Supporting Information
Adenine methylation in eukaryotes: apprehending the complex evolutionary history and functional potential of an epigenetic modification


Lakshminarayan M. Iyer , Dapeng Zhang, and L. Aravind*

* Address for correspondence: L. Aravind (aravind@mail.nih.gov)

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA


Abstract

While N6-methyladenosine (m6A) is a well-known epigenetic modification in bacterial DNA, it remained largely unstudied in eukaryotes. Recent studies have brought to fore its potential epigenetic role across diverse eukaryotes with biological consequences, which are distinct and possibly even opposite to the well-studied 5-methylcytosine mark. Adenine methyltransferases appear to have been independently acquired by eukaryotes on at least 13 occasions from prokaryotic restriction-modification and counter-restriction systems. On at least 4-5 instances these methyltransferases were recruited as RNA methylases. Thus, m6A marks in eukaryotic DNA and RNA might be more widespread and diversified than previously believed. Several m6A-binding protein domains from prokaryotes were also acquired by eukaryotes, facilitating prediction potential readers for these marks. Further, multiple lineages of the AlkB family of dioxygenases have been recruited as m6A demethylases. Although members of the TET/JBP family of dioxygenases have also been suggested to be m6A demethylases, this proposal needs more careful evaluation.

Contents

  • Phyletic distribution, domain architectures, gene neighborhoods and multiple sequence alignments of Adenine methylases and other domains described in this study
    1. Materials and Methods
    2. Group 1 Methyltransferases
    3. Group 2 Methyltransferases
    4. Group 3/Trichomonas-like N6 Methyltransferases
    5. Eukaryotic AlkB families proteins
    6. Phyletic distribution and alignments of N6mA readers
    7. Fasta sequences of entries with temporary ids, and not found in Genbank


    • Materials and Methods

      Detection of distant sequence similarities

      Iterative sequence profile searches were done using PSI-BLAST [1] and the web version of the JACKHMMER ( http://www.ebi.ac.uk/Tools/hmmer/search/jackhmmer) programs. For m6A DNA methylases, previously identified families were used as seed in iterative profile searches against the non-redundant (NR) protein database of National Center for Biotechnology Information (NCBI) and a local database that additionally contained sequences from completed eukaryotic genomes that have not been deposited in the NR database [2-5]. The HHpred program [6] was used for profile–profile comparisons which compares HMMs derived from a given alignment created using either PSIBLAST of HHBLITZ against a library of HMM created from Pfam and PDB. For previously known domains, the Pfam database [7] was used as a guide and augmented by addition of newly detected divergent members using a local database of profiles.

      Creation of multiple alignments

      Similarity-based clustering for both classification and culling of nearly identical sequences was performed using the BLASTCLUST program (ftp://ftp.ncbi.nih.gov/blast/documents/blastclust.html). Multiple sequence alignments were built by the Kalign2 [8] and Muscle [9] programs, followed by manual adjustments on the basis of profile-profile and structural alignments.

      Prediction of operons in prokaryotic genomes

      Gene neighborhoods were obtained by isolating all conserved prokaryotic genes, in the neighborhood of the gene under consideration, which showed a separation of less than 70 nucleotides between their termini. Genes fulfilling this criterion and occurring in the same direction were considered likely to form operons. These were further filtered using BLASTCLUST (-L 0.3 –S 0.3) to cluster all those proteins that were encoded by the putative operons to determine conserved gene-neighborhoods. If such conserved gene-neighborhoods were found across more than one major bacterial lineage (phylum) mapped using the NCBI taxonomy id then they were seen as notable associations, which was further analyzed for functional significance.

      Structural analysis

      Structure similarity searches were performed using the DaliLite program [10] which orders alignments based on Z-scored derived from C-alpha matches. Secondary structures were predicted using the JPred program [11]. Structural visualization and manipulations were performed using the PyMol ( http://pymol.org) program

      Phylogenetic analysis

      Phylogenetic analysis was conducted using an approximately maximum-likelihood method implemented in the FastTree 2.1 [12] program under default parameters. Independent ML analysis was done using the MEGA5 program with the JTT substitution model and alpha parameter of 1. The in-house TASS package comprising Perl scripts was used to automate the analysis.

      REFERENCES

      1 Altschul SF, Madden TL, Schaffer AA, Zhang J, et al. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic acids research 25: 3389-402.
      2 Iyer LM, Anantharaman V, Wolf MY, Aravind L. 2008. Comparative genomics of transcription factors and chromatin proteins in parasitic protists and other eukaryotes. International journal for parasitology 38: 1-31.
      3 Iyer LM, Abhiman S, Aravind L. 2011. Natural history of eukaryotic DNA methylation systems. Progress in molecular biology and translational science 101: 25-104.
      4 Iyer LM, Zhang D, Burroughs AM, Aravind L. 2013. Computational identification of novel biochemical systems involved in oxidation, glycosylation and other complex modifications of bases in DNA. Nucleic acids research 41: 7635-55.
      5 Iyer LM, Zhang D, de Souza RF, Pukkila PJ, et al. 2014. Lineage-specific expansions of TET/JBP genes and a new class of DNA transposons shape fungal genomic and epigenetic landscapes. Proceedings of the National Academy of Sciences of the United States of America 111: 1676-83.
      6 Soding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic acids research 33: W244-8.
      7 Finn RD, Bateman A, Clements J, Coggill P, et al. 2014. Pfam: the protein families database. Nucleic acids research 42: D222-30.
      8 Lassmann T, Frings O, Sonnhammer EL. 2009. Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features. Nucleic acids research 37: 858-65.
      9 Edgar RC. 2004. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5: 113.
      10 Holm L, Kaariainen S, Rosenstrom P, Schenkel A. 2008. Searching protein structure databases with DaliLite v.3. Bioinformatics 24: 2780-1.
      11 Cole C, Barber JD, Barton GJ. 2008. The Jpred 3 secondary structure prediction server. Nucleic acids research 36: W197-201.
      12 Price MN, Dehal PS, Arkin AP. 2010. FastTree 2--approximately maximum-likelihood trees for large alignments. PLoS One 5: e9490.

    • Multiple sequence alignment of the MT-A70 clade (Clade 1) of adenine methylases

      Note: Many members, especially those of the METTL14 group, show disruptions of the canonical active site residues.                                                
      Sequence features annotation                                                 Str-3                                    Str-4                                                                            Str-5                                                Str-6                                                              Str-7                                                                                       Synapomorphic strand            conservedK                                                       Str-1               Str-2                     
      RES                                                                     PLPPQWINCDLRRFDYSVL-------------------------GKFHVIMADPPWDIHM------------------------SLP-------------YGTMTDDE-----MRAMPIPALQ-DE-GLLFLWVTGR--------AMEVGRE----------CLR----VWGYTRVD--EVVWVKTNQ-------------------------LQRVIRTGR----------------TGHWLNHTKEHMLVGIKNPPGVTQGSNT-------------------------------------GETPTLKFPSWI-----NRGLDT------------------DVIVSEV------------------------RETSRKPDE---------VYNMIERMCP------------------------------GGRKVEIF-GRKHN------VRPGWITLG-----------NQLGNVD
      ALIGN                                                                   ------EE----E---------------------------------EEEEE------------------------------------------------------HHH-----HH---HH--------EEEEEE------------HHHHHH----------HHH----H---HHHH--HEEEEEE-------------------------------------------------------EE---HHHEEEH-------------------------------------------------------------------------------------------EEEEEE-------------------------------------------HHEHHHH------------------------------------HHEHH-H---------------EE--------------------
      HMM                                                                     ------E----------------------------------------EEEEEE---------------------------------------------------HHHH-----HHHHHHHHHH-----EEEEEE-----------HHHHHHH----------HHH----HHHHHEEE--EEEEEEEE----------------------------EEEEE--------------------EEEE---EEEEEEEE---------------------------------------------------------EEE-----EE----------------------EEEEEE-------------------------------HHH---------HHHHHHHH-----------------------------------EEEEE-E--------------EEEEE-----------E------
      FREQ                                                                    ------EE--HHHHHHHH----------------------------EEEEE------------------------------------------------------HHH-----HHH-------------EEEEEHHH--------HHHH-HH----------HHH----HH---EEE--EEEEEHH----------------------------EEEEEE--------------------EEE----EEEEEEE-----------------------------------------------------------------------E------------------EEEEEEE------------------------E-----------------EEEEEEE------------------------------------EEEEE----------------EEE--------------------
      PSSM                                                                    -----------------------------------------------EEEEE---HHHH----------------------------------------------HHH-----HH-------------EEEEEE-----------HHHHHHH----------HHH----H---EEE----EEEEEE--------------------------------------------------------------EEEEEEE----------------------------------------------------------------------E------------------EEEEE----------------------------------H---------HHHHHHHH-----------------------------------EEEEE-----------------EEEE------------------
      FINAL                                                                   ----------HHHHH-------------------------------EEEEEE-----------------------------------------------------HHH-----HHH------------EEEEEE--H--------HHHHHHH----------HHH----H---EEEE--EEEEEEE----------------------------EEEEEE--------------------EEE----EEEEEEEE----------------------------------------------------------------------E------------------EEEEEEE------------------------E-------H---------HHHHHHH------------------------------------EEEEE----------------EEEEE------------------
      CC1G_14583_Coprinopsis_cinerea_okayama7#130_299747281                   PLPPQWINCDLRRFDYSVL-------------------------GKFHVIMADPPWDIHM------------------------SLP-------------YGTMTDDE-----MRAMPIPALQ-DE-GLLFLWVTGR--------AMEVGRE----------CLR----VWGYTRVD--EVVWVKTNQ-------------------------LQRVIRTGR----------------TGHWLNHTKEHMLVGIKNPPGVTQGSNT-------------------------------------GETPTLKFPSWI-----NRGLDT------------------DVIVSEV------------------------RETSRKPDE---------VYNMIERMCP------------------------------GGRKVEIF-GRKHN------VRPGWITLG-----------NQLGNVD
      NEMVEDRAFT_v1g33607_Nematostella_vectensis_156398086                    LYPPQWISCDVRSLQMDVL-------------------------GKFSVIMADPPWDIHM------------------------ELP-------------YGTMSDDE-----MRNLSVPSLQ-DN-GYIFLWVTGR--------AMELGRE----------CLE----IWGYERCD--ELIWVKTNQ-------------------------LQRLIRTGR----------------TGHWINHGKEHCLIGVK------------------------------------------------GDTTGF-----------NRGMDC------------------DV------------------------------------------------------------------------------------------------------------------------------------LV
      PF07_0123_Plasmodium_falciparum_3D7_124512114                           VYGPQWIRCDLRNFDLSIF-------------------------KYVSVVMADPPWDIHM------------------------DLP-------------YGTMTDNE-----MKLLPVQLIQ-DE-GMIFLWVTGR--------AMELARE----------CLQ----IWGYKRVE--EILWVKTNH-------------------------LQRIIRTGR----------------TGHWLNHSKEHCLVGIK------------------------------------------------GNPII------------NRNIDC------------------NVIVSEV------------------------RETSRKPDE---------IYSLIERLCP------------------------------QNLKIELF-GRPHN------CRSNWITLG-----------NQLNGVV
      TGFOU_217350_Toxoplasma_gondii_FOU_672280105                            EYPAQWIRCDIRTFDFSIF-------------------------KLIRVVMADPPWDIHM------------------------DLP-------------YGTMTDQE-----MRSLRVDLIQ-EE-GLLFLWVTGR--------AMELARE----------CLQ----LWGYRRVE--EILWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHSKEHCLVAVK------------------------------------------------GNMAF------------NRNIDC------------------DVIVSEV------------------------RETSRKPDEIYRQGEAR-HRGMIERMAP------------------------------DSLKVELF-GRMHN------VRNNWITLG-----------NQLKGVK
      LOC100641238_Amphimedon_queenslandica_340369522                         LVPPQWLNCDLRNFDTSVL-------------------------GKFAVVMADPPWDIHM------------------------ELP-------------YGTMSDDE-----MRQLDIPSLQ-DD-GFIFLWVTGR--------AMELGRE----------CLT----LWGYERID--ELVWVKTNQ-------------------------LQRLIRTGR----------------TGHWINHGKEHCLVGAK------------------------------------------------GNLQGV-----------NRGIDT------------------DVIVAEV------------------------RATSRKPDE---------IYGVIERLSP------------------------------GTRKIELF-GRQHN------CQPNWLTLG-----------NQLEGDN
      EMIHUDRAFT_211665_Emiliania_huxleyi_CCMP1516_551562135                  HYESQFVNCDIRTFPMQTL-------------------------GKFPVIMADPPWDIHM------------------------ELP-------------YGTMSDDE-----MRRMNVQVLQ-DD-GVLFLWVTGR--------AMELGRE----------CLE----IWGYRFVQ--ELLWVKTNQ-------------------------LQRIIRTGR----------------TGHWINHSKEHCLIGVK------------------------------------------------GDLDDRF----------NQNLDC------------------DVICAEV------------------------RETSRKPDE---------MYDLLERLAP------------------------------GQRKLELF-GRPHN------VHKGWTTLG-----------NQLGKTQ
      _Danio_rerio_597501008                                                  LFPSQWICCDIRYLDVSIL-------------------------GKFAVVMADPPWDIHM------------------------ELP-------------YGTLTDDE-----MRKLNIPILQ-DD-GFLFLWVTGR--------AMELGRE----------CLS----LWGYDRVD--EIIWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHGKEHCLVGVK------------------------------------------------GNPQGF-----------NRGLDC------------------DVIVAEV------------------------RSTSHKPDE---------IYGMIERLSP------------------------------GTRKIELF-GRPHN------VQPNWITLG-----------NQLDGIH
      METTL3_Homo_sapiens_21361827                                            LFPPQWICCDIRYLDVSIL-------------------------GKFAVVMADPPWDIHM------------------------ELP-------------YGTLTDDE-----MRRLNIPVLQ-DD-GFLFLWVTGR--------AMELGRE----------CLN----LWGYERVD--EIIWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHGKEHCLVGVK------------------------------------------------GNPQGF-----------NQGLDC------------------DVIVAEV------------------------RSTSHKPDE---------IYGMIERLSP------------------------------GTRKIELF-GRPHN------VQPNWITLG-----------NQLDGIH
      Dmel_CG5933_Drosophila_melanogaster_21355141                            LYPPQWIQCDLRFLDMTVL-------------------------GKFAVVMADPPWDIHM------------------------ELP-------------YGTMSDDE-----MRALGVPALQ-DD-GLIFLWVTGR--------AMELGRD----------CLK----LWGYERVD--ELIWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHGKEHCLVGMK------------------------------------------------GNPTNL-----------NRGLDC------------------DVIVAEV------------------------RATSHKPDE---------IYGIIERLSP------------------------------GTRKIELF-GRPHN------IQPNWITLG-----------NQLDGIR
      _Saccharomyces_cerevisiae_S288c_1174426                                 ALPAQWIRCDVRKFDFRVL-------------------------GKFSVVIADPAWNIHM------------------------NLP-------------YGTCNDIE-----LLGLPLHELQ-DE-GIIFLWVTGR--------AIELGKE----------SLN----NWGYNVIN--EVSWIKTNQ-------------------------LGRTIVTGR----------------TGHWLNHSKEHLLVGLK------------------------------------------------GNPKWI-----------NKHIDV------------------DLIVSMT------------------------RETSRKPDE---------LYGIAERLAGT-----------------------------HARKLEIF-GRDHN------TRPGWFTIG-----------NQLTGNC
      AT4G10760_Arabidopsis_thaliana_15236910                                 LGEAQWINCDIRSFRMDIL-------------------------GTFGVVMADPPWDIHM------------------------ELP-------------YGTMADDE-----MRTLNVPSLQ-TD-GLIFLWVTGR--------AMELGRE----------CLE----LWGYKRVE--EIIWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHSKEHCLVGIK------------------------------------------------GNPEV------------NRNIDT------------------DVIVAEV------------------------RETSRKPDE---------MYAMLERIMP------------------------------RARKLELF-ARMHN------AHAGWLSLG-----------NQLNGVR
      PTSG_03395_Salpingoeca_rosetta_514696822                                TFPAQWIQCDVRYIDFSVL-------------------------GKFSVIMADPPWRINM------------------------ELP-------------YGTMSDEE-----MRQLPVQDLQ-DN-GVIFLWVTAR--------CVDLGRE----------LLK----RWGYNYAN--DLIWIKINQ-------------------------LQNLVRTGR----------------TGHWMNHAKEHCMIGVK------------------------------------------------GNLDGI-----------YPGIDC------------------DVLVSEV------------------------RDTSRKPDE---------IYGLIERLSP------------------------------GTRKIELF-GRPHN------VQSNWLTLG-----------DQLQGVQ
      TTHERM_00962190_Tetrahymena_thermophila_SB210_586734236                 KLNPQWINCDLRQIDFNIL-------------------------GKFNCIMADPPWDIHM------------------------TLP-------------YGTLKDRE-----MKAMRVDLLQ-EE-GVIFLWVTGR--------AMELGRE----------CLT----NWGYRRVE--EIIWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHSKEHCLVGIK------------------------------------------------GNPKI------------NRKIDC------------------DVIVSEV------------------------RETSRKPDE---------IYNLIERMCP------------------------------GGKKIELF-GRPHN------TMPGWLTLG-----------NQLPGIY
      GSPATT00017263001_Paramecium_tetraurelia_strain_d4-2_145529029          SMPPQWINCDLRIFDFRVL-------------------------GKFDVIMADPPWDIHM------------------------NLP-------------YGTLKDKE-----MKALRVDLLQ-ND-GIIFLWVTGR--------AMELGRE----------CLI----LWGYRRVE--ELVWIKVNQ-------------------------LHRIIRTGR----------------TGHWLNHSKEHCLIGIK------------------------------------------------GNPQL------------IKGLDC------------------DVIVSEV------------------------RETSRKPDE---------VYGIINRMCP------------------------------NGKKVELF-GRPHN------CRPNWITLG-----------NQLPGVY
      ACA1_074420_Acanthamoeba_castellanii_str_Neff_470518935                 WADRTFINCDLRYYNLASL-------------------------GKFDAILIDPPWRIKGNQLISNEKTMFNNSKW--------GLS-------------YGTMSNDE-----IIDIDVGCLS-DK-GFIFLWVINS--------QIEFGFK----------CLQ----KWGYTYVD--RITWVKKTA-------------------------SGNIAIS------------------QGYYFLHSSEICLVGVKYDAK--------------------------------------------GKSLEF-----------ISKTSN------------------DLLFAEI------------------------REKSRKPDQ---------LYHIIERMVP------------------------------GGRKVEIF-ARNHN------MRPGWLSLG-----------NQLGEYY
      SPRG_03347_Saprolegnia_parasitica_CBS_22365_641538296                   PTTAIAIACDVTTYDVARL-------------------------GTFDAIVMDPPWEINL------------------------QLP-------------YTTLSDEA-----IGALAIPALQ-TA-GWIFLWVATG--------KLVVGRQ----------LLR----QWGYTVVD--DIVWIKIDQ-------------------------LQHVAHQGR----------------TGHWLNHSQEHCLVGRK------------------------------------------------GLAPS------------AARLDC------------------DVIVAAP------------------------RENSRKPDE---------LYHLVERVVP------------------------------AGRKLELF-GRRHN------LRDGWTTLG----------------DQ
      CHLREDRAFT_128290_Chlamydomonas_reinhardtii_159466562                   ALDPQWINCDVRSFDMTVL-------------------------GKFGVIMADPPWEIHQ------------------------DLP-------------YGTMKDDE-----MVNLNVGCLQ-DN-GVLFLWVTGR--------AMELARE----------CMA----KWGYKRVD--ELIWVKTNQ-------------------------LQRLIRTGR----------------TGHWLNHSKEHCLVGIK------------------------------------------------GSPQL------------NRYVDT------------------DVVVAEV------------------------RETSRKPDE---------MYSLLERLSP------------------------------GTRKLEIF-ARVHN------CKPGWVGLG-----------NQLKNVN
      NEMVEDRAFT_v1g95490_Nematostella_vectensis_156393637                    PSMSSFLNSDATKLQPVIEHGKVKI-------------------VPFDLIVIDPPWYNK-------------------------SAKRKRM---------YSFMSLWQ-----IKALPVPELIAPG-GLLAVWVTNKAK------YIRFTRSE---------LLP----SWGVDVIA--EWHWIKVTK-------------------------TGEYVVG------------------MESAHKKPYETLIIGRLPILPGASID---------------------------------------GGVKQ------------V--PEH------------------QVICSVPC-----------------------LKHSRKPPL---------GDVFKDFLPR------------------------------HPHCLEMF-AR--N------LTPGWTSWG-----------NQVLKFQ
      GSPATT00037207001_Paramecium_tetraurelia_strain_d4-2_145499669          NHPPNYIKADLRTFDLQQL-------------------------GKFDVILIDPPWAEYAKRLMQANMQV--------------KEH-------------QQSWTLEE-----LKQLHIDKIADIP-SFIFLWCGSE--------HLDDGRE----------LFK----TWGFKRCE--DIVWLKTNKDHSK---------------------QNQYVAGQDY---------------GDNLFRRVKEHCLVGLR------------------------------------------------GDVKRASDQHFI-----HANIDT------------------DVIITEEEV----------------------MGSTKKPEE---------LYEIIERFCL------------------------------GRKRIELF-GEIHN------IRDGWLTIG-----------TQLRDTR
      NEMVEDRAFT_v1g224635_Nematostella_vectensis_156328704                   ATPPMYLRCDLETFALHDLD------------------------NKFDVILVDPPLEEYQRRHAGV------------------SFN-------------FKPWTWDD-----IMKLDIEEVAAQR-SFIFLWCGSHE-------GLTEGRKVQHLKLKSDMCLR----KWGFRRCE--DICWIKTNKTNP----------------------GNTKYLE------------------PIAIFQHTKEHCLMGIR------------------------------------------------GTVRRSTDGDFI-----HANVDI------------------DLIITEECK----------------------GV--------------------------------------------------------------------------------------------------VTRDN
      CHLREDRAFT_174824_Chlamydomonas_reinhardtii_159474530                   PPHCVPIHANVTTFDWPSLYSH----------------------AQFDVIMMDPPWQLA-------------------------TANPTRGVALG-----YSQLNDDH-----ISRLPVPQLQRQG-GYLFVWVINA--------KYKWTLD----------LFD----RWGYRLVD--EVVWVKMTV-------------------------NRRLAKS------------------HGYYLQHAKEVCLVAKR------------------------------------------------GNPPVPPGC--------EGGVGS------------------DIIFSER------------------------RGQSQKPEE---------IYHLIEQLVP------------------------------NGRYLEIF-ARKNN------LRNYWVSIG-----------NEVTGTG
      PHYPADRAFT_206270_Physcomitrella_patens_168011388                       PKSCTFLISDISEVHRLI--------PGD--S-K----------DGFNLMVIDPPWENK-------------------------SVHRKSL---------YPTLPNKY-----LLSLPVKQLAHADGALVALWITNRE-------KLRHFAETE--------LFP----AWGVKMAA--VWYWLKVTV-------------------------EGTMVSP------------------LDLAHHKPYECLLLGYLPSKIGSTSEESVQFT---------------------------------GSREH------------ADLPDK------------------FVLISIP------------------------GDHSRKPPL---------KSLLSKHIPGQRH---------------------------AERGLELF-AR--E------LSAGWTSWG-----------NEPLRFQ
      CELE_C18A3.1_Caenorhabditis_elegans_17531953                            PPKSTFHVGDVKDIEQYS--------RAH--D------------LLFDLIIADPPWFSK-------------------------SVKRKR----------TYQMDEEV-----LDCLDIPVILTHD-ALIAFWITNRIG------IEEEMIE----------RFD----KWGMEVVA--TWKLLKITT-------------------------QGDPVYDF-----------------DNQKHKVPFESLMLAKK------------------------------------------------KDSMR------------KFELPE------------------NFVFASVPM----------------------SVHSHKPPLLDLLRHF--GIEFTEPL-------------------------------------ELF-AR--S------LLPSTHSVG-----------YEPFLLQ
      AT1G19340_Arabidopsis_thaliana_18394726                                 PRNSCFYMSDLHHIRNLV--------PAK--S-E----------EGYNLIVIDPPWENA-------------------------SAHQKSK---------YPTLPNQY-----FLSLPIKQLAHAEGALVALWVTNRE-------KLLSFVEKE--------LFP----AWGIKYVA--TMYWLKVKP-------------------------DGTLICD------------------LDLVHHKPYEYLLLGYHFTELA-------------------------------------------GSEKRSDF---------KLLDKN------------------QIIMSIP------------------------GDFSRKPPI---------GDILLKHTPGSQ----------------------------PARCLELF-AR--E------MAAGWTSWG-----------NEPLHFQ
      mettl4_Danio_rerio_189522093                                            PPRCRFLLSDVTRMDPLV--------NSG---------------DKFDLIVLDPPWENK-------------------------SVKRSNR---------YSSLPSSQ-----LKKLPVPALAAPG-GLVVTWVTNRAK------HRRFVREE---------LYP----HWAVEVLA--EWLWVKVTR-------------------------SGEFVFP------------------LDSQHKKPYEVLVLGRC------------------------------------------------RSTSD------------HTDRCSAVNELPDQ----------RLLVSVPS-----------------------TLHSHKPSL---------AAVLKPYIRR------------------------------EPRCLELF-AR--S------LQSDWSCWG-----------NEVLKFQ
      Dmel_CG7818_Drosophila_melanogaster_19920926                            ASAPMYLKADLKSLDVKT--L----------G------------AKFDVILIEPPLEEYARAAPSVATVG--------------GAP-------------RVFWNWDD-----ILNLDVGEIAAHR-SFVFLWCGSSE-------GLDMGRN----------CLK----KWGFRRCE--DICWIRTNI-------------------------NKPGHSKQLE---------------PKAVFQRTKEHCLMGIK------------------------------------------------GTVRRSTDGDFI-----HANVDI------------------DLIISEEEE----------------------FGSFEKPIE---------IFHIIEHFCL------------------------------GRRRLHLF-GRDSS------IRPGWLTVG-----------PELTNSN
      GSPATT00032234001_Paramecium_tetraurelia_strain_d4-2_145486788          --IKSYINCDIRYFNIDF--------LVE--K------V-----GGFDVVLMDPPWRIKGGQQNDSSFMFTNSKF---------SLD-------------YNTMSNQE-----IMDIKIEKLS-KK-GFLFLWILNT--------QLNIAYE----------MAS----KWGYEIVD--QIIWVKLNPQ------------------------GNNVYLS------------------TGYYFMHSFEICLVGYTN-----------------------------------------------KHVEY------------HSKISN------------------NIIFSPV------------------------RNKSQKPIE---------LYEIIELMMP------------------------------GSKKVEIF-ARNHN------LRHGWFSIG-----------NQLGETF
      GSPATT00027481001_Paramecium_tetraurelia_strain_d4-2_145473723          --VKSYINCDIRYFNLDF--------LVE--K------V-----GGFDVVLMDPPWRIKGGQQNDSSFMFTNSKF---------SLD-------------YNTMSNQE-----IMDIKIEKLS-KK-GFLFLWILNT--------QLNIAYE----------MAS----KWGYEIVD--QIIWVKLNPQ------------------------GNNVYLS------------------TGYYFMHSFEICLVGYK------------------------------------------------CPPGEHVEY--------HSKISN------------------NIIFSPV------------------------RNKSQKPIE---------MYEIIEIMMP------------------------------GAKKVEIF-ARNNN------LRHGWFSIG-----------NQLGETY
      METTL14_Homo_sapiens_24308265                                           NTPPMYLQADIEAFDIRE--L----------T------------PKFDVILLEPPLEEYYRETGI-------------------TAN-------------EKCWTWDD-----IMKLEIDEIAAPR-SFIFLWCGSGE-------GLDLGRV----------CLR----KWGYRRCE--DICWIKTNKNNP----------------------GKTKTLD------------------PKAVFQRTKEHCLMGIK------------------------------------------------GTVKRSTDGDFI-----HANVDI------------------DLIITEEPE----------------------IGNIEKPVE---------IFHIIEHFCL------------------------------GRRRLHLF-GRDST------IRPGWLTVG-----------PTLTNSN
      Dmel_CG14906_Drosophila_melanogaster_24647514                           PNQSRFFNHNVDNLPALL-----HQ-LL----------------PAYDLIVLDPPWRNKYIRRLKRAKP---------------ELG-------------YSMLSNEQ-----LSHIPLSKLTHPR-SLVAIWCTNST-------LHQLALEQQ--------LLP----SWNLRLLH--KLRWYKLST-------------------------DHELIAPPQ----------------SDLTQKQPYEMLYVACR------------------------------------------------SDASENYG---------KDIQQT------------------ELIFSVPS-----------------------IVHSHKPPL---------LSWLREHLLLDKDQL-------------------------EPNCLELF-AR--Y------LHPHFTSIG-----------LEVLKLM
      NAEGRDRAFT_72415_Naegleria_gruberi_strain_NEG-M_290979461               PKHCVPIRTDVRNMNWKA--------LAR--V------------AQFDVILMDPPWQLA-------------------------TSNPLRGVAIS-----YKPLSDKH-----IQSMDISSLQEKNGGFLFVWVINA--------KYVKTLE----------MIE----KWGYKFVD--EITWVKQTK-------------------------HRRLAKG------------------HGYYLQHAKENCIIAIK------------------------------------------------NTTPEREKEMMEAA---RQKVTSLC----------------DVILSDR------------------------RGQSQKPED---------LYHFIEQMVP------------------------------DGKYLEIF-GRRNN------LRDYWVTIG----------------NE
      NAEGRDRAFT_30463_Naegleria_gruberi_strain_NEG-M_290998263               FKGGHYINCDLRYFTLS--------------S------L-----GKFDVILIDPPWRVIQSRPQEAMMFSNTNF----------KLN-------------YNTLSYEE-----IMDINVGSLC-DQ-GFCFMWVLNS--------SLQFGLN----------LLN----HWGFSYID--KITWIKKTK-------------------------NDQIFAG------------------TGYYFLHSTELLLVGVKH-----------------------------------------------GSTKKNGQKLQY-----ISKITN------------------DILFSKV------------------------GIQSQKPNE---------VYEIIESMVP------------------------------GARKIEIF-ARNHN------IRKGWLSIG-----------NRLGEAF
      CC1G_11190_Coprinopsis_cinerea_okayama7#130_299741172                   SLPPSYLPYSQLSTLNS---------------------------SKFDVILLDPPFS--------------------------------------------SSFTWDN-----LQELPIPSLAADP-SFVFLWVGSGAGE-----GLERGRE----------VLA----KWGYRRCE--DVVWVKTNK-------------------------TTNQGPGTDPP--------------TTSLLTRTKQHCLMGIR------------------------------------------------GTVRRSTDSWFV-----HCNVDT------------------DVIIWEGDP----------------------TDPTRKPPE---------MYTLIENFCL------------------------------GIRRLEIF-GRPSS------LRRGWVTVL---------------GPN
      AT4G09980_Arabidopsis_thaliana_145340055                                ASAPMYLKGDLHEVELSPELF----------G------------TKFDVILVDPPWEEYVHRAPGV------------------SDS-------------MEYWTFED-----IINLKIEAIADTP-SFLFLWVGDGV-------GLEQGRQ----------CLK----KWGFRRCE--DICWVKTNKSNA----------------------APTLRHD------------------SRTVFQRSKEHCLMGIK------------------------------------------------GTVRRSTDGHII-----HANIDT------------------DVIIAEEPP----------------------YGSTQKPED---------MYRIIEHFAL------------------------------GRRRLELF-GEDHN------IRAGWLTVG-----------KGLSSSN
      EAG_10107_Camponotus_floridanus_307172265                               PKKCTFYCYDVRDIDKKI--------ELN---------------NQYDFILLDPPWWNKSIRRKKIKCA---------------EAS-------------YKMMYNEE-----LIKIPIRKLLHSN-GIVAIWCTNSSN------HLNSIFNE---------IFP----SWGITYRA--KWYWIKVTQ-------------------------AGDTICNFNLA--------------PG---KQPFELLILGSA------------------------------------------------LEEDK------------VNIPDA------------------KLMISIPS-----------------------AVHSHKPPL---------TEIIKDYLPN------------------------------EPKCLEIF-AR--Y------LLPGWTSWG-----------LEILKFQ
      Ot11g01290_Ostreococcus_tauri_308809243                                 PPQCIPVHANVTTYDWRP--------MYE--H------------EQFDVIMMDPPWQLATANPTRGV-----------------SLG-------------YSQLTDQD-----IANLPLPQLQ-KN-GLLFVWVINA--------KYQWCLN----------QFK----KWGYEFVD--EIVWVKVTN-------------------------SRRLAKS------------------HGFYLQHAKEVCLVARR------------------------------------------------GDTPPGLK---------DKAIGS------------------DIIFAPR------------------------RGQSQKPTE---------IYELIEELVP------------------------------NGRYLEIF-ARKNN------LRDFWVSVG-----------NEVTGTG
      SINV_06005_Solenopsis_invicta_322796786                                 PRKCTFYSYDVRDIEKKI--------ELS---------------NQYDFILLDPPWWNKSIRRKKMKCA---------------EAS-------------YKMMYNEE-----LVKIPIKKLLHSN-GIVAVWCTNSSN------HLNSIINE---------IFP----SWGIIYRA--KWYWLKVTQ-------------------------AGDTICNFNSA--------------PG---KQPYELLVLGTA------------------------------------------------LEKGK------------IDIPGG------------------KLMISVPS-----------------------AVHSHKPPLTDLFI----FLEIIKDYLPD-----------------------------EPKCLEIF-AR--Y------LLPGWTSWG-----------LEILKFQ
      gp5_EBPR_siphovirus_4_337731296                                         --------MSSTE-EFIA--L----------R------P-A---GGFSLIMADPPWSYEMRSEKGYAKAP--------------EAQ-------------YATMPLAE-----IAAMPVELLAAED-CLLWLWAVNP--------QLPQALE----------VLV----AWGFTFKT--AGTWLKRST-------------------------RGKVSFG------------------TGYILRSANEPFLIGAR------------------------------------------------GRPKT------------TRATRS------------------AVITRDERLRGMEDNWPLGTITIEAAG----REHSRKPDE---------AYVACEELMP------------------------------GARRLDLF-SR--Q------RRQGWVSWG-----------NEVGKFE
      METTL4_Homo_sapiens_145275206                                           PPKSSFLLSDISCMQPLL--------NYR---------------KTFDVIVIDPPWQNK-------------------------SVKRSNR---------YSYLSPLQ-----IQQIPIPKLAAPN-CLLVTWVTNRQK------HLRFIKEE---------LYP----SWSVEVVA--EWHWVKITN-------------------------SGEFVFP------------------LDSPHKKPYEGLILGRVQEKTALPLRN--------------------------------------ADVNV------------LPIPDH------------------KLIVSVPC-----------------------TLHSHKPPL---------AEVLKDYIKP------------------------------DGEYLELF-AR--N------LQPGWTSWG-----------NEVLKFQ
      LOC100633541_Amphimedon_queenslandica_340370562                         PPSSSFLLSDISKIQLLK--------RFS--RYAD---A-----NGYNIIVLDPPWENRSAIR---------------------GGK-------------YKWLDKED-----ISQLPIPELIAPG-GLVALWVTNKRQ------LVQWTVQE---------LLP----KWGLEYIG--EWLWIKVTT-------------------------EGDFVFD------------------VDSVHKKPYESLIIGKRPSLPADSDPTPPPAKAQRLTESHCIPPSNPLMLKDLS-----------GDKSVQSLSADSASSV-KSRSGD------------------EILVDNVSVITGSCKPPSQYSLMCIPS----TTHSQKPYLGDILQ----LYAESKDM----------------------------------KCLELF-AR--N------LLPGWTSWG-----------NEVLQFQ
      LOC100635916_Amphimedon_queenslandica_340382361                         AIPPVYYKVDLSSFDLTS--L----------D------------AKFDVILIDPPLEEY-------------------------QRRTTGITYP------WQPWDFEE-----IMNLKIEDVSAPR-SFVFLWCGSCE-------GLDLGRE----------CLK----KWGFRRCE--DICWVKTNMNDP----------------------GNTTHLE------------------QKSIFQHTKEHCLMGIK------------------------------------------------GTVRRNQDGHFI-----HANIDL------------------DIIISEEPE----------------------MGNNDKPEE---------IFHIIEHFCL------------------------------GRKRLHLF-GNDAT------VRPGWLTLG-----------PNLSSSN
      mettl14_Danio_rerio_46309507                                            NTPPMYLQADPDTFDLRE--L----------K------------CKFDVILIEPPLEEY-------------------------YRESGIIAN-------ERFWNWDD-----IMKLNIEEISSIR-SFVFLWCGSGE-------GLDLGRM----------CLR----KWGFRRCE--DICWIKTNKNNP----------------------GKTKTLD------------------PKAVFQRTKEHCLMGIK------------------------------------------------GTVRRSTDGDFI-----HANVDI------------------DLIITEEPE----------------------MGNIEKPVE---------IFHIIEHFCL------------------------------GRRRLHLF-GRDST------IRPGWLTVG-----------PTLTNSN
      CAOG_07090_Capsaspora_owczarzaki_ATCC_30864_470293128                   APGARWLVADILRSNLAE--L-T--------G------------QTFEGILIDAPLARS-------------------------GEPAT-----------PGMVTVDE-----LKAAGISPALIPR-GFIFMWAEKE--------WIPDLLE----------VAQ----AWGFHYVE--NICWVRHNI-------------------------NNKVSRE------------------DSRFFRKSKLLLLIFRN------------------------------------------------FDIGA------------K----------------------------QV------------------------AIPEEKPQP---------IYSTIETLLPNANATALPDGSIG-----------------PGKLLELS-WRAQPF-----LRRGWTTIS---------------HSQ
      ACA1_156200_Acanthamoeba_castellanii_str_Neff_470390089                 IPGSRYVEATIEELELEK--L-----------------------GSFEAILMDPPWDLSPRASASDKARQCHQLVPKRG-----KNG-------------ERLISPEE-----LGKWPVTDKLIPK-GFLFIWTEKE--------LIPRVLT----------MAQ----KWGFHYVE--NFAWVKFDV-------------------------NNKIHTE------------------DYAYFRKSKLTLLMLRKP-----------------------------------------------GDIEL------------RHQRNP------------------DVKFDFV------------------------RKGRERLPF---------VFDIIETLLPTAKYNPKTG---------------------KGKMLQLW-CLPQE------RRSGWTSVH----------------LS
      ACA1_219460_Acanthamoeba_castellanii_str_Neff_470419934                 PIGSTYLEGDILKMELKN--Y-----------------------GEFEAILMDPPWHTG-------------------------AKDDPARL--------PGTVTPEE-----LGKLKITDALLPK-GLAFVWVEKE--------LIPKVFA----------LMK----KWNFIYVE--NFAWVKKSV-------------------------NNKFVSQ------------------PYKYFQKSKTTLFIFRKFTAE--------------------------------------------GKDQLEL----------RHQRNP------------------DV-----------------------------------PDF---------TYHVIETLLPNAAYKEDAG---------------------RGKLLELW-GRAGSQ-----RRTGWT-------------------TI
      ACA1_366350_Acanthamoeba_castellanii_str_Neff_470407427                 PPHCVPIRADVHNPSLRSSLASAWASKQVDGQQALGKQ------VQFDVIVMDPPWQLA-------------------------GSAPTRGVALG-----YKQLHNKD-----IEKIPIPLLQ-TN-GFLFIWVINA--------RYAFALD----------LME----KWGYRFVD--DIAWVKATV-------------------------NRRMAKG------------------HGFYLQHAKETCLVGLK------------------------------------------------GEDPPNM----------RGNRCS------------------DVIFSER------------------------RGQSQKPEE---------IYHIMEALVP------------------------------NGRYLEIF-ARRNN------LRNHWVSVG----------------LE
      ACA1_149840_Acanthamoeba_castellanii_str_Neff_470510758                 AAPLVVVFVAGQQQQQKQEEEEEQEAGGE--G-EGEAEV--A--GGFGLVVVDPPWENR-------------------------SLSRSHN---------YGTLAPHE-----IAKLPVRSLLSSAGAYVVVWVTNNP-------AIHNFVKRN--------LFP----RWRVQYVA--THYWLKLTS-------------------------SGEPVMP------------------LNSAHRKPYEEL----------------------------------------------------------------------AGLRKE------------------MVVCSVA------------------------NRYARKPS------------------------------------------------------------------------------------------LDAVAAA
      PFL1715w_Plasmodium_falciparum_3D7_124806530                            STPARYIRCDLRTFDLGS--L----------D------------TKFDVILIDPPWKEYYDRKIHNLHVLNNINLDQDLNNDMNNEK-------------DKFWTLED-----LANIEIEKIAEVP-SFLFIWCGVT--------HLEDARV----------LLN----KWGYRRCE--DICWLKTNI-------------------------NEKNKKNKYLNEINN----------ENSYLQRTTEHCLVGIK------------------------------------------------GAVRRSYDIHLI-----HANLDT------------------DVIIAEETEQN--------------------IYDNNKPEE---------LYKIIEKFCL------------------------------GRRKIELF-GTNRN------IRNGWLTLG---------------KHI
      _Maritimibacter_alkaliphilus_495609045                                  ---MARMTAN-VVQQFAN--L----------R------P-G---GGFGLIMADPPWRFE-------------------------NFSAKGEGKNATAH--YECTSLDW-----IKSLPVEVLAADN-CLLWLWATNP--------MLREAFE----------VLD----AWDFEFAT--AGTWVKRTV-------------------------HGKVAFG------------------TGYVLRSSNEPFLIGKR------------------------------------------------GKPKAT-----------RSTRSTIPTYCDIDLFEGDWPKSAITIEAVA------------------------REHSRKPDE---------AFAAAEALLP------------------------------DVPRIELF-SR--Q------TRPGWRAWG-----------NQTDKFG
      TVAG_136190_Trichomonas_vaginalis_G3_154414896                          IKHSASINCDVRTFPFDK--------LGE--I------------TQFDVITMDPPWLIA-------------------------QAGITRGVAIN-----YDQLSTDI-----IGQIPLQKIQ-KN-GYIFVWVIAS--------QLENGIQ----------LLQ----NWGYEFLT--YLNWVKISK-------------------------YGRYMPS------------------HGYYLQHNKETVLIGHK------------------------------------------------GKDPENM----------RPNKFN------------------DLIIQQRS-----------------------LRQSHKPIE---------IYELIERVFP------------------------------NSMYCEIF-ARPHN------LRQGWVSVG----------------LE
      _Afipia_sp_1NLS2_496698392                                              ---MTLPAKDLLSFAGQ---------------------------RRFSTILADPPWQFT-------------------------NKTGKVAPEHKRLSR-YGTMKLDE-----IMMLPVADIAAPT-SHLYLWCPNA--------LLPEGLA----------VMK----AWGFNYKS--NIVWHKVRKD------------------------GGSDGRG------------------VGFYFRNVTEVILFGVR------------------------------------------------GKNARTLA---------PGRRQV------------------NLLATRK------------------------REHSRKPDE---------QYEIIESCSP------------------------------GP-FLELF-AR--G------TRKNWATWG-----------NQADDDY
      _Actinomyces_sp_oral_taxon_175_497433097                                AGRSSSEGTDSS-PAIPG--L----------P------P-----GGFATILVDPPWPLQSG-----------------------EKH-------------YRTMSLAR-----IKALPVGALAARD-AHLWLWTTNA--------LLPKAYE----------VAE----AWGFTVRS--PLTWVKFRL-------------------------------GLG----------------GRYQLRNATEQLLFCTR------------------------------------------------GRAPL------------GSRSQP------------------TWFNAPV------------------------TEHSRKPAE---------QFAIIERVSP------------------------------GP-YLELF-ARRRPE-----SNQPWAVWG-----------DQVASDI
      XAUT_RS18300_Xanthobacter_autotrophicus_501064335                       --------MNG-LWQFGD--L----------K------M-----FGYDLIVADPPWDFELYSEAGEGKSA--------------KAH-------------YGTMKLDE-----IAALRVGDLARGD-CLLLLWCCEW--------MPPAARQR---------VLD----AWGFTYKT--TIIWRKVTR-------------------------AGKVRMG------------------PGYRARTMHEPVIVATV------------------------------------------------GNPKH------------T--PFS------------------SVFDGVA------------------------REHSRKPEA---------FYRMVEAAAP------------------------------KAARADLF-SR--Q------RRDGWDAFG-----------NEVEKFD
      _Cenarchaeum_symbiosum_503247195                                        NRTKQNKIPQEILYQKLPN-------------------------RKFDIIYADPPWDYNGKLQYDKTDLYVSTS----------SFK-------------YPTMKTKK-----MMEIPIKKIASSN-SLLFLWATSP--------HLEQAIQ----------LGK----AWGFEYRTV-AFVWDKMNH-------------------------------N------------------PGKYTLSNCELCLLFKH------------------------------------------------GKIPTP-----------RGARNV------------------RQLITIPR-----------------------TEHSRKPVQ---------AMQGIERMFP------------------------------FQKKIELF-AR--E------KYRGWSAWG-----------LDLVLKN
      ETHHA_RS08455_Ethanoligenens_harbinense_503250901                       MSTAKETANNLLQFCGE---------------------------KKYATVYADPPWRFQ-------------------------NRTGKVAPENKKLNR-YPTMDLED-----IKALPVGKIAAEK-SHLYLWVPNA--------LLPDGLE----------VMK----AWGFEYKG--NIIWEKVRKD------------------------GEPDGRG------------------VGFYFRNVTEILLFGIR------------------------------------------------GGNNRTLA---------PARSQV------------------NLIRTQK------------------------REHSRKPDE---------IITIIESCSP------------------------------GP-YLELF-AR--G------DRENWDMWG-----------NQATAEY
      _Mycobacterium_abscessus_511283520                                      MAAPLREVNEPPPLPVTD--------------------------GGFSTILADPPWRFT-------------------------NRTGKVAPEHRRLDR-YSTLSLDE-----ICALGVSDVTADN-AHLYLWVPNA--------LLPDGLR----------VME----EWGFRYVS--NIVWSKVRRD------------------------GLPDGRG------------------VGFYFRNTTELLLFGVR------------------------------------------------GSMRTLQ----------PARSQV------------------NQIVTRK------------------------REHSRKPDE---------QYELIEACSP------------------------------GP-YLEMF-GR--Y------RRPNWAVWG-----------DEANEDV
      CAOG_04822_Capsaspora_owczarzaki_ATCC_30864_514485079                   PPHCVPIKANVLEFDWAS--------LAA--H------------CQFDVIMMDPPWQLA-------------------------SNAPTRGIALT-----YNQLPDAA-----IEDIPIASLQRNG-GFVFVWVINN--------RYAKAFD----------MLK----RWGYRFVD--SIDWVKFTV-------------------------NRRLAKC------------------HGFYLQHAKETCLIGLK------------------------------------------------GDPPPGC----------VGNVAS------------------DVIFSER------------------------RGNSQKPDE---------MYELVEALVP------------------------------NGKYLEIF-GRRNN------LRNYWVTIG----------------NE
      PTSG_05864_Salpingoeca_rosetta_514690366                                PSPCRFLLANIQHLRPHM--------Q-------D---L-----GVFDLIVMDPPWHNG-------------------------SVRRGSR---------YGTMDYDA-----IMDIPIPFLMSPR-CLLALWITNND-------RCATFVHER--------LLP----HWGLKKVT--EWKWLKVTT-------------------------QGEPVFP------------------LSSRHKRPYEVLILATNAPGAFETAPAYGIVHQWQAAMRQQQDEDRREQQQDEEKQQKLETKENEGEKQEGEQHQQVAKCSNHTEHSQ------------------QVKHAPPAVVQHGADAPIALPADLRIAGVPSLVHSEKPPA---------IHRLLVSLLTRGSNTSPLRQQQQQQQRQEEDGVLASTPRQRPRCLEVF-AR--R------LHRHWTSVG-----------NQVFKLQ
      PTSG_04805_Salpingoeca_rosetta_514693100                                STPPMSIRADPLCLDASS--L----------G------------TSFGVIYIDAPLPEYARRAP--------------------GLK-------------LDTVSWEE-----LGRLDVRGLAGEI-AFVFMWVGCSE-------GLEKGAQ----------LLR----RWGFRRCE--DICWVKTNKQQP----------------------RRRGIME------------------PHSLLQHTKEHCLLGIR------------------------------------------------GAPNRKTEPHIL-----HSNMDV------------------DVIVSEDPP----------------------IESTEKPSE---------IFAVMERMCQ------------------------------SRKRLHLF-AS--GT-----VRPGWVGVG-----------KDLPQTD
      TVAG_062450_Trichomonas_vaginalis_G3_154413191                          IKHSAAIACDVREFPFDK--------LGA--I------------TQFDVITMDPPWLIA-------------------------QASITRGVAIN-----YDQLGTDT-----ITQIPLHKIQ-KN-GYIFLWVIAS--------QLENGIQ----------ILN----KWGYEFLT--YLNWVKISK-------------------------YGRYMPS------------------HGYYLQHNKETVLIGRK------------------------------------------------GRDPENM----------RAEMFD------------------DLIIQQRG-----------------------LRQSHKPVE---------IYELIERVFP------------------------------NSMYLEIF-ARPHN------LREGWVSMG----------------LE
      RPHASCH2410_RS00155_Rhizobium_phaseoli_515104987                        ----MRLFPDL-WPFGD---L----------Q------P-----HSFDFIMADPPWKMQEWSDNGDKSKST-------------QSK-------------YRLMPLDE-----IKAMPVLDLAAPN-CLLWLWATNP--------MLPQALD----------VLH----AWGFTFAT--AGSWMKTTR-------------------------NGKQAFG------------------TGYIFRTSNEPILIGKR------------------------------------------------GEPKTT-----------RSVRSS--------------------FPGLA------------------------REHSRKPEE---------GYREAERLMP------------------------------RARRLELF-SR--T------NRVGWTTWG-----------DEVGKFG
      H156_RS0101780_Methylococcus_capsulatus_515934135                       TENTLDPAADLLERLGD---------------------------KRFRTILADPPWQFQ-------------------------NRTGKMAPEHKRLNR-YGTMSLEA-----IAGLPVERLTADT-AHLYLWVPNA--------LLLEGLK----------VME----AWGFTYKT--NLVWHKIRKD------------------------GGPDGRG------------------VGFYFRNVTELVLFGVR------------------------------------------------GKNARTLA---------AGRRQV------------------NFLATRK------------------------REHSRKPDE---------MYGIIEACSP------------------------------GP-YLELF-AR--G------ARDRWSVWG-----------NEADENY
      LOKHON_RS09140_Loktanella_hongkongensis_516541036                       --------------------M-----------------------GPFDIILADPPWRFASNSEAKPGRNP--------------RRH-------------YPCMRDEE-----ICALPVARWAAPA-ALLLMWTTSP--------MLDRSMA----------IPR----AWGFRYVS--SLVWTKDRI-------------------------------G------------------TGYWARNRHELVLICKR------------------------------------------------GRFDCP-----------RPAPFA------------------DSVISGQQ-----------------------REHSRKPDA---------LHAQIDAAWP------------------------------EARKLELF-AR--Q------ERPGWTAWG---------------NDT
      CYME_CME116C_Cyanidioschyzon_merolae_strain_10D_544210442               PPHCIPVRADVRFADWDQ--------IAAAAN------------GNYDVILMDPPWQLA-------------------------TANPTRGVALG-----YNQLSDES-----ILAIPLEKLQ-RC-GLLLIWVINA--------KYRVALQ----------MFE----RWGYRLVD--EIVWVKLTV-------------------------NRRLAKN------------------HGFYLQHAKETCLVGVK------------------------------------------------GNDLSALSTA-------PGMPRP------------------DVILSER------------------------RGQSQKPDE---------LYEWIEALVP------------------------------NGKYIEIF-ARKNN------LRNFWVSIG-----------NEVTGES
      CYME_CMH026C_Cyanidioschyzon_merolae_strain_10D_544211235               LNDGYYINCDLRYFNLAY--------LRE--C------V-----GNFDVVLIDPPWRIAGGQRASTPNGPMFTNNHW-------AVN-------------YNTLSNEE-----ILDLDIGCLS-NS-GLCFLWVVSS--------QLPTGMA----------CLS----RWGYEYID--KITWIKKRQ--------------------------GKLHVS------------------HGYHFMHSSELCLIGVK------------------------------------------------RPCEF------------IGKVSN------------------DLIFAEV------------------------REKSRKPDE---------LYHVVETMLP------------------------------GTAKIELF-ARNHN------IRRGWLSLG-----------NELGEQF
      _Thalassobacter_arenae_544667667                                        MDMSSNPSQDLRDFLSG---------------------------DSFGCVMADPPWRFT-------------------------NRTGKVAPEHKRLAR-YPTMTVED-----ICALPVSDHLMDR-AHCYMWVPNA--------LLPEGLR----------VLN----AWGFEYKS--NIIWHKIRKD------------------------GGSDGRG------------------VGFYFRNVTEILLFGVR------------------------------------------------GKNIRTLA---------PGRRQV------------------NMMQTRK------------------------REHSRKPDE---------QYELIESCSW------------------------------GP-YLELF-GR--G------IRDGWTVWG-----------NQADADY
      COCSUDRAFT_36424_Coccomyxa_subellipsoidea_C-169_545366117               ------------MSKHASAPE-----------------------GGYDCIVMDPPWENK-------------------------SAKRSGH---------YPTLPSRH-----LLSIPIARLLNQQGGLLALWVTNRE-------RLRRFVDQE--------LLA----KWGLEQVA--TWFWLKVTN-------------------------SGQLVSP------------------LEVAHRRPYEALLLARLRPSAASGS----------------------------------------DTNEE------------GRAVRN------------------MVFLAVP------------------------GEHSRKPHI---------GSLLAPHLLA------------------------------QPACLEMF-AR--E------LAADWTSWG-----------NEALRFQ
      TVAG_002370_Trichomonas_vaginalis_G3_123390303                          IKQAAPIKADIRYFDWET--------LGK--I------------CQFDVILMDPPWNIQPAQTTRGV-----------------ELG-------------YELMLESE-----IASMKIPLVQ-TN-GYCFMWVVAS--------FLPVGVS----------MLQ----GWGYKVID--FINWIKTSK-------------------------YGRYRPS------------------NGYFLQHDKETCLVGIK------------------------------------------------GKPLD------------GEDVDIFN----------------DLIIDERG-----------------------ARQSHKPPS---------LYDIIERMFP------------------------------GRLYLEIF-ARAHN------EREGWVSLG----------------LE
      EMIHUDRAFT_205550_Emiliania_huxleyi_CCMP1516_551588467                  PARSSFSLCRLAYWPRLS-------------R-IL---------PRFRCLVVDPPWPSR-------------------------SVQRAGA---------YKVLALEELAEA-LRSLP--ALCDRRGCLICVWMTNAI-------KVQEMVEQT--------LLP----AWGATKVG--LWYWLKLSP--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------DG
      EMIHUDRAFT_449178_Emiliania_huxleyi_CCMP1516_551613601                  -----------------------------------------------------------------------------------------------MEGDEDEVWTPQE-----VMNLRIEAITETP-SFCFLWSGSGV-------SLQWGRA----------CLR----KWGFRRCE--DISWVKSNRAT-----------------------GRNTHFL------------------PDSVVTPTTEHCLVGIK------------------------------------------------GTVRRNYDGHII-----HANVDT------------------DVMLSEEPP----------------------YGSTEKPTE---------LYAIIEHFSN------------------------------GRRRLELF-GEDHN------LRRGWLTLG-----------KGLSSSS
      HMPREF0742_RS10030_Rothia_aeria_553802969                               MLDPMNTNEEFAPLPTVE--------------------------GGFQTVLADPPWRFT-------------------------NRTGKVAPEHHRLGR-YGTMSLDE-----IKALRVGDVTADN-AHLYLWVPNA--------LLPEGLE----------VMQ----AWGFRYVS--NIIWAKRRKD------------------------GGPDGRG------------------VGFYFRNVTEPILFGVK------------------------------------------------GSMRTLA----------PGRSTV------------------NMIETRK------------------------REHSRKPDE---------QYDLIEACSP------------------------------GP-YLELF-AR--Y------ARPGWSVWG-----------NEASNEI
      G966_02949_Escherichia_coli_UMEA_3323-1_554729604                       ------------MGWFMT--------------------------KKYTLIYADPPWVYRDKAADGNRGA---------------GFK-------------YPVMSVLD-----ICRLPVWDLADEN-CLLAMWWVPT--------QPLEALK----------VVE----AWGFRLMTMKGFTWIKCGSRQ-----------------------PDKLVMG------------------MGHMTRANSEDCLFAVK------------------------------------------------GKLPT------------R--INA------------------GIVQSFTAPR---------------------LEHSRKPDI---------VREKLVQLLG------------------------------DVSRIELF-AR--Q------TSHGFDVWG-----------NQCEDPA
      AGZ61752.1_Phormidium_phage_MIS-PhV1A_556471807                         --------------------------------------------KDYRLIVVDPPWSYSLRETDATHRG---------------RCP-------------YPSMSDEQ-----ILNLPIGAIAHQN-SYLLLWVTNN--------HLPLGFR----------CLE----RWGFTYKS--IFTWVKTTKAST----------------------EEKIAPNMG----------------IGHYGRNCTEHFLIATR------------------------------------------------GNPGSFTS---------HGLTDIR-----------------NIIFAPR------------------------SKHSEKPPE---------FWTIADRLAEHL----------------------------DGPRIELF-ARSSGLF----KREGWDSWG----------------AE
      RFI_31139_Reticulomyxa_filosa_569355952                                 PPHSVPIRADVMHFDFKA--------LAN--E------QLRISGRLFDVIMMDPPWQLA-------------------------SSNPTRGVAIG-----YEQLTDES-----ILALPIPKLQ-SD-GFLLVWTINA--------KYRLALQ----------MFK----KWGYRIVN--DIAWVKQTV-------------------------NRRIARG------------------HGFYLQHAKETCLVGFK------------------------------------------------GEEKKVGF---------VSGVCA------------------DVIYSVR------------------------RGQSQKPVE---------IYEYIERLVP------------------------------NGCYLEIF-GRRNN------LRDYWVTIG----------------NE
      F442_02656_Phytophthora_parasitica_P10297_570995458                     PAGSSFAQRDVRTLHQLA--------L-----------------GQHKLILMDPPWQNK-------------------------SVSRGKR---------YNTFDHTD-----LLRINIPHIADPNECILAVWVTNRPR------YMTYLREQ---------ALP----SWGFTYHA--CWYWLKLSK-------------------------NGELVTP------------------LDSTHRLPVETLLVAYR------------------------------------------------AKDQKHEQLL-------RRRLGEKM----------------RIVVSIP------------------------LRHSWKPPPE--------CFFDKDIMST------------------------------SHRKVELF-AR--E------LRPHWTSIG-----------NEVLKFQ
      ETSY1_42765_Candidatus_Entotheonella_sp_TSY1_575405212                  SNSPHSAADDLLA-------C---G-FPP---------------HSFSTVLADPPWRFT-------------------------NRTGKMAPEHRRLSR-YPTLTLEE-----IADLPLAQLVQPD-SHLYLWVPNA--------LLAEGLD----------VMR----RWGFTYKT--NLVWYKIRRD------------------------GGPDRRG------------------VGFYFRNVTELVLFGVR------------------------------------------------GRMRTLA----------PGRRQE------------------NLLASQK------------------------QEHSRKPDT---------FYDLIERCSP------------------------------GP-YLELF-AR--H------PRPGWHQFG----------------NE
      GbCGDNIH3_7033_Granulibacter_bethesdensis_CGDNIH3_586601520             MTKQPDPIAEFRN-------Q-----LNG---------------GNFATVLADPPWRFQ-------------------------NRTGKMAPEHRRLSR-YGTMELPE-----IMALPVSEVTAKT-AHLYLWVPNA--------LLPEGLA----------VMQ----AWGFNYKS--NLVWHKIRKD------------------------GGSDGRG------------------VGFYFRNVTELVLFGVK------------------------------------------------GKNARTEA---------PGRRQV------------------NLLATQK------------------------REHSRKPDE---------FYDIVEACSP------------------------------GP-YLELF-AR--G------TRPGWCAWG-----------NQAEEYD
      TTHERM_00704040_Tetrahymena_thermophila_SB210_586728217                 PDNSIPICSDVTKLNFQA--L-----IDA--Q------M-RHAGKMFDVIMMDPPWQLS-------------------------SSQPSRGVAIA-----YDSLSDEK-----IQNMPIQSLQ-QD-GFIFVWAINA--------KYRVTIK----------MIE----NWGYKLVD--EITWVKKTV-------------------------NGKIAKG------------------HGFYLQHAKESCLIGVK------------------------------------------------GDVDNGRF---------KKNIAS------------------DVIFSER------------------------RGQSQKPEE---------IYQYINQLCP------------------------------NGNYLEIF-ARRNN------LHDNWVSIG----------------NE
      TTHERM_00136470_Tetrahymena_thermophila_146175568                       RTSQSIIECDLRYFDFTY--L-T--------N------I-F---DSFDVVMIAPPF---------------------------------------------DAISIQE-----VFELKVELIS-KQ-GFLFFWAKDV--------PTATSYE----------IMS----KWGYDVID--QIIWVKIDKN------------------------EGKMILEDK----------------PDKYFYNSNEMCLIGVK------------------------------------------------KHPSSKGVEY-------QSKVSN------------------NIIVSHQ------------------------PQNQSCPDQ---------IYDIIDLMMP------------------------------GSKKIELF-TKQKV------IRGGWFGLE---------------HKF
      RirG_092820_Rhizophagus_irregularis_DAOM_197198w_595481916              PEWCVPIKADVLTFEWDE--------FAK--E------------CQFDVILMDPPWQLA-------------------------THAPTRGVAIA-----YQQLPDVC-----IEELPIPKLQ-KN-GFLFIWVINN--------KYSKAFE----------MMK----KWGYTYCD--DITWVKQTV-------------------------NRRMAKG------------------HGFYLQHAKETCLMGRK------------------------------------------------GEDPPGC----------NHSISS------------------DVIFSER------------------------RGQSQKPEE---------LYEMIEELVP------------------------------NGNYLEIF-GRKNN------LRDYWVTIG----------------NE
      RirG_018940_Rhizophagus_irregularis_DAOM_197198w_595497236              ATPPTYLKADLRTFDFKS--L----------G------------TKFDVILIDPPLEEYCRRSPLVA-----------------GSN-------------LDYWDYDE-----IANLKIEDAAATP-SFIFIWSGDAD-------GLDRGRQ----------LLL----KWGYRRCE--DIVWIKTNKKW-----------------------DGSHHIE------------------PRSIFQRTKEHCIMGIK------------------------------------------------GTVRRSTDGHFI-----HCNVDT------------------DVIISEEPHY---------------------GIGTAKPEE---------LYHIIEHFCL------------------------------GRRRLELF-GEDHN------IRPGWLTVG-----------LSLSSSN
      TTHERM_00558100_Tetrahymena_thermophila_SB210_118378397                 NHPPVYLKADLKYYDLSK--L-----------------------GKFDVIMMDPPWKEY-------------------------EERVQGLPIYSQYPEKFNSWDLNE-----IAALPIDEISDKP-SFLFLWVGSD--------HLDQGRE----------LFR----KWGYKRCE--DIVWVKTNKDKT----------------------KEYIELP------------------HSNLLVRVKEHCLVGLR------------------------------------------------GDVKRASDSHFI-----HANIDT------------------DVIVAEEPP----------------------LGSTQKPAE---------IYDIIERFCL------------------------------GRKRLELF-GEVHN------VRQGWLTIG-----------KLLDESN
      YCL055W_Saccharomyces_cerevisiae_S288c_6319795                          TPFGCKIGIDSIVPTLNHWI------QNE--N------------LTFDVVMI-------------------------------------------------GCLTENQFIYPILTQLPLDRLISKP-GFLFIWANSQ--------KINELTK----------LLNNE--IWAKKFRRSEELVFVPIDK-------------------------KSPFYPGLDQD--------------DETLMEKMQWHCWMCIT------------------------------------------------GTVRRSTDGHLI-----HCNVDT------------------DLSIETKD-----------------------TTNGAVPSH---------LYRIAENFST------------------------------ATRRLHIIPARTGYETPVK-VRPGWVIVS-----------PDVMLDN
      H556_RS0109535_Brevundimonas_naejangsanensis_636828153                  -------MIEPLPA------------------------------GPYSCILADPPWHHA-------------------------SRSPKGQTRRSPSHH-YRTMALAE-----IKALPVADVAAKD-CHLFLWTTGP--------HLQQAFL----------VMN----AWGFRYSSL-AFVWVKRRKQPDGDDDGVLFMD------------RRDLFTG------------------MGYTTRQNAELVLLGRR------------------------------------------------GAPKR------------LSKAIH------------------QIITAPR------------------------QEHSRKPSE---------AHSRIERYCD------------------------------GP-RLELF-AR--A------PRDGWTVWG----------------NE
      KPNIH27_19120_Klebsiella_pneumoniae_subsp_pneumoniae_KPNIH27_640854256  --------------------------------------------MNYDLIYCDPPWEYG-------------------------NRISNGAACNH-----YSTMSIDD-----LKFLPVRKLAADN-AVLAMWYTGT--------HNREAVE----------LAE----SWGFRVRTMKGFTWVKLNQNAADRFNKALSTGELVDFNDLLEMLDRETRMN------------------GGNHTRSNTEDVLIATR------------------------------------------------GTGLP------------RASASVK-----------------QVVHTCL------------------------GEHSAKPWE---------VRNRLEQLYG------------------------------DVKRIELF-AR--E------EWKGWDRWG-----------NQCNNSI
      SPRG_10355_Saprolegnia_parasitica_CBS_22365_641530415                   APQRVHFAPDMQLTELG---------------------------MQFDVIVVDPPWAEVA------------------------TSG-------------EAIWTAQD-----LARLDVNGIGAYP-GVLFLWCGSGGTYDGVHSHFDEACD----------LVA----TW-----------WAKVQDV------------------------NERSGHG------------------LV---RRSKELCLLALR------------------------------------------------DHVWRDTSGHFV-----HANVDA------------------DVVIAPT------------------------TAGRAKPAA---------FYEVVERFCL------------------------------GRRRLDVF-GS--T------ARNGWVTLD---------------RFA
      _Lachnospiraceae_bacterium_3_1_46FAA_496677811                          MPAVLFL-LELHRRRKGGYKI-----ENN---------------QKYNIIYADPPWRYQQKRLSGAA-----------------EHH-------------YPTMSVKD-----ICGLKVEEIAAKD-CVLFLWATFP--------QLPEALR----------VIK----AWGFQYKTV-AFVWLKQNKS------------------------GKGWFFG------------------LGFWTRGNAEICLLAIK------------------------------------------------GKPHR------------NSNRVH------------------QFLISPI------------------------RGHSQKPEE---------AREKIVELMG------------------------------DLPRVELF-AR--E------KTEGWDAWG-----------NEVESDI
      F820_RS0109105_Xylella_fastidiosa_653894579                             TKHKANTASDVGRDLLARHGG-----------------------QRFHTILADPPWQFQ-------------------------NRTGKMAPEHKRLSR-YGTMTLDD-----IMMLPVEQLVTDT-AHLYLWVPNA--------LLPEGIK----------VLE----AWGFSYKS--NIVWHKVRKD------------------------GGPDGRG------------------VGFYFRNVTELVLFGVR------------------------------------------------GKNARTLA---------PGRRQV------------------NFLATQK------------------------REHSRKPDE---------FYDIVESCSP------------------------------GP-FLELF-AR--G------PRDGWKVWG-----------NQADKYY
      ON05_RS35435_Acaryochloris_sp_CCMEE_5410_657200356                      ------------MPNTPPLPV-----------------------GAFSLIVVDPPWSYHLRESDKTHRG---------------RCP-------------YPSMTDEE-----IVAMPVSSIAAPD-SYLLLWTTNN--------HLPLAFK----------VME----SWGFEYKA--IHTWVKTTLD------------------------RSKIRYG------------------VGHYGRNATEHVLIGRK------------------------------------------------GKAKTFTALGL------TNIPTA--------------------FQAPL------------------------GQHSQKPEE---------FYQMADRLGDAL----------------------------GGQRIELF-SR--C------PRPGWESWG----------------AE
      K291_RS0125225_Ensifer_sp_USDA_6670_661268459                           --------MTGWFFDPLLP-------------------------LHYEMIVIDPPWGFDLYSKEGAKKSA--------------LAK-------------YELMKDAE-----VRTLPVGKLASMD-CLLYCWATAP--------QLPLAIE----------CVK----AWGFQYKS--ILVWRKTTP-------------------------SGKIRMG------------------TGYRVRTTGEVVVVATL------------------------------------------------GNPKQ------------EAIPQT--------------------IFDGIA-----------------------REHSRKPDE---------LYALCDRVMP------------------------------HARRADVF-AR--E------QREGWHAFG-----------NEVTKFN
      SS17_3321_Escherichia_coli_O157:H7_str_SS17_666005014                   ------------MT------------------------------KKYTLIYADPPWTFRDKATDGQRGA---------------SFK-------------YPVMSLLD-----ICRLPVWELAADN-CLLAMWWVPT--------QPLEALK----------VVE----AWGFRLVTMKGLTWNKCGKRQ-----------------------TDKLVMG------------------MGSTTRANSEDCLFAVK------------------------------------------------GNLPE------------R--INA------------------GIIQSFTAPR---------------------LDHSRKPDM---------AREKLVQLLG------------------------------DVPRIELF-AR--H------TSHGFDVWG-----------NQCGTPS
      EL18_01388_Nitratireductor_basaltis_667917580                           ----MHLF-DWPFGDLNP--------------------------HSFDLIMADPPWAFELRSDKGEGKSA--------------QSH-------------YKCQTLDE-----IKALPVLDLAAPD-CLLWLWATNP--------MLPQAFE----------VMA----AWGFTFKT--AGAWGKTTV-------------------------NGKLAFG------------------TGYIFRSAHEPILIGTR------------------------------------------------GEPRTT-----------KSVRSL--------------------IMGQV------------------------REHSRKPEE---------AYAAAEKLIP------------------------------NARRLELF-SR--T------DRAGWEVWG-----------DEAGKFG
      BM_Bm2284d_Brugia_malayi_671410008                                      PPFSSFIINDACVAEALI--------RYG---------------KKFDFILLDPPWENK-------------------------SVK-------------RKTVYPTYGDHRWMLDLCLPELLKES-GLLAIWVTNNAK------HLKFTDN----------MIE----YFGFEKIA--TWRWLKVTN-------------------------SGEPVYN------------------LNSQHKQPFENIVFASC------------------------------------------------VAARRHY----------MNIANE------------------FALISTPS-----------------------AIHSRKPPLFPVLQALGILEETAEQL-------------------------------------ELY-GR--Y------LLPRTITVG-----------FEAAKLQ
      GSPATT00018667001_Paramecium_tetraurelia_strain_d4-2_145532196          VKS--YINCDIRYFNLDF--------LVE--K------V-----GGFDVVLMDPPWRIKGGQQNDSSFMFTNSKF---------SLD-------------YNTMSNQE-----IMDIKIEKLS-KK-GFLFLWILNT--------QLNIAYE----------MAS----KWGYEIVD--QIIWVKLNPQ------------------------GNNVYLS------------------TGYYFMHSFEICLVGYK------------------------------------------------CPPGEHVEY--------HSKISN------------------NIIFSPV------------------------RNKSQKPIE---------MYEIIELMMP------------------------------GAKKVEIF-ARNNN------LRHGWFSIG-----------NQLGETY
      TGFOU_268840_Toxoplasma_gondii_FOU_672286124                            NTPSLCIQANLHHFDWGI--L----------G----G-------VKFDVILVDPPWQEYFDRCAAIGAT---------------NED-------------LTPWTLEE-----MLQLPVEKIGDTP-SFCFLWCGVT--------HLEDARQ----------LLH----KWGYRRCE--DICWLKTNKKAAQRRREQNAAHVNDVLDYK----ATQLVHD------------------ETSILQRTTEHCLMGIK------------------------------------------------GTVRRSQDSHFI-----HANLDT------------------DILISEQEEE---------------------VGCTRKPEE---------LYDIIERFCL------------------------------GRRRIELF-GRDWN------RRAGWVTVG-----------CEFGLTT
      JP75_07920_Devosia_riboflavina_674766351                                ----------MTAWPFGA--M----------P------M-----FSFDVVMADPPWSFDNWSEGGNAKNA--------------KAQ-------------YDCMPTPD-----IKRLPVGHLAAGD-CWLWLWATYP--------MLPDAIE----------VMD----AWGFRYVT--AGPWVKRGT-------------------------SGKLAMG------------------TGYVLRSCSEIFLIGKN------------------------------------------------GEPKT------------HARDVR------------------NVLEAPR------------------------REHSRKPDE---------AYAMAEKLFG------------------------------PGRRADLF-SR--E------TRPGWTSWG-----------NESTKFD
      AF48_RS10595_Enterobacter_aerogenes_695800969                           --------------------------MT----------------GKYTLIYADPPWSYRDKAADGDRGA---------------GFK-------------YPVMNVMD-----ICRLPVWELSADD-CLLAMWWVPT--------QPVEALK----------VVE----AWGFRLMTMKGFTWHKINKH------------------------KGNSAIG------------------MGHMTRANSEDCLFAVR------------------------------------------------GKLPERMDASICQ----H-------------------------VTAPR------------------------LENSRKPDV---------IREKLVQLLG------------------------------DVPRIELF-AR--Q------SSHGFDVWG-----------NQCIAPA
      RMATCC62417_10014_Rhizopus_microsporus_727142779                        PPRSSFIMGSMTDSSLQQLS------DYV--S--S---L-----GGADLIIIDPPWPNK-------------------------SVHRSSK---------YETQDIYD-----LFTIPMKDMINVN-SVVAVWVTNKP-------KFRKFIIHK--------LFP----AWELECKA--EWVWLKVTT-------------------------EGQCIFP------------------LDSSHKKPYEQLIIGHR------------------------------------------------QKTSD------------L--PSR------------------HVIVSVPS-----------------------LRHSRKPPL---------QDVILPYLKNKD----------------------------RPVCVEMF-AR--C------LTPGWISWG-----------NECLKFQ
      RMATCC62417_07548_Rhizopus_microsporus_727145762                        PEWCIPIKANVMTYDWDS--------LAK--E------------VQFDVIVTDPPWQLA-------------------------THAPTRGVAIA-----YQQLPDIC-----IEEIPIPKLQ-KN-GFLFIWVINN--------KYAKAFE----------LME----KWGYTYVD--DITWVKQTV-------------------------NRRMAKG------------------HGYYLQHAKETCLVGKK------------------------------------------------GQDPPNC----------RHSVGS------------------DVIFSER------------------------RGQSQKPEE---------LYELIEELVP------------------------------NGKYLEIF-GRKNN------LRDYWVTIG----------------NE
      MVEG_02535_Mortierella_verticillata_NRRL_6337_672827354                 PGSRYEETNNVLDMDLKR--F----------G------------TDYQVIYMDPPLLRA-------------------------GEEPG-----------PNKITMEQ-----LATLDIGSIL-PK-GFLFVWIEKE--------FLPDIVR----------LAE----RWEFRYVE--NFCWIKRNV-------------------------NNLIARE------------------PSPYFNSSKLSCLIFRKE-----------------------------------------------GDVEL------------RHQRSP------------------DCVFDFPKPVNAA------------------TLSEEKPKF---------MYELIETLLPQAVYSESNPN--------------------GDKMMELW-ARPGT------RRKGWTSIC---------------QTK
      OT_ostta06g01320_Ostreococcus_tauri_693499469                           VPDCVHMQKNIKSMKYET--L----------G------------TDYLGVLLNPPWDIE-------------------------NSPD------------RGDVTVDD-----IEAIPLEKLT-PL-GFIFIWVEKE--------NLSKVCD----------VMH----EKNFVYVE--NLTWVHLKP-------------------------NNTIVES------------------AARYLGRSHRTLLIFRRDVRDKRFVEG--------------------------------------KKIEL------------RHQRNSDVTL--------------DIIQTTK------------------------SGRRAIPEH---------VYKSIETLLPKAYEPGT-----------------------PGKLLELW-AEPGA------KRAGWTSVA----------------DS
      GLOINDRAFT_123982_Rhizophagus_irregularis_DAOM_181602_552937933         IHGSRYFENDILSMDLKK--L----------G------------QDFQAVYIDPPFLLP-------------------------DEEPS-----------PEKITLQQ-----FESLKVPDIV-PK-GFLFIWVEKE--------FIPDIVQ----------IAE----KWNFRYVE--NFCWIKKHI-------------------------NNQISRA------------------PYRYFNKSKLSLLIFRKE-----------------------------------------------GDIEL------------RHQRNP------------------DCVFDFIKPRTLE------------------MLTEGKPRV---------MYDIIETMLPQAVYNEQNPN--------------------GDKLLELW-AKKGS------HRQGWTTVV----------------QI
      RMATCC62417_05354_Rhizopus_microsporus_727148790                        IVGSRYYEVDNIVSTDLTQ-Y----------G------------TDFNAVYMDPPFLLP-------------------------GEEPV-----------AGKITIDD-----FGALNVADIV-KA-GFLFIWLEKE--------WIQRVVN----------ITA----KWGFKYVE--NFCWIKKDV-------------------------NNQIHKS------------------PYRYFNKSKLSLLIFRKE-----------------------------------------------GDIEL------------RHQRNP------------------DCVFDFIRPKLPD------------------EISEKKPPF---------MYKVIETLLPTANYHIENNPN-------------------GERLLELW-AKKGQ------KRQGWTTVV----------------ER
      GSPATT00005554001_Paramecium_tetraurelia_strain_d4-2_145487402          YNGKNQLSANLMKQIKEGN-H----------Q------L-----FKLRIIVLKKILGTDLSQ---------------------YVKGVQGIFI--------DNLFRKD-----LKNLDLSKKLISN-GILFIWSDKG--------LINEILE----------IME----SKGFTYIE--NLVVVQLSLEKALEELNKHMKIEQ----------TEEAVLDNLNFLQQKVQVKDLIVNCPSKVLNQSKQVLIMFRK------------------------------------------------FDEQKTQLEL-------RHQRTP------------------DVLFDFVSN----------------------GKKSEKSKEY--------IYQTIETLLP------------------------------KSQLMEIF-AQRDQ------PRKDWISVC---------------ESK
      T424_RS0114345_Rhizobium_undicola_653315751                             NTDAPSPSDDFTN-------F-----ISG---------------RKFATIMADPPWQFM-------------------------NRTGKVAPEHKRLNR-YGTMELDA-----IKALPVATACAPT-AHLYLWVPNA--------LLPEGLE----------VMK----AWGFNYKA--NIVWHKLRKD------------------------GGSDGRG------------------VGFYFRNVTELILFGTR------------------------------------------------GKNARTLP---------PGRSQV------------------NYIGTRK------------------------REHSRKPDE---------QYPLIESCSP------------------------------GP-YLEMF-GR--G------LRKGWTTWG-----------NQADETY
      STYLEM_4843_Stylonychia_lemnae_678336696                                GLAQVIHPKEGIVKTNFDN-Y----------A------------KNVEAILINPCWVTQKDK----------------------KGK-------------VKGVTMDE-----FQQLNFSKNLMID-GLIFVWVEKE--------IISPVIK----------YFE----SQGLIYVE--NVCWVMLDQTKKEQVEATQSIDV-----------SPAYIRD------------------DYQYIRKSHKTLLMFRRLQKKN-------------------------------------------GNPLEL-----------RHQRTC------------------DVCFDFVDT----------------------NVHNYKPNEY--------LYKLIETLLPQSIVDEEKK---------------------HLRMIELW-AQDPK------PRKGWI------------------KFI
      consensus/100%                                                          .................................................................................................................h..h..........s.hh.W.....................................b...........h.........................................................................................................................................................................................................................................................................................................
      consensus/95%                                                           ..............................................h..lhhcPPh...............................................h.........h..h.l..h.....s.hhhW.............................hh......Wshp........W.+.p..........................................................s.b.hhh.................................................................................................................................P..................h.....................................chh.sp....................................
      consensus/90%                                                           ..............................................a..lhhDPPh...............................................h.........h..h.l..l.....s.lhhWh............h...............hh......Wuhp......h.W.K.p.........................................................ps.c.hlh..p...........................................................................................h.s.............................p.KP..................h....................................hcla.up.............W.shs..................
      consensus/85%                                                           .........p....................................a.hlhhDPPa............................................b..h...p.....h..l.l..l.....shlhlWh.s..........h....p..........hh......WGhpb.....h.WhK.p................................s.......................pps.E.hlhu.p................................................sp.......................................phh.s............................pspKP............b..hc.h....................................l-lF.uc..........p..W.shu..................
      consensus/80%                                                           .........s....................................ashlhhDPPW............................p...............h..hs.pp.....l..lsl..l...p.uhlhlWh.s..........h..sbp..........hh......WGapb.p...h.WlK.s............................p...s...................s.h.pps.E.hLhu.+................................................up...................p...................phl.s............................popKP.b.........hb.hh-.h...................................blElF.uR..........p.sW.shG..................
      consensus/75%                                                           .........sh...................................FshlhhDPPWpb..........................p...............a..hs.pc.....l..lsl.pl...p.uhlhlWhss..........h.bsbp..........hhp.....WGapbhp..ph.WlK.s............................p...s...................s.h.ppspE.hLhu.+................................................Gp...................ss..................pllhs............................popKP.b.........ha.hh-ph.s.................................blElF.uR..p.......p.sW.shG..................
      consensus/70%                                                           ...s..h.s-h..hp..............................pFslIhhDPPWpb..........................p...............Ysshs.p-.....l..lsl.pl...s.uhlalWhss..........h.bubp..........hhp.....WGaphhp..ph.WlK.s............................p..bs...................s.hhppspE.hLlu.+................................................Gps..................ss..................sllhs............................pSpKP.b.........ha.hlEph.s..............................ss.blElF.uR..p.......+.sW.shG..................
      
      
      Back to Contents
    • Phyletic distribution and gene neighborhoods of the MT-A70 clade (Clade 1)of adenine methylases

      Eukaryotic versions (For a proper resolution of the relationships please refer to tree
      GI                Domain-architecture            Pfam                         Gene name             Len   Taxonomy                                              Species name                                   Genbank
      # 178; METTL3/METTL14                                                                                                                                                                                                                        
      159466562         CCCH+CCCH+N6-MTase             MT-A70                       CHLREDRAFT_128290     367   eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii                      hypothetical protein CHLREDRAFT_128290 [Chlamydomonas reinhardtii].
      300265564         CCCH+CCCH+N6-MTase             MT-A70                       VOLCADRAFT_59216      287   eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis                  hypothetical protein VOLCADRAFT_59216 [Volvox carteri f. nagariensis].
      15236910          CCCH+CCCH+N6-MTase             MT-A70                       AT4G10760             685   eukaryota>viridiplantae                               Arabidopsis thaliana                           N6-adenosine-methyltransferase MT-A70-like protein [Arabidopsis thaliana].
      302815848         CCCH+CCCH+N6-MTase             MT-A70                       SELMODRAFT_130187     383   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_130187 [Selaginella moellendorffii].
      168053183         CCCH+CCCH+N6-MTase             MT-A70                       PHYPADRAFT_40965      556   eukaryota>viridiplantae                               Physcomitrella patens                          predicted protein, partial [Physcomitrella patens].
      641538296         CCCH+CCCH+N6-MTase             MT-A70                       SPRG_03347            394   eukaryota>stramenopiles                               Saprolegnia parasitica CBS 223.65              hypothetical protein SPRG_03347 [Saprolegnia parasitica CBS 223.65].
      Aque1000027926    CCCH+CCCH+N6-MTase             MT-A70                       Aque1000027926        510   eukaryota>metazoa>porifera                            Amphimedon queenslandica                       Aqu1.228194                            
      Lgig1000006160    CCCH+CCCH+N6-MTase             MT-A70                       Lgig1000006160        526   eukaryota>metazoa>mollusca                            Lottia gigantea                                e_gw1.88.185.1                         
      21355141          CCCH+CCCH+N6-MTase             MT-A70                       Dmel_CG5933           608   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                        inducer of meiosis 4 [Drosophila melanogaster].
      307194509         CCCH+CCCH+N6-MTase             MT-A70                       EAI_16445             549   eukaryota>metazoa>hexapoda                            Harpegnathos saltator                          N6-adenosine-methyltransferase 70 kDa subunit [Harpegnathos saltator].
      307182701         CCCH+CCCH+N6-MTase             MT-A70                       EAG_11443             548   eukaryota>metazoa>hexapoda                            Camponotus floridanus                          N6-adenosine-methyltransferase 70 kDa subunit [Camponotus floridanus].
      189238819         CCCH+CCCH+N6-MTase             MT-A70                       LOC656280             540   eukaryota>metazoa>hexapoda                            Tribolium castaneum                            PREDICTED: N6-adenosine-methyltransferase 70 kDa subunit [Tribolium castaneum].
      193683437         CCCH+CCCH+N6-MTase             MT-A70                       LOC100159080          550   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                            PREDICTED: N6-adenosine-methyltransferase 70 kDa subunit [Acyrthosiphon pisum].
      110749760         CCCH+CCCH+N6-MTase             MT-A70                       LOC551911             556   eukaryota>metazoa>hexapoda                            Apis mellifera                                 PREDICTED: N6-adenosine-methyltransferase 70 kDa subunit-like [Apis mellifera].
      158290414         CCCH+CCCH+N6-MTase             MT-A70                       AgaP_AGAP002895       621   eukaryota>metazoa>hexapoda                            Anopheles gambiae str. PEST                    AGAP002895-PA [Anopheles gambiae str. PEST].
      156548054         CCCH+CCCH+N6-MTase             Pox_A31+MT-A70               LOC100122395          527   eukaryota>metazoa>hexapoda                            Nasonia vitripennis                            PREDICTED: similar to n6-adenosine-methyltransferase ime4 [Nasonia vitripennis].
      72085101          CCCH+CCCH+N6-MTase             MT-A70                       LOC589354             242   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: similar to (N6-adenosine)-methyltransferase, partial [Strongylocentrotus purpuratus].
      321476680         CCCH+CCCH+N6-MTase             MT-A70                       DAPPUDRAFT_192406     537   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_192406 [Daphnia pulex].
      321452889         CCCH+CCCH+N6-MTase             MT-A70                       DAPPUDRAFT_305196     260   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_305196 [Daphnia pulex].
      321452885         CCCH+CCCH+N6-MTase             MT-A70                       DAPPUDRAFT_66395      225   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_66395, partial [Daphnia pulex].
      156398086         CCCH+CCCH+N6-MTase             MT-A70                       NEMVEDRAFT_v1g33607   237   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein, partial [Nematostella vectensis].
      221126117         CCCH+CCCH+N6-MTase             MT-A70                       LOC100197970          335   eukaryota>metazoa>cnidaria                            Hydra magnipapillata                           PREDICTED: similar to Methyltransferase like 3, partial [Hydra magnipapillata].
      47086489          CCCH+CCCH+N6-MTase             MT-A70                       mettl3                584   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    N6-adenosine-methyltransferase subunit METTL3 [Danio rerio].
      597501008         CCCH+CCCH+N6-MTase             MT-A70                       -                     584   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    RecName: Full=N6-adenosine-methyltransferase subunit METTL3; AltName: Full=N6-adenosine-methyltransferase 70 kDa subunit; Short=MT-A70.
      47227445          CCCH+CCCH+N6-MTase             MT-A70                       GSTEN:00024364:G:001  530   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                         unnamed protein product, partial [Tetraodon nigroviridis].
      114651893         CCCH+CCCH+N6-MTase             MT-A70                       METTL3                558   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: methyltransferase like 3 isoform 5 [Pan troglodytes].
      114651897         CCCH+CCCH+N6-MTase             MT-A70                       METTL3                505   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: methyltransferase like 3 isoform 4 [Pan troglodytes].
      67078430          CCCH+CCCH+N6-MTase             MT-A70                       Mettl3                580   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              N6-adenosine-methyltransferase 70 kDa subunit [Rattus norvegicus].
      21361827          CCCH+CCCH+N6-MTase             MT-A70                       METTL3                580   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   N6-adenosine-methyltransferase 70 kDa subunit [Homo sapiens].
      327285111         CCCH+CCCH+N6-MTase             MT-A70                       mettl3                569   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                            PREDICTED: N6-adenosine-methyltransferase 70 kDa subunit [Anolis carolinensis].
      77627973          CCCH+CCCH+N6-MTase             MT-A70                       Mettl3                580   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   N6-adenosine-methyltransferase subunit METTL3 [Mus musculus].
      114651889         CCCH+CCCH+N6-MTase             MT-A70                       METTL3                580   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: n6-adenosine-methyltransferase 70 kDa subunit isoform 3 [Pan troglodytes].
      114651891         CCCH+CCCH+N6-MTase             MT-A70                       METTL3                592   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: methyltransferase like 3 isoform 2 [Pan troglodytes].
      198413310         CCCH+CCCH+N6-MTase             MT-A70                       LOC100176137          305   eukaryota>metazoa>chordata                            Ciona intestinalis                             PREDICTED: similar to methyltransferase like 3 [Ciona intestinalis].
      210086854         CCCH+CCCH+N6-MTase             MT-A70                       BRAFLDRAFT_288147     554   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_288147 [Branchiostoma floridae].
      219434179         CCCH+CCCH+N6-MTase             MT-A70                       BRAFLDRAFT_217213     568   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_217213 [Branchiostoma floridae].
      Smar1000006443    CCCH+CCCH+N6-MTase             MT-A70                       Smar1000006443        559   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima                             SMAR007641-PA pep:novel scaffold:Smar1:JH431796:151221:153694:-1 gene:SMAR007641 transcript:SMAR007641-RA
      Hrob1000005423    CCCH+CCCH+N6-MTase             MT-A70                       Hrob1000005423        264   eukaryota>metazoa>annelida                            Helobdella robusta                             68987                                  
      Hrob1000012135    CCCH+CCCH+N6-MTase             MT-A70                       Hrob1000012135        262   eukaryota>metazoa>annelida                            Helobdella robusta                             121421                                 
      Caps1000024491    CCCH+CCCH+N6-MTase             MT-A70                       Caps1000024491        517   eukaryota>metazoa>annelida                            Capitella spI                                  fgenesh1_pg.C_scaffold_975000002       
      576692605         CCCH+CCCH+N6-MTase             MT-A70                       EGR_08908             500   eukaryota>metazoa                                     Echinococcus granulosus                        N6-adenosine-methyltransferase subunit [Echinococcus granulosus].
      238661046         CCCH+CCCH+N6-MTase             MT-A70                       Smp_146300            630   eukaryota>metazoa                                     Schistosoma mansoni                            expressed protein [Schistosoma mansoni].
      340369522         CCCH+CCCH+N6-MTase             MT-A70                       LOC100641238          509   eukaryota>metazoa                                     Amphimedon queenslandica                       PREDICTED: N6-adenosine-methyltransferase subunit METTL3-like [Amphimedon queenslandica].
      674587266         CCCH+CCCH+N6-MTase             MT-A70                       HmN_000529400         505   eukaryota>metazoa                                     Hymenolepis microstoma                         n6 adenosine methyltransferase 70 kDa [Hymenolepis microstoma].
      674263102         CCCH+CCCH+N6-MTase             MT-A70                       EmuJ_000651200        517   eukaryota>metazoa                                     Echinococcus multilocularis                    n6 adenosine methyltransferase 70 kDa [Echinococcus multilocularis].
      674567341         CCCH+CCCH+N6-MTase             MT-A70                       EgrG_000651200        393   eukaryota>metazoa                                     Echinococcus granulosus                        n6 adenosine methyltransferase 70 kDa [Echinococcus granulosus].
      485635833         CCCH+CCCH+N6-MTase             MT-A70                       EMIHUDRAFT_232802     315   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_232802 [Emiliania huxleyi CCMP1516].
      551562135         CCCH+CCCH+N6-MTase             MT-A70                       EMIHUDRAFT_211665     315   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_211665 [Emiliania huxleyi CCMP1516].
      Uram1000000276    CCCH+CCCH+N6-MTase             MT-A70                       Uram1000000276        372   eukaryota>fungi>mucoromycotina                        Umbelopsis ramanniana                          fgenesh1_kg.1_#_1214_#_combest_scaffold_1_3769
      Spun1000003158    CCCH+CCCH+N6-MTase             MT-A70                       Spun1000003158        581   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (581 aa)
      Wseb1000002161    CCCH+CCCH+N6-MTase             MT-A70                       Wseb1000002161        416   eukaryota>fungi>basidiomycota                         Wallemia sebi                                  estExt_fgenesh1_kg.C_70035             
      242208543         CCCH+CCCH+N6-MTase             MT-A70                       POSPLDRAFT_25577      295   eukaryota>fungi>basidiomycota                         Postia placenta Mad-698-R                      predicted protein, partial [Postia placenta Mad-698-R].
      527302065         CCCH+CCCH+N6-MTase             MT-A70                       FOMPIDRAFT_1022720    566   eukaryota>fungi>basidiomycota                         Fomitopsis pinicola FP-58527 SS1               hypothetical protein FOMPIDRAFT_1022720 [Fomitopsis pinicola FP-58527 SS1].
      58268080          CCCH+CCCH+N6-MTase             MT-A70                       CNE03860              406   eukaryota>fungi>basidiomycota                         Cryptococcus neoformans var. neoformans JEC21  mRNA methyltransferase [Cryptococcus neoformans var. neoformans JEC21].
      299747281         CCCH+CCCH+N6-MTase             MT-A70                       CC1G_14583            596   eukaryota>fungi>basidiomycota                         Coprinopsis cinerea okayama7#130               m6a methyltransferase [Coprinopsis cinerea okayama7#130].
      71017811          CCCH+CCCH+N6-MTase             MT-A70                       UM02989.1             395   eukaryota>fungi>basidiomycota                         Ustilago maydis 521                            hypothetical protein UM02989.1 [Ustilago maydis 521].
      Abis1000003455    CCCH+CCCH+N6-MTase             MT-A70                       Abis1000003455        312   eukaryota>fungi>basidiomycota                         Agaricus bisporus                              e_gw1.4.1246.1                         
      164649925         CCCH+CCCH+N6-MTase             MT-A70                       LACBIDRAFT_243879     309   eukaryota>fungi>basidiomycota                         Laccaria bicolor S238N-H82                     predicted protein, partial [Laccaria bicolor S238N-H82].
      50545848          CCCH+CCCH+N6-MTase             MT-A70                       YALI0B03498g          587   eukaryota>fungi>ascomycota                            Yarrowia lipolytica CLIB122                    YALI0B03498p [Yarrowia lipolytica CLIB122].
      50310943          CCCH+N6-MTase                  MT-A70                       KLLA0F09097g          524   eukaryota>fungi>ascomycota                            Kluyveromyces lactis NRRL Y-1140               hypothetical protein [Kluyveromyces lactis NRRL Y-1140].
      6321246           CCCH+N6-MTase                  MT-A70                       YGL192W               600   eukaryota>fungi>ascomycota                            Saccharomyces cerevisiae S288c                 Ime4p [Saccharomyces cerevisiae S288c].
      50412773          CCCH+N6-MTase                  MT-A70                       DEHA0B04491           536   eukaryota>fungi>ascomycota                            Debaryomyces hansenii CBS767                   hypothetical protein DEHA0B04491 [Debaryomyces hansenii CBS767].
      1174426           CCCH+CCCH+N6-MTase             MT-A70                       -                     600   eukaryota>fungi>ascomycota                            Saccharomyces cerevisiae S288c                 RecName: Full=N6-adenosine-methyltransferase IME4.
      150864816         CCCH+CCCH+N6-MTase             MT-A70                       PICST_57562           531   eukaryota>fungi>ascomycota                            Scheffersomyces stipitis CBS 6054              activator of IME1 Predicted N6-adenine RNA methylase IME4 [Scheffersomyces stipitis CBS 6054].
      50284965          CCCH+N6-MTase                  MT-A70                       CAGL0A03300g          488   eukaryota>fungi>ascomycota                            Candida glabrata CBS 138                       hypothetical protein [Candida glabrata CBS 138].
      45198691          CCCH+N6-MTase                  MT-A70                       AGOS_AFR173W          559   eukaryota>fungi>ascomycota                            Eremothecium gossypii ATCC 10895               AFR173Wp [Eremothecium gossypii ATCC 10895].
      68466659          CCCH+CCCH+N6-MTase             MT-A70                       CaO19.1476            543   eukaryota>fungi>ascomycota                            Candida albicans SC5314                        hypothetical protein CaO19.1476 [Candida albicans SC5314].
      Adig1000001851    CCCH+CCCH+N6-MTase             MT-A70                       Adig1000001851        262   eukaryota>cnidaria                                    Acropora digitifera                            adi_v1.03360                           
      514696822         CCCH+CCCH+N6-MTase             MT-A70                       PTSG_03395            797   eukaryota>choanoflagellida                            Salpingoeca rosetta                            N6-adenosine-methyltransferase 70 kDa subunit [Salpingoeca rosetta].
      145534770         CCCH+CCCH+N6-MTase             Methyltransf_26+MT-A70       GSPATT00019450001     493   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      145529029         CCCH+CCCH+N6-MTase             Methyltransf_26+MT-A70       GSPATT00017263001     539   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      586734236         CCCH+CCCH+N6-MTase             MT-A70                       TTHERM_00962190       741   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila SB210                  N6-adenosine-methyltransferase 70 kDa subunit (macronuclear) [Tetrahymena thermophila SB210].
      672280105         CCCH+CCCH+N6-MTase             MT-A70                       TGFOU_217350          453   eukaryota>alveolata>apicomplexa                       Toxoplasma gondii FOU                          putative methyltransferase MTA70, partial [Toxoplasma gondii FOU].
      124512114         CCCH+CCCH+N6-MTase             MT-A70                       PF07_0123             760   eukaryota>alveolata>apicomplexa                       Plasmodium falciparum 3D7                      mRNA (N6-adenosine)-methyltransferase, putative [Plasmodium falciparum 3D7].
      221485567         CCCH+CCCH+N6-MTase             MT-A70                       TGGT1_028850          823   eukaryota>alveolata>apicomplexa                       Toxoplasma gondii GT1                          N6-adenosine-methyltransferase 70 kDa subunit, putative [Toxoplasma gondii GT1].
      156087837         CCCH+CCCH+N6-MTase             MT-A70                       BBOV_III001900        641   eukaryota>alveolata>apicomplexa                       Babesia bovis T2Bo                             MT-A70 family protein [Babesia bovis T2Bo].
      307110630         N6-MTase                       MT-A70                       CHLNCDRAFT_50385      309   eukaryota>viridiplantae>chlorophyta                   Chlorella variabilis                           hypothetical protein CHLNCDRAFT_50385 [Chlorella variabilis].
      116060398         N6-MTase                       MT-A70                       Ot11g01290            371   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             Predicted N6-adenine methylase involved in transcription regulation (ISS) [Ostreococcus tauri].
      308809243         N6-MTase                       MT-A70                       Ot11g01290            371   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             Predicted N6-adenine methylase involved in transcription regulation (ISS) [Ostreococcus tauri].
      255086517         N6-MTase                       MT-A70                       MICPUN_76124          165   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein, partial [Micromonas sp. RCC299].
      158275861         N6-MTase                       MT-A70                       CHLREDRAFT_174824     331   eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii                      predicted protein [Chlamydomonas reinhardtii].
      300268372         N6-MTase                       MT-A70                       VOLCADRAFT_116042     245   eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis                  hypothetical protein VOLCADRAFT_116042, partial [Volvox carteri f. nagariensis].
      159474530         N6-MTase                       MT-A70                       CHLREDRAFT_174824     331   eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii                      predicted protein [Chlamydomonas reinhardtii].
      303284481         N6-MTase                       MT-A70                       MICPUCDRAFT_51320     357   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    hypothetical protein MICPUCDRAFT_51320 [Micromonas pusilla CCMP1545].
      Cmer1000000605    N6-MTase                       MT-A70                       Cmer1000000605        393   eukaryota>rhodophyta                                  Cyanidioschyzon merolae                        CME116C similar to (N6-adenosine)-methyltransferase
      452822052         N6-MTase                       MT-A70                       Gasu_34680            285   eukaryota>rhodophyta                                  Galdieria sulphuraria                          mRNA (2'-O-methyladenosine-N6-)-methyltransferase [Galdieria sulphuraria].
      544210442         N6-MTase                       MT-A70                       CYME_CME116C          392   eukaryota>rhodophyta                                  Cyanidioschyzon merolae strain 10D             similar to (N6-adenosine)-methyltransferase [Cyanidioschyzon merolae strain 10D].
      569355952         N6-MTase                       MT-A70                       RFI_31139             322   eukaryota>rhizaria                                    Reticulomyxa filosa                            MT-A70 family protein, partial [Reticulomyxa filosa].
      121913835         N6-MTase                       MT-A70                       TVAG_062450           392   eukaryota>parabasalia                                 Trichomonas vaginalis G3                       MT-A70 family protein [Trichomonas vaginalis G3].
      123390303         N6-MTase                       MT-A70                       TVAG_002370           412   eukaryota>parabasalia                                 Trichomonas vaginalis G3                       MT-A70 family protein [Trichomonas vaginalis G3].
      154414896         N6-MTase                       MT-A70                       TVAG_136190           397   eukaryota>parabasalia                                 Trichomonas vaginalis G3                       MT-A70 family protein [Trichomonas vaginalis G3].
      121880798         N6-MTase                       MT-A70                       TVAG_002370           412   eukaryota>parabasalia                                 Trichomonas vaginalis G3                       MT-A70 family protein [Trichomonas vaginalis G3].
      121908694         N6-MTase                       MT-A70                       TVAG_389630           359   eukaryota>parabasalia                                 Trichomonas vaginalis G3                       MT-A70 family protein [Trichomonas vaginalis G3].
      154413191         N6-MTase                       MT-A70                       TVAG_062450           392   eukaryota>parabasalia                                 Trichomonas vaginalis G3                       MT-A70 family protein [Trichomonas vaginalis G3].
      121914692         N6-MTase                       MT-A70                       TVAG_136190           397   eukaryota>parabasalia                                 Trichomonas vaginalis G3                       MT-A70 family protein [Trichomonas vaginalis G3].
      Aque1000012323    N6-MTase                       MT-A70                       Aque1000012323        451   eukaryota>metazoa>porifera                            Amphimedon queenslandica                       Aqu1.212591                            
      307170446         N6-MTase                       MT-A70                       EAG_00575             205   eukaryota>metazoa>hexapoda                            Camponotus floridanus                          Methyltransferase-like protein KIAA1627-like protein [Camponotus floridanus].
      307177286         N6-MTase                       MT-A70                       EAG_12613             382   eukaryota>metazoa>hexapoda                            Camponotus floridanus                          Methyltransferase-like protein KIAA1627-like protein [Camponotus floridanus].
      19920926          N6-MTase                       MT-A70                       Dmel_CG7818           397   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                        CG7818 [Drosophila melanogaster].      
      158297043         N6-MTase                       MT-A70                       AgaP_AGAP008111       392   eukaryota>metazoa>hexapoda                            Anopheles gambiae str. PEST                    AGAP008111-PA [Anopheles gambiae str. PEST].
      48138147          N6-MTase                       MT-A70                       LOC409900             390   eukaryota>metazoa>hexapoda                            Apis mellifera                                 PREDICTED: methyltransferase-like protein 14 homolog [Apis mellifera].
      193577905         N6-MTase                       MT-A70                       LOC100163326          387   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                            PREDICTED: methyltransferase-like protein 14 homolog [Acyrthosiphon pisum].
      193664699         N6-MTase                       MT-A70                       LOC100169342          366   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                            PREDICTED: methyltransferase-like protein 14 homolog [Acyrthosiphon pisum].
      91089719          N6-MTase                       MT-A70                       LOC663857             390   eukaryota>metazoa>hexapoda                            Tribolium castaneum                            PREDICTED: methyltransferase-like protein 14 homolog [Tribolium castaneum].
      156553899         N6-MTase                       MT-A70                       LOC100117110          390   eukaryota>metazoa>hexapoda                            Nasonia vitripennis                            PREDICTED: methyltransferase-like protein 14 homolog [Nasonia vitripennis].
      291232903         N6-MTase                       MT-A70                       LOC100376676          456   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii                       PREDICTED: methyltransferase-like protein 14-like [Saccoglossus kowalevskii].
      115935304         N6-MTase                       MT-A70                       LOC579598             363   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: similar to MGC79735 protein [Strongylocentrotus purpuratus].
      321457952         N6-MTase                       MT-A70                       DAPPUDRAFT_130106     168   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_130106 [Daphnia pulex].
      221131975         N6-MTase                       MT-A70                       LOC100197023          446   eukaryota>metazoa>cnidaria                            Hydra vulgaris                                 PREDICTED: N6-adenosine-methyltransferase subunit METTL14-like [Hydra vulgaris].
      42517136          N6-MTase                       MT-A70                       Mettl14               456   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   methyltransferase like 14 [Mus musculus].
      327274188         N6-MTase                       MT-A70                       mettl14               456   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                            PREDICTED: N6-adenosine-methyltransferase subunit METTL14 [Anolis carolinensis].
      24308265          N6-MTase                       MT-A70                       METTL14               456   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   N6-adenosine-methyltransferase subunit METTL14 [Homo sapiens].
      109467560         N6-MTase                       MT-A70                       RGD1304822_predicted  456   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              PREDICTED: similar to CG7818-PA [Rattus norvegicus].
      71896697          N6-MTase                       MT-A70                       METTL14               459   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                                  N6-adenosine-methyltransferase subunit METTL14 [Gallus gallus].
      224049178         N6-MTase                       MT-A70                       METTL14               459   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                            PREDICTED: methyltransferase-like protein 14 [Taeniopygia guttata].
      46309507          N6-MTase                       MT-A70                       mettl14               455   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    N6-adenosine-methyltransferase subunit METTL14 [Danio rerio].
      326918994         N6-MTase                       MT-A70                       LOC100540744          490   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                            PREDICTED: methyltransferase-like protein 14-like [Meleagris gallopavo].
      47207634          N6-MTase                       MT-A70                       GSTEN:00009248:G:001  241   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                         unnamed protein product, partial [Tetraodon nigroviridis].
      114595809         N6-MTase                       MT-A70                       METTL14               456   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: N6-adenosine-methyltransferase subunit METTL14 [Pan troglodytes].
      219440655         N6-MTase                       MT-A70                       BRAFLDRAFT_58874      164   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_58874 [Branchiostoma floridae].
      198424026         N6-MTase                       MT-A70                       LOC100176240          474   eukaryota>metazoa>chordata                            Ciona intestinalis                             PREDICTED: methyltransferase-like protein 14 homolog [Ciona intestinalis].
      Smar1000012137    N6-MTase                       MT-A70                       Smar1000012137        431   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima                             SMAR002698-PA pep:novel scaffold:Smar1:JH431216:18104:19856:1 gene:SMAR002698 transcript:SMAR002698-RA
      Caps1000008038    N6-MTase                       MT-A70                       Caps1000008038        338   eukaryota>metazoa>annelida                            Capitella spI                                  fgenesh1_pg.C_scaffold_355000005       
      Hrob1000012559    N6-MTase                       MT-A70                       Hrob1000012559        369   eukaryota>metazoa>annelida                            Helobdella robusta                             79167                                  
      576693404         N6-MTase                       MT-A70                       EGR_08094             418   eukaryota>metazoa                                     Echinococcus granulosus                        N6-adenosine-methyltransferase subunit [Echinococcus granulosus].
      340382361         N6-MTase                       MT-A70                       LOC100635916          450   eukaryota>metazoa                                     Amphimedon queenslandica                       PREDICTED: methyltransferase-like protein 14 homolog [Amphimedon queenslandica].
      Sarc1000000133    N6-MTase                       MT-A70                       Sarc1000000133        361   eukaryota>ichthyosporea                               Sphaeroforma arctica                            Sphaeroforma arctica JP610 hypothetical protein (361 aa)
      290979461         N6-MTase                       MT-A70                       NAEGRDRAFT_72415      451   eukaryota>heterolobosea                               Naegleria gruberi strain NEG-M                 predicted protein [Naegleria gruberi strain NEG-M].
      284086029         N6-MTase                       MT-A70                       NAEGRDRAFT_72415      451   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi]. 
      551613601         N6-MTase                       MT-A70                       EMIHUDRAFT_449178     245   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_449178 [Emiliania huxleyi CCMP1516].
      Mver1000003113    N6-MTase                       MT-A70                       Mver1000003113        309   eukaryota>fungi>zygomycete                            Mortierella verticillata                        Mortierella verticillata NRRL 6337 mRNA (2'-O-methyladenosine-N6-)-methyltransferase (309 aa)
      Pisp1000005866    N6-MTase                       MT-A70                       Pisp1000005866        165   eukaryota>fungi>neocallimastigomycota                 Piromyces sp                                   estExt_Genewise1Plus.C_1020015         
      Uram1000008906    N6-MTase                       MT-A70                       Uram1000008906        267   eukaryota>fungi>mucoromycotina                        Umbelopsis ramanniana                          fgenesh1_kg.75_#_199_#_combest_scaffold_75_134233
      Bcir1000003454    N6-MTase                       MT-A70                       Bcir1000003454        264   eukaryota>fungi>mucoromycotina                        Backusella circina                             estExt_fgenesh1_pg.C_270053            
      Lhya1000001414    N6-MTase                       MT-A70                       Lhya1000001414        272   eukaryota>fungi>mucoromycotina                        Lichtheimia hyalospora                         estExt_Genewise1.C_100084              
      Crev1000002253    N6-MTase                       Mnd1+MT-A70                  Crev1000002253        412   eukaryota>fungi>kickxellomycotina                     Coemansia reversa                              fgenesh1_kg.9_#_19_#_isotig04348       
      595497236         N6-MTase                       MT-A70                       RirG_018940           470   eukaryota>fungi>glomeromycota                         Rhizophagus irregularis DAOM 197198w           Kar4p [Rhizophagus irregularis DAOM 197198w].
      595481916         N6-MTase                       MT-A70                       RirG_092820           303   eukaryota>fungi>glomeromycota                         Rhizophagus irregularis DAOM 197198w           Ime4p [Rhizophagus irregularis DAOM 197198w].
      Ccor1000007140    N6-MTase                       MT-A70                       Ccor1000007140        165   eukaryota>fungi>entomophthoromycota                   Conidiobolus coronatus                         CE21212_12273                          
      Spun1000000693    N6-MTase                       MT-A70                       Spun1000000693        357   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (357 aa)
      Spun1000006218    N6-MTase                       MT-A70                       Spun1000006218        358   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (358 aa)
      164645185         N6-MTase                       MT-A70                       LACBIDRAFT_319029     563   eukaryota>fungi>basidiomycota                         Laccaria bicolor S238N-H82                     predicted protein [Laccaria bicolor S238N-H82].
      Wseb1000004329    N6-MTase                       MT-A70                       Wseb1000004329        501   eukaryota>fungi>basidiomycota                         Wallemia sebi                                  estExt_fgenesh1_kg.C_210042            
      58271296          N6-MTase                       MT-A70                       CNI00700              586   eukaryota>fungi>basidiomycota                         Cryptococcus neoformans var. neoformans JEC21  transcription regulator [Cryptococcus neoformans var. neoformans JEC21].
      527293720         N6-MTase                       MT-A70                       FOMPIDRAFT_1134066    613   eukaryota>fungi>basidiomycota                         Fomitopsis pinicola FP-58527 SS1               hypothetical protein FOMPIDRAFT_1134066 [Fomitopsis pinicola FP-58527 SS1].
      299741172         N6-MTase                       MT-A70                       CC1G_11190            708   eukaryota>fungi>basidiomycota                         Coprinopsis cinerea okayama7#130               transcription regulator [Coprinopsis cinerea okayama7#130].
      220730232         N6-MTase                       MT-A70                       POSPLDRAFT_106379     611   eukaryota>fungi>basidiomycota                         Postia placenta Mad-698-R                      predicted protein [Postia placenta Mad-698-R].
      116504623         N6-MTase                       MT-A70                       CC1G_11190            680   eukaryota>fungi>basidiomycota                         Coprinopsis cinerea okayama7#130               hypothetical protein CC1G_11190 [Coprinopsis cinerea okayama7#130].
      Abis1000008599    N6-MTase                       MT-A70                       Abis1000008599        649   eukaryota>fungi>basidiomycota                         Agaricus bisporus                              Genemark.8250_g                        
      Pbla1000007785    N6-MTase                       MT-A70                       Pbla1000007785        331   eukaryota>fungi>basal                                 Phycomyces blakesleeanus                       fgeneshPB_pg.16__43                    
      Mcir1000003212    N6-MTase                       MT-A70                       Mcir1000003212        195   eukaryota>fungi>basal                                 Mucor circinelloides                           e_gw1.02.942.1                         
      50548917          N6-MTase                       MT-A70                       YALI0C17017g          311   eukaryota>fungi>ascomycota                            Yarrowia lipolytica CLIB122                    YALI0C17017p [Yarrowia lipolytica CLIB122].
      150864497         N6-MTase                       MT-A70                       PICST_35413           385   eukaryota>fungi>ascomycota                            Scheffersomyces stipitis CBS 6054              hypothetical protein PICST_35413 [Scheffersomyces stipitis CBS 6054].
      46434912          N6-MTase                       MT-A70                       CaO19.3736            369   eukaryota>fungi>ascomycota                            Candida albicans SC5314                        hypothetical protein CaO19.3736 [Candida albicans SC5314].
      6319795           N6-MTase                       MT-A70                       YCL055W               335   eukaryota>fungi>ascomycota                            Saccharomyces cerevisiae S288c                 Kar4p [Saccharomyces cerevisiae S288c].
      50285123          N6-MTase                       MT-A70                       CAGL0B00462g          324   eukaryota>fungi>ascomycota                            Candida glabrata CBS 138                       hypothetical protein [Candida glabrata CBS 138].
      68485255          N6-MTase                       MT-A70                       CaO19.11221           369   eukaryota>fungi>ascomycota                            Candida albicans SC5314                        hypothetical protein CaO19.11221 [Candida albicans SC5314].
      50423415          N6-MTase                       MT-A70                       DEHA0E24156g          402   eukaryota>fungi>ascomycota                            Debaryomyces hansenii CBS767                   hypothetical protein DEHA0E24156g [Debaryomyces hansenii CBS767].
      50304541          N6-MTase                       MT-A70                       KLLA0C00693g          323   eukaryota>fungi>ascomycota                            Kluyveromyces lactis NRRL Y-1140               hypothetical protein [Kluyveromyces lactis NRRL Y-1140].
      45199255          N6-MTase                       MT-A70                       AGOS_AFR736C          322   eukaryota>fungi>ascomycota                            Eremothecium gossypii ATCC 10895               AFR736Cp [Eremothecium gossypii ATCC 10895].
      384484780         N6-MTase                       MT-A70                       RO3G_01664            250   eukaryota>fungi                                       Rhizopus delemar RA 99-880                     hypothetical protein RO3G_01664 [Rhizopus delemar RA 99-880].
      727145762         N6-MTase                       MT-A70                       RMATCC62417_07548     263   eukaryota>fungi                                       Rhizopus microsporus                           Putative mRNA (2'-O-methyladenosine-N6-)-methyltransferase [Rhizopus microsporus].
      Adig1000009633    N6-MTase                       MT-A70+MT-A70                Adig1000009633        214   eukaryota>cnidaria                                    Acropora digitifera                            adi_v1.06363                           
      514693100         N6-MTase                       MT-A70                       PTSG_04805            593   eukaryota>choanoflagellida                            Salpingoeca rosetta                            hypothetical protein PTSG_04805 [Salpingoeca rosetta].
      Ttra1000005250    N6-MTase                       MT-A70                       Ttra1000005250        304   eukaryota>apusozoa                                    Thecamonas trahens                              Thecamonas trahens ATCC 50062 hypothetical protein (304 aa)
      470407427         N6-MTase                       MT-A70                       ACA1_366350           289   eukaryota>amoebozoa>acanthamoebidae                   Acanthamoeba castellanii str. Neff             MTA70 family protein [Acanthamoeba castellanii str. Neff].
      403359546         N6-MTase                       MT-A70                       OXYTRI_23292          520   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                            MT-A70 family protein (macronuclear) [Oxytricha trifallax].
      145544559         N6-MTase                       MT-A70                       GSPATT00003428001     317   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      586728217         N6-MTase                       MT-A70                       TTHERM_00704040       428   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila SB210                  MT-a70 family protein (macronuclear) [Tetrahymena thermophila SB210].
      145474019         N6-MTase                       MT-A70                       GSPATT00000069001     251   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      145525104         N6-MTase                       MT-A70                       GSPATT00015836001     364   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      146142755         N6-MTase                       MT-A70                       TTHERM_00704040       372   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila SB210                  MT-A70 family protein (macronuclear) [Tetrahymena thermophila SB210].
      145499669         N6-MTase                       MT-A70                       GSPATT00037207001     319   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      403376498         N6-MTase                       MT-A70                       OXYTRI_18855          698   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                            MT-A70 family protein (macronuclear) [Oxytricha trifallax].
      118378397         N6-MTase                       MT-A70                       TTHERM_00558100       392   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila SB210                  MT-a70 family protein [Tetrahymena thermophila SB210].
      145492063         N6-MTase                       MT-A70                       GSPATT00034109001     319   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      145476411         N6-MTase                       MT-A70                       GSPATT00027862001     364   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      221488029         N6-MTase                       MT-A70                       TGGT1_107030          525   eukaryota>alveolata>apicomplexa                       Toxoplasma gondii GT1                          N6-adenosine-methyltransferase subunit, putative [Toxoplasma gondii GT1].
      156086210         N6-MTase                       MT-A70                       BBOV_IV005850         448   eukaryota>alveolata>apicomplexa                       Babesia bovis T2Bo                             hypothetical protein [Babesia bovis T2Bo].
      672286124         N6-MTase                       MT-A70                       TGFOU_268840          525   eukaryota>alveolata>apicomplexa                       Toxoplasma gondii FOU                          putative N6-adenosine-methyltransferase [Toxoplasma gondii FOU].
      514485079         N6-MTase                       MT-A70                       CAOG_04822            317   eukaryota                                             Capsaspora owczarzaki ATCC 30864               MT-A70 family protein [Capsaspora owczarzaki ATCC 30864].
      320169965         N6-MTase                       MT-A70                       CAOG_04822            424   eukaryota                                             Capsaspora owczarzaki ATCC 30864               MT-A70 family protein [Capsaspora owczarzaki ATCC 30864].
      # 45; METTL4                                                                                                                                                                                                                                                
      224046124         alpha-helical+N6-MTase         MT-A70                       METTL4                481   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                            PREDICTED: methyltransferase-like protein 4 [Taeniopygia guttata].
      327269895         alpha-helical+N6-MTase         MT-A70                       mettl4                479   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                            PREDICTED: methyltransferase-like protein 4 [Anolis carolinensis].
      118086830         alpha-helical+N6-MTase         MT-A70                       METTL4                475   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                                  PREDICTED: similar to Methyltransferase like 4 [Gallus gallus].
      326917444         alpha-helical+N6-MTase         MT-A70                       LOC100547828          474   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                            PREDICTED: methyltransferase-like protein 4-like [Meleagris gallopavo].
      145275206         alpha-helical+N6-MTase         MT-A70                       METTL4                472   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   methyltransferase-like protein 4 isoform 1 [Homo sapiens].
      55647269          alpha-helical+N6-MTase         MT-A70                       METTL4                472   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: methyltransferase-like protein 4 isoform X1 [Pan troglodytes].
      109487662         Nalpha-helical+N6-MTase        MT-A70                       RGD1306451_predicted  471   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              PREDICTED: similar to methyltransferase like 4 [Rattus norvegicus].
      74315949          alpha-helical+N6-MTase         MT-A70                       Mettl4                471   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   methyltransferase-like protein 4 [Mus musculus].
      189522093         alpha-helical+N6-MTase         MT-A70                       mettl4                450   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    PREDICTED: methyltransferase-like protein 4 isoform X1 [Danio rerio].
      47217445          alpha-helical+N6-MTase         MT-A70                       GSTEN:00031779:G:001  308   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                         unnamed protein product, partial [Tetraodon nigroviridis].
      340370562         alpha-helical+N6-MTase         MT-A70+MT-A70                LOC100633541          403   eukaryota>metazoa                                     Amphimedon queenslandica                       PREDICTED: methyltransferase-like protein 4 [Amphimedon queenslandica].
      190582266         alpha-helical+N6-MTase         MT-A70                       TRIADDRAFT_58838      331   eukaryota>metazoa>placozoa                            Trichoplax adhaerens                           hypothetical protein TRIADDRAFT_58838 [Trichoplax adhaerens].
      162690420         N6-MTase                       MT-A70                       PHYPADRAFT_206270     428   eukaryota>viridiplantae                               Physcomitrella patens                          predicted protein [Physcomitrella patens].
      168011388         N6-MTase                       MT-A70                       PHYPADRAFT_206270     428   eukaryota>viridiplantae                               Physcomitrella patens                          predicted protein [Physcomitrella patens].
      Mver1000006359    N6-MTase                       Methyltransf_26+MT-A70       Mver1000006359        466   eukaryota>fungi>zygomycete                            Mortierella verticillata                        Mortierella verticillata NRRL 6337 hypothetical protein (466 aa)
      Spun1000004502    N6-MTase                       MT-A70                       Spun1000004502        418   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (418 aa)
      18394726          N6-MTase                       MT-A70                       AT1G19340             414   eukaryota>viridiplantae                               Arabidopsis thaliana                           methyltransferase-like protein 2 [Arabidopsis thaliana].
      Lhya1000004692    N6-MTase                       MT-A70                       Lhya1000004692        398   eukaryota>fungi>mucoromycotina                        Lichtheimia hyalospora                         estExt_Genemark1.C_470043              
      198422905         N6-MTase                       MT-A70                       LOC100186432          385   eukaryota>metazoa>chordata                            Ciona intestinalis                             PREDICTED: methyltransferase-like protein 4 [Ciona intestinalis].
      307172265         N6-MTase                       MT-A70                       EAG_10107             385   eukaryota>metazoa>hexapoda                            Camponotus floridanus                          Methyltransferase-like protein 4 [Camponotus floridanus].
      727142779         N6-MTase                       MT-A70                       RMATCC62417_10014     371   eukaryota>fungi                                       Rhizopus microsporus                           hypothetical protein RMATCC62417_10014 [Rhizopus microsporus].
      Bcir1000011661    N6-MTase                       Methyltransf_26+MT-A70       Bcir1000011661        371   eukaryota>fungi>mucoromycotina                        Backusella circina                             fgenesh1_pg.4_#_30                     
      161078350         N6-MTase                       MT-A70                       Dmel_CG14906          359   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                        CG14906 [Drosophila melanogaster].     
      24647514          N6-MTase                       MT-A70                       Dmel_CG14906          359   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                        CG14906 [Drosophila melanogaster].     
      322796786         N6-MTase                       MT-A70                       SINV_06005            356   eukaryota>metazoa>hexapoda                            Solenopsis invicta                             hypothetical protein SINV_06005, partial [Solenopsis invicta].
      221118051         N6-MTase                       MT-A70                       LOC100203523          355   eukaryota>metazoa>cnidaria                            Hydra magnipapillata                           PREDICTED: similar to Methyltransferase-like protein 4 [Hydra magnipapillata].
      46123695          N6-MTase                       MT-A70                       FG06225.1             333   eukaryota>fungi>ascomycota                            Fusarium graminearum PH-1                      hypothetical protein FG06225.1 [Fusarium graminearum PH-1].
      Ccor1000001300    N6-MTase                       Methyltransf_26+MT-A70       Ccor1000001300        308   eukaryota>fungi>entomophthoromycota                   Conidiobolus coronatus                         gm1.1428_g                             
      384493544         N6-MTase                       MT-A70                       RO3G_08740            305   eukaryota>fungi                                       Rhizopus delemar RA 99-880                     hypothetical protein RO3G_08740 [Rhizopus delemar RA 99-880].
      189238810         N6-MTase                       MT-A70                       LOC655139             288   eukaryota>metazoa>hexapoda                            Tribolium castaneum                            PREDICTED: similar to Methyltransferase-like protein 4, partial [Tribolium castaneum].
      307197295         N6-MTase                       MT-A70                       EAI_06255             287   eukaryota>metazoa>hexapoda                            Harpegnathos saltator                          Methyltransferase-like protein 4 [Harpegnathos saltator].
      210094981         N6-MTase                       MT-A70                       BRAFLDRAFT_237642     244   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_237642, partial [Branchiostoma floridae].
      Hrob1000005904    N6-MTase                       MT-A70                       Hrob1000005904        243   eukaryota>metazoa>annelida                            Helobdella robusta                             69400                                  
      Caps1000003296    N6-MTase                       MT-A70                       Caps1000003296        236   eukaryota>metazoa>annelida                            Capitella spI                                  e_gw1.295.23.1                         
      156223537         N6-MTase                       MT-A70                       NEMVEDRAFT_v1g95490   230   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein, partial [Nematostella vectensis].
      156393637         N6-MTase                       MT-A70                       NEMVEDRAFT_v1g95490   230   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein, partial [Nematostella vectensis].
      158293422         N6-MTase                       MT-A70                       AgaP_AGAP008665       225   eukaryota>metazoa>hexapoda                            Anopheles gambiae str. PEST                    AGAP008665-PA, partial [Anopheles gambiae str. PEST].
      321477348         N6-MTase                       MT-A70                       DAPPUDRAFT_25900      216   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_25900, partial [Daphnia pulex].
      Lgig1000004573    N6-MTase                       MT-A70                       Lgig1000004573        215   eukaryota>metazoa>mollusca                            Lottia gigantea                                e_gw1.35.28.1                          
      302799717         N6-MTase                       MT-A70                       SELMODRAFT_114834     207   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_114834, partial [Selaginella moellendorffii].
      Uram1000004450    N6-MTase                       MT-A70                       Uram1000004450        208   eukaryota>fungi>mucoromycotina                        Umbelopsis ramanniana                          fgenesh1_kg.22_#_168_#_combest_scaffold_22_46365
      545366117         N6-MTase                       MT-A70                       COCSUDRAFT_36424      197   eukaryota>viridiplantae>chlorophyta                   Coccomyxa subellipsoidea C-169                 MT-A70 [Coccomyxa subellipsoidea C-169].
      470510758         N6-MTase                       MT-A70                       ACA1_149840           196   eukaryota>amoebozoa>acanthamoebidae                   Acanthamoeba castellanii str. Neff             MTA70 family [Acanthamoeba castellanii str. Neff].
      Mcir1000000489    N6-MTase                       MT-A70                       Mcir1000000489        195   eukaryota>fungi>basal                                 Mucor circinelloides                           Mucci1.e_gw1.1.979.1                   
      302759497         N6-MTase                       MT-A70                       SELMODRAFT_80323      190   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_80323, partial [Selaginella moellendorffii].
      Pbla1000000301    N6-MTase                       MT-A70                       Pbla1000000301        185   eukaryota>fungi>basal                                 Phycomyces blakesleeanus                       gw1.70.18.1                            
      307104593         N6-MTase                       MT-A70                       CHLNCDRAFT_36694      164   eukaryota>viridiplantae>chlorophyta                   Chlorella variabilis                           hypothetical protein CHLNCDRAFT_36694 [Chlorella variabilis].
      # 11;                                                                                                                                                                                                                                                 
      403348598         N6-MTase+ZZ+ZZ+ZZ+ZZ           MT-A70+ZZ+ZZ+ZZ+ZZ           OXYTRI_05013          822   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                            MT-A70 family protein (macronuclear) [Oxytricha trifallax].
      145473723         N6-MTase+ZZ+ZZ+ZZ+ZZ           MT-A70+ZZ+ZZ+ZZ              GSPATT00027481001     712   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      145486788         N6-MTase+ZZ+ZZ+ZZ+ZZ           MT-A70+ZZ+ZZ+ZZ              GSPATT00032234001     711   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      145532196         N6-MTase+ZZ+ZZ+ZZ+ZZ           MT-A70+ZZ+ZZ+ZZ              GSPATT00018667001     707   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      544211235         N6-MTase+ZZ+ZZ+ZZ+ZZ           MT-A70+ZZ+ZZ+ZZ              CYME_CMH026C          626   eukaryota>rhodophyta                                  Cyanidioschyzon merolae strain 10D             similar to (N6-adenosine)-methyltransferase [Cyanidioschyzon merolae strain 10D].
      Cmer1000000996    N6-MTase+ZZ+ZZ+ZZ+ZZ           MT-A70+ZZ+ZZ+ZZ              Cmer1000000996        627   eukaryota>rhodophyta                                  Cyanidioschyzon merolae                        CMH026C similar to (N6-adenosine)-methyltransferase
      545702233         N6-MTase+ZZ+ZZ+ZZ              MT-A70+ZZ+ZZ+ZZ              Gasu_54970            601   eukaryota>rhodophyta                                  Galdieria sulphuraria                          mRNA (2'-O-methyladenosine-N6-)-methyltransferase [Galdieria sulphuraria].
      284095325         N6-MTase+ZZ+ZZ+ZZ              MT-A70+ZZ+ZZ                 NAEGRDRAFT_30463      473   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi]. 
      290998263         N6-MTase+ZZ+ZZ+ZZ              MT-A70+ZZ+ZZ                 NAEGRDRAFT_30463      473   eukaryota>heterolobosea                               Naegleria gruberi strain NEG-M                 predicted protein [Naegleria gruberi strain NEG-M].
      145512105         N6-MTase                       MT-A70                       GSPATT00039029001     454   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein, partial (macronuclear) [Paramecium tetraurelia strain d4-2].
      145493475         N6-MTase+ZZ+ZZ+ZZ+ZZ           MT-A70+ZZ+ZZ                 GSPATT00034815001     424   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      146144702         N6-MTase+ZZ+ZZ                 MT-A70                       TTHERM_00136470       540   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila SB210                  MT-A70 family protein (macronuclear) [Tetrahymena thermophila SB210].
      146175568         N6-MTase+ZZ+ZZ                 MT-A70                       TTHERM_00136470       540   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila                        MT-A70 family protein (macronuclear) [Tetrahymena thermophila].
      Ttra1000006356    N6-MTase+ZZ+ZZ                 MT-A70+ZZ+ZZ                 Ttra1000006356        1237  eukaryota>apusozoa                                    Thecamonas trahens                              Thecamonas trahens ATCC 50062 hypothetical protein (1237 aa)
      470518935         N6-MTase+ZZ+ZZ+ZZ+ZZ           MT-A70+ZZ+ZZ                 ACA1_074420           1067  eukaryota>amoebozoa>acanthamoebidae                   Acanthamoeba castellanii str. Neff             Putative N6adenosine-methyltransferase [Acanthamoeba castellanii str. Neff].
      89301120          N6-MTase+ZZ+ZZ+ZZ+ZZ           MT-A70+ZZ+ZZ                 TTHERM_00388490       2070  eukaryota>alveolata>ciliophora                        Tetrahymena thermophila SB210                  MT-A70 family protein (macronuclear) [Tetrahymena thermophila SB210].
      # 5;                                                                                                                                                                                                                                                  
      238665683         N6-MTase+BIR                   MT-A70+BIR                   Smp_172190.1          779   eukaryota>metazoa                                     Schistosoma mansoni                            expressed protein [Schistosoma mansoni].
      238665684         N6-MTase                       MT-A70                       Smp_172190.2          629   eukaryota>metazoa                                     Schistosoma mansoni                            expressed protein [Schistosoma mansoni].
      674260189         N6-MTase                       MT-A70                       EmuJ_000931900        618   eukaryota>metazoa                                     Echinococcus multilocularis                    methyltransferase protein 14 [Echinococcus multilocularis].
      674562007         N6-MTase                       MT-A70                       EgrG_000931900        618   eukaryota>metazoa                                     Echinococcus granulosus                        methyltransferase protein 14 [Echinococcus granulosus].
      674588329         N6-MTase                       MT-A70                       HmN_000449900         612   eukaryota>metazoa                                     Hymenolepis microstoma                         methyltransferase protein 14 [Hymenolepis microstoma].
      # 4;                                                                                                                                                                                                                                                  
      693499469         N6-MTase                       MT-A70                       OT_ostta06g01320      439   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             MT-A70-like [Ostreococcus tauri].      
      255076305         N6-MTase                       -                            MICPUN_108123         323   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      116058220         N6-MTase                       MT-A70                       Ot06g01500            270   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             methyltransferase MT-A70, putative (ISS) [Ostreococcus tauri].
      303279795         N6-MTase                       -                            MICPUCDRAFT_58792     264   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      # 4;                                                                                                                                                                                                                                                  
      321445585         N6-MTase                       MT-A70                       DAPPUDRAFT_70851      101   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_70851, partial [Daphnia pulex].
      115803226         N6-MTase                       MT-A70                       LOC594744             96    eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: similar to Methyltransferase like 3, partial [Strongylocentrotus purpuratus].
      Hrob1000012694    N6-MTase                       MT-A70                       Hrob1000012694        75    eukaryota>metazoa>annelida                            Helobdella robusta                             153237                                 
      321445586         N6-MTase                       MT-A70                       DAPPUDRAFT_70850      61    eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_70850, partial [Daphnia pulex].
      # 13; Eukaryotic subclade-6-related
      672827354         N6-MTase                       MT-A70                       MVEG_02535            470   eukaryota>fungi                                       Mortierella verticillata NRRL 6337             hypothetical protein MVEG_02535 [Mortierella verticillata NRRL 6337].
      Mver1000002542    N6-MTase                       MT-A70                       Mver1000002542        471   eukaryota>fungi>zygomycete                            Mortierella verticillata                        Mortierella verticillata NRRL 6337 hypothetical protein (471 aa)
      Uram1000004716    N6-MTase                       MT-A70                       Uram1000004716        457   eukaryota>fungi>mucoromycotina                        Umbelopsis ramanniana                          fgenesh1_kg.24_#_143_#_combest_scaffold_24_50604
      Pbla1000012755    N6-MTase                       MT-A70                       Pbla1000012755        454   eukaryota>fungi>basal                                 Phycomyces blakesleeanus                       estExt_fgeneshPB_pg.C_40189            
      Spun1000005457    N6-MTase                       MT-A70                       Spun1000005457        454   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (454 aa)
      Lhya1000007007    N6-MTase                       MT-A70                       Lhya1000007007        441   eukaryota>fungi>mucoromycotina                        Lichtheimia hyalospora                         estExt_Genewise1Plus.C_880061          
      727148790         N6-MTase                       GAD+MT-A70                   RMATCC62417_05354     435   eukaryota>fungi                                       Rhizopus microsporus                           hypothetical protein RMATCC62417_05354 [Rhizopus microsporus].
      Mcir1000010337    N6-MTase                       MT-A70                       Mcir1000010337        436   eukaryota>fungi>basal                                 Mucor circinelloides                           fgenesh1_kg.09_#_17_#_987_1_CCIA_CCIB_EXTA
      Bcir1000006779    N6-MTase                       MT-A70                       Bcir1000006779        435   eukaryota>fungi>mucoromycotina                        Backusella circina                             estExt_Genewise1.C_720022              
      384502026         N6-MTase                       MT-A70                       RO3G_17115            426   eukaryota>fungi                                       Rhizopus delemar RA 99-880                     hypothetical protein RO3G_17115 [Rhizopus delemar RA 99-880].
      552937933         N6-MTase                       MT-A70                       GLOINDRAFT_123982     426   eukaryota>fungi>glomeromycota                         Rhizophagus irregularis DAOM 181602            hypothetical protein GLOINDRAFT_123982 [Rhizophagus irregularis DAOM 181602].
      Crev1000000518    N6-MTase                       MT-A70                       Crev1000000518        423   eukaryota>fungi>kickxellomycotina                     Coemansia reversa                              e_gw1.2.97.1                           
      470419934         N6-MTase                       MT-A70                       ACA1_219460           387   eukaryota>amoebozoa>acanthamoebidae                   Acanthamoeba castellanii str. Neff             MT-A70 protein [Acanthamoeba castellanii str. Neff].
      Uram1000001485    N6-MTase                       MT-A70                       Uram1000001485        829   eukaryota>fungi>mucoromycotina                        Umbelopsis ramanniana                          fgenesh1_kg.5_#_767_#_combest_scaffold_5_103034
      Mver1000001135    N6-MTase                       MT-A70                       Mver1000001135        786   eukaryota>fungi>zygomycete                            Mortierella verticillata                        Mortierella verticillata NRRL 6337 hypothetical protein (786 aa)
      Pbla1000005949    MYB+N6-MTase                   MT-A70                       Pbla1000005949        761   eukaryota>fungi>basal                                 Phycomyces blakesleeanus                       fgeneshPB_pg.6__310                    
      Lhya1000007505    N6-MTase                       -                            Lhya1000007505        198   eukaryota>fungi>mucoromycotina                        Lichtheimia hyalospora                         e_gw1.100.12.1                         
      Pisp1000009414    N6-MTase                       MT-A70                       Pisp1000009414        197   eukaryota>fungi>neocallimastigomycota                 Piromyces sp                                   gm1.11659_g                            
      384494112         N6-MTase                       -                            RO3G_09313            164   eukaryota>fungi                                       Rhizopus delemar RA 99-880                     hypothetical protein RO3G_09313 [Rhizopus delemar RA 99-880].
      Bcir1000007777    N6-MTase                       -                            Bcir1000007777        155   eukaryota>fungi>mucoromycotina                        Backusella circina                             fgenesh1_pm.3_#_58                     
      # 2;                                                                                                                                                                                                                                          
      326428930         N6-MTase                       MT-A70+MT-A70                PTSG_05864            555   eukaryota>choanoflagellida                            Salpingoeca rosetta                            hypothetical protein PTSG_05864 [Salpingoeca rosetta].
      514690366         N6-MTase                       MT-A70+MT-A70                PTSG_05864            555   eukaryota>choanoflagellida                            Salpingoeca rosetta                            hypothetical protein PTSG_05864 [Salpingoeca rosetta].
      # 2;                                                                                                                                                                                                                                                  
      67901006          N6-MTase                       MT-A70                       AN7490.2              546   eukaryota>fungi>ascomycota                            Aspergillus nidulans FGSC A4                   hypothetical protein AN7490.2 [Aspergillus nidulans FGSC A4].
      70989679          N6-MTase                       MT-A70                       AFUA_2G05600          455   eukaryota>fungi>ascomycota                            Aspergillus fumigatus Af293                    MT-A70 family [Aspergillus fumigatus Af293].
      # 2;                                                                                                                                                                                                                                                  
      320165059         N6-MTase                       MT-A70                       CAOG_07090            511   eukaryota                                             Capsaspora owczarzaki ATCC 30864               hypothetical protein CAOG_07090 [Capsaspora owczarzaki ATCC 30864].
      470293128         N6-MTase                       MT-A70                       CAOG_07090            511   eukaryota                                             Capsaspora owczarzaki ATCC 30864               hypothetical protein CAOG_07090 [Capsaspora owczarzaki ATCC 30864].
      # 2;                                                                                                                                                                                                                                                  
      Chet1000009456    N6-MTase                       MT-A70                       Chet1000009456        494   eukaryota>fungi>ascomycota                            Cochliobolus heterostrophus                    estExt_fgenesh1_pg.C_390006            
      111064233         N6-MTase                       MT-A70                       SNOG_06702            472   eukaryota>fungi>ascomycota                            Phaeosphaeria nodorum SN15                     hypothetical protein SNOG_06702 [Phaeosphaeria nodorum SN15].
      # 2;                                                                                                                                                                                                                                                  
      678336696         N6-MTase                       -                            STYLEM_4843           437   eukaryota>alveolata>ciliophora                        Stylonychia lemnae                             methyltransferase mt- [Stylonychia lemnae].
      403331225         N6-MTase                       MT-A70                       OXYTRI_15421          384   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                            methyltransferase MT-A70, putative (ISS) (macronuclear) [Oxytricha trifallax].
      # 2;                                                                                                                                                                                                                                                  
      671410008         N6-MTase                       MT-A70                       BM_Bm2284d            398   eukaryota>metazoa>nematoda                            Brugia malayi                                  Protein Bm2284, isoform d [Brugia malayi].
      170590806         N6-MTase                       MT-A70                       Bm1_43505             338   eukaryota>metazoa>nematoda                            Brugia malayi                                  MT-A70 family protein [Brugia malayi]. 
      # 2;                                                                                                                                                                                                                                                  
      123486533         N6-MTase                       -                            TVAG_138980           366   eukaryota>parabasalia                                 Trichomonas vaginalis G3                       hypothetical protein [Trichomonas vaginalis G3].
      123473707         N6-MTase                       -                            TVAG_312160           365   eukaryota>parabasalia                                 Trichomonas vaginalis G3                       hypothetical protein [Trichomonas vaginalis G3].
      # 2;                                                                                                                                                                                                                                                  
      Adig1000003645    N6-MTase                       MT-A70+MT-A70                Adig1000003645        320   eukaryota>cnidaria                                    Acropora digitifera                            adi_v1.21218                           
      Adig1000021966    N6-MTase                       MT-A70+MT-A70                Adig1000021966        320   eukaryota>cnidaria                                    Acropora digitifera                            adi_v1.16957                           
      # 2;                                                                                                                                                                                                                                                  
      485631354         N6-MTase                       MT-A70                       EMIHUDRAFT_205550     311   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_205550 [Emiliania huxleyi CCMP1516].
      551588467         N6-MTase                       MT-A70                       EMIHUDRAFT_205550     311   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_205550 [Emiliania huxleyi CCMP1516].
      # 2;                                                                                                                                                                                                                                                  
      145487402         N6-MTase                       -                            GSPATT00005554001     308   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      145546436         N6-MTase                       -                            GSPATT00024232001     302   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      # 2;                                                                                                                                                                                                                                                  
      115915952         N6-MTase                       MT-A70                       LOC579669             237   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus].
      115974039         N6-MTase                       MT-A70                       LOC579669             196   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus].
      # 2;                                                                                                                                                                                                                                                  
      156201156         N6-MTase                       MT-A70                       NEMVEDRAFT_v1g224635  197   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein, partial [Nematostella vectensis].
      156328704         N6-MTase                       MT-A70                       NEMVEDRAFT_v1g224635  197   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         hypothetical protein NEMVEDRAFT_v1g224635, partial [Nematostella vectensis].
      # 2;                                                                                                                                                                                                                                                  
      674560581         N6-MTase                       MT-A70                       EgrG_000328700        69    eukaryota>metazoa                                     Echinococcus granulosus                        n6 adenosine methyltransferase ime4 [Echinococcus granulosus].
      674580335         N6-MTase                       MT-A70                       EmuJ_000328700        69    eukaryota>metazoa                                     Echinococcus multilocularis                    n6 adenosine methyltransferase ime4 [Echinococcus multilocularis].
      # 1;                                                                                                                                                                                                                                                  
      299472858         N6-MTase+SWIB                  SWIB                         Esi_0052_0135         513   eukaryota>stramenopiles                               Ectocarpus siliculosus                         EsV-1-129 [Ectocarpus siliculosus].    
      # 1;                                                                                                                                                                                                                                                  
      323453071         N6-MTase+ANK                   RCC1_2+Ank_2+Ank_2+MT-A70    AURANDRAFT_63473      2507  eukaryota>stramenopiles                               Aureococcus anophagefferens                    hypothetical protein AURANDRAFT_63473 [Aureococcus anophagefferens].
      300257334         N6-MTase                       MT-A70                       VOLCADRAFT_98443      1121  eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis                  hypothetical protein VOLCADRAFT_98443 [Volvox carteri f. nagariensis].
      145340055         N6-MTase                       MT-A70                       AT4G09980             963   eukaryota>viridiplantae                               Arabidopsis thaliana                           methyltransferase-like protein 1 [Arabidopsis thaliana].
      Pram1000002112    N6-MTase                       Glyco_hydro_1+MT-A70         Pram1000002112        930   eukaryota>stramenopiles                               Phytophthora ramorum                           82533                                  
      307215508         N6-MTase                       MT-A70                       EAI_03217             902   eukaryota>metazoa>hexapoda                            Harpegnathos saltator                          Methyltransferase-like protein KIAA1627-like protein [Harpegnathos saltator].
      124806530         N6-MTase                       MT-A70                       PFL1715w              646   eukaryota>alveolata>apicomplexa                       Plasmodium falciparum 3D7                      mRNA methyltransferase, putative [Plasmodium falciparum 3D7].
      Smar1000010256    N6-MTase+DUF572                DUF572+MT-A70                Smar1000010256        640   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima                             SMAR004352-PA pep:novel scaffold:Smar1:JH431477:154003:157599:1 gene:SMAR004352 transcript:SMAR004352-RA
      71013063          N6-MTase                       MT-A70                       UM02405.1             630   eukaryota>fungi>basidiomycota                         Ustilago maydis 521                            hypothetical protein UM02405.1 [Ustilago maydis 521].
      Cmer1000000469    N6-MTase                       -                            Cmer1000000469        628   eukaryota>rhodophyta                                  Cyanidioschyzon merolae                        CMD131C hypothetical protein           
      403366155         N6-MTase                       -                            OXYTRI_19511          588   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                            methyltransferase MT-A70, putative (ISS) (macronuclear) [Oxytricha trifallax].
      Ttra1000008696    N6-MTase                       MT-A70                       Ttra1000008696        556   eukaryota>apusozoa                                    Thecamonas trahens                              Thecamonas trahens ATCC 50062 hypothetical protein (556 aa)
      307108462         N6-MTase                       -                            CHLNCDRAFT_144074     531   eukaryota>viridiplantae>chlorophyta                   Chlorella variabilis                           hypothetical protein CHLNCDRAFT_144074 [Chlorella variabilis].
      156207497         N6-MTase                       MT-A70                       NEMVEDRAFT_v1g221917  497   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein, partial [Nematostella vectensis].
      89306372          N6-MTase                       -                            TTHERM_00301770       475   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila SB210                  hypothetical protein TTHERM_00301770 (macronuclear) [Tetrahymena thermophila SB210].
      Ccor1000008813    N6-MTase                       MT-A70                       Ccor1000008813        469   eukaryota>fungi>entomophthoromycota                   Conidiobolus coronatus                         estExt_fgenesh1_pg.C_3190005           
      545711291         N6-MTase                       MT-A70                       Gasu_13930            449   eukaryota>rhodophyta                                  Galdieria sulphuraria                          methyltransferase [Galdieria sulphuraria].
      470390089         N6-MTase                       MT-A70                       ACA1_156200           440   eukaryota>amoebozoa>acanthamoebidae                   Acanthamoeba castellanii str. Neff             MT-A70 protein [Acanthamoeba castellanii str. Neff].
      159480906         N6-MTase                       MT-A70                       CHLREDRAFT_168079     430   eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii                      predicted protein, partial [Chlamydomonas reinhardtii].
      19113968          N6-MTase                       MT-A70                       SPAC22G7.07c          413   eukaryota>fungi>ascomycota                            Schizosaccharomyces pombe 972h-                mRNA (N6-adenosine)-methyltransferase (predicted) [Schizosaccharomyces pombe 972h-].
      570995458         N6-MTase                       MT-A70                       F442_02656            382   eukaryota>stramenopiles                               Phytophthora parasitica P10297                 hypothetical protein F442_02656 [Phytophthora parasitica P10297].
      17531953          N6-MTase                       MT-A70                       CELE_C18A3.1          365   eukaryota>metazoa>nematoda                            Caenorhabditis elegans                         C18A3.1 [Caenorhabditis elegans].      
      118354144         N6-MTase                       -                            TTHERM_01005150       342   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila                        hypothetical protein TTHERM_01005150 (macronuclear) [Tetrahymena thermophila].
      313234377         N6-MTase                       MT-A70                       GSOID_T00012466001    333   eukaryota>metazoa>chordata                            Oikopleura dioica                              unnamed protein product [Oikopleura dioica].
      85108254          N6-MTase                       MT-A70                       NCU08328              322   eukaryota>fungi>ascomycota                            Neurospora crassa OR74A                        hypothetical protein NCU08328 [Neurospora crassa OR74A].
      641530415         N6-MTase                       MT-A70                       SPRG_10355            286   eukaryota>stramenopiles                               Saprolegnia parasitica CBS 223.65              hypothetical protein SPRG_10355 [Saprolegnia parasitica CBS 223.65].
      Aque1000011545    N6-MTase                       MT-A70                       Aque1000011545        282   eukaryota>metazoa>porifera                            Amphimedon queenslandica                       Aqu1.211813                            
      307107027         N6-MTase                       -                            CHLNCDRAFT_134188     278   eukaryota>viridiplantae>chlorophyta                   Chlorella variabilis                           hypothetical protein CHLNCDRAFT_134188 [Chlorella variabilis].
      145512411         N6-MTase                       -                            GSPATT00010893001     275   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      569429466         N6-MTase                       MT-A70                       RFI_04550             237   eukaryota>rhizaria                                    Reticulomyxa filosa                            hypothetical protein RFI_04550, partial [Reticulomyxa filosa].
      Psoj1000016503    N6-MTase                       MT-A70                       Psoj1000016503        230   eukaryota>stramenopiles                               Phytophthora sojae                             144700                                 
      Bden1000006054    N6-MTase                       MT-A70                       Bden1000006054        183   eukaryota>fungi>chytridiomycota                       Batrachochytrium dendrobatidis                  Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (183 aa)
      Bnat1000020351    N6-MTase                       MT-A70                       Bnat1000020351        144   eukaryota>rhizaria                                    Bigelowiella natans                            fgenesh1_pg.85_#_83                    
      Hrob1000005324    N6-MTase                       MT-A70+MT-A70                Hrob1000005324        142   eukaryota>metazoa>annelida                            Helobdella robusta                             164465                                 
      159464034         N6-MTase                       MT-A70                       CHLREDRAFT_146896     122   eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii                      predicted protein, partial [Chlamydomonas reinhardtii].
      115759069         N6-MTase                       MT-A70                       LOC757579             49    eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: similar to n6-adenosine-methyltransferase ime4, partial [Strongylocentrotus purpuratus].
      
      Proakaryotic homologs
      GI           Operons                                                                                              Arch        Pfam-Arch                         Gene name              len  phylogeny                                              Species                                          Genbank descriptions
      #; BglII-/REase-associated
      503247195    N6-MTase*-><-MunI                                                                                    N6-MTase    MT-A70                            CENSYa_0595            216   archaea                                               Cenarchaeum symbiosum                            transcriptional regulator [Cenarchaeum symbiosum].                                                                                      <-118575782_?<-118575783_?<-118575784_?||118575785_?->118575786_?->118575787_?-><-118575788_?||503247195_N6-MTase*-><-118575790_MunI||118575791_?->118575792_?->118575793_?-><-118575794_?||118575795_?-><-118575796_?
      553802969    BglII-><-N6-MTase*                                                                                   N6-MTase    MT-A70                            HMPREF0742_RS10030     283   bacteria>actinobacteria                               Rothia aeria                                     MT-A70 protein [Rothia aeria].                                                                                                          <-739427288_?<-553802963_?<-553802964_?<-553802965_?||553802966_?->553802967_?->553802968_BglII-><-553802969_N6-MTase*<-553802970_?<-553802972_?||553802973_?->553802974_?-><-739427273_?<-553802977_?||739427276_?->
      503250901    BglII->N6-MTase*->                                                                                   N6-MTase    MT-A70                            ETHHA_RS08455          222   bacteria>firmicutes                                   Ethanoligenens harbinense                        S-adenosylmethionine-binding protein [Ethanoligenens harbinense].                                                                       503250895_?->503250896_?-><-503250897_?<-503249903_?<-754032519_?<-503249901_?||503250900_BglII->503250901_N6-MTase*-><-503250902_?<-503250904_?<-503250905_?<-503250906_?||503250907_?-><-754031295_?||754032521_?->
      653315751    BglII-><-N6-MTase*||?->?-><-?||?->?->?->ParB->                                                       N6-MTase    MT-A70                            T424_RS0114345         220   bacteria>proteobacteria>alphaproteobacteria           Rhizobium undicola                               S-adenosylmethionine-binding protein [Rhizobium undicola].                                                                              739204316_?->739204347_?->653315740_?->653315743_?->653315745_?->739204348_?->653315749_BglII-><-653315751_N6-MTase*||653315753_?->739204349_?-><-653315755_?||653315757_?->653315759_?->653315761_?->653315763_ParB->
      586601520    <-N6-MTase*<-BglII<-?<-?||?->HNH->                                                                   N6-MTase    MT-A70                            GbCGDNIH3_7033         215   bacteria>proteobacteria>alphaproteobacteria           Granulibacter bethesdensis CGDNIH3               Adenine-specific methyltransferase [Granulibacter bethesdensis CGDNIH3].                                                                586601513_?->586601514_?->586601515_?->586601516_?->586601517_?->586601518_?-><-586601519_?<-586601520_N6-MTase*<-586601521_BglII<-586601522_?<-586601523_?||586601524_?->586601525_HNH->586601526_?-><-586601527_?
      661268459    N6-MTase*-><-McrB-NTD+REase                                                                          N6-MTase    MT-A70                            K291_RS0125225         189   bacteria>proteobacteria>alphaproteobacteria           Ensifer sp. USDA 6670                            MT-A70 family protein [Ensifer sp. USDA 6670].                                                                                          696510181_?-><-661268448_?<-661268450_?||661268452_?->661268455_?->696510167_?->661268457_?->661268459_N6-MTase*-><-661268460_McrB-NTD+REase||661268462_?->661268464_?-><-661268618_?||661268466_?->696510168_?->661268470_?->
      515934135    BglII->N6-MTase*->                                                                                   N6-MTase    MT-A70                            H156_RS0101780         216   bacteria>proteobacteria>gammaproteobacteria           Methylococcus capsulatus                         S-adenosylmethionine-binding protein [Methylococcus capsulatus].                                                                        651602654_?->651602655_?->515934136_BglII->515934135_N6-MTase*->515934134_?->499262072_?->515934133_?->499262074_?->499262075_?->515934131_?->499262077_?->
      653894579    ASCH->?->?->?->?->BglII->N6-MTase*->                                                                 N6-MTase    MT-A70                            F820_RS0109105         230   bacteria>proteobacteria>gammaproteobacteria           Xylella fastidiosa                               S-adenosylmethionine-binding protein [Xylella fastidiosa].                                                                              653894443_?->490185352_ASCH->490185644_?->544952717_?->490185356_?->490185357_?->653894557_BglII->653894579_N6-MTase*-><-740452319_?<-653894719_?<-653894722_?
      
      # 27; DCM/DAM associated                                                                                                                                                                                                                                                                                
      657200356    REase-><-?<-?<-N6-MTase*<-?<-DCM+DCM                                                                 N6-MTase    MT-A70                            ON05_RS35435           186   bacteria>cyanobacteria                                Acaryochloris sp. CCMEE 5410                     hypothetical protein [Acaryochloris sp. CCMEE 5410].                                                                                    <-498167725_?||498167728_REase-><-748211640_?<-748211643_?<-657200356_N6-MTase*<-498167740_?<-498167742_DCM+DCM<-657200357_?<-498167747_?<-498167748_?
      667917580    <-N6-MTase*<-DCM                                                                                     N6-MTase    MT-A70                            EL18_01388             190   bacteria>proteobacteria>alphaproteobacteria           Nitratireductor basaltis                         Adenine-specific methyltransferase [Nitratireductor basaltis].                                                                          <-667917573_?<-667917574_?<-667917575_?<-667917576_?<-667917577_?<-667917578_?<-667917579_?<-667917580_N6-MTase*<-667917581_DCM<-667917582_?<-667917583_?||667917584_?-><-667917585_?<-667917586_?<-667917587_?
      515104987    <-N6-MTase*<-Methylase<-?<-DCM                                                                       N6-MTase    MT-A70                            RPHASCH2410_RS0109895  193   bacteria>proteobacteria>alphaproteobacteria           Rhizobium phaseoli                               DNA methyltransferase [Rhizobium phaseoli].                                                                                             <-515104976_?<-748177413_?<-515104978_?<-515104979_?<-515104981_?<-515104982_?<-515104984_?<-515104987_N6-MTase*<-515104989_Methylase<-515104990_?<-515104991_DCM<-657763447_?||515104993_?-><-657763450_?<-515104997_?
      501064335    N6-MTase*->Methylase->                                                                               N6-MTase    MT-A70                            XAUT_RS18300           194   bacteria>proteobacteria>alphaproteobacteria           Xanthobacter autotrophicus                       MT-A70 family protein [Xanthobacter autotrophicus].                                                                                     <-753820203_?||501064331_?->501064332_?-><-753820205_?<-753820207_?||501064334_?->753820214_?->501064335_N6-MTase*->501064336_Methylase->753820216_?->753820218_?->501064339_?-><-753820221_?||753820227_?->501064341_?->
      674766351    DCM+DCM->N6-MTase*-><-?||?->?->?->?->Terminase_LS->Terminase_LS->                                    N6-MTase    MT-A70                            JP75_07920             190   bacteria>proteobacteria>alphaproteobacteria           Devosia riboflavina                              DNA methyltransferase [Devosia riboflavina].                                                                                            <-674766344_?||674766345_?-><-674766346_?||674766347_?->674766348_?->674766349_?->674766350_DCM+DCM->674766351_N6-MTase*-><-674766352_?||674766353_?->674766354_?->674766355_?->674766356_?->674766357_Terminase_LS->674766358_Terminase_LS->
      # 27;                                                                                                                                                                                                                                                                                 
      511283520    -                                                                                                    N6-MTase    MT-A70                                                   279   bacteria>actinobacteria                               Mycobacterium abscessus                          adenine-specific DNA methyltransferase [Mycobacterium abscessus].                                                                             
      496677811    -                                                                                                    N6-MTase    MT-A70                                                   208   bacteria>firmicutes                                   Lachnospiraceae bacterium 3_1_46FAA              DNA methyltransferase [Lachnospiraceae bacterium 3_1_46FAA].                                                                                  
      636828153    RuvC->?->?->N6-MTase*->                                                                              N6-MTase    MT-A70                            H556_RS0109535         199   bacteria>proteobacteria>alphaproteobacteria           Brevundimonas naejangsanensis                    hypothetical protein [Brevundimonas naejangsanensis].                                                                                   <-737320074_?||636828147_?->636828148_?->636828149_?->636828150_RuvC->658447872_?->636828152_?->636828153_N6-MTase*->737320076_?->
      544667667    -                                                                                                    N6-MTase    MT-A70                                                   212   bacteria>proteobacteria>alphaproteobacteria           Thalassobacter arenae                            type II DNA modification methyltransferase [Thalassobacter arenae].                                                                           
      496698392    -                                                                                                    N6-MTase    MT-A70                                                   212   bacteria>proteobacteria>alphaproteobacteria           Afipia sp. 1NLS2                                 S-adenosylmethionine-binding protein [Afipia sp. 1NLS2].                                                                                      
      495609045    -                                                                                                    N6-MTase    MT-A70                                                   217   bacteria>proteobacteria>alphaproteobacteria           Maritimibacter alkaliphilus                      DNA methyltransferase [Maritimibacter alkaliphilus].                                                                                          
      516541036    ParB->?->N6-MTase*->?->VRR-NUC->VRR-NUC->                                                            N6-MTase    MT-A70                            LOKHON_RS09140         173   bacteria>proteobacteria>alphaproteobacteria           Loktanella hongkongensis                         hypothetical protein [Loktanella hongkongensis].                                                                                        <-516541043_?<-702932437_?||702932435_?->516541040_?->516541039_?->702932433_ParB->516541037_?->516541036_N6-MTase*->648455747_?->702932431_VRR-NUC->702932430_VRR-NUC->702932429_?->516541031_?->516541030_?->516541029_?->
      575405212    <-N6-MTase*                                                                                          N6-MTase    MT-A70                            ETSY1_42765            198   bacteria>proteobacteria>deltaproteobacteria           Candidatus Entotheonella sp. TSY1                S-adenosylmethionine-binding protein [Candidatus Entotheonella sp. TSY1].                                                               575405210_?-><-575405211_?<-575405212_N6-MTase*
      640854256    <-N6-MTase*<-?<-?<-?<-NUMOD4                                                                         N6-MTase    MT-A70                            KPNIH27_19120          214   bacteria>proteobacteria>gammaproteobacteria           Klebsiella pneumoniae subsp. pneumoniae KPNIH27  DNA methyltransferase [Klebsiella pneumoniae subsp. pneumoniae KPNIH27].                                                                <-640854249_?<-640854250_?||640854251_?-><-640854252_?<-640854253_?<-640854254_?<-640854255_?<-640854256_N6-MTase*<-640854257_?<-640854258_?<-640854259_?<-640854260_NUMOD4<-640854261_?<-640854262_?||640854263_?->
      554729604    RecT->?->?->N6-MTase*->?-><-Phage_integrase<-?<-?||Exonuc_VII->                                      N6-MTase    MT-A70                            G966_02949             220   bacteria>proteobacteria>gammaproteobacteria           Escherichia coli UMEA 3323-1                     hypothetical protein G966_02949 [Escherichia coli UMEA 3323-1].                                                                         554729597_?->554729598_?->554729599_?->554729600_?->554729601_RecT->554729602_?->554729603_?->554729604_N6-MTase*->554729605_?-><-554729606_Phage_integrase<-554729607_?<-554729608_?||554729609_Exonuc_VII-><-554729610_?<-554729611_?
      666005014    N6-MTase*->?-><-?||Phage_integrase->                                                                 N6-MTase    MT-A70                            SS17_3321              208   bacteria>proteobacteria>gammaproteobacteria           Escherichia coli O157:H7 str. SS17               Adenine DNA methyltransferase, phage-associated [Escherichia coli O157:H7 str. SS17].                                                   666005007_?->666005008_?->666005009_?->666005010_?->666005011_?->666005012_?->666005013_?->666005014_N6-MTase*->666005015_?-><-666005016_?||666005017_Phage_integrase->666005018_?-><-666005019_?<-666005020_?<-666005021_?
      695800969    RecT->?->?->?->N6-MTase*->?-><-Phage_integrase<-?<-?||Exonuc_VII->                                   N6-MTase    MT-A70                            AF48_RS10595           197   bacteria>proteobacteria>gammaproteobacteria           Enterobacter aerogenes                           adenine methylase [Enterobacter aerogenes].                                                                                             695800963_?->695801432_?->695800964_?->695800965_RecT->695800966_?->695801433_?->695800967_?->695800969_N6-MTase*->695800970_?-><-695800972_Phage_integrase<-505805395_?<-505183232_?||518922309_Exonuc_VII->695800973_?->505805397_?->
      556471807    N6-MTase*->                                                                                          N6-MTase    MT-A70                            AGZ61752.1             214   viruses                                               Phormidium phage MIS-PhV1A                       DNA Methyltransferase [Phormidium phage MIS-PhV1A].                                                                                     556471801_?->556471802_?->556471803_?->556471804_?->556471805_?->556471806_?->556471807_N6-MTase*->556471808_?->556471809_?->556471810_?->556471811_?->556471812_?->556471813_?->556471814_?->
      337731296    N6-MTase*->                                                                                          N6-MTase    MT-A70                            gp5                    211   viruses>dsdna viruses, no rna stage>caudovirales      EBPR siphovirus 4                                hypothetical protein [EBPR siphovirus 4].                                                                                               337731292_?->337731293_?->337731294_?->337731295_?->337731296_N6-MTase*->337731297_?->337731298_?->337731299_?->337731300_?->337731301_?->337731302_?->337731303_?->
      # 1;                                                                                                                                                                                                                                                                                  
      497433097    -                                                                                                    N6-MTase    SP+MT-A70                                                290   bacteria>actinobacteria                               Actinomyces sp. oral taxon 175                   SAM-binding domain protein [Actinomyces sp. oral taxon 175].                                                                                  
      
      
      Back to Contents
    • Multiple sequence alignment of the Group I, clade 2 Adenine methylases(Dictyostelium DICPUDRAFT_50950-like)

      Boundaries and core MTase elements                                                      Str-3                       Str-4                                          Str-5                                           Str-6                  Str-7                                                 Str-1                   Str-2                                                                                                                
      FINAL                                                          -------------------------EEEHHHHHHHH--H--------------EEEE----------------HHHHH--HHHHHHHHHHHHHHH-----EEEEEEEEEEEE-------------HHHHHHHHHHHHHHHHH---EEEEEEEEEEE----EE--H---HHHHHHHHHHH-------------------------------HHHHHHHHHH------EEEEEE-----HHHHHHHHH---EEEEE--HHHHHHHHH--
      ALIGN                                                          -------------------------EEEHHHHHHHH--H--------------EEEEE-------------HHHHHHH--HHHHHHHHHHHHHHH-----EEEEEEE-------------------HEHHHHHHHHHHHHHHHH-HH-HEEEEE------EEE--------EEEEEE-------------------------------HHHHHHHHHHHHH-----EEEEE------HHHHHHHH----EEEHHHHHHHHHHHH---
      HMM                                                            --------------------E----EEEHHHHHHHH--HH---------EEEEEEEEE---EEEEEEE------HHHH--HHHHHHHHHHHHHHHH----EEEEEEEEEEEE-------------HHHHHHHHHHHHHHHHH---HEEEEEEEEE-----EEEE------EEEEEEE----------------E------EEE----HHHHHHHHHHHHHH-----EEEEEE------HHHHHHHHH----EEE--HHHHHH-----
      FREQ                                                           ------------------------------HHHHHH--H--------EE-----EE----------------EEE-----HHH---HHHHHHHHH-----HHHHHHHHHHHHH------------EEEE--HHHHHHHHHHH---EEE-EEEEEEE-----HHHH---HHHHHHHHHHH--------------EE------------------H-EEEEE------EEEE--------HHHHHHHH-H--------HHHHHHHHH--
      PSSM                                                           -------------------------EEEEHHHHHHH--H--------------EEEE----HHHHHH---------HH--HHHHHHHHHHHHHHH-----EEEEEEE-------------------HHHHHHHHHHHHHHHH----E----EEEEE------------HHHHHHHHHH--------------------------------HHHHHHHHHH-------EEEE------HHHHHHHHH---EE-EE--HHHHHH-----
      GUITHDRAFT_103022_Guillardia_theta_CCMP2712_551670651          STTE--KGWDVAEG---SSKR----TVVCMDALEWM--MQSENEGLKGGMFVGSVLTSLPDISELQFPQVSEGEKLER--YKGWFVDTAAMILNRIPAGQFAIFYQSDVRVCTKE--------GQVEDWIDKAALCYEASKRTSCKQLWHKYALTCSPGTRSVGR---PTLSHIVCFSNG-ATYKRDRFPAPDVFYR-GEMIWPRAIGLDACVLCLAFLRNL-GNVSTVIDPFCGRGTTLAVANALGMDAVGVELSPKRCRIATSLS
      Smin1000020133_Symbiodinium_minutum_Mf_105b01_Smin1000020133   RTGG--RKMMPKEA--PNGRR----EVICEDALEWI-EKQGHFPS---G---SMVFTSLPDMSE--VVEFA-PR-FED--WEDFFMKAVRHILTALPYGSVAAFYQTDVRLP-TE--------GQ----VSKAFLVLKAAEAV--------------PEARGFGG---G------CV---------------------------KVMGVSATATVLKWATRRLAGLHTVIDPFCGAGTVLAMANAFGLDAIGVDLSPKRIKQAQRLD
      DICPUDRAFT_50950_Dictyostelium_purpureum_330842901             LQLN--KEKGLIDK--FGVYR----DVYCMDAVQWL-NNNAIDPN-------TSVITSLPDITE--VSGFT----LEQ--YKQWFTNTVQLIASKLSDNNVGIVYQTDIK---YHWKHDRSLIEE---YIDKGYLAMKGIEAAGCKVVWHKIMAASDLTKMIITK-NKSSFTHMICFAKQPTNIKYQD-NTPDINTR-GDMVWSRAMGLNACEISTSYVRG--IGSHTVLDPFCGKGSVLAVANVYGLNGIGIDLSTSKFRNSFNLQ
      DDB_G0285285_Dictyostelium_discoideum_AX4_66809181             LELN--KKNGILDR--VGVYR----DIYCMDAVQWL-KENEISPQ-------SSVITSLPDISE--VSSMN----LEQ--YKQWFTDTVSLITSKLDEKNVAIVYQTDIK---KKWKMDRGIIEE---YVDKGYLAQKGAEISGCKVIWHKIMTAHS----NISN-NKATFTHMICFSKNPTNIKYQE-NTPDIGGR-GSMVWSKAMGLNACVIAILYCRS--IGSTTIIDPFCGKGSVLAVANVYGLDSVGVDLSSGKTKNSFNLQ
      DFA_07250_Dictyostelium_fasciculatum_470249763                 IKEN--KIHVPKDR--LKVTR----QVYCIDAIEWL-KNNELSPN-------TSVITSLPDIVE--MSGYT----LPQ--YKEWFVNAVRLITSKLSDNNVAIFYQTDIK---RKWKKDKSVTEE---YVDKGYMVQKGAELSECKVLFHKLMLSHPVETEVVTR-NKASFTHMICIAKNPSSIMHQD-NTPDVAPR-GAMVWPKAMGLNACLVAVRFIRG--VGSTSILDCFCGKGSVLGTANLLGLSSIGVDLSVAKCRNSHSLV
      SAMD00019534_083600_Acytostelium_subglobosum_LB1_735852337     IKEN--RTNASRIQ--KRVMR----DITCQDAVQWL-KDNTVASG-------TSVITSLPDIVE--MTGYS----LQQ--YRDWFVDTVELITSKLSDNNVAIFYQTDIR---RKVKGNKGIVDE---YLDKGYLCAKGAERSGCKMVWHKLMYSVAPELGKVSRGQTPGFSHMLCFAKKPGLLTYQE-QTPDIAPR-GGMVWKKAMGLNACMVALRYIRG--VGCNTVLDTFCGKGSVLAAANMLGLHAIGVDLSISKTRHSSNLV
      MXDZ_RS0208475_Myxococcus_xanthus_499869570                    MVDE--QAGATGAA-----KR----TVYCEDALVWL-EARPVLEG-------SSAIASLPDWSE--FPSLS----LAE--WKAWFIRAAALILARVPPEGVAIFYQTDVK---DE--------GT---WVDKGYLVARAAEEVGVDLLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-VRVDMAK-STPDVLPEAGEVTWTRGMGVEACLIACRFILEH-TRTRTVVDPFCGHGTVLAVANALGLDAVGVELSRKRARKARNLR
      LILAB_RS07805_Myxococcus_fulvus_760026550                      MVDE--REGAAGAA-----KR----TVYCEDALVWL-EARPALAG-------SSAIASLPDGSE--FPSLS----LAD--WKAWFIRAAALILARVPPEGVAIFYQTDVK---EE--------GT---WVDKGYLVSRAAEEVGVELLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-VRVDLAK-STPDVLPEAGEVTWTRGMGVEACLIACRFILEH-TRTRTVVDPFCGHGTVLAVANALGLDSVGVELSRKRARKARNLR
      MXAN_RS00775_Myxococcus_xanthus_763416965                      MVDE--QAGATGAA-----KR----TVYCEDALVWL-EARPVLEG-------SSAIASLPDWSE--FPSLS----LAE--WKAWFIRAAALILARVPPEGVAIFYQTDVK---DE--------GT---WVDKGYLVARAAEEVGVDLLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-VRVDMAK-STPDVLPEAGEVTWTRGMGVEACLIACRFILEH-TRTRTVVDPFCGHGTVLAVANALGLDAVGVELSRKRARKARNLR
      LILAB_07935_Myxococcus_fulvus_HW-1_337257340                   MVDE--REGAAGAA-----KR----TVYCEDALVWL-EARPALAG-------SSAIASLPDGSE--FPSLS----LAD--WKAWFIRAAALILARVPPEGVAIFYQTDVK---EE--------GT---WVDKGYLVSRAAEEVGVELLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-VRVDLAK-STPDVLPEAGEVTWTRGMGVEACLIACRFILEH-TRTRTVVDPFCGHGTVLAVANALGLDSVGVELSRKRARKARNLR
      A176_RS20440_Myxococcus_sp_(contaminant_ex_DSM_436)_488713785  MVDE--RDGDTGAA-----RREPQRTVDCEDALAWL-EARPVLEG-------SSAIASLPDWSE--FPTLS----LAD--WKAWFIRAAALILARVPPEGVAIFYQTDVK---EE--------GT---WVDKGYLVSRAAEEVGVDLLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-VRVDMGK-STPDVLPEAGEVTWTRGMGVEACLIACRFILEH-TRTRTVVDPFCGHGTVLAVANALGLDAVGVELSRKRARKARNLR
      DB31_RS18015_Hyalangium_minutum_763330777                      MTES--GAGGASER--PEGQR----TVECADARVWL-EGRQVLEG-------CSAITSLPDVSE--FPELS----LAE--WKQWFIRAAVLVMSKVPAQGVAIFYQTDVK---KD--------GA---WVDKGYLISKAAEEAGCELLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-IRVDLGK-ATPDVLPDAGEVTWTRGMGLHACLAACRFILEH-TATRTVVDPFCGHGTVLAVANALGLDAVGVELSRKRAKKARVLR
      PPL_08445_Polysphondylium_pallidum_PN500_281204782             LKDN--K-NKDNVA--KSVYR----NLFCMDALEYI-KNNELEKT-------TSIITSLPDIVE--MSGYT----LDR--YKTWFVNAITLICSKLTDNNVAIFYQTDVK---RKVKGNKGVVDE---YLDKGYMCSKGAEIAGCKMVWHKMMTSSPPELGKVARGTKSSFSHMICIARTPSNLIYQE-QTPDIAPR-GAMTWPKAMGLNACMVAAKYIRG--IGSTTILDPFCGKGSVLAIANLIGLNSIGVDLAISKVRHSCNLL
      MFUL124B02_RS00860_Myxococcus_fulvus_819023527                 MLHT--TAG-----------R----TVHCEDALTWL-AAQPILTG-------CSAVASLPDASE--FPTLS----LAE--WKAWFIRAAALVMSRVPDDGVAIFYQTDVK---DE--------GL---WVDKGYLVSRAAEDSGMGLLWHKVVCRRAPGTVTFGR---PAYSHMLCFSRG-IRVDLGK-STADVLPDAGEVTWTRGMGVEACQLACRFILEH-TPTRTVVDPFCGHGTALAVANAMGLEAIGVELSRKRARKARNLR
      MYSTI_RS00680_Myxococcus_stipitatus_505158657                  MADE--RDGTDGRA--LDARR----TVHCEDALTWL-AAQPVLTG-------CSAVASLPDASE--FPTLS----LAE--WKAWFTRAAALVMSRVPDDGVAIFYQTDVK---DE--------GL---WVDKGYLVSRAAEEAGLGMLWHKVVCRRAPGTVTFGR---PAYSHMLCFSKG-VRPDLAK-STADVLPEAGEVTWTRGMGVEACQLACRFILEQ-TSTRTVVDPFCGHGTALAVANAMGLQAVGVELSRKRARKARNLF
      Q664_RS19790_Cystobacter_violaceus_759680543                   ---------MEQAP--SQGKR----TVHCADALAWL-EAQGVLAG-------CSLITSMPDVSE--FPSLS----LAQ--WKEWFVRTASLVLSRCPDDGVTIFYQTDIK---KD--------GT---WVDKGYLVQKAAEQLGHSLLWHKVVCRTPPGSITFGR---PAYSHMLCFSRG-LRAALSK-STADVLPQAGEVTWTRGMGVQACLVACRYVLEN-TPTRTIVDPFCGHGSVLAVANWLGLEAVGVELSRKRAKKARALQ
      D187_RS06910_Cystobacter_fuscus_488707209                      MTSD--ERRAEREA--PQGER----TVHCADALAWL-EAQGVLEG-------CSLITSMPDVSE--FPTLT----LAE--WKDWFVRTAALVLSRCPDEGVTIFYQTDIK---KD--------GT---WVDKGYLVQKAAEQQGHALLWHKVVCRAPAGQTTFGR---PAYSHLLCFSRD-VRADLSR-STPDVLPQAGEVTWTRGMGVEACLAACRYVLEN-TSTRRIVDPFCGHGTVLAVANDLGLDAVGVELSRKRAKKARALR
      STAUR_RS01010_Stigmatella_aurantiaca_488687410                 MAEH--GEAENTPP--PPGRR----TVECAEAVAWL-SGRGVLEG-------CSVITSLPDLSE--FPALS----LAE--WKQWFIRAAALVMAKVPPEGVALFYQTDVK---HE--------GT---WVDKGYLVSRAAEEAGQETLFHKVVCRRPPGTVTFGR---PAYSHLLGFSRG-VRLALSK-ATADVLPEAGEVTWTRGMGVRACLAACRFIQEH-TPTRTVVDPFCGHGTVLAVANALGLDAVGVELSRKRARKARALR
      COCOR_RS39470_Corallococcus_coralloides_504213579              MTDG--RSGTAAA------KR----TVYCEDALAWL-DARPVLEG-------CSMVASLPDVSE--FPQLT----VPQ--WKDWFVGAAAKVLSRVPEDGVAVFYQSDVK---KD--------GA---WVDKGYLVSKAAEAAGCDTLWHKVVCRRTPGTVTFGR---PAYSHLLCFSRG-LKADAAK-STADVLPDPGEVTWTRGMGLNACLVACRFILEQ-TRTRTVVDPFCGHGTALAVANALGLDAVGVELSRKRARRARNLQ
      Anae109_2889_Anaeromyxobacter_sp_Fw109-5_152029321             ------MPGSLLSRPISAPRR----AVHVGDGVAWL-EAGALPAD-------HALVTSLPDASE--LPALG----ADG--WRRWFLAAAERACRAVADEAVAIFYQTDVK---RD--------GA---WVDKAFLVQLAAERAGSALLWHKIVCRVAPGTTTVGR---PAYAHLLCVSRA-LRLAPGQ-SSPDVLPVAGAMTWPRAMPLEACAAAARFLVAH-TRCRTVVDPFCGLGSMLAVANAHGLDAVGVELSRQRAERARALA
      PSR1_03713_Anaeromyxobacter_sp_PSR-1_775300299                 MADGRPQAGEVGAR---APRR----DVRCGDGVAFLREAAPLPPD-------HALVTSLPDASE--LPALG----AAG--WEAWFVDVAALACAAVDPGAPAVFYQTDVK---RD--------GA---WVDKAHLVALGAARAGARLLFHKIVCRVPPGTATFGR---PAYAHLLCCARA-LRLDPAR-ATPDVLPALGQMPWPRAMGAAACEAVARFLLAD-TACRTVVDPFCGLGTMLAVANAHGLDAIGVELSRRRADRARRLH
      ANAE109_RS14815_Anaeromyxobacter_sp_Fw109-5_752809907          ---------------MSAPRR----AVHVGDGVAWL-EAGALPAD-------HALVTSLPDASE--LPALG----ADG--WRRWFLAAAERACRAVADEAVAIFYQTDVK---RD--------GA---WVDKAFLVQLAAERAGSALLWHKIVCRVAPGTTTVGR---PAYAHLLCVSRA-LRLAPGQ-SSPDVLPVAGAMTWPRAMPLEACAAAARFLVAH-TRCRTVVDPFCGLGSMLAVANAHGLDAVGVELSRQRAERARALA
      Adeh_4079_Anaeromyxobacter_dehalogenans_2CP-C_85777006         MADGRPQAGEVGAR---APRR----DVRCGDGVAFLREAAPLPPD-------HALVTSLPDASE--LPALG----AAG--WEAWFVDVAALACAAVDAGAPAIFYQTDVK---RD--------GA---WVDKAHLVALGAARAGARLRFHKIVCRVPPGTATFGR---PAYAHLLCCSRA-LRLDPAR-ATPDVLPALGQMPWPRAMGAAACEAVARFLLAE-TGCRTVVDPFCGLGTMLAVANAHGLDAVGVELSRRRADRARRLQ
      A2CP1_RS21285_Anaeromyxobacter_dehalogenans_506415536          MSDGR-QAGGVGAR---APRR----EVRCGDGVSFLREAAPLPPD-------HALVTSLPDASE--LPALG----AEG--WEAWFVDVAALACAAVAPGAPAIFYQTDVK---RD--------GA---WVDKAQLVARGAARAGARLLFHKIVCRVPPGTATFGR---PAYAHLLCCSRA-LRLDPAR-ATPDVLPALGQMPWPRAMGAAACEAVARFLLAE-TGCRTVVDPFCGLGTMLAVANAHGLDALGVELSRRRADRARRLH
      ANAEK_RS21100_Anaeromyxobacter_sp_K_501520222                  MSDGR-QAGGVGAR---APRR----EVRCGDGVAFLREAAPLPPD-------HALVTSLPDASE--LPALG----AEG--WEAWFVDVAALACAAVAPGAPAIFYQTDVK---RD--------GA---WVDKAHLVALGAARAGARLLFHKIVCRVPAGTATFGR---PAYAHLLCCSRA-LRLDPAR-ATPDVLPALGQMPWPRAMGAAACQAVARFLLAE-TGCRTVVDPFCGLGTMLAVANAHGLDGVGVELSRRRADRARRLQ
      ADEH_RS21070_Anaeromyxobacter_dehalogenans_752814769           ----------MGAR---APRR----DVRCGDGVAFLREAAPLPPD-------HALVTSLPDASE--LPALG----AAG--WEAWFVDVAALACAAVDAGAPAIFYQTDVK---RD--------GA---WVDKAHLVALGAARAGARLRFHKIVCRVPPGTATFGR---PAYAHLLCCSRA-LRLDPAR-ATPDVLPALGQMPWPRAMGAAACEAVARFLLAE-TGCRTVVDPFCGLGTMLAVANAHGLDAVGVELSRRRADRARRLQ
      M201_gp12_Halovirus_HCTV-2_509139482                           -------------------MS--C-DIYQADGIEWC-RENPNA---------GAVVTSLPDPENIIFPEGS-A--YEGRPW-AWFREAIDACAEATHPNAPLVLRQTDRR---DN--------GT-KSKAALAFDVLLENQDDDWRCLWHKIVLHQDPETTNIHR---PTYSHLLAFGRP--QVGPGS-RTPDVLRP-GDKLYANGMGLATAERAVQFAG---SAHEVIVDPFCGRGTVPVMADALGYSAIGVDLDPEQVQHARGLT
      consensus/100%                                                 ....................p.....l...-u..ah..................hhsShPD..p..hs..s....h....a..aF..sh..h..........hhbQoD.+....p..............hsbu..h.b........................h.....s......sh...........................pshsh.ss..s..ah.......p.llDsFCG.Go..s.As..GhpulGl-Ls..b.p.u..L.
      consensus/95%                                                  ....................R.....l.s.Dul.ah.................uhlsShPD.sE..hs.hs....h....ac.WF..sh..h...hs..ssslhYQoDl+...pc.............blsKu.hh.bus........aHKhh...s.....hsp...sshsHhlshup........p..ssDl....G...a.+uMsh.As..sh.ah.......psllDsFCG.GohLuhANh.GhpulGV-Lu..+.cpu..L.
      consensus/90%                                                  ...................bR....pl.s.Dul.al........s.......puhlsSLPD.sE..hs.hs....h....ac.WF..sh.bh...hs..ssslhYQTDl+...cc.............alDKuahh.buu..ss....aHKhh...ss....hs+...ssasHhlChu+...ph...p..osDl....G.h.Ws+uMGh.AC.hshbah......spollDPFCG.GohLAhANh.GLpulGV-LS.p+scpup.L.
      consensus/85%                                                  ...................bR....sV.C.Dul.aL.ps.sh..s.......puhlsSLPDhoE..hs.hs....h....ac.WFhpsh.hhhs.lss.ssAlFYQTDlK...cc........s....alDKuaLh.buA..sG..hlaHKlhhp.ss.p.shuR...suauHhlChu+s..ph...c.sTsDlhs..G.hsWs+uMGh.AC.hsh+alb...s.spTllDPFCG.GohLAlANh.GLsulGV-LS.p+scpup.L.
      consensus/80%                                                  ...................bR....sV.C.Dul.aL.cs.sl..s.......puhlsSLPDhSE..hsshs....h....ac.WFlpsh.hhhu.lss.ssAIFYQTDlK...cc........G....aVDKuaLl.buAbbsG..hlaHKlhhp.sP.s.shuR...PuauHhlChu+s..ph...c.sTsDVhP..G.hsWs+uMGl.ACbhsh+alb...s.scTllDPFCG.GohLAVANh.GLculGV-LS.p+sc+Ap.L.
      consensus/75%                                                  b..................bR....sV.C.Dul.WL.cs.sl..s.......puhlsSLPDhSE..hPshu....h....Wc.WFlcshsLhhu.lss.usAIFYQTDlK...c-........G....aVDKuaLV.buAcbsGhphlaHKlhhp.sPsosshGR...PuauHhLChu+s..pls.sc.sTPDVhP..GphsWsRuMGlpACbhshRalb...s.s+TVlDPFCG.GohLAVANh.GL-ulGV-LSbpRsc+A+sLp
      consensus/70%                                                  h.p...b...........scR....sV.C.DAlsWL.cupsl..s.......suhlTSLPDhSE..hPsLu....hsp..W+sWFlcsAsLlhu.lsspuVAIFYQTDVK...c-........G....WVDKuaLV.+uAEcsGhclLWHKlVCR.sPGTsThGR...PAYoHhLChSRu.l+ls.uc.uTPDVLP..GphsWsRuMGlpACbhAhRFlb.p.TssRTVVDPFCG+GThLAVANAhGLDAlGVELSR+RA++ARsLp
      
      Back to Contents
    • General notes, Phyletic distribution and gene neighborhoods of the Group I, Clade 2 adenine methylases (Dictyostelium DICPUDRAFT_50950-like)

      General notes

      In eukaryotes, this clade of MTases are found in Guillardia, Dinoflagellates and Amoebozoans. The Dictyostelium DICPUDRAFT_50950 protein is the prototype of this family.Interestingly, they do not possess the frequently observed DAM -strand-4 motifs, and instead have a SLPD motif. This type of motif is only present in this clade of methylases and not outside of it. The characteristic sequence features of this family include a DxxC motif after strand-1, D/E in strand-2, D after strand-3, SLPD instead of DPPY after strand-4, DxK after strand-5, HK in strand-6 and H and R flanking strand-7. The Methylase fused to NTF2 in dinoflagellate,seems to be a fragment, but Guillardia (gi|551670651) has a full sequence and was used to reconstitute the Dinoflagellate methylase as it is divided between two proteins, Smin1000020134 and Smin1000020133. Smin1000020134 is N-terminal and has PPR repeats at the N-terminus, whereas Smin1000020133 is fused to an NTF2 domain at the C-terminus. This fusion is like the Ot12g00270 clade where the predicted Dinoflagellate RNA methylases (two of them) are fused to PPR repeats, although they have been independently derived in the eukaryotes.
      General notes:
      GI           Operons/Domain architectures                                                                        Arch   Pfam-Arch         Gene name            len  phylogeny                                     Species                                  Genbank descriptions
      #; Eukaryotic versions
      66809181     <-DAM*                                                                                              DAM                      DDB_G0285285         440  eukaryota>amoebozoa>mycetozoa>dictyosteliida  Dictyostelium discoideum AX4             hypothetical protein DDB_G0285285 [Dictyostelium discoideum AX4].                 <-66809167_?||66809169_?-><-66809171_?<-66809173_?||66809175_?->66809177_?-><-66809179_?<-66809181_DAM*<-111226459_?<-66809185_?<-66809187_?||66809189_?-><-66809191_?<-66809193_?||66809195_?->
      470249763    <-DAM*                                                                                              DAM                      DFA_07250            372  eukaryota>amoebozoa>mycetozoa>dictyosteliida  Dictyostelium fasciculatum               hypothetical protein DFA_07250 [Dictyostelium fasciculatum].                      <-470249749_?||470249751_?->470249753_?-><-470249755_?||470249757_?-><-470249759_?||470249761_?-><-470249763_DAM*<-470249765_?<-470249767_?||470249769_?-><-470249771_?||470249773_?-><-470249775_?<-470249777_?
      281204782    DAM*->                                                                                              DAM                      PPL_08445            361  eukaryota>amoebozoa>mycetozoa>dictyosteliida  Polysphondylium pallidum PN500           hypothetical protein PPL_08445 [Polysphondylium pallidum PN500].                  <-281204775_?||281204776_?->281204777_?-><-281204778_?<-281204779_?<-281204780_?<-281204781_?||281204782_DAM*-><-281204783_?<-281204784_?||281204785_?->281204786_?-><-281204787_?<-281204788_?||281204789_?->
      735852337    <-DAM*                                                                                              DAM    N6_N4_Mtase       SAMD00019534_083600  341  eukaryota>amoebozoa>mycetozoa>dictyosteliida  Acytostelium subglobosum LB1             hypothetical protein SAMD00019534_083600, partial [Acytostelium subglobosum LB1]. <-735852330_?<-735852331_?<-735852332_?||735852333_?-><-735852334_?<-735852335_?||735852336_?-><-735852337_DAM*<-735852338_?<-735852339_?<-735852340_?<-735852341_?<-735852342_?<-735852343_?||735852344_?->
      551670651    <-DAM*                                                                                              DAM    SP+N6_N4_Mtase    GUITHDRAFT_103022    318  eukaryota>cryptophyta                         Guillardia theta CCMP2712                hypothetical protein GUITHDRAFT_103022 [Guillardia theta CCMP2712].               <-551670637_?||551670639_?-><-551670641_?||551670643_?->551670645_?->551670647_?-><-551670649_?<-551670651_DAM*<-551670653_?<-551670655_?||551670657_?->551670659_?-><-551670661_?||551670663_?->551670665_?->
      330842901    DAM*->                                                                                              DAM    N6_N4_Mtase       DICPUDRAFT_50950     259  eukaryota>amoebozoa>mycetozoa>dictyosteliida  Dictyostelium purpureum                  hypothetical protein DICPUDRAFT_50950, partial [Dictyostelium purpureum].         330842901_DAM*-><-330842915_?||330842903_?->330842905_?->330842907_?->330842909_?-><-330842917_?||330842911_?->
      Smin1000020133\                                                                                                  DAM                      Smin1000020133       193  eukaryota>alveolata>dinophyceae               Symbiodinium minutum Mf 1.05b.01         
      Smin1000020134/                                                                                                  DAM                      Smin1000020134       633  eukaryota>alveolata>dinophyceae               Symbiodinium minutum Mf 1.05b.01        
      # 25; Prokaryotic homologs
      488687410    <-METHYLASE<-?<-5xTM+HISKIN<-?<-DAM*||?->?->SUKH->?-><-DOC                                          DAM    N6_N4_Mtase       STAUR_RS01010        231  bacteria>proteobacteria>deltaproteobacteria   Stigmatella aurantiaca                   hypothetical protein [Stigmatella aurantiaca].                                    <-739729589_?<-488687416_?<-488687441_?<-488687447_METHYLASE<-488687432_?<-503139379_5xTM+HISKIN<-488687400_?<-488687410_DAM*||488687402_?->488687445_?->488687427_SUKH->488687389_?-><-739729599_DOC||503139381_?->739729588_?->
      760026550    <-HISKIN||3xTM->?-><-?<-ABHYDROLASE3||DAM*-><-?<-?||?->5xTM+HISKIN->?->METHYLASE->                  DAM    N6_N4_Mtase       LILAB_RS07805        264  bacteria>proteobacteria>deltaproteobacteria   Myxococcus fulvus                        hypothetical protein [Myxococcus fulvus].                                         503702586_?->503702587_?-><-503702588_HISKIN||760026545_3xTM->503702590_?-><-503702591_?<-760026547_ABHYDROLASE3||760026550_DAM*-><-760029538_?<-760026553_?||760029540_?->503702597_5xTM+HISKIN->503702598_?->760026556_METHYLASE->503702600_?->
      819023527    <-METHYLASE<-?<-5xTM+HISKIN<-?||?->ABHYDROLASE3->?-><-DAM*||ABHYDROLASE3->                          DAM    N6_N4_Mtase       MFUL124B02_RS00860   237  bacteria>proteobacteria>deltaproteobacteria   Myxococcus fulvus                        hypothetical protein [Myxococcus fulvus].                                         <-819023518_METHYLASE<-819023520_?<-819023522_5xTM+HISKIN<-819023524_?||819023525_?->819038976_ABHYDROLASE3->819023526_?-><-819023527_DAM*||819023528_ABHYDROLASE3->819023531_?->819038978_?-><-819023533_?<-819023534_?||819023535_?->819023536_?->
      505158657    <-METHYLASE<-?<-5xTM+HISKIN<-?||?->ABHYDROLASE3->?-><-DAM*||?->ABHYDROLASE3->?-><-?<-?<-3xTM        DAM    N6_N4_Mtase       MYSTI_RS00680        245  bacteria>proteobacteria>deltaproteobacteria   Myxococcus stipitatus                    hypothetical protein [Myxococcus stipitatus].                                     <-505158650_METHYLASE<-505158651_?<-505158652_5xTM+HISKIN<-505158653_?||505158654_?->763426938_ABHYDROLASE3->505158656_?-><-505158657_DAM*||505158658_?->505158659_ABHYDROLASE3->505158660_?-><-505158661_?<-505158662_?<-505158663_3xTM||505158664_?->
      488713785    3xTM->?-><-?<-ABHYDROLASE3<-?||DAM*-><-?<-?||?->5xTM+HISKIN->?->METHYLASE->                         DAM    N6_N4_Mtase       A176_RS20440         250  bacteria>proteobacteria>deltaproteobacteria   Myxococcus sp. (contaminant ex DSM 436)  hypothetical protein [Myxococcus sp. (contaminant ex DSM 436)].                   <-488713778_?<-488713779_?||768721037_3xTM->768721038_?-><-488713782_?<-488713783_ABHYDROLASE3<-488713784_?||488713785_DAM*-><-488713786_?<-768721088_?||768721089_?->768721039_5xTM+HISKIN->488713790_?->768721040_METHYLASE->488713792_?->
      763416965    <-METHYLASE<-?<-5xTM+HISKIN<-?||?->?-><-DAM*||ABHYDROLASE3->?-><-?<-3xTM                            DAM    N6_N4_Mtase       MXAN_RS00775         246  bacteria>proteobacteria>deltaproteobacteria   Myxococcus xanthus                       hypothetical protein [Myxococcus xanthus].                                        <-521966331_?<-499869564_METHYLASE<-499869565_?<-499869566_5xTM+HISKIN<-521966329_?||499869568_?->759926767_?-><-763416965_DAM*||499869571_ABHYDROLASE3->499869572_?-><-499869573_?<-521966328_3xTM||499869575_?->521966327_?->499869577_?->
      337257340    <-HISKIN||3xTM->?-><-?<-ABHYDROLASE3||DAM*-><-?<-?||?->5xTM+HISKIN->?->METHYLASE->                  DAM    N6_N4_Mtase       LILAB_07935          246  bacteria>proteobacteria>deltaproteobacteria   Myxococcus fulvus HW-1                   hypothetical protein LILAB_07935 [Myxococcus fulvus HW-1].                        337257333_?->337257334_?-><-337257335_HISKIN||337257336_3xTM->337257337_?-><-337257338_?<-337257339_ABHYDROLASE3||337257340_DAM*-><-337257341_?<-337257342_?||337257343_?->337257344_5xTM+HISKIN->337257345_?->337257346_METHYLASE->337257347_?->
      499869570    3xTM->?-><-?<-ABHYDROLASE3||DAM*-><-?<-?||?->5xTM+HISKIN->?->METHYLASE->                            DAM    N6_N4_Mtase       MXDZ_RS0208475       266  bacteria>proteobacteria>deltaproteobacteria   Myxococcus xanthus                       hypothetical protein [Myxococcus xanthus].                                        <-499869577_?<-521966327_?<-499869575_?||521966328_3xTM->499869573_?-><-499869572_?<-499869571_ABHYDROLASE3||499869570_DAM*-><-759926767_?<-499869568_?||521966329_?->521966330_5xTM+HISKIN->499869565_?->499869564_METHYLASE->521966331_?->
      763330777    DAM*->?->5xTM+HISKIN->?->METHYLASE->                                                                DAM    N6_N4_Mtase       DB31_RS18015         234  bacteria>proteobacteria>deltaproteobacteria   Hyalangium minutum                       hypothetical protein [Hyalangium minutum].                                        763331669_?-><-763330769_?<-763331672_?<-763330771_?<-763331675_?<-763330773_?||763330775_?->763330777_DAM*->763330780_?->763330783_5xTM+HISKIN->763330787_?->763330788_METHYLASE->763331677_?->763331678_?-><-763330791_?
      759680543    <-5xTM+HISKIN<-?<-?<-?<-?||?->ABHYDROLASE3-><-DAM*<-?<-?<-?||?-><-?||ABHYDROLASE3->                 DAM    N6_N4_Mtase       Q664_RS19790         237  bacteria>proteobacteria>deltaproteobacteria   Cystobacter violaceus                    hypothetical protein [Cystobacter violaceus].                                     <-759680531_5xTM+HISKIN<-759680612_?<-759680533_?<-759680534_?<-759680537_?||759680540_?->759680613_ABHYDROLASE3-><-759680543_DAM*<-759680545_?<-759680548_?<-759680552_?||759680555_?-><-759680557_?||759680560_ABHYDROLASE3->759680563_?->
      488707209    <-DAM*<-?||?-><-?||?->STYKIN->STYKIN->                                                              DAM    N6_N4_Mtase       D187_RS06910         240  bacteria>proteobacteria>deltaproteobacteria   Cystobacter fuscus                       hypothetical protein [Cystobacter fuscus].                                        <-759717612_?<-488707202_?<-488707203_?||488707204_?-><-488707206_?||759717613_?-><-759717277_?<-488707209_DAM*<-488707210_?||759717614_?-><-759717615_?||759717616_?->759717617_STYKIN->759717619_STYKIN->759717279_?->
      504213579    3xTM->?-><-?<-ABHYDROLASE3||DAM*-><-?||5xTM+HISKIN->METHYLASE->                                     DAM    N6_N4_Mtase       COCOR_RS39470        243  bacteria>proteobacteria>deltaproteobacteria   Corallococcus coralloides                hypothetical protein [Corallococcus coralloides].                                 <-759606359_?<-504213574_?<-759604248_?||504213575_3xTM->504213576_?-><-504213577_?<-504213578_ABHYDROLASE3||504213579_DAM*-><-504213580_?||504213581_5xTM+HISKIN->759606361_METHYLASE-><-759606363_?||759604249_?->504213586_?->504213587_?->
      85777006     <-Glyoxalase_4<-TRANSGLUTAMINASE||?->?-><-Acyl-ACP_TE||DAM*->METHYLASE-><-DnaJ||Ferredoxin-RRM->    DAM    N6_N4_Mtase       Adeh_4079            252  bacteria>proteobacteria>deltaproteobacteria   Anaeromyxobacter dehalogenans 2CP-C      conserved hypothetical protein [Anaeromyxobacter dehalogenans 2CP-C].             85776999_?-><-85777000_?<-85777001_Glyoxalase_4<-85777002_TRANSGLUTAMINASE||85777003_?->85777004_?-><-85777005_Acyl-ACP_TE||85777006_DAM*->85777007_METHYLASE-><-85777008_DnaJ||85777009_Ferredoxin-RRM->85777010_?->85777011_?-><-85777012_?||85777013_?->
      501520222    <-Glyoxalase_4<-TRANSGLUTAMINASE||?->?-><-Acyl-ACP_TE||DAM*->METHYLASE-><-DnaJ||Ferredoxin-RRM->    DAM    N6_N4_Mtase       ANAEK_RS21100        247  bacteria>proteobacteria>deltaproteobacteria   Anaeromyxobacter sp. K                   hypothetical protein [Anaeromyxobacter sp. K].                                    501520215_?-><-501520216_?<-501520217_Glyoxalase_4<-501520218_TRANSGLUTAMINASE||501520219_?->501520220_?-><-501520221_Acyl-ACP_TE||501520222_DAM*->501520223_METHYLASE-><-501520224_DnaJ||501520225_Ferredoxin-RRM->501520226_?->501520227_?-><-501520228_?||501520229_?->
      506415536    <-Glyoxalase_4<-TRANSGLUTAMINASE||?->?-><-Acyl-ACP_TE||DAM*->METHYLASE-><-DnaJ||Ferredoxin-RRM->    DAM    N6_N4_Mtase       A2CP1_RS21285        247  bacteria>proteobacteria>deltaproteobacteria   Anaeromyxobacter dehalogenans            hypothetical protein [Anaeromyxobacter dehalogenans].                             506415530_?-><-506415531_?<-506415532_Glyoxalase_4<-506415533_TRANSGLUTAMINASE||506415534_?->501520220_?-><-506415535_Acyl-ACP_TE||506415536_DAM*->506415537_METHYLASE-><-506415538_DnaJ||506415539_Ferredoxin-RRM->506415540_?->506415541_?-><-506415542_?||506415543_?->
      775300299    <-Glyoxalase_4<-TRANSGLUTAMINASE||?->?-><-Acyl-ACP_TE||DAM*->METHYLASE-><-DnaJ                      DAM    N6_N4_Mtase       PSR1_03713           244  bacteria>proteobacteria>deltaproteobacteria   Anaeromyxobacter sp. PSR-1               hypothetical protein PSR1_03713 [Anaeromyxobacter sp. PSR-1].                     775300292_?-><-775300293_?<-775300294_Glyoxalase_4<-775300295_TRANSGLUTAMINASE||775300296_?->775300297_?-><-775300298_Acyl-ACP_TE||775300299_DAM*->775300300_METHYLASE-><-775300301_DnaJ
      152029321    <-Endonuclease_5<-?<-TIMbarrel_redox||?-><-?||DAM*-><-?<-URI<-Patatin                               DAM    N6_N4_Mtase       Anae109_2889         225  bacteria>proteobacteria>deltaproteobacteria   Anaeromyxobacter sp. Fw109-5             conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5].                    <-152029314_?||152029315_?-><-152029316_Endonuclease_5<-152029317_?<-152029318_TIMbarrel_redox||152029319_?-><-152029320_?||152029321_DAM*-><-152029322_?<-152029323_URI<-152029324_Patatin||152029325_?-><-152029326_?||152029327_?-><-152029328_?
      752809907    <-Endonuclease_5<-?<-TIMbarrel_redox<-?||DAM*-><-?<-URI<-Patatin                                    DAM    SP+N6_N4_Mtase    ANAE109_RS14815      216  bacteria>proteobacteria>deltaproteobacteria   Anaeromyxobacter sp. Fw109-5             hypothetical protein [Anaeromyxobacter sp. Fw109-5].                              752809903_?-><-752809904_?||752809905_?-><-752809906_Endonuclease_5<-752809043_?<-501045893_TIMbarrel_redox<-501045895_?||752809907_DAM*-><-501045897_?<-501045898_URI<-501045899_Patatin||752809908_?->501045902_?-><-501045903_?<-501045904_?
      752814769    <-Glyoxalase_4<-TRANSGLUTAMINASE||?->?-><-Acyl-ACP_TE||DAM*->METHYLASE-><-DnaJ||Ferredoxin-RRM->    DAM    SP+N6_N4_Mtase    ADEH_RS21070         215  bacteria>proteobacteria>deltaproteobacteria   Anaeromyxobacter dehalogenans            hypothetical protein, partial [Anaeromyxobacter dehalogenans].                    499742384_?-><-499742385_?<-499742386_Glyoxalase_4<-499742387_TRANSGLUTAMINASE||499742388_?->499742389_?-><-499742390_Acyl-ACP_TE||752814769_DAM*->499742392_METHYLASE-><-499742393_DnaJ||499742394_Ferredoxin-RRM->499742395_?->499742396_?-><-752814525_?||499742398_?->
      # 1;
      509139482    DnaJ->?->?->?->?->?->ParB+DAM->DAM*->?->?->?->?->?->DAM->                                           DAM    UPF0020           M201_gp12            213  viruses>dsdna viruses, no rna stage           Halovirus HCTV-2                         AdoMet-MTase [Halovirus HCTV-2].                                                  509139477_DnaJ->509139478_?->509139479_?->509139480_?->509139548_?->509139549_?->509139481_ParB+DAM->509139482_DAM*->509139483_?->509139484_?->509139485_?->509139550_?->509139486_?->509139487_DAM->509139488_?->
      
      Back to Contents
    • Multiple sequence alignment of the Group I, clade 3 adenine methylases (Naegleria NAEGRDRAFT_76461-like)

                                                                                                       Str-3               Str-4                                                                                        Str-5                            Str-6                   Str  7                                                                                                       Synapomorphic strand                                Str-1                Str-2
      ALIGN                                                                                   ------HH-HH-----H------------EEEEEE--------------------------------------------------HH-HHHHH-HHH----------------HEEEE----EEEEEE-------HHHHHHH----EEE------EEEEE--------------------HHEEEE----------E--------------------------------------------------------------------------------------------EEEE----E-------------------------HHHHHHHHHHEE-----EEEEEE-----EHHEEHH---EEEEE--HHHHH-HHHHHHHHHHHHHHH-----HHHHHHH---------HHHHHH----------------EEE----------------HHHHHHHHH--------------------
      HMM                                                                                     --HH-HHH-HHHHHHHHHHH-------EEEEEEEE--------------------------E--E--E-------E-E------HHH-HHHHH-HH-----------------HEEE-----EEEEEEE------HHHHHHH--HHHHHH-HHHHHEEEE-------HHHHHHHHHHH--EEEEEE--------------------EEEEE---------------------------------------------E-----E-EE------------EE-------EEEHEH----H--HH---------------------HHHHHHHHHHHH----EEEEEE----HHHHHHHHH---EEEEEE--HHHH-HHHHHHHHHHHHH-----HEEHEEHHH--------HHHHHHHH--------E-----EEEEE-----EEHHHHHHHHHHHHHHHHEEE-----EEHHHHEEEE---
      FREQ                                                                                    ---------------HHHHH---------EEEEE--------------------------------------------------HHH-HHHHH-HHH----------------HHH------EEEEEEE---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      PSSM                                                                                    -----HHH-HHHHH-HHHHHH--------EEEEE------------------------------------------------------HHHHH-HHH----------------HHH------EEEEEEE-------HHHHHHH----H-------EEEEEE--------------------EEEEE-------------------------------------------------------------------------------------------------------EEE-------------------------------HHHHHHHHHH-------EEE-------HHHHHHHH----EEEE---HHHH-HHHHHHHHHHHHH------HHEE------------HHHHHHH-------EEE-----EEEEE-----EEE------HHHHHHHHHH-------------EEEE---
      FINAL                                                                                   --HH-HHH-HHHHHHHHHHH---------EEEEEE-----------------------------E-------------------HHH-HHHHH-HHH----------------HHH------EEEEEEE-------HHHHHH-------------EEEEEE-------------------EEEEEE-------------------------------------------------------------------------------E------------E---------EEEE-------------------------------HHHHHHHHHH-------EEE------HHHHHHHHH----EEEE---HHHH-HHHHHHHHHHHH--------EE-------------HHHHHHH---------------EEEEE-----EE-------HHHHHHHHH--------------EEEE---
      NAEGRDRAFT_76461_Naegleria_gruberi_strain_NEG-M_290971699                               ERNG-YSA-ILLYGDVLELANI-FPKKVSYDLMICDLPY--------------------G-V--F--KD-SP-YDV-LFT----DDQ-LKEFI-DNL----------------YQITPDTNFPTWIFFG-----EYKQIVKLQELIELKK-GNAVICIWVK----NGRQFGANFGTYKYESFLLCF----------PNKQV-----NIKPQ---------------------------------------------GFSLSPF-SI------------CF------PSETNFL----K--DS-NS---------RE-CNKGQKPLSLISWLVYQFSNVDGIVLDLCSGTATTAVAAVSYGRNSISLESNHGQF-EHAAERLKLSEFEKVNFEPLVVCTVSEDKEKKEKKKSTKRKTAPTKK-TPKKASKKKKTETPLKGSKMQSKKAEEIAMKYIDSEAVEMASDEENVSEKDEEEDDEE----------------------
      ADA73_RS21195_Bacteroides_fragilis_695344547                                            IETD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------LAALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----AYHPQ--M--TPC---PPERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERQIQEELECRNKVQEEKEI------REEKQNQ--------------------------------------------------------------------------------------------
      LEP1GSC165_RS0218310_Leptospira_santarosai_696345163                                    -----MATHHLYYGDCLENLPK-IPDS-SVDLILCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILIF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLKLLRYLIKTYSNPGDTVLDNCMGHGTTGIAATELGRNFIGMERDKEYF-NKAQRKIQMAETRTQLELNFD-----------------------------------------------------------------------------------------------------------
      LEP1GSC071_RS16130_Leptospira_santarosai_696349061                                      -----MATHHLYYGDCLENLPK-IPDS-SVDLILCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILIF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLKLLRYLIKTYSNPGDTVLDNCMGHGTTGIAATELGRNFIGMERDKEYF-NKAQRKIQMAETRTQLGLNFES----------------------------------------------------------------------------------------------------------
      LEP1GSC076_RS08120_Leptospira_696229311                                                 -----MATHHLYYGDCLENLPK-IPDS-SVDLILCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILIF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLKLLRYLIRTYSNPGDTVLDNCMGHGTTGIAATELGRNFIGMERDKEYF-NKAQRKIQMAETRTQLELNFES----------------------------------------------------------------------------------------------------------
      LEP1GSC068_2949_Leptospira_sp_Fiocruz_LV3954_410015573                                  -----MATHHLYYGDCLENLPK-IPDS-SVDLILCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILIF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLKLLRYLIRTYSNPGDTVLDNCMGHGTTGIAATELGRNFIGMERDKEYF-NKAQRKIQMAETRTQLELNFES----------------------------------------------------------------------------------------------------------
      LEP1GSC071_3962_Leptospira_santarosai_str_JET_410804029                                 -----MATHHLYYGDCLENLPK-IPDS-SVDLILCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILIF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLKLLRYLIKTYSNPGDTVLDNCMGHGTTGIAATELGRNFIGMERDKEYF-NKAQRKIQMAETRTQLGLNFES----------------------------------------------------------------------------------------------------------
      T343_RS0105895_Leptospira_licerasiae_495865040                                          -----MST-ELHLGDCLKILPK-IPDS-SIDLIFCDLPY--------------------G-T-----TD-CP-WDK-IIP--------MEKLW-PEY----------------ERISKENT--PIILTA---SQPFTTYLINSNPKNFRY-E----LIWYK-TKASGFLLAKKRPNKSHEN-ILVF----------YKKQP-----VYNPI--K--YEI-D-ERYRR--K-G-K-T-LGNGN--QSTVFRI---T-GEKSKNY-QY--LD-S-GS---RY------PDSVLCF-P--S--ES-EV--G---------MHPTQKPTRLLRFLIKSFSNPGDLVLDNCMGHGTTGIAAVELGRNFIGIEKERSYF-KKAESKIRMAEKRYSLGLDFET----------------------------------------------------------------------------------------------------------
      LEP1GSC132_RS14950_Leptospira_kirschneri_490906211                                      -----MAT-LLYHGDCLNHLPK-IPDA-SVDLIFCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKKFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILVF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFAI---R-GEKSENY-QY--LD-D-GS---RF------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLRLLRYLIRTYSNPGDTVLDNCMGHGTTGIAAVELARDFIGMEMDKEYF-EKAKRKIQMAETRTQLELNFES----------------------------------------------------------------------------------------------------------
      LEP2GSC066_RS0103865_Leptospira_santarosai_648272077                                    -----MATHHIYHGDCLKILPK-ISDA-SVDLIFCDLPY--------------------G-T-----TD-CA-WDI-IIP--------MEKLW-PEY----------------ERISKKKT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKTRPNKSHEN-ILVF----------YKKQP-----IYNPI--K--YEI-D-ERYRR--K-G-K-T-LGNGN--QSTVFSI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-ET--G---------MHPTQKPLRLLRYLIKTFSNPGDTVLDNCMGHGTTGIASIELGRNFIGIERDKDYF-QKAKSKIKMAETRTQLGLNFES----------------------------------------------------------------------------------------------------------
      LEP1GSC193_RS01970_Leptospira_alstonii_738085263                                        -----MAT-HLYHGDCLDNLPK-IPDA-SVDLIFCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SHPFTNYLINSNPKNFRYYE----LIWYK-TKASGFLNAKTRPNKSHEN-ILVF----------YKNQP-----VYNPI--K--YQI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GF---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLRLLRYLIKTYSNVGDTVLDNCMGHGTTGIAAIELARDFIGMEMDKEYF-EKAKRKIQMAETRIQLELNFES----------------------------------------------------------------------------------------------------------
      AAY48_RS15275_Leptospira_interrogans_446543012                                          -----MAT-HLYHGDCLSHLPK-IPDT-SVDLIFCDLPY--------------------A-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKEKT--PIILTG---SHPFTNYLINSNPKNFRYYE----LIWYK-TKASGFLNAKTRPNKSHEN-ILIF----------YKNQP-----VYNPI--K--YEI-D-KRYKR--K-G-K-I-QGKGH--QSTVFTI---S-GEKSENY-QY--LD-D-GF---RY------PDSVLCF-P--S--EF-EI--G---------MHPTQKPLRLLRYLIKTYSHVGDTVLDNCMGHGTTGIAAVELARNFIGMEMDKEYF-EKAKRKIQMAETRTQLELNFES----------------------------------------------------------------------------------------------------------
      BAPNAU_RS08760_Bacillus_amyloliquefaciens_549781736                                     LELN-----RIYQMDCLEGMKL-IPDN-SIDMILCDLPY--------------------G-Q-----TS-NS-WDS-VLP--------LDKLW-KQY----------------NRIIKDNG--AIVLTA---KGKFKINLINSNFKNYRY-E----WVWDK-NKGANFPHVKRMPLNVHEY-VVVF----------YNQQP-----VYNPQ--M--TEG-K-PYKQI--R-------KEESL--K---GIA---D-NINR----KT-TIS-N-GK---RY------PKSIIRV-EGIA-----QR---------NI-CHPTQKPVELFEYLIKTYSNEGDIVLDNCMGSGTTAVACEKLNRKWIGFEIVKEYI-AIANKRLDYF---HSISES-------------------------------------------------------------------------------------------------------------
      JO84_gp345_Aureococcus_anophagefferens_virus_672551258                                  ---M-----NILEGDCLFHMKN-ISDK-SVDMILCDLPY--------------------G-T-----TK-NK-WDS-VIP--------FYDLW-ENY----------------NRIIKDNG--AIVLFG---SQPFTTKLISSNMKDFRY-C----LVWEK-NKFSDFLNAKRKPMKTNED-ICIF----------YKKQP-----TYNPQ--Y--TYS-T-PYTRW--N-------TQSAV--D---KQT---N-YGGHKQN-IS--KS-D-GK---RL------PTTVLKF-N--R-----IE---------RP-DHPTQKPIDLLEWLIKTYSNENELILDNCMGVGSTGIAAKNTNRRFIGIEKDQNYF-KKATENLI------------------------------------------------------------------------------------------------------------------------
      BCPBV781_gp10_Burkholderia_phage_Bcep781_23752321                                       HVNY-----ELYWGDCLDLMRL-LPDA-SVDMVMCDLPY--------------------G-T-----TA-CA-WDS-VLP--------FDALW-AQY----------------RRIVKSRG--AVVLTA---AQPFTSALVASNFEWFKY-D----WVWAK-NRPTNFAHAKNKPMPKHES-VLVFSPGTTVHASQSKLRM-----TYNPQG-L--TRI-E-PRKMK----------TYNTD--A---MFS---K-RGSHGEY-----TQ-E-FT---NY------PHSLLEF-S--T-----DQ-----LN-----LHPTAKPVALMEYLIRTYTSEGDTVLDNCMGSGTTGVACINTGRRFIGMEKDADYA-LIATGRMR---EAIDRD----------IPIDLC-----------------------------------------------------------------------------------------------
      BCPBV43_gp10_Burkholderia_phage_Bcep43_41057660                                         HVNY-----ELYWGDCLDLMRL-LPDA-SVDMVMCDLPY--------------------G-T-----TA-CA-WDS-VLP--------FDALW-AQY----------------RRIVKSRG--AVVLTA---AQPFTSALVASNFEWFKY-D----WVWAK-NRPTNFAHAKNKPMPKHES-VLVFSPGTTVHASQFKLRM-----TYNPQG-L--TRI-E-PRKMK----------TYNTD--A---MFS---K-RGSHGEY-----TQ-E-FT---NY------PHSLLEF-S--T-----DQ-----LN-----LHPTAKPVALMEYLIRTYTSEGDTVLDNCMGSGTTGVACINTGRRFIGMEKDADYA-LIATGRMR---EAIDRD----------IPIDLC-----------------------------------------------------------------------------------------------
      HMPREF9022_RS14035_Erysipelotrichaceae_bacterium_2_2_44A_496003735                      -MES-----YIKHGDCLEVMKD-IPDK-SIDMILCDLPY--------------------G-T-----TQ-CK-WDV-VIP--------FDKLW-EQY----------------CRVAKDNA--AIVLFG---AEPFSSRLRLSNVQMYKY-D----WIWDK-VKGTGFLNAKKQPLRNHEV-ICVF----------YKSQC-----TYNPQ--M--TSG-Q-RKVSY--R-------RKGLQ--T---DVY---G-QADEDYI-----YD-S-AA---RY------PRSIQVF-S--A--DT-QK---------CS-LQPTQKPIALLEYLIRTYTNDYDIVLDNCMGSGSTCIAAQNTNRKYIGIESEESIF-NTAKDRIK----------------------------MNKTQL-----QLF------------------------------------------------------------------------------
      LF41_RS05185_Lysobacter_dokdonensis_738211128                                           --MI-----DLYQGDCLEVMGR-LPSN-SVDLILCDLPY--------------------G-T-----TS-CK-WDS-VIP--------FDALW-SQY----------------RRIAKRNA--AIVLTA---NQPFTTALIASNLCEFRY-T----WVWDKVNRPTGFLNAKLRPLRAFED-VCVF----------YRAQP-----TYNPQK-W---RG-E-PYKTT----------HGSSG--E---AYH---R-TETRTQV-----CA-D-GM---RY------PQDLIRI-K--A-----DN-----RGVEGR-VHPTQKPVALMEYLVKTYSNEGDTVLDNCMGSGTTGVACANTGRRFIGIERDADYF-TIASKRVG---VAGAKP------R-VWVPLAVSGD---------------------------------------------------------------------------------------------
      B4072_RS12970_Bacillus_subtilis_752704809                                               LKKK-----RIYQMDCLEGMPL-IPDK-SIDMILCDLPY--------------------G-T-----TR-NK-WDS-IIP--------FDKLW-EQY----------------KRIIKDNG--AIVLTA---AQPFTSALIMSNVKDFKY-E----WIWKK-SNGTGHLNAKRMPMKDHES-ILVF----------YKKQP-----TYNPQ-------GIV-PYNRV--T-------RRGGN--G---GNY---N-SSNT----SN--FQ-E-YT---NY------PRTIQQF-A--Y-----DK---------KK-YHPTQKPVALFEYLIKTYSNEGDTVLDNCMGSGTTAVACENLNRKWIGFETESKYI-EIANNRLKEL-HSISNF---------------------------------------------------------------------------------------------------------------
      M089_3211_Bacteroides_ovatus_str_3725_D9_iii_649530658                                  -------------MDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-AN-WDR-QIP--------LTALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----VYHPQ--M--TPC---PSERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EITERRIQ---EELEC--------------------RNKAQEEKEI-------------------REEKQINKSMTWTKKES----------------------------------------------
      B4069_2083_Bacillus_subtilis_751875430                                                  LKKK-----RIYQMDCLEGMPL-IPDK-SIDMILCDLPY--------------------G-T-----TR-NK-WDS-IIP--------FDKLW-EQY----------------KRIIKDNG--AIVLTA---AQPFTSALIMSNVKDFKY-E----WIWKK-SNGTGHLNAKRMPMKDHES-ILVF----------YKKQP-----TYNPQ-------GIV-PYNRV--T-------RRGGN--G---GNY---N-SSNT----SN--FQ-E-YT---NY------PRTIQQF-A--Y-----DK---------KK-YHPTQKPVALFEYLIKTYSNEGDTVLDNCMGSGTTAVACENLNRKWIGFETESKYI-EIANNRLKEL-HSISNF---------------------------------------------------------------------------------------------------------------
      BAT_RS05535_Bacillus_pumilus_489305499                                                  LELN-----RIYQMDCLEGMKL-IPDE-SVDLILCDLPY--------------------G-T-----TD-VKRWDK-IIP--------IEKLW-EQY----------------KRIIKETG--NVVLFG---SQPFTSYLVNSNPSMFRY-E----WIWDK-TKGANFLNSNHQPLKVHEN-ILVF--SKLPASPNKKGTA-----TYFPQK-T--EGK-E-YKVKR----------SSHKG--E---IFN---G-GSLRDNF-----EKVN-EG---RH------PVSIQTF-L--K-----DK---------DN-IHPTQKPVEMCEYLIRTYTDQSDIVLDNCMGSGTTAVASIISQRKWIGFETDPTFY-QLANKRLE---QVQLGD------D-L-ASYQ-------------------------------------------------------------------------------------------------
      JI66_RS07000_Lactobacillus_kunkeei_736516671                                            MNKA-----QLQKGDCIKLMHE-LPDK-SVDMILCDLPY--------------------G-I-----TN-HK-WDS-IIP--------YDDLW-TEY----------------ERIIKDNG--AIVLFG---AEPFSTKLRMSNIKLYRY-D----WVWLK-SRATLFQMSHKRPMNKHEL-ISVF----------YKHLP-----TYNPQ--M--SKG-K-PYKTN--G-------RRERK--A---SGF---L-SSGMVNI-PR--NN-K-GT---RY------PTTILDFPN--S-----NA---------KR-YHPTEKPINILSYLIKTYTNENEVVLDNCMGSGSTGVACIRNNRKFIGFELNGHYF-EVAQKRIN---NELDSL---------------------------------------------------------------------------------------------------------------
      BAT_RS05555_Bacillus_pumilus_489305329                                                  LELN-----RIYQRDCIEGMRM-LPDK-SIDMILCDLPY--------------------G-T-----TR-NK-WDI-VIP--------LDSLW-EQY----------------ERVVKDNG--AIVLTA---AQPFTSLLVSSNPKLFRY-D----ITWDK-KQITGFLNAKRMPLRKHED-ILIF----------YKKPP-----TYNPQ--F--TFG-D-SYEV---R-------RKHST--S---NYG---S-QNEN----ET--KS-D-GR---RY------PTSIIEI-P--QIR----E---------KG-GHPTQKPVKLFEWLIKTFTNEGDIILDSCIGSGTTAVAATQLNRNFIGFEIETEYA-KRANQRLD---SSVRGSSIKEAPHDK------------------------------------------------------------------------------------------------------
      HMPREF9505_RS14705_Enterococcus_faecalis_488335929                                      MELN-----KIYNEDCLEGMKR-ISDK-SIDMILCDLPY--------------------G-T-----TD-NK-WDV-IIP--------FDKLW-EQY----------------ERIIKDSG--AIVLTG---SQPFTTDIIMSNRKLFRY-E----WIWNK-NQASNFFMANKMPLKVHEN-ILVF----------YKKLP-----TYNKQ--MIPRTN-P-SVAI---A-------QERGY--V---YDG---A-KSDNYNISTV--KM-S-PK---GYDKNWKNPISILNI-N--QLKNNSNE---------RC-GHPTQKPVALFEHLIKTYTNEGEIVLDNCIGSGTTAVAAINTNRQFIGFEKEKEYF-DVAIERIK---KASEEDDSKV-----------------------------------------------------------------------------------------------------------
      H581_RS0105900_Paenibacillus_harenae_655162666                                          --LN-----TIIHGDCLDVMEE-IDAA-SIDMILCDLPY--------------------G-T-----TQ-ND-WDS-VIP--------LEKLW-TQY----------------KRIIKDNG--AIVLTA---QTPFDKVLGCSNLSMLRY-E----WIWEK-TSATGHLNANRMPLKAHEN-ILVF----------YKSLP-----DYNPQ--M--TTGHK-PVNSY--T-------KHQDD--G---SNY---G-KTKIGI--SG--GG-R-TD---RY------PRSVICF-P--T--DK-QI---------EA-LHPTQKPIELFKYLIETYSNVGDTVLDNCIGSGTTAAAALSCGRNFIGIEKEWKYV-QIARNRMEYV-QPVINF---------------------------------------------------------------------------------------------------------------
      LPP122_RS10265_Lactobacillus_paracasei_695862061                                        -------------------MAE-LPTA-SIDMILCDLPY--------------------G------TTA-NT-WDK-IIP--------FASLW-GQY----------------ERLIKPQG--AIVLTA---NERFSADVVQSNPALYRY-K----WVWVK-NTVTNFVNAKNRPLSRFEE-ILVF---SKSGTANYGNSPDTIGMNYFPQG-L--MPY---------------------NK--TVTSRKY---EQSNQLHPW-NA--PD-TYTQEWTNY------PSDVLNY----K--SD-RT--G---------WHPTQKPVELFAYLIKTYSQPNDLVLDNCMGSGTTAIAAIDTDRHFIGYEISHEYW-QRANDRIA---NHHNT--------------------QTALF---------------------------------------------------------------------------------------
      C171_RS19470_Paenibacillus_sp_FSL_H8-237_738793084                                      --LN-----QIINADCFDVFPN-IKTG-SVDMILCDLPY--------------------G-T-----TQ-SP-WDS-ILP--------FDQLW-MAY----------------ERIIKDNG--AIVLFA---KAPFDKALAASNMKLFKY-E----WIWEK-NKATGHLNKSLMPLQAHEN-ILVF----------YKRPP-----TYNAQ--M--SQGHK-PMNAA--T-------NNHKS--S---V-Y---G-DGIPWS--NE--AG-K-TE---RL------PRSVLYY-PVVN--ND-DP---------ER-IHPNQKPVELCEYFIRTYTNPGETVLDNCAGSCTTAVAASRTGRNYIAIERDKRHA-ADGTQRLKNM-QLTLF----------------------------------------------------------------------------------------------------------------
      EH55_RS12840_Synergistes_jonesii_740126887                                              NPNG-----KLYHGDCLEIMKD-IPDG-SVDMVLCDLPY--------------------G-M-----TA-CD-WDV-VIP--------FEPLW-EHY----------------NRICKRNA--AVVLFS---QQPFTTDIINSNRKKFRY-E----IIYRK-TMKMGFLNAHKMPLKGHEN-ICVF----------YKALP-----TYNPQKTQ--SRN-R-PRIRV----------QEDAR--C---RIY---S-KFKGGVM-----VD-D-GS---RY------PESVIDF-S--N-----FNGGGFVKNRERT-KHPTQKPVPLLEYLIRTYSNEKETILDNCIGSGSTAVAAENTGRRWIGIEKEEHFC-EVAKKRIA---EAAAQG------R-LLLLQG-------------------------------------------------------------------------------------------------
      BCPBV1_gp10_Burkholderia_phage_Bcep1_38638617                                           -----------MFGDCLLAMHE-LPAQ-SVDLVLCDLPY--------------------G-T-----TR-NR-WDT-PLD--------LSRLW-VAY----------------RHVCKPGA--PVLLFA---QTPFDKVLGASNLPELRY-E----WIWEK-TNATGFLNAKRAPLKAHEN-ILVF----------CDRAP-----TYRPI--K--TSG-H-VRKTS--T-------RLG-Y--S---SNY---G-AQAVSS------YD-S-TE---RY------PRSVLRF-A--S--DK-QR---------SK-LHPTQKPVALLEYLIRTHAAPGAVVLDNCMGCASTALAAMQAGCAFIGIENDVEHF-ETAQRRVR------------------------DY--QP------------------------------------------------------------------------------------------
      EH55_RS07365_Synergistes_jonesii_740127826                                              NPNG-----KLYHGDCLEIMKD-IPDG-SVDMVLCDLPY--------------------G-M-----TA-CD-WDV-VIP--------FEPLW-EHY----------------NRICKRNA--AVVLFS---QQPFTTDIINSNRKKFRY-E----IIYRK-TMKMGFLNAHKMPLKGHEN-ICVF----------YKALP-----TYNPQKTQ--SRN-R-PRIRV----------QEDAR--C---RIY---S-KFKGGVM-----VD-D-GS---RY------PESVIDF-S--N-----FNGGGFVKNRERT-KHPTQKPVPLLEYLIRTYSNEKETILDNCIGSGSTAVAAENTGRRWIGIEKEERFC-EVAKKRIA---EAAAQG------R---LLQG-------------------------------------------------------------------------------------------------
      HMPREF1981_RS00510_Bacteroides_pyogenes_545404645                                       IRID-----EIYNEDCLEGMKR-IADR-SIDAIVCDLPY--------------------G-MLNRRNRY-AA-WDR-LIA--------LEPLW-EQY----------------RRIIKPDS--PVILFA---QGMFTARLLMSQPRLWRY-N----LVWYK-DRASGHLNANRMPLRKHED-ILVF----------YEHLP-----VYHPQ--M--IPC---DKSQR--N-H---G-RRTRQ--TFTNRCY---G-DMRMTEV-RI--AD-D------KY------PTSVIQI-A--K--EH-KN--G--AF-----YHPTQKPVALVEYLIRTYTDKGDVVLDNCMGSGTTAIAAIRSGRHYIGFETDAGYC-RIAGERIK---AE-KL--------------------TETENENNPI------NNENNA----------------------------------------------------------------------
      BcepNY3gene09_Burkholderia_phage_BcepNY3_149882909                                      ANRC-----ELMFGDCLLAMHE-LPAQ-SVDLVLCDLPY--------------------G-T-----TR-NR-WDT-PLD--------LSRLW-VAY----------------RHVCKPGA--PVLLFA---QTPFDKVLGASNLPELRY-E----WIWEK-TNATGFLNAKRAPLKAHEN-ILVF----------CDRAP-----TYRPI--K--TSG-H-VRKTS--T-------RLG-Y--S---SNY---G-AQAVSS------YD-S-TE---RY------PRSVLRF-A--S--DK-QR---------SK-LHPTQKPVALLEYLIRTHAAPGAVVLDNCMGCASTALAAMQAGCAFIGIENDVEHF-ETAQRRVR------------------------DY--RS------------------------------------------------------------------------------------------
      N355_gp092_Cellulophaga_phage_phi13:2_526178238                                         MKRN-----EIYLGDCLELMPKHVEDK-SIDMIFCDLPY--------------------G-T-----TQ-CK-WDS-IID--------LDKLW-NEY----------------RRVIKDNG--VIVLFA---SQPFTSILTSSNLKMFKY-S----YTWDK-ITKTNHLNAKKQPLRQVED-ICVF----------YKKQP-----TYKPQ--G--LIE-C-EVSNF--RPN---HFKYKKG--E---KVY---G-EQKEHGN-----KS-T-YT---NY------PSNLIQY-S-----NG-NH---------NS-LHPTQKPLDLIEYMIKTYTNEGDLILDNTCGSGTTGLGAKNLGRNFIMMEQDPKYY-DVACKRVLT-----------------------------------------------------------------------------------------------------------------------
      P667_3626_Acinetobacter_baumannii_691080530                                             -MTF-----KLHHGDCLEIMAN-IPDQ-SIDMILCDLPY--------------------G-T-----TC-CA-WDT-VIS--------FNPLW-AHY----------------ERIIKPNG--AIVLFA---ANPFAAVLATSNLKLFRY-E----MIWEK-PAATGFLNAKKQPLRAHEN-ILVF----------YKSQP-----TYNPQ--K--TTG-H-KRKTA--K-------RKDIG--S---EHY---G-KQLNIKD-----YD-S-TE---RY------PRSVQLF-S--S--DK-QK---------SN-LHPTQKPVALCEYLIRTYTNVGEVVLDNCMGSGTTGIACINTDRKFIGIEKEAKYF-EIAKKRLA------------------------DA--VEIKQT-----ELFSEVV--------------------------------------------------------------------------
      LSJ_RS10550_Lactobacillus_salivarius_763125951                                          MDLK-----K---GDCLELLGG-VQDM-SIDLILCDLPY--------------------G-T-----TR-NK-WDK-IID--------LDKLW-EHY----------------NRIIKDNG--AIVLFS---QQPFSSKLIESNPKMFRY-E----RIWTK-GLATGHLNAKKMPLKKHEN-ILVF----------YKKLP-----TYNPQ--W--WYS-T-PYKV---K-------QGRSK--S---SNY---D-KQRPYTP-SE--SK-D-GR---RY------PVDIIEF-K--H-----DG---------KK-LHPTQKPVALLEYLIKTYTNEGDTVLDNCMGSGSTGVACANTNRNFIGIELSSEYY-NIAKDRIE---KAVAK----------------------------------------------------------------------------------------------------------------
      LSJ_3100c_Lactobacillus_salivarius_690349817                                            --MK-----K---GDCLELLGG-VQDM-SIDLILCDLPY--------------------G-T-----TR-NK-WDK-IID--------LDKLW-EHY----------------NRIIKDNG--AIVLFS---QQPFSSKLIESNPKMFRY-E----RIWTK-GLATGHLNAKKMPLKKHEN-ILVF----------YKKLP-----TYNPQ--W--WYS-T-PYKV---K-------QGRSK--S---SNY---D-KQRPYTP-SE--SK-D-GR---RY------PVDIIEF-K--H-----DG---------KK-LHPTQKPVALLEYLIKTYTNEGDTVLDNCMGSGSTGVACANTNRNFIGIELSSEYY-NIAKDRIE---KAVAK----------------------------------------------------------------------------------------------------------------
      B4145_RS14775_Bacillus_subtilis_516293791                                               IQLN-----KAYQLDCLEGMKL-IPDK-SVDMILCDLPY--------------------G-T-----TQ-NK-WDS-IIP--------LDKLW-EQY----------------ERIIKDNG--AIVLTA---QTPFDKVLGGSNLKLLKY-E----WIWEK-NRGTGHLNAKKMPMKNHEN-ILVF----------YKKLP-----TYNPQ--M--REG-E-PYQRLNCS-------KNALN--K---GNY---G-KTKD----SHSTVS-D-GK---RY------PLSVLDF-A--V-----VE---------RT-IHPTQKPVELFEYLIKTYTNEGEIVLDNCLGSGTTAIACELNNRKWIGFETEQQYI-ELINKRLDSIQLNYNLENLNGLT---------------------------------------------------------------------------------------------------------
      LF41_2421_Lysobacter_dokdonensis_DS-58_702087568                                        -------------------MGR-LPSN-SVDLILCDLPY--------------------G-T-----TS-CK-WDS-VIP--------FDALW-SQY----------------RRIAKRNA--AIVLTA---NQPFTTALIASNLCEFRY-T----WVWDKVNRPTGFLNAKLRPLRAFED-VCVF----------YRAQP-----TYNPQK-W---RG-E-PYKTT----------HGSSG--E---AYH---R-TETRTQV-----CA-D-GM---RY------PQDLIRI-K--A-----DN-----RGVEGR-VHPTQKPVALMEYLVKTYSNEGDTVLDNCMGSGTTGVACANTGRRFIGIERDADYF-TIASKRVG-----VAGAKP------R-VWVPLAVSGD-------------------------------------------------------------------------------------------
      PLA107_32876_Pseudomonas_amygdali_pv_lachrymans_str_M301315_330989854                   KEEI-----QLYKGDCLELMKS-IPDA-SVDMILCDLPY--------------------G-T-----TQ-NK-WDC-PID--------LSRLW-PEY----------------WRICKPSA--AIILTA---QTPFDKILGASQIGHLKY-E----WIWEK-TAATGFLNAKKSPLKAHEN-VLVF----------YRKQP-----TYNPA--M--TAG-HTIKRTN--A-------SYANH--G---ANY---G-KSSSVRA----PYE-S-TE---RY------PRSVQKL-P--K--DN-RL---------KN-QHPTQKPVALMEYLIRTYTNEGDIVLDNCMGSGTTGVACIHSGRRFIGIERDEKIF-GTASDRIASAIALRNTPVPQIELFGTA-----------------------------------------------------------------------------------------------------
      AAY85_RS20710_Pseudomonas_amygdali_763469483                                            KEEI-----QLYKGDCLELMKS-IPDA-SVDMILCDLPY--------------------G-T-----TQ-NK-WDC-PID--------LSRLW-PEY----------------WRICKPSA--AIILTA---QTPFDKILGASQIGHLKY-E----WIWEK-TAATGFLNAKKSPLKAHEN-VLVF----------YRKQP-----TYNPA--M--TAG-HTIKRTN--A-------SYANH--G---ANY---G-KSSSVRA----PYE-S-TE---RY------PRSVQKL-P--K--DN-RL---------KN-QHPTQKPVALMEYLIRTYTNEGDIVLDNCMGSGTTGVACIHSGRRFIGIERDEKIF-GTASDRIASAIALRNTPVPQIELFGTA-----------------------------------------------------------------------------------------------------
      LEP1GSC108_RS06340_Leptospira_weilii_490637745                                          -----MRT-DLHYANCFKIFPT-IPDK-SIHLILCDLPY--------------------G-T-----TD-CE-WDI-LLP--------FEALW-KEY----------------ERIITDNG--AIILTA---SQPFTTKLINSNPKLFRY-E----LIWYK-SKASGFLNANKMPNKSHEN-ILIF----------YKKLP-----TYNPQ--K--YQI-D-PKFQR--K-G---K-SKKKP--QSSLFNI---R-GKKSESY-QY--FD-N-GL---RH------PDSVLCF-P--S--EM-RK--G---------MHPTQKPVALMKFLVSSYSNVGDTVLDNCMGSGTTGIACVELDRNFIGIEQEEEFF-ELASRRIATANKIRRLESIESAFSKQKKETEKNL----------------------------------------------------------------------------------------------
      LSS_RS02805_Leptospira_santarosai_490593464                                             NQKIEPSI-QLFNDDCFNRLPQ-IPDK-SIKMILCDLPY--------------------G-T-----TD-CS-WDT-ILP--------FKPLW-EQY----------------NRVIVENG--AIILTA---SQPFTTALINSNPKHFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKKLP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-STQKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLISSYSNAGDTVLDNCMGSGTTGVACVQTDRNFIGIEKEVEYF-ELAERRIEIAIKIRKLKTITSLFSEKENTND-------------------------------------------------------------------------------------------------
      LEP1GSC187_RS02085_Leptospira_santarosai_490621523                                      TQEIKSSI-QLFNDDCFNRLPQ-IPDK-SINMILCDLPY--------------------G-T-----TD-CS-WDT-ILP--------FKPLW-EHY----------------NRVIVENG--AIILTA---SQPFTTALINSNPKHFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKRLP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-SAKKN--YSNLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPIALMNFLISSYSNAGDTILDNCMGSGTTGVACIQTDRNFIGIEKEEEYF-ELAKRRIEIAIKIRKLKTITSLFSEKENTND-------------------------------------------------------------------------------------------------
      LEP1GSC070_RS05690_Leptospira_santarosai_490626771                                      KQKIEPSI-QLFNDDCFNRLPQ-IPDK-SIKMILCDLPY--------------------G-T-----TD-CS-WDT-ILP--------FKPLW-EQY----------------NRVIVENG--AIILTA---SQPFTTALINSNPKHFRY-E----LIWYK-TKASGFLNANKRPNKSHEN-ILIF----------YKKLP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-STQKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLIKSYSNTGDTVLDNCMGSGTTGVACVQTDRNFIGIEKEVEYF-ELAERRIEIAIKIRKLKTITSLFSEKENTND-------------------------------------------------------------------------------------------------
      LEP2GSC171_RS25015_Leptospira_weilii_757125221                                          ------------------MFPA-IPDK-SIHLILCDLPY--------------------G-T-----TD-CE-WDI-ILP--------FEALW-KEY----------------ERIITDNG--AIILTA---SQPFTTKLINSNPKLFRY-E----LIWYK-SKASGFLNANKMPNKSHEN-ILIF----------YKKLP-----TYNPQ--K--YQI-D-PKFQR--K-G---K-SKKKP--QSSLFNI---R-GKKSESY-QY--FD-N-GL---RH------PDSVLCF-P--S--EM-RK--G---------MHPTQKPVALMKFLVSSYSNVGDTVLDNCMGSGTTGVACAELDRNFIGIEQEEEFF-ELASRRIATANKIRRLESLESAFSKQKKETEKNL----------------------------------------------------------------------------------------------
      LEP1GSC086_RS09945_Leptospira_weilii_490633882                                          NKNTEPSI-QLFNDDCFNIFPQ-IPDK-SVNLVLCDLPY--------------------G-T-----TD-CS-WDK-VLP--------FKELW-EQY----------------NRMIVENG--AVILTA---SQPFTTALINSNPKNFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKKLP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-SSKKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLISSYSNVGDTILDNCMGSGTTGVACIQTDRNFIGIEKEEEYF-ELAQRRIEIAKKIRRLKTLPSIFSEKEKTDE-------------------------------------------------------------------------------------------------
      LEP1GSC133_0802_Leptospira_borgpetersenii_serovar_Pomona_str_200901868_464395818        LERAETSI-QLFNDDCFNIFPK-IPDK-SINLVLCDLPY--------------------G-T-----TD-CS-WDK-ILP--------FKELW-EQY----------------NRMIVENG--AVILTA---SQPFTTALINSNPKNFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKKRP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-SSKKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLIRSYSNVGDTILDNCMGSGTTGVACVKTDRNFIGVEKEEEYF-DLANRRIEIAKKVRRLKALPSIFSEKEKTDE-------------------------------------------------------------------------------------------------
      LSS_RS12690_Leptospira_santarosai_490596716                                             NQKIEPSI-QLFNDDCFNIFPQ-IPDK-SINLILCDLPY--------------------G-T-----TD-CS-WDT-ILP--------FKPLW-EQY----------------NRVIVENG--AIILTA---SQPFTTALINSNPKHFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKKLP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-STKKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLISSYSNVGDRILDNCMGSGTTGVACVQTDRNFIGIEKEVEYF-ELAERRIEIAKKVRRLKILPSIFSEKENKDE-------------------------------------------------------------------------------------------------
      LEP1GSC133_RS04220_Leptospira_borgpetersenii_763222313                                  MERAETSI-QLFNDDCFNIFPK-IPDK-SINLVLCDLPY--------------------G-T-----TD-CS-WDK-ILP--------FKELW-EQY----------------NRMIVENG--AVILTA---SQPFTTALINSNPKNFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKKRP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-SSKKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLIRSYSNVGDTILDNCMGSGTTGVACVKTDRNFIGVEKEEEYF-DLANRRIEIAKKVRRLKALPSIFSEKEKTDE-------------------------------------------------------------------------------------------------
      BF33_RS19405_Bacillus_cereus_756411756                                                  --LN-----EIHNMDCLEGMKL-LQSK-SIDMILCDLPY--------------------GVT-----AR-NK-WDV-IIP--------FDKLW-EQY----------------ERIIKDNG--AIVLTA---TQPFASKLIMSNPDLFRY-D----WIWEK-TLATGHLNAKKMPMRAHES-ILVF----------YKKLP-----TYNPM--K--TKGHA-PVNSY--T-------KHQDD--G---TNY---G-KTKVGI--SG--GG-S-TE---RY------PRSVQRF-S--A--DK-QK---------EA-IHPTQKPVALFEYLIKTYTNEGETILDNCMGSGTTAVAAINTNRNFVGFEKDLEIH-AAANQRINNL---QQVLQ------------V--------I----------------------------------------------------------------------------------------
      BCEP1808_RS36475_Burkholderia_vietnamiensis_500205672                                   KNNP-----VLMQGDCLELLET-IPDN-SIDMVCCDMPY--------------------G-T-----TN-CR-WDA-TLD--------LRRLW-AQY----------------RRVTTENA--AIVLFA---QTPFDKVLGVSNLEWLRY-E----LIWQK-THATGHLNAKKMPMKAHEN-ILVF----------YNKLP-----TYNPQ--K--TTG-H-IRKTS--V-------KRRDN--T---SVY---G-EQNFVEL----SYE-S-TD---RH------PRSVLTF-P--K--DT-QR---------IA-LHPTQKPLALIEWLVSTFTNEGDAVLDNCMGSGTTGEACQRLGRRFVGMELDESHF-AVASSRILSGGVPALRNAA-------------------------------------------------------------------------------------------------------------
      BACPLE_RS11710_Bacteroides_plebeius_494836062                                           AKDI-----TLYKADCLEVMPF-LPES-SIDLVLCDPPF--------------------G-I-----TA-SQ-WDK-IIP--------FPEMW-KEI----------------RRVRKENA--PTVLFG---SEPFSSLLRCGNLEEFKY-D----WVWEK-SKASNFLLAKKQPLKAHEL-ISVF----------GKGRT-----PYYPI--M--EEG-E-PYGNR--T-------KRGSN--W---TGI---G-KVPNPTF-RN--EN-R-GT---RY------PRSVIYF-K--T--AE-SE--G------KT-IHVNQKPIALLQYLIRTYTKEGDTVLDFASGSMSTAIACIYTHRKCICIEKDETHF-SQGEKRVR-----NEYQY---LRL--------------------------------------------------------------------------------------------------------
      HMPREF1007_RS17585_Bacteroides_sp_4_1_36_495941257                                      AKDI-----TLYKADCLEVMPL-LPES-SIDLVLCDPPF--------------------G-T-----TA-SQ-WDK-IIP--------FPEMW-KEI----------------RRVRKENA--PTALFG---SEPFSSLLRYSNLDEFKY-D----WVWEK-SKASNFLLAKKQPLKAHEL-ISVF----------CKGKT-----PYYPI--M--EEG-E-PYGNR--T-------KRGSN--W---TGI---N-NVPNPAF-RN--EN-R-GI---RY------PRSVKYF-K--T--AE-SE--G------KT-IHVNQKPIALLQYLIKTYTKEGDTVLDFASGSMSTAIACIYTNRKCICIEKDEKYF-SQGEGRVR-----NEYQHTAGLRFLNKVICK-------------------------------------------------------------------------------------------------
      ADC57_RS01530_Bacteroides_490416978                                                     AKDI-----TLYKADCLEVMPF-LPES-SIDLVLCDPPF--------------------G-I-----TA-SQ-WDK-IIP--------FPEMW-KEI----------------RRVRKENA--PTVLFG---SEPFSSLLRCGNLEEFKY-D----WVWEK-SKASNFLLAKKQPLKAHEL-ISVF----------GKGRT-----PYYPI--M--EEG-E-PYGNR--T-------KRGSN--W---TGI---N-NVPNPTF-RN--EN-K-GT---RY------PRSVIYF-K--T--AE-SE--G------KT-IHVNQKPIALLQYLIRTYTKEGDTVLDFASGSMSTAIACIYTHRKCICIEKDETHF-SQGEKRVR-----NEYQY---LRL--------------------------------------------------------------------------------------------------------
      M099_RS00805_Bacteroides_vulgatus_696379522                                             AKDI-----TLYKADCLEVMPF-LPES-SIDLVLCDPPF--------------------G-I-----TA-SQ-WDK-IIP--------FPEMW-KEI----------------RRVRKENA--PTVLFG---SEPFSSLLRCGNLEEFKY-D----WVWEK-SKASNFLLAKKQPLKAHEL-ISVF----------GKGRT-----PYYPI--M--EEG-E-PYGNR--T-------KRGSN--W---TGI---G-KVPNPTF-RN--EN-R-GT---RY------PRSVIYF-K--A--AE-SE--G------KT-IHVNQKPIALLQYLIRTYTKEGDTVLDFASGSMSTAIACIYTHRKCICIEKDETHF-SQGEKRVR-----NEYQY---LRL--------------------------------------------------------------------------------------------------------
      AMCSP14_RS14090_Streptococcus_pneumoniae_446323973                                      MEID-----KIIKKDVLEFMET-IPDN-KIDLIVTDPPYLINYKT--------------N---WRK-EK-HK-FSN-VIKNDNNPEL-IKEYI-KEC----------------YRILKDDT--AIYIFC---SFDKVDFFKKEIEKYFSV-K-NI-IIWRK-NNHT-AGDLEAQFGKQYEM-IILA----------NKGRK--------------------------------------KFN--------------GERLTDV-----WD---FK---RV--------------S--S-----DK------L-----LHQNQKPIELIKRCIVKHSDVGDTVFDGFMGSGTTALAALETDRHFIGTEIDEYYF-GIAEERIK-----NHNAQ---LSLFDEV----------------------------------------------------------------------------------------------------
      EL26_10775_Tumebacillus_flagellatus_660643078                                           MTNI-----TITNEDCMCLLRR-TESE-SVDLVLTDPPY--------------------G-I-EF--RA-TR-GSK-VAT--------AKGIL-NDH-KDNIGFLESVAVEL-YRVLKPNS--HLYWFT-R-WDKVEEQLPMLRRCGFRP-K-NA-MIWIK-GGHGMSDTLG-AYAPEYEV-VLFC----------HKGRR--------------------------------------LLN------------E-------------VD---GR---KR------HTDVLRF----S--KI-AP--G------SL-VHSHQKPTALLEFLIQKSSNAGDLVLDPFLGSGSTALAARNTGRSFVGCELSEEIF-QIAQQQLAA-----------------------------------------------------------------------------------------------------------------------
      OBV_RS14990_Oscillibacter_valericigenes_503885074                                       TDGL-----HLMDG--IKGLLS-LPQH-SVDMLLTDPPY--------------------G-T-----TR-NF-WDV-PLP--------LPKLW-EAV----------------RWAVKPEG--AVLLFA---QCPYDKVLGASNLPMLRY-E----WIWYK-ERGTGFLNANRAPLKKSEN-ILVF---------YQKSP------VYHPQ--F--TYG-EPYRKTF--P-------RSG-T--S---SNY---G-KFERTAS----VSN-D-GR---RY------PGNVLFI-P--T-----VT---------GG-VHPTQKPVELCEYLIKTYTDEGAVVADICAGSGTTAVAALNTGRHFVCFEIAPAFY-SSATGRLEQAR-LAVERGEKGV----------------------------------------------------------------------------------------------------------
      N007_RS30730_Alicyclobacillus_acidoterrestris_665851256                                 MELN-----RIYQMDVLDGVKL-VADN-SVDLVVTDPPYLMNYRS--------------N---RRV-VR-NK-FDY-IHNDQSSYDL-IATFI-DEC----------------YRVMKDNT--AIYMFC---SWHHIDYFKQQFERKFKL-K-NL-IVWNK-NNHG-SGDLKGAYAPKHEL-ILFG----------HKG--------------------------------------RSLLQ--------------HKRIPDV-----ID---CD---KI--------------P--S-----AK------L-----THPTEKPVELLTIFILNSSQPGDVVLDGFIGTGATAVACVNTGRNFIGFETEPQYI-EIANKRLE-----GLL----------------------------------------------------------------------------------------------------------------
      N007_05790_Alicyclobacillus_acidoterrestris_ATCC_49025_529047177                        MELN-----RIYQMDVLDGVKL-VADN-SVDLVVTDPPYLMNYRS--------------N---RRV-VR-NK-FDY-IHNDQSSYDL-IATFI-DEC----------------YRVMKDNT--AIYMFC---SWHHIDYFKQQFERKFKL-K-NL-IVWNK-NNHG-SGDLKGAYAPKHEL-ILFG----------HKG--------------------------------------RSLLQ--------------HKRIPDV-----ID---CD---KI--------------P--S-----AK------L-----THPTEKPVELLTIFILNSSQPGDVVLDGFIGTGATAVACVNTGRNFIGFETEPQYI-EIANKRLE-----GLL----------------------------------------------------------------------------------------------------------------
      CDQ29993.1_Streptococcus_pneumoniae_698840876                                           MEID-----KIIKKDVLEVMAT-IPDN-KIDLIVTDPPYLINYKT--------------N---WRK-EK-HK-FSN-VIKNDNNPEL-IKEYI-KEC----------------YRILKDDT--AIYIFC---SFDKVDFFKKEIEKYFSV-K-NI-IIWRK-NNHT-AGDLEAQFGKQYEM-IILA----------NKG--------------------------------------RKKFN--------------GERLTDV-----WD---FK---RV--------------S--S-----DK------L-----LHQNQKPIELIKRCIVKHSDVGDIVFDGFMGSGTTALAALETDRHFIGAEIDGYYF-GIAEERIK-----NHNAQ---LSLFDEV----------------------------------------------------------------------------------------------------
      SEP9_059_Staphylococcus_phage_vB_SepS_SEP9_589891490                                    MELN-----KIYNEDCVAGMKN-MESG-SVDLVVTDPPYLVNYKT--------------G---RRK-DKTHR-FNK-VILNDDNEQL-IINYI-NEC----------------YRILKNNS--AMYMFC---SSDKVDFFKQQLEKKFKI-K-NM-IIWVK-NNHT-AGDLKGSFGRKYEI-IFLV----------VKG--------------------------------------KKHFN--------------GKRLTDI-----WG---FD---KV--------------S--G-----KN------Q-----LHQNQKPLDLIKQCIEKHSDKGDLVFDGFAGSGTTAIACKELERNFIGFELDKGYF-DIAIKRLE-----DYKGE--------------------------------------------------------------------------------------------------------------
      G500_RS21745_Flexibacter_roseolus_737788178                                             KNVE-IKN-QLFLGDCLEILKA-IPSN-SIDCLITDPPYNISGYDHKKQI---------G---WLK-SN-DF-WKK-QKA----FKK-IDENW-DKFSDDDYESFTIEWLSEIKRIVKPNG--NIAIFGSY-HNIYKIGYLIEKLDLKTI-N-S--IIWYK-RNAFPNVTQ-RMFCESTEQ-IIWC-------VNESKKNA-KNW-TFNYK--I-----------------------MKELN------------G-GVQMRNL-----FD-----------V----PLTK----Q--S--ER-EF--G---------KHPSQKPLEVLNNLMLALTNEGDVVLDCFLGSGTTAVSALQHKRNFVGIEQNYDYL-QIAQRRLENIESVIFNKTEI------------------------------------------------------------------------------------------------------------
      HPS42_RS08645_[Haemophilus]_parasuis_737515587                                          -----MNI-NLMQGDCLELLRD-IPDA-AVDMILTDPSY--------------------S-V-GM--TS-NS-IKS-SFNELSMVKPFFSQLF-KEF----------------KRVLKSDG--VAYIFTDWRTISFIQPILDAELGVKNV------LVWDK-AGRMSSSYG-----FYYEL-ILFA--------GNNKR---------------------------------------------------------KIHKKNI-----LK---AP---SF------ASNARKT----N---------G------EK-LHNAQKPIELLQELIINSSDEGDVVLDCFMGSGSTGVACLNTNRKFIGFEIDDKYF-HIAKDRIG-L-H-NRVSV--------------------------------------------------------------------------------------------------------------
      EL26_RS10195_Tumebacillus_flagellatus_740246844                                         ----------------MCLLRR-TESE-SVDLVLTDPPY--------------------G-I-EF--RA-TR-GSK-VAT--------AKGIL-NDH-KDNIGFLESVAVEL-YRVLKPNS--HLYWFT-R-WDKVEEQLPMLRRCGFRP-K-NA-MIWIK-GGHGMSDTLG-AYAPEYEV-VLFC----------HKGRR--------------------------------------LLN------------E-------------VD---GR---KR------HTDVLRF----S--KI-AP--G------SL-VHSHQKPTALLEFLIQKSSNAGDLVLDPFLGSGSTALAARNTGRSFVGCELSEEIF-QIAQQQLAA-----------------------------------------------------------------------------------------------------------------------
      LEP1GSC194_RS08595_Leptospira_alstonii_523636963                                        -----LNT-KLFYDDCFNVLPK-IPDK-SVDLILSDLPY--------------------G-T-----TD-CF-WDK-ILP--------LDLLW-KEY----------------ERIIKDNG--AIILTS---CQPLTTRLICSNQKLFRY-E----LVWYK-SKPSGFLNAKKMPNKSHEN-ILIF----------YKRLP-----TYNPQ--K--FRI-D-PKFQK--K-G---K-SSKAG--I-NVFKV---S-GPKSENY-QY--LD-E-GL---RY------PDSVLCF-P--S--EF-AK--G---------MHPTQKPVSLMKFLVQSYSNVGDLVLDNCMGAGTTGVACVESDRNFIGIEKEKIYF-DLAKTRISNAKKLKSQNL-FVS----------------------------------------------------------------------------------------------------------
      LEP1GSC172_RS05255_Leptospira_interrogans_488105867                                     -----MDI-RLYNRDCFKVLPK-IKDK-SVHLIFSDLPY--------------------G-K-----TD-CK-WDK-VLS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPKLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKRLP-----TYNLQ--K--YCI-D-PKFQV--K-G---K-SSLQT--T-KFINI---S-GKKTLSY-QY--LD-E-GT---RY------PDSVLCF-P--S--DS-NK--G---------MHPTQKPLSLLNFLILSYTNEFDTVLDHCMGSGTTGVACVKSNRRFIGIEKDKGYF-DLSKSRISKAKKENEGYL-FSDIALSS-----------------------------------------------------------------------------------------------------
      LEP2GSC076_RS0118050_Leptospira_interrogans_446276826                                   -----MDI-RLYNRDCFKVLPK-IGDK-SVHLIFSDLPY--------------------G-K-----TV-CK-WDQ-ILS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPRLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKKLP-----TYNPQ--K--YSI-D-PKFHV--K-G---K-HSLQT--A-NFINI---K-GTKPLNY-QY--LE-D-GT---RY------PDSVLCF-P--S--ES-SK--G---------MHPTQKPVSLLNFLILSYTNKFDTVLDHCMGSGTTGVSCVKTERRFIGIEKDKGYF-KLAKSRISKAQKEKVETL-FSDLALSS-----------------------------------------------------------------------------------------------------
      IQ65_RS20335_Leptospira_interrogans_516471781                                           -----MDI-RLYNRDCFKVLPK-IGDK-SVHLIFSDLPY--------------------G-K-----TV-CK-WDQ-ILS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPRLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKKLP-----TYNPQ--K--YSI-D-PKFHV--K-G---K-HSLQT--A-NFINI---K-GTKLLNY-QY--LE-D-GT---RY------PDSVLCF-P--S--ES-SK--G---------MHPTQKPVSLLNFLILSYTNKFDTVLDHCMGSGTTGVSCVKTERRFIGIEKDKGYF-KLAKSRISKAQKEKVETL-FSDLALSS-----------------------------------------------------------------------------------------------------
      LEP1GSC041_RS17345_Leptospira_noguchii_490560754                                        -----MDI-RLYNRDCFKVLPK-IKDK-SVHLIFSDLPY--------------------G-K-----TD-CK-WDK-VLS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPKLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKRLP-----TYNPQ--K--YCI-D-PKFQV--K-G---K-RSLQT--I-KFINI---S-GKKTLNY-QY--LD-E-GT---RY------PDSVLCF-P--S--DS-NK--G---------MHPTQKPLSLLNFLILSYTNEFDTVLDHCMGSGTTGVACVKSNRRFIGIEKDKGYF-DLSKSRISKAKKEKEENL-FSDIALSS-----------------------------------------------------------------------------------------------------
      LEP1GSC084_RS211440_Leptospira_interrogans_446276825                                    -----MDI-RLYNRDCFKVLPK-IGDK-SVHLIFSDLPY--------------------G-K-----TV-CK-WDQ-ILS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPRLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKKLP-----TYNPQ--K--YSI-D-PKFHV--K-G---K-HSLQT--A-NFINI---K-GTKLLNY-QY--LE-D-GT---RY------PDSVLCF-P--S--ES-SK--G---------MHPTQKPVSLLNFLILSYTNKFDTVLDHCMGSGTTGVSCVKTERKFIGIEKDKGYF-KLAKSRISKAQKEKVETL-FSDLALSS-----------------------------------------------------------------------------------------------------
      LEP2GSC066_RS0113465_Leptospira_santarosai_490627908                                    -----MDI-RLYNEDCFKVLPT-IKDK-SVHLIFSDLPY--------------------G-T-----ID-CV-WDR-VLP--------FDNLW-KEY----------------NRILIDNG--VILFTG---SQPFTTKIILSNPKHFRY-E----LIWYK-SKASGFLNAKLMPNKSHEN-ILVF----------YKKLP-----TYNPQ--K--YSI-D-LKFRA--K-G---K-FNKQT--S-KFINI---T-GPKNLNY-QY--ID-E-GL---RY------PDSVLCF-P--S--ES-QK--G---------MHPTQKPVSLLNFLILSYTNEFNTVLDHCMGSGTTGVSCVNTNRRFIGIEKDKGYF-DLAQSRISKAKNSISLDL-FSKVNLNS-----------------------------------------------------------------------------------------------------
      LEP1GSC186_RS16950_Leptospira_noguchii_490575676                                        -----MDI-RLYNRDCFKVLPK-IKDK-SVHLIFSDLPY--------------------G-K-----TD-CK-WDK-VLS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPKLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKRLP-----TYNPQ--K--YCI-D-PKFQV--K-G---K-RSLQT--I-KFINI---S-GKKTLNY-QY--LD-E-GT---RY------PDSVLCF-P--S--DS-NK--G---------MHPTQKPLSLLNFLILSYTNEFDTVLDHCMGSGTTGVACVKSNRRFIGIEKDKGYF-DLSKSRISKAMKEKEENL-FSDIALSS-----------------------------------------------------------------------------------------------------
      _Ruminococcus_sp_SR1/5_505338314                                                        MSEA-----TLLQGDCLELMNR-IPDS-SIDMVLSDLPY--------------------G-T-----TR-CR-WDA-PIN--------LQELW-EQY----------------RRVVKENG--AIALFS---AQPFTTELISSNKAMYRY-E----WIWRK-TQPSGFMNAKKMPLRTHEN-IEIF----------YRKPP-----TYNPQ--M--THG-H-QRKTA--TAY---GTRESDG--S---SCY---G-REERNYT-----YD-S-TD---RY------PVDVLQY-S-----TG-DK---------SKRLHPTQKPVDLLEYLVKTYTNPGETVLDNCMGAGSTGVACLNTGREFVGIELDPEYY-QIAKERIE-----QHVEN------------I--------F----------------------------------------------------------------------------------------
      LEP1GSC059_0080_Leptospira_phage_vB_LnoZ_CZ214-LE1_529283433                            -----MDI-RLYNRDCFKVLPK-IKDK-SVHLIFSDLPY--------------------G-K-----TD-CK-WDK-VLS--------LENLW-KEY----------------NRILIENG--VVIFTG---NQPFTTQIIQSNPKLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKRLP-----TYNPQ--K--YCI-D-PKFQV--K-G---K-RSLQT--I-KFINI---S-GKKTLNY-QY--LD-E-GT---RY------PDSVLCF-P--S--DS-NK--G---------MHPTQKPLSLLNFLILSYTNEFDTVLDHCMGSGTTGVACVKSNRRFIGIEKDKGYF-DLSKSRISKAKKEKEENL-FSDIALSS-----------------------------------------------------------------------------------------------------
      M123_RS17125_Bacteroides_fragilis_492201466                                             AKDI-----TLYKADCLEVMPF-LPES-SIDLVLCDPPF--------------------G-I-----TA-SQ-WDK-IIP--------FPEMW-KEI----------------RRVRKENA--PTVLFG---SEPFSSLLRCGNLEEFKY-D----WVWEK-SKASNFLLAKKQPLKAHEL-ISVF----------GKGRI-----PYYPI--M--EEG-E-PYGNR--T-------KRGSN--W---TGI---N-NVPNPTF-RN--EN-K-GT---RY------PRSVIYF-K--T--AE-SE--G------KT-IHVNQKPIALLQYLIRTYTKEGDTVLDFASGSMSTAIACIYTHRKCICIEKDETHF-SQGEKRVR-----NEYQY---LRL--------------------------------------------------------------------------------------------------------
      HMPREF1033_RS05085_Tannerella_sp_6_1_58FAA_CT1_748669634                                IEQD-----KIYNMDCLEGMKG-IADR-SIDAVIADLPY--------------------G-VLNRQNGA-AR-WDN-RIP--------LKPLW-EQY----------------LRITKPDS--PIILFA---QGMFSAELVLSQPKLWRY-N----LVWHK-DRVSGHLNANRMPLRQHED-ILVF----------YRKLP-----VYHPQ--M--IPC---PPEKR--N-H---G-RRKTE--GFTNRCY---G-GMKLAPV-RI--AD-D------KY------PTSVISV-P--K--EH-RK--G--TF-----YHPTQKPVALIEYLIRTYTDEGDTVLDNCIGSGTTAVAALRTGRHYIGFETDSGYC-GIAERRIREEIDQRERKDNEKNQ---------------------------------------------------------------------------------------------------------
      M099_RS00920_Bacteroides_494836074                                                      IEAD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------FAALW-EQY----------------QRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----VYHPQ--M--TPC---PPERR--Y-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERRIQEELECRNKAQEEKEIREEKQNQ--------------------------------------------------------------------------------------------------
      HMPREF1070_RS16045_Bacteroides_ovatus_490454463                                         IETD-----KIYHMDCIEGMRL-MADG-SVDAVIADLPY--------------------G-MLNHKNKA-AR-WDR-QIP--------LEPLW-EQY----------------LRVTKPES--PIILFA---QGMFTAELLLSQPRIWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-IIVF----------YKRQP-----VYHPQ--M--TPC---LPERR--N-H---G-RRKTE--GFTNRCY---G-AMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KR--G--AF-----YHPTQKPVALMEYLIRTYTDKGAVVLDNCIGSGTTAVAAIRTGRHYIGFETEKVYC-EIAERRIREEIERRNEAK--------------------------------------------------------------------------------------------------------------
      EE52_RS20200_Bacteroides_494843167                                                      IETD-----RIYLMDCMEGMKQ-IADS-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------FAALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----VYHPQ--M--TPC---PPERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQVYC-EIAERRIQEELECRNKAQEEKEIREEKQNQ--------------------------------------------------------------------------------------------------
      BACCOPRO_RS07720_Bacteroides_coprophilus_495417764                                      IKKD-----QIYHMDCLKGMKQ-MADR-SVDAIIADLPY--------------------G-VLNNRNTS-AG-WDK-QLP--------LEKLW-EEY----------------LRISKPES--PVILFG---QGMFTARLVLSQPKIWRY-N----LVWHK-DRVTGHLNANRMPLRQHED-IIVF----------YRKQP-----VYHPQ--M--KPC---PAEQR--N-H---G-RSKTR--GFTNRCY---G-QMNLTPI-RI--AD-D------KY------PTSVIAI-A--K--EH-CK--G--CF-----YHPTQKPVALLEYLIRTYTNEGDTVLDSCIGSGTTMVAAIRTGRHFIGFETEQSYF-ETALLRIAEETE-QNHQTTEINIQ--------------------------------------------------------------------------------------------------------
      M082_RS10705_Bacteroides_490422204                                                      IETD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-AN-WDR-QIP--------LTALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----VYHPQ--M--TPC---PSERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EITERRIQEELECRNKAQEEKEIREEKQINKSMTWTKKES----------------------------------------------------------------------------------------
      ADC57_RS01565_Bacteroides_494743555                                                     IETD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------LAALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----AYHPQ--M--TPC---PPERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERRIQEELECRNKVQEEKEIREEKQNQ--------------------------------------------------------------------------------------------------
      BSFG_RS19480_Bacteroides_490416986                                                      IETD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------LAALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPFRQHED-ILVF----------YKKQP-----AYHPQ--M--TPC---PPERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERRIQEELECRNKVQEEKEIREEKQNQ--------------------------------------------------------------------------------------------------
      BFAG_00704_Bacteroides_fragilis_3_1_12_313134650                                        -------------MDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------LAALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----AYHPQ--M--TPC---PPERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERQIQEELECRNKVQEEKEIREEKQNQ--------------------------------------------------------------------------------------------------
      HMPREF1062_RS27355_Bacteroides_cellulosilyticus_494418810                               IETD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-AN-WDR-QIP--------LTALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----AYHPQ--M--TPC---PSERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERRIQEELECRNKVQEEKEIREEKQNQ--------------------------------------------------------------------------------------------------
      IQ65_RS13625_Leptospira_interrogans_446127730                                           -----ITT-DLYLDDCLDRLPK-IPDE-SIRLILADLPY--------------------G-T-----TR-CK-WDK-ALP--------LEFLW-REY----------------ERIIIDNG--AIILTA---SQPFTTALINSNPRLFRY-E----LIWYK-SKASGFLNAKKMPQKSHEN-ILIF----------YKKPP-----VYNPQ--T--YKI-N-PIYQR--K-GVKLR-KSHKP--E-SLFKL---S-NSDMNQY-RY--ID-D-GT---RL------PDSVLCF-A--S--EF-QK--G---------MHPTQKPVALMDFLIRSYSNISDTVLDNCMGSGTTGVACIRAGRNFVGIEKDKDIF-DVASRRIEIAHTIHKLNSLPSLFR--------------------------------------------------------------------------------------------------------
      LEP2GSC168_RS0121290_Leptospira_interrogans_446127731                                   -----ITT-DLYLDDCLDRLPK-IPDE-SIRLILADLPY--------------------G-T-----TR-CK-WDK-VLP--------LEFLW-REY----------------ERIIIDNG--AIILTA---SQPFTTALINSNPRLFRY-E----LIWYK-SKASGFLNAKKKPQKSHEN-ILIF----------YKKPP-----VYNPQ--T--YKI-N-PIYQR--K-GVKLR-KSHKP--E-SLFKL---S-NSDMNQY-RY--ID-D-GT---RL------PDSVLCF-A--S--EF-QK--G---------MHPTQKPVALMDFLIRSYSNISDTVLDNCMGSGTTGVACIRAGRNFVGIEKDKDIF-DVASRRIEIAHTIHKLNSLPSLFR--------------------------------------------------------------------------------------------------------
      AZ40_RS07215_Aeromonas_jandaei_752537274                                                ---M-----KLLQGDCLSLLPS-LPDN-SIDMVLADPPY--------------------G-T-----TQ-CK-WDS-VID--------LAAMW-REL----------------ERVCKPNS--AIVMTA---AQPFTAQLVCSNIGMFKY-E----IIWEK-GNATGFLNAKKQPLRAHES-VLVF----------YRQQP-----TYNPQ--M--TSG-H-ARKTS--K-------RKTVN--S---ECY---G-KALSLTE-----YD-S-TE---RY------PRSVQFF-S--S--DK-QR---------GS-YHATQKPVALMEWLIRSFSNPADVVLDFCMGSGTTGVACLNTGREFIGMEMDTEIF-KVATSRID----SLINKEAA------------------------------------------------------------------------------------------------------------
      HMPREF1019_RS05155_Campylobacter_sp_10_1_50_496651971                                   ---------IILQGNSLEIIKG-IPTN-SIDLIFADPPYWMRVDG--V-LKRPEGKEFDG-------CN-DE-WDNTFLN-NDDYVD-FTRKWLNEC----------------KRVLKQNG--SIWVIGGM-QCIYTIGGIMQELGFWFI-N-D--VIWQK-SNPTPNFMGTRL-NNSHET-LIWA----------TKSKKSK--FTFNYK------------------T-------AKELNTENIDINLFEKGE-RRQLGSV-------------W-RF------SVCSGNE-R--L--KD-EN--G------NK-LHSTQKPESLLYRVIAISSKIGDIVLDPFGGTMTTAAMAKKLGRNYISIEQNDKYI-KFGKKRVN-D-IVFEDS--DIAHAK-FDKKPLKVNLDQMIDANFLNLGERFYLKNSDEFAILKRGSRLEYNNI-LYDIHSLAAKLKS-AKSERL-NGFKFWHVMRDNKKILLDDIRSHFREINA----
      DV59_RS09130_Helicobacter_pylori_446268888                                              ---------TIIEGDCLEKLKD-FPNK-SVDFIFADPPYFMQTEG--E-LKRFEGTKFQG-------VE-DH-WDK-FGS-FEEYDT-FCLVWLKEC----------------QRILKDNG--SICVIGSF-QNIFRIGFHLQNLGFWIL-N-D--IIWHK-SNPVPNFAGKRL-CNAHET-LIWC----------AKHKNSK--VTFNYK------------------T-------MKYLN------------N-DKQEKSV-------------W-QI------PICMGNE-R--L--KD-AQ--G------KK-VHSTQKPEALLKKIILSTTKPKDIVLDPFFGTGTTGAVAKSMNRYFIGIEKDSFYI-KEAAKRLN-N--TRDKS--DFITNLELETKPPKIPMSLLISKQLLKIGDFLYSPNKEKICQVLENGQVRDNENYETSIHKMSAKYLN-K--TNH-NGWKFFYAYYQNQFLLLDELRYICQRDS-----
      QT55_RS02770_Helicobacter_pylori_727328309                                              ---------TIIEGDCLEKLKD-FPDK-SIDFIFADPPYFMQTEG--E-LKRFEGTKFQG-------VE-DH-WDK-FGS-FKEYDT-FCLGWLKEC----------------QRILKDNG--SICVIGSF-QNIFRIGFHLQNLGFWIL-N-D--IIWHK-SNPVPNFAGKRL-CNAHET-LIWC----------AKHKNSK--VTFNYK------------------T-------MKYLN------------N-DKQEKSV-------------W-QI------PICIGNE-R--L--KD-AQ--G------KK-VHSTQKPEALLKKIILSATKPKDIVLDPFFGTGTTGAVAKSMNRHFIGIEKDSFYI-KEATKRLN-N--TMDKS--DFITNLNLETKPPKIPMSLLISKQLLKIGDFLYSPNKERICQVLENGQVRDNENYETSIHKMSAKYLN-K--TNH-NGWKFFHAYYQNQFLLLDELRYICQKEF-----
      GZ76_RS01670_Helicobacter_pylori_726979196                                              ---------AIIEGDCLEKLKD-FPNK-SVDFIFADPPYFMQTEG--E-LKRFEGTKFQG-------VE-DH-WDK-FGS-FKEYDT-FCLGWLKEC----------------QRILKDNG--SICVIGSF-QNIFRIGFHLQNLGFWIL-N-D--IIWHK-SNPVPNFAGKRL-CNAHET-LIWC----------AKHKNSK--VTFNYK------------------T-------MKYLN------------N-DKQEKSV-------------W-QI------PICMGNE-R--L--KD-AQ--G------KK-VHSTQKPEALLKKIILSATKPKDIVLDPFFGTGTTGAVAKSMNRHFIGIEKDSFYI-KEAAKRLN-N--TRDKS--DFITNLELETKPPKIPMSLLISKQLLKIGDFLYSSNKEKICQVLENGQVRDNENYETSIHKMSAKYLN-K--TNH-NGWKFFYAYYQNQFLLLDELRYICQRDS-----
      NPL7_RS01720_Mycoplasma_hyosynoviae_738495747                                           ---------QILLGDNIELFKQ-IPDN-SIDLIFADPPYNMNLQK--D-LIRYDGSKFDG-------VD-DE-WDK-YES-LEEYDK-ECKLWLAEC----------------LRVLKKDG--SLWVIGSF-QNIHRLGYILQDMSAWII-N-E--IVWEK-ANPVPNFGGTRF-VNAQET-MLWV----------TKNSKAK--FTFNYK------------------T-------MKHMN------------G-GTQMKSV-------------W-KL------PICTGSE-R--L--KD-ED--G------KK-IHSTQKPLALLERIIIACSKPNDIVLDPFSGTATTAHAAKMLGRNYIGFEKDKIYY-EQSILRLN-T-VRKDESKNDLINAI-YDAKPQKVDFIDLINNNYISTTDKLRIITKDYELHFNKNGDIDFEGE-SLTPNKLCRKIFN-K--PT--NAWDVIMVND----MKLSEIREKYRAEN-----
      NPL1_02345_Mycoplasma_hyosynoviae_635203155                                             ---------QILLGDNIELFKQ-IPDN-SIDLIFADPPYNMNLQK--D-LIRYDGSKFDG-------VD-DE-WDK-YES-LEEYDK-ECKLWLAEC----------------LRVLKKDG--SLWVIGSF-QNIHRLGYILQDMSAWII-N-E--IVWEK-ANPVPNFGGTRF-VNAQET-MLWV----------TKNSKAK--FTFNYK------------------T-------MKHMN------------G-GTQMKSV-------------W-KL------PICTGSE-R--L--KD-ED--G------KK-IHSTQKPLALLERIIIACSKPNDIVLDPFSGTATTAHAAKMLGRNYIGFEKDKTYY-EQSILRLN-T-VRKDESKNDLINAI-YDAKPQKVDFIDLINNNYISTTDKLRIITKDYELHFNKNGDIDFEGE-SLTPNKLCRKIFN-K--PT--NAWDVIMVND----MKLSEIREKYRAEN-----
      NPL7_01825_Mycoplasma_hyosynoviae_635202114                                             ---------QILLGDNIELFKQ-IPDN-SIDLIFADPPYNMNLQK--D-LIRYDGSKFDG-------VD-DE-WDK-YES-LEEYDK-ECKLWLAEC----------------LRVLKKDG--SLWVIGSF-QNIHRLGYILQDMSAWII-N-E--IVWEK-ANPVPNFGGTRF-VNAQET-MLWV----------TKNSKAK--FTFNYK------------------T-------MKHMN------------G-GTQMKSV-------------W-KL------PICTGSE-R--L--KD-ED--G------KK-IHSTQKPLALLERIIIACSKPNDIVLDPFSGTATTAHAAKMLGRNYIGFEKDKIYY-EQSILRLN-T-VRKDESKNDLINAI-YDAKPQKVDFIDLINNNYISTTDKLRIITKDYELHFNKNGDIDFEGE-SLTPNKLCRKIFN-K--PT--NAWDVIMVND----MKLSEIREKYRAEN-----
      JF44_RS0106430_Thalassospira_australica_696534926                                       ---------KVLVGDCIELMNS-LPEK-SVDLIFADPPYNLQLGG--D-LLRPNNSKVDA-------VD-DH-WDQ-FDS-FRHYDD-FTRDWLTAA----------------RRVLKDTG--AIWVIGSY-HNIYRVGNTLQDIGFWIL-N-D--IVWRK-TNPMPNFRGKRF-CNAHET-LLWC----------SKSEEQK-AITFNYE------------------A-------MKQLN------------E-GLQMRSD-------------W-LM------PICSGSE-R--L--KD-DK--G------KK-VHPTQKPEALLQRVLMATTRQGDVVLDPFFGTGTTGAAARRLGRHFIGLEREEGYA-EAARDRIA-K-VQMLDG--DSLELTESKRSLPRIPFGAVIERGLLAPGDKIYDNRGNVAAMVRADGSISHKDN-AGSIHQVGAHVQG-A--QAC-NGWTYWHYKCDGRLVSIDNLRSQLRKELGQVPA
      QU01_RS03135_Helicobacter_pylori_727305172                                              ---------TIIEGDCLEKLKD-FPDK-SIDFIFADPPYFMQTEG--E-LKRFEGTKFQG-------VE-DH-WDK-FGS-FKEYDT-FCLGWLKEC----------------QRILKDNG--SICVIGSF-QNIFRIGFHLQNLGFWIL-N-D--IIWHK-SNPVPNFAGKRL-CNAHET-LLWC----------AKHKNSK--VTFNYK------------------T-------MKYLN------------N-DKQEKSV-------------W-QI------PICIGNE-R--L--KD-AQ--G------KK-VHSTQKPEALLKKIILSATKPKDIVLDPFFGTGTTGAVAKSMNRHFIGIEKDSFYI-KEAAKRLN-N--TMDKS--DFITNLNLETKPPKIPMSLLISKQLLKIGDFLYSPNKERICQVLENGQVRDNENYETSIHKMSAKYLN-K--TNH-NGWKFFHAYYQNQFLLLDELRYICQKEF-----
      _Candidatus_Hepatoplasma_crinochetorum_740676991                                        ---------KIKLGNCLEELKK-IPSK-SIDLIFADPPYFMQTGT--GTLYRINGNKYNG-------VD-DE-WDK-FDS-YKEYDK-FTRKWLTQC----------------RRILKDKG--SIWVMGTF-HNIYRLGYIIQDLNFWII-N-D--ITWEK-TNPTPNFRGTKF-VNSNEN-LIWF----------TKSQNSK--FTFNYK------------------T-------MKNEN------------K-KKQMGSV-------------W-KF------SICSGKE-R--L--KD-NN--G------NK-LHNTQKPEDLLRRIILASTKINDVILDPFFGTGTTGAVAKKLHRNFIGIENNEKYINYANDRIKNVDKISKNDPFFDYIKAK-FDEKIKYPKILDLIRENKIKSKYLYNL--KEDKVYLNSEGKIKINNV-KYSIHK-ATEIFE-N--GRYLNGWKYWYIKEGNNYISIDDIRKDVK--------
      MALK_RS00800_Mycoplasma_alkalescens_488970128                                           ---------KILYGDCIENLKK-IPDE-TFDFCFADPPYFMQIERGKK-LFRVDGTEFNG-------CD-DE-WDK-FES-ITAYKK-WTKQWLTEV----------------HRVLKKDG--SICVIAGM-QSIFEIGSILREIGYWVI-N-D--IIWHK-SNPTPNFGGTRL-NNSHET-LIWA----------TKTKKSK--FTFNYK------------------T-------GKFLN------------G-GKQMGSI-------------W-KF------SVCSGNE-R--L--KD-YN--G------KK-VHNTQKPEALLYRIITLFTKKDDLILDPFGGTMTTAYVAKKTGRNYTMIERDPNYI-KHGQKRID-S-AIPSIG--DVENAI-FDLKPPKVQFSKMVEANYFNIGEPFYTKNKEKALLNSKNGHLKYNAE-INSMHEIAGKMIG-LD-RRV-NAFNYLYVIRDDELISINQIRNKYRAKLKEDI-
      NPL4_RS01685_Mycoplasma_hyosynoviae_738491218                                           ---------QILLGDNIELFKQ-IPDN-SIDLIFADPPYNMNLQK--D-LIRYDGSKFDG-------VD-DE-WDK-YES-LEEYDK-ECKLWLAEC----------------LRVLKKDG--SLWVIGSF-QNIHRLGYILQDMSAWII-N-E--IVWEK-ANPVPNFGGTRF-VNAQET-MLWV----------TKNSKAK--FTFNYK------------------T-------MKHMN------------G-GTQMKSV-------------W-KL------PICTGSE-R--L--KD-ED--G------KK-IHSTQKPLALLERIIIACSKPNDIVLDPFSGTATTAHAAKMLGRNYIGFEKDKTYY-EQSILRLN-T-VRKDESKNDLINAI-YDAKPQKVDFIDLINNNYISTTDKLRIITKDYELHFNKNGDIDFEGE-SLTPNKLCRKIFN-K--PT--NAWDVIMVND----MKLSEIREKYRAEN-----
      K355_RS0107980_Thalassospira_lucentensis_550983501                                      ---------QVLVGDCIEMMNS-LPEK-SVDLIFADPPYNLQLGG--D-LLRPNNSKVDA-------VD-DH-WDQ-FDS-FRHYDE-FTRDWLTAA----------------RRVLKDTG--AIWVIGSY-HNIYRVGNTLQDIGYWIL-N-D--IVWRK-TNPMPNFRGKRF-CNAHET-LLWC----------SKSEEQK-AITFNYE------------------A-------MKQLN------------E-GLQMRSD-------------W-LM------PICSGSE-R--L--KD-DN--G------KK-VHPTQKPEALLQRVLLATTRQGDVVLDPFFGTGTTGAAARRLGRHFIGLEREETYA-EAARERIA-K-VQMLDG--DSLEVTESKRSLPRIPFGAVIERGLLSPGEKIYDNRGNVAAMVRADGSISHKDN-AGSIHQIGARVQG-A--EAC-NGWTYWHYKCDGRLVSIDNLRSQLRKEMGQVPA
      consensus/100%                                                                          ...................h.........hphhhsD.sa....................s..............p.................h......................h....s....hh.s..................b..........a.K..................E..h.h.....................................................................................................................................p...KP..hh..hh...s.....lhD...G..sT...s....p.....E..........................<---RAMA domain---------------------------------------------------------------------------------------------->
      consensus/95%                                                                           ..............ssh..h...h.p..SlchlhsD.PY....................u..........s..ac....s........h...h...h.................Rl...ps..shhh.s..........h...p...hbh.p....hlW.K.s........p.......E..l.hh............................................................................................p.........p.............................HssQKP..Lhpbhl...op..p.lLD.h.Gp.oTuhss....RphlshE.p...h...u..bl.........................................................................................................................
      consensus/90%                                                                           ..........l...Dshp.h...l.s..SlchlhsD.PY....................G.......s..s..WD....s........h..hW..ph.................Rl...ps..slhh.u......ap..h..pp...hbh.p....hlW.K.s..ss...up....p.aE..lhhh...........c........sa..b...................................................................p.......s.s...b........................hHssQKP..Lhpbll.s.op..-.lLD.h.GsuoTuhus....RpaIuhE.p..ah...u.p+l.........................................................................................................................
      consensus/85%                                                                           .........plb..DChp.h...lss..SlchlhsD.PY....................G.......s..sp.WD...hs........h..hW..ph.................Rl.b.pu..slhl.u...p..ap..h..pp...abh.p....hlW.K.spsss.h.up.b..p.aEs.llhh...........+........sap.b...............................................p...................ph......P.s...b........................hHsTQKP..LhpblIbshop.s-.VLD.hhGoGoTulAs.p.sRpaIGhEb-..ah.p.upp+l.........................................................................................................................
      consensus/80%                                                                           .........plh..DChc.h...ls-..SlDhlhsD.PY....................G.......sp.sp.WD...ls........hp.hW..ph................pRl.b.su..slhl.u...p..Fs..l..pp.p.a+h.p....hlW.K.spsss.hsuppb..p.HEs.lllh...........+pbs.....sap.b..............................p................p...................+h......P.slb.h...........p............hHsTQKPl.LhpaLIboaop.sD.VLDshhGoGTTulAs.p.sRpaIGhEb-..ah.p.AppRlp........................................................................................................................
      consensus/75%                                                                           .........plh..DChc.h.p.lsDp.SlDhlhsD.PY....................G.......sp.sp.WDp.hls........hp.LW.pph................pRl.Kpsu..slll.u...pp.Fs..l..up.p.a+Y.p....hlWbK.spsosahsuppbP.c.HEs.IllF..........hKpbs.....sYpPb..............................p................p..sh......p........+a......P.slb.h....p.....pp............hHPTQKPlsLhpaLIboaos.uD.VLDshhGoGTTulAs.p.sRpaIGhEb-p.Yh.pbAppRlp........................................................................................................................
      consensus/70%                                                                           .........plh..DChc.h.p.lsDp.SlDhIlsDLPY....................G.......sp.sp.WDp.hls........hp.LW.ppY................pRlhKcsu..slll.u...pp.Fss.lh.Ss.cba+Y.p....hlWbK.spsosahsAcpbP.+.HEs.ILlF..........aKpbs.....sYpPb..b...............p.......pp..p........h.......p..sh......s.p......+a......Ppolbph.s..p..p..pp............hHPTQKPlsLhpaLIboYos.GDsVLDsChGoGTTulAs.p.sRpaIGhEb-p.Yh.pbApcRlp......p.................................................................................................................
      
      Back to Contents
    • General notes, Phyletic distribution and gene neighborhoods of the Group I, Clade 3 adenine methylases (Naegleria NAEGRDRAFT_76461-like)

      General notes

      The NAEGRDRAFT_76461 N6-MTase in Naegleria is present in a single copy and has been confirmed to be part of the Naegleria genome (i.e. it isnt a contamination). The protein is a large one with the DAM at the C-terminus. Prokaryotic homologs of this N6-MTase can be distinguished by their unique Str-4 signature with a DLPY motif. Addtionally, members of the family share a K between strand-1 and helix-1, D at the beginning of strand-2 (and the universal D** at the end of strand-2), R and E** flanking str and-3, D in the helix before strand-4, D and R** in helix between strands 4 and 5, K at the end of strand-6 and E** and K** flanking strand-7. Neighborhoods suggest that they are part of Type IV secretion systems or present in phages. There is at least one version which is an R-M system. Phylogenetic analysis groups the Naeglerial DAM with DAMs found in Bacteroidetes species that are part of Type IV secretion systems.
      # 80; Either phage or transposon associated                                                                                                                                                                                                                                                                                               
      GI           Gene neighborhoods                                                                                                   Arch        Pfam arch                      Gene name         Len  Taxonomy                                       Species                                                  Genbank
      464395818    <-ASCH<-?<-N6-MTase*<-?<-?<-Gam-nuclease-inh<-Gam-nuclease-inh<-?<-?<-AAA                                            N6-MTase    N6_N4_Mtase                    LEP1GSC133_0802   307  bacteria>spirochaetes                          Leptospira borgpetersenii serovar Pomona str. 200901868  DNA methylase family protein [Leptospira borgpetersenii serovar Pomona str. 200901868].                           464395810_?-><-464395750_?<-464395824_?<-464395740_?<-464395671_?<-464395689_ASCH<-464395717_?<-464395818_N6-MTase*<-464395803_?<-464395746_?<-464395728_Gam-nuclease-inh<-464395805_Gam-nuclease-inh<-464395713_?<-464395797_?<-464395704_AAA
      410804029    Gam-nuclease-inh->?->?->MuF->N6-MTase->multi-TM->MazG->N6-MTase*->                                                   N6-MTase    SP+N6_N4_Mtase                 LEP1GSC071_3962   288  bacteria>spirochaetes                          Leptospira santarosai str. JET                           putative uncharacterized adenine-specific methylase YhdJ [Leptospira santarosai str. JET].                        410804041_Gam-nuclease-inh->410804133_?->410804034_?->410804117_MuF->410804038_N6-MTase->410804175_multi-TM->410804054_MazG->410804029_N6-MTase*->410804125_?->410804108_?->410804019_?->410804198_?->410804067_?->410804142_?-><-410804048_?
      446543012    Gam-nuclease-inh->?->?->MuF->N6-MTase->?->N6-MTase*->                                                                N6-MTase    SP+N6_N4_Mtase                 -                 285  bacteria>spirochaetes                          Leptospira interrogans                                   hypothetical protein [Leptospira interrogans].                                                                    446148057_?->523650350_Gam-nuclease-inh->446043447_?->447063297_?->523650335_MuF->446545314_N6-MTase->446614561_?->446543012_N6-MTase*->446271796_?->446265643_?->446525580_?->446495114_?->447002706_?->516465345_?-><-523650344_?
      696229311    <-N6-MTase*<-?<-?<-?<-N6-MTase<-MuF                                                                                  N6-MTase    SP+N6_N4_Mtase                 -                 285  bacteria>spirochaetes                          Leptospira                                               MULTISPECIES: DNA methylase [Leptospira].                                                                         495673486_?->696229310_?-><-495673641_?<-495673463_?<-490624138_?<-490624181_?<-495673482_?<-696229311_N6-MTase*<-696229312_?<-696229313_?<-495673450_?<-495673454_N6-MTase<-495673619_MuF<-495673617_?<-495673592_?
      490906211    MuF->?->N6-MTase->?->?->N6-MTase*->                                                                                  N6-MTase    SP+N6_N4_Mtase                 -                 284  bacteria>spirochaetes                          Leptospira kirschneri                                    putative adenine-specific methylase YhdJ [Leptospira kirschneri].                                                 490906210_?->490906225_?->490906216_MuF->642970670_?->490906226_N6-MTase->490906212_?->490906229_?->490906211_N6-MTase*->642970671_?->
      495865040    MuF->N6-MTase->?->?->?->N6-MTase*->                                                                                  N6-MTase    SP+N6_N4_Mtase                 -                 284  bacteria>spirochaetes                          Leptospira licerasiae                                    DNA methylase [Leptospira licerasiae].                                                                            495866424_?->495865716_?->495867158_MuF->495866141_N6-MTase->495867192_?->495871841_?->495866482_?->495865040_N6-MTase*->495864971_?->495871835_?->495866397_?->498200535_?-><-495865257_?<-495867171_?||495865794_?->
      490422204    <-VirD4-like<-?<-?<-?<-TraK<-N6-MTase*<-?<-TraM<-?<-?<-MultiTM                                                       N6-MTase    N6_N4_Mtase                    -                 280  bacteria>bacteroidetes                         Bacteroides                                              MULTISPECIES: DNA methylase [Bacteroides].                                                                        490422198_?-><-492267499_?<-490422199_VirD4-like<-490422200_?<-490422201_?<-490422202_?<-490422203_TraK<-490422204_N6-MTase*<-490422205_?<-490422206_TraM<-490422207_?<-490422209_?<-494420144_MultiTM<-490422214_?<-490422215_?
      696349061    Gam-nuclease-inh->?->?->MuF->N6-MTase->multi-TM->MazG->N6-MTase*->                                                   N6-MTase    SP+N6_N4_Mtase                 -                 278  bacteria>spirochaetes                          Leptospira santarosai                                    DNA methylase [Leptospira santarosai].                                                                            696349045_Gam-nuclease-inh->490624197_?->490624147_?->490624187_MuF->490624149_N6-MTase->490624220_multi-TM->490624160_MazG->696349061_N6-MTase*->648272076_?->490624181_?->490624138_?->490624232_?->490624165_?->490624203_?-><-696349066_?
      490593464    <-MazG<-?<-N6-MTase*<-?<-?<-?<-?<-Gam-nuclease-inh<-?<-AAA                                                           N6-MTase    N6_N4_Mtase                    -                 277  bacteria>spirochaetes                          Leptospira santarosai                                    phage DNA methylase [Leptospira santarosai].                                                                      490593448_?-><-490601417_?<-490593451_?<-490626751_?<-696346268_?<-490593460_MazG<-490593462_?<-490593464_N6-MTase*<-490593466_?<-490593468_?<-490593470_?<-696346260_?<-514360257_Gam-nuclease-inh<-490593475_?<-490593476_AAA
      490596716    <-N6-MTase*<-?<-?<-?<-?<-Gam-nuclease-inh<-?<-AAA                                                                    N6-MTase    N6_N4_Mtase                    -                 277  bacteria>spirochaetes                          Leptospira santarosai                                    phage DNA methylase [Leptospira santarosai].                                                                      490598338_?-><-696346163_?<-490596721_?<-490596720_?<-696346179_?<-490596718_?<-490596717_?<-490596716_N6-MTase*<-490596715_?<-490596714_?<-490596713_?<-490596712_?<-490596711_Gam-nuclease-inh<-490596709_?<-490596708_AAA
      490621523    DCM->?->?->?->N6-MTase*->                                                                                            N6-MTase    N6_N4_Mtase                    -                 277  bacteria>spirochaetes                          Leptospira santarosai                                    DNA methylase family protein [Leptospira santarosai].                                                             490614287_?->490621546_?->696347273_?->696347305_DCM->696347306_?->696347275_?->696347276_?->490621523_N6-MTase*->696347279_?->490621505_?->696347282_?->490621564_?->696347283_?->490621527_?->490621517_?->
      490626771    AAA->?->Gam-nuclease-inh->?->?->?->?->N6-MTase*->                                                                    N6-MTase    N6_N4_Mtase                    -                 277  bacteria>spirochaetes                          Leptospira santarosai                                    DNA methylase family protein [Leptospira santarosai].                                                             490626833_AAA->490626754_?->490626820_Gam-nuclease-inh->696348306_?->490626759_?->490593468_?->490626829_?->490626771_N6-MTase*->490626767_?->490626844_?->696348307_?->696348327_?->490626751_?-><-696348308_?<-490602563_?
      490633882    AAA->?->?->Gam-nuclease-inh->?->?->?->N6-MTase*->                                                                    N6-MTase    N6_N4_Mtase                    -                 277  bacteria>spirochaetes                          Leptospira weilii                                        DNA methylase family protein [Leptospira weilii].                                                                 490633900_AAA->490636543_?->490633845_?->490633859_Gam-nuclease-inh->490633887_?->490633842_?->738117417_?->490633882_N6-MTase*->490633893_?->490633841_?->490633904_?->490633890_?-><-738117420_?||490633762_?->738117406_?->
      696345163    Gam-nuclease-inh->?->?->MuF->N6-MTase->multi-TM->MazG->N6-MTase*->                                                   N6-MTase    SP+N6_N4_Mtase                 -                 277  bacteria>spirochaetes                          Leptospira santarosai                                    DNA methylase [Leptospira santarosai].                                                                            490613760_Gam-nuclease-inh->490613694_?->490613847_?->490613713_MuF->490613894_N6-MTase->490613846_multi-TM->490613768_MazG->696345163_N6-MTase*->696345164_?->490613748_?->490613729_?->490613809_?->490613876_?->490613815_?-><-696345092_?
      649530658    <-VirD4-like<-?<-?<-?<-TraK<-N6-MTase*<-?<-TraM<-?<-?<-MultiTM                                                       N6-MTase    N6_N4_Mtase                    M089_3211         271  bacteria>bacteroidetes                         Bacteroides ovatus str. 3725 D9 iii                      DNA methylase family protein [Bacteroides ovatus str. 3725 D9 iii].                                               <-649530652_?<-649530653_VirD4-like<-649530654_?<-649530655_?<-649530656_?<-649530657_TraK<-649530658_N6-MTase*<-649530659_?<-649530660_TraM<-649530661_?<-649530662_?<-649530663_MultiTM<-649530664_?<-649530665_?
      490416986    N6-MTase->?->MultiTM->?->?->TraM->?->N6-MTase*->TraK->?->?->?->VirD4-like->                                          N6-MTase    N6_N4_Mtase                    -                 270  bacteria>bacteroidetes                         Bacteroides                                              MULTISPECIES: hypothetical protein [Bacteroides].                                                                 490416978_N6-MTase->490416979_?->495942219_MultiTM->490422209_?->490416983_?->490416984_TraM->490416985_?->490416986_N6-MTase*->490416987_TraK->490416988_?->490416989_?->490416990_?->495942226_VirD4-like-><-494418804_?<-490416993_?
      494418810    <-VirD4-like<-?<-?<-TraK<-N6-MTase*<-?<-TraM<-?<-?<-MultiTM<-?<-N6-MTase                                             N6-MTase    N6_N4_Mtase                    -                 270  bacteria>bacteroidetes                         Bacteroides cellulosilyticus                             hypothetical protein [Bacteroides cellulosilyticus].                                                              490416994_?->490416993_?->494418804_?-><-494418806_VirD4-like<-494418808_?<-490416988_?<-490416987_TraK<-494418810_N6-MTase*<-490422205_?<-494418815_TraM<-490422207_?<-490422209_?<-492201471_MultiTM<-492201468_?<-490416978_N6-MTase
      494743555    TraM->?->N6-MTase*->TraK->?->?->?->VirD4-like->                                                                      N6-MTase    N6_N4_Mtase                    -                 270  bacteria>bacteroidetes                         Bacteroides                                              MULTISPECIES: hypothetical protein [Bacteroides].                                                                 <-495124505_?<-495124515_?<-495124517_?||490422209_?->490416983_?->494743553_TraM->490416985_?->494743555_N6-MTase*->490416987_TraK->490416988_?->490416989_?->490416990_?->494418806_VirD4-like-><-494418804_?<-490416993_?
      494836074    <-VirD4-like<-?<-?<-?<-TraK<-N6-MTase*<-?<-TraM                                                                      N6-MTase    N6_N4_Mtase                    -                 270  bacteria>bacteroidetes                         Bacteroides                                              MULTISPECIES: hypothetical protein [Bacteroides].                                                                 <-494836086_?<-696379496_?<-696379521_VirD4-like<-494836080_?<-494836078_?<-494836077_?<-494836075_TraK<-494836074_N6-MTase*<-494836072_?<-494836070_TraM<-490422207_?<-494836068_?||490439559_?->490439558_?->490439557_?->
      494843167    N6-MTase->?->MultiTM->?->?->TraM->?->N6-MTase*->TraK->?->?->?->VirD4-like->                                          N6-MTase    N6_N4_Mtase                    -                 270  bacteria>bacteroidetes                         Bacteroides                                              MULTISPECIES: hypothetical protein [Bacteroides].                                                                 494843158_N6-MTase->494843161_?->495930520_MultiTM->763472918_?->763472920_?->494843165_TraM->494843166_?->494843167_N6-MTase*->763472922_TraK->763472924_?->494843175_?->494843179_?->763472926_VirD4-like->695476477_?->494843186_?->
      695344547    <-VirD4-like<-?<-?<-?<-TraK<-N6-MTase*<-?<-TraM<-?<-?<-MultiTM<-?<-N6-MTase                                          N6-MTase    N6_N4_Mtase                    -                 270  bacteria>bacteroidetes                         Bacteroides fragilis                                     DNA methylase [Bacteroides fragilis].                                                                             490416993_?->494418804_?-><-495942226_VirD4-like<-490416990_?<-757748891_?<-490416988_?<-492201739_TraK<-695344547_N6-MTase*<-490416985_?<-490416984_TraM<-490416983_?<-490422209_?<-757748895_MultiTM<-492201468_?<-490416978_N6-MTase
      763222313    <-ASCH<-?<-?<-N6-MTase*<-?<-?<-?<-AAA                                                                                N6-MTase    N6_N4_Mtase                    -                 270  bacteria>spirochaetes                          Leptospira borgpetersenii                                cytosine methyltransferase [Leptospira borgpetersenii].                                                           763222312_?->488839006_?-><-488839031_?<-763222280_?<-488838822_ASCH<-488838861_?<-763222284_?<-763222313_N6-MTase*<-488838988_?<-763222285_?<-763222286_?<-488838843_AAA||763222314_?->763222289_?->488838960_?->
      410015573    <-N6-MTase*<-?<-?<-?<-N6-MTase<-MuF                                                                                  N6-MTase    SP+N6_N4_Mtase                 LEP1GSC068_2949   269  bacteria>spirochaetes                          Leptospira sp. Fiocruz LV3954                            putative uncharacterized adenine-specific methylase YhdJ [Leptospira sp. Fiocruz LV3954].                         410015635_?->410015575_?-><-410015681_?<-410015622_?<-410015634_?<-410015655_?<-410015632_?<-410015573_N6-MTase*<-410015572_?<-410015584_?<-410015615_?<-410015617_N6-MTase<-410015662_MuF<-410015660_?<-410015643_?
      490637745    AAA->?->Gam-nuclease-inh->?->?->?->?->N6-MTase*->?->ASCH->                                                           N6-MTase    N6_N4_Mtase                    -                 268  bacteria>spirochaetes                          Leptospira weilii                                        DNA methylase family protein [Leptospira weilii].                                                                 490637739_AAA->490637823_?->490637758_Gam-nuclease-inh->490637820_?->490637784_?->738101833_?->490637799_?->490637745_N6-MTase*->738101836_?->490637751_ASCH->490637803_?->490637863_?->738101882_?->490637836_?->515129620_?->
      545404645    <-VirD4-like<-?<-?<-TraK<-N6-MTase*<-TraM<-?<-?<-MultiTM                                                             N6-MTase    N6_N4_Mtase                    -                 268  bacteria>bacteroidetes                         Bacteroides pyogenes                                     DNA (cytosine-5-)-methyltransferase [Bacteroides pyogenes].                                                       748714185_?->545404639_?-><-545404640_?<-545404641_VirD4-like<-748714186_?<-545404643_?<-545404644_TraK<-545404645_N6-MTase*<-545404646_TraM<-545404647_?<-545404648_?<-545404649_MultiTM<-545404650_?<-545404651_?<-748714187_?
      648272077    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    -                 268  bacteria>spirochaetes                          Leptospira santarosai                                    DNA methylase, partial [Leptospira santarosai].                                                                   515125024_?-><-515125025_?<-515125026_?<-515125027_?<-515125028_?<-515125029_?<-648272076_?<-648272077_N6-MTase*
      489305499    <-N6-MTase*<-?<-URI<-N6-MTase                                                                                        N6-MTase    N6_N4_Mtase                    -                 265  bacteria>firmicutes                            Bacillus pumilus                                         DNA-cytosine methyltransferase [Bacillus pumilus].                                                                <-736611728_?<-736611801_?<-736611802_?<-736611803_?<-489305170_?<-736611729_?<-489305717_?<-489305499_N6-MTase*<-736611804_?<-489305926_URI<-489305329_N6-MTase<-736611730_?<-489306211_?<-736611731_?<-489305838_?
      748669634    MultiTM->?->?->TraM->N6-MTase*->TraK->?->?->?->VirD4-like->                                                          N6-MTase    N6_N4_Mtase                    -                 265  bacteria>bacteroidetes                         Tannerella sp. 6_1_58FAA_CT1                             DNA methylase [Tannerella sp. 6_1_58FAA_CT1].                                                                     748669630_?->496674958_?->748669827_?->496674960_MultiTM->496674961_?->496674962_?->496674963_TraM->748669634_N6-MTase*->748669635_TraK->748669828_?->748669830_?->748669832_?->748669636_VirD4-like->496674969_?->748669833_?->
      330989854    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    PLA107_32876      264  bacteria>proteobacteria>gammaproteobacteria    Pseudomonas amygdali pv. lachrymans str. M301315         DNA methylase N-4/N-6 domain-containing protein, partial [Pseudomonas amygdali pv. lachrymans str. M301315].      330989854_N6-MTase*->
      446127730    <-N6-MTase*<-?<-Collar                                                                                               N6-MTase    N6_N4_Mtase                    -                 264  bacteria>spirochaetes                          Leptospira interrogans                                   hypothetical protein [Leptospira interrogans].                                                                    <-446127730_N6-MTase*<-447029443_?<-446808019_Collar<-757477342_?<-446127269_?<-447014100_?<-446555036_?<-642966717_?
      446127731    Collar->?->N6-MTase*->                                                                                               N6-MTase    N6_N4_Mtase                    -                 264  bacteria>spirochaetes                          Leptospira interrogans                                   hypothetical protein [Leptospira interrogans].                                                                    447082902_?->516465892_?->516465893_?->516465894_?->446272244_?->446808018_Collar->447029444_?->446127731_N6-MTase*->658829992_?-><-446558080_?<-446799003_?<-446767232_?<-447143376_?<-516465896_?<-446325284_?
      495941257    <-MutS_I+N6-MTase<-DCM<-?<-?<-?<-?<-N6-MTase<-N6-MTase*                                                              N6-MTase    Methyltransf_26+N6_N4_Mtase    -                 264  bacteria>bacteroidetes                         Bacteroides sp. 4_1_36                                   DNA methyltransferase [Bacteroides sp. 4_1_36].                                                                   <-736517283_MutS_I+N6-MTase<-495941252_DCM<-495941253_?<-736517241_?<-495941254_?<-736517242_?<-495941256_N6-MTase<-495941257_N6-MTase*<-495941258_?<-495941260_?<-495941261_?<-495941262_?<-495941263_?<-495941264_?<-495941265_?
      495417764    MultiTM->?->?->?->N6-MTase->TraM->?->N6-MTase*->TraK->?->?->?->VirD4-like->                                          N6-MTase    N6_N4_Mtase                    -                 263  bacteria>bacteroidetes                         Bacteroides coprophilus                                  hypothetical protein [Bacteroides coprophilus].                                                                   495417751_MultiTM->495417753_?->495417755_?->495417757_?->495417759_N6-MTase->495417760_TraM->495417762_?->495417764_N6-MTase*->749915300_TraK->495417768_?->749915273_?->495417771_?->495417773_VirD4-like->495417775_?->
      740126887    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    -                 263  bacteria>synergistetes                         Synergistes jonesii                                      hypothetical protein [Synergistes jonesii].                                                                       <-740126887_N6-MTase*<-740130694_?<-740130686_?<-740130688_?<-740130690_?
      23752321     <-N6-MTase*<-?<-?<-?<-?||?-><-MuF<-NUDIX                                                                             N6-MTase    N6_N4_Mtase                    BCPBV781_gp10     262  dsdna viruses, no rna stage>caudovirales       Burkholderia phage Bcep781                               gp10 [Burkholderia phage Bcep781].                                                                                <-23752315_?<-23752316_?<-47835036_?<-47835021_?<-47835022_?<-23752319_?<-23752320_?<-23752321_N6-MTase*<-23752322_?<-23752323_?<-23752324_?<-47835023_?||23752326_?-><-23752327_MuF<-23752328_NUDIX
      41057660     <-N6-MTase*<-?<-?<-?<-?||?-><-MuF<-NUDIX                                                                             N6-MTase    N6_N4_Mtase                    BCPBV43_gp10      262  dsdna viruses, no rna stage>caudovirales       Burkholderia phage Bcep43                                gp10 [Burkholderia phage Bcep43].                                                                                 <-41057653_?<-41057654_?<-41057655_?<-41057656_?<-41057657_?<-41057658_?<-41057659_?<-41057660_N6-MTase*<-41057662_?<-41057663_?<-41057664_?<-41057665_?||41057666_?-><-41057667_MuF<-41057668_NUDIX
      503885074    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    -                 262  bacteria>firmicutes                            Oscillibacter valericigenes                              DNA-cytosine methyltransferase [Oscillibacter valericigenes].                                                     <-503885062_?<-503885063_?<-503885068_?<-503885069_?<-503885070_?<-753860327_?<-503885072_?<-503885074_N6-MTase*<-503885075_?<-503885076_?<-753859673_?<-503885079_?||753859675_?-><-753859676_?<-503885081_?
      313134650    N6-MTase->?->MultiTM->?->?->TraM->?->N6-MTase*->TraK->?->?->?->VirD4-like->                                          N6-MTase    N6_N4_Mtase                    BFAG_00704        261  bacteria>bacteroidetes                         Bacteroides fragilis 3_1_12                              DNA (cytosine-5-)-methyltransferase [Bacteroides fragilis 3_1_12].                                                313134643_N6-MTase->313134644_?->313134645_MultiTM->313134646_?->313134647_?->313134648_TraM->313134649_?->313134650_N6-MTase*->313134651_TraK->313134652_?->313134653_?->313134654_?->313134655_VirD4-like-><-313134656_?<-313134657_?
      516293791    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    -                 261  bacteria>firmicutes                            Bacillus subtilis                                        DNA-cytosine methyltransferase [Bacillus subtilis].                                                               <-751587985_?<-516293791_N6-MTase*<-751587988_?<-751587990_?<-751587993_?<-751587996_?<-751587999_?<-516293781_?<-751588002_?
      740127826    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 261  bacteria>synergistetes                         Synergistes jonesii                                      hypothetical protein [Synergistes jonesii].                                                                       740127826_N6-MTase*->740127829_?->740127831_?->740127834_?->740127836_?->
      488335929    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    -                 260  bacteria>firmicutes                            Enterococcus faecalis                                    DNA-cytosine methyltransferase [Enterococcus faecalis].                                                           <-488335922_?<-488335923_?<-488335924_?<-488335925_?<-488335926_?<-488335927_?<-488335928_?<-488335929_N6-MTase*<-488326316_?<-488335930_?<-488335931_?
      523636963    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 260  bacteria>spirochaetes                          Leptospira alstonii                                      DNA methylase family protein [Leptospira alstonii].                                                               523636950_?->738083161_?->523636910_?->523636962_?->523637035_?->738083163_?->523636883_?->523636963_N6-MTase*-><-516420500_?||523636965_?->516420495_?->738083107_?->523637017_?->523636918_?-><-523636982_?
      446276825    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    -                 259  bacteria>spirochaetes                          Leptospira interrogans                                   hypothetical protein [Leptospira interrogans].                                                                    <-446276825_N6-MTase*<-488029201_?
      446276826    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 259  bacteria>spirochaetes                          Leptospira interrogans                                   hypothetical protein [Leptospira interrogans].                                                                    446010275_?->446857835_?->446042321_?->447190987_?->446733555_?->446710508_?->446742328_?->446276826_N6-MTase*-><-642966814_?
      488105867    N6-MTase*->?-><-?<-?<-RADICAL-SAM<-RADICAL-SAM                                                                       N6-MTase    N6_N4_Mtase                    -                 259  bacteria>spirochaetes                          Leptospira interrogans                                   DNA methylase family protein [Leptospira interrogans].                                                            488105905_?->488105867_N6-MTase*->488105884_?-><-488105878_?<-488105912_?<-488105908_RADICAL-SAM<-488105906_RADICAL-SAM<-488105911_?<-488105916_?
      490560754    N6-MTase*->?-><-?<-?<-RADICAL-SAM<-RADICAL-SAM                                                                       N6-MTase    N6_N4_Mtase                    -                 259  bacteria>spirochaetes                          Leptospira noguchii                                      DNA methylase [Leptospira noguchii].                                                                              490560818_?->490560729_?->490560817_?->490560745_?->490560733_?->490560819_?->490560738_?->490560754_N6-MTase*->748682039_?-><-748682041_?<-488105912_?<-490560723_RADICAL-SAM<-488105906_RADICAL-SAM<-490560812_?<-488105916_?
      490575676    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 259  bacteria>spirochaetes                          Leptospira noguchii                                      DNA methylase family protein [Leptospira noguchii].                                                               738074846_?->490560729_?->490560817_?->490575678_?->490575727_?->490575700_?->490575698_?->490575676_N6-MTase*-><-738074849_?<-490575711_?<-490575720_?<-738074852_?<-490575822_?||490575802_?->738074875_?->
      490627908    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 259  bacteria>spirochaetes                          Leptospira santarosai                                    DNA methylase family protein [Leptospira santarosai].                                                             648272267_?->490613267_?->490625571_?->648272268_?->490627898_?->490613256_?->490625561_?->490627908_N6-MTase*-><-515125978_?||490615892_?-><-490606321_?<-490615889_?
      516471781    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 259  bacteria>spirochaetes                          Leptospira interrogans                                   DNA-cytosine methyltransferase [Leptospira interrogans].                                                          488060392_?->446857835_?->757477947_?->516471783_?->446733555_?->446710508_?->757477948_?->516471781_N6-MTase*-><-757477949_?<-642966825_?||757477950_?->
      529283433    N6-MTase*->?->?->?-><-?<-?<-RADICAL-SAM<-RADICAL-SAM                                                                 N6-MTase    N6_N4_Mtase                    LEP1GSC059_0080   259                                                 Leptospira phage vB_LnoZ_CZ214-LE1                       DNA methylase family protein [Leptospira phage vB_LnoZ_CZ214-LE1].                                                529283421_?->529283359_?->529283377_?->529283378_?->529283410_?->529283402_?->529283348_?->529283433_N6-MTase*->529283414_?->529283408_?->529283440_?-><-529283441_?<-529283405_?<-529283374_RADICAL-SAM<-529283380_RADICAL-SAM
      763469483    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 259  bacteria>proteobacteria>gammaproteobacteria    Pseudomonas amygdali                                     hypothetical protein [Pseudomonas amygdali].                                                                      763469483_N6-MTase*->
      490454463    <-VirD4-like<-?<-?<-?<-TraK<-N6-MTase*<-TraM<-?<-?<-?<-MultiTM                                                       N6-MTase    N6_N4_Mtase                    -                 258  bacteria>bacteroidetes                         Bacteroides ovatus                                       hypothetical protein [Bacteroides ovatus].                                                                        <-696272134_?<-696272136_?<-490454458_VirD4-like<-490454459_?<-490454460_?<-490454461_?<-696272353_TraK<-490454463_N6-MTase*<-696272355_TraM<-490454465_?<-490454466_?<-490454467_?<-490454468_MultiTM<-490454469_?<-490454470_?
      738085263    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 258  bacteria>spirochaetes                          Leptospira alstonii                                      DNA methylase [Leptospira alstonii].                                                                              523635656_?-><-523635655_?||738085263_N6-MTase*->523639106_?->523639078_?->544626688_?->
      757125221    N6-MTase*->?->ASCH->                                                                                                 N6-MTase    N6_N4_Mtase                    -                 256  bacteria>spirochaetes                          Leptospira weilii                                        cytosine methyltransferase [Leptospira weilii].                                                                   515130822_?->757125221_N6-MTase*->515130824_?->648273328_ASCH->515130826_?->515130827_?-><-490634456_?||490634520_?->
      490416978    <-N6-MTase<-?<-TraM<-?<-?<-MultiTM<-?<-N6-MTase*<-?<-?<-?<-VirB4-FtsK<-?<-?<-Int-maturase+HNH                        N6-MTase    Methyltransf_26+N6_N4_Mtase    -                 254  bacteria>bacteroidetes                         Bacteroides                                              MULTISPECIES: DNA methyltransferase [Bacteroides].                                                                <-695344547_N6-MTase<-490416985_?<-490416984_TraM<-490416983_?<-490422209_?<-757748895_MultiTM<-492201468_?<-490416978_N6-MTase*<-490416977_?<-757748908_?<-490416974_?<-757748898_VirB4-FtsK<-490416972_?<-490416970_?<-757748901_Int-maturase+HNH
      492201466    Int-maturase+HNH->?->?->VirB4-FtsK->?->?->?->N6-MTase*->?->MultiTM->                                                 N6-MTase    Methyltransf_26+N6_N4_Mtase    -                 254  bacteria>bacteroidetes                         Bacteroides fragilis                                     DNA methyltransferase [Bacteroides fragilis].                                                                     490416968_Int-maturase+HNH->490416970_?->695344546_?->490416973_VirB4-FtsK->490416974_?->695551343_?->490416977_?->492201466_N6-MTase*->492201468_?->695551347_MultiTM-><-695551206_?<-695551210_?<-695551214_?<-695551218_?<-695551229_?
      494836062    VirB4-FtsK->?->?->?->N6-MTase*->?->MultiTM->?->?->TraM->?->N6-MTase->                                                N6-MTase    Methyltransf_26+N6_N4_Mtase    -                 254  bacteria>bacteroidetes                         Bacteroides plebeius                                     DNA methyltransferase [Bacteroides plebeius].                                                                     490416966_?->494836051_?->494836054_?->494836055_VirB4-FtsK->490416974_?->696379523_?->494836059_?->494836062_N6-MTase*->494836064_?->494836066_MultiTM->494836068_?->490422207_?->494836070_TraM->494836072_?->494836074_N6-MTase->
      696379522    <-MultiTM<-?<-N6-MTase*<-?<-?<-?<-VirB4-FtsK                                                                         N6-MTase    Methyltransf_26+N6_N4_Mtase    -                 254  bacteria>bacteroidetes                         Bacteroides vulgatus                                     DNA methyltransferase [Bacteroides vulgatus].                                                                     <-490439537_?<-490439536_?<-490439534_?<-647589678_?<-490439531_?<-696379543_MultiTM<-494836064_?<-696379522_N6-MTase*<-494836059_?<-696379523_?<-490416974_?<-696379524_VirB4-FtsK<-494836054_?<-494836051_?<-490416966_?
      738211128    N6-MTase*->?->?->?->Terminase_LS->?->?->MuF->                                                                        N6-MTase    N6_N4_Mtase                    -                 254  bacteria>proteobacteria>gammaproteobacteria    Lysobacter dokdonensis                                   hypothetical protein [Lysobacter dokdonensis].                                                                    738211113_?->738211116_?->738211118_?->738211121_?->738211123_?->738211126_?->738211371_?->738211128_N6-MTase*->738211131_?->738211133_?->738211136_?->738211139_Terminase_LS->738211142_?->738211144_?->738211147_MuF->
      489305329    <-N6-MTase<-?<-URI<-N6-MTase*                                                                                        N6-MTase    N6_N4_Mtase                    -                 253  bacteria>firmicutes                            Bacillus pumilus                                         DNA-cytosine methyltransferase [Bacillus pumilus].                                                                <-736611803_?<-489305170_?<-736611729_?<-489305717_?<-489305499_N6-MTase<-736611804_?<-489305926_URI<-489305329_N6-MTase*<-736611730_?<-489306211_?<-736611731_?<-489305838_?<-489305408_?<-489305272_?<-736611732_?
      549781736    DHH->?->?->?->?->N6-MTase*->N6-MTase->                                                                               N6-MTase    N6_N4_Mtase                    -                 253  bacteria>firmicutes                            Bacillus amyloliquefaciens                               Modification methylase RsrI [Bacillus amyloliquefaciens].                                                         549781723_?->549781725_?->549781726_DHH->549781728_?->549781730_?->549781732_?->549781735_?->549781736_N6-MTase*->549781738_N6-MTase->504230838_?->752856931_?->549781740_?->752856932_?->545132120_?->545132119_?->
      500205672    METHYLASE-><-NucA<-?||?->?->N6-MTase*->                                                                              N6-MTase    N6_N4_Mtase                    -                 252  bacteria>proteobacteria>betaproteobacteria     Burkholderia vietnamiensis                               DNA-cytosine methyltransferase [Burkholderia vietnamiensis].                                                      500205679_?-><-500205677_?||500205676_METHYLASE-><-759573915_NucA<-759573856_?||759573857_?->500205673_?->500205672_N6-MTase*->759573859_?->759573916_?-><-759573860_?<-500205667_?<-759573861_?<-759573862_?<-500205665_?
      691080530    DCM->?->N6-MTase*->?->?->?->?->Terminase_SS->GT->                                                                    N6-MTase    N6_N4_Mtase                    -                 251  bacteria>proteobacteria>gammaproteobacteria    Acinetobacter baumannii                                  cytosine methyltransferase [Acinetobacter baumannii].                                                             <-447065173_?<-487955263_?||446150822_?->446882801_?->446105617_?->446969423_DCM->691080528_?->691080530_N6-MTase*->691080532_?->446915980_?->691080536_?->447067279_?->691080538_Terminase_SS->691080540_GT->691080542_?->
      751875430    <-N6-MTase<-N6-MTase*                                                                                                N6-MTase    N6_N4_Mtase                    B4069_2083        249  bacteria>firmicutes                            Bacillus subtilis                                        Adenine-specific methyltransferase [Bacillus subtilis].                                                           <-751875423_?<-751875424_?<-751875425_?<-751875426_?<-751875427_?<-751875428_?<-751875429_N6-MTase<-751875430_N6-MTase*<-751875431_?<-751875432_?<-751875433_?<-751875434_?<-751875435_?<-751875436_?<-751875437_?
      505338314    -                                                                                                                    N6-MTase    N6_N4_Mtase                    -                 248  bacteria>firmicutes                            Ruminococcus sp. SR1/5                                   DNA modification methylase [Ruminococcus sp. SR1/5].                                                              
      756411756    <-HNH<-?<-?<-?<-?<-N6-MTase*                                                                                         N6-MTase    N6_N4_Mtase                    -                 248  bacteria>firmicutes                            Bacillus cereus                                          hypothetical protein [Bacillus cereus].                                                                           <-447151656_?<-446510145_?<-446579460_HNH<-446891195_?<-446977878_?<-446667118_?<-446680756_?<-756411756_N6-MTase*<-756411780_?<-446637401_?<-756411757_?<-446569170_?<-756411667_?<-446700868_?<-446897432_?
      655162666    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 246  bacteria>firmicutes                            Paenibacillus harenae                                    DNA methylase [Paenibacillus harenae].                                                                            655162659_?->655162660_?->655162661_?->655162662_?->655162663_?->655162664_?->655162665_?->655162666_N6-MTase*-><-655162667_?||738828102_?->655162669_?->655162670_?->655162671_?->655162672_?->655162673_?->
      738793084    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    -                 246  bacteria>firmicutes                            Paenibacillus sp. FSL H8-237                             hypothetical protein [Paenibacillus sp. FSL H8-237].                                                              <-738793073_?<-738793332_?<-738793334_?<-738793337_?<-738793076_?<-738793079_?<-738793082_?<-738793084_N6-MTase*<-738793087_?<-738793090_?<-738793093_?<-738793097_?<-738793099_?<-738793101_?<-738793338_?
      496003735    N6-MTase*->N6-MTase->METHYLASE->                                                                                     N6-MTase    N6_N4_Mtase                    -                 245  bacteria>firmicutes                            Erysipelotrichaceae bacterium 2_2_44A                    DNA-cytosine methyltransferase [Erysipelotrichaceae bacterium 2_2_44A].                                           496003721_?->496003723_?->748747028_?->496003727_?->496003731_?->748747029_?->748747030_?->496003735_N6-MTase*->496003737_N6-MTase->496003740_METHYLASE->496003742_?->496003744_?->496003745_?->496003746_?->496003748_?->
      695862061    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 244  bacteria>firmicutes                            Lactobacillus paracasei                                  DNA methyltransferase [Lactobacillus paracasei].                                                                  511748281_?-><-695862042_?<-695862043_?||511748282_?-><-695862045_?||511748283_?->511748284_?->695862061_N6-MTase*->511748286_?->511748287_?->511748288_?->511748289_?->511748290_?->511748291_?->511748292_?->
      736516671    <-MuF<-Phage_portal<-Terminase_SS<-?<-Terminase_SS<-?<-N6-MTase*                                                     N6-MTase    N6_N4_Mtase                    -                 244  bacteria>firmicutes                            Lactobacillus kunkeei                                    hypothetical protein [Lactobacillus kunkeei].                                                                     <-736516664_?<-736516665_MuF<-736516666_Phage_portal<-736516667_Terminase_SS<-736516668_?<-736516669_Terminase_SS<-736516670_?<-736516671_N6-MTase*<-736516672_?<-736516673_?<-736516709_?<-736516675_?<-736516676_?<-736516677_?<-736516678_?
      149882909    <-N6-MTase*<-?<-?<-?<-?<-?||?-><-MuF                                                                                 N6-MTase    N6_N4_Mtase                    BcepNY3gene09     243  dsdna viruses, no rna stage>caudovirales       Burkholderia phage BcepNY3                               DNA methylase [Burkholderia phage BcepNY3].                                                                       <-149882902_?<-149882903_?<-149882904_?<-149882905_?<-149882906_?<-149882907_?<-149882908_?<-149882909_N6-MTase*<-149882910_?<-149882911_?<-149882912_?<-149882913_?<-149882914_?||149882915_?-><-149882916_MuF
      526178238    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    N355_gp092        242  dsdna viruses, no rna stage>caudovirales       Cellulophaga phage phi13:2                               DNA methylase [Cellulophaga phage phi13:2].                                                                       526178231_?->526178232_?->526178233_?->526178234_?->526178235_?->526178236_?->526178237_?->526178238_N6-MTase*->526178239_?->526178240_?->526178241_?->526178242_?->526178243_?-><-526178244_?<-526178245_?
      702087568    N6-MTase*->?->?->?->Terminase_LS->?->?->MuF->                                                                        N6-MTase    N6_N4_Mtase                    LF41_2421         242  bacteria>proteobacteria>gammaproteobacteria    Lysobacter dokdonensis DS-58                             Adenine-specific methyltransferase [Lysobacter dokdonensis DS-58].                                                702087561_?->702087562_?->702087563_?->702087564_?->702087565_?->702087566_?->702087567_?->702087568_N6-MTase*->702087569_?->702087570_?->702087571_?->702087572_Terminase_LS->702087573_?->702087574_?->702087575_MuF->
      752537274    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    -                 242  bacteria>proteobacteria>gammaproteobacteria    Aeromonas jandaei                                        hypothetical protein [Aeromonas jandaei].                                                                         <-752537266_?<-752537267_?<-752537268_?<-752537269_?<-752537270_?<-752537272_?<-752537273_?<-752537274_N6-MTase*<-752537487_?||752537275_?-><-752537488_?<-752537276_?<-752537277_?||752537489_?-><-752537490_?
      752704809    N6-MTase*->N6-MTase->                                                                                                N6-MTase    SP+N6_N4_Mtase                 -                 241  bacteria>firmicutes                            Bacillus subtilis                                        cytosine methyltransferase [Bacillus subtilis].                                                                   752704719_?->752704717_?->516293771_?->695807941_?->498016884_?->752704715_?->752704713_?->752704809_N6-MTase*->752704807_N6-MTase->752704711_?->516293779_?->751588002_?->516293781_?->752704709_?->695807779_?->
      763125951    <-METHYLASE<-?<-?<-?<-?<-?<-?<-N6-MTase*                                                                             N6-MTase    N6_N4_Mtase                    -                 238  bacteria>firmicutes                            Lactobacillus salivarius                                 hypothetical protein [Lactobacillus salivarius].                                                                  <-763125944_METHYLASE<-763125945_?<-763125946_?<-763125947_?<-763125948_?<-763125949_?<-763125950_?<-763125951_N6-MTase*<-763125952_?<-763125953_?<-763125954_?<-763125955_?<-763125956_?<-763125957_?<-763125958_?
      690349817    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    LSJ_3100c         236  bacteria>firmicutes                            Lactobacillus salivarius                                 DNA modification methylase [Lactobacillus salivarius].                                                            <-690349810_?<-690349811_?<-690349812_?<-690349813_?<-690349814_?<-690349815_?<-690349816_?<-690349817_N6-MTase*<-690349818_?<-690349819_?<-690349820_?<-690349821_?<-690349822_?<-690349823_?<-690349824_?
      672551258    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    JO84_gp345        234  dsdna viruses, no rna stage                    Aureococcus anophagefferens virus                        putative cytosine-5 methyltransferase [Aureococcus anophagefferens virus].                                        672551140_?->672551027_?->672551135_?->672551187_?-><-672551213_?<-672551058_?<-672551059_?||672551258_N6-MTase*-><-672551060_?<-672551198_?||672551141_?->672551214_?-><-672551007_?<-672551171_?||672551028_?->
      38638617     <-N6-MTase*<-?<-?<-?<-?<-?||?-><-MuF                                                                                 N6-MTase    N6_N4_Mtase                    BCPBV1_gp10       233  dsdna viruses, no rna stage>caudovirales       Burkholderia phage Bcep1                                 gp10 [Burkholderia phage Bcep1].                                                                                  <-38638610_?<-38638611_?<-38638612_?<-38638613_?<-38638614_?<-38638615_?<-38638616_?<-38638617_N6-MTase*<-38638618_?<-38638619_?<-38638620_?<-38638621_?<-38638622_?||38638623_?-><-38638624_MuF
      # 14; R-M                                                                                                                                                                                                                                                                         
      496651971    HNH-><-?<-?||?-><-METHYLASE<-?<-?<-N6-MTase*<-REase<-METHYLASE<-?<-?<-?<-?<-DOC                                      N6-MTase    N6_N4_Mtase                    -                 369  bacteria>proteobacteria>epsilonproteobacteria  Campylobacter sp. 10_1_50                                modification methylase [Campylobacter sp. 10_1_50].                                                               736902613_HNH-><-496651959_?<-496651961_?||496651963_?-><-496651965_METHYLASE<-496651967_?<-496651969_?<-496651971_N6-MTase*<-496651973_REase<-496651975_METHYLASE<-736902809_?<-496651979_?<-496651981_?<-496651983_?<-489029481_DOC
      550983501    <-METHYLASE<-?<-?<-?||adenine-glycosylase-><-?<-N6-MTase*||?->ABC->                                                  N6-MTase    N6_N4_Mtase                    -                 366  bacteria>proteobacteria>alphaproteobacteria    Thalassospira lucentensis                                modification methylase [Thalassospira lucentensis].                                                               <-655387197_?<-703179655_METHYLASE<-550983496_?<-550983497_?<-703179657_?||655387198_adenine-glycosylase-><-550983500_?<-550983501_N6-MTase*||550983502_?->703179660_ABC->550983504_?->550983505_?->550983506_?->550983507_?->550983508_?->
      696534926    <-ABC<-?||N6-MTase*->?-><-adenine-glycosylase||?->?->?->METHYLASE->                                                  N6-MTase    N6_N4_Mtase                    -                 366  bacteria>proteobacteria>alphaproteobacteria    Thalassospira australica                                 modification methylase [Thalassospira australica].                                                                696535023_?->696534922_?-><-696534923_?<-696534924_?<-696534925_?<-696535024_ABC<-696535025_?||696534926_N6-MTase*->696534927_?-><-696534928_adenine-glycosylase||696534929_?->696534930_?->696534931_?->696534932_METHYLASE->696534933_?->
      488970128    REase->N6-MTase*->                                                                                                   N6-MTase    N6_N4_Mtase                    -                 362  bacteria>tenericutes                           Mycoplasma alkalescens                                   Type II restriction modification system N4-cytosine or N6-adenine DNA methyltransferase [Mycoplasma alkalescens]. <-488970114_?<-488970115_?<-488970117_?<-488970121_?<-488970123_?<-750250321_?||750250339_REase->488970128_N6-MTase*-><-488970129_?<-488970130_?<-488970131_?<-488970133_?<-488970134_?<-488970136_?<-488970137_?
      740676991    -                                                                                                                    N6-MTase    N6_N4_Mtase                    -                 361  bacteria>tenericutes                           Candidatus Hepatoplasma crinochetorum                    hypothetical protein [Candidatus Hepatoplasma crinochetorum].                                                     
      446268888    <-REase<-N6-MTase*<-?<-QRPTase_N<-?<-PS_Dcarbxylase                                                                  N6-MTase    N6_N4_Mtase                    -                 359  bacteria>proteobacteria>epsilonproteobacteria  Helicobacter pylori                                      DNA methyltransferase [Helicobacter pylori].                                                                      <-447055814_?||487802840_?-><-446116267_?||446003551_?->446761496_?-><-658502684_?<-727092483_REase<-446268888_N6-MTase*<-446833673_?<-446375435_QRPTase_N<-447064608_?<-446148357_PS_Dcarbxylase<-446875834_?<-727086548_?<-446836634_?
      726979196    PS_Dcarbxylase->?->QRPTase_N->N6-MTase->?->N6-MTase*->REase->                                                        N6-MTase    N6_N4_Mtase                    -                 359  bacteria>proteobacteria>epsilonproteobacteria  Helicobacter pylori                                      DNA methyltransferase [Helicobacter pylori].                                                                      726979213_?->446880902_?->726979188_PS_Dcarbxylase->726979190_?->726979192_QRPTase_N->726979215_N6-MTase->726979194_?->726979196_N6-MTase*->726979198_REase->726979200_?->726979202_?->726979204_?->726979206_?->447045345_?->446802786_?->
      727305172    <-REase<-N6-MTase*<-?<-QRPTase_N<-?<-PS_Dcarbxylase                                                                  N6-MTase    N6_N4_Mtase                    -                 359  bacteria>proteobacteria>epsilonproteobacteria  Helicobacter pylori                                      DNA methyltransferase [Helicobacter pylori].                                                                      <-727305182_?||727305180_?-><-727305178_?||727305177_?->727305176_?-><-727305339_?<-727305174_REase<-727305172_N6-MTase*<-727305171_?<-727305170_QRPTase_N<-727305168_?<-727305167_PS_Dcarbxylase<-446875834_?<-727305165_?<-727305163_?
      727328309    <-REase<-N6-MTase*<-N6-MTase<-QRPTase_N<-?<-PS_Dcarbxylase                                                           N6-MTase    N6_N4_Mtase                    -                 359  bacteria>proteobacteria>epsilonproteobacteria  Helicobacter pylori                                      DNA methyltransferase [Helicobacter pylori].                                                                      545063097_?-><-727340056_?<-727328235_?<-727060838_?<-727328232_?<-727340057_?<-727340058_REase<-727328309_N6-MTase*<-727340059_N6-MTase<-727327806_QRPTase_N<-727327808_?<-727327810_PS_Dcarbxylase<-727105271_?<-727327829_?<-727327812_?
      738491218    <-ABC<-ABC<-?||N6-MTase*-><-REase<-SAM-synthetase                                                                    N6-MTase    N6_N4_Mtase                    -                 355  bacteria>tenericutes                           Mycoplasma hyosynoviae                                   hypothetical protein [Mycoplasma hyosynoviae].                                                                    <-738509207_?<-738509210_?<-738489806_?<-738493702_?<-738509238_ABC<-738509213_ABC<-738509216_?||738491218_N6-MTase*-><-738491226_REase<-738509218_SAM-synthetase<-738491210_?<-738493659_?||738491204_?->738491202_?->738509221_?->
      738495747    SAM-synthetase->REase-><-N6-MTase*||?->ABC->ABC->                                                                    N6-MTase    N6_N4_Mtase                    -                 355  bacteria>tenericutes                           Mycoplasma hyosynoviae                                   hypothetical protein [Mycoplasma hyosynoviae].                                                                    <-738508624_?<-738491202_?<-738491204_?||738493659_?->738493662_?->738495750_SAM-synthetase->738495763_REase-><-738495747_N6-MTase*||738495744_?->738495741_ABC->738495760_ABC->738508625_?->
      635202114    SAM-synthetase->REase-><-N6-MTase*||?->ABC->ABC->                                                                    N6-MTase    N6_N4_Mtase                    NPL7_01825        350  bacteria>tenericutes                           Mycoplasma hyosynoviae                                   DNA methyltransferase [Mycoplasma hyosynoviae].                                                                   <-635202118_?<-635202108_?<-635202109_?||635202110_?->635202111_?->635202112_SAM-synthetase->635202113_REase-><-635202114_N6-MTase*||635202115_?->635202116_ABC->635202117_ABC->635202119_?->
      635203155    SAM-synthetase->REase-><-N6-MTase*||?->ABC->                                                                         N6-MTase    N6_N4_Mtase                    NPL1_02345        350  bacteria>tenericutes                           Mycoplasma hyosynoviae                                   DNA methyltransferase [Mycoplasma hyosynoviae].                                                                   <-635203148_?<-635203149_?<-635203150_?||635203151_?->635203152_?->635203153_SAM-synthetase->635203154_REase-><-635203155_N6-MTase*||635203156_?->635203157_ABC->635203158_?->
      737788178    ABC-><-?||?->?->?->?-><-METHYLASE||N6-MTase*->                                                                       N6-MTase    N6_N4_Mtase                    -                 281  bacteria>bacteroidetes                         Flexibacter roseolus                                     hypothetical protein, partial [Flexibacter roseolus].                                                             652629928_ABC-><-652629930_?||652629932_?->652629934_?->737788172_?->652629936_?-><-737788175_METHYLASE||737788178_N6-MTase*-><-652629939_?||652629940_?-><-652629941_?||652629943_?->652629944_?->737788181_?->652629945_?->
      # 7;                                                                                                                                                                                                                                                                           
      446323973    N6-MTase*->                                                                                                          N6-MTase    N6_N4_Mtase                    -                 231  bacteria>firmicutes                            Streptococcus pneumoniae                                 DNA-cytosine methyltransferase [Streptococcus pneumoniae].                                                        446197668_?->446393604_?->446106149_?->447079773_?->446276775_?->446520999_?->446532213_?->446323973_N6-MTase*->446377415_?->446079215_?->446701795_?->446719036_?->487776690_?->446963895_?->446742073_?->
      698840876    IstB_IS21->?->?->N6-MTase*->?->DCM->                                                                                 N6-MTase    N6_N4_Mtase                    -                 231  bacteria>firmicutes                            Streptococcus pneumoniae                                 putative prophage protein [Streptococcus pneumoniae].                                                             <-698840869_?<-698840870_?||698840871_?->698840872_?->698840873_IstB_IS21->698840874_?->698840875_?->698840876_N6-MTase*->698840877_?->698840878_DCM->698840879_?->698840880_?->698840881_?->698840882_?->698840883_?->
      660643078    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    EL26_10775        226  bacteria>firmicutes                            Tumebacillus flagellatus                                 hypothetical protein EL26_10775 [Tumebacillus flagellatus].                                                       <-660643071_?||660643072_?-><-660643073_?||660643074_?-><-660643075_?||660643076_?-><-660643077_?<-660643078_N6-MTase*<-660643079_?
      589891490    <-HNH<-?<-?<-N6-MTase*<-?<-HNH<-?<-?<-?<-?<-DHH                                                                      N6-MTase    N6_N4_Mtase                    SEP9_059          225  dsdna viruses, no rna stage>caudovirales       Staphylococcus phage vB_SepS_SEP9                        cytosine specific DNA methyltransferase [Staphylococcus phage vB_SepS_SEP9].                                      <-589891483_?<-589891484_?<-589891485_?<-589891486_?<-589891487_HNH<-589891488_?<-589891489_?<-589891490_N6-MTase*<-589891491_?<-589891492_HNH<-589891493_?<-589891494_?<-589891495_?<-589891496_?<-589891497_DHH
      529047177    URI->Toprim->N6-MTase*->                                                                                             N6-MTase    N6_N4_Mtase                    N007_05790        223  bacteria>firmicutes                            Alicyclobacillus acidoterrestris ATCC 49025              hypothetical protein N007_05790 [Alicyclobacillus acidoterrestris ATCC 49025].                                    529047170_?->529047171_?->529047172_?->529047173_?->529047174_?->529047175_URI->529047176_Toprim->529047177_N6-MTase*->529047178_?->529047179_?->529047180_?->
      665851256    URI->Toprim->N6-MTase*->                                                                                             N6-MTase    N6_N4_Mtase                    -                 222  bacteria>firmicutes                            Alicyclobacillus acidoterrestris                         hypothetical protein [Alicyclobacillus acidoterrestris].                                                          750137128_?->750137130_?->544884002_?->544884003_?->544884004_?->665851255_URI->544884007_Toprim->665851256_N6-MTase*->544884009_?->665851257_?->544884011_?->
      740246844    <-N6-MTase*                                                                                                          N6-MTase    N6_N4_Mtase                    -                 215  bacteria>firmicutes                            Tumebacillus flagellatus                                 hypothetical protein [Tumebacillus flagellatus].                                                                  <-740246740_?||740246840_?-><-740246743_?||740246745_?-><-740246746_?||740246748_?-><-740246750_?<-740246844_N6-MTase*<-740246847_?
      # 1;                                                                                                                                                                                                                                                                           
      290971699    N6-MTase*->                                                                                                          N6-MTase    SP+N6_N4_Mtase                 NAEGRDRAFT_76461  994  eukaryota>heterolobosea                        Naegleria gruberi strain NEG-M                           predicted protein [Naegleria gruberi].                                                                            <-290971703_?||290971697_?-><-290971705_?||290971699_N6-MTase*->290971701_?-><-290971707_?
      737515587    <-Terminase_SS<-Terminase_SS<-N6-MTase*                                                                              N6-MTase    N6_N4_Mtase                    -                 219  bacteria>proteobacteria>gammaproteobacteria    Haemophilus parasuis                                     adenine methyltransferase [Haemophilus parasuis].                                                                 <-737515584_Terminase_SS<-538043063_Terminase_SS<-737515587_N6-MTase*<-737515588_?
      
      Back to Contents
    • Multiple sequence alignments of the Group I, clade 4 adenine methylases (Fungal N6- DNA MTases).

      Two alignments are shown, the first alignment is only of the Fungal N6-MTases ilustrating the various domains in the proteins. The second shows the core N6-MTase domain
      
      Boundaries                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   <----Treble-clef-DNMT3-like----->                                                                                                            <------ Chromo ---------------------------------------------------------------------------------------->                                                                                                                                         <----------------------Chromo----------------------------------------------------------->                                                                                                     <-AT-hook----->                                                       <-----------------------------------chromo----------------------------------------------------------->                                                                                                                                                                                                                                                                           <----At-hook----------->                                           <--------------------------chromo-------------------------------------------------------------------->                                                                                                                                                                                                                                                             <---------*--*----------*--*----ZZ-finger---------*--*--------*-*---?                                                                                                                                                <----------*--*-------------------PHD finger------------------------*--*-----------------------><-----*--*-PHD----*----* fin*er*---------------*--*-->           <-------*--*----------ZZ finger-------*--*-----------*--*------------------->                                                                             <--- GATA finger------------>                                                                              <--- DAM methylase--------------                               <---------Syanapomorphic strand-helix-------->    Str-1                   Str-2                              Str-3                       Str-4 ****                                          Str-5->                        Str-6                             Str-7  <--C-terminal N6-MTase                  Str-1              Str-2                                Str-3                                                                              <---- KRI domain------------->                                             EEEEEE                                      EEEEE                                                  EEEEEEEE                                                                                      <--false fusion of SPX, CYTH and tMs----                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           
      FINAL                                                                          ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------E----------------------------------------------------------------E-----EE---------------------------EE------------------------------------------HHHHHHH----HH--------------------------------------HHHHH----------------------E--------------------------------------------EEEE-E-----EEEEEEEEE----EEEEEE-----------------EEEE----E--------------------------------------------------------------------------------------------E-----------------------------------------------------------------------------------HHHHHHHHHH----EEEE------HHHHHHHHH----EE--------HHHHH-----------------HHHH------------------------------------------------------------------------------------------------------------------------EE--------------------EE--HHHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHHH--HHHHH--------EEEEEEEEE--------EEE-------------------------------HHHHH--------HHHHEE-EE-----------EEE---------EEEE-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------H-HHH-----------------------------------------------E-----EEE----------EEEEEE----EEEEE---------EE--------------------------------------------------------------E--------------EE------------EE-------------------------------------------------------------------------E------EEEE-----E-EEEE----------------------------------------------------EEE---------------------------------------------------------HHHH----EEEEEEEEEEEE------------------HHHHHHHHHH---------------EEEEE------------------------------------E-EEEEEEEEE-------------------------------------------------------------------------------HHHHHHHHHHH----E------EEEE-----------------------------------------------E--EEEEE--------E-EE----HHHHHHHHHHH---HHHEE------EEEEEE----EEEE--------------EEEEE-------------------EEEEEEEEEE-----------------------EEEE------------------------------------HHHHH----------------------------HHHHH-----------------------------------E-----EEEEEEE----HHHHH-----EEE-----EEE---------------------EE-------------------EEE--------EEEEE-----------------------HHHH-------EEE---E-EE---------------------E------EEE---------------EEEEE--------------------HHHHHHHHHH------EEEEE------HHHHHHHHH---EEEEE--HHHHHHHHHH-------------------EEEEE--HHHHH---------------EEEEEE------EEE--------------HHHHHHHHHHHHHHHHHHH----EEEEEEE------EEEE-HHHHHHHHHH---EEEEHHHHHHH-------HHHHHHHH--EEEHHHHHEEEE------------------------------------------EEEEEEEEE-----EEEEEEEEEEEEEEE------HHHHHHHHHHHH-------EEEEEEEE-----------------------------------------------------------------------------HHHHHHHHHHHHHHHHHHHHH----EE----------HHHHHHH------------------------------EEEEEE-------------H-HHHHHHHHHHHHHHHHHH-----EEEEEE------------------------------EEHHHHHHHHHH------EEEEEEEEE----------------------------------------------------HHHHHH--------------HHHHHHH-------
      ALIGN                                                                          ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------EEEE------------------------------------------------------------------------------------------------EEEHHHHH---------------------------EEEE-EE-----EEHHHEHH-----EEEE-------HHHHHHHH--------------------------------------------------------------------------------------------------HHHHHHHH-H------------EEEE---------E-----HHHHHHHHHHH----------------------EEE-EE-E--------HHHHHHHHHH-----E--------------E-----HEE---HHHH-HHHHH-------H----HHHHHH-------------------------------------------------------------------------------------------------------------------------------------------------E----HHHHHHHHHHHH---------------------EEEEEE---------------EEEEEE-------EEEEEEE-------HHHHHHHH--HHHH--------------------------EE-------HHHHHH-HH---HE------------------------HHHHH------------------------------------------------------------EEEE-----E-----------------------------------------------------------------H-----------------------HHHHHHHHH--------------------EE-EE----------HHHHHHHHHHH------HEE-E------------------------------H----HHH------HHHHEHH-------HHHHHHHHHHHHHHHHH---EEEEE---------EEE--------------------HEEEH---------------------------------EEEE--------------E----------------------------------------------------------------------------------------------------------------------HHHHHHHHHH-----------------------------------------------------------------------------------------------EE-----EEE-EEEEEEEE------------------HHHHHHHHH---------------EEEEEE------------------------------------E-EEEEE--HHH---------H------------------------------------------------------------------------HHHHHHH---EE------EEEE--------------------EE-----------EEEEE------------EEHH---------E-EE------HEHHHHHHH----EEEEE-----EEE-EE----------------------EEEEE-HHHHH-------H-----EEEEEE-----------------------------EEEE------------HHHH-------------------HHHHH---------------HHH---HHHHHHHHHHHHHHHHHHHHHHH--HHH------------------------EEEEEE----HHHHH------EEE---HHHHH---------------------H--------------------EEE-------------------------------EEEEEE-------EEEE------------------------------------------------EEE-----E-----------------------HHHHHHHHHH------EEEE--------HHHHHHHHHH---------HHHHH-----------------------HHHHHH-HHHH-----------------EEE---------EEE-------------HHHHHHHHHHHHHHHHHHHHH---EEEE-------EEE----HHHHHHHH----HHHHHHHHHHHHHHH-----EEEEEH-HHEHHHHHHHHHH------------------------------------------EEEEEE------------EEEEEEEEEE-----HHHHHHHHHHHHHH-------HHHHHHHH------------------------------------------------HHHHH------------------------HHHHHHHHHHHHH--HHHHHH----HH----------HHHHHHH------------------------------EEEEEE-----------HHH-HHHHHHHHHHHHHHHHHHH-----EEE--------------------------------HHHHHHHHHHHH--------EEEEEEE--------------------------------HHH------------------HHHHH--------------HHHHHHH-H-----
      HMM                                                                            -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------E--H-HHHHHH---EEEEE-----HH------------------------EEEE--EEE----------------------------------------------------EEE--------E-----E----E-EEEE-HHHH-HHHHHHE-----EEEE--------------------HHHHHHHHH---EEE-EEE-HHHHHHH----HHHH-----------------------EEEEE----EEE-HHHH---------E----EEEE---EEE------EEEEEE----HHH------H----HHHHHHHHHHHHHHHHHH-HHH---EEEEEEEEE----EEEEEHHHHHHHHHHHH-HEH-HEEEE----EEEE---HH-HHHHHHHH-------EEEHHHHHHEEEEE---------EEEEE----EEE-----------------------------------------------------------------------------------------------------------------------------------------------E--H-HHHHHH---EEEEE------HH-----------------EEEE--EEEEEE----------EE----EEE-----EE-HHHHHHHHHHE-----EEEE------HHHHH----HHHH---EEEEEE---HHHHHHHHHHH---------------E---EEEEE--------------------EE-HHHH---E----EEEEE---EE----EEEEEE----HHHH----HHHHHHHHHHHHHHHHHHHHH---EEEEEEEEE----EEEEEHHHHHHHHHHHH-HE-HHEEEEEEEE---HHHHHHHHHH--EEEHHHHHHEE-EEE--------EEEEEEE----E--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HHHHHHHHHHH--EEEEEEEE----------------HHHHHHHHHHHH-------------EEEEEE-----------------------------------EE-EEEEEEEEE-----------------------------------------------EEEEE---------------EE-------------HHHHHHHH---EE------EEEE-------------E-------E-----------EEE---------EEE--EEEE--------EE-EE----HHHHHHHHHHH---EEEHHHH---EEEEEE-----EEEEEE-----HHH----EEEEE--HHHH-------HH-HHHHEEEEEE---------------------------EEHHHHH-HHHH--HHHHHHHHH---------------HHHHHHH---------------HHH---HHHHHHHHHHHHHHHHHHHHHHH----------H--------EE-EE-----EEEEEEEE---HHHHH---E-EEEEHH---EE---------------------EE-------------------EEEE--------EEEE-----------------EEEEEEHHHH---EEEE-------------------------------E------E-------EEEEEEE----EEEE-----EEEE-----EEEEE-HHHHHHHHHHHH----EEEEEE-----HHHHHHHHHH---EEEEE--EHHH----EEEEE--------------HEEEEHHHHHHHHHHHH-----------EEEEEE---HHHHHHHHHHHH-HH------HHHHHHHHHHHHH-EEEEE---EEEEEEE-----EEEEE-HHHHHHHHHHH--EEEEEEEHHHHHHHH----EEEEHHHHHEEEHHHHHHEEEE-----------------------------------------EEEEEEEEEEE--------EEEEEEEEEE------HHHHHHHHHEEEE-----EEEEEEEEE---------------------------------------HH----HHHHHHHHHH-----------------------HHHHHHHHHHHHHHHHHHHE---E----------HHHHHHHHHHH-----EEE--------------------EEEEEE--HHH--HHHHHH---HHHHHHHHHHHHHHHHH-----EEEEEEEEEE---------------------EEEEEEEEEHHHHHHHHH-----EEEEEEEEEE--------------------E---------EEE-----E----H---EE---HHHHH--------------HHHHHHH-H-----
      FREQ                                                                           ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------EE-------------------------------------------------------------------HHHHHHHHHHH----H--------------------------------------------------------------------------------------------------------HHHHH--EEEE-E-----EEEEEEEE----EEEEEEE--------HHH---H--EHHE-----------------------------------------------------------------------------------------------------------------------------------------------------------------E------------------HHHHHHHHHH-----EEEE------HHHHHHHHH---EEE--------EE--------------------HHHHHHH------------------------------------------------------------------------------------------------------------------------------------------------HHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHH---EEEEE--H------EEEEEEEE---------EE---------------------------------EEE---------HEEEE-E------------EE----------EEEE-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------H-HHH------------------------------------------------E----HHHHH-HH-----EEEEEE----EEEEEE--------EE--------------------------------------------------------------E--------------EEE---------EEEE-------------------------------------------------------------------------E-----EEEEE---EEE-EEEEE---E------------------------------------------------EE---------------------------------------------------------EEE----EEEEEEEEEEEE--------E----------HHHHHHHHH----------------EEEEE-----------------------------------EE-EEEEEEEE--------------------------------------------------------------------------------HHHHHHHHHH-------------EE---------------------------------------------------HEEHHE----------HH---HHHHHHHHH-----EEEEEE------------------------E-EEEEE--------------------------EEE--EEEE------------------------EEEE-----------------------------------EEEEE------------------------------------------------------------------E-EE-----EEEEE-----------------E--HH--HEE---------------------E--------------------EEEE-----EEEEEEE----HHHHHH-----------------------HHHH---H-HE----------------------------------------------EEEE------------------------HHHHHHH------EEEE-------HHHHHHHHH----EEE---HHHHHHHHHH------------------EEEEE-----------------------EEEEE------EEEEE-------------HHHHHHHHHHHHHHHHHHHH----EEEEE--------E----HHHHHHHHHH-HHHHHHHHHHH---------EHHHHHHHHHHHHHHHHHHE---------HHHH------------------------------EEEEEE-------EEEE-EEEEEEEEEE-----HEH-HEHH-EEEEEE------HHHHHHHHH---------------------------------------------------------------------------HHHHHHHHHHHHHHHHHHHHHH----EEE---------HHHHHHH-------------------------------EEEEE-----------H-H-HHHHHHHHHHHHHHHHHH------EEEE-------------------------------HHHHHHHHHHH-------EEEEEEEE-----------------------------------------------------HHHHHH--------------HHHHHH--------
      PSSM                                                                           ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HHHH--------------------------------------------------------------------------------------------------------------------E-----EEE---------------------------------------------------------EEEE-------------------------------------------------------------HHHHHH------H------------HH---------------------------------E----------EEEE-EE----EEEEEEEE-----EEEEE------------------EE------------------------------------EEE----------HHH-----------------------------------------------E-----------------------------------------------------------------------------------HHHHHHHHHHH-----E------------------E---------H-HHHHHH-----------HHHHHHHHH-------------------------------------------------------------------------------------------------------------------------------------------------------HHHHHHH----------------------EEE------------------------------EEEEEEEE----------E-----------------------------------------------HHHH--------------------------EEEE---------------------------------------------------------------------------------------------------------------------------------------------------H------------------------------------------------------------------HHHHH-------------HHH----------------------------HHHHHHH------HHHEEE-----EEE-------------EEE----EEEEE----------------------------------E------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------E-----------------EE---------------E--------------------------------------------HH--HHHH---EEEEEE---------------------HHHHHHHH--------------------------------------------------------E-EEE-----------------------------------------------------------------------------------------HHHHHH----EE------EEE----------------------------------H---H------------EEEEE-----------------HHHHHHHHHHHHHH--H--------EEE-------EE-----HH--------EEEEE--------------------EEEEEEEEE----------------------------------------------------------------------------------------------------------------------------------------------EEEEEE---------------EEE-----E-E-------------------------------------------------------------------H--------HHHHHHHHH-------EEE---E-EE---------------------E------EEEE----------------E----------------------HHHHHHHHHH------EEE---------HHHHHHH----EEEEE--HHHHHHHHHHH-------------------EEEE----H-----------------EEEEE------------------------HHHHHHHHHHHHHHHHHHH----EEEEEEEE------EEE--HHHHHHHHH---EE---EEEH---------EEEHHH-----EEEE-EEEEEE--------------------------------------------EEEEEE-----------EE--EEEEE--------HHHHHHHHHHHH-------EEEEEEE------------------------------------------------------HH----HHH--------------HHHHHHHHHHHH---HHEEEE--EE------------HHHHHHH------------------------------EEEEEE----------------HHHHHHHHHHHHHHHHH----EEEEE---------------------------------EEEHHHHHHHH---------EEEEE-----------------------------------------------------HHHHHH--------------HHHHHHH-------
      CONF                                                                           --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------87-----8755535221356678883321288888446--3-323224--------56778654322000----------------------------------------------------013887886402-----530065-566686443-345677765542002301--------------2457655566652454222-12314655565----3036777---7777888--------8865544----34511110134------602677653---101013--33532335543025------78765422456410001562677-6178616668885726870888712345665202304430-00134----314554312-4434445754-----5346522223445510-04330044555----54121112121233346677667778877766542010-2566777-8888766666775565554777863235666654578777888-------7651456211-25-5555676604678888874378615662776312111354414410008453211-0244220366435----442121310011223333765--67887665555445-----5576444456777788888444588877530011----35676774122125--20244455556888-88877--76673---32024--------------------40410344677778875254---328887344555870145442366765532343003221010025566234354314898831110421356654535865-542224344587722210121124654663255313-304542455775034125----6726775456543322345------------------------------68-888888878765-------567666676444667888888------------------------------88888888788876543----------421005123346777777888876655566666567888--88887776556633656888888887765543343345467765410-233012334543443443356777776642155433455------675410068720210243343452146664188267777178877787022777554---------4566422345----------4667-----------657766545210--------------0103333334542012-------------------------------------------------------------------------16543212332678401-21100---1-55566677766677766665676673-00-----------------12035------5788843--0-------------------------------1589--3543402001132002567503266888637888---885482566665347687788888877313441888------------------------888-888712-3306765413---------588----------------------------88888866666-----------88776688--88887-1557234441268303------3732578888643---2412--0020-----------245420035678421--78873133458622-3227865839689998467300000058883254330689825301211002000017704552120001-------16966532445776516888----88----------8-804623445-333587565444456-8---88-78887772113201---------------356---66644212210245676300357--77788--8688-88---31-13-----57886602784112208983-58501220046---------------------6078988---887--54445604507655630345522344010100002244142024001210565440224---6-622---------10011--57530------321337886300124665277715788853247898836764207766256530699874865118887-51677664468006785235888988875032256787766655---16888726312022237-------999525887078953303103455476666752878999999999999875317972999872213683366401678999987185011123460722565321000366620023215664047336778888201088--88777-------------------76040024652245503204301510356326766003537285676520278996155544431--------6775445667777-----------------66----63333455655----424157--888888782736899997875835207652--3021166766--4021112310668888556------------88---8-617887236524--3155110-447889999888888875687453897410112----------------777-8512021004145455636788-613624776337777788-----------886888-----7412388886765616--41-1335578--------------8999865-589888
      MVEG_09762_Mortierella_verticillata_NRRL_6337_672819038                        --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MD-----ASTAPPKKTVLPQFHSVYPSALSSSSSNNPV--L-EPKKRG--------RKPKARPVEQAPVS----------------------------------------------------VTSSPGESGESL-----VDIDSI-STSSSSLSS-ALSLSSIATSPSFSLSSL--------------SPLSSPTSTSSIDIVFDTH-QSPRFESIDPS----VTLLTPS---STASAGR--------KRKQSQP----PQHKHNPVAQD------HHDSVDNNI---MLDNFH--TTGAFLTRTASKLL------GGQSVTGLDSNIHGYSLGRPCKV-FGTDKVWYFGTMVGLKNGKIRIRFDDWGSDWDEWLASDSR-RLVKS----LTPEETEQR-NARLATLNEG-----STSTTTISATTVVKTS-ALGVQASAPTI----GPNAKGKSVKEAKMPMEPKYKKIKIAPATHTSDTVVRT-LGISKMD-QQNTASTSALTSTLDPTLIQKQKTTKTTKPPKTPKTPKTPKAP-------KKKEQELLPS-EI-PSPTGNETAITSIAVSEATSTVKKKPKTPKNTSKKAGTVAPSSTDLVAAPDAMSA-TSVATDTHLNMQL----ETTHAVAKVKKELKRPKKASK--TAAQDTLDGDQVSS-----LVDATFTPGAPGDTVKPKKRPKAKAAEVPISLAA----LFASPVPTEQTLAL--ASEATIEITTQPDT-LSSTE--STLST---PPPRS--------------------MSATPTLSEENQTTDESKDS---ESSPDPPVTRTLGNPIDLGLLPLEITEFEYEDESKELMGMIHRVLDTGKIQNRVVEYVPEDDYYDTAGGYRKKKKRPNGDG-ENGDEEDDEGSKRTQTKTKDKFVAILESRHQQQILK-QHISGMPLPSSFPFVPPP----PPKKKLTKEQREALAQKCE------------------------------HA-MRNMCHEPMIQL-------VRQVIYTNSDRVEAINNYNNN------------------------------YSKSQPKIVDRKERFAQ----------ALAGRKLTKLNRGLPLARKIIGPTSFAAAQGLTFNEAGY--IETPVTKIIAAAKTIATSTPVKKKRGRKPLSLKAAVASTAGGGTGLD-ESQAGFGKDGKDGKINPFQSLRSLTQTKKRKIGVAEDHI------ANLKRLYTPGTRIQARDKQMEWLLARVRDLRNSRVLVQYEGFPAFYNEWIDINSERL---------KYDTTLEQTP----------WDPN-----------ATSTRNPLTTTT--------------ITASHPVSSNDSGTTA-------------------------------------------------------------------------PPPTDPEADSVPQDTPE-DKALT---G-KKAIREKKGGVTESTPTDQEKAEESI-SA-----------------EEGPV------DDGLDAV--E-------------------------------EENA--AVVNCIQCQVKISQFRIYCMYCEVESKAVVQSDP---PCEPFNLCLWCFSNAFPEHHDHPRSSFATKVIVGPK------------------------GVR-PVKGGI-ITRFEKDVLD---------LEY----------------------------KEPEKPAAPTL-----------SPEDQLNA--MMRLD-NDQSYVYLDQWRERKV------CAFCNDEGLANKD---PFIG--PYPF-----------LLASTNRYGDAKKKN--FWAHDACARHSPEV-IQGKDGTWYNVSMAMRRGRTVKCTLCKEKGATIGCFEPKCYRSFHVPCTGKPMSHFEDGVIFWCPQHEKA-------YLQRDAYDETFSCDRCSKIL----GV----------N-PWSTCIKCS-DDFFHTFDLCRECFS-K---DD-INHEHGKDDFKITS---------------LEL---LRSEQLEKEAAMVIPMDEALANK--KKPIS--YKPK-MR---GL-SR-----LVCSYCWSATSTKWRKGYNG-VLMCEDCFSAG---------------------PVNDTPM---QPP--TTLEESLSDQALSNPNLPPLVVGGSGVGFVDTENPKGVGRYATSAEDYSHTPYLTRTSV---S-AVR---------FDHSS--SQAVY------LDSYGPSENQLYSLPIDTTYYDIPGRAPRWATHSGTDYHGTWLPQTVRRAVTKYTNPNDKILSNFLGRG-TDAIECFLLGRCCTAVDINPAAITLSIRNCSFAIPPNGTVKAEH---RPTILQGDSRKLTGPLF-------ESESFDHVLSHPPYKDCVAYSTHIDGDLSRFGNSIEFQREMTHVVQETYRLLKMGRRCTLGIGDNREHCFYIPVSFQLIRQYINQGFELEELIVKRQRYCAMFGLGTYLCVQFDFLCFTHEFIATLRKVPKQGHDTMILEP--DYSLL-------------------DLVDVTGTVRAIPSCPIERKSVVMGSVWTFKPTEEFDFPTLCASRMVERFGKNDSNWEEFQIEFK--------QTIDPNVVSASDD-----------------DD----TLNDIEDEKEW----TEDAVA--ILPPEEENLVSYERDRLQQIQENNRMLLAL--GLITELSETS--DDIGHQIKLKNSNDNTCFP------------PP---A-ETVLWLVAHIPC--TQMKTHQ-VPAYRTAIMNLARKALVQLPLTGVFVVGAQDIR----------------TEK-GKLLPLGMLILEDIVRVVGDDC-LRLKELIEAVPDGYQKDR-----------RKITSW-----EEYQEEACSPNDQIPK--KH-LPIVHAC--------------YLVFTKV-KEPVKP------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      Crev1000002507_Coemansia_reversa_Crev1000002507                                -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSGPDRCHGADGPLSPLA---------------------------AAFDYDS-----------------SGSLSSLDSFLDALSDCADDPLRPDSSLLPE-------RQFNRSASATDAKSSQTVPLK---QQEHKKEKQNL------------P-SRNKRER--------TADVSSP----KPRQTVNLQRQ---------SMQTLV---SSNNMI--TTGAFLTRTTLRKLGVADIVADPE-----ASNAGLAVGSRIKV-LNLDKHWYTAVVLAIDSGKALAHYPGWEHCYNEWVLIESR-RL-LY----RGKADLGVS----------------GSTKAMEQLE-------ALYSGVEEPVL-I-------------------------GFDLAQAINDAFGIS---------TGCNLKNGGADE-AAANVFVNSVDNIDSGTLARKQS--------------GVKESKEHRS----RGRPAGTRNRRRAGHIKAKSSRKQR---P--TIQNDK------AKKTLVGDQEES-AAAECPSTPGVP-----------AEL----RVPSVRLV--RAAENPYARCHRSE--------DIFCGDSDENEATARESCLTAVRDVHNGGNEE---PSGKKARIADD-----NDTATASSGNAIWH-LTRGD-----------------------------------YVTTGAFSTRRTIKALAHSGSTGGIMQD-HHGYYPGQRVEI--MNANQ-----SWYQGRVIAY-----ANKKFL---IHYGGWDHANNEWIVAGSRRMRPASDI----NDMAVTET-EEMARKACVVLVDEYNTYIDGVER-KNAEKADAKRKARELRKTPVNVRVAKLAQSMSADSMGDDEA-EEMTHACLPENE-----EEDDEDVDPI-----SVEAGYTPVPQL-----LRVKDYVQLFRKGMQIAARDRNKLWWRATIAEIKTFRLRIHYTGFGSSWDEWVEMNTQRIMFEESAESSRCDAEMAGDPLHVSGENSSALSKEPNCMGQVISSSSHGKDAGQTDYGTVE--ESQETKGIPVPR---RLGRPPGPETKSTPLSLRLALKALM-SDREMF-EQCHPE-------ELDVFHLPKEHMSMR--------DYS------TFLKVGDRVRIRDRDK----QWYDCTIIDLRHGRIRICFNGHSDEFNQWIPVNSDRI-RILR-ETIDGDKRLEKME----KESQIAQRRKQEKLRAQRRKRSQASIASLVRLAES-----------------------------------------------------------------------------------------------------LEYIVDCEDTFVSGHTQ-VGPQD---G-ITELDAATEVQSRLEGSTEDDASGGD-DK-----------------PLLQLM---M-ESDIDNV--DGMPLLTRILLAEHFKRQRFGALIRHGSMVAMQDSA-TWFVYCNQCNIVISTFRYYCLSCERPSD-----GY---DYESYDLCLMCFSRQFPSDHPHSQASFARAAVGDAESIVKFTADALSRCRDHERLAAASAHML-DLFSGL-IAVYEPDAFDTSYKPRTP-GTSLWSKLAVGLHGTTTSTLDTSAVVGKIIGNTRRSRITSIINP---DSEML-SSCNGHDA--DASSD-KEDKDETDRQLCKADVDDLPPRCAFCSEDDQSQRDLLGTFAAEQP--F-----------VLSMVRDDGTVRRRR--FWAHTACAKYSPEV-LVTEAGQWFNVAAALRRARTIKCAECKRRGATIGCFHDRCQKSFHVACAGMSKSFFESGRIFWCPKHARMAAGVVEGNAGPEPVSLEARCANCNHEL----SG----------DLMWMECLECL-AEPERQFSLCLTCYD-SKDALA--DHPHKKRCFREH----------------------------LSHTGGVSSNGQYLADI--AAQDS-------RR---RV-GKGT---TCCHYCRSRQSRRWRKGYAG-VVMCEACFNTAHSLRGGAQAKQVQAGTVCDQDLFAEADN---DS----PGELEVVALNPFGRS---LITGSDAQPLPPPQQQ----QQGALIEDYTQGIYFTREACIAPN-RVG---------LPSVSQ-QPLGE------LSSYGPTDSMLFTLPVNTSYFDIPGRAPRWASHSGTDYHGTWLPQTVRRALLRYTQRGEHVLSNFLGRG-TDAIECFLLNRKCVGVDINPSAVSLSQRNCSFTITPGCGMSIEF---RPTIMQGDARDLRSDLWPGASYFAESESFDHILSHPPYKDCVLYSTNIDGDLSRFPGPDEFQREMEKVVTESWRLLKMGRHLTLGIGDNRAECFYIPVSYQLIRTYISSGFELEELVVKRQRYCQAFGLGTYLCVQFDFLMFTHEFIATLRKVPKDQIDSMHLAD--RHYAEDSEFGLQTVTVDKDPLDFRLVAISHRCLREVPASPIERKGVVMGSVWTFEHHPVHSFTHMCMSRMVERFGRDGSNWEQIDLALRP---LE--QGTTENAADGTNAASDIAAASDTQCTGDVIDD----KCASLNNARN-----QAESDP--ELLDSDTEEGGYERARQRQIQQNREQLLQL--GLVSELGEDS--TDIAHYQKMIAMTP----L------------PPTSSA-PLALIVVPHILN--TEFARCH-VEPYRRTLVQITHDASHRLCPSGLLVLGVQDVR----------------DEH-GKLWPLGMLVLEDVQRAVGSIR-LRLKEFIVVVENGYARKR-----------DDVMSR-----ETFVDEQCVVEVNTPD--IH-VPIVHAY--------------YLVFMKL---K---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RMCBS344292_09167_Rhizopus_microsporus_729708575                               -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSSRPFAWQLFQNCGATEELKLETRADPKNAPDKKKFDT--------FMGKLLLKSNNNNNKPTN--------KQSTLTEEPFANLFRV---------RKNKLQDS--NDDA-----KFTFKYLPND-----------GKKENRL----------------SNG--KKTRW---D--VG-----QEEEQKKDEVTQKEEPSLEE--KEENPIDLF----DDDS-----------------------------ESLTSLDTLQSDDDVLGPWL----------------NLA--------SDKKEQE---------CESCVQLNDKDRDPMDA-AHCCSTCADQWKTLLSDLLEKI---------QSSVVNTSAGKKKKVKLD---KSIREQPSTKK--------------EKQPQQR--------TRGKKSA-I--PPTQSRHSQ--------------STN--KKMETFV--STGAFMTRKTAETL------ADPE-NGFYPNPYGYVHLQVVEV-LNINGIWYRGTLEKMDKGKVKVKYSDWDDQ-E-WIIMGSR-RL-RI----VPPEVIAKE---------------------------------RDE-KKEKD--------------------------------SSKDALTVIPRA----K--G-RDDDYVSSTLD--KDPHQLFNDNEVFMTRRMARAMVDEYGFRPNSF----GYRRNRAVAV-VF------NVKDKECIGFLREMRDNQVRVWYP--DLYQSEWVRVGSRRLRLLSSEEEE--KYKQHVDLDVQ------------------EVPAVTEQ--KKVEDAPKEAAPKE--------STPEPKTKKLRQ-KKAKSKTPTPEPETHQEKQ--------------------------QQKELQK-PVESS-----------------------------------FLTTGAFATRRAMRQLQDEN---GFVPN-PYNYTYNQPVEI--LNTRSGKTH-FWECGRLVAM-----RPGQVK---VHYDGWDEAYDEWVMVGSRRIRILSKE--------------------------------EEENK-KKHNELLVAEANPEVQDEVKRKRKHQVIRPEDYAKLGLLES-EQT-----------------------------VIKQKKKPSKEI---------------------------T------------------------------PVESDSSSSEEDEEEYE----------EPSIRRRSRKASKNKKKKA------VKQKQQLQQQKAAE--HPTTTTGAEEEKQIIS---------------LRVAQAKAS-EKYEFV-ANVYGY------------------------------DYM------QHVTVLNLDK----------KMYEARLVSMHKNKVKVHYCGWPDIFDEYITVGSRRI------QPIENDHQVECIE--------------------------------------------------------------------------------------------------------------------------------------------PDYQKRYEKIMQDGPTE-CQHQH---QHQPAPKKLNRKRLTLDDVQEEEGQG---EY-----------------HKEPNE-E---EEDIE----V-------------------------------VEMD-SWKVYCNQCNVVIKQFRYYCTYCENPSI-----GY---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------AGPR-PIKGEL-VTVWEEDVLEEEHQQEEQ------------------------------------------------------DASSIFTG--ETPIE-GDQGYKFLKRWKRRKV------CAFCNDDDDTSEDL-GSFIG--P--F-----------IIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYVCKEKGATIGCFESKCSKSFHLPCSQKPVSYFKSGVIFWCSVHEAY-------YNKKDTYVNVFNCDGCGKKM----EE----------E-SWFTCVPCSTSSYFSSFDLCNECFE-K---FP-EDHPHNEDDFEETS---------------LAI---IKEMEAQKARETARKKEEAREAN--AKKKL--LFPRKRR---KL-PDGSAP-VSCCYCGTTEAESWRKAYDGGVMMCNPCFELA---------------------LMVDNDE---RP----TSDMPLVIHN--AEQ-----------------------QYMTSIEDYSHKPYFTREAA---I-KID---------SETA---VVGQR------LTSYEPQPNQLFSLTFESTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKHTAKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLDGDLSRFTTIEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFICTFRKIPKENVDRMLINE--EDQ---------------------HRIPTKRTLRGVPSSAIMRKSVVMGTVWVFKPTDSFRFDQLCISRMVERFGKDDGNWEHIELDLEE-------PHQKQQSVKPAEL-----------------EK-----------------------GK--ETNEEQEEISDYERQRLKRIEENNKTLLKL--GLISELSEKS--DDIIHYENMMSKPP----Y------------VE---S-DLVLMIVSHQQ-----ILPQY-INSYRQTLVGIAKEAIERLAPKGMLIIGAQDIR----------------DPVSGKLWPMSMLILEDIERAVGRDD-IRLKELVVTVPDGYSKDR-----------QQKPRS-----EEEEEEMIDIE--TID--DF-VPIVHAV--------------YLIFQKL-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      Pbla1000013272_Phycomyces_blakesleeanus_Pbla1000013272                         ------------------------------------------------------------------------------------------------MPSDPFSW-----------QLVRPTAEA------EGVHPKPIKPPLDRWKGAVHGNCENSTQKHSFGTA---------------DTHKSQASDASLSSSTPIPVLEKPKQTNPIILTSKFGQ--------FMGKMLLQSSNNRTKPVPKINE----HSQQSNEPSYTGYFNIERKRRSSDGLKVVLRAV--GQEERSESSTQKVVELKDSIP--EDTIM--DDTMDDTMDDMSIEEESIEDLPTSNSLLKQNLT------VN-----TTIGEASTNNTPIEEVVLKNTPSPISPIGLN----QPKDITLSDPRYHHKKGKRTRWDVGPILIPEDMECTQSFASLQLDDTSKA------------------------------LDYEPENLCDTLATGDCDACNIMNSQERGPLDQ-VALCESCKQTWLPNAKSLLDRL------SKHSKDFVKPTQKQTSKQTLK---QTSKLLSKQQK----QPHQQDN---QSQSQPQ--------SKQKQTP----KQTSKKTTKRLPPAKSVHLHSKKSSI--QEAKNYY--TTGAFLTRNTAKQL------VD-E-AGFHPNPHGFTNMQKVKV-LNINGHWYRGILTMMYGSKVKVHYLDWDDQ-EEWIVMGSR-RL-RG----LTKDEEEEDEDANEEEDGEDAAIENENKDEENEEKDEDKDKEEEEEEEEDDILSIPAESKTTQPVKKGEHLSVSP-----KSHSRKSQSNNVSAL---------NPKKHYTTPID--TDPTQIFNDNEIFMTRRMAHQLTDEHGFKPNSF----GYRYNRAVAV-SL-RAEKGKRNRMEYNGLLREMRGNQVRVWYP--SLRQSDWLIIGSRRLRVLTDQEAS-ELDNLGTELVRTMDTRAKDSEISTKLPTETKTPSPSTT--ETLPEPQSESVPKH-----VEETVEDTVEDTVEGEVEGIAEEPVEAPVPEPAPGPI-KRGRGRPRKVALPVDPSNPHIVTIPKTPTK-TALKKRNDSIKNGIKETKKAAAAAAMVVEMGTKDTEDALDYLTTGAFATRRAMRQLKDEH---GFVPN-PYGYVYDQPIEI--LNTRSSKNK-FWERGRLIGM-----CPGKVL---VRYDGWGEVYDEWVMVGSRRIRPAAAQIESSGDQKSSTMASTENTAPTSTLNGGSASTKKRAKQ-AARNDLLVTEANPEVEDEARKKRQHRVLGPEDYERLGLLAGSEKVEKIERRGRKKMVRDVETPKETETMKDKAVLIEEEPKPIEPS-------VEAPIVQNEDQEMAEPESDLT------------------------------KPNGAMTVPEGTQNDLPVKRKKQKAKTQQRKRKPAKATSPSPSVSSSTSLQQTQAQTELSTELSTE--TETETPTPISTSIVSVAATEADHDTSTLTSSYRRHIPSDE-SNHGFV-ANVYGY------------------------------DYL------QHVQVLHLDK----------KWYEGRLVSMERNRVRVHYCGWLDKFDENIAVGSRRI------QVIENDHEVVCIE--------------------------------------------------------------------------------------------------------------------------------------------PTYSERLEKMQEEKEKK-AVEPE---D-AQVVKPSKRREVAPTVVPAPEEPVHG-TH-----------------DMVEYH-----MEAVDGM--E-------------------------------VEENDTWKVYCNQCNIIIKQFRYYCTYCETPSE-----GH---DYQSFELCLRCFDQNFPFWHEHPRSSFAVQAVIDAD-----------------------MGPM-PIKGEL-VTVWEEDILEEIPDDTQ--DDL----------------------------NDPDDMFSGTM-----------EASEVFSG--VAPLD-EDQGYKFLKRWQRRKV------CAFCNDDDDTSTEL-GKFIG--P--F-----------VITSFNKNGTEKKRS--FWAHDACARYSPEV-FCTSEGKWYNVTIALRRGRGMKCYGCKEKGATIGCFESKCSKSFHLPCAQKPVSYFQSGVIFWCNTHEAY-------YKKKDTYVNIFNCDGCSKRL----ED----------E-TWFTCIPCA-SSYFSSFDLCAECFH-N---FP-QDHAHDEDQFEETS---------------FAI---LKEVEAQKATEAAKAKEELRAAN--PKKKP--LFPKRKR---RL-ADGSVP-LTCSYCGTEEAESWRKGYDGGVLMCTPCFELA---------------------LFIDNDG---N----TASNESLVIDSE-ETH-----------------------RYVMSIEDYTHKPYLTRDAV---S-ATK---------FSDH---RTGPR------LASYGPQPNQLFSLVFDSTYYDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKYTNKDERVLSNFLGRG-TDAIECFLLQRRCCGVDINPAAVALSQRNCCFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------ADESFHHVLSHPPYKDCVAYSTHLEGDLSRFTSVEDFRAEYGRVVRESWRLLKMGRRLTLGIGDNREHCFYIPVGFHLLREYINHGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFVATFRKIPLECTDKMLPID--NSD----CR---------------DHVRVEHVTKAVPQSAISRKSVVMGTVWIFKPTDTHTFEQLCISRMLERFGKDDGNWEQVLLDFMSPESMMIQNNVQQQYQSSTSS-----------------QNVHKDKPEEQEHDRDLDLDQDQEQEN--NKEDSQNQLSDYEKLRLKRIEENNQTLLKL--GLISEMSEDS--DDVIHYESMMSKKP----L------------EN---A-PLVLVMVGHQP-----IEPRQ-IGLYRETIVQIALEAVKKLAPLGMLIIGTKDIR----------------QKDNGKLWPMSMLVLEDIERAIDRSV-LKLKEMVVTVPEGHSKDR-----------QQKNLN-----TEVEEE---LE--IVD--EH-LTIVHAI--------------YLVFQRMNYSHNYN------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      Bcir1000010688_Backusella_circina_Bcir1000010688                               ---------------------------------------------------------------------------------------------------------------------------------MEDIPTHPKYEWSSTQEKRVFNMLDEKTKEYRKQKE---------------DPFSWQLPCSVKFQQDNPNTGETTTNKQ---KTSKFGN--------FMGKLLLKSSTNASKIKKSVPKPLLVEKKDSTEMPLTGYFRF---------MKRRTHHS--NTSEDNGFPKINFKFSTARVLGNENTQIVGLNTQTNI----KPKYRLING--------KRTRWDVVSPVAN-----NSKDEDSSLDLKNKEVVLPQDQSQEPTSSTE----KSDMRE-KCIDF--DFQYKEDFNDDVV------DDGSSLGSLQSDDDVLGPWIELGMFDSNVSETKKSDHFSSFYSVDNQQNLKSLDLPEIQ----CLDCKLRTAYDKKPLDI-SNLCLSCQNKWSDTLSNVFRKFEFAVL-TQTCKEVTEKPKSKKVKVTND---NKKASVEKNDV----LHKEKTG---GEKLLPQ--------KLKKSPPAIGIPPTKSNYTRKT-NKKMATGSSKKNDI--PKKTNAT--AKPIVVATPETNDD------DDPP-FGFCSNPRGLVYKQVVEV-LNINGHWYRGTLELMDKRKVKVKYIDWDDQ-EEWVIIGSK-RL-RT----IQLEDKESD----------------QQTDQQMKSKGEST---ADEVSKNNPIY-------------------------------FRKIAAAVKSK---------EPDDYVSSTLD--KDPTQIFNDNEVFMTRRLAQELVDEHGFMPNSF----GYRRNRAVAV-TFYTSSKQRKQKEESVGYLREMHKNQVRVWYP--DLHQSEWLLVGSRRLRILTEEEEESILFDSSIDLDRQ------------------EVPKIQEI--AQIENKIDEI------------PIINPPPKRSRGRPKKTLPTEVVEIATEEDTN---NVYEPEQVPQSTI--------ILEKKVTEE-HGQGD-EAKVSN----------------------------FLTTGAFATRRAMRQLTDQS---GFVPN-PYGYTNNQAVEV--LNTRSGKKK-FWEFGRLVEM-----KPGKVR---VHYEGWSDLYDEWIMVGSRRIRVAQEQ-------------IPQKEDNDEVIAAPVP----------KTNDLLMTELNPEIRDEVKRNKKHKILSAKDYQELGLLVNIEELAAKELRKKK--------------------LHEKKTEEMGTT-----VKVKAVSKTKSKKSEIGGDKYED------------------------------EHDDEDIDEGDLDNDYQ----------DTVVKKRLKSASKFKRKVK-------KSKTKIAKQTPCE--HHSPSPPPANDTQVIS---------------LRLAQARAS-NSQSFV-ANVYGY------------------------------DYM------QHVTVLHLDK----------KFYEGRLVSMRKNKIKVHYCGWLDAFDEYITCGSRRL------QVIENDHEVVCIE--------------------------------------------------------------------------------------------------------------------------------------------PNFKERYESM---KSTG-EPSLP---E-ITPVNRIVRKRITLDDVCEEDSEGQR-EY-----------------HKEPSG-----EGEDEEE--L-------------------------------VEMD-AWKVYCNQCNIVIKQFRYYCTYCETPSA-----GC---DYHSFELCLRCFDQNFPFWHDHPRSSFAIQAVIDKE-----------------------VGPM-PIKGEL-VTVWEEDVLEESVNITNE-DEE----------------------------KNGEENIEPMFES---KIDSV-DASKVFSG--DASIT-TDQGYKYLKRWKRRKV------CAFCNDDDDTSNEL-GQFIG--P--F-----------IIATFNKNGVEKKRS--FWAHDSCARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYGCKEKGATIGCFESKCSKSFHLPCSQKPASYFKNGVIFWCQTHEAY-------YNKKDTYVNIFNCDGCSKKL----EE----------E-TWFTCVQCA-TSYFSTFDLCVDCYE-K---FP-ADHRHGEEDFEETS---------------LAI---LKEMEAQKATEAAREKEELRAANARKKKKS--LFPRRRR---KL-PDGSTP-VSCCYCGTYEAETWRKGYDGGVIMCNTCFELA---------------------LLIDNDG---DT---NVTDMPLVVDNDGLQQ-----------------------RYVSSIEDYSHKPYFTREAL---S-STK---------FSDA---STGRR------LESYEPQPNQYFSLTFDSSYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKYTVKDERVLSNFLGRG-TDAIECFLLQRRCCGIDINPAAVSLSQRNCCFEIPPGLT-SAEY---RPIVAQADARQLTGSLF-------GDESFHHVLSHPPYKDCVAYSTHIDGDLSRYTHIDDFKVEYNKVVKESWRLLKMSRRLTLGIGDNREHCFYIPVGFHLIRLYIDQGFELEELVIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIGTFKKIPLENIDRMLIKN--EEEEDSAERA--------------SHVRLTSMQRGVPSSAILRKSVVMGTVWVFRPTESFRFSQLCTSRMVERFGKDDNNWEHIELDFSF------QDQPRCEQIESCHA-----------------ET---------------------EKDQ--SIDEEESPLSEYEQQRLRRIEENNKTLLKL--GLISELSEES--NDVIHYENMMDKAP----L------------ED---G-KLVLMITAHQT-----LAPCQ-INLYRKTIVQLAKDATKKLAHHGMLIIGTQDIR----------------NNTSGKLWPMTMLVLEDIERAVDQST-LKLKEMVVTVPDGYSKNR-----------KQNMDEQPD--TEHNEEEIDIE--TVD--DY-VPIVHAV--------------YLVFQRL-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      LCOR_11540.1_Lichtheimia_corymbifera_JMRC:FSU:9682_661176173                   -----MSYSDESSASNAFNWQLMSNGQGASPQQTSEQQAFDIHYN-QYQAYYYNHP--HVYNNYAADQNAAYYYHIYYGYHPGVNQHAYHHHHHYPMVENGYSYVENSTSYHHSSSTVEPSAPA--PP------PPPSKSSQSQTQNAPSSSQPRKDEKVSISEG---------------SSSRSHEKTVCNIEVDGAVPTTTTTNRKPVFKQSKFGS--------FMGKLLLVKSKGGNGKDDKGHE----KTNISKKSGL---------HRSAAEKTTSSSRH--HHQQQQNLMATNYFQIRKEVPGADNVVMATNKTEDDN---------------------KEKKW------VS-----DSITAAT----TTANVVIDDAPPSPSSKSTQ----EEDNVT--------QPHDKGNVDVDSLF-----DDVSSLSSLQSDDSVLGPWM-----------------------LDEQDDYNDDEDSQSW----CIDCDAL-PEQQNPLEP-TLMCDGCKTKWMFRTVSLINKIAAAAIHQRRRRPAVRPPKEKRSKVSGQ---QKEKMGNKTTRKRATTPTSGAK---RKAKAQQ--------HREQKLP----PPTKSRASKAI---------QKKTNGGKKKKDEFV--TTGAFMTRNTAKQL------ADPE-HGFHPNPHGFTRMQQVEV-LNFNGHWYRGVLTMMNANRVKVEYIDWQDQ-EEWIIMSSR-RL-RT----IKPDKIMTE---------------------------------SVHEEDRDPLL---------------------P-----QTTSTTENMTMTIKA---------KDDDYVTATLD--TDPHQISNDNEVYLTRRMAKELKDEHGFRINTF----GYRYNRAVAV-TC-KRGVMGKKNIEYLGYLREMRDTQIRVWYP--TLRQSDWLVVGSRRLRLLTSEEEK-ALEEEGKKMESLL-----------------QQPPPQSS--APTKEKNESDAAIS-----MNPSSVSPSTKRRKA-SRSSAKKDTPKPTTCETPSQQPKVSKGKEVETNVSIAPATTAPKSTKKISSQ-ECTTQ-----------------------------------FTTTGAFATRRAMRQLQDEH---GFVPN-PYGYYNNQPIEV--LNTRTAKGKFFWERGHLVGM-----KPGYVK---VRYDGWSDIYDEWFMVGSRKIRPASTE-------------SNEPSASTATTATAAGAGNEGGIA-ATGGDLLCLEDNPELRHE---KRPHRLIGPEDYLQLGYLVP---------------------------------IVDPPPPPPPPPATISTSPLSRSVPTKSKLTRIPDNEN-D------------------------------DKDDDAIIDNDDDEDYTCKKRIGR---RRRKRQASATNNRGKRRRRNNTTATTKAQSRKRDREKEEEEDDGEWEEQVFPSKIPI------------STLIRRGRPVDDDDNHGFI-ANVYGY------------------------------DYM------QHVQVLHLDK----------KWYEARLVRMERNMVRVHFCGWIDKFDEYIRVGSRRI------QVIENDHEVECIE--------------------------------------------------------------------------------------------------------------------------------------------PFYKERYESAAYQQCQH-DHQMD---Q-ERAKAAAATAAELAQRMAEMRRSRRR-TL-----------------ENMPTE-----EEGSGDI--D-------------------------------VDGN---KVFCRQCGVIIKQFRYYCTYCESSAE-----DG---TTHSFDLCLLCFDQQFPFWHEHPRSSFAVQAVIDSE-----------------------AGPM-PIKGEL-VTVWEEDVIEDTSAATAADKDT----------------------------EGGETATTTAAAS---KQDHVEEASQVFTG--SSAIDTAEQGYKYLKRWQRRKV------CAFCNDDDDTSEDL-GKFIG--P--F-----------VIATFNKNGVERKRQ--FWVHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYGCKEKGATIGCFESKCNKSFHLPCADKPVNYFRNGLIFFCPTHEAY-------YNKKDTYVNVFKCDGCQKEM----QD----------E-SWYTCLPCA-SSYFRSYDLCGECFD-T---LPDRNHPHDEDDFEETS---------------FAI---LKEVEAEKAREEARAKE--LAAA--KRKKS--LFPK-KR---RL-RAGESPDITCCYCGTTESEEWRKGYDGGIVMCRPCFEMA---------------------LLVDNND---GGRPLISEPNTLINDPVVAAD-----------------------SYVTQIEDYTHKPYLTRDAL---S-STK---------FSNDGKVAPVPR------LSTYEPQPHQLFSLVFDSTYYDIPGRAPRWATHSGTDYHGTWLPQTVRRSILKHTDKDERVLSNFLGRG-TDAIECFLLQRRCVGVDINPAAVALSQRNCCFEIPPGMT-SAEY---RPIIAQADSRHLEGSLF-------GDESFHHILSHPPYKDCVAYSTHLEGDLSRFTNIEEFKMEYVKVVQESWRLLKMGRQLTLGIGDNREHCFYIPVGFRLLRQYIDNGFELEELVIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKIPLSNMDRMMPQE--TDAQQQPSQ---------------DQIKLSYTQHGVPSSAILRKSVVMGTVWVFKPTEQYRFEQLCMSRMVERFGRDDTNWEQVHLEFQT-------NEAEQQLLLSRGN-----------------ED-------------------NSAMKC--KQKKDESTLSEYEQQRLKRIEENTRMLVQL--GLISELSEES--TDVMHYETMMTKPS----L------------PE---A-PLRLIISAHQPQ----LLAHQ-INAYRQTLMQLARDAVNKLAPQGMLIIGTQDIR----------------SAD-GKLWPMGMLVLEDIERTVDATM-LKLKEMVVAVPDGYSKDR-----------KQETTASLPSSTQLDKEEDIVD--IVD--EH-LPIVHAV--------------YLVFQKL-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      LRAMOSA00608_Absidia_idahoensis_var_thermophila_671688888                      -----MSSSNESSASNAFNWQLMAHGQGASPQQTSEQQAFDIHYNSQYQSYYYNHP--HAY-DYAVDQHAAYYYQQYYGYYPVVNQHSYH-----PM-SNAYSYMEHSSSY-QSSSTIAPSVPSQLPPTTQVAAPSSSKSSLSQHQNLSSSSQPGKDEKVSISEG---------------SSSKNTEKTLCNIETDGNGTVPMTTNQKSALKQSKFGS--------FMGKLLLVKGKGGS---DKGHD----KTSGSKKSDL---------YRSAAEKTTSLKHH--HQQKQQSLMTANYFQIRKEAPVADNVVV--KKTEENT---------------------KKEKC------VS-----DSTTATTMTTTTKADVVIEDDATQSPSNKRT----SQHNIK--------QQNDKDD-DLDSLF-----DDVSSLSSLQSDDSVLGPWM-----------------------SDD-DNNDDGDDSQPW----CINCDAL-PEQQNALET-RHMCDGCKDKWVFQTVSLINRIAGAAM-QRGKRPT-RPPKEKQNKVFIQ---QKDKMSNSTVR--TSTPTSGTR---RKGKIQQ--------HREQNLP----PPTKSRASKVI---------QKKSSG--KKKDAFV--TTGAFMTRNTAKQL------ADPE-HGFHPNPHGFTRMQQVEV-LNFNGHWYRGVLTEMNANRVKVEYIDWQDQ-EEWIIMSSR-RL-RT----IKPDKIMTD---------------------------------HVQEVDRDPLL---------------------P-----QVTTTAAENMATTKA---------KDDDYVTATLD--TDPHQISNDNEVYLTRRMAQELKDEHGFRLNTF----GYRYNRAVAV-TC-KRGVMGKKNIEYLGYLREMRDTQIRVWYP--TLKQSDWLVVGSRRLRLLTPEEEK-ILEEEGKNMELLL-----------------QQPPPQSS--GQEKEKIQPDAA----------SPVTPSSKRRKGNSRSSARKETSQPATRQVSQQ--KVNKGKSVDINASKA-SLTVTKSINKVPSQ-ECTSQ-----------------------------------FTTTGAFATRRAMRQLQDEH---GFVPN-PYGYYNNQPIEV--LNTRTAKGKFFWERGHLVGM-----KPGYVK---VRYDGWSDIYDEWFMVGSRKIRPASTE-------------TNEASSSSATTTTATG---NEAVT-ATGGDLLCLEDNPELRHE---KRPHRLIGPEDYLQLGYLVP---------------------------------IVDPPPPPPPPT--------SHPISATSKSTHVFNNENDD------------------------------DDDDDAVIDNDDDEDYTCKKRIGR---RQRKRQTSATSSRVKRRRR------TKAQPKKQVNEKDE---DGEWEEQVFPSKLPI------------STLIRRGRPVDD-DNHGFI-ANVYGY------------------------------DYM------QHVQVLHLDK----------KWYEARLVKMERNMVRVHFCGWIDKFDEYIRVGSRRI------QVIENDHEVECIE--------------------------------------------------------------------------------------------------------------------------------------------PFYKERYESAAYQQCQH-DNQVD---Q-ERAKAAAATAAELAQKMAEMRRSRRR-TL-----------------ENMPTE-----EEGSGDI--D-------------------------------VDGS---KVFCRQCGVIIKQFRYYCTYCESPTE-----DG---IMHSFDLCLLCFDQQFPFWHEHPRSSFAVQAVIDAE-----------------------AGPM-PIKGEL-VTVWEEDVIEDLNSAAAV-KNT----------------------------DGEDTTTTTTTATTSKQADHVEEASQVFTG--SSAIDTAEQGYKYLKRWQRRKV------CAFCNDDDDTSEDL-GKFIG--P--F-----------VIATFNKNGVERKRQ--FWVHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYACKEKGATIGCFESKCNKSFHLPCADKPVNYFRNGLIYFCPTHEAY-------YNKKDTYVNVFKCDGCQKLM----QD----------E-SWYTCLPCA-SSYFRTYDLCAECFD-T---LPDRNHPHDEDDFEETS---------------FAI---LKEVEAEKAREEARAKE--LAAA--KRKKS--LFPK-KR---RL-RAGESPDITCCYCGTTESEEWRKGYDGGIVMCRPCFEMA---------------------LLVDNND---GGRPLISEPNTLINDPVVAAD-----------------------SYVTQIEDYTHKPYLTRDAL---S-STK---------FSNDGKIAHVPR------LSTYEPQPHQLFSLVFDSTYYDIPGRAPRWATHSGTDYHGTWLPQTVRRSILKHTEKDERVLSNFLGREWYKESMC----RRGYQSGKYKAAVALSQRNCCFEIPPGMT-SAEY---RPIIAQADSRHLEGSLF-------GDESFHHILSHPPYKDCVAYSTHLEGDLSRFTNIEDFKMEYIKVVQESWRLLKMGRQLTLGIGDNREHCFYIPVGFRLLRQYIDNGFELEELVIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKIPLNNIDRMMPQE--TDARQQTTQ---------------DQIKLSYTQHGVPSSAILRKSVVMGTVWVFKPTEQYRFEQLCMSRMVERFGRDDTNWEQVHFEFQT-------NEAEQQLLSSRSG-----------------ED-------------------SSAMKT--KQKDDQDILSEYEQQRLKRIEENTRMLVQL--GLISELSEES--TDVMHYETMMTKPS----L------------PE---A-PLRLIISAHQPQ----LLAHQ-VNAYRQTLMQLAQDAVDKLAPQGMLIIGTQDIR----------------SAD-GKLWPMGMLVLEDIERTVDATM-LKLKEMVVAVPDGYSKDR-----------KQESAASF---TSLEKEEDIVD--IVD--EH-LPIVHAV--------------YLVFQKL-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RO3G_02774_Rhizopus_delemar_RA_99-880_384485890                                -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MGKLMLKSKQ---QTPA--------KQPTLTEEPFAGLFRM---------RKNRMQDGVLNEETTVTANKFTFRFLPEN-----------DKKNYRI---------------SSNG--RKTRW---D--VSVHKDKQEEQSNSEEIIPKENPKSEKTPSQNQEEDIF-----DDA-----------------------------ESLTSLDSLQSDDDVLGPWI----------------ELGMIH-----PSKKQED---------CKDCIQLNHKDRNPLDS-VRPCSTCTDQWRELFQELVEKI---------QQHTPQTTKKKKVKLSSK---EEETKIPSIKETTKTSSKNNAKFNNKESSEKR--------SRGKKSITI--PPTQSKHSQ--------------STN--RKMMDFV--STGAFMTRKTAETL------ADPE-NGFYPNPYGYIHHQIVEV-LNINGIWYRGTLEMMDKGKVLVKYSDWDDQ-EEWVIMGSR-RL-RI----VPLEIIAKE---------------------------------KDEETKGQQEV---------------------------NSLAIKDAPTLIPRV----KISG-KEDDYVSCTLD--KDPQQLFNDNEVFMTRRMARALVDEHGFRPNSF----GYRRNRPVAV-VF------NIKDKECIGYLREMRKDQVRVWYP--DLHQSEWIVMGSRRLRLLKPEEEE--KYKKEVDLDAQ------------------EVPVPIQKPTEHKEKQPEASKPKKGRSLKKIKAKSVPEPEMASE-EETVASTPPSSVVVKEDQT--------------------------SEKDSIKLSNSSS-----------------------------------FLTTGAFATRRAMRQLQDEN---GFVPN-PYNYTYNQSVEI--LNTRSGKTH-FWECGKLVAM-----RPGQVK---VHYDGWDDAYDEWIMVGSRRIRVLSKK--------------------------------EEEDKQKRYNDLLVAESNPEVQDEVKRKRKHQVIRPEDYQKLGLLEN-EQV-----------------------------I---KKKKIKEP---------------------------T------------------------------YIESDSSS----EEEFN----------P---KRRSKKASNSHQKK---------KKAATKQPIVEE--EPPV---LEEEKQIIS---------------LRVAQAKAS-EKYEFV-ANVYGY------------------------------DYM------QHITILHLDK----------KLYEGRLVSMHKNKVKVHYCGWPDAFDEYITVGSRRI------QPIENDHQVECNE--------------------------------------------------------------------------------------------------------------------------------------------PDYRERYEKMMQDGPVETCQHKH---Q-PPVSKKLNRKRLTLEDVQDEEGEAAQVEY-----------------YKGPTN-E---DDEIEDTIVV-------------------------------VEMD-SWRVYCNQCNVIIKQFRYYCTYCENPSI-----GH---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------IGPR-PIKGEL-VTVWEEDVLEEEPLQEDT------------------------------------------------------NASNIFTG--EIPID-SDQGYKYLKRWKRRKV------CAFCNDDDDTSEEL-GQFIG--P--F-----------VIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYTCKEKGATIGCFESKCSKSFHLPCSKKPVSYFKSGVIFWCRIHEAY-------YNKKDTYVNVFNCDGCGKKM----ED----------E-SWFTCVPCSSSSYFSSFDLCSECYE-K---FP-SDHPHNEDEFEETS---------------LAI---LKEMEAQKAREAAKKKEEAREAN--AKKKKKSLFPRKRR---KL-PDGSTP-ISCCYCGTFEAESWRKGYDGGVIMCNPCFELA---------------------LMVDNDE---RP----SSDMPLVIHN--TEQ-----------------------QYMTSIEDYSHKPYFTRDTA---T-KVN---------NDSA---V-GQR------LGSYEPQPNQLFSLTFDSTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKYTSKDERILSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCCFEIPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLEGDLSRFTSIEEFNREYTKVVEESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRIYIDEGFELEELVIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKIPKENIDRMLIKD--DQD---------------------ECIATKRTLHGVPSSAIMRKSVVMGTVWVFKPTEAFRFDQLCISRMVERFGKDDGNWEHIELDLLV-------NLSVDE------------------------ET-----------------------PK--DEEEKEDLISDYEKQRLKRIEENNKTLLKL--GLISELSEKS--DDVIHYENMINKIP----Y------------SD---S-DLVLMVIGHQK-----VEPQS-INAYRKTLVSIAREATQRLAPKGMLIIGTQDIR----------------DPVNGKLWPMSMLVLEDIERELGRDE-IRLKELVVTVPDGYSKDR-----------QQKFPL-----EQEEEEVIDIE--TID--DF-VPIVHAV--------------YLIFQRL-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RMCBS344292_14260_Rhizopus_microsporus_729703045                               -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MGKLLLKSNNNNNKSTN--------KQSTLTEEPFANLFRV---------RKNKLQDS--NDDA-----KFTFKYLPDD-----------DKKENRL----------------SDG--KKTRW---D--VG-----QEEEQKKDEVTQKEEPSLEE--QEEDPIDLF----DDDS-----------------------------ESLTSLDTLQSDDDVLGPWM----------------NLA--------SDKKEQE---------CESCIQLNDKDRDPMDA-AHCCSTCTDQWKTLLGDLLEKI---------QSLAVNASAGKKKRVKLD---KSIQEQPSTKK--------------EKQPQQR--------TRGKKSV-I--PPTQSRHSQ--------------STN--KKMETFV--STGAFMTRKTAETL------ADPE-NGFYPNPYGYAHLQVVEV-LNINGIWYRGTLEKMDKGKVKVKYSDWDDQ-E-WIIMGSR-RL-RI----VPPDVIAKE---------------------------------RDEEKKEKD--------------------------------SSKDTLTVIPRA----K--G-RDDDYVSSTLD--KDPQQLFNDNEVFITRRMARAMVDEYGFRPNSF----GYRRNRAVAV-VF------NVKNKECIGFLREMRDNQVRVWYP--DLYQSEWIRVGSRRLRLLSSEEEE--KYKQQVDLDVQ------------------EVPAVIEQ--KKAEDEPKEATPKE--------STSEPKTKKLRQ-KKAKSKAPTPEPETHQEKQ--------------------------QQKEPQR-PVESS-----------------------------------FLTTGAFATRRAMRQLQDEN---GFVPN-PYNYTYNQPVEI--LNTRSGKTH-FWECGRLVAM-----RPGQVK---VHYDGWDEAYDEWVMVGSRRIRILSKE--------------------------------EEENK-KKHNELLVAEANPEVQDEVKRKRKHQVIRPEDYAKLGLLES-EQT-----------------------------VIKQKKKVSKEV---------------------------T------------------------------PVESDSSSSEEDEEEYE----------EPSVRRRSRKAGKNKKKKA------VKQKQQLQQQKIAE--YSTTTTGAEEEKQIIS---------------LRVAQAKAS-EKYEFV-ANVYGY------------------------------DYM------QHVTVLNLDK----------KMYEARLVSMHKNKVKVHYCGWPDIFDEYITVGSRRI------QPIENDHQVECIE--------------------------------------------------------------------------------------------------------------------------------------------PDYQERYEKVMQDGPTE-CQHQH---Q--PAPKKLNRKRLTLDDVQEEEGQG---EY-----------------HKEPNE-E---EEDIE----V-------------------------------VEMD-SWKVYCNQCNVVIKQFRYYCTYCENPSI-----GY---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------AGPR-PIKGEL-VTVWEEDVLEEEHQQEEQ------------------------------------------------------DASSIFTG--ETAIE-GDQGYRFLKRWKRRKV------CAFCNDDDDTSEDL-GSFIG--P--F-----------IIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYVCKEKGATIGCFESKCSKSFHLPCSQKPVSYFKSGVIFWCSVHEAY-------YNKKDTYVNVFNCDGCGKKM----EE----------E-SWFTCVPCSSSSYFSSFDLCNECFE-K---FP-KDHPHNEDDFEETS---------------LAI---IKEMEAQKAREAARKKEEAREAN--AKKKL--LFPRKRR---KL-PDGSAP-VSCCYCGTAEAESWRKAYDGGVMMCNPCFELA---------------------LMVDNDE---RP----TSDMPLVIHN--AEQ-----------------------QYMTSIEDYSHKPYFTREAA---I-KID---------SETA---VIGQR------LTSYEPQPNQLFSLAFESTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFICTFRKIPKENIDRMLINE--EDQ---------------------HRIPIKRTLRGVPSSAIMRKSVVMGTVWVFKPTDSFRFDQLCISRMVERFGKDDGNWEHIELDLEE-------PHQKQQSVKPAEL-----------------EK-----------------------GN--ETNEEQEEISDYERQRLKRIEENNKTLLKL--GLISELSEKS--DDIIHYENMMSKSP----Y------------VE---S-DLVLMIVGHQQ-----ILPRY-INSYRQTLVDIAKEAIQRLAPKGMLIIGAQDIR----------------DPVSGKLWPMSMLILEDIERAVGRDD-IRLKELVVTVPDGYSKDR-----------QQKPRS-----EEEEDEMIDIE--TID--DF-VPIVHAV--------------YLIFQKL-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      PARPA_01280.1 scaffold 1359_Parasitella_parasitica_758369443                   ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSQNDPDIVYQ------------P-YISDFPDVDKFSATTTTTATTT----EPIDALMALSD------------YAV--NQSNQIGIVPQDNMIHANSVDNL--P---DN-N-FQWITCSPAVNGDNISFV-QNWSISSSSNNEGLHSNNIASLDLTVMQDN---NNLTAHISTI-QTDIIDLTRTNGSSN-------------N--HECKVC-----------IINSMDRGPFD---------------------------KINTCNKCQAQWTNF---------IASTTFKKTND--ASVSQILHDRQVRLTRNMAQELMDDYGFGPNIH----GYRRDRPVFINTC----HQDQKKKDYIGVLQEQKQGKVKVWVP--DFESFEWLPVGSKRLKTMTPQEEQ----EAYMRIGT-------------------TNIPIQDE--APLLPTQVEKTHHK--------------SNKKQPRIKRPPVKS--NSRQKRTRA-------------------SVTAKASKAAVTAS-AAIHG-----------------I-----------------HLTTGNFSKEATKSQSKDND---GFIPN-SYGYAKNRSVQI--LDIENNKIE-AWSHGTLVAM-----RPGFVK---VHYEKGSKRYYEWIDTGSQRIKLLAEE-------------NTAT------------------------ACLMMLDEDSNAENEPKKSGR------QSEKSIGDISQ-----------------------------------QNNARPITSL--------------------------------------------------------------------------------------------------------------------------------------------------------------RIAQIKAS-ETEHFA-PNAYGY------------------------------HYM------QHITVLDSNK----------KYYEARITSLQKNKVKIHYCGWIDKFDELIPLGSKRI------RVLEDDKEADCLE--------------------------------------------------------------------------------------------------------------------------------------------PNYCERYEQSLHDNNPP-CRPNQE--K-ISAVEDLKQYQVAKNKDND----------------------------------------------------------------------------------T-VTKICCSQCNSEIKRFRYYCSYCEAPSP-STP-CSIDQNSQSFQLCPACFDYSFPSWHQHPRSGFAFQAMTNDF-----------------------HQEE-EHDTTM---LWEHDILPEQGHNVG---------------------------------------------------MPL-EASKVFTGTEEISAQ-DGNGYLFLQKWKDRKI------CGFCNDDDDNSQEL-GPFIG--P--F-----------T-STSMKLGQEKKRT--VWAHYACARYSPEVSYSAEEKKWYNVTKALKRGRSM-------------------------------------------------------------------------------------------------------------ELTKCLHVFR-Q-ST---------------------------------------------EKISSNA-----------TKKTQ--LFPQRPR---KL-PDGSTS-ISCCYCGTSEAKTWRKGYDGGIMMCEPCFDVV---------------------YAKHSSL---QD-------GLFAVD-----------------------------SYAASIEDYSHKPYFTRDTL---S-LTK---------P------VVGPR------LTSYEPQPNQTFSLTFDSTYFDIPGRAPRWASHSGTDYHGTWLPQTVRRALLKYTKKDERVLSNFLGRG-TDAIECFLLQRKCCGIDINPAAIALSQRNCCFQVPEGLT-FAEY---RPIIALTDARQLNGSLF-------NDESYHHVLSHPPYKDCIAYSTHLDGDLSRFTRIEDFKQEYTRVIRESYRVLKMSRRVTLGIGDNREHCFYVPVGFHLLRLYIDQGFELEELIVKRQRYCSAFGLGTYLCVQFDFLIFTHEFIATFKKIPHHRVDKMTLAL-KTGTSLI------------------QPPVATTVLCMASHLVLFPEKA---------------------DRMIERFGQDDCNWLHVELVTDM--LSR--EHNQQEESGSSEG-----------------LQ-------------------MKI-----SRSTEVETISEYEQERLKKIQENNEILLKL--GLVSDLSQESAVNDSIYCDTLLSRKP----Y------------VH---A-DLAVMATGHIAK----LAPEQ-ITVYRQSVVKLAQDAMVKLPVKGLLIVGTMDIR----------------DEKTGKLWPISMLVLEDIERT--TSG-LKLKELVTTVPEGYSKNR-----------DRTEV----------QQEP----------EH-LPIMSFLAEFNKSIYEPWRIEYVAFEAILNGLRAICESGHWTRQDEEDFESAIRLEAGKVDLFINCKQREIESRVLYCQRTLVQQKSMSEKTRNSTDDTLTDILADINDLTKFTRLNFKALERLIQEHDRLTNTNRQPLLVEVCRTRPLDSQRFDGILVQVSSLLDKCRGRLALDSNTDNNSSNTTKSRRQGESSSARYWVHQDNATEVKAVLLFNLPIFGDDSYKQSERAMSYVYLDNASFSEYTAQLQSDNGAELITCRWDGDIHSASQVFVERHVFVKGGFSTQDGIALNANRLHDFVVTKSYSAEEYAQDLTSVGFDQNYVDSSYTIAKSIQGTIIDKQLKPKLRVQFNRLHFEAPHDKSLSVSLDNDVSLSATLDKPSSIDWLNGFLDNRQRFPYAILETRVQDQEPPPWLSRLLESNLVYEVPRFSICLHGVALLWGPQLPLLPWWLSQIDVDIRTAKKQDKLLVEGASEYSGLTRSNSLRPLIDGQYRMGYLEAQLQKRLPQRRQRSLARHGSQYSSNSSRHQSIVITDETLHAAVDSKEKAPEYVVQLEDANASRLTLQSTPTANDEQEGLRSRSSLKQLQTFSDFYRPQEGGSQAYMLQDPHTIKDNDQMRKAMLTELAQEKEEKKKKKKKQKPPQHTMEPKLFFANERTFINWLQFSALIMTAALTLLNFGDHVSTIAGATFFGISMVIALYAFFRYRYRAYQMSTRPDIRYDDLFGPVGLCCLLVGAMALNFALRWQHPSASDTYLGVNNKTDEQS
      HMPREF1544_03082_Mucor_circinelloides_f_circinelloides_1006PhL_511008850       ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSQNMEEIVWE------------P-QASHSHK--------TNTAEYI----APTDQVHLHQA------------TDE--LYQRELG--PQGSINQAIAAINQ--P---INIE-HQWISCNSEPIGKNLHSVLLESDNDSSQNSILNQSSSNSEINQTSTRGN---NHVLSSS----------FQPDETSKS-------------I--NDCENC-----------IINNRDRHPFD---------------------------KISICNKCQCQWAHF---------LPNSIDVDSDN--NSVTQMLHDRHIRLTRKKAQELIDDYGFGPNIH----GYRRHRPVKV-AF----PLDKTKSECIGVLCRVHQGKVKVWVP--ELRMVEWLPMGTRRIKLMNPQEEK----DAETMLRG-------------------SNPTIDDV--ELLPDKQDQQAFKV--------------SHSKRASLDPARLKKPSKVQSKQRKL-------------------TELNTVDSQMNTTL-ATSNR-----------------A-----------------YLTTGAFATRRAVHQLKDDN---GFIPN-PFGYAKNQAVQI--LDTKHSKSK-SWYDGTLVEM-----KPGYIK---VHYNQWPETYDEWLMIGSRRIRIADGG-------------SVATEDIC-----------------KTDEHLMVLAEDPDARHEAKRKRRSVGMQQKRQKSTNSRPE-----------------------------------QSSTRPITSK--------------------------------------------------------------------------------------------------------------------------------------------------------------RVAHLKAA-EAEQFV-PNVYGY------------------------------YYM------QHVTVLYCDK----------RHYEARIVGIQKNKVKIHYCGWADEFDELIPNDSKRL------QAIDTTNQVECIE--------------------------------------------------------------------------------------------------------------------------------------------PDYSERDKKAPLSTNSN-ATDNEVLES-IDVFQEKEAKEIINLNASEHGTEEEE--------------------DIIIVD-----DVTVDNA----------------------------------VVAS-TSKVYCSHCKKDLEQFRYYCTYCEASSS-ASD-QT---NLNSFQLCLACFDQCFPDWHQHPRSGFAIQAITDSP-----------------------KQHQ-NKDTSSSLSIWEEDIMQKQDDCIDK------------------------------------------------TALSL-EASKIFTGVDNTATQ-DEYGYLLLQKWKDRKI------CAFCNDDDDNWQEL-GPFVG--P--F-----------V-SITTKLGQEKKRT--FWAHEACARYSPEV-------------------------------------------SFHLACTNKPINNFRNGVIFWCHVHEAA-------HNKKDTYINIFHCDGCSKRF----SN----------DETWLTCEQCSLVNYFSSFDICNECYN-D-DAVI-GEHQHDKSAYQETS---------------YSL---IERIEARKQIKKEEGKFH------SVKKAQ--LFPRRTR---KLPSRSTTT-TSCCYCGTLEADAWRRGYDGGILMCNTCFGMV---------------------YNKDRPA---ED----ACEGPLAIE-----------------------------SYAASIEDYSHKPYLTRDTV---SSSNK---------P------FIGPR------LTSYEPQSNQLFSLTFDSTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRALLKYTKQNERILSNFLGRG-TDAIECFLLQRKCCGIDINPAAVALSQRNCCFEVPAGLT-FAEY---RPIIALADARQLKGSLF-------GDESYDHILSHPPYKDCVAYSTHLEGDLSRFTQLNDFKAEYTRVIRESYRVLKMDRRLTLGIGDNREHCFYVPVGFHLIRLYIDQGFELEELIVKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKIPQQQVNRMPLTQDEEDSDTA------------------NKPIYATTLYGIPQSAITRKSRVMGTVWTFNLSHQYSFQLLCISRMIERFGQDDCNWLLVELAIDK--TTQ--EHHQQQQRSGSRA-----------------NR-------------------LSIIAI--PSAPEVETISEYEQERQRKIQQNKETLLKL--GLISDLSQDSVVNDSIYCDTLLNKKP----Y------------AH---A-DLVVMATGHIEN----LLPNQ-IDLYRKSIIQLAQDATSQLAVKGMLIIGTKDVR----------------DQTNGKLWPLSMLVLEDVERT--GNG-LKLKEMVITVPEGYSKNK-----------DTFTTE-----SSIEEEPP----------QHLLPIVHAI--------------YLIFQKQ-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      MAM1_0127c06017_Mucor_ambiguus_758351301                                       ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSGNHREPQCE------------P-QAEYFHE--------RSRAKHA----GPTDQVHLYHA------------SDE--LDQRELG--PLCNINQTTTIKDH--P---KSID-YQWVPYN--PAESSLDLV----QHNFLQNTCSNHSLPNGEINQASTQG----NIALANL----------MQPTKAIES-------------I--IDCKNC-----------ITSSRDRKPFD---------------------------KIALCNQCQCKWINF---------LPSPFNTHSDN--GNVAHMLQHGHKHL----PQELADSSSFGPNVH----GYHRHRLIKI-AF----PKDKTNKECIGVLCRLHQGKVKVWVP--ALQMVEWLPAGTRRIKLMDSEEEK----DAETVLRG-------------------SIPSIHDV--ELLFDEQNQQARGM--------------SQRKRATSESPAQKRSFSAPHKNKEL-------------------AQANTADSQPCIAL-PRTHR-----------------A-----------------YLTTGSFATRKTIHQLKDDS---GFIPN-PFGYAKNQPVQI--LDTKHGRSK-SWYNGTLVEM-----RPGYVK---VHYNQWPETYDEWLMAGSRRIRIADGV-------------SAAVNDMS-----------------KSDEQLMAIAEDPDTQHNAKRKKQSLEKQQKGLKRTSSGYE-----------------------------------QSGARTIASR--------------------------------------------------------------------------------------------------------------------------------------------------------------RLAHLRAA-EVEEFV-PNLYGY------------------------------SYM------QHVNVLYHDK----------RYYEARIVGVQKNKVKVHYCGWTDDFDELIPNGSHRL------QAIDTK---ECLE--------------------------------------------------------------------------------------------------------------------------------------------PDNLERDKHMPLAENTI-SNDEAV--S-VNVPQKIKALEATKLGINDQAVEKEE-EGRQYVDLRQLFSFLPLFVDIIMVD-----DVLVEDK----------------------------------VEPD-TAGVKCSHCKAAIEDFRYYCTYCEATST-TCN-VN---NLESFQLCLVCFGHCFPDWHPHPRSGFAIQAITDGP-----------------------RQSQGSRPLSLSSSMWEEDVMETQDECMGE------------------------------------------------AAISL-EASKIFTGVDNITAQ-DKHGYLFLEKWSNRKI------CGFCNDDDDNSQEL-GSFVG--P--F-----------V-STMTKLGQEKKRT--FWVHDACARYSPEVRFSVVDGKWYNVTRALKRGRSMRCFACKEKGATIGCFDSKCSKSFHLSCTNKPVNNFRNGVIFWCHIHEAA-------LEKKDAYINVFHCDGCSKRF----SN----------DETWLTCEQCSLDNYFSSFDLCKECYK-K-NSVL-REHQHERSVYKETS---------------YLQ---LEQAEVLEQIKKGDGKY--------TKKAQ--LFPWRSR---KL-SNGSTP-TSCCYCGTLQADNWRKGYDGGILMCDTCFGMI---------------------YDRQQPT---QD----ASEGSAAIE-----------------------------NYIASIEDYSHKPYFTRETL---S-MNK---------S------LIGSR------LTSYGPQSNQLFSLTFDSTYFDIPGRAPRWATHSGTDYQGTWLPQTVRRALLRHTKKDERILSNFLGRG-TDAIECFLLQRKCCGIDINPVAVALSQRNCCFEVPAGLT-FAKH---RPIIALTDARQLNGSLF-------GDESYHHILSHPPYKDCIAYSTHLEGDLSRFTQLEEFKKEYMRVVQESYRVLKMDRRLTLGIGDNREHCFYVPIGFHLIRLYIDQGFELEELIVKRQRYCSAYGLGTYLCVQFDFLIFTHEFIATFKKVPQQQVNKMPLMQ--EGLPTT------------------NAPEYTTTLYGIPHSAIARNSRVMGTVWTFKPSHQYSFQILCISRMVNRFGQDDCNWLHVELAIDM--TTQ--EHCQQQQDTESRA-----------------DR-------------------LQITCN--PSTMKVHPISEYEHKRQRKIQENRETLLKL--GLISDLSQDSVVIDSIFCDTMLNKKP----Y------------PH---A-DLVIMATGHIEN----LLPNQ-IDMYRKSIIQLAQDATRQLAVKGRLVIGTKDVR----------------DQISGKLWPISMLVLEDIERT--SHGLLKLKEMVITVPEGYAKDK-----------NAFTAK-----PLIEEGNP----------AH-LPIVHAI--------------YLVFQK--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RirG_262840_Rhizophagus_irregularis_DAOM_197198w_595436939                     MYLPYENFEQNVEATSTSNWQL---NRTETPPGVNPHHLLPP----PPQNYFPQRPIVSTF-NNAHQINPPLYFRPPNQSHDKFKKNKRGFIRHNEVTNNQYAYTSDQNLK-NNSYSASPRFNS--HS-GYQTWHAPNNNNNSNWSIANSTECQYAPKQFDRQNRQMFPLGTQPYPIYHFSQQYSHHPVPNYSYPQQPNNCQYFPSQRPVQHFYQHGTQPNDFHQAIIPQFPTYPTDNNNENVENVEN-IT-QAKKSKRNKKKKVV-----EHSKNNINTVNDQI--IPKPSSEFIKQNDNESKKRIRPSDNERSSTVPLSNDS---LTKSFHIYE---------NDGRK------LF-----TKKKNVNKIEINLKERVSKDKINRLNNNSSKSFLADDENYE--VFGI--KKKIKLDVEIDSIL-SV--HDSTSSENIQILQQVNTDML----------SSKGISNISEV--VNDDSYSNIIDSSESI----IFDDSLSQNENSSRRDS-GVACDNME----------LDDM------VIKEKDINDPPITISNTVLENSYYNSNPIINNSET----INDIVVS---NFASLDA--------NETINL-----ESKLTKYTKLI------ADSSLNNNL--LNSNNEF--QENSLVSSNSIPEI------SA------TPILSPIIRNESTNI-ISNSENEIQREKDEKISNQSKLSY-----E-SDTPVVSET-LE-DS----FTSTNVTND-DFAIPIDKKAG----NDLKESLPINHLKNNS-EDIQSPKESFI-IVDDSSEFEDSDIKLTGNENP-----DVDSTAESTNSFSKVDFSYAKLRLKPEETIQVTCDDKENPNHSSNNSTKIEKGKFSGTIFTRRSIRDLAFAKFEGYSVNQRVKVL--------NVDSIWYPGTIIAMDKTKVRVRFDGWDAKYDEWVSKDSRRLRVMSDEEIL-EIQENSSQNNYEM----IKQHLIDKI----EKRESISI--TTVTEDPQTDFSLK-----SKRTKKRPTKKQTSSGDNTKKVRGSPKLQLSQTKK---SIDHKQKKNN------SHSRPKSPTNVVKK-KCNSC-ENIFDK----------------------------LEKVGALELCTKCVPLFAPQ-LTRIKAF-AHDFTLDQKVQV--LNIDK-----IWYPARIVNV-----EKSRVK---VHFDGWGKRFDEWISVESRRLKALSED-E-----------VNEVQKENHLSDDADNRQNKSNQD-QNQTKASTSESSPKFDSV-DDKPKKDSVCSRDIDKERQLNA--------YKLKE--------------------ILGSDSESLSSV----PDSDSDSLSSCTESISSSSLSLSA------------------------------SESHYNSLSSSSDDEFEVKM-------PKIKRTVRHNNNKTPIRKK------ILQSSKEKICESCK--IAHLNVQRIGSLDLCT---------------YCRSLFGDD-ATFRFKRGGQYGF------------------------------ELH------KRVKVLSRDG----------EWYPATMVDVEDSRIRVHFDGWSDYFDEWIPAGSQRMRDMTLEEIIEAQKALEQLDKETLKQREIIYKPQKRRRSNLKRTIQSSTVTPLNRVTKSMDSTETANNKVDVDSSLPDQQSDHSTDSTISFDWNEYYIGRDTRRSLRNDKLLSNSDVLAKLKYRFKPGNQIEVRDRLKEWVPATIIETKGCRVLIHYDDVPAFYDEWIDISSERLRE-KCAKN---E-IKEVAKNVKSVTTSLETKKDLKKKKD-KV-----------------ENDDYRLEFVVNGALNGR--L-------------------------------ITANDKWCIYCDQCNVVIKQFRYFCTYCESRSE-----GN---DYKSFELCVWCFAHQFPNYHEHPRCSFAVQSVIDDE-----------------------AIKM-SSKGEV-VKTFERDVFDTTYKEPEF---------------------------------------------------------DITSD--KMPLD-TDMGYLYLQAWNMRKI------CGFCNDDD--TNQL-GGFIA--PYPF-----------VSNTYTRYG-EKQKT--FWSHYACAKYSPEV-FFTKSNEWYNVTLAWKRGRSMKCGKCKERGATIGCFEPKCAKSYHLSCTDKPLSHFEMGVIFYCPSHEAR-------YNQKELYNEVYRCDVCSCEL----QE----------D-KWMTCRPCE-SNFFSSFDLCLQCFEVK---FP--EHEHKKDEFEETS---------------VKK---IKDAQITKQATLAVANQKARNAG--MRKKS-------KN---SL-QNKGGR-IQCSYCWAEESSRWRKGYNG-VLMCEDCFELV---------------------LVNNNTG---EPQ---------------EKD-----------------------KLLVTSEDYSYQPYLTRNFC---S-DKK---------FDDFE--SQAMY------LDSYEPVENQLFSLSFDSSYFDIPGRAPRWATHSGTDYHGTWLPQTVRRALLRFTKKNGKVLSNFLGRG-TDAIESFLLGRRLVGVDINPAAVALSQRNCSFAIPPNRDITAEH---RPIILQADSRNLTGPMF-------EVESYDHILSHPPYKNCVEYSTHIDGDLSRFANSREFAIEMSKVIDESWRLLKPGRRVTLGIGDNREHCFYVPVSFNLFRQYIDQGFELEELVAKRQRYCQAFGLGTYLCVQYDFLMFTHEFIATFKKVDKAHNNRMLVTP--DESTLS------------------GTVIFSRNLREIPVLPIARKSVVMGTTWTFKPTRTHSFVQLCTSRMVERFGRDFANWEEIQIKFNN---------MEPNNIANDNS-----------------SD----K--------------LTKFED--QIDNDEEDMPEYERIRQKRIKENQKMLLSL--GLKCDLGETS--DDISHLEKILHSMP----L------------PP---PVPTALIVVPHIPN--NLLTSQI-IPIYRTAIKHLAKEAYERLPPSGFFVIGGQDVR----------------TSD-NKLWPLSMLFMEDVNNSVGEDK-MPLKELVVTVPEGYAKDK-----------KKITKK-----DDYIEEQCILD--EEDKIEH-LSIVHAC--------------YLIFMKL---R---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      Spun1000004719_Spizellomyces_punctatus_Spun1000004719                          ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MPTKSRPSICEMARR-------------YVCRRNG--QSPLQAFER----FCYPLALSVCRHAGLRFRTL--------HVKLPV-----TGVTGLCNVLCMPANSNTCSARIC------SILGNDAPSTRSARMSVFQTLPRS----------------K--------------ENPNNHSR--RMLRGMGQKRTAGC--CC-VVQEDRDPFDTQIPCTSCCQFFASCLPSFFSQD-----LVKKKKKV--------SRKSIKGRKSSSAN-ERDDD-----------------------------------FLEPVILPPIINSRKL-------ALPAN-CPAFHIDAIVEV-----RDGAK--TWWPGRVVTV-----QSGKVC---VHYDGWGDQYDEWIDCESQRIRLATQM-------------PADQCSERISQENEHG---------QTGEAGIGPDAVILVGRSVREKRKKSAAYTNQKNAKRKKAN----------------S------------------RTNVTVVNPI----------------------------------------------------------TSKSTESATPARPQGFN------------AQQPRSSSTSPVDNFGI------FRAAANREAREAYI---------------------------------SRRALANDA-TADDM--KRLYGK---G-----------------------------------ARVEVCCAGG----------ERYMATVIKTRSWQVLVQYDGWDEAWNEWIDMNSSKM------KLVEAA---------------------------------------------------------------------------------------------------------------------------------------------------------------------SGENN----------------------SSEDGNSSA--------------------GECSSE-----EDDEQ-----------------------------------------KWKIFCNRCEKRIRQYRYFCTYCEVPSE-----GF---EYESFDLCLACFQQDFPLDHPHPIQSFAVEPLLDTD-------DP--------------TRRK-FKDGEL-VSTFVLDEFDTSYIAMGT------------------------------------------------NQSQI-DAETVVTA--IPMVP------------------R----CAFCHSER--TDIV-GPFIG--PHPFRNTRISGRRMPLPSSEKKNSGKNRRVPIFWAHDACARFSPEVYFMKDSGKWHNVLKALARGRGVKCAACKERGATIGCFDVRCTRSYHVGCTRKPLSQFEEGVIFWCPRHESL-------VNKADNYKDVYNCDVCSNSLGLSDND----------E-QWHTCDECA-QNHFNTFDLCKECFE-G-R-FP-ETHDHGKDRFITTC---------------MSQRKEIREMEQQLARELVAAN---------ASRKS--LGQRRKK---KL-ERASG--IRCAYCWIDSSSRWRKGYNG-IPMCEDCFQMA--SSAFASSKAPELCATPEA-IPSPSPS---ES----LSQSPVNAPTPVPVR----IEADSPAPVLDPTSMER--VYRTEIEAYSHEYYLTRGVV-G-K-ASG---------ADEIGA-AEVNKSSEFGILQSYAPTDDQLFTMGFDTSFYDIPGRAPRWATHSGGDYHGTWLPQIVRMSLLRYTSEGERVLSNFSGRG-TDAIECFLLKRRCCSVDINPASVALSQRNVSFSVPPELGLTAAY---RPVIVLADSRELIGSLF-------EDESYDHILSHPPYKDCVSYSAHIEGDLSHFPDMEDFQKEMEKIVAETWRLLKPNRRCTLGIGDNRRECFYQPVSFQTIRTYINDGFELEELIVKRQRYCQMAPLGTYLCTQYNFLMFTHEFIAILRKVDDRQHSGLFSYLKVDDDHD-------------------FHVNPTRILRVIPAAPIDRTSIVMGTVWTFRVTQKHSLARLAMSKLIERFGTDSAYWEEVSISEFR-----------NKVIRAAVY-----------------DD--------LCAERD-----PPEEED--EEEENGTEVTEYERRRREQLSKNTRELLSM--GLISELSPEGE-DDAKHLETLLAMPP----VQTIQHEGCPTMHPP---S-SPVIIFVPHINAPSTAILPHAWINEYRKFVIDCARDAAARLTDGGYFIIGVKDARIFLPPKTDPSSENEQPCGIQTKYVPLGLLVSEDLSRYFEGSE-MRLKDFVVAVPEGYSRDKGIEFEEMKARIDEDEQE-WK--SEQEKEANGQS--NVR--RL-LPIVQAY--------------YFIYAKQ--------------AGNRKLSEPSS------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RMATCC62417_10446_Rhizopus_microsporus_727142291                               -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MQDGPTE-CQHQH-----QPAPKKLNRKRLTLDDVQEEEGQG---EY-----------------HKEPNE-E---EEDIE----V-------------------------------VEMD-SWKVYCNQCNVVIKQFRYYCTYCENPSI-----GY---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------AGPR-PIKGEL-VTVWEEDVLEEEHQQEEQ------------------------------------------------------DASSIFTG--ETAIE-GDQGYRFLKRWKRRKV------CAFCNDDDDTSEDL-GSFIG--P--F-----------IIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYVCKEKGATIGCFESKCSKSFHLPCSQKPVSYFKSGVIFWCSVHEAY-------YNKKDTYVNVFNCDGCGKKM----EE----------E-SWFTCVPCSSSSYFSSFDLCNECFE-K---FP-EDHPHNEDDFEETS---------------LAI---IKEMEAQKAREAARKKEEAREAN--AKKKL--LFPRKRR---KL-PDGSAP-VSCCYCGTAEAESWRKAYDGGVMMCNPCFELA---------------------LMVDNDE---RP----TSDMPLVIHN--AEQ-----------------------QYMTSIEDYSHKPYFTREAA---I-KID---------SETA---VVGQR------LTSYEPQPNQLFSLAFESTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFICTFRKIPKENIDRMLINE--EDQ---------------------HRIPIKRTLRGVPSSAIMRKSVVMGTVWVFKPTDSFRFDQLCISRMVERFGKDDGNWEHIELDLEE-------PHQKQQSVKPA------------------------------------------ELEKGNETNEEQEEISDYERQRLKRIEENNKTLLKL--GLISELSEKS--DDIIHYENMMSKSP----Y------------VE---S-DLVLMIVGHQQ-----ILPRY-INSYRQTLVDIAKEAIQRLAPKGMLIIGAQDIR----------------DPVSGKLWPMSMLILEDIERAVGRDD-IRLKELVVTVPDGYSKDR-----------QQKPRS-----EEEEDEMIDIE--TID--DF-VPIVHAV--------------YLIFQKL-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RMATCC62417_10446_Rhizopus_microsporus_727142293                               -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MQDGPTE-CQHQH-----QPAPKKLNRKRLTLDDVQEEEGQG---EY-----------------HKEPNE-E---EEDIE----V-------------------------------VEMD-SWKVYCNQCNVVIKQFRYYCTYCENPSI-----GY---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------AGPR-PIKGEL-VTVWEEDVLEEEHQQEEQ------------------------------------------------------DASSIFTG--ETAIE-GDQGYRFLKRWKRRKV------CAFCNDDDDTSEDL-GSFIG--P--F-----------IIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYVCKEKGATIGCFESKCSKSFHLPCSQKPVSYFKSGVIFWCSVHEAY-------YNKKDTYVNVFNCDGCGKKM----EE----------E-SWFTCVPCSSSSYFSSFDLCNECFE-K---FP-EDHPHNEDDFEETS---------------LAI---IKEMEAQKAREAARKKEEAREAN--AKKKL--LFPRKRR---KL-PDGSAP-VSCCYCGTAEAESWRKAYDGGVMMCNPCFELA---------------------LMVDNDE---RP----TSDMPLVIHN--AEQ-----------------------QYMTSIEDYSHKPYFTREAA---I-KID---------SETA---VVGQR------LTSYEPQPNQLFSLAFESTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFICTFRKIPKENIDRMLINE--EDQ---------------------HRIPIKRTLRGVPSSAIMRKSVVMGTVWVFKPTDSFRFDQLCISRMVERFGKDDGNWEHIELDLEV-------KIYKMSFT----Y---VF---------------------------------IYIYHF--RSLIKSNSL---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RMATCC62417_10446_Rhizopus_microsporus_727142292                               -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MQDGPTE-CQHQH-----QPAPKKLNRKRLTLDDVQEEEGQG---EY-----------------HKEPNE-E---EEDIE----V-------------------------------VEMD-SWKVYCNQCNVVIKQFRYYCTYCENPSI-----GY---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------AGPR-PIKGEL-VTVWEEDVLEEEHQQEEQ------------------------------------------------------DASSIFTG--ETAIE-GDQGYRFLKRWKRRKV------CAFCNDDDDTSEDL-GSFIG--P--F-----------IIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYVCKEKGATIGCFESKCSKSFHLPCSQKPVSYFKSGVIFWCSVHEAY-------YNKKDTYVNVFNCDGCGKKM----EE----------E-SWFTCVPCSSSSYFSSFDLCNECFE-K---FP-EDHPHNEDDFEETS---------------LAI---IKEMEAQKAREAARKKEEAREAN--AKKKL--LFPRKRR---KL-PDGSAP-VSCCYCGTAEAESWRKAYDGGVMMCNPCFELA---------------------LMVDNDE---RP----TSDMPLVIHN--AEQ-----------------------QYMTSIEDYSHKPYFTREAA---I-KID---------SETA---VVGQR------LTSYEPQPNQLFSLAFESTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFICTFRKIPKENIDRMLINE--EDQ---------------------HRIPIKRTLRGVPSSAIMRKSVVMGTVWVFKPTDSFRFDQLCISRMVERFGKDDGNWEHIELDLEE-------PHQKQQSVKPGLY---FV---------------------------------LLIFHS--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      Ccor1000008322_Conidiobolus_coronatus_Ccor1000008322                           --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MKCTKCRMMGATVGCFNAKCPRIYHLTCCDKNPKLFLQGYIFYCPKHEAI-------ENKKQTYEEYYHCDHCKNSL-PR-ANTLGPFPDYSPD-EWFTCQACVEENFFSGFDLCTECFT-H-K-FKSIKHNHKANRFIRTTKEKLEVLLDVLANNKLTL-R-NDKLKGRKSLKNDKLNSIENIKD-IAKPED--SAPKKRKIIKKI-VQYQPN-IHCSYCWSTSSTIWRRGYMG-VLLCSKCFMNT--------STDKNLASIATQ-STSEVNE---DD----QDSEEIIIDV--VND----------LPSPPAQQDKF--GYHGNYEDYIHQPYHTRNLP-----QLNCLKYDPSQIAESS---VMANK--AIH-LETYGPTIYQAFSLDYKSTYYDIPGSAPRWASHSGSDYHGTWLPQTVRRAITRYTKEGDMVLSNFLGRG-TDAIECFLLKRKCIGIDINPVAVSLSQKNISFALPPSLLANSEFKYHRPTIIQGDARNLFNIL--------TNESISHVLSHPPYKDCVEYSNNIDGDLSKFSTNMEFCKEMQNVVNETWRVLKMGGRCTLGIGDNRDQCFYQPVSFDLLLLYMETGFQVEEIIVKRQRQCRAFGLGTFLCVKYDFLMFTHEFIITLRKVPITSRGSMS-----RNMKK-------------------SKFFQQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      GLOINDRAFT_316719_Rhizophagus_irregularis_DAOM_181602_552908586                -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MRKKSKN---SL-QNKGGR-IQCSYCWAEESSRWRKGYNG-VLMCEDCFELV---------------------LVNNNTG---EP----------------QEK-D---------------------KLLVTSEDYSYQPYLTRNFC---S-DKK---------FDDFE--SQAMY------LDSYEPVENQLFSLSFDSSYFDIPGRAPRWATHSGTDYHGTWLPQTVRRALLRFTKKNGKVLSNFLGRG-TDAIESFLLGRRLVGVDINPAAVALSQRNCSFAIPPNRDITAEH---RPIILQADSRNLTGPMF-------EVESYDHILSHPPYKNCVEYSTHIDGDLSRFANSREFAIEMSKVIDESWRLLKPGRRVTLGIGDNREHCFYVPVSFNLFRQYIDQGFELEELVAKRQRYCQAFGLGTYLCVQYDFLMFTHEFIATFKKVDKAHNNRMLVTP--DESTLS------------------GTVIFSRNLREIPVLPIARKSVVMGTTWTFKPTRTHSFVQLCTSRMVERFGRDFANWEEIQIKFNNM------EPNNIANDNSSDK--------------------------------------LTKFED--QIDNDEEDMPEYERIRQKRIKENQKMLLSL--GLKCDLGETS--DDISHLEKILHSMP----L--------P---PP-V---PTALIVVPHIPNN--LLTSQI-IPIYRTAIKHLAKEAYERLPPSGFFVIGGQDVR----------------TSD-NKLWPLSMLFMEDVNNSVGEDK-MPLKELVVTVPEGYAKDK-----------KKITKK-----DDYIEEQCILD--EEDKIEH-LSIVHAC--------------YLIFMKL---------R---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      Uram1000000474_Umbelopsis_ramanniana_Uram1000000474                            ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MCEACFELA-SLD-----------------LIQDELPLVLDE----------------EAD-H---------------------TYAASIEDYSHKPYLTREAL---S-STK---------FDDMK--SNAVR------LASYAPVEHQLFSLSFDSTYFDIPGRAPRWASHSGTDYHGTWLPQTVRRAILRHTRKDDRILSNFLGRG-TDAIECFLLQRRCCGVDINPAAVSLSQRSCSFETPPGLT-TAEH---RPIIVQADSRKLTGALF-------ADESYDHVLSHPPYKDCVAYSLHIEGDLSRYTNPLDFQEQYDKCVRESWRLLKMDRRLTLGIGDNREHCFYIPVGFQLIRLYINNGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKVPRNSNDKM-DSF--DASNTK------------------SQMRITYTCREIPRSPIARKSVVMGTVWLFKPSSRHSFAQLCTSRMVERFGKDESNWEHVELEIISD-------NDDSSLKPSLAN--------------------------------------IATDSM--VVEDEGLSISSYEIERQKRIDENRLALLQLVSSLSTPFIHIN--DDLTYL-FCMAGVD----I--------R---PQ-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      
      Alignment 2
      
                                                                                                                                                                                                       <-----Syapomorphic strand-helix unit-------->    Str-1                  Str-2*                                                                 Str-3                    Str-4 ****                                               Str-5                            Str-6                                                    Str-7 |                                            
      FINAL                                                                           -HHHHHHHHHHHHHHHH-------EEEEE----------------------------------------------------------------------------------------E---EEEEE---------------------HHHHHHHHHH------EEEEE------HHHHHHHHH---EEEEE--HHHHHHHHHHHHH------------------------------------------------EEEEEE--HHHHHHHH------------EEEEEE------EEEE----------------HHHHHHHHHHHHHHHHHHHHH---EEEEEEE------EEE--HHHHHHHHHHH--HHHHHHHHH----H------HHHH---HH-----------HHH---H----EEEEHHHHEEEEE-|---------------------------------------------
      ALIGN                                                                           ------E-----------------EEEEH------H--HH--------H-----------------------------------------------------------------E-EE---EEEE---------------------------EEEEEE-----EEEEE-------HHHHHHHH---EEEEE--HHHHHHHHH----------------------------------------------------EEEEE-----------------------EEEEE------EEEEE--------------H-HHHHHHHHHHHHHHHHHHHHH----EEEEE------------HHHHHHHHHHH-HHHHHHHHHH----H-------------EE-----------------HH--HHHHHHHHHHHHHH-|---------------------------------------------
      HMM                                                                             ------E--H-HHHHHH-------EEEEE-------------------HH---------------------------------------------------------------EE-EE--EEEEEE--------EE-----EEEEE-HHHHHHHHHHE-----EEEE-------HHHHHHHHH---EEEEEE-HHHHHHHHHHH--------------------------------------------------EEEEEEE-HHHH---E-----------EEEEEEE----EEEEEE--------HHHH----HHHHHHHHHHHHHHHHHHHHH---EEEEEEEEE----EEEEEHHHHHHHHHHHH-HEHHEEEEE----EEE----HHHH---HH-----------HHH---H----EEEHHHHHHEEEEE|---------------------------------------------
      FREQ                                                                            -HHHHHHHHHHHHHHHH--------EEEE----------------------------------------------------------------------------------------EEE--EEEE------------------------HHHHHHE------EEE--------HHHHHHHHH----EEE---HHHHHHHHHHH--------------------------------------------------EEEEE---HHHHHHHH------------EEEEE-------EEEE-----------------HHHHHHHHHHHHHHHHHHHH----EEEEE--------E----HHHHHHHHHH-HHHHHHHHHH------------HH----HH-----------HHH---HH--HHHHHHHHHHEEE--|---------------------------------------------
      PSSM                                                                            -------HHHHHHHHH--------EEEEE----------------------------------------------------------------------------------------------EE----------------------HHHHHHHHHHH-----EEEE-------HHHHHHHH----EEEEE--HHHHHHHHHHHHHHHH-----------------------------------------------EEEE----HHHHHH--------------EEEEE------------------------HHHHHHHHHHHHHHHHHHHHH----EEEEEE------------HHHHHHHHHH----HHHHHHHH----H---------------------------------------EEE--EEEEEEE|---------------------------------------------
      RMATCC62417_10446_Rhizopus_microsporus_727142292                                QQYMTSIEDYSHKPYFTRE----AAIKID------S-ETA--V-----VGQ-----R--------L---TS----Y--EP------------------------QPN-QL---FS-LAFESTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFICTFRK|----I------------------P---K-ENIDRM---------L-------
      RMATCC62417_10446_Rhizopus_microsporus_727142293                                QQYMTSIEDYSHKPYFTRE----AAIKID------S-ETA--V-----VGQ-----R--------L---TS----Y--EP------------------------QPN-QL---FS-LAFESTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFICTFRK|----I------------------P---K-ENIDRM---------L-------
      RMATCC62417_10446_Rhizopus_microsporus_727142291                                QQYMTSIEDYSHKPYFTRE----AAIKID------S-ETA--V-----VGQ-----R--------L---TS----Y--EP------------------------QPN-QL---FS-LAFESTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFICTFRK|----I------------------P---K-ENIDRM---------L-------
      RMCBS344292_14260_Rhizopus_microsporus_729703045                                QQYMTSIEDYSHKPYFTRE----AAIKID------S-ETA--V-----IGQ-----R--------L---TS----Y--EP------------------------QPN-QL---FS-LAFESTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFICTFRK|----I------------------P---K-ENIDRM---------L-------
      RMCBS344292_09167_Rhizopus_microsporus_729708575                                QQYMTSIEDYSHKPYFTRE----AAIKID------S-ETA--V-----VGQ-----R--------L---TS----Y--EP------------------------QPN-QL---FS-LTFESTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKHTAKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LDGDLSRFTTIEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFICTFRK|----I------------------P---K-ENVDRM---------L-------
      RO3G_02774_Rhizopus_delemar_RA_99-880_384485890                                 QQYMTSIEDYSHKPYFTRD----TATKVN------N-DSA--V------GQ-----R--------L---GS----Y--EP------------------------QPN-QL---FS-LTFDSTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKYTSKDERILSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCCFEIPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LEGDLSRFTSIEEFNREYTKVVEESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRIYIDEGFELEELVIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIATFRK|----I------------------P---K-ENIDRM---------L-------
      HMPREF1544_03082_Mucor_circinelloides_f_circinelloides_1006PhL_511008850        ESYAASIEDYSHKPYLTRD----T---VS------SSNKP--F-----IGP-----R--------L---TS----Y--EP------------------------QSN-QL---FS-LTFDSTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRALLKYTKQNERILSNFLGRG-TDAIECFLLQRKCCGIDINPAAVALSQRNCCFEVPAGLT----------------------------------FAE---YRPIIALADARQLK-----GS--LFGDESYDHILSHPPYKDCVAYS-TH---LEGDLSRFTQLNDFKAEYTRVIRESYRVLKMDRRLTLGIGDNREHCFYVPVGFHLIRLYIDQGFELEELIVKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIATFRK|----I------------------P---Q-QQVNRM---------P-------
      MAM1_0127c06017_Mucor_ambiguus_758351301                                        ENYIASIEDYSHKPYFTRE----T---LS------M-NKS--L-----IGS-----R--------L---TS----Y--GP------------------------QSN-QL---FS-LTFDSTYFDIPGRAPRWATHS-GTDYQGTWLPQTVRRALLRHTKKDERILSNFLGRG-TDAIECFLLQRKCCGIDINPVAVALSQRNCCFEVPAGLT----------------------------------FAK---HRPIIALTDARQLN-----GS--LFGDESYHHILSHPPYKDCIAYS-TH---LEGDLSRFTQLEEFKKEYMRVVQESYRVLKMDRRLTLGIGDNREHCFYVPIGFHLIRLYIDQGFELEELIVKR----QRYCS-AYGLG---TY-----------LCV---QF--DFLIFTHEFIATFKK|----V------------------P---Q-QQVNKM---------P-------
      GLOINDRAFT_316719_Rhizophagus_irregularis_DAOM_181602_552908586                 DKLLVTSEDYSYQPYLTRN----FCSDKK------F-DDF--E-----SQA---M-Y--------L---DS----Y--EP------------------------VEN-QL---FS-LSFDSSYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRALLRFTKKNGKVLSNFLGRG-TDAIESFLLGRRLVGVDINPAAVALSQRNCSFAIPPNRDI---------------------------------TAE---HRPIILQADSRNLT-----GP--MFEVESYDHILSHPPYKNCVEYS-TH---IDGDLSRFANSREFAIEMSKVIDESWRLLKPGRRVTLGIGDNREHCFYVPVSFNLFRQYIDQGFELEELVAKR----QRYCQ-AFGLG---TY-----------LCV---QY--DFLMFTHEFIATFKK|----V------------------D---K-AHNNRM---------L-------
      LCOR_11540.1_Lichtheimia_corymbifera_JMRC:FSU:9682_661176173                    DSYVTQIEDYTHKPYLTRD----ALSSTK------F-SND---------GKVAPVPR--------L---ST----Y--EP------------------------QPH-QL---FS-LVFDSTYYDIPGRAPRWATHS-GTDYHGTWLPQTVRRSILKHTDKDERVLSNFLGRG-TDAIECFLLQRRCVGVDINPAAVALSQRNCCFEIPPGMT----------------------------------SAE---YRPIIAQADSRHLE-----GS--LFGDESFHHILSHPPYKDCVAYS-TH---LEGDLSRFTNIEEFKMEYVKVVQESWRLLKMGRQLTLGIGDNREHCFYIPVGFRLLRQYIDNGFELEELVIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIATFRK|----I------------------P---L-SNMDRM---------M-------
      MVEG_09762_Mortierella_verticillata_NRRL_6337_672819038                         GRYATSAEDYSHTPYLTRT----SVSAVR------F-DHS--S-----SQA---V-Y--------L---DS----Y--GP------------------------SEN-QL---YS-LPIDTTYYDIPGRAPRWATHS-GTDYHGTWLPQTVRRAVTKYTNPNDKILSNFLGRG-TDAIECFLLGRCCTAVDINPAAITLSIRNCSFAIPPNGTV---------------------------------KAE---HRPTILQGDSRKLT-----GP--LFESESFDHVLSHPPYKDCVAYS-TH---IDGDLSRFGNSIEFQREMTHVVQETYRLLKMGRRCTLGIGDNREHCFYIPVSFQLIRQYINQGFELEELIVKR----QRYCA-MFGLG---TY-----------LCV---QF--DFLCFTHEFIATLRK|----V------------------P---K-QGHDTM---------I-------
      LRAMOSA00608_Absidia_idahoensis_var_thermophila_671688888                       DSYVTQIEDYTHKPYLTRD----ALSSTK------F-SND---------GKIAHVPR--------L---ST----Y--EP------------------------QPH-QL---FS-LVFDSTYYDIPGRAPRWATHS-GTDYHGTWLPQTVRRSILKHTEKDERVLSNFLGREWYKESMC----RRGYQSGKYKAAVALSQRNCCFEIPPGMT----------------------------------SAE---YRPIIAQADSRHLE-----GS--LFGDESFHHILSHPPYKDCVAYS-TH---LEGDLSRFTNIEDFKMEYIKVVQESWRLLKMGRQLTLGIGDNREHCFYIPVGFRLLRQYIDNGFELEELVIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIATFRK|----I------------------P---L-NNIDRM---------M-------
      RirG_262840_Rhizophagus_irregularis_DAOM_197198w_595436939                      DKLLVTSEDYSYQPYLTRN----FCSDKK------F-DDF--E-----SQA---M-Y--------L---DS----Y--EP------------------------VEN-QL---FS-LSFDSSYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRALLRFTKKNGKVLSNFLGRG-TDAIESFLLGRRLVGVDINPAAVALSQRNCSFAIPPNRDI---------------------------------TAE---HRPIILQADSRNLT-----GP--MFEVESYDHILSHPPYKNCVEYS-TH---IDGDLSRFANSREFAIEMSKVIDESWRLLKPGRRVTLGIGDNREHCFYVPVSFNLFRQYIDQGFELEELVAKR----QRYCQ-AFGLG---TY-----------LCV---QY--DFLMFTHEFIATFKK|----V------------------D---K-AHNNRM---------L-------
      PARPA_01280.1 scaffold 1359_Parasitella_parasitica_758369443                    DSYAASIEDYSHKPYFTRD----T---LS------L-TKP--V-----VGP-----R--------L---TS----Y--EP------------------------QPN-QT---FS-LTFDSTYFDIPGRAPRWASHS-GTDYHGTWLPQTVRRALLKYTKKDERVLSNFLGRG-TDAIECFLLQRKCCGIDINPAAIALSQRNCCFQVPEGLT----------------------------------FAE---YRPIIALTDARQLN-----GS--LFNDESYHHVLSHPPYKDCIAYS-TH---LDGDLSRFTRIEDFKQEYTRVIRESYRVLKMSRRVTLGIGDNREHCFYVPVGFHLLRLYIDQGFELEELIVKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLIFTHEFIATFKK|----I------------------P---H-HRVDKM---------T-------
      Bcir1000010688_Backusella_circina_Bcir1000010688                                QRYVSSIEDYSHKPYFTRE----ALSSTK------F-SDA--S-----TGR-----R--------L---ES----Y--EP------------------------QPN-QY---FS-LTFDSSYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKYTVKDERVLSNFLGRG-TDAIECFLLQRRCCGIDINPAAVSLSQRNCCFEIPPGLT----------------------------------SAE---YRPIVAQADARQLT-----GS--LFGDESFHHVLSHPPYKDCVAYS-TH---IDGDLSRYTHIDDFKVEYNKVVKESWRLLKMSRRLTLGIGDNREHCFYIPVGFHLIRLYIDQGFELEELVIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIGTFKK|----I------------------P---L-ENIDRM---------L-------
      Pbla1000013272_Phycomyces_blakesleeanus_Pbla1000013272                          HRYVMSIEDYTHKPYLTRD----AVSATK------F-SDH--R-----TGP-----R--------L---AS----Y--GP------------------------QPN-QL---FS-LVFDSTYYDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKYTNKDERVLSNFLGRG-TDAIECFLLQRRCCGVDINPAAVALSQRNCCFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFADESFHHVLSHPPYKDCVAYS-TH---LEGDLSRFTSVEDFRAEYGRVVRESWRLLKMGRRLTLGIGDNREHCFYIPVGFHLLREYINHGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFVATFRK|----I------------------P---L-ECTDKM---------L-------
      Uram1000000474_Umbelopsis_ramanniana_Uram1000000474                             HTYAASIEDYSHKPYLTRE----ALSSTK------F-DDM--K-----SNA---V-R--------L---AS----Y--AP------------------------VEH-QL---FS-LSFDSTYFDIPGRAPRWASHS-GTDYHGTWLPQTVRRAILRHTRKDDRILSNFLGRG-TDAIECFLLQRRCCGVDINPAAVSLSQRSCSFETPPGLT----------------------------------TAE---HRPIIVQADSRKLT-----GA--LFADESYDHVLSHPPYKDCVAYS-LH---IEGDLSRYTNPLDFQEQYDKCVRESWRLLKMDRRLTLGIGDNREHCFYIPVGFQLIRLYINNGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIATFRK|----V------------------P---R-NSNDKM---------D-------
      Crev1000002507_Coemansia_reversa_Crev1000002507                                 QQQGALIEDYTQGIYFTRE----ACIAPNRVGLPSV-SQQ--P-----LGE--------------L---SS----Y--GP------------------------TDS-ML---FT-LPVNTSYFDIPGRAPRWASHS-GTDYHGTWLPQTVRRALLRYTQRGEHVLSNFLGRG-TDAIECFLLNRKCVGVDINPSAVSLSQRNCSFTITPGCGM---------------------------------SIE---FRPTIMQGDARDLRSDLWPGASYFAESESFDHILSHPPYKDCVLYS-TN---IDGDLSRFPGPDEFQREMEKVVTESWRLLKMGRHLTLGIGDNRAECFYIPVSYQLIRTYISSGFELEELVVKR----QRYCQ-AFGLG---TY-----------LCV---QF--DFLMFTHEFIATLRK|----V------------------P---K-DQIDSM---------H-------
      Spun1000004719_Spizellomyces_punctatus_Spun1000004719                           RVYRTEIEAYSHEYYLTRG----VVGKAS--------GAD--E-----IGA-AEV-NKSS-EFGIL---QS----Y--AP------------------------TDD-QL---FT-MGFDTSFYDIPGRAPRWATHS-GGDYHGTWLPQIVRMSLLRYTSEGERVLSNFSGRG-TDAIECFLLKRRCCSVDINPASVALSQRNVSFSVPPELGL---------------------------------TAA---YRPVIVLADSRELIGS-------LFEDESYDHILSHPPYKDCVSYS-AH---IEGDLSHFPDMEDFQKEMEKIVAETWRLLKPNRRCTLGIGDNRRECFYQPVSFQTIRTYINDGFELEELIVKR----QRYCQ-MAPLG---TY-----------LCT---QY--NFLMFTHEFIAILRK|----V------------------D---D-RQHSGL---------F-------
      Ccor1000008322_Conidiobolus_coronatus_Ccor1000008322                            FGYHGNYEDYIHQPYHTRNLPQLNCLKYD-PSQI-A-ESS--V-----MAN-----K-AI-H---L---ET----Y--GP------------------------TIY-QA---FS-LDYKSTYYDIPGSAPRWASHS-GSDYHGTWLPQTVRRAITRYTKEGDMVLSNFLGRG-TDAIECFLLKRKCIGIDINPVAVSLSQKNISFALPPSLLANSE------------------------------FKY---HRPTIIQGDARNLFN--------ILTNESISHVLSHPPYKDCVEYS-NN---IDGDLSKFSTNMEFCKEMQNVVNETWRVLKMGGRCTLGIGDNRDQCFYQPVSFDLLLLYMETGFQVEEIIVKR----QRQCR-AFGLG---TF-----------LCV---KY--DFLMFTHEFIITLRK|----V------------------P---I-TSRGSM---------S-------
      DESFE_RS02250_Desulfurococcus_fermentans_504580152                              ---MRLVNYNEYLNHVSKR----NTVIVE--------GEE----IE--LKP--------I-K---V---KR----M----------------------------TPL-PE-E-LL-DSS-STVWSFPKRG-SWATH--RGDYRGNWPPQVARLLIERYSDPGNIVLDPMIGSG-TTCIEAKLLGRNCIGVDISYEAVILTLHRLYWLEKTLENPPDDAGS---------------------IDL-ENARR---AVVEIYHGDARRLS---------RVRDGTIDLVITHPPYFNIIKYS-S--R-VDGDLSRASSLEEYLKWFNEAAGEIYRVLKPGGHLGILIGDTRIRKYYVPISHHVLEILLRRGFILREEVVKI----QHKMKTTREVW------------S---RLK---DR--DFLLIYHEKLYVMRK|-----------------------P-----RNQEEY--EKYKY----------
      SPHMEL_RS03490_Desulfurococcus_amylolyticus_756979360                           ---MRLVDYNEYLNYVSKR----NTVIVE--------GEE----IE--LKP--------I-K---V---KR----M----------------------------MPL-PE-E-LP-DSS-STVWSFPKRG-TWATH--RGDYRGNWPPQVARLLIERYSNPGDIVLDPMIGSG-TTCIEAKLLGRNCIGVDISYEAVILTLHRLYWLEKTLENPPNDACS---------------------IDL-ENARR---AVVEIYHGDARRLS---------RVRDETIDLVITHPPYFNIIKYS-S--R-VDGDLSRASSLEEYLKWFNEATGEIYRVLKPGGHLGILIGDTRIRKYYVPISHHVLEILLRRGFILREEVVKI----QHKMKTTREVW------------S---RLK---DR--DFLLIYHEKLYVMRK|-----------------------P-----RNQEEY--EKYKY----------
      DKAM_RS02320_Desulfurococcus_kamchatkensis_501637311                            ---MRLVDYNEYLNYVSKR----NTVIVE--------GEE----IE--LKP--------I-K---V---KR----M----------------------------MPL-PE-E-LP-DSS-STVWSFPKRG-TWATH--RGDYRGNWPPQVARLLIERYSNPGDIVLDPMIGSG-TTCIEAKLLGRNCIGVDISYEAVILTLHRLYWLEKTLENPPNDAGS---------------------IDL-ENARR---AVVEIYHGDARRLS---------RVRDETIDLVITHPPYFNIIKYS-S--R-VDGDLSRASSLEEYLKWFNEATGEIYRVLKPGGHLGILIGDTRIRKYYVPISHHVLEILLRRGFILREEVVKI----QHKMKTTREVW------------S---RLK---DR--DFLLIYHEKLYVMRK|-----------------------P-----RNQEEY--EKYKY----------
      DICTH_1800_Dictyoglomus_thermophilum_H-6-12_206739986                           YKDMKEITKEDYIRFVEEN----EFVIIE--------DVK----VK--LNK--------N-W---D-I-KS----Y----------------------------SPP----ENYT-PEK-TTVWSFPDRG-SWATH--KGNYRGNWSPYIPRNLILKYTAKGDWVLDQMMGSG-TTLVEAKLLERNAIGVDINLDAVMVALDRLNFSYNPLFP---------------------------------KYSE---PIIKTYWGDARNLN---------KIEDNSIDLIATHPPYAGIISYT-KN-KKQSDDLSQL-PLEEYLKEMEKVAEESFRVLKPGKVCAILIGDTRKHKYYVPIAYRVMQVFLEVGFILKEDIIKL----QWNMKATRERW----------------RAK---EY--EFYLIGHEHIFVFRK|-----------------------P-----EDEKEY--KKYKF----------
      Smar_0588_Staphylothermus_marinus_500164270                                     VS-MRKVSYEDYLEFLKNN----KVIEIE--------GSK----IS--LEP--------I-H---V---KH----L----------------------------YPL-PE-E-LT-DIS-TTVWSFPKRG-SWATH--RGNYRGNWPPQMARALIQKYTMPGDTVLDPMIGSG-TTCIEAKLLGRNCIGVDINYNALMLTLHRLYWLEK-YLEKKA------------SKPQEIIEGENSPISI---EDILN-AKVEIYHGDARNLD---------KISNNSIDLVATHPPYFNIIRYS-RGEK-IPGDLSGARKLEEYLSMIQQVISEAYRVLKPGHYMGILVGDTRIHKHYVPITHYVLQTLLKTGFILKEEVVKI----QHKMKTTREVW------------S---KLK---NK--DFLLIYHEKLFILRK|-----------------------P-----INKKEY--RKYKY----------
      Shell_0210_Staphylothermus_hellenicus_502907573                                 AG-MRRVSYDDYLEFLKNN----RAVEIE--------GNR----IS--LEP--------I-R---V---KR----L----------------------------YPL-PQ-E-LT-DIS-TTVWSFPKRG-SWATH--RGDYRGNWPPQMARALILAYTMPGETVLDPMIGSG-TTCIEAKLLGRNCIGVDINYNAVILTLHRLYWLEK-YLEKQA------------S-TQEIFGGEYSPVSI---EDILK-ARVEIYHGDARNLD---------KISSNSIDLVATHPPYYNIIRYS-RTKK-IPGDLSGARRLEEYLAMIQQVGKEAFRVLKPGRILGILIGDTRIHKHYVPITHHVLETLLKTGFILKEEVVKI----QHKMKTTREIW------------S---KLK---NK--DFLLIYHEKLFILRK|-----------------------P-----IDKKEY--RKYKY----------
      D891_RS0103440_Hippea_sp_KM1_643385301                                          ---MREIKQEDYLEFIKNH----SEVVIG--------NSA----VK--LEG--------N-F---VII-SN----C----------------------------SPS----ENYI-PER-TSVWSFPDRG-KWATH--RGNYRGNWSPYIPRNLILKYTEKNDWVLDQMMGSG-TTLVEAKLLQRNAIGIDINLEAVMVSRDRLNFSCDSSVHNDY--R----------------------------E-----PIIKTYWGDARNLD---------KIDNNSIDLIATHPPYANIISYS-RK-KKIESDLSSM-PLKKYISGMKEVARESYRVLKPGKICSILIGDARKHKHYIPISNMIMEIFLNSGFILKEDIIKI----QWNMKATRENW----------------RAK---QY--DFFLIAHEHIFVFRK|-----------------------P-----ENEKDL--KKHRF----------
      DICTH_RS08720_Dictyoglomus_thermophilum_754082338                               ---MKEITKEDYIRFVEEN----EFVIIE--------DVK----VK--LNK--------N-W---D-I-KS----Y----------------------------SPP----ENYT-PEK-TTVWSFPDRG-SWATH--KGNYRGNWSPYIPRNLILKYTAKGDWVLDQMMGSG-TTLVEAKLLERNAIGVDINLDAVMVALDRLNFSYN-PLFPKY--S----------------------------E-----PIIKTYWGDARNLN---------KIEDNSIDLIATHPPYAGIISYT-KN-KKQSDDLSQL-PLEEYLKEMEKVAEESFRVLKPGKVCAILIGDTRKHKYYVPIAYRVMQVFLEVGFILKEDIIKL----QWNMKATRERW----------------RAK---EY--EFYLIGHEHIFVFRK|-----------------------P-----EDEKEY--KKYKF----------
      _Thermofilum_sp_1910b_530785168                                                 ---MRLVTREEYLEYIKTH----RTVRIE--------DEE----IP--IGK--------P-H---R-I-EK----Y----------------------------APD----D-SA-LET-TTVWSFPDRG-DWATH--RGDYRGNWAPQIPRNLILRYSRPGETVLDQMCGSG-TTLVESKLLGRNAIGVDINYEAVMLTMDRLNFSYR-PLDPDY--R----------------------------E-----PEIRVYHGDARNLN---------LIEDESIDLIATHPPYANIISYS-KA-KRIEGDLSQVYSLEEYLQGIREVAKESFRVLKPGRYCAILIGDTRRHRHYVPIAFRVMQQFLEVGFILREDIIKI----QWNTKTTEKKWARLAKTSEENWID-KPENK---KHWTDFYLIAHEHLFVFRK|-----------------------P-----AEGEDI--EKYRD----------
      CSUB_C0599_Candidatus_Caldiarchaeum_subterraneum_526884977                      ---MRPVTWEDYRRYVAEK----GYVEVE--------DVR----IE--IGK--------P-H---K-I-NS----Y----------------------------SPP----ASYN-LEA-TTVWSFPDRG-DWATH--SGDYRGNWSPYVPRNLILKYTKPGELVLDQMVGSG-TTLVEATLLGRNAIGVDINYEACILTLDRLNFEFH-PL-DEQ--Q----------------------------E-----PVIKVYHGDAAKLN---------IIEDESVDLIATHPPYWNIIPYS-RK-RPEGDLSAYR-KLEDYLGKMMQIARESYRVLKPGRYCAILIGDTRKHKHYVPISTYVMLKFLQAGFVLAEDIIKL----QHKMKTTREKW----------------SGKNFEQY--GFHKIAHEHLYIFRK|-----------------------P-----ENEEER--GRLSL----------
      I774_RS01625_Aigarchaeota_archaeon_JGI_0000106-J15_756971970                    ---MREITDEDYREFVKTH----REIMVE--------NVK----IR--IGQ--------E-R---R-I-YE----Y----------------------------QPR----D-FV-LET-TNVWSFPERG-AWATH--QGNFRGNWPPQLVRNIILRYSKPGETVLDQMCGSG-TTLIECKLLGRNAIGVDINLDCVMLTRDRLNFEYT-LLDADY--P----------------------------R-----VTIKTYVGDARNLN---------LVEDDSIHLIATHPPYVNIIPYS-RR-KEIEGDLSAVHSIDEYIEGMRKIAEECYRVLKPGRFCTILVGDIRRHRHHVPVAFRTMQVFLESGFILREDIIKH----QWKTKTTREKWEGLTKVAEECWVDIDRKVR---KYYMDFYLLYYEHLFIFRK|-----------------------P-----DKNENL--DQYKD----S-----
      Mpt1_c10100_Candidatus_Methanoplasma_termitum_731481703                         LYISKDIVYQPMIEILDSK----AEDVVV--------ENKKIVPWE--IYK--------K-G---K-I-SG----I----------------------------QPS----D-FK-LER-TTVWSFPDRG-DWASH--TPQYRGNWSPRVVRNIIELYSKPGDLVLDPMVGGG-TTPVECMLTGRNSISIDINQGAISITRNRLELPESMK----------------------------------KQIPK---TVHRTFIGDVRNLD---------KIADESIDLIATHPPYANIIKYA-PS---VDGDLSQINDYDVFFSEFKKAIKEFHRVLKPGAYCSILMGDTHNRSHFVPITARLMFDFLKEGFVLKEDIIKK----EWNCE-SDRNL------------G---KYS---NS--SFLLTMHEHLFVFRK|-----------------------L-N---KGEGAL--KNSSR----S-F-FE
      _Thermogladius_cellulolyticus_504550199                                         MVGMREVPISEYLEFVSRN----REVVVE--------DQV----IR--LDP--------I-E---V---KR----L----------------------------EPL-PE-E-LT-DVS-TTVWSFPVRG-GWATH--RGDYRGNWAPQIPRALILMYTRPGDVVLDPMVGSG-TTLIEAKLLGRNSIGVDINYNAVMLALHRLYYLEK-AAL------EYLKRLREGA--GAVGGGAGPAFGDAMPEDVER-AWYKVYHGDARSLS---------LLGSESVDLVATHPPYFNIIDYG-GGER-PEGDLSAARDLEEYLRWVREVAGELYRVLKPGKYCAVLVGDTRVHKHYVPISHYVLQAFLDAGFLLKEEVIKV----QHKMKTTREVW------------S---RVK---NR--DFLLIYHEKLFILRK|-----------------------P-----GQSEESRPSRLKYSGR-------
      DESMU_RS04730_Desulfurococcus_mucosus_503327799                                 ---MREVTVGEYLDFVSRN----RRIIVG--------GQE----VD--LSP--------I-E---V---RR----L----------------------------EPS-AD-E-LT-DVS-TTIWSFPKRG-SWATH--RGDYRGNWPPQMARALILGYTEPGEIVLDPMAGSG-TTCIEAVLLGRKCIAVDINYNAVMLTHHRLYYLVN-ARL------------KQGALPGLDAGGEGTGV-----------QGYRVFHGDARRLD---------EIRDNTVDLVATHPPYFNIIGYG-GN---VDGDLSNARTLEEYLEWLREVAGEIYRVLKPGRYCGILIGDTRVHGHYVPITHYALEVFLDAGFILKEEVIKI----QHKMKTTREVW------------N---RLR---KR--NFLLIYHEKLFVFRK|-----------------------P-----GVGEDT--GKLRYSMKLP-----
      Tagg_1290_Thermosphaera_aggregans_502895170                                     MY-VREVTVEEYLDFVSRN----TSITID--------GQS----IP--LKP--------I-K---V---NR----L----------------------------DPS-PQ-E-LP-DVS-TTVWSFPKRG-SWATH--KGDYRGNWPPQIPRALILKYTSEGDVVLDPMVGSG-TTCIEAVLLGRNCIGVDLNYHAVMLTHHRLYYLVK-AEL---------------------------SRGREPGR-----AWYKIYHGDARRLD---------KIRDDTVDLVLTHPPYLNIVRYG-EE-R-SEGDLSAVRGLEEFLVLFKEIAREVYRVLKPGKTLAVLVGDTRIKKHYVPLTHYVLLTLLDTGFVMMEEVVKI----QHKMKTTREVW------------S---RLR---NR--DFLLIYHEKLFILRK|-----------------------P-----VDRE----PKVKYSG--------
      METIN_RS02115_Methanocaldococcus_infernus_502864863                             ---MKEVTYDDYFEFIKEH----SYVTIE--------DTK----LE--IGK--------D-W---K-I-KK----F----------------------------QPD----N-FE-LEP-TNVWSFPKRG-DWATHYLNSKYRGNWAPQVARNLILRYSKEGETVLDPFVGSG-TTLIEAKLLFRNAIGVDINRDAVMLTLDRLRFNYNPL-------D-------------------------INEKPK---TWIKVFVGDARNLD---------KIEDESIDLIATHPPYVNIVKYT-KK-SEVDGDLSKVRSVEDFVNEMRKVAREFFRVLKPGRYCAILIGDTRRNKHHVPVSFRVMQAFLEEGFILKEDIIKI----QHNMR-VTPLW---KK-----------RSQ---EL--NFLLLKYEHLFVFRK|-----------------------P-----ESDEKL--SKFKE----S-----
      MBMB1_RS02390_Methanobacterium_sp_MB1_746331486                                 ---MKEKTHEDYNSFLKNN----RFIVIE--------DGKKTLELK--IGK--------K-H---D-P-IE----F----------------------------APE----D-FK-LEI-VNVWSFPKRG-KWATH--GGEYRGNWAPEIPRNILLRYSEAGDVVLDQFLGSG-TTLIECKLLGRKGIGIDVNLNAIMLTRDRLNFNYNPF-------E----------------------------IPI---YEQKTFMGDARDLD---------LIKNESIDLIATHPPYANIIRYS-KD-K-IPEDISNVKNIDEYIKEMEKVASESYRVLKNGKHAAILVGDTRRNKHHIPVAFRVMQAFLEAGFILREDIIKV----QHQMK-GTTFW---AK-----------RSQ---EL--NFLLLKHEHLFVFRK|-----------------------P-----EKDEKT--GKYKF----S-----
      MBMB1_0500_Methanobacterium_sp_MB1_557946003                                    HYYMKEKTHEDYNSFLKNN----RFIVIE--------DGKKTLELK--IGK--------K-H---D-P-IE----F----------------------------APE----D-FK-LEI-VNVWSFPKRG-KWATH--GGEYRGNWAPEIPRNILLRYSEAGDVVLDQFLGSG-TTLIECKLLGRKGIGIDVNLNAIMLTRDRLNFNYNPF-------E----------------------------IPI---YEQKTFMGDARDLD---------LIKNESIDLIATHPPYANIIRYS-KD-K-IPEDISNVKNIDEYIKEMEKVASESYRVLKNGKHAAILVGDTRRNKHHIPVAFRVMQAFLEAGFILREDIIKV----QHQMK-GTTFW---AK-----------RSQ---EL--NFLLLKHEHLFVFRK|-----------------------P-----EKDEKT--GKYKF----S-----
      FACI_RS02255_Ferroplasma_acidarmanus_518679720                                  ---MKRITLDDYNNYKKLN----DIVTIE--------DNK----IK--IGE--------K-N---I-I-ET----L----------------------------EPE----K-FN-LEI-DNVWSFPERG-KWCTHYLNAKYRGNYAPQLPRNIILRYSKENDLILDPFSGSG-TTLIEAKLLKRHGIGMDINLGSAMITMDRLNFNNSEN-------N----------------------------L-----IEPEIFNGDARNLN---------EIEDESIDLIMTHPPYANIIKYS-KD-NIIKDDLSSIESLEEYYKKFKKVIKEMHRTLKKGKYCAILIGDTRKKGYQIPISFTIMQLFLKEGFVLKEDIIKV----QHNTK-TRHYW---AS-----------LSI---KN--NFMLLAYEHLFVFKK|-----------------------L----------------------------
      BJBARM5_0369_Candidatus_Parvarchaeum_acidophilus_ARMAN-5_290559536              ---MKEVTLDNFRDFARTH----NSVKIE--------DNT----IE--IGM--------Q-K---Q-I-TM----L----------------------------QPT----D-FS-PET-TTVWSFPKRG-DWATHYLNSKYRGNWAPQIPRNLILEYTNPEDIVLDPMNGSG-TTLIECKLLGRNGIGVDINEEAIMIALDRLNFQAHEL-------P----------------------------S-----SEIKTFVGDARNLN---------LIKDNAIDLILTHPPYVNIISYT-YN-R-VEGDLSSISSVSEFIEEINKLAVEFFRVIKPGKYCAILMGDTRRHSHYIPVTFRTMQAFLEAGFALKEDIIKL----QWNMQSTRQNW---AG-----------------KQ--NFYKIAHEHLFVFRK|-----------------------P-----THDERL--SELKE----------
      I759_RS06660_Euryarchaeota_archaeon_SCGC_AAA252-I15_754482757                   --------------------------------------------------------------------------------------------------------------------------MWSFPKRG-DWATH--RGDYRGNWAPEIPRNLILRYSTEGDTVLDQMVGGG-TTLIECKLLGRNGIGVDINSDAIMITRDRLRFDSIDE-------N----------------------------FPE---TGQKTYVGNARNLD---------KISDESIDLIATHPPYLNIIPYT-QE-Q-VKGDLSSVHDLNEFAEEMKIVAQESIRVLKSGKYCGILIGDTRRHKHYVPISARILQAFLKAGFILKEDIIKQ----QWNCK-ATGFW---KK-----------KSQ---ES--NFLLIMHEHLYVFRK|-----------------------P-----EKDEET--TRLKD----S-----
      _EM3_bacterium_JGI_0000106-B10_658542249                                        -------------------------------------------------------------------M-EK----F----------------------------EPE----N-FK-LET-TTVWSFPERG-EWATH--KGNYRANWSPYIPRNLILRYTQEGDLVLDQMVGSG-TTLIECKLLNRRGIGVDINHDAIMVTRNRLDFKYK------Y-------------------------------D-----PEIKTYVGDARNLN---------LIPDETIDLIATHPPYANIVKFS-NN-R-IEGDLSNVKNIDEFINEMIKVARESYRVLKPGKHCAILIGDTRKRKHFVPIATRVLEVFLKVGFILREDVIKL----QWKMKGTREKW----------------RGS---KY--DFLLLAHEHLFIFRK|-----------------------P-----GKDEKL--TLFKD----SII---
      H17AP60334_RS10630_Thermosipho_africanus_490206260                              ---------------------------------------------------------------------------M----------------------------EIN----N--K-LEI-TTVWSFPERG-KWETH--NSKYRGNFAPQIPRNLILKYSKEGEVILDPMVGSG-TTLIEAKILNRKSLGYDINPKSVEITKQNLEFQGD------Y-------------------------------K-----YEPIVKVGDARNLS---------EIEDNTIDLIITHPPYLNIIKYS-EG-T-IEGDLSNISNVEKFIKEIDKIAKELFRVLKENKYCAILIGDTRKRGHYVPLSYYVLKAFLNNGFVLKEDIIKV----QHNCK-STPYW---EK-----------QVK---KY--NFHLIMHEHLFIFRK|-----------------------P-----SKNENL--SPIKF----S-TNYL
      TMEL_RS01630_Thermosipho_melanesiensis_501003459                                ---------------------------------------------------------------------------M----------------------------DNF----D--K-LEI-TTVWSFPKRG-KWKTH--NSRYRGNFAPQIPRNVILRYSNESETILDPMVGSG-TTLIEAKILNRKSIGYDINPESIELTKRNLNFEGN------Y-------------------------------K-----YEPAVKIGDARNLY---------EIKNETIDLIITHPPYLNIIKYS-SG-K-IKQDLSNISDVNKFILEFEKIVKELYRVLKENKYCAILIGDTRRKGHYIPLSFYVMKIFLKNRFVLKEDIIKI----QHNCQ-STPFW---EK-----------QVK---KY--NFYLIMHEHLFVFRK|-----------------------P-----KKDENL--THIKY----S-TGLF
      _Candidatus_Calescibacterium_nevadense_551115149                                ---MREITEEDYRTFLKTH----DFVIIE--------NVK----VP--LIK--------E-H---K-I-EK----F----------------------------EPE----N-FK-LET-TTVWSFPERG-EWATH--KGNYRANWSPYIPRNLILRYTQEGDLVLDQMVGSG-TTLIECKLLNRRGIGVDINPDAIMVTRNRLDFKYK------Y-------------------------------D-----PEIKTYVGDARNLN---------LIPDETIDLIATHPPYANIVKFS-NN-R-IEGDLSNVKNIDEFINEMIKVARESYRVLKPGKHCAILIGDTRKRKHFVPIATRVLEVFLKVGFILREDVIKL----QWKMKGTREKW----------------RGS---KY--DFLLLAHEHLFIFRK|-----------------------P-----GKDEKL--TLFKD----SII---
      TTHWC1_RS07155_Thermoanaerobacter_489963634                                     ---------------------------------------------------------------------------M----------------------------QDI----D-FK-KEI-TTVWSFPERG-KWKTH--KGNYRGNFAPQIPRNVILRYSQEGDFVLDPMVGSG-TTLIETKILNRRGIGFDINPDSVELTKRNLDFDGD------Y-------------------------------K-----YEQVVRVGDVRNLK---------EISDISIDLIITHPPYLNIIKYS-NG-R-IEGDLSNISDVKKFCDELEKGVIELYRVLKEDRYCAILIGDTRKSGHYIPLSYYVMRLFLKNGFVLKEDIIKV----QHNCK-STPYW---ER-----------QVE---KY--NFYMIMHEHLFIFRK|-----------------------P-----KKDENL--NKIKY----S-TGLY
      THYS13_RS12835_Thermoanaerobacter_sp_YS13_757582161                             ---------------------------------------------------------------------------M----------------------------QDI----D-FK-KEI-TTVWSFPERG-KWKTH--KGNYRGNFAPQIPRNVILRYSHEGDFVLDPMVGSG-TTLIETKILNRRGIGFDINPDSVELTKRNLDFDGN------Y-------------------------------K-----YEQIVRVGDVRNLK---------DIGDSSIDLIITHPPYLNIIKYS-NG-T-IEGDLSNISDVKKFCDELKKGVIELYRVLKEDRYCAILIGDTRKSGHYIPLSYYVMRLFLKNGFVLKEDIIKV----QHNCK-STPYW---ER-----------QVE---KY--NFYMIMHEHLFIFRK|-----------------------P-----KKDENL--NKIKY----S-TGLY
      M663_RS0111020_Thermoanaerobacter_sp_A7A_658480004                              ---------------------------------------------------------------------------M----------------------------QDI----D-FK-KEI-TTVWSFPERG-KWKTH--KGNYRGNFAPQIPRNVILRYSHEGDFVLDPMVGSG-TTLIETKILNRRGIGFDINPDSVELTKRNLDFDGD------Y-------------------------------K-----YEQIVRVGDVRNLK---------EIGDSSIDLIITHPPYLNIIKYS-NG-R-IKGDLSNISDVKKFCNELEKGVIELYRVLKEDRYCAILIGDTRKSGHYIPLSYYVMRLFLKNGFVLKEDIIKV----QHNCK-STPYW---ER-----------QVE---KY--NFYMIMHEHLFIFRK|-----------------------P-----KKDENL--NKIKY----S-TGLY
      THIT_RS02440_Thermoanaerobacter_italicus_502759633                              ---------------------------------------------------------------------------M----------------------------QDI----D-FK-KEI-TTVWSFPERG-KWKTH--KGDYRGNFAPQIPRNVILRYSQEGDFVLDPMVGSG-TTLIETKILNRRGIGFDINPDSVELTKRNLDFDGD------Y-------------------------------K-----YEQIVRVGDVRNLK---------EISDSSIDLIITHPPYLNIIKYS-NG-R-IEGDLSNISDVKKFCDELEKGVIELYRVLKEDRYCAILIGDTRKSGHYIPLSYYVMRLFLKNGFVLKEDIIKV----QHNCK-STPYW---EK-----------QVE---KY--NFYMIMHEHLFIFRK|-----------------------P-----KKDENL--NKIKY----S-TGLY
      TKV_c04890_Thermoanaerobacter_kivui_694165517                                   ---------------------------------------------------------------------------M----------------------------QNI----E-FR-KEI-TTVWSFPERG-NWKTH--NGSYRGNFAPQIPRNVILRYSNEGDIVLDPMVGSG-TTLIEAKLLNRRGIGFDINPESVELAKRNLEFDGE------Y-------------------------------K-----YEQIVRVGDVRNLK---------EISDSSIDLIITHPPYLNIIKYS-NG-R-IEGDLSNISDVKKFCDELEKGVIELYRVLKEDRYCAILIGDTRKSGHYIPLSYYVMRLFLKNGFVLKEDIIKV----QHNCK-STPYW---EK-----------QVE---KY--NFYMIMHEHLFIFRK|-----------------------P-----KKDENL--NKIKY----S-TGLY
      HYDTH_RS09530_Hydrogenobacter_thermophilus_502729540                            ---MKEITMNDYLEFIKEN----DFVIIE--------SVK----VK--LNK--------T-W---S-I-KS----Y----------------------------GPK----E-YF-PEK-TTVWSFPNRG-SWATH--KGNYRGNWSPYVPRNLILKYTNKGDWVLDQMMGSG-TTLVEAKLLERNAIGVDINLDAVMVALDRLNFP--------Y--G----------------------------Q-----STIKTYWGDARNLD---------KIESQSIDLIATHPPYANMISYT-KN-KKLSDDLSLL-SPEEYLKEMRKVAEESYRVLKPGKVCAILIGDTRKYKHYVPIAFRVMQVFLEAGFILREDIIKL----QWKMKATREKW----------------RAK---EY--DFYLIAHEHIFVFRK|-----------------------P-----EKEEEY--RKYKL----S-----
      HGMM_F16H05C22_uncultured_Aquificae_bacterium_374851611                         ---MKEITMNDYLEFIKEN----DFVIIE--------SVK----VK--LNK--------T-W---S-I-KS----Y----------------------------GPK----E-YF-PEK-TTVWSFPNRG-SWATH--KGNYRGNWSPYVPRNLILKYTNKGDCVLDQMMGSG-TTLVEAKLLERNAIGVDINLDAVMVALDRLNFP--------Y--G----------------------------Q-----STIKTYWGDARNLD---------KIESQSIDLIATHPPYANMISYT-KN-KKLSDDLSLL-SPEEYLKEMRKVAEESYRVLKPGKVCAILIGDTRKYKHYVPIAFRVMQVFLEAGFILREDIIKL----QWKMKATREKW----------------RAK---EY--DFYLIAHEHIFVFRK|-----------------------P-----EKEEEY--RKYKL----S-----
      MTC_RS07180_Methanocella_conradii_504218926                                     -------------------------------------------------------------------MKNNSNISL----------------------------APT----N-FE-PEF-TTLWSFPVRG-NWATH--SPDYRGNFAPQIARNLILKYSKEGDTVLDPMAGGG-TTLIEAKLLNRKGIGFDINPKAVDITIKNLRFECN------S-------------------------------N-----YEPKVKVGDVRNLK---------EIPDSSIDLIITHPPYLNIIKYS-DG-K-IEGDLSNISSLKKFCDELELGIKEFYRVLKEDSYCAILIGDTRRAKHYVPLSYYVMERFLDNGFVLKEDIIKA----QHNCE-STPYW---KS-----------KAE---KL--NIYLIMHEHLFVFRK|-----------------------P-----SEHENL--SRLRY----S-----
      _Gracilibacteria_bacterium_JGI_0000069-K10_742671763                            -------------------------------------------------------------------M-AKKLK-L----------------------------PPE----D-FE-QEC-STVWSFPRRG-NWATH--NSKYRGNWSPEVVRNLILRYSKEGDYLLDPMIGGG-TTAIEAKLLGRNLLCYDINPEAIKLTESFLDFEIP------S-----------------------------PTKER---ARVRLKKHNATKKNK--------DLKDESIDFVLMHPPYVDIIKYS-D--G-IKGDLSHIHDLDEFSDEIEKVAKESFRVLKKGGYCAVLMGDTRREKMYQPLAFKTMERFLKVGFALKEDIIKV----QHNCK-ATGFW---VN-----------KSK---DY--NFLLIMHEHLFIFKK|-----------------------I----------------------------
      ACD_71C00187G0001_uncultured_bacterium_(gcode_4)_406901678                      -------------------------------------------------------------------M-PKKIKKL----------------------------QPE----E-FD-QEC-TTVWSFPRRG-NWATH--NSKYRGNWSPDVVRNLIVRYSKEGDTLLDPMIGGG-TTAIECKLLNRNLIAFDVNPASIELSESMLDFEYD------S-------------------------------S-----AKIRIVQGDARELMK--------KVGDESVDFILHHPPYADIIKYS-EW-K-IPEDLSNIHDIDEFADEMEKIARECFRVLKKWQYCAILIGDTRREKMYQPMAFKVMERFLRVGFALKEDIVKV----QHNCK-ATGYW---KT-----------SSQ---KY--NFLLIMHEHLFIFKK|-----------------------P----------------------------
      _Methanocella_conradii_504218923                                                -------------------------------------------------------------------MKNNSNISL----------------------------APT----N-FE-PEF-TTLWSFPVRG-NWATH--SPDYRGNFAPQIARNLILKYSKEGDTVLDPMAGGG-TTLIEAKLLNRKGIGFDINPKAVDITIKNLRFECN------S-------------------------------N-----YEPKVKVGDVRNLK---------EIPDSSIDLIITHPPYLNIIKYS-DG-K-IEGDLSNISSLKKFCDELELGIKEFYRVLKEDSYCAILIGDTRRAKHYVPLSYYVMERFLDNGFVLKEDIIKA----QHNCE-STPYW---KS-----------KAE---KL--NIYLIMHEHHLFLGS|-----------------------R-----VNMKI------------------
      ACD_81C00186G0010_uncultured_bacterium_406873648                                -------------------------------------------------------------------MSIKDFK-L----------------------------HPE----E-FD-LEC-TTVWAFPRRG-NWATH--ASDWRGNWAPEVVRNLILRYSSEKDHLLDCMIGGG-TTAIEAKILNRHITCIDVNEEALERTRKSLEFEVE------N-------------------------------K-----AKQRVMKCDARDMS---------FIKDNEIDFVLTHPPYADIIKYS-DG-Q-IEEDISGIHDIDAFVDEIEKVAKELYRVLRPGKYCAILMGDTRRNKMYQPLAFKVMERFLRVGFVLKEDIIKR----QFNCK-ATGFW---VN-----------KSK---ES--NFLLIMHEHLFVFQK|-----------------------LDSIKSPDFATV--SKIKT----I-----
      FERPE_RS08390_Fervidobacterium_pennivorans_752594110                            ---------------------------------------------------------------------------M----------------------------HDLNVEFD-FK-PEI-TTVWSFPERG-KWSTH--KGTYRGNFAPQVARNLLLRYTKEGDVILDPMMGSG-TTLIEAKLLKRRAIGIDINPTSVELTKRNLSFNCP------N-------------------------------S-----YEPEVFIGDARDLS---------FIEDETVDFVLLHPPYLNIIKYS-EG-N-INGDLSNISDVKRFCTELEKVIIELFRVLKPGKFCSVLIGDTRKNGHYVPLSYYVLTLFLKNGFVLKEEIIKI----QHNCT-STPYW---RK-----------KVS---EN--NFYLIMHEHLFVFKK|-----------------------P-----ELGENV--SKIKY----S-----
      Ferpe_1711_Fervidobacterium_pennivorans_DSM_9078_383110164                      ---------------------------------------------------------------------------M----------------------------HDLNVEFD-FK-PEI-TTVWSFPERG-KWSTH--KGTYRGNFAPQVARNLLLRYTKEGDVILDPMMGSG-TTLIEAKLLKRRAIGIDINPTSVELTKRNLSFNCP------N-------------------------------S-----YEPEVFIGDARDLS---------FIEDETVDFVLLHPPYLNIIKYS-EG-N-INGDLSNISDVKRFCTELEKVIIELFRVLKPGKFCSVLIGDTRKNGHYVPLSYYVLTLFLKNGFVLKEEIIKI----QHNCT-STPYW---RK-----------KVS---EN--NFYLIMHEHLFVFKK|-----------------------P-----ELGENV--SKIKY----S-----
      ACD_28C00322G0004_uncultured_bacterium_406967845                                ---------------------------------------------------------------------MSEIK-L----------------------------HPE----E-FE-LEC-TTVWAFPRRG-NWATH--KSDWRGNWSPEVARNLILRYSKEKDHLLDCMIGGG-TTAIEAKILNRHITCIDVNEEALERTKKSLEFEVD------N-------------------------------K-----AKQRVAKCDARNMS---------FIKDNEIDFVLTHPPYADIIKYS-EG-K-IEEDLSGIHDIDAFVDEIEKVAKELFRVLKKGKYCAILMGDTRRNKMYQPLAFKVMQKFLDTGFVLKEDIIKR----QFNCK-ATGFW---VT-----------KSK---ES--NFLLIMHEHLFVFQK|-----------------------V----------------------------
      ACD_18C00096G0009_uncultured_bacterium_406986924                                ---------------------------------------------------------------------MVKMK-L----------------------------HPD----N-FD-LEC-STVWSFPRRG-KWATH--KSDWRGNWAPEVVRNLILRYSGEKDHLLDCMIGGG-TTAIEAKILNRHITCIDVNEEALERTRKSLNFEVN------N-------------------------------K-----ARQRIIKCDARKMD---------FIKDNEIDFVLTHPPYADIIKYG-EG-K-IKEDLSNIHDIEKFAEEMELVAKELYRVLKPQKYCAILIGDTRRNKMYQPMAYKVMDKFLKQGFKLKEDIIKQ----QHNCK-ATGFW---VK-----------KSK---KL--NFLLIMHEHLFVFQK|----------------------------------------------------
      NA23_RS09565_Fervidobacterium_islandicum_701167223                              ---------------------------------------------------------------------------M----------------------------RDLSREIE-FK-PEI-TSVWSFPDRG-KWSTH--RGNYRGNFAPQVARNLLLKYTQEGDLVLDPMMGSG-TTLIEAKLLKRKAIGIDINPESVELTRKNLDFNCD------N-------------------------------C-----YKPEVLLGDARKMS---------FLNDEVVDFIILHPPYLNIIKYS-NG-N-IVGDLSTISDVKTFCLELEKVVHELFRVLKQNKYCAVLIGDTRKNGHYVPLSYYVATLFLKNGFVLKEEIIKV----QHNCS-STPFW---EK-----------KVQ---EH--NFYLIMHEHLFVFRK|-----------------------P-----AQDENL--SRIKY----S-SGRF
      TTHE_RS09375_Thermoanaerobacterium_thermosaccharolyticum_503063371              ---------------------------------------------------------------------------M----------------------------ENI----N-FK-KEI-TTVWSFPERG-DWATH--NGKYRGNFAPQVPRNIILRYSKENDIVLDPMVGSG-TTLVEAKLLNRRGIGFDINPDAIDITKRNLNFGAN------FSGK---------------------------CK-----FEPDAKIGDIRNLK---------EIDDNSIDLIITHPPYLNIIKYS-NG-N-IEGDLSNISGVKKFLNELEKGVSELFRVLKNNRYCAILIGDTRKSGHYVPLAFYVMQLFLKNGFILKEDIIKV----QHNCK-STPYW---ES-----------QVE---KY--NFYLIMHEHLFVFRK|-----------------------P-----DIDEDV--SKVRY----S-----
      HMPREF9131_RS04375_Peptoniphilus_490963643                                      ------------------------------------------------MTS----------K---I---TK----W----------------------------EPE----N-FE-LEM-TTHWSFPERG-DWATH--DAKWRGNWSPYIPRNIILRYSKEKDLILDQFAGGG-TTLVEAKLLKRNIIGLDVNDVALNRCREKIDFEHE---------G----------------------------AD----GKVFLRKGDARNLD---------FISDNSIDLICTHPPYANIIKYS-EN---IKEDLSQL-KINDFLDEMKKVASESYRVLKKDKFCAVLMGDTRKNGHMIPLSFYVMQVFENAGFKMKEMIIKE----QHNCR-ATGFW---KT-----------NSI---KY--NFLLIAHEHLFIFRK|----------------------------------------------------
      SPICO_RS02800_Sphaerochaeta_coccoides_503504517                                 -----------------------------------------------------------M-T---I---RK----W----------------------------EPD----N-FE-LET-NTVWSFPDRG-NWATH--DAKWRGNWSPYIPRNILLRYSGEGDWVLDQFVGGG-TTLVEAKLLNRNIIGIDVNPDALNRCKAKIDFEC----------P----------------------------NA----GTVKLYQNSAGNLS---------FIEANSIDLICTHPPYADIIHYS-ED---IEGDLSLM-SVRDFLGAMKPVAEECYRVLKKGKFCAVLMGDTRKKGCVIPMSFDVMKIFEAAGFVTKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KH--NFLLLAHEYLFVFRK|----------------------------------------------------
      P159_RS0116605_Selenomonas_ruminantium_657829373                                -----------------------------------------------------------------M---IK----W----------------------------EPE----D-FE-LRM-TTHWTFPKRG-DWATH--DAKWRGNWSPYIPRNIMLRYSKEGDCVLDQFAGGG-TTLVEAKLLNRNIIGVDVNDMALERCREKTDFEHE---------G----------------------------AE----GRVTLQKGDARNLD---------FLKDEQIDLICTHPPYANIIQYS-ED---IPADLSRM-AIADFLEEMKKVAKESYRVLKKDKFCAILMGDTRKKGCMVPMSFDVMKIFEEAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      BN820_00869_Acetobacter_sp_CAG:977_547265226                                    ------------------------------------------------MKR--------------I---KC----W----------------------------QPE----N-FE-LEM-TSVWSFPNRG-KWATH--DAKWRGNWSPYIPRNLILRYSQEGDVVLDQFVGGG-TTLVEAKLLNRNAIGVDINDAALERCAEKTSFQYE---------G----------------------------SE----GEISIVKADARDLS---------FISNESIDLICTHPPYANIIQYS-DD---LENDLSRL-SLKDFLAEIQKVASESYRVLKKGKYCAVLMGDLRKKGHVFPLGMNVMQIFESVGFSLKEIIIKE----QHNCK-ATGYW---KT-----------SSI---KY--NFFLLAHEYLFVFKK|--K-------------------------------------------------
      Q388_RS0120175_Ruminococcus_albus_503262746                                     -------------------------------------------------MK----------K---I---KK----W----------------------------EPD----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRDIIGVDCNDEALTRCREKIDFDYP---------P----------------------------AH----GKVFLYKGDARDLY---------FQSDESVDLICTHPPYADIIKYS-DG---IPEDLSQL-KVKDFLEAMKPVAAECYRVLKKGKFCAVLMGDTRQKGCMIPMSFDVMKIFQEAGFTLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      I825_RS0102520_Atribacteria_bacterium_SCGC_AAA255-G05_658522169                 -------------------------------------------------MS--------K-K---I---KT----F----------------------------YPK----D-FK-EKQ-STVWSFKQRG-NWATH--SGEYRGNWSPFIPRNVILKYSNPGELILDYFCGAG-TTAVECKLLNRKCIAIDINDKAIELAKENVNFNTE---------S--RQLTF--E------------------KNHTQIYEPELLVGDARDLS---------SLKDNSVDLICAHPPYSNIIHYT-DS---KEGDLSFF-DIDEFLKEMEKVAKESFRVLKPGRQCAILIGDTRRKKHIIPLGFKLINIYLEAGFKLRELVIKR----QHNCK-TTGFW---YT-----------NSI---KY--NFLLLAHEYLPIFEK|----------------------------------------------------
      _[Eubacterium]_siraeum_505332319                                                -----------------------------------------------MANK----------K---I---TK----W----------------------------EPE----N-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRDIIGIDINDVALERCREKTDFDYE---------P----------------------------AK----GKVYINKGDARHLD---------SIPDDSIDLICTHPPYADIIKYS-DG---IDGDLSQL-KVKEFLEQMKPVAEESYRVLKKGKFCAILMGDTRQKGCMIPMSFDVMKIFQDAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      N469_RS0107485_Marinimicrobia_703200955                                         ---------------------------------------------M--KVS--------K-K---I---KR----L----------------------------YPE----D-FK-EEL-TTVWSFKQRG-NWATH--SGEYRGNWSPYIPRNVILKYSKPDELVLDYFCGAG-TTAVECKLLGRKCIAFDINDKAIELARRNLNFIVE---------S--QQLSLIDE------------------KLHPQIYEPALSVGDARELS---------LLQDNSIDLICAHPPYANIIHYT-DS---KEGDLSFC-DIDEFLKEMGKVAKESFRVLKPGRQCVILIGDIRKKKHVIPLGFKLINVYLNAGFKLRELVIKR----QHNCK-TTGFW---YA-----------NSI---KY--NFLLLAHEYLPIFEK|----------------------------------------------------
      BN720_00766_Eubacterium_sp_CAG:581_548315511                                    ------------------------------------------------MNK----------R---I---TK----W----------------------------GPD----D-FE-LEM-TTHWSFPDRG-KWATH--DAKWRGNWSPYIPRNILLRYSNEGDLVLDQFAGGG-TTLVEAKLLNRNIIGVDCNDVALKRCKEKIDFDYE---------P----------------------------AK----GKVYIRKGDARNLD---------FIKDDSIDLICTHPPYANIIQYS-DD---ITEDLSLL-KINDFLEQMKKVASESYRVLKKGKFCAVLMGDTRQKGHMIPMSFDVMNIFQNTGFKLKELIIKE----QHNCK-ATGFW---KT-----------NSV---KY--NFLLIAHEYLFVFHK|--I-------------------------------------------------
      Q355_RS15275_Meiothermus_cerbereus_738305516                                    --------------------------------------------MT--RRK--------S-K---I---TQ----W----------------------------EPK----G-FQ-LET-TTVWSFKNRG-KWATH--DGRYRGNWSPYIPRNLILRYSQPHEVVLDYFSGGG-TTAVEAKLLTRRCIARDINPDALALTKENLDFQLP---------Q--DMFS---G------------------NGH---FPIQIELGDARDLS---------SIEDESIDLICAHPPYAGIISYSANA---VDGDLSTL-CVPEFIDEMQKVARESYRVLKAGRQCAILIGDSRKSKHIVPIGFLTIRAFLNAGFVLRELIIKR----QHNCK-TTGFW---YS-----------NSI---RY--NFLLLAHEFLPVFEK|----------------------------------------------------
      HMPREF1497_RS08290_Fusobacterium_492656844                                      ------------------------------------------------MNK----------K---N---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEEDLILDQFIGGG-TTLVEAKLLNRNIIGVDVNNVAIERCKEKINFNFE---------N----------------------------S-----GKVYIHKGDARKLD---------FIKDETIDFICTHPPYANIIEYS-ED---IEEDLSHL-KISEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEAEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFVFKK|----------------------------------------------------
      L21TH_RS00440_Caldisalinibacter_kiritimatiensis_493349121                       --------------------------MDK--------KLG----KYEDKKS--------H-K---V---KE----W----------------------------EPK----D-FK-LEA-TTVWSFPNRG-KWATH--SGKYRGNWSPYIPRNIILRYSKKDDTVLDQFLGSG-TTLIETKLLHRNGIGVDVNQNAINIAKENLEFTKN---------K----------------------------E-----YEPKIIKGDARDLD---------FISDESIDLICTHPPYANIIKYS-DN---IKEDLSRY-DINQFLVEMKKVASECYRVLKKDKYCAILIGDTRRKKHMIPLGFKVMEVFLDAGFVLKENIIKE----QHNCK-ATGFW---YK-----------RSI---EY--NFLLIAHEYLLVFRK|-PVDNEDKKV------------------------------------------
      HMPREF9093_RS05275_Fusobacterium_sp_oral_taxon_370_496969638                    ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTQEKDLILDQFAGGG-TTLVEAKLLNRNIIGIDINDVAIERCKEKIDFEFE---------N----------------------------S-----GKVYIRKGDARNLD---------FIKDETIDFVCTHPPYANIIEYS-ED---IEGDLSHL-KIPEFLKEIEKVATESYRVLKKDNFCAILMGDTRIKGHIQPLGFEVMKVFEKVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|----------------------------------------------------
      HMPREF1504_RS04555_Veillonella_sp_ICM51a_740284293                              ------------------------------------------------MAK-------NIKK---I---KK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNLLLRYSQEGDLILDQFAGGG-TTLVEAKLLNRDIIGIDINEVALERCKEKIDFDYE---------S----------------------------AK----GRVELHKGDARNLD---------FISDDSIDFVCTHPPYANIIKYS-EG---IEGDLSQL-KVPEFLEEMKLVASESYRVLKKGRFCAILMGDTRQKGHMVPMSFDVMRIFEEAGFKLKELIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFVFKK|----------------------------------------------------
      MSUIS_RS03915_Mycoplasma_suis_503374808                                         --------------------------------------------MS--SKV--------K-K---F---TK----W----------------------------GPD----N-FE-LET-STIWNFPNRG-KWATH--DAKYRGNWSPYIPRNILLRYSSEGDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNEESLKRCREKTSFEFN---------G---------P------------------K-----GQVEIVKGDARDLN---------FIKSESIDLICTHPPYANIIHYS-EG-QVIEEDLSNL-KVSEFLEEMKKVAQECCRVLKKNKYCVILMGDTRKNGHMIPLSFDVMKLFEDVGFKLKELIIKA----QHNCK-ATGFW---KT-----------NSV---KH--NFLLIAHEYLFVFRK|----------------------------------------------------
      COPRO5265_RS06650_Coprothermobacter_proteolyticus_754097257                     ----------------------------------------------------------------------M----W----------------------------EPE----D-FS-LET-TTVWGFPDRG-DWATH--SGKYRGNWSPYIPRNVILRYSNENDVVLDQFVGSG-TTLVEAKLLGRRGLGVDINPDAVKLALSNVNFEHK--------------------------------------C-----GLADVHIGDARNLD---------FVKDSSIDLICTHPPYSNIIKYS-DN---IEGDLSHY-DIPEFLKEMYKVASESYRVLKRGRFCAVLMGDTRRKGNIIPLGFRVMEVFCKAGLTLKEIVIKE----QHNCT-STGYW---KK-----------QSI---KY--NFLLIAHEYLFIFKK|----------------------------------------------------
      SELSP_RS03130_Selenomonas_sputigena_493205676                                   ------------------------------------------------MVK----------K---I---TK----W----------------------------EPE----E-FE-LEM-TTHWSFPKRG-NWATH--DAKWRGNWSPYIPRNILLRYSEEKDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNDTALERCKEKIDFEHD---------G----------------------------AD----GKVYIHKGDARNLD---------FIPDGSIDLICTHPPYADIIKYS-ED---IEADLSHL-KVKDFLEEMNAVAAESYRVMKKGKFCVVLMGDTRQKGHMIPMSFQVMRIFEDAGFTLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      MSU_RS04145_Mycoplasma_suis_762897706                                           -----------------------------------------------------------------M---LK----L----------------------------GPD----N-FE-LET-STIWNFPNRG-KWATH--DAKYRGNWSPYIPRNILLRYSSEGDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNEESLKRCKEKTSFEFN---------G---------P------------------Q-----GQVEIVKGDARDLN---------FIKSESIDLICTHPPYANIIHYS-EG-QVIEEDLSNL-KVSEFLEEMKKVAQECYRVLKKNKYCVILMGDTRKNGHMIPLSFDVMKLFEDVGFKLKELIIKA----QHNCK-ATGFW---KT-----------NSV---KH--NFLLIAHEYLFVFRK|----------------------------------------------------
      J145_RS0109710_Fusobacterium_hwasookii_657692329                                ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGE-TTLVEAKLLNRNIIGIDVNDVAIERCKEKINFEFE---------N----------------------------S-----GKVYINKGDARKLD---------FIKDESIDFVCTHPPYANIIEYS-EN---IDEDLSHL-KIPEFLKEMKKVASESHRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEGVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|----------------------------------------------------
      MSU_0848_Mycoplasma_suis_str_Illinois_323652279                                 ---------------------------------------------------------------------MK----L----------------------------GPD----N-FE-LET-STIWNFPNRG-KWATH--DAKYRGNWSPYIPRNILLRYSSEGDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNEESLKRCKEKTSFEFN---------G---------P------------------Q-----GQVEIVKGDARDLN---------FIKSESIDLICTHPPYANIIHYS-EG-QVIEEDLSNL-KVSEFLEEMKKVAQECYRVLKKNKYCVILMGDTRKNGHMIPLSFDVMKLFEDVGFKLKELIIKA----QHNCK-ATGFW---KT-----------NSV---KH--NFLLIAHEYLFVFRK|----------------------------------------------------
      G497_RS0101670_Desulfovibrio_cuneatus_652933168                                 ------------------------------------------------MLK--------N-K---E---QK----W----------------------------GPD----N-FE-LEM-NTVWSFPQRG-NWATH--DAKYRGNWSPYIPRNILLRYSSEGDYVLDQFAGGG-TTLVEAKLLKRNVLGVDVNESALECCRVKCDFESE---------N----------------------------A-----GRVVIRHGDARNLN---------FIKDECIDLVCTHPPYANIIQYS-EN---NLNDLSHL-DVTSFLEQMKLVAAESYRVLKKDKFCAILMGDTRKKGHIIPMSFEVMRIFEHAKFKTKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFLLLAHEYLFIFKK|----------------------------------------------------
      FSCG_RS00585_Fusobacterium_nucleatum_496076017                                  ------------------------------------------------MN-------------------KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEEDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKIDFEFE---------N----------------------------S-----GKVYIHKGDARRLD---------FIKDETIDFICTHPPYANIIEYS-ED---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEAEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFVFKK|----------------------------------------------------
      HMPREF1583_RS06235_Gardnerella_vaginalis_515155565                              ------------------------------------------------MTV--------T-A---I---KR----W----------------------------EPE----N-FE-LEM-TTHWSFPDRG-NWATH--DSKWRGNWSPYVVRNLLLRYSAEKDLVLDQFVGGG-TTLVEAKLLNRDVIGVDVNDIAINRCREKVSFNHE---------G----------------------------AD----GRVYIRKGDARNLD---------FLDDESIDFICTHPPYANIIKYS-EN---IPEDLSLL-KVDAFLSQMKKVAEESYRVLKTNKFCAVLMGDTRQKGCMIPMSFDVMKIFQNAGFTLKEIIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      BN788_01674_Eubacterium_siraeum_CAG:80_547865125                                ------------------------------------------------MHK----------K---I---TK----W----------------------------QPD----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRDIIGIDINDVALERCREKTDFDYE---------P----------------------------AK----GKVYINKGDARHLD---------SIPDDSIDLICTHPPYADIIKYS-DG---VDGDLSQL-KVKDFLEQMKPVAEESYRVLKKGKFCAILMGDTRQKGCMIPMSFDVMKIFQDAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      FUSO3_RS09465_Fusobacterium_necrophorum_737952516                               ------------------------------------------------MVK----------K---I---KK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNILLRYSKEEDLVLDQFAGGG-TTLVEAKLLNRDVIGVDVNEFALERCQEKISFEYE---------T----------------------------AK----GKVYLRKGDARKLD---------FIPDESVDLICTHPPYANIIQYS-ED---IEEDLSHL-KIKDFLEEMKKVAGESYRVLKKDKFCAILMGDTRQKGHMMPMSFEVMKIFEEVGFKLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      BN656_01315_Bacteroides_pectinophilus_CAG:437_547961389                         -------------------------------------------------------------------------------------------------------MQPN----N-FQ-LEP-TTVWSFPDRG-SWATH--SGKYRGNWSPYVPRNLILRYSKPGEWVLDQFMGSG-TTLVEAKLLGRNAVGIDINPQSVSISETNLKFQCE---------T----------------------------K-----SKIFTKNADATNLH---------FIKDEHIDFICTHPPYADIIKYS-KG---ISGDISLL-CVDKFLGEMNKVAAESYRVLKRGKMCAVMIGDVREHGKVIPLGFRMMEGFLNAGFSNKEIIIKE----QHNCR-STKYW---EN-----------HNN---S----FLMLAHEYIFVFQK|----------------------------------------------------
      HMPREF0405_RS07885_Fusobacterium_nucleatum_496073207                            ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEEDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKIDFEFE---------N----------------------------S-----GKVYIHKGDARRLD---------FIKDETIDFICTHPPYANIIEYS-EE---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCVILMGDTRIKGHIQPLGFEVMKVFEAEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFVFKK|----------------------------------------------------
      CLOHIR_RS00470_[Clostridium]_hiranonis_493484190                                -----------------------------------------------------------------M---IN----W----------------------------EPS----N-FK-LET-GTVWIFPERG-SWATH--TPKYRGNFSPYVPRNLILRYSKKGDMILDQFAGGG-TTLIEAKLLGRNIIGVDVNIQALALCRSSTNFEYK---------N----------------------------S-----SKVYLRRGDARNLN---------FIPDEKIDFICTHPPYADAIKYS-KD---IVEDISLL-DYKSFLKEMEKVAKESYRVLKKGKYCAILMGDIRKNGNVIPLGFEVMNIFKNVGFINKEIIIKE----QYNCK-STDYW---IK-----------KSF---ER--NFLLLEHEYLFVFRK|----------------------------------------------------
      HMPREF1090_RS26280_[Clostridium]_clostridioforme_488666942                      ------------------------------------------------MAK----------K---I---TK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLRRNIIGVDVNDVALARCREKIDFEHE---------G----------------------------AD----GKVYIHKGDARHLD---------FIPDGSIDLICTHPPYADIIRYS-ED---IDEDLSHL-KVKDFLEEMKTVAQESYRVLKKDKFCAVLMGDTRQKGHMVPMSFEVMRIFEDAGFKLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      HMPREF1501_RS03285_Fusobacterium_sp_OBRC1_492605690                             ------------------------------------------------MKK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGG-TTLVEAKLLNRNIIGIDINDIAIERCKEKIDFEFE---------N----------------------------S-----GKVYIHKGDARKLD---------FIKDESIDFICTHPPYANIIEYS-ED---IDEDLSHL-KIPEFLKEIKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEGEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------SSI---KY--NFFLIAHEYLFIFKK|----------------------------------------------------
      BN647_00380_Firmicutes_bacterium_CAG:41_547820585                               -----------------------------------------------MENK----------K---I---KK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRDIIGVDVNDAALDRCREKTDFDYE---------P----------------------------AK----GKVYIKKGDARNLD---------FVPDESIDLICTHPPYADIIKYS-DG---LKNDLSQL-KVKDFFEEMKKVASESYRVLKKDKFCAILMGDTRQKGCMIPMSFDVMKIFQDAGFKLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      HGRM_RS15055_Ruminococcus_sp_JC304_517992866                                    --------------------------------------------------------------------------------------------MSKKKGEILIREAPE----K-FK-LED-TTIWSFPERG-SWATH--SGKYRGNWSPYIPRNLILRYSKKNDWILDQFLGSG-TTLIEAKLLGRNAIGVDINSEAIKLSNTNLNFTCQ---------E----------------------------S-----SKIFTKQGNATELS---------FIKDESINLICTHPPYADIIRYS-NK---IPGDISHL-KYEEFLKALEQVAREAYRVLKKQGICAFMIGDIRRAGYVLPLGMNSMQKFVNAGFKLKEIVIKE----QHNCR-SADYW---DG-----------KER---N----FLMLAHEYIFILKK|-----------------TDDYKS-----------------------------
      HMPREF9124_RS05925_Oribacterium_sp_oral_taxon_108_738699070                     -----------------------------------------------------------------------------------------------------MLLQPN----S-FN-LEQ-TSIWSFPERG-KWATH--SGKYRGNWSPYIPRNLILRYSKPGDWVLDQFLGSG-TTLVEAKLLNRNGIGIDINPKALSLSRTNLSFHSN---------S----------------------------R-----AQIFLKKGNAAKLA---------FIKDNRIDFICTHPPYSNIISYS-SD---LAGDISLC-NEKEFIIAMKKVAAESFRVLKKGKYCAVMIGDKRIHGNVIPLGFQLLTCFLETGFVLKEIIIKV----QHNCR-ATSNW---QN-----------KNR---N----FLMLAHEYIFVFYK|---------------PNF----------------------------------
      HMPREF9124_1256_Oribacterium_sp_oral_taxon_108_str_F0425_333759614              ------------------------------------------------------------------------M--YSKNSIDFVYFQRVIKLNYYKTGGDFILLQPN----S-FN-LEQ-TSIWSFPERG-KWATH--SGKYRGNWSPYIPRNLILRYSKPGDWVLDQFLGSG-TTLVEAKLLNRNGIGIDINPKALSLSRTNLSFHSN---------S----------------------------R-----AQIFLKKGNAAKLA---------FIKDNRIDFICTHPPYSNIISYS-SD---LAGDISLC-NEKEFIIAMKKVAAESFRVLKKGKYCAVMIGDKRIHGNVIPLGFQLLTCFLETGFVLKEIIIKV----QHNCR-ATSNW---QN-----------KNR---N----FLMLAHEYIFVFYK|---------------PNF----------------------------------
      J142_RS0109970_Fusobacterium_hwasookii_657695114                                ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKINFEFE---------N----------------------------S-----GKVYINKGDARKLD---------FIKDESIDFVCTHPPYANIIEYS-EN---IDEDLSHL-KIPEFLKEMKKVASESHRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEGVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|----------------------------------------------------
      _Fusobacterium_nucleatum_496296625                                              ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKIDFEFE---------N----------------------------S-----GKVYIHKGDARKLN---------FIKNETIDFICTHPPYANIIKYS-ED---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEKVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|----------------------------------------------------
      G397_RS0107800_[Eubacterium]_siraeum_491499778                                  -----------------------------------------------MANK----------K---I---TK----W----------------------------EPE----N-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRDIIGIDINDVALERCSEKTAFDYE---------P----------------------------AK----GKVYINKGDARCLD---------SIPDDSIDLICTHPPYADIIKYS-DG---IDGDLSQL-KVKDFLEQMKPVAEESYRVLKKGKFCAILMGDTRQKGCMIPMSFDVMKIFQDAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      BN678_01434_Dialister_sp_CAG:486_547523036                                      ------------------------------------------------MKK--------I-K-------------W----------------------------EPE----N-FE-LEM-NTVWDFPERG-SWATH--DAKYRGNWSPYIPRNLLLRYSKEGDWVLDQFAGGG-TTLVEAKLLHRNCIGLDVNPAALSRCHEKCEFPFE---------N----------------------------A-----GKIIIREGDARHLD---------FLPDASIDFICTHPPYADIIRYS-ED---LAGDLSHL-RGEAFLAEMEKVAGESYRVLKKDKFCAVLMGDMRQKGCMIPLSFQVMERFLAAGFTLKELIVKT----QHNCR-ATGFW---KT-----------NSV---KY--NFLLIAHEHLFVFRK|----------------------------------------------------
      FSDG_RS00930_Fusobacterium_nucleatum_495977000                                  ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKIDFEFG---------N----------------------------S-----GKVYIHKGDARKLN---------FIKNETIDFICTHPPYANIIKYS-ED---VEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEKVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|----------------------------------------------------
      FSAG_RS07575_Fusobacterium_periodonticum_496069501                              ------------------------------------------------MKK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTQEKDLILDQFAGGG-TTLVEAKLLNRNIIGIDVNDIAIERCREKIDFEFE---------N----------------------------S-----GKVYIHKGDARNLD---------FIKNETIDFICTHPPYANIIEYS-ED---IEEDLSHL-KIPEFLKEIEKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEKVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|----------------------------------------------------
      CTHBC1_RS07545_Ruminiclostridium_thermocellum_490598869                         TAANNDVTCAFVKKVAKES----TICLEE--------KSK----SY-FADK--------L-N---I---KS----W----------------------------EPE----N-FN-LET-TTVWSFPDRG-DWATH--SGKYRGNWSPFIPRNVILRYSKEGETVLDQFVGSG-TTLVEAKLLKRKGIGVDINPEAVNLTCRNINFEKE---------D----------------------------C-----GETEVHVGDARHLG---------FIKDESVDLICTHPPYSNIIKYS-ED---IEGDLSHC-DINEFLVEMEKVAKESYRVLKKGRFCAILIGDTRRKGHMIPIGFNVMQTFLRAGFKLKEIVIKE----QHNCS-STGYW---RN-----------QSI---KY--NFLLIAHEYLFIFRK|----------------------------------------------------
      JCM21531_RS04440_[Clostridium]_straminisolvens_740456070                        TAANNDVTQAFVKKVAKES----RIYFEE--------NAK----GY-FVGK--------P-N---I---KL----W----------------------------EPE----N-FN-LET-STVWSFPDRG-NWATH--SGKYRGNWSPFIPRNVIMRYTKEGETVLDQFVGSG-TTIVEAKLLKRKGIGVDINPEAVNLTSRNINFEKE---------D----------------------------C-----GEVEVHVGDARHLG---------FIKDESIDLICTHPPYSNIIKYS-ED---IQGDLSHY-DIDDFLVEMEKVAKESYRVLKKDRFCAILMGDTRRKGHMIPIGFNVMQTFLRAGFKLKEIVIKE----QHNCS-STGYW---RN-----------QSI---KY--NFLLIAHEYLFIFRK|----------------------------------------------------
      CLOCLA_RS0112375_[Clostridium]_clariflavum_653611723                            TAYRNDVTQLFVKKIMKES----MIYIKE--------KES----DY-YVNK--------L-N---I---KS----W----------------------------EPE----N-FS-LET-TTIWGFPDRG-SWATH--SGKYRGNWSPFIPRNVILRYSTEGEIVLDQFVGSG-TTLVEAKLLNRKSIGIDINPEAVNIARHNTNFERD---------G----------------------------S-----GEVEVHVGDARHLE---------FIDDESIDLICTHPPYSNIIRYS-EN---IQGDLSHC-DIKEFYKEMEKVAIECYRVLKKNKFCAILIGDTRKKGHMIPIGFNVMEIFLRTGFKLKEIVIKE----QHNCS-STGYW---RN-----------QSI---KY--NFLLIAHEYLFIFRK|----------------------------------------------------
      K364_RS0114940_Desulfitibacter_alkalitolerans_654856343                         --------------------------MTS--------NKK----L----K-------------------------W----------------------------EPD----K-FE-LQT-NTVWSFPDRG-NWATH--NSKYRGNWSPYIPRNLILRYSKEGDTILDQFAGSG-TTLIEAKLLNRNCIGVDINAVSIELCRENTDFERE---------N----------------------------C-----GHVTIKRGDARDLS---------FINDKSIDFICTHPPYANIIKYS-ND---IIGDLSCY-EVGDFLKEMKKVAAECYRILKEDKYCAILIGDTRKKGHIVPIGFEVMKVFEAIGLKIKEIIIKE----QHNCT-STCYW---RN-----------KST---KY--NFLLLAHEYLFVFKK|----------------------------------------------------
      TVG_RS02300_Thermoplasma_volcanium_499219160                                    -------HTTELARHYANE-VY-NMAFIS-R------KLP--------YSD--------I-D---L---SR----W------------R---------------EYD----F----VIT-DSLWLFDKRD-YRGSK--LGWYWGNFVPQIPRQLILRFSRKDEWVLDPFSGSG-TTLIEAKKLGRNSLGIEINEEVCKKSLEILNSIDG---------D----------------------------G-----FSTAIS-GDSASVN-LTKVME--YYGIPEFNLVIMHPPYHDIIKFT-DI----GGDLSNARDTKEFLSMLGKVTRNVSKYLQKGRFLALVIGDKYSNGEWIPLGFYSMQKVMDQGFRLKSTIVKNFEYTRGKAS-SSDLW---RY-----------RAL---AG--GFYVFKHEYIFVFQK|----T-----------------------------------------------
      AMDU3_IPLC00003G0029_Thermoplasmatales_archaeon_I-plasma_546149073              -------KSSKLPKL-----------------------AP--------FSD--------I-D---L---SR----W------------K---------------DYS----D----VWT-DSLWVIDRRD-TSGAH--INSYHGNFIPQIPHQLMMRYTKKGDWVLDPFLGSG-TTLIECIRMGRNGIGVELNESVADKSKSIIASEPN---------S-F--------------------------G-----VTTVTSVGDSSTLP-FRDLLD--SINVKSVQLAVLHPPYHGIIKFS-DN----PADLSNAGTVDDFLLQLSRVVSNTMQVLDEGRYMALVIGDKYEKGKWIPLGFYAMQKVMDAGMELKSIVVKNYGETRGKSH-QQSLW---RY-----------RAL---QG--GYYLFKHEYVFIFRK|----A-----------------------------------------------
      AMDU1_APLC00004G0008_environmental_samples_546147902                            -------RGIDRAHYYVRS-VI-RDIGSS-A------SAS--------SLE--------F-N---I---NL----W------------K---------------AYD----E----IRT-DSLWILGKRD-REAGH--KGWYWGNFVPQIPHQLMMRYTRTSDWILDPFCGSG-TTLIEAIRMERNSVGIEINPEVYSRTREAVQSLPH---------D----------------------------G-----TRAEIILGDSYTVD-LVPVME--RNGVSSFDMVLLHPPYWDIIRFS-DD----QGDLSTSPDMESFISRFTQIAKKSIAVLKSGGYMGLVIGDAYRDGEIVPLGFRCMDAVASLNMKIRGIIVKDIQNTRGKRS-SENLW---RY-----------RAL---KS--GFYVFKHEYVFVFQK|----P-----------------------------------------------
      FFONT_0867_Fervidicoccus_fontis_504370902                                       RRPLKEITLEDFESIAKRK----KY-VTI--------GCK----K---IEL----------E---I---EG----F--KEL-----------------------QPK----E-FV-VEK-TSVWSFPERG-KWATHKYNAKFRGNWSPQVARNLLLLYSKSGDTVLDPFLGSG-TSMIECILLKRRCYGVDINIDSVMLSWSRIKPIYS---------S----------------------------D-----SFVKLFEGDAEYLD---------AFEDEKFDFILGHPPYASIIKYS-KG---SDGDLSKM-SIQEYLEKMRRIARELYRILKKDKYLAIMVGDIRRKKHVIPLGYMVMKIFLEEGFIIKEHIIKV----QHNMI-GTAFW---KN-----------KKN---D----FLLLKHEHIFVFRK|----PLDSSD-YEK---FQYYMLY----------------------------
      _Taylorella_asinigenitalis_505364599                                            ------------------------------------------------MQK--------------S---KKIKT-W----------------------------EPE----N-FE-LEM-TTHWSFPKRG-NWATH--DAKWRGNWSPYIPRNVILRYSKEGDLILDQFAGGG-TTLVEAKLLNRNIIGIDINQDSIDRCKEKTDFKLT---------L----------------------------EL----GNVDIKKGDARELT---------NIKDESIDLICTHPPYADIIKYS-ED---IPEDLSRL-KIKEFLNEMTKVADESYRVLKKGKFCAILIGDMRKNGNVIPLSTKVMNVFTDAGFVLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFLLLAHEYLYIFKK|--T-------------------------------------------------
      Y919_RS08595_Caloranaerobacter_azorensis_737178046                              IEEYTYLPEKLVLEKLEN--------LKE--------DRE----LYRVKKP--------L-R---I---QS----W----------------------------EPK----D-FA-LEA-TTVWSFPDRG-KWATH--NGKYRGNWSPYIPRNIILSYTKKGDIVLDQFLGSG-TTLVETKLLERRGIGVDINLDAIKVARANLRFNKN---------K----------------------------E-----YEPKIYKGDARNLD---------FIPDNSIDLICTHPPYANIIKYS-ND---IEGDLSLC-NIDEFINEMKKVAKEAFRVLKENKYCAILIGDTRKNKHMIPLGFKVMQVFLDAGFILKEIIIKE----QHNCK-ATGFW---YK-----------RSI---EY--NFLLIAHEYLFVFRK|-PKSK-----------------------------------------------
      HMPREF1630_RS01030_Anaerococcus_lactolyticus_490965414                          ------------------------------------------------MSK--------------I---KK----W----------------------------EPD----D-FE-LEM-TTHWSFPQRG-NWATH--DAKWRGNWSPYIPRNIILRYSEEGDLVLDQFAGGG-TTLVEAKLLNRNIIGLDVNDVALNRCKEKIDFNLTD--------R----------------------------PL----GKVKLLKGDARNLD---------FLTDESIDLVCTHPPYADIIKYS-DG---IENDLSLL-KINDFLKEMNKVAAEAYRVLKKDKFCAILMGDTRKNKHMIHLGFDVLKVFEDEGFKLKELIIKE----QHNTR-ATGFW---KK-----------RSV---DY--NFLLIAHEYLFILKK|----------------------------------------------------
      CAAU_RS08905_Caloramator_australicus_496184606                                  CVEKYNFTKDFIKKIFSGN----DMLLKE--------SKQ----LY-LTN-------------------------L----------------------------SPE----K-FE-LET-TTIWSFPDRG-NWATH--SGKYRGNWSPYIPRNILLRYSNEGELILDQFVGSG-TTLVEAKLLNRNAVGVDINPIALEITRENLKFNYE---------Y----------------------------N-----PKIDIKLGDARNLY---------FIEDNSIDLICTHPPYANIIKYS-EN---INGDLSHL-DVKEFLLEIEKVAKECYRVLKKDKYCAILIGDTRKKGHIIPIGFSVMQKFIDAGFKLKEIIIKE----QHNCN-STPYW---KN-----------KSL---KY--NFLLIMHEYLFVFRK|----------------------------------------------------
      MB41_RS06705_Anaerosalibacter_sp_ND1_757126172                                  -----------------------------------------------MREP--------L-K---V---DS----W----------------------------APE----N-FA-LEA-TTVWSFPERG-KWATH--NGKYRGNWSPYIPRNIILRYSEENDTVLDQFLGSG-TTLIETKLLKRRGIGVDINSETVKLSKENLRFNKN---------K----------------------------E-----YQPIIYNADARNLN---------FISDNSIDLICTHPPYANIIKYS-KN---INGDLSHL-NIDKFIIEIRKVAEESFRVLKNNRYCAILIGDTRKDKHIIPLGFKVMEEFLNAGFVLKETIIKE----QHNCK-ATGFW---YN-----------KSL---QY--NFLLIAHEYLFVFRK|-NCNY-----------------------------------------------
      CTM_RS16890_Clostridium_tetanomorphum_737163966                                 -----------------------------------------------------------------M---DE----------------------------------------A-FK-LET-KSLWSFKERG-EWATH--KGDYPGNWSPFVPKNIILRYSKQNDVVLDQFLGSG-TTLIEAKLLNRRAIGCDINPKALEIAKDRISRVKA--------------------------------------N-----TAIQLMECNARNLD---------CIKDNSIDLICTHPPYSNIIKYS-KN---IQGDLSLL-NLNEFYEAIKEVSIECFRVLKKTKYCTIMMGDIRKNGCVIPLGFNVMNLFLNQGFKLKEIIIKE----QHNCN-STKFW---KE-----------ISL---KK--NFYLLAHEYLFVFFK|----------------------------------------------------
      LEBU_RS11230_Leptotrichia_buccalis_506250654                                    ------------------------------------------------MVK--------K-----I---KK----W----------------------------EPE----E-FE-LEM-NTVWSFPDRG-KWATH--DAKYRGNWSPYIPRNLLLRYSNEGDLILDQFAGGG-TTLVEAKLLNRNIIGVDINSNALKRCKEKCDFEYE---------N----------------------------L-----GKVYFYEADARNLN---------FIPDENIDFICTHPPYANIIKYS-ED---IENDLSHL-KVKDFLIEMEKVASESYRVLKKDKFCAILMGDTRQKGHIIPMSFEVMKIFEKVGFKTKEIIIKE----QHNCK-ATGFW---KT-----------NSV---KY--NFLLIAHEYLFVFKK|----------------------------------------------------
      VE20218_RS15590_Clostridiales_bacterium_VE202-18_657680312                      ---------------------------------------------------------------------------------------------------------MK----T-YE-LQN-TTIWSFPDRG-SWATH--KGDYRGNWSPHIPKNLILKYTKQNDLILDCFAGSG-TTLIEAKLLNRNAIGVDINNDALNISKKRLHFDCH---------N----------------------------S-----AKIELYQCDAKKMT---------MLKDNSIDFICTHPPYTNIIKYS-KN---LENDLSLL-DYKDFLLHMDKVSKELYRVLKQGHNCSFMIGDIRKNGNVIPLGFNTMQVFLNNGFTLKEIIIKE----QHNCS-STKYW---QN-----------KIQ---NL--NFYLLAHEYIFVLSK|----------------------------------------------------
      CC89_RS03170_Clostridium_sp_KNHs214_737306053                                   -----------------------------------------------------------------M---EK----------------------------------------N-FK-LET-DTIWNFEERG-NWCTH--RGDYPGNWSPYVPKNIILRYSKEGEFVLDQFVGSG-TTLIEACLLNRKIIGCDINDRALNICSDRIKNLSK---------K----------------------------------DNVFLKKRDARNLY---------DIKDESIDLICTHPPYANIIKYS-KN---IDGDISLL-DIEEFYEAMKDVAKECYRVLKKEKYCSILMGDTRKRGFIIPLAFNVLNIFMNSGFKLKEIIIKQ----QHNCK-STEYW---RD-----------ISI---KR--NFYLIAHEYLFVFKK|----------------------------------------------------
      MELS_RS05115_Megasphaera_elsdenii_503781935                                     -----------------------------------------------------------M-K-------------W----------------------------QPD----D-FT-LEM-TSVWSFPQRG-KWATH--DGNYRGNWSPYIPRNLILRYSGEGDRILDCFVGGG-TTLVEAKLLSRNCIGVDVNEQALDRCRKKCDFSC----------P----------------------------NM----GKIYLKQGDARNLH---------FIQDASIDFICTHPPYANIIQYS-QD---IEQDLSRL-DVDSFLAEIKKVVCECYRVLKKGKFCAILMGDIRKKGHVIPLSFWVMDLFLQQGFSLKEMIIKE----QHNCR-ATGFW---KT-----------NSV---KY--NFLLLAHEHLFVFHK|-IS-------------------------------------------------
      HMPREF1580_RS03105_Gardnerella_vaginalis_515278394                              ------------------------------------------------MTV--------T-S---I---KR----W----------------------------EPE----N-FE-LEM-TTHWSFPDRG-NWATH--DSKWRGNWSPYVVRNLLLRYSAEKDLVLDQFVGGG-TTLVEAKLLNRDVIGVDVNDIAINRCREKVSFNHE---------G----------------------------AD----GRVYIRKGDARNLD---------FLDDESIDFICTHPPYANIIKYS-EN---IPEDLSLL-KVDAFLSQMKKVAEESYRVLKTNKFCAVLMGDTRQKGCMIPMSFDVMKIFQNAGFTLKEIIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      HGPG_RS01070_Peptoniphilus_grossensis_517953989                                 ------------------------------------------------MTN----------K---I---TK----W----------------------------EPE----N-FE-LEM-TTHWSFPERG-DWATH--DAKWRGNWSPYIPRNIILRYSKEKDLILDQFAGGG-TTLVEAKLLNRDIIGIDVNDVALNRCKEKIDFHHE---------G----------------------------AD----GKVFLRKGDARNLD---------FIPDNSIDLICTHPPYANIIEYS-EN---IEEDLSHL-KTNEFLEEMKKVASESYRVLKKDKFCAVLMGDTRKNGHMIPLSFYVMQVFENAGFKLKEMIIKE----QHNCK-ATGFW---KT-----------NSI---KY--NFLLIAHEHLFIFRK|----------------------------------------------------
      QSI_RS08630_Clostridiales_496091935                                             ---------------------------------------------------------------------------------------------------------MN----T-YK-LQN-TTIWNFPDRG-NWATH--KGDYRGNWSPHVPKNLILKYTEQKDLVLDCFVGSG-TTLIEAKLLDRNAIGIDINKKALEITRNRLNFDCN---------N----------------------------N-----AHIQLHLGDAQNLK---------MVKDNSIDFICTHPPYADIIKYS-KN---IENDISNL-EYNEFLAHMNQVSKELYRVLKPSHFCSFMIGDIRKKGNVIPLGFLTMQTFINNGFTLKEIIIKE----QHNCS-STSYW---ND-----------KSK---TL--GFYLLAHEYIFVLYK|----------------------------------------------------
      BN748_01131_Fusobacterium_sp_CAG:649_547450181                                  ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFMGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKIDFEFE---------N----------------------------S-----GKVYIHKGDARRLD---------FIKDETIDFICTHPPYTNIIQYS-ED---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCVILMGDTRIKGHIQPLGFEVMKVFEAEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFVFKK|----------------------------------------------------
      HMPREF1586_RS06065_Gardnerella_vaginalis_490232946                              ------------------------------------------------MTV--------T-S---I---KR----W----------------------------EPE----N-FE-LEM-TTHWSFPDRG-NWATH--DSKWRGNWSPYVVRNLLLRYSAEKDLVLDQFVGGG-TTLVEAKLLNRDVIGVDVNDIAINRCREKVSFNHE---------G----------------------------AD----GRVYIRKGDARNLD---------FLDDESIDFICTHPPYANIIKYS-EN---IPEDLSLL-KVDAFLSQMKKVAEESYRVLKANKFCAVLMGDTRQKGCMIPMSFDVMKIFQNAGFTLKEIIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      HMPREF0993_RS14405_Lachnospiraceae_bacterium_5_1_57FAA_496543443                ------------------------------------------------------------------------M--FLLSELGIILFYFN---GQKKKGEIFIREAPE----K-FK-LED-TTIWSFPERG-SWATH--SGKYRGNWSPYIPRNLILRYSKKKDWILDQFLGSG-TTLIEAKLLGRNAIGVDINSEAVKLSNTNLNFTCQ---------E----------------------------R-----SKIFTKQGNANNLS---------FIKDESIDLICTHPPYADIIRYS-KE---IPGDISHL-KYEEFLKELEQVARESYRVLKRQGICAIMIGDIRRKGYVLPLGMNSMQKFVEAGFKLKEIIIKE----QHNCR-SAYYW---EG-----------RER---K----FLMLAHEYIFILEK|-----------------TDCYNSM----------------------------
      BN715_00862_Megasphaera_elsdenii_CAG:570_548306764                              -----------------------------------------------------------M-K-------------W----------------------------QPD----E-FT-LEM-TSVWSFPQRG-KWATH--DGKYRGNWSPYIPRNLILRYSSEGDRILDCFVGGG-TTLVEAKLLNRNCIGIDINAKALDRCREKCDFSC----------P----------------------------NM----GKIYLKQGDARKLH---------FIQDEAIDFICTHPPYANIIQYS-QD---IEQDLSRL-EVELFLEEMKKVVCECYRVLKKGKFCAILMGDIRKKGYVIPLSFFVMDLFLRQGFSLKEMIIKE----QHNCK-ATGFW---KT-----------NSV---KY--NFLLLAHEHLFVFHK|-IS-------------------------------------------------
      F553_RS0104430_Megamonas_rupellensis_648605175                                  --------------------------------------------------------------------------------------------------------MKD----N-FK-LEM-TTVWSFPKRG-NWATH--SGMYRGNWSPYVPRNLILKFTAEHDWILDQFMGSG-TTLIEAKLLNRNIIGIDVNEKAYKITEKNLNFECK---------T----------------------------S-----SHIHIRLCSAENIY---------FIKNNSIDCICTHPPYANIIKYS-KD---NQYDISLL-SVEKYLLAMKNVAKESYRVLKSNHICAIMVGDIRKEGILIPLGFYVMNIFKQQGFILKDIIIKE----QHNCK-STSKW---VN-----------IKH---S----FYLLAHEYIFIFEK|---------------------K------------------------------
      LEPGO_RS0100700_Leptotrichia_goodfellowii_652339649                             ------------------------------------------------MGK--------K-KF--I---KK----S----------------------------EPE----N-FE-LEM-NTVWSFPNRG-KWGTH--DAKYRGNWSPYIPRNLLLRYSNENDLILDQFAGGG-TTLVEAKLLNRNIIGIDVNDEALNRCKEKCNFEYE---------N----------------------------S-----GKVKICKGDARNLD---------FISNESIDFICTHPPYANIIQYS-ET---IENDLSHL-KVKDFLVEMKKVAEESYRVLKKNKFCAVLMGDIRQKGHIIPMSFEVMKIFESVGFKTKEIIIKE----QHNCK-ATGFW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      HMPREF1040_RS02760_Megasphaera_sp_UPII_135-E_494634458                          ------------------------------------------------MGK----------K---I---VK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNILLRYSKENDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNEVALARCREKINFDHS---------G----------------------------AN----GKVYLYKGDARTLD---------FIKDNSIDLICTHPPYADIIKYS-ED---IETDLSHL-KVKDFLIAMRDVAAESYRVLKKDKFCAVLMGDTRQKGHMIPMSFEVMKLFQSAGFKLKELVIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFIFRK|----------------------------------------------------
      FNV_RS04020_Fusobacterium_nucleatum_492571686                                   ------------------------------------------------MNK----------K---N---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEEDLILDQFIGGG-TTLVEAKLLNRNIIGVDVNNVAIERCKEKINFNFE---------N----------------------------S-----GKVYIHKGDARKLD---------FIKDETIDFICTHPPYANIIEYS-ED---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEAEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFVFKK|----------------------------------------------------
      HMPREF9454_RS05820_Megamonas_funiformis_495813929                               --------------------------------------------------------------------------------------------------------MKD----N-FK-LEM-TTVWSFPKRG-NWATH--SGMYRGNWSPYVPRNLILKFTAEHDWILDQFMGSG-TTLIEAKLLNRNIIGIDVNEKAYKITEKNLNFECK---------T----------------------------S-----SHIHIRLCSAENIY---------FIKNNSIDCICTHPPYANIIKYS-KD---NRYDISLL-SVEKYLLAMKNVAKESYRVLKSNHICAIMVGDIRKKGILIPLGFYVMNIFKQQGFILKDIIIKE----QHNCK-STLKW---VN-----------IKH---S----FYLLAHEYIFIFEK|---------------------K------------------------------
      NW74_RS05595_Parvimonas_micra_754560689                                         ------------------------------------------------MKK--------------E---KK----W----------------------------EPE----D-FE-LEM-TTHWSFPNRG-NWATH--DAKWRGNWSPYIPRNIILRYSKENDVVLDQFVGGG-TTLVEAKLLNRNIIGVDVNDIAIQRCKEKVDFEYK---------Q----------------------------SN----SKVIIKKGDARNLF---------FLENESIDLICTHPPYANIINYS-DD---LENDLSRL-NIKDFLIQMEEVANESYRVLKKGKFCAILMGDTRQKGNMIPMSFKVMEIFKKTGFTLKEIIIKE----QHNCK-ATGFW---KT-----------NSI---KY--NFLLIAHEYLFIFKK|----------------------------------------------------
      CLPA_RS15010_Clostridium_pasteurianum_489540792                                 -----------------------------------------------------------------M---SE----------------------------------------V-FN-LET-KTLWSFKERG-DWGTH--KGDYPGNWSPFVPRNIILRYSKDNELILDQFLGSG-TTLIEAKLLNRRGIGCDVNSTALETSKNRIEGVNG---------N----------------------------------NSIKLVKGSAKNMN---------FIKNESIDLICTHPPYSNIIKYS-KD---IDEDLSLL-NIDEFYESIKEVSKEAFRVLKKGKYCAIMMGDIRRNGCVIPLGFNVMNLFLNQGFRLKEIIIKE----QHNCS-STKYW---EE-----------ISL---KK--NFYLLAHEYLFVFLK|----------------------------------------------------
      Smon_1038_Streptobacillus_moniliformis_DSM_12112_268315129                      ------------------------------------------------MIN--------K-K---L---TK----W----------------------------EPE----N-FE-LEM-NTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNLLLRYSKENDLVLDQFAGGG-TTLVEAKLLNRDIIGVDINEVSLERCREKVNFEHE---------G----------------------------SN----GKVYIHKGDARNLD---------FISDESIDFICTHPPYANIIQYS-DN---IEEDLSHL-KIPQFLEEMKKVAFESYRVLKNDKFCAVLMGDTRIKGYMQPMSFEVMKIFESEGFKLKEIIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFIFKK|----------------------------------------------------
      T504_RS0104020_Selenomonas_sp_ND2010_697204360                                  -------------------------------------------------MK----------K---I---KK----W----------------------------EPE----E-FE-LRM-TTHWTFPKRG-DWATH--DAKWRGNWSPYIPRNIMLRYSKEGDCVLDQFAGGG-TTLVEAKLLNRNIIGVDVNDVALERCREKTDFEHE---------G----------------------------AN----GKVYLKKGDARNLS---------FIPDEHVDLICTHPPYADIIKYS-ED---IEEDLSRL-KIADFLEEMKKVAGECYRVLKKDKFCAILMGDTRKKGCMVPMSFDVMKIFEEVGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      NZ47_RS07170_Anaerovibrio_lipolyticus_746146226                                 ------------------------------------------------MAE----------KI--I---RK----W----------------------------EPD----D-FN-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSTENDLIIDQFAGGG-TTLVEAKLLNRDIIGVDVNENAITRCKEKIAFEHE---------G----------------------------AN----GKVSLYKGDARNLD---------FIDDESIDLICTHPPYADIIKYS-ED---IPEDLSLL-KVKDFLEEMKKVAAESYRILKKDKFCAILMGDTRKKGNMVPMSFGVMKIFEEAGFKLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      G598_RS0111930_Selenomonas_ruminantium_652371152                                ------------------------------------------------MAK----------K---I---TK----W----------------------------EPD----D-FE-LEM-TTHWTFPKRG-DWATH--DAKWRGNWSPYIPRNIMLRYSKEGDCVLDQFAGGG-TTLVEAKLLNRNIIGVDVNDVALERCREKTNFEHE---------G----------------------------AN----GKVYLKKGDARNLS---------FISNEHVDLICTHPPYADIIKYS-ED---IEEDLSRL-KIADFLEEMKKVAGECYRVLKKDKFCAILMGDTRKKGCMVPMSFDVMKIFEEAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFKK|TMK-------------------------------------------------
      Q428_RS06840_Fervidicella_metallireducens_737398024                             -----------------------------------------------------------------M---LD----------------------------------------N-FK-LET-TTIWSFKDRG-NYYTH--KGDYPGNWSPYVPRNIILRYSKENDVVLDQFAGSG-TTLIECRLLNRIGVGCDVNEVALKMAWERTKCIQS---------K----------------------------------SKTILLKRDARNLY---------DIKDSSIDLICTHPPYSNAIKYS-ED---IEEDISLL-EYDKFLNEIVKVASECYRVLKKGKYCALLIGDIRKNGYIKPLGYETLNKFLNQSFKLKEIIIKE----QHNCR-KTEYW---KE-----------ISI---KN--NFYLIAHEYLFVFQK|----------------------------------------------------
      HMPREF9629_RS00455_unclassified_Peptostreptococcaceae_497210069                 ------------------------------------------------MTK--------T-K---I---TK----W----------------------------EPE----N-FE-LEM-NTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNLILRYSKENDLVLDQFAGGG-TTLVEAKLLNRNIIGVDINDIALERCKEKTSFNYD---------G----------------------------AN----GKVYINKGDARNLS---------FIQDESIDFICTHPPYANIIRYS-EN---IEGDLSCC-KIPEFLKEMQKVANESYRVLKKEKFCAILMGDTRIKGNVQPMSFEVMKIFENTGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFLLLAHEYLFIFKK|V---------------------------------------------------
      DP68_RS13195_Clostridium_sp_HMP27_737327616                                     -----------------------------------------------------------------M---NS----------------------------------------I-KEELQC-TTIWSFKDRG-DWATH--KGDYPGNWSPYVPRNIILRYSREGDLVLDQFLGSG-TTAVEAALLNRKFIGIDINDNALLLASKRCSNYI----------N----------------------------------KNISIIKGDAKDLK---------DIKDETINLICTHPPYSNIIKYS-KY---NKDDISLL-SLEGYYKAMDKVAKECFRVLKGNSYCAILIGDTRKNGFIQPLGFNVMNSFINAGFILKEIIIKE----QHNCS-STKKW---IE-----------ISK---KR--NFLLIAHEYLFVFKK|---------ILKELPQ------------------------------------
      HMPREF0889_RS01300_Megasphaera_genomosp_type_1_496777936                        ------------------------------------------------MGK----------K---I---VK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNILLRYSEENDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNETALARCREKINFEHS---------G----------------------------AN----GKVYLYKGDARTLD---------FIKDNSIDLICTHPPYADIIKYS-ED---IEADLSHL-KVKDFLIAMRDVAAESYRVLKKDKFCAVLMGDTRQKGHMIPMSFEVMKLFQSAGFKLKELVIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFIFRK|----------------------------------------------------
      QX51_RS12035_Terrisporobacter_othiniensis_746722773                             -----------------------------------------------------------------M---NN----------------------------------------N----IET-TTIWSFPDRG-NWLTH--KGDYPGNWSPHIPKNIILRYSKEKDKVLDQFIGSG-TTLIETNRLNRIGIGSDINIEALKLCQIRVPQ------------N----------------------------------NKTYIRKQDARYLK---------LIKDNTIDLICTHPPYANIVKYS-DS---IKEDISLL-DFESYYESMKFVAQSCYRVLKPQKHCAILIGDTRKNGLIEPLGFNVMNIFLKEGFKLKEIIIKE----QHNCK-CTDKW---KE-----------LSK---QR--NFLLIAHEYLFIFKK|-----------D----------------------------------------
      G598_RS0113740_Selenomonas_ruminantium_652371446                                ------------------------------------------------MAK----------K---I---TK----W----------------------------EPD----D-FE-LEM-TTHWTFPKRG-DWATH--DAKWRGNWSPYIPRNIMLRYSKEGDCVLDQFAGGG-TTLVEAKLLNRNIIGVDVNDVALERCREKTNFEHE---------G----------------------------AN----GKVYLKKGDARNLS---------FISNEHVDLICTHPPYADIIKYS-ED---IEEDLSRL-KIADFLEEMKKVAGECYRVLKKDKFCAILMGDTRKKGCMVPMSFDVMKIFEEAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      HMPREF1039_RS02295_Megasphaera_sp_UPII_199-6_494632851                          ------------------------------------------------MGK----------K---I---VK----W----------------------------EPD----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNILLRYSEENDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNETALARCREKINFEHS---------G----------------------------AN----GKVYLYKGDARTLD---------FIKDNSIDLICTHPPYADIIKYS-ED---IEADLSHL-KVKDFLIAMRDVAAESYRVLKKDKFCAVLMGDTRQKGHMIPMSFEVMKLFQSAGFKLKELVIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFIFRK|----------------------------------------------------
      HMPREF9286_RS08365_Peptoniphilus_harei_492766054                                ------------------------------------------------MAN----------K---I---TK----W----------------------------EPE----N-FE-LEM-TTHWSFPQRG-NWATH--DAKWRGNWSPYIPRNIILRYSNEKDLILDQFAGGG-TTLVEAKLLNRNIFGIDVNDVALNRCKEKVDFEHV---------G----------------------------AD----GKVFLRKGDARNLD---------FIPDNSIDLICTHPPYANIIEYS-ED---IEEDLSRL-KIKDFLAEMKKVAAESYRVLKKDKFCAVLIGDTRQKGHMIPLSFYVMQIFEEAGFKMKEMIIKE----QHNCK-ATGFW---KT-----------NSI---KY--NFLLIAHEHLFIFRK|----------------------------------------------------
      SMON_RS05265_Streptobacillus_moniliformis_754175633                             -------------------------------------------------MN--------K-K---L---TK----W----------------------------EPE----N-FE-LEM-NTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNLLLRYSKENDLVLDQFAGGG-TTLVEAKLLNRDIIGVDINEVSLERCREKVNFEHE---------G----------------------------SN----GKVYIHKGDARNLD---------FISDESIDFICTHPPYANIIQYS-DN---IEEDLSHL-KIPQFLEEMKKVAFESYRVLKNDKFCAVLMGDTRIKGYMQPMSFEVMKIFESEGFKLKEIIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFIFKK|----------------------------------------------------
      CD05_RS0101160_Ruminococcus_sp_NK3A76_655060651                                 -------------------------------------------------MK----------K---I---KK----W----------------------------EPD----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRNIIGVDCNDEALTRCREKIDFDYPP--------------------------------------AQ---GKVFLYKGDARDLY---------FQSDESVDLICTHPPYADIIKYS-DG---IPEDLSQL-KVKDFLEAMKPVAAECYRVLKKGKFCAVLMGDTRQKGCMIPMSFDVMKIFQEAGFTLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|----------------------------------------------------
      MR07_RS03745_Mycoplasma_ovis_568197217                                          --------------------------------------------MI--NRK--------------I---TK----W----------------------------EPE----N-FQ-LQT-NTLWSFPDRG-SWATH--DAKWRGNWSPYIPRNILLRYSKEGDLVLDQFAGGG-TTLVEAKLLNRNIIGIDVNGEAIKRCKEKIDFDYS---------N---------A------------------N-----GKVTTMKGDVRNLC---------FLDSDSIDLVCTHPPYADIIRYS-EG-KEIIEDLSNL-EINEFLSQMQLVAFECYRVLKKGKFCVILMEDTRKNGHMIPLSYKVMKIFEDKGFKLKELIIKV----QHNCK-TTGYW---AT-----------NSV---KY--NFLLIAHEYLFVFKK|----------------------------------------------------
      J144_RS0109740_Fusobacterium_hwasookii_657696170                                ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKINFEFE---------N----------------------------S-----GKVYINKGDARKLD---------FIKDESIDFVCTHPPYANIIEYS-EN---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEGVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|----------------------------------------------------
      MD85_RS04325_[Clostridium]_cellulosi_736835888                                  ------------------------------------------------MKR--------------I---VK----W----------------------------EPD----N-FE-LEM-NTVWSFPERG-NWATH--DAKYRGNWSPYIPRNLLLRYSSKGDLVLDQFVGGG-TTLVEAKLLGRNIIGVDVNPRALERCQEKIDFDYD---------N----------------------------A-----GEVYLYNGDARNLY---------FIKNESIDFICTHPPYANIIRYS-ED---IEADLSHL-NVKDFLVEMHKVASESFRVLKKNKFCAILMGDTRKRGHVIPMSFEVMKIFESAGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFLLLAHEYLFVFKK|----------------------------------------------------
      Q346_RS0100700_Mycoplasma_gallinarum_653082102                                  -------------------------------------------------MK----------K---I---KK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQENDLILDQFAGGG-TTLVEAKLLNRDIIGVDINDVAIERCKEKTAFDYQL--------------------------------------AT---GKVYIKKGDARNLD---------FIPDESIDLICTHPPYADIIKYS-EG---IDGDLSQL-KVKDFIEEMKKVASESYRVLKKDRFCAVLMGDTRQKGHMIPMSFDVMRVFEEAGFKLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|EEK-------------------------------------------------
      _Candidatus_Magnetobacterium_casensis_749954601                                 -----------------------------------------------------------------M---KT----I----------------------------TPK----G-FA-VEK-TTVWSFKSRG-TWATH--NGNYRGNWSPYIPRNVILKYSKMHDLVLDCFCGAG-TTGVECKLLGRNFIGIDINAAAIGLATENMDFDPG---------Q--DH-----D------------------D-----ANAELFVGDARDLN---------GIGDATVDLICAHPPYADIIRYT-HD---NDKDLSAY-GVGRFLDEIDKVARESYRVLKRSGHCAILIGDMRKNKNVIPLGFRTIERYLMAGFVLKELIIKR----QHNCK-TTGFW---YN-----------NSV---KY--NFLLLAHEYLAVFVK|----------------------------------------------------
      T263_RS0108015_Fusobacterium_nucleatum_696307008                                -------------------------------------MPK--------FND--------F-D---L---KN----W------------K---------------EYE----D----IYT-DTLWIIEKRD-NSGVH--TSKYHGNFVPQIPNQLFRRYTKKGEWILDPFLGSG-TSIIEAQRLGRNSIGIELQEDVLKEAYERILVEKS---------N----------------------------D-----CRGKLYIGDSKEIN-ISKILK--SNSIKKVQFIIFHPPYWDIIKFS-DK----ENDLSNSKSVEDFLSSLGKVVDNTTEYLEKNRYCSIVIGDKYENSQIVPLGFYCMNLFLERNFLLKAIIVKNFEETKGKRN-QKSIW---RY-----------RAL---AS--DFFIFKHEYIMVFKK|----I-N---------------------------------------------
      HMPREF1498_RS09070_Fusobacterium_sp_CM1_696259451                               -------------------------------------MPK--------FND--------F-D---L---KN----W------------K---------------EYE----D----IYT-DTLWIMEKRD-NSGVH--TSKYHGNFVPQIPNQLFRRYTKKGEWILDPFLGSG-TSIIEAQRLGRNSIGIELQEDVLKEAYERILVEKS---------N----------------------------D-----CRGKLYIGDSKEIN-ISKILK--SNSIKKVQFIIFHPPYWDIIKFS-DK----ENDLSNSKSVEDFLSSLGKVVDNTTEYLEKNRYCSIVIGDKYENSQIVPLGFYCMNLFLERNFLLKAIIVKNFEETKGKRN-QKSIW---RY-----------RAL---AS--DFFIFKHEYIMVFKK|----I-K---------------------------------------------
      G550_RS0101585_Megamonas_hypermegale_738301178                                  -------------------------------------------------------------------------------------------MYYTRGKKQVFIMDLK----N-FK-LET-TSVWSFPDRG-NWYTH--YGDYPGNWSPYVPRNLILKYSLEKEWILDQFMGSG-TTLIEAKLLNRNIIGTDINPKAYAITKSRLNFAYD---------S----------------------------T-----SHIHIRINDAQDLS---------FIKDNSISLICTHPPYANIIKYS-AD---IKNDLSLM-SYNSYFKAMAKVAKEAHRVLKNGRICAIMVADIRKNWKFIPLGHYVINEFLKVGFILKDIIIKE----QHNCK-SLIKW---SQ-----------REH---E----CYLIKHEYIFIFQK|---------------------ESR----------------------------
      BN531_00658_Eubacterium_sp_CAG:202_547826215                                    -------------------------------------MAK--------YND--------I-D---M---KH----W------------K---------------EYD----D----ILT-DTLWMFDKRD-NSGVH--SASYHGNFVPQIPNQLFRRYTKKGDWILDPFMGSG-TSLIEAQRLGRNSIGIELQEDVAKNTRNLLLEEKN---------N----------------------------Y-----TKGKIIIGDSRNVN-LSEKLI--SIGIKKVQFVIYHPPYWDIIKFS-DK----KEDLSNCLSLEDFLKSFGQVIDNTVPFLEKNRYCAVVIGDKYANSEIVPLGFHCMNLFIQKGLKLKAILVKNFEETKGKAN-QKAIW---RY-----------RAL---AS--DFFIFKHEYIFVFKN|----A-QK---RGK--------------------------------------
      K292_RS0108670_Anaerovorax_odorimutans_739513201                                ---------------------------------------------------------------------------------------------------MILH--PN----N-FE-LER-TTIWSFSERG-SWATH--SGGYRGNWSPYVPRNLILRYSKSNDWVLDQFLGSG-TSLIEAKLLGRNAIGVDINEQAINLASSNIEFKCL---------A----------------------------N-----SKICIRLADAKKLN---------FIKSESIDLICTHPPYANIIKYS-DE---IENDLSLL-SYEEFLRAMEDVALESYRVLKRQKVCSIMIGDIRKNGNVVPLGMEVMNIFLKIGFKSKEIIIKQ----QHNCS-STPYW---RN-----------KNN---E----FLMLAHEYIFIFEK|----------------------------------------------------
      H122_RS0107615_Clostridium_saccharoperbutylacetonicum_505207017                 ---------------------------------------------------------------------------M------------------------------D----N-FS-QEL-TSIWSFRDRG-DWNNH--KGDYPGNCSPRVIRNLLLKYTKENDTVLDQFLGSG-TTAIEVLLLNRKIIGIDINKKALDISNCRIKDLN---------------------------------------------GNKILKVGNAEKLE----------ISNETVNFICTHPPYLDIIKYS-KD---IEGDLSLL-NKVDFYTAIKNVANECYRVLKFKSKCAIIIGDVRKKGYIEPLGFNVMNIFLSTGFLLKEIIIKE----QHNCK-STDKW---KE-----------IAK---QK--NFLLIQHEYIFVFEK|------NY---FK---------------------------------------
      BN462_00619_Ruminococcus_sp_CAG:108_546656251                                   -------------------------------------MGK--------YND--------L-D---L---SQ----W------------R---------------EYT----D----IET-DSLWIIDKRD-NSGAH--SGHYHGNFVPQIPHQLFSRYTKKGNWILDPFMGSG-TSLIEAQRMGRNSIGIEIQHDVAKEAYDRIYTEKN---------D----------------------------V-----VRTKVVVADSQTCD-MNKILL--SEGINKVQFVIMHPPYWDIVKFS-EN----PNDLSNCDSINEFLDSFSKVIDNSLSVLEKNRYCAVVIGDKYANSQVIPLGFYCMNLIMEKGLLLKAILVKNFGETKGKSN-KQGIW---RY-----------RAL---AS--DFYIFKHEYIFVFKK|----V-K---------------------------------------------
      G594_RS0107390_Clostridium_paraputrificum_736866276                             ---------------------------------------------------------------------------M------------------------------D----S-FY-VEE-TSIWSFKDRG-DWATH--RGDYPGNCSPRVVRNLLIKYTKENDVILDQFLGGG-TTAIECLLLNRKIIGIDINKNAISITQDRTRKLN---------------------------------------------GDKSLYLGDAKKLN----------LQGESVDFICTHPPYLDIIKYS-NN---IKDDLSLL-RKEEFYSAMLEVATESFRVLKKHSRCAVIIGDVRKKGYIEPLGFTVMNIFINARFLLKEIIIKE----QHNCK-NTEKW---RE-----------IAK---KK--NFLLIQHEYIFVFEK|------SY---SN---------------------------------------
      BN584_00043_Clostridium_sp_CAG:277_548245492                                    -------------------------------------MAK--------YND--------L-D---P---KK----W------------K---------------EYS----D----INT-DSLWLIEKRD-NSGAH--SGDYHGNFVPQIPHQLFTRYTKKGDWILDPFMGSG-TSLIEAQRLGRNSIGIDLQPDVVQEAEERIRTEQR---------K----------------------------N-----CIVRTVTGDSRTVN-IEEVMS--SVGIDKLQFVMMHPPYWDIIKFS-DN----EKDLSNTSTLDEFLESFGQVIDNSTKYLEKNRYCACVIGDKYANSQVIPLGFYCMNQFMERGFLLKAILVKNFGETKGKAN-QQGIW---RY-----------RAI---TN--DFYIFKHEYIFVFKK|----V-K---------------------------------------------
      CHY_RS10255_Carboxydothermus_hydrogenoformans_499664364                         IANNYFVTQKFIKEVVKHS----QSLLKE--------EEK----NY-TPVN--------N-K---P---RT----W----------------------------APE----N-FS-LET-TTVWSFPDRG-SWATH--SGKYRGNWSPFIPRNIILRYSKEGEVVLDQFVGSG-TTLVEAKLLKRKGIGVDINPEAVSLTLKNTNFEIE---------E----------------------------G-----GEIEVRVGDARNLY---------FLKDESIDLICTHPPYSNIIKYS-DN---IEGDLSHF-DVNDFLLEMEKVAKECYRVLKKGKFCAILIGDTRRKGYIIPIGFSVMEIFRKIGFKLKEIIIKE----QHNCS-STGYW---RN-----------QSI---KY--NFLLIAHEYLFVFKK|----------------------------------------------------
      CLOSCI_RS12555_[Clostridium]_scindens_490745127                                 ------------------------------------------------------------------------M--FLLSELGIILFYFN---GQKKKGEIFIREAPE----K-FK-LED-TTIWSFPERG-SWATH--SGKYRGNWSPYIPRNLILRYSKKKDWILDQFLGSG-TTLIEAKLLGRNAIGVDINSEAVKLSNTNLNFTCQ---------E----------------------------R-----SKIFTKQGNANNLS---------FIKDESIDLICTHPPYADIIRYS-KE---IPGDISHL-KYEKFLKELEQVARESYRVLKRQGICAIMIGDIRRKGYVLPLGMNSMQKFVEAGFKLKEIIIKE----QHNCR-SAYYW---EG-----------RER---K----FLMLAHEYIFILEK|-----------------TDCYNSM----------------------------
      YSBL_RS07445_Ruminiclostridium_thermocellum_489613307                           TAANNDVTCAFVKKVAKES----TICLEE--------KSK----SY-FADK--------L-N---I---KS----W----------------------------EPE----N-FN-LET-TTVWSFPDRG-DWATH--SGKYRGNWSPFIPRNVILRYSKEGETVLDQFVGSG-TTLVEAKLLKRKGIGVDINPEAVNLTCRNINFEKE---------D----------------------------C-----GETEVHVGDARHLG---------FIKDESIDLICTHPPYSNIIKYS-ED---IEGDLSHC-DINEFLVEMEKVAKESYRVLKKGRFCAILIGDTRRKGHMIPIGFNVMQTFLRAGFKLKEIVIKE----QHNCS-STGYW---RN-----------QSI---KY--NFLLIAHEYLFIFRK|----------------------------------------------------
      COPRO5265_1445_Coprothermobacter_proteolyticus_DSM_5265_206738469               VAQKNSLTERFVKGVLGYR----TL-VKE--------EMT----NY-NVSK--------A-K---T---EI----W----------------------------EPE----D-FS-LET-TTVWGFPDRG-DWATH--SGKYRGNWSPYIPRNVILRYSNENDVVLDQFVGSG-TTLVEAKLLGRRGLGVDINPDAVKLALSNVNFEHK--------------------------------------C-----GLADVHIGDARNLD---------FVKDSSIDLICTHPPYSNIIKYS-DN---IEGDLSHY-DIPEFLKEMYKVASESYRVLKRGRFCAVLMGDTRRKGNIIPLGFRVMEVFCKAGLTLKEIVIKE----QHNCT-STGYW---KK-----------QSI---KY--NFLLIAHEYLFIFKK|----------------------------------------------------
      consensus/100%                                                                  ...........................................................................................................................a.h..ps....s+...s.a.ush.P...p..h..ao.....lls.h.G....p..bs....R..........sh..s.............................................................s....................hp.hh.HPPY.shl.as...........u.......a...h.....p....hp....hsh.h.D.........hs...h..h....h.....lhK.....b.....................................h....aE....h.p|....................................................
      consensus/95%                                                                   ........................................................................................................................so.ash..Ru.pa.oH...spa.Gsa.Pb.s+.hhb+ao...-.lLs.hhG.G.TshlEsbbL.Rp.huhDls..sh..s.pp.........................................................ssspph.............spphp.lh.HPPY.shl.ao.p.......DlS.h.p...a...h..hh.c..RlLK....hsl.hGD.+.p..h.Phu...hp.hb..GF.hc-.llK.....b...p............................p....Fhhh.HEalh.hbK|....................................................
      consensus/90%                                                                   .................................................................................................................h......so.ash..Ru.pWuTH...spa+Gsa.Pb.sRphll+Yop..-.lLs.hhG.G.TshlEsblL.Rp.lGhDlN..ul.bsbpp..h................................................p.....uDucpl.............spphc.lhsHPPY.shlpYS.p.......DlS.h.ph.pa...h.plh.E.aRlLK.sp.hsl.hGD.Rbp..h.Phua.hhp.abp.GF.LcE.llK.....Q..hp.s..b.......................p...sFhhh.HEalh.hcK|....................................................
      consensus/85%                                                                   ...........................................................................h.....................................ap.bp..so.Wsh.pRu.pWATH...upa+GNWsPblsRpllL+Yopp.-.lLD.hhGuG.TohlEsbLL.Rp.lGlDlN..ul.bsbpphpa................................................p..l..uDu+pLp..........h.spolDhlhoHPPY.sIIpYS.p....l..DLS.h.ph.cFh..hppVh.E.aRVLK.s+bhslhhGD.Rbp..h.Phua.hhp.abp.GF.L+E.lIK.....Q+php.sp.hW.................s....p...sFhlh.HEalhlFcK|....................................................
      consensus/80%                                                                   ...........................................................................h.............................sp....p.Fp.lp..sohWsFPpRG.pWATH..puca+GNWsPblsRsllL+Yopc.-.lLD.hhGuG.TThlEsbLL.Rp.IGlDlN..Al.bsbcphpFp...............................................c.bl..uDARpLp..........l.spolDhlhoHPPY.sIIpYS.p....lp.DLS.h.plpcFh.bhpcVh.EsaRVLK.s+bCulhhGDsRbp..hlPhua.lhp.Fbp.GF.L+E.IIKb....Q+pCp.spshW................ps....pb..sFhll.HEalalF+K|....................................................
      consensus/75%                                                                   ......................................................................p....h.............................sp....p.Fp.lc..oThWoFPcRG.pWATH..sucaRGNWuPblsRNllL+Yopcs-.lLDpFhGuG.TTllEsKLLsRp.IGlDlN..Al.bsbcphsFp..............................................sc.bl..GDARpLs.........blpspolDhIhTHPPYhsIIpYS.ps...lp.DLSph.plp-FhpbhpcVhpEsaRVLK.s+aCullhGDsRbps.hlPluaplhphFbp.GF.LKEbIIKb....Q+pCp.upshW...p............ps....pb..sFhLlhHEalFlF+K|....................................................
      consensus/70%                                                                   ......................................................................p....h............................pPp....p.Fp.LE..oThWSFPcRG.pWATH..sucaRGNWuPblPRNlILRYopcsDhlLDpFhGuG.TTLlEAKLLsRphIGlDlN..AlpbsbcplsFp.p............................................schbl.bGDARpLs.........blpDpSIDhIhTHPPYhsIIcYS.cs...lc.DLSph.plp-Flpchc+VupEsaRVLKbs+aCAlLhGDsRbcsahlPluFpVMplFbpsGF.LKEhIIKb....QHNC+.usshW...pp...........ps....ch..sFhLlhHEaLFlF+K|....................................................
      
      
      Back to Contents
    • General notes, Phyletic distribution and gene neighborhoods of the Group I, Clade 4 adenine methylases (Fungal N6- DNA MTases)

      General notes

      The novel Fungal adenine methylase is present in a variety of early branching fungi such as Spizellomyces punctatus ( a euchytrid), Conidiobolus coronatus (Entomophthorales), Coemansia reversa (Zygomycota proper), the Mucoromycotina and Rhizophagus irregularis( glomeromycota). The phyletic pattern suggests that the domain was acquired early in an ancestral fungus and lost independently on multiple occasions including in Batrachochytridum, Ascomycetes and Basidiomycetes. Fungal DAMs are fused to multiple domains including Chromo, DNMT3-like finger, ZZ finger, PHD, GATA, AT-hooks and KRI domain, suggesting a strong association with chromatin. The c-terminal methylase lacks the characteristic PPY motif in strand-4 suggesting that it is inactive. The KRI domain is inserted between strands-3 and 4 of the C-terminal methylase. These methylases are characterized by a HPPY motif in strand-4. Additional conserved structural elements and motifs include a very typical strand-helix unit before the methylase core which contains a highly conserved histidine and arginine between the two elements. Additionally they contain a D** in strand -1, E in helix before strand-2 and R** before strand-2, Universal D** after strand-2 HPPY motif in strand-4, R**xxK motif in helix before strand-5, D after strand-5, K after strand-6, HE** before strand -7 and K** after strand-7.
      
      # 1; Eukaryotic versions
      GI                Architecture                                                                                          Gene_name                    Len   Taxonomy                             Species                                         Genbank                            
      Bcir1000010688    DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+N6-MTase(KRI)                       Bcir1000010688               2188  eukaryota>fungi>mucoromycotina       Backusella circina                              estExt_fgenesh1_pm.C_3310003
      Uram1000000474    GATA(frag)+N6-MTase+N6-MTase(KRI)                                                                     Uram1000000474               482   eukaryota>fungi>mucoromycotina       Umbelopsis ramanniana                           fgenesh1_kg.2_#_546_#_combest_scaffold_2_37280
      Crev1000002507    CHROMO+CHROMO+CHROMO+AT-hook+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+N6-MTase(KRI)                         Crev1000002507               2200  eukaryota>fungi>kickxellomycotina    Coemansia reversa                               fgenesh1_pg.10_#_86
      552908586         GATA+N6-MTase++N6-MTase(KRI)                                                                          GLOINDRAFT_316719            649   eukaryota>fungi>glomeromycota        Rhizophagus irregularis DAOM 181602             hypothetical protein GLOINDRAFT_316719 [Rhizophagus irregularis DAOM 181602].
      595436939         CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase++N6-MTase(KRI)                                RirG_262840                  2470  eukaryota>fungi>glomeromycota        Rhizophagus irregularis DAOM 197198w            hypothetical protein RirG_262840 [Rhizophagus irregularis DAOM 197198w].
      Ccor1000008322    ZZ+GATA+N6-MTase++N6-MTase(KRI)                                                                       Ccor1000008322               597   eukaryota>fungi>entomophthoromycota  Conidiobolus coronatus                          CE35304_1103
      Spun1000004719    CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+N6-MTase(KRI)                                        Spun1000004719               1591  eukaryota>fungi>chytridiomycota      Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (1591 aa)
      Pbla1000013272    DNMT3-Trebleclef+CHROMO+CHROMO+AT-hook+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+N6-MTase(KRI)        Pbla1000013272               2382  eukaryota>fungi>basal                Phycomyces blakesleeanus                        estExt_fgeneshPB_pg.C_100091
      671688888         DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+N6-MTase(KRI)                LRAMOSA00608                 2236  eukaryota>fungi                      Absidia idahoensis var. thermophila             hypothetical protein LRAMOSA00608 [Absidia idahoensis var. thermophila].
      511008850         CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+N6-MTase+MTase(KRI)                                         HMPREF1544_03082             1600  eukaryota>fungi                      Mucor circinelloides f. circinelloides 1006PhL  hypothetical protein HMPREF1544_03082 [Mucor circinelloides f. circinelloides 1006PhL].
      758351301         CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ-like+GATA+N6-MTase+MTase(KRI)                               MAM1_0127c06017              1640  eukaryota>fungi                      Mucor ambiguus                                  DNA N6-MTase N-4 [Mucor ambiguus].
      729708575         DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+SAM-N6-MTase+MTase(KRI)               RMCBS344292_09167            1919  eukaryota>fungi                      Rhizopus microsporus                            hypothetical protein RMCBS344292_09167 [Rhizopus microsporus].
      729703045         DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTaseMTase(KRI)                    RMCBS344292_14260            1878  eukaryota>fungi                      Rhizopus microsporus                            hypothetical protein RMCBS344292_14260 [Rhizopus microsporus].
      384485890         DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+SAM-N6-MTase+MTase(KRI)               RO3G_02774                   1914  eukaryota>fungi                      Rhizopus delemar RA 99-880                      hypothetical protein RO3G_02774 [Rhizopus delemar RA 99-880].
      727142291         ZZ+PHD+ZZ+GATA+N6-MTaseMTase(KRI)                                                                     RMATCC62417_10446            1039  eukaryota>fungi                      Rhizopus microsporus                            hypothetical protein RMATCC62417_10446 [Rhizopus microsporus].
      727142293         ZZ+ZZ+GATA+N6-MTaseMTase(KRI)                                                                         RMATCC62417_10446            861   eukaryota>fungi                      Rhizopus microsporus                            hypothetical protein RMATCC62417_10446 [Rhizopus microsporus].
      727142292         ZZ+ZZ+GATA+N6-MTaseMTase(KRI)                                                                         RMATCC62417_10446            856   eukaryota>fungi                      Rhizopus microsporus                            hypothetical protein RMATCC62417_10446 [Rhizopus microsporus].
      661176173         DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase MTase(KRI)                   LCOR_11540.1                 2275  eukaryota>fungi                      Lichtheimia corymbifera JMRC:FSU:9682           dna N6-MTase [Lichtheimia corymbifera JMRC:FSU:9682].
      672819038         CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+MTase(KRI)                                           MVEG_09762                   2147  eukaryota>fungi                      Mortierella verticillata NRRL 6337              hypothetical protein MVEG_09762 [Mortierella verticillata NRRL 6337].
      758369443         CHROMO+CHROMO+ZZ+GATA+SAM-N6-MTase+MTase(KRI)                                                         PARPA_01280.1                2216  eukaryota>fungi                      Parasitella parasitica                          hypothetical protein [Parasitella parasitica].
      
      
      # 146; Bacterial homologs                                                                                                                                                                                                                                   
      GI           Gene neighborhoods                                                                                                	Arch         Pfam-arch                                          Gene name                    Len   Taxonomy                                     Species name                                        Genbank
      557946003    N6-MTase*-><-DpnII<-DpnII<-DpnM-N6-MTase<-Phage_integrase                                                           N6-MTase    UPF0020+N6_N4_Mtase                                MBMB1_0500                   364   archaea>euryarchaeota                        Methanobacterium sp. MB1                            DNA methylase N-4/N-6 domain protein [Methanobacterium sp. MB1].                           <-557945996_?||557945997_?-><-557945998_?<-557945999_?<-557946000_?<-557946001_?<-557946002_?||557946003_N6-MTase*-><-557946004_DpnII<-557946005_DpnII<-557946006_DpnM-N6-MTase<-557946007_Phage_integrase<-557946008_?||557946009_?-><-557946010_?
      499219160    <-N6-MTase*                                                                                                         N6-MTase    N6_N4_Mtase+Methyltransf_26                        TVN0442                      347   archaea>euryarchaeota                        Thermoplasma volcanium                              DNA methyltransferase [Thermoplasma volcanium].                                            <-13541266_?||13541267_?-><-13541268_?<-13541269_?<-13541270_?||13541271_?->13541272_?-><-499219160_N6-MTase*||13541274_?->13541275_?-><-13541276_?<-13541277_?||13541278_?->13541279_?->13541280_?->
      546147902    N6-MTase*->                                                                                                         N6-MTase    Methyltransf_26                                    AMDU1_APLC00004G0008         346   archaea>euryarchaeota                        environmental samples                               MULTISPECIES: DNA adenine modification methylase [environmental samples].                  546151579_?-><-546151580_?<-546147897_?<-546151581_?<-546147899_?||546147900_?->546147901_?->546147902_N6-MTase*-><-546151582_?<-546147904_?<-546147905_?<-546147906_?<-546151583_?<-546151584_?||546147910_?->
      500164270    <-N6-MTase*                                                                                                         N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            Smar_0588                    342   archaea>crenarchaeota                        Staphylothermus marinus                             DNA methylase [Staphylothermus marinus].                                                   126465487_?->126465488_?-><-126465489_?<-126465490_?<-126465491_?<-126465492_?<-126465493_?<-500164270_N6-MTase*||126465495_?->126465496_?->126465497_?->126465498_?-><-126465499_?||126465500_?->126465501_?->
      731481703    <-N6-MTase*||DpnII->                                                                                                N6-MTase    N6_N4_Mtase+Methyltransf_26                        Mpt1_c10100                  341   archaea>euryarchaeota                        Candidatus Methanoplasma termitum                   modification methylase MjaII [Candidatus Methanoplasma termitum].                          731481696_?-><-731481697_?<-731481698_?<-731481699_?||731481700_?->731481701_?->731481702_?-><-731481703_N6-MTase*||731481704_DpnII-><-731481705_?<-731481706_?<-731481707_?<-731481708_?<-731481709_?<-731481710_?
      504550199    N6-MTase*->                                                                                                         N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            TCELL_0627                   340   archaea>crenarchaeota                        Thermogladius cellulolyticus                        DNA methylase [Thermogladius cellulolyticus].                                              <-389860942_?||389860943_?->389860944_?->389860945_?-><-389860946_?<-389860947_?||389860948_?->504550199_N6-MTase*->389860950_?->389860951_?->389860952_?->389860953_?-><-389860954_?<-389860955_?||389860956_?->
      502907573    <-N6-MTase*                                                                                                         N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            Shell_0210                   338   archaea>crenarchaeota                        Staphylothermus hellenicus                          DNA methylase [Staphylothermus hellenicus].                                                <-297526219_?||297526220_?->297526221_?-><-297526222_?||297526223_?->297526224_?->297526225_?-><-502907573_N6-MTase*||297526227_?->297526228_?-><-297526229_?<-297526230_?<-297526231_?<-297526232_?<-297526233_?
      530785168    -                                                                                                                   N6-MTase    Methyltransf_26                                                                 330   archaea>crenarchaeota                        Thermofilum sp. 1910b                               hypothetical protein [Thermofilum sp. 1910b].                                              
      737178046    <-N6-MTase*<-DpnM-N6-MTase<-?<-Spermidine-synthase                                                                  N6-MTase    SP+N6_N4_Mtase+Methyltransf_26                     Y919_RS08595                 330   bacteria>firmicutes                          Caloranaerobacter azorensis                         DNA methylase N-4 [Caloranaerobacter azorensis].                                           737178066_?-><-737178067_?<-737178040_?<-737178041_?<-737178042_?<-737178044_?<-737178046_N6-MTase*<-737178049_DpnM-N6-MTase<-737178051_?<-737178052_Spermidine-synthase<-737178053_?<-737178054_?||737178056_?->737178057_?->
      489613307    <-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                                   N6-MTase    SP+Methyltransf_26                                 YSBL_RS07445                 329   bacteria>firmicutes                          Ruminiclostridium thermocellum                      DNA methylase N-4 [Ruminiclostridium thermocellum].                                        489613325_?-><-489613323_?<-489613321_?<-489613315_?<-489613313_?<-489613311_?<-489613309_?<-489613307_N6-MTase*<-489613306_DpnII<-489613301_DpnM-N6-MTase||489613300_?-><-489613298_?<-489613295_?<-489613292_?<-739434939_?
      490598869    <-N6-MTase*<-DpnM-N6-MTase||?-><-?<-?<-?||?->METHYLASE->                                                            N6-MTase    SP+Methyltransf_26                                 CTHBC1_RS07545               329   bacteria>firmicutes                          Ruminiclostridium thermocellum                      DNA methylase N-4 [Ruminiclostridium thermocellum].                                        490598858_?-><-490598859_?<-489613321_?<-500163438_?<-500163439_?<-490598866_?<-490598867_?<-490598869_N6-MTase*<-489613301_DpnM-N6-MTase||489613300_?-><-553726118_?<-490598882_?<-739435388_?||739445830_?->553726119_METHYLASE->
      499664364    <-DpnII<-N6-MTase*<-DpnM-N6-MTase                                                                                   N6-MTase    SP+Methyltransf_26                                 CHY_RS10255                  329   bacteria>firmicutes                          Carboxydothermus hydrogenoformans                   DNA methylase N-4 [Carboxydothermus hydrogenoformans].                                     <-753782109_?<-499664357_?<-499664359_?<-499664360_?<-499664361_?<-736527008_?<-753782391_DpnII<-499664364_N6-MTase*<-753782392_DpnM-N6-MTase<-499664366_?<-499664367_?||499664368_?-><-499664369_?<-499664370_?<-499664371_?
      653611723    <-ABC-ATPase<-?<-?||?-><-?<-?<-?<-N6-MTase*<-DpnII<-DpnM-N6-MTase<-?<-Methylase                                     N6-MTase    SP+Methyltransf_26                                 CLOCLA_RS0112375             329   bacteria>firmicutes                          [Clostridium] clariflavum                           DNA methylase N-4 [[Clostridium] clariflavum].                                             <-653611718_ABC-ATPase<-653611719_?<-504022761_?||653611720_?-><-759895769_?<-653611721_?<-653611722_?<-653611723_N6-MTase*<-653611724_DpnII<-653611725_DpnM-N6-MTase<-653611726_?<-504022770_Methylase<-504022771_?||653611727_?->653611728_?->
      740456070    <-N6-MTase*<-DpnM-N6-MTase                                                                                          N6-MTase    SP+N6_N4_Mtase+Methyltransf_26                     JCM21531_RS04440             329   bacteria>firmicutes                          [Clostridium] straminisolvens                       DNA methylase N-4 [[Clostridium] straminisolvens].                                         <-740456184_?<-740456070_N6-MTase*<-740456072_DpnM-N6-MTase<-740456075_?<-740456077_?<-740456079_?<-740456080_?<-740456187_?<-740456082_?
      206738469    <-METHYLASE<-?<-PLD+SFII-helicase<-?<-?<-N6-MTase*<-DpnII<-DpnM-N6-MTase                                            N6-MTase    Methyltransf_26                                    COPRO5265_1445               327   bacteria>firmicutes                          Coprothermobacter proteolyticus DSM 5265            DNA methylase N-4 [Coprothermobacter proteolyticus DSM 5265].                              639380273_?-><-206737760_?<-206739175_METHYLASE<-206737708_?<-206738620_PLD+SFII-helicase<-639380274_?<-206738701_?<-206738469_N6-MTase*<-206738522_DpnII<-206738073_DpnM-N6-MTase<-206738705_?<-639380275_?||639380276_?-><-639380277_?<-206737751_?
      526884977    <-N6-MTase*                                                                                                         N6-MTase    Methyltransf_26                                    CSUB_C0599                   325   archaea                                      Candidatus Caldiarchaeum subterraneum               DNA methylase N-4/N-6 [Candidatus Caldiarchaeum subterraneum].                             557694027_?-><-557694028_?<-557694029_?||557694030_?->557694031_?->557694032_?-><-557694033_?<-526884977_N6-MTase*<-557694035_?<-557694036_?||557694037_?-><-557694038_?<-557694039_?<-557694040_?<-557694041_?
      496184606    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    SP+N6_N4_Mtase+Methyltransf_26                     CAAU_RS08905                 322   bacteria>firmicutes                          Caloramator australicus                             DNA methylase N-4 [Caloramator australicus].                                               749868668_?->496184600_?->496184601_?->496184602_?->496184603_?->749868670_DpnM-N6-MTase->496184605_DpnII->496184606_N6-MTase*-><-496184607_?||749868035_?->749868780_?->496184610_?->749868781_?->496184614_?->749868037_?->
      503327799    <-N6-MTase*                                                                                                         N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            Desmu_0935                   319   archaea>crenarchaeota                        Desulfurococcus mucosus                             DNA methylase [Desulfurococcus mucosus].                                                   320101121_?->320101122_?->320101123_?->320101124_?->320101125_?->320101126_?->320101127_?-><-503327799_N6-MTase*||320101129_?-><-320101130_?||320101131_?-><-320101132_?||320101133_?-><-320101134_?<-320101135_?
      756971970    <-N6-MTase*||?->?->?->?->REase-4->                                                                                  N6-MTase    Methyltransf_26                                    I774_RS01625                 319   archaea                                      Aigarchaeota archaeon JGI 0000106-J15               DNA methylase [Aigarchaeota archaeon JGI 0000106-J15].                                     756971964_?-><-756971969_?<-756971970_N6-MTase*||756971971_?->756971965_?->756971966_?->756971967_?->756971968_REase-4->756971972_?->
      504580152    <-ABC-ATPase||N6-MTase*-><-?||?->?->?->?->?->Pribosyltran->                                                         N6-MTase    N6_N4_Mtase+Methyltransf_26                        Desfe_0448                   318   archaea>crenarchaeota                        Desulfurococcus fermentans                          DNA methylase [Desulfurococcus fermentans].                                                390938183_?->390938184_?-><-390938185_?||390938186_?->390938187_?-><-390938188_?<-390938189_ABC-ATPase||504580152_N6-MTase*-><-390938191_?||390938192_?->390938193_?->390938194_?->390938195_?->390938196_?->390938197_Pribosyltran->
      501637311    <-Pribosyltran<-?<-?<-?<-?||?->?-><-N6-MTase*||?->ABC-ATPase->                                                      N6-MTase    N6_N4_Mtase+Methyltransf_26                        DKAM_0485                    317   archaea>crenarchaeota                        Desulfurococcus kamchatkensis                       DNA methylase [Desulfurococcus kamchatkensis].                                             <-218883789_Pribosyltran<-218883790_?<-218883791_?<-218883792_?<-218883793_?||218883794_?->218883795_?-><-501637311_N6-MTase*||218883797_?->218883798_ABC-ATPase->218883799_?-><-218883800_?<-218883801_?<-218883802_?||218883803_?->
      756979360    <-Pribosyltran<-?<-?<-?<-?||?-><-N6-MTase*||ABC-ATPase->                                                            N6-MTase    N6_N4_Mtase+Methyltransf_26                        SPHMEL_RS03490               317   archaea>crenarchaeota                        Desulfurococcus amylolyticus                        DNA methylase [Desulfurococcus amylolyticus].                                              <-756979355_?<-756979356_Pribosyltran<-756979357_?<-756979358_?<-756979917_?<-501637305_?||756979359_?-><-756979360_N6-MTase*||756979361_ABC-ATPase->756979362_?-><-756979363_?<-756979918_?<-756979364_?||756979365_?-><-756979366_?
      504370902    N6-MTase*->                                                                                                         N6-MTase    N6_N4_Mtase+Methyltransf_26                        FFONT_0867                   314   archaea>crenarchaeota                        Fervidicoccus fontis                                DNA methylase N-4 [Fervidicoccus fontis].                                                  385805902_?-><-385805903_?<-385805904_?||385805905_?->385805906_?-><-385805907_?<-385805908_?||504370902_N6-MTase*-><-385805910_?||385805911_?->385805912_?->385805913_?-><-385805914_?<-385805915_?||385805916_?->
      206739986    <-ABC-ATPase<-DpnII<-N6-MTase*                                                                                      N6-MTase    Methyltransf_26                                    DICTH_1800                   311   bacteria>dictyoglomi                         Dictyoglomus thermophilum H-6-12                    putative DNA methylase [Dictyoglomus thermophilum H-6-12].                                 <-206740402_?<-206739665_?||206740399_?-><-206739566_?<-206740100_?<-206741023_ABC-ATPase<-206740340_DpnII<-206739986_N6-MTase*<-206741073_?<-206739739_?<-206739820_?||206739604_?-><-206740154_?||206740432_?->206739563_?->
      502895170    N6-MTase*->?->DCM->                                                                                                 N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            Tagg_1290                    311   archaea>crenarchaeota                        Thermosphaera aggregans                             DNA methylase [Thermosphaera aggregans].                                                   <-296243011_?<-296243012_?||296243013_?->296243014_?->296243015_?-><-296243016_?<-296243017_?||502895170_N6-MTase*->296243019_?->296243020_DCM->296243021_?-><-296243022_?||296243023_?-><-296243024_?<-296243025_?
      643385301    <-DpnM-N6-MTase<-DpnII<-N6-MTase*                                                                                   N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            D891_RS0103440               306   bacteria>proteobacteria>deltaproteobacteria  Hippea sp. KM1                                      DNA methylase [Hippea sp. KM1].                                                            643385287_?->643385289_?->643385291_?->643385293_?->643385295_?-><-737585466_DpnM-N6-MTase<-643385299_DpnII<-643385301_N6-MTase*||643385304_?->643385305_?->643385306_?->737585802_?->661256721_?-><-643385310_?<-643385312_?
      502864863    N6-MTase*->                                                                                                         N6-MTase    N6_N4_Mtase+Methyltransf_26                        Metin_0423                   305   archaea>euryarchaeota                        Methanocaldococcus infernus                         DNA methylase N-4 [Methanocaldococcus infernus].                                           <-296109101_?<-296109102_?<-296109103_?<-296109104_?||296109105_?->296109106_?-><-296109107_?||502864863_N6-MTase*->296109109_?-><-296109110_?<-296109111_?<-296109112_?<-296109113_?<-296109114_?<-296109115_?
      754082338    <-ABC-ATPase<-DpnII<-N6-MTase*                                                                                      N6-MTase    UPF0020                                            DICTH_RS08720                304   bacteria>dictyoglomi                         Dictyoglomus thermophilum                           DNA methylase [Dictyoglomus thermophilum].                                                 754082337_?->501542015_?->501542972_?-><-501542137_?<-501542672_?<-501543600_ABC-ATPase<-501542913_DpnII<-754082338_N6-MTase*<-501543650_?<-501542311_?<-501542392_?<-501542727_?<-754082042_?<-754082049_?<-754082339_?
      746331486    N6-MTase*-><-DpnII<-DpnII<-DpnM-N6-MTase<-Phage_integrase                                                           N6-MTase    UPF0020+N6_N4_Mtase                                MBMB1_RS02390                303   archaea>euryarchaeota                        Methanobacterium sp. MB1                            hypothetical protein [Methanobacterium sp. MB1].                                           746330837_?-><-746331483_?<-566004529_?||566004530_?-><-566004531_?<-566004532_?<-566004533_?||746331486_N6-MTase*-><-746331489_DpnII<-746331493_DpnII<-746331496_DpnM-N6-MTase<-746331499_Phage_integrase<-566004541_?||566004542_?-><-566004543_?
      290559536    N6-MTase*->DpnM-N6-MTase->                                                                                          N6-MTase    N6_N4_Mtase+Methyltransf_26                        BJBARM5_0369                 297   archaea                                      Candidatus Parvarchaeum acidophilus ARMAN-5         Methyltransferase type 11 [Candidatus Parvarchaeum acidophilus ARMAN-5].                   290559529_?-><-290559530_?||290559531_?-><-290559532_?<-290559533_?<-290559534_?<-290559535_?||290559536_N6-MTase*->290559537_DpnM-N6-MTase-><-290559538_?<-290559539_?||290559540_?->290559541_?->290559542_?-><-290559543_?
      374851611    N6-MTase*->                                                                                                         N6-MTase    Methyltransf_26                                    HGMM_F16H05C22               293   bacteria>aquificae                           uncultured Aquificae bacterium                      DNA methylase [uncultured Aquificae bacterium].                                            <-374851604_?<-374851605_?||374851606_?-><-374851607_?||374851608_?->374851609_?->374851610_?->374851611_N6-MTase*-><-374851612_?<-374851613_?<-374851614_?<-374851615_?<-374851616_?||374851617_?->374851618_?->
      502729540    N6-MTase*->                                                                                                         N6-MTase    Methyltransf_26                                    HYDTH_RS09530                293   bacteria>aquificae                           Hydrogenobacter thermophilus                        DNA methylase N-4 [Hydrogenobacter thermophilus].                                          <-502729533_?||502729534_?-><-502729535_?||502729536_?->502729537_?->502729538_?->502729539_?->502729540_N6-MTase*-><-502729541_?<-502729542_?<-502729543_?<-502729544_?<-502729545_?<-502729546_?||502729547_?->
      551115149    -                                                                                                                   N6-MTase    Methyltransf_26                                                                 292   bacteria                                     Candidatus Calescibacterium nevadense               RNA methyltransferase [Candidatus Calescibacterium nevadense].                             
      518679720    N6-MTase*-><-DpnM-N6-MTase                                                                                          N6-MTase    N6_N4_Mtase+Methyltransf_26                        FACI_IFERC00001G0455         284   archaea>euryarchaeota                        Ferroplasma acidarmanus                             hypothetical protein [Ferroplasma acidarmanus].                                            518651809_?-><-518651810_?||518651811_?->518651812_?-><-518651813_?<-518651814_?||518651815_?->518679720_N6-MTase*-><-518651817_DpnM-N6-MTase<-518651818_?<-518651819_?<-518651820_?<-518651821_?<-518651822_?||518651823_?->
      383110164    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    Methyltransf_26                                    Ferpe_1711                   278   bacteria>thermotogae                         Fervidobacterium pennivorans DSM 9078               DNA methylase [Fervidobacterium pennivorans DSM 9078].                                     383110157_?->383110158_?->383110159_?->383110160_?->383110161_?->383110162_DpnM-N6-MTase->383110163_DpnII->383110164_N6-MTase*->383110165_?->383110166_?-><-383110167_?<-383110168_?<-383110169_?<-383110170_?<-383110171_?
      546149073    ASCH->?->?->?->?->?->?->N6-MTase*->                                                                                 N6-MTase    N6_N4_Mtase                                        AMDU3_IPLC00003G0029         272   archaea>euryarchaeota                        Thermoplasmatales archaeon I-plasma                 hypothetical protein [Thermoplasmatales archaeon I-plasma].                                546149066_ASCH->546149067_?->546149068_?->546149069_?->546149070_?->546149071_?->546149072_?->546149073_N6-MTase*->546149074_?->546149075_?->546149076_?-><-546149077_?||546149078_?-><-546149079_?<-546149080_?
      490745127    N6-MTase*->?->DpnM-N6-MTase->                                                                                       N6-MTase    N6_N4_Mtase+Methyltransf_11                        CLOSCI_RS12555               271   bacteria>firmicutes                          [Clostridium] scindens                              DNA methylase N-4 [[Clostridium] scindens].                                                490745118_?->490745119_?->748651615_?->490745121_?->490745122_?->496543445_?->490745127_N6-MTase*->490745128_?->490745129_DpnM-N6-MTase->490745130_?->490745132_?->490745133_?->490745134_?->748651604_?->
      496543443    <-DpnM-N6-MTase<-?<-N6-MTase*                                                                                       N6-MTase    N6_N4_Mtase+Methyltransf_11                        HMPREF0993_RS14405           271   bacteria>firmicutes                          Lachnospiraceae bacterium 5_1_57FAA                 DNA methylase N-4 [Lachnospiraceae bacterium 5_1_57FAA].                                   <-496543439_?<-496543440_?<-496543441_?<-769170250_?<-496543442_?<-490745129_DpnM-N6-MTase<-490745128_?<-496543443_N6-MTase*<-496543445_?<-490745122_?<-496543446_?<-769170261_?<-490745119_?<-496543448_?<-496543449_?
      333759614    MACRODOMAIN->?->?->?->?->?->?->N6-MTase*->DpnM-N6-MTase->?->N6-MTase->DpnII->Primase+SNF+PLD->                      N6-MTase    N6_N4_Mtase+Methyltransf_26                        HMPREF9124_1256              270   bacteria>firmicutes                          Oribacterium sp. oral taxon 108 str. F0425          putative DNA (cytosine-5-)-methyltransferase [Oribacterium sp. oral taxon 108 str. F0425]. 333759038_MACRODOMAIN->333759543_?->333759315_?->333758850_?->333758658_?->333759566_?->333759020_?->333759614_N6-MTase*->333760338_DpnM-N6-MTase->333758772_?->333760010_N6-MTase->333760591_DpnII->333760705_Primase+SNF+PLD->333758828_?->333759066_?->
      701167223    DpnM-N6-MTase->DpnII->N6-MTase*-><-?<-ABC-ATPase                                                                    N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            NA23_RS09565                 268   bacteria>thermotogae                         Fervidobacterium islandicum                         DNA methylase N-4 [Fervidobacterium islandicum].                                           701167713_?->701167218_?->701167219_?-><-701167220_?<-701167221_?||701167714_DpnM-N6-MTase->701167222_DpnII->701167223_N6-MTase*-><-701167224_?<-701167715_ABC-ATPase||701167225_?-><-701167226_?||737436023_?->701167227_?->701167228_?->
      493349121    <-N6-MTase*<-MutH                                                                                                   N6-MTase    Methyltransf_26                                    L21TH_RS00440                267   bacteria>firmicutes                          Caldisalinibacter kiritimatiensis                   sensor protein fixL [Caldisalinibacter kiritimatiensis].                                   493349112_?->493349115_?->736405791_?-><-493349117_?<-493349118_?<-493349119_?<-493349120_?<-493349121_N6-MTase*<-493349122_MutH
      504218926    N6-MTase-><-?||?-><-?<-?<-?||DpnM-N6-MTase->N6-MTase*-><-?||?->MACRODOMAIN->?->Ploop+REase->                        N6-MTase    UPF0020+N6_N4_Mtase                                Mtc_1443                     267   archaea>euryarchaeota                        Methanocella conradii                               RNA methyltransferase [Methanocella conradii].                                             383319865_N6-MTase-><-383319866_?||383319867_?-><-383319868_?<-383319869_?<-383319870_?||383319871_DpnM-N6-MTase->504218926_N6-MTase*-><-383319873_?||383319874_?->383319875_MACRODOMAIN->383319876_?->383319877_Ploop+REase->383319878_?->383319879_?->
      503063371    <-ABC-ATPase||?-><-?<-?<-?<-?<-N6-MTase*<-DpnII<-DpnII<-DpnM-N6-MTase                                               N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            TTHE_RS09375                 266   bacteria>firmicutes                          Thermoanaerobacterium thermosaccharolyticum         RNA methyltransferase [Thermoanaerobacterium thermosaccharolyticum].                       <-503063364_?<-503063365_ABC-ATPase||503063366_?-><-503063367_?<-503063368_?<-503063369_?<-753831842_?<-503063371_N6-MTase*<-753831957_DpnII<-753831958_DpnII<-503063372_DpnM-N6-MTase<-503063373_?<-503063374_?||753831959_?->503063376_?->
      547826215    <-N6-MTase*<-?<-DpnII-like-REase||?-><-?<-ABC-ATPase                                                                N6-MTase    N6_N4_Mtase+Methyltransf_26                        BN531_00658                  266   bacteria>firmicutes                          Eubacterium sp. CAG:202                             DNA modification methylase [Eubacterium sp. CAG:202].                                      547826208_?->547826209_?->547826210_?->547826211_?->547826212_?->547826213_?->547826214_?-><-547826215_N6-MTase*<-547826216_?<-547826217_DpnII-like-REase||547826218_?-><-547826219_?<-547826220_ABC-ATPase<-547826221_?<-547826222_?
      752594110    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    Methyltransf_26                                    FERPE_RS08390                263   bacteria>thermotogae                         Fervidobacterium pennivorans                        DNA methylase N-4, partial [Fervidobacterium pennivorans].                                 504265091_?->504265092_?->504265093_?->504265094_?->504265095_?->752594108_DpnM-N6-MTase->752594109_DpnII->752594110_N6-MTase*->504265099_?->504265100_?-><-504265101_?<-504265102_?<-504265103_?<-504265104_?<-504265105_?
      546656251    <-N6-MTase*<-DpnII-like-REase                                                                                       N6-MTase    Methyltransf_26                                    BN462_00619                  262   bacteria>firmicutes                          Ruminococcus sp. CAG:108                            DNA methylase N-4/N-6 domain protein [Ruminococcus sp. CAG:108].                           546656244_?-><-546656245_?<-546656246_?<-546656247_?||546656248_?->546656249_?->546656250_?-><-546656251_N6-MTase*<-546656252_DpnII-like-REase<-546656253_?<-546656254_?<-546656255_?<-546656256_?<-546656257_?<-546656258_?
      548245492    DpnII-like-REase->N6-MTase*->                                                                                       N6-MTase    N6_N4_Mtase                                        BN584_00043                  262   bacteria>firmicutes                          Clostridium sp. CAG:277                             DNA adenine modification methylase [Clostridium sp. CAG:277].                              <-548245485_?<-548245486_?<-548245487_?<-548245488_?<-548245489_?<-548245490_?||548245491_DpnII-like-REase->548245492_N6-MTase*-><-548245493_?<-548245494_?<-548245495_?<-548245496_?||548245497_?->548245498_?-><-548245499_?
      696259451    <-N6-MTase*<-DpnII-like-REase                                                                                       N6-MTase    N6_N4_Mtase                                        HMPREF1498_RS09070           262   bacteria>fusobacteria                        Fusobacterium sp. CM1                               hypothetical protein [Fusobacterium sp. CM1].                                              <-696259449_?||696259450_?-><-696259451_N6-MTase*<-696259452_DpnII-like-REase
      696307008    <-N6-MTase*<-DpnII-like-REase                                                                                       N6-MTase    N6_N4_Mtase                                        T263_RS0108015               262   bacteria>fusobacteria                        Fusobacterium nucleatum                             hypothetical protein [Fusobacterium nucleatum].                                            <-658616850_?<-658616852_?<-658616855_?<-658616858_?<-492627213_?<-696307007_?<-658616862_?<-696307008_N6-MTase*<-658616866_DpnII-like-REase
      703200955    <-N6-MTase*                                                                                                         N6-MTase    SP+Methyltransf_26                                 N469_RS0107485               262   bacteria                                     Marinimicrobia                                      MULTISPECIES: DNA methylase N-4, partial [Marinimicrobia].                                 <-703200955_N6-MTase*||551208956_?->551208955_?->655262737_?-><-551208953_?<-661311063_?<-703200953_?<-661311064_?
      489963634    <-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                                   N6-MTase    Methyltransf_26                                    TTHWC1_RS07155               259   bacteria>firmicutes                          Thermoanaerobacter                                  MULTISPECIES: RNA methyltransferase [Thermoanaerobacter].                                  490535680_?-><-490535682_?<-490535684_?<-490535685_?<-490535686_?<-489966042_?||490535687_?-><-489963634_N6-MTase*<-490535690_DpnII<-490535692_DpnM-N6-MTase||489963637_?-><-490535694_?<-490535698_?<-490535699_?<-490535700_?
      490206260    <-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                                   N6-MTase    Methyltransf_26                                    H17AP60334_RS10630           259   bacteria>thermotogae                         Thermosipho africanus                               RNA methyltransferase [Thermosipho africanus].                                             740200843_?->490206258_?-><-490206260_N6-MTase*<-490206262_DpnII<-490206263_DpnM-N6-MTase||490206264_?->490206265_?->490206266_?->490206268_?->490206269_?->
      501003459    <-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                                   N6-MTase    Methyltransf_26                                    TMEL_RS01630                 259   bacteria>thermotogae                         Thermosipho melanesiensis                           RNA methyltransferase [Thermosipho melanesiensis].                                         <-501003453_?<-501003454_?||752786045_?->501003455_?->501003456_?->501003457_?->752786296_?-><-501003459_N6-MTase*<-501003460_DpnII<-501003461_DpnM-N6-MTase<-501003463_?<-752786297_?<-501003465_?<-501003466_?<-501003467_?
      502759633    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    Methyltransf_26                                    THIT_RS02440                 259   bacteria>firmicutes                          Thermoanaerobacter italicus                         RNA methyltransferase [Thermoanaerobacter italicus].                                       502759626_?->502759627_?->502759628_?->502759629_?->502759630_?->502759631_DpnM-N6-MTase->502759632_DpnII->502759633_N6-MTase*-><-502759634_?||502759635_?->502759636_?->502759637_?->502759638_?->502759639_?->502759640_?->
      658480004    <-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                                   N6-MTase    Methyltransf_26                                    M663_RS0111020               259   bacteria>firmicutes                          Thermoanaerobacter sp. A7A                          DNA methylase N-4 [Thermoanaerobacter sp. A7A].                                            <-658479990_?<-658479992_?<-658479994_?<-658479996_?<-658479998_?<-658480000_?||658480002_?-><-658480004_N6-MTase*<-658480006_DpnII<-658480009_DpnM-N6-MTase<-658480011_?<-658480013_?<-658480016_?<-658480018_?<-658480020_?
      658542249    -                                                                                                                   N6-MTase    Methyltransf_26                                                                 259   bacteria                                     EM3 bacterium JGI 0000106-B10                       hypothetical protein [EM3 bacterium JGI 0000106-B10].                                      
      694165517    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    Methyltransf_26                                    TKV_c04890                   259   bacteria>firmicutes                          Thermoanaerobacter kivui                            DNA methylase N-4/N-6 domain protein [Thermoanaerobacter kivui].                           694165510_?->694165511_?->694165512_?->694165513_?->694165514_?->694165515_DpnM-N6-MTase->694165516_DpnII->694165517_N6-MTase*-><-694165518_?||694165519_?->694165520_?->694165521_?->694165522_?->694165523_?-><-694165524_?
      757582161    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    Methyltransf_26                                    THYS13_RS12835               259   bacteria>firmicutes                          Thermoanaerobacter sp. YS13                         DNA methylase N-4 [Thermoanaerobacter sp. YS13].                                           757582157_?->757582158_?->757582317_?->757582318_?->757582319_?->757582159_DpnM-N6-MTase->757582160_DpnII->757582161_N6-MTase*-><-757582162_?||757582163_?->757582164_?->757582165_?->757582166_?->757582320_?-><-757582167_?
      658522169    N6-MTase*->?->?->?->?->N6-MTase->                                                                                   N6-MTase    Methyltransf_26                                    I825_RS0102520               258   bacteria                                     Atribacteria bacterium SCGC AAA255-G05              DNA methylase N-4, partial [Atribacteria bacterium SCGC AAA255-G05].                       <-658522163_?<-658522164_?||658522165_?->658522166_?-><-658522167_?||658522168_?->658522169_N6-MTase*->658522170_?->658522171_?->658522172_?->658522173_?->658522174_N6-MTase->658522176_?->658522177_?->
      738305516    <-N6-MTase*                                                                                                         N6-MTase    SP+Methyltransf_26                                 Q355_RS15275                 258   bacteria>deinococci                          Meiothermus cerbereus                               DNA methylase N-4, partial [Meiothermus cerbereus].                                        654400345_?->654400346_?->654400347_?->654400348_?->654400349_?->654400350_?-><-738305515_?<-738305516_N6-MTase*<-654400352_?<-738305505_?<-654400354_?<-738305517_?||738305518_?->654400372_?-><-654400356_?
      504218923    DpnM-N6-MTase->N6-MTase*-><-?||?-><-?<-?<-?||DpnM-N6-MTase->N6-MTase->                                              N6-MTase    UPF0020+N6_N4_Mtase                                Mtc_1434                     256   archaea>euryarchaeota                        Methanocella conradii                               RNA methyltransferase [Methanocella conradii].                                             383319858_?->383319859_?-><-383319860_?<-383319861_?<-383319862_?<-383319863_?||383319864_DpnM-N6-MTase->504218923_N6-MTase*-><-383319866_?||383319867_?-><-383319868_?<-383319869_?<-383319870_?||383319871_DpnM-N6-MTase->383319872_N6-MTase->
      517992866    <-DpnM-N6-MTase<-?<-N6-MTase*                                                                                       N6-MTase    N6_N4_Mtase+Methyltransf_11                        HGRM_RS15055                 255   bacteria>firmicutes                          Ruminococcus sp. JC304                              DNA methylase N-4 [Ruminococcus sp. JC304].                                                <-517992859_?<-517992860_?<-517992861_?<-517992862_?<-517992863_?<-517992864_DpnM-N6-MTase<-517992865_?<-517992866_N6-MTase*<-769258210_?<-517992871_?<-517992872_?<-517992873_?||769258211_?-><-517992878_?<-517992879_?
      503374808    <-N6-MTase*                                                                                                         N6-MTase    SP+Methyltransf_26                                 MSUIS_RS03915                253   bacteria>tenericutes                         Mycoplasma suis                                     DNA methylase N-4 [Mycoplasma suis].                                                       <-503374801_?<-503374802_?<-503374803_?<-503374804_?<-503374805_?<-503374806_?<-503374807_?<-503374808_N6-MTase*<-503374809_?<-503374810_?<-763029221_?||503374812_?->503374813_?-><-503374814_?<-503374815_?
      738301178    N6-MTase*->DpnM-N6-MTase->DpnII-><-?<-?||METHYLASE->                                                                N6-MTase    Methyltransf_26                                    G550_RS0101585               253   bacteria>firmicutes                          Megamonas hypermegale                               DNA methylase N-4 [Megamonas hypermegale].                                                 654417979_?->654417980_?->654417981_?->654417982_?-><-654417983_?<-654417984_?<-738301202_?||738301178_N6-MTase*->654417986_DpnM-N6-MTase->738301180_DpnII-><-738301182_?<-738301184_?||654417990_METHYLASE->654417991_?->654417992_?->
      742671763    -                                                                                                                   N6-MTase    Methyltransf_26                                                                 253   bacteria                                     Gracilibacteria bacterium JGI 0000069-K10           hypothetical protein [Gracilibacteria bacterium JGI 0000069-K10].                          
      757126172    <-N6-MTase*<-DpnII<-DpnM-N6-MTase||?-><-?<-?<-ABC-ATPase                                                            N6-MTase    Methyltransf_26                                    MB41_RS06705                 253   bacteria>firmicutes                          Anaerosalibacter sp. ND1                            DNA methylase N-4 [Anaerosalibacter sp. ND1].                                              <-757126039_?<-757126041_?<-757126043_?<-757126045_?<-757126047_?||757126049_?->757126051_?-><-757126172_N6-MTase*<-757126053_DpnII<-757126055_DpnM-N6-MTase||757126057_?-><-757126059_?<-757126061_?<-757126063_ABC-ATPase<-757126065_?
      406901678    <-N6-MTase*<-?<-?<-DpnII<-?<-?<-DpnM-N6-MTase                                                                       N6-MTase    Methyltransf_26                                    ACD_71C00187G0001            251   bacteria                                     uncultured bacterium (gcode 4)                      hypothetical protein ACD_71C00187G0001 [uncultured bacterium (gcode 4)].                   <-406901678_N6-MTase*<-406901679_?<-406901680_?<-406901681_DpnII<-406901682_?<-406901683_?<-406901684_DpnM-N6-MTase||406901685_?->
      505364599    -                                                                                                                   N6-MTase    SP+Methyltransf_26                                                              251   bacteria>proteobacteria>betaproteobacteria   Taylorella asinigenitalis                           DNA methylase [Taylorella asinigenitalis].                                                 
      568197217    N6-MTase*->                                                                                                         N6-MTase    Methyltransf_26                                    MR07_RS03745                 251   bacteria>tenericutes                         Mycoplasma ovis                                     DNA methylase N-4 [Mycoplasma ovis].                                                       <-568197213_?<-568197214_?<-763016747_?<-763016748_?<-763016749_?||763016750_?-><-568197216_?||568197217_N6-MTase*-><-763016751_?||568197219_?->763016752_?->568197221_?-><-568197222_?||568197223_?-><-568197224_?
      652371152    <-N6-MTase*<-DpnII                                                                                                  N6-MTase    UPF0020+N6_N4_Mtase                                G598_RS0111930               251   bacteria>firmicutes                          Selenomonas ruminantium                             DNA methylase N-4 [Selenomonas ruminantium].                                               <-739504480_?<-739504482_?<-652371147_?<-652371148_?<-652371149_?||652371150_?-><-652371151_?<-652371152_N6-MTase*<-652371153_DpnII
      740284293    <-N6-MTase*<-DpnII<-HTH+DpnM-N6-MTase                                                                               N6-MTase    SP+Methyltransf_26                                 HMPREF1504_RS04555           251   bacteria>firmicutes                          Veillonella sp. ICM51a                              DNA methylase N-4 [Veillonella sp. ICM51a].                                                <-740284278_?<-740284279_?<-740284280_?||740284281_?->740284286_?->740284289_?-><-740284291_?<-740284293_N6-MTase*<-740284295_DpnII<-740284300_HTH+DpnM-N6-MTase<-740284301_?<-740284302_?<-740284306_?<-491520967_?<-491525280_?
      754482757    <-N6-MTase*||?->?-><-DpnM-N6-MTase                                                                                  N6-MTase    UPF0020                                            I759_RS06660                 251   archaea>euryarchaeota                        Euryarchaeota archaeon SCGC AAA252-I15              hypothetical protein [Euryarchaeota archaeon SCGC AAA252-I15].                             754482742_?->754482744_?->754482746_?->754482749_?->754482751_?->754482753_?->754482755_?-><-754482757_N6-MTase*||754482934_?->754482760_?-><-754482937_DpnM-N6-MTase<-754482763_?||754482765_?->754482767_?->754482770_?->
      497210069    <-ParB<-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                             N6-MTase    N6_N4_Mtase+Methyltransf_26                        HMPREF9629_RS00455           250   bacteria>firmicutes                          unclassified Peptostreptococcaceae (miscellaneous)  MULTISPECIES: DNA methylase N-4 [unclassified Peptostreptococcaceae (miscellaneous)].      <-497210062_?<-497210063_?<-497210064_?<-497210065_?<-497210066_?<-497210067_?<-497210068_ParB<-497210069_N6-MTase*<-497210070_DpnII<-497210071_DpnM-N6-MTase<-497210072_?<-497210073_?<-497210074_?<-497210075_?<-497210076_?
      653082102    <-N6-MTase*<-?<-?<-MutH<-HTH+DpnM-N6-MTase                                                                          N6-MTase    Methyltransf_26                                    Q346_RS0100700               250   bacteria>tenericutes                         Mycoplasma gallinarum                               DNA methylase N-4 [Mycoplasma gallinarum].                                                 653082096_?->653082097_?->653082098_?->738487736_?->653082099_?->653082100_?-><-653082101_?<-653082102_N6-MTase*<-653082103_?<-738487739_?<-653082105_MutH<-653082106_HTH+DpnM-N6-MTase||653082107_?->653082108_?->653082109_?->
      268315129    <-N6-MTase*                                                                                                         N6-MTase    N6_N4_Mtase+Methyltransf_26                        Smon_1038                    249   bacteria>fusobacteria                        Streptobacillus moniliformis DSM 12112              putative RNA methylase [Streptobacillus moniliformis DSM 12112].                           <-268315122_?<-268315123_?||268315124_?-><-268315125_?<-268315126_?<-268315127_?<-268315128_?<-268315129_N6-MTase*<-268315130_?<-268315131_?<-268315132_?<-268315133_?<-268315134_?<-268315135_?<-268315136_?
      490232946    <-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                                   N6-MTase    N6_N4_Mtase+Methyltransf_26                        HMPREF1586_RS06065           249   bacteria>actinobacteria                      Gardnerella vaginalis                               DNA methylase N-4 [Gardnerella vaginalis].                                                 <-514910845_?<-514910846_?<-490232946_N6-MTase*<-490232945_DpnII<-514910847_DpnM-N6-MTase<-696236856_?
      491499778    <-ABC-ATPase<-?<-Pribosyltran<-?||?->?->?-><-N6-MTase*<-?<-DpnII<-?<-DpnM-N6-MTase                                  N6-MTase    N6_N4_Mtase+Methyltransf_26                        G397_RS0107800               249   bacteria>firmicutes                          [Eubacterium] siraeum                               DNA methylase N-4 [[Eubacterium] siraeum].                                                 <-491494962_ABC-ATPase<-491494960_?<-491494958_Pribosyltran<-491494955_?||518492361_?->518492362_?->491499780_?-><-491499778_N6-MTase*<-491499771_?<-491499769_DpnII<-491499767_?<-491499766_DpnM-N6-MTase<-769258050_?||491499757_?->491499748_?->
      505332319    -                                                                                                                   N6-MTase    N6_N4_Mtase+Methyltransf_26                                                     249   bacteria>firmicutes                          [Eubacterium] siraeum                               DNA methylase [[Eubacterium] siraeum].                                                     
      515155565    <-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                                   N6-MTase    N6_N4_Mtase+Methyltransf_26                        HMPREF1583_RS06235           249   bacteria>actinobacteria                      Gardnerella vaginalis                               hypothetical protein [Gardnerella vaginalis].                                              <-696236014_?<-515155563_?<-515155565_N6-MTase*<-515155567_DpnII<-490232944_DpnM-N6-MTase<-515155570_?||696235466_?->515155571_?-><-515155573_?<-515155575_?
      515278394    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    N6_N4_Mtase+Methyltransf_26                        HMPREF1580_RS03105           249   bacteria>actinobacteria                      Gardnerella vaginalis                               hypothetical protein [Gardnerella vaginalis].                                              515278393_?->490232944_DpnM-N6-MTase->490232945_DpnII->515278394_N6-MTase*->490232948_?->515278396_?->
      547820585    <-N6-MTase*<-?<-DpnII<-Mrr_cat<-HTH+DpnM-N6-MTase                                                                   N6-MTase    N6_N4_Mtase+Methyltransf_26                        BN647_00380                  249   bacteria>firmicutes                          Firmicutes bacterium CAG:41                         DNA methylase [Firmicutes bacterium CAG:41].                                               <-547820578_?<-547820579_?<-547820580_?||547820581_?->547820582_?->547820583_?-><-547820584_?<-547820585_N6-MTase*<-547820586_?<-547820587_DpnII<-547820588_Mrr_cat<-547820589_HTH+DpnM-N6-MTase<-547820590_?<-547820591_?<-547820592_?
      548315511    <-N6-MTase*<-DpnII<-HTH+DpnM-N6-MTase                                                                               N6-MTase    Methyltransf_26                                    BN720_00766                  249   bacteria>firmicutes                          Eubacterium sp. CAG:581                             DNA methylase [Eubacterium sp. CAG:581].                                                   548315508_?->548315509_?-><-548315510_?<-548315511_N6-MTase*<-548315512_DpnII<-548315513_HTH+DpnM-N6-MTase
      652339649    <-N6-MTase*||DpnM-N6-MTase->wHTH+REase-DpnII->                                                                      N6-MTase    SP+Methyltransf_26                                 LEPGO_RS0100700              249   bacteria>fusobacteria                        Leptotrichia goodfellowii                           DNA methylase N-4 [Leptotrichia goodfellowii].                                             652339642_?->652339643_?->652339644_?->652339645_?->652339646_?->652339647_?->738097603_?-><-652339649_N6-MTase*||652339650_DpnM-N6-MTase->652339651_wHTH+REase-DpnII->652339652_?-><-652339653_?<-652339654_?<-652339655_?<-652339656_?
      746146226    <-METHYLASE<-?<-?<-?||?-><-?<-?<-N6-MTase*<-DpnII<-DpnM-N6-MTase<-?<-ABC-ATPase<-?<-ABC-ATPase                      N6-MTase    Methyltransf_26                                    NZ47_RS07170                 249   bacteria>firmicutes                          Anaerovibrio lipolyticus                            DNA methylase N-4 [Anaerovibrio lipolyticus].                                              <-746146211_METHYLASE<-746146240_?<-746146214_?<-746146217_?||653148288_?-><-746146221_?<-653148286_?<-746146226_N6-MTase*<-746146229_DpnII<-746146242_DpnM-N6-MTase<-746146231_?<-746146245_ABC-ATPase<-746146233_?<-746146235_ABC-ATPase<-746146249_?
      406967845    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    Methyltransf_26                                    ACD_28C00322G0004            248   bacteria                                     uncultured bacterium                                hypothetical protein ACD_28C00322G0004 [uncultured bacterium].                             406967842_?->406967843_DpnM-N6-MTase->406967844_DpnII->406967845_N6-MTase*->406967846_?->
      488666942    <-N6-MTase*<-HNH<-?<-DpnII<-?<-HTH+DpnM-N6-MTase                                                                    N6-MTase    N6_N4_Mtase+Methyltransf_26                        HMPREF1090_RS26280           248   bacteria>firmicutes                          [Clostridium] clostridioforme                       hypothetical protein [[Clostridium] clostridioforme].                                      488666936_?-><-488666938_?||740461489_?->740461506_?->740461507_?->740461509_?-><-740461490_?<-488666942_N6-MTase*<-740461510_HNH<-488666944_?<-488666945_DpnII<-488666946_?<-488666947_HTH+DpnM-N6-MTase<-488666948_?<-488666949_?
      490963643    DpnM-N6-MTase->DpnII->N6-MTase*-><-?||ABC-ATPase->                                                                  N6-MTase    Methyltransf_26                                    HMPREF9131_RS04375           248   bacteria>firmicutes                          Peptoniphilus                                       MULTISPECIES: DNA methylase N-4 [Peptoniphilus].                                           496704092_?->496703981_?->496704035_?->496704093_?-><-496704027_?||738855338_DpnM-N6-MTase->496703967_DpnII->490963643_N6-MTase*-><-490963751_?||496704011_ABC-ATPase->496704063_?->496704052_?->496703991_?->
      490965414    DpnM-N6-MTase->wHTH+REase-DpnII->N6-MTase*->                                                                        N6-MTase    Methyltransf_26                                    HMPREF1630_RS01030           248   bacteria>firmicutes                          Anaerococcus lactolyticus                           DNA methylase N-4 [Anaerococcus lactolyticus].                                             739466286_?->490965420_?->490965419_?->490965418_?->739466289_?->739466291_DpnM-N6-MTase->739466292_wHTH+REase-DpnII->490965414_N6-MTase*->
      492766054    <-N6-MTase*<-wHTH+REase-DpnII<-DpnM-N6-MTase                                                                        N6-MTase    Methyltransf_26                                    HMPREF9286_RS08365           248   bacteria>firmicutes                          Peptoniphilus harei                                 DNA methylase N-4 [Peptoniphilus harei].                                                   <-492766020_?<-492766045_?<-492765924_?<-492766041_?<-492766025_?<-750245934_?||492766021_?-><-492766054_N6-MTase*<-492766035_wHTH+REase-DpnII<-492766033_DpnM-N6-MTase<-492766005_?||492766011_?->492766051_?-><-492765968_?<-492766003_?
      493205676    HTH+DpnM-N6-MTase->DpnII->?->N6-MTase*->                                                                            N6-MTase    Methyltransf_26                                    SELSP_RS03130                248   bacteria>firmicutes                          Selenomonas sputigena                               DNA methylase N-4 [Selenomonas sputigena].                                                 <-493205689_?||493205688_?->493205687_?->493205686_?->493205685_HTH+DpnM-N6-MTase->739508203_DpnII->493205681_?->493205676_N6-MTase*->503505981_?-><-503505982_?||493205670_?->493205663_?-><-493205364_?<-493205363_?<-493205362_?
      494632851    <-N6-MTase*<-DpnII<-DpnM-N6-MTase<-Radical_SAM                                                                      N6-MTase    Methyltransf_26                                    HMPREF1039_RS02295           248   bacteria>firmicutes                          Megasphaera sp. UPII 199-6                          DNA methylase N-4 [Megasphaera sp. UPII 199-6].                                            494632843_?-><-494632817_?||494632828_?->494632849_?-><-494632855_?<-494632825_?<-494632827_?<-494632851_N6-MTase*<-494632821_DpnII<-494632835_DpnM-N6-MTase<-494632831_Radical_SAM<-494632848_?<-494632839_?<-494632818_?<-494632853_?
      494634458    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    N6_N4_Mtase+Methyltransf_26                        HMPREF1040_RS02760           248   bacteria>firmicutes                          Megasphaera sp. UPII 135-E                          DNA methylase N-4 [Megasphaera sp. UPII 135-E].                                            494634456_?->494634450_?-><-738304381_?||738304383_?-><-738304385_?||494634454_DpnM-N6-MTase->494634445_DpnII->494634458_N6-MTase*->494634451_?->738304386_?->494634457_?-><-494634453_?<-494634452_?||494634448_?->
      496777936    <-N6-MTase*<-DpnII<-DpnM-N6-MTase<-Radical_SAM                                                                      N6-MTase    N6_N4_Mtase+Methyltransf_26                        HMPREF0889_RS01300           248   bacteria>firmicutes                          Megasphaera genomosp. type_1                        DNA methylase N-4 [Megasphaera genomosp. type_1].                                          494632843_?-><-496777933_?||494632828_?->496777934_?-><-496777935_?<-494632825_?<-494632827_?<-496777936_N6-MTase*<-496777937_DpnII<-496777938_DpnM-N6-MTase<-494632831_Radical_SAM<-494632848_?<-494632839_?<-496777939_?<-494632853_?
      517953989    DpnM-N6-MTase->wHTH+REase-DpnII->N6-MTase*->                                                                        N6-MTase    Methyltransf_26                                    HGPG_RS01070                 248   bacteria>firmicutes                          Peptoniphilus grossensis                            DNA methylase N-4, partial [Peptoniphilus grossensis].                                     517953982_?-><-517953983_?<-517953984_?<-517953985_?||517953986_?->517953987_DpnM-N6-MTase->517953988_wHTH+REase-DpnII->517953989_N6-MTase*-><-517953990_?||517953991_?->517953992_?->738841560_?->738841562_?->738841762_?->517953996_?->
      547265226    Radical_SAM-><-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                      N6-MTase    Methyltransf_26                                    BN820_00869                  248   bacteria>proteobacteria>alphaproteobacteria  Acetobacter sp. CAG:977                             sensor protein fixL [Acetobacter sp. CAG:977].                                             547265219_?->547265220_?->547265221_?->547265222_?-><-547265223_?<-547265224_?||547265225_Radical_SAM-><-547265226_N6-MTase*<-547265227_DpnII<-547265228_DpnM-N6-MTase||547265229_?-><-547265230_?<-547265231_?||547265232_?->
      547865125    DpnM-N6-MTase->?->DpnII->?->N6-MTase*-><-?<-?<-?||?->Pribosyltran->?->ABC-ATPase->                                  N6-MTase    Methyltransf_26                                    BN788_01674                  248   bacteria>firmicutes                          Eubacterium siraeum CAG:80                          DNA methylase [Eubacterium siraeum CAG:80].                                                <-547865118_?<-547865119_?||547865120_?->547865121_DpnM-N6-MTase->547865122_?->547865123_DpnII->547865124_?->547865125_N6-MTase*-><-547865126_?<-547865127_?<-547865128_?||491494955_?->547865129_Pribosyltran->547865130_?->505332324_ABC-ATPase->
      652371446    <-N6-MTase*                                                                                                         N6-MTase    UPF0020+N6_N4_Mtase                                G598_RS0113740               248   bacteria>firmicutes                          Selenomonas ruminantium                             DNA methylase N-4 [Selenomonas ruminantium].                                               652371439_?-><-652371440_?<-652371441_?<-652371442_?<-739504782_?<-652371444_?<-652371445_?<-652371446_N6-MTase*
      652933168    <-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                                   N6-MTase    N6_N4_Mtase+Methyltransf_26                        G497_RS0101670               248   bacteria>proteobacteria>deltaproteobacteria  Desulfovibrio cuneatus                              DNA methylase N-4 [Desulfovibrio cuneatus].                                                652933161_?->652933162_?->652933163_?-><-652933164_?<-652933165_?||652933166_?->652933167_?-><-652933168_N6-MTase*<-652933169_DpnII<-652933170_DpnM-N6-MTase<-652933171_?<-652933172_?<-652933173_?<-652933174_?<-652933175_?
      654856343    <-N6-MTase*<-wHTH+REase-DpnII<-DpnM-N6-MTase                                                                        N6-MTase    UPF0020+N6_N4_Mtase                                K364_RS0114940               248   bacteria>firmicutes                          Desulfitibacter alkalitolerans                      DNA methylase N-4 [Desulfitibacter alkalitolerans].                                        <-654856335_?||654856336_?-><-654856337_?<-654856338_?<-737286368_?<-654856340_?<-654856341_?<-654856343_N6-MTase*<-737286550_wHTH+REase-DpnII<-654856345_DpnM-N6-MTase<-654856346_?<-654856347_?<-654856348_?<-654856349_?<-654856350_?
      737952516    HTH+DpnM-N6-MTase->DpnII->N6-MTase*-><-?<-?<-?<-?||N6-MTase->                                                       N6-MTase    Methyltransf_26                                    FUSO3_RS09465                248   bacteria>fusobacteria                        Fusobacterium necrophorum                           DNA methylase N-4 [Fusobacterium necrophorum].                                             737971124_?->737952519_HTH+DpnM-N6-MTase->737952518_DpnII->737952516_N6-MTase*-><-737952514_?<-737952511_?<-737952508_?<-492764123_?||492765822_N6-MTase->492764117_?->737971127_?->
      754175633    <-N6-MTase*                                                                                                         N6-MTase    N6_N4_Mtase+Methyltransf_26                        SMON_RS05265                 248   bacteria>fusobacteria                        Streptobacillus moniliformis                        DNA methylase N-4 [Streptobacillus moniliformis].                                          <-502622237_?<-502622238_?||502622239_?-><-502622240_?<-502622241_?<-502622242_?<-502622243_?<-754175633_N6-MTase*<-754175102_?<-502622246_?<-502622247_?<-502622248_?<-502622249_?<-502622250_?<-502622251_?
      406986924    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    Methyltransf_26                                    ACD_18C00096G0009            247   bacteria                                     uncultured bacterium                                hypothetical protein ACD_18C00096G0009 [uncultured bacterium].                             <-406986917_?<-406986918_?||406986919_?->406986920_?->406986921_?->406986922_DpnM-N6-MTase->406986923_DpnII->406986924_N6-MTase*->406986925_?->
      492571686    <-N6-MTase*<-Mrr_cat<-DpnII                                                                                         N6-MTase    UPF0020+N6_N4_Mtase                                FNV_RS04020                  247   bacteria>fusobacteria                        Fusobacterium nucleatum                             DNA methylase N-4 [Fusobacterium nucleatum].                                               740585784_?->492571673_?-><-492571675_?<-492571678_?<-492571680_?<-492571682_?<-740585788_?<-492571686_N6-MTase*<-492571690_Mrr_cat<-740585790_DpnII
      492605690    <-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                                                   N6-MTase    SP+Methyltransf_26                                 HMPREF1501_RS03285           247   bacteria>fusobacteria                        Fusobacterium sp. OBRC1                             DNA methylase N-4 [Fusobacterium sp. OBRC1].                                               492596573_?-><-696257839_?<-696257842_?<-696257846_?<-696257853_?<-696257855_?<-696257856_?<-492605690_N6-MTase*<-696257867_DpnII<-696257868_DpnM-N6-MTase<-696257859_?<-492605693_?||696257861_?->
      492656844    <-N6-MTase*<-Mrr_cat<-DpnII<-DpnM-N6-MTase<-PHP+DNApolIIIa<-?<-?<-SNF                                               N6-MTase    UPF0020+N6_N4_Mtase                                HMPREF1497_RS08290           247   bacteria>fusobacteria                        Fusobacterium                                       MULTISPECIES: DNA methylase N-4 [Fusobacterium].                                           492652534_?-><-496079217_?<-552904728_?<-552904729_?<-552904730_?<-552904731_?<-696263596_?<-492656844_N6-MTase*<-492571690_Mrr_cat<-492656846_DpnII<-552904732_DpnM-N6-MTase<-492653383_PHP+DNApolIIIa<-552904733_?<-492653385_?<-492653388_SNF
      495977000    SNF->?->PHP+DNApolIIIa->?->?->DpnM-N6-MTase->DpnII->N6-MTase*->                                                     N6-MTase    N6_N4_Mtase+UPF0020+N6_N4_Mtase                    FSDG_RS00930                 247   bacteria>fusobacteria                        Fusobacterium nucleatum                             DNA methylase N-4 [Fusobacterium nucleatum].                                               495977008_SNF->495977006_?->495977005_PHP+DNApolIIIa->511541204_?->495977003_?->495977002_DpnM-N6-MTase->492632699_DpnII->495977000_N6-MTase*->696266057_?->495976997_?->495976996_?->495976995_?->495976994_?->511541205_?-><-495976992_?
      496069501    <-Ploop+REase||DpnM-N6-MTase->DpnII->N6-MTase*->                                                                    N6-MTase    SP+Methyltransf_26                                 FSAG_RS07575                 247   bacteria>fusobacteria                        Fusobacterium periodonticum                         DNA methylase N-4 [Fusobacterium periodonticum].                                           496069507_?->496069506_?->496069505_?->492793635_?-><-496069504_Ploop+REase||496069503_DpnM-N6-MTase->496069502_DpnII->496069501_N6-MTase*->737488074_?->496069499_?->496069498_?->496069497_?->492810497_?->496069496_?->496069495_?->
      496073207    <-N6-MTase*<-DpnII<-DpnM-N6-MTase<-PHP+DNApolIIIa<-?<-?<-SNF                                                        N6-MTase    N6_N4_Mtase+Methyltransf_26                        HMPREF0405_RS07885           247   bacteria>fusobacteria                        Fusobacterium nucleatum                             DNA methylase N-4 [Fusobacterium nucleatum].                                               496073200_?-><-496073201_?<-496073202_?<-496073203_?<-496073204_?<-496073205_?<-740654570_?<-496073207_N6-MTase*<-496073208_DpnII<-496073209_DpnM-N6-MTase<-496073210_PHP+DNApolIIIa<-496073211_?<-644872653_?<-496073213_SNF<-492573569_?
      496296625    -                                                                                                                   N6-MTase    N6_N4_Mtase+UPF0020+N6_N4_Mtase                                                 247   bacteria>fusobacteria                        Fusobacterium nucleatum                             DNA methylase N-4 [Fusobacterium nucleatum].                                               
      496969638    Mrr_cat->N6-MTase*->                                                                                                N6-MTase    N6_N4_Mtase+Methyltransf_26                        HMPREF9093_RS05275           247   bacteria>fusobacteria                        Fusobacterium sp. oral taxon 370                    DNA methylase N-4 [Fusobacterium sp. oral taxon 370].                                      <-496969577_?<-496969579_?<-737521001_?<-737521008_?<-496969609_?||737521003_?->496969633_Mrr_cat->496969638_N6-MTase*->496969641_?->496969644_?->496969660_?-><-496969666_?<-496969669_?<-496969672_?<-737521006_?
      503262746    DpnM-N6-MTase->DpnII->?->N6-MTase*->                                                                                N6-MTase    Methyltransf_26                                    Q388_RS0120175               247   bacteria>firmicutes                          Ruminococcus albus                                  DNA methylase N-4 [Ruminococcus albus].                                                    <-503262738_?<-503262739_?<-503262740_?||503262741_?->503262742_DpnM-N6-MTase->503262744_DpnII->739445280_?->503262746_N6-MTase*-><-739443706_?
      506250654    <-N6-MTase*<-DpnM-N6-MTase                                                                                          N6-MTase    Methyltransf_26                                    LEBU_RS11230                 247   bacteria>fusobacteria                        Leptotrichia buccalis                               DNA methylase N-4 [Leptotrichia buccalis].                                                 <-506250648_?<-506250649_?<-506250650_?<-506250651_?<-506250652_?<-506250653_?<-493858211_?<-506250654_N6-MTase*<-754128986_DpnM-N6-MTase<-506250656_?<-506250657_?<-506250658_?<-506250659_?<-506250660_?<-754128988_?
      547450181    <-N6-MTase*<-DpnII<-DpnII<-DpnM-N6-MTase<-PHP+DNApolIIIa<-?<-?<-SNF                                                 N6-MTase    N6_N4_Mtase+Methyltransf_26                        BN748_01131                  247   bacteria>fusobacteria                        Fusobacterium sp. CAG:649                           sensor protein fixL [Fusobacterium sp. CAG:649].                                           547450176_?->547450177_?->547450178_?->492574489_?->492574493_?->547450179_?-><-547450180_?<-547450181_N6-MTase*<-547450182_DpnII<-547450183_DpnII<-547450184_DpnM-N6-MTase<-547450185_PHP+DNApolIIIa<-492676764_?<-547450186_?<-547450187_SNF
      655060651    <-MutH<-?<-N6-MTase*<-?<-DpnII<-DpnM-N6-MTase                                                                       N6-MTase    Methyltransf_26                                    CD05_RS0101160               247   bacteria>firmicutes                          Ruminococcus sp. NK3A76                             DNA methylase N-4 [Ruminococcus sp. NK3A76].                                               655060645_?->739463238_?->739463240_?->655060647_?->655060648_?-><-655060649_MutH<-655060650_?<-655060651_N6-MTase*<-739463242_?<-655060653_DpnII<-655060654_DpnM-N6-MTase<-655060655_?||655060656_?->655060657_?->655060658_?->
      657692329    DpnM-N6-MTase->?->DpnII->N6-MTase*->                                                                                N6-MTase    N6_N4_Mtase+Methyltransf_11                        J145_RS0109710               247   bacteria>fusobacteria                        Fusobacterium hwasookii                             DNA methylase N-4 [Fusobacterium hwasookii].                                               657692322_?->657692323_?->657692324_?->657692325_?->657692326_DpnM-N6-MTase->657692327_?->657692328_DpnII->657692329_N6-MTase*->696306406_?->657692331_?->657692332_?->492676742_?->657692333_?->657692334_?-><-492676735_?
      657695114    DpnM-N6-MTase->?->DpnII->N6-MTase*->                                                                                N6-MTase    N6_N4_Mtase+Methyltransf_26                        J142_RS0109970               247   bacteria>fusobacteria                        Fusobacterium hwasookii                             DNA methylase N-4 [Fusobacterium hwasookii].                                               657692322_?->657695109_?->657695111_?->657695112_?->657692326_DpnM-N6-MTase->657692327_?->657695113_DpnII->657695114_N6-MTase*->696306406_?->657692331_?->657695115_?->492676742_?->657695116_?->657695117_?-><-657695118_?
      657696170    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    N6_N4_Mtase+Methyltransf_11                        J144_RS0109740               247   bacteria>fusobacteria                        Fusobacterium hwasookii                             DNA methylase N-4 [Fusobacterium hwasookii].                                               657692321_?->657692322_?->657692323_?->657692324_?->657692325_?->657692326_DpnM-N6-MTase->657696169_DpnII->657696170_N6-MTase*->696311518_?->657696172_?->
      697204360    <-N6-MTase*<-?<-?<-HTH+DpnM-N6-MTase                                                                                N6-MTase    Methyltransf_26                                    T504_RS0104020               247   bacteria>firmicutes                          Selenomonas sp. ND2010                              DNA methylase N-4 [Selenomonas sp. ND2010].                                                <-697204304_?<-697204305_?||697204359_?-><-697204306_?<-697204307_?||697204308_?->697204309_?-><-697204360_N6-MTase*<-697204310_?<-697204311_?<-697204361_HTH+DpnM-N6-MTase<-697204312_?<-697204313_?||697204314_?-><-697204315_?
      754560689    ABC-ATPase->?->?->?-><-?<-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                           N6-MTase    N6_N4_Mtase+Methyltransf_26                        NW74_RS05595                 247   bacteria>firmicutes                          Parvimonas micra                                    DNA methylase N-4 [Parvimonas micra].                                                      <-754560678_?<-754560680_?||754560681_ABC-ATPase->754560683_?->754560684_?->754560686_?-><-754560688_?<-754560689_N6-MTase*<-754560690_DpnII<-754560692_DpnM-N6-MTase<-754560693_?<-754560694_?<-754560696_?||754560698_?->754561342_?->
      736835888    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    Methyltransf_26                                    MD85_RS04325                 246   bacteria>firmicutes                          [Clostridium] cellulosi                             DNA methylase N-4 [[Clostridium] cellulosi].                                               <-736835869_?<-736835872_?||736835875_?->736835877_?->736835880_?->736835882_DpnM-N6-MTase->736835885_DpnII->736835888_N6-MTase*->736835890_?->736836198_?->736835893_?-><-736835895_?||736835898_?->736835900_?->736835903_?->
      749954601    -                                                                                                                   N6-MTase    N6_N4_Mtase+N6_N4_Mtase                                                         246   bacteria>nitrospirae                         Candidatus Magnetobacterium casensis                DNA methylase N-4, partial [Candidatus Magnetobacterium casensis].                         
      762897706    <-N6-MTase*<-DpnII                                                                                                  N6-MTase    Methyltransf_26                                    MSU_RS04145                  246   bacteria>tenericutes                         Mycoplasma suis                                     DNA methylase N-4 [Mycoplasma suis].                                                       <-503375509_?<-503375510_?<-503375511_?||762897703_?-><-503375512_?<-503375513_?<-503374807_?<-762897706_N6-MTase*<-762897832_DpnII<-503375516_?<-503375517_?<-762897716_?||503375520_?->503374813_?-><-503375521_?
      323652279    <-N6-MTase*<-DpnII                                                                                                  N6-MTase    Methyltransf_26                                    MSU_0848                     245   bacteria>tenericutes                         Mycoplasma suis str. Illinois                       DNA methylase N-4/N-6 domain-containing protein [Mycoplasma suis str. Illinois].           <-323652272_?<-323652273_?<-323652274_?<-323652275_?<-323652276_?<-323652277_?<-323652278_?<-323652279_N6-MTase*<-323652280_DpnII<-323652281_?<-323652282_?<-323652283_?<-323652284_?||323652285_?->323652286_?->
      503504517    HTH+DpnM-N6-MTase->DpnII->N6-MTase*->                                                                               N6-MTase    N6_N4_Mtase+Methyltransf_26                        SPICO_RS02800                245   bacteria>spirochaetes                        Sphaerochaeta coccoides                             DNA methylase N-4 [Sphaerochaeta coccoides].                                               503504509_?-><-503504510_?||752748369_?->503504512_?->503504513_?->503504515_HTH+DpnM-N6-MTase->503504516_DpnII->503504517_N6-MTase*->752747888_?->503504518_?-><-503504519_?||752748371_?->752748373_?->752747890_?->752748375_?->
      547523036    DpnM-N6-MTase->DpnII->N6-MTase*->                                                                                   N6-MTase    N6_N4_Mtase+Methyltransf_26                        BN678_01434                  245   bacteria>firmicutes                          Dialister sp. CAG:486                               putative RNA methylase, partial [Dialister sp. CAG:486].                                   <-547523009_?<-547523013_?<-547523017_?||547523021_?->547523025_?->547523028_DpnM-N6-MTase->547523032_DpnII->547523036_N6-MTase*->547523040_?->547523043_?-><-547523047_?<-547523051_?||547523055_?->547523059_?->547523063_?->
      737327616    <-DpnII<-DpnM-N6-MTase<-N6-MTase<-N6-MTase*<-?<-?<-?<-?<-PLD+SFII-helicase                                          N6-MTase    Methyltransf_26                                    DP68_RS13195                 245   bacteria>firmicutes                          Clostridium sp. HMP27                               DNA methylase N-4 [Clostridium sp. HMP27].                                                 <-737327604_?<-737327606_?||737327608_?-><-737327610_?<-737327613_DpnII<-737327716_DpnM-N6-MTase<-737327615_N6-MTase<-737327616_N6-MTase*<-737327617_?<-737327619_?<-737327621_?<-737327623_?<-737327625_PLD+SFII-helicase||737327628_?-><-737327630_?
      496076017    SNF->?->PHP+DNApolIIIa->DpnM-N6-MTase->DpnII->Mrr_cat->N6-MTase*->                                                  N6-MTase    Methyltransf_26+N6_N4_Mtase                        HMPREF0946_RS02930           244   bacteria>fusobacteria                        Fusobacterium nucleatum                             DNA methylase N-4 [Fusobacterium nucleatum].                                               496076024_?->496076023_SNF->496076022_?->496076021_PHP+DNApolIIIa->496076020_DpnM-N6-MTase->496076019_DpnII->496076018_Mrr_cat->496076017_N6-MTase*->696308233_?->496076014_?->496076013_?->496076012_?->496076011_?->496076010_?-><-492652534_?
      503781935    N6-MTase*->DpnM-N6-MTase->N6-MTase->DpnII->PHP+DNApolIIIa->                                                         N6-MTase    UPF0020+N6_N4_Mtase                                MELS_RS05115                 244   bacteria>firmicutes                          Megasphaera elsdenii                                DNA methylase N-4 [Megasphaera elsdenii].                                                  503781928_?->503781929_?->503781930_?-><-503781931_?||753929833_?->503781933_?-><-503781934_?||503781935_N6-MTase*->503781936_DpnM-N6-MTase->503781937_N6-MTase->503781938_DpnII->503781939_PHP+DNApolIIIa-><-503781940_?||503781941_?->503781942_?->
      548306764    <-PHP+DNApolIIIa||?-><-DpnII<-N6-MTase<-DpnM-N6-MTase<-N6-MTase*                                                    N6-MTase    UPF0020                                            BN715_00862                  244   bacteria>firmicutes                          Megasphaera elsdenii CAG:570                        putative RNA methylase [Megasphaera elsdenii CAG:570].                                     <-548306758_?||503781940_?-><-548306759_PHP+DNApolIIIa||548306760_?-><-548306761_DpnII<-548306762_N6-MTase<-548306763_DpnM-N6-MTase<-548306764_N6-MTase*||548306765_?-><-548306766_?<-548306767_?<-548306768_?<-548306769_?<-548306770_?<-548306771_?
      657829373    <-N6-MTase*                                                                                                         N6-MTase    N6_N4_Mtase+Methyltransf_26                        P159_RS0116605               244   bacteria>firmicutes                          Selenomonas ruminantium                             DNA methylase N-4 [Selenomonas ruminantium].                                               <-657829360_?<-657829361_?<-739517657_?||657829365_?->739517658_?-><-657829369_?<-739517500_?<-657829373_N6-MTase*<-657829375_?<-657829377_?<-657829379_?<-657829381_?<-657829383_?<-657829385_?<-657829387_?
      493484190    N6-MTase*->DpnM-N6-MTase->N6-MTase->DpnII->METHYLASE->                                                              N6-MTase    Methyltransf_26                                    CLOHIR_RS00470               243   bacteria>firmicutes                          [Clostridium] hiranonis                             DNA methylase N-4 [[Clostridium] hiranonis].                                               <-493484183_?||493484184_?->750105496_?->493484186_?->493484187_?->493484188_?->750105505_?->493484190_N6-MTase*->493484191_DpnM-N6-MTase->493484192_N6-MTase->493484193_DpnII->750105506_METHYLASE->750105497_?->493484196_?-><-493484197_?
      738699070    MACRODOMAIN->?->?->?->?->N6-MTase*->N6-MTase->DpnII->Primase+SNF+PLD->                                              N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            HMPREF9124_RS05925           243   bacteria>firmicutes                          Oribacterium sp. oral taxon 108                     DNA methylase N-4 [Oribacterium sp. oral taxon 108].                                       738698697_?->738698699_?->496987335_MACRODOMAIN->496986404_?->496988548_?->738698704_?->738698705_?->738699070_N6-MTase*->496990603_N6-MTase->496992270_DpnII->496992383_Primase+SNF+PLD->496987124_?->496987391_?->496992206_?-><-496987099_?
      739513201    <-MutH<-?||N6-MTase*->DpnM-N6-MTase->N6-MTase->DpnII->                                                              N6-MTase    UPF0020+N6_N4_Mtase                                K292_RS0108670               240   bacteria>firmicutes                          Anaerovorax odorimutans                             DNA methylase N-4 [Anaerovorax odorimutans].                                               739513208_?->653150344_?->653150345_?-><-653150346_?||653150347_?-><-653150348_MutH<-653150349_?||739513201_N6-MTase*->653150351_DpnM-N6-MTase->653150352_N6-MTase->653150353_DpnII->739513202_?->739513209_?->653150354_?->739513203_?->
      754097257    <-PLD+SFII-helicase<-?<-?<-N6-MTase*<-DpnII<-DpnM-N6-MTase                                                          N6-MTase    Methyltransf_26                                    COPRO5265_RS06650            240   bacteria>firmicutes                          Coprothermobacter proteolyticus                     DNA methylase N-4 [Coprothermobacter proteolyticus].                                       754096854_?-><-501537781_?<-754097253_?<-501537729_?<-501538863_PLD+SFII-helicase<-754097255_?<-501538944_?<-754097257_N6-MTase*<-501538765_DpnII<-501538213_DpnM-N6-MTase<-501538948_?<-754096859_?||501539125_?-><-501538866_?<-501537772_?
      489540792    <-DpnII<-DpnM-N6-MTase<-N6-MTase<-N6-MTase*<-?<-?<-?<-?<-PLD+SFII-helicase                                          N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            CLPA_RS15010                 238   bacteria>firmicutes                          Clostridium pasteurianum                            DNA methylase N-4/N-6 domain-containing protein [Clostridium pasteurianum].                <-489540779_?<-489540781_?||489540782_?-><-736828614_?<-489540787_DpnII<-736828612_DpnM-N6-MTase<-489540790_N6-MTase<-489540792_N6-MTase*<-489540794_?<-489540795_?<-489540797_?<-489540799_?<-489540801_PLD+SFII-helicase<-736828393_?<-489540806_?
      495813929    <-DpnII<-DpnM-N6-MTase<-N6-MTase*                                                                                   N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            HMPREF9454_RS05820           238   bacteria>firmicutes                          Megamonas funiformis                                DNA methylase N-4 [Megamonas funiformis].                                                  <-748623258_?<-495813923_?<-495813924_?<-495813925_?<-495813926_?<-495813927_DpnII<-495813928_DpnM-N6-MTase<-495813929_N6-MTase*<-495813930_?<-495813931_?<-495813932_?<-748623259_?<-495813934_?<-495813935_?<-495813936_?
      496091935    <-wHTH+REase-DpnII<-DpnM-N6-MTase||N6-MTase*->                                                                      N6-MTase    Methyltransf_26                                    QSI_RS08630                  238   bacteria>firmicutes                          Clostridiales                                       MULTISPECIES: DNA methylase N-4 [Clostridiales].                                           <-738856073_?<-496091933_wHTH+REase-DpnII<-497274454_DpnM-N6-MTase||496091935_N6-MTase*->496091936_?->738856076_?->496091938_?->488677428_?->488677427_?->545045779_?-><-497274460_?
      505207017    HNH->?->?->?->?->?->N6-MTase*->N6-MTase->DpnM-N6-MTase->DpnII->                                                     N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            H122_RS0107615               238   bacteria>firmicutes                          Clostridium saccharoperbutylacetonicum              DNA methylase, partial [Clostridium saccharoperbutylacetonicum].                           516421054_?->505207023_HNH->505207022_?->505207021_?->505207020_?->505207019_?->505207018_?->505207017_N6-MTase*->505207016_N6-MTase->505207015_DpnM-N6-MTase->505207014_DpnII->505207013_?->505207012_?->505207011_?->505207010_?->
      547961389    <-DpnII<-N6-MTase<-DpnM-N6-MTase<-N6-MTase*                                                                         N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            BN656_01315                  238   bacteria>bacteroidetes                       Bacteroides pectinophilus CAG:437                   putative DNA (Cytosine-5-)-methyltransferase [Bacteroides pectinophilus CAG:437].          <-547961385_?<-547961386_DpnII<-547961387_N6-MTase<-547961388_DpnM-N6-MTase<-547961389_N6-MTase*||547961390_?->547961391_?->
      648605175    N6-MTase*->DpnM-N6-MTase->DpnII->                                                                                   N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            F553_RS0104430               238   bacteria>firmicutes                          Megamonas rupellensis                               DNA methylase N-4 [Megamonas rupellensis].                                                 <-517828949_?||517828950_?->517828951_?->648605173_?->495813932_?->517828953_?->517828954_?->648605175_N6-MTase*->517828956_DpnM-N6-MTase->517828957_DpnII->517828958_?->495813924_?->517828959_?->517828960_?->517828961_?->
      657680312    <-N6-MTase*||DpnM-N6-MTase->                                                                                        N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            VE20218_RS15590              238   bacteria>firmicutes                          Clostridiales bacterium VE202-18                    DNA methylase N-4 [Clostridiales bacterium VE202-18].                                      <-657680300_?||657680302_?->657680304_?->736546666_?->657680307_?->657680309_?->657680311_?-><-657680312_N6-MTase*||657680314_DpnM-N6-MTase->657680316_?->657680318_?->657680320_?->657680322_?->657680324_?->489634871_?->
      736866276    <-DpnII<-DpnM-N6-MTase<-N6-MTase<-N6-MTase*<-?<-?<-HNH                                                              N6-MTase    N6_N4_Mtase+N6_N4_Mtase                            G594_RS0107390               238   bacteria>firmicutes                          Clostridium paraputrificum                          DNA methylase N-4 [Clostridium paraputrificum].                                            652800875_?-><-652800876_?<-652800877_?<-736866318_?<-652800878_DpnII<-652800879_DpnM-N6-MTase<-652800880_N6-MTase<-736866276_N6-MTase*<-652800882_?<-652800883_?<-652800884_HNH<-652800885_?<-652800886_?<-736866322_?<-652800888_?
      737163966    <-MutH||N6-MTase*->N6-MTase->DpnM-N6-MTase->                                                                        N6-MTase    Methyltransf_26                                    CTM_RS16890                  238   bacteria>firmicutes                          Clostridium tetanomorphum                           DNA methylase N-4 [Clostridium tetanomorphum].                                             737164062_?->737164063_?->737163956_?->737163958_?->737163960_?->737163962_?-><-737163964_MutH||737163966_N6-MTase*->737163967_N6-MTase->737164065_DpnM-N6-MTase->737163969_?->737163971_?->737163974_?->737163975_?->737163977_?->
      737306053    PLD+SFII-helicase->?->?->McrC-NTD->METHYLASE-><-?||N6-MTase*->N6-MTase->DpnM-N6-MTase->DpnII->Mrr_cat->             N6-MTase    UPF0020+N6_N4_Mtase                                CC89_RS03170                 238   bacteria>firmicutes                          Clostridium sp. KNHs214                             DNA methylase N-4 [Clostridium sp. KNHs214].                                               737308255_?->737306041_PLD+SFII-helicase->737306043_?->737306045_?->737306048_McrC-NTD->737308257_METHYLASE-><-737306051_?||737306053_N6-MTase*->737306055_N6-MTase->737306058_DpnM-N6-MTase->737306060_DpnII->737306062_Mrr_cat->737306064_?->737306066_?->737306068_?->
      737398024    <-SNF<-?<-DpnM-N6-MTase<-N6-MTase<-N6-MTase*||DpnII-><-ABC-ATPase<-ABC-ATPase                                       N6-MTase    Methyltransf_26                                    Q428_RS06840                 238   bacteria>firmicutes                          Fervidicella metallireducens                        DNA methylase N-4 [Fervidicella metallireducens].                                          <-737398017_?<-737398043_?||737398019_?-><-737398045_SNF<-737398047_?<-737398021_DpnM-N6-MTase<-737398022_N6-MTase<-737398024_N6-MTase*||737398026_DpnII-><-737398049_ABC-ATPase<-737398027_ABC-ATPase<-737398029_?<-737398031_?||737398033_?-><-737398035_?
      746722773    N6-MTase*->DpnII->N6-MTase->?->?->?-><-?||PLD+SFII-helicase->                                                       N6-MTase    Methyltransf_26                                    QX51_RS12035                 234   bacteria>firmicutes                          Terrisporobacter othiniensis                        DNA methylase N-4 [Terrisporobacter othiniensis].                                          746722766_?->746722767_?->746722768_?->746722769_?->746722770_?->746722771_?->746722772_?->746722773_N6-MTase*->746722774_DpnII->746722865_N6-MTase->746722775_?->746722776_?->746722777_?-><-746722778_?||746722866_PLD+SFII-helicase->
      # 1;                                                                                                                                                                                                                                       
      406873648    DpnM-N6-MTase->wHTH+REase-DpnII->N6-MTase*->                                                                        N6-MTase    Methyltransf_11+Peptidase_S24                      ACD_81C00186G0010            395   bacteria                                     uncultured bacterium                                Sensor protein fixL [uncultured bacterium].                                                406873641_?->406873642_?-><-406873643_?||406873644_?->406873645_?->406873646_DpnM-N6-MTase->406873647_wHTH+REase-DpnII->406873648_N6-MTase*->406873649_?->406873650_?->406873651_?-><-406873652_?<-406873653_?||406873654_?->
      
      Back to Contents
    • Multiple sequence alignment of the Group I/Clade 5/Ot12g00270-like N6-MTases

      General notes

      <---------------------------PPR -repeats --------------------------------------------------------------------------------------------------------------------------------------------------> Extra-strand Str-1 Str-2 Str-3 Str-4 Str-5 Str-6 * Str-7 * ALIGN --------------------------------------------------------------------------------------------HHH--------------------------------------------------------------------------------------------------------------------------------------------------------------------------HHHH-HHHHHHHHHHHHHHHHHHH----HHHHH-----------------------------------------------------------------------------------------------------------------------HHHHHHH---------------HHH---------------------------------HHHHHHH-H--H----------------------------EEEEE------HHHHHHHH----------HHHHHHHHH-------------HHH--HHHHHHHHH-----------HHHH-----H---------HH---------------------------------------------H--HHHHHHHHHHHHH--------------H-----HHHHHHHHHHHH------------------------------------------------HHHHH--------------HHHHHHHHHHHHHHHHH-----H-----------HEEHH-------------E---EE--------EEEEE-----------HHHHHHHHHHH--------------------------------------------------------------------H---HHHH----HHHHHHHHHH---------------------------------HHHHH-------------------HHHHHHHHHHHHHHHHH------EEEEEE-------------HHHHHHHHHHHH-HHHHHHHHH---------------------------------HHHHHHHHH---------------------------------- HMM --HHHHHHHHHHHH-------HHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---EEEEEEEEEE--H--HHHHHHHHHHHHH-EEEEE---EEEEHHHHHHHHH--HHHHHHHHHHHHHHHH---EEEEEE-------H--HHEHHHHHHHHHHH-------EEEEEEE-----HHHHHHHHHHHH---EE---EEE---HHHHHHH----HHHHEEEEE----HHHHHHHHH-HEE-------EEEEEE---HHHHHHHHHHHHHHHH-HHH--HHEHEEH--HHHHHHHH---E--------------------------EEEEE-------------------------------------------------------------------HH----HHHHHHH---------------HHH-HHH---EEE---------EE-------H-HHHHHHHHHH-H--H-H--------------------------EEEEEE----HHHHHHHHHH-HHHEEHHHHHHHHHHHH------------HHHH--HHHHHHHHH-----------HHHH-----H---------HHH------------------H-----------------------HHH--HHHHHHHHHHHHH--------------H-----HHHHHHHHHHHH-------------H-H-----EE--EEE-----------H------HHHHHHH--------------HHHHHHHHHHHHHHHHH-----H-----------EEEEE--EEE-------------E--------EEEEEE-----H---HHHHHHHHHHHH--------HH---EEEEE---------------------------------EEEEE------------HH-HHHHH----HHHHHHHHHHHHH------------------------------HHHHH-------------------HHHHHHHHHHHHHHHHHH----EEEEEEE--EEEEEEEEE---HHHHHHHHHHH--EEEEEEE-----------------------------------HEEEEEEE---------------------------------- FREQ -----HHHHHHHHHH---HHHHHHHHHHHHHHHH-------HHHHHHHHH---HHHHHHHHHHHHHH------EEEEEHHHHHH----HHHHHHHHHHHHHH------EEEEEEHHHH------HHHHHHHHHHHHH-------EEEEHHHHHH-----HHHHHHHHHHHHH------EEEEEEHHHH----HHHHHHHHHHHH-------EEEEEHHHHH-----HHHHHHHHHHHHHH------EEEEEHHHHHH-----HHHHHHH-HHHHHH------EEEEHHHHHH-----HHHHHHHHHHHHH--------EE--------------EEHHHHHH------H-------------------------------------------------------------HHH----HHHHHHH---------------HH------EE-----------EEEE----------HHHHHHHH-H--H-----------------------------EE-------EEEEHHHH-----------HHHHHHHHH--------------HH--HHHHHHHHH-----------H---------------------------------------H-----------------------HHH--HHHHHHHHHHHH------------------------EEEEEEEEE-------------E-EEE--------------------------------HHH--------------HHHHHHHHHH--HHHHH-----------------EEEEE---------------------------EEEEE----------HEHHHHHHHHHH--------HHHH-----------HHH-----------------------------E-E--------EEE------H----HHHHHHHHHHHHHH----------HHHH----------------HHHE-------------------E-----HHHHHHHHHEE-----EEEEEEE--EEEEEEE----HHHHHHHHHHH---EEEEEEEEE---------------------------------EHEEEHHHHH-------------------------------- PSSM -------HHHEHHH-----------HHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHH-----EEEEEHHHHHHH-----HHHHHHHHHHHH-------EEEHHHHHHHHH----HHHHHHHHHHHHH------EEE-HHHHHHH-----HHHHHHHHHHHHH-------EEE-HHH-H--HHHHHHHHHHHHHH-------HE-HHHHHHHHHH--HHHHHHHHHHHHH-------EEEEEHHHHHH-H-HHHHHHHHH-HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH----------------------------------HH-HHH-------------------------------------------------------------HHH----HHH---H--------------------HHHH--------------------------HHHHHHHH-H--H-----------------------------EE-------HHHHHHHHH---EEEEE--HHHHHHHHH-------------HHH--HHHHHHHHH-----------HHHH------------------------------------H------------------------HH--HHHHHHHHHHHHH--------------H-----HHHHHHHHHHHH-------------H-HH--------------------------------HHHH--------------HHHHHHHHHHHHHHHH-------------------EEEE-----------------------------EEEE----------HHHHHHHHHHH--------HHHH----HHH---HHHHHH-----------------------H----------------------------HHHHHHHHHHHHHH-----------------------------HHHHH-------------------HHHHHHHHHHHHHHHHH-----EEEEEEE------EEE---HHHHHHHHHHHH---EEEEEEEE---------------------------------EEEEEEEE----------------------------------- FINAL -----HHHHHHHHH-----HHHHHHHHHHHHHHH-------HHHHHHHHHH--HHHHHHHHHHHHHH-----EEEEEEHHHHHH----HHHHHHHHHHHHHH------EEEEHHHHHHH-----HHHHHHHHHHHHH------EEEEEHHHHHH-----HHHHHHHHHHHHH-------EEEEHHHH---HHHHHHHHHHHHHH-------EEEEHHHHHHHH---HHHHHHHHHHHHHH------HHHHHHHHHHH----HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------------------HHHHHHHH------H-------------------------------------------------------------HHH----HHHHHHH---------------HHH------EE----------EEE-----------HHHHHHHH-H--H----------------------------EEEE------HHHHHHHHH----EEE---HHHHHHHHH-------------HHH--HHHHHHHHH-----------HHH-------------------------------------H-----------------------HHH--HHHHHHHHHHHHH--------------H-----HHHHHHHHHHHH-------------H-----------------------------------HHHH--------------HHHHHHHHHHHHHHHHH-----------------EEEEE---EE----------------------EEEEEE---------HHHHHHHHHHHH--------HHHH-----------HHHH----------------------------E----------HHHH-HHHHH----HHHHHHHHHHHHHH----------H------------------HHHHH-------------------HHHHHHHHHHHHHHHHH-----EEEEEEE--EEEEEEEE---HHHHHHHHHHH---EEEEEEEEE---------------------------------EEEEEEEE---------------------------------- Smin1000031348_Symbiodinium_minutum_Mf_105b01_Smin1000031348 MAISSDLAAIKVLCKNQRPTSESINHDLSPFLDELKSSPRFGTSLLNKLARRKKLRQLSLVLDALLLNRCDVNVFHYGVSISAFEKAEKWDRALLLLRQMDVVRVEPNTVTYSACISAMEKCGLWRQAVDLLAEMEYRRVEKNVITISAGISACSKAGHWWLALDVLDKMCKDQIDPDTVAFNACISSCSSQWQVALHLFQQMSGFKLQPDVISYSGAIAACEEGLQWETTLQLFTDLQSKEVHLDDFSYNALISSFSRGSAWQLCLHV-LQLKQLASCASTASIASETNHSNEFEFATNDTNDTNDTTDICKLLVPSVP--------------SVNFVKLVEGL-DIF-------------------------------------------------------------AVS----ATVSASL---------------ELQ-WAVGYPFLP----AKEEQLTHGFYKYIAGMQALCARELL-E--L-V-----P-----------HA---Q---NIMDMFCGSGTVLVEALRTGKRAIGCDVSPLALLVATHHS---DAA----RIDL--YELFEVARE-----------LVAS-----M---------EAR------------------N-----------------------EGW--HYLKSRISNLRSK--------------N-----LRDALHFILLVS-------------L-SRVQDVTY--LHSSSKVIKSS-VPD------HGLPPCM--------------FLGVAQLYVARVRSLRA-----RA------LESECEIYRCDARVL------RL---EP--------VDAIVTGPPYPG---VYDYHSPANMCA--------DLLGENILYDF---CAPGYS-----------------------IRGSKAP--------TNVE-MAHEK----SSTYAAGREIGQNR----------LWLED--------------SDFAE-------------------IWQSEQEAWLTSAFENLRE-GGTATLMIGDGDLHSAGDGGFDNLEPTIIAAEKVGFATIATATIRGKSK---HPK--QP-------KGMK-------RTEHVVHLKKP------------------------------KL--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Ot12g00270_Ostreococcus_tauri_308809742 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MALARARGLDARALARTRTRTRNVARVIGGGPRGGSNDADGRAWRDDVDD------ETLAYVRKASTASSFEERAFGDD-------------------------------------------------------------VIG----EKLLRAM---------------RVQ-WTHGFASTP----SGFPRCTHGFGEILAGMQPAACDLILRD--V-L-----------------SG---ST--SVLDPFAGGGTTLVCAATLGMRAVGTDVSPLACFVSAHRCW--RPD----ETTI--DEMVHRASAARD--------LAKS-----A---------EEQSVESDAE--------EAGD-------------------GGVPRAW--RRIRDALR---ASNGGVSGG-------R-----VDESLWFCMSVA------LQRT--QK-SQSKARPR---KGKGKRSGGS-R-----------------GGED-NADI---FYKVVEEYADRLRQFRA-----MCST-CDSQAPDAEIHNIDVRRF------KL------ADEA--KVDAVLTSCPYPA---VYDYLSFARKVR--------AGSGAVVSTSSP--RSQGPTSS---------------------YVNTVVP--------GDRN-WPKE--------WMEG-EIGSRR----------ALRSD-------------PYAFRD-------------------AWQREQEEWLEVVAHSLKP-RGRACIMVGDGANIDTRAS-------MVSAAEKFGLKTLAGATMKLTTT---TES-GNV-------YNQA-------RTEHLILFEKH----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- OSTLU_26692_Ostreococcus_lucimarinus_CCE9901_145352834 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------M---------------RVQ-WTHGYASAP----SGFARCTHGFGEILAGMQPAACDLIMRE--C-F-----------------EA---TETRSVLDPFAGGGTTLVCAQTLGWRAVGADVSPLACFVSANRNW--RAS----EEEV--EVMLESARTVTD--------RAGT-----R---------EEKD---GAN--------DEDA-------------------SDVPRSW--RAVRDALREYLASDAFVGAP-------R-----VAEGLWFCMSVA------LQRT--QK-SQNKRRHKG--KSKGKRGGGG-D-----------------DASRVNAET---FLKTVEEYANRLEQFRE-----ECAKHAGAPAPPASIHNVDVRKF------KL------SAED--KVDAVLTSCPYPA---VYDYLSFARKVR--------AGSGAVVPSSASFSARSGPTSS---------------------YVNTVVP--------GDRN-WPAG--------WTEG-EIGSRR----------ALRSD-------------PYAFKD-------------------VWQREQEEWLAVVADSLTP-RGRAAIMVGDGANIDTRAS-------IVAAAAKFALREVAGCTMKLTTT---TEE-GRV-------YNQA-------RTEHLILFDRA----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Esi_0134_0029_Ectocarpus_siliculosus_298705971 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MYPTPCPSRKVQRCFVP-SILAPVCTFGICVSHQPRSRKRKEYRYDTMLTSGS----------------------------------VLAESQ-GAP-------------------------------------------------------------EIR----EALEEGL---------------RLQ-WSFGFPYTP----KELPVLTNAFFPYPAGMQPATAHHLL-E--T-V-----L-----------TG---R---SVLDPFVGGGTVIVEALRAGRLGVGSDVSPLSLLVSRGRTW--IAS----DSQL--EELREAVRS-----------VCEA-----A---------AAKLGETHTR--------QVSG-----------------------KDW--DCIESEIANYLARSKDGVKD-------E-----LRDALWFVLAVARPKAATKRRR--GV-EHVVAWEV--FHSVCREYTIA-VKNLASAATAGAAPCLPQTPVILRRDARHPLPLVVDVGVGGKENAGG-----GG------LENLAACFKPKAPVG------GD---EHNSNSL--VVDAVLTSPPYPG---VYDYLAHARQVR--------SVMGRLGSTEQ---SRSMHRTSI--------------------FSNSRVP--------TGRN-WADE--------WVIG-EIGSKR----------KIRRDRKREAYSGGSGDVSTRRED-------------------NWEKDQKDWILATTNALRV-GGRIGIMLGDGDEVDTRSSLLRTVEVLRTQEGGAVLDVLGWATFRSAAG---ARR--R-----------M-------RTEHIILLEK--------------------------------L--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Bathy10g01900_Bathycoccus_prasinos_612390323 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MRVVA--SSLFAFRQQELRRKQHHHRHHHQHHAYERKRRRNRCAIASNIATKTTESP------PLEYYEHTKIPSTEFTETCHGPE-------------------------------------------------------------EYT----SVLLHAM---------------RMQ-WKHGFTHVP----NGFRRVTHGFGEILAGMQPAAAERCV-E--L-LLRRGEAEDDDHDTS-KTEN---KE--SVLDPFCGGGTVPVVAMTNGLIAYGTDVSPLALHVCGHRVW--VPENV--DEEE--TWLRERMKKI----------EAIV-----A---------DEKDADENVN--------PLGSFA-SIA-------------VAVEKAI--SVESEDDESNSSSSGNESNS-NSIDSTT-----RKTALFFCLAEA------ERRHFVRQ-KRNKRKKR--FGKRRATSNGS-K-----SDRD--------DSYYSPLSF---YQRCVDEYIFRVKDLHD-----AVG-----GAADVVIRKGDARFD------DL---VPKDKIG--TVSGILTSPPYPG---VYDYVSFARKQRSRFRVGSTAGNGSAITTTIEETESEKPPTS----------------Y----YVDAKVPDSNELDENGNRAPWSDD--------FSSANEFGAKR----------ELRKD-------------PSAFND-------------------KWTQSQTQWLKNAYKTLKP-SGRMLVMIGDGANVDALKS-------TVVCAEQLGFTLLASCSLALLRT---MESTGTV-------YNGA-------RKEHLILFEKK------------------------------LLS--KSE--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- MICPUN_104425_Micromonas_sp_RCC299_255089364 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MAAMSASGAAAAARCGASILTPRRARRLVQLVRAATAAGGDVEVGTAAER--------ETYLEIASRPSRFQESQWGPE-------------------------------------------------------------DVC----VVLLEAM---------------RVQ-WNHGYPSTP----RGYPRTTHGFHEYPAGMQAAAADRIL-D--V-L-----------------PG---N---SLLDPFAGGGTSLVVGMSKGRETFGVDVSPLAAFVATHRTW--RPAPGAEEETF--EWMRATADAALDGLDAAA--VASR-----E---------EESSPTTGLE--------PETSSG-STSEEPPTTKTGAKRGGGVPRAW--LAIRRSLRTALESPTGIEAPPPGLGGTT-----PEGALWFCLSVA------LQRS--QK-GRGKKRHKRGYKNQRRKTRAE-T-------ENAFKSQLATEAELDAAAS---FRKCVDEYCDRVNELCA-----CVPV----GTPPPVIVNGDVRTL------PEASGGGQLPAG--SIAAALTSPPYPG---VYDYLSFARKVR--------AGSGAALMADGDGSRSPGTDDGSRSPGNGSGSDADSDRHGSLRYFATAVP--------GDRT-WPDE--------WNVG-EFGSRR----------ELRRD-------------PRAFKE-------------------AWQEDQTRWIDVVANALAP-GGRVAIMIGDGANVDTRVS-------TIDAGVKCGLTHVASVSMALTHE---TAE-GLV-------WNAA-------RREHLVLLEKP------------------------------RDR-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- MICPUCDRAFT_63711_Micromonas_pusilla_CCMP1545_303287789 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MTTLPAPRATWAYRWASRAFPTSSSTTTTSRASPSSVALEASRGGDEGARRRARAAHNHQHRRHRRGVAAAASTTSGSDAPAAAARVAANDDEPESYLDVASRPSRFEESAWGPK-------------------------------------------------------------DLS----SILLAAM---------------RIQ-WTHGYPNTP----RGYPRLTHGFHEYPAGMQAAAASKAL-E--A-L-----------------PG---R---ALLDPFAGGGTTLVVGMADGRATKGVDVSPLACAVAAHRAW--RPSGG--DATL--ALMRDAARAVVERMDVTAGEMRRE-----E---------REDAKAAAAR--------GEDDDG-EVDDD---DGGGKRKPGGVPRDW--RVAQRALADVVGNAASIGGGGSGGDGDG-----VEGALWHCFAVA------LQRS--SK-SKNKRRP---YKRQRKKAAAA-AADSETADSDAAAAETATAAETATADA---FARVVDEYCDRVGDVLA-----AVPP----ETPRAIIVNSDVR-------------GVDASAF--QTDAVLTSPPYPG---VYDYLSFARKVK--------SGSGSA----AGGAAAAGAET----------------------YFRDAAP--------GDRN-WPAA--------WTTG-ELGSRR----------ALRSD-------------PRAFKE-------------------AWQREQEQWLAFVSASLRP-GGRAAIMIGDGANVDTRLS-------IEDAGAKVGLARVAGCTMALTRT---MSD-GGV-------WCQA-------RREHLILLEKT------------------------------PTPTTPTE--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- EMIHUDRAFT_258012_Emiliania_huxleyi_CCMP1516_551547865 ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MRRD-------------PQSFGA-------------------AWQRDQELWLGAARRSMRP-GARAAVVIGDGGGIDTLDS-------TRRAAEAVGMRVVACASIRSDLP---VEE--RL-------QGNR-------RTEHALLLEAP----------------------------------FTLSPAFAPS--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- DB32_008855_Sandaracinus_amylolyticus_816962200 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MNVG-GPTRSQ-GD-PKIAATL-----VE--------AME-AAAH---------EDPDALTHGFHAWPARMHRAIARTVI-A--R-A-------------T-R-EG---E---RVLDPFCGSGTVLLEAMLAGRRSAGVDLNPLAGALVGVKCE--RRD----LESR--DAFEQLAEL-----------VGEA---------------SLERVQARER--------AIAKLP---PSM---------------IGWYGPHVLKELAGLLEEIRSVED--------E---A-DRRALEMVFSSL------VVKL--SK-KRAETS----QQEAEKRIRKG-L----------------------STEM---FVRKAHELAQRWGTLDE-----ALPE----GAIAPRFFEGDSREL---E--RL-------LGARTKARLILTSPPYGG---TYDYHAQHALRL----------------------------------------------A------------------------WLGL--DA-RA-FARR-EMGARR----------SAAVD-------------PDEAAE-------------------QWWDDVAAMLRGMERVLDR-DGFIVLLMGDGRFGQLDVP-LIPQ--LIEIAPHVGLELVASASQPRPAW---GG-------------GAT-------REEHLVALKRA-E---RRR--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- HGMM_F46H12C04_uncultured_delta_proteobacterium_374854167 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MLRPGQ----------------------GKRALIHVG-GPVETA-GD-PVLARTM-----AE--------ALC-VRPG---------ERDATLTHPFHAYPARLHPEVARRLV-AMIP-M-------------S-R-PA-------TVLDPFCGSGTVLVEALVAGHRAIGTDLSPLAIELAGLKTR--LTT----EAER--RAMVQAARA-----------VARE---------------AGA--MVRAG--------VRLPVP---PGE---------------ARWFAPHTLREVAALASIVLAMGP--------G---F-ERDALRLLLSSI------LVKV--SF-QASDSD----PRWVTKRLAPG-A----------------------ALRF---FARKAFELDCCLAALAK-----AVPS----GAPPVDVHVDDATVL---R--TV-------ADA--TVDVVMTSPPYAA---TYDYVTHHARRY----------------------------------------------A------------------------WLGL--DP-GA-LRTK-EMGAAR----------WFDV---------------PEVGA------------------RRFARELASMCAAFARVLRP-GGLALVVMADGAAGDRPLL-ADEM--LARAIQGTGLDLVARVSQARPVF---DRV-SRR----AF-ARGS-------KREHLMAFVRR----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- HW49_RS06065_Porphyromonadaceae_bacterium_COT-184_OH4590_738943261 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MT--------DYS-IDFNFG------GADTMYLTHSLHPYPAKFPPQLPNVIL-S--K-Y-------------G-I-KG---Q---TVLDPFCGSGTTLVESRLLGFNAIGVDVNGLSTLLSKVKAT--PLS----NEEI--ATIKDFISL-----------VENE---------------NFQWSMNR----------PQIEVK-QIEGL---------------EHWFQHNVAEELTHLLNLISKINS--------E---N-VRDFLKIVVSSI------IVRV--SN-QESDTR----FVAKNKEIQDN-F----------------------TFRQ---FLVRAKEYLERISVFSQ-----KAHK-----ECFLKLINADSRNL---N--ML-------EDG--SIDMIITSPPYAN---TYDYYLYHKFRK----------------------------------------------R------------------------WLDI--DV-KF-AQNN-EIGSRR----------EYSSL-------------KKTAEQ--------------------WTTDLKLCFHEMFRLIKK-DGLAFIVIGDSVIKKQLIK-INEV--ICDFIPEIGFEVCNVISSSLSEH---SRM-FNP----TF-TQKD-------KKEHLI-----------------------ILKKY------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ PPSIR1_RS28425_Plesiocystis_pacifica_494032514 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSGDEDTRADTEAPSEQERSAKNEPGHEPAGKKPEGKKPKKGKAPRRGARREALSNLG-GPVDSR-GD-RAFAFAL-----AS--------AME-A----------QAEEADVLTHGFHTYPARMHPALARVVL-R--EFD-------------L-G-PG---S---EVLDPFCGGGTVAVEAMVAGWRCLGSDLDPLALRLSRVKVE--RRR----EPQR--ARFTEVLEA-----------VAAA---------------SERRVRGRVE--------VRAALS---PEE---------------RSWYDVHVLKELAGLLEEIRAVRD--------K---R-DRQALEMVFSAI------VVKF--SR-QRSETA----DRDAPRRLRKG-L----------------------VTEF---FLRKGRELVERWAALSE-----ALPE----RSHEPRFVQSDARRL---P--TT-------LSGEYRCDLVLTSPPYGG---TYDYARHHARRH----------------------------------------------A------------------------WLGI--NA-KH-LREG-EIGARR----------DLSQP-------------HSKSGSARRGERRSGRVEAGKDAQARWDQQVLDTLRTLHELLSP-DAVAVLLMGDAQLGRVAVP-ADRQ--LEALAPVAELEFLAAASQPRPEWWTGGH-------------GQE-------RREHLVALRRP-A--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCE_RS37400_Sorangium_cellulosum_501196891 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSTPRSADPKPKPGPQQQKARASRPAPQERRALTHIG-GATVLE-GD-PESAAKL-----AH--------ALD-VASS-----STAEEAARAHVHGFHSYPARMHPDTARRLI-E--G-L-------------S-R-PG---E---RVLDPFCGSGTVLVEARLAGRAAIGVDANPLAVRLARLKVQ--GST----PGER--ERLVAAARE-----------VAAA---------------ADERRKARAG--------PSRRYG---PED---------------VALFEPHVLLELDGLRVGLDRIAG--------HGADA-LRADLELVLSAI------LTKL--SR-RTSDTS----EHELPRRIAAG-Y----------------------PARL---FIRKAEELAQRLAEVAA-----PLEA-----SPPALVEEGDARVL---R--GV-------EAG--SVHLAITSPPYPG---VYDYLAHHEARL----------------------------------------------R------------------------WLRL--RS-AR-FEQD-EIGARR----------HLDPL-------------GPEAGR------------------ARWRDELGGVLAALARVLRS-GAPLVLLIADSVVAGAPVY-AVDV--VKAAASGARFAVRAVASQPRPHF---HRP-TAR----AF-QRRP-------RHEHAILLIRA----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- MURRU_RS15895_Muricauda_ruestringensis_503800515 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MNCSIFGKQTNQYNMD-VA--IPFESL---------------SVS-EDWTFKN-----VKSTEQWTHGYHRYPAKFLPNIVKKII-E--D-Y-------------T-K-EG---D---VVADLFAGCGTTLVESKIHGRKSVGVDINPVAQLIARVKTQ--PID----PKELD-RVFKDLVNS-----------LEFY---------------DEK---------------NYHNIE-KHDRI---------------DYWFFPENKYRIAYLYDLISGLPE--------S-Q-K-IKDFFLVALSHI------LKNC--SR-WLQTST----KPQIDPDKVPV-S----------------------VFFA---FKKQVKTMIRKNSDFFK--ELKKLQY----SNVESQIFLQDARKT------EI-------EDS--SISTVITSPPYVT---SYEYADLHQLTG----------------------------------------------Y------------------------WFDYVSNL-LE-FRKN-FIGTFY---SYGTELKTESKT-------------AQDLID-QLKNIHLRTAKEV----ANYFNDMKMVADEMYRILKN-DGYAFIVIGNTTFKNVKIL-SAEI--FSELLELSGFEVHDVIKRSIPHK-L-IPT-IRDKTTGKF-TTVANKNSKKVYPEEYIIIARK----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- L956_RS0103135_Elizabethkingia_anophelis_639235853 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MNSK-VQ-NISFDNI---------------PVS-PEWTFKN-----VRSVENWTHGYHRYPAKFLPNIVKKII-E--E-H-------------T-E-VG---D---TVADVFAGCGTTLVEAKIHGRKSVGVDINPVAQLITSVKTN--PID----PAKLN-ETYSLILEK-----------FDDF---------------ILE---------------DYINVA-THERI---------------DYWFFPENKGKIAFLYGVILNLQE--------N-Q-R-IKDFFLVALSNI------LKNC--SK-WLQSST----KPQIDPDKIPT-D----------------------PFIA---FKRQIKSMLRKNAEFYK--ELANLNF----LDTECNIYLEDARAT------GI-------DSE--SVNAVITSPPYVT---SYEYADLHQLTG----------------------------------------------Y------------------------WFEYVENL-LE-FRKQ-FIGTFY---SYGEVVETVSDL-------------ANQTVK-KISEKHLRTAKEM----SNYFSDMAAVSNEMFRILKV-GGKAFIVIGNTTYKNVKIN-SAEI--FVEMLRSSGFEIDEVIKRSIPHK-L-IPT-IRDNVSGKF-TTLSNKNSKEVYPEEYIIIAKK----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- bsrIM2_Geobacillus_stearothermophilus_313667099 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MV--------DIT-AIQNLKIFE-EDYSNTTDLNHGIHSYPAKFPPQIPGKLL-D--K-F-------------A-K-DN---Y---VVLDPFCGSGTTLVEASLRNLDSVGNDINPIALLISTVKTT--KYF----DTDF--EELEKIIYD-----------LKDD---------------YMN---NK----------NNLKSTIDFPNK---------------DHWFQKNVQKEIELILKHINLCSN--------E---K-YRNLLKVVLSEI------IVTV--SN-QESDTR----YAAIDKNIPDG-K----------------------TIEL---FEKRYFAIKDKILSFSQ-----MVKD----FEYETKIISNDARNL---I--DI-------RSE--SIDIIITSPPYAN---TYDYYLYHKHRM----------------------------------------------N------------------------WLGY--NF-KE-TQNI-EIGSRN----------EYSSK-------------KQKPEK--------------------WKHDLMLVLQEMYRVMKK-DRLCFIIIGDSVINKEHIK-INDV--IREIATKIGFEYLNEESVPLSKN---SRK-FNK----KFGTDQT-------KLEHLISLYKP--------------KLTIIMKAH------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ CAP_RS22030_Chondromyces_apiculatus_763387130 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSTPQKSERR------------------DRRALTHVG-GPARLN-GH-PESAAKL-----AA--------ALD-VEGALTA-----EEAARAHVHGFHSYPARMHPVTARRLV-E--S-L-------------S-A-PG---T---RVLDPFCGSGTVLVEARLAGRTALGVDANPLAVRLAWLKVR--GVG----EAER--DHLVAAARE-----------VAAT---------------ADARRRARAG--------ASKRYG---PED---------------TALFDPHVLLELDGLRMGLDKIED--------Q---P-TRATLELAFSAI------LTKV--SR-RAADTA----AHELPRRIAAG-Y----------------------PARL---LIRKTEELVARLAEVAP-----ALSS-----APPATVDAGDARIL---R--GI-------APG--SVDLIVTSPPYPG---VYDYVAHHEARL----------------------------------------------R------------------------WLRL--PR-AP-FEEA-EIGARR----------RLDQL-------------GPAQGL------------------ERWHAELGAVLSAMARVLHP-AGSAVLLIADSVLAGEPVY-AIDA--LRAAAPAAGLALCAAASQDRPHF---HAP-TAR----AF-AQRP-------REEHAIMLVHRQDSPPRDRRKLHPPRGPVAGSRRPGKP-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- MB26_RS00775_Sphingobacteriaceae_bacterium_DW12_723440940 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSIR-IQKNTSFDNI---------------PVS-SEWTFKN-----VRSVENWTHGYHRYPAKFLPNIVKKII-E--E-H-------------T-E-VG---D---TVADVFAGCGTTLVEAKIHGRKSVGVDINPVAQLITSVKTN--PID----PAKLN-EAYSMILEK-----------FDDF---------------ILE---------------DYIHVA-THERI---------------DYWFFPENKGKIAFLYGVILNLQE--------D-Q-R-IKDFFLVALSNI------LKNC--SK-WLQSST----KPQIDPDKIPT-D----------------------PFIA---FKRQIKSMLRKNAEFYK--ELANLNF----LDTECNIYLEDARAT------GI-------DSE--SVNAVITSPPYVT---SYEYADLHQLTG----------------------------------------------Y------------------------WFEYVENL-LA-FRKQ-FIGTFY---SYGEVVETVSDL-------------ANQTVK-KISKKHLRTAKEM----SNYFSDMAAVSNEMFRILKV-GGKAFIVIGNTTYKNVKIN-SAEI--FVEMLISSGFEIDEVIKRSIPHK-L-IPT-IRDNVSGKF-TTLSNKNSKEVYPEEYIIIAKK----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- AHM02030.1_uncultured_miscellaneous_Crenarchaeota_group_594540697 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSEWSAKLIENVQFIYELALAEL--------------------ELEAFGVDFTVRNSLREFV------------------------LKNPPNIDR-------------------LVRRLAYFK----TVG-DN-LTDYYRL-------------------TRYNRT------RSVNQYLTHWIYPYKGKFHPQMIRALL-N--I-L-KL------------K-EK---E---TVLDPFIGSGTTAVEAQLLGINCIGRDISPLCVLQSKVKTE--SIQ-A--IDKIR-EYRVEATTS-----------FKRS---------------------------------NTPLGD-EEPNN---------------------------KTYSEFLNSIND--------E---Q-IKNFYLMA------K---LVAV--SD-----------SARRRREIVKS-F-DK-------------------NLAL---MIASVEDYRKAKDELDL-------------QLGSVDISLGDARQL------RL-------EDS--SVQGIVTSPPYSI---ALDYVANDAHAL----------------------------------------------E------------------------AMGC--NP-GK-MREK-FVGVRG---------------------KG-----ETRIGL--------------------YNEDMQRSFQEMQRVLEE-GRYCTIVIGNATYQQRKVE-TIEF--TIEECEKLGFTLVKNINKIIYGL-----------------YNIM-------QTDNTLIFRKD-N--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- bmrIM1_Bacillus_megaterium_123187375 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSQ----------------------RIT-DK-KDIREKL-----YE------------IDWDFV------KSNTTRGIHSIHPYPAKFIPEIPRTLI-E--T-L-------------P-L-PE-G-T---SILDPFCGSGTTLVEAQNRGISTVGVDLNPIACLISKVKTN--PIP----------INFIESAEL-----------CVAN---------------AQS---------N-----NSAINN-KIPNL---------------DHWFKTDIQEAVSSLVEEVNQVED--------E---D-IRNGLRLALSSI------IVKV--SN-QESDTR----YAAIEKKVNKD-D----------------------VFTY---FLGACQKLSEYLDE--------NLFN----SNVTANIINKSILEV-S-P--ED-------IKK--PIGMVITSPPYPN---AYEYWLYHKYRM----------------------------------------------W------------------------WLGY--DP-NK-VKVN-EIGARA----------HYFKK-------------NHQTIE-------------------DFISQMDSVMTLLSEVVVS-DGYICFVVGRSIIHGKEYD-NSKI--IEELALKHNLKVIAIIDRNIAKH---RKS-FNL-------SHAK------IKKETVLVLQKR----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- A3IY_RS0101600_crenarchaeote_SCGC_AAA261-L14_516669726 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MGINESVD---TR-EDWSFA------GENTKYLTHGFHPYPARMMPLISKKVI-E--T-Y-------------AVK-ED---D---IILDPFCGSGTVLVESLIHNKDAIGFDINPLAVLIAKAKTT--PIE----PKKLH-EKINEVIRD-----------ILLD---------------KS----------------NHPEPE-SIPNL---------------HYWFKPKVVSQLSKILYHIKNIDD--------E---E-MYNFFATAFSYT------VWKV--SNARKSEYK----LYRMDVNELDK-W----------------------NPDV---LSIFKAILFDNLKGMEE--FYKTMQE----KKAKATIYLRD-FRM---S--DV-------EDE---VTIVLTSPPYGDSRTTVAYEQFSKYSA----------------------------------------------L------------------------WLGF--KEILN-MEAK-AIGGVRKVGKVEKL--GSKTL-------------EEVFKK--VYEKDPERGWDL----YTYFYDMDVSIQKIVKALKKGRSYVIFVIGSRTIRRIKIP-TDRI--LIEMAEKHNLQYEKIIYRKIPSK---RLP-WKN----AP-ENVPGIKGET-IGSEFIIIWKY----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- A3MW_RS0105500_crenarchaeote_SCGC_AAA261-N23_516682681 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MGINESVD---TR-EDWSFA------GENTKYLTHGFHPYPARMMPLISKKVI-E--T-Y-------------AVK-ED---D---IILDPFCGSGTVLVESLIHNKDAIGFDINPLAVLIAKAKTT--PIE----PKKLH-EKINEVIRN-----------ILLD---------------KS----------------NHPEPE-SIPNL---------------HYWFKPKVVSQLSKILYHIKNIDD--------E---E-MYNFFATAFSYT------VWKV--SNARKSEYK----LYRMDVNELDK-W----------------------NPDV---LSIFKAILFDNLKGMEE--FYKTMQE----KKAKATIYLRD-FRM---S--DV-------EDE---VTIVLTSPPYGDSRTTVAYEQFSKYSA----------------------------------------------L------------------------WLGF--KEILN-MEAK-AIGGVRKVGKVEKL--GSKTL-------------EEVFKK--VYEKDPERGWDL----YTYFYDMDVSIQKIVKALKKGRSYVIFVIGSRTIRRIKIP-TDRI--LIEMAEKHNLQYEKIIYRKIPSK---RLP-WKN----AP-ENVPGIKGET-IGSEFIIIWKY----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- btsIM2_Geobacillus_thermoglucosidasius_85720925 --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MK--KTLLE----NTDLQSL---------------VCN-IDWEFK------DANTQYLTHNLHRYSGKFIPQIAKSAI-E--L-L-------------T-Q-PG---D---TILDPYMGSGTTLVEAVLLNRFSIGIDLNPLAVLIAQAKVT--PIE----REKL--DFLITTFTD-----------LCESLDLYFEPSIFNPPLSNIEELVEEAR--------KDFRFT--NDWF---------------TKWFQEKVLLQLIVIKRAIDSISD--------L---D-CRNLATVAFSNI------LRRS--SN-AHSGYP----NVMYDKNAKERPL----------------------PAMV---FLQSLKESVAMVESLDY------LKF----KSFKPRIYLCDNNNM------PI-------PDN--TIDAIITHPPYIG---AIPYAEYGMLSL----------------------------------------------G------------------------WLGY--NW-RE-LDEK-LTGGKR-------------------------------------------QSKNV-V--HRFKVGYTRMLQESYRVLKP-GKKMFLLVGNPVVKGEVVD-LGEM--TKNLATEVGFSLIAESTRMGTNR-R-ANK-MGN--------------------EVLLFFQKN----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- C801_RS06010_Bacteroides_uniformis_511018363 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MNVTLENYR-----------SLDFDTI---------------SVK-EEWN-------MPAERERRMHSIHAYPAKFPAFITTKAI-H--K-A-------------E-E-YNISVK---TVADIFCGCGTVAFETVRSGKHFWGCDINPVATLIAETKSN--VYQ----DKQLK-DIFDQIIAV-----------YKTS---------------GVD---------------ESNRIY-SNERI---------------RYWFDETHIDDLLKLRSAIYQVTN--------D-G-L-YRNFFLCAFSNI------LKSC--SR-WLTKSI----KPQIDPKKQPK-D----------------------VLSS---YICQVNMMRKANME--------NINE----EYGEADIIRNNILDI------NI-------DKP--FTDLIVTSPPYVT---SYEYADLHQLST----------------------------------------------L------------------------WLEYTDDF-RA-LREG-TIGSLYHSKEFNENLKKLNNT-------------GQDIVF-KMYSIDKRKARSI----AQYYIDIQSTVHKVAEMLNT-RGACLFVIGNTEYKGVKID-NAKH--LTECLLAEGFVNIEVDRRKISNK-I-LTP-YRD-TNGKF-ASIKG-SCRKVYSEEFVIFARK----------CN----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- BACOVA_04794_Bacteroides_ovatus_ATCC_8483_156107192 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MNVTLENYR-----------SLDFDTI---------------SVK-EEWN-------MPAERERRMHSIHAYPAKFPAFITTKAI-H--K-A-------------E-E-YNISVK---TVADIFCGCGTVAFETVRSGKHFWGCDINPVATLIAETKSN--VYQ----DKQLK-DIFDQIIAV-----------YKTS---------------GVD---------------KSNRIY-SNERI---------------RYWFDEAHIDDLLKLRSAIYQVTN--------D-G-L-YRNFFLCAFSNI------LKSC--SR-WLTKSI----KPQIDPKKQPK-D----------------------VLSS---YICQVNMMRKANME--------NINE----EYGEADIIRNNILDI------NI-------DKP--FTDLIVTSPPYVT---SYEYADLHQLST----------------------------------------------L------------------------WLEYTDDF-RA-LREG-TIGSLYHSKEFNENLKKLNNT-------------GQDIVF-KMYSIDKRKARSI----AQYYIDMQSTVHKVAEMLNT-RGACLFVIGNTEYKGVKID-NAKH--LTECLLAEGFVNIEVDRRKISNK-I-LTP-YRD-TNGKF-ASIKG-SCRKVYSEEFVIFARK----------CN----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- TRICHSKD4_2254_Roseibium_sp_TrichSKD4_307773237 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MKQRLVLETTEQRKAALAERLDLRGESFADWIEGQIENAVA-PPFAGEIDE-PVTIGELLEGQVVE--------KLTHADWAFT------SANTRYLTHDLHPYPAKFPPQIPGQLI-A--A-L-------------S-F-PG---D---LVFDPFGGSGTTAVEAVRLGRRTVSLDANPLSALLGRAK-T--AYL----DADM--AAQLHSLEG-----------VVES---------------HIAAACGKGSDWSEGLLATHSKYVPQIPNF---------------EKWFCDAAGGELALLRHLISKVTE--------G---A-AKDAALVALSRI------VTRV--SN-QDSETR----YVSVAKEIPPG-F----------------------TLRA---YVESLRFVSKKMESARR-----PML------GASAEFVEGDARTD---IGHTV-------APA--SVDLIVTSPPYPN---ATDYHLYHRFRL----------------------------------------------F------------------------WLGW--DP-RE-LGAV-EIGSHL----------KHQRN-------------GTDFAE--------------------YETEMAKVLTDCFEALQP-GRHAVFIVGDAVFKGEQFS-TSGA--LADAAAKCGFEVLGTVNRPIHAT---KRS-FSK----A--ARRA-------RSEELLILRRP--------------NAPVAVAIDAPAYKMWPFERTLRAKELSALGVADLKAPKAAKTIETELAQPALWNLRRSTFSAQFCAEKDPRPQTVWQKVLENGDADAATRKDPKYATHGLHAYKGKFYPQLAKSLLNTSGVECGAKVLDPYCGSGTVPLECLLNGYQAFGFDMNPLAAKIAKAKSGILLRDQELIELSAASITDMLKTGVGSDELDQFPENVHGELLSWFPKPVLSKLNAILARIRLLGDETLVDYFEVVLSSIIREVSQQEPSDLRIRRRKEPLTDAPVFELFSERLAAQMLRLQKYRSVQARQPGRRYLPRIEEGDSRNSECFLKACVGSASIDCVVTSPPYATALPYIDTDRLSILALMGTPSNERAVVEGSLTGSREIKRKEREELEEQLNDGSVNLPHPIVRTLKGILDGNRGSDAGFRRQNMPALLSRYFTDIQKTLAQVHRVMKPGAKAFYVVGDSRTKVGDNWFAIPTCQHTREIAADVGFKVHPSISIDVTTERMLHLKNAITENDILVFERA DB30_07152_Enhygromyxa_salina_747219293 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MAFRFPL-----VE--------ALR-AQ---------DLDEPSELIHGFHAYPARMHPAVARVLL-R--ELS-------------V-G-PG---S---EVLDPFCGSGTVAVEALISGWRALGSDLDPLALRLARVKTE--RRG----PKTR--ARFLDRLAA-----------IGAA---------------SSARVRDRDD--------VQAKIS---ADE---------------RRWYEVHVLKELAGLLEEIRKVDD--------E---R-DRLALEMVFSAI------VVKF--SR-QRSETA----ARTTDKRLRKG-L----------------------VTEF---FVRKGEELAAAWAELSA-----AIPR----PHHLPRFVLSDARRL---P--RT-------LAGEYRCDLVLTSPPYGG---TYDYLHHHARRH----------------------------------------------A------------------------WLGI--SP-KK-LREK-EIGARR----------YLSRG-------------NGGAGS-----------------RARWDDEMTAVLGSIAELLRP-DAVAVLLVGDAELGGERIA-ADSQ--LERLAPAAELEFLAAASQPRPDW---RG-------------GPP-------RREHLVALRRV-DFAGPRG--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- TRICHSKD4_RS09860_Roseibium_sp_TrichSKD4_750334774 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------DE-PVTIGELLEGQVVE--------KLTHADWAF------TSANTRYLTHDLHPYPAKFPPQIPGQLI-A--A-L-------------S-F-PG---D---LVFDPFGGSGTTAVEAVRLGRRTVSLDANPLSALLGRAK-T--AYL----DADM--AAQLHSLEG-----------VVES---------------HIAAACGKGSDWSEGLLATHSKYVPQIPNF---------------EKWFCDAAGGELALLRHLISKVTE--------G---A-AKDAALVALSRI------VTRV--SN-QDSETR----YVSVAKEIPPG-F----------------------TLRA---YVESLRFVSKKMESARR-----PML------GASAEFVEGDARTD---IGHTV-------APA--SVDLIVTSPPYPN---ATDYHLYHRFRL----------------------------------------------F------------------------WLGW--DP-RE-LGAV-EIGSHL----------KHQRN-------------GTDFAE--------------------YETEMAKVLTDCFEALQP-GRHAVFIVGDAVFKGEQFS-TSGA--LADAAAKCGFEVLGTVNRPIHAT---KRS-FSK----A--ARRA-------RSEELLILRRP--------------NAPVAVAIDAPAYKMWPFERTLRAKELSALGVADLKAPKAAKTIETELAQPALWNLRRSTFSAQFCAEKDPRPQTVWQKVLENGDADAATRKDPKYATHGLHAYKGKFYPQLAKSLLNTSGVECGAKVLDPYCGSGTVPLECLLNGYQAFGFDMNPLAAKIAKAKSGILLRDQELIELSAASITDMLKTGVGSDELDQFPENVHGELLSWFPKPVLSKLNAILARIRLLGDETLVDYFEVVLSSIIREVSQQEPSDLRIRRRKEPLTDAPVFELFSERLAAQMLRLQKYRSVQARQPGRRYLPRIEEGDSRNSECFLKACVGSASIDCVVTSPPYATALPYIDTDRLSILALMGTPSNERAVVEGSLTGSREIKRKEREELEEQLNDGSVNLPHPIVRTLKGILDGNRGSDAGFRRQNMPALLSRYFTDIQKTLAQVHRVMKPGAKAFYVVGDSRTKVGDNWFAIPTCQHTREIAADVGFKVHPSISIDVTTERMLHLKNAITENDILVFERA P279_RS13360_Rhodobacteraceae_bacterium_PD-2_746588169 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MNEANS----------------WPASVD-AE-TKAKSSL-----SE------------VDWGF------PDRLSHSEIEGVHPYPAKFISEIPRALI-S--H-L-------------P-L-PR-N-S---AVLDPFCGSGSTLVESQRLGIPAVGIDLNPIACLMSRVKTM--ACP----------TDLEQHAAR-----------ITLS---------------SDE---------T-----TDIEIP-EIPNL---------------DHWFQPNVQIEINRIARSIVTAPS--------E---A--RDALKLALSSI------IVRV--SN-QESDTR----YAAVKKDVDPS-T----------------------VPKL---FLRAVNRINAALQK--------RNYP----LT-PVEVVESDILKV-E-P--SV-------IKR--PIGLVVTSPPYPN---AYEYWLYHKYRM----------------------------------------------W------------------------WLGY--DP-LK-VKTD-EIGARA----------HFFKK-------------NHHTAE-------------------DFARQMRQTLTLLDGVIVK-GGFASFVVGRSKIHGKIYD-NAKI--ITDEAAPLGFKPFFVTERILSAN---RKS-FNL-------SHAN------IKTETVLVLRKE----------VE----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- GEMOB_RS19995_Gemmata_obscuriglobus_497728440 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MPGGLEPR---------------DPNRRRSLTNIG-GAVTKE-GD-RADADRL-----AH--------ALD-VPPATDA----EDDAARGHVHGFHTYPARMHPITAARLV-R--A-F-------------A-P-PG---G---TVLDPFCGSGTVLVESLVAGRNARGTDLNPLAVLLAQCKTR--PRT----PADI--DRLVADAVA-----------CAEE---------------ADARRKAKAG--------ASRRFS---PED---------------VELFEPHVLLELDSLRAKIGTFGG------------P-NRSDLELLLSSV------LVKL--SR-KPGDTG----WGVRTRRTAGG-F----------------------AAKL---FVQRAQDLAARLSALRV-----LLPA----PAPTATVVQDDATDL---K--RL-------PPG--FVGAVVTSPPYAA---TYDYIAHHSLRL----------------------------------------------R------------------------WLGL--DP-AP-LARG-EIGSRT----------AYARL-------------TPAAAR------------------TAWCRELDRFFRAVGRLLPP-GAPMVLLMADSAVGDGAIR-ADEV--AAEVGRACGLLPAARASQPRPHF---HGP-TAN----AF-RDRP-------RFEHALLLRKA----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- ATHE_RS08295_Caldicellulosiruptor_bescii_506388366 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ME-NLKFDLFLSN-VDIDFK------DYKIKESVHGIHSYPAMMPAPLAEFLI-Q--S-F-------------T-K-KN---D---IVLDPFCGSGTVLYEALKNGRNAIGVDINPLAILISNVKINIGKIE----LSKLE-KFFIEIFKA-----------YQQL---------------EG----------------KEFELP-KFKNI---------------DFWFKKEVQINLQRLKTAIEVVDQ--------D---I-YKLFFKLVFAKT------VRNV--SNTRNSEFK----LYRLEEEKLKQ-H----------------------NPDV---WKTFE----RDFKVTEEKLLYREITN----NSNYVKIFHKNILDL---D--EV-------ENE--TVDLILTSPPYGDARTTVAYGQFSRLSL----------------------------------------------Q------------------------WLNL--WE-YD-VDKE-SLGGKKKIGEFDPILFQLPVL-------------NSVFNK--ILQLDSKRAEEV----LRFFHDYFYSIKKLTKLVRK-KGYAVYVVANRKVRGIEIP-TDEI--TKEMFEFFGFVWVDTLERNIINK---RMP-LKN----SP-SNIQGQKDNTMLKEKVVILRKI----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CALLA_RS05395_Caldicellulosiruptor_lactoaceticus_503808472 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ME-NLKFDLFLSN-VDIDFK------DYKIKESVHGIHSYPAMMPAPLAEFLI-Q--S-F-------------T-K-RN---D---IVLDPFCGSGTVLYEALKNGRNAIGVDINPLAILISNVKINIGKIE----LSKLE-KFFIEIFKA-----------YQQL---------------EG----------------KEFELP-KFKNI---------------DFWFKKEVQINLQRLKTAIEVVDQ--------D---I-YKLFFKLVFAKT------VRNV--SNTRNSEFK----LYRLEEEKLKQ-H----------------------NPDV---WKTFE----RDFKVTEEKLLYREITN----NSNYVKIFHKNILDL---D--EV-------ENE--TVDLILTSPPYGDARTTVAYGQFSRLSL----------------------------------------------Q------------------------WLNL--WE-YD-VDKE-SLGGKKKIGEFDPILFQLPVL-------------NSVFNK--ILQLDSKRAEEV----LRFFHDYFYSIKKLTKLVRK-KGYAVYVVANRKVRGIEIP-TDEI--TKEMFEFFGFVWVDTLERNIINK---RMP-LKN----SP-SNIQGQKDNTMLKEKVVILRKI----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- N447_RS0100840_Candidatus_Fervidibacter_sacchari_658440082 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------M--LTDIE-ALKS-IDWTFR------GVKAKEGIHGLHSYPAMMAPPVAKKLI-E--A-L-------------S-Q-QG---D---TVFDPFCGSGTVTTEAIRLGRHAIGFDINPLALLIAEVKAT--PIM----PRRLW-QALSQIESN-----------LQGL---------------------------------SEVETP-MFANI---------------DYWFKPEVQKQLARLKAAIDALED--------E---A-VKKFSLVVFSRV------VREA--SNTRPNEFK----LFRLPPEKLER-H----------------------NPDV---FRLFRQRFAECVGIMSD--WWREVGG----SLLKPQLKLHDARQP---F--PL-------PPE--SVDLLLTSPPYGDARTTVAYGQFSRLSL----------------------------------------------Q------------------------WLGL--WR-ED-LDRL-SLGG-----HLRSMTVPIPTL-------------KEVLEE--IAKVSPKRAKEV----EAFYADLYDCLRNIVPVVKQ-DGFAVFVIANRKVKGVKLP-TDKI--IVEMLPEFELI--TALPRQIPNK---RMP-LRN----SP-SNIPGETDTTMLEEHIVILRK------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ CALKRO_RS05380_Caldicellulosiruptor_kronotskyensis_503195393 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ME-NLKFDLFLSN-VDIDFK------DYKIKESIHGIHSYPAMMPAPLAEFLI-Q--S-F-------------T-R-RN---D---IVLDPFCGSGTVLYEALKNGRNAIGVDINPLAILISNVKINIGKIE----LSKLE-KFFIEIFKA-----------YQQL---------------EG----------------KEFELP-KFKNI---------------DFWFKKEVQINLQRLKTAIEVVDQ--------D---I-YKLFFKLVFAKT------VRNV--SNTRNSEFK----LYRLEEEKLKQ-H----------------------NPDV---WKTFE----RDFKVTEEKLLYREITN----NSNYVKIFHKNILDL---D--EV-------ENE--TVDLILTSPPYGDARTTVAYGQFSRLSL----------------------------------------------Q------------------------WLNL--WE-YD-VDKE-SLGGKKKIGEFDPILFQLPVL-------------NSVFNK--ILQLDSKRAEEV----LRFFHDYFYSIKKLAKLVRK-KGYAVYVVANRKVRGIEIP-TDEI--TKEMFEFFGFVWVDTLERNIINK---RMP-LKN----SP-SNIQGQKDNTMLKEKVVILRKI----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- BF638R_RS09490_Bacteroides_fragilis_504064743 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MQVG-NNFY-----------NIDFEQI---------------PVD-EYWN-------TGSEKECRMHKIHAYPAKFPAFITTKAI-E--F-V-------------K-R-RGGEVN---LIADIFCGCGTVAYETKKNGIDFWGCDINPVATLIAEVKSQ--KYQ----DDLLK-KYSNEIVNV-----------FSSL---------------IIA---------------NEDIEN-INERI---------------KYWFKNQQIRDLLALKRAISIVLI--------E-N-VSYKKFFLCAFSNI------LKST--SV-WLTKSI----KPQVDPDKVPA-V----------------------VIDA---FVRQVDMMIKANNE--------NLIS----NENQIKIENINFLDK------KI-------ENS--FADLIVTSPPYVT---SYEYADLHQLST----------------------------------------------L------------------------WLGFVDDY-KI-LRKG-TIGSIYHESDYEDNLNQLSPI-------------GLDIVS-SLYKVDKSKAKAA----SKYFVDMQKVTSKAFQLLNN-NGYSLFVIGNTEYKKVKID-NAKH--LIESMYSAGFRDLEIIQRKISKK-I-LTP-YRD-SRGKF-TSNKN-S-RQVYSEEFIIIGKK----------YED---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- TT66_RS06615_Dehalococcoides_sp_UCH007_822531632 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MVSNPLLD---AQ-LDWTFK------ESNVREDTHCYHDYPARMIPQIAQQLL-K--L-Y-------------S---NG---G---LLFDPYCGTGTSLVEAMRYGINAVGTDINPLACLIARTKTKMLDIK----SVQLQITAFEKFASK-----------MDGC---------------DL----------------KQFDVN-RIENL---------------NFWFKPEVISKITWILSFINDMSD--------V---D-IKDFFKVAASET------IRKS--SNTRQGEFK----LYRRSQSDLIK-F----------------------NPDV---FKIMLDTLYRNQRGLID-LFAKTYAD----KSLRIRVSNFNTVDCIPKY--EI-------EPE--SVDIVITSPPYGDSHTTVAYGQYSRLSA----------------------------------------------E------------------------WLDL--ENPRS-VDRI-AMGGQLPKTIIK---FNFDQL-------------DTVISE--IENKDIRRAREV----CGFYDDLNKSIANVSATIIS-GGYACYVVANRKVKGNTLP-TDLV--IRQFFEQQGFDHINTFNRSIPNK---RMP-LRN----CP-SNISGLLEETMSKEYIVVMKKQ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- consensus/100% .....................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................a..............h........hhhup.........................h...........................................p..h.................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................... consensus/95% ...............................................................................................................................................................................................................................................................................................................................................................................................................................................................................p.h..h.u.h.......hh..................................lhD.a.GsGos...s...s....u.Dhssls...s..+..................................................................................................................................................h.......................................................................h...................................h...p............................hhT.sPY.....s..Y...............................................................................h..................G.........................................................a..p...........h......h.hhhup.........................h...........................................p.hl.h.................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................. consensus/90% ...............................................................................................................................................................................................................................................................................................................................................................................................................................................................................Hsh..h.A.h.s..s..hl..................................lhD.FsGsGTshh.s...sb..hG.DhsPlu..ls..+...................b...h.............h.................................................................h........l..h........................b..h..hhu..........p...p..................p............................s......h...................................h...s............................llTSPPYss...sh.Y...............................................................................W...........h.....hGs........................................................a..pb...h..h...l......h.hhhus.........................h......pb.................................b.Ephlhh.+................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................ consensus/85% .........................................................................................................................................................................................................................................................................................................................................................................................................................................................p....sb...............Hsha.Y.A.h.s.hsp.hl.p....h..................s...p....lhD.FsGsGTshhpu...Gb.shG.DlsPlu.blu..+.............pb....h...h.............h.................................................................h........l..h......h.................c..hbhhhu........l.p...pp...sp............p............................s......h......................h..........s.h...s.........................hshllTSPPYss...sh.Y.....b.........................................................................W..b........h.....hGs........................................................a..-b...h..h.p.lp...u.hhhhlus.......h...........hh...sh.....hpb.b...............................b.Echlhh.+................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................ consensus/80% .........................................................................................................................................................................................................................................................................................................................................................................................................................................................p..p.sb.......b.......HshH.YPA.h.s.hsp.ll.p....h..................s...p....lhDsFsGsGTshhEu...GbpshG.DlsPlA.hlu..+s............pb....h.p.hp............h...............................................................p.W.......pl..l...h..h.................+.hhbhhhu........l.p...Sp...sp............p............................s..h...a......h...............h..........spl...shb.........l.............hshllTSPPYss...sh.Y....pbp........................................................................Wh.h........h.....lGu.b...........h.....................p....................a..-b...h..h.chlp...u.hhhlluss.h....h..........phh...sh..h..hpb.b.....................p.........b.Echlhh.+................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................ consensus/75% .........................................................................................................................................................................................................................................................................................................................................................................................................................................................p..p.sh.s.....b......sHuhH.YPAbh.s.hsp.ll.p....h..................s...p....lhDPFsGuGTshVEub..GbpslGsDlsPlAhhlupsKsp...........pb..p.h.p.hp............h.....................................p.p......p................c.Wa....b.pl..lb..l..l..........p......+phhbhhhS.h......l.ps..Sp.b.sc......b.p.p.c............................s.ph...a....p.h.p....h........h........s.spl...chb.........l........s....lshllTSPPYss...sh-Y...ppbp........................................................................Whsh........hpp..plGubb...........h.p...............p.h.p....................abp-h...lp.h.chlp..su.hhhllGss.h.s..hs.......h.phh...Gh..hsshpb.bsp...................p.........bpEchllh.+................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................ consensus/70% .......................................................................................................................................................................................................................................................................................................................................................................................................................................h................hp.hp.sh.s.....b......sHuhH.YPAbh.s.hspbll.p....h..................s...p....VhDPFsGuGTshVEubb.GbpslGsDlsPLAhhlupsKsp...........ph..p.h.p.hp............h.....................................p.p......p................c.Wa..phb.pl..lb..l..l.p........p.....h+phhbhhhS.h......l.ps..Sp.b.sc......b.p.p.c............................s.ph...a.b.hp.h.pph..h........h........s.spl...Dhbp........l........s....lshllTSPPYss...sh-Y..appbp........................................................................Whsh........hpp..plGubb...........h.p..............sp.h.p....................abp-h...lp.h.chLp..subhhhllGss.h.s.bls..sp...h.chh..hGh..hsshpb.bspp........p.........s.........bpEchlllb+................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................
      Back to Contents
    • General notes, phyletic distribution and gene neighborhoods of Group I/Clade 5/Ot12g00270-like N6-MTases of adenine methylases

      General notes

      The Group I/Clade 5/Ot12g00270 group of methylases are found in Dinoflagellates, chlorophytes and Emiliania. The Symbiodinium version is fused to PPR repeats. Perhaps there is a common plastid function across these. This group and its prokaryotic homologs are distinguished by a H before the Rossmann methylase, D in Str-1*, D after strand-2*, S at the end of Strand-3, T before the characteristic SPPY motif in strand-4 and E and R flanking strand-7. They also seem to have a large insert between Strands-2 and 3 and Strands 3 and 4. The SPPY motif suffests that they are N4 methylases. Operons suggest that they are derived from R-M systems.
      GI           Gene neighborhoods                                                                               Architectures             Pfam archs                                      Gene name          Len  Taxonomy                                     Species                                       Genbank
      # 1; Eukaryotic versions
      Smin1000031348                                                                                                 PPR-repeats+N6-MTase     PPR_2+PPR_2+PPR_2+PPR_2+PPR_2+N6_N4_Mtase       Smin1000031348     722  eukaryota>alveolata>dinophyceae              Symbiodinium minutum Mf 1.05b.01              1
      551547865                                                                                                      N6-MTase                                                                 EMIHUDRAFT_258012  101  eukaryota>haptophyceae                       Emiliania huxleyi CCMP1516                    hypothetical protein EMIHUDRAFT_258012 [Emiliania huxleyi CCMP1516].
      298705971                                                                                                      N6-MTase                 Methyltransf_26                                 Esi_0134_0029      510  eukaryota>stramenopiles                      Ectocarpus siliculosus                        conserved unknown protein [Ectocarpus siliculosus].
      145352834                                                                                                      N6-MTase                 N6_N4_Mtase                                     OSTLU_26692        411  eukaryota>viridiplantae>chlorophyta          Ostreococcus lucimarinus CCE9901              predicted protein [Ostreococcus lucimarinus CCE9901].
      303287789                                                                                                      N6-MTase                                                                 MICPUCDRAFT_63711  580  eukaryota>viridiplantae>chlorophyta          Micromonas pusilla CCMP1545                   DNA methyltransferase [Micromonas pusilla CCMP1545].
      308809742                                                                                                      N6-MTase                 N6_N4_Mtase+N6_N4_Mtase                         Ot12g00270         486  eukaryota>viridiplantae>chlorophyta          Ostreococcus tauri                            unnamed protein product [Ostreococcus tauri].
      255089364                                                                                                      N6-MTase                                                                 MICPUN_104425      563  eukaryota>viridiplantae>chlorophyta          Micromonas sp. RCC299                         DNA methyltransferase [Micromonas sp. RCC299].
      612390323                                                                                                      N6-MTase                                                                 Bathy10g01900      555  eukaryota>viridiplantae>chlorophyta          Bathycoccus prasinos                          DNA methyltransferase [Bathycoccus prasinos].
      # Prokaryotic homologs
      # 4;
      503195393    EcoRII->N6-MTase*-><-PolB_NTase                                                                   N6-MTase                 N6_N4_Mtase                                     CALKRO_RS05380     416  bacteria>firmicutes                          Caldicellulosiruptor kronotskyensis           DNA methyltransferase [Caldicellulosiruptor kronotskyensis].                         503195387_?->503195388_?->503195389_?->503195390_?->754103640_?->503195391_?->503195392_EcoRII->503195393_N6-MTase*-><-503195394_PolB_NTase||503195395_?->503195396_?->503195397_?->503195398_?-><-503195399_?<-500266339_?
      503808472    <-N6-MTase*<-EcoRII                                                                               N6-MTase                 N6_N4_Mtase                                     CALLA_RS05395      416  bacteria>firmicutes                          Caldicellulosiruptor lactoaceticus            DNA methyltransferase [Caldicellulosiruptor lactoaceticus].                          <-503808467_?||503808468_?->503808469_?-><-503808470_?||754084957_?->754084959_?-><-503808471_?<-503808472_N6-MTase*<-503808473_EcoRII<-503808474_?<-503808475_?<-503808476_?<-754084960_?<-503808478_?<-503808479_?
      506388366    PolB_NTase-><-?<-?<-N6-MTase*<-EcoRII                                                             N6-MTase                 N6_N4_Mtase                                     ATHE_RS08295       416  bacteria>firmicutes                          Caldicellulosiruptor bescii                   DNA methyltransferase [Caldicellulosiruptor bescii].                                 <-754086977_?<-506388360_?<-506388361_?<-506388362_?||754086978_PolB_NTase-><-506388364_?<-506388365_?<-506388366_N6-MTase*<-506388367_EcoRII<-754086980_?<-506388369_?<-506388370_?<-754086982_?<-506388372_?<-506388373_?
      658440082    <-N6-MTase*<-EcoRII                                                                               N6-MTase                 N6_N4_Mtase                                     N447_RS0100840     404  bacteria                                     Candidatus Fervidibacter sacchari             hypothetical protein, partial [Candidatus Fervidibacter sacchari].                   <-516980737_?<-516980739_?<-658440081_?||516980743_?-><-658440082_N6-MTase*<-516980747_EcoRII<-658440083_?<-658440084_?<-516980752_?||516980755_?->516980757_?-><-658440085_?
      # 2;
      516669726    <-N6-MTase+N6-MTase*<-EcoRII                                                                      N6-MTase+N6-MTase        N6_N4_Mtase                                     A3IY_RS0101600     412  archaea>crenarchaeota                        crenarchaeote SCGC AAA261-L14                 hypothetical protein [crenarchaeote SCGC AAA261-L14].                                <-516669726_N6-MTase+N6-MTase*<-516669730_EcoRII<-516669731_?||516669732_?-><-516669733_?<-516668117_?||516669734_?-><-516669735_?
      516682681    EcoRII-><-N6-MTase+N6-MTase*                                                                      N6-MTase+N6-MTase        N6_N4_Mtase                                     A3MW_RS0105500     412  archaea>crenarchaeota                        crenarchaeote SCGC AAA261-N23                 hypothetical protein [crenarchaeote SCGC AAA261-N23].                                516668114_?->516682676_?-><-516669734_?||516682677_?-><-516669732_?||516682679_?->516682680_EcoRII-><-516682681_N6-MTase+N6-MTase*
      # 3;
      494032514    <-DnaJ||?->?->?->?->?->N6-MTase*->STYKIN->                                                        N6-MTase                 UPF0020                                         PPSIR1_RS28425     465  bacteria>proteobacteria>deltaproteobacteria  Plesiocystis pacifica                         RNA methylase [Plesiocystis pacifica].                                               <-494032507_?<-494032508_DnaJ||494032509_?->494032510_?->494032511_?->494032512_?->494032513_?->494032514_N6-MTase*->770228038_STYKIN-><-494032516_?||770228041_?->770228043_?->494032519_?->770228020_?->494032521_?->
      816962200    <-STYKIN||?->?->?-><-?||Methylase->N6-MTase*->?->STYKIN->                                         N6-MTase                 N6_N4_Mtase                                     DB32_008855        392  bacteria>proteobacteria>deltaproteobacteria  Sandaracinus amylolyticus                     modification methylase, putative [Sandaracinus amylolyticus].                        <-816962193_?<-816962194_STYKIN||816962195_?->816962196_?->816962197_?-><-816962198_?||816962199_Methylase->816962200_N6-MTase*->816962201_?->816962202_STYKIN->816962203_?-><-816962204_?||816962205_?-><-816962206_?||816962207_?->
      747219293    <-DnaJ||?->?->N6-MTase*->STYKIN->                                                                 N6-MTase                 UPF0020                                         DB30_07152         386  bacteria>proteobacteria>deltaproteobacteria  Enhygromyxa salina                            putative modification methylase [Enhygromyxa salina].                                747219286_?-><-747219287_?<-747219288_?<-747219289_?<-747219290_DnaJ||747219291_?->747219292_?->747219293_N6-MTase*->747219294_STYKIN-><-747219295_?||747219296_?-><-747219297_?<-747219298_?||747219299_?->747219300_?->
      # 3;
      763387130    <-N6-MTase*<-?<-?||?-><-REC                                                                       N6-MTase                 SP+N6_N4_Mtase                                  CAP_RS22030        439  bacteria>proteobacteria>deltaproteobacteria  Chondromyces apiculatus                       hypothetical protein [Chondromyces apiculatus].                                      <-763387118_?<-763387121_?||763387124_?-><-763387187_?||763387126_?-><-763387128_?<-763387189_?<-763387130_N6-MTase*<-763387133_?<-763387135_?||763387138_?-><-763387141_REC||763387143_?->763387190_?->763387145_?->
      501196891    <-STYKIN<-?<-?||REC-><-?||?->N6-MTase*->                                                          N6-MTase                 SP+N6_N4_Mtase                                  SCE_RS37400        433  bacteria>proteobacteria>deltaproteobacteria  Sorangium cellulosum                          hypothetical protein [Sorangium cellulosum].                                         <-501196884_?<-769204570_STYKIN<-501196886_?<-769204573_?||501196888_REC-><-501196889_?||769204575_?->501196891_N6-MTase*->501196892_?-><-501196893_?||769200681_?->501196895_?->501196896_?->501196897_?->501196898_?->
      497728440    N6-MTase+N6-MTase*->                                                                              N6-MTase+N6-MTase        N6_N4_Mtase+Methyltransf_26                     GEMOB_RS19995      414  bacteria>planctomycetes                      Gemmata obscuriglobus                         hypothetical protein [Gemmata obscuriglobus].                                        497728423_?-><-497728426_?<-702544373_?<-497728433_?<-702544379_?||497728437_?->702544385_?->497728440_N6-MTase+N6-MTase*-><-497728446_?
      # 3;
      503800515    RadC-><-?<-HISKIN<-N6-MTase+N6-MTase*                                                             N6-MTase+N6-MTase        N6_N4_Mtase+N6_N4_Mtase                         MURRU_RS15895      434  bacteria>bacteroidetes                       Muricauda ruestringensis                      DNA methylase N-4/N-6 domain-containing protein, partial [Muricauda ruestringensis]. <-503800507_?<-503800509_?<-503800510_?||503800511_?->503800512_RadC-><-503800513_?<-503800514_HISKIN<-503800515_N6-MTase+N6-MTase*<-503800517_?<-503800518_?<-754185092_?<-503800521_?||754184508_?-><-754184510_?||503800523_?->
      723440940    N6-MTase+N6-MTase*->HISKIN->                                                                      N6-MTase+N6-MTase        N6_N4_Mtase                                     MB26_RS00775       424  bacteria>bacteroidetes                       Sphingobacteriaceae bacterium DW12            hypothetical protein [Sphingobacteriaceae bacterium DW12].                           <-723440934_?<-723440935_?<-723440981_?||723440936_?->723440937_?->723440938_?->723440939_?->723440940_N6-MTase+N6-MTase*->723440941_HISKIN->723440942_?->723440943_?->723440982_?-><-723440983_?<-723440944_?||723440984_?->
      639235853    N6-MTase+N6-MTase*->HISKIN->                                                                      N6-MTase+N6-MTase        N6_N4_Mtase                                     L956_RS0103135     423  bacteria>bacteroidetes                       Elizabethkingia anophelis                     hypothetical protein [Elizabethkingia anophelis].                                    <-639235847_?<-639235848_?<-639235849_?<-639235850_?<-639235851_?<-489069457_?||639235852_?->639235853_N6-MTase+N6-MTase*->639235854_HISKIN->639235855_?->639235856_?->639235857_?->639235858_?-><-639235859_?<-639235860_?
      # 3;
      156107192    <-Phage_integrase<-?||Eco57I-REase-><-?<-HISKIN<-N6-MTase*<-Phage_integrase                       N6-MTase                 MethyltransfD12                                 BACOVA_04794       424  bacteria>bacteroidetes                       Bacteroides ovatus ATCC 8483                  hypothetical protein BACOVA_04794 [Bacteroides ovatus ATCC 8483].                    <-156107185_?<-156107186_?<-156107187_Phage_integrase<-156107188_?||156107189_Eco57I-REase-><-156107190_?<-156107191_HISKIN<-156107192_N6-MTase*<-156107193_Phage_integrase<-156107194_?<-156107195_?||156107196_?-><-156107197_?<-156107198_?<-156107199_?
      504064743    <-Phage_integrase<-?||N6-MTase*->                                                                 N6-MTase                 Methyltransf_23                                 BF638R_RS09490     424  bacteria>bacteroidetes                       Bacteroides fragilis                          DNA methylase [Bacteroides fragilis].                                                495935276_?->492243001_?->499516031_?-><-504064739_?<-504064740_?<-504064741_Phage_integrase<-504064742_?||504064743_N6-MTase*->504064744_?-><-752443818_?<-504064746_?<-504064747_?<-504064748_?<-695461310_?<-504064751_?
      511018363    Phage_integrase->N6-MTase*->HISKIN->?-><-Eco57I-REase                                             N6-MTase                 MethyltransfD12                                 C801_RS06010       424  bacteria>bacteroidetes                       Bacteroides uniformis                         hypothetical protein [Bacteroides uniformis].                                        511018359_?->511018360_?->492387032_?->511018361_?->492411218_?-><-492411213_?||511018362_Phage_integrase->511018363_N6-MTase*->511018364_HISKIN->490429217_?-><-737477984_Eco57I-REase||737477986_?->511018059_?->737477988_?->511018368_?->
      # 2;
      307773237    <-MPTase-PVC+PSBP<-Thermonuclease||?->?->N6-MTase+N6-MTase*->                                     N6-MTase+N6-MTase        N6_N4_Mtase+N6_N4_Mtase+UPF0020                 TRICHSKD4_2254     961  bacteria>proteobacteria>alphaproteobacteria  Roseibium sp. TrichSKD4                       DNA methylase N-4/N-6 domain-containing protein [Roseibium sp. TrichSKD4].           <-307773230_?||307773231_?->307773232_?-><-307773233_MPTase-PVC+PSBP<-307773234_Thermonuclease||307773235_?->307773236_?->307773237_N6-MTase+N6-MTase*-><-307773238_?||307773239_?-><-307773240_?<-307773241_?||307773242_?->307773243_?->307773244_?->
      750334774    <-MPTase-PVC+PSBP<-Thermonuclease||?->N6-MTase+N6-MTase*->                                        N6-MTase+N6-MTase        N6_N4_Mtase+N6_N4_Mtase+UPF0020                 TRICHSKD4_RS09860  913  bacteria>proteobacteria>alphaproteobacteria  Roseibium sp. TrichSKD4                       hypothetical protein, partial [Roseibium sp. TrichSKD4].                             497444828_?->750334695_?->497444832_?->497444833_?-><-750334771_MPTase-PVC+PSBP<-750334772_Thermonuclease||750334773_?->750334774_N6-MTase+N6-MTase*-><-497444839_?<-750334696_?||750334775_?->750334697_?->497444845_?->497444846_?->497444847_?->
      # 2;
      313667099    N6-MTase+N6-MTase->N6-MTase+N6-MTase*->                                                           N6-MTase+N6-MTase        N6_N4_Mtase+Methyltransf_26                     bsrIM2             389  bacteria>firmicutes                          Geobacillus stearothermophilus                M2.BsrI [Geobacillus stearothermophilus].                                            313667098_N6-MTase+N6-MTase->313667099_N6-MTase+N6-MTase*-><-313667100_?
      738943261    N6-MTase+N6-MTase->N6-MTase+N6-MTase*->                                                           N6-MTase+N6-MTase        N6_N4_Mtase+N6_N4_Mtase                         HW49_RS06065       375  bacteria>bacteroidetes                       Porphyromonadaceae bacterium COT-184 OH4590   hypothetical protein [Porphyromonadaceae bacterium COT-184 OH4590].                  738943253_?->738943255_?->738943257_?->738943290_?->738943291_?->738943259_?->738943293_N6-MTase+N6-MTase->738943261_N6-MTase+N6-MTase*->738943263_?->738943265_?->738943268_?-><-738943270_?<-738943272_?<-738943274_?||738943276_?->
      # 2;
      746588169    N6-MTase*->                                                                                       N6-MTase                 N6_N4_Mtase                                     P279_RS13360       385  bacteria>proteobacteria>alphaproteobacteria  Rhodobacteraceae bacterium PD-2               RNA methyltransferase [Rhodobacteraceae bacterium PD-2].                             <-564610271_?||746588168_?-><-665956792_?<-564610274_?||564610275_?->564610276_?->564610277_?->746588169_N6-MTase*->746588109_?->564610281_?->564610282_?->564610283_?->564610284_?->564610285_?-><-564610286_?
      123187375    N6-MTase*->N6-MTase->                                                                             N6-MTase                 N6_N4_Mtase                                     bmrIM1             379  bacteria>firmicutes                          Bacillus megaterium                           M1.BmrI [Bacillus megaterium].                                                       123187375_N6-MTase*->123187376_N6-MTase-><-123187377_?
      # 1;
      822531632    <-ParB<-?<-?<-?<-?<-?<-?<-N6-MTase*                                                               N6-MTase                 Methyltransf_26+Methyltransf_26                 TT66_RS06615       418  bacteria>chloroflexi                         Dehalococcoides sp. UCH007                    DNA methyltransferase [Dehalococcoides sp. UCH007].                                  <-822531626_ParB<-822531627_?<-822531628_?<-822531629_?<-822531630_?<-822531820_?<-822531631_?<-822531632_N6-MTase*<-822531633_?||822531634_?->822531635_?-><-822531636_?<-505220856_?<-505220857_?<-505220858_?
      594540697    Methylase-><-?<-?||?->?-><-?<-N6-MTase*                                                           N6-MTase                 UPF0020                                         AHM02030.1         404  archaea>crenarchaeota                        uncultured miscellaneous Crenarchaeota group  putative RNA methylase [uncultured miscellaneous Crenarchaeota group].               594540690_?->594540691_Methylase-><-594540692_?<-594540693_?||594540694_?->594540695_?-><-594540696_?<-594540697_N6-MTase*||594540698_?->
      374854167    <-Methylase||?-><-?||N6-MTase*->                                                                  N6-MTase                 N6_N4_Mtase                                     HGMM_F46H12C04     402  bacteria>proteobacteria>deltaproteobacteria  uncultured delta proteobacterium              RNA methylase [uncultured delta proteobacterium].                                    <-374854164_Methylase||374854165_?-><-374854166_?||374854167_N6-MTase*->374854168_?-><-374854169_?<-374854170_?<-374854171_?<-374854172_?||374854173_?-><-374854174_?
      85720925     N6-MTase->?->N6-MTase+N6-MTase*->                                                                 N6-MTase+N6-MTase        N6_N4_Mtase                                     btsIM2             393  bacteria>firmicutes                          Geobacillus thermoglucosidasius               M2.BtsI [Geobacillus thermoglucosidasius].                                           85720923_N6-MTase->85720924_?->85720925_N6-MTase+N6-MTase*-><-85720926_?
      
      Back to Contents


    • Multiple sequence alignment of the Group I-Clade6/ Emiliania-ver1 N6-MTases

                                                                                                  Str (-2)                                                                                     Str (-1)                                                                                 Helix-1                         Str-1                  Str-2                                                          Str-3                                            Str-4                                                  Str-5                                    Str-6                        Str-7                                                       
      RES                                                                           QGGVLRARF-----VAPPFSSLDA-RQ----------------PVWQERKKHWNAL---F--D--SG--------------------A---GRS-------------NTLLSAGRTSGLG----VLEK--ADGS--------------------------------------------V-----V-G---TSLFDPVLAEVMVRWFAPRPRLVAAG----SRLAPAGRPVIVLDPFAGGCVRGIVCAGLGCLYVGVDISAKQVRAN-------QEQ-W------QQL-E---R-S---G-R---LGEVK-HA--PAWLHGDGEQ-I---------ASH-YQ--G-VL-ASRGLPPETLA----DLILACPPYYDREQYNG-----GPLDLSMLPTYKAFLAKYQRIIAAAVSLLKPQHFACFVVGNAR--AK--G-GGLHVLLSDTLQAFAKVDCTPYNVAVLLTALGTAPQRAEQTMAAASKLVPAHQDVLVVVKGRG---------------F---DKSDARACG-IRAKE-G-----ASSQS
      FINAL                                                                         -------------------EEE---------------------HHHHHHHHHHHHH---H----------------------------------------------------------------EEE--------------------------------------------------------------------HHHHHHHHHH----------------EEE------EEEEE-----HHHHHHHHH---EEEEE--HHHHHHH-------HHH-H------HHH----------------------------EEEE----H-H---------HHH-HH--H-HH-HHH-------E----EEEEE------EEEE-----------------HHHHHHHHHHHHHHHHHHH----EEEEEEE-------------EE--HHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHH-------EEEEEEEEE----------------------------------EE--------------
      ALIGN                                                                         --------------------------------------------HHHHHHHHHHE------------------------------------------------------EEEE-----------HH----------------------------------------------------------------------HHHHHHHHH------------------------EEEE-------EEEEEEEH----------HHHHHHHH-------HHH-H------HHH-H--------------------------EEE-------------------H--------------------------EE-----------------------------HHHHHHHHHHHHHHHHHHH----EEEEEEEE------------EEE---HHHHHHHH---HHHHHHHHHHH---HHHHHHHHHHHHHHH---EEEEEEEE---------------------------------E----------------
      HMM                                                                           -------EE-----EEEEEEEE----H----------------HHHHHHHHHHHHH----------------------------------------------------EEEEEE-----------E----------------------------------------------------E-----E-E---EEEE-HHHHHHHHHHH---HHEHH-------EEE----EEEEEEE----EEEEEEHHHH--EEEEEE--HHHHHHH-------HHH-H------HH----------------------------EEEEE--HHH-H---------HHH-HH--H-HH-HHH-------E----EEEEE---EEEEEEE--------HHH------HHHHHHHHHHHHHHHHHHH----EEEEEEEE-H--H------EEEE-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----EEEEEEEEEEEEEE------------------H---HHHHHHH----EEE-------------
      FREQ                                                                          --------------------E-----H----------------HHHHHHHHHHHHH---H------H----------------------------------------------------E----EEE---------------------------------------------------------------------HHHHHHH-----------HE----HEEE------EEE-------HHHHHHHHH----EEE---HHHHHHH-------HHH-H------HHH---------------------------EEEE-------H---------HHH-HH----HH-HHHH------E----EEEE-------EEE------------------HHHHHHHHHHHHHHHHHHH-----EEEEE-------------------HHHHHHHHH--HHHHHHHHHHH------HHHHHHHHHHHHHHHH--EEEEEE--------------------------------------------------
      PSSM                                                                          --------------------E-----------------------HHHHHHHHHHHH---H---------------------------------------------------------------------------------------------------------------------------------------HHHHHHHHH------HHHHHH----HH--------EEEE------HHHHHHHH----EEEEE--HHHHHHH-------HHH-H------HHH-H--------------------------EEEEE---H-H----------HH--------H-HHH-------E----EEEEE---------------------------HHHHHHHHHHHHHHHHHHH----EEEEEEE-------------EE--HHHHHHHHHHH--HHH--HEEEH------------------------EEEEEE--------------------------------------------------
      EMIHUDRAFT_459692_Emiliania_huxleyi_CCMP1516_551554922                        QGGVLRARF-----VAPPFSSLDA-RQ----------------PVWQERKKHWNAL---F--D--SG--------------------A---GRS-------------NTLLSAGRTSGLG----VLEK--ADGS--------------------------------------------V-----V-G---TSLFDPVLAEVMVRWFAPRPRLVAAG----SRLAPAGRPVIVLDPFAGGCVRGIVCAGLGCLYVGVDISAKQVRAN-------QEQ-W------QQL-E---R-S---G-R---LGEVK-HA--PAWLHGDGEQ-I---------ASH-YQ--G-VL-ASRGLPPETLA----DLILACPPYYDREQYNG-----GPLDLSMLPTYKAFLAKYQRIIAAAVSLLKPQHFACFVVGNAR--AK--G-GGLHVLLSDTLQAFAKVDCTPYNVAVLLTALGTAPQRAEQTMAAASKLVPAHQDVLVVVKGRG---------------F---DKSDARACG-IRAKE-G-----ASSQS
      EMIHUDRAFT_464003_Emiliania_huxleyi_CCMP1516_551572077                        QGGVLRARF-----VAPPFSSLDA-RQ----------------PVWQERKKHWNAL---F--D--SG--------------------A---GRS-------------NTLLSAGRTSGLG----VLEK--ADGS--------------------------------------------V-----V-G---TSLFDPVLAEVMVRWFAPRPRLVAAG----SRLAPAGRPVIVLDPFAGGCVRGIVCAGLGCLYVGVDISAKQVRAN-------QEQ-W------QQL-E---R-S---G-R---LGEVK-HA--PAWLHGDGEQ-I---------ASH-YQ--G-VL-ASRGLPPETLA----DLILACPPYYDREQYNG-----GPLDLSMLPTYKAFLAKYQRIIAAAVSLLKPQHFACFVVGNAR--AK--G-GGLHVLLSDTLQAFAKVDCTPYNVAVLLTALGTAPQRAEQTMAAASKLVPAHQDVLVVVKGRG---------------F---DKSDARACG-IRAKE-G-----ASSQS
      EMIHUDRAFT_231186_Emiliania_huxleyi_CCMP1516_551605992                        CGSAERARL-----EALSVRQLKD-EGSRRAVDLSGCIEKCDLGVWQRRKKEWHDL---F--D--SS--------------------L---GRD-------------EDLLG----AGLR----HLLP--ASSK--------------------------------------------L-----N-G---TSVFDPVLMENVVSWWVPRPR------------KVRHRPVVVIDPFAGGVVRGFVAAAKGVQYIGVDVSARQVEAN-------RKQ-V------PQG-Q---W-------------DFR-HP--PIWVVGDSEQ-I---------ATL-VP--K-EL-HRLGLEPL--A----DAIVSCPPYFNLEEYKA-----GPKDLSGLPSYATFLVKYKRIVANAVSLLRPKALSTFVVGNFR--GK--G-GALELFHSDTISAFKEAGCSLYNDAVLTTSTGTAALRATRTMGSGAKLVPTHQNVVVFVKGEG---------------F---TPADARASG-IRPNA-A-----ESQSQ
      EMIHUDRAFT_95251_Emiliania_huxleyi_CCMP1516_551629083                         GQGELGHRF-----GAPPFSVLNS-QR----------------GYWKERRHFWEGT---Y--NVHSE--------------------E---GRG-------------DNLIG---YKGLG----------GEGA--------------------------------------------------R-G---TSVFCPVLSELCCRWWCP-----AGG--------------TVLDPFAGGSVRGVVAGWLGLRYVGLELRREQIARN-------REQ-A------ARA-G---AVA---G-S---SGHRW-RP--PEWVEADARE-L---------ADG-D--------GPAGAPRQ--A----DFVFSCPPYFDLEQYSD-----DPRDLSRAHSYATFLAAYEAIIAAAAHRLKPRRFACFVVGEIR--DG--D-GFMRNFVGDTVSAFQKAGCRLYNSCVMLLPFNTLPVRAGKAMASSAKLGMCHQHVLVFYNGR---------------------NPAAEVKG-LKLAN-M-----TRPLE
      EMIHUDRAFT_115516_Emiliania_huxleyi_CCMP1516_551585076                        GQGELGHRF-----GAPPFSVLNS-QR----------------GYWKERRHFWEGT---Y--NVHSE--------------------E---GRG-------------DNLIG---YKGLG----------GEGA--------------------------------------------------R-G---TSVFCPVLSELCCRWWCP-----AGG--------------TVLDPFAGGSVRGVVAGWLGLRYVGLELRREQIARN-------REQ-A------ARA-G---AVA---G-S---SGHRW-RP--PEWVEADARE-L---------ADG-D--------GPAGAPRQ--A----DFVFSCPPYFDLEQYSD-----DPRDLSRAHSYATFLAAYEAIIAAAAHRLKPRRFACFVVGEIR--DG--D-GFMRNFVGDTVSAFQKAGCRLYNSCVMLLPFNTLPVRAGKAMASSAKLGMCHQHVLVFYNGR---------------------NPAAEVKG-LNLAN-M-----TRPLE
      EMIHUDRAFT_209088_Emiliania_huxleyi_CCMP1516_551572199                        -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MRSAAPPST-------SST--------GRM-----------P-S---------PS-------------A---A-----AKQ-LA-----F-RQ--------A----DFICTCPPYFDLEVYDG-----GANDLSMLPSYDAFLVKYSRIIKEAAKLLRPTQLAAFVVGN----------------------------------------------RAGRQMAATSKWVQVHQDTLVVVKGDR---------------F---TTKEAKRAG-IDASG-------SGIAG
      C266_RS12110_Pandoraea_sp_SD6-2_498505877                                     GAGSLAERF-----MVPPFSTLDA-RA----------------TAWQDRKDAWLAL------GIESE--------------------V---GRD-------------APSYG---NASET---QKAAR--GATA---------------Q-HR-------------------------------------TSVFDPVLCELAYRWFCP-----DGG--------------TVIDPFAGGSVRGIVAARLGRQYVGMELRDEQVAAN-------REQ-L------HLI-A-----P---D-D---------PA--PAWQVGDSRH-I---------GKA-LK--G----IA--------A----DFLFSCPPYADLERYSD-----TPEDLSTM-KYPEFLAAYRDVIAGAAALLKPDRFACFVVGDVR--AR--S-GAYRNFVSDTITAFLDAGLTLYNEAILLTALGSAPIRAGKQFAASRKLGKVHQNVLVFVKGDW-------------------KKAVAACGD-VDMTD----VQFPDPDE
      U875_RS13765_Pandoraea_pnomenusa_560179856                                    GDGSLAELF-----MVPPFSTLDA-RA----------------AAWQERKDAWLAL------GIQSE--------------------V---GRD-------------APAYG---DASEA---QKAAR--GATA---------------Q-HR-------------------------------------TSVFDPVLCELAYRWFCP-----EGG--------------IVLDPFAGGSVRGIVAARLGRQYVGMELREEQVAAN-------REQ-L------HLI-A-----P---D-D---------PA--PAWTVGDSRK-I---------GQA-LK--G----VE--------A----DFLFSCPPYADLERYSD-----RPEDLSTM-KYSEFLAAYRDIVAGAAALLKPDRFACFVVGDVR--AR--S-GAYRNFVSDTITAFLDAGLTLYNEAILLTALGSAPIRAGKQFAASRKLCKVHQNVLVFVKGDW-------------------KKAVAACGT-VDMTN----VEFPEPDE
      LV28_24705_Pandoraea_pnomenusa_698968189                                      GDGSLAELF-----MVPPFSTLDA-RA----------------AAWQERKEAWLAL------GIQSE--------------------V---GRD-------------APAYG---DASEA---QKAAR--GATA---------------Q-HR-------------------------------------TSVFDPVLCELAYRWFCP-----EGG--------------IVLDPFAGGSVRGIVAARLGRQYVGMELRDEQVAAN-------REQ-L------HLI-A-----P---D-D---------PA--PAWTVGDSRK-I---------GQA-VK--G----VE--------A----DFLFSCPPYADLERYSD-----RPEDLSTM-KYPEFLAAYRDVVTGAAALLKPDRFACFVVGDVR--AR--S-GAYRNFVSDTIAAFLDAGLTLYNEAILLTALGSAPIRAGKQFASSRKLGKVHQNVLVFVKGDW-------------------KKAVAACGT-VDMTN----VEFPEADE
      UB46_RS43165_Burkholderiaceae_bacterium_16_771680244                          GAGSLAERF-----MVPPFSTLNA-RD----------------TAWQDRKRDWLAL------GIQSE--------------------L---GRD-------------APAYS---SASDQ---QKAER--GVPA---------------Q-HR-------------------------------------TSIFDPVLCELVYRWFCP-----ADG--------------LVLDPFAGGSVRGIVAARLGRQYIGMDLRAEQVQAN-------RAQ-L------DLL-R-----A---D-D---------PA--PAWHVGDSRK-I---------GHH-LA--D----VD--------A----DFLFSCPPYADLERYSD-----NPADLSTM-DYPAFLAAYREVISGAVGQLKPDRFACFVVGDVR--SHRGT-GCYRNFVADTIEAFLDAGTQMYNHAVLLTALGSLPIRVGKQFSASRKLGTAHQHVLVFVKGDW-------------------KRAVAACGE-VAISD----NLFPEAEE
      UB46_RS23670_Burkholderiaceae_bacterium_16_771670996                          GAGSLAEQF-----MVPPFSTLNA-RD----------------ADWQDRKKAWLAL------GIQSE--------------------L---GRD-------------APAFS---SASDK---QKAAS--GATA---------------Q-HR-------------------------------------TSIFDPVLCELAYRWFCP-----ADG--------------LVLDPFAGGSVRGIVAARLGRQYIGMDLRAEQVQAN-------RAQ-L------DLL-R-----A---G-D---------PA--PAWHVGDSRK-I---------GQH-LA--D----VE--------A----DFVFSCPPYADLERYSD-----DPADLSTM-DYPAFLTAYREVISGAVGQLKPDRFACFVVGDVR--SHRGT-GCYRNFVADTIEAFLDAGMQLYNEAILLTALGSLPIRAAKQFSASRKLGKAHQNVLVFVKGDW-------------------KRAVAACGD-VEVSD----DLFPEAAE
      W930_RS0102395_Pandoraea_564969930                                            GDGSLAELF-----MVPPFSTLDA-RA----------------AAWQERKDAWLSL------GIQSE--------------------V---GRD-------------APAFG---GSSEA---QKAAR--GATA---------------Q-HR-------------------------------------TSVFDPVLCELAYRWFCP-----EGG--------------IVLDPFAGGSVRGIVAARLGRQYVGMELREEQVAAN-------REQ-L------HLI-A-----P---D-D---------PA--PAWTVGDSRK-I---------GQA-LK--G----VE--------A----DFLFSCPPYADLERYSD-----RTEDLSTM-KYSEFLAAYRDIVAGAAALLKPDRFAGFVVGDVR--AR--S-GAYRNFVSDTIAAFLDAGLTLYNEAILLTALGSAPIRAGKQFAASRKLGKVHQNVLVFVKGDW-------------------KKAVAACGT-VDMTN----VEFPEPDE
      RR42_RS08225_Cupriavidus_basilensis_759627761                                 GAGSLAEQF-----MVPPFSTLNA-RD----------------AAWQDRKKAWLAL------GIQSE--------------------L---GRD-------------APAFS---SASDQ---QKADR--GAPA---------------Q-HR-------------------------------------TSIFDPVLCELAYRWFCP-----ADG--------------LVLDPFAGGSVRGIVAARLGRQYVGMELRAEQVQAN-------RAQ-L------DLL-R-----A---D-D---------PA--PAWHVGDSRK-I---------GQH-LA--D----VE--------A----DFVFSCPPYANLERYSD-----DPADLSTM-DYPAFLTAYREVISGAVGQLKPDRFACFVVGDVR--EKRGT-GCYRNFVADTIEAFLDAGTELYNHAVLLTALGSLPIRAGKQFSASRKLGTAHQHVLVFVKGDW-------------------KRAVAACGE-VDVSD----DLFPEAEE
      REUT_RS12130_Cupriavidus_pinatubonensis_499617803                             ARRSLAEQF-----MVPPFSTLNA-RD----------------AAWQERKQSLLAL------GIQSE--------------------L---GRD-------------APAYA---SASDH---QKADQ--GAAP---------------Q-HR-------------------------------------TSIFDPVLCELAYRWFCP-----ADG--------------LVLDPFAGGSVRGIVAARLGRPYVGMELRPEQVEAN-------RGQ-L------HLV-Q-----T---H-D---------PA--PVWHVGDSRQ-I---------ARR-LP--D----VQ--------A----DFLFSCPPYADLERYSD-----DLADLSTM-DYPTFLAAYREVIAGAVSLLKPDRFACFVVGDVR--EKRGT-GPYRNFVADTIQAFLDAGMRLYNEAILLTALGSAPIRAGKQFAASRKLGKTHQNVLVFVKGDW-------------------KRAVAACGD-VLLAD----DLFPESDE
      HIBPF_RS01765_Haemophilus_influenzae_501001894                                EKLDLKQAF-----LAPPFSILNS-RE----------------GWWQDRKRKWKAL------GIDSG--------------------S---GRD-------------SGMQR---GMNSM---ARYNS--KED----------------------TSMSESD----------------------------DSIFDPVLCEILYSWFSP-----KGC--------------QILDPFAGGSVRGIVASKLNRNYIGVDLRAEQIEAN-------LKQ-R------NLV-------C---S-N---------DELQPIWICGDSIN-I---------DKL-AN--G----VS--------A----DLVFSCPPYADLEVYSN-----NPNDLSNM-DYDNFKKAYFEIIKKSCDMLKDNRFACFVVGEVR--DK--K-GNYYNFVGDTIQAFLEAGLSYYNEMILVNVCGTMPMRAATPFKGSRKIGKVHQNVLVFVKGNP-------------------KIAAEYCGD-VDVYI-------PNEDN
      HIAG_01531_Haemophilus_influenzae_NT127_260093825                             EKLDLKQAF-----LAPPFSILNS-RE----------------GWWQDRKRKWKAL------GIDSG--------------------S---GRD-------------SGMQR---GMNSM---ARYNS--KED----------------------TSMSESD----------------------------DSIFDPVLCEILYSWFSP-----KGC--------------QILDPFAGGSVRGIVASKLNRNYIGVDLRAEQIEAN-------LKQ-R------NLV-------C---S-N---------DELQPIWICGDSIN-I---------DKL-AN--G----VS--------A----DLVFSCPPYADLEVYSN-----NPNDLSNM-DYDNFKKAYFEIIKKSCDMLKDNRFACFVVGEVR--DK--K-GNYYNFVGDTIQAFLEAGLSYYNEMILVNVCGTMPMRAATPFKGSRKIGKVHQNVLVFVKGNP-------------------KIAAEYCGD-VDVYI-------PNEDN
      OPIT5_RS03245_Opitutaceae_bacterium_TAV5_497199068                            GSGALSERF-----GAPPFSVFDA-RA----------------GWWQDRKNEWLAY------GLASE--------------------E---GRS-------------DSLIF---ESETA---KNFGRMHGGNA---------------P-KG-------------------------------------TSIFDPVLCELLYTWFCA-----AGG--------------VVLDPFAGGSVRGVVASELGRHYVGVDLRSEQVEAN-------RAQ-A------EILCK-----P---P-H---------PA--PVWHVGDSRN-I---------AHL-AS--D----VR--------A----DFLFSCPPYADLEVYSD-----DPADLSTM-EYSQFITCYREIISASAALLRNDRFACFVVGDIR--DP-AT-GLYRNFVADTIAAFHAAGLALYNEAVLVTMAGSLPLRINKQFTVARKLGKTHQNVLVFVKGDP-------------------RRAAASCGD-IQPWE----G---RADE
      BAQ92410.1_uncultured_Mediterranean_phage_uvMED_775458612                     ASGILKEKF-----GVPPFSILNA-RE----------------GWWQNRKKLWLEL------GIKSE--------------------V---GRD-------------EELTY---SISKG--------------------------------DVGKRI----------------------MQA-G-GS--TSVFDPVISELIYRWFSN-----PNA--------------VILDPFAGGSVRGIVAATLGRKYIGVDLRKEQVEAN-------IQQ-G------EDI-L-----E---S-T---------CE-KPIWITGNSQN-I---------DDL-VV--E-----K--------A----DLIFSCPPYVDLEVYSK-----DPNDLSNM-DFEAFKENYAEIIKKSCDLLNDNSFAAFVVGEVR--KK--D-GTYYNFVGETIEAFIKAGLSFYNEAILVTMVGSLPLRCGNGFTKSRKLGKTHQNILVFVKGDP-------------------VLATEKCGE-CEFADPAAFEE-ENVEI
      TRICHSKD4_2718_Roseibium_sp_TrichSKD4_307772411                               ALR---ESF-----IYPPFSVLDA-RS----------------RWWQDRKRQWLDL------GIESG--------------------K---GRK-------------EDLLK---GYAVS-MARWHQI--QGQD---------------H------------------KP----------ADW-M-E---KSVFDPVLTELLVAWFSP-----KGG--------------RVLDPFAGGSVRGIVSALLGRTYLGVDISEEQIIEN-------RRQAE------ALS-----------V-Q---------KE--ARWKQGDAVR-L-----------Y-KS--G----VR--------GWF--DMILTCPPYGDLEKYSD-----DPADISNM-PYELFLGGFRQAVKDAAARLSPDRFAVFVVGDFR--DK--Q-GINRGFVADTIQACKNAELQFYNDGVLVTQAGSLPMRVRGMFEASRKLGKTHQNVLVFVKGDP-------------------KRATEHVGH-VT--P-F-----TFPDS
      TRICHSKD4_RS11975_Roseibium_sp_TrichSKD4_750334917                            -LR---ESF-----IYPPFSVLDA-RS----------------RWWQDRKRQWLDL------GIESG--------------------K---GRK-------------EDLLK---GYAVS-MARWHQI--QGQD---------------H------------------KP----------ADW-M-E---KSVFDPVLTELLVAWFSP-----KGG--------------RVLDPFAGGSVRGIVSALLGRTYLGVDISEEQIIEN-------RRQAE------ALS-----------V-Q---------KE--ARWKQGDAVR-L-----------Y-KS--G----VR--------GWF--DMILTCPPYGDLEKYSD-----DPADISNM-PYELFLGGFRQAVKDAAARLSPDRFAVFVVGDFR--DK--Q-GINRGFVADTIQACKNAELQFYNDGVLVTQAGSLPMRVRGMFEASRKLGKTHQNVLVFVKGDP-------------------KRATEHVGH-VT--P-F-----TFPDS
      C511_RS16595_Bacteroides_graminisolvens_736778761                             KPNNVAERF-----IIPPFSILDA-KQ----------------GRWQERKRAWLSL------GIKSE--------------------E---GRD-------------KEITY---SRSAQ-NPAIYEV--RNRM---------------R-EKLGYDPSWDEITEYC-----KKHDI---PML-D-G---TSVFDPVLCELAYRWFNI-----PAG--------------VILDPFAGGSVRGIVAAKLGMRYRGVDLRPEQIKAN-------YEN-A------AEM-Q-----PPFTE-N---------DC--LVWKCGDSRD-I---------DQH-YA--G----LK--------A----DMIFSCPPYADLEVYSD-----DPRDLSNM-EYEEFLNAYRTIIQKSCSLLKENRFAVFVIGEVR--GK--N-GAYYNFVGDTINAFLEAGLHYYNEMILATQIGSLAMRVTNQFNHSRKIGKTHQNVLVFFKGDL-------------------K--------------------------
      JCM15093_3233_Bacteroides_graminisolvens_DSM_19988_=_JCM_15093_665395990      KPNNVAERF-----IIPPFSILDA-KQ----------------GRWQERKRAWLSL------GIKSE--------------------E---GRD-------------KEITY---SRSAQ-NPAIYEV--RNRM---------------R-EKLGYDPSWDEITEYC-----KKHDI---PML-D-G---TSVFDPVLCELAYRWFNI-----PAG--------------VILDPFAGGSVRGIVAAKLGMRYRGVDLRPEQIKAN-------YEN-A------AEM-Q-----PPFTE-N---------DC--LVWKCGDSRD-I---------DQH-YA--G----LK--------A----DMIFSCPPYADLEVYSD-----DPRDLSNM-EYEEFLNAYRTIIQKSCSLLKENRFAVFVIGEVR--GK--N-GAYYNFVGDTINAFLEAGLHYYNEMILATQIGSLAMRVTNQFNHSRKIGKTHQNVLVFFKGDL-------------------KQIPSLYPE-LDFRE------------
      AZL_RS04495_Azospirillum_lipoferum_502738489                                  SGKSLVERF-----GVPPFTVLDA-KQ----------------GYWRDRKAAWTAL---G--V-HAE--------------------E---GR--------------EHLPD----TNVA----TDWM--RRGS--------------------------------------------A-----V-G---GSAFDPVLAELVFRWFTP-----GAG---A----------AILDPFGGEATKGVIAATLGYAYTGVELRGEQVQAN-------RAQ-W------LKV-RDRLP-P---E-R---LAQVR-HE--PVWIEGDSAK-I---------DSL--------------LPAGRLY----DLVFTSPPYYDLEIYSK-----GEKDGSAFESYDRFISWYRDIFRQACGRLKPNRFAVVKIGDVR--DE--R-GFYRNFLGDNIACFLDAGLGFYNEAVLATPIGSLALRAGRQFTASRKLGRGHQNVLCFFGGDP---------------G---RIKAEFPQE-IEVGD-V-----APDDP
      EM79_05715_Vibrio_parahaemolyticus_686232662                                  KVGSLADRF-----IISPFSVFNS-RK----------------GWWQQRKRAWLDL------GIRSE--------------------V---GRN-------------EKLSI---TALST---NQYSE--KNEI---------------E-QKLGRTLSTEEYLSEY-------------CTL-T-KLHTTSIFDPVLTELCYEWFCP-----QKG--------------HIVDPFAGGSVRGVVASKTARKYTGNDLREEQVIAN-------REQ-A------NAI-------C---S-S---------PA--PKWIVGDSVE-L---------ESL-IS--E-----K--------A----DMIFSCPPYADLEKYSD-----NPKDLSNM-GYDDFLENYRKIIRCCFSILKDDRFAVFVVGEVR--DK--N-GNYRNFVSDTISAFLGSGFSYYNEAILVNVVGSLPIRVGKQFSQSRKLGKTHQNVLVFVKGDA-------------------KKAAKACGD-IEVHL----EE-PEEES
      AMBLS11_12430_Alteromonas_macleodii_str_'Black_Sea_11'_407249874              ARETLAERF-----IVPPFSVLDA-KQ----------------GYWNERKRAWQSL------GIQSE--------------------L---GRD-------------DALTY---AVSSQ-PPHVYEF--KNAV---------------E-KDIGRKLTWKEFAEKY-----PEE-----ITL-T-G---TSIFDPVLTEVLYSWFCP-----QGG--------------KILDPFAGGSVRGVVAGFMGYNYTGVELRPEQVKAN-------QRQ-G------RTI-L-----G---E-H---------AT--AQWINADSRT-I---------PEV-ID--Q----DE--------EF---DLVFSCPPYADLEVYSD-----DPNDLSTL-GYAEFVEAYTDIIKKACDKLKDNSFAVFVVGEVR--NK--K-GGYYGFVQDTIAAFEAAGLDYYNEVILLTNIGSNAIRAAGQFTKSRKLAKGHQNALVFAKGSPVPQAIAGLSSAIADHFNDHRHVFKAYEN-VMVFC--------KGDP
      EM98_RS02575_Vibrio_parahaemolyticus_646357048                                KVGSLADRF-----IISPFSVFNS-RK----------------GWWQQRKRAWLDL------GIRSE--------------------V---GRN-------------EKLSI---TALST---NQYSE--KNEI---------------E-QKLGRTLSTEEYLSEY-------------CTL-T-KLHTTSIFDPVLTELCYEWFCP-----QKG--------------HIVDPFAGGSVRGVVASKTARKYTGNDLREEQVIAN-------REQ-A------NAI-------C---S-S---------PA--PKWIVGDSVE-L---------ESL-IS--E-----K--------A----DMIFSCPPYADLEKYSD-----NPKDLSNM-GYDDFLENYRKIIRCCFSILKDDRFAVFVVGEVR--DK--N-GNYRNFVSDTISAFLGSGFSYYNEAILVNVVGSLPIRVGKQFSQSRKLGKTHQNVLVFVKGDA-------------------KKAAKACGD-IEVHL----EE-PEEES
      TY03_RS09575_Bacteroidaceae_bacterium_MS4_755016279                           RTGVLKERF-----IIPPFSVLDA-KS----------------GNWQERKRAWLDL------GIKSD--------------------D---GRD-------------GNITF---NRSAQ-PPRVYEA--KNML---------------R-AKTGIEPSWDEVMDYC-----QKNNI---PVM-S-G---TSIFDPVLCELSYRWFNL-----NGG--------------SVLDPFAGGSVRGIVASKLGMPYFGVDLRNEQIESN-------YKN-A------LEV-I-----GENTM-P---------SY--PTWVCGDSCH-I---------DTL-YS--G----MQ--------V----DFIFSCPPYADLEVYSD-----DPKDLSTM-KYDDFKQAYFEIIKKSCSMLKDDRFAVFVVGEVR--SK--Q-GVYRNFVSDTISAFIEAGMVYYNEMILVNQIGSLAMRASNQFNHSRKIGKHHQNVLVFYKGDT-------------------NAIGSNYPK-LDLSD------------
      EM93_14610_Vibrio_parahaemolyticus_655611541                                  KVGSLADRF-----IISPFSVFNS-RK----------------GWWQQRKRAWLDL------GIRSE--------------------V---GRN-------------EKLSI---TALST---NQYSE--KNEI---------------E-QKLGRTLSTEEYLSEY-------------CTL-T-KLHTTSIFDPVLTELCYEWFCP-----QKG--------------HIVDPFAGGSVRGVVASKTARKYTGNDLREEQVIAN-------REQ-A------NAI-------C---S-S---------PA--PKWIVGDSVE-L---------ESL-IS--E-----K--------A----DMIFSCPPYADLEKYSD-----NPKDLSNM-GYDDFLENYRKIIRCCFSILKDDRFAVFVVGEVR--DK--N-GNYRNFVSDTISAFLGSGFSYYNEAILVNVVGSLPIRVGKQFSQSRKLGKTHQNVLVFVKGDA-------------------KKAAKACGD-IEVHL----EE-PEEES
      M201_gp11_Halovirus_HCTV-2_509139481                                          TAGSLEQDF-----GVPPFSVLNT-TK----------------AYWTERREQWKEM------GLDNL--------------------RETPGRE-------------DAMVE---GEGGV-YTGDWGG--GEDS---------------G-GV---------------GGG---------MDG-T-G---TSVFDPVLAELLYRWFAP-----SEG--------------TVLDPFAGGPARAVVSAVTGRAYHGIDLNEAQVQHNRESWDGVADS-D------IDV-------------D---------NA--PQWAHGDSAE-M---------GEH-IE--AQEWPDE--------Y----DFLFSCPPYHDLETYTD-----QDEDLSNM-DYPEFLETYRTIIAQGVERLKEDRFAAFVVSEVR--DD--E-GFYRGFVSDTVQAFEDAGMHLYNDAVLVNTPGTLPVRVRNYFERGRKLGRMHQNVLVFFKGDP---DPSNIR----------EQVGQVSVP-SVVMG-DG----DSEDD
      QI63_RS07900_Treponema_sp_OMZ_838_763137539                                   TKRTLREQF-----LVPPFSVLDT-RQ----------------GYWQDRKKQWKSI------GIKSE--------------------E---GRD-------------NKMLS---HLKKN-AEKVNGT--AN-------------------------------------------------TL-T-E---LSIFDPVLCETMYTWFCT-----PDG--------------KVLDPFAGGSVRGIVASYKQMDYTGFDIRPEQIAAN-------EKQ-L------YIC-N-----G---N-K---------HA--PRWICCDSLK-M---------DTQ-LN--E----FD--------RY---DFLFSCPPYADLETYSD-----IEGDISNM-DYPRFMQTYREIIRKACTYLNDNRFVVFVVGEVR--NK--KTGAYYNFVPDTIKAFEDAGLTYYNEIILLNVAGNKAYTAGHDAKQSRKIAKVHQNILAFVKGDA-------------------DKAAKIHEN-VLVFL--------KGNS
      SINME_RS11520_Sinorhizobium_meliloti_754504471                                ----------------------------------------------------------------------------------------------------------------------------IAEG--DIEG---------------G------------------MS----------AGQ-T-G---TSIFDPVLCELAYRWFCP-----PGG--------------LILDPFSGGSVRGIVASKLGRSYLGIDLSARQIAAN-------QEQAA------KIC-----------G-A---------PE--PRWINADSRD-I---------DSL-AG--G----LA--------A----DFIFSCPPYADLEIYSD-----DPRDLSNM-TYPEFVAAYRDIIAKSCAILKDDRFACFVVGEVR--DK--R-GAYYGFVPDTIAAFQAAGLAFHNEAILITAAGSLPIRAAKQFEATRKLGKTHQNVLVFCKGDA-------------------KKATAAIGE-VEFGA-I-----DGADA
      BAQ89246.1_uncultured_Mediterranean_phage_uvMED_775455096                     KKVLLREKY-----LEPPFSICDT-KQ----------------GTWQRRRQNWKNL------GIESE--------------------V---GRE-------------GTDGA---HFAGR-HRQAERS--GKKP---------------A-ESTQRILDVGE----------------------------VSIFDPALCEILYTWFVD-----KGG--------------SILDPFSGGSVRGIVAHYLGWKYVGIDVRQSQITSN-------KIQ-G------EKI-L-----D---K-D---------NQ--PKWIFGDSNK-ILDTMIDGKEANQ-LE--E--F----------------DFVFSCPPYGNLEIYSD-----QPDDISTL-DYPAFLEVYESIIAKSCKLLKQGALACFVVGEFR--DK--K-GNYVGFVPDTIRAFTKCGMKYYNEAILLNAIGSASVRANTNMK-NKKLVKVHQNILVFKKA------------------------------------------------
      ERS075618_03274_Mycobacterium_abscessus_806900839                             --STLASRF-----GVPPFTVLDA-KQ----------------GAWKQRVKAWKAL------GIASA--------------------S---GRSVDTEGGSDARAATGIMAF---KAPQS-IYANWYE--IKNA---------------AEKKIGRTLTTAEIMERY-GD--ELKFY---AEG-G-A---ISVFDPVLAELLVSWFSP-----VEA--------------DVIDPWSGGSVRGVVSAALGRRYTGIDLSGDQLAVN-------GEQ-W------AVV-----------E-PRLPQMATTVTA--PQWIQGDSRD-V---L-----KTL-DD--E-SF----------------DMMIGCPPYYDLEQYSK-----DPADLSAM-STDEFDAAFIETIAEVARVLRPDSFAALVVGSAR--DK--R-GDLRDMRSLVSRAADATGMKLANDAVLLTAIGSNAARAARPFSKGRALGRVHQDILVLVKGDR-------------------IRAARRCGEEVTLIA-E-----PELDE
      FSDG_RS10625_Fusobacterium_nucleatum_495977294                                QKGNLIKKF-----IVPPFTIIDS-TK----------------NPWLSIKNKWKE-------SINSL--------------------K---GRS-------------KELI------GEM-Y----------------------------------------------------------------G---TSLFDGAISEVFYKWFLP-----NKDIETK----------KILDPFCGGVIRGAVAELLGYKYTGFDIRKEQIDVN-------IEQ-S------KEL-------------G---------IS--PKFILDDSEN-V---------DKY-ID--D----NT--------Q----DLIFSCPPYLDLEVYSN-----NENDLSNM-EYEDFINKYNRIIKKHCKKLKDNRFAIFVIGDVR--DK--N-GVLIDFVGDTIKAFKNAGLNYYNQVIYKEPLGSVTIRAGRAFNATRKISKVHQNIIIFYKGNV---K--DIK----------KYFNEFYSE-KDVDD-FN----SITDD
      HMPREF5175_00726_Lactobacillus_gasseri_SV-16A-US_675161105                    TPDALTNDF-----MVPPFSVLDT-RQ----------------GNWQDRKREWLDL------GIKSE------L-----------------GRK-------------GDLTF---AKSIN----------IEGS--------------------------------------------------S-G---TSVFDPVLTEIIYKWFIP-----YAN--A-----------NIYDPFAGGSVRGIVAATLGHHYTGIDLRKEQIDAN-------YQN-A------KEI-------------G-TNI-----DL--INWYVDDSQN-A---------DKY-VK--D----NT--------Q----DLIIACPPYLDLEVYSD-----DPKDLSTM-SDDEFDNIYQKILQNAVKKLKNNRFAVVIVSDVRGKGK--N-SGYRDLTGMTKRAFIDSGCCFYNDIILLNAIGSAAVRARRYMK-SRKVARVHQNVLVFYKGDI-------------------TKIKDEFDE-IHGLN-------DALEN
      EL88_RS12025_Bacteroides_dorei_740824887                                      K--RLKDDF-----ILPPFSVLNT-RT----------------AEWQERRRAWLEI------GIKSD--------------------E---GRS-------------EDLTF---AKTAQ-PPIFYDT--KNAL---------------R-ETLGREPQTEEVIAEM-----ERLGL---KTM-T-T---TSIFDPVLTELSYRWFNI-----EGG--------------RILDPFAGGSVRGIVAAKLNMPYVGNDLSEAQIRAN-------RTN-A------EEV-L-----G-VSC-P---------FF--PQWTVGDSSQ-L---------EDV-LSTNG----IS--------G--DFDMIFSCPPYADLEVYSN-----DPRDISNM-DYAQFIEAYKRIIKQSCTRLKNNRFAVFVVGDIR--DK--K-GIYRNFVSHTIEAFTGCGLHYYNSLILVNQITSLAIRVRRQFNGTRKVGKVHQNVLVFCKGSV-------------------EETIDSFEE-LQVKK------------
      H590_RS0105875_Propionibacterium_jensenii_655293784                           PTPGLAGRF-----LEPPFSVLDR-RQ----------------GRWQDRRRQWDAI------GLHSE--------------------T---GRS-------------PELVH---DYSAT-M----------RV---------------K------------------SR----------SSD-A-G---TSIFDPALAELIIAWWSR-----EGD--------------QILDPFAGGSVRGVVSSCMGRRYTGVDLSADQVKAN-------RRQ-A------DLG-----------SPD---------MP--PQWICGDSAD-I---------DALLDD--A----VE--------A----DLIMTCPPYGDLERYSD-----DPADLSTM-GWEDFQTAYRTILSTTVERLRQDRFVAVVVSDIK--DR--D-GAYRGLPALTSDALTAAGCRIVTDAVILDPLGSKQMMCERPFKANRTLTRVHQNLIVGVKGDR-------------------HVATARLDA-AQHDP-I-----AS---
      U713_RS23890_Rhodobacter_capsulatus_565833115                                 PARTFGQDL-----MRGEHVVGAG--------------------QPAPQNGGVLMP------SHTSGD--------------PSFYAK---KRA-------------KEAEI---GRELT-TEEFLAD--HYAA---------------S------------------DA----------PTA-S-G---TSIFDPVLCEIAYRWFCP-----PGG--------------TVLDPFAGGSVRGIVAARLGRPYVGIELRAKQVAAN-------QAQ-A------GLA-----------G-A---------PA--PRWIAGDSRD-L---------AGL-AA--G----ID--------A----DLVFSCPPYWNLERYSD-----DPSDLSTM-PLADFLKAQGEIIAQAVARLRPNRFAVWVIGDVR--DA--D-GFFVNLPGLTVEAFEAAGARFYNDAILVTAVGSLPIRVGRQFTAARKLGRTHQNVLVFCKGDP-------------------RKATEACGP-VEFGE-I-----EGPET
      T245_RS0104245_Vibrio_parahaemolyticus_686217757                              ---------------------------------------------WQQRKRAWLDL------GIRSE--------------------V---GRN-------------EKLSI---TALST---NQYSE--KNEI---------------E-QKLGRTLSTEEYLSEY-------------CTL-T-KLHTTSIFDPVLTELCYEWFCP-----QKG--------------HIVDPFAGGSVRGVVASKTARKYTGNDLREEQVIAN-------REQ-A------NAI-------C---S-S---------PA--PKWIVGDSVE-L---------ESL-IS--E-----K--------A----DMIFSCPPYADLEKYSD-----NPKDLSNM-GYDDFLENYRKIIRCCFSILKDDRFAVFVVGEVR--DK--N-GNYRNFVSDTISAFLGSGFSYYNEAILVNVVGSLPIRVGKQFSQSRKLGKTHQNVLVFVKGDA-------------------KKAAKACGD-IEVHL----EE-PEEES
      HGGM_RS12685_Alistipes_shahii_505358723                                       R--RLKDDF-----VMPPFSVLNT-RT----------------AEWQERRRAWLEI------GIKSE--------------------E---GRD-------------EDLTF---AKSAQ-PPAFYDT--KNAL---------------R-ETLGREPSTDELLAEM-----EKQGI---QAM-A-T---TSIFDPVLTELSYRWFNI-----EGG--------------RILDPFAGGSVRGIVAAKLNMPYVGNDLREKQVVAN-------IEN-A------KEV-L-----GNMPA-D---------IA--PRWTVGDSTQ-L---------EDV-LQKNG----VT--------G--DFDMVFSCPPYADLEVYSN-----DPRDISNM-DYPQFLEAYKAAIKQACARLKNNRFAVFVVGDIR--DK--K-GIYRNFIGHTIEAFTECGLSYYNHLILVNQVTSLAIRVRKQMNTGRKIGKLHQNVLVFCKGSV-------------------EETVDQFEE-VQVTK------------
      CG50_RS14910_Paenirhodobacter_enshiensis_738746241                            PARTFGQDL-----MRGEHVVGAGSTS----------------QQAAPQNGGVLMP------SHTSGD--------------PSFYAK---KRA-------------KEAEL---GRELT-TEEFLAD--HYAA---------------S------------------DA----------PTA-S-G---TSIFDPVLCEIAYRWFCP-----PGG--------------TVLDPFAGGSVRGIVAARLGRPYVGIELRAEQVAAN-------QAQ-A------DLA-----------G-A---------PA--PRWIAGDSRD-L---------AGL-AA--G----IE--------A----DLVFSCPPYWNLERYSD-----DPSDLSSM-PLADFMKAQGEIIAQAVARLRPNRFAVWVIGDVR--DA--D-GFFVNLPGLTVEAFEAAGARFYNDAILVTAVGSLPIRVGRQFTAARKLGRTHQNVLVFCKGDP-------------------RKATEACGP-VEFGE-I-----EGPDT
      LIN_RS06450_Listeria_innocua_499299925                                        LDSNLFESF-----LFPPFSYLDS-KT----------------KRWRDRKDQWKNL------GIRSE------L-----------------GRE-------------GNLTF---ASSLR----------SASL--------------------------------------------------T-G---TSIFDPVLCELAYRWFTP-----GES--A-----------KIYDPFAGGSVRGIVAKVLGHEYTGIDLRKEQVEAN-------RIN-A------KEI-------------G---L-----DG--INWITDNSLN-A---------DKH-IE--D----NS--------M----DLLFTCPPYFDLEVYSD-----NKEDISNM-EYEEFIKVYSEILAKGANKLKDNRFAIVVISDVR--DK--A-GFYRDLTGLTKSVFEKNGIYFYNDLILLNSLGSGALRARRNMR-NRKLVRIHQNVLVFYKGNP-------------------DEIQEHFPI-LEVLE-------DNLEE
      IG92_RS0128815_Streptomyces_sp_NRRL_S-1868_740017278                          EATQTEALP-----GIHPFSILRS-DM----------------GPWRERRRAWHAL------GMTSR--------------------A---GRE-------------HVRTW---DTSSP-FG---------RE---------------K------------------LA----------AIS-D-G---LSTFDPVLAECSYRWYAP-----QAG--------------HVLDPFAGGSTRGLVAGHLGYGYTGIDLSPAQVDAN-------EEQ-AIAWTEKDLL-----------A-----------RQ--PIWTLGDSADVL---------PGL-DD--G----AY-------------DYVFTCPPYHCLERYSD-----HPADLSAM-RWRTFSQTYSRIIAESVRCLADHSFATWVVGEIR--NS--T-GTLRSLIPLTIAAHEAAGARFYNDAILMNALGTVPMRIGQQWRASRKMGRHHQYVLTFVKGDP-------------------KKATRRVDD-RPAR-------------
      C892_RS0103080_Nocardiopsis_baichengensis_516125430                           QQNTLAPKPRIDVGGILPFSILRA-DR----------------GNWQERKRAWQDL------GFDSQ--------------------A---GRE-------------GVKTW---DTSSP-FG---------KQ---------------Q------------------LM----------KIS-N-G---LSTFAPVLAELCYEWYAP-----PEA--------------RILDPFAGGSVRGLVAGNGGYHYTGIDLSERQVEAN-------REQ-AADWAERDLI-----------N-----------GT--TRWITGDAVQVL---------PTL-GE--E----SF-------------DYVLTCPPYHDLEVYSD-----HPDDLSAM-DWDEFVDAYIEAIIYTVRNMKPDTFATWVVGETR--DR--K-GAIRGIVPLTIEAHARAGAKLYNDAILQNVLGTVPLRVGNQWRASRKMGRHHQYVLTFVKGDA-------------------KRATRNLAG-AAA--------------
      AMBLS11_RS12625_Alteromonas_macleodii_764990690                               ---------------------------------------------------------------MQSE--------------------L---GRD-------------DALTY---AVSSQ-PPHVYEF--KNAV---------------E-KDIGRKLTWKEFAEKY-----PEE-----ITL-T-G---TSIFDPVLTEVLYSWFCP-----QGG--------------KILDPFAGGSVRGVVAGFMGYNYTGVELRPEQVKAN-------QRQ-G------RTI-L-----G---E-H---------AT--AQWINADSRT-I---------PEV-ID--Q----DE--------EF---DLVFSCPPYADLEVYSD-----DPNDLSTL-GYAEFVEAYTDIIKKACDKLKDNSFAVFVVGEVR--NK--K-GGYYGFVQDTIAAFEAAGLDYYNEVILLTNIGSNAIRAAGQFTKSRKLAKGHQNALVFAKGSP----------------------------------------------
      HMPREF9454_RS03645_Megamonas_funiformis_495813435                             EKVSLSEKF-----LFTPTSVLNT-RC----------------AQWQERKRAWFKY------GIKSD--------------------L---SRE-------------NIKTT---GSAAGSVPRFYEY--KEKC---------------E-KEIGRKLSVAEFTDNYLHRYMKEDSLLKFTNT-G-GI--LSVFDPVLCELMYYWFSF-----DKA--------------KILDPFAGGSVRGIIASKLNRQYTGVDLRKEQIEAN-------INQ-G------DEL-L-----S---T-D---------DI-KPKWICGNSLN-I---------SSL-AK--D-----E--------Y----DFIFSCPPYYDLEIYSD-----DKEDLSNQ-TYEDFLSMYRKIIFDSVNMLKDNRFACFVVGDIR-NRK--T-GMYRNFVSETIAAFHNAGMELYNEIILLTTLGSLPIRMGRGFSISRKVGKTHQNVLVFYKGDQ-------------------KKIRDLYGD-IDILE---ISD-EDLDI
      HS95_01700_Listeria_monocytogenes_685911099                                   LDSNLFESF-----LFPPFSYLDS-KT----------------KRWRDRKDQWKNL------GIRSE------L-----------------GRE-------------GNLTF---ASSLR----------SASL--------------------------------------------------T-G---TSIFDPVLCELAYRWFTP-----KES--A-----------KIYDPFAGGSVRGIVAKVLGHEYTGIDLRKEQVEAN-------HIN-A------KEI-------------G---L-----DG--INWITDNSLN-A---------DKH-IE--D----NS--------M----DLLFTCPPYFDLEVYSD-----NKEDISNM-EYEEFIKVYSEILDKAANKLKDNRFAIVVISDVR--DK--A-GFYRDLTGLTKSVFEKNGIYFYNDLILLNSLGSGALRARRNMR-NRKLVRIHQNVLVFYKGNP-------------------DEIQEHFPI-LEVLE-------DNLEV
      I794_RS04725_Listeria_monocytogenes_746332729                                 LDSNLFESF-----LFPPFSYLDS-KT----------------KRWRDRKDQWKNL------GIRSE------L-----------------GRE-------------GNLTF---ASSLR----------SASL--------------------------------------------------T-G---TSIFDPVLCELAYRWFTP-----KES--A-----------KIYDPFAGGSVRGIVAKVLGHEYTGIDLREEQVEAN-------HIN-A------KEI-------------G---L-----DG--INWITDNSLN-A---------DKH-IE--D----NS--------M----DLLFTCPPYFDLEVYSD-----NKEDISNM-EYEEFIKVYSEILDKGANKLKDNRFAIVVISDVR--DK--A-GFYRDLTGLTKSIFEKNGIYFYNDLILLNSLGSGALRARRNMR-NRKLVRIHQNVLVFYKGNP-------------------DEIQEHFPI-LEVLE-------DNLEE
      HT51_04190_Listeria_monocytogenes_489827744                                   LDSNLFESF-----LFPPFSYLDS-KT----------------RRWRDRKDQWKNL------GIRSE------L-----------------GRE-------------GNLTF---ASSLR----------SASL--------------------------------------------------T-G---TSIFDPVLCELAYRWFTP-----KES--A-----------KIYDPFAGGSVRGIVAKVLGHEYTGIDLRKEQVEAN-------HIN-A------KEI-------------G---L-----DG--INWITDNSLN-A---------DKH-IE--D----NS--------M----DLLFTCPPYFDLEVYSD-----NKEDISNM-EYEEFIKVYSEILDKAANKLKDNRFAIVVISDVR--DK--A-GFYRDLTGLTKSVFEKNGIYFYNDLILLNSLGSGALRARRNMR-NRKLVRIHQNVLVFYKGNP-------------------DEIQEHFPI-LEVLE-------DNLEV
      LIN_RS08830_Listeria_innocua_499300722                                        LDSNLFESF-----LFPPFSYLDS-KT----------------RRWRDRKDQWKNL------GIRSE------L-----------------GRE-------------GNLTF---ASSLR----------SASL--------------------------------------------------T-G---TSIFDPVLCELAYRWFTP-----GES--A-----------KIYDPFAGGSVRGIVAKVLGHEYTGIDLRKEQVEAN-------RIN-A------KEI-------------G---L-----DG--INWITDNSLN-A---------DKH-IE--D----NS--------M----DLLFTCPPYFDLEVYSD-----NKEDISNM-EYDEFIKVYSEILDKGANKLKDNRFAIVVISDVR--DK--A-GFYRDLTGLTKSIFEKNGIYFYNDLILLNSLGSGALRARRNMR-NRKLVRIHQNVLVFYKGNP-------------------DEIQEHFPI-LEVLE-------DNLEE
      U717_RS25485_Rhodobacter_capsulatus_502831711                                 PARTFGQDL-----MRGEHVVGAGPAS----------------QSATPQNGGVLMP------SHTSGD--------------PSFYVK---KRA-------------KEAEL---GREMT-TEEFLAD--HYAA---------------S------------------DA----------PTA-S-G---TSIFDPVLCEIAYRWFCP-----PGG--------------TVLDPFAGGSVRGIVAARLGRPYVGIELRAEQVAAN-------QAQ-A------DLA-----------G-D---------PA--PHWIAGDSRD-L---------ARL-TA--G----IE--------A----DLVFSSPPYWNLERYSD-----DPSDLSTM-PLADFLKAQAEIIAQAVARLRPNRFAVWVIGDVR--DT--D-GFFVNLPGLTVEAFEAAGARFYNDAILVTAVGSLPIRVGRQFTVARKLGRTHQNVLVFRKGDP-------------------RKATEACGP-VEFGE-I-----EGPDT
      F823_RS0101690_Peptostreptococcus_anaerobius_518425096                        VDSNLADTF-----LFSPFSYIDT-KT----------------DRWQNRKKAWKEL------GIKSE------V-----------------GRE-------------DGLIF---SKALI----------NESL--------------------------------------------------A-G---TSIFDPVLCELGYRWFSP-----NKN--C-----------NIIDPFAGGSVRGIVANVLGHSYTGIDLRQEQIDAN-------FNN-A------NEM-------------G---L-----SN--IKWICDDSQN-V---------LEH-VN--E----ES--------Q----DLMFTCPPYFDLEVYSD-----NDKDISNM-DYNSFSEIYSNILRRTARTLKDNRFGVVVISDVR--DK--K-GFYRDLTGLTKQALAEEGMYFYNDLILLNSIGTAAIRARRYMA-NRKVARLHQNVLVFYKGDP-------------------KKIKDEFGE-LETLE-------EEEIF
      D350_RS16800_Enterococcus_faecalis_514907821                                  VNSNLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKKQWKEL------GIKSE------L-----------------GRE-------------DNLVF---SANLQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQVSAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEDFAEVYSEILKRSAKKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSKEGLYFYNDMILLNTAGSAALRARQSMN-NRKVVRIHQNVLVFYKGNP-------------------QKISKHFEA-LETLD-------DELET
      LiPB054_gp77_Listeria_phage_B054_157325361                                    LDSNLFESF-----LFPPFSYLDS-KT----------------RRWRDRKDQWKNL------GIRSE------L-----------------GRE-------------GNLTF---ASSLR----------SASL--------------------------------------------------T-G---TSIFDPVLCELAYRWFTP-----KES--A-----------KIYDPFAGGSVRGIVAKVLGHEYTGIDLRKEQVEAN-------HIN-A------KEI-------------G---L-----DG--INWITDNSLN-A---------DKH-IK--D----NS--------M----DLLFTCPPYFDLEVYSD-----NKEDISNM-EYDEFIKVYSEILDKGANKLKDNRFAIVVISDVR--DK--A-GFYRDLTGLTKSIFEKNGIYFYNDLILLNSLGSGALRARRNMR-NRKLVRIHQNVLVFYKGNP-------------------DEIQEHFPI-LEVLE-------DNLEE
      HMPREF9998_01728_Peptostreptococcus_anaerobius_VPI_4330_=_DSM_2949_429146233  VDSNLADTF-----LFSPFSYIDT-KT----------------DRWQNRKKAWKEL------GIKSE------V-----------------GRE-------------DGLIF---SKALI----------NESL--------------------------------------------------A-G---TSIFDPVLCELGYRWFSP-----NKN--C-----------NIIDPFAGGSVRGIVANVLGHSYTGIDLRQEQIDAN-------FNN-A------NEM-------------G---L-----SN--IKWICDDSQN-V---------LEH-VN--E----ES--------Q----DLMFTCPPYFDLEVYSD-----NDKDISNM-DYNSFSEIYSNILRRTARTLKDNRFGVVVISDVR--DK--K-GFYRDLTGLTKQALAEEGMYFYNDLILLNSIGTAAIRARRYMA-NRKVARLHQNVLVFYKGDP-------------------KKIKDEFGE-LETLE-------EEEIF
      D351_RS17740_Enterococcus_506557932                                           VNSNLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKKQWKEL------GIKSE------L-----------------GRE-------------DNLVF---SANLQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTK--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQVSAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLD-I---------DEH-IE--N----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFEEVYSEILKRSARKLKDNRFAVVTISDVR--DK--K-GFYRDLTGLTKQAFSEEGLFFYNDMILLNTAGSAALRARQSMN-NRKVVRIHQNVLVFYKGNP-------------------QKISKHFEA-LETLD-------DELET
      ATH1_RS0100585_Anaerophaga_thermohalophila_498101954                          SAGVLRERF-----IVPPFSILDS-RR----------------GEWQKRKKAWRAI---I--GDNGE------------------------SRN-------------DTLIK---SIELK----------YKDLYQKTRAHRQKLGISFK-EYLEKYVSEEEKRKAE-KK--------V-TAT---G---VSILDPVMAEIVCKWFGV------DG--G-----------NVFDCFAGDSVFGYVAAHEGFNFTGIELRPEQARLN-------NER-V------EGM-------------T-------------AKYINDDGQN-V---------AKY-IE--P----ES--------Q----DLLFSCPPYFDLEVYSD-----DPKDASNQDSYEDFIHILKNAFTESLKCLKENRFAVVVIGDVR-NKK--T-GFYYNMIDDLKRTFKEAGAPLYNEIILLEQTANSALRAAKAME-SRKVVKVHQNILVFYKGDP-------------------KHIKDNFKI-IEYDE-------QDLES
      B157_RS0110235_Spirosoma_spitsbergense_754558370                              NTGGLSERF-----IVPPFSILDT-RQ----------------GYWKERKEQWRTR---I--GDFGE------------------------SRN-------------ATLRK---SKSGD----------DPGYYRQKSGVEKQLG---------RKLTAQEFERDH-YV--------R-LTKLPVG---VSLLDPVLSEIIVQWFGL------AG--G-----------GAFDPFAGDSVFGFVSAATGMTFTGIELRQEQADLN-------QTR-L----DTEGL-------------P-------------GRYICDDGRN-V---------AQH-LP--A----ES--------Q----DLLFSCPPYFDLEVYSD-----SPQDASNQKTYAGFYAILHEAFTNAVLCLKPNRFAVIVCGDVR-DKK--T-GAYYGFPQDVIQTMRSAGLHFYNECILIEQAGNAAIRASSQMK-HRKVVKTHQQVLVFYKG------------------------------------------------
      P791_RS10720_Enterococcus_faecalis_727050824                                  ANTSLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKRQWKEL------GIKSE------L-----------------GRE-------------NNLVF---NPSMQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------DIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQVSAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSAKKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSKEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQHFEV-LETLD-------DELEN
      D927_RS13925_Enterococcus_faecalis_488292151                                  TNTSLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKRQWKEL------GIKSE------L-----------------GRE-------------NNLVF---NPSMQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQVSAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSARKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSKEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQHFEV-LETLD-------DELEN
      UMY_RS13155_Enterococcus_faecalis_498481713                                   TNTSLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKRQWKEL------GIKSE------L-----------------GRE-------------NNLVF---NPSMQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQVSAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSARKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSKEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQHFEV-LETLD-------DELEN
      T261_5811_Streptomyces_lydicus_A02_768317646                                  RPPTLGPLP-----GIHPFSVIRT-DL----------------GPWRERRRAWHDL------GIASR--------------------A---GRE-------------HVTTF---ATRGK-FA---------QE---------------K------------------LT----------SIN-D-G---LSTFDPVLAETSYSWYCP-----DGG--------------TILDPFAGGSVRGLVAGNGGFRYTGIDLSPVQLDAN-------EEQ-AAAWRENDLL-----------A-----------AQ--PEWLLGDAADVL---------PDL-EE--G----AY-------------DYVFTCPPYHNLEKYSD-----HPADLSAM-RWKEFAEVYREIIAASVRCLAEDRFATWVVGEVR--NS--L-GMIRGLIPLTIAAHEAAGARLYNDAVLMNTLGTVPLRIGNQWRASRKMGRHHQYVLTFVKGDP-------------------KRATASLGE-GTA--------------
      HMPREF9518_RS16950_Enterococcus_faecalis_488328083                            ANTSLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKRQWKEL------GIKSE------L-----------------GRE-------------DNLVF---NPSMQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQVSAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSARKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSKEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQYFEV-LETLD-------DELEN
      CULC0102_0528_Corynebacterium_ulcerans_0102_393402237                         QAATLSERF-----GVPPVTVINT-RS----------------GEWQGRKRAWTAK------GIASF--------------------E---GRR-------------DKMIY---THGAN-LYSNWFD--IKSK---------------A-RKTHPDITDKEIAENY-QD--QLKPF---TNG-S-G---TSTFDPVLAELLLAWFSK-----RDD--------------RVLDPWAGGSVRGIVSAAVGRQYVGHELRPEQVEEN-------TNQ-W------GTY-----------D-Q-----TECAHP--PQWITGDSRD-T---M-----RHH-PA--G-AF----------------DMIIGCPPYYDLETYSD-----DPSDLSTL-TTEEFDEAMANTLRIADKALANDRFAAFVIGPVR--DK--H-GALRDMKRCMINAA-PTGWSYVNDMVLVNPMGTAQLRAARSFTSTRTLTRVHQDIVIFAKGDR-------------------KRAAERLGD-VELVN-F-----DTLDE
      B073_RS0131580_Streptomyces_sp_MspMP-M5_739963074                             MAPTLGPLP-----GIRPFSVLRT-DL----------------GPWRDRRAAWNDL------GLASR--------------------T---GRE-------------HATTY---ASRGK-FA---------EK---------------Y------------------LT----------SIN-G-G---LSTFDPVLAEVLYEWYCP-----ERG--------------SVLDPFAGGSVRGLVAGNGGYRYTGIDLSPSQVDAN-------EDQ-AADWAARDLL-----------A-----------AK--PRWILGDAADVL---------PDF-AT--G----AY-------------DYVMTCPPYHNLEKYSD-----HPADLSAM-RWKEFTEAYREIIADSVRCLAEDSFATWVVGEVR--NS--A-GIIRGLIPLTIAAHEAAGARLYNDAILMNTLGTTPMRLGNQWRASRKMGRHHQYVLTFVKGDP-------------------KCATARLAE-VAP--------------
      ES21_06495_Enterococcus_faecalis_694245431                                    ANTSLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKRQWKEL------GIKSE------L-----------------GRE-------------DNLVF---NPSMQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQVSAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSAKKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSTEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQHFEV-LETLD-------DELEN
      JF27_RS10635_Enterococcus_faecalis_640121481                                  ANTSLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKRQWKEL------GIKSE------L-----------------GRE-------------DNLVF---NPSMQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQVSAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSAKKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSTEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQHFEV-LETLD-------DELEN
      WUC_RS08035_Enterococcus_faecalis_488295148                                   ANTSLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKRQWKEL------GIKSE------L-----------------GRE-------------DNLVF---NPSMQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQISAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSAKKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSTEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQHFEV-LETLD-------DELEN
      WOK_RS09735_Enterococcus_faecalis_498526876                                   ANTSLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKRQWKEL------GIKSE------L-----------------GRE-------------DNLVF---NPSMQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQISAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSAKKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSTEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQHFEV-LETLD-------DELEN
      WU9_RS08245_Enterococcus_faecalis_498397880                                   ANTSLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKRQWKEL------GIKSE------L-----------------GRE-------------DNLVF---NPSMQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQISAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSARKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSTEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQHFEV-LETLD-------DELEN
      D349_RS04690_Enterococcus_faecalis_514889617                                  ANTSLFDSF-----LFPPFSYLDT-KT----------------KRWLDRKRQWKEL------GIKSE------L-----------------GRE-------------DNLVF---NPSMQ----------APGL--------------------------------------------------E-G---TSIFDPVLCELGYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQISAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSARKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSTEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQHFEV-LETLD-------DELEN
      DM07_RS11075_Oceanicaulis_sp_HL-87_738605470                                  -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------SIFDPVICEIAYRWFCP-----PAG--------------MVLDPFAGGSVRGIVASRLGRGYCGIELRAEQVDAN-------RAQ-A------HLA-----------G-D---------PA--PEWRQGDSRD-L---------AKL-AG--D----IE--------A----DLVFSCPPYWNLEQYSE-----DPADLSNM-GREEFFAAQADIIAAAVARLRPDRFAVWIVGDVR--DG--D-GCFVNLPGRTIEAFESAGARFYNDAILVTAVGSLPIRAGRQFEASRKLGRTHQNVLVFVKGDP-------------------KKATEACGP-VE---------------
      K252_RS0101690_Butyricimonas_virosa_652946519                                 PESSLFDRF-----VVPPFSILDT-RK----------------GYWQDRKKKWYDI---I--GDMGE------------------------SRN-------------DTLVT---SLEIK----------YKDLYQRTREHRKELGISFK-EYIEKYVPKEELEREQ-SK--------I-VAQ---G---VSILDPVMAEIVCRWFGF------KN--C-----------QTFDCFAGDSVFGFVSAYLGNQFTGIELREQQASLN-------NER-V------AEM-------------T-------------ARYICDDGQN-V---------AKH-IT--P----ES--------Q----DLLFSCPPYFDLEKYSD-----LPNDASNQDSYEDFIQILKNAFTAAVGCLKNNRFAVICVGDVR-DRK--T-GFYYDFCGDIKRIFKEAGVLLYNEIILVEQTASTALRAARYME-TRKVAKTHQHILVFFKGNP-------------------KDIKKEYPK-IEYTE-------EDMVQ
      BACCOP_RS00640_Bacteroides_coprocola_749916162                                PESSLFDRF-----VVPPFSILDT-RK----------------GYWQDRKKKWYDI---I--GDMGE------------------------SRN-------------DTLVT---SLEIK----------YKDLYQRTREHRKELGISFK-EYIEKYVPKEELEREQ-SK--------I-VAQ---G---VSILDPVMAEIVCRWFGF------KN--C-----------QTFDCFAGDSVFGFVSAYLGNDFTGIELREQQASLN-------NER-V------ADM-------------T-------------ARYICDDGQN-V---------AKH-IN--P----ES--------Q----DLLFSCPPYFDLEKYSD-----LPNDASNQDSYEDFIQILKNAFTAAVGCLKNNRFAVICVGDVR-DRK--T-GFYYDFCGDIKRIFKEAGVLLYNEIILVEQTASTALRAARYME-TRKVAKTHQHILVFFKGNP-------------------KDIKKEYPK-IEYTE-------EDM--
      BACCOP_01158_Bacteroides_coprocola_DSM_17136_189432761                        PESSLFDRF-----VVPPFSILDT-RK----------------GYWQDRKKKWYDI---I--GDMGE------------------------SRN-------------DTLVT---SLEIK----------YKDLYQRTREHRKELGISFK-EYIEKYVPKEELEREQ-SK--------I-VAQ---G---VSILDPVMAEIVCRWFGF------KN--C-----------QTFDCFAGDSVFGFVSAYLGNDFTGIELREQQASLN-------NER-V------ADM-------------T-------------ARYICDDGQN-V---------AKH-IN--P----ES--------Q----DLLFSCPPYFDLEKYSD-----LPNDASNQDSYEDFIQILKNAFTAAVGCLKNNRFAVICVGDVR-DRK--T-GFYYDFCGDIKRIFKEAGVLLYNEIILVEQTASTALRAARYME-TRKVAKTHQHILVFFKGNP-------------------KDIKKEYPK-IEYTE-------EDMVQ
      phiCTP1_gp51_Clostridium_phage_phiCTP1_304360713                              DKTSLLNSF-----VVPPFSVLDT-RQ----------------GYWQDRKRKWLKYTGNLSQTRDGE------F-----------------GRV-------------GQGT----EDNLF----------GTIN--------------------------------------------------N-G---TSNFDPVLAEIAYKWFCP------LG--G-----------KILDPFGGEQTKGVVAGILGFNYNAVEFRKDQVDLN-------KKC-V------AP------------------Y-----AG--VDYICGDSNN-I---------EDL-IK--D----RD--------F----DMIFTSPPYYDLEVYSK-----D--DMSALGTYEEFMKQYKNIFAHCFNMLADDRFLVIKIGEIR-DKK--T-GIYRNFIGDNISIMKDIGFKYYNEAILINSFGTAPIRARGQMR-NRKMVKVHQNILVFYKGNE-------------------KNIS-NLKF-WGNKN-------E----
      GV66_RS18355_Bacteroides_dorei_740821721                                      PESSLFDRF-----VVPPFSILDT-RK----------------GYWQDRKKKWYDI---I--GDMGE------------------------SRN-------------DTLVT---SLEIK----------YKDLYQRTREHRKELGISFK-EYIEKYVPKEELEREQ-SK--------I-VAQ---G---VSILDPVMAEIVCRWFGF------KN--C-----------QTFDCFAGDSVFGFVSAYLGNSFTGIELREQQASLN-------NER-V------ADM-------------T-------------ARYICDDGQN-V---------AKH-IT--P----ES--------Q----DLLFSCPPYFDLEKYSD-----LPNDASNQDSYEDFIQILKNAFTAAVGCLRNNRFAVICVGDVR-DRK--T-GFYYDFCGDIKRIFKEAGVLLYNEIILVEQTASTALRAARYME-TRKVAKTHQHILVFFKGNP-------------------KDIKKEYPK-IEYTE-------EDMVQ
      D881_RS16175_Corynebacterium_ulcerans_752848289                               -----------------------------------------------------------------------------------------------------------------------------------KSK---------------A-RKTHPDITDKEIAENY-QD--QLKPF---TNG-S-G---TSTFDPVLAELLLAWFSK-----RDD--------------RVLDPWAGGSVRGIVSAAVGRQYVGHELRPEQVEEN-------TNQ-W------GTY-----------D-Q-----TECAHP--PQWITGDSRD-T---M-----RHH-PA--G-AF----------------DMIIGCPPYYDLETYSD-----DPSDLSTL-TTEEFDEAMANTLRIADKALANDRFAAFVIGPVR--DK--H-GALRDMKRCMINAA-PTGWSYVNDMVLVNPMGTAQLRAARSFTSTRTLTRVHQDIVIFAKGDR-------------------KRAAERLGD-VELVN-F-----DTLDE
      BN352_RS08925_Candidatus_Alistipes_marseilloanorexicus_518076130              ANGSLADRF-----VIPPFSILDT-RK----------------GYWQARKKVWREL---I--GDMGE------------------------SRN-------------DTLIT---SPEIK----------YKDIYQKTREHRESLGLSFK-EYLDKYVPEEVKEREA-AK--------V-LSA---G---VSLLDPVMAELICRWFGL------EK--C-----------KTFDCFAGDSVFGYVSAYLGNEFTGIELRPEQAQLN-------NER-V------EGM-------------A-------------ARYICDDGQN-V---------GQH-IE--P----NS--------M----DLLFSCPPYYDLEKYSD-----LENDASNQGTYEEFLTILTNAFKSALGCLKENRFAVIVVGDVR-DKK--T-GFYYNFIDDMKRIFKENGAALYNELILIETGASTALRAARYME-SRKVAKMHQNILVFYKGNT-------------------KEIKNNYKK-IEYAS-------EDLEL
      B620_gp66_Croceibacter_phage_P2559S_399528699                                 AHRKLTERF-----IVPPFSVLDS-RK----------------GYWMQRKRAWKAL---I--QDDGQ------------------------SRE-------------GTLGK---DTIME----------QINS----------------------------------------------------G---VSILDPVMAELVSLWYGI------KG--G-----------NAFDCFAGDTVFGFVAGRYGMNFKGIELRQQQADLN-------NMR-V----KAAGL-------------P-------------AEYICDDGQN-V---------DKH-FK--Q----LS--------Q----DLFFSCPPYFDLEVYSD-----MEQDASNQASYKEFLQILETAFEKSYNALKENRFAVIVVGDIR-GKD--S-AYRL-FVDDMRRMWTGFGAMLYNEMIYVEPIGTLPQRAARLMK-NRKIGKCHQNILVFYKGDP-------------------KQIQNNFKE-LEYDS-------EDLEF
      NWI_RS04225_Nitrobacter_winogradskyi_499633394                                LARTFGQDL-----MRGEHEVGAT------------------------TNGGVLMP------SHTSGD--------------PGFYSK---KRA-------------REAEI---GRELT-TEEFLAD--HYEA---------------S------------------DA----------PTA-S-G---TSIFDPVLCEIAYRWFCP-----QGG--------------TVLDPFAGGSVRGIVASRLGRRYVGIELRHEQVEAN-------RAQ-V------TIA-----------V-E---------PS--PEWRVGDARD-L---------GAI-AA--D----VA--------A----DLIFSCPPYWNLERYSD-----DPADLSTM-GEAAFFEAQAAIIAAAVARLKDDRFAVWVVGDVR--DD--R-GFYVNLPGRTVEAFEAAGARFYNEAILVTAVGSLPIRTGRQFTAARKLGRTHQSVLVFVKGDP-------------------RRATEACGE-VEFGE-I-----EEALA
      HMPREF1261_00448_Corynebacterium_sp_KPL1818_550761804                         AALTLAERF-----LVVPTTVIDT-RR----------------GEWRERKKSWLKL------GVAAQ--------------------E---GRA-------------GQLIY---APPSG-NFINWYE--IKNK---------------A-LALNSELTNKEILEKY-ED--QLKPY---NEG-R-G---TSVFDPALCEVLYLWFSN-----PGD--------------HTLDPWAGGSVRGIVGAMLNRHYQGHELRAEQCVEN-------RKQ-A------DRI-----------K-E-RGHLPHGVDK--PVWIDGDSAK-T---M-----QDN-QA--E-SF----------------DFIIGCPPYYDLEQYSD-----DEADISNL-STEDFNAAMAVTLQEVDRCLRPNRFAAFVVGSAR--DK--R-GDLRDMKACMMNAM-PDGWHLANDAVLVNNVGTGAIRAKKMFEGGRSLARVHQDILVFVKGDR-------------------KTAAKRLDA-IKVAE-L-----DSMDN
      HMPREF1267_02363_Corynebacterium_sp_KPL1824_550751922                         AALTLAERF-----LVVPTTVIDT-RR----------------GEWRERKQSWLKL------GVAAQ--------------------E---GRA-------------GQLIY---APPSG-NFINWYE--IKNK---------------A-LALNSELTNKEILEKY-ED--QLKPY---NEG-R-G---TSVFDPALCEILYLWFSN-----PGD--------------RTLDPWAGGSVRGIVGAVLNRHYQGHELRAEQCVEN-------RKQ-A------DRI-----------K-E-RGHLPHGVDK--PVWIDGDSAK-T---M-----QDN-QA--E-SF----------------DFIIGCPPYYDLEQYSD-----DEADISNL-STEDFNAAMAVTLQEVDRCLRPNRFAAFVVGSAR--DK--R-GDLRDMKACMMNAM-PDGWHLANDAVLVNNVGTGAIRAKKMFEGGRSLARVHQDILVFVKGDR-------------------KTAAKRLDA-IKVAE-L-----DSMDN
      HMPREF1267_RS11500_Corynebacterium_sp_KPL1824_736660310                       -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------EG-R-G---TSVFDPALCEILYLWFSN-----PGD--------------RTLDPWAGGSVRGIVGAVLNRHYQGHELRAEQCVEN-------RKQ-A------DRI-----------K-E-RGHLPHGVDK--PVWIDGDSAK-T---M-----QDN-QA--E-SF----------------DFIIGCPPYYDLEQYSD-----DEADISNL-STEDFNAAMAVTLQEVDRCLRPNRFAAFVVGSAR--DK--R-GDLRDMKACMMNAM-PDGWHLANDAVLVNNVGTGAIRAKKMFEGGRSLARVHQDILVFVKG------------------------------------------------
      HMPREF1261_RS02180_Corynebacterium_sp_KPL1818_736650700                       -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------EG-R-G---TSVFDPALCEVLYLWFSN-----PGD--------------HTLDPWAGGSVRGIVGAMLNRHYQGHELRAEQCVEN-------RKQ-A------DRI-----------K-E-RGHLPHGVDK--PVWIDGDSAK-T---M-----QDN-QA--E-SF----------------DFIIGCPPYYDLEQYSD-----DEADISNL-STEDFNAAMAVTLQEVDRCLRPNRFAAFVVGSAR--DK--R-GDLRDMKACMMNAM-PDGWHLANDAVLVNNVGTGAIRAKKMFEGGRSLARVHQDILVFVKG------------------------------------------------
      D471_RS0130950_Nocardiopsis_lucentensis_648436510                             PPPPLEAR------EVTPLSVLQT-AR----------------GPWQVRKKAWAQA------GLTGD--------------------A---GRD-------------HITIW---PTISK-TA---------G------------------------------------I----------EWQ-R-A---VSVFDPHLTDVHYTWYCP-----PGG--------------LILDPFAGGATRGLVAAHRGYHYTGIDLSEQQVEAN-------RQQ-YQAWQDRGLV-----------T-----------GS--AKWIVGSAQEAL---------PCY-QT--G----WA-------------DYIFTCPPYHGLERYSD-----DPRDLSAM-DWDEYLHMVNLIAAECARILKQDRFTTWITGDLR--DK--N-GHLRRLPSKVDDAHEHAGLALANDTIIAAPLGGKFGVIWRNWVPTRSTTRIHQHAHTWVSGNR-------------------KTATTAATR------------------
      QR19_RS07135_Enterococcus_faecalis_728810984                                  ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------GYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQVSAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSARKLKDNRFAVVTISDVR--DK--K-GFYQDLTGLTKRAFSKEGLYFYNDMILLNAVGSGSLRARRLMN-NRKVTRMHQNVLVFYKGNP-------------------KNINQHFEV-LETLD-------DELEN
      A3EC_RS11435_Corynebacterium_ulceribovis_750124080                            ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------G---VSRFDPTLAEVFYRWWTA-----SGD--------------LVFDPCCGGVTRGLVASAMRRRYVGVDVRSEQVAAN-------SAY------------------------G-------------DMWRVGDGR--V---------GHM-SG-----------------V----DAVFTCPPYWRLERYSD-----QSDDMSAM-TLAQYRQAVNEIAAASYGALAENRYIGVVISDAR--GR--D-GLYAGLPAMWWQAMSDAGFGMLADMVVLDPVGRKYLTGWKLLGTSRKPTRVHQYLLVGVKGDA-------------------RQA------------------------
      QR19_RS04500_Enterococcus_faecalis_728810915                                  ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------GYRWFTP-----KTE--S-----------NIFDPFAGGSVRGIVAKVLGHNYTGIDLRAEQISAN-------YAN-A------REI-------------G---L-----SD--INWICDDSLN-I---------DHH-IE--D----ES--------Q----DLLFTCPPYADLEVYSD-----DERDISNM-SYEEFAEVYSEILKRSARKLKDNRFAVVTISDVR--DK--K-GFYQDLT------------------------------------------------------------------------------------------------------
      ER57_RS06425_Smithella_sp_SCADC_739526353                                     AHATMTNTW----------------NS----------VK----GDWLKMKKEWNER-IEAA-GEKHG----I-L----N---PKF--A---SRE-GCWQGESGF---SNVVL-T-ERSID----------DDKV------------VV-N------------------KG----------KSL-N-G-N-ASVLDPVACECILRFFMP-----KEG--R-----------RIYNPFGGGVQFGFVSGAYGDEYVASEIRQNQCDAN------------------NKI-------C---S-E---F-----PG--VRWVKSDSAT-Y---------KPE-----G-MF----------------DLVFTCPPYYRVEKYVDYDGQPPEGEINHLGTYNDFRDTLFAGYKVAIDHLNDNRFFVVMTGDSR--DK--N-GSYHCHEAETEIFFKESGLSVYNKIIYLESEFTRLAQAKKTLD-YRKFPKREQKIIVAYKGDA----------SV---I---KDIFPPIGR-L----------------
      consensus/100%                                                                .................................................................................................................................................................................................................................................................hp......s.................................................................................................DhhhssPPY...E.Y..........-.s.......a..................................................................................................................................................
      consensus/95%                                                                 ...............................................................................................................................................................................................S.hsPshsE..h.Wa..........................hDsauGsssbGhVu...s..a.G.-lp..Qh..N.........p........................................a..ssu.p.......................................DhlhsCPPY.sLE.Ysp........DhS.b.....F...b...h..s...L..pph..h.hup.R........u....h...................................h......s..cQ.......u................................................
      consensus/90%                                                                 ............................................................................................p..................................................................................................S.hDPshsEl.h.Was.........................hDPFAGssVbGhVu...s..Y.G.-lp.pQh..N.........p..........h.............................a..ssu.p.h..........p..........................DhlhsCPPY.sLE.YSs........DhSsh..b..F...h..hh..sh..Lp.spFushsluphR...p....G.h.sh.s....h....Gh.hhNphlhhp..ss..hRh.p.h..spphsb.HQpllshhKGp...............................................
      consensus/85%                                                                 .......................s.....................h..bp..h.........s..u..........................R.................................................................................................sSlFDPVlsEl.hpWFs........................lhDPFAGGSVRGlVu..bs..Y.G.-Lp.pQh..N.........p..........h.............................W..sDu.p.h..........p..........................DhlhoCPPYhcLE.YSc.....p..DlSsh..h.pF...h.phh..sh..L+.s+FAshslu-hR...c....G.hbsh.s....hh...Gh.hhNchlLhp..uo.shRh.p.h..sRKls+.HQplLlFhKGs....................p..........................
      consensus/80%                                                                 ....h...h......hsPao.hss.p...................WbpR+p.W..b......u..u.........................uR................h................................................................................sSlFDPVLsEl.hpWFss.......s...............lhDPFAGGSVRGlVu..hG.pY.Gh-LR.cQl.hN.......b.p.h........h.............................W.ssDu.p.l..........p..........................DhlFoCPPYhDLE.YSD.....s..DlSsh.sa.pF..hh.phl..uh..L+ssRFAshslu-lR..s+..p.G.hbsh.s..bphh...Gh.hYN-hlLls.hGo.shRh.p.h..sRKls+.HQplLVFhKGs....................pph...h...h................
      consensus/75%                                                                 ...sL.ppF......hsPFShlss.cp..................WbpR+c.Wb.h......G..u.........................uRp...............h.....s..........................................................................sSlFDPVLsElhYpWFss.......s..............plhDPFAGGSVRGlVAs.LG.pYsGh-LR.-Ql.uN.......b.p.h.......bh.............................WhssDS.p.l.........sp..h.........p.............DhlFoCPPYhDLE.YSD.....s..DlSsh.sY.pFhphappll..uh..L+ssRFAshllu-lR..c+..p.G.hbshsu.sbpAa..sGh.hYN-hlLlsshGo.slRh.p.hp.sRKls+.HQslLVFhKGs....................cph...h...lp...............
      consensus/70%                                                                 ...sL.ppF.....hhsPFShlss.+p..................WbcRKc.Wb.l......GhpSp........................GRp...............h.....s......................................................................u...sSlFDPVLsElhYcWFsP.......s..............plhDPFAGGSVRGlVAu.LGppYsGl-LR.EQlpAN.......b.p.h......pbh.............s.........s...spWlssDSbp.l.........sp..h...s.....p.............DhlFoCPPYhDLE.YSD.....s.pDlSsM.sYpcFhphYppIl.puhpbLKssRFAshVlu-lR..cK..p.Ghabshsu.TbpAF..sGh.hYN-hILlsshGo.slRh.+.hp.sRKls+.HQslLVFhKGs....................cph...h...lp...............
      
      Back to Contents
    • General notes, phyletic distribution and gene neighborhoods of Group I-Clade6/ Emiliania-ver1 N6-MTases

    • General notes:

      Members of this clade are only found in Emiliania huxleyi in eukaryotes. E. huxleyi has 6 copies of this family of which 5 are definitely complete and 4 are fused to an N-terminal FHA domain. This methylase shares with its prokaryotic homologs several characteristic motifs including a CPPYxxDxE motif in Strand-4, E in helix before strand-1, D at the end of strand-1, R in helix after strand-1 QxxxN in helix after strand 2, [DN] after strand-3, D beginning of Strand-4, D between strand -4 and following helix, R at the end of Strand-5, N and R in helix after strand-6, HQ and K flanking strand-7. Operons of the prokaryotic homologs are very stereotypic and they are present in the ParB-TLs locus where the methylase is usually fused to ParB. This suggests that the DAM is basically derived from a phage-type DAM and was transferred into Emiliania in a lineage specific way, where it fused to a FHA domain and expanded. The presence of the FHA domain suggests a role for protein phosphorylation in regulating the Methylase activity.
      GI           Gene neigborhoods                                                                                                                      Archs                                   Pfam architectures             Gene name                Len  Taxonomy                                          Species                                            Genbank
      #; Eukaryotic versions
      551572077    <-FHA+N6-MTase*                                                                                                                        FHA+N6-MTase                             FAP+Methyltransf_26            EMIHUDRAFT_464003       688  eukaryota>haptophyceae                            Emiliania huxleyi CCMP1516                         hypothetical protein EMIHUDRAFT_464003 [Emiliania huxleyi CCMP1516].             551572063_?-><-551572065_?||551572067_?-><-551572069_?||551572071_?-><-551572073_?||551572075_?-><-551572077_FHA+N6-MTase*||551572079_?-><-551572081_?||551572083_?->551572085_?-><-551572087_?<-551572089_?<-551572091_?
      551554922    FHA+N6-MTase*->                                                                                                                        FHA+N6-MTase                             Methyltransf_26                EMIHUDRAFT_459692       604  eukaryota>haptophyceae                            Emiliania huxleyi CCMP1516                         hypothetical protein EMIHUDRAFT_459692 [Emiliania huxleyi CCMP1516].             551554918_?-><-551554920_?||551554922_FHA+N6-MTase*-><-551554924_?||551554926_?-><-551554928_?||551554930_?->551554932_?-><-551554934_?<-551554936_?
      551585076    FHA+N6-MTase*->                                                                                                                        FHA+N6-MTase                             FHA+Methyltransf_26            EMIHUDRAFT_115516       683  eukaryota>haptophyceae                            Emiliania huxleyi CCMP1516                         hypothetical protein EMIHUDRAFT_115516 [Emiliania huxleyi CCMP1516].             551585076_FHA+N6-MTase*-><-551585078_?||551585080_?-><-551585082_?||551585084_?-><-551585086_?||551585088_?-><-551585090_?
      551629083    FHA+N6-MTase*->                                                                                                                        FHA+N6-MTase                             FHA+Methyltransf_26            EMIHUDRAFT_95251        658  eukaryota>haptophyceae                            Emiliania huxleyi CCMP1516                         hypothetical protein EMIHUDRAFT_95251 [Emiliania huxleyi CCMP1516].              551629071_?-><-551629073_?<-551629075_?||551629077_?-><-551629263_?||551629079_?-><-551629081_?||551629083_FHA+N6-MTase*->551629085_?-><-551629087_?<-551629089_?||551629091_?-><-551629093_?||551629095_?-><-551629097_?
      551572199    <-N6-MTase*<-?||?-><-?||?-><-?||N6-MTase->                                                                                             N6-MTase                                 SP                             EMIHUDRAFT_209088       136  eukaryota>haptophyceae                            Emiliania huxleyi CCMP1516                         hypothetical protein EMIHUDRAFT_209088 [Emiliania huxleyi CCMP1516].             <-551572185_?<-551572187_?<-551572189_?<-551572191_?||551572193_?-><-551572195_?||551572197_?-><-551572199_N6-MTase*<-551572201_?||551572203_?-><-551572205_?||551572207_?-><-551572209_?||551572211_N6-MTase->551572213_?->
      551605992    N6-MTase*->                                                                                                                            N6-MTase                                 DUF3597+Methyltransf_26        EMIHUDRAFT_231186       572  eukaryota>haptophyceae                            Emiliania huxleyi CCMP1516                         hypothetical protein EMIHUDRAFT_231186 [Emiliania huxleyi CCMP1516].             551605978_?->551605980_?-><-551605982_?||551605984_?->551605986_?->551605988_?-><-551605990_?||551605992_N6-MTase*-><-551605994_?<-551605996_?<-551605998_?||551606000_?-><-551606002_?||551606004_?-><-551606006_?
      # 73; Prokaryotic homologs                                                                                                                                                                                                                   
      509139481    DnaJ->?->?->?->?->?->ParB+N6-MTase*->N6-MTase->?->?->?->?->?->N6-MTase->                                                               ParB+N6-MTase                            ParBc+Methyltransf_26          M201_gp11               526  viruses>dsdna viruses, no rna stage               Halovirus HCTV-2                                   hypothetical protein HCTV2_11 [Halovirus HCTV-2].                                509139476_?->509139477_DnaJ->509139478_?->509139479_?->509139480_?->509139548_?->509139549_?->509139481_ParB+N6-MTase*->509139482_N6-MTase->509139483_?->509139484_?->509139485_?->509139550_?->509139486_?->509139487_N6-MTase->
      505358723    <-MuF+ART<-?<-Phage_portal<-Terminase_LS<-HTH<-ParB+N6-MTase*<-?<-VRRNUC                                                               ParB+N6-MTase                            Methyltransf_26                HGGM_RS12685            523  bacteria>bacteroidetes                            Alistipes shahii                                   hypothetical protein [Alistipes shahii].                                         505358716_?-><-505358717_?<-505358718_MuF+ART<-505358719_?<-505358720_Phage_portal<-648293960_Terminase_LS<-505358722_HTH<-505358723_ParB+N6-MTase*<-505358724_?<-505358725_VRRNUC<-505358726_?<-736777341_?<-505358727_?<-505358728_?<-505358729_?
      740824887    VRRNUC->ParB+N6-MTase*->HTH->Terminase_LS->Phage_portal->MuF->                                                                         ParB+N6-MTase                            Methyltransf_26                EL88_RS12025            518  bacteria>bacteroidetes                            Bacteroides dorei                                  hypothetical protein [Bacteroides dorei].                                        740827101_?->740827104_?->740824875_?->740824878_?->740824881_?->740824884_?->740827107_VRRNUC->740824887_ParB+N6-MTase*->740827110_HTH->740827113_Terminase_LS->740824891_Phage_portal->740827116_MuF-><-740824894_?<-740824899_?<-740824902_?
      407249874    <-Antirestrict<-?<-?<-MuF<-?<-Terminase_LS<-?<-ParB+N6-MTase*<-?<-?<-?<-?<-VRRNUC                                                      ParB+N6-MTase                            SP+ParBc+Methyltransf_26       AMBLS11_12430           517  bacteria>proteobacteria>gammaproteobacteria       Alteromonas macleodii str. 'Black Sea 11'          hypothetical protein AMBLS11_12430 [Alteromonas macleodii str. 'Black Sea 11'].  <-407249867_Antirestrict<-407249868_?<-407249869_?<-407249870_MuF<-407249871_?<-407249872_Terminase_LS<-407249873_?<-407249874_ParB+N6-MTase*<-407249875_?<-407249876_?<-407249877_?<-407249878_?<-407249879_VRRNUC<-407249880_?<-407249881_?
      495813435    <-MuF<-Phage_portal<-Terminase_LS<-?<-?<-ParB+N6-MTase*                                                                                ParB+N6-MTase                            ParBc+Methyltransf_26          HMPREF9454_RS03645      481  bacteria>firmicutes                               Megamonas funiformis                               ParB-like partition protein [Megamonas funiformis].                              <-495813426_?<-748623224_?<-748623225_MuF<-495813429_Phage_portal<-495813430_Terminase_LS<-495813431_?<-495813433_?<-495813435_ParB+N6-MTase*<-495813436_?<-748623226_?<-495813439_?<-495813441_?<-495813442_?<-495813443_?<-495813444_?
      655611541    <-AHH||?->ParB+N6-MTase*->?->Terminase_LS->                                                                                            ParB+N6-MTase                            ParBc+Methyltransf_26          EM93_14610              478  bacteria>proteobacteria>gammaproteobacteria       Vibrio parahaemolyticus                            hypothetical protein EM93_14610 [Vibrio parahaemolyticus].                       655611534_?->655611535_?-><-655611536_?<-655611537_?<-655611538_?<-655611539_AHH||655611540_?->655611541_ParB+N6-MTase*->655611542_?->655611543_Terminase_LS->655611544_?->655611545_?->655611546_?->655611547_?->655611548_?->
      502738489    ParB+N6-MTase*->                                                                                                                       ParB+N6-MTase                            ParBc+Methyltransf_26          AZL_RS04495             469  bacteria>proteobacteria>alphaproteobacteria       Azospirillum lipoferum                             hypothetical protein [Azospirillum lipoferum].                                   755092447_?->755092445_?->755092443_?->755091482_?->755091480_?->755091478_?->502738490_?->502738489_ParB+N6-MTase*->502738488_?->502738487_?-><-502738486_?<-755091476_?<-755092441_?||755091475_?->755091474_?->
      665395990    ParB+N6-MTase*->?->URI1->HTH->Terminase_LS->?->?->Phage_portal->                                                                       ParB+N6-MTase                            Methyltransf_26                JCM15093_3233           467  bacteria>bacteroidetes                            Bacteroides graminisolvens DSM 19988 = JCM 15093   sensor protein FixL [Bacteroides graminisolvens DSM 19988 = JCM 15093].          665395983_?->665395984_?->665395985_?->665395986_?-><-665395987_?<-665395988_?<-665395989_?||665395990_ParB+N6-MTase*->665395991_?->665395992_URI1->665395993_HTH->665395994_Terminase_LS->665395995_?->665395996_?->665395997_Phage_portal->
      646357048    <-AHH||?->?->ParB+N6-MTase*->?->?->Terminase_LS->                                                                                      ParB+N6-MTase                            SP+ParBc+Methyltransf_26       EM98_RS02575            465  bacteria>proteobacteria>gammaproteobacteria       Vibrio parahaemolyticus                            hypothetical protein [Vibrio parahaemolyticus].                                  639555748_?-><-491625684_?<-639577452_?<-545083456_?<-646357052_AHH||491637856_?->686139500_?->646357048_ParB+N6-MTase*->757509302_?->686140497_?->686140496_Terminase_LS->545125087_?->545125048_?->639577449_?->686140495_?->
      686232662    <-Terminase_LS<-ParB+N6-MTase*<-?<-?||AHH->                                                                                            ParB+N6-MTase                            SP+ParBc+Methyltransf_26       EM79_05715              465  bacteria>proteobacteria>gammaproteobacteria       Vibrio parahaemolyticus                            hypothetical protein [Vibrio parahaemolyticus].                                  <-545084109_?<-686232657_?<-686232658_?<-686232659_?<-545125048_?<-686232660_?<-686232661_Terminase_LS<-686232662_ParB+N6-MTase*<-686232663_?<-491637856_?||646357052_AHH->686232664_?->646368128_?->491625684_?-><-639555748_?
      755016279    N6-MTase->?->?->?->ParB+N6-MTase*->?->HTH->Terminase_LS->Phage_portal->                                                                ParB+N6-MTase                            Methyltransf_26                TY03_RS09575            463  bacteria>bacteroidetes                            Bacteroidaceae bacterium MS4                       hypothetical protein [Bacteroidaceae bacterium MS4].                             755016269_?->755016271_?->755016274_?->755016853_N6-MTase->755016855_?->755016275_?->755016277_?->755016279_ParB+N6-MTase*->755016281_?->755016857_HTH->755016284_Terminase_LS->755016859_Phage_portal->755016286_?->755016288_?->755016289_?->
      771670996    ParB+N6-MTase*-><-?||HTH->Terminase_LS->Phage_portal->                                                                                 ParB+N6-MTase                            ParBc+Methyltransf_26          UB46_RS23670            461  bacteria>proteobacteria>betaproteobacteria        Burkholderiaceae bacterium 16                      chromosome partitioning protein ParB [Burkholderiaceae bacterium 16].            771670978_?-><-771670981_?||771670984_?->771670986_?->771670990_?->771670992_?->771670994_?->771670996_ParB+N6-MTase*-><-771670998_?||771671000_HTH->771671002_Terminase_LS->771671004_Phage_portal->771671007_?->771671010_?->771671012_?->
      771680244    <-MuF<-Terminase_LS<-HTH<-ParB+N6-MTase*||?->?-><-?<-?||?-><-DCM                                                                       ParB+N6-MTase                            ParBc+Methyltransf_26          UB46_RS43165            461  bacteria>proteobacteria>betaproteobacteria        Burkholderiaceae bacterium 16                      chromosome partitioning protein ParB [Burkholderiaceae bacterium 16].            <-771680264_MuF<-771680266_Terminase_LS<-771680268_HTH<-771680244_ParB+N6-MTase*||771680247_?->771680249_?-><-771680252_?<-771680255_?||771680257_?-><-771680259_DCM<-771680261_?
      429146233    <-MuF<-Phage_portal<-Terminase_LS<-HTH<-?<-?<-ParB+N6-MTase*                                                                           ParB+N6-MTase                            ParBc+Methyltransf_26          HMPREF9998_01728        460  bacteria>firmicutes                               Peptostreptococcus anaerobius VPI 4330 = DSM 2949  ParB-like protein [Peptostreptococcus anaerobius VPI 4330 = DSM 2949].           <-429146226_?<-429146227_MuF<-429146228_Phage_portal<-429146229_Terminase_LS<-429146230_HTH<-429146231_?<-429146232_?<-429146233_ParB+N6-MTase*<-429146234_?<-429146235_?<-429146236_?<-429146237_?<-429146238_?<-429146239_?<-429146240_?
      759627761    HNH->?->HNH->ParB+N6-MTase*->HTH->Terminase_LS->?->?->P22_CoatProtein->                                                                ParB+N6-MTase                            ParBc+Methyltransf_26          RR42_RS08225            460  bacteria>proteobacteria>betaproteobacteria        Cupriavidus basilensis                             chromosome partitioning protein ParB [Cupriavidus basilensis].                   <-759627756_?||759627757_?->759627758_?->759627759_?->759633912_HNH->759627760_?->759633914_HNH->759627761_ParB+N6-MTase*->759633917_HTH->759627762_Terminase_LS->759627763_?->759633919_?->759627764_P22_CoatProtein->759627765_?->759627768_?->
      499617803    ParB+N6-MTase*->HTH->Terminase_LS->Phage_portal->MuF->                                                                                 ParB+N6-MTase                            ParBc+Methyltransf_26          REUT_RS12130            459  bacteria>proteobacteria>betaproteobacteria        Cupriavidus pinatubonensis                         hypothetical protein [Cupriavidus pinatubonensis].                               <-499617796_?<-754010951_?||499617798_?->499617799_?->754010954_?->499617801_?->499617802_?->499617803_ParB+N6-MTase*->499617804_HTH->499617805_Terminase_LS->499617806_Phage_portal->499617807_MuF->499617808_?->499617809_?->499617810_?->
      498505877    <-UPF0150+RHH_1<-?<-?||ParB+N6-MTase*->?->Terminase_LS->?->?->P22_CoatProtein->                                                        ParB+N6-MTase                            ParBc+Methyltransf_26          C266_RS12110            457  bacteria>proteobacteria>betaproteobacteria        Pandoraea sp. SD6-2                                hypothetical protein [Pandoraea sp. SD6-2].                                      498505870_?->498505871_?->738769516_?-><-498505873_?<-498505874_UPF0150+RHH_1<-498505875_?<-738769443_?||498505877_ParB+N6-MTase*->738769518_?->738769519_Terminase_LS->498505880_?->498505881_?->498505882_P22_CoatProtein->498505883_?->498505884_?->
      560179856    <-MuF<-Phage_portal<-Terminase_LS<-HTH<-ParB+N6-MTase*                                                                                 ParB+N6-MTase                            ParBc+Methyltransf_26          U875_RS13765            457  bacteria>proteobacteria>betaproteobacteria        Pandoraea pnomenusa                                chromosome partitioning protein ParB [Pandoraea pnomenusa].                      <-560179849_?<-560179850_?<-685479048_?<-685479057_MuF<-560179853_Phage_portal<-560179854_Terminase_LS<-560179855_HTH<-560179856_ParB+N6-MTase*<-560179857_?<-560179858_?<-560179860_?<-753868762_?<-753868765_?<-560179862_?<-685479087_?
      564969930    HNH->?-><-?<-UPF0150+RHH_1<-?||ParB+N6-MTase*->HTH->Terminase_LS->Phage_portal->MuF->                                                  ParB+N6-MTase                            ParBc+Methyltransf_26          W930_RS0102395          457  bacteria>proteobacteria>betaproteobacteria        Pandoraea                                          MULTISPECIES: DNA methyltransferase [Pandoraea].                                 738747188_?->564969915_?->738747191_HNH->564969918_?-><-564969922_?<-564969925_UPF0150+RHH_1<-498505875_?||564969930_ParB+N6-MTase*->564969934_HTH->564969935_Terminase_LS->564969937_Phage_portal->564969940_MuF->655322380_?->564969947_?->564969955_?->
      698968189    <-UPF0150+RHH_1<-?<-?||ParB+N6-MTase*->?->Terminase_LS->?->?->P22_CoatProtein->                                                        ParB+N6-MTase                            ParBc+Methyltransf_26          LV28_24705              457  bacteria>proteobacteria>betaproteobacteria        Pandoraea pnomenusa                                chromosome partitioning protein ParB [Pandoraea pnomenusa].                      698968182_?->698968183_?->698968184_?-><-698968185_?<-698968186_UPF0150+RHH_1<-698968187_?<-698968188_?||698968189_ParB+N6-MTase*->698968190_?->698968191_Terminase_LS->698968192_?->698968193_?->698968194_P22_CoatProtein->698968195_?->698968196_?->
      260093825    <-MuF+PBECR3<-Phage_portal<-Terminase_LS<-HTH||?-><-ParB+N6-MTase*                                                                     ParB+N6-MTase                            ParBc+Methyltransf_26          HIAG_01531              452  bacteria>proteobacteria>gammaproteobacteria       Haemophilus influenzae NT127                       conserved hypothetical protein [Haemophilus influenzae NT127].                   <-260093818_?<-260093819_?<-260093820_MuF+PBECR3<-260093821_Phage_portal<-260093822_Terminase_LS<-260093823_HTH||260093824_?-><-260093825_ParB+N6-MTase*<-260093826_?<-260093827_?<-260093828_?<-260093829_?<-260093830_?||260093831_?->260093832_?->
      501001894    <-MuF+PBECR3<-Phage_portal<-Terminase_LS<-HTH||?-><-ParB+N6-MTase*                                                                     ParB+N6-MTase                            ParBc+Methyltransf_26          HIBPF_RS01765           451  bacteria>proteobacteria>gammaproteobacteria       Haemophilus influenzae                             hypothetical protein [Haemophilus influenzae].                                   <-503290935_?<-491959484_?<-752488870_MuF+PBECR3<-503290937_Phage_portal<-503290938_Terminase_LS<-491883473_HTH||491959498_?-><-501001894_ParB+N6-MTase*<-503290939_?<-503292806_?<-503290941_?<-503290942_?<-503290943_?||494052963_?->491883430_?->
      497199068    <-Phage_capsid<-Phage_portal<-?<-?<-?<-ParB+N6-MTase*                                                                                  ParB+N6-MTase                            ParBc+Methyltransf_26          OPIT5_RS03245           447  bacteria>verrucomicrobia                          Opitutaceae bacterium TAV5                         ParB domain protein nuclease [Opitutaceae bacterium TAV5].                       <-497199061_?<-645069321_?<-763429376_Phage_capsid<-763429377_Phage_portal<-497199065_?<-645069322_?<-497199067_?<-497199068_ParB+N6-MTase*<-497199069_?<-497199070_?<-645069323_?<-763429378_?<-497199073_?<-497199074_?<-645069324_?
      736778761    <-MuF<-Phage_portal<-?<-Terminase_LS<-HTH<-URI1<-?<-ParB+N6-MTase*                                                                     ParB+N6-MTase                            Methyltransf_26                C511_RS16595            446  bacteria>bacteroidetes                            Bacteroides graminisolvens                         hypothetical protein, partial [Bacteroides graminisolvens].                      <-736778755_MuF<-640566463_Phage_portal<-653066796_?<-640566462_Terminase_LS<-736778758_HTH<-640566460_URI1<-640566459_?<-736778761_ParB+N6-MTase*||640566457_?->736778764_?->640566455_?-><-640566454_?<-640566453_?<-640566452_?<-653066797_?
      157325361    ParB+N6-MTase*->?->gp79->                                                                                                              ParB+N6-MTase                            ParBc+Methyltransf_26          LiPB054_gp77            445  viruses>dsdna viruses, no rna stage>caudovirales  Listeria phage B054                                gp77 [Listeria phage B054].                                                      157325353_?->157325354_?->157325355_?->157325357_?->157325358_?->157325359_?->157325360_?->157325361_ParB+N6-MTase*->157325362_?->157325363_gp79->157325364_?->
      489827744    <-MuF+MPTase<-Phage_portal<-Terminase_LS<-HTH<-?<-gp79<-?<-ParB+N6-MTase*                                                              ParB+N6-MTase                            ParBc+Methyltransf_26          HT51_04190              445  bacteria>firmicutes                               Listeria monocytogenes                             chromosome partitioning protein ParB [Listeria monocytogenes].                   <-685901802_MuF+MPTase<-685901805_Phage_portal<-685901807_Terminase_LS<-685901809_HTH<-489827747_?<-489827746_gp79<-489827745_?<-489827744_ParB+N6-MTase*<-489827743_?<-489827741_?<-489827740_?<-489827739_?<-489827738_?<-489827736_?<-489827735_?
      499299925    ParB+N6-MTase*->?->gp79->?->HTH->Terminase_LS->?->MuF->                                                                                ParB+N6-MTase                            ParBc+Methyltransf_26          LIN_RS06450             445  bacteria>firmicutes                               Listeria innocua                                   chromosome partitioning protein ParB [Listeria innocua].                         499299918_?->499299919_?-><-499299920_?||754507716_?->754507717_?->499299923_?->499299924_?->499299925_ParB+N6-MTase*->499299926_?->499299927_gp79->499299928_?->754507766_HTH->499299930_Terminase_LS->499299931_?->754507767_MuF->
      499300722    <-MuF+MPTase<-Phage_portal<-Terminase_LS<-HTH<-?<-gp79<-?<-ParB+N6-MTase*                                                              ParB+N6-MTase                            ParBc+Methyltransf_26          LIN_RS08830             445  bacteria>firmicutes                               Listeria innocua                                   chromosome partitioning protein ParB [Listeria innocua].                         <-489827751_MuF+MPTase<-489827750_Phage_portal<-489827749_Terminase_LS<-489827748_HTH<-499299928_?<-499300721_gp79<-489827745_?<-499300722_ParB+N6-MTase*<-489827743_?<-499300723_?<-754507717_?<-499300724_?<-499300725_?<-499299919_?<-499299918_?
      518425096    <-MuF<-Phage_portal<-Terminase_LS<-HTH<-?<-ParB+N6-MTase*                                                                              ParB+N6-MTase                            ParBc+Methyltransf_26          F823_RS0101690          445  bacteria>firmicutes                               Peptostreptococcus anaerobius                      chromosome partitioning protein ParB [Peptostreptococcus anaerobius].            <-518425093_?<-488934558_?<-488934559_MuF<-488934560_Phage_portal<-488934562_Terminase_LS<-518425094_HTH<-488934568_?<-518425096_ParB+N6-MTase*<-488934571_?<-518425098_?<-488934574_?<-488934575_?<-488934576_?<-488934577_?<-488934578_?
      685911099    ParB+N6-MTase*->?->gp79->?->HTH->Terminase_LS->Phage_portal->MuF+MPTase->                                                              ParB+N6-MTase                            ParBc+Methyltransf_26          HS95_01700              445  bacteria>firmicutes                               Listeria monocytogenes                             chromosome partitioning protein ParB [Listeria monocytogenes].                   685911096_?->685911097_?->489827738_?->489827739_?->489827740_?->489827741_?->489827743_?->685911099_ParB+N6-MTase*->489827745_?->489827746_gp79->685911100_?->489827748_HTH->685911102_Terminase_LS->685911103_Phage_portal->685911104_MuF+MPTase->
      746332729    <-MuF+MPTase<-Phage_portal<-Terminase_LS<-HTH<-?<-gp79<-?<-ParB+N6-MTase*                                                              ParB+N6-MTase                            ParBc+Methyltransf_26          I794_RS04725            445  bacteria>firmicutes                               Listeria monocytogenes                             chromosome partitioning protein ParB [Listeria monocytogenes].                   <-746332714_MuF+MPTase<-489827750_Phage_portal<-746332716_Terminase_LS<-746332718_HTH<-746332721_?<-746332725_gp79<-489827745_?<-746332729_ParB+N6-MTase*<-746332790_?<-746332731_?<-746332734_?<-746332792_?<-746332736_?<-746332739_?<-746332741_?
      488292151    <-Terminase_LS<-HTH<-gp79<-?<-ParB+N6-MTase*                                                                                           ParB+N6-MTase                            ParBc+Methyltransf_26          D927_RS13925            444  bacteria>firmicutes                               Enterococcus faecalis                              chromosome partitioning protein ParB [Enterococcus faecalis].                    <-658549994_Terminase_LS<-488292154_HTH<-498481714_gp79<-488292152_?<-488292151_ParB+N6-MTase*<-727131312_?<-488340732_?<-514887749_?<-642973505_?<-514887750_?<-488295138_?<-488295137_?
      488295148    N6-MTase->?->?->?->ParB+N6-MTase*->?->gp79->?->HTH->Terminase_LS->Phage_portal->MuF->                                                  ParB+N6-MTase                            ParBc+Methyltransf_26          WUC_RS08035             444  bacteria>firmicutes                               Enterococcus faecalis                              chromosome partitioning protein ParB [Enterococcus faecalis].                    488297008_?->488297009_?->488320992_?->488297012_N6-MTase->488331601_?->488295144_?->488295146_?->488295148_ParB+N6-MTase*->488292152_?->498479911_gp79->488297017_?->488295152_HTH->504337688_Terminase_LS->727003452_Phage_portal->488295155_MuF->
      488328083    <-MuF<-Phage_portal<-Terminase_LS<-HTH<-?<-gp79<-?<-ParB+N6-MTase*                                                                     ParB+N6-MTase                            ParBc+Methyltransf_26          HMPREF9518_RS16950      444  bacteria>firmicutes                               Enterococcus faecalis                              chromosome partitioning protein ParB [Enterococcus faecalis].                    <-727197464_MuF<-488328077_Phage_portal<-488328078_Terminase_LS<-488328079_HTH<-488328080_?<-488328081_gp79<-488292152_?<-488328083_ParB+N6-MTase*<-488295146_?<-488328084_?<-488328086_?
      498397880    N6-MTase->?->?->?->?->ParB+N6-MTase*->?->gp79->Terminase_SS->Terminase_LS->Phage_portal->MuF->                                         ParB+N6-MTase                            ParBc+Methyltransf_26          WU9_RS08245             444  bacteria>firmicutes                               Enterococcus faecalis                              hypothetical protein [Enterococcus faecalis].                                    488297009_?->488320992_?->488299640_N6-MTase->488299639_?->498398227_?->488293129_?->498397879_?->498397880_ParB+N6-MTase*->498397881_?->498397882_gp79->498397883_Terminase_SS->498397884_Terminase_LS->498397885_Phage_portal->514889620_MuF->498397887_?->
      498481713    N6-MTase->?->?->?->?->ParB+N6-MTase*->?->gp79->HTH->Terminase_LS->Phage_portal->MuF->                                                  ParB+N6-MTase                            ParBc+Methyltransf_26          UMY_RS13155             444  bacteria>firmicutes                               Enterococcus faecalis                              hypothetical protein [Enterococcus faecalis].                                    488298694_?->488298693_?->488298692_N6-MTase->488298691_?->498481711_?->488298687_?->498481712_?->498481713_ParB+N6-MTase*->488292152_?->498481714_gp79->498481715_HTH->727155214_Terminase_LS->727155215_Phage_portal->498481718_MuF->642980108_?->
      498526876    N6-MTase->?->?->?->?->ParB+N6-MTase*->?->gp79->?->HTH->Terminase_LS->Phage_portal->MuF->                                               ParB+N6-MTase                            ParBc+Methyltransf_26          WOK_RS09735             444  bacteria>firmicutes                               Enterococcus faecalis                              hypothetical protein [Enterococcus faecalis].                                    488299642_?->488320992_?->488299640_N6-MTase->488299639_?->498398227_?->498526875_?->488295146_?->498526876_ParB+N6-MTase*->488292152_?->498479911_gp79->498526877_?->498526878_HTH->727020034_Terminase_LS->727020036_Phage_portal->488293090_MuF->
      506557932    ParB+N6-MTase*->?->gp79->                                                                                                              ParB+N6-MTase                            ParBc+Methyltransf_26          D351_RS17740            444  bacteria>firmicutes                               Enterococcus                                       MULTISPECIES: DNA adenine methyltransferase [Enterococcus].                      642981009_?->498470081_?->514905202_?->514905317_?->506557932_ParB+N6-MTase*->506557933_?->488319782_gp79->
      514889617    <-MuF<-Phage_portal<-Terminase_LS<-Terminase_SS<-gp79<-?<-ParB+N6-MTase*                                                               ParB+N6-MTase                            ParBc+Methyltransf_26          D349_RS04690            444  bacteria>firmicutes                               Enterococcus faecalis                              ParB-like protein [Enterococcus faecalis].                                       <-498397887_?<-514889620_MuF<-514889619_Phage_portal<-498397884_Terminase_LS<-514889618_Terminase_SS<-498397882_gp79<-498397881_?<-514889617_ParB+N6-MTase*<-514908240_?||514908242_?->
      514907821    ParB+N6-MTase*->?->gp79->HTH->Terminase_LS->Phage_portal->?->Phage_GP20->                                                              ParB+N6-MTase                            ParBc+Methyltransf_26          D350_RS16800            444  bacteria>firmicutes                               Enterococcus faecalis                              ParB-like protein [Enterococcus faecalis].                                       <-514907818_?||514907819_?->727198410_?->514907821_ParB+N6-MTase*->514907822_?->642980399_gp79->514907824_HTH->514907825_Terminase_LS->514907826_Phage_portal->514907827_?->514907828_Phage_GP20->
      640121481    ParB+N6-MTase*->?->gp79->?->HTH->Terminase_LS->Phage_portal->MuF->                                                                     ParB+N6-MTase                            ParBc+Methyltransf_26          JF27_RS10635            444  bacteria>firmicutes                               Enterococcus faecalis                              chromosome partitioning protein ParB [Enterococcus faecalis].                    640121466_?->488327247_?->640121471_?->488333040_?->488296063_?->488293129_?->488295146_?->640121481_ParB+N6-MTase*->488292152_?->498479911_gp79->488297017_?->640121490_HTH->498481013_Terminase_LS->640121493_Phage_portal->488337259_MuF->
      694245431    <-Terminase_LS<-Terminase_SS<-gp79<-?<-ParB+N6-MTase*                                                                                  ParB+N6-MTase                            ParBc+Methyltransf_26          ES21_06495              444  bacteria>firmicutes                               Enterococcus faecalis                              chromosome partitioning protein ParB [Enterococcus faecalis].                    <-694245427_Terminase_LS<-694245428_Terminase_SS<-694245429_gp79<-694245430_?<-694245431_ParB+N6-MTase*<-694245432_?<-694245433_?<-694245434_?<-694245435_?<-694245436_?<-694245437_?<-694245438_?
      727050824    MazG-Phage->?->?->?->?->?->?->ParB+N6-MTase*->?->gp79->Terminase_SS->Terminase_LS->Phage_portal->MuF->                                 ParB+N6-MTase                            ParBc+Methyltransf_26          P791_RS10720            444  bacteria>firmicutes                               Enterococcus faecalis                              chromosome partitioning protein ParB [Enterococcus faecalis].                    488300839_MazG-Phage->488333106_?->488285809_?->488300837_?->498521068_?->498521067_?->727050818_?->727050824_ParB+N6-MTase*->488292152_?->498397882_gp79->498406766_Terminase_SS->658430911_Terminase_LS->498407969_Phage_portal->727050857_MuF->488311051_?->
      495977294    <-P22_CoatProtein<-?<-MuF<-?<-Terminase_LS<-HTH<-ParB+N6-MTase*<-?||?-><-?<-?<-?<-?<-DAM                                               ParB+N6-MTase                            ParBc+Methyltransf_26          FSDG_RS10625            441  bacteria>fusobacteria                             Fusobacterium nucleatum                            hypothetical protein [Fusobacterium nucleatum].                                  <-696266129_?<-495977306_P22_CoatProtein<-495977303_?<-495977301_MuF<-495977299_?<-495977297_Terminase_LS<-495977295_HTH<-495977294_ParB+N6-MTase*<-495977293_?||495977291_?-><-495977289_?<-495977286_?<-495977285_?<-495977284_?<-495977283_DAM
      775458612    <-MuF<-?<-Terminase_LS<-?<-ParB+N6-MTase*                                                                                              ParB+N6-MTase                            Methyltransf_26                BAQ92410.1              441  viruses                                           uncultured Mediterranean phage uvMED               DNA modification N6-MTase (COG0863) [uncultured Mediterranean phage uvMED].     <-775458605_?<-775458606_?<-775458607_?<-775458608_MuF<-775458609_?<-775458610_Terminase_LS<-775458611_?<-775458612_ParB+N6-MTase*<-775458613_?<-775458614_?<-775458615_?||775458616_?-><-775458617_?<-775458618_?||775458619_?->
      491883456    -                                                                                                                                      ParB+N6-MTase                            ParBc+Methyltransf_26                                  376  bacteria>proteobacteria>gammaproteobacteria       Haemophilus influenzae                             plasmid partitioning protein ParB [Haemophilus influenzae].                      
      675161105    <-Phage_GP20<-MuF<-Phage_portal<-Terminase_LS<-HTH<-?<-ParB+N6-MTase*                                                                  ParB+N6-MTase                            Methyltransf_26                HMPREF5175_00726        369  bacteria>firmicutes                               Lactobacillus gasseri SV-16A-US                    hypothetical protein HMPREF5175_00726 [Lactobacillus gasseri SV-16A-US].         <-675161098_?<-675161099_Phage_GP20<-675161100_MuF<-675161101_Phage_portal<-675161102_Terminase_LS<-675161103_HTH<-675161104_?<-675161105_ParB+N6-MTase*<-675161106_?<-675161107_?<-675161108_?<-675161109_?<-675161110_?<-675161111_?<-675161112_?
      763137539    <-MuF<-?<-?<-Terminase_LS<-?<-N6-MTase*                                                                                                N6-MTase                                 Methyltransf_26                QI63_RS07900            330  bacteria>spirochaetes                             Treponema sp. OMZ 838                              hypothetical protein, partial [Treponema sp. OMZ 838].                           <-763135633_?<-763135634_?<-763135636_MuF<-763135638_?<-763137537_?<-763135640_Terminase_LS<-763135642_?<-763137539_N6-MTase*<-763135644_?<-763135645_?<-763135647_?<-763135649_?<-763135651_?<-763135653_?<-763135656_?
      738746241    <-N6-MTase*                                                                                                                            N6-MTase                                 Methyltransf_26                CG50_RS14910            327  bacteria>proteobacteria>alphaproteobacteria       Paenirhodobacter enshiensis                        DNA methyltransferase [Paenirhodobacter enshiensis].                             <-738746247_?<-738746249_?<-738746232_?||738746234_?->738746251_?->738746236_?-><-738746238_?<-738746241_N6-MTase*||738746242_?->
      282557516    ParB+N6-MTase*->?->?->HTH->Terminase_LS->Phage_portal->MuF->Phage_GP20->                                                               ParB+N6-MTase                            ParBc+Methyltransf_26          HMPREF9209_0590         325  bacteria>firmicutes                               Lactobacillus gasseri 224-1                        ParB-like protein [Lactobacillus gasseri 224-1].                                 282557509_?->282557510_?->282557511_?->282557512_?->282557513_?->282557514_?->282557515_?->282557516_ParB+N6-MTase*->282557517_?->282557518_?->282557519_HTH->282557520_Terminase_LS->282557521_Phage_portal->282557522_MuF->282557523_Phage_GP20->
      502831711    <-N6-MTase*<-ParB<-?<-?<-?<-HTH                                                                                                        N6-MTase                                 Methyltransf_26                U717_RS25485            324  bacteria>proteobacteria>alphaproteobacteria       Rhodobacter capsulatus                             DNA methyltransferase [Rhodobacter capsulatus].                                  <-739227891_?<-502831705_?<-665954923_?<-502831707_?||502831708_?->665954925_?-><-502831710_?<-502831711_N6-MTase*<-739227894_ParB<-739227896_?<-665954928_?<-502831715_?<-502831716_HTH||665954930_?->502831718_?->
      565833115    HTH->?->?->?->ParB->N6-MTase*->                                                                                                        N6-MTase                                 Methyltransf_26                U713_RS23890            322  bacteria>proteobacteria>alphaproteobacteria       Rhodobacter capsulatus                             DNA methyltransferase [Rhodobacter capsulatus].                                  <-565833101_?<-565833103_?||565833105_HTH->665960137_?->665960138_?->739227896_?->565833113_ParB->565833115_N6-MTase*->565833117_?-><-665960140_?<-665960141_?<-665960142_?||565833125_?->665960143_?->565833129_?->
      68058078     ParB+N6-MTase*->?-><-?||HTH->Terminase_LS->Phage_portal->MuF->                                                                         ParB+N6-MTase                            ParBc+N6_N4_Mtase              NTHI1522                319  bacteria>proteobacteria>gammaproteobacteria       Haemophilus influenzae 86-028NP                    hypothetical protein NTHI1522 [Haemophilus influenzae 86-028NP].                 <-68058071_?<-68058072_?||68058073_?->68058074_?->68058075_?->68058076_?->68058077_?->68058078_ParB+N6-MTase*->68058079_?-><-68058080_?||68058081_HTH->68058082_Terminase_LS->68058083_Phage_portal->68058084_MuF->68058085_?->
      775455096    N6-MTase*->?->?-><-?<-DAMT1                                                                                                            N6-MTase                                 Methyltransf_26                 BAQ89246.1             316  viruses                                           uncultured Mediterranean phage uvMED               DNA modification N6-MTase (COG0863) [uncultured Mediterranean phage uvMED].     <-775455089_?||775455090_?->775455091_?->775455092_?->775455093_?->775455094_?->775455095_?->775455096_N6-MTase*->775455097_?->775455098_?-><-775455099_?<-775455100_DAMT1<-775455101_?||775455102_?->775455103_?->
      655293784    <-N6-MTase*<-?<-?<-?<-?<-DAM<-SNF<-DCM                                                                                                 N6-MTase                                 Methyltransf_26                 H590_RS0105875         309  bacteria>actinobacteria                           Propionibacterium jensenii                         hypothetical protein [Propionibacterium jensenii].                               <-655293778_?<-655293779_?<-655293780_?<-739111574_?<-655293781_?<-655293782_?<-655293783_?<-655293784_N6-MTase*<-655293785_?<-739111630_?<-655293787_?<-655293788_?<-655293789_DAM<-655293790_SNF<-655293791_DCM
      768317646    N6-MTase*->?->?->?-><-?||?->?->HNH->                                                                                                   N6-MTase                                 Methyltransf_26                 T261_5811              309  bacteria>actinobacteria                           Streptomyces lydicus A02                           ParB domain protein nuclease [Streptomyces lydicus A02].                         768317639_?->768317640_?->768317641_?->768317642_?->768317643_?->768317644_?->768317645_?->768317646_N6-MTase*->768317647_?->768317648_?->768317649_?-><-768317650_?||768317651_?->768317652_?->768317653_HNH->
      499633394    <-N6-MTase*<-ParB                                                                                                                      N6-MTase                                 Methyltransf_26                 NWI_RS04225            308  bacteria>proteobacteria>alphaproteobacteria       Nitrobacter winogradskyi                           DNA methyltransferase [Nitrobacter winogradskyi].                                <-499633388_?<-752697874_?<-499633390_?||499633391_?->752697876_?->752697160_?-><-499633393_?<-499633394_N6-MTase*<-752697878_ParB<-499633396_?<-499633397_?<-499633398_?<-499633399_?<-499633400_?<-499633401_?
      307772411    Phage_GPD->?->?->N6-MTase*->                                                                                                           N6-MTase                                 Methyltransf_26                 TRICHSKD4_2718         305  bacteria>proteobacteria>alphaproteobacteria       Roseibium sp. TrichSKD4                            gp77 [Roseibium sp. TrichSKD4].                                                  307772404_?->307772405_?->307772406_?->307772407_?->307772408_Phage_GPD->307772409_?->307772410_?->307772411_N6-MTase*->307772412_?->307772413_?->307772414_?-><-307772415_?||307772416_?->307772417_?->307772418_?->
      740017278    <-N6-MTase*                                                                                                                            N6-MTase                                 Methyltransf_26                 IG92_RS0128815         304  bacteria>actinobacteria                           Streptomyces sp. NRRL S-1868                       chromosome partitioning protein ParB [Streptomyces sp. NRRL S-1868].             <-664361403_?<-664361406_?<-664361409_?<-664361411_?<-664361413_?<-740017278_N6-MTase*||664361418_?->664361421_?->664361424_?->664361427_?->664361430_?->664361433_?->664361436_?->
      516125430    <-N6-MTase*                                                                                                                            N6-MTase                                 Methyltransf_26                 C892_RS0103080         301  bacteria>actinobacteria                           Nocardiopsis baichengensis                         chromosome partitioning protein ParB [Nocardiopsis baichengensis].               516125423_?-><-516125424_?||648432354_?-><-516125426_?<-750403313_?||750403314_?-><-516125429_?<-516125430_N6-MTase*<-516125431_?||516125433_?-><-516125434_?<-516125435_?||516125436_?->516125437_?->516125438_?->
      750334917    Phage_GPD->?->N6-MTase*->                                                                                                              N6-MTase                                 Methyltransf_26                 TRICHSKD4_RS11975      300  bacteria>proteobacteria>alphaproteobacteria       Roseibium sp. TrichSKD4                            hypothetical protein, partial [Roseibium sp. TrichSKD4].                         <-497094044_?||497094045_?->497092480_?->497092478_?->497092476_?->497092474_Phage_GPD->497092472_?->750334917_N6-MTase*->497094050_?->750334871_?->750334872_?->750334873_?->750334874_?-><-750334918_?<-750334875_?
      304360713    N6-MTase*->                                                                                                                            N6-MTase                                 Methyltransf_26                 phiCTP1_gp51           299  viruses>dsdna viruses, no rna stage>caudovirales  Clostridium phage phiCTP1                          hypothetical protein phiCTP1_gp51 [Clostridium phage phiCTP1].                   304360706_?->304360707_?->304360708_?->304360709_?->304360710_?->304360711_?->304360712_?->304360713_N6-MTase*->304360714_?->304360715_?->304360716_?->304360717_?->304360718_?->304360719_?->304360720_?->
      686217757    N6-MTase*->                                                                                                                            N6-MTase                                 Methyltransf_26                 T245_RS0104245         298  bacteria>proteobacteria>gammaproteobacteria       Vibrio parahaemolyticus                            hypothetical protein, partial [Vibrio parahaemolyticus].                         686217757_N6-MTase*->686217758_?->686217759_?->
      739963074    <-MuF<-Phage_portal<-?<-HNH<-?<-?<-N6-MTase*                                                                                           N6-MTase                                 Methyltransf_26                 B073_RS0131580         295  bacteria>actinobacteria                           Streptomyces sp. MspMP-M5                          chromosome partitioning protein ParB [Streptomyces sp. MspMP-M5].                <-739963064_?<-739963066_MuF<-739963069_Phage_portal<-648556155_?<-648556156_HNH<-739963071_?<-517366965_?<-739963074_N6-MTase*<-739963075_?<-517366968_?<-517366969_?<-739963077_?<-517366972_?<-517366973_?<-739963079_?
      648436510    N6-MTase*->?->?->MuF->?->Terminase_LS-><-Phage_portal                                                                                  N6-MTase                                 Methyltransf_26+N6_N4_Mtase     D471_RS0130950         292  bacteria>actinobacteria                           Nocardiopsis lucentensis                           hypothetical protein [Nocardiopsis lucentensis].                                 516189936_?->648436510_N6-MTase*->516189946_?->516189949_?->516189952_MuF->516189954_?->516189956_Terminase_LS-><-750410233_Phage_portal||750410236_?->
      752848289    ParB->?->ParB->N6-MTase*->ParB->?->Terminase_LS->Phage_portal->MuF->?->P22_CoatProtein->                                               N6-MTase                                 SP+Methyltransf_26              D881_RS16175           270  bacteria>actinobacteria                           Corynebacterium ulcerans                           hypothetical protein, partial [Corynebacterium ulcerans].                        665903459_?->504648867_?->560537641_?->504648869_?->757634954_ParB->560537654_?->757634957_ParB->752848289_N6-MTase*->757634964_ParB->503677968_?->560537665_Terminase_LS->560537671_Phage_portal->560537676_MuF->560537682_?->560537686_P22_CoatProtein->
      764990690    <-Antirestrict<-?<-?<-MuF<-?<-Terminase_LS<-?<-N6-MTase*<-?<-?<-?<-?<-VRRNUC                                                           N6-MTase                                 Methyltransf_26                 AMBLS11_RS12625        270  bacteria>proteobacteria>gammaproteobacteria       Alteromonas macleodii                              hypothetical protein, partial [Alteromonas macleodii].                           <-764990687_Antirestrict<-504811868_?<-504811869_?<-764990688_MuF<-504811871_?<-764990689_Terminase_LS<-504811873_?<-764990690_N6-MTase*<-504811875_?<-504811876_?<-764990487_?<-504811878_?<-504811879_VRRNUC<-504811880_?<-504811881_?
      754504471    <-N6-MTase*<-ParB||N6-MTase->?-><-HNH                                                                                                  N6-MTase                                 Methyltransf_26                 SINME_RS11520          256  bacteria>proteobacteria>alphaproteobacteria       Sinorhizobium meliloti                             hypothetical protein, partial [Sinorhizobium meliloti].                          <-754504418_?||754504470_?->503610625_?-><-503610626_?<-503610627_?<-503610628_?<-503610629_?<-754504471_N6-MTase*<-754504472_ParB||503610630_N6-MTase->503610631_?-><-503610632_HNH<-503610633_?<-754504473_?<-754504474_?
      728810984    N6-MTase*->?->gp79->HTH->Terminase_LS->Phage_portal->MuF+MPTase->                                                                      N6-MTase                                 Methyltransf_26                 QR19_RS07135           228  bacteria>firmicutes                               Enterococcus faecalis                              chromosome partitioning protein ParB, partial [Enterococcus faecalis].           728810984_N6-MTase*->488292152_?->498481714_gp79->728810986_HTH->728810987_Terminase_LS->728810989_Phage_portal->728810990_MuF+MPTase->488311052_?->
      738605470    N6-MTase->?->?->?->?->ParB->N6-MTase*->?-><-?||?->?->?->?->MuF->                                                                       N6-MTase                                 Methyltransf_26                 DM07_RS11075           223  bacteria>proteobacteria>alphaproteobacteria       Oceanicaulis sp. HL-87                             DNA methyltransferase, partial [Oceanicaulis sp. HL-87].                         738603777_?->738603780_N6-MTase->738603783_?->738603785_?->738603788_?->738605465_?->738605468_ParB->738605470_N6-MTase*->738603791_?-><-738603794_?||738603798_?->738605473_?->738605476_?->738605479_?->738603799_MuF->
      736650700    ParB->N6-MTase*->ParB->CRISPR_assoc->?->?->?->?->N6-MTase->                                                                            N6-MTase                                 Methyltransf_26                 HMPREF1261_RS02180     222  bacteria>actinobacteria                           Corynebacterium sp. KPL1818                        hypothetical protein, partial [Corynebacterium sp. KPL1818].                     736650695_?->552852507_?->552852510_?->736650696_?->552852517_?->552852521_?->736650698_ParB->736650700_N6-MTase*->552852530_ParB->552852532_CRISPR_assoc->552852535_?->552852539_?->552852542_?->552852545_?->736650701_N6-MTase->
      736660310    <-Thy1<-?<-?<-?<-ParB<-N6-MTase*<-ParB                                                                                                 N6-MTase                                 Methyltransf_26                 HMPREF1267_RS11500     222  bacteria>actinobacteria                           Corynebacterium sp. KPL1824                        hypothetical protein, partial [Corynebacterium sp. KPL1824].                     <-736660162_?||736660164_?-><-552836611_Thy1<-552836618_?<-552836623_?<-552836631_?<-552836639_ParB<-736660310_N6-MTase*<-736660312_ParB<-552836652_?<-552836659_?<-552836663_?<-552836671_?<-552836675_?<-552836682_?
      750124080    <-N6-MTase*<-?<-?<-?<-?<-?<-CRISPR_assoc                                                                                               N6-MTase                                 N6_N4_Mtase                     A3EC_RS11435           206  bacteria>actinobacteria                           Corynebacterium ulceribovis                        hypothetical protein, partial [Corynebacterium ulceribovis].                     <-516655108_?<-516655109_?<-516655110_?<-516655111_?<-750124076_?<-750124078_?<-516655115_?<-750124080_N6-MTase*<-516655117_?<-516655118_?<-516655119_?<-516655120_?<-516655121_?<-750124082_CRISPR_assoc<-750124084_?
      # 7;                                                                                                                                                                                                                                                   
      189432761    <-MuF<-?<-Phage_portal<-Terminase_LS<-?<-?<-ParB+N6-MTase*                                                                             ParB+N6-MTase                            ParBc+Methyltransf_26          BACCOP_01158            518  bacteria>bacteroidetes                            Bacteroides coprocola DSM 17136                    hypothetical protein BACCOP_01158 [Bacteroides coprocola DSM 17136].             189432754_?-><-189432755_MuF<-189432756_?<-189432757_Phage_portal<-189432758_Terminase_LS<-189432759_?<-189432760_?<-189432761_ParB+N6-MTase*<-189432762_?<-189432763_?<-189432764_?<-189432765_?<-189432766_?<-189432767_?<-189432768_?
      652946519    ParB+N6-MTase*->?->?->Terminase_LS->Phage_portal->MuF->                                                                                ParB+N6-MTase                            ParBc+Methyltransf_26          K252_RS0101690          518  bacteria>bacteroidetes                            Butyricimonas virosa                               hypothetical protein [Butyricimonas virosa].                                     652946514_?->736480518_?->652946515_?->494838952_?->652946516_?->652946517_?->652946518_?->652946519_ParB+N6-MTase*->652946520_?->652946521_?->736480520_Terminase_LS->652946522_Phage_portal->736480522_MuF-><-652946523_?<-736480477_?
      740821721    ParB+N6-MTase*->?->?->Terminase_LS->Phage_portal->MuF->                                                                                ParB+N6-MTase                            Methyltransf_26                GV66_RS18355            518  bacteria>bacteroidetes                            Bacteroides dorei                                  chromosome partitioning protein ParB [Bacteroides dorei].                        740821704_?->652946513_?->740821707_?->740821710_?->740821713_?->740821715_?->740821718_?->740821721_ParB+N6-MTase*->652946520_?->494838940_?->740822740_Terminase_LS->740821727_Phage_portal->740822743_MuF-><-740821731_?<-494838927_?
      498101954    <-MuF<-?<-Phage_portal<-Terminase_LS<-?<-?<-?<-ParB+N6-MTase*<-?<-?<-?<-?<-?<-N6-MTase                                                 ParB+N6-MTase                            Methyltransf_26                ATH1_RS0100585          515  bacteria>bacteroidetes                            Anaerophaga thermohalophila                        hypothetical protein [Anaerophaga thermohalophila].                              <-763245641_MuF<-498101940_?<-498101943_Phage_portal<-498101946_Terminase_LS<-656214691_?<-498101951_?<-656214692_?<-498101954_ParB+N6-MTase*<-498101957_?<-498101960_?<-656214693_?<-498101969_?<-498101972_?<-498101979_N6-MTase<-498101981_?
      518076130    DCM->?->?->?->?->?->ParB+N6-MTase*->?->?->Terminase_LS->Phage_portal->ParB->MuF->                                                      ParB+N6-MTase                            Methyltransf_26                BN352_RS08925           512  bacteria>bacteroidetes                            Candidatus Alistipes marseilloanorexicus           chromosome partitioning protein ParB [Candidatus Alistipes marseilloanorexicus]. 518076122_?->518076123_DCM->518076125_?->518076126_?->518076127_?->736151789_?->518076129_?->518076130_ParB+N6-MTase*->648397961_?->518076131_?->518076132_Terminase_LS->518076133_Phage_portal->518076134_ParB->648397962_MuF-><-518076136_?
      749916162    <-MuF<-Phage_portal<-Terminase_LS<-?<-?<-ParB+N6-MTase*                                                                                ParB+N6-MTase                            ParBc+Methyltransf_26          BACCOP_RS00640          500  bacteria>bacteroidetes                            Bacteroides coprocola                              chromosome partitioning protein ParB, partial [Bacteroides coprocola].           494838927_?->494838929_?-><-749916157_MuF<-494838935_Phage_portal<-749916159_Terminase_LS<-494838940_?<-749916160_?<-749916162_ParB+N6-MTase*<-494838946_?<-494838948_?<-494838950_?<-494838952_?<-494838954_?<-749916163_?<-494838962_?
      754558370    ParB+N6-MTase*->?->?->Terminase_LS->                                                                                                   ParB+N6-MTase                            Methyltransf_26                B157_RS0110235          444  bacteria>bacteroidetes                            Spirosoma spitsbergense                            hypothetical protein, partial [Spirosoma spitsbergense].                         <-522092021_?<-522092022_?<-522092023_?<-754558369_?<-522092025_?<-522092026_?<-522092027_?||754558370_ParB+N6-MTase*->522092029_?->522092030_?->754558371_Terminase_LS->522092032_?->522092033_?->522092034_?->522092035_?->
      # 4;                                                                                                                                                                                                                                                       
      393402237    ParB->?->ParB+N6-MTase*->ParB->?->Terminase_LS->Phage_portal->MuF->?->P22_CoatProtein->                                                ParB+N6-MTase                            ParBc+Methyltransf_26          CULC0102_0528           516  bacteria>actinobacteria                           Corynebacterium ulcerans 0102                      hypothetical protein CULC0102_0528 [Corynebacterium ulcerans 0102].              393402230_?->393402231_?->393402232_?->393402233_?->393402234_?->393402235_ParB->393402236_?->393402237_ParB+N6-MTase*->393402238_ParB->393402239_?->393402240_Terminase_LS->393402241_Phage_portal->393402242_MuF->393402243_?->393402244_P22_CoatProtein->
      550751922    <-Thy1<-?<-?<-?<-ParB<-ParB+N6-MTase*                                                                                                  ParB+N6-MTase                            ParBc+Methyltransf_26          HMPREF1267_02363        505  bacteria>actinobacteria                           Corynebacterium sp. KPL1824                        hypothetical protein HMPREF1267_02363 [Corynebacterium sp. KPL1824].             <-550751915_?<-550751916_?<-550751917_Thy1<-550751918_?<-550751919_?<-550751920_?<-550751921_ParB<-550751922_ParB+N6-MTase*<-550751923_?<-550751924_?<-550751925_?<-550751926_?<-550751927_?<-550751928_?<-550751929_?
      550761804    ParB+N6-MTase*->ParB->CRISPR_assoc->?->?->?->?->N6-MTase->                                                                             ParB+N6-MTase                            ParBc+Methyltransf_26          HMPREF1261_00448        505  bacteria>actinobacteria                           Corynebacterium sp. KPL1818                        hypothetical protein HMPREF1261_00448 [Corynebacterium sp. KPL1818].             550761797_?->550761798_?->550761799_?->550761800_?->550761801_?->550761802_?->550761803_?->550761804_ParB+N6-MTase*->550761805_ParB->550761806_CRISPR_assoc->550761807_?->550761808_?->550761809_?->550761810_?->550761811_N6-MTase->
      806900839    ParB+N6-MTase*->?->Terminase_LS->Phage_portal->?->Phage_capsid->                                                                       ParB+N6-MTase                            ParBc+Methyltransf_26          ERS075618_03274         503  bacteria>actinobacteria                           Mycobacterium abscessus                            ParB-like nuclease domain [Mycobacterium abscessus].                             806900832_?->806900833_?->806900834_?->806900835_?->806900836_?->806900837_?->806900838_?->806900839_ParB+N6-MTase*->806900840_?->806900841_Terminase_LS->806900842_Phage_portal->806900843_?->806900844_Phage_capsid->806900845_?->806900846_?->
      # 1;                                                                                                                                                                                                                                                   
      728810915    N6-MTase*->                                                                                                                            N6-MTase                                 Methyltransf_26                QR19_RS04500            147  bacteria>firmicutes                               Enterococcus faecalis                              chromosome partitioning protein ParB, partial [Enterococcus faecalis].           728810915_N6-MTase*->
      68058079     ParB+N6-MTase->?*-><-?||HTH->Terminase_LS->Phage_portal->MuF->                                                                         -                                        -                              NTHI1523                139  bacteria>proteobacteria>gammaproteobacteria       Haemophilus influenzae 86-028NP                    predicted DNA modification N6-MTase [Haemophilus influenzae 86-028NP].          <-68058072_?||68058073_?->68058074_?->68058075_?->68058076_?->68058077_?->68058078_ParB+N6-MTase->68058079_?*-><-68058080_?||68058081_HTH->68058082_Terminase_LS->68058083_Phage_portal->68058084_MuF->68058085_?->68058086_?->
      738080913    <-MuF<-Phage_portal<-Terminase_LS<-HTH||?-><-N6-MTase*<-ParB                                                                           N6-MTase                                 N6_N4_Mtase                    LA03_RS30870            212  bacteria>proteobacteria>betaproteobacteria        Burkholderia gladioli                              hypothetical protein, partial [Burkholderia gladioli].                           <-738080803_?<-738080907_?<-738080804_MuF<-738080909_Phage_portal<-738080806_Terminase_LS<-738080911_HTH||738080808_?-><-738080913_N6-MTase*<-738080914_ParB||738080810_?-><-738080811_?<-738080813_?<-738080815_?<-738080817_?<-738080819_?
      739526353    Radical-SAM+N6-MTase*->                                                                                                                Radical-SAM+N6-MTase                     Methyltransf_26                ER57_RS06425            739  bacteria>proteobacteria>deltaproteobacteria       Smithella sp. SCADC                                hypothetical protein [Smithella sp. SCADC].                                      739526334_?->739526335_?->739526336_?->739526338_?->739526342_?->739526345_?->739526348_?->739526353_Radical-SAM+N6-MTase*->739526355_?->739526358_?->739526361_?->
      399528699    ASCH->ParB+N6-MTase*->                                                                                                                 ParB+N6-MTase                            SP+Methyltransf_26             B620_gp66               452  viruses>dsdna viruses, no rna stage>caudovirales  Croceibacter phage P2559S                          hypothetical protein P2559S_66 [Croceibacter phage P2559S].                      399528692_?->399528693_?->399528694_?->399528695_?->399528696_?->399528697_?->399528698_ASCH->399528699_ParB+N6-MTase*->399528700_?->399528701_?->
      
      Back to Contents
    • Multiple sequence alignment of the PCIF1-like/Group2-Clade 1 of N6-MTase

                                                                                           Str-1                                                                                                                   Str-1                            Str-2                                        Str-3                              Str-4                                                               Str-5                                             Str-6                                               Str-7                                                                                                                                          
      FINAL                                  --EEEEE-E-------------------------------------EEEE-HHHHHHHHHHHH----------------H------------------HH-------------------------HHHHHHHHHHHHHHHH-H---------------------------------------HHHHHHHHHH-------EEE------------------------------------------------------------------------------EE-----HHH---------HHHHHH-------HHHHHHHH--------------------------EEEEEE-----------------------HH-HHHHHHH------------EEEE-----EE------E-----------EE-------EEEEE--E----EEEEEEHHHH-----HH--------H--H--HHHHHHHHH--
      ALIGN                                  ----EEE----------------------------------------HHHHHHHHHHHHHHHH----------------H------------------HH-------------------------HHHHHHHHHHHHHHHH-H---------------------------------------HHHHHHHHH--------EE------------------------------E------------EE------------------------------------------HH---------HHHHHH-------HHHHHHHHH-------------------------EEEEEE------------------------H-HHHHHH-------------HHHHH------------H-----------H----------EE--------EEEEEEE-----------------HH--H--HHHHHHHH---
      HMM                                    --EEEEE-EE-------------------------------------HHHHHHHHHHHHHHHH-----------------------------------HH-------------------------HHHHHHHHHHHHHHHH--------------------------EEE------------HHHHHHHHHH--H-----H---HHHHHHH-------HHHH-HHHHHHHH-------------EE--E--EE--------------------------EEEE----HHH---------HHHHHH-------HHHHHHHHHH---------------H--------EEEEEE---E-------------------HH-HHHHHHH----HH-------HEEHHH--EEEE----E-----------EE--E----EEEEE--E----EEEEEEHHHH-----HH--H----HH--H--HHHHHHHHH--
      FREQ                                   --EEEEE-E-------------------------------------EEEE---HHHHHHHHHH----------------E------------------------------------------------HHHHHHHHHHHHH-H--------------------------------------HHEEHHHHHHH---EE--EEE----------------EEE-----------------------------------------------------------------HHHH---------HHHHHH-------HHHHHH-----------------------------EEEEE--------------------------HHHHHHH---HHHH-----HHEE----HH-------E-----------EE----H--HHHEE--H-HHHHHEEEHH---------------------H--HHHHHHHHH--
      PSSM                                   --EEEEE-E--------------------------------------EE--HHHHHHHHHHHH----------------H----------------------------------------------HHHHHHHHHHHHHHH-------------------------------------------HHHHHHHH----E--E--------------------------------------------------------------------------------EEE-----HH---------HHHHHH-------HHHHHHHHH-------------------------EEEEE-------------------------H-HHHHHH----------------------------------------------------EEEE--------EEEEE--HH-----HH--H-----H--H--HHHHHHHH---
      Homo_sapiens_18034767                  NNVVCIR-YK--------GE--------------------------MVKVSRNYFSKLWLLYR----------------YS----C--IDDSA-----FE-------------------------RFLPRVWCLLRRYQMM-FG-VGLY---EG------T-----GLQGSL---------PVHVFEALHRL-FGVS--FECFASPLNCYF-------RQYCSAFPDTDGYFGSRGP-------CL--D--FAPL---S-------------------GSFEANPPFCEE---------LMDAMV-------SHFERLLESS---------------P---E--PLSFIVFI---PEW-REP---------P---TP-ALTRMEQ---SRFKR----HQLILPAFEHEYRSGSQH-----------IC--K-K-EEMHYK--A-VHNTAVLFLQNDP-----GF--AKWAPTP--E--RLQELSAAYRQ
      Aureococcus_anophagefferens_323451439  LPQTREL-LR--------HE--------------------------LAPASNLAYDTLGCVVI---DH---VAPA----RR-------ARDAD-----QE---------ALDA------------AAEAARLALVDDAAAL-VA-ARRG---GG------G-----GDAARLWARLDGRAADCRRLAALAAA-SAADR-ARADAARRSSLK-------APWATALDDV---FGPDGS--LEL--NV--A--LAPT---RRMKGLDTWRAVRWLGAQHPRRVSLQPPVLFE--Y------YAEALAWGDAGATKAVGRMLLDG------GEG------E------PGRHALLY-A-PPR-ARG---------P---AS-VFCALCR---AMERSG---DPSEIAASAADYVACFDW-----------VR--S-------------VCGRGAFAAALAD-R---GD--DAFGGAA-----VLHALCAATPS
      Tetrahymena_thermophila_89295554       LQQRYQN-FQ--T--I--QD------------------------E-QDQSDQEEVNQLLSQEE---QN-----------KD--L-S--ISDQR-SQG-NN-IAHKDT------------------IMNQKIFLTLFWYDYI--G-I------QN------------GQQWSL---------NSEVFDLLKQF-LNIN--TEVFASPFNRNL-------ENYFSLF-ESDKYFGSFGN--FNKN-YL--N-------I-Q-------------------QNFQANPPFIDN---------LFTHFA-------AQILQILEIN---------------TQNNR--EIGCVIVF---P-W-QDN---------Q------GYYQLQN---SDYFI----DEIELLKNAHYYTDQNSC--------SS-IK--S-------------KFNTYILILGNTF-FK------DKYLQCP--Y--LSDSIVQAFQI
      Naegleria_gruberi_284097202            NRERVQR-FS----NI--LT--------------------------SHQLNMSHYEKLKKHFH----------------HRLNIIS--ANDSRKREVLQD-------------IFFNRSNDYTTCVFDYFVYCLLLRYGAM-FG-AGEKRF-EG------T-----GLHAAC---------PVEVFETLNKN-MNVN--SENFASPLNSYF-------VDFCSACGDLDFWFGSLGG-------FF--E--FFPK---S-------------------GCFESNPPFSEE---------LMQMMV-------SHMENLLANS---------------T---E--SLSFLIVV---PNW-MDA---------L---S---LQRLCS---SSFLT----HEHVLGANVHAYITGSQH------N----EK--N-V-HKRKYN--A-VHETHFFVLQNEK-----GK--TINPITS--Q--FIDYFLKSFAS
      Emiliania_huxleyi_551547167            GSGGTVT-LS--C-----GG------------------------E-SVVCKRSHLEKLRAL------------------LR----S--GDAGD-----AA-AASGER------------------LFERRAYCVLARVLAL-----------QG------GEPRAGGMQAAV---------GYRVFDALARH-HGAA--FELFASPLNARF-------SSFCSAAPDVDRAFGSVGS-------FFSPS--LDPL-LAS-------------------GAFQANPPYDPP---------LVAAMG-------ERMHALLASA---------------DARRD--ALTFIVII---PHW-QDK---------P------CWRALEQ---SCRCS----AHLRLPQAEHGFFEGGQH-----------YR--P---ALWRAA----NHDTSLFFLQSAA-----AP--R--PSEA--S--LAA-LRTAFRA
      Ostreococcus_tauri_116057704           RVDKDVVTLS--V-----GK------------------------S-DVRVNAEHLEKLKEMY-----------------RI----A--NGDRF-----TE-K-----------------------VFMADVYTMVARYDAA-----------QGGQYRFAG-----GHHTAL---------HGEVFDVLRDA-FFVS--CELFASPLNARW-------PTFCSAHIDVDYAFGSLGS-------YR--D--FRPS---H-------------------GSYEVNPPFDEE---------LVGDMS-------NHLFELLQNA---------------T---G--ALTFVVIT---PYW-LNR---------P------CWEDMRR---SKFCT----RCEVLSVREAGYFEGAQH-----------RK--K---SRFRFA----TSDTSVLFLQNEP-----XX--XXXXXKSSTG--RSRSPNERMPR
      Toxoplasma_gondii_221486586            GRG-----WR--------GG----DW--------------------TCECVRDLIDRIRSVKS----------------IG----S---------------------------------------LFASAVFALLCRYHSI-CG-CQN----QG------K-----GLQYAV---------PPNVLDVLRDD-LKVN--CELFASPFNVHF-------DNYCSIFPDVDVMFGSRGS-------FF--D----PS-F-Q------------LLE----GSFEANPPFDEV---------LMARMV-------QRLLSWLKKS------EERQRESL--------PLSFCLSL---PDW-SNG---------P---SE-FMYLLKK---SEYLR----YSEVIPEGKHVYLNGFQH-----------FC--H-T-CDLEVP--A-VCGTFFAVLQNEA-----GT--KAWPVTD--T--FINRLKEAWS-
      Micromonas_pusilla_226459779           TAKAKAK-LT--C-----GK------------------------V-SVECNAAHLEKLRALHR---------AYAR---RR-GG-G--GGGRR-RAR-AR-SAADDE----R-------------IFREDAFSLLARYSSL-QG-AHYK---AG------------AMQAAL---------PPAMFDALREH-FDVS--MELCASPFNCRW-------RRYCGAALDVDAAFGSLGSGAFYFHTVF--E--FTPS---G-------------------GSFEVNPPFDPG---------FVERLV-------AHLESVLSRSATRTTDGDDDDDDDDDDDAE--ALSFVVVV---PYW-PEK---------R------AWLRLVN---SEFAR----KVLKLRAGAHGFVAGAQH-----------LR--P---DKVVPS----AAATSVVFLQNDA-----GA--KRWPVTS--E--KIDAVKEGFRG
      Sphaeroforma_arctica_Sarc1000002473    NEAKKGD-TA--I-----TG------------AIKRSNSSTVK-----RVTSDKHGKPIELDD---KQ---------------------------------HKNTEE------------------LFHNDLYSMLLRYNAL-----------SG------M-----GFQAAC---------SEHVFAALKSL-FRTD--FECFSSPLNTHH-------PLYCSAYIDTDFHFGSRGN-------FF--D--FYPA---S-------------------GSFQANPPFVTG---------VMERMA-------KHIETLLQKA---------------NTNSQ--PLSFIVVV---PGWVNEV---------S------YQTMLAS--PFMEGD----GPLLISKDDHGFCDGAQH-----------QR--Q---DRYRNS----PFDTAIFFLRTDAARNVYGE--RTFESDA--I--LRKAFAEALPS
      Nasonia_vitripennis_156553336          REQTMLR-FH--------ND--------------------------AMCINNTHLAKLEHLYR----------------HN----C--FDDRK-----FE-------------------------MFLPRVWCMLKRYQTY-LG-S--T---ES------Q-----ATQMAL---------PVTVFECLQRM-FGVT--FECFASPLNCYF-------RQYCSAFADTDAYFGSRGP-------FL--D--FRPI---S-------------------GSFQANPPYCEE---------LMEAMV-------NHFERLLSDS---------------T---E--ALSFVVFL---PEW-RDP---------A---PN-ALLKLEA---SHFKR----KQVVVPAMEHEYRHGFQH-----------VL--P-K-GEVNIR--A-IHGTLVVWLQNAA-----GH--ARWGPTE--E--RVEALLEAWRP
      Lottia_gigantea_Lgig1000003858         NDITSLR-YK-------SET---------------------------VKINSSHFHKLEQLYK----------------LN----C--RDDPR-----FD-------------------------HFLCRVWCLLRRYQTY-FG-IHTN---EG------F-----GLQGAL---------PVTVFECLHRV-FGVT--FECFASPLNCYF-------RQFCSAFTDTDGYFGSRGD-------IL--N--FFPK---S-------------------GSFEANPPFCEE---------LMEAMV-------DHFENLLHES---------------N---E--PLSFIVFI---PEW-RDP---------P---TE-ALMRLES---SRFKK----KQITFPAYEHEYRNGFQH-----------IC--P-K-NDMSVK--S-LHGTVAIFLQNDA-----GF--SKWGPTP--E--RIKELLLSSKP
      Phytophthora_sojae_Psoj1000010133      GMRQ----LT--Y-----GN------------------------S-TVKLSAAHFAKLREMYA---RK-----------QG----L--GGDGS-----SM-APKDQR------------------SFESALFCLLLRYDSL-----------DG------G-----GFQAAL---------NEECFDVLLKE-FDCK--MECFASPLNCRY-------SRFCSAFLDTDCAFGSVGS-------FF--D--FSPR---S-------------------GCFEANPPFIPK---------VIKRMA-------DHMTALLNAA---------------D---G--PLAFIVII---PAW-QDT---------E------GWQQLNS---SRYNQ----THLLIPQKQHGYCEGKQQ-----------IR--K---TRWRIA----SFDTSVFFWQNSK-----AC--NKWPVTE--K--KLDSLKSAFKS
      Capitella_spI_Caps1000025183           GDVVSLR-FK-------SEH--------------------------LIKVNTSHFHKLEQLYR----------------CN----C--RDDPK-----FE-------------------------NFLPRVWCLLRRYHTY-FG-LSAD---EG------S-----GLQGAL---------PVPVFECLHRV-FNVT--FECFASPLNCYF-------KQYCSAFVDTDGYFGSRGP-------LL--D--FSPT---S-------------------GSFEANPPFGEE---------LMEAMV-------DHFESLLSES---------------N---D--PLSFIVFV---PDW-RDP---------P---TE-ALMRLES---SRFKR----KQATVVAYEHEYRQGFQH-----------IV--N-K-ADTNIR--A-SHGTVIIFLQNEA-----GF--NKWGPTR--E--RLNELLLAYNP
      Ostreococcus_tauri_116058754           GLDARVV-IR--D-----TG------------PFVSFQLNQKK-P-YVKVTKTHLGKLRALYC---RT-----------CR----R--GKPLS-----EDVNSSEYI------------------VFAYCVFALLLRYGESL----------GG------A-----GYQAAL---------GEDAFDVLKER-LGVS--CECFASPLNARY-------ARFCSAFFDVDKYFGSLGN-------FF--GHGFKPR---A-------------------GSFEMNPPFVPE---------TMLAAV-------EKASALLDEA---------------QKRGA--ALSFVVVV---PAW-KEC---------K------FWHFLQS--CLHLQH----CD-IVDAESHGFCDGAQH-----------VRPMH---ERHRVS----SFDTGVCYLQTAA-----AA--AHRPCDR--A--LRDVVTSAMKR
      Phytophthora_infestans_262109934       GMRQ----LT--Y-----GR------------------------S-TVKLSANHFTKLREMFA---KK-----------QG----L--GGDGS-----NM-APKDQQ------------------QLECALFCLLLRYESL-----------DG------G-----GFQAAL---------NEECFDVLLKE-FDCK--MECFASPLNCRY-------SRFCSAFLDTDFAFGSVGS-------FF--D--FSPR---S-------------------GCFEANPPFIPK---------VIKRMA-------DHMTALLNAA---------------D---G--PLAFIVII---PAW-QET---------E------GWQQLNA---SRFNQ----RHLLIPQKQHGYCEGKQQ-----------IR--K---TRWRIA----SFDTSVFFWQNSK-----AC--NKWPVTE--K--KLEGLKQAFRS
      Helobdella_robusta_Hrob1000018333      GEVVTLR-LKLAAAGVANND--------------------------VMKINSMHFRKLEQLYM----------------LN----C--RDDPK-----ME-------------------------HFLHRTWCLLKRYNTF-FG-TKEN---EG------F-----GLQGAL---------PVSVFQCLNRS-FGVT--FECFASPLNCYF-------KQFCSAFPDTDGYFGSRGS-------IL--D--FYPI---S-------------------GSFEANPPFNEE---------LMEAMV-------DHFESLLSET---------------P---L--PLSFIIFL---PDW-KDP---------P---TE-ALIKLES---SRYKR----QQMTIPAMEHEYRHGFQH-----------IC--Q-R-KDLNVR--S-LHGTLVIFLQNDA-----GA--NKWSVNN--D--NMRELLYAYQL
      Phaeodactylum_tricornutum_219110361    SRVFSLV-FH--------RK----SWKKPF----------------RVKINVSHYHKLKTAFL---RV-----------HN----S--DHQLK-PIL-LY-DHGKPT-KAIH-------------SFHLIIMSLLLRYSAL-SG-GQLL---VD-LRG--G-----GMQGAV---------HDEVFEALQTC-FPNESFLECFASPLNCYA-------ANFGSAFTDIDFHFGSVGD-------FL--D--QSIS---H-------------------GVCEANPPFSPG---------LMDTMV-------DRIEYNLTLA---------------DQTSS--CLTFVVII---PTA-STSEDVRTAKRFA---TK-SFQRMLG---SAACR----LHISLAARDHGYIEGAQH-----------LR--P---TRYKES----NFDTSVILLQSSA-----AR--KENIDEN--N--LEKRLRSAFTS
      Harpegnathos_saltator_307208075        REQTMLR-FH--------GD--------------------------TMCINNIHLTKLEHLYR----------------YN----C--FDDKK-----FE-------------------------MFLPRVWCMLKRYQTY-LG-I--N---EG------Q-----ATQMAL---------PVTVFECLQRS-FGVT--FECFASPLNCYF-------RQYCSAFADTDSYFGSRGP-------FL--D--FRPV---S-------------------GSFQANPPYCEE---------LMEAMV-------NHFERLLADS---------------T---E--PLSFVVFL---PEW-RDP---------A---PN-ALIKLEG---SHFKR----KQVVVPAMEHEYRHGFQH-----------IL--P-K-GEVNIR--A-AHGTLVVWLQNPA-----GA--ARWGPTE--E--RVEALLEAWRP
      Sphaeroforma_arctica_Sarc1000006366    ------T-AE--T-----SARARSVDCVEV--VITGKKAREAL-L-ERATTDLRAFDLTYNQM---TVRINKAHHDKLRLL----H--SRNAP-----QS-ERGDDS------------------ALNSRVFSLLVRYHTLQGG------HVQG------G-----GMQAAL---------IEDTFDALLRN-FGVN--FECFASPLNSRY-------GQYCSMFADTDGPFGSVGS-------FF--D--FYPL---S-------------------GSFEANPPFEDG---------VIHRMA-------MHIDVLLDRS---------------DRENK--PLSFVVVI---PAW-AES---------S------GWQRLNQ---STHLK----RLLTLSQRDHGFCEGTQH-----------SR--P---TRYRIS----TYVVPIVVGLAMT-----GDFFAYWVISA--V--MVGMVLSLVRL
      Emiliania_huxleyi_551601616            GEEDGVG-LR--E-----AEVGGGVWCVSLPPALLAPLPQELR-K-PLKISDEWLSKLREMHT---ATVASTAPA----SA----P--ASASA-----AS-AAAAEL------------------RFRSDLARLLLRYKAL-----------GG------S-----GFQAAI---------GGGAFAVLRAS-FGAR--LECFASPLNARS-------APFCSAFPDVDAPFGSLGS-------FL--E--FEPE---A-------------------GAYEANPPFVPL---------VLRAMC-------AHMHRLLDRA---------------EASRR--PLLFVVVVGASSAL-KRH---------A------AWEDLQGLAAGRHGR----AQWLLPLHAHGYTEGHAH-----------IA--K---GGARAARRMSSCDTAVFVWASSA-----GA--EQWPVTD--G--AEAALRAAMKA
      Emiliania_huxleyi_551569354            GEGGLVE-VH--V-----HG--------PL--LRLSLSSAPGG-T-HVDVSHEHYAKLAALHA---KH---------------------------------APSVAG------------------NLRTRVLCMMLRYQSL-----------GA------H-----GNQCAL---------PPAGFEVLRQR-LGIR--FECFASPLNARY-------DRYCSAFADTDAAFGSIGS-------FF--G--FRPT---H-------------------GSFECNPPFVPEAPLAAVRPPVLLAAV-------KHAEALLSAA---------------EASGG--ALSFAFVV---PSW-ERV---------P------FHHQLCR-SAFLRGG----APLRLAAEAHGFVDGAQH-----------LK--AAAGDRLRVS----SFASTVGVLQTAA-----AA--ERWPVDA--A--LYSRLSAAFAG
      Saccoglossus_kowalevskii_291221943     GELTCLK-YK--------DQ--------------------------VCKVNSAHFQKLEQLYK----------------LH----C--LDDPR-----FD-------------------------NFLGRAWCLLKRYQTM-FG-LRTN---EG------S-----GLQGAL---------PIPVFQVLNRH-FGVT--FECFASPLNCYF-------KQYCSAFNDVDSYFGSRGP-------VL--D--FYPV---S-------------------GSFEANPPFGEE---------LMEAMV-------DHFESLLDKS---------------T---D--PLSFIVFI---PEW-RDP---------P---TP-ALVRMEA---SRFKR----KQCLIPALEHEYRSGTQH-----------TC--S-S-HELYYR--A-VHGTLAFFLQNDA-----GF--EKWGPTP--D--RVKALLDAFIP
      Emiliania_huxleyi_485621692            --------------------------------------------R-PRSADESMRADLRRAGM---APAASAAVVQAVRSA----S--ARAAT-----RV-DNFRST------------------AAEGGRLRVRLKREED-----------GA------T-----RLEAAL---------GGAAFSALQRC-LGVN--FECFASPLNCYY-------GAYCSAFPDVDAPFGSRGS-------FR--G--FAPR---R-------------------GSYEVNPPFVDG---------LIARMA-------ERLLSLLAAA---------------HAACE--PLTFVVVL---PGW-LDS---------E------GYRALDG---SSHLR----AKLLVAAADHGFVDGGQH-----------AR--T---RTFRES----PYDTALFFLQSGA-----AA--AV---DE--A--CVESVRTALAR
      Giardia_lamblia_159115031              SHSLDLK-FK---------------------------------------LSSIHFYKLRELYK----------------RT----SGKRFDPE-----MK-------------------------MFSKLLFILLRRYHTF-FG-TERF---EG------T-----SFHAAA---------PENIFRRLKSF-LEVS--QECFASPLNCFF-------SQFCSAFPEIDVFFGSLGS-------FF--D--YDIA---E-------------------GSFECGPPYTLE---------CMDRTA-------KHIIRTLDKS---------------E---NRRPIMFVVFV---PEW-RVP---------P---AQ-YHLDLEE---SAYTR----FHFCAPGGKHYYVSGEQHEPKCIASKGALTN--E-K-VGRYYL--V-PHGTHVYFVCNDA-----GF--KRYAKGS--EDYLEKAADDILRV
      Emiliania_huxleyi_485638053            GEEDGVG-LR--E-----AEVGGGVWCVSLPPALLAPLPQELR-K-PLKISDEWLSKLREMHT---ATVASTAPA----SA----P--ASASA-----AS-AAAAEL------------------RFRSDLARLLLRYKAL-----------GG------S-----GFQAAI---------GGGAFAVLRAS-FGAR--LECFASPLNARS-------APFCSAFPDVDAPFGSLGS-------FL--E--FEPE---E-------------------GAYEANPPFVPL---------VLRAMC-------AHMHRLLDRA---------------EASRR--PLLFVVVVGASSAL-KRH---------A------AWEDLQGLAAGRHGR----AQWLLPLHAHGYTEGHAH-----------IA--K---GGARAARRMSSCDTAVFVWASSA-----GA--EQWPVTD--G--AEAALRAAMKA
      Aureococcus_anophagefferens_323449955  --------------------------------------------T-ARATTVARRDGGARHFF---CGDATAELPEKV-DA----K--LRELA-----RR-AGTKPT------------------DVDACILAMTMRYDAL-----------GG------S-----GFQAAL---------PGAAFRALRDR-FGVN--FECFASPLNAYY-------ERYCSAHADVDAPFGSLGS-------FY--D--FSPR---R-------------------GAFECNPPFAPA---------PLLRAA-------RRCDALLAAA---------------EARRD--ALAFAFVA---PVW-TDQ---------A------AWAAVDG---SRFKR----GAVRVPREDHAWRD---------------AR--T---ARARRV----PVDTAIFILATSA-----AE--AAHPCDA--A--ALGEVRAALLV
      Ectocarpus_siliculosus_298709108       SAGQTSG-RR--------EA--------------------------SFAITGAHLHKTWSAYC----------------R-----C--VGGDS-PVW-DR-------------------------NFLRRLFCVLSRYETL-SA--------TS------D-----GYQMAF---------PASGFRLLRHL-VSVD--CECFASPLNCTL-------SRFCSVAYDTDKFFGSEGN-------FF--Q--SEYQ---Q-------------------GSFEANPPFVEE---------VMERMV-------DHMHHLLRRA---------------T---G--PMSFAVIV---PGW-DDD---------G---CV-SYQNMKN---SRFARPHPGFYLTLQKGMHNYRPGMQH---------------R-Q-DVEEKP--S-NCNTFLFILQNDW-----GA--DAWPVST--T--SLGQLQAELES
      Sphaeroforma_arctica_Sarc1000000137    HSTHQAQ-VE--E-----RG------------ALLAYSLRTKKKP-FFSLSRPHAAKLRSLYA---RT-----------R-----G--GKWVD-----KE-G-SDND------------------RFVDAVFCVLARYDAL-----------GG------A-----GYQAAL---------NEASFDVLKDK-MRVD--CECFASPLNCRY-------GQFCSAFPDTDSPFGSLGS-------FF--D--FYPS---K-------------------GSFEMNPPFVPE---------VLCAAA-------EHANALLSLT---------------KEP-----LSFVVVV---PAW-KEV---------R------MWQVLSN--SAYNKH----EPLILTASNHGYCDGQQH-----------QRRPS---ERYRVS----SYDTAVFFLQNDA-----GA--KKWPVSE--A--IRNELVESMHK
      Danio_rerio_125819445                  NDIACLR-FK--------GE--------------------------MVKVSRGHFNKLELLYR----------------YS----C--IDDPR-----FE-------------------------KFLSRVWCLIKRYQVM-FG-SGVN---EG------S-----GLQGSL---------PVPVFEALNKQ-FGVT--FECFASPLNCYF-------KQFCSAFPDIDGFFGSRGP-------FL--S--FSPA---S-------------------GSFEANPPFCEE---------LMDAMV-------THFEDLLGRS---------------S---E--PLSFIIFV---PEW-RDP---------P---TP-ALTRMEA---SRFRR----HQMTVPAFEHEYRSGSQH-----------IC--K-R-EEIYYK--A-IHGTAVIFLQNNA-----GF--AKWEPTT--E--RIQELLAAYKV
      Ciona_intestinalis_198420771           KRHVCLT-YN--------GE--------------------------LVRLNYLYLEKLEALYR----------------IS----C--KDDPK-----ME-------------------------LFLQRVWCLLRRYQTF-FG-PNQY---EG------I-----MLQGAL---------PSTVFECLYSV-FGVT--MECFASPLNSYY-------KNYSSAFADTDCYFGSSGP-------LM--K--LFPV---S-------------------GSFEVNPPFAEE---------LMEAMV-------DHFEKLLAQS---------------N---E--PLSFIVFV---PEW-RDP---------T---PI-AILRMET---SKFKR----KQVLVPAFEHEYRSGLQH-----------VA--P-L-KEVYHK--A-VHGTMVFFLQNES-----GF--QKWGPTQ--D--RLRKLLTAFRP
      Camponotus_floridanus_307182697        REQTMLR-FH--------GD--------------------------TMYINNTHLTKLEHLYR----------------YN----C--FDDKK-----FE-------------------------MFLPRVWCMLKRYQTY-FG-I--N---EG------Q-----ATQMAL---------PVTVFECLQRS-FGVT--FECFASPLNCYF-------RQYCSAFADTDSYFGSRGP-------FL--D--FRPV---S-------------------GSFQANPPYCEE---------LMEAMV-------NHFERLLADS---------------A---E--PLSFVVFL---PEW-RDP---------A---PN-ALIKLEG---SHFKR----KQVVVPAMEHEYRHGFQH-----------IL--P-K-GEVNIR--A-VHGTLVVWLQNPA-----GA--ARWGPTE--E--RVEALLEAWRP
      Naegleria_gruberi_284095070            GTVVD-----------------------------------------SFKLNLVHFNKLRLLYQ----------------KH----N-QEIDPD-----LK-------------------------IFPYRLYALLRRYQTF-FGDSESE---EG------A-----NFHAAL---------PEKGFEFLYKE-FNVC--HECFASPINCYF-------SSFCSAFPDTDVYFGSRGS-------FF--E--FRPT---Q-------------------GFFECGPPYTLE---------VMNKTA-------EYCLQLLKAS---------------D---E--PLSFAVFV---PEW-TDT---------E---YG-RMLHPDS---TPLCT----GHLLAEQGKHEYVIGMQH-----------FK--E-N-EKRYWT--L-PFPTHVYFLQNEK-----GK--EKWPITP--Q--LIERYKKVMEI
      Ectocarpus_siliculosus_298711525       NTYV----LQ--L-----GK------------------------N-KLRMNSAHYDKMKELFS---RS-----------RV----E--GAARR-----QS-SSTEHPPPAWIG------------DFHDCLFSCLMRYEAL-----------QG------G-----GFQASM---------GGDAFDVLLKR-FGAR--MECFASPFNCRY-------SRYCSAFPDTDGPFGSAGS-------FF--D--FQPT---Q-------------------GAYEANPPFVRD---------VILKMA-------NHMDGLLQAT---------------A---K--ALTFVVII---PCW-EDS---------A------GWKRLRD---SAFLS----KHIKLDQKDHGYCEGKQH-----------LR--R---NRYRLA----SFHTSVFFLQTDV-----AR--RQQSPETLGQ--ACRELERAFAL
      Aureococcus_anophagefferens_323451821  --------------------------------------------S-ARAVPAATLDDANDEAE---AALAARPGRIPK-RK----R--PQARG-----DG-DDERPP------------------AFLSKVFSMLLRYDDL-----------AG------D---A-GQHGAI---------PAAVFDVLR-R-WGCD--AECCATPFNATL-------GSYCSPFRDTDAPFGSVGS-------FF--A--FEPA---S-------------------GCYEINPPFTLN----------SDVVE-------RHLRTLLDAA---------------ERGGR--PLMFVMVH-A-AAH-ARH---------ARDGATRALPARDG-PCARYLR----RDFLLAAGAHHYREGKFY-----------AR--A-L-PRAYVP----PMPSIVLFLATDA-----GA--RRWPATR--E--LQAGIEDAFAW
      Micromonas_pusilla_226462261           SIK-----LT--L-----HD------------------------A-EVEINEQQYEKLRWLYESDIQA-----------TQ----F--GGTCA-----KP-SLANTFCESGI-------------TFHSAVFAMLCRYASA-----------HGGMHCMAG-----GHHNAL---------HGDVFDALNIG-LGVH--AECCASPLNCHW-------RLYCSGHPDTDLTFGSLGS-------VF--S--FDPV---D-------------------GYFECNPPFEES---------VLLDCI-------KHIDSLLDVA---------------EVAGK--SLSCVFIV---PHW-PGR---------R------AWETLFR---SVHKS----HTEVIPLREHGFLEVLTG-----------PF--S---ATF-------SYNFHSCFLSKLK-----TF--VSHLIET--H--LTKFLCRGHRK
      Emiliania_huxleyi_485623173            AVEAPAW-WS--T---------------------------------EGSPGGVEIEAEVAG------------------AR----L--SAQAS-----EA-TPDPRA------------------AFHQRLFALLLRYKTL-----------RG------H-----GFHAAI---------APAVWRVLTSR-LGVG--FEAFASPLNCYL-------PTFGSGFSDVDGAFGSSGS-------FF--R--LKPAQLAS-------------------GSCACNPPFVHA---------ILDAAA-------ARVEELLAAA---------------AAADA--PLSFAFIM---PGW-KET---------R------AHASLSA---SPFLR----RAVLVAAADHGFCDGASH-----------QR--A---DPLRAS----PYDTVVFVLQTER-----GS--RKWPADG--R--FEAELRAAFAA
      Micromonas_pusilla_226460900           DADAVVT-VD--D-----AG------------PLLALKVNAQK-P-YMQVSKQHMGKLRALYS---RH-----------SL----G--GAPLP-----PE-GSSEHA------------------AFAASVFALLARYDAC-----------GG------A-----GYQAAL---------GEKAFDVLKKR-VGVG--CEAFASPLNARY-------GRFCSAFPDVDGPFGSLGS-------FF--D--FAPT---R-------------------GSFEMNPPFVPE---------VLLAAA-------ERAEKLLRTA---------------EESDS--RLSFVVVV---PAW-RDV---------P------MWTALEK--SAFKRG----DALIVPASAHGYCDGAQQ-----------IRSPS---ERHRVS----SYDTGVFFLQTTA-----GA--RRWPVTE--E--IRAELLEGMKA
      Thecamonas_trahens_Ttra1000007497      RKDR----FV--P-----TK------------------------SLAVAINVEHVTKLAARYT---GP-----------RP----A--EGDVD-----V-----------------------------ELLFLLLFQYNAL-----------EG------G-----GFQAAL---------PDPVFDLLAAAPFHAD--TEAFASPLNVTL-------PRYHSALPAIDAAFGSSGS-------FF--D--AMPD---A-------------------GVVEANPPFTEP---------FITRML-------AHMHKCLAAA---------------S---G--PLTFVVIV---PAW-RQS---------P------AWLALTT---SPHAS----RTAVLPAAEHAYCEGKQH-----------LR--K---TRFRLA----SNDTSIVFLQNEL-----AA--ASLTITD--A--HVEAIAAAFAA
      Monosiga_brevicollis_167534686         --ELVIY-FA----------------------------------A-SQPVALHRLRRLDWMVR-G-AG-----------RH----I--PDDVP-PAK-HD------------Q------------WQLAQVARCIFRY--------------AA------------HLHTTA--QHWGH--PQEFYNFLARR-LGLL--REAFACPLNSRVLGYNDPAARFCSLYRDTDAPFGSLGS-------FF--E--TDML-A-S---G---------------YGWVVHPPFTED---------ILNRLS-------AQCQSALQQA---AS--------------Q--DRQLIVGIGW-PNW-TDM---------P------SYHQLRD---SPFKR----SEVLQVKYNYHYERLNGT----L------VK--P-------------NFTNIYLILSAQP---------LNEDQQA-----AVREFETIVAH
      Monosiga_brevicollis_163777248         DDKVVLR-CP--TVGA--DC--------------------------LVSMKSDHYHRLAKLYR----------------TH----C-EAFDPE-----RQ-------------------------HFHRLALCVGLRYQTI-LF-EPYV-----------------DGENSM---------PAAFKAFLTKQ-LHCC--FDSFASPFNAYY-------RNFASAAPDLDSFFGSVGS-------FF--D--FTPS---Q-------------------GSFVAFPPFVEL---------VLDRTA-------DRIEALLNAS---------------S---D--PLSFVVMM---PEW-RLY---------K---LH-ALDVLDQ---SPHRR----ADFLIEGNKQAILSGQSW-----------YG--D-V-GQQWKL--A-RFGYRVYVLQNEA-----GF--AQWGAGV--A--ALEENKRAFGA
      Proteobacteria_bacterium_655448079     PAVSMFR-VE--V--L--RS------------RVLGAMR-----S-ACEVSGLKFVNVEELLT----------------KV-IF-S--VRAHN-----LP-------------------------QYDAVLPSIIAGK----------E---DG------D-REV-GGSAVV---------SKVFFNYITGN-VPSE--TKAMEN-LPSTL-------ATICGQAAALVYHHGS----------TL--Q-----------------------------GS--AEPP---------------KNVV-------EVKKRLWPGQ-T--------------------NAHYAALL-------YNG---------R------EVCRLDM---SRYKR------LVVAHNRYDPSSPGQC-----------------I-DRIFTM----AMRYQSLMLSTKC-L---GM--HAALPNN-----VFKLLVDALGC
      Psychroflexus_gondwanensis_489532476   SSPDTLL-ID------------------------------------DFKLKRNQYIDKAYHFV----------------KS-------KTDDV--------------------------------TATNAILRSALRY----------------------------GSIYAETRHIGP---PQKVYDLFYK--WGIR--NEGFASPFNARL-LGKPK-AQFYSLFEDTDEIFGSGGS-------FF--N--LNHP---E-----NP------------GHWCLDPPFTSE---------LMTKVD-------SILASWLETY-K--------------------DLSFLLII---PE--SHA---------P----------------SNVPD----ESVTLKKDLH-YYEGLEG----V------LK--P-L-----------PVNVCIHRYGNIE-----GF--------------SSDAIEEGYSK
      Psychroflexus_tropicus_517867866       IDATSFG-IG------------------------------------EFKLHRNQYIDKAFHFV----------------QK-------KSDKT--------------------------------EAFNAILRSALRY----------------------------ASIYAETRHIGP---PQRVYDMFYD--WGIR--NEGFASPFNARV-LGKPQ-AQFYSLFKDTDKIFGSGGS-------FF--N--LEMP---E-----NP------------GHWCLDPPFTTE---------IMTKVD-------HILETWLETY-K--------------------ELSFLLII---PE--SHT---------P----------------ANQPD----ETVTLKKDTH-YYEGLEG----V------LK--P-L-----------PVNVCIHRYGQFE-----GF--------------SAKAILDGYSK
      Psychroflexus_torquis_504836046        SSPDTLS-ID------------------------------------DFELKRNQYIDKAYHFV----------------NS-------ETDDA--------------------------------TATNAILRSALRY----------------------------GSIYAETRHIGP---PQKVYDLFYK--WGVR--NEGFASPFNARL-LGKPK-AQFYSLFENTDEIFGSGGS-------FF--N--LNSP---E-----NP------------GHWCLDPPFTSE---------LMIKVD-------SILASWLKTY-T--------------------DLSFLLII---PQ--SHT---------P----------------SNKPD----ETITLKKDLH-YYEGLEG----V------LK--P-L-----------PVNVCIHRYGNIE-----GF--------------SSDAILEGYSK
      Listeria_monocytogenes_733112108       --------MK------------------------------------TIANEYETLEAIKKAMA-M--------------YE--L-K--KADKD--------------------------------HVATPRYVVEDIYSLI-----------DI------E-----SFKSLWF--------PFNHYDSLFK--LRAE--ELNLKY---------------------KATHIFDNVGN-----D-FF--T--TEPP-I-D---------C---------DLMISNPPFSQQ-----------NEII-------ERSFQLIDEK-------------------K--IKSFALLL---P---LST--L------E------TEKRANI-F-AQYSDK---LAILIFKKRIKFLGHSTS-----------FN--R---GCCWVC-------YNISALEDKR---------IQWV-------------------
      consensus/100%                         .........................................................................................................................................................................................b..h..........................................ass..................................................PP................................h.............................hh...........................................................................................................................................
      consensus/95%                          ..............................................................................................................................h....h....bh................................s..............h..h....h......c.hus.hss...........a.u...p.s..FGS.Gs.......hh..................................h.h.PPa...........................h..hL..........................b.hhhhh...s.............................................h......a........................................h..h....................................
      consensus/90%                          ..................................................p...h.p.....................................................................h...hh.hh.cY............................s.p.uh.........s...a.hL....h.hp...EshAoPhNs...........a.S.h.-.D..FGS.Gs.......hh................................G.h.hsPPa.............h..hs.........h..hL..........................h.ahhhh...P....p...........................u...p........l....a.a..s..................................s..h.hh.s.......s...................h...h..
      consensus/85%                          ........h........................................hs...h.cb....................................................................h...hh.hh.RY...............s............u.p.uh.........s...aphL....h.hp...EsFASPhNs.h.........ahShh.D.D..FGS.Gs.......hh..p.........p...................G.aphsPPa............hh..hs........ph..hL..s......................sl.Fhlhh...P....p.....................h.....u...p......h.l....H.a.ps.p................................ss.hhhh.s.......u...................h..sh..
      consensus/80%                          ........hp.......................................hs..ph.cl................................s...................................h...lh.hl.RY..h............u............u.psAh.........s..sFphL.p..hshp...EsFASPhNsbh........pahSha.DsD..FGS.Gs.......hh..p..h.P....p...................GsaphNPPFs...........lh..hs.......p+h..lL..u......................sLsFllhl...P.a..c..................h..hp....u...p......h.ls...H.a.pG.pp.............................s.ss.lhhl.s.......u........p..........l..uh..
      consensus/75%                          ........hp.....................................hphs..ph.cl..hh............................s...................................h...lhshl.RYpsh............u............u.psAl.........s..sFchL.p..hshp..hEsFASPlNsbh........paCSua.DsD..FGS.Gs.......ah..s..h.P....p...................GsaphNPPFs...........lhp.hs.......p+hppLL..u......................sLoFllhl...P.W.pcs.................h..hp....o.b.p......h.ls...H.a.pG.pp...........hp................s.so.lhhLps.......u.....bs.s........p.l..uh..
      consensus/70%                          ........hp.....................................hpls..ch.+Lb.ha............................s...................................a...lashLbRYpsh............G............GbpsAl.........s..sFchL.p..hshp..hEsFASPLNsbh........paCSAa.DsD..FGS.Gs.......Fh..s..h.P....p...................GsaphNPPFs...........lhp.hs.......p+hppLLp.u...............p......sLoFllhl...P.W.pcs................sh..lp....S.a.c.....ph.ls..pH.Y.pG.pa...........hp.......p........s.sT.lhhLpsps.....u....pas.s........p.lb.uh..
      
      Back to Contents
    • General notes, phyletic distribution and gene neighborhoods of Group2-Clade 1

      General notes:

      ##7. PCIF1. This type of MTase is highly conserved in the eukaryotes and goes back to LECA. Although there are some interesting anomalies where lineages such as fungi and Trichomonas have entirely lost it. This far only a smattering of close prokaryotic homologs can be found in Psychroflexus and a Listeriaceae that are truly very closely related to the eukaryotic versions. In general thought they belong to the Group 2 (M.EcoKI/M.TaqI-like group) One of the homologs in Proteobacteria bacterium JGI 0000113-E04 is fused to a Thump domain, although it looks very dubious in that it is likely to be derived from a mis-assembly. This is because the neighborhood of this domain in this bacterium also contains eukaryotic proteins. We hence did not consider this sequence as prokaryotic.The PCIF1 protein contains a WW domain N-terminal to the methylase domain. Conserved residues include an E in strand-3, a H in strand-6 and a T before strand-7.
      GI                Domain-architecture                               Pfam                                            Gene name               Len   Taxonomy                                              Species                                Genbank                                
      116059373         N6-MTase                                          PCIF1_WW                                        Ot08g03420              422   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                     unnamed protein product, partial [Ostreococcus tauri].
      116058754         N6-MTase                                          PCIF1_WW                                        Ot07g01880              729   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                     unnamed protein product [Ostreococcus tauri].
      116057704         N6-MTase                                          PCIF1_WW                                        Ot05g01520              1115  eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                     unnamed protein product [Ostreococcus tauri].
      226514838         N6-MTase                                          PCIF1_WW                                        MICPUN_70757            203   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                  predicted protein, partial [Micromonas sp. RCC299].
      226523283         N6-MTase                                          PCIF1_WW                                        MICPUN_84425            98    eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                  predicted protein, partial [Micromonas sp. RCC299].
      226522980         N6-MTase                                          PCIF1_WW                                        MICPUN_55447            432   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                  predicted protein [Micromonas sp. RCC299].
      226522164         N6-MTase                                          PCIF1_WW                                        MICPUN_64863            669   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                  predicted protein [Micromonas sp. RCC299].
      226463465         N6-MTase                                          PCIF1_WW                                        MICPUCDRAFT_12799       101   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545            predicted protein [Micromonas pusilla CCMP1545].
      226462261         N6-MTase                                          PCIF1_WW                                        MICPUCDRAFT_70191       490   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545            hypothetical protein MICPUCDRAFT_70191 [Micromonas pusilla CCMP1545].
      303290244         N6-MTase                                          PCIF1_WW                                        MICPUCDRAFT_43506       385   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545            predicted protein [Micromonas pusilla CCMP1545].
      226460900         N6-MTase                                          PCIF1_WW                                        MICPUCDRAFT_57528       547   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545            predicted protein [Micromonas pusilla CCMP1545].
      303290246         N6-MTase                                          -                                               MICPUCDRAFT_54412       180   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545            predicted protein [Micromonas pusilla CCMP1545].
      226459779         N6-MTase                                          PCIF1_WW                                        MICPUCDRAFT_58337       686   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545            predicted protein [Micromonas pusilla CCMP1545].
      220970802         N6-MTase                                          PCIF1_WW                                        THAPSDRAFT_24679        722   eukaryota>stramenopiles                               Thalassiosira pseudonana CCMP1335      predicted protein [Thalassiosira pseudonana CCMP1335].
      220978161         N6-MTase                                          PCIF1_WW                                        THAPSDRAFT_20873        816   eukaryota>stramenopiles                               Thalassiosira pseudonana CCMP1335      predicted protein [Thalassiosira pseudonana CCMP1335].
      220970802         N6-MTase                                          PCIF1_WW                                        THAPSDRAFT_24679        722   eukaryota>stramenopiles                               Thalassiosira pseudonana CCMP1335      predicted protein [Thalassiosira pseudonana CCMP1335].
      Psoj1000010133    N6-MTase                                          PCIF1_WW                                        Psoj1000010133          468   eukaryota>stramenopiles                               Phytophthora sojae                     137293             
      Pram1000005973    N6-MTase                                          SMC_N+AAA_21+APG6+SMC_hinge+PCIF1_WW            Pram1000005973          1665  eukaryota>stramenopiles                               Phytophthora ramorum                   77822              
      262109934         N6-MTase                                          PCIF1_WW                                        PITG_18050              344   eukaryota>stramenopiles                               Phytophthora infestans T30-4           conserved hypothetical protein [Phytophthora infestans T30-4].
      568046702         N6-MTase                                          PCIF1_WW                                        L914_10859              461   eukaryota>stramenopiles                               Phytophthora parasitica                hypothetical protein L914_10859 [Phytophthora parasitica].
      219110361         N6-MTase                                          PCIF1_WW                                        PHATRDRAFT_43163        549   eukaryota>stramenopiles                               Phaeodactylum tricornutum CCAP 1055/1  predicted protein [Phaeodactylum tricornutum CCAP 1055/1].
      Fcyl1000123364    N6-MTase                                          PCIF1_WW                                        Fcyl1000123364          276   eukaryota>stramenopiles                               Fragilariopsis cylindrus               estExt_Genewise1.C_350073
      Fcyl1000111862    N6-MTase                                          PCIF1_WW                                        Fcyl1000111862          300   eukaryota>stramenopiles                               Fragilariopsis cylindrus               gw1.6.1306.1       
      Fcyl1000123395    N6-MTase                                          PCIF1_WW                                        Fcyl1000123395          271   eukaryota>stramenopiles                               Fragilariopsis cylindrus               estExt_Genewise1.C_350074
      Fcyl1000020973    N6-MTase                                          PCIF1_WW                                        Fcyl1000020973          514   eukaryota>stramenopiles                               Fragilariopsis cylindrus               e_gw1.6.1362.1     
      Fcyl1000034107    N6-MTase                                          PCIF1_WW                                        Fcyl1000034107          514   eukaryota>stramenopiles                               Fragilariopsis cylindrus               e_gw1.39.270.1     
      Fcyl1000046637    N6-MTase                                          PCIF1_WW                                        Fcyl1000046637          871   eukaryota>stramenopiles                               Fragilariopsis cylindrus               fgenesh2_kg.27_#_20_#_0_0_CCUX4586.b1_CCUX_EXTA
      Fcyl1000123964    N6-MTase                                          PCIF1_WW                                        Fcyl1000123964          300   eukaryota>stramenopiles                               Fragilariopsis cylindrus               gw1.39.264.1       
      Fcyl1000047105    N6-MTase                                          PCIF1_WW                                        Fcyl1000047105          846   eukaryota>stramenopiles                               Fragilariopsis cylindrus               fgenesh2_kg.35_#_10_#_0_0_CCUX4586.b1_CCUX_EXTA
      Fcyl1000016119    N6-MTase                                          PCIF1_WW                                        Fcyl1000016119          354   eukaryota>stramenopiles                               Fragilariopsis cylindrus               fgenesh2_pg.2_#_93 
      Fcyl1000014177    N6-MTase                                          PCIF1_WW                                        Fcyl1000014177          577   eukaryota>stramenopiles                               Fragilariopsis cylindrus               gw1.1.2000.1       
      Fcyl1000088053    N6-MTase                                          PCIF1_WW                                        Fcyl1000088053          190   eukaryota>stramenopiles                               Fragilariopsis cylindrus               gw1.27.207.1       
      Fcyl1000099591    N6-MTase                                          PCIF1_WW                                        Fcyl1000099591          185   eukaryota>stramenopiles                               Fragilariopsis cylindrus               estExt_Genewise1.C_270101
      Fcyl1000049607    N6-MTase                                          PCIF1_WW                                        Fcyl1000049607          618   eukaryota>stramenopiles                               Fragilariopsis cylindrus               gw1.1.174.1        
      Fcyl1000104830    N6-MTase                                          PCIF1_WW                                        Fcyl1000104830          269   eukaryota>stramenopiles                               Fragilariopsis cylindrus               gw1.1.1865.1       
      Fcyl1000014986    N6-MTase                                          PCIF1_WW                                        Fcyl1000014986          593   eukaryota>stramenopiles                               Fragilariopsis cylindrus               fgenesh2_pg.1_#_1050
      298711525         N6-MTase                                          PCIF1_WW                                        Esi_0035_0124           771   eukaryota>stramenopiles                               Ectocarpus siliculosus                 conserved unknown protein [Ectocarpus siliculosus].
      298709108         N6-MTase                                          PCIF1_WW                                        Esi_0231_0005           625   eukaryota>stramenopiles                               Ectocarpus siliculosus                 conserved unknown protein [Ectocarpus siliculosus].
      323451821         N6-MTase                                          PCIF1_WW                                        AURANDRAFT_71790        749   eukaryota>stramenopiles                               Aureococcus anophagefferens            hypothetical protein AURANDRAFT_71790 [Aureococcus anophagefferens].
      323451439         SPOUT+C2+Tox-ABHYDROLASE3+N6-MTase                Aa_trans+SpoU_methylase+C2+Lipase_3+PCIF1_WW    AURANDRAFT_71849        3487  eukaryota>stramenopiles                               Aureococcus anophagefferens            hypothetical protein AURANDRAFT_71849 [Aureococcus anophagefferens].
      323449955         N6-MTase                                          PCIF1_WW                                        AURANDRAFT_66058        306   eukaryota>stramenopiles                               Aureococcus anophagefferens            hypothetical protein AURANDRAFT_66058 [Aureococcus anophagefferens].
      Bnat1000017417    N6-MTase                                          PCIF1_WW                                        Bnat1000017417          214   eukaryota>rhizaria                                    Bigelowiella natans                    gw1.33.39.1        
      Lgig1000003858    WW+N6-MTase                                       WW+PCIF1_WW                                     Lgig1000003858          637   eukaryota>metazoa>mollusca                            Lottia gigantea                        e_gw1.21.357.1     
      24667685          WW+N6-MTase                                       WW+PCIF1_WW                                     Dmel_CG11399            920   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                CG11399 [Drosophila melanogaster].
      156553336         WW+N6-MTase                                       WW+PCIF1_WW                                     LOC100116693            763   eukaryota>metazoa>hexapoda                            Nasonia vitripennis                    PREDICTED: phosphorylated CTD-interacting factor 1 [Nasonia vitripennis].
      307208075         WW+N6-MTase                                       WW+PCIF1_WW                                     EAI_15124               733   eukaryota>metazoa>hexapoda                            Harpegnathos saltator                  Phosphorylated CTD-interacting factor 1 [Harpegnathos saltator].
      91095163          WW+N6-MTase                                       WW+PCIF1_WW                                     LOC656483               664   eukaryota>metazoa>hexapoda                            Tribolium castaneum                    PREDICTED: phosphorylated CTD-interacting factor 1 [Tribolium castaneum].
      66535528          WW+N6-MTase                                       WW+PCIF1_WW                                     LOC551754               729   eukaryota>metazoa>hexapoda                            Apis mellifera                         PREDICTED: similar to CG11399-PB [Apis mellifera].
      158300743         WW+N6-MTase                                       WW+PCIF1_WW                                     AgaP_AGAP011933         789   eukaryota>metazoa>hexapoda                            Anopheles gambiae str. PEST            AGAP011933-PA [Anopheles gambiae str. PEST].
      193654861         WW+N6-MTase                                       WW+PCIF1_WW                                     LOC100159733            693   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                    PREDICTED: phosphorylated CTD-interacting factor 1-like [Acyrthosiphon pisum].
      307182697         WW+N6-MTase                                       PCIF1_WW                                        EAG_11439               735   eukaryota>metazoa>hexapoda                            Camponotus floridanus                  Phosphorylated CTD-interacting factor 1 [Camponotus floridanus].
      291221943         WW+N6-MTase                                       WW+PCIF1_WW                                     PCIF1                   713   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii               PREDICTED: phosphorylated CTD-interacting factor 1 [Saccoglossus kowalevskii].
      115957714         N6-MTase                                          PCIF1_WW                                        LOC581054               1094  eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus          PREDICTED: similar to Chromosome 20 open reading frame 67 [Strongylocentrotus purpuratus].
      321469809         WW+N6-MTase                                       WW+PCIF1_WW                                     DAPPUDRAFT_51102        640   eukaryota>metazoa>crustacea                           Daphnia pulex                          hypothetical protein DAPPUDRAFT_51102 [Daphnia pulex].
      321459458         N6-MTase                                          PCIF1_WW                                        DAPPUDRAFT_61197        232   eukaryota>metazoa>crustacea                           Daphnia pulex                          hypothetical protein DAPPUDRAFT_61197, partial [Daphnia pulex].
      47212430          N6-MTase                                          PCIF1_WW                                        GSTEN:00009293:G:001    673   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                 unnamed protein product [Tetraodon nigroviridis].
      125819445         WW+N6-MTase                                       WW+PCIF1_WW                                     LOC553360               716   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                            PREDICTED: phosphorylated CTD-interacting factor 1 [Danio rerio].
      114682331         WW+N6-MTase                                       WW+PCIF1_WW                                     LOC458292               658   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                        PREDICTED: phosphorylated CTD interacting factor 1 isoform 3 [Pan troglodytes].
      224078117         WW+N6-MTase                                       WW+PCIF1_WW                                     LOC100228688            617   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                    PREDICTED: hypothetical protein, partial [Taeniopygia guttata].
      114682333         WW+N6-MTase                                       WW+PCIF1_WW                                     LOC458292               685   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                        PREDICTED: phosphorylated CTD interacting factor 1 isoform 4 [Pan troglodytes].
      62646369          WW+N6-MTase                                       WW+PCIF1_WW                                     RGD1310800_predicted    704   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                      PREDICTED: similar to posphorylated CTD interacting factor PCIF1 [Rattus norvegicus].
      326936323         WW+N6-MTase                                       WW+PCIF1_WW                                     PCIF1                   829   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                    PREDICTED: phosphorylated CTD-interacting factor 1-like [Meleagris gallopavo].
      22122647          WW+N6-MTase                                       WW+PCIF1_WW                                     Pcif1                   706   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                           phosphorylated CTD-interacting factor 1 [Mus musculus].
      18034767          WW+N6-MTase                                       WW+PCIF1_WW                                     PCIF1                   704   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                           phosphorylated CTD-interacting factor 1 [Homo sapiens].
      114682325         WW+N6-MTase                                       WW+PCIF1_WW                                     LOC458292               704   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                        PREDICTED: phosphorylated CTD interacting factor 1 isoform 1 [Pan troglodytes].
      118100805         WW+N6-MTase                                       WW+PCIF1_WW                                     LOC771963               707   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                          PREDICTED: similar to Chromosome 20 open reading frame 67 [Gallus gallus].
      327288387         WW++N6-MTase+RING                                 WW+PCIF1_WW+zf-RING_2                           pcif1                   860   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                    PREDICTED: phosphorylated CTD-interacting factor 1-like [Anolis carolinensis].
      210126797         WW++N6-MTase                                      WW+PCIF1_WW                                     BRAFLDRAFT_118827       781   eukaryota>metazoa>chordata                            Branchiostoma floridae                 hypothetical protein BRAFLDRAFT_118827 [Branchiostoma floridae].
      210099455         N6-MTase                                          PCIF1_WW                                        BRAFLDRAFT_267292       730   eukaryota>metazoa>chordata                            Branchiostoma floridae                 hypothetical protein BRAFLDRAFT_267292, partial [Branchiostoma floridae].
      198420771         N6-MTase                                          PCIF1_WW                                        LOC100187259            515   eukaryota>metazoa>chordata                            Ciona intestinalis                     PREDICTED: phosphorylated CTD-interacting factor 1-like [Ciona intestinalis].
      Smar1000005290    WW+N6-MTase                                       WW+PCIF1_WW                                     Smar1000005290          715   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima                     SMAR008745-PA pep:novel scaffold:Smar1:JH431850:1046606:1050059:-1 gene:SMAR008745 transcript:SMAR008745-RA
      Hrob1000018333    N6-MTase                                          PCIF1_WW                                        Hrob1000018333          499   eukaryota>metazoa>annelida                            Helobdella robusta                     102536             
      Caps1000025183    WW+N6-MTase                                       WW+PCIF1_WW                                     Caps1000025183          650   eukaryota>metazoa>annelida                            Capitella spI                          estExt_fgenesh1_pg.C_4020007
      Sarc1000002473    N6-MTase                                          PCIF1_WW                                        Sarc1000002473          686   eukaryota>ichthyosporea                               Sphaeroforma arctica                    Sphaeroforma arctica JP610 hypothetical protein (686 aa)
      Sarc1000000137    N6-MTase                                          PCIF1_WW                                        Sarc1000000137          556   eukaryota>ichthyosporea                               Sphaeroforma arctica                    Sphaeroforma arctica JP610 hypothetical protein (556 aa)
      Sarc1000006366    N6-MTase                                          PCIF1_WW                                        Sarc1000006366          1130  eukaryota>ichthyosporea                               Sphaeroforma arctica                    Sphaeroforma arctica JP610 hypothetical protein (1129 aa)
      284097202         N6-MTase                                          PCIF1_WW                                        NAEGRDRAFT_56544        724   eukaryota>heterolobosea                               Naegleria gruberi                      phosphorylated carboxy-terminal domain interacting factor [Naegleria gruberi].
      284095070         N6-MTase                                          PCIF1_WW                                        NAEGRDRAFT_78364        756   eukaryota>heterolobosea                               Naegleria gruberi                      phosphorylated CTD interacting factor [Naegleria gruberi].
      485621692         N6-MTase                                          PCIF1_WW                                        EMIHUDRAFT_196229       460   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_196229 [Emiliania huxleyi CCMP1516].
      551547167         N6-MTase                                          PCIF1_WW                                        EMIHUDRAFT_422415       480   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_422415 [Emiliania huxleyi CCMP1516].
      485623173         N6-MTase                                          PCIF1_WW                                        EMIHUDRAFT_242884       541   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_242884 [Emiliania huxleyi CCMP1516].
      551569354         N6-MTase                                          Stc1+PCIF1_WW                                   EMIHUDRAFT_461366       920   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_461366 [Emiliania huxleyi CCMP1516].
      485634611         N6-MTase                                          -                                               EMIHUDRAFT_434687       237   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_434687, partial [Emiliania huxleyi CCMP1516].
      551536260         N6-MTase                                          PCIF1_WW                                        EMIHUDRAFT_221253       311   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_221253 [Emiliania huxleyi CCMP1516].
      485623605         N6-MTase                                          Stc1+PCIF1_WW                                   EMIHUDRAFT_117856       920   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_117856 [Emiliania huxleyi CCMP1516].
      485612123         N6-MTase                                          PCIF1_WW                                        EMIHUDRAFT_437886       208   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_437886 [Emiliania huxleyi CCMP1516].
      485638541         N6-MTase                                          Dam                                             EMIHUDRAFT_111979       250   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_111979 [Emiliania huxleyi CCMP1516].
      485641476         N6-MTase                                          PCIF1_WW                                        EMIHUDRAFT_440930       483   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_440930 [Emiliania huxleyi CCMP1516].
      551536260         N6-MTase                                          PCIF1_WW                                        EMIHUDRAFT_221253       311   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_221253 [Emiliania huxleyi CCMP1516].
      485638053         N6-MTase                                          PCIF1_WW                                        EMIHUDRAFT_112355       538   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_112355 [Emiliania huxleyi CCMP1516].
      551601616         N6-MTase                                          PCIF1_WW                                        EMIHUDRAFT_99458        538   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_99458 [Emiliania huxleyi CCMP1516].
      485617795         N6-MTase                                          PCIF1_WW                                        EMIHUDRAFT_197672       379   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516             hypothetical protein EMIHUDRAFT_197672 [Emiliania huxleyi CCMP1516].
      528270883         N6-MTase                                          PCIF1_WW                                        AGDE_02289              379   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                       hypothetical protein AGDE_02289 [Angomonas deanei].
      528228842         N6-MTase                                          PCIF1_WW                                        AGDE_11734              502   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                       hypothetical protein AGDE_11734 [Angomonas deanei].
      528244288         N6-MTase                                          PCIF1_WW                                        AGDE_08890              400   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                       hypothetical protein AGDE_08890 [Angomonas deanei].
      528274252         N6-MTase                                          PCIF1_WW                                        AGDE_01076              528   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                       hypothetical protein AGDE_01076 [Angomonas deanei].
      528260169         N6-MTase                                          PCIF1_WW                                        AGDE_05813              455   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                       hypothetical protein AGDE_05813 [Angomonas deanei].
      159115031         N6-MTase                                          PCIF1_WW                                        GL50803_24111           599   eukaryota                                             Giardia lamblia ATCC 50803             Phosphorylated CTD interacting factor PCIF1 [Giardia lamblia ATCC 50803].
      146083518         N6-MTase                                          PCIF1_WW                                        LINJ_17_1050            1068  eukaryota>euglenozoa>kinetoplastida                   Leishmania infantum JPCM5              conserved hypothetical protein [Leishmania infantum JPCM5].
      146088136         N6-MTase                                          PCIF1_WW                                        LINJ_24_2080            589   eukaryota>euglenozoa>kinetoplastida                   Leishmania infantum JPCM5              conserved hypothetical protein [Leishmania infantum JPCM5].
      146104898         N6-MTase                                          PCIF1_WW                                        LinJ36.6020             657   eukaryota>euglenozoa>kinetoplastida                   Leishmania infantum JPCM5              hypothetical protein [Leishmania infantum].
      157867596         N6-MTase                                          PCIF1_WW                                        LMJF_17_0940            1068  eukaryota>euglenozoa>kinetoplastida                   Leishmania major strain Friedlin       conserved hypothetical protein [Leishmania major strain Friedlin].
      157870335         N6-MTase                                          PCIF1_WW                                        LMJF_24_2000            592   eukaryota>euglenozoa>kinetoplastida                   Leishmania major strain Friedlin       conserved hypothetical protein [Leishmania major strain Friedlin].
      157877658         N6-MTase                                          PCIF1_WW                                        LMJF_36_5530            656   eukaryota>euglenozoa>kinetoplastida                   Leishmania major strain Friedlin       conserved hypothetical protein [Leishmania major strain Friedlin].
      163777248         N6-MTase                                          PCIF1_WW                                        MONBRDRAFT_6793         638   eukaryota>choanoflagellida                            Monosiga brevicollis MX1               predicted protein [Monosiga brevicollis MX1].
      167534686         N6-MTase                                          PCIF1_WW                                        MONBRDRAFT_28562        509   eukaryota>choanoflagellida                            Monosiga brevicollis MX1               hypothetical protein [Monosiga brevicollis MX1].
      239900371         N6-MTase                                          PCIF1_WW                                        Pmar_PMAR019893         212   eukaryota>alveolata                                   Perkinsus marinus ATCC 50983           conserved hypothetical protein, partial [Perkinsus marinus ATCC 50983].
      594149527         N6-MTase                                          PCIF1_WW                                        GSHART1_T00000036001    565   eukaryota>euglenozoa>kinetoplastida                   Phytomonas sp. isolate Hart1           unnamed protein product [Phytomonas sp. isolate Hart1].
      124512340         N6-MTase                                          PCIF1_WW                                        MAL8P1.49               1466  eukaryota>alveolata>apicomplexa                       Plasmodium falciparum 3D7              hypothetical protein [Plasmodium falciparum 3D7].
      514681002         N6-MTase                                          PCIF1_WW                                        PTSG_10553              877   eukaryota>choanoflagellida                            Salpingoeca rosetta                    hypothetical protein PTSG_10553 [Salpingoeca rosetta].
      326429537         N6-MTase                                          PCIF1_WW                                        PTSG_06762              352   eukaryota>choanoflagellida                            Salpingoeca rosetta                    hypothetical protein PTSG_06762 [Salpingoeca rosetta].
      558598873         N6-MTase                                          PCIF1_WW                                        SS50377_14344           517   eukaryota                                             Spironucleus salmonicida               Phosphorylated CTD interacting factor PCIF1 [Spironucleus salmonicida].
      528231724         N6-MTase                                          PCIF1_WW                                        STCU_06168              583   eukaryota>euglenozoa>kinetoplastida                   Strigomonas culicis                    hypothetical protein STCU_06168 [Strigomonas culicis].
      89295554          N6-MTase                                          PCIF1_WW                                        TTHERM_00426090         413   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila SB210          phosphorylated CTD-interacting factor 1 (macronuclear) [Tetrahymena thermophila SB210].
      Ttra1000007497    N6-MTase                                          PCIF1_WW                                        Ttra1000007497          512   eukaryota>apusozoa                                    Thecamonas trahens                      Thecamonas trahens ATCC 50062 hypothetical protein (512 aa)
      221486586         N6-MTase                                          PCIF1_WW                                        TGGT1_082560            722   eukaryota>alveolata>apicomplexa                       Toxoplasma gondii GT1                  conserved hypothetical protein [Toxoplasma gondii GT1].
      72389220          N6-MTase                                          PCIF1_WW                                        Tb927.5.2310            711   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma brucei brucei TREU927      hypothetical protein [Trypanosoma brucei brucei TREU927].
      72393303          N6-MTase                                          PCIF1_WW                                        Tb927.8.6220            572   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma brucei brucei TREU927      hypothetical protein [Trypanosoma brucei brucei TREU927].
      71661834          N6-MTase                                          PCIF1_WW                                        Tc00.1047053511903.170  524   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener     hypothetical protein [Trypanosoma cruzi strain CL Brener].
      71403247          N6-MTase                                          PCIF1_WW                                        Tc00.1047053511065.36   524   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener     hypothetical protein [Trypanosoma cruzi strain CL Brener].
      71651022          N6-MTase                                          PCIF1_WW                                        Tc00.1047053509799.80   532   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener     hypothetical protein [Trypanosoma cruzi strain CL Brener].
      71651450          N6-MTase                                          PCIF1_WW                                        Tc00.1047053511755.90   694   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener     hypothetical protein [Trypanosoma cruzi strain CL Brener].
      71650340          N6-MTase                                          PCIF1_WW                                        Tc00.1047053504137.130  532   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener     hypothetical protein [Trypanosoma cruzi strain CL Brener].
      71412328          N6-MTase                                          PCIF1_WW                                        Tc00.1047053511313.20   694   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener     hypothetical protein [Trypanosoma cruzi strain CL Brener].
      
      Back to Contents
    • Multiple sequence alignment of the DIRS1-like/Group2-Clade 1 of N6-MTase
      
      Back to Contents
      
      
    • Multiple sequence alignment of the DIRS1-like/Group2-Clade 1 of N6-MTase

                                                                                                                  Str-1                    Str-2                    Str-4                                                 Str-5
      RES                                                                            HGRSP-SLD-VFADAHTTKVPG------SFFAA-NWCPG----------VKGVDAFAQ----DWGSPGWGRRDGGETVRPLLYINPPF--SA------VARVLRKVAEER--------------------PDCV-LILPVWP-RAWVAILRT-----LPIRAQMTLAHR--ELF------IPGPQVPNAAKRGPMTPRYR-VQAVY
      ALIGN                                                                          -------HH-HHHH-------E------EEEE----------------------EEE-------------------HH-------------H------HHHHHHHHH-------------------------EE-EE---------HHHHHH-----HHHH---------------------------------------------
      HMM                                                                            ------HHE-EEHHHHHHHHH-------EEEEE-----------------EEEEEEEE-----------HHHHHHHH---EEEE---H--HH------HHHHHHHHHH------------------------EE-EEE--------HHHHHH-----H--HH-----------E--------------------------------
      FREQ                                                                           -----------E----------------EEE-------------------HHHHHHHH------------HHHHHHHH--HH---------H------HHHHHHHHHH------------------------EE-EEE---------HHHHH-----HHH-------------------------------------HHH-HHHH-
      PSSM                                                                           --------------------------------------------------------------------------------E----------H------HHHHHHHHHH-----------------------EEE-EEE--------HHHHHH-----HH--------------------------------------------E--
      FINAL                                                                          ----------------------------EEEE------------------HHHHEEEE------------HHHHHHHH--EEE-------HH------HHHHHHHHHH------------------------EE-EEE--------HHHHHH-----HHH--------------------------------------HH-HHEE-
      Vcar1000004728_Vcar_Vcar1000004728                                             ---L--S-KLQDPD-------------DW-----MLKPSIFRRLHDTWG---P-FDIDLFASHA-------TFQLP------VYYS---------RYF-----TRTTSGV-------DAFRS---------SW-----G--RRCW--------ANP----PFH----LLLRVLQHAE-----AC--QS-R---L-----C--LV------APF-------WPTRDWW-PFITA-D-GV-WFKPFASGVRLLGRAAD-V-FLARTSGSPSPKAL--------PA-DLD--ARAEA-LL-----------------
      AAA33195.1_Dictyostelium_discoideum_167739                                     ---L--S-RLSEMNHKSSTR--VIKSYNW-----QLKKEVFNRIQLQFG---Q-IQMDLFASHL-------NHQTT------NY---------------------STIRM-------NTLHL---------DW-----SQWKQCL--------AFP----PPI----LLPSILEKMN-----SS--SS-KKVSI-----I--LI------FPI-------WRSATWY-PMIQA-Q-VP-RHHRHMFP-QVLGTFQE-V-L-----TKQSVESI--------PI-QIQ--QRWKL-GI-----------------
      AAA70202.1_Dictyostelium_discoideum_903714                                     ---L--S-RLSEMNHKSSTR--VIKSYNW-----QLKKEVFNRIQLQFG---Q-IQMDLFASHL-------NHQTN------NY---------------------STIRM-------NALHL---------DW-----SQWKQCL--------AFP----PPI----LLPSILEKMN-----SS--SS-KKVSI-----I--LI------FPI-------WRSATWY-PMIQA-Q-VP-RHHRHMFP-QVLGTFQE-V-L-----TKQSVESI--------PI-QIQ--QRWKL-GI-----------------
      AAL35360.1_Tetraodon_nigroviridis_17066696                                     ---L--S-R----------G--NPLYGEW-----RLHPQVVAQIWQRYG---K-AAVDLFASQE-------NAHCP------LFFS---------LAE-----GSAPLGV-------DALAH---------PW-----PD-VLLY--------AFP----PLS----LISPTLARVR-----EQ--GL-S---L-----I--LV------APR-------WPSKHWI-AEIVQ-L-LM-AEP------WPLPCRRD-L-L------SQARGEI--------FH-PRP--DRLSL-WA-----------------
      LOC100333442_Danio_rerio_292630533                                             ---L--S-R----------Q--RLEPGGW-----RLHPKVVAAIWQRFS---K-ADINLFACQK-------TTHCP------LWFS---------LTH------PAPLGL-------DAMVQ---------KW-----PR-LRLY--------AFP----PIA----LLPGILERVR-----QG--GY-N---L-----L--LV------APY-------WPTRVWF-SDLVS-L-LD-GLP------WEIPIQRD-L-L------SQAGGMI--------VH-PRP--DLWKL-WV-----------------
      LOC100334277_Danio_rerio_292615760                                             ---L--S-R----------Q--GVRSGEW-----KLHPEVVETIWERFG---K-AQVDLFASQE-------TTHCV------LWFS---------LSH------PAPLGL-------DAMVQ---------TW-----PR-LRLY--------AFP----PVA----LLPGVLERIR-----QD--GV-Q---L-----L--LV------APF-------WPTRIWF-SDLIA-L-LA-GLP------WEIPIRRD-L-L------SQAGGMI--------LH-PRP--DLWKL-WV-----------------
      LOC558928_Danio_rerio_189516844                                                ---L--S-R----------Q--GLEPGGW-----RLHPKVLAAIWQRFG---R-ADVDLFACQK-------TTHCP------LWFS---------QTH------PAPLGL-------DAMVQ---------TW-----PR-LRLY--------AFP----PIA----LLPGVLERVR-----QG--GY-N---L-----L--LV------APY-------WPTRVWF-SDLVS-L-LD-GLP------WEIPVQRD-L-L------SQAEGMI--------VY-PRP--DLWKL-WV-----------------
      LOC100331476_Danio_rerio_292613204                                             ---L--S-R----------Q--GLEPGGW-----RLHPKVVAAIWQRFG---R-ADVDLFACQK-------TTHCP------LWFS---------QTH------PAPLGL-------DAMVQ---------TR-----PR-LCLY--------VFP----PIA----LLRGVLERVR-----QG--GY-N---L-----L--LV------APY-------WPTRVWF-SDLVS-L-LD-GLP------WKNTVQRD-L-L------SQAEGMI--------VH-PRP--DLWKL-WV-----------------
      LOC100329396_Danio_rerio_292611038                                             ---L--S-R----------Q--LLRPGEW-----RLHPKSVQLIWARFG---E-AQIDLFASPE-------NAHCQ------LFFS---------LTE-------GSLGT-------DALAH---------SW-----PRGMRKY--------AFP----PVS----LLAQFLCKVR-----ED--EE-Q---V-----L--LV------APL-------WPNRTWI-SELSL-L-AT-ALP------WRIPLRED-L-L------SQGQGTI--------WH-PRP--DLWNL-HS-----------------
      LOC561204_Danio_rerio_292626487                                                ---L--S-R----------Q--LLRPGEW-----RLHPESVQLIWARFG---E-AQIDLFASPE-------NAHCQ------LFFS---------LTE-------GSLGT-------DALAH---------SW-----PRGMRKY--------AFP----PVS----LLTQFLCKVR-----ED--EE-Q---V-----L--LV------APL-------WPNRTWI-SELSL-L-AT-ALP------WRIPLRED-L-L------SQGQGTI--------WH-PRP--DLWNL-HV-----------------
      LOC100005823_Danio_rerio_125838616                                             ---L--S-R----------Q--GLEPGGW-----RLHPKVVAAIWQRFG---R-ADVDLFACQK-------TTHCP------LWFS---------QTH------PAPLGL-------DAMVQ---------TW-----PR-LRLY--------VFP----PIA----LLPGVLERVR-----QG--GY-N---L-----L--LV------APY-------WPTRVWF-SDLVS-L-------------------RD-L-L------SQAEGMI--------VH-PRP--DLWKL-WV-----------------
      LOC100369420_Saccoglossus_kowalevskii_291232955                                ---L--S-R-------------IVDTDDW-----SLNPKIFGMLDKIWG---P-HSIDRFASCH-------NAQLP------RFNS---------AAS-----NPGTEAV-------DAFCQ---------DW-----ST-ENNWIQNWKAELSNP----ALDDFCDTLPDVMLASR-----AG--NT-L---S-----K--YT------GSW-------LRWKRWC-QSNLS-A-GA-ACP--AKP-LHIAIYLR-S-L------LDNANTV--------AP-MDS--ALYSIRWA-----------------
      CBG11637_Caenorhabditis_briggsae_268535232                                     ---A--S-R-------------NFDFDDW-----GVADRVFRQAQKLWG---E-IKVDWFADAN-------NRKTE------VYFS---------RYP-----EFGTSGV-------NVFDHIERAERMGCAW--------------------WVP----PPM----LIPHLLKIEG-----AK--R------------Q--FV------EELKQKAGEGFEHHIER-LKRIP-F-EA-K--------ATSTAAAY-K-A------ENDKRTLEGRRSFFNYE-PGSDAVILMC--------------------
      TcasGA2_TC015886_Tribolium_castaneum_270017202                                 ---E--S-R----------R--LSPETEF-----ELAPYAFRKICTFFQ---I-PEVDLFASRN-------NTKCR------RFFS---------WFR-----DPEAEVV-------DAFTV---------PW-----TD-LKFY--------AFP----PFS----LVAHCLQKIV-----SD--RA-R---G-----I--LV------VPY-------WPTQPWF-PIFTS-L-LR-KEP------IFFEPDTN-L-L------LTSHRVP---HP---LH---------------QKLSLVAGLLSK----
      TcasGA2_TC010690_Tribolium_castaneum_270016221                                 ---K--S-R----------R--LKSETEW-----QLSDFAYSDIIKRFG---Y-PEIDLFANRH-------NAKCE------KFVA---------WLR-----DPGALAI-------DAFTV---------SW-----ED-YYFY--------AFP----PFS----VVLRTLRKII-----SD--RP-C---G-----I--LV------VPN-------WPIQPWF-PLFIS-L-LT-NKP------FYCKPSKY-L-L------TSPDRRR---HP---IW---------------NQLSLVVGRLSG----
      VOLCADRAFT_118754_Volvox_carteri_f_nagariensis_302846025                       ---L--A-N-----------------SKW-----HETLEPLVCTYPTLF---H-GLLFKLQDMR-------DEDVR------GFVR---------CLT------PSTLQT-------GGYRG---------------------------RPQ-SVN----PLM--A-MSQRGMTKAH-----AV--DR-V---K-----L--LI--Y---QQL----VE-GLKKVSY-RIIKT-F-FE-ESD---QR-LLNDMAQF-P-I---F--AEFVGLV---AN---MD-TGK--------AI-GESD------------
      LOC100493936_Xenopus_(Silurana)_tropicalis_301612402                           ---L--S-R----------Q--RLDPGEW-----ALNPGIFQDIVALWG---L-PEVDLMASRQ-------NRKVT------QFMS---------RCR-----DPLALAA-------DALTT---------TW-----DF-DLAY--------AFP----PLP----LLPRVIRKIR-----SE--RC-T---V-----I--LI------APH-------WPKRAWF-TELVA-L-SR-SEP------WPLPQIPD-L-L------AQGPILH--------PN-PAF--LNLTA-WR-----------------
      LOC100331913_Danio_rerio_292614277                                             ---------------------------QW-----RLHPESVQLIWARFR---E-AQIDLFASPE-------NAHCQ------LFYS---------LTE-------GSLGT-------DALAH---------SW-----PRGVRKY--------AFP----PVS----LIAQLMCKFR-----ED--EE-Q---V-----L--LV------APL-------WPNRTWI-SELSL-L-AT-ALP------WQIPLRED-L-L------SQGQGTV--------WH-PHP-----TT-KL-----------------
      LOC561204_Danio_rerio_125850303                                                ---L--S-R----------Q--LLRPGEW-----RLHPESVQLIWARFG---E-AQIDLFASPE-------NAHCQ------LFFS---------LTE-------GSLGT-------DALAH---------SW-----PRGMRKY--------AFP----PVS----LLTQFLCKVR-----ED--EE-Q---V-----L--LV------APL-------WPNRTWI-SELSL-L-AT-ALP------WRIPLRED-L-L------SQGQGTI--------WH-PRP--DLWNL-HL-----------------
      IscW_ISCW000383_Ixodes_scapularis_240992967                                    ---A--S-R----------R--VMDASSW-----KLCPFTFSRVNFLWG---P-VHMDLFADFS-------NHQVR------HYYS---------WKP-----DPQAAAV-------DALSH---------DG-----TG-QGLY--------AFP----PFS----LVSRCIAKLQ-----TS--NS-L---L-----I--LV------APV-------RPSQPWY-ASLLY-H-SY-EEP------RLLPQSHD-L-L------RSHDGQV---HP---ML------------ST-GTLIL-----------
      LOC100487094_Xenopus_(Silurana)_tropicalis_301609594                           ---S--Q-S----------T--RFP--EW-----ELHSDAFQDLTRRWG---T-PQIDLMASRS-------NQNVL------KFFT---------HCR-----DPLTTGI-------VAMTQ---------HW-----RF-DLVY--------VFP----PLP----MLPQVLKKIR-----QS--PT-T---V-----I--VI------APY-------WPRRTWF-SDLQE-L-CD-TQK------GP---------S------SSGPNRP-----------------------------------------
      LOC100378332_Saccoglossus_kowalevskii_291236647                                ---L--S-R------R---L--CLQNTEW-----QILQWVTSRLFWLWD---G-PKIDLFAALR-------NAKLP------TYVT---------LLP-----TPGAWAV-------DALSI---------PW-----SH-MDST-GKSYAV-RLP----GGR----LQHSVLLMLK-----RF--HV-D----------------Y---RSK-------YDVLS---EKLSN-L-LK-TKP-DQMD-VAVNVKCE-M-G------LEYEASA--------FK------KLLKY-AF-GLNFVKMVNVTKKV--
      ACB38666.1_Daphnia_pulex_170819724                                             ---E--S-R----------A--GPDSGDW-----KLDPMVFERIQQLW----P-SDVDVFASPW-------NAHLP------AFIS---------WFP-----QPGAMAT-------NAFSV---------NW-----KG-LSGY--------IFP----PFA----LIFKCIEKIR-----RE--RA-T---A-----V--FV------CPV-------WTGQPWF-PLLLE-L-VC-DVP------RLLTSSPV-L-L------TSALGES---HP---LI------------SS-NALHLAA--------W
      ACB38665.1_Daphnia_pulex_170819710                                             ---E--S-R----------A--EADTSDW-----RLDATIFSRISEIW----E-MDVDLFASSW-------NSQLP------RFIA---------WGP-----QPGAFAA-------NAFSI---------RW-----EN-IYGY--------AFP----PFS----LIFRCIEKIR-----RE--KA-S---I-----I--LI------CPV-------WTGQPWF-PVLLE-H-AC-DIP------RLLRPSPE-L-L------TSARGEP---HP---LI------------QS-GALSLAA--------W
      NEMVEDRAFT_v1g220156_Nematostella_vectensis_156352960                          ---P--S-R-------------VLSDLDC-----TLSAQTWRSIDIAFG---P-HSIDLMALPS-------NVMHDHAGRPLRFFS---------QLP-----CVQAEST-------NVFAH---S-----LL-----PE-ENAY--------VFP----PFI----LMGPLLGHLS-----KR--AC-P-F-S-----I--VV------PDI-------TPRKYWW-SVLKR-R-AA-A------S-FKLGSRGS-L-S-SLL--FPAKSGA---AP---WL-NQE--RLRGS-RA-----------------
      ORF2_Panagrellus_redivivus_10058                                               ---A--S-R-------------ETDPDDW-----AISKEIFEKLTAKFQ---K-CQCDRFASHK-------TKQLD------KFMS---------RVP-----CPGSAGV-------NAFAY---------QW-----TD-WSSW--------CVP----PPA----LLVRTWKHIE-----SH--AC-E---G-----L--LV------SPD-------WP------ANVVA-T-AA-SRA------VRKGFAKL-V-Y--RI--RAGTRCI-T-PP-A-FS-TGA--------FQ-TPYAQSDLLVYR---F
      CBG07308_Caenorhabditis_briggsae_268581363                                     ---A--S-R-------------NFDFDDW-----GIAERVFIQAQRMWG---E-IKVDWFADAN-------NKKTE------LYFS---------RYP-----EFGTSGV-------NVFEHVERAERMGLPW--------------------LVP----PPV----LIPQLIKIMR-----RR--RL-R---G-----V--LV------APL-------WKSHISY-QALVD-Y-SG-R--------FIREVKDY-I-I------YQKNDCI--------FI-PGE---------------------------
      CBG06557_Caenorhabditis_briggsae_268566149                                     ---A--S-R-------------NFDFDDW-----GIAERVFSHAQRMWG---E-IKVDWFADAN-------NKKTE------LYFS---------RYP-----EFGTSGV-------NVFEHVERAERMGLPW--------------------LVP----PPV----LIPQLIKIMR-----RR--RL-R---G-----V--LV------APL-------WKSHISY-QALVD-Y-SG-R--------FIREVKDY-I-I------YEKNDCI--------FI-PGE---------------------------
      CBG03513_Caenorhabditis_briggsae_268553337                                     ---A--S-R-------------NFDFDDW-----GIAERVFSQAQRMWG---E-IKVDWFADAN-------NKKTE------LYFS---------RYP-----EFGTSGV-------NVFEHVERAERMGLPW--------------------LVP----PPV----LIPQLIKIMR-----RR--RL-R---G-----V--LV------APL-------WESHISY-QALVD-Y-SG-R--------FIREVKDY-I-I------YEKNDCI--------FI-PGE---------------------------
      CRE_11685_Caenorhabditis_remanei_308493269                                     ---A--S-R-------------NFDFDDW-----GIADRVFKQAQRLWG---E-IKVDWFADAQ-------NKKTE------RFFS---------RYP-----EFGSSGV-------NVFEHIPRAERMGLAW--------------------WVP----PPV----MIPQLLKIAK-----SR--GL-K---G-----V--LV------APL-------WKSHPSY-QALVD-S-SG-R--------FVRYVRDY-I-I------YEKNDNI--------FI-PGE---------------------------
      CBG19494_Caenorhabditis_briggsae_268562950                                     ---A--S-R-------------NFDFGDW-----GVADKVFRQAQRLWG---E-IKVDWFADAN-------NRKTE------LYLS---------RYP-----EFGTSGV-------NVFDHIERAERMGCAW--------------------WVP----PPV----LLPHLLKIAR-----KR--RL-R---G-----V--LV------APL-------WQSHASY-QSLVD-K-TR-R--------FIREIKDY-I-I------YEKNDSI--------FI-PGD---------------------------
      CRE_13303_Caenorhabditis_remanei_308490919                                     ---A--S-R-------------DFDYDDW-----AVQNWAFEWAQKRWG---E-VKCDWFADEQ-------NTKTE------LFFS---------RLP-----EPGTLGA-------DVFEHVDKAGAIGLAW--------------------WVP----PPA----LIPRLMRVAR-----QK--KL-R---G-----I--LA------TPL-------WKAHPSY-QALVN-E-RG-E--------FIPEIRDS-R-I------FKVNTKI--------IS-PGR---------------------------
      CRE_30767_Caenorhabditis_remanei_308500876                                     ---A--S-R-------------DFDYDDW-----AVQNWAFEWAQKRWG---E-VKCDWFADEQ-------NTKTE------LFFS---------RLP-----EPGTLGA-------DVFEHVDKAGAIGLAW--------------------WVP----PPA----LIPRLMRVAR-----QK--KL-R---G-----I--LA------TPL-------WKAHPSY-QALVN-E-RG-E--------FIPEIRDS-R-I------FKVNTKI--------IS-PGR---------------------------
      CRE_30766_Caenorhabditis_remanei_308500992                                     ---A--S-R-------------DFDYDDW-----AVQNWAFEWAQKRWG---E-VKCDWFADEQ-------NTKTE------LFFS---------RLP-----EPGTLGA-------DVFENVDKAGSIGLAW--------------------WVP----PPV----LIPRLMRVAR-----QK--KL-R---G-----I--LA------TPL-------WKTHPSY-QALVN-E-RG-E--------FIPEIRDS-R-I------FKVNTKI--------IS-PGR---------------------------
      CBG17454_Caenorhabditis_briggsae_268571541                                     ---A--S-R-------------EFDTDDW-----GVQNWAFEWAQKRWG---H-VKCDLFASER-------NAKHS------VYFS---------RYP-----EPTSSGT-------DAFDHF-TCAAKSLTW--------------------WVP----PPV----LVPKLIQVAR-----RN--RC-R---G-----I--LA------TPL-------WKTHPSY-LALVD-Q-NG-N--------FIREIRDL-R-R------RTRENNF--------SM-TRS---------------------------
      CRE_21296_Caenorhabditis_remanei_308475765                                     ---A--S-R-------------DFDFDDW-----GVDQKVFLWAQTRWG---K-FKCDWFADEA-------NAKTQ------LFYS---------RDP-----CKCSQGA-------NVFDHIDVAKELGFAW--------------------WVP----PSN----LVPQLIAECR-----KT--SM-R---G-----V--LA------MPL-------WENHVSF-QAILD-S-RG-N--------WIRQLVDL-R-V------YPAKDRI--------IV-PGT---------------------------
      CRE_26772_Caenorhabditis_remanei_308462401                                     ---A--S-R-------------DFDFDDW-----GVDQKVFLWAQTRWG---E-FKCDWFADEA-------NAKTQ------LFYS---------RDP-----GKCSQGA-------NVFDHIDVAKQLGFAW--------------------WVP----PPN----LVPRLIAECR-----KT--SM-R---G-----V--LA------MPL-------WENHVSF-QAILD-S-RG-N--------WIRQLVDL-R-V------YPAKDRI--------IV-PGA---------------------------
      LOC100492777_Xenopus_(Silurana)_tropicalis_301606863                           ---L--V-K---FK-----K--DSPHKNYIDNLKNIFNSHIADCMNLWD---D-TNIMDIEDNC-------KATCY-LDPT-GLLE---------KVK-----ELLEIAQ-------NFIKG------------------------K------NTP----SDD-CG-QYFEMCVATE-----KN--ST-G--------------------PPH-------WESTSPV-TPVHP-L-TV-NQK---PQ-SDMPSTNH-T-S------HSSMDIS--------VK-SSD--SYHHM----PTR-------------
      LOC100489531_Xenopus_(Silurana)_tropicalis_301620562                           ---L--S-R----------Y--QIDPTEW-----ELHPEVFDLIVTQWG---E-PDLDLMASRH-------NRKTP------LFIS---------KTR-----DHLANEE-------RKRHVHSNRSKLA-----------SEVV--------VYR----PGE----SLNRSTHNTP-----HS--GR-P---A-----S--PR--SDL-PPQ-SPNVP-FDGVALE-TAILN-Q-KG----------FSPVVAQT-M-I------NARKAVS-S-KA---YH------RIWKI-FI-----------------
      LOC100487066_Xenopus_(Silurana)_tropicalis_301624526                           ---L--S-R----------T--TLDPGEW-----KLKEEIFQQLVAKWG---Q-PCLDVMASRF-------NSQTP------RFLS---------KVH-----DPMVEGL-------DALTS---------PW-----HC-QLAY--------AFP----PIP----LIPRLLHKIR-----RE--RV-P---T-----I--LI------APW-------WPRRAWF-AELIQ-M-SA-EQP------WTIPLSSD-L-L------SQGPATA--------EN-LHK--LNLTA-WM-----------------
      LOC100490727_Xenopus_(Silurana)_tropicalis_301619133                           ---L--S-R----------T--TLDPGEW-----KLKPEIFQQIVKKWG---L-PCLDIMASRF-------NSQIP------RFLS---------KVH-----DPKAEGV-------DALTS---------PW-----HC-QLAY--------AFP----PIP----LIPRLLHKIR-----RE--NI-P---T-----I--LI------APW-------WPRRAWF-AELIQ-M-SA-EQP------WTFPLYAD-L-L------SQGPAKA--------EN-IHN--LNLTA-WM-----------------
      CHLREDRAFT_180868_Chlamydomonas_reinhardtii_159465941                          ---L--S-R------QL--A--QARDQNL-----RLKPAVFRSLVTTDGGQYR-PTVDCCADVL-------GLNAQ-PGCA-EFFS-----------P-----ERSVLGQ-------EQRLA----GKV--LW--------------------AFP----PVS----LTGEVLATIAAAAQLDE--RT-R---A-----T--VV------VPY-------QPSYPWF-QQWAS-Q-RL-AYK------TLQGNISA-L-A--DW--QRSKGRS-G-ED-L----------------------------------
      EAI_09447_Harpegnathos_saltator_307201692                                      ---E--S-R----------I--SDTNTEW-----SLSEQAFRAVEGVFG---P-FDIDLFASII-------NAKLD------LYVS---------WFP-----DPGSWAI-------DAFTL---------SW-----QS-LYFY--------AFP----PFI----IIPRILRKII-----DD--EA-T---G-----V--LI------VPW-------WPSQSWF-PMFTC-L-LQ----------------------------------------------------------------------------
      EAI_17025_Harpegnathos_saltator_307196129                                      ---L--S-R----------L--KNLDTEW-----ELATYAFNKITTSFG---F-PELDLFATSL-------NAKCE------KFCS---------WAT-----DPNAWAI-------DAFSI---------SW-----ST-FFSY--------AFP----PFS----MILRMLNKIV-----QD--KA-R---G-----I--IV------VPN-------WKGQAWY-PMFRN-L-L-----------------------------------------------------------------------------
      NEMVEDRAFT_v1g211073_Nematostella_vectensis_156375001                          ---L--S-R-------------FVDKDDW-----SVNQSVFRLLDAKGG---P-HTIDRFASAY-------NTKLT------CFNSSSLPGLFDLLLG-----AKAVSAV-------RKYHT---------GW-----MR-LRVW-ALSKFD-VKPIPAKPLH-VA-LFLTELTRSA-----EE--KG-V---G-----I--SN--VEG-VAY---VIT-WRAL----PTLLH-G-CT-SWR-DYGR-------------------------------------------------------------------
      BRAFLDRAFT_131954_Branchiostoma_floridae_260797342                             ---T--G-Q-CRCL-----D--DFSGRQC-NMC-QFGYFDFPTCRECTC-N-Q-AGTDPNTCNA-------NDVCA-CADN-GTCS---------CKP-----NVEGKSC-------TLCKE-------G-SF-----NL----------EE-ENP--N-GCT-SC-FCFGITDQCR-----QA--NL-V---T-----E--QV------TPD-------ADNNNFF---LSN-I-RR-TQQ-T----------------------------------------------------------------------
      NEMVEDRAFT_v1g208020_Nematostella_vectensis_156382077                          ---A--N-K------------------SW--LK-KLSEEQLRDTADKYL-R-P-NNCSHVVVPKVNEEIWLNFKCR------VNIS---------YQP-----DPGAYAV-------NAFHT---------SW-----KN-LCFY--------AFS----PFG----IIQKVLSKIS-----ED--QA-T---G-----I--LV------APH-------WPT-----PTMVA-I-SC-K--------FTYRTTSY-F-T------QKEEHPV---LT---IQ-SRA--ETSTP----QDPTVLGLPLV-----
      EAI_12430_Harpegnathos_saltator_307198641                                      ---Q--S-H----------I--VSTETEW-----SLSCDYFHRIESGFD---P-FDIDLFASSI-------YTKCP------CFVS---------WLP-----DPLAHSI-------DAFSL---------DW-----SK-FYFF--------AFP----PFI----LILRVLRKII-----SD--KA-E---R-----V--LV------VP------------------------------------------------------------------------------------------------------
      EAI_06111_Harpegnathos_saltator_307212135                                      ---E--S-R----------C--KDPGTEW-----CLSDEAFQQVNKAFG---P-FDINLFASAI-------NNKCD------VCVS---------WFP-----NPGSFTT-------DAFAV---------AW-----EA-LNFY--------AFP----PFI----LLPRVLRKLI-----DD--EA-T---G-----T--LV------V-------------------------------------------------------------------------------------------------------
      EAI_10577_Harpegnathos_saltator_307193617                                      ---E--S-R----------I--SDTDTEW-----SLTDCAFQLIDRHFG---P-FAIDLFASAI-------NTKND------LYVS---------WFP-----NPGSWAT---------FTL---------DW-----HR-FYFY--------AFP----PFI----LFSRGLRKFI-----DD--KA-I---G-----V--LV------VPW-------W---------------------------------------------------------------------------------------------
      GIP_L7_0070_Glyptapanteles_indiensis_190702585                                 ---G--S-R----------I--VNPDTEW-----ELADWAFQRIVKNFG---T-PEIDLFASRT-------NRKCK------KFCS---------WHR-----DPDAYCV-------DAFTM---------VC-----TD-LKFY--------AFP----PFS----LILRTLKKIE-----AD--QA-Q---D-----T--STSTVSKNAAN-------GRDIVWQ-TFLKL-D-FN-EKA------VELLVGSI-T-D------STMKQYN---KP---LQEWKNF-------SSEQKIDMLKPQTNQVINW
      EAG_00458_Camponotus_floridanus_307183886                                      ---E--S-R----------K--LQPETEF-----ELDNSAFQKIVKVFG---Q-PEIDLFASRA-------NAKCR------RYVS---------SRK-----DSGSIAI-------DAFIL---------EW-----KR-FLFY--------AFP----PFS----VILKVLRKIE-----YE--GS-S---G-----I--VV--------------------------------------------------------------------------------------------------------------
      NEMVEDRAFT_v1g220590_Nematostella_vectensis_156351485                          ---L--D-R------V---I--TDEHSTF-REL-ARIAAVFRLLNIKWG---P-YTIDRFATHY-------NAQLS------RFHS---------KFA-----APGSCGV-------DAFTQ---------EW-----S--------------GLP----EKG----YLERGLTFRS-TI--AI--GP-T---Q-----V--SC-EF---RPL-----------------------------------------------------------------------------------------------------
      Dpul1000019018_Daphnia_pulex_Dpul1000019018                                    ---G--S-K-------------MTDTDDW-----QVDHETYQRINRRYS-----FTIDLFASDR-------NTKCQ------NFSQ-------------------------------------------------------------------IFT----ART----LLASMRFRTR----------------G-----K--MK----------------WLGSA---HQLEK---------------------------------------------------------------------------------
      CBG23377_Caenorhabditis_briggsae_268557352                                     ---T--S-R-------------EFDTDDW-----GVQDWAFEWAQKRWS---R-VKCDLFASER-------NAKHS------VYFS---------RYP-----EPTSSGT-------DAFDHF-TCAAKSLTW--------------------WVP----PPV----LVPN-----------------------------------------------------------------------------------------------------------------------------------------------
      CBG19482_Caenorhabditis_briggsae_268561666                                     ---A--S-R-------------NFFFDDW-----GVAGRVFRQAQRL-------------------------------------------------YP-----EFGTSGV-------NVFDHIERAERMGCAL--------------------WVP----PPV----LIPHLLKMGR-----KR--RL-R---G-----V--LV------APL-------WRSHASY-QALVD-H-SG-R--------FIRAIKDY-I-I------YEKNDNI--------FI-PEG---------------------------
      LOC582271_Strongylocentrotus_purpuratus_115621795                              ---L--S-R----------G--KCLPSEW-----TLSPTVFRQLVRVFS---T-SISSQLRSTI------------------VFLG-------------------------------------------------------------------FVR----ESG------NQELLKIR-----ED--QA-M---V-----V--LI------APW-------WPARSWF-QDLLT-L-LV-GTL------WSLPCHPD-L-V------SQPLSGI--------LH-QRP--EILHL-TA----------------W
      LOC100123785_Nasonia_vitripennis_156546508                                     -------------------------------------------------------------------------------------------------------------M-------NAFTI---------NW-----NN-KFWY--------AFP----PFA----LLTKTLKKIR-----DD--KA-E---G-----I--LI------VPH-------WPGQPWF-PEFKR-L-LE-THA----P-FSVPAFTD-C-R-SIV--REAYRQK-G-LE---EA-PVE--II-----------------------
      MNEG_14804_Monoraphidium_neglectum_761958716                                   ---L--S-KIVDKN-------------DW-----MLHEEEFGRLARRFG---P-FEVDLFASHT-------TRQLP------KYFS---------LYH-----TPDTAGI-------DAFAQ---------HW-----G--RGCW--------CNP----PFT----LIGRVLRHAR-----EC--GA-R---M-----C--LL------APA-------WPSAAWWHQLVLP-G-GT-HFRPFVRECVVLPKRRD-L-F------------------------------------------------------
      D478_26539_Brevibacillus_agri_BAB-2500_432181416                               ------N-K--AMF--------TSEREEW-----ETPQDFFEKLNKEF----G-FQLDVCALPT-------NAKCE------RYFT---------PDE-------------------DGLKQ---------EW-----TG--VCW--------MNP----PYG--R-EIGKWVKKAY-----ES--AK-Q---G-AT--V--VC--L---LPA-------RTDVKWW-HDYCM-K-G--EIR------LVRGRMKF----------VGADNMA-----------PFP--NAVVI-FS-----------------
      C236_RS0118880_Brevibacillus_laterosporus_517503045                            ------N-E--GMF--------TSSTDLW-----ETPQDFFNQLNKEF----G-FQLDVCALPE-------NAKCE------RYFS---------PDE-------------------DGLQQ---------EW-----TG--ICW--------MNP----PYG--R-QIGKWIKKAY-----ES--SL-N---G-AT--V--VC--L---IPA-------RTDASWW-HAHCM-K-G--EIR------LVKGRLKF----------GGSKWNA-----------PFP--NAVVI-FR-----------------
      ABOUO_79_Paenibacillus_phage_Abouo_525335850                                   ------N-E--GMF--------TSSTDLW-----ETPQEFFNQLNQEF----G-FQIDVCALPE-------NAKCE------RYFS---------PDE-------------------DGLQQ---------EW-----TG--ICW--------MNP----PYG--R-QIGKWIKKAY-----ES--SL-N---G-AT--V--VC--L---IPA-------RTDARWW-HDYCM-K-G--EIR------LVKGRLKF----------GSSKWSA-----------PFP--NALVI-FK-----------------
      D478_RS25245_Brevibacillus_agri_748713908                                      ------------MF--------TSEREEW-----ETPQDFFEKLNKEF----G-FQLDVCALPT-------NAKCE------RYFT---------PDE-------------------DGLKQ---------EW-----TG--VCW--------MNP----PYG--R-EIGKWVKKAY-----ES--AK-Q---G-AT--V--VC--L---LPA-------RTDVKWW-HDYCM-K-G--EIR------LVRGRMKF----------VGADNMA-----------PFP--NAVVI-FS-----------------
      M655_RS0109725_Bacillus_sp_NSP21_737442515                                     ------------MF--------KSEREEW-----ETPQEFFDKLNDEF----G-FQLDVCALPT-------NAKCE------RYFT---------PDD-------------------DGLHQ---------EW-----TG--VCW--------MNP----PYG--R-EIGKWVKKAY-----ES--AK-Q---G-AT--V--VC--L---LPA-------RTDVKWW-HDYCM-K-A--EIR------LVRGRMKF----------VGADNMA-----------PFP--NAVVI-FS-----------------
      ABBL099_02355_Acinetobacter_baumannii_690996743                                ---T--K-N--KLFGL-----AEERTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AN-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      FL80_RS05360_Acinetobacter_baumannii_690988986                                 ---T--K-N--KLFGL-----AEERTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AN-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      RQ87_RS18135_Acinetobacter_baumannii_447010248                                 ---T--K-N--KLFGL-----ADDRTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      ERIC1_1c08270_Paenibacillus_larvae_subsp_larvae_DSM_25719_567770034            ------N-K--VHY--------SSKTDMW-----ETPQNLFDRLNEEF----K-FDLDVCAIPE-------NAKCK------RYFT---------PSE-------------------DGLKQ---------EW-----KG--ACW--------MNP----PYG--R-QIGKWIAKAY-----ES--SL-E---G-AT--V--VC--L---VPS-------RTDTKWW-HGYCM-K-G--EIR------FIRGRLKF----------GGSPHNA-----------PFP--NAVVI-FR-----------------
      J517_3010_Acinetobacter_baumannii_691065210                                    ---T--K-N--KLFGL-----AEERTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      J523_3197_Acinetobacter_baumannii_691027491                                    ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      J660_1691_Acinetobacter_baumannii_691157882                                    ---S--K-N--KLFGL-----AEDRTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      ERIC1_RS03940_Paenibacillus_larvae_738763505                                   ------N-K--VHY--------SSKTDMW-----ETPQNLFDRLNEEF----K-FDLDVCAIPE-------NAKCK------RYFT---------PSE-------------------DGLKQ---------EW-----KG--ACW--------MNP----PYG--R-QIGKWIAKAY-----ES--SL-E---G-AT--V--VC--L---VPS-------RTDTKWW-HGYCM-K-G--EIR------FIRGRLKF----------GGSPHNA-----------PFP--NAVVI-FR-----------------
      K035_3853_Acinetobacter_baumannii_691039522                                    ---T--K-N--KLFGL-----ADDRTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      J697_3983_Acinetobacter_baumannii_691093639                                    ---T--K-N--KLFGL-----ADDRTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      B4086_RS03845_Bacillus_cereus_822506548                                        ------N-K--GMF--------TSKTDLW-----ATPQYFFDELHKEF----N-FELDVCALED-------NAKCE------KYFT---------PEM-------------------DGLKQ---------EW-----NG--TCW--------MNP----PYG--R-GIGKWVQKAY-----ES--SL-T---G-ST--V--VC--L---LPA-------RTDTRWW-HDYCM-N-G--EIR------LVKGRLKF----------GDSKNSA-----------PFP--NAVVI-FG-----------------
      J689_1368_Acinetobacter_calcoaceticus/baumannii_complex_645913983              ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--R-EIVDWIAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      W9I_03525_Acinetobacter_nosocomialis_493629840                                 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--R-EIVDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      K041_RS17240_Acinetobacter_baumannii_690981431                                 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      J532_4398_Acinetobacter_baumannii_691154760                                    ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      ACIN5021_2863_Acinetobacter_sp_OIFC021_444754682                               ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--R-EIVDWIAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      J595_RS19805_Acinetobacter_baumannii_691047241                                 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EIVDWIAKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      LJ44_RS16470_Acinetobacter_baumannii_447017697                                 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      FL80_RS15355_Acinetobacter_baumannii_690990657                                 ---A--K-L--GLFGN-----AEGRTDVW-----ATPQTLFDALDQVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EIVDWISKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CAVVV-FR-----------------
      J532_4398_Acinetobacter_baumannii_940793_630464595                             ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      F985_01871_Acinetobacter_sp_NIPH_973_490838153                                 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPG-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--R-EIVDWIAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      TT45_RS11045_Acinetobacter_baumannii_758882462                                 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EIVDWISKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      ABSDF2497_Acinetobacter_baumannii_SDF_169152788                                ---T--K-N--KLFGL-----ADDRTDVW-----ATPQDFFEKLDRVF----K-FDLDVCALPD-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----DT--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      J689_1349_Acinetobacter_baumannii_691068978                                    ---A--Q-R--KLFGL-----AENRTDVW-----ATPQDFFDKLNAVF----N-FDLDVCALPE-------NAKCE------RFFS---------PEQ-------------------NGLKQ---------EW-----IG--TCW--------MNP----PYG--R-EIVDWIAKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      BTS2_0497_Bacillus_sp_TS-2_591276954                                           ------N-Q--AMF--------SSSTDKW-----STPQSFYDKLNQEF----Q-FDIDVCATDS-------DKKCE------RYFS---------PEQ-------------------DGLKQ---------EW-----TG--ICW--------MNP----PYG--R-GIGPWIQKAY-----ES--SQ-Q---G-AT--V--VC--L---LPS-------RTDTKWW-HEYCM-K-G--EIR------FIKGRLKF----------GDSKNSA-----------PFP--SVVVI-FR-----------------
      J594_4091_Acinetobacter_baumannii_259052_588219826                             ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EIVDWIAKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      BTS2_RS02440_Bacillus_sp_TS-2_780117918                                        ------N-Q--AMF--------SSSTDKW-----STPQSFYDKLNQEF----Q-FDIDVCATDS-------DKKCE------RYFS---------PEQ-------------------DGLKQ---------EW-----TG--ICW--------MNP----PYG--R-GIGPWIQKAY-----ES--SQ-Q---G-AT--V--VC--L---LPS-------RTDTKWW-HEYCM-K-G--EIR------FIKGRLKF----------GDSKNSA-----------PFP--SVVVI-FR-----------------
      J635_1953_Acinetobacter_baumannii_690997976                                    ---A--K-L--GLFGN-----AEGRTDVW-----ATPQKLFDALDQVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----AG--TCW--------MNP----PYG--R-EIVDWISKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      LILY_61_Bacteriophage_Lily_755258783                                           ---SNTM-A--VHY--------SSKTDMW-----ETPQDFFDKLHAEF----G-FTLDVCAVPE-------NAKCE------RFFS---------PDD-------------------NGLLQ---------NW-----KG--VCW--------MNP----PYG--R-QIGAWIAKAY-----ES--SL-E---G-AT--V--VC--L---VPS-------RTDTKWW-HDYCL-K-G--EVR------FIKGRLKF----------GGSPHNA-----------PFP--NAIVI-FR-----------------
      J546_RS10975_Acinetobacter_baumannii_736663998                                 ---A--N-H--QLFGL-----AENRTDIW-----ATPQDFFDKLNAVF----K-FDLDVCALPN-------NAKCE------RFFS---------PED-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EIIEWVAKAA-----CT--AK-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKSNA-----------PFG--CCVVV-FR-----------------
      J660_0735_Acinetobacter_calcoaceticus/baumannii_complex_493629922              ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--C-EIVDWIAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      J479_2646_Acinetobacter_baumannii_691127129                                    ---A--K-L--GLFGN-----AEGRTDVW-----ATPQTLFDALDQVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EITLWIDKAV-----QT--AN-Q---G-HT--V--VG--L---LPA-------RTDVTWW-QEHVM-N-R--EIH------YIKGRLKF----------GGCKHNA-----------PFG--CAVVV-FR-----------------
      ACINWC323_RS01110_Acinetobacter_sp_WC-323_696306260                            ---A--K-S--KLFGL-----AEDRTDVW-----ATPQDFFDKLNAIF----D-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLSQ---------EW-----TG--TCW--------MNP----PYG--R-EISLWIEKAV-----ET--AN-A---G-YT--V--VA--L---LPA-------RTDVGWW-QSHCL-N-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CAVVV-FR-----------------
      J596_3741_Acinetobacter_baumannii_691117543                                    ---A--K-L--GLYGN-----AEGKTDVW-----ATPQNLFDALDQIF----N-FDLDVCALPE-------NAKCE------RYFT---------PEL-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EISLWIEKAV-----ET--AN-N---G-HT--V--VG--L---LPV-------RTDVVWW-QEHIL-H-R--EIH------YIKGRLKF----------GGSKHNA-----------PFG--CALVV-FR-----------------
      J660_0735_Acinetobacter_baumannii_88816_593668543                              ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--C-EIVDWIAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR-----------------
      ACINWC323_A0077_Acinetobacter_sp_WC-323_425484490                              ---A--K-S--KLFGL-----AEDRTDVW-----ATPQDFFDKLNAIF----D-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLSQ---------EW-----TG--TCW--------MNP----PYG--R-EISLWIEKAV-----ET--AN-A---G-YT--V--VA--L---LPA-------RTDVGWW-QSHCL-N-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CAVVV-FR-----------------
      X858_RS0107890_Bacillus_subtilis_647261410                                     ------------HF--------SSKTDLW-----ATPQYFFDELHKEF----D-FELDVCALED-------NAKCE------KYFT---------PEM-------------------DGLKQ---------EW-----NS--TCW--------MNP----PYG--R-GIGEWVQKAY-----ES--SL-K---G-ST--V--VC--L---LPA-------RTDTRWW-HDYCM-K-G--EIR------LVKGRLKF----------GESKDNA-----------PFP--NAVVI-FG-----------------
      J635_2258_Acinetobacter_baumannii_690998264                                    ---A--K-L--GLFGN-----AEGRTDVW-----ATPQKLFDALDQVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------DW-----TG--TCW--------MNP----PYG--R-EISLWIEKAV-----QT--AN-Q---G-HT--V--VG--L---LPT-------RTDVAWW-QEHVM-N-R--EIH------YIKGRLKF----------GGCKHNA-----------PFG--CAVVV-FR-----------------
      G454_RS0114655_Desulfovirgula_thermocuniculi_654109520                         ------N-R--GLF--------SSASSEW-----ETPQKFFETLDVEF----G-FTLDVCARPE-------NAKCP------RYFS---------PEE-------------------DGLRQ---------EW-----AP-EVCW--------MNP----PYG--R-EIGKWIQKAY-----EE--AQ-K---G-AT--V--VC--L---LPS-------RTDTAWW-HEYVM-RAA--EVR------FIRGRLRF----------GGAENGA-----------PFP--SCVVV-FR-----------------
      F931_01759_Acinetobacter_pittii_507070967                                      ---A--K-L--GLYGN-----AEGKTDVW-----ATPQNLFDAIDHIF----N-FDLDVCALPE-------NAKCD------RYFT---------PEL-------------------DGLKQ---------EW-----VG--TCW--------MNP----PYG--R-EISLWIEKAV-----ET--AN-N---G-HT--V--VG--L---LPV-------RTDVVWW-QEHIL-H-R--EIH------YIKGRLKF----------GGCKHNA-----------PFG--CALVV-FR-----------------
      PL75_03330_Neisseria_sp_KH1503_831387832                                       ------------HF--------SSKTDLW-----ATPQDFFDNLNEEF----G-FELDVCALPE-------NAKCE------KYFT---------PEN-------------------DGLKQ---------DW-----TG--TCW--------MNP----PYG--R-EIGKWMKKAY-----ES--SL-T---GNAT--V--VC--L---VPA-------RTDTKWF-HDFAM-K-G--EVR------FIKGRLKF----------GGSKNSA-----------PFP--SAVVI-FR-----------------
      K035_3825_Acinetobacter_baumannii_42057_4_629017472                            -------------------------------------QDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      DESKU_RS03925_Desulfotomaculum_kuznetsovii_503587829                           ------N-E--SMF--------SSRTGEW-----ETPQTFFDALDAEF----H-FTLDVCARPE-------NAKCA------RFFT---------PEQ-------------------DGLRQ---------SW-----AG-ETCW--------MNP----PYG--R-EIGRWVEKAY-----NE--AR-R---G-AV--V--VA--L---LPA-------RTDTRWW-HRYVM-RAA--EIR------FVEGRLKF----------GGAENSA-----------PFP--SVVVV-FT-----------------
      RBAU_RS10310_Bacillus_amyloliquefaciens_752856685                              ---E--T-K--TNFNQGVFFNPEDRTDVW-----ATPIDFFNKINERY----K-LNLDVCAKPS-------NAKCK------NFFT---------PEI-------------------DGLKQ---------KW-----VG--RVW--------MNP----PYG--R-EIKKWIKKAY-----EE--VE-N---G-NS--EIAVC--L---VPA-------RTCSAWW-HEYCM-K-G--EIL------FIRHRLKF----------GGSKINA-----------PFP--NALVI-FS-----------------
      G454_RS0102995_Desulfovirgula_thermocuniculi_654100680                         ------N-R--VLF--------SSATSEW-----ETPQELFARLHAEF----G-FTLDVCARPW-------NAKCT------RYFS---------PEQ-------------------NGLIQ---------EW-----AP-ETCW--------MNP----PYG--R-EISRWVRKAW-----EE--AQ-K---G-AT--V--VC--L---LPS-------RTDTAWW-HEYVM-RAA--EIR------FIRGRLHF----------EGAKNGA-----------PFP--SCVVV-FR-----------------
      K035_3825_Acinetobacter_baumannii_691039509                                    -------------------------------------QDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR-----------------
      TS65_RS13365_Aneurinibacillus_migulanus_759006369                              ------T-A--VMF--------SSATDEW-----ATPQDFFDQLNQEF----H-FTLDPCATHE-------SAKCA------RYFT---------EED-------------------NGLAQ---------DW-----TG-EIVF--------MNP----PYG--R-VLGQWVKKAF-----EE--SI-K---G-AT--V--VC--L---LPA-------RTDTRWF-HDYIY-HRA--EIR------FVKGRLKF----------GDSKNSA-----------PFP--SMVVI-FN-----------------
      MMA_RS11485_Janthinobacterium_sp_Marseille_501027971                           ------S-K--VHF--------SSATPEW-----YTPQSTFDVLNAEF----G-FTLDPCCTHE-------NAKCD------RHFT---------MAE-------------------NGLSQ---------DW-----SN-EVTF--------MNP----PYG--R-EIKEWMRKAY-----ES--SL-S---G-AT--V--VC--L---VPA-------RTDTAWW-HDYSI-K-G--EIR------FLRGRLKF----------GGAKTNA-----------PFP--SAIVI-FR-----------------
      Q332_RS01180_Pseudobacteroides_cellulosolvens_739064083                        ------T-E--IMF--------SSKSDEW-----ETPQQFFDKLHKEF----N-FQLDVCATAE-------NAKCD------KYYT---------KID-------------------DGLSQ---------SW-H-HWAQ--RCW--------MNP----PYG--R-NIDKWIKKAF-----DE--SQ-E---G-AT--V--VC--L---IPA-------RTDTKYW-HTYCM-K-AH-EIR------FVKGRLKF-S---------NSKDCA-----------PFP--SAIVV-FK-----------------
      HMPREF0179_03455_Bilophila_wadsworthia_3_1_6_316921487                         ------MNP--ALF--------SSAKEDW-----ETPREFFERLDGEF----H-FDLDVCAFPH-------NAKCP------TYFT---------KED-------------------DGLAR---------DW-----GN-RVCW--------MNP----PYG--K-AIKAWMTKAL-----DA--SR-R---G-AT--V--VC--L---VPS-------RTDTAWW-HDTVI-A-GGAEVR------FARGRLRF----------VGAEHPA-----------PFP--SAVVI-FR-----------------
      HMPREF0179_RS04985_Bilophila_wadsworthia_749811142                             ------MNP--ALF--------SSAKEDW-----ETPREFFERLDGEF----H-FDLDVCAFPH-------NAKCP------TYFT---------KED-------------------DGLAR---------DW-----GN-RVCW--------MNP----PYG--K-AIKAWMTKAL-----DA--SR-R---G-AT--V--VC--L---VPS-------RTDTAWW-HDTVI-A-GGAEVR------FARGRLRF----------VGAEHPA-----------PFP--SAVVI-FR-----------------
      RDMS_RS01750_Deinococcus_sp_RL_736377798                                       ------M-A--VHY--------SSEKHDW-----TTPRSFFDELNAEF----N-FTLDAAASPH-------NALCS------RYFT---------EAD-------------------DGLSQ---------PW-----TG-TV-W--------CNP----PYG--R-QIGRWIAKAA-----QS--AC-E---G-AT--V--VM--L---IPA-------RTDTAAW-HDHILFN-PQAEVR------FVRGRLRF----------GDATANA-----------PFP--SAVII-FR-----------------
      NZ45_03810_Clostridium_botulinum_700273311                                     ------T-A--VMF--------SSETDLW-----ATPQDFFDKLNKEF----D-FDLDPCATHE-------NAKCS------KYFT---------KEI-------------------DGLKQ---------DW-----QG-HKVF--------CNP----PYG--R-GIKDWVEKAY-----KE--SK-K-E-N-TT--V--VM--L---IPA-------RTDTRYF-HEYIY-H-KAKEIR------FVKGRLKF----------GSAKNSA-----------PFP--SMVVV-FR-----------------
      BZ26_RS0118830_Clostridium_botulinum_489480013                                 ------T-A--VMF--------SSETDLW-----ATPQDFFDKLNKEF----N-FDLDPCATKE-------NAKCS------KYFT---------KEI-------------------DGLKQ---------DW-----GR-YRVF--------CNP----PYG--R-EIGKWVEKAY-----KE--SK-K-Q-N-TT--V--VM--L---IPA-------RTDTKYF-HSYIY-H-KAKEIR------FIKGRLKF----------GNAKNSA-----------PFP--SMIVV-FR-----------------
      A11W_RS0107210_Staphylococcus_hominis_515743089                                ------M-E--VHY--------SSKSNEW-----ATPQNLFDELNEEF----N-FTLDPCATDE-------NAKCS------KYFT---------IED-------------------DGLSK---------DW-----SK-DVVF--------MNP----PYG--R-EIKKWNKKAY-----EE--SL-N---G-AT--V--VC--L---IPA-------RTDTTYW-HDFIF-D-RADDIR------FLRGRLKF----------GNSKNSA-----------PFP--SAIVV-YR-----------------
      V006_02512_Staphylococcus_aureus_686297326                                     ------M-E--VHY--------SSKTNEW-----TTPQNLFDELNGEF----N-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR-----------------
      HMPREF9988_RS10060_Staphylococcus_epidermidis_488427723                        ------M-E--VHY--------SSKSNEW-----ATPQKLFDELDKEF----N-FTLDPCATDE-------NAKCN------KHFT---------IED-------------------DGLSK---------DW-----SK-DVVF--------MNP----PYG--R-EIKKWIKKAY-----EE--SL-N---G-AT--V--VC--L---IPA-------RTDTTYW-HDFIF-D-KADDIR------FLRGRLKF----------GNSKNSA-----------PFP--SAIVV-YL-----------------
      T259_RS08765_Clostridium_botulinum_748203410                                   ------T-A--VMF--------SSETDLW-----ATPQDFFDKLNKEF----N-FDLDPCATHE-------NAKCS------KYFT---------KEI-------------------DGLKQ---------DW-----QG-YKVF--------CNP----PYG--R-VLKDWVKKCY-----EE--SL-K-P-N-TT--V--VM--L---IPA-------RTDTKYF-HEYIY-H-KVKEIR------FVKGRLKF----------GDAKNSA-----------PFP--SMVVV-F------------------
      CO98_RS04645_Staphylococcus_aureus_739716594                                   ------M-S--VHF--------SSKSNEW-----TTPQYLFDELNEEF----N-FTLDPCATDE-------NAKCS------KYFT---------IED-------------------DGLSK---------DW-----SN-DVVF--------MNP----PYG--R-EIKKWIKKAY-----EE--SL-N---G-AT--V--VC--L---IPA-------RTDTTYW-HDFIF-D-KADDIR------FLKGRLKF----------GNSKNSA-----------PFP--SSIVI-YE-----------------
      TH16_RS01985_Staphylococcus_caprae_488372936                                   ------M-S--VHF--------SSKSNEW-----YTPQYLFDELNEKY----Q-FTLDPCASHE-------NAKCD------KYFT---------IED-------------------DGLTK---------DW-----SK-DIVF--------MNP----PYG--R-NIKHWIKKAY-----EE--SV-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-NAYNIK------FLKGRIKF----------GGAVNSA-----------PFP--SAIVV-FK-----------------
      PI74_RS05125_Clostridium_botulinum_500994137                                   ------T-A--VMF--------SSGTDLW-----ATPQDFFDKLNKEF----D-FDLDPCATHK-------NAKCS------KYFT---------KEI-------------------DGLKQ---------DW-----QG-YKVF--------CNP----PYG--R-SIKDWVEKAY-----KE--SK-K-E-N-TT--V--VM--L---IPA-------RTDTRYF-HEYIY-N-KAKEIR------FVKGRLKF----------GDAKNSA-----------PFP--SMVVV-F------------------
      RK90_RS13240_Staphylococcus_aureus_446374006                                   ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR-----------------
      SA930_RS14870_Staphylococcus_aureus_446374005                                  ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR-----------------
      AS94_12270_Staphylococcus_aureus_686449191                                     ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVEKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR-----------------
      U183_02276_Staphylococcus_aureus_686300364                                     ------M-E--VHY--------SSKTNEW-----TTPQNLFDDLNREF----N-FTLDPCSTDE-------NAKCQ------KHYT---------END-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKHWVKKAY-----EE--SI-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GESKNSA-----------PFP--SAIIV-YR-----------------
      Phi93_04_Lactococcus_phage_phi93_673939868                                     ------N-E--LMF--------SSKTDLW-----STPNDFFDKLNDEF----H-FTLDPCSTHE-------NAKCY------KHFT---------KEE-------------------NGLLQ---------DW-----GN-EVVF--------CNP----PYG--R-QIKEWIKKSY-----EE--SQ-K-D-N-TT--V--VM--L---IPA-------RTDTIYF-HEYIY-H-KA-EIR------FIKGRLKF----------GNAKNSA-----------PFP--SMVVI-FE-----------------
      RL05_RS02180_Staphylococcus_aureus_446374007                                   ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLSEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR-----------------
      SAZ172_RS05790_Staphylococcus_aureus_554679133                                 ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PLP--SAIIV-YR-----------------
      CLOSCI_00567_[Clostridium]_scindens_ATCC_35704_167664126                       --------K--ALF--------SSAKEDW-----ATPQDFFDELNKEF----H-FDLDPCADAE-------NAKCK------EFFT---------KEQ-------------------NGLLQ---------DW-----GG-RCVF--------CNP----PYG--RTSTGEWIKKCY-----EE--AQ-K-P-G-TV--V--VA--L---IPA-------RTDTRFF-HDYIY-H-KA-EIR------FIKGRLHF----------GGCKDAA-----------PFP--SMVVV-FR-----------------
      ERS140248_02184_Staphylococcus_aureus_678260344                                ------M-E--VHY--------SSKTNEW-----ATPQNLFDDLNREF----N-FTLDPCSTDE-------NAKCQ------KHYT---------AKD-------------------NGLIQ---------DW-----SE-DVVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SV-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------SESKNSA-----------PFP--SAIIV-YR-----------------
      T666_02640_Staphylococcus_aureus_686391504                                     ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKCWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR-----------------
      SD74_RS18965_Clostridium_botulinum_752703286                                   ------T-A--VMF--------SSETDLW-----ATPQDFFDELNKEF----D-FDLDPCATHE-------NAKCD------KYYT---------IVE-------------------DGLKQ---------DW-----QG-HKVF--------CNP----PYG--R-GIKDWVEKAY-----KE--SK-K-E-N-TT--V--VM--L---IPA-------RTDTKYF-HSYIY-H-KAKEIR------FIKGRLKF----------GDAKNSA-----------PFP--SMVVV-F------------------
      W619_00569_Staphylococcus_aureus_686419170                                     ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNGEF----N-FTLDPCSTDE-------NAKCQ------KHYT---------AKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKHWVKKAY-----EE--SV-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GESKNSA-----------PFP--SAIIV-YR-----------------
      CLOSCI_RS06430_[Clostridium]_scindens_748651356                                --------K--ALF--------SSAKEDW-----ATPQDFFDELNKEF----H-FDLDPCADAE-------NAKCK------EFFT---------KEQ-------------------NGLLQ---------DW-----GG-RCVF--------CNP----PYG--RTSTGEWIKKCY-----EE--AQ-K-P-G-TV--V--VA--L---IPA-------RTDTRFF-HDYIY-H-KA-EIR------FIKGRLHF----------GGCKDAA-----------PFP--SMVVV-FR-----------------
      SAGV69_RS11740_Staphylococcus_aureus_506511035                                 ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-RDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR-----------------
      OR63_RS06485_Clostridium_tetani_737140426                                      ------T-A--VMF--------SSETDLW-----ATPQEFYNELNKEF----N-FDLDPCATHE-------NAKCP------KYYT---------VVE-------------------DGLKQ---------DW-----QG-HKVF--------CNP----PYG--R-EISKWVEKAY-----KE--SK-K-E-N-TT--V--VM--L---IPA-------RTDTKYF-HSYIY-R-KAKEIR------FIKGRLKF----------GNAKNSA-----------PFP--SMVVV-F------------------
      AWRIB429_RS09790_Oenococcus_oeni_768719850                                     ------N-E--LMF--------SSKTDLW-----STPNDFFDKLNDEF----H-FTLDPCSTHE-------NAKCY------KHFT---------KEE-------------------NGLLQ---------DL-----GN-EVVF--------CNP----PYG--R-QIKDWVKKSY-----EE--SQ-K-D-N-TT--V--VM--L---IPA-------RTDTIYF-HEYIY-H-KA-EIR------FIKGRLKF----------GNAKNSA-----------PFP--SMVVI-FE-----------------
      BN927_RS09785_Lactococcus_lactis_554763517                                     ------K-E--LMF--------SSKTDLW-----STPWNFFDKLNDEF----H-FTLDPCSTHE-------NAKCY------KHFT---------IEE-------------------DGLLQ---------DW-----GN-EVVF--------CNP----PYG--R-QIKDWVKKAY-----EE--SQ-K-D-D-TT--V--VM--L---IPA-------RTDTIYF-HEYIY-H-KA-EIR------FIKGRLKF----------GDAKNAA-----------PFP--SMVVI-FR-----------------
      EMTOL_RS19950_Emticicia_oligotrophica_504839093                                ---N--I-K--AIFSC--------KTTNW-----ETPQDLFDELDKQY----N-FTLDVCATSE-------NAKCN------EFFT---------PEI-------------------DGLKQ---------EW-----KG--MCW--------MNP----PYG--R-EIGKWVRKAH-----LE--VI-T---G-RC--RI-IA--L---LPA-------RTDTKWF-HEWVLNK-H--EIK------FIKGRLRF----------SDSKNSA-----------PFP--SMLVI-FE-----------------
      BN981_00304_Halobacillus_trueperi_635344555                                    ------M-N--VHY--------SSKSNDW-----ATPQDFFDGLDNEF----N-FTLDPCATSE-------NAKCD------NYFT---------IED-------------------DGLKQ---------SW-----EG-ETVF--------CNP----PYG--R-EIKLWVKKAF-----QE--SK-K-P-N-TK--V--VM--L---IPA-------RTDTKYF-HDYIY-M-QA-RVR------FIKGRLKF----------GNGKGNA-----------PFP--SMVVI-F------------------
      BN981_RS01350_Halobacillus_737533832                                           ------M-N--VHY--------SSKSNDW-----ATPQDFFDGLDNEF----N-FTLDPCATSE-------NAKCD------NYFT---------IED-------------------DGLKQ---------SW-----EG-ETVF--------CNP----PYG--R-EIKLWVKKAF-----QE--SK-K-P-N-TK--V--VM--L---IPA-------RTDTKYF-HDYIY-M-QA-RVR------FIKGRLKF----------GNGKGNA-----------PFP--SMVVI-F------------------
      BN981_RS01320_Halobacillus_737532221                                           ------M-D--VHY--------SSKTNEW-----ATPQDFFDELNTEF----N-FTLDPCATPD-------NAKCD------KYFT---------EKD-------------------DGLEQ---------SW-----EG-ETVF--------CNP----PYG--R-GIKHWVKKAY-----QE--ST-K-P-N-TT--V--VL--L---IPS-------RTDTRYF-HDYVY-H-KS-EIR------FLKGRLKF----------GDGSGNA-----------PFP--SMVAI-YR-----------------
      QI18_RS10395_Lactococcus_lactis_746045508                                      ------R-E--LMF--------SSKTDLW-----STPWNFFEKLNDEF----H-FTLDPCSTHE-------NAKCY------KHFT---------IKE-------------------DGLLQ---------DW-----GN-EVVF--------CNP----PYG--R-KIKDWVKKAY-----EE--SQ-K-D-N-TT--V--VM--L---IPA-------RTDTIYF-HEYVY-H-KA-EVR------FIKGRLKF----------GDAKNAA-----------PFP--SMVVI-FR-----------------
      RM98_RS18265_Chromobacterium_violaceum_759932528                               ---S--E-Q--VHF--------SSKTDEW-----PTPQALFDQLHEEF----G-FTLDVCATAE-------NAKCE------RFFT---------REQ-------------------DGLAQ---------DW-----SR-DVVW--------MNP----PFG--H-QIKLWMAKAY-----RS--SI-D---G-AL--V--VC--L---VPA-------RTDTRWF-HRHALKA-A--EIR------ALDKRLRF----------DGAKAKA-----------PFP--AVLVV-Y------------------
      RN16_RS04075_Chromobacterium_subtsugae_759887196                               ---S--E-Q--IHF--------SSKTDEW-----PTPQALFDQLHAEF----G-FTLDVCATQE-------NAKCE------RFFT---------REQ-------------------DGLAQ---------DW-----SR-EVVW--------MNP----PFG--H-QIKLWMAKAY-----RS--SI-D---G-AL--V--VC--L---VPA-------RTDTRWF-HRHALKA-A--EIR------ALDKRLRF----------DGAKAKA-----------PFP--AVLVV-Y------------------
      KU40_RS04850_Clostridium_botulinum_737823765                                   ------------MF--------SSKTDMW-----STPQDFYNKLNQEF----N-FNLDPCSTNE-------NAKCE------RHYT---------IAE-------------------DGLKQ---------NW-----VG-STVF--------CNP----PYG--R-VLKDWVKKCY-----EE--SK-K-D-N-TT--V--VM--L---IPA-------RTDTTYF-HNYIY-K-KVKEIR------FIRGRLKF----------GDCKNAA-----------PFP--SMVVV-F------------------
      DALK_RS23730_Desulfatibacillum_alkenivorans_506429612                          -------------------------NCEW-----ATPQDLFDSLNKEF----H-FTLDPCCTIE-------NAKCE------RFYT---------KAE-------------------DGLSQ---------DW-----TG-ETVF--------MNP----PYS--RSEMPKWIQRAY-----ES--SL-A---G-SK--V--VC--L---LPA-------KTDTRWF-HDFCL-K-G--EIR------FIKGRICF----------GSGEGRA-----------PFP--SMVVI-FN-----------------
      CC61_RS14530_Chromobacterium_sp_C-61_748184431                                 ---A--E-N--VHF--------STGKDEW-----PTPQALFDQLNAEF----G-FTIDVCATAK-------NAKCT------KFYT---------QVD-------------------DGLAQ---------NW-----AG-EVVW--------MNP----PFG--H-SIKLWMAKAY-----RS--SL-D---G-AL--V--VC--L---VPA-------RTDTRWW-HRVVMKA-S--EVR------VLDKRLRF----------DGGNHKA-----------PFP--AVVVV-F------------------
      SAG0375_00225_Streptococcus_agalactiae_GB00984_527786367                       ------Q-K--SLL--------SSDKDYW-----ETPQTFFKKLNNEF----D-FDLDVASSHD-------NAKCK------NHFT---------VVE-------------------DGLSQ---------DW-----TG--NVF--------CNP----PYG--R-EIGKWVEKAY-----KE--SL-K-PYN-NV--I--VL--L---IPA-------RTDTKYW-HDYIF-G-KAKDIR------YLKGRLKF-T-I-----NGKENYPA-----------PFP--SAVII-F------------------
      SAG0375_RS111635_Streptococcus_agalactiae_487848063                            ------Q-K--SLL--------SSDKDYW-----ETPQTFFKKLNNEF----D-FDLDVASSHD-------NAKCK------NHFT---------VVE-------------------DGLSQ---------DW-----TG--NVF--------CNP----PYG--R-EIGKWVEKAY-----KE--SL-K-PYN-NV--I--VL--L---IPA-------RTDTKYW-HDYIF-G-KAKDIR------YLKGRLKF-T-I-----NGKENYPA-----------PFP--SAVII-F------------------
      DK41_RS08970_Streptococcus_agalactiae_642982737                                ------Q-K--SLL--------SSDKDYW-----ETPQTFFKKLNNEF----D-FDLDVASSHD-------NAKCK------NHFT---------VVE-------------------DGLSQ---------DW-----TG--NVF--------CNP----PYG--R-EIGKWVEKAY-----KE--SL-K-PYN-NV--I--VL--L---IPA-------RTDTKYW-HDYIF-G-KAKDIR------YLKGRLKF-T-I-----NGKENYPA-----------PFP--SAVII-Y------------------
      AB660_RS05030_Chromobacterium_subtsugae_828144310                              ---D--A-S--IHF--------RSSTDEW-----PTPQLLFDELHAEF----Q-FTVDVCATPG-------NAKCP------RYYT---------RAD-------------------DGLAQ---------DW-----SA-ETVW--------MNP----PFG--H-GIKFWMEKAL-----KS--AR-A---G-AT--V--VC--L---VPS-------RTDTRWW-HRYAMWA-A--EIR------CLDKRLQF----------DGGSAKA-----------PFP--AVVIV-F------------------
      ANACOL_RS13845_Anaerotruncus_colihominis_493931641                             ------N-K--ALL--------SSKRLDW-----CTPRDFFDALDVEF----H-FTLDAAATEK-------SAKCA------KYYT---------PET-------------------DGLSA---------SW-----AG-ETVF--------CNP----PYG--R-EIKAWIKKGF-----EE--GQ-Q-S-G-TT--V--VL--L---IPS-------RTDTEYF-HKYIL-G-KA-EIR------FLKGRLKFTD-E-----EGLTQDAA-----------PFP--SMLVI-YR-----------------
      ELEN_RS13090_Eggerthella_lenta_506241510                                       ------G-G--VAF--------SSERHYW-----ETPQDLFDTLDNEF----H-FTLDPASTDE-------NAKCE------KHYT---------IED-------------------DGLCQ---------SW-----AG-ERVF--------CNP----PYG--R-ELSKWVKKAH-----AE--VALN-P-G-TV--V--VM--L---IPA-------RTDTTYF-HDYIY-H-KA-EVR------FIRGRLRF-C-I-----QGKAKDAA-----------PFP--SMVVV-FR-----------------
      VE20213_RS09880_Clostridiales_bacterium_VE202-13_639741003                     --------------------------MDY-----CTPQDFFDKLNQEF----H-FTLDAAATSK-------SAKCP------QYYT---------PEI-------------------DGIKN---------PW-SIAGGG--AVF--------CNP----PYG--R-KIGKWVRKAY-----EE--SR-N---G-TT--V--VL--L---IPA-------RTDTAYF-HDYIY-G-CA-EIR------FVRGRLHF-TDE-----DGNTYDRA-----------PFP--SMVVI-YN-----------------
      T370_RS0102475_Bilophila_wadsworthia_736486878                                 --------N--VHF--------LSKKHDW-----ATPWPLFRELNARF----GPCELDVCATAR-------NAKCG------NFFS---------PEE-------------------DGLRQ---------VW-----HG--VCW--------MNP----PYG--R-ALPHWMAKAV-----NEIEME-R---A-ER--V--IC--L---LPA-------RTDTAWW-HRYVL-P-FAAEIH------YLRGRIRF----------EGAGSSA-----------PFP--SAVVI-F------------------
      HMPREF0555_0745_Leuconostoc_mesenteroides_subsp_cremoris_ATCC_19254_227352467  ------D-K--VLF--------SSNSMVW-----ETPKDYFDKLNRKF----K-FDLDACASDT-------NHKVD------TYFT---------EDD-------------------NALEQ---------KW-----GG--NVF--------MNP----PYG--R-HIGKFIKKAY-----EE--HL-R-DPN-RF--I--VM--L---IPS-------RTDTKYW-HEYIQ-D-KAT-VK------FIKGRLKF-E-I-----DGESMDAA-----------PFP--SALVV-YG-----------------
      HMPREF0555_RS01180_Leuconostoc_mesenteroides_738135700                         ------D-K--VLF--------SSNSMVW-----ETPKDYFDKLNRKF----K-FDLDACASDT-------NHKVD------TYFT---------EDD-------------------NALEQ---------KW-----GG--NVF--------MNP----PYG--R-HIGKFIKKAY-----EE--HL-R-DPN-RF--I--VM--L---IPS-------RTDTKYW-HEYIQ-D-KAT-VK------FIKGRLKF-E-I-----DGESMDAA-----------PFP--SALVV-YG-----------------
      L964_RS00605_Leuconostoc_pseudomesenteroides_491052808                         ------S-K--ALF--------SSKSMVW-----ETPKDYFDKLNRKF----K-FDLDACASDT-------NHKVD------TYFT---------EDD-------------------DALEQ---------KW-----GG--NVF--------MNP----PYG--R-HIGEFIKKAY-----EE--HL-R-DPN-RF--I--VM--L---IPS-------RTDTKYW-HEYIQ-D-KAT-VK------FIKGRLKF-E-L-----DGRPMNTA-----------PFP--SALII-YG-----------------
      OPIT5_22060_Opitutaceae_bacterium_TAV5_573475515                               -------------------M-TSSMDMTW-----GTPQVWFDYLHLEF----G-FTLDPCCLHQ-------TAKCK------KHYT---------PAE-------------------DGLAQ---------SW-----AE-ERVF--------MNP----PYG--R-DLPKWMKKAY-----EE--AR-D-N-G-TL--I--VC--F---VPA-------RVDTEWW-HRYAT-K-G--EVR------FPKGRVKF----------ADALDSA-----------PFP--VAVVI-FR-----------------
      OPIT5_RS20660_Opitutaceae_bacterium_TAV5_763429761                             ------------------------MDMTW-----GTPQVWFDYLHLEF----G-FTLDPCCLHQ-------TAKCK------KHYT---------PAE-------------------DGLAQ---------SW-----AE-ERVF--------MNP----PYG--R-DLPKWMKKAY-----EE--AR-D-N-G-TL--I--VC--F---VPA-------RVDTEWW-HRYAT-K-G--EVR------FPKGRVKF----------ADALDSA-----------PFP--VAVVI-FR-----------------
      BR71_RS03710_Chromobacterium_haemolyticum_759948263                            -TDE--A-S--IHF--------RSTRDDW-----ETPQDLFDALHAEF----G-FTVDVCASDK-------TAKCV------RYYT---------KAD-------------------NGLAK---------DW-----SN-EVVW--------MNP----PFG--H-VTKRWMDKAR-----LS--SM-R---G-AT--V--VC--L---VPA-------RVSVLWW-HRNVFLA-S--EVR------CLRPRLQF----------VGAAQKA-----------PFD--AVLVI-FR-----------------
      RM98_RS09640_Chromobacterium_violaceum_759929100                               ---A--E-N--IHF--------RSGRDDW-----ETPHDLFASLNAEF----G-FTVDVCASEK-------TAKCP------RYYT---------PAM-------------------NGLAQ---------DW-----GG-ETVW--------MNP----PFG--H-VTKRWMDKAR-----LS--SL-Q---G-AT--V--VC--L---VPA-------RTSVLWW-HRNVFLA-S--EVR------CIRPRLQF----------VGAAQKA-----------PFD--AVLVV-F------------------
      CLOM621_RS14915_Clostridiales_492715347                                        ------N-D--ALL--------SSKNMCW-----CTPPDFFAELDREF----H-FELDPASTDK-------SAKCA------KHFT---------PDD-------------------DGLKQ---------DW-----GG-YCVF--------CNP----PYG--R-AIADWVRKGY-----EE--SR-K-P-G-TT--V--VM--L---IPS-------RTDTAYF-HDWIF-G-KASEVR------FLRGRLKFTD-E-----DGNGEDAA-----------PFP--SAVIV-WR-----------------
      HMPREF1020_RS23965_Clostridium_sp_7_3_54FAA_496656604                          ------N-D--ALL--------SSKNMCW-----CTPPDFFAELDREF----H-FELDPASTDK-------SAKCA------KHFT---------PDD-------------------DGLKQ---------DW-----GG-YRVF--------CNP----PYG--R-AIADWVRKGY-----EE--SR-K-P-G-TT--V--VM--L---IPS-------RTDTAYF-HDWIF-G-KASEVR------FLRGRLKFTD-E-----DGNGEDAA-----------PFP--SAVIV-WR-----------------
      N644_0465_Lactobacillus_plantarum_AY01_544589963                               ------N-K--ALF--------TSNKEDW-----ETPQDFYDRLNAKY----H-FEWDLAASDG-------NAKCG------DYFT---------SDD-------------------NSLEQ---------DW-E-RLSG--NLF--------LNP----PYG--R-ELKLWVKKAS-----ET--QL-K---H-DQF-L--VM--L---IPS-------RTDTSYW-HDYIF-N-HA-EIE------FLRGRLKF-E-V-----DGVGGDSA-----------PFP--SAVVI-YT-----------------
      ZJ316_RS06725_Lactobacillus_plantarum_505193070                                ------N-K--ALF--------TSNKEDW-----ETPQDFYDRLNAKY----H-FEWDLAASDG-------NAKCG------HYFT---------SDD-------------------NSLEQ---------DW-E-RLSG--NLF--------LNP----PYG--R-ELKLWVKKAS-----ET--QL-K---H-DQF-L--VM--L---IPS-------RTDTSYW-HDYIF-N-HA-EIE------FLRGRLKF-E-V-----DGVGGDSA-----------PFP--SAVVI-YT-----------------
      HMPREF0178_RS14615_Bilophila_sp_4_1_30_496774991                               ------N-R--ALF--------SSVKDDW-----PTPWEFFHNLDLEF----D-FTLDVCAVPW-------SAKVW------RYCV---------PPHALRVWGETTFRRLFPDALVDGLAH---------SW-----AG-ERCY--------MNP----PYG--R-EIGPWVEKAR-----RE--AE-R---G-AL--V--VG--L---LPA-------RTDTAWF-HEHVY-R-AATEIR------FLKGRLKF----------EGAAASA-----------PFP--SMIAV-WG-----------------
      HMPREF0179_RS15850_Bilophila_wadsworthia_491165768                             ------N-R--ALF--------SSVKDDW-----PTPWEFFRNLDLEF----D-FTLDVCAVPW-------SAKVC------RYCV---------PPHALRVWGETTFRRLFPDALVDGLAH---------SW-----AG-ERCY--------MNP----PYG--R-EIGPWVEKAR-----RE--GE-R---G-AL--V--VG--L---LPA-------RTDTAWF-HEHVY-R-AATEIR------FLKGRLKF----------EGAAASA-----------PFP--SMIAV-WG-----------------
      H627_RS17735_Lactobacillus_harbinensis_737460398                               KP----G-G--AAL--------TSNKDDW-----ETPQAFFESLNAKY----H-FAIDLAASKD-------NAKCD------RYFS---------VAD-------------------DSLLQ---------DWSD-DFGG--AMY--------LNP----PYG--R-HIGDWVKKAY-----ET--SL-R---V-NVP-I--VL--L---IPA-------RTDTSYW-HDYIF-G-KA-SIK------FIRGRLKF-E-Q-----NGMAGGPA-----------PFP--SAIIV-YN-----------------
      A323_gp73_Acinetobacter_bacteriophage_AP22_388570840                           ------M-N--VHF--------SSDKQTW-----ETPQDLFDKLNDIF----N-FNLDACAEHD-------TAKVK------KYFT---------IDD-------------------NALIQ---------DW-----IG-S-VW--------CNP----PYN--R-EQIKFIEKAL-----NE--SL-K-H-K-ST--V--VL--L---IPA-------RPETKVW-QNVIF-K-SASQIC------FIKGRLKF----------GNSKYNA-----------PFP--SALIV-FG-----------------
      TY47_RS06930_Lactobacillus_brevis_754895979                                    ------N-N--ALL--------SSEKNYW-----ETPHDFFKKLNEKY----Y-FSFDLAASPE-------NTKCE------NFFS---------EED-------------------NSLTK---------AW-H-ELKG--NLF--------LNP----PYG--R-ELRKWVKKAY-----EE--SL-K---K-HDGYI--VL--L---IPA-------RTDTSYW-HDFIF-G-KA-QIN------FLRGRIKF-E-L-----HGESKDAA-----------PFP--SAIVI-YG-----------------
      N644_RS02335_Lactobacillus_plantarum_727092536                                 ------N-K--ALF--------TSNKEDW-----ETPQDFYDRLNAKY----H-FEWDLAASDG-------NAKCG------DYFT---------SDD-------------------NSLEQ---------DW-E-RLSG--NLF--------LNP----PYG--R-ELKLWVKKAS-----ET--QL-K---H-DQF-L--VM--L---IPS-------RTDTSYW-HDYIF-N-HA-EIE------FLRGRLKF-E-V-----DGVGGDSA-----------PFP--SAVVI-Y------------------
      MCOL2_RS04700_Listeria_fleischmannii_738104299                                 ------D-R--VIF--------SSERDDW-----ETPTDLFNELDKEF----L-FDLDATANKN-------NAKCP------KFFT---------KEQ-------------------NALVQ---------EW-----RG--SVF--------CNP----PYG--R-EIQKFIEKAY-----IE--SK-K-AYC-ER--V--VL--L---IPA-------RTDTKIW-HDFIF-P-FSKEII------FIKGRLKY-E-L-----NKISNSPA-----------PFP--SAIII-FE-----------------
      G469_RS0106650_Atopobium_fossor_654811069                                      ------T-S--GLR--------SSASNEW-----TTPKDLFDELNREF----K-FTVDAASTHE-------NALVD------KHWT---------LAE-------------------DGLAQ---------CW-----DG-ERVW--------CNP----PYG--R-QIAQWVKKAS-----EA--V------G-GV--V--VM--L---IPA-------RTDTSYW-HDYVF-P-NASDIR------FIRGRLHF----------SQSKTAA-----------PFP--SAIVV-FE-----------------
      B7017_p0034_Bifidobacterium_breve_704484626                                    ------G-A--AAM--------TSNKDDW-----ETPQALFDQLDKEF----H-FTLDAASNDQ-------NAKCE------HHYT---------AEN-------------------SGLEH---------SW-----GG-ETVF--------CNP----PYG--R-NIGDWIRKAS-----QE--AS-K-P-D-TL--V--VL--L---VPA-------RTDTRWF-QNYIL-H-RA-EVR------FLPGRLKY-E-V-----DGQAGEAA-----------PFP--SMVVI-MR-----------------
      VPUCM_1151_Vibrio_parahaemolyticus_UCM-V493_584469889                          ------L-D--VMFSS-ANS-GDKSKDKW-----QTPPEIFAQLNDRF----G-FTLDAAAEPE-------TALCE------KYFT---------EED-------------------DALKQ---------DW-----SG-HVVF--------CNP----PYS--K--LRVFAKKAY-----EE--SL-K---G-TT--V--VM--L---VPA-------RTDTQAC-HDYLA-N-G--EMY------FIRGRLKF-L-K-----VGELQDAA-----------PFP--SVVCV-LG-----------------
      Q331_RS21100_Afifella_pfennigii_736470177                                      ------H-Q--SLY--------SSRTEEW-----ETPPALFERLDRIF----G-FRLDACASPA-------NRKCE------TWFS---------AAD-------------------NALER---------SW---AEHG--RVW--------LNP----PYG--R-RIAGFMRKAF-----EE--SQ-K---G-AL--V--VA--L---VPA-------RTDTLWW-HEWVN-G-KA-DIV------FLKGRLKY-LDE-----NRRERSPA-----------PFP--SALVV-Y------------------
      CLOM621_08346_Clostridium_sp_M62/1_291074040                                   --------------------------MCW-----CTPPDFFAELDREF----H-FELDPASTDK-------SAKCA------KHFT---------PDD-------------------DGLKQ---------DW-----GG-YCVF--------CNP----PYG--R-AIADWVRKGY-----EE--SR-K-P-G-TT--V--VM--L---IPS-------RTDTAYF-HDWIF-G-KASEVR------FLRGRLKFTD-E-----DGNGEDAA-----------PFP--SAVIV-WR-----------------
      consensus/100%                                                                 ..........................................................................................................................................................................................................................................................................................................................
      consensus/95%                                                                  ............................a...........h..h...a.......phD.hs..........s.p.........ahs...............................ssh............h....................h.P....P.......h...h.............................h........s........b.............................................................................................
      consensus/90%                                                                  ............................W..........ha..h...a.......phD.hu..........s.ps........aho...............................ssh............W....................h.P....P.s.....l..hh.bh.......p...............l..lh......hP........b.s...h.......................................................................................
      consensus/85%                                                                  ............................W..........hFp.lp..a......hplD.hu..........s.+s........aho...............................suh.p..........W...........a........h.P....Phs.....l..hh.+h......pp.....p...s.....l..lh......hP........b.s..ha.p.hh................h.....p.................h.........................................
      consensus/80%                                                                  ........p.............p.....W.......s.phFpplp..a......hplD.hA..p.......NsKs.......paao...............................suh.p.........pW..........ha........h.P....Phu.....l.phl.Kh......pp.....p...s.....l..lh......hP........bss..aa.pphh.......p........h.bsbbp.............s...h...........P.s...........................
      consensus/75%                                                                  ........p.............p.pps.W.......spphFpplp.bF....p.hplD.hA..p.......NuKC.......paao...............................suh.p.........pW..........ha........hsP....Pau.....l.phlpKh......pp..s..p...s.....l..lh......hPh.......bss..aa.pphh.......ch.......albsbhch..........s.s.s.h...........Pbs...hh.l.h..................
      consensus/70%                                                                  ......p.p.............pspps.W......sspphFcpLspbF....p.FslDhhA..p.......NAKCp......paao...............................suLpp.........pW.....s....ha........hsP....Pau....bl.chlpKu......cp..u..p...s.....l..lh..h...lPh.......bscs.aa.pchhh......clp......alcsbh+a..........s.upssu...........Phs..shl.l.a..................
      
    • Multiple sequence alignment of the CrRem1 subgroup of the DIRS1-like/Group2-Clade 1 of N6-MTase

                                          <----N6A DNA methylase-------------------------------------------Str-2---------------------------------------Strand-4---------------------------------------------------------------------->
      RES                                  M-FRRDLFESI---------Q--------------SSLG-V----------------------TFTYDAAC-NDEGTN----ALCARYASPGRSFLASNVAG----E-CVWINPPYSHIRDWQQHYMRCKASDPEH-T-S--AVFCVPA-WPQVHRLMQKAKYSLVARYPA---GTPLFSKPGPDGQR
      ALIGN                                -----HHHHHH-----------------------------------------------------EEEEEE-------------EEEEE--------EE----------EEEE-------HHHHHHHHH-----------E--EEEEE-------HHHHHHHHHHHHEEE-------EEEE--------
      HMM                                  ----HHHHHHH---------H--------------HH--------------------------EEEHHHHH-H----------EE--EE-----EEEE--------E-EEEE------HHHHHHHHH------------E--EEEEE----------------EEEEEEE------EEEEE-------
      FREQ                                 ----HHHHHHH---------H--------------HHHH-H----------------------HHHHHHHH-----HH----HHHHHHHHH-----------------EE----------HHHHHHHHHHHHHH-----E--EEEE---------HHHHH-H---EEEEEE------EEEE-------
      PSSM                                 ------HHHHH---------H--------------HH--------------------------EEEEEEE--------------------------------------EEEE-------HHHHHHHHHHHHH-------E--EEEEEE-----HHHHHHHH--EEEEEEE------EEEEE-------
      CONF                                 7-848847687---------8--------------8715-4----------------------56405651-457723----56532211052111443488----0-5964488878157899989887740299-5-0--8998515-776004566505310699970---86377747888864
      FINAL                                ----HHHHHHH---------H--------------HHH-------------------------EEEHHHHH------H----HHHHHHH-----------------E-EEEE-------HHHHHHHHHHHHHH------E--EEEEE-------HHHHHHH---EEEEEEE-----EEEEE-------
      SOL25                                B-B---BB--B--------------------------B--B-----------------------BBBBBBB-----------BBB--BBB----BB---B--------BBBBBBBB--B--BB--BB-B--------B-B--BBBBBB--B----BB---B-B-BBB-BB-------BBB--------
      SOL5                                 -------B--B-----------------------------------------------------BBB-B----------------------------------------BBB---------B--------------------BBBBBB-----------------B----------------------
      SOL0                                 ------------------------------------------------------------------B-B--------------------------------------------------------------------------BBB------------------------------------------
      _Crei_29423677                      M-FRRDLFESI---------Q--------------SSLG-V----------------------TFTYDAAC-NDEGTN----ALCARYASPGRSFLASNVAG----E-CVWINPPYSHIRDWQQHYMRCKASDPEH-T-S--AVFCVPA-WPQVHRLMQKAKYSLVARYPA---GTPLFSKPGPDGQR
      _Crei_29423694                      M-FRRDLFESI---------Q--------------SSLG-V----------------------TFTYDAAC-NDEGTN----ALCARYASPGRSFLASNVAG----E-CVWINPPYSHIRDWQQHYMRCKASDPEH-T-S--AVFCVPA-WPQVHRLMQKAKYSLVARYPA---GTPLFSKPGPDGQR
      Vcar1000014369_Vcar_Vcar1000014369  M-FLPDEFRNV---------E--------------NMLG-R----------------------QFTFDAAC-NNSGDN----SLCTRFASPSNSFLTSDVSG----EFFVWANPPYTKIELWHKHYLRCKRMRPET-T-S--AVFVVPK-WSQIERRMRAAGHQLLKTYAV---GTKLFLEKADDGSR
      Vcar1000006797_Vcar_Vcar1000006797  M-FLPDEFRNV---------E--------------NMLG-R----------------------QFTFDAAC-NNSGDN----SLCTRFASPSNSFLTSDVSG----E-FVWANPPYTKIELWHKHYLRCKRMRPET-T-S--AVFVVPK-WSQIERRMRAAGHQLLKTYAV---GTKLFLEKADDGSR
      Vcar1000013547_Vcar_Vcar1000013547  M-YLPDEFRNV---------E--------------NMLG-R----------------------QFTFDAAC-NNSGDN----SLCTRFASPSNSFLTSDVSG----E-FVWANPPYTKIELWHKHYLRCKRMRPET-T-S--AVFVVPK-WSQIERRMRAAGHQLLKTYAV---GTKLFLEKADDGSR
      Vcar1000003571_Vcar_Vcar1000003571  M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTHVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR
      Vcar1000013363_Vcar_Vcar1000013363  M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFVVPKDVFTFECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR
      Vcar1000006689_Vcar_Vcar1000006689  M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAIPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR
      Vcar1000010818_Vcar_Vcar1000010818  M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFLDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR
      Vcar1000011421_Vcar_Vcar1000011421  M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSCFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR
      Vcar1000012306_Vcar_Vcar1000012306  M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR
      Vcar1000012957_Vcar_Vcar1000012957  M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFAQPVDVGTR
      Vcar1000013369_Vcar_Vcar1000013369  M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR
      Vcar1000013410_Vcar_Vcar1000013410  M-FLRDEFRRV---------E--------------TELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR
      Vcar1000006131_Vcar_Vcar1000006131  M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR
      Vcar1000007918_Vcar_Vcar1000007918  ---------------------------------------------------------------------------GDN----SLCTRFASPSNSFLTSDVSG----E-FVWANPPYTKIELWHKHYLRCKRMRPET-T-S--AVFVVPK-WSQIERRMRAAGHQLLKTYAV---GTKLFLEKADDGSR
      Vcar1000012324_Vcar_Vcar1000012324  --LSCTVFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAQVDG----E-CVRMVPRTSTPSTFISQYLESKTTNPRT---S--AIIILPD-RPTAPWAPLIRHMTVVRRFPA---GARIVCRRDPSDAS
      Vcar1000014693_Vcar_Vcar1000014693  --IAQLLFLEY---------D--------------SQYG------------------------PFTVDAYC-DDLGLT----AQLSPFFSPSRPFLSTDIEG----E-CVWMVPPVDNASTNIARYLDAKTANPNT---S--AIIVLPD-RPQAPWGPLIRHMTIVRRFPA---GAQIVCRPLSSDPS
      Vcar1000005651_Vcar_Vcar1000005651  --IARSLFLEY---------D--------------SQYG------------------------PFTVDAYC-DDLGLT----AQLSPFFSPSRPFLSTDIEG----E-CVWMVPPVDNASTIVARYLDAKTASPNT---S--AIIVLPD-RPQAPWAPLIRHMTIVRRFPA---GAQIVCRPTTSDPS
      Vcar1000014193_Vcar_Vcar1000014193  ------------------------------------------------------------------------------------LSPFFSPSRPFLSTDIEG----E-CIWMVPPVDNVSTIVARYLDAKTANPKT---S--AIIVLPD-RPQAPWAPLIRHMTIVHRFPA---GAQIVCRPTTSDPS
      Vcar1000006954_Vcar_Vcar1000006954  ------------------------------------------------------------------------------------LSPFFSPSRPFLSTDIEG----E-CVWMVPPVDNASTIVARYLDAKTANPKT---S--AIIVLPD-RPQASWAPLIRHMTIVRRFPA---GAQIVCRPTSSDPS
      Vcar1000001334_Vcar_Vcar1000001334  --LTRAIFLDL---------D--------------SQYG------------------------PFTVDACC-D--GIN----AHVVPFFSPSRSFLSAQVDG----E-CVWMVPPTSSPSAFISQYLESKTTNPRT---S--AIIILPD-RPTAPWAPLIRHMTVMRRFPA---GAWIVCRRDPSDAS
      Vcar1000012130_Vcar_Vcar1000012130  --LARTVFLDL---------D--------------SQYS------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAQVDG----E-CVWMVPPTSTPSTFISQYLESKSTNPRT---S--AVIVLPD-RPTAPWTPLIRHMTVVRRFPA---GARIVCHRDPSDAS
      Vcar1000003269_Vcar_Vcar1000003269  --LSCTVFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----THVVPFFSPSRSFLSAQVDG----E-CVWMVPPTSTLSTFISQYLESKTTNPCTAPIS--TLVII-----------------------------------------
      Vcar1000012920_Vcar_Vcar1000012920  --LSRTVFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAHVDG----E-CVWMVPPTSSPSAFISQYLESKTTNPRT---S--AIIVLPD-RPTAPWAPLIRHMTVVRRFPT---GARIVCRRDPSDAS
      Vcar1000014935_Vcar_Vcar1000014935  --LSRTVFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAQVDG----E-CVWMVPPTSSPSAFISQYLESKTTNPRT---S--AIIVLPD-RPTAPWAPLIRHMTVVRRFPA---GARIVCRRDPSDAS
      Vcar1000010947_Vcar_Vcar1000010947  --L---------------------------------HYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAQVDG----E-CVWMVPPTSSPSAFISQYLESKTTNPRT---S--AIIVLPD-RPTAPWAPLIRHMTVVRRFPA---GARIVCRRDPSDAS
      Vcar1000003043_Vcar_Vcar1000003043  --LSCTIFLDL---------D--------------SQYG------------------------PFTIDACC-DDFGIN-------VPFFSPPHSFLSAQVDG----E-CVWMVLPTLNPSAFISQYLESKTTNPCT---S--AIIVLPD-RPTAPWALLIHHMAIVRRFPA---GVQIFCRRDPSDAS
      Vcar1000010860_Vcar_Vcar1000010860  --VMGKAAEWY---------HDLFTHKGSMLTIQGMCDD-F----------------------VLLFSDEC-ATDANGNNWFAFDTLINFAQDSVTSHDLNG----Q-HIWCSTPADRVIPWLNNYSTFKQRTPDT-T-R--AVILVPK-CIHLEKEFQTRGWTLLKEFTK---NSRIFSEPKPGGGH
      Vcar1000008009_Vcar_Vcar1000008009  ----RSVFLRL---------Q--------------KASG-R----------------------VFTFDATC-NGGSD-----ALCPKFACSSSPIISHDVSG----Q-HVWCQPPPKCVNDWLDHYSACKQRSPES-T-S--AIFVVPK-CTQFEQTFQKRGWTLLKEFLS---DAHIFSVPKSGGGR
      Vcar1000014314_Vcar_Vcar1000014314  I-IDYELLRQL---------E--------------RRIG-R----------------------TFSLDAAA-NDDGSN----SVCKMFASPTRSFLNSDCSG----H-TIWMNPPMKLLSDFLRHYHRCKSYDPSI-S-SWTAPWASTA-WPRVNCLDVNCSMCRVLTFPA---KSVLFNGLSLEGKT
      Vcar1000000459_Vcar_Vcar1000000459  --VLRSVFLRL---------Q--------------KASG-R----------------------VFAIDASC-NVRSDN----SLCPIFAC-PDSFTSHNLSG----Q-HIWCNAPADRAIPWLNNYSTFKQRTPDT-T-S--AVILVPK-CAHLEKEFQTRGWTLLKEFTK---NSSIFSEPKPGGGR
      Vcar1000000870_Vcar_Vcar1000000870  --------ERY---------Q--------------LRQP-T----------------------AAAAEPAR-VTGPTA----ALQPQRGTRQDSFTSHNLSG----Q-HIWCNAPADRAIPWLNNFSTFKQRAPDT-T-S--AVILVPK-CAHLEKEFQTRGWTLLKEFTK---NSNIFSEPKPGGGR
      Vcar1000014202_Vcar_Vcar1000014202  M-FLPGEFRNV---------E--------------NMLG-R----------------------QFTFDTAC-NNSGDN----SLCTRFASLSSSFLTSDVSG----E-FPGFKPPYTKIELWHKHYLRCKRMRPET-T-S--AVFVVPK-WSQVERRRQGASGARTAGGKC---VPPVCGKHLPQMPR
      Vcar1000001335_Vcar_Vcar1000001335  --LSCTIFLDL---------Y--------------SQYG------------------------LFTVDACC-DDFGIN----AHIIPFFSPSCSFLSAQVDGLWFAY-SLWMFWAVR-----------------------------------------------------------------------
      Vcar1000010806_Vcar_Vcar1000010806  --VAPPRFVRR---PDGNCLW--------------SKYW---QGLGRSVHESEVASIATTMGREFTLDACA-SDCGLS----AVCNAFSCTARPFLDTNIAG----H-TVWMAPNAADLPAYVTHYRACKPLAPQS---T--AACILVP-SGTEP--SLLKGMKLVRRYPV---GTSLFYVPDVQGSR
      Vcar1000015077_Vcar_Vcar1000015077  --VAPPRFVRR---PDGNCLW--------------SKYW---QGLGRSVHESEVASIATTMGREFTLDACA-SDCGLS----AVCNAFSCTARPFLDTNIAG----H-TVWMAPNAADLPAYVTHYRACKPLAPQS---T--AACILVP-SGTEP--SLLKGMKLVRRYPV---GTSLFYVPDVQGSR
      Vcar1000003045_Vcar_Vcar1000003045  A-LANPVVLSKLSLPQPGA-W--------------STFLLSPPASGRYL-------VVRGPQRGATMTLCELQAYGES----AFEMELVPAPRP-------------------PQPPPMP-----FPSPQPPSPP-------------P-PPSPP--PSSKSSSRSREWPV---SKVSTNPTAVTAVT
      Vcar1000007748_Vcar_Vcar1000007748  --LARTIFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLGPSR-----------------------RRYLESKSTNPRT---S--AVIVLPD-RPTAPWTPLIRHMTVVRRFPA---GARIVCHRDPSDAS
      Vcar1000007858_Vcar_Vcar1000007858  --LARTVFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAQ---------------------------VDALKQHNRL---H--QPPLPPV-DENTP-TSRSTTLDMVKQWATDVGGGDCNSHDDSKGHL
      Vcar1000005961_Vcar_Vcar1000005961  AGIAPPRFVRR---PDGNCLW--------------SKYW---QGLGRSVHESEVASIATIMDREFTLDACA-SDCGLS----AVCNAFSCTARPFLDTNVAG----H-TVWMAPNAADLPAYVTHYRACKPLAPQS---T--AACILVP-SGTEP--SLLKGMKLVRRYPV---GTSLFYVPDVQGSR
      consensus/100%                      .......................................................................................b....ps..............................................................................................
      consensus/95%                       ...........................................................................s...........h.s.spsh.s.p..........................a...b...Pp....p.....h......p...................................
      consensus/90%                       ................................................................hshss...ss.u.s....u.hs.a.ssspsFhs.pl.G....p..lbh.ss.........pY..sK...Ppp...o..Ahhhls....p.....b....p.h.pas....ss.lh....s.s.p
      consensus/85%                       ..h....h.p.........................p.b..........................FohDsss.ss.G.s....u.hs.FhssupsFhsspl.G....p..lWhsss.sp...h..pY.psK..pPpo...o..Ahhhls...sp.....b....pll+pass...ss.lh....s.ssp
      consensus/80%                       ..h..s.Fbph........................p.hs.........................FThDAss.ss.G.s....u.hs.FhSsupsFhssplsG....c..lWhsPP.sp...a.ppY.psK..sPpo...S..AlhhlP...sph....b...hpll+pass...usplh.p.sssssp
      consensus/75%                       ..h..s.Fbph.........p..............sphG.........................FThDAsC.ss.G.s....u.hs.FhSPopSFhssplsG....c..VWhsPP.sphp.a.ppY.csK..sPcT...S..AlhhlPc..sph....b...hpll+pass...Gsplhsp.sssssp
      consensus/70%                       ..h..s.Fbpl.........p..............sphG.........................FThDAsC.ss.G.N....u.hs.FhSPSpSFhssslsG....E.hVWhsPPhsphp.a.ppY.csK..sPcT...S..AlhhlPc.bsph....b...hpll+pass...Gsplhsp.sssssp
      

    • General notes, phyletic distribution and gene neighborhoods of DIRS1-like/Group2-Clade 1 of N6-MTase

      General notes:

      DIRS1-type/CrRem1-like. These are usually degenerate in their active site suggesting that they are inactive. Further, several proteins are fragmentary suggesting that they are inactive or degenerate transposons. The CrRem1-like transposase versions, however, are likely to be active. In a previous study, we had shown that CrRem1-like transposon is present in upto 39 copies in the draft genome of Volvox. However, these sequence seem to have been eliminated completely from the Genbank assembly. They can be unified to the PCIF1 group in searches suggesting that they are likely to have been derived from PCIF1. The CrRem1-like MTases are shown separately just to illustrate the Volvox expansion (These are derived from an earlier study).

    • General notes, phyletic distribution and gene neighborhoods of DIRS1-like/Group2-Clade 1 of N6-MTase

      
      GI                Domain architecture                                                                                  Pfam                                                                                                                                                                                                                                                                                                                                                                                                                       Gene_name             len   Taxonomy                                              Species                        Genbank/other annotation
      # 1;
      Adig1000018270    -                                                                                                    LMF1                                                                                                                                                                                                                                                                                                                                                                                                                       Adig1000018270        178   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.14096                                                        
      Adig1000000319    -                                                                                                    RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      Adig1000000319        1180  eukaryota>cnidaria                                    Acropora digitifera            adi_v1.18602                                                        
      Adig1000005510    RnaseH                                                                                               RNase_H+Dam+Phage_integrase                                                                                                                                                                                                                                                                                                                                                                                                Adig1000005510        720   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.22623                                                        
      Adig1000019867    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Adig1000019867        132   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.15446                                                        
      Adig1000014879    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Adig1000014879        599   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.11174                                                        
      Adig1000005157    CASPASE                                                                                              RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      Adig1000005157        1007  eukaryota>cnidaria                                    Acropora digitifera            adi_v1.00366                                                        
      Adig1000023250    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Adig1000023250        374   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.02813                                                        
      Adig1000019364    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Adig1000019364        374   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.15024                                                        
      Adig1000005444    CASPASE                                                                                              Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Adig1000005444        970   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.22572                                                        
      Adig1000006995    NACHT+RnaseH                                                                                         RVT_1+RNase_H                                                                                                                                                                                                                                                                                                                                                                                                              Adig1000006995        2418  eukaryota>cnidaria                                    Acropora digitifera            adi_v1.04684                                                        
      Adig1000000407    -                                                                                                    RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      Adig1000000407        1291  eukaryota>cnidaria                                    Acropora digitifera            adi_v1.18677                                                        
      Adig1000012708    RnaseH+ZNKNUCK                                                                                       RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      Adig1000012708        747   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.00852                                                        
      Adig1000012208    -                                                                                                    RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      Adig1000012208        895   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.00785                                                        
      Adig1000005671    MYB                                                                                                  RVT_1+Phage_integrase                                                                                                                                                                                                                                                                                                                                                                                                      Adig1000005671        1221  eukaryota>cnidaria                                    Acropora digitifera            adi_v1.22768                                                        
      Adig1000014194    SbcC                                                                                                 -                                                                                                                                                                                                                                                                                                                                                                                                                          Adig1000014194        412   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.10547                                                        
      Adig1000010273    SWC3                                                                                                 YkyA                                                                                                                                                                                                                                                                                                                                                                                                                       Adig1000010273        349   eukaryota>cnidaria                                    Acropora digitifera            adi_v1.06943                                                        
      384498610         -                                                                                                    RVT_1+Dam                                                                                                                                                                                                                                                                                                                                                                                                                  RO3G_13812            370   eukaryota>fungi                                       Rhizopus delemar RA 99-880     hypothetical protein RO3G_13812 [Rhizopus delemar RA 99-880].       
      384486240         -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        RO3G_03124            172   eukaryota>fungi                                       Rhizopus delemar RA 99-880     hypothetical protein RO3G_03124 [Rhizopus delemar RA 99-880].       
      384495516         RnaseH                                                                                               RNase_H                                                                                                                                                                                                                                                                                                                                                                                                                    RO3G_10717            264   eukaryota>fungi                                       Rhizopus delemar RA 99-880     hypothetical protein RO3G_10717 [Rhizopus delemar RA 99-880].       
      384497823         -                                                                                                    RVT_1+Dam                                                                                                                                                                                                                                                                                                                                                                                                                  RO3G_13025            1062  eukaryota>fungi                                       Rhizopus delemar RA 99-880     hypothetical protein RO3G_13025 [Rhizopus delemar RA 99-880].       
      Mcir1000003087    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Mcir1000003087        229   eukaryota>fungi>basal                                 Mucor circinelloides           Genemark1.3173_g                                                    
      Pbla1000001570    RnaseH                                                                                               RVT_1+DNA_pol_viral_C+Dam                                                                                                                                                                                                                                                                                                                                                                                                  Pbla1000001570        570   eukaryota>fungi>basal                                 Phycomyces blakesleeanus       e_gw1.22.32.1                                                       
      Amac1000015656    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Amac1000015656        424   eukaryota>fungi>blastocladiomycota                    Allomyces macrogynus            Allomyces macrogynus ATCC 38327 hypothetical protein (424 aa)      
      Rall1000003656    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Rall1000003656        306   eukaryota>fungi>cryptomycota                          Rozella allomycis              O9G_006133m.01                                                      
      Rall1000004614    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Rall1000004614        193   eukaryota>fungi>cryptomycota                          Rozella allomycis              O9G_005572m.01                                                      
      Bcir1000007834    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Bcir1000007834        192   eukaryota>fungi>mucoromycotina                        Backusella circina             fgenesh1_pg.3_#_153                                                 
      Bcir1000015312    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Bcir1000015312        227   eukaryota>fungi>mucoromycotina                        Backusella circina             fgenesh1_pg.351_#_5                                                 
      Mver1000012212    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Mver1000012212        214   eukaryota>fungi>zygomycete                            Mortierella verticillata        Mortierella verticillata NRRL 6337 hypothetical protein (214 aa)   
      Mver1000007812    RnaseH                                                                                               Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Mver1000007812        408   eukaryota>fungi>zygomycete                            Mortierella verticillata        Mortierella verticillata NRRL 6337 hypothetical protein (408 aa)   
      Mver1000006934    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Mver1000006934        281   eukaryota>fungi>zygomycete                            Mortierella verticillata        Mortierella verticillata NRRL 6337 hypothetical protein (281 aa)   
      485639708         -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        EMIHUDRAFT_252968     232   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516     hypothetical protein EMIHUDRAFT_252968 [Emiliania huxleyi CCMP1516].
      Sarc1000007323    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000007323        147   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (147 aa)           
      Sarc1000003931    HPC2                                                                                                 -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000003931        111   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (111 aa)           
      Sarc1000000340    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Sarc1000000340        291   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (291 aa)           
      Sarc1000008354    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Sarc1000008354        611   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (611 aa)           
      Sarc1000010122    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Sarc1000010122        129   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (129 aa)           
      Sarc1000005744    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000005744        206   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (206 aa)           
      Sarc1000001093    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Sarc1000001093        320   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (320 aa)           
      Sarc1000009775    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Sarc1000009775        395   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (395 aa)           
      Sarc1000012860    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000012860        81    eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (80 aa)            
      Sarc1000000559    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000000559        181   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (181 aa)           
      Sarc1000002310    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000002310        167   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (167 aa)           
      Sarc1000000227    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000000227        158   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (158 aa)           
      Sarc1000007502    -                                                                                                    rve                                                                                                                                                                                                                                                                                                                                                                                                                        Sarc1000007502        346   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (346 aa)           
      Sarc1000008574    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000008574        91    eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (91 aa)            
      Sarc1000004600    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000004600        341   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (341 aa)           
      Sarc1000009974    -                                                                                                    Spc7_N                                                                                                                                                                                                                                                                                                                                                                                                                     Sarc1000009974        333   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (333 aa)           
      Sarc1000013043    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000013043        244   eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (243 aa)           
      Sarc1000011591    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Sarc1000011591        77    eukaryota>ichthyosporea                               Sphaeroforma arctica            Sphaeroforma arctica JP610 hypothetical protein (77 aa)
      Smar1000013635    RnaseH                                                                                               RNase_H+Dam                                                                                                                                                                                                                                                                                                                                                                                                                Smar1000013635        229   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR001291-PA pep:novel scaffold:Smar1:AFFK01015044:37482:38168:-1 gene:SMAR001291 transcript:SMAR001291-RA
      Smar1000013446    RnaseH                                                                                               -                                                                                                                                                                                                                                                                                                                                                                                                                          Smar1000013446        296   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR001465-PA pep:novel scaffold:Smar1:JH430561:19939:21493:-1 gene:SMAR001465 transcript:SMAR001465-RA
      Smar1000007923    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Smar1000007923        77    eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR006470-PA pep:novel scaffold:Smar1:JH431701:276342:276938:1 gene:SMAR006470 transcript:SMAR006470-RA
      Smar1000009144    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Smar1000009144        159   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR005333-PA pep:novel scaffold:Smar1:JH431599:48277:48814:-1 gene:SMAR005333 transcript:SMAR005333-RA
      Smar1000001056    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Smar1000001056        193   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR012323-PA pep:novel scaffold:Smar1:JH432129:50962:52819:1 gene:SMAR012323 transcript:SMAR012323-RA
      Smar1000003623    RnaseH                                                                                               RNase_H+Dam                                                                                                                                                                                                                                                                                                                                                                                                                Smar1000003623        408   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR010103-PA pep:novel scaffold:Smar1:JH431960:112870:114093:-1 gene:SMAR010103 transcript:SMAR010103-RA
      Smar1000006848    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Smar1000006848        89    eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR015629-PA pep:novel scaffold:Smar1:JH431783:5867:6133:1 gene:SMAR015629 transcript:SMAR015629-RA
      Smar1000011252    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Smar1000011252        183   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR003465-PA pep:novel scaffold:Smar1:AFFK01018421:2875:3482:-1 gene:SMAR003465 transcript:SMAR003465-RA
      Smar1000004446    RnaseH                                                                                               RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      Smar1000004446        425   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR009389-PA pep:novel scaffold:Smar1:JH431896:51121:52792:1 gene:SMAR009389 transcript:SMAR009389-RA
      Smar1000007917    -                                                                                                    RNase_H                                                                                                                                                                                                                                                                                                                                                                                                                    Smar1000007917        429   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR006464-PA pep:novel scaffold:Smar1:JH431701:250462:251748:1 gene:SMAR006464 transcript:SMAR006464-RA
      Smar1000004632    SUN                                                                                                  F5_F8_type_C                                                                                                                                                                                                                                                                                                                                                                                                               Smar1000004632        655   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima             SMAR009255-PA pep:novel scaffold:Smar1:JH431878:386467:389171:-1 gene:SMAR009255 transcript:SMAR009255-RA
      210086106         PIN+SET+CHASE3+MA+WXG+RNASE-EG+MA+LAMG+LamG+LEVANB+LamG+LAMG                                         Laminin_B+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_B+Laminin_EGF+Laminin_EGF+Laminin_I+Flagellar_rod+MAD+Myosin_tail_1+Troponin+Laminin_II+Laminin_G_1+Laminin_G_2+Laminin_G_1+Laminin_G_2                                                                                                                                                                              BRAFLDRAFT_131954     2475  eukaryota>metazoa>chordata                            Branchiostoma floridae         hypothetical protein BRAFLDRAFT_131954 [Branchiostoma floridae].    
      313236395         CDC27                                                                                                RVT_1+GRP+FoP_duplication+Phage_integrase+DUF3807                                                                                                                                                                                                                                                                                                                                                                          GSOID_T00016890001    1568  eukaryota>metazoa>chordata                            Oikopleura dioica              unnamed protein product [Oikopleura dioica].                        
      313244116         -                                                                                                    RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      GSOID_T00010067001    725   eukaryota>metazoa>chordata                            Oikopleura dioica              unnamed protein product [Oikopleura dioica].                        
      210086104         DISCOIDIN+PIN+SET+MA+MODE-HTH+NIT+XpaC+sigma+CHASE3+MODE-HTH+MA+WXG+BZIP+LamG+LamG+LamG+LamG+LamG    Acyl-CoA_ox_N+Laminin_N+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_B+Laminin_B+Laminin_EGF+VSP+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_B+Laminin_EGF+Laminin_EGF+Laminin_I+Myosin_tail_1+AAA_27+AAA_27+MAD+Myosin_tail_1+PspA_IM30+Cast+Lebercilin+FadA+Troponin+Laminin_II+Laminin_G_1+Laminin_G_2+Laminin_G_2+Herpes_BLLF1+Herpes_BLLF1+Pneumo_att_G+Laminin_G_1+Laminin_G_2    BRAFLDRAFT_131952     3505  eukaryota>metazoa>chordata                            Branchiostoma floridae         hypothetical protein BRAFLDRAFT_131952 [Branchiostoma floridae].    
      313235579         -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          GSOID_T00014774001    357   eukaryota>metazoa>chordata                            Oikopleura dioica              unnamed protein product [Oikopleura dioica].                        
      327268405         TPR+TPR+TPR+TPR+TPR+JOR                                                                              TPR_11+TPR_11+TPR_12+TPR_11+TPR_17+TPR_11+TPR_11+TPR_1+TPR_1+Herpes_BLLF1+JmjC                                                                                                                                                                                                                                                                                                                                             LOC100564779          1580  eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis            PREDICTED: lysine-specific demethylase 6A-like [Anolis carolinensis].
      327286446         RnaseH+TBC                                                                                           RVT_1+RabGAP-TBC                                                                                                                                                                                                                                                                                                                                                                                                           LOC100566709          1049  eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis            PREDICTED: hypothetical protein LOC100566709 [Anolis carolinensis]. 
      327274991         NLPC                                                                                                 LRAT                                                                                                                                                                                                                                                                                                                                                                                                                       LOC100563610          382   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis            PREDICTED: hypothetical protein LOC100563610 [Anolis carolinensis]. 
      125838616         RnaseH                                                                                               RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      LOC100005823          560   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                    PREDICTED: reverse transcriptase/ribonuclease H/putative methyltransferase-like [Danio rerio].
      189519778         RnaseH                                                                                               RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      LOC100003059          790   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                    PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Danio rerio].
      189546720         -                                                                                                    RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      LOC100149602          762   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                    PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Danio rerio].
      189516844         RnaseH                                                                                               RVT_1+RNase_H                                                                                                                                                                                                                                                                                                                                                                                                              LOC558928             684   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                    PREDICTED: reverse transcriptase/ribonuclease H/putative methyltransferase-like [Danio rerio].
      17066696          RnaseH                                                                                               RVT_1+Dam                                                                                                                                                                                                                                                                                                                                                                                                                  -                     785   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis         reverse transcriptase/ribonuclease H/putative methyltransferase, partial [Tetraodon nigroviridis].
      125850303         RnaseH                                                                                               RVT_1+RNase_H                                                                                                                                                                                                                                                                                                                                                                                                              LOC561204             892   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                    PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Danio rerio].
      125835610         RnaseH                                                                                               RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      LOC100008043          684   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                    PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Danio rerio].
      156209094         -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        NEMVEDRAFT_v1g220590  131   eukaryota>metazoa>cnidaria                            Nematostella vectensis         predicted protein [Nematostella vectensis].                         
      156219436         -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          NEMVEDRAFT_v1g208020  360   eukaryota>metazoa>cnidaria                            Nematostella vectensis         predicted protein [Nematostella vectensis].                         
      156216881         STYKIN                                                                                               Pkinase_Tyr+Dam                                                                                                                                                                                                                                                                                                                                                                                                            NEMVEDRAFT_v1g211073  426   eukaryota>metazoa>cnidaria                            Nematostella vectensis         predicted protein [Nematostella vectensis].                         
      156209473         -                                                                                                    DUF640                                                                                                                                                                                                                                                                                                                                                                                                                     NEMVEDRAFT_v1g220156  672   eukaryota>metazoa>cnidaria                            Nematostella vectensis         predicted protein [Nematostella vectensis].                         
      321452244         -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          DAPPUDRAFT_267974     433   eukaryota>metazoa>crustacea                           Daphnia pulex                  hypothetical protein DAPPUDRAFT_267974 [Daphnia pulex].             
      170819710         SRDOMAIN                                                                                             Herpes_BLLF1+RVT_1                                                                                                                                                                                                                                                                                                                                                                                                         -                     1291  eukaryota>metazoa>crustacea                           Daphnia pulex                  reverse transcriptase [Daphnia pulex].                              
      321459175         -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        DAPPUDRAFT_257338     330   eukaryota>metazoa>crustacea                           Daphnia pulex                  hypothetical protein DAPPUDRAFT_257338 [Daphnia pulex].             
      170819724         RnaseH                                                                                               RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      -                     757   eukaryota>metazoa>crustacea                           Daphnia pulex                  reverse transcriptase [Daphnia pulex].                              
      115628917         -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          LOC582271             229   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus  PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Strongylocentrotus purpuratus].
      291220884         -                                                                                                    DUF829                                                                                                                                                                                                                                                                                                                                                                                                                     LOC100368707          403   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii       PREDICTED: hypothetical protein [Saccoglossus kowalevskii].         
      291236647         ZZ-like                                                                                              -                                                                                                                                                                                                                                                                                                                                                                                                                          LOC100378332          667   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii       PREDICTED: hypothetical protein [Saccoglossus kowalevskii].         
      291232955         -                                                                                                    Dam+Phage_integrase                                                                                                                                                                                                                                                                                                                                                                                                        LOC100369420          685   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii       PREDICTED: predicted protein-like [Saccoglossus kowalevskii].       
      156542171         RnaseH                                                                                               RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      LOC100116731          585   eukaryota>metazoa>hexapoda                            Nasonia vitripennis            PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Nasonia vitripennis].
      307196129         RnaseH                                                                                               RVT_3+PCIF1_WW                                                                                                                                                                                                                                                                                                                                                                                                             EAI_17025             251   eukaryota>metazoa>hexapoda                            Harpegnathos saltator          hypothetical protein EAI_17025, partial [Harpegnathos saltator].    
      156538873         -                                                                                                    RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      LOC100115061          405   eukaryota>metazoa>hexapoda                            Nasonia vitripennis            PREDICTED: hypothetical protein, partial [Nasonia vitripennis].     
      156542658         -                                                                                                    Phage_integrase                                                                                                                                                                                                                                                                                                                                                                                                            LOC100121360          1054  eukaryota>metazoa>hexapoda                            Nasonia vitripennis            PREDICTED: similar to tyrosine recombinase [Nasonia vitripennis].   
      156539065         -                                                                                                    RVT_1+Phage_int_SAM_1                                                                                                                                                                                                                                                                                                                                                                                                      LOC100117226          787   eukaryota>metazoa>hexapoda                            Nasonia vitripennis            PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase, partial [Nasonia vitripennis].
      156546508         -                                                                                                    Phage_integrase                                                                                                                                                                                                                                                                                                                                                                                                            LOC100123785          389   eukaryota>metazoa>hexapoda                            Nasonia vitripennis            PREDICTED: similar to tyrosine recombinase [Nasonia vitripennis].   
      307212135         -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          EAI_06111             213   eukaryota>metazoa>hexapoda                            Harpegnathos saltator          hypothetical protein EAI_06111, partial [Harpegnathos saltator].    
      307193617         -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        EAI_10577             149   eukaryota>metazoa>hexapoda                            Harpegnathos saltator          hypothetical protein EAI_10577, partial [Harpegnathos saltator].    
      307201692         -                                                                                                    DNA_pol_viral_C+Dam                                                                                                                                                                                                                                                                                                                                                                                                        EAI_09447             251   eukaryota>metazoa>hexapoda                            Harpegnathos saltator          hypothetical protein EAI_09447, partial [Harpegnathos saltator].    
      307198641         -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        EAI_12430             155   eukaryota>metazoa>hexapoda                            Harpegnathos saltator          hypothetical protein EAI_12430, partial [Harpegnathos saltator].    
      307183886         -                                                                                                    DNA_pol_viral_C                                                                                                                                                                                                                                                                                                                                                                                                            EAG_00458             240   eukaryota>metazoa>hexapoda                            Camponotus floridanus          hypothetical protein EAG_00458, partial [Camponotus floridanus].    
      156542106         -                                                                                                    Phage_integrase                                                                                                                                                                                                                                                                                                                                                                                                            LOC100115034          858   eukaryota>metazoa>hexapoda                            Nasonia vitripennis            PREDICTED: similar to tyrosine recombinase [Nasonia vitripennis].   
      Aque1000012689    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Aque1000012689        284   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.212957                                                         
      Aque1000016213    -                                                                                                    Phage_int_SAM_1                                                                                                                                                                                                                                                                                                                                                                                                            Aque1000016213        334   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.216481                                                         
      Aque1000017746    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Aque1000017746        158   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.218014                                                         
      Aque1000024217    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Aque1000024217        222   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.224485                                                         
      Aque1000022247    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Aque1000022247        129   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.222515                                                         
      Aque1000026010    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Aque1000026010        147   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.226278                                                         
      Aque1000015330    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Aque1000015330        199   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.215598                                                         
      Aque1000019121    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Aque1000019121        220   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.219389                                                         
      Aque1000008102    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Aque1000008102        232   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.208349                                                         
      Aque1000012578    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Aque1000012578        221   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.212846                                                         
      Aque1000018252    RnaseH                                                                                               RVT_1+Dam                                                                                                                                                                                                                                                                                                                                                                                                                  Aque1000018252        708   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.218520                                                         
      Aque1000010245    -                                                                                                    Dam                                                                                                                                                                                                                                                                                                                                                                                                                        Aque1000010245        256   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.210513                                                         
      Aque1000019993    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Aque1000019993        556   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.220261                                                         
      Aque1000019816    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Aque1000019816        230   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.220084                                                         
      Aque1000001701    -                                                                                                    Phage_int_SAM_1                                                                                                                                                                                                                                                                                                                                                                                                            Aque1000001701        312   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.201769                                                         
      Aque1000022602    -                                                                                                    RVT_1+Dam                                                                                                                                                                                                                                                                                                                                                                                                                  Aque1000022602        350   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.222870                                                         
      Aque1000008788    -                                                                                                    RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      Aque1000008788        577   eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.209041                                                         
      Aque1000011876    -                                                                                                    -                                                                                                                                                                                                                                                                                                                                                                                                                          Aque1000011876        79    eukaryota>metazoa>porifera                            Amphimedon queenslandica       Aqu1.212144                                                         
      159465941         Phd-YefM+RnaseH                                                                                      RVT_1+RNase_H                                                                                                                                                                                                                                                                                                                                                                                                              CHLREDRAFT_180868     1199  eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii      hypothetical protein CHLREDRAFT_180868 [Chlamydomonas reinhardtii]. 
      22415757          -                                                                                                    RVT_1                                                                                                                                                                                                                                                                                                                                                                                                                      ORF-B                 829   eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis  reverse transcriptase [Volvox carteri f. nagariensis].              
      
      
      CrRemI subclade
      GI                      Domain architecture                      Gene_name        len    Species                     Class                                 Genbank annotation
      Vcar1000015077          N6-MTase                                 Vcar1000015077   707    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_231000001
      Vcar1000014693          N6-MTase                                 Vcar1000014693   671    Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_fgenesh4_pg.C_1190017
      Vcar1000010806          N6-MTase                                 Vcar1000010806   511    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_83000041
      Vcar1000005651          N6-MTase                                 Vcar1000005651   483    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_107000001
      Vcar1000014193          N6-MTase                                 Vcar1000014193   466    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_80000013
      Vcar1000007748          N6-MTase                                 Vcar1000007748   460    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_102000028
      Vcar1000001334          N6-MTase                                 Vcar1000001334   443    Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_fgenesh4_pg.C_120014
      Vcar1000007858          N6-MTase                                 Vcar1000007858   397    Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_fgenesh4_pg.C_300114
      Vcar1000012920          N6-MTase                                 Vcar1000012920   372    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_95000030
      Vcar1000014935          N6-MTase                                 Vcar1000014935   321    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_273000001
      Vcar1000003269          N6-MTase                                 Vcar1000003269   313    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_25000154
      Vcar1000013363          N6-MTase                                 Vcar1000013363   311    Volvox carteri              eukaryota>viridiplantae>chlorophyta   e_gw1.101.32.1
      Vcar1000003043          N6-MTase                                 Vcar1000003043   302    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_23000205
      Vcar1000012324          N6-MTase                                 Vcar1000012324   287    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_22000129
      Vcar1000000870          N6-MTase+pepsin                          Vcar1000000870   267    Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_fgenesh4_pg.C_10481
      Vcar1000014369          N6-MTase                                 Vcar1000014369   252    Volvox carteri              eukaryota>viridiplantae>chlorophyta   e_gw1.112.21.1
      Vcar1000006797          N6-MTase                                 Vcar1000006797   251    Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_Genewise1Plus.C_190101
      Vcar1000013547          N6-MTase                                 Vcar1000013547   251    Volvox carteri              eukaryota>viridiplantae>chlorophyta   gw1.120.3.1
      Vcar1000003571          N6-MTase                                 Vcar1000003571   250    Volvox carteri              eukaryota>viridiplantae>chlorophyta   gw1.10.241.1
      Vcar1000006131          N6-MTase                                 Vcar1000006131   250    Volvox carteri              eukaryota>viridiplantae>chlorophyta   e_gw1.31.78.1
      Vcar1000006689          N6-MTase                                 Vcar1000006689   250    Volvox carteri              eukaryota>viridiplantae>chlorophyta   gw1.75.44.1
      Vcar1000010818          N6-MTase                                 Vcar1000010818   250    Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_Genewise1.C_830117
      Vcar1000011421          N6-MTase                                 Vcar1000011421   250    Volvox carteri              eukaryota>viridiplantae>chlorophyta   gw1.116.24.1
      Vcar1000012306          N6-MTase                                 Vcar1000012306   250    Volvox carteri              eukaryota>viridiplantae>chlorophyta   gw1.61.90.1
      Vcar1000012957          N6-MTase                                 Vcar1000012957   250    Volvox carteri              eukaryota>viridiplantae>chlorophyta   gw1.105.40.1
      Vcar1000013369          N6-MTase                                 Vcar1000013369   250    Volvox carteri              eukaryota>viridiplantae>chlorophyta   gw1.101.33.1
      Vcar1000013410          N6-MTase                                 Vcar1000013410   250    Volvox carteri              eukaryota>viridiplantae>chlorophyta   gw1.111.24.1
      Vcar1000010947          N6-MTase                                 Vcar1000010947   225    Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_fgenesh4_pg.C_410059
      Vcar1000007918          N6-MTase                                 Vcar1000007918   224    Volvox carteri              eukaryota>viridiplantae>chlorophyta   e_gw1.11.237.1
      Vcar1000014202          N6-MTase                                 Vcar1000014202   213    Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_Genewise1Plus.C_800030
      Vcar1000012130          N6-MTase                                 Vcar1000012130   160    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_57000068
      Vcar1000000459          N6-MTase                                 Vcar1000000459   159    Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_Genewise1Plus.C_10079
      Vcar1000001335          N6-MTase                                 Vcar1000001335   151    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_12000015
      Vcar1000014314          N6-MTase                                 Vcar1000014314   150    Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_Genewise1.C_1320001
      Vcar1000008009          N6-MTase                                 Vcar1000008009   139    Volvox carteri              eukaryota>viridiplantae>chlorophyta   gw1.11.234.1
      Vcar1000006954          N6-MTase                                 Vcar1000006954   346    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_9000001
      Vcar1000010860          N6-MTase+RT                              Vcar1000010860   569    Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_89000033
      Vcar1000003045          N6-MTase                                 Vcar1000003045   2927   Volvox carteri              eukaryota>viridiplantae>chlorophyta   fgenesh4_pg.C_scaffold_23000206
      Vcar1000005961          N6-MTase                                 Vcar1000005961   1336   Volvox carteri              eukaryota>viridiplantae>chlorophyta   estExt_fgenesh5_synt.C_620028
      29423694                N6-MTase                                 -                258    Chlamydomonas reinhardtii   eukaryota>viridiplantae>chlorophyta   pol protein [Chlamydomonas reinhardtii].
      159468287               N6-MTase                                 -                171    Chlamydomonas reinhardtii   eukaryota>viridiplantae>chlorophyta   hypothetical protein CHLREDRAFT_171026 [Chlamydomonas reinhardtii].
      29423677                N6-MTase+pepsin+RT                       -                776    Chlamydomonas reinhardtii   eukaryota>viridiplantae>chlorophyta   reverse-transcriptase [Chlamydomonas reinhardtii].
      
      Back to Contents
    • Multiple sequence alignment of the Group2/clade 2/Chlorophyte type N6-MTases

      consensus/100%                                                                  .......................................................................D...s.................................s.h.....h.......................................................................................................h....s.......................................................................................................................................................................................
      consensus/95%                                                                   .........................................b.pP..b...h..................lDss.s.............s...hs..............suL.....W...........hhhNPPa...........................................ah.+h...................................lhl....s.sp...ha.........................h.....ch.b...............................................s................hhhh........................................................................
      consensus/90%                                                                   .................................p.......a.TP..hhp.l.....h...........pLDss.u.............s.p.as..............suL...p.W...........hahNPPa..........................s................ah.+h..p.........s......................lhLh...s.os.s.hap..h...................h.h.h.p.Rl.F...............................................sshs.............lhha........................................................................
      consensus/85%                                                                   ...............b.................p.......a.TP..hhp.l.....h..........hsLDss.u.............s.p.as..............sGL...p.W...........hahNPPY..........................sp......h........ah.+h.pp.........s.....................hlhLl...spT-.s.aapphh...................l.a.l.c.RlpF........................................s......sshss............lhha........................................................................
      consensus/80%                                                                   ...............h.................pp......W.TP..hhc.lp....h..........hsLDss.u..p......ps..sppaao.......p......sGL...p.W...........hahNPPY..........................up......l........ah.+hhpp.........s.....................hlhLl...scT-.s.aapphh...................l.a.l.+uRlpF........s.......................p.......s......ushss...........hllha........................................................................
      consensus/75%                                                                   ...............a.................ps......W.TP..hhc.Ls....a..........hsLDss.u.sp......ss.psppaaT.......pp.....DGL...ppW...........hahNPPY..........................up......l.......pWlpKhhpp.........s.....................hlhLl..ssRTD.spaapchh...................lpF.l.+GRl+F........s..................p....s.......s......ushss...........hllla........................................................................
      consensus/70%                                                                   ...............a.................ss......W.TPpphh-bLsp...a..........hsLDsC.ussp......ss.psp+aaT.......cp.....DGLp..ppW..........plahNPPY..........................uc.p....l.......cWlpKuhpp.......p.u.....................lVhLl..PuRTD.opaapchh...........b.......lpF.l.+GRL+F........u.............s....p....s.......s......APhss...........hllla........................................................................
      Annot                                                                                        Str(-1)                                                Str-1                  Str-2                                Str-4                                                                                   Str-5                                              Str-6                                                            Str-7                                                              
      FINAL                                                                           ---------------------------------------------HHHHHHHHH-H------------------------------H-HHHH----------------------------------E-EEEE--------------------------------------H-----HHHHHHHHHHH--H-H-------E-----------------EEEEEE--E-----HHHHHHHH-----------H-HH----HHH-H-H---EE----------------------------------------------------EEEEE----EEEEEE--------------------------------------------HHHH-HH-HH--------
      ALIGN                                                                           -----------------------------------------------HHHH--H-H-H----------------------------------EEE-------------------------------E-EEE--------------------------------------HH-----HHHHHHHHHHH--H---------------------------EEEEEE--E------HHHHHHH-----------H-HH----HHH-H-H---EE-----------------------------------------------------------EEEEEEEE-E-----------------------------------------------EE-E---------
      HMM                                                                             --------------EEE---------------------------HHHHHHHHHH-H-------------EEE-------------------EEEEE--------------------------------EEEEEE------------------------------------------HHHHHHHHHHH--H-H-H-----E-----------------EEEEEE--E---H-HHHHHHHH-----------H-H-----HHH-H-HH-HEEE---------------------------------------EE-----------------EEEEEEE--------------------------------------------HHHHH-HH-HHH--H----
      FREQ                                                                            ---------------------------------------------HHHHHHHHH---------------------------H---HH-HHHH-----------------------EEE--------E-EEE---------------------------------------H-----HHHHHHHHHHH--H-H-------E-----------------EEEEEE----------HHHHHH-----------H-HH----HHH-H-H---EE---------------------------------------------------EEEEEE----EEEEEE-----------------------------------------------E-EE-EE--------
      PSSM                                                                            ---------------------------------------------HHHHHHHHH-H-H--------------------------------------------------------------------E-EEEE--------------------------------------H-----HHHHHHHHHHH--H-H-------------------------EEEEEE--------HHHHHHHH-----------H-----------E----EE--------------------------------------------------------------EEEEE---------------------------------------------HHHH-HH-HH--------
      VOLCADRAFT_104970_Volvox_carteri_f_nagariensis_302839284                        GG-N-V-------RLFLSS-------------ESP----E-WFTPLSIIELVHE-V-FTPGG------INLDSC-SSAAANT---RV-GATAYYDM------ES-----DGLLECNAW-M------G-NVFVNPPF--------------------------GV-HGGASY-----QSLFFQRCATE--Y-M-AG-R-IH-----------------QAVLLL--KAAVG-YAWFDAIL-----------Q-WP----VCF-L-RQRLAFV-------------------R----R-Q--SGSQQQEGGPLTWGVRVANPHG---------SVVVYMGP-----------------------------------------DVQRF-VS-VFG--C--MG-----------\ParB-HTH fused
      VOLCADRAFT_100579_Volvox_carteri_f_nagariensis_302855367                        GG-N-V-------RLFLSS-------------ESP----E-WFTPLSIIELVRE-V-FTPGR------IDLDPC-SSAAANT---RV-GATVYYDM------ES-----DGLLECNAW-M------G-NVFVNPPF--------------------------GV-RGGASY-----QSLFFQRCATE--Y-T-AG-R-IH-----------------QAVLLL--KSAVG-YAWFDAIL-----------Q-WP----VCF-L-RQRLAFV-------------------R----G-Q--SGSQQQEGGPLTWGARVANPHG---------SVVVYMGP-----------------------------------------DVQRF-VS-VFG--C--MG-----------|
      VOLCADRAFT_104840_Volvox_carteri_f_nagariensis_302838546                        YG-G-L-------RVFMQS-------------DTC----E-WYTPDFILDLVRE-L-FTPGC------IDLDPC-SCAAANT---RV-RATSFYDE------AT-----DGLAEGSAW-R------G-N----PAF--------------------------GV-RRGQSL-----QGLFFGRCMRE--Y-Q-AG-N-VR-----------------QAVVLLILKAGIG-YSWFNDVL-----------N-WP----VCF-L-REHLSFV-------------------R----Q-V--GTS-----DELQWGARAQNPHG---------SVIVYMGP-----------------------------------------AVERF-AT-LFS--R--IG-----------|
      VOLCADRAFT_106473_Volvox_carteri_f_nagariensis_302846292                        AG-S-R-------PIFLQS-------------ASV----E-WYTPQCILDKVAE-M-FGPGG------IDLDPC-SSEAANT---RV-KAGRFFDV------AL-----DGLSEACRW-E------G-NVFVNPPF--------------------------GS-RGVLSM-----QNLFFERCVKE--Y-R-QG-A-VK-----------------QAVVLL--KAAVG-YKWFRAVL-----------E-WP----VCF-L-WERLAFV-------------------Q----P-Q--HTSVGEE-SELKWGSRVQNPHG---------SVVVYLGT-----------------------------------------NVDKF-VR-IFG--D--IG-----------|
      VOLCADRAFT_108225_Volvox_carteri_f_nagariensis_302854263                        AG-S-R-------PIFLQS-------------ASV----E-WYTPQCILDKVAE-M-FGPGG------IDLDPC-SSEAANT---RV-KAGRFFDV------AL-----DGLSEACRW-E------G-NVFVNPPF--------------------------GS-RGVLSM-----QNLFFERCVKE--Y-R-QG-A-VK-----------------QAVVLL--KAAVG-YKWFRAVL-----------E-WP----VCF-L-WERLAFV-------------------Q----P-Q--HTSVGEE-SELKWGSRVQNPHG---------SVVVYLGT-----------------------------------------NVDKF-VR-IFG--D--IG-----------|
      VOLCADRAFT_91459_Volvox_carteri_f_nagariensis_302838997                         YG-G-L-------PVITRS-------------DTC----E-WYTPDFILDLVRE-L-FTPGC------IDLDPC-SCAAANT---RV-RATSFYDE------AT-----DGLAEGNAW-R------G-NVFLNPAF--------------------------GV-RRGQSL-----QELFFGRCKRE--Y-Q-AG-N-VR-----------------QAVVLL--KAGIG-CSWFNDVL-----------N-WP----VCF-L-RERLSFV-------------------R----Q-V--GTS-----DELQWGARALNPHG---------SVIAYMGP-----------------------------------------AVERF-AT-LFS--R--IG-----------|
      VOLCADRAFT_104908_Volvox_carteri_f_nagariensis_302838722                        AG-S-R-------PIFLQS-------------ASV----E-WYTPQCILDKVAE-M-FGPGG------IDLDPC-SSEAANT---RV-KAGRFFDV------AL-----DGLSEACRW-E------G-NVFVNPPF--------------------------GS-RGVLSM-----QNLFFERCVKE--Y-R-QG-A-VK-----------------QAVVLL--KAAVG-YKWFRAVL-----------E-WP----VCF-L-WERLAFV-------------------Q----P-Q--HTSVGEE-SELKWGSRVQNPHG---------SVVVYLGT-----------------------------------------NVDKF-VR-IFG--D--IG-----------|
      VOLCADRAFT_118198_Volvox_carteri_f_nagariensis_302842945                        AR-D-W-------QDYVSP-------------DSE----Y-YATPPYILTAVRK-L-YG-GA------IDLDPA-SDEKANE---AV-QAAKFYTA------EE-----DGLSPELPW-S------G-KIFINPPS--------------------------GI-VGSEPL-----QGLFFNRAIRE--A-AVAP-T-IT-----------------ECVILL--KAAVG-QRWFGPVF-----------D-HP----HCW-L-AERTVKK-------------------GA---A-A--AAAAAGGGGNGGDGGGGKGPRG---------MVVVYVGR-----------------------------------------RVQDF-CN-AFG--E--LG-----------|
      VOLCADRAFT_106408_Volvox_carteri_f_nagariensis_302845993                        GG-N-V-------RLFLSS-------------ESL----E-WFTPLSIIELVRE-V-FTPGR------INLDPC-SSAAANT---RV-GATVYYDM------ES-----DGLLECNTW-M------G-NVFVNLPF--------------------------GV-HGGASY-----QSLFFQRCATE--Y-T-AG-R-IH-----------------QAVLLL--KAALCLFSGDASAL-----------T-NF----ICV-L-PCSGDTS----------------------------------------T-----FTNFI---------CSLAWSGDS---------------------------------------SALTTF-IC-SLA--C--SG-----------|
      BATDEDRAFT_85509_Batrachochytrium_dendrobatidis_JAM81_575474002                 TG-V-S-------FTELNN--------------------E-LYTPKAIIMAAKK-V-IEKKQ------FDLDPA-SCAFANTLHGDT-IANTIYTE------AE-----DGLQ--KIW-N------G-HVWLSPPS--------------------------GI-DEAGLI-R---MKKWFLAAESK--Y-L-AG-E-IV-----------------SCHILL--RVDMQ-NDWFLRAL-----------Y-YP----HCF-F-HERIQFS---------------------------------------T---------PT---------GREKLLTD-----------------S----------HMLVYMG-----TNTERF-CI-QFA--Q--LG-----------/
      Ot07g02900_Ostreococcus_tauri_308806169                                         VG-L-T-------PDWIVH-------------ATC-KVFE-MDLPTIEAPLIK---------------GLLDPC-TNSHLRP---NI-PAEKCYDK------KD-----DGLKMENSW-E------GYHVLVNPPY--------------------------EA-Q----V-----QWRFINRAINE--V-E-WE-R-CP-----------------GVILVC--RNSTD-TSYFQRLL-----------P-FP----RIH-L-RRTAVQF---------------------K----D-Y--S-------H------CPVGF---------GICVFCIV-----------------SP--T--NPK-QA----------EMYHRF-YD-EFH--Q--SG-----------\Fused to multiple chromatinic domains
      OT_ostta07g03040_Ostreococcus_tauri_693499233                                   VG-L-T-------PDWIVH-------------ATC-KVFE-MDLPTIEAPLIK---------------GLLDPC-TNSHLRP---NI-PAEKCYDK------KD-----DGLKMENSW-E------GYHVLVNPPY--------------------------EA-Q----V-----QWRFINRAINE--V-E-WE-R-CP-----------------GVILVC--RNSTD-TSYFQRLL-----------P-FP----RIH-L-RRTAVQF---------------------K----D-Y--S-------H------CPVGF---------GICVFCIV-----------------SP--T--NPK-QA----------EMYHRF-YD-EFH--Q--SG-----------|
      F751_3154_Auxenochlorella_protothecoides_760440511                              MG-L-T-------PDWIIE-------------TVCFGVFG-LQRPTAEVPFIK---------------GLLDPC-SNSRVAP---NI-PAEVLYDKHVGAGVED-----NGLALKNEW-K------GFYILLNPPF--------------------------HS-Q----M-----QWRFVNRAIDA--V-E-RG-E-VP-----------------GVLLVC--RNSTD-ANYFQRLR-----------P-YP----RVL-L-GRKCALF---------------------K----D-Y--D-------K------SPNGF---------GIAAVMLA-----------------K---R--E---RT----------DLYLRF-YD-AFE--R--FG-----------||
      MICPUCDRAFT_57353_Micromonas_pusilla_CCMP1545_303277723                         DG-L-T-------PDWIVD-------------AAC-RVFC-LNVPTVDEPIIR---------------GLLDPC-TNNKRRP---NI-PAEKTFDK------KQ-----DGLKQENEW-K------GYHVVLNPSY--------------------------ES-Q----V-----QWRFINRAINE--V-E-WG-F-CP-----------------GILLVC--RNSTD-TSYFQRLH-----------P-FP----RIF-L-RRDAIRF---------------------K----D-Y--D-------N------TPIGF---------GIAVFCMV-----------------APTVT--KSE-KL----------ETYRRF-YD-EFS--H--AG-----------|
      OSTLU_87805_Ostreococcus_lucimarinus_CCE9901_145349057                          VG-L-T-------PDWIVH-------------GAC-KVFG-LDLPTIEAPLIK---------------GLLDPC-TNSHLRP---NI-PAEKCYDK------KD-----DGLKMSNPW-A------GYHVLVNPPY--------------------------EA-Q----V-----QWRFINRAINE--V-E-WE-H-CP-----------------GIILVC--RNSTD-TSYFQRLL-----------P-FP----RIH-L-RRKAVQF---------------------K----D-Y--S-------N------CPIGF---------GICVFCIV-----------------SP--T--HPQ-QA----------DIYSRF-HD-EFH--A--AG-----------|
      VOLCADRAFT_89771_Volvox_carteri_f_nagariensis_302835622                         MG-L-T-------PDWIIM-------------AAAFKVFQ-LPRPTASAPYIR---------------GLLDPC-TNSKANP---NI-PAEKLYDK------SD-----DGLKLSNSW-S------GYHVILNPEY--------------------------TS-Q----T-----QWRFVNRAIDE--V-E-NG-C-VP-----------------AVLLLC--RNSTD-TAYFQRLR-----------P-YP----RVM-L-KRTSARF---------------------K----D-Y--E-------K------TPIGF---------GIAVFCIA-----------------P---R--GPG-RT----------TLYRRF-ID-AFG--D--WG-----------|
      Bathy04g03050_Bathycoccus_prasinos_612396523                                    EG-L-T-------PDWIIE-------------GCY-KVFG-LEKPTVEVPFVK---------------DLLDPC-TNSLSNP---NI-PAEVLYDK------SI-----NGLLSKNSW-A------NKFALLNPPY--------------------------ET-Q----T-----QWRFIHRAINE--V-E-WG-F-SK-----------------GILLVC--RNSTD-TNYFQKLL-----------P-FP----RVM-L-RRNAVQF---------------------K----D-Y--T-------S------SPIGF---------GIAVFCMV-----------------AP--G--NPEVQR----------ETYARF-YD-EFA--S--AG-----------|
      MICPUN_55980_Micromonas_sp_RCC299_255071987                                     DG-L-T-------PDWIID-------------AGC-RIFG-LNVPTVQEPIVK---------------GLLDPC-TNDKRDP---NI-PAEKTYDK------RQ-----DGLKQENPW-K------GYHVILNPSY--------------------------ES-Q----V-----QWRFINRAINE--V-E-WG-F-CP-----------------GILLVC--RNSTD-TSYFQRLL-----------P-FP----RIF-L-RRDAVRF---------------------K----D-Y--T-------H------TPIGF---------GIAVFCLV-----------------SPIVT--PEE-KM----------ATYSRF-YN-EFR--H--AG-----------|
      H632_c3034p0_Helicosporidium_sp_ATCC_50920_633905054                            FG-L-T-------PDWIIS-------------AACFDVLQ-LARPTPERPFIR---------------GLLDPC-SNSLLAP---NI-PAERLYDR------AA-----DGLSAANPW-R------GFHVLLNPPF--------------------------SA-Q----M-----QWRFVNRAIDA--V-E-ND-E-VP-----------------AVVLLC--RNSTD-AGFFQRLR-----------P-YP----RVL-L-RRKSAHF---------------------K----D-Y--E-------K------TPIGF---------GIAVFMLA-----------------K---E--S---RI----------HLYERF-LK-TFE--R--AG-----------|
      COCSUDRAFT_83615_Coccomyxa_subellipsoidea_C-169_545372676                       MG-L-T-------PDWIIQ-------------AASFVVFR-LPRPTPEQPFIA---------------GLLDPC-TNSMVAP---NI-PAQVLYDK------KM-----NGLLMSNSW-A------GFHVLLNPDY--------------------------SA-A----T-----QWRFVNRAIDE-----------VP-----------------AVLLVC--RNSTD-TAYFQRLR-----------P-YP----RVM-L-RRGNARF---------------------K----D-Y--D-------K------TPIGF---------GVAVFCIA-----------------K---A--P---AT----------ELYERF-FD-GFA--A--MG-----------|
      CHLNCDRAFT_138470_Chlorella_variabilis_552817679                                MG-L-T-------PDWIVE-------------AAAFRVFG-LERPTAERPYIA---------------GLLDPC-TNSKLAP---NI-PAESLYDK------QPRAAQDNGLKLSNSW-Q------GRYVLLNPDY--------------------------RA-Q----V-----QWRFVNRAIDE--V-E-NG-G-VP-----------------AVVLVC--RNSTD-TGYFQRLR-----------P-YP----RVL-L-RRLSARF---------------------K----D-Y--E-------K------TPIGF---------GIAVFCIA-----------------K---S--NVR-RV----------ELYSRF-YD-AFE--G--MG-----------/
      Npun_F2574_Nostoc_punctiforme_PCC_73102_186465327                               SH-S-E-PC----PPPPK--------------ESD----K-WYTPPNIQDLLTQ-V-L---GA-----VDLDPC-ADDG------KHIKAANHYTA------SD-----DGLA--QEW-Y------G-RVFMNPPY--------------------------SC------------PGKWMAKLQAE--I-E-AG-R-VT-----------------EAIALV--PAATD-TNWLHPLL-----------D-TQP---ICF-W-KGRIKF--------LD------------T----N----Y-------Q------PKLSARQ-------SHCLLYW------G----------------------------------TNAQKF-KQ-VFD--E--VG-----------\Prokaryotic homologs
      VR70_RS06925_Rhodospirillaceae_bacterium_BRH_c57_783127364                      MD-R-G-KH----GLFVN-LEP-G------Q-RNT----E-WYTPDWILNPLYE-A-MGNQP------FDLDPC-SPIK-GPDA-PV-WAKKHFTR------KD-----DGLS--QDW-H------G-RVWLNPPY--------------------------AS------------LADWIRKAADA--T-W-CR-N--MRNPPTEESAQREHPLCESVVALI--PARTH-TVYWQDYI-----------T-NHA-R-VLF-L-HGKIGF----RMP-TP-------E----G-LV-Q----A-------K------TQFPE---------GLAFVIW------------------------------------------GNHR-----PFT--Q------AL-------|
      VR70_RS05265_Rhodospirillaceae_bacterium_BRH_c57_783126461                      MD-R-G-KH----GLFVN-LEP-G------Q-RNT----E-WYTPKWILEPLYK-A-MGNQP------FDLDPC-SPIK-GPNS-PV-WAKKHFTK------DD-----DGLN--QNW-H------G-RVWLNPPY--------------------------AS------------LADWIRKAADA--T-W-CS-S--MPNPPTEEAGLREQPLCESVVALI--PARTH-TVYWQDYI-----------T-NHA-R-VIF-I-HGKIGF----LMP-TP-------E----G-LV-Q----A-------K------TQFPE---------GLAFVIW------------------------------------------GNHR-----PFT--D------AL-------|
      Riv7116_4895_Rivularia_sp_PCC_7116_427373349                                    RG-Q-G-SL----SNKSL--------------SSD----E-WYTPPHISDLVTQ-V-L---GQ-----ITLDPC-ADEG------KHIRAAQHYTV------LD-----DGLI--QEW-N------G-RIFMNPPY--------------------------SA------------PSVWIKKLQAE--F-E-SG-R-VT-----------------EAIALV--PAATD-TRWLSPLL-----------K-SQP---VCF-W-TGRIKF--------LD------------M----S----Y-------K------PRLSARQ-------SHCLVYW------G-G--NWE-------------------------------RF-KE-VFD--P--YG-----------|
      OSG_eHP4_00155_Environmental_Halophage_eHP-4_383396772                          EH-T-V-VD----AATKQ--------------ETD----E-WASPRELVEPLNT-A-V---NG-----FDLDPC-SGAE------VSPFADKTYTE------SD-----NGLS--QPW-S------G-IVWVNPPY--------------------------SA------------MDTWTEKAIAE--I----E-N-TG-----------------TICYLC--KGDSS-TEWWQTAA-----------Q-EAT-V-ICA-I-DHRLQF--------GD------------G----D----N-------S------APF-----------ASHIVVF------G-R--ASD-------------------------------SL-IL-ELQ--N--HG-----------|
      ACAty_RS09645_Acidithiobacillus_caldus_491011364                                -----A-------VADPA--------------WSD----E-WYTPDYILDAARA-V-L-G-D------IDLDPA-SCAA-AN-E-AV-QAKRFFAK------EQ-----DGLQ--QAW-R------G-KVWLNPPY--------------------------SY-P--Q-------ILDFCEALVQR--Y---AD-G--S--------------VT-EAIVLV--NSGTE-TQWGQMLL-----------S-HGS-A-ACF-P-ASRLKF--------RR----P--E----G----K----S-------G------LPS-----------QGQMLVY--F---G-P--HVD-------------------------------RF-KT-VFL--S--IG-----------|
      ATC_RS06425_Acidithiobacillus_caldus_503768726                                  -----A-------VADPA--------------WSD----E-WYTPDYILDAARA-V-L-G-D------IDLDPA-SCAA-AN-E-VV-RARQFFDK------TQ-----DGLQ--QDW-Q------G-TVWLNPPY--------------------------SY-P--A-------ILDFCEALVQR--Y---AD-G--S--------------VT-EAIVLV--NSGTE-TQWGQMLL-----------S-HGS-A-ACF-P-ASRLKF--------RR----P--E----G----K----S-------G------LPS-----------QGQMLVY--F---G-P--RAD-------------------------------RF-KT-VFL--S--IG-----------|
      HMPREF0731_4170_Roseomonas_cervicalis_ATCC_49957_296263068                      -----T-------AAFSS--------------AYE----A-WATPPDLLERLYA-A-V-G-S------IDLDPC-SPGK-LR-S-RV-KAPRHFTE------RD-----DGLA--QEW-S------G-KVYMNPPY--------------------------GR-T----------IGAWTTKARVE--V---TA-G--R--------------AE-CVVGLV--PARTD-TRWWHADV-----------A-GHA-H-VWL-L-KGRLAF--------GD------------G----S----T-------P------APF-----------PSALLLW------G-G--NAP-----------TIA-----------------EM-SA-SFP------------------|
      IL54_RS15525_Sphingobium_sp_ba1_739620247                                       ---T-P------SLWLPS---V----------AAD-RN-R-RFTPIEFLRAIEQ-V-W-G-M------IDLDPC-GHPD----S-PV-NARRRISL------EE-GG--DGLR--DDW-S---G--D-VVYLNPPF--------------------------SE------------MVTWLKRADQM----W-IE--------------MR---VQ-KIIALV--PARTD-IGYFHDRI-----------A-QVC-D-VGL-M-RGRLQF--------GQ----P-----I-G----K-K--D-------D--RSR-ATF-----------ALMVCLW------G-A--TAE-------------------------------EI--A-AFD-----AI-----------|
      G407_RS29165_Salinarimonas_rosea_759845022                                      ------------VANWGS---G----------SRD----E-RFTPEDVVRRIEA-V-L-D-G------IELDPC-GHPK----S-PV-RARRFIHR------EE-----DGLK--QPW-N---A--R-TVFVNPPY--------------------------SE------------TGEFVRKAAHE----W-RS-K--R--------------AQ-TVLMLL--PVKTH-FAWHQDHI-----------Q-GIA-D-VFF-L-RGRITF----ERL-GM------------P----A----T-------P------APF-----------PTMLVLY------G----GTD---------------VMIA------------RI-MA-LFE-----CG-----------|
      AFERRI_560020_Acidithiobacillus_ferrivorans_669953369                           IN-P-R------AALWTA--------------KDN----T-WMTPPSLLEQLYP-L-LPSKT------FDIDPC-SPCV-GP-AAPV-RAYVHYTE------RH-----DGLR--QSW-G-K-G--T-YCYVNPPF--------------------------SH------------LRKWIHKALAE--T---DN-G-------V------------VSILLC--PARVD-SIWWHALV-----------A-NRI-P-VVM-L-RGRLHF--------GG------------G----D----N-------R--QQK-APF-----------ASALLII------G----GSA-------------------------------QL-PK-RVA--D--AT-----------|
      SYN7509_RS0224085_Synechocystis_sp_PCC_7509_497315962                           LQ-----------DKFSK-SSK-T------PTRKK-AP-Q-LYTPPEIIDLVRV-V-M-G-E------IDLDPA-SDDI-AQ-Q-WV-QARNYYTL------AL-----DGLF--HPW---F----G-RVWLHPPA--------------------------DG-K----------TAKWTSKLLNE----Y-SS-G--R--------------VT-EAVLLV--RPSAG-SKWFQKLT-----------R-LFP---VCF-P-DERLKF----L---DD------------Q----E-IP-Q-------T------QPK-----------NGNAIFY--L---G-Q--NRQ-------------------------------QF-GQ-VFG--T--IG-----------|
      GGI1_15033_Acidithiobacillus_sp_GGI-221_339835072                               IK-K-P-RC-A--PCLSS--------------GKD----D-WTTPSHLLNLILN-V-LGRKG------FDLDPC-SPSL-KG---PV-PASRYYTR------RE-----DGLK--QAW-E------G-LVFVNPPY--------------------------SQ------------MRHWSSKLVDA--A---AC-G-------V------------QIIALV--PSRTG-TQWWHQVL-----------D-GGA-R-PIY-L-RGRLRF--------GE------------G------I--G-------Q------APF-----------DSTILLF-----------NFS-------------------------------DF----LAE--Q--MA-----------|
      TREAZ_0592_Treponema_azotonutricium_ZAS-9_333734957                             -----H-------VAHNS--------------GNN----E-WYTPAEYIEAARK-A-M-G-G------IDLDPA-SCEA-AN-R-TV-KAKKIHTI------DD-----DGLG--HPW-E------G-RVWLNPPY--------------------------AR-E--L-------IGKFIEKLKTH--V---CR-G--E--------------VT-EAIVLV--NNATE-TAWFGALV-----------S-FSN-A-IVF-P-ASRVKF--------NG----P--D----G----K----M-------G------SPL-----------QGQAVLY--A---G-P--NSE-------------------------------KF-LD-AYK--S--FG-----------|
      Metlim_0419_Methanoplanus_limicola_490177569                                    -----K-------GIRGV--------------PKN----E-WTTPPEIVEASLE-V-L-G-V------IDIDPC-AESK-DC-P-NI-PARAHYTI------WD-----NGMS--VHW-E------G-RVFLNPPY--------------------------GN-S----------LARWIAYLRDE--Y---RL-G--Y--------------VR-EAIVLV--PARTD-SRWFH--Y-----------M-GSN-F-IWCGV-KGRLRF--------SE------------I----D----G-------P------APF-----------PSAIFYI------G-K--NRK-----------RFV-----------------EI-FS-RFG------------------|
      Riv7116_1753_Rivularia_sp_PCC_7116_427370342                                    KT-S-S-KE----TERTN--------------KTD----C-WYSPPHIVELVIQ-V-L---GE-----INLDPC-ADDG------RHIRATKHYTF------DD-----NGLE--QSW-C------G-KVYMNPPY--------------------------SH------------PGAWMKKLELE--F-E-TG-N-VD-----------------EAIALV--PAATD-TNWLSPVL-----------K-TQP---VCF-W-KGRIKF--------LG------------Q----D----Y-------Q------PKLSARQ-------SHVLVYW------G-N--NWQ-------------------------------KF-RE-VFE--D--YG-----------|
      SPUTW3181_RS15120_Shewanella_sp_W3-18-1_500114172                               IN-Q-S------NAEQGF---------------------E-YYTPAPWPQLASQ-L-M-G-G------IDLDPA-SNEI-AN-A-SI-KAKSIFTK------EV-----DGLS--KTW-H---G----TVWMNHPFHRGEQPC---SSKCKKKACIKRGHHIDK-P----I-PG--NGDWINKVISE--Y---ES-G--N--------------IK-EAVIIT--FCNSS-EGWFLPLL-----------K-YA----QCF-P-NGRVHY----IKE-DG------------S---------K-------A------DSCT----------KGSVITY--I---G-K--NVA-------------------------------EF-AR-LYG--E--HG-----------|
      SHEWPOL2_RS06540_Shewanella_sp_POL2_739569226                                   IN-Q-S------NAEQGF---------------------E-YYTPEPWPQLASQ-L-M-G-G------IDLDPA-SNDI-AN-A-SI-NAKTIFTK------EI-----DGLS--QRW-Y---G----TVWMNHPFHRGEKPC---KAKCNKKACIKRGHHIDK-P----I-PG--NGDWINKIIKE--Y---ES-G--H--------------IK-EAVIIT--FCNSS-ETWFLPLL-----------K-FP----QCF-P-HGRVHY----KKA-DG------------S---------K-------A------DSCT----------KGSVITY--L---G-K--NVA-------------------------------EF-SR-LFG--A--HG-----------|
      VII_RS00060_Vibrio_mimicus_446980525                                            IN-Q-T--------------------------SGD--V-E-YYTPLEWVEPARQ-V-M-G-S------IELDPA-SSDI-AN-Q-TV-KAQRIFTI------DD-----DGLS--RPW-T---AQ---TLWMNHPFHRGEKACPADHSKCKKITCLKRGFHIDK-D----I-PS--NNDWINKFIAE--Y---EA-G--H--------------FK-EAICIT--FGNTS-EAWFRKLL-----------P-HL----QCF-P-NGRVHY----RKP-DG------------T---------I-------N------RNVT----------KGSVLTY--L---G-D--RPK-------------------------------AF-KK-VFS--R--LG-----------|
      C475_14433_Halosimplex_carlsbadense_493940376                                   TS-E-Y-QQ----VHWSS--------------ESD----E-WATPPSLLRPLDD-A-V---DG-----FDLDPC-SGAE------ERSIAAETYTE------AD-----DGLA--QRW-H------G-VVWCNPPY--------------------------SD-V----A-------DWIEKARFE--G-A-RD-A-VE-----------------LVIVLV--PARTS-TQWFHKFA-----------S-HAA-A-VCF-I-EGRLSF--------GD------------A----D----N-------S------APF-----------PSMLLAF------G-E--PTD-------------------------------AV-ID-AFD--D--RG-----------|
      VPUCM_1151_Vibrio_parahaemolyticus_UCM-V493_584469889                           ---L-D-------VMFSS-ANS-G------DKSKD----K-WQTPPEIFAQLND-R-F---G------FTLDAA-AEPE------TA-LCEKYFTE------ED-----DALK--QDW-S---G--H-VVFCNPPY--------------------------SK------L-----R-VFAKKAYEE--S---LK-G--T-----------------TVVMLV--PARTD-TQACHDYL-----------A-N--GE-MYF-I-RGRLKF-----LKVGE------------L----Q----D-------A------APF-----------PSVVCVL------G-P--GVE--R-------------------------------------------------------|
      C478_07432_Haloterrigena_thermotolerans_493699302                               --------------VYEE--------------GDD----K-HDTPVEFVAPLIE-A-V---GG-----FDLDP--SASQ------SSDLAERNVTK------DE-----DGLS--TPW-H------G-DVWLNPPY--------------------------SG-V----S-------DWLEYGRDE--Y-Y-RG-A-VD-----------------SIIALV--FARTS-TQWFHNHA-----------T-TAD-L-ACF-V-EGRLSF--------AG------------S----D----H-------S------APA-----------PSVVLVW------G-D--AAG---------------------------NP--DV-VE-YLD--S--QG-----------|
      HLRTI_001342_Halorhabdus_tiamatea_495800631                                     --------------VYEE--------------GDD----D-HDTPGEFVEPLID-A-V---SG-----FDLDP--SAST------SSNLAERNVTK------DE-----DGLS--IPW-H------G-DVWLNPPY--------------------------ST-V----S-------DWLEYARNE--Y-H-RG-A-VD-----------------SIISLV--FARTS-TQWFHNHA-----------T-TAD-L-ACF-V-EGRLSF--------GE------------A----A----N-------S------APA-----------PSLVLVW------G-D--AAE---------------------------NT--DV-VE-YLS--S--QG-----------|
      Metfor_2481_Methanoregula_formicica_505099336                                   -----A-------ALLSH--------------AST----E-HYTPQYILDAVIA-C-M-E-A------IDLDPC-SNSR-KI-P-NV-PAARHYTV------QD-----NGLL--RPW-V------G-RVFLNPPF--------------------------GY-E----V-----E-DWFSKLFLE--T---LE-G--R--------------TT-EAIILW--KSATE-TSAWKTLT-----------R-LSC-R-VCF-P-SARVRF--------GG----P--G----S--D-E-R--K-------S------PTF-----------SPALFYV------G-P--RPE-------------------------------RF-EE-AFR--H--IG-----------|
      HMPREF0731_RS06220_Roseomonas_cervicalis_750330482                              -----T-------AAFSS--------------AYE----A-WATPPDLLERLYA-A-V-G-S------IDLDPC-SPGK-LR-S-RV-KAPRHFTE------RD-----DGLA--QEW-S------G-KVYMNPPY--------------------------GR-T----I-----G-AWTTKARVE--V---TA-G--R--------------AE-CVVGLV--PARTD-TRWWHADV-----------A-GHA-H-VWL-L-KGRLAF--------GD------------G----S----T-------P------APF-----------PSALLLW------G-G--NAP----------------------------------------------------------|
      LEPBO_RS38120_Leptolyngbya_boryana_738087862                                    -------------ERMTS---A----------KTD----E-HYTPPELLELVYE-C-FSPLG------IELDPC-SNAH-GEEA-NV-KASQYFTI------ED-----DGLA--QEW-N---A--K-TVYINPPY--------------------------SD------V-----A-AWVDKVVTE----Q-DR-N--N--------------IG-DVLLLV--KADTS-TQWFAQIW-----------E-SAT-A-VCF-L-KKRVRF-----IN-AE------------S----E-G--N-------A------APF-----------ASAIAYF------G-S--EID-------------------------------RF-YY-AFE--S--AG-----------|
      AGC34828.1_Escherichia_phage_PBECO_4_441462540                                  ---H-S-------VHFST--------------GKD----N-WTTPKDFFEDLDE-L-W---E------FTLDAA-CVKE------TA-LCDNFFTP------ED-----DSLS--QDW-G---N--N-IVWLNPPY--------------------------SD------L-----K-TWLSKAVDA--Y---NN-G--A-----------------TVVILV--PSRTD-TIAFQDYA-----------A-KICDC-ICF-I-KGRLRF--GIPEEPDK------------K----T----D-------S------APF-----------PSCLIVL------D-K--YLT--T-------------------------------------------------------|
      PBI_121Q_417_Escherichia_phage_121Q_712914615                                   ---H-S-------VHFST--------------GKD----N-WTTPKDFFEDLDE-L-W---G------FTLDAA-CVDE------TA-LCDNFFTP------ED-----DSLS--QDW-G---N--N-IVWLNPPY--------------------------SD------L-----K-TWLSKAVDA--Y---NN-G--A-----------------TVVILV--PSRTD-TIAFQDYA-----------A-KICDC-ICF-I-KGRLRF--GIPEEPDK------------K----T----D-------S------APF-----------PSCLIVL------D-K--YLT--T-------------------------------------------------------|
      F403_gp088_Enterobacteria_phage_vB_KleM-RaK2_422937337                          ---H-A-------VHFST--------------RKN----DLWTTPKPLFDKLNA-L-W---N------FTVDVA-CSNE------TA-LCLKHYTP------ED-----DGLS--QDW-S---N--E-TFWLNPPY--------------------------SD------L-----S-PWLSKSVED--Y---NR-G--A-----------------TGLILV--PARTD-TRAFQNFA-----------S-PFCDA-MCF-I-KGRLKF--GNPLKPND------------K----L----T-------S------APF-----------PSCIIVL------D-K--NLT--Q-------------------------------------------------------|
      DX12_RS0110285_Vibrio_parahaemolyticus_646896396                                ---N-K-------LFFSS-ARN-G------SSKQD----K-WQTPPAVFEKLNE-E-F---N------FTLDAT-AEPE------TA-LCDHYFTI------DD-----DALT--QDW-G---N--Q-TVYCNPPY--------------------------SQ------L-----K-DFAKKAQEE--A---KK-G--A-----------------TVVMLV--PARTD-TKAFHDYL-----------S-H--GE-VRL-I-KGRLKF-----LMEGK------------E----Q----D-------A------APF-----------PSMVCVM------G-K--DRE--Q-------------------------------------------------------|
      ABSDF2497_Acinetobacter_baumannii_SDF_169152788                                 MT-K-N-------KLFGL-----A------DDRTD----V-WATPQDFFEKLDR-V-F---K------FDLDVC-ALPD------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAADT--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLV-------------------------------DV----NWE--K--SA-----------|
      A148_RS0111015_Vibrio_splendidus_695353200                                      ---N-K-------LFFSS-ART-G------NPKRD----K-WQTPPAVFKKLNE-E-F---H------FTLDAT-AEPE------TA-LCDHYFTM------DD-----DALT--QDW-S---N--Q-TVYCNPPY--------------------------SQ------L-----K-DFAKKAQEE--A---KK-G--A-----------------TVVMLV--PARTD-TKAFHDHL-----------S-H--GE-VRL-I-KGRLKF-----LQDGE------------E----Q----D-------A------APF-----------PSMVCV-------------------------------------------------------------------------|
      OAC_RS0107480_Vibrio_cyclitrophicus_515155813                                   ---N-K-------LFFSS-ART-G------NPKRD----K-WQTPPAVFKKLNE-E-F---H------FTLDAT-AEPE------TA-LCDHYFTM------DD-----DALT--QDW-S---N--Q-TVYCNPPY--------------------------SQ------L-----K-DFAKKAQEE--A---NK-G--A-----------------TVVMLV--PARTD-TKAFHDHL-----------S-H--GE-VRL-I-KGRLKF-----LQDGE------------E----Q----D-------A------APF-----------PSMVCVM------G-N--DVE--Q-------------------------------------------------------|
      I59_RS11655_Curtobacterium_sp_B8_551283918                                      AG-R-G------GWTHEA-P-S----------ATI----D-WYTPDYIFQALAV-T------------FDLDPC-SPGS-SR-S-NV-PAGAVYTL------AD-----DGLS--SPW-R------G-LCWVNPPY--------------------------D--D----T-----R-TWLQRLADH------G-----------------------EGIALV--FARTD-TKWFHEAA-----------K-SAD-L-VCF-T-SGRIKF-----ID-GR-------T----M----Q----P-------G------GSPGA-GS--------VFLAW------G----ATA-------------------------------AA----ALG------------------|
      Q354_RS0120435_Marinobacterium_jannaschii_654380487                             FG-E---------GANNA-NGR----------KSV----E-WYTPKWIFDELNV-V------------FDLDPS-SPHD-HE-S-FV-PADEKYTI------FD-----DGLS--KPW-H------G-RVWLNPPY--------------------------GR-D----T-----P-FWMNRMIDH------G-----------------------NGIALV--FSRTD-AKWFQDAM-----------K-AAT-A-VLF-V-AGRIEF-----VP-GN-------E----N----K----H-------K----K-SRSGA-GT--------ALFAF------G----EDN-------------------------------AR----VLR------------------|
      D478_26539_Brevibacillus_agri_BAB-2500_432181416                                IN-K---------AMF--------------TSERE----E-WETPQDFFEKLNK-E-F---G------FQLDVC-ALPT------NA-KCERYFTP------DE-----DGLK--QEW-T------G-VCWMNPPY--------------------------GR-E----I-----G-KWVKKAYES--A---KQ-G--A-----------------TVVCLL--PARTD-VKWWHDYC-----------M-KG--E-IRL-V-RGRMKF--------VG------------A----D----N-------M------APF-----------PNAVVIF------S-P--ASA-------------------------------GC----SYK--A--ID-----------|
      J479_2646_Acinetobacter_baumannii_691127129                                     MA-K-L-------GLFGN-----A------EGRTD----V-WATPQTLFDALDQ-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----T-LWIDKAVQT--A---NQ-G--H-----------------TVVGLL--PARTD-VTWWQEHV-----------M-NR--E-IHY-I-KGRLKF--------GG------------C----K----H-------N------APF-----------GCAVVVF------R-P--SLK-------------------------------DV----QWG--A--Q------------|
      FL80_RS15355_Acinetobacter_baumannii_690990657                                  MA-K-L-------GLFGN-----A------EGRTD----V-WATPQTLFDALDQ-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----V-DWISKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCAVVVF------R-P--SLK-------------------------------DV----QWG--A--Q------------|
      C462_04300_Halorubrum_arcis_495269178                                           MS-L-F-SH----EFHED--------------SSD----E-FGTPAEFHRPLAD-A-V---GG-----FDLDPA-SGAE------SQPLASTRFTK------ED-----DGLS--KEW-F------G-TVWLNPPF--------------------------SE-K----T-------RWVRKARAE--V-A-EG-N-VE-----------------TAVVLL--PVDTS-TKLFHDHV-----------T-DAT-A-ICF-V-EGRLSF--------DG------------G----D----R-------N------PNF-----------GTLLAVF------G-E--ASD-------------------------------DL-LD-ALD--R--KG-----------|
      AEQU_RS00240_Coriobacteriaceae_496663041                                        AG---G-------AAFSS--------------ARD----D-WETPAWLFSALDS-E-F---H------FTLDAA-SSDA------NA-KCERHLTK------RD-----DGLA--ADW-----G--GERVWVNPPY--------------------------GR-G----V-----G-AWARKAAIE--G-A-KP-R--T-----------------TVALLV--AARTD-TEWFLRYI-----------L-GHA-E-IRL-V-RGRIRF----ELA-GV------------A----Q----G-------P------APF-----------PSMVAVF------G-E--GAA----------------------------------PG-KVS--SIANA--AARGGKGAS|
      J689_1368_Acinetobacter_calcoaceticus/baumannii_complex_645913983               MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------|
      J635_1953_Acinetobacter_baumannii_690997976                                     MA-K-L-------GLFGN-----A------EGRTD----V-WATPQKLFDALDQ-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-A------G-TCWMNPPY--------------------------GR-E----I-----V-DWISKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----KWG--D--Q------------|
      LJ44_RS16470_Acinetobacter_baumannii_447017697                                  MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------|
      J635_2258_Acinetobacter_baumannii_690998264                                     TA-K-L-------GLFGN-----A------EGRTD----V-WATPQKLFDALDQ-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QDW-T------G-TCWMNPPY--------------------------GR-E----I-----S-LWIEKAVQT--A---NQ-G--H-----------------TVVGLL--PTRTD-VAWWQEHV-----------M-NR--E-IHY-I-KGRLKF--------GG------------C----K----H-------N------APF-----------GCAVVVF------R-P--SLK-------------------------------DV----QWG--T--Q------------|
      TT45_RS11045_Acinetobacter_baumannii_758882462                                  MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----V-DWISKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWA--M--AV-----------|
      J660_0735_Acinetobacter_calcoaceticus/baumannii_complex_493629922               MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GC-E----I-----V-DWIAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------|
      J689_1349_Acinetobacter_baumannii_691068978                                     MA-Q-R-------KLFGL-----A------ENRTD----V-WATPQDFFDKLNA-V-F---N------FDLDVC-ALPE------NA-KCERFFSP------EQ-----NGLK--QEW-I------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QW-------------------|
      W9I_03525_Acinetobacter_nosocomialis_493629840                                  MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GR-E----I-----V-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------|
      F985_01871_Acinetobacter_sp_NIPH_973_490838153                                  MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPG------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------|
      J523_3197_Acinetobacter_baumannii_691027491                                     MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----SWE--K--SA-----------|
      K041_RS17240_Acinetobacter_baumannii_690981431                                  MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWD--K--GA-----------|
      J517_3010_Acinetobacter_baumannii_691065210                                     MT-K-N-------KLFGL-----A------EERTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----NWE--K--SA-----------|
      J595_RS19805_Acinetobacter_baumannii_691047241                                  MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------|
      J697_3983_Acinetobacter_baumannii_691093639                                     MT-K-N-------KLFGL-----A------DDRTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-Q--SLI-------------------------------DV----SWE--K--SA-----------|
      BN776_01939_Clostridium_sp_CAG:768_548223533                                    MN-S-T-EF-KK-YNFMQ---E----------RSD------YLTPPEMIQEIFQ-E-LNLLGIYSGDKFDLDTC-CSQK----N-IP-ACNHYIEG------EN-----DGLS--LDW-H------N-LNYCNPPY--------------------------KT------C-----D-KWVKKAFAE----F-QN-G--K-----------------ISVLLI--PARTE-TKYWQEYILKNGFAIRENVY-------VRF-L-RKGLCF-----LN-PE------------T----N----E-------K----M-GVFKN---------ALAIVIF------D----GSK--N-K--------------------------EV-------------------------|
      J660_1691_Acinetobacter_baumannii_691157882                                     MS-K-N-------KLFGL-----A------EDRTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----NWE--K--SA-----------|
      RQ87_RS18135_Acinetobacter_baumannii_447010248                                  MT-K-N-------KLFGL-----A------DDRTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----SWE--K--SA-----------|
      J596_3741_Acinetobacter_baumannii_691117543                                     MA-K-L-------GLYGN-----A------EGKTD----V-WATPQNLFDALDQ-I-F---N------FDLDVC-ALPE------NA-KCERYFTP------EL-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----S-LWIEKAVET--A---NN-G--H-----------------TVVGLL--PVRTD-VVWWQEHI-----------L-HR--E-IHY-I-KGRLKF--------GG------------S----K----H-------N------APF-----------GCALVVF------R-P--SLK-------------------------------DV----QSD--K--SI-----------|
      ACINWC323_RS01110_Acinetobacter_sp_WC-323_696306260                             MA-K-S-------KLFGL-----A------EDRTD----V-WATPQDFFDKLNA-I-F---D------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLS--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----S-LWIEKAVET--A---NA-G--Y-----------------TVVALL--PARTD-VGWWQSHC-----------L-NR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCAVVVF------R-P--SLN-------------------------------DV----RWE--Q--SQ-----------|
      K035_3853_Acinetobacter_baumannii_691039522                                     MT-K-N-------KLFGL-----A------DDRTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWD--K--GA-----------|
      F931_01759_Acinetobacter_pittii_507070967                                       MA-K-L-------GLYGN-----A------EGKTD----V-WATPQNLFDAIDH-I-F---N------FDLDVC-ALPE------NA-KCDRYFTP------EL-----DGLK--QEW-V------G-TCWMNPPY--------------------------GR-E----I-----S-LWIEKAVET--A---NN-G--H-----------------TVVGLL--PVRTD-VVWWQEHI-----------L-HR--E-IHY-I-KGRLKF--------GG------------C----K----H-------N------APF-----------GCALVVF------R-P--SLK-------------------------------DV----RWE--S--SI-----------|
      FL80_RS05360_Acinetobacter_baumannii_690988986                                  MT-K-N-------KLFGL-----A------EERTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---NK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----NWE--K--SA-----------|
      ABBL099_02355_Acinetobacter_baumannii_690996743                                 MT-K-N-------KLFGL-----A------EERTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---NK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----SWE--K--SA-----------|
      SALWKB2_RS02465_Snodgrassella_alvi_644547413                                    MN-K-G----FTHE-KNA-S-N----------NSD----E-WYTPEWMFRILNL-D------------FDLDPA-APKG-GL-P-WI-PAQQFYCK------ED-----DGLS--KPW-H------G-LVWLNPPY--------------------------GK-E----T-----G-KWLQRMHEH------R-----------------------QGIALV--FSRTD-SRWFHDYA-----------V-KAD-A-ILY-L-KGRVRF-----V--NA-------D----G----D----P-------G----K-SSLGC---------GSVLIGW------G----EVA-------------------------------VS----ALQ------------------|
      MZ39_RS07370_Pseudomonas_fluorescens_734904253                                  MG-A-R-------E-APP-K-H----------KSV----E-WYTPAWIFERLGL-Q------------FDLDPS-SPHD-YV-T-AV-PAKTKYTI------FD-----DGLS--KEW-S------G-RVWMNPPY--------------------------GP-E----T-----S-FWMRRLIAH------G-----------------------DGIALV--FSRTD-AEWFQDAM-----------A-NAS-A-TLL-V-KGRIAF-----VP-GH-------E----N----S----H-------K----K-GRSGA---------GSALFAF------G----DEC-------------------------------AI----ALQ------------------|
      H621_RS26760_Pseudomonas_vranovensis_739119776                                  MG-A-R-------P-EQP-K-H----------KSV----E-WYTPAWIFERLGV-E------------FDLDPS-SPHD-YV-T-PV-PAKRKYTV------FD-----DGLS--KDW-A------G-RVWMNPPY--------------------------GP-D----T-----G-FWMRRLIAH------G-----------------------NGIALV--FSRTD-AEWFQEAM-----------S-SAS-A-TLL-I-KGRIAF-----IP-GH-------E----N----S----H-------K----K-GRSGA---------GSAMFAF------G----DEC-------------------------------AI----ALQ------------------|
      J532_4398_Acinetobacter_baumannii_691154760                                     MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWD--K--GA-----------|
      BTS2_0497_Bacillus_sp_TS-2_591276954                                            IN-Q---------AMF--------------SSSTD----K-WSTPQSFYDKLNQ-E-F---Q------FDIDVC-ATDS------DK-KCERYFSP------EQ-----DGLK--QEW-T------G-ICWMNPPY--------------------------GR-G----I-----G-PWIQKAYES--S---QQ-G--A-----------------TVVCLL--PSRTD-TKWWHEYC-----------M-KG--E-IRF-I-KGRLKF--------GD------------S----K----N-------S------APF-----------PSVVVIF------R-P--KVV-------------------------------SM-------------------------|
      HMPREF1315_RS07015_Bifidobacterium_longum_494116860                             AG---A-------AAMTS--------------NKD----D-WETPQSLFDQLDE-E-F---H------FILDAA-SSDQ------NA-KCEHHYTA------EN-----SGLE--HSW-----E--GETVFCNPPY--------------------------GR-N----I-----G-DWIRKASQE--A-S-KP-D--T-----------------LVVLLV--PARTD-TRWFQNHI-----------L-HRA-E-VRF-L-PGRLKY----EVN-GQ------------A----G---------------------------------EAAPSFW------R-E--GTP----------------------------------SF----------------------|
      B7017_p0034_Bifidobacterium_breve_704484626                                     AG---A-------AAMTS--------------NKD----D-WETPQALFDQLDK-E-F---H------FTLDAA-SNDQ------NA-KCEHHYTA------EN-----SGLE--HSW-----G--GETVFCNPPY--------------------------GR-N----I-----G-DWIRKASQE--A-S-KP-D--T-----------------LVVLLV--PARTD-TRWFQNYI-----------L-HRA-E-VRF-L-PGRLKY----EVD-GQ------------A----G----E-------A------APF-----------PSMVVIM------R-T--GER----------------------------------------------------------|
      BBRE_RS02915_Bifidobacterium_breve_518557238                                    AG---G-------AAYMS--------------NRM----N-WETPQELFDQLDA-E-F---H------FTLDAA-SSAT------NH-KCQKYYTA------ED-----SAFD--HEW-----G--GETVFCNPPY--------------------------GK-A----I-----A-EWVRKCSAE--A-S-RK-D--T-----------------LVVMLL--PARTD-TRWFQQFI-----------L-NRA-E-VRF-L-KGRLRF----ETN-GI------------P----G----G-------P------APF-----------PSMIVVM------R-T--GER----------------------------------------------------------|
      ELEN_RS13090_Eggerthella_lenta_506241510                                        ---G-G-------VAFSS--------------ERH----Y-WETPQDLFDTLDN-E-F---H------FTLDPA-STDE------NA-KCEKHYTI------ED-----DGLC--QSW-A---G--E-RVFCNPPY--------------------------GR-E----L-----S-KWVKKAHAEVAL---NP-G--T-----------------VVVMLI--PARTD-TTYFHDYI-----------Y-HKA-E-VRF-I-RGRLRFCIQ-----GK------------A----K----D-------A------APF-----------PSMVVVFR-----------------------------------------------------------------------|
      CLOSCI_00567_[Clostridium]_scindens_ATCC_35704_167664126                        LN-K-A--------LFSS--------------AKE----D-WATPQDFFDELNK-E-F---H------FDLDPC-ADAE------NA-KCKEFFTK------EQ-----NGLL--QDW-G---G--R-CVFCNPPY--------------------------GRTS----T-----G-EWIKKCYEE-AQ---KP-G--T-----------------VVVALI--PARTD-TRFFHDYI-----------Y-HKA-E-IRF-I-KGRLHF--------GG------------C----K----D-------A------APF-----------PSMVVVF---RKGK----ENEEEK-KTGCTAAGHT-EEKAAEKDDGSENGVDGI-------------------------|
      SAG0375_00225_Streptococcus_agalactiae_GB00984_527786367                        VQ---K-------SLLSS--------------DKD----Y-WETPQTFFKKLNN-E-F---D------FDLDVA-SSHD------NA-KCKNHFTV------VE-----DGLS--QDW-----T--G-NVFCNPPY--------------------------GR-E----I-----G-KWVEKAYKE--SLK-PY-N--N-----------------VIVLLI--PARTD-TKYWHDYI-----------F-GKA-KDIRY-L-KGRLKF----TIN-GK------------E----N----Y-------P------APF-----------PSAVIIF------------------------------------------------------------------------|
      GSM_RS11735_Lactobacillus_mali_497764146                                        LN---K-------SMFTS--------------DKQ----Y-WETPRDFFNKINK-V-F---H------FNWDLA-STDD------NA-LCTNHLTE------KD-----DSLS--IDW-G-GLS--G-NLFLNPPY--------------------------GR-E----L-----K-LWVKKAAET--K-L-KH-N--Q-----------------YLVLLI--PSRTD-TSYWHDYI-----------F-GKA-E-IKF-I-RGRLKF----AID-GE------------Q----K----D-------A------APF-----------PSALIIYK-----G-E---------------------------------------------------------------|
      HMPREF0555_0745_Leuconostoc_mesenteroides_subsp_cremoris_ATCC_19254_227352467   VD---K-------VLFSS--------------NSM----V-WETPKDYFDKLNR-K-F---K------FDLDAC-ASDT------NH-KVDTYFTE------DD-----NALE--QKW-----G--G-NVFMNPPY--------------------------GR-H----I-----G-KFIKKAYEE--HLR-DP-N--R-----------------FIVMLI--PSRTD-TKYWHEYI-----------Q-DKA-T-VKF-I-KGRLKF----EID-GE------------S----M----D-------A------APF-----------PSALVVY-GF---------------------------------------------------------------------|
      HMPREF0555_RS01180_Leuconostoc_mesenteroides_738135700                          VD---K-------VLFSS--------------NSM----V-WETPKDYFDKLNR-K-F---K------FDLDAC-ASDT------NH-KVDTYFTE------DD-----NALE--QKW-----G--G-NVFMNPPY--------------------------GR-H----I-----G-KFIKKAYEE--HLR-DP-N--R-----------------FIVMLI--PSRTD-TKYWHEYI-----------Q-DKA-T-VKF-I-KGRLKF----EID-GE------------S----M----D-------A------APF-----------PSALVVY-GF---------------------------------------------------------------------|
      N644_0465_Lactobacillus_plantarum_AY01_544589963                                IN---K-------ALFTS--------------NKE----D-WETPQDFYDRLNA-K-Y---H------FEWDLA-ASDG------NA-KCGDYFTS------DD-----NSLE--QDW-E-RLS--G-NLFLNPPY--------------------------GR-E----L-----K-LWVKKASET--Q-L-KH-D--Q-----------------FLVMLI--PSRTD-TSYWHDYI-----------F-NHA-E-IEF-L-RGRLKF----EVD-GV------------G----G----D-------S------APF-----------PSAVVIYT-----G-E--GNV----------------------------------HE-NPE--L------------LEE|
      DK41_RS08970_Streptococcus_agalactiae_642982737                                 VQ---K-------SLLSS--------------DKD----Y-WETPQTFFKKLNN-E-F---D------FDLDVA-SSHD------NA-KCKNHFTV------VE-----DGLS--QDW-----T--G-NVFCNPPY--------------------------GR-E----I-----G-KWVEKAYKE--SLK-PY-N--N-----------------VIVLLI--PARTD-TKYWHDYI-----------F-GKA-KDIRY-L-KGRLKF----TIN-GK------------E----N----Y-------P------APF-----------PSAVIIY------------------------------------------------------------------------|
      SAG0375_RS111635_Streptococcus_agalactiae_487848063                             VQ---K-------SLLSS--------------DKD----Y-WETPQTFFKKLNN-E-F---D------FDLDVA-SSHD------NA-KCKNHFTV------VE-----DGLS--QDW-----T--G-NVFCNPPY--------------------------GR-E----I-----G-KWVEKAYKE--SLK-PY-N--N-----------------VIVLLI--PARTD-TKYWHDYI-----------F-GKA-KDIRY-L-KGRLKF----TIN-GK------------E----N----Y-------P------APF-----------PSAVIIF------------------------------------------------------------------------|
      L964_RS00605_Leuconostoc_pseudomesenteroides_491052808                          NS---K-------ALFSS--------------KSM----V-WETPKDYFDKLNR-K-F---K------FDLDAC-ASDT------NH-KVDTYFTE------DD-----DALE--QKW-----G--G-NVFMNPPY--------------------------GR-H----I-----G-EFIKKAYEE--HLR-DP-N--R-----------------FIVMLI--PSRTD-TKYWHEYI-----------Q-DKA-T-VKF-I-KGRLKF----ELD-GR------------P----M----N-------T------APF-----------PSALIIY-GL---------------------------------------------------------------------|
      ZJ316_RS06725_Lactobacillus_plantarum_505193070                                 VN---K-------ALFTS--------------NKE----D-WETPQDFYDRLNA-K-Y---H------FEWDLA-ASDG------NA-KCGHYFTS------DD-----NSLE--QDW-E-RLS--G-NLFLNPPY--------------------------GR-E----L-----K-LWVKKASET--Q-L-KH-D--Q-----------------FLVMLI--PSRTD-TSYWHDYI-----------F-NHA-E-IEF-L-RGRLKF----EVD-GV------------G----G----D-------S------APF-----------PSAVVIYT-----G-E--GNV----------------------------------HE-NPE--L------------LEE|
      T370_RS0102475_Bilophila_wadsworthia_736486878                                  MN-----------VHFLS--------------KKH----D-WATPWPLFRELNA-R-F---GP-----CELDVC-ATAR------NA-KCGNFFSP------EE-----DGLR--QVW-H------G-VCWMNPPY--------------------------GR-A----L-----P-HWMAKAVNEIEM---ER-A--E-----------------RVICLL--PARTD-TAWWHRYV-----------L-PFAAE-IHY-L-RGRIRF--------EG------------A----G----S-------S------APF-----------PSAVVIF------------------------------------------------------------------------|
      RBAU_RS10310_Bacillus_amyloliquefaciens_752856685                               ME-T-K-------TNFNQGVFFNP------EDRTD----V-WATPIDFFNKINE-R-Y---K------LNLDVC-AKPS------NA-KCKNFFTP------EI-----DGLK--QKW-V------G-RVWMNPPY--------------------------GR-E----I-----K-KWIKKAYEE--V---EN-G--N---------------SEIAVCLV--PARTC-SAWWHEYC-----------M-KG--E-ILF-I-RHRLKF--------GG------------S----K----I-------N------APF-----------PNALVIF------S-N--EHV-------------------------------NT-YK-AID--R--EGNLVI-------|
      JCM13658_RS00605_Bacteroidales_490421344                                        MD-----------VTFEG---K-S------STGKN----E-WLTPPCLLRRLGP---F-----------DLDPC-SPVN-RP---WD-TARHHYTI------ED-----DGLQ--QPW-F------G-RVFCNPPY--------------------------DT-A--L-I-----V-RFIRRCVEH------R-----------------------NAVALT--FARTD-TRLFHELI----FP-----N-ADS---ILF-I-KGRLSF-Y--HVT-GE------------Q----G----G-------T------AGA-----------PSCLIAF------N----KEN----------------------------------TA-VLE--T--CG-----------|
      ACINWC323_A0077_Acinetobacter_sp_WC-323_425484490                               MA-K-S-------KLFGL-----A------EDRTD----V-WATPQDFFDKLNA-I-F---D------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLS--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----S-LWIEKAVET--A---NA-G--Y-----------------TVVALL--PARTD-VGWWQSHC-----------L-NR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCAVVVF------R-P--SLN-------------------------------DV----RWE--Q--SQ-----------|
      HMPREF1069_RS24300_Bacteroides_ovatus_490451898                                 MD-----------VTFEG---K-S------STGKD----E-WLTPPCLLRRLGP---F-----------DLDPC-SPVN-RP---WD-TARHHYTI------ED-----DGLQ--QPW-F------G-RVFCNPPY--------------------------DT-A--L-I-----V-RFIRRCVEH------R-----------------------NAVALT--FARTD-TRLFHELI----FP-----N-ADS---ILF-I-KGRLSF-Y--HVT-GE------------Q----G----G-------T------AGA-----------PSCLIAF------N----KEN----------------------------------TA-VLE--T--CG-----------|
      N007_RS30575_Alicyclobacillus_acidoterrestris_750137118                         QG-Q-D-------VLFSS--------------ASI----E-WGTPQHIFDALDA-E-F---H------FTLDAA-ANVH------NH-KCDKWYGT-QS---DG-TFI-DGLA--QDW-S---G--E-TIWLNPPY--------------------------QR-N--V-I-----D-KWAHKAYTS--A-R-DN-G--T-----------------TVVLLL--PARLD-VKWWNKYC-----------V-YAP-E-IRF-V-EGRIRF----EQE-GK-------------------Y--N-------S------ATF-----------PSAIVIF------------------------------------------------------------------------|
      N007_05570_Alicyclobacillus_acidoterrestris_ATCC_49025_529047023                QG-Q-D-------VLFSS--------------ASI----E-WGTPQHIFDALDA-E-F---H------FTLDAA-ANVH------NH-KCDKWYGT-QS---DG-TFI-DGLA--QDW-S---G--E-TIWLNPPY--------------------------QR-N--V-I-----D-KWAHKAYTS--A-R-DN-G--T-----------------TVVLLL--PARLD-VKWWNKYC-----------V-YAP-E-IRF-V-EGRIRF----EQE-GK-------------------Y--N-------S------ATF-----------PSAIVIF----R-G----GVS-------------------------------DV----PYQ------------------|
      HMPREF0179_RS04985_Bilophila_wadsworthia_749811142                              ---MNP-------ALFSS--------------AKE----D-WETPREFFERLDG-E-F---H------FDLDVC-AFPH------NA-KCPTYFTK------ED-----DGLA--RDW-G---N--R-VCWMNPPY--------------------------GK-A----I-----K-AWMTKALDA--S---RR-G--A-----------------TVVCLV--PSRTD-TAWWHDTV-----------I-AGGAE-VRF-A-RGRLRF--------VG------------A----E----H-------P------APF-----------PSAVVIF------R-P--PPS--P-------------------------------------------------------|
      BZ26_RS0118830_Clostridium_botulinum_489480013                                  MN-T-A-------VMFSS--------------ETD----L-WATPQDFFDKLNK-E-F---N------FDLDPC-ATKE------NA-KCSKYFTK------EI-----DGLK--QDW-G---R--Y-RVFCNPPY--------------------------GR-E----I-----G-KWVEKAYKE-SK---KQ-N--T-----------------TVVMLI--PARTD-TKYFHSYI-----------Y-HKAKE-IRF-I-KGRLKF--------GN------------A----K----N-------S------APF-----------PSMIVVF-RG---------------------------------------------------------------------|
      H627_RS17735_Lactobacillus_harbinensis_737460398                                MS---D-FLKPGGAALTS--------------NKD----D-WETPQAFFESLNA-K-Y---H------FAIDLA-ASKD------NA-KCDRYFSV------AD-----DSLL--QDWSD-DFG--G-AMYLNPPY--------------------------GR-H----I-----G-DWVKKAYET--S-L-RV-N--V-----------------PIVLLI--PARTD-TSYWHDYI-----------F-GKA-S-IKF-I-RGRLKF----EQN-GM------------A----G----G-------P------APF-----------PSAIIVY--N---G-D--GAE----------------------------------K-----------------------|
      K288_RS0104020_Bradyrhizobium_sp_Ai1a-2_653543986                               MT-T-A-------PLFAG---I-GAHQTPRRTRTD----E-WLTPPAVLKALGP--------------FDLDPC-APIV-RP---WP-TAAHHYTI------RD-----NGLL--LPW-F------G-RVFLNPPY--------------------------HR-S--V-I-----G-KWLARMSGH------G-----------------------RGIALI--FARTE-TEAFFRYV----WE-----Q-ASA---LLF-L-RGRLDF-H--TVD-GG------------T----A----QRQSGRAAN------AGA-----------PSVLCAY------G----PRD----------------------------------AE-MLA--F--CG-----------|
      EX05_RS06230_Agrobacterium_rhizogenes_736484955                                 MT-L---------NLFAG---M-GTHQSA-RSKTD----V-WFTPPAIIEALGGPDS-----------FDLDPC-SSVE-RP---WP-TARRHFT----P--ED-----NGLM--RPW-Q------G-RVWMNPPY--------------------------ST-Q--L-L-----R-KFMARMAEH------D-----------------------HGVALV--FARTE-TDPFHRYV----WG-----A-ASG---LLF-V-RGRLNF-H--RID-GE------------P----A----R------KN------GGA-----------PSVLIAY------G----DED----------------------------------RD-ILA--A--AP-----------|
      I569_RS06865_Enterococcus_dispar_510798824                                      MS---L-SY--K-AIMTS--------------DNQ----D-WETPQELFDNLNN-E-F---D------FELDAF-ASDK------NA-KCKHFFTE------RD-----DAFQ--QDW-T-KYK----SIFINPPY--------------------------TS-K----V-----Q-DEVLKKIND--T-I-SS-NWMG-----------------VIVLLI--PARTD-TKRWHDYI-----------F-NKA-DDIRF-I-KGRLRF----EVD-GI------------P----R----G-------S------STF-----------PSAVIVY--D---L-R--NKE----------------------------------E--------------------VAE|
      YWU_RS14235_Corynebacterium_sputi_736638732                                     MT-A-G------RKSVSD--------------TKH------WCTPPGILDSVRS-V-F---G---GK-IDLDPC-SNEH----S-LV-NASVEYKL-P----EN-----DGLA--ESW-D-F----E-RIFVNPPY--GSDPV-R-----------------KT-R----I-----A-HWFAKIAES--V---RN-G--S-----------------EVIALV--PVATN-TRHWKNHV----FP-----L-AAA---VCF-LYEPRVKF----YID-GR------------E----D-P--K-------G------APM-----------SCAIIYY------G-R--HLE-------------------------------SF-AE-NFR--H--HG-----------|
      Q333_RS16720_Brevundimonas_bacteroides_737311698                                MS---A------SHRFDN-A-K-R-RRSD-DHPRQ----A-LATPAYVLEPVRR-L-L-G-G------IGLDPC-TDPD----N-PT-GADRFYCL------PQ-----DGAS--LPW-D---A--P-SIFVNPPY--------------------------GE-A----R-----K-RWVERCVEA---------G--T--------------RT-RVVLLI--PAHTE-SKVFQLAL--R--------S-CDS---VLF-I-DARLRF--------GV--M---------R----D----N-GRQE--A------ASH-----------GSALLSW------N-V--DLS-------------------------------RI----VED--V--CG-----------|
      BSTEL_RS07490_Bifidobacterium_stellenboschense_736512951                        MT-A-G------RQPVSV--------------TKH------WCTPQKYVDAVTE-V-F---G---GT-IDLDPC-SNEY----S-TV-NARVEYRL-P----EH-----DGLR--DSW-D-Y----P-RIYVNPPY--GRDKE-H-----------------GT-T----I-----A-DWFVRIAEA--A---RN-G--S-----------------EVMALV--PVATN-TAHWKEYV----YP-----V-ASA---VCF-LYDTRLHF----VIN-GN------------E----D-T--K-------G------APM-----------SCAMIYY------G-N--HPQ-------------------------------EF-GR-VFS--R--YG-----------|
      BBIA_RS00390_Bifidobacterium_biavatii_705399968                                 MT-A-G------RHPVSQ--------------TKH------WCTPQKYVDAVTE-V-F---D---GQ-IDLDPC-SNEY----S-TV-NARVEYIL-P----EN-----DGLR--DSW-D-Y----D-RIYVNPPY--GRDVE-H-----------------GT-T----I-----A-DWFVRIADA--V---GR-G--S-----------------EVMALV--PVATN-TAHWKDFV----YP-----V-ASA---ICF-LYDTRLHF----VIN-GN------------E----D-T--K-------G------APM-----------SCCMIYY------G-D--NPR-------------------------------KF-GR-VFS--R--YG-----------|
      HMPREF0179_03455_Bilophila_wadsworthia_3_1_6_316921487                          ---MNP-------ALFSS--------------AKE----D-WETPREFFERLDG-E-F---H------FDLDVC-AFPH------NA-KCPTYFTK------ED-----DGLA--RDW-G---N--R-VCWMNPPY--------------------------GK-A----I-----K-AWMTKALDA--S---RR-G--A-----------------TVVCLV--PSRTD-TAWWHDTV-----------I-AGGAE-VRF-A-RGRLRF--------VG------------A----E----H-------P------APF-----------PSAVVIF------R-P--PPS--P-------------------------------------------------------|
      BN981_RS01320_Halobacillus_737532221                                            MNKM-D-------VHYSS--------------KTN----E-WATPQDFFDELNT-E-F---N------FTLDPC-ATPD------NA-KCDKYFTE------KD-----DGLE--QSW-E---G--E-TVFCNPPY--------------------------GR-G----I-----K-HWVKKAYQE-ST---KP-N--T-----------------TVVLLI--PSRTD-TRYFHDYV-----------Y-HKS-E-IRF-L-KGRLKF--------GD------------G----S----G-------N------APF-----------PSMVAIY-R----------------------------------------------------------------------|
      V006_02512_Staphylococcus_aureus_686297326                                      ---M-E-------VHYSS--------------KTN----E-WTTPQNLFDELNG-E-F---N------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------|
      RDMS_RS01750_Deinococcus_sp_RL_736377798                                        ---M-A-------VHYSS--------------EKH----D-WTTPRSFFDELNA-E-F---N------FTLDAA-ASPH------NA-LCSRYFTE------AD-----DGLS--QPW-T---G--T-V-WCNPPY--------------------------GR-Q----I-----G-RWIAKAAQS--A---CE-G--A-----------------TVVMLI--PARTD-TAAWHDHI-----------LFNPQAE-VRF-V-RGRLRF--------GD------------A----T----A-------N------APF-----------PSAVIIF------R-P--GGQ--G-------------------------------------------------------|
      T666_02640_Staphylococcus_aureus_686391504                                      ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-CWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------|
      A11W_RS0107210_Staphylococcus_hominis_515743089                                 ---M-E-------VHYSS--------------KSN----E-WATPQNLFDELNE-E-F---N------FTLDPC-ATDE------NA-KCSKYFTI------ED-----DGLS--KDW-S---K--D-VVFMNPPY--------------------------GR-E----I-----K-KWNKKAYEE--S---LN-G--A-----------------TVVCLI--PARTD-TTYWHDFI-----------F-DRADD-IRF-L-RGRLKF--------GN------------S----K----N-------S------APF-----------PSAIVVY------R----GVTT---------------------------------------------------------|
      SAGV69_RS11740_Staphylococcus_aureus_506511035                                  ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWRDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------|
      X998_RS01715_Staphylococcus_aureus_446374007                                    ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLSE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------|
      U183_02276_Staphylococcus_aureus_686300364                                      ---M-E-------VHYSS--------------KTN----E-WTTPQNLFDDLNR-E-F---N------FTLDPC-STDE------NA-KCQKHYTE------ND-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-HWVKKAYEE--S---IK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GE------------S----K----N-------S------APF-----------PSAIIVY------R----GVR----------------------------------------------------------|
      QZ29_RS14215_Staphylococcus_aureus_446374006                                    ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------|
      ERS140248_02184_Staphylococcus_aureus_678260344                                 ---M-E-------VHYSS--------------KTN----E-WATPQNLFDDLNR-E-F---N------FTLDPC-STDE------NA-KCQKHYTA------KD-----NGLI--QDW-S---E--D-VVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---VK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------SE------------S----K----N-------S------APF-----------PSAIIVY------R----GGR----------------------------------------------------------|
      HMPREF9988_RS10060_Staphylococcus_epidermidis_488427723                         ---M-E-------VHYSS--------------KSN----E-WATPQKLFDELDK-E-F---N------FTLDPC-ATDE------NA-KCNKHFTI------ED-----DGLS--KDW-S---K--D-VVFMNPPY--------------------------GR-E----I-----K-KWIKKAYEE--S---LN-G--A-----------------TVVCLI--PARTD-TTYWHDFI-----------F-DKADD-IRF-L-RGRLKF--------GN------------S----K----N-------S------APF-----------PSAIVVY------L----GVTT---------------------------------------------------------|
      SAZ172_RS05790_Staphylococcus_aureus_554679133                                  ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APL-----------PSAIIVY------R----GAQ----------------------------------------------------------|
      SA930_RS14870_Staphylococcus_aureus_446374005                                   ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----G------------------------------------------------------------|
      AS94_12270_Staphylococcus_aureus_686449191                                      ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVEKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------|
      BN981_RS01350_Halobacillus_737533832                                            ---M-N-------VHYSS--------------KSN----D-WATPQDFFDGLDN-E-F---N------FTLDPC-ATSE------NA-KCDNYFTI------ED-----DGLK--QSW-E---G--E-TVFCNPPY--------------------------GR-E----I-----K-LWVKKAFQE-SK---KP-N--T-----------------KVVMLI--PARTD-TKYFHDYI-----------Y-MQA-R-VRF-I-KGRLKF--------GN------------G----K----G-------N------APF-----------PSMVVIF------------------------------------------------------------------------|
      W619_00569_Staphylococcus_aureus_686419170                                      ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNG-E-F---N------FTLDPC-STDE------NA-KCQKHYTA------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-HWVKKAYEE--S---VK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GE------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------|
      BN981_00304_Halobacillus_trueperi_635344555                                     MGKM-N-------VHYSS--------------KSN----D-WATPQDFFDGLDN-E-F---N------FTLDPC-ATSE------NA-KCDNYFTI------ED-----DGLK--QSW-E---G--E-TVFCNPPY--------------------------GR-E----I-----K-LWVKKAFQE-SK---KP-N--T-----------------KVVMLI--PARTD-TKYFHDYI-----------Y-MQA-R-VRF-I-KGRLKF--------GN------------G----K----G-------N------APF-----------PSMVVIF------------------------------------------------------------------------|
      LILY_61_Bacteriophage_Lily_755258783                                            MS-NTMA------VHYSS--------------KTD----M-WETPQDFFDKLHA-E-F---G------FTLDVC-AVPE------NA-KCERFFSP------DD-----NGLL--QNW-K------G-VCWMNPPY--------------------------GR-Q----I-----G-AWIAKAYES--S---LE-G--A-----------------TVVCLV--PSRTD-TKWWHDYC-----------L-KG--E-VRF-I-KGRLKF--------GG------------S----P----H-------N------APF-----------PNAIVIF------R-G--KGQ----------------------------------------------------------|
      ERIC1_RS03940_Paenibacillus_larvae_738763505                                    MN-K---------VHYSS--------------KTD----M-WETPQNLFDRLNE-E-F---K------FDLDVC-AIPE------NA-KCKRYFTP------SE-----DGLK--QEW-K------G-ACWMNPPY--------------------------GR-Q----I-----G-KWIAKAYES--S---LE-G--A-----------------TVVCLV--PSRTD-TKWWHGYC-----------M-KG--E-IRF-I-RGRLKF--------GG------------S----P----H-------N------APF-----------PNAVVIF------R-----------------------------------------------------------------|
      ERIC1_1c08270_Paenibacillus_larvae_subsp_larvae_DSM_25719_567770034             MN-K---------VHYSS--------------KTD----M-WETPQNLFDRLNE-E-F---K------FDLDVC-AIPE------NA-KCKRYFTP------SE-----DGLK--QEW-K------G-ACWMNPPY--------------------------GR-Q----I-----G-KWIAKAYES--S---LE-G--A-----------------TVVCLV--PSRTD-TKWWHGYC-----------M-KG--E-IRF-I-RGRLKF--------GG------------S----P----H-------N------APF-----------PNAVVIF------R-G--RKE-------------------------------SLHGQKRNE--T--KD-----------|
      Q397_RS23350_Terasakiella_pusilla_740171937                                     MN--WTH------VQRTNGG------------TSD----Q-WFTPFELLNTLYD-A-C---G-V-MQ-FDLDPC-SPPE-YL-A-HT-KAKKRFCV------ES-GD--DGLA--EDW-R---G--K-TIFMNPPY--------------------------GR-G----I-----D-KWVEKACTE--V---AN-G--N--------------AK-TVIGLL--PVKAD-TDWWHNHV--A-MK--AD-M-------FVF---NGRLKF--------GN------------A----K----G-------S------GRF-----------ASALAIW------------------------------------------------------------------------|
      OPIT5_22060_Opitutaceae_bacterium_TAV5_573475515                                --------------MTSS--------------MDM----T-WGTPQVWFDYLHL-E-F---G------FTLDPC-CLHQ------TA-KCKKHYTP------AE-----DGLA--QSW-A---E--E-RVFMNPPY--------------------------GR-D----L-----P-KWMKKAYEE--A---RDNG--T-----------------LIVCFV--PARVD-TEWWHRYA-----------T-K-G-E-VRF-P-KGRVKF--------AD------------A----L----D-------S------APF-----------PVAVVIF------R-S--RL-----------------------------------------------------------|
      GL4_RS02905_Methyloceanibacter_caenitepidi_779886230                            MT-L-G-----SHQRCVG--------------KSQ----Q-HLTPRWILDPLGE--------------FELDPC-AASP-RP---WS-CADVNYTE------ED-----DGLS--QVW-S------G-RVWLNPPF--------------------------NR-Y--V-V-----G-DWMDRFFDH------G-----------------------RGIALL--HARTE-TNWF-RLV----WK-----C-ASA---LLF-L-DKRVKF-C--RSD-GS------------M----Q----E--A----N------SGA-----------PVVLVAA------D----DLN----------------------------------AA-CLR--R--CG-----------|
      TY47_RS06930_Lactobacillus_brevis_754895979                                     MN---N-------ALLSS--------------EKN----Y-WETPHDFFKKLNE-K-Y---Y------FSFDLA-ASPE------NT-KCENFFSE------ED-----NSLT--KAW-H-ELK--G-NLFLNPPY--------------------------GR-E----L-----R-KWVKKAYEE--S-LKKH-D--G-----------------YIVLLI--PARTD-TSYWHDFI-----------F-GKA-Q-INF-L-RGRIKF----ELH-GE------------S----K----D-------A------APF-----------PSAIVIY-G----G-S--Q------------------------------------------------------------|
      HMPREF1020_RS23965_Clostridium_sp_7_3_54FAA_496656604                           MN---D-------ALLSS--------------KNM----C-WCTPPDFFAELDR-E-F---H------FELDPA-STDK------SA-KCAKHFTP------DD-----DGLK--QDW-----G--GYRVFCNPPY--------------------------GR-A----I-----A-DWVRKGYEE--S-R-KP-G--T-----------------TVVMLI--PSRTD-TAYFHDWI-----------F-GKA-SEVRF-L-RGRLKF-T--DED-GN------------G----E----D-------A------APF-----------PSAVIVW-RSPE-S-T--GRE----------------------------------FA-TWH--I---------------|
      CLOM621_RS14915_Clostridiales_492715347                                         MN---D-------ALLSS--------------KNM----C-WCTPPDFFAELDR-E-F---H------FELDPA-STDK------SA-KCAKHFTP------DD-----DGLK--QDW-----G--GYCVFCNPPY--------------------------GR-A----I-----A-DWVRKGYEE--S-R-KP-G--T-----------------TVVMLI--PSRTD-TAYFHDWI-----------F-GKA-SEVRF-L-RGRLKF-T--DED-GN------------G----E----D-------A------APF-----------PSAVIVW-RSPE-S-T--GRE----------------------------------FA-TWH--I---------------|
      ANACOL_RS13845_Anaerotruncus_colihominis_493931641                              ---N-K-------ALLSS--------------KRL----D-WCTPRDFFDALDV-E-F---H------FTLDAA-ATEK------SA-KCAKYYTP------ET-----DGLS--ASW-A---G--E-TVFCNPPY--------------------------GR-E----I-----K-AWIKKGFEE-GQ---QS-G--T-----------------TVVLLI--PSRTD-TEYFHKYI-----------L-GKA-E-IRF-L-KGRLKF--------TD------------E----EGLTQD-------A------APF-----------PSMLVIY------R----GQGKEQ-NDG---------------------------------------------------|
      ND2E_3441_Colwellia_psychrerythraea_694338559                                   VK-K-L-------AYIGS---K-P-GDIT-SRDSD----S-WYTPNIYTDMTRK-V-L-G-T------IDLDPF-SSSL-AN-E-YV-KAERYFDA------DS-----NAFK--QIW-F-K-EQ-G-TVFMNPPY--SRKLI-------------------DK-A----V-----E-IFLQNISDS--S-I-S-----------------------QAVVLV--NNATE-TKWFQSLT--R--------K-SDA---LCL-V-DKRIPF-E--SFD-GK-----------------H----S-------S------GNT-----------RGQVFLY--Y---G-V--NKK-------------------------------AF-KK-VFK--E--IG-----------|
      JCM19241_5986_Vibrio_sp_JCM_19241_749448467                                     MT-Q-H-------AKIAN--------------MNN----E-WHTPHQYIDSARK-V-M-G-S------IDTDPA-SNDI-AQ-E-YI-QADTYYTI------DN-----SSLD--KEW-S------G-NVWMNPPY--------------------------GR-T----I-----K-DFCNKLVDE----F-ES-G--R--------------VK-QAIVLT--NNGTD-TQWFDALS--G--------I-SSA---ICH-H-KKRIAF-L--RPT-GE-----------------R-V--N--------------NNT-----------KGQIFMY--I---G-D--NSQ-------------------------------AF-RD-EFN--Q--YG-----------|
      G469_RS0106650_Atopobium_fossor_654811069                                       -----M-------STFTS-GLR-S------S-ASN----E-WTTPKDLFDELNR-E-F---K------FTVDAA-STHE------NA-LVDKHWTL------AE-----DGLA--QCW-D---G--E-RVWCNPPY--------------------------GR-Q----I-----A-QWVKKASEA--V------G--G-----------------VVVMLI--PARTD-TSYWHDYV-----------F-PNASD-IRF-I-RGRLHF--------SQ------------S----K----T-------A------APF-----------PSAIVVF------E-R--WA-----------------------------------------------------------|
      HMPREF1247_RS02895_Atopobium_488626325                                          -----M-------TAFTS-GLR-S------S-TSN----E-WTTPKYLFDELNR-E-F---K------FTVDAA-STHE------NA-LVDKHWTI------EE-----DGLS--QCW-D---N--E-RVWCNPPY--------------------------GR-Q----I-----A-KWVKKASEA--V------G--G-----------------VVVMLI--PARTD-TAYWHDYI-----------F-SNASD-IRF-I-CGRLHF--------SN------------S----K----N-------A------APF-----------PSAIVVF------E-R--WQ-----------------------------------------------------------|
      N644_RS02335_Lactobacillus_plantarum_727092536                                  MN---K-------ALFTS--------------NKE----D-WETPQDFYDRLNA-K-Y---H------FEWDLA-ASDG------NA-KCGDYFTS------DD-----NSLE--QDW-E-RLS--G-NLFLNPPY--------------------------GR-E----L-----K-LWVKKASET--Q-L-KH-D--Q-----------------FLVMLI--PSRTD-TSYWHDYI-----------F-NHA-E-IEF-L-RGRLKF----EVD-GV------------G----G----D-------S------APF-----------PSAVVIY------------------------------------------------------------------------|
      CC61_RS14530_Chromobacterium_sp_C-61_748184431                                  MA-E-N-------VHFST--------------GKD----E-WPTPQALFDQLNA-E-F---G------FTIDVC-ATAK------NA-KCTKFYTQ------VD-----DGLA--QNW-A---G--E-VVWMNPPF--------------------------GH-S----I-----K-LWMAKAYRS--S---LD-G--A-----------------LVVCLV--PARTD-TRWWHRVV-----------M-KAS-E-VRV-L-DKRLRF--------DG------------G----N----H-------K------APF-----------PAVVVVF------------------------------------------------------------------------|
      QI18_RS10395_Lactococcus_lactis_746045508                                       MN-R-E-------LMFSS--------------KTD----L-WSTPWNFFEKLND-E-F---H------FTLDPC-STHE------NA-KCYKHFTI------KE-----DGLL--QDW-G---N--E-VVFCNPPY--------------------------GR-K----I-----K-DWVKKAYEE-SQ---KD-N--T-----------------TVVMLI--PARTD-TIYFHEYV-----------Y-HKA-E-VRF-I-KGRLKF--------GD------------A----K----N-------A------APF-----------PSMVVIFRKDNQ-------------------------------------------------------------------|
      RN16_RS04075_Chromobacterium_subtsugae_759887196                                LS-E-Q-------IHFSS--------------KTD----E-WPTPQALFDQLHA-E-F---G------FTLDVC-ATQE------NA-KCERFFTR------EQ-----DGLA--QDW-S---R--E-VVWMNPPF--------------------------GH-Q----I-----K-LWMAKAYRS--S---ID-G--A-----------------LVVCLV--PARTD-TRWFHRHA-----------L-KAA-E-IRA-L-DKRLRF--------DG------------A----K----A-------K------APF-----------PAVLVVY------------------------------------------------------------------------|
      MMA_RS11485_Janthinobacterium_sp_Marseille_501027971                            -M-S-K-------VHFSS--------------ATP----E-WYTPQSTFDVLNA-E-F---G------FTLDPC-CTHE------NA-KCDRHFTM------AE-----NGLS--QDW-S---N--E-VTFMNPPY--------------------------GR-E----I-----K-EWMRKAYES--S---LS-G--A-----------------TVVCLV--PARTD-TAWWHDYS-----------I-K-G-E-IRF-L-RGRLKF--------GG------------A----K----T-------N------APF-----------PSAIVIF------R-P--------------------LPIKELA------------------------------------|
      AWRIB429_RS09790_Oenococcus_oeni_768719850                                      MN-N-E-------LMFSS--------------KTD----L-WSTPNDFFDKLND-E-F---H------FTLDPC-STHE------NA-KCYKHFTK------EE-----NGLL--QDL-G---N--E-VVFCNPPY--------------------------GR-Q----I-----K-DWVKKSYEE-SQ---KD-N--T-----------------TVVMLI--PARTD-TIYFHEYI-----------Y-HKA-E-IRF-I-KGRLKF--------GN------------A----K----N-------S------APF-----------PSMVVIFE-----------------------------------------------------------------------|
      TH16_RS01985_Staphylococcus_caprae_488372936                                    ---M-S-------VHFSS--------------KSN----E-WYTPQYLFDELNE-K-Y---Q------FTLDPC-ASHE------NA-KCDKYFTI------ED-----DGLT--KDW-S---K--D-IVFMNPPY--------------------------GR-N----I-----K-HWIKKAYEE--S---VK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NNAYN-IKF-L-KGRIKF--------GG------------A----V----N-------S------APF-----------PSAIVVF------KPKGDGLK----------------------------------------------------------|
      OR63_RS06485_Clostridium_tetani_737140426                                       MN-T-A-------VMFSS--------------ETD----L-WATPQEFYNELNK-E-F---N------FDLDPC-ATHE------NA-KCPKYYTV------VE-----DGLK--QDW-Q---G--H-KVFCNPPY--------------------------GR-E----I-----S-KWVEKAYKE-SK---KE-N--T-----------------TVVMLI--PARTD-TKYFHSYI-----------Y-RKAKE-IRF-I-KGRLKF--------GN------------A----K----N-------S------APF-----------PSMVVVF------------------------------------------------------------------------|
      G454_RS0114655_Desulfovirgula_thermocuniculi_654109520                          ML-N-R-------GLFSS--------------ASS----E-WETPQKFFETLDV-E-F---G------FTLDVC-ARPE------NA-KCPRYFSP------EE-----DGLR--QEW-A---P--E-VCWMNPPY--------------------------GR-E----I-----G-KWIQKAYEE--A---QK-G--A-----------------TVVCLL--PSRTD-TAWWHEYV-----------M-RAA-E-VRF-I-RGRLRF--------GG------------A----E----N-------G------APF-----------PSCVVVF------R-P--GYS--G--------LPV-VKSMAAR------------------------------------|
      RM98_RS18265_Chromobacterium_violaceum_759932528                                LS-E-Q-------VHFSS--------------KTD----E-WPTPQALFDQLHE-E-F---G------FTLDVC-ATAE------NA-KCERFFTR------EQ-----DGLA--QDW-S---R--D-VVWMNPPF--------------------------GH-Q----I-----K-LWMAKAYRS--S---ID-G--A-----------------LVVCLV--PARTD-TRWFHRHA-----------L-KAA-E-IRA-L-DKRLRF--------DG------------A----K----A-------K------APF-----------PAVLVVY------------------------------------------------------------------------|
      T259_RS08765_Clostridium_botulinum_748203410                                    MN-T-A-------VMFSS--------------ETD----L-WATPQDFFDKLNK-E-F---N------FDLDPC-ATHE------NA-KCSKYFTK------EI-----DGLK--QDW-Q---G--Y-KVFCNPPY--------------------------GR-V----L-----K-DWVKKCYEE-SL---KP-N--T-----------------TVVMLI--PARTD-TKYFHEYI-----------Y-HKVKE-IRF-V-KGRLKF--------GD------------A----K----N-------S------APF-----------PSMVVVF------------------------------------------------------------------------|
      DESKU_RS03925_Desulfotomaculum_kuznetsovii_503587829                            ML-N-E-------SMFSS--------------RTG----E-WETPQTFFDALDA-E-F---H------FTLDVC-ARPE------NA-KCARFFTP------EQ-----DGLR--QSW-A---G--E-TCWMNPPY--------------------------GR-E----I-----G-RWVEKAYNE--A---RR-G--A-----------------VVVALL--PARTD-TRWWHRYV-----------M-RAA-E-IRF-V-EGRLKF--------GG------------A----E----N-------S------APF-----------PSVVVVF------T-PEKAVS--D--------GPV-VRSMRVK------------------------------------|
      CLOSCI_RS06430_[Clostridium]_scindens_748651356                                 LN-K-A--------LFSS--------------AKE----D-WATPQDFFDELNK-E-F---H------FDLDPC-ADAE------NA-KCKEFFTK------EQ-----NGLL--QDW-G---G--R-CVFCNPPY--------------------------GRTS----T-----G-EWIKKCYEE-AQ---KP-G--T-----------------VVVALI--PARTD-TRFFHDYI-----------Y-HKA-E-IRF-I-KGRLHF--------GG------------C----K----D-------A------APF-----------PSMVVVF---RKGK----ENEEEK-KTGCTAAGHT-EEKAAEKDDGSENGVDGI-------------------------|
      NZ45_03810_Clostridium_botulinum_700273311                                      MN-T-A-------VMFSS--------------ETD----L-WATPQDFFDKLNK-E-F---D------FDLDPC-ATHE------NA-KCSKYFTK------EI-----DGLK--QDW-Q---G--H-KVFCNPPY--------------------------GR-G----I-----K-DWVEKAYKE-SK---KE-N--T-----------------TVVMLI--PARTD-TRYFHEYI-----------Y-HKAKE-IRF-V-KGRLKF--------GS------------A----K----N-------S------APF-----------PSMVVVF---RGE------------------------------------------------------------------|
      MCOL2_RS04700_Listeria_fleischmannii_738104299                                  MD---R-------VIFSS--------------ERD----D-WETPTDLFNELDK-E-F---L------FDLDAT-ANKN------NA-KCPKFFTK------EQ-----NALV--QEW-----R--G-SVFCNPPY--------------------------GR-E----I-----Q-KFIEKAYIE--SKK-AY-C--E-----------------RVVLLI--PARTD-TKIWHDFI-----------F-PFS-KEIIF-I-KGRLKY----ELN-KI------------S----N----S-------P------APF-----------PSAIIIF---EECNL----------------------------------------------------------------|
      BN927_RS09785_Lactococcus_lactis_554763517                                      MN-K-E-------LMFSS--------------KTD----L-WSTPWNFFDKLND-E-F---H------FTLDPC-STHE------NA-KCYKHFTI------EE-----DGLL--QDW-G---N--E-VVFCNPPY--------------------------GR-Q----I-----K-DWVKKAYEE-SQ---KD-D--T-----------------TVVMLI--PARTD-TIYFHEYI-----------Y-HKA-E-IRF-I-KGRLKF--------GD------------A----K----N-------A------APF-----------PSMVVIF-----RKDNQ--------------------------------------------------------------|
      PI74_RS05125_Clostridium_botulinum_500994137                                    MN-T-A-------VMFSS--------------GTD----L-WATPQDFFDKLNK-E-F---D------FDLDPC-ATHK------NA-KCSKYFTK------EI-----DGLK--QDW-Q---G--Y-KVFCNPPY--------------------------GR-S----I-----K-DWVEKAYKE-SK---KE-N--T-----------------TVVMLI--PARTD-TRYFHEYI-----------Y-NKAKE-IRF-V-KGRLKF--------GD------------A----K----N-------S------APF-----------PSMVVVF------------------------------------------------------------------------|
      Q332_RS01180_Pseudobacteroides_cellulosolvens_739064083                         ---T-E-------IMFSS--------------KSD----E-WETPQQFFDKLHK-E-F---N------FQLDVC-ATAE------NA-KCDKYYTK------ID-----DGLS--QSW-H---HWAQ-RCWMNPPY--------------------------GR-N----I-----D-KWIKKAFDE--S---QE-G--A-----------------TVVCLI--PARTD-TKYWHTYC-----------M--KAHE-IRF-V-KGRLKF--------SN------------S----K----D-------C------APF-----------PSAIVVF------K-P--TLK--QLKVSSY-------------------------------------------------|
      G454_RS0102995_Desulfovirgula_thermocuniculi_654100680                          MF-N-R-------VLFSS--------------ATS----E-WETPQELFARLHA-E-F---G------FTLDVC-ARPW------NA-KCTRYFSP------EQ-----NGLI--QEW-A---P--E-TCWMNPPY--------------------------GR-E----I-----S-RWVRKAWEE--A---QK-G--A-----------------TVVCLL--PSRTD-TAWWHEYV-----------M-RAA-E-IRF-I-RGRLHF--------EG------------A----K----N-------G------APF-----------PSCVVVF------R-P--GCT--G--------PPV-IRSMAAR------------------------------------|
      Phi93_04_Lactococcus_phage_phi93_673939868                                      MN-N-E-------LMFSS--------------KTD----L-WSTPNDFFDKLND-E-F---H------FTLDPC-STHE------NA-KCYKHFTK------EE-----NGLL--QDW-G---N--E-VVFCNPPY--------------------------GR-Q----I-----K-EWIKKSYEE-SQ---KD-N--T-----------------TVVMLI--PARTD-TIYFHEYI-----------Y-HKA-E-IRF-I-KGRLKF--------GN------------A----K----N-------S------APF-----------PSMVVIF----E-------------------------------------------------------------------|
      GAP32_068_Cronobacter_phage_vB_CsaM_GAP32_414086984                             NN-M-S-------VHFSS--------------ASN----T-WDTPDDFYQKLHA-V-W---N------FTLDPA-AMDE------TA-KCEKYYTP------ET-----DGLA--HSW-A---G--E-TVWCNPPY--------------------------GR-E----I-----S-KWFKKFDEE-FK---QN-G--T-----------------TIIALP--PARTD-TTYFHKYV-----------R-DSATA-ICF-V-KGRLKF--------DNRSLPSWKEDGSHK----K----T-------G------APF-----------PSMIVIY----D-N----NITQEK-YEVLNSLGFV-VQPFLLG------------------------------------|
      CO98_RS04645_Staphylococcus_aureus_739716594                                    ---M-S-------VHFSS--------------KSN----E-WTTPQYLFDELNE-E-F---N------FTLDPC-ATDE------NA-KCSKYFTI------ED-----DGLS--KDW-S---N--D-VVFMNPPY--------------------------GR-E----I-----K-KWIKKAYEE--S---LN-G--A-----------------TVVCLI--PARTD-TTYWHDFI-----------F-DKADD-IRF-L-KGRLKF--------GN------------S----K----N-------S------APF-----------PSSIVIY------E----CKEAEQ-------------------------------------------------------|
      TS65_RS13365_Aneurinibacillus_migulanus_759006369                               MN-T-A-------VMFSS--------------ATD----E-WATPQDFFDQLNQ-E-F---H------FTLDPC-ATHE------SA-KCARYFTE------ED-----NGLA--QDW-T---G--E-IVFMNPPY--------------------------GR-V----L-----G-QWVKKAFEE--S---IK-G--A-----------------TVVCLL--PARTD-TRWFHDYI-----------Y-HRA-E-IRF-V-KGRLKF--------GD------------S----K----N-------S------APF-----------PSMVVIF------N-RA-GVKVGG-------------------------------------------------------|
      KU40_RS04850_Clostridium_botulinum_737823765                                    --------------MFSS--------------KTD----M-WSTPQDFYNKLNQ-E-F---N------FNLDPC-STNE------NA-KCERHYTI------AE-----DGLK--QNW-V---G--S-TVFCNPPY--------------------------GR-V----L-----K-DWVKKCYEE-SK---KD-N--T-----------------TVVMLI--PARTD-TTYFHNYI-----------Y-KKVKE-IRF-I-RGRLKF--------GD------------C----K----N-------A------APF-----------PSMVVVF------------------------------------------------------------------------|
      SD74_RS18965_Clostridium_botulinum_752703286                                    MN-T-A-------VMFSS--------------ETD----L-WATPQDFFDELNK-E-F---D------FDLDPC-ATHE------NA-KCDKYYTI------VE-----DGLK--QDW-Q---G--H-KVFCNPPY--------------------------GR-G----I-----K-DWVEKAYKE-SK---KE-N--T-----------------TVVMLI--PARTD-TKYFHSYI-----------Y-HKAKE-IRF-I-KGRLKF--------GD------------A----K----N-------S------APF-----------PSMVVVF------------------------------------------------------------------------|
      EMTOL_RS19950_Emticicia_oligotrophica_504839093                                 MN-I-K-------AIFSC--------------KTT----N-WETPQDLFDELDK-Q-Y---N------FTLDVC-ATSE------NA-KCNEFFTP------EI-----DGLK--QEW-K------G-MCWMNPPY--------------------------GR-E----I-----G-KWVRKAHLE--V---IT-G--R----------------CRIIALL--PARTD-TKWFHEWV-----------LNKH--E-IKF-I-KGRLRF--------SD------------S----K----N-------S------APF-----------PSMLVIF------E-G--RP-----------------------------------------------------------|
      J546_RS10975_Acinetobacter_baumannii_736663998                                  MA-N-H-------QLFGL-----A------ENRTD----I-WATPQDFFDKLNA-V-F---K------FDLDVC-ALPN------NA-KCERFFSP------ED-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----I-EWVAKAACT--A---KQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----S-------N------APF-----------GCCVVVF------R-P--TLN-------------------------------DV----EWE--N--AG-----------|
      J532_4398_Acinetobacter_baumannii_940793_630464595                              MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWD--K--GA-----------|
      J594_4091_Acinetobacter_baumannii_259052_588219826                              MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------|
      ACIN5021_2863_Acinetobacter_sp_OIFC021_444754682                                MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------|
      J660_0735_Acinetobacter_baumannii_88816_593668543                               MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GC-E----I-----V-DWIAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------|
      PaVLD_ORF117R_Planktothrix_phage_PaV-LD_371496242                               IQ-Q-L-------CLFET---Q-P--SLI---DSN----E-NYTPSDLIDLVHK---F-Y-G-F----PELDPF-SCEQ-AN-Q-II-KAQKIFTI------QD-----DGFK--QNW-R-R-A--K-TLWLNPPY----------SA--------------GF------I-----E-KVVDKLIAT--L---NE-T--E----A------------EAFLLT--NTDNS-TAWYKKAL----NR-----C-DR----FCL-P-STRLTF-Y--SPK-RA--V----E----G--K-K-Q--N-------Q------NRF-----------SQTLFYF------G-L--QPQ-------------------------------RF-EE-IFE--G--WG-----------|
      M095_RS22645_Bacteroidales_492455391                                            MN-----------TSFER---S--------KQTTD----E-WYTPKWIVDALGS--------------FDLDPC-APEN-RL---WN-TAKRHITP------SE-----DGLK--TEW-G------GVRVWLNPPY--------------------------SR-P--L-I-----E-RFVEKMVRN------N-----------------------NGIALL--FNRCD-SKMFQDLI----FP-----N-ASA---IMF-V-KGRIKF-Y--RPD-GT------------Q----G----D-------S------PGC-----------GSVLIAF------G----EEN----------------------------------AK-ILE--Y--SN-----------|
      BN796_00478_Alistipes_sp_CAG:831_547185524                                      MN-----------TSFER---C--------ANTTD----E-WYTPKWIIDSLGE--------------FDLDPC-SPAN-RL---WN-TAKRHITP------QE-----DGLK--TSW-G------GVRVWLNPPY--------------------------SR-P--L-I-----E-RFVEKMVAN------N-----------------------NGIALL--FNRCD-SKMFQDLI----FP-----N-ASA---ILF-V-RGRIKF-Y--RPD-GT------------Q----G----D-------S------PGC-----------GSVLIAF------G----ESN----------------------------------AE-ALE--K--SN-----------|
      JCM13658_RS05485_Bacteroides_gallinarum_517496590                               MD-----------VRFEG---R-S------STGKN----E-WLTPPDLLERLGP--------------FDLDPC-APVN-RP---WA-TAAHHYTI------ED-----DGLK--QPW-F------G-RVFCNPPY--------------------------DT-S--L-I-----V-QFIRRCSEH------G-----------------------NAVALT--FARTD-TRLFHEWI----FP-----R-ADS---VLF-I-KGRLSF-H--HVS-GE------------R----G----S-------T------AGA-----------PSCLIAF------G----KAN----------------------------------TA-VLK--S--CG-----------|
      HMPREF9447_RS03430_Bacteroides_oleiciplenus_496419253                           MN-----------VTFEG---N-S------HTGKN----E-WLTPPDLLKKLGH--------------FDLDPC-SPVN-RP---WS-TAHRHYTI------LD-----NGLE--QEW-T------G-RVFCNPPY--------------------------DT-N--L-I-----V-RFIHRCAEH------G-----------------------NAIALT--FARTD-TRLFHDEI----FR-----K-ADS---ILF-I-KGRLRF-Y--HVN-GE------------Q----G----G-------T------AGA-----------PSCLIAF------N----KEN----------------------------------TE-VLR--N--CG-----------|
      BN938_RS08150_Mucinivorans_hirudinis_740870005                                  MN-----------VTFEG---N-S------STGKN----E-WLTPPDILAKLGE--------------FDLDPC-APIN-RP---WA-TANNHFTI------ED-----DGLV--QPW-Q------G-RVFCNPPY--------------------------DT-R--L-I-----I-QFIERCIEH------K-----------------------NAIALT--FARTE-TKLFQELI----FR-----H-AHS---ILF-I-KGRLSF-H--HVT-GE------------R----G----G-------T------AGA-----------PSCLIAF------D----EAN----------------------------------SQ-VLK--N--CG-----------|
      TY03_RS11290_Bacteroides_graminisolvens_640565353                               MN-----------VTFEG---N-S------ATGKN----Q-WLTPPELLAKLGQ--------------FDLDPC-APIN-RP---WP-TATQHYTI------ED-----DGLK--QPW-F------G-RCWVNPPY--------------------------DT-Q--L-I-----I-QFIERCVEH------K-----------------------NAIALT--FSRTE-TKLFQELI----FK-----K-AHS---ILF-I-KGRLSF-H--HVT-GE------------R----G----G-------T------AGA-----------PSCLISF------N----EVN----------------------------------SE-ILK--S--CG-----------|
      M120_RS21850_Bacteroides_fragilis_695522728                                     MN-----------VTFEG---K-S------STGKN----E-WLTPPCLLDRLGE--------------FDLDPC-SPVN-RP---WD-TARHHYTV------GD-----DRLR--QPW-F------G-RVFCNPPY--------------------------DT-P--L-I-----V-RFIRKCVEH------R-----------------------NAIALT--FARTD-TRLFHELI----FP-----Y-ADT---ILF-I-RGRLRF-Y--HVT-GE------------Q----G----G-------T------AGA-----------PSCLISF------N----REN----------------------------------TA-ALK--M--CG-----------|
      M121_RS02435_Bacteroides_fragilis_695336745                                     MN-----------VTFEG---K-S------STGKN----E-WLTPPCLLDRLGE--------------FDLDPC-SPVN-RP---WD-TARHHYTV------GD-----DRLR--QPW-F------G-RGFCNPPY--------------------------DT-P--L-I-----V-RFIRKCVEH------R-----------------------NAIALT--FARTD-TRLFHELI----FP-----Y-ADT---ILF-I-RGRLRF-Y--HVT-GE------------Q----G----G-------T------AGA-----------PSCLISF------N----REN----------------------------------TA-ALK--M--CG-----------|
      H599_RS0112420_Flavobacterium_daejeonense_652309842                             MN-----------TSFER-----C------ENTKV----E-WLTPPELVKKLGE--------------FDLDPC-SPIN-AP---FL-HAKNNFTV------LD-----NGLS--QKW-F------G-RVYLNPPY--------------------------GR-G--M-E-----L--WLEKLKFH------G-----------------------NGIALI--FARTE-TKCFFEHI----WN-----D-ADA---VLF-V-KGRIRF-Y--HIS-GI------------Q----A----G-------T------PGA-----------PSVFIAY------G----KEN----------------------------------AF-ALK--N--CG-----------|
      D478_RS25245_Brevibacillus_agri_748713908                                       --------------MFTS--------------ERE----E-WETPQDFFEKLNK-E-F---G------FQLDVC-ALPT------NA-KCERYFTP------DE-----DGLK--QEW-T------G-VCWMNPPY--------------------------GR-E----I-----G-KWVKKAYES--A---KQ-G--A-----------------TVVCLL--PARTD-VKWWHDYC-----------M-KG--E-IRL-V-RGRMKF--------VG------------A----D----N-------M------APF-----------PNAVVIF------S-P--ASA-------------------------------GC----SYK--A--ID-----------|
      M655_RS0109725_Bacillus_sp_NSP21_737442515                                      --------------MFKS--------------ERE----E-WETPQEFFDKLND-E-F---G------FQLDVC-ALPT------NA-KCERYFTP------DD-----DGLH--QEW-T------G-VCWMNPPY--------------------------GR-E----I-----G-KWVKKAYES--A---KQ-G--A-----------------TVVCLL--PARTD-VKWWHDYC-----------M-KA--E-IRL-V-RGRMKF--------VG------------A----D----N-------M------APF-----------PNAVVIF------S-P--ASA-------------------------------GC----SYK--A--ID-----------|
      BTS2_RS02440_Bacillus_sp_TS-2_780117918                                         MN-Q---------AMFSS--------------STD----K-WSTPQSFYDKLNQ-E-F---Q------FDIDVC-ATDS------DK-KCERYFSP------EQ-----DGLK--QEW-T------G-ICWMNPPY--------------------------GR-G----I-----G-PWIQKAYES--S---QQ-G--A-----------------TVVCLL--PSRTD-TKWWHEYC-----------M-KG--E-IRF-I-KGRLKF--------GD------------S----K----N-------S------APF-----------PSVVVIF------R-P--KVV-------------------------------SM-------------------------|
      SBVP3_0091_Vibrio_phage_phi_3_751186426                                         ----------------MN--------------SND----E-WYTPEFIMDKVRR-V-L-G-E------IDLDPA-SNPT-AN-T-IV-RAKTYYTK------EQ-----NGLN--YPW-L------G-KVWCNPPY--------------------------SA-A--L-I-----K-KFTKYFAEE--Y---KR-G--V--------------MT-EGIMLT--NSGTD-TQWNIAL--------------QGG-V-QAY-T-NGRISF--------LQ----P--DL---T----P----K-------G------KGS-----------RGQCFTY--F---G-P--NPE-------------------------------LF-IK-VFTEDN--FC-----------|
      METEXDRAFT_RS01570_Methylobacterium_extorquens_489692296                        MG------------ETLG---I-GGHQRPRKERTD----T-WLTPPGIVRALGP--------------FDLDPCAAPDP-KP---WA-TAATHYTW---P--AQ-----DGLL--LPW-Y------G-RVWLNPPY--------------------------GR-A----L-----G-TWLAKMARH------GC-----------------------GTAFT--FARTE-TKAFFDHV----WN-----E-ADA---ILF-L-KGRVSF-H--HQD-GS------------P----A----R-------N------GGA-----------PSVLIAF------G----ADD----------------------------------VE-RLM--E--SG-----------|
      MICLODRAFT_RS13290_Microvirga_lotononidis_497160926                             MT------------LNKG---M-GGHHSA-AAMTE----T-WLTPPGIIQALGSSSS-----------FDLDPCAAPKS-RP---WD-TARNHYTW---P--EQ-----DGLR--LPW-E------G-RVWLNPPY--------------------------GR-A----M-----T-DWLKKMSRH------NK-----------------------GTALI--FARTE-TEAYHEFV----WP-----Y-ASG---LLF-L-RGRLHF-H--YPD-GR------------R----A----E------AN------SGA-----------PSVLVAY------G----EED----------------------------------VE-RLI--Q--SG-----------|
      K035_3825_Acinetobacter_baumannii_691039509                                     ---------------------------------------------QDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-----------------------------------------------------------------|
      ND2E_RS09310_Colwellia_psychrerythraea_696562339                                ----------------SD---------------------S-WYTPNIYTDMTRK-V-L-G-T------IDLDPF-SSSL-AN-E-YV-KAERYFDA------DS-----NAFK--QIW-F-K-EQ-G-TVFMNPPY--SRKLI-------------------DK-A----V-----E-IFLQNISDS--S-I-S-----------------------QAVVLV--NNATE-TKWFQSLT--R--------K-SDA---LCL-V-DKRIPF-E--SFD-GK-----------------H----S-------S------GNT-----------RGQVFLY--Y---G-V--NKK-------------------------------AF-KK-VFK--E--IG-----------|
      K035_3825_Acinetobacter_baumannii_42057_4_629017472                             ---------------------------------------------QDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----SWE--K--SA-----------|
      CLOM621_08346_Clostridium_sp_M62/1_291074040                                    -----------------M---------------------C-WCTPPDFFAELDR-E-F---H------FELDPA-STDK------SA-KCAKHFTP------DD-----DGLK--QDW-----G--GYCVFCNPPY--------------------------GR-A----I-----A-DWVRKGYEE--S-R-KP-G--T-----------------TVVMLI--PSRTD-TAYFHDWI-----------F-GKA-SEVRF-L-RGRLKF-T--DED-GN------------G----E----D-------A------APF-----------PSAVIVW-RSPE-S-T--GRE----------------------------------FA-TWH--I---------------|
      C471_08405_Halorubrum_saccharovorum_490147912                                   ----------------------------------------------RIGRPLSW-A-V---DG-----FDLDPA-SGAE------PVPIADQRYTE------AD-----DGLA--QPW-H------G-DVFLNPPW--------------------------TS-E----DSDGTPKRRWLRKARNE--A-Q-RD-A-VD-----------------TMIVLL--PAATE-AGWFRDHM-----------W-GAP-A-LCF-VGPGRIPF--------IG------------E----D----R-------N------PSFP-----------LAIAAF------G-D--VPA-------------------------------AL-LD-VLD--S--FG-----------|
      RIV7116_RS23705_Rivularia_sp_PCC_7116_763462851                                 ----------------SD---------------------E-WYTPPHISDLVTQ-V-L---GQ-----ITLDPC-ADEG------KHIRAAQHYTV------LD-----DGLI--QEW-N------G-RIFMNPPY--------------------------SA-P----S-------VWIKKLQAE--F-E-SG-R-VT-----------------EAIALV--PAATD-TRWLSPLL-----------K-SQP---VCF-W-TGRIKF--------LD------------M----S----Y-------K------PRLSARQ-------SHCLVYW------G-G--NWE-------------------------------RF-KE-VF-------------------|
      OPIT5_RS20660_Opitutaceae_bacterium_TAV5_763429761                              ---------------MDM---------------------T-WGTPQVWFDYLHL-E-F---G------FTLDPC-CLHQ------TA-KCKKHYTP------AE-----DGLA--QSW-A---E--E-RVFMNPPY--------------------------GR-D----L-----P-KWMKKAYEE--A---RDNG--T-----------------LIVCFV--PARVD-TEWWHRYA-----------T-K-G-E-VRF-P-KGRVKF--------AD------------A----L----D-------S------APF-----------PVAVVIF------R-S--RL-----------------------------------------------------------|
      DALK_RS23730_Desulfatibacillum_alkenivorans_506429612                           ---------------MNC---------------------E-WATPQDLFDSLNK-E-F---H------FTLDPC-CTIE------NA-KCERFYTK------AE-----DGLS--QDW-T---G--E-TVFMNPPY--------------------------SRSE----M-----P-KWIQRAYES--S---LA-G--S-----------------KVVCLL--PAKTD-TRWFHDFC-----------L-K-G-E-IRF-I-KGRICF--------GS------------G----E----G-------R------APF-----------PSMVVIF------N-G--------------------AK-----------------------------------------|
      VE20213_RS09880_Clostridiales_bacterium_VE202-13_639741003                      -----------------M---------------------D-YCTPQDFFDKLNQ-E-F---H------FTLDAA-ATSK------SA-KCPQYYTP------EI-----DGIK--NPW-SIAGG--G-AVFCNPPY--------------------------GR-K----I-----G-KWVRKAYEE-S----RN-G--T-----------------TVVLLI--PARTD-TAYFHDYI-----------Y-GCA-E-IRF-V-RGRLHF--------TD------------E----DGNTYD-------R------APF-----------PSMVVIY------N----G----N-RVG---------------------------------------------------/
      consensus/100%                                                                  .......................................................................D...s.................................s.h.....h.......................................................................................................h....s.......................................................................................................................................................................................
      consensus/95%                                                                   .........................................b.pP..b...h..................lDss.s.............s...hs..............suL.....W...........hhhNPPa...........................................ah.+h...................................lhl....s.sp...ha.........................h.....ch.b...............................................s................hhhh........................................................................
      consensus/90%                                                                   .................................p.......a.TP..hhp.l.....h...........pLDss.u.............s.p.as..............suL...p.W...........hahNPPa..........................s................ah.+h..p.........s......................lhLh...s.os.s.hap..h...................h.h.h.p.Rl.F...............................................sshs.............lhha........................................................................
      consensus/85%                                                                   ...............b.................p.......a.TP..hhp.l.....h..........hsLDss.u.............s.p.as..............sGL...p.W...........hahNPPY..........................sp......h........ah.+h.pp.........s.....................hlhLl...spT-.s.aapphh...................l.a.l.c.RlpF........................................s......sshss............lhha........................................................................
      consensus/80%                                                                   ...............h.................pp......W.TP..hhc.lp....h..........hsLDss.u..p......ps..sppaao.......p......sGL...p.W...........hahNPPY..........................up......l........ah.+hhpp.........s.....................hlhLl...scT-.s.aapphh...................l.a.l.+uRlpF........s.......................p.......s......ushss...........hllha........................................................................
      consensus/75%                                                                   ...............a.................ps......W.TP..hhc.Ls....a..........hsLDss.u.sp......ss.psppaaT.......pp.....DGL...ppW...........hahNPPY..........................up......l.......pWlpKhhpp.........s.....................hlhLl..ssRTD.spaapchh...................lpF.l.+GRl+F........s..................p....s.......s......ushss...........hllla........................................................................
      consensus/70%                                                                   ...............a.................ss......W.TPpphh-bLsp...a..........hsLDsC.ussp......ss.psp+aaT.......cp.....DGLp..ppW..........plahNPPY..........................uc.p....l.......cWlpKuhpp.......p.u.....................lVhLl..PuRTD.opaapchh...........b.......lpF.l.+GRL+F........u.............s....p....s.......s......APhss...........hllla........................................................................
      
      
      Back to Contents
    • General notes, phyletic distribution and domain architectures of the Group2/clade 2/Chlorophyte type N6-MTases

      General notes:

      The Chlorophyte-type N6-MTase is distinguished by the presence of a axTP motif N-terminal to the helix before the first strand, a DP motif in strand-1, a T/D in place of the ancestral D in strand-2, a [D/N]G motif before strand -4 and an NPP[F/Y] motif in strand-4, a high conserved positively charged residue in the helix before strand-5. Interestingly, strand-3 is either absent or highly abbreviated in this family and this is typical of members of the EcoK1/M.TaqI group of N6-MTases where strand-3 and the helices around it are somewhat protracted. There are essentially two families in this clade. One where the methylase is fused to a ParB-type HTH. This far this has been detected in multiple copies in Volvox, and is also found in Chlamydomonas and Batrachochytrium. Due to the expansion in Volvox and the presence in the Chytrid, it appears ot behave like a mobile element. However, no known family of transposases are found in the vicinity. The second family which is usually found as a single copy in chlorophytes shows fusions to one or two BMB/PWWP domains at the N-terminus and a ZfCW/PHD-X domain at the C-terminus. These architectures suggest that the methylation mark is closely regulated or coordinated with histone modifications. The prokaryotic versions closest to the chlorophyte DAMs show a variety of contexts, including fusions to the ParB and ASCH domain, and association with Phage Packaging, a DpnII-restriction enzyme-like RE system and in phages/prophages, where they might serve for defensive function. Its presence in cyanobacteria suggests a likely source for the eukaryotic chlorophyte versions.
      The ParB-type HTH domain which is found in the mobile cyanobacterial family of N6-MTases belongs to the larger ParB-like HTH which is pan-bacterial and is fused to the ParB-like nuclease domain. However, this is a distinct subfamily and the eukaryotic versions is closely related to cyanobacterial versions. The prokaryotic homologs of these show a variety of interesting fusions such to the ASCH domain in 3 cyanobacteria, to ParB in Mycobacterium kansasii and Nitrolancea hollandica (Chloroflexi) (Distinct from the pan-bacterial ParB ),and to a cytosine methylase domain in cyanobacteria. Further, in cyanobacteria the domain is fused to a Tudor-like SH3 fold domain in Cyanobacteria. This tudor-like domain is likely to be involved in protein-protein interactions. In terms of gene neighborhoods, the ParB-HTH is mostly ParA associated in a phage terminase context, suggesting its role in partitioning. Note that in these systems there is no ParB nuclease domain, just the HTH. Some versions are associated with a transposase but this is not very common and could be just be background associations of mobile systems. The Eukaryotic versions appear closer to versions which associated with the ParA protein and play a role in partitioning. This suggests that in the eukaryotes, the ParB domain might recognize a specific sequence or a DNA feature.
      GI           Gene neighborhood                                                                                         Domain arch                  Pfam-architectures            Gene name           Len   Taxonomy                                          Species name                                          Genbank
      # ; Eukaryotic Chlorophyte DAM -ParB-HTH fused                                                                                                                                                                                                                                           
      302838997    <-ParB-HTH+N6-MTase*                                                                                      ParB-HTH+N6-MTase            HSP90+Dam                     VOLCADRAFT_91459    473   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_91459 [Volvox carteri f. nagariensis].                              <-302838989_?||302838783_?->302838785_?-><-302838991_?<-302838993_?<-302838995_?||302838787_?-><-302838997_ParB-HTH+N6-MTase*<-302838999_?||302838789_?->302838791_?->302838793_?-><-302839001_?||302838795_?-><-302839003_?
      302842945    <-ParB-HTH+N6-MTase*||?->?-><-?||?-><-?||?-><-Guanylate_kin                                               ParB-HTH+N6-MTase            Dam                           VOLCADRAFT_118198   357   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_118198 [Volvox carteri f. nagariensis].                             <-302842935_?<-302842937_?||302842757_?-><-302842939_?<-302842941_?<-302842943_?||302842759_?-><-302842945_ParB-HTH+N6-MTase*||302842761_?->302842763_?-><-302842947_?||302842765_?-><-302842949_?||302842767_?-><-302842951_Guanylate_kin
      302845993    <-ParB-HTH+N6-MTase*                                                                                      ParB-HTH+N6-MTase            -                             VOLCADRAFT_106408   816   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_106408 [Volvox carteri f. nagariensis].                             302845869_?->302845871_?-><-302845987_?||302845873_?-><-302845989_?||302845875_?-><-302845991_?<-302845993_ParB-HTH+N6-MTase*||302845877_?->302845879_?->302845881_?-><-302845995_?<-302845997_?<-302845999_?||302845883_?->
      302846292    <-ParB-HTH+N6-MTase*<-?<-?||?->?->?-><-METHYLASE                                                          ParB-HTH+N6-MTase            Dam                           VOLCADRAFT_106473   528   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_106473 [Volvox carteri f. nagariensis].                             302846154_?-><-302846286_?||302846156_?->302846158_?-><-302846288_?||302846160_?-><-302846290_?<-302846292_ParB-HTH+N6-MTase*<-302846294_?<-302846296_?||302846162_?->302846164_?->302846166_?-><-302846298_METHYLASE<-302846300_?
      302854263    <-ParB-HTH+N6-MTase*||?->?-><-?<-RelA                                                                     ParB-HTH+N6-MTase            Dam                           VOLCADRAFT_108225   570   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_108225 [Volvox carteri f. nagariensis].                             <-302854251_?<-302854253_?<-302854255_?<-302854257_?||302854221_?-><-302854259_?<-302854261_?<-302854263_ParB-HTH+N6-MTase*||302854223_?->302854225_?-><-302854265_?<-302854267_RelA<-302854269_?||302854227_?->302854229_?->
      302838546    <-ParB-HTH+N6-MTase*||?->?-><-?<-?<-P-kinase                                                              ParB-HTH+N6-MTase            Dam                           VOLCADRAFT_104840   490   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_104840 [Volvox carteri f. nagariensis].                             302838326_?->302838328_?-><-302838542_?||302838330_?-><-302838544_?||302838332_?->302838334_?-><-302838546_ParB-HTH+N6-MTase*||302838336_?->302838338_?-><-302838548_?<-302838550_?<-302838552_P-kinase||302838340_?-><-302838554_?
      302855015    <-N6-MTase+N6-MTase*                                                                                      N6-MTase+N6-MTase            -                             VOLCADRAFT_108410   146   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_108410 [Volvox carteri f. nagariensis].                             302854993_?->302854995_?->302854997_?-><-302855013_?||302854999_?-><-302855015_N6-MTase+N6-MTase*<-302855017_?||302855001_?-><-302855019_?<-302855021_?<-302855023_?<-302855025_?<-302855027_?
      302839284    <-ParB-HTH+N6-MTase*                                                                                      ParB-HTH+N6-MTase            Dam                           VOLCADRAFT_104970   364   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_104970 [Volvox carteri f. nagariensis].                             <-302839284_ParB-HTH+N6-MTase*<-302839286_?||302839130_?-><-302839288_?<-302839290_?<-302839292_?||302839132_?-><-302839294_?
      302855367    <-ParB-HTH+N6-MTase*                                                                                      ParB-HTH+N6-MTase            Dam                           VOLCADRAFT_100579   287   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_100579 [Volvox carteri f. nagariensis].                             302855347_?-><-302855357_?<-302855359_?<-302855361_?<-302855363_?||302855349_?-><-302855365_?<-302855367_ParB-HTH+N6-MTase*
      302838722    <-N6-MTase*                                                                                               N6-MTase                     Dam                           VOLCADRAFT_104908   495   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_104908 [Volvox carteri f. nagariensis].                             302838498_?->302838500_?-><-302838714_?<-302838716_?<-302838718_?||302838502_?-><-302838720_?<-302838722_N6-MTase*||302838504_?->
      302843631    <-ParB-HTH*                                                                                               ParB-HTH                     PAT1                          VOLCADRAFT_105875   819   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_105875 [Volvox carteri f. nagariensis].                             302843457_?-><-302843625_?<-302843627_?||302843459_?->302843461_?-><-302843629_?||302843463_?-><-302843631_ParB-HTH*||302843465_?-><-302843633_?||302843467_?-><-302843635_?||302843469_?-><-302843637_?||302843471_?->
      302839946    METHYLASE->?-><-?||?-><-?||N6-MTase+N6-MTase*->                                                           N6-MTase+N6-MTase            TM                            VOLCADRAFT_92117    1075  eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_92117 [Volvox carteri f. nagariensis].                              302839936_?->302839938_?->302839940_METHYLASE->302839942_?-><-302840106_?||302839944_?-><-302840108_?||302839946_N6-MTase+N6-MTase*-><-302840110_?||302839948_?-><-302840112_?||302839950_?-><-302840114_?||302839952_?->302839954_?->
      159472462    <-ParB-HTH+N6-MTase*||?-><-?||?-><-?||?->?-><-Guanylate_kin                                               ParB-HTH+N6-MTase            Dam                           CHLREDRAFT_191158   321   eukaryota>viridiplantae>chlorophyta               Chlamydomonas reinhardtii                             predicted protein [Chlamydomonas reinhardtii].                                                      <-159472452_?<-159472454_?||159472222_?-><-159472456_?<-159472458_?<-159472460_?||159472224_?-><-159472462_ParB-HTH+N6-MTase*||159472226_?-><-159472464_?||159472228_?-><-159472466_?||159472230_?->159472232_?-><-159472468_Guanylate_kin
      302852824    N6-MTase*->                                                                                               N6-MTase                     Nop14                         VOLCADRAFT_99082    300   eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_99082 [Volvox carteri f. nagariensis].                              <-302852874_?<-302852876_?||302852822_?-><-302852878_?||302852824_N6-MTase*-><-302852880_?<-302852882_?<-302852884_?||302852826_?-><-302852886_?<-302852888_?||302852828_?->
      575474002    APSES->?-><-MBL<-?||P-kinase-><-?||?-><-ParB-HTH+N6-MTase*                                                ParB-HTH+N6-MTase            Dam                           BATDEDRAFT_85509    496   eukaryota>fungi>chytridiomycota                   Batrachochytrium dendrobatidis JAM81                  hypothetical protein BATDEDRAFT_85509 [Batrachochytrium dendrobatidis JAM81].                       575471282_APSES->575472862_?-><-575472060_MBL<-575472062_?||575471354_P-kinase-><-575474000_?||575472864_?-><-575474002_ParB-HTH+N6-MTase*||575472064_?->575471574_?->575472666_?-><-575472066_?||575473118_C6FunFin-><-575471562_?||575472708_?->
      # ; Eukaryotic Chlorophyte DAM                                                                                                                                                                                                                                            
      760440511    BMB+PHD+N6-MTase*->                                                                                       BMB+PHD+N6-MTase+ZFCW        PHD+zf-CW                     F751_3154           516   eukaryota>viridiplantae>chlorophyta               Auxenochlorella protothecoides                        hypothetical protein F751_3154 [Auxenochlorella protothecoides].                                    760440497_?-><-760440499_?||760440501_?-><-760440503_?||760440505_?-><-760440507_?<-760440509_?||760440511_BMB+PHD+N6-MTase*-><-760440513_?||760440515_?->760440517_?->760440519_?-><-760440521_?<-760440523_?<-760440525_?
      612396523    <-FtsJ_methylase||?-><-?||?-><-?<-?<-RAMA+N6-MTase+ZFCW*<-RNase_T                                         RAMA+N6-MTase+ZFCW           Nucleoplasmin+Drf_FH1         Bathy04g03050       1310  eukaryota>viridiplantae>chlorophyta               Bathycoccus prasinos                                  predicted protein [Bathycoccus prasinos].                                                           612396033_?-><-612396669_FtsJ_methylase||612396113_?-><-612395985_?||612395879_?-><-612396147_?<-612396601_?<-612396523_N6-MTase*<-612395721_RNase_T||612396527_?-><-612396483_?||612395921_?-><-612395871_?||612396661_?->612395853_?->
      159476182    <-Histone||?->?->BMB+N6-MTase*-><-?||Histone-><-?||?-><-?||ABC-transporter->                              BMB+N6-MTase+ZFCW            PWWP+zf-CW                    CHLREDRAFT_167032   1174  eukaryota>viridiplantae>chlorophyta               Chlamydomonas reinhardtii                             hypothetical protein CHLREDRAFT_167032, partial [Chlamydomonas reinhardtii].                        <-159476714_?<-159476716_?<-159476718_?||159476176_?-><-159476720_Histone||159476178_?->159476180_?->159476182_BMB+N6-MTase*-><-159476722_?||159476184_Histone-><-159476724_?||159476186_?-><-159476726_?||159476188_ABC-transporter->159476190_?->
      552817679    <-METHYLASE||?->?-><-?||?->?->?-><-BMB+PHD+N6-MTase+ZFCW*                                                 BMB+PHD+N6-MTase+ZFCW        PHD+zf-CW                     CHLNCDRAFT_138470   865   eukaryota>viridiplantae>chlorophyta               Chlorella variabilis                                  hypothetical protein CHLNCDRAFT_138470 [Chlorella variabilis].                                      <-552817673_METHYLASE||552817310_?->552817313_?-><-552817676_?||552817317_?->552817320_?->552817323_?-><-552817679_BMB+PHD+N6-MTase+ZFCW*<-552817682_?<-552817686_?||552817327_?-><-552817690_?||552817330_?->552817333_?-><-552817694_?
      545372676    BMB+N6-MTase*->                                                                                           BMB+N6-MTase                 -                             COCSUDRAFT_83615    537   eukaryota>viridiplantae>chlorophyta               Coccomyxa subellipsoidea C-169                        hypothetical protein COCSUDRAFT_83615 [Coccomyxa subellipsoidea C-169].                             545372133_?-><-545372135_?<-545372137_?<-545372140_?||545372142_?->545372145_?->545372147_?->545372676_BMB+N6-MTase*-><-545372150_?<-545372152_?<-545372155_?||545372157_?-><-545372160_?<-545372162_?<-545372165_?
      633905054    PHD+N6-MTase+ZFCW*->                                                                                      PHD+N6-MTase+ZFCW            PHD+zf-CW                     H632_c3034p0        488   eukaryota>viridiplantae>chlorophyta               Helicosporidium sp. ATCC 50920                        hypothetical protein H632_c3034p0, partial [Helicosporidium sp. ATCC 50920].                        633905054_PHD+N6-MTase+ZFCW*->
      303277723    <-BMB+N6-MTase+ZFCW*                                                                                      BMB+N6-MTase+ZFCW            Nucleoplasmin+zf-CW           MICPUCDRAFT_57353   1004  eukaryota>viridiplantae>chlorophyta               Micromonas pusilla CCMP1545                           predicted protein [Micromonas pusilla CCMP1545].                                                    303276985_?-><-303277717_?||303276987_?-><-303277719_?||303276989_?-><-303277721_?||303276991_?-><-303277723_BMB+N6-MTase+ZFCW*<-303277725_?||303276993_?-><-303277727_?<-303277729_?||303276995_?->303276997_?->303276999_?->
      255071987    BMB+N6-MTase+ZFCW*->                                                                                      BMB+N6-MTase+ZFCW            DUF3987+zf-CW                 MICPUN_55980        1012  eukaryota>viridiplantae>chlorophyta               Micromonas sp. RCC299                                 predicted protein [Micromonas sp. RCC299].                                                          255071979_?-><-255072943_?<-255072945_?||255071981_?->255071983_?-><-255072947_?||255071985_?->255071987_BMB+N6-MTase+ZFCW*->255071989_?-><-255072949_?||255071991_?-><-255072951_?||255071993_?-><-255072953_?||255071995_?->
      145349057    <-FtsJ_methylase||?->?-><-BMB+N6-MTase*<-RNase_T||?-><-?||?-><-?||?->ABC-transporter->                    BMB+N6-MTase                 SP+PWWP+SMC_N                 OSTLU_87805         847   eukaryota>viridiplantae>chlorophyta               Ostreococcus lucimarinus CCE9901                      predicted protein [Ostreococcus lucimarinus CCE9901].                                               145348583_?-><-145349053_?||145348585_?->145348588_?-><-145349055_FtsJ_methylase||145348590_?->145348592_?-><-145349057_BMB+N6-MTase*<-145349060_RNase_T||145348594_?-><-145349062_?||145348596_?-><-145349064_?||145348598_?->145348600_ABC-transporter->
      308806169    <-ubiquitin||?->?-><-FtsJ_methylase||?->?-><-BMB+N6-MTase*<-RNase_T||?-><-?||?->?->ABC-transporter->      BMB+N6-MTase                 PWWP                          Ot07g02900          854   eukaryota>viridiplantae>chlorophyta               Ostreococcus tauri                                    Actin filament-coating protein tropomyosin (ISS) [Ostreococcus tauri].                              308806155_?-><-308806157_ubiquitin||308806159_?->308806161_?-><-308806163_FtsJ_methylase||308806165_?->308806167_?-><-308806169_BMB+N6-MTase*<-308806171_RNase_T||308806173_?-><-308806175_?||308806177_?->308806179_?->308806181_ABC-transporter-><-308806183_?
      693499233    <-ubiquitin||?->?-><-FtsJ_methylase||?->?-><-BMB+N6-MTase*<-RNase_T||?-><-?||?->?->ABC-transporter->      BMB+N6-MTase                 PWWP                          OT_ostta07g03040    1018  eukaryota>viridiplantae>chlorophyta               Ostreococcus tauri                                    Zinc finger, CW-type [Ostreococcus tauri].                                                          693499229_?-><-116058850_ubiquitin||693499230_?->116058852_?-><-693499231_FtsJ_methylase||693499232_?->116058855_?-><-693499233_BMB+N6-MTase*<-693499234_RNase_T||693499235_?-><-693499236_?||693499237_?->693499238_?->116058862_ABC-transporter-><-116058863_?
      302835622    BMB+BMB+PHD+N6-MTase+ZFCW*->                                                                              BMB+BMB+PHD+N6-MTase+ZFCW    PWWP+MSP1_C+PWWP+PHD+zf-CW    VOLCADRAFT_89771    1214  eukaryota>viridiplantae>chlorophyta               Volvox carteri f. nagariensis                         hypothetical protein VOLCADRAFT_89771 [Volvox carteri f. nagariensis].                              <-302835846_?||302835614_?-><-302835848_?||302835616_?->302835618_?-><-302835850_?||302835620_?->302835622_BMB+BMB+PHD+N6-MTase+ZFCW*-><-302835852_?||302835624_?-><-302835854_?||302835626_?-><-302835856_?||302835628_?-><-302835858_?
      ------------------------Prokaryotic homologs------------------------
      # 2;  Versions somewhat closer to the eukaryotic ones                                                                                                                                                                                                                                          
      491011364    <-Methylase<-?<-RusA<-?<-?<-?<-?<-ParB+N6-MTase*<-?<-?||Peptidase_S24->                                   ParB+N6-MTase                ParBc+Dam                     ACAty_RS09645       388   bacteria>proteobacteria>gammaproteobacteria       Acidithiobacillus caldus                              hypothetical protein [Acidithiobacillus caldus].                                                    <-491011357_Methylase<-491011359_?<-740686876_RusA<-740686878_?<-491011362_?<-740687613_?<-740686880_?<-491011364_ParB+N6-MTase*<-740686882_?<-491011365_?||740686884_Peptidase_S24-><-740686886_?||491011368_?->491011369_?->740687616_?->
      503768726    <-ArdC+MPTase<-Methylase<-?<-RusA<-?<-?<-?<-ParB+N6-MTase*<-?<-?||Peptidase_S24->                         ParB+N6-MTase                ParBc+Dam                     ATC_RS06425         388   bacteria>proteobacteria>gammaproteobacteria       Acidithiobacillus caldus                              hypothetical protein [Acidithiobacillus caldus].                                                    <-503768720_ArdC+MPTase<-503768721_Methylase<-503768722_?<-753905032_RusA<-503768724_?<-753905034_?<-753902617_?<-503768726_ParB+N6-MTase*<-503768727_?<-503768728_?||753905037_Peptidase_S24->503768729_?->753902625_?->491011368_?->503768731_?->
      # 3; Packaging associated                                                                                                                                                                                                                                           
      500114172    Terminase_LS->Phage_portal-><-?<-?||?-><-?<-N6-MTase*                                                     N6-MTase                     Dam                           SPUTW3181_RS15120   208   bacteria>proteobacteria>gammaproteobacteria       Shewanella sp. W3-18-1                                hypothetical protein [Shewanella sp. W3-18-1].                                                      <-500114165_?||500114166_Terminase_LS->500114167_Phage_portal-><-500114168_?<-500114169_?||500114170_?-><-500114171_?<-500114172_N6-MTase*<-500114173_?<-500114174_?<-752761115_?<-500114175_?<-500114176_?||500114177_?->500114178_?->
      739569226    TET-JBP->?-><-?<-?<-?<-?<-?<-N6-MTase*                                                                    N6-MTase                     Dam                           SHEWPOL2_RS06540    196   bacteria>proteobacteria>gammaproteobacteria       Shewanella sp. POL2                                   hypothetical protein, partial [Shewanella sp. POL2].                                                739569175_TET-JBP->739569177_?-><-739569179_?<-739569180_?<-739569181_?<-739569224_?<-739569183_?<-739569226_N6-MTase*<-739569228_?<-739569184_?<-739569186_?<-739569229_?<-739569187_?<-739569189_?<-739569192_?
      446980525    <-N6-MTase*                                                                                               N6-MTase                     -                             VII_RS00060         195   bacteria>proteobacteria>gammaproteobacteria       Vibrio mimicus                                        hypothetical protein [Vibrio mimicus].                                                              <-694128903_?<-446367425_?<-447182778_?<-447051858_?<-446937185_?<-694128904_?<-446915745_?<-446980525_N6-MTase*<-446925120_?<-447034144_?<-694128905_?||446144374_?->446829364_?->446123684_?->694128906_?->
      # 4; the circularly permuted methylase is of the HpaI family                                                                                                                                                                                                                                            
      694338559    <-DpnII-likeRE<-cpDAM<-N6-MTase*                                                                          N6-MTase                     SP+Dam                        ND2E_3441           184   bacteria>proteobacteria>gammaproteobacteria       Colwellia psychrerythraea                             DNA N-6-adenine-methyltransferase [Colwellia psychrerythraea].                                      694338552_?->694338553_?-><-694338554_?<-694338555_?||694338556_?-><-694338557_DpnII-likeRE<-694338558_cpDAM<-694338559_N6-MTase*
      696562339    <-DpnII-likeRE<-cpDAM<-N6-MTase*                                                                          N6-MTase                     Dam                           ND2E_RS09310        162   bacteria>proteobacteria>gammaproteobacteria       Colwellia psychrerythraea                             hypothetical protein, partial [Colwellia psychrerythraea].                                          696562293_?->696562294_?-><-696562295_?<-696562337_?||696562296_?-><-696562297_DpnII-likeRE<-696562338_cpDAM<-696562339_N6-MTase*
      # 4; phage/prophage                                                                                                                                                                                                                                            
      749448467    <-N6-MTase*<-?<-Phage_integrase                                                                           N6-MTase                     Dam                           JCM19241_5986       168   bacteria>proteobacteria>gammaproteobacteria       Vibrio sp. JCM 19241                                  modification methylase Bsp6I [Vibrio sp. JCM 19241].                                                <-749448460_?<-749448461_?<-749448462_?<-749448463_?<-749448464_?<-749448465_?<-749448466_?<-749448467_N6-MTase*<-749448468_?<-749448469_Phage_integrase||749448470_?->749448471_?->749448472_?->749448473_?->749448474_?->
      751186426    <-N6-MTase*                                                                                               N6-MTase                     Dam                           SBVP3_0091          167   viruses>dsdna viruses, no rna stage>caudovirales  Vibrio phage phi 3                                    hypothetical protein SBVP3_0091 [Vibrio phage phi 3].                                               <-751186419_?<-751186420_?<-751186421_?<-751186422_?<-751186423_?<-751186424_?<-751186425_?<-751186426_N6-MTase*<-751186427_?<-751186428_?<-751186429_?||751186430_?->751186431_?->751186432_?->751186433_?->
      # 1;  Prophage                                                                                                                                                                                                                                           
      333734957    <-N6-MTase*<-?<-?<-?||SFII-RAD3->                                                                         N6-MTase                     Dam                           TREAZ_0592          238   bacteria>spirochaetes                             Treponema azotonutricium ZAS-9                        gp44 [Treponema azotonutricium ZAS-9].                                                              <-333734241_?<-333735058_?<-333734368_?<-333735683_?<-333736204_?<-333736985_?<-333737240_?<-333734957_N6-MTase*<-333736693_?<-333737435_?<-333734288_?||333734581_SFII-RAD3-><-333735236_?<-333734247_?<-333736610_?
      # 1;                                                                                                                                                                                                                                            
      497315962    <-Phage_integrase<-?||?-><-N6-MTase*                                                                      N6-MTase                     Dam                           SYN7509_RS0224085   273   bacteria>cyanobacteria                            Synechocystis sp. PCC 7509                            DNA N-6-adenine-methyltransferase (Dam) [Synechocystis sp. PCC 7509].                               <-497315669_?<-497315670_?<-497315671_?||655839734_?-><-497315638_Phage_integrase<-740179817_?||497315960_?-><-497315962_N6-MTase*||497315963_?->497315964_?-><-655839735_?<-497315966_?<-497315967_?<-740179819_?<-740179822_?
      505099336    <-N6-MTase*                                                                                               N6-MTase                     Dam                           Metfor_2481         181   archaea>euryarchaeota                             Methanoregula formicica                               DNA N-6-adenine-methyltransferase (Dam) [Methanoregula formicica].                                  432331833_?->432331834_?->432331835_?->432331836_?->432331837_?->432331838_?-><-432331839_?<-505099336_N6-MTase*<-432331841_?<-432331842_?<-432331843_?<-432331844_?<-432331845_?<-432331846_?<-432331847_?
      490177569    <-N6-MTase*                                                                                               N6-MTase                     SP+Dam                        Metlim_0419         212   archaea>euryarchaeota                             Methanoplanus limicola                                hypothetical protein [Methanoplanus limicola].                                                      490177562_?->490177563_?-><-490177564_?<-490177565_?<-490177566_?<-490177567_?<-490177568_?<-490177569_N6-MTase*<-490177570_?<-490177571_?||490177572_?->490177573_?-><-490177574_?<-490177575_?||490177576_?->
      # 1;                                                                                                                                                                                                                                            
      427370342    N6-MTase*->                                                                                               N6-MTase                     Dam                           Riv7116_1753        519   bacteria>cyanobacteria                            Rivularia sp. PCC 7116                                DNA N-6-adenine-methyltransferase (Dam) [Rivularia sp. PCC 7116].                                   427370335_?->427370336_?->427370337_?->427370338_?-><-427370339_?<-427370340_?||427370341_?->427370342_N6-MTase*-><-427370343_?||427370344_?-><-427370345_?||427370346_?->427370347_?-><-427370348_?<-427370349_?
      # 1; Type IV system                                                                                                                                                                                                                                            
      763462851    AAA-><-?||ASCH->N6-MTase*->                                                                               N6-MTase                     Dam                           RIV7116_RS23705     144   bacteria>cyanobacteria                            Rivularia sp. PCC 7116                                hypothetical protein, partial [Rivularia sp. PCC 7116].                                             504933756_?->504933757_?->504933758_?->504933759_?->504933760_AAA-><-504933761_?||763462848_ASCH->763462851_N6-MTase*-><-504933763_?||504933764_?->504933765_?->504933766_?-><-504933768_?||763462853_?->763461708_?->
      427373349    AAA-><-?||ASCH+N6-MTase*->                                                                                ASCH+N6-MTase                Dam                           Riv7116_4895        629   bacteria>cyanobacteria                            Rivularia sp. PCC 7116                                ASCH domain-containing protein [Rivularia sp. PCC 7116].                                            427373342_?->427373343_?->427373344_?->427373345_?->427373346_?->427373347_AAA-><-427373348_?||427373349_ASCH+N6-MTase*-><-427373350_?||427373351_?->427373352_?->427373353_?->427373354_?-><-427373355_?||427373356_?->
      # 1;                                                                                                                                                                                                                                            
      186465327    ParB-HTH->?->?->?->?->?->DCM+N6-MTase*->                                                                  DCM+N6-MTase                 DNA_methylase+Dam             Npun_F2574          1180  bacteria>cyanobacteria                            Nostoc punctiforme PCC 73102                          C-5 cytosine-specific DNA methylase [Nostoc punctiforme PCC 73102].                                 186465320_?->186465321_ParB-HTH->186465322_?->186465323_?->186465324_?->186465325_?->186465326_?->186465327_DCM+N6-MTase*->186465328_?->186465329_?-><-186465330_?<-186465331_?<-186465332_?<-186465333_?<-186465334_?
      
      Back to Contents
    • Multiple sequence alignment of the Tudor-like SH3 domain associated with the ParB-like HTH domain
    • Multiple sequence alignment of the ParB-HTH domain associated with the Chlorophyte-type N6-MTases

      Alignment of eukaryotic members only.
      
      FINAL                                      -H--HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH-------HHHEEE-----E-------HHHHHHH--HHHHHHHHH----------HHHH-----------------------HHHHH------HHHHHH-HH---------EEEEEEEEEE--E--------------
      ALIGN                                      ----HHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH------HEEHHH-----HHH-----HHHHHHH--HHHHHHHHH-------------------------------------HHHHH------HHHHHH-HHH---------EEEEEEE-E--EE--E----------
      HMM                                        ----HHHHHHHHHHHHH----HHHHHHHHHHHHHH---HHE------EEEEEE-----E---E---HHHHHHH--HHHHHHHHH-----EE-----EE---------------EEEE------HHH------HHHHHH-HH------EEEEEEEEEEEE---E--EEEEEEE-----
      FREQ                                       -H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHH-----HH------HHHHHHH--HHHHHHHHH----------H-H-----------------HH-----HHHHH------HHHHHH-HH---------EEEEEEEHHH--H--------------
      PSSM                                       -H--HHHHHHHHHHHHHH---HHHHHHHHHHHHHH-------------HHEEH-----------------HHH--HHHHHHHHH-----------HH------------------------HHHHH------HHHHHH-HH--------EEEEEHHEE-------------------
      CHLREDRAFT_191158_Creinhardtii_159472462   RP--EDLEGCEKLIEGSSLKNNFLSIGQALVTINDRKLYKDSG-YTSFTQYIE-----QKGDFGFGPRQALRL--LAATRLVRNFPPNIALPSSERQV---------------RALVGLEQAEAIE------VWSKAT-KISQDTNTPLTHRLVESVLGK--ELPTTYRQVTRDWQD
      VOLCADRAFT_118198_Vcarteri_302842945       QP--EDLEGCEKLIEGSSLKNNFLRIGQALVTINDRKLYKRAG-SSSFTQYIE-----QKSDFGFGPRQALRL--LAATRLVRNFPPTIALPTSERQV---------------RALVGLEQQQAVK------VWVKAN-LIAQETGVPLTHRLVESVLGK--ELPASYRQAARDWQD
      VOLCADRAFT_91459_Vcarteri_302838997        EG--NALASAERRVARAA-PAYFAEASLAMLEIAEGKLYSFAG-HASFQDYIR---K-SSAVLGFGLRQARNL--IAAARVIRNLPADVARPSNERQV---------------RPLVGCHPDVQLK------VWVLALERADGDTRHEDALSLQYGGL-P--VITRSDTCEWYTPDF
      VOLCADRAFT_108225_Vcarteri_302854263       RA--RTLGEAEARVTMAA-GGFFVEASCALLDIAEQHLFEAEG-YKSFRAYIL-----ERKSLGFGYRQARAW--VAAARFIRSLPPSMPVPQYERLV---------------RPLTRCEPGVARL------AWERVL-RYHNEEGRRMTAELVVSCI-Q--EVGTASTVAQSDSED
      VOLCADRAFT_105875_Vcarteri_302843631       RA--RTLGEAEARVTMAA-GGFFVEASCALLDIAEQHLFEAEG-YKSFRAYIL-----ERKSLGFGYRQARAW--VAAARFIRSLPPSMPVPQYERLV---------------RPLTRCEPGVARL------AWERVL-RYHNEEGRRMTAELVVSCI-Q--EVGTASTVAQSDSED
      VOLCADRAFT_106408_Vcarteri_302845993       EM--ALLLRAEQVVQGS--GSQFRQMANSLLDIQERRLYSCLG-FGSFVQYVS-----ESGRIDIAPRYAQAL--VAAAHFLRLLSATDVVPNSECQV---------------RPLTALPPCDALG------AWRLAV-AHSVAESRLLSGRLVEQCA-L--EVTGRTGSDNSSSDM
      VOLCADRAFT_104970_Vcarteri_302839284       -M--ARLLRAEQLVQGC--GSQFRQMANSLLDIQERRLYSCLG-FGSFVQYVS-----ESGRIDIAPRYAQAL--VAAARFLRLLSATDVVPNSGRQV---------------RPLTALPPCDALG------AWRLAV-ARSVAESRLLGGRLVEQCA-L--EVTGRSRLLRGTGSD
      VOLCADRAFT_104840_Vcarteri_302838546       EG--NALASAERRVPMAA-LGFFAEASLALLDIAEGKLYSFAG-HASFKAYIR---R-SLAVLGFGLHQARNL--IAAARVIRNLPAGVARPSNQMQV---------------RPFVGCHPDVQLK------AWVLALGRAGGLESARVSGRLVRECL-R--EVKGSLADDVLAGSA
      BATDEDRAFT_85509_Bdendrobatidis_575474002  HL--ERLHNLEKAITEHLSTGKFFIVAAALRCIEEERLF-----YPERTVYSY-----AKSRFGFSRRTTNTY--LCSSYVYESITEDKTLPIPVNIS-----------HV--RSLHKYPPEVRRQ------IW-----KQLNDSGLTITEENVVAM-----TIKYETGVSFTELNN
      BATDEDRAFT_90358_Bdendrobatidis_575483232  EY--ARIPLEQK-------QKQFVETVTAIRAIICRKLYRDEG-YDSLQTYFL-------SKWDVSRAQVYRL--MDCWPILTTICKAHVIPYKERLC---------------RTLKQCTRSPSELVL----LWDNVI---GSCDPAFVSPKFIFDVW-D--RLQSTLHTFLDQDTE
      VOLCADRAFT_108630_Vcarteri_302856103       EM--ARLLRAEQLVQGC--GSQFRQMANSLLDIQERRLYSCLG-FGSFVQYVS-----ESGRIDIAPRYAQAL--VAAARFLRLLSATDVVPNSGRQPSHDSWVGGSLSNVHWRSQAGAVCCGVLAVTTAAATWMSAI-ASLVRESIWFEVEGVGMCA-CFSPVRVRSGSPH-----
      MVEG_03971_Mverticillata_672826234         DS--SVIRGSSKTSHQSK-PTLFETTVLAFRDIIVRRLWRSDNRFQSESEFCK-------HHWEIQRSRRDEL--IECAELLVELSRIPCRPTSESVC---------------RVLA---------------NWSA---KYQHEQPSLELGKTTVSV-----KIWTKVLDE------
      VOLCADRAFT_106473_Vcarteri_302846292       RRPMRVRGRGLAWL-------LAVVIKCWARGEHGRWNWRRWG-LPPLVHQVVVVAKDETRSEHGGLTSRPMWPMPGSRRLQVAIGRSMPVPQYERLV---------------RPLTRCEPGVARL------AWERVL-RYHNEEGRRMTAELVVSCI-Q--EVGTASTVAQSDSED
      VOLCADRAFT_100579_Vcarteri_302855367       ------------------------------------------------------------------------------------------MPNSERQV---------------RPLTALPPCDALG------AWRLAV-ARSEAESRLLSGRLVEQCA-L--EVTGRSRLLRGTGSD
      consensus/100%                             ...........................................................................................P.....s...............Rsb.................W..........p......p...........l.............
      consensus/95%                              ...........................................................................................P.....s...............Rsb.................W..........p......p...........l.............
      consensus/90%                              ......b...............h......h..b.....a........b..b...........b.........h....s..h...hs...s.P...pbs...............Rsh..h..............W..........ps.....c....s......l.............
      consensus/85%                              p.....l...b...........F...s.uh.sI...+Lap..s...sb..ah..........h.hu.p....h..hss..hh..hs.s.s.Pp..pbs...............RsLs.h.....b.......sW..s.......ps..hs.c.l..sh....pl.s.........p.
      consensus/80%                              p.....l...b...........F...s.uh.sI...+Lap..s...sb..ah..........h.hu.p....h..hss..hh..hs.s.s.Pp..pbs...............RsLs.h.....b.......sW..s.......ps..hs.c.l..sh....pl.s.........p.
      consensus/75%                              c.....l..sb..l......sbF.phs.uhbsI.pb+Lap..G.a.Sh..Yh.........phshu.pbs..h..lsuschlp.ls.s.shPp.bpbs...............RsLs.h..ss.b.......sW..s.......ps.bhs.chV.psh....cl.sp......p.p.
      consensus/70%                              c.....L..sEb.l..s...sbF.phu.ulbsI.-b+Lap..G.asSh..Yl......pp.phshu.+bsb.h..luus+hlc.ls.s.slPpsERbV...............RsLs.h.ssssb.......sW.bsl....sppu.bhoscLV.psh....cl.sps.s...s.p.
      
      Alignment with prokaryotic homologs.
      
      RES                                                                        R-P--ED-LEGCEKL---IEGSSLK---NNF-------LSIGQALVTIND-----RKLYK--D---------SG-YT--S-FTQYIE-----QKGDF-GF-GPRQALRLL--AATRLVRNF-----------------------------------------------------------PPN-I------------------------------------------------A-------LP-S-------------SERQV---------------RAL----------VG--L--EQ-AE-A---IE------VW--S-KAT-KISQD-TN-----T-PLTHRLVESVLG
      ALIGN                                                                      ------H-HHHHHHH---HHHHHHH---HHH-------HHHHHHHHHHHH-------HHH--H-------------------EHEHH--------HH-HH-HHHHHHHHH--HHHHHHHHH-----------------------------------------------------------HH------------------------------------------------------------------------------------------------------------------------------HH------HH--H-HHH-HHH--------------HHHHHH----
      HMM                                                                        -----HH-HHHHHHH---HHHHHHH---HHH-------HHHHHHHHHHHH---------E---------------------HHHHHH-----HHHHH-----HHHHHHHH--HHHHHHHH---------------------------------------------------------------------------------------------------------------------------------------------EE---------------EEE----------E---------H--H---HH------HH--H-HHH-HH---------------HHHHHHHH--
      FREQ                                                                       -----HH-HHHHHHH---HHHHHHH---HHH-------HHHHHHHHHHHH-----HHHHH-------------------H-HHHHHH-----HHH-------HHHHHHHH--HHHHHHHH-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HH-H---HH------HH--H-HHH-HHH---------------EEEEEEE--
      PSSM                                                                       -----HH-HHHHHHH---HHHHHHH---HHH-------HHHHHHHHHHHH-----H--H-----------------------HHHHH-----HH--------HHHHHHHH--HHHHHHHHH-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HH------HH--H-HHH-HHH---------------HHHHHH---
      FINAL                                                                      -----HH-HHHHHHH---HHHHHHH---HHH-------HHHHHHHHHHHH-----HHHHH---------------------HHHHHH-----HHH-------HHHHHHHH--HHHHHHHHH------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HH-H---HH------HH--H-HHH-HHH---------------HHHHHHH--
      CHLREDRAFT_191158_Creinhardtii_159472462                                   R-P--ED-LEGCEKL---IEGSSLK---NNF-------LSIGQALVTIND-----RKLYK--D---------SG-YT--S-FTQYIE-----QKGDF-GF-GPRQALRLL--AATRLVRNF-----------------------------------------------------------PPN-I------------------------------------------------A-------LP-S-------------SERQV---------------RAL----------VG--L--EQ-AE-A---IE------VW--S-KAT-KISQD-TN-----T-PLTHRLVESVLG
      VOLCADRAFT_118198_Vcarteri_302842945                                       Q-P--ED-LEGCEKL---IEGSSLK---NNF-------LRIGQALVTIND-----RKLYK--R---------AG-SS--S-FTQYIE-----QKSDF-GF-GPRQALRLL--AATRLVRNF-----------------------------------------------------------PPT-I------------------------------------------------A-------LP-T-------------SERQV---------------RAL----------VG--L--EQ-QQ-A---VK------VW--V-KAN-LIAQE-TG-----V-PLTHRLVESVLG
      VOLCADRAFT_91459_Vcarteri_302838997                                        E-G--NA-LASAERR---VARAA-P---AYF-------AEASLAMLEIAE-----GKLYS--F---------AG-HA--S-FQDYIR---K-SSAVL-GF-GLRQARNLI--AAARVIRNL-----------------------------------------------------------PAD-V------------------------------------------------A-------RP-S-------------NERQV---------------RPL----------VG--C--HP-DV-Q---LK------VW--V-LALERADGD-TR-----H-EDALSLQYGGLP
      VOLCADRAFT_108225_Vcarteri_302854263                                       R-A--RT-LGEAEAR---VTMAA-G---GFF-------VEASCALLDIAE-----QHLFE--A---------EG-YK--S-FRAYIL-----ERKSL-GF-GYRQARAWV--AAARFIRSL-----------------------------------------------------------PPS-M------------------------------------------------P-------VP-Q-------------YERLV---------------RPL----------TR--C--EP-GV-A---RL------AW--E-RVL-RYHNE-EG-----R-RMTAELVVSCIQ
      VOLCADRAFT_105875_Vcarteri_302843631                                       R-A--RT-LGEAEAR---VTMAA-G---GFF-------VEASCALLDIAE-----QHLFE--A---------EG-YK--S-FRAYIL-----ERKSL-GF-GYRQARAWV--AAARFIRSL-----------------------------------------------------------PPS-M------------------------------------------------P-------VP-Q-------------YERLV---------------RPL----------TR--C--EP-GV-A---RL------AW--E-RVL-RYHNE-EG-----R-RMTAELVVSCIQ
      VOLCADRAFT_106408_Vcarteri_302845993                                       E-M--AL-LLRAEQV---VQGS--G---SQF-------RQMANSLLDIQE-----RRLYS--C---------LG-FG--S-FVQYVS-----ESGRI-DI-APRYAQALV--AAAHFLRLL-----------------------------------------------------------SAT-D------------------------------------------------V-------VP-N-------------SECQV---------------RPL----------TA--L--PP-CD-A---LG------AW--R-LAV-AHSVA-ES-----R-LLSGRLVEQCAL
      VOLCADRAFT_104970_Vcarteri_302839284                                       --M--AR-LLRAEQL---VQGC--G---SQF-------RQMANSLLDIQE-----RRLYS--C---------LG-FG--S-FVQYVS-----ESGRI-DI-APRYAQALV--AAARFLRLL-----------------------------------------------------------SAT-D------------------------------------------------V-------VP-N-------------SGRQV---------------RPL----------TA--L--PP-CD-A---LG------AW--R-LAV-ARSVA-ES-----R-LLGGRLVEQCAL
      VOLCADRAFT_104840_Vcarteri_302838546                                       E-G--NA-LASAERR---VPMAA-L---GFF-------AEASLALLDIAE-----GKLYS--F---------AG-HA--S-FKAYIR---R-SLAVL-GF-GLHQARNLI--AAARVIRNL-----------------------------------------------------------PAG-V------------------------------------------------A-------RP-S-------------NQMQV---------------RPF----------VG--C--HP-DV-Q---LK------AW--V-LALGRAGGL-ES-----A-RVSGRLVRECLR
      BATDEDRAFT_85509_Bdendrobatidis_575474002                                  H-L--ER-LHNLEKA---ITEHLST---GKF-------FIVAAALRCIEE-----ERLF----------------YP--E-RTVYSY-----AKSRF-GF-SRRTTNTYL--CSSYVYESI-----------------------------------------------------------TED-K------------------------------------------------T-------LP-I-------------PVNIS-----------HV--RSL----------HK--Y--PP-EV-R---RQ------IW--------KQLND-SG-----L-TITEENVVAMTI
      BATDEDRAFT_90358_Bdendrobatidis_575483232                                  E-Y--AR-IPLEQK----------Q---KQF-------VETVTAIRAIIC-----RKLYR--D---------EG-YD--S-LQTYFL-------SKW-DV-SRAQVYRLM--DCWPILTTI-----------------------------------------------------------CKA-H------------------------------------------------V-------IP-Y-------------KERLC---------------RTL----------KQ--C--TR-SP-S---EL---VL-LW--D-NVI---GSC-DP-----A-FVSPKFIFDVWD
      VOLCADRAFT_108630_Vcarteri_302856103                                       E-M--AR-LLRAEQL---VQGC--G---SQF-------RQMANSLLDIQE-----RRLYS--C---------LG-FG--S-FVQYVS-----ESGRI-DI-APRYAQALV--AAARFLRLL-----------------------------------------------------------SAT-D------------------------------------------------V-------VP-N-------------SGRQPSHDSWVGGSLSNVHWRSQ----------AG--A--VC-CG-V---LAVTTAAATW--M-SAI-ASLVR-ES-----I-WFEVEGVGMCAC
      MVEG_03971_Mverticillata_672826234                                         D-S--SV-IRGSSKT---SHQSK-P---TLF-------ETTVLAFRDIIV-----RRLWR--S---------DNRFQ--S-ESEFCK-------HHW-EI-QRSRRDELI--ECAELLVEL-----------------------------------------------------------SRI-P------------------------------------------------C-------RP-T-------------SESVC---------------RVL----------A------------------------NW--S-AKY--QHEQ-PS-----L-ELGKTTVSVKIW
      VOLCADRAFT_106473_Vcarteri_302846292                                       R-RPMRV-RGRGLAW---L----------LA-------VVIKCWARGEHG-----RWNWR--R---------WG-LP--P-LVHQVVVVAKDETRSE-HG-GLTSRPMWPMPGSRRLQVAI-----------------------------------------------------------GRS-M------------------------------------------------P-------VP-Q-------------YERLV---------------RPL----------TR--C--EP-GV-A---RL------AW--E-RVL-RYHNE-EG-----R-RMTAELVVSCIQ
      -_Fischerella_sp_PCC_9431_737132827                                        L-T--EE-EQCLRLH---LERKV-E---RAF-------YEAGKALRELRD-----RKLYR--S---------T--HQ--T-FEEYCR-------DRF-GY-SRRHPYLLM--EAAVIVDNL-SE--------------------------------------------------------KCD-P---------------------------------MDH------------I-------PP-T-------------SERQV---------------RPL----------TK--L--DP-DT-Q---CE------AW--Q-QAV-SEAGG--------K-VPSSRIVKDIVQ
      -_Nostoc_sp_PCC_7120_764953510                                             L-T--EQ-EQSDRLF---LKRKV-E---RAF-------FEAGKALMELRD-----RRLYR--S---------T--HA--T-FEEYCK-------DRF-GY-NRSRSYQLI--DAAIVVDNL-Q---------------------------------------------------------KCP-Q---------------------------------FVD------------I-------LP-T-------------AEGQV---------------RPL----------TK--L--EP-QE-Q---QE------AW--P-TAV-EETGG--------K-VPTGRIVKDVVQ
      -_Nostoc_punctiforme_501381481                                             L-T--EE-EQRDRLH---LERRV-E---RAF-------FEAGKALAELRD-----RRLYR--S---------S--HR--T-FEEYCR-------DRF-GH-SRRQSYLLM--DAAVIFDNL-EQ--------------------------------------------------------KCD-R---------------------------------SDH------------I-------LP-T-------------NEWQV---------------RPL----------SK--L--DP-DI-Q---PE------AW--E-QAV-ESANG--------K-VPSHRIVKDVVQ
      -_[Scytonema_hofmanni]_UTEX_B_1581_740464136                               L-S--DA-EAVELRR---LEAKV-ELGLKAF-------WEIGQALSQIRD-----KRLYR--E---------T--HK--T-FEEYCI-------TRW-EM-SRRSAYQLI--GAAIVVENV----R------------------------------------------------------NCA-Q------------------------------------------------I-------LP-L-------------NEAQA---------------RPL----------VA--L--PP-EQ-Q---RE------AW--K-TAV-STAAN--------G-KVTALHVAQVAR
      -_Chroococcales_cyanobacterium_CENA595_769921346                           L-S--DD-ELSDRHR---LELRV-E---RVF-------YEAGTALRELRD-----RKLYR--D---------T--HR--T-FEDYCK-------NRF-GY-HRRHCYQLI--DAADVVENL------C----ANS---------------------------------------------AQK-K----------------S-GTS------------GAH------------I-------LP-T-------------NEYQV---------------RPL----------TK--L--EP-AQ-Q---IM------IW--Q-QAV-ESAGG--------K-APSGRIVKSIVE
      alr7299_Nostoc_sp_PCC_7120_17135837                                        L-T--EQ-EQSDRLF---LKRKV-E---RAF-------FEAGKALMELRD-----RRLYR--S---------T--HA--T-FEEYCK-------DRF-GY-NRSRSYQLI--DAAIVVDNL-Q---------------------------------------------------------KCP-Q---------------------------------FVD------------I-------LP-T-------------AEGQV---------------RPL----------TK--L--EP-QE-Q---QE------AW--P-TAV-EETGG--------K-VPTGRIVKDVVQ
      Sta7437_4876_Stanieria_cyanosphaera_PCC_7437_428272365                     L-T--DE-EQQERLH---LERQV-E---RSF-------YVAGKALQQLRD-----RRLYR--S---------T--HS--T-FEDYCR-------ERF-GY-SRRHPYLLI--DAAIVVDNL-SQ--------------------------------------------------------KCD-P---------------------------------LDH------------I-------LP-T-------------SERQV---------------RPL----------SK--L--DR-YQ-Q---VE------VW--Q-QAV-EEAGG--------V-VPSSRIVRDLVQ
      -_Tolypothrix_campylonemoides_751574024                                    L-T--DG-ELRLRLE---LERQV-E---SAF-------YEAGKALRELRD-----KRLYR--S---------T--HK--T-FEEYCK-------DRF-GF-ERRHPYRLI--DGADIVDNL-IQ--------------------------------------------------------MCP-N---------------------------------GTQ------------I-------LP-T-------------SERQV---------------RPL----------TK--L--ER-EE-Q---RQ------AW--Q-MAL-EQAGG--------K-VPTGNIVKDIVQ
      -_Microcystis_aeruginosa_501223295                                         L-S--EE-EVRDRER---LERTV-E---RAF-------YQAGSALQELRD-----RRLYR--D---------G--YD--S-FEDYCR-------GRF-GH-SRQKANYLI--TGAAIYRTL-----------------------------------------------------------SAA-N----------------------------------CP------------L-------LP-S-------------SEYQV---------------RPL----------AV--L--TP-QQ-Q---PT------VW--N-EAV-AVAGG--------R-TPDHRIVRETVG
      -_Fischerella_sp_PCC_9431_737134277                                        L-T--LE-EQRDRLH---LERKV-E---RAF-------YEAGKALRELRD-----RRLYR--S---------T--HK--T-FEEYCR-------DRF-GH-SRQKSNYLI--AAAGVFDNL-----------------------------------------------------------TTI-G-----CQNLPSED---L-TTN------------GSQ------------I-------LP-T-------------NERQV---------------RPL----------TQ--L--EP-DQ-Q---RE------VW--Q-QAV-TEAGG--------K-VPSGRIVKDIVQ
      -_Cylindrospermum_stagnale_505141377                                       L-T--EE-EERDRLH---LERQV-E---RAF-------YEAGKALRQLRD-----RKLYR--N---------T--HK--T-FEEYCK-------DRF-SY-NRSRSYQLI--DAAFVVDNL-E---------------------------------------------------------ECP-Q---------------------------------IVD------------I-------LP-T-------------AEGQV---------------RPL----------TK--L--EA-EE-Q---VT------CW--Q-EAV-ESAGG--------K-VPSGRIVKSIVD
      -_Scytonema_millei_748136445                                               L-S--ET-EAAERHR---LELRV-E---RAF-------YEAGRALRELKQ-----KKLYR--S---------T--HN--T-FEDYCI-------ERF-GF-SRRHPYRLI--EAASVFENL------C----PIG---------------------------------------------TQN-D----------------L-PTN------------ERQ------------I-------LP-T-------------SERQI---------------RDL----------VS--L--EP-QQ-Q---RE------IW--Q-SAV-LIANG--------K-VPSSRIVKGIVE
      Npun_BR102_Nostoc_punctiforme_PCC_73102_186469442                          L-T--EA-EERDRLS---LERKV-E---RAF-------FEAGKALMELRD-----RRLYR--S---------T--HK--T-FEEYCR-------FRF-AY-TYRHVNYLI--AGSVIVDNI------------------------------K----------------------------MGT-N-----SSQNEKSH--EM-GTN------------SSQ------------I-------LP-T-------------SEVQV---------------RPL----------AK--L--EP-QQ-Q---PE------AW--Q-QAV-EQAEG--------K-VPSGRIVKDVVQ
      -_Nostoc_punctiforme_753811080                                             L-T--EA-EERDRLS---LERKV-E---RAF-------FEAGKALMELRD-----RRLYR--S---------T--HK--T-FEEYCR-------FRF-AY-TYRHVNYLI--AGSVIVDNI------------------------------K----------------------------MGT-N-----SSQNEKSH--EM-GTN------------SSQ------------I-------LP-T-------------SEVQV---------------RPL----------AK--L--EP-QQ-Q---PE------AW--Q-QAV-EQAEG--------K-VPSGRIVKDVVQ
      CWATWH0402_1907_Crocosphaera_watsonii_WH_0402_543531309                    L-T--HS-EERDRLH---LERKV-E---RAF-------YEAGKALQELRD-----RRLYR--S---------T--HK--T-FERYCR-------ERF-GY-NRSRSYQLI--DAGMVVDNL-Q---------------------------------------------------------KCP-Q---------------------------------IVD------------I-------FP-T-------------KESLV---------------RPL----------AS--L--NP-SQ-Q---VE------VW--T-KAV-ELVNG--------Q-VPPARVVKNIVD
      -_Scytonema_millei_748134961                                               L-T--ED-EERDRHR---LELRV-E---QAF-------YQAGAALRELKE-----RRLYR--S---------T--HS--T-FEEYCQ-------DRF-GY-HRRHSYQLI--DAAVVFENL------C----AIG---------------------------------------------AQK-N----------------A-DTR------------GAR------------I-------LP-T-------------SERQC---------------RPL----------TQ--L--EP-AQ-Q---VK------AW--Q-QAI-ELTGG--------K-APSGRTVKGIVE
      -_Pleurocapsa_sp_PCC_7319_518335686                                        L-T--TE-EEGDRLH---LERKV-E---RAF-------YEAGMALMQLRD-----RRLYR--S---------T--HA--T-FEDYCR-------DRF-DY-VRRRSYQLI--DAAKIYNNL-SE--------------------------------------------------------KCV-Q---------------------------------FVH------------I-------LP-T-------------REGQV---------------RPM----------SQ--L--NA-EE-Q---VL------AW--E-TAV-EEAGG--------K-VPTGKIVKDVVQ
      -_Anabaena_cylindrica_505030514                                            L-T--EE-EERDRFR---LERQV-E---RAF-------SAAGKALRQLRD-----RKLYR--S---------T--HK--T-FEEYCK-------DRF-SY-NRSRSYQLI--DAADVVDNL-E---------------------------------------------------------ECP-Q---------------------------------IVD------------I-------LP-T-------------AEGQV---------------RPL----------TK--L--EA-EE-Q---VS------CW--Q-EAV-AAVGG--------K-VPSGRIVKSIVD
      -_Stanieria_cyanosphaera_753865019                                         L-T--DE-EQQERLH---LERQV-E---RSF-------YVAGKALQQLRD-----RRLYR--S---------T--HS--T-FEDYCR-------ERF-GY-SRRHPYLLI--DAAIVVDNL-SQ--------------------------------------------------------KCD-P---------------------------------LDH------------I-------LP-T-------------SERQV---------------RPL----------SK--L--DR-YQ-Q---VE------VW--Q-QAV-EEAGG--------V-VPSSRIVRDLVQ
      -_Calothrix_sp_PCC_7103_737188140                                          L-S--YD-EEQERII---LEKQV-E---RSF-------YVAGRALRILRD-----KKLYR--N---------S--HK--N-FEEYCQ-------YKF-AF-TRRNVNYLI--ASSQVVDNL--------------------------------------------------AGTNI----EGT-E--FL-GTNCSQ-------------------------------------I-------LP-T-------------NECQV---------------RPL----------TK--L--EP-SE-Q---RE------CW--H-QAV-EAAGN--------K-VPSGRQVKDIVT
      -_Synechocystis_sp_PCC_7509_740179759                                      L-T--ED-EEKERHW---LERKV-E---LAF-------VEAGTALRRLRD-----ERLYR--S---------T--HK--T-FEAYCR-------DRF-GF-TRRRPYQLI--DAANVIENL------C----TNG---------------------------------------------TQ---------------------------------------------------I-------LP-S-------------SERQI---------------RDL----------IE--L--NP-KE-Q---CK------VW--Q-QAV-DESGG--------K-VPSGRIVKGIVE
      DA73_0201705_Tolypothrix_bouteillei_VB521301_744453553                     L-T--PE-EQKERLR---LERVI-E---RSF-------YEAGKALRELRD-----RRLYR--S---------T--HK--T-FEEYCK-------NRF-GY-NRSRSYQFI--DAATVVDNL-Q---------------------------------------------------------KCP-Q---------------------------------FVD------------I-------FP-T-------------AESQV---------------RPL----------VP--L--ES-DQ-Q---WE------AW--Q-LSV-EAAGK--------K-VPSARIVKDIVE
      -_Scytonema_millei_748135946                                               L-S--EP-EAAERHR---LEQKV-E---RAF-------YEAALALRELHE-----RKLYR--S---------T--HS--R-FDHYCR-------DRF-GF-SQQNADLLI--RAAGVIDNL-----------------------------------------------------------KIT-T----------------------I----------GCN------------F-------XP-T-------------NERQV---------------RPL----------TK--L--EP-NE-Q---RQ------VW--Q-QAI-EAAGN--------R-VPSGRVVKDIVV
      -_Anabaena_variabilis_499635872                                            L-T--PE-EQSDRLL---LERKV-E---RAF-------FEAGKALAELRD-----RRLYR--S---------T--HR--T-FEEYCK-------DRF-SY-THRHVNYLI--AASLIVDNI------------------------------I----------------------------MGT-N-----SSQIEEAQADEM-GTN------------SSQ------------I-------FP-I-------------SEVQV---------------RPL----------SK--L--EP-QQ-Q---RK------AW--Q-DAV-QEAGD--------K-VPTGRIVKDVVQ
      -_Tolypothrix_sp_PCC_7601_797212730                                        I-S--ET-EAQELRR---LEATV-ERGLRAF-------WEIGQALRQIQD-----QRLYR--Q---------D--YK--N-FEEYCI-------TRW-EM-SRRSAYQLI--EAASVYENV----R------------------------------------------------------HGA-Q------------------------------------------------I-------LP-A-------------NERQA---------------RPL----------TV--L--PP-EK-Q---RE------AW--N-KAV-STAPS--------G-KVTSVHVAQVAK
      -_Tolypothrix_campylonemoides_751574204                                    L-S--EA-EAQELRK---LEATV-ERCLKAF-------WQIGQALRGIRD-----KHLYR--Q---------Q--YK--T-FEEYCI-------TRW-EM-SRRSAYQLI--EAASVYENV----R------------------------------------------------------HGA-Q------------------------------------------------I-------LP-A-------------NERQA---------------RPL----------VA--L--SP-EQ-Q---RE------AW--A-KAV-STAPS--------G-KVTAVHVTQVAR
      -_Fischerella_muscicola_515347403                                          L-T--EE-EKADRHR---LELKI-E---RAF-------YEAGCALKELWE-----RRLYR--S---------T--HK--T-FEEYCR-------DRF-NY-SRDTAYLKM--AAAVVYDNI---------------QKF-----------------------------------------LPT-I-----GRQTP----------------------------------------------MP-T-------------NERQL---------------RDL----------AKANL--EP-EL-Q---AA------TW--L-QGV-EEAGG--------K-VPSGRIIKGIVE
      -_Oscillatoria_nigro-viridis_504992580                                     L-T--DA-EALELSS---LEATV-ERSLKAF-------WEIGQALRQIRD-----RRLYR--Q---------D--FS--T-FEDYCT-------NRW-EM-SRRWAYQLI--EAATVYENV----R------------------------------------------------------HGA-P------------------------------------------------I-------LP-A-------------NERQV---------------RPL----------TA--L--PS-QE-Q---PR------AW--A-QAV-STAPN--------G-KLTAFHVARVVE
      -_Tolypothrix_sp_PCC_7601_797208446                                        L-T--PE-EQSDRLH---LERKV-E---RAF-------FEAGKALAELRD-----RRLYR--S---------T--HR--T-FEDYCR-------DRF-GH-SRQQSNYLI--AAAGVYENL-----------------------------------------------------------TTI-G-----CQNVENEN---L-TTI------------CCQ------------I-------LP-T-------------NERQV---------------RPL----------TK--L--EP-QQ-Q---QE------VW--Q-QAV-EEAGG--------K-VPTGKIVKDVVQ
      -_Stanieria_cyanosphaera_505024902                                         L-T--VE-EESDRYS---LERKV-E---RAF-------YEAGMALMELRD-----RKLYR--S---------T--HA--T-FEDYCR-------DRF-DY-TRRRPYQLI--EAALIYDNL-SE--------------------------------------------------------KCV-K---------------------------------FLH------------I-------LP-T-------------KEGQV---------------QPL----------TQ--L--EW-ES-Q---PS------AW--E-TAV-EEAGG--------K-VPTGRIVKDVVR
      -_Anabaena_sp_PCC_7108_515515560                                           L-T--EE-EERDRFR---LERQV-E---RAF-------SAAGKALRELRD-----RKLYR--N---------S--HQ--T-FEEYCK-------DRF-SY-NRSRSYQLI--DAADVVDNL-E---------------------------------------------------------ECP-Q---------------------------------FVD------------I-------LP-T-------------AEGQV---------------RPL----------TK--L--EA-EE-Q---VS------CW--Q-EAV-EAAGG--------K-VPSGRIVKSIVD
      FDUTEX481_04373_Tolypothrix_sp_PCC_7601_407266820                          I-S--ET-EAQELRR---LEATV-ERGLRAF-------WEIGQALRQIQD-----QRLYR--Q---------D--YK--N-FEEYCI-------TRW-EM-SRRSAYQLI--EAASVYENV----R------------------------------------------------------HGA-Q------------------------------------------------I-------LP-A-------------NERQA---------------RPL----------TV--L--PP-EK-Q---RE------AW--N-KAV-STAPS--------G-KVTSVHVAQVAK
      BegalDRAFT_1574_Beggiatoa_alba_B18LD_386428626                             L-S--PE-EFQALAQ---HEAIV-KAGLQTF-------YDIGEALLTIRD-----KRLYR--A---------E--FN--S-FEEYCQ-------EKW-GF-VRRQADRLI--QAFEVTENL----R------------------------------------------------------PVG-L------------------------------------------------S-------MP-H-------------NEAQA---------------RPL----------VK--L--EP-EL-Q---RQ------AW--Q-KAV-EMAPD--------G-KPTSSLVKKIVK
      MICAB_900014_Microcystis_aeruginosa_PCC_9717_389714985                     L-S--EE-EVRDRER---LERTV-E---RAF-------YQAGSALQELRD-----RRLYR--D---------G--YD--S-FEDYCR-------GRF-GH-SRQKANYLI--TGAAIYRTL-----------------------------------------------------------SAA-N----------------------------------CP------------L-------LP-S-------------SEYQV---------------RPL----------AV--L--AP-QQ-Q---PT------VW--N-EAV-AEAGG--------R-TPDHRVVRETVG
      -_Calothrix_sp_PCC_7103_737188608                                          L-S--EA-EVLELES---LESTV-QRGLRAF-------WEIGQALRILRD-----KRLYR--Q---------C--YD--T-FEEYCI-------NRW-EM-SRRSAYYLI--DAAAVYENV----N------------------------------------------------------HGS-Q------------------------------------------------I-------LP-A-------------NERQA---------------RPL----------TA--L--TP-SE-Q---QK------VW--Q-QAV-STAPN--------G-KITATHIIQVVK
      -_Crocosphaera_watsonii_737857352                                          L-S--EA-EQSEKKR---LEGVV-S---EAV-------WNAGKALRELRD-----KKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYHNL-----------------------------------------------------------TTN-R----------------------------------CE------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KSV-EIANG--------K-VPTHRIVKQVVR
      -_Chlorogloeopsis_fritschii_515383623                                      L-T--ED-EQRDRLY---LERKI-E---RAF-------FEAGKALMELRD-----HRLYR--S---------T--HK--T-FEEYCK-------DRF-GF-ERRHPYRLI--EAAVVVDNL-MQ--------------------------------------------------------MCP-NGTQIEANSNDEQKQSIG-TQIEIESSEQQMRPNGTQ------------I-------LP-T-------------SERQV---------------RPL----------TE--L--EP-SQ-Q---QE------VW--Q-TAV-QEAGG--------K-VPTGRIVKDVVQ
      -_Crocosphaera_watsonii_494519775                                          L-S--EA-EQSEKKR---LEGVV-S---EAV-------WNAGKALRELRD-----KKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYHNL-----------------------------------------------------------TTN-R----------------------------------CE------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KSV-EIANG--------K-VPTHRIVKQVVR
      DA73_0214905_Tolypothrix_bouteillei_VB521301_744450902                     L-T--PE-ELRERLQ---LERKV-E---RAF-------YEAGKALMELRN-----QRLYR--S---------T--HK--T-FEEYCR-------DRF-GH-TRQKSNYLI--AAADVFENL-----------------------------------------------------------TTS-G-----CQ-----------------------------------------I-------LP-T-------------SERQI---------------RPL----------TK--L--EP-VK-Q---PE------AW--Q-LSI-EAADG--------K-SPPSRIVNDIVE
      -_Crocosphaera_watsonii_757158775                                          L-T--EV-ELAEKQR---LEAVI-I---GAV-------WSAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRSLSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-VPTHRIVKQVVR
      N44_02315_Microcystis_aeruginosa_NIES-44_718251661                         L-T--PE-EERQRLF---WERKV-E---RAF-------YEAGTALKELRD-----RRLYR--S---------T--HK--T-FEEYVQ-------DRF-GM-KRAHSYRLI--DAAAVVDNL-F---------------------------------------PLCLQIGDNLSEMSPQE-MSP-N-----WRQNSTGE---K-LTN---------------------------P-------VP-T-------------NESQC---------------RPL----------TQ--L--EP-DQ-Q---RE------VW--Q-QAV-EAAGG--------K-VPSGRIVKDIVQ
      -_Mastigocladopsis_repens_703170672                                        L-T--DG-ELRLRLE---LERQV-E---SAF-------YEAGKALRELRD-----KRLYR--S---------T--HK--T-FEEYCK-------DRF-GF-ERRHPYRLI--DGADIVDNL-IQ--------------------------------------------------------MCP-N---------------------------------GTQ------------I-------LP-T-------------SERQV---------------RPL----------TK--L--ER-EE-Q---RQ------AW--Q-MAV-EQAGG--------K-VPTGNIVKDIVQ
      -_Crocosphaera_watsonii_546222413                                          L-T--EV-ELAEKQR---LEAVI-I---GAV-------WSAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-VPTHRIVKQVVR
      -_Microcystis_aeruginosa_763118968                                         L-S--EE-EVRDRER---LERTV-E---RAF-------YQAGSALQELRD-----RRLYR--D---------G--YD--S-FEDYCR-------GRF-GH-SRQKANYLI--TGAAIYRTL-----------------------------------------------------------SAA-N----------------------------------CP------------L-------LP-S-------------SEYQV---------------RPL----------AV--L--AP-QQ-Q---PT------VW--N-EAV-AEAGG--------R-TPDHRVVRETVG
      Sta7437_4607_Stanieria_cyanosphaera_PCC_7437_428272125                     L-S--LE-DERDKLK---LEREV-E---RAF-------YRAGCALKELRD-----RRLYR--S---------T--HK--T-FKEYCQ-------DRF-GF-TRRRSDYLI--GAAEVVDNL---------------------------------------------------------------S-----GEPKPKRE-------P------------LVL------------I-------LP-T-------------SERQC---------------RPL----------TK--L--EP-EQ-Q---RE------IW--R-EAV-ESSKG--------K-VPSGKVVADLVA
      Cyan7822_6833_Cyanothece_sp_PCC_7822_306986606                             L-S--AD-EEKELLR---LERVV-E---RSF-------YEAGSALRKIRA-----LRLYR--A---------R--FN--S-FEEYTQ-------ERF-GF-TRRQPYYLI--EAANVVDNL-----------------------------------------------------KS----ECE-P--LV-------------------------------H------------I-------LP-S-------------SERQV---------------RPL----------TK--L--NA-TE-Q---RS------VW--N-DAV-SRAQG--------K-VPSGRIVTEALE
      -_Crocosphaera_watsonii_546206668                                          L-T--EV-ELAEKQR---LEAVI-I---GAV-------WSAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRSLSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-VPTHRIVKQVVR
      CwatDRAFT_0109_Crocosphaera_watsonii_WH_8501_67852287                      L-T--EV-ELAEKQR---LEAVI-I---GAV-------WSAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRSLSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-VPTHRIVKQVVR
      -_Crocosphaera_watsonii_494523801                                          L-T--EV-ELAEKQR---LEAVI-I---GAV-------WSAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-VPTHRIVKQVVR
      -_Microcystis_aeruginosa_779871805                                         L-T--PE-EERQRLF---WERKV-E---RAF-------YEAGTALKELRD-----RRLYR--S---------T--HK--T-FEEYVQ-------DRF-GM-KRAHSYRLI--DAAAVVDNL-F---------------------------------------PLCLQIGDNLSEMSPQE-MSP-N-----WRQNSTGE---K-LTN---------------------------P-------VP-T-------------NESQC---------------RPL----------TQ--L--EP-DQ-Q---RE------VW--Q-QAV-EAAGG--------K-VPSGRIVKDIVQ
      DA73_0203765_Tolypothrix_bouteillei_VB521301_744452929                     L-K--EE-ELRLRLH---LERKV-E---RSF-------YEAGKALMELRD-----KRLYR--S---------T--HK--T-FEEYCR-------DRF-SH-SRQKSNYLI--AAADVFENL-----------------------------------------------------------TTI-R-----CQNSSSED--------------------DLQ------------I-------LP-S-------------SEYQI---------------RPL----------TK--L--EP-EQ-Q---LQ------AW--Q-ISV-EEAGG--------V-APAARIVKDVVQ
      -_Cylindrospermum_stagnale_505141386                                       E-A--SA-IALELDR---LEGRI-EKGLRAF-------WEIGQSLGQIRD-----KQLYR--Q---------T--YK--T-FEEYCL-------NRW-EM-SRRSAYRLI--EAASVYENV----T------------------------------------------------------HGS-Q-IPE-NV----------------------------------THGSHFKI-------LP-A-------------NERQV---------------RPL----------AT--L--TP-EQ-Q---RQ------AW--A-KAV-STAPG--------G-KVTSGHVAQVAR
      -_Cyanothece_497232044                                                     L-T--DS-EQKERLR---LERQV-E---RAF-------YVAGCALAKLKT-----DKLYR--S---------T--HS--T-FEDYCQ-------DRF-SF-TRRHVNYLI--AAAGVVDNL--------------------------------------------------K----------------M-GTNCSQNN---------------EDA--ENL------------I-------LP-T-------------TASQC---------------RPL----------TA--L--EP-LK-Q---VE------AW--S-EAI-TQAGG--------K-VPPARIVQEVVQ
      -_Calothrix_sp_PCC_7103_518327692                                          L-T--IS-EQEERDY---LEKLV-E---RAF-------YSAGKALQTLRD-----KKLYR--S---------T--HK--S-FESYCL-------DRF-NY-NSSRSYQLM--DAADVVDNL-K---------------------------------------------------------KVP-Q---------------------------------IVE------------L-------LP-T-------------AEGQV---------------RPL----------VK--L--DF-DT-R---RE------AW--K-MAV-EEVNG--------K-VPSGRVVKDIVN
      MICAK_2860002_Microcystis_aeruginosa_PCC_9701_389882556                    L-T--PE-EERQRLF---WERKV-E---RAF-------YEAGTALKELRD-----RRLYR--S---------T--HK--T-FEEYCR-------DRF-GY-SRRKMDYLI--SGSEVFENL-Q-----------------------------------------TRTIGSQSDRDETRT-IGS-Q-----SDRDETRT---I-GSQ---------------------------I-------LP-I-------------SERQV---------------RPL----------TQ--L--EP-EQ-Q---RE------VW--Q-QAV-EAAGG--------K-VPSGRIVKDIVQ
      -_Calothrix_sp_PCC_7103_737187200                                          L-T--YD-EQRERER---LERLV-E---RAF-------YQAGLALKELRD-----KRLYR--N---------T--HT--S-FDKYCK-------DRF-AY-HRSRYYQLI--NAATIVDNL-Q---------------------------------------------------------PCL-Q---------------------------------IVD------------I-------LP-T-------------AESQV---------------RPL----------VL--L--DP-DE-Q---RL------AW--T-QAV-KAASG--------K-VPSAKVVKDIVD
      MC7420_4124_Coleofasciculus_chthonoplastes_PCC_7420_196179143              L-T--DA-EIVEFRS---LEATV-EKGLRAF-------WQIGQALRQIRD-----KRLYR--Q---------D--YG--T-FEDYCL-------TRW-EI-SRRSAYQLI--EAASVVENV----R------------------------------------------------------HGA-Q------------------------------------------------I-------IP-A-------------NERQA---------------RPL----------TA--L--KP-EQ-Q---QA------AW--A-KAV-STAPR--------G-KVTAAHVAQVAQ
      -_Crocosphaera_watsonii_494514224                                          L-T--EV-ELAEKQR---LEAVV-I---GAV-------WAAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--VGANIYENL-----------------------------------------------------------TTN-R----------------------------------CE------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--ET-KD-Q---VV------AW--G-KAV-EIANG--------K-VPTHRIVKQVVR
      -_Cyanothece_sp_CCY0110_737832178                                          L-S--EA-EEVEKQR---LEAVV-S---GAV-------WAAGKALQKLRD-----KKLYR--D---------S--HP--S-FAVYCQ-------ETF-GH-SRQKSDYLI--VAATIYENL-----------------------------------------------------------EAS-G----------------------------------CE------------V-------LP-Q-------------SEFQV---------------RPL----------GV--L--KK-PPLQ---VE------AW--N-KAV-DICNG--------K-VPSHRIVKQVVR
      -_Crocosphaera_watsonii_737862397                                          L-S--EV-ELAEKQR---LEAVV-I---GAV-------WSAGFALQQLRD-----QKLYR--D---------T--HS--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-IPTHRIVKQVVR
      -_Synechocystis_sp_PCC_7509_655839534                                      L-S--ED-EEKERHR---LELKV-E---RAF-------VEAGTALRKLRD-----RRLYR--S---------T--HK--T-FEEYCS-------DRF-GF-SRRHPYRLI--DAANVVENL-EK--------------------------------------------------------FCV-Q---------------------------------FGH------------I-------LP-A-------------KEFVC---------------RPL----------TI--L--RP-DQ-Q---RE------VW--Q-EIL-QETEG--------K-HPTGKEVKSIVE
      -_Desulfococcus_multivorans_527022036                                      M-------TADRLSE---LEAII-DRNRRSF-------YVIGKALYEIRE-----NRLYR--L---------LG-FK--T-FEAYVK-------DRW-SM-GKSHAHRFI--EAYRVIENL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-E-------------NESQV---------------RPL----------VP--L--TP-LE-Q---RN------IW--R-QFL---ASG--------M-ALTAKNICRLVS
      -_Anabaena_sp_PCC_7108_515520582                                           R-G--EA-ITVELGR---LEDRI-EKGLRAF-------WDIGQSLGQIRD-----KQLYR--Q---------S--YK--T-FDEYCL-------NRW-EM-SRRSAYRLI--QAALVYENV----T------------------------------------------------------RGS-Q-SFV-NV----------------------------------THGSQNQI-------LP-T-------------NERQI---------------RPL----------VT--L--PP-EK-Q---RE------AW--A-KAV-STAPN--------G-KVTADHVAQVAR
      -_Cyanothece_sp_PCC_7424_501601085                                         L-T--PL-ELSEKEQ---LERQV-E---EAF-------FIAGEALRSLRD-----RRLYR--D---------T--HR--T-FEQYCQ-------DRF-GH-TRQKINYLI--AGAAIYSNL-----------------------------------------------------------TTA-R----------------------------------CQ------------V-------LP-A-------------GEYQV---------------RPL----------SV--L--ES-EL-Q---PE------AW--N-KAV-SLADG--------K-VPTSRIVREVVE
      -_Cyanothece_sp_PCC_7424_752567372                                         L-T--PP-ELSEKEQ---LEQQV-E---EAF-------FIAGEALRSLRD-----RRLYR--D---------T--HR--S-FEQYCQ-------DRF-GH-TRQKINYLI--AGAAIYSNL-----------------------------------------------------------TTA-R----------------------------------CQ------------V-------LP-A-------------GEYQV---------------RPL----------SV--L--ES-EL-Q---PE------AW--N-KAV-SLADG--------K-VPTSRIVREVVE
      -_Scytonema_millei_748137603                                               L-N--EV-EERDRHR---LELRV-E---RAF-------YEAGKAIKELRD-----RRLYR--S---------T--HN--NDFVGYCR-------DRF-GK-TKQAVNYLI--AAAEVYENL-T--------------------------------------------------------------------------------TTN------------CCR------------V-------LP-T-------------SEGQV---------------RSL----------SG--L--KL-EK-Q---VE------VW--Q-QAI-DLAEG--------K-VPSARIVKGIVE
      Sta7437_4542_Stanieria_cyanosphaera_PCC_7437_428272064                     L-T--AS-QTKELLR---LEKTI-E---TSF-------YLAGLALRQIQS-----KRLYR--E---------N--YR--T-FEAYCR-------NRF-DF-TRASAYYLI--KAASVVDNL-----------------------------------------------------------KCQ-Q---------------------------------FVD------------I-------LP-T-------------KESQC---------------RPL----------MS--L--PP-EK-Q---TQ------VW--L-EAI-SQAKG--------K-VPSARLVKNIVA
      -_Crocosphaera_watsonii_546220971                                          L-T--EV-ELAEKQR---LEAVV-I---GAV-------WAAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--VGANIYENL-----------------------------------------------------------TTN-R----------------------------------CE------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--ET-KD-Q---VV------AW--G-KAV-EIANG--------K-VPTHRIVKQVVR
      -_Nostoc_sp_PCC_7120_499309017                                             M-T--EE-EQRDRLN---LERKV-E---RAF-------VEAGKALMELRD-----RRLYR--N---------T--HK--T-FEEYCR-------DRF-GY-SRDAAYLKM--SATNVYENI---------------QKH-----------------------------------------LPT-N-----GRQIP----------------------------------------------MP-T-------------NERQL---------------RSL----------AKAEL--EP-KV-Q---AK------VW--R-QAV-QEAKG--------K-TPSGRIVQDVID
      PCC7424_5542_Cyanothece_sp_PCC_7424_218175378                              L-T--PP-ELSEKEQ---LEQQV-E---EAF-------FIAGEALRSLRD-----RRLYR--D---------T--HR--S-FEQYCQ-------DRF-GH-TRQKINYLI--AGAAIYSNL-----------------------------------------------------------TTA-R----------------------------------CQ------------V-------LP-A-------------GEYQV---------------RPL----------SV--L--ES-EL-Q---PE------AW--N-KAV-SLADG--------K-VPTSRIVREVVE
      -_Anabaena_variabilis_499635567                                            M-T--EE-EQRDRLN---LERKV-E---RAF-------VEAGKALMELRD-----RRLYR--N---------T--HK--T-FEEYCR-------DRF-GY-SRDAAYLKM--SATNVYENI---------------QKH-----------------------------------------LPT-N-----GRQIP----------------------------------------------MP-T-------------NERQL---------------RSL----------AKAEL--EP-KV-Q---AK------VW--R-QAV-QEAKG--------K-TPSGRIVQDVID
      -_Stanieria_cyanosphaera_753864885                                         L-S--LE-DERDKLK---LEREV-E---RAF-------YRAGCALKELRD-----RRLYR--S---------T--HK--T-FKEYCQ-------DRF-GF-TRRRSDYLI--GAAEVVDNL---------------------------------------------------------------S-----GEPKPKRE-------P------------LVL------------I-------LP-T-------------SERQC---------------RPL----------TK--L--EP-EQ-Q---RE------IW--R-EAV-ESSKG--------K-VPSGKVVADLVA
      -_Chroococcales_cyanobacterium_CENA595_769922127                           L-T--ED-EEKERHR---LELKV-E---RAF-------YEAGSALRELRD-----RRLYR--S---------T--HK--T-FEAYSQ-------ERF-GM-TPRPAYYLI--AAAGVVENL-E---------------------------------------------------------MRT-N---------------------------------GSQ------------I-------LP-T-------------TERQV---------------RPL----------AN--L--EP-EE-Q---RQ------IW--Q-QAV-QEAGN--------K-VPSGRIVKDIVQ
      CWATWH0401_4234_Crocosphaera_watsonii_WH_0401_543428839                    L-T--EV-ELAEKQR---LEAIV-I---GAV-------WAAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--VGANIYENL-----------------------------------------------------------TTN-R----------------------------------CE------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--ET-KD-Q---VV------AW--G-KAV-EIANG--------K-VPTHRIVKQVVR
      Sta7437_4575_Stanieria_cyanosphaera_PCC_7437_428272094                     L-S--FE-EERDRLR---LERQV-E---RAF-------YQAGIALKELRD-----RRLYR--S---------T--HE--T-FEKYCQ-------DRF-GM-QRRHPYRLI--DAAAVVDNI------LQMCPI-----------------------------------------------RTQ-N-----GSDTSDAN-------K------------TLE------------I-------IP-T-------------SEWQI---------------RSL----------TK--L--EP-RQ-Q---RE------IW--A-RAI-ELAGN--------K-VPSGKIVSELVS
      UH38_20050_Chroococcales_cyanobacterium_CENA595_768384071                  L-T--ED-EEKERHR---LELKV-E---RAF-------YEAGSALRELRD-----RRLYR--S---------T--HK--T-FEAYSQ-------ERF-GM-TPRPAYYLI--AAAGVVENL-E---------------------------------------------------------MRT-N---------------------------------GSQ------------I-------LP-T-------------TERQV---------------RPL----------AN--L--EP-EE-Q---RQ------IW--Q-QAV-QEAGN--------K-VPSGRIVKDIVQ
      -_Chroococcales_cyanobacterium_CENA595_769920071                           L-T--PE-EQRDRQR---LELGV-E---QAF-------YQAGKALAQLRE-----RRLYR--T---------T--HK--T-FEAYCQ-------DRF-GF-TRRHSDYLI--NGAKVVENL-----------------LSI---------------------------------------RTI-S-----PPNYAQGN---L-RTI------------PAQ------------I-------LP-T-------------KLEQV---------------KPL----------TS--L--EP-DQ-W---RL------AW--N-KAV-EKAHG--------K-VPSGQIVRAVVE
      -_Cyanothece_sp_PCC_8802_752568031                                         L-T--LA-EQAEKQH---LESIV-T---GAV-------WSAGLALRELRD-----LRLYR--D---------T--HA--N-FAEYCR-------ERF-GH-SRQKSDYLI--VAAKIYENL-----------------------------------------------------------SEN-H----------------------------------CQ------------V-------LP-T-------------TEFQV---------------RPL----------GG--L--EP-DL-Q---VQ------AW--Q-EAV-AIASDTSSRNAHPK-VPSNQIVKQVVR
      -_Microcystis_aeruginosa_763118064                                         L-T--PE-EERQRLF---WERKV-E---RAF-------YEAGTALKELRD-----RRLYR--S---------T--HK--T-FEEYCR-------DRF-GY-SRRKMDYLI--SGSEVFENL-Q-----------------------------------------TRTIGSQSDRDETRT-IGS-Q-----SDRDETRT---I-GSQ---------------------------I-------LP-I-------------SERQV---------------RPL----------TQ--L--EP-EQ-Q---RE------VW--Q-QAV-EAAGG--------K-VPSGRIVKDIVQ
      -_Cyanothece_sp_PCC_7424_752567338                                         P-T--PQ-EEEDLQR---LEKIV-E---CSF-------LDAGLALQEINT-----RKLYR--F---------S--HK--T-FEDYCR-------DRF-GYLNRRHPYRLI--EAALVVENL------L------K---------------------------------------------KCD-Q----------------I-GHK------------KIP------------M--------P-N-------------NEAQV---------------RPL----------TQ--L--DE-EQ-Q---WE------AW--E-NAV-TESKT--------K-VPSAAFVKKSVE
      -_Nostoc_punctiforme_501381405                                             L-T--DQ-EQSLRLQ---LERQV-E---RAF-------LSAGQALMELRD-----RRLYR--S---------T--HR--T-FEEYCR-------ERF-NY-SRDAAYLKI--SATVVYENL---------------QKF-----------------------------------------LPT-I-----GRQIP----------------------------------------------MP-T-------------NERQL---------------RFL----------AKAEL--EP-AV-Q---AD------VW--Q-QAV-EQAGN--------K-IPSGRIVKDVVD
      Anacy_5838_Anabaena_cylindrica_PCC_7122_428682367                          R-G--EA-ITVELGR---LEDRI-EKGLRAF-------WDIGQSLGQIRD-----KQLYR--Q---------S--YK--T-FDEYCL-------NRW-EM-SRRSAYRLI--QAALVYENV----T------------------------------------------------------RGS-Q-SFV-NVIHGSQN------------PETLTC--GSRSFVNVTHGSQNQI-------LP-T-------------NERQI---------------RPL----------VT--L--PP-EK-Q---RE------AW--A-KAV-STAPN--------S-KVTAAHVAQVAR
      -_Anabaena_cylindrica_755115685                                            R-G--EA-ITVELGR---LEDRI-EKGLRAF-------WDIGQSLGQIRD-----KQLYR--Q---------S--YK--T-FDEYCL-------NRW-EM-SRRSAYRLI--QAALVYENV----T------------------------------------------------------RGS-Q-SFV-NVIHGSQN------------PETLTC--GSRSFVNVTHGSQNQI-------LP-T-------------NERQI---------------RPL----------VT--L--PP-EK-Q---RE------AW--A-KAV-STAPN--------S-KVTAAHVAQVAR
      -_Cyanothece_sp_PCC_7822_754536191                                         L-S--AD-EEKELLR---LERVV-E---RSF-------YEAGSALRKIRA-----LRLYR--A---------R--FN--S-FEEYTQ-------ERF-GF-TRRQPYYLI--EAANVVDNL-----------------------------------------------------KS----ECE-P--LV-------------------------------H------------I-------LP-S-------------SERQV---------------RPL----------TK--L--NA-TE-Q---RS------VW--N-DAV-SRAQG--------K-VPSGRIVTEALE
      PCC7424_5430_Cyanothece_sp_PCC_7424_218175274                              P-T--PQ-EEEDLQR---LEKIV-E---CSF-------LDAGLALQEINT-----RKLYR--F---------S--HK--T-FEDYCR-------DRF-GYLNRRHPYRLI--EAALVVENL------L------K---------------------------------------------KCD-Q----------------I-GHK------------KIP------------M--------P-N-------------NEAQV---------------RPL----------TQ--L--DE-EQ-Q---WE------AW--E-NAV-TESKT--------K-VPSAAFVKKSVE
      -_Microcystis_aeruginosa_763120073                                         L-T--PE-EERQRLF---WERKV-E---RAF-------YEAGTALKELRD-----RRLYR--S---------T--HK--T-FEEYVQ-------DRF-GM-KRAHSYRLI--EATGVVDNL-LA-----------------------------------KVPPMVELLGDSSDKVPP--------------------------LVE---------------------------V-------LP-T-------------NERQV---------------RPL----------IQ--L--EP-DQ-Q---RE------VW--Q-QAV-EAAGG--------K-VPSGRIVKDI--
      Xen7305DRAFT_00000510_Xenococcus_sp_PCC_7305_442790849                     L-T--IE-EESIRFS---LEKKV-E---RAF-------YEAGKALRELRN-----RRLYR--S---------T--HV--T-FEEYCR-------DRF-DF-TRRRPYQLI--EAAQIYDNL-ID--------------------------------------------------------KCE-P---------------------------------IVP------------V-------LP-T-------------KEGQV---------------RPL----------SE--L--TI-DE-Q---PI------AW--E-TAV-EQAGG--------K-VPTGRIVKEVVK
      -_Xenococcus_sp_PCC_7305_750617827                                         L-T--IE-EESIRFS---LEKKV-E---RAF-------YEAGKALRELRN-----RRLYR--S---------T--HV--T-FEEYCR-------DRF-DF-TRRRPYQLI--EAAQIYDNL-ID--------------------------------------------------------KCE-P---------------------------------IVP------------V-------LP-T-------------KEGQV---------------RPL----------SE--L--TI-DE-Q---PI------AW--E-TAV-EQAGG--------K-VPTGRIVKEVVK
      CY0110_32445_Cyanothece_sp_CCY0110_126620031                               L-S--EA-EEVEKQR---LEAVV-S---GAV-------WAAGKALQKLRD-----KKLYR--D---------S--HP--S-FAVYCQ-------ETF-GH-SRQKSDYLI--VAATIYENL-----------------------------------------------------------EAS-G----------------------------------CE------------V-------LP-Q-------------SEFQV---------------RPL----------GV--L--KK-PPLQ---VE------AW--N-KAV-DICNG--------K-VPSHRIVKQVVR
      Cyan8802_4571_Cyanothece_sp_PCC_8802_256592473                             L-T--LA-EQAEKQH---LESIV-T---GAV-------WSAGLALRELRD-----LRLYR--D---------T--HA--N-FAEYCR-------ERF-GH-SRQKSDYLI--VAAKIYENL-----------------------------------------------------------SEN-H----------------------------------CQ------------V-------LP-T-------------TEFQV---------------RPL----------GG--L--EP-DL-Q---VQ------AW--Q-EAV-AIASDTSSRNAHPK-VPSNQIVKQVVR
      Pse7367_3831_Pseudanabaena_sp_PCC_7367_427992361                           L-S--VA-ERQRLHK---YEQMI-R---QNI-------IEIGLALLDIQE-----SRLYR--E---------T--HA--N-FEAYAF-------EQF-GI-SKTYAYGKI--AAAKVIKNL----T------------------------------------------------------GVA-P------------------------------------------------M-------LP-Q-------------NERQC---------------RPL----------AG--L--DA-QQ-Q---RL------AW--Q-EVL---ATG--------D-RITGKLVKEIVA
      -_Crocosphaera_watsonii_737859558                                          L-T--ED-EEKEKLR---LERKV-E---RSF-------YEAGIALKLLRD-----GRYYR--N---------T--HP--S-FESYCQ-------DRF-GYRNRRHPYRLI--EAAVTIENL------L------E---------------------------------------------NCD-Q----------------F-GHI------------SSP------------I-------IP-V-------------NESQA---------------RPL----------TS--L--DDPSQ-Q---VK------AW--T-QAI-EKAGG--------K-VPPARIVKEV--
      Glo7428_4930_Gloeocapsa_sp_PCC_7428_428267400                              L-S--DS-EERERYR---LEFKV-D---RGI-------AQAWLALKELRD-----RRLYR--S---------T--HK--T-FEEYAK-------ERF-GY-NRAHAYRLI--EAAQVLENL------SPNWRQNE---------------------------------------------LQD-E----------------M-SPI------------WRQ------------K-------FP-N-------------SESQC---------------REL----------AK--L--PP-HF-Q---PI------AW--E-KVL-EASGN--------K-APTAKLIKGIVE
      -_Desulfobacterium_autotrophicum_506384528                                 --------EQDRLTR---LENLI-ARNQSHF-------HEIGKALKEIKD-----TRLYK--L---------NL-FS--S-FETYAR-------VRW-DM-GRAQAYRLI--ESYKVINNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-N-------------NESQV---------------RPL----------AP--L--DP-IE-Q---RK------IW--K-AFL---KTA--------M-EITAPNIKQFID
      -_Cyanothece_sp_PCC_7822_754535993                                         L-T--SE-EEQELLQ---LEGCI-E---RSF-------YQAGQALKAIRD-----KRLYR--F---------L--YA--T-FEDYCR-------ERF-GF-ARRHSYQLI--DAAVVMDNL---------------LAIEAQCESGAQTENISDNLC---------------ANGA----QTE-T--II-SGETPVAK---------------NIT--PRQ------------I-------LP-T-------------SERQV---------------RPL----------TS--L--NP-SQ-Q---RE------AW--A-KAV-HLAKG--------K-VPSNRIVTRVAE
      -_Desulfobacterium_autotrophicum_501881616                                 --------EQDRLTR---LENLI-ARNQSHF-------HEIGKALKEIKD-----TRLYK--L---------NL-FS--S-FETYAR-------VRW-DM-GRAQAYRLI--ESYKVINNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-N-------------NESQV---------------RPL----------AP--L--DP-IE-Q---RK------IW--K-AFL---KTA--------M-EITAPNIKQFID
      -_Gloeocapsa_sp_PCC_7428_754508876                                         L-S--DS-EERERYR---LEFKV-D---RGI-------AQAWLALKELRD-----RRLYR--S---------T--HK--T-FEEYAK-------ERF-GY-NRAHAYRLI--EAAQVLENL------SPNWRQNE---------------------------------------------LQD-E----------------M-SPI------------WRQ------------K-------FP-N-------------SESQC---------------REL----------AK--L--PP-HF-Q---PI------AW--E-KVL-EASGN--------K-APTAKLIKGIVE
      Cyan7822_6546_Cyanothece_sp_PCC_7822_306986431                             L-T--SE-EEQELLQ---LEGCI-E---RSF-------YQAGQALKAIRD-----KRLYR--F---------L--YA--T-FEDYCR-------ERF-GF-ARRHSYQLI--DAAVVMDNL---------------LAIEAQCESGAQTENISDNLC---------------ANGA----QTE-T--II-SGETPVAK---------------NIT--PRQ------------I-------LP-T-------------SERQV---------------RPL----------TS--L--NP-SQ-Q---RE------AW--A-KAV-HLAKG--------K-VPSNRIVTRVAE
      Syn6312_1142_Synechococcus_sp_PCC_6312_427376379                           L-S--LI-ERSDLER---LEQTI-RAGLNTF-------VEVGQALQKIRE-----QRLYR--E---------T--HQ--T-FEAYCE-------DKF-DL-RRNYADKTI--AASSFVERI----S------------------------------------------------------TIG-V------------------------------------------------I-------LP-T-------------NESQV---------------REI----------LT--L--PE-DR-Q---VE------AW--R-EVA-EAAAS----E---G-KLTADLVKTVVK
      -_Calothrix_sp_PCC_7103_737187623                                          L-T--TA-EAEEFRY---LETRV-EECLKSF-------WEIGRALARIRD-----ERLYR--E---------N--YK--T-FEEYCM-------TRW-EM-SRRSAYQLI--DAAVIYRNI----S------------------------------------------------------ENI-I-----------DD------------DVSVAY--GRQKIQ---------I-------LP-A-------------NERQI---------------RPL----------VA--L--SP-KQ-Q---QE------AW--N-QVV-STAPN--------G-KVTAVHVACVVN
      -_Desulfobacterium_autotrophicum_506384753                                 --------EQDRLTR---LENLI-ARNQGRF-------HEIGKALKEIKD-----TRLYK--L---------NL-FS--S-FETYAR-------VRW-DM-GRAQAYRLI--ESYKVISNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-S-------------NESQV---------------RPL----------AP--L--GP-IE-Q---RK------IW--K-AFL---KTA--------M-EITAPNIKQFID
      -_Desulfobacterium_autotrophicum_501880589                                 --------EQDRLTR---LENLI-ARNQSRF-------HEIGKALKEIKD-----TRLYK--L---------NL-FS--S-FETYAR-------VRW-DM-GRAQAYRLI--ESYKVINNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-N-------------NESQV---------------RPL----------AP--L--DP-IE-Q---RK------IW--K-AFL---KTA--------M-EITAPNIKQFID
      BegalDRAFT_1454_Beggiatoa_alba_B18LD_386428514                             M-D--ER-LDELEQV---IEKEL-----SAF-------YRVGNALAEIKE-----SRLYR--S---------KG-YE--N-FEAYCV-------EVW-GM-HRQHAHRLI--NASAVVRNL-----------------------------------------------------------SSV-G------------------------------------------------D-------MP-K-------------NEAQV---------------RPL----------VG--L--PP-EK-Q---RE------VW--E-TVC---QSG--------K-VTAERVQAVLAV
      -_delta_proteobacterium_PSCGC_5296_654515559                               --------SNQRLVH---LESVI-KKYRQDF-------YSVGKALTEIRD-----GRYYL--K---------LS-FK--S-FESYLK-------HRW-DM-GRSQAYRLI--QAAYVIDNL----S------------------------------------------------------PIG-D------------------------------------------------V-------LP-Q-------------NEAQA---------------RAL----------NK--L--DL-FS-Q---RK------VW--R-NFL---KTQ--------K-PLSALNISKFVS
      -_delta_proteobacterium_PSCGC_5451_654517946                               --------SNQRLVH---LESVI-KKYRQDF-------YSVGKALTEIRD-----GRYYL--K---------LS-FK--S-FESYLK-------HRW-DM-GRSQAYRLI--QAAYVIDNL----S------------------------------------------------------PIG-D------------------------------------------------V-------LP-Q-------------NEAQA---------------RAL----------NK--L--DL-FS-Q---RK------VW--R-NFL---KTQ--------K-PLSALNISKFVS
      -_Cyanothece_497232068                                                     L-S--KA-EQDEKQR---LEAVI-S---GAV-------WAAGKALKELRD-----KKLYR--D---------T--HP--S-FAVYCQ-------ETF-GH-SRQKSDYLI--VAATIYENL-----------------------------------------------------------KAG-G----------------------------------CE------------V-------LP-Q-------------SEFQV---------------RPL----------GV--L--KK-PPLQ---VE------AW--D-KAV-AIESG--------K-VPRHHIVKKVVR
      -_Stanieria_cyanosphaera_753864865                                         L-T--AS-QTKELLR---LEKTI-E---TSF-------YLAGLALRQIQS-----KRLYR--E---------N--YR--T-FEAYCR-------NRF-DF-TRASAYYLI--KAASVVDNL-----------------------------------------------------------KCQ-Q---------------------------------FVD------------I-------LP-T-------------KESQC---------------RPL----------MS--L--PP-EK-Q---TQ------VW--L-EAI-SQAKG--------K-VPSARLVKNIVA
      BegalDRAFT_1483_Beggiatoa_alba_B18LD_386428542                             M-D--ER-LEELEQV---IEKEL-----SAF-------YRVGNALVEIRD-----KRLYR--L---------KG-YE--N-FEAYCV-------EVW-KM-HRQHAHRLI--NASAVVRNL-----------------------------------------------------------SSV-G------------------------------------------------D-------MP-K-------------NEAQV---------------RPL----------VG--L--PP-EK-Q---RE------VW--E-TVC---QSG--------K-VTAERVQAVLAV
      -_Tolypothrix_sp_PCC_7601_797212629                                        L-T--TE-EWSDRIF---LERQV-E---RAF-------YAAAKALKALRD-----RRLYR--S---------T--HA--T-FEDYCR-------SRF-GF-THRHVNYLI--AGSLVVDNL------------------------------MGTNG---SQIENSDKTGTNGSQVENLDEMGT-N-----GSQIENSD--ET-GTN------------GSQ------------I-------LP-T-------------SERQV---------------RPL----------VP--L--EP-EQ-Q---RQ------AW--Q-KAV-ELAGG--------K-IPSGRIVQDIVD
      -_Beggiatoa_alba_749816531                                                 M-D--ER-LDELEQV---IEKEL-----SAF-------YRVGNALAEIKE-----SRLYR--S---------KG-YE--N-FEAYCV-------EVW-GM-HRQHAHRLI--NASAVVRNL-----------------------------------------------------------SSV-G------------------------------------------------D-------MP-K-------------NEAQV---------------RPL----------VG--L--PP-EK-Q---RE------VW--E-TVC---QSG--------K-VTAERVQ-----
      -_Beggiatoa_alba_749816534                                                 M-D--ER-LEELEQV---IEKEL-----SAF-------YRVGNALVEIRD-----KRLYR--L---------KG-YE--N-FEAYCV-------EVW-KM-HRQHAHRLI--NASAVVRNL-----------------------------------------------------------SSV-G------------------------------------------------D-------MP-K-------------NEAQV---------------RPL----------VG--L--PP-EK-Q---RE------VW--E-TVC---QSG--------K-VTAERVQAVLA-
      -_Coleofasciculus_chthonoplastes_763350225                                 L-T--DA-EIVEFRS---LEATV-EKGLRAF-------WQIGQALRQIRD-----KRLYR--Q---------D--YG--T-FEDYCL-------TRW-EI-SRRSAYQLI--EAASVVENV----R------------------------------------------------------HGA-Q------------------------------------------------I-------IP-A-------------NERQA---------------RPL----------TA--L--KP-EQ-Q---QA------AW--A-KAV-STAPR--------G-KVTAAHVAQVAQ
      FDUTEX481_04300_Tolypothrix_sp_PCC_7601_407266570                          L-T--TE-EWSDRIF---LERQV-E---RAF-------YAAAKALKALRD-----RRLYR--S---------T--HA--T-FEDYCR-------SRF-GF-THRHVNYLI--AGSLVVDNL------------------------------MGTNG---SQIENSDKTGTNGSQVENLDEMGT-N-----GSQIENSD--ET-GTN------------GSQ------------I-------LP-T-------------SERQV---------------RPL----------VP--L--EP-EQ-Q---RQ------AW--Q-KAV-ELAGG--------K-IPSGRIVQDIVD
      -_Stanieria_cyanosphaera_753864872                                         L-S--FE-EERDRLR---LERQV-E---RAF-------YQAGIALKELRD-----RRLYR--S---------T--HE--T-FEKYCQ-------DRF-GM-QRRHPYRLI--DAAAVVDNI------LQMCPI-----------------------------------------------RTQ-N-----GSDTSDAN-------K------------TLE------------I-------IP-T-------------SEWQI---------------RSL----------TK--L--EP-RQ-Q---RE------IW--A-RAI-ELAGN--------K-VPSGKIVSELVS
      CWATWH0003_2674t1_Crocosphaera_watsonii_WH_0003_357263645                  L-T--ED-EEKEKLR---LERKV-E---RSF-------YEAGIALKLLRD-----GRYYR--N---------T--HP--S-FESYCQ-------DRF-GYRNRRHPYRLI--EAAVTIENL------L------E---------------------------------------------NCD-Q----------------F-GHI------------SSP------------I-------IP-V-------------NESQA---------------RPL----------TS--L--DDPSQ-Q---VK------AW--T-QAI-EKAGG--------K-VPPARIVKEV--
      -_Myxosarcina_sp_GI1_738538560                                             L-T--ES-ERQERNN---LEITV-Q---QAF-------FVAGQALKLLRD-----KRLYR--E---------T--HA--T-FEAYVR-------DRF-DY-TRRAVDYLI--LAAEVVENL-----------------------------------------------------------KRE-Q--IV------------------L----------KTN------------V-------LP-T-------------KESQC---------------RPL----------AK--L--SP-EQ-Q---RE------VW--L-TAV-EKTGG--------K-VPSARIVKEVVN
      -_Cyanothece_sp_CCY0110_495554039                                          L-S--EA-ELAQKQE---LESIV-S---SAV-------WSAGRALRELRD-----KKLYR--D---------T--HQ--S-FAVYCQ-------ETF-GH-SRQKSDYLI--VAANIYENL-----------------------------------------------------------KDS-G----------------------------------CE------------V-------LP-K-------------SEFQV---------------RPL----------GV--L--KK-PPLQ---VE------AW--D-KAV-DICNG--------R-VPKHQIVKQVVR
      -_Crocosphaera_watsonii_494523812                                          ------------------MEAVV-I---GAV-------WSAGFALQQLRD-----QKLYR--D---------T--HS--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-IPTHRIVKQVVR
      -_Synechocystis_sp_PCC_7509_740179430                                      L-T--TD-EEQERHR---LELKV-E---LGF-------QEAVKALKQLRD-----KKLYR--S---------T--HQ--T-FEDYVV-------ERF-GM-QRAHAYRLI--NAAVVIENL------S----PIG----------------------------------------------D---------------------------------------------------I-------LP-I-------------TESLC---------------REV----------AK--LP-NC-AQ-Q---QK------AW--R-QTL-VGTGG--------K-MPTIKQVRGIVE
      -_Desulfobacter_postgatei_748757961                                        MTS--VDSGHKQLAH---LESLI-SSNQEDF-------CQAGRALKEIRD-----NRLYK--L---------AL-FD--T-FEAYTK-------ARW-DI-SRAHAYRLI--KYCEVIHNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-V-------------NESQV---------------RHL----------AP--L--MP-ME-Q---RR------VW--K-DFL---AGG--------S-ELTAQNIKRFIT
      -_Scytonema_millei_748135960                                               L-S--DE-EKGRLFE---LERQV-E---ESF-------YRAGIALKEIRD-----SRLYR--I---------T--HP--T-FEEYCR-------ERF-GF-ERRYPYQLI--DAAIVADNI---------------------------------RQC---------------------------------------------------------VR--DAH------------I-------FP-T-------------NEYQL---------------RPL----------AK--LKGDP-AK-Q---AE------VW--L-RAV-ERAQG--------K-QPTYEAVKETVQ
      DespoDRAFT_03587_Desulfobacter_postgatei_2ac9_389403119                    VTS--VDSGHKQLAH---LESLI-SSNQEDF-------CQAGRALKEIRD-----NRLYK--L---------AL-FD--T-FEAYTK-------ARW-DI-SRAHAYRLI--KYCEVIHNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-V-------------NESQV---------------RHL----------AP--L--MP-ME-Q---RR------VW--K-DFL---AGG--------S-ELTAQNIKRFIT
      -_Streptomyces_roseochromogenus_559034988                                  L-T--PE-EEKTFEA---CKAGM-DNLHKAF-------WIAGKSLETMAT-----GNLHR--N---------SG-HP--N-FADFVW-------VHW-EI-SESQTYRLM--DEWRIGEAL----S------------------------------------------------------QMG-W---------------------------------------------------------H-P-------------RESQV---------------RKL----------VD--I--KN-AA-G---NT-AAVA-VY--D-AVA---RTG--------K-RVTASLLEDVAR
      -_Fischerella_sp_PCC_9339_737126426                                        Q-S--SK-ELEQKIC---LLRNK-E---AKF-------YELGKVLRELRD-----KKLYA--A---------T--HK--T-FKDYCK-------S-F-GL-GNRYVYLLI--AAADVVDNL--------------------------------------AQ-------------------RCP-P----------------------------------GS------------P-------LP-T-------------SERQI---------------RPL----------LR--L--PL-EQ-Q---CM------VW--Q-EAI-ALASG--------Q-VPTCRIVEEVVQ
      -_Tolypothrix_campylonemoides_751570983                                    L-T--EE-EVADRHR---LELKI-E---RAF-------YEAGCALRELRE-----RRLYR--S---------T--HK--T-FEEYCR-------ARF-NY-SRDTAYLKI--AAAVVCDNI---------------QKF-----------------------------------------LPT-N-----GRQIP----------------------------------------------MP-T-------------NERQL---------------RDL----------AKANF--EP-EI-Q---AA------AW--L-QGV-EEAGG--------K-VPSGRIVKGIVE
      -_Scytonema_millei_748136747                                               L-S--PQ-EERERHR---LELRV-E---RAF-------YEAGVALRELRD-----KKLYR--S---------T--HR--T-FEAYCR-------DRF-NY-SRDTAYLKI--AAAVVYENI---------------QKF-----------------------------------------LPT-N-----CRQIP----------------------------------------------MP-M-------------NEYQL---------------RAI----------AKAEL--EP-EI-Q---AS------MW--L-QGV-EEAGG--------K-SPSGRIVKGIVE
      -_Scytonema_millei_748136457                                               L-T--PE-EERERHR---LELKV-E---RAF-------IESALALRELRD-----RRLYR--D---------T--HP--NDFVGYCR-------DRF-GK-TKQAVNYLI--AALEVYENL-T--------------------------------------------------------------------------------TTI------------GCR------------I-------LP-T-------------NERQC---------------REL----------AK--L--PN-EL-Q---PQ------VW--D-AAV-EQNNG--------K-VPTSSIVKNAVE
      -_Chroococcidiopsis_thermalis_752825464                                    L-S--PD-EERERHR---LEIRV-D---RAL-------GEGWSALKQLRD-----LRLYR--S---------T--HK--T-FEEYAK-------DRF-GY-NRAHAYRLI--NAAAVLENL------S----HTD---------------------------------------------RKE-E----------------M-SPN------------WRQ------------K-------MP-S-------------SESQC---------------REL----------AK--L--PA-NK-Q---PK------AW--E-KVL-SVSGD--------K-APTAQIVKTVVE
      -_Fischerella_sp_PCC_9339_515877940                                        L-N--EE-EKADRHR---LELKI-E---RAF-------FEAGSALRELRE-----RRLYR--S---------T--HR--T-FEEYCR-------DRF-NY-SRDTAYLKI--AAAVVYDNI---------------QNF-----------------------------------------LPT-N-----GRQTP----------------------------------------------MP-T-------------NERQL---------------RDL----------AKGNL--EP-EL-Q---AA------VW--L-QGV-EEAGG--------K-VPSGRIIKGIVE
      -_Pseudanabaena_sp_PCC_7367_754053711                                      L-S--VA-ERQRLHK---YEQMI-R---QNI-------IEIGLALLDIQE-----SRLYR--E---------T--HA--N-FEAYAF-------EQF-GI-SKTYAYGKI--AAAKVIKNL----T------------------------------------------------------GVA-P------------------------------------------------M-------LP-Q-------------NERQC---------------RPL----------AG--L--DA-QQ-Q---RL------AW--Q-EVL---ATG--------D-RITGKLVKEIVA
      Chro_5819_Chroococcidiopsis_thermalis_PCC_7203_428013042                   L-S--PD-EERERHR---LEIRV-D---RAL-------GEGWSALKQLRD-----LRLYR--S---------T--HK--T-FEEYAK-------DRF-GY-NRAHAYRLI--NAAAVLENL------S----HTD---------------------------------------------RKE-E----------------M-SPN------------WRQ------------K-------MP-S-------------SESQC---------------REL----------AK--L--PA-NK-Q---PK------AW--E-KVL-SVSGD--------K-APTAQIVKTVVE
      -_Cyanothece_497231939                                                     L-S--AA-ELSEKQR---LEAIV-V---GAV-------WAAGKALRELRD-----KKLYR--D---------T--HP--S-FAVYCQ-------ETF-GH-SRQKSDYLI--VAATIYENL-----------------------------------------------------------EAS-G----------------------------------CE------------V-------LP-K-------------SEFQV---------------RPL----------GV--L--KK-PPLQ---VE------AW--D-KAV-AISDG--------K-VPRHHIVKKVVR
      -_Nocardia_jiangxiensis_750537552                                          L-S--DG-EQNQLSA---CESSI-STLRMAF-------WAAGRALQIVRD-----GRLYR--N---------A--YP--S-FDDYVE-------QRW-DM-QRSYAHKLI--RAWPLAAKL----H------------------------------------------------------PLA----------------------------------------------------------PG-I-------------NEGQI---------------REL----------LP--V--AA-EY-G---ED-AAVT-VY----ATL-VAG-D--------V-KITAGKLREAVA
      -_[Scytonema_hofmanni]_UTEX_B_1581_657929542                               L-T--QD-EADDRHR---LELKI-E---RAF-------YEAGCALRELRE-----RRLYR--S---------T--HS--N-FEEYCR-------DRF-NY-SRDTAYLKI--AAAVVYDNI---------------QKF-----------------------------------------LPT-N-----GRQIP----------------------------------------------MP-T-------------NERQL---------------RDL----------AKANF--EP-EV-Q---AD------AW--M-QGV-EEAGG--------K-APSGRIIKGIVE
      -_Fischerella_sp_PCC_9339_648361686                                        --------EQNVKKA---LEKNT-P---LYF-------YDAGQALSELFQ-----QKLYR--S---------S--HS--S-FDEYCL-------ERF-RL-GRSQVYRYI--YAASTYDNL----K------------F-----------------------------------------SHG-S---------------------------------NQQ-----A------L-------LP-T-------------TERQI---------------RDL----------YN--L--EP-TL-Q---RE------VW--Q-TAI-DLAVG------C---SPSSRMVKEALL
      -_Streptacidiphilus_melanogenes_755026932                                  L-S--EV-ELHDLGT---CERAV-ENLATAT-------WLAGKALQTIRD-----GKLYR--Q---------T--HR--T-FEEYVT-------ERW-EI-GERTAYQMI--EEWPLAERL----N------------------------------------------------------QAY-G--------------------------------------------------------KP-V-------------TASHI---------------RAL----------LP--V--TT-RF-G---LD-DAVE-LYQQL-RAR-AQADG--------V-RLTAQITGQIAK
      Ple7327_4170_Pleurocapsa_sp_PCC_7327_427981701                             L-N--DT-EKMRLQE---IEAIV-AQGLQTF-------YEVGQALIEIRD-----RKLYR--E---------T--HK--T-FEAYCK-------EKW-SL-TRPSAYRLL--KAAEVIKNL----S------------------------------------------------------PMG-D------------------------------------------------K-------FP-T-------------NERQV---------------RPP----------TK--L--PP-AQ-Q---LE------IW--Q-KAV-EESPN--------G-TPTAKIVERLVK
      -_Nocardia_otitidiscaviarum_748262016                                      L-S--EC-EHAQLAA---CESSI-DTLRQAF-------WSAGRALQIVRD-----GRLYR--T---------G--YA--T-FDDYVE-------QRW-DM-RRSYAHKLI--RAWPLAARL----H------------------------------------------------------RHA----------------------------------------------------------PA-I-------------NEGQI---------------REL----------LP--V--AA-EH-G---DD-AAVT-VY----TTL-AAE-N--------V-KITAATLREAVA
      -_Nocardia_otitidiscaviarum_759915784                                      L-S--EC-EHAQLAA---CESSI-DTLRQAF-------WSAGRALQIVRD-----GRLYR--N---------G--YA--T-FDDYVE-------QRW-DM-RRSYAHKLI--RAWPLAARL----H------------------------------------------------------RHA----------------------------------------------------------PA-I-------------NEGQI---------------REL----------LP--V--AA-EH-G---DD-AAVT-VY----TTL-AAE-N--------V-KITAATLREAVA
      -_Oscillatoria_sp_PCC_10802_516328499                                      L-N--GS-ERARLEQ---LESLI-DQEVHLF-------SQVGKALDEICD-----KRLYR--E---------T--HN--T-FQGYCQ-------DKW-GI-ARRRAYQLI--DAAQIVENL----S------------------------------------------------------ALG-A------------------------------------------------Q-------IP-T-------------SERQV---------------RPL----------TG--L--PK-DA-Q---VE------IW--Q-KAV-ALASN--------G-IPTGTAVQRLVD
      -_Oscillatoria_sp_PCC_10802_763312164                                      L-T--QP-EQIELEN---LEAQV-QRGIKAF-------WEMGEALRQIRD-----KRLYR--Q---------N--YS--S-FEKYCP-------ARW-QI-SWRSAYQLI--EAAVLMENL----R------------------------------------------------------HGA-G-------------------------------------IE---------T-------LP-A-------------NERQA---------------RPL----------TA--L--PA-EK-Q---RE------AW--V-KAV-TTAPS--------G-RITYHHVVKIAK
      -_Desulfatibacillum_aliphaticivorans_654862385                             L-T--EF-EQDRRDA---LEGII-KRNMAGF-------IAVGLALKEMLE-----SRLYR--S---------T--HP--T-WEAYIR-------DFF-EI-SRSYALRLI--DAADTVRLI----S---------------------------------NEGIDPVDFDDGRQ-------NVA-N---------------------------------WQH-----P--------------VP-A-------------NEAQV---------------RPL----------SK--L--PV-ED-R---PG------AW--F-EAL-KTAPE--------G-KITARHVSDTVK
      OMM_03956_Candidatus_Magnetoglobus_multicellularis_str_Araruama_571788483  --D--QK-RIKQLHS---FEAVI-KKQQSNF-------HVLGKTLSKIKD-----LSLYK--H---------IG-FK--S-FEDYTI-------KRL-DI-KKSQAYRMI--NASKVIENL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-Q-------------NEAQA---------------RLL----------TK--F--DT-FT-Q---QQ------LW--Q-KFL---ETG--------M-ALTTSNIRKSII
      -_Fischerella_sp_PCC_9605_737153646                                        L-S--AE-ERERLLA---LEREV-V---ESF-------LTAARALREIRD-----RRLYR--E---------S--YP--N-FEEYCE-------ARF-GY-GKRLAYYYI--DAANVADNL-----------------------------------------------------EG----S-E-Q--IV-------------------------------H------------V-------LP-T-------------SESQV---------------RPL----------KG--L--AP-DA-Q---RL------VW--S-KAV-EKAQG--------K-APSINVVRETLK
      -_Desulfobacterium_autotrophicum_506386923                                 ---------MDRLIE---LETLI-ARNQERF-------CQIGRALKAIRD-----GRLYR--Q---------AL-FD--T-FEAYAR-------TRW-DM-GRSQAYRLI--KSYEVIHNL----S------------------------------------------------------PIG-D------------------------------------------------R-------MP-A-------------NESQV---------------RPL----------AQ--L--AP-DE-Q---RK------TW--K-DFI---NSG--------V-ESSALNIRRFID
      C789_3692_Microcystis_aeruginosa_DIANCHI905_443331859                      --------ETPSLED---LERII-DRGQKAF-------YVVGTALKSIRD-----ARLYQ--H---------QQNYP--D-FDSYCR-------ERW-DM-SRRKADNFI--RASVFIDNL-----------------------------------------------------------RRN-N---------------------------------CSE--------------------LP-S-------------NESQI---------------RPL----------LS--I--KS-EE-E-Q-IE------IW--L-DII----------Q---S-APLGKITAKYVQ
      -_Streptacidiphilus_neutrinimicus_755021339                                L-S--EV-ELHDLGV---CERAV-ENLATAT-------WLAGKALQTIRD-----GKLYR--H---------T--HA--R-FEDYIT-------ERW-DI-SERAAYQMI--EEWPLAERL----N------------------------------------------------------QAY-G--------------------------------------------------------KP-V-------------TASHI---------------RAL----------LP--V--TT-RF-G---LD-AATE-LYQQL-RTR-ADADG--------V-RLTAQITGQIAK
      -_Oscillatoria_nigro-viridis_504989405                                     L-D--VT-ERARLEE---LESIV-EKGLQTF-------YEVGKALDEIRE-----QKLYR--E---------S--HK--T-FDAYCR-------EKW-GI-AKQTANRFI--AAAQVIENL----T------------------------------------------------------PMG-V------------------------------------------------K-------IP-A-------------NERQV---------------RPL----------TG--L--SP-EL-Q---LE------IW--Q-EAL-ESSPN--------G-IPSGAAVQRLVE
      -_Pleurocapsa_minor_752746526                                              L-N--DT-EKMRLQE---IEAIV-AQGLQTF-------YEVGQALIEIRD-----RKLYR--E---------T--HK--T-FEAYCK-------EKW-SL-TRPSAYRLL--KAAEVIKNL----S------------------------------------------------------PMG-D------------------------------------------------K-------FP-T-------------NERQV---------------RPP----------TK--L--PP-AQ-Q---LE------IW--Q-KAV-EESPN--------G-TPTAKIVERLVK
      -_Chlorogloeopsis_fritschii_515385753                                      L-T--EE-EKADRHR---LELKI-E---RAF-------YEAGCALRELRE-----RRLYR--S---------T--HR--T-FEEYCR-------DRF-NY-SRDTAYLKI--AAAVVYDNI---------------QKF-----------------------------------------LPT-I-----GRQTP----------------------------------------------MP-T-------------NERQL---------------RDL----------AKANF--EP-EL-Q---AA------AW--L-QGI-EEAGG--------K-VPSGRIIKGIVE
      -_Pleurocapsa_sp_PCC_7319_738911651                                        L-S--SP-ELDLRSQ---LENQV-R---SAF-------YTAGMALTQLKE-----LRLYR--S---------T--HL--S-FEEFCQ-------DVF-GY-SRDYAYLKM--TAAQVYQNL------LDN--------------------------------------------------LPT-N-----GRQVP----------------------------------------------LP-T-------------RQRQL---------------RPI----------IKAKL--KD-DV-Q---VQ------VW--Q-EAV-DLAHN--------Q-VPTSSIVAQAVR
      IPF_3218_Microcystis_aeruginosa_PCC_7806_159026604                         --------ETPSLED---LERII-DRGQKAF-------YVVGTALKSIRD-----ARLYQ--H---------QQNYP--D-FDSYCR-------ERW-DM-SRRKADNFI--RASVFIDNL-----------------------------------------------------------RRN-N---------------------------------CSE--------------------LP-S-------------NESQI---------------RPL----------LS--I--KS-EE-E-Q-IE------IW--L-DII----------Q---S-APLGKITAKYVQ
      -_Fischerella_sp_PCC_9605_652339044                                        L-N--SS-IKETFTA---IDRFE---------------WQAIDEILQMRE-----EQIYR--E---------VG-YK--T-FEEYCQ-REL---YAW-G--GYRRINQLL--GAKKVIDAV---------------------------------------G-------------------ELG-E----------------------H-------------------------I----------K-------------NERQA---------------RPL----------LH--L-----VK-E---PE------KL--K-TAV-AIALK-EN-----P-SPSESDFAAAAQ
      Emtol_0315_Emticicia_oligotrophica_DSM_17448_387857486                     L-S--NE-ENERLTI---CEEVI-DKGLKTF-------IEVGNALFEIRN-----NKLYR--G---------S--FT--T-FEAYCK-------ERW-QL-KRQRAYELM--GAAEVVNQL----S----------------------ENNLS---------------------------EIS-D---------------------------------KSN------------L-------LP-T-------------KESHA---------------NAL----------TQ--I--PV-TL-R---FQ------VW--R-AVV-EESLT-TK-----K-PITAKMIVEQTE
      -_Streptomyces_sp_CNQ865_654253752                                         L-S--AQ-EQQDREA---CEAGV-TNLATAF-------WVAGKSLETLEQ-----AKLYR--E---------T--HP--N-FAEYVW-------ERW-EI-SESHLHRLK--AEWRIGEKL----S------------------------------------------------------EFG-Y---------------------------------------------------------R-P-------------REAQV---------------REL----------LP--V--AE-QH-G---PD-AAIR-IY--D-TVA---RQA--------P-RVTAKLLQQAAA
      -_Synechococcus_sp_PCC_6312_752791755                                      L-S--LI-ERSDLER---LEQTI-RAGLNTF-------VEVGQALQKIRE-----QRLYR--E---------T--HQ--T-FEAYCE-------DKF-DL-RRNYADKTI--AASSFVERI----S------------------------------------------------------TIG-V------------------------------------------------I-------LP-T-------------NESQV---------------REI----------LT--L--PE-DR-Q---VE------AW--R-EVA-EAAAS----E---G-KLTADLVKTVVK
      -_Nocardia_seriolae_696559281                                              L-S--RS-ESEQLEV---CESSI-DALRVAF-------WTAGRALQIVRD-----GRLYR--A---------D--HA--T-FDEYVE-------KRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PVA----------------------------------------------------------PS-I-------------NEGQI---------------REL----------LP--V--VA-AH-G---EE-AAVT-VY----TTL-AA--D--------V-KVTAGKLREAVA
      -_Streptosporangium_roseum_740087909                                       M-D--DG-EQADLAA---CEAAI-DTLRIAF-------WAAGKALQVIRD-----GRLYR--A---------T--HS--T-FEEYTI-------DRW-EM-SRTQADRLI--RAWPLAERL----A------------------------------------------------------PIG----------------------------------------------------------VKII-------------NESQV---------------REL----------VP--L--AE-QH-G---QD-AAAV-VY----QTI-VEADG--------V-RVTA--------
      -_Nocardia_asiatica_760034517                                              L-S--ER-ERAQLTA---CESSI-DTLRIAF-------WAAGRALQIVRD-----GRLYR--D---------S--HE--T-FDEYVE-------QRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PVA----------------------------------------------------------PA-I-------------NEGQV---------------REL----------LP--V--AA-VY-G---ED-AAVT-VY----TTV-AAGAE--------V-KVTAGKLRQAIA
      -_Nocardia_sp_CNY236_738617228                                             L-S--DR-ERAQLTA---CESSI-DTLRIAF-------WAAGRALQIVRD-----GRLYR--D---------T--HE--T-FDQYVE-------QRW-DM-QRSYAHKLI--RAWPLADRL----H------------------------------------------------------PMA----------------------------------------------------------PA-I-------------NEGQI---------------REL----------LP--V--AV-EH-G---DD-AAVT-VY----TTI-ADGIG--------T-RVTAARLREAVA
      -_Streptacidiphilus_melanogenes_755027016                                  L-S--EV-ELHDLGV---CERAV-ENLATAT-------WLAGKALQSIRD-----GKLYR--H---------T--HA--R-FEDYVT-------ERW-EI-SERAAYQMI--EEWPLAERL----N------------------------------------------------------QAY-G--------------------------------------------------------KP-V-------------TASHI---------------RAL----------LP--V--TT-RF-G---LD-AATE-LYQQL-RTR-AQADG--------V-RLTAQITGQIAK
      -_Myxosarcina_sp_GI1_738538439                                             L-S--AE-ELELRAT---LEHQV-T---SSF-------HTSGMALAKLNE-----LRLYR--N---------T--HS--N-FEEFCL-------DVF-GY-SSDYAYLKM--AAARIYQNL------SDN--------------------------------------------------LPT-N-----GRHFP----------------------------------------------LP-T-------------RQRQL---------------RPI----------VKAKL--DK-DA-Q---LE------VW--L-DAI-ALAEG--------K-IPSYAIVAEAVR
      I546_4173_Mycobacterium_kansasii_732_576415619                             M-N--PA-EARALTQ---HETVI-ERGIKTF-------IAVGTALAAIRD-----QRLYR--E---------R--YA--T-FENYCH-------MRW-GL-SRSRAYRLI--DAANVVDSM----S------------------------------------------------------PIG-D------------------------------------------------T-------VP-A-------------TESQA---------------REL----------MG--L--TP-TQ---A-AT------VM--R-VAH-EQTSG--------K-ITAAAIRAARSR
      GM3708_3465_Geminocystis_sp_NIES-3708_770470161                            --------KYNQLEQ---ITNSI-KYNKISY-------IKLGMQLYQVKY-----YRLYK--N---------N--YK--S-FKDYCE-------KAV-YY-PVWRANQVI--ESSSVAIQL----I------------------------------------------------------KAG----FN--------------------------------------------I-------IP-Q-------------NEAQA---------------RLL----------IK--L--NE-EE---L-IR------KW--Q-EVL-DTYEV------H-K-ITANRIENIVFG
      -_Nocardia_concava_750531062                                               L-S--RS-ESEQLEV---CESSI-DALRVAF-------WTAGRALQIVRD-----GRLYR--A---------D--HA--T-FDDYVE-------KRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PVA----------------------------------------------------------PS-I-------------NEGQI---------------REL----------LP--V--AA-AH-G---EE-AAVT-VY----TTL-AA--D--------V-KVTAGKLREAVA
      -_Opitutaceae_bacterium_TAV1_494604192                                     L-T--SD-ERRVLRC---CKKAV-QAAMNNL-------MEAAPLLREVKE-----KRLYR--E---------G--YA--S-FEEFCR-------AEF-SM-DRTHAYRLI--AAANVVEAI----D------------------------------------------------------EAR-S-----KARYGQPD---T-GED-------VAL--GDT------------V-LP----LP-T-------------NERQA---------------RPL----------AQ--L--PA-AD-Q---PK------AW--K-KAV-NMAGG--------K-QPTGKQVAAAVK
      -_Myxosarcina_sp_GI1_738540713                                             L-S--ES-EKAERDN---LERTV-Q---QAS-------FSAWNALKILRD-----KRLYR--E---------T--HA--T-FESYVR-------DRF-GF-TRRSADYFI--SAAKIVENL----K---------------------------------AENSFPFPQNKKSEL------KRE-Q--FV------------------L----------KTN------------V-------LP-T-------------KESQC---------------RSL----------AK--L--SP-EE-Q---RQ------AW--G-RAV-ELAGN--------K-VPSSRLVKEAVR
      -_Fischerella_sp_PCC_9339_737126563                                        L-S--KE-EKKLLER---LEQQV-K---DSF-------LAAAHALREINE-----KRLYR--E---------T--HK--T-FDSYCE-------ERF-GF-KRRQAYHYI--EGAKVTDAL-Q---------------------------------------------------------QSA-R---------------------------------TVH------------I-------LP-A-------------NEYQI---------------RPL----------AS--LK-EP-EK-Q---IE------AW--E-RAV-EHAGG--------K-LPTHELVKKTVQ
      -_Streptomyces_aurantiacus_514922043                                       L-T--AE-EREALDA---CKAGL-NNLHNAF-------WIAGKSLETMQT-----GNLHR--N---------EG-IG--S-FAEYVW-------INW-EI-SESQMHRLI--GEWRIGEQL----A------------------------------------------------------QLG-H---------------------------------------------------------R-P-------------RESQV---------------REL----------AD--I--KQ-AA-G---DR-AAVA-VY--D-AVV---RAG--------Q-RVTARLLKDVSR
      -_Nocardia_abscessus_760001072                                             L-S--DR-ERVQLTA---CESSI-DTLRIAF-------WAAGRALQIVRD-----GRLYR--D---------S--HE--T-FDEYVE-------QRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PVA----------------------------------------------------------PA-I-------------NEGQV---------------REL----------LP--I--AA-EY-G---ED-AAVT-VY----TTI-AAGAD--------V-KVTAGKLRQAIA
      -_Streptomyces_canus_703383829                                             L-S--EL-ERETLEA---CKAGM-NNLHNAF-------WVAGKSLETMAV-----GNLHR--N---------EG-FA--N-FAEFVW-------TNW-EI-SESQVYRLM--DGWRIGESL----S------------------------------------------------------QLG-H---------------------------------------------------------R-P-------------RESQV---------------REL----------TD--I--KR-TA-G---DE-AAVA-VY--D-AVA---RSD--------K-RVTARLLARVAR
      -_Streptomyces_sp_NRRL_F-5123_759461293                                    L-S--DQ-EQQDLAA---CKAGV-DNLRNAF-------WIAGKSLETLRT-----AELHR--G---------E--NP--N-FAEWVW-------DTW-EI-SETQLYRLM--DEWRVGEAL----A------------------------------------------------------NLG-H---------------------------------------------------------K-P-------------LEGQV---------------RKL----------TE--V--RR-QT-N---DK-IAIT-VY--D-TIA---RCT--------E-RVTGKLVETVVD
      -_Nocardia_niigatensis_750579664                                           L-S--RT-ETDQLEL---CESSI-DALRVAF-------WTAGRALQIVRD-----GRLYR--A---------D--HA--T-FDEYVE-------KRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PLA----------------------------------------------------------PA-I-------------NEGQI---------------REL----------LP--V--AA-AY-G---ED-AAVT-VY----TTL-AA--D--------V-KVTAGKLREAVA
      Dalk_3579_Desulfatibacillum_alkenivorans_AK-01_218762801                   L-S--KK-EQDRRRE---LEGLV-MKNMAAF-------LEMGAALAEIQR-----DRLYR--S---------T--HR--N-FEAYVR-------DVF-EI-GKSYAHRQI--AGYQVVENI----R---------------------------------SA-------------------MAP-D-----GKNVA------------N----------WRQ------------I-------LP-A-------------NEAQV---------------RPL----------TL--L--DD-PE-E-Q-VE------AW--K-HAV-KIGED-SK-R---G-KVTARHVAQAVG
      -_Myxosarcina_sp_GI1_738539870                                             L-E--QL-ERTIEKG---LKVLR-----QTF-------FEVGLALAEVKK-----RELYL--A---------KG-YS--S-YTSYCA-------GEW-KI-NKTYAYDLL--KAAEVVVNL-I--P------------------------------------------------------QLE-S----------------F-PQS-------FSA--IAE-----K------L-DYP---LP-R-------------NESQC---------------REI----------AK--L--KT-AE-L-Q-RQ------AW--Q-EIL-ESDEN----------KITAKQIRHTVA
      -_Prauserella_rugosa_738995333                                             L-T--VA-EADQLAD---LEAVI-AQGLQTF-------VRVGQALLTIRD-----NRLYR--K---------T--HE--T-FEEYCR-------ERW-EM-TKDSANRVI--RAAEVVEVM----S------------------------------------------------------PIG-L------------------------------------------------T--------P-A-------------TESQA---------------REL----------AP--LKDDP-DA-M---RA------VW--E-TAN-E-RTD--------G-KPTAKVIRECRE
      -_Deinococcus_sp_2009_760136477                                            L-A--PH-EVQRLHS---LEATV-RDGLRDF-------QRTGQALSEIRD-----NELFR--A---------T--HD--T-FEAYLE-------ERW-GF-TPTQADRII--EANEVTKVL----E------------------------------------------------------PLG--------------------------------------------------I-------AP-I-------------SERQA---------------RAF----------KG-----AA----K-----------IL----TEL--------------E-PEQRRLVARLAQ
      -_Deinococcus_ficus_760094872                                              L-A--PH-EVQRLHN---LEATV-RDGLRDF-------QRTGQALSEIRD-----NELFR--A---------T--HD--T-FEAYLE-------ERW-DF-TPSQADRII--EANEVTKVL----E------------------------------------------------------PLG--------------------------------------------------I-------AP-I-------------SERQA---------------RAF----------KG-----AA----K-----------IL----TEL--------------E-PEQRRLVARLAQ
      -_Myxosarcina_sp_GI1_738540774                                             L-S--PQ-EQQLRDK---LEQQV----LTGF-------VLRGQALRTIKR-----LRLYR--D---------S--YD--N-FESYCE-------DVF-GF-SMLYIERCM--RAAETYYQI----V----------EYL-----------------------------------------KTQ-G------------------------------------------------L-KEA---LP-N-------------KQKQL---------------RPI----------FQAHL--SP-IE-A---GE------VW--V-MAV-DIALG--------K-VPSYSMVKTAVK
      -_Deinococcus_radiodurans_499190814                                        L-A--PH-EQQRLDD---LEQTV-EGGLRDF-------QRTGQALSEIRD-----NELYR--A---------T--HD--S-FEAYLQ-------DRW-GF-GVRQADRLI--DAAQVAKQL----E------------------------------------------------------PLG--------------------------------------------------I-------SP-R-------------HEAQA---------------RSF----------RP-----AA----R-----------IV----EEL--------------E-PEQQRLVARLVE
      NS07_v2contig00189-0005_Nocardia_seriolae_749286507                        L-S--RS-ESEQLEV---CESSI-DALRVAF-------WTAGRALQIVRD-----GRLYR--A---------D--HA--T-FDEYVE-------KRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PVA----------------------------------------------------------PS-I-------------NEGQS---------------REL----------LP--V--VA-AH-G---EE-AAVT-VY----TTL-AA--D--------V-KVTAGKLREAVA
      -_Streptomyces_vietnamensis_751920075                                      L-S--PQ-EEADFEA---CKAGV-RNLQNAF-------WVAGKSLETIKT-----GNLQR--R---------V--HA--N-FATFVW-------EEF-EI-SEPQMHRLV--EEWRVGQAL----S------------------------------------------------------QLG-W---------------------------------------------------------K-P-------------KESQV---------------REL----------TG--I--TK-EA-G---DQ-TAVT-VY--D-TIA---RNV--------K-RVTAQVIRDVVA
      -_Streptacidiphilus_melanogenes_755027075                                  L-S--TV-ELHDLGV---CERAV-DNLATAT-------WLAGKALQSIRD-----GKLYR--E---------T--HR--T-FEEYVT-------ERW-EI-GERTAYQMI--EEWPLAERL----N------------------------------------------------------QAL-G--------------------------------------------------------KP-A-------------TASHT---------------RAL----------LP--V--VA-RF-GADGLD-AAAG-LYEEL-RDR-AQAEG--------V-RVTAALTGQIVK
      -_Cyanothece_sp_PCC_7822_503099618                                         L-------ELQIQEG---LRLSM-----QGF-------YLIGSALRQLKA-----LKLHR--N---------S--HL--R-FDEYAK-------ERF-KI-SKRYQHYLI--NAVEVIDVF-----------------------------------------------------------SND-K----------------------Q----------YIA-L----------I-NGQ---IP-E-------------REFHC---------------RQL----------LK--L--GN----Q---PD-IWKK-AW--Y-KSV-MLASG--------E-TPTAKIVGEVVK
      -_Deinococcus_516480931                                                    L-A--PH-EEQRFQA---LEQTV-EGGLRDF-------QRTGQALAEIRD-----NHLFR--E---------T--HA--D-FETYLR-------DRW-GF-NLRQADRII--DAAVVARQL----E------------------------------------------------------PLG--------------------------------------------------I-------EP-R-------------HERQA---------------STF----------KP-----AV----K-----------II----GAL--------------E-PEQQRLISRLVE
      -_Deinococcus_radiodurans_736351733                                        L-A--PH-EQQRLDD---LEQTV-EGGLRDF-------QRTGQALSEIRD-----NELYR--A---------T--HD--S-FEAYLQ-------DRW-GF-GVRQADRLI--DAAQVAKQL----E------------------------------------------------------PLG--------------------------------------------------I-------SP-R-------------HEAQA---------------RSF----------RP-----AA----R-----------IV----EEL--------------E-PEQQRLVARLVE
      -_Crocosphaera_watsonii_494523440                                          L-T--HE-EARDRGN---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYDNL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDESPISLTTNRAQNPEVE-MTT-N-----GLQTEMAK---M-TTN------------GTQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--Q-KAV-EQAGG--------K-VPSGRIVKSIVD
      -_Crocosphaera_watsonii_737861903                                          L-T--HE-EARDRGN---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYDNL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDESPISLTTNRAQNPEVE-MTT-N-----GLQTEMAK---M-TTN------------GTQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--Q-KAV-EQAGG--------K-VPSGRIVKSIVD
      -_Crocosphaera_watsonii_494523440                                          L-T--HE-EARDRGN---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYDNL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDESPISLTTNRAQNPEVE-MTT-N-----GLQTEMAK---M-TTN------------GTQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--Q-KAV-EQAGG--------K-VPSGRIVKSIVD
      -_Crocosphaera_watsonii_737861903                                          L-T--HE-EARDRGN---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYDNL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDESPISLTTNRAQNPEVE-MTT-N-----GLQTEMAK---M-TTN------------GTQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--Q-KAV-EQAGG--------K-VPSGRIVKSIVD
      CWATWH0003_2673b1_Crocosphaera_watsonii_WH_0003_357263649                  L-T--YE-EERDRLH---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYENL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDELPTSLTTNGTQNPEVE-MTT-N-----GRQTEMAK---M-TTN------------GRQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--A-LAV-EQAGG--------K-VPSGRIVKSIVS
      -_Crocosphaera_watsonii_546230520                                          L-T--YE-EERDRLH---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYENL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDELPTSLTTNGTQNPEVE-MTT-N-----GRQTEMAK---M-TTN------------GRQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--A-LAV-EQAGG--------K-VPSGRIVKSIVS
      -_Crocosphaera_watsonii_737859551                                          L-T--YE-EERDRLH---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYENL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDELPTSLTTNGTQNPEVE-MTT-N-----GRQTEMAK---M-TTN------------GRQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--A-LAV-EQAGG--------K-VPSGRIVKSIVS
      CWATWH0402_1321_Crocosphaera_watsonii_WH_0402_543538779                    L-T--HE-EERDRLH---LEGQV-E---RAF-------FSAGIALQELRD-----RRLYR--S---------T--HK--T-FEDYCQ-------ERF-GY-SRRKMDYLI--AGSEVYQNLLLPSEMRTNCSQTD---------LPDDQSQMRTNGSQITESDDNLQMRTNCSQNADDE-MRT-N-----CSQNADGK---T-RTN------------CSQ------------I-------LP-T-------------REAQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--Q-KAV-EQAGG--------K-VPSGRIVKSIVD
      -_Streptomyces_sp_CT34_759552136                                           L-S--AE-EEDLLHL---CMRGI-EQFQNAW-------WVMAKSLANINA-----RRLYR--K---------T--HA--N-FEDFCW-------DNF-KK-SRPTAYEEM--TAYAMGELL----S-A----------------------------------------------------RAD-K----------------------P---------------------------------FD-E-------------NSNEV-----------SA--RAD--------T-PA--I-----GK-K-V-AS------AY----NPI-TKDYG----------AEVSVAVHETIE
      -_uncultured_Mediterranean_phage_uvMED_787066260                           L-------ETEISAA---YADRL-----------HQD-LAIGKALTQIFR-----RRLYR--G-----KDG----GR--D-WETWLT-ECS---AKF----TQGRGPLTK--KPALYLRGF---------------YQF-----------------------------------------RCE----VL------------L-KGS-G----------RSP-----D------I-----P-LP-A-------------SPYQV---------------RPL--LA----Q-LE--T--HP-EA-A---VD------MW--K-SAC-ADAAR-EK-V-G-K-VPSYEQVQRAAL
      -_Borrelia_bissettii_503783569                                             L-------KDKLKTL---TTDDI-----------YNK-IETAKVLNTINQ-----KKLYI--------LDG----YK--N-FYSFLA--------DF-KI-AKSQAYKYI-KIVSGVEKGI-I--D----------YNF-----------------------------------------IAN-N-----GIEKTIKQ---L---E------------SNN------------V-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNG----------------KFTGFLLEELLE
      -_Borrelia_burgdorferi_695263564                                           L-------KDKLKTL---TTDDI-----------YNK-IETAKILNTINQ-----KKLYI--------LDG----YK--N-FYSFLA--------DF-KI-AKSQAYKYI-KIVSGVEKGI-I--D----------YNF-----------------------------------------IAN-N-----GIEKTIKQ---L---E------------SNN------------V-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNG----------------KFTGFLLEELLE
      -_Borrelia_garinii_696415789                                               L-------KDKLKTL---TTDDI-----------YNK-IETAKILNTINQ-----KKLYI--------LDG----YK--N-FYSFLA--------NF-KI-AKSQAYKYI-KIVSGVEQGI-I--D----------YNF-----------------------------------------IAN-N-----GIEKAIKQ---L---E------------GSN------------I-------IK-K-----------S-NQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-SNA----------------KFTGFLLDKLFS
      OY14_04355_Borrelia_chilensis_741043351                                    L-------KEKLKVL---IKEES-----------YNK-IETARILKEIND-----NKYYI--------VDG----YK--N-FSHFLK--------DY-NM-AKTSVYRYI-KIAVGIDSGK-I--D----------YEL-----------------------------------------ILK-K-----GIYYAMQI---L---E------------NNN------------I-------TI-N-----------P-KAILN---------------RSF--------K-LK--I--EE-EE-I---FN------FY--K-SNT----------------SFVSFLLKELYH
      -_Borrelia_miyamotoi_764988637                                             L-------KEKLKIL---IKEES-----------YNK-IETARILKEINE-----SKYYA--------LDG----YK--S-FTAFIK--------SY-KI-AKTSVYRYI-KLVSGIDSGK-I--D----------YDL-----------------------------------------ILN-R-----GIDYAIKI---L---E------------SNN------------I-------IS-K-----------S-NVNPL---------------RPL--------R-FQ--L--DD-EE-C---FH------FY--K-SNT----------------KFASFLLKEIFK
      BHO_0016000_Borrelia_hermsii_YBT_576092807                                 L-------KEKLKIL---IKEES-----------YNK-IETARILKEINE-----SKYYA--------LDG----YK--S-FTAFIK--------SY-KI-AKTSIYRYI-KLVTGIDSGK-I--D----------YDL-----------------------------------------ILS-R-----GVDYAIKV---L---E------------NNS------------I-------IS-K-----------S-NVNPL---------------RPL--------R-FQ--L--DD-EE-S---FH------FY--K-SNT----------------KFASFLLKEIYK
      I871_B18_Borrelia_miyamotoi_LB-2001_736012165                              L-------KEKLKIL---IKEES-----------YNK-IETARILKEINE-----SKYYA--------LDG----YK--S-FTAFIK--------SY-KI-AKTSVYRYI-KLVSGIDSGK-I--D----------YDL-----------------------------------------ILN-R-----GIDYAIKI---L---E------------SNN------------I-------IS-K-----------S-NVNPL---------------RPL--------R-FQ--L--DD-EE-C---FH------FY--K-SNT----------------KFASFLLKEIFK
      -_Borrelia_duttonii_740581845                                              L-------KDRLKSL---VVDDI-----------YNK-IETAKILSLINE-----KKLYI--------FDG----YK--S-FYGFLA--------DF-KI-AKSQAYKYI-KIASGMEQGV-I--D----------YDF-----------------------------------------IIN-N-----GIENTIKK---L---G------------SKN------------I-------VK-K-----------S-KHNLT---------------KQL--------C-FE--F--KS-QD-S---YD------FY--K-RDT----------------KFMCFVLDELFI
      -_Borrelia_crocidurae_504496248                                            L-------KDRLKSL---VVDDI-----------YNK-IETAKILSLINE-----KKLYI--------FDG----YK--S-FYGFLA--------DF-KI-AKSQAYKYI-KIVSGMAQGI-I--D----------YDF-----------------------------------------IIN-N-----GIENTIKK---L---G------------SKN------------I-------VK-K-----------S-KHNLT---------------KQL--------C-FQ--F--KS-QD-S---YD------FY--K-RDT----------------KFMCFVLDELFI
      -_Borrelia_hispanica_639482667                                             L-------KDRLKSL---VVDDI-----------YNK-IETAKILSLINE-----KKLYI--------FDG----YK--S-FYGFLA--------DF-KI-AKSQAYKYI-KIASGMAQGI-I--D----------YDF-----------------------------------------IIN-N-----GIENTIKK---L---G------------SKN------------S-------IK-K-----------S-KHNLT---------------KQL--------C-FQ--F--KN-QD-S---YD------FY--K-SDT----------------KFVCFVLDELFI
      -_Borrelia_persica_639480863                                               C-------KHSLRKS---MVNDI-----------ENK-IQIMEILYNVRK-----KKLYR--------FDH----HA--T-FDAFIK--------AF-GI-GKTQAYLYL-KIYEQILKGT-L--T----------VKE-----------------------------------------IRE-K-----GLIEIYRN---I-KLK------------EIS------------A-------KK-S-------------RQNLI---------------KPL--------R-FQ--L--KD-HK-S---YD------FY--K-KRS----------------KFAAFILDKLFL
      AM1_B0079_Acaryochloris_marina_MBIC11017_158310190                         M-S-----VNDLSEC---VEAGL-NAGAYAI-------AQAGLALREIQR-----RGEYP--T---------T--------FEAFVK-------DKF-AL-TRARAYQLM--YAADIIADL----A----------SVF-----------------------------------------ESN-K--------------------------------------------------------LP-R-------------SESAV---------------RPM----------IG--L--TK-QQ-R---IE------VW--R-RAL-KGKQR----------SPGYGTVKAIVE
      -_Acaryochloris_marina_753958401                                           M-S-----VNDLSEC---VEAGL-NAGAYAI-------AQAGLALREIQR-----RGEYP--T---------T--------FEAFVK-------DKF-AL-TRARAYQLM--YAADIIADL----A----------SVF-----------------------------------------ESN-K--------------------------------------------------------LP-R-------------SESAV---------------RPM----------IG--L--TK-QQ-R---IE------VW--R-RAL-KGKQR----------SPGYGTVKAIVE
      -_Scytonema_millei_748136693                                               L-T--PE-EERERHR---LELRV-E---RAF-------FEAGKALRELRE-----RRLYR--S---------T--HK--S-WEAYCQ-------ERF-GF-GRDSADIKI--SASRVVEEI---------------REY-----------------------------------------LPT-N-----RRQI-----------------------------------------------LP-T-------------TLEQV---------------RPL----------LKLKA--SS-E--R---IE------AW--L-KAI-DTNHG--------R-IPNGRIVKGIVK
      Strvi_0238_Streptomyces_violaceusniger_Tu_4113_344043288                   L-S--PD-EAEDLRQ---CERAF-ANADEAE-------WMRGKAAHAVRD-----RRLYR---------------PR--T-WPDYCE-------EVL-GE-SESEVNRMI--QEWPIGAMI----T----------QIW-----------------------------------------VTP-R------------------------------------------------P-------TP-A-------------SHRRA-----L---------LPL--VDL-----YG--L--EA-TA-R---GY-VLLR-TW--------AAENN--------E-RVTATVLTAMVD
      -_Streptomyces_violaceusniger_759522371                                    L-S--PD-EAEDLRQ---CERAF-ANADEAE-------WMRGKAAHAVRD-----RRLYR---------------PR--T-WPDYCE-------EVL-GE-SESEVNRMI--QEWPIGAMI----T----------QIW-----------------------------------------VTP-R------------------------------------------------P-------TP-A-------------SHRRA-----L---------LPL--VDL-----YG--L--EA-TA-R---GY-VLLR-TW--------AAENN--------E-RVTATVLTAMVD
      -_Reyranella_massiliensis_522187926                                        V-------TEAEAHR---LAAEI-A---EAS-D-FDA-FRLGGLLARIHR-----ERWYR--------GAG----YP--D-FRSYVE-------ARH-GF-KLRKALYLA-----AIYESV-I--D----------LGL-----------------------------------------TWQ-E----------------L---------------------------------------RP-V-------------GWSKL---------------KEL----------VG--V--VD-RD-N---AR------DW----LAI-AAAEG-----------MTVLKLHALVQ
      -_Cyanothece_sp_PCC_7822_754535969                                         L-T--SE-EEQELLQ---LEGCI-E---RSF-------YQAGLALKTIRD-----KRLYR--F---------L--YA--T-FEEYCR-------ERF-GF-ARRHSYQLI--DAAVVMDNL---------------LAIEPQYEPGAQTENPFDHLCA-IGAQIENTFDHLSANGT----QTQ-T--II-SDETPAAK---------------SLA--SRQ------------I-------LP-T-------------SERQV---------------RPL----------IS--L--NP-SQ-Q---RE------AW--V-KAV-NLAQG--------K-VPSNRIVSLVAD
      Cyan7822_6496_Cyanothece_sp_PCC_7822_306986392                             L-T--SE-EEQELLQ---LEGCI-E---RSF-------YQAGLALKTIRD-----KRLYR--F---------L--YA--T-FEEYCR-------ERF-GF-ARRHSYQLI--DAAVVMDNL---------------LAIEPQYEPGAQTENPFDHLCA-IGAQIENTFDHLSANGT----QTQ-T--II-SDETPAAK---------------SLA--SRQ------------I-------LP-T-------------SERQV---------------RPL----------IS--L--NP-SQ-Q---RE------AW--V-KAV-NLAQG--------K-VPSNRIVSLVAD
      -_Lachnospiraceae_bacterium_10-1_510895729                                 I----EI-IKDESFR---VQKSF---------------VKIGWYLKHIRD-----NELFK--------EDG----YA--S-IWECAA-------DQL-GY-SQATASRFI-----NICEKF----S----------KNH---------------------------------------------N---------------------------------SPE------------L-----D-VK----YAGF-------DKSQM---------------IEM----------LP--M--EP-EQ-----LE------------KVV--------------P-EMTVKQIRDIKT
      BDCR2A_01333_Borrelia_duttonii_CR2A_576313683                              L-------KKQLKSN---FKNEV-----------YNR-VETMKILKEIKD-----NEYYK--------MDG----YK--T-FDSFIK--------DY-KL-AKSQVYDYL-KVATAIEKGI-V--E----------EVY-----------------------------------------LLE-N-----GFKNTVIL---L-RNS------------DSN------------L-------MK-K-----------S-RRNPI---------------KPL--------R-FQ--L--KR-QD-S---YD------FY--K-KNL----------------KLASFVLDELFV
      -_Borrelia_valaisiana_501894927                                            L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIQEGI-L--E----------EVY-----------------------------------------VIE-N-----GVSKAIAV---L-R-E------------SPS------------G-------LK-K-----------S-KQNSI---------------KPL--------R-FQ--L--KT-KE-S---YD------FY--K-SNV----------------KFTGFMMHEIFE
      -_Borrelia_duttonii_740581624                                              L-------KKQLKSN---FKNEV-----------YNR-VETMKILKEIKD-----NEYYK--------MDG----YK--T-FDSFIK--------DY-KL-AKSQVYDYL-KVATAIEKGI-V--E----------EVY-----------------------------------------LLE-N-----GFKNTVIL---L-RNS------------DSN------------L-------MK-K-----------S-RRNPI---------------KPL--------R-FQ--L--KR-QD-S---YD------FY--K-KNL----------------KLASFVLDELFV
      -_Borrelia_hispanica_639482723                                             L-------KKQLKSN---FKNEV-----------YNR-IETMKILKEIKD-----NEYYK--------MDG----YK--T-FDSFIK--------DY-KL-AKSQVYDYL-KVATAIEKGI-V--E----------EVY-----------------------------------------LLE-N-----GFKNTIIL---L-RNS------------DSN------------L-------VK-K-----------S-RRNPI---------------KPL--------R-FQ--L--KR-QD-S---YD------FY--K-KNS----------------KLASFILDELFA
      BCD_1877_Borrelia_crocidurae_DOU_576102765                                 L-------KKQLKSN---FKNEV-----------YNR-VETMKILKEIKD-----NEYYK--------MDG----YK--T-FDSFIK--------DY-KL-AKSQVYDYL-KVATAIEKGI-V--E----------EVY-----------------------------------------LLE-N-----GFKNTVIL---L-RNS------------DSN------------L-------MK-K-----------S-RRNPI---------------KPL--------R-FQ--L--KR-QD-S---YD------FY--K-KNL----------------KLASFVLDELFV
      -_Borrelia_crocidurae_644981026                                            L-------KKQLKSN---FKNEV-----------YNR-VETMKILKEIKD-----NEYYK--------MDG----YK--T-FDSFIK--------DY-KL-AKSQVYDYL-KVATAIEKGI-V--E----------EVY-----------------------------------------LLE-N-----GFKNTVIL---L-RNS------------DSN------------L-------MK-K-----------S-RRNPI---------------KPL--------R-FQ--L--KR-QD-S---YD------FY--K-KNL----------------KLASFVLDELFV
      -_Borrelia_bissettii_503783476                                             L-------KNKLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--E----------ETY-----------------------------------------IIE-N-----GLTMSLLS---I-RDK------------ESS------------S-------FK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YA------FY--K-SNA----------------KFTGFLLDELFE
      -_Borrelia_finlandensis_501928340                                          L-------KNKLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--E----------ETY-----------------------------------------IIE-N-----GLTMSLLS---I-RDK------------ESS------------S-------FK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE
      -_Borrelia_persica_639480210                                               L-------KQKLKFN---FQQEV-----------YYK-IESIKVLKEIKD-----NEYYK--------FDG----YR--T-FEDFIK--------EY-KL-ARSQVYDYL-KIATAIENGI-L--E----------ESY-----------------------------------------VVE-N-----GITRTIAF---L-R-T------------TTS------------K-------LK-K-----------S-KRNLI---------------KPL--------R-FQ--L--KN-QK-S---YD------YY--K-KNA----------------KLTGFILDRLFL
      -_Borrelia_hermsii_645010627                                               L-------KQKLKPN---FQQEI-----------YYK-MEAIKILKEIKD-----NEYYK--------LDG----YR--I-LEDFIK--------DY-KL-ARSQAYDYL-KIATALENGI-L--D----------ESY-----------------------------------------VVE-N-----GITQAIAF---L-R-T------------TSN------------K-------LK-K-----------S-KRNLI---------------KPL--------R-FQ--L--KS-QE-S---YN------FY--K-KNA----------------RFTGFILDILFS
      -_Borrelia_bissettii_503783755                                             L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIKDGI-L--E----------ESY-----------------------------------------VIE-N-----GVTKTLEF---L-R-K------------SPN------------V-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-SNA----------------KFTGFMLDKLFS
      -_Borrelia_hermsii_644979647                                               L-------KQKLKSN---FQQEI-----------YYK-MEAIKILKEIKD-----NEYYK--------LDG----YR--I-FEDFIK--------DY-KL-ARSQAYDYL-KIATALENGI-L--D----------ESY-----------------------------------------VVE-N-----GITQAIAF---L-R-T------------TSN------------K-------LK-K-----------S-KRNLI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-KNA----------------RFTGFILDILFS
      -_Borrelia_duttonii_501531293                                              L-------KQKLKSH---CQQEI-----------YYK-MEIIKILKEIKD-----NEYYK--------LDN----YK--T-FEDFIR--------DY-KL-ARSQVYDYL-KIANAIENGI-L--Q----------ESY-----------------------------------------VVE-N-----GITHTIAF---L-R-S------------DSG------------L-------FK-K-----------KLRRNAL---------------KPL--------K-FQ--L--KK-QE-S---YN------FY--K-KNV----------------KFAEFLLDTLFL
      -_Borrelia_crocidurae_504499910                                            L-------KQKLKSH---CQQEI-----------YYK-MEIIKILKEVKD-----NEYYK--------LDN----YK--T-FEDFIR--------DY-KL-ARSQVYDYL-KIANAIENGI-L--Q----------ESY-----------------------------------------VVE-N-----GITHTIAF---L-R-S------------DSR------------L-------FK-K-----------KLRRDSL---------------KPL--------K-FQ--L--KK-HE-S---YN------FY--K-KNV----------------KFAEFLLDTLFL
      -_Borrelia_hermsii_644979702                                               L-------KSKLIIN---FKSEI-----------CSR-IETMKVLKEIKD-----NEYYK--------LDG----YK--N-FEDFTK--------DY-KL-AKSQAYDYL-KVAGAIEEGI-I--E----------ESF-----------------------------------------LIE-N-----GFRQTLYV---L-RNS------------DSN------------T-------LN-K-----------S-RVNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-KNA----------------KFTSFLMDNLFE
      -_Borrelia_hermsii_645063163                                               L-------KSKLIIN---FKSEI-----------CSR-IETMKVLKEIKD-----NEYYK--------LDG----YK--N-FEDFTK--------DY-KL-AKSQAYDYL-KVAGAIEEGI-I--E----------ESF-----------------------------------------LIE-N-----GFRQTLYV---L-RNS------------DSN------------T-------LN-K-----------S-RVNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-KNA----------------KFTSFLMDNLFE
      -_Borrelia_hermsii_645010701                                               L-------KDKLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YN--S-FNSFAK--------NY-KI-ARTQVYDYL-RLANAMEEGL-L--E----------ERF-----------------------------------------IIE-N-----GLTISLLS---L-RDK------------EGV------------N-------IK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-QN-S---YA------FY--K-SNA----------------KFTSFLMDELFE
      -_Borrelia_hermsii_644979468                                               L-------KDKLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YN--S-FNSFAK--------NY-KI-ARTQVYDYL-RLANAMEEGL-L--E----------ERF-----------------------------------------IIE-N-----GLTISLLS---L-RDK------------EGV------------N-------IK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-QN-S---YA------FY--K-SNA----------------KFTSFLMDELFE
      -_Borrelia_duttonii_752506021                                              L-------KLKLKSN---LKENI-----------YNK-LEAMKILKEIKD-----NDYYK--------LDG----YK--R-FSEFLG--------SY-KV-AKSQAYNYL-KIATAIEKGL-L--E----------EQY-----------------------------------------VLE-N-----GFREVLSL---I-RIK------------EGV------------K-------IK-K-----------S-RQSGL---------------KSL--------R-FH--F--KS-QE-S---YD------FY--R-QNV----------------KFAGFLMDALFK
      -_Borrelia_hermsii_645010853                                               L-------KEKLKQN---ARKEI-----------YYK-VESIRILKEIKD-----NGYYK--------LDG----HK--N-FDSFIK--------SY-RM-AKTQVYAYL-RLANAIEEGM-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---IKKNK------------ESI------------A-------IK-K-----------S-RQKAI---------------NPL--------R-FQ--L--KS-QD-S---YD------FY--K-QNS----------------KFTSFVLDTLFL
      -_Borrelia_hispanica_639481672                                             L-------KEKLKQN---ARKEI-----------YYK-VENIRILKEIKD-----NEYYK--------LDG----HK--N-FDSFIK--------DY-RM-AKTQVYAYL-RLANAIEAGI-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---I-KNK------------KTV------------K-------TK-K-----------V-KQDSI---------------KLL--------S-FK--L--KN-QA-S---YD------FY--K-NNV----------------KFMAFMLDTIFL
      -_Borrelia_burgdorferi_group_496158399                                     L-------KKKLYVN---LREGV-----------SNR-VECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-DV-AKTQAYNYL-KIANAIEAGV-I--E----------EQY-----------------------------------------VLD-N-----GFRLILSV---L-KDK------------ESP------------V-------LK-K-----------S-KQNSI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-SNA----------------KFTSFMMQEIFE
      -_Borrelia_coriaceae_645024139                                             L-------KKKLYIN---LREGI-----------YNR-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------NY-DV-AKTQAYNYL-KIATALEEGL-L--E----------EQY-----------------------------------------VLE-N-----GFRQILGL---L-KDK------------ESE------------K-------LK-K-----------S-RVNPI---------------KPL--------R-FQ--L--KS-QA-S---YD------FY--K-QNA----------------KFTSFLMDRLFA
      -_Borrelia_miyamotoi_763123871                                             L-------KEKLKQN---ARKEI-----------YYK-VENIRILKEIKD-----NEYYK--------LDG----HK--N-FDNFIK--------SY-RM-AKTQVYAYL-RLANAIEDGL-L--A----------EQY-----------------------------------------IIE-N-----GINESLAM---I-KNK------------ESV------------K-------IK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-QD-S---YD------FY--K-EHS----------------KFTAFILDTLFS
      BCD_1474_Borrelia_crocidurae_DOU_576102339                                 L-------KLKLKSN---LKENI-----------YNK-LEAMKILKEIKD-----NDYYK--------LDG----YK--R-FSEFLG--------SY-KV-AKSQAYNYL-KIATAIEKGL-L--E----------EQY-----------------------------------------VLE-N-----GFREVLSL---I-RIK------------EGV------------K-------IK-K-----------S-RQSGL---------------KSL--------K-FH--F--KS-QE-S---YD------FY--R-QNV----------------KFASFLMDTLFK
      -_Borrelia_burgdorferi_695262165                                           L-------KKKLYVN---LREGV-----------SNR-VECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-DV-AKTQAYNYL-KIANAIEAGV-I--E----------EQY-----------------------------------------VLD-N-----GFRLILSV---L-KDK------------ESP------------V-------LK-K-----------S-KQNSI---------------KPL--------R-FQ--L--KA-QE-S---YD------FY--K-SNA----------------KFTSFMMQEIFE
      -_Borrelia_garinii_657235060                                               L-------KKKLYVN---LREGV-----------SNR-IACMKILKEIKD-----NEYYK--------IDG----YK--S-FDAFIK--------DY-DV-AKTQAYNYL-KIANAIESGV-I--E----------EQY-----------------------------------------VLD-N-----GFRSILSV---L-KDK------------ESP------------A-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YE------FY--K-SNA----------------KFTGFLLDKLFS
      -_Borrelia_hermsii_695262844                                               L-------KKKLYIN---LREGI-----------YNR-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------NY-DV-AKTQAYNYL-KIATALEEGL-L--E----------EQY-----------------------------------------VLE-N-----GFRQILSL---L-KDK------------ESA------------T-------IK-K-----------S-KVNPI---------------KPL--------R-FQ--L--KS-QE-S---YG------FY--K-SNA----------------KFTSFLMDELFE
      -_Borrelia_coriaceae_654876319                                             L-------KEKLKQN---ARKEI-----------YYK-VENIRILKEIKD-----NEYYK--------FDG----HK--N-FDSFIK--------SY-RM-AKTQVYAYL-RLANAIAEGM-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---I-KDK------------VSQ------------T-------VR-K-----------S-KQNSI---------------KPL--------R-FQ--L--KS-QE-S---YN------FY--K-ENS----------------KFTAFVLDTLFS
      -_Borrelia_bissettii_503783548                                             L-------KKKLYVN---LREGV-----------SNR-VECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-DV-AKTQAYNYL-KIANAIEAGV-I--E----------EQY-----------------------------------------VLD-N-----GFRLILSV---L-KDK------------ESP------------V-------LK-K-----------S-KQNSI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-SNA----------------KFTSFMMQEIFE
      -_Borrelia_persica_740577787                                               L-------MAKMKQN---SKKEI-----------YYK-VENIRILKEIKD-----NEYYK--------LEG----HK--S-FDKFIE--------SY-RM-AKTQVYAYL-RLANAMEKGI-L--A----------EQY-----------------------------------------IIE-H-----GINESLAL---I-KER------------KLL------------K-------LK-R-----------S-TQDLV---------------KPL--------R-FQ--L--ET-YE-S---YD------FY--K-KNS----------------KFISFLLEKLFA
      -_Borrelia_duttonii_501533328                                              L-------KEKLKQN---ARKEI-----------YYK-IENIRILKEIKD-----NEYYK--------LDG----HK--H-FDSFIK--------DY-RM-AKTQVYAYL-RLANAMEKGI-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---I-KNK------------KTI------------K-------TK-K-----------I-RRSSI---------------NSL--------S-FK--L--KN-QA-S---YD------FY--K-NNV----------------KFIAFMLDTIFL
      -_Borrelia_hermsii_645063171                                               L-------KKKLYIN---LREGI-----------YNR-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------NY-DV-AKTQAYNYL-KIATALEEGL-L--E----------EQY-----------------------------------------VLE-N-----GFRQILSL---L-KDK------------ESA------------T-------IK-K-----------S-KVNPI---------------KPL--------R-FQ--L--KS-QE-S---YG------FY--K-SNA----------------KFTSFLMDELFE
      -_Borrelia_crocidurae_644980725                                            L-------KEKLKQN---ARKEI-----------YYK-VENIRILKEIKD-----NEYYK--------LDG----HK--N-FDSFIK--------DY-RM-AKTQVYVYL-RLANAMEKGI-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---I-KNK------------KTI------------K-------TK-K-----------I-RRDSI---------------NSL--------S-FK--L--KN-QA-S---YD------FY--K-NNV----------------KFIAFMLDTIFL
      BDU_7025_Borrelia_duttonii_Ly_201084505                                    L-------KLKLKSN---LKENI-----------YNK-LEAMKILKEIKD-----NDYYK--------LDG----YK--R-FSEFLG--------SY-KV-AKSQAYNYL-KIATAIEKGL-L--E----------EQY-----------------------------------------VLE-N-----GFREVLSL---I-RIK------------EGV------------K-------IK-K-----------S-RQSGL---------------KSL--------R-FH--F--KS-QE-S---YD------FY--R-QNV----------------KFAGFLMDALFK
      -_Borrelia_recurrentis_501533114                                           L-------KEKLKQN---ARKEI-----------YYK-VENIRILKEIKD-----NEYYK--------LDG----HK--H-FDSFIK--------DY-RM-AKTQVYAYL-RLANAMEKGI-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---I-KNK------------KTI------------K-------IK-K-----------I-RRGSI---------------NSL--------S-FK--L--KN-QA-S---YD------FY--K-NNV----------------KFIAFMLDTIFL
      BHO_0006701_Borrelia_hermsii_YBT_576093010                                 L-------KEKLKQN---ARKEI-----------YYK-VESIRILKEIKD-----NGYYK--------LDG----HK--N-FDSFIK--------SY-RM-AKTQVYAYL-RLANAIEEGM-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---IKKNK------------ESI------------A-------IK-K-----------S-RQKAI---------------NPL--------R-FQ--L--KS-QD-S---YD------FY--K-QNS----------------KFTSFVLDTLFL
      -_Borrelia_crocidurae_749307948                                            L-------KLKLKSN---LKENI-----------YNK-LEAMKILKEIKD-----NDYYK--------LDG----YK--R-FSEFLG--------SY-KV-AKSQAYNYL-KIATAIEKGL-L--E----------EQY-----------------------------------------VLE-N-----GFREVLSL---I-RIK------------EGV------------K-------IK-K-----------S-RQSGL---------------KSL--------K-FH--F--KS-QE-S---YD------FY--R-QNV----------------KFASFLMDTLFK
      -_Borrelia_valaisiana_506379547                                            L-------KKKLYVN---LREGI-----------SNR-IECMKILKEIKD-----NKYYK--------LDG----YK--S-FDAFIK--------DY-DV-AKTQAYNYL-KIANAIESGV-I--E----------EQY-----------------------------------------VLD-N-----GFRLILSV---F-KNK------------ESP------------T-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-SNA----------------KFTGFLLDKLFS
      BCD_1485_Borrelia_crocidurae_DOU_576102351                                 L-------KERIVNN---FKKEI-----------FHK-IEVIKALKEIKD-----NKYYE--------LDG----YK--S-FNSFAK--------NF-RI-ARTQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-N-----GINETISF---L-RNK------------EGI------------S-------IK-R-----------S-RQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNS----------------KFTGFLLDTLFD
      -_Borrelia_crocidurae_644980346                                            L-------KKKLYIN---LREGI-----------YNK-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------KY-DV-AKTQAYNYL-KIAAALEEGL-L--E----------EQF-----------------------------------------VLE-N-----GFRQILSL---L-RDK------------QGK------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNA----------------KFTSFLLDKLFQ
      -_Borrelia_duttonii_501533271                                              L-------KKKLYIN---LREGI-----------YNK-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------NY-DV-AKTQAYNYL-KIAAALEEGL-L--E----------EQF-----------------------------------------VLE-N-----GFRQILSL---L-RDK------------QGK------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KR-QE-S---YD------FY--K-KNA----------------KFTSFLLDKLFQ
      -_Borrelia_hispanica_639481710                                             L-------KKKLYIN---LREGI-----------YNK-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------NY-DV-AKTQAYNYL-KIAAALEEGL-L--E----------EQF-----------------------------------------VLE-N-----GFRQILSL---L-RDK------------QGK------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KR-QE-S---YD------FY--K-KNA----------------KFTSFLLDKLFQ
      -_Borrelia_duttonii_740582129                                              L-------KKKLYIN---LREGI-----------YNK-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------NY-DV-AKTQAYNYL-KIAAALEEGL-L--E----------EQF-----------------------------------------VLE-N-----GFRQILSL---L-RDK------------QGK------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KR-QE-S---YD------FY--K-KNA----------------KFTSFLLDKLFQ
      BDCR2A_01875_Borrelia_duttonii_CR2A_576313055                              L-------KERIVNN---FKKEI-----------FHK-IEVIKALKEIKD-----NKYYE--------LDG----YK--S-FNSFAK--------NF-RI-ARTQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-N-----GINETISF---L-RNK------------EGI------------S-------IK-R-----------S-RQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNS----------------KFTGFLLDTLFD
      -_Borrelia_crocidurae_504509673                                            L-------KERIVNN---FKKEI-----------FHK-IEVIKALKEIKD-----NKYYE--------LDG----YK--S-FNSFAK--------NF-RI-ARTQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-N-----GINETISF---L-RNK------------EGI------------S-------IK-R-----------S-RQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNS----------------KFTGFLLDTLFD
      -_Borrelia_duttonii_740582639                                              L-------KERIVNN---FKKEI-----------FHK-IEVIKALKEIKD-----NKYYE--------LDG----YK--S-FNSFAK--------NF-RI-ARTQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-N-----GINETISF---L-RNK------------EGI------------S-------IK-R-----------S-RQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNS----------------KFTGFLLDTLFD
      -_Borrelia_hispanica_639482094                                             L-------KERIVNN---FKKEI-----------FHK-IEVIKALKEIKD-----NKYYE--------LDG----YK--S-FNSFAK--------NF-RI-ARTQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-N-----GINETISF---L-RNK------------EGI------------S-------IK-R-----------S-RQNPI---------------KPL--------R-FQ--L--KK-QA-S---YD------FY--K-KNS----------------KFTGFLLDTLFD
      -_Borrelia_persica_740577610                                               L-------KEKIINN---FKKEI-----------FHK-IETIKALKEIKD-----NKYYK--------LDG----HN--S-FNSFSK--------NF-RL-ARSQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-Y-----GIKYTISF---L-RNK------------EGI------------S-------LK-K-----------S-KVNPI---------------KPL--------R-FQ--L--KC-QE-S---YD------YY--K-KDS----------------KFTSFVMDTLFR
      BOM_0964_Borrelia_miyamotoi_FR64b_576103756                                L-------KDKLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NQYYK--------LDG----YN--N-FEEFTR--------HY-KI-AKTQAYEYL-KIANAMEEGL-I--Q----------EQD-----------------------------------------IIK-N-----GIHNIILS---L-RDK------------EGT------------N-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------YY--K-KNA----------------KFTSFLMDELFS
      -_Borrelia_hispanica_639482644                                             L-------KERLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NAYYK--------LDG----YK--N-FEEFTR--------HY-RI-AKTQAYEYL-KIANAIQEGL-V--E----------EQD-----------------------------------------IIE-N-----GIHDIILS---L-RNK------------AGF------------N-------IK-K-----------S-RQNVI---------------KPL--------K-FR--L--KR-QE-S---YD------FY--K-KNP----------------KFTGFILDEIFF
      -_Borrelia_miyamotoi_763123770                                             L-------KDKLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NQYYK--------LDG----YN--N-FEEFTR--------HY-KI-AKTQAYEYL-KIANAMEEGL-I--Q----------EQD-----------------------------------------IIK-N-----GIHNIILS---L-RDK------------EGT------------N-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------YY--K-KNA----------------KFTSFLMDELFS
      -_Borrelia_crocidurae_504509579                                            L-------KDKLKEN---FKREI-----------YYK-VESMKILKEIKD-----NEYYK--------LDN----YK--S-FEGFIK--------DY-KV-AKTQAYAYL-RLANALHDGI-I--E----------ENY-----------------------------------------IIE-N-----GIHNALDL---I-GHE------------GSK------------A-------VK-K-----------S-KQNKI---------------KPL--------R-FQ--L--KK-QA-S---YD------FY--K-KNS----------------KFTGYLLDMLFE
      -_Borrelia_persica_740577602                                               L-------KLKLKSN---FKEGI-----------YNK-LEAMKILKEIKD-----NHYYR--------YDG----YK--K-FSDFLG--------SY-DV-AKSQAYNYL-KIATAIEQGI-I--E----------ENY-----------------------------------------VLE-N-----GFREVLHL---I-RSK------------GCE------------K-------FK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KN-QA-S---YD------FY--K-KNA----------------KFTSFLMDKLFL
      -_Borrelia_duttonii_499985609                                              L-------KDKLKEN---FKREI-----------YYK-VESMKILKEIKD-----NEYYK--------LDN----YK--S-FEGFIK--------DY-KV-AKTQAYAYL-RLANALHDGI-I--E----------ENY-----------------------------------------IIE-N-----GIHNALDL---I-GHE------------GSK------------T-------VR-K-----------S-KQNKI---------------KPL--------R-FQ--L--KK-QA-S---YD------FY--K-KNS----------------KFTGYLLDMLFE
      -_Borrelia_garinii_657235493                                               L-------KKKLKDS---FRSEI-----------YYK-MEVIKILKEIKD-----NKYYK--------LDG----YR--I-FEDFIK--------DY-DL-ARTQVYGYL-KIANAIQEGL-L--K----------ENY-----------------------------------------VIQ-N-----GVTKTIAF---L-K-K------------SID------------V-------SK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNA----------------KFTGYLLDKLFN
      -_Borrelia_hispanica_740573754                                             L-------KDKLKEN---FKREI-----------YYK-VESMKILKEIKD-----NEYYK--------LDN----YK--S-FEGFIK--------DY-KV-AKTQAYAYL-RLANALHDGI-I--E----------ENY-----------------------------------------IIE-N-----GIHNALDL---I-GHE------------GSK------------T-------VR-K-----------S-KQNKI---------------KPL--------R-FQ--L--KK-QA-S---YD------FY--K-KNS----------------KFTGYLLDMLFE
      -_Borrelia_crocidurae_504496143                                            L-------KERLKAN---FRKEI-----------FHK-IDSIRILKEIKD-----NKYYK--------LDG----YK--S-FDSFIK--------SY-RL-ARSQVYVYL-KIANAIEEGL-I--E----------ENY-----------------------------------------IIE-N-----GIHDTFNL---I-QNT------------GNK------------T-------IN-K-----------S-KKESI---------------QSL--------S-FH--L--KH-QE-S---YD------FY--K-QNI----------------KIISFILDELVV
      -_Borrelia_burgdorferi_499985629                                           L-------KEKLKTN---FKKEI-----------FHK-VENIRILKEIKD-----NEYYK--------FDG----YK--N-FLDFVK--------NF-NV-AKSQAYKYL-KLATALQDGV-L--N----------ENY-----------------------------------------VIE-N-----GIHNSFNY---I-KDK------------ESP------------S-------LK-K-----------S-KENPI---------------KPL--------R-LK--L--KT-QE-S---YD------FY--K-SKA----------------KFTSFMMNEIFE
      -_Borrelia_coriaceae_645023888                                             L-------KQKLKSN---FQQEI-----------YYK-MEAIKILKEIKD-----NEYYK--------LDG----YR--I-FEDFIK--------DY-KL-ARSQAYDYL-KIATALANGT-L--E----------ENY-----------------------------------------VIE-N-----GITQTIAF---L-R-T------------TSS------------K-------LK-K-----------S-KYNLI---------------KPL--------H-LQ--L--KS-QE-S---YD------FY--K-KNA----------------KFTGFILDILFS
      BCD_1669_Borrelia_crocidurae_DOU_576102548                                 L-------KDKLKEN---FKREI-----------YYK-VESMKILKEIKD-----NEYYK--------LDN----YK--S-FEGFIK--------DY-KV-AKTQAYAYL-RLANALHDGI-I--E----------ENY-----------------------------------------IIE-N-----GIHNALDL---I-GHE------------GSK------------A-------VK-K-----------S-KQNKI---------------KPL--------R-FQ--L--KK-QA-S---YD------FY--K-KNS----------------KFTGYLLDMLFE
      -_Borrelia_garinii_657235558                                               L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NKYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------ENY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------KTS------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFN
      -_Borrelia_burgdorferi_497942842                                           L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKIND-----KRLYL--------EGG----YK--S-FKDFLS--------DF-KL-AKTQSYEYI-KLAAAIEAGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KNQ------------ANG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-SNS----------------RFVSFMMDEIFK
      -_Borrelia_burgdorferi_671550272                                           L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKIND-----KRLYL--------EGG----YK--S-FKDFLS--------DF-KL-AKTQSYEYI-KLAAAIEAGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KNQ------------ANG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-SNS----------------RFVSFMMDEIFK
      -_Borrelia_garinii_657248004                                               L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK
      -_Borrelia_garinii_696413767                                               L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK
      -_Borrelia_garinii_671520434                                               L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK
      -_Borrelia_garinii_657234804                                               L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SHT----------------RFTSFMMDEIFK
      -_Borrelia_garinii_671556237                                               L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK
      -_Borrelia_garinii_501710213                                               L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK
      BGP219_Borrelia_garinii_PBi_52696733                                       L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAVETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK
      -_Borrelia_afzelii_501669395                                               L-------ITQLRNN---IKSEI-----------YNI-IDTMKILKKIND-----KKLYV--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-RLATAVETGL-L--E----------ENF-----------------------------------------ITS-N-----GIRASIRY---V-KNK------------TSG------------I-------IK-K-----------S-KQNSI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNA----------------NFTNFMMNEIFE
      -_Borrelia_spielmanii_501898261                                            L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAVETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YT------FY--K-SNT----------------RFTSFMMDEIFK
      -_Borrelia_valaisiana_506379500                                            L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK
      -_Borrelia_finlandensis_501928245                                          L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKIND-----KKLYL--------EGG----YK--S-FKDFLS--------DF-KL-AKTQSYEYI-KLAAAIEAGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRF---I-KNK------------TNG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FK--L--KN-QE-S---YD------FY--K-SNS----------------RFVSFMMDEIFK
      -_Borrelia_burgdorferi_740592163                                           L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKIND-----KRLYL--------EGG----YK--S-FKDFLS--------DF-KL-AKTQSYEYI-KLAAAIEAGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KNQ------------ANG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-SNS----------------RFVSFMMDEIFK
      -_Borrelia_garinii_696412166                                               L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK
      -_Borrelia_bissettii_503789140                                             L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKIND-----KKLYL--------EGG----YK--S-FKDFLS--------DF-KL-AKTQSYEYI-KLATAVEEGV-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KNK------------ANG------------T-------MK-K-----------S-KQNLI---------------KPL--------K-FQ--L--KN-QE-S---YA------FY--K-SNS----------------KFASFMMDEIFK
      -_Borrelia_hispanica_639481943                                             L-------KEQLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NAYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIESGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RKN------------NAD------------S-------VK-K-----------S-RINPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-KNS----------------KLTSFILDELFE
      -_Borrelia_duttonii_740582201                                              L-------KEQLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NAYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNN------------NAD------------S-------IK-K-----------S-RINPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-KNS----------------KLTSFILDELFE
      -_Borrelia_crocidurae_504509606                                            L-------KEQLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NAYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNN------------NAD------------S-------IK-K-----------S-RINPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-KNS----------------KLTSFILDELFE
      -_Borrelia_burgdorferi_671563339                                           L-------KDRLRAN---FRKEI-----------FHK-VDNIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNS------------NSD------------T-------VK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE
      -_Borrelia_garinii_696419050                                               L-------KERLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NEYYK--------LDG----YK--T-FDAFIK--------NY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNS------------NSD------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE
      -_Borrelia_afzelii_501574765                                               L-------KERLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NEYYK--------LDG----YK--T-FDAFIK--------NY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNS------------NSD------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE
      -_Borrelia_burgdorferi_695263537                                           L-------KERLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNS------------NSD------------T-------VK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE
      -_Borrelia_burgdorferi_499186196                                           L-------KERLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNS------------NSD------------T-------VK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE
      -_Borrelia_turicatae_519700232                                             L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------N-------IK-K-----------S-KQNPL---------------KPL--------R-FQ--L--KC-PD-A---YA------FY--K-GNA----------------KLTSFLLERVFS
      BAN_0003100_Borrelia_anserina_BA2_576100681                                L-------KERLKVN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-KNR------------EGV------------G-------IK-R-----------S-KQNPL---------------SPL--------R-FQ--L--KC-PE-A---YA------FY--K-RNA----------------KLTSFLLEKVFS
      BHO_0003100_Borrelia_hermsii_YBT_576092650                                 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----KEYYK--------IDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------H-------IK-K-----------S-KQNPL---------------RPL--------R-FQ--L--KC-SD-A---YA------FY--K-QNA----------------KLTSFLLEKVFS
      BCO_0003100_Borrelia_coriaceae_Co53_576094173                              L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--T-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINNTMFL---L-RNK------------EGV------------N-------IK-K-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-GNA----------------KLTSFLLEKVFS
      BHY_1114_Borrelia_hermsii_YOR_576105484                                    L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------H-------IK-K-----------S-KQNPL---------------KPL--------R-FQ--L--KC-SD-A---YA------FY--K-QNS----------------KLTSFLLEKIFS
      -_Borrelia_miyamotoi_645073074                                             L-------KEKLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----KEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINNTMFL---I-RNK------------EGV------------S-------IK-K-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-RNA----------------KLTSFLLEKVFL
      -_Borrelia_hermsii_644979602                                               L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------H-------IK-K-----------S-KQNPL---------------KPL--------R-FQ--L--KC-SD-A---YA------FY--K-QNS----------------KLTSFLLEKIFS
      -_Borrelia_anserina_645048715                                              L-------KERLKVN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-KNR------------EGV------------G-------IK-R-----------S-KQNPL---------------SPL--------R-FQ--L--KC-PE-A---YA------FY--K-RNA----------------KLTSFLLEKVFS
      BDU_1115_Borrelia_duttonii_Ly_201084318                                    L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------N-------IK-R-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-KNA----------------KLTSFLLERVFS
      -_Borrelia_hispanica_639481996                                             L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------T-------IK-R-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-KNA----------------KLTSFLLERVFS
      BHW_0003100_Borrelia_hermsii_MTW_576091528                                 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------H-------IK-K-----------S-KQNPL---------------KPL--------R-FQ--L--KC-SD-A---YA------FY--K-QNS----------------KLTSFLLEKIFS
      -_Borrelia_parkeri_644922901                                               L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--N-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------N-------IK-K-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-GNS----------------KLTSFLLEKVFS
      -_Borrelia_hermsii_645062976                                               L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------H-------IK-K-----------S-KQNPL---------------KPL--------R-FQ--L--KC-SD-A---YA------FY--K-QNS----------------KLTSFLLEKIFS
      -_Borrelia_crocidurae_504496098                                            L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------N-------IK-R-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-KNA----------------KLTSFLLERVFS
      -_Borrelia_644980614                                                       L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------N-------IK-R-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-KNA----------------KLTSFLLERVFS
      -_Borrelia_lonestari_145652250                                             L-------KEKLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----KEYYK--------LDH----YS--S-FDDFAR--------DY-RL-ARTQTYKYL-KIATAIEEGI-I--E----------EKY-----------------------------------------VIN-N-----GINSTMFL---L-RNK------------EGV------------S-------IK-K-----------S-RQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-KNA----------------KLTSFLLEKVFL
      -_Borrelia_coriaceae_645023282                                             L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--T-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINNTMFL---L-RNK------------EGV------------N-------IK-K-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-GNA----------------KLTSFLLEKVFS
      -_Borrelia_persica_639480295                                               L-------KESLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YN--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAIAEGL-L--E----------EKF-----------------------------------------IIE-N-----GLTMSLLS---I-RDK------------HGT------------T-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-YE-S---YD------FY--K-KNA----------------KFTGFVLDKLFW
      -_Borrelia_recurrentis_501533150                                           L-------KKKLYIN---LREGI-----------YNK-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------NY-NV-AKTQAYNYL-KIAAALEEGL-L--E----------EKF-----------------------------------------VLE-N-----GFRQILSL---L-RDK------------QGK------------T-------IK-R-----------S-KQNPI---------------KPL--------R-FQ--L--KR-QE-S---YD------FY--K-KNA----------------KFTSFLLDKLFQ
      -_Borrelia_duttonii_501533243                                              L-------KERLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NEYYK--------LDG----YK--N-FEEFTR--------HY-RI-AKTQAYEYL-KIANAIQEGL-V--E----------EKD-----------------------------------------IIE-N-----GIHDIILS---L-RNK------------AGF------------N-------TK-K-----------S-KQNVI---------------KPL--------K-FQ--L--KR-QE-S---YD------FY--K-KNP----------------KFTSFILDEIFF
      -_Borrelia_crocidurae_504496216                                            L-------KERLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NEYYK--------LDG----YK--N-FEEFTR--------HY-RI-AKTQAYEYL-KIANAIQEGL-V--E----------EKD-----------------------------------------IIE-N-----GIHDIILS---L-RNK------------TGF------------S-------IK-K-----------S-RQNMI---------------KPL--------K-FQ--L--KR-QE-S---YD------FY--K-KNP----------------KFASFILDEIFF
      -_Borrelia_crocidurae_644980530                                            L-------KERLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NEYYK--------LDG----YK--N-FEEFTR--------HY-RI-AKTQAYEYL-KIANAIQEGL-V--E----------EKD-----------------------------------------IIE-N-----GIHDIILS---L-RNK------------AGF------------N-------TK-K-----------S-RQNVI---------------KPL--------K-FQ--L--KR-QE-S---YD------FY--K-KNP----------------KFTSFILDEIFF
      -_Borrelia_garinii_696422229                                               L-------KNELKNR---IEDDI-----------RNK-INTMKILLKIRN-----GKLYI--------LDG----YK--R-FEDFIF--------DF-KI-ARTQAYKYI-KIAKLIFEGK-L--K----------EIN-----------------------------------------IIE-D-----GIDKTLFN---L---M------------KDR------------K-------V--K-----------S-RANLV---------------KPL--------R-IR--L--ET-QE-A---YD------FY--K-RNP----------------KFTNHILE----
      -_Borrelia_burgdorferi_499192746                                           L-------KNELKNR---IEDDI-----------RNK-INTMKILLEIRN-----RKLYI--------LDG----YK--K-FEDFIF--------DF-KI-ARTQAYKYI-KIAKLIFEGK-L--E----------EID-----------------------------------------IIE-N-----GIDKTLFN---L---M------------KDK------------K-------I--N-----------S-KANLI---------------TPL--------R-VR--L--ET-QE-A---CD------FY--K-MNP----------------KFANYILEDFYQ
      -_Borrelia_coriaceae_645024479                                             L-------KEKLKEN---FKREI-----------HYK-VEAIKILKEIKD-----NEYYK--------LDN----YN--S-FESFVK--------QY-KV-AKTQAYAYL-KLANALQNGI-L--E----------EGY-----------------------------------------IIE-N-----GIHNSLVL---I-ENK------------KNK------------T-------MK-K-----------S-RQKPI---------------RSL--------R-FQ--F--EN-QE-S---YD------FY--K-KNA----------------KFTSFLMDVLFR
      -_Borrelia_501532751                                                       L-------KSKLKDN---IKDDI-----------YNK-IEAMHILREIKD-----KEYYK--------LDG----YK--S-FSRFIK--------DY-KL-AKSQAYSYL-RIASAIQDGI-L--K----------EEY-----------------------------------------LIE-N-----GFRQSLSF---L-MEK------------ESK------------N-------LK-K-----------S-KINPV---------------KPL--------R-FQ--L--KS-QD-S---YN------YY--K-KNA----------------KLTGFILDKLFL
      -_Borrelia_hispanica_639481645                                             L-------KSKLKDN---IKDDI-----------YNK-IEAMHILREIKD-----KEYYK--------LDG----YK--S-FSRFIK--------DY-KL-AKSQAYSYL-RIASAIQDGI-L--K----------EEY-----------------------------------------LIE-N-----GFRQSLSF---L-MEK------------ESK------------N-------LK-K-----------S-KINPV---------------KPL--------R-FQ--L--KS-QD-S---YN------YY--K-KNA----------------KLTGFILDKLFL
      -_Borrelia_hermsii_644979506                                               L-------KRKLMIN---LKDEI-----------HAK-IITMKILKEIND-----KELYV--------QEG----YK--T-FSDFIS--------EF-NL-ARTQVYGYI-RMAAAISEGV-L--S----------EEY-----------------------------------------IIQ-N-----GIQNSLLF---I-RST------------NSD------------T-------IK-K-----------S-RVNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-QNA----------------KFTSFLMDELFK
      -_Borrelia_crocidurae_644980942                                            L-------KSKLKDN---IKDDI-----------YNK-IEAMHILREIKD-----KEYYR--------LDG----YK--S-FSRFIK--------DY-KL-AKSQAYSYL-RIASAIQDGI-L--K----------EEY-----------------------------------------LIE-N-----GFRQSLSF---L-MEK------------ESK------------N-------LK-K-----------S-KINPV---------------KPL--------R-FQ--L--KS-QD-S---YN------YY--K-KNA----------------KLTGFILDKLFL
      -_Borrelia_persica_639480216                                               L-------KCKLKDN---IKEDI-----------YNK-IEAMYILKEIKE-----KRYYK--------LDG----YK--S-FSQFIK--------NY-KL-GRSQAYSYL-RIASAIEYGI-L--K----------EEY-----------------------------------------LIE-N-----GVRQCLIF---L-TKS------------ENI------------K-------IK-K-----------S-RQNLI---------------KPL--------R-FQ--L--KC-QE-T---YD------YY--K-KNS----------------KFTSFLMEELFR
      BHY_1499_Borrelia_hermsii_YOR_576105904                                    L-------KENFINS---FKKEI-----------VYK-IECMKILKEIKD-----NQYYK--------LDG----FK--T-FDSFTK--------NF-KI-ARSQIYNYL-KLAGAMEDGL-I--S----------EEY-----------------------------------------LLE-N-----GINDSLDL---I-KNK------------ERA------------T-------LK-K-----------S-TQNSI---------------KPL--------R-FQ--L--KD-RK-V---MI------FT--K-S---------------------ILSLQHSF-
      -_Borrelia_garinii_671520608                                               L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFV---L-RNK------------ESL------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE
      -_Borrelia_burgdorferi_497943789                                           L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFL---L-RNK------------ESV------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE
      -_Borrelia_burgdorferi_group_493479353                                     L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFV---L-RNK------------ESI------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE
      -_Borrelia_valaisiana_501894859                                            L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFV---I-RNK------------KGL------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE
      -_Borrelia_bissettii_503783725                                             L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFV---L-RNK------------ESV------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE
      -_Borrelia_garinii_657248267                                               L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFV---L-RNK------------ESL------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE
      -_Borrelia_burgdorferi_501588721                                           L-------KEKLKIN---SKKEI-----------YCK-LETLKILKEIKD-----NHYYR--------FDG----YK--S-FEAFSK--------DY-RL-ARAQVYNYL-KIANAIEDGI-I--Q----------EEF-----------------------------------------LIK-N-----GILETLIV---L-RNK------------ESK------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KR-QE-S---YD------FY--K-SNA----------------KFTGFMLDKLFS
      -_Borrelia_garinii_657235047                                               L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------KTS------------I-------IK-K-----------T-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-KNA----------------KFTGFLLDKLFN
      -_Borrelia_spielmanii_493479385                                            L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------ETA------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-SNA----------------KFTGFLLDKLFS
      -_Borrelia_burgdorferi_671580297                                           L-------KNRLKTN---IKRKF-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDF---I-EGQ------------ETS------------I-------VK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFS
      -_Borrelia_garinii_501710973                                               L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------ETS------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFN
      -_Borrelia_burgdorferi_497945190                                           L-------KEKLNDN---FKKEI-----------FHR-VENIKILKEIKD-----NQYYK--------FDG----YK--T-FLDFIK--------DF-DV-AKTQAYKYL-RLATALQEGL-I--K----------EDY-----------------------------------------LIE-N-----GIKNSYNF---I-KDK------------ESP------------A-------LK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-KKS----------------KFTSFMMHEIFE
      -_Borrelia_burgdorferi_488739923                                           L-------KEKLNDN---FKKEI-----------FHR-VENIKILKEIKD-----NQYYK--------FDG----YK--T-FLDFIK--------DF-DV-AKTQAYKYL-RLATALQEGL-I--K----------EDY-----------------------------------------LIE-N-----GIKNSYNF---I-KDK------------ESP------------A-------LK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-KNA----------------KFTAFILEELLK
      -_Borrelia_coriaceae_752506999                                             L-------KEQLKTS---FSNEV-----------YNR-VQTMKILKEIRD-----NEYYK--------IDG----YK--T-FDAFIK--------DY-KL-AKTQVYDYL-RVAAAIEDGV-V--E----------EDY-----------------------------------------LLK-H-----GFKNTVML---L-RNK------------SSK------------T-------IR-R-----------S-KINPI---------------KPL--------R-FQ--L--KK-KD-S---YD------FY--K-KNA----------------RLTSFILDEIFQ
      BCO_0130002_Borrelia_coriaceae_Co53_576094619                              L-------KEQLKTS---FSNEV-----------YNR-VQTMKILKEIRD-----NEYYK--------IDG----YK--T-FDAFIK--------DY-KL-AKTQVYDYL-RVAAAIEDGV-V--E----------EDY-----------------------------------------LLK-H-----GFKNTVML---L-RNK------------SSK------------T-------IR-R-----------S-KINPI---------------KPL--------R-FQ--L--KK-KD-S---YD------FY--K-KNA----------------RLTSFILDEIFQ
      -_Borrelia_burgdorferi_group_488735361                                     L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDF---I-EGQ------------ETS------------I-------VK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFS
      -_Borrelia_burgdorferi_497943336                                           L-------KDRLKAN---FRKEI-----------YHK-LDSIKILKEIKD-----NQYYK--------IDG----YK--K-FDYFIK--------DY-KI-ARSQAYNYL-KLATALQEGI-L--K----------EDY-----------------------------------------LIE-N-----GIHNSLDL---I-KDK------------ESP------------T-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-SNA----------------KFTGFLLDKLFM
      -_Leptospira_interrogans_446063157                                         --K-----EQNFRAQ--FLHERI-Q---ANF-------IGLVFDLKEMRD-----KKLYS--R---L---G----FD--N-FKDYLK-STL---PKF--V-TLSFASNLM-----LLSDKM-S--E----------EDY-----------------------------------------KKT-N---P-NQVQILAK---I----------------ASN--------PD--V-FE----IS-H-----------K-VKGEI-----------HLS-NGM--V-------MD--L-----EE-----YE----T-TY--A-DEI----------------AQQTDVYREAIK
      -_Leptospira_santarosai_490596705                                          --N-----EQNFRAQ--FLHERI-Q---ANF-------IGLVFDLKEMRD-----RKLYA--R---L---G----FD--N-FKDYLK-SAL---PKF--V-TLSFASNLM-----LLSDKM-S--E----------EDY-----------------------------------------KKT-N---P-SKVQVLAK---I----------------ASN--------PD--V-FE----IS-H-----------K-VKGEI-----------HLS-NGT--V-------MD--L-----EE-----YE----T-IY--A-NEI----------------AQQTDAYREAIK
      -_Borrelia_hermsii_645063111                                               L-------KDRLKES---FKREI-----------HYK-VEAIKILKEIKD-----NEYYK--------LDN----YN--S-FESFVK--------EY-KV-AKTQAYAYL-KLASALQDGI-L--Q----------EDY-----------------------------------------IIE-H-----GIHNSLVL---I-GNE------------RNK------------T-------IR-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-HD-S---YN------FY--K-KNA----------------KFTSFLMDEL--
      -_Borrelia_burgdorferi_497942632                                           L-------KDRLRAN---FRKEI-----------FHK-VDNIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DF-NI-ARSQAYNYL-KLAAALQEGI-L--K----------EDY-----------------------------------------VIE-N-----GIHNSLNL---I-QDK------------ESP------------T-------FK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-SNA----------------KFTGFMLDKLFS
      -_Borrelia_coriaceae_654876378                                             L-------KEQLKTS---FSNEV-----------YNR-VQTMKILKEIRD-----NEYYK--------IDG----YK--T-FDAFIK--------DY-KL-AKTQVYDYL-RVAAAIEDGV-V--E----------EDY-----------------------------------------LLK-H-----GFKNTVML---L-RNK------------SSK------------T-------IR-R-----------S-KINPI---------------KPL--------R-FQ--L--KK-KD-S---YD------FY--K-KNA----------------RLTSFILDEIFQ
      -_Borrelia_hermsii_645010591                                               L-------KDRLKES---FKREI-----------HYK-VEAIKILKEIKD-----NEYYK--------LDN----YN--S-FESFVK--------EY-KV-AKTQAYAYL-KLASALQDGI-L--Q----------EDY-----------------------------------------IIE-H-----GIHNSLVL---I-GNE------------RNK------------T-------IK-K-----------L-RQNPI---------------KPL--------R-FQ--L--KS-HD-S---YE------FY--K-KNA----------------KFTSFLMDELFR
      -_Borrelia_hermsii_644979720                                               L-------KDRLKES---FKREI-----------HYK-VEAIKILKEIKD-----NEYYK--------LDN----YN--S-FESFVK--------EY-KV-AKTQAYAYL-KLASALQDGI-L--Q----------EDY-----------------------------------------IIE-H-----GIHNSLVL---I-GNG------------RNK------------T-------IR-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-HD-S---YN------FY--K-KNA----------------KFTSFLMDELFR
      -_Borrelia_afzelii_500023248                                               L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------ETL------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFN
      BCO_0130005_Borrelia_coriaceae_Co53_576095359                              L-------KEQLKTS---FSNEV-----------YNR-VQTMKILKEIRD-----NDYYK--------LDG----YK--T-FDAFIK--------DY-KL-AKTQVYDYL-RVATAIEDGV-V--E----------EDY-----------------------------------------LLK-H-----GFKNTVML---L-RNK------------SSK------------M-------MR-R-----------S-KINPI---------------KPL--------R-FQ--L--KK-KD-S---YD------FY--K-KNA----------------RLTSFILDEIFQ
      -_Borrelia_burgdorferi_497944835                                           L-------KEKLNDN---FKKEI-----------FHR-VENIKILKEIKD-----NQYYK--------FDG----YK--T-FLDFIK--------DF-DV-AKTQAYKYL-RLATALQEGL-I--K----------EDY-----------------------------------------LIE-N-----GIKNSYNF---I-KDK------------ESP------------A-------LK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-KKS----------------KFTSFMMHEIFE
      -_Borrelia_garinii_501704211                                               L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------ETL------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFN
      -_Borrelia_burgdorferi_501930839                                           L-------KEKLNDN---FKKEI-----------FHR-VENIKILKEIKD-----NQYYK--------FDG----YK--T-FLDFIK--------DF-DV-AKTQAYKYL-RLATALQEGL-I--K----------EDY-----------------------------------------LIE-N-----GIKNSYNF---I-KDK------------ESP------------A-------LK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-KKS----------------KFTSFMMHEIFE
      -_Borrelia_garinii_671501759                                               L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------ETS------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------RFTGFLLDKLFN
      -_Borrelia_coriaceae_645024774                                             L-------KEQLKTS---FSNEV-----------YNR-VQTMKILKEIRD-----NDYYK--------LDG----YK--T-FDAFIK--------DY-KL-AKTQVYDYL-RVATAIEDGV-V--E----------EDY-----------------------------------------LLK-H-----GFKNTVML---L-RNK------------SSK------------M-------MR-R-----------S-KINPI---------------KPL--------R-FQ--L--KK-KD-S---YD------FY--K-KNA----------------RLTSFILDEIFQ
      -_Borrelia_afzelii_504299060                                               L-------KDRLKAN---FRKEI-----------FHK-VDNIRILKEIKD-----NEYYK--------LDG----YK--S-FFAFVK--------DY-NI-ARTQAYNYL-KLATALQEGF-I--K----------EDY-----------------------------------------IIE-N-----GIHNSLDL---I-QDK------------ESP------------T-------FK-K-----------S-KKNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNA----------------KFTGFLLDKLFN
      -_Borrelia_duttonii_740582299                                              L-------VKQLKNN---IKSEI-----------YNV-IDTMKILKKIND-----KKLYI--------EGG----FK--S-FKDFLS--------EF-KL-AKTQSYEYI-KLATAIETGL-L--E----------EDF-----------------------------------------ITL-N-----GIRASIRY---I-KTK------------TNG------------I-------IK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-KNA----------------KLTGFILDRLFL
      -_Borrelia_persica_639480329                                               L-------VKQLKNN---IKSEI-----------YNI-IDTMKILKRIND-----NKLYA--------EGG----FS--S-FKDFLS--------EF-KL-AKTQSYEYI-KLARAIETGL-L--E----------EDF-----------------------------------------ITL-H-----GIRASIRY---I-KTQ------------ANG------------I-------IK-K-----------S-KQNPV---------------KPL--------R-FQ--L--KH-QE-S---YN------FY--K-KNA----------------KLTGFILDNLFL
      -_Borrelia_burgdorferi_499186290                                           L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIKDGI-L--E----------EAY-----------------------------------------VIE-N-----GVTKTLEF---L-R-K------------SPN------------V-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNA----------------KFTGYLLDKLFN
      -_Borrelia_burgdorferi_group_501704326                                     L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIKDGI-L--E----------EAY-----------------------------------------VIE-N-----GVTKTLEF---L-R-K------------SPN------------V-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNA----------------KFTGYLLDKLFN
      -_Borrelia_garinii_696415807                                               L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIQDGV-L--E----------EAY-----------------------------------------VIE-N-----GVTKAIAF---L-R-K------------SPG------------V-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KT-HE-S---YD------FY--K-SNA----------------KFTSFMMHELFE
      -_Borrelia_valaisiana_501894944                                            L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIQEGI-L--E----------EAY-----------------------------------------VIE-N-----GISKAIAV---L-R-E------------SPS------------G-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KT-KE-S---YD------FY--K-SNA----------------KFTSFMMHEIFE
      -_Borrelia_garinii_671481046                                               L-------KQRLKSN---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIEEGV-L--E----------EAY-----------------------------------------VIE-N-----GVTKTIAF---L-R-K------------SPS------------I-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-SNA----------------KFTSFMMHEIFE
      -_Borrelia_burgdorferi_501704894                                           L-------KNRLVNN---FKKEI-----------FHK-IEFIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--D----------EAY-----------------------------------------VIE-N-----GLTISLLS---L-RDK------------ESS------------S-------FK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE
      BBUBOL26_W05_Borrelia_burgdorferi_Bol26_226232418                          L-------KNRLVNN---FKKEI-----------FHK-IEFIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--D----------EAY-----------------------------------------IIE-N-----GLTISLLS---L-RDK------------ESS------------S-------FK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE
      -_Borrelia_burgdorferi_497943851                                           L-------KNRLVNN---FKKEI-----------FHK-IEFIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--D----------EAY-----------------------------------------IIE-N-----GLTISLLS---L-RDK------------ESS------------S-------FK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE
      -_Borrelia_afzelii_504299173                                               L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIEEGV-L--E----------EAY-----------------------------------------VIE-N-----GVTKTIAF---L-R-K------------SPG------------V-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-SNA----------------KFTSFMMHEIFE
      -_Borrelia_afzelii_504299035                                               L-------KNKLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--E----------EAF-----------------------------------------IIE-N-----GLTMSLLS---L-REK------------ESP------------T-------FK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE
      -_Borrelia_garinii_671501078                                               L-------KEKLKTN---FKKEI-----------FHK-VENIRILKEIKD-----NQYYK--------FDG----YK--N-FLDFVK--------DF-NV-AKSQAYKYL-KLAAALQDGV-L--N----------DDY-----------------------------------------VIK-N-----GIHNSFNY---I-KDK------------EGP------------S-------LK-K-----------S-KQNPI---------------KPL--------R-LK--L--KT-QE-S---YD------FY--K-SKA----------------KFTSFMMNEIFE
      -_Planctomyces_brasiliensis_503393144                                      L-T--DA-QQAKLQD---CEIVI-EDGLKAF-------LKTCIAVVVIDD-----LELYK--P------------HK--S-LHAYCA-------FRF-DY-SDTETGRLR--NAGHVLVNL----S-----------GL-----------------------------------------SAD-D--L-------------F-SGK------------ESD------------I-V-----LP-A-------------NEGQC---------------REM----------AK--L--KK-GK-K-QDAD-LQRK-VW--A-EVI-KRAGQ--------D-KITARLIKEVVD
      -_uncultured_Mediterranean_phage_uvMED_787047096                           M-T--EQ-EQKELIE---AETVI-K---SSFQGKMERDLAIGAGLLKIKR-----QKLYR-GV---------SG-GR--L-WIDYLK-EES-A-KLT-GN-AEPISDQLA--RNLRGFYEF----R------------------------------CE----------------------ILQ-D---------------------------------LYN-YI---------I-------LP-T-------------NKSQV---------------TPI--LGY-----LK-----NP-KE-A---VE------IW--K-AAC-SEAGS----N---K-VPTYHQVNKAYY
      -_Gloeocapsa_sp_PCC_7428_505002935                                         I-Q--QH-TQELKER---LQRTA-----QDI-------WEIGQKLAEVRS------------R-----LK-----HG--Q-FDNWLK-------AEF-GW-SRRTAYNFI-----NVYETF------------------------------------N----------------------ERA-K----------------F--------------------------------------------------------AHFNI-AT------------SAL----------YL--L--AS-PS-T---PQ----D-IK--D-QFI-EVAQT----G---Q-KVTHKDIRKALE
      -_Nostoc_sp_PCC_7120_499306042                                             V-E--QR-TSEIREQ---LRRTA-----QDI-------WEIGQSLAEVRA------------Q-----LK-----HG--Q-FETWLK-------AEF-GW-SRRTAYNFI-----NVYETF------------------------------------G----------------------NRA-N----------------L--------------------------------------------------------AQIDI-AT------------SAL----------YL--L--AA-PS-T---PE----N-LR--E-QYI-EEAKA----G---K-RITHKELVQTIK
      -_Cyanothece_497233611                                                     I-Q--QL-TQEIRDC---LRRSA-----QDI-------WEIGQKLADVRD------------R-----LK-----YG--Q-FDTWLK-------VEF-GW-SRRTAYNFI-----SVYQTF------------------------------------G----------------------ERA-N----------------L--------------------------------------------------------AQVDI-AT------------SAL----------YL--L--AA-PS-T---PQ----K-VR--E-EFL-QKAQA----G---Q-TITHKQLSEVIQ
      -_Crocosphaera_watsonii_494515216                                          I-Q--QL-TQEIRDC---LRRSA-----QDI-------WEIGQKLADVRD------------R-----LK-----YG--Q-FDTWLK-------TEF-GW-SRRTAYNFI-----SVYQTF------------------------------------G----------------------ERA-N----------------L--------------------------------------------------------AKVNI-AT------------SAL----------YL--L--SA-PS-T---SQ----K-VR--E-EFL-QKARS----G---E-TITYKQLSEVIQ
      -_Fischerella_sp_PCC_9339_515877202                                        I-Q--QR-TGEIKER---LRRSA-----QDI-------WEIGQKLADVRS------------Q-----LK-----HG--Q-FDTWLK-------AEF-GW-SRRTAYNFI-----NVYEAF------------------------------------D----------------------ECA-N----------------L--------------------------------------------------------AQIDI-AT------------TAL----------YL--L--AA-PS-T---PE----N-VR--E-EIL-QRAKG----G---E-TLTHKDIRQVIK
      -_Nostoc_punctiforme_501381574                                             V-N--TS-IQETLTA---IDRFE---------------WQAIEELRLMRD-----NGYYS--D---------AG-YV--S-FEDYCE-KEL---TKH-G--GYRRVRDLL--SAKKVVDTL--------------------------------------PE-------------------ELR-E----------------------K-------------------------I----------T-------------KPSQT---------------RSL----------LR--L-----VK-T---PD------KL--H-EAV-AIAAK-EK-----P-FPTAADFAKAVQ
      -_Nostoc_punctiforme_501381627                                             V-N--TS-IQETFSA---INRFE---------------WQAIDELRLMRD-----NHYYK--D---------GG-YL--S-FEDYCE-KEL---IKH-G--GYRRVRDLL--SAKKVVDTL--------------------------------------PE-------------------ELK-D----------------------K-------------------------I----------T-------------KPSQT---------------RSL----------LR--L-----VK-T---PD------KL--E-QAV-AIAAK-EK-----P-FPTAADFTKAVQ
      -_Nostoc_punctiforme_501381520                                             L-N--NG-ILDTFTA---IDRFE---------------WQASDELRLMRD-----KGYYQ--D---------GG-HK--S-FEAYCE-SEL---TKH-G--GYRRVKDLF--AAKRVVDTL--------------------------------------PE-------------------ELR-P----------------------H-------------------------I----------T-------------KPSQT---------------RSL----------LR--L-----VK-T---PD------KL--E-QAV-AIAAK-EK-----P-FPTAADFAKAVQ
      -_Cyanothece_497232009                                                     --------QIAKRAE---LENIV-IEHSQSF-------TIIGKALREIQE-----KKLHQIED---------P--NK--R-WDVYVD-------ETF-GI-VKSRAYQLI--AGTVLYEVL----E---------------------------------DN-------------------LKG-D--YS----------------------------------------------------LP-K-------------SDTQL---------------RPL----------YS--L------------VK------GW--R-KAT-AKNDV----------EEKERIEQLIVK
      -_Cyanothece_sp_CCY0110_495553174                                          --------QIAKRAE---LENIV-IEHSQSF-------TIIGKALREIQE-----KKLHQIED---------P--NK--R-WDVYVD-------ETF-GI-VKSRAYQLI--AGTVLYEVL----E---------------------------------DN-------------------LEG-D--YS----------------------------------------------------LP-K-------------SDTQL---------------RPL----------YS--L------------VK------GW--R-KAT-AKNDV----------EEKERIEQSIVK
      -_Anabaena_variabilis_499635690                                            L-N--SS-IQETFTS---LDRFE---------------WQAVGELLQMRD-----QELYI--E---------AG-YA--D-FKEYCQ-REL---SAW-G--GYRRISQLL--GAKKVIDSV---------------------------------------G-------------------ELG-Q----------------------H-------------------------I----------K-------------NERQA---------------LPL----------LR--L-----VK-E---PQ------KL--R-EAV-AIAVQ-EN-----P-SPSESDFAAAAQ
      -_Nodularia_spumigena_493212749                                            L-N--TS-IKETFAA---IDRFE---------------WQAVDKILQMRE-----QQIYL--E---------GG-YK--N-FEEYCQ-REL---SAW-G--GYRRINQLL--GAKKVIEAA---------------------------------------G-------------------EFG-G----------------------H-------------------------I----------K-------------NERQS---------------RPL----------LR--L-----VK-E---PE------KL--K-QAL-AIALE-QN-----P-SPSESDFAAAAK
      -_Nostoc_sp_PCC_7120_499308918                                             L-N--SG-IKETFTA---IDRFE---------------WQAVGEILQMRD-----QELYL--E---------AG-YA--D-FKEYCQ-REL---SAW-G--GYRRITQLL--GAKRVIDTV---------------------------------------G-------------------ELG-Q----------------------H-------------------------I----------K-------------NERQA---------------RPL----------LR--L-----AK-E---PE------KL--K-QAV-TIALQ-EN-----P-SPSESDFAAAAQ
      -_Anabaena_variabilis_499635648                                            L-N--SS-IKETFTT---IDRFE---------------WQAVEEILQMRD-----QQLHR--E---------AG-YK--S-FEEYCQ-AEL---SAW-G--GYRRITQLL--GAKKVIDAV---------------------------------------G-------------------ELG-E----------------------H-------------------------I----------K-------------NERQA---------------RPL----------LR--L-----VK-E---PD------KL--K-EAV-AIALQ-EN-----P-NPSESDFAAAAR
      -_Kitasatospora_sp_MBT63_727525039                                         F-E--AD-KQTAHTR---FAQQA------------------GPALWEIHD-----RKLYR--S---------T--HS--T-WEEYLG-------ERW-GL-SRSYAHRLL--EMIPVQAAL-L--P------------------------------------------------NP----AFG-N-------------------------------------L----------V-------LR-E-------------SQARV-----L---------VPV----------LR-EY--GP-AQ-V---RE------VV--E-KAL---ADG--------A-RPTAKALTAART
      -_Anaeromyxobacter_dehalogenans_501750516                                  L-A--AR-AEDLPEG-S-MRRKV-LEGAQRF-K-SAW-VELGRLLSEVRR-----KELWR--G---W---G----YP--S-FERYCT-------KEL-FI-RGATADKLT--ASYGFLERH----E----------------------------------------------------P-ELA-K------------------------------------------------A----------R-------------GETRA---------------PPF--------E-----V------------IE------VL----SRA----------------EATGRLSDSGWR
      PSR1_03440_Anaeromyxobacter_sp_PSR-1_775300647                             L-A--AR-AEDLPEG-S-MRRKV-LEGAQRF-K-SAW-VELGRLLSEVRR-----KELWR--G---W---G----YP--S-FERYCT-------KEL-FI-RGATADKLT--ASYGFLERH----E----------------------------------------------------P-ELA-K------------------------------------------------A----------R-------------GETRA---------------PPF--------E-----V------------IE------VL----SRA----------------EATGRLSDSGWR
      -_Anaeromyxobacter_sp_K_501518403                                          L-A--AR-AEDLPEG-S-MRRKV-LEGAQRF-K-SAW-VELGRLLSEVRR-----KELWR--G---W---G----YP--S-FERYCT-------KEL-FI-RGATADKLT--ASYGFLERH----E----------------------------------------------------P-ELA-K------------------------------------------------A----------R-------------GETRA---------------PPF--------E-----V------------IE------VL----SRA----------------EAAGRLSDSGWR
      -_Streptomyces_sp_NRRL_F-5702_664543262                                    L-T--PK-EQQTLGR---VHAAR-DHHQAAK-------WMRGKALAVAFS-----RRLFR--G-----EDG----RR--T-RQEYLD-------DEWDGI-SESAAYREI--GEWPVAKAI----S----------------------------------------------------D-ACE----------------------------------------------------------RP-A-------------PDSHV---------------RAL----------VD--V--AK-QQ-G---AE--PVA-RW--Y-AEL-RRHGQ-QA-G---H-RVTADVVANLAD
      -_Streptomyces_sp_150FB_748778099                                          L-N--AR-EQQDLDR---VHAAR-DHHRAAK-------WMRGKALEAAFR-----RRLFR--G-----EDG----TR--S-RQQYLD-------EEWDGL-SESAAYREI--GEWRLAKEI----T----------------------------------------------------D-ACE----------------------------------------------------------RP-A-------------PDSHV---------------RAL----------LD--V--AG-AQ-G---HK--QVA-HW--Y-AEL-RRHGQ-QT-G---R-RVTADAVANLAD
      -_Streptomyces_sp_NRRL_F-6628_739996264                                    L-N--AK-EQQQLER---IHSAR-DHHQAAK-------WMRGKALDSAFR-----RRLFR--G-----EDG----QR--T-RQQYLD-------AEWDGM-SESAAYLEI--REWPLAAQI----S----------------------------------------------------A-TFG----------------------------------------------------------RP-A-------------PDSHV---------------RAL----------VG--V--AE-NQ-G---HE--TVA-AW--Y-ADL-RRHGQ-EL-G---Q-RVTADVVANLAD
      -_Scytonema_millei_748141416                                               V-Q--QR-TKELKER---LQRTA-----QDI-------WEIGKKLVEVRA------------E-----LKG----HG--Y-FDAWLR-------AEF-GW-SRRTAYNFI-----YVYEAF-----------------------------------------------------------PYA-K----------------F--------------------------------------------------------AQMII-EP------------SAL----------YR--L--AS-PS-T---PD----A-IR--D-KFI-QQANA----G---S-KVSHKEVLKAVT
      -_Streptomyces_664512363                                                   I-F--AV-QYAAKAN---HERAE-Q---QKL-------IGLGLRLQAIKD-----EELHK--H---------TG-FE--T-FGALTD-------ARF-GI-KKHQANNIL--RVLGVAQAL----E------------------------------------------------------DVT-T--------------------------------------------------------QE-L-------------KERPL---------------RVL----------VP--I--LD-TH-G---AD-AVRE-TW--A-EAA---RHG----------NVTDTALKEAAN
      -_Streptomyces_coelicolor_499350288                                        L-N--DQ-ERGYLDV---CEQAL-HGFRKSV-------VVAGKALEVINR-----GRLYR--E---------T--HE--T-FADYVT-------EVW-DM-KRAHAYRMI--EGWRPADLV----S------------------------------------------------------PIG-D-----------------------------------------------------------I-------------NEGQA---------------REL----------AP--V--LK-EY-G---PE-VTVT-LY----RGV-KELRG----D---R-RVTAADLSEARA
      -_Streptomyces_yeochonensis_740055334                                      --Q--KD-QTETVIR---TAHAA---GKAAV-------WVMGQGIAAAAK-----GKWFR--R---------T--HS--S-LEQYVV-----D-LIP-DV-VPRQARRWV--TGYPIALAI-T--S------------------------------------------------------RTG-E--------------------------------------------------------SP---------------VEGQV---------------REI----------AD--L--PE-SV-A---VE------LY----AAA-DTAAR-AA-G---G-RLTAKHLTDLRR
      -_[Clostridium]_clostridioforme_488660258                                  --------DTRRLAN-I-AYKDI-K---NGF-------VGFGYYLKIIRD-----EKLWQ--------GQG----YD--S-FNEFLG-------DEY-GK-DKSWASRCI-----NLYDKF---------------------------------------------------------------G--------------------------------------------------------IP-I-------------EPGEL---------------PRL--------------------EE-Q---YE------VY----NVS-QLIEM----------LPMSEELREQVT
      -_Borrelia_duttonii_740582340                                              --------EEIRART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYV--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KRDSS----------YCKPVVVKFNLK
      -_Crocosphaera_watsonii_494514846                                          L-T--RD-DKYGYED-A-LAQLK-S---HQF-G-W---VKFGLWAFQFKV-----KRFYK--Y---------H--HK--T-WKQFCE-------NVL-HR-NRYYVDKLI--KAVRVMKDL-----------------------------------------------------------ICA-G--FE--------------------------------------------V-------LP-Q-------------NEYQC---------------RFL----------TK--F--WG-EE---L-TE------NW--A-MIV-DAVPP--------H-LITGDLLKAQFS
      -_Borrelia_miyamotoi_645073449                                             --------EEIRART---LNEAI------NK-------VELAKALYEIKK-----NKLYR--F---------DG-YD--N-FYGFCL--------NY-KF-SRTMIYRYI--KIGAYLEKD---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDV-IH------------SSL----------NK--I------------IN----------D-IRV-KGSDS----------EYKPVII-----
      EUS_21090_[Eubacterium]_siraeum_70/3_291531542                             V----HQ-DLMIQEQ-V-VAQSL---------------TQIAIDLKEIRD-----RRLYA--E---L---G----YS--D-FAEYCE-------NAT-KT-GKRQAYNLI-----SLVEQY-----------------------------------------------------------KID-D----------------L-------------------------------S-----R-LA----------YL---GSTKL---------------IAL----------KS--L--GK-EE-----RE------------ELI-ESGKA--------E-ELSVRELKEKIK
      -_Pedosphaera_parvula_750252315                                            L-T--TE-ETTTLSE---CEAVI-EEGMKTF-------VEVGSAVLTISD-----RRLYR--A---------T--HS--T-FEDYIQ-------DKW-DM-TARRAYQLC--EAAEVVMKL----E------------------------------------------------------NVK-H---------------------------------ASQ------------I----------E-------------NARQA---------------EAL----------AK-----AP-EE-K---RD----E-VL--E-KAI----------Q---T-APEGKLTSKH--
      NITHO_3110002_Nitrolancea_hollandica_Lb_390172790                          --P--SA-SHHARHA---CPRCA-----ISS-------ISRNPTCRVLSSLLRPYRRLYR--E---------AG-YR--T-FEDYCQ-------KRW-GW-TRQRAHQLM--LAAEVSTTV---------------------------------------------------------------D------------------------------------------------I-------PP-A-------------NEAQA---------------REL----------SR--L--KS-TE-A-I-RE------TW----QEV-REERG--------E-SVTARDVRETGY
      -_Streptomyces_virginiae_664375955                                         I-E--AS-VGQYQQS---LRRLQ-A---RHR-------VEVGQLLDEINE-----SGLWE--L---------EG-HE--K-FGHYVK-------ARW-GW-DRSYGYRLI--DLALVHRAL----A------------------------------------------------------PLG-P----------------------A----------VLD-----T------V-------VE-S-------------HAREL---------------APV----------VK--V--NG-DE-G---AR------HF--V-EVL-RLESG----G---K-KVTAAVIAARRD
      -_Chamaesiphon_minutus_504973660                                           L----GG-GLGDFLS---LDELA-KDNLFGF-------VRVGIAADRIRK-----RKLWQ--C-AKI--------YK--D-WNDYCT-------KGL-GK-TAWYINRTI--DAANVVMTL----I------------------------------------------------------SAG----FK--------------------------------------------I-------LP-T-------------CEAQC---------------RPL--VKL--LG-VG--L--DA------I-AD------VW--T-KVV-SAFTP--------D-KITAGKILAIAE
      -_Kitasatospora_sp_MBT63_727520012                                         F-A--GA-EYAARAN---IQQST-Q---QRV-------IVQGQILLAMRE-----EELWK--A---------LG-WT--D-FDVLVK-------HRF-GI-GRNYANKII--RSMPVVRAL----E------------------------------------------------------HVT-S--------------------------------------------------------ME-M-------------AEKHL---------------RAL----------VP--V--QE-RH-G---DE-AVRR-TW--E-EAL---RKG----------KITEKSLKEAAR
      -_Ruminococcus_flavefaciens_497670003                                      Y-T--EA-YNLNVRI-C-INAQM---AQQNL-------YEVCKGLKEMRD-----GKLYK--E------LG----YN--S-FEDYTE-------NEV-GL-SRFMAYKY---AAIADMKNV----E----------------------------------------------------------S------------------------------------------------I-------QQ-I-------------GVTKL---------------ALL----------AK--L-----DE-----PQ----------R-EEI-QQSVN------V-E-EVSVRELKAEID
      -_Streptomyces_sp_NRRL_F-5140_664445691                                    L-T--PE-EQEQLAE---CHRAV-DNARSAQ-------WMLGRALEIVRR-----RRLYR--G------DG----TR--T-WPQYLA-----A-EHD-GM-TERDARRLQ--EEWRLAKAV-----------------------------------------------------------QEA--------------------------------------------------L-----G-KP-A-------------PASHV---------------RAM----------LE--Y--AD-NT-S---IE-QAAI-DY----AML-RAAFD-SG-R-A-R-LVAHQITARVTR
      -_Crocosphaera_watsonii_494520212                                          L-T--RD-DKYGYED-A-LAQLK-S---HQF-G-W---VKFGLWAFSLRS-----KRFYK--Y---------H--HK--T-WKQFCE-------NVL-HR-NRYYVDKLI--KAVRVMKDL-----------------------------------------------------------ICA-G--FE--------------------------------------------V-------LP-Q-------------NEYQC---------------RFL----------TK--F--WG-EE---L-TE------NW--A-MIV-DAVPP--------H-LITGDLLKAQFS
      -_Streptosporangium_roseum_502659622                                       V-P-----TLEDCER-H-ITTIT-----TQW---L---LGVGRALAAIRD-----HELFQ--E------KG----YT--S-FTAYLR-TE----HPW----HPSYVSRVI--ANIPVVEAL----E------------------------------------------------------RHG-A---------------------------------DRD-----------------------L-------------NEGQA-TA--I---------RPV--------W-EQ-----HG-EE-A-L-FE------VW--------DATTG--------K-RSAAALVRVARA
      -_Kamptonema_formosum_518317229                                            L-D--YL-DVDVDRE---IQVGF-----FSY-------VKVGYFLDKMRY-----YKLYQ--K---------QG-FN--S-FKEYCL-------KVL-RK-SAHYCIKII--SAAEVCLRL----A------------------------------------------------------ALG-----------------------------------FEQ--------------------LP-N-------------CVAQA---------------LPL----------VK--F--NP-VF-G-L-DS-PLYD-KW--Q-DVL-DNTPP----G---Q-PITAKHIAEILD
      -_Streptomyces_sp_FR1_501453636                                            I-G--KA-QDNAELT---VRQAK-D---RFT-------REAGPALALIHD-----DELWR--P---------E--YE--S-FVDYVK-------RRW-DY-SSTHGYLLV--ATAKVQKAL-----------------------------------------------------------PEG----------------------------------------------------------AS-A-------------NTGHV---------------QVL----------AP--V--LR-HN-G---LE-AVSE-AW----AKS-EKRNG----------QPTAATLKAAVD
      HMPREF1089_00435_[Clostridium]_bolteae_90B3_480702301                      --------DTRRLAN-I-SYKDI-K---NGF-------VGFGYYLKIIRD-----EKLWQ--------GQG----YD--S-FNEFLG-------DEY-GK-DKSWASRCI-----NLYDKF---------------------------------------------------------------G--------------------------------------------------------IP-V-------------EPGEL---------------PRL--------------------ED-A---YE------SY----NVS-QLIEM----------IPMQEELQEQVT
      NITHO_3110002_Nitrolancea_hollandica_Lb_390172790                          --P--SA-SHHARHA---CPRCA-----ISS-------ISRNPTCRVLSSLLRPYRRLYR--E---------AG-YR--T-FEDYCQ-------KRW-GW-TRQRAHQLM--LAAEVSTTV---------------------------------------------------------------D------------------------------------------------I-------PP-A-------------NEAQA---------------REL----------SR--L--KS-TE-A-I-RE------TW----QEV-REERG--------E-SVTARDVRETGY
      -_Streptomyces_sp_PAMC26508_505393262                                      L-D--DR-EREHLAV---CEQAL-TGFRKSV-------IVAGKALEVINR-----GRLYR--E---------T--HS--T-FVEYLD-------DVW-EI-RKSQAYRMI--EAWPVAAAV----S------------------------------------------------------PIG-D-----------------------------------------------------------I-------------NEGQA---------------RQL----------QP--V--FK-DY-G---HE-AALA-VY----REV-KALRG----D---R-KVTAADLAEARA
      -_Streptomyces_739808094                                                   L-D--DQ-QRAHLLV---CEQAL-TGFRKSV-------IVAGKALEVISR-----GRLYR--E---------T--HA--T-FVEYLD-------DVW-EI-RKSQAYRMI--EAWPVAAAV----S------------------------------------------------------PIG-D-----------------------------------------------------------I-------------NEGQA---------------REL----------RP--V--FT-DY-G---QE-AAVA-LY----REV-KELRG----N---R-KVTAADLAEARA
      -_Acaryochloris_marina_501118686                                           L-A--DL-EEIFLEP---TETGS-----DAL-------LSSGLALKVIQD-----NKLYL--P---------D--SK--G-FKVYVE-------ENL-GV-TYIHAFRCI--QAAELVLFL-Q--E------------------------------------------------------HFS--------------------------------------------------V-------LP-Q-------------SESAA---------------RPL----------VK--L--SR-AN-Q---LK------AW--G-EVL-RITAG-DK---W---APGKDRIKKTIA
      -_Streptomyces_sp_NRRL_B-24484_663245548                                   --------RQGAAET---IRAAK-A---RHD-------MQVGQALELIRD-----QKLYE--A---------TG-FG--S-FREYVE-------QRW-GY-SLSRAYQMM--DTILVMSAV----S------------------------------------------------------TIV-E------------------------------------------------T-------VP---------------PEGQQ---------------RVL----------AT--V--IR-QH-S---PE-AACM-LL----ESA-RTAPG----------KLTAKKLTELRD
      -_Streptomyces_sp_CNQ865_654253933                                         L-I--GE-EEEVFQR---CEAAV-ETLKFAF-------WAAGKGLQVIRD-----GRLYR--A---------T--HG--T-FDDYVQ-------DRW-GM-TRAQANKLI--RMWPIAEAL----F------------------------------------------------------ESQ-A----------------------Q----------ESN------------D-LARTRAKR-L-------------SQSVV---------------WEL----------VP--V--AE-RY-D---VD-AAQH-LY----STT-VEASG--------G-EVTAAVLKGAVA
      -_Desulfococcus_multivorans_750110637                                      I----ND-EDEDEFI---IEYEP-----PWF-------VQVGQALSHLKE-ALL-TEFPC--A---------E--QG--P-LPRWGE--TC---KEL-SI-SQSYANRLI--AAAEVYVAL----R------------------------------------------------------SAG--------------------------------------------------I-DEDD--LP-I-------------YERQV---------------RPL----------VR--F--KQ-DP-S-I-LK----Y-LW--E-EAL-VIAED-IE-F-N-S-LPRAGVVEYVVG
      -_[Kitasatospora]_papulosa_662754816                                       L-T--DA-DRADLEL---CEQAV-RSHHATF-------WMTGKALDAVAK-----RHLYR--A---------R--YA--N-FDALL--------EDW-DV-TLADSSRMR--RGWPLAARL-L--P------------------------------------------------------DVP-K--------------------------------------------------------LT-R-------------SHVEA-----L---------LPV--VER-----YG--V--DA-AA-T-L-HA------ML--R-EAL--------------P-KVTAKAITDVVR
      -_Acaryochloris_marina_501117833                                           L-A--DL-EEIFLEP---PDTGS-----DAL-------LSSGLALKTIQD-----KKLYL--P---------D--SK--G-FKVYVE-------ENL-GV-TYIHAFRCI--QAAELVLFL-Q--E------------------------------------------------------HFS--------------------------------------------------V-------LP-Q-------------SESAA---------------RPL----------VK--L--SR-AN-Q---LK------AW--G-EVV-RITAG-DK---W---APGKDRIKKTIA
      -_Streptacidiphilus_carbonis_755052115                                     L-I--AQ-EQDMLIK---CESAI-ENLRFAF-------WAAGKALQVIRN-----ARLYR--E---------Q--FE--T-FDEYTQ-------SKW-DI-TPQYANKLI--RTWRVAEAL----L------------------------------------------------------Q--------------------------P----------RSG------------GVLETIVSTK-L-------------GYGHA---------------WAL----------VP--L--VE-QH-S---VQ-AAVY-LY----MGI-VKVKG--------A-GVTAALVQGAVE
      -_Streptomyces_varsoviensis_664363832                                      V-T-----EQVIHAA---LAAGD-----AAI-------WVIGKALTVAAK-----GKFHR--D---------Q--GM--T-FDEYAR-------AET-GK-SPAHARRWM--DGAPLALAV-A--A------------------------------------------------------ATS-S--------------------------------------------------------TP---------------PEGHV---------------RPL----------RK--I--EK-EI-G---TR-PAIE-LY----RSA-DKASG-EG-G---R-KVTGAVLVEIRK
      -_Acaryochloris_marina_501119208                                           L-A--EL-EEIFLEP---PETGS-----DAL-------LSSGLALKTIQD-----NKLYL--P---------D--SK--G-FKVYVE-------ANL-GV-TYIHAFRCI--QAAELVLFL-Q--Q------------------------------------------------------HFS--------------------------------------------------V-------LP-Q-------------SESAA---------------RPL----------VK--L--SR-AN-Q---LK------AW--G-EVV-RITAG-DK---W---APGKDRIKKTIA
      -_Borrelia_hermsii_645011182                                               --------EEINART---LEEAV------NR-------VELAKALYEIKK-----NKLYR--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYI--KIGAYLEKE---------------------------------------------------------------N-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KKSSP----------ECKPVVIRFNLE
      -_Kitasatospora_sp_MBT66_759753188                                         F-A--GA-EYAARAN---IQQST-Q---QRV-------IVQGQILLAMRE-----EELWK--A---------LG-WT--D-FDVLVK-------HRF-NI-GRNYANKII--RSMPVVRAL----E------------------------------------------------------HVT-S--------------------------------------------------------ME-M-------------AEKHL---------------RAL----------VP--V--QE-RH-G---DE-AVRR-TW--E-EAL---RKG----------KITEKSLKEAAR
      -_Borrelia_crocidurae_644980358                                            --------EEIRART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYV--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KRDSS----------YCKPVVVKFNLK
      -_Streptomyces_griseofuscus_739811243                                      I-S--AA-QSAAETS---LIVAH-N---EFV-------MQAGPALIKIKE-----DDLWQ--A---------GG-YT--S-FKDYFE-------KKW-KY-TEQRAYQLI--RAVPVIAAL----K------------------------------------------------------GVA-T--------------------------------------------------------VK-I-------------NEGQC---------------REL----------AP--V--AR-DH-D---AA-TVQK-IW----RAA----------------ESKGKVTAKSLA
      -_Borrelia_coriaceae_645024919                                             --------EEIKART---LEEAV------NK-------LELAKALYEIKK-----NKLYR--F---------DG-YD--Y-FYEFCL--------DY-KF-SRTMIYKYI--RIGAYLEKE---------------------------------------------------------------D-----------------------------------------------------------V-------------KEQDI-IQ------------GSL----------NK--I------------IN----------D-IRV-KKSSS----------MCKPVIIKLNLE
      -_[Eubacterium]_siraeum_491496822                                          V----HQ-DLMIQEQ-V-AAQSL---------------TQIAIDLKEIRD-----RRLYA--E---L---G----YS--D-FAEYCE-------NAT-KT-GKRQAYNLI-----SLVEQY-----------------------------------------------------------KID-D----------------L-------------------------------S-----R-LA----------YL---GSTKL---------------IAL----------KS--L--GK-EE-----RE------------ELI-ESGKA--------E-ELSVRELKEKIK
      -_Xenococcus_sp_PCC_7305_493559029                                         I-A--GD-FDLQYLT---FEFAY-NQ--LAF-------VRNGLLLAKIKF-----LKLYK--N---------YG-DG--T-FASFCR-------EKL-RK-QRWQINDTI--RAARVVLEL-----------------------------------------------------------MYA-G--FD--------------------------------------------V-------LP-T-------------NISQA---------------IAL----------AK--L--TG-EK---L-VE------TW--R-SII-NIIPL--------D-KITAKSIRNLLN
      -_Streptomyces_violaceorubidus_663148255                                   I-L--AV-QYAARAN---HERAE-Q---QKL-------IGLGLRLQAMKD-----EELHK--T---------AG-FN--T-FGELTD-------SRF-GI-KKHQANNIL--RVMPVAQAL----E------------------------------------------------------DIT-T--------------------------------------------------------QE-L-------------KERPL---------------RVL----------VP--V--LE-AH-G---RE-AVRE-TW--L-EAA---RHG----------NVTDKTLMQAAN
      -_Borrelia_crocidurae_504495970                                            --------EEIRART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYV--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KRDSS----------YCKPVVVKFNLK
      -_Streptomyces_sp_CNH099_654240343                                         L-I--GE-EEEVFQR---CEAAV-ETLKFAF-------WAAGKGLQVIRD-----GRLYR--A---------T--HG--T-FDDYVQ-------DRW-GM-TRAQANKLI--RMWPIAEAL----F------------------------------------------------------ESE-A----------------------Q----------GSN------------D-LARTRAKR-L-------------SQSVV---------------WEL----------VP--V--AE-RY-D---VD-AAQH-LY----STT-VEASG--------G-EITAAVLKGAVA
      -_Deinococcus_peraridilitoris_505048307                                    L-T--PE-EKARLAA---LEQVV-GYGLRSF-------IEMAQALQEIQE-----RRLYR--E---------Q--YV--T-FEHYCM-------KVW-NF-SRTWAYQIM-QSKEAALLAL-----------------------------------------------------------DHG--------------------------------------------------V---P---VP---------------TERHA---------------RAL----------IG--V--SA-EN-----LE------IV--A-SVV-KAATG----K---E-NPTSADYQAVVE
      BCO_0118100_Borrelia_coriaceae_Co53_576095549                              --------EEIKART---LEEAV------NK-------LELAKALYEIKK-----NKLYR--F---------DG-YD--Y-FYEFCL--------DY-KF-SRTMIYKYI--RIGAYLEKE---------------------------------------------------------------D-----------------------------------------------------------V-------------KEQDI-IQ------------GSL----------NK--I------------IN----------D-IRV-KKSSS----------MCKPVIIKLNLE
      -_Chroococcidiopsis_thermalis_504967303                                    V-Q--QR-TKELKER---LQRTA-----QDI-------WDIGKKLVEVRA------------E-----LKG----HG--Y-FDAWLR-------AEF-GW-SRRTAYNFI-----YVYEAF-----------------------------------------------------------PYA-K----------------F--------------------------------------------------------AQMII-EP------------SAL----------YR--L--AS-PS-T---PD----A-IR--D-KFI-QQANA----G---S-KVTHKEVLKAVT
      -_Streptacidiphilus_anmyonensis_755076504                                  --Q--KE-QTETVIR---TAHAT---GKAAV-------WVIGQGIAIMTK-----GKLYR--R---------T--HS--T-LEDYVA-----E-LIP-DV-VPRQTRRWV--TGSKVALAI-A--T------------------------------------------------------RQG-E--------------------------------------------------------AP---------------VESQV---------------RKL----------TD--L--PE-EV-A---VE------LY----VAA-NDAAT-AA-G---Q-RLTAESLGQLAQ
      -_Streptomyces_sp_R1-NS-10_759540387                                       L-T--AD-EQERLAA---CVAGI-ELLSTAT-------WVAGKSLDTVAT-----GRLFR--VIPHKLEPERC--YK--T-IEEWSE-------TEY-GI-SRSRCSQLR--DGWELGEVL----T------------------------------------------------------VRG-H----------------------K---------------------------------AP----------------EGQV---------------REL----------VP--F--FK-QH-G---LK-AAVG-VY--E-MVV-QAAGA--------D-KITAKRLRETVK
      -_Streptomyces_rochei_690403288                                            I-Q--EA-DRRTELA---TEQIT-Q---QYL-------LWVGEPYRIVRD-----EELYR--V---------AG-YS--S-FDDWGR-------ALN-GR-SGDYMNKII--RVAPVVRAL----S------------------------------------------------------HIT-R--------------------------------------------------------RQ-L-------------KEQPL---------------RPL----------IA--V--QR-ER-G---DE-AVRR-CW--R-KAE---ASG----------DLTERGLRAAAV
      -_Cyanothece_sp_PCC_8802_506264213                                         M-S-----ISWATNE---IKQHL-----LNW-------CRVGIVAQQVKR-----FCKWK--D---------LK-LT--S-FKEYCE-------TIL-GV-SCGYINQII--KCAKVTLDL----A------------------------------------------------------SMG----FE--------------------------------------------V-------LP-T-------------NPSQA---------------KHL----------LK--F--EG-ED---L-KA------AW--Q-QVL-DENPK--------H-LITAKAIEKTLN
      -_Nostoc_punctiforme_501377550                                             L-------EQSILEG---IEAGK-----KGF-------QQAAQALLRIDE-----LALWR----------G-E--AV--S-FDAYRQ--------KF-----------------KAVLEDL---------------------------------------------------------------D------------------------------------------------I------------------------TDRHL---------------NRL--------------L--AA-EK-C---VQ------ML----RPI----------------GLNICTHKEVIS
      -_Borrelia_recurrentis_501533313                                           --------EEIRART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYV--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KRDSS----------YCKPVVVKFNLK
      -_Streptomyces_albidoflavus_663309700                                      L-N--PA-EVLRLNQ---AESQI-RAFGKAA-------AAAGEAFDMIKK-----DGLHH--H---------Y--GL--T-WAQYTL-------THW-GL-SASQADRLI--AAAPVMREL---------------------------------------------------------------S------------------------------------------------I----------A-------------NEGTA---------------REF--VSIY----RE--W--GA-TS-A---RA------IW----SGA-KEGAD--------G-KPTAKIANIAVK
      -_Streptomyces_sp_NRRL_S-1022_663349955                                    L-R--PD-ELEQLAE---CHRAI-DNARSAQ-------WMLGRALEIVRR-----RRLYR--G------DG----SR--T-WPQYLA-----A-EHD-GM-TERDARRLQ--EEWRLAKAV-----------------------------------------------------------QDA--------------------------------------------------L-----G-KP-A-------------PASHV---------------RAM----------LD--Y--AD-ST-S---DE-QAAH-AY----VML-RSAFE-AA-Q-V-R-LAAHQITARVAK
      -_Kitasatospora_sp_MBT66_759768668                                         L-I--AQ-ERDLLVK---CEAAL-ENLRIAY-------WAAGKALEAIRG-----GRLYR--A---------T--HG--T-FEAYCL-------ELW-DI-SPQYAGKLI--RAWRVAEKV----F------------------------------------------------------ESL-G----------------------P----------KSN------------D-LETIVSKR-L-------------GYGQA---------------WEL----------VA--L--SE-EH-G---VD-AAAL-LY----VAL-IQAKG--------M-ALTAAMVAGAAK
      -_Streptomyces_sp_Tu_6176_740047622                                        --------QKEQTEA-V-ISTAL-AAGDAAV-------WVIAQGLERAAK-----GRWWR--R---------T--HT--S-LGSYVE-----A-----KI-GRSAVYGRQ--LRKNAPLAL----E------------------------------------------------------TAH-K---------------------------------TGT--------------------VP---------------KPSQV---------------KVT----------SK--T--EE-QY-G---RE-AAVT-LY----EVV-RDVSS-EL-G---A-HPTADSLMAVHK
      -_Deinococcus_marmoris_736389644                                           L-E--PH-QKARLTA---LETTV-RDGLRDF-------RRTGQALSEIRD-----NEFFR--A---------G--YD--S-FESYLQ-------DRW-GF-TPPQAGRLM--EAADVAKVL----D------------------------------------------------------PLG--------------------------------------------------I-------QP-K-------------NEAQA---------------RTF----------KA-----AA----K-----------LV----TEM--------------E-PEQQRVVARLVE
      -_Nonomuraea_candida_759952224                                             L----GA-QEAHDGA---VERAD-----RYL-----T-LTQGLALEAVKK-----DDLWR--------LLG----FK--S-FQEYVE-------QRL-NI-SRQHAYKMM--QAAPVHRDL-------------------------------------------------------------------------------------------------------------------------P-H-------------VERLT-----F---------RQI--AIL-----AR--L--KD-AA-T---RQ----K-VW--------SLAEK------W-E-DTSPPSLQKAVD
      dsmv_2585_Desulfococcus_multivorans_DSM_2059_523467872                     I----ND-EDEDEFI---IEYEP-----PWF-------VQVGQALSHLKE-ALL-TEFPC--A---------E--QG--P-LPRWGE--TC---KEL-SI-SQSYANRLI--AAAEVYVAL----R------------------------------------------------------SAG--------------------------------------------------I-DEDD--LP-I-------------YERQV---------------RPL----------VR--F--KQ-DP-S-I-LK----Y-LW--E-EAL-VIAED-IE-F-N-S-LPRAGVVEYVVG
      Cflav_PD5941_Pedosphaera_parvula_Ellin514_223896866                        L-T--TE-ETTTLSE---CEAVI-EEGMKTF-------VEVGSAVLTISD-----RRLYR--A---------T--HS--T-FEDYIQ-------DKW-DM-TARRAYQLC--EAAEVVMKL----E------------------------------------------------------NVK-H---------------------------------ASQ------------I----------E-------------NARQA---------------EAL----------AK-----AP-EE-K---RD----E-VL--E-KAI----------Q---T-APEGKLTSKH--
      -_Streptomyces_turgidiscabies_493426429                                    L-M--AA-EVTKSKS---IAWSK-----LRW-T-----VETGAALRVLIE-----EDLYK--------EDP-E--FT--S-LETYAD-------NRL-HL-SRGHVYELV-DDASRLLAVA-----------------------------------------------------------PLS-E----------------I----------------SDK------------P-------FN-A-------------SQAKV-----L---------APL----------ME--V--YA-ED-G---VE-GGRT-----K-AEL-VVADV-DS-T-G-K-KRTAAALRKAAE
      -_Borrelia_hispanica_639481918                                             --------EEIRART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYV--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KRDSS----------YCKPVVVKFNLK
      -_Borrelia_persica_639480287                                               --------EEIKART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYI--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-ID------------SSL----------NK--I------------IN----------D-IRV-KRNSS----------YCKPVVVKFNLK
      -_Streptomyces_sp_NRRL_S-455_663210974                                     -----------ASHS---LKAAR-A---RFV-------VEAGTALRAIRD--ED-GGLYK--V---------T--HE--T-FEQYIS-------DRW-DM-DRSRAYQLI--DAAPTMNLL----S----------------------------------------------------------K------------------------------------------------I-FD----TA-P-------------VESQA---------------RAL----------AP--V--LE-AH-G---EE-AVRE-VV----VAV-KQAGA----------KVTAATIKEAAH
      -_Geminocystis_herdmanii_515865463                                         --------KYNQLKQ---ITNSI-KYNRINY-------IKLGLQLYQVKY-----YHLYK--N---------E--YK--S-FKDYCE-------KAV-YY-PVWRANQVI--DSANVAIKL----I------------------------------------------------------KLG----FN--------------------------------------------I-------IP-Q-------------NEAQA---------------RLL----------IK--L--NE-EQ---L-ME------KW--Q-EVL-NTYEP------Y-K-ITANRIERIVFG
      -_Opitutaceae_bacterium_TAV5_645069929                                     L-T--RD-ERDQLTR---CELNI-E---RNL-------SSIGAALKTIRD-----QRLYR--E---------T--HD--S-FDRYCY-------ERW-QK-SARWAHYQI--AAAIFAEEH-----------------------------------------------------------PEA-D---------------------------------GGE------------V-RR----IA-A-------------GRAPD--P------------SPA----------AK--A--AP-AK-P---TK-SD---RY--R-EQI-RALAE-MDREAL-D-RATQQSIGKLAE
      -_Deinococcus_misasensis_736313124                                         L-S--PA-ERLALQD---RESII-EGGQQAA-------KATWKALAEIHD-----LQLYR--E------------LG--T-WEDYLQ-------KRW-GI-KAAHGYRQV--AAAAIDVIL----T------------------------------------------------------GAG--------------------------------------------------V-------AI-P-------------TERRL---------------RPL--TKI-----LE--L--PE-DL-Q---DA------TA----RAM-KAIFG--------S-KPSSSEVEAFAE
      -_Streptomyces_sp_URHA0041_697211997                                       --------QLQRTEE-V-IAAAV-AAGDTAL-------WVIAQALERAAK-----GRWWR--A---------T--HD--S-LTAYVE-----E-----TV-GRSAVYVRQ--LRMNAPLAL----E------------------------------------------------------TAR-R---------------------------------TGT--------------------VP---------------KPSQI---------------KAT----------RK--T--EQ-RH-G---LD-AAVT-LY----EVV-RDVAA-EL-G---G-EPTARGLIAVHN
      -_Deinococcus_radiodurans_653294307                                        L-A--PH-EQQRLDD---LEQTV-EGGLRDF-------QRTGQALSEIRD-----NELYR--A---------T--HD--S-FEAYLQ-------DRW-GF-GVRQADRLI--DAAQVAKQL----E------------------------------------------------------PLG--------------------------------------------------I-------SP-R-------------HEAQA---------------RSF----------RP-----AA----R-----------IV----EEL--------------E-PEQQRLVARLVE
      -_Streptomyces_niveus_558889206                                            L-S--EE-DRLLLSQ---CEGRI-QAFGKAA-------ADAGEAFDTIKD-----KELHR--H---------Y--RM--T-WAEYTM-------ARW-GV-SVSQVDRLI--AAAPVMREL---------------------------------------------------------------S------------------------------------------------I----------P-------------NEGTA---------------REL--VPAY----RE--W--GA-DS-A---RA------LW----DGT-KEGSE--------N-KPTAKVLRAAVQ
      -_Streptomyces_griseus_702687171                                           L-I--GE-EQDHLAR---CEAAV-ETLKGAF-------WAAGKALQIIRD-----ARLYR--Q---------T--HG--T-FDAYCD-------DRW-GM-NRQYADKLI--RTWPIAEAL----Y------------------------------------------------------ERQLA----------------------A----------AKG------------E-TTPIGVKK-L-------------NQAQM---------------WEL----------VP--V--AD-SW-D---VD-AATF-VYETVADTV-VQVDG--------R-DVTAAVIQGAVK
      -_Streptomyces_sp_S4_498331267                                             L-S--EE-DRLLLSQ---CEGRI-QAFGKAA-------ADAGEAFDTIKN-----KELHR--H---------Y--RM--T-WAEYTL-------ARW-GV-SVSQVDRLI--AAAPVMREL---------------------------------------------------------------S------------------------------------------------I----------P-------------NEGTA---------------REL--VPAY----RD--W--GA-DS-A---RA------LW----NGT-KEGSG--------N-KPTAKVLKAAVQ
      -_Kitasatospora_sp_MBT66_759768796                                         L-T--AE-EEQRLAA---CVEGV-ELLSTAY-------WVAGKSLDTMAV-----GRLFR--KLPHRLEPARC--YA--T-IEEWAD-------VEH-GI-RQSRCSKLR--AGWELGEVL----N------------------------------------------------------AHG-H----------------------K---------------------------------VP----------------EGQV---------------REL----------VP--L--KN-RH-G---LK-AAVG-VY--Q-LVV-NAVGA--------E-KVTA--------
      -_Deinococcus_swuensis_746727627                                           L-E--PH-QKARLTA---LETTV-RDGLRDF-------RRTGQALSEIRD-----NEFFR--A---------G--YD--S-FEAYLQ-------DRW-GF-TPPQAGRLM--EAADVAKVL----D------------------------------------------------------PLG--------------------------------------------------I-------QP-R-------------NEAQA---------------RTF----------KA-----AA----K-----------IV----TEL--------------E-PEQQRVVARLVE
      -_Deinococcus_frigens_736394879                                            L-E--PH-QKARLTA---LETTV-RDGLRDF-------RRTGQALSEIRD-----NEFFR--A---------G--YG--S-FEAYLQ-------NRW-GF-TPPQAGRLM--DAADVARVL----D------------------------------------------------------PLG--------------------------------------------------I-------QP-K-------------NEAQA---------------RTF----------KA-----AA----R-----------IV----TEL--------------E-PEDQRVVARLVE
      GM3709_2810_Geminocystis_sp_NIES-3709_770473153                            --------KYNQLTH---ITNSI-KYNRINY-------IKLGMQLYQVRY-----YKLYK--S---------S--YT--S-FKDYCE-------KAV-YY-PVWRANQVI--ESASIAIKL----I------------------------------------------------------KAG----FN--------------------------------------------I-------IP-Q-------------NEAQA---------------RLL----------IK--L--NE-EE---L-MR------KW--Q-EVL-DTYEP------Y-K-ITANRIEKIVFG
      -_Streptomyces_sp_NRRL_F-5008_740027662                                    L-T--PE-ERADLET---CERAV-SGLQTAF-------TVAGKALATINQ-----ARLYK--E---------T--HS--S-FAAYVE-------DRW-GM-RKSQAYRLI--EAWPVAVAL-----------------------------------------------------------SSG-P----------------------N-------------------------V-------SP-R-------------GDTSA-------PPEKHV--RAL----------LP--V--VK-RH-G---LD-AARV-VY----EEL-REQDA--------R-VTTTRVTQAVRV
      -_Deinococcus_misasensis_736303905                                         L-T--SE-EQSQLDQ---LESTI-QQAVEQV-------KAGWHALKEIHD-----KGLYR--L------------YG--T-WEEYLQ-------KRW-NI-SATHGHRQV--AAAALDTIL----L------------------------------------------------------NAG--------------------------------------------------V-------VV-E-------------AERRL---------------RPL--TPL-----LD--L--PD-ED-Q---VA------IV----RTL-RETCG--------S-KPSTAQVKAFAE
      -_Streptomyces_halstedii_664288086                                         V-T-----ERVIHAA---LAAGD-----AAI-------WVIGKALTVAAK-----GKFHR--D---------Q--GM--T-FDEYAR-------AET-GK-SPAHARRWM--DGAPLALAV-A--A------------------------------------------------------ATS-S--------------------------------------------------------TP---------------PEGHV---------------RPL----------RK--I--EK-EI-G---TR-PAIE-LY----RSA-DKASG-EG-G---R-KVTGAVLVEIRK
      -_Streptomyces_bikiniensis_663184423                                       L-S--DQ-EREDVEA---CKAGV-DNLRNAF-------WVAGKSLETMST-----AKLHR--E---------E--NP--N-FAEWIW-------EKW-EI-SESNLYRLI--DEWRVGEAL----A------------------------------------------------------NLG-H---------------------------------------------------------K-P-------------LESHV---------------RKM----------TE--L--RR-QT-S---DK-VAIT-VY--D-TIA---RCR--------T-RVTGDLVEKVVN
      -_Streptomyces_sp_CNT372_739896924                                         L-T--PS-ERADLET---CERAV-SGLQTAF-------TVAGKALATINQ-----ARLYK--E---------T--HP--S-FAAYVE-------DRW-GM-RRAQAYRLI--EAWPVAVAL-----------------------------------------------------------SSG-P----------------------D-------------------------V-------SP-R-------------GDTSA-------PPERHV--RAL----------LP--V--VK-RH-G---LD-AARA-VY----EEL-REQDA--------R-VTTTRVTQAVRA
      -_Diplosphaera_colitermitum_759901356                                      L-T--RD-ERDQLTR---CEANI-E---RGL-------TAVGQALKTIRD-----NRLYR--E---------T--HD--S-FDIYCY-------ERW-QK-SVRWANYQI--AAAIFAGEH-----------------------------------------------------------PEA-E------------------------ITS------ERQ------------A-RA----LR-S-------------GGSTP--E------------ATS----------PA--A--TP-SK-P---TK-ND---RY--R-EQI-RALAE-MDREAL-D-RATQQSIAKLAE
      -_Opitutaceae_bacterium_TAV5_497194662                                     L-T--RE-ERDQLTR---CEANI-E---RGL-------TAVGQALKTIRD-----ARLYR--E---------T--HD--N-FEAYCY-------ERW-QR-SKRWANYQI--AAAIFAEEN-----------------------------------------------------------PEA-D---------------------------------EGE------------V-RR----IA-A-------------GRAPS--R------------ESA----------PA--A--AP-AK-P---TK-SD---RY--R-EQI-RALAE-MDREAL-D-RATQQSIGKLAE
      -_Pleurocapsa_sp_PCC_7319_518337503                                        I-E--GD-FNLDYIV---FEFAY-NR--LAY-------VRNGLLLAKLKF-----LKLYK--N---------FG-DG--T-FATFCR-------EQL-KI-TRWQVNDNI--KAARVCLEL-----------------------------------------------------------IYA-G--FE--------------------------------------------I-------LP-T-------------NISQA---------------IAL----------AS--L--AG-DE---L-IH------AW--R-SVI-ESIEP--------D-KITHKSIKSFLF
      -_Streptomyces_megasporus_671527277                                        L-T--EA-ERADLTT---CQAVL-QQHHASF-------WLTGKALETISK-----RRLYR--A---------D--HP--T-FEAFL--------EDW-DI-TPADAYRMM--NGWPLANRL-L--R------------------------------------------------------DVP-K--------------------------------------------------------LT-R-------------SHVEA-----L---------LPV--VNR-----YG--V--EA-AA-T-L-HA------LL--R-DSL--------------P-KVTAAAIAQVVR
      -_Lamprocystis_purpurea_521992951                                          M-T--AE-ECDLYVK---LFQKS-ESD-QRF-------Y-----LLKIRE-----EKGWK--A---------KG-FE--S-FDAFGE-------SVL-GV-TIGRLNQLA--RAAEVQLSI-----------------------------------------------------------GND-T------------------------------------------------I-VSK---IP----------------EGQL---------------RPL----------AP--L--TD-EE-R---RT------VW--A-EAT-AKAEE-DG-----R-KLTARLVQEAVD
      consensus/100%                                                             ............................................h....................................b...............................................................................................................................................................................................................................................................................
      consensus/95%                                                              ..................h.........................l..hpp......pha....................p.a..a...........h...........bh......h...h.............................................................................................................................................h.................h..............h.....................h...................................
      consensus/90%                                                              ........b.........hp..................h.....L..lpp......chab...............a...s.F..ah.........pa....s...s..hh......h...h.............................................................................................................................................h...............p.L..............h.....................h.........................s...h...h.
      consensus/85%                                                              h.......b...b.....hp..h...............h...bhL.plpc......chYc...............a...s.Fp.ah.........pa..h.sbp.s..hl...u..l.p.l.........................................................................................................................................pp..h...............+sL..............l....................ha....p....................s...h..hh.
      consensus/80%                                                              l.......cp..b.....hp..l...............hp..bhLbpl+-.....pchY+...............a...s.Fp.ahb........pa..h.scpps..hl...u..l.psl.........................................................................................................................................pp..h...............+sL..............l.............p......ha....psh..................ss..lpphh.
      consensus/75%                                                              L.......cpp.b.p...hcp.l...............hps.bhLbpl+-.....pchY+...............ap..o.Fcpahb........ca..h.s+pps.bhl...us.l.psl...............................................................p........................................................h................pp..l...............+sL..............L.....pp......p......ha....psh..................oubhlpphhp
      consensus/70%                                                              L.......cppbb.p...hcp.l...............hps.bhLbpl+-.....pchY+...............ap..o.Fcpahb........ca..h.u+ppsabhl...Assl.csl...............................................................p........................................................h..p.............ppp.l...............+sL..........hp..L..p..pp.p....p......ha..p.psh..................oubhlcplhp
      
      
      Back to Contents
    • General notes, phyletic distribution and domain architectures of the ParB-HTH domain associated with the Chlorophyte-type N6-MTases

      General notes:

      The ParB-type HTH domain which is found in the mobile cyanobacterial family of N6-MTases belongs to the larger ParB-like HTH which is pan-bacterial and is fused to the ParB-like nuclease domain. However, this family fused to the N6-MTase is a distinct subfamily and the eukaryotic versions is closely related to cyanobacterial versions. The prokaryotic homologs of these show a variety of interesting fusions such to the ASCH domain in 3 cyanobacteria, to ParB in Mycobacterium kansasii and Nitrolancea hollandica (Chloroflexi) (Distinct from the pan-bacterial ParB ),and to a cytosine methylase domain in cyanobacteria. Further, in cyanobacteria the domain is fused to a Tudor-like SH3 fold domain in Cyanobacteria. This tudor-like domain is likely to be involved in protein-protein interactions. In terms of gene neighborhoods, the ParB-HTH is mostly ParA associated in a phage terminase context, suggesting its role in partitioning. Note that in these systems there is no ParB nuclease domain, just the HTH. Some versions are associated with a transposase but this is not very common and could be just be background associations of mobile systems. The Eukaryotic versions appear closer to versions which associate with the ParA protein and play a role in partitioning. This suggests that in the eukaryotes, the ParB domain might recognize a specific sequence or a DNA feature.
      GI           Gene neighborhood                                                                                                                                                                             Domain-architecture    Pfam                      Gene name                Len  Taxonomy                                     Species                                                 Genbank annotation
      # 202; Mostly ParA associated in a terminase context                                                                                                                                                                                                                                                         
      576100681    BlyB-holin->orfD->Borrelia_orfA->DUF226->ParA->ParB-HTH*->?->Terminase_LS->                                                                                                                   ParB-HTH               Plasmid_parti             BAN_0003100              195  bacteria>spirochaetes                        Borrelia anserina BA2                                   Putative plasmid partition protein (plasmid) [Borrelia anserina BA2].                                             576100674_?->576100675_?->576100676_BlyB-holin->576100677_orfD->576100678_Borrelia_orfA->576100679_DUF226->576100680_ParA->576100681_ParB-HTH*->576100682_?->576100683_Terminase_LS-><-576100684_?<-576100685_?||576100686_?->576100687_?->576100688_?->
      503783548    orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->XerD->?->?->Terminase_LS->                                                                                              ParB-HTH               Plasmid_parti             BBIDN127_RS05350         192  bacteria>spirochaetes                        Borrelia bissettii                                      permease [Borrelia bissettii].                                                                                    503783542_orfD->503783543_BdrA->503783544_Mlp-><-503783545_ERF||503783546_Borrelia_orfA->497943782_DUF226->503783547_ParA->503783548_ParB-HTH*->503783549_BdrA->503783518_XerD->503783554_?->503783555_?->503783556_Terminase_LS->
      576091528    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                   ParB-HTH               Plasmid_parti             BHW_0003100              191  bacteria>spirochaetes                        Borrelia hermsii MTW                                    Plasmid partition family protein (plasmid) [Borrelia hermsii MTW].                                                <-576091521_?||576091522_?->576091523_?-><-576091524_?||576091525_?-><-576091526_Terminase_LS<-576091527_?<-576091528_ParB-HTH*<-576091529_ParA<-576091530_DUF226<-576091531_Borrelia_orfA<-576091532_orfD<-576091533_BlyB-holin<-576091534_?<-576091535_?
      576092650    BlyB-holin->orfD->?->Borrelia_orfA->DUF226->ParA->ParB-HTH*->?->Terminase_LS->                                                                                                                ParB-HTH               Plasmid_parti             BHO_0003100              191  bacteria>spirochaetes                        Borrelia hermsii YBT                                    Plasmid partition family protein (plasmid) [Borrelia hermsii YBT].                                                576092643_?->576092644_BlyB-holin->576092645_orfD->576092646_?->576092647_Borrelia_orfA->576092648_DUF226->576092649_ParA->576092650_ParB-HTH*->576092651_?->576092652_Terminase_LS->576092653_?-><-576092654_?<-576092655_?<-576092656_?||576092657_?->
      576094173    BlyB-holin->orfD->Borrelia_orfA->DUF226->ParA->ParB-HTH*-><-?||?->?->Terminase_LS->                                                                                                           ParB-HTH               Plasmid_parti             BCO_0003100              191  bacteria>spirochaetes                        Borrelia coriaceae Co53                                 Plasmid partition family protein (plasmid) [Borrelia coriaceae Co53].                                             576094166_?->576094167_?->576094168_BlyB-holin->576094169_orfD->576094170_Borrelia_orfA->576094171_DUF226->576094172_ParA->576094173_ParB-HTH*-><-576094174_?||576094175_?->576094176_?->576094177_Terminase_LS->576094178_?-><-576094179_?<-576094180_?
      576105484    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                   ParB-HTH               Plasmid_parti             BHY_1114                 191  bacteria>spirochaetes                        Borrelia hermsii YOR                                    Plasmid partition family protein (plasmid) [Borrelia hermsii YOR].                                                <-576105477_?||576105478_?->576105479_?-><-576105480_?||576105481_?-><-576105482_Terminase_LS<-576105483_?<-576105484_ParB-HTH*<-576105485_ParA<-576105486_DUF226<-576105487_Borrelia_orfA<-576105488_orfD<-576105489_BlyB-holin<-576105490_?<-576105491_?
      497942632    BlyB-holin->Borrelia_lipo_2->orfD-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase||?->?->Terminase_LS->                                                   ParB-HTH               Plasmid_parti             BBUN40_RS06040           186  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           497942599_BlyB-holin->497942602_Borrelia_lipo_2->497942605_orfD-><-504353774_ERF||497942614_Borrelia_orfA->497942617_DUF226->497942626_ParA->497942632_ParB-HTH*->497942636_BdrA->504353775_BppA->497943617_XerD-><-497943614_Phage-integrase||497942645_?->497942648_?->497943610_Terminase_LS->
      201084505    <-Lipoprotein_2<-Lipoprotein_2<-?<-?<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226||Borrelia_orfA->?-><-BdrA                                                          ParB-HTH               SP+Plasmid_parti          BDU_7025                 205  bacteria>spirochaetes                        Borrelia duttonii Ly                                    PF49 plasmid partition protein (plasmid) [Borrelia duttonii Ly].                                                  <-201084498_Lipoprotein_2<-201084499_Lipoprotein_2<-201084500_?<-201084501_?<-201084502_Lipoprotein_2<-201084503_Lipoprotein_2<-201084504_Lipoprotein_2<-201084505_ParB-HTH*<-201084506_ParA<-201084507_DUF226||201084508_Borrelia_orfA->201084509_?-><-201084510_BdrA
      644979506    DUF226->ParA->ParB-HTH*->                                                                                                                                                                     ParB-HTH               Plasmid_parti             BHW_RS04950              204  bacteria>spirochaetes                        Borrelia hermsii                                        permease, partial [Borrelia hermsii].                                                                             752506912_?->644979504_DUF226->644979505_ParA->644979506_ParB-HTH*->
      501533114    DUF226->ParA->ParB-HTH*->Lipoprotein_2->?-><-?||?->Lipoprotein_2-><-Mlp                                                                                                                       ParB-HTH               Plasmid_parti             BRE_RS05055              199  bacteria>spirochaetes                        Borrelia recurrentis                                    permease [Borrelia recurrentis].                                                                                  <-501533109_?<-752506110_?<-752506368_?<-752506369_?||752506370_?->752506374_DUF226->501533113_ParA->501533114_ParB-HTH*->752506375_Lipoprotein_2->501533115_?-><-501533116_?||501533117_?->501533118_Lipoprotein_2-><-501533119_Mlp||752506371_?->
      501533328    <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-?||?-><-?<-ParB-HTH*<-ParA<-DUF226<-?||Lipoprotein_2->Lipoprotein_2->                                                           ParB-HTH               Plasmid_parti             BDU_RS07380              199  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     <-501533318_Lipoprotein_2<-752506111_Lipoprotein_2<-501533321_Lipoprotein_2<-752506107_Lipoprotein_2<-501533323_?||501533325_?-><-501533326_?<-501533328_ParB-HTH*<-501533113_ParA<-752506112_DUF226<-752506108_?||501533331_Lipoprotein_2->752506113_Lipoprotein_2->752506109_?->752506110_?->
      576092807    <-Lipoprotein_2||?-><-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA                                                                                                                                  ParB-HTH               Plasmid_parti             BHO_0016000              199  bacteria>spirochaetes                        Borrelia hermsii YBT                                    Putative plasmid partition protein (plasmid) [Borrelia hermsii YBT].                                              <-576092804_?<-576092805_Lipoprotein_2||576092806_?-><-576092807_ParB-HTH*<-576092808_ParA<-576092809_DUF226<-576092810_Borrelia_orfA||576092811_?->576092812_?->576092813_?-><-576092814_?
      644980725    <-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-?||BdrA->Lipoprotein_2->                                                                                                                            ParB-HTH               SP+Plasmid_parti          BCD_RS06505              199  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   <-749307931_Lipoprotein_2<-644980725_ParB-HTH*<-644980726_ParA<-644980727_DUF226<-644980728_?||644980729_BdrA->644980730_Lipoprotein_2->644980732_?->644980733_?->
      576093010    <-SSB<-BppA<-?<-BppA||?->DUF226->ParA->ParB-HTH*->ERF->?->?->Lipoprotein_2->?->Lipoprotein_2->Lipoprotein_2->                                                                                 ParB-HTH               SP+Plasmid_parti          BHO_0006701              198  bacteria>spirochaetes                        Borrelia hermsii YBT                                    Putative plasmid partition protein (plasmid) [Borrelia hermsii YBT].                                              <-576093003_SSB<-576093004_BppA<-576093005_?<-576093006_BppA||576093007_?->576093008_DUF226->576093009_ParA->576093010_ParB-HTH*->576093011_ERF->576093012_?->576093013_?->576093014_Lipoprotein_2->576093015_?->576093016_Lipoprotein_2->576093017_Lipoprotein_2->
      639481672    <-Lipoprotein_2<-?<-?<-Lipoprotein_2||?->DUF226->ParA->ParB-HTH*->Lipoprotein_2->Lipoprotein_2->?->Lipoprotein_2->Lipoprotein_2->                                                             ParB-HTH               Plasmid_parti             U880_RS0100260           198  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    <-740572702_Lipoprotein_2<-639481666_?<-639481667_?<-639481668_Lipoprotein_2||639481669_?->639481670_DUF226->639481671_ParA->639481672_ParB-HTH*->740572705_Lipoprotein_2->639481674_Lipoprotein_2->639481675_?->740572708_Lipoprotein_2->639481677_Lipoprotein_2->
      695263537    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH               SP+Plasmid_parti          PF-49                    197  bacteria>spirochaetes                        Borrelia burgdorferi                                    PF-49 protein [Borrelia burgdorferi].                                                                             695208857_?->695208858_ParA->695263537_ParB-HTH*->695208860_?->
      736012165    Borrelia_orfA->DUF226->ParA->ParB-HTH*->Lipoprotein_2->                                                                                                                                       ParB-HTH               Plasmid_parti             I871_B18                 197  bacteria>spirochaetes                        Borrelia miyamotoi LB-2001                              plasmid partition protein (plasmid) [Borrelia miyamotoi LB-2001].                                                 736012158_?->736012159_?-><-736012160_?<-736012161_?||736012162_Borrelia_orfA->736012163_DUF226->736012164_ParA->736012165_ParB-HTH*->736012166_Lipoprotein_2->
      576102339    <-Lipoprotein_2<-Lipoprotein_2<-?<-?<-Lipoprotein_2<-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-?<-?||?-><-BdrA                                                                                  ParB-HTH               SP+Plasmid_parti          BCD_1474                 195  bacteria>spirochaetes                        Borrelia crocidurae DOU                                 Putative plasmid partition protein (plasmid) [Borrelia crocidurae DOU].                                           <-576102332_?<-576102333_Lipoprotein_2<-576102334_Lipoprotein_2<-576102335_?<-576102336_?<-576102337_Lipoprotein_2<-576102338_Lipoprotein_2<-576102339_ParB-HTH*<-576102340_ParA<-576102341_DUF226<-576102342_?<-576102343_?||576102344_?-><-576102345_BdrA||576102346_?->
      501669395    Borrelia_lipo_1->Borrelia_lipo_1->Borrelia_lipo_1-><-?||Borrelia_orfA->DUF226->ParA->ParB-HTH*-><-?||METHYLASE->                                                                              ParB-HTH               SP+Plasmid_parti          BAFPKO_RS06170           194  bacteria>spirochaetes                        Borrelia afzelii                                        permease [Borrelia afzelii].                                                                                      500023161_Borrelia_lipo_1->500023159_Borrelia_lipo_1->500023158_Borrelia_lipo_1-><-500023156_?||500023155_Borrelia_orfA->500023154_DUF226->500023153_ParA->501669395_ParB-HTH*-><-500023148_?||500023144_METHYLASE->500023143_?->500023142_?->500023141_?->
      501532751    DUF226->ParA->ParB-HTH*->                                                                                                                                                                     ParB-HTH               Plasmid_parti             Q7M_RS06865              193  bacteria>spirochaetes                        Borrelia                                                MULTISPECIES: permease [Borrelia].                                                                                504499987_DUF226->504499988_ParA->501532751_ParB-HTH*->
      639480216    Borrelia_orfA->DUF226->ParA->ParB-HTH*->                                                                                                                                                      ParB-HTH               Plasmid_parti             U881_RS0101565           193  bacteria>spirochaetes                        Borrelia persica                                        permease [Borrelia persica].                                                                                      639480213_Borrelia_orfA->639480214_DUF226->639480215_ParA->639480216_ParB-HTH*->
      639480329    <-ERF<-ParB-HTH*<-ParA<-DUF226                                                                                                                                                                ParB-HTH               Plasmid_parti             U881_RS0102340           193  bacteria>spirochaetes                        Borrelia persica                                        permease [Borrelia persica].                                                                                      <-639480328_ERF<-639480329_ParB-HTH*<-639480330_ParA<-639480331_DUF226<-639480332_?
      639481645    <-BppA||Borrelia_orfA->DUF226->ParA->ParB-HTH*->ERF->                                                                                                                                         ParB-HTH               Plasmid_parti             U880_RS0100085           193  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    <-740572680_BppA||639481642_Borrelia_orfA->639481643_DUF226->639481644_ParA->639481645_ParB-HTH*->740572681_ERF->
      644980942    Borrelia_orfA->DUF226->ParA->ParB-HTH*->ERF-><-Mlp||BdrA->?->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->                                                                                    ParB-HTH               Plasmid_parti             BCD_RS07715              193  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   <-644980938_?<-749308044_?<-749308051_?<-749308045_?||644980939_Borrelia_orfA->644980940_DUF226->644980941_ParA->644980942_ParB-HTH*->749308052_ERF-><-644980943_Mlp||644980944_BdrA->644980945_?->749308046_Lipoprotein_2->644980947_Lipoprotein_2->644980948_Lipoprotein_2->
      740577602    <-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA                                                                                                                                                 ParB-HTH               Plasmid_parti             U881_RS10530             193  bacteria>spirochaetes                        Borrelia persica                                        permease [Borrelia persica].                                                                                      <-639480289_ERF<-740577602_ParB-HTH*<-639480290_ParA<-639480291_DUF226<-639480292_Borrelia_orfA
      496158399    orfD-><-?||Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase||Borrelia_lipo_1->                                                                         ParB-HTH               SP+Plasmid_parti          NM71_RS06435             192  bacteria>spirochaetes                        Borrelia burgdorferi group                              MULTISPECIES: permease [Borrelia burgdorferi group].                                                              499186263_orfD-><-497944296_?||499186264_Mlp-><-499186265_ERF||499186266_Borrelia_orfA->499186267_DUF226->499186268_ParA->496158399_ParB-HTH*->497944503_BdrA->499186269_BppA->499186270_XerD-><-763427946_Phage-integrase||499186272_Borrelia_lipo_1-><-499186279_?||499186280_?->
      506379547    ParA->ParB-HTH-><-Borrelia_lipo_1<-?<-?<-XerD<-BdrA<-ParB-HTH*<-Borrelia_lipo_2                                                                                                               ParB-HTH               Plasmid_parti             BVAVS116_RS05635         192  bacteria>spirochaetes                        Borrelia valaisiana                                     permease [Borrelia valaisiana].                                                                                   506379527_ParA->506379528_ParB-HTH-><-750014218_Borrelia_lipo_1<-506379538_?<-506379539_?<-506379541_XerD<-506379546_BdrA<-506379547_ParB-HTH*<-750014206_Borrelia_lipo_2<-750014207_?<-506379572_?<-506379580_?<-506379581_?<-506379584_?<-506379590_?
      645063171    Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->                                                                                                                                                ParB-HTH               SP+Plasmid_parti          BHY_RS06695              192  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      749302146_Borrelia_orfA->645063169_DUF226->645063170_ParA->645063171_ParB-HTH*->645063172_BdrA->
      657235060    Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->                                                                                                                                                ParB-HTH               SP+Plasmid_parti          DZ03_RS0105960           192  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      657235057_Borrelia_orfA->657235058_DUF226->657235059_ParA->657235060_ParB-HTH*->657235061_BdrA->
      695262165    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH               SP+Plasmid_parti          YP_009075307.1           192  bacteria>spirochaetes                        Borrelia burgdorferi                                    putative plasmid partition protein; orf3 [Borrelia burgdorferi].                                                  695198281_?->695198282_ParA->695262165_ParB-HTH*->695198284_?->
      695262844    <-BdrA<-ParB-HTH*<-ParA<-DUF226<-?||BppA->                                                                                                                                                    ParB-HTH               SP+Plasmid_parti          YP_009076506.1           192  bacteria>spirochaetes                        Borrelia hermsii                                        hypothetical protein [Borrelia hermsii].                                                                          <-695199855_BdrA<-695262844_ParB-HTH*<-695199857_ParA<-695199858_DUF226<-695199859_?||695199860_BppA->
      501533150    <-BdrA||Mlp-><-SSB||Borrelia_lipo_2->?->DUF226->ParA->ParB-HTH*-><-Mlp||?->BdrA->?->Lipoprotein_2->Lipoprotein_2->                                                                            ParB-HTH               SP+Plasmid_parti          BRE_RS05250              191  bacteria>spirochaetes                        Borrelia recurrentis                                    permease [Borrelia recurrentis].                                                                                  <-501533144_BdrA||501533145_Mlp-><-752506379_SSB||752506380_Borrelia_lipo_2->501533147_?->501533148_DUF226->501533149_ParA->501533150_ParB-HTH*-><-501533151_Mlp||501533152_?->501533153_BdrA->752506381_?->501533154_Lipoprotein_2->501533155_Lipoprotein_2->501533156_?->
      501533271    DUF226->ParA->ParB-HTH*->BppA->SSB->SSB->XerD-><-Phage-integrase<-Mlp                                                                                                                         ParB-HTH               SP+Plasmid_parti          BDU_RS05365              191  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     501533268_?->501533269_DUF226->501533270_ParA->501533271_ParB-HTH*->752505977_BppA->752505978_SSB->752505984_SSB->501533274_XerD-><-501533275_Phage-integrase<-501533276_Mlp||501533277_?->
      639481710    <-ERF||?->DUF226->ParA->ParB-HTH*->BppA->                                                                                                                                                     ParB-HTH               Plasmid_parti             U880_RS0100505           191  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    <-740572765_ERF||639481707_?->639481708_DUF226->639481709_ParA->639481710_ParB-HTH*->639481711_BppA->
      644980346    <-ERF||DUF226->ParA->ParB-HTH*->BppA->                                                                                                                                                        ParB-HTH               SP+Plasmid_parti          BCD_RS04485              191  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   <-749307677_ERF||644980344_DUF226->644980345_ParA->644980346_ParB-HTH*->644980347_BppA->
      645024139    <-ERF<-ParB-HTH*<-ERF<-ParB-HTH<-ParA<-DUF226<-Borrelia_orfA||BppA-><-Borrelia_orfA                                                                                                           ParB-HTH               SP+Plasmid_parti          T431_RS0107620           191  bacteria>spirochaetes                        Borrelia coriaceae                                      permease [Borrelia coriaceae].                                                                                    654876536_?->645023990_?->740579677_?->740579685_?->645023977_?-><-654876537_ERF<-645024139_ParB-HTH*<-740579687_ERF<-654876538_ParB-HTH<-645024753_ParA<-645024752_DUF226<-654876539_Borrelia_orfA||740579689_BppA-><-645024131_Borrelia_orfA
      740582129    <-BppA<-ParB-HTH*<-ParA<-DUF226<-?||ERF-><-orfD<-Borrelia_lipo_2                                                                                                                              ParB-HTH               Plasmid_parti             BDCR2A_RS06520           191  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     <-740582126_BppA<-740582129_ParB-HTH*<-501533270_ParA<-501533269_DUF226<-740582133_?||740582135_ERF-><-740582138_orfD<-740582140_Borrelia_lipo_2<-740582144_?
      497942842    Borrelia_orfA->DUF226->ParA->ParB-HTH*->MultiTM-><-Borrelia_lipo_1<-?<-Borrelia_lipo_1                                                                                                        ParB-HTH               SP+Plasmid_parti          NM71_RS06980             190  bacteria>spirochaetes                        Borrelia burgdorferi                                    permease [Borrelia burgdorferi].                                                                                  497942808_?->497942819_?->499192785_?-><-497942827_?||499192780_Borrelia_orfA->497942834_DUF226->497942837_ParA->497942842_ParB-HTH*->499192782_MultiTM-><-499192790_Borrelia_lipo_1<-501897251_?<-499192791_Borrelia_lipo_1<-499192784_?<-499192786_?<-499192787_?
      501928245    Borrelia_lipo_1->Borrelia_lipo_1-><-?||Borrelia_orfA->DUF226->ParA->ParB-HTH*->MultiTM-><-Borrelia_lipo_1<-Borrelia_lipo_1<-?<-Borrelia_lipo_1||?-><-Borrelia_lipo_1                          ParB-HTH               SP+Plasmid_parti          BSV1_RS05810             190  bacteria>spirochaetes                        Borrelia finlandensis                                   permease [Borrelia finlandensis].                                                                                 497942808_?->501928244_Borrelia_lipo_1->501928236_Borrelia_lipo_1-><-748691647_?||501928229_Borrelia_orfA->501928253_DUF226->501928228_ParA->501928245_ParB-HTH*->501928259_MultiTM-><-501928239_Borrelia_lipo_1<-501928262_Borrelia_lipo_1<-501928248_?<-501928226_Borrelia_lipo_1||501928258_?-><-748691650_Borrelia_lipo_1
      504496248    <-BdrA||Mlp->?->DUF226->ParA->ParB-HTH*->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->?->?->Lipoprotein_2->                                                                    ParB-HTH               Plasmid_parti             Q7M_RS06620              190  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   <-504496241_BdrA||752505230_Mlp->752505231_?->504496245_DUF226->504496247_ParA->504496248_ParB-HTH*->504496250_Lipoprotein_2->504496251_Lipoprotein_2->504496252_Lipoprotein_2->504496253_Lipoprotein_2->752505232_?->752505233_?->504496261_Lipoprotein_2->
      576102765    Mlp->?-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||?->SSB->XerD-><-Mlp                                                                                                                    ParB-HTH               Plasmid_parti             BCD_1877                 190  bacteria>spirochaetes                        Borrelia crocidurae DOU                                 Putative plasmid partition protein (plasmid) [Borrelia crocidurae DOU].                                           <-576102758_?<-576102759_?<-576102760_?<-576102761_?||576102762_Mlp->576102763_?-><-576102764_ERF<-576102765_ParB-HTH*<-576102766_ParA<-576102767_DUF226<-576102768_Borrelia_orfA||576102769_?->576102770_SSB->576102771_XerD-><-576102772_Mlp
      576313683    Mlp->?-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||?->SSB->SSB->XerD->                                                                                                                    ParB-HTH               Plasmid_parti             BDCR2A_01333             190  bacteria>spirochaetes                        Borrelia duttonii CR2A                                  putative plasmid partition protein [Borrelia duttonii CR2A].                                                      <-576313676_?<-576313677_?<-576313678_?<-576313679_?||576313680_Mlp->576313681_?-><-576313682_ERF<-576313683_ParB-HTH*<-576313684_ParA<-576313685_DUF226<-576313686_Borrelia_orfA||576313687_?->576313688_SSB->576313689_SSB->576313690_XerD->
      639482667    DUF226->ParA->ParB-HTH*->?->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->                                                                                       ParB-HTH               Plasmid_parti             U880_RS0105985           190  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    639481986_?->639482665_DUF226->639482666_ParA->639482667_ParB-HTH*->639482668_?->740573672_Lipoprotein_2->740573666_Lipoprotein_2->740573675_Lipoprotein_2->740573680_Lipoprotein_2->740573669_Lipoprotein_2->
      740581845    <-BdrA<-?||?->?->ParA->ParB-HTH*->Lipoprotein_2->?->Lipoprotein_2->                                                                                                                           ParB-HTH               Plasmid_parti             BDCR2A_RS05930           190  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     <-740581832_BdrA<-740581835_?||740581837_?->740581840_?->740581843_ParA->740581845_ParB-HTH*->740581850_Lipoprotein_2->740581848_?->740581853_Lipoprotein_2->
      52696733     Borrelia_orfA->DUF226->ParA->ParB-HTH*-><-Borrelia_lipo_1<-ERF                                                                                                                                ParB-HTH               SP+Plasmid_parti          BGP219                   189  bacteria>spirochaetes                        Borrelia garinii PBi                                    hypothetical protein BGP219 [Borrelia garinii PBi].                                                               52696730_Borrelia_orfA->52696731_DUF226->52696732_ParA->52696733_ParB-HTH*-><-52696734_Borrelia_lipo_1<-52696735_ERF<-52696736_?
      501710213    <-ParB-HTH*<-ParA<-DUF226                                                                                                                                                                     ParB-HTH               SP+Plasmid_parti          DY95_RS0104625           189  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      <-501710213_ParB-HTH*<-501710214_ParA<-501710226_DUF226
      501898261    Borrelia_lipo_1-><-?||?-><-Borrelia_lipo_1||Borrelia_orfA->DUF226->ParA->ParB-HTH*-><-Borrelia_lipo_1                                                                                         ParB-HTH               SP+Plasmid_parti          BSPA14S_RS06005          189  bacteria>spirochaetes                        Borrelia spielmanii                                     permease [Borrelia spielmanii].                                                                                   501897677_Borrelia_lipo_1-><-750018078_?||501898236_?-><-501898240_Borrelia_lipo_1||501898252_Borrelia_orfA->501898255_DUF226->501898258_ParA->501898261_ParB-HTH*-><-501898271_Borrelia_lipo_1<-501898277_?<-750018079_?
      503789140    <-METHYLASE||?->?-><-Borrelia_lipo_1<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||?-><-?<-Borrelia_lipo_1                                                                                         ParB-HTH               SP+Plasmid_parti          BBIDN127_RS06445         189  bacteria>spirochaetes                        Borrelia bissettii                                      permease [Borrelia bissettii].                                                                                    <-503789128_METHYLASE||503789130_?->503789134_?-><-503789135_Borrelia_lipo_1<-503789140_ParB-HTH*<-503789141_ParA<-503789142_DUF226<-763175385_Borrelia_orfA||763175386_?-><-763175387_?<-763175388_Borrelia_lipo_1<-503789147_?
      506379500    Lipoprotein_2->ERF->Borrelia_lipo_1-><-ParB-HTH*<-ParA<-DUF226<-?<-?<-?||MultiTM->Borrelia_orfA->                                                                                             ParB-HTH               SP+Plasmid_parti          BVAVS116_RS05510         189  bacteria>spirochaetes                        Borrelia valaisiana                                     permease [Borrelia valaisiana].                                                                                   506379494_Lipoprotein_2->506379496_ERF->506379499_Borrelia_lipo_1-><-506379500_ParB-HTH*<-750014204_ParA<-506379502_DUF226<-506379503_?<-506379508_?<-506379515_?||750014216_MultiTM->506379525_Borrelia_orfA->
      657234804    <-BdrA<-?||?->?-><-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||?->Borrelia_lipo_1->?-><-Borrelia_lipo_1                                                                                           ParB-HTH               SP+Plasmid_parti          DZ03_RS0103740           189  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      657234800_?-><-696419413_BdrA<-657234801_?||657234802_?->657234803_?-><-657234804_ParB-HTH*<-657234805_ParA<-657234806_DUF226<-657234807_Borrelia_orfA||696419402_?->696419404_Borrelia_lipo_1->657234808_?-><-657234809_Borrelia_lipo_1
      657248004    <-BdrA<-DUF226<-?<-?<-ParB-HTH*<-ParA||?->Borrelia_lipo_1->?-><-Borrelia_lipo_1                                                                                                               ParB-HTH               SP+Plasmid_parti          DZ00_RS0100090           189  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      <-671502334_BdrA<-696414189_DUF226<-657247698_?<-671502337_?<-657248004_ParB-HTH*<-671502341_ParA||696414192_?->696414195_Borrelia_lipo_1->501710212_?-><-696414197_Borrelia_lipo_1
      671520434    Borrelia_lipo_1-><-ParB-HTH*<-ParA<-DUF226                                                                                                                                                    ParB-HTH               SP+Plasmid_parti          DY88_RS0103995           189  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      696421322_Borrelia_lipo_1-><-671520434_ParB-HTH*<-696418599_ParA<-671478354_DUF226<-671478348_?
      671556237    Borrelia_lipo_1-><-?<-Borrelia_lipo_1<-?<-?||ParA->ParB-HTH*-><-Borrelia_lipo_1||?-><-ERF                                                                                                     ParB-HTH               SP+Plasmid_parti          DY90_RS0100165           189  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      501710551_?->671556248_Borrelia_lipo_1-><-696410895_?<-696419824_Borrelia_lipo_1<-696419827_?<-671556244_?||696419830_ParA->671556237_ParB-HTH*-><-671556756_Borrelia_lipo_1||671556757_?-><-671556759_ERF<-696410899_?||671556761_?->
      576102351    <-BppA<-BdrA<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||ERF-><-orfD<-Borrelia_lipo_2                                                                                                            ParB-HTH               Plasmid_parti             BCD_1485                 188  bacteria>spirochaetes                        Borrelia crocidurae DOU                                 hypothetical protein BCD_1485 (plasmid) [Borrelia crocidurae DOU].                                                <-576102349_BppA<-576102350_BdrA<-576102351_ParB-HTH*<-576102352_ParA<-576102353_DUF226<-576102354_Borrelia_orfA||576102355_ERF-><-576102356_orfD<-576102357_Borrelia_lipo_2
      576313055    <-BdrA<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-Borrelia_orfA||ERF->                                                                                                                          ParB-HTH               Plasmid_parti             BDCR2A_01875             188  bacteria>spirochaetes                        Borrelia duttonii CR2A                                  putative plasmid partition protein [Borrelia duttonii CR2A].                                                      <-576313054_BdrA<-576313055_ParB-HTH*<-576313056_ParA<-576313057_DUF226<-576313058_Borrelia_orfA<-576313059_Borrelia_orfA||576313060_ERF->
      645010853    <-BdrA<-Lipoprotein_2||Mlp-><-XerD<-SSB||?->ParA->ParB-HTH*->?->Lipoprotein_2->Lipoprotein_2-><-Lipoprotein_2<-Lipoprotein_2                                                                  ParB-HTH               Plasmid_parti             BHO_RS05800              188  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      <-645010825_BdrA<-645010828_Lipoprotein_2||645010831_Mlp-><-645010834_XerD<-749299211_SSB||645010845_?->645010850_ParA->645010853_ParB-HTH*->645010859_?->749299212_Lipoprotein_2->749299213_Lipoprotein_2-><-645010867_Lipoprotein_2<-645010869_Lipoprotein_2
      749307948    <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-BdrA||?->Borrelia_orfA->                                                                               ParB-HTH               Plasmid_parti             BCD_RS06730              188  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   <-644980757_Lipoprotein_2<-644980758_Lipoprotein_2<-644980759_Lipoprotein_2<-749307952_Lipoprotein_2<-749307948_ParB-HTH*<-749307950_ParA<-644980763_DUF226<-644980764_BdrA||644980765_?->749307954_Borrelia_orfA->
      654876319    <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2||?-><-ParB-HTH*<-ParA<-DUF226<-?<-Mlp<-Lipoprotein_2                                               ParB-HTH               Plasmid_parti             T431_RS0103865           187  bacteria>spirochaetes                        Borrelia coriaceae                                      permease [Borrelia coriaceae].                                                                                    <-645024031_Lipoprotein_2<-645024030_Lipoprotein_2<-740579181_Lipoprotein_2<-645024306_Lipoprotein_2<-740579183_Lipoprotein_2<-654876318_Lipoprotein_2||645024312_?-><-654876319_ParB-HTH*<-645024315_ParA<-654876320_DUF226<-645024321_?<-740579185_Mlp<-645024327_Lipoprotein_2
      740582299    Mlp->Phage-integrase-><-XerD<-SSB<-SSB<-ParB-HTH*<-ParA<-DUF226<-?||ERF->                                                                                                                     ParB-HTH               Plasmid_parti             BDCR2A_RS06845           187  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     <-740582282_?||740582285_Mlp->740582288_Phage-integrase-><-740582292_XerD<-740582315_SSB<-740582296_SSB<-740582299_ParB-HTH*<-740582304_ParA<-740582307_DUF226<-740582310_?||740582313_ERF->
      752506021    <-Lipoprotein_2<-Lipoprotein_2<-?<-?<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-?||?->Borrelia_orfA->                                              ParB-HTH               Plasmid_parti             BDU_RS06030              187  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     <-752506020_Lipoprotein_2<-752506028_Lipoprotein_2<-501533013_?<-501533014_?<-501533015_Lipoprotein_2<-501533016_Lipoprotein_2<-501533017_Lipoprotein_2<-752506021_ParB-HTH*<-501533019_ParA<-501533020_DUF226<-752506022_Borrelia_orfA<-752506023_?||752506024_?->752506029_Borrelia_orfA->752506030_?->
      763123871    <-XerD<-?<-?<-BppA||?->DUF226->ParA->ParB-HTH*->ERF->ERF-><-Borrelia_lipo_2<-BlyB-holin                                                                                                       ParB-HTH               Plasmid_parti             BOM_RS05530              187  bacteria>spirochaetes                        Borrelia miyamotoi                                      permease [Borrelia miyamotoi].                                                                                    <-645073224_XerD<-645073225_?<-645073226_?<-763123878_BppA||645073228_?->645073229_DUF226->645073230_ParA->763123871_ParB-HTH*->763123880_ERF->645073134_ERF-><-763123873_Borrelia_lipo_2<-645073231_BlyB-holin<-645073136_?<-645073232_?
      497943336    orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                           ParB-HTH               Plasmid_parti             NM71_RS05150             186  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           499186329_orfD->499186330_BdrA->499186331_Mlp-><-499186332_ERF||499186319_Borrelia_orfA->499186320_DUF226->499186321_ParA->497943336_ParB-HTH*->499186322_BdrA->499186323_BppA->499186293_XerD-><-499186324_Phage-integrase||499186325_?->499186184_?->497943302_?->
      500023248    orfD->BdrA-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->                                                                                                                             ParB-HTH               Plasmid_parti             BAFPKO_RS06545           186  bacteria>spirochaetes                        Borrelia afzelii                                        chromosome partitioning protein [Borrelia afzelii].                                                               500023234_?->504299370_orfD->500023238_BdrA-><-500023243_ERF||500023245_Borrelia_orfA->500023246_DUF226->500023247_ParA->500023248_ParB-HTH*->500023249_BdrA-><-500023251_?<-500023252_?<-500023253_?||500023254_?->500023255_?->500023259_?->
      501533243    <-Borrelia_orfA||?-><-MultiTM<-ThiF<-ParB-HTH*<-ParA||Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->                                              ParB-HTH               Plasmid_parti             BDU_RS05290              186  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     <-752505976_Borrelia_orfA||501533240_?-><-501533241_MultiTM<-501533242_ThiF<-501533243_ParB-HTH*<-501533244_ParA||501533245_Lipoprotein_2->501533246_Lipoprotein_2->752505975_Lipoprotein_2->501533247_Lipoprotein_2->501533249_Lipoprotein_2->501533250_Lipoprotein_2->
      501704211    BlyB-holin->Mlp->Borrelia_orfA->DUF226->ParA->ParB-HTH*->                                                                                                                                     ParB-HTH               Plasmid_parti             BGAPBR_RS05150           186  bacteria>spirochaetes                        Borrelia garinii                                        chromosome partitioning protein [Borrelia garinii].                                                               696411817_?->501704216_?->501704255_BlyB-holin->501704201_Mlp->501704219_Borrelia_orfA->501704207_DUF226->501704227_ParA->501704211_ParB-HTH*->501704234_?-><-501704233_?
      501710973    Mlp->DUF226->ParA->ParB-HTH*->XerD->?->BlyB-holin->BdrA->Mlp->Borrelia_orfA->                                                                                                                 ParB-HTH               Plasmid_parti             BGAFAR04_RS04830         186  bacteria>spirochaetes                        Borrelia garinii                                        chromosome partitioning protein [Borrelia garinii].                                                               696414862_?->501710881_?->501710891_?->501710887_?->501710904_Mlp->501704207_DUF226->501710876_ParA->501710973_ParB-HTH*->501711020_XerD->501710905_?->696414864_BlyB-holin->501710896_BdrA->501711040_Mlp->501711010_Borrelia_orfA->696414865_?->
      504299060    Borrelia_lipo_2->orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                  ParB-HTH               Plasmid_parti             BAFPKO_RS04750           186  bacteria>spirochaetes                        Borrelia afzelii                                        chromosome partitioning protein [Borrelia afzelii].                                                               504299052_Borrelia_lipo_2->504299053_orfD->504299054_BdrA->504299055_Mlp-><-504299056_ERF||504299058_Borrelia_orfA->504299059_ParA->504299060_ParB-HTH*->504299061_BdrA->504299062_BppA->504299063_XerD-><-501574879_Phage-integrase||504299064_?->504299065_?-><-504299066_?
      504496143    <-Lipoprotein_2<-BdrA||?->?-><-ParB-HTH*<-ParA                                                                                                                                                ParB-HTH               Plasmid_parti             Q7M_RS05445              186  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   <-504496141_Lipoprotein_2<-504496142_BdrA||752505150_?->752505151_?-><-504496143_ParB-HTH*<-504496144_ParA||752505152_?->752505153_?->752505154_?->
      504496216    <-Lipoprotein_2<-?<-BdrA<-?<-?<-MultiTM<-ThiF<-ParB-HTH*<-ParA||Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->?->METHYLASE->                                                    ParB-HTH               Plasmid_parti             Q7M_RS05925              186  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   <-752505181_Lipoprotein_2<-752505182_?<-752505187_BdrA<-752505183_?<-752505188_?<-504496214_MultiTM<-504496215_ThiF<-504496216_ParB-HTH*<-504496217_ParA||504496218_Lipoprotein_2->752505184_Lipoprotein_2->752505189_Lipoprotein_2->752505190_Lipoprotein_2->504496225_?->752505191_METHYLASE->
      504509606    <-SSB<-SSB<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||ERF-><-orfD                                                                                                                               ParB-HTH               Plasmid_parti             BCD_RS07150              186  bacteria>spirochaetes                        Borrelia crocidurae                                     chromosome partitioning protein [Borrelia crocidurae].                                                            <-644980837_SSB<-644980838_SSB<-504509606_ParB-HTH*<-644980840_ParA<-644980841_DUF226<-644980842_Borrelia_orfA||749308014_ERF-><-644980844_orfD
      639481943    <-BppA<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA                                                                                                                                                ParB-HTH               Plasmid_parti             U880_RS0101780           186  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    <-740573056_BppA<-639481943_ParB-HTH*<-639481944_ParA<-639481945_DUF226<-639481946_Borrelia_orfA
      639482644    <-Borrelia_orfA<-?<-MultiTM<-ThiF<-ParB-HTH*<-ParA<-?||Lipoprotein_2->Lipoprotein_2->?->Lipoprotein_2->Lipoprotein_2->                                                                        ParB-HTH               Plasmid_parti             U880_RS0105870           186  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    <-740573640_Borrelia_orfA<-639482641_?<-639482642_MultiTM<-740573637_ThiF<-639482644_ParB-HTH*<-639482645_ParA<-639482646_?||639482647_Lipoprotein_2->740573643_Lipoprotein_2->639482648_?->639482649_Lipoprotein_2->740573646_Lipoprotein_2->
      644980530    <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-?||?->ParA->ParB-HTH*->                                                                                                                        ParB-HTH               Plasmid_parti             BCD_RS05395              186  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   <-644980522_?<-749307822_Lipoprotein_2<-644980525_Lipoprotein_2<-749307824_Lipoprotein_2<-644980527_?||644980528_?->644980529_ParA->644980530_ParB-HTH*->
      657235047    <-ParB-HTH*<-ParA<-DUF226                                                                                                                                                                     ParB-HTH               Plasmid_parti             DZ03_RS0105875           186  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      <-657235047_ParB-HTH*<-657235048_ParA<-657235049_DUF226
      671563339    <-BdrA<-ParB-HTH*<-ParA                                                                                                                                                                       ParB-HTH               Plasmid_parti             DZ19_RS0105860           186  bacteria>spirochaetes                        Borrelia burgdorferi                                    permease [Borrelia burgdorferi].                                                                                  <-501600498_BdrA<-671563339_ParB-HTH*<-671563341_ParA
      740582201    <-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||ERF->                                                                                                                                               ParB-HTH               Plasmid_parti             BDCR2A_RS06670           186  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     <-740582201_ParB-HTH*<-740582204_ParA<-740582206_DUF226<-740582209_Borrelia_orfA||740582211_ERF->
      499186196    orfD-><-?||Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                            ParB-HTH               Plasmid_parti             NM71_RS05580             185  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           497944293_orfD-><-499186190_?||499186191_Mlp-><-499186192_ERF||499186193_Borrelia_orfA->499186194_DUF226->499186195_ParA->499186196_ParB-HTH*->499186197_BdrA->499186198_BppA->499186199_XerD-><-499186200_Phage-integrase||499186325_?->499186184_?->763427928_?->
      501574765    orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                           ParB-HTH               Plasmid_parti             BAFPKO_RS04960           185  bacteria>spirochaetes                        Borrelia afzelii                                        chromosome partitioning protein [Borrelia afzelii].                                                               504299145_orfD->504299146_BdrA->504299147_Mlp-><-504299148_ERF||504299149_Borrelia_orfA->504299150_DUF226->504299151_ParA->501574765_ParB-HTH*->504299152_BdrA->504299153_BppA->504299154_XerD-><-504299155_Phage-integrase||504299156_?-><-504299157_?||504299158_?->
      503783569    BlyB-holin->Borrelia_lipo_2->orfD->Mlp->Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                           ParB-HTH               Plasmid_parti             BBIDN127_RS05685         185  bacteria>spirochaetes                        Borrelia bissettii                                      permease [Borrelia bissettii].                                                                                    503783561_BlyB-holin->503783467_Borrelia_lipo_2->503783562_orfD->503783563_Mlp->503783566_Borrelia_orfA->503783567_DUF226->503783568_ParA->503783569_ParB-HTH*->503783570_BdrA->503783571_BppA->503783572_XerD-><-503783573_Phage-integrase||503783574_?->503783575_?-><-763175365_?
      576103756    orfD-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BppA->?->?->XerD->                                                                                                                       ParB-HTH               Plasmid_parti             BOM_0964                 185  bacteria>spirochaetes                        Borrelia miyamotoi FR64b                                Plasmid partition family protein (plasmid) [Borrelia miyamotoi FR64b].                                            576103754_orfD-><-576103755_ERF<-576103756_ParB-HTH*<-576103757_ParA<-576103758_DUF226<-576103759_Borrelia_orfA||576103760_BppA->576103761_?->576103762_?->576103763_XerD->
      657235558    DUF226->ParA->ParB-HTH*->                                                                                                                                                                     ParB-HTH               Plasmid_parti             DY94_RS0102615           185  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      696420534_?->657235554_DUF226->657235555_ParA->657235558_ParB-HTH*->
      695263564    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH               Plasmid_parti             PF-49                    185  bacteria>spirochaetes                        Borrelia burgdorferi                                    PF-49 protein [Borrelia burgdorferi].                                                                             695208921_?->695208922_ParA->695263564_ParB-HTH*->695208924_?->
      696415789    <-XerD<-BppA<-BdrA<-ParB-HTH*<-ParA<-Borrelia_orfA||ERF-><-Mlp<-orfD<-BlyB-holin                                                                                                              ParB-HTH               Plasmid_parti             DM10_RS00505             185  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      <-696415784_?||696411761_?-><-696415785_?<-696415786_XerD<-696415787_BppA<-696415788_BdrA<-696415789_ParB-HTH*<-696415790_ParA<-696415791_Borrelia_orfA||696415792_ERF-><-696415793_Mlp<-696415794_orfD<-696415795_BlyB-holin<-696415796_?
      499186290    orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA-><-?||BppA->XerD-><-Phage-integrase                                                                                      ParB-HTH               Plasmid_parti             BB_O33                   184  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           11497260_orfD->11497261_BdrA->11497262_Mlp-><-11497280_ERF||11497237_Borrelia_orfA->11497238_DUF226->11497239_ParA->499186290_ParB-HTH*->11497241_BdrA-><-11497263_?||11497242_BppA->11497243_XerD-><-11497244_Phage-integrase||11497245_?->11497246_?->
      740577610    <-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA                                                                                                                                                 ParB-HTH               SP+Plasmid_parti          U881_RS0102215           184  bacteria>spirochaetes                        Borrelia persica                                        permease [Borrelia persica].                                                                                      <-639480299_ERF<-740577610_ParB-HTH*<-639480301_ParA<-639480302_DUF226<-639480303_Borrelia_orfA
      764988637    DUF226->ParA->ParB-HTH*-><-?||Lipoprotein_2->                                                                                                                                                 ParB-HTH               Plasmid_parti             I871_RS04285             184  bacteria>spirochaetes                        Borrelia miyamotoi                                      chromosome partitioning protein [Borrelia miyamotoi].                                                             764988629_?->764988630_?-><-764988631_?<-764988632_?<-764988633_?||764988634_DUF226->764988636_ParA->764988637_ParB-HTH*-><-764988642_?||764988638_Lipoprotein_2->
      201084318    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon||?-><-orfD<-BlyB-holin                                                                    ParB-HTH               Plasmid_parti             BDU_1115                 183  bacteria>spirochaetes                        Borrelia duttonii Ly                                    PF49 plasmid partition protein (plasmid) [Borrelia duttonii Ly].                                                  <-201084311_?<-201084312_?||201084313_?->201084314_?->201084315_?-><-201084316_Terminase_LS<-201084317_?<-201084318_ParB-HTH*<-201084319_ParA<-201084320_DUF226<-201084321_Borrelia_orfA<-201084322_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon||201084323_?-><-201084324_orfD<-201084325_BlyB-holin
      499192746    DUF226->ParA->ParB-HTH*->                                                                                                                                                                     ParB-HTH               Plasmid_parti             NM71_RS06515             183  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           499192760_?->499192743_?->501902608_?->499192744_DUF226->499192745_ParA->499192746_ParB-HTH*-><-763427949_?<-497942530_?<-499192761_?<-497942522_?<-497942517_?<-499192762_?<-497945500_?
      639482723    <-SSB<-SSB||?->DUF226->ParA->ParB-HTH*->ERF->                                                                                                                                                 ParB-HTH               Plasmid_parti             U880_RS0106340           183  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    <-639482718_SSB<-639482719_SSB||639482720_?->639482721_DUF226->639482722_ParA->639482723_ParB-HTH*->740573743_ERF->
      644981026    <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2||?-><-BdrA<-Terminase_LS||?-><-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-Mlp<-Phage-integrase||HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-Mlp    ParB-HTH               Plasmid_parti             BCD_RS08230              183  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   <-644981019_Lipoprotein_2<-749308080_Lipoprotein_2<-644981020_Lipoprotein_2||644981021_?-><-749308079_BdrA<-644981023_Terminase_LS||749308081_?-><-644981026_ParB-HTH*<-644981027_ParA<-644981028_DUF226<-644981029_Borrelia_orfA<-644981030_Mlp<-644981031_Phage-integrase||644981032_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-749308082_Mlp
      740581624    Mlp-><-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||XerD-><-Mlp                                                                                                                                    ParB-HTH               Plasmid_parti             BDCR2A_RS05505           183  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     740581621_Mlp-><-740581624_ParB-HTH*<-644981027_ParA<-740581626_DUF226<-740581629_Borrelia_orfA||740581631_XerD-><-740581634_Mlp
      741043351    Borrelia_orfA->DUF226->ParA->ParB-HTH*->                                                                                                                                                      ParB-HTH               Plasmid_parti             OY14_04355               182  bacteria>spirochaetes                        Borrelia chilensis                                      chromosome partitioning protein (plasmid) [Borrelia chilensis].                                                   741043344_?->741043345_?-><-741043346_?||741043347_?->741043348_Borrelia_orfA->741043349_DUF226->741043350_ParA->741043351_ParB-HTH*-><-741043352_?<-741043353_?
      145652250    ParA->ParB-HTH*->?->?->?->Terminase_LS->                                                                                                                                                      ParB-HTH               Plasmid_parti             ABP88177.1               181  bacteria>spirochaetes                        Borrelia lonestari                                      hypothetical protein [Borrelia lonestari].                                                                        145652251_ParA->145652250_ParB-HTH*->145652252_?->145652253_?->145652254_?->145652255_Terminase_LS->
      504496098    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                   ParB-HTH               Plasmid_parti             Q7M_RS05150              181  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   <-504496090_?<-504496091_?<-504496092_?||504496093_?->504496094_?-><-504496096_Terminase_LS<-504496097_?<-504496098_ParB-HTH*<-501532856_ParA<-501532978_DUF226<-504496099_Borrelia_orfA<-504496100_orfD<-504496101_BlyB-holin<-504496102_?<-504496103_?
      504509673    <-BppA<-BdrA<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||ERF-><-orfD                                                                                                                             ParB-HTH               Plasmid_parti             BCD_RS06770              181  bacteria>spirochaetes                        Borrelia crocidurae                                     chromosome partitioning protein [Borrelia crocidurae].                                                            <-644980768_BppA<-644980769_BdrA<-504509673_ParB-HTH*<-644980770_ParA<-644980771_DUF226<-644980772_Borrelia_orfA||644980773_ERF-><-749307961_orfD
      519700232    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                   ParB-HTH               Plasmid_parti             BTA100                   181  bacteria>spirochaetes                        Borrelia turicatae                                      hypothetical protein [Borrelia turicatae].                                                                        <-541862209_?||541862210_?->541862211_?->541862212_?-><-541862213_?<-541862214_Terminase_LS<-541862215_?<-519700232_ParB-HTH*<-541862217_ParA<-541862218_DUF226<-541862219_Borrelia_orfA<-541862220_orfD<-541862221_BlyB-holin<-541862222_?<-541862223_?
      639481996    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                   ParB-HTH               Plasmid_parti             U880_RS0102040           181  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    <-639481989_?<-639481990_?||639481991_?->639481992_?->639481993_?-><-639481994_Terminase_LS<-639481995_?<-639481996_ParB-HTH*<-639481997_ParA<-639481998_DUF226<-639481999_Borrelia_orfA<-639482000_orfD<-639482001_BlyB-holin<-639482002_?<-639482003_?
      639482094    Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->                                                                                                                                          ParB-HTH               SP+Plasmid_parti          U880_RS0102635           181  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    639482091_Borrelia_orfA->639482092_DUF226->639482093_ParA->639482094_ParB-HTH*->639482095_BdrA->740573199_BppA->
      644922901    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                   ParB-HTH               Plasmid_parti             X966_RS04750             181  bacteria>spirochaetes                        Borrelia parkeri                                        permease [Borrelia parkeri].                                                                                      <-749302062_?<-644922885_?||644922887_?->644922891_?-><-644922894_?<-644922896_Terminase_LS<-644922898_?<-644922901_ParB-HTH*<-644922903_ParA<-519700238_DUF226<-749302063_Borrelia_orfA<-749302064_orfD<-644922913_BlyB-holin<-644922915_?<-644922917_?
      644979602    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                   ParB-HTH               Plasmid_parti             BHW_RS05475              181  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      <-752506919_?||644979596_?->644979597_?-><-644979598_?||644979599_?-><-644979600_Terminase_LS<-644979601_?<-644979602_ParB-HTH*<-644979603_ParA<-644979604_DUF226<-644979605_Borrelia_orfA<-644979606_orfD<-644979607_BlyB-holin<-644979608_?<-644979609_?
      644980614    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                   ParB-HTH               Plasmid_parti             BDCR2A_RS04950           181  bacteria>spirochaetes                        Borrelia                                                MULTISPECIES: permease [Borrelia].                                                                                <-740581381_?<-740581385_?||740581388_?->740581391_?->740581394_?-><-740581398_Terminase_LS<-740581400_?<-644980614_ParB-HTH*<-740581403_ParA<-501532978_DUF226<-740581405_Borrelia_orfA<-740581407_orfD<-740581409_BlyB-holin<-504496102_?<-740581412_?
      645023282    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                   ParB-HTH               Plasmid_parti             T431_RS0103125           181  bacteria>spirochaetes                        Borrelia coriaceae                                      permease [Borrelia coriaceae].                                                                                    <-645023301_?<-740579078_?<-645023296_?||645023293_?-><-645023290_?<-645023287_Terminase_LS<-645023284_?<-645023282_ParB-HTH*<-645023279_ParA<-645023274_DUF226<-740579080_Borrelia_orfA<-645023268_orfD<-645023266_BlyB-holin<-645023264_?<-645023261_?
      645024774    <-XerD<-?<-SSB<-SSB||Borrelia_orfA->DUF226->ParB-HTH*->ERF->                                                                                                                                  ParB-HTH               Plasmid_parti             BCO_RS07330              181  bacteria>spirochaetes                        Borrelia coriaceae                                      permease [Borrelia coriaceae].                                                                                    <-645024768_?<-645023953_XerD<-645024769_?<-752507031_SSB<-645024771_SSB||645024772_Borrelia_orfA->645024773_DUF226->645024774_ParB-HTH*->645024775_ERF->
      645048715    BlyB-holin->orfD->Borrelia_orfA->DUF226->ParA->ParB-HTH*->?->Terminase_LS->                                                                                                                   ParB-HTH               Plasmid_parti             BAN_RS04870              181  bacteria>spirochaetes                        Borrelia anserina                                       permease [Borrelia anserina].                                                                                     645048709_?->752506871_?->752506872_BlyB-holin->752506873_orfD->752506876_Borrelia_orfA->645048713_DUF226->645048714_ParA->645048715_ParB-HTH*->645048716_?->645048717_Terminase_LS-><-645048718_?<-645048719_?||645048720_?->645048721_?->645048722_?->
      645062976    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                   ParB-HTH               Plasmid_parti             BHY_RS05115              181  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      <-749302082_?||645062973_?->645062974_?-><-644979598_?||644979599_?-><-645062975_Terminase_LS<-644979601_?<-645062976_ParB-HTH*<-644979603_ParA<-644979604_DUF226<-645062977_Borrelia_orfA<-644979606_orfD<-644979607_BlyB-holin<-644979608_?<-644979609_?
      645073074    <-Terminase_LS<-?<-ParB-HTH*<-ParA<-Borrelia_orfA<-orfD<-BlyB-holin                                                                                                                           ParB-HTH               Plasmid_parti             BOM_RS04520              181  bacteria>spirochaetes                        Borrelia miyamotoi                                      permease [Borrelia miyamotoi].                                                                                    <-645073067_?<-763123742_?<-763123715_?||645073070_?->645073071_?-><-645073072_Terminase_LS<-645073073_?<-645073074_ParB-HTH*<-645073075_ParA<-763123743_Borrelia_orfA<-645073077_orfD<-645073078_BlyB-holin<-645073079_?<-645073080_?<-645073081_?
      740582639    <-BdrA<-ParB-HTH*<-ParA<-DUF226||ERF->                                                                                                                                                        ParB-HTH               Plasmid_parti             BDCR2A_RS07425           181  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     <-740582637_BdrA<-740582639_ParB-HTH*<-740582641_ParA<-740582643_DUF226||740582645_ERF->
      501588721    BdrA->Mlp-><-ERF||?->Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                              ParB-HTH               Plasmid_parti             NM71_RS06225             180  bacteria>spirochaetes                        Borrelia burgdorferi                                    permease [Borrelia burgdorferi].                                                                                  499186212_BdrA->499186213_Mlp-><-499186214_ERF||742499587_?->499186215_Borrelia_orfA->499186216_DUF226->499186217_ParA->501588721_ParB-HTH*->499186218_BdrA->499186219_BppA->499186220_XerD-><-499186221_Phage-integrase||499186222_?->499186223_?->501895123_?->
      696413767    <-ParB-HTH*<-ParA<-DUF226                                                                                                                                                                     ParB-HTH               Plasmid_parti             DZ02_RS01000000108520    180  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      <-696413767_ParB-HTH*<-671547733_ParA<-671547734_DUF226
      639480295    <-ERF<-ParB-HTH*<-ParA<-DUF226                                                                                                                                                                ParB-HTH               Plasmid_parti             U881_RS0102195           179  bacteria>spirochaetes                        Borrelia persica                                        permease [Borrelia persica].                                                                                      <-639480294_ERF<-639480295_ParB-HTH*<-639480296_ParA<-639480297_DUF226
      740577787    <-Lipoprotein_2||?-><-MultiTM<-?||?->DUF226->ParA->ParB-HTH*->                                                                                                                                ParB-HTH               Plasmid_parti             U881_RS0103180           179  bacteria>spirochaetes                        Borrelia persica                                        permease, partial [Borrelia persica].                                                                             <-639480427_Lipoprotein_2||639480428_?-><-639480429_MultiTM<-639480430_?||639480431_?->639480432_DUF226->639480433_ParA->740577787_ParB-HTH*->
      639480863    <-ERF<-ParB-HTH*<-ParA<-DUF226<-?||BppA->                                                                                                                                                     ParB-HTH               Plasmid_parti             U881_RS0105670           178  bacteria>spirochaetes                        Borrelia persica                                        permease [Borrelia persica].                                                                                      <-639480862_ERF<-639480863_ParB-HTH*<-639480864_ParA<-639480865_DUF226<-639480866_?||639480867_BppA->
      645024479    <-MultiTM<-ThiF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Borrelia_orfA->                            ParB-HTH               Plasmid_parti             T431_RS0104570           178  bacteria>spirochaetes                        Borrelia coriaceae                                      permease [Borrelia coriaceae].                                                                                    740579281_?-><-645024494_MultiTM<-645024491_ThiF||654876359_Borrelia_orfA->645024484_DUF226->645024481_ParA->645024479_ParB-HTH*->645024476_Lipoprotein_2->740579290_Lipoprotein_2->740579284_Lipoprotein_2->645024470_Lipoprotein_2->740579287_Lipoprotein_2->740579293_Lipoprotein_2->654876362_Borrelia_orfA->
      576102548    Borrelia_lipo_2->orfD-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BppA->BppA->SSB->XerD->                                                                                                 ParB-HTH               Plasmid_parti             BCD_1669                 177  bacteria>spirochaetes                        Borrelia crocidurae DOU                                 Putative plasmid partition protein (plasmid) [Borrelia crocidurae DOU].                                           <-576102541_?<-576102542_?<-576102543_?||576102544_?->576102545_Borrelia_lipo_2->576102546_orfD-><-576102547_ERF<-576102548_ParB-HTH*<-576102549_ParA<-576102550_DUF226<-576102551_Borrelia_orfA||576102552_BppA->576102553_BppA->576102554_SSB->576102555_XerD->
      644979720    <-Lipoprotein_2<-BdrA||?-><-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA                                                                                                                            ParB-HTH               Plasmid_parti             BHW_RS06110              177  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      <-644979715_Lipoprotein_2<-749302125_BdrA||644979718_?-><-644979720_ParB-HTH*<-644979721_ParA<-644979722_DUF226<-644979723_Borrelia_orfA
      645010591    DUF226->ParA->ParB-HTH*->BdrA-><-Lipoprotein_2<-Lipoprotein_2<-?<-Lipoprotein_2<-BdrA<-Terminase_LS                                                                                           ParB-HTH               Plasmid_parti             BHO_RS05255              177  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      645010587_DUF226->645010590_ParA->645010591_ParB-HTH*->645010594_BdrA-><-645010596_Lipoprotein_2<-645010599_Lipoprotein_2<-645010602_?<-645010609_Lipoprotein_2<-645010612_BdrA<-645010615_Terminase_LS
      644979647    <-ParB-HTH*<-ParA<-DUF226<-Lipoprotein_2<-BdrA                                                                                                                                                ParB-HTH               Plasmid_parti             BHY_RS05885              176  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      <-644979648_?<-644979647_ParB-HTH*<-749302115_ParA<-645063075_DUF226<-749302116_Lipoprotein_2<-645063078_BdrA
      645010701    <-ERF<-BdrA<-ParB-HTH*<-ParA<-DUF226<-?||BppA->                                                                                                                                               ParB-HTH               SP+Plasmid_parti          BHO_RS05480              176  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      <-645010695_ERF<-645010698_BdrA<-645010701_ParB-HTH*<-645010704_ParA<-645010707_DUF226<-645010709_?||645010711_BppA->
      645023888    <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BdrA-><-Lipoprotein_2<-BdrA||Mlp->         ParB-HTH               Plasmid_parti             T431_RS0103305           176  bacteria>spirochaetes                        Borrelia coriaceae                                      permease [Borrelia coriaceae].                                                                                    <-654876291_Lipoprotein_2<-740579113_Lipoprotein_2<-654876292_Lipoprotein_2<-740579100_Lipoprotein_2<-740579103_Lipoprotein_2<-740579116_Lipoprotein_2<-645023885_Lipoprotein_2<-645023888_ParB-HTH*<-645023890_ParA<-654876296_DUF226<-645023892_Borrelia_orfA||645023893_BdrA-><-645023896_Lipoprotein_2<-645023900_BdrA||645023901_Mlp->
      645063163    <-BdrA<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA                                                                                                                                                ParB-HTH               Plasmid_parti             BHY_RS06640              176  bacteria>spirochaetes                        Borrelia hermsii                                        chromosome partitioning protein [Borrelia hermsii].                                                               <-645063162_BdrA<-645063163_ParB-HTH*<-644979703_ParA<-645063164_DUF226<-645063165_Borrelia_orfA
      763123770    orfD-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BppA->?->?->XerD->                                                                                                                       ParB-HTH               Plasmid_parti             BOM_RS04635              176  bacteria>spirochaetes                        Borrelia miyamotoi                                      hypothetical protein [Borrelia miyamotoi].                                                                        645073095_orfD-><-763123769_ERF<-763123770_ParB-HTH*<-645073098_ParA<-645073099_DUF226<-645073100_Borrelia_orfA||763123772_BppA->645073102_?->645073103_?->763123766_XerD->
      497943851    <-BdrA<-ParB-HTH*<-ParA                                                                                                                                                                       ParB-HTH               Plasmid_parti             DZ10_RS0105695           175  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           <-671559758_BdrA<-497943851_ParB-HTH*<-671579798_ParA
      501704894    orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                           ParB-HTH               Plasmid_parti             BBUJD1_RS00885           175  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           488735288_orfD->499186212_BdrA->501883602_Mlp-><-504352924_ERF||504352925_Borrelia_orfA->504352926_DUF226->488735209_ParA->501704894_ParB-HTH*->504352927_BdrA->763417052_BppA->488735217_XerD-><-497943318_Phage-integrase||504352928_?->501704944_?->499186310_?->
      501928340    orfD-><-?||Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA-><-Phage-integrase||?->?->Terminase_LS->                                                                                  ParB-HTH               Plasmid_parti             BSV1_RS06320             175  bacteria>spirochaetes                        Borrelia finlandensis                                   chromosome partitioning protein [Borrelia finlandensis].                                                          748691687_orfD-><-748691689_?||501928371_Mlp-><-748691701_ERF||501928405_Borrelia_orfA->501928365_DUF226->501928300_ParA->501928340_ParB-HTH*->501928369_BdrA-><-501928327_Phage-integrase||501928367_?->748691703_?->501928390_Terminase_LS->501928324_?->748691705_?->
      503783476    Borrelia_lipo_2->orfD->BdrA->Mlp->Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                 ParB-HTH               Plasmid_parti             BBIDN127_RS04525         175  bacteria>spirochaetes                        Borrelia bissettii                                      chromosome partitioning protein [Borrelia bissettii].                                                             503783467_Borrelia_lipo_2->503783468_orfD->503783469_BdrA->503783470_Mlp->503783473_Borrelia_orfA->503783474_DUF226->503783475_ParA->503783476_ParB-HTH*->503783477_BdrA->503783478_BppA->503783479_XerD-><-503783480_Phage-integrase||503783486_?->
      504299035    Borrelia_lipo_2->orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                  ParB-HTH               Plasmid_parti             BAFPKO_RS04540           175  bacteria>spirochaetes                        Borrelia afzelii                                        chromosome partitioning protein [Borrelia afzelii].                                                               504299028_Borrelia_lipo_2->504299029_orfD->504299030_BdrA->504299031_Mlp-><-504299032_ERF||504299033_Borrelia_orfA->504299034_ParA->504299035_ParB-HTH*->504299036_BdrA->504299037_BppA->504299038_XerD-><-504299039_Phage-integrase||504299040_?->763173059_?->763173060_?->
      226232418    <-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                                            ParB-HTH               Plasmid_parti             BBUBOL26_W05             174  bacteria>spirochaetes                        Borrelia burgdorferi Bol26                              putative plasmid partition protein (plasmid) [Borrelia burgdorferi Bol26].                                        <-226232414_ERF||226232415_Borrelia_orfA->226232416_DUF226->226232417_ParA->226232418_ParB-HTH*->226232419_BdrA->226232420_BppA->226232421_XerD-><-226232422_Phage-integrase||226232423_?->226232424_?->226232425_?->
      493479353    orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                           ParB-HTH               Plasmid_parti             BAFPKO_RS05375           174  bacteria>spirochaetes                        Borrelia burgdorferi group                              MULTISPECIES: chromosome partitioning protein [Borrelia burgdorferi group].                                       493479326_orfD->504299104_BdrA->504299105_Mlp-><-504299106_ERF||504299107_Borrelia_orfA->493479359_DUF226->501574801_ParA->493479353_ParB-HTH*->501574776_BdrA->504299109_BppA->504299110_XerD-><-504299111_Phage-integrase||504299112_?->504299113_?->763173064_?->
      501531293    <-Borrelia_orfA||HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->?->?->Borrelia_orfA->DUF226->ParA->ParB-HTH*->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->                                        ParB-HTH               Plasmid_parti             BDU_RS04395              174  bacteria>spirochaetes                        Borrelia duttonii                                       chromosome partitioning protein [Borrelia duttonii].                                                              <-752505939_Borrelia_orfA||501531286_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->501531287_?->501531288_?->752505936_Borrelia_orfA->501531291_DUF226->501531292_ParA->501531293_ParB-HTH*->501531294_Lipoprotein_2->752505940_Lipoprotein_2->752505937_Lipoprotein_2->
      501894859    BlyB-holin->Mlp->Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                                  ParB-HTH               Plasmid_parti             BVAVS116_RS06000         174  bacteria>spirochaetes                        Borrelia valaisiana                                     chromosome partitioning protein [Borrelia valaisiana].                                                            501894843_?->501894848_?->501894849_BlyB-holin->501894852_Mlp->501894856_Borrelia_orfA->501894857_DUF226->501894858_ParA->501894859_ParB-HTH*->501894860_BdrA->501894861_BppA->501894862_XerD-><-501894863_Phage-integrase||501894864_?->501894865_?-><-750014224_?
      503783725    orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase                                                                                           ParB-HTH               Plasmid_parti             BBIDN127_RS05150         174  bacteria>spirochaetes                        Borrelia bissettii                                      chromosome partitioning protein [Borrelia bissettii].                                                             503783718_orfD->503783719_BdrA->503783720_Mlp-><-503783721_ERF||503783723_Borrelia_orfA->497943782_DUF226->503783724_ParA->503783725_ParB-HTH*->503783726_BdrA->503783727_BppA->503783728_XerD-><-503783729_Phage-integrase||503783730_?->503783731_?->503783732_?->
      504499910    <-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BdrA->HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-Mlp||BdrA->                                                                        ParB-HTH               Plasmid_parti             BCD_RS05470              174  bacteria>spirochaetes                        Borrelia crocidurae                                     chromosome partitioning protein [Borrelia crocidurae].                                                            <-749307847_Lipoprotein_2<-504499910_ParB-HTH*<-644980543_ParA<-644980544_DUF226<-644980545_Borrelia_orfA||644980546_BdrA->644980547_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-644980548_Mlp||644980550_BdrA->
      504509579    <-Lipoprotein_2<-Lipoprotein_2<-?<-?||Borrelia_lipo_2->orfD-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||SSB->SSB->                                                                        ParB-HTH               Plasmid_parti             BCD_RS07490              174  bacteria>spirochaetes                        Borrelia crocidurae                                     chromosome partitioning protein [Borrelia crocidurae].                                                            <-644980895_Lipoprotein_2<-749308030_Lipoprotein_2<-644980897_?<-644980898_?||749308035_Borrelia_lipo_2->749308031_orfD-><-749308036_ERF<-504509579_ParB-HTH*<-644980901_ParA<-504509577_DUF226<-749308032_Borrelia_orfA||644980902_SSB->749308037_SSB->
      644979702    <-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA                                                                                                                                                      ParB-HTH               Plasmid_parti             BHW_RS06025              174  bacteria>spirochaetes                        Borrelia hermsii                                        chromosome partitioning protein, partial [Borrelia hermsii].                                                      <-644979702_ParB-HTH*<-644979703_ParA<-644979704_DUF226<-644979705_Borrelia_orfA
      657248267    <-XerD<-BppA<-ParB-HTH*<-ParA||ERF-><-BdrA<-orfD<-Borrelia_lipo_2<-BlyB-holin                                                                                                                 ParB-HTH               Plasmid_parti             DY95_RS0102845           174  bacteria>spirochaetes                        Borrelia garinii                                        chromosome partitioning protein [Borrelia garinii].                                                               <-671612253_?<-501704835_XerD<-501704830_BppA<-657248267_ParB-HTH*<-501704490_ParA||501704847_ERF-><-501704831_BdrA<-501704842_orfD<-501704499_Borrelia_lipo_2<-501704839_BlyB-holin
      740573754    <-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BppA->                                                                                                                                         ParB-HTH               Plasmid_parti             U880_RS10330             174  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    <-639482730_ERF<-740573754_ParB-HTH*<-639482731_ParA<-639482732_DUF226<-639482733_Borrelia_orfA||639482734_BppA->
      501704326    orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA-><-?||BppA->XerD-><-Phage-integrase                                                                                      ParB-HTH               Plasmid_parti             NM71_RS05360             173  bacteria>spirochaetes                        Borrelia burgdorferi group                              MULTISPECIES: chromosome partitioning protein [Borrelia burgdorferi group].                                       499186329_orfD->501883588_BdrA->499186307_Mlp-><-763427922_ERF||499186287_Borrelia_orfA->499186288_DUF226->499186289_ParA->501704326_ParB-HTH*->499186291_BdrA-><-497945423_?||501883650_BppA->499186293_XerD-><-488735128_Phage-integrase||499186294_?->499186295_?->
      501894927    Phage-integrase-><-BdrA<-ParB-HTH*<-ParA<-Borrelia_orfA||Borrelia_orfA->DUF226->ParA->ParB-HTH->XerD->                                                                                        ParB-HTH               Plasmid_parti             BVAVS116_RS04800         173  bacteria>spirochaetes                        Borrelia valaisiana                                     chromosome partitioning protein [Borrelia valaisiana].                                                            750014136_?-><-501894913_?<-750014137_?<-750014139_?||750014140_Phage-integrase-><-501894926_BdrA<-501894927_ParB-HTH*<-501894928_ParA<-750014142_Borrelia_orfA||750014144_Borrelia_orfA->501894942_DUF226->501894943_ParA->501894944_ParB-HTH->501894949_XerD->
      501894944    <-BdrA<-ParB-HTH<-ParA<-Borrelia_orfA||Borrelia_orfA->DUF226->ParA->ParB-HTH*->XerD->                                                                                                         ParB-HTH               Plasmid_parti             BVAVS116_RS04865         173  bacteria>spirochaetes                        Borrelia valaisiana                                     chromosome partitioning protein [Borrelia valaisiana].                                                            <-501894926_BdrA<-501894927_ParB-HTH<-501894928_ParA<-750014142_Borrelia_orfA||750014144_Borrelia_orfA->501894942_DUF226->501894943_ParA->501894944_ParB-HTH*->501894949_XerD->501894951_?->501894952_?->
      503783755    BdrA->?->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->XerD-><-Phage-integrase                                                                                                    ParB-HTH               Plasmid_parti             BBIDN127_RS05560         173  bacteria>spirochaetes                        Borrelia bissettii                                      chromosome partitioning protein [Borrelia bissettii].                                                             503783749_BdrA->503783750_?->503783751_Mlp-><-503783752_ERF||503783753_Borrelia_orfA->497944496_DUF226->503783754_ParA->503783755_ParB-HTH*->503783756_BdrA->503783518_XerD-><-503783759_Phage-integrase||503783760_?-><-763175361_?<-503783762_?||503783763_?->
      504299173    BdrA->Mlp->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase||?->?->Terminase_LS->                                                                      ParB-HTH               Plasmid_parti             BAFPKO_RS05550           173  bacteria>spirochaetes                        Borrelia afzelii                                        chromosome partitioning protein [Borrelia afzelii].                                                               501574871_BdrA->504299167_Mlp->504299168_Mlp-><-504299169_ERF||504299170_Borrelia_orfA->504299171_DUF226->504299172_ParA->504299173_ParB-HTH*->504299174_BdrA->504299175_BppA->504299110_XerD-><-499919955_Phage-integrase||504299176_?->504299177_?->504299179_Terminase_LS->
      671481046    ParA->ParB-HTH*->BdrA->                                                                                                                                                                       ParB-HTH               Plasmid_parti             DY90_RS0105415           173  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      671557949_ParA->671481046_ParB-HTH*->671481040_BdrA->
      696415807    <-Terminase_LS<-?<-?<-?||Phage-integrase-><-ParB-HTH*<-ParA<-DUF226                                                                                                                           ParB-HTH               Plasmid_parti             DM10_RS00395             173  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      <-696415800_?<-696415801_?<-696415802_Terminase_LS<-696415803_?<-696415804_?<-696415805_?||696415806_Phage-integrase-><-696415807_ParB-HTH*<-696415808_ParA<-696415809_DUF226<-696415810_?
      504495970    <-Borrelia_orfA||?->?->ParA->ParB-HTH*->                                                                                                                                                      ParB-HTH               -                         Q7M_RS04450              172  bacteria>spirochaetes                        Borrelia crocidurae                                     permease [Borrelia crocidurae].                                                                                   752505117_?-><-501533315_Borrelia_orfA||504495968_?->504495969_?->501533314_ParA->504495970_ParB-HTH*->
      639480210    Borrelia_orfA->DUF226->ParA->ParB-HTH*->ERF->                                                                                                                                                 ParB-HTH               Plasmid_parti             U881_RS0101540           172  bacteria>spirochaetes                        Borrelia persica                                        permease [Borrelia persica].                                                                                      639480207_Borrelia_orfA->639480208_DUF226->639480209_ParA->639480210_ParB-HTH*->639480211_ERF->
      645024919    <-ParB-HTH*<-ParA<-?<-?||Borrelia_orfA->                                                                                                                                                      ParB-HTH               -                         BCO_RS07580              172  bacteria>spirochaetes                        Borrelia coriaceae                                      permease [Borrelia coriaceae].                                                                                    <-645024919_ParB-HTH*<-645024927_ParA<-645024928_?<-645024931_?||752507041_Borrelia_orfA-><-752507042_?
      501533313    <-ParB-HTH*<-ParA<-?<-?||Borrelia_orfA->                                                                                                                                                      ParB-HTH               -                         BRE_RS05790              171  bacteria>spirochaetes                        Borrelia recurrentis                                    permease [Borrelia recurrentis].                                                                                  <-501533313_ParB-HTH*<-501533314_ParA<-504495969_?<-504495968_?||501533315_Borrelia_orfA-><-752506402_?<-752506403_?
      639480287    <-Borrelia_orfA||?->?->ParA->ParB-HTH*-><-ParB-HTH*<-ParA<-?<-?||Borrelia_orfA->                                                                                                              ParB-HTH               -                         U881_RS0102010           171  bacteria>spirochaetes                        Borrelia persica                                        permease [Borrelia persica].                                                                                      639480281_?-><-639480282_?<-639480283_Borrelia_orfA||639480284_?->639480285_?->639480286_ParA->639480287_ParB-HTH*-><-639480287_ParB-HTH*<-639480286_ParA<-639480285_?<-639480284_?||639480283_Borrelia_orfA->639480282_?-><-639480281_?
      639481918    Borrelia_orfA-><-?||?-><-Borrelia_orfA||?->?->ParA->ParB-HTH*->                                                                                                                               ParB-HTH               -                         U880_RS0101625           171  bacteria>spirochaetes                        Borrelia hispanica                                      permease [Borrelia hispanica].                                                                                    639481921_Borrelia_orfA-><-639481922_?||639481922_?-><-639481921_Borrelia_orfA||504495968_?->639481920_?->639481919_ParA->639481918_ParB-HTH*->
      644980358    <-Borrelia_orfA||?->?->ParA->ParB-HTH*->                                                                                                                                                      ParB-HTH               -                         BCD_RS04550              171  bacteria>spirochaetes                        Borrelia crocidurae                                     permease, partial [Borrelia crocidurae].                                                                          644980356_?-><-644980357_Borrelia_orfA||504495968_?->504495969_?->501533314_ParA->644980358_ParB-HTH*->
      645011182    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH               -                         BHO_RS06460              171  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      <-645011182_ParB-HTH*<-645011185_ParA<-645011188_?<-645011190_?||749299237_?-><-749299238_?
      740582340    <-ParB-HTH*<-ParA<-?<-?||Borrelia_orfA->                                                                                                                                                      ParB-HTH               -                         BDCR2A_RS06900           171  bacteria>spirochaetes                        Borrelia duttonii                                       permease [Borrelia duttonii].                                                                                     <-740582340_ParB-HTH*<-501533314_ParA<-504495969_?<-504495968_?||740582345_Borrelia_orfA->
      576095549    <-ParB-HTH*<-ParA<-?<-?||Borrelia_orfA->                                                                                                                                                      ParB-HTH               -                         BCO_0118100              170  bacteria>spirochaetes                        Borrelia coriaceae Co53                                 Putative plasmid partition protein (plasmid) [Borrelia coriaceae Co53].                                           <-576095549_ParB-HTH*<-576095550_ParA<-576095551_?<-576095552_?||576095553_Borrelia_orfA-><-576095554_?
      696412166    DUF226->ParA->ParB-HTH*-><-Borrelia_lipo_1<-?||?-><-BdrA<-DUF226<-Borrelia_orfA||orfD->                                                                                                       ParB-HTH               Plasmid_parti             DY92_RS0102290           170  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      657252264_DUF226->696412165_ParA->696412166_ParB-HTH*-><-696412168_Borrelia_lipo_1<-696412167_?||657252267_?-><-657252268_BdrA<-696412169_DUF226<-696412170_Borrelia_orfA||657252270_orfD->
      671501759    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH               Plasmid_parti             DZ05_RS0105610           169  bacteria>spirochaetes                        Borrelia garinii                                        permease, partial [Borrelia garinii].                                                                             <-671501759_ParB-HTH*<-671501760_ParA
      696422229    <-Borrelia_orfA<-ParB-HTH*<-ParA                                                                                                                                                              ParB-HTH               Plasmid_parti             DZ06_RS01000000107455    163  bacteria>spirochaetes                        Borrelia garinii                                        chromosome partitioning protein [Borrelia garinii].                                                               <-671501216_Borrelia_orfA<-696422229_ParB-HTH*<-696422231_ParA<-696422233_?
      576105904    <-Lipoprotein_2<-?<-?<-ERF<-ParB-HTH*<-ParA<-ParA<-DUF226<-?<-Borrelia_orfA||BppA->BppA->                                                                                                     ParB-HTH               SP+Plasmid_parti          BHY_1499                 160  bacteria>spirochaetes                        Borrelia hermsii YOR                                    hypothetical protein BHY_1499 (plasmid) [Borrelia hermsii YOR].                                                   <-576105900_Lipoprotein_2<-576105901_?<-576105902_?<-576105903_ERF<-576105904_ParB-HTH*<-576105905_ParA<-576105906_ParA<-576105907_DUF226<-576105908_?<-576105909_Borrelia_orfA||576105910_BppA->576105911_BppA->
      645063111    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH               Plasmid_parti             BHY_RS06200              157  bacteria>spirochaetes                        Borrelia hermsii                                        permease, partial [Borrelia hermsii].                                                                             644979721_ParA->645063111_ParB-HTH*->
      695263429    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH               Plasmid_parti             YP_009077380.1           145  bacteria>spirochaetes                        Borrelia burgdorferi                                    putative plasmid partition protein, partial [Borrelia burgdorferi].                                               695208550_ParA->695263429_ParB-HTH*->
      224513739    ParA->ParB-HTH*->BdrA-><-?<-?||BppA->                                                                                                                                                         ParB-HTH               Plasmid_parti             BSPA14S_PA0096           142  bacteria>spirochaetes                        Borrelia spielmanii A14S                                putative plasmid partition protein (plasmid) [Borrelia spielmanii A14S].                                          224513741_ParA->224513739_ParB-HTH*->224513737_BdrA-><-224513738_?<-224513742_?||224513740_BppA->
      645073449    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH               -                         BOM_RS06715              134  bacteria>spirochaetes                        Borrelia miyamotoi                                      permease, partial [Borrelia miyamotoi].                                                                           <-645073449_ParB-HTH*<-645073450_ParA<-645073451_?<-645073452_?||763124095_?-><-763124097_?
      219694015    Borrelia_lipo_2->orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->ParA->ParB-HTH*->BdrA-><-?<-?||BppA->XerD-><-Phage-integrase                                                                          ParB-HTH               Plasmid_parti             BGAPBR_V0033             133  bacteria>spirochaetes                        Borrelia garinii PBr                                    putative plasmid partition protein (plasmid) [Borrelia garinii PBr].                                              219694004_Borrelia_lipo_2->219694032_orfD->219694021_BdrA->219694017_Mlp-><-219694037_ERF||219694026_Borrelia_orfA->219694001_ParA->219694015_ParB-HTH*->219694014_BdrA-><-219694010_?<-219694038_?||219694020_BppA->219694025_XerD-><-219694007_Phage-integrase||219694027_?->
      644979632    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH               -                         BHW_RS05635              129  bacteria>spirochaetes                        Borrelia hermsii                                        permease, partial [Borrelia hermsii].                                                                             <-644979632_ParB-HTH*<-644979633_ParA<-644979634_?<-644979635_?||752506929_?-><-749302140_?
      671558442    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH               Plasmid_parti             DY90_RS0106595           127  bacteria>spirochaetes                        Borrelia garinii                                        permease, partial [Borrelia garinii].                                                                             671558441_ParA->671558442_ParB-HTH*->
      657235150    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH               Plasmid_parti             DZ03_RS0106530           123  bacteria>spirochaetes                        Borrelia garinii                                        permease, partial [Borrelia garinii].                                                                             657235149_ParA->657235150_ParB-HTH*->
      657245836    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH               Plasmid_parti             DZ07_RS0106275           123  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein, partial [Borrelia burgdorferi].                                                  657245835_ParA->657245836_ParB-HTH*->
      696413957    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH               Plasmid_parti             DZ02_RS0106455           119  bacteria>spirochaetes                        Borrelia garinii                                        permease, partial [Borrelia garinii].                                                                             <-696413957_ParB-HTH*<-671547930_ParA
      219694563    <-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||?-><-?||?->Borrelia_orfA->                                                                                                                          ParB-HTH               Plasmid_parti             BGAFAR04_E0008           117  bacteria>spirochaetes                        Borrelia garinii Far04                                  hypothetical protein BGAFAR04_E0008 (plasmid) [Borrelia garinii Far04].                                           219694576_?->219694566_?-><-219694581_?||219694571_?-><-219694557_?<-219694559_?<-219694563_ParB-HTH*<-219694558_ParA<-219694565_DUF226<-219694568_Borrelia_orfA||219694560_?-><-219694575_?||219694588_?->219694586_Borrelia_orfA->
      671561023    <-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA                                                                                                                                                      ParB-HTH               SP                        DZ15_RS0105800           113  bacteria>spirochaetes                        Borrelia burgdorferi                                    permease, partial [Borrelia burgdorferi].                                                                         <-671561023_ParB-HTH*<-501666664_ParA<-497942834_DUF226<-671561024_Borrelia_orfA
      657249216    DUF226->ParA->ParB-HTH*->                                                                                                                                                                     ParB-HTH               -                         DZ12_RS0106295           112  bacteria>spirochaetes                        Borrelia burgdorferi                                    permease, partial [Borrelia burgdorferi].                                                                         501588777_DUF226->499186342_ParA->657249216_ParB-HTH*->
      576092914    <-Lipoprotein_2||ParA->ParB-HTH*-><-?<-Mlp<-?<-Mlp                                                                                                                                            ParB-HTH               -                         BHO_0117100              106  bacteria>spirochaetes                        Borrelia hermsii YBT                                    Putative plasmid partition protein (plasmid) [Borrelia hermsii YBT].                                              <-576092907_?<-576092908_?||576092909_?-><-576092910_?<-576092911_?<-576092912_Lipoprotein_2||576092913_ParA->576092914_ParB-HTH*-><-576092915_?<-576092916_Mlp<-576092917_?<-576092918_Mlp||576092919_?->576092920_?->576092921_?->
      695263555    ParA->?->ParB-HTH*->                                                                                                                                                                          ParB-HTH               Plasmid_parti             PF-49                    96   bacteria>spirochaetes                        Borrelia burgdorferi                                    PF-49 protein, partial [Borrelia burgdorferi].                                                                    695208901_ParA->695208900_?->695263555_ParB-HTH*->
      695263547    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH               -                         PF-49                    95   bacteria>spirochaetes                        Borrelia burgdorferi                                    PF-49 protein, partial [Borrelia burgdorferi].                                                                    695208881_?->695208882_ParA->695263547_ParB-HTH*->
      657236274    <-ParB-HTH*<-ParA<-Borrelia_orfA                                                                                                                                                              ParB-HTH               -                         DY94_RS0105330           92   bacteria>spirochaetes                        Borrelia garinii                                        hypothetical protein, partial [Borrelia garinii].                                                                 <-657236274_ParB-HTH*<-657236277_ParA<-657236280_Borrelia_orfA
      219693935    Borrelia_lipo_1-><-BdrA||?-><-?<-?<-?<-ParB-HTH*<-?<-?<-ParA                                                                                                                                  ParB-HTH               Plasmid_parti             BGAPBR_E0019             91   bacteria>spirochaetes                        Borrelia garinii PBr                                    hypothetical protein BGAPBR_E0019 (plasmid) [Borrelia garinii PBr].                                               <-219693939_?||219693908_Borrelia_lipo_1-><-219693911_BdrA||219693940_?-><-219693903_?<-219693899_?<-219693941_?<-219693935_ParB-HTH*<-219693934_?<-219693930_?<-219693928_ParA<-219693937_?<-219693898_?||219693921_?->219693902_?->
      671563388    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH               -                         DZ19_RS0106050           91   bacteria>spirochaetes                        Borrelia burgdorferi                                    hypothetical protein, partial [Borrelia burgdorferi].                                                             <-671563388_ParB-HTH*<-671563390_ParA
      695263540    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH               -                         YP_009077626.1           89   bacteria>spirochaetes                        Borrelia burgdorferi                                    PF-49 protein, partial [Borrelia burgdorferi].                                                                    695208862_?->695208863_ParA->695263540_ParB-HTH*->
      # 1; Same as above in general
      576094619    <-BppA<-BppA||Borrelia_orfA->DUF226->ParB-HTH*->ERF-><-orfD<-Borrelia_lipo_2                                                                                                                  ParB-HTH               Plasmid_parti             BCO_0130002              201  bacteria>spirochaetes                        Borrelia coriaceae Co53                                 Putative plasmid partition protein (plasmid) [Borrelia coriaceae Co53].                                           <-576094615_BppA<-576094616_BppA||576094617_Borrelia_orfA->576094618_DUF226->576094619_ParB-HTH*->576094620_ERF-><-576094621_orfD<-576094622_Borrelia_lipo_2<-576094623_?<-576094624_?<-576094625_?<-576094626_?
      576095359    <-XerD<-?<-SSB<-SSB<-BppA||Borrelia_orfA->DUF226->ParB-HTH*->ERF-><-orfD                                                                                                                      ParB-HTH               Plasmid_parti             BCO_0130005              200  bacteria>spirochaetes                        Borrelia coriaceae Co53                                 Putative plasmid partition protein (plasmid) [Borrelia coriaceae Co53].                                           <-576095352_XerD<-576095353_?<-576095354_SSB<-576095355_SSB<-576095356_BppA||576095357_Borrelia_orfA->576095358_DUF226->576095359_ParB-HTH*->576095360_ERF-><-576095361_orfD
      488735361    Phage-integrase-><-XerD<-ParB-HTH*<-DUF226<-Borrelia_orfA||ERF->                                                                                                                              ParB-HTH               Plasmid_parti             DZ12_RS0105395           186  bacteria>spirochaetes                        Borrelia burgdorferi group                              MULTISPECIES: chromosome partitioning protein [Borrelia burgdorferi group].                                       <-740590199_?||488735346_?->657249030_?->657249032_Phage-integrase-><-501588716_XerD<-488735361_ParB-HTH*<-488735365_DUF226<-657249036_Borrelia_orfA||740590200_ERF->
      488739923    Borrelia_orfA->DUF226->ParB-HTH*->ERF-><-?<-?||?-><-?||Phage-integrase->                                                                                                                      ParB-HTH               SP+Plasmid_parti          NM71_RS07415             186  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           499193068_Borrelia_orfA->499193062_DUF226->488739923_ParB-HTH*->499193063_ERF-><-499193064_?<-499193070_?||763428009_?-><-499193066_?||497944141_Phage-integrase->
      493479385    ParB-HTH*->BdrA->                                                                                                                                                                             ParB-HTH               Plasmid_parti             BSPA14S_RS04890          186  bacteria>spirochaetes                        Borrelia spielmanii                                     chromosome partitioning protein [Borrelia spielmanii].                                                            493479385_ParB-HTH*->493479383_BdrA->
      497944835    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH               SP+Plasmid_parti          BBU80A_RS09650           186  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           <-497944835_ParB-HTH*
      499985629    Phage-integrase-><-BppA<-BppA<-BppA||Borrelia_orfA->DUF226->ParB-HTH*->ERF->                                                                                                                  ParB-HTH               Plasmid_parti             Ip21p08                  186  bacteria>spirochaetes                        Borrelia burgdorferi                                    hypothetical protein [Borrelia burgdorferi].                                                                      115534917_?->115534918_Phage-integrase-><-115534919_BppA<-115534927_BppA<-115534920_BppA||115534921_Borrelia_orfA->115534922_DUF226->499985629_ParB-HTH*->115534924_ERF->115534925_?-><-115534926_?
      501930839    Borrelia_orfA->DUF226->ParB-HTH*->ERF-><-?<-?||?->Phage-integrase->                                                                                                                           ParB-HTH               SP+Plasmid_parti          BBUN40_RS05120           186  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           504353027_Borrelia_orfA->497944126_DUF226->501930839_ParB-HTH*->497944130_ERF-><-497944134_?<-497944135_?||497944137_?->497944141_Phage-integrase->
      671501078    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH               SP+Plasmid_parti          DZ04_RS0105605           186  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      <-671501078_ParB-HTH*
      671580297    <-BdrA<-ParB-HTH*                                                                                                                                                                             ParB-HTH               Plasmid_parti             DZ10_RS0106780           186  bacteria>spirochaetes                        Borrelia burgdorferi                                    permease [Borrelia burgdorferi].                                                                                  <-671580290_BdrA<-671580297_ParB-HTH*
      671550272    Borrelia_lipo_1->Borrelia_lipo_1-><-ParB-HTH*                                                                                                                                                 ParB-HTH               Plasmid_parti             DZ09_RS0104855           185  bacteria>spirochaetes                        Borrelia burgdorferi                                    permease [Borrelia burgdorferi].                                                                                  501666678_?->497942861_Borrelia_lipo_1->501666676_Borrelia_lipo_1-><-671550272_ParB-HTH*
      696419050    <-BdrA<-ParB-HTH*                                                                                                                                                                             ParB-HTH               Plasmid_parti             DY99_RS01000000109765    181  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      <-671481818_BdrA<-696419050_ParB-HTH*
      644979468    Borrelia_orfA->DUF226->ParB-HTH*->BdrA->ERF->                                                                                                                                                 ParB-HTH               SP+Plasmid_parti          BHY_RS06165              176  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      645063105_Borrelia_orfA->645063106_DUF226->644979468_ParB-HTH*->644979469_BdrA->645063107_ERF->
      645010627    <-Lipoprotein_2<-?<-Lipoprotein_2<-BdrA<-Terminase_LS<-MultiTM<-ThiF<-ParB-HTH*<-DUF226<-Borrelia_orfA||Lipoprotein_2->?-><-BdrA||BdrA->                                                      ParB-HTH               Plasmid_parti             BHO_RS05335              175  bacteria>spirochaetes                        Borrelia hermsii                                        permease [Borrelia hermsii].                                                                                      <-645010599_Lipoprotein_2<-645010602_?<-645010609_Lipoprotein_2<-645010612_BdrA<-645010615_Terminase_LS<-645010618_MultiTM<-645010621_ThiF<-645010627_ParB-HTH*<-645010630_DUF226<-645010633_Borrelia_orfA||645010636_Lipoprotein_2->645010639_?-><-749299198_BdrA||645010642_BdrA->749299199_?->
      497943789    ParB-HTH*->BdrA->                                                                                                                                                                             ParB-HTH               Plasmid_parti             DY93_RS0105830           174  bacteria>spirochaetes                        Borrelia burgdorferi                                    chromosome partitioning protein [Borrelia burgdorferi].                                                           740589459_?->497943789_ParB-HTH*->671559884_BdrA->
      499985609    <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Borrelia_orfA<-ParB-HTH*<-Borrelia_orfA||BppA->BppA->XerD-><-Phage-integrase<-Mlp||BdrA->                        ParB-HTH               Plasmid_parti             ORFc                     174  bacteria>spirochaetes                        Borrelia duttonii                                       hypothetical protein [Borrelia duttonii].                                                                         <-115534864_?<-115534865_Lipoprotein_2<-115534866_Lipoprotein_2<-115534867_Lipoprotein_2<-115534868_Lipoprotein_2<-115534869_Lipoprotein_2<-115534870_Borrelia_orfA<-499985609_ParB-HTH*<-115534872_Borrelia_orfA||115534873_BppA->115534874_BppA->115534875_XerD-><-115534876_Phage-integrase<-115534877_Mlp||115534878_BdrA->
      671520608    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH               Plasmid_parti             DY88_RS0105135           174  bacteria>spirochaetes                        Borrelia garinii                                        chromosome partitioning protein [Borrelia garinii].                                                               <-671520608_ParB-HTH*
      657235493    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH               SP+Plasmid_parti          DY94_RS0102265           172  bacteria>spirochaetes                        Borrelia garinii                                        permease [Borrelia garinii].                                                                                      <-657235493_ParB-HTH*||657235495_?->657235496_?->
      752506999    Borrelia_orfA->DUF226->ParB-HTH*->ERF-><-Borrelia_lipo_2                                                                                                                                      ParB-HTH               Plasmid_parti             BCO_RS06215              172  bacteria>spirochaetes                        Borrelia coriaceae                                      hypothetical protein [Borrelia coriaceae].                                                                        645024187_Borrelia_orfA->645024190_DUF226->752506999_ParB-HTH*->645024196_ERF-><-645024198_Borrelia_lipo_2<-645024200_?<-645024203_?<-645024206_?<-645024210_?<-645024212_?
      740592163    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH               Plasmid_parti             DZ19_RS09730             171  bacteria>spirochaetes                        Borrelia burgdorferi                                    permease [Borrelia burgdorferi].                                                                                  <-740592163_ParB-HTH*
      497945190    ParB-HTH*->BppA->                                                                                                                                                                             ParB-HTH               Plasmid_parti             BBU80A_RS08665           166  bacteria>spirochaetes                        Borrelia burgdorferi                                    hypothetical protein, partial [Borrelia burgdorferi].                                                             497945190_ParB-HTH*->497945191_BppA->
      654876378    Borrelia_lipo_2->orfD-><-ERF<-ParB-HTH*                                                                                                                                                       ParB-HTH               Plasmid_parti             T431_RS0104785           153  bacteria>spirochaetes                        Borrelia coriaceae                                      hypothetical protein, partial [Borrelia coriaceae].                                                               645024210_?->645024206_?->645024203_?->645024200_?->654876375_Borrelia_lipo_2->654876376_orfD-><-740579301_ERF<-654876378_ParB-HTH*
      671558427    ParB-HTH*->                                                                                                                                                                                   ParB-HTH               Plasmid_parti             DY90_RS0106570           134  bacteria>spirochaetes                        Borrelia garinii                                        permease, partial [Borrelia garinii].                                                                             671558427_ParB-HTH*->
      671547916    <-BdrA<-ParB-HTH*                                                                                                                                                                             ParB-HTH               Plasmid_parti             DZ02_RS0106405           130  bacteria>spirochaetes                        Borrelia garinii                                        permease, partial [Borrelia garinii].                                                                             <-671481040_BdrA<-671547916_ParB-HTH*
      671560278    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH               -                         DY93_RS0106950           94   bacteria>spirochaetes                        Borrelia burgdorferi                                    permease, partial [Borrelia burgdorferi].                                                                         <-671560278_ParB-HTH*<-671560280_?
      # 155; Often XerD association                                                                                                                                                                                                                                                        
      494523440    Relaxase-><-?<-HNH<-?||Primpol?->ParB-HTH+Prok-TUDOR*->                                                                                                                                       ParB-HTH+Prok-TUDOR         -                         CWATWH0005_2825          431  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     494523435_Relaxase-><-494523436_?<-494523437_HNH<-494523438_?||494523439_Primpol?->494523440_ParB-HTH+Prok-TUDOR*-><-494523442_?||494523443_?->
      428272365    <-ParB<-ParA<-?<-?<-?||ParB-HTH+Prok-TUDOR*-><-XerD||CASPASE-><-?||?->?->METHYLASE->                                                                                                          ParB-HTH+Prok-TUDOR         -                         Sta7437_4876             420  bacteria>cyanobacteria                       Stanieria cyanosphaera PCC 7437                         hypothetical protein Sta7437_4876 (plasmid) [Stanieria cyanosphaera PCC 7437].                                    <-428272358_?<-428272359_?<-428272360_ParB<-428272361_ParA<-428272362_?<-428272363_?<-428272364_?||428272365_ParB-HTH+Prok-TUDOR*-><-428272366_XerD||428272367_CASPASE-><-428272368_?||428272369_?->428272370_?->428272371_METHYLASE->428272372_?->
      428267400    <-ParA<-?<-?<-?<-?<-?||HTH->ParB-HTH+Prok-TUDOR*-><-?<-?||?->?->?->XerD->                                                                                                                     ParB-HTH+Prok-TUDOR         SP                        Glo7428_4930             400  bacteria>cyanobacteria                       Gloeocapsa sp. PCC 7428                                 hypothetical protein Glo7428_4930 (plasmid) [Gloeocapsa sp. PCC 7428].                                            <-428267393_ParA<-428267394_?<-428267395_?<-428267396_?<-428267397_?<-428267398_?||428267399_HTH->428267400_ParB-HTH+Prok-TUDOR*-><-428267401_?<-428267402_?||428267403_?->428267404_?->428267405_?->428267406_XerD-><-428267407_?
      718251661    <-TerD<-TerD||?-><-ParB-HTH+Prok-TUDOR*                                                                                                                                                       ParB-HTH+Prok-TUDOR         -                         N44_02315                370  bacteria>cyanobacteria                       Microcystis aeruginosa NIES-44                          benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB [Microcystis aeruginosa NIES-44]. 718251654_?-><-718251655_?||718251656_?-><-718251657_?<-718251658_TerD<-718251659_TerD||718251660_?-><-718251661_ParB-HTH+Prok-TUDOR*||718251662_?->718251663_?->718251664_?-><-718251665_?||718251666_?->718251667_?->718251668_?->
      389882556    <-ParB-HTH+Prok-TUDOR*                                                                                                                                                                        ParB-HTH+Prok-TUDOR         -                         MICAK_2860002            368  bacteria>cyanobacteria                       Microcystis aeruginosa PCC 9701                         conserved hypothetical protein [Microcystis aeruginosa PCC 9701].                                                 389882570_?-><-389882571_?<-389882572_?<-389882573_?<-389882574_?||389882575_?->389882555_?-><-389882556_ParB-HTH+Prok-TUDOR*<-389882557_?||389882558_?->389882559_?-><-389882560_?||389882561_?-><-389882562_?<-389882563_?
      543531309    Relaxase->?-><-?<-ASCH+ParB-HTH+Prok-TUDOR*                                                                                                                                                   ASCH+ParB-HTH+Prok-TUDOR    -                         CWATWH0402_1907          361  bacteria>cyanobacteria                       Crocosphaera watsonii WH 0402                           hypothetical protein CWATWH0402_1907 [Crocosphaera watsonii WH 0402].                                             <-543531302_?||543531303_?-><-543531304_?||543531305_?->543531306_Relaxase->543531307_?-><-543531308_?<-543531309_ASCH+ParB-HTH+Prok-TUDOR*
      737861903    Relaxase-><-?<-HNH<-?||Primpol?->ParB-HTH+Prok-TUDOR*->                                                                                                                                       ParB-HTH+Prok-TUDOR         -                         CWATWH0003_RS24275       344  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     737861894_Relaxase-><-737861897_?<-737861900_HNH<-494523438_?||494523439_Primpol?->737861903_ParB-HTH+Prok-TUDOR*->494523441_?-><-737861906_?||494523443_?->494523444_?->494523445_?->
      754508876    HTH->ParB-HTH+Prok-TUDOR*-><-?||?->?->XerD->                                                                                                                                                  ParB-HTH+Prok-TUDOR         -                         GLO7428_RS24200          338  bacteria>cyanobacteria                       Gloeocapsa sp. PCC 7428                                 hypothetical protein, partial [Gloeocapsa sp. PCC 7428].                                                          <-754508859_?<-505004106_?<-505004107_?<-754508860_?<-505004108_?<-754508861_?||754508875_HTH->754508876_ParB-HTH+Prok-TUDOR*-><-505004112_?||505004114_?->505004115_?->754508877_XerD-><-505004117_?<-505004118_?<-505004119_?
      748135946    RVT+HNH-><-?<-?<-?||?->?->HTH->ParB-HTH+Prok-TUDOR*-><-?<-HU-IHF                                                                                                                              ParB-HTH+Prok-TUDOR         -                         QH73_RS07795             337  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein [Scytonema millei].                                                                          748135943_RVT+HNH-><-748135944_?<-748135878_?<-748135879_?||748135880_?->748135945_?->748135881_HTH->748135946_ParB-HTH+Prok-TUDOR*-><-748135882_?<-748135947_HU-IHF||748135883_?-><-748135948_?<-748135949_?||748135884_?->748135885_?->
      67852287     <-ExoVII<-?<-ParB-HTH+Prok-TUDOR*<-XerD                                                                                                                                                       ParB-HTH+Prok-TUDOR         -                         CwatDRAFT_0109           334  bacteria>cyanobacteria                       Crocosphaera watsonii WH 8501                           unknown protein [Crocosphaera watsonii WH 8501].                                                                  <-67852291_?||67852285_?-><-67852290_?<-67852289_ExoVII<-67852288_?<-67852287_ParB-HTH+Prok-TUDOR*<-67852286_XerD
      751570983    <-RVT+HNH||?->?->DDE_Tnp_1_2-><-ParB-HTH+Prok-TUDOR*<-Primpol?                                                                                                                                ParB-HTH+Prok-TUDOR         -                         SD81_RS27565             334  bacteria>cyanobacteria                       Tolypothrix campylonemoides                             hypothetical protein [Tolypothrix campylonemoides].                                                               <-751570975_?<-751565841_?<-751571087_?<-751570976_RVT+HNH||751571089_?->751570978_?->751570980_DDE_Tnp_1_2-><-751570983_ParB-HTH+Prok-TUDOR*<-751570984_Primpol?||751570986_?-><-751570988_?<-751570990_?||751570992_?->751570994_?->751571091_?->
      218175378    <-XerD||ParB-HTH+Prok-TUDOR*->                                                                                                                                                                ParB-HTH+Prok-TUDOR         SP                        PCC7424_5542             332  bacteria>cyanobacteria                       Cyanothece sp. PCC 7424                                 hypothetical protein PCC7424_5542 (plasmid) [Cyanothece sp. PCC 7424].                                            218175371_?->218175372_?->218175373_?-><-218175374_?<-218175375_?||218175376_?-><-218175377_XerD||218175378_ParB-HTH+Prok-TUDOR*->218175379_?->218175380_?->218175381_?->
      515877940    Primpol?->ParB-HTH+Prok-TUDOR*->                                                                                                                                                              ParB-HTH+Prok-TUDOR         -                         PCC9339_RS0106675        332  bacteria>cyanobacteria                       Fischerella sp. PCC 9339                                hypothetical protein [Fischerella sp. PCC 9339].                                                                  <-515877934_?<-515877935_?<-515877936_?<-515877936_?<-515877937_?<-515877938_?||737126821_Primpol?->515877940_ParB-HTH+Prok-TUDOR*->515877941_?->737126824_?->515877943_?->515877944_?->515877945_?->515877946_?->515877947_?->
      657929542    <-ParB-HTH+Prok-TUDOR*<-Primpol?                                                                                                                                                              ParB-HTH+Prok-TUDOR         -                         TOL9009_RS0101730        330  bacteria>cyanobacteria                       [Scytonema hofmanni] UTEX B 1581                        hypothetical protein [[Scytonema hofmanni] UTEX B 1581].                                                          <-657929539_?<-657929540_?<-657929541_?<-740464363_?<-657929542_ParB-HTH+Prok-TUDOR*<-657929543_Primpol?||657929544_?->657929545_?->657929546_?->657929547_?->740464366_?->657929548_?->
      407266570    <-DDE||TPR-><-?||?->?->?->?-><-ParB-HTH+Prok-TUDOR*||?->?-><-?||?->?-><-?||HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->                                                                         ParB-HTH+Prok-TUDOR         -                         FDUTEX481_04300          326  bacteria>cyanobacteria                       Tolypothrix sp. PCC 7601                                hypothetical protein FDUTEX481_04300 [Tolypothrix sp. PCC 7601].                                                  <-407266564_DDE||407266563_TPR-><-407266565_?||407266566_?->407266567_?->407266568_?->407266569_?-><-407266570_ParB-HTH+Prok-TUDOR*||407266571_?->407266572_?-><-407266573_?||407266574_?->407266575_?-><-407266576_?||407266577_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->
      515385753    <-ParB-HTH+Prok-TUDOR*<-Primpol?||?->?-><-?||?-><-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon                                                                                                    ParB-HTH+Prok-TUDOR         -                         UYC_RS0100505            326  bacteria>cyanobacteria                       Chlorogloeopsis fritschii                               hypothetical protein [Chlorogloeopsis fritschii].                                                                 515385746_?->750127864_?->515385748_?-><-515385749_?||515385750_?->515385751_?->515388784_?-><-515385753_ParB-HTH+Prok-TUDOR*<-515388785_Primpol?||515388786_?->515385106_?-><-515385110_?||515385113_?-><-515385115_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon||515385118_?->
      546206668    PriCT_2->XerD->ParB-HTH+Prok-TUDOR*->?->ExoVII->                                                                                                                                              ParB-HTH+Prok-TUDOR         -                         CWATWH8502_3343          323  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     546206663_PriCT_2->546206666_XerD->546206668_ParB-HTH+Prok-TUDOR*->494518954_?->546206670_ExoVII->494518956_?-><-494518951_?
      546222413    <-ExoVII<-?<-ParB-HTH+Prok-TUDOR*<-XerD                                                                                                                                                       ParB-HTH+Prok-TUDOR         -                         CWATWH0005_5641          323  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     494524271_?->494524273_?->546222411_?->546222412_?-><-494523800_?<-494518955_ExoVII<-494518954_?<-546222413_ParB-HTH+Prok-TUDOR*<-546222414_XerD
      655839534    RVT+HNH->?->?->ParB-HTH->HTH->Primpol?->ParB-HTH+Prok-TUDOR*-><-?<-?||RecD->XerD->                                                                                                            ParB-HTH+Prok-TUDOR         -                         SYN7509_RS0222055        323  bacteria>cyanobacteria                       Synechocystis sp. PCC 7509                              hypothetical protein [Synechocystis sp. PCC 7509].                                                                740179509_?->497316263_RVT+HNH->497316262_?->497316261_?->740179512_ParB-HTH->497316258_HTH->740179516_Primpol?->655839534_ParB-HTH+Prok-TUDOR*-><-497316255_?<-655839535_?||740179519_RecD->497316172_XerD->497316171_?->740179350_?->740179426_?->
      748136445    HU-IHF-><-?<-ParB-HTH+Prok-TUDOR*<-HTH<-?<-?||?-><-?<-?||NACHT->                                                                                                                              ParB-HTH+Prok-TUDOR         -                         QH73_RS10255             323  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein [Scytonema millei].                                                                          <-748136367_?||748136368_?->748136443_?->748136369_?-><-748136444_?||748136370_HU-IHF-><-748136371_?<-748136445_ParB-HTH+Prok-TUDOR*<-748136372_HTH<-748136373_?<-748136374_?||748136375_?-><-748136376_?<-748136377_?||748136378_NACHT->
      797212629    TPR-><-?||?->?->?-><-ParB-HTH+Prok-TUDOR*||?->?-><-?||?-><-?||HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-ATHOOK+ParA                                                                         ParB-HTH+Prok-TUDOR         -                         FDUTEX481_RS32065        323  bacteria>cyanobacteria                       Tolypothrix sp. PCC 7601                                hypothetical protein [Tolypothrix sp. PCC 7601].                                                                  797212624_?->797212718_?->797212719_TPR-><-797212625_?||797212626_?->797212627_?->797212628_?-><-797212629_ParB-HTH+Prok-TUDOR*||797212630_?->797212720_?-><-797212631_?||797212721_?-><-797212722_?||797212632_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-797212633_ATHOOK+ParA
      769922127    HTH->Primpol?->ParB-HTH+Prok-TUDOR*->                                                                                                                                                         ParB-HTH+Prok-TUDOR         -                         UH38_RS20080             322  bacteria>cyanobacteria                       Chroococcales cyanobacterium CENA595                    hypothetical protein [Chroococcales cyanobacterium CENA595].                                                      <-769922079_?||769922080_?->769922081_HTH->769922126_Primpol?->769922127_ParB-HTH+Prok-TUDOR*->769922082_?->769922083_?->769922128_?->769922084_?->769922085_?->769922086_?->769922087_?->
      737859551    Primpol?->ParB-HTH+Prok-TUDOR*->                                                                                                                                                              ParB-HTH+Prok-TUDOR         -                         CWATWH0003_RS12720       320  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein, partial [Crocosphaera watsonii].                                                            737859546_?->737859548_?->494521402_Primpol?->737859551_ParB-HTH+Prok-TUDOR*->
      186469442    <-ParB-HTH+Prok-TUDOR*                                                                                                                                                                        ParB-HTH+Prok-TUDOR         -                         Npun_BR102               317  bacteria>cyanobacteria                       Nostoc punctiforme PCC 73102                            conserved hypothetical protein (plasmid) [Nostoc punctiforme PCC 73102].                                          <-186469435_?<-186469436_?<-186469437_?||186469438_?-><-186469439_?||186469440_?->186469441_?-><-186469442_ParB-HTH+Prok-TUDOR*<-186469443_?<-186469444_?<-186469445_?||186469446_?->186469447_?->186469448_?->186469449_?->
      797208446    <-ParB-HTH+Prok-TUDOR*                                                                                                                                                                        ParB-HTH+Prok-TUDOR         -                         FDUTEX481_RS10740        317  bacteria>cyanobacteria                       Tolypothrix sp. PCC 7601                                hypothetical protein [Tolypothrix sp. PCC 7601].                                                                  <-797208721_?<-797208441_?<-797208442_?<-797208722_?<-797208443_?<-797208444_?<-797208445_?<-797208446_ParB-HTH+Prok-TUDOR*<-797208447_?||797208723_?->797208724_?->797208448_?->797208725_?->797208449_?->797208450_?->
      501601085    <-ParB-HTH+Prok-TUDOR*<-XerD                                                                                                                                                                  ParB-HTH+Prok-TUDOR         SP                        PCC7424_RS26725          314  bacteria>cyanobacteria                       Cyanothece sp. PCC 7424                                 hypothetical protein [Cyanothece sp. PCC 7424].                                                                   <-501601078_?<-752567297_?<-501601080_?<-752567298_?<-501601082_?||501601083_?->501601084_?-><-501601085_ParB-HTH+Prok-TUDOR*<-501601086_XerD||501601087_?-><-752567299_?<-501601089_?<-501601090_?<-752567300_?||501601092_?->
      499635872    <-ParB-HTH+Prok-TUDOR*                                                                                                                                                                        ParB-HTH+Prok-TUDOR         SP                        AVA_RS27595              313  bacteria>cyanobacteria                       Anabaena variabilis                                     hypothetical protein [Anabaena variabilis].                                                                       <-499635864_?<-752818109_?<-499635866_?<-499635867_?||752818153_?-><-499635870_?<-499635871_?<-499635872_ParB-HTH+Prok-TUDOR*<-499635873_?||499635874_?->752818154_?->752818155_?->499635876_?->499635877_?->752818156_?->
      501381405    NACHT->STYKIN->?-><-TPR+CASPASE||?->ParB-HTH+Prok-TUDOR*-><-?<-?<-PIN+CASPASE<-TPR+CASPASE||DDE_3->                                                                                           ParB-HTH+Prok-TUDOR         SP                        NPUN_RS34090             313  bacteria>cyanobacteria                       Nostoc punctiforme                                      hypothetical protein [Nostoc punctiforme].                                                                        <-501381398_?<-753810943_?||501381399_NACHT->753810971_STYKIN->753810972_?-><-501381402_TPR+CASPASE||501381404_?->501381405_ParB-HTH+Prok-TUDOR*-><-501381406_?<-501381408_?<-501381409_PIN+CASPASE<-501381410_TPR+CASPASE||501381411_DDE_3->753810944_?->753810973_?->
      389714985    <-ParB-HTH+Prok-TUDOR*<-?||?-><-?<-?<-?<-RVT+HNH                                                                                                                                              ParB-HTH+Prok-TUDOR         SP                        MICAB_900014             309  bacteria>cyanobacteria                       Microcystis aeruginosa PCC 9717                         conserved hypothetical protein [Microcystis aeruginosa PCC 9717].                                                 389714978_?->389714979_?->389714980_?->389714981_?->389714982_?->389714983_?->389714984_?-><-389714985_ParB-HTH+Prok-TUDOR*<-389714986_?||389714970_?-><-389714961_?<-389714962_?<-389714963_?<-389714964_RVT+HNH||389714965_?->
      499309017    STYKIN->?-><-TPR+CASPASE<-?<-?<-?<-?||ParB-HTH+Prok-TUDOR*->?->RVT+HNH->RVT+HNH->RVT+HNH->?-><-?<-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon                                                    ParB-HTH+Prok-TUDOR         SP                        PCC7120DELTA_RS29565     309  bacteria>cyanobacteria                       Nostoc sp. PCC 7120                                     hypothetical protein [Nostoc sp. PCC 7120].                                                                       764953490_STYKIN->764953492_?-><-499309012_TPR+CASPASE<-499309013_?<-499309014_?<-499309015_?<-499309016_?||499309017_ParB-HTH+Prok-TUDOR*->764953376_?->499309018_RVT+HNH->764953494_RVT+HNH->764953500_RVT+HNH->499309021_?-><-499309022_?<-499309023_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon
      499635567    <-ParB-HTH+Prok-TUDOR*||?->?->?->TPR+CASPASE-><-?<-STYKIN||NACHT->                                                                                                                            ParB-HTH+Prok-TUDOR         SP                        AVA_RS26020              309  bacteria>cyanobacteria                       Anabaena variabilis                                     hypothetical protein [Anabaena variabilis].                                                                       499309022_?-><-499309021_?<-499635567_ParB-HTH+Prok-TUDOR*||499635568_?->499309014_?->499635569_?->499635570_TPR+CASPASE-><-499635571_?<-752818111_STYKIN||752818112_NACHT->
      515347403    ABC->ABC-><-?||Primpol?->ParB-HTH+Prok-TUDOR*->                                                                                                                                               ParB-HTH+Prok-TUDOR         -                         UYG_RS0120335            308  bacteria>cyanobacteria                       Fischerella muscicola                                   hypothetical protein [Fischerella muscicola].                                                                     515347396_?->515347397_?->515347398_?->515347399_ABC->515347400_ABC-><-515347401_?||703201084_Primpol?->515347403_ParB-HTH+Prok-TUDOR*-><-515347404_?||515347405_?->515347406_?->515347407_?->515347408_?->703201088_?->703201078_?->
      501381481    ParB-HTH+Prok-TUDOR*->TPR->TPR+CASPASE->NACHT->TPR+CASPASE->                                                                                                                                  ParB-HTH+Prok-TUDOR         -                         NPUN_RS34540             305  bacteria>cyanobacteria                       Nostoc punctiforme                                      hypothetical protein [Nostoc punctiforme].                                                                        <-501381475_?<-753810996_?<-501381477_?<-753810949_?<-501381478_?<-753810997_?||501381480_?->501381481_ParB-HTH+Prok-TUDOR*->753810998_TPR->753810999_TPR+CASPASE->753811000_NACHT->753811001_TPR+CASPASE->501381484_?->753811002_?->753811003_?->
      753865019    <-ParB<-ParA<-?<-?<-?||?->ParB-HTH+Prok-TUDOR*-><-XerD||CASPASE-><-?||?->METHYLASE->                                                                                                          ParB-HTH+Prok-TUDOR         -                         STA7437_RS23975          302  bacteria>cyanobacteria                       Stanieria cyanosphaera                                  hypothetical protein [Stanieria cyanosphaera].                                                                    <-505008572_?<-505008573_ParB<-505008574_ParA<-505008575_?<-753865013_?<-753865015_?||753865017_?->753865019_ParB-HTH+Prok-TUDOR*-><-753865021_XerD||753865024_CASPASE-><-505008581_?||753865026_?->505008584_METHYLASE->505008585_?-><-505008586_?
      757158775    <-ExoVII<-?<-ParB-HTH+Prok-TUDOR*                                                                                                                                                             ParB-HTH+Prok-TUDOR         -                         CWATDRAFT_RS29615        301  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     757158773_?-><-494518957_?||494518951_?-><-494518956_?<-494518955_ExoVII<-494518954_?<-757158775_ParB-HTH+Prok-TUDOR*
      744450902    <-ParB-HTH+Prok-TUDOR*                                                                                                                                                                        ParB-HTH+Prok-TUDOR         SP                        DA73_0214905             298  bacteria>cyanobacteria                       Tolypothrix bouteillei VB521301                         hypothetical protein DA73_0214905 [Tolypothrix bouteillei VB521301].                                              744450881_?-><-744450898_?||744450899_?->744450882_?-><-744450900_?<-744450883_?<-744450901_?<-744450902_ParB-HTH+Prok-TUDOR*<-744450884_?||744450885_?->744450886_?->744450887_?-><-744450888_?<-744450889_?||744450890_?->
      737134277    ParB-HTH+Prok-TUDOR*-><-?<-?<-?<-RVT+HNH                                                                                                                                                      ParB-HTH+Prok-TUDOR         -                         FIS9431_RS33115          295  bacteria>cyanobacteria                       Fischerella sp. PCC 9431                                hypothetical protein [Fischerella sp. PCC 9431].                                                                  <-652326659_?||737135005_?->652326660_?->652326661_?-><-652326662_?||737135007_?->737135017_?->737134277_ParB-HTH+Prok-TUDOR*-><-652326663_?<-652326664_?<-652326665_?<-652326666_RVT+HNH<-652326667_?<-652326668_?<-737135019_?
      769920071    <-ParB-HTH+Prok-TUDOR*<-?<-HISKIN||?->HISKIN->                                                                                                                                                ParB-HTH+Prok-TUDOR         -                         UH38_RS09160             294  bacteria>cyanobacteria                       Chroococcales cyanobacterium CENA595                    hypothetical protein [Chroococcales cyanobacterium CENA595].                                                      769919939_?-><-769919940_?<-769919941_?<-769919942_?<-769919943_?<-769920071_ParB-HTH+Prok-TUDOR*<-769919944_?<-769919945_HISKIN||769919946_?->769920072_HISKIN->769919947_?-><-769919948_?<-769919949_?
      779871805    SNF-helicase->?->?-><-?<-?<-TerD<-TerD<-ParB-HTH+Prok-TUDOR*                                                                                                                                  ParB-HTH+Prok-TUDOR         -                         N44_RS02080              293  bacteria>cyanobacteria                       Microcystis aeruginosa                                  hypothetical protein [Microcystis aeruginosa].                                                                    779871748_SNF-helicase->779871801_?->779871752_?-><-779871755_?<-779871760_?<-488847691_TerD<-488847692_TerD<-779871805_ParB-HTH+Prok-TUDOR*||779871765_?->779871769_?->779871774_?-><-779871778_?||488837493_?->779871786_?->
      752568031    <-ExoVII<-ParB-HTH+Prok-TUDOR*<-XerD                                                                                                                                                          ParB-HTH+Prok-TUDOR         -                         CYAN8802_RS22020         292  bacteria>cyanobacteria                       Cyanothece sp. PCC 8802                                 hypothetical protein [Cyanothece sp. PCC 8802].                                                                   <-502464563_?<-502464564_?<-502464565_?<-752568030_?||502464569_?-><-502464578_?<-502464579_ExoVII<-752568031_ParB-HTH+Prok-TUDOR*<-752568048_XerD<-502464582_?||502464583_?-><-502464586_?||752568033_?->752568050_?-><-502464594_?
      763118064    <-ParB-HTH+Prok-TUDOR*                                                                                                                                                                        ParB-HTH+Prok-TUDOR         -                         MICAK_RS15355            291  bacteria>cyanobacteria                       Microcystis aeruginosa                                  hypothetical protein [Microcystis aeruginosa].                                                                    490389584_?-><-490389586_?<-490389588_?<-490389589_?<-490389590_?||490389591_?->490389593_?-><-763118064_ParB-HTH+Prok-TUDOR*||763118011_?->490389596_?->490389597_?-><-763118012_?<-490389598_?
      505024902    <-ParB-HTH+Prok-TUDOR*<-?||XerD-><-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-?<-Zn_Tnp_IS1595<-McrB                                                                                           ParB-HTH+Prok-TUDOR         SP                        STA7437_RS22850          290  bacteria>cyanobacteria                       Stanieria cyanosphaera                                  hypothetical protein [Stanieria cyanosphaera].                                                                    505024896_?->753864831_?->505024897_?->505024898_?->505024899_?->505024900_?->505024901_?-><-505024902_ParB-HTH+Prok-TUDOR*<-753864890_?||505024905_XerD-><-505024906_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-505024907_?<-753864892_Zn_Tnp_IS1595<-505024909_McrB<-505024910_?
      505141377    TPR->?-><-DDE_Tnp_1_2||?->?-><-?<-ParB-HTH+Prok-TUDOR*||?->?-><-DDE_3                                                                                                                         ParB-HTH+Prok-TUDOR         -                         CYLST_RS31010            290  bacteria>cyanobacteria                       Cylindrospermum stagnale                                hypothetical protein [Cylindrospermum stagnale].                                                                  <-752562980_?||505141373_TPR->505141374_?-><-505141375_DDE_Tnp_1_2||752562981_?->752562892_?-><-752562982_?<-505141377_ParB-HTH+Prok-TUDOR*||505141378_?->505141379_?-><-752561959_DDE_3<-752561958_?||505141380_?->505141381_?->752562984_?->
      518335686    Relaxase->?->?->?->?->?-><-ParB-HTH+Prok-TUDOR*||XerD-><-?<-?<-?<-?<-ABC                                                                                                                      ParB-HTH+Prok-TUDOR         SP                        PLEUR7319_RS0114150      289  bacteria>cyanobacteria                       Pleurocapsa sp. PCC 7319                                hypothetical protein [Pleurocapsa sp. PCC 7319].                                                                  518335678_?->518335679_Relaxase->518335680_?->518335682_?->518335683_?->518335684_?->518335685_?-><-518335686_ParB-HTH+Prok-TUDOR*||518335687_XerD-><-648410763_?<-518335688_?<-518335689_?<-648410764_?<-648410765_ABC<-518335692_?
      738538439    URI->?->?->?->?-><-ParB-HTH+Prok-TUDOR*||XerD-><-?||HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-?<-SFII-helicase                                                                              ParB-HTH+Prok-TUDOR         SP                        KV40_RS24315             289  bacteria>cyanobacteria                       Myxosarcina sp. GI1                                     hypothetical protein [Myxosarcina sp. GI1].                                                                       <-738538426_?||738538428_?->738538430_URI->738538431_?->738538432_?->738538433_?->738538438_?-><-738538439_ParB-HTH+Prok-TUDOR*||738538562_XerD-><-738538440_?||738538442_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-738538445_?<-738538447_SFII-helicase<-738538449_?<-738538564_?
      752567372    <-XerD||ParB-HTH+Prok-TUDOR*->                                                                                                                                                                ParB-HTH+Prok-TUDOR         SP                        PCC7424_RS28605          289  bacteria>cyanobacteria                       Cyanothece sp. PCC 7424                                 hypothetical protein, partial [Cyanothece sp. PCC 7424].                                                          <-501601018_?||501601019_?->501601020_?->501601021_?-><-752567351_?||501601024_?-><-752567371_XerD||752567372_ParB-HTH+Prok-TUDOR*->501601027_?->501601028_?->752567352_?->
      768384071    HTH->ParB-HTH+Prok-TUDOR*->                                                                                                                                                                   ParB-HTH+Prok-TUDOR         -                         UH38_20050               289  bacteria>cyanobacteria                       Chroococcales cyanobacterium CENA595                    hypothetical protein UH38_20050 [Chroococcales cyanobacterium CENA595].                                           <-768384026_?||768384027_?->768384028_HTH->768384071_ParB-HTH+Prok-TUDOR*->768384029_?->768384030_?->768384031_?->768384032_?->768384033_?->768384034_?->768384035_?->
      505030514    <-RDRP||?->?->?-><-?<-?<-?||ParB-HTH+Prok-TUDOR*-><-HISKIN                                                                                                                                    ParB-HTH+Prok-TUDOR         -                         ANACY_RS28045            288  bacteria>cyanobacteria                       Anabaena cylindrica                                     hypothetical protein [Anabaena cylindrica].                                                                       <-505030507_RDRP||505030508_?->505030509_?->505030510_?-><-505030511_?<-505030512_?<-505030513_?||505030514_ParB-HTH+Prok-TUDOR*-><-505030515_HISKIN||505030516_?-><-505030517_?<-505030518_?||755115646_?->505030520_?->505030521_?->
      753811080    <-ParB-HTH+Prok-TUDOR*                                                                                                                                                                        ParB-HTH+Prok-TUDOR         -                         NPUN_RS35730             288  bacteria>cyanobacteria                       Nostoc punctiforme                                      hypothetical protein [Nostoc punctiforme].                                                                        <-501381676_?<-753811078_?<-501381678_?<-501381679_?||753811079_?->501381682_?->501381683_?-><-753811080_ParB-HTH+Prok-TUDOR*<-501381685_?<-501381686_?<-501381687_?||753811081_?->501381689_?->753811043_?->501381690_?->
      744452929    <-XerD||?-><-?<-?<-?||ParB-HTH+Prok-TUDOR*->                                                                                                                                                  ParB-HTH+Prok-TUDOR         -                         DA73_0203765             287  bacteria>cyanobacteria                       Tolypothrix bouteillei VB521301                         hypothetical protein DA73_0203765 [Tolypothrix bouteillei VB521301].                                              <-744452910_?||744452911_?-><-744452912_XerD||744452913_?-><-744452914_?<-744452915_?<-744452928_?||744452929_ParB-HTH+Prok-TUDOR*->744452916_?->744452917_?-><-744452918_?<-744452919_?||744452920_?->744452921_?->744452922_?->
      752567338    <-ParB-HTH+Prok-TUDOR*<-?<-DCM                                                                                                                                                                ParB-HTH+Prok-TUDOR         -                         PCC7424_RS28050          287  bacteria>cyanobacteria                       Cyanothece sp. PCC 7424                                 hypothetical protein [Cyanothece sp. PCC 7424].                                                                   752567363_?->501600808_?-><-501600809_?<-501600810_?||501600811_?-><-752567337_?<-501600813_?<-752567338_ParB-HTH+Prok-TUDOR*<-752567339_?<-752567340_DCM||501600815_?-><-501600816_?<-501600817_?<-501600818_?<-501600819_?
      218175274    RdRP+RNaseH+RNaseH->?-><-?<-?||?-><-?<-?<-ParB-HTH+Prok-TUDOR*                                                                                                                                ParB-HTH+Prok-TUDOR         -                         PCC7424_5430             286  bacteria>cyanobacteria                       Cyanothece sp. PCC 7424                                 conserved hypothetical protein (plasmid) [Cyanothece sp. PCC 7424].                                               218175267_RdRP+RNaseH+RNaseH->218175268_?-><-218175269_?<-218175270_?||218175271_?-><-218175272_?<-218175273_?<-218175274_ParB-HTH+Prok-TUDOR*||218175275_?-><-218175276_?<-218175277_?<-218175278_?<-218175279_?<-218175280_?<-218175281_?
      748134961    <-DCM<-?<-?<-?||?->ParB-HTH->Primpol?->ParB-HTH+Prok-TUDOR*->                                                                                                                                 ParB-HTH+Prok-TUDOR         -                         QH73_RS02585             284  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein, partial [Scytonema millei].                                                                 <-748134886_DCM<-748134887_?<-748134888_?<-748134889_?||748134890_?->748134960_ParB-HTH->748134891_Primpol?->748134961_ParB-HTH+Prok-TUDOR*-><-748134962_?||748134963_?->748134964_?->748134965_?->748134892_?->748134966_?->748134967_?->
      748137603    <-ParB-HTH+Prok-TUDOR*<-HTH<-HTH<-RecT                                                                                                                                                        ParB-HTH+Prok-TUDOR         -                         QH73_RS16405             282  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein [Scytonema millei].                                                                          748137524_?->748137525_?->748137526_?->748137601_?->748137527_?-><-748137602_?||748137528_?-><-748137603_ParB-HTH+Prok-TUDOR*<-748137529_HTH<-748137530_HTH<-748137604_RecT<-748137531_?||748137605_?->748137532_?->748137533_?->
      737857352    ParB->?->?->?->?-><-?<-ExoVII<-ParB-HTH+Prok-TUDOR*<-?<-XerD                                                                                                                                  ParB-HTH+Prok-TUDOR         -                         CWATWH0003_RS03005       281  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     737857345_ParB->494519769_?->494519770_?->737857348_?->494519772_?-><-494519773_?<-494519774_ExoVII<-737857352_ParB-HTH+Prok-TUDOR*<-737857355_?<-737857358_XerD
      750617827    METHYLASE-><-?<-?<-?<-?<-ParB-HTH+Prok-TUDOR*||?->?-><-?||Peptidase_M10->                                                                                                                     ParB-HTH+Prok-TUDOR         SP                        XEN7305_RS25800          281  bacteria>cyanobacteria                       Xenococcus sp. PCC 7305                                 hypothetical protein [Xenococcus sp. PCC 7305].                                                                   493559047_?->493559048_?->493559049_METHYLASE-><-750617862_?<-493559051_?<-493559052_?<-493559053_?<-750617827_ParB-HTH+Prok-TUDOR*||493559055_?->493559056_?-><-493559057_?||750617865_Peptidase_M10-><-493559059_?||493559060_?-><-750617867_?
      494519775    XerD->?->ParB-HTH+Prok-TUDOR*->ExoVII->?-><-?<-?<-?<-?<-ParB                                                                                                                                  ParB-HTH+Prok-TUDOR         -                         CWATWH0005_5327          280  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     546226988_XerD->494519776_?->494519775_ParB-HTH+Prok-TUDOR*->494519774_ExoVII->494519773_?-><-494519772_?<-546226991_?<-494519770_?<-494519769_?<-546226994_ParB
      497232044    TPR+CASPASE->?-><-?||?->?->?->Primpol?->ParB-HTH+Prok-TUDOR*->?->?->?->TerD->                                                                                                                 ParB-HTH+Prok-TUDOR         -                         CY51472DRAFT_RS0225515   280  bacteria>cyanobacteria                       Cyanothece                                              MULTISPECIES: hypothetical protein [Cyanothece].                                                                  501330960_TPR+CASPASE->497232050_?-><-497232049_?||497232048_?->497232047_?->501330961_?->497232045_Primpol?->497232044_ParB-HTH+Prok-TUDOR*->497232043_?->497232042_?->497232041_?->497232040_TerD->639855298_?->639855302_?->497232038_?->
      763118968    <-ParB-HTH+Prok-TUDOR*<-?<-?<-?<-RVT+HNH                                                                                                                                                      ParB-HTH+Prok-TUDOR         -                         MICAB_RS03030            280  bacteria>cyanobacteria                       Microcystis aeruginosa                                  hypothetical protein [Microcystis aeruginosa].                                                                    488845132_?->488845134_?->488845136_?->763118967_?->488845140_?->488845142_?->488845144_?-><-763118968_ParB-HTH+Prok-TUDOR*<-488845148_?<-763118969_?<-488845152_?<-488845158_RVT+HNH||488845160_?->488845162_?->488845163_?->
      428013042    <-ParA<-ParA<-?<-HNH||Primpol?->ParB-HTH+Prok-TUDOR*-><-?<-?<-HNH||?->?->AAA-ATPase->                                                                                                         ParB-HTH+Prok-TUDOR         -                         Chro_5819                279  bacteria>cyanobacteria                       Chroococcidiopsis thermalis PCC 7203                    hypothetical protein Chro_5819 (plasmid) [Chroococcidiopsis thermalis PCC 7203].                                  428013035_?-><-428013036_?<-428013037_ParA<-428013038_ParA<-428013039_?<-428013040_HNH||428013041_Primpol?->428013042_ParB-HTH+Prok-TUDOR*-><-428013043_?<-428013044_?<-428013045_HNH||428013046_?->428013047_?->428013048_AAA-ATPase->428013049_?->
      494514224    <-METHYLASE<-SNF-helicase<-?<-ExoVII<-ParB-HTH+Prok-TUDOR*||?-><-?<-?<-?||?->ParB->                                                                                                           ParB-HTH+Prok-TUDOR         -                         CWATDRAFT_RS03435        279  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     <-494514218_?<-546220957_?<-494514219_?<-494514220_METHYLASE<-494514221_SNF-helicase<-494514222_?<-757157015_ExoVII<-494514224_ParB-HTH+Prok-TUDOR*||494514225_?-><-494514226_?<-494514227_?<-757157016_?||494514228_?->494514229_ParB-><-494514230_?
      494523801    <-ExoVII<-?<-ParB-HTH+Prok-TUDOR*<-XerD                                                                                                                                                       ParB-HTH+Prok-TUDOR         -                         CWATWH0003_RS26285       279  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     494523799_?->546222412_?-><-494523800_?<-494518955_ExoVII<-494518954_?<-494523801_ParB-HTH+Prok-TUDOR*<-494523802_XerD
      497231939    ParB-><-?<-ExoVII<-?<-?<-?<-XerD<-ParB-HTH+Prok-TUDOR*||?-><-?<-?||TPR+CASPASE->                                                                                                              ParB-HTH+Prok-TUDOR         SP                        CY51472DRAFT_RS0223830   279  bacteria>cyanobacteria                       Cyanothece                                              MULTISPECIES: hypothetical protein [Cyanothece].                                                                  497231946_ParB-><-497231945_?<-639854853_ExoVII<-497231943_?<-497231942_?<-497231941_?<-737891485_XerD<-497231939_ParB-HTH+Prok-TUDOR*||497231938_?-><-497231937_?<-501330991_?||501330992_TPR+CASPASE->497231934_?->497231933_?-><-497231932_?
      497232068    Tox-HNH->?-><-ParB-HTH+Prok-TUDOR*||?->?->?->?->?->HNH->                                                                                                                                      ParB-HTH+Prok-TUDOR         -                         CY51472DRAFT_RS0225395   279  bacteria>cyanobacteria                       Cyanothece                                              MULTISPECIES: hypothetical protein [Cyanothece].                                                                  <-497232077_?<-497232075_?||497232074_?-><-737891503_?<-497232071_?||497232070_Tox-HNH->497232069_?-><-497232068_ParB-HTH+Prok-TUDOR*||497232067_?->497232066_?->639855262_?->497232064_?->497232063_?->497232062_HNH->497232061_?->
      543428839    ParB-HTH+Prok-TUDOR*->ExoVII->?->SNF-helicase->METHYLASE->                                                                                                                                    ParB-HTH+Prok-TUDOR         -                         CWATWH0401_4234          279  bacteria>cyanobacteria                       Crocosphaera watsonii WH 0401                           hypothetical protein CWATWH0401_4234 [Crocosphaera watsonii WH 0401].                                             <-543428838_?||543428839_ParB-HTH+Prok-TUDOR*->543428840_ExoVII->543428841_?->543428842_SNF-helicase->543428843_METHYLASE->543428844_?->543428845_?->543428846_?->
      737832178    Tox-HNH->?-><-ParB-HTH+Prok-TUDOR*                                                                                                                                                            ParB-HTH+Prok-TUDOR         SP                        CY0110_RS14620           279  bacteria>cyanobacteria                       Cyanothece sp. CCY0110                                  hypothetical protein [Cyanothece sp. CCY0110].                                                                    495551649_?-><-495551650_?||495551651_?-><-737832174_?<-495551652_?||495551653_Tox-HNH->737832175_?-><-737832178_ParB-HTH+Prok-TUDOR*
      515515560    <-DDE<-McrB+METHYLASE<-?<-URI<-?||ParB-HTH+Prok-TUDOR*-><-?<-?<-?<-?||DDE->                                                                                                                   ParB-HTH+Prok-TUDOR         -                         ANA7108_RS0100620        278  bacteria>cyanobacteria                       Anabaena sp. PCC 7108                                   hypothetical protein [Anabaena sp. PCC 7108].                                                                     755139934_?-><-515515555_?<-755139935_DDE<-515515557_McrB+METHYLASE<-755139938_?<-755139940_URI<-515515559_?||515515560_ParB-HTH+Prok-TUDOR*-><-648412157_?<-515515562_?<-515515563_?<-755139943_?||515515565_DDE->515515569_?-><-515515570_?
      740179759    <-XerD||?->?-><-XerD||HU-IHF-><-?<-ParB-HTH+Prok-TUDOR*<-HTH<-?||?->?->DDE->                                                                                                                  ParB-HTH+Prok-TUDOR         -                         SYN7509_RS0223705        278  bacteria>cyanobacteria                       Synechocystis sp. PCC 7509                              hypothetical protein [Synechocystis sp. PCC 7509].                                                                <-740179750_?<-497316315_XerD||740179753_?->497316313_?-><-740179756_XerD||655839688_HU-IHF-><-497316309_?<-740179759_ParB-HTH+Prok-TUDOR*<-740179762_HTH<-655839696_?||655839701_?->655839706_?->655839485_DDE->497316325_?->740179661_?->
      769921346    <-ParB-HTH+Prok-TUDOR*                                                                                                                                                                        ParB-HTH+Prok-TUDOR         -                         UH38_RS16060             278  bacteria>cyanobacteria                       Chroococcales cyanobacterium CENA595                    hypothetical protein [Chroococcales cyanobacterium CENA595].                                                      <-769921282_?<-769921343_?<-769921283_?||769921344_?->769921284_?->769921345_?->769921285_?-><-769921346_ParB-HTH+Prok-TUDOR*||769921286_?-><-769921287_?<-769921288_?<-769921289_?||769921347_?-><-769921290_?<-769921291_?
      748136693    <-DDE_Tnp_IS1<-ParB-HTH+Prok-TUDOR*                                                                                                                                                           ParB-HTH+Prok-TUDOR         -                         QH73_RS11110             276  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein, partial [Scytonema millei].                                                                 748136506_?->748136507_?->748136508_?-><-748136509_?<-748136691_?<-748136692_?<-748136510_DDE_Tnp_IS1<-748136693_ParB-HTH+Prok-TUDOR*<-748136511_?<-748136694_?<-748136512_?||748136695_?-><-748136696_?<-748136513_?<-748136514_?
      501223295    <-ParB-HTH+Prok-TUDOR*<-?<-?<-?<-?||DDE_3-><-?||DDE_Tnp_1_2->                                                                                                                                 ParB-HTH+Prok-TUDOR         -                         MAE_RS14770              275  bacteria>cyanobacteria                       Microcystis aeruginosa                                  hypothetical protein [Microcystis aeruginosa].                                                                    <-501223289_?||488880879_?->501223290_?->501221134_?-><-501223292_?<-501223293_?||754188503_?-><-501223295_ParB-HTH+Prok-TUDOR*<-754188505_?<-501223296_?<-501220880_?<-501220879_?||501221980_DDE_3-><-501223297_?||754188508_DDE_Tnp_1_2->
      737188140    <-ParB-HTH+Prok-TUDOR*<-?<-?||?-><-?<-HISKIN                                                                                                                                                  ParB-HTH+Prok-TUDOR         -                         CAL7103_RS0120705        274  bacteria>cyanobacteria                       Calothrix sp. PCC 7103                                  hypothetical protein [Calothrix sp. PCC 7103].                                                                    648401412_?-><-518321992_?<-518321993_?<-518321994_?<-518321995_?<-518321996_?<-518321997_?<-737188140_ParB-HTH+Prok-TUDOR*<-518321999_?<-518322000_?||737188142_?-><-648401413_?<-518322003_HISKIN<-518322004_?||518322005_?->
      518327692    NACHT->?-><-?<-?<-ParB-HTH+Prok-TUDOR*||?-><-?<-?<-?<-CASPASE<-?||TPR+HD-RNase->                                                                                                              ParB-HTH+Prok-TUDOR         -                         CAL7103_RS0150440        273  bacteria>cyanobacteria                       Calothrix sp. PCC 7103                                  hypothetical protein [Calothrix sp. PCC 7103].                                                                    518327685_?-><-518327686_?<-518327687_?||648402072_NACHT->518327689_?-><-518327690_?<-518327691_?<-518327692_ParB-HTH+Prok-TUDOR*||518327693_?-><-518327694_?<-518327695_?<-518327696_?<-518327697_CASPASE<-518327698_?||518327699_TPR+HD-RNase->
      754536191    ParB-HTH+Prok-TUDOR*->?->?-><-?<-?<-?<-ParB<-XerD                                                                                                                                             ParB-HTH+Prok-TUDOR         -                         CYAN7822_RS33100         272  bacteria>cyanobacteria                       Cyanothece sp. PCC 7822                                 hypothetical protein, partial [Cyanothece sp. PCC 7822].                                                          754536188_?->503090766_?-><-503090767_?<-503090768_?||503090770_?-><-503090771_?<-503090772_?||754536191_ParB-HTH+Prok-TUDOR*->503090776_?->503090777_?-><-503090778_?<-503090780_?<-754536193_?<-754536196_ParB<-503090783_XerD
      126620031    Tox-HNH->?-><-ParB-HTH+Prok-TUDOR*<-?<-HNH                                                                                                                                                    ParB-HTH+Prok-TUDOR         -                         CY0110_32445             271  bacteria>cyanobacteria                       Cyanothece sp. CCY0110                                  hypothetical protein CY0110_32445 [Cyanothece sp. CCY0110].                                                       126620024_?->126620025_?-><-126620026_?||126620027_?-><-126620028_?||126620029_Tox-HNH->126620030_?-><-126620031_ParB-HTH+Prok-TUDOR*<-126620032_?<-126620033_HNH
      495554039    ParB-HTH+Prok-TUDOR*->?-><-?<-?<-?<-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon                                                                                                                  ParB-HTH+Prok-TUDOR         -                         CY0110_RS25950           271  bacteria>cyanobacteria                       Cyanothece sp. CCY0110                                  hypothetical protein [Cyanothece sp. CCY0110].                                                                    <-495554029_?<-737833440_?||495554031_?-><-495554035_?<-737833445_?<-737833442_?<-495554038_?||495554039_ParB-HTH+Prok-TUDOR*->495554040_?-><-495554041_?<-495554042_?<-495554043_?<-737833447_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-495554045_?||495554046_?->
      737862397    <-ExoVII<-ParB-HTH+Prok-TUDOR*<-XerD<-DDE_Tnp_ISAZ013                                                                                                                                         ParB-HTH+Prok-TUDOR         -                         CWATWH0003_RS26330       269  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     <-494523803_?<-494523804_?||494523805_?-><-494523807_?<-494523808_?<-494523809_ExoVII<-737862397_ParB-HTH+Prok-TUDOR*<-494523813_XerD<-494523815_DDE_Tnp_ISAZ013
      751574024    XerD->?->?-><-ParB-HTH+Prok-TUDOR*||?-><-?||?->RVT+HNH->                                                                                                                                      ParB-HTH+Prok-TUDOR         -                         SD81_RS35605             269  bacteria>cyanobacteria                       Tolypothrix campylonemoides                             hypothetical protein [Tolypothrix campylonemoides].                                                               751574020_XerD->751574022_?->751573928_?-><-751574024_ParB-HTH+Prok-TUDOR*||515883502_?-><-751573933_?||751573186_?->751574026_RVT+HNH-><-751566262_?<-751573935_?<-751573937_?
      738911651    Peptidase_M10->?-><-?||?->?-><-ParB-HTH||?-><-ParB-HTH+Prok-TUDOR*||XerD->                                                                                                                    ParB-HTH+Prok-TUDOR         -                         PLEUR7319_RS33990        266  bacteria>cyanobacteria                       Pleurocapsa sp. PCC 7319                                hypothetical protein [Pleurocapsa sp. PCC 7319].                                                                  738911648_Peptidase_M10->518333452_?-><-518333453_?||518333454_?->518333455_?-><-518333456_ParB-HTH||518333457_?-><-738911651_ParB-HTH+Prok-TUDOR*||738911654_XerD->518333460_?->518333461_?->738911661_?-><-738911478_?<-518333464_?<-518333465_?
      256592473    <-ExoVII<-ParB-HTH+Prok-TUDOR*<-XerD                                                                                                                                                          ParB-HTH+Prok-TUDOR         -                         Cyan8802_4571            265  bacteria>cyanobacteria                       Cyanothece sp. PCC 8802                                 hypothetical protein Cyan8802_4571 (plasmid) [Cyanothece sp. PCC 8802].                                           <-256592466_?<-256592467_?<-256592468_?<-256592469_?||256592470_?-><-256592471_?<-256592472_ExoVII<-256592473_ParB-HTH+Prok-TUDOR*<-256592474_XerD<-256592475_?||256592476_?-><-256592477_?||256592478_?-><-256592479_?<-256592480_?
      752825464    <-ParA<-ParA<-?<-HNH||Primpol?->ParB-HTH+Prok-TUDOR*->?-><-?<-?<-HNH||?->AAA-ATPase->                                                                                                         ParB-HTH+Prok-TUDOR         -                         CHRO_RS28535             265  bacteria>cyanobacteria                       Chroococcidiopsis thermalis                             hypothetical protein, partial [Chroococcidiopsis thermalis].                                                      504975986_?-><-504975987_?<-504975988_ParA<-752825462_ParA<-504975990_?<-504975991_HNH||752825463_Primpol?->752825464_ParB-HTH+Prok-TUDOR*->752825420_?-><-752825465_?<-504975995_?<-504975996_HNH||752825466_?->504975999_AAA-ATPase->504976000_?->
      17135837     ABC-><-?||ABC->?->ParB-HTH+Prok-TUDOR*->                                                                                                                                                      ParB-HTH+Prok-TUDOR         -                         alr7299                  264  bacteria>cyanobacteria                       Nostoc sp. PCC 7120                                     alr7299 (plasmid) [Nostoc sp. PCC 7120].                                                                          17135830_?->17135831_?->17135832_?->17135833_ABC-><-17135834_?||17135835_ABC->17135836_?->17135837_ParB-HTH+Prok-TUDOR*->17135838_?->17135839_?-><-17135840_?<-17135841_?||17135842_?->17135843_?-><-17135844_?
      546220971    <-METHYLASE<-SNF-helicase<-?<-ExoVII<-ParB-HTH+Prok-TUDOR*                                                                                                                                    ParB-HTH+Prok-TUDOR         -                         CWATWH8502_3723          264  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     <-546220957_?<-494514219_?<-546220961_?<-546220964_METHYLASE<-494514221_SNF-helicase<-546220966_?<-546220969_ExoVII<-546220971_ParB-HTH+Prok-TUDOR*
      748136747    TPR->?-><-?<-?<-?||?->?->ParB-HTH+Prok-TUDOR*->?->?->?->?->ABC->                                                                                                                              ParB-HTH+Prok-TUDOR         -                         QH73_RS12040             262  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein, partial [Scytonema millei].                                                                 748136633_TPR->748136742_?-><-748136743_?<-748136744_?<-748136745_?||748136634_?->748136746_?->748136747_ParB-HTH+Prok-TUDOR*->748136748_?->748136635_?->748136636_?->748136637_?->748136638_ABC->748136639_?->748136640_?->
      748136457    RVT+HNH-><-?||?->?->ParB-HTH+Prok-TUDOR*->                                                                                                                                                    ParB-HTH+Prok-TUDOR         -                         QH73_RS10490             260  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein, partial [Scytonema millei].                                                                 <-748136454_?||748136126_?-><-748136455_?||748136382_RVT+HNH-><-748136456_?||748136394_?->748136395_?->748136457_ParB-HTH+Prok-TUDOR*->748136396_?->748136397_?->748136458_?->748136459_?->748136398_?->748136460_?->748136461_?->
      442790849    METHYLASE-><-?<-?<-?<-?<-ParB-HTH+Prok-TUDOR*||?->?-><-?||Peptidase_M10->                                                                                                                     ParB-HTH+Prok-TUDOR         -                         Xen7305DRAFT_00000510    258  bacteria>cyanobacteria                       Xenococcus sp. PCC 7305                                 hypothetical protein Xen7305DRAFT_00000510 [Xenococcus sp. PCC 7305].                                             442790842_?->442790843_?->442790844_METHYLASE-><-442790845_?<-442790846_?<-442790847_?<-442790848_?<-442790849_ParB-HTH+Prok-TUDOR*||442790850_?->442790851_?-><-442790852_?||442790853_Peptidase_M10-><-442790854_?||442790855_?-><-442790856_?
      738540774    <-ParB-HTH+Prok-TUDOR*||?-><-?<-?||XerD-><-?<-?<-DDE                                                                                                                                          ParB-HTH+Prok-TUDOR         -                         KV40_RS29900             258  bacteria>cyanobacteria                       Myxosarcina sp. GI1                                     hypothetical protein [Myxosarcina sp. GI1].                                                                       <-738540756_?<-738540809_?<-738540759_?<-738540762_?<-738540765_?||738540768_?->738540771_?-><-738540774_ParB-HTH+Prok-TUDOR*||738540777_?-><-738540780_?<-738540783_?||738540812_XerD-><-738540785_?<-738540787_?<-738540789_DDE
      737132827    ParA->HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->ParB-><-?<-?||?->Relaxase->ParB-HTH+Prok-TUDOR*->?->HNH->?-><-?||?->?-><-HISKIN                                                               ParB-HTH+Prok-TUDOR         -                         FIS9431_RS31145          257  bacteria>cyanobacteria                       Fischerella sp. PCC 9431                                hypothetical protein [Fischerella sp. PCC 9431].                                                                  652319800_ParA->652319802_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->652319809_ParB-><-652319810_?<-652319812_?||652319813_?->737132824_Relaxase->737132827_ParB-HTH+Prok-TUDOR*->652319814_?->652319815_HNH->652319816_?-><-652319817_?||737132828_?->652319819_?-><-652319821_HISKIN
      737187200    DDE-><-?<-?<-?<-ParB-HTH+Prok-TUDOR*                                                                                                                                                          ParB-HTH+Prok-TUDOR         -                         CAL7103_RS0100030        257  bacteria>cyanobacteria                       Calothrix sp. PCC 7103                                  hypothetical protein, partial [Calothrix sp. PCC 7103].                                                           737187199_DDE-><-518317934_?<-518317935_?<-518317936_?<-737187200_ParB-HTH+Prok-TUDOR*||518317938_?->518317939_?->648400969_?->518317941_?->737187201_?->737187203_?->518317944_?->
      648361686    TPR+CASPASE-><-?<-?<-STYKIN<-NACHT||ParB-HTH+Prok-TUDOR*->ParB-HTH-><-?<-?<-?||XerD->                                                                                                         ParB-HTH+Prok-TUDOR         -                         PCC9339_RS0103785        253  bacteria>cyanobacteria                       Fischerella sp. PCC 9339                                hypothetical protein [Fischerella sp. PCC 9339].                                                                  737126431_?->737126434_?->737126438_TPR+CASPASE-><-515877411_?<-515877412_?<-737126440_STYKIN<-515877414_NACHT||648361686_ParB-HTH+Prok-TUDOR*->515877416_ParB-HTH-><-515877417_?<-515877418_?<-737126307_?||515877419_XerD->515877420_?-><-515877421_?
      740179430    <-XerD<-RecD||?-><-ParB-HTH+Prok-TUDOR*<-HTH||DDE-><-ParB-HTH<-ParB-HTH                                                                                                                       ParB-HTH+Prok-TUDOR         -                         SYN7509_RS26630          252  bacteria>cyanobacteria                       Synechocystis sp. PCC 7509                              hypothetical protein, partial [Synechocystis sp. PCC 7509].                                                       <-740179423_?<-740179426_?<-740179350_?<-497316171_?<-497316172_XerD<-740179427_RecD||655839503_?-><-740179430_ParB-HTH+Prok-TUDOR*<-740179432_HTH||497315944_DDE-><-740179434_ParB-HTH<-740179437_ParB-HTH<-497315734_?<-740179354_?||497316078_?->
      515383623    ParB-HTH+Prok-TUDOR*->                                                                                                                                                                        ParB-HTH+Prok-TUDOR         -                         UYC_RS0133720            242  bacteria>cyanobacteria                       Chlorogloeopsis fritschii                               hypothetical protein [Chlorogloeopsis fritschii].                                                                 <-515383601_?||515383603_?-><-515390425_?<-515383607_?||648395863_?->648395864_?->515383620_?->515383623_ParB-HTH+Prok-TUDOR*->515383626_?-><-515383628_?<-515383630_?<-648395865_?<-515383636_?<-515383639_?<-515383641_?
      494523812    <-ExoVII<-?<-?<-ParB-HTH+Prok-TUDOR*<-XerD                                                                                                                                                    ParB-HTH+Prok-TUDOR         -                         CWATWH0005_5485          238  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     494523805_?-><-494523806_?<-494523807_?<-494523808_?<-494523809_ExoVII<-494523810_?<-494523811_?<-494523812_ParB-HTH+Prok-TUDOR*<-546228489_XerD
      738540904    <-XerD||ParB-HTH+Prok-TUDOR*->                                                                                                                                                                ParB-HTH+Prok-TUDOR         -                         KV40_RS30300             234  bacteria>cyanobacteria                       Myxosarcina sp. GI1                                     hypothetical protein [Myxosarcina sp. GI1].                                                                       738540898_?->738541035_?-><-738540900_?||738541038_?-><-738540902_?<-738541041_?<-738541044_XerD||738540904_ParB-HTH+Prok-TUDOR*->738540906_?->738540907_?->738540908_?->738540910_?->738540911_?->738540912_?-><-738540914_?
      764953510    ABC-><-?||ABC->?->ParB-HTH+Prok-TUDOR*->                                                                                                                                                      ParB-HTH+Prok-TUDOR         -                         PCC7120DELTA_RS29870     230  bacteria>cyanobacteria                       Nostoc sp. PCC 7120                                     hypothetical protein, partial [Nostoc sp. PCC 7120].                                                              499309069_?->499309070_?->499309071_?->499309072_ABC-><-499309073_?||499309074_ABC->499309075_?->764953510_ParB-HTH+Prok-TUDOR*->764953405_?->499309078_?->764953409_?->499309080_?-><-499309084_?<-499309085_?||499309086_?->
      744453553    <-ParB-HTH+Prok-TUDOR*                                                                                                                                                                        ParB-HTH+Prok-TUDOR         SP                        DA73_0201705             229  bacteria>cyanobacteria                       Tolypothrix bouteillei VB521301                         hypothetical protein DA73_0201705, partial [Tolypothrix bouteillei VB521301].                                     <-744453520_?||744453521_?-><-744453522_?||744453523_?->744453524_?-><-744453525_?||744453526_?-><-744453553_ParB-HTH+Prok-TUDOR*<-744453527_?||744453528_?->744453529_?->744453530_?->
      738539875    <-ParA<-?<-?||?->HNH-><-ParB-HTH+Prok-TUDOR||?-><-ParB-HTH+Prok-TUDOR*<-?<-?||XerD->                                                                                                          ParB-HTH+Prok-TUDOR         -                         KV40_RS28185             224  bacteria>cyanobacteria                       Myxosarcina sp. GI1                                     hypothetical protein [Myxosarcina sp. GI1].                                                                       <-738539861_ParA<-738539864_?<-738539995_?||738540017_?->738539867_HNH-><-738539870_ParB-HTH+Prok-TUDOR||738539872_?-><-738539875_ParB-HTH+Prok-TUDOR*<-738539878_?<-738540021_?||738540025_XerD-><-738539881_?<-738539884_?<-738539887_?<-738540029_?
      754535969    <-ParB-HTH*||?->?-><-METHYLASE                                                                                                                                                                ParB-HTH                    SP                        CYAN7822_RS31780         204  bacteria>cyanobacteria                       Cyanothece sp. PCC 7822                                 hypothetical protein, partial [Cyanothece sp. PCC 7822].                                                          <-503100216_?||503100217_?->503100218_?-><-503100219_?<-503100220_?<-754535782_?<-503100222_?<-754535969_ParB-HTH*||503100224_?->754535785_?-><-503100225_METHYLASE<-754535972_?||503100227_?->754535974_?-><-503100229_?
      740027662    ParA?->ParB-HTH*->                                                                                                                                                                            ParB-HTH                    -                         IF77_RS0136485           189  bacteria>actinobacteria                      Streptomyces sp. NRRL F-5008                            hypothetical protein, partial [Streptomyces sp. NRRL F-5008].                                                     664267443_?-><-664267445_?<-664267447_?||664267448_?->740027659_?->740027652_?->664267453_ParA?->740027662_ParB-HTH*-><-664267457_?<-740027664_?<-664267461_?||664267466_?->664267468_?-><-664267469_?
      739896924    ParA?->ParB-HTH*->                                                                                                                                                                            ParB-HTH                    -                         C593_RS30785             185  bacteria>actinobacteria                      Streptomyces sp. CNT372                                 hypothetical protein, partial [Streptomyces sp. CNT372].                                                          739896922_?->517680486_?->517680487_ParA?->739896924_ParB-HTH*-><-517680489_?
      754535993    Subtilisin->Subtilisin->?->?-><-DDE_3<-?||ParB-HTH*->                                                                                                                                         ParB-HTH                    -                         CYAN7822_RS32040         183  bacteria>cyanobacteria                       Cyanothece sp. PCC 7822                                 hypothetical protein, partial [Cyanothece sp. PCC 7822].                                                          <-503100252_?||503100253_Subtilisin->754535989_Subtilisin->503100255_?->503100256_?-><-754535991_DDE_3<-503100257_?||754535993_ParB-HTH*->754535996_?->503100259_?->503100260_?->503100261_?->754535807_?->754535809_?->754535812_?->
      738911416    <-XerD||ParB-HTH+Prok-TUDOR*->                                                                                                                                                                ParB-HTH+Prok-TUDOR         -                         PLEUR7319_RS33705        170  bacteria>cyanobacteria                       Pleurocapsa sp. PCC 7319                                hypothetical protein, partial [Pleurocapsa sp. PCC 7319].                                                         738911398_?->738911413_?-><-518333130_?<-518333131_?||518333132_?-><-648410481_XerD||738911416_ParB-HTH+Prok-TUDOR*-><-518333135_?||518333136_?-><-518333137_?||738911419_?->518333139_?->518333140_?-><-518333141_?
      753864872    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         STA7437_RS22520          166  bacteria>cyanobacteria                       Stanieria cyanosphaera                                  hypothetical protein, partial [Stanieria cyanosphaera].                                                           505024834_?->505024835_?-><-505024836_?||505024837_?->753864869_?->753864871_?->505024840_?-><-753864872_ParB-HTH*||753864821_?-><-505024843_?<-753864875_?<-505024845_?<-753864822_?<-505024847_?<-753864824_?
      763312164    NACHT-><-ParB-HTH*||?->?-><-?||CASPASE->?->?-><-ABC                                                                                                                                           ParB-HTH                    -                         OSC10802_RS39075         163  bacteria>cyanobacteria                       Oscillatoria sp. PCC 10802                              hypothetical protein, partial [Oscillatoria sp. PCC 10802].                                                       516325480_?-><-763312157_?<-763312160_?<-763312162_?<-648406499_?<-516325486_?||516325488_NACHT-><-763312164_ParB-HTH*||763312165_?->648406500_?-><-516325491_?||516325492_CASPASE->763311553_?->516325494_?-><-516325495_ABC
      357263645    Primpol?->ParB-HTH*->                                                                                                                                                                         ParB-HTH                    -                         CWATWH0003_2674t1        161  bacteria>cyanobacteria                       Crocosphaera watsonii WH 0003                           hypothetical protein CWATWH0003_2674t1, partial [Crocosphaera watsonii WH 0003].                                  357263644_Primpol?->357263645_ParB-HTH*->
      703170672    DDE-><-?||ParB-HTH+Prok-TUDOR*-><-?<-XerD||?-><-?<-?||CASPASE->CASPASE->                                                                                                                      ParB-HTH+Prok-TUDOR         -                         MAS10914_RS29250         158  bacteria>cyanobacteria                       Mastigocladopsis repens                                 hypothetical protein, partial [Mastigocladopsis repens].                                                          515883494_?->515883495_?-><-515883496_?<-515883498_?||515883500_?->515883501_DDE-><-515883502_?||703170672_ParB-HTH+Prok-TUDOR*-><-515883503_?<-515883504_XerD||703170675_?-><-515883506_?<-515883507_?||703170678_CASPASE->703170682_CASPASE->
      753864885    <-ParB-HTH*<-?||?->?->?-><-AAA-ATPase                                                                                                                                                         ParB-HTH                    -                         STA7437_RS22675          154  bacteria>cyanobacteria                       Stanieria cyanosphaera                                  hypothetical protein, partial [Stanieria cyanosphaera].                                                           <-505024864_?<-505024865_?<-753864827_?<-753864828_?<-753864884_?<-505024869_?<-505024870_?<-753864885_ParB-HTH*<-505024872_?||505024873_?->505024874_?->505024875_?-><-505024876_AAA-ATPase||505024877_?->505024878_?->
      763120073    DDE-><-?<-?<-?||ParB-HTH*->                                                                                                                                                                   ParB-HTH                    -                         MICAE_RS13740            154  bacteria>cyanobacteria                       Microcystis aeruginosa                                  hypothetical protein, partial [Microcystis aeruginosa].                                                           <-488871383_?<-488871384_?<-513846817_?||488871388_DDE-><-488871389_?<-488871390_?<-763120071_?||763120073_ParB-HTH*-><-488871392_?||488871393_?-><-763120072_?
      763350225    ParA->ParB->DDE_3->METHYLASE-><-HTH||ParB-HTH*-><-?||?->?->?->?->Peptidase_M10->Peptidase_M10->                                                                                               ParB-HTH                    -                         MC7420_RS19625           153  bacteria>cyanobacteria                       Coleofasciculus chthonoplastes                          hypothetical protein, partial [Coleofasciculus chthonoplastes].                                                   <-763350114_?||493033615_?->493033471_ParA->493033522_ParB->493033396_DDE_3->763350222_METHYLASE-><-493033334_HTH||763350225_ParB-HTH*-><-763350228_?||493033541_?->493033509_?->493033259_?->493033571_?->763350230_Peptidase_M10->493033595_Peptidase_M10->
      760034517    ParA->ParB-HTH*->NLPC-><-?||?->?->?->ABC->                                                                                                                                                    ParB-HTH                    -                         ON27_RS00260             150  bacteria>actinobacteria                      Nocardia asiatica                                       hypothetical protein, partial [Nocardia asiatica].                                                                760034450_?->760034453_?->760034456_?->760034458_?->760034460_?->760034462_?->760034465_ParA->760034517_ParB-HTH*->760034520_NLPC-><-760034522_?||760034467_?->760034470_?->760034472_?->760034475_ABC->760034525_?->
      696559281    <-ParB-HTH*<-ParA||NLPC->                                                                                                                                                                     ParB-HTH                    -                         FH09_RS0132835           147  bacteria>actinobacteria                      Nocardia seriolae                                       hypothetical protein, partial [Nocardia seriolae].                                                                696559274_?->696559275_?->696559276_?->696559277_?-><-696559281_ParB-HTH*<-696559282_ParA||696559283_NLPC-><-696559278_?||696559279_?-><-696559284_?||696559280_?->
      738540713    <-ParB-HTH*||Tox-HNH->                                                                                                                                                                        ParB-HTH                    -                         KV40_RS29710             147  bacteria>cyanobacteria                       Myxosarcina sp. GI1                                     hypothetical protein, partial [Myxosarcina sp. GI1].                                                              738540609_?-><-738540612_?<-738540711_?<-738540615_?<-738540618_?<-738540621_?<-738540623_?<-738540713_ParB-HTH*||738540626_Tox-HNH->738540629_?->738540632_?->738540635_?->738540637_?->738540640_?->738540715_?->
      750579664    <-NLPC<-?||ParA->ParB-HTH*->                                                                                                                                                                  ParB-HTH                    -                         ON33_RS24890             144  bacteria>actinobacteria                      Nocardia niigatensis                                    hypothetical protein, partial [Nocardia niigatensis].                                                             750579613_?-><-750579614_?||750579660_?-><-750579661_?<-750579662_NLPC<-750579663_?||750579615_ParA->750579664_ParB-HTH*-><-750579616_?<-750579665_?<-750579617_?<-750579618_?||750579666_?->750579619_?-><-750579620_?
      754053711    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         PSE7367_RS19220          143  bacteria>cyanobacteria                       Pseudanabaena sp. PCC 7367                              hypothetical protein, partial [Pseudanabaena sp. PCC 7367].                                                       <-754052469_?<-504958988_?<-754053638_?||504959113_?-><-504959114_?||504959115_?-><-504959117_?<-754053711_ParB-HTH*<-754053640_?||754053641_?->754053713_?->504959122_?->504959123_?->754053642_?-><-504959125_?
      737153646    <-XerD||?->?-><-?<-ParB-HTH*                                                                                                                                                                  ParB-HTH                    -                         FIS9605_RS38655          141  bacteria>cyanobacteria                       Fischerella sp. PCC 9605                                hypothetical protein, partial [Fischerella sp. PCC 9605].                                                         652338671_?-><-737153643_?<-737153645_?<-652338672_XerD||652338673_?->652338674_?-><-652338675_?<-737153646_ParB-HTH*<-652338676_?<-652338677_?<-737153594_?<-652338678_?<-652338679_?<-652338680_?<-652338681_?
      737859558    Primpol?->ParB-HTH*->                                                                                                                                                                         ParB-HTH                    -                         CWATWH0003_RS12730       140  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein, partial [Crocosphaera watsonii].                                                            737859555_Primpol?->737859558_ParB-HTH*->
      738538560    <-ParB-HTH*<-?||?->URI->                                                                                                                                                                      ParB-HTH                    -                         KV40_RS24275             139  bacteria>cyanobacteria                       Myxosarcina sp. GI1                                     hypothetical protein, partial [Myxosarcina sp. GI1].                                                              <-738538553_?||738538419_?->738538555_?->738538420_?->738538556_?-><-738538558_?<-738538422_?<-738538560_ParB-HTH*<-738538426_?||738538428_?->738538430_URI->738538431_?->738538432_?->738538433_?->738538438_?->
      737126563    ParB-HTH*->?->HNH->                                                                                                                                                                           ParB-HTH                    -                         PCC9339_RS35280          137  bacteria>cyanobacteria                       Fischerella sp. PCC 9339                                hypothetical protein, partial [Fischerella sp. PCC 9339].                                                         <-515877580_?<-648361739_?||515877582_?-><-648361741_?<-648361742_?<-515877586_?||737126559_?->737126563_ParB-HTH*->515877589_?->515877590_HNH->737126567_?->737126570_?->515877593_?->515877594_?->515877595_?->
      737126426    <-ParB-HTH*||?->?->?->?->TPR+CASPASE->                                                                                                                                                        ParB-HTH                    -                         PCC9339_RS35040          135  bacteria>cyanobacteria                       Fischerella sp. PCC 9339                                hypothetical protein, partial [Fischerella sp. PCC 9339].                                                         <-515877396_?<-515877397_?<-737126422_?||515877399_?-><-515877400_?||515877401_?->515877402_?-><-737126426_ParB-HTH*||515877404_?->737126429_?->737126431_?->737126434_?->737126438_TPR+CASPASE-><-515877411_?<-515877412_?
      738995333    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         IF64_RS0121970           133  bacteria>actinobacteria                      Prauserella rugosa                                      hypothetical protein, partial [Prauserella rugosa].                                                               <-663758541_?<-738995333_ParB-HTH*<-663758543_?<-663758544_?<-738995336_?<-663758546_?<-663758547_?<-663758548_?<-663758549_?
      737232780    <-P-loop<-?<-ParB-HTH*                                                                                                                                                                        ParB-HTH                    -                         Q362_RS21450             132  bacteria>proteobacteria>deltaproteobacteria  Desulfobulbus elongatus                                 hypothetical protein, partial [Desulfobulbus elongatus].                                                          <-654866285_?<-654866206_?<-654867413_?<-737232778_?<-654867414_P-loop<-737232779_?<-737232780_ParB-HTH*<-654867416_?<-737232781_?<-654867417_?<-654867418_?||737232782_?->654867419_?->654867420_?->
      738617228    ParA->ParB-HTH*->NLPC-><-?||?-><-?||ABC->                                                                                                                                                     ParB-HTH                    -                         K940_RS26470             131  bacteria>actinobacteria                      Nocardia sp. CNY236                                     hypothetical protein, partial [Nocardia sp. CNY236].                                                              <-738617225_?||655027042_?->738617078_?->655027043_?->655027044_?->738617081_?->655027045_ParA->738617228_ParB-HTH*->655027047_NLPC-><-655027048_?||655027049_?-><-655027050_?||738617231_ABC->655027051_?-><-655027052_?
      750252315    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         CFLAV_RS00535            131  bacteria>verrucomicrobia                     Pedosphaera parvula                                     hypothetical protein, partial [Pedosphaera parvula].                                                              494654662_?-><-494654663_?<-750252179_?||494654665_?->494654666_?->494654667_?-><-750252182_?||750252315_ParB-HTH*->494654670_?->494654671_?-><-494654672_?<-494654673_?<-494654675_?<-494654676_?<-494654677_?
      663146205    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    SP                        IF21_RS0142715           130  bacteria>actinobacteria                      Streptomyces capuensis                                  hypothetical protein, partial [Streptomyces capuensis].                                                           663146205_ParB-HTH*->
      748262016    METHYLASE-><-?||?-><-?||STYKIN->?-><-NLPC||ParB-HTH*->                                                                                                                                        ParB-HTH                    -                         ON31_RS25060             130  bacteria>actinobacteria                      Nocardia otitidiscaviarum                               hypothetical protein, partial [Nocardia otitidiscaviarum].                                                        748261919_METHYLASE-><-748261920_?||748261921_?-><-748261922_?||748262013_STYKIN->748261923_?-><-748262015_NLPC||748262016_ParB-HTH*-><-748261924_?<-748261925_?<-659884470_?<-748262017_?||748261926_?->748261927_?->748261928_?->
      759915784    <-ParB-HTH*||NLPC-><-?<-STYKIN||?-><-?||?-><-METHYLASE                                                                                                                                        ParB-HTH                    -                         NOTIT_RS39125            130  bacteria>actinobacteria                      Nocardia otitidiscaviarum                               hypothetical protein, partial [Nocardia otitidiscaviarum].                                                        <-659884466_?<-659884467_?<-659884468_?||659884469_?->659884470_?->659884471_?->659884472_?-><-759915784_ParB-HTH*||659884474_NLPC-><-659884475_?<-759915786_STYKIN||659884477_?-><-659884478_?||659884479_?-><-659884480_METHYLASE
      750537552    ParA->ParB-HTH*->NLPC->                                                                                                                                                                       ParB-HTH                    -                         ON40_RS04835             129  bacteria>actinobacteria                      Nocardia jiangxiensis                                   hypothetical protein, partial [Nocardia jiangxiensis].                                                            750537034_?-><-750537037_?<-750537039_?||750537549_?->750537042_?-><-750537044_?||750537045_ParA->750537552_ParB-HTH*->750537554_NLPC-><-750537556_?<-750537559_?<-750537047_?||750537049_?-><-750537051_?||750537052_?->
      760001072    ParA->ParB-HTH*->NLPC-><-?||?->?->?->ABC->                                                                                                                                                    ParB-HTH                    -                         ON19_RS00180             129  bacteria>actinobacteria                      Nocardia abscessus                                      hypothetical protein, partial [Nocardia abscessus].                                                               760000905_?->760000908_?->760000911_?->760000915_?->760000918_?->760000923_?->760001069_ParA->760001072_ParB-HTH*->760001075_NLPC-><-760001078_?||760001082_?->760000926_?->760000928_?->760001085_ABC->760001088_?->
      752791755    P-loop->?->?->?->?->ParB-HTH*->                                                                                                                                                               ParB-HTH                    -                         SYN6312_RS05530          128  bacteria>cyanobacteria                       Synechococcus sp. PCC 6312                              hypothetical protein, partial [Synechococcus sp. PCC 6312].                                                       504936766_?->504936767_?->504936768_P-loop->504936769_?->752791199_?->752791201_?->752791754_?->752791755_ParB-HTH*->504936774_?->504936775_?->504936776_?->504936777_?->752791756_?->752791203_?->504936779_?->
      753864865    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         STA7437_RS22355          128  bacteria>cyanobacteria                       Stanieria cyanosphaera                                  hypothetical protein, partial [Stanieria cyanosphaera].                                                           753864857_?->753864859_?->505024810_?->505024811_?-><-753864861_?||753864816_?->753864862_?-><-753864865_ParB-HTH*||505024816_?->505024817_?-><-753864817_?||505024818_?->505024819_?->505024820_?->505024821_?->
      750531062    <-NLPC||ParA->ParB-HTH*->                                                                                                                                                                     ParB-HTH                    -                         ON43_RS29550             127  bacteria>actinobacteria                      Nocardia concava                                        hypothetical protein, partial [Nocardia concava].                                                                 <-750530893_?<-750530895_?||750531058_?-><-750530897_?||750531059_?-><-750531061_NLPC||750530898_ParA->750531062_ParB-HTH*-><-750531064_?<-750531065_?<-750530901_?<-750531067_?||750531068_?->750530903_?->750530905_?->
      740087909    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         IO34_RS50180             119  bacteria>actinobacteria                      Streptosporangium roseum                                hypothetical protein, partial [Streptosporangium roseum].                                                         665605533_?->665605536_?-><-665605540_?<-665605542_?<-665605545_?||665605548_?->665605551_?->740087909_ParB-HTH*->740087912_?-><-665605560_?<-665605563_?||665605566_?->665605569_?->665605571_?->665605577_?->
      737153947    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         FIS9605_RS39545          118  bacteria>cyanobacteria                       Fischerella sp. PCC 9605                                hypothetical protein, partial [Fischerella sp. PCC 9605].                                                         <-652339343_?||652339344_?-><-652338464_?<-652339345_?<-652339346_?<-737153945_?||737153916_?->737153947_ParB-HTH*-><-652339348_?<-652339349_?<-652339350_?<-652334111_?<-652339351_?<-652339352_?||652339353_?->
      749816534    <-XerD<-ParB-HTH*                                                                                                                                                                             ParB-HTH                    -                         BEGALDRAFT_RS07320       118  bacteria>proteobacteria>gammaproteobacteria  Beggiatoa alba                                          hypothetical protein, partial [Beggiatoa alba].                                                                   488762020_?->488762021_?->488762022_?-><-488762024_?<-749816070_?<-488762025_?<-749816530_XerD<-749816534_ParB-HTH*<-488762035_?<-488762037_?<-488762039_?<-488762042_?||488762044_?->488762046_?->488762048_?->
      738450871    HISKIN->?-><-?||ParB-HTH*->                                                                                                                                                                   ParB-HTH                    -                         C789_RS15355             116  bacteria>cyanobacteria                       Microcystis aeruginosa                                  hypothetical protein, partial [Microcystis aeruginosa].                                                           488833846_?->488833849_?->488833851_HISKIN->488833853_?-><-738450868_?||738450871_ParB-HTH*->738450872_?->488833865_?->488833867_?->488833868_?->488833870_?->488833871_?->488833876_?->
      749816531    <-XerD<-ParB-HTH*                                                                                                                                                                             ParB-HTH                    -                         BEGALDRAFT_RS07180       114  bacteria>proteobacteria>gammaproteobacteria  Beggiatoa alba                                          hypothetical protein, partial [Beggiatoa alba].                                                                   <-488761975_?||488761978_?-><-749816065_?<-749816066_?||488761981_?->488761983_?-><-749816530_XerD<-749816531_ParB-HTH*<-488761989_?<-488761991_?<-488761994_?<-749816067_?<-488761996_?||749816532_?->488762000_?->
      749816550    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         BEGALDRAFT_RS07765       106  bacteria>proteobacteria>gammaproteobacteria  Beggiatoa alba                                          hypothetical protein, partial [Beggiatoa alba].                                                                   488762199_?->488762200_?->488762201_?-><-749816548_?<-488762203_?||749816549_?-><-488762206_?||749816550_ParB-HTH*-><-488762211_?||488762213_?->749816551_?->488762217_?->749816552_?->488762222_?->488762224_?->
      750077672    <-ParB-HTH*<-?<-?<-?<-?<-?<-RecT                                                                                                                                                              ParB-HTH                    -                         F784_RS22230             102  bacteria>deinococci                          Deinococcus apachensis                                  hypothetical protein, partial [Deinococcus apachensis].                                                           518415046_?->518415047_?-><-518415048_?<-518415049_?||518415050_?-><-518415051_?<-648640482_?<-750077672_ParB-HTH*<-518415055_?<-518415056_?<-518415057_?<-518415058_?<-518415059_?<-750077674_RecT<-518415061_?
      764930116    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         NOS7107_RS16750          100  bacteria>cyanobacteria                       Nostoc sp. PCC 7107                                     hypothetical protein, partial [Nostoc sp. PCC 7107].                                                              <-504927031_?<-504927032_?||504927033_?->504927034_?->504927035_?->504927036_?->504927037_?-><-764930116_ParB-HTH*||764930119_?->504927039_?-><-504923882_?||764930121_?->504927041_?-><-504927043_?||504927044_?->
      738452741    <-ParB-HTH*<-?||DDE_Tnp_1_2->                                                                                                                                                                 ParB-HTH                    -                         I546_RS16505             98   bacteria>actinobacteria                      Mycobacterium kansasii                                  hypothetical protein, partial [Mycobacterium kansasii].                                                           738447877_?-><-738452475_?<-738452477_?<-738452479_?<-738452481_?<-738452483_?<-738452485_?<-738452741_ParB-HTH*<-738452743_?||738448501_DDE_Tnp_1_2->738447877_?->738452486_?-><-738452745_?<-738452488_?<-738452490_?
      754047437    <-Relaxase<-?||?->?->?-><-ParB-HTH*<-ParA                                                                                                                                                     ParB-HTH                    -                         EMTOL_RS20635            97   bacteria>bacteroidetes                       Emticicia oligotrophica                                 hypothetical protein, partial [Emticicia oligotrophica].                                                          504839219_?-><-504839220_?<-754047436_Relaxase<-754047400_?||504839223_?->504839224_?->504839225_?-><-754047437_ParB-HTH*<-504839227_ParA||754047402_?->504839229_?->504839230_?->504839231_?->504839232_?->754047438_?->
      736383566    <-ParB-HTH*<-?<-?<-?<-HNH                                                                                                                                                                     ParB-HTH                    -                         H565_RS13795             96   bacteria>deinococci                          Deinococcus murrayi                                     hypothetical protein, partial [Deinococcus murrayi].                                                              <-653253874_?<-736383564_?<-653253875_?<-736383449_?<-653253877_?<-736383452_?<-653253879_?<-736383566_ParB-HTH*<-653253881_?<-736383569_?<-653253883_?<-736383572_HNH<-736383575_?<-736383455_?<-653253885_?
      750079283    <-ParB-HTH*<-?<-RNAse_T                                                                                                                                                                       ParB-HTH                    -                         Q424_RS15920             94   bacteria>deinococci                          Deinococcus                                             MULTISPECIES: hypothetical protein, partial [Deinococcus].                                                        <-658539106_?<-648446909_?<-646631546_?<-516480771_?<-516480772_?<-516480773_?<-516480774_?<-750079283_ParB-HTH*<-516480776_?<-658539109_RNAse_T<-516480778_?<-516480779_?<-516480780_?<-658539110_?<-516480782_?
      748248067    ParA->ParB-HTH*->NLPC-><-?<-?||?->ABC->                                                                                                                                                       ParB-HTH                    -                         ON21_RS29990             92   bacteria>actinobacteria                      Nocardia araoensis                                      hypothetical protein, partial [Nocardia araoensis].                                                               <-748248018_?||748248020_?-><-748248021_?||748248023_?-><-748248025_?<-748248065_?||748248027_ParA->748248067_ParB-HTH*->748248069_NLPC-><-748248029_?<-748248070_?||748248030_?->748248032_ABC->748248072_?-><-748248034_?
      750589548    ParA->ParB-HTH*->NLPC-><-?||?->?->ABC->                                                                                                                                                       ParB-HTH                    -                         ON29_RS29715             92   bacteria>actinobacteria                      Nocardia exalbida                                       hypothetical protein, partial [Nocardia exalbida].                                                                <-750589525_?||750589526_?->750589528_?->750589529_?->750589530_?->750589531_?->750589547_ParA->750589548_ParB-HTH*->750589550_NLPC-><-750589551_?||750589533_?->750589534_?->750589535_ABC->750589552_?->750589553_?->
      750595695    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    -                         A3IC_RS57885             92   bacteria>actinobacteria                      Streptomyces scabrisporus                               hypothetical protein, partial [Streptomyces scabrisporus].                                                        750595635_?-><-750595694_?||648527278_?-><-648527279_?<-750595636_?<-648527280_?||522043141_ParA->750595695_ParB-HTH*->648527281_?-><-522043143_?<-522043144_?||522043145_?->522043146_?->750595696_?-><-522043148_?
      783222125    ParB-HTH*->?->P-loop->                                                                                                                                                                        ParB-HTH                    -                         VR65_RS19825             87   bacteria>proteobacteria>deltaproteobacteria  Desulfobulbaceae bacterium BRH_c16a                     hypothetical protein, partial [Desulfobulbaceae bacterium BRH_c16a].                                              <-783221951_?<-783221953_?||783221955_?->783221963_?->783221965_?->783222121_?->783221967_?->783222125_ParB-HTH*->783221970_?->783221973_P-loop->783221975_?->783221977_?->783221979_?->783221981_?->783221983_?->
      750503395    ParA->ParB-HTH*->NLPC-><-?<-?||?->ABC->                                                                                                                                                       ParB-HTH                    -                         ON34_RS28430             84   bacteria>actinobacteria                      Nocardia pneumoniae                                     hypothetical protein, partial [Nocardia pneumoniae].                                                              <-750503299_?||750503301_?->750503303_?->750503393_?->750503304_?->750503307_?->750503310_ParA->750503395_ParB-HTH*->750503400_NLPC-><-750503403_?<-750503313_?||750503314_?->750503407_ABC->750503409_?->750503411_?->
      749816938    ParB-HTH*->XerD->?-><-?||?-><-ParA                                                                                                                                                            ParB-HTH                    -                         BEGALDRAFT_RS17130       82   bacteria>proteobacteria>gammaproteobacteria  Beggiatoa alba                                          hypothetical protein, partial [Beggiatoa alba].                                                                   <-488779792_?<-488779796_?<-488779800_?||488779802_?->488779804_?->749816937_?->488779809_?->749816938_ParB-HTH*->749816939_XerD->488779813_?-><-488779816_?||488779820_?-><-488779822_ParA<-488779825_?<-488779827_?
      738659689    <-NLPC<-?*<-ParA                                                                                                                                                                              -                           SP                        D892_RS40665             80   bacteria>actinobacteria                      Nocardia sp. BMG51109                                   hypothetical protein, partial [Nocardia sp. BMG51109].                                                            <-640147789_?||640147792_?-><-640147795_?||640147797_?->738659684_?->640147803_?-><-738659687_NLPC<-738659689_?*<-640147810_ParA||640147813_?-><-738659692_?<-738658134_?||738659694_?->640147822_?->738659696_?->
      750304467    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    -                         B156_RS30075             79   bacteria>bacteroidetes                       Spirosoma luteum                                        hypothetical protein, partial [Spirosoma luteum].                                                                 517452061_?->517452062_?->517452063_?-><-648569338_?||750304465_?->517452066_?->517452067_ParA->750304467_ParB-HTH*->517452069_?->517452070_?->517452071_?->517452072_?->517452073_?->517452074_?->517452075_?->
      737785149    <-ParB-HTH*<-ParA<-?||?->?->?->Mrr_cat-REase->                                                                                                                                                ParB-HTH                    -                         B056_RS38110             78   bacteria>actinobacteria                      Frankia sp. BCU110501                                   hypothetical protein, partial [Frankia sp. BCU110501].                                                            <-517329402_?<-517329403_?<-517329404_?<-517329405_?<-737785146_?||517329407_?->517329408_?-><-737785149_ParB-HTH*<-648548434_ParA<-737785138_?||517329414_?->737785152_?->737785154_?->517329417_Mrr_cat-REase->517329418_?->
      750404616    ParA->ParB-HTH*->NLPC->                                                                                                                                                                       ParB-HTH                    -                         ON44_RS04455             74   bacteria>actinobacteria                      Nocardia vinacea                                        hypothetical protein, partial [Nocardia vinacea].                                                                 <-750404615_?||750404607_?->750404608_ParA->750404616_ParB-HTH*->750404617_NLPC->750404609_?->750404618_?->750404610_?-><-750404611_?<-750404612_?<-750404619_?
      750463730    ParA->?*->NLPC->                                                                                                                                                                              -                           SP                        ON41_RS05075             74   bacteria>actinobacteria                      Nocardia transvalensis                                  hypothetical protein, partial [Nocardia transvalensis].                                                           750463727_?->750463728_?->750463729_?-><-750463710_?||750463712_?-><-750463714_?||750463716_ParA->750463730_?*->750463731_NLPC-><-750463717_?<-750463718_?<-750463732_?<-750463733_?
      763155621    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         MICAH_RS23570            71   bacteria>cyanobacteria                       Microcystis aeruginosa                                  hypothetical protein, partial [Microcystis aeruginosa].                                                           488887368_?->763155621_ParB-HTH*->
      # 13;                                                                                                                                                                                                                                                          
      389403119    zf-CHC2->ParB-HTH*->                                                                                                                                                                          ParB-HTH                    -                         DespoDRAFT_03587         228  bacteria>proteobacteria>deltaproteobacteria  Desulfobacter postgatei 2ac9                            hypothetical protein DespoDRAFT_03587 [Desulfobacter postgatei 2ac9].                                             389403112_?->389403113_?->389403114_?->389403115_?->389403116_?-><-389403117_?||389403118_zf-CHC2->389403119_ParB-HTH*->389403120_?-><-389403121_?||389403122_?-><-389403123_?||389403124_?->389403125_?->389403126_?->
      386428514    <-ABC<-?||?-><-?||?->?-><-XerD<-ParB-HTH*                                                                                                                                                     ParB-HTH                    -                         BegalDRAFT_1454          199  bacteria>proteobacteria>gammaproteobacteria  Beggiatoa alba B18LD                                    hypothetical protein BegalDRAFT_1454 [Beggiatoa alba B18LD].                                                      <-386428507_ABC<-386428508_?||386428509_?-><-386428510_?||386428511_?->386428512_?-><-386428513_XerD<-386428514_ParB-HTH*<-386428515_?<-386428516_?<-386428517_?<-386428518_?<-386428519_?<-386428520_?||386428521_?->
      386428542    <-XerD<-ParB-HTH*                                                                                                                                                                             ParB-HTH                    -                         BegalDRAFT_1483          199  bacteria>proteobacteria>gammaproteobacteria  Beggiatoa alba B18LD                                    hypothetical protein BegalDRAFT_1483 [Beggiatoa alba B18LD].                                                      386428535_?->386428536_?->386428537_?->386428538_?-><-386428539_?<-386428540_?<-386428541_XerD<-386428542_ParB-HTH*<-386428543_?<-386428544_?<-386428545_?<-386428546_?<-386428547_?||386428548_?->386428549_?->
      501880589    <-ParB-HTH*<-zf-CHC2                                                                                                                                                                          ParB-HTH                    -                         HRM2_RS03160             193  bacteria>proteobacteria>deltaproteobacteria  Desulfobacterium autotrophicum                          DNA methylase [Desulfobacterium autotrophicum].                                                                   752604261_?-><-506383132_?<-506383133_?<-752603663_?<-752604262_?<-752603629_?<-501880588_?<-501880589_ParB-HTH*<-501880590_zf-CHC2<-501880591_?<-506383135_?<-752604263_?<-752603664_?<-506383137_?||506383138_?->
      501881616    zf-CHC2->ParB-HTH*->                                                                                                                                                                          ParB-HTH                    -                         HRM2_RS03920             193  bacteria>proteobacteria>deltaproteobacteria  Desulfobacterium autotrophicum                          DNA methylase [Desulfobacterium autotrophicum].                                                                   <-506386491_?<-501878029_?||752604848_?->506386493_?->752604849_?->501880591_?->506386494_zf-CHC2->501881616_ParB-HTH*->501880784_?->752603629_?->506386495_?-><-506386498_?<-752604046_?<-506386500_?<-752604850_?
      506384528    zf-CHC2->ParB-HTH*->                                                                                                                                                                          ParB-HTH                    -                         HRM2_RS11855             193  bacteria>proteobacteria>deltaproteobacteria  Desulfobacterium autotrophicum                          DNA methylase [Desulfobacterium autotrophicum].                                                                   506384523_?->506384524_?->752604527_?->506384526_?->506384527_?->501880591_?->501880590_zf-CHC2->506384528_ParB-HTH*->506384529_?->752603629_?->752604528_?-><-506384531_?<-506384532_?<-506384533_?<-752604529_?
      506384753    zf-CHC2->ParB-HTH*->?->?-><-?<-?<-?<-ABC                                                                                                                                                      ParB-HTH                    -                         HRM2_RS12975             193  bacteria>proteobacteria>deltaproteobacteria  Desulfobacterium autotrophicum                          DNA methylase [Desulfobacterium autotrophicum].                                                                   <-752604070_?<-506386680_?<-506386681_?<-506386682_?<-752604884_?||506386684_?->506384754_zf-CHC2->506384753_ParB-HTH*->506386685_?->752603629_?-><-752604885_?<-506386687_?<-506386688_?<-752604886_ABC<-752604887_?
      748757961    <-DDE||ParB-HTH*->                                                                                                                                                                            ParB-HTH                    -                         DESPODRAFT_RS17490       192  bacteria>proteobacteria>deltaproteobacteria  Desulfobacter postgatei                                 DNA methylase [Desulfobacter postgatei].                                                                          490176783_?->490176786_?->490176788_?->490176791_?->490176793_?->490176795_?-><-748757960_DDE||748757961_ParB-HTH*->748757962_?->748757963_?->490176812_?-><-748757964_?<-748757650_?<-748757828_?||748758336_?->
      527022036    <-ParB-HTH*<-zf-CHC2                                                                                                                                                                          ParB-HTH                    -                         DSMV_RS01030             188  bacteria>proteobacteria>deltaproteobacteria  Desulfococcus multivorans                               hypothetical protein [Desulfococcus multivorans].                                                                 <-527022035_?<-527022036_ParB-HTH*<-527022037_zf-CHC2
      654515559    zf-CHC2->ParB-HTH*->                                                                                                                                                                          ParB-HTH                    -                         H167_RS0106170           188  bacteria>proteobacteria>deltaproteobacteria  delta proteobacterium PSCGC 5296                        hypothetical protein [delta proteobacterium PSCGC 5296].                                                          654515557_?->654515558_zf-CHC2->654515559_ParB-HTH*->654515560_?->654515561_?->654515769_?-><-654515770_?
      654517946    DDE->zf-CHC2->ParB-HTH*->                                                                                                                                                                     ParB-HTH                    -                         H169_RS0112525           188  bacteria>proteobacteria>deltaproteobacteria  delta proteobacterium PSCGC 5451                        hypothetical protein [delta proteobacterium PSCGC 5451].                                                          740511296_?-><-654517944_?||740511298_DDE->654515558_zf-CHC2->654517946_ParB-HTH*->654515560_?->654517947_?->654517948_?->
      571788483    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         OMM_03956                187  bacteria>proteobacteria>deltaproteobacteria  Candidatus Magnetoglobus multicellularis str. Araruama  DNA modification methylase family protein [Candidatus Magnetoglobus multicellularis str. Araruama].               571788483_ParB-HTH*->571788484_?->571788485_?-><-571788486_?
      506386923    zf-CHC2->ParB-HTH*->                                                                                                                                                                          ParB-HTH                    -                         HRM2_RS24175             183  bacteria>proteobacteria>deltaproteobacteria  Desulfobacterium autotrophicum                          DNA methylase [Desulfobacterium autotrophicum].                                                                   <-506386915_?<-752604933_?<-506386917_?||506386919_?-><-506386920_?||506386921_?->506386922_zf-CHC2->506386923_ParB-HTH*->506386924_?->506386925_?-><-752604934_?<-506386928_?||752604935_?->506386930_?->752604936_?->
      # 12;                                                                                                                                                                                                                                                          
      428682367    ParA->ParA->?-><-?<-HTH||ParB-HTH*->                                                                                                                                                          ParB-HTH                    -                         Anacy_5838               389  bacteria>cyanobacteria                       Anabaena cylindrica PCC 7122                            hypothetical protein Anacy_5838 (plasmid) [Anabaena cylindrica PCC 7122].                                         <-428682360_?<-428682361_?||428682362_ParA->428682363_ParA->428682364_?-><-428682365_?<-428682366_HTH||428682367_ParB-HTH*->428682368_?->428682369_?->428682370_?-><-428682371_?<-428682372_?<-428682373_?||428682374_?->
      505141386    <-RVT+HNH<-?||?->RVT+HNH-><-HTH||ParB-HTH*->?-><-?||?->DDE->                                                                                                                                  ParB-HTH                    -                         CYLST_RS31085            386  bacteria>cyanobacteria                       Cylindrospermum stagnale                                hypothetical protein [Cylindrospermum stagnale].                                                                  505141381_?->752562984_?-><-505141382_RVT+HNH<-505141383_?||505141384_?->505141382_RVT+HNH-><-505141385_HTH||505141386_ParB-HTH*->505141387_?-><-752562986_?||505141389_?->505141390_DDE->505141391_?->505141392_?->505141393_?->
      755115685    ParA->ParA->?-><-?<-HTH||ParB-HTH*->                                                                                                                                                          ParB-HTH                    -                         ANACY_RS28675            378  bacteria>cyanobacteria                       Anabaena cylindrica                                     hypothetical protein [Anabaena cylindrica].                                                                       <-755115707_?||755115708_?->505177176_ParA->505177177_ParA->505177178_?-><-505177179_?<-505177180_HTH||755115685_ParB-HTH*->505177182_?->505177183_?->505177184_?-><-505177185_?<-505177186_?<-505177187_?||505177188_?->
      515520582    <-ParB-HTH*||HTH->?-><-?<-ParA<-ParA||SNF-helicase->                                                                                                                                          ParB-HTH                    -                         ANA7108_RS0126375        356  bacteria>cyanobacteria                       Anabaena sp. PCC 7108                                   hypothetical protein [Anabaena sp. PCC 7108].                                                                     515520575_?-><-515520576_?<-515520577_?<-648412724_?<-515520579_?||755141323_?-><-515520581_?<-515520582_ParB-HTH*||515520583_HTH->515520584_?-><-515520585_?<-755140633_ParA<-755141324_ParA||515520588_SNF-helicase->515520589_?->
      504992580    RVT+HNH->HNH->?->RVT+HNH->?-><-ParB-HTH*||HTH->?->?->?->DDE_Tnp_ISAZ013->?-><-HISKIN                                                                                                          ParB-HTH                    -                         OSC7112_RS32535          333  bacteria>cyanobacteria                       Oscillatoria nigro-viridis                              hypothetical protein [Oscillatoria nigro-viridis].                                                                <-504992574_?||753868290_?->504987353_RVT+HNH->504992576_HNH->753868291_?->504992578_RVT+HNH->504992579_?-><-504992580_ParB-HTH*||753868292_HTH->753868293_?->504992583_?->753868294_?->504987920_DDE_Tnp_ISAZ013->753868295_?-><-504992584_HISKIN
      740464136    TPR+CASPASE->HISKIN->HISKIN->?->?-><-HTH||ParB-HTH*->                                                                                                                                         ParB-HTH                    -                         TOL9009_RS37215          330  bacteria>cyanobacteria                       [Scytonema hofmanni] UTEX B 1581                        hypothetical protein [[Scytonema hofmanni] UTEX B 1581].                                                          740464164_?->657929202_TPR+CASPASE->740464166_HISKIN->657929205_HISKIN->657929206_?->740464168_?-><-657929210_HTH||740464136_ParB-HTH*->657929212_?->657929213_?->657929215_?-><-740464169_?<-740464171_?<-657929217_?<-657929219_?
      751574204    <-DDE||?-><-?<-?<-HTH||ParB-HTH*->                                                                                                                                                            ParB-HTH                    -                         SD81_RS36485             330  bacteria>cyanobacteria                       Tolypothrix campylonemoides                             hypothetical protein [Tolypothrix campylonemoides].                                                               <-751574214_?<-751574216_?<-751574198_DDE||751574218_?-><-751574200_?<-751574202_?<-751574220_HTH||751574204_ParB-HTH*->751574206_?->751574207_?-><-751568714_?
      407266820    <-McrB||ParA->ParB->?->?-><-?<-HTH||ParB-HTH*->?->?-><-?<-?||?-><-AAA-ATPase                                                                                                                  ParB-HTH                    -                         FDUTEX481_04373          329  bacteria>cyanobacteria                       Tolypothrix sp. PCC 7601                                hypothetical protein FDUTEX481_04373 [Tolypothrix sp. PCC 7601].                                                  <-407266813_McrB||407266814_ParA->407266815_ParB->407266816_?->407266817_?-><-407266818_?<-407266819_HTH||407266820_ParB-HTH*->407266821_?->407266822_?-><-407266823_?<-407266824_?||407266825_?-><-407266826_AAA-ATPase||407266827_?->
      196179143    ParA->ParB-><-?||DDE_3->METHYLASE-><-HTH||ParB-HTH*-><-?||?->?->?->?->Peptidase_M10->Peptidase_M10->                                                                                          ParB-HTH                    -                         MC7420_4124              326  bacteria>cyanobacteria                       Coleofasciculus chthonoplastes PCC 7420                 hypothetical protein MC7420_4124 [Coleofasciculus chthonoplastes PCC 7420].                                       196179077_?->196179180_ParA->196179206_ParB-><-196179080_?||196179138_DDE_3->196179087_METHYLASE-><-196179110_HTH||196179143_ParB-HTH*-><-196179150_?||196179219_?->196179199_?->196179072_?->196179235_?->196179093_Peptidase_M10->196179247_Peptidase_M10->
      797212730    <-ABC<-?<-?||ParA->ParB-><-?<-HTH||ParB-HTH*->?->?->?-><-AAA-ATPase                                                                                                                           ParB-HTH                    -                         FDUTEX481_RS32380        302  bacteria>cyanobacteria                       Tolypothrix sp. PCC 7601                                hypothetical protein, partial [Tolypothrix sp. PCC 7601].                                                         <-797212658_ABC<-797212659_?<-797212660_?||797212661_ParA->797212662_ParB-><-797212663_?<-797212729_HTH||797212730_ParB-HTH*->797212664_?->797212665_?->797212666_?-><-797212667_AAA-ATPase||797212668_?->797212731_?-><-797212669_?
      737188608    <-HTH||ParB-HTH*->                                                                                                                                                                            ParB-HTH                    -                         CAL7103_RS0139200        297  bacteria>cyanobacteria                       Calothrix sp. PCC 7103                                  hypothetical protein, partial [Calothrix sp. PCC 7103].                                                           <-518325591_?<-518325592_?||737188605_?-><-518325594_?||648401859_?-><-518325597_?<-737188607_HTH||737188608_ParB-HTH*-><-518325600_?||518325601_?->518325602_?-><-737188609_?<-518325604_?||518325605_?-><-737188610_?
      737187623    HISKIN-><-?<-ABC<-?<-ParB-HTH*||HTH-><-?<-?<-?<-?<-?||STYKIN->                                                                                                                                ParB-HTH                    -                         CAL7103_RS51515          278  bacteria>cyanobacteria                       Calothrix sp. PCC 7103                                  hypothetical protein, partial [Calothrix sp. PCC 7103].                                                           737187620_?-><-518318456_?||518318457_?->737187621_HISKIN-><-518318459_?<-737187276_ABC<-518318461_?<-737187623_ParB-HTH*||648401019_HTH-><-648401020_?<-737187625_?<-518318467_?<-737187626_?<-518318469_?||737187628_STYKIN->
      # 9;                                                                                                                                                                                                                                                           
      499190814    <-ParB-HTH*||?-><-?<-?<-?<-?<-?<-HNH                                                                                                                                                          ParB-HTH                    -                         DR_1719                  287  bacteria>deinococci                          Deinococcus radiodurans                                 hypothetical protein [Deinococcus radiodurans].                                                                   <-15806715_?<-15806716_?||15806717_?->15806718_?-><-15806719_?<-15806720_?<-15806721_?<-499190814_ParB-HTH*||15806723_?-><-15806724_?<-15806725_?<-15806726_?<-15806727_?<-15806728_?<-15806729_HNH
      516480931    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         Q424_RS0102995           280  bacteria>deinococci                          Deinococcus                                             MULTISPECIES: hypothetical protein [Deinococcus].                                                                 <-516480924_?<-516480925_?<-646632761_?<-646632768_?||750079322_?-><-516480929_?<-516480930_?<-516480931_ParB-HTH*||516480932_?-><-760145289_?<-516480934_?<-516480935_?<-646632779_?||516480937_?->516480938_?->
      736351733    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         BS32_RS0115445           280  bacteria>deinococci                          Deinococcus radiodurans                                 hypothetical protein [Deinococcus radiodurans].                                                                   <-654876149_?||499190809_?->499190810_?-><-499190811_?<-499190812_?<-736324533_?<-736351733_ParB-HTH*||499190815_?->
      736394879    DDE-><-?<-?||ParB-HTH*->                                                                                                                                                                      ParB-HTH                    -                         Q322_RS0113730           275  bacteria>deinococci                          Deinococcus frigens                                     hypothetical protein [Deinococcus frigens].                                                                       736394882_DDE-><-657676348_?<-657676349_?||736394879_ParB-HTH*->657676351_?-><-736394883_?||657676353_?->736394885_?->657676355_?->657676356_?->657676357_?->
      736389644    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         Q319_RS0103110           274  bacteria>deinococci                          Deinococcus marmoris                                    hypothetical protein [Deinococcus marmoris].                                                                      <-657678463_?<-657678464_?<-657678466_?<-736389672_?<-657678468_?||736389675_?-><-657678470_?<-736389644_ParB-HTH*||657678472_?->657678473_?->
      746727627    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         QR90_RS12370             274  bacteria>deinococci                          Deinococcus swuensis                                    hypothetical protein [Deinococcus swuensis].                                                                      746727614_?->746727617_?->746727619_?->746727621_?->746730019_?->746727623_?-><-746727625_?||746727627_ParB-HTH*-><-746727629_?<-746727631_?<-746727633_?<-746730022_?<-746727635_?<-746730024_?||746730026_?->
      760094872    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         H564_RS20900             271  bacteria>deinococci                          Deinococcus ficus                                       hypothetical protein, partial [Deinococcus ficus].                                                                760094866_?-><-760094869_?<-653261436_?<-551072753_?<-760094715_?<-653261449_?<-653261454_?<-760094872_ParB-HTH*||653261460_?-><-653261465_?<-760094875_?<-653261469_?<-760094878_?<-760094881_?||653261475_?->
      760136477    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         DEINO_RS19540            271  bacteria>deinococci                          Deinococcus sp. 2009                                    hypothetical protein, partial [Deinococcus sp. 2009].                                                             760136474_?->760136476_?->654853319_?->760094875_?->551072738_?-><-551072740_?<-551072742_?||760136477_ParB-HTH*->551072746_?->551072748_?->654853321_?->551072751_?->551072753_?->551072755_?->760094869_?->
      653294307    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         T313_RS0115790           208  bacteria>deinococci                          Deinococcus radiodurans                                 hypothetical protein, partial [Deinococcus radiodurans].                                                          <-499190815_?||653294307_ParB-HTH*->
      # 7;                                                                                                                                                                                                                                                           
      499635648    ParB-HTH*->?->?->?-><-?||?-><-?<-NACHT                                                                                                                                                        ParB-HTH                    AAA_23                    AVA_RS26435              548  bacteria>cyanobacteria                       Anabaena variabilis                                     hypothetical protein [Anabaena variabilis].                                                                       <-499635640_?<-499635641_?<-499635644_?<-752818090_?<-499635645_?<-752818120_?||499635647_?->499635648_ParB-HTH*->752818121_?->499635650_?->499635651_?-><-499635652_?||499635654_?-><-499635655_?<-499635656_NACHT
      499308918    NACHT->                                                                                                                                                                                       -                           AAA_23                    PCC7120DELTA_RS29045     544  bacteria>cyanobacteria                       Nostoc sp. PCC 7120                                     hypothetical protein [Nostoc sp. PCC 7120].                                                                       499308911_NACHT-><-499308912_?<-499308913_?||499308914_?-><-499308915_?<-499308916_?<-764953466_?<-499308918_?*<-764953350_?||499308920_?->499308921_?->499308922_?->499308923_?->499308924_?->499308925_?->
      493212749    <-DDE<-?<-?<-?<-ParB-HTH*                                                                                                                                                                     ParB-HTH                    AAA_23                    NSP_RS09855              538  bacteria>cyanobacteria                       Nodularia spumigena                                     hypothetical protein [Nodularia spumigena].                                                                       <-493212742_?<-493212743_?<-493212744_?<-493212745_DDE<-493212746_?<-493212747_?<-493212748_?<-493212749_ParB-HTH*<-493212750_?<-493212751_?||493212753_?->493212755_?->493212756_?->493212757_?->493212758_?->
      501381627    Nostoc                                                                                                                                                                                        -                           DUF2869                   NPUN_RS35395             532  bacteria>cyanobacteria                       Nostoc punctiforme                                      hypothetical protein [Nostoc punctiforme].                                                                        PCC
      499635690    <-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-?||?-><-?<-?<-?<-?||?*->?-><-?<-NACHT                                                                                                             -                           AAA_23                    AVA_RS26650              526  bacteria>cyanobacteria                       Anabaena variabilis                                     hypothetical protein [Anabaena variabilis].                                                                       <-499635683_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-752818124_?||499635685_?-><-499635686_?<-752818125_?<-499635688_?<-499635689_?||499635690_?*->752818126_?-><-499635692_?<-752818127_NACHT<-499635694_?<-499635695_?||499635696_?->499635697_?->
      501381520    Nostoc                                                                                                                                                                                        -                           Mitofilin                 NPUN_RS34800             526  bacteria>cyanobacteria                       Nostoc punctiforme                                      hypothetical protein [Nostoc punctiforme].                                                                        PCC
      501381574    Peptidase_M10->                                                                                                                                                                               -                           Prominin                  NPUN_RS35095             524  bacteria>cyanobacteria                       Nostoc punctiforme                                      hypothetical protein [Nostoc punctiforme].                                                                        501381565_Peptidase_M10-><-753811023_?||501381568_?->753811024_?-><-501381571_?<-501381572_?<-753811025_?<-501381574_?*<-501381575_?||501381577_?->501381578_?->753810958_?->501381580_?->501381581_?->753810959_?->
      # 7;                                                                                                                                                                                                                                                           
      494515216    <-ParB-HTH*||ABC->                                                                                                                                                                            ParB-HTH                    DUF3102                   CWATWH0005_3991          333  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     <-494520597_?<-494515216_ParB-HTH*||546223097_ABC->
      515877202    RVT+HNH-><-?<-?<-?<-?<-?<-?||ParB-HTH*->                                                                                                                                                      ParB-HTH                    SP+DUF3102                PCC9339_RS0102660        331  bacteria>cyanobacteria                       Fischerella sp. PCC 9339                                hypothetical protein [Fischerella sp. PCC 9339].                                                                  648361600_RVT+HNH-><-515877195_?<-515877196_?<-515877197_?<-515877198_?<-515877200_?<-648361604_?||515877202_ParB-HTH*->515877203_?->515877204_?->515877205_?->515877206_?->515877207_?->515877208_?->737126249_?->
      504967303    ParB-HTH*->?->?->ABC->                                                                                                                                                                        ParB-HTH                    DUF3102                   CHRO_RS11615             328  bacteria>cyanobacteria                       Chroococcidiopsis thermalis                             hypothetical protein [Chroococcidiopsis thermalis].                                                               <-504967295_?||504967296_?->504967297_?-><-752824248_?||504967299_?-><-504967300_?<-504967301_?||504967303_ParB-HTH*->504967304_?->504967305_?->504967306_ABC->752824249_?->752824857_?->752824858_?->752824859_?->
      748141416    <-ABC<-?<-?<-ParB-HTH*                                                                                                                                                                        ParB-HTH                    DUF3102                   QH73_RS39240             327  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein [Scytonema millei].                                                                          <-748142456_?<-748142457_?<-748142458_?<-748142459_?<-748141414_ABC<-748141415_?<-748142460_?<-748141416_ParB-HTH*||748141417_?->748141418_?-><-748141419_?||748141420_?-><-748142461_?<-748141421_?||748141422_?->
      499306042    <-ParB-HTH*||?->ABC->                                                                                                                                                                         ParB-HTH                    SP+DUF3102                PCC7120DELTA_RS15095     326  bacteria>cyanobacteria                       Nostoc sp. PCC 7120                                     hypothetical protein [Nostoc sp. PCC 7120].                                                                       <-499306035_?<-499306036_?<-764952782_?<-764951162_?<-499306039_?||499306040_?->499306041_?-><-499306042_ParB-HTH*||499306043_?->764952785_ABC-><-499306045_?||764951163_?->499306048_?->499306049_?->499306050_?->
      497233611    <-ABC<-?<-?||?->?-><-ABC<-ABC||ParB-HTH*->                                                                                                                                                    ParB-HTH                    DUF3102                   CY51472DRAFT_RS0220405   325  bacteria>cyanobacteria                       Cyanothece                                              MULTISPECIES: hypothetical protein [Cyanothece].                                                                  <-737891168_ABC<-497233604_?<-497233605_?||497233607_?->497233608_?-><-497233609_ABC<-497233610_ABC||497233611_ParB-HTH*-><-497233614_?<-497233615_?<-497233616_?<-497233617_?||497233618_?->497233619_?->501330428_?->
      505002935    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    DUF3102                   GLO7428_RS18185          321  bacteria>cyanobacteria                       Gloeocapsa sp. PCC 7428                                 hypothetical protein [Gloeocapsa sp. PCC 7428].                                                                   <-505002927_?<-754508026_?<-505002929_?<-754508640_?<-505002931_?<-754508641_?<-754508642_?||505002935_ParB-HTH*->505002936_?->505002937_?->505002938_?->505002939_?->505002940_?->505002941_?->505002942_?->
      # 7;                                                                                                                                                                                                                                                           
      559034988    ParA->ParB-HTH*-><-?<-ParA                                                                                                                                                                    ParB-HTH                    -                         M878_RS91610             330  bacteria>actinobacteria                      Streptomyces roseochromogenus                           hypothetical protein [Streptomyces roseochromogenus].                                                             559034980_?->665860281_?->739880871_?->665860283_?->559034984_?-><-559034985_?||665860286_ParA->559034988_ParB-HTH*-><-559034989_?<-739880872_ParA<-559034991_?<-559034992_?<-559034993_?<-559034994_?<-739880873_?
      514922043    SFII-helicase->?-><-?||ParA->?-><-ParB-HTH*<-ParA                                                                                                                                             ParB-HTH                    -                         STRAU_RS27165            323  bacteria>actinobacteria                      Streptomyces aurantiacus                                hypothetical protein [Streptomyces aurantiacus].                                                                  514922036_?->739810306_?->739810308_SFII-helicase->739810311_?-><-514922040_?||739810313_ParA->514922042_?-><-514922043_ParB-HTH*<-514922044_ParA
      703383829    ParB-HTH*-><-ParB<-ParA                                                                                                                                                                       ParB-HTH                    -                         H293_RS0145400           317  bacteria>actinobacteria                      Streptomyces canus                                      hypothetical protein [Streptomyces canus].                                                                        <-703383826_?||522154481_?->518968523_?->703383829_ParB-HTH*-><-703383827_ParB<-518968526_ParA||703383828_?-><-703383830_?<-518968529_?||655409978_?->518968531_?->
      751920075    ParA->ParB-HTH*-><-?<-ParA<-?<-Mrr_cat-REase||?->DDE->                                                                                                                                        ParB-HTH                    -                         SVTN_RS39960             311  bacteria>actinobacteria                      Streptomyces vietnamensis                               hypothetical protein, partial [Streptomyces vietnamensis].                                                        <-751919914_?<-751919915_?<-751920072_?<-751920073_?<-751920074_?<-751919916_?||751919917_ParA->751920075_ParB-HTH*-><-751919918_?<-751920076_ParA<-751919919_?<-751920077_Mrr_cat-REase||751920078_?->751920079_DDE-><-751919920_?
      654253752    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         H298_RS0133730           307  bacteria>actinobacteria                      Streptomyces sp. CNQ865                                 hypothetical protein [Streptomyces sp. CNQ865].                                                                   654253746_?->739888683_?->654253747_?->654253748_?->654253749_?->654253750_?-><-654253751_?||654253752_ParB-HTH*-><-654253753_?<-654253754_?||739888709_?-><-654253755_?<-739888710_?<-739888686_?<-654253758_?
      759461293    <-TrwC||HNH-><-?<-?<-?<-ParB-HTH*<-ParA<-?<-?<-?<-?<-?<-ASCH                                                                                                                                  ParB-HTH                    -                         IH54_RS0129315           306  bacteria>actinobacteria                      Streptomyces sp. NRRL F-5123                            hypothetical protein [Streptomyces sp. NRRL F-5123].                                                              <-671538860_?||759461292_?-><-759461305_TrwC||759461307_HNH-><-671538868_?<-671538870_?<-671538872_?<-759461293_ParB-HTH*<-671538876_ParA<-671538879_?<-671538881_?<-671538883_?<-759461309_?<-671538889_?<-671538891_ASCH
      663184423    Mrr_cat-REase-><-?||ParB-HTH*-><-?<-?||TrwC-><-?||TrwC->                                                                                                                                      ParB-HTH                    -                         OO47_RS31770             285  bacteria>actinobacteria                      Streptomyces bikiniensis                                hypothetical protein, partial [Streptomyces bikiniensis].                                                         702585104_?->702585107_?-><-663184414_?||663184417_Mrr_cat-REase-><-663184420_?||663184423_ParB-HTH*-><-663184426_?<-663184430_?||739772506_TrwC-><-663184437_?||702585117_TrwC->663184446_?-><-663184449_?
      # 5;                                                                                                                                                                                                                                                           
      755052115    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH                    -                         TR46_RS22930             297  bacteria>actinobacteria                      Streptacidiphilus carbonis                              hypothetical protein [Streptacidiphilus carbonis].                                                                755052226_?->755052227_?->755052106_?->755052108_?->755052109_?->755052110_?->755052112_?-><-755052115_ParB-HTH*<-755052229_ParA<-755052118_?||755052120_?->755052122_?->755052125_?->755052126_?->755052128_?->
      702687171    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    SP                        OO55_RS21730             278  bacteria>actinobacteria                      Streptomyces griseus                                    hypothetical protein [Streptomyces griseus].                                                                      702687145_?->702687150_?->702687153_?->702687158_?->702687500_?->702687504_?->702687166_ParA->702687171_ParB-HTH*->702687175_?->702687178_?->702687182_?->702687185_?->702687188_?->702687192_?->702687196_?->
      759768668    DDE_Tnp_1_2->?-><-?<-RVT+HNH<-?||ParA->ParB-HTH*-><-?<-ASCH                                                                                                                                   ParB-HTH                    -                         BI06_RS43855             271  bacteria>actinobacteria                      Kitasatospora sp. MBT66                                 hypothetical protein [Kitasatospora sp. MBT66].                                                                   759752930_?->759752926_DDE_Tnp_1_2->759752923_?-><-759768662_?<-759768875_RVT+HNH<-759768665_?||759768878_ParA->759768668_ParB-HTH*-><-759768881_?<-759768883_ASCH<-759768670_?<-759768673_?||759768886_?-><-759768675_?<-759768678_?
      654253933    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    -                         H298_RS0135225           259  bacteria>actinobacteria                      Streptomyces sp. CNQ865                                 hypothetical protein [Streptomyces sp. CNQ865].                                                                   739888835_?-><-739888841_?<-654253928_?<-654253929_?<-654253930_?<-654253931_?||654253932_ParA->654253933_ParB-HTH*-><-654253934_?<-654253935_?<-654253936_?<-654253937_?<-654253938_?<-654253939_?<-739888843_?
      654240343    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    -                         B121_RS0125930           258  bacteria>actinobacteria                      Streptomyces sp. CNH099                                 hypothetical protein [Streptomyces sp. CNH099].                                                                   654240336_?->654240337_?->654240338_?->739933181_?-><-739933148_?||654240341_?->654240342_ParA->654240343_ParB-HTH*-><-654240410_?<-739933184_?<-739933187_?<-654240344_?<-654240345_?<-654240346_?<-739933189_?
      # 4;                                                                                                                                                                                                                                                           
      755027075    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         TR49_RS10465             378  bacteria>actinobacteria                      Streptacidiphilus melanogenes                           hypothetical protein [Streptacidiphilus melanogenes].                                                             <-755027065_?||755027067_?-><-755027070_?<-755027072_?||755027342_?->755027343_?->755027073_?->755027075_ParB-HTH*->755027078_?-><-755027080_?||755027083_?->755027085_?->755027095_?->755027098_?->755027347_?->
      755021339    <-DDE||?-><-?||?-><-ParB-HTH*                                                                                                                                                                 ParB-HTH                    -                         TR48_RS34600             363  bacteria>actinobacteria                      Streptacidiphilus neutrinimicus                         hypothetical protein [Streptacidiphilus neutrinimicus].                                                           755021332_?-><-755021333_DDE||755021334_?-><-739750968_?||755021337_?-><-755021339_ParB-HTH*<-755021340_?<-755021357_?<-755021358_?<-755021342_?<-755021345_?||755021347_?-><-755021349_?
      755026932    ParB-HTH*->?-><-?<-?<-HNH                                                                                                                                                                     ParB-HTH                    SP                        TR49_RS10120             363  bacteria>actinobacteria                      Streptacidiphilus melanogenes                           hypothetical protein [Streptacidiphilus melanogenes].                                                             755026918_?->755026921_?-><-755027301_?<-755026924_?||755027302_?->755026926_?->755026928_?->755026932_ParB-HTH*->755027305_?-><-755027308_?<-755026935_?<-755026937_HNH<-755026940_?||755027311_?-><-755026944_?
      755027016    <-ABC||?->?->?->?-><-?||ParB-HTH*->                                                                                                                                                           ParB-HTH                    -                         TR49_RS10340             363  bacteria>actinobacteria                      Streptacidiphilus melanogenes                           hypothetical protein [Streptacidiphilus melanogenes].                                                             <-755027008_?<-755027331_ABC||755027010_?->755027012_?->755027334_?->755027337_?-><-755027014_?||755027016_ParB-HTH*->755027339_?->755027019_?->755027021_?->755027023_?-><-755027025_?<-755027027_?||755027029_?->
      # 4;                                                                                                                                                                                                                                                           
      505031549    ParB-HTH*->?->?-><-HISKIN<-HISKIN<-HISKIN                                                                                                                                                     ParB-HTH                    -                         CYAN10605_RS03945        312  bacteria>cyanobacteria                       Cyanobacterium aponinum                                 hypothetical protein [Cyanobacterium aponinum].                                                                   505031542_?-><-505031543_?<-754511752_?||505031545_?-><-754511999_?<-505031547_?<-505031548_?||505031549_ParB-HTH*->505031550_?->505031551_?-><-505031552_HISKIN<-505031553_HISKIN<-505031554_HISKIN<-505031555_?||754512000_?->
      770470161    <-ParB-HTH*<-?<-?<-DnaJ                                                                                                                                                                       ParB-HTH                    -                         GM3708_3465              305  bacteria>cyanobacteria                       Geminocystis sp. NIES-3708                              hypothetical protein GM3708_3465 [Geminocystis sp. NIES-3708].                                                    770470154_?->770470155_?-><-770470156_?||770470157_?->770470158_?->770470159_?-><-770470160_?<-770470161_ParB-HTH*<-770470162_?<-770470163_?<-770470164_DnaJ||770470165_?->770470166_?-><-770470167_?||770470168_?->
      515865463    STYKIN->?->?-><-ParB-HTH*                                                                                                                                                                     ParB-HTH                    -                         SYN6308_RS19250          304  bacteria>cyanobacteria                       Geminocystis herdmanii                                  hypothetical protein [Geminocystis herdmanii].                                                                    <-648410422_?<-515865456_?<-515865457_?<-515865458_?||750163354_STYKIN->515865460_?->515865462_?-><-515865463_ParB-HTH*||515865464_?->750163356_?->515865466_?-><-515865467_?||515865468_?->648410423_?->515865470_?->
      770473153    HISKIN->?->?->?->?->?-><-?||ParB-HTH*->                                                                                                                                                       ParB-HTH                    -                         GM3709_2810              300  bacteria>cyanobacteria                       Geminocystis sp. NIES-3709                              hypothetical protein GM3709_2810 [Geminocystis sp. NIES-3709].                                                    770473146_HISKIN->770473147_?->770473148_?->770473149_?->770473150_?->770473151_?-><-770473152_?||770473153_ParB-HTH*-><-770473154_?||770473155_?->770473156_?->770473157_?-><-770473158_?<-770473159_?||770473160_?->
      # 4;                                                                                                                                                                                                                                                           
      759896111    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         OBACDRAFT_RS02835        287  bacteria>verrucomicrobia                     Diplosphaera colitermitum                               hypothetical protein [Diplosphaera colitermitum].                                                                 759896105_?->759896193_?-><-759896106_?<-759896107_?<-759896108_?||759896109_?->759896110_?->759896111_ParB-HTH*->759896112_?->759896113_?->759896114_?->759896115_?->759896116_?->759896117_?->759896118_?->
      759901356    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         OBACDRAFT_RS17200        283  bacteria>verrucomicrobia                     Diplosphaera colitermitum                               hypothetical protein [Diplosphaera colitermitum].                                                                 <-759901354_?<-759901356_ParB-HTH*<-759901359_?||759901361_?->759901362_?->759896106_?-><-759896185_?<-759901365_?<-759901367_?
      497194662    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         OPIT5_RS20290            280  bacteria>verrucomicrobia                     Opitutaceae bacterium TAV5                              hypothetical protein [Opitutaceae bacterium TAV5].                                                                <-494601358_?<-494601357_?<-497194668_?<-497194667_?<-497194666_?<-497194665_?||497194664_?->497194662_ParB-HTH*->497194661_?->497194660_?->497194659_?->497194658_?->497194657_?->497194656_?->497194655_?->
      645069929    <-ParB-HTH*<-?<-?<-?||?->?->ABC->                                                                                                                                                             ParB-HTH                    -                         OPIT5_RS27705            280  bacteria>verrucomicrobia                     Opitutaceae bacterium TAV5                              hypothetical protein [Opitutaceae bacterium TAV5].                                                                <-497194655_?<-497194656_?<-497194657_?<-497194658_?<-497194659_?<-497194660_?<-497194661_?<-645069929_ParB-HTH*<-497197845_?<-497197846_?<-497197847_?||497197848_?->497197849_?->494605492_ABC->497197850_?->
      # 4;                                                                                                                                                                                                                                                           
      504989405    <-STYKIN||?->?-><-?<-?||?->?->ParB-HTH*->                                                                                                                                                     ParB-HTH                    Mitofilin                 OSC7112_RS14000          284  bacteria>cyanobacteria                       Oscillatoria nigro-viridis                              hypothetical protein [Oscillatoria nigro-viridis].                                                                <-753867507_STYKIN||504992649_?->753866729_?-><-753867508_?<-504989402_?||504989403_?->504989404_?->504989405_ParB-HTH*->504989406_?->504989407_?->504989408_?->753867509_?->504989410_?->753867511_?->504989412_?->
      427981701    ParB-HTH*->?->?->?-><-?<-?<-?||STYKIN->                                                                                                                                                       ParB-HTH                    DUF885                    Ple7327_4170             282  bacteria>cyanobacteria                       Pleurocapsa sp. PCC 7327                                hypothetical protein Ple7327_4170 [Pleurocapsa sp. PCC 7327].                                                     <-427981694_?<-427981695_?<-427981696_?<-427981697_?||427981698_?->427981699_?->427981700_?->427981701_ParB-HTH*->427981702_?->427981703_?->427981704_?-><-427981705_?<-427981706_?<-427981707_?||427981708_STYKIN->
      516328499    DDE_Tnp_IS1->?->?->?->?->?->?->ParB-HTH*->                                                                                                                                                    ParB-HTH                    -                         OSC10802_RS0123170       257  bacteria>cyanobacteria                       Oscillatoria sp. PCC 10802                              hypothetical protein [Oscillatoria sp. PCC 10802].                                                                648406977_DDE_Tnp_IS1->763312939_?->763312942_?->516328495_?->516328496_?->516328497_?->516328498_?->516328499_ParB-HTH*->516328500_?->516328501_?->516328502_?->516328503_?->516328505_?->763312944_?->763312947_?->
      752746526    ParB-HTH*->?->?->?-><-?<-?<-?||STYKIN->                                                                                                                                                       ParB-HTH                    DUF885                    PLE7327_RS20050          255  bacteria>cyanobacteria                       Pleurocapsa minor                                       hypothetical protein [Pleurocapsa minor].                                                                         504958489_?-><-504958490_?<-752745292_?<-504958492_?<-504958493_?||504958495_?->504958496_?->752746526_ParB-HTH*->504958498_?->504958499_?->752746528_?-><-504958501_?<-504958502_?<-504958503_?||752746530_STYKIN->
      # 3;                                                                                                                                                                                                                                                           
      558889206    <-ParB-HTH*<-ParA?                                                                                                                                                                            ParB-HTH                    -                         M877_RS79680             346  bacteria>actinobacteria                      Streptomyces niveus                                     hypothetical protein [Streptomyces niveus].                                                                       665865665_?->558889203_?-><-665865667_?||558889205_?-><-558889206_ParB-HTH*<-665865668_ParA?<-739935100_?<-739935101_?<-558889210_?<-558889211_?<-739935102_?||558889213_?->
      498331267    <-ParB-HTH*<-ParA?                                                                                                                                                                            ParB-HTH                    -                         STREPS4_RS33205          345  bacteria>actinobacteria                      Streptomyces sp. S4                                     hypothetical protein [Streptomyces sp. S4].                                                                       498331253_?->498331255_?->764840581_?->498331260_?-><-764840583_?||498331263_?-><-498331265_?<-498331267_ParB-HTH*<-764840585_ParA?<-764840587_?<-498331273_?<-498331274_?<-498331276_?<-648267387_?||498331278_?->
      663309700    ParA?->ParB-HTH*->                                                                                                                                                                            ParB-HTH                    -                         IF45_RS0127995           343  bacteria>actinobacteria                      Streptomyces albidoflavus                               hypothetical protein [Streptomyces albidoflavus].                                                                 <-663309687_?||663309689_?->663309691_?->663309693_?->663309695_?->663309696_?->663309698_ParA?->663309700_ParB-HTH*-><-663309702_?<-663309703_?<-663309705_?<-739786609_?
      # 3;                                                                                                                                                                                                                                                           
      739808094    <-ParB-HTH*<-ParA<-?<-?<-?||?->?->URI->                                                                                                                                                       ParB-HTH                    SP                        IG72_RS0133105           328  bacteria>actinobacteria                      Streptomyces                                            MULTISPECIES: hypothetical protein [Streptomyces].                                                                739778362_?->662754654_?->662754655_?-><-739808094_ParB-HTH*<-739808096_ParA<-662754660_?<-662754663_?<-662754664_?||662754666_?->739808103_?->662754669_URI->
      505393262    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         F750_RS32695             326  bacteria>actinobacteria                      Streptomyces sp. PAMC26508                              hypothetical protein [Streptomyces sp. PAMC26508].                                                                <-505393252_?||505393253_?-><-505393255_?||753981584_?-><-505393258_?<-753981639_?||505393261_?->505393262_ParB-HTH*->505393263_?->753981587_?-><-505393265_?||753981589_?-><-753981641_?<-505393268_?<-505393269_?
      499350288    DDE_Tnp_1_2->DDE_Tnp_1_2-><-ParB-HTH*<-ParA||?-><-?||?-><-?<-DDE_Tnp_1_2                                                                                                                      ParB-HTH                    -                         SCP2.04c                 325  bacteria>actinobacteria                      Streptomyces coelicolor                                 hypothetical protein [Streptomyces coelicolor].                                                                   21233965_DDE_Tnp_1_2->21233966_DDE_Tnp_1_2-><-499350288_ParB-HTH*<-21233968_ParA||21233969_?-><-21233970_?||21233971_?-><-21233972_?<-21233973_DDE_Tnp_1_2<-21233974_?
      # 3;                                                                                                                                                                                                                                                           
      664543262    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         IH47_RS0134150           296  bacteria>actinobacteria                      Streptomyces sp. NRRL F-5702                            hypothetical protein [Streptomyces sp. NRRL F-5702].                                                              664543246_?-><-664543249_?||695837320_?-><-664543253_?<-695837322_?||664543257_?->664543260_?-><-664543262_ParB-HTH*<-664543266_?<-664543269_?||664543272_?->664543275_?->
      748778099    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         QR77_RS41180             292  bacteria>actinobacteria                      Streptomyces sp. 150FB                                  hypothetical protein [Streptomyces sp. 150FB].                                                                    748778199_?->748778097_?->748778200_?-><-748778201_?||748778202_?->748778203_?->748778098_?->748778099_ParB-HTH*-><-748778100_?||748778101_?-><-748778102_?||748778204_?->748778103_?-><-748778104_?||748778105_?->
      739996264    NLPC->?->?->?->?->?-><-ParB-HTH*                                                                                                                                                              ParB-HTH                    SP                        IG93_RS28555             288  bacteria>actinobacteria                      Streptomyces sp. NRRL F-6628                            hypothetical protein [Streptomyces sp. NRRL F-6628].                                                              739996257_?->739996259_NLPC->557418823_?->739996260_?->739996261_?->739996262_?->739996263_?-><-739996264_ParB-HTH*<-739996269_?<-739996265_?||739996270_?->
      # 3;                                                                                                                                                                                                                                                           
      501518403    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         ANAEK_RS11660            204  bacteria>proteobacteria>deltaproteobacteria  Anaeromyxobacter sp. K                                  hypothetical protein [Anaeromyxobacter sp. K].                                                                    <-499739874_?||501518397_?-><-501518398_?<-501518399_?||501518400_?-><-501518401_?<-501518402_?<-501518403_ParB-HTH*<-501518404_?||501518405_?-><-501518407_?<-501518408_?<-501518409_?<-501518410_?<-501518411_?
      501750516    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         A2CP1_RS12130            204  bacteria>proteobacteria>deltaproteobacteria  Anaeromyxobacter dehalogenans                           hypothetical protein [Anaeromyxobacter dehalogenans].                                                             <-499739874_?||501750493_?-><-501750497_?<-501750502_?||501750508_?-><-501750511_?<-501518402_?<-501750516_ParB-HTH*<-501750519_?||501750522_?-><-501518407_?<-501750538_?<-501750543_?<-501750549_?<-501750556_?
      775300647    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    HrpB2                     PSR1_03440               204  bacteria>proteobacteria>deltaproteobacteria  Anaeromyxobacter sp. PSR-1                              hypothetical protein PSR1_03440 [Anaeromyxobacter sp. PSR-1].                                                     775300640_?->775300641_?->775300642_?->775300643_?->775300644_?-><-775300645_?||775300646_?->775300647_ParB-HTH*->775300648_?->
      # 3;                                                                                                                                                                                                                                                           
      501117833    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         AM1_RS31280              140  bacteria>cyanobacteria                       Acaryochloris marina                                    hypothetical protein [Acaryochloris marina].                                                                      <-501117027_?<-501117825_?<-501117826_?<-501117827_?<-501117828_?||753958425_?->501117831_?-><-501117833_ParB-HTH*||501117834_?->753958545_?->501117838_?->501117839_?->501117840_?-><-501117841_?||753958547_?->
      501118686    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         AM1_RS35145              140  bacteria>cyanobacteria                       Acaryochloris marina                                    hypothetical protein [Acaryochloris marina].                                                                      <-501118694_?<-753959417_?<-501118692_?<-501118690_?<-501118689_?||753958425_?->501118687_?-><-501118686_ParB-HTH*||501118684_?->501118683_?->501118682_?-><-501118681_?||753959431_?->501118678_?-><-501118677_?
      501119208    DDE-><-?||?->?-><-?||ParB-HTH*->                                                                                                                                                              ParB-HTH                    -                         AM1_RS36990              140  bacteria>cyanobacteria                       Acaryochloris marina                                    hypothetical protein [Acaryochloris marina].                                                                      <-753960122_?||753960222_?->753958881_DDE-><-753960126_?||501119205_?->501119207_?-><-501119206_?||501119208_ParB-HTH*-><-501119209_?<-753960137_?<-753960225_?<-753960141_?<-501119214_?||501119215_?->501119216_?->
      # 2;                                                                                                                                                                                                                                                           
      546230520    ASCH+ParB-HTH+Prok-TUDOR*->                                                                                                                                                                   ASCH+ParB-HTH+Prok-TUDOR    ASCH                      CWATWH0005_934           498  bacteria>cyanobacteria                       Crocosphaera watsonii                                   Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB [Crocosphaera watsonii].          546230519_?->546230520_ASCH+ParB-HTH+Prok-TUDOR*->
      357263649    Primpol?->ASCH+ParB-HTH+Prok-TUDOR*->                                                                                                                                                         ASCH+ParB-HTH+Prok-TUDOR    ASCH                      CWATWH0003_2673b1        492  bacteria>cyanobacteria                       Crocosphaera watsonii WH 0003                           hypothetical protein CWATWH0003_2673b1, partial [Crocosphaera watsonii WH 0003].                                  357263647_?->357263648_Primpol?->357263649_ASCH+ParB-HTH+Prok-TUDOR*->
      # 2;                                                                                                                                                                                                                                                           
      664288086    <-ParB-HTH*<-ParA?                                                                                                                                                                            ParB-HTH                    -                         IG73_RS0132720           409  bacteria>actinobacteria                      Streptomyces halstedii                                  hypothetical protein [Streptomyces halstedii].                                                                    664288217_?->664288079_?->739855292_?->664288082_?->664288084_?-><-664288086_ParB-HTH*<-664288088_ParA?<-739855299_?<-739855301_?<-664288097_?<-664288100_?<-739855303_?||664288106_?->
      664363832    ParA?->ParB-HTH*->                                                                                                                                                                            ParB-HTH                    -                         IF95_RS0132470           409  bacteria>actinobacteria                      Streptomyces varsoviensis                               hypothetical protein [Streptomyces varsoviensis].                                                                 <-664363810_?||664363813_?->664363816_?->664363819_?->740118370_?->740118372_?->664363829_ParA?->664363832_ParB-HTH*-><-664363835_?<-664363837_?<-664363839_?<-664363842_?<-664363845_?
      # 2;                                                                                                                                                                                                                                                           
      497232009    ParB-HTH+Prok-TUDOR*->?->DDE_Tnp1->?->?->?-><-RDRP                                                                                                                                            ParB-HTH+Prok-TUDOR         -                         CY51472DRAFT_RS0223475   378  bacteria>cyanobacteria                       Cyanothece                                              MULTISPECIES: hypothetical protein [Cyanothece].                                                                  <-737891482_?<-639854764_?<-497232014_?<-497232013_?<-497232012_?<-497232011_?<-497232010_?||497232009_ParB-HTH+Prok-TUDOR*->497232008_?->497232007_DDE_Tnp1->497232006_?->501330974_?->501330975_?-><-497232003_RDRP||501330977_?->
      495553174    ParB-HTH+Prok-TUDOR*->?->?->DDE_Tnp1->                                                                                                                                                        ParB-HTH+Prok-TUDOR         SP                        CY0110_RS21885           371  bacteria>cyanobacteria                       Cyanothece sp. CCY0110                                  hypothetical protein [Cyanothece sp. CCY0110].                                                                    <-495553167_?<-495553168_?||495553169_?-><-495553170_?<-495553171_?<-495553172_?<-495553173_?||495553174_ParB-HTH+Prok-TUDOR*->737833010_?->495553176_?->495553177_DDE_Tnp1->495553178_?->
      # 2;                                                                                                                                                                                                                                                           
      727520012    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    -                         AA75_RS00975             374  bacteria>actinobacteria                      Kitasatospora sp. MBT63                                 hypothetical protein [Kitasatospora sp. MBT63].                                                                   <-727520004_?<-727520015_?||727520007_?-><-727520018_?<-727520021_?<-727520024_?||727520029_ParA->727520012_ParB-HTH*->
      759753188    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH                    -                         BI06_RS00860             374  bacteria>actinobacteria                      Kitasatospora sp. MBT66                                 hypothetical protein [Kitasatospora sp. MBT66].                                                                   759753168_?->759753171_?->759753592_?->759753175_?->759753178_?->759753182_?->759753185_?-><-759753188_ParB-HTH*<-759753594_ParA||759753596_?->759753598_?-><-759753193_?||759753601_?-><-759753197_?<-759753200_?
      # 2;                                                                                                                                                                                                                                                           
      306986392    <-ParB-HTH*||?-><-METHYLASE                                                                                                                                                                   ParB-HTH                    SP                        Cyan7822_6496            361  bacteria>cyanobacteria                       Cyanothece sp. PCC 7822                                 conserved hypothetical protein (plasmid) [Cyanothece sp. PCC 7822].                                               <-306986385_?||306986386_?->306986387_?-><-306986388_?<-306986389_?<-306986390_?<-306986391_?<-306986392_ParB-HTH*||306986393_?-><-306986394_METHYLASE<-306986395_?||306986396_?->306986397_?-><-306986398_?||306986399_?->
      306986431    Subtilisin->METHYLASE->?->?-><-?||ParB-HTH*->                                                                                                                                                 ParB-HTH                    -                         Cyan7822_6546            324  bacteria>cyanobacteria                       Cyanothece sp. PCC 7822                                 conserved hypothetical protein (plasmid) [Cyanothece sp. PCC 7822].                                               <-306986424_?<-306986425_?||306986426_Subtilisin->306986427_METHYLASE->306986428_?->306986429_?-><-306986430_?||306986431_ParB-HTH*->306986432_?->306986433_?->306986434_?-><-306986435_?||306986436_?-><-306986437_?<-306986438_?
      # 2;                                                                                                                                                                                                                                                           
      494514846    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         CWATDRAFT_RS06740        347  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     <-494514838_?||494514839_?-><-494514840_?<-494514841_?<-494514842_?||757157200_?-><-494513614_?<-494514846_ParB-HTH*||494520213_?-><-757157202_?<-757157203_?||494514850_?->494514851_?-><-494514852_?||494514853_?->
      494520212    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         CWATWH0003_RS05610       255  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     <-737857974_?<-494520212_ParB-HTH*
      # 2;                                                                                                                                                                                                                                                           
      697211997    Terminase_LS->ParB-HTH*->                                                                                                                                                                     ParB-HTH                    -                         N545_RS29500             346  bacteria>actinobacteria                      Streptomyces sp. URHA0041                               hypothetical protein [Streptomyces sp. URHA0041].                                                                 <-697211992_?<-697211993_?||697211994_?->697211995_?->697212008_?->697211996_Terminase_LS->697211997_ParB-HTH*->697212009_?->697211998_?->697211999_?-><-697212000_?<-697212001_?<-697212002_?<-697212010_?
      740047622    ParA->ParB-HTH*-><-?||TrwC->                                                                                                                                                                  ParB-HTH                    -                         CF54_RS37465             343  bacteria>actinobacteria                      Streptomyces sp. Tu 6176                                hypothetical protein [Streptomyces sp. Tu 6176].                                                                  740047610_?->740047611_?->740047613_?->740047614_?->740047616_?->740047619_?->740047620_ParA->740047622_ParB-HTH*-><-740047623_?||740047658_TrwC-><-740047625_?||740047628_?-><-740047631_?||740047633_?->740047660_?->
      # 2;                                                                                                                                                                                                                                                           
      663349955    HISKIN->?->?->?->?->?->?->ParB-HTH*-><-Mrr_cat-REase                                                                                                                                          ParB-HTH                    SP                        IG05_RS0139385           329  bacteria>actinobacteria                      Streptomyces sp. NRRL S-1022                            hypothetical protein [Streptomyces sp. NRRL S-1022].                                                              663349948_HISKIN->663349949_?->663349950_?->663349951_?->663349952_?->663349953_?->663349954_?->663349955_ParB-HTH*-><-663349956_Mrr_cat-REase<-663349957_?<-739965392_?||663349959_?->663349960_?->663349961_?-><-663349962_?
      664445691    Mrr_cat-REase-><-ParB-HTH*<-?<-?<-?<-?<-?<-ASCH                                                                                                                                               ParB-HTH                    -                         IH78_RS0156160           286  bacteria>actinobacteria                      Streptomyces sp. NRRL F-5140                            hypothetical protein [Streptomyces sp. NRRL F-5140].                                                              <-740023575_?<-664445688_?<-664445689_?||664445690_Mrr_cat-REase-><-664445691_ParB-HTH*<-740023578_?<-664445693_?<-664445694_?<-740023580_?<-740023582_?<-664445697_ASCH<-664445698_?
      # 2;                                                                                                                                                                                                                                                           
      740055334    SFII-helicase->?-><-?||?->?-><-TrwC<-ParB-HTH*<-ParA<-?<-?<-?<-?<-?<-HISKIN                                                                                                                   ParB-HTH                    -                         BS72_RS00040             313  bacteria>actinobacteria                      Streptomyces yeochonensis                               hypothetical protein [Streptomyces yeochonensis].                                                                 740055326_SFII-helicase->740055328_?-><-740055413_?||740055330_?->740055415_?-><-740055418_TrwC<-740055334_ParB-HTH*<-740055337_ParA<-740055340_?<-740055343_?<-740055346_?<-740055349_?<-740055351_?<-740055355_HISKIN
      755076504    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    -                         TR50_RS28230             313  bacteria>actinobacteria                      Streptacidiphilus anmyonensis                           hypothetical protein [Streptacidiphilus anmyonensis].                                                             755076484_?->755076487_?->755076604_?->755076491_?->755076494_?->755076498_?->755076501_ParA->755076504_ParB-HTH*->755076608_?->755076507_?->755076510_?->755076611_?-><-755076513_?||755076516_?->755076520_?->
      # 2;                                                                                                                                                                                                                                                           
      664512363    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH                    -                         IO31_RS0138135           308  bacteria>actinobacteria                      Streptomyces                                            MULTISPECIES: hypothetical protein [Streptomyces].                                                                739974938_?->664512824_?->664512827_?->664512830_?->739974940_?->664512367_?->739974942_?-><-664512363_ParB-HTH*<-664512361_ParA||665655926_?->664512357_?->664512355_?->664512353_?->664512351_?->664512348_?->
      663148255    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH                    -                         IE94_RS0124715           242  bacteria>actinobacteria                      Streptomyces violaceorubidus                            hypothetical protein [Streptomyces violaceorubidus].                                                              663148232_?->663148235_?->663148238_?->663148243_?->663148246_?->740073938_?->740073972_?-><-663148255_ParB-HTH*<-663148257_ParA||663148259_?->663148262_?->663148265_?->663148268_?->663148271_?->663148273_?->
      # 2;                                                                                                                                                                                                                                                           
      736303905    ParB->RNAse_T->?->ParB-HTH*->                                                                                                                                                                 ParB-HTH                    -                         Q371_RS01470             298  bacteria>deinococci                          Deinococcus misasensis                                  hypothetical protein [Deinococcus misasensis].                                                                    <-736303875_?||736303881_?->736303885_?->736303889_?->736304540_ParB->736303894_RNAse_T->736303897_?->736303905_ParB-HTH*->736303908_?->736303915_?->736303920_?->736303924_?->736303928_?->736304544_?->736304548_?->
      736313124    <-ParB-HTH*<-?<-?<-RNAse_T                                                                                                                                                                    ParB-HTH                    -                         Q371_RS10895             277  bacteria>deinococci                          Deinococcus misasensis                                  hypothetical protein [Deinococcus misasensis].                                                                    <-736313104_?<-736313106_?<-736313109_?<-736313112_?<-736313115_?<-736313118_?<-736313121_?<-736313124_ParB-HTH*<-736313126_?<-736313127_?<-736313290_RNAse_T<-736313129_?<-736313131_?<-736313132_?<-736313133_?
      # 2;                                                                                                                                                                                                                                                           
      671527277    <-TrwC<-ParB-HTH*<-ParA                                                                                                                                                                       ParB-HTH                    -                         IF47_RS0126170           295  bacteria>actinobacteria                      Streptomyces megasporus                                 hypothetical protein [Streptomyces megasporus].                                                                   <-739882773_?<-671527271_?||671527272_?->671527273_?-><-671527274_?||671527275_?-><-739882774_TrwC<-671527277_ParB-HTH*<-671527278_ParA||671527279_?-><-671527280_?
      662754816    SFII-helicase->?-><-ParB-HTH*<-ParA<-?<-DDE                                                                                                                                                   ParB-HTH                    -                         IA22_RS0132355           288  bacteria>actinobacteria                      [Kitasatospora] papulosa                                hypothetical protein [[Kitasatospora] papulosa].                                                                  <-662754806_?<-662754808_?<-662754809_?<-662754810_?<-662754812_?||740441354_SFII-helicase->662754815_?-><-662754816_ParB-HTH*<-662754817_ParA<-662754818_?<-505392728_DDE<-662754819_?<-662754820_?<-662754821_?<-662754822_?
      # 2;                                                                                                                                                                                                                                                           
      291531542    <-SFII-helicase<-?<-?<-?<-ParB-HTH*                                                                                                                                                           ParB-HTH                    DUF3102                   EUS_21090                291  bacteria>firmicutes                          [Eubacterium] siraeum 70/3                              Protein of unknown function (DUF3102) [[Eubacterium] siraeum 70/3].                                               <-291531535_?<-291531536_?<-291531537_?<-291531538_SFII-helicase<-291531539_?<-291531540_?<-291531541_?<-291531542_ParB-HTH*<-291531543_?<-291531544_?<-291531545_?<-291531546_?<-291531547_?<-291531548_?<-291531549_?
      491496822    <-SFII-helicase<-?<-?<-?<-ParB-HTH*                                                                                                                                                           ParB-HTH                    DUF3102                   G397_RS0111230           291  bacteria>firmicutes                          [Eubacterium] siraeum                                   hypothetical protein [[Eubacterium] siraeum].                                                                     <-518492507_?<-491496813_?<-491496815_?<-491496816_SFII-helicase<-491496818_?<-769258155_?<-491496821_?<-491496822_ParB-HTH*<-491496825_?<-491496826_?<-491496832_?<-491496834_?<-491496840_?||491496842_?->491496845_?->
      # 2;                                                                                                                                                                                                                                                           
      490596705    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         LEP1GSC048_RS07865       286  bacteria>spirochaetes                        Leptospira santarosai                                   hypothetical protein [Leptospira santarosai].                                                                     <-490596713_?<-490596712_?<-490596711_?<-490596709_?<-490596708_?<-490596707_?<-490596706_?<-490596705_ParB-HTH*<-490596704_?<-490596703_?<-490596702_?<-696346182_?<-490596699_?<-490596698_?||696346167_?->
      446063157    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         LEP1GSC034_RS113505      285  bacteria>spirochaetes                        Leptospira interrogans                                  hypothetical protein [Leptospira interrogans].                                                                    <-446521022_?<-447117837_?<-696579207_?<-447008490_?<-447185333_?<-446587050_?<-487902815_?<-446063157_ParB-HTH*<-446991473_?<-487902812_?<-447170069_?<-446586997_?<-446992945_?<-446505006_?||446577653_?->
      # 2;                                                                                                                                                                                                                                                           
      443331859    HISKIN->?-><-?<-?||ParB-HTH*->                                                                                                                                                                ParB-HTH                    SP                        C789_3692                285  bacteria>cyanobacteria                       Microcystis aeruginosa DIANCHI905                       hypothetical protein C789_3692 [Microcystis aeruginosa DIANCHI905].                                               443331853_?->443331854_?->443331855_HISKIN->443331856_?-><-443331857_?<-443331858_?||443331859_ParB-HTH*->443331860_?->443331861_?->443331862_?->443331863_?->443331864_?->443331865_?-><-443331866_?
      159026604    <-ParB-HTH*||?->?-><-?||?-><-HISKIN                                                                                                                                                           ParB-HTH                    SP                        IPF_3218                 274  bacteria>cyanobacteria                       Microcystis aeruginosa PCC 7806                         unnamed protein product [Microcystis aeruginosa PCC 7806].                                                        <-159026604_ParB-HTH*||159026605_?->159026606_?-><-159026607_?||159026608_?-><-159026609_HISKIN<-159026610_?<-159026611_?
      # 2;                                                                                                                                                                                                                                                           
      428272064    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         Sta7437_4542             278  bacteria>cyanobacteria                       Stanieria cyanosphaera PCC 7437                         hypothetical protein Sta7437_4542 (plasmid) [Stanieria cyanosphaera PCC 7437].                                    428272057_?->428272058_?->428272059_?->428272060_?-><-428272061_?||428272062_?->428272063_?-><-428272064_ParB-HTH*||428272065_?->428272066_?->428272067_?->428272068_?->428272069_?->428272070_?->428272071_?->
      428272125    <-ParB-HTH*<-?||?->?->?-><-AAA-ATPase                                                                                                                                                         ParB-HTH                    -                         Sta7437_4607             228  bacteria>cyanobacteria                       Stanieria cyanosphaera PCC 7437                         hypothetical protein Sta7437_4607 (plasmid) [Stanieria cyanosphaera PCC 7437].                                    <-428272118_?<-428272119_?<-428272120_?<-428272121_?<-428272122_?<-428272123_?<-428272124_?<-428272125_ParB-HTH*<-428272126_?||428272127_?->428272128_?->428272129_?-><-428272130_AAA-ATPase||428272131_?->428272132_?->
      # 2;                                                                                                                                                                                                                                                           
      518337503    TPR+CASPASE-><-?||?->?->?-><-?||?->ParB-HTH*-><-?<-?||?->Peptidase_M10->                                                                                                                      ParB-HTH                    -                         PLEUR7319_RS0123660      254  bacteria>cyanobacteria                       Pleurocapsa sp. PCC 7319                                hypothetical protein [Pleurocapsa sp. PCC 7319].                                                                  738913802_TPR+CASPASE-><-518337497_?||518337498_?->648410984_?->648410985_?-><-518337501_?||518337502_?->518337503_ParB-HTH*-><-518337504_?<-738913804_?||518337506_?->738913807_Peptidase_M10->648410986_?->518337509_?->738913809_?->
      493559029    <-ParB-HTH*||?-><-?||?->?-><-?<-?<-HISKIN                                                                                                                                                     ParB-HTH                    -                         XEN7305_RS25675          252  bacteria>cyanobacteria                       Xenococcus sp. PCC 7305                                 hypothetical protein [Xenococcus sp. PCC 7305].                                                                   493559020_?->493559022_?->750617818_?-><-493559024_?<-493559026_?<-493559027_?||493559028_?-><-493559029_ParB-HTH*||750617821_?-><-493559031_?||750617853_?->750617856_?-><-493559034_?<-493559035_?<-493559036_HISKIN
      # 2;                                                                                                                                                                                                                                                           
      664188154    ParB-HTH*->?->?-><-?||?->?-><-?<-HISKIN                                                                                                                                                       ParB-HTH                    -                         IF88_RS0135010           228  bacteria>actinobacteria                      Streptomyces sp. NRRL F-2580                            hypothetical protein [Streptomyces sp. NRRL F-2580].                                                              664188133_?-><-664188136_?<-664188139_?||664188142_?-><-664188145_?<-664188148_?||664188151_?->664188154_ParB-HTH*->664188157_?->664188160_?-><-664188162_?||664188165_?->664188168_?-><-664188171_?<-664188174_HISKIN
      759522371    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         STRVI_RS46070            214  bacteria>actinobacteria                      Streptomyces violaceusniger                             hypothetical protein, partial [Streptomyces violaceusniger].                                                      <-503809945_?<-503809946_?<-503809947_?<-503809948_?<-503809949_?||503809950_?->503809951_?->759522371_ParB-HTH*->759522303_?-><-759522305_?<-503809955_?<-759522373_?||503809958_?->503809959_?->759522375_?->
      # 2;                                                                                                                                                                                                                                                           
      218762801    <-MU-transposase<-ParB-HTH*<-?<-?||?-><-?||?->HISKIN->                                                                                                                                        ParB-HTH                    -                         Dalk_3579                224  bacteria>proteobacteria>deltaproteobacteria  Desulfatibacillum alkenivorans AK-01                    hypothetical protein Dalk_3579 [Desulfatibacillum alkenivorans AK-01].                                            <-218762794_?<-218762795_?<-218762796_?<-218762797_?<-218762798_?<-218762799_?<-218762800_MU-transposase<-218762801_ParB-HTH*<-218762802_?<-218762803_?||218762804_?-><-218762805_?||218762806_?->218762807_HISKIN-><-218762808_?
      654862385    <-P-loop<-?<-ParB-HTH*                                                                                                                                                                        ParB-HTH                    -                         G491_RS0111365           223  bacteria>proteobacteria>deltaproteobacteria  Desulfatibacillum aliphaticivorans                      hypothetical protein [Desulfatibacillum aliphaticivorans].                                                        <-654862378_?<-654862379_?<-654862380_?<-737234295_?<-654862382_?<-654862383_P-loop<-654862384_?<-654862385_ParB-HTH*||654862386_?-><-737234244_?<-737234296_?||654862387_?->654862388_?->654862389_?->654862390_?->
      # 2;                                                                                                                                                                                                                                                           
      523467872    DnaJ->?->?->?-><-?<-?<-?<-ParB-HTH*                                                                                                                                                           ParB-HTH                    -                         dsmv_2585                201  bacteria>proteobacteria>deltaproteobacteria  Desulfococcus multivorans DSM 2059                      hypothetical protein dsmv_2585 [Desulfococcus multivorans DSM 2059].                                              523467865_DnaJ->523467866_?->523467867_?->523467868_?-><-523467869_?<-523467870_?<-523467871_?<-523467872_ParB-HTH*<-523467873_?<-523467874_?<-523467875_?<-523467876_?<-523467877_?<-523467878_?<-523467879_?
      750110637    DnaJ->?->?->?-><-?<-?<-?<-ParB-HTH*<-?<-ParB-HTH                                                                                                                                              ParB-HTH                    -                         DSMV_RS10090             198  bacteria>proteobacteria>deltaproteobacteria  Desulfococcus multivorans                               hypothetical protein [Desulfococcus multivorans].                                                                 527025605_DnaJ->527025606_?->527025607_?->527025608_?-><-527025609_?<-750110705_?<-527025611_?<-750110637_ParB-HTH*<-527025613_?<-750110641_ParB-HTH<-527025615_?<-527025616_?<-527025617_?<-750110707_?<-527025619_?
      # 2;                                                                                                                                                                                                                                                           
      759540387    ParA->ParB-HTH*->?->?->TrwC->                                                                                                                                                                 ParB-HTH                    -                         SSP08S_RS57300           154  bacteria>actinobacteria                      Streptomyces sp. R1-NS-10                               hypothetical protein, partial [Streptomyces sp. R1-NS-10].                                                        517904201_?->517904202_?->517904203_?->759540386_?->517904205_?->517904206_?->517904207_ParA->759540387_ParB-HTH*->759540388_?->517904210_?->759540389_TrwC->517904212_?-><-517904213_?||648487654_?->517904215_?->
      759768796    SFII-helicase-><-?<-ParB-HTH*<-ParA                                                                                                                                                           ParB-HTH                    Ndufs5                    BI06_RS43330             143  bacteria>actinobacteria                      Kitasatospora sp. MBT66                                 hypothetical protein, partial [Kitasatospora sp. MBT66].                                                          759768792_SFII-helicase-><-759768475_?<-759768796_ParB-HTH*<-759768805_ParA||759756574_?-><-759768478_?||759768481_?-><-759752933_?||759753483_?->759752930_?->
      # 2;                                                                                                                                                                                                                                                           
      748137595    RecT->HTH->HTH->ParB-HTH*->ParB-HTH+Prok-TUDOR-><-HU-IHF                                                                                                                                      ParB-HTH                    -                         QH73_RS16185             146  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein, partial [Scytonema millei].                                                                 <-748137493_?<-748137494_?<-748137495_?||748137496_?->748137497_RecT->748137498_HTH->748137499_HTH->748137595_ParB-HTH*->748137500_ParB-HTH+Prok-TUDOR-><-748137501_HU-IHF||748137502_?-><-748137503_?<-748137504_?||748137505_?->748137596_?->
      748136711    HU-IHF->?->?-><-ParB-HTH*<-HTH<-RecT                                                                                                                                                          ParB-HTH                    -                         QH73_RS11420             127  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein, partial [Scytonema millei].                                                                 <-748136710_?<-748136545_?<-748136546_?||748136547_?->748136548_HU-IHF->748136549_?->748136550_?-><-748136711_ParB-HTH*<-748136551_HTH<-748136712_RecT<-748136552_?||748136553_?-><-748136554_?||748136555_?-><-748136556_?
      # 2;                                                                                                                                                                                                                                                           
      67856109     <-RVT+HNH<-?<-DDE_Tnp_1                                                                                                                                                                       -                           SP                        CwatDRAFT_4471           138  bacteria>cyanobacteria                       Crocosphaera watsonii WH 8501                           hypothetical protein CwatDRAFT_4471 [Crocosphaera watsonii WH 8501].                                              67856105_?-><-67856266_RVT+HNH<-67856265_?<-67856264_DDE_Tnp_1||67856106_?->67856107_?->67856108_?->67856109_?*->67856110_?->67856111_?->67856112_?->67856113_?->67856114_?->67856115_?->67856116_?->
      757157310    <-RVT+HNH<-?||?-><-DDE_Tnp_1||?->?->?->ParB-HTH*->                                                                                                                                            ParB-HTH                    -                         CWATDRAFT_RS09845        114  bacteria>cyanobacteria                       Crocosphaera watsonii                                   hypothetical protein [Crocosphaera watsonii].                                                                     <-757157323_RVT+HNH<-494515558_?||757157308_?-><-494514515_DDE_Tnp_1||757157309_?->757157324_?->494515407_?->757157310_ParB-HTH*->757157311_?->757157325_?->494515410_?->494515411_?->494515412_?->494515413_?->494515414_?->
      # 2;                                                                                                                                                                                                                                                           
      158310190    ParB-HTH*->?->?->?->?->?->?->HU-IHF->                                                                                                                                                         ParB-HTH                    -                         AM1_B0079                138  bacteria>cyanobacteria                       Acaryochloris marina MBIC11017                          hypothetical protein AM1_B0079 (plasmid) [Acaryochloris marina MBIC11017].                                        158310183_?-><-158310184_?<-158310185_?<-158310186_?||158310187_?-><-158310188_?<-158310189_?||158310190_ParB-HTH*->158310191_?->158310192_?->158310193_?->158310194_?->158310195_?->158310196_?->158310197_HU-IHF->
      753958401    ParB-HTH*->?->?->?->?->?->HU-IHF->                                                                                                                                                            ParB-HTH                    -                         AM1_RS30720              134  bacteria>cyanobacteria                       Acaryochloris marina                                    hypothetical protein [Acaryochloris marina].                                                                      753958398_?->753958491_?-><-501117690_?<-501117691_?<-501117692_?<-753958400_?<-501117695_?||753958401_ParB-HTH*->753958494_?->501117699_?->501117700_?->501117701_?->501117702_?->501117703_HU-IHF->501117704_?->
      # 2;                                                                                                                                                                                                                                                           
      119462359    <-ParB-HTH+Prok-TUDOR*<-ParB-HTH<-?||DCM->                                                                                                                                                    ParB-HTH+Prok-TUDOR         -                         N9414_06389              116  bacteria>cyanobacteria                       Nodularia spumigena CCY9414                             hypothetical protein N9414_06389 [Nodularia spumigena CCY9414].                                                   <-119462352_?<-119462353_?<-119462354_?<-119462355_?<-119462356_?||119462357_?-><-119462358_?<-119462359_ParB-HTH+Prok-TUDOR*<-119462360_ParB-HTH<-119462361_?||119462362_DCM->119462363_?-><-119462364_?||119462365_?-><-119462366_?
      585121647    <-ParB-HTH+Prok-TUDOR*<-ParB-HTH<-?||?->DCM->                                                                                                                                                 ParB-HTH+Prok-TUDOR         -                         NSP_22570                114  bacteria>cyanobacteria                       Nodularia spumigena CCY9414                             Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB [Nodularia spumigena CCY9414].    <-585121640_?<-585121641_?<-585121642_?<-585121643_?<-585121644_?||585121645_?-><-585121646_?<-585121647_ParB-HTH+Prok-TUDOR*<-585121648_ParB-HTH<-585121649_?||585121650_?->585121651_DCM->585121652_?-><-585121653_?<-585121654_?
      # 1;                                                                                                                                                                                                                                                           
      306986606    DCM+ParB-HTH+Prok-TUDOR*->                                                                                                                                                                    DCM+ParB-HTH+Prok-TUDOR     DNA_methylase             Cyan7822_6833            645  bacteria>cyanobacteria                       Cyanothece sp. PCC 7822                                 DNA-cytosine methyltransferase (plasmid) [Cyanothece sp. PCC 7822].                                               306986599_?-><-306986600_?<-306986601_?<-306986602_?||306986603_?-><-306986604_?<-306986605_?||306986606_DCM+ParB-HTH+Prok-TUDOR*-><-306986607_?||306986608_?->306986609_?->306986610_?-><-306986611_?<-306986612_?<-306986613_?
      543538779    <-ParB-HTH+Prok-TUDOR*<-?<-?||?->?->?-><-RecD<-RecD                                                                                                                                           ParB-HTH+Prok-TUDOR         -                         CWATWH0402_1321          560  bacteria>cyanobacteria                       Crocosphaera watsonii WH 0402                           hypothetical protein CWATWH0402_1321 [Crocosphaera watsonii WH 0402].                                             <-543538778_?<-543538779_ParB-HTH+Prok-TUDOR*<-543538780_?<-543538781_?||543538782_?->543538783_?->543538784_?-><-543538785_RecD<-543538786_RecD
      480702301    HNH->?->?->?->ParB-HTH*->                                                                                                                                                                     ParB-HTH                    DUF2360                   HMPREF1089_00435         551  bacteria>firmicutes                          [Clostridium] bolteae 90B3                              hypothetical protein HMPREF1089_00435 [[Clostridium] bolteae 90B3].                                               480702294_?->480702295_?->480702296_?->480702297_HNH->480702298_?->480702299_?->480702300_?->480702301_ParB-HTH*->480702302_?->480702303_?->480702304_?->480702305_?->480702306_?->480702307_?->480702308_?->
      510895729    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    -                         C819_RS19820             527  bacteria>firmicutes                          Lachnospiraceae bacterium 10-1                          hypothetical protein [Lachnospiraceae bacterium 10-1].                                                            665905575_?->510895723_?->510895724_?->510895725_?-><-510895726_?||665905576_?->510895728_ParA->510895729_ParB-HTH*-><-665905577_?<-510895731_?||510895733_?->736104461_?->550997608_?->510895737_?->510895739_?->
      344043288    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         Strvi_0238               361  bacteria>actinobacteria                      Streptomyces violaceusniger Tu 4113                     hypothetical protein Strvi_0238 (plasmid) [Streptomyces violaceusniger Tu 4113].                                  <-344043281_?<-344043282_?<-344043283_?<-344043284_?<-344043285_?||344043286_?->344043287_?->344043288_ParB-HTH*->344043289_?->344043290_?-><-344043291_?<-344043292_?<-344043293_?||344043294_?->344043295_?->
      427992361    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    SP                        Pse7367_3831             353  bacteria>cyanobacteria                       Pseudanabaena sp. PCC 7367                              hypothetical protein Pse7367_3831 (plasmid) [Pseudanabaena sp. PCC 7367].                                         <-427992354_?<-427992355_?||427992356_?-><-427992357_?||427992358_?->427992359_?-><-427992360_?<-427992361_ParB-HTH*<-427992362_?||427992363_?->427992364_?->427992365_?->427992366_?->427992367_?-><-427992368_?
      386428626    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    Trp_dioxygenase+DUF488    BegalDRAFT_1574          337  bacteria>proteobacteria>gammaproteobacteria  Beggiatoa alba B18LD                                    hypothetical protein BegalDRAFT_1574 [Beggiatoa alba B18LD].                                                      386428619_?->386428620_?->386428621_?-><-386428622_?<-386428623_?||386428624_?-><-386428625_?||386428626_ParB-HTH*->386428627_?-><-386428628_?||386428629_?->386428630_?->386428631_?->386428632_?->386428633_?->
      787066260    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         BAR36866.1               331  viruses                                      uncultured Mediterranean phage uvMED                    unnamed protein product [uncultured Mediterranean phage uvMED].                                                   <-787066253_?<-787066254_?<-787066255_?<-787066256_?||787066257_?->787066258_?->787066259_?->787066260_ParB-HTH*->787066261_?->787066262_?->787066263_?->787066264_?->787066265_?->787066266_?-><-787066267_?
      521992951    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         A39O_RS0108365           328  bacteria>proteobacteria>gammaproteobacteria  Lamprocystis purpurea                                   hypothetical protein [Lamprocystis purpurea].                                                                     750223431_?->521992951_ParB-HTH*->521992952_?->750223432_?->
      749286507    <-ParB-HTH*<-ParA||NLPC->                                                                                                                                                                     ParB-HTH                    -                         NS07_v2contig00189-0005  328  bacteria>actinobacteria                      Nocardia seriolae                                       hypothetical protein NS07_v2contig00189-0005 [Nocardia seriolae].                                                 749286503_?->749286504_?->749286505_?->749286506_?-><-749286507_ParB-HTH*<-749286508_ParA||749286509_NLPC-><-749286510_?||749286511_?-><-749286512_?||749286513_?->
      428272094    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    SP                        Sta7437_4575             326  bacteria>cyanobacteria                       Stanieria cyanosphaera PCC 7437                         hypothetical protein Sta7437_4575 (plasmid) [Stanieria cyanosphaera PCC 7437].                                    428272087_?->428272088_?-><-428272089_?||428272090_?->428272091_?->428272092_?->428272093_?-><-428272094_ParB-HTH*||428272095_?-><-428272096_?<-428272097_?<-428272098_?<-428272099_?<-428272100_?<-428272101_?
      291541580    ParB-HTH*->?->?->SFII-helicase->                                                                                                                                                              ParB-HTH                    AAA_13                    RBR_02590                325  bacteria>firmicutes                          Ruminococcus bromii L2-63                               hypothetical protein RBR_02590 [Ruminococcus bromii L2-63].                                                       291541573_?->291541574_?->291541575_?->291541576_?->291541577_?->291541578_?->291541579_?->291541580_ParB-HTH*->291541581_?->291541582_?->291541583_SFII-helicase->291541584_?->291541585_?->291541586_?->291541587_?->
      759552136    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         RZ84_RS34725             317  bacteria>actinobacteria                      Streptomyces sp. CT34                                   hypothetical protein [Streptomyces sp. CT34].                                                                     <-759552125_?<-759552181_?<-759552127_?||759552183_?->759552129_?->759552131_?->759552134_?->759552136_ParB-HTH*-><-759552138_?||759552140_?->759552185_?->759552142_?->759552144_?->759552147_?->759552150_?->
      727525039    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    -                         AA75_RS07430             309  bacteria>actinobacteria                      Kitasatospora sp. MBT63                                 hypothetical protein [Kitasatospora sp. MBT63].                                                                   727525073_?->727525034_ParA->727525039_ParB-HTH*->727525044_?-><-727525078_?||727525047_?->727525050_?->727525055_?->727525081_?->727525058_?->
      787047096    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         BAR21317.1               309  viruses                                      uncultured Mediterranean phage uvMED                    unnamed protein product [uncultured Mediterranean phage uvMED].                                                   787047089_?->787047090_?->787047091_?->787047092_?->787047093_?->787047094_?->787047095_?->787047096_ParB-HTH*->787047097_?->787047098_?->787047099_?->787047100_?->787047101_?->787047102_?-><-787047103_?
      759952224    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    -                         IO39_RS42050             298  bacteria>actinobacteria                      Nonomuraea candida                                      hypothetical protein [Nonomuraea candida].                                                                        759952206_?->759952209_?-><-759952212_?<-759952215_?||759952296_?->759952218_?->759952221_ParA->759952224_ParB-HTH*-><-759952228_?<-759952230_?<-759952232_?<-759952237_?<-759952240_?||759952243_?-><-759952298_?
      690403288    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH                    -                         pSLA2-S.15               294  bacteria>actinobacteria                      Streptomyces rochei                                     hypothetical protein [Streptomyces rochei].                                                                       689922287_?->689922288_?->689922289_?-><-689922290_?||689922291_?-><-689922292_?||689922293_?-><-690403288_ParB-HTH*<-689922295_ParA<-689922296_?||689922297_?-><-689922298_?||689922299_?->689922300_?->689922301_?->
      493426429    <-ParB-HTH*<-ParA<-RNAse_T                                                                                                                                                                    ParB-HTH                    -                         STRTUCAR8_RS40225        291  bacteria>actinobacteria                      Streptomyces turgidiscabies                             hypothetical protein [Streptomyces turgidiscabies].                                                               764831940_?->764831942_?->764831945_?->493426431_?->493426426_?->764831947_?-><-493426429_ParB-HTH*<-764831949_ParA<-764831951_RNAse_T
      503099618    NACHT-><-?||?->?-><-ParB-HTH+Prok-TUDOR*                                                                                                                                                      ParB-HTH+Prok-TUDOR         -                         CYAN7822_RS28405         291  bacteria>cyanobacteria                       Cyanothece sp. PCC 7822                                 hypothetical protein [Cyanothece sp. PCC 7822].                                                                   503099611_?-><-754535247_?<-754535459_?||754535462_NACHT-><-754535465_?||503099616_?->503099617_?-><-503099618_ParB-HTH+Prok-TUDOR*||754535249_?->503099620_?->754535467_?->754535470_?->754535252_?->503099623_?->503099625_?->
      497670003    ParB-HTH*->?->?->?->SFII-helicase->                                                                                                                                                           ParB-HTH                    V_ATPase_I                RUMFLAFD1_RS0105685      290  bacteria>firmicutes                          Ruminococcus flavefaciens                               hypothetical protein [Ruminococcus flavefaciens].                                                                 497669993_?->497669994_?->497669996_?->497669998_?->497669999_?->739421887_?->497670001_?->497670003_ParB-HTH*->497670004_?->739421889_?->739421891_?->497670009_SFII-helicase->497670011_?->497670013_?->497670015_?->
      738539870    <-ParA<-?<-?||?->HNH-><-ParB-HTH+Prok-TUDOR*||?-><-ParB-HTH+Prok-TUDOR<-?<-?||XerD->                                                                                                          ParB-HTH+Prok-TUDOR         -                         KV40_RS28175             287  bacteria>cyanobacteria                       Myxosarcina sp. GI1                                     hypothetical protein [Myxosarcina sp. GI1].                                                                       <-738539857_?<-738539859_?<-738539861_ParA<-738539864_?<-738539995_?||738540017_?->738539867_HNH-><-738539870_ParB-HTH+Prok-TUDOR*||738539872_?-><-738539875_ParB-HTH+Prok-TUDOR<-738539878_?<-738540021_?||738540025_XerD-><-738539881_?<-738539884_?
      739811243    <-TrwC||?->?-><-ParB-HTH*                                                                                                                                                                     ParB-HTH                    -                         IF34_RS35665             286  bacteria>actinobacteria                      Streptomyces griseofuscus                               hypothetical protein [Streptomyces griseofuscus].                                                                 <-739811291_?<-739811292_?<-739811236_?||739811293_?-><-739811294_TrwC||739811295_?->739811240_?-><-739811243_ParB-HTH*<-739811297_?<-739811299_?<-739811247_?<-739811301_?<-739811248_?<-739811250_?<-739811253_?
      506264213    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         CYAN8802_RS10090         284  bacteria>cyanobacteria                       Cyanothece sp. PCC 8802                                 hypothetical protein [Cyanothece sp. PCC 8802].                                                                   <-506264220_?<-506264219_?||752567501_?-><-506264217_?<-506264216_?<-752567744_?||506264214_?->506264213_ParB-HTH*->506264212_?->506264211_?->506264210_?->752567742_?->506265425_?-><-506265426_?<-506265427_?
      664375955    ParA?->ParB-HTH*-><-?<-?<-?<-?<-ASCH                                                                                                                                                          ParB-HTH                    -                         IG86_RS0137215           282  bacteria>actinobacteria                      Streptomyces virginiae                                  hypothetical protein [Streptomyces virginiae].                                                                    664375953_ParA?->664375955_ParB-HTH*-><-664375958_?<-664375960_?<-664375962_?<-664375964_?<-664375967_ASCH<-664375970_?<-664375973_?
      502659622    ParA->ParB-HTH*->                                                                                                                                                                             ParB-HTH                    -                         SROS_RS45435             275  bacteria>actinobacteria                      Streptosporangium roseum                                hypothetical protein [Streptosporangium roseum].                                                                  <-502659614_?<-502659615_?<-759974859_?||502659617_?-><-759974841_?||502659620_?->759974844_ParA->502659622_ParB-HTH*-><-759974847_?||502659624_?-><-502659625_?<-502659626_?<-759974862_?||502659628_?->502659629_?->
      663245548    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH                    -                         IE98_RS0140635           274  bacteria>actinobacteria                      Streptomyces sp. NRRL B-24484                           hypothetical protein [Streptomyces sp. NRRL B-24484].                                                             663245517_?->663245521_?->663245524_?->759454963_?->663245536_?-><-663245541_?<-663245545_?<-663245548_ParB-HTH*<-663245553_ParA
      748135960    ParB-HTH+Prok-TUDOR*->?->HISKIN->HISKIN->                                                                                                                                                     ParB-HTH+Prok-TUDOR         -                         QH73_RS08030             272  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein, partial [Scytonema millei].                                                                 748135956_?->748135904_?->748135905_?->748135957_?-><-748135906_?<-748135958_?||748135959_?->748135960_ParB-HTH+Prok-TUDOR*->748135961_?->748135907_HISKIN->748135908_HISKIN->748135909_?-><-748135910_?||748135962_?-><-748135963_?
      518317229    <-HISKIN<-?<-?<-?<-?<-ParB-HTH*                                                                                                                                                               ParB-HTH                    -                         OSCIL6407_RS0116325      270  bacteria>cyanobacteria                       Kamptonema formosum                                     hypothetical protein [Kamptonema formosum].                                                                       <-494599497_?<-494599498_?<-494599499_HISKIN<-494599904_?<-518317226_?<-518317227_?<-518317228_?<-518317229_ParB-HTH*<-518317230_?<-518317231_?<-750368297_?||518317233_?->518317234_?->518317235_?->518317236_?->
      505048307    <-ParB-HTH*<-?<-?<-?<-RNAse_T                                                                                                                                                                 ParB-HTH                    -                         DEIPE_RS07610            269  bacteria>deinococci                          Deinococcus peraridilitoris                             hypothetical protein [Deinococcus peraridilitoris].                                                               <-505048301_?<-505048302_?<-505048303_?<-505048304_?<-505048305_?<-752559530_?<-505048306_?<-505048307_ParB-HTH*<-505048308_?<-505048309_?<-505048310_?<-752560050_RNAse_T<-505048312_?<-505048313_?<-752560051_?
      427376379    P-loop->?->?->?->?->ParB-HTH*->                                                                                                                                                               ParB-HTH                    -                         Syn6312_1142             266  bacteria>cyanobacteria                       Synechococcus sp. PCC 6312                              hypothetical protein Syn6312_1142 [Synechococcus sp. PCC 6312].                                                   427376372_?->427376373_?->427376374_P-loop->427376375_?->427376376_?->427376377_?->427376378_?->427376379_ParB-HTH*->427376380_?->427376381_?->427376382_?->427376383_?->427376384_?->427376385_?->427376386_?->
      504973660    ParB-HTH*->?-><-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon                                                                                                                                      ParB-HTH                    MatP                      266  bacteria>cyanobacteria                       Chamaesiphon minutus                                    hypothetical protein [Chamaesiphon minutus].                                                                      <-504973653_?<-753793967_?<-504973656_?<-753792584_?||753793968_?-><-504973658_?<-753792586_?||504973660_ParB-HTH*->504973661_?-><-753793138_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-504971468_?||753793969_?->504973664_?->753792588_?-><-504973666_?
      387857486    <-Relaxase<-?||?->?->?-><-ParB-HTH*<-ParA                                                                                                                                                     ParB-HTH                    -                         Emtol_0315               260  bacteria>bacteroidetes                       Emticicia oligotrophica DSM 17448                       hypothetical protein Emtol_0315 (plasmid) [Emticicia oligotrophica DSM 17448].                                    387857479_?-><-387857480_?<-387857481_Relaxase<-387857482_?||387857483_?->387857484_?->387857485_?-><-387857486_ParB-HTH*<-387857487_ParA||387857488_?->387857489_?->387857490_?->387857491_?->387857492_?->387857493_?->
      501453636    ParA->ParB-HTH*-><-?<-?<-?<-?||?-><-SFII-helicase                                                                                                                                             ParB-HTH                    SP                        260  bacteria>actinobacteria                      Streptomyces sp. FR1                                    hypothetical protein [Streptomyces sp. FR1].                                                                      190410681_?->190410682_?->190410683_?->190410684_?->190410685_?->190410686_?->190410687_ParA->501453636_ParB-HTH*-><-190410689_?<-190410690_?<-190410691_?<-190410692_?||190410693_?-><-190410694_SFII-helicase<-190410695_?
      576415619    <-ParB-HTH+ParB*<-?||DDE_Tnp_1_2->DDE_Tnp_1_2->                                                                                                                                               ParB-HTH+ParB               ParBc                     I546_4173                257  bacteria>actinobacteria                      Mycobacterium kansasii 732                              parB-like nuclease domain protein [Mycobacterium kansasii 732].                                                   <-576415477_?<-576415788_?<-576415759_?<-576415637_?<-576415706_?<-576415701_?<-576415774_?<-576415619_ParB-HTH+ParB*<-576415492_?||576415657_DDE_Tnp_1_2->576415496_DDE_Tnp_1_2-><-576415798_?<-576415497_?||576415587_?->576415793_?->
      390172790    <-ParB-HTH+ParB-HTH*                                                                                                                                                                          ParB-HTH+ParB-HTH           SP                        NITHO_3110002            254  bacteria>chloroflexi                         Nitrolancea hollandica Lb                               hypothetical protein NITHO_3110002 [Nitrolancea hollandica Lb].                                                   390172789_?-><-390172790_ParB-HTH+ParB-HTH*||390172791_?-><-390172792_?||390172793_?-><-390172794_?<-390172795_?<-390172796_?<-390172797_?
      503393144    <-ParB<-?||ParB-HTH*-><-?||DDE_Tnp_1_2->                                                                                                                                                      ParB-HTH                    -                         246  bacteria>planctomycetes                      Planctomyces brasiliensis                               hypothetical protein [Planctomyces brasiliensis].                                                                 <-752752012_?<-503393138_?<-752752013_?<-503393140_?<-503393141_?<-503393142_ParB<-503393143_?||503393144_ParB-HTH*-><-752750814_?||752751618_DDE_Tnp_1_2->752750816_?->752750818_?->503393146_?-><-503393147_?<-752750820_?
      652339044    <-ParB-HTH*<-?<-?<-?<-?<-?||?-><-TPR+CASPASE                                                                                                                                                  ParB-HTH                    -                         240  bacteria>cyanobacteria                       Fischerella sp. PCC 9605                                hypothetical protein [Fischerella sp. PCC 9605].                                                                  737153835_?->737153754_?->652339039_?->652339040_?-><-652339041_?||652339042_?-><-652339043_?<-652339044_ParB-HTH*<-652339045_?<-737153837_?<-652339047_?<-652339048_?<-652339049_?||737153839_?-><-652339052_TPR+CASPASE
      373100107    ABC->                                                                                                                                                                                         -                           -                         OR16_21363               235  bacteria>proteobacteria>betaproteobacteria   Cupriavidus basilensis OR16                             hypothetical protein OR16_21363 [Cupriavidus basilensis OR16].                                                    <-373100100_?<-373100101_?||373100102_?-><-373100103_?||373100104_?->373100105_?->373100106_?-><-373100107_?*<-373100108_?||373100109_?->373100110_?->373100111_?->373100112_ABC->373100113_?->373100114_?->
      522821476    HISKIN-><-ParB-HTH*                                                                                                                                                                           ParB-HTH                    -                         228  bacteria>proteobacteria>deltaproteobacteria  Sorangium cellulosum                                    hypothetical protein [Sorangium cellulosum].                                                                      <-769244403_?||522821470_?->522821471_?->769244405_?-><-769244406_?<-522821474_?||522821475_HISKIN-><-522821476_ParB-HTH*||769241447_?->522821478_?->769244407_?->522821482_?-><-522821483_?<-522821484_?<-522821485_?
      223896866    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         Cflav_PD5941             212  bacteria>verrucomicrobia                     Pedosphaera parvula Ellin514                            hypothetical protein Cflav_PD5941 [Pedosphaera parvula Ellin514].                                                 223896859_?-><-223896860_?<-223896861_?||223896862_?->223896863_?->223896864_?->223896865_?->223896866_ParB-HTH*->223896867_?->223896868_?-><-223896869_?<-223896870_?<-223896871_?<-223896872_?<-223896873_?
      522187926    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         BN60_RS12900             210  bacteria>proteobacteria>alphaproteobacteria  Reyranella massiliensis                                 hypothetical protein [Reyranella massiliensis].                                                                   <-522187918_?||759564486_?->522187920_?-><-522187921_?<-522187923_?<-759566104_?<-522187925_?<-522187926_ParB-HTH*<-522187927_?<-522187928_?<-522187929_?||522187930_?-><-522187931_?<-522187932_?<-522187933_?
      494604192    ParB-HTH*->ParB->                                                                                                                                                                             ParB-HTH                    -                         OPIT1DRAFT_RS19430       209  bacteria>verrucomicrobia                     Opitutaceae bacterium TAV1                              hypothetical protein [Opitutaceae bacterium TAV1].                                                                <-494604186_?<-494604187_?<-494604188_?<-494604189_?||759401514_?-><-759401518_?||494604191_?->494604192_ParB-HTH*->759403425_ParB->494604194_?->494604195_?->494604196_?->494604197_?->759401523_?->759401526_?->
      748137500    RecT->HTH->HTH->ParB-HTH->ParB-HTH+Prok-TUDOR*-><-HU-IHF                                                                                                                                      ParB-HTH+Prok-TUDOR         -                         QH73_RS16190             174  bacteria>cyanobacteria                       Scytonema millei                                        hypothetical protein, partial [Scytonema millei].                                                                 <-748137494_?<-748137495_?||748137496_?->748137497_RecT->748137498_HTH->748137499_HTH->748137595_ParB-HTH->748137500_ParB-HTH+Prok-TUDOR*-><-748137501_HU-IHF||748137502_?-><-748137503_?<-748137504_?||748137505_?->748137596_?->748137597_?->
      488660258    ParB-HTH*->?->?->?->?-><-?<-?<-DDE                                                                                                                                                            ParB-HTH                    -                         HMPREF1086_RS21385       152  bacteria>firmicutes                          [Clostridium] clostridioforme                           hypothetical protein [[Clostridium] clostridioforme].                                                             488660264_?->488660263_?->488660262_?->488660261_?->488660260_?->488633868_?->488660259_?->488660258_ParB-HTH*->488660257_?->488660256_?->488660255_?->488660254_?-><-488663774_?<-488648570_?<-488648571_DDE
      390173641    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    SP                        NITHO_2240002            135  bacteria>chloroflexi                         Nitrolancea hollandica Lb                               hypothetical protein NITHO_2240002 [Nitrolancea hollandica Lb].                                                   390173640_?-><-390173641_ParB-HTH*||390173642_?->390173643_?->390173644_?-><-390173645_?<-390173646_?||390173647_?-><-390173648_?
      663210974    <-ParB-HTH*<-ParA                                                                                                                                                                             ParB-HTH                    -                         IE97_RS0131565           134  bacteria>actinobacteria                      Streptomyces sp. NRRL S-455                             hypothetical protein, partial [Streptomyces sp. NRRL S-455].                                                      663210936_?->663210951_?->663210955_?->663210959_?->663210964_?->663210968_?->663210971_?-><-663210974_ParB-HTH*<-739995163_ParA||663210981_?->663210985_?->663210989_?->663210991_?->663210994_?->663210996_?->
      760128192    <-MU-transposase<-ParB-HTH*<-?<-?||?-><-?||?->HISKIN->                                                                                                                                        ParB-HTH                    -                         DALK_RS18450             128  bacteria>proteobacteria>deltaproteobacteria  Desulfatibacillum alkenivorans                          hypothetical protein, partial [Desulfatibacillum alkenivorans].                                                   <-506428598_?<-760126304_?<-760128188_?<-506428601_?<-506428602_?<-506428603_?<-760128190_MU-transposase<-760128192_ParB-HTH*<-760126305_?<-506428607_?||506428608_?-><-506428609_?||506428610_?->760128194_HISKIN-><-506428612_?
      308205593    ParB-HTH+Prok-TUDOR*->                                                                                                                                                                        ParB-HTH+Prok-TUDOR         -                         Nfla_3501                127  bacteria>cyanobacteria                       Nostoc flagelliforme str. Sunitezuoqi                   hypothetical protein Nfla_3501 [Nostoc flagelliforme str. Sunitezuoqi].                                           308205593_ParB-HTH+Prok-TUDOR*->
      501377550    ParB-HTH*->                                                                                                                                                                                   ParB-HTH                    -                         NPUN_RS12905             107  bacteria>cyanobacteria                       Nostoc punctiforme                                      hypothetical protein [Nostoc punctiforme].                                                                        <-501377544_?<-753809726_?<-501377546_?||753810489_?-><-501377548_?||501377549_?->753809727_?->501377550_ParB-HTH*->753809728_?->753810490_?->501377553_?->501377554_?->753809729_?->501377555_?->753810491_?->
      752563686    HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->ParB-HTH*->                                                                                                                                         ParB-HTH                    -                         CYAST_RS12110            107  bacteria>cyanobacteria                       Cyanobacterium stanieri                                 hypothetical protein, partial [Cyanobacterium stanieri].                                                          505036590_?->505036591_?->505036592_?-><-505036593_?||505036594_?->505036595_?->505036023_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->752563686_ParB-HTH*->752563385_?-><-505036596_?||505036597_?->505036598_?->505036599_?->752563687_?-><-505036601_?
      571784120    <-ParB-HTH*<-zf-CHC2                                                                                                                                                                          ParB-HTH                    -                         OMM_12844                96   bacteria>proteobacteria>deltaproteobacteria  Candidatus Magnetoglobus multicellularis str. Araruama  hypothetical protein OMM_12844, partial [Candidatus Magnetoglobus multicellularis str. Araruama].                 <-571784120_ParB-HTH*<-571784121_zf-CHC2<-571784122_?
      648257288    <-ParB-HTH*                                                                                                                                                                                   ParB-HTH                    -                         RH99_RS08025             90   bacteria>firmicutes                          Clostridium arbusti                                     hypothetical protein [Clostridium arbusti].                                                                       <-497922162_?<-497922164_?<-497922165_?<-497922167_?<-497922169_?<-497922171_?<-497922174_?<-648257288_ParB-HTH*<-497922180_?<-648257289_?<-648257290_?<-497922182_?<-497922185_?<-497922187_?<-497922189_?
      543531433    <-ParB-HTH*<-XerD<-PriCT_2<-DDE_Tnp_ISAZ013<-DDE_Tnp_ISAZ013                                                                                                                                  ParB-HTH                    -                         CWATWH0402_4419          68   bacteria>cyanobacteria                       Crocosphaera watsonii WH 0402                           hypothetical protein CWATWH0402_4419, partial [Crocosphaera watsonii WH 0402].                                    <-543531433_ParB-HTH*<-543531434_XerD<-543531435_PriCT_2<-543531436_DDE_Tnp_ISAZ013<-543531437_DDE_Tnp_ISAZ013
      52696830     <-ParB-HTH*<-ParB-HTH                                                                                                                                                                         ParB-HTH                    -                         BGP305                   50   bacteria>spirochaetes                        Borrelia garinii PBi                                    hypothetical protein BGP305 [Borrelia garinii PBi].                                                               <-52696828_?<-52696829_?<-52696830_ParB-HTH*<-52696831_ParB-HTH<-52696832_?<-52696833_?
      
      Back to Contents
    • Multiple sequence alignment of the Tudor-like SH3 domain associated with the ParB-like HTH domain

      ALIGN                                                           ---EEEE---------------------EEEEEEE--------------EEEEEE---E--------EEEEEE----HH-HHHHHH-----
      HMM                                                             -HHHHHHHH---------------EE-EEEEEEEE-------E-----EEEEEEE-------------EEHHHH-HHHH------------
      FREQ                                                            --EEEEE---------------------EEEEEEE--------------EEEEEE---E-E-----EEEEEEE----HH-HHHHHHH----
      PSSM                                                            -HHHHHHH---------------------EEEEEE--------------EEEEEE------------EEEEEE----EE-EEE--------
      FINAL                                                           -HHHHHHH--------------------EEEEEEE-------------EEEEEEE---E--------EEEEEE-----E-EEHHHH-----
      CAL7103_RS0100030_Calothrix_sp_PCC_7103_737187200               KDIVDKIRERTKLP--------NPYRVGEVCLILPK-DNPD-LRGKSGCWCVVTH---V-G--D--FSCTIDTW-DNEY-TVKIEHLKSLE
      CAL7103_RS0150440_Calothrix_sp_PCC_7103_518327692               KDIVNRIRERTKLP--------NPHRVGEVCMILPK-DNPD-LRGKSGFWCVIVG---V-G--D--FSCTVETW-DGEY-TVKIEHLKSLE
      SD81_RS35605_Tolypothrix_campylonemoides_751574024              KDIVQRIRERTKVP--------NPYQVGEVCRILPK-DNPE-LKGKSGCWCIVTY---V-A--D--YSCTVTTW-DCEY-VVKLEHLKSLD
      CYLST_RS31010_Cylindrospermum_stagnale_505141377                KSIVDKIRERTKLP--------NPYRLGEVCQILPK-DNPE-LKGKSGCWGIVTH---L-G--D--YSCTITTW-DGEY-TVKIENLKSLE
      CWATWH0402_1321_Crocosphaera_watsonii_WH_0402_543538779         KSIVDQIRERTPVP--------NPWRKGEVAMIMVK-DNPD-LRGKGGCWCVISE---V-H--N--FTCTVRLW-DGEY-QVKPENLKELP
      CWATWH0402_RS27635_Crocosphaera_watsonii_737861903              KSIVDQIRERNPVP--------NPWRVNEVAMIMVK-DNPE-LRGKGGCWCVITE---V-H--N--FSCTVRLW-DGDY-QVKPENLKELP
      CWATWH0402_1907_Crocosphaera_watsonii_WH_0402_543531309         KNIVDQIRERSPVP--------NPWRKGEVCMILVK-DNPD-LRGKGGCWCVITE---V-H--N--FSCTVSLW-DGDY-QVKPENLKELP
      _Crocosphaera_watsonii_494523440                                KSIVDQIRERNPVP--------NPWRVNEVAMIMVK-DNPE-LRGKGGCWCVITE---V-H--N--FSCTVRLW-DGDY-QVKPENLKELP
      CWATWH0003_RS12720_Crocosphaera_watsonii_737859551              KSIVSQIRERNPIP--------NPWRKGEVAMIIVK-DNPD-LRGKGGCWCVITE---V-H--N--FSCTVRLW-DGNY-QVKPENLKDLP
      _Crocosphaera_watsonii_546230520                                KSIVSQIRERNPIP--------NPWRKGEVAMIIVK-DNPD-LRGKGGCWCVITE---V-H--N--FSCTVRLW-DGNY-QVKPENLKDLP
      CWATWH0003_2673b1_Crocosphaera_watsonii_WH_0003_357263649       KSIVSQIRERNPIP--------NPWRKGEVAMIIVK-DNPD-LRGKGGCWCVITE---V-H--N--FSCTVRLW-DGNY-QVKPENLKDLP
      NPUN_RS34540_Nostoc_punctiforme_501381481                       KDVVQRIMERTQVP--------NTYQIGEVCQILAK-DNPE-LRGKAGCWGIVNH---V-G--E--FSCTVKTW-DGEY-TVGLQHLKSFN
      FIS9431_RS33115_Fischerella_sp_PCC_9431_737134277               KDIVQRIMERTKVT--------NPYRVGEICQIIAK-DNPE-LRGKGGCWCIVNK---V-N--D--FSCTVIAW-DKEY-TMRIEHLKSLD
      STA7437_RS23975_Stanieria_cyanosphaera_753865019                RDLVQRIKEKTKVP--------NPYRVGEVCQIVAK-DNLE-LRGKGGCWCIVSA---V-H--D--FSCTVNTW-DCEY-LVKLEHLKSLD
      FIS9431_RS31145_Fischerella_sp_PCC_9431_737132827               KDIVQRIMERTKVA--------NPYQLGEVCQIIAK-DNPE-LRGKGGCWCIVSQ---V-N--D--FSCTVTTW-DGEY-SIALQHLKSFD
      FDUTEX481_04300_Tolypothrix_sp_PCC_7601_407266570               QDIVDKIRERTKVP--------NPYRLGEVCTLLPK-DNPE-LKGRSGCWGVITH---V-G--D--YSCTLETW-DAEY-TVKIEHLKSLE
      FDUTEX481_RS32065_Tolypothrix_sp_PCC_7601_797212629             QDIVDKIRERTKVP--------NPYRLGEVCTLLPK-DNPE-LKGRSGCWGVITH---V-G--D--YSCTLETW-DAEY-TVKIEHLKSLE
      Sta7437_4876_Stanieria_cyanosphaera_PCC_7437_428272365          RDLVQRIKEKTKVP--------NPYRVGEVCQIVAK-DNLE-LRGKGGCWCIVSA---V-H--D--FSCTVNTW-DCEY-LVKLEHLKSLD
      DA73_0214905_Tolypothrix_bouteillei_VB521301_744450902          NDIVEQIMKRTKVP--------NPYQVGEVCQILPK-DNPE-LRGKSKCWCIVTE---V-N--D--FSCVVRAW-DGEY-VVKMDHLKSLD
      NPUN_RS35730_Nostoc_punctiforme_753811080                       KDVVQQIMERTKVP--------NTYQIGEVCQILAK-DNPE-LRGKGGCWGIVNH---V-G--E--FSCTIKIW-DGEY-TVGLQYLKSYN
      AVA_RS27595_Anabaena_variabilis_499635872                       KDVVQRIMESSKAP--------NPYRVGEVCQFVVK-DNPD-LRGMGSCWCIVTH---V-G--E--FSCTVTAW-NGEY-TVRTDHLKPMD
      Npun_BR102_Nostoc_punctiforme_PCC_73102_186469442               KDVVQQIMERTKVP--------NTYQIGEVCQILAK-DNPE-LRGKGGCWGIVNH---V-G--E--FSCTIKIW-DGEY-TVGLQYLKSYN
      _Microcystis_aeruginosa_763118064                               KDIVQRIRERTKIP--------IPYRVGDVCEILIK-DNPE-LRGLGGCWCIVIE---V-R--E--FSCLVRTW-NGEY-TVREENLRDLQ
      DA73_0203765_Tolypothrix_bouteillei_VB521301_744452929          KDVVQRIMERTKVP--------NPYRVGEVCQILPK-DYPD-LRGKGKCWCIVSQ---V-N--D--LSCTVTAW-DGDY-IVKMDCLKSLD
      alr7299_Nostoc_sp_PCC_7120_17135837                             KDVVQRIMERTQVP--------NSYQIGEVCQILAK-DNPE-LRGKGGCWCIVVA---V-H--D--FSCTVRIW-DGEL-TAGLKHLKSFD
      MICAK_2860002_Microcystis_aeruginosa_PCC_9701_389882556         KDIVQRIRERTKIP--------IPYRVGDVCEILIK-DNPE-LRGLGGCWCIVIE---V-R--E--FSCLVRTW-NGEY-TVREENLRDLQ
      _Nostoc_sp_PCC_7120_764953510                                   KDVVQRIMERTQVP--------NSYQIGEVCQILAK-DNPE-LRGKGGCWCIVVA---V-H--D--FSCTVRIW-DGEL-TAGLKHLKSFD
      NPUN_RS34090_Nostoc_punctiforme_501381405                       KDVVDRIRERTKVP--------NPYHIGEICILLPK-DDPD-LRGKAGYWGVVSH---V-G--D--YSCTVQTW-DGDY-TVKIEHLKLLE
      CAL7103_RS0120705_Calothrix_sp_PCC_7103_737188140               KDIVTRIRERTKLP--------NPYREGEICVLLPK-NNPD-LRGKSGCWGAITH---V-G--D--YSCTIETW-DGEY-TVKIQHLLSLN
      N44_RS02080_Microcystis_aeruginosa_779871805                    KDIVQRIRERTKIP--------IPHRVGDVCEILIK-DNPE-LRGLGGCWCIVIE---V-R--E--FSCLVRAW-NGEY-TVREENLRDLQ
      PLEUR7319_RS0114150_Pleurocapsa_sp_PCC_7319_518335686           KDVVQRIKDKKRPP--------ITLRVGEVCFLIAK-DNPE-LRGKSGCWCIVSE---V-Y--K--FSCSVATW-DNEY-ILRPEHLQSLE
      N44_02315_Microcystis_aeruginosa_NIES-44_718251661              KDIVQRIRERTKIP--------IPHRVGDVCEILIK-DNPE-LRGLGGCWCIVIE---V-R--E--FSCLVRAW-NGEY-TVREENLRDLQ
      CY51472DRAFT_RS0225515_Cyanothece_497232044                     QEVVQNMQQQKKIP--------NPWHVGEVAQIVVK-GNSD-LKGKGGCWAVIVA---V-N--D--FSCSVQLW-DGEY-QVKPENLKELP
      ANACY_RS28045_Anabaena_cylindrica_505030514                     KSIVDKLREKTNLP--------NPYYLGQVCQILPK-DIPE-LKGKNGCWGIITH---V-G--S--YSCRITTW-NGEY-LVKIENLKSLD
      ANA7108_RS0100620_Anabaena_sp_PCC_7108_515515560                KSIVDKLREKTNLP--------NPYYLGQVCQILPK-DIPE-IKGKNGCWGIITH---V-G--N--YSCTITTW-EGEY-LVKIENLKSLD
      FDUTEX481_RS10740_Tolypothrix_sp_PCC_7601_797208446             KDVVQRIMERTQVL--------NSYQLGEVCQILAK-DNPE-LRGKGGCWAIVAQ---V-N--N--FSCTVRNW-DGEL-TVGLKHLKSYE
      STA7437_RS22850_Stanieria_cyanosphaera_505024902                KDVVRRMKEKNSAP--------ISFRVGEVCQILAK-DNPE-LRGKSGCWCIVSE---V-Y--E--FSCLVDTW-NERY-LLRGENLSSLD
      AVA_RS26020_Anabaena_variabilis_499635567                       QDVIDRIRDRTSVP--------NPYQVGEICVLHPK-DNPD-LRGKSGYWGVVTH---V-G--E--YSCTVKCW-DGDY-TAKVEHLKSLE
      PCC7120DELTA_RS29565_Nostoc_sp_PCC_7120_499309017               QDVIDRIRDRTSVP--------NPYQVGEICVLHPK-DNPD-LRGKSGYWGVVTH---V-G--E--YSCTVKCW-DGDY-TAKVEHLKSLE
      CWATWH0401_RS11050_Crocosphaera_watsonii_494518295              RSIVDEIRERRPVP--------NPWRVGEVAQIIIK-GNPD-LKGRSGQWCIIEE---V-L--N--FSCLVKTW-DGII-QVKLENLKDVY
      DA73_0201705_Tolypothrix_bouteillei_VB521301_744453553          KDIVEQIMERTKVP--------NPYQVGEVCQILPK-DNPE-LRGKSKCWCIVTE---V-N--D--FSCVVRAW-DGEY-VV---------
      cce_2332_Cyanothece_sp_ATCC_51142_171698701                     RSVVDEIRERRPVP--------NPWRVGEVAQIVMK-RNPE-LKGRSGQWCIIEE---V-L--N--FSCLVKTW-DGII-QVKIENLKDVY
      CY51472DRAFT_RS0216845_Cyanothece_497230707                     RSVVDEIRERRPVP--------NPWRVGEVAQIVMK-RNPE-LKGRSGQWCIIEE---V-L--N--FSCLVKTW-DGII-QVKIENLKDVY
      CY0110_RS20455_Cyanothece_sp_CCY0110_495552874                  ISVIDEIRERRPVP--------NPWRVGEVAQIVVK-KNPD-LKGRSGQWCIIEE---V-L--N--FSCLVKTW-DGTI-QVKIENLKDVY
      XEN7305_RS25800_Xenococcus_sp_PCC_7305_750617827                KEVVKRMKDNNPKP--------IPFRVGEICQIMTK-DNPE-LRGKGGCWCIVKD---I-Y--K--LSCQVSTW-NDNY-ILRAENLKSLG
      Xen7305DRAFT_00000510_Xenococcus_sp_PCC_7305_442790849          KEVVKRMKDNNPKP--------IPFRVGEICQIMTK-DNPE-LRGKGGCWCIVKD---I-Y--K--LSCQVSTW-NDNY-ILRAENLKSLG
      CAL6303_RS10710_Calothrix_parietina_505010770                   LDIVQQIKDKNPLP--------NPYHIGEVCRILPS-TDPD-LKPFSGCWCIVYE---I-N--P--HSCGVKTW-KANLATVKPEYLERID
      CAL7103_RS55850_Calothrix_sp_PCC_7103_737188944                 PDIVEQIKNKKRVP--------NPHHVGEVCRIISK-GNPD-LKPVAGAWCIVTQ---V-N--P--HSCGIKTW-KMDFPAVKPENLEITY
      _Cyanothece_sp_PCC_7822_754536191                               EALKQRVREKSLAP--------FPYNEGDVCKILVK-GNPD-LKGKGGHWGVIVA---I-N--N--FSADIQLA-DGIY-LAKEENLEELP
      CAL7103_RS55635_Calothrix_sp_PCC_7103_737188848                 TDIVEQIKNKQRVP--------NPHYKGQVCQIIAC-GDKE-LKKFAGCWAIIIQ---V-N--E--HSCNIQTW-REDIPTVKPENLKPLD
      WA1_RS0149465_Scytonema_hofmanni_703031009                      LDITQQIKNKQRVP--------NPRRVGEICQIVAQ-GDPE-LKKFSKCWGIIKE---V-N--L--HSCYIQTW-KMDFPTVKPENLEPVY
      RIV7116_RS23620_Rivularia_sp_PCC_7116_504933746                 TSIVQQIKDKQRVP--------NPRIVGEICQIISK-GDPD-LKQYSRCWCIINA---V-N--L--HSCAIKTW-KMDFPTVKPENLEPTY
      Cyan7822_6833_Cyanothece_sp_PCC_7822_306986606                  EALKQRVREKSLAP--------FPYNEGDVCKILVK-GNPD-LKGKGGHWGVIVA---I-N--N--FSADIQLA-DGIY-LAKEENLEELP
      SD81_RS27565_Tolypothrix_campylonemoides_751570983              KGIVEKLKQKPLFL-----AT-DFCQVGDVFILTRL-EGAE--RKYNGCWAIASV---L-N--D--FTVEVDVH-DTTL-NVKPENLNKID
      SYN7509_RS0223705_Synechocystis_sp_PCC_7509_740179759           KGIVERLKEKPLVK-----AS-DFCQIGGVFILTRL-EGNE--RKYNGCWAIASE---L-R--E--FTVVVDVY-DGEF-AVKPENLNSID
      SYN7509_RS0222055_Synechocystis_sp_PCC_7509_655839534           KSIVERLKEKPLVK-----AS-DFCTIGDPFILTRL-EGAE--RKYNGCWAIARE---H-R--D--FTIAVDVY-DGEL-AVKPENLNPID
      UH38_20050_Chroococcales_cyanobacterium_CENA595_768384071       KDIVQRLKEKPLAL-----AS-DYCSIGDVFTLTRL-EGIE--RKYNGCWAIAKE---L-R--D--YTIAVDVH-DTTL-SVKPDNLQPLD
      UH38_RS20080_Chroococcales_cyanobacterium_CENA595_769922127     KDIVQRLKEKPLAL-----AS-DYCSIGDVFTLTRL-EGIE--RKYNGCWAIAKE---L-R--D--YTIAVDVH-DTTL-SVKPDNLQPLD
      QH73_RS02585_Scytonema_millei_748134961                         KGIVERLKEKPRLH-----AA-DFCCIGDVFVLTKL-EDSD--RKYNGYPCIAVE---L-K--Q--FSVDVDVH-DTTL-TVKPENLKKVD
      _Cyanothece_sp_PCC_7424_752567338                               EELKQRVREKANVP--------FPYKVNDVCKIIVK-ENPQ-LRGKSGHWGIIVE---V-M--N--FSANIQLA-DGIY-QVKEENLEELS
      UYC_RS0100505_Chlorogloeopsis_fritschii_515385753               KGIVEQLKDKPLLL-----AS-DFCQIGDVFTLTRL-EGTE--RKYNGCWAIAVA---L-K--E--FSVEVDVH-DTTL-NVKPENLNKID
      TOL9009_RS0101730_[Scytonema_hofmanni]_UTEX_B_1581_657929542    KGIVERLKEKPLFL-----AT-EFCQIGDVFTITKL-EGVE--RKYNGCWAIAVA---L-N--D--FTLEVDVH-DTTL-NVKPENLNKID
      SYN7509_RS26630_Synechocystis_sp_PCC_7509_740179430             RGIVERLKEKPLVK-----AS-DFCTVGDPFILTRL-EGAE--RKYNGCWAIARE---L-R--D--FTIAVDVH-DTTL-AVKPDNLDPID
      PCC7424_5430_Cyanothece_sp_PCC_7424_218175274                   EELKQRVREKANVP--------FPYKVNDVCKIIVK-ENPQ-LRGKSGHWGIIVE---V-M--N--FSANIQLA-DGIY-QVKEENLEELS
      UYG_RS0120335_Fischerella_muscicola_515347403                   KGIVEQLKEKPLLL-----AS-NFCQIGDVFTLTRL-EGTE--RKYNGCWAIAVV---L-K--E--FSVEVDVY-DTTL-NVKPENLNKID
      _Cyanothece_sp_PCC_7424_501601085                               REVVEQYKEKPE----H-----NPFELGEVVGVESK-DNPL-LRGRNGAWGIVTG---V-S--K--HHCNLQLW-DTEFEEVGVEYLKELN
      PCC9339_RS0106675_Fischerella_sp_PCC_9339_515877940             KGIVEQLKEKPLLL-----AS-DFCQIGDVFTLTRL-EGTE--RKYNGCWAIAVV---L-K--E--FSVEVEVH-DTTL-NVKPENLNKID
      Glo7428_4930_Gloeocapsa_sp_PCC_7428_428267400                   KGIVEQLQEKPLLQ-----AR-DFCTCGDVFTLVKL-EGSM--RKYNGYWAIVCS---I-N--T--FTIAVDVH-DTTI-LVKPENLQPID
      GLO7428_RS24200_Gloeocapsa_sp_PCC_7428_754508876                KGIVEQLQEKPLLQ-----AR-DFCTCGDVFTLVKL-EGSM--RKYNGYWAIVCS---I-N--T--FTIAVDVH-DTTI-LVKPENLQPID
      PLEUR7319_RS33990_Pleurocapsa_sp_PCC_7319_738911651             -IVAQAVRLHQAAE--QN-LV-NPFTSGEICRLVVR-DNSQ-LKGKGGCWCIVDQ---V-Y--L--SSCTVNTW-SDEF-EVPIENLESLG
      QH73_RS11110_Scytonema_millei_748136693                         KGIVKQLKEKGLRY-----AT-EFCSVGDVFVLTKL-EDSE--RKYNGCPCVAIE---L-K--Q--FTVDVDVH-DTTL-TVKPENLQKLD
      PCC7424_5542_Cyanothece_sp_PCC_7424_218175378                   REVVEQYKEKPE----H-----NPFELGEVVGVESK-DNPL-LRGRNGAWGIVTG---V-S--K--HHCHLQLW-DTEIEEVGVEYLKELD
      CHRO_RS28535_Chroococcidiopsis_thermalis_752825464              KTVVERMKEKQLFP-----AR-DFCAVGDVFTLTRL-HSRE--RKYNGYPCVALV---L-K--D--FTIEVDVY-DGTL-IVKPENLKPID
      _Cyanothece_sp_PCC_7424_752567372                               REVVEQYKEKPE----H-----NPFELGEVVGVESK-DNPL-LRGRNGAWGIVTG---V-S--K--HHCHLQLW-DTEIEEVGVEYLKELD
      Chro_5819_Chroococcidiopsis_thermalis_PCC_7203_428013042        KTVVERMKEKQLFP-----AR-DFCAVGDVFTLTRL-HSRE--RKYNGYPCVALV---L-K--D--FTIEVDVY-DGTL-IVKPENLKPID
      QH73_RS16405_Scytonema_millei_748137603                         KGIVEQLKEKPLVI-----AK-DFCQVGDVFTLVRL-EGKE--KKYNGCSCVAVE---S-R--D--FTVMVEVH-DTTL-TVKPENLNKID
      CY0110_RS14620_Cyanothece_sp_CCY0110_737832178                  KQVVREMTREDA----D-----NPFELGEVVGIVAQ-DNPD-LKGKNGCWGIVTA---L-T--K--TTCNLQTW-NDELEAIEIEFLRELE
      CWATDRAFT_RS29615_Crocosphaera_watsonii_757158775               KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE
      CY0110_32445_Cyanothece_sp_CCY0110_126620031                    KQVVREMTREDA----D-----NPFELGEVVGIVAQ-DNPD-LKGKNGCWGIVTA---L-T--K--TTCNLQTW-NDELEAIEIEFLRELE
      CWATWH0401_4234_Crocosphaera_watsonii_WH_0401_543428839         KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DTELEGVEIEFLQELE
      CWATDRAFT_RS03435_Crocosphaera_watsonii_494514224               KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DTELEGVEIEFLQELE
      _Crocosphaera_watsonii_494523812                                KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEEVEIEFLQELE
      CWATWH0005_RS08635_Crocosphaera_watsonii_737857352              KQVVREMTREEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---V-S--K--NTCDLQTW-DSELEGVEIEFLQELE
      _Crocosphaera_watsonii_546220971                                KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DTELEGVEIEFLQELE
      CwatDRAFT_0109_Crocosphaera_watsonii_WH_8501_67852287           KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE
      _Crocosphaera_watsonii_494519775                                KQVVREMTREEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---V-S--K--NTCDLQTW-DSELEGVEIEFLQELE
      _Crocosphaera_watsonii_546222413                                KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE
      _Crocosphaera_watsonii_546206668                                KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE
      CWATWH0005_RS11590_Crocosphaera_watsonii_737862397              KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEEVEIEFLQELE
      CWATWH0005_RS00905_Crocosphaera_watsonii_494523801              KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE
      CYAN8802_RS22020_Cyanothece_sp_PCC_8802_752568031               KQVVRQLTRIPG----S-----NPFEVGEVVGIVAK-DHPG-VRGRNGSWAIVTA---I-A--S--DTCDLQLW-DTFLEGIESEYLKEMD
      Cyan8802_4571_Cyanothece_sp_PCC_8802_256592473                  KQVVRQLTRIPG----S-----NPFEVGEVVGIVAK-DHPG-VRGRNGSWAIVTA---I-A--S--DTCDLQLW-DTFLEGIESEYLKEMD
      CY51472DRAFT_RS0223830_Cyanothece_497231939                     KKVVREMTRGDE----D-----NPFELGEVVGIVAQ-DNPE-LKGKNGCWGIVTA---L-T--K--TTCDLQIW-DTELEGIEIEFLRELE
      CY0110_RS25950_Cyanothece_sp_CCY0110_495554039                  KQVVREMTREDA----D-----NPFELGEVVGIVAQ-DNPQ-LKGKNGCWGIVTA---L-T--I--TTCDLQTW-DNELEAIEIEFLRELE
      _Nostoc_sp_PCC_7120_764953501                                   ----------------------DPYRVGEICTLQPK-DNPD-LRGKSGYWGVVTH---V-G--E--YSCTIKWW-DGDY-TAKVEHLKLLE
      MICAB_RS03030_Microcystis_aeruginosa_763118968                  RETVGKYRDKGK----A-----NVFEVGEVVGILAK-DNPR-LKGKNNCWAIVTA---V-H--P--RSCDLRLH-DGAIDLVKIEYLKELG
      MICAB_900014_Microcystis_aeruginosa_PCC_9717_389714985          RETVGKYRDKGK----A-----NVFEVGEVVGILAK-DNPR-LKGKNNCWAIVTA---V-H--P--RSCDLRLH-DGAIDLVKIEYLKELG
      QH73_RS07795_Scytonema_millei_748135946                         KDIVVQLKQKELFP-----IA-HFCQVGDAFTLMRL-EGCE--RKYNGYPGVATK---L-K--D--FTIEVEVF-DGTM-AVKPENLRPID
      MAE_RS14770_Microcystis_aeruginosa_501223295                    RETVGKYRDKGK----A-----NVFEVGEVVGILAK-DNPR-LKGKNNCWAIVTA---V-H--L--RSCDLRLH-DGAIDLVKIEYLKELG
      CWATWH0402_3406_Crocosphaera_watsonii_WH_0402_543537000         ------MTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEEVEIEFLQELE
      CWATWH0402_3406_Crocosphaera_watsonii_WH_0402_543531669         ------MTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE
      Ava_B0348_Anabaena_variabilis_ATCC_29413_75705382               -----------MLPCDRSWCNSDPYRVGEICTLQPK-DNPD-LRGKSGYWGVVTH---V-G--E--YSCTIKCW-DGDY-TAKVEHLKLLE
      QH73_RS10255_Scytonema_millei_748136445                         KGIVEQLKEKPLRY-----AT-DFGQVGDAFTLMRL-EGCE--RKYNGYPSVAIE---L-R--D--FTILVEVF-DGTM-AVKPENLKPID
      UH38_RS16060_Chroococcales_cyanobacterium_CENA595_769921346     KSIVEQLKERPLRY-----AT-EFCAVGDVFTLTQL-EGEE--RRYNCYPCVAVD---L-N--E--FTVKVDVC-NTTI-AVKPENLKSVD
      QH73_RS16190_Scytonema_millei_748137500                         KDIVVQLKQKELFP-----IA-HFCQVGDAFTLMRL-EGCE--RKYNGYPGVATK---L-K--D--FTIEVEVF-DGTM-AVKPENLRPID
      KV40_RS24315_Myxosarcina_sp_GI1_738538439                       AEAVRLYLAKDSDR--E-----NPFVEREVCRIVVK-GNSK-LKGKGDCWCIVSQ---V-L--A--NSCMVDTWAEDNI-EVPIENLESMG
      KV40_RS30300_Myxosarcina_sp_GI1_738540904                       KEAVKAYLKQKYPP--V-----NPFSPGEICRI-----TSE-VPGKKNCWCVVAE---L-R--K--DECVVDTW-DDRF-VVSVEDLVPLK
      UH38_RS09160_Chroococcales_cyanobacterium_CENA595_769920071     RAVVEQLQAKPLVL-----AQ-DYCQVGDVFILTRL-QGKD--SQYNGCWAIALK---P-T--N--STVIVDVH-DATL-TVKPNNLNKID
      KV40_RS29900_Myxosarcina_sp_GI1_738540774                       KTAVKTYLNQKYPP--V-----NPFTEGQICRI-----SSG-IKGKLHCWCVISE---V-R--K--DKCVVDTW-DSQY-VVSVEDLQEMK
      KV40_RS28185_Myxosarcina_sp_GI1_738539875                       KEAVKAYLKQKYPP--V-----NPFSPGEICRI-----TSG-VPGKKNCWCVVAE---V-R--K--DECVVNTW-DDRF-VVSVDDLLSLK
      CY51472DRAFT_RS0225395_Cyanothece_497232068                     KKVVREMTREDA----E-----NPFELGEVVGIVAQ-DNPE-LKGKNGCWGIVTA---L-T--K--TTCNLQTW-NDELEEIEIEFLRELE
      PLEUR7319_RS33705_Pleurocapsa_sp_PCC_7319_738911416             KSAVKAYLKQKYPP--V-----NPFSIGEICRI-----KSG-VPGKQNCWCVVAE---V-S--Q--DECVVDTW-DDRF-VVSVDDLMPMK
      CYAN7822_RS28405_Cyanothece_sp_PCC_7822_503099618               GEVVKQLQNITIEQ--------NPFEVGDICLIAPG-DNPA-LRGLQGCWAIIAA---K-N--E--FYCDIKTS-LGLIRQISSSYLLHSA
      QH73_RS12040_Scytonema_millei_748136747                         KGIVERIKEKPLHL-----AS-DYCKEGEVFFLQRL-VGIE--KHYNGCWAIAIE---VES--R--FTIKAAVY-DGTL-ELRQENMKPID
      QH73_RS10490_Scytonema_millei_748136457                         KNAVERIRKKTLHL-----AS-DYCQDGEVFFLQRL-SGKE--KKYNGCWAIALE---VEN--R--LTVKAAVY-DGTL-ELRQEQMKSID
      CY0110_RS21885_Cyanothece_sp_CCY0110_495553174                  REVVRSSQQKEVQT--S--SF-SKKDIGAVIEIIKV-EGNDSLRGQKGNYGIITG---V-N--N--FSVSIETA-MGKYDTIPQQCVRKMP
      CY51472DRAFT_RS0223475_Cyanothece_497232009                     REVVRASQQKEVQA--R--SF-STEDIGAVVEITKV-EGNDNLRGQKGNYGIITG---V-N--N--FSVSIETA-IGEYDTIPQQCVRKMP
      PCC9339_RS0103785_Fischerella_sp_PCC_9339_648361686             EALLHVLIKHGQVQ--------NSYSIGEVCQIIAK-NNPQ-LKDKNGCWCIVKE---I-DATT--CTCTVATI-YGLC-SLSVDHLKSLN
      Nfla_3501_Nostoc_flagelliforme_str_Sunitezuoqi_308205593        KDVVQRMMERTQVP--------NTSQIGEVCQILVK-DNPL------TIYVVNEQGKKI-R--T--SAIPITN--DGTY-SGYKEFLLILE
      consensus/95%                                                   ......h.p..................Gplh.l..........+.b....slh.....l........ph.lp...p..h..h..p.L..h.
      consensus/90%                                                   b.hlpph.pp..............hp.G-lh.l..b..s....+.bss.aslh.....l....p...oh.lp.a.ss.h..l..c.Lp.h.
      consensus/85%                                                   b.lVpph.cc............s.hphG-lh.l..b.pssc..+.bsGhWslh.....l....p...oh.lp.a.ssph..lp.-.Lp.h.
      consensus/80%                                                   cplVpphpcc............s.hplG-Vh.l..b.css-..+.bsGhWsll.....l....p...oh.lp.a.-sph..lc.E.Lp.l.
      consensus/75%                                                   +plVpphpcc............sshplG-Vh.l..b.-ss-.l+sbsGhWslls....l....p...oh.lpsa.Dsph..Vc.E.Lpplp
      consensus/70%                                                   +plVpphpc+pb..........ssaplG-Vh.lh.b.-ss-.L+GbsGCWsIls....l.p..p...oCslpsa.Dsph..Vc.EpLpplp
      
      Back to Contents
    • Multiple sequence alignment of the Group2/clade 3/CHLREDRAFT_205675 -like N6-MTases

                                                                                                                                                   Str-1                             Str-2               Str-3 lost         Str-4                                                                              Str-5                                                                                                                                                                                 Str-6                                                                                                                                                         Str-7                                                                                                
                                                                                                  **                                                   *    *                                                                              *                                                                              *
      RES                                                                       -------M------PASDHCETPREAYADIAPV---VSAIAEMC----------------GKTDDS-VTIYDPY-FCAGAMKQRLASC--------GF-P-NIINRNS---------DFYED---IR-T-NN-I-P-EY-DILITNPPYSTEPYNHIKR-----------------------------------LMQFL----------A--------DMGKPFFILQPVYVYTKPYYQAAR----------------------------------------------------------------------------ERLGE------------G--------------------------------------------------------------------C-F-FITPS--HR---YRF---------------------------------------ET--P-QG-MRNV---------------RQQE-LI-----------------------------------------------------------TS---PFVSL-WYCYVP-A------HM--------------------------FPKLRR---------WWC------AE-G------HR----LSQGCV--MRAQSR
      ALIGN                                                                     -----------------------HHHHHHHHHH---HHHHHHH------------------------EEEE--------HHHHHHHH--------------EEEE-------------EEEE------------------EEEE-----------HHHH-----------------------------------HHHHH----------H--------H----EEEE----HH-----H-EH----------------------------------------------------------------------------H-----------------------------------------------------------------------------------------EEE---------EEE---------------------------------------------------------------------------------------------------------------------------------------------HE-EEEE---------------------------------------HHHHH---------HHH------HH-H--------------------------
      HMM                                                                       ----------------------HHHHHHHHHHH---HHHHHHH------------------------EEEEEEE-E--HHHHHHHHH--------------EEEE--E---------EEEEE---E-----------EE-EEEEE--------HHHHHH-----------------------------------HHHHH----------H--------H---EEEEEE-HH----HHHHHHH----------------------------------------------------------------------------HH------------------------------------------------------------------------------------E-E-EEE---------EEE---------------------------------------E-----------------------------EE-EE-----------------------------------------------------------E------EEE-EEEE-----------------------------------------EEE---------EEE------E-------------------EE--EE----
      FREQ                                                                      -----------------------HHHHHHHHHH---HHHHHHH--------------------------EE--------HHHHHHHH------------E-EEE--------------EEEE----------------E-EEEE--------HHHHHHH-----------------------------------HHHHH----------H--------H----HHHH---HHH------HHH------------------------------------------------------------------------------------------------------------------------------------------------------------------E-E-EEE---------EEE-------------------------------------------------------------------------------------------------------------------------------------------EEEE-EEEE----------------------------------------EEEE---------EEE------------------------------------
      PSSM                                                                      -----------------------HHHHHHHHHH---HHHHHH--------------------------EEEE------HHHHHHHHH----------------EE------------------------------------EEE--------HHHHHHH-----------------------------------HHHHH----------H--------H---EEEEEE---------HHHHH----------------------------------------------------------------------------H-------------------------------------------------------------------------------------E-E-EEE-------------------------------------------------------------------------------------------------------------------------------------------------------EEEE-EEE----------------------------------------HHHHH---------HEH------------------------------------
      FINAL                                                                     -----------------------HHHHHHHHHH---HHHHHHH-------------------------EEEEE-----HHHHHHHHH--------------EEEE------------EEEEE----------------E-EEEE--------HHHHHHH-----------------------------------HHHHH----------H--------H----EEEEE---------HHHHH----------------------------------------------------------------------------H-------------------------------------------------------------------------------------E-E-EEE---------EEE-------------------------------------------------------------------------------------------------------------------------------------------EEEE-EEEE----------------------------------------EEEE---------EEE------------------------------------
      PTSG_07527_Salpingoeca_rosetta_514687493                                  -------M------PASDHCETPREAYADIAPV---VSAIAEMC----------------GKTDDS-VTIYDPY-FCAGAMKQRLASC--------GF-P-NIINRNS---------DFYED---IR-T-NN-I-P-EY-DILITNPPYSTEPYNHIKR-----------------------------------LMQFL----------A--------DMGKPFFILQPVYVYTKPYYQAAR----------------------------------------------------------------------------ERLGE------------G--------------------------------------------------------------------C-F-FITPS--HR---YRF---------------------------------------ET--P-QG-MRNV---------------RQQE-LI-----------------------------------------------------------TS---PFVSL-WYCYVP-A------HM--------------------------FPKLRR---------WWC------AE-G------HR----LSQGCV--MRAQSR
      PTSG_04809_Salpingoeca_rosetta_514693110                                  RQ--W-QC------EDADHCETPLDAVRDIEPI---LEELCQAL----------------GKERAA-LRIYDPY-YCAGGIRVHLARL--------GF-T-SVYNKPE---------DFYSV--AEK-K-AA-S-P-DF-DVVVTNPPYSSVPVNHVLK-----------------------------------LARFL--------------Y----SQPKPWLVVQPNYVYTKGPWHALV-----------------------------------------------------------------------------------R---------K-------H--------------------------------------------------------HPQ-P-F-YVIPG--TP---RAY------------V----Y-Q--------------------A--P-AT-MRGD------------VQRKKQL----K---------------------------------------------------------TS---PFVTF-WFCNLR-E------HQ--------------------------QRVFEK-----------V------QQ-G--LLS-SR----PHVGVA--VKSF--
      MONBRDRAFT_26429_Monosiga_brevicollis_MX1_167525218                       FP--W-EA------DANDHCETPYLAYAHIAPL---LRWLAKSL----------------GKTSQE-LVIYDPY-FCAGSMRAHLARL--------GF-E-TVINECR---------DFYAD---VA-S-GN-V-P-DY-DVLVTNPPFSTTGQDHVAL-----------------------------------LCQFL--------------Q----KTPKPFCVVQPNYVYTKAPWAALT-----------------------------------------------------------------------------------QTTG------R-------A--------------------------------------------------------APF-P-F-YLTPR--QP---RQY------------V----Y-E--------------------T--P-PA-FRP--------------LNAKQR----K---------------------------------------------------------TS---PFVTL-WYCRLP-A------HM--------------------------QGAAFR---------WMV------TP-G------CK----LPLRLS--IVST--
      MONBRDRAFT_25858_Monosiga_brevicollis_MX1_167523896                       HA--F-KT------EREDHCETPFEAYKDIAPL---LHQVAGML----------------GKNADE-VQLYDPY-YCAGTTKKHLARL--------GF-P-NVYNENV---------DFYEA---VR-S-EQ-T-P-AY-DILITNPPFSNTHVNHIKR-----------------------------------LMRFV----------A--------NSGKPFFILQPNYVYTKPYYQEVR----------------------------------------------------------------------------A------------------------------------------------------------------------------------------------------------------------------------------------A--P-MP-LPSV---------------HKPR---------------------------------------------------------------Q---PF------CSVS-S------HR--------------------------ISSDDRDPADCSGPGSAA------CR-G------YT----QPPICV--RHPRGL
      ACA1_068850_Acanthamoeba_castellanii_str_Neff_470514790                   WY--F---------HFGRHHQRVVQWWQAQVQR---QRKASKNT-----K-TKT------KKKKNE-SESEDED-EEDEEEEEKKKAG--------GL-L-PDCVLAR---------DVNSL---P--K-GV-R-P-EA-GR-SGGPPLGG----------------------------------------------------------------------------------QKRRFEEAI--GGGRG-----------------------------------------GRGGRGGGR------------------------------------G--GR------------------------------------------------------------GGD------------GR-G-GSW-----------------R-G-------------------------GG-RGGG-------DRG--GRGGRGR-------GAPSGDG------------------------------------------------GK---RFKSH-HPQPPS-G----------------------------------GGATMR------G-----------AS--------QG----RHTRFD--------
      ACA1_037140_Acanthamoeba_castellanii_str_Neff_470455333                   HP--F-EA------DPRDHAETPFQAYRDIAPL---LRALAQRLYGSVVGDANDDADGGVAAASRQ-LRIYDPY-FCEGSMVAYLARL--------GF-T-NVYNRNE---------DFYGV---VA-A-GR-V-P-DF-DVLVTNPPFSG---DHMER-----------------------------------IVRWA----------E--------GCGRPWLLLMPDFVANKPYYHAFR---------------------------------------------------------------------------------------------HRCTSTSAA--------------------------------------------------------SGK-P-V-YIGPTR-QA---YVF-----------------A-A-----------------------P-LH-TPDG--------------------------VTP-LVG------------------------------------------------QR---PATAV-VPAGLK-R---R------AADKNEDDENEDDEGEG-------------------------------------------------------------
      VOLCADRAFT_105839_Volvox_carteri_f_nagariensis_302843234                  HG--F-AT------EPGDHAETPLQAYEHIEPL---LARLATRL----------------GTTKAA-LRIYDPF-FCEGRMLAHMASL--------GF-E-SVHNRNE---------DFYAM---RD-T-KR-T-P-EF-DVLVTNPPFSG---DHIEA-----------------------------------TFEFA----------I--------ASGRPFFILVPQYVSRKAFYLEWL---------------------------------------------------------------------------------------------N--NGCTPG--------------------------------------------------------LPP-P-V-FVGPKH-EP---YIF-----------------L-A-----------------------P-DR-ADVR--------------------------TAR-APG------------------------------------------------QE---PATAA-ATQG--------------DSVPGDDAASRACAGAGVGSADGGLTDAVE---------WST------ET----------------------------
      VOLCADRAFT_120421_Volvox_carteri_f_nagariensis_302830991                  ---------------------------------------------------------------------------------------------------M-NFIHRNR---------DFYQD---VK-C-GD-V-P-PY-DILVTNPPYSA---DHKER-----------------------------------ILEFC--------------L----RSGKPWALLLPNYVATKAYFGQLL---------------------------DSTTT-------------------------------------------------------------A-------A--------------------------------------------------------SQR-P-F-FLTPQ--AR---YSY-----------------E-H-----------------------P-EG-TGHL---------------------------------------------------------------------------------ES---PFFSI-WYIGLG-E-Q--------------------------------------------------------TE----------------------------
      OT_ostta10g00600_Ostreococcus_tauri_693498069                             LA--FGDV------DEADHCETPFRAYRDVEPF---LFAIAKAL----------------KREKKD-LRIYDPY-YCEGSMVEHLRAL--------GF-E-SVYNRNE---------DFYEK---VA-T-KT-T-P-DF-DVLVTNPPYSG---DHFKR-----------------------------------ILTFC----------R--------DSNKPWLLLLPNFVCRKSYYEECV---------------------------------------------------------------------------------------------G-------E--------------------------------------------------------RAK-P-L-FLIPDSTKP---YRY-----------------W-A-----------------------P-GR-NEDG--------------------------VR--AKG------------------------------------------------TT---PFETF-WYIEFS-G-------------------------------TLDPEAQRA---------WWL------KK--------YK----AHSTCD--IPALDE
      OT_ostta14g01460_Ostreococcus_tauri_693497110                             GG----MS------IEDDEWATATRTWRSLFTV---L--------GAT------------ANGFRT-KKVWAPF-VYDELAGKRMREA--------GF-E-RVVHKRW---------DFFDK---VR-D-GPFV-R-SL-DAVVDNPPYTG-K-GMKQK-----------------------------------VLKSL----------V--------DAELPFCLLLPLGVVHGAFVREML---------------------------------------------------------------------------------------------E--ER------------------------------------------------------------YVQ-----IIVPR--KC---YVF---------------------------------------------KK-NGA----------------------------------------------------------------------------------EI---PFKFLVWLCYKM-E-L--------------------------------ERDLYL-ID---------------------------------------------
      Ot14g01730_Ostreococcus_tauri_308811312                                   GG----MS------IEDDEWATATRTWRSLFTV---L--------GAT------------ANGFRT-KKVWAPF-VYDELAGKRMREA--------GF-E-RVVHKRW---------DFFDK---VR-D-GPFV-R-SL-DAVVDNPPYTG-K-GMKQK-----------------------------------VLKSL----------V--------DAELPFCLLLPLGVVHGAFVREML---------------------------------------------------------------------------------------------E--ER------------------------------------------------------------YVQ-----IIVPR--KC---YVF---------------------------------------------KK-NGA----------------------------------------------------------------------------------EI---PFKFLVWLCYKM-E-L--------------------------------ERDLYL-ID---------------------------------------------
      Ot13g02010_Ostreococcus_tauri_308810727                                   HP--F-ET------DADDHCETSPEAHENVANF---LKAACGQL----------------NKSPAE-LVIYDPY-YCAGGVRRNLALI--------GF-T-NVINRNE---------DFYQV---IE-E-DR-V-P-QH-DVLLTNPPYSE---DHIVK-----------------------------------CVTFA----------AENLA----AHGRPYMLLLPSYVIHKDYYMPALLTGGTRGR-------------------------------EKAKALAAAAERDAKGSDDDDDEAEEMKIHSGS---------------------A--SR---A--------------------------------------------------------QIL-P-F-YVAPK--KR---YYY-----------------W-T-----------------------P-KA-MIKARAAKGQSEES--IKARRKR----TH-IGALGER------------------------------------------------TS---PFLSF-WYCCFG-T-M--------------------------------QRDVLA---------WHK---R--LP--------RA----DVFGYT--LARHPN
      Ot10g00600_Ostreococcus_tauri_308808370                                   LA--FGDV------DEADHCETPFRAYRDVEPF---LFAIAKAL----------------KREKKD-LRIYDPY-YCEGSMVEHLRAL--------GF-E-SVYNRNE---------DFYEK---VA-T-KT-T-P-DF-DVLVTNPPYSG---DHFKR-----------------------------------ILTFC----------R--------DSNKPWLLLLPNFVCRKSYYEECV---------------------------------------------------------------------------------------------G-------E--------------------------------------------------------RAK-P-L---------------Y-----------------W-A-----------------------P-GR-NRR-----------------------------------------------------------------------------------R---RFGTF----EFS-G-------------------------------TLDPEAQRA---------WWL------KK--------YK----AH--C---------
      OSTLU_29451_Ostreococcus_lucimarinus_CCE9901_145356641                    ------MS------RADDEWATSERTWASLFGV---L----------E------------RNGYAK-KKLWAPF-VYDGDAGRRMRSA--------GF-E-RVVHRRA---------DFFER---VR-D-GVFV-R-SL-DAVVDNPPYTG-K-GMKER-----------------------------------ILKAL----------R--------DAEVPFCLLLPLGVLHAAFVRDIL---------------------------------------------------------------------------------------------E--ET------------------------------------------------------------HVQ-----VIVPR--KC---YVF---------------------------------------------KK-GGT----------------------------------------------------------------------------------EV---PFKFLCWLCYKM-E-L--------------------------------KRDLYF-ID---------------------------------------------
      OSTLU_27164_Ostreococcus_lucimarinus_CCE9901_145353330                    HP--F-ET------DADDHCETSPEAHENVANF---LKAACGRL----------------NKTPAD-LVIYDPY-YCAGGTRRNLALI--------GF-T-NVINRNE---------DFYKV---IE-E-GR-V-P-EH-DVFLTNPPYSA---DHIEK-----------------------------------CVTFA----------AENLA----AHGRPYMLLLPSYVIHKDYYLPALLTGGSRGK-------------------------------EKAKSLELQRKDEENNEDDVEENDDGIKRHGGG---------------------A--AR---A--------------------------------------------------------QIL-P-F-YIAPK--KR---YYY-----------------W-T-----------------------P-KA-MVKARAAKGQSEES--AKARRKR----TH-IGALGER------------------------------------------------TS---PFLSF-WYCCFG-S-M--------------------------------QRDVLA---------WHK---K--LP--------RS----EVWGYV--VARHPN
      OSTLU_26028_Ostreococcus_lucimarinus_CCE9901_145351428                    LA--FADA------DEADHCETPFRAYRDVEPF---LFNLAKAM----------------KKEKKT-LRIYDPY-YCEGSMVAHLNAL--------GF-E-NVYNRNE---------DFYEK---VA-T-KS-T-P-EF-DVLVTNPPYSG---DHFKR-----------------------------------ILSYC----------R--------DSNKPWLLLLPNFVCRKSYYAPSI---------------------------------------------------------------------------------------------G-------D--------------------------------------------------------AAK-P-L-FLIPDDAKP---YRY-----------------W-A-----------------------P-GR-QGHE--------------------------IR--AKG------------------------------------------------TT---PFETF-WYIEFA-G-------------------------------VLDAQQQRA---------WWL------KK--------YA----AHSSCS--VPALDE
      MNEG_5609_Monoraphidium_neglectum_761971839                               HP--F-ET------DPGDHAETPFEAYEHLEPL---LARLARRL----------------GTTKAA-LRVYDPF-YCEGRMVQHMARL--------GF-E-SVYNRNE---------DFYQV---RA-E-GR-C-P-EY-DVLLTNPPFSG---DHLER-----------------------------------IFRFA----------V--------NSNKPWFLLIPQYVARKAFYLEWL---------------------------------------------------------------------------------------------T--NRRPKG--------------------------------------------------------APK-P-S-FVAPTR-QP---YVF-----------------T-A-----------------------P-DR-ADVR--------------------------LMRLADG------------------------------------------------QQ---AAAAA-APAAAQ-R---QVAEGAASQEQGTAQGQVECSGLGADGEAEQHQDQLS---------QQQ------QQQQ---QQQQQ----QQQQQQ--QPALSD
      MICPUN_64290_Micromonas_sp_RCC299_255088714                               GR----MS------VEDDEWATAPRTWAALAPY---L------------------------SDYHD-KKIWAPF-YYDGAAGKRLRDA--------GF-T-RVVHKRE---------DFFKR---VN-D-RVFV-K-SV-SAVVDNPPYTG-K-GMKER-----------------------------------VLRAL----------V--------AVDVPFCLLLPLGVLHTATVREIL---------------------------------------------------------------------------------------------D--PE------------------------------------------------------------HVQ-----ALIPR--RC---WVS---------------------------------------------KS-GQR----------------------------------------------------------------------------------EV---PFKYLVWLCYKM-R-L--------------------------------PRDLVL-MPDT-------------------------------------------
      MICPUN_103519_Micromonas_sp_RCC299_255086063                              HP--F-EV------DAADHCETPFQAYQDIEPF---LFRMALAL----------------KKPKDK-LRIYDPY-FCEGSVAKHLARL--------GF-T-SVYNKNE---------DFYKC---IE-E-KR-I-P-EH-DVLLTNPPYSG---DHFRR-----------------------------------ILSFC----------A--------KNKKPWLLLLPNFVCRKQYYQPCV---------------------------------------------------------------------------------------------G-------E--------------------------------------------------------DVK-A-L-FLIPDPTKP---YRY-----------------W-A-----------------------P-GR-RGFE--------------------------DRNQAKG------------------------------------------------TT---PFETF-WYVNYA-G-------------------------------LAPHEEVRA---------WWM------KK--------FA----PHSTCT--LPAPDE
      MICPUN_59514_Micromonas_sp_RCC299_255079482                               PS--A-PR------DPADDCETPDVAYAHIAPL---LRKLAQRL----------------GKPPGA-LRIWDPY-YCAGGVKARLGAL--------GFGD-VVNDPDA---------DFYDV---VD-G-SRPP-P-PH-DICVTNPPFSG---NHARR-----------------------------------LFEYLNSRGERGEGKGTTKG----GPVARFVVLAPEYVHRKA-WF------------------------------------------------------------------------------------------------E-------A--------------------------------------------------------PRG-T-F-FMVPS--RR---YSF------------V----A-A--------------------S--G-GR-RENT------------AVDCRHW----RR-ERSCPRGETCPFVHVGPGIDPEGERVAEEARRARANAGFSGGGGGGD---GGRVTVA---PFDCY-WHCHLG-E------FT--------------------------RSVAAA---------WRQ------KH-G------RR----GRVGVR--MVDGVE
      MICPUN_59025_Micromonas_sp_RCC299_255078302                               HA--F-QV------DADDHCETSPEAHAHILNF---LNKTASAL----------------GKTPKT-LVIYDPY-YCAGGTKRSFAAL--------GF-P-NVINENK---------DFYAV---CQ-R-GE-V-P-EH-DVLVTNPPYSA---DHVER-----------------------------------CITFA----------AKNLY----EHGRPYFLLLPSYCVNKPYYTSALLTGGAAGKKARAAREEAEGGTKTKEEDGRDKKSGDEPRRDENEQQQEEEEEKTEEEEEEEEDGEGFKVHDGGKSRTKKI---------ATRDGG--SR---R--------------------------------------------------------QTL-P-F-YVAPV--KR---YYY-----------------W-T-----------------------P-KP-LIAARKAQQGVELG--GKTRRKK----SH-VGRLGER------------------------------------------------TS---PFLSF-WYCGMG-DDL--------------------------------QPEALR---------WHR---K--LP--------RA----AVGGYT--VARNPN
      MICPUCDRAFT_53312_Micromonas_pusilla_CCMP1545_303288509                   GG----MS------VEDDEWATSARTWRALAPH---L------------------------SAYIA-KKVWAPF-YYDGTAGIRMREA--------GF-R-RVVHTKD---------DFFKR---VN-D-RAFV-K-SL-AAVIDNPPYTG-K-GVKER-----------------------------------VIAAL----------V--------RADVPFCLLLPIGVLHA---QELL---------------------------------------------------------------------------------------------D--AD------------------------------------------------------------KVQ-----TLIPR--RC---RVN---------------------------------------------KA-GGR----------------------------------------------------------------------------------EI---PFKYLVWLCYKM-E-L--------------------------------ERDLVL-MPDE-------------------------------------------
      MICPUCDRAFT_51291_Micromonas_pusilla_CCMP1545_303284947                   HP--F-EY------DAADHCETPFQAYQDVEPF---LFRVALAL----------------KKPKEK-LVIYDPY-FCEGSVVKHLARL--------GF-A-NVINRNE---------DFYQC---ID-E-KR-I-P-EH-DVLLTNPPYSG---DHFRR-----------------------------------ILSFC----------G--------KSKKPWLLLLPNFVCRKQYYEPAI---------------------------------------------------------------------------------------------G-------A--------------------------------------------------------SSK-P-L-FLIPDQLKP---YRY-----------------W-A-----------------------P-GR-KGYE--------------------------DRTRAKG------------------------------------------------TT---PFETF-WYVDFA-G-------------------------------VASHADVRA---------WWM------KK--------FS----PHSNCT--LPDVND
      MICPUCDRAFT_62794_Micromonas_pusilla_CCMP1545_303284171                   ----M---------NPTDEYMTPPSAWEAIQKY---I--------------------------PKN-KVIWEAF-YGDGKSGDTLRTL--------GF---KVIHDEV---------DFFE---------NN-----LG-DVIVSNPPFSR-R-AAGPR-----------------------------------ASVSI-STIAEVYP-S-----D--RRRRDYSVCHEELGITPDFRRVEI------------TR-----------------------R-VNFIHVN----------------------------------W------------D--SR---P----GSPRS--------------------------------------STTPR----RRR-S-I-RNRSR--AR---YTY------RR---------R-R-----------------------P-------------------------------RC-------------------------------------------------------------------WWETAA-P-S--------------------------------------------------------------------------------------
      MICPUCDRAFT_60475_Micromonas_pusilla_CCMP1545_303283096                   HA--F-DA------DGDDHCETSPEAHANVVNF---LNDVASRL----------------GKKPSE-LIIYDPY-YCAGGTERSFNAL--------GF-R-NVINRNE---------DFYAV---AK-R-NE-V-P-EH-DVLVTNPPYSA---DHVEK-----------------------------------CLTFA----------AANLA----EHGRPYFLLLPSYVIHKPYYVDALLTGGAAGRRAKEAREKRERGRAEEKEEDVEEE-------EEEEEDEEEDEEEMVDDDDDDDDDDQMVFKRASDATGERIPLAKKKSNASSSSSS--SR---R--------------------------------------------------------QTL-P-F-YVAPA--KR---YYY-----------------W-T-----------------------P-KA-LLAARRAASARDDGESASARRKK----KH-VGRLGER------------------------------------------------TS---PFPSI-WCCCLG-E-F--------------------------------QTDALR---------RHR---K--LP--------RA----FVDGYT--VSTHPN
      MICPUCDRAFT_57902_Micromonas_pusilla_CCMP1545_303278244                   RD--R-DR------DPADDCETPAVAYAHLAPI---LRKLAQRL----------------KKPPSE-LAVYDPY-RCAGAVESRLGAL--------GF-D-AVANPAD---------DFYAA---LE-E-DR-V-P-PH-DVLVTNPPFSG---EHARR-----------------------------------LVSFL----------ASTRY----SRKRAFCFLAPEYVHRKA-WYAAM-----------------------------------------------------------------------------------T---------R-------A--------------------------------------------------------RPD-V-C-YLVPK--ER---YAF------------V----A-S--------------------S--G-GR-RENT------------AKPCRHW----AR-DGRCPRGDECPFQHGGGG--------GSGASTSSEDAPVARGGGASSSVTGTRTVVA---PFDCI-WHVHAG-E-----GRQ--------------------------RSVVAA---------WRQ------KY-GGGDGK-KE----DALGAR--LVERAE
      CHLREDRAFT_205675_Chlamydomonas_reinhardtii_159485216                     WP--F-EV------DYNDHFETSSAAVDDIQPV---LLALCNRL----------------KKTPAQ-LAIYDPF-FCKGGIRRHYEAR--------GF-T-NFIHRKR---------DFYAD---VE-S-GQ-L-P-EY-DVMVTNPPYSA---DHKER-----------------------------------ALDFC--------------L----RSGKPWALLLPNYVATKAYYSELV---------------------------DAAGT-------------------------------------------------------------P-------P--------------------------------------------------------QQR-P-F-YLTPI--TR---YAY-----------------E-H-----------------------P-EG-TGHA---------------------------------------------------------------------------------ES---PFYSI-WYVGLG-V-H--------------------------------------------------------TE----------------------------
      Bathy01g03440_Bathycoccus_prasinos_612399992                              DR----FS------IEDDEWATSQRAWNALAKH---L------------------------EKFKG-KKIWAPF-YYDGKVKTRLKQA--------GF-RGKVTHEKR---------DFFKL---MN-D-AKFL-A-NV-DAIIDNPPYTG-K-GMKEK-----------------------------------ILTKL----------I--------AKDVPFCLLFPLGVLHSKFLRDLT---------------------------------------------------------------------------------------------A--AK-----AR--------------------------------------------------RK-KVQ-----AIVPR--RV---FVH---------------------------------------------KE-FGE----------------------------------------------------------------------------------EL---PFKYLVWLCYGL-E-L--------------------------------ERDLVL-MDEE-------------------------------------------
      Bathy07g04630_Bathycoccus_prasinos_612393160                              HA--F-EC------NPDDHCETSLEAHKDIVNF---LNIVASQK----------------NKKPSE-LIIYDPY-YCAGATRLNFTEL--------GF-P-NVINENK---------DFYEM---VA-K-NK-V-P-EH-DVFVTNPPYSE---EHVEK-----------------------------------CVTFA----------AKNMT----QFGRPYFMLVPSYVVCKPYFVPALLTGGAQGA-------------------------------------------EERENTKEEDAEDDMNMH------------------------K--KR---G--------------------------------------------------------QVL-P-F-YIAPT--KR---YYY-----------------F-T-----------------------P-KP-LAKLR--NKNIDEN--GIEKKRR----GH-VGRRGER------------------------------------------------TS---PFLSL-WICGFG-D-D--------------------------------QIEALR---------MHK---K--LR--------RE----QVKHYV--VARNPK
      Bathy11g00290_Bathycoccus_prasinos_612389594                              HP--F-DV------DASDHCETPFQAYKDIEPF---LFRIALSL----------------KKTKAS-LKIYDPY-FCEGSAKEHLKRL--------GF-E-SVHNVNE---------DFYEN---VK-K-NT-I-P-EY-DVLLTNPPYSS---DHFKR-----------------------------------ILNFC----------G--------ASEKPFFLLLPNFVCRKTYYANEI---------------------------------------------------------------------------------------------T-------S--------------------------------------------------------RKKEP-L-FLIPDELKP---YRY-----------------W-A-----------------------P-GR-KGFE--------------------------ER--AKG------------------------------------------------TT---PFITF-WYLEFG-D-------------------------------AIDKNEIRG---------WWL------KK--------YS----PHSRCE--LPAPEE
      GUITHDRAFT_46531_Guillardia_theta_CCMP2712_551675275                      ----F-SV------EHQDHCETPGEAYDDIVPV---LLAIASNI----------------GKRADE-LMIYDPY-YCNGLVAQNLRDR--------GF-Q-HVYNKNE---------DFYEA---VK-Q-GT-T-P-PF-DVLVTNPPYSN---DHIER-----------------------------------LFSFC----------S---S----CEK-PWMVLVPNYVYTKDYYEKML-------------------------------------------------------------------------------------------------K---S--------------------------------------------------------GVR-P-F-YVIPP--NR---YEY-----------------I-S-----------------------P-AG-ARGS---------------REKK--------------------------------------------------------------TS---PFVSF-WFI---------------------------------------------------------------------------------------------
      GUITHDRAFT_109156_Guillardia_theta_CCMP2712_551658644                     HP--F-EH------DPADDCETCFQAYCDIAPF---LIKLAQRV----------------GKPKKD-LCIWDPY-YCAGKVKDHLRKL--------GF-H-NVHNNNE---------DFY-S---LK-P-EQ-F-P-PY-DVLLTSPPYSR---NHIEK-----------------------------------ILVFA----------S--------ECKKPWILLMPQYVHRKSYYSAII---------------------------------------------------------------------------------------------E-------G--------------------------------------------------------QH--P-F-YMIPP--KP---YVYHAHHGGRKDNTNVTCRHW-ARDGKCPKGDECAFVHGEVGDSAQP-AI-QSKG------------------------------ITP------------------------------------------------VT---PFKSI-WHMHFP-P-------------------------------EGMNNGIYT---------WAV------HK--------LR----KS------------
      GUITHDRAFT_90353_Guillardia_theta_CCMP2712_551638519                      HS--F-ET------TDADHAETPREAYEHILPL---LHKMAEAA----------------SKKPSE-LRIYDPF-FCTGSMKRHLASL--------GF-T-NVYNKNE---------DFYEM---VK-S-KR-I-P-EH-DMVVTNPPYSL---DHIPR-----------------------------------FLRWL----------S--------VNDKPWLLLVPNYVYTKDYFSSSL---------------------------------------------------------------------------------------------R--GRL---------------------------------------------------------------P-M-FLTPPG-R----YVY-----------------E-S-----------------------P-KH-VAN-------------------------------AQG------------------------------------------------QT---APYVS-FWYVET-R----------------------------------------------------------------------------------------
      GUITHDRAFT_113893_Guillardia_theta_CCMP2712_551648195                     HP--F-PT------EYGDHFETSKVAIHDIAPI---LQQFAKVS----------------GKQASS-LAIYDPY-YCDGAVIEHFRQE--------GF-H-NVHNLNV---------DCYQV---WK-S-AT-TSS-DF-DIVVTNPPFSG---DHKQK-----------------------------------CLEHC--------------V----KREQAWMVLLPAYCATKNYFQELM-----------------------------SNW-------------------------------------------------------------K-------E--------------------------------------------------------RGK-V-F-YGIPK--VR---YDF-----------------E-H-----------------------P-EG-TGHA---------------------------------------------------------------------------------VS---PFFSI-WFVYLG-K-H--------------------------------------------------------TE----------------------------
      GUITHDRAFT_165084_Guillardia_theta_CCMP2712_551646515                     FP--Y-EI------DDADHAETPAEAYADISHV---LEYVAGIL----------------KKDNNT-VKIYDPY-YCNGSVKKRLMRQ--------GF-P-NVYNERE---------DFYKA---IE-D-KR-I-P-SH-DILLTNPPYSG---DHPER-----------------------------------LMNFI----------S---R----TKS-PWFLLMPNWVYTKDYYKDLI----------------------------------LN---------------------------------------------------------K--AC---S--------------------------------------------------------SNP-P-F-YYIPK--KR---YTY-----------------W-T-----------------------P-PW-LHSS----------------------------QFGVS------------------------------------------------TS---PFPSF-WYIH--------------------------------------CGKHTE---------KVKGW----LE--------SN----ASDSMM--FAGGVQ
      THAPSDRAFT_bd1109_Thalassiosira_pseudonana_CCMP1335_224015927             HA--F-ET------NSLDHCETPLCAYENVQTV---LEMMAKHL----------------HVQPSK-LRIWDPY-YCDGTVKQHLASL--------GY-D-RVINENV---------DFYKR---VE-D-NT-I-P-EH-DVLLTNPPYSG---DHIER-----------------------------------LLKFV----------T---T----VNDKPFCLLMPNWVARKKEYKSII-----------------------------------------------------------------------------------------------------G--------------------------------------------------------KTN-L-F-YVSPI--EV---YTY-----------------A-M-----------------------P-TW-NSKP------------EHVDEET----GK--------------------------------------------------------TT---PYLSS-WYVSLR-S----------------------------------NSEATG---------RIE------NK--------LD----SIAKR---------
      THAPSDRAFT_9806_Thalassiosira_pseudonana_CCMP1335_224009558               -P--F-KA------DPDDHCESSPTSYAHIAPI---LNYVAKCI----------------GKKPRK-LEIYDPY-YCAGGMVRHMNKL--------GF-N-KVYNKAE---------DFYQV---IR-D-GN-V-P-SH-DVVVTNPPYSG---DHFDR-----------------------------------LLQF--------------LS----GNHKPALLLLPEHFSKNK---------------------------------------------------------------------------------------------------S--AR---H--------------------------------------------------------AQH-N-FCFLVPT--ER---YHY-----------------W-T-----------------------P-DG-M-------RPDDEG--DKKRKKQ----HR-NLVLGSR------------------------------------------------NS---PFPSH-WFIAME-P-IMT------------------------------NKQLIS---------LVR---D--GE--------IK----LLEGCG--LYERQE
      THAPS_23466_Thalassiosira_pseudonana_CCMP1335_224005064                   FP--Y-PT------NPDDHCETPLQSYQDILPI---LNELRKGT-----G----------ATERET-LKIYDPY-FCNGSVVKHLASL--------GY-T-NVYNKKE---------DCYKV---WK-Q-RK-E-P-PF-DAFLTNPPYSD---DHIDK-----------------------------------LMEYL-ASP------S---F----DN-KPWLLLMPSWVHKKDYYINAT---------------------------------------TGNKKDRKKGK-------------------------------------------D-------S--------------------------------------------------------RSN-P-F-YIVPK--KR---YVY-----------------V-P-----------------------P-PD-FREK---------K--VSDVHKK--------------------------------------------------------------SS---PFTSM-WYIWGG-T----------------------------------NEKNEA---------LIK---A--FQ--------KS----NVDGCD--VARSRS
      THAPSDRAFT_6523_Thalassiosira_pseudonana_CCMP1335_224003919               HA--F-ET------NSLDHCETPLCAYENVQPV---LEMMAKHL----------------HVQPSM-LRIWDPY-YCDGTVKQHLASL--------GY-D-RVINENF---------DFYKR---VE-D-NT-I-P-EH-DVLLTNPPYSG---DHIER-----------------------------------LLKFV----------T---T----VNDKPFCLLMPNWVARKKEYKSII-----------------------------------------------------------------------------------------------------G--------------------------------------------------------KTN-L-F-YVSPI--EV---YTY-----------------A-M-----------------------P-TW-NSKP------------EHVDEET----GK--------------------------------------------------------TT---PYLSS-WYVSLR-S----------------------------------NSEATS---------RIE------NK--------LD----SIAKR---------
      THAPSDRAFT_21256_Thalassiosira_pseudonana_CCMP1335_223996249              YP--Y-PT------DYNDHFETPQRAYEDILPI---IGYVLKKK---I---------KR-YNSQSD-VTIYDPY-FCTGRAATLLNATFEQ--HTTGNKRHTNIRIQHEKR------DFYQD---VR-Q-NN-T-P-QY-DILVTNPPYSG---DHKER-----------------------------------CLEYV-------VD-Q-----LK-NNQRPFFLLMPNYVASKEYFRKIV--------------------------------------------------------------------------------L------------E--EK---I-----------------------------------------------------Q------I-V-FITPS--SKHP-YEY-----------------D-H-----------------------P-EG-TGHE---------------------------------------------------------------------------------TS---PFASV-WFCGLS-C----------------------------------GDTDGT---------WKK----------------NQ------------------
      THAOC_11048_Thalassiosira_oceanica_397625596                              YR--A-TV------DYNDHFETPLRAYTDVFPV---IETLIQQK----------------C-KGKR-VIIYDPF-YCTGRAASLLRQC--------LQ-S-NNEKLAEKVDIQHEKRDFYRD---LR-E-NT-V-P-KF-DILVTNPPYSG---DHKER-----------------------------------CLEFA--------------V----NSSRPFFLLMPNYIATKEYFRKTV-------------------------------L-------------------------------------------------------------E-------T--------------------------------------------------------KKV-QDV-YIIPS--PGES-YEY-----------------H-H-----------------------P-EG-TGKP---------------------------------------------------------------------------------LS---PFESV-WFVGVS-R-R--------------------------------------------------------TS---L------------------------
      THAOC_20767_Thalassiosira_oceanica_397605573                              FP--Y-DV------NPDDHCETPPEAYRDVDPL---LSDLCRRL----------------GKSKSE-LRIYDPY-YCDGSVRRHLADI--------GY-G-DVHNERV---------DCYRV---WE-E-GR-E-P-EF-DVLVTNPPYSH--------------------------------------------IGYS----------Q------------DAFPSLPANEQNKRRSHREA---------------------------------------HEVRHLAVLRGQAVAPPNAAVGAQEGLLRGDHDG--------------------P--SR---P--------------------------------------------------------SPP-A-V-LRRAP--EA---VRL-----------------P-P-----------------------P-RG-PAGE------------EGQRHAQ--------------------------------------------------------------EE---LAVRL-HVVLLG-G----------------------------------EGGGQR---------GMD---RV-VP--------RG----GTGEGG--CDVARS
      PHATRDRAFT_41248_Phaeodactylum_tricornutum_CCAP_1055/1_219130185          YP--Y-PV------NYNDHFETPLLAYKDLQPL---IDWLWSSS---I---CRKVKQGR-NAKATD-ISIYDPY-YCDGRTRSILAEL--------GY-R-NVLHEKR---------DFYKD---VM-R-NT-V-P-EY-DLLLTNPPYSD---QHKTK-----------------------------------CLEYC-------FS-Q-----LR-ESNKPFCILMPNYVASRQYFRNFL--------------------------------------------------------------------------------M------------K--EE---P-----------------------------------------------------ED-----V-V-YLIPT--LQ---YQY-----------------D-H-----------------------P-EG-TGKD---------------------------------------------------------------------------------KS---PFDSL-WFCGIG-R----------------------------------DRAKSA-VEF-----WKG----------------LG----RATFCP--KMAASL
      PHATRDRAFT_36383_Phaeodactylum_tricornutum_CCAP_1055/1_219120494          FP--F-VT------EADDHCESPLDAYHDIMPL---LKHL---------S----------GNETEK-FCIYDPY-YCDGGVTRNLNEL--------GF-P-NVYNRKE---------DCYAV---WS-D-VD-QCP-KF-DCLVTNPPYST---DHIER-----------------------------------LVKHV-TSS------T---F----TTGKPWFLLLPQWVHKKEFYQAAT---------------------------------------DALR-----------------------------------------------------------------------------------------------------------------------P-F-YLVPH--KR---YVY-----------------V-P-----------------------P-KD-FRES---------R--KSDVHKK--------------------------------------------------------------SS---PFVSM-WYVYGG-S----------------------------------AKQTEA---------IIR---T--YL--------QI----QNAPCD--LARSKS
      Naga_100023g2_Nannochloropsis_gaditana_585103216                          YP--F-PT------DYADHFESPLRAYEDLEPF---LQWLRRAL----------------RREKTS-LHIYDPY-FCRGAVVNLLKSL--------GF-P-RVTNKMR---------DFYAD---VA-A-GT-V-P-SY-DVLVTNPPYSD---DHKEK-----------------------------------ILRFC--------------L----GSDKPWCLLLPNYVANKSYYLDAI-----------------------------RPL-------------------------------------------------------------P-------Q--------------------------------------------------------DRQ-P-F-YLVPH--AK---YEY-----------------Q-H-----------------------P-EG-TGHI---------------------------------------------------------------------------------SS---PFFSI-WVCNPG-P-I--------------------------------------------------------PR----------------------------
      Esi_0085_0104_Ectocarpus_siliculosus_298715150                            HP--F-PT------EYGDHFETPLQAYRDIEVA---LALLAKLL----------------DKKRKH-LRIWDPY-YCAGRTPRLLGQL--------GF-P-KVEHSNQ---------DFYKV---VR-E-KR-Q-P-KH-DVLITNPPYSG---DHKKR-----------------------------------CLEYC--------------R----ASGKPWFLLVPNYVATKDYYRLAV---------------------------LGPAA-------------------------------------------------------------G-------P--------------------------------------------------------GGE-P-F-YVVPE--NK---YYF-----------------D-H-----------------------P-EG-TGHA---------------------------------------------------------------------------------DS---PFTGV-WYVHCG-S-H--------------------------------------------------------TE----------------------------
      Esi_0571_0002_Ectocarpus_siliculosus_298713748                            YP--F-EV------EECDHCETSERAYSDISPL---LSALAAEL----------------GKPPED-LVIYDPY-YCQGSTVGRLASL--------GF-P-RVHNRKE---------DFYEV---VK-N-GN-I-P-QH-DVVVTNPPFSG---EHMPK-----------------------------------ILKFC----------A---R----QGAKPWFLLLPNYVYLKDYYEPSL---------------------------------GRR---------------------------------------------------------S--GQ---G--------------------------------------------------------ATR-P-F-YLTPP--KR---YMY-----------------Y-S-----------------------P-QG-SRLK------------VKSSERK--------------------------------------------------------------TS---PFNTF-WYIHLG-D----------------------------------CAVTSK---------ILQSYDAASRK--------LD----INARCC--VARTTQ
      AURANDRAFT_65195_Aureococcus_anophagefferens_676390061                    YA--W-DT------DYGDHFETSEQAFRDVAPA---LRAL--------------------CGDGAG-AAILDPY-YCDGAAETRLRAL--------GF-R-NATNPAT---------DFYASR-AYR-EPGD-R-S-TF-DALVTNPPYSG---DHKER-----------------------------------CLAFA--------------L----ACGRPFALLLPAYVAEKKYFADAC-------------------------------A-------------------------------------------------------------E-------T--------------------------------------------------------GAA-P-F-FVSPA--RGRPPYEY-----------------A-H-----------------------P-HG-TGKA---------------------------------------------------------------------------------AA---PFASA-WVVDSG-R-G--------------------------------------------------------AA---A------------------------
      EMIHUDRAFT_250132_Emiliania_huxleyi_CCMP1516_551539647                    QPLAF-SA------AEDDHCETAPEAYAHIVSL---LRLVARKR----------------GVPPEE-LRIWDPY-YCNGAVARHLAAL--------GF-P-HVHNANE---------DFYAR---LD-S-GD-L-P-EH-DVLLTNPPYTH---PHPER-----------------------------------LLAHC----------A---A----SGT-PWLALMPNWVYTKDYYWAAL---------------------------------GRS---------------------------------------------------------H--GT---A--------------------------------------------------------DTQ-P-F-YIAPR--KR---YNY-----------------W-T-----------------------P-RG-RRSD------------LTSGGAKAKTHGHTNAALGIR------------------------------------------------TS---PFVSF-WY----------------------------------------CGGFGP---------ALR------KR--------VT----PPEGCV--LCWSTE
      EMIHUDRAFT_212966_Emiliania_huxleyi_CCMP1516_551557088                    WS--F-VT------EYNDHFETPRRAYADILPL---LAAASPL-----------------------------PP-KRDGGSAPEAEAL-AAVTAL-GVRRERVLNRNR---------DFYAD---IA-T-GQ-L-P-QY-DVLLTNPPYSG---DHKQRRGTA--------------TSPPTSPPDCNAPFPARLLRFL-ASDGD-------------MRGAPFLLLLPAW----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------A-TGNS---------------------------------------------------------------------------------AS---PFHAV-WFCGGW-A----------------------------------TEGARR------RA-MRA------LR-P------SR----RLREVE--------
      EMIHUDRAFT_119528_Emiliania_huxleyi_CCMP1516_551561218                    WS--F-VT------EYNDHFETPRRAYADILPL---LAAASPL-----------------------------PP-KRDGGSAPEAEAL-AA--AL-GVRRERVLNRNR---------DFYAD---IA-T-GQ-L-P-QY-DVLLTNPPYSG---DHKQRRGVALPCSLPRRSVPLLLPSPEPEARDCNAPFPARLLRFL-ASDGD-------------MRGAPFLLLLPAWVCEKDYWNAFL---------------------------------------ERLATHRAAGGGGGGGDGGGGGGGGGDGGGGGGGAG------------------D--SS---A-----------------------------AHSSGECCKRRRRRRVGEGGLER----AAG-V-F-------------YVR-------------------------------------------P-SA-TGNS---------------------------------------------------------------------------------AS---PFHAV-WFCGGW-A----------------------------------TEGARR------RA-MRA------LR-P------SR----RLREVE--------
      EMIHUDRAFT_440832_Emiliania_huxleyi_CCMP1516_551614698                    GG----AG------ASDD-WQTARRSWAAIAEV---L--------G---------------PAARE-KRIWMPF-YYDGACAEHLREL--------GF-T-RVHHKRE---------DFFVQ---VR-N-PKFL-K-KV-DLILDNPPYTS-P-EMKEA-----------------------------------VLRAL----------A--------STGKPFVMLLPISVLHVGFAREVL---------------------------------------------------------------------------------------------D--TD------------------------------------------------------------KLQ-----AVVPR--RV---YVR---------------------------------------------KT-GGE----------------------------------------------------------------------------------EV---PFKYLCWLCCGV-R-L--------------------------------KRDLIL-IDDDDDA-AAA---D--------------------------------
      EMIHUDRAFT_439419_Emiliania_huxleyi_CCMP1516_551626801                    GG----AG------ASDD-WQTARRSWAAIAEV---L--------G---------------PAARE-KRIWMPF-YYDGACAEHLREL--------GF-T-RVHHKRE---------DFFVQ---VR-N-PKFL-K-KV-DLILDNPPYTS-P-EMKEA-----------------------------------VLRAL----------A--------STGKPFVMLLPISVLHVGFAREVL---------------------------------------------------------------------------------------------D--TD------------------------------------------------------------KLQ-----AVVPR--RV---YVR---------------------------------------------KT-GGE----------------------------------------------------------------------------------EV---PFKYLCWLCCGV-R-L--------------------------------KRDLIL-IDDDDDA-AAA---D--------------------------------
      STCU_02709_Strigomonas_culicis_528246120                                  HP--F-RA------NFNDHFETSIEALRDVLPV---VRELQQLL-----R----------PSTPER-FTLYDPY-YCSGTVKELWAQL--------EV-P-NVVHENV---------DFYAA---VE-A-RT-V-P-PH-DMLVTNPPFSD---DHIER-----------------------------------FLRFV----------L-----LR-NAGRPWAMLAPDYVVTKPWYRELV-------------------------------------E-RHCTKASRIAKGVLTGAA------------------------------------R--PP---P-A--------TFALPPFIQAAAAPAAA------------AAAAPPPPAVVGV------E-P-F-YIVPR--AR---YDY-----------------R-H-----------------------PVEG-AARE---------------------------------------------------------------------------------HS---HFKSM-WYVWAG-R-H--------------------------------TPEVVR------GV-RVA-V----LH--------RA----A------VPDRAPQ
      Pmar_PMAR017622_Perkinsus_marinus_ATCC_50983_294899807                    FP--Y-PT------DSLDHAESPAKAYGHVAPI---LELMAKKL----------------GKTKED-LKIYDPY-YCNGAVVDNLKAL--------GF-N-NVYNKCE---------DFYS----VE-T------P-DF-DVLLTNPPYSG---EHPEK-----------------------------------LAEFT----------A--------KVGKPWLWLVPNWFYMKDYYKKLI---------------------------------------------------------------------------------------------E--QP---G--------------------------------------------------------QSG-M-F-FVAPK--KR---YVY-----------------Q-T-----------------------P-KH-LRAS--------------SDEAK--------------------------------------------------------------TS---PFPSF-WFINGC-D-V--------------------------------CTPVEM------------------QS-A------MA----DAEGVL--AVTDVH
      TRSC58_02209_Trypanosoma_rangeli_SC58_554942173                           HP--F-KA------EFNDHFETSMEALRDVAVV---VDQLRRLQ-----R----------PSAPEN-FVVYDPY-YCAGTVLHHWNAL--------GV-Q-RVIHENR---------DFYRD---IA-E-GK-V-PPDY-DMLVTNPPFSG---DHIER-----------------------------------LFDYL----------V--------TAKKPFAFLVPDYTATKDWYRAAV-------------------------------------R-RHFTAAPPSGKGDINAPR------------------------------------H--TR---P----KASAA-ALQPPPFLKMDPPADAETPLNDTINKINREKDKGGGEETVPI----GTE-P-F-YLVPR--SC---YDF-----------------R-H-----------------------P-KG-AGND---------------------------------------------------------------------------------HS---HFRSM-WFVWAG-R-H--------------------------------TTEVLR------GA-KVE-F----AR--------RH----REALTQ--RQTVLD
      DQ04_18661000_Trypanosoma_grayi_686647126                                 -----------------------MEALQDIMVV---VEQLRQLV-----R----------PSAPEN-FAVYDPY-YCAGGIVQQWKEL--------GV-Q-RVLHDNR---------DFYKD---VA-E-GT-V-PRDY-DMLVTNPPFSG---DHIER-----------------------------------MFDYL----------L--------ASKRPFAFLVPDYTATKEWYRSAV-------------------------------------R-RHFTPAPPTGKGDINAPR------------------------------------R--AR---P----LVPAAVLLQPPPFVKTDADAASG-----NNNNTNSGSGDCGDCDVVPI----GVE-P-F-YLVPR--VR---YDF-----------------K-H-----------------------P-KG-VGNE---------------------------------------------------------------------------------HS---HFRSM-WFVWAG-R-R--------------------------------TTEVLR------GA-KVE-F----SR--------LH----RETMTAAGRRAVPD
      Tc00.1047053506297.320_Trypanosoma_cruzi_strain_CL_Brener_71666516        HP--F-RA------EFNDHFETSLEALRDIAVV---VDQLLLLL-----R----------PSAPEN-FVVYDPY-YCAGTVVRYWNTL--------GV-Q-RVIHANR---------DFYKD---IA-E-GD-D-PDDY-DMLVTNPPFSG---DHIER-----------------------------------LFNYL----------V--------ARKKPFAFLVPDYTATKDWYRTAV-------------------------------------R-RHFTPVPSSGKGDINASR------------------------------------H--TR---P----KLPAA-LLQPPPFLKMEQTASAEIALDDTKNKKQKEDNKCDNEETIPI----GTE-P-F-YLVPR--GR---YDF-----------------S-H-----------------------P-KG-VGKD---------------------------------------------------------------------------------HS---HFRTM-WFVWTG-R-H--------------------------------TTEVLR------GT-KVE-F----AR--------RH----REDSTQ--RQVSPD
      Tc00.1047053508041.40_Trypanosoma_cruzi_strain_CL_Brener_71409237         HP--F-RA------EFNDHFETSLEALRDIAVV---VDQLRLLL-----R----------PSAPEN-FVVYDPY-YCAGTVVRYWNTL--------GV-Q-RVIHANR---------DFYKD---IA-E-GD-D-PDDY-DMLVTNPPFSG---DHIER-----------------------------------LFNYL----------V--------TRKKPFAFLVPDYTATKDWYRTAV-------------------------------------R-RHFTPAPPSGKGDINAPR------------------------------------H--TR---P----KVPAA-LLQPPPFLKMEQTTSAGIALDDTNDKKQKEDNKCDGEETIPI----GTE-P-F-YLVPR--GR---YDF-----------------S-H-----------------------P-KG-VGKD---------------------------------------------------------------------------------HS---HFRTM-WFVWTG-R-H--------------------------------TTEVLR------GT-KVE-F----AR--------RH----REDSTQ--RQVSPD
      TCSYLVIO_001620_Trypanosoma_cruzi_407859934                               HP--F-RA------EFNDHFETSLEALRDIAVV---VDQLRLLL-----R----------PSAPEN-FVVYDPY-YCAGTVVRYWNTL--------GV-Q-RVIHANR---------DFYKD---IA-E-GD-D-PDDY-DKLVTNPPFSG---DHIER-----------------------------------LFNYL----------V--------TRKKPFAFLVPDYTATKDWYRTAV-------------------------------------R-RHFTPVPPSGKGDINASR------------------------------------H--TR---P----NVSAA-LLQPPPFLKMEQTAGAGMAFVDTNEKKQKEDNKCDSEETIPI----GTE-A-F-YLVPR--GR---YDF-----------------S-H-----------------------P-KG-VGKD---------------------------------------------------------------------------------HS---HFRTM-WFVWTG-R-H--------------------------------TTEVLR------GT-KVE-F----AR--------RH----REHSTQ--RQASPD
      MOQ_000466_Trypanosoma_cruzi_marinkellei_407425172                        HP--F-RA------EFNDHFETSLEALRDIAVV---VDQLPLLL-----R----------PSAPEN-FVVYDPY-YCAGTVVQHWNTL--------GV-Q-RVIHANR---------DFYKD---IA-E-GN-V-PDDY-DMLVTNPPFSG---DHIER-----------------------------------LFKFL----------V--------ARKKPFAFLVPDYTATKDWYRNAV-------------------------------------R-RQFTPAPPTGKGDINAPR------------------------------------H--TR---P----KVPAA-VLQPPPFLKLEQTPSAGIARDDSNDKKQKEDNKSDSEETLPI----GTE-P-F-YLVPR--GR---YDF-----------------S-H-----------------------P-KG-VGND---------------------------------------------------------------------------------HS---HFRSI-WFVWTG-R-H--------------------------------TTEVLR------GT-KVE-F----AR--------RH----REKSTQ--RQVSPD
      AGDE_06588_Angomonas_deanei_528257849                                     HP--F-RA------NFNDHFETSLEALRDVLAA---VQEVRQQL-----R----------PSTPEK-FTLYDPY-YCSGTVVASWAQL--------DM-P-NVINENV---------DFYAT---MA-N-HT-I-P-VH-DMLVTNPPFSD---DHIPR-----------------------------------LMKFL----------A-----DG-NDGRPWAFLAPDYVATKPWYIQFV-------------------------------------N-EHYAKATRVAKGVLRGPA------------------------------------P--TA---P-R--------SFALPPYLAAGNTAAT---------------------KVLPV------E-P-F-YIIPK--QK---YDF-----------------H-H-----------------------PVEG-VGKE---------------------------------------------------------------------------------HS---HFKSM-WYVWAG-R-H--------------------------------TNDVVR------AS-RVE-L----LR--------RH----P-------TGAAPA
      GSEM1_T00001947001_Phytomonas_sp_isolate_EM1_588317381                    HN--F-KA------NFNDHFETTIEALRDLLPV---VQELRRLT-----R----------PSAPER-FVLYDPY-YCAGAIPGLWRDL--------GL-P-HTLHENR---------DFYAD---IA-R-DT-V-PGPY-DLLVTNPPFSD---DHLPR-----------------------------------LLEFL----------ARGRDETRGNRQRPWAFLAPDYIAAKPWYRAWV-------------------------------------R-DHFEAA---GGGNRNPDS------------------------------------D--GA---PGALKKAQIT-RFEAPPFLKASQADEVV-----DGVPQEGANGVCHTVGGSPVCTKLGPE-P-F-YIVPK--GR---YDF-----------------K-H-----------------------P-LN-AGHE---------------------------------------------------------------------------------HS---HFKSM-WYVWMG-S-R--------------------------------TSEIIR------AA-KIE-L----LK--------TS----TSGSSA-------T
      D341_RS0120100_Proteobacteria_bacterium_JGI_0000113-E04_655449388         FP--Y-DA------VERDHCESPRVAYQQIEPL---LRSYASSI----------------GKMAKE-LKIYDPY-YCNGAVKKHLRFL--------GY-Q-DVYNECE---------DFYNK---IE-T-DT-V-P-AF-DVMITNPPYSG---DHMEK-----------------------------------LLKFC----------AGYCA----KKKKPFFLLLPNYVYTKEYYSDVF---------------------------------------------------------------------------------------------T--EQ---P--------------------------------------------------------DRS-I-L-FN-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      ADD95864.1_uncultured_organism_MedDCM-OCT-S12-C54_291336302               AA--M-DA------ALAGKRKRKRKEGVPPREV---TGTVPEGS-----E--------S-EAGLES-LYVYDPY-FCQGGMVDALVEL--------GCARERVINLNR---------DFYQD---VA-D-GS-V-P-SH-DVLLTNPPYSA---DHKQK-----------------------------------LLDYL-------LG-E--------HQHRPGKGM----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      AIA83135.1_Podovirus_Lau218_643012036                                     ------KN------PASDEYYTPPEGIEPLVKF---L--------------------------NKE-LRYYEP---TAGKSQRIVKYL-TS--K--GF---NITGSKP-----DE--DFLK---------GD-F-S-DY-DAIITNPPFSN-KGDFIER---------------------------CY------------------------------EIGKPFALLMPVSTIQGQKRGKRF--------------------------------------------------------------------------------I------------E-------D--------------------------------------------------------GIE-----LLVLN--KR---VSF-----------------I-K-----------------------P-DN-TIAG---------------------------------------------------------------------------------SP---SFGVA-WFCHG----I--------------------------------LPEKLQ---------FHE------AR-K--------------------------
      LS64_RS01615_Helicobacter_sanguini_736557575                              QN----SY------ENSDERYTKPEAIFPLLKY---I--------------------------PKD-KVIWCPCDLESSYFVRIFRLN--------GY---KVIHSHI---NLGQ--DFYHF---EP-K--------EW-DILITNPPFSN-----KKE-----------------------------------FISRV----------L--------SFKKPFCLLLPLTYLNDSTPYHLF---------------------------------------------------------------------------------------------K--DI------------------------------------------------------------DLE-----LLIFD--KR---MEF----------I------NAD-------------------------SK-----------------------------------------------------N--------------------------------RI---SFKSG-YYAWKV-F----------------------------------NKQVVF---------EKL------ES--------LHFLESNKWNEI--IENMKY
      AW14_RS07635_Siansivirga_zeaxanthinifaciens_765310844                     PP--L-KK------GSPDDFQTPPEALTPLLPF---L--------------------------KKD-WTIWECA-SGKGNLSTYLKQQ--------GF---KVISTDI---LAGK--DFLSY---EP-K--------QY-DCIITNPPYAF-----KQE-----------------------------------FLERA----------Y--------CLGKPFAFLLPLTTFETAKRQQYF---------------------------------------------------------------------------------------------K--HC------------------------------------------------------------GLE-----VIFLD--KR---INF-----------------E-T-----------------------P-DG-SGD----------------------------------------------------------------------------------GS---WFATA-WFTNWL-K-I--------------------------------GKQMSF---------TSL------------------------------------
      HMPREF1087_RS05735_[Clostridium]_clostridioforme_740438568                YL----TS--NKK-Q--DDLFTPAYAVDPIIKY---L--------------------------SKD-KIIWCPWDCEWSAFYQRLKEE--------GF---KVVRSSL---EEGE--DFFEY---EP-D--------EW-DIVVSNPPFSI-----KDK-----------------------------------VLERL----------Y--------SFNKPFAILLPLNSLQGKTRYKYF------------------------------------------------------------------------------------------------KQ------------------------------------------------------------GIQ-----ILSFD--AR---VCY-----------------H-D-------------------------KNHMDSV-----------------------------------------------VK--------------------------------GS---PFATA-YFCRDL-L----------------------------------PKDLIV---------EKL------VT--------YE----RPLMTR--------
      LS74_RS07390_Helicobacter_magdeburgensis_736576773                        YL----NA--RHD-ESSDECMTPFYAVEPLLKY---I--------------------------PRN-KTVWCPFDKEWSAFVK-LLST--------RN---EVIHSHI---DDGK--DFFTY---KP-K--------HF-DIIISNPPFSC-----KDK-----------------------------------VLQRC----------Y--------ELNKPFAMLLPVSCIQGKKRVEMF---------------------------------------------------------------------------------------------M--KN------------------------------------------------------------GLQ-----ILAFD--LR---VDY-----------------H-T-------------------------RGNMQET-----------------------------------------------TK--------------------------------AT---YFGSA-FFCKDI-L----------------------------------PLSLMF---------APL------KK--------YE----QSLGEK--ASAKRE
      HMPREF1074_RS00110_Bacteroides_xylanisolvens_495301957                    ----------MFL-NCSDEKYTPGYGVAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---NTGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KIE-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQY-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSRM--YRDVDN
      VK10_RS18230_Bacteroides_acidifaciens_765333434                           PS----KLSRMFL-NCSDEKYTPGYGVAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---NTGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KIE-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQF-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSRM--HRDVDN
      LGA01S_RS01170_Lactococcus_garvieae_754856863                             ----L-DK--VAN-SGNDEFYTPEYAIKPLLKY---I--------------------------PKN-AKVWCPFDTSDSLFVKLLLEH--------GC---EVVNTHI---SRGE--DFF-E---LS-N-SE-I-A-DWCDYIISNPPYSR-----KTE-----------------------------------VLEEL----------F--------MTEKPFAMLLGCVGLFESQKRFEM--------------------------------------------------------------------------------F------------R--DN------------------------------------------------------------SFE-----IMMFN--RR---ISY--FQS------------Y-E-EQK-------------------P-SK--------------------------------------------------------------------------------------NP---PFSS--WYLCKG-I-L--------------------------------NKPFVF---------EEV------VK----------------------------
      YS40_029_Thermus_phage_phiYS40_118197649                                  LV----EH--VKK-EKDDEFHTPRFAVEPLLKY---I--------------------------PKD-KVIWCPFDTEESNYVKVFMEN--------GY---KVVYSHI---SMGQ--DFFFY---EP-E--------NY-DVIVSNPPFSV-----KTE-----------------------------------ILKRA----------Y--------SLGKPFAFLLPITSLEGKKRGELF---------------------------------------------------------------------------------------------R--KY------------------------------------------------------------GLQ-----LIVFD--RR---IEF------------------------------------------------MSTT-----------------------------------------------KT--------------------------------GV---WFNTS-YFCYKL-L----------------------------------PRDLIF---------EQL------EV----------------------------
      TMA_029_Thermus_phage_TMA_343960410                                       LV----EH--VKK-ERDDEFHTPRFAVEPLLKY---I--------------------------PKD-KVIWCPFDTEESNYVKVFMEN--------GY---KVVYSHI---SMGQ--DFFFY---EP-E--------NY-DVIVSNPPFSV-----KTE-----------------------------------ILKRA----------Y--------SLGKPFAFLLPITSLEGKKRGELF---------------------------------------------------------------------------------------------R--KY------------------------------------------------------------GLQ-----LIVFD--RR---IEF------------------------------------------------MSTT-----------------------------------------------KT--------------------------------GV---WFNTS-YFCYKL-L----------------------------------PRDLIF---------EQL------KV----------------------------
      HMPREF1033_RS05355_Tannerella_sp_6_1_58FAA_CT1_496675013                  YN----NW---HI-RANDERYTPRYTVLPIIKY---L--------------------------PQK-AVIWCPFDTENSEFVLTLKEN--------GF---KVTHSHI---VNGD--DFYTY---EP-E--------YW-DIIVSNPPFSN-----KRQ-----------------------------------IFERC----------L--------SFGKPFALIMSNLALNDSFPCRLF---------------------------------------------------------------------------------------------K--DK------------------------------------------------------------ELQ-----LLMFD--KR---IQY-------------------N-------------------------LL-----------------------------------------------------E--------------------------------RI---PFASS-YFCHKL-L----------------------------------PKQIIF---------ENL------DV--------VK----GQMSRM--YKDMED
      HMPREF1069_06304_Bacteroides_ovatus_CL02T12C04_392661135                  ----------MFL-NCSDEKYTPGYGVAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---STGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KMD-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQY-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSRM--HRDVDN
      HMPREF1074_RS00030_Bacteroides_xylanisolvens_495301979                    ----------MFL-NCSDEKYTPGYGVAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---STGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KKD-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQY-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSHM--HRDVDN
      _Bacteroides_ovatus_769142550                                             PS----KLSRMFL-NCSDEKYTPGYGVAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---STGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KMD-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQY-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSRM--HRDVDN
      NI20_RS0109935_Oscillibacter_sp_ER4_696600425                             ------KA----R-KASDLYPTPPEVTVALMRF---L------------------------KLPAG-TDIWEPA-RGQGDMVRALADC--------GM---AVYGTDI---RDGI--DFLTT---RQ-P-GN-A-P-AA-DWIITNPPFSL-----ADE-----------------------------------FIRHA----------A--------EIGKPFAMLLKAQYWHAAKRAQLF---------------------------------------------------------------------------------------------R----------------------------------------------------------------EIP-P-S-YVLPLTWRP-D-FLF-------------------K-------------------------ER-DGKK------------------------------------------------G--------------------------------AS---PLMDV-MWCVWL------------------------------------TPQMQG-V-------QTV-F----KP-----L--MR----PEKEK---------
      P694_RS05110_Entomoplasma_luminosum_737023823                             MR----NI--LGLQKSNNEFYTPEEPIIDLLDNFLNI--------------------------PKS-KIIWCPFDTEDSEFVKQLKHR--------GY---KIISSHI---ENGK--DFYEY---EPNE--------EW-DMILSNPPFSG-----KRI-----------------------------------LIERC----------E--------SFKKPFCLLYG-----ATIFSQSM----------------------------------------------------------------------------------------GN---T--LN------------------------------------------------------------RCE-----FIFIQ--RN---IKF------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      _Bacteroides_ovatus_490425600                                             --------------------------MAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---STGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KKD-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQF-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSRM--HRDVDN
      X558_RS03965_Mycoplasma_pirum_738499893                                   LG--L-QK------RENNEFYTPKETVENIVNL---V-----------------------IKKLKN-KVVWCPFDTQDSNFVKVLKEK--------NI---SVINTHI-N-IKNG--DFYK-------N-KT-I-PKKW-DLILSNPPFSK-----KRE-----------------------------------LIERC----------L--------SFNKDFCLL----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      Q453_RS00595_Mycoplasma_hyorhinis_504101400                               IT----KA--FNY-KNDDEWYTTREDVQFFIDN-ANI--------------------------PKD-KVIWCPFDVEDSNFVTVFKKN--------GY---KVLHSHI---WDQQ--DFYEY---EPKE--------KW-DIIISNPPFKG-----KHR-----------------------------------LLARL----------L--------EFNKPWALIFGIQALNSEKFCHEL-----------------------------------------------------------------------------------------Q---K--FK------------------------------------------------------------RVQ-----YIHLK--RR---MCF-----------------T-K-------------------------DH-LNYD-----------------------------------------------IK--------------------------------NLQRPSFASM-WIANDL-F----------------------------------DKDILV---------WNG------VN--------YK----KDGKEF--F-----
      SY27_07790_Flavobacterium_sp_316_759871867                                KI----DY--IKR-GAFDELYTPKEAIECILPY---I--------------------------PDTVKIIWECTAIENSEIVTVLKAN--------DF---EVIKSHI---KDGL--DFFEY---EP-P--------QY-DLIITNPPYSL-----KDQ-----------------------------------FLKRA----------F--------ELDKPFMMLLPITTLEGKKRSEMF---------------------------------------------------------------------------------------------Q--QH------------------------------------------------------------KVQ-----VLIPS--KR---FNF-----------------I-K-------------------------EK-----------------------------------------------------K--------------------------------GS---WFQTS-WFTWKLNL----------------------------------KSDLIF---------MNV------------------------------------
      MOS_RS00605_Mycoplasma_hyorhinis_504896920                                IT----KA--FNY-KNDDEWYTTREDVQFFIDN-ANI--------------------------PKD-KVIWCPFDVEDSNFVTVFKKN--------GY---KVLHSHI---WDQQ--DFYEY---EPKE--------KW-DIIISNPPFKG-----KHR-----------------------------------LLARL----------L--------EFNKPWALIFGIQALNSEKFCHEL-----------------------------------------------------------------------------------------Q---K--FK------------------------------------------------------------RVQ-----YVHLK--RR---MCF-----------------T-K-------------------------DH-LNYD-----------------------------------------------IK--------------------------------NLQRPSFASM-WIANDL-F----------------------------------DKDILV---------WNG------VN--------YK----KDGKEF--F-----
      RUMFLAFD1_RS0120915_Ruminococcus_flavefaciens_497673944                   YL----TA--NRT-SAGDEVYTPFYAIEPLLEF---L--------------------------PKD-KKIWCPFDEEWSAFYQFLSEK--------GY---EVERSSL---KEGQ--DFFRY---EP-E--------QW-DILVSNPPFSK-----KND-----------------------------------VLKRA----------F--------SFQKPFALLLPVNSIQGKARYKIF------------------------------------------------------------------------------------------------NN------------------------------------------------------------EIQ-----MLSFD--GR---VDY-----------------H-T-------------------------RQNMECT-----------------------------------------------TK--------------------------------GN---HFGSA-YFCRDL-L----------------------------------PSKLEL---------RQL------VK--------YD----RPLVTP--TIGGDE
      F801_RS0102175_Mycoplasma_hyorhinis_518948704                             IT----KA--FNY-KNDDEWYTTREDVQFFIDN-ANI--------------------------PKD-KVIWCPFDVEDSNFVTVFKKN--------GY---KVLHSHI---WDQQ--DFYEY---EPKE--------KW-DIIISNPPFKG-----KHR-----------------------------------LLARL----------L--------EFNKPWALIFGIQALNSEKFCHEL-----------------------------------------------------------------------------------------Q---K--FK------------------------------------------------------------RVQ-----YVHLK--RR---MCF-----------------T-K-------------------------DH-LNYD-----------------------------------------------IK--------------------------------NLQRPSFASM-WIANDL-F----------------------------------DKDILV---------WNG------VD--------YK----KDGKEF--F-----
      M081_5001_Bacteroides_fragilis_str_3998_T(B)_4_596015177                  YS----KY--LHS-NKSDEKYTPQYAVLPIIKY---L--------------------------PRK-AVIWCPFDTENSEFVLALKEA--------GY---RVVYSHI---FTGQ--DFFEY---EP-K--------RW-DIIVSNPPFSN-----KAR-----------------------------------IFERC----------L--------AFRKPFALLMSNFWLNDSAPCRLF---------------------------------------------------------------------------------------------K--ER------------------------------------------------------------ELQ-----LLLFD--KR---VEY-------------------N-------------------------DL-----------------------------------------------------S--------------------------------RV---PFGSS-YFCHKV-L----------------------------------PKQIVF---------ENL------TK--------IK----GEKSRM--WADVEK
      MHR_0113_Mycoplasma_hyorhinis_HUB-1_304309105                             IT----KA--FNY-KNDDEWYTTREDVQFFIDN-ANI--------------------------PKD-KVIWCPFDVEDSNFVTVFKKN--------GY---KVLHSHI---LDQQ--DFYEY---EPKE--------KW-DIIISNPPFKG-----KHR-----------------------------------LLARL----------L--------EFNKPWALIFGIQALNSEKFCHEL-----------------------------------------------------------------------------------------Q---K--FK------------------------------------------------------------RVQ-----YVHLK--RR---MCF-----------------T-K-------------------------DH-LNYD-----------------------------------------------TK--------------------------------NLQRPSFASM-WIANDL-F----------------------------------DKDILV---------WNG------VD--------YK----KDGKEF--F-----
      H740_RS04285_Campylobacter_showae_489041364                               HR------------SKSDFYQTPYAITRRLLEV--------EKF-----S-----------------GRILEPA-CGAGAITAILKEA--------GY-E-DVTAYDL-L-LDGK--DFLA-------E-TR-----KF-DVIITNPPFSL-----AKE-------------------------------F---ILKAC-------------------EIAPRFAFLLPLNYLHGKERLDEI--------------------------------------------------------------------------------Y---------------SR---E--------------------------------------------------------ILE-K-V-YVFAR-------YPL------------L-S--A-Q-------------IR--------P-DG----------------------------KY--------------------------------------------------------ET-G-MMVYA-WYIFDT-K-H--------K-----------------------GAPTIH---------WID------NS-E-D-VV-RK-GK---------------
      _Mycoplasma_hyorhinis_752716488                                           MT----KA--FNY-KNDDEWYTTREDVQFFIDN-ANI--------------------------PKD-KVIWCPFDVEDSNFVTVFKKN--------GY---KVLHSHI---LDQQ--DFYEY---EPKE--------KW-DIIISNPPFKG-----KHR-----------------------------------LLARL----------L--------EFNKPWALIFGIQALNSEKFCHEL-----------------------------------------------------------------------------------------Q---K--FK------------------------------------------------------------RVQ-----YVHLK--RR---MCF-----------------T-K-------------------------DH-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      M125_5712_Bacteroides_fragilis_str_3998T(B)3_596000932                    YS----KY--LHS-NKSDEKYTPQYAVLPIIKY---L--------------------------PRK-AVIWCPFDTENREFVLALKEA--------GY---RVVYSHI---FTGQ--DFFEY---EP-K--------RW-DIIVSNPPFSN-----KAR-----------------------------------IFERC----------L--------AFRKPFALLMSNFWLNDSAPCRLF---------------------------------------------------------------------------------------------K--ER------------------------------------------------------------ELQ-----LLLFD--KR---VEY-------------------N-------------------------DL-----------------------------------------------------S--------------------------------RV---PFGSS-YFCHKV-L----------------------------------PKQIVF---------ENL------TK--------IK----GEKSRM--WADVEK
      M081_RS20695_Bacteroides_fragilis_695479665                               YS----KY--LHS-NKSDEKYTPQYAVLPIIKY---L--------------------------PRK-AVIWCPFDTENSEFVLALKEA--------GY---RVVYSHI---FTGQ--DFFEY---EP-K--------RW-DIIVSNPPFSN-----KAR-----------------------------------IFERC----------L--------AFRKPFALLMSNFWLNDSAPCRLF---------------------------------------------------------------------------------------------K--ER------------------------------------------------------------ELQ-----LLLFD--KR---VEY-------------------N-------------------------DL-----------------------------------------------------S--------------------------------RV---PFGSS-YFCHKV-L----------------------------------PKQIVF---------ENL------TK--------IK----GEKSRM--WADVEK
      consensus/100%                                                            .....................................................................................................................Dh.....................s...ssPPh..................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................
      consensus/95%                                                             .................s...p.......h......l................................hh.s...........h...........sh...ph..............Dhh....................DhhlsNPPap............................................hh..........................sahhl....................................................................................................................................................................................................................................................................................................................................................................................................................................................
      consensus/90%                                                             .................Dc..os..sh..l......l................................la.Ph....u.....h...........Gh...pl...p..........DFa........p...........DhlloNPPao........c...................................hh..h.......................Pahhlhs....p..............................................................................................................................................................................h.............b...........................................................................................................................................h....h...............................................................................................
      consensus/85%                                                             ................sDc.bTs.bsh..l..h...l................................lasPa..p.u.h...h...........Gh...pV..ppb.........DFa........p........pa.DhlloNPPao........c...................................hhp.h......................bPahhLhs...hp.......h......................................................................................................................................................................hh..........h.a...........................................................................................................................................a....ahh.............................................................................................
      consensus/80%                                                             ..............p.sDchbTs.bsh.sl..h...l...........................p..b.lasPa..p.u.h...h...........Gh...pV.pppb.........DFa........p........pa.DhlloNPPaS......bpc...................................hhpbh......................+PahhLhs...hp.......h......................................................................................................................................................................hl..........h.a...........................................................................................................................................F.s..ahh.............................................................................................
      consensus/75%                                                             ..............p.sDchbTs.buh.slh.h...l...........................pp.b.lasPa..p.u.h...h.p.........Ga...pV.pppb.........DFa........p........pa.DhlloNPPaSs.....bpc...................................hhpbh......................+PahhLhP..hhpp.bb..hh......................................................................................................................................................................hl..p.......h.a......................................................................................................................................s....F.s..ahh.............................................................................................
      consensus/70%                                                             .s............p.sDchbTs.buh.slhsh...l...........................pp.b.IasPa.hhsu.h.p.h.ph........Ga...pVhpppb.........DFa........p........pa.DllloNPPaSs.....bp+...................................lhpbh...................p..+PahlLhP.bhhpp.bb.phh................................................................................................................................................................p.....hl..p..p....h.a.............................................p........................................................................................s...sF.oh.aah.............................................................................................
      
      Back to Contents
    • General notes, phyletic distribution and domain architectures of the Group2/clade 3/CHLREDRAFT_205675 -like N6-MTases

      General notes:

      A somewhat mobile group with members in a wide range of eukaryotes. Almost always in two copies in the eukaryotes. Both are usually active.These are likely to be RNA methylases given that some of them are fused to the KH and CCCH domains. In Aureococcus the domain is fused to an N-terminal DUF501 and a C-terminal SAP domain. DUF501 in turn is found in association with the pseudouridine synthase domain (e.g. Thalassiosira oceanica, gi: 397631724) and is also likely to be associated in context with RNA. Like other members of this group, this clade has the characteristic TP motif in the helix before strand-1 and also lacks a well defined strand-3.
      # 1; Eukaryotic homologs
      GI               Domain-arch            Pfam arch                     Gene name                              Len   Taxonomy                                      Species                                        Genbank                                                              
      # 1;
      514687493         N6-MTase                -                           PTSG_07527                             290   eukaryota>choanoflagellida                    Salpingoeca rosetta                            hypothetical protein PTSG_07527 [Salpingoeca rosetta].
      514693110         N6-MTase                -                           PTSG_04809                             713   eukaryota>choanoflagellida                    Salpingoeca rosetta                            hypothetical protein PTSG_04809 [Salpingoeca rosetta].
      167525218         N6-MTase                -                           MONBRDRAFT_26429                       428   eukaryota>choanoflagellida                    Monosiga brevicollis MX1                       hypothetical protein [Monosiga brevicollis MX1].
      167523896         N6-MTase                FG-GAP_2                    MONBRDRAFT_25858                       1215  eukaryota>choanoflagellida                    Monosiga brevicollis MX1                       hypothetical protein [Monosiga brevicollis MX1].
      470514790         N6-MTase                -                           ACA1_068850                            456   eukaryota>amoebozoa>acanthamoebidae           Acanthamoeba castellanii str. Neff             hypothetical protein ACA1_068850 [Acanthamoeba castellanii str. Neff].
      470455333         N6-MTase                -                           ACA1_037140                            442   eukaryota>amoebozoa>acanthamoebidae           Acanthamoeba castellanii str. Neff             hypothetical protein ACA1_037140 [Acanthamoeba castellanii str. Neff].
      302843234         N6-MTase                -                           VOLCADRAFT_105839                      730   eukaryota>viridiplantae>chlorophyta           Volvox carteri f. nagariensis                  hypothetical protein VOLCADRAFT_105839 [Volvox carteri f. nagariensis].
      302830991         N6-MTase                -                           VOLCADRAFT_120421                      198   eukaryota>viridiplantae>chlorophyta           Volvox carteri f. nagariensis                  hypothetical protein VOLCADRAFT_120421 [Volvox carteri f. nagariensis].
      693498069         N6-MTase                -                           OT_ostta10g00600                       336   eukaryota>viridiplantae>chlorophyta           Ostreococcus tauri                             DNA methylase, N-6 adenine-specific, conserved site [Ostreococcus tauri].
      693497110         N6-MTase                -                           OT_ostta14g01460                       234   eukaryota>viridiplantae>chlorophyta           Ostreococcus tauri                             unnamed product [Ostreococcus tauri].
      308811312         N6-MTase                -                           Ot14g01730                             236   eukaryota>viridiplantae>chlorophyta           Ostreococcus tauri                             unnamed protein product [Ostreococcus tauri].
      308810727         N6-MTase                -                           Ot13g02010                             525   eukaryota>viridiplantae>chlorophyta           Ostreococcus tauri                             unnamed protein product [Ostreococcus tauri].
      308808370         N6-MTase                -                           Ot10g00600                             259   eukaryota>viridiplantae>chlorophyta           Ostreococcus tauri                             unnamed protein product [Ostreococcus tauri].
      145356641         N6-MTase                -                           OSTLU_29451                            166   eukaryota>viridiplantae>chlorophyta           Ostreococcus lucimarinus CCE9901               predicted protein [Ostreococcus lucimarinus CCE9901].
      145353330         N6-MTase                -                           OSTLU_27164                            533   eukaryota>viridiplantae>chlorophyta           Ostreococcus lucimarinus CCE9901               predicted protein [Ostreococcus lucimarinus CCE9901].
      145351428         N6-MTase                -                           OSTLU_26028                            333   eukaryota>viridiplantae>chlorophyta           Ostreococcus lucimarinus CCE9901               predicted protein [Ostreococcus lucimarinus CCE9901].
      761971839         N6-MTase                -                           MNEG_5609                              613   eukaryota>viridiplantae>chlorophyta           Monoraphidium neglectum                        hypothetical protein MNEG_5609 [Monoraphidium neglectum].
      255088714         N6-MTase                -                           MICPUN_64290                           293   eukaryota>viridiplantae>chlorophyta           Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      255086063         N6-MTase                -                           MICPUN_103519                          422   eukaryota>viridiplantae>chlorophyta           Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      255079482         N6-MTase+CCCH           zf-CCCH                     MICPUN_59514                           296   eukaryota>viridiplantae>chlorophyta           Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      255078302         N6-MTase                -                           MICPUN_59025                           641   eukaryota>viridiplantae>chlorophyta           Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      303288509         N6-MTase                -                           MICPUCDRAFT_53312                      283   eukaryota>viridiplantae>chlorophyta           Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      303284947         N6-MTase                -                           MICPUCDRAFT_51291                      413   eukaryota>viridiplantae>chlorophyta           Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      303284171         N6-MTase                -                           MICPUCDRAFT_62794                      178   eukaryota>viridiplantae>chlorophyta           Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      303283096         N6-MTase                -                           MICPUCDRAFT_60475                      709   eukaryota>viridiplantae>chlorophyta           Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      303278244         N6-MTase+CCCH           zf-CCCH                     MICPUCDRAFT_57902         CCCH         366   eukaryota>viridiplantae>chlorophyta           Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      159485216         N6-MTase                -                           CHLREDRAFT_205675                      410   eukaryota>viridiplantae>chlorophyta           Chlamydomonas reinhardtii                      predicted protein [Chlamydomonas reinhardtii].
      612399992         N6-MTase                -                           Bathy01g03440                          243   eukaryota>viridiplantae>chlorophyta           Bathycoccus prasinos                           predicted protein [Bathycoccus prasinos].
      612393160         N6-MTase                -                           Bathy07g04630                          585   eukaryota>viridiplantae>chlorophyta           Bathycoccus prasinos                           predicted protein [Bathycoccus prasinos].
      612389594         N6-MTase                MTS                         Bathy11g00290                          307   eukaryota>viridiplantae>chlorophyta           Bathycoccus prasinos                           predicted protein [Bathycoccus prasinos].
      551675275         N6-MTase                -                           GUITHDRAFT_46531                       166   eukaryota>cryptophyta                         Guillardia theta CCMP2712                      hypothetical protein GUITHDRAFT_46531, partial [Guillardia theta CCMP2712].
      551658644         N6-MTase+CCCH           zf-CCCH                     GUITHDRAFT_109156         CCCH         292   eukaryota>cryptophyta                         Guillardia theta CCMP2712                      hypothetical protein GUITHDRAFT_109156 [Guillardia theta CCMP2712].
      551638519         N6-MTase                -                           GUITHDRAFT_90353                       235   eukaryota>cryptophyta                         Guillardia theta CCMP2712                      hypothetical protein GUITHDRAFT_90353 [Guillardia theta CCMP2712].
      551648195         N6-MTase                -                           GUITHDRAFT_113893                      411   eukaryota>cryptophyta                         Guillardia theta CCMP2712                      hypothetical protein GUITHDRAFT_113893 [Guillardia theta CCMP2712].
      551646515         N6-MTase                -                           GUITHDRAFT_165084                      426   eukaryota>cryptophyta                         Guillardia theta CCMP2712                      hypothetical protein GUITHDRAFT_165084 [Guillardia theta CCMP2712].
      224015927         N6-MTase                -                           THAPSDRAFT_bd1109                      244   eukaryota>stramenopiles                       Thalassiosira pseudonana CCMP1335              predicted protein [Thalassiosira pseudonana CCMP1335].
      224009558         N6-MTase                -                           THAPSDRAFT_9806                        337   eukaryota>stramenopiles                       Thalassiosira pseudonana CCMP1335              predicted protein [Thalassiosira pseudonana CCMP1335].
      224005064         N6-MTase                -                           THAPS_23466                            489   eukaryota>stramenopiles                       Thalassiosira pseudonana CCMP1335              predicted protein [Thalassiosira pseudonana CCMP1335].
      224003919         N6-MTase                -                           THAPSDRAFT_6523                        230   eukaryota>stramenopiles                       Thalassiosira pseudonana CCMP1335              predicted protein [Thalassiosira pseudonana CCMP1335].
      223996249         N6-MTase                Methyltransf_26             THAPSDRAFT_21256                       257   eukaryota>stramenopiles                       Thalassiosira pseudonana CCMP1335              predicted protein [Thalassiosira pseudonana CCMP1335].
      397625596         N6-MTase                Methyltransf_26             THAOC_11048                            320   eukaryota>stramenopiles                       Thalassiosira oceanica                         hypothetical protein THAOC_11048, partial [Thalassiosira oceanica].
      397605573         N6-MTase                -                           THAOC_20767                            347   eukaryota>stramenopiles                       Thalassiosira oceanica                         hypothetical protein THAOC_20767, partial [Thalassiosira oceanica].
      219130185         N6-MTase                -                           PHATRDRAFT_41248                       320   eukaryota>stramenopiles                       Phaeodactylum tricornutum CCAP 1055/1          predicted protein [Phaeodactylum tricornutum CCAP 1055/1].
      219120494         N6-MTase                -                           PHATRDRAFT_36383                       217   eukaryota>stramenopiles                       Phaeodactylum tricornutum CCAP 1055/1          predicted protein [Phaeodactylum tricornutum CCAP 1055/1].
      585103216         N6-MTase                Methyltransf_26             Naga_100023g2                          371   eukaryota>stramenopiles                       Nannochloropsis gaditana                       DNA methylase, N-6 adenine-specific, conserved site [Nannochloropsis gaditana].
      298715150         N6-MTase                -                           Esi_0085_0104                          334   eukaryota>stramenopiles                       Ectocarpus siliculosus                         conserved unknown protein [Ectocarpus siliculosus].
      298713748         N6-MTase                -                           Esi_0571_0002                          544   eukaryota>stramenopiles                       Ectocarpus siliculosus                         conserved unknown protein [Ectocarpus siliculosus].
      676390061         DUF501+N6-MTase+SAP     DUF501                      AURANDRAFT_65195                       3593  eukaryota>stramenopiles                       Aureococcus anophagefferens                    hypothetical protein AURANDRAFT_65195 [Aureococcus anophagefferens].
      551539647         N6-MTase                -                           EMIHUDRAFT_250132                      371   eukaryota>haptophyceae                        Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_250132 [Emiliania huxleyi CCMP1516].
      551557088         KH+N6-MTase             KH_3                        EMIHUDRAFT_212966       KH             570   eukaryota>haptophyceae                        Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_212966 [Emiliania huxleyi CCMP1516].
      551561218         KH+N6-MTase             KH_3                        EMIHUDRAFT_119528       KH             450   eukaryota>haptophyceae                        Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_119528 [Emiliania huxleyi CCMP1516].
      551614698         N6-MTase                -                           EMIHUDRAFT_440832                      205   eukaryota>haptophyceae                        Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_440832 [Emiliania huxleyi CCMP1516].
      551626801         N6-MTase                -                           EMIHUDRAFT_439419                      205   eukaryota>haptophyceae                        Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_439419 [Emiliania huxleyi CCMP1516].
      528246120         N6-MTase                -                           STCU_02709                             364   eukaryota>euglenozoa>kinetoplastida           Strigomonas culicis                            hypothetical protein STCU_02709 [Strigomonas culicis].
      294899807         N6-MTase                -                           Pmar_PMAR017622                        296   eukaryota>alveolata                           Perkinsus marinus ATCC 50983                   hypothetical protein Pmar_PMAR017622 [Perkinsus marinus ATCC 50983].
      554942173         N6-MTase                -                           TRSC58_02209                           341   eukaryota>euglenozoa>kinetoplastida           Trypanosoma rangeli SC58                       hypothetical protein TRSC58_02209 [Trypanosoma rangeli SC58].
      686647126         N6-MTase                -                           DQ04_18661000                          284   eukaryota>euglenozoa>kinetoplastida           Trypanosoma grayi                              hypothetical protein DQ04_18661000 [Trypanosoma grayi].
      71666516          N6-MTase                -                           Tc00.1047053506297.320                 340   eukaryota>euglenozoa>kinetoplastida           Trypanosoma cruzi strain CL Brener             hypothetical protein [Trypanosoma cruzi strain CL Brener].
      71409237          N6-MTase                -                           Tc00.1047053508041.40                  340   eukaryota>euglenozoa>kinetoplastida           Trypanosoma cruzi strain CL Brener             hypothetical protein [Trypanosoma cruzi strain CL Brener].
      407859934         N6-MTase                -                           TCSYLVIO_001620                        340   eukaryota>euglenozoa>kinetoplastida           Trypanosoma cruzi                              hypothetical protein TCSYLVIO_001620 [Trypanosoma cruzi].
      407425172         N6-MTase                -                           MOQ_000466                             340   eukaryota>euglenozoa>kinetoplastida           Trypanosoma cruzi marinkellei                  hypothetical protein MOQ_000466 [Trypanosoma cruzi marinkellei].
      528257849         N6-MTase                -                           AGDE_06588                             301   eukaryota>euglenozoa>kinetoplastida           Angomonas deanei                               hypothetical protein AGDE_06588 [Angomonas deanei].
      588317381         N6-MTase                -                           GSEM1_T00001947001                     369   eukaryota>euglenozoa>kinetoplastida           Phytomonas sp. isolate EM1                     unnamed protein product [Phytomonas sp. isolate EM1].
      Smin1000013068    N6-MTase                -                           Smin1000013068                         252  eukaryota>alveolata>dinophyceae                Symbiodinium minutum Mf 1.05b.01               6
      
      GI           Operon                                                                                               Dom-arch    Pfam arch               Gene name         Len  Taxonomy                                          Species                                         Genbank                                                              
      # 68; Type IV secretion system associated                                                                                                                                                                                                               
      765333434    <-N6-MTase*<-DUF3872-Ig<-TraO<-TraN                                                                  N6-MTase    -                       -                 208  bacteria>bacteroidetes                            Bacteroides acidifaciens                        tRNA (adenine-N6)-methyltransferase [Bacteroides acidifaciens].                  765333433_?-><-765333434_N6-MTase*<-765333437_DUF3872-Ig<-765333435_TraO<-765333436_TraN
      769142550    TraO->DUF3872-Ig->N6-MTase*-><-?||?->?->?-><-N6-MTase                                                N6-MTase    -                       -                 207  bacteria>bacteroidetes                            Bacteroides ovatus                              tRNA (adenine-N6)-methyltransferase [Bacteroides ovatus].                        490452835_TraO->490452836_DUF3872-Ig->769142550_N6-MTase*-><-490452838_?||769142554_?->490452840_?->490452841_?-><-490452845_N6-MTase<-490452848_?<-490452850_?
      736576773    N6-MTase*->                                                                                          N6-MTase    MTS                     -                 202  bacteria>proteobacteria>epsilonproteobacteria     Helicobacter magdeburgensis                     hypothetical protein [Helicobacter magdeburgensis].                              736576728_?->736576730_?->736576773_N6-MTase*->736576732_?->736576735_?->736576737_?->736576740_?->736576743_?->736576746_?->736576749_?->
      499516379    TraK->?->N6-MTase->TraM->?->N6-MTase*->TraN->?->?->?->VirD4_TraG->                                   N6-MTase    -                       -                 201  bacteria>bacteroidetes                            Bacteroides fragilis                            tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis].                      53714095_?->53714096_?->53714097_TraK->53714098_?->53714099_N6-MTase->53714100_TraM->53714101_?->499516379_N6-MTase*->53714103_TraN->53714104_?->53714105_?->53714106_?->53714107_VirD4_TraG-><-53714108_?<-53714109_?
      260623613    <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-cpN6-MTase<-TraM<-?<-TraK                                    N6-MTase    -                       BACFIN_05770      199  bacteria>bacteroidetes                            Bacteroides finegoldii DSM 17565                hypothetical protein BACFIN_05770 [Bacteroides finegoldii DSM 17565].            <-260623606_?<-260623607_?<-260623608_VirD4_TraG<-260623609_?<-260623610_?<-260623611_?<-260623612_TraN<-260623613_N6-MTase*<-260623614_cpN6-MTase<-260623615_TraM<-260623616_?<-260623617_TraK<-260623618_?<-260623619_?||260623620_?->
      490439159    TraK->?->N6-MTase->TraM->?->N6-MTase*->TraN->?->?->?->VirD4_TraG->                                   N6-MTase    SP                      -                 199  bacteria>bacteroidetes                            Bacteroidales                                   MULTISPECIES: tRNA (adenine-N6)-methyltransferase [Bacteroidales].               490439165_?->490439164_?->490439163_TraK->490439162_?->499516378_N6-MTase->490439160_TraM->490443600_?->490439159_N6-MTase*->490439158_TraN->490439157_?->496044155_?->490439155_?->655320168_VirD4_TraG-><-490439153_?<-490439152_?
      490451191    <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK                                   N6-MTase    -                       -                 199  bacteria>bacteroidetes                            Bacteroidales                                   MULTISPECIES: tRNA (adenine-N6)-methyltransferase [Bacteroidales].               490439152_?->490439153_?-><-490439154_VirD4_TraG<-490439155_?<-490439156_?<-695491981_?<-490439158_TraN<-490451191_N6-MTase*<-490451190_?<-490439160_TraM<-490439161_N6-MTase<-490439162_?<-490439163_TraK<-490439164_?<-695491989_?
      494415009    <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK                                   N6-MTase    -                       -                 199  bacteria>bacteroidetes                            Bacteroidales                                   MULTISPECIES: tRNA (adenine-N6)-methyltransferase [Bacteroidales].               490439152_?->494414996_?-><-494414999_VirD4_TraG<-494415001_?<-494415003_?<-494415005_?<-494415007_TraN<-494415009_N6-MTase*<-490451190_?<-490439160_TraM<-496051721_N6-MTase<-496051722_?<-490439163_TraK<-494415019_?<-490439165_?
      496051720    <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK                                   N6-MTase    -                       -                 199  bacteria>bacteroidetes                            Bacteroides sp. 2_2_4                           tRNA (adenine-N6)-methyltransferase [Bacteroides sp. 2_2_4].                     490439152_?->494414996_?-><-496051719_VirD4_TraG<-494415001_?<-494415003_?<-494415005_?<-494415007_TraN<-496051720_N6-MTase*<-490451190_?<-490439160_TraM<-496051721_N6-MTase<-496051722_?<-490439163_TraK<-494415019_?<-490439165_?
      496308428    <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK                                   N6-MTase    -                       -                 199  bacteria>bacteroidetes                            Parabacteroides sp. D13                         tRNA (adenine-N6)-methyltransferase [Parabacteroides sp. D13].                   490439152_?->496308427_?-><-490439154_VirD4_TraG<-490439155_?<-496044155_?<-494415005_?<-494415007_TraN<-496308428_N6-MTase*<-496308429_?<-490439160_TraM<-496308430_N6-MTase<-496044157_?<-490439163_TraK<-490439164_?<-496308431_?
      495916348    TraK->?->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*->?-><-?<-?<-?<-ParB                              N6-MTase    -                       -                 195  bacteria>bacteroidetes                            Bacteroides sp. 1_1_30                          tRNA (adenine-N6)-methyltransferase [Bacteroides sp. 1_1_30].                    495916341_TraK->695334858_?->495916343_?->495916344_TraM->495916345_TraN->495916346_TraO->696264670_DUF3872-Ig->495916348_N6-MTase*->696264709_?-><-490455318_?<-495916364_?<-495916370_?<-495916373_ParB<-495916380_?<-495916381_?
      496422695    TraK->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*->?->?-><-?||?-><-?<-?<-ParB                         N6-MTase    MTS                     -                 195  bacteria>bacteroidetes                            Bacteroides oleiciplenus                        hypothetical protein [Bacteroides oleiciplenus].                                 496422688_?->496422689_TraK->496422690_?->763277032_TraM->496422692_TraN->763277085_TraO->496422694_DUF3872-Ig->496422695_N6-MTase*->763277086_?->496422697_?-><-496422699_?||496422700_?-><-496422701_?<-496422702_?<-496422703_ParB
      695334862    TraK->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*->?-><-?<-?<-?<-?<-ParB                              N6-MTase    -                       -                 195  bacteria>bacteroidetes                            Bacteroides fragilis                            tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis].                      695334857_?->695334929_TraK->695334858_?->695334859_TraM->495916345_TraN->695334860_TraO->695334861_DUF3872-Ig->695334862_N6-MTase*->695334930_?-><-695334863_?<-490455318_?<-695334864_?<-695334865_?<-695341265_ParB<-695334867_?
      696261854    <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-cpN6-MTase<-TraM<-?<-TraK                                    N6-MTase    SP                      -                 191  bacteria>bacteroidetes                            Bacteroides finegoldii                          tRNA (adenine-N6)-methyltransferase [Bacteroides finegoldii].                    <-495024507_?<-495024509_?<-495024511_VirD4_TraG<-495024513_?<-495024516_?<-495024519_?<-495024523_TraN<-696261854_N6-MTase*<-495024528_cpN6-MTase<-495024530_TraM<-495024532_?<-495024534_TraK<-495024536_?<-495024547_?||696261855_?->
      492357380    TraK->?->TraM->TraN->TraO->cpN6-MTase->DUF3872-Ig->N6-MTase*->                                       N6-MTase    -                       -                 190  bacteria>bacteroidetes                            Bacteroides fragilis                            tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis].                      492357371_TraK->492357373_?->492357374_TraM->492357375_TraN->492357376_TraO->492357377_cpN6-MTase->492357378_DUF3872-Ig->492357380_N6-MTase*->492357382_?->695340121_?-><-492357385_?<-492357387_?<-492357388_?<-695340204_?<-492357390_?
      495301957    TraK->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*->                                                   N6-MTase    -                       -                 190  bacteria>bacteroidetes                            Bacteroides xylanisolvens                       tRNA (adenine-N6)-methyltransferase [Bacteroides xylanisolvens].                 495299922_?->490425606_TraK->495299921_?->696232227_TraM->696232252_TraN->495301955_TraO->495301956_DUF3872-Ig->495301957_N6-MTase*-><-495301958_?||696232363_?->495301960_?->696232364_?-><-495301962_?
      495301979    TraK->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*->                                                   N6-MTase    -                       -                 190  bacteria>bacteroidetes                            Bacteroides xylanisolvens                       tRNA (adenine-N6)-methyltransferase [Bacteroides xylanisolvens].                 495299922_?->490425606_TraK->495299921_?->696232227_TraM->696232252_TraN->495301975_TraO->495301977_DUF3872-Ig->495301979_N6-MTase*-><-495301980_?<-495301962_?
      695345663    TraK->?->TraM->TraN->TraO->cpN6-MTase->DUF3872-Ig->N6-MTase*->                                       N6-MTase    -                       -                 190  bacteria>bacteroidetes                            Bacteroides fragilis                            tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis].                      695345658_TraK->695345659_?->695345660_TraM->492357375_TraN->695345661_TraO->492357377_cpN6-MTase->695345662_DUF3872-Ig->695345663_N6-MTase*->695345664_?->695340121_?-><-492357385_?<-695345665_?<-695345666_?<-695345685_?<-695345667_?
      265525156    N6-MTase*->                                                                                          N6-MTase    -                       PCMG_00077        189  viruses>dsdna viruses, no rna stage>caudovirales  Prochlorococcus phage P-SSM2                    conserved hypothetical protein [Prochlorococcus phage P-SSM2].                   265525149_?->265525150_?->265525151_?->265525152_?->265525153_?->265525154_?->265525155_?->265525156_N6-MTase*->265525157_?->265525158_?->265525159_?->265525160_?->265525161_?->265525162_?->265525163_?->
      291544920    N6-MTase*->VirB6_TrbL->?->VirB4_TraE->                                                               N6-MTase    EcoRI_methylase         RUM_19970         189  bacteria>firmicutes                               Ruminococcus champanellensis 18P13 = JCM 17042  hypothetical protein RUM_19970 [Ruminococcus champanellensis 18P13 = JCM 17042]. 291544913_?->291544914_?->291544915_?->291544916_?->291544917_?->291544918_?->291544919_?->291544920_N6-MTase*->291544921_VirB6_TrbL->291544922_?->291544923_VirB4_TraE-><-291544924_?<-291544925_?||291544926_?->291544927_?->
      392661135    TraO->DUF3872-Ig->N6-MTase*-><-?||?->?->?-><-?<-?<-N6-MTase                                          N6-MTase    -                       HMPREF1069_06304  189  bacteria>bacteroidetes                            Bacteroides ovatus CL02T12C04                   hypothetical protein HMPREF1069_06304 [Bacteroides ovatus CL02T12C04].           392661133_TraO->392661134_DUF3872-Ig->392661135_N6-MTase*-><-392661136_?||392661137_?->392661138_?->392661139_?-><-392661140_?<-392661141_?<-392661142_N6-MTase
      585220856    <-VirB4_TraE<-?<-VirB6_TrbL<-N6-MTase*<-?<-VirD4_TraG<-HNH<-VirD4-TraG                               N6-MTase    EcoRI_methylase         RF007C_04375      189  bacteria>firmicutes                               Ruminococcus flavefaciens 007c                  tRNA (adenine-N6)-methyltransferase [Ruminococcus flavefaciens 007c].            <-585220860_VirB4_TraE<-585220861_?<-585220862_VirB6_TrbL<-585220856_N6-MTase*<-585220863_?<-585220864_VirD4_TraG<-585220865_HNH<-585220866_VirD4-TraG<-585220867_?<-585220868_?<-585220869_?
      815703720    <-VirB4_TraE<-?<-VirB6_TrbL<-N6-MTase*<-?<-VirD4-TraG                                                N6-MTase    EcoRI_methylase         -                 189  bacteria>firmicutes                               Ruminococcus sp. UNK.MGS-30                     tRNA (adenine-N6)-methyltransferase [Ruminococcus sp. UNK.MGS-30].               <-815703705_?<-815703707_?<-815703709_?<-815703712_?<-815703713_VirB4_TraE<-815703715_?<-815703718_VirB6_TrbL<-815703720_N6-MTase*<-815704307_?<-815704309_VirD4-TraG<-547318573_?<-547318574_?<-815703723_?<-815703725_?||815703727_?->
      492311699    <-N6-MTase*<-DUF3872-Ig<-TraO<-TraN<-TraM<-?<-TraK                                                   N6-MTase    MTS                     -                 188  bacteria>bacteroidetes                            Bacteroides fragilis                            tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis].                      492311683_?->492311684_?->695340692_?->492311687_?-><-492311689_?||492311691_?->492311694_?-><-492311699_N6-MTase*<-492311701_DUF3872-Ig<-492311703_TraO<-492311706_TraN<-492311710_TraM<-695340771_?<-492311717_TraK<-492311720_?
      492375163    TraK->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*->                                                   N6-MTase    MTS                     -                 188  bacteria>bacteroidetes                            Bacteroides uniformis                           tRNA (adenine-N6)-methyltransferase [Bacteroides uniformis].                     492375149_?->492375151_TraK->736509373_?->492375155_TraM->492375157_TraN->492375159_TraO->492375161_DUF3872-Ig->492375163_N6-MTase*-><-492375167_?<-492375168_?||492311689_?-><-492375179_?<-492375181_?<-492375183_?<-492375185_?
      763415988    TraM->TraN->TraO->DUF3872-Ig->N6-MTase*->                                                            N6-MTase    -                       -                 188  bacteria>bacteroidetes                            Candidatus Bacteroides timonensis               tRNA (adenine-N6)-methyltransferase [Candidatus Bacteroides timonensis].         763415981_TraM->763415983_TraN->763415984_TraO->763415985_DUF3872-Ig->763415988_N6-MTase*-><-763415990_?
      752678067    VirD4-TraG->?->N6-MTase*->VirB6_TrbL->?->VirB4_TraE->                                                N6-MTase    EcoRI_methylase         -                 185  bacteria>firmicutes                               Ruminococcus champanellensis                    tRNA (adenine-N6)-methyltransferase [Ruminococcus champanellensis].              505371826_?->505371827_?->505371828_?->752677690_?->505371830_?->752678065_VirD4-TraG->752678066_?->752678067_N6-MTase*->505371834_VirB6_TrbL->505371835_?->505371836_VirB4_TraE-><-505371837_?<-505371838_?||505371839_?->752677691_?->
      546873189    TraK-><-?||TraM->TraN->DUF3872-Ig->N6-MTase*->                                                       N6-MTase    MTS                     -                 184  bacteria>actinobacteria                           Eggerthella sp. CAG:1427                        hypothetical protein [Eggerthella sp. CAG:1427].                                 <-546873154_?||546873159_?->546873163_TraK-><-546873168_?||546873173_TraM->546873178_TraN->546873184_DUF3872-Ig->546873189_N6-MTase*-><-546873191_?
      596000932    TraK->?->TraM->TraN->TraO->DNA-primase->DUF3872-Ig->N6-MTase*->?->AbiH->                             N6-MTase    CoA_binding_2           M125_5712         184  bacteria>bacteroidetes                            Bacteroides fragilis str. 3998T(B)3             hypothetical protein M125_5712 [Bacteroides fragilis str. 3998T(B)3].            596000934_TraK->596000933_?->596000943_TraM->596000946_TraN->596000945_TraO->596000941_DNA-primase->596000940_DUF3872-Ig->596000932_N6-MTase*->596000937_?->596000949_AbiH->596000950_?-><-596000948_?<-596000931_?<-596000942_?||596000944_?->
      596015177    TraK->?->TraM->TraN->TraO->DNA-primase->DUF3872-Ig->N6-MTase*->?->AbiH->                             N6-MTase    Methyltransf_26         M081_5001         184  bacteria>bacteroidetes                            Bacteroides fragilis str. 3998 T(B) 4           hypothetical protein M081_5001 [Bacteroides fragilis str. 3998 T(B) 4].          596015170_TraK->596015171_?->596015172_TraM->596015173_TraN->596015174_TraO->596015175_DNA-primase->596015176_DUF3872-Ig->596015177_N6-MTase*->596015178_?->596015179_AbiH->596015180_?->596015181_?-><-596015182_?<-596015183_?<-596015184_?
      695479665    TraK->?->TraM->TraN->TraO->DNA-primase->DUF3872-Ig->N6-MTase*->?->AbiH->                             N6-MTase    Methyltransf_26         -                 183  bacteria>bacteroidetes                            Bacteroides fragilis                            tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis].                      695479568_TraK->695479571_?->695479574_TraM->695479576_TraN->695479579_TraO->695542931_DNA-primase->695542948_DUF3872-Ig->695479665_N6-MTase*->695479586_?->695479589_AbiH-><-695479596_?<-695479599_?||695479602_?->695542935_?->
      757750547    TraK->?->TraM->TraN->TraO->DNA-primase->DUF3872-Ig->N6-MTase*->?->AbiH->                             N6-MTase    CoA_binding_2           -                 183  bacteria>bacteroidetes                            Bacteroides fragilis                            tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis].                      695479568_TraK->695479571_?->757750544_TraM->695479576_TraN->695479579_TraO->757750538_DNA-primase->757750546_DUF3872-Ig->757750547_N6-MTase*->695479586_?->757750540_AbiH->695479593_?-><-695479596_?<-695479599_?||695479602_?->757750542_?->
      739436373    <-VirB4_TraE<-?<-VirB6_TrbL<-N6-MTase*<-?<-VirD4_TraG<-HNH<-VirD4-TraG                               N6-MTase    EcoRI_methylase         -                 182  bacteria>firmicutes                               Ruminococcus flavefaciens                       tRNA (adenine-N6)-methyltransferase [Ruminococcus flavefaciens].                 <-739436256_VirB4_TraE<-739436259_?<-739436261_VirB6_TrbL<-739436373_N6-MTase*<-739436374_?<-739436375_VirD4_TraG<-739436376_HNH<-739436377_VirD4-TraG<-739436265_?<-739436267_?<-739436269_?
      156112085    N6-MTase->?->?-><-?<-N6-MTase*<-DUF3872-Ig<-TraO<-TraN<-TraM<-?<-TraK                                N6-MTase    -                       BACOVA_00455      175  bacteria>bacteroidetes                            Bacteroides ovatus ATCC 8483                    hypothetical protein BACOVA_00455 [Bacteroides ovatus ATCC 8483].                156112078_?->156112079_?->156112080_?->156112081_N6-MTase->156112082_?->156112083_?-><-156112084_?<-156112085_N6-MTase*<-156112086_DUF3872-Ig<-156112087_TraO<-156112088_TraN<-156112089_TraM<-156112090_?<-156112091_TraK<-156112092_?
      695393939    <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK                                   N6-MTase    -                       -                 159  bacteria>bacteroidetes                            Bacteroides sp. 2_1_16                          tRNA (adenine-N6)-methyltransferase, partial [Bacteroides sp. 2_1_16].           496044153_?->490439153_?-><-490439154_VirD4_TraG<-496044154_?<-496044155_?<-490439157_?<-490439158_TraN<-695393939_N6-MTase*<-490443600_?<-490439160_TraM<-490439161_N6-MTase<-496044157_?<-490439163_TraK<-490439164_?<-490439165_?
      263256225    <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK                                   N6-MTase    -                       HMPREF0101_01185  150  bacteria>bacteroidetes                            Bacteroides sp. 2_1_16                          hypothetical protein HMPREF0101_01185 [Bacteroides sp. 2_1_16].                  263256218_?->263256219_?-><-263256220_VirD4_TraG<-263256221_?<-263256222_?<-263256223_?<-263256224_TraN<-263256225_N6-MTase*<-263256226_?<-263256227_TraM<-263256228_N6-MTase<-263256229_?<-263256230_TraK<-263256231_?<-263256232_?
      355531994    <-N6-MTase*<-DUF3872-Ig<-TraO<-TraN<-TraM<-?<-TraK                                                   N6-MTase    -                       HMPREF9441_00695  119  bacteria>bacteroidetes                            Paraprevotella clara YIT 11840                  hypothetical protein HMPREF9441_00695 [Paraprevotella clara YIT 11840].          355531987_?->355531988_?->355531989_?->355531990_?->355531991_?->355531992_?->355531993_?-><-355531994_N6-MTase*<-355531995_DUF3872-Ig<-355531996_TraO<-355531997_TraN<-355531998_TraM<-355531999_?<-355532000_TraK<-355532001_?
      511045000    N6-MTase*->?-><-Relaxase_TraI                                                                        N6-MTase    EcoRI_methylase         -                 179  bacteria>firmicutes                               Lachnospiraceae bacterium COE1                  hypothetical protein [Lachnospiraceae bacterium COE1].                           737642640_?->511044992_?->665909753_?->511044994_?->511044995_?->511044996_?->511044997_?->511045000_N6-MTase*->511045001_?-><-511045002_Relaxase_TraI<-490160044_?||550995956_?->511045003_?->511045004_?->511045005_?->
      
      # 68;DCM associated 
      497673944    <-VirB4_TraE<-PrgI<-VirB6_TrbL<-N6-MTase*||DUF3786->DCM+N6-MTase->HNH->?-><-Resolvase<-Relaxase_TraI N6-MTase    EcoRI_methylase         -                 189  bacteria>firmicutes                               Ruminococcus flavefaciens                       hypothetical protein [Ruminococcus flavefaciens].                                <-497673934_?<-497673935_?<-497673936_?<-497673938_?<-497673939_VirB4_TraE<-497673941_PrgI<-497673943_VirB6_TrbL<-497673944_N6-MTase*||497673945_DUF3786->497673946_DCM+N6-MTase->497673947_HNH->497673949_?-><-497673951_Resolvase<-657802676_Relaxase_TraI<-497673954_?
      738971076    N6-MTase*->CMP-hydrolase->DCM->DCM->HNH->cpN6-MTase->                                                N6-MTase    -                       -                 180  bacteria>bacteroidetes                            Prevotella amnii                                sugar-phospahte nucleotidyltransferase [Prevotella amnii].                       738971062_?->738971265_?->738971267_?->738971064_?->738971068_?->738971071_?->738971073_?->738971076_N6-MTase*->738971079_CMP-hydrolase->738971268_DCM->738971081_DCM->738971272_HNH->738971083_cpN6-MTase->738971085_?->738971274_?->
      739000163    <-cpN6-MTase<-N6-MTase<-DCM<-?<-?<-CMP-hydrolase<-N6-MTase*                                          N6-MTase    -                       -                 180  bacteria>bacteroidetes                            Prevotella disiens                              sugar-phospahte nucleotidyltransferase [Prevotella disiens].                     <-739000148_?<-739000151_cpN6-MTase<-739000155_N6-MTase<-739000157_DCM<-739000158_?<-739000160_?<-739000162_CMP-hydrolase<-739000163_N6-MTase*<-739000242_?<-739000165_?<-739000167_?<-739000169_?<-739000172_?<-739000174_?<-739000243_?
      696600425    DCM->?->N6-MTase*->                                                                                  N6-MTase    EcoRI_methylase         -                 186  bacteria>firmicutes                               Oscillibacter sp. ER4                           hypothetical protein [Oscillibacter sp. ER4].                                    <-696600348_?||696600350_?->696600352_?->696600354_?->696600356_?->696600358_DCM->696600360_?->696600425_N6-MTase*->696600362_?->696600364_?->696600367_?->696600370_?->696600427_?->696600430_?->696600372_?->
      
      # 68; 
      61805950     N6-MTase*->                                                                                          N6-MTase    -                       PSSM2_078         197  viruses>dsdna viruses, no rna stage>caudovirales  Prochlorococcus phage P-SSM2                    hypothetical protein PSSM2_078 [Prochlorococcus phage P-SSM2].                   61805944_?->61805945_?->312281380_?->312281387_?->61805947_?->61805948_?->61805949_?->61805950_N6-MTase*->61805951_?->61805952_?->61805953_?->61805954_?->61805955_?->61805956_?->61805957_?->
      736557575    SSB->?->?->?->?->?->N6-MTase*->N6-MTase->                                                            N6-MTase    EcoRI_methylase         -                 189  bacteria>proteobacteria>epsilonproteobacteria     Helicobacter sanguini                           hypothetical protein, partial [Helicobacter sanguini].                           <-736557556_?||736557574_SSB->736557560_?->736557563_?->736557565_?->736557566_?->736557567_?->736557575_N6-MTase*->736557568_N6-MTase->736557571_?->
      480695228    <-N6-MTase*                                                                                          N6-MTase    -                       HMPREF1097_02603  187  bacteria>firmicutes                               [Clostridium] bolteae 90B8                      hypothetical protein HMPREF1097_02603 [[Clostridium] bolteae 90B8].              <-480695224_?<-480695225_?<-480695226_?<-480695227_?<-480695228_N6-MTase*<-480695229_?<-480695230_?<-480695231_?<-480695232_?<-480695233_?||480695234_?-><-480695235_?
      496675013    <-N6-MTase*                                                                                          N6-MTase    MTS                     -                 186  bacteria>bacteroidetes                            Tannerella sp. 6_1_58FAA_CT1                    tRNA (adenine-N6)-methyltransferase [Tannerella sp. 6_1_58FAA_CT1].              <-496675006_?<-496675007_?||748669647_?-><-496675008_?||748669840_?->496675010_?->748669842_?-><-496675013_N6-MTase*<-496675014_?<-748669844_?<-496675016_?<-748669648_?<-748669649_?<-496675020_?||496675021_?->
      736576648    N6-MTase*->                                                                                          N6-MTase    -                       -                 186  bacteria>proteobacteria>epsilonproteobacteria     Helicobacter magdeburgensis                     hypothetical protein, partial [Helicobacter magdeburgensis].                     736576613_?->736576615_?->736576616_?->736576618_?->736576619_?->736576622_?->736576624_?->736576648_N6-MTase*->736576625_?->736576628_?->736576631_?->736576633_?->736576635_?->736576638_?->736576641_?->
      655453737    N6-MTase*->                                                                                          N6-MTase    -                       -                 185  bacteria>proteobacteria                           Proteobacteria bacterium JGI 0000113-L05        tRNA (adenine-N6)-methyltransferase [Proteobacteria bacterium JGI 0000113-L05].  655453730_?->655453731_?->655453732_?->655453733_?->655453734_?->655453735_?->655453736_?->655453737_N6-MTase*->655453738_?->655453739_?->655453740_?->655453741_?->655453742_?-><-655453743_?<-655453744_?
      740463513    <-N6-MTase*                                                                                          N6-MTase    -                       -                 182  bacteria>firmicutes                               [Clostridium] bolteae                           tRNA (adenine-N6)-methyltransferase [[Clostridium] bolteae].                     <-488633370_?<-488635648_?<-740463510_?<-488635650_?<-740463513_N6-MTase*<-488635653_?<-488628681_?<-740463515_?||488635656_?-><-488635657_?<-740463417_?<-488635660_?
      753855623    N6-MTase*->                                                                                          N6-MTase    -                       -                 182  bacteria>spirochaetes                             Treponema primitia                              hypothetical protein [Treponema primitia].                                       <-505823760_?<-505823761_?||505823762_?->505823763_?->505823764_?->505823765_?->753855622_?->753855623_N6-MTase*->505823768_?->505823769_?->505823770_?->505823771_?->505823772_?->505823773_?->505823774_?->
      740438568    <-N6-MTase*                                                                                          N6-MTase    EcoRI_methylase         -                 181  bacteria>firmicutes                               [Clostridium] clostridioforme                   tRNA (adenine-N6)-methyltransferase [[Clostridium] clostridioforme].             <-488659667_?<-488659666_?<-488659665_?<-488659664_?<-488659663_?<-488659662_?<-488659661_?<-740438568_N6-MTase*<-488659660_?<-488659659_?<-740438566_?<-488659657_?<-488659656_?<-488659655_?<-488659654_?
      763125949    <-REase+N6-MTase<-?<-?<-?<-?<-N6-MTase*<-?<-cpN6-MTase                                               N6-MTase    EcoRI_methylase         -                 179  bacteria>firmicutes                               Lactobacillus salivarius                        sugar-phospahte nucleotidyltransferase [Lactobacillus salivarius].               <-763125942_?<-763125943_?<-763125944_REase+N6-MTase<-763125945_?<-763125946_?<-763125947_?<-763125948_?<-763125949_N6-MTase*<-763125950_?<-763125951_cpN6-MTase<-763125952_?<-763125953_?<-763125954_?<-763125955_?<-763125956_?
      333739213    N6-MTase*->                                                                                          N6-MTase    -                       TREPR_0896        178  bacteria>spirochaetes                             Treponema primitia ZAS-2                        sugar-phospahte nucleotidyltransferase [Treponema primitia ZAS-2].               <-333738183_?<-333739473_?||333740528_?->333738456_?->333738592_?->333739981_?->333738228_?->333739213_N6-MTase*->333738037_?->333740303_?->333739983_?->333738732_?->333741471_?->333740250_?->333739655_?->
      489159998    <-N6-MTase*                                                                                          N6-MTase    CoA_binding_2           -                 176  bacteria>firmicutes                               Streptococcus intermedius                       hypothetical protein [Streptococcus intermedius].                                <-489159987_?<-739738008_?<-739738010_?<-489159991_?<-489159993_?<-489159995_?<-489159997_?<-489159998_N6-MTase*<-489160000_?<-489160003_?<-489160004_?<-489160006_?<-489160010_?||489160012_?->489160014_?->
      754856863    <-Terminase_LS<-?<-HNH<-?<-?<-N6-MTase*                                                              N6-MTase    -                       -                 176  bacteria>firmicutes                               Lactococcus garvieae                            sugar-phospahte nucleotidyltransferase [Lactococcus garvieae].                   <-754856851_?<-754856852_?<-754856854_Terminase_LS<-754856857_?<-754856859_HNH<-754857000_?<-754856861_?<-754856863_N6-MTase*<-754856865_?<-754857002_?<-754856867_?<-754857005_?<-754856869_?<-754857007_?<-754856871_?
      501311522    <-N6-MTase*<-SSB<-?<-?<-RecT                                                                         N6-MTase    -                       -                 174  bacteria>firmicutes                               Clostridium botulinum                           sugar-phospahte nucleotidyltransferase [Clostridium botulinum].                  <-501302841_?<-501306070_?<-501311386_?<-501311817_?<-501313272_?<-501302860_?<-501311840_?<-501311522_N6-MTase*<-501312032_SSB<-501312919_?<-501302806_?<-501306054_RecT<-501310854_?<-501312320_?<-501311584_?
      643012036    N6-MTase*->                                                                                          N6-MTase    -                       -                 174  viruses>dsdna viruses, no rna stage>caudovirales  Podovirus Lau218                                putative phage protein [Podovirus Lau218].                                       643012029_?->643012030_?->643012031_?->643012032_?->643012033_?->643012034_?->643012035_?->643012036_N6-MTase*->643012037_?->643012038_?->643012039_?->643012040_?->643012041_?->643012042_?->643012043_?->
      739429399    REase->?->?->?->?->?->N6-MTase*->N6-MTase->cpN6-MTase->                                             N6-MTase    EcoRI_methylase         -                 174  bacteria>firmicutes                               Ruminococcus albus                              hypothetical protein [Ruminococcus albus].                                       739429387_?->739429389_REasew->739429391_?->739430531_?->739429393_?->739429395_?->739429397_?->739429399_N6-MTase*->739429401_N6-MTase->739429402_cpN6-MTase->739429404_?->739429406_?->739429408_?->739429410_?->739429412_?->
      294983942    N6-MTase*->                                                                                          N6-MTase    -                       ZPR_4103          171  bacteria>bacteroidetes                            Zunongwangia profunda SM-A87                    N-6 adenine-specific DNA methylase [Zunongwangia profunda SM-A87].               <-294983935_?<-294983936_?<-294983937_?||294983938_?->294983939_?->294983940_?->294983941_?->294983942_N6-MTase*-><-294983943_?||294983944_?->294983945_?->294983946_?->294983947_?->294983948_?-><-294983949_?
      118197649    <-REase<-?||?->N6-MTase*->                                                                           N6-MTase    EcoRI_methylase         YS40_029          169  viruses>dsdna viruses, no rna stage>caudovirales  Thermus phage phiYS40                           sugar-phospahte nucleotidyltransferase [Thermus phage phiYS40].                  <-118197642_?<-118197643_?<-118197644_?<-118197645_?<-118197646_REase<-118197647_?||118197648_?->118197649_N6-MTase*-><-118197650_?<-118197651_?<-118197652_?||118197653_?-><-118197654_?<-118197655_?<-118197656_?
      343960410    <-REase<-?||?->N6-MTase*->                                                                           N6-MTase    EcoRI_methylase         TMA_029           169  viruses>dsdna viruses, no rna stage>caudovirales  Thermus phage TMA                               sugar-phospahte nucleotidyltransferase [Thermus phage TMA].                      <-343960403_?<-343960404_?<-343960405_?<-343960406_?<-343960407_REase<-343960408_?||343960409_?->343960410_N6-MTase*-><-343960411_?<-343960412_?<-343960413_?||343960414_?-><-343960415_?<-343960416_?<-343960417_?
      652129793    N6-MTase*->                                                                                          N6-MTase    MTS                     -                 168  bacteria>bacteroidetes                            Flavobacterium soli                             hypothetical protein [Flavobacterium soli].                                      652128964_?->652129035_?->652129206_?->652129327_?->652129406_?->652129588_?->652129708_?->652129793_N6-MTase*-><-652129930_?<-652130157_?<-652130255_?<-652130455_?<-652130548_?<-652130719_?<-652130816_?
      800940615    N6-MTase*-><-?<-?||?-><-N6-MTase<-HNH                                                                N6-MTase    Methyltransf_26         -                 168  bacteria>bacteroidetes                            Flavobacterium sp. 316                          hypothetical protein [Flavobacterium sp. 316].                                   800940601_?->800940603_?->800940605_?-><-800940607_?||800940609_?->800940611_?->800940613_?->800940615_N6-MTase*-><-800940617_?<-800940619_?||800940621_?-><-800940623_N6-MTase<-800940625_HNH<-800940627_?<-800940629_?
      425715747    N6-MTase*->                                                                                          N6-MTase    SP+Dam                  HMPREF9282_00530  165  bacteria>firmicutes                               Veillonella ratti ACS-216-V-Col6b               hypothetical protein HMPREF9282_00530 [Veillonella ratti ACS-216-V-Col6b].       425715740_?->425715741_?->425715742_?->425715743_?->425715744_?->425715745_?->425715746_?->425715747_N6-MTase*->425715748_?->425715749_?->425715750_?->425715751_?->425715752_?->425715753_?->425715754_?->
      753824499    N6-MTase*->                                                                                          N6-MTase    -                       -                 165  bacteria>bacteroidetes                            Zunongwangia profunda                           tRNA (adenine-N6)-methyltransferase, partial [Zunongwangia profunda].            <-502838502_?<-502838503_?<-502838504_?||502838505_?->502838506_?->502838507_?->502838508_?->753824499_N6-MTase*->502838511_?->753823556_?->502838513_?->502838514_?->502838515_?-><-753823558_?||502838519_?->
      765310844    SNF2->?->?->?->?->?->N6-MTase*->Methylase_S->N6-MTase->Methylase_S->                                 N6-MTase    Methyltransf_23         -                 165  bacteria>bacteroidetes                            Siansivirga zeaxanthinifaciens                  hypothetical protein [Siansivirga zeaxanthinifaciens].                           765310837_?->765310838_SNF2->765310839_?->765310840_?->765310841_?->765310842_?->765310843_?->765310844_N6-MTase*->765310845_Methylase_S->765310846_N6-MTase->765312112_Methylase_S->765310847_?->765310848_?->765310849_?->765310850_?->
      # 68; R-M system
      737023823    <-ParB+HNH<-N6-MTase*                                                                                N6-MTase    -                       -                 136  bacteria>tenericutes                              Entomoplasma luminosum                          hypothetical protein, partial [Entomoplasma luminosum].                          <-647290174_ParB+HNH<-737023823_N6-MTase*
      738315783    N6-MTase*->ParB+HNH->                                                                                N6-MTase    -                       -                 120  bacteria>tenericutes                              Mesoplasma seiffertii                           hypothetical protein, partial [Mesoplasma seiffertii].                           652736106_?->652736107_?->652736108_?->652736109_?-><-652736110_?||652736111_?->652736112_?->738315783_N6-MTase*->738315769_ParB+HNH->652736114_?->738315772_?->652736117_?->652736118_?->652736120_?->652736122_?->
      738499893    N6-MTase*->ParB+HNH->cpN6-MTase->                                                                    N6-MTase    EcoRI_methylase         -                 108  bacteria>tenericutes                              Mycoplasma pirum                                hypothetical protein, partial [Mycoplasma pirum].                                <-738499704_?||652846545_?->652846547_?->652846548_?-><-738499706_?<-652846551_?<-738499707_?||738499893_N6-MTase*->652846554_ParB+HNH->738499896_cpN6-MTase->652846556_?-><-652846558_?||738499708_?->652846561_?->738499709_?->
      # 3;                                                                                                                                                                                                                
      497336777    <-N6-MTase*<-SSB<-?<-RecT                                                                            N6-MTase    MTS                     -                 189  bacteria>proteobacteria>epsilonproteobacteria     Campylobacter sp. FOBRC14                       hypothetical protein [Campylobacter sp. FOBRC14].                                497336866_?->497336786_?->497336887_?-><-497336838_?<-736970004_?<-497336817_?<-497336860_?<-497336777_N6-MTase*<-497336833_SSB<-497336829_?<-497336785_RecT<-497336846_?<-497336835_?<-497336831_?<-497336775_?
      500772938    RecT->?->SSB->N6-MTase*->                                                                            N6-MTase    MTS                     -                 189  bacteria>proteobacteria>epsilonproteobacteria     Campylobacter curvus                            hypothetical protein [Campylobacter curvus].                                     <-500772932_?||500772933_?->754105770_?->754105772_?->500772935_RecT->754105775_?->500772937_SSB->500772938_N6-MTase*->500772939_?->500772940_?->754105777_?->754105779_?->500772942_?-><-754105874_?<-500772945_?
      489041364    <-N6-MTase*<-?<-RecT                                                                                 N6-MTase    MTS                     -                 188  bacteria>proteobacteria>epsilonproteobacteria     Campylobacter showae                            hypothetical protein [Campylobacter showae].                                     <-489041352_?<-489041353_?<-489041355_?<-489041357_?<-489041359_?<-489041360_?<-489041362_?<-489041364_N6-MTase*<-489041368_?<-489041371_RecT<-489041373_?<-489041374_?<-489041377_?<-489041378_?<-489041380_?
      # 2;                                                                                                                                                                                                                
      431004013    N6-MTase*->N6-MTase-><-PLDc                                                                          N6-MTase    RelB+EcoRI_methylase    A15U_04136        415  bacteria>proteobacteria>gammaproteobacteria       Escherichia coli KTE210                         RelB/DinJ family addiction module antitoxin [Escherichia coli KTE210].           <-431004006_?<-431004007_?||431004008_?->431004009_?->431004010_?->431004011_?->431004012_?->431004013_N6-MTase*->431004014_N6-MTase-><-431004015_PLDc||431004016_?->431004017_?-><-431004018_?<-431004019_?<-431004020_?
      692950787    N6-MTase*->N6-MTase-><-PLDc                                                                          N6-MTase    EcoRI_methylase         -                 344  bacteria>proteobacteria>gammaproteobacteria       Escherichia coli                                restriction endonuclease subunit M [Escherichia coli].                           <-585312902_?<-505582380_?<-446688926_?<-486190256_?<-486190259_?<-486190260_?<-486190261_?||692950787_N6-MTase*->486190280_N6-MTase-><-692950788_PLDc<-692946043_?||486190285_?->692950789_?-><-585312903_?<-486190286_?
      # 2;                                                                                                                                                                                                                
      323436523    <-ParB<-?<-ASCH<-?<-?<-ASCH||HTH-><-N6-MTase*<-?<-cpN6-MTase<-?||HTH->                               N6-MTase    SP                      Weevi_0265        278  bacteria>bacteroidetes                            Weeksella virosa DSM 16922                      ParB-like nuclease [Weeksella virosa DSM 16922].                                 <-323436516_ParB<-323436517_?<-323436518_ASCH<-323436519_?<-323436520_?<-323436521_ASCH||323436522_HTH-><-323436523_N6-MTase*<-323436524_?<-323436525_cpN6-MTase<-323436526_?||323436527_HTH-><-323436528_?<-323436529_?<-323436530_?
      754544258    <-ParB<-?<-ASCH<-?<-?<-ASCH||HTH-><-N6-MTase*<-?<-cpN6-MTase<-?||HTH->                               N6-MTase    -                       -                 248  bacteria>bacteroidetes                            Weeksella virosa                                chromosome partitioning protein ParB [Weeksella virosa].                         <-754544257_ParB<-503362712_?<-503362713_ASCH<-503362714_?<-503362715_?<-503362716_ASCH||503362717_HTH-><-754544258_N6-MTase*<-503362719_?<-503362720_cpN6-MTase<-503362721_?||503362722_HTH-><-503362723_?<-754544068_?<-503362725_?
      # 2;                                                                                                                                                                                                                
      748595473    <-ParB<-?<-?<-?<-?<-?<-REase<-N6-MTase*                                                              N6-MTase    SP                      -                 224  bacteria>proteobacteria>alphaproteobacteria       Ochrobactrum intermedium                        hypothetical protein [Ochrobactrum intermedium].                                 <-748595461_ParB<-748595463_?<-748595465_?<-748595502_?<-748595466_?<-748595469_?<-493515914_REase<-748595473_N6-MTase*<-748595475_?<-748595477_?<-748595479_?||493515916_?->748595504_?->748595480_?->493515918_?->
      763458170    N6-MTase*->REase->?->?->?->?->?->ParB->                                                              N6-MTase    SP                      -                 224  bacteria>proteobacteria>alphaproteobacteria       Brucella abortus                                hypothetical protein [Brucella abortus].                                         <-763458159_?<-763458161_?<-748595504_?||763458163_?->763458165_?->763458167_?->748595475_?->763458170_N6-MTase*->763458172_REase->763458174_?->763458176_?->763458212_?->763458178_?->748595463_?->748595461_ParB->
      # 2;                                                                                                                                                                                                                
      523845400    <-N6-MTase*<-?<-RecT                                                                                 N6-MTase    -                       M638_00220        193  bacteria>firmicutes                               Listeria monocytogenes                          sugar-phosphate nucleotidyltransferase [Listeria monocytogenes].                 <-523847903_?<-523847904_?<-523847905_?<-523847906_?<-523847907_?<-523847908_?<-523845399_?<-523845400_N6-MTase*<-523847909_?<-523845401_RecT<-523845402_?<-523845403_?<-523847910_?<-523847911_?<-523847912_?
      752526171    N6-MTase*->                                                                                          N6-MTase    -                       -                 176  bacteria>firmicutes                               Listeria monocytogenes                          sugar-phosphate nucleotidyltransferase [Listeria monocytogenes].                 <-502716505_?<-644855401_?||770723372_?-><-489827227_?||497615352_?->506520144_?->559010092_?->752526171_N6-MTase*->770723442_?->558988748_?->770723481_?->
      # 2;                                                                                                                                                                                                                
      488893482    <-N6-MTase*                                                                                          N6-MTase    EcoRI_methylase         -                 180  bacteria>proteobacteria>epsilonproteobacteria     Campylobacter                                   MULTISPECIES: Sugar-phospahte nucleotidyltransferase [Campylobacter].            <-488893482_N6-MTase*<-488932284_?
      488923876    <-N6-MTase*                                                                                          N6-MTase    EcoRI_methylase         -                 180  bacteria>proteobacteria>epsilonproteobacteria     Campylobacter coli                              Sugar-phospahte nucleotidyltransferase [Campylobacter coli].                     <-488923876_N6-MTase*<-488932284_?
      # 1;                                                                                                                                                                                                                
      655449388    <-N6-MTase*                                                                                          N6-MTase    DUF4238                 -                 238  bacteria>proteobacteria                           Proteobacteria bacterium JGI 0000113-E04        hypothetical protein [Proteobacteria bacterium JGI 0000113-E04].                 <-655449381_?||655449382_?-><-655449383_?||655449384_?-><-655449385_?||655449386_?-><-655449387_?<-655449388_N6-MTase*||655449389_?-><-655449390_?
      291336302    <-N6-MTase<-N6-MTase*                                                                                N6-MTase    -                       -                 208                                                    uncultured organism MedDCM-OCT-S12-C54          hypothetical protein, partial [uncultured organism MedDCM-OCT-S12-C54].          <-291336295_?||291336296_?-><-291336297_?||291336298_?-><-291336299_?||291336300_?-><-291336301_N6-MTase<-291336302_N6-MTase*
      291336301    <-N6-MTase*<-N6-MTase                                                                                N6-MTase    -                       -                 123                                                    uncultured organism MedDCM-OCT-S12-C54          hypothetical protein [uncultured organism MedDCM-OCT-S12-C54].                   <-291336295_?||291336296_?-><-291336297_?||291336298_?-><-291336299_?||291336300_?-><-291336301_N6-MTase*<-291336302_N6-MTase
      
      Back to Contents
    • Multiple sequence alignment of the Group2-Clade4/EMIHUDRAFT_111979-like N6-MTases

                                                                                                                                **                      Str-1            Str-2                                                          Str-4                                                          Str-5                                      Str-6                                           Str-7                                                                                                                                                                                                                                                                                                                 
      FINAL                                                                          ------------------------------------------------------HHHHHHHHHH----EEEHHHH---HHHHHHHHHHH-HH-----HH-------------------------------------------E----EEEE------------------------------------HHHHHHH-H-HH-H-------E-EEEEE-----HHHHH-HHH-H---------------------E-EEEEE------------EEE--------------------------E-EEEEE----------------------------------------------------------------------EEEE---------------------HHH---HHHH------------------------------HHHHH----------------------H---
      ALIGN                                                                          ------------------------------------------------------HHHHHHHHHHHH-HHHHHH-----HH-HHH--EE-----------------------------------------------------------EEE-------------------------------------HHHHHHH-H-HH-H-------E-EEEEE------HHHH-HHH-H---H---------------E-E-HEEE-------------EEE---E------------------------EEEEE----------------------------------------------------------------------EEEE-E------HH-----------HHH---HHHH------HH------H----HH---HH-HHHHHHHH--------------------------
      HMM                                                                            -----------------------------------E--EEEE-----------HHHHHHHHHHH---EEEE-------HHHHHHHHH-H-HH-----HHHHH-HH------------------------------E------E----EEEEE------H------H-HHHHHHH-----------HHHHHHHHH-H-HH-H-------E-EEEEE---HHHHHHH-HHH-H---H---------------E-E-EEEEE------------EEE---EE---------------------E-EEEEE----------------------------------------------------------------------EEEE-EEE-----------------HHH---HHHHHH--HHHH------H----H-----H---HHHHHH----------------------HH--
      FREQ                                                                           ------------------------------------------------------HHHHHHHHHHHH-HHHHHHH------HHHHHHHHH-HH--------------H-------------------------H--------------EE-------------------EEEE-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      PSSM                                                                           ------------------------------------------------------HHHHHHHHH------E-----------------------------------------------------------------------------EEEE------------------------------------HHHHHHH-H-HH-H---------EEEEE-----HHHHH-HHH-H---------------------E-EEEEE------------EEE--------------------------E-EEEE-----------------------------------------------------------------------EEE----------------------HHH---HHHHH-----HH------H---------------HHHHH----------------------H---
      consensus/90%                                                                  ...............................................ps.W.TP..hF..Ls..F..F.lDssA.....pN.bs..ahs.........ssL...p.a.........................................hahNPPYup...............................al.+Ah.........p......sVhLlPsc.ss.aa...h................-.........lchl...........GRl.F...................sss..bss.hlhla........................................................................................................................................................................................................
      EMIHUDRAFT_111979_Emiliania_huxleyi_CCMP1516_551608163                         IT-PAA--A-A-------------------D-AD-V--DILS--VQKTQQWETPPQVWEYVSARWA-VDFDACAS---PINALAPRYST-VD-----DDFLA-REDL-------------------------K--D------I----TIYCNPPYAL-D------RYGTGSCAA-----------IEPFVRRLV-E-LA-ST-R-GC-T-CIALVPVLSHQLWFH-TCV-T---G-A--A-S--G-GRAAH-E-IHWVQ----------GLLKW---NN-PF-H-E-REP-ASPYI--YPF-ALCVW------------------------------------------R-P--G-AP----P-DRA-----H----EVVA-SLPRP-SDD-----------VSR---SFHFRR--CRRR------G----CG---KV-RLLPRHVD----------------------LLRA----------------------------------
      EMIHUDRAFT_240085_Emiliania_huxleyi_CCMP1516_551578908                         IA-PAA--A-A-------------------D-AD-V--DILS--VQKTQQWETPLPVWEYVSARWA-VDFDACAS---PINALAPRYST-VD-----DDFLA-REDL-------------------------K--D------I----TIYCNPPYAL-D------RYGTGSCAA-----------IEPFVRRLV-E-LA-ST-R-GC-T-CIALVPVLSHQSWFH-TCV-T---G-A--A-S--G-GRAAH-E-IHWVQ----------GLLKW---NN-PF-H-E-REP-ASPYI--YPF-ALCVW------------------------------------------R-P--G-AP----P-DLG-----D----SQYG-S-----CDA-----------HEY---VMHFT----------------------------------------------------------------------------------------------
      DX12_RS0110285_Vibrio_parahaemolyticus_646896396                               FS------S-A-------------------R-NG----------SSKQDKWQTPPAVFEKLNEEFN-FTLDATAE---PETALCDHYFT-ID-----DDAL--TQDW-------------------------G--N--Q--------TVYCNPPYSQ----------------------------LKDFAKKAQ---EE-AK-K-GA-T-VVMLVPARTDTKAFH-DYL-----S-H--G----E---------VRLIK----------GRLKF---L-------------MEGKE--QDA-A------------------------------------------------P--F-PS----M---------V----CVMG-------KDR-----------EQK---IGTTTQ-------------------------DALTLESK------------------------------------------------------------
      Q331_RS21100_Afifella_pfennigii_736470177                                      ---------------------------M----VH-Q--SLY---SSRTEEWETPPALFERLDRIFG-FRLDACAS---PANRKCETWFS-AA-----DNAL--ERSW-------------------------AEHG-----------RVWLNPPYGR-R--------------------------IAGFMRKAF---EE-SQ-K-GA-L-VVALVPARTDTLWWH-EWV---N-G-K--A----D---------IVFLK----------GRLKY-------L-DEN-RRE-RSPAP--FPS-ALVVY--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      HMPREF0179_03455_Bilophila_wadsworthia_3_1_6_316921487                         MN--------P---------------------------ALF---SSAKEDWETPREFFERLDGEFH-FDLDVCAF---PHNAKCPTYFT-KE-----DDGL--ARDW-------------------------G--NR----------VCWMNPPYGK-A--------------------------IKAWMTKAL---DA-SR-R-GA-T-VVCLVPSRTDTAWWH-DTV-IAG-G-A-------E---------VRFAR----------GRLRF-------------VGA-EHPAP--FPS-AVVIF------------------------------------------R-P--P-PS--------------P----SQQKETNDENNDPQ--------------------------------------------------------------------------------------------------------------------
      OAC_RS0107480_Vibrio_cyclitrophicus_515155813                                  FS------S-A-------------------R-TG----------NPKRDKWQTPPAVFKKLNEEFH-FTLDATAE---PETALCDHYFT-MD-----DDAL--TQDW-------------------------S--N--Q--------TVYCNPPYSQ----------------------------LKDFAKKAQ---EE-AN-K-GA-T-VVMLVPARTDTKAFH-DHL-----S-H--G----E---------VRLIK----------GRLKF---L-------------QDGEE--QDA-A------------------------------------------------P--F-PS----M---------V----CVMG-------NDV-----------EQK---IGTTTQ-------------------------DKLKLEPK------------------------------------------------------------
      A148_RS0111015_Vibrio_splendidus_695353200                                     FS------S-A-------------------R-TG----------NPKRDKWQTPPAVFKKLNEEFH-FTLDATAE---PETALCDHYFT-MD-----DDAL--TQDW-------------------------S--N--Q--------TVYCNPPYSQ----------------------------LKDFAKKAQ---EE-AK-K-GA-T-VVMLVPARTDTKAFH-DHL-----S-H--G----E---------VRLIK----------GRLKF---L-------------QDGEE--QDA-A------------------------------------------------P--F-PS----M---------V----CV--------------------------------------------------------------------------------------------------------------------------------
      HMPREF0179_RS04985_Bilophila_wadsworthia_749811142                             MN--------P---------------------------ALF---SSAKEDWETPREFFERLDGEFH-FDLDVCAF---PHNAKCPTYFT-KE-----DDGL--ARDW-------------------------G--NR----------VCWMNPPYGK-A--------------------------IKAWMTKAL---DA-SR-R-GA-T-VVCLVPSRTDTAWWH-DTV-IAG-G-A-------E---------VRFAR----------GRLRF-------------VGA-EHPAP--FPS-AVVIF------------------------------------------R-P--P-PS--------------P----SQQ-------------------------------------------------------------------------------------------------------------------------------
      OR63_RS06485_Clostridium_tetani_737140426                                      ---------------------------M--N-TA----VMF---SSETDLWATPQEFYNELNKEFN-FDLDPCAT---HENAKCPKYYT-VV-----EDGL--KQDW-------------------------Q--G--H--------KVFCNPPYGR-E--------------------------ISKWVEKAY---KE-SK-KENT-T-VVMLIPARTDTKYFH-SYI-Y---R-K--A---KE---------IRFIK----------GRLKF-------------GNA-KNSAP--FPS-MVVVF--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      VPUCM_1151_Vibrio_parahaemolyticus_UCM-V493_584469889                          FS------S-A-------------------N-SG----------DKSKDKWQTPPEIFAQLNDRFG-FTLDAAAE---PETALCEKYFT-EE-----DDAL--KQDW-------------------------S--G--H--------VVFCNPPYSK----------------------------LRVFAKKAY---EE-SL-K-GT-T-VVMLVPARTDTQACH-DYL-----A-N--G----E---------MYFIR----------GRLKF---L-------------KVGEL--QDA-A------------------------------------------------P--F-PS----V---------V----CVLG-------PGV-----------ERKGGGLLTKKT-------------------------CCFGNKNLDEA----G----------------------------------------------------
      HMPREF1020_RS23965_Clostridium_sp_7_3_54FAA_496656604                          -------------M------------------ND----ALL---SSKNMCWCTPPDFFAELDREFH-FELDPAST---DKSAKCAKHFT-PD-----DDGL--KQDW-------------------------G--G--Y--------RVFCNPPYGR-A--------------------------IADWVRKGY---EE-SR-KPGT-T-VVMLIPSRTDTAYFH-DWI-F---G-K--A---SE---------VRFLR----------GRLKF---------TDEDGNG-EDAAP--FPS-AVIVW------------------------------------------R----S-PE------STGRE-------FATWH-I----------------------------------------------------------------------------------------------------------------------------
      CLOM621_RS14915_Clostridiales_492715347                                        -------------M------------------ND----ALL---SSKNMCWCTPPDFFAELDREFH-FELDPAST---DKSAKCAKHFT-PD-----DDGL--KQDW-------------------------G--G--Y--------CVFCNPPYGR-A--------------------------IADWVRKGY---EE-SR-KPGT-T-VVMLIPSRTDTAYFH-DWI-F---G-K--A---SE---------VRFLR----------GRLKF---------TDEDGNG-EDAAP--FPS-AVIVW------------------------------------------R----S-PE------STGRE-------FATWH-I----------------------------------------------------------------------------------------------------------------------------
      CLOM621_08346_Clostridium_sp_M62/1_291074040                                   ------------------------------------------------MCWCTPPDFFAELDREFH-FELDPAST---DKSAKCAKHFT-PD-----DDGL--KQDW-------------------------G--G--Y--------CVFCNPPYGR-A--------------------------IADWVRKGY---EE-SR-KPGT-T-VVMLIPSRTDTAYFH-DWI-F---G-K--A---SE---------VRFLR----------GRLKF---------TDEDGNG-EDAAP--FPS-AVIVW------------------------------------------R----S-PE------STGRE-------FATWH-I----------------------------------------------------------------------------------------------------------------------------
      VPUCM_RS05810_Vibrio_parahaemolyticus_740612810                                FS------S-A-------------------N-SG----------DKSKDKWQTPPEIFAQLNDRFG-FTLDAAAE---PETALCEKYFT-EE-----DDAL--KQDW-------------------------S--G--H--------VVFCNPPYSK----------------------------LRVFAKKAY---EE-SL-K-GT-T-VVMLVPARTDTQACH-DYL-----A-N--G----E---------MYFIR----------GRLKF---L-------------KVGEL--QDA-A------------------------------------------------P--F-PS----V---------V----CVLG------------------------------------------------------------------------------------------------------------------------------
      G454_RS0114655_Desulfovirgula_thermocuniculi_654109520                         M-------------------------------LN-R--GLF---SSASSEWETPQKFFETLDVEFG-FTLDVCAR---PENAKCPRYFS-PE-----EDGL--RQEW-------------------------A--PE----------VCWMNPPYGR-E--------------------------IGKWIQKAY---EE-AQ-K-GA-T-VVCLLPSRTDTAWWH-EYV-M---RAA-------E---------VRFIR----------GRLRF-------------GGA-ENGAP--FPS-CVVVF------------------------------------------R-P----GY--------------S--GLPVVK-------SMA--------AR----------------------------------------------------------------------------------------------------------
      BN981_RS01320_Halobacillus_737532221                                           ---------------------------M--NKMD----VHY---SSKTNEWATPQDFFDELNTEFN-FTLDPCAT---PDNAKCDKYFT-EK-----DDGL--EQSW-------------------------E--G--E--------TVFCNPPYGR-G--------------------------IKHWVKKAY---QE-ST-KPNT-T-VVLLIPSRTDTRYFH-DYV-Y---H-K--S----E---------IRFLK----------GRLKF-------------GDG-SGNAP--FPS-MVAIYR-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RH85_RS11625_Vibrio_ichthyoenteri_748690860                                    YS------S-A-------------------R-TG----------EKQQDRWQTPAEIFRQLNDEFH-FTLDAAAE---PSTALCSNYFT-EQ-----DDAL--AKNW-------------------------G--S--H--------VVYCNPPYSK----------------------------LREFARKAY---EA-SL-T-GA-T-VVMLVPARTDTQAFH-HYL-----S-K--G----E---------VRFIK----------GRLKF---L-------------QAGEA--QNT-A------------------------------------------------P--F-PS----M---------I----CVLG-------AGV-----------ERK---MITVLQ-------------------------DSLHNAVV------------------------------------------------------------
      TH16_RS01985_Staphylococcus_caprae_488372936                                   --------------------------------MS----VHF---SSKSNEWYTPQYLFDELNEKYQ-FTLDPCAS---HENAKCDKYFT-IE-----DDGL--TKDW-------------------------S--K--D--------IVFMNPPYGR-N--------------------------IKHWIKKAY---EE-SV-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-N--A---YN---------IKFLK----------GRIKF-------------GGA-VNSAP--FPS-AIVVF------------------------------------------KPKGDG-LK-----------------------------------------------------------------------------------------------------------------------------------------------------
      BZ26_RS0118830_Clostridium_botulinum_489480013                                 ---------------------------M--N-TA----VMF---SSETDLWATPQDFFDKLNKEFN-FDLDPCAT---KENAKCSKYFT-KE-----IDGL--KQDW-------------------------G--R--Y--------RVFCNPPYGR-E--------------------------IGKWVEKAY---KE-SK-KQNT-T-VVMLIPARTDTKYFH-SYI-Y---H-K--A---KE---------IRFIK----------GRLKF-------------GNA-KNSAP--FPS-MIVVFRG------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      SD74_RS18965_Clostridium_botulinum_752703286                                   ---------------------------M--N-TA----VMF---SSETDLWATPQDFFDELNKEFD-FDLDPCAT---HENAKCDKYYT-IV-----EDGL--KQDW-------------------------Q--G--H--------KVFCNPPYGR-G--------------------------IKDWVEKAY---KE-SK-KENT-T-VVMLIPARTDTKYFH-SYI-Y---H-K--A---KE---------IRFIK----------GRLKF-------------GDA-KNSAP--FPS-MVVVF--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      J532_4398_Acinetobacter_baumannii_691154760                                    -N--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWD-------KGASHE-----------------------------------------------------------------------------------------------------------------
      K041_RS17240_Acinetobacter_baumannii_690981431                                 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWD-------KGASHE-----------------------------------------------------------------------------------------------------------------
      W9I_03525_Acinetobacter_nosocomialis_493629840                                 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGR-E--------------------------IVDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------
      J532_4398_Acinetobacter_baumannii_940793_630464595                             ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWD-------KGASHE-----------------------------------------------------------------------------------------------------------------
      J517_3010_Acinetobacter_baumannii_691065210                                    MN--------T----------------M----TK-N--KLFGLAEERTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VNWE-------KSA--------------------------------------------------------------------------------------------------------------------
      LJ44_RS16470_Acinetobacter_baumannii_447017697                                 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------
      F985_01871_Acinetobacter_sp_NIPH_973_490838153                                 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PGNAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------
      J689_1368_Acinetobacter_calcoaceticus/baumannii_complex_645913983              MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------
      J660_0735_Acinetobacter_calcoaceticus/baumannii_complex_493629922              MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGC-E--------------------------IVDWIAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------
      J523_3197_Acinetobacter_baumannii_691027491                                    MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VSWE-------KSA--------------------------------------------------------------------------------------------------------------------
      J660_0735_Acinetobacter_baumannii_88816_593668543                              ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGC-E--------------------------IVDWIAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------
      ACIN5021_2863_Acinetobacter_sp_OIFC021_444754682                               ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------
      J660_1691_Acinetobacter_baumannii_691157882                                    MN--------S----------------M----SK-N--KLFGLAEDRTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VNWE-------KSA--------------------------------------------------------------------------------------------------------------------
      K035_3853_Acinetobacter_baumannii_691039522                                    MN--------S----------------M----TK-N--KLFGLADDRTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWD-------KGASHE-----------------------------------------------------------------------------------------------------------------
      RL05_RS02180_Staphylococcus_aureus_446374007                                   --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLSEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ-----------------------------------------------------------------------------------------------------------------------------------------------------
      FL80_RS05360_Acinetobacter_baumannii_690988986                                 MN--------N----------------M----TK-N--KLFGLAEERTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AN-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VNWE-------KSA--------------------------------------------------------------------------------------------------------------------
      PI74_RS05125_Clostridium_botulinum_500994137                                   ---------------------------M--N-TA----VMF---SSGTDLWATPQDFFDKLNKEFD-FDLDPCAT---HKNAKCSKYFT-KE-----IDGL--KQDW-------------------------Q--G--Y--------KVFCNPPYGR-S--------------------------IKDWVEKAY---KE-SK-KENT-T-VVMLIPARTDTRYFH-EYI-Y---N-K--A---KE---------IRFVK----------GRLKF-------------GDA-KNSAP--FPS-MVVVF--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      ABBL099_02355_Acinetobacter_baumannii_690996743                                MN--------N----------------M----TK-N--KLFGLAEERTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AN-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VSWE-------KSA--------------------------------------------------------------------------------------------------------------------
      J697_3983_Acinetobacter_baumannii_691093639                                    MN--------S----------------M----TK-N--KLFGLADDRTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-Q--S-LI--------------D----VSWE-------KSA--------------------------------------------------------------------------------------------------------------------
      RQ87_RS18135_Acinetobacter_baumannii_447010248                                 MN--------S----------------M----TK-N--KLFGLADDRTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VSWE-------KSA--------------------------------------------------------------------------------------------------------------------
      V006_02512_Staphylococcus_aureus_686297326                                     --------------------------------ME----VHY---SSKTNEWTTPQNLFDELNGEFN-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ-----------------------------------------------------------------------------------------------------------------------------------------------------
      A11W_RS0107210_Staphylococcus_hominis_515743089                                --------------------------------ME----VHY---SSKSNEWATPQNLFDELNEEFN-FTLDPCAT---DENAKCSKYFT-IE-----DDGL--SKDW-------------------------S--K--D--------VVFMNPPYGR-E--------------------------IKKWNKKAY---EE-SL-N-GA-T-VVCLIPARTDTTYWH-DFI-F---D-R--A---DD---------IRFLR----------GRLKF-------------GNS-KNSAP--FPS-AIVVY------------------------------------------R----G-VTT----------------------------------------------------------------------------------------------------------------------------------------------------
      G454_RS0102995_Desulfovirgula_thermocuniculi_654100680                         M-------------------------------FN-R--VLF---SSATSEWETPQELFARLHAEFG-FTLDVCAR---PWNAKCTRYFS-PE-----QNGL--IQEW-------------------------A--PE----------TCWMNPPYGR-E--------------------------ISRWVRKAW---EE-AQ-K-GA-T-VVCLLPSRTDTAWWH-EYV-M---RAA-------E---------IRFIR----------GRLHF-------------EGA-KNGAP--FPS-CVVVF------------------------------------------R-P----GC--------------T--GPPVIR-------SMA--------AR----------------------------------------------------------------------------------------------------------
      J546_RS10975_Acinetobacter_baumannii_736663998                                 ---------------------------M----AN-H--QLFGLAENRTDIWATPQDFFDKLNAVFK-FDLDVCAL---PNNAKCERFFS-PE-----DDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------IIEWVAKAA---CT-AK-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KSNAP--FGC-CVVVF------------------------------------------R-P--T-LN--------------D----VEWE-------NA---GGGV--------------------------------------------------------------------------------------------------------------
      AWRIB429_RS09790_Oenococcus_oeni_768719850                                     ---------------------------M--N-NE----LMF---SSKTDLWSTPNDFFDKLNDEFH-FTLDPCST---HENAKCYKHFT-KE-----ENGL--LQDL-------------------------G--N--E--------VVFCNPPYGR-Q--------------------------IKDWVKKSY---EE-SQ-KDNT-T-VVMLIPARTDTIYFH-EYI-Y---H-K--A----E---------IRFIK----------GRLKF-------------GNA-KNSAP--FPS-MVVIF--E-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      NZ45_03810_Clostridium_botulinum_700273311                                     ---------------------------M--N-TA----VMF---SSETDLWATPQDFFDKLNKEFD-FDLDPCAT---HENAKCSKYFT-KE-----IDGL--KQDW-------------------------Q--G--H--------KVFCNPPYGR-G--------------------------IKDWVEKAY---KE-SK-KENT-T-VVMLIPARTDTRYFH-EYI-Y---H-K--A---KE---------IRFVK----------GRLKF-------------GSA-KNSAP--FPS-MVVVFRGE-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RK90_RS13240_Staphylococcus_aureus_446374006                                   --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ-----------------------------------------------------------------------------------------------------------------------------------------------------
      SA930_RS14870_Staphylococcus_aureus_446374005                                  --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G--------------------------------------------------------------------------------------------------------------------------------------------------------
      VII00023_15021_Vibrio_ichthyoenteri_ATCC_700023_342803448                      YS------S-A-------------------R-TG----------EKQQDRWQTPAEIFRQLNDEFH-FTLDAAAE---PSTALCSNYFT-EQ-----DDAL--AKNW-------------------------G--S--H--------VVYCNPPYSK----------------------------LREFARKAY---EA-SL-T-GA-T-VVMLVPARTDTQAFH-HYL-----S-K--G----E---------VRFIK----------GRLKF---L-------------QAGEA--QNT-A------------------------------------------------P--F-PS----M---------I----CVLG-------AGV-----------ERK---MITVLQ-------------------------DSLHNAVV------------------------------------------------------------
      KU40_RS04850_Clostridium_botulinum_737823765                                   ---------------------------------------MF---SSKTDMWSTPQDFYNKLNQEFN-FNLDPCST---NENAKCERHYT-IA-----EDGL--KQNW-------------------------V--G--S--------TVFCNPPYGR-V--------------------------LKDWVKKCY---EE-SK-KDNT-T-VVMLIPARTDTTYFH-NYI-Y---K-K--V---KE---------IRFIR----------GRLKF-------------GDC-KNAAP--FPS-MVVVF--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      DESKU_RS03925_Desulfotomaculum_kuznetsovii_503587829                           M-------------------------------LN-E--SMF---SSRTGEWETPQTFFDALDAEFH-FTLDVCAR---PENAKCARFFT-PE-----QDGL--RQSW-------------------------A--GE----------TCWMNPPYGR-E--------------------------IGRWVEKAY---NE-AR-R-GA-V-VVALLPARTDTRWWH-RYV-M---RAA-------E---------IRFVE----------GRLKF-------------GGA-ENSAP--FPS-VVVVF------------------------------------------T-P--EKAV--------------S--DGPVVR-------SMR--------VK----------------------------------------------------------------------------------------------------------
      CO98_RS04645_Staphylococcus_aureus_739716594                                   --------------------------------MS----VHF---SSKSNEWTTPQYLFDELNEEFN-FTLDPCAT---DENAKCSKYFT-IE-----DDGL--SKDW-------------------------S--N--D--------VVFMNPPYGR-E--------------------------IKKWIKKAY---EE-SL-N-GA-T-VVCLIPARTDTTYWH-DFI-F---D-K--A---DD---------IRFLK----------GRLKF-------------GNS-KNSAP--FPS-SIVIY------------------------------------------E----C-KEAEQ--------------------------------------------------------------------------------------------------------------------------------------------------
      ERS140248_02184_Staphylococcus_aureus_678260344                                --------------------------------ME----VHY---SSKTNEWATPQNLFDDLNREFN-FTLDPCST---DENAKCQKHYT-AK-----DNGL--IQDW-------------------------S--E--D--------VVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SV-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------SES-KNSAP--FPS-AIIVY------------------------------------------R----G-GR-----------------------------------------------------------------------------------------------------------------------------------------------------
      HMPREF9988_RS10060_Staphylococcus_epidermidis_488427723                        --------------------------------ME----VHY---SSKSNEWATPQKLFDELDKEFN-FTLDPCAT---DENAKCNKHFT-IE-----DDGL--SKDW-------------------------S--K--D--------VVFMNPPYGR-E--------------------------IKKWIKKAY---EE-SL-N-GA-T-VVCLIPARTDTTYWH-DFI-F---D-K--A---DD---------IRFLR----------GRLKF-------------GNS-KNSAP--FPS-AIVVY------------------------------------------L----G-VTT----------------------------------------------------------------------------------------------------------------------------------------------------
      T666_02640_Staphylococcus_aureus_686391504                                     --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKCWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ-----------------------------------------------------------------------------------------------------------------------------------------------------
      BBRE_RS02915_Bifidobacterium_breve_518557238                                   MS--------DFTG------------------AG-G--AAY---MSNRMNWETPQELFDQLDAEFH-FTLDAASS---ATNHKCQKYYT-AE-----DSAF--DHEW-------------------------G--G--E--------TVFCNPPYGK-A--------------------------IAEWVRKCS---AE-AS-RKDT-L-VVMLLPARTDTRWFQ-QFI-L---N-R--A----E---------VRFLK----------GRLRF-E-TN--------GIP-GGPAP--FPS-MIVVM------------------------------------------R-T--G-ER-----------------------------------------------------------------------------------------------------------------------------------------------------
      AS94_12270_Staphylococcus_aureus_686449191                                     --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVEKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ-----------------------------------------------------------------------------------------------------------------------------------------------------
      Q332_RS01180_Pseudobacteroides_cellulosolvens_739064083                        -------------MN-----------------TE----IMF---SSKSDEWETPQQFFDKLHKEFN-FQLDVCAT---AENAKCDKYYT-KI-----DDGL--SQSW-------------------------H--HWAQ--------RCWMNPPYGR-N--------------------------IDKWIKKAF---DE-SQ-E-GA-T-VVCLIPARTDTKYWH-TYC-M-----K--A---HE---------IRFVK----------GRLKF-------------SNS-KDCAP--FPS-AIVVF------------------------------------------K-P--T-LK--QLKVSSY--------------------------------------------------------------------------------------------------------------------------------------------
      QI18_RS10395_Lactococcus_lactis_746045508                                      ---------------------------M--N-RE----LMF---SSKTDLWSTPWNFFEKLNDEFH-FTLDPCST---HENAKCYKHFT-IK-----EDGL--LQDW-------------------------G--N--E--------VVFCNPPYGR-K--------------------------IKDWVKKAY---EE-SQ-KDNT-T-VVMLIPARTDTIYFH-EYV-Y---H-K--A----E---------VRFIK----------GRLKF-------------GDA-KNAAP--FPS-MVVIF--RKDNQ-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      APL_RS02615_Actinobacillus_pleuropneumoniae_500173972                          ---------------------------M-------T--------NFDKNTWQTPPECANYVKRRWA-IKWDGAAT---VENKICDYFIT-PEI------------DF-LNFESID---R-II--------E-N--N--A--------RIFINPPYGR-----------------------GY---VEKFVKQAV---RLMNE-K-RC-F-VVMLLNADKSTEWFK-LIR-----E-H--A-T--E-------V-IDIVG----------KRVAF---IN-PI-T---KKP-VEDNP--KWQ-MFAVF------------------------------------------D-P--Y-AE--------------G----FVTS-------YVS-----------YDK---IKQVGT-----------------ND-N-NA---------------------------------------------------------------------
      APPSER11_RS02705_Actinobacillus_pleuropneumoniae_491783102                     ---------------------------M-------T--------DFDKNTWQTPPECANYVKRRWA-IKWDGAAT---VENKICDYFIT-PEI------------DF-LNFESID---R-II--------E-N--N--A--------RIFINPPYGR-----------------------GY---VEKFVKQAV---RLMNE-K-RC-F-VVMLLNADKSTEWFK-LIR-----E-H--A-T--E-------V-IDIVG----------KRVAF---IN-PI-T---KKP-VEDNP--KWQ-MFAVF------------------------------------------D-P--Y-AE--------------G----FVTS-------YVS-----------YDK---IKQVGT-----------------ND-N-NA---------------------------------------------------------------------
      ABSDF2497_Acinetobacter_baumannii_SDF_169152788                                MN--------T----------------M----TK-N--KLFGLADDRTDVWATPQDFFEKLDRVFK-FDLDVCAL---PDNAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---DT-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LV--------------D----VNWE-------KSA--------------------------------------------------------------------------------------------------------------------
      SAZ172_RS05790_Staphylococcus_aureus_554679133                                 --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--LPS-AIIVY------------------------------------------R----G-AQ-----------------------------------------------------------------------------------------------------------------------------------------------------
      W619_00569_Staphylococcus_aureus_686419170                                     --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNGEFN-FTLDPCST---DENAKCQKHYT-AK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKHWVKKAY---EE-SV-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GES-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ-----------------------------------------------------------------------------------------------------------------------------------------------------
      Q7S_RS08715_Rahnella_aquatilis_505727589                                       TS-EFA----S-------------------T-TP----------IEHKDRWQTPVEVFTALDLEFG-FYLDAAAD---YQNALCARYLT-EG-----DDAL--ATEW-------------------------E--S--Y-----G--AIWCNPPYSA----------------------------ITPWIEKAA---EQ-CRAQ-HQ-P-VVMLLPADTSTGWFS-LAL-----T-T--A-D--E---------IRFIT-----D----GRLSF---IN-AG-T---GKPGKNGNS--KGS-MLVIW------------------------------------------R-P--F-IK----P---------R----SQFT-------TVS-----------RDA---LITAGA-------------------------DYLQEVAA------------------------------------------------------------
      ERIC1_1c08270_Paenibacillus_larvae_subsp_larvae_DSM_25719_567770034            MN--------K---------------------------VHY---SSKTDMWETPQNLFDRLNEEFK-FDLDVCAI---PENAKCKRYFT-PS-----EDGL--KQEW-------------------------K--G-----------ACWMNPPYGR-Q--------------------------IGKWIAKAY---ES-SL-E-GA-T-VVCLVPSRTDTKWWH-GYC-M---K-G-------E---------IRFIR----------GRLKF-------------GGS-PHNAP--FPN-AVVIF------------------------------------------R-G--R-KE-------------SL----HGQK-------RNE--------TKDDCA------------------------------------------------------------------------------------------------------
      J596_3741_Acinetobacter_baumannii_691117543                                    MN--------S----------------M----AK-L--GLYGNAEGKTDVWATPQNLFDALDQIFN-FDLDVCAL---PENAKCERYFT-PE-----LDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------ISLWIEKAV---ET-AN-N-GH-T-VVGLLPVRTDVVWWQ-EHI-L---H-R-------E---------IHYIK----------GRLKF-------------GGS-KHNAP--FGC-ALVVF------------------------------------------R-P--S-LK--------------D----VQSD-------KSI--------------------------------------------------------------------------------------------------------------------
      T259_RS08765_Clostridium_botulinum_748203410                                   ---------------------------M--N-TA----VMF---SSETDLWATPQDFFDKLNKEFN-FDLDPCAT---HENAKCSKYFT-KE-----IDGL--KQDW-------------------------Q--G--Y--------KVFCNPPYGR-V--------------------------LKDWVKKCY---EE-SL-KPNT-T-VVMLIPARTDTKYFH-EYI-Y---H-K--V---KE---------IRFVK----------GRLKF-------------GDA-KNSAP--FPS-MVVVF--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      U183_02276_Staphylococcus_aureus_686300364                                     --------------------------------ME----VHY---SSKTNEWTTPQNLFDDLNREFN-FTLDPCST---DENAKCQKHYT-EN-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKHWVKKAY---EE-SI-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GES-KNSAP--FPS-AIIVY------------------------------------------R----G-VR-----------------------------------------------------------------------------------------------------------------------------------------------------
      IH28_RS0115430_Acinetobacter_baumannii_663438128                               ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------G---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      BN927_RS09785_Lactococcus_lactis_554763517                                     ---------------------------M--N-KE----LMF---SSKTDLWSTPWNFFDKLNDEFH-FTLDPCST---HENAKCYKHFT-IE-----EDGL--LQDW-------------------------G--N--E--------VVFCNPPYGR-Q--------------------------IKDWVKKAY---EE-SQ-KDDT-T-VVMLIPARTDTIYFH-EYI-Y---H-K--A----E---------IRFIK----------GRLKF-------------GDA-KNAAP--FPS-MVVIF--RKDNQ-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      MC49_RS06655_Morganella_morganii_738462811                                     MA-VHT----R---------------------------NTP---GEYKDSWQTPEWLFTALDLEFG-FYLDAAAS---DINALCSRYLT-EQ-----DDAL--KSEW-------------------------V--S--H-----G--AIWCNPPYSN----------------------------IRPWVEKAA---EQ-SRMQ-NQ-P-VVMLVPEDMSVGWFL-EAL-----K-T--V-D--E---------IRVIT-----G----GRINF---VN-PV-T---GEE-KKGNS--KGS-MLLIW------------------------------------------R-P--F-IT----P---------R----RLSS-------FAL-----------KQE---LEAIGN-------------------------QYLAEVSA------------------------------------------------------------
      ERIC1_RS03940_Paenibacillus_larvae_738763505                                   MN--------K---------------------------VHY---SSKTDMWETPQNLFDRLNEEFK-FDLDVCAI---PENAKCKRYFT-PS-----EDGL--KQEW-------------------------K--G-----------ACWMNPPYGR-Q--------------------------IGKWIAKAY---ES-SL-E-GA-T-VVCLVPSRTDTKWWH-GYC-M---K-G-------E---------IRFIR----------GRLKF-------------GGS-PHNAP--FPN-AVVIF------------------------------------------R-------------------------------------------------------------------------------------------------------------------------------------------------------------
      F931_01759_Acinetobacter_pittii_507070967                                      MN--------S----------------M----AK-L--GLYGNAEGKTDVWATPQNLFDAIDHIFN-FDLDVCAL---PENAKCDRYFT-PE-----LDGL--KQEW-------------------------V--G-----------TCWMNPPYGR-E--------------------------ISLWIEKAV---ET-AN-N-GH-T-VVGLLPVRTDVVWWQ-EHI-L---H-R-------E---------IHYIK----------GRLKF-------------GGC-KHNAP--FGC-ALVVF------------------------------------------R-P--S-LK--------------D----VRWE-------SSI--------------------------------------------------------------------------------------------------------------------
      SAGV69_RS11740_Staphylococcus_aureus_506511035                                 --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWR-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ-----------------------------------------------------------------------------------------------------------------------------------------------------
      TT45_RS11045_Acinetobacter_baumannii_758882462                                 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------IVDWISKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWA-------MA---VNEFRKAESKEG------------------------------------------------------------------------------------------------------
      ANACOL_RS13845_Anaerotruncus_colihominis_493931641                             -------------M------------------NK----ALL---SSKRLDWCTPRDFFDALDVEFH-FTLDAAAT---EKSAKCAKYYT-PE-----TDGL--SASW-------------------------A--G--E--------TVFCNPPYGR-E--------------------------IKAWIKKGF---EE-GQ-QSGT-T-VVLLIPSRTDTEYFH-KYI-L---G-K--A----E---------IRFLK----------GRLKF---------TDEEGLT-QDAAP--FPS-MLVIY------------------------------------------R----G-QG------KEQNDG-----------------------------------------------------------------------------------------------------------------------------------------
      Phi93_04_Lactococcus_phage_phi93_673939868                                     ---------------------------M--N-NE----LMF---SSKTDLWSTPNDFFDKLNDEFH-FTLDPCST---HENAKCYKHFT-KE-----ENGL--LQDW-------------------------G--N--E--------VVFCNPPYGR-Q--------------------------IKEWIKKSY---EE-SQ-KDNT-T-VVMLIPARTDTIYFH-EYI-Y---H-K--A----E---------IRFIK----------GRLKF-------------GNA-KNSAP--FPS-MVVIF--E-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      D478_RS25245_Brevibacillus_agri_748713908                                      ---------------------------------------MF---TSEREEWETPQDFFEKLNKEFG-FQLDVCAL---PTNAKCERYFT-PD-----EDGL--KQEW-------------------------T--G-----------VCWMNPPYGR-E--------------------------IGKWVKKAY---ES-AK-Q-GA-T-VVCLLPARTDVKWWH-DYC-M---K-G-------E---------IRLVR----------GRMKF-------------VGA-DNMAP--FPN-AVVIF------------------------------------------S-P--A-SA--------------G----CSYK-------AID--------K-----------------------------------------------------------------------------------------------------------
      C236_RS0118880_Brevibacillus_laterosporus_517503045                            ------------------------------MAIN-E--GMF---TSSTDLWETPQDFFNQLNKEFG-FQLDVCAL---PENAKCERYFS-PD-----EDGL--QQEW-------------------------T--G-----------ICWMNPPYGR-Q--------------------------IGKWIKKAY---ES-SL-N-GA-T-VVCLIPARTDASWWH-AHC-M---K-G-------E---------IRLVK----------GRLKF-------------GGS-KWNAP--FPN-AVVIF------------------------------------------R-K--V-GS--------------Q----HSYK-------AID--------KYGYFI------------------------------------------------------------------------------------------------------
      D478_26539_Brevibacillus_agri_BAB-2500_432181416                               MI--------K----------------TSDNIIN-K--AMF---TSEREEWETPQDFFEKLNKEFG-FQLDVCAL---PTNAKCERYFT-PD-----EDGL--KQEW-------------------------T--G-----------VCWMNPPYGR-E--------------------------IGKWVKKAY---ES-AK-Q-GA-T-VVCLLPARTDVKWWH-DYC-M---K-G-------E---------IRLVR----------GRMKF-------------VGA-DNMAP--FPN-AVVIF------------------------------------------S-P--A-SA--------------G----CSYK-------AID--------K-----------------------------------------------------------------------------------------------------------
      BN981_00304_Halobacillus_trueperi_635344555                                    ---------------------------M--GKMN----VHY---SSKSNDWATPQDFFDGLDNEFN-FTLDPCAT---SENAKCDNYFT-IE-----DDGL--KQSW-------------------------E--G--E--------TVFCNPPYGR-E--------------------------IKLWVKKAF---QE-SK-KPNT-K-VVMLIPARTDTKYFH-DYI-Y---M-Q--A----R---------VRFIK----------GRLKF-------------GNG-KGNAP--FPS-MVVIF--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      BN911_RS03730_Morganella_morganii_738472851                                    MA-GYA----S------------------------------NTAPEHKDSWQTPEWLFTALDLEFG-FYLDAAAS---DINALCSRYLT-EQ-----DDAL--KSEW-------------------------V--S--H-----G--AIWCNPPFSN----------------------------IRPWVEKAA---EQ-ARMQ-NQ-P-VVMLVPEDMSVGWFL-EAL-----K-T--V-D--E---------IRVIT-----G----GRINF---VN-PV-T---GEE-KKGNS--KGS-MFLIW------------------------------------------R-P--F-IT----P---------R----RLPS-------FAL-----------KQD---LESIGN-------------------------QYLAEVRA--A---------------------------------------------------------
      J689_1349_Acinetobacter_baumannii_691068978                                    MN--------T----------------M----AQ-R--KLFGLAENRTDVWATPQDFFDKLNAVFN-FDLDVCAL---PENAKCERFFS-PE-----QNGL--KQEW-------------------------I--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQW-----------------DKGASRE-------------------------------------------------------------------------------------------------------
      BN981_RS01350_Halobacillus_737533832                                           --------------------------------MN----VHY---SSKSNDWATPQDFFDGLDNEFN-FTLDPCAT---SENAKCDNYFT-IE-----DDGL--KQSW-------------------------E--G--E--------TVFCNPPYGR-E--------------------------IKLWVKKAF---QE-SK-KPNT-K-VVMLIPARTDTKYFH-DYI-Y---M-Q--A----R---------VRFIK----------GRLKF-------------GNG-KGNAP--FPS-MVVIF--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RN38_RS21980_Hafnia_paralvei_746124400                                         MS-EFA----S-------------------N-TP----------LEHKDRWQTPIEVFSALDAEFG-FYLDAAAE---HGNALCARYLT-ER-----DDAL--NSEW-------------------------V--S--Y-----G--AIWCNPPYSA----------------------------ITPWVEKAA---EQ-CKAQ-SQ-P-VVMLLPADTSTGWFS-LAL-----E-S--V-D--E---------VRLIT-----G----GRLSF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RDE---LIAIGS-------------------------GIMAGVKA--A---------------------------------------------------------
      J594_4091_Acinetobacter_baumannii_259052_588219826                             ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------
      TS65_RS13365_Aneurinibacillus_migulanus_759006369                              M-------------------------------NT-A--VMF---SSATDEWATPQDFFDQLNQEFH-FTLDPCAT---HESAKCARYFT-EE-----DNGL--AQDW-------------------------T--GE----------IVFMNPPYGR-V--------------------------LGQWVKKAF---EE-SI-K-GA-T-VVCLLPARTDTRWFH-DYI-Y---HRA-------E---------IRFVK----------GRLKF-------------GDS-KNSAP--FPS-MVVIF------------------------------------------N-R--A-GV--------------KVGG-----------------------------------------------------------------------------------------------------------------------------------
      J595_RS19805_Acinetobacter_baumannii_691047241                                 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------
      HMPREF0454_RS07315_Hafnia_alvei_490192932                                      MS-EFA----S-------------------N-TP----------LEHKDRWQTPIEVFAALDAEFG-FYLDAAAD---HGNALCARYLT-ES-----DDAL--NSEW-------------------------V--S--Y-----G--AIWCNPPYSA----------------------------ITPWVEKAA---EQ-CKAQ-SQ-P-IVMLLPADTSTGWFS-LAL-----E-S--V-D--E---------VRLIT-----G----GRLSF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RNE---LIAIGS-------------------------SIMAGVKA--A---------------------------------------------------------
      GEAM_RS21330_Ewingella_americana_736793592                                     MN-EFA----S-------------------H-TP----------VEHKDRWQTPLEVFTALDLEFG-FYLDAAAD---DQNALCARYLS-EA-----DNAL--ATEW-------------------------E--S--Y-----G--AIWCNPPYSA----------------------------ITPWVEKAA---EQ-CHVQ-NQ-P-VVMLLPADTSTGWFA-QAL-----A-T--A-D--E---------IRFIT-----E----GRLSF---IN-AG-T---GKPGKNGNS--KGS-MLVIW------------------------------------------R-P--F-IK----P---------R----GQFT-------TVC-----------RDV---LLSIGA-------------------------DYLQEVAA------------------------------------------------------------
      Mm0Y_RS16130_Morganella_morganii_802097985                                     MA-GYA----S-------------------K-TA----------PEHKDSWQTPEWLFTALDLEFG-FYLDAAAS---DINALCSRYLT-EQ-----DDAL--KSEW-------------------------I--S--H-----G--AIWCNPPFSN----------------------------IRPWVEKAA---EQ-SRMQ-NQ-P-VVMLVPEDMSVGWFL-EAL-----K-T--V-D--E---------IRVIT-----G----GRINF---VN-PV-T---GEE-KKGNS--KGS-MFLIW------------------------------------------R-P--F-IT----P---------R----RVLN-------TTL-----------KQE---LEAIGN-------------------------QYLAEVSA------------------------------------------------------------
      HMPREF0864_RS08005_Enterobacteriaceae_bacterium_9_2_54FAA_496089880            MS-EFA----S-------------------N-TP----------LEHKDRWQTPIEVFAALDAEFG-FYLDAAAD---HGNALCARYLT-ES-----DDAL--NSEW-------------------------V--S--Y-----G--AIWCNPPYSA----------------------------ITPWVGKAT---EQ-CRAQ-SQ-P-VVMLLPADTSTGWFS-LAL-----E-S--V-D--E---------VRIIT-----G----GRLAF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RDE---LIAIGS-------------------------DIMAGVKA--A---------------------------------------------------------
      ACINWC323_A0077_Acinetobacter_sp_WC-323_425484490                              ---------------------------M----AK-S--KLFGLAEDRTDVWATPQDFFDKLNAIFD-FDLDVCAL---PENAKCERYFT-PE-----IDGL--SQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------ISLWIEKAV---ET-AN-A-GY-T-VVALLPARTDVGWWQ-SHC-L---N-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-AVVVF------------------------------------------R-P--S-LN--------------D----VRWE-------QSQ--------------------------------------------------------------------------------------------------------------------
      ACINWC323_RS01110_Acinetobacter_sp_WC-323_696306260                            MN--------S----------------M----AK-S--KLFGLAEDRTDVWATPQDFFDKLNAIFD-FDLDVCAL---PENAKCERYFT-PE-----IDGL--SQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------ISLWIEKAV---ET-AN-A-GY-T-VVALLPARTDVGWWQ-SHC-L---N-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-AVVVF------------------------------------------R-P--S-LN--------------D----VRWE-------QSQ--------------------------------------------------------------------------------------------------------------------
      SG0729_Sodalis_glossinidius_str_'morsitans'_84779227                           LI--------S-------------------N-TP----------KSFKDRWQTPIEVFRALDAEFN-FKLDAAAD---KSNALCKAFLT-EQ-----HDAL--KSDW-------------------------N--S--K-----G--AIFCNPPYSK----------------------------IMPWVKKAA---EQ-CKKQ-NQ-T-IVMLLPSDTSTAWFY-EAL-----K-T--S-D--E---------VRFIT-----E----GRLSF---IS-AE-T---GKEGKAGNS--KGS-VLFIW------------------------------------------R-P--W-RI----P---------G----CWMT-------YVQ-----------RKE---LLKKGV-------NRDNRTPLPAQR--------------------------------------------------------------------------
      SGP1_RS15415_Sodalis_glossinidius_499730784                                    IV--------S-------------------Q-TP----------KACKDKWQTPVEIFRALDAEFG-FGLDAAAD---FANALCRRYLT-EE-----DDAL--NCEW-------------------------H--T--R-----G--AIFCNPPYSN----------------------------ITPWVSKAA---EQ-CAVQ-KQ-T-IVMLLPSDTSTGWFR-MGL-----E-S--V-D--E---------VRVIT-----G----GRLSF---IS-AA-T---GVCGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----N---------R----CQFT-------TVD-----------KSD---LIRIGT-------EAVR----EVAA--------------------------------------------------------------------------
      AB64_RS00770_Escherichia_coli_486273694                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------L--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      J635_1953_Acinetobacter_baumannii_690997976                                    MN--------T----------------M----AK-L--GLFGNAEGRTDVWATPQKLFDALDQVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------A--G-----------TCWMNPPYGR-E--------------------------IVDWISKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VKWG-------DQ---------------------------------------------------------------------------------------------------------------------
      ARN_24250_Arsenophonus_nasoniae_284008293                                      LI--------S-------------------H-TP----------KPFKDRWRTPIGVFKTLDAEFN-FKLDAAAD---KNNALCKAFLT-EQ-----QDAL--TCDW-------------------------N--S--N-----G--AIFCNPPYSK----------------------------IMPWVKKAA---EQ-CRKQ-NQ-T-IVMLLPSDTSTAWFY-EGL-----N-T--A-D--E---------IRFIT-----E----GRLSF---VS-AE-T---GEQGISGNS--KGS-VLFIW------------------------------------------R-P--L-GR----E---------M----CRMT-------HIR-----------KKE---LLPLTI-------GCST----------------------------------------------------------------------------------
      M655_RS0109725_Bacillus_sp_NSP21_737442515                                     ---------------------------------------MF---KSEREEWETPQEFFDKLNDEFG-FQLDVCAL---PTNAKCERYFT-PD-----DDGL--HQEW-------------------------T--G-----------VCWMNPPYGR-E--------------------------IGKWVKKAY---ES-AK-Q-GA-T-VVCLLPARTDVKWWH-DYC-M---K-A-------E---------IRLVR----------GRMKF-------------VGA-DNMAP--FPN-AVVIF------------------------------------------S-P--A-SA--------------G----CSYK-------AID--------K-----------------------------------------------------------------------------------------------------------
      BE89_RS22035_Escherichia_coli_446051431                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      AB660_RS05030_Chromobacterium_subtsugae_828144310                              MT--------D----------------A----S-----IHF---RSSTDEWPTPQLLFDELHAEFQ-FTVDVCAT---PGNAKCPRYYT-RA-----DDGL--AQDW-------------------------S--AE----------TVWMNPPFGH-G--------------------------IKFWMEKAL---KS-AR-A-GA-T-VVCLVPSRTDTRWWH-RYA-MW--A-A-------E---------IRCLD----------KRLQF-------------DGG-SAKAP--FPA-VVIVF--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      CG67_RS0113335_Tatumella_sp_UCD-D_suzukii_647630238                            MT-D----K-S-------------------N-TS----------AEHKDSWQTPPEVFRALNAEFQ-FQLDAAAS---AHNALCRKYIT-AE-----QDTL--QTEW-------------------------G--D--YVE--NG--YAWLNPPYSA----------------------------PLPFVEKAG---KE-KELN-HV-G-CVMLLPADISVGWFK-EAV-----K-T--A-S--E---------VRLIT-----G----GRLAF---IS-SQ-T---GKP-VGGNN--KGS-LLIIW------------------------------------------H-P--W-PT----G---------S----CQFK-------TVD-----------RDQ---LINFGK-------------------------RLMERAA-------------------------------------------------------------
      ABOUO_79_Paenibacillus_phage_Abouo_525335850                                   ------------------------------MAIN-E--GMF---TSSTDLWETPQEFFNQLNQEFG-FQIDVCAL---PENAKCERYFS-PD-----EDGL--QQEW-------------------------T--G-----------ICWMNPPYGR-Q--------------------------IGKWIKKAY---ES-SL-N-GA-T-VVCLIPARTDARWWH-DYC-M---K-G-------E---------IRLVK----------GRLKF-------------GSS-KWSAP--FPN-ALVIF------------------------------------------K-E--A-GS--------------Q----HSYK-------AID--------KYGSLL------------------------------------------------------------------------------------------------------
      SGP1_RS06170_Sodalis_glossinidius_754366340                                    LI--------S-------------------N-TP----------KSFKDRWQTPIEVFRALDAEFN-FKLDAAAD---KSNALCKAFLT-EQ-----HDAL--KSDW-------------------------N--S--K-----G--AIFCNPPYSK----------------------------IMPWVKKAA---EQ-CKKQ-NQ-T-IVMLLPSDTSTAWFY-EAL-----K-T--S-D--E---------VRFIT-----E----GRLSF---IS-AE-T---GKEGKAGNS--KGS-VLFIW------------------------------------------R-P--W-RI----P---------G----CWMT-------YVQ-----------RKE---LL-------------------------------------------------------------------------------------------------
      CLOSCI_00567_[Clostridium]_scindens_ATCC_35704_167664126                       -------------MTDRGKKGDLTMAPL--N--K----ALF---SSAKEDWATPQDFFDELNKEFH-FDLDPCAD---AENAKCKEFFT-KE-----QNGL--LQDW-------------------------G--G--R--------CVFCNPPYGRTS--------------------------TGEWIKKCY---EE-AQ-KPGT-V-VVALIPARTDTRFFH-DYI-Y---H-K--A----E---------IRFIK----------GRLHF-------------GGC-KDAAP--FPS-MVVVF-----RKGKENEEEKKTGCTAAGHTEEKAAEKDDGSENGVDGI-------------------------------------------------------------------------------------------------------------------------------------------------------------
      G468_RS0114315_Arsenophonus_nasoniae_652428396                                 LI--------S-------------------H-TA----------KPFKDRWQTPIEVFRTLDAEFT-FRLDAAAD---ENNALCTAFLS-EK-----ADAL--KCDW-------------------------N--S--D-----G--AIFCNPPYSN----------------------------IKPWVNKAA---EQ-CRKQ-KQ-T-IVMLLPSDTSTAWFY-EGL-----N-T--A-D--E---------IRFIT-----E----GRLLF---VS-AE-T---GEQGTSGNS--KGS-VLFIW------------------------------------------R-P--L-ER----E---------V----CKIT-------HIR-----------KKE---LLPLTT-------GCST----------------------------------------------------------------------------------
      AT03_RS13490_Hafnia_alvei_647467325                                            MS-EFA----S-------------------N-TP----------LEHKDRWQTPIGVFSALDAEFG-FYLDAAAD---HGNALCARYLT-ER-----DDAL--NSEW-------------------------V--S--Y-----G--AIWCNPPYSA----------------------------ITPWVEKAA---EQ-CKAQ-SQ-P-IVMLLPADTSTGWFP-LAL-----E-S--V-D--E---------VRIIT-----G----GRLSF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RDE---LIAIGS-------------------------SIMAGVKA--A---------------------------------------------------------
      MY84_RS08540_Escherichia_coli_446051432                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAV---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      HA49_RS14705_Tatumella_morbirosei_740176027                                    MT-D----K-S-------------------N-TP----------AEHKDTWQTPPEIFRALNAEFQ-FQLDAAAS---PHNALWRKFIT-AE-----QDTL--RTEW-------------------------G--D--YVE--NG--YAWLNPPYSA----------------------------PLPFVEKAA---KE-KELN-HV-G-CVMLLPADISVGWFR-EAV-----K-T--A-S--E---------VRLIT-----G----GRLAF---IS-SQ-T---GKP-VGGNN--KGS-LLIIW------------------------------------------H-P--W-PT----G---------S----CQFK-------TVD-----------RDQ---LMDFGK-------------------------RLIARAA-------------------------------------------------------------
      J644_3880_Acinetobacter_baumannii_691073319                                    ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      J641_4016_Acinetobacter_baumannii_1188188_589421412                            MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      EMTOL_RS19950_Emticicia_oligotrophica_504839093                                ---------------------------M----NI-K--AIF---SCKTTNWETPQDLFDELDKQYN-FTLDVCAT---SENAKCNEFFT-PE-----IDGL--KQEW-------------------------K--G-----------MCWMNPPYGR-E--------------------------IGKWVRKAH---LE-VI-T-GRCR-IIALLPARTDTKWFH-EWV-L---NKH-------E---------IKFIK----------GRLRF-------------SDS-KNSAP--FPS-MLVIF------------------------------------------E-G--R-----------------P--------------------------------------------------------------------------------------------------------------------------------------
      BTS2_RS02440_Bacillus_sp_TS-2_780117918                                        --------------------------------MN-Q--AMF---SSSTDKWSTPQSFYDKLNQEFQ-FDIDVCAT---DSDKKCERYFS-PE-----QDGL--KQEW-------------------------T--G-----------ICWMNPPYGR-G--------------------------IGPWIQKAY---ES-SQ-Q-GA-T-VVCLLPSRTDTKWWH-EYC-M---K-G-------E---------IRFIK----------GRLKF-------------GDS-KNSAP--FPS-VVVIF------------------------------------------R-P--K-VV---------------------------------------------SM------------------------------------------------------------------------------------------------------
      LILY_61_Bacteriophage_Lily_755258783                                           MS--------N----------------T--MAVH------Y---SSKTDMWETPQDFFDKLHAEFG-FTLDVCAV---PENAKCERFFS-PD-----DNGL--LQNW-------------------------K--G-----------VCWMNPPYGR-Q--------------------------IGAWIAKAY---ES-SL-E-GA-T-VVCLVPSRTDTKWWH-DYC-L---K-G-------E---------VRFIK----------GRLKF-------------GGS-PHNAP--FPN-AIVIF------------------------------------------R-G--K-GQ-----------------------------------------------------------------------------------------------------------------------------------------------------
      GAP32_068_Cronobacter_phage_vB_CsaM_GAP32_414086984                            -------------M------------------EDNNMSVHF---SSASNTWDTPDDFYQKLHAVWN-FTLDPAAM---DETAKCEKYYT-PE-----TDGL--AHSW-------------------------A--G--E--------TVWCNPPYGR-E--------------------------ISKWFKKFD---EE-FK-QNGT-T-IIALPPARTDTTYFH-KYV-R---D-S--A---TA---------ICFVK----------GRLKFDN-RSLPSWKEDGSHK-KTGAP--FPS-MIVIY------------------------------------------D----N-NI------TQEKYEVLNSLGFVVQP-FLLG-------------------------------------------------------------------------------------------------------------------------
      BTS2_0497_Bacillus_sp_TS-2_591276954                                           ------------------------------MTIN-Q--AMF---SSSTDKWSTPQSFYDKLNQEFQ-FDIDVCAT---DSDKKCERYFS-PE-----QDGL--KQEW-------------------------T--G-----------ICWMNPPYGR-G--------------------------IGPWIQKAY---ES-SQ-Q-GA-T-VVCLLPSRTDTKWWH-EYC-M---K-G-------E---------IRFIK----------GRLKF-------------GDS-KNSAP--FPS-VVVIF------------------------------------------R-P--K-VV---------------------------------------------SM------------------------------------------------------------------------------------------------------
      J532_3860_Acinetobacter_baumannii_940793_630469298                             MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      J532_3860_Acinetobacter_baumannii_691154170                                    ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      BLAHAN_04217_Blautia_hansenii_DSM_20583_260542363                              -----------------------------------------------------------------------------------MRETLH-AG-----RRRP--KARL-------------------------G--G--Y--------RVFCNPPYGR-A--------------------------IADWVRKGY---EE-SR-KPGT-T-VVMLIPSRTDTAYFH-DWI-F---G-K--A---SE---------VRFLR----------GRLKF---------TDEDGNG-EDAAP--FPS-AVIVW------------------------------------------R----S-PE------STGRE-------FATWH-I----------------------------------------------------------------------------------------------------------------------------
      P262_01673_Cronobacter_sakazakii_CMCC_45402_564117231                          MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----R-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KCLEALKC--ENGKAA----------------------------------------------------
      JO80_RS0108885_Cronobacter_malonaticus_696416059                               MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----R-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRLEALKC--ENGKAA----------------------------------------------------
      P262_RS05820_Cronobacter_sakazakii_752821882                                   MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----R-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KCLEALKC--ENGKAA----------------------------------------------------
      ECC34793_RS0111695_Escherichia_coli_585346834                                  MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YIWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      FL80_RS15355_Acinetobacter_baumannii_690990657                                 MS--------T----------------M----AK-L--GLFGNAEGRTDVWATPQTLFDALDQVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------IVDWISKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-AVVVF------------------------------------------R-P--S-LK--------------D----VQWG-------AQ---------------------------------------------------------------------------------------------------------------------
      H627_RS17735_Lactobacillus_harbinensis_737460398                               MS--------D--F-------------L--K-PG-G--AAL---TSNKDDWETPQAFFESLNAKYH-FAIDLAAS---KDNAKCDRYFS-VA-----DDSL--LQDWSD---------------------DFG--G-----------AMYLNPPYGR-H--------------------------IGDWVKKAY---ET-SL-RVNV-P-IVLLIPARTDTSYWH-DYI-F---G-K-------A--S------IKFIR----------GRLKF-E-QN--------GMA-GGPAP--FPS-AIIVY-------------------------------------N----G-D--G-AE--------------K--------------------------------------------------------------------------------------------------------------------------------------
      ASU2_RS02700_Actinobacillus_491812488                                          ---------------------------M-------T--------DFDKNTWQTPQECRTYAKYRWL-VIWDGAAT---AENAICERFIT-PEI------------DF-LNFDAVT---Q-II--------P-N--H--A--------RIFINPPYGR-----------------------GY---VKKFVRQAI---RLMRE-K-QC-F-IVMLLNADKSTEWFQ-LIR-----E-N--A-T--E-------V-IDIIG----------QRVAF---IN-PV-T---GKP-VSDNP--KWQ-MFAVF------------------------------------------D-P--H-AE--------------G----FTTS-------YVT-----------YDK---ILEVAQ-----------------YD-K------------------------------------------------------------------------
      ECOM_RS18005_Escherichia_coli_485729004                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNQLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      RJ36_RS12145_Enterobacteriaceae_bacterium_strain_FGI_57_506488274              MT-DYN--G-S-------------------N-TP----------ADQRDLWRTPPSLFASLDAEFC-FQLDAAAA---PLNALCRKFIT-AE-----QNTL--ETPW-------------------------A--N--YLT-VPG--YVWLNPPYSD----------------------------ITPFVKKSA---VE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CLFT-------TVE-----------RDH---LMAFGN-------------------------KLLARREA--A---------------------------------------------------------
      IO46_03040_Gallibacterium_anatis_703606824                                     -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-Q-GF-L-VVMLLPADKSTAWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      EC2850400_RS19395_Escherichia_coli_487555289                                   MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--S-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      IO46_RS02995_Gallibacterium_anatis_757675697                                   ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-Q-GF-L-VVMLLPADKSTAWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      SR72_RS20425_Enterobacter_cloacae_complex_695720049                            MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALSRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VTG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLSRREA--A---------------------------------------------------------
      SEEM1923_08275_Salmonella_enterica_555233171                                   MT-D----K-S-------------------N-TP----------IEIKDLWRTPPEIFHALNAEFC-FVLDAAAN---AENALCRLYIT-EQ-----QNTL--FTPW-------------------------K--E--VMPDIPG--YVWLNPPYSR----------------------------PMPFVKKAV---NE-NEDN-GI-G-CVMLLPADISVSWFI-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AI-T---GKP-VNGNN--KGS-MLVIW------------------------------------------H-P--Y-PR----S---------GG---CRMN-------TVD-----------RNV---LMKYGK-------------------------RRMKVTA-------------------------------------------------------------
      WQ86_RS10285_Escherichia_446051430                                             MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      PU01_RS20405_Hafnia_alvei_810918351                                            MS-EFS----S-------------------N-TP----------LEHKDRWQTPIEVFAALDAEFG-FYLDAAAD---HRNALCARYLT-DR-----DDAL--NSEW-------------------------E--S--Y-----G--AIWCNPPYSA----------------------------ITPWVENAA---EQ-CKAQ-SQ-P-VVMLLPADTSTGWFS-LAL-----E-S--V-D--E---------VRLIT-----G----GRLSF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RDE---LIAIGS-------------------------GIMTGVKA--A---------------------------------------------------------
      CLOSCI_RS06430_[Clostridium]_scindens_748651356                                ------------------------MAPL--N--K----ALF---SSAKEDWATPQDFFDELNKEFH-FDLDPCAD---AENAKCKEFFT-KE-----QNGL--LQDW-------------------------G--G--R--------CVFCNPPYGRTS--------------------------TGEWIKKCY---EE-AQ-KPGT-V-VVALIPARTDTRFFH-DYI-Y---H-K--A----E---------IRFIK----------GRLHF-------------GGC-KDAAP--FPS-MVVVF-----RKGKENEEEKKTGCTAAGHTEEKAAEKDDGSENGVDGI-------------------------------------------------------------------------------------------------------------------------------------------------------------
      X858_RS0107890_Bacillus_subtilis_647261410                                     ------------------------------MDVH------F---SSKTDLWATPQYFFDELHKEFD-FELDVCAL---EDNAKCEKYFT-PE-----MDGL--KQEW-------------------------N--S-----------TCWMNPPYGR-G--------------------------IGEWVQKAY---ES-SL-K-GS-T-VVCLLPARTDTRWWH-DYC-M---K-G-------E---------IRLVK----------GRLKF-------------GES-KDNAP--FPN-AVVIF------------------------------------------G-E--K-AK--------------K----HTLI-------AM---------------------------------------------------------------------------------------------------------------------
      EC2867750_RS19465_Escherichia_coli_487513381                                   MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------GRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      CDL33251.1_Enterobacter_cloacae_ISC8_571251222                                 RS-GYG--G-S-------------------N-TP----------SDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AD-----QNTL--ETPW-------------------------A--D--CLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GLLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      EH105704_RS01255_Escherichia_hermannii_488393402                               MT-D----K-S-------------------N-TP----------PEDKDRWRTPPEIFHALNAEFC-FVLDAAAS---KENALCRSYIT-EM-----QDTL--ATDW-------------------------N--A--VMPDIPG--YAWLNPPYSK----------------------------PMPFVKKAA---QE-NADN-FT-G-CVMLLPADTSVAWFR-EAI-----S-T--A-H--E---------VRFIT-----G----GRLSF---LN-AT-T---GKA-VNGNN--KGS-ILVIW------------------------------------------H-P--Y-PR----T---------H----CQFS-------TVE-----------RDV---LMEYGR-------------------------RRTKAAA-------------------------------------------------------------
      SEEA0292_19103_Salmonella_enterica_554632055                                   MT-D----K-S-------------------N-TP----------IEIKDLWRTPPEIFHALNAEFC-FVLDAAAN---AENALCRLYIT-EQ-----QNTL--FTPW-------------------------K--E--VMPDIPG--YVWLNPPYSR----------------------------PMPFVKKAV---NE-NEDN-GI-G-CVMLFPADISVSWFI-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AI-T---GKP-VNGNN--KGS-MLVIW------------------------------------------H-P--Y-PR----S---------GG---CRMN-------TVD-----------RNV---LMKYGK-------------------------RRMKVTV-------------------------------------------------------------
      RDMS_RS01750_Deinococcus_sp_RL_736377798                                       M---------A---------------------------VHY---SSEKHDWTTPRSFFDELNAEFN-FTLDAAAS---PHNALCSRYFT-EA-----DDGL--SQPW-------------------------T--GT----------V-WCNPPYGR-Q--------------------------IGRWIAKAA---QS-AC-E-GA-T-VVMLIPARTDTAAWH-DHI-LFNPQ-A-------E---------VRFVR----------GRLRF-------------GDA-TANAP--FPS-AVIIF------------------------------------------R-P--G-GQ--------------G--------------------------------------------------------------------------------------------------------------------------------------
      NV79_RS06670_Enterobacter_hormaechei_757619257                                 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AD-----QNTL--ETPW-------------------------A--D--CLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-ST-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RGE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      K035_3825_Acinetobacter_baumannii_691039509                                    ------------------------------------------------------QDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-------------------------------------------------------------------------------------------------------------------------------------------------------------
      WQ88_RS24815_Escherichia_coli_823642731                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALHAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--S-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      YA69_RS12920_Cronobacter_sakazakii_765034080                                   MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFIKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-VNGNN--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRLEALKC--ENEKAA----------------------------------------------------
      VE18_RS11090_Enterobacter_cloacae_782730169                                    MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFAYLDTEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YIWLNPPYSD----------------------------ITPFVKKAA---AE-SS-N-QI-G-TVMLVPADTLVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      MSQ_RS0117370_Escherichia_coli_485798016                                       MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PV----H---------T----ATLR-------PLI-----------VES------------------------------------------------------------------------------------------------------
      BN129_RS03185_Cronobacter_sakazakii_495122741                                  MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PIPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVD-----------RDE---LMAYGR-------------------------KRLEALKC--ENEKAA----------------------------------------------------
      A323_gp73_Acinetobacter_bacteriophage_AP22_388570840                           --------------------------------MN----VHF---SSDKQTWETPQDLFDKLNDIFN-FNLDACAE---HDTAKVKKYFT-ID-----DNAL--IQDW-------------------------I--G-----------SVWCNPPYNR-E--------------------------QIKFIEKAL---NE-SL-KHKS-T-VVLLIPARPETKVWQ-NVIFK---S-A--S----Q---------ICFIK----------GRLKF-------------GNS-KYNAP--FPS-ALIVF-----------------------------------------------G-KH----------------IDLSEFG-FCVY-------------------------------------------------------------------------------------------------------------------------
      K035_3825_Acinetobacter_baumannii_42057_4_629017472                            ------------------------------------------------------QDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VSWE-------KSA--------------------------------------------------------------------------------------------------------------------
      CSK29544_RS00070_Cronobacter_sakazakii_655998119                               MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EM-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-VNGNN--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRMEALKC--ENGKAA----------------------------------------------------
      E05_32470_Plautia_stali_symbiont_549071051                                     LI--------S-------------------N-TP----------KSFKDRWRTPIGVFKTLDAEFG-FKLDAAAD---KSNALCKAHLT-EQ-----QDAL--KCDW-------------------------N--S--K-----G--AIFCNPPYSK----------------------------IMPWVKKAA---EQ-CRKQ-KK-T-IVMLLPSDTSTAWFH-EAL-----K-T--S-D--E---------VRFIT-----E----GRLSF---IS-AE-T---GKEGKAGNS--KGS-VLFIW--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      BX81_RS08930_Escherichia_coli_693032238                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALHAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      HMPREF9548_RS22140_Escherichia_coli_446051428                                  MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------IPPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      ECKD1_RS05095_Escherichia_coli_446051426                                       MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAN---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      SG0744_Sodalis_glossinidius_str_'morsitans'_84779242                           II--------S-------------------K-TP----------KVCKDRWQTPVEIFRALDAEFG-FGLDAAAD---HDNTRCRHYLT-EE-----DDAL--SCDW-------------------------H--T--R-----G--AIFCNPPYSN----------------------------IMPWVKKAA---EQ-CALQ-QQ-T-VVMLLPSDTSTAWFA-QAQ-----K-T--A-D--E---------VRFIT-----E----GRLSF---IS-AE-T---GEKGKAGNS--KGS-VLFIW------------------------------------------R-P--W-RI----T---------P----KGMT-------TVS-----------KQV---LIN------------------RMWG--------------------------------------------------------------------------
      SG64_RS18700_Enterobacter_cloacae_798873157                                    NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFR-FQLDAAAA---PHNALCRRYIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVG-----------RDE---LMAFGA-------------------------KLIARREA--A---------------------------------------------------------
      SGP1_RS06345_Sodalis_glossinidius_754365623                                    II--------S-------------------K-TP----------KVCKDRWQTPVEIFRALDAEFG-FGLDAAAD---HDNTRCRHYLT-EE-----DDAL--SCDW-------------------------H--T--R-----G--AIFCNPPYSN----------------------------IMPWVKKAA---EQ-CALQ-QQ-T-VVMLLPSDTSTAWFA-QAQ-----K-T--A-D--E---------VRFIT-----E----GRLSF---IS-AE-T---GEKGKAGNS--KGS-VLFIW------------------------------------------R-P--W-RI----T---------P----KGMT-------TVS-----------KQV---LIN------------------RMWG--------------------------------------------------------------------------
      MMA_RS11485_Janthinobacterium_sp_Marseille_501027971                           --------------------------------MS-K--VHF---SSATPEWYTPQSTFDVLNAEFG-FTLDPCCT---HENAKCDRHFT-MA-----ENGL--SQDW-------------------------S--NE----------VTFMNPPYGR-E--------------------------IKEWMRKAY---ES-SL-S-GA-T-VVCLVPARTDTAWWH-DYS-I---K-G-------E---------IRFLR----------GRLKF-------------GGA-KTNAP--FPS-AIVIF------------------------------------------R-P-------------------------LPIK-------ELA--------------------------------------------------------------------------------------------------------------------
      JO78_RS0107935_Cronobacter_malonaticus_696399167                               MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTKFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-VNGNN--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRMEALKC--ENGKAA----------------------------------------------------
      JP29_01125_Gallibacterium_anatis_702419560                                     -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRVA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDL------------------------------------------------------------------------------------------------------
      JP29_RS01080_Gallibacterium_anatis_746017794                                   ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRVA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDL------------------------------------------------------------------------------------------------------
      L361_01863_Enterobacter_sp_MGH_15_578296709                                    NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALSRKFIT-AE-----QNTL--ETPW-------------------------A--D--YMS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-LLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      PU15_RS12295_Escherichia_coli_757742433                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNVEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      EQS_RS0120745_Escherichia_sp_TW15838_446051429                                 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADISVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      MC75_02025_Klebsiella_pneumoniae_721491398                                     MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--AIWCNPPYSD----------------------------IGPWVEKAV---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      TJ25_RS25930_Escherichia_coli_766962597                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFST-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILACREA--A---------------------------------------------------------
      SG79_RS07240_Enterobacter_cloacae_798841194                                    NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PYNALCRRFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------IMPFVKKAA---SE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      UOK_RS0120300_Cronobacter_sakazakii_742402431                                  MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTKFID-EM-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSAQWFK-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-VNGNN--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRLEALKC--ENEKAA----------------------------------------------------
      Q770_03340_Klebsiella_pneumoniae_subsp_pneumoniae_PittNDM01_667708234          MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------A--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAI-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      RN00_RS02255_Klebsiella_pneumoniae_742851006                                   MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-FLFIW------------------------------------------R-P--F-IS----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      DY84_RS0103980_Klebsiella_pneumoniae_639443683                                 MT-DYG--G-S-------------------N-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQNNP--KGS-ILFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      L347_RS0123085_Enterobacter_cloacae_complex_695653183                          NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALSRKFIT-AE-----QNTL--ETPW-------------------------A--D--YMS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-LLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      L371_00766_Enterobacter_sp_MGH_25_555187647                                    NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSE----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLASREA--A---------------------------------------------------------
      H922_23640_Citrobacter_freundii_GTC_09629_486073301                            SG-DYG--G-S-------------------K-TP----------PDQRDLWRTPPALFASLNAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------G--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      UOM_RS0110205_Cronobacter_malonaticus_742403302                                ---D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNTLCTKFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFIKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----R-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRLEALKC--ENGKAA----------------------------------------------------
      BN131_RS17085_Cronobacter_malonaticus_696395149                                MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNTLCTKFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFIKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----R-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRLEALKC--ENGKAA----------------------------------------------------
      ETDT_RS00685_Edwardsiella_tarda_737631614                                      MT--IK----S-------------------N-TP----------ASAKDCWQTPLWLFDALDIEFG-FWLDAAAS---ESNALCAKYLT-EE-----DNAL--GCEW-------------------------E--S--A-----G--AIWCNPPYSK----------------------------IGPWVAKAA---EQ-SDRQ-IQ-T-VVMLVPEDMSVGWFT-DAL-----K-S--V-D--E---------VRVIT-----G----GRVNF---VH-AV-T---GAE-QKGNS--KGS-MLLIW------------------------------------------R-P--F-IN----P---------R----RMIT-------TIS-----------KST---LEAIGR-------------------------PVRSAA--------------------------------------------------------------
      SMDB11_RS12950_Serratia_marcescens_644361110                                   MSLVYA----S-------------------N-TP----------AEHKDRWQTPIEIFSALDVEFG-FYLDAAAD---HGNALCARYLT-EQ-----DNAL--AVDW-------------------------E--S--Y-----G--AIWCNPPYSA----------------------------ITPWVEKAA---EQ-CRAQ-NQ-P-VVMLLPADTSTGWFS-LAL-----Q-S--V-D--E---------VRLIT-----D----GRLAF---IN-SA-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RDE---LIGIGT-------------------------DILREVAA------------------------------------------------------------
      BR71_RS03710_Chromobacterium_haemolyticum_759948263                            MK--------S----------------K----TD-EASIHF---RSTRDDWETPQDLFDALHAEFG-FTVDVCAS---DKTAKCVRYYT-KA-----DNGL--AKDW-------------------------S--NE----------VVWMNPPFGH-V--------------------------TKRWMDKAR---LS-SM-R-GA-T-VVCLVPARVSVLWWH-RNV-FL--A-S-------E---------VRCLR----------PRLQF-------------VGA-AQKAP--FDA-VLVIF------------------------------------------R-P--G-DT--------------Q----AKLS------------------------------------------------------------------------------------------------------------------------------
      Q770_RS00955_Klebsiella_pneumoniae_763022815                                   MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------A--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAI-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      RP28_RS19415_Leclercia_adecarboxylata_743514479                                MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-VE-----QNTL--ETPW-------------------------A--D--YLT-IPG--YAWLNPPYSD----------------------------ITPFVKKAA---AE-SK-N-QI-G-TVMLVPADTSVGWFR-EAI-----E-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLLIW------------------------------------------R-P--F-PR----T---------H----CHFA-------TVE-----------RDE---LMTFGA-------------------------KLLARREA--A---------------------------------------------------------
      HMPREF0731_4170_Roseomonas_cervicalis_ATCC_49957_296263068                     LG-RSG--T-T-----PS----F----L--N-TA-A----F---SSAYEAWATPPDLLERLYAAVGSIDLDPCSPGKLRSRVKAPRHFT-ER-----DDGL--AQEW-------------------------S--G-----------KVYMNPPYGR-T--------------------------IGAWTTKAR-V-EV-TAGR-AE-C-VVGLVPARTDTRWWH-ADV-A---G-H--A----H---------VWLLK----------GRLAF-------------GDG-STPAP--FPS-ALLLW---------------------GGN------------------A-P--------T-I--------------AEMS-A-----SFP-----------DAQ-H-IPARHR--------------------S----PDGAKREA--A---------------------------------------------------------
      JP30_07420_Gallibacterium_anatis_IPDH697-78_702412378                          -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIEVLE----F-P--H--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-Q-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QVS-----------LDF---VKMCGW-----------------YG-K------------------------------------------------------------------------
      JP30_RS07235_Gallibacterium_anatis_746078506                                   ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIEVLE----F-P--H--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-Q-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QVS-----------LDF---VKMCGW-----------------YG-K------------------------------------------------------------------------
      AC06_RS01890_Escherichia_coli_693106202                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-A--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      SG74_RS03495_Enterobacter_cloacae_798873142                                    MT-DFT--G-S-------------------N-TP----------AEQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--CLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-ST-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RGE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      ECNIH2_RS14490_Enterobacter_cloacae_764909418                                  NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALSRKFIT-AE-----QNTL--ETPW-------------------------A--D--YMS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-LLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      J635_2258_Acinetobacter_baumannii_690998264                                    MN--------T----------------T----AK-L--GLFGNAEGRTDVWATPQKLFDALDQVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQDW-------------------------T--G-----------TCWMNPPYGR-E--------------------------ISLWIEKAV---QT-AN-Q-GH-T-VVGLLPTRTDVAWWQ-EHV-M---N-R-------E---------IHYIK----------GRLKF-------------GGC-KHNAP--FGC-AVVVF------------------------------------------R-P--S-LK--------------D----VQWG-------TQ---------------------------------------------------------------------------------------------------------------------
      JP35_RS08425_Gallibacterium_anatis_746004460                                   MS-------------------------F--D----------------KDAYPTPISLFNQINDEFN-FTIDGAAL---PHNAKLDRYIT-PE-----MDFM--TYPL-----------------------E-N--E-----------RIWINPPFSD----------------------------LHSFVKRAV---DL-YENH-DC-L-VVMLLPVDISTRWFS-LIV-----E-K--A-T--E---------IRFIV--G-------GRIKF---LN-PE-T---DK--WTDVC--RGN-HLAIF------------------------------------------D-P--K-HK----A-----M---G----QVIR-------HVH-----------------IDNFAN--LE--------W------------R-------------------------------------------------------------------
      N561_00905_Gallibacterium_anatis_665836508                                     -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      HMPREF0485_04750_Klebsiella_sp_1_1_55_289774595                                MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SIS-----------LAE---LKRIGN--WRLHDARRKRKRSPRPGSSLRRRDNQSDERK--A---------------------------------------------------------
      N561_00905_Gallibacterium_anatis_12656/12_540073363                            ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      F349_RS0103215_Enterobacter_cloacae_complex_516289844                          MT-DFT--G-S-------------------N-TP----------ADQRDLWRTPPALFSSLNAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------IMPFVKKAA---SE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRLIT-----A----GRLAF---IN-PV-T---DKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      X657_RS20595_Klebsiella_pneumoniae_694095222                                   MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SIS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      RK16_RS24410_Escherichia_coli_693100364                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      KPR_RS19370_Klebsiella_pneumoniae_529982416                                    MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYIT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      MTE1_RS05745_Klebsiella_pneumoniae_490299083                                   MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIRT--QEAA---------------------------------------------------------------------------------------
      A225_RS06730_Klebsiella_oxytoca_504650526                                      MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RHSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      N625_RS17185_Klebsiella_pneumoniae_757706267                                   MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      L420_RS03025_Enterobacter_cloacae_556358052                                    MT-DYT--G-S-------------------N-TP----------EDQRDLWRTPPALFAALNAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--GTPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDD---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      HMPREF0485_RS01290_Klebsiella_sp_1_1_55_695778461                              MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SIS-----------LAE---LKRIGN---------------------------------------------------------------------------------------------
      TB84_RS11250_Klebsiella_pneumoniae_749592663                                   MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVHF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IS----P---------R----HIIT-------TVS-----------LAE---LKRIGT--LEAA---------------------------------------------------------------------------------------
      N035_RS243200_Klebsiella_pneumoniae_589884974                                  MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      L461_RS22430_Klebsiella_pneumoniae_556221454                                   MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      N559_RS20150_Klebsiella_pneumoniae_530706273                                   MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RHSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGT--LEAA---------------------------------------------------------------------------------------
      TB56_RS26830_Klebsiella_pneumoniae_556477177                                   MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQNNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      XALC_0202_Xanthomonas_albilineans_GPE_PC73_283472241                           MS-QYV----D--W-------------Y--G-------------KGRAQNWRTPQSIFDALHDEFQ-FTLDGASE---PGNGLLPLAST------A-DEQI----DW-------------------------T--G--H--------RVFCNPPWSN----------------------------IRPFLERAP---AA------DC---AVFLVPARTNAKWFH-RAI-----D-L--G-A--A---------VRFFE----------GRPKF-E-LP-HR-----SGP-GNSSP--VDC-LLLIL------------------------------------------R-K--D-VA--------------R---------------EVQ--G-----------------------------------------------------------------------------------------------------------------
      P244_RS19660_Klebsiella_pneumoniae_746037254                                   MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LTE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      KPST82_RS05430_Klebsiella_pneumoniae_763385574                                 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYIT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      KPHS_12370_Klebsiella_pneumoniae_504108903                                     MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYIT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LTE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      IO43_07585_Gallibacterium_anatis_7990_703617381                                -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KV-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      IO43_RS07410_Gallibacterium_anatis_746082790                                   ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KV-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      TB82_RS16260_Klebsiella_pneumoniae_749548111                                   MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASIT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      HMPREF1307_RS05885_Klebsiella_pneumoniae_490281197                             MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      F403_gp088_Enterobacteria_phage_vB_KleM-RaK2_422937337                         KH------A-V-------------------H-FS----------TRKNDLWTTPKPLFDKLNALWN-FTVDVACS---NETALCLKHYT-PE-----DDGL--SQDW-------------------------S--N--E--------TFWLNPPYSD----------------------------LSPWLSKSV---ED-YN-R-GA-T-GLILVPARTDTRAFQ-NFA-----S-PFCD----A---------MCFIK----------GRLKFGNPL-------------KPNDK--LTS-A------------------------------------------------P--F-PS----C---------I----IVLD-------KNL-----------TQA---KIDCLK--------------------------SLGNTMV--N----I----------------------------------------------------
      XA43_RS13245_Escherichia_coli_817696779                                        MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SREQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      nADLYRO1b_RS11635_Yersinia_ruckeri_740410430                                   MS-DYG--G-S-------------------H-TP----------DNLKDLWQTPNDIFAALDLEFG-FYLDAAAS---HQSALCARYLT-ER-----DDAL--NCEW-------------------------I--S--Y-----G--AIWCNPPYSN----------------------------ITPWVQKAA---EQ-CREQ-NQ-I-VVMLIPADTSTGWFS-LAL-----E-S--V-D--E---------VRLIT-----G----GRLSF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CTFT-------IVK-----------RDE---LKAIGQ-------------------------EILTGSKT--A---------------------------------------------------------
      B4086_RS03845_Bacillus_cereus_822506548                                        ------------------------------MTIN-K--GMF---TSKTDLWATPQYFFDELHKEFN-FELDVCAL---EDNAKCEKYFT-PE-----MDGL--KQEW-------------------------N--G-----------TCWMNPPYGR-G--------------------------IGKWVQKAY---ES-SL-T-GS-T-VVCLLPARTDTRWWH-DYC-M---N-G-------E---------IRLVK----------GRLKF-------------GDS-KNSAP--FPN-AVVIF------------------------------------------G-E--K-AK--------------K----HTLI-------AM---------------------------------------------------------------------------------------------------------------------
      L383_01094_Enterobacter_sp_MGH_37_578289375                                    NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      L383_01874_Enterobacter_sp_MGH_37_578286731                                    NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSE----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLASREA--A---------------------------------------------------------
      AIA83183.1_Podovirus_Lau218_643012085                                          MK----------------------------N-RN----------IKHSDNWATPKELYNELDKEFN-FDFDPC-----PLNSSVDGL----------DEDL----SW-------------------------G--K-----------SNFVNPPYSL-K-------------------L------KTDFVKRAV-K-EK-HK---GN-T-CILLLPVSTSTKLFH-EDI-L---P-N--A-D--D---------IRFLK----------GRVKF---IG-TN-T-K-GVL-VSNKCGMHDT-MVVIF------------------------------------------K-G----KR--------------K--------------------------------------------------------------------------------------------------------------------------------------
      AB186_07590_Klebsiella_pneumoniae_828953686                                    MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RMSNP--KGS-ILFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGT--LEAA---------------------------------------------------------------------------------------
      U074_RS0104770_Escherichia_coli_657257859                                      MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RRSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      L415_RS13205_Klebsiella_pneumoniae_556400945                                   MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGT--LEAA---------------------------------------------------------------------------------------
      AB07_0778_Escherichia_coli_5-172-05_S1_C1_660087059                            NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-KI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LVAFGA-------------------------KLLARREA--A---------------------------------------------------------
      A965_RS0108215_Enterobacter_cloacae_648328174                                  NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--EMPW-------------------------A--D--CLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---DKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      L964_RS00605_Leuconostoc_pseudomesenteroides_491052808                         ---------------------------M--N-SK----ALF---SSKSMVWETPKDYFDKLNRKFK-FDLDACAS---DTNHKVDTYFT-ED-----DDAL--EQKW-------------------------G--G-----------NVFMNPPYGR-H--------------------------IGEFIKKAY---EE-HL-RDPN-RFIVMLIPSRTDTKYWH-EYI-Q---D-K--A----T---------VKFIK----------GRLKF-E-LD--------GRP-MNTAP--FPS-ALIIY-----------------------------------------------G-L------------------------------------------------------------------------------------------------------------------------------------------------------
      JP28_09585_Gallibacterium_anatis_702415297                                     -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      ABR28_RS04400_Enterobacter_sp_GN02358_829891415                                MT-DYT--G-S-------------------N-TP----------EDQRDLWRTPPALFAALNAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--GTPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNS--KGS-ILIIW------------------------------------------R-P--Y-PR----T---------H----CEFT-------TVE-----------RDV---LMEFGT-------------------------KLLARREA--A---------------------------------------------------------
      L365_RS11955_Klebsiella_pneumoniae_556494180                                   MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCDHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      JL04_RS05590_Gallibacterium_anatis_746010315                                   ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      ASUC_RS06415_Actinobacillus_succinogenes_501020288                             MS--------------K----------F--D----------------KDTYPTPLSLFLPLDAEFN-FTLDGAAL---PNNAKCDRYVT-PE-----MDFL--TYQL-----------------------Q-N--E-----------RIFINPPFSD----------------------------PLSFIKRSI---EL-FEYY-NC-L-VVMLLPVDISTEWFS-LIT-----R-K--A-T--E---------IRFIV--G-------GRIKF---VS-PE-T---GD--WTDVC--RGN-HLAIF------------------------------------------D-P--R-HR----N-----M---G----QVIR-------NIH-----------------IDDLGK--FE--------W------------RVNSRKRK--P---------------------------------------------------------
      ABR33_RS14555_Enterobacter_sp_GN02548_829940915                                MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-IA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      RM98_RS09640_Chromobacterium_violaceum_759929100                               MA--------D----------------Q----AE-N--IHF---RSGRDDWETPHDLFASLNAEFG-FTVDVCAS---EKTAKCPRYYT-PA-----MNGL--AQDW-------------------------G--GE----------TVWMNPPFGH-V--------------------------TKRWMDKAR---LS-SL-Q-GA-T-VVCLVPARTSVLWWH-RNV-FL--A-S-------E---------VRCIR----------PRLQF-------------VGA-AQKAP--FDA-VLVVF--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      H922_RS0124235_Citrobacter_freundii_696358583                                  SG-DYG--G-S-------------------K-TP----------PDQRDLWRTPPALFASLNAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------G--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      HK32_RS26080_Klebsiella_pneumoniae_523682820                                   MT-DYV--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------A--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RMSNP--KGS-ILFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEA----------------------------------------------------------------------------------------
      J479_2646_Acinetobacter_baumannii_691127129                                    MS--------T----------------M----AK-L--GLFGNAEGRTDVWATPQTLFDALDQVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------ITLWIDKAV---QT-AN-Q-GH-T-VVGLLPARTDVTWWQ-EHV-M---N-R-------E---------IHYIK----------GRLKF-------------GGC-KHNAP--FGC-AVVVF------------------------------------------R-P--S-LK--------------D----VQWG-------AQ---------------------------------------------------------------------------------------------------------------------
      ABR28_RS09465_Enterobacter_sp_GN02358_829892043                                MT-DYT--G-S-------------------N-TP----------AEQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETSW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SI-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-N--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLSRREA--A---------------------------------------------------------
      AW35_RS0117640_Klebsiella_pneumoniae_657698125                                 MT-DYV--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------A--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RMSNP--KGS-ILFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      L466_01900_Enterobacter_sp_BIDMC_30_578249703                                  NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      AB07_RS0123005_Escherichia_coli_696361303                                      NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-KI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LVAFGA-------------------------KLLARREA--A---------------------------------------------------------
      TA98_RS18955_Klebsiella_pneumoniae_749548558                                   MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYIT-EL-----DDSL--NSEW-------------------------S--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      PROVALCAL_RS03515_Providencia_alcalifaciens_493708059                          MA-VYA----S-------------------H-TA----------PADKDCYQTPQWLFEAMTAEFG-FWLDVAAS---KQNALCVDFFT-QE-----QDAL--KQEW-------------------------F--S--K-----G--AIWCNPPYSN----------------------------IKPWVEKAA---EQ-YLEQ-NQ-P-IVMLVPEDKSTSWFS-LAL-----K-S--V-D--E---------IRVVI-----D----GRINF---VD-PT-T---GKE-KRGNN--KGS-MFLIW------------------------------------------R-P--F-TE----P---------K----RVTT-------HVS-----------KKR---LMEIGY-------------------------SILGVA-A------------------------------------------------------------
      C243_RS0119615_gamma_proteobacterium_WG36_516062979                            MKSDYIGAGQS-------------------Q-TP----------AEHKDRWQTPVEIFDALDLEFG-FYLDAAAD---LSNALCSHYLT-EY-----DDSL--SCDW-------------------------E--S--Y-----G--AIWCNPPYSA----------------------------VTPWVSKAA---EQ-CKAQ-NQ-P-VVMLLPADTSTGWFS-EAL-----K-T--V-D--E---------VRFIT-----D----GRIGF---IN-AG-T---GKPGKSGNS--KGS-MLFIW------------------------------------------R-P--F-IK----P---------R----CMFT-------TIS-----------RDD---LIVIGS-------------------------EV-RGVSA--A---------------------------------------------------------
      SS16_RS19255_Enterobacter_cloacae_779858102                                    MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--QTPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---TE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-S---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVG-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      TB70_RS08620_Klebsiella_pneumoniae_694081399                                   MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      GGE_RS10915_Haemophilus_haemolyticus_763376104                                 ---------------------------MT------EQ-------QFDKDTWQTPRYVFEWLSQRFGWFDLDGCAT---ANNALTWRYIGEPNSDNDEHQSI--ADDF-LM--PIEQMLDVLLDEVAERC-L-D--P--L--------RIYVNPPYSN----------------------------VTPYLQRAK---EL-RD-A-GY-L-VVMLLNNDKSTQWYQNHIQ-----G-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGNS--KGQ-MVVVF------------------------------------------D-P--T-ME--------------D----YVTR-------SIS-----------LDF---IKKVGG-----------------YS-K------------------------------------------------------------------------
      UO85_RS18470_Enterobacter_cloacae_770797848                                    MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------S--D--YLS-IPG--YVWLNPPYSN----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      KPR_RS15135_Klebsiella_pneumoniae_529980423                                    MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYIT-EL-----DDSL--NSEW-------------------------T--S--C-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      GRPL_RS04760_Raoultella_planticola_695777676                                   MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSDW-------------------------T--S--Y-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--F-IS----P---------R----HIIT-------SVS-----------LAE---LKRIGT--LEEA---------------------------------------------------------------------------------------
      AAX16_RS03635_Haemophilus_haemolyticus_822519471                               ---------------------------MT------EQ-------KFDKDTWQTPHYVFEWLSQRFGWFDLDGCAT---ANNALTWRYIGEPNSDNDEHQSI--ADDF-LM--PIEQMLDVLLDEVAERC-S-A--P--L--------RIYVNPPYSN----------------------------VTPYLQRAK---EL-RD-A-GY-L-VVMLLNNDKSTQWYQNHIQ-----G-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGNS--KGQ-MVVVF------------------------------------------D-P--A-ME--------------D----FVTR-------SIS-----------LDF---IKKVGG-----------------YD-G--A---------------------------------------------------------------------
      L383_RS0123770_Enterobacter_sp_MGH_37_695674014                                ---DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      UA70_28325_Raoultella_planticola_767053612                                     -----------------------------------------------RDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-YQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASIT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--F-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      AF41_RS16340_Citrobacter_sp_MGH_55_757783319                                   SG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------V--D--YLS-IPG--YVWLNPPYSD----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGV-------------------------KLLARREA--A---------------------------------------------------------
      SS39_RS14470_Enterobacter_asburiae_779796092                                   NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPAIFVSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------IMPFVKKAA---SE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      JP46_RS0122155_Enterobacter_cloacae_692191212                                  HS-GYG--G-S-------------------N-TP----------AEQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNTLCRKFIT-AE-----QDTL--ETPW-------------------------A--D--YLT-IPG--YAWLNPPYSD----------------------------ITPFVKKAA---AE-SK-N-QI-G-TVMLVPADTSVGWFR-EAI-----E-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--F-PR----T---------H----CEST-------FVE-----------RDV---LMTFGA-------------------------KLLARREA--A---------------------------------------------------------
      T636_A2961_Enterobacter_cloacae_MRSN_11489_728967019                           MT-DFT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--QTPW-------------------------A--D--YLN-VPG--SVWLNPPYSD----------------------------ISPFVKKAA---SE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      AAV10_RS05315_Enterobacter_cloacae_complex_695626939                           MT-DFT--G-S-------------------N-TP----------ADQRDLWRTPPVLFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---TE-SA-N-QI-G-TVMLVPADTSVGWFN-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      L371_RS0123590_Enterobacter_sp_MGH_25_695714549                                ---DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSE----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLASREA--A---------------------------------------------------------
      ESMG_RS20740_Escherichia_coli_446051427                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAN---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      ABR38_RS19795_Enterobacter_sp_GN02825_829773120                                NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSE----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLASREA--A---------------------------------------------------------
      G468_RS0102435_Arsenophonus_nasoniae_652426387                                 LI--------S-------------------H-TP----------KPFKDRWRTPIEVFRALDAEFN-FKLDAAAD---KNNALCKAFLT-EQ-----QDAL--TCDW-------------------------N--S--N-----G--AIFCNAPYSK----------------------------IMPWVKKAA---EQ-CRKQ-NQ-T-IVMLLPSDTSTAWFY-EGL-----N-T--A-D--E---------IRFIT-----E----GRLSF---VS-AE-T---GEQGISGNS--KGS-V------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      L466_RS0122390_Enterobacter_sp_BIDMC_30_695758762                              NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      AF41_03280_Citrobacter_sp_MGH_55_635724739                                     SG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------V--D--YLS-IPG--YVWLNPPYSD----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGV-------------------------KLLARREA--A---------------------------------------------------------
      SG64_RS05740_Enterobacter_cloacae_798869855                                    SG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YMS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNS--KGS-ILIIW------------------------------------------R-P--Y-PR----T---------H----CEFT-------TVE-----------RDV---LMEFGS-------------------------KLLARREA--E---------------------------------------------------------
      L349_RS23330_Enterobacter_cloacae_complex_550795579                            MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFY-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      ABR37_RS08920_Enterobacter_sp_GN02768_829838493                                MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      SG82_RS22470_Enterobacter_cloacae_798866512                                    MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGT-------------------------KLLARREA--A---------------------------------------------------------
      KU61_RS02975_Enterobacter_cloacae_704505064                                    MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFR-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      SGP1_RS10390_Sodalis_glossinidius_499730277                                    IV--------S-------------------Q-TP----------KACKDRWQTPVEIFRALDAEFR-FCLDAAAN---HDNTLCRCYLT-EE-----DDAL--SCDW-------------------------Y--T--R-----G--AIFCNPPYSN----------------------------ITPWVRKAA---EQ-CVVQ-QQ-T-IVMLLPSDTSTGWFR-LGL-----E-S--V-D--E---------VRVIT-----G----GRLSF---IS-AA-T---GVCGKNGNS--KGS-LLFIW------------------------------------------R-P--F-FK----N---------R----CQFT-------TVD-----------KSD---LIRIGT-------EVVR----KVAA--------------------------------------------------------------------------
      QQ39_06370_Pragia_fontium_827401593                                            MS-DFG--G-S-------------------N-TP----------AELKDRWQTPDNIFHALDAEFG-FYLDAASE---PHNALCSRFLT-SA-----DDSL--SCDW-------------------------G--S--Y-----G--SIWCNPPYSN----------------------------ITPWIVKAA---EQ-CKKQ-RQ-P-IVMLLPADTSTGWFS-LAL-----K-S--V-D--E---------IRIVT-----D----GRIQF---IN-AG-T---GKKGKNGPG--KGN-LFLIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TIS-----------RDE---LIGIGE-------------------------SILEGVKT--A---------------------------------------------------------
      RN16_RS04075_Chromobacterium_subtsugae_759887196                               MA--------D----------------L----SE-Q--IHF---SSKTDEWPTPQALFDQLHAEFG-FTLDVCAT---QENAKCERFFT-RE-----QDGL--AQDW-------------------------S--RE----------VVWMNPPFGH-Q--------------------------IKLWMAKAY---RS-SI-D-GA-L-VVCLVPARTDTRWFH-RHA-LK--A-A-------E---------IRALD----------KRLRF-------------DGA-KAKAP--FPA-VLVVY--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      SS33_RS24310_Enterobacter_sp_35730_772624651                                   MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PS----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLASREA--A---------------------------------------------------------
      eiDWFOrf6_Edwardsiella_phage_eiDWF_318064446                                   MS-GYH--D-S-------------------K-TA----------PEDKDCWRTPPEVFRYAVRTWGAFEIDAAAA---DHNHLVADYWT-LA-----DNAL--VQDW-------------------------S--G--K--------RVWCNPPYSD----------------------------IGPWVEKAA---TA-EF--------CVMLVPADTSVKWFA-TAG-----E-L--G-A--S---------VIFIT-----R----GRLRF---IH-NA-T---GKP-GPSNK--MGS-CFLVF------------------------------------------G-G--S-RP----G---------R------VD-------FVT-----------RAG---VYQIGA----------------------RR-KVTVKRRV-----------RAPHNAT------------------------------------------
      KV31_RS01780_Enterobacter_cloacae_complex_692189073                            RT-GYG--G-S-------------------H-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-SE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      MYSTI_RS09680_Myxococcus_stipitatus_505160458                                  --------------------------------MN-P--VHF---SSASAEWATPRDLFARLHAEHE-FTLDVCAT---EENTVLPRFYT-RN-----DNGL--AQDW-------------------------A--G--E--------RCWMNPPYGT-AKHACKPDCAKKACEKRGQHIPEYVPGIQDWVEKAA---TC------GS-L-VVALLPARTDTRWWH-RHI-W---D-V-------DRDAPRPGVRVKFFR----------GRLKF-------------GGR-KTGAP--FPS-ALVTF-----------------------------------------------G-VQ--------------S--------------------------------------------------------------------------------------------------------------------------------------
      SS49_RS01825_Enterobacter_cloacae_779812391                                    MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------G--D--YLN-VPG--YVWLNPPYSD----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVG-----------RDE---LMAFGA-------------------------KLIARREA--A---------------------------------------------------------
      OOC_RS12555_Providencia_rettgeri_491050038                                     MA-VYS----S-------------------N-TA----------PEDKDCWQTPQWLFEALTLEFG-FWLDAAAN---EQNALCPYFLT-IE-----QNAL--QSDW-------------------------V--S--R-----G--AIWCNPPYSK----------------------------IKPWIAKAA---EQ-CTKQ-NQ-P-IVMLLPADKSTSWYS-LAL-----K-S--V-D--E---------VRTII-----D----GRINF---VD-PN-T---GKE-KKGNS--KGS-ILLIW------------------------------------------R-P--F-VE----P---------K----AIGT-------HIS-----------KNR---LMEIGN-------------------------AILGVA-A------------------------------------------------------------
      JP33_05990_Gallibacterium_anatis_CCM5995_702395340                             -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--H--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----K-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QVS-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      JP33_RS05860_Gallibacterium_anatis_746094489                                   ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--H--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----K-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QVS-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      UMN179_RS11845_Gallibacterium_anatis_503512608                                 ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--AEDF-LTFDPLD-----LIEVLE----F-P--H--V--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-Q-GF-L-VVMLLPADKSTAWYK-VIE-----E-K--A-T--E-------V-IDITGYYDEKGRWKNGRISF---LH-PT-E---NVE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QVS-----------LDF---VMKCGG-----------------YG-N------------------------------------------------------------------------
      SR65_RS01890_Enterobacter_asburiae_779958989                                   NG-DYG--G-S-------------------K-TP----------IDQRDLWRTPPALFASLDSEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETLW-------------------------A--D--YLS-IPG--YVWLNPPYSE----------------------------IMPFVKKAA---SE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RNE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      dam_Yersinia_phage_PY100_164414537                                             KR-DFG--G-S-------------------T-TP----------KDIRDLWATPQWLFDYFNEIYK-FDLDAAAN---DINHKCDNYLT-LE-----NDGIVEEHEW-------------------------I--C--E-----S--AVWCNPPYSD----------------------------PQPWIEKAI---NE-SS-L-GV-L-SVMLLPCDPSTEWFH-LAS-----K-S--A-S--K---------IYILT-----G----GRVQF---VR-AD-T---GEE-QRGNP--KGS-VLFVF------------------------------------------D-P--N-DG----D---------Q-------E-------TIY-----------LP----IWEAGG--KEPR--------------WF---KSWTLKEE--E--------E------------------------------------------------
      EB105725_RS05190_Shimwellia_blattae_488371093                                  MS-DYG--G-S-------------------K-TP----------VPERDLWQTPASIFTALDIEFG-FYLDVAAA---PHNALCARFMT-EH-----EDAL--NSDW-------------------------S--S--Y-----G--AIWCNPPYSD----------------------------ITPWIRKAA---EQ-CQKQ-HQ-T-VVMLLPADISTGWFS-LAL-----Q-T--V-D--E---------IRLIT-----N----GRIQF---VP-ASVS---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--F-IS----P---------R----GIFT-------TVS-----------KPA---LEDAGQQYLDEV-AA------------------------------------------------------------------------------------
      BU34_RS16325_Escherichia_coli_643945869                                        MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---KQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RRSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA---------------------------------------------------------------------------------------
      ECCZ_RS00820_Escherichia_coli_559190709                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWCTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LIAFGS-------------------------RILARREA--A---------------------------------------------------------
      SS08_RS15735_Enterobacter_cloacae_749204695                                    MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWMNPPYSD----------------------------ITPFVNKAA---TE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVG-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      ERIG_RS16875_Escherichia_fergusonii_446051425                                  MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFD-FQLDAAAN---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      VP22_RS12395_Escherichia_fergusonii_803565941                                  MT-DFT--G-S-------------------N-TP----------AEHRDSWCTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      HMPREF0731_RS06220_Roseomonas_cervicalis_750330482                             ---------------------------M--N-TA-A----F---SSAYEAWATPPDLLERLYAAVGSIDLDPCSPGKLRSRVKAPRHFT-ER-----DDGL--AQEW-------------------------S--G-----------KVYMNPPYGR-T--------------------------IGAWTTKAR-V-EV-TAGR-AE-C-VVGLVPARTDTRWWH-ADV-A---G-H--A----H---------VWLLK----------GRLAF-------------GDG-STPAP--FPS-ALLLW---------------------GGN------------------A-P-----------------------------------------------------------------------------------------------------------------------------------------------------------
      AB28_RS19280_Escherichia_coli_695802868                                        MT-DFT--G-S-------------------K-TP----------VEQRNLWQTPIPLFVALDAEFC-LTLDAAAS---TDNALCNRYIT-EE-----QNTL--TTPW-------------------------A--D--FLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-ST-N-QI-G-TVMLVPADTSVGWFR-EAI-----E-T--A-S--E---------VRFIV-----G----GRLAF---IN-PV-S---GKP-VSDNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CQFT-------TVE-----------RDA---LLSFGA-------------------------RLIAKREA--A---------------------------------------------------------
      G869_RS17520_Escherichia_coli_486132694                                        MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--D---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNS--KGS-ILIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      SEEB0179_06810_Salmonella_enterica_555260527                                   MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPAIFAALDAEFC-FQLDTAAA---PHNALCRRFIT-EE-----QNTL--VTPW-------------------------A--D--YMS-IPG--HVWMNPPYSD----------------------------IMPFVKKAA---AE-SK-N-QI-G-TVMLVPSDTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLLIW------------------------------------------R-P--Y-PR----T---------Q----CDLT-------TVE-----------RDV---LIEFGS-------------------------ARLARREA--A---------------------------------------------------------
      L743_RS07620_Serratia_marcescens_742394131                                     MSAVFA----S-------------------N-TP----------PEHKDRWQTPIEVFNALDVEFG-FFLDAAAD---DGNALCAHYQT-EQ-----DNAL--SIDW-------------------------V--S--Y-----G--AIWCNPPYSD----------------------------ITPWVIKAA---EQ-CHVQ-NQ-P-IVMLLPADTSTGWFS-LAL-----Q-S--V-D--E---------VRFIT-----D----GRLAF---IN-SA-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFS-------TIS-----------RDE---LLRIGK-------------------------GIAMEVSV--A---------------------------------------------------------
      US97_C0007G0018_Microgenomates_bacterium_GW2011_GWF1_38_5_818397581            MD--------A---------------------------VLF---SRKSDEWTTPEATYVGLDAEFH-FTDDPCPL---GA-----------------TDGL--EREW-------------------------K--G-----------SVYVNPPYSK----------------------------IAAFVEKAIQELDE-GH---AH-T-VVFLVPSRTDTRWFH-RYV-L---G-R--G-G--E---------IRFIK----------GRLKF---------S---GS--KNSAP--FPS-MIVIW------------RDRKMGI-------------------------PD-Y-EE----R-L-------D----EVFT-G-AF--TTN-----------HPA---MIDWVQ--VKME-----------LK------RLRNNRRE------------------------------------------------------------
      SPM24T3_RS16925_Serratia_sp_M24T3_497323801                                    MKSDHLGL--S-------------------S-TP----------AEHKDRWQTPVEIFDALDLEFG-FYLDAAAD---QSNALCSHYLT-EQ-----DDSL--SCEW-------------------------T--S--H-----G--AIWCNPPYSA----------------------------PPPWVAKAA---EQ-CRIQ-KQ-P-VVMLMPADTSTGWFS-EAL-----K-T--V-D--E---------VRFIT-----D----GRIGF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CMFT-------TVS-----------RDE---LLVIGS-------------------------EV-RGVSA--A---------------------------------------------------------
      AAX16_RS08715_Haemophilus_haemolyticus_822520202                               ---------------------------MT------EQ-------QFDKDTWQTPKYVFNWLEIKCGSFDVDGCAS---SENALCKEYID---------------SDF--------DFLTCSMRGFQNCCEK-E--N--L--------KIYVNPPYSD----------------------------VTPFLIRAK---EL-RD-A-GH-L-VVMLLNNDKSTQWYQNHIH-----N-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGNS--KGQ-MVVVF------------------------------------------D-P--T-ME--------------D----FVTR-------SIS-----------LDF---IKKVGG-----------------YI-HVEK---------------------------------------------------------------------
      P833_RS20130_Enterobacter_cloacae_695744722                                    MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFR-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRLIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      CH09_gp15_Edwardsiella_phage_eiAU-183_589889277                                MS-GYH--D-S-------------------K-TA----------PEDKDCWRTPPEVFRYAVRTWGSFEIDAAAA---DHNHLVADYWT-LA-----DNAL--VQDW-------------------------S--G--K--------RVWCNPPYSD----------------------------IGPWVEKAA---TA-EF--------CVMLVPADTSVKWFA-TAG-----E-L--G-A--S---------VIFIT-----R----GRLRF---IH-NA-T---GKP-GPSNK--MGS-CFLVF------------------------------------------G-G--S-RP----G---------R------VD-------FVT-----------RAG---VYQIGA----------------------RR-KVTVKRRV-----------RAPHNAT------------------------------------------
      RM98_RS18265_Chromobacterium_violaceum_759932528                               MA--------D----------------L----SE-Q--VHF---SSKTDEWPTPQALFDQLHEEFG-FTLDVCAT---AENAKCERFFT-RE-----QDGL--AQDW-------------------------S--RD----------VVWMNPPFGH-Q--------------------------IKLWMAKAY---RS-SI-D-GA-L-VVCLVPARTDTRWFH-RHA-LK--A-A-------E---------IRALD----------KRLRF-------------DGA-KAKAP--FPA-VLVVY--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      ABF72_01710_Enterobacter_cloacae_829343115                                     NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-HI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      ETAC_RS11795_Edwardsiella_piscicida_505274905                                  MS--VK----S-------------------N-TP----------AEAKDCWQTPLWLFDALDLEFG-FWLDAAAS---ESNALCVKYLT-EV-----DNAL--GCEW-------------------------E--S--A-----G--AIWCNPPYSK----------------------------IGPWVAKAA---EQ-SARQ-IQ-T-VVMLVPEDMSVGWFS-EAL-----K-T--V-D--E---------VRVIT-----G----GRVNF---VH-AV-T---GAE-QKGNS--KGS-MLLIW------------------------------------------R-P--F-TT----P---------L----HRIT-------TVS-----------KSM---LEAIGR-------------------------PVRSAA--------------------------------------------------------------
      IO48_RS08405_Gallibacterium_anatis_746097630                                   ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--AEDF-LTFDPLD-----LIEVLE----F-P--H--V--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------
      ETAF_RS16405_Edwardsiella_tarda_504339421                                      MS--IK----S-------------------N-TP----------AEAKDCWQTPLWLFDALDLEFG-FWLDAAAS---ESNALCVKYLT-EV-----DNAL--GCEW-------------------------E--S--A-----G--AIWCNPPYSK----------------------------IGPWVAKAA---EQ-SARQ-IQ-T-VVMLVPEDMSVGWFS-EAL-----K-T--V-D--E---------VRVIT-----G----GRVNF---VH-AV-T---GAE-QKGNS--KGS-MLLIW------------------------------------------R-P--F-TT----P---------L----HRIT-------TVS-----------KSM---LEAIGR-------------------------PVRSAA--------------------------------------------------------------
      ABF80_10955_Enterobacter_cloacae_829278978                                     MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLR-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---TE-SD-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARRES--A---------------------------------------------------------
      BGIM_RS49305_Zavarzinella_formosa_750606593                                    MN----------------------------D-QH----------KEIRGCWRTSPAVFNKLEGIFG-FTIDACAD---RDNHLLPRYWT-EE-----DDAL--TQDW-----------------------S-E--E-----------RVFCNPPF----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      L467_RS08095_Klebsiella_pneumoniae_694065919                                   RH------G-A-------------------K-IT----------ETGSDDWQTPRVIYEALNKRFK-FTRDAAAT---KQNSHCARYWT-KE-----DDAL--LMDW-------------------------S--Q--E-------KSIFCNPPYSK----------------------------VAEFLAKAH---EP-------E-T-AVFLIPFRPQTGFFL-QFV-----W-A--SPYLHE---------MMIIH----------RGIRF---I-------------HPDRV--ESVRS------------------------------------------------P--M-PV----V---------V----LVYR-------NKP-----------RKR-D-LLITVN-------------------------CADSLHTL--H----VVAGQRPGHPLEHGHSIRNKIIQEYQRGATVAELVRKYEGKVSRRSIYRWVKG
      HICON_RS06920_Haemophilus_influenzae_503292971                                 ---------------------------MT------GQ-------QFDKDTWQTPHYVFEWLSQRFGLFDLDGCAT---ANNALTCHYIGEPNSDNDEHQSI--ADDF-LM--PIEQMLDVLLDEVAERC-S-A--P--L--------RIYVNPPYSN----------------------------VTPYLQRAK---EL-CD-A-GY-L-VVMLLNNDKSTQWYQNHIQ-----G-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGNS--KGQ-MVVVF------------------------------------------D-P--T-ME--------------D----FVTR-------SIS-----------LDF---IKKIGG-----------------YS-K------------------------------------------------------------------------
      SS28_RS07355_Enterobacter_cloacae_779872478                                    MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWMNPPYSD----------------------------ITPFVNKAA---TE-SA-N-QI-G-AVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------AVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      L463_02765_Enterobacter_sp_BIDMC_27_578260023                                  RT-GYG--G-S-------------------H-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-SE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      JP31_RS04730_Gallibacterium_anatis_746070689                                   ME-------------------------F--D----------------KDAYPTPISLFNQINDEFN-FTIDGAAL---PHNTKCERYIT-PE-----MDFL--KYPL-----------------------V-N--E-----------RIWINPPFSE----------------------------PLSFVKRAV---EL-YENH-DC-L-VVMLLPVDISTKWFS-LVA-----E-K--A-T--E---------IRFIV--G-------GRIKF---LN-PE-T---DK--WTDVC--RGN-HLAIF------------------------------------------N-P--A-HK----S-----M---G----QAIR-------HIH-----------------ISRFKN--LE--------W------------R-------------------------------------------------------------------
      HMPREF1315_RS07015_Bifidobacterium_longum_494116860                            AS--------NFYK------------------AG-A--AAM---TSNKDDWETPQSLFDQLDEEFH-FILDAASS---DQNAKCEHHYT-AE-----NSGL--EHSW-------------------------E--G--E--------TVFCNPPYGR-N--------------------------IGDWIRKAS---QE-AS-KPDT-L-VVLLVPARTDTRWFQ-NHI-L---H-R--A----E---------VRFLP----------GRLKY-E-VN--------GQA-GEAAP--------SFW------------------------------------------R-E--G-TP--S-F------------------------------------------------------------------------------------------------------------------------------------------------
      DJ57_RS06970_Yersinia_kristensenii_740850846                                   MS-DFG--G-S-------------------N-TP----------DNLKDLWMTPADIFTALDIEFG-FYLDAAAS---NKSALCARYLT-EQ-----DDAL--NSAW-------------------------E--S--Y-----G--AIWCNPPYSD----------------------------ISPWVTKAT---EQ-CKQQ-LQ-T-VVMLVPADSSVGWFS-QAL-----Q-S--V-D--E---------VRFIT-----D----GRISF---LR-SD-T---GKP-INGNN--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CMFT-------RVK-----------RDE---LKAIGQ-------------------------EILTGSKA--A---------------------------------------------------------
      KS43_RS19035_Pectobacterium_carotovorum_746454692                              MT--LK----S-------------------N-TS----------ADDKDRWQTPLWLFDALDIEFG-FYLDVAAS---GKNALCANYLT-ES-----DDAL--NTDW-------------------------V--S--H-----G--SVWCNPPYSK----------------------------ITPWVEKAA---EQ-YRKQ-NR-N-VVMLIPEDMSVGWFS-LAL-----N-S--V-D--E---------VRVIT-----D----GRVNF---VE-PS-T---GME-KKGNS--KGS-MLLIW------------------------------------------R-P--F-TT----P---------R----RIIT-------TVS-----------KPL---LMNIGQ-------------------------GIRRAA--------------------------------------------------------------
      L422_RS03620_Enterobacter_cloacae_556329507                                    MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLR-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---TE-SD-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A---------------------------------------------------------
      SU59_RS00780_Haemophilus_influenzae_756151459                                  ---------------------------MT------EQ-------QFDKDTWQTPCYVFEWLSQRFGLFDLDGCAT---ANNALTCHYIGEPNSDNDEHQSI--ADDF-LM--PIEQMLDVLLDEVAERC-S-D--P--L--------RIYVNPPYSN----------------------------VTPYLQRAK---EL-CD-A-GY-L-VVMLLNNDKSTQWYQNHIQ-----G-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGYS--KGQ-MVVVF------------------------------------------D-P--T-ME--------------D----FVTR-------SIS-----------LDF---IKKVGG-----------------YI-RMEK---------------------------------------------------------------------
      PSNIH1_RS00725_Pantoea_sp_PSNIH1_746340387                                     MA-GYH--D-S-------------------H-TP----------IDIRDLWQTPPEIFAALNREFR-FVADVAAS---KLNHLLPAYLT-EQ-----DDAL--NQDW-------------------------A--A--QFP--IG--ITWCNPPYSD----------------------------ITPWVVKAT---EE-AR-K-GM-G-TVMLVPADTSVGWFS-AAR-----S-S--C-T--E---------VRFIT-----N----GRLSF---IR-AD-T---GKA-VNGNN--KGS-MLLIW------------------------------------------N-P--F-LS----Y---------F----GLTG-------YVS-----------RDA---LMSIGT-------------------------RLLLSAEK--V--SAA----------------------------------------------------
      IE01_RS08420_Gallibacterium_anatis_517157190                                   ---------------------------M----------------SFDRDAYRTPKYVFKWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--AEDF-LTFDPLD-----LIAVLE----F-C--N--F--------TIFVNPPYSN----------------------------PLPFVERAA---EL-KK-Q-GF-L-VAMLLPADKSTKWYQ-VIQ-----D-N--A-T--E-------V-IDIVG----------GRINF---LH-PE-T---GEE-VKGNN--KGS-LIAVF------------------------------------------D-P--T-MQ--------------G----FITR-------QVT-----------LDF---IKDVGG-----------------YG-I------------------------------------------------------------------------
      NTHI477_RS07245_Haemophilus_influenzae_764356005                               ---------------------------MT------EQ-------QFDKDTWQTPRYVFEWLSQRFGLFDLDGCAT---ANNALTCHYIGESNSDNDEHQSI--ADDF-LM--PIEQMLDVLLDEVAERC-S-D--P--L--------RIYVNPPYSN----------------------------VTPYLQRAK---EL-CD-A-GY-L-VVMLLNNDKSTQWYQNHIQ-----G-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGNS--KGQ-MVVVF------------------------------------------D-P--T-ME--------------D----FVTR-------SVS-----------LDF---VKKCGG-----------------YG-I------------------------------------------------------------------------
      HMPREF0555_0745_Leuconostoc_mesenteroides_subsp_cremoris_ATCC_19254_227352467  ----------------------M----M--V-DK----VLF---SSNSMVWETPKDYFDKLNRKFK-FDLDACAS---DTNHKVDTYFT-ED-----DNAL--EQKW-------------------------G--G-----------NVFMNPPYGR-H--------------------------IGKFIKKAY---EE-HL-RDPN-RFIVMLIPSRTDTKYWH-EYI-Q---D-K--A----T---------VKFIK----------GRLKF-E-ID--------GES-MDAAP--FPS-ALVVY-----------------------------------------------G-F------------------------------------------------------------------------------------------------------------------------------------------------------
      UA45_RS08225_Morganella_morganii_770844418                                     MKADYG--G-S-------------------T-TP----------KELRDLWQTPLPLFSALDAEFG-FYLDAAAD---KNNTLCSHYLT-EK-----DNAL--NSDW-------------------------Q--S--Y-----G--SIWCNPPYSD----------------------------IQPWVRKAA---EQ-CREQ-LQ-P-VVMLVPADTSVGWFK-SAL-----D-T--V-D--E---------VRFIT-----G----GRISF---IN-AG-T---DKS-KNGNT--KGS-MLLIW------------------------------------------R-P--F-TQ----P---------R----RIIT-------TVN-----------RDD---LMDIGN-------------------------RLLESQI-------------------------------------------------------------
      HMPREF0555_RS01180_Leuconostoc_mesenteroides_738135700                         ---------------------------M--V-DK----VLF---SSNSMVWETPKDYFDKLNRKFK-FDLDACAS---DTNHKVDTYFT-ED-----DNAL--EQKW-------------------------G--G-----------NVFMNPPYGR-H--------------------------IGKFIKKAY---EE-HL-RDPN-RFIVMLIPSRTDTKYWH-EYI-Q---D-K--A----T---------VKFIK----------GRLKF-E-ID--------GES-MDAAP--FPS-ALVVY-----------------------------------------------G-F------------------------------------------------------------------------------------------------------------------------------------------------------
      A15Y_RS20360_Escherichia_coli_486190610                                        MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKH-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A---------------------------------------------------------
      consensus/100%                                                                 ..........................................................................................................h..........................................ahNsPa................................................................................................................................................................................................................................................................................................................................................................
      consensus/95%                                                                  ...............................................pp.a.TP..ha..Lp..a..F.lDssu......s.bs..ahs.........ssl.....a.........................................hahNPPYu................................ah.+u..................VhLlPsc.ss.aa....................p.........lphh...........GRl.F...................s.s..bs..h...h........................................................................................................................................................................................................
      consensus/90%                                                                  ...............................................ps.W.TP..hF..Ls..F..F.lDssA.....pN.bs..ahs.........ssL...p.a.........................................hahNPPYup...............................al.+Ah.........p......sVhLlPsc.ss.aa...h................-.........lchl...........GRl.F...................sss..bss.hlhla........................................................................................................................................................................................................
      consensus/85%                                                                  ...............................................p-.W.TP..hF..Ls.bF..F.LDssA.....pNsbC.paho.........ssL...p.W.........................................hahNPPYup............................l..alpKAh....p.s..p....s.sVhLlPsc.ss.aa...hh.......p.......E.........l+hlp..........GRl.F.............s....psss..bss.hlhla........................................................................................................................................................................................................
      consensus/80%                                                                  .............................................pppD.WpTP..hF..Ls.cFs.F.LDssA.....pNsbC.+aho........pssL...ppW.........................s..s............hahNPPYup............................l.salpKAh...pp.s..p....s.sVhLlPucpss.Wap.phh.......p.......E.........l+hlp..........GRl.F.............G....psss..bss.hlhla..........................................c.s...........................................................................................................................................................
      consensus/75%                                                                  ................................s............pppD.WpTP..lF..Ls.cFs.FpLDssAs....pNAbC.+ahT..b.....pssL..pppW.........................s..s............lahNPPYup............................I.saVcKAh...pp.s..p....s.sVhLlPucsss.WFp.phh.....p.p..s....E.........l+hlp..........GRl.F.............G....psss..bsS.hlhla..........................................c.P.....p.....................................................................................................................................................
      consensus/70%                                                                  ................................s............pppD.WpTP..lFs.Ls.EFs.FpLDssAs....pNAbC.+ahT..b.....pssL..pppW.........................s..s............lWhNPPYuc............................I.saVcKAh...pp.s..p.s..s.sVhLlPAcTss.WFp.phh.....p.p..u....E.........lRhIp..........GRL.F.............Gp...puss..bGS.hlhla..........................................+.P.....p.....................h...............................................................................................................................
      
      Back to Contents
    • General notes, phyletic distribution and domain architectures of the Group2-Clade4/EMIHUDRAFT_111979-like N6-MTases

      General notes:

      This N6-MTase is an independent transfer to Emiliania. The eukaryotic enzymes are active. Synapomorphic residues include TP before first helix, D at the end of strand-1, N in helix following strand-1, NPPY motif after strand-4, basic residue in helix after strand-4, P at the end of strand-5, acidic residue beginning of strand-6, R between strands 6 and 7. Strand-3 is degenerate as in other families of this group. Operons show that they are present in phage/prophages where they are close to packaging and replication genes, suggesting that they might be packaged, and might be used to modify self-DNA to counter host REs in the phages/prophages.
      GI           Gene neighborhoods                                                                                                                            Dom archs   Pfam archs     Gene name              Len      Taxonomy                                          Species name                                   Genbank annotation
      # 96; Eukaryotic versions
      551608163                                                                                                                                                  N6-MTase    N6-MTase       EMIHUDRAFT_111979      250      eukaryota>haptophyceae                            Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_111979 [Emiliania huxleyi CCMP1516].
      551578908                                                                                                                                                  N6-MTase    N6-MTase       EMIHUDRAFT_240085      261      eukaryota>haptophyceae                            Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_240085 [Emiliania huxleyi CCMP1516].
      # 96; Prokaryotic homologs
      446051431    <-Phage_integrase<-?<-?<-small-pep<-?<-N6-MTase*<-?<-?<-RecT-Redbeta                                                                          N6-MTase    N6-MTase       -                      185      bacteria>proteobacteria>gammaproteobacteria       Escherichia coli                               DNA N-6-adenine-methyltransferase [Escherichia coli].                                               447177970_?-><-446728115_?<-485655176_Phage_integrase<-446410552_?<-446686039_?<-446111015_small-pep<-446135997_?<-446051431_N6-MTase*<-692947403_?<-446108978_?<-485810799_RecT-Redbeta<-446918185_?<-446155721_?<-447210913_?<-446336822_?
      486273694    RecT-Redbeta->?->?->N6-MTase*->?->small-pep->?->?->Phage_integrase->                                                                          N6-MTase    N6-MTase       -                      185      bacteria>proteobacteria>gammaproteobacteria       Escherichia coli                               phage N-6-adenine-methyltransferase [Escherichia coli].                                             693027482_RecT-Redbeta->693027485_?->485691706_?->486273694_N6-MTase*->446135997_?->446111015_small-pep->446686022_?->446410552_?->445974032_Phage_integrase->
      84779227     <-Phage_capsid<-Phage_portal<-Terminase_LS<-?<-HNH<-DCM<-Phage_lysozyme<-N6-MTase*<-DnaC<-KilA-N||?-><-SSB                                    N6-MTase    N6-MTase       SG0729                 181      bacteria>proteobacteria>gammaproteobacteria       Sodalis glossinidius str. 'morsitans'          hypothetical phage protein [Sodalis glossinidius str. 'morsitans'].                                 <-84779220_Phage_capsid<-84779221_Phage_portal<-84779222_Terminase_LS<-84779223_?<-84779224_HNH<-84779225_DCM<-84779226_Phage_lysozyme<-84779227_N6-MTase*<-84779228_DnaC<-84779229_KilA-N||84779230_?-><-84779231_SSB<-84779232_?<-84779233_?||84779234_?->
      490192932    <-N6-MTase*                                                                                                                                   N6-MTase    N6-MTase       -                      180      bacteria>proteobacteria>gammaproteobacteria       Hafnia alvei                                   DNA N-6-adenine-methyltransferase [Hafnia alvei].                                                   <-490192932_N6-MTase*<-490192933_?<-737528911_?<-490192935_?
      496089880    <-Phage_integrase<-small-pep<-N6-MTase*<-?<-METHYLASE<-?<-?<-RecT-Redbeta                                                                     N6-MTase    N6-MTase       -                      180      bacteria>proteobacteria>gammaproteobacteria       Enterobacteriaceae bacterium 9_2_54FAA         DNA N-6-adenine-methyltransferase [Enterobacteriaceae bacterium 9_2_54FAA].                         <-490985983_?||496089884_?-><-490985991_?<-496089883_?||496089882_?-><-496089881_Phage_integrase<-748754240_small-pep<-496089880_N6-MTase*<-496089879_?<-496089878_METHYLASE<-748754325_?<-748754241_?<-496089875_RecT-Redbeta<-748754326_?<-496089873_?
      505727589    <-Phage_AlpA<-HTH_3+Peptidase_S24||?->?->HTH->N6-MTase*->RusA->Phage_pRha+ANT->?->Phage_antitermQ->                                           N6-MTase    N6-MTase       -                      180      bacteria>proteobacteria>gammaproteobacteria       Rahnella aquatilis                             DNA N-6-adenine-methyltransferase [Rahnella aquatilis].                                             <-505727580_?||505727582_?-><-753991441_Phage_AlpA<-505727584_HTH_3+Peptidase_S24||505727585_?->505727586_?->505727588_HTH->505727589_N6-MTase*->505727591_RusA->753991050_Phage_pRha+ANT->505727593_?->505727594_Phage_antitermQ-><-505727595_?||505727596_?->505727597_?->
      746124400    <-Phage-tail-tape<-?<-Portal<-Terminase_LS||?->?->?->N6-MTase*->small-pep->Phage_integrase->                                                  N6-MTase    N6-MTase       -                      180      bacteria>proteobacteria>gammaproteobacteria       Hafnia paralvei                                DNA methylase [Hafnia paralvei].                                                                    <-746124389_Phage-tail-tape<-746124673_?<-746124392_Portal<-746124676_Terminase_LS||746124679_?->746124395_?->746124398_?->746124400_N6-MTase*->746124403_small-pep->746124406_Phage_integrase->746124409_?-><-496089958_?<-746124412_?<-746124413_?<-496089961_?
      736793592    <-Phage_antitermQ<-?<-Phage_pRha+ANT<-RusA<-N6-MTase*<-HTH                                                                                    N6-MTase    N6-MTase       -                      179      bacteria>proteobacteria>gammaproteobacteria       Ewingella americana                            DNA methylase [Ewingella americana].                                                                <-736793578_?||736793581_?->736793582_?-><-736793583_Phage_antitermQ<-736793586_?<-736793588_Phage_pRha+ANT<-736793589_RusA<-736793592_N6-MTase*<-736793595_HTH<-736793711_?<-736793597_?||736793598_?->736793600_?->736793602_?->736793604_?->
      738472851    <-Phage_antitermQ<-?<-KilA-N<-RusA<-?<-N6-MTase*                                                                                              N6-MTase    N6-MTase       -                      179      bacteria>proteobacteria>gammaproteobacteria       Morganella morganii                            DNA methylase [Morganella morganii].                                                                485706482_?-><-738472838_Phage_antitermQ<-738472840_?<-738472843_KilA-N<-738473381_RusA<-738472848_?<-738472851_N6-MTase*<-738472855_?<-738472858_?<-640732309_?<-738472860_?<-738472862_?<-738472864_?||738473383_?->
      738462811    N6-MTase*->?->KilA-N->?->Phage_antitermQ->                                                                                                    N6-MTase    N6-MTase       -                      178      bacteria>proteobacteria>gammaproteobacteria       Morganella morganii                            DNA methylase [Morganella morganii].                                                                738462796_?-><-738462799_?||738464960_?->738462802_?->738462805_?->738462808_?->738464963_?->738462811_N6-MTase*->738462814_?->639128326_KilA-N->738462817_?->738462820_Phage_antitermQ->738462824_?-><-738462827_?||639126534_?->
      802097985    N6-MTase*->?->KilA-N->?->Phage_antitermQ->                                                                                                    N6-MTase    N6-MTase       -                      178      bacteria>proteobacteria>gammaproteobacteria       Morganella morganii                            DNA methylase [Morganella morganii].                                                                802097982_?->802097030_?->802097031_?->802097983_?->802097984_?->802097985_N6-MTase*->802097986_?->639128326_KilA-N->802097987_?->738462820_Phage_antitermQ->738462824_?-><-738462827_?||639126534_?->
      499730784    <-RusA<-?<-?<-N6-MTase*<-?<-?<-?||?->?->RecT-Redbeta->                                                                                        N6-MTase    SP             -                      177      bacteria>proteobacteria>gammaproteobacteria       Sodalis glossinidius                           DNA N-6-adenine-methyltransferase [Sodalis glossinidius].                                           <-754366011_?<-643659505_?||499730781_?-><-754366012_?<-499730782_RusA<-754366538_?<-754366013_?<-499730784_N6-MTase*<-754366014_?<-754366015_?<-754366539_?||754366016_?->754366017_?->499730787_RecT-Redbeta->754366018_?->
      169152788    <-N6-MTase*                                                                                                                                   N6-MTase    N6-MTase       ABSDF2497              173      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii SDF                    putative bacteriophage protein [Acinetobacter baumannii SDF].                                       <-169152781_?<-169152782_?<-169152783_?<-169152784_?<-169152785_?<-169152786_?<-169152787_?<-169152788_N6-MTase*<-169152789_?<-169152790_?<-169152791_?<-169152792_?<-169152793_?||169152794_?->169152795_?->
      284008293    KilA-N->?->?->Phage_lambda_P->?->N6-MTase*->                                                                                                  N6-MTase    N6-MTase       ARN_24250              173      bacteria>proteobacteria>gammaproteobacteria       Arsenophonus nasoniae                          phage DNA methyltransferase [Arsenophonus nasoniae].                                                284008286_?-><-284008287_?||284008288_KilA-N->284008289_?->284008290_?->284008291_Phage_lambda_P->284008292_?->284008293_N6-MTase*->
      447017697    <-N6-MTase<-?||?->?->?->N6-MTase*->?->?-><-Phage_integrase                                                                                    N6-MTase    N6-MTase       -                      166      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        N-6-adenine-methyltransferase [Acinetobacter baumannii].                                            <-446466656_?<-447033140_?<-446956397_N6-MTase<-447018352_?||740523977_?->446054157_?->446054521_?->447017697_N6-MTase*->446850517_?->446434453_?-><-446697986_Phage_integrase<-447100339_?<-446730003_?<-446054325_?||446643136_?->
      490838153    <-N6-MTase*<-?<-?<-?<-?<-?<-AAA                                                                                                               N6-MTase    N6-MTase       -                      166      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter sp. NIPH 973                     phage N-6-adenine-methyltransferase [Acinetobacter sp. NIPH 973].                                   <-490838141_?<-490838144_?<-490838145_?||490838148_?->490838149_?-><-446848420_?<-490838151_?<-490838153_N6-MTase*<-445951092_?<-446776857_?<-645915102_?<-446577501_?<-490838159_?<-490838161_AAA<-488063409_?
      493629840    HTH->?->?-><-Phage_AlpA<-N6-MTase*<-?<-?<-?<-?<-?<-?<-HTH_3+Peptidase_S24                                                                     N6-MTase    N6-MTase       -                      166      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter nosocomialis                     N-6-adenine-methyltransferase [Acinetobacter nosocomialis].                                         <-491023730_?<-491023731_?||493629838_?->491023734_HTH->493629839_?->491023738_?-><-490850089_Phage_AlpA<-493629840_N6-MTase*<-445951092_?<-493629841_?<-493629842_?<-445995254_?<-446051578_?<-447183491_?<-493629844_HTH_3+Peptidase_S24
      493629922    <-N6-MTase*<-?<-?<-?<-AAA                                                                                                                     N6-MTase    N6-MTase       -                      166      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter calcoaceticus/baumannii complex  MULTISPECIES: N-6-adenine-methyltransferase [Acinetobacter calcoaceticus/baumannii complex].        <-491024412_?<-446530955_?<-487980009_?||491024414_?->691157783_?-><-491024421_?<-490838151_?<-493629922_N6-MTase*<-445951092_?<-493629923_?<-493629924_?<-493629925_AAA<-491280326_?<-493629926_?<-493629927_?
      515155813    N6-MTase*->Phage_AlpA->                                                                                                                       N6-MTase    SP+N6-MTase    -                      166      bacteria>proteobacteria>gammaproteobacteria       Vibrio cyclitrophicus                          DNA N-6-adenine-methyltransferase [Vibrio cyclitrophicus].                                          656242824_?->515155813_N6-MTase*->515155814_Phage_AlpA-><-515155815_?<-515158394_?<-695348704_?||515155818_?->515155819_?->515155820_?->
      645913983    N6-MTase*->?->Phage_AlpA->                                                                                                                    N6-MTase    N6-MTase       -                      166      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter calcoaceticus/baumannii complex  MULTISPECIES: adenine methyltransferase [Acinetobacter calcoaceticus/baumannii complex].            446913963_?->691068996_?->691068999_?->691069002_?->691069005_?->671585605_?->691069009_?->645913983_N6-MTase*->691069012_?->446324015_Phage_AlpA->446956738_?-><-490848217_?<-691069015_?<-691069016_?||493629626_?->
      646896396    METHYLASE-><-?<-?<-DCM<-Phage_AlpA<-N6-MTase*<-?<-Phage_AlpA||RadC->?->?->?-><-Phage_AlpA                                                     N6-MTase    SP+N6-MTase    -                      166      bacteria>proteobacteria>gammaproteobacteria       Vibrio parahaemolyticus                        adenine methyltransferase [Vibrio parahaemolyticus].                                                <-545079836_?<-545079837_?||646896368_METHYLASE-><-646896374_?<-686283225_?<-646896385_DCM<-646896390_Phage_AlpA<-646896396_N6-MTase*<-686283226_?<-646896408_Phage_AlpA||658925964_RadC->658925965_?->646896434_?->646896439_?-><-646896445_Phage_AlpA
      691047241    RecT-Redbeta->?->?->?->?->N6-MTase*->?->Phage_integrase-><-?<-Phage_lysozyme                                                                  N6-MTase    N6-MTase       -                      166      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                691048071_?->691048073_?->497182141_RecT-Redbeta->497182088_?->446054153_?->691048076_?->691047238_?->691047241_N6-MTase*->490838151_?->493629963_Phage_integrase-><-446006682_?<-691048213_Phage_lysozyme<-447006407_?<-446060349_?
      758882462    RecT-Redbeta->?->?->?->N6-MTase*->?-><-Phage_integrase                                                                                        N6-MTase    N6-MTase       -                      166      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                758882458_?->446375422_?->758882459_?->445988016_RecT-Redbeta->487936574_?->758882460_?->758882461_?->758882462_N6-MTase*->446046136_?-><-758882463_Phage_integrase||490978840_?->446933280_?->446032311_?->447038484_?->445997369_?->
      444754682    Phage_integrase-><-?<-N6-MTase*                                                                                                               N6-MTase    N6-MTase       ACIN5021_2863          163      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter sp. OIFC021                      DNA N-6-adenine-methyltransferase (N6-MTase) [Acinetobacter sp. OIFC021].                                <-444754612_?<-444754588_?<-444754680_?<-444754736_?<-444754818_?||444754653_Phage_integrase-><-444754626_?<-444754682_N6-MTase*<-444754756_?<-444754570_?<-444754810_?<-444754666_?<-444754593_?<-444754819_?<-444754734_?
      588219826    <-Phage_integrase<-?<-N6-MTase*                                                                                                               N6-MTase    N6-MTase       J594_4091              163      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii 259052                 DNA N-6-adenine-methyltransferase family protein [Acinetobacter baumannii 259052].                  588219822_?-><-588219825_Phage_integrase<-588219827_?<-588219826_N6-MTase*<-588219823_?<-588219821_?<-588219828_?<-588219824_?
      593668543    <-N6-MTase*<-?<-?<-?<-AAA                                                                                                                     N6-MTase    N6-MTase       J660_0735              163      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii 88816                  DNA N-6-adenine-methyltransferase family protein [Acinetobacter baumannii 88816].                   <-593668536_?<-593668537_?<-593668538_?||593668539_?->593668540_?-><-593668541_?<-593668542_?<-593668543_N6-MTase*<-593668544_?<-593668545_?<-593668546_?<-593668547_AAA<-593668548_?<-593668549_?<-593668550_?
      748690860    <-RadC||N6-MTase*->Phage_AlpA->HTH->REase+SFII->METHYLASE->                                                                                   N6-MTase    N6-MTase       -                      161      bacteria>proteobacteria>gammaproteobacteria       Vibrio ichthyoenteri                           adenine methyltransferase [Vibrio ichthyoenteri].                                                   493763790_?-><-493763791_?<-748690858_?<-748690859_?<-493763794_?<-493763795_?<-493763796_RadC||748690860_N6-MTase*->493763798_Phage_AlpA->493763799_HTH->748690861_REase+SFII->493763801_METHYLASE->748690862_?->493763803_?->
      736663998    <-RusA<-?<-?<-?<-?<-?<-N6-MTase*<-DnaB<-Phage_rep_O<-?<-?||HTH_3+Peptidase_S24->                                                              N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                <-736663988_?<-736663989_RusA<-736582961_?<-651319723_?<-736663993_?<-736663995_?<-736663997_?<-736663998_N6-MTase*<-736664032_DnaB<-736663999_Phage_rep_O<-736664000_?<-736664001_?||736664033_HTH_3+Peptidase_S24->736664002_?-><-736664003_?
      740612810    N6-MTase*->McrB->McrC->?->HNH->                                                                                                               N6-MTase    N6-MTase       -                      141      bacteria>proteobacteria>gammaproteobacteria       Vibrio parahaemolyticus                        adenine methyltransferase, partial [Vibrio parahaemolyticus].                                       516017700_?-><-545079325_?||645070513_?->491602254_?->645070514_?-><-645070287_?<-645070288_?||740612810_N6-MTase*->740612812_McrB->645070517_McrC->645070518_?->645070519_HNH->645070520_?-><-645070521_?||645070522_?->
      696306260    <-N6-MTase<-?<-?<-N6-MTase*                                                                                                                   N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter sp. WC-323                       adenine methyltransferase [Acinetobacter sp. WC-323].                                               <-497201532_?<-696306258_?<-497201536_?<-497201530_?<-497201538_N6-MTase<-497201541_?<-696306264_?<-696306260_N6-MTase*<-696306262_?<-497201545_?
      690981431    METHYLASE->?->?-><-?<-?<-?<-N6-MTase*<-?<-?<-?<-N6-MTase                                                                                      N6-MTase    N6-MTase       -                      158      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                445987296_METHYLASE->446375043_?->447052417_?-><-690981438_?<-690981435_?<-727739395_?<-690981431_N6-MTase*<-690981312_?<-446667397_?<-446902378_?<-446605937_N6-MTase<-690981373_?<-691001572_?<-691014600_?
      691039522    <-HTH_3+Peptidase_S24||?->?->?->?->Phage_rep_O->?->N6-MTase*->                                                                                N6-MTase    N6-MTase       -                      158      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                <-445974397_HTH_3+Peptidase_S24||446625677_?->445971061_?->447103404_?->446525189_?->445986772_Phage_rep_O->691016564_?->691039522_N6-MTase*->691015623_?->691015626_?->691026561_?->
      691068978    <-Head-tail_con<-?<-?<-?<-?<-?<-?<-N6-MTase*<-?<-?<-?<-?<-?<-DnaC<-Phage_rep_O                                                                N6-MTase    N6-MTase       -                      158      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                <-691068966_Head-tail_con<-691068967_?<-691068970_?<-691068972_?<-691068974_?<-691068976_?<-691068977_?<-691068978_N6-MTase*<-691068979_?<-691068980_?<-691068982_?<-691068985_?<-645913892_?<-691068987_DnaC<-691068990_Phage_rep_O
      695353200    <-METHYLASE<-Phage_AlpA<-N6-MTase*<-?<-Phage_AlpA||RadC->                                                                                     N6-MTase    N6-MTase       -                      144      bacteria>proteobacteria>gammaproteobacteria       Vibrio splendidus                              adenine methyltransferase, partial [Vibrio splendidus].                                             <-515656659_?<-515656660_?<-515645430_?||515656661_?-><-515656662_?<-515656663_METHYLASE<-657349588_Phage_AlpA<-695353200_N6-MTase*<-695353203_?<-515656666_Phage_AlpA||515656668_RadC->515656669_?->515656670_?->515656671_?->515656672_?->
      691154760    <-N6-MTase*                                                                                                                                   N6-MTase    N6-MTase       -                      157      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase, partial [Acinetobacter baumannii].                                       <-691154760_N6-MTase*
      691157882    DnaC->N6-MTase*->                                                                                                                             N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                446990918_?->446088269_?->515182860_?->691157878_?->691157879_?->691157880_?->691157881_DnaC->691157882_N6-MTase*->691157883_?->691157884_?->691157885_?->691157886_?->690988505_?->690988506_?->446940447_?->
      425484490    <-N6-MTase<-?<-?<-N6-MTase*                                                                                                                   N6-MTase    N6-MTase       ACINWC323_A0077        152      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter sp. WC-323                       DNA N-6-adenine-methyltransferase (N6-MTase) [Acinetobacter sp. WC-323].                                 <-425484495_?<-425484488_?<-425484497_?<-425484494_?<-425484498_N6-MTase<-425484499_?<-425484487_?<-425484490_N6-MTase*<-425484493_?<-425484501_?
      447010248    <-N6-MTase*                                                                                                                                   N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        hypothetical protein [Acinetobacter baumannii].                                                     <-746025322_?<-446082079_?<-447010248_N6-MTase*<-638872311_?<-446202223_?<-638872318_?<-447006889_?<-446995652_?
      507070967    <-N6-MTase*<-N6-MTase<-?<-?<-?<-DnaC<-?<-DCM                                                                                                  N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter pittii                           phage N-6-adenine-methyltransferase [Acinetobacter pittii].                                         <-507070960_?<-507070961_?<-507070962_?<-507070963_?<-507070964_?<-507070965_?<-507070966_?<-507070967_N6-MTase*<-507070968_N6-MTase<-507070969_?<-507070970_?<-507070971_?<-507070972_DnaC<-507070973_?<-690970629_DCM
      690997976    <-Head-tail_con<-?<-?<-small-protein<-small-protein<-N6-MTase*<-?<-?<-RusA<-?<-DnaC<-Phage_rep_O                                              N6-MTase    N6-MTase       -                      154      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                <-690997963_?<-690997966_?<-690997968_Head-tail_con<-446255980_?<-446652432_?<-690997971_small-protein<-690997973_small-protein<-690997976_N6-MTase*<-690997978_?<-690997980_?<-690997982_RusA<-690997983_?<-690997985_DnaC<-690997990_Phage_rep_O<-446741286_?
      630464595    <-N6-MTase*                                                                                                                                   N6-MTase    N6-MTase       J532_4398              155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii 940793                 DNA N-6-adenine-methyltransferase family protein [Acinetobacter baumannii 940793].                  <-630464593_?<-630464594_?<-630464595_N6-MTase*
      690988986    <-HTH_3+Peptidase_S24||?->?->?->?->Phage_rep_O->?->N6-MTase*->                                                                                N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                <-446715420_HTH_3+Peptidase_S24||446088064_?->690988978_?->447018345_?->690988980_?->690988982_Phage_rep_O->690988984_?->690988986_N6-MTase*->690988987_?->690988990_?->690988991_?->690988994_?->690988996_?->727739050_?->446019471_?->
      690996743    <-N6-MTase*<-?<-Phage_rep_O<-?<-?<-?<-?||HTH_3+Peptidase_S24->                                                                                N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                <-638852081_?<-446019471_?<-690988996_?<-690988994_?<-690988991_?<-690988990_?<-690990083_?<-690996743_N6-MTase*<-690988984_?<-690988982_Phage_rep_O<-690988980_?<-447018345_?<-690988978_?<-446088064_?||446715420_HTH_3+Peptidase_S24->
      691027491    <-N6-MTase*<-?<-?<-DnaB<-Phage_rep_O                                                                                                          N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                <-446978217_?<-446022309_?<-691027489_?<-691016566_?<-445966370_?<-446571071_?<-446082079_?<-691027491_N6-MTase*<-446749121_?<-446991165_?<-446028310_DnaB<-446122449_Phage_rep_O<-489397397_?<-446990989_?||446789918_?->
      691065210    N6-MTase*->                                                                                                                                   N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                446990918_?->446088269_?->691065201_?->691065203_?->691065205_?->691065207_?->691065209_?->691065210_N6-MTase*->691065211_?->691065213_?->691065214_?->691065215_?->446074810_?->691065217_?->446300643_?->
      691093639    N6-MTase*->?->?->?->?->?->Phage_antitermQ->                                                                                                   N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                690990088_?->447190729_?->691093632_?->691093635_?->691093639_N6-MTase*->690990083_?->690990082_?->690990080_?->690990079_?->690990078_?->690990076_Phage_antitermQ->690990074_?->
      691117543    small-protein->?->ASCH->?->?->DCM->N6-MTase->N6-MTase*->                                                                                      N6-MTase    N6-MTase       -                      155      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase [Acinetobacter baumannii].                                                691117523_small-protein->691117525_?->691117527_ASCH->691117530_?->691117533_?->691117536_DCM->691117539_N6-MTase->691117543_N6-MTase*->691117546_?-><-691117549_?
      663438128    <-N6-MTase*                                                                                                                                   N6-MTase    N6-MTase       -                      119      bacteria>proteobacteria>gammaproteobacteria       Acinetobacter baumannii                        adenine methyltransferase, partial [Acinetobacter baumannii].                                       <-663438128_N6-MTase*<-446054521_?<-446054157_?
      
      # 96;
      316921487    Phage_endo_I->?->?->Bro-N->?->?->N6-MTase*->?->?->?->MazG-Phage+MazG-Phage->Phage_pRha+ANT->AAA->                                             N6-MTase    N6-MTase       HMPREF0179_03455       158      bacteria>proteobacteria>deltaproteobacteria       Bilophila wadsworthia 3_1_6                    phage N-6-adenine-methyltransferase [Bilophila wadsworthia 3_1_6].                                  316921494_?->316921493_Phage_endo_I->316921492_?->316921491_?->316921490_Bro-N->316921489_?->316921488_?->316921487_N6-MTase*->316921486_?->316921485_?->316921484_?->316921483_MazG-Phage+MazG-Phage->316921482_Phage_pRha+ANT->316921481_AAA->316921480_?->
      749811142    Phage_endo_I->?->?->Bro-N->?->?->N6-MTase*->?->?->?->MazG-Phage+MazG-Phage->Phage_pRha->ANT->AAA->                                            N6-MTase    N6-MTase       -                      147      bacteria>proteobacteria>deltaproteobacteria       Bilophila wadsworthia                          adenine methyltransferase, partial [Bilophila wadsworthia].                                         491171955_?->749811133_Phage_endo_I->749811135_?->491171951_?->749811137_Bro-N->749811140_?->491171946_?->749811142_N6-MTase*->749811143_?->491171940_?->749811145_?->749811147_MazG-Phage+MazG-Phage->749811148_Phage_pRha->749811149_ANT->491171932_AAA->
      
      # 96;
      736470177    <-Phage_integrase||?->N6-MTase*->                                                                                                             N6-MTase    N6-MTase       -                      143      bacteria>proteobacteria>alphaproteobacteria       Afifella pfennigii                             hypothetical protein, partial [Afifella pfennigii].                                                 <-651245532_?||736470174_?->651245534_?->736470147_?->651245535_?-><-651245536_Phage_integrase||736470150_?->736470177_N6-MTase*->736470179_?-><-736470182_?<-651245538_?||651245539_?-><-736470184_?||651245540_?->651245541_?->
      # 96;
      654109520    <-MOM<-N6-MTase*                                                                                                                              N6-MTase    N6-MTase       -                      154      bacteria>firmicutes                               Desulfovirgula thermocuniculi                  adenine methyltransferase [Desulfovirgula thermocuniculi].                                          <-737120374_?<-654109515_?<-737120377_?<-737120378_MOM<-654109520_N6-MTase*<-654109526_?<-654109530_?<-737120375_?
      567770034    <-DUF3310<-Prim-Pol+PriCT_1+D5<-?<-?<-?<-N6-MTase*                                                                                            N6-MTase    N6-MTase       ERIC1_1c08270          155      bacteria>firmicutes                               Paenibacillus larvae subsp. larvae DSM 25719   phage N-6-adenine methyltransferase [Paenibacillus larvae subsp. larvae DSM 25719].                 <-567770027_?<-567770028_?<-567770029_DUF3310<-567770030_Prim-Pol+PriCT_1+D5<-567770031_?<-567770032_?<-567770033_?<-567770034_N6-MTase*<-567770035_?<-567770036_?<-567770037_?<-567770038_?<-567770039_?<-567770040_?<-567770041_?
      517503045    <-N6-MTase*<-?<-?<-?<-?<-Phage_pRha<-?<-DnaC                                                                                                  N6-MTase    N6-MTase       -                      156      bacteria>firmicutes                               Brevibacillus laterosporus                     DNA N-6-adenine-methyltransferase [Brevibacillus laterosporus].                                     <-517503038_?<-517503039_?<-737329766_?<-517503041_?<-517503042_?<-517503043_?<-517503044_?<-517503045_N6-MTase*<-737329767_?<-517503050_?<-517503051_?<-517503052_?<-737329768_Phage_pRha<-517503054_?<-737329760_DnaC
      493931641    DnaC->?->?->?->RecU->N6-MTase*->N6-MTase->HARE-HTH->ASCH->?->?->MPTase->                                                                      N6-MTase    N6-MTase       -                      152      bacteria>firmicutes                               Anaerotruncus colihominis                      DNA N-6-adenine-methyltransferase [Anaerotruncus colihominis].                                      749997332_?->749997333_?->493931636_DnaC->749997334_?->749997338_?->493931639_?->749997339_RecU->493931641_N6-MTase*->749997341_N6-MTase->493931643_HARE-HTH->749997342_ASCH->749997346_?->493931646_?->749997347_MPTase->493931648_?->
      737823765    small-protein->?->?->?->N6-MTase*->?->?->?->?->?->?->SSB->                                                                                    N6-MTase    N6-MTase       -                      135      bacteria>firmicutes                               Clostridium botulinum                          adenine methyltransferase [Clostridium botulinum].                                                  737823833_?->737814057_?->737823755_?->737823757_small-protein->737823759_?->737823761_?->737823763_?->737823765_N6-MTase*->737823767_?->737823769_?->737823771_?->737823773_?->737823775_?->737819447_?->737819450_SSB->
      739064083    METHYLASE->?-><-MPTase<-Phage-tail-tape||?->?->?->N6-MTase*->small-protein->Recombinase->                                                     N6-MTase    N6-MTase       -                      152      bacteria>firmicutes                               Pseudobacteroides cellulosolvens               adenine methyltransferase [Pseudobacteroides cellulosolvens].                                       739064069_METHYLASE->739064071_?-><-739064073_MPTase<-739064075_Phage-tail-tape||739064077_?->739064079_?->739064081_?->739064083_N6-MTase*->739064085_small-protein->739064553_Recombinase->739064087_?->739064089_?->739064556_?->739064091_?->739064093_?->
      291074040    <-HARE-HTH<-N6-MTase*<-?<-RecU<-?<-?<-N6-MTase                                                                                                N6-MTase    N6-MTase       CLOM621_08346          148      bacteria>firmicutes                               Clostridium sp. M62/1                          DNA N-6-adenine-methyltransferase (N6-MTase) [Clostridium sp. M62/1].                                    <-291074033_?<-291074034_?<-291074035_?<-291074036_?<-291074037_?<-291074038_?<-291074039_HARE-HTH<-291074040_N6-MTase*<-291074041_?<-291074042_RecU<-291074043_?<-291074044_?<-291074045_N6-MTase<-291074046_?<-291074047_?
      503587829    Phage_integrase->?->N6-MTase*->?->N6-MTase->                                                                                                  N6-MTase    N6-MTase       -                      156      bacteria>firmicutes                               Desulfotomaculum kuznetsovii                   DNA N-6-adenine-methyltransferase [Desulfotomaculum kuznetsovii].                                   503587822_?->752613398_?->503587824_?->752613399_?->752613400_?->503587827_Phage_integrase->503587828_?->503587829_N6-MTase*->503587830_?->503587831_N6-MTase->503587832_?->503587833_?->503587834_?->752613790_?->503587836_?->
      759006369    N6-MTase*->?->ASCH->?->?->?->Terminase_SS->Terminase_LS->                                                                                     N6-MTase    N6-MTase       -                      147      bacteria>firmicutes                               Aneurinibacillus migulanus                     adenine methyltransferase [Aneurinibacillus migulanus].                                             759006362_?->759006363_?->759006364_?->759006365_?->759006366_?->759006367_?->759006368_?->759006369_N6-MTase*->759006370_?->759006371_ASCH->759006372_?->759006373_?->759006374_?->759006468_Terminase_SS->759006375_Terminase_LS->
      488372936    <-HNH<-PBECR1<-?<-?<-dUTPase<-?<-?<-N6-MTase*<-?<-?<-DUF3310<-?<-PVL_ORF50                                                                    N6-MTase    N6-MTase       -                      145      bacteria>firmicutes                               Staphylococcus caprae                          DNA N-6-adenine-methyltransferase [Staphylococcus caprae].                                          <-488372955_HNH<-488372953_PBECR1<-488372950_?<-488372948_?<-488372946_dUTPase<-488372940_?<-739686961_?<-488372936_N6-MTase*<-488372934_?<-488372931_?<-488372929_DUF3310<-488372927_?<-488372925_PVL_ORF50<-488372923_?<-488372922_?
      737442515    <-Phage_terminase<-?<-DCM<-ParB<-?<-?<-?<-N6-MTase*<-?<-?<-RusA<-DnaB<-DnaC                                                                   N6-MTase    N6-MTase       -                      145      bacteria>firmicutes                               Bacillus sp. NSP2.1                            adenine methyltransferase [Bacillus sp. NSP2.1].                                                    <-737442428_Phage_terminase<-651510613_?<-651510616_DCM<-651510619_ParB<-651510622_?<-651510625_?<-492413769_?<-737442515_N6-MTase*<-651510632_?<-737442516_?<-492413793_RusA<-651510639_DnaB<-737442431_DnaC<-737442517_?<-737442518_?
      748713908    <-Phage_portal<-Terminase_LS<-Phage_terminase<-?<-?<-?<-?<-N6-MTase*<-?<-?<-?<-?<-?<-?<-RusA                                                  N6-MTase    N6-MTase       -                      145      bacteria>firmicutes                               Brevibacillus agri                             adenine methyltransferase [Brevibacillus agri].                                                     <-492413752_Phage_portal<-748713899_Terminase_LS<-748713907_Phage_terminase<-492413760_?<-492413763_?<-492413766_?<-492413769_?<-748713908_N6-MTase*<-492413776_?<-492413778_?<-492413781_?<-492413784_?<-492413787_?<-748713909_?<-492413793_RusA
      554763517    N6-MTase*->?->?->?->Terminase_LS->Phage_portal->MuF->Phage_GP20->                                                                             N6-MTase    N6-MTase       -                      144      bacteria>firmicutes                               Lactococcus lactis                             hypothetical protein [Lactococcus lactis].                                                          554763517_N6-MTase*->696369314_?->554763519_?->696369328_?->554763521_Terminase_LS->554763522_Phage_portal->696369317_MuF->696369330_Phage_GP20->
      432181416    <-Phage_portal<-Terminase_LS<-Phage_terminase<-?<-?<-?<-?<-N6-MTase*<-?<-?<-?<-?<-?<-?<-RusA                                                  N6-MTase    N6-MTase       D478_26539             157      bacteria>firmicutes                               Brevibacillus agri BAB-2500                    DNA N-6-adenine-methyltransferase [Brevibacillus agri BAB-2500].                                    <-432181409_Phage_portal<-432181410_Terminase_LS<-432181411_Phage_terminase<-432181412_?<-432181413_?<-432181414_?<-432181415_?<-432181416_N6-MTase*<-432181417_?<-432181418_?<-432181419_?<-432181420_?<-432181421_?<-432181422_?<-432181423_RusA
      739716594    <-MazG<-?<-?<-HNH<-?<-?<-?<-N6-MTase*<-DUF3310<-?<-?<-RusA<-?<-?<-Phage_rep_org_N                                                             N6-MTase    N6-MTase       -                      144      bacteria>firmicutes                               Staphylococcus aureus                          adenine methyltransferase [Staphylococcus aureus].                                                  <-739716582_MazG<-739716585_?<-739716586_?<-739716589_HNH<-739716637_?<-739716591_?<-739716592_?<-739716594_N6-MTase*<-739716640_DUF3310<-739716596_?<-499595896_?<-739716598_RusA<-739716600_?<-739716602_?<-739716604_Phage_rep_org_N
      746045508    <-Terminase_LS<-?<-HNH<-?<-?<-small-protein<-N6-MTase*                                                                                        N6-MTase    N6-MTase       -                      144      bacteria>firmicutes                               Lactococcus lactis                             adenine methyltransferase [Lactococcus lactis].                                                     <-746045496_?<-746045498_Terminase_LS<-746046287_?<-746045500_HNH<-746045502_?<-746045504_?<-746045506_small-protein<-746045508_N6-MTase*<-746045509_?<-746046289_?<-746045511_?<-746045512_?<-746045513_?<-746046291_?<-746045514_?
      700273311    <-N6-MTase<-N6-MTase*                                                                                                                         N6-MTase    N6-MTase       NZ45_03810             143      bacteria>firmicutes                               Clostridium botulinum                          adenine methyltransferase [Clostridium botulinum].                                                  <-700273305_?<-700273306_?||700273307_?-><-700273308_?<-700273309_?<-700273310_?<-700273313_N6-MTase<-700273311_N6-MTase*<-700273312_?
      496656604    <-ParB+N6-MTase<-?<-?<-?<-?<-HARE-HTH<-N6-MTase*<-?<-RecU<-?<-?<-N6-MTase                                                                     N6-MTase    N6-MTase       -                      158      bacteria>firmicutes                               Clostridium sp. 7_3_54FAA                      DNA N-6-adenine-methyltransferase [Clostridium sp. 7_3_54FAA].                                      <-496656597_?<-496656598_ParB+N6-MTase<-496656599_?<-496656600_?<-496656601_?<-769135258_?<-496656603_HARE-HTH<-496656604_N6-MTase*<-496656605_?<-496656606_RecU<-496656607_?<-496656608_?<-496656609_N6-MTase<-496656610_?<-496656611_?
      488427723    <-small-protein<-?<-?<-DUF3310<-PVL_ORF50<-?<-?<-N6-MTase*<-?<-?<-DnaC<-Phage_rep_org_N<-AP2<-?<-DUF968                                       N6-MTase    N6-MTase       -                      142      bacteria>firmicutes                               Staphylococcus epidermidis                     DNA N-6-adenine-methyltransferase [Staphylococcus epidermidis].                                     <-488427716_small-protein<-488427717_?<-488427718_?<-488427719_DUF3310<-488427720_PVL_ORF50<-488427721_?<-488427722_?<-488427723_N6-MTase*<-488427724_?<-488427725_?<-488427726_DnaC<-488427727_Phage_rep_org_N<-488427728_AP2<-488427729_?<-488427730_DUF968
      489480013    Phage_portal->MuF->?->N6-MTase*->N6-MTase->                                                                                                   N6-MTase    N6-MTase       -                      142      bacteria>firmicutes                               Clostridium botulinum                          phage N-6-adenine-methyltransferase [Clostridium botulinum].                                        696516263_Phage_portal->696516265_MuF->666650663_?->489480013_N6-MTase*->696516267_N6-MTase->
      515743089    <-DUF3310<-?<-?<-?<-N6-MTase*<-?<-?<-DnaB<-?<-Phage_rep_org_N<-DUF968                                                                         N6-MTase    N6-MTase       -                      142      bacteria>firmicutes                               Staphylococcus hominis                         DNA N-6-adenine-methyltransferase [Staphylococcus hominis].                                         <-739692513_DUF3310<-515743086_?<-515743087_?<-515743088_?<-515743089_N6-MTase*<-515743090_?<-515743091_?<-515743092_DnaB<-515743093_?<-515743094_Phage_rep_org_N<-515743095_DUF968<-515743096_?
      446374006    SSB->DUF968->Phage_rep_org_N->DnaC->?->?->N6-MTase*->?->?->PVL_ORF50->Phage_Orf51->?->dUTPase->                                               N6-MTase    N6-MTase       -                      141      bacteria>firmicutes                               Staphylococcus aureus                          DNA N-6-adenine-methyltransferase [Staphylococcus aureus].                                          447210992_?->446627362_SSB->447122187_DUF968->446427137_Phage_rep_org_N->446725726_DnaC->446159298_?->447046432_?->446374006_N6-MTase*->445971925_?->447109983_?->446458196_PVL_ORF50->447049569_Phage_Orf51->446987809_?->446107781_dUTPase->447204818_?->
      446374007    <-Phage_Orf51<-PVL_ORF50<-?<-?<-?<-N6-MTase*<-?<-?<-DnaB<-?<-Phage_rep_org_N<-DUF968<-SSB                                                     N6-MTase    N6-MTase       -                      141      bacteria>firmicutes                               Staphylococcus aureus                          DNA N-6-adenine-methyltransferase [Staphylococcus aureus].                                          <-446377139_?<-827456484_?<-827456486_Phage_Orf51<-827456488_PVL_ORF50<-446695570_?<-827456491_?<-445971951_?<-446374007_N6-MTase*<-447046434_?<-446947149_?<-447028912_DnaB<-447054991_?<-446427118_Phage_rep_org_N<-447122162_DUF968<-446627367_SSB
      506511035    <-Phage_Orf51<-DUF3310<-PVL_ORF50<-?<-?<-N6-MTase*<-?<-?<-DnaC<-Phage_rep_org_N||?-><-DUF968<-SSB                                             N6-MTase    N6-MTase       -                      141      bacteria>firmicutes                               Staphylococcus aureus                          hypothetical protein [Staphylococcus aureus].                                                       <-686306375_?<-446901953_?<-686148387_Phage_Orf51<-445944872_DUF3310<-506506510_PVL_ORF50<-447109983_?<-445971925_?<-506511035_N6-MTase*<-447046432_?<-446159298_?<-521258099_DnaC<-446112371_Phage_rep_org_N||446633145_?-><-521258098_DUF968<-752533923_SSB
      554679133    SSB->DUF968->Phage_rep_org_N->?->DnaB->?->?->N6-MTase*->?->?->DUF3310->Phage_Orf51->                                                          N6-MTase    N6-MTase       -                      141      bacteria>firmicutes                               Staphylococcus aureus                          prophage LambdaSo DNA modification methyltransferase [Staphylococcus aureus].                       554679128_SSB->554679130_DUF968->446427119_Phage_rep_org_N->447054987_?->554679132_DnaB->446947142_?->447046432_?->554679133_N6-MTase*->445971955_?->447110008_?->445944880_DUF3310->554679135_Phage_Orf51->554679137_?->447028362_?->446987770_?->
      678260344    <-small-protein<-?<-Phage_Orf51<-DUF3310<-PVL_ORF50<-?<-?<-N6-MTase*<-?<-?<-DnaC<-Phage_rep_org_N||?-><-small-protein<-DUF968                 N6-MTase    N6-MTase       ERS140248_02184        141      bacteria>firmicutes                               Staphylococcus aureus                          prophage L54a%2C N-6-adenine-methyltransferase [Staphylococcus aureus].                             <-678260337_small-protein<-678260338_?<-678260339_Phage_Orf51<-678260340_DUF3310<-678260341_PVL_ORF50<-678260342_?<-678260343_?<-678260344_N6-MTase*<-678260345_?<-678260346_?<-678260347_DnaC<-678260348_Phage_rep_org_N||678260349_?-><-678260350_small-protein<-678260351_DUF968
      686297326    SSB->DUF968->Phage_rep_org_N->?->DnaB->?->?->N6-MTase*->?->?->?->PVL_ORF50->Phage_Orf51->                                                     N6-MTase    N6-MTase       -                      141      bacteria>firmicutes                               Staphylococcus aureus                          adenine methyltransferase [Staphylococcus aureus].                                                  446857537_SSB->686174277_DUF968->686297325_Phage_rep_org_N->447054991_?->686169507_DnaB->686169506_?->447046432_?->686297326_N6-MTase*->686297327_?->447110007_?->446695570_?->486217540_PVL_ORF50->686175250_Phage_Orf51->
      686300364    SSB->DUF968-><-?||Phage_rep_org_N->DnaC->?->?->N6-MTase*->?->?->?->PVL_ORF50->Phage_Orf51->                                                   N6-MTase    N6-MTase       -                      141      bacteria>firmicutes                               Staphylococcus aureus                          adenine methyltransferase [Staphylococcus aureus].                                                  446627367_SSB->686300367_DUF968-><-446336900_?||523688958_Phage_rep_org_N->686300366_DnaC->446159298_?->686300365_?->686300364_N6-MTase*->445971943_?->447110007_?->446695570_?->686300363_PVL_ORF50->686348626_Phage_Orf51->
      686391504    <-Phage_Orf51<-PVL_ORF50<-?<-?<-N6-MTase*<-?<-?<-DnaC<-Phage_rep_org_N<-AP2<-DUF968<-SSB                                                      N6-MTase    N6-MTase       -                      141      bacteria>firmicutes                               Staphylococcus aureus                          adenine methyltransferase [Staphylococcus aureus].                                                  <-686391501_?<-686391502_Phage_Orf51<-686391503_PVL_ORF50<-447109983_?<-445971925_?<-686391504_N6-MTase*<-447046432_?<-446159298_?<-686332987_DnaC<-446427132_Phage_rep_org_N<-686391505_AP2<-686298587_DUF968<-686391506_SSB
      686419170    SSB->DUF968->AP2->Phage_rep_org_N->DnaC->?->?->N6-MTase*->?->?->?->PVL_ORF50->Phage_Orf51->                                                   N6-MTase    N6-MTase       -                      141      bacteria>firmicutes                               Staphylococcus aureus                          adenine methyltransferase [Staphylococcus aureus].                                                  686419164_SSB->686419165_DUF968->686419166_AP2->686419167_Phage_rep_org_N->686419168_DnaC->446551491_?->686419169_?->686419170_N6-MTase*->445971955_?->686382904_?->446023397_?->445951516_PVL_ORF50->686382906_Phage_Orf51->
      686449191    <-Phage_Orf51<-DUF3310<-?<-?<-N6-MTase*<-?<-?<-DnaC<-Phage_rep_org_N<-AP2<-DUF968<-SSB                                                        N6-MTase    N6-MTase       -                      141      bacteria>firmicutes                               Staphylococcus aureus                          adenine methyltransferase [Staphylococcus aureus].                                                  <-446066855_?<-446901953_?<-686449190_?<-446053519_Phage_Orf51<-445944880_DUF3310<-447109974_?<-686369055_?<-686449191_N6-MTase*<-447046428_?<-446178742_?<-686449192_DnaC<-686449193_Phage_rep_org_N<-686449194_AP2<-686449195_DUF968<-447021753_SSB
      737532221    <-HNH<-SSB<-?<-?<-HNH<-?<-N6-MTase*<-?<-?<-?<-?<-?<-N6-MTase<-DnaC                                                                            N6-MTase    N6-MTase       -                      141      bacteria>firmicutes                               Halobacillus                                   MULTISPECIES: adenine methyltransferase [Halobacillus].                                             <-737532213_?<-737532215_HNH<-737532216_SSB<-737532218_?<-737532219_?<-737533831_HNH<-737532220_?<-737532221_N6-MTase*<-737532222_?<-737532223_?<-737532225_?<-737532227_?<-737532228_?<-737533832_N6-MTase<-737532230_DnaC
      492715347    <-HARE-HTH<-N6-MTase*<-?<-RecU<-?<-?<-N6-MTase                                                                                                N6-MTase    N6-MTase       -                      158      bacteria>firmicutes                               Clostridiales                                  MULTISPECIES: DNA N-6-adenine-methyltransferase [Clostridiales].                                    <-495674054_?<-495674055_?<-495674056_?<-490331179_?<-490331178_?<-490331176_?<-490331175_HARE-HTH<-492715347_N6-MTase*<-490331171_?<-490331168_RecU<-490331165_?<-495674061_?<-490331152_N6-MTase<-495674063_?<-495674065_?
      500994137    <-N6-MTase*                                                                                                                                   N6-MTase    N6-MTase       -                      140      bacteria>firmicutes                               Clostridium botulinum                          DNA N-6-adenine-methyltransferase [Clostridium botulinum].                                          <-489454419_?<-500994137_N6-MTase*
      635344555    <-N6-MTase<-?<-?<-?<-?<-?<-N6-MTase*<-DnaC<-Phage_rep_org_N<-Phage_pRha+ORF6C<-MetJArc||MetJArc-><-small-protein<-small-protein               N6-MTase    N6-MTase       BN981_00304            140      bacteria>firmicutes                               Halobacillus trueperi                          phage N-6-adenine-methyltransferase [Halobacillus trueperi].                                        <-635344548_?<-635344549_N6-MTase<-635344550_?<-635344551_?<-635344552_?<-635344553_?<-635344554_?<-635344555_N6-MTase*<-635344556_DnaC<-635344557_Phage_rep_org_N<-635344558_Phage_pRha+ORF6C<-635344559_MetJArc||635344560_MetJArc-><-635344561_small-protein<-635344562_small-protein
      738763505    <-DUF3310<-Prim-Pol+PriCT_1+D5<-?<-?<-?<-?<-N6-MTase*                                                                                         N6-MTase    N6-MTase       -                      136      bacteria>firmicutes                               Paenibacillus larvae                           adenine methyltransferase, partial [Paenibacillus larvae].                                          <-738761287_?<-738761289_DUF3310<-738761291_Prim-Pol+PriCT_1+D5<-738761294_?<-738761296_?<-738761298_?<-738761300_?<-738763505_N6-MTase*<-738761302_?<-738761305_?<-738763507_?<-738761307_?<-738761309_?<-738761310_?<-738761313_?
      737140426    <-N6-MTase*                                                                                                                                   N6-MTase    N6-MTase       -                      140      bacteria>firmicutes                               Clostridium tetani                             adenine methyltransferase [Clostridium tetani].                                                     <-746206773_?<-737140426_N6-MTase*<-737140429_?<-737140432_?<-737140435_?
      748203410    small-protein->?->?->?->?->N6-MTase*->                                                                                                        N6-MTase    N6-MTase       -                      140      bacteria>firmicutes                               Clostridium botulinum                          adenine methyltransferase [Clostridium botulinum].                                                  489456872_?->489456871_?->489456870_small-protein->489456868_?->489456866_?->489456865_?->748203409_?->748203410_N6-MTase*->500994144_?->500994146_?->500994147_?->489454419_?->748203411_?->647362810_?->748203413_?->
      752703286    <-N6-MTase*                                                                                                                                   N6-MTase    N6-MTase       -                      140      bacteria>firmicutes                               Clostridium botulinum                          adenine methyltransferase [Clostridium botulinum].                                                  <-752703286_N6-MTase*
      768719850    <-Phage_portal<-HNH<-Terminase_LS<-N6-MTase*                                                                                                  N6-MTase    N6-MTase       -                      140      bacteria>firmicutes                               Oenococcus oeni                                adenine methyltransferase [Oenococcus oeni].                                                        <-488908904_?<-488908905_Phage_portal<-768719849_HNH<-488908906_Terminase_LS<-768719850_N6-MTase*<-488908910_?
      446374005    <-N6-MTase*                                                                                                                                   N6-MTase    N6-MTase       -                      139      bacteria>firmicutes                               Staphylococcus aureus                          hypothetical protein, partial [Staphylococcus aureus].                                              <-446374005_N6-MTase*<-510795933_?
      737533832    <-N6-MTase<-?<-?<-?<-?<-?<-N6-MTase*<-DnaC<-Phage_rep_org_N<-Phage_pRha+ORF6C                                                                 N6-MTase    N6-MTase       -                      137      bacteria>firmicutes                               Halobacillus                                   MULTISPECIES: adenine methyltransferase [Halobacillus].                                             <-737532220_?<-737532221_N6-MTase<-737532222_?<-737532223_?<-737532225_?<-737532227_?<-737532228_?<-737533832_N6-MTase*<-737532230_DnaC<-737533835_Phage_rep_org_N<-737532232_Phage_pRha+ORF6C<-737533836_?<-737532233_?<-737532235_?<-737532237_?
      654100680    <-MOM<-N6-MTase*                                                                                                                              N6-MTase    N6-MTase       -                      154      bacteria>firmicutes                               Desulfovirgula thermocuniculi                  adenine methyltransferase [Desulfovirgula thermocuniculi].                                          <-654100660_?<-654100664_?<-737119001_?<-737119003_?<-654100676_?<-737119021_?<-737119023_MOM<-654100680_N6-MTase*<-654100684_?<-654100688_?<-737119025_?<-654100700_?<-654100704_?<-654100708_?<-654100715_?
      518557238    RecT-Redbeta->SSB->?->?->?->?->?->N6-MTase*->?->?->small-protein->?->small-protein->?->HNH->                                                  N6-MTase    N6-MTase       -                      152      bacteria>actinobacteria                           Bifidobacterium breve                          DNA N-6-adenine-methyltransferase [Bifidobacterium breve].                                          489927158_RecT-Redbeta->489927162_SSB->489927167_?->489927170_?->489927172_?->489927174_?->489927176_?->518557238_N6-MTase*->489927184_?->489927185_?->489927186_small-protein->489927190_?->489927191_small-protein->489927192_?->489927194_HNH->
      673939868    N6-MTase->?->N6-MTase*->Terminase_LS->HNH->Phage_portal->                                                                                     N6-MTase    N6-MTase       Phi93_04               140      viruses>dsdna viruses, no rna stage>caudovirales  Lactococcus phage phi93                        DNA methylase [Lactococcus phage phi93].                                                            673939865_?->673939866_N6-MTase->673939867_?->673939868_N6-MTase*->673939869_Terminase_LS->673939870_HNH->673939871_Phage_portal->673939872_?->673939873_?->673939874_?->673939875_?->
      # 2;
      491783102    <-N6-MTase<-?<-HTH_3+Peptidase_S24||?->Phage_pRha+ANT->?->HTH->N6-MTase*->?->RusA->Phage_antitermQ->?->?->?->KilA-N->                         N6-MTase    N6-MTase       -                      174      bacteria>proteobacteria>gammaproteobacteria       Actinobacillus pleuropneumoniae                DNA methylase [Actinobacillus pleuropneumoniae].                                                    <-491783091_N6-MTase<-491783093_?<-491783095_HTH_3+Peptidase_S24||491783098_?->763111857_Phage_pRha+ANT->491783100_?->491783101_HTH->491783102_N6-MTase*->491783105_?->491783106_RusA->491783107_Phage_antitermQ->491805399_?->491805403_?->491783110_?->491783111_KilA-N->
      500173972    <-N6-MTase<-?<-HTH_3+Peptidase_S24||?->Phage_pRha+ANT->Phage_rep_O->HTH->N6-MTase*->?->RusA->Phage_antitermQ->?->Phage_lysozyme->             N6-MTase    N6-MTase       -                      174      bacteria>proteobacteria>gammaproteobacteria       Actinobacillus pleuropneumoniae                DNA methylase [Actinobacillus pleuropneumoniae].                                                    <-500173966_N6-MTase<-500173967_?<-762512306_HTH_3+Peptidase_S24||500173969_?->500173970_Phage_pRha+ANT->762512559_Phage_rep_O->762512561_HTH->500173972_N6-MTase*->500173973_?->500173974_RusA->500173975_Phage_antitermQ->762512308_?->500173976_Phage_lysozyme->500173977_?->762512310_?->
      # 1;
      342803448    <-RadC||N6-MTase*->Phage_AlpA->HTH->REase+SFII->METHYLASE->                                                                                   N6-MTase    N6-MTase       VII00023_15021         397      bacteria>proteobacteria>gammaproteobacteria       Vibrio ichthyoenteri ATCC 700023               putative phage N-6-adenine-methyltransferase [Vibrio ichthyoenteri ATCC 700023].                    342803441_?-><-342803442_?<-342803443_?<-342803444_?<-342803445_?<-342803446_?<-342803447_RadC||342803448_N6-MTase*->342803449_Phage_AlpA->342803450_HTH->342803451_REase+SFII->342803452_METHYLASE->342803453_?->342803454_?->
      584469889    N6-MTase*->McrB->McrC->?->HNH->                                                                                                               N6-MTase    N6-MTase       VPUCM_1151             247      bacteria>proteobacteria>gammaproteobacteria       Vibrio parahaemolyticus UCM-V493               prophage LambdaSo, DNA modification methyltransferase, putative [Vibrio parahaemolyticus UCM-V493]. 584469882_?->584469883_?-><-584469884_?||584469885_?->584469886_?->584469887_?-><-584469888_?||584469889_N6-MTase*->584469890_McrB->584469891_McrC->584469892_?->584469893_HNH->584469894_?-><-584469895_?||584469896_?->
      
      Back to Contents
    • Multiple sequence alignment of the Group2-Clade5/Reticulomyxa-like N6-MTase

                                                                        <-Restriction endonuclease------------* *--------------------------------------------------------------------------------------------->                                                                                                                                             Str-1                                      Str-2                       Str-3                     Str-4                                               Str-5                                      Str-6            Str-7                             <---N-terminal RAGNYA-------------------------------------------------------------------------------------------------> < Helical coiled coil-----------><----C-terminal RAGNYA-                                                                         ---------------------------><  c-TERMINAL HELIX OF COILED cOIL-------------------------------------------------------------------------------------------------------------------------------------------------------------->                                                                                                                                                 
      ALIGN                                    -------HHHHHHHHHHHHHHHH-------------------------EEEE------HHHHHH-----H-HHHHHHHHHHHHHH-----EEE---------HHHHH-------HHHHHHH------H-H------HHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHH-----------------HHHHHHHH-----HHHHHHH----EEEE-----------HHHHHHHHHHHHHH------------------HHHEHHHH----------EEE----EEEEEEEEE------EE---------EEEEEEHHH-----------HH-----------E---HHH-HHHHHHHHHEE---------------------------------EEEEEE------------H---HHHHHH--HHHHHHHHHHHH----------EEEEEE------------EEHHH-------HHHHHHHH-----EEEEE-----EEEE---E---EEEHHHH-------HHHHHH-H---------EEE-------HHHHHHHHHHH--------------------------------------------E---HHHHHHHHHHHHHHHHHHHH---------------E----------------HHHHHHHHH---HHHHH----EEEE-------------------EEEEE-----HHHHHHHHHH----HH----------HH-HHHHHHHHHHHHHHH------EE------EEEEEEH--------HHHHHHHHHH------EEEEE---HHHHH-HHHHHHHHHHHHHH--HHEEH-------HHHHHHHH-----HHHHHHH------HH----EEEE----------HHHHHH-------------HHHHHHHH---E-EEEEE---E------HHHH-HHH--HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH------HHHHE----
      HMM                                      ---HHHHHHHHHHHHHHH--EEEE----------HHH--EEE-----EEEEEE-----EEEEEE-----HHHHHHHHH--HHHEEH---EEEE----EEE-EEEHHHHHHE-HHHHHHHHHH----HEEHH---HHHHHHHHHHH---EEE---HHHHHHHHHHHHHHHHHHHHHHHH---EEEE---HHHHHHHHHHH--HHHHHHHHHH-HHEEEEEE----------HHHHHHHHHHHHHHH--HEEE-------HHHHHHHHHHHHH-----EEEEEEEE-HHHHHHHHHHHHH----EEEE------EEEHHHHHHHHH-------HHHHH-------EEEEEEEHHH-HHHHHHHHHHHHH--H---E---E-----EEEE---E-------EEEEEEE---------HHE---EHHHHHHHHHHHHHHHE--HH-HHHHH---EEEEEEE--------EEEEEEEEE-----HHHHHHH--HHHHHHHHHHHHHHHHHHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHH---E---EEEEE---HHHHHHHHHHHHHH---------HHHHHEE-EHHHHHH------EEEEEE-----HHHHHHHHHHHHHH--HHHHHHHHH---HH-HH------EEEEEE------EEEEEEE----HHHHHHHHHHHHHHHHHHHHHEE---EEE-----EEEEEEEHHHHH-------EEHEEEE----EEEE-H-HHH--HHEHHHEE-HHHHHHHHHH---HHHEE--HHHHHHHHHHHHHHHHH---EEEEEEE-------EEEEEE-HHHHHHHHHHHHHHHHHHHH----EEEEEE----E--HHHHH-EE----H-HHHHH------HH---EEEEE----------HHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH--EEEE---HHHEHHHHHHHHHHHHHHH--HHHHHH----
      FREQ                                     ----------HHHHHHHHHHH---EEE-------EEEEEE--------EEEE-------EEEEEE-----HHHHHHHHHHHHHHHH--HHHHHH------EEE----------HHHHHHHHHHHHHHHHHHH----EEEEEEEEEEEEE----------HHHHHHHHHHHHHHHHH----------------HHHHHH----HHHHHHHHHHHH------HEEEEHHHH-----HHHHHHHHHHH----EE---E-------HHHHHHHHH---------------HEEEHHEEE--------E---------HEEHHHHHHHH-H-HHHHHHHHH-----H--------E-H-HHHHHHHHHHH------------H-----H--------------EEEEE----------EEEE---E-----------------EEEE--HHH----EEEEEHHHH----EEEE-----EE----HHHHHHHHHH----EEEEE----EEEEEHHHEEEEEEEE------------EEEEEE--------EEE-----------------------------------HH-HHHHHHHH-----EEEEH----HHHHH------------HHHHHHHHHHHHHHHHHHHHHHHH-------EEEEE----------------HHHHHHHH---HHHHHH-HHHHHHHHHHHHH-----HEEE-----------------EEEEE-----------HEEEHHHHHHHHHH------EEEEEE-----EEEEE------HHHHHHHHHH--------EEEEEEHHHHHHHHHHHHHHHHHH---------------HEEHHHHHHHH-----HHHHHHH------H-----HHH--------------EEEEEEE----------HHHHHHHHHH---EEEHHHHHHHHHHHHHHH-HHHHHHHH-----HH-----EEEEEEEEE------E---HHHHHHHHHHH------EEEEE---
      PSSM                                     -----HHHHHHHHHHHHH---------------HHHHHHH------EEEEEE-------EEEEEE------HHHHHHHHHHHHHH----EEEEEE---EEEEEE--------------------------------HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHH----------------HHHHHHHH---HHHHHHHHHHHHHH-------------------HHHHHHHHHHH-------------HHHHHHHHHHHHHH--------EEE---HHHHHHHHH-------EEE------HHHHHHHHHHHHHH----HHHHHHH-----HH--EEEEE-HH-HHHHHH---E---------EE-----------------------EEEEEE-----------------------HHHHHHHHHHHHHHHHHHH----EEEEEEE---------------------HHHHHHHHHH----EEEEEE------------EEEEEEEEE----------HHHHHHHHHH----------------HHHHHHHHHHH-------------EEEE------------HHHHHH--------------HHHHHH-----HHHHHHHHHHHHH-------------------EEEEE--------------HHHHH--HHHHHHH-------------------EEEEE-------HHHHHHHH--HHEHHHHHHHHH-H-HHHHEEEHHHHHHH---E-----EEEEE---EEEEEEE--------EEEEEEEE-------EEEEE-E--HHHHHHHHHHHHHHHHHHH-----------EEEEEE--EEEEEE----HHHHHH------HH----EEE----------HHHHHHHHHH-----------HHHHHHHHHHH--EEEEEHHHH--HHHHHHH-HHHHHHHHHHHHHH-----EEHHHEEHHHH---EEEEEE-----HHHHHHH-----E-------
      FINAL                                    ------HHHHHHHHHHHHHH----EEE-------HHHHH--------EEEEEE------EEEEEE-----HHHHHHHHHHHHHHHH---EEEEE-----EEEEE---------HHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHH--------------HHHHHHHHH----HHHHHHHHHHHHH-------EEHHHHH-----HHHHHHHHHHH----EE--------HHHHHHHHHHHH---------EEE--HHHHHHHHHH-------EEE-------HHHHHHHHHHHH-----HHHHHHH-----H---EEEEE-HH-HHHHHHHHHHH--------E---------E--------------EEEEEE-----------EE---E-------HHHHHHHHHHHHHHH-------EEEEEEE-------EEEEE---EEE---HHHHHHHHHHH----EEEEE-----EEEE--EEEEEEEEEE-----------HHHHHHH-------EEEE-------HHHHHHHHHH----------------EE--HHHHHHH-----HHHEE-----HHHH---HHHHH-----HHHHHHHHHHHHHHHHHHHHHHHH--------EEEEEEEE-----HHHH---HHHHHHHHHHHHHHHH---HHHHHHHHHHH----EEEEEE------HHHH-------EEEEEE-----------EEEHHHHHHHHHH-------EEEEE----EEEEEE--------HHHEEEEE-------EEEEEEEHHHHHHHHHHHHHHHHHHHH----EEE-----EEEEEE--EEEEE----HHHHHHH------HH---EEEEE----------HHHHHHHHH----------HHHHHHHHHHHH--EEEEHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH----HHHHHHEEEEE---EEEEE--HHHHHHHHHHH-----EEEEE---
      RFI_15181_Reticulomyxa_filosa_569404634  MKYSDLVERDIEAAIDNELRYLKWNDDSKDINSCNRNKKKLLGGKRPDYILYKDNSDEPIAIIEAKKPYEDINKAQQQGIEYAKILNAPVVFATDGIYTKTYHIKKQANLTLNNEEVDDLLNQSTLLNFLQDNIYDSIDKIRLNRQKGFTIKRGINIYIQNYLFSNILFLKVISELAEMNDCTIALPPKDYLWDNFKIKKGLDLVDFLNKQAFDYFKKSYGGKVLSKIEILSGKERILNDIITNLDDLWLS---DTNTDIKGDAFEYFLRNYGGAETDFGEYFTPRHIVKTMVKLLNPKFGEKIYDGFCGTGGMLTESFKYIKRRMPLNPNTIRYL-----KNETVFGGEFST-IFRIAKMNMILAGDGHSNIARQDS-----YEKKQTNK-------LDVVITNIPF-GNKMKTDY---LSQYGYNGKSAEICGVLHCLDALNNQNENARAGIIVPEEEAKQKQNHIWYFDLQNDGYALNKARTKIKGQNDIDVLLSEASLNIDEIERLKRINFDVLYKNKVRNNKYVLLANQYKEQVID---NFAFQEFSLQELEEIKHIEFKKGNALSKTEVENDGIYECILYGELYTKYNNPFIDKVYSTTNVKGKILSNYGDVQNKINPQYLSLVFNYTLKNELAKYARGANILHLSNDNIKKIKIPLPPLEEQQKIVEEIDSYQKVIDGAKQIIDNWNPSFEVREGCEIRNLGDITKLVRGPFGGSLKKEIFVESGFKVYEQSNCIKNDVKIGNYYITESKYKEMIRFSVQENDILMSCSGTIGKVLLLNNNFEKGIINQALLKVTPAQSSKKKIVEQIENERKIVESSKQLIKLQEEKIKNKINLFLLNSNGIDARDYSSNLSKNDTELKAKKK------NLGQDDFIIDNKSAENIISQLEEVNKLCINNCGLASQNHEFATQILNNLFTNVNLVMGDIWNAYANYSNNL-ISSKNLVDMNIANEQLADNFLNIAKQVQLINAQVMLDCCNELVAPSLKQAASCSEKIAKKINSN---------------------------------------------------------------------------------
      RFI_34923_Reticulomyxa_filosa_569334903  ---------------------------------------------------------------------------------------------------------KKTTGFLDKELIREILDKNNPC-IKEHVVPDEVYYQKAIKINEILHNGSINKNNRARVVATMLLALLKDNYINRENNCFS------MINELNSRAE----EILHEKDKRGFIHCIKISVPPTPDNHIKFRKALLEAVQELDSINIRSAMNSGTDILGRFYEQFL-KYGNGSKEIGIVLTPRHIARFAVDVLNITNKDKVLDPACGTGGFLVSAFDKVKSEVDK-----EEL--EKFKTEGIYGIEQDPEVVALALVNMIFRGDGRANLEEGNC-----FTSKK-----FIDLKVSKVLMNPPF-ALKKSDEK-----EYKFIDF------------ALNKMEKGGYLFAVIPSS-------------VMFRSKNFKDWRVKMLENNTLKAVIKLPDALFYPVS---VCTSAIIIQKGVKHDKNAN--------------GVMINSQKNNNIEEIRNALASHLNSIKLGQSKQQFIQKPIDFDKYLECSSEAYLED---KDYSKDEI-----EAQAKIPIDFY-FKIQKCQSGNLENYPTG-DIPFVSNTSLNNGVVKYVVLSEEKELIKNVPCI--AINGFGFATIQTHPFIGSGNGG-----GYVSALIPKKEMTMLELAYYAG---QLNLKSWCFSYGRRAVKHRLSAIKLSEFKKESINPNILSNIKNDLVSEIDSFVKKINSKSDTTWLIMNELKRRGYELFYYIPTNLIQVNGKILAIGNFIKIKKYQPMVYEIGKRQTLNLEDASAILIRQNPPLNMEYLTSTYLLEVIKHKVLILNNPSQIRNCPEKLFVTNFPQFCPPTIIASNYNHEVKKFITSHKEVVIKPIYDFGGSYIKKISLRSKNIKEIIKKYQLKFGNFIIQKFLPFVIEGDKRIILLDGEILGAIKRIPKAGDFRANLVVGGKAAKVEITKNDLVICRTLRPELKRRGLMIAGIDIIGNNLIEINVTSPTGLVAINKLYNQKSEVYVVNAIERKLKDAIKTYG
      RFI_21063_Reticulomyxa_filosa_569384171  ------------------------------------------------------------------------------------------------------------------------------------------------MKHNVLLQSCLCGRFSE--FANILFLKLLSEGNEKS-----------WWSDIK------------SQSNDY-------------------------IINEIDPLVLS---SIDSDIKGDAFEYFLEKTTSTENDLGEYFTPRNVVKTIINLVDPKFKETVYDPFCGTGGFLTESFNYIKENNIIEGE--EDL--KRLKQETIYGREVTA-TARIAKMNMILHGDGHAGIQQINTLSNPDYIEKKGGKWIFVKLQINQVIKRMPFIIQRIQTDRGQEFFAYNVQEKLKEYKIKFRPIKPASPHLNGKYSHLMDKLHTWEK---------YYNKGRPHSALQGKTPWEKYKELEPQIP--TIEEVH----LNYEV---------------------SQE---NFVPQSYNVH--KNIQDI--KRGKSYN--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------<----ATP grasp in sequence above ------------------------------------------------------------------------------------>-------------------------------------------------------------------------------
      RFI_02175_Reticulomyxa_filosa_569435738  -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------DKVLDPACGTGGFLVSAFDKVKSEVDK-----EEL--EKFKTEGIYGIEQDPEVVALALVNMIFRGDGRANLEEGNC-----FTSKK-----FIDLKVSKVLMNPPF-ALKKSDEK-----EYKFIDF------------ALNKMEKGGYLFAVIPSS-------------VMFRSKNFKDWRVKMLENNTLKAVIKLPDALFYPVS---VCTSAIIIQKGVKHDKNANVLWGWLKDGFVKKKGVMINSQKNNNIEEIRNALASHLNSIKLGQSKQQFIQKPIDFDKYLECSPEAYLED---KDYSKDEI-----EAQAKIVLQNL-ISFKLCSQ--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RFI_38478_Reticulomyxa_filosa_569312235  ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MYLNVSNPKNETVFGGEVST-IFRIAKMNMILAGDGHSNIARQDS-----YEKKQTNK-------LDVVITNIPF-GNKMKTDY---LSQYGYNGKSAEICGVLHCLDALNNQNENARAGIIVPEGI------------LFNGNKAYTQLRRDLVEKYSLENVVSLPKRTFVDVG----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      consensus/100%                           .............................................................................................................................................................................................................................................................................................................................................b.L.....KpEslaG.E.ss..h.lAbhNMIh.GDG+usl.b.ss.....a.pKb..........ls.Vl.p.PF...+bps-b......Y.h..b............shs...ps.b..hh.................hb..sbs.p.hb.c...p..bc...pbs...h..l...........................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................
      
      Back to Contents
    • General notes, phyletic distribution and domain architectures of the Group2-Clade5/Reticulomyxa-like N6-MTase

      General notes:

      These methyltransferases are real insertions. Their gene neighborhoods have eukaryotic sequences. Further, they are interrupted by introns. A comparison of closely related sequences suggest that they wree derived from bacteria in particular intracellualr symbionts like Ricketssia. In many bacteria,the three domains (RE, MEthylase and Methylase S) are fused in a single protein.
      # 1; Eukaryotic versions
      569312235       N6-MTase                                                     N6_Mtase                                                     RFI_38478         146  eukaryota>rhizaria                           Reticulomyxa filosa                  restriction-modification protein, partial [Reticulomyxa filosa].
      569334903       N6-MTase+ATP-grasp!                                          N6_Mtase+Methyltransf_26+Eco57I+GSH-S_N+GSH-S_ATP            RFI_34923         892  eukaryota>rhizaria                           Reticulomyxa filosa                  N-6 DNA methylase, partial [Reticulomyxa filosa].         
      569384171       N6-MTase+Reverse-transcriptase                               N6_Mtase+rve+rve_3                                           RFI_21063         326  eukaryota>rhizaria                           Reticulomyxa filosa                  Type I restriction-modification system methyltransferase subunit [Reticulomyxa filosa].
      569404634       REase+N6-MTase+RAGNYA+helix+RAGNYA+helix                     HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S                RFI_15181         978  eukaryota>rhizaria                           Reticulomyxa filosa                  type I restriction-modification system methyltransferase subunit [Reticulomyxa filosa].
      569435738       N6-MTase                                                     N6_Mtase                                                     RFI_02175         275  eukaryota>rhizaria                           Reticulomyxa filosa                  N-6 DNA methylase, partial [Reticulomyxa filosa].         
      # 1; Prokaryotic homologs
      345528927      REase+N6-MTase+RAGNYA+helix                                   HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S                FBFL15_0854       748  bacteria>bacteroidetes                       Flavobacterium branchiophilum FL-15  Probable type I modification methyltransferase [Flavobacterium branchiophilum FL-15].
      495892824      REase+N6-MTase+RAGNYA+helix                                   HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S                -                 977  bacteria>bacteroidetes                       Paraprevotella clara                 restriction endonuclease subunit M [Paraprevotella clara].
      91069778                                                                     HSDR_N_2+N6_Mtase+Methyltransf_26+Eco57I                     RBE_1419          517  bacteria>proteobacteria>alphaproteobacteria  Rickettsia bellii RML369-C           Type I restriction-modification system methyltransferase subunit [Rickettsia bellii RML369-C].
      746565283      REase+N6-MTase+RAGNYA+helix                                   HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S                -                 790  bacteria>proteobacteria>alphaproteobacteria  Rickettsia felis                     hypothetical protein [Rickettsia felis].                  
      67005333       N6-MTase                                                      N6_Mtase+Methyltransf_26                                     RF_p07            332  bacteria>proteobacteria>alphaproteobacteria  Rickettsia felis URRWXCal2           Type I restriction-modification system methyltransferase subunit (plasmid) [Rickettsia felis URRWXCal2].
      827052825      REase+N6-MTase                                                HSDR_N_2+N6_Mtase+Methyltransf_26                            MRECE_1c072       584  bacteria>tenericutes                         Mycoplasmataceae bacterium CE_OT135  type I restriction endonuclease subunit M [Mycoplasmataceae bacterium CE_OT135].
      823691316      N6-MTase                                                      N6_Mtase+Methyltransf_26                                     -                 576  bacteria>spirochaetes                        Brachyspira hyodysenteriae           hypothetical protein, partial [Brachyspira hyodysenteriae].
      763152770      REase+N6-MTase                                                HSDR_N_2+N6_Mtase+Methyltransf_26                            -                 531  bacteria>bacteroidetes                       Flavobacterium branchiophilum        hypothetical protein, partial [Flavobacterium branchiophilum].
      490962086      REase+N6-MTase+RAGNYA+helix+RAGNYA+helix                      HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S+Methylase_S    -                 983  bacteria>firmicutes                          Peptoniphilus lacrimalis             restriction endonuclease subunit M [Peptoniphilus lacrimalis].
      480765781      REase+N6-MTase+RAGNYA+helix+RAGNYA+helix                      HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S+Methylase_S    HMPREF1083_02457  968  bacteria>firmicutes                          [Clostridium] clostridioforme 90A6   hypothetical protein HMPREF1083_02457 [[Clostridium] clostridioforme 90A6].
      740438970      REase+N6-MTase+RAGNYA+helix+RAGNYA+helix                      HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S+Methylase_S    -                 966  bacteria>firmicutes                          [Clostridium] clostridioforme        restriction endonuclease subunit M [[Clostridium] clostridioforme].
      546935999      REase+N6-MTase+RAGNYA+helix+RAGNYA+helix                      HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S+Methylase_S    -                 981  bacteria>firmicutes                          Clostridium sp. CAG:81               type I restriction modification DNA specificity domain protein [Clostridium sp. CAG:81].
      
      Back to Contents
    • Multiple sequence alignment of the Group2-Clade6/Rhizophagus-RirG_033390-likeN6-MTase

                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       <-----Methylase core----                                    Str-1                                                Str-2                                                     Str-3                                        Str-4                                                                                                                                            Str-5                                 Str-6                 Str-7                                                                                                                                                                                                                                                                  <------------------------------------------------------------------------------------------------------TaqIC----------------------------------------------------------------------------------------------------------------------------------->                                                                                                                                                                                                                                                                                         
      RES                                                                       MFEQSDYSRYQQIKSQTQRWHHDKTLYHSDCVVASIQHEAIRQAKLQHAANDHAYEAFFLRWLFVYICFMQGRFGGLGSGKRKQQVHSGGVA-------------NRMTKTHDLESPTDNHKRRRPNDD-EDA---------ISEEEINAIVAILGNQLLIVT-----------------------KS-IMVEKE---SYTPSAAVTFLLRRMDVVDAQD---------------------------------------------------------------------------------------------------------------------ARSTEIYCLMTAFMTVYLCFV-------------------RVLIQQ-------PE----FDHFMQLFNHHDSLLNQ----DAQP--------------------------FLWFYALHHM------PIVDDLHKQLSTSTK------QCAMANIDSIFSKFYTQYFLEMSAAKHQKDHGQYY|TPRSVLRFMWDRCATV-----SHLIQLLQQQQTG------------------------MCRVFD-PCLGIGSFLCEFLTRFTKAC--RF----TVWSDPQRLTQLLLQDIPDHIFGIEIDPFAYQLCKMNMMVHLYPLYQRLCELGVQLPP------------Q--SIHRFKLFCNDTLKLNVESNPFRNAETVD---PFEKHWLDQLRDTCKLKFDFIVTNPPYMIRKTGFITQPDPAIYDESRLGGAKL------------------------------------------------------------------------------------------SQAYLYFMWIALQRCDDTQGQVCLITPSQWIVLEFAQQLRT-------WIWEHCKLLDIYEF----EPFKVWPKVQTDSLIFRI------CKRTSSLPN-------------SN-HTLYLRHVGKNMTLMHLLDIYRHFRP-DQQLLCTDNAL---------KYKHTPLTEHNRQLKTKH-------SSFSFLLPSVSFLDQLESMTQHLGRICDTDPAAQ----KANTA-APLIW-NRGPNTNPVYSLV-VRTAWARVTFGKETCDRWLKPCFYWNGKTI--SSATGG--GKEGEFWRH-RDPLRLGKKETSAAEAYLPY--------------CGVDVP-FYSMILVNREDADRLKEDFNNK-GPWSALYLYLH--DARVALQADKKEED----------------IANCQYNKCGL-VPVKIIHPINCGYFTRSQPRPRFFIDRQEMAVTNQ-CIYFTIKPDYPW----Q--DPDYYCGLLNSTLIMFFIKLHCSYDQQGRMRFFGRLMAYVPFAPPPSLEFMQQ--VATLVQGVTLARSCLYPFLHYCKGGQR----LLERVRN----FEWHLTSIES-----------------------------------------------GIVRQFE-----PPADWRQGISTNTAELHWIIDFIHT-----LNK-DNAHDIFIALLKLNSLFQLAIDQMIYHLYRIPQALQLEIEHDLKLDNLRQEW-PHVS-LQIPNEEEHKSSTNISVWYQSTLSMAKSFIDLSNE
      FINAL                                                                     ------------------------------------HHHHHHHHH--------------------------------------EE--------------------------------------------------------------HHHHHHHHHHHHHHHH-----------------------HH-HHHHHH---H-----HHHHHHHHHHHH----------------------------------------------------------------------------------------------------------------------------EEEEEEEE-HHHHHHHHH-------------------HHH----------------HHHHHHHH--------------------------------------------EEHHHHHHH------HHHHHHHHHHHH--H------HHHHHHHHHHHHHHHHHHHHHHHHHHH--------|--HHHHHHHHHHHH--------HHHH---------------------------------EEEE--------HHHHHHHHHHHHHH--HH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------H--HHHHHHHHHHH-EEE------HHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---EEEE----E--HHHHHHHHHHHHH---------------------------------------------------------------------------------------------HHHHHHHHHHHHHH------EEEEEEHHHHHHHHHHHHHHH-------HHHHHHHHHHHEE-----------EEEHHHHHHHHH------HHH-----H-------------HH-HHEEEEE------HHHHHHHHH-----HHHHH----HH---------HH---------------H-------HHHHHHHHHHHHHHHHHHHHHH-----------------HHHH-HHHHH-HHHHH---HHHHH-HHHHHHHHH---HHHHHHHHHHHEE-------EEEE------EEEEEE----EEEEEEEE---HHHHHHH--------------HHHHHH-HHHHH------EEEEEEE-------HHHHHHHHH--HHHHHHHH-----E----------------E-HHHHHHHHH-HHHHHHH-----EEEE-----EEEEE---EEEEE-----EEE-----------------EEEEEE-HHHHH-------------EEEHHHHHEE-----------HHH--HHHHHHHHHHH--------------------HHHHHHH----H--------------------------------------------------------HHHHHHH-------HHHHH----HHHHHHH-------------------HHHHHHHHH------HHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH----H-HHHHHHHH------HHHHHHHHHH-----------
      ALIGN                                                                     ---------------------------------HHHHHHHHHHH----------HHHHHHHHH--EEE------------------------------------------------------------------------------HHHHHHHHH--HHHHHH-----------------------HH-HHHHH---------HHHHHHHHH--------------------------------------------------------------------------------------------------------------------------------HHHHHHHHHHHHHHHHHH-------------------HHHHH--------------HHHHEHH--------------------------------------------HHHHHHHHHH------HHHHHHHHHHHH----------EEEE--HHHHHHHHHHHHHHHHHHHHH-------|-----HEEHHHHHH---------HHHHHHHH-----------------------------------------HHHHHHHHHHHHH--HH----HH---HHHHHHHHHH---------H--HHHHHHHHHHHHHH--HHHHHHHHHHH-------------------HHHHHHHH------EE----------------HHHHHHHHHH--------EEEE----EEEE----------HHHHHHHH----H------------------------------------------------------------------------------------------HHHHHHHHHHHHHH-------EEEE---HHHHHHHHHHHHH-------HHHHHHHHHHHHHH--------E-------HHHHHH------HH--------------------HH-HHHHHHHH-----HHHHHHHH----------------H---------HH-------------------------EEEE-----HHHHHHHHH--------------------------EEE----------EEEE-EEHHHHHH--------------EEE-------------------EEE-----HHHHHH-----H---------------------------EEEEEEE----HHHHHHH------HHHHHHHHHH--HHHHHHH------E----------------EEEEE-------EEEEEEE---------------EEE----EEEE---EEEEEE-----------------HHHHHHHHHHHHHHHHHHH-----HHHHHHHH-------------HHHH--HHHHHHHHHHHHHHHHHHHHH-----H----HEHHHHH----H-HHH---HH-----------------------------------------------HHHHH-----------------HHHHHHHHHHHHHHH-----------HHHHHHHHHHHH-HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH-------------------------HHHHHH-----------
      HMM                                                                       --HHHHHHHHHHHH-HH-HHHHHHHHH---EEEE--H-HHHHHHHHHHHHHHHHH-HEE---HHHHHHHHHHH--HHEEEEEHHHHHHHHHH-------------HHHHHHHHHHHHHHHHH--------HEE---------EEEEEEH--EEEE------E-----------------------------HHHH---HHHHHH------EEEEEEEE------------------------------------------------------------------------------------------------------------------------EEEE--EEEE---HHHHHHH-------------------HHH----------HH----HHHHHHHHHHHHH----------EE--------------------------EEEE-HHHHH------HHHHHHHHHHHHHHH------HHHHHHHH---EEEEEE--HHHHHHHHH-------|--HHHHHHHHHHHHHH-----HHHHHHHHHH----------------------------HHEHH-HH----HHHHHHHHHHHHH----E----EEE--HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------H--HHHHHHHHHHH-EEEE-----------HH---HHHHHHHHH------EEEEEEEE---EEEEE-EEEE----HHHHHHHHH---H------------------------------------------------------------------------------------------HHHHHHHHHHHHHH------EEEEE--HHHHHHHHHHHHHH-------HHHHHHHHHHHEE--------EEEEEE--HHHHHHH------HH-----------------------EEEEEEE------HHHHHHHHH-------------------------EEEE----------------------EEEEEE--HHHHHHHHHHHH--HHH------------------EEEEE-E-------EEEEE-HHHHHHHHH--HHHHHHEEEEEEEE---EE--EEE-------EEEEEE----EEEEEE-----HHHHHHH--------------HHH----EEEEEEEHHHHHHHHHH--------HHHHHHHHH--HHHHHHH-----EE----------------EEEE----EEE-EEEEEEEEE---EEEE----EEEEEE---EEEEEE-EEEEEE--------------HHHHHHEHHHHHHHHHHHH--------EEEEHHHHHHH--------H--HHH--HHHHHHHHHHHHHHHHHHHHHHH---H----EEHHHHH----HHEE-----H-----------------------------------------------HHHH----------------H-HHHHHHHHHHHHHH--------H-HHHHHHHHHHHHHH-HHHHHHHHHHHHHH----HHHHHHHH---HHHHHHH--------E----------HHHHHHHHHHHHHHHHHH-----
      FREQ                                                                      ----------------------------------------H-H---------------------------------------------------------------------------------------------------------EEEEEEE--HEEHHH-----------------------HH-HHHHHH---------HHHHHHHHHHH-----------------------------------------------------------------------------------------------------------------------------EEEEEEEHEEEEHHHHHH-------------------HEEE---------------HH-HHEE---------------------------------------------EEEEEE---------HHHHHHHHHH-----------HHEHHHHHHHHHHHHHHHHHHHHHHHH--------|---EEEEEEEEEE---------EEE----------------------------------EEEE---------HHHHHHHHHHHHH--HH----HHHHHH------HHHHHHHHHHHHHHHH---HHHHHHHHHHHHH-HHHHHHHHHHHHH------------H--HH-HHHHHHHH-HHE------HHHHHHHH---HHHHHHHH---HHHHHHHHHHH------------HHHHHHHHHHHHHHHH---------------------------------------------------------------------------------------------HHHHHHHHHHHHH------HH-HHHHHHHHHHHHHH---HH-------HHHHHHHHHHHHH------------HHHHHHHHHHH------HHHH---HH-------------HH-HHHHHHHH-----HHHHHHHHH-----HHHHH---HHH---------HHHHHHEE-------HHH-------HHHHHHHHHHHHH---HHHHHHH-HHEE----------HHHHH-HHHHH-HHHHH----HHHH-HHHHHHHH-----HHHHHHHHHHHHH-------HHHHH--HHHHHHH-----HHHHHHHHH---HHHHHH--------------HHHHHH-HHHHH-----EEEEEE---------HHHHHHHHH--HHHHHHHH-----------------------HHHHHHHHHH-HHHHHHH-----EEEE------EEEE--HHHHH-------EEE--------------E-EEEEEEE------------------EEEEEEEEEEE------------H--HHHHHHHHH--------E--------------EEEEEE----E----------------------------------------------------------HEEEE-----HHHHHH----------------------------------EEEE----------------------HHH-HHHHHHHHH-----HH-HHHH-HHHHHEEE------HHHHHHHHH------EEE---
      PSSM                                                                      ---------E-----------------------HHHHHHHHHHHHH----HHH-HHHH--------------------------EE----------------------------------------------------------HHHHHHHHHHHHHHHHHHH-----------------------HH-HHH------------HHHHHHHHHHHHHHH--------------------------------------------------------------------------------------------------------------------------EEEEEHHHHHHHHHHHH-------------------HHH----------------HHHHHHH---------------------------------------------HHHHHHHHH------HHHHHHHHHHHH--H------HHHHHHHHHHHHHHHHHHHHHH-HHHH--------|---HHHHHHHHHHH-----------------------------------------------EEE-------HHHHHHHHHHHHHH--HH----HH------HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHHHH--H-------------------------HEEH----------------------HHHHHHHHHHHHHHHHHHHHHHH--HHHHH--------HHHHHHHHHHH----------------------------------------------------------------------------------------------HHHHHHHHHHHHHH------EEEEEE--HHH--HHHHHHHH-------HHHH----EEEEE-----------E-----HHHEEE------E------------------------EEEEEEE------HHHHHHH---------------------------EEE-----------------------EEE------HHHHHHHHHHHH-------------------------EE-E-------HHHHH-HHHHHHHH---EEEE-------------EE--EEEEE----EEEEEEE-----EEEEEEE---HHHHHHH--------------HHHHHH-HHHHHH-----EEEEEEE-------EEEEEHHHH--HHHHHHH-------------------------HHHHHHHHH-HHHHHH-------EE------EEEEE---EEE----------------------------EEEE---HHHHHH----------HHHHHHHHHH------------HHH--HHHHHHHHHHH--------------------HHHHHHH----H--------------------------------------------------------HHHHHH-----------------HHHHHHH-------------------HHHHHHHHHH------------H--------HHHHHHHHHHHHHHHHHH--------------------HHHHEH-HH------------
      MAM1_0525c10839_Mucor_ambiguus_758346042                                  MFEQSDYSRYQQIKSQTQRWHHDKTLYHSDCVVASIQHEAIRQAKLQHAANDHAYEAFFLRWLFVYICFMQGRFGGLGSGKRKQQVHSGGVA-------------NRMTKTHDLESPTDNHKRRRPNDD-EDA---------ISEEEINAIVAILGNQLLIVT-----------------------KS-IMVEKE---SYTPSAAVTFLLRRMDVVDAQD---------------------------------------------------------------------------------------------------------------------ARSTEIYCLMTAFMTVYLCFV-------------------RVLIQQ-------PE----FDHFMQLFNHHDSLLNQ----DAQP--------------------------FLWFYALHHM------PIVDDLHKQLSTSTK------QCAMANIDSIFSKFYTQYFLEMSAAKHQKDHGQYY|TPRSVLRFMWDRCATV-----SHLIQLLQQQQTG------------------------MCRVFD-PCLGIGSFLCEFLTRFTKAC--RF----TVWSDPQRLTQLLLQDIPDHIFGIEIDPFAYQLCKMNMMVHLYPLYQRLCELGVQLPP------------Q--SIHRFKLFCNDTLKLNVESNPFRNAETVD---PFEKHWLDQLRDTCKLKFDFIVTNPPYMIRKTGFITQPDPAIYDESRLGGAKL------------------------------------------------------------------------------------------SQAYLYFMWIALQRCDDTQGQVCLITPSQWIVLEFAQQLRT-------WIWEHCKLLDIYEF----EPFKVWPKVQTDSLIFRI------CKRTSSLPN-------------SN-HTLYLRHVGKNMTLMHLLDIYRHF---RP-DQQLLCTDNAL------KYKHTPLTEHNRQLKTKH-------SSFSFLLPSVSFLDQLESMTQHLGRICDTDPAAQ----KANTA-APLIW-NRGPNTNPVYSLV-VRTAWARVTFGKETCDRWLKPCFYWNGKTI--SSATGG--GKEGEFWRH-RDPLRLGKKETSAAEAYLPY--------------CGVDVP-FYSMILVNREDADRLKEDFNNK-GPWSALYLYLH--DARVALQADKKEED----------------IANCQYNKCGL-VPVKIIHPINCGYFTRSQPRPRFFIDRQEMAVTNQ-CIYFTIKPDYPW----Q--DPDYYCGLLNSTLIMFFIKLHCSYDQQGRMRFFGRLMAYVPFAPPPSLEFMQQ--VATLVQGVTLARSCLYPFLHYCKGGQR----LLERVRN----FEWHLTSIES-----------------------------------------------GIVRQFE-----PPADWRQGISTNTAELHWIIDFIHT-----LNK-DNAHDIFIALLKLNSLFQLAIDQMIYHLYRIPQALQLEIEHDLKLDNLRQEW-PHVS-LQIPNEEEHKSSTNISVWYQSTLSMAKSFIDLSNE
      PARPA_06902.1_scaffold_25125_Parasitella_parasitica_758364669             ------------------------------------------------------------------------------------------MT-------------HKLVSAPPIPATVKN-KRQRPN---EKP---------LGEEDIHVIVTMLGNQLLNVK-----------------------KL-IMVEKD---TYTPSPAVTFLLRRMDV-DTQD---------------------------------------------------------------------------------------------------------------------ARSNEIYCLMTAFMTVYLCFI-------------------RLLIRQ-------QD----FDDFMKLFNDHDSLLNQ----ENQP--------------------------FLWFYAVHHM------SVEDDLHKQLAAAGRNVHSYSHLILADIDSIFSKFYTHYFLEISMAKHQKDHGQFY|TPQAVLRFMWDKCADL-----QHLVRQLQQK--S------------------------LCSVFD-PCLGIGSFICEFLTRLIKAC--QF----TVWDDPQQLAHLLLHDIPDHIYGIEIDPFAYQLCKLNMMVHLFPLYRRINELQVRLPP------------K--SIHRLRLFCNDTLKLTVESNPFWNTGSVD---PFEKHWLDQLRDASKLKFDFVVTNPPYMIRKTGFVTQPDPAIYDESKLGGAKI------------------------------------------------------------------------------------------SQAYLYFMWVALQRCDDANGQVCLITPSQWMVLEFAEQLRN-------WIWSNCKLLDIYEF----EPYKVWPKVQTDSLIFRL------CKRSSTMLN-------------SN-HTLYLRHVGRNTNLIQLLDIYQNF---RP-GQPS--VDLSL------KYKLTAFTQENRVLQTNH-------SSFSFLLPSVSFLDQLNSITQHLGRICDSDLSSS----S---K-APLVW-NRGPNTNPVYSLV-VRTRWAIETFGQETCDRWLKPCFYWNGKTI--YSATGG--GKEGAFWKD-RDPCRLDKKETSAAEAYLPY--------------YNANVP-FYSMIMVNKEDAEKLKENHANN-GTWSALYSYLR--DARIALQADKNEED----------------IANCQYSKSGL-VPVKIIHPINCGYFTRSQPRPRFFVDKREMTVTNQ-CIYFTIKPDYPW----Q--DPDYFCGLLNSTLLMFFIKLHCSYDQQGRMRFFGRLMAYIPFAPPPSVDFMRQ--VAGFVQAVTMARSCLYIIIRYSEGGQK----LIERVRN----FEWHLTQEEL-----------------------------------------------AILGHFQ-----PPLDYKDSLS-NVTQLGWVIELFKK-----ANALENTQDAFIVMLKLNSLFQLAIDQMIYHLYKIPESLQLEIEHDLKLDNVRQEW---LA-YRIPTLTN---SIILEEWYSSILSIAKSLV-----
      HMPREF1544_04357_Mucor_circinelloides_f_circinelloides_1006PhL_511007562  -----------------------------------------------------------------------------------------------------------MTTTPYPDSPIANCKRSRPNDDDEEA---------INQEEINAIVNTLGNQLLIVK-----------------------KS-IMIEKE---SYTPSAAVTFLLRRMDVVDSQD---------------------------------------------------------------------------------------------------------------------ARSTEIYCLMTAFMTVYLCFI-------------------QVLIPQ-------TD----FDRFMQLFNDHDSLLNQ----ETQP--------------------------FLWYYAVHHM------SIVDDLHKQLSTSTK------HLVMANLDSIFSKFYTHYFLEISAAKHQKDHGQYY|TPRSVLRFMWDRCATV-----PHLIQLLQQRQTG------------------------ICRVFD-PCLGIGSFLCEFLTRFIKAC--RF----TIWNDPQRLTELLLQDIPDHIFGIEIDPFAYQLCKMNMMVHLYPLYQRVCELGIQLPP------------R--SIHRLRLFCNDTLKLKVESNPFWNSDNVD---QFEKHWLDQLRDACNLKFDFIVTNPPYMIRKTGFITQPDPAIYDESRLGGAKL------------------------------------------------------------------------------------------SQAYLYFMWIALQRCDDTNGQVCLITPSQWMVLEFAEQLRQNLDLGSLQIARHLRVRTIQSMAQSANRFTYFPPMQTHKSTTKI--------RSHIIPS-------------PY--------------------IYQNF---RP-DQPP--TDSSL------KYKHTPLTDHTTSLKTKH-------SSFSFLLPSVSFLDRLESITQHLGRICDIDPAKL----N---T-APLIW-NRGPNTNPVYSLV-VRTDWAIKTLGQETCDRWLKPCFYWNGKTI--SSTTGG--GKEGEFWKS-RDPVRLSKKETSAAEAYVPY--------------YGVGVP-RYSMILVNKEDADKLKENFNNN-GAWSALYLYLR--DARVALQADKKEED----------------IANCQYNKCGV-VPVKIVHPINCGYFTRSQPRPRFFIDKHQMAVTNQ-CIYFTIKPDCPW----Q--DPDYYCGLLNSTLAMFFIKLHCSYDQQGRMRFFGRLMAHVPFAPPPSTEFMQQ--VATFVQGVTLARSCLYPFLRYCKGGQR----LLERVRN----FEWHLTAMES-----------------------------------------------DIVRQFE-----PPANWTEAISCNTAELEWIIDLIHT-----VNQ-DNALNVFIALLKLNSLFQLAVDQMIYHLYRIPQALQSEIEHDLKLDNLRQEWGPNFS-LHIPSDVD--APTNMAAWIQSTLSMAKSFIDS---
      RMCBS344292_07627_Rhizopus_microsporus_729710234                          -------------------------------------------------------------------------------------------------------------------MTITNEQTTTTSGK-VND---------LGERETTTVVNILGLSLVNIL-----------------------QD-IRNSQEKETCASSSETAAFLAQR----SLDD---------------------------------------------------------------------------------------------------------------------KDGLYIHCLMTGFMAVYLTFI-------------------ELVLSD-------S-----FHDFIRLFSPHDSLL-------STP--------------------------FVWYYFKHHV------SLLYHLRIQLDHL--------DISITSIDTIISKFYTHYFLETSAQKHQKDHGQYY|TPKPVIQFMWEKVIAS-----RPLLVHC-----G------------------------IPRIFD-PCLGIGSFLCEYIHRLIEQC--RQ----YVWNDAERLAKLLTQDIPESIWGVEIDPFTYHLCKLNMMVHLFPIYQRLNELQLSLPP------------H--SINRLRLFCNDTLTLRCDN------GNQD---AFEKDCLNLLRDPSKLKFHYIVTNPPYMIRKTGFITQPDPTLYDLQTLGG-RG------------------------------------------------------------------------------------------TQAYVYFMWAALQRIDDQLGQVCLITPSQWTILEFAQHFRE-------WILMNCKLLDMYEF----EPYKIWPKVQTDSLIFRI------CKRTSILDH-------------FE-YTLYLRNKARNTPLADLLQQYRDF---NP-LTN---CNPQL------QFRYSFCLKSCKGM-----------ASFAFILPTTSCLDELNRLTGHLPRLCDGEGKRD----G---Q-APLVW-HRGPNTNPVYALV-VRTAWARHRFGDKVCQQWLKPCFYWNGKSG--YASTGG--GKEGEFWKT-RDPLRLIKKEASAAEAYWPY------------RSYDEKES-FYSLIMINREDADYLRVQSTED-ESYAVLYNYLH--EARLSLQANKNDKD----------------IVYCQYSKCGTEYPVKIVHPINCGYYSRTQPRQRFFIDTTQIAVTNQ-CIYFTIQPSTPW----K--DYDYFCGILNCTLLQYFTKVYCSYDQQGRMRFFGRSMATIPFAPPPSARFMHE--LALFVQSITFTRTWLYTFIRHTHSGQR----LMERVRS----YEWHLDEADK-----------------------------------------------AALSHYDTFDIRPDADLA-SFS-YWQSIQWIDDFVQR-----KR--GDAFHCFVVLLKIASLFQFAIDQMAYYAYGIPLHLQLEVEKELQLISQRREW----T-MQIRELEY-----NEENWSSLIINTAKSLVD----
      RMATCC62417_07972_Rhizopus_microsporus_727145438                          -------------------------------------------------------------------------------------------------------------------MTITNEQTTTTSGR-VND---------LGERETTTVVNILGLSLVNIL-----------------------QD-IRNSQEKETCASSSETAAFLAQR----SLDD---------------------------------------------------------------------------------------------------------------------KDGLYIHCLMTGFMAVYLTFI-------------------ELVLSD-------S-----FHDFIRLFSPHDSLL-------STP--------------------------FVWYYFKHHV------SLLYHLRIQLDHL--------DISITSIDTIISKFYTHYFLETSAQKHQKDHGQYY|TPKPVIQFMWEKVIAS-----RPLLVHC-----G------------------------IPRIFD-PCLGIGSFLCEYIHRLIEQC--RQ----NVWNDAERLAKLLTQDIPESIWGVEIDPFTYHLCKLNMMVHLFPIYQRLNELQLSLPS------------H--SINRLRLFCNDTLTLRCDN------GNQD---AFEKDCLNLLRDPSKLKFHYIVTNPPYMIRKTGFITQPDPTLYDLQTLGG-RG------------------------------------------------------------------------------------------TQAYVYFMWAALQRIDDQLGQVCLITPSQWTILEFAQHFRE-------WILMNCKLLDMYEF----EPYKIWPKVQTDSLIFRI------CKRTSILDH-------------FE-YTLYLRNKARNTPLADLLQQYRDF---NP-LTN---CNPQL------QFRYSFCLKSCKGM-----------ASFAFILPTTSCLDELNRLTGHLPRLCDGEGKRD----G---Q-APLVW-HRGPNTNPVYALV-VRTAWARHRFGDKVCQQWLKPCFYWNGKSG--YASTGG--GKEGEFWKT-RDPLRLIKKEASAAEAYWPY------------RSYDEKES-FYSLIMINREDADYLRVQSTED-ESYAVLYNYLH--EARLSLQANKNDKD----------------IVYCQYSKCGTEYPVKIVHPINCGYYSRTQPRQRFFIDTTQIAVTNQ-CIYFTIQPSTPW----Q--DYDYFCGILNCTLLQYFTKVYCSYDQQGRMRFFGRSMATIPFAPPPSARFMHE--LALFVQSITFTRTWLYTFIRHAHSGQR----LMERVRS----YEWHLDEADK-----------------------------------------------AALSHYDTFDIRPDADLA-SFS-YWQSIQWIDDFVQR-----KR--GDAFHCFVVLLKIASLFQFAIDQMAYYAYGIPLHLQLEVEKELQLISQRREW----T-MQIRELEY-----NEENWSSLIINTAKSLVD----
      RMCBS344292_12151_Rhizopus_microsporus_729705342                          -------------------------------------------------------------------------------------------------------------------MTITNEQTTTTNGK-VND---------LGERETTTVVNILGLSLVNIL-----------------------QD-IRNSQE--TCASSSETATFLAQR----SLDD---------------------------------------------------------------------------------------------------------------------KDRIYIHSLMTGFMAVYLTFI-------------------ELVLSD-------S-----FHDFMRLFSLHDSLL-------STP--------------------------FVWYYYKHHV------SLLYHLRIQLGHL--------DISITSIDTIISKFYTHYFLETSAQKHQKDHGQYY|TPKPVIQFMWEKVIAS-----RPLLVHC-----D------------------------IPRIFD-PCLGIGSFLCEYIHRLIEQC--RH----YVWNDAERLAKLLTQDIPESIWGVEIDPFTYHLCKLNMMVHLFPIYQRLNELQLSLPP------------H--SINRLRLFCNDTLTLRSDD------GNQD---AFEQDCLNLLRDPSKLKFHYIVTNPPYMIRKTGFITQPDPTLYDLQTLGG-KG------------------------------------------------------------------------------------------TQAYVYFMWAALQRIDDQLGQVCLITPSQWTILEFAQHFRE-------WILMNCKLLDMYEF----EPYKIWPKIQTDSLIFRI------CKRKSILDH-------------FE-YTLYLRNKARNTPLADLLQQYRDF---NP-LTN---CNPQL------QFRYSFCLKSCKDM-----------ASFAFILPTTSCLDELNRLTGHLPRLCDGEGKRD----G---Q-APLVW-HRGPNTNPVYALV-VRTAWARHKFGDKVCQQWLKPCFYWNGKSG--YASTGG--GKEGEFWKT-RDPLRLIKKEASAAEAYWPY------------RSYDEKES-FYSLIMINREDADYLRAQSTED-ESYAVLYNYLH--EARLSLQANKNDKD----------------IVYCQYSKCGTEYPVKIVHPINCGYYSRTQPRQRFFIDTTQIAVTNQ-CIYFTIQPSTPW----Q--DYDYFCGILNCTLLQYFTKVYCSYDQQGRMRFFGRSMATIPFAPPPSTRFMHE--LALFVQSVTFTRTWLYTFIRHTHSGQR----LMERVRS----YEWRLDEADK-----------------------------------------------AALSHYDTLDIRPDADLA-SFS-CWQSIQWIDDFVQR-----KR--GNAFHCFVILLKIASLFQFAIDQMAYYAYGIPLHLQLEVEKELQLISQRREW----T-MQIRELEY-----NEENWSSLIMNTAKSLVD----
      LRAMOSA03076_Absidia_idahoensis_var_thermophila_671695680                 ----------------------------------------------------------------------------MDQQQCKQHEQPLSLL-------------NRAASPASP-IPSSN-ERKRSY---IHM---------NEVDVVKDVVAMLGDVLERIN-----------------------DAWV--------------------------QQED---------------------------------------------------------------------------------------------------------------------EETRRMRGLVTTYVAFVEAVA-------------------DDLLHN--DNNDRP-----FAAYMAIFDPHDQMA------DDDP--------------------------LLHHDTLFRR------SLAKDIRHRLNQLGV--QS--ESLLKCVDTIFARFYT----NKAPPQQQKDHGQFY|TPQTVVRFMWEQCLAN-----NN------KK--H------------------------VPRVLD-PCMGMGAFLCEFLTRWVMQL--DS----ATWDNAVALEQVLCTDIPAHIWGVELDPVALRLAKLNILVHLLPLYRRLRQLTQQNTL------------T-LRVDRLHLFCSDTLRLTP-V-------GDD---PWEHVELQRLH-SGHLVFDYIVTNPPYMIRKTGRITDPDPALYDSRILGG-RG------------------------------------------------------------------------------------------VQAYVYFMWICLQRCDPHDGELCLITPSQWLVLEFARHLRA-------WIWEHCELLQLFQL----EPYKVWPRVQTDSLIFRL------RMRGTRPPN-------------LNTHTLFLRHTARRATLQDILAAYTTF---NPHQQP---PSSDI------AYKYTPTHDRSRIQNSPN-------ASFAFLSPSTSLTGELAQLTHSLSRLCDGP------------G-APLVF-HRGPNTHPVYALV-VRTQWARDYFGPQCCSRWLRPAFYWSGK-----AAGTN--DPESIFWHL-RDPQRLARKETSPAEAYAPF--------------YAPDA--NYSLLLVDKEGADKLESSAATL-DQDARLYEYLQ--AARVALQPTREERK----------------VTWCHYNQSGADVAVKIVHPINCGYFTKSQPRQRFFVDRHQLCVTNQ-CMYFTLSPETDL-------SAEFFCGILNSSTVQFFLREHCAYDQQRRTRFFGRHLANIPCCSLPVASSFEATLMTDLVHAVTISRLWIYAIVWYT-DAQH----VIEHLRA----GTWDIHPGDM-----------------------------------------------PRVSAVSVQD-LNHSHHTSAWS-NDIRSHWISRVLDS-----RQH-TSLDIILVQLLQLASLFQYGIDQLTFVLYHVPIPLQRALEHELELVQATARW-S--H-VSLND----------------IFDTAQAILH--ND
      LCOR_02075.1_Lichtheimia_corymbifera_JMRC:FSU:9682_661187564              -----------------------------------MNSEIIEDQQYEPAAKRRKL---------------------PNQQKQPQIQQPLSSS-------------SKQLVPPNP-LGSSYIERKRAY---VKV---------SGGNLVQDTLAILGGVFTRVN-----------------------TA-L--------------------------RQED---------------------------------------------------------------------------------------------------------------------EGTRHSRTLMTTFAVFTEAVT-------------------GYFLHDVSKPDERP-----FATYMAVFSPQDQLT------DGDP--------------------------LLHPDVAFRQ------SLAMDIQYQLERIGV--QR--DKLYDCIDTLFSRFYA----DKTTSVHQKNKGQFF|TPQNVVHFMWGRCLDH-----KN------KN--Q------------------------MPRVLD-PCMGLGAFLCDFLRRWVTQLQRDG----ATWDNAGVLQQALCTDIPANVWGVEVDPVALKLSKLNAMLHVLPLYRRLRQLTGNNTD------------GFLRVNRLHLFCNNTLQLDPST-------GID---AWEQQELQVLR-SGY--FDYVITNPPYVSQKGKCFAVPDPALYDELVLGD-CI------------------------------------------------------------------------------------------KQAYAFFIWFCLQRCDPQEGEVCFITPSHWMTSEHDYNLRI-------WIWENCEFLQLYYF----KSAKVWPRLNTDSMIFRL------RMRGTRPPD-------------LNARSLYVRNMEIGLPLQDILDSYVAF---DP-QQP---QPKHI------KFKLTPTHDPQRIHQSSG-------AKLTFLCLS-PLTDELQEFTKSMTRLCNSR------------G-APLEF-TSGCSPMPRYGLV-VRTKWALENFGTRCYARWFRPAFYWSSK-----GARSK--GCEVDFWRV-RDPERIVKRELPPSEAFVPF--------------YTAEDAKKYSLILVDKDGIAELEVSTDP---EEERLYEYLQ--EARAEMQP-GNKRK----------------AIFSPFFRSGVDEAVKIVHPALNGYQSKYTPRQRFFVDRDQLGVTDQ-CGFFTLARGVDL-------SPEFFCGLLNSSTLLYLLRHYCTYDQEQRMYFYERHLKNVPCCDLPSASSQEAALMNDLVHAVTTARVWINAIVQAS-GAQY----ITTSLRN----CTWDIHPDDF-----------------------------------------------SAIDGLLIQDFLTSPEHTEGWS-DEVKNHWISQSLFS-----RPH-ARLVVVLVQLLMLSSLFQFGIDQLAYVLYRIPGNMQHAMEEEMEHVEFREQW-S--H-VTLDD----------------IFDTAQSILQ----
      RirG_033390_Rhizophagus_irregularis_DAOM_197198w_595493243                MSNANDNTQTVAPPPKRLKSSDEVSSQSDD-----WLSVVLNEVEVDPRIDTWN-------------CFFQDSHVSRHKPQNSEITFNTPFI-------------FKSVNPAQN-LVNGAESNSQRRQE-FSSNLSSFSTNFLSEDLVGKVVEVLGILLEEVK-----------------------KD-IWITII---NKDHLIPASFLREFNKL-VHDSHPNNVKTPFIKCVEEFRSFISRV---------------------------------QSSYLS-------------------------------------------------PDIELFKNALNSYCQITSFITVVHMLF------DTVVMDFLCENNEEISIRQ--LS---------VETFMNIFHDDDNFLKRNIPFDDQP-----------------PNASENLEAFTWYYRLVVRSQP---QYQCDLSVRFHSLNI--H---FTTLEPIESILSILYTKHFLNVLAKEQQKDHGQFY|TPREVMQFMWDRVLIGKGN--RTWIEKL----LG--GSFQNSSLYPNQSSSNWGVLPQAPSVLD-PCMGIGSFLCEYISRLILAA--QQ--CPIVWNNSVAISNLL-HSLSVNLWGIEIDAFAYHLCKINITLHILPLYKRYLHLTSLIND------------L--KLSRLHLFCNDTLNLYLPK-------REH---TWEYENLWLLRSPQRLKFDFIVTNPPYMIRKTGFISEPDSELFDERVLGK-GG------------------------------------------------------------------------------------------MQAYAYFFWFCVERCREEIGEVCLISASQWMGLEFADKLRA-------WMWQKCHLVEFFQF----EPFKVWRKIQTDSLIFRL------RRRSEPILSTSVPPIQPVLT--ES-SILFLRYMNRKATLHETLQAYSNF---DP-NNAQ--YEKDM------QYKLSLPYPMTQLPSSTN------SYSFTFLMPSSAVSAYLHSITAHLPSLCDHASMKHTWV-E---N-NPLIW-HRGPNTNPVYALV-VRTTWAYSKFGPEVCRRWLRPVFYWNGKNG----------GKEAEFWQKMGDELRLEKKESSPAEAYVPFIVNNSGGTTIAKDGLEQDRS-MYSLIMVDRDAVDKVRREFGEN----SEFWKYLK--EARKYLQTGFTSRE----------------VVYCGTSKCGIDVQTKIIHPINYGYFSKNQPRQRFFIDEDNVCVTNQ-CIYFTVKTTTRL-------PPYFFLGILNSTTIQHFLSHHCKYDQQGRMRLFRENMAKIPYAAPAHSDGVEW--FIRCVQRMILARQMLYEGIRICGDEDK----VASTIVEKLRRGAWQLTGREWQEKESNAYNDETSVILREEQEGNWVMVKQCRGRGDGVEYSLCPGSGGEQVARHETEETTEQTTTFHNFS-SAADAAVLKDHRAPSGNNISKT-HIMKPFFESLLYASACLQYAIDQYTYTLYGINAKFQMALEEELKLELFEAIL---NKYPRLNGTAG---NEDGEEKKGKVPEWGERLFE----
      RirG_033390_Rhizophagus_irregularis_DAOM_197198w_595493244                MSNANDNTQTVAPPPKRLKSSDEVSSQSDD-----WLSVVLNEVEVDPRIDTWN-------------CFFQDSHVSRHKPQNSEITFNTPFI-------------FKSVNPAQN-LVNGAESNSQRRQE-FSSNLSSFSTNFLSEDLVGKVVEVLGILLEEVK-----------------------KD-IWITII---NKDHLIPASFLREFNKL-VHDSHPNNVKTPFIKCVEEFRSFISRV---------------------------------QSSYLS-------------------------------------------------PDIELFKNALNSYCQITSFITVVHMLF------DTVVMDFLCENNEEISIRQ--LS---------VETFMNIFHDDDNFLKRNIPFDDQP-----------------PNASENLEAFTWYYRLVVRSQP---QYQCDLSVRFHSLNI--H---FTTLEPIESILSILYTKHFLNVLAKEQQKDHGQFY|TPREVMQFMWDRVLIGKGN--RTWIEKL----LG--GSFQNSSLYPNQSSSNWGVLPQAPSVLD-PCMGIGSFLCEYISRLILAA--QQ--CPIVWNNSVAISNLL-HSLSVNLWGIEIDAFAYHLCKINITLHILPLYKRYLHLTSLIND------------L--KLSRLHLFCNDTLNLYLPK-------REH---TWEYENLWLLRSPQRLKFDFIVTNPPYMIRKTGFISEPDSELFDERVLGK-GG------------------------------------------------------------------------------------------MQAYAYFFWFCVERCREEIGEVCLISASQWMGLEFADKLRA-------WMWQKCHLVEFFQF----EPFKVWRKIQTDSLIFRL------RRRSEPILSTSVPPIQPVLT--ES-SILFLRYMNRKATLHETLQAYSNF---DP-NNAQ--YEKDM------QYKLSLPYPMTQLPSSTN------SYSFTFLMPSSAVSAYLHSITAHLPSLCDHASMKHTWV-E---N-NPLIW-HRGPNTNPVYALV-VRTTWAYSKFGPEVCRRWLRPVFYWNGKNG----------GKEAEFWQKMGDELRLEKKESSPAEAYVPFIVNNSGGTTIAKDGLEQDRS-MYSLIMVDRDAVDKVRREFGEN----SEFWKYLK--EARKYLQTGFTSRE----------------VVYCGTSKCGIDVQTKIIHPINYGYFSKNQPRQRFFIDEDNVCVTNQVC-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RirG_033390_Rhizophagus_irregularis_DAOM_197198w_595493245                MSNANDNTQTVAPPPKRLKSSDEVSSQSDD-----WLSVVLNEVEVDPRIDTWN-------------CFFQDSHVSRHKPQNSEITFNTPFI-------------FKSVNPAQN-LVNGAESNSQRRQE-FSSNLSSFSTNFLSEDLVGKVVEVLGILLEEVK-----------------------KD-IWITII---NKDHLIPASFLREFNKL-VHDSHPNNVKTPFIKCVEEFRSFISRV---------------------------------QSSYLS-------------------------------------------------PDIELFKNALNSYCQITSFITVVHMLF------DTVVMDFLCENNEEISIRQ--LS---------VETFMNIFHDDDNFLKRNIPFDDQP-----------------PNASENLEAFTWYYRLVVRSQP---QYQCDLSVRFHSLNI--H---FTTLEPIESILSILYTKHFLNVLAKEQQKDHGQFY|TPREVMQFMWDRVLIGKGN--RTWIEKL----LG--GSFQNSSLYPNQSSSNWGVLPQAPSVLD-PCMGIGSFLCEYISRLILAA--QQ--CPIVWNNSVAISNLL-HSLSVNLWGIEIDAFAYHLCKINITLHILPLYKRYLHLTSLIND------------L--KLSRLHLFCNDTLNLYLPK-------REH---TWEYENLWLLRSPQRLKFDFIVTNPPYMIRKTGFISEPDSELFDERVLGK-GG------------------------------------------------------------------------------------------MQAYAYFFWFCVERCREEIGEVCLISASQWMGLEFADKLRA-------WMWQKCHLVEFFQF----EPFKVWRKIQTDSLIFRL------RRRSEPILSTSVPPIQPVLT--ES-SILFLRYMNRKATLHETLQAYSNF---DP-NNAQ--YEKDM------QYKLSLPYPMTQLPSSTN------SYSFTFLMPSSAVSAYLHSITAHLPSLCDHASMKHTWV-E---N-NPLIW-HRGPNTNPVYALV-VRTTWAYSKFGPEVCRRWLRPVFYWNGKNG----------GKEAEFWQKMGDELRLEKKESSPAEAYVPFIVNNSGGTTIAKDGLEQDRS-MYSLIMVDRDAVDKVRREFGEN----SEFWKYLK--EARKYLQTGFTSRE----------------VVYCGTSKCG--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      MVEG_04886_Mortierella_verticillata_NRRL_6337_672825191                   TNSPSDPMLRPSSSPSSFPTSSSASIPMPH-----FRGQLLHRA---PDSAYQR-------------RYFGASTSGSGSSPNPLSSFSPSTVPVPIHSDPLSKMPRKTSTSKRP-QPNDTIGNTSTSGP-FSSGRGSIANQ--DATLTSYLHTQLGRLLLSTKAHVRRLLESFLHQKAASTVHPRLES-VLVELF---N-GMLMSHNALATREHI-SSQEQPMTLRPVLIQSLDHLAALMAHAPSVSSSSTPATLATATPGATRKKAPPHPPSRLSQSIYFSDTDRDTDSDLDLEGVAQGTDATRREATEGTTGRSSGPPRPESELLAIEIETVEMYTSTIEFLSLMTAFVSVVCWLMQARLKRDTSQPTTLGVDIEDVDMEN--RQPTDKDCFDRLHSFLLLFDDHDNVLK---PIQDALHSEEQGGHSTTHPRGAMDDPTRQVGLFAWHFSLFSDDEDGPLQQEVNLIDRQALDSI--H---F------DVILNDLYSTHVLAMTAKEHQKDHGQFY|TPSNVVDFMWRRAIVGRENLLERFVANL----GGAKGQGVQASMAPVESEASL-----VPTALD-PCLGVSTFLSCYVRLLIQKA--RQDHTETIWNSPIASRLLL-AQICENIWGIELDGFAFWMARCGILASLIPLVERVQKLQHQQQQGLQAYQAGRGETT--KLTRLHLFRNDTLQLTVPD-------GVHPDKSWERACILQLRDPQLLRFDFIVTNPPYMIRKTGTFSAPDPEVYDWSILET-GGSPTIITSNVSPSETGSRSKPRRGSISPNPPINEDVVSAAEEEDELEGSDSEATTPDSRSGSPRSSRVKASSSSWPTSSASASMRLGAKGMMQAYGYFIWFAAQRIKPYAGVSCMITASQWLTLEFATKLRA-------WLFENCLMDEFFQF----EPFKVFAKVQTDSLIFKIRSMEPGRTRQDSSIEPSIPLYDRLLEIGAH-RTVFLRHTDHHRPLDGILQDYMDFFAISP-QEQS--SSVNIMVSNKTREELSAVIAAAPQPSSSSTTVTAPTYSFAPMMPSSLLSTFLLSLTQDLPGICSAGTKRVNRL-S---AVEPLLW-HRGPNTNPVYGLV-VRMEYAEVMFGEVMKARWFRPAFYWNGKNSPEVGMMTKALHKEGQFWQG-RDRLRLSKKEGSPAESY---LVPTPG-----------SHR-LYGLCMVDKESVKVLREQMAQGVQGAAALWQYLT--DVRNHFQPGLASKKRKVFLSGKQQMTDDEGVAYCSTNQCGSDVPEKLVHPINYGYFSKTQPRQRFFLDTSSLAVTNQ-CIYLTLNKLSHHYDAAQSPPLIYFLTLLNSSTLQFFVLHHCQYDQQGRMRLFRESMAKIPFQDRDVKSSP---------QRIQYAAQL---GQLMIDLKGT----LYKVVME------WHLTGSSSRTDLGAPRLSEPFIGSVGGNQGLLDWIRRGGDPPTGV-LPKTRDQIWRMLQGHASAPTTRSPSAPSSIA-QLSTSA--PPALPALGAHFHRA-ESLSTQADIDTDTNTGTDTDTDTDDNFESGRRSRFDQEEDFEKPRREYQHPL---QE-PRASGFSP---QQYNQQHSSWLKSSNDPTPS----
      RirG_248540_Rhizophagus_irregularis_DAOM_197198w_595439684                --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MELYSPNI--H---FAILELIESKLLILYAKHFVNVLLKE-QKNHGQFY|TLRKVMRFMWDQIGIK-GN--RKRIEIF-LR-----GTF--------QTSSNWEVFPQALNVLDNSCMKMRSFLCEFVSRLTPAA--HI-------------------EIKDLYWYQKAAEQGFDNAKFN----------------------------------------LGLWYNNEIFIEKDE------AGRF---YWCQKAAEDGDKTAQFNLGI------YYYYGVGIEKDEAKSFYWFQKAAE-SG------------------------------------------------------------------------------------------FKGAQFNLGICYQYGDCIEKDK--IKAFYWYHKAVKNGLKQ----------AQYNLGIYYEN-------GVGIDKNEDKAFYWY---Q--KAAENGVKE-------------AQ-FNLGLCYENGNGIGKDEVKAFYWY-H-RA-------AENGL--KEA-QYNLGTCYKNGDGIEKD--------DVKAFYWYQKAVENGLKEAQLNLEN-CRFNGIGIEK--D---E-VNIFYAHHKPEERSLNAIKWIENALKYEKVKFIPYKEIKNTQPLCKGRFG----------HISKVIWTK-INNYVICKKLINTIDNKNNL---------L--DAFIHELK-INLHLNYSNRIIRCLGISFDQKTSEYLLIMEYANGGDLQSYLKNNFN------Y------------LTWNDKKKLAFQIADGLNYLHNENVIHRDLHSRNIVIHENTAKITDF-GISKNQNDQISI-------AYIDNFGVVAYMEPKCLIDPNFPYTKSSDIYSFGVLMWEISSGYPPFKDNDNI-----VALAISINTDIIYYSMPLF------A--LYL------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      GLOINDRAFT_201382_Rhizophagus_irregularis_DAOM_181602_552925964           ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MGLEFADKLRA-------WMWQKCHLVEFFQF----EPFKVWRKIQTDSLIFRL------RRRSEPILSTSVPPIQPVLT--ES-SILFLRYMNRKATLHETLQAYSNF---DP-NNAQ--YEKDM------QYKLSLPYPMTQLPSSTN------SYSFTFLMPSSAVSAYLHSITAHLPSLCDHASMKHTWV-E---N-NPLIW-HRGPNTNPVYALV-VRTTWAYSKFGPEVCRRWLRPVFYWNGKNG----------GKEAEFWQKMGDELRLEKKESSPAEAYVPFIVNNSGGTTIAKDGLEQDRS-MYSLIMVDRDAVDKVRREFGEN----SEFWKYLK--EARKYLQTGFTSRE----------------VVYCGTSKCG--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      RMATCC62417_18770_Rhizopus_microsporus_727130617                          ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------M-----------ASFAFILPTTSCLDELSRLTGHLPRLCDGEGKRD----G---Q-APLVW-HRGPNTNPVYALV-VRTAWARHRFGDKVCQQWLKPCFYWNGKSG--YASTGG--GKEGEFWKT-RDPLRLIKKEASAAEAYWPY------------RSYDEKES-FYSLIMINREDADYLRAQSAED-ESYAVLYNYLH--EARLSLQANKNDKD----------------IVYCQYSKCGTEYPVKIVHPINCGYYSRTQPRQRFFIDTTQVAVTNQ-CIYFTIQPSAPW----Q--NYDYFCGILNCTLLQYFTKVYCSYDQQGRMRFFGRSMATIPFAPPPSARFMHE--LALFVQSVTFTRTWLYTFIRHAHSGQR----LMERVRS----YEWHLDEADK-----------------------------------------------AALNHYDTLDIRPDADLA-SFS-CWQSIQWIDDFVQR-----KR--GDASHCFVVLLKMASLFQFAIDQMAYYAYGIPLHLQLEVEKELQLISQRREW----T-MQIRELEY-----NEENWSSLIINTAKSLVD----
      GLOINDRAFT_201358_Rhizophagus_irregularis_DAOM_181602_552925963           ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MRLFRENMAKIPYAAPAHSDGVEW--FIRCVQRMILARQMLYEGIRICGDEDKVASTIVEKLRR----GAWQLTGREWQEKESNAYNDETSVILREEQEGNWVMVKQCRGRGDGVEYSLCPGSGGEQVARHETEETTEQTTTFHNFS-SAADAAVLKDHRAPSGNNISKT-HIMKPFFESLLYASACLQYAIDQYTYTLYGINAKFQMALEEELKLELFEAIL-N-KY-PRLNGTAG---NEDGEEKKGKVPEWGERLFE----
      RO3G_16192_Rhizopus_delemar_RA_99-880_384500990                           ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MIRKTGFITQPDPTLYDLQTLGG-RG------------------------------------------------------------------------------------------TQAYIYFMWVALQRIDDQQGQLCLITPSQWTVLEFAQHLRE-------WILANCKLLDMYEF----EPYKVWPKVQTDSLIFRI------CKRTSVLPN-------------TD-YTLYLRNKARNTSLTDILQQYNVF---NP-AES---HDPEL------QYRYGFCTRNYKDL-----------TSFAFILPTTSCLDELNRITGHLPRLCDGEGKKNSWYTD---Q-IPLVW-HRGPNTNPVYALV-VRTSWAKQTFGDKICQLWLKPCFYWNGKSG--SAAKGG--GKEGEFWKS-RDPLRLCKKETSAAEAYWPY------------RLLDSQDS-FYSIIMVNREDADFLKSQVEHD-SSYKAFYSYLR--EARLALQANQNDKD----------------IVYCQYSKSGTDHPVKIVHPINCGYYSRTQPRQRFFVDTTQIAGSLQ-L-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      consensus/100%                                                            .......................................................................................................................................................................................................................................................................................................................................................................................................................................................................................bb..................-sbh..hYs...........QKs+GQaa|...........................................................................................................................................................................................................................................................................................................................................................................................h+............ph.h..hb..........h....ppcp...bh..........p...p....................................a..a.....................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................
      consensus/95%                                                             .......................................................................................................................................................................................................................................................................................................................................................................................................................................................................................bb..................-sbh..hYs...........QKs+GQaa|...........................................................................................................................................................................................................................................................................................................................................................................................h+............ph.h..hb..........h....ppcp...bh..........p...p....................................a..a.....................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................
      consensus/90%                                                             ....................................................................................................................s.....ppp......p.............p........LG..h...........................p..l............................pp............................................................................................................................sbhTsahsh...h.......................h.p.............h..ah.lFp.pDph........s............................h.a.................pl..bh..............h..l-olhsbhYs....p....bpQKsHGQaY|.................................................................................................................................................................................................................................h.b.s......s..ha....h................................................................................................buh.a.h.hshbb.c......ChIps..W..bEaspphR........bhh.phph.phabh....p.hpla.chpTDphha+l......p.R.p...p..............p...l.lp...........l..Y..F..................................................bs.h..p..h.s.L.php.ph...Cp.................sl.a.pp.sp..s..ul..lc..h.b..h......bhbps..hhpu+..............p..hWp...s...l.++b.ss.-sb............................h.hspc.h..l..p..........hh.Yhp...hb..hbs......................h..s...b.u..............................................................................................................................................................................................................................................................................................................................................................
      consensus/85%                                                             ....................................................................................................................s.....ppp......p.............p........LG..h...........................p..l............................pp............................................................................................................................sbhTsahsh...h.......................h.p.............h..ah.lFp.pDph........s............................h.a.................pl..bh..............h..l-olhsbhYs....p....bpQKsHGQaY|T.p.VhpFMW.bh.............................................h.phhD.sCh.h.sFlspalpbh...h..p....................pl...ha..chs..sh..s+hs........................................h.La.sspl.l..................ap...h...c.s....h.h......YhhbKsG.hs.PDs.laDbp.Lu...............................................................................................QAY.aFhWhshbRhc...G..ChIosSbW..LEFApphR........Whh.pCcl.phabh....csaKla.+lQTDShIF+l......p.Rpp...p..............p...LaLR..s.p.sL.p.Lp.Y.sF...ps...........h.........bp.s...................shsFhh.o.sh.sbL.phT.pLsplCs.................PLha.p+GPsspPlYuLV.VRs.aAb..hG.b.h.bWh+PsFYWsGK..............Es.FWp...D..Rl.KKE.ssuEua.sh..............h..p...bYuhlhls+-shc.lc.p.sp.......ha.YLp..-hR..hQssbppcc................ls.s..pbsG..............................................................................................................................................................................................................................................................................................................................................................
      consensus/80%                                                             ....................................................................................................................s.ss.pppp......ps..........s.p....ll..LG..L..l........................ps.l............................ps......................................................................................................................ps....sbhTsFhshh..hh...................ph.lpp.............h.pah.lFp.cDphh.......spP..........................h.aa............ph..clp.bh.p............h.sl-oIhubhYopahlp..s.cpQKDHGQaY|TPp.VhpFMWcbhh.......p....................................hsplhD.PChGhuuFLC-alpRh...h..p.......Wsss.....hL..pls.plaGlElDshshphsKhNh.hplhPlhbRh.pL.....................plpRL+LFhNsTLpL...p.........c....aEb..l..L+.s..h.FcallTNPPYMIRKTGhhopPDs.laDbp.LG...............................................................................................QAYhYFhWhslbRhc...GblClIosSQW..LEFAppLR........WhhbpCcLlphabF....EPaKlW.+lQTDSLIFRl......pbRsp.h.p..............p...LaLR..s+p.sL.c.Lp.YpsF...pP..p.....p..h........bac.s......p............SFsFlhPosuh.sbLpphT.pLsplCc................sPLla.pRGPNTpPVYuLV.VRT.WAbppFG.bshpbWh+PsFYWsGK............sbEu.FWp...D.hRL.KKEsSsAEAYhPa..............h..p...bYShlhls+-shcblc.p.sp.......hapYLp..-AR..LQssbppcc................ls.C..s+sG......l.a....shbp+...p.phhlcppph..o.b.........................................................................................................................................................................................................................................................................................................................
      consensus/75%                                                             ....................................................................................................................s.ss.pppp.p....ps..........s.cb...lV.hLG..L.pl........................ps.lb.pb....s.s...s.shL.pb....s.ps......................................................................................................................ps....sbhTuFhsVh..hh...................cl.lpp.............hcsFh.lFpscDshL.......spP..........................F.Waa...........ph..cLp.bh.p.s..........h.sI-oIhSbhYTpaFLp..s.cpQKDHGQaY|TPp.VhpFMWcbhh.......p....................................hsplhD.PChGhuuFLC-alpRh...h..p.......Wsss.....hL..pls.plaGlElDshshphsKhNh.hplhPlhbRh.pL.....................plpRL+LFhNsTLpL...p.........c....aEb..l..L+.s..h.FcallTNPPYMIRKTGhhopPDs.laDbp.LG...............................................................................................QAYhYFhWhslbRhc...GblClIosSQW..LEFAppLR........WhhbpCcLlphabF....EPaKlW.+lQTDSLIFRl......pbRsp.h.p..............p...LaLR..s+p.sL.c.Lp.YpsF...pP..p.....p.ph........ba+ho.....sp............SFsFlhPosuh.sbLpplT.cLsplCD..s.............sPLlW.pRGPNTNPVYuLV.VRT.WAbppFG.csCpbWL+PsFYWNGKs...........GKEubFWpp..D.hRL.KKEsSsAEAYhPa..............h..p.s.hYShIhls+-sh-bl+.p.spp....u.hapYLp..-AR..LQssbspcc................lsaC..sKsG....sKllHPhN.GYho+sbPR.RFFlDppphslTsQ.s................................................b.a...h..ls....s..p.............h..s..h.................l..............................................................................................................................................................................................................
      consensus/70%                                                             ....................................................................................................................s.ss.pppp.p....ps..........s.cb...lV.hLG..L.pl........................ps.lb.pb....s.s...s.shL.pb....s.ps......................................................................................................................ps....sbhTuFhsVh..hh...................cl.lpp.............hcsFh.lFpscDshL.......spP..........................F.Waa...........ph..cLp.bh.p.s..........h.sI-oIhSbhYTpaFLp..s.cpQKDHGQaY|TPp.VhpFMW-+hh.......p.bl..h..............................hsplhD.PChGlGoFLCEalpRhlb.h..p.......Wsss..l.plL.psls.plaGlElDshsapLsKhNh.lHlhPlYbRh.pL....s................plpRL+LFCNDTLpL...p.........c...saEbp.L.bLRps.bLpFcalVTNPPYMIRKTGhlopPDs.lYDbp.LG...............................................................................................QAYhYFhWhslQRhc.p.GblCLIosSQWhsLEFAppLR........WhhbpC+Ll-habF....EPaKVW.KlQTDSLIFRl......pbRsp.l.p..............p...LaLR..s+pssL.-.Lp.YpsF...pP..ps....p.ph........ba+ho.s...sp............SFsFlhPosuh.sbLpplT.HLsplCD..s.p.........p.sPLlW.HRGPNTNPVYuLV.VRTsWAbppFG.csCpbWL+PsFYWNGKs...........GKEu-FWpp.bD.lRL.KKEsSsAEAYhPa..............h..c.s.hYSlIhls+-shDbL+.p.sps....u.hapYL+..-AR..LQssbsp+-................lsaC..sKsG..hssKIlHPINhGYao+sQPR.RFFlDppphsVTsQ.C..bs..............s....hslls....b.h....h.Ysbp.chbhF.c.hh.lPhss.s..p..p......hsp.h.hsp.hl...h............lhp.l........Wplp..p..................................................l............p...shu....p.........................h......ss..p.s.Dp...........hp...-.-b.....p..h.......ph....................p.sp........
      
      Back to Contents

    • General notes, phyletic distribution and domain architectures of the Group2-Clade6/Rhizophagus-RirG_033390-likeN6-MTase

      General notes:

      Another MTase derived from prokaryotic R-M systems. This is a bit more widespread and is present across various fungi and rhizaria.
      GI/Gene label     Domain architecture                                                                                                             Pfam architecture                                                                       Gene name                     Len   Taxonomy                             Species name                                    Genbank/other annotation
      Uram1000007539    N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         Uram1000007539                1090  eukaryota>fungi>mucoromycotina       Umbelopsis ramanniana                           fgenesh1_kg.52_#_84_#_combest_scaffold_52_106915
      Uram1000001377    Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S                                   Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1                             Uram1000001377                655   eukaryota>fungi>mucoromycotina       Umbelopsis ramanniana                           fgenesh1_kg.5_#_213_#_combest_scaffold_5_101108
      Bcir1000010321    Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+N6-MTase+TaqIC/RAGNYA/Methylase_S                               Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1                                  Bcir1000010321                690   eukaryota>fungi>mucoromycotina       Backusella circina                              e_gw1.291.36.1
      Bnat1000001029    Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S          Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1    Bnat1000001029                678   eukaryota>rhizaria                   Bigelowiella natans                             estExt_fgenesh1_pg.C_320094
      Bnat1000018648    Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S                                             Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1                                       Bnat1000018648                394   eukaryota>rhizaria                   Bigelowiella natans                             e_gw1.71.34.1
      Bcir1000008046    Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S                              Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1                        Bcir1000008046                633   eukaryota>fungi>mucoromycotina       Backusella circina                              estExt_Genewise1.C_420006
      384490648         Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S                                             Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1                                       RO3G_06575                    484   eukaryota>fungi                      Rhizopus delemar RA 99-880                      hypothetical protein RO3G_06575 [Rhizopus delemar RA 99-880].
      595439684         N6-MTase[]+N6-MTase[Sel1+Sel1+Sel1+Sel1+Sel1+Kinase+N6-MTase+TaqIC/RAGNYA/Methylase_S                                           Sel1+Sel1+Sel1+Sel1+Sel1+Pkinase                                                        RirG_248540                   637   eukaryota>fungi>glomeromycota        Rhizophagus irregularis DAOM 197198w            Mkk2p [Rhizophagus irregularis DAOM 197198w].
      384491727         Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S                                        Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1                                  RO3G_07628                    644   eukaryota>fungi                      Rhizopus delemar RA 99-880                      hypothetical protein RO3G_07628 [Rhizopus delemar RA 99-880].
      595493245         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         RirG_033390                   886   eukaryota>fungi>glomeromycota        Rhizophagus irregularis DAOM 197198w            hypothetical protein RirG_033390 [Rhizophagus irregularis DAOM 197198w].
      727145438         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         RMATCC62417_07972             936   eukaryota>fungi                      Rhizopus microsporus                            hypothetical protein RMATCC62417_07972 [Rhizopus microsporus].
      729705342         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         RMCBS344292_12151             934   eukaryota>fungi                      Rhizopus microsporus                            hypothetical protein RMCBS344292_12151 [Rhizopus microsporus].
      595493244         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         RirG_033390                   925   eukaryota>fungi>glomeromycota        Rhizophagus irregularis DAOM 197198w            hypothetical protein RirG_033390 [Rhizophagus irregularis DAOM 197198w].
      758364669         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                              .N6_Mtase+Eco57I                                                                         
      511007562         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         HMPREF1544_04357              962   eukaryota>fungi                      Mucor circinelloides f. circinelloides 1006PhL  hypothetical protein HMPREF1544_04357 [Mucor circinelloides f. circinelloides 1006PhL].
      758346042         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         MAM1_0525c10839               1074  eukaryota>fungi                      Mucor ambiguus                                  hypothetical protein MAM1_0525c10839 [Mucor ambiguus].
      Ccor1000001613    N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         Ccor1000001613                1177  eukaryota>fungi>entomophthoromycota  Conidiobolus coronatus                          gm1.1775_g
      595493243         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         RirG_033390                   1238  eukaryota>fungi>glomeromycota        Rhizophagus irregularis DAOM 197198w            hypothetical protein RirG_033390 [Rhizophagus irregularis DAOM 197198w].
      729710234         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         RMCBS344292_07627             936   eukaryota>fungi                      Rhizopus microsporus                            hypothetical protein RMCBS344292_07627 [Rhizopus microsporus].
      384500990         N6-MTase-frag                                                                                                                   -                                                                                       RO3G_16192                    383   eukaryota>fungi                      Rhizopus delemar RA 99-880                      hypothetical protein RO3G_16192 [Rhizopus delemar RA 99-880].
      Pbla1000013531    N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase                                                                                Pbla1000013531                1204  eukaryota>fungi>basal                Phycomyces blakesleeanus                        estExt_fgeneshPB_pg.C_140177
      661187564         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase                                                                                LCOR_02075.1                  946   eukaryota>fungi                      Lichtheimia corymbifera JMRC:FSU:9682           hypothetical protein RO3G_16192 [Lichtheimia corymbifera JMRC:FSU:9682].
      672825191         N6-MTase+TaqIC/RAGNYA/Methylase_S                                                                                               -                                                                                       MVEG_04886                    2159  eukaryota>fungi                      Mortierella verticillata NRRL 6337              hypothetical protein MVEG_04886 [Mortierella verticillata NRRL 6337].
      552925964         N6-MTase+TaqIC/RAGNYA/Methylase_S(frag)                                                                                         -                                                                                       GLOINDRAFT_201382             310   eukaryota>fungi>glomeromycota        Rhizophagus irregularis DAOM 181602             hypothetical protein GLOINDRAFT_201382 [Rhizophagus irregularis DAOM 181602].
      Mver1000004892    N6-MTase                                                                                                                        -                                                                                       Mver1000004892                2160  eukaryota>fungi>zygomycete           Mortierella verticillata                         Mortierella verticillata NRRL 6337 hypothetical protein (2160 aa)
      Lhya1000010458    N6-MTase                                                                                                                        -                                                                                       Lhya1000010458                392   eukaryota>fungi>mucoromycotina       Lichtheimia hyalospora                          estExt_fgenesh1_pm.C_2000009
      727130617         N6-MTase(frag)+TaqIC/RAGNYA/Methylase_S                                                                                         -                                                                                       RMATCC62417_18770             448   eukaryota>fungi                      Rhizopus microsporus                            hypothetical protein RMATCC62417_18770 [Rhizopus microsporus].
      Bcir1000016958    N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase                                                                                Bcir1000016958                329   eukaryota>fungi>mucoromycotina       Backusella circina                              MIX930_10_37
      552925963         TaqIC                                                                                                                           -                                                                                       GLOINDRAFT_201358             274   eukaryota>fungi>glomeromycota        Rhizophagus irregularis DAOM 181602             hypothetical protein GLOINDRAFT_201358 [Rhizophagus irregularis DAOM 181602].
      Bcir1000008318    N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+Eco57I                                                                         Bcir1000008318                992   eukaryota>fungi>mucoromycotina       Backusella circina                              estExt_fgenesh1_pg.C_1200041
      671695680         N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                               N6_Mtase+UPF0020+Eco57I                                                                 LRAMOSA03076                  938   eukaryota>fungi                      Absidia idahoensis var. thermophila             hypothetical protein LRAMOSA03076 [Absidia idahoensis var. thermophila].
      
      ---- Prokaryotic homologs----
      
      # 1;                                                                                                                                                                                                                                         
      658523148        N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                                N6_Mtase+TaqI_C                                         -                866   bacteria                                    Atribacteria bacterium SCGC AAA255-G05                         hypothetical protein, partial [Atribacteria bacterium SCGC AAA255-G05].
      489091935        N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                                N6_Mtase+Methyltransf_26+Eco57I+TaqI_C                  -                1254  bacteria>spirochaetes                       Leptospira weilii                                              N-6 DNA methylase [Leptospira weilii].
      740186027        REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                          HSDR_N_2+N6_Mtase+Eco57I                                -                1215  bacteria>deinococci                         Thermus sp. NMX2.A1                                            hypothetical protein [Thermus sp. NMX2.A1].
      495592567        REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                          HSDR_N_2+N6_Mtase+Eco57I                                -                1166  archaea>euryarchaeota                       Haloferax mucosum                                              type i restriction-modification system methyltransferase subunit [Haloferax mucosum].
      489139504        REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                          HSDR_N_2+N6_Mtase+Eco57I                                -                1214  bacteria>deinococci                         Thermus aquaticus                                              N-6 DNA methylase [Thermus aquaticus].
      495849928        REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                          HSDR_N_2+N6_Mtase+Eco57I+TaqI_C                         -                1167  archaea>euryarchaeota                       Haloferax                                                      MULTISPECIES: type i restriction-modification system methyltransferase subunit [Haloferax].
      647643842        N6_Mtase                                                                                                                         N6_Mtase+Eco57I                                         -                1198  bacteria>proteobacteria>betaproteobacteria  Herminiimonas sp. CN                                           hypothetical protein [Herminiimonas sp. CN].
      652390117        N6_Mtase                                                                                                                         N6_Mtase+Eco57I                                         -                1284  bacteria>cyanobacteria                      Planktothrix rubescens                                         hypothetical protein [Planktothrix rubescens].
      491099787        REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                           HSDR_N_2+N6_Mtase+Eco57I+TaqI_C+DUF4337                 -                1164  archaea>euryarchaeota                       Haloarcula sinaiiensis                                         type i restriction-modification system methyltransferase subunit [Haloarcula sinaiiensis].
      754792650        N6_Mtase                                                                                                                         N6_Mtase+Eco57I                                         -                1284  bacteria>cyanobacteria                      Planktothrix agardhii                                          hypothetical protein [Planktothrix agardhii].
      654626038        N6_Mtase                                                                                                                         N6_Mtase+Eco57I                                         -                833   bacteria>cyanobacteria                      Dolichospermum circinale                                       hypothetical protein [Dolichospermum circinale].
      818764894        HSDR_N+N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                          HSDR_N+N6_Mtase+Eco57I+TaqI_C                           UX10_C0029G0010  906   bacteria                                    Parcubacteria (Magasanikbacteria) bacterium GW2011_GWA2_45_39  hypothetical protein UX10_C0029G0010 [Parcubacteria (Magasanikbacteria) bacterium GW2011_GWA2_45_39].
      808798668        N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                                N6_Mtase+Eco57I+TaqI_C                                  -                1275  bacteria>cyanobacteria                      Limnoraphis robusta                                            hypothetical protein [Limnoraphis robusta].
      493714293        REase+N6_Mtase++TaqIC/RAGNYA/Methylase_S                                                                                         HSDR_N_2+N6_Mtase+Methyltransf_26+Eco57I+TaqI_C         -                1171  archaea>euryarchaeota                       Natrialba aegyptia                                             type i restriction-modification system methyltransferase subunit [Natrialba aegyptia].
      493035869        N6_Mtase+Eco57I                                                                                                                  N6_Mtase+Eco57I                                         -                664   bacteria>cyanobacteria                      Coleofasciculus chthonoplastes                                 N-6 DNA methylase [Coleofasciculus chthonoplastes].
      755639426        N6_Mtase                                                                                                                         N6_Mtase                                                -                765   bacteria>actinobacteria                     Leucobacter komagatae                                          hypothetical protein [Leucobacter komagatae].
      568633968        REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                          HSDR_N_2+APG6+N6_Mtase+Methyltransf_26+Eco57I+TaqI_C    BN903_17         1254  archaea>euryarchaeota                       Halorubrum sp. AJ67                                            uncharacterized protein domain protein [Halorubrum sp. AJ67].
      851124860        REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                          HSDR_N_2+APG6+N6_Mtase+Methyltransf_26+Eco57I+TaqI_C    -                1209  archaea>euryarchaeota                       Halorubrum sp. AJ67                                            hypothetical protein [Halorubrum sp. AJ67].
      428252835        N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                                N6_Mtase+Eco57I+TaqI_C                                  Mic7113_3025     1277  bacteria>cyanobacteria                      Microcoleus sp. PCC 7113                                       type I restriction-modification system methyltransferase subunit [Microcoleus sp. PCC 7113].
      754157697        N6_Mtase+TaqIC/RAGNYA/Methylase_S                                                                                                N6_Mtase+Eco57I+TaqI_C                                  -                1280  bacteria>cyanobacteria                      Microcoleus sp. PCC 7113                                       hypothetical protein [Microcoleus sp. PCC 7113].
      
      
      Back to Contents



    • Multiple sequence alignment of the Group3/Trichomonas-like N6-MTases

                                                                                                                                      Str-1                 Str-2                                                                                                                            Str-3                         Str-4                                         Str-5                                    Str-6                   Str-7               
      Conserved residues                                                                          G K                                    D                         D     R     <------------ HTH------------------------------------D------------------>                                 Y                                                                                      +S                                                      Y D                   
      FINAL                                                                             -------------HHHHHHHHHHHH-----------------E---EEEEEE---HHHHHHHHH-----EEEEE----HHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHHHH---------EEEEEHHHHHHH--HH--H---HHHHHHHH-H--HHHH-------------------EEEEEHHHHHHH----HH-----------EEEEE----------HH--HH-----HHHHHHHHHHHHH--------EEEEE----HHHHHHHHHHH---------------------EEEEEEEE-EE-----E------EEEEEEEE-------------
      ALIGN                                                                             -------------HHHHHHHHHEEE-E-------------------EEEEE----HHHHHHHH------EEEE-----HHHHHHH---HHHHHHHHHHHH-H-------------------HHEEEEEEH-----------EEEHHHHHHH---HH--H---HHHHHHHH-H--HHHHHH-----------------HEEEEHHHHHHH-----------------EEEEE-----------H--HH-----HHHHHHHHHHHHH--------EEEEE---HHHHHHHHHHHH---------HHH---------HHHHEHHH-H------------HHHHHEHHH-------------
      HMM                                                                               --------HHHHHHHHHHHHHHHHH-H---------------E---EEEEEE---HHHHHHHHHH----EEEEE--HHHHHHHHHHHHHHHHHHHHHHHE-E----------------HHHHHHHHHHHHH-------EEEEEEEHHHHHHHHHHH--H---HHHHHHHH-H--H-HEEE----HHHHHH---HHHHEEEEHHHHHHHH----HHHH---------EEEEEE--EEEEE--HH--HH-----EE-HHHHHHHEHHH-------EEEEE----HHHHHHHHHHH----------EE---------EEEEEEEE-EE-----EEE----EEEEEEEE---E---------
      FREQ                                                                              -------------HHHHHHHHHHEE-----------------E---EEEEEE----HHHHHHHH-----EEEE-----HHHHHHH---HHHHHHHHHHHH--------------------HHHHHHHHHHH-----------EEEHHHHHH---HH--H---HHHHHHHH-H----E--------------------EEEEEH-HHH-------------------EEEE-----------HH--HH-----HHHHHHHHHHHHH--------EEEEE----EEEEEHHHHH-------------------------EEEEE-E------------HHH-HEEE--------------
      PSSM                                                                              -------------HHHHHHHHHHHH---------------------EEEEE----HHHHHHHHH----EEEEE------HHHHH--HHHHHHHHHHHHHH------------------HHHHHHHHHHHHH------------HHHHHHH-------------------------------------HHH---H---HEEEHHHHHHHH----HHH----------EEEEE------------------------HHHHHHHHHH--------EEEEE-----HHHHHHHHHH---------------------EEEEEEEE-EE------------EEEEEEE--------------
      TVAG_007390_Trichomonas_vaginalis_G3_123421258                                    FT-K-PPLPFIGNKSKIRKDLIDIL-KD----I------KGDY---VFVDLFGGSLYISHLLHIMFPKATIIANDYDNYVDRLKHIHDTNEILKELKERI-N-----V--KPDEKI-PTDQKEIVREVISK----A-PYIDWDTLCSRLLYSGAYK--Y---YDLDTLMK-K--VLYLRYTNLFDENIDA---YLEGLTIVHKDWRILF----NEYKDLPN-----VFFICDPPYFHTYSIQY--GD-----EWTLKDTVETFDVM-Y-YP--SIYFSSDKSYTEELIEILSKR-----YGEKFS---------VTKHVIGR-GL-----INPNTQKNNEVIYVT---F-S-I-NGEE
      TVAG_056220_Trichomonas_vaginalis_G3_123471301                                    FT-K-PPLPFFGNKSRCKDILLREL-KK----L------PNGL---TFVDLFGGSFYISHLCHTVFPDSKIICNDFDNYMNRLKHIPDTNKILKELKEKI-P-----I--GKMERI-PLDKKNTVREVLKK----A-EYIDWDSISSRLLYSGAIR--V---HDIETLMS-K--VLYLNYTKVFKENIEK---YTEGIEFVRCHWTELY----EKYKNKEN-----VFFIVDPPFYNTWDFQY--QV-----DWTLRDSLETLDVLHN-HP--CFYFTSDKSGLETVMRWLEDI-----HDYDFR---------YDKIEYER-G----------------------------------
      TVAG_557140_Trichomonas_vaginalis_G3_123207322                                    FT-K-PPLPFIGNKSKIRKDLIDIL-KD----I------KGDY---VFVDLFGGSLYISHLLHIMFPKATIIANDYDNYVDRLKHIHDTNEILKELKERI-N-----V--KLDEKI-PTDQKEIVREVISK----A-PYIDWDTLCSRLLYSGAYK--Y---YDLDTLMK-K--VLYLRYTNLFDENIDA---YLEGLTIVHKDWRILF----NEYKDLPN-----VFFICDPPYFHTYSIQY--GD-----EWTLKDTVETFDVL-N-YP--SIYFSSDKSYTEELIEILSKR-----YGEKFS---------VTKHVIGR-GL-----INPNTQKNNEVIYVT---F-S-I-NGEE
      TVAG_271330_Trichomonas_vaginalis_G3_123479010                                    FT-K-PPLPFIGNKSKIRKDLIDIL-KD----I------KGDY---VFVDLFGGSLFISHLLHTLFPKATIIANDYDNYVDRLKHIHDTNEILKELKERI-N-----V--KLDEKI-PTDQKEIIREVISK----A-PYIDWDTLCSRLLYSGAYK--Y---YDLVTLMK-K--VLYLRYTHLFDENIDD---YLEGLTIVHKDWRILF----NEYKDLPN-----VFFICDPPYFHTYSIQY--GD-----EWTLKDTVETFDVL-N-YP--SIYFSSDKSYTEELIEILSKR-----YGEKFS---------VTKHVIGR-GL-----INPNTQKNNEVIYVT---F-S-I-NGEE
      TVAG_344370_Trichomonas_vaginalis_G3_123481438                                    FT-K-PPLPFIGNKSKIRKDLIDIL-KD----I------KGDY---VFVDLFGGSLFISHLLHTLFPKSTIIANDYDNYVDRLKHIHDTNEILKELKERI-N-----V--KPDEKI-PTDQKEIVREVISK----A-PYIDWDTLCSRLLYSGAYK--Y---YDLDTLMK-K--VLYLRYTNLFDENIDA---YLEGLTIVHKDWRIIF----NEYKDLPN-----VFFICDPPYFHTYSIQY--GD-----EWTLKYIVETFDVL-N-YP--SIYFSSDKSYTEELIEILSKR-----YGEKFS---------VTKHVIGR-GL-----INPNTQKNNEVIYVT---F-S-I-NGEE
      TVAG_039120_Trichomonas_vaginalis_G3_123484516                                    FT-K-PPLPFFGNKSRCKDILLREL-KK----L------PNGL---TFVDLFGGSFYISHLCHTVFPDSKIICNDFDNYMDRLKHIPDTNKILKELKEKI-P-----I--GKMERI-PLDNKNIVREVLKK----A-EYIDWDSISARLLYSGAIR--V---HDIETLMS-K--VLYLNYTNVFKEDIEK---YIEGIEFVRCHWTDLY----EKYKDKEN-----VFFIVDPPFYNTWDFQY--QV-----DWTLRDSLETLDVLHN-HP--CFYFTSDKSGLETVMRWL-------------------------------------------------------------------
      TVAG_051460_Trichomonas_vaginalis_G3_123976294                                    FT-K-PPLPFFGNKSRCKDILLREL-KK----L------PNGL---TFVDLFGGSFYISHLCHTVFPDSKIICNDFDNYMNRLKHIPDTNKILKELKEKI-P-----I--GKMERI-PLDKKNTVREVLKK----A-EYIDWDSISSRLLYSGSIR--V---HDIETLMS-N--VLYLNYTKVFKEDIEK---YTEGIEFVRCRWTELY----EKYKNKEN-----VFFIVDPPFYNTYDFQY--QV-----DWTLRDSLETLDVLHN-HP--CFYFTSDK-----------------------------------------------------------------------------
      BN1088_RS06390_Sphingobacterium_sp_PM2-P1-29_786219197                            YT-S-SPLPFMGQKRRFLKKFKEVL-IN----N------KPDA---IYVDLFGGSGLLSHIVKQYYPKATVVYNDYDNFSERLLHIDQTNELLASIRYLI-K--D--L--PNDKAI-PVDRRQPVIDCIYA-HEKRYGYVDYVSISSNLLFAMNYA------KDMDQLSR-Q--VFYKTIRESSY-NADG---YLEGVERVSMDYKSLF----ERYKDQSN-----VVFLVDPPYLSTETSVY--KSS----HWKLSDYLDVLDVL-K-VPH-YYYFTSNKSQIVELCEWLGSK--VP-GANPFR---N-----TVLYSNVS------S-VNYSSKYTDIMIVK--------------
      M573_RS10255_Prevotella_intermedia_771514766                                      FN-S-APLPFQGQKRKFAKEFAKVL-HQ----Y------PDDT---VFVDLFGGSGLLSHITKHQKPNATVVYNDFDNYRQRLAHISQTNELLATIREIL-K--D--V--PRGKMV-AGGERQLVIDAIKR-HEKCYGYVDYITLSSSIMFSMKYC------TNIDDLEK-Q--GIYNRVRRGDFATCDG---YLDDLTVVSVDYKQLV----EQYKDVPN-----VVFIIDPPYLSTDTASY--NM-----NWQLSDYLDVLLVL-F-KHS-FIYFTSNKSSIIELCEWIARN--SG-MNNPFE---Q-----CNKVEVDT------S-MNYNSTYTDIMLYTT-------I-----
      JCM6334_RS11905_Prevotella_disiens_545432296                                      FN-S-APLPFQGQKRKFAKEFAKVL-QQ----Y------PDDT---MFVDLFGGSGLLSHITKRQKPNATVVYNDFDNYRQRLAHISQTNELLAAIREIL-K--D--V--PRGKMV-AGEERQLVIDAIKR-HEKFYGYVDYITLSSSIMFSMKYC------TNIDDFEK-Q--GIYNRVRRGDFATCDG---YLDDLTVVSVDYKQLV----EQYKDVPN-----VVFIIDPPYLSTDTASY--SM-----NWQLSDYLDVLLVL-F-NHS-FVYFTSNKSSIIELCEWIARN--SG-MNNPFE---R-----CNKVEINT------S-MNYNSAYTDIMLYTT-------I-----
      K941_RS0107590_Moraxella_caprae_656071893                                         HK-T-APLPFTGQKRMFLRHFEKILKDN----I------PNDGEGFTVLDVFGGSGLLAHNAKRILPKATVIYNDFDGYVERLAHIPTTNRLRQELFEIL-K--G--E--PRSVKL-SSTAKAKVLGHLRK-SADNGTFVDVQTLAGWLLFSGRQV------GSLDEFLA-E-STFYNRIVKTDYPNADG---YLDGLILECLDFEKLL----QKYQDTPN-----CLLLLDPPYLCTAQGAY--AKH---GYFGMTKFLRLMQ-F-V-RPP-FIFFSSTKSELMDYMAYVQRY-----EPNTWQRVGD-----FTHIKVNS------S-INAKVSYEDNILAK--------------
      AJF4211_000450_Avibacterium_paragallinarum_JF4211_523674289                       -------MPFVGQKRLFLNAFKQVL-NE-HI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVLKF-------------
      AJF4211_000170_Avibacterium_paragallinarum_JF4211_523673311                       -------MPFVGQKRLFLNAFKQVL-NE-HI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF-------------
      HSM_RS03540_Histophilus_somni_501302336                                           FS-Q-APLPFVGQKRLFLNAFKQVL-ND-NI-Q------NDGE-GWTIVDAFGGSGLLSHVAKRIKPKARVIYNDFDGYADRLKHISDTNRLRAELIQIV-G-DI--V--PKNKRL-DDNKKQEIINKIND-FN---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQFL----PKFADNPK-----ALFVLDPPYLCTKQNSY--KMA-N--YFDLVDFLQLIDLT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF-------------
      AJF4211_RS08790_Avibacterium_paragallinarum_545595880                             YK-Q-APLPFVGQKRLFLNAFKQVL-ND-NI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF-------------
      AJF4211_RS06820_Avibacterium_paragallinarum_545595679                             YK-Q-APLPFVGQKRLFLNAFKQVL-ND-NI-P------NDGE-GWTIIDTFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FN---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF-------------
      AJF4211_RS12815_Avibacterium_paragallinarum_545595274                             YK-Q-APLPFVGQKRLFLNAFKQVL-NE-HI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF-------------
      AJF4211_RS06190_Avibacterium_paragallinarum_737726850                             FN-Q-APLPFVGQKRLFLNAFKQVL-NE-HI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVLKF-------------
      Z012_RS09750_Avibacterium_paragallinarum_805420685                                FS-Q-APLPFVGQKRLFLNAFKQVL-NE-HI-P------DDGE-GWMIVDAFGGSGLLSHVAKRIKPKARVIYNDFDGYADRLKHISDTNRLRAELIQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDPPYLCTKQNSY--KMA-T--YFDLVDFLQLIDLT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF-------------
      AJF4211_RS10020_Avibacterium_paragallinarum_737726745                             FN-Q-APLPFVGQKRLFLNAFKQVL-NE-HI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF-------------
      JP34_RS00795_Gallibacterium_anatis_746011652                                      FS-Q-APLPFVGQKRLFLNAFKQVL-NE-HI-P------DDGE-GWTIVDAFGGSGLLSHVAKRIKPKARVIYNDFDGYADRLKHISDTNRLRAELIQIV-D-DI--V--PKNKRL-DNNKNQEIINKING-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKQADYPSAED---YLDGLEIVSEPFQTLL----PKFADNPK-----ALFVLDPPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF-------------
      IE01_RS08000_Gallibacterium_anatis_517157783                                      YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NEGE-GWTIVDVFGGSGLLARNAKDICPKSTVIFNDYDNYAERLANIKQTNQLRQQLAHCL-I--D--V--KPGARL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEALFE-K--SLWNNLTKRDYPVADD---YLDGLNIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQTAY--KKE-T--FFDLIDFLELMRLI-R-PP--FIMFSSVRSEFNRYIDFLIKYK-ER-NYQHFV---D-----VVEQKINV------T-VNCNVNYQDNMIYKF-------------
      UMN179_RS12515_Gallibacterium_anatis_503512750                                    YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NDGE-GWTIIDVFGGSGLLARNAKDICPKARVIFNDYDNYAERLANIKQTNQLRQQLAHCL-I--D--V--KPEARL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEVLFK-K--SLWNNLTKRDYPVADD---YLDGLDIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQTAY--KKE-T--FFDLIDFLELMRLI-R-PP--FIMFSSAKSEFNRYIDFLIKHK-EK-NYQHFV---D-----AVEQKINV------R-VNHNVNYQDNMVYKF-------------
      P375_RS07850_Gallibacterium_genomosp_2_746067969                                  YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NDGE-GWTIIDVFGGSGLLAGNAKDICSKARVIFNDYDNYAERLANIKQTNQLRQQLAYCL-I--D--V--KPEARL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEALFK-K--SLWNNLTKRDYPVADD---YLNGLDIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQSTY--KKE-T--YFDLIDFLELMRLI-R-PP--FIMFSSAKSEFNRYIDFLIKHK-EK-NYRHFV---D-----AVEQKINV------R-VNHNVNYQDNMVYKF-------------
      JP33_RS07160_Gallibacterium_anatis_746094831                                      YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NDDE-GWTIIDVFGGSGLLARNAKDICPKAQVIFNDYDNYAERLANIKQTNQLRQQLAHCL-I--D--V--KPEARL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEALFK-K--SLWNNLTKRDYPVADD---YLDGLDIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQTAY--KKE-T--FFDLIDFLELMRLI-R-PP--FIMFSSVRSEFNRYINFLIKYK-ER-NYQHFV---D-----AVQQKINV------T-VNCNVNYQDNIVYKF-------------
      IO46_RS12295_Gallibacterium_anatis_746089913                                      YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NDGE-GWTIIDVFGGSGLLARNAKDICSKAQVIFNDYDNYAERLANIKQTNQLRQQLAHCL-I--D--V--KPEARL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEALFK-K--SLWNNLTKRDYPVADD---YLDGLDIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQSAY--KKE-S--YFDLIDFLELMRLI-R-PP--FIMFSSVRSEFNRYIDFLIRYK-ER-NYQHFV---D-----AVEQKINV------T-VNCNVNYQDNIVYKF-------------
      JP28_RS09245_Gallibacterium_anatis_746010293                                      YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NDGE-GWTIIDVFGGSGLLARNAKDICSKAQVIFNDYDNYAERLANIKQTNQLRQQLAHCL-I--D--V--KPGTRL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEALFK-K--SLWNNLTKRDYPVADD---YLDGLDIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQSAY--KKE-S--YFDLIDFLELMRLI-R-PP--FIMFSSVRSEFNRYIDFLIRYK-ER-NYQHFV---D-----AVEQKINV------T-VNCNVNYQDNIVYKF-------------
      ERS450003_01064_Haemophilus_influenzae_777210024                                  FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GNGE-GWTIIDTFGGSGLLSHTAKRLKPKARIIYNDFDGYAERLAHIDDINQLRAELYSVV-G-NA--T--PKNKRM-TKDCKAECIKIIQN-FK---GYKDLNCLASWLLFSGQQV------ATFDDLFQ-H--NFWHCIRQSDYPKADG---YLDGVEIVKESFHTLL----PKFSDDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      NTHI723_RS04270_Haemophilus_influenzae_764389671                                  FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GEGE-GWTIIDTFGGSGLLSHTAKRLKPKARVIYNDFDGYAERLAHIDDINQLRAELYSVV-G-NA--T--SKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--NFWHCIRQSDYPKADG---YLDGVEIVKESFHTLL----PKFSDDPK-----ALFVLDPPYLCTKQESY--KQA-K--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AERITVNA------K-LNYQVAYEDNLVYKF-------------
      HMPREF9095_RS06800_Haemophilus_491953443                                          FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWIIIDTFGGSGLLSHTAKRLKPKARVIYNDFDGYAERLAHIDDINRLRAELYSVV-G-NA--T--SKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--NFWHCIRQSDYPKADG---YLDGVEIVKESFHTLL----PKFSNDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      C645_RS00620_Haemophilus_influenzae_803453319                                     FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHTAKRLKPKARIIYNDFDGYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCVRQSDYPKADG---YLDGVEIVRESFHTLL----PKFTDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      C645_RS06690_Haemophilus_influenzae_803453531                                     FK-Q-APLPFIGQKRMFLKQFETIL-ND-NI-P------DDGE-GWTIIDTFGGSGLLSHTAKRLKPKAHVIYNDFDGYAERLVHIDDTNALRAQIFAKI-G-NT--T--PKNKRL-PKSLKAEIIQIIDE-FQ---GYKDLNCLASWLLFSGQQV------GSLEELYR-K--DFWHCVRLSDYPSADG---YLDGVEVIRESFHALL----PKFVDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      HIBPF_RS02220_Haemophilus_influenzae_503290984                                    FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHAAKVIKPKAHVIYNDFDSYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCVRLSDYPSAEG---YLDGVEVIRESFHTLL----PKFSDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-L-PP--YIFFSSTKSEFVRFIEYMVDDK-VH-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      SU58_RS08535_Haemophilus_influenzae_756163264                                     FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHAAKVMKPKAHVIYNDFDGYAERLVHIDDTNALRAQIFAKI-G-NT--T--PKNKRL-PKSLKAEIIQIIDE-FQ---GYKDLNCLASWLLFSGQQV------GSLEELYR-K--DFWHCVRLSDYPGAEG---YLDGVEIVKESFHTLL----PKFSNDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      CK45_RS04150_Haemophilus_influenzae_696244941                                     FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHTAKRLKPKASVIYNDFDGYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCIRQSDYPKADG---YLDGVEIVRESFHTLL----PKFTDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMVDDK-VH-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      JP32_RS09880_Gallibacterium_anatis_746007528                                      FA-Q-APLPFVGQKRMFLQQFRAVL-NQ-LI-T------NDGE-GWTIVDAFGGSGLLSHTAKRLKPNAHVIYNDFDGYAERLKHIDDINRLRQILSELL-A--N--Y--PRDRRL-DIAMRHKVIDAIES-FN---GYKDPHILCAWLLFSGQQV------KSINELYS-R--GFYNCIRQSDYTTADG---YLDGIEVVNESFVTLL----PKFADDSK-----AIFVLDPPYLCTKQASY--KQE-R--YFDLIDFLELIRLT-R-PP--YLFFSSTKSEFIRFVDWLIASK-GD-NWQSFV---D-----YQRIVIQT------S-ASHNGQYEDNLIYNF-------------
      HMPREF1199_RS02420_Prevotella_oralis_565956523                                    YL-S-APLPFVGQKRMFAKEFIKVL-DR----F------SDGT---TFVDLFGGSGLLSHITKCRKPNSTVVYNDFDNYRKRIENIPVTNALLSDLRKIV-R--D--I--PRKKRI-TGETREKVLACLKR-YLQQYGYVDFITISSSILFSMKYV------TIFEDVGK-E--TLYNNIRMNDYPPCSD---YLDGLQVVCCDYKNLY----DKYKDMPK-----VVFLIDPPYLNTEVGTY--NM-----YWKLPDYLDVLKIL-Q-GCS-FVYFTSDKSSIIELCEWMEKN--KE-TGNPFK---N-----CSIASVNA------H-VNHNAGYTDMMLYTV-------------
      HMPREF1475_RS02705_Prevotella_oralis_738993231                                    YN-S-APLPFVGQKRMFAKKFKKVL-EQ----F------PDGT---TFVDLFGGSGLLSHIAKWQKPNSKVVYNDFDGYRLRLEHIPQTNELLAEIREIV-R--N--V--PRHKAI-AGETRNYIFESLLR-HQERYGYLDFITVSSSVMFSMKYQ------LSINDMRK-E--TLYNNVRTTDYPICDD---YLEGLTITSVDYKQLF----NQYKDRPD-----VVFLVDPPYLSTDVGTY--KM-----YWRLSDYLDVLEVL-T-GHS-FVYFTSNKSSILELCDWIGRN--KN-MDNPFK---K-----CTKVEVNA------H-MNYNATYTDIMLYTN-------V-----
      HMPREF0663_11914_Prevotella_oralis_ATCC_33269_323094424                           YN-S-APLPFVGQKRMFAKKFKKVL-EQ----F------PDGT---TFVDLFGGSGLLSHIAKWQKPNSKVVYNDFDGYRLRLEHIPQTNELLAEIREIV-R--N--V--PRHKAI-AGETRNYIFESLLR-HQERYGYLDFITVSSSVMFSMKYQ------LSINDMRK-E--TLYNNVRTTDYPICDD---YLEGLTITSVDYKQLF----NQYKDRPD-----VVFLVDPPYLSTDVGTY--KM-----YWRLSDYLDVLEVL-T-GHS-FVYFTSNKSSILELCDWIGRN--KN-MDNPFK---K-----CTKVEVNA------H-MNYNATYTDIMLYTN-------V-----
      HMPREF0645_RS12560_Prevotella_bergensis_494312007                                 YN-S-APLPFVGQKRMFAKEFRKVL-EQ----F------PDGT---TFVDLFGGSGLLSHITKCEKPHSKVVYNDFDGYRLRLEHIPQTNELLAKLREIV-R--K--I--PKHKPI-TGEAREQVFECLRE-HQECYGYLDFITISSSIMFSMKYR------LSIDEMRK-E--ALYNNVRSTDYPLCCD---YLDGLTIVSSDYKQVF----NLYKNTPG-----VVFFVDPPYLSTEVGTY--KM-----YWRLADYLDVLTVL-A-GHS-FVYFTSNKSSILELCDWVGRN--KT-VGNPFE---K-----CTKVEFNA------H-MNYNATYTDMMLYKK-------A-----
      L888_RS0101115_Hallella_seregens_654481515                                        YL-S-APLPFVGQKRMFAKEFRKVL-DQ----I------PDGT---TFVDLFGGSGLLSHIAKYDKPHSEVVYNDFDGYRRRLEHIPQTNELLAELRDIV-R--D--V--PRYKAI-TGETREHVFGCLLQ-HEKRYGYIDFITVSSSIMFSAKYC------LSIDDMRK-E--ALYNKVRSSDYSECPD---YLDGLTIVSKDYKQLF----KEYRDKPD-----VVFLVDPPYLGTEVGTY--KM-----FWKLADYLDVLKVL-Q-GHA-YIYFTSNKSSIIELCEWLGQN--RD-MGNPFE---H-----STRVEFKA------Q-MNYNASYTDMMLYKN-------A-----
      M082_RS01650_Bacteroides_696270804                                                YL-S-APLPFVGQKRMFAKEFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDAVVVYNDFDNYRQRLANIPATNVLLSDLRRIA-E--G--E--PRNKRI-TGEVRDKMFARIER-EEKEHGYVDYITISASLLFAMKYV------TSLEGMKK-E--AIYNRIRQTDYPEAKD---YLEGLTITGEDYKEVF----KRYKDAPG-----VVFLVDPPYLSTEVGTY--KM-----FWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRRVEFSA------N-VNYQAKYTDMMLYTK-------P-----
      M137_RS11600_Bacteroides_494836361                                                NL-S-APLPFVGQKRMFAKEFIKVL-EQ----F------PEDT---VFVDLFGGSGLLSHIAKRSKPDATVVYNDFDNYRFRLKNIPQTNKLLADIRELV-G--NS-I--PKHKPI-KGELRERIFKRIEE-EELNVGYVDFITLSSSLMFSMKYK------LSVAEMRK-E--VLYNNIRKTGYPESSD---YLKGLEIVSCDYKAVF----NQYKDVPG-----VVFLIDPPYLSTDVGTY--NM-----YWRLSDYLDVLKIL-E-KHS-FVYFTSNKSSILELCEWIGAN--KT-IGNPFE---G-----CTKKEFNA------H-MNYSAEYTDMMLYKK-------Q-----
      BN456_01886_Prevotella_sp_CAG:1031_512184299                                      YL-S-APLPFVGQKRMFAKQFIEVI-RQ----Y------PADT---VFVDLFGGSGLLSHITKHFHPESRVIYNDFDNYRLRINNIPRTNSLLESIRPIA-S--Q--F--DRHKPI-TGGAREQIFSLLEQ-EEKETCFLDFITLSSSLMFSMKYK------MSIEGMRG-E--TLYNNVRKNGYEPCRD---YLAGLEIVSCDYRELF----EQYKDTPG-----VVFFVDPPYLSTDVGTY--RM-----YWRLADYLDVLSVL-P-GHN-FIYFTSEKSCIIELCEWMGRH--PS-LGDPFA---R-----CQRREFNA------T-MNYNASYKDIMLFTI-------P-----
      HMPREF1070_RS05245_Bacteroides_ovatus_490456001                                   YL-S-APLPFVGQKRMFAKEFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDATVVYNDFDNYRRRLANIPATNVLLSDLRWIA-E--G--E--PRNKRI-TGEVCEKMFARIER-EEKERGYVDYITLSSSLLFAMRYM------LSLEDMRK-E--TLYNNIRQTDYPEAKD---YLEGLTITGEDYKEVF----KRYKDVPG-----VVFLVDPPYLSTEVGTY--KM-----YWRLADYLDVLTVL-K-GHP-FVYFTSNKSSILELCDWIDRN--PF-IGSPFK---N-----CRKVEFNA------H-MNYNSKYTDMMLYTK-------P-----
      BSFG_RS03650_Bacteroides_sp_4_3_47FAA_495946269                                   YL-S-APLPFVGQKRMFAKEFIKVL-DR----F------PDST---VFVDLFGGSGLLSHITKRVRPDAVVVYNDFDNYRQRLDNIPNTNQLLADLRRIT-A--E--L--PRKKRI-TGEARERILARIEK-EEKEHGYVDYITLSSSLLFSMKYV------LNLDNMRK-E--TFYNTIHRTDYSDAKD---YLEGLTIVSEDYKEVF----KRYKDVLG-----VVFLVDPPYLSTEVGTY--KM-----YWHLADYLNVLHVL-K-EHS-FVYFTSNKSSILELCSWIGDN--PS-IGNPFK---D-----CVKVEFNA------C-VNYSSCYTDIMLCKQ-------G-----
      BA92_RS10770_Bacteroides_490455210                                                YL-S-APLPFVGQKRMFAREFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDATVVYNDFDNYRCRLVNIPATNVLLSDLRRIA-E--G--E--PRNKRI-TGEVRDKMFARIER-EEKEHGYVDYITVSASLLFAMKYV------TSLEGMKK-E--AIYNRIRQTDYPEAKD---YLEGLTITSEDYKEVF----KRYKDVPG-----VVFLVDPPYLSTEVGTY--KM-----FWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRKVEFSA------S-VNYQAKYTDMMLYTK-------P-----
      JCM15754_RS11780_Prevotella_aurantiaca_640570678                                  YY-S-APLPFVGQKRMFAKEFKKVL-EQ----F------PDGT---TFVDLFGGSGLLSHITKSEKPHSKVVYNDFDGYRLRLEHVSQTNELLSELRKIV-R--D--L--PKHKPI-VGEARKRIFECLIK-CQERYGYLDFITISSSLLFSMKYC------LNIDDMNK-E--TLYNNIRSTDYPLCDG---YLDGLTIVSTDYKQVF----NQYKDTPN-----VVFLVDPPYLSTEVGTY--KM-----YWKLADYLDVLSVL-A-GHS-FVYFTSNKSSILELCDWIGRN--KH-IGNPFE---K-----CTKVECNA------R-MNYNSTYTDMMLYKN-------A-----
      JCM17725_RS06630_Prevotella_scopos_647557435                                      YY-S-APLPFVGQKRMFAKEFKKVL-EQ----F------PDGT---TFVDLFGGSGLLSHITKSEKPHSKVVYNDFDGYRLRLEHVPQTNELLSELRKIV-R--E--L--PKHKPI-VGGARKQIFECLIK-HQDRYGYLDFITISSSLLFSMKYC------LNIDDMNK-E--TLYNNIRSTDYPLCDG---YLDGLTIVSADYKQVF----NQYKDTPN-----VVFLVDPPYLSTEVGTY--KM-----YWKLADYLDVLSVL-A-GHS-FVYFTSNKSSILELCDWIGRN--KH-IGNPFE---K-----CTKVEFNA------R-MNYNSTYTDMMLYKN-------A-----
      VK67_RS05530_Mannheimia_haemolytica_493291112                                     FK-Q-APLPFVGQKRMFLAQVSQIL-NE-NI-T------DDGQ-GWTIIDVFGGSGLLAHTAKHIKPKAHIIYNDYDGYAERLKHIPDTNRLRKQIYDII-G-KS--T--PKNKRL-DPDKKSQVINIIQS-FD---GYIDVNCVASWLLFSGQQI------NSLEDLFN-K--IFWNGVRQTDYPSAEG---YLDGIEVTHESFHKLL----PRFQHKDK-----VLLLLDPPYLCTRQESY--KQA-T--YFDLIDFLRLINLT-K-VP--YIFFSSTKSEFVRFVDFMVSEK-KD-NWQTFE---N-----AKWIKVKA------S-LNYQSTYEDNLVYKF-------------
      L278_RS124350_Mannheimia_haemolytica_544865513                                    FK-Q-APLPFVGQKRMFLAQVSQIL-NE-NI-T------DDGQ-GWTIIDVFGGSGLLAHTAKHIKPKAHIIYNDYDGYAERLKHIPDTNRLRKQIYDII-G-ES--T--PKNKRL-DPDKKSQVINIIQS-FD---GYIDVNCVASWLLFSGQQI------NSLEDLFN-K--IFWNGVRQTDYPSAEG---YLDGIEVTHESFHKLL----PRFQHKDK-----VLLLLDPSYLCTRQESY--KQA-T--YFDLIDFLRLINLT-K-VL--YIFFSSTKSEFVRFVDFMVSEK-KD-NWQAFE---N-----AKWIKVKA------S-LNYQSTYEDNLVYKF-------------
      JCM16497_RS23455_Bacteroides_sartorii_640590225                                   YL-S-APLPFVGQKRMFAKEYIKVL-GE----I------KGAK---VFVDLFGGSGLLSHITKQQRPDVTVIYNDFDNYSRRLEHIGHTNAILDRLRDIL-A--S--V--PRLKIV-PKPLKGRIIEMLAE-EEAETGFVDYITLSTSLLFSMKYA------TTLDEMRK-Q--TMYNRIKRTDY-DADG---YLDDVVVESCDYRELF----EKYRNRDD-----VVFLVDPPYLSTDVSTY--SM-----CWKLSDYLDVLKVL-V-GHK-YVYFTSNKSSIVELCDWLGRN--KE-IGNPFI---G-----STRKEFNA------S-MNYNSHYTDIMLYNV-------A-----
      J450_RS04260_Mannheimia_haemolytica_525759492                                     FK-Q-APLPFVGQKRMFLAQVSQIL-NE-NI-P------DDGQ-GWTIIDVFGGSGLLAHTVKHIKPKAHIIYNDYDGYAERLKHIPDTNRLRKQIYDII-G-ES--T--PKNKRL-DPDKKSQVINVIQS-FD---GYIDVNCVASWLLFSGQQI------NSLENLFN-K--IFWNGIRQTDYPSAEG---YLDGIEVTHESFHKLL----PRFQHKDK-----VLLLLDPPYLCTRQESY--KQA-T--YFDLIDFLRLINLT-K-VP--YIFFSSTKSEFVRFVDFMVSEK-KD-NWQTFE---N-----AKWIKVKA------S-LNYQSTYEDNLVYKF-------------
      C801_RS14355_Bacteroides_uniformis_511019476                                      YL-S-APLPFVGQKRMFAKEYIKVL-GE----V------KDAK---VFVDLFGGSGLLSHITKRQCPDATVVYNDFDNYRRRIENIPRTNALLVDLRNIV-R--G--V--PKHGCI-KGTMRDEVFVRLEQ-EERTHGYIDFITISSAIMFSMKYK------LSIPEMKK-E--ALYNNIRQSDYPAASD---YLEGLTIVSCDYKVLF----EKYRDRDD-----VVFLVDPPYLSTDVGTY--NM-----YWKLSDYLDVLKVL-V-GHR-YVYFTSNKSSIIELCDWLDKN--KE-IGNPFI---G-----ATRKEFNA------S-MNYNSHYTDIMLYNV-------A-----
      HMPREF1181_RS12035_Bacteroides_490416379                                          YS-Q-APLPFVGQKRMFASEFRKVL-KR----F------SDKT---VFIDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLGNIHRTNELLEDLRETA-K--G--Y--PRHKKI-TGSMRDVFLERILQ-DEQN-GFVDYLTLSSSLLFSMKYV------LNFEELKK-Q--NLYNKLRQNDY-NCDG---YLDGLEVVCCDYKELA----DKYNADTD-----VVFLVDPPYMATDISTY--KM-----DWRLQDYLDVLLVL-S-GHP-FVYFTSGKSPILDFCEWMEQH--PG-IGNPFR---G-----TCKSTLTA------R-MNYNSSYTDIMLYKG-------T--AEA
      HMPREF1079_00192_Bacteroides_fragilis_CL05T00C42_392705106                        YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFSGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA
      BSBG_02560_Bacteroides_sp_9_1_42FAA_229455867                                     NL-S-APLPFVGQKRMFAKEFIKVL-EQ----F------PEDT---VFVDLFGGSGLLSHIAKRSKPDATVVYNDFDNYRFRLKNIPQTNKLLADIRELV-G--NS-I--PKHKPI-KGELRERIFKRIEE-EELNVGYVDFITLSSSLMFSMKYK------LSVAEMRK-E--VLYNNIRKTGYPESSD---YLKGLEIVSCDYKAVF----NQYKDVPG-----VVFLIDPPYLSTDVGTY--NM-----YWRLSDYLDVLKIL-E-KHS-FVYFTSNKSSILELCEWIGAN--KT-IGNPFE---G-----CTKKEFNA------H-MNYSAEYTDMMLYKK-------Q-----
      HMPREF1079_RS0100985_Bacteroides_fragilis_695330037                               YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFSGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA
      C799_RS11490_Bacteroides_thetaiotaomicron_696234173                               YS-Q-APLPFVGQKRMFASEFRKVL-ER----F------GDKT---VFVDLFGGSGLLAHITKRERPDATVIYNDHDNYRGRLENIGRTNRLLADLRDMA-R--E--H--PRHKMI-TGSLRAAFLERIRQ-EEQT-GAVDYITLSSSLLFSGKYA------LNLEELGK-Q--SFYNNLRLSDY-RCGG---YLDGLEVVCCDYKVLA----DKYGCSPD-----VVFLVDPPYMATDTSTY--QM-----DWKLKDYLDVLLVL-K-GHP-FVYFTSSKSPILDFCSWMEEH--PG-SGNPFR---G-----AGRSTFAA------R-MNYASSYTDIMLYRE-------M--PGA
      BFAG_03319_Bacteroides_fragilis_3_1_12_313137261                                  YL-S-APLPFVGQKRMFAREFIKVL-EQ----F------PEDT---VFVDLFGGSGLLSHIAKCRKPDATVVYNDFDNYRRRMAHIPQTNRLIADIRGMV-G--DA-V--PRHRPI-TGELRERIFRRIEQ-EERTVGYVDFITLSSSLMFSMKYR------LSVPEMRK-E--ALYNNIRKADYPECAD---YLDGLEIVSCDYKEVF----GRYKDTPG-----VVFLVDPPYLSTDVGTY--NM-----YWHMSDYLDVLNVL-A-GHS-FVYFTSNKSSILELCEWIGRN--RD-IGNPFE---K-----CTRVEFNA------H-MNYNASYTDMMLYRK-------E-----
      C799_02456_Bacteroides_thetaiotaomicron_dnLKV9_507741308                          YS-Q-APLPFVGQKRMFASEFRKVL-ER----F------GDKT---VFVDLFGGSGLLAHITKRERPDATVIYNDHDNYRGRLENIGRTNRLLADLRDMA-R--E--H--PRHKMI-TGSLRAAFLERIRQ-EEQT-GAVDYITLSSSLLFSGKYA------LNLEELGK-Q--SFYNNLRLSDY-RCGG---YLDGLEVVCCDYKVLA----DKYGCSPD-----VVFLVDPPYMATDTSTY--QM-----DWKLKDYLDVLLVL-K-GHP-FVYFTSSKSPILDFCSWMEEH--PG-SGNPFR---G-----AGRSTFAA------R-MNYASSYTDIMLYRE-------M--PGA
      HMPREF1055_02982_Bacteroides_fragilis_CL07T00C01_387775820                        YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA
      HMPREF1018_02174_Bacteroides_sp_2_1_56FAA_335946057                               YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA
      M070_4300_Bacteroides_fragilis_str_A7_(UDC12-2)_596213380                         YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCHWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA
      JCM12083_RS06185_Prevotella_shahii_647518976                                      YK-S-APLPFVGQKRMFVKEFIKVL-SQ----F------PADT---VFVDLFGGSGLLSHIAKRTKPESTVVYNDFDNYRCRLQHIPQTNRLIADLRDIV-A--DK-V--PRNKPI-TGELRKRIFARIEQ-EERVVGYVDFITLSSSIMFSMKYK------LSVPEMRK-E--TLYNSIRKSDYPTCPD---YLDGLEITSCDYKELF----NQHKDTPS-----VVFLVDPPYLSTEVGTY--NM-----YWKMADYLDVLNVL-A-GHS-FVYFTSNKSSILELCEWLGRN--RS-LGNPFE---N-----AVKVEFNA------H-MNYNASYTDMMLFKK-------E-----
      HMPREF9008_RS11875_Parabacteroides_sp_20_3_496053795                              YL-S-APLPFVGQKRMFARKFMKVL-EQ----Y------PEST---VFVDLFGGSGLLSHITKRCKPEATVIYNDFDNYHKRLENIPRTNRLIADLRSMV-G--NS-V--PRHKTI-TGELRERIFSRILQ-EEHETGYVDFITLSSSLMFSMKYK------LSVPEMRK-E--ALYNNIRKADYPECTD---YLEGLEIVSCDYKELF----NRYKDTPG-----VVFLVDPPYLSTDVGTY--NM-----SWRMSDYLDVLNVL-S-GHP-FVYFTSNKSSILELCEWIGKN--KN-TGNPFE---G-----CTRMEFNA------H-INYSSSYTDMMLFKK-------E-----
      M118_4484_Bacteroides_fragilis_str_3783N1-2_595939381                             YL-S-APLPFVGQKRMLAKEFMKVL-EQ----Y------PDGT---LFVDLFGGSGLLSHITKSLKPHSTVIYNDFDNYRFRMKHIPQTNQLLADIREMV-G--NS-V--PRHKII-KGELRERIFSRIEQ-EENSTGYVDFITLSSSILFSMKYK------LSVQDMRK-E--ALYNNIRKTGYPECTD---YLEGLEIVSCDYKEVF----NRYKDIPG-----VVFLVDPPYLSTDVGTY--NM-----YWNMADYLDVLNVL-K-GHS-YVYFTSNKSSILELCEWIGKN--RD-LGNPFE---N-----CTKVEFNA------H-MNYNSSYTDMMLYKK-------E-----
      M088_0657_Bacteroides_ovatus_str_3725_D1_iv_649508868                             YL-S-APLPFVGQKRMFAKEFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDAVVVYNDFDNYRQRLANIPATNVLLSDLRRIA-E--G--E--PRNKRI-TGEVRDKMFARIER-EEKEHGYVDYITISASLLFAMKYV------TSLEGMKK-E--AIYNRIRQTDYPEAKD---YLEGLTITGEDYKEVF----KRYKDAPG-----VVFLVDPPYLSTEVGTY--KM-----FWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRRVEFSA------N-VNYQAKYTDMMLYTK-------P-----
      BFAG_RS07280_Bacteroides_fragilis_695344948                                       YL-S-APLPFVGQKRMFAREFIKVL-EQ----F------PEDT---VFVDLFGGSGLLSHIAKCRKPDATVVYNDFDNYRRRMAHIPQTNRLIADIRGMV-G--DA-V--PRHRPI-TGELRERIFRRIEQ-EERTVGYVDFITLSSSLMFSMKYR------LSVPEMRK-E--ALYNNIRKADYPECAD---YLDGLEIVSCDYKEVF----GRYKDTPG-----VVFLVDPPYLSTDVGTY--NM-----YWHMSDYLDVLNVL-A-GHS-FVYFTSNKSSILELCEWIGRN--RD-IGNPFE---K-----CTRVEFNA------H-MNYNASYTDMMLYRK-------E-----
      M125_RS18320_Bacteroides_492741740                                                YL-S-APLPFVGQKRMFAREFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRSDATVVYNDFDNYRCRLVNIPATNVLLSDLRRIA-E--G--E--PRNKRI-TGEIRDKMFARIER-EEKEHGYVDYITVSASLLFAMKYV------TSLEGMKK-E--AIYNRIRQTDYPEAKD---YLEGLTITSEDYKEVF----KRYKDVPG-----VVFLVDPPYLSTEVGTY--KM-----FWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRKVEFSA------S-VNYQAKYTDMMLYTK-------P-----
      HMPREF9007_RS09530_Bacteroides_sp_1_1_14_496037689                                YL-S-APLPFVGQKRMFAKEFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDAVVVYNDFDNYRQRLANIPVTNVLLSDLHRIA-E--G--E--PRNKRI-TGEVRNKMFARIER-EEKEHGYVDYITISASLLFAMKYV------TCLKEMKK-E--TIYNRIRRTDYPEAED---YLEGITVTCEDYKEVF----KRYKDVPG-----VVFLVDPPYLSTEVGTY--KM-----YWRLADYLNVLNVL-K-GHA-FVYFTSNKSSILELCDWMGRN--PF-LGNPFK---E-----CRKVEFSA------N-VNYQAKYTDMMLYTV-------P-----
      BSFG_RS03795_Bacteroides_sp_4_3_47FAA_495946224                                   YS-Q-APLPFVGQKRMFASEFRKVL-KR----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SRDMHGIFLERIRR-EENT-GFVDYLTISSSLLFSGKYA------RNIGELGK-L--NFYNNIRLSDY-SCEG---YLDGLEVVCCDYRELT----DKYRDSPD-----VVFLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGMSTLTA------R-MNYSSSYTDIMLYKE-------T--TEA
      BSBG_RS01095_Bacteroides_sp_9_1_42FAA_495950973                                   YL-S-APLPFVGQKRMFARKFMKVL-EQ----Y------PEST---VFVDLFGGSGLLSHITKRCKPEATVIYNDFDNYHKRLENIPRTNRLIADLRAMV-G--NS-V--PRHKTI-TGELRERIFSRILQ-EEHETGYVDFITLSSSLMFSMKYK------LNVPEMRK-E--ALYNNIRKADYPECTD---YLEGLEIVSCDYKELF----NRYKNTPG-----VVFLVDPPYLSTDVGTY--NM-----SWRMSDYLDVLNVL-S-GHP-FVYFTSNKSSILELCEWIGKN--KN-IGNPFE---G-----CTRMEFNA------H-INYSSSYTDMMLFKK-------E-----
      HMPREF1981_RS13185_Bacteroides_pyogenes_545407693                                 HL-S-APLPFVGQKRMFAKEFVKVL-EQ----F------SEKT---VFVDLFGGSGLLSHITKCVRPDAVVVYNDFDNYRKRLGNIPRTNRLLSDLREIS-N--G--T--PKHKPI-TGEKREKVFARIQK-EEKEYGYVDYITLSSSLLFSMKYK------ICLEEMKK-E--TIYNKIRVSDYPEAGD---YLQGLTITCDDYKKVF----NQYKDVPG-----VLFLIDPPYLSTEVGTY--NM-----SWRLADYLDVLGVL-K-EHS-FVYFTSNKSSILELCDWIGRN--HS-IGNPFK---K-----CRKVEFNA------S-MNYSAKYIDIMLYTV-------P-----
      BVU_RS04850_Bacteroides_vulgatus_500646766                                        YS-Q-APLPFVGQKRMFASEFRKVL-KR----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SKKMHGMFLERIRR-EEST-GFVDYLTISSSLLFSGKYA------RNIGELGK-L--NFYNNMRLSDY-CCEG---YLDGLEVVYCDYRELV----DRYRDSPD-----VVFLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-E-GHP-FIYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGISTLTA------R-MNYSSSYTDIMLYKE-------M--TEA
      M137_RS14330_Bacteroidales_492281093                                              YL-S-APLPFVGQKRMLAKEFMKVL-EQ----Y------PDGT---LFVDLFGGSGLLSHITKSLKPHSTVIYNDFDNYRFRMKHIPQTNQLLADIREMV-G--NS-V--PRHKII-KGELRERIFSRIEQ-EENSTGYVDFITLSSSILFSMKYK------LSVQDMRK-E--ALYNNIRKTGYPECTD---YLEGLEIVSCDYKEVF----NRYKDIPG-----VVFLVDPPYLSTDVGTY--NM-----YWNMADYLDVLNVL-K-GHS-YVYFTSNKSSILELCEWIGKN--RD-LGNPFE---N-----CTKVEFNA------H-MNYNSSYTDMMLYKK-------E-----
      M080_1486_Bacteroides_fragilis_str_3397_T10_595910038                             YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA
      M116_RS19700_Bacteroides_fragilis_492352476                                       YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRGRLENIGRTNTLLGDLRKIV-G--I--Y--PHNQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEH-L--CFYNKIRQADY-RCDG---YLDGLEVVCYDYKELA----ETYRVLPG-----VVFLVDPPYMGTDISTY--RM-----DWKLGDYLDVLPVL-K-GHP-FVYFTSSKSPILDFCKWMEEH--PG-TGNPFK---G-----TGRSAITA------R-MNYNSSYTDIMLYNN-------M--ACT
      M098_0958_Bacteroides_vulgatus_str_3775_SR(B)_19_649521449                        YS-Q-APLPFVGQKRMFASEFRKVL-KH----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SKKMHSIFLERIRR-EEST-GFVDYLTISSSLLFSGKYA------RNIGELEK-L--NFYNNMRLSDY-CCEG---YLDGLEVVCCDYRELV----DRYRDSPN-----VVYLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-K-GHP-FIYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGMSTLTA------R-MNYSSSYTDIMLYKE-------M--TEA
      M098_RS09095_Bacteroides_vulgatus_696374681                                       YS-Q-APLPFVGQKRMFASEFRKVL-KH----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SKKMHSIFLERIRR-EEST-GFVDYLTISSSLLFSGKYA------RNIGELEK-L--NFYNNMRLSDY-CCEG---YLDGLEVVCCDYRELV----DRYRDSPN-----VVYLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-K-GHP-FIYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGMSTLTA------R-MNYSSSYTDIMLYKE-------M--TEA
      BN535_00547_Bacteroides_492444862                                                 YL-S-APLPFVGQKRMFAKDFIRVL-GQ----F------PGST---VFVDLFGGSGLLSHITKCVRLDAAVVYNDFDNYRRRLANIPATNVLLSDLRRIA-E--G--E--PRNKRI-TGEVRDKMFARIER-EEKEHGYVDYITVSASLLFAMKYV------TSLEGMKK-E--AIYNRIRQTDYPEAKD---YLEGLTITSEDYKEVF----KRYKDVPG-----VVFLVDPPYLSTEVGTY--EM-----SWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRKAEFSA------S-VNYQAKYTDMMLYTK-------P-----
      M116_4685_Bacteroides_fragilis_str_3719_A10_596095999                             YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRGRLENIGRTNTLLGDLRKIV-G--I--Y--PHNQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEH-L--CFYNKIRQADY-RCDG---YLDGLEVVCYDYKELA----ETYRVLPG-----VVFLVDPPYMGTDISTY--RM-----DWKLGDYLDVLPVL-K-GHP-FVYFTSSKSPILDFCKWMEEH--PG-TGNPFK---G-----TGRSAITA------R-MNYNSSYTDIMLYNN-------M--ACT
      BSEG_RS20295_Bacteroides_dorei_696373063                                          YS-Q-APLPFVGQKRMFASEFRKVL-KC----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SREMHGIFLERIRR-EEST-GFVDYLTISSSLLFSGKYA------RNIGELGK-L--NFYNNMRLSDY-SCEG---YLDGLEVVCCDYRELV----DKYRDSPD-----VVFLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGMSTLTA------R-MNYSSSYTDIMLYKE-------M--TEA
      BSEG_01570_Bacteroides_dorei_5_1_36/D4_345456400                                  YS-Q-APLPFVGQKRMFASEFRKVL-KC----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SREMHGIFLERIRR-EEST-GFVDYLTISSSLLFSGKYA------RNIGELGK-L--NFYNNMRLSDY-SCEG---YLDGLEVVCCDYRELV----DKYRDSPD-----VVFLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGMSTLTA------R-MNYSSSYTDIMLYKE-------M--TEA
      HMPREF9441_RS15520_Paraprevotella_clara_495898225                                 YL-S-APLPFVGQKRMFAREFIKVL-EQ----F------NDKT---VFVDLFGGSGLLSHITKCQRPDATVVYNDFDGYRERLQAIPQTNILLADFRRLA-A--G--V--PKDKPI-RGAVRERILERIAM-AEREWGYVDYITVSSALMFSMKYA------TSLEAMRK-E--TLYNNIRKTDYPPCPD---YLDGLTITSCDYKELY----EKYKDVPG-----VVFFVDPPYLSTEVGTY--KM-----YWRLSDYLDVLNVL-R-DKP-FVYFTSNKSSIIELCEWLGEN--KT-LGNPFK---N-----CGKVEFNA------H-MNYSAKYTDIMLYKK-------Q-----
      M080_RS26780_Bacteroides_fragilis_499301742                                       YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA
      M070_RS00960_Bacteroides_fragilis_695540882                                       YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCHWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA
      HMPREF1057_RS0113675_Bacteroides_finegoldii_495041624                             YS-S-APLPFVGQKRMFAKEFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDATVVYNDFDNYRCRLANIPATNVLLSDLRRIA-E--G--E--PKNKRI-TGEVRDKMFARIER-EEKEHGYVDYITISASLLFAMKYV------ASLEEMKK-E--AIYNRIRRADYSKAED---YLEGIMVTCKDYKEVF----KCYKDVPG-----VVFLVDPPYLSTEVGTY--KM-----YWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRKVEFSA------S-VNYQAKYTDMMLYTK-------P-----
      D11S_2165_Aggregatibacter_actinomycetemcomitans_D11S-1_261414145                  FK-Q-APLPFVGQKRQFLKHFKAIL-NE-QI-P------GDGE-AWTIIDTFGGSGLLAHTAKQLKPCARVIYNDFDGYADRIKHIDDINRLRGQIAALL-S-GV--P--RQKRVT-DKAIKTEIVKTIEA-FD---GYVDLASLASWLLFSGQQV------GSFDELCR-K--DFWHCVCASDYPSADG---YLDGVEVVSESFHTLL----PRFTADPQ-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIDYMQEDK-VD-NWQAFA---N-----AKRIAIKT------K-LNYHGEYEDNLVYKF-------------
      HMPREF9011_RS05730_Bacteroides_sp_3_1_40A_496057734                               YL-S-APLPFVGQKRMFAKEFMKVL-EQ----Y------PDGT---LFVDLFGGSGLLSHITKSLKPHSTVIYNDFDNYRFRMKHIPQTNQLLADIREMV-G--NS-V--PRHKII-KGELRERIFSRIEQ-EENTTGYVDFITLSSSLLFSMKYK------LSVQDMRK-E--ALYNNIRKTGYPECTD---YLEGLEIVSCDYKEVF----NRYKDIPG-----VVFLVDPPYLSTDVGTY--NM-----YWNMADYLDVLNVL-K-GHS-YVYFTSNKSSILELCEWIGKN--KD-LGNPFE---N-----CTKVEFNA------H-MNYNSSYTDMMLYKN-------E-----
      M117_RS13145_Bacteroides_fragilis_695509259                                       YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRGRLENIGRTNTLLGDLRKIV-R--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QSMEELEH-L--CFYNKIRQADY-RCDG---YLDGLEVVCYDYKELA----DTYRVLPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLPVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA
      M127_RS12840_Bacteroides_695294566                                                YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA
      HMPREF9952_RS06315_Haemophilus_pittmaniae_494451533                               YK-Q-APLPFVGQKRQFLKHFEVVL-NE-NI-P------GDGD-DWTIIDTFGGSGLLSHAAKQLKPKARVIYNDFDGYAERIKHIDDINRLRAQIAALL-V-DI--P--RQKRIT-DKALKAQIIDTIKA-FD---GYIDLATLTSWLLFSGQQV------GTFEELFA-K--DFWHCIRQSDYPSADG---YLDGIEVVSESFHTLL----PRFSADQQ-----AVFVLDPPYLCTRQESY--KQA-H--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMVEDK-VD-NWEAFY---N-----SERVVVKA------S-ASYSGKYEDNMVYKF-------------
      I926_RS02325_Pasteurella_multocida_810414634                                      FK-Q-APLPFVGQKRMFIKHFEHIL-NE-NI-K------GDGK-DWTIIDVFGGSGLLSHTAKRVKPKARVIYNDFDRYVERLNHITETNQLREILYHSV-S-EI--I--PKNKLI-SKQAKEEIINKIKA-FN---GYKDVNCLSSWLLFSGQQV------DSLDELFK-Q--RFYNCIRQSNYALADG---YLDGLEVINESFHQLL----PRFIDKKK-----VLLVLDPPYLCTRQESY--KQS-T--YFDLIDFLRLIHLT-K-PP--FIFFSSTKSEFIRFIEAMVEDK-WD-NWQAFN---E-----INRITVNA------S-ASYSGKYEDNLIYKF-------------
      SALWKB29_RS09160_Snodgrassella_alvi_739635808                                     FN-K-APLPFVGQKRNFLKQFAKVL-NE-NI-D------GQGE-DWTIIDVFGGSGLLAHHAKRLKPNARVIYNDFDNYADRLAYIEDINRLRQQLAVTV-K--D--I--PIKSKL-DKRTQSLVIEQIRS-FK---GYIDLGSLQTWLLFSGDIL------ENLDQIYKES--VLYNRVKKSDYPLATG---YLDGVEVVSKSFDVLL----PDYVDNEN-----TLLVLDPPYLFSEQKAY--RKA-E--EFRFISFIKLMTLI-R-PP--FILFSNKHSEILEYLDYAIEQK-D----ERFA---G-----YDFCAINA------T-ISKKRTYKDYMIYKF-------------
      L11_RS07795_Neisseria_weaveri_750388073                                           HA-K-APLPFVGQKRNFIKHYLGVL-DK--I-P------GSGS-GWNIVDVFGGSGLLAHVAKRIKPDARVIYNDFDNYAARVKAIPDINRLRRLISGYL-A--G--Y--VKKQRI-PDDVKQVIIGEIER-FD---GYKCHVVLASWFLFSGRQA------ANLERFYR-S--EWYFNLPLSDYPVADD---YLDGLEIIRQSYETLI----PQFSDDPQ-----ALLVLDPPYLSTTQAAY--AQD-G--RFGLVDYLKLVNLV-R-PP--YLFFSSTRSEFIDYIDAVVSMQ-LD-NWHVFD---H-----STRLTVQA------K-VSKYASYEDNLVYKL-------------
      PSYCG_RS09460_Psychrobacter_sp_G_521257429                                        QH-K-APLPFVGQKRFFLKHFRQVI-DE-HI-P------DNGE-GWTIIDVFGGSGLLSNNAKHLKPAATVIFNDFDNYCERLKHVDDSNRLRRQLMDVL-A--D--Y--PRQTLL-DRNIKSQVIDVIEK-FK---GHIDLRVLSTWLLFAGKHA------TSLDELYA-S--HLYNSLRRTDFTAVDD---YLTGLDIVCESYDTLI----PQYANQPK-----TLLVFDPPYVNTQQGAY--AQK-E--YFGMVQFLKLMQCV-R-PP--YIFFSSTRSELPAYLDFLREHD-SC-AWQRVG---N-----YETISLKA------Q-MNKNSSYEDHMIYRF-Q-----------
      l11_17040_Neisseria_weaveri_LMG_5135_343968128                                    HA-K-APLPFVGQKRNFIKHYLGVL-DK--I-P------GSGS-GWNIVDVFGGSGLLAHVAKRIKPDARVIYNDFDNYAARVKAIPDINRLRRLISGYL-A--G--Y--VKKQRI-PDDVKQVIIGEIER-FD---GYKCHVVLASWFLFSGRQA------ANLERFYR-S--EWYFNLPLSDYPVADD---YLDGLEIIRQSYETLI----PQFSDDPQ-----ALLVLDPPYLSTTQAAY--AQD-G--RFGLVDYLKLVNLV-R-PP--YLFFSSTRSEFIDYIDAVVSMQ-LD-NWHVFD---H-----STRLTVQA------K-VSKYASYEDNLVYKL-------------
      PMCN03_RS01910_Pasteurella_multocida_492125251                                    YK-Q-APLPFVGQKRLFLNHYINII-NE-HI-P------DDGE-GWTIIDAFGGSGLLSHVTKHIKPKARVIYNDFDGYSERLKHIRDLNKLRRILLELL-K--N--E--PRSKQL-SCDMKYKVIQAIEA-FT---GYKDPHVLSTWLLFSGQQV------RTLSELYR-L--SFYNRIRLSDYSEAQD---YFNGFEVANESFHSLL----PRFVDKQK-----TLFVLDPPYLCTHQAAY--SMD-T--YFDLIDFLRLINLT-R-PP--FIFFSSTKSEFIRFVDFMLETK-TH-NWESFT---D-----YKKISINT------S-TNYSGKYEDNLVYKF-------------
      C228_RS0112985_Actinobacillus_capsulatus_517482436                                YK-Q-APLPFVGQKRQFLAQYAAIL-NQ-YI-P------NDGQ-GWTIIDAFGGSGLLSHTAKQLKPAARVIYNDFDGYATRLKHIDDINQLRGKIYTLL-D-GV--P--RQKRIT-DHSIKIKIIETIEA-FD---GYKDLNCLASWLLFSGQQV------ATLSDLYH-K--DFWNCIRLSDYQNADG---YLDGIEITNESFHTLL----PRFINDQR-----TVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLINIT-R-PP--YIFFSSTKSEFVRFIEYMQTDK-VD-NWQAFD---G-----AQRIAIKT------S-LNYQGEYEDNLVYKF-------------
      GGE_RS05405_Haemophilus_haemolyticus_763375484                                    FK-Q-APLPFVGQKRMFLKHFETIL-NE-NI-K------DDGE-GWTIIDTFGGSGLLSHAAKRLKPKACVIYNDFDGYAERLAHIDDINALRSQLFTIV-G-NA--T--PKNKRM-PKELKAECVKIIQA-FD---GYKDLNCLASWLLFSGQQV------ATIDELFQ-N--DFWHCIRQSDYPKADG---YLDGVEIVRESFHTLL----PKFADNPK-----ALFVLDPPYLCTRQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMQEDK-ID-NWEAFD---N-----TKRIVVNA------S-ASYSGKYEDNMVYKF-------------
      MOMA_RS09420_Moraxella_macacae_750343301                                          YQ-K-APLPFVGQKRMFLKEFRKIL-EK--I-P------NDGE-SWTIVDVFGGSGLLANNAKACKPKATVIYNDFDGYTKRLAHIDDINRLRTILFGLT-K--D--V--PRQKRI-PDGLKGRILQVIAD-FD---GYIDTRSVSTWLLFSGKQI------AHIDELAD-N--QMYNTVRTNDYDRADG---YLNGILVTNESFEILI----PKYADRPN-----TMLLLDPPYICTEQKAY--AMT-G--YFGMTKFLRLMKLV-R-PP--YLLFSSTRSELLDYMDYLKDCE-PV-MWERIG---G-----FEKVSVQS------Y-VNYTSEYEDNMIFKF-------------
      HMPREF9065_RS02985_Aggregatibacter_sp_oral_taxon_458_545363364                    FK-Q-APLPFVGQKRMFLNHFKAIL-NE-QI-P------GDGE-GWTIIDTFGGSGLLSHTAKQLKPRARVIYNDFDGYAERIKHIDDINRLRAQIAALL-A-GV--P--RQKRVT-DKALKAQIIDTIKA-FD---GYVDLASLTSWLLFSGQQV------GSFDELCK-K--DFWHCVRASDYPSADG---YLDGVEVVSESFHTLL----PRFTADPQ-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLINIT-R-PP--YIFFSSTKSEFVRFIEYMLQDK-VD-NWQAFD---G-----AQRIAIKT------S-LNYQGEYEDNLVYKF-------------
      E9K_RS05470_Moraxella_catarrhalis_489757769                                       HH-K-APLPFVGQKRMFLKEFRKIL-DK--I-P------SDGE-NWTIIDVFGGSGLLANNAKAYKPNATVIYNDFDGYTKRLAHIDDINRLRAILFEMT-K--D--V--PRQKRI-SDELKGRILQAIDD-FD---GYVDARSVSTWLLFSGKQI------NHISELTD-H--SMYNTVRTSDYDNAQD---YLDGLVITHESFDTLI----PKFADKPN-----ALLLLDPPYVCTEQKAY--ALK-G--YFGMTKFLRLMKLV-R-PP--YLFFSSTRSELLDYMDYLKDCE-PV-MWELVG---D-----FEKVSVNS------H-VNYNAEYEDNMIFKF-------------
      RHAA1_RS00240_Aggregatibacter_actinomycetemcomitans_491746110                     FK-Q-APLPFVGQKRKFLKHFNAIL-NR-HI-A------GDGQ-GWTIIDTFGGSGLLAHAAKQLKPRARVIYNDFDGYFERIKHIDDINRLRGQIAALL-S-GV--P--RQKRVT-DKALKADIIKTIEA-FD---GYVDLASLASWLLFSGQQV------GSFDELCG-K--DFWDCVRASDYPSAEG---YLDGVEVVCESFHTLL----PRFTADPQ-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMQEDK-VD-NWQAFA---G-----AQRIAIKT------S-LNYQGEYEDNLVYKF-------------
      E9U_RS07920_Moraxella_catarrhalis_489767871                                       HH-K-APLPFVGQKRMFLKEFRKIL-DK--I-P------NYGE-NWTIIDVFGGSGLLANNAKAYKPNATVIYNDFDGYTKRLAHIDDINRLRAILFEMT-K--D--V--PRQKRI-SDELKGRILQAIDD-FD---GYVDARSVSTWLLFSGKQI------NHISELTD-H--SMYNTVRTGDYDNAQD---YLDGLVITHESFDTLI----PKFADKPN-----ALLLLDPPYVCTEQKAY--ALK-G--YFGMTKFLRLMKLV-R-PP--YLFFSSTRSELLDYMDYLKDCE-PV-MWELVG---D-----FEKVSVNS------H-VNYNAEYEDNMIFKF-------------
      E9G_RS00410_Moraxella_catarrhalis_489754581                                       HH-K-APLPFVGQKRMFLKEFRKIL-DK--I-P------SDGE-NWTIIDVFGGSGLLANNAKAYKPNATVIYNDFDGYTKRLAHIDDINRLRGILFEMT-K--D--V--PRQKRI-SDELKGRILQAIDD-FD---GYVDARSVSTWLLFSGKQI------NHISELTD-H--SMYNTVRTSDYDNAQD---YLDGLVITHESFDTLI----PKFADKPN-----ALLLLDPPYVCTEQKAY--ALK-G--YFGMTKFLRLMKLV-R-PP--YLFFSSTRSELLDYMDYLKDCE-PV-MWELVG---D-----FEKVSVNS------H-VNYNAEYEDNMIFKF-------------
      HMPREF1053_RS00285_Haemophilus_haemolyticus_491876509                             FK-Q-APLPFVGQKRMFLKHFETIL-NE-NI-E------DDGE-GWTIIDTFGGSGLLSHAAKAIKPKARVIYNDFDGYAERLAHIDDINKLRAELYSVV-G-NA--T--PKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--DFWHCIRQSDYQKADG---YLDGVEIVQESFHTLL----PKFSDDPK-----ALFVLDPPYLCTRQESY--KQA-T--YFDLIDFLRLINIT-R-PP--YIFFSSTKSEFVRFIKYMVDDK-VH-NWQNFD---N-----AQRIVVNA------S-ASYSGKYEDNMVYKF-------------
      HMPREF9016_RS00975_Neisseria_sp_oral_taxon_014_496464403                          HS-T-APLPFVGQKRYFIKHFTKVL-SQ--I-P------ADGK-HWTIVDVFGGSGLLAHVAKRIKPQARVIYNDYDNYSDRLRHIPDYNRLREQIAQIV-G--G--I--PKGSRL-DPERTRSVQQTITN-FQ---GHIDVRVLSSWLLFSAKQA------NSLEQLLG-F--EFYNKVRQSPYSIAAD---YLDGLEITRQDYNLLM----AEHQHNPN-----TLLVLDPPYVSTAQGAY--AAD-K--YFNMVSFLRMIQYM-R-PP--FILFSSTRSEALDYFQFLQECE-PD-KYRRFS---G-----YNIVSLDA------K-MGKGIEYQDNMIYKI---D---------
      P1062_RS06555_Pasteurella_multocida_514429666                                     FK-Q-APLPFVGQKRMFLKHFEQVL-DE-NI-Q------GDGE-GWTIIDVFGGSGLLSHTAKRLKPKARVIYNDFDRYTERLNHIAETNQLREILYQTV-N-GI--I--PKNKLI-NKRLKEEIINKINR-FN---GYKDVNCLSSWLLFSGQQV------GSLHELFK-R--RFYNCVRKTDYVLTEG---YLEGLEVVSESFHQLL----PKFQNKEK-----VLIVLDPPYLCTRQESY--RQA-T--YFDLIDFLRLIHLT-K-PP--FIFFSSTKSEFIRFLDYTQEDK-TD-NWQTFE---G-----YKRIIVNT------S-ASYSGKYEDNLIYKF-------------
      P1062_RS03850_Pasteurella_multocida_514429566                                     FK-Q-APLPFVGQKRMFLKHFEQVL-DE-NI-Q------GDGE-GWTIIDVFGGSGLLSHTAKRLKPKSRVIYNDFDGYSERLNHIAEINQLREILYQTV-N-GI--I--PKNKLI-NKRLKEEIINKINR-FN---GYKDVNCLSSWLLFSGQQV------GSLHELFK-R--RFYNCVRKTDYVLTEG---YLEGLEVVSESFHQLL----PKFQNKEK-----VLIVLAPPYLCTRQESY--RQA-T--YFDLIDFLRLIHLT-K-PP--FIFFSSTKSEFIRFLDYTQEDK-TD-NWQTFE---G-----YKRIIVNT------S-ASYSGKYEDNLIYKF-------------
      NMA510612_RS09285_Neisseria_meningitidis_488141552                                HS-T-APLPFVGQKRYFIKHFTKVL-SQ--I-P------ADGK-HWTIVDVFGGSGLLAHVAKRIKPQARVIYNDYDNYSDRLRHIPDYNRLREQIAQIV-G--G--I--PKGSRL-DPERTRSVQQTITN-FQ---GHIDVRVLSSWLLFSAKQA------NSLEQLLG-F--EFYNKVRQSPYSIAAD---YLDGLEITQQDYNLLM----AEHQHNPN-----TLLVLDPPYVSTAQGAY--AAD-K--YFNMVSFLRMIQYM-R-PP--FILFSSTRSEALDYFQFLQECE-PD-KYRRFS---G-----YNIVSLDA------K-MGKGIEYQDNMIYKI---D---------
      NM70082_RS106455_Neisseria_meningitidis_488182095                                 HS-T-APLPFVGQKRYFIKHFTKVL-SQ--I-P------ADGK-HWTIVDVFGGSGLLAHVAKRIKPQAQVIYNDYDNYSDRLRHIPDYNRLREQIAQIV-G--G--I--PKGSRL-DPERTRSVQQTITN-FQ---GHIDVRVLSSWLLFSAKQA------NSLEQLLG-F--EFYNKVRQSPYSIAAD---YLDGLEITQQDYNLLM----AEHQHNPN-----TLLVLDPPYVSTAQGAY--AAD-K--YFNMVSFLRMIQYM-R-PP--FILFSSTRSEALDYFQFLQECE-PD-KYRRFS---G-----YNIVSLDA------K-MGKGIEYQDNMIYKI---D---------
      K941_RS0107980_Moraxella_caprae_656071953                                         FK-T-APLPFVGQKRQFIGRFEKLLLNN----I------PNDGEGWTVIDVFGGSGLLAHNAKRLLPKTTVIYNDFDDYTNRLKHIPTTNALRQALSDIL-K--H--E--PRSLKL-SSTVKQQVLDIVKD-FQSQGKFIDVQTIAGWLLFSGRQV------ADLDEFMA-E-STLYNRITKTDYELADG---YLDGLVITCESFEQLL----TKHQATPN-----CLLLLDPPYVCTTQSAY--NLHERGGYFGMTKFLTLMH-Y-V-KPP-YIFFSSTRSELLDYMSYVEQY-----EPHTWERIGG-----FERIVVKV------T-VNKGLGYEDNILAK--------------
      IO48_RS11150_Gallibacterium_anatis_746098344                                      FA-Q-APLPFVGQKRMFLQQFRSVL-NQ-MI-A------DNGD-GWTIVDAFGGSGLLSHTAKRLKPNARVIYNDFDGYAERLKHIDDINRLRQQLSNLL-T--G--Y--PRQKRL-DIAMRHKVIDAIES-FD---GYKDPHILCAWLLFSGQQI------KSLNELYR-H--GFYNCVRQSDYDTADG---YLDGIEVVSESFSELL----PKFANDKK-----AIFVLDPPYLCAHQASY--KQE-S--YFGLINFLELIRLT-R-PP--YLFFSSTKSEFVRFVDWLVETR-SD-NWQSFA---D-----YQRIIVRT------S-ASYIGKYEDNLIYKC-------------
      JL04_RS11025_Gallibacterium_anatis_746100920                                      FA-Q-APLPFVGQKRMFLQQFRSVL-NQ-MI-A------DNGD-GWTIVDAFGGSGLLSYAAKQLKPKARVIYNDFDGYAERLKHIDDINRLRQQLSDLL-T--G--C--PRQKRL-DIAMRHKIIDAIES-FD---GYKDPHILCAWLLFSGQQI------KSLNELYR-H--GFYNCVRQSDYDTADG---YLDGIEVVSESFSELL----PKFANDKK-----AIFVLDPPYLCTHQASY--KQE-S--YFGLINFLELIRLT-R-PP--YLFFSSTKSEFVRFIDWLIATK-GD-NWQSFA---D-----YQRIVIQT------S-ASHNGQYEDNLIYNF-------------
      P375_RS00515_Gallibacterium_genomosp_2_746064999                                  FS-Q-APLPFVGQKRMFLNQFKSVL-NE-MI-T------NDGE-GWTIVDAFGGSGLLSHTAKRLKQKATVIYNDFDGYVEWLKHIDDINHLRQQLSDLL-A--D--Y--PRRKRL-DIAMRHKIIDVIES-FD---GYKDPHILCAWLLFSGQQV------KSVDELYR-H--GFYNCVRQSDYPTADG---YLDGIEVVHESFSTLL----PQFSDDKR-----VIFVLDPPYLCTHQASY--KQD-G--YFDLINFLGLIRLT-R-PP--YIFFSSTKSEFVRFVDWLIATK-GD-NWQSFI---D-----YKRIIVQI------S-TSYSGKYEDNLIYRA-------------
      GCWU000324_01234_Kingella_oralis_ATCC_51147_237868315                             YR-K-APLPFVGQKRNFLKHLIPVL-QQ-NI-P------NDGA-GWTIVDVFGGSGLLAHTTKRTLPKARVIYNDFDGYAERIKNIPDTNRLRDRLAEVL-D--K--Q--PRDKAL-NADAKAAVVVIIRG-FG---GYKDLNCLRSWLLYSGKEA------AT-PDCLYGE--TLYNRLRLSPYPEAAD---YLDGLDITSQSFETLL----PRYAGQDN-----TLLILDPPYVCTQQGMY--ANQ-T--YFGMVPFLTLAQMV-R-PP--FVFFSSTRSEFLDYLGFLRQYK-PQ-EWANWA---G-----FGQVTIRG------T-LSKGSCYEDNMVYRF--ER---------
      JP30_RS08890_Gallibacterium_anatis_746079094                                      FS-Q-APLPFVGQKRMFLKQFKAVL-NQ-MI-D------NDGE-GWTIVDAFGGSGLLSHTAKQLKPKAKVIYNDFDGYAERLKHIDDINHLRQQLSDLL-A--D--Y--PRRKRL-DIAMRHKIIDVIES-FD---GYKDPHILCAWLLFSGQQV------KSVDELYR-H--GFYNCVRQSDYPTADG---YLDGIEVVHESFSTLL----PQFSDDKR-----VIFVLDPPYLCTHQASY--KQD-G--YFDLINFLELIRLA-R-PP--YIFFSSTKSEFVRFVDRLITTK-GD-NWQSFV---D-----YHRIVVQT------S-TSYSGKYEDNLIYKS-------------
      JP36_RS09335_Gallibacterium_genomosp_1_746108169                                  FT-Q-APLPFVGQKRMFLNHFKTVL-NE-MI-T------NDGE-GWTIVDAFGGSGLLSHTAEQLKPQARVIYNDFDGYAERLKHIDDINRLRQILSKLL-A--N--Y--PRQKRL-DIAMRHKVIDAIES-FD---GYKDPHILCTWLLFSGQQV------KSIDELYR-H--GFYNCVRQSDYPEADG---YLDGIEVVNESFSDLL----PKFFDDSK-----AIFVLDPPYLCTHQDSY--KQE-S--YFDLINFLELIRLT-R-PP--YLFFSSTKSEFVRFVDWLIAAK-GD-NWQSFE---D-----YHRIVVQT------S-TSYSGKYEDNLIYKC-------------
      HMPREF0669_RS04845_Prevotella_sp_oral_taxon_299_496519123                         HM-K-APLPFVGQKRNFIKALTPII-ER----Q------PDNT---IFVDLFGGSGLLSNLVKELKPNARVIYNDFDNYSERLAHIKETEELRHMIGEKL-K--D--V--PKCSKV-SEELKAEICDLIED-FKAKKGFVDIVTVASWLLFSNRTA------GDIDDIRA-KRNTFYNSVIKAPLK-ADG---YLEGAERVCKDFQKLI----DEFKNVPN-----VLFICDPPYMLTEKAHY--KKT----YWGLGKYLNLLKDM-T-GLN-SIYFTSSKSGLLDFYHWWEKN-----MPQAIKK--P-----YKIISNNV------GYFHESREYEDIMMYN--------------
      UMN179_RS08310_Gallibacterium_anatis_762905187                                    FA-Q-APLPFVGQKRMFLQQFRSVL-NQ-MI-A------DNGD-GWTIVDAFGGSGLLSHTAKRLKPNARVIYNDFDGYAERLKHIDDINRLRQQLSNLL-T--G--Y--PRQKRL-DIAMRHKVIDAIES-FD---GYKDPHILCAWLLFSGQQI------KSLNELYR-H--GFYNCVRQSDYDTADG---YLDGIEVVSESFSELL----PKFANDKK-----AIFVLDPPYLCTHQARY--KQE-S--YFDLINFLELIKLT-R-PP--YLFFSSTKSEFVRFVDWLIATK-GD-NWQSFA---D-----YQRIIVQT------S-TSYSGKYEDNLIYKC-------------
      JP35_RS03815_Gallibacterium_anatis_746003746                                      FA-Q-APLPFVGQKRMFLQQFRSVL-NQ-MI-A------DNGD-GWTIVDAFGGSGLLSHTAKHLKPNARMIYNDFDGYAERLKHIDDINRLRQILSELL-A--N--C--PRDKRL-DIAMRHKVIDAIEA-FE---GYKDPHILCAWLLFSGQQV------SSINELYR-R--GFYNCVRQNDYPTADG---YLDGIEVVNESFVTLL----PKFADDSK-----AIFVLDPPYLCTKQASY--RQD-S--YFDLIAFLDLIKLT-R-PP--YLFFSSTKSEFIRFVDWLIASK-GD-NWQSFV---D-----YQRIIVQT------S-TSYSGKYEDNLIYKC-------------
      ASUC_RS07260_Actinobacillus_succinogenes_501020456                                FN-Q-APLPFVGQKRMFLQHFRKIL-NQ-HI-A------NNGD-GWTIVDVFGGSGLLSHTARNTKSAATVIYNDFDGYSERLQHIGDINRLRADLYALV-D-NA--A--PKNKRL-SKALKSDVIHVIQN-FD---GYKDLNCLSSWLLFSGQQV------GNFTELFN-R--DFWHCIRKSDYPEADG---YLEGITVTNESFDSLI----PRFAGKDK-----VLLILDPPYLCTRQDSY--KQA-T--YFDLIDFLKLINLT-K-PP--YIFFSSTKSEFIRFIDYMIDSK-AD-NWRSFD---N-----CHRIAVNA------S-ASYSGQYEDNLVFKF-------------
      HMPREF1054_1309_Haemophilus_paraphrohaemolyticus_HK411_385696246                  FK-Q-APLPFVGQKRMFLKHVQAVL-DK-HI-D------GEGE-GWTIVDVFGGSGLLSHTAKHIKPKATVIYNDFDGYAKRLKYIDDINRLRQIIFNHL-H-GI--V--PKNGRL-SKEIKAEIINKIND-FQ---GYKDLNCLASWLLFSGQQV------SSFEALFA-K--DFWHCVRQSDYPSAEG---YLDGIEIVSESFHKLI----PRYKDQEK-----VLLLLDPPYLCTRQESY--KQS-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRYLDYMQESK-TD-NWQAFE---N-----YERIVVKA------S-TSKDGIYEDNMIYKF-------------
      NEIPOLOT_RS06080_Neisseria_polysaccharea_489846667                                YR-K-APLPFTGQKRNFLKLFKQVL-ND-HI-P------GDGE-DWTILDAFGGFGLLSHTAKQCKPAARVIYNDYDGYSERLQHIPDINHLRRLLAGLL-T--P--V--PRSKPV-PPAIKAAIVAAIRS-FG---GYIDLDCLVSWLLFSGNTA------ADLDELGR-K--TMYNCISLSDYPEVQD---YLQGVEIVDQSYRELL----PQHIGNPR-----TLLVLDPPYVCTQQGNY--RKA-A--YFGMVEFLRLMAMV-R-PP--FIFFSSTRSELPAYLDLVAELR-LP-GWERFT---G-----SQTLTVSS------T-INRNSSYDDHLIYKF-------------
      Q321_RS0105900_Conchiformibius_steedae_652666097                                  YT-Q-APLPFTGQKRRFLNHFKSLL-NQ-HL-D------GDGA-GWTIIDAFGGSGLLAHTAKRCKPAARVIYNDFDGYAERLAHIPDTNRLRQILADLL-Q--H--Q--PRQKRL-PDAVKKAAEAAING-FG---GYLDLDCLAAWLLFSGKTA------RDLPDLYS-H--QFYHCVRQHDYPDATD---YLDGLEIVSQPYHELL----PPHLDDPK-----TLLVLDPPYVCTQQGSY--RKA-G--YFGMVQFLRLMRLV-R-PP--FVFFSSTRSELLEYLDLVIGDK-ME-GWDRFK---D-----YQKISLTT------H-INQQAVYEDNLVYRF-------------
      HMPREF9021_RS11285_Simonsiella_muelleri_488719230                                 YT-Q-APLPFTGQKRRFLNHFKTLL-KQ-HI-P------NDGD-GWTIVDAFGGSGLLAHTAKQVLPKARVIYNDFDGYTERLKNINDTNQLRKIIFDLT-R--D--Y--PLKQKL-PETLKKTIQATLQT-FG---GYIDLDCVASWILFSSRQT------TDLNDLIYNH--TYFNNVRLSDYPSADG---YLDSVEVVSKSYHELL----PEFQDNSK-----ALLVLDPPYVSTAQGAY--RKA-G--YFGMVEFLRLMRLV-R-PP--FVFFSSTRSELLDYLDLIVNEK-AE-GWQNLQ---D-----YQKISVHI------T-MNKTAHYEDNMIYKF-----QAA-----
      Q338_RS03200_Alysiella_crassa_736171011                                           FY-Q-APLPFTGQKRRFLTAFKQVL-NQ-CI-A------DDGV-GWTIIDVFGGSGLLAHTAKRGKPAARVIYNDFDDYATRLQNINDTNRLRQIIFDLT-Q--H--I--PRNQKL-PENTQKQVQAALKS-FD---GYVDLHSVASWLLFSGRQT------TDLDDLLHNH--TYYNGVRISDYPQADG---YLDGLEITSQSYADLL----PQWIGRDK-----VLFLFDPPYVCTAQGAY--RNE-T--YFGMVEFLKLMRLV-R-PP--FVFFSSTRSEFLDYLNLVIGYK-LD-GWEHFA---G-----YRKINVQV------H-LNKNAQYEDNLVYKL-NSAFEIA-----
      Q338_RS01810_Alysiella_crassa_736169879                                           YH-Q-APLPFTGQKRNFLKAFKQVL-NE-QI-S------GDGD-GWTIIDVFGGSGLLAHTAKRCKPAARVIYNDFDGYATRLQHIDDTNRLRQVIFNLL-Q--D--C--PRKTKL-TPALKAVVQATLQT-FG---GYVDVASVASWILFSGQQA------RTLDDLMQ-H--NFYNRVRLSGYPSADG---YLDGLEIVSQSYVDLL----PQFINQDK-----VLLLLDPPYVCTAQGAY--YND-V--YFGMVEFLKLMSMV-R-PP--FVFFSSTRSEFLDYLDLVIGYK-LD-GWERFA---G-----YHKVSLMA------G-LNYSARYEDNMVYKF-------------
      HMPREF9021_RS03005_Simonsiella_muelleri_488717644                                 YS-Q-APLPFTGQKRNFLKFFQQVL-KE-NI-S------NQGQ-GWTIIDAFGGSGLLAHTAKQTLPEARVIFNDFDGYTERLAHIDDTNRLRELIFNRL-N--D--LNVPKNQGL-IPQEKAEIEVIIHD-FG---GYKDVISLGSWLLFSGRQV------NQLSDLFN-Q--NWYRKIRETPYPSAVG---YLDDVEIVRRNAHELL----PDFVDSPR-----VLLVLDPPYVCTEQGSY--RQD-D--YFGMVQFLRLMSVV-R-PP--FVFFSSTRSEFLAYLDFVIETK-QA-GWERFV---D-----YRKISIHT------S-LNKQSRYEDNLVFKF--E----------
      NELON_RS04600_Neisseria_elongata_489871056                                        YR-K-APLPFTGQKRNFLKLFKQVL-NE-HI-P------GDGE-DWTILDAFGGSGLLSHTAKQCKPAARVIYNDYDGYSERLQHIPDINRLRRLLAGIL-E--P--V--PRSKPV-PPAIKAAIVAAIRS-FG---GYVDLDCLVSWLLFSGNTA------ADLDELCR-K--TMYNCISLSDYPEAQD---YLQGVEIVGQSYRELL----PQHIGNPR-----TLLVLDPPYVCTQQGNY--RKA-A--YFGIVEFLRLMAMV-R-PP--FVFFSSTRSELPAYLDLVPELR-LP-GWERFA---N-----SQTLTVSS------T-INRNSSYDDHLIYKF-------------
      KKB_RS07455_Kingella_kingae_489886467                                             FY-Q-APLPFTGQKRRFLTAFKQVL-NQ-CI-A------DDGA-DWTIVDVFGGSGLLAHTAKRCKPAARVIYNDFDGYSERLQNINDTNKLRTIIADLL-A--H--Y--PRNQKL-PDTLKKTVQATLNS-FG---GYIDLDCVASWLLFSGRQT------TDLHDLLHNH--TYYNGVRLSNYPSADG---YLDNVEVVSKSYHELL----PEFQDNPK-----ALLVLDPPYICTAQGSY--RKA-G--YFGMVQFLRLMRLV-Q-PP--FIFFSSTRSELIDYLDFIVNEK-AE-GWQNLQ---D-----YQKISVHV------T-MNKTAQYEDNMVYKF-----QAA-----
      EIKCOROL_RS01150_Eikenella_corrodens_489918676                                    YR-K-APLPFTGQKRNFLKLFKQVL-NE-HI-P------GDGE-DWTILDAFGGSGLLSHAAKQCKPAARVIYNDYDGYSERLQHIPDINRLRRLLAGIL-E--P--V--PRSKLV-PPAIKAAIVAAIRS-FG---GYVDLDCLVSWLLFSGNTA------ANLDELCR-K--TMYNCISLSDYPEAQD---YLQGVEIVSQSYRELL----PQHISNPC-----TLLVLDPPYVCTQQGNY--RKA-A--YFGMVEFLRLMAMV-R-PP--FIFFSSTRSELPAYLDLVAELR-LP-GWERFA---G-----SQTLTVSS------T-INRNSGYDDHLIYKF-------------
      HD_RS00615_Haemophilus_ducreyi_499246665                                          FK-Q-APLPFTGQKRMFLNHFKAVL-NE-HI-V------GDGE-GWTIVDVFGGSGLLSHTAKQLKPAARVIYNDFDGYAERLKHIDDINRLRGQIHALL-R-DV--P--SQKRIT-DKALKTKIIATINA-FD---GYKDLASLSSWLLFSGQQV------ATFDDLFK-K--DFWCCIRQSDYPRAEG---YLDGIEVTSESFHTLL----PQFIADKK-----TLFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMQTDK-VH-NWQAFD---G-----AQRIAIKT------N-LNYHGEYEDNLVYKF-------------
      HMPREF9021_RS06670_Simonsiella_muelleri_750347144                                 HT-Q-APLPFTGQKRRFLNHFKTLL-KQ-QI-P------NDGD-GWTIVDAFGGSGLLAHTAKQVLPKARVIYNDFDGYTERLQNINDTNQLRKIIFDLT-R--D--Y--PRNQKL-PETLKKTIQTTLQT-FG---GYIDLDCVASWILFSSRQP------TDLHDLIYNH--TYFNNVRLSDYPSADD---YLDGVEVVSKSYHELL----PEFQDNSN-----ALLVLDPPYVSTAQGAY--RKA-G--YFGMVEFLRLMRLV-R-PP--FVFFSSTRSELLDYLDLIVNEK-AE-GWQNLQ---D-----YQKISVHI------T-MNKTAHYEDNMIYKF-----QAA-----
      L278_RS122210_Mannheimia_haemolytica_544865770                                    FK-Q-APLPFSGQKRMFLSHFKRVL-ND-NI-K------DDGK-GWTIIDVFGGSGLLSHTAKYEKSLAKVIYNDFDNYTERLEHIKDTNQLRQEIYRIV-D-RI--I--PKNKRI-SNEVKAKIINKIND-FE---GFKDLKCLSSWLLFSGEQV------ATLEELFK-H--DFWNCVRQSDYPEATG---YLDDIEVVSESFHQLL----PRFHDKEK-----VLLILDPPYLCTRQESY--KQA-N--YFDLIDFLRLIDLT-K-PP--YIFFSSTKSEFIRFIHYMVEGK-KD-NWKSFE---G-----AKRIVVNA------S-ASYNGQYEDNMVYKF-------------
      D650_21760_Mannheimia_haemolytica_USDA-ARS-USMARC-183_472258915                   FK-Q-APLPFSGQKRMFLSHFKQVL-NA-NI-E------ADGK-DWTIIDVFGGSGLLAHTAKREKPLARVIYNDFDNYAERLNHIKETNQLRQEIYQIV-D-EI--I--PKNKRI-SNEIKAKIINKIND-FE---GFKDLKCLSSWLLFSGEQV------ATLDELFK-H--DFWHCVRQSDYPEATG---YLDDIEVVSESFHQLL----PRFHDKEK-----VLLILDPPYLCTRQESY--KQE-N--YFDLIDFLRLIDLT-K-PP--YIFFSSTKSEFIRFIHYMVQNR-KE-NWQAFE---G-----AERIVVNA------S-ASYNGKYEDNMVYKF-------------
      BN741_01478_Prevotella_stercorea_CAG:629_548211070                                FS-S-SPLPFRGSKRYYVRRFREVL-AQ----T------QDID---TVVDLFGGSGLLSRVAKDTLPNCRVIYNDFDHYDTRLANAANTNALLRSIAPLL-V--N--V--PDNKKV-PTETKIKILELCAE-EEKR-HAVDYITLSGSLLFSGNWA------QSYEELSK-Q--TMYNRMVKTDY-NVAN---YLSGLEVTHCDYRELF----NAHKANKK-----ALFLLDPPYLQTEHSAYKADT-----YWQLKDYLDVLTLL-D-DTK-YVLFTSGKSQIIELCDWINQS--FG--GKLLK---D-----AQKYVQNS------R-INDFAAYKDIMIAK--------------
      HMPREF9148_RS11290_Prevotella_sp_F0091_545434898                                  YF-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVIYNDFDGYQKRLTMLSETNALLSELRKIV----D--A--PRYKAI-LGVQREKVLECVRK-YERIYGCVDYITLSSSILFSMKYV------TTYADLEK-E--TLYNNIKSTDYPPCDD---YLDGLTITSCDYKEVF----EKYKDVPN-----VVFLVDPPYLSTDSTTY--KM-----YWKLSDYLDVLTIL-A-GHR-FIYFTSNKSSIVELCEWIGKN--KL-IGNPFE---S-----CQRKEFNA------R-MNYNSSYTDIMLYTD-------V-----
      P150_RS0104410_Prevotella_sp_HUN102_655515586                                     YL-S-APLPFQGQKRMFAKEYIKVL-RQ----F------PDCT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLEALPQTNALLAELRAIM----D--V--PRHKAI-MGEQRKQVLSCIRR-HEREYGYVDYITLSSSILFSMKYA------TEYADLEK-E--TLYNKIKGVDYPPCDD---YLDGLTITSCDYKEVF----ERYKDVAN-----VVFLVDPPYLSTDSKTY--RM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIIELCDWMGRH--PN-LGDPFR---N-----CHRREFNA------H-MNYSSSYTDIMLYTD-------A-----
      WEEVI_RS00470_Weeksella_virosa_503362551                                          YT-Q-SPLPFQGQKRKFINHVKQVL-AN----S------SEDA---TYVDLFGGSGILSHTVKQLKPNAKVVYNDYDDFSKRLAAVAQTNVLLDKIRAIT-K--E--L--PKDKLI-PEVHKLKLLELIKD-EECRLGYVDYITLSSSLLFSAKYV------TNYNDLTK-Q--TFYNNVRQSNY-VTDD---YLAGVEIVHQDYKELF----DQYKDLDN-----VVFLVDPPYLSTDTSTY--TSD---KYWKLKDYLNVLDVL-V-GTN-YLYFTSNKSQIVELCQWMEDRTSMA-DVNPFS---G-----STTVCINT------T-LNHSAKYTDMMLYKL-------K-----
      BZARG_RS04055_Bizionia_argentinensis_495910797                                    FN-T-SPLPFQGQKRRFVKQFKEAL-NV----F------SDSA---TYVDLFGGSGLLAHTVKQKYPNAKVIWNDYDNFQNRLESISETNLLLTELRSFL-I--D--L--PRKQRM-EAIDRERVLRVVKA-HETKYGYVDYVTLSGSLLFSAKYA------TNYKEFAN-E--SFYNRIKLSDY-NATG---YLSGVERVQNDYKALF----DSYKS-DT-----TVFLVDPPYLSTDTSSY--NKD---NYWKLRDYLDVLSVL-D-GSK-YFYFTSNKSQIVELCEWIETR--TM-TGNPFQ---G-----STMTTTAG------T-INHTASYTDIMLFK--------------
      P150_RS0110495_Prevotella_sp_HUN102_655516580                                     YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------LDGT---TFVDLFGGSGLLSHIAKYQKPNSTVVYNDFDGYRKRLEVLPQTNALLAELRTIV----D--V--PRHKAI-MGEQREQVLSCIRR-HEREHGYVDYITLSSSILFSMKYA------IEYADLEK-E--TLYNNIKGVDYPPCND---YLDGLIITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--RM-----YWKLSDYLDVLTVL-S-SHH-FIYFTSNKSSIIELCDWMGRH--PN-LGDPLR---N-----CHRREFNA------H-MNYSSSYTDIMFYTD-------A-----
      HMPREF1651_RS08825_Prevotella_bivia_739005860                                     YL-S-APLPFQGQKRMFAKEYIKVL-RQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLDALPQTNALLAELRAIV----D--V--PRHKPI-MGEQREQVLSCIRR-HERVHGCVDYITLSSSILFSMKYA------TEYAELEK-E--TLYNNIKGVDYPPCED---YLDGLTITSCDYKEVF----ERYKNVPG-----VVFLVDPPYLNTDSKTY--RM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIVELCDWMGRH--PN-FGDPFK---N-----CHRREFNA------H-MNYNSSYTDIMLYTD-------V-----
      P150_RS0109795_Prevotella_sp_HUN102_655516468                                     YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLEALPQTNALLAELRAIV----D--V--PRHKAI-MGEQREQVLSCIRR-HERERGYVDYITLSSSILFSMKYA------TEYADLEK-E--TLYNNIKGVDYPPCDD---YLDGLTITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--RM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIIELCDWMGRH--PN-LGDPFR---N-----CHRREFNA------H-MNYSSSYTDIMLYTD-------A-----
      ORNRH_RS05640_Ornithobacterium_rhinotracheale_504603820                           FK-S-APLPFQGQKRNFVKHFKEAL-KG----F------PSNA---IYVDLFGGSGLLSHTVKCVHPEAKVVYNDFDNFQKRLKAIPETNKILEELRALN-L--K--T--PRGKII-QGEEKEKVLEVLKR-ADKR-GFVDWITLSGSLKFSMNYG------LKLEDFTN-D--TLYNTIRKTNFDEASD---YLAGIEVVSEDYRHLF----QKYKDLDN-----VVFLADPPYLSTDTATY--AND---KYWKLTDYLEVLETL-Q-GSN-FFYFTSNKSQVVELCQWLGTRT-NE-SLNPFK---D-----ATCTAMTN------C-PTHKTSYQDIMYHYKK------------
      D468_RS0112575_Prevotella_oris_648594256                                          YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKYQKPNSTVIYNDFDGYRKRLEHIPQTNALLSRLRSIL-C--D--Y--PRKKAI-AGAMRRSVLSCIHE-YEHTFGYVDYITLSGSLLFSMKYA------TCYEELSK-E--TLYNRIKATDYPLADT---YLDGLTVTSCDYRQLF----EQYKNIPD-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDILTIL-A-GHR-FIYFTSNKSSITELCEWIGKN--KL-IGNPFE---N-----CHRREFKA------H-MNYNASYTDIMLYKN-------T-----
      K334_RS0105170_Prevotella_baroniae_647603997                                      YL-S-APLPFQGQKRMFAKEYIKAL-RQ----F------PDDT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRRRLEALSQTNALLAELRAIV----D--V--PRHKAI-VGAQRERVLSCIRK-HEQDYGYVDYITLSSSILFSMKYA------TEYAGLEK-E--TLYNNIKGVDYPPCED---YLDGLTITSCDYKKVF----ERYKNVPG-----VVFLVDPPYLSTDIKTY--RM-----CWKLSDYLDVLTIL-A-GHR-FIYFTSNKSSIIELCEWIGKN--KL-IGNPFE---N-----CRCQEFNA------H-MNHNASYTDIMLYTD-------V-----
      HMPREF0654_RS11780_Prevotella_disiens_739003412                                   YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKHQKPNSTVVYNDFDGYRHRLERIAQTNELLEELRAIV----D--V--PRSKPI-LGEVRKRVLDCIRK-HEQKYGCVDYITLSTSLLFSMKYA------TCFAEMEK-E--ILYNRIKSTNYLLCTD---YLDGLTITSYDYKEVF----EKYKDVPN-----VVFLVDPPYLSTDIKTY--RM-----NWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIVELCDWMGRH--PN-LGDPFK---N-----CHRREFNA------H-INYNSSYTDIMLYTD-------V-----
      PREBIDRAFT_RS00610_Prevotella_bivia_490468432                                     YL-S-APLPFQGQKRMFAKEYIKVL-RQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLDALPQTNALLAELRAIV----D--V--PRHKPI-MGEQREQVLSCIRR-HERVHGCVDYITLSSSILFSMKYA------TEYAELEK-E--TLYNNIKGVDYPPCED---YLDGLTITSCDYKEVF----ERYKNVPG-----VVFLVDPPYLNTDSKTY--RM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIVELCDWMGRH--PN-FGDPFK---N-----CHRREFNA------H-MNYNSSYTDIMLYTD-------V-----
      HMPREF0665_RS09490_Prevotella_oris_490512514                                      YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKYQKPNSTVIYNDFDGYRKRLEHIPQTNALLSRLRSIL-C--D--Y--PRKKAI-AGAMRRSVLSCIHE-YEHTFGYVDYITLSGSLLFSMKYA------TCYEELSK-E--TLYNRIKATDYPLADT---YLDGLTVTSCDYRQLF----EQYKNIPD-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDILTIL-A-DHR-FIYFTSNKSSITELCEWIGKN--KL-IGNPFE---N-----CHRREFKA------H-MNYNASYTDIMLYKN-------T-----
      B739_RS09680_Riemerella_anatipestifer_504751701                                   WK-S-APLPFQGQKRGFIRHFSQAV-KE----Y------PNNA---IYIDLFGGSGLLSHTVKCVHPNAKVIYNDYDNFRKRLEAIPKTNQILDELRALN-L--Q--T--PRGKKI-EGAEREAVFKILKK-ADER-GFVDWISLSSSLKFSMNYG------TKLKDFTE-D--TLYNSVRRSNYDPADD---YLEGIEVVSEDYKKLF----DIYRGKNN-----VVFLVDPPYLSTDTSTY--NKE---SYWKLSDYLEVLETL-Q-GSN-YFYFTSNKSQIVELCQWLETRT-SN-NSNPFK---G-----ATRTAVNN------K-TTHNTGYTDLMYHLKKN-----------
      M949_RS00775_Riemerella_anatipestifer_740907932                                   WK-S-APLPFQGQKRGFIRHFSQAV-KE----Y------PNNA---IYIDLFGGSGLLSHTVKCVHPNAKVIYNDYDNFRKRLEAIPKTNQILDELRALN-L--Q--T--PRGKKI-EGAEREAVFKILKK-ADER-GFVDWISLSSSLKFSMNYG------TKLKDFTE-D--TLYNSVRRSNYDPADD---YLEGIEVVSEDYKKLF----DIYRGKNN-----VVFLVDPPYLSTDTSTY--NKE---SYWKLSDYLEVLETL-Q-GSN-YFYFTSNKSQIVELCQWLETRT-SN-NSNPFK---G-----ATRTAVNN------K-TTHNTGYTDLMYHLKKIKIELNALL---
      CCYN49044_RS09840_Capnocytophaga_cynodegmi_517090945                              YK-Q-APLPFQGQKRRFLKEFEKAL-QE----Y------PSKG---FYVDLFGGSGLLSHTVKRLYPNATVIYNDFDDYHKRLEAIPQTNAILSELRNLN-L--T--T--PREKRI-AGLEREAVLAVLKR-ADES-GFVDWITISSSLKFSMNYG------FSYEDFEG-D--TLYNCVHTSNYELATD---YLQGIEIVKLDYKTLF----KQYKDVPN-----VVFLIDPPYLSTDTATY--NSK---DYWRLRDYLDVLDCL-H-GQS-FFYFTSNKSQIVELCEWLETRT-SD-NANPFK---G-----ANISTTAN------A-PSHNTKFTDIMYHIKR------------
      LS70_RS01430_Helicobacter_sp_MIT_11-5569_736161659                                FK-A-PPLPFMGNKKNALKLVESLI-KEIRAKY------NEQD-L-IFLDCFGGSGFLSHTFKYHLPNARVIYNDYDDYLDRVKNAKTTEEILGRISALV-T--S-----PKNAKI-TEEKKQKIISILEE-YEQRGQKIDYVSISSFVLFQGNYA------KDLTKLKK-A--QFYYKFGSIKK-ETRG---YLTGVEAVKMDFKAMI----EKYKAEAKISGKIAFLILDPPYLQTNTDVY--NT--E--FYRLPQFLELIDRI-E-KP--FMLFSSLKSDIVDFLAWYDRL-------NP-----K-----LKGRKIRS--------YNLCDVYSSVIPKTDFCFYE--------
      HMPREF9420_RS08510_Prevotella_salivae_494223274                                   YL-S-APLPFQGQKRMFAKEYIKIL-QQ----F------PDNA---TFVDLFGGSGLLSHIAKHQKPNSTIVYNDFDGYRKRLEALPQTNALLAELRAIV----N--V--PRHKPI-LGGTRERVLSCIRR-HECTYGYVDYITLSSSLMFSMKYA------TEFSDFEK-E--TLYNNIKAVDYPSCSD---YLDGLVITSCDYKELF----EKYKDVPG-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDVLTIL-T-GHR-FIYFTSNKSSIIELCKWIGKN--KI-IGNPFE---N-----CHHKEFNA------H-MNYNSSYTDIMLYTD-------A-----
      HMPREF9304_RS12585_Prevotella_timonensis_739058226                                HL-S-APLPFQGQKRMFAREYIKVL-QQ----Y------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLEALPQTNALLSELRTIV----D--V--PRHKAI-IGTQREQVLSCIRK-HERTHGYVDYITLSSSILFSMKYA------TEYSDLEK-D--TLYNNIKGVDYPPCDD---YLDGLTITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--RM-----YWKLSDYLDVLTVL-A-GHR-FIYFTSNKSSVIELCDWMGNH--PN-LGNPFR---N-----CHRKEFNA------H-MNYSSSYTDIMLYTD-------V-----
      X919_RS0112015_Prevotella_sp_HJM029_655519412                                     YL-S-APLPFQGQKRMFAREFKNVL-KQ----F------PDTA---TFVDLFGGSGLLSHIAKHEKPNATVVYNDFDGYRDRLAHIPQTNKLLAQLRLIL-N--D--Y--SRGKAI-IGEHRQRVLQCIEE-HQVRYGYVDYVTLSSSIMFAMHYK------QSLNEMRR-E--TLYNRIRQTDYPLCND---YLEGLTIVSADYKQIF----HQYKDVPG-----VVFLVDPPYLSTDCKTY--KM-----SWNLADYLDVLHVL-H-GHR-FIYFTFNKSSILELCDWMGKN--RN-LGNPFE---G-----CTKATFNA------H-ANFNATYTDMMLCKN-------D-----
      JCM14966_RS06695_Prevotella_oulorum_640643393                                     YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLEALPQTNALLAELRAIV----D--V--PRHKAI-MGEQREQVLSCIRW-HESEHGYVDYITLSSSILFSMKYA------MGYADLEK-E--TLYNNIKGVDYPPCDD---YLEGLTITSSDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--RM-----FWKLSDYLDVLTIL-A-GHR-FIYFTSNKSSIIELCDWMGRH--PN-LGDPFR---N-----CHRREFNA------H-MNYNSSYTDIMLYTD-------V-----
      CAPSP0001_RS11720_Capnocytophaga_sputigena_488758369                              HT-T-SPLPFQGQKRKFVKHFKEAL-KH----F------PANA---TYIDLFGGSGLLSHTVKTTHPNARVIWNDYDDFAHRLALIPTTNEIIAQLRPIV-A--N--H--PKGTRI--NEVKPTILEVLRQ-YPPE--ALDYITLSANLLFSGKYA------TSLEALAK-D--GFYAKVSQTPY-NADG---YLAGVERRQTDYRKLI----AEFEYTPN-----TVFILDPPYLSTDISSY--RGA---QDWKLKDYLHIVKAL-N-TMPRYIYFGSNKGQLLDLFDFLANE--YN-LPSPFN---N-----TTRVTVST------S-VNYSSSYEDLMIYK--------------
      TMA01S_RS05515_Tenacibaculum_maritimum_639782857                                  YG-S-SPLPFQGQKRNFIKQFKEAL-KT----Y------PEDA---VYVDLFGGSGLLSHTVKQEKPKAKVIYNDYDSFKNRIAAIPKTNNILRKLRELL-S--D--Y--PKSKKI-NGDKRKAVLELLKL-ENNK-GYVDFITISSSILFSMNYV------QTYEELEK-Q--TFYNRIRKSDF-NAEG---YLNNVEFVYGEYKEVF----KQYKNVPN-----VVFLVDPPYLSTDCTTY--K-----NYWKLTDYLDVLKVL-Y-SNN-YFYFTSNKSSVIELCKWIENN--TG-GVNPFN---K-----AKVVYQYN------K-TTHNTGYTDIMLHKC-------Y-TTDT
      HMPREF9141_RS12020_Prevotella_multiformis_494610799                               YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDNT---TFVDLFGGSGLLSHIAKCQKPDSTVVYNDFDGYRLRLEHIPQTNELLAELREIV-Q--G--T--PRYKPI-TGDAREKVFECLQK-YQDRYGYLDFITISSSIMFSMKYR------LNIEEMRK-E--ALYNTIRKTDYPLCSD---YLDGLNIVSADYKQVF----NQYKDKPG-----VVFLVDPPYLSTEVGTY--KM-----YWRLADYLDVPTVL-A-GHS-FVYFTSNKSSILELCDWIGRN--KT-IGNPFE---K-----CTKVEFNA------H-MNYNATYTDMMLYKK-------A-----
      HMPREF9420_2325_Prevotella_salivae_DSM_15606_315663782                            YL-S-APLPFQGQKRMFAKEYIKVL-RQ----F------PDNT---TFVDLFGGSGLLSHIAKCQKPDSTVVYNDFDGYRLRLEHIPQTNELLAELRKIV----D--V--PRHKPI-LGEARERVLSCILR-HERTHGYVDYITLSSSVMFSMKYA------TEFSDFEK-E--TLYNNIKAADYPSCSD---YLDGLTITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIVELCEWIGKN--KL-IGNPFE---N-----CHRREFNA------H-MNYNASYTDIMLYTD-------V-----
      JCM15124_RS09550_Prevotella_falsenii_640568267                                    YN-S-APLPFQGQKRRFAREFAKVL-RH----F------PDDA---VFVDLFGGSGLLSHITKCQKPNATVVYNDFDGYRQRLAHVAETNELLAQLRVIL-K--D--V--PHHKLV-PAGTKELAIKCIEK-HEARYGYVDYITLSSSLMFSAEYA------TSLNGIAK-E--SMYNRVRKVDYSTGED---YLGGLTVVSEDYKELF----EQYKNVPN-----VVFLADPPYLNTEADSY--KM-----TWRLNDYLDVLLVL-L-KNS-FIFFTSNKSSVVELCEWLARN--GG-MPNPFE---R-----CDKVELDA------L-LNHHASYTDIMYYTT-------L-----
      H526_RS0116665_Aquimarina_latercula_737095939                                     YM-A-APLPFQGQKRNLAKQFKAAL-NK----R------TPPK---VYVDLFGGSGLLSRIAKDVHPQATVVYNDYDNYRQRIQHIPATNRVLNDIRKMV-V--D--L--PAKTRI-PQIIKNKIIERIEQ-ET---GYLDYITLSTYLLFSMNFV------NSLEELKK-Q--TFYNRVRHTEIPEAKD---YLKGLEVVFYDYRELF----ERYNGSDQ-----VVFIVDPPYLSTDCGSY--KN-----YWKLKEHLDVLKVL-K-GGR-YIYFTSNKSNVVELCEWIETN--TG-GVNPFY---N-----AETISRTN------I-VNYQSSYSDIMLID--------------
      ATCC51562_RS05210_Campylobacter_concisus_544657538                                FN-A-APLPFQGQKRNFIKQFRELIKDE----F------RAYRNG-IFIDAFGGSGLLSHNIKQIYPNARVIYNDYDNYSERLANIETTNEILQTIEPIT-K--K--Y--KKNEKV-SEEDREKIIKIIDE-YIKRGYFIDWLTLSSNLLFSAKYA------HNKDEFKK-E-KTFFATSPKMPLYQKNS---YLKGVEIAHKEAMELI----KEFENK-D-----VVLVLDPPYLQTNKAGY--K-----CFWGLRDFLKLIR-L-V-REP-FIFFSSENSDILPYIDDLVEY-----GDEAFK---G-----YSLKQARL------N-NNNEQAKIDYMIYK--------------
      PIN17_RS06195_Prevotella_intermedia_763168088                                     YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRHRLERIAQTNELLEELRAIV----D--V--PRSKPI-LGEIRKRVLDCIRK-HEQKYGYVDYITLSASLLFSMKYA------TCFADLEK-E--TLYSRVKSTNYPLCTD---YLDGLTITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIMELCEWIGKN--KI-IGNPFE---N-----CHRREFNA------H-MNYNASYTDIMLYTD-------V-----
      HMPREF9420_RS10145_Prevotella_salivae_763205581                                   YL-S-APLPFQGQKRMFAKEYIKVL-RQ----F------PDNT---TFVDLFGGSGLLSHIAKCQKPDSTVVYNDFDGYRLRLEHIPQTNELLAELRKIV----D--V--PRHKPI-LGEARERVLSCILR-HERTHGYVDYITLSSSVMFSMKYA------TEFSDFEK-E--TLYNNIKAADYPSCSD---YLDGLTITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIVELCEWIGKN--KL-IGNPFE---N-----CHRREFNA------H-MNYNASYTDIMLYTD-------V-----
      LS72_RS05710_Helicobacter_apodemus_736539564                                      FK---PPLPFMGSKIAMLKRVKEAL-HS----MAFTQAIRKDT---IFYDVFGGSGLLSHYIKQLYPQNEVIWNDFDNFKERLDNIEKTENLRFRLHNLC-K--D--Y--STKIKL-PQEIIDKIKQILEQ----E-QYLDLTTLSSYLCFAGNYA------ITKEQLFN-N--IKYHRIPAKPL-NAKG---YLRGVMRVSKDFKQLLEEIPQEEKDKQQ-----AFLILDPPYLQTQKGNY----R-D--FYTLKDFCLLVENI-F-KP--YLFFSSKNSDILPFIDFYKKYN----PV-------------FEDYKIDK--------ASLKIGGEDYMICSS----------TQT
      JCM21142_RS20860_Saccharicrinis_fermentans_763356669                              YT-S-APLPFMGQKRRFLTLFKSAL-NE----F------KTAN---TFIDLFGGSGLLSHTAKSVRPDAQVVYNDYDDYHTRLLHVDKTNRMLEHIRVLV-K--D--C--PPDKKI-PNEIKQKVIDYIQN-EEQK-GFVDYITLSSSLLFSMNYS------KSLKDLEK-Q--TMYNCVRKSNY-NVAG---YLDGLHIVKYDYKELF----NKYKGIDN-----VVFFVDPPYLSTEVGTY--NN-----YWRLADYLDVLNTL-K-DTS-YFYFTSDKSSIIELCDWLQKN--LE-ANNPFD---G-----AIKYEMPV------K-VNHNAGYTDIMLCK--------------
      C506_RS0110745_Alistipes_497282263                                                YT-S-APLPFMGQKKRFIREFRKAL-RE----F------DHAT---VFVDLFGGSGLLSHVTKRERPDARVIYNDFDDYHVRLENIKRTNALLHDIRSIV-G--D--Y--PTAKRL-TPQMRTTILDTVRS-AEKT-GYVDYITLSSSLLFSSKYV------TDYTELQN-A--GLYNNLRASDY-TCEG---YLDGIEVVHADYRELF----NQYKDIPG-----VVFLVDPPYLSTEVGVY--KC-----RWRLSDYLDVLTLL-S-STS-YFYFTSNKSSIIELCEWISEA--KV-NANPFL---N-----AVRKEMGA------Q-LNYNSRYTDIMIYRR-------------
      BN590_01677_Alistipes_sp_CAG:29_547931225                                         YN-S-APLPFMGQKRRFVGEFKKAL-GQ----F------PNAT---VFVDLFGGSGLLSHVAKQERPDVQVVYNDFDDFHLRLQNIPRTNALLADIRNFVGG--G--I--SKNQRL-SEALRKQILDRVAE-EEKT-GFVDYITLSGSLLFSGKYV------TSFDELAK-D--GFYNTVRQTDF-SAEG---YLDGVEIVKQDYHELF----EQYKDVSG-----AVFLVDPPYLSTEVGAY--KC-----YWRLADYLDVLKIL-Q-GKS-YVYFTSNKSQIVELIDWFTRT--QF-NSNPFE---G-----AERREFNV------S-INHNSKYTDIMLYKQ-------A-----
      ATHG_RS03835_Alistipes_timonensis_497946642                                       YN-S-APLPFMGQKRRFVGEFRKAL-GH----F------PGAT---VFVDLFGGSGLLSHVAKQERPDVQVVYNDFDDFHLRLQNIPRTNALLADIRNFVGG--G--I--SRNQRL-SEALQKQILDRVAE-EEKT-GLVDYITLSGSLLFSGKYV------TSFDELAK-D--GFYNTVRQTDF-SAEG---YLDGVEIVKQDYRELF----EQYKDVPG-----AVFLVDPPYLSTEVGPY--KC-----YWRLADYLDVLKVL-Q-GKS-YVYFTSTKSQIVELIDWFTRT--QF-DSNPFE---G-----AERREFNV------T-INHNSKYTDIMLYRR-------I-----
      JCM21142_114604_Saccharicrinis_fermentans_DSM_9555_=_JCM_21142_588488100          YT-S-APLPFMGQKRRFLTLFKSAL-NE----F------KTAN---TFIDLFGGSGLLSHTAKSVRPDAQVVYNDYDDYHTRLLHVDKTNRMLEHIRVLV-K--D--C--PPDKKI-PNEIKQKVIDYIQN-EEQK-GFVDYITLSSSLLFSMNYS------KSLKDLEK-Q--TMYNCVRKSNY-NVAG---YLDGLHIVKYDYKELF----NKYKGIDN-----VVFFVDPPYLSTEVGTY--NN-----YWRLADYLDVLNTL-K-DTS-YFYFTSDKSSIIELCDWLQKN--LE-ANNPFD---G-----AIKYEMPV------K-VNHNAGYTDIMLCKG-------NTSVSG
      SU65_11745_Flavobacterium_psychrophilum_806965486                                 FT-K-APLPFMGQKRNFIKQFKPAL-NK----Y------SESA---TYVDLFGGSGLLSHTVKSIYPGAKVVYNDFDNYKIRLENIGKTNQLIADLRVIL-K--D--S--PKDKII-LGEFRSKVLERVLL-EENS-GYVDYITLSSSILFSMKYA------LSFEALQK-E--TLYNTMRQSEY-TADG---YLDGLEVVSLCYKELF----AKYKDLPN-----VVFLVDPPYLSTESGTY--KS-----FWKLRDYLDVLQVL-D-GTK-YFYFTSKKSSIIELCEWIETK--MP-MSNPFT---G-----ASLETMNA------T-VTYQSSYTDIMLYKY-------E-----
      JCM12083_RS12170_Prevotella_shahii_647521559                                      YQ-Q-APLPFMGQKRKFVKAFRQIL-RG----Y------PDDV---TIVDLFGGSGLLSHVAKREKPNATVVYNDFDNYQWRIATIPRTNALLARIREVT-D--S--L--PRGKVI-RHPHRDRILEIIAE-EEQC-GFVDYITLSPSLLFSMKYA------NNMDELVK-Q--TFYNTVRRNDY-CADG---YLDGLTIVHKDYKALF----AEYRDKPN-----VLFLVDPPYLSTEVGTY--TM-----SWRLADYLDVLTVL-Q-GHD-YVYFTSNKSQIIELCEWIGRS--RI-NRNPFE---C-----AHRVEVNT------T-MNYNSNYTDIMLYRK-------N-----
      HMPREF0670_RS03300_Prevotella_sp_oral_taxon_317_496521463                         YQ-Q-APLPFMGQKRKFVKAFRQIL-KS----Y------PDNV---TIVDLFGGSGLLSHVAKREKPNATVVYNDYDNYHRRIAAIPRTNALLARIREVT-E--S--L--PRGKVI-RQPHRDRILEIIAE-EEQR-GFVDYITLSPSLLFSMKYA------NKMDELVK-Q--TFYNTVRRNDY-CADG---YLDGLTIVHKDYKALF----NEYRDKPN-----VLFLVDPPYLSTEVGTY--TM-----TWKLADYLDVLTIL-Q-GHD-YVYFTSNKSQIIELCEWIGQS--RI-DRNPFE---C-----AHRVEVNT------T-MNYNSSYTDIMLYRK-------N-----
      BN863_RS14255_Formosa_agariphila_740746518                                        YN-T-APLPFMGQKRKFIKSFKDAL-HN----Y------PPDG---IYVDLFGGSGLLAHTAKQHYPNATVVYNDFDNYRKRINAIPETNNLLEKLRMLI-S--E--W--PKDKRI-TGVTRENVLKAIKR-HEDQYNYVDYITLSSSLLFSMKYV------LNYEDLVK-S--TLYNCIRMSDY-KAEG---YLNGLDIVSLDYKVLF----EQYKDSDK-----VVFLIDPPYLQTTSVTY--KN-----YWNLTDYLDVLSVL-E-GHR-YFYFTSNKSSIIELCEWVGNR--TL-TTNPFA---H-----ATKLEVNT------S-VNYNSSYTDIMLFK--------------
      IW16_RS16985_Chryseobacterium_vrystaatense_736743227                              YV-Q-APLPFQGQKRRFLKSFKEAL-KD----F------PEDA---IYVDLFGGSGLLSHTVKQFYPNSEVIYNDFDGYTFRLENVQKTNSLLSDVREIC-S--K--S-IDRKGKL-SNELHSEIIGRISK---EK-GFVDWVTISSSLLFSMNYA------TSFEQLKK-E--TFYNKVRLSDY-CVDG---YLEGVSKVREDYQCLF----AKYQHYPK-----AVFLIDPPYLSTNCSTY--TNP---DYWKLSDYLNVLNTV-D-NTS-YFYFTSNKSQIIELCDWMSKKK-CF-K-NPFS---C-----STTVSINT------S-LTHNAKYDDIMIYRYKNV----------
      HMPREF9715_RS04510_Myroides_odoratimimus_493305395                                FC-A-SPLPFLGQKRKYLKEVKQVL-NH----T------NPRG---TYVDLFGGSGLLSHTIKRHYPDATVIYNDYDGFSDRISNITTTNNLLERIRLLL-V--D--I--DSKTKV-PDTIKQQILQLIKA-DEEANVYVDYITLSSTLLFTMKYE------QTYEGFAK-Q--TLYNRLTKTPY-NADG---YLEGLIIESSDYKALF----EKYKHIPG-----VCFLVDPPYLSTEVSGY--KMN----YWKLKDYLNVLNVL-D-GHK-YLYFTSNKSQIVELCEWVESR--KD-KGNPFN---H-----SRTVSMTN------K--SKNTTYEDILIHN-----------ITP
      ANH9381_RS06760_Aggregatibacter_actinomycetemcomitans_503933737                   YK-Q-APLPFIGQKQQFLTHYTTIL-NQ-HI-Q------DEGK-GWTIIDAFGGSGLLSHTAKQLKPAARVIYNDFDDYVMRLKHIDDINRLRGKIYTLL-D-GV--P--RQKRIT-DHLLKTKIIKVIET-FD---GYKDLNCLASWLLFSGQQV------ATLSDLYH-K--DFWNCIRKSDYPNAHG---YLDGIEITNESFHTLL----PRFINDER-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLINIT-R-PP--YIFFSSTKSEFVRFIEYMCEDK-VN-NWQAFD---G-----AQRITIKT------N-LNYQGQYEDNLVYKF-------------
      HMPREF0198_0362_Cardiobacterium_hominis_ATCC_15826_258520722                      HS-K-APLPFIGQKRAFLNQFATVL-HQ-II-P------DDGD-GWTILDAFGGSGLLSHAAKHHKPAARVIYNDYDGYAERLRHIPDINRLRRILEDVL-R--H--H--PRGVHL-KTAKRAEVVAAIRA-FD---GYTDLNCLISWLLFSGNQA------SSIEELCG-K--HMYHAVRRSDFPAADG---YLDGLEITRESYTTLL----PQHTANPR-----CLLILDPPYICTMQGAY--KQQ-G--YFGMVEFLRLMLHV-R-PP--FIFFSSTRSELPAYLQLVIGDR-LA-GWERFI---D-----YQTISINT------V-LNSTARYEDNLIYKC-------------
      SCC393_RS02190_Aggregatibacter_actinomycetemcomitans_491717013                    YK-Q-APLPFIGQKQQFLTHYTTIL-NQ-HI-Q------DEGK-GWTIIDAFGGSGLLSHTAKQLKPAARFIYNDFDDYVMRLKHIDDINRLRGKIYTLL-D-GV--P--RQKRIT-DHLLKTKIIKAIET-FD---GYKDLNCLASWLLFSGQQV------ATLSDLYH-K--DFWNCIRKSDYPNAHG---YLDGIEITNESFHTLL----PRFINDER-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLINIT-R-PP--YIFFSSTKSEFVRFIEYMCEDK-VN-NWQAFD---G-----AQRITIKT------N-LNYQGQYEDNLVYKF-------------
      HMPREF0198_RS01770_Cardiobacterium_hominis_750049800                              HS-K-APLPFIGQKRAFLNQFATVL-HQ-II-P------DDGD-GWTILDAFGGSGLLSHAAKHHKPAARVIYNDYDGYAERLRHIPDINRLRRILEDVL-R--H--H--PRGVHL-KTAKRAEVVAAIRA-FD---GYTDLNCLISWLLFSGNQA------SSIEELCG-K--HMYHAVRRSDFPAADG---YLDGLEITRESYTTLL----PQHTANPR-----CLLILDPPYICTMQGAY--KQQ-G--YFGMVEFLRLMLHV-R-PP--FIFFSSTRSELPAYLQLVIGDR-LA-GWERFI---D-----YQTISINT------V-LNSTARYEDNLIYKC-------------
      HPNK_00382_Haemophilus_parasuis_str_Nagasaki_598907105                            FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-S------GDGE-GWTIVDVFGGSGLLSHTAKQLKPRARVIYNDFDNYAERLQHIPDINQLRQQLAIAL-A--D--C--SKGKRL-DKAKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSIEELYT-Q--DFWHCLRQSDYPSAEG---YLDGVEIVCESFHQLV----PRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVINT------S-TSYSGKYEDNLVYKF-------------
      GGE_RS03480_Haemophilus_haemolyticus_491864737                                    FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHTAKQLKPKAHVIYNDFDGYAERLVHIDDTNALRTQIFAKI-G-NT--T--PKNKRL-PKSLKAEIIQIIDE-FQ---GYKDLNCLASWLLFSGQQV------GSLEELYR-K--DFWHCVRLSDYPSADG---YLDGVEVVHESFHTLL----PKYANDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      HPS42_05865_Haemophilus_parasuis_ST4-2_633956025                                  FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-E------GDGE-GWIIVDVFGGSGLLSHTAKQLKPQARVIYNDFDNYAERLQHIPDINQLRQQLAVAL-A--D--C--PKDKRL-DKTKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSIEELYT-Q--DFWHCLRQSDYPSAEG---YLDGVEIVCESFHQLV----PRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVVNT------S-TSYSGKYEDNLVYKF-------------
      HMPREF1128_RS04375_Haemophilus_sputorum_494789040                                 FK-Q-APLPFIGQKRMFLKHFQNIL-NE-HI-K------DDGE-GWIIIDAFGGSGLLSHVAKAIKPKARVIYNDFDGYSERLAHIGDINTLRSQLFTAV-G-SA--V--PKNKRM-PKEVKAKCVKIIQE-FD---GYKYLNCLASWLLFSGQQV------ATTDELFQ-N--DFWNCIRQSDYPKADC---YLDDIEIIRESFHTLL----PKFSGNRK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFIRFIEYMQKDK-VD-NWQSFD---G-----AKRIVVNG------S-ASYSGKYEDNLVYKF-------------
      HPS9_RS04300_Haemophilus_parasuis_737511689                                       FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-S------GDGE-GWTIVDVFGGSGLLSHTAKQLKPRARVIYNDFDNYAERLQHIPDINQLRQQLAVAL-A--D--C--PKDKRL-DKAKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSIEELYT-Q--DFWHCLRQSDYPSADG---YLEGVEIVCESFHQLV----PRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVINT------S-TSYSGKYEDNLVYKF-------------
      SASC598J21_017980_Snodgrassella_alvi_SCGC_AB-598-J21_662243730                    FK-Q-APLPFIGQKRYFIKSFCEVL-ND-NI-Q------GLGD-EWTIVDVFGGSGLLSHHAKRLKPHARVIYNDFDNYAQRLKHIDDINRLRQQLAEVL-K--G--I--PRKNKI-DPKTHALIIETIKN-FD---GFIDIDCVWAWLLFSGNQA------ESLNQIYTQP--VLYNRLRKSDYPDAKD---YLTGVEVVSKSFDELL----PEYVNNEK-----TLLVLDPPYLFSEQKGY--RKA-K--DFGLASFLQLMELI-R-PP--FIMFSNYRSEILDYFDYQIKRN-D----ERFL---N-----YKYTSISA------P-LNNVGFYRDNMIYKF-------------
      SASC598J21_RS08380_Snodgrassella_alvi_739535589                                   FK-Q-APLPFIGQKRYFIKSFCEVL-ND-NI-Q------GLGD-EWTIVDVFGGSGLLSHHAKRLKPHARVIYNDFDNYAQRLKHIDDINRLRQQLAEVL-K--G--I--PRKNKI-DPKTHALIIETIKN-FD---GFIDIDCVWAWLLFSGNQA------ESLNQIYTQP--VLYNRLRKSDYPDAKD---YLTGVEVVSKSFDELL----PEYVNNEK-----TLLVLDPPYLFSEQKGY--RKA-K--DFGLASFLQLMELI-R-PP--FIMFSNYRSEILDYFDYQIKRN-D----ERFL---N-----YKYTSISA------P-LNNVGFYRDNMIYKF-------------
      hia5_Haemophilus_influenzae_359359006                                             FK-Q-APLPFIGQKRMFLKQFEQIL-NE-NI-S------DNGE-GWTILDTFGGSGLLSHTAKRLKPKARVIYNDFDGYAERLAHIDDINQLRAELYSVV-G-NA--T--SKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--NFWHCIRQSDYPKADG---YLDGVEIVKESFHTLL----PKFSNDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      SU55_RS07055_Haemophilus_influenzae_756154060                                     FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTVIDTFGGSGLLSHAAKRLKPKARVIYNDFDGYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCVRLSDYPSAEG---YLDGVEVIRESFHTLL----PKFTDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMVDDK-VH-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      A160_0967_Aggregatibacter_actinomycetemcomitans_serotype_a_str_A160_443550816     YK-Q-APLPFIGQKQQFLTHYTTIL-NQ-HI-Q------DEGK-GWTIIDAFGGSGLLSHTAKQLKPAARFIYNDFDDYVMRLKHIDDINRLRGKIYTLL-D-GV--P--RQKRIT-DHLLKTKIIKAIET-FD---GYKDLNCLASWLLFSGQQV------ATLSDLYH-K--DFWNCIRKSDYPNAHG---YLDGIEITNESFHTLL----PRFINDER-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLINIT-R-PP--YIFFSSTKASSCGLLSICVKIK-SI-TGRHLM---V-----HKELRLRQ------T-STTKGNMKIIWFINFRD-----------
      W820_RS02320_Haemophilus_influenzae_748782878                                     FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHTAKRLKPKARIIYNDFDGYAERLAHIDDINQLRAELYSVV-G-NA--T--PKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--NFWHCIRQSDYPKADG---YLDGVEVIRESFHTLL----PKFADNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFMNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      SU30_RS04070_Haemophilus_influenzae_756154896                                     FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GEGE-GWTIIDTFGGSGLLSHTAKRLKPKARVIYNDFDGYAERLAHIDDINQLRTELYSVV-G-NA--T--PKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--NFWHCIRQSDYPKADG---YLDGIEIVKESFHTLL----PKFSDDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF-------------
      SU30_RS01820_Haemophilus_influenzae_756151906                                     FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGK-DWTIIDVFGGSGLLSNTAKRVKPKSRVIYNG---YSERLNHITEINQLREILYQTV-N-GI--I--TKNKLI-SKRLKEEIINKINN-FS---GYKDVNCLSSWLLFSGQQV------NSLDELFK-Q--RFYNCIRKSDYELADG---YLNGLEVINESFHQLL----PKFIDKEK-----VLLILDPPYLCTRQESY--KQA-S--YFDLIDFLRLIHLT-K-PP--FIFFSSTKSEFIRFIEAMVEDK-WD-NWQVFN---E-----VNRITVNA------S-ASYNGKYEDNLIYKF-------------
      COK_0640_Mannheimia_haemolytica_serotype_A2_str_BOVINE_261312126                  FK-Q-APLPFIGQKRMFLSHFKQVL-NA-NI-E------ADGK-DWTIIDVFGGSGLLAHTAKREKPLARVIYNDFDNYAERLNHIKDTNQLRQEIYQIV-D-GV--I--PKNKRI-SNEVKAKIINKIND-FE---GFKDPKCLSSWLLFSGEQV------ATLDELFK-H--DFWNCVRQSNYPEATG---YLDDIEVVSESFHQLL----PRFHDKEK-----VLLILDPPYLCTRQESY--KQA-N--YFDLIDFLRLIDLT-K-PP--YIFFSSTKSEFIRFIHYMVQNK-KE-NWQAFE---G-----AERIVVNA------S-PSYNGKYEDNMVFKF-------------
      J450_RS10910_Mannheimia_haemolytica_493294268                                     FK-Q-APLPFIGQKRMFLSHFKQVL-NA-NI-E------ADGK-DWTIIDVFGGSGLLAHTAKREKPLARVIYNDFDNYAERLNHIKDTNQLRQEIYQIV-D-GV--I--PKNKRI-SNEVKAKIINKIND-FE---GFKDPKCLSSWLLFSGEQV------ATLDELFK-H--DFWNCVRQSNYPEATG---YLDDIEVVSESFHQLL----PRFHDKEK-----VLLILDPPYLCTRQESY--KQA-N--YFDLIDFLRLIDLT-K-PP--YIFFSSTKSEFIRFIHYMVQNK-KE-NWQAFE---G-----AERIVVNA------S-PSYNGKYEDNMVFKF-------------
      K941_RS0100010_Moraxella_caprae_738435815                                         HS-T-APLPFIGQKRQIIQVFRNTL-DR-IV-P------DDGQ-GWTIIDVFGGSGLLAHNAKYLKPKARVIYNDFDNFSQRLYHLCDTNRLRQQLYAIL-E--P--L--PRAKRI-DEPTKQKLLEIIQN-FD---GFVDCHSVSTWLLFSGNQI------SHIDELPK-H--EFYNTIRRSDYPTADG---YLDGLEIVSESFEILM----PKFYHQDK-----TLFILDPPYLRTKQEAY--GLG-E--YFGMIQFLKLMKWV-R-PP--YLFFSSTKSEFLSYLDYVKEYE-PV-MWERLG---G-----FEKLSFTS------Y-VNKSSTYEDNMIFKI-------------
      MS_RS00345_[Mannheimia]_succiniciproducens_499512619                              FK-Q-APLPFIGQKRMFLKHFERLL-ED--I-P------NDGE-GWTIIDAFGGSGLLSHVAKHLKPEATVIYNDFDGYAERLAHIDDINRLRQAIYPLL-A--N--C--AKSKKV-PNDIKTQIIDVIKG-FD---GYINEHILCSWLCFSGQQV------KTLDELFK-E--DFWNCIRKSDYPSADG---YLDGIEVVSESFHTLL----PKYQTDPK-----ALFVLDPPYLCTQQASY--KQE-N--YFDLIDFLRLVHLT-R-PP--YVFFSSSKSEFVRFIEAMIEDK-WD-NWQAFE---N-----YERVIVKT------S-SSYSGKYEDNMVFKF-------------
      ACEE_RS02875_Actinobacillus_equuli_746131177                                      FK-Q-APLPFIGQKRMFLKQVESVL-NQ-HI-D------GDGK-DWIIVDVFGGSGLLSHTAKRVKPNATVIYNDFDGYSDRLKHIDDINALRRIIYNIC-V-DI--I--PKNSRL-SKELKAKIINEINQ-FK---GYKDLNCLATWLLFSGQQI------GSFDELYA-K--EFYNCVRMTDYPQATG---YLDGLEIMSESFHTLI----PKFANKTN-----VLLLLDPPYLCTRQESY--KQK-N--YFDLVDFLRLVNLT-R-PP--YIFFSSTKSEFIRFIDTAIEDK-WN-NWQAFD---E-----YKRIVVHV------S-ASYTGKYEDNMIYKF-------------
      HMPREF1052_RS08385_Pasteurella_bettyae_492143056                                  FK-Q-APLPFIGQKRMFLKHFQNIL-NE-HI-K------DDGE-DWIIIDAFGGSGLLSHVAKAIKPKARVIYNDFDGYSERLAHIGDINTLRSQLFTAV-G-SA--V--PKNKRM-PKEVKAKCVKIIQE-FD---GYKDLNCLASWLLFSGQQV------ATTDELFQ-N--DFWNCIRQSDYPKADC---YLDDIEIIRESFHTLL----PKFSDNRK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFIRFIEYMQKDK-VD-NWQSFE---G-----AKRIVVNG------S-ASYSGKYEDNLVYKF-------------
      HICG_RS06205_Haemophilus_influenzae_696250595                                     FK-Q-APLPFIGQKRMFLKHVEIVL-NK-HI-D------GEGE-GWTIVDVFGGSGLLSHTAKQLKPKATVIYNDFDGYAERLNHIDDINRLRQIIFNCL-H-GI--I--PKNGRL-SKEIKEEIINKIND-FK---GYKDLNCLASWLLFSGQQV------GSVEALFA-K--DFWNCVRQSDYPTAEG---YLDGIEVISESFHKLI----PRYQNQDK-----VLLLLDPPYLCTRQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRYLNYMQESK-TD-NWRAFE---N-----YKRIVVKA------S-ASKDGIYEDNMIYKF-------------
      WQG_17550_Bibersteinia_trehalosi_USDA-ARS-USMARC-192_469489659                    FK-Q-APLPFIGQKRMFLKHFEQVL--A-HI-P------DDGN-GWTIVDVFGGSGLLSHTAKRLKPKARVIYNDYDNYSERLQHIDDINRLRRIIADLM-A--D--T--PKYKRL-DNAKKLQIIEAIEA-FQ---GYKDLHILCSWLAFSGQQV------SSFDELYK-Q--NFWHCIRQSDYLTADG---YLDGVEIVRESFHQLV----PRFTGQPN-----TLLVLDPPYLCTHQESY--KQE-R--YFDLVDFLRLIHLT-K-PP--YVFFSSTKSEFVRFIDAMVEDK-WD-NWQAFD---D-----AQRIVVQT------S-ASYNGKYEDNMVYKF-------------
      IO45_RS00140_Gallibacterium_anatis_746088554                                      FA-Q-APLPFIGQKRMFLQQFRSVL-NQ-MI-A------DNGD-GWTIVDAFGGSGLLSHTAKCLKPNARVIYNDFDGYAERLKHIDDINRLRQILSELL-A--N--C--PRDRRL-DIAMRHKVIDAIES-FN---GYKDPHILCAWLLFSGQQV------KSINELYS-R--GFYNCIRQSDYTTADG---YLDGIEVVNESFVTLL----PKFADDSK-----AIFVLDPPYLCTKQASY--KQE-R--YFDLIDFLELIRLT-R-PP--YLFFSSTKSEFIRFVDWLIASK-GD-NWQSFV---D-----YQRIIVQT------S-TSYSGKYEDNLIYKC-------------
      HD_RS06430_Haemophilus_ducreyi_753848096                                          YK-N-APLPFIGQKRQFLTHYTEIL-NQ-YI-S------GDGQ-GWTIIDAFGGSGLLSDVAKRIKPAARVIYNDFDNYAERLLHIDEINELRLKISDTI-G-NT--I--PKNKKL-TPDVKSKVINVIQS-FQ---GYKDLNCLASWLLFSGNQV------GSLEDLFN-K--DFWHCVRQSDYPRADG---YLDGIEIIQESFHQLL----PKFRDEPN-----TLFVLDPPYLCTRQESY--RQA-S--YFDLIGFLRLIHLT-R-PP--YIFFSSSKSEFVRFIDAMVEDK-WD-NWQAFE---N-----YGKISINT------S-ASYSGKYEDNMVFKF-------------
      HI1523_Haemophilus_influenzae_491961424                                           FK-Q-APLPFIGQKRMFLKHVEIVL-NK-HI-D------GEGE-GWTIVDVFGGSGLLSHTAKQLKPKATVIYNDFDGYAERLNHIDDINRLRQIIFNCL-H-GI--I--PKNGRL-SKEIKEEIINKIND-FK---GYKDLNCLASWLLFSGQQV------GSVEALFA-K--DFWNCVRQSDYPTAEG---YLDGIEVISESFHKLI----PRYQNQDK-----VLLLLDPPYLCTRQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRYLNYMQESK-TD-NWRAFE---N-----YKRIVVKA------S-ASKDGIYEDNMIYKF-------------
      IE01_RS11495_Gallibacterium_anatis_517158409                                      FS-Q-APLPFIGQKRMFLNQFKTVL-NQ-MI-A------NDGE-GWTIVDAFGGSGLLSYAAKQLKPKARVIYNDFDGYAERLKHIDDINRLRQQLSDLL-T--G--C--PRQKRL-DIAMRHKVIDVIES-FN---GYKDPHILCAWLLFSGQQI------KSLNELYR-H--GFYNCVRQSDYDTADG---YLDGIEVVSESFSKLL----PKFANDKK-----AIFVLDPPYLCTHQASY--KQE-N--YFDLIHFLELIRLT-R-PP--YIFFSSTKSEFVRFVDWLVATK-GN-NWQSFV---D-----YKRIIVQT------S-TSYSGKYEDNLIYKC-------------
      HD_1581_Haemophilus_ducreyi_35000HP_33148844                                      YK-N-APLPFIGQKRQFLTHYTEIL-NQ-YI-S------GDGQ-GWTIIDAFGGSGLLSDVAKRIKPAARVIYNDFDNYAERLLHIDEINELRLKISDTI-G-NT--I--PKNKKL-TPDVKSKVINVIQS-FQ---GYKDLNCLASWLLFSGNQV------GSLEDLFN-K--DFWHCVRQSDYPRADG---YLDGIEIIQESFHQLL----PKFRDEPN-----TLFVLDPPYLCTRQESY--RQA-S--YFDLIGFLRLIHLT-R-PP--YIFFSSSKSEFVRFIDAMVEDK-WD-NWQAFE---N-----YGKISINT------S-ASYSGKYEDNMVFKF-------------
      HMPREF9095_RS07250_Haemophilus_aegyptius_494053240                                FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHAAKVIKPKAHVIYNDFDSYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCVRLSDYPSAEG---YLDGVEVIRESFHTLL----PKFTDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMVDDK-VH-NWQTFD---N-----AERIVVNA------S-ASYSGKYEDNMVYKF-------------
      HPS41_RS06910_Haemophilus_parasuis_737547081                                      FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-D------GNGE-GWTIVDVFGGSGLLSHTAKQLKPQARVIYNDFDNYAERLQHIPDINQLRQQLAVAL-A--D--C--PKGKRL-DKTKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSLEELYT-Q--DFWHCLRQSDYPSAES---YLDGVEIVCESFHQLV----SRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVVNT------S-TNCRGKYEDNLVYKF-------------
      A3G3_RS0107195_Moraxella_boevrei_518349657                                        YK-S-APLPFIGQKRFFISHFVKLL-KD-KI-P------NDGE-NWTIIDVFGGSGLLAHNAKRLLPKARVIYNDFDGYAQRLQNIADTERLRQKLFDLL-A--N--A--EDERKL-NDQQRKQIIETINQ-FD---GYLDINAIATWILFSGKQA------KTLDELYQ-N--TFYNTVRKTPYKTADG---YLDGLEITHESFETLI----PKFANQPN-----TLLLLDPPYVFTEQTAY--HQA-K--YFGMVEFLTLMSLV-R-PP--YIFFSSTKSELLDYLAYVQKHQ-PH-DWDRLG---G-----FDRLFLQS------Q-VNYHKSYQDNMIWKF-------------
      F543_RS02755_Bibersteinia_trehalosi_644530078                                     FK-Q-APLPFIGQKRMFLKHFEQVL--A-HI-P------DDGN-GWTIVDVFGGSGLLSHTAKRLKPKARVIYNDYDNYSERLQHIDDINRLRRIIADLM-A--D--T--PKYKRL-DNAKKLQIIEAIEA-FQ---GYKDLHILCSWLAFSGQQV------SSFDELYK-Q--NFWHCIRQSDYLTADG---YLDGVEIVRESFHQLV----PRFTGQPN-----TLLVLDPPYLCTHQESY--KQE-R--YFDLVDFLRLIHLT-K-PP--YVFFSSTKSEFVRFIDAMVEDK-WD-NWQAFD---D-----AQRIVVQT------S-ASYNGKYEDNMVYKF-------------
      SVR5_RS07195_Haemophilus_parasuis_491999424                                       FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-D------GNGE-GWTIVDVFGGSGLLSHTAKQLKPQARVIYNDFDNYAERLQHIPDINQLRQQLAVAL-A--D--C--PKDKRL-DKTKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSIEELYT-Q--DFWHCLRQSDYPSAEG---YLDGVEIVCESFHQLV----PRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVVNT------S-TSYSGKYEDNLVYKF-------------
      HPS41_RS09445_Haemophilus_parasuis_737547480                                      FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-S------GDGE-GWTIVDVFGGSGLLSHVTKRLKPKATVIYNDFDGYAERLAHIDDINRLRRLIYPLL-A--A--C--EKQKKV-PNDVKAQIIEVIKN-FD---GYINEHILCSWLCFSGQQV------ATLDELFK-E--DFWHCIRQSDYSSADG---YLDDIEVVSESFYTLL----PKYQNDPK-----ALFVLDPPYLCTHQASY--KQA-T--YFDLVDFLRLIHLT-R-PP--FVFFSSTKSEFVRYVDAMIEDK-WD-NWQAFQ---D-----YERIVVNT------S-TSYSGKYEDNLVYKF-------------
      F543_5700_Bibersteinia_trehalosi_USDA-ARS-USMARC-189_575451422                    FK-Q-APLPFIGQKRMFLKHFEQVL--A-HI-P------DDGN-GWTIVDVFGGSGLLSHTAKRLKPKARVIYNDYDNYSERLQHIDDINRLRRIIADLM-A--D--T--PKYKRL-DNAKKLQIIEAIEA-FQ---GYKDLHILCSWLAFSGQQV------SSFDELYK-Q--NFWHCIRQSDYLTADG---YLDGVEIVRESFHQLV----PRFTGQPN-----TLLVLDPPYLCTHQESY--KQE-R--YFDLVDFLRLIHLT-K-PP--YVFFSSTKSEFVRFIDAMVEDK-WD-NWQAFD---D-----AQRIVVQT------S-ASYNGKYEDNMVYKF-------------
      BN1226_RS02290_Mannheimia_sp_MG13_764738267                                       FK-Q-APLPFIGQKRMFLQHFERLL-ND-NI-P------NDGD-GWTILDAFGGSGLLSHVAKRLKPKATVIYNDFDGYAERLQHIDDINRLRRQIAPLL-A--E--Q--PKQKRL-SPELKAQIIDVIKA-FD---GYINVHVLCSWLLFSGQQV------KTLDELFT-Q--DFWHCLRQSDYPSADG---YLDGLTVVSESFHTLL----PKYQHDPK-----ALFVLDPPYLCTHQESY--GQQ-R--YFDLIDFLRLIHLT-R-PP--FVFFSSTKSEFVRFIDAMITDQ-WD-NWQSFA---N-----YERIAVKT------S-TSYSGKYEDNMVFKF-------------
      HPS41_07110_Haemophilus_parasuis_ST4-1_633953678                                  FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-D------GNGE-GWTIVDVFGGSGLLSHTAKQLKPQARVIYNDFDNYAERLQHIPDINQLRQQLAVAL-A--D--C--PKGKRL-DKTKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSLEELYT-Q--DFWHCLRQSDYPSAES---YLDGVEIVCESFHQLV----SRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVVNT------S-TNCRGKYEDNLVYKF-------------
      HICON_RS02315_Haemophilus_influenzae_503292691                                    FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHAAKVIKPKAHVIYNDFDSYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCVRLSDYPSAEG---YLDGVEVIRESFHTLL----PKFSDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-L-PP--YIFFSSTKSEFVRFIEYMVDDK-VH-NWQTFD---N-----AERIVVNA------S-ASYSGKYEDNMVYKF-------------
      APPSER11_RS04990_Actinobacillus_pleuropneumoniae_491784781                        YK-N-APLPFIGQKRQFLTHYTEIL-NQ-YI-P------GDGQ-GWTIIDAFGGSGLLSHVAKRIKPAARVIYNDFDNYAERLLHIDEINELRLKISDTI-G-NA--I--PKNKKL-TPDVKSKVINAIQS-FQ---GYKDLNCLASWLLFSGNQV------GSLEDLFN-K--DFWHCVRQSDYPRADG---YLDGVEIIQESFHQLL----PKFRDEPN-----TLFVLDPPYLCTRQESY--RQA-S--YFDLIDFLRLIHLT-R-PP--YIFFSSSKSEFVRFIDAMVEDK-WD-NWQAFE---N-----YGKISINT------S-ASYSGKYEDNMVFKF-------------
      POREN0001_0004_Porphyromonas_endodontalis_ATCC_35406_229317608                    YT-T-APLPFAGQKRRWLKQLEPII-RS----L------PSNT---IFVDVFGGSGLVSRLCKDVHPAARVIYNDYDNYSERLRHIKETEQLRQEIVSIL-A--P--L--KHNSRV-PEEYKALVLRAVQA-HEKRCQYVDWVTLSGWLLFTNNFA------YSPKDLAS-R--GLYAHPSRTALSDAGASERYLCGLEIVSVDYRELL----SAYKDASN-----TILILDPPYLSTECGGY--RGN----SWSCDDYLDLLTLM-P-TNN-YLYFSSTKFDFVPFL----RR-----TSAAW----G-----YQHPFIDAQVLYRSSPFGSGGANPEVLYYK--------------
      L13_RS00005_Neisseria_weaveri_738545947                                           HA-K-APLPFAGQKRNFIKHYLGVL-DK--I-P------GSGS-GWTIVDVFGGSGLLTHVAKRVKPDARVIYNDFDNYAARVKAIPDINRLRRLISGYL-A--G--Y--VKKQRI-PDDVKQVIIGEIER-FD---GYKCHVVLASWFLFSGRQA------ANLERFYR-S--EWYFNLPLSDYPVADD---YLDGLEITRQSYETLI----PQFSDDPQ-----ALLVLDPPYLSTTQAAY--AQD-G--RFGLVDYLKLVNLV-R-PP--YLFFSSTRSEFIDYIDAVVSMQ-LD-NWHVFD---H-----STRLTVQA------K-VSKYASYEDNLVYKL-------------
      NM70021_RS109520_Neisseria_meningitidis_488180979                                 HS-T-APLPF-------IKHFTKVL-SQ--I-P------ADGK-HWTIVDVFGGSGLLAHVAKRIKPQARVIYNDYDNYSDRLRHIPDYNRLREQIAQIV-G--G--I--PKGSRL-DPERTRSVQQTITN-FQ---GHIDVRVLSSWLLFSAKQA------NSLEQLLG-F--EFYNKVRQSPYSIAAD---YLDGLEITQQDYNLLM----AEHQHNPN-----TLLVLDPPYVSTAQGAY--AAD-K--YFNMVSFLRMIQYM-R-PP--FILFSSTRSEALDYFQFLQECE-PD-KYRRFS---G-----YNIVSLDA------K-MGKGIEYQDNMIYKI---D---------
      CUP_RS09075_Campylobacter_upsaliensis_490401642                                   YK-T-PPLAFNGNKKNMLKLYREAL-ED----MKCY--VNKDT---IFYDVFGGSGLLAHETKRQFQNNKVIWNDFDNFQKRLNMLDKTEALRLKIVNII-K--DKRF--QKEERI-KRIERQKIEKLLKE----E-GEFDYIQLSSWLRFGGSYAKDCEDFFRAKEFYN-K--IAYDKV----L-DKKD---YLKGVIRVQKDYKELL----KEAKERGN-----FFFILDPPYIQTDKAHY----E-G--FFGLCEFLELIISI-E-MP--FIFFSSAKSEILNFMDFCKEPK----NLQNLQ---NLQNLHFKRVNLNK--------CIKNKDNSDFIFYKQ----------R--
      HMPREF1052_RS06075_Pasteurella_bettyae_492141771                                  FK-Q-APIPFIGQKRMFLQHFERLL-ND-NI-P------NDGD-GWTIVDAFGGSGLLSHIAKRLKPKATVIYNDFDGYAERLQHIDDINRLRRQIAPLL-ALAE--Q--PKQKRL-SPELKAQIIDVIKA-FD---GYINVHVLCSWLLFSGQQV------KTLDELFT-Q--DFWHCLRQSDYPSADG---YLDGLTVVSESFHTLL----PKYQHDPK-----ALFVLDPPYLCTHQESY--GQQ-R--YFDLIDFLRLIHLT-R-PP--FVFFSSTKSEFVRFIDAMITDQ-WD-KWQSFA---N-----YERIAVKT------S-TSYSGKYEDNMVYKF-------------
      consensus/100%                                                                    .......hsF..........h...l......................hhD.FuG..hls...c.....s.hlhNs...a..bh..h...p.hb..h............................h...h...............l.s.h.a................h........a................Y..sh......h..h...................hhhhsssah.s....Y..........a.h...h..............hhFs..p.............................................................................
      consensus/95%                                                                     a..p.sPLPF.GpKp.hhp.h..hL..p...................hlD.FGGSGLLuc.sK...s.u.llaNDaDsa..Rl..l.p.N.lb..l...................h.....+..hb..l........sa.D...lss.lhFu.p.........ph..h.......ha..hp..sh..s.s...YLpsl.h.p.sap.lh.....pa..........shhllDPPYh.T...sY..........a.h.paLplh..........ahhFoSs+S.h..hh.hh..........p.h...............h...........s....hpD.hhh...............
      consensus/90%                                                                     a..p.APLPF.GQKR.Fhp.h.phL..p...........sp......hlDhFGGSGLLuH.sKp.bP.upVlYNDaDsY..Rl..I.p.N.Lb..l...............p.p.l.....+..hh..lp.......Ga.D..slss.lhFS.pb........sh.ph.......happlp.ssa..sps...YLpGlplsp.sap.lh.....pa.s........slhllDPPYhsTp..sY..........a.h.paLplh......s...alaFoSs+S.hhchhphh.p........p.a..........h.b..hps......p..s.ps.YpD.hlap..............
      consensus/85%                                                                     a..p.APLPF.GQKRbFhppa.plL.pp...........sssp....hlDhFGGSGLLuH.sKp.+PpupVlYNDaDsY.pRL.pIsp.N.Lb..l..hh..........s+pp.l.s...+..hhp.lp...p...Ga.Dh.slso.LLFS.pb........shpph.p.p..shappl+.ssY..sps...YL-Glplsp.sapplh.....pa.s........slhllDPPYlsTp..sY..p.......aph.-aLclhph..p.s...alaFoSs+Sphlchhphh.p........p.F..........h.b..hps......p.hs.ps.YpD.hlap..............
      consensus/80%                                                                     a..p.APLPF.GQKRbFhppabplL.pp...........sssp...shlDhFGGSGLLSH.sKp.+PpApVlYNDaDsY.pRL.pIsp.N.Lb.pl..hl..........P+pp.l.s...+..lhp.Ip...p...Ga.Dh.slso.LLFS.pbh.......shp-hbp.p..shappl+bsDY..sps...YL-GlplsppsacpLh.....pa.s........slhllDPPYlsTc..sY..pb.....bapL.DaLcllpl..p.s...alaFoSsKSphlchhcah.pp.......psF....s.....hpc..hps......p.hsbpspYpD.hlap..............
      consensus/75%                                                                     a..p.APLPFhGQKRbFhppabplL.pp...........s-sp...shlDhFGGSGLLSH.sKpb+PpApVIYNDaDsY.pRL.pIsp.NpLb.pl..hl..........P+pcbl.s.p.+.bllp.Ipp.bp...Ga.Dh.sluo.LLFS.pbh.......shp-lbp.p..shaspl+bsDY.pscs...YL-Glplsppsa+pLh....spa.s.sp.....slhllDPPYlsTc..oY..pb.....aacL.DaLcllpl..p.s...alaFoSsKSphlchhcah.pp.......psFp...s.....hp+..hps......p.hsapupYpD.hlY+..............
      consensus/70%                                                                     ap.p.APLPFhGQKRbFlpcFbplL.pp...........s-up...shlDlFGGSGLLSH.sKpb+PpApVIYNDFDsY.cRLppIsc.NpLb.plbpll....s..h..P+p+bl.s.pb+.bllp.Ipp.bc...GY.Dh.sLuS.LLFSupbh.......shc-Lbp.p..shaNslRboDYspscs...YLDGl-llppsa+pLh....scapspsp.....slFllDPPYLsTcp.oY..+b.....YacL.DaLcllpl..c.s...alaFoSsKSphlchhcah.cp.......psFp...s.....hp+h.lss......p.hsasupYpD.hlYK..............
      
      Back to Contents
    • General notes, phyletic distribution and domain architectures of the Group3/Trichomonas-like N6-MTases

      General notes:

      The Trichomonas N6-MTases are essentially identical and are fused to phage-tail fiber proteins. They appear to be part of a mobile system. Trichomonas possesses several paralogous N6A methylases that are often fused to a domain found in phage structural proteins (e.g., gi: 121901620, TVAG_056220). The Trichomonas methylase contains various conserved features such as a GxK motif in helix-1, an extra D after strand-2, an R in the helix after strand-2, D in the HTH-like insert of strand-2 Y- before strand 3, +S after strand-5 and YxD in strand-7. The GxK motif found at the N-terminus of the E coli/T4/DpnII(DpnB) DAM is a highly conserved feature of this group, along with a winged-HTH insert after strand-2. These are important for base flipping as seen in cocrystal structures. The eukaryotic version appears to be derived from phage DAMs that are associated with packaging and likely to be used in defense against host RE. The architectures of the trichomonas proteins with fusions ot the phage-tail fiber suggests that this system was taken entirely from the same phage packaging system. The neighborhoods also contain a MULE transposase and an A32-like packaging ATPase. These might also be just-so associations as the polintons constitute about 30% of the Trichomonas genome.
      GI           Gene neighborhood                                                                                           Architecture                Pfam architecture     Gene name             Len  Taxonomy                                       Species name                                                Genbank description
      # 1; Eukaryotic versions                                                                                                                                                                                                                                        
      123207322    <-N6-MTase*                                                                                                 N6-MTase                    MethyltransfD12       TVAG_557140           280  eukaryota>parabasalia                          Trichomonas vaginalis G3                                    hypothetical protein [Trichomonas vaginalis G3].                                                              <-123207322_N6-MTase*
      123481438    MULE-transposase->?->?->?-><-?||?-><-N6-MTase*<-?||?-><-?<-?<-?<-A32-like_ATPase                            N6-MTase                    MethyltransfD12       TVAG_344370           280  eukaryota>parabasalia                          Trichomonas vaginalis G3                                    hypothetical protein [Trichomonas vaginalis G3].                                                              <-123481412_?||123481416_MULE-transposase->123481420_?->123481423_?->123481427_?-><-123481431_?||123481435_?-><-123481438_N6-MTase*<-123481442_?||123481445_?-><-123481449_?<-123481453_?<-123481456_?<-123481460_A32-like_ATPase||123481463_?->
      123421258    <-N6-MTase*                                                                                                 N6-MTase                    MethyltransfD12       TVAG_007390           273  eukaryota>parabasalia                          Trichomonas vaginalis G3                                    hypothetical protein [Trichomonas vaginalis G3].                                                              <-123421244_?||123421247_?->123421249_?->123421251_?-><-123421254_?||123421256_?-><-123421258_N6-MTase*<-123421261_?||123421265_?->123421267_?-><-123421269_?<-123421272_?<-123421274_?||123421276_?->
      123479010    T4gp10-like-baseplate->?-><-?||?-><-?||?-><-N6-MTase*                                                       N6-MTase                    MethyltransfD12       TVAG_271330           273  eukaryota>parabasalia                          Trichomonas vaginalis G3                                    hypothetical protein [Trichomonas vaginalis G3].                                                              <-123478996_?||123478998_T4gp10-like-baseplate->123479000_?-><-123479002_?||123479004_?-><-123479006_?||123479008_?-><-123479010_N6-MTase*||123479012_?->123479014_?-><-123479016_?<-123479018_?||123479020_?->123479022_?->123479024_?->
      123471301    <-N6-MTase+Phage-tailfib<-?<-?||?->?-><-?||?->N6-MTase+Phage-tailfib*->?->DUF3839->                         N6-MTase+Phage-tailfib      PTR                   TVAG_056220           566  eukaryota>parabasalia                          Trichomonas vaginalis G3                                    hypothetical protein [Trichomonas vaginalis G3].                                                              <-123471287_N6-MTase+Phage-tailfib<-123471289_?<-123471291_?||123471293_?->123471295_?-><-123471297_?||123471299_?->123471301_N6-MTase+Phage-tailfib*->123471303_?->123471305_DUF3839->
      123976294    N6-MTase+Phage-tailfib*->DUF3839->?->?-><-A32-like_ATPase<-DUF4108                                          N6-MTase+Phage-tailfib      PTR                   TVAG_051460           527  eukaryota>parabasalia                          Trichomonas vaginalis G3                                    hypothetical protein [Trichomonas vaginalis G3].                                                              <-123976292_?||123976294_N6-MTase+Phage-tailfib*->123976296_DUF3839->123976298_?->123976300_?-><-123976302_A32-like_ATPase<-123976304_DUF4108<-123976306_?||123976308_?->
      123484516    <-MULE-transposase<-?<-?||?*->DUF3839-><-A32-like_ATPase||?-><-DUF4108                                      -                           PAT1                  TVAG_039120           451  eukaryota>parabasalia                          Trichomonas vaginalis G3                                    hypothetical protein [Trichomonas vaginalis G3].                                                              123484490_?-><-123484493_?||123484497_?-><-123484501_?<-123484505_MULE-transposase<-123484509_?<-123484512_?||123484516_?*->123484521_DUF3839-><-123484525_A32-like_ATPase||123484529_?-><-123484533_DUF4108||123484536_?-><-123484539_?<-123484543_?
      # 218; Prokaryotic homologs                                                                                                                                                                                                                                      
      472258915    <-N6-MTase*<-?<-?<-?<-?<-?<-?<-Tail_P2_I                                                                    N6-MTase                    SP                    D650_21760            323  bacteria>proteobacteria>gammaproteobacteria    Mannheimia haemolytica USDA-ARS-USMARC-183                  hypothetical protein D650_21760 [Mannheimia haemolytica USDA-ARS-USMARC-183].                                 <-472258908_?||472258909_?-><-472258910_?<-472258911_?||472258912_?->472258913_?->472258914_?-><-472258915_N6-MTase*<-472258916_?<-472258917_?<-472258918_?<-472258919_?<-472258920_?<-472258921_?<-472258922_Tail_P2_I
      469489659    GP46->Baseplate_J->DUF2313->?->DUF4376->?->N6-MTase*->                                                      N6-MTase                    SP                    WQG_17550             317  bacteria>proteobacteria>gammaproteobacteria    Bibersteinia trehalosi USDA-ARS-USMARC-192                  D12 class N6 adenine-specific DNA methyltransferase [Bibersteinia trehalosi USDA-ARS-USMARC-192].             469489652_?->469489653_GP46->469489654_Baseplate_J->469489655_DUF2313->469489656_?->469489657_DUF4376->469489658_?->469489659_N6-MTase*->469489660_?->469489661_?->469489662_?->469489663_?->469489664_?->469489665_?-><-469489666_?
      345456400    <-VirD4-FtsK<-?<-LPD3+N4ART+N4ART+MPTase+MPTase<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC->                      N6-MTase                    MethyltransfD12       BSEG_01570            316  bacteria>bacteroidetes                         Bacteroides dorei 5_1_36/D4                                 D12 class N6 adenine-specific DNA methyltransferase [Bacteroides dorei 5_1_36/D4].                            <-345456398_?||345456399_?-><-229435357_VirD4-FtsK<-229435356_?<-229435355_LPD3+N4ART+N4ART+MPTase+MPTase<-229435354_?<-229435353_?<-345456400_N6-MTase*<-229435351_?||229435350_ParA->229435349_?->229435348_?->229435347_DOC-><-229435346_?<-345456401_?
      598907105    DUF4376->?->N6-MTase*->                                                                                     N6-MTase                    Methyltransf_26       HPNK_00382            315  bacteria>proteobacteria>gammaproteobacteria    Haemophilus parasuis str. Nagasaki                          hypothetical protein HPNK_00382 [Haemophilus parasuis str. Nagasaki].                                         598907103_DUF4376->598907104_?->598907105_N6-MTase*-><-598907106_?<-598907107_?||598907108_?->598907109_?->598907110_?->598907111_?->598907112_?->
      507741308    <-VirD4-FtsK<-?<-?<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC->                                                   N6-MTase                    SP+MethyltransfD12    C799_02456            314  bacteria>bacteroidetes                         Bacteroides thetaiotaomicron dnLKV9                         hypothetical protein C799_02456 [Bacteroides thetaiotaomicron dnLKV9].                                        507741301_?->507741302_?-><-507741303_VirD4-FtsK<-507741304_?<-507741305_?<-507741306_?<-507741307_?<-507741308_N6-MTase*<-507741309_?||507741310_ParA->507741311_?->507741312_?->507741313_DOC->507741314_?-><-507741315_?
      335946057    STN+Cna_B_2+Plug->?->?->?-><-?<-N6-MTase*<-?||ParA->                                                        N6-MTase                    MethyltransfD12       HMPREF1018_02174      313  bacteria>bacteroidetes                         Bacteroides sp. 2_1_56FAA                                   hypothetical protein HMPREF1018_02174 [Bacteroides sp. 2_1_56FAA].                                            335946050_?->335946051_?->335946052_STN+Cna_B_2+Plug->335946053_?->335946054_?->335946055_?-><-335946056_?<-335946057_N6-MTase*<-335946058_?||335946059_ParA->335946060_?->335946061_?->335946062_?-><-335946063_?<-335946064_?
      387775820    STN+Cna_B_2+Plug->?->?->?-><-?<-N6-MTase*<-?||?->ParA->                                                     N6-MTase                    MethyltransfD12       HMPREF1055_02982      313  bacteria>bacteroidetes                         Bacteroides fragilis CL07T00C01                             hypothetical protein HMPREF1055_02982 [Bacteroides fragilis CL07T00C01].                                      387775813_?->387775814_?->387775815_STN+Cna_B_2+Plug->387775816_?->387775817_?->387775818_?-><-387775819_?<-387775820_N6-MTase*<-387775821_?||387775822_?->387775823_ParA->387775824_?->387775825_?-><-387775826_?<-387775827_?
      392705106    STN+Cna_B_2+Plug->?->?->?-><-?<-N6-MTase*<-?||?->ParA->                                                     N6-MTase                    MethyltransfD12       HMPREF1079_00192      313  bacteria>bacteroidetes                         Bacteroides fragilis CL05T00C42                             hypothetical protein HMPREF1079_00192 [Bacteroides fragilis CL05T00C42].                                      392705099_?->392705100_?->392705101_STN+Cna_B_2+Plug->392705102_?->392705103_?->392705104_?-><-392705105_?<-392705106_N6-MTase*<-392705107_?||392705108_?->392705109_ParA->392705110_?->392705111_?->392705112_?-><-392705113_?
      575451422    <-N6-MTase*<-?<-DUF4376<-?<-DUF2313<-Baseplate_J<-GP46                                                      N6-MTase                    SP                    F543_5700             313  bacteria>proteobacteria>gammaproteobacteria    Bibersteinia trehalosi USDA-ARS-USMARC-189                  D12 class N6 adenine-specific DNA methyltransferase [Bibersteinia trehalosi USDA-ARS-USMARC-189].             575451415_?-><-575451416_?<-575451417_?<-575451418_?<-575451419_?<-575451420_?<-575451421_?<-575451422_N6-MTase*<-575451423_?<-575451424_DUF4376<-575451425_?<-575451426_DUF2313<-575451427_Baseplate_J<-575451428_GP46<-575451429_?
      596213380    <-ParA||?->N6-MTase*-><-?||?->?-><-?<-?<-?<-STN+Cna_B_2+Plug                                                N6-MTase                    MethyltransfD12       M070_4300             313  bacteria>bacteroidetes                         Bacteroides fragilis str. A7 (UDC12-2)                      D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides fragilis str. A7 (UDC12-2)].  596213373_?->596213374_?-><-596213375_?<-596213376_?<-596213377_?<-596213378_ParA||596213379_?->596213380_N6-MTase*-><-596213381_?||596213382_?->596213383_?-><-596213384_?<-596213385_?<-596213386_?<-596213387_STN+Cna_B_2+Plug
      695509259    <-ParA||?->N6-MTase*->?-><-METHYLASE                                                                        N6-MTase                    MethyltransfD12       M117_RS13145          313  bacteria>bacteroidetes                         Bacteroides fragilis                                        DNA methyltransferase [Bacteroides fragilis].                                                                 <-695509272_ParA||695509275_?->695509259_N6-MTase*->695509264_?-><-695509267_METHYLASE
      33148844     tail_3->N6-MTase*->                                                                                         N6-MTase                    -                     HD_1581               308  bacteria>proteobacteria>gammaproteobacteria    Haemophilus ducreyi 35000HP                                 conserved hypothetical protein [Haemophilus ducreyi 35000HP].                                                 33148837_?->33148838_?->33148839_?->33148840_?->33148841_?->33148842_?->33148843_tail_3->33148844_N6-MTase*-><-33148845_?<-33148846_?<-33148847_?<-33148848_?<-33148849_?||33148850_?->33148851_?->
      595910038    <-ParA<-?||?->N6-MTase*-><-?||?-><-?<-?<-?<-STN+Cna_B_2+Plug                                                N6-MTase                    MethyltransfD12       M080_1486             302  bacteria>bacteroidetes                         Bacteroides fragilis str. 3397 T10                          D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides fragilis str. 3397 T10].      595910031_?->595910032_?-><-595910033_?<-595910034_?<-595910035_ParA<-595910036_?||595910037_?->595910038_N6-MTase*-><-595910039_?||595910040_?-><-595910041_?<-595910042_?<-595910043_?<-595910044_STN+Cna_B_2+Plug<-595910045_?
      261312126    N6-MTase*->                                                                                                 N6-MTase                    SP                    COK_0640              301  bacteria>proteobacteria>gammaproteobacteria    Mannheimia haemolytica serotype A2 str. BOVINE              hypothetical protein COK_0640 [Mannheimia haemolytica serotype A2 str. BOVINE].                               261312126_N6-MTase*-><-261312127_?||261312128_?->261312129_?->261312130_?->261312131_?->261312132_?->261312133_?->
      588488100    Collar->?->?->?->?->N6-MTase*->                                                                             N6-MTase                    -                     JCM21142_114604       298  bacteria>bacteroidetes                         Saccharicrinis fermentans DSM 9555 = JCM 21142              site-specific DNA methylase [Saccharicrinis fermentans DSM 9555 = JCM 21142].                                 588488094_?->588488095_Collar->588488096_?->588488097_?->588488098_?->588488099_?->588488100_N6-MTase*->
      491961424    GP46->Baseplate_J->DUF2313->?->?->N6-MTase*->                                                               N6-MTase                    -                     HI1523                296  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                16273417_?->16273418_?->16273419_GP46->16273420_Baseplate_J->30995455_DUF2313->16273422_?->16273423_?->491961424_N6-MTase*-><-16273425_?<-16273426_?||30995456_?->16273428_?->16273429_?->16273430_?-><-16273431_?
      633956025    <-N6-MTase*                                                                                                 N6-MTase                    -                     HPS42_05865           292  bacteria>proteobacteria>gammaproteobacteria    Haemophilus parasuis ST4-2                                  hypothetical protein HPS42_05865, partial [Haemophilus parasuis ST4-2].                                       633956024_?-><-633956025_N6-MTase*
      656071953    <-N6-MTase*                                                                                                 N6-MTase                    -                     K941_RS0107980        292  bacteria>proteobacteria>gammaproteobacteria    Moraxella caprae                                            hypothetical protein [Moraxella caprae].                                                                      <-738436646_?<-656071949_?<-656071950_?<-656071951_?<-738436648_?<-738436650_?<-656071952_?<-656071953_N6-MTase*<-656071954_?<-656071955_?<-656071956_?<-656071957_?<-738436652_?||656071958_?->656071959_?->
      736161659    N6-MTase*->                                                                                                 N6-MTase                    -                     LS70_RS01430          291  bacteria>proteobacteria>epsilonproteobacteria  Helicobacter sp. MIT 11-5569                                hypothetical protein [Helicobacter sp. MIT 11-5569].                                                          736161636_?->736161640_?->736161643_?->736161646_?->736161652_?->736164915_?->736161656_?->736161659_N6-MTase*->736161663_?->736161667_?->736161671_?->736161675_?->736161679_?->736161683_?->736161687_?->
      740907932    <-N6-MTase*                                                                                                 N6-MTase                    -                     M949_RS00775          290  bacteria>bacteroidetes                         Riemerella anatipestifer                                    DNA methyltransferase [Riemerella anatipestifer].                                                             504751695_?->491057476_?->504751696_?->504751697_?->504751698_?->504751699_?->504751700_?-><-740907932_N6-MTase*<-504751703_?<-740908888_?<-740907934_?<-504751706_?<-504751707_?<-504751709_?<-504751710_?
      493294268    Phage_Mu_Gp45->GP46->Baseplate_J->DUF2313->?->DUF4376->?->N6-MTase*->                                       N6-MTase                    -                     J450_RS10910          289  bacteria>proteobacteria>gammaproteobacteria    Mannheimia haemolytica                                      hypothetical protein [Mannheimia haemolytica].                                                                493293624_Phage_Mu_Gp45->525964514_GP46->525964517_Baseplate_J->493293627_DUF2313->525964520_?->525759286_DUF4376->493294352_?->493294268_N6-MTase*->665824783_?->493291229_?-><-493292924_?<-525964527_?<-493292928_?<-493292929_?<-525964533_?
      237868315    <-Phage_sheath_1<-?<-?<-?<-?<-?<-N6-MTase*<-?<-?<-Collar<-Tail_P2_I<-Baseplate_J<-?<-Phage_base_V           N6-MTase                    MethyltransfD12       GCWU000324_01234      288  bacteria>proteobacteria>betaproteobacteria     Kingella oralis ATCC 51147                                  D12 class N6 adenine-specific DNA methyltransferase [Kingella oralis ATCC 51147].                             <-237868308_?<-237868309_Phage_sheath_1<-237868310_?<-237868311_?<-237868312_?<-237868313_?<-237868314_?<-237868315_N6-MTase*<-237868316_?<-237868317_?<-237868318_Collar<-237868319_Tail_P2_I<-237868320_Baseplate_J<-237868321_?<-237868322_Phage_base_V
      323094424    N6-MTase*->                                                                                                 N6-MTase                    -                     HMPREF0663_11914      288  bacteria>bacteroidetes                         Prevotella oralis ATCC 33269                                D12 class N6 adenine-specific DNA methyltransferase [Prevotella oralis ATCC 33269].                           323094417_?->323094418_?->323094419_?->323094420_?->323094421_?-><-323094422_?||323094423_?->323094424_N6-MTase*-><-323094425_?||323094426_?->323094427_?-><-323094428_?<-323094429_?<-323094430_?<-323094431_?
      491876509    N6-MTase*->                                                                                                 N6-MTase                    -                     HMPREF1053_RS00285    288  bacteria>proteobacteria>gammaproteobacteria    Haemophilus haemolyticus                                    hypothetical protein [Haemophilus haemolyticus].                                                              696248627_?->696248628_?->491876467_?->491876208_?->491876315_?->491876522_?->491876174_?->491876509_N6-MTase*-><-491876092_?<-491876286_?<-491876146_?<-491876339_?||491876477_?-><-491876085_?<-491849867_?
      656071893    GPW_gp25->Baseplate_J->Tail_P2_I->?->DUF4376->?->?->N6-MTase*->                                             N6-MTase                    MethyltransfD12       K941_RS0107590        288  bacteria>proteobacteria>gammaproteobacteria    Moraxella caprae                                            hypothetical protein [Moraxella caprae].                                                                      656071886_GPW_gp25->656071887_Baseplate_J->656071888_Tail_P2_I->738436601_?->656071890_DUF4376->656071891_?->656071892_?->656071893_N6-MTase*->656071894_?->656071895_?->738436603_?->656071897_?->656071898_?-><-656071899_?<-656071900_?
      738435815    <-N6-MTase*<-?<-tail_3                                                                                      N6-MTase                    -                     K941_RS0100010        288  bacteria>proteobacteria>gammaproteobacteria    Moraxella caprae                                            hypothetical protein [Moraxella caprae].                                                                      <-738435815_N6-MTase*<-738435817_?<-738435838_tail_3<-738435841_?<-656070712_?<-656070713_?<-656070714_?<-738435843_?
      762905187    <-Phage_integrase||?->?->N6-MTase*->                                                                        N6-MTase                    -                     UMN179_RS08310        288  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 762905530_?->503511886_?-><-503511887_?<-762905184_?<-503511888_Phage_integrase||503511889_?->503511890_?->762905187_N6-MTase*-><-503511892_?<-503511893_?<-503511894_?<-503511895_?<-503511896_?<-503511897_?<-503511898_?
      494836361    Thymidylate_synthase->N6-MTase*->                                                                           N6-MTase                    -                     M137_RS11600          286  bacteria>bacteroidetes                         Bacteroides                                                 MULTISPECIES: DNA methyltransferase [Bacteroides].                                                            757767589_?->495949220_?->494836371_?->494836369_?->494836367_?->494836365_Thymidylate_synthase->494836361_N6-MTase*->695408851_?->695399977_?->499301954_?->492242375_?->695399987_?-><-695399991_?||492348593_?->
      496521463    STN+Cna_B_2+Plug->?-><-Thymidylate_synthase<-N6-MTase*                                                      N6-MTase                    Methyltransf_26       HMPREF0670_RS03300    286  bacteria>bacteroidetes                         Prevotella sp. oral taxon 317                               DNA methyltransferase [Prevotella sp. oral taxon 317].                                                        496521457_?-><-763204112_?<-763204113_?||496521459_?->496521460_STN+Cna_B_2+Plug->496521461_?-><-763204334_Thymidylate_synthase<-496521463_N6-MTase*<-496521464_?<-496521465_?<-496521466_?<-496521467_?<-496521468_?<-496521469_?<-763204114_?
      521257429    N6-MTase*->Phage_base_V->?->Baseplate_J->?->?->?->Phage_sheath_1->                                          N6-MTase                    -                     PSYCG_RS09460         286  bacteria>proteobacteria>gammaproteobacteria    Psychrobacter sp. G                                         hypothetical protein [Psychrobacter sp. G].                                                                   521257422_?->521257423_?->521257424_?->521257425_?->521257426_?->521257427_?->521257428_?->521257429_N6-MTase*->754143218_Phage_base_V->521257431_?->521257432_Baseplate_J->754143219_?->521257434_?->521257435_?->521257436_Phage_sheath_1->
      647521559    N6-MTase*->Thymidylate_synthase-><-?<-ABC-ATPase                                                            N6-MTase                    -                     JCM12083_RS12170      286  bacteria>bacteroidetes                         Prevotella shahii                                           DNA methyltransferase [Prevotella shahii].                                                                    647521551_?->647521553_?->647521557_?->647521559_N6-MTase*->647521563_Thymidylate_synthase-><-763202136_?<-647521566_ABC-ATPase<-647521567_?<-647521568_?<-647521569_?||647521571_?->
      649521449    <-STN+Cna_B_2+Plug<-VirD4-FtsK<-?<-LPD3+N4ART+N4ART+MPTase+MPTase<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC->    N6-MTase                    MethyltransfD12       M098_0958             286  bacteria>bacteroidetes                         Bacteroides vulgatus str. 3775 SR(B) 19                     D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides vulgatus str. 3775 SR(B) 19]. <-649521442_?<-649521443_STN+Cna_B_2+Plug<-649521444_VirD4-FtsK<-649521445_?<-649521446_LPD3+N4ART+N4ART+MPTase+MPTase<-649521447_?<-649521448_?<-649521449_N6-MTase*<-649521450_?||649521451_ParA->649521452_?->649521453_?->649521454_DOC-><-649521455_?||649521456_?->
      736171011    Phage_base_V->GPW_gp25->Baseplate_J->Tail_P2_I->Collar->Caudo_TAP->?->N6-MTase*->                           N6-MTase                    Methyltransf_26       Q338_RS03200          286  bacteria>proteobacteria>betaproteobacteria     Alysiella crassa                                            hypothetical protein [Alysiella crassa].                                                                      736170990_Phage_base_V->736170994_GPW_gp25->736170998_Baseplate_J->736171002_Tail_P2_I->736171072_Collar->736171076_Caudo_TAP->736171005_?->736171011_N6-MTase*->736171080_?->736171084_?-><-736171015_?<-736171088_?<-736171019_?<-736171022_?<-736171026_?
      495898225    <-N6-MTase*<-Thymidylate_synthase                                                                           N6-MTase                    -                     HMPREF9441_RS15520    285  bacteria>bacteroidetes                         Paraprevotella clara                                        DNA methyltransferase [Paraprevotella clara].                                                                 <-495898213_?||495898214_?-><-748607312_?||495898218_?->495898219_?->495898220_?->495898221_?-><-495898225_N6-MTase*<-495898226_Thymidylate_synthase<-495898227_?<-495898228_?<-748607316_?<-495898231_?<-748607319_?<-495898233_?
      495946269    <-N6-MTase*<-Thymidylate_synthase                                                                           N6-MTase                    -                     BSFG_RS03650          285  bacteria>bacteroidetes                         Bacteroides sp. 4_3_47FAA                                   DNA methyltransferase [Bacteroides sp. 4_3_47FAA].                                                            <-495946260_?<-495946261_?<-495123637_?<-495123633_?<-495946264_?<-696364638_?<-696364632_?<-495946269_N6-MTase*<-495946270_Thymidylate_synthase<-495946271_?<-495946272_?<-495946273_?<-495123608_?<-495946274_?<-495946275_?
      315663782    N6-MTase*->Thymidylate_synthase->                                                                           N6-MTase                    -                     HMPREF9420_2325       284  bacteria>bacteroidetes                         Prevotella salivae DSM 15606                                hypothetical protein HMPREF9420_2325 [Prevotella salivae DSM 15606].                                          315663841_?-><-315663842_?<-315663843_?||315663778_?->315663779_?->315663780_?->315663781_?->315663782_N6-MTase*->315663783_Thymidylate_synthase->315663784_?->315663785_?->315663786_?-><-315663787_?||315663788_?->315663789_?->
      443550816    tail_3->N6-MTase+Phage-tailfib*->                                                                           N6-MTase+Phage-tailfib      SP                    A160_0967             284  bacteria>proteobacteria>gammaproteobacteria    Aggregatibacter actinomycetemcomitans serotype a str. A160  hypothetical protein A160_0967 [Aggregatibacter actinomycetemcomitans serotype a str. A160].                  443550809_?->443550810_?->443550811_?->443550812_?->443550813_?->443550814_?->443550815_tail_3->443550816_N6-MTase+Phage-tailfib*-><-443550817_?<-443550818_?<-443550819_?<-443550820_?<-443550821_?<-443550822_?<-443550823_?
      490455210    Thymidylate_synthase->N6-MTase*->                                                                           N6-MTase                    MethyltransfD12       BA92_RS10770          284  bacteria>bacteroidetes                         Bacteroides                                                 MULTISPECIES: DNA methyltransferase [Bacteroides].                                                            492713924_?->753198973_?->490455216_?->490455215_?->490455214_?->490455213_?->494419100_Thymidylate_synthase->490455210_N6-MTase*-><-753198974_?||753196135_?->753196136_?->753196137_?->753196138_?->753198975_?->753199045_?->
      490456001    Thymidylate_synthase->N6-MTase*->                                                                           N6-MTase                    -                     HMPREF1070_RS05245    284  bacteria>bacteroidetes                         Bacteroides ovatus                                          DNA methyltransferase [Bacteroides ovatus].                                                                   490455991_?->490455993_?->490455995_?->696272652_?->696272653_?->490455999_?->696272654_Thymidylate_synthase->490456001_N6-MTase*->696272656_?->490456003_?->490456004_?->490425270_?->490425271_?-><-490425272_?<-490425273_?
      492444862    <-STN+Cna_B_2+Plug||?-><-?<-?<-?||?-><-N6-MTase*<-Thymidylate_synthase                                      N6-MTase                    MethyltransfD12       BN535_00547           284  bacteria>bacteroidetes                         Bacteroides                                                 MULTISPECIES: DNA methyltransferase [Bacteroides].                                                            <-491935049_?<-547309720_STN+Cna_B_2+Plug||547309721_?-><-547309722_?<-491933064_?<-547309723_?||547309724_?-><-492444862_N6-MTase*<-492444861_Thymidylate_synthase<-492444860_?<-492444859_?<-492444858_?||547309725_?->547309726_?->
      492741740    STN+Cna_B_2+Plug-><-N6-MTase*<-Thymidylate_synthase                                                         N6-MTase                    MethyltransfD12       M125_RS18320          284  bacteria>bacteroidetes                         Bacteroides                                                 MULTISPECIES: DNA methyltransferase [Bacteroides].                                                            496045361_?->492280413_?->695547684_STN+Cna_B_2+Plug-><-492741740_N6-MTase*<-695547708_Thymidylate_synthase<-695547689_?<-492741743_?<-695547696_?<-757749882_?<-492741756_?<-757749883_?
      495041624    <-N6-MTase*<-Thymidylate_synthase                                                                           N6-MTase                    MethyltransfD12       HMPREF1057_RS0113675  284  bacteria>bacteroidetes                         Bacteroides finegoldii                                      DNA methyltransferase [Bacteroides finegoldii].                                                               <-495041624_N6-MTase*<-696271763_Thymidylate_synthase<-495041629_?<-696271765_?||495033235_?->495034690_?-><-490456521_?<-490456520_?
      496037689    Thymidylate_synthase->N6-MTase*->                                                                           N6-MTase                    -                     HMPREF9007_RS09530    284  bacteria>bacteroidetes                         Bacteroides sp. 1_1_14                                      DNA methyltransferase [Bacteroides sp. 1_1_14].                                                               496037681_?->496037682_?->696274161_?->496037684_?->496037687_?->490455213_?->696274162_Thymidylate_synthase->496037689_N6-MTase*->496037690_?->496037691_?->496037692_?->496037693_?->496037694_?->496037695_?->496037696_?->
      512184299    MuF->?->?->?->?->Thymidylate_synthase->N6-MTase*-><-?<-?<-Phage_tail_S                                      N6-MTase                    MethyltransfD12       BN456_01886           284  bacteria>bacteroidetes                         Prevotella sp. CAG:1031                                     hypothetical protein [Prevotella sp. CAG:1031].                                                               512184292_?->512184293_MuF->512184294_?->512184295_?->512184296_?->512184297_?->512184298_Thymidylate_synthase->512184299_N6-MTase*-><-512184300_?<-512184301_?<-512184302_Phage_tail_S<-512184303_?<-512184328_?<-512184329_?<-512184330_?
      545407693    <-Thymidylate_synthase                                                                                      -                           SP                    HMPREF1981_RS13185    284  bacteria>bacteroidetes                         Bacteroides pyogenes                                        D12 class N6 adenine-specific DNA methyltransferase [Bacteroides pyogenes].                                   <-545407686_?<-545407687_?<-545407688_?<-748714848_?<-748714859_?<-545407690_?<-545407691_?<-545407693_?*<-545407694_Thymidylate_synthase<-545407695_?<-545407696_?<-545407697_?<-545407698_?<-545407699_?<-545407700_?
      545595274    <-N6-MTase*<-?<-DUF4376                                                                                     N6-MTase                    -                     AJF4211_RS12815       284  bacteria>proteobacteria>gammaproteobacteria    Avibacterium paragallinarum                                 Putative uncharacterized protein [Avibacterium paragallinarum].                                               516417631_?->516417630_?->516417629_?->737727144_?-><-737727161_?||545595273_?->545595597_?-><-545595274_N6-MTase*<-737726748_?<-737726759_DUF4376<-545596245_?
      545595679    Baseplate_J->Tail_P2_I->Collar->?->DUF4376->?->?->N6-MTase*->                                               N6-MTase                    -                     AJF4211_RS06820       284  bacteria>proteobacteria>gammaproteobacteria    Avibacterium paragallinarum                                 Putative uncharacterized protein [Avibacterium paragallinarum].                                               737726877_Baseplate_J->737726879_Tail_P2_I->737691486_Collar->648446893_?->737726882_DUF4376->737726885_?->545595678_?->545595679_N6-MTase*-><-545595597_?<-545595273_?||545595680_?->
      545595880    N6-MTase*->                                                                                                 N6-MTase                    -                     AJF4211_RS08790       284  bacteria>proteobacteria>gammaproteobacteria    Avibacterium paragallinarum                                 Putative uncharacterized protein [Avibacterium paragallinarum].                                               545595880_N6-MTase*-><-545595597_?<-545595273_?||545595680_?->737691335_?->516418003_?->516418004_?->545595883_?->
      696270804    Thymidylate_synthase->N6-MTase*-><-?||?->?-><-?<-?<-?<-STN+Cna_B_2+Plug                                     N6-MTase                    MethyltransfD12       M082_RS01650          284  bacteria>bacteroidetes                         Bacteroides                                                 MULTISPECIES: DNA methyltransferase [Bacteroides].                                                            757774896_?->496037684_?->757774897_?->696270757_?->696270759_?->490455213_?->696270802_Thymidylate_synthase->696270804_N6-MTase*-><-696270761_?||696270763_?->696270765_?-><-696270768_?<-495295222_?<-696270770_?<-696270806_STN+Cna_B_2+Plug
      229455867    <-STN+Cna_B_2+Plug<-?||?-><-N6-MTase*<-Thymidylate_synthase                                                 N6-MTase                    -                     BSBG_02560            283  bacteria>bacteroidetes                         Bacteroides sp. 9_1_42FAA                                   D12 class N6 adenine-specific DNA methyltransferase [Bacteroides sp. 9_1_42FAA].                              229455860_?-><-229455861_?<-229455862_?<-229455863_?<-229455864_STN+Cna_B_2+Plug<-229455865_?||229455866_?-><-229455867_N6-MTase*<-229455868_Thymidylate_synthase<-229455869_?<-229455870_?<-229455871_?||229455872_?-><-229455873_?<-229455874_?
      490512514    N6-MTase*->Thymidylate_synthase->?-><-?<-?<-STN+Cna_B_2+Plug                                                N6-MTase                    -                     HMPREF0665_RS09490    283  bacteria>bacteroidetes                         Prevotella oris                                             DNA methyltransferase [Prevotella oris].                                                                      490512507_?->490512508_?->490512509_?->490512510_?->490512511_?->490512512_?->490512513_?->490512514_N6-MTase*->490512515_Thymidylate_synthase->739008727_?-><-490512516_?<-490512517_?<-748616038_STN+Cna_B_2+Plug||490512519_?->490512520_?->
      648594256    N6-MTase*->Thymidylate_synthase->?->STN+Cna_B_2+Plug->                                                      N6-MTase                    -                     D468_RS0112575        283  bacteria>bacteroidetes                         Prevotella oris                                             DNA methyltransferase [Prevotella oris].                                                                      517750944_?->517750945_?->490512510_?->490512511_?->517750946_?->648594255_?->648594256_N6-MTase*->490512515_Thymidylate_synthase->739008727_?->517750949_STN+Cna_B_2+Plug->647603487_?->647603486_?->647603485_?->647603484_?->
      489886467    <-N6-MTase*<-?<-DUF4376<-?<-DUF2313<-Baseplate_J<-GP46<-Phage_Mu_Gp45                                       N6-MTase                    -                     KKB_RS07455           282  bacteria>proteobacteria>betaproteobacteria     Kingella kingae                                             hypothetical protein [Kingella kingae].                                                                       <-489886453_?<-489886455_?<-489886457_?||489886460_?-><-489886461_?<-489886463_?||489886465_?-><-489886467_N6-MTase*<-696250480_?<-696250481_DUF4376<-696250482_?<-489886475_DUF2313<-489886478_Baseplate_J<-489886480_GP46<-489886482_Phage_Mu_Gp45
      491717013    N6-MTase+Phage-tailfib*->                                                                                   N6-MTase+Phage-tailfib      SP                    SCC393_RS02190        282  bacteria>proteobacteria>gammaproteobacteria    Aggregatibacter actinomycetemcomitans                       hypothetical protein [Aggregatibacter actinomycetemcomitans].                                                 491716979_?-><-491716981_?||491716992_?->491717001_?->491717003_?->757650929_?->757650930_?->491717013_N6-MTase+Phage-tailfib*-><-757650923_?<-491717022_?<-491700764_?<-491717025_?<-491717030_?<-491700760_?<-757650924_?
      491784781    tail_3->N6-MTase*-><-?||?-><-?<-?<-?<-ABC-ATPase                                                            N6-MTase                    -                     APPSER11_RS04990      282  bacteria>proteobacteria>gammaproteobacteria    Actinobacillus pleuropneumoniae                             hypothetical protein [Actinobacillus pleuropneumoniae].                                                       491784768_?->491784770_?->491784772_?->491806154_?->491784776_?->491806156_?->491806158_tail_3->491784781_N6-MTase*-><-763113371_?||491784785_?-><-491784787_?<-491784789_?<-491784791_?<-491784793_ABC-ATPase<-491784795_?
      499301742    <-ParA||?->N6-MTase*-><-?||?-><-?<-?<-?<-STN+Cna_B_2+Plug                                                   N6-MTase                    MethyltransfD12       M080_RS26780          282  bacteria>bacteroidetes                         Bacteroides fragilis                                        DNA methyltransferase [Bacteroides fragilis].                                                                 492279352_?->492279349_?->492279345_?-><-492279342_?<-492279339_?<-492279337_ParA||492279331_?->499301742_N6-MTase*-><-657213291_?||492279325_?-><-492279322_?<-492279320_?<-492291773_?<-492279316_STN+Cna_B_2+Plug<-492291771_?
      503933737    tail_3->N6-MTase+Phage-tailfib*->                                                                           N6-MTase+Phage-tailfib      SP                    ANH9381_RS06760       282  bacteria>proteobacteria>gammaproteobacteria    Aggregatibacter actinomycetemcomitans                       hypothetical protein [Aggregatibacter actinomycetemcomitans].                                                 <-491690510_?<-491690512_?<-491762168_?||491690521_?->696438157_?->491736689_?->491736696_tail_3->503933737_N6-MTase+Phage-tailfib*-><-491755859_?||754504819_?-><-491731099_?<-491684677_?<-491731104_?<-491684680_?||491731108_?->
      504751701    <-N6-MTase*                                                                                                 N6-MTase                    -                     B739_RS09680          282  bacteria>bacteroidetes                         Riemerella anatipestifer                                    DNA methyltransferase [Riemerella anatipestifer].                                                             504751695_?->491057476_?->504751696_?->504751697_?->504751698_?->504751699_?->504751700_?-><-504751701_N6-MTase*<-504751703_?<-740908888_?<-740907934_?<-504751706_?<-504751707_?<-504751709_?<-504751710_?
      517482436    <-N6-MTase*                                                                                                 N6-MTase                    -                     C228_RS0112985        282  bacteria>proteobacteria>gammaproteobacteria    Actinobacillus capsulatus                                   hypothetical protein [Actinobacillus capsulatus].                                                             <-517482436_N6-MTase*<-748200589_?
      596095999    <-ParA||?->?->N6-MTase*->?-><-?||?-><-?||STN+Cna_B_2+Plug->                                                 N6-MTase                    MethyltransfD12       M116_4685             282  bacteria>bacteroidetes                         Bacteroides fragilis str. 3719 A10                          D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides fragilis str. 3719 A10].      <-596095996_ParA||596095997_?->596095998_?->596095999_N6-MTase*->596096000_?-><-596096001_?||596096002_?-><-596096003_?||596096004_STN+Cna_B_2+Plug->596096005_?->596096006_?->
      640568267    <-N6-MTase*                                                                                                 N6-MTase                    -                     JCM15124_RS09550      282  bacteria>bacteroidetes                         Prevotella falsenii                                         DNA methyltransferase [Prevotella falsenii].                                                                  <-640568267_N6-MTase*<-640568268_?<-640568269_?<-640568270_?<-640568271_?<-640568272_?<-739003492_?<-640568274_?
      696373063    <-VirD4-FtsK<-?<-LPD3+N4ART+N4ART+MPTase+MPTase<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC->                      N6-MTase                    MethyltransfD12       BSEG_RS20295          282  bacteria>bacteroidetes                         Bacteroides dorei                                           DNA methyltransferase [Bacteroides dorei].                                                                    <-696372985_?||495114778_?-><-495114781_VirD4-FtsK<-495114782_?<-495114784_LPD3+N4ART+N4ART+MPTase+MPTase<-495114785_?<-495114786_?<-696373063_N6-MTase*<-495114788_?||495114789_ParA->495114791_?->495114793_?->696372991_DOC-><-696372992_?<-495114800_?
      696374681    <-STN+Cna_B_2+Plug<-VirD4-FtsK<-?<-LPD3+N4ART+N4ART+MPTase+MPTase<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC->    N6-MTase                    MethyltransfD12       M098_RS09095          282  bacteria>bacteroidetes                         Bacteroides vulgatus                                        DNA methyltransferase [Bacteroides vulgatus].                                                                 <-696374562_?<-696374565_STN+Cna_B_2+Plug<-696374568_VirD4-FtsK<-495946037_?<-696374569_LPD3+N4ART+N4ART+MPTase+MPTase<-696374677_?<-696374574_?<-696374681_N6-MTase*<-495946968_?||696374577_ParA->696374581_?->696374583_?->696374585_DOC-><-696374588_?||696374592_?->
      737547081    N6-MTase*->                                                                                                 N6-MTase                    -                     HPS41_RS06910         282  bacteria>proteobacteria>gammaproteobacteria    Haemophilus parasuis                                        hypothetical protein [Haemophilus parasuis].                                                                  737547081_N6-MTase*-><-737547084_?||737547091_?->737547093_?->737547086_?->491993831_?->737547088_?->491993827_?->
      738993231    <-N6-MTase*                                                                                                 N6-MTase                    -                     HMPREF1475_RS02705    282  bacteria>bacteroidetes                         Prevotella oralis                                           DNA methyltransferase [Prevotella oralis].                                                                    <-490504035_?<-490504034_?||490504033_?->490504032_?->490504030_?-><-490504029_?<-490504028_?<-738993231_N6-MTase*<-514976994_?||490504024_?-><-490504023_?<-490503678_?<-490504022_?<-490504021_?<-490504020_?
      746108169    <-N6-MTase*<-?<-?||Phage_integrase->                                                                        N6-MTase                    -                     JP36_RS09335          282  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium genomosp. 1                                  hypothetical protein [Gallibacterium genomosp. 1].                                                            746108158_?->746108161_?->746108163_?->746068593_?->746108165_?->746108167_?->746108192_?-><-746108169_N6-MTase*<-746108171_?<-746108173_?||746108176_Phage_integrase->746108178_?->
      753848096    tail_3->N6-MTase*->                                                                                         N6-MTase                    -                     HD_RS06430            282  bacteria>proteobacteria>gammaproteobacteria    Haemophilus ducreyi                                         hypothetical protein [Haemophilus ducreyi].                                                                   499246981_?-><-499246982_?||499246983_?->499246984_?->499246990_?->499246991_?->499247853_tail_3->753848096_N6-MTase*-><-499247855_?<-499247856_?<-499247857_?<-499247859_?||499247860_?->499247861_?->499247862_?->
      229317608    <-N6-MTase*                                                                                                 N6-MTase                    MethyltransfD12       POREN0001_0004        281  bacteria>bacteroidetes                         Porphyromonas endodontalis ATCC 35406                       D12 class N6 adenine-specific DNA methyltransferase [Porphyromonas endodontalis ATCC 35406].                  229317606_?->229317610_?->229317612_?-><-229317608_N6-MTase*<-229317617_?<-229317615_?||229317607_?-><-229317609_?<-229317611_?<-229317616_?<-229317614_?
      359359006    N6-MTase*->                                                                                                 N6-MTase                    -                     hia5                  281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      Hia5 [Haemophilus influenzae].                                                                                359359005_?->359359006_N6-MTase*->
      491864737    GPW_gp25->Baseplate_J->Tail_P2_I->Collar->?->?->?->N6-MTase*-><-?||?->?->?->?->?->Collar->                  N6-MTase                    -                     GGE_RS03480           281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus haemolyticus                                    hypothetical protein [Haemophilus haemolyticus].                                                              491864725_GPW_gp25->491864727_Baseplate_J->491864729_Tail_P2_I->763375215_Collar->491864732_?->491864734_?->491864735_?->491864737_N6-MTase*-><-491864739_?||491864742_?->763375197_?->491864746_?->763375217_?->491864749_?->763375218_Collar->
      491953443    <-N6-MTase*<-?<-?<-?<-Collar<-DUF2313<-Baseplate_J<-GP46                                                    N6-MTase                    -                     HMPREF9095_RS06800    281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus                                                 MULTISPECIES: hypothetical protein [Haemophilus].                                                             494053311_?->491916408_?->494053312_?-><-494053313_?<-696248082_?||491910609_?->491864739_?-><-491953443_N6-MTase*<-491866182_?<-494053317_?<-494053318_?<-494053319_Collar<-696248122_DUF2313<-494053321_Baseplate_J<-494053322_GP46
      492125251    tail_3->?->N6-MTase*-><-?||?->?->ABC-ATPase->                                                               N6-MTase                    SP                    PMCN03_RS01910        281  bacteria>proteobacteria>gammaproteobacteria    Pasteurella multocida                                       hypothetical protein [Pasteurella multocida].                                                                 492125273_?->504481140_?->504481139_?->492125264_?->504481138_?->643672669_tail_3->492125255_?->492125251_N6-MTase*-><-514165557_?||512748675_?->504091885_?->492023654_ABC-ATPase->492113843_?-><-643672685_?<-492023662_?
      492143056    <-N6-MTase*<-?<-?<-?<-Collar<-Baseplate_J<-GPW_gp25<-Phage_base_V                                           N6-MTase                    -                     HMPREF1052_RS08385    281  bacteria>proteobacteria>gammaproteobacteria    Pasteurella bettyae                                         hypothetical protein [Pasteurella bettyae].                                                                   492143048_?->492143050_?-><-492142921_?||492143029_?-><-492142909_?||492143018_?->750314533_?-><-492143056_N6-MTase*<-750314506_?<-492143138_?<-492142949_?<-750314535_Collar<-492143012_Baseplate_J<-492143123_GPW_gp25<-492143004_Phage_base_V
      494053240    <-Phage_sheath_1<-N6-MTase*<-?<-?<-?<-Collar<-?<-Baseplate_J                                                N6-MTase                    -                     HMPREF9095_RS07250    281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus aegyptius                                       hypothetical protein [Haemophilus aegyptius].                                                                 696248080_?-><-494053233_?<-494053234_?<-494053237_?<-491890345_?<-494053238_?<-494053239_Phage_sheath_1<-494053240_N6-MTase*<-491866182_?<-494053242_?<-491901935_?<-494053243_Collar<-494053244_?<-494053245_Baseplate_J<-491852313_?
      494789040    <-N6-MTase*<-?<-?<-?<-?<-Tail_P2_I<-Baseplate_J<-GPW_gp25                                                   N6-MTase                    -                     HMPREF1128_RS04375    281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus sputorum                                        hypothetical protein [Haemophilus sputorum].                                                                  <-494789045_?<-494789378_?<-494789560_?<-494788858_?<-494788621_?<-494790093_?<-494789843_?<-494789040_N6-MTase*<-491961422_?<-494788844_?<-494788564_?<-494790005_?<-494789847_Tail_P2_I<-494788657_Baseplate_J<-494788891_GPW_gp25
      503290984    <-N6-MTase*<-?<-?<-?<-Collar<-Tail_P2_I<-Baseplate_J<-GPW_gp25                                              N6-MTase                    -                     HIBPF_RS02220         281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                491889249_?->494053683_?->494053684_?->503290981_?->503290982_?->752488873_?->752488874_?-><-503290984_N6-MTase*<-752488875_?<-503290985_?<-503290986_?<-503290987_Collar<-491897168_Tail_P2_I<-503290988_Baseplate_J<-503290989_GPW_gp25
      503292691    Baseplate_J->?->?->Collar->?->?->?->N6-MTase*->Phage_sheath_1->                                             N6-MTase                    -                     HICON_RS02315         281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                494053245_Baseplate_J->494053244_?->752489194_?->503292689_Collar->503292690_?->503290985_?->752489128_?->503292691_N6-MTase*->494053239_Phage_sheath_1->494053238_?->491890345_?->494053237_?->494053234_?->503292692_?-><-491914347_?
      649508868    Thymidylate_synthase->N6-MTase*->                                                                           N6-MTase                    MethyltransfD12       M088_0657             281  bacteria>bacteroidetes                         Bacteroides ovatus str. 3725 D1 iv                          D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides ovatus str. 3725 D1 iv].      649508861_?->649508862_?->649508863_?->649508864_?->649508865_?->649508866_?->649508867_Thymidylate_synthase->649508868_N6-MTase*-><-649508869_?||649508870_?->649508871_?->649508872_?-><-649508873_?<-649508874_?<-649508875_?
      662243730    N6-MTase*->?->HNH->Terminase_SS->Terminase_LS->                                                             N6-MTase                    SP                    SASC598J21_017980     281  bacteria>proteobacteria>betaproteobacteria     Snodgrassella alvi SCGC AB-598-J21                          hypothetical protein SASC598J21_017980 [Snodgrassella alvi SCGC AB-598-J21].                                  662243723_?->662243724_?->662243725_?->662243726_?->662243727_?->662243728_?->662243729_?->662243730_N6-MTase*->662243731_?->662243732_HNH->662243733_Terminase_SS->662243734_Terminase_LS->662243735_?->662243736_?->662243737_?->
      696244941    GP46->Baseplate_J->DUF2313->Collar->?->?->?->N6-MTase*->                                                    N6-MTase                    -                     CK45_RS04150          281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                696244953_GP46->696244951_Baseplate_J->696244949_DUF2313->696245023_Collar->696244947_?->696244945_?->696244943_?->696244941_N6-MTase*-><-696244940_?<-696244939_?||696244937_?-><-491880641_?||491907518_?-><-696244936_?<-696244934_?
      696250595    GP46->Baseplate_J->DUF2313->?->?->N6-MTase*->                                                               N6-MTase                    -                     HICG_RS06205          281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                696250594_?->491961409_?->491961411_GP46->491961413_Baseplate_J->491961415_DUF2313->491961417_?->491961422_?->696250595_N6-MTase*-><-696250610_?<-491961426_?<-491961428_?||491961431_?->491961436_?->491961439_?->491961442_?->
      746003746    <-Phage_integrase||?->?->N6-MTase*->                                                                        N6-MTase                    -                     JP35_RS03815          281  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 <-746003737_Phage_integrase||746003738_?->746003739_?->746003746_N6-MTase*-><-746003740_?<-746003741_?||746003742_?-><-746003747_?<-746003743_?||746003744_?->746003745_?->
      746007528    DUF4376->?->Phage_sheath_1-><-N6-MTase*                                                                     N6-MTase                    -                     JP32_RS09880          281  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 746007515_?->746007523_?-><-746007517_?<-746007519_?||746007525_DUF4376->746007521_?->746007527_Phage_sheath_1-><-746007528_N6-MTase*
      746088554    <-N6-MTase*<-?<-?||Phage_integrase->                                                                        N6-MTase                    -                     IO45_RS00140          281  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 746011065_?->746088543_?->545092680_?-><-746088554_N6-MTase*<-746088546_?<-746088550_?||746088556_Phage_integrase-><-746088553_?
      746098344    GPW_gp25->Baseplate_J->Tail_P2_I->?-><-Phage_integrase||?->?->N6-MTase*->                                   N6-MTase                    -                     IO48_RS11150          281  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 746089736_GPW_gp25->746098324_Baseplate_J->746098326_Tail_P2_I->746098340_?-><-746098342_Phage_integrase||746098328_?->746098330_?->746098344_N6-MTase*-><-503511892_?<-746098332_?<-746098346_?<-746003424_?<-503510518_?<-503510519_?||545092895_?->
      746100920    <-Phage_integrase||?->?->N6-MTase*-><-?<-?<-ABC-ATPase                                                      N6-MTase                    -                     JL04_RS11025          281  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 746100909_?-><-746100912_Phage_integrase||545093172_?->545093173_?->746100920_N6-MTase*-><-545092600_?<-665836837_?<-746100915_ABC-ATPase||746100917_?->
      746131177    <-N6-MTase*<-?<-DUF4376<-?<-DUF2313<-Baseplate_J<-GP46<-Phage_Mu_Gp45                                       N6-MTase                    -                     ACEE_RS02875          281  bacteria>proteobacteria>gammaproteobacteria    Actinobacillus equuli                                       hypothetical protein [Actinobacillus equuli].                                                                 <-746131168_?<-746131169_?<-746131170_?<-746133702_?<-746131172_?||746131174_?->746131176_?-><-746131177_N6-MTase*<-746131178_?<-746131180_DUF4376<-746133704_?<-746131183_DUF2313<-746131185_Baseplate_J<-746131194_GP46<-746131195_Phage_Mu_Gp45
      748782878    GP46->Baseplate_J->DUF2313->Collar->?->?->?->N6-MTase*->                                                    N6-MTase                    -                     W820_RS02320          281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                696246362_GP46->543996823_Baseplate_J->748782883_DUF2313->748782885_Collar->748782874_?->748782875_?->491866182_?->748782878_N6-MTase*-><-491864739_?<-748782880_?
      756154060    Phage_base_V->GPW_gp25->Baseplate_J->Tail_P2_I->?->?->?->N6-MTase*->                                        N6-MTase                    -                     SU55_RS07055          281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                491897174_Phage_base_V->491897173_GPW_gp25->491897170_Baseplate_J->491897168_Tail_P2_I->756154059_?->491897340_?->491866182_?->756154060_N6-MTase*-><-491955477_?<-491880641_?||491880643_?-><-696242420_?||756154063_?->
      756154896    <-N6-MTase*                                                                                                 N6-MTase                    -                     SU30_RS04070          281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                <-491907958_?<-491909806_?<-491909802_?<-756151526_?||756151519_?-><-491909047_?||499591788_?-><-756154896_N6-MTase*<-491866182_?<-756154897_?
      756163264    ABC-ATPase->ABC-ATPase->?->?-><-N6-MTase*<-?<-?<-?<-Collar<-DUF2313<-Baseplate_J<-GP46                      N6-MTase                    -                     SU58_RS08535          281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                756163260_?->527109474_?->491961245_?->756163261_ABC-ATPase->756163262_ABC-ATPase->491906242_?->756163263_?-><-756163264_N6-MTase*<-491866182_?<-756163265_?<-756163266_?<-756163321_Collar<-756163322_DUF2313<-756163267_Baseplate_J<-756163268_GP46
      764389671    <-N6-MTase*<-?<-?<-?<-Collar<-DUF2313<-Baseplate_J<-GP46                                                    N6-MTase                    -                     NTHI723_RS04270       281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                <-740655046_?||491838320_?-><-740655049_?||740655051_?->740655053_?-><-764437855_?||764389667_?-><-764389671_N6-MTase*<-491866182_?<-764436837_?<-764389675_?<-764436842_Collar<-764389994_DUF2313<-764389683_Baseplate_J<-696246362_GP46
      764738267    Phage_Mu_Gp45->GP46->Baseplate_J->DUF2313->Collar->DUF4376->?->N6-MTase*-><-?<-METHYLASE                    N6-MTase                    -                     BN1226_RS02290        281  bacteria>proteobacteria>gammaproteobacteria    Mannheimia sp. MG13                                         hypothetical protein [Mannheimia sp. MG13].                                                                   764737997_Phage_Mu_Gp45->764737999_GP46->764738003_Baseplate_J->764738005_DUF2313->764738263_Collar->764738265_DUF4376->764738006_?->764738267_N6-MTase*-><-764738269_?<-764738008_METHYLASE<-764738010_?<-764738270_?<-764738012_?<-764738013_?<-492134135_?
      777210024    GP46->Baseplate_J->DUF2313->Collar->?->?->?->N6-MTase*->                                                    N6-MTase                    -                     ERS450003_01064       281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      D12 class N6 adenine-specific DNA methyltransferase [Haemophilus influenzae].                                 777210017_GP46->777210018_Baseplate_J->777210019_DUF2313->777210020_Collar->777210021_?->777210022_?->777210023_?->777210024_N6-MTase*-><-777210025_?<-777210026_?<-777210027_?||777210028_?->777210029_?-><-777210030_?||777210031_?->
      803453319    GPW_gp25->Baseplate_J->Tail_P2_I->Collar->?->?->?->N6-MTase*->                                              N6-MTase                    -                     C645_RS00620          281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                491897173_GPW_gp25->491897170_Baseplate_J->491897168_Tail_P2_I->803453317_Collar->491901935_?->803453318_?->491866182_?->803453319_N6-MTase*-><-803453320_?||803453656_?->491901278_?->491896093_?->491896090_?-><-491896087_?||491896082_?->
      803453531    GPW_gp25->Baseplate_J->Tail_P2_I->Collar->?->?->?->N6-MTase*-><-?||?->?-><-?<-?||?-><-ABC-ATPase            N6-MTase                    -                     C645_RS06690          281  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                803453528_GPW_gp25->491901962_Baseplate_J->491901959_Tail_P2_I->803453529_Collar->803453668_?->803453530_?->491866182_?->803453531_N6-MTase*-><-491864739_?||803453532_?->803453533_?-><-491877319_?<-499591645_?||696245907_?-><-491893936_ABC-ATPase
      488719230    <-N6-MTase*                                                                                                 N6-MTase                    -                     HMPREF9021_RS11285    280  bacteria>proteobacteria>betaproteobacteria     Simonsiella muelleri                                        hypothetical protein [Simonsiella muelleri].                                                                  <-488719230_N6-MTase*<-488717776_?<-750347722_?<-488717778_?<-488717779_?<-750346942_?<-750347723_?
      492352476    <-ParA||?->N6-MTase*->?-><-?<-?||STN+Cna_B_2+Plug->                                                         N6-MTase                    MethyltransfD12       M116_RS19700          280  bacteria>bacteroidetes                         Bacteroides fragilis                                        DNA methyltransferase [Bacteroides fragilis].                                                                 <-695454433_ParA||492352478_?->492352476_N6-MTase*->492352474_?-><-492352472_?<-695337132_?||492352466_STN+Cna_B_2+Plug->492352464_?->492352462_?-><-695337131_?
      493291112    N6-MTase*->METHYLASE->                                                                                      N6-MTase                    -                     VK67_RS05530          280  bacteria>proteobacteria>gammaproteobacteria    Mannheimia haemolytica                                      hypothetical protein [Mannheimia haemolytica].                                                                <-505297060_?<-493291101_?||493291104_?->493291105_?->493291108_?->493291109_?->493291110_?->493291112_N6-MTase*->493291113_METHYLASE->493291114_?->493291115_?-><-493291116_?||493291118_?-><-493291119_?<-493291120_?
      501020456    <-N6-MTase*                                                                                                 N6-MTase                    -                     ASUC_RS07260          280  bacteria>proteobacteria>gammaproteobacteria    Actinobacillus succinogenes                                 hypothetical protein [Actinobacillus succinogenes].                                                           <-501020449_?||501020450_?->501020451_?-><-501020452_?||501020453_?->501020454_?->501020455_?-><-501020456_N6-MTase*<-501020458_?<-501020459_?||501020460_?-><-501020461_?||501020462_?->501020463_?-><-501020464_?
      525759492    N6-MTase*->                                                                                                 N6-MTase                    -                     J450_RS04260          280  bacteria>proteobacteria>gammaproteobacteria    Mannheimia haemolytica                                      hypothetical protein [Mannheimia haemolytica].                                                                525759486_?->493295450_?->493293312_?->525759488_?->525759489_?->525759490_?->525759491_?->525759492_N6-MTase*->696267597_?->525759493_?->493293694_?-><-493292220_?<-493292218_?<-493294463_?<-493294462_?
      544657538    <-N6-MTase*<-DCM                                                                                            N6-MTase                    MethyltransfD12       ATCC51562_RS05210     280  bacteria>proteobacteria>epsilonproteobacteria  Campylobacter concisus                                      hypothetical protein [Campylobacter concisus].                                                                <-544657507_?<-737181490_?<-737181469_?<-544657524_?<-737181491_?<-544657530_?<-737181471_?<-544657538_N6-MTase*<-544657522_DCM<-544657549_?<-737181492_?<-544657545_?||544657510_?-><-544657536_?||737181493_?->
      544865513    <-N6-MTase*                                                                                                 N6-MTase                    -                     L278_RS124350         280  bacteria>proteobacteria>gammaproteobacteria    Mannheimia haemolytica                                      hypothetical protein [Mannheimia haemolytica].                                                                493293008_?->544865512_?->525759340_?->493290115_?-><-493290116_?<-493290117_?<-696267597_?<-544865513_N6-MTase*<-544865514_?<-493291109_?<-544865515_?<-544865518_?<-544865520_?||493293688_?-><-493292580_?
      544865770    GP46->Baseplate_J->DUF2313->?->DUF4376->?->N6-MTase*->                                                      N6-MTase                    -                     L278_RS122210         280  bacteria>proteobacteria>gammaproteobacteria    Mannheimia haemolytica                                      hypothetical protein [Mannheimia haemolytica].                                                                544865763_?->544865764_GP46->544865765_Baseplate_J->544865766_DUF2313->544865767_?->544865768_DUF4376->544865769_?->544865770_N6-MTase*-><-544865771_?<-544865772_?||544865773_?->493296028_?-><-493296027_?||493294126_?->544865774_?->
      639782857    <-METHYLASE||?->?-><-?||?-><-N6-MTase*<-?<-?<-?<-Collar                                                     N6-MTase                    -                     TMA01S_RS05515        280  bacteria>bacteroidetes                         Tenacibaculum maritimum                                     DNA methyltransferase [Tenacibaculum maritimum].                                                              639782849_?->639782851_?-><-639782852_METHYLASE||639782853_?->740182981_?-><-639782855_?||639782856_?-><-639782857_N6-MTase*<-639782858_?<-639782859_?<-639782861_?<-639782863_Collar<-740182974_?<-639782867_?<-639782868_?
      695294566    <-ParA||?->N6-MTase*->                                                                                      N6-MTase                    MethyltransfD12       M127_RS12840          280  bacteria>bacteroidetes                         Bacteroides                                                 MULTISPECIES: DNA methyltransferase [Bacteroides].                                                            492291783_?->695339004_?->695339060_?-><-695400500_?<-496602564_?<-499301741_ParA||492279331_?->695294566_N6-MTase*->492279325_?-><-492279322_?
      695330037    STN+Cna_B_2+Plug->?->?->?-><-?||?-><-N6-MTase*<-?||ParA->                                                   N6-MTase                    MethyltransfD12       HMPREF1079_RS0100985  280  bacteria>bacteroidetes                         Bacteroides fragilis                                        DNA methyltransferase [Bacteroides fragilis].                                                                 492291771_?->492279316_STN+Cna_B_2+Plug->492291773_?->492279320_?->492279322_?-><-492279325_?||657213291_?-><-695330037_N6-MTase*<-492279331_?||492291778_ParA->492279339_?->492291779_?->492291780_?-><-492291781_?<-492291782_?
      695344948    MuF->?->?->?->?->?->Thymidylate_synthase->N6-MTase+Phage-tailfib*-><-?<-Phage_tail_S                        N6-MTase+Phage-tailfib      SP                    BFAG_RS07280          280  bacteria>bacteroidetes                         Bacteroides fragilis                                        DNA methyltransferase [Bacteroides fragilis].                                                                 695344945_MuF->492223777_?->695344946_?->492223780_?->492223783_?->492223786_?->695344947_Thymidylate_synthase->695344948_N6-MTase+Phage-tailfib*-><-492223794_?<-492223795_Phage_tail_S<-695344949_?<-695344950_?<-492223806_?<-695344951_?<-492223813_?
      695540882    <-ParA<-?||N6-MTase*-><-?||?-><-?<-?<-?<-STN+Cna_B_2+Plug                                                   N6-MTase                    MethyltransfD12       M070_RS00960          280  bacteria>bacteroidetes                         Bacteroides fragilis                                        DNA methyltransferase [Bacteroides fragilis].                                                                 492279352_?->492279349_?->492279345_?-><-492279342_?<-492279339_?<-492279337_ParA<-515708972_?||695540882_N6-MTase*-><-515708970_?||492279325_?-><-492279322_?<-496602556_?<-492279318_?<-695430123_STN+Cna_B_2+Plug<-492291771_?
      696234173    <-VirD4-FtsK<-?<-?<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC->                                                   N6-MTase                    MethyltransfD12       C799_RS11490          280  bacteria>bacteroidetes                         Bacteroides thetaiotaomicron                                DNA methyltransferase [Bacteroides thetaiotaomicron].                                                         511014028_?->696233857_?-><-511014030_VirD4-FtsK<-696234170_?<-696234172_?<-511014033_?<-511014034_?<-696234173_N6-MTase*<-511014036_?||696234174_ParA->511014038_?->511014039_?->511014040_DOC->696234175_?-><-511014042_?
      258520722    <-N6-MTase*                                                                                                 N6-MTase                    SP                    HMPREF0198_0362       279  bacteria>proteobacteria>gammaproteobacteria    Cardiobacterium hominis ATCC 15826                          hypothetical protein HMPREF0198_0362 [Cardiobacterium hominis ATCC 15826].                                    <-258520715_?<-258520716_?<-258520717_?<-258520718_?<-258520719_?||258520720_?->258520721_?-><-258520722_N6-MTase*||258520723_?-><-258520649_?<-258520650_?||258520651_?->258520652_?->258520653_?->258520654_?->
      488141552    <-N6-MTase*<-?<-?<-?<-DUF2313<-Baseplate_J<-Baseplate_J<-GP46                                               N6-MTase                    -                     NMA510612_RS09285     279  bacteria>proteobacteria>betaproteobacteria     Neisseria meningitidis                                      hypothetical protein [Neisseria meningitidis].                                                                <-496710478_?||488147750_?-><-504393489_?||488175119_?->488175120_?->488175121_?-><-488147744_?<-488141552_N6-MTase*<-488141551_?<-488141550_?<-488175122_?<-728043483_DUF2313<-488141547_Baseplate_J<-488141546_Baseplate_J<-488141544_GP46
      488182095    <-N6-MTase*<-?<-?<-?<-?<-DUF2313<-Baseplate_J<-Baseplate_J                                                  N6-MTase                    -                     NM70082_RS106455      279  bacteria>proteobacteria>betaproteobacteria     Neisseria meningitidis                                      D12 class N6 adenine-specific DNA methyltransferase family protein [Neisseria meningitidis].                  488163872_?-><-488147748_?||488147747_?->728042758_?->488149296_?->488166023_?-><-488147744_?<-488182095_N6-MTase*<-488141551_?<-488141550_?<-488166024_?<-488166025_?<-728043483_DUF2313<-488141547_Baseplate_J<-488141546_Baseplate_J
      488717644    <-N6-MTase*                                                                                                 N6-MTase                    -                     HMPREF9021_RS03005    279  bacteria>proteobacteria>betaproteobacteria     Simonsiella muelleri                                        hypothetical protein [Simonsiella muelleri].                                                                  488717637_?->488717638_?->488717639_?->488717640_?->488717641_?->750346891_?->488717643_?-><-488717644_N6-MTase*<-750346893_?<-488717647_?<-488717648_?<-488717649_?<-488717650_?<-488717651_?<-488717652_?
      492281093    Phage_tail_S->?-><-N6-MTase*<-Thymidylate_synthase<-?<-?<-?<-?<-MuF                                         N6-MTase                    -                     M137_RS14330          279  bacteria>bacteroidetes                         Bacteroidales                                               MULTISPECIES: DNA methyltransferase [Bacteroidales].                                                          495943786_?->492281109_?->492281107_?->492281103_?->695409052_?->492281099_Phage_tail_S->492281096_?-><-492281093_N6-MTase*<-492281092_Thymidylate_synthase<-495896300_?<-492281090_?<-495896298_?<-492281087_?<-492281085_MuF<-492281083_?
      495950973    Phage_tail_S->?-><-N6-MTase*<-Thymidylate_synthase<-?<-?<-?<-?<-MuF                                         N6-MTase                    -                     BSBG_RS01095          279  bacteria>bacteroidetes                         Bacteroides sp. 9_1_42FAA                                   DNA methyltransferase [Bacteroides sp. 9_1_42FAA].                                                            495950962_?->495950963_?->495950965_?->696359225_?->495950968_?->696359215_Phage_tail_S->495950972_?-><-495950973_N6-MTase*<-495950975_Thymidylate_synthase<-496053796_?<-495950980_?<-696359216_?<-495950989_?<-696359226_MuF<-696359218_?
      496053795    Phage_tail_S->?-><-N6-MTase*<-Thymidylate_synthase<-?<-?<-?<-?<-MuF                                         N6-MTase                    -                     HMPREF9008_RS11875    279  bacteria>bacteroidetes                         Parabacteroides sp. 20_3                                    DNA methyltransferase [Parabacteroides sp. 20_3].                                                             496053793_?->495950963_?->495950965_?->696359225_?->495950968_?->736514586_Phage_tail_S->495950972_?-><-496053795_N6-MTase*<-495950975_Thymidylate_synthase<-496053796_?<-496053797_?<-496053798_?<-495950989_?<-696359226_MuF<-696359218_?
      496057734    Phage_tail_S->?-><-N6-MTase*<-Thymidylate_synthase<-?<-?<-?<-?<-MuF                                         N6-MTase                    -                     HMPREF9011_RS05730    279  bacteria>bacteroidetes                         Bacteroides sp. 3_1_40A                                     DNA methyltransferase [Bacteroides sp. 3_1_40A].                                                              496057730_?->492281109_?->492281107_?->496057731_?->492281101_?->496057732_Phage_tail_S->496057733_?-><-496057734_N6-MTase*<-496057735_Thymidylate_synthase<-696377523_?<-496057737_?<-496057738_?<-496057739_?<-696377528_MuF<-496057741_?
      496464403    Phage_Mu_Gp45->GP46->Baseplate_J->DUF2313->?->?->N6-MTase*->                                                N6-MTase                    -                     HMPREF9016_RS00975    279  bacteria>proteobacteria>betaproteobacteria     Neisseria sp. oral taxon 014                                hypothetical protein [Neisseria sp. oral taxon 014].                                                          496464397_?->496464398_Phage_Mu_Gp45->488141544_GP46->748592186_Baseplate_J->496464400_DUF2313->496464401_?->496464402_?->496464403_N6-MTase*-><-496464404_?||496464405_?->496464406_?->748592187_?->748592188_?->496464409_?->496464410_?->
      500646766    <-ParA||?->N6-MTase*->?-><-?||?->STN+Cna_B_2+Plug->                                                         N6-MTase                    MethyltransfD12       BVU_RS04850           279  bacteria>bacteroidetes                         Bacteroides vulgatus                                        DNA methyltransferase [Bacteroides vulgatus].                                                                 500646760_?->500646761_?->500646762_?-><-500646763_?<-500646764_?<-500646765_ParA||495946968_?->500646766_N6-MTase*->752488389_?-><-752488390_?||752488391_?->500646770_STN+Cna_B_2+Plug->500646771_?->500646772_?->500646773_?->
      504603820    <-Thymidylate_synthase<-N6-MTase*                                                                           N6-MTase                    -                     ORNRH_RS05640         279  bacteria>bacteroidetes                         Ornithobacterium rhinotracheale                             DNA methyltransferase [Ornithobacterium rhinotracheale].                                                      <-504603116_?||504603814_?->504603815_?->504603816_?->738702467_?->754917207_?-><-504603819_Thymidylate_synthase<-504603820_N6-MTase*<-504603821_?<-754917212_?<-504603823_?<-504603824_?<-504603825_?<-504603826_?<-754917215_?
      514429566    <-N6-MTase*<-?<-?<-DUF2313<-Baseplate_J<-GP46<-Phage_Mu_Gp45                                                N6-MTase                    SP                    P1062_RS03850         279  bacteria>proteobacteria>gammaproteobacteria    Pasteurella multocida                                       hypothetical protein [Pasteurella multocida].                                                                 <-492015334_?||504092484_?->512748257_?->492027804_?->504092482_?->492027802_?->757461359_?-><-514429566_N6-MTase*<-514429567_?<-757461361_?<-757461140_DUF2313<-514165916_Baseplate_J<-514165917_GP46<-514165918_Phage_Mu_Gp45<-757461141_?
      514429666    N6-MTase*->                                                                                                 N6-MTase                    SP                    P1062_RS06555         279  bacteria>proteobacteria>gammaproteobacteria    Pasteurella multocida                                       hypothetical protein [Pasteurella multocida].                                                                 <-492019952_?||492126203_?->514165526_?-><-514165525_?||514165524_?->514429660_?->757461336_?->514429666_N6-MTase*-><-492026848_?<-504092255_?<-492020358_?<-504092254_?||492026860_?->492020596_?->512748065_?->
      518349657    N6-MTase*->                                                                                                 N6-MTase                    MethyltransfD12       A3G3_RS0107195        279  bacteria>proteobacteria>gammaproteobacteria    Moraxella boevrei                                           hypothetical protein [Moraxella boevrei].                                                                     750349540_?->750349543_?->518349655_?->750349546_?->518349657_N6-MTase*-><-518349658_?<-518349659_?<-518349660_?||648521159_?->518349662_?->518349663_?->518349665_?->
      647518976    MuF->?->?->?->?->?->N6-MTase*-><-Phage_tail_S                                                               N6-MTase                    -                     JCM12083_RS06185      279  bacteria>bacteroidetes                         Prevotella shahii                                           DNA methyltransferase [Prevotella shahii].                                                                    647518963_?->763201846_MuF->647518967_?->647518969_?->647518972_?->763201833_?->647518974_?->647518976_N6-MTase*-><-647518978_Phage_tail_S<-647518980_?<-647518983_?<-647518986_?<-647518988_?<-763201835_?<-763201837_?
      750347144    <-N6-MTase*                                                                                                 N6-MTase                    MethyltransfD12       HMPREF9021_RS06670    279  bacteria>proteobacteria>betaproteobacteria     Simonsiella muelleri                                        hypothetical protein [Simonsiella muelleri].                                                                  <-750347216_?<-488718372_?<-488718373_?<-488718374_?<-488718375_?<-750347218_?<-750347220_?<-750347144_N6-MTase*<-488718379_?<-488718380_?<-750347146_?<-488718381_?||488718382_?->750347222_?->488718384_?->
      810414634    N6-MTase*->                                                                                                 N6-MTase                    SP                    I926_RS02325          279  bacteria>proteobacteria>gammaproteobacteria    Pasteurella multocida                                       hypothetical protein [Pasteurella multocida].                                                                 <-810414619_?||810414621_?-><-810414623_?||810414625_?->810422398_?->810414628_?->810414631_?->810414634_N6-MTase*->810422401_?-><-810414636_?<-810414639_?||810414642_?->810414644_?->810414646_?->810414648_?->
      343968128    <-N6-MTase*<-?<-?<-?<-Caudo_TAP<-Collar<-DUF2313<-Baseplate_J                                               N6-MTase                    SP                    l11_17040             278  bacteria>proteobacteria>betaproteobacteria     Neisseria weaveri LMG 5135                                  hypothetical protein l11_17040 [Neisseria weaveri LMG 5135].                                                  <-343968128_N6-MTase*<-343968129_?<-343968130_?<-343968131_?<-343968132_Caudo_TAP<-343968133_Collar<-343968134_DUF2313<-343968135_Baseplate_J
      490416379    <-LPD3+N4ART+N4ART+MPTase+MPTase<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC->                                     N6-MTase                    MethyltransfD12       HMPREF1181_RS12035    278  bacteria>bacteroidetes                         Bacteroides                                                 MULTISPECIES: DNA methyltransferase [Bacteroides].                                                            491885528_?->491885531_?->491885534_?-><-514974061_?<-514974062_LPD3+N4ART+N4ART+MPTase+MPTase<-514974063_?<-514974064_?<-490416379_N6-MTase*<-490416378_?||514974065_ParA->514974066_?->514974067_?->514974068_DOC->514974069_?-><-514974070_?
      496519123    N6-MTase*->?-><-?<-?||?->?->ABC-ATPase->                                                                    N6-MTase                    -                     HMPREF0669_RS04845    278  bacteria>bacteroidetes                         Prevotella sp. oral taxon 299                               hypothetical protein [Prevotella sp. oral taxon 299].                                                         496519116_?->496519117_?->763165970_?->763166059_?->496519120_?->532473657_?->763165972_?->496519123_N6-MTase*->763166060_?-><-496519125_?<-496519126_?||496519127_?->496519128_?->496519129_ABC-ATPase->496519130_?->
      501302336    GP46->Baseplate_J->DUF2313->Collar->DUF4376->?->DCM->N6-MTase*-><-?<-?<-ABC-ATPase                          N6-MTase                    -                     HSM_RS03540           278  bacteria>proteobacteria>gammaproteobacteria    Histophilus somni                                           hypothetical protein [Histophilus somni].                                                                     501302328_GP46->501302329_Baseplate_J->501302330_DUF2313->753849627_Collar->501302332_DUF4376->501302333_?->501302334_DCM->501302336_N6-MTase*-><-501302337_?<-753849413_?<-753849629_ABC-ATPase<-501302340_?<-501302341_?<-501302342_?<-501302343_?
      503362551    <-McrC<-McrB<-?<-?<-?<-?<-Thymidylate_synthase<-N6-MTase*<-?<-?<-?<-?<-?<-Collar                            N6-MTase                    MethyltransfD12       WEEVI_RS00470         278  bacteria>bacteroidetes                         Weeksella virosa                                            DNA methyltransferase [Weeksella virosa].                                                                     <-503362544_McrC<-754544231_McrB<-503362546_?<-503362547_?<-754544048_?<-503362549_?<-503362550_Thymidylate_synthase<-503362551_N6-MTase*<-503362552_?<-503362553_?<-754544049_?<-503362555_?<-503362556_?<-503362557_Collar<-754544232_?
      503512750    Baseplate_J->Tail_P2_I->?-><-?<-?||DUF4376->?->N6-MTase*->                                                  N6-MTase                    -                     UMN179_RS12515        278  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 503512743_Baseplate_J->503512744_Tail_P2_I->762905620_?-><-762905621_?<-503512747_?||762905622_DUF4376->503512749_?->503512750_N6-MTase*->503512751_?->503512752_?->503512753_?-><-503512754_?<-503512755_?<-503512756_?<-503512757_?
      517157783    Baseplate_J->Tail_P2_I->?->Collar->Caudo_TAP->DUF4376->?->N6-MTase*->                                       N6-MTase                    -                     IE01_RS08000          278  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 517157788_Baseplate_J->517157787_Tail_P2_I->746085363_?->648519298_Collar->746085367_Caudo_TAP->746085370_DUF4376->517157784_?->517157783_N6-MTase*->517157782_?->517157781_?->517157780_?-><-517157779_?<-517157778_?<-746085373_?<-517157776_?
      547931225    <-N6-MTase*                                                                                                 N6-MTase                    -                     BN590_01677           278  bacteria>bacteroidetes                         Alistipes sp. CAG:29                                        d12 class N6 adenine-specific DNA methyltransferase [Alistipes sp. CAG:29].                                   <-547931222_?<-547931223_?<-547931224_?<-547931225_N6-MTase*<-547931226_?<-547931227_?<-547931228_?
      640570678    N6-MTase*->Thymidylate_synthase->                                                                           N6-MTase                    MethyltransfD12       JCM15754_RS11780      278  bacteria>bacteroidetes                         Prevotella aurantiaca                                       DNA methyltransferase [Prevotella aurantiaca].                                                                640570671_?->640570672_?->640570673_?->640570674_?->640570675_?->640570676_?->640570677_?->640570678_N6-MTase*->640570679_Thymidylate_synthase->
      736169879    <-N6-MTase*                                                                                                 N6-MTase                    -                     Q338_RS01810          278  bacteria>proteobacteria>betaproteobacteria     Alysiella crassa                                            hypothetical protein [Alysiella crassa].                                                                      <-736169847_?<-736169851_?<-736169855_?<-736169861_?||736169865_?->736169871_?->736169875_?-><-736169879_N6-MTase*<-736169883_?<-736170079_?<-736169888_?<-736169894_?<-736170083_?<-736170090_?<-736169898_?
      737726745    ABC-ATPase-><-?||?->?-><-N6-MTase*<-DCM                                                                     N6-MTase                    -                     AJF4211_RS10020       278  bacteria>proteobacteria>gammaproteobacteria    Avibacterium paragallinarum                                 hypothetical protein [Avibacterium paragallinarum].                                                           516416440_?-><-737727047_?<-648446740_?||516416433_ABC-ATPase-><-737727048_?||545595273_?->545595597_?-><-737726745_N6-MTase*<-737726757_DCM<-516418180_?<-516416426_?<-516416425_?<-516416424_?<-516416423_?<-516416422_?
      737726850    <-N6-MTase*<-DCM<-?<-DUF4376<-Collar                                                                        N6-MTase                    -                     AJF4211_RS06190       278  bacteria>proteobacteria>gammaproteobacteria    Avibacterium paragallinarum                                 hypothetical protein [Avibacterium paragallinarum].                                                           <-516417945_?<-737726848_?<-545595613_?||516417941_?->516417940_?->737726843_?->545595615_?-><-737726850_N6-MTase*<-737726757_DCM<-516418180_?<-737726746_DUF4376<-545595617_Collar
      746010293    Baseplate_J->Tail_P2_I->?-><-?<-?||DUF4376->?->N6-MTase*->                                                  N6-MTase                    -                     JP28_RS09245          278  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 746010285_Baseplate_J->746010287_Tail_P2_I->746010297_?-><-746010289_?<-503512747_?||746010298_DUF4376->746010291_?->746010293_N6-MTase*->746010295_?->
      746011652    <-N6-MTase*<-DCM<-?<-?<-?||Phage_integrase->                                                                N6-MTase                    -                     JP34_RS00795          278  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 <-746011641_?<-746011644_?<-746011645_?||746011647_?->746011648_?->746011649_?->746011651_?-><-746011652_N6-MTase*<-746011653_DCM<-746011654_?<-746011655_?<-746011657_?||746011659_Phage_integrase->746011660_?-><-746011834_?
      746067969    <-N6-MTase*<-?<-DUF4376<-?<-Tail_P2_I<-Baseplate_J                                                          N6-MTase                    -                     P375_RS07850          278  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium genomosp. 2                                  hypothetical protein [Gallibacterium genomosp. 2].                                                            746067963_?->746067964_?->503512755_?->746067965_?-><-746067966_?<-746067967_?<-746067968_?<-746067969_N6-MTase*<-746067970_?<-746068026_DUF4376<-746068028_?<-746067971_Tail_P2_I<-746067972_Baseplate_J<-746067973_?<-746067974_?
      746089913    <-N6-MTase*<-?<-DUF4376                                                                                     N6-MTase                    -                     IO46_RS12295          278  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 746010848_?->746010845_?->746010843_?->746010841_?-><-746010840_?<-746010838_?<-757676485_?<-746089913_N6-MTase*<-746089910_?<-746089943_DUF4376||757676487_?->757676488_?->
      746094831    Baseplate_J->Tail_P2_I->?-><-?<-?||DUF4376->?->N6-MTase*->                                                  N6-MTase                    -                     JP33_RS07160          278  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 746094824_Baseplate_J->746094826_Tail_P2_I->746094868_?-><-746010289_?<-503512747_?||746094869_DUF4376->746094829_?->746094831_N6-MTase*->746094834_?->746094836_?->746094840_?-><-746094842_?<-746094845_?<-746094848_?<-746094849_?
      756151906    <-N6-MTase*                                                                                                 N6-MTase                    -                     SU30_RS01820          278  bacteria>proteobacteria>gammaproteobacteria    Haemophilus influenzae                                      hypothetical protein [Haemophilus influenzae].                                                                <-756151912_?<-491873780_?<-491853492_?<-491891724_?<-491874530_?<-491906619_?<-756151908_?<-756151906_N6-MTase*<-491866182_?||756151904_?-><-756151952_?||491915004_?-><-491849814_?||491906642_?-><-491906646_?
      763375484    <-N6-MTase*<-?<-?<-?<-?<-Collar<-DUF2313<-Baseplate_J                                                       N6-MTase                    -                     GGE_RS05405           278  bacteria>proteobacteria>gammaproteobacteria    Haemophilus haemolyticus                                    hypothetical protein [Haemophilus haemolyticus].                                                              <-491782491_?<-491782456_?<-696247400_?<-491865825_?<-491865829_?<-763375481_?<-763375528_?<-763375484_N6-MTase*<-763375487_?<-491865842_?<-491865844_?<-491865847_?<-763375490_Collar<-491865850_DUF2313<-491865853_Baseplate_J
      805420685    <-N6-MTase*<-DCM                                                                                            N6-MTase                    -                     Z012_RS09750          278  bacteria>proteobacteria>gammaproteobacteria    Avibacterium paragallinarum                                 hypothetical protein [Avibacterium paragallinarum].                                                           805420673_?-><-516417306_?||805420675_?->805420677_?->805420679_?->805420681_?->805420683_?-><-805420685_N6-MTase*<-805420687_DCM
      385696246    Collar->?->?->?->?->?->?->N6-MTase*->                                                                       N6-MTase                    -                     HMPREF1054_1309       277  bacteria>proteobacteria>gammaproteobacteria    Haemophilus paraphrohaemolyticus HK411                      D12 class N6 adenine-specific DNA methyltransferase [Haemophilus paraphrohaemolyticus HK411].                 385696272_Collar->385696276_?->385696219_?->385696212_?->385696243_?->385696245_?->385696259_?->385696246_N6-MTase*->385696214_?->385696239_?->385696213_?->385696235_?->385696229_?->385696230_?->385696251_?->
      489754581    <-Portal<-?<-Phage_capsid<-?<-?<-N6-MTase*<-?<-Terminase_LS<-?<-Terminase_SS<-HNH                           N6-MTase                    -                     E9G_RS00410           277  bacteria>proteobacteria>gammaproteobacteria    Moraxella catarrhalis                                       hypothetical protein [Moraxella catarrhalis].                                                                 <-489754569_?<-489754571_?<-489754573_Portal<-489754574_?<-489754575_Phage_capsid<-738393062_?<-489754580_?<-489754581_N6-MTase*<-489754582_?<-489754584_Terminase_LS<-489754585_?<-489754589_Terminase_SS<-489754590_HNH<-489754591_?<-489754592_?
      489757769    <-Portal<-?<-Phage_capsid<-?<-?<-N6-MTase*<-?<-Terminase_LS<-?<-Terminase_SS<-HNH                           N6-MTase                    -                     E9K_RS05470           277  bacteria>proteobacteria>gammaproteobacteria    Moraxella catarrhalis                                       hypothetical protein [Moraxella catarrhalis].                                                                 <-489757759_?<-489757760_?<-489757762_Portal<-489754574_?<-489757764_Phage_capsid<-738383218_?<-489757768_?<-489757769_N6-MTase*<-489754582_?<-738431723_Terminase_LS<-489754585_?<-489757777_Terminase_SS<-489757779_HNH<-489757780_?<-489757782_?
      489767871    <-Portal<-?<-Phage_capsid<-?<-?<-N6-MTase*<-?<-Terminase_LS<-?<-Terminase_SS<-HNH                           N6-MTase                    -                     E9U_RS07920           277  bacteria>proteobacteria>gammaproteobacteria    Moraxella catarrhalis                                       hypothetical protein [Moraxella catarrhalis].                                                                 <-489757759_?<-489757760_?<-489757762_Portal<-489754574_?<-489767867_Phage_capsid<-738383218_?<-489757768_?<-489767871_N6-MTase*<-489767873_?<-738383198_Terminase_LS<-489754585_?<-489757777_Terminase_SS<-489767877_HNH<-489767879_?<-489767881_?
      491999424    DAM+DAM->?->?-><-?<-?<-?||?-><-N6-MTase*                                                                    N6-MTase                    -                     SVR5_RS07195          277  bacteria>proteobacteria>gammaproteobacteria    Haemophilus parasuis                                        hypothetical protein [Haemophilus parasuis].                                                                  491999129_DAM+DAM->491999132_?->544677724_?-><-491999431_?<-491999429_?<-491999427_?||491999425_?-><-491999424_N6-MTase*
      492141771    <-N6-MTase*<-?<-DUF4376                                                                                     N6-MTase                    -                     HMPREF1052_RS06075    277  bacteria>proteobacteria>gammaproteobacteria    Pasteurella bettyae                                         hypothetical protein [Pasteurella bettyae].                                                                   <-492141771_N6-MTase*<-750314401_?<-492141772_DUF4376
      493305395    RadC-><-?<-?<-?<-?<-?<-N6-MTase*                                                                            N6-MTase                    SP+MethyltransfD12    HMPREF9715_RS04510    277  bacteria>bacteroidetes                         Myroides odoratimimus                                       DNA methyltransferase [Myroides odoratimimus].                                                                <-738522835_?||493305389_RadC-><-493305390_?<-493305391_?<-493305392_?<-493305393_?<-493305394_?<-493305395_N6-MTase*<-493305397_?<-493305398_?<-493305399_?<-493305401_?<-493305402_?<-493305403_?<-493305405_?
      494312007    <-Thymidylate_synthase<-N6-MTase*                                                                           N6-MTase                    -                     HMPREF0645_RS12560    277  bacteria>bacteroidetes                         Prevotella bergensis                                        DNA methyltransferase [Prevotella bergensis].                                                                 <-494312003_Thymidylate_synthase<-494312007_N6-MTase*<-763258955_?<-494312011_?<-763258950_?<-494312015_?<-763258952_?<-494312020_?<-763258956_?
      494610799    <-Thymidylate_synthase<-N6-MTase*                                                                           N6-MTase                    -                     HMPREF9141_RS12020    277  bacteria>bacteroidetes                         Prevotella multiformis                                      DNA methyltransferase [Prevotella multiformis].                                                               <-494610798_Thymidylate_synthase<-494610799_N6-MTase*<-494610800_?||494610801_?-><-494610802_?<-494610803_?<-494610804_?<-494610805_?<-750264791_?
      517090945    MuF->?->?->?->?->N6-MTase*->                                                                                N6-MTase                    -                     CCYN49044_RS09840     277  bacteria>bacteroidetes                         Capnocytophaga cynodegmi                                    hypothetical protein [Capnocytophaga cynodegmi].                                                              517090938_?->517090939_?->750049331_MuF->750049334_?->750049335_?->517090942_?->517090944_?->517090945_N6-MTase*->
      517158409    <-N6-MTase*<-DCM<-?<-?||Phage_integrase->                                                                   N6-MTase                    -                     IE01_RS11495          277  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 <-517158405_?<-648519377_?<-517158406_?||517158407_?-><-517158409_N6-MTase*<-648519378_DCM<-517158413_?<-517158414_?||517158415_Phage_integrase->
      545432296    N6-MTase*->                                                                                                 N6-MTase                    -                     JCM6334_RS11905       277  bacteria>bacteroidetes                         Prevotella disiens                                          hypothetical protein [Prevotella disiens].                                                                    545432296_N6-MTase*->545429855_?->545429854_?-><-640636938_?
      644530078    <-N6-MTase*<-?<-DUF4376<-?<-DUF2313<-Baseplate_J<-GP46                                                      N6-MTase                    -                     F543_RS02755          277  bacteria>proteobacteria>gammaproteobacteria    Bibersteinia trehalosi                                      hypothetical protein [Bibersteinia trehalosi].                                                                <-505246074_?||740660613_?-><-505246072_?<-505246071_?<-505246069_?<-505246068_?<-505246067_?<-644530078_N6-MTase*<-644530083_?<-505246064_DUF4376<-644530088_?<-505246062_DUF2313<-505246061_Baseplate_J<-505246060_GP46<-505246059_?
      647557435    <-Thymidylate_synthase<-N6-MTase*                                                                           N6-MTase                    -                     JCM17725_RS06630      277  bacteria>bacteroidetes                         Prevotella scopos                                           DNA methyltransferase [Prevotella scopos].                                                                    647557418_?->647557420_?->647557422_?-><-647557426_?<-647557428_?||647557431_?-><-647557433_Thymidylate_synthase<-647557435_N6-MTase*<-647557437_?<-647557438_?<-647557440_?<-647557442_?<-647557444_?<-763207927_?<-647557448_?
      654481515    N6-MTase*->?-><-?||?-><-?<-STN+Cna_B_2+Plug                                                                 N6-MTase                    -                     L888_RS0101115        277  bacteria>bacteroidetes                         Hallella seregens                                           DNA methyltransferase [Hallella seregens].                                                                    654481508_?->654481509_?->654481510_?->654481511_?->654481512_?->654481513_?->654481514_?->654481515_N6-MTase*->654481516_?-><-654481517_?||654481518_?-><-654481519_?<-654481520_STN+Cna_B_2+Plug||654481521_?->763391280_?->
      655519412    N6-MTase*->                                                                                                 N6-MTase                    -                     X919_RS0112015        277  bacteria>bacteroidetes                         Prevotella sp. HJM029                                       DNA methyltransferase [Prevotella sp. HJM029].                                                                655519405_?->655519406_?->655519407_?->655519408_?->655519409_?->655519410_?->655519411_?->655519412_N6-MTase*-><-655519413_?||739036906_?->655519415_?->
      736743227    N6-MTase*->                                                                                                 N6-MTase                    -                     IW16_RS16985          277  bacteria>bacteroidetes                         Chryseobacterium vrystaatense                               DNA methyltransferase [Chryseobacterium vrystaatense].                                                        736743211_?->736743213_?->736743215_?->736743217_?->736743219_?->736743221_?->736743224_?->736743227_N6-MTase*-><-736743230_?<-736743233_?||736743236_?->736743239_?->736743242_?->736743245_?->736743248_?->
      737547480    <-N6-MTase*                                                                                                 N6-MTase                    -                     HPS41_RS09445         277  bacteria>proteobacteria>gammaproteobacteria    Haemophilus parasuis                                        hypothetical protein [Haemophilus parasuis].                                                                  <-492000594_?<-514062143_?<-491999443_?<-491999446_?<-514062144_?<-498485476_?||737547479_?-><-737547480_N6-MTase*<-737547481_?
      738545947    <-N6-MTase*<-?<-?<-Caudo_TAP<-Collar<-DUF2313<-Baseplate_J<-GP46                                            N6-MTase                    SP                    L13_RS00005           277  bacteria>proteobacteria>betaproteobacteria     Neisseria weaveri                                           hypothetical protein [Neisseria weaveri].                                                                     <-738545947_N6-MTase*<-490412638_?<-490412639_?<-490412640_Caudo_TAP<-490412641_Collar<-490411891_DUF2313<-490411892_Baseplate_J<-490412644_GP46
      746064999    <-Phage_integrase||?->?->?->DCM->N6-MTase*->                                                                N6-MTase                    -                     P375_RS00515          277  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium genomosp. 2                                  hypothetical protein [Gallibacterium genomosp. 2].                                                            <-746065027_?||746064987_?-><-746065030_Phage_integrase||746064990_?->746064993_?->746064996_?->746065033_DCM->746064999_N6-MTase*-><-746065002_?<-746065036_?
      746079094    <-N6-MTase*<-DCM<-?<-?<-?||Phage_integrase->                                                                N6-MTase                    -                     JP30_RS08890          277  bacteria>proteobacteria>gammaproteobacteria    Gallibacterium anatis                                       hypothetical protein [Gallibacterium anatis].                                                                 <-746079074_?<-746079076_?<-746007122_?<-746079080_?<-746079084_?||746079087_?->746079090_?-><-746079094_N6-MTase*<-746079097_DCM<-746079102_?<-746079106_?<-746079109_?||746079113_Phage_integrase->746079116_?->
      750343301    <-Portal<-?<-Phage_capsid<-?<-N6-MTase*<-Terminase_LS<-Terminase_SS<-HNH                                    N6-MTase                    -                     MOMA_RS09420          277  bacteria>proteobacteria>gammaproteobacteria    Moraxella macacae                                           hypothetical protein [Moraxella macacae].                                                                     <-497188035_?<-497188036_?<-497188037_?<-497188038_Portal<-497188039_?<-497188040_Phage_capsid<-750343521_?<-750343301_N6-MTase*<-497188044_Terminase_LS<-497188045_Terminase_SS<-497188046_HNH<-750343523_?<-497188048_?<-750342927_?<-750343524_?
      750388073    <-N6-MTase*<-?<-?<-Caudo_TAP<-Collar<-DUF2313<-Baseplate_J<-GP46                                            N6-MTase                    -                     L11_RS07795           277  bacteria>proteobacteria>betaproteobacteria     Neisseria weaveri                                           hypothetical protein [Neisseria weaveri].                                                                     <-750388073_N6-MTase*<-490411886_?<-490411887_?<-490411889_Caudo_TAP<-490411890_Collar<-490411891_DUF2313<-490411892_Baseplate_J<-490411894_GP46
      771514766    N6-MTase*->                                                                                                 N6-MTase                    -                     M573_RS10255          277  bacteria>bacteroidetes                         Prevotella intermedia                                       DNA methyltransferase [Prevotella intermedia].                                                                771514769_?->771514770_?->771514761_?->771514762_?->771514763_?->771514764_?->771514765_?->771514766_N6-MTase*->
      806965486    <-Thymidylate_synthase<-N6-MTase+Phage-tailfib*                                                             N6-MTase+Phage-tailfib      SP                    SU65_11745            277  bacteria>bacteroidetes                         Flavobacterium psychrophilum                                DNA methyltransferase [Flavobacterium psychrophilum].                                                         <-806965479_?<-806965480_?<-806965481_?||806965482_?->806965483_?-><-806965484_?<-806965485_Thymidylate_synthase<-806965486_N6-MTase+Phage-tailfib*<-806965487_?<-806965488_?<-806965489_?<-806965490_?<-806965491_?<-806965492_?||806965493_?->
      261414145    <-METHYLASE<-?<-?<-?<-?<-?<-?<-N6-MTase*<-tail_3                                                            N6-MTase                    -                     D11S_2165             276  bacteria>proteobacteria>gammaproteobacteria    Aggregatibacter actinomycetemcomitans D11S-1                hypothetical protein D11S_2165 [Aggregatibacter actinomycetemcomitans D11S-1].                                <-261414138_METHYLASE<-261414139_?<-261414140_?<-261414141_?<-261414142_?<-261414143_?<-261414144_?<-261414145_N6-MTase*<-261414146_tail_3<-261414147_?<-261414148_?<-261414149_?<-261414150_?<-261414151_?<-261414152_?
      313137261    MuF->?->?->?->?->Thymidylate_synthase->N6-MTase*-><-?<-Phage_tail_S                                         N6-MTase                    MethyltransfD12       BFAG_03319            276  bacteria>bacteroidetes                         Bacteroides fragilis 3_1_12                                 D12 class N6 adenine-specific DNA methyltransferase [Bacteroides fragilis 3_1_12].                            313137254_?->313137255_MuF->313137256_?->313137257_?->313137258_?->313137259_?->313137260_Thymidylate_synthase->313137261_N6-MTase*-><-313137262_?<-313137263_Phage_tail_S||313137264_?-><-313137265_?<-313137266_?<-313137267_?<-313137268_?
      490468432    N6-MTase*->Thymidylate_synthase->                                                                           N6-MTase                    -                     PREBIDRAFT_RS00610    276  bacteria>bacteroidetes                         Prevotella bivia                                            DNA methyltransferase [Prevotella bivia].                                                                     490468423_?-><-738993203_?||490468429_?->490468432_N6-MTase*->490468435_Thymidylate_synthase-><-490465488_?<-490466789_?<-490466791_?<-490466793_?||490466795_?-><-490466797_?
      491746110    <-N6-MTase*<-tail_3                                                                                         N6-MTase                    -                     RHAA1_RS00240         276  bacteria>proteobacteria>gammaproteobacteria    Aggregatibacter actinomycetemcomitans                       hypothetical protein [Aggregatibacter actinomycetemcomitans].                                                 491746101_?->491746102_?->491746103_?->491746104_?->491746105_?-><-491746107_?<-696425996_?<-491746110_N6-MTase*<-491746114_tail_3<-491746116_?<-491746118_?<-491746120_?<-491746122_?<-491746124_?<-491746126_?
      494223274    <-Thymidylate_synthase<-N6-MTase*                                                                           N6-MTase                    -                     HMPREF9420_RS08510    276  bacteria>bacteroidetes                         Prevotella salivae                                          DNA methyltransferase [Prevotella salivae].                                                                   <-494223259_?<-763205527_?<-494223263_?<-494223265_?<-763205528_?<-763205531_?<-494223272_Thymidylate_synthase<-494223274_N6-MTase*<-494223281_?<-494223283_?<-494223285_?<-494223287_?<-494223289_?<-494223291_?<-763205533_?
      494451533    GP46->DUF2313->Collar->Caudo_TAP->?->?->?->N6-MTase*->                                                      N6-MTase                    SP                    HMPREF9952_RS06315    276  bacteria>proteobacteria>gammaproteobacteria    Haemophilus pittmaniae                                      hypothetical protein [Haemophilus pittmaniae].                                                                494451755_GP46->494451582_DUF2313->748589684_Collar->494451690_Caudo_TAP->494451609_?->494451730_?->494451628_?->494451533_N6-MTase*->748589685_?->494451618_?->494451551_?->748589669_?->748589686_?->494451693_?->494451748_?->
      497946642    Collar->?->?->?->?->N6-MTase*->Thymidylate_synthase->                                                       N6-MTase                    -                     ATHG_RS03835          276  bacteria>bacteroidetes                         Alistipes timonensis                                        DNA methyltransferase [Alistipes timonensis].                                                                 497946627_?->497946628_?->648239499_Collar->497946630_?->497946631_?->497946634_?->497946636_?->497946642_N6-MTase*->497946644_Thymidylate_synthase-><-497946646_?<-497946647_?<-497946650_?<-522184605_?<-497946655_?||497946658_?->
      499246665    N6-MTase*->                                                                                                 N6-MTase                    -                     HD_RS00615            276  bacteria>proteobacteria>gammaproteobacteria    Haemophilus ducreyi                                         hypothetical protein [Haemophilus ducreyi].                                                                   499246647_?->499246654_?->499246655_?->499246658_?->499246659_?->499246660_?->499246661_?->499246665_N6-MTase*->499246666_?->753847986_?->753847719_?->499246670_?->499246672_?->499246673_?->499246676_?->
      511019476    <-ABC-ATPase<-?||?->?->?->?-><-N6-MTase*<-Thymidylate_synthase                                              N6-MTase                    MethyltransfD12       C801_RS14355          276  bacteria>bacteroidetes                         Bacteroides uniformis                                       hypothetical protein [Bacteroides uniformis].                                                                 <-511019470_?<-495939893_ABC-ATPase<-511019471_?||511019472_?->511019473_?->511019474_?->737478242_?-><-511019476_N6-MTase*<-511019477_Thymidylate_synthase<-511019478_?<-737478243_?<-511019480_?<-511019481_?<-511019482_?<-511019483_?
      545363364    tail_3->N6-MTase*->?->?->?->ABC-ATPase->                                                                    N6-MTase                    -                     HMPREF9065_RS02985    276  bacteria>proteobacteria>gammaproteobacteria    Aggregatibacter sp. oral taxon 458                          hypothetical protein [Aggregatibacter sp. oral taxon 458].                                                    545363330_?->545363335_?->545363340_?->545363345_?->545363346_?->696449597_?->545363354_tail_3->545363364_N6-MTase*->545363367_?->545363374_?->696449550_?->545363384_ABC-ATPase->696449553_?->696449602_?->545363399_?->
      545434898    <-N6-MTase*                                                                                                 N6-MTase                    MethyltransfD12       HMPREF9148_RS11290    276  bacteria>bacteroidetes                         Prevotella sp. F0091                                        D12 class N6 adenine-specific DNA methyltransferase [Prevotella sp. F0091].                                   <-739039278_?<-545434895_?<-545434896_?<-545434898_N6-MTase*<-545434899_?||545434900_?->545434901_?-><-545434902_?<-739039285_?<-545434904_?<-545434905_?
      595939381    Thymidylate_synthase->N6-MTase*-><-?<-Phage_tail_S                                                          N6-MTase                    -                     M118_4484             276  bacteria>bacteroidetes                         Bacteroides fragilis str. 3783N1-2                          D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides fragilis str. 3783N1-2].      595939376_?->595939377_?->595939378_?->595939379_?->595939380_Thymidylate_synthase->595939381_N6-MTase*-><-595939382_?<-595939383_Phage_tail_S<-595939384_?<-595939385_?<-595939386_?<-595939387_?<-595939388_?
      640643393    <-N6-MTase*                                                                                                 N6-MTase                    -                     JCM14966_RS06695      276  bacteria>bacteroidetes                         Prevotella oulorum                                          DNA methyltransferase [Prevotella oulorum].                                                                   <-739020595_?||496530796_?->496530795_?->640643388_?->640643389_?->640643390_?->640643391_?-><-640643393_N6-MTase*<-640643394_?<-640643395_?<-739020600_?<-640643397_?<-640643398_?<-640643399_?<-739020603_?
      647603997    N6-MTase*->Thymidylate_synthase->                                                                           N6-MTase                    -                     K334_RS0105170        276  bacteria>bacteroidetes                         Prevotella baroniae                                         DNA methyltransferase [Prevotella baroniae].                                                                  653238928_?->647604005_?->653238929_?->739008434_?->647604001_?->653238930_?->647603998_?->647603997_N6-MTase*->647603996_Thymidylate_synthase-><-647603995_?<-653238931_?<-545310047_?<-545309968_?<-653238932_?<-647603994_?
      652666097    <-N6-MTase*<-N6-MTase                                                                                       N6-MTase                    Methyltransf_26       Q321_RS0105900        276  bacteria>proteobacteria>betaproteobacteria     Conchiformibius steedae                                     hypothetical protein [Conchiformibius steedae].                                                               <-652666087_?<-652666088_?<-736299556_?<-652666092_?||652666093_?-><-736299559_?||652666095_?-><-652666097_N6-MTase*<-652666099_N6-MTase<-652666101_?<-652666102_?<-736299566_?<-736299570_?<-652666104_?||652666106_?->
      655515586    N6-MTase*->Thymidylate_synthase->                                                                           N6-MTase                    MethyltransfD12       P150_RS0104410        276  bacteria>bacteroidetes                         Prevotella sp. HUN102                                       DNA methyltransferase [Prevotella sp. HUN102].                                                                739060657_?->739060658_?->655515581_?->739060633_?->655515583_?->655515584_?->655515585_?->655515586_N6-MTase*->655515587_Thymidylate_synthase->739060659_?-><-655515588_?<-655515589_?<-655515590_?<-655515591_?<-655515592_?
      655516468    N6-MTase*->Thymidylate_synthase->                                                                           N6-MTase                    MethyltransfD12       P150_RS0109795        276  bacteria>bacteroidetes                         Prevotella sp. HUN102                                       DNA methyltransferase [Prevotella sp. HUN102].                                                                655516461_?->655516462_?->655516463_?->655516464_?->655516465_?->655516466_?->655516467_?->655516468_N6-MTase*->655516469_Thymidylate_synthase->655516470_?-><-655516471_?<-655516472_?||655516473_?->655516474_?->739060943_?->
      655516580    N6-MTase*->Thymidylate_synthase->                                                                           N6-MTase                    -                     P150_RS0110495        276  bacteria>bacteroidetes                         Prevotella sp. HUN102                                       DNA methyltransferase [Prevotella sp. HUN102].                                                                655516575_?->655516576_?->739060777_?->655516577_?->655516578_?->655516579_?->655515585_?->655516580_N6-MTase*->655516581_Thymidylate_synthase-><-739060778_?<-655516582_?||739060953_?-><-655516583_?<-655516584_?||655516585_?->
      737095939    Collar->N6-MTase*-><-?||?->?->?-><-?<-ABC-ATPase                                                            N6-MTase                    MethyltransfD12       H526_RS0116665        276  bacteria>bacteroidetes                         Aquimarina latercula                                        DNA methyltransferase [Aquimarina latercula].                                                                 737096088_?->653142542_?->737095942_?->653142541_?->653142540_?->653142539_?->737096086_Collar->737095939_N6-MTase*-><-653142537_?||653144827_?->653144828_?->737097563_?-><-653144830_?<-653144831_ABC-ATPase<-737097565_?
      739003412    N6-MTase*->Thymidylate_synthase->                                                                           N6-MTase                    MethyltransfD12       HMPREF0654_RS11780    276  bacteria>bacteroidetes                         Prevotella disiens                                          DNA methyltransferase [Prevotella disiens].                                                                   739003411_?->739003412_N6-MTase*->739003418_Thymidylate_synthase->
      739005860    N6-MTase*->Thymidylate_synthase->?->VirD4-FtsK->                                                            N6-MTase                    -                     HMPREF1651_RS08825    276  bacteria>bacteroidetes                         Prevotella bivia                                            DNA methyltransferase [Prevotella bivia].                                                                     <-739005859_?||490468429_?->739005860_N6-MTase*->739005861_Thymidylate_synthase->739005865_?->739005862_VirD4-FtsK-><-488624280_?<-695331374_?<-488624282_?<-488624283_?
      739058226    <-Thymidylate_synthase<-N6-MTase*<-?<-Phage_tail_S<-MuF                                                     N6-MTase                    MethyltransfD12       HMPREF9304_RS12585    276  bacteria>bacteroidetes                         Prevotella timonensis                                       DNA methyltransferase [Prevotella timonensis].                                                                <-739058243_?<-739058244_?<-739058222_Thymidylate_synthase<-739058226_N6-MTase*<-739058238_?<-739058230_Phage_tail_S<-739058239_MuF<-739058240_?
      763168088    N6-MTase*->Thymidylate_synthase->                                                                           N6-MTase                    MethyltransfD12       PIN17_RS06195         276  bacteria>bacteroidetes                         Prevotella intermedia                                       DNA methyltransferase [Prevotella intermedia].                                                                504522352_?->504522353_?->504522354_?->504522355_?->504522356_?->504522357_?->763168086_?->763168088_N6-MTase*->504522361_Thymidylate_synthase->504522362_?->763168089_?-><-504522364_?<-763168091_?<-490507278_?<-490471607_?
      763205581    N6-MTase*->Thymidylate_synthase->                                                                           N6-MTase                    -                     HMPREF9420_RS10145    276  bacteria>bacteroidetes                         Prevotella salivae                                          DNA methyltransferase [Prevotella salivae].                                                                   494223933_?-><-494223934_?<-763205919_?||494223936_?->763205580_?->763205920_?->494223939_?->763205581_N6-MTase*->494223941_Thymidylate_synthase->494223942_?->494223943_?->494223944_?-><-763205583_?||494223946_?->494223947_?->
      786219197    <-N6-MTase*                                                                                                 N6-MTase                    -                     BN1088_RS06390        276  bacteria>bacteroidetes                         Sphingobacterium sp. PM2-P1-29                              DNA methyltransferase [Sphingobacterium sp. PM2-P1-29].                                                       786219183_?->786219185_?->786219187_?->786219189_?->786219191_?-><-786219193_?<-786219195_?<-786219197_N6-MTase*<-786219199_?<-786219202_?<-786219204_?<-786219206_?<-786219209_?<-786219212_?<-786219214_?
      489846667    Phage_Mu_Gp45->GP46->Baseplate_J->DUF2313->Collar->DUF4376->?->N6-MTase*->                                  N6-MTase                    -                     NEIPOLOT_RS06080      275  bacteria>proteobacteria>betaproteobacteria     Neisseria polysaccharea                                     hypothetical protein [Neisseria polysaccharea].                                                               489846657_Phage_Mu_Gp45->489846658_GP46->757453500_Baseplate_J->489846661_DUF2313->489846662_Collar->489846663_DUF4376->489846665_?->489846667_N6-MTase*-><-645192739_?<-489846672_?<-645192740_?||489846675_?-><-489846680_?<-489846682_?<-489846683_?
      489871056    <-N6-MTase*<-?<-?<-?<-?<-?<-METHYLASE                                                                       N6-MTase                    -                     NELON_RS04600         275  bacteria>proteobacteria>betaproteobacteria     Neisseria elongata                                          hypothetical protein [Neisseria elongata].                                                                    489871039_?->489871042_?->750384455_?->489871047_?->489871049_?->489871051_?->489871052_?-><-489871056_N6-MTase*<-489870829_?<-489871070_?<-489871073_?<-750384457_?<-489871079_?<-489871081_METHYLASE||489871086_?->
      489918676    GP46->Baseplate_J->DUF2313->Collar->DUF4376->?->?->N6-MTase*->                                              N6-MTase                    -                     EIKCOROL_RS01150      275  bacteria>proteobacteria>betaproteobacteria     Eikenella corrodens                                         hypothetical protein [Eikenella corrodens].                                                                   489918666_GP46->489918667_Baseplate_J->489918668_DUF2313->737609230_Collar->489918671_DUF4376->489918673_?->737609113_?->489918676_N6-MTase*-><-489918679_?||737609116_?->489918682_?->489918683_?-><-489918684_?<-489918685_?<-489918688_?
      495946224    <-N6-MTase*<-?||?->DOC->                                                                                    N6-MTase                    MethyltransfD12       BSFG_RS03795          275  bacteria>bacteroidetes                         Bacteroides sp. 4_3_47FAA                                   hypothetical protein, partial [Bacteroides sp. 4_3_47FAA].                                                    <-495946217_?<-696364617_?<-495946222_?<-495946224_N6-MTase*<-495946227_?||696364624_?->495946231_DOC-><-495946232_?||696364621_?->696364623_?-><-495946235_?
      565956523    MuF->?->N6-MTase*-><-?<-Phage_tail_S                                                                        N6-MTase                    -                     HMPREF1199_RS02420    275  bacteria>bacteroidetes                         Prevotella oralis                                           hypothetical protein [Prevotella oralis].                                                                     738975553_?->565956498_?->565956503_?->565956506_?->738975744_?->738975745_MuF->565956519_?->565956523_N6-MTase*-><-565956528_?<-565956533_Phage_tail_S<-565956539_?<-565956551_?<-565956557_?<-565956560_?<-490503989_?
      633953678    N6-MTase*->                                                                                                 N6-MTase                    -                     HPS41_07110           275  bacteria>proteobacteria>gammaproteobacteria    Haemophilus parasuis ST4-1                                  hypothetical protein HPS41_07110 [Haemophilus parasuis ST4-1].                                                633953678_N6-MTase*-><-633953679_?||633953680_?->633953681_?->633953682_?->633953683_?->633953684_?->633953685_?->
      640590225    <-N6-MTase*<-Thymidylate_synthase                                                                           N6-MTase                    MethyltransfD12       JCM16497_RS23455      275  bacteria>bacteroidetes                         Bacteroides sartorii                                        DNA methyltransferase [Bacteroides sartorii].                                                                 <-640590216_?<-640590217_?||640590218_?->640590219_?->640590222_?->640590223_?->640590224_?-><-640590225_N6-MTase*<-640590226_Thymidylate_synthase<-640590227_?<-727808809_?<-640590229_?
      737511689    <-N6-MTase*<-?<-DUF4376<-?<-DUF2313<-Baseplate_J<-GP46<-Phage_Mu_Gp45                                       N6-MTase                    -                     HPS9_RS04300          275  bacteria>proteobacteria>gammaproteobacteria    Haemophilus parasuis                                        hypothetical protein [Haemophilus parasuis].                                                                  <-737511689_N6-MTase*<-737511692_?<-737511694_DUF4376<-652522515_?<-737511697_DUF2313<-737511699_Baseplate_J<-506592116_GP46<-737511701_Phage_Mu_Gp45
      750049800    <-N6-MTase*                                                                                                 N6-MTase                    -                     HMPREF0198_RS01770    275  bacteria>proteobacteria>gammaproteobacteria    Cardiobacterium hominis                                     hypothetical protein [Cardiobacterium hominis].                                                               <-490241187_?<-490241188_?<-490241189_?<-490241191_?<-750049796_?||750050068_?->490241194_?-><-750049800_N6-MTase*<-750050071_?<-490241199_?||490241201_?->750050072_?->490241206_?->490241208_?->750050074_?->
      495910797    Collar->?->Thymidylate_synthase->N6-MTase*->                                                                N6-MTase                    MethyltransfD12       BZARG_RS04055         274  bacteria>bacteroidetes                         Bizionia argentinensis                                      DNA methyltransferase [Bizionia argentinensis].                                                               495910929_?->495911099_?->749809173_?->749809174_?->749809175_Collar->495910915_?->495911144_Thymidylate_synthase->495910797_N6-MTase*->495910913_?->495911079_?->749809153_?-><-495910801_?||495911112_?-><-495910949_?<-495911118_?
      499512619    <-N6-MTase*<-?<-?<-Caudo_TAP<-Collar<-Tail_P2_I<-Baseplate_J<-GPW_gp25                                      N6-MTase                    -                     MS_RS00345            274  bacteria>proteobacteria>gammaproteobacteria    [Mannheimia] succiniciproducens                             hypothetical protein [[Mannheimia] succiniciproducens].                                                       499512611_?->753910721_?->499512613_?->499512615_?->499512616_?->499512617_?->499512618_?-><-499512619_N6-MTase*<-499512620_?<-499512621_?<-499512622_Caudo_TAP<-499512623_Collar<-753909406_Tail_P2_I<-499512625_Baseplate_J<-753909410_GPW_gp25
      548211070    <-N6-MTase*                                                                                                 N6-MTase                    SP+Methyltransf_26    BN741_01478           274  bacteria>bacteroidetes                         Prevotella stercorea CAG:629                                d12 class N6 adenine-specific DNA methyltransferase family protein [Prevotella stercorea CAG:629].            <-548211068_?<-548211069_?<-548211070_N6-MTase*<-548211071_?<-548211072_?<-548211073_?<-548211074_?<-548211075_?<-548211076_?<-548211077_?
      739635808    N6-MTase*->?->?->?->?->Phage_capsid->?->Portal->                                                            N6-MTase                    Methyltransf_26       SALWKB29_RS09160      274  bacteria>proteobacteria>betaproteobacteria     Snodgrassella alvi                                          hypothetical protein [Snodgrassella alvi].                                                                    739635869_?->739635798_?->739635799_?->739635802_?->739635804_?->739635805_?->739635807_?->739635808_N6-MTase*->739635812_?->739635815_?->739635818_?->739635876_?->739635879_Phage_capsid->739635821_?->739635826_Portal->
      740746518    <-N6-MTase*<-?<-?<-?<-?<-?<-Collar                                                                          N6-MTase                    MethyltransfD12       BN863_RS14255         274  bacteria>bacteroidetes                         Formosa agariphila                                          DNA methyltransferase [Formosa agariphila].                                                                   740746510_?->740746512_?->740748407_?-><-740746513_?||740746514_?-><-740746516_?<-740746517_?<-740746518_N6-MTase*<-740746519_?<-740746521_?<-740746522_?<-740746523_?<-740746524_?<-740746525_Collar<-740746527_?
      488758369    Collar->?->?->?->?->?->N6-MTase*->N6-MTase->                                                                N6-MTase                    -                     CAPSP0001_RS11720     273  bacteria>bacteroidetes                         Capnocytophaga sputigena                                    D12 class N6 adenine-specific DNA methyltransferase family protein [Capnocytophaga sputigena].                488758360_?->488758329_Collar->488758368_?->488766210_?->488758379_?->488758366_?->488758350_?->488758369_N6-MTase*->488758377_N6-MTase->
      739535589    N6-MTase*->?->HNH->Terminase_SS->Terminase_LS->                                                             N6-MTase                    -                     SASC598J21_RS08380    273  bacteria>proteobacteria>betaproteobacteria     Snodgrassella alvi                                          hypothetical protein [Snodgrassella alvi].                                                                    739535578_?->739535581_?->739535583_?->739535586_?->739535442_?->739535446_?->739535448_?->739535589_N6-MTase*->739535450_?->739535453_HNH->739535456_Terminase_SS->739535458_Terminase_LS->739535461_?->739535592_?->739535595_?->
      763356669    Collar->?->?->?->?->N6-MTase+Phage-tailfib*->                                                               N6-MTase+Phage-tailfib      SP                    JCM21142_RS20860      273  bacteria>bacteroidetes                         Saccharicrinis fermentans                                   hypothetical protein, partial [Saccharicrinis fermentans].                                                    763356667_?->653273643_Collar->653273642_?->653273641_?->653273640_?->653273639_?->763356669_N6-MTase+Phage-tailfib*->
      488180979    Baseplate_J->Baseplate_J->DUF2313->?->?->?->?->N6-MTase*->                                                  N6-MTase                    -                     NM70021_RS109520      272  bacteria>proteobacteria>betaproteobacteria     Neisseria meningitidis                                      D12 class N6 adenine-specific DNA methyltransferase family protein [Neisseria meningitidis].                  488141546_Baseplate_J->488141547_Baseplate_J->728043483_DUF2313->488166025_?->488166024_?->488141550_?->488141551_?->488180979_N6-MTase*-><-488149547_?||488149549_?-><-488149551_?
      497282263    N6-MTase*->                                                                                                 N6-MTase                    -                     C506_RS0110745        272  bacteria>bacteroidetes                         Alistipes                                                   MULTISPECIES: DNA methyltransferase [Alistipes].                                                              517526431_?->703290626_?->517526433_?->648626765_?->497282137_?->517526435_?->497282133_?->497282263_N6-MTase*-><-517526436_?||517526437_?->497282296_?->497282210_?->517526438_?->517526439_?->517526440_?->
      523673311    ABC-ATPase-><-?<-?<-?||?->?-><-N6-MTase*<-DCM                                                               N6-MTase                    -                     AJF4211_000170        270  bacteria>proteobacteria>gammaproteobacteria    Avibacterium paragallinarum JF4211                          Putative uncharacterized protein [Avibacterium paragallinarum JF4211].                                        <-523673304_?||523673305_ABC-ATPase-><-523673306_?<-523673307_?<-523673308_?||523673309_?->523673310_?-><-523673311_N6-MTase*<-523673312_DCM<-523673313_?||523673314_?-><-523673315_?<-523673316_?<-523673317_?<-523673318_?
      523674289    <-N6-MTase*<-DCM<-?<-?<-DUF4376<-Collar                                                                     N6-MTase                    -                     AJF4211_000450        270  bacteria>proteobacteria>gammaproteobacteria    Avibacterium paragallinarum JF4211                          Putative uncharacterized protein [Avibacterium paragallinarum JF4211].                                        <-523674282_?<-523674283_?<-523674284_?||523674285_?->523674286_?->523674287_?->523674288_?-><-523674289_N6-MTase*<-523674290_DCM<-523674291_?<-523674292_?<-523674293_DUF4376<-523674294_Collar
      # 2;                                                                                                                                                                                                                                         
      490401642    <-N6-MTase*                                                                                                 N6-MTase                    -                     CUP_RS09075           283  bacteria>proteobacteria>epsilonproteobacteria  Campylobacter upsaliensis                                   hypothetical protein [Campylobacter upsaliensis].                                                             <-490401632_?||490401633_?->748624973_?->490401636_?->490401637_?->490401638_?-><-748624639_?<-490401642_N6-MTase*<-748624972_?
      736539564    <-N6-MTase*<-DCM                                                                                            N6-MTase                    SP                    LS72_RS05710          279  bacteria>proteobacteria>epsilonproteobacteria  Helicobacter apodemus                                       hypothetical protein [Helicobacter apodemus].                                                                 <-736539555_?<-736539557_?<-736539558_?<-736539569_?<-736539559_?<-736539561_?<-736539563_?<-736539564_N6-MTase*<-736539566_DCM<-736539568_?
      
      
      Back to Contents

    • General notes and phyletic distribution of the eukaryotic AlkB families

      General notes:

      Phyletic distribution of various families AlkBH1: eukaryota,choanoflagellida, cnidaria,euglenozoa, fungi,haptophyceae heterolobosea,metazoa nucleariidae_and_fonticula,rhizaria, stramenopiles,viridiplantae AlkBH4: alveolates,choanoflagellida,kinetoplastida, metazoa, chlorophytes, some ciliates AlkBH6: eukaryota,amoebozoa apusozoa,kinetoplastida, fungi,ichthyosporea, metazoa,rhodophyta, stramenopiles,viridiplantae AlkBH7: eukaryota,choanoflagellida, kinetoplastida,fungi, haptophyceae, metazoa nucleariidae_and_fonticula,stramenopiles AlkBH8: eukaryota,alveolata apusozoa,euglenozoa fungi,haptophyceae heterolobosea,ichthyosporea metazoa,stramenopiles viridiplantae AlkBH2: eukaryota,choanoflagellida metazoa, rhizaria AlkBH3: haptophyceae,metazoa stramenopiles,viridiplantae AlkBH5: alveolata,amoebozoa cnidaria,ichthyosporea metazoa,viridiplantae FTO: eukaryota,apusozoa, ichthyosporea,metazoa rhizaria,stramenopiles viridiplantae Minor clades - AT4G02485-like found in Capsaspora, basal fungi and plants. -THAPSDRAFT_42543-like group with RNA modifying members in stramenopiles and haptophytes - SAD fused AlkB in fungi (sporadic in Asco, basals, Basidio, Chytrids) From the tree and phyletic patterns, we note that AlkB was transferred to the eukaryotes from a prokaryotic source after the splitting of the basal lineages in the common ancestor of the kinetoplastids and other eukaryotes. This further split to give the 5 basal clades include AlkBH1, H4,H6,H7 and H8. Later derivations that are widespread include six clades AlkBH2 (Animals, fungi, stramenopiles, rhizarians), AlkBH3 (animals, fungi, plants, Rhizarians, alveolates, stramenopiles), AlkBH5 (Metazoa, plants, ciliates) and FTO (Crown, stramenopiles, Rhizarians) AT4G02485-like and THAPSDRAFT_42543-like. The architectures reveal the general context of action. Major architectures include. --fusion to RRM and methylase in AlkBH8 -- fusion to a SAD and zinc ribbon in fungi -- Fusion to PHD in some chlorophytes and an AThook in Oxytricha in AlkBH5 -- Fusion to amidohydrolase and GST in a fungal version related to ALKBH2 -- Independent fusion to SAD in two stramenopiles, a PHD in one of them that has SAD, and an AThook in perkinsus of a group related to AlkBH3. -- Fusion to the CUEand TopC in a variety of fungi with occasional instances of fusions to JAB and UBA in a group related to AlkBH3, suggesting a role in combination with Ubiquitination. -- Fusion to HECT and little finger in P. sojae -- Fusion to ASCH in some stramenopiles of the FTO clade --Fusion to KH in Ttra1000007412 -- Fusion to Thump and Methylase in Fragilariopsis proteins of hte THAPSDRAFT_42543-like clade -- Fusion to ZNF+RRM+TET-JBP+AlkB-2OGFEDO in Aureococcus anophagefferens (323449197) of AlkBH1
      GI               Archs                                Archs                                            Pfam archs                                                                              Gene name               Len   Taxonomy                                              Species                                        Genbank
      # AlkBH1: In kinetoplastids, note fusion to ZNF+RRM+TET-JBP in Aureococcus
      86562425          -                                   -                                                2OG-FeII_Oxy_2      Note fragmented                                                     Y51H7C.5                248   eukaryota>metazoa>nematoda                            Caenorhabditis elegans                         hypothetical protein Y51H7C.5 [Caenorhabditis elegans].
      326429695         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PTSG_06918              427   eukaryota>choanoflagellida                            Salpingoeca rosetta                            hypothetical protein PTSG_06918 [Salpingoeca rosetta].
      Crev1000000444    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Crev1000000444          369   eukaryota>fungi>kickxellomycotina                     Coemansia reversa                              estExt_Genemark1.C_20086
      Mcir1000011152    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Mcir1000011152          420   eukaryota>fungi>basal                                 Mucor circinelloides                           Genemark1.11549_g
      Bcir1000005678    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Bcir1000005678          401   eukaryota>fungi>mucoromycotina                        Backusella circina                             e_gw1.67.64.1
      Pbla1000008754    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Pbla1000008754          395   eukaryota>fungi>basal                                 Phycomyces blakesleeanus                       fgeneshPB_pg.23__178
      Uram1000002459    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Uram1000002459          430   eukaryota>fungi>mucoromycotina                        Umbelopsis ramanniana                          fgenesh1_kg.10_#_156_#_combest_scaffold_10_5503
      Lhya1000001164    TM+TM+TM+TM                         -                                                TM+TM+TM+TM+2OG-FeII_Oxy_2                                                              Lhya1000001164          647   eukaryota>fungi>mucoromycotina                        Lichtheimia hyalospora                         gm1.1132_g
      Bcir1000009260    SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Bcir1000009260          427   eukaryota>fungi>mucoromycotina                        Backusella circina                             estExt_Genewise1Plus.C_2670019
      384493338         TM+TM                               -                                                TM+TM+2OG-FeII_Oxy_2                                                                    RO3G_08534              557   eukaryota>fungi                                       Rhizopus delemar RA 99-880                     hypothetical protein RO3G_08534 [Rhizopus delemar RA 99-880].
      Mcir1000002688    -                                   -                                                2OG-FeII_Oxy_2                                                                          Mcir1000002688          428   eukaryota>fungi>basal                                 Mucor circinelloides                           Genemark1.2761_g
      Pbla1000012921    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Pbla1000012921          451   eukaryota>fungi>basal                                 Phycomyces blakesleeanus                       estExt_fgeneshPB_pg.C_50358
      Bcir1000010874    SP+TM+TM+TM+TM+TM                   -                                                SP+TM+TM+TM+TM+TM+2OG-FeII_Oxy_2                                                        Bcir1000010874          644   eukaryota>fungi>mucoromycotina                        Backusella circina                             estExt_Genewise1Plus.C_890072
      Lhya1000002295    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Lhya1000002295          358   eukaryota>fungi>mucoromycotina                        Lichtheimia hyalospora                         estExt_Genemark1.C_180068
      Mver1000004016    -                                   -                                                2OG-FeII_Oxy_2                                                                          Mver1000004016          426   eukaryota>fungi>zygomycete                            Mortierella verticillata                        Mortierella verticillata NRRL 6337 hypothetical protein (426 aa)
      Spun1000007058    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Spun1000007058          379   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 alkylated DNA repair protein AlkB (379 aa)
      58262090          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          CNM01450                425   eukaryota>fungi>basidiomycota                         Cryptococcus neoformans var. neoformans JEC21  hypothetical protein CNM01450 [Cryptococcus neoformans var. neoformans JEC21].
      Wseb1000002876    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Wseb1000002876          383   eukaryota>fungi>basidiomycota                         Wallemia sebi                                  estExt_fgenesh1_kg.C_100073
      164650570         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LACBIDRAFT_305933       428   eukaryota>fungi>basidiomycota                         Laccaria bicolor S238N-H82                     predicted protein [Laccaria bicolor S238N-H82].
      169853276         -                                   AlkB-2OGFEDO+TBC                                 2OG-FeII_Oxy_2+Ribosomal_S21e+DUF3548+RabGAP-TBC                                        CC1G_04298              1328  eukaryota>fungi>basidiomycota                         Coprinopsis cinerea okayama7#130               hypothetical protein CC1G_04298 [Coprinopsis cinerea okayama7#130].
      527305476         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          FOMPIDRAFT_1027065      446   eukaryota>fungi>basidiomycota                         Fomitopsis pinicola FP-58527 SS1               hypothetical protein FOMPIDRAFT_1027065 [Fomitopsis pinicola FP-58527 SS1].
      Abis1000001743    -                                   AlkB-2OGFEDO+TBC                                 2OG-FeII_Oxy_2+Ribosomal_S21e+DUF3548+RabGAP-TBC                                        Abis1000001743          1245  eukaryota>fungi>basidiomycota                         Agaricus bisporus                              estExt_fgenesh2_pm.C_20405
      Rall1000005813    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Rall1000005813          365   eukaryota>fungi>cryptomycota                          Rozella allomycis                              O9G_000964m.01
      Ccor1000005557    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Ccor1000005557          411   eukaryota>fungi>entomophthoromycota                   Conidiobolus coronatus                         gm1.6199_g
      Amac1000008309    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Amac1000008309          378   eukaryota>fungi>blastocladiomycota                    Allomyces macrogynus                            Allomyces macrogynus ATCC 38327 alkylated DNA repair protein AlkB (378 aa)
      Amac1000014273    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Amac1000014273          378   eukaryota>fungi>blastocladiomycota                    Allomyces macrogynus                            Allomyces macrogynus ATCC 38327 alkylated DNA repair protein AlkB (378 aa)
      Bden1000008328    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Bden1000008328          371   eukaryota>fungi>chytridiomycota                       Batrachochytrium dendrobatidis                  Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (371 aa)
      Falb1000005587    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Falb1000005587          452   eukaryota>nucleariidae_and_fonticula                  Fonticula alba                                  Fonticula alba ATCC 38817 (V2) hypothetical protein (452 aa)
      Falb1000005586    SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Falb1000005586          411   eukaryota>nucleariidae_and_fonticula                  Fonticula alba                                  Fonticula alba ATCC 38817 (V2) hypothetical protein (411 aa)
      116202811         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          CHGG_09290              296   eukaryota>fungi>ascomycota                            Chaetomium globosum CBS 148.51                 hypothetical protein CHGG_09290 [Chaetomium globosum CBS 148.51].
      85095632          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NCU09779.1              366   eukaryota>fungi>ascomycota                            Neurospora crassa OR74A                        hypothetical protein [Neurospora crassa OR74A].
      Chet1000008148    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Chet1000008148          386   eukaryota>fungi>ascomycota                            Cochliobolus heterostrophus                    estExt_fgenesh1_pg.C_180212
      111061162         -                                   -                                                2OG-FeII_Oxy_2                                                                          SNOG_09947              366   eukaryota>fungi>ascomycota                            Phaeosphaeria nodorum SN15                     hypothetical protein SNOG_09947 [Phaeosphaeria nodorum SN15].
      67527001          -                                   -                                                2OG-FeII_Oxy_2                                                                          AN3958.2                361   eukaryota>fungi>ascomycota                            Aspergillus nidulans FGSC A4                   hypothetical protein AN3958.2 [Aspergillus nidulans FGSC A4].
      70991683          -                                   -                                                2OG-FeII_Oxy_2                                                                          AFUA_6G07990            359   eukaryota>fungi>ascomycota                            Aspergillus fumigatus Af293                    oxidoreductase, 2OG-Fe(II) oxygenase family family [Aspergillus fumigatus Af293].
      160703884         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SNOG_13051              298   eukaryota>fungi>ascomycota                            Phaeosphaeria nodorum SN15                     hypothetical protein SNOG_13051 [Phaeosphaeria nodorum SN15].
      50555041          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          YALI0F03003g            330   eukaryota>fungi>ascomycota                            Yarrowia lipolytica CLIB122                    YALI0F03003p [Yarrowia lipolytica].
      19113345          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SPBC13G1.04c            273   eukaryota>fungi>ascomycota                            Schizosaccharomyces pombe                      alkB homolog [Schizosaccharomyces pombe].
      569432089         -                                   -                                                2OG-FeII_Oxy_2                                                                          RFI_03532               449   eukaryota                                             Reticulomyxa filosa                            hypothetical protein RFI_03532 [Reticulomyxa filosa].
      528235142         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AGDE_10531              368   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                               alkylated DNA repair protein alkB like protein 1 [Angomonas deanei].
      528232486         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AGDE_11003              368   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                               alkylated DNA repair protein alkB like protein 1 [Angomonas deanei].
      528273468         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AGDE_01394              167   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                               hypothetical protein AGDE_01394 [Angomonas deanei].
      528225497         -                                   -                                                2OG-FeII_Oxy_2                                                                          STCU_07353              386   eukaryota>euglenozoa>kinetoplastida                   Strigomonas culicis                            alkylated DNA repair protein alkB like protein 1 [Strigomonas culicis].
      594144045         -                                   -                                                2OG-FeII_Oxy_2                                                                          GSHART1_T00002327001    120   eukaryota>euglenozoa>kinetoplastida                   Phytomonas sp. isolate Hart1                   unnamed protein product [Phytomonas sp. isolate Hart1].
      146082486         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LINJ_16_0360            368   eukaryota>euglenozoa>kinetoplastida                   Leishmania infantum JPCM5                      conserved hypothetical protein [Leishmania infantum JPCM5].
      157867117         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LMJF_16_0350            368   eukaryota>euglenozoa>kinetoplastida                   Leishmania major strain Friedlin               conserved hypothetical protein [Leishmania major strain Friedlin].
      72387544          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Tb927.4.460             323   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma brucei brucei TREU927              Alkylated DNA repair protein (alkB homolog) [Trypanosoma brucei TREU927].
      71419500          -                                   -                                                2OG-FeII_Oxy_2                                                                          Tc00.1047053510687.140  323   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener             alkylated DNA repair protein [Trypanosoma cruzi strain CL Brener].
      238660742         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Smp_041490              334   eukaryota>metazoa                                     Schistosoma mansoni                            expressed protein [Schistosoma mansoni].
      674588814         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2+Transposase_mut                                                          HmN_000414100           344   eukaryota>metazoa                                     Hymenolepis microstoma                         alkylated DNA repair protein alkB [Hymenolepis microstoma].
      674564509         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EgrG_000517300          366   eukaryota>metazoa                                     Echinococcus granulosus                        alkylated DNA repair protein alkB [Echinococcus granulosus].
      576694219         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EGR_07282               347   eukaryota>metazoa                                     Echinococcus granulosus                        Alkylated DNA repair protein AlkB [Echinococcus granulosus].
      674576182         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EmuJ_000517300          366   eukaryota>metazoa                                     Echinococcus multilocularis                    conserved hypothetical protein [Echinococcus multilocularis].
      Hrob1000010247    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Hrob1000010247          263   eukaryota>metazoa>annelida                            Helobdella robusta                             123161
      158593980         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Bm1_35285               339   eukaryota>metazoa>nematoda                            Brugia malayi                                  ALKBH protein, putative [Brugia malayi].
      17537375          -                                   -                                                2OG-FeII_Oxy_2                                                                          Y51H7C.4                169   eukaryota>metazoa>nematoda                            Caenorhabditis elegans                         hypothetical protein Y51H7C.4 [Caenorhabditis elegans].
      Adig1000007449    TM+TM+TM+TM+TM                      AlkB-2OGFEDO                                     2OG-FeII_Oxy_2+TM+TM+TM+TM+TM                                                           Adig1000007449          653   eukaryota>cnidaria                                    Acropora digitifera                            adi_v1.04821
      221130857         -                                   -                                                2OG-FeII_Oxy_2                                                                          LOC100210027            350   eukaryota>metazoa>cnidaria                            Hydra magnipapillata                           PREDICTED: similar to predicted protein [Hydra magnipapillata].
      156222983         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       NEMVEDRAFT_v1g96737     323   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein, partial [Nematostella vectensis].
      Bnat1000003625    -                                   -                                                2OG-FeII_Oxy_2                                                                          Bnat1000003625          292   eukaryota>rhizaria>cercozoa                           Bigelowiella natans                            fgenesh1_pg.23_#_199
      Aque1000008145    -                                   BetaPropeller                                    2OG-FeII_Oxy_2+WD40                                                                     Aque1000008145          303   eukaryota>metazoa>porifera                            Amphimedon queenslandica                       Aqu1.208392
      115681547         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC579973               488   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: alkylated DNA repair protein alkB homolog 1 [Strongylocentrotus purpuratus].
      Aque1000020897    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Aque1000020897          420   eukaryota>metazoa>porifera                            Amphimedon queenslandica                       Aqu1.221165
      198417894         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC100186336            312   eukaryota>metazoa>chordata                            Ciona intestinalis                             PREDICTED: alkylated DNA repair protein alkB homolog 1 [Ciona intestinalis].
      119115270         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AgaP_AGAP000155         293   eukaryota>metazoa>hexapoda                            Anopheles gambiae str. PEST                    AGAP000155-PA [Anopheles gambiae str. PEST].
      Lgig1000020166    -                                   -                                                2OG-FeII_Oxy_2                                                                          Lgig1000020166          349   eukaryota>metazoa>mollusca                            Lottia gigantea                                estExt_fgenesh2_pg.C_sca_10395
      Caps1000009657    -                                   -                                                2OG-FeII_Oxy_2                                                                          Caps1000009657          317   eukaryota>metazoa>annelida                            Capitella spI                                  fgenesh1_pg.C_scaffold_335000028
      190584785         -                                   SbcC+AlkB-2OGFEDO                                2OG-FeII_Oxy_2                                                                          TRIADDRAFT_24904        271   eukaryota>metazoa>placozoa                            Trichoplax adhaerens                           hypothetical protein TRIADDRAFT_24904, partial [Trichoplax adhaerens].
      91080539          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC661543               297   eukaryota>metazoa>hexapoda                            Tribolium castaneum                            PREDICTED: similar to AlkB CG33250-PA [Tribolium castaneum].
      45555401          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Dmel_CG33250            332   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                        AlkB [Drosophila melanogaster].
      193641120         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC100160270            300   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                            PREDICTED: similar to AlkB CG33250-PA [Acyrthosiphon pisum].
      156542602         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC100120538            252   eukaryota>metazoa>hexapoda                            Nasonia vitripennis                            PREDICTED: similar to HDC19127 [Nasonia vitripennis].
      66515297          -                                   -                                                2OG-FeII_Oxy_2                                                                          AlkB                    310   eukaryota>metazoa>hexapoda                            Apis mellifera                                 PREDICTED: alkylated DNA repair protein alkB homolog 1 [Apis mellifera].
      307171459         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EAG_03150               306   eukaryota>metazoa>hexapoda                            Camponotus floridanus                          Alkylated DNA repair protein alkB-like protein 1 [Camponotus floridanus].
      307202053         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EAI_02538               308   eukaryota>metazoa>hexapoda                            Harpegnathos saltator                          Alkylated DNA repair protein alkB-like protein 1 [Harpegnathos saltator].
      Smar1000008099    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Smar1000008099          279   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima                             SMAR006255-PA pep:novel scaffold:Smar1:JH431682:46249:48654:-1 gene:SMAR006255 transcript:SMAR006255-RA
      321456927         -                                   -                                                2OG-FeII_Oxy_2                                                                          DAPPUDRAFT_218420       288   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_218420 [Daphnia pulex].
      210123411         -                                   HNH-HHH+AlkB-2OGFEDO                             2OG-FeII_Oxy_2                                                                          BRAFLDRAFT_119815       365   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_119815 [Branchiostoma floridae].
      210123430         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          BRAFLDRAFT_262043       343   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_262043, partial [Branchiostoma floridae].
      291242901         -                                   -                                                2OG-FeII_Oxy_2                                                                          LOC100369152            416   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii                       PREDICTED: alkylated DNA repair protein alkB homolog 1-like [Saccoglossus kowalevskii].
      66472360          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          alkbh1                  363   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    alkylated DNA repair protein alkB homolog 1 [Danio rerio].
      47206791          SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       GSTEN:00005439:G:001    351   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                         unnamed protein product, partial [Tetraodon nigroviridis].
      326920865         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC100546801            365   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                            PREDICTED: alkylated DNA repair protein alkB homolog 1-like [Meleagris gallopavo].
      224051564         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH1                  367   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                            PREDICTED: alkylated DNA repair protein alkB homolog 1 [Taeniopygia guttata].
      71895969          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH1                  371   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                                  alkylated DNA repair protein alkB homolog 1 [Gallus gallus].
      327259296         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       alkbh1                  370   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                            PREDICTED: alkylated DNA repair protein alkB homolog 1 isoform X1 [Anolis carolinensis].
      87298840          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH1                  389   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   alkylated DNA repair protein alkB homolog 1 [Homo sapiens].
      114654177         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH1                  389   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: alkylated DNA repair protein alkB homolog 1 [Pan troglodytes].
      109478500         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Alkbh_predicted         389   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              PREDICTED: similar to Alkylated DNA repair protein alkB homolog [Rattus norvegicus].
      Fcyl1000037010    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Fcyl1000037010          279   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.26.280.1
      Fcyl1000031479    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Fcyl1000031479          279   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.26_#_236
      226523214         METHYLASE                           AlkB-2OGFEDO+SAM-methylase                       2OG-FeII_Oxy_2+Methyltransf_11                                                          MICPUN_60739            684   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      485641088         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_227562       256   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_227562 [Emiliania huxleyi CCMP1516].
      303279975         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          MICPUCDRAFT_69712       167   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      226518964         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          MICPUN_108675           334   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      Pram1000014720    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Pram1000014720          217   eukaryota>stramenopiles                               Phytophthora ramorum                           50629
      Psoj1000010940    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Psoj1000010940          292   eukaryota>stramenopiles                               Phytophthora sojae                             138234
      301110256         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PITG_08013              292   eukaryota>stramenopiles                               Phytophthora infestans T30-4                   conserved hypothetical protein [Phytophthora infestans T30-4].
      302838827         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       VOLCADRAFT_101962       250   eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis                  hypothetical protein VOLCADRAFT_101962 [Volvox carteri f. nagariensis].
      162676748         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PHYPADRAFT_16622        253   eukaryota>viridiplantae                               Physcomitrella patens                          predicted protein, partial [Physcomitrella patens].
      186519239         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       AT5G01780               387   eukaryota>viridiplantae                               Arabidopsis thaliana                           oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana].
      30683015          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AT3G14160               313   eukaryota>viridiplantae                               Arabidopsis thaliana                           oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana].
      15231791          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AT3G14140               452   eukaryota>viridiplantae                               Arabidopsis thaliana                           oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana].
      284096896         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NAEGRDRAFT_45239        294   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi].
      323449197         SP+TET-JBP                          ZNF+TRUA+TET-JBP+AlkB-2OGFEDO                    SP+2OG-FeII_Oxy_2                                                                       AURANDRAFT_66742        1793  eukaryota>stramenopiles                               Aureococcus anophagefferens                    hypothetical protein AURANDRAFT_66742 [Aureococcus anophagefferens].
      
      # SAD fused group (Note ZnR after SAD reported in original paper)
      Spun1000007354    -                                   SAD+AlkB-2OGFEDO                                 2OG-FeII_Oxy_2                                                                          Spun1000007354          742   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (742 aa)
      Spun1000003257    -                                   SWC3+AlkB-2OGFEDO                                2OG-FeII_Oxy_2                                                                          Spun1000003257          341   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (341 aa)
      Ccor1000006818    -                                   SAD+AlkB-2OGFEDO                                 2OG-FeII_Oxy_2                                                                          Ccor1000006818          546   eukaryota>fungi>entomophthoromycota                   Conidiobolus coronatus                         fgenesh1_pg.165_#_2
      88182299          -                                   CDC27+SAD+AlkB-2OGFEDO                           2OG-FeII_Oxy_2                                                                          CHGG_06386              724   eukaryota>fungi>ascomycota                            Chaetomium globosum CBS 148.51                 hypothetical protein CHGG_06386 [Chaetomium globosum CBS 148.51].
      169600785         -                                   SAD+AlkB-2OGFEDO                                 2OG-FeII_Oxy_2                                                                          SNOG_03244              978   eukaryota>fungi>ascomycota                            Phaeosphaeria nodorum SN15                     hypothetical protein SNOG_03244 [Phaeosphaeria nodorum SN15].
      67526987          SP                                  SAD+AlkB-2OGFEDO                                 SP+2OG-FeII_Oxy_2+2OG-FeII_Oxy_2                                                        AN3951.2                686   eukaryota>fungi>ascomycota                            Aspergillus nidulans FGSC A4                   hypothetical protein AN3951.2 [Aspergillus nidulans FGSC A4].
      70998340          -                                   SAD+DDRP-ZNR+AlkB-2OGFEDO                        2OG-FeII_Oxy_2                                                                          AFUA_5G07420            465   eukaryota>fungi>ascomycota                            Aspergillus fumigatus Af293                    hypothetical protein AFUA_5G07420 [Aspergillus fumigatus Af293].
      527303756         -                                   SAD+AlkB-2OGFEDO                                 2OG-FeII_Oxy_2                                                                          FOMPIDRAFT_1156757      567   eukaryota>fungi>basidiomycota                         Fomitopsis pinicola FP-58527 SS1               hypothetical protein FOMPIDRAFT_1156757 [Fomitopsis pinicola FP-58527 SS1].
      164647611         SP                                  SAD+AlkB-2OGFEDO                                 SP+2OG-FeII_Oxy_2                                                                       LACBIDRAFT_313729       1047  eukaryota>fungi>basidiomycota                         Laccaria bicolor S238N-H82                     predicted protein [Laccaria bicolor S238N-H82].
      Abis1000004299    -                                   PAF1+SAD+AlkB-2OGFEDO                            2OG-FeII_Oxy_2                                                                          Abis1000004299          1014  eukaryota>fungi>basidiomycota                         Agaricus bisporus                              Genemark.4150_g
      116508555         -                                   SAD+AlkB-2OGFEDO+AlkB-2OGFEDO                    -                                                                                       CC1G_01939              500   eukaryota>fungi>basidiomycota                         Coprinopsis cinerea okayama7#130               predicted protein [Coprinopsis cinerea okayama7#130].
      # Although grouped with above searches put them closer to AlkBH3
      485611490         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_57409        94    eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_57409, partial [Emiliania huxleyi CCMP1516].
      551605040         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_58725        85    eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_58725, partial [Emiliania huxleyi CCMP1516].
      Fcyl1000122023    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Fcyl1000122023          121   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.28.330.1
      Fcyl1000032066    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Fcyl1000032066          429   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.28_#_266
      Fcyl1000030685    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Fcyl1000030685          495   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.24_#_31
      Fcyl1000120875    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Fcyl1000120875          150   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.24.318.1
      
      #; AlkBH2
      569436514         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          RFI_01908               150   eukaryota                                             Reticulomyxa filosa                            hypothetical protein RFI_01908 [Reticulomyxa filosa].
      156221236         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       NEMVEDRAFT_v1g101914    212   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein [Nematostella vectensis].
      Lgig1000002422    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Lgig1000002422          242   eukaryota>metazoa>mollusca                            Lottia gigantea                                e_gw1.2.1002.1
      210111343         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       BRAFLDRAFT_83601        267   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_83601 [Branchiostoma floridae].
      219447363         -                                   MYB+MYB+POTRA+POTRA+AlkB-2OGFEDO                 Myb_DNA-bind_6+Myb_DNA-binding+Myb_DNA-binding+YppG+Cmyb_C+2OG-FeII_Oxy_2               BRAFLDRAFT_123541       844   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_123541 [Branchiostoma floridae].
      115953792         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC593121               265   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: similar to AlkB, alkylation repair homolog 2 (E. coli) [Strongylocentrotus purpuratus].
      Adig1000021749    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Adig1000021749          408   eukaryota>cnidaria                                    Acropora digitifera                            adi_v1.02480
      Aque1000029667    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Aque1000029667          264   eukaryota>metazoa>porifera                            Amphimedon queenslandica                       Aqu1.229935
      114646812         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH2                  261   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 isoform 1 [Pan troglodytes].
      48717226          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH2                  261   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 isoform 1 [Homo sapiens].
      61098162          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Alkbh2                  239   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 [Mus musculus].
      109497502         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Alkbh2_predicted        239   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              PREDICTED: similar to alkB, alkylation repair homolog 2 [Rattus norvegicus].
      118098574         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH2                  241   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                                  PREDICTED: hypothetical protein [Gallus gallus].
      326929758         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH2                  243   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 isoform X1 [Meleagris gallopavo].
      224071680         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH2                  256   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 [Taeniopygia guttata].
      68383159          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          si:dkey-65b12.2         258   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 [Danio rerio].
      47226495          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          GSTEN:00029512:G:001    257   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                         unnamed protein product, partial [Tetraodon nigroviridis].
      193606231         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC100162992            218   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                            PREDICTED: similar to alkB, alkylation repair homolog 2 [Acyrthosiphon pisum].
      189241463         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       LOC662784               197   eukaryota>metazoa>hexapoda                            Tribolium castaneum                            PREDICTED: similar to alkB, alkylation repair homolog 2 [Tribolium castaneum].
      163776713         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       MONBRDRAFT_16363        180   eukaryota>choanoflagellida                            Monosiga brevicollis MX1                       predicted protein, partial [Monosiga brevicollis MX1].
      514688869         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PTSG_06623              325   eukaryota>choanoflagellida                            Salpingoeca rosetta                            Alkbh2 protein [Salpingoeca rosetta].
      221132913         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       LOC100215008            235   eukaryota>metazoa>cnidaria                            Hydra vulgaris                                 PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 2-like [Hydra vulgaris].
      Bnat1000020289    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Bnat1000020289          95    eukaryota>rhizaria>cercozoa                           Bigelowiella natans                            gw1.85.31.1
      
      
      #; Note group shows fusions to amidohydrolase and GST,  new removal pathway? ?, possible related to AlkBH2
      Bnat1000011351    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Bnat1000011351          425   eukaryota>rhizaria>cercozoa                           Bigelowiella natans                            estExt_fgenesh1_pg.C_180193
      220976009         TM                                  AlkB-2OGFEDO                                     UPF0029+2OG-FeII_Oxy_2+TM                                                               THAPSDRAFT_21832        568   eukaryota>stramenopiles                               Thalassiosira pseudonana CCMP1335              predicted protein [Thalassiosira pseudonana CCMP1335].
      Fcyl1000027737    -                                   AlkB-2OGFEDO                                     UPF0029+2OG-FeII_Oxy_2                                                                  Fcyl1000027737          482   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       e_gw1.15.516.1
      46136713          -                                   AlkB-2OGFEDO                                     Isochorismatase+2OG-FeII_Oxy_2+GST_C_2                                                  FG09872.1               927   eukaryota>fungi>ascomycota                            Fusarium graminearum PH-1                      hypothetical protein FG09872.1 [Fusarium graminearum PH-1].
      116201083         -                                   MIP-T3+AlkB-2OGFEDO                              Isochorismatase+Herpes_BLLF1+2OG-FeII_Oxy_2                                             CHGG_08426              990   eukaryota>fungi>ascomycota                            Chaetomium globosum CBS 148.51                 hypothetical protein CHGG_08426 [Chaetomium globosum CBS 148.51].
      85095350          SP                                  AlkB-2OGFEDO                                     SP+Isochorismatase+2OG-FeII_Oxy_2+DUF4358                                               NCU05807                1166  eukaryota>fungi>ascomycota                            Neurospora crassa OR74A                        isochorismatase family protein [Neurospora crassa OR74A].
      67524139          -                                   SWC3+AlkB-2OGFEDO                                Isochorismatase+2OG-FeII_Oxy_2+GST_C_2                                                  AN2527.2                817   eukaryota>fungi>ascomycota                            Aspergillus nidulans FGSC A4                   hypothetical protein AN2527.2 [Aspergillus nidulans FGSC A4].
      70998909          SP                                  AlkB-2OGFEDO                                     SP+Isochorismatase+2OG-FeII_Oxy_2+GST_C_2                                               AFUA_3G14500            828   eukaryota>fungi>ascomycota                            Aspergillus fumigatus Af293                    isochorismatase family protein family [Aspergillus fumigatus Af293].
      Chet1000005801    -                                   AlkB-2OGFEDO                                     Isochorismatase+2OG-FeII_Oxy_2                                                          Chet1000005801          879   eukaryota>fungi>ascomycota                            Cochliobolus heterostrophus                    estExt_Genewise1Plus.C_130166
      169622270         -                                   AlkB-2OGFEDO                                     Isochorismatase+Isochorismatase+2OG-FeII_Oxy_2+GST_C_2                                  SNOG_14354              1122  eukaryota>fungi>ascomycota                            Phaeosphaeria nodorum SN15                     hypothetical protein SNOG_14354 [Phaeosphaeria nodorum SN15].
      Abis1000009270    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Abis1000009270          348   eukaryota>fungi>basidiomycota                         Agaricus bisporus                              estExt_fgenesh2_pg.C_130160
      527304243         -                                   AlkB-2OGFEDO+SbcC                                2OG-FeII_Oxy_2                                                                          FOMPIDRAFT_1112477      295   eukaryota>fungi>basidiomycota                         Fomitopsis pinicola FP-58527 SS1               hypothetical protein FOMPIDRAFT_1112477 [Fomitopsis pinicola FP-58527 SS1].
      170104308         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LACBIDRAFT_329183       333   eukaryota>fungi>basidiomycota                         Laccaria bicolor S238N-H82                     predicted protein [Laccaria bicolor S238N-H82].
      116500656         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          CC1G_04807              347   eukaryota>fungi>basidiomycota                         Coprinopsis cinerea okayama7#130               isochorismatase [Coprinopsis cinerea okayama7#130].
      
      Sarc1000013250    -                                   -                                                FTO_CTD                                                                                 Sarc1000013250          182   eukaryota>ichthyosporea                               Sphaeroforma arctica                            Sphaeroforma arctica JP610 hypothetical protein (182 aa)
      #; Kinetoplastid-only
      528262236         -                                   -                                                2OG-FeII_Oxy_2                                                                          AGDE_05127              309   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                               hypothetical protein AGDE_05127 [Angomonas deanei].
      528229076         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       STCU_06661              309   eukaryota>euglenozoa>kinetoplastida                   Strigomonas culicis                            hypothetical protein STCU_06661 [Strigomonas culicis].
      594143793         -                                   -                                                2OG-FeII_Oxy_2                                                                          GSHART1_T00002652001    311   eukaryota>euglenozoa>kinetoplastida                   Phytomonas sp. isolate Hart1                   unnamed protein product [Phytomonas sp. isolate Hart1].
      146100771         -                                   -                                                2OG-FeII_Oxy_2                                                                          LINJ_35_1290            318   eukaryota>euglenozoa>kinetoplastida                   Leishmania infantum JPCM5                      conserved hypothetical protein [Leishmania infantum JPCM5].
      70905714          -                                   -                                                2OG-FeII_Oxy_2                                                                          LMJ_1059                318   eukaryota>euglenozoa>kinetoplastida                   Leishmania major strain Friedlin               hypothetical protein, conserved [Leishmania major strain Friedlin].
      71423866          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Tc00.1047053508303.40   305   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener             hypothetical protein [Trypanosoma cruzi strain CL Brener].
      71425222          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Tc00.1047053511179.120  305   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener             hypothetical protein [Trypanosoma cruzi strain CL Brener].
      72388954          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Tb927.5.980             305   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma brucei brucei TREU927              hypothetical protein [Trypanosoma brucei brucei TREU927].
      # ; 
      Fcyl1000018322    -                                   -                                                2OG-FeII_Oxy_2                                                                          Fcyl1000018322          418   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.3.1101.1
      300259504         SP                                  RAD18+SWC3+AlkB-2OGFEDO+POTRA                    SP+2OG-FeII_Oxy_2                                                                       VOLCADRAFT_119009       753   eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis                  hypothetical protein VOLCADRAFT_119009 [Volvox carteri f. nagariensis].
      551590411         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_450175       375   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_450175 [Emiliania huxleyi CCMP1516].
      323449058         SP+TM+TM+TM+TM+TM+TM+TM+TM+TM+TM    VgrG+AlkB-2OGFEDO                                SP+Beta_helix+Beta_helix+TM+TM+TM+TM+TM+TM+TM+TM+Drf_FH1+DUF488+2OG-FeII_Oxy_2+TM+TM    AURANDRAFT_66792        2180  eukaryota>stramenopiles                               Aureococcus anophagefferens                    hypothetical protein AURANDRAFT_66792 [Aureococcus anophagefferens].
      226520209         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       MICPUN_62550            353   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      226458145         -                                   -                                                2OG-FeII_Oxy_2                                                                          MICPUCDRAFT_60114       377   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      149262140         -                                   -                                                -                                                                                       LOC100045202            127   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   PREDICTED: hypothetical protein [Mus musculus].
      284091784         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NAEGRDRAFT_66602        226   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi].
      485643787         -                                   -                                                2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_251931       163   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_251931 [Emiliania huxleyi CCMP1516].
      
      # ; ALKBH7 found in kinetoplastids
      326428939         SP                                  POTRA+POTRA                                      SP                                                                                      PTSG_05873              295   eukaryota>choanoflagellida                            Salpingoeca rosetta                            hypothetical protein PTSG_05873 [Salpingoeca rosetta].
      163776258         -                                   -                                                -                                                                                       MONBRDRAFT_24773        212   eukaryota>choanoflagellida                            Monosiga brevicollis MX1                       predicted protein [Monosiga brevicollis MX1].
      551558472         -                                   -                                                2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_212276       291   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     alkB, alkylation repair 7 [Emiliania huxleyi CCMP1516].
      551626318         -                                   -                                                2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_251025       188   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_251025 [Emiliania huxleyi CCMP1516].
      Uram1000003343    SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Uram1000003343          258   eukaryota>fungi>mucoromycotina                        Umbelopsis ramanniana                          fgenesh1_kg.14_#_839_#_combest_scaffold_14_22505
      Crev1000000227    -                                   -                                                -                                                                                       Crev1000000227          323   eukaryota>fungi>kickxellomycotina                     Coemansia reversa                              fgenesh1_kg.1_#_172_#_isotig04968
      Mver1000004713    SP                                  -                                                SP                                                                                      Mver1000004713          324   eukaryota>fungi>zygomycete                            Mortierella verticillata                        Mortierella verticillata NRRL 6337 hypothetical protein (324 aa)
      Spun1000001767    SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Spun1000001767          259   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (259 aa)
      Ccor1000005156    -                                   -                                                -                                                                                       Ccor1000005156          247   eukaryota>fungi>entomophthoromycota                   Conidiobolus coronatus                         estExt_Genemark1.C_990013
      Wseb1000002511    -                                   -                                                2OG-FeII_Oxy_2                                                                          Wseb1000002511          272   eukaryota>fungi>basidiomycota                         Wallemia sebi                                  gm1.2421_g
      527292773         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       FOMPIDRAFT_130573       247   eukaryota>fungi>basidiomycota                         Fomitopsis pinicola FP-58527 SS1               hypothetical protein FOMPIDRAFT_130573 [Fomitopsis pinicola FP-58527 SS1].
      164647071         -                                   -                                                2OG-FeII_Oxy_2                                                                          LACBIDRAFT_316029       234   eukaryota>fungi>basidiomycota                         Laccaria bicolor S238N-H82                     predicted protein [Laccaria bicolor S238N-H82].
      Abis1000001809    SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Abis1000001809          248   eukaryota>fungi>basidiomycota                         Agaricus bisporus                              estExt_Genewise1.C_21079
      220730306         -                                   -                                                2OG-FeII_Oxy_2                                                                          POSPLDRAFT_94548        259   eukaryota>fungi>basidiomycota                         Postia placenta Mad-698-R                      predicted protein [Postia placenta Mad-698-R].
      58265654          SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       CNC05880                287   eukaryota>fungi>basidiomycota                         Cryptococcus neoformans var. neoformans JEC21  hypothetical protein CNC05880 [Cryptococcus neoformans var. neoformans JEC21].
      Falb1000000067    SP                                  -                                                SP                                                                                      Falb1000000067          377   eukaryota>nucleariidae_and_fonticula                  Fonticula alba                                  Fonticula alba ATCC 38817 (V2) hypothetical protein (377 aa)
      Amac1000008462    SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Amac1000008462          281   eukaryota>fungi>blastocladiomycota                    Allomyces macrogynus                            Allomyces macrogynus ATCC 38327 hypothetical protein (281 aa)
      320163657         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       CAOG_01081              278   eukaryota                                             Capsaspora owczarzaki ATCC 30864               hypothetical protein CAOG_01081 [Capsaspora owczarzaki ATCC 30864].
      71661420          -                                   -                                                -                                                                                       Tc00.1047053511211.100  280   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener             hypothetical protein [Trypanosoma cruzi strain CL Brener].
      74025276          SP                                  -                                                SP                                                                                      Tb11.01.3200            272   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma brucei brucei TREU927              hypothetical protein [Trypanosoma brucei brucei TREU927].
      528238560         SP                                  -                                                SP                                                                                      STCU_04463              271   eukaryota>euglenozoa>kinetoplastida                   Strigomonas culicis                            alkylated DNA repair protein alkB like protein 7 [Strigomonas culicis].
      528226932         SP                                  -                                                SP                                                                                      AGDE_12223              300   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                               alkylated DNA repair protein alkB like protein 7 [Angomonas deanei].
      528261529         SP                                  -                                                SP                                                                                      AGDE_05364              266   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                               alkylated DNA repair protein alkB like protein 7 [Angomonas deanei].
      528264616         SP                                  -                                                SP                                                                                      AGDE_04331              266   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                               alkylated DNA repair protein alkB like protein 7 [Angomonas deanei].
      157872016         SP                                  -                                                SP                                                                                      LMJF_28_2710            286   eukaryota>euglenozoa>kinetoplastida                   Leishmania major strain Friedlin               conserved hypothetical protein [Leishmania major strain Friedlin].
      146092519         SP                                  -                                                SP                                                                                      LINJ_28_2910            286   eukaryota>euglenozoa>kinetoplastida                   Leishmania infantum JPCM5                      conserved hypothetical protein [Leishmania infantum JPCM5].
      594147457         -                                   -                                                -                                                                                       GSHART1_T00005476001    300   eukaryota>euglenozoa>kinetoplastida                   Phytomonas sp. isolate Hart1                   unnamed protein product [Phytomonas sp. isolate Hart1].
      110758828         -                                   -                                                -                                                                                       LOC411448               185   eukaryota>metazoa>hexapoda                            Apis mellifera                                 PREDICTED: similar to Y46G5A.35 [Apis mellifera].
      307186823         -                                   -                                                2OG-FeII_Oxy_2                                                                          EAG_04133               241   eukaryota>metazoa>hexapoda                            Camponotus floridanus                          Alkylated DNA repair protein alkB-like protein 7 [Camponotus floridanus].
      307191857         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       EAI_03104               229   eukaryota>metazoa>hexapoda                            Harpegnathos saltator                          Alkylated DNA repair protein alkB-like protein 7 [Harpegnathos saltator].
      238661944         SP                                  SIN18+SIN3A                                      SP+SAP18                                                                                Smp_150930              600   eukaryota>metazoa                                     Schistosoma mansoni                            conserved hypothetical protein [Schistosoma mansoni].
      238652610         -                                   -                                                -                                                                                       Smp_194350              160   eukaryota>metazoa                                     Schistosoma mansoni                            conserved hypothetical protein, partial [Schistosoma mansoni].
      674575674         SP                                  -                                                SP                                                                                      EMUJ_000641600          283   eukaryota>metazoa                                     Echinococcus multilocularis                    alpha ketoglutarate dependent [Echinococcus multilocularis].
      674567274         SP                                  IF2-HTH                                          SP                                                                                      EgrG_000641600          283   eukaryota>metazoa                                     Echinococcus granulosus                        alpha ketoglutarate dependent [Echinococcus granulosus].
      576699638         -                                   IF2-HTH                                          Pkinase_Tyr                                                                             EGR_01968               605   eukaryota>metazoa                                     Echinococcus granulosus                        Alkylated DNA repair protein AlkB [Echinococcus granulosus].
      674595975         SP                                  -                                                SP                                                                                      HmN_000005200           264   eukaryota>metazoa                                     Hymenolepis microstoma                         alpha ketoglutarate dependent [Hymenolepis microstoma].
      193681067         -                                   -                                                -                                                                                       LOC100161950            373   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                            PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum].
      291224759         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       LOC100372060            245   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii                       PREDICTED: spermatogenesis associated 11-like [Saccoglossus kowalevskii].
      Smar1000006659    -                                   -                                                2OG-FeII_Oxy_2                                                                          Smar1000006659          178   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima                             SMAR007432-PA pep:novel scaffold:Smar1:JH431789:114970:119470:-1 gene:SMAR007432 transcript:SMAR007432-RA
      Lgig1000004215    -                                   -                                                2OG-FeII_Oxy_2                                                                          Lgig1000004215          176   eukaryota>metazoa>mollusca                            Lottia gigantea                                e_gw1.28.427.1
      189239805         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       LOC100141950            230   eukaryota>metazoa>hexapoda                            Tribolium castaneum                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial isoform X1 [Tribolium castaneum].
      Caps1000000823    SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Caps1000000823          234   eukaryota>metazoa>annelida                            Capitella spI                                  estExt_Genewise1.C_120131
      71998257          -                                   -                                                2OG-FeII_Oxy_2                                                                          CELE_Y46G5A.35          227   eukaryota>metazoa>nematoda                            Caenorhabditis elegans                         Y46G5A.35 [Caenorhabditis elegans].
      170571892         -                                   -                                                2OG-FeII_Oxy_2                                                                          Bm1_02010               212   eukaryota>metazoa>nematoda                            Brugia malayi                                  spermatogenesis associated 11 [Brugia malayi].
      Aque1000017310    SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Aque1000017310          265   eukaryota>metazoa>porifera                            Amphimedon queenslandica                       Aqu1.217578
      196010015         -                                   -                                                2OG-FeII_Oxy_2                                                                          TRIADDRAFT_28544        211   eukaryota>metazoa>placozoa                            Trichoplax adhaerens                           hypothetical protein TRIADDRAFT_28544 [Trichoplax adhaerens].
      156217303         -                                   -                                                2OG-FeII_Oxy_2                                                                          NEMVEDRAFT_v1g114209    185   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein, partial [Nematostella vectensis].
      198434437         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       LOC100180366            264   eukaryota>metazoa>chordata                            Ciona intestinalis                             PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial-like [Ciona intestinalis].
      327264013         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       alkbh7                  222   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial [Anolis carolinensis].
      109513091         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       LOC679944               221   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              PREDICTED: similar to spermatogenesis associated 11 isoform 2 [Rattus norvegicus].
      109486536         SP                                  SHS2                                             SP+2OG-FeII_Oxy_2                                                                       LOC681562               163   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              PREDICTED: similar to spermatogenesis associated 11 isoform 1 [Rattus norvegicus].
      21313470          SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Alkbh7                  221   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial isoform 1 precursor [Mus musculus].
      114674893         -                                   -                                                2OG-FeII_Oxy_2                                                                          ALKBH7                  221   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial isoform X2 [Pan troglodytes].
      14150066          -                                   -                                                2OG-FeII_Oxy_2                                                                          ALKBH7                  221   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial precursor [Homo sapiens].
      47214908          SP                                  -                                                SP                                                                                      GSTEN:00023729:G:001    228   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                         unnamed protein product [Tetraodon nigroviridis].
      62955187          -                                   -                                                2OG-FeII_Oxy_2                                                                          alkbh7                  233   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial [Danio rerio].
      321472218         -                                   -                                                -                                                                                       DAPPUDRAFT_100794       182   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_100794 [Daphnia pulex].
      115894368         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       LOC757284               203   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: similar to AlkB, alkylation repair homolog 7 (E. coli), partial [Strongylocentrotus purpuratus].
      210122389         -                                   -                                                2OG-FeII_Oxy_2                                                                          BRAFLDRAFT_262579       181   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_262579 [Branchiostoma floridae].
      219443919         -                                   -                                                2OG-FeII_Oxy_2                                                                          BRAFLDRAFT_266169       181   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_266169 [Branchiostoma floridae].
      Hrob1000010803    -                                   -                                                -                                                                                       Hrob1000010803          205   eukaryota>metazoa>annelida                            Helobdella robusta                             76748
      85726474          -                                   -                                                -                                                                                       Dmel_CG14130            255   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                        CG14130 [Drosophila melanogaster].
      118781137         -                                   -                                                -                                                                                       AgaP_AGAP000760         258   eukaryota>metazoa>hexapoda                            Anopheles gambiae str. PEST                    AGAP000760-PA [Anopheles gambiae str. PEST].
      Psoj1000016465    SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Psoj1000016465          247   eukaryota>stramenopiles                               Phytophthora sojae                             144652
      Psoj1000016470    METHYLASE                           METHYLASE                                        Methyltransf_16+2OG-FeII_Oxy_2                                                          Psoj1000016470          361   eukaryota>stramenopiles                               Phytophthora sojae                             144657
      Pram1000000142    SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       Pram1000000142          245   eukaryota>stramenopiles                               Phytophthora ramorum                           84939
      262110203         SP                                  -                                                SP                                                                                      PITG_04676              246   eukaryota>stramenopiles                               Phytophthora infestans T30-4                   conserved hypothetical protein [Phytophthora infestans T30-4].
      220978082         -                                   -                                                Urocanase+2OG-FeII_Oxy_2                                                                THAPSDRAFT_31380        177   eukaryota>stramenopiles                               Thalassiosira pseudonana CCMP1335              predicted protein, partial [Thalassiosira pseudonana CCMP1335].
      Fcyl1000050013    -                                   -                                                -                                                                                       Fcyl1000050013          179   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.2.1691.1
      Fcyl1000106316    -                                   -                                                -                                                                                       Fcyl1000106316          280   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.2.998.1
      Fcyl1000016334    TM                                  -                                                TM                                                                                      Fcyl1000016334          354   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.2_#_308
      217403334         SP                                  -                                                SP                                                                                      PHATRDRAFT_50302        249   eukaryota>stramenopiles                               Phaeodactylum tricornutum CCAP 1055/1          predicted protein [Phaeodactylum tricornutum CCAP 1055/1].
      
      #; ALKBH4 -- found in kinetoplastids
      403343479         -                                   -                                                2OG-FeII_Oxy                                                                            OXYTRI_08063            342   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                            hypothetical protein OXYTRI_08063 (macronuclear) [Oxytricha trifallax].
      403359506         -                                   -                                                -                                                                                       OXYTRI_23312            335   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                            hypothetical protein OXYTRI_23312 (macronuclear) [Oxytricha trifallax].
      Caps1000011463    -                                   -                                                -                                                                                       Caps1000011463          319   eukaryota>metazoa>annelida                            Capitella spI                                  estExt_Genewise1.C_2310033
      219504515         -                                   -                                                -                                                                                       BRAFLDRAFT_256230       304   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_256230 [Branchiostoma floridae].
      210116659         -                                   AlkB-2OGFEDO                                     -                                                                                       BRAFLDRAFT_219277       303   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_219277 [Branchiostoma floridae].
      156226754         -                                   PLC                                              2OG-FeII_Oxy_2                                                                          NEMVEDRAFT_v1g34785     267   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein, partial [Nematostella vectensis].
      Lgig1000007221    -                                   -                                                2OG-FeII_Oxy                                                                            Lgig1000007221          273   eukaryota>metazoa>mollusca                            Lottia gigantea                                e_gw1.180.14.1
      Adig1000021750    -                                   -                                                2OG-FeII_Oxy_2                                                                          Adig1000021750          294   eukaryota>cnidaria                                    Acropora digitifera                            adi_v1.02479
      291242331         -                                   Nimm60                                           2OG-FeII_Oxy_2                                                                          LOC100367945            270   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii                       PREDICTED: alkB, alkylation repair homolog 4-like [Saccoglossus kowalevskii].
      221106579         -                                   -                                                -                                                                                       LOC100204004            272   eukaryota>metazoa>cnidaria                            Hydra vulgaris                                 PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4-like [Hydra vulgaris].
      326433577         -                                   -                                                2OG-FeII_Oxy_2                                                                          PTSG_09879              270   eukaryota>choanoflagellida                            Salpingoeca rosetta                            hypothetical protein PTSG_09879 [Salpingoeca rosetta].
      167524358         -                                   -                                                -                                                                                       MONBRDRAFT_26078        207   eukaryota>choanoflagellida                            Monosiga brevicollis MX1                       hypothetical protein [Monosiga brevicollis MX1].
      557636210         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          HmN_000938900           328   eukaryota>metazoa                                     Hymenolepis microstoma                         alpha ketoglutarate dependent [Hymenolepis microstoma].
      Aque1000014704    -                                   -                                                2OG-FeII_Oxy                                                                            Aque1000014704          284   eukaryota>metazoa>porifera                            Amphimedon queenslandica                       Aqu1.214972
      198434600         -                                   AlkB-2OGFEDO                                     -                                                                                       LOC100186431            303   eukaryota>metazoa>chordata                            Ciona intestinalis                             PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4-like [Ciona intestinalis].
      91087937          -                                   AlkB-2OGFEDO                                     -                                                                                       LOC660615               306   eukaryota>metazoa>hexapoda                            Tribolium castaneum                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Tribolium castaneum].
      158298439         SP                                  -                                                SP                                                                                      AgaP_AGAP009588         315   eukaryota>metazoa>hexapoda                            Anopheles gambiae str. PEST                    AGAP009588-PA [Anopheles gambiae str. PEST].
      156547970         -                                   AlkB-2OGFEDO                                     -                                                                                       LOC100121480            291   eukaryota>metazoa>hexapoda                            Nasonia vitripennis                            PREDICTED: similar to ENSANGP00000020936 [Nasonia vitripennis].
      66554392          -                                   -                                                -                                                                                       LOC551104               296   eukaryota>metazoa>hexapoda                            Apis mellifera                                 PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4-like [Apis mellifera].
      307200597         -                                   -                                                -                                                                                       EAI_15196               240   eukaryota>metazoa>hexapoda                            Harpegnathos saltator                          Alkylated DNA repair protein alkB-like protein 4 [Harpegnathos saltator].
      307183142         -                                   -                                                -                                                                                       EAG_14740               298   eukaryota>metazoa>hexapoda                            Camponotus floridanus                          Alkylated DNA repair protein alkB-like protein 4 [Camponotus floridanus].
      321472550         SP                                  -                                                SP+2OG-FeII_Oxy                                                                         DAPPUDRAFT_48015        293   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_48015 [Daphnia pulex].
      24583140          -                                   -                                                -                                                                                       Dmel_CG4036             304   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                        CG4036, isoform A [Drosophila melanogaster].
      193599006         -                                   -                                                2OG-FeII_Oxy                                                                            LOC100167916            299   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                            PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum].
      224076156         -                                   -                                                2OG-FeII_Oxy                                                                            LOC100229056            389   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                            PREDICTED: alkB, alkylation repair homolog 4 (E. coli) [Taeniopygia guttata].
      326931240         -                                   -                                                -                                                                                       LOC100543581            289   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                            PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4-like [Meleagris gallopavo].
      118100089         -                                   -                                                HpaP                                                                                    ALKBH4                  486   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                                  PREDICTED: hypothetical protein [Gallus gallus].
      8923019           -                                   -                                                -                                                                                       ALKBH4                  302   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Homo sapiens].
      110625894         -                                   -                                                -                                                                                       Alkbh4                  215   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Mus musculus].
      109497175         -                                   -                                                -                                                                                       Alkbh4_predicted        301   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              PREDICTED: similar to CG4036-PA [Rattus norvegicus].
      68372246          SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       alkbh4                  315   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 isoform X2 [Danio rerio].
      25148697          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          CELE_F09F7.7            291   eukaryota>metazoa>nematoda                            Caenorhabditis elegans                         F09F7.7, isoform a [Caenorhabditis elegans].
      170579486         -                                   -                                                -                                                                                       Bm1_16965               297   eukaryota>metazoa>nematoda                            Brugia malayi                                  LD42289p [Brugia malayi].
      239897326         -                                   -                                                2OG-FeII_Oxy_2                                                                          Pmar_PMAR003551         252   eukaryota>alveolata                                   Perkinsus marinus ATCC 50983                   conserved hypothetical protein [Perkinsus marinus ATCC 50983].
      71747664          -                                   -                                                2OG-FeII_Oxy_2                                                                          Tb10.70.0360            304   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma brucei brucei TREU927              hypothetical protein [Trypanosoma brucei brucei TREU927].
      71659461          -                                   -                                                2OG-FeII_Oxy                                                                            Tc00.1047053510187.490  304   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener             hypothetical protein [Trypanosoma cruzi strain CL Brener].
      146104297         -                                   -                                                2OG-FeII_Oxy_2                                                                          LINJ_36_2080            297   eukaryota>euglenozoa>kinetoplastida                   Leishmania infantum JPCM5                      conserved hypothetical protein [Leishmania infantum JPCM5].
      157876860         -                                   -                                                2OG-FeII_Oxy_2                                                                          LMJF_36_1970            297   eukaryota>euglenozoa>kinetoplastida                   Leishmania major strain Friedlin               conserved hypothetical protein [Leishmania major strain Friedlin].
      528254392         -                                   -                                                2OG-FeII_Oxy_2                                                                          STCU_00887              304   eukaryota>euglenozoa>kinetoplastida                   Strigomonas culicis                            alkylated DNA repair protein alkB like protein 4 [Strigomonas culicis].
      528266291         -                                   -                                                2OG-FeII_Oxy_2                                                                          AGDE_03849              297   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                               alkylated DNA repair protein alkB like protein 4 [Angomonas deanei].
      528238215         -                                   -                                                2OG-FeII_Oxy_2                                                                          AGDE_09971              297   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                               alkylated DNA repair protein alkB like protein 4 [Angomonas deanei].
      528261759         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       AGDE_05286              210   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                               alkylated DNA repair protein alkB like protein 4 [Angomonas deanei].
      672578582                                                                                                                                                                                      TGMAS_246140            1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii MAS                          hypothetical protein TGMAS_246140 [Toxoplasma gondii MAS].
      523576915                                                                                                                                                                                      TGGT1_246140            1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii GT1                          hypothetical protein TGGT1_246140 [Toxoplasma gondii GT1].
      672285053                                                                                                                                                                                      TGFOU_246140            1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii FOU                          hypothetical protein TGFOU_246140 [Toxoplasma gondii FOU].
      672573839                                                                                                                                                                                      TGVAND_246140           1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii VAND                         hypothetical protein TGVAND_246140 [Toxoplasma gondii VAND].
      672301308                                                                                                                                                                                      TGRUB_246140            1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii RUB                          hypothetical protein TGRUB_246140 [Toxoplasma gondii RUB].
      675123610                                                                                                                                                                                      HHA_246140              803   eukaryota>alveolata>apicomplexa                       Hammondia hammondi                             hypothetical protein HHA_246140 [Hammondia hammondi].
      557738285                                                                                                                                                                                      TGVEG_246140            1033  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii VEG                          hypothetical protein TGVEG_246140 [Toxoplasma gondii VEG].
      237835449                                                                                                                                                                                      TGME49_046140           1033  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii ME49                         hypothetical protein TGME49_046140 [Toxoplasma gondii ME49].
      672276401                                                                                                                                                                                      TGP89_246140            1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii p89                          hypothetical protein TGP89_246140 [Toxoplasma gondii p89].
      
      #; Basal Fungal-only
      Bden1000004472    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Bden1000004472          226   eukaryota>fungi>chytridiomycota                       Batrachochytrium dendrobatidis                  Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (226 aa)
      Spun1000008764    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Spun1000008764          195   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (195 aa)
      Uram1000007702    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Uram1000007702          198   eukaryota>fungi>mucoromycotina                        Umbelopsis ramanniana                          fgenesh1_kg.54_#_164_#_combest_scaffold_54_109393
      Mcir1000007727    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Mcir1000007727          211   eukaryota>fungi>basal                                 Mucor circinelloides                           Mucci1.fgeneshMC_pg.8_#_475
      Bcir1000007546    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Bcir1000007546          206   eukaryota>fungi>mucoromycotina                        Backusella circina                             fgenesh1_kg.13_#_14_#_Locus8775v1rpkm8.87
      Lhya1000001028    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Lhya1000001028          218   eukaryota>fungi>mucoromycotina                        Lichtheimia hyalospora                         estExt_fgenesh1_pg.C_70030
      Mver1000006163    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Mver1000006163          222   eukaryota>fungi>zygomycete                            Mortierella verticillata                        Mortierella verticillata NRRL 6337 hypothetical protein (222 aa)
      Ccor1000007017    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Ccor1000007017          226   eukaryota>fungi>entomophthoromycota                   Conidiobolus coronatus                         fgenesh1_pg.176_#_7
      
      #; RNA modifiying?
      284087788         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NAEGRDRAFT_58773        259   eukaryota>heterolobosea                               Naegleria gruberi                              hypothetical protein NAEGRDRAFT_58773 [Naegleria gruberi].
      Ttra1000007412    -                                   AlkB-2OGFEDO+KH                                  2OG-FeII_Oxy_2                                                                          Ttra1000007412          330   eukaryota>apusozoa                                    Thecamonas trahens                              Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase (330 aa)
      
      #; AlkBH6-like?
      325078238         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          DICPUDRAFT_99071        256   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Dictyostelium purpureum                        hypothetical protein DICPUDRAFT_99071 [Dictyostelium purpureum].
      66800191          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          DDB_G0293582            247   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Dictyostelium discoideum AX4                   2-oxoglutarate and Fe-dependent oxygenase family protein [Dictyostelium discoideum AX4].
      284085583         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NAEGRDRAFT_72926        288   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi].
      Rall1000004359    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Rall1000004359          279   eukaryota>fungi>cryptomycota                          Rozella allomycis                              O9G_000840m.01
      284081917         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NAEGRDRAFT_54771        279   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi].
      284094106         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NAEGRDRAFT_64270        279   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi].
      89302339          SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       TTHERM_00219000         254   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila SB210                  2OG-Fe(II) oxygenase family oxidoreductase (macronuclear) [Tetrahymena thermophila SB210].
      Ttra1000001477    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Ttra1000001477          253   eukaryota>apusozoa                                    Thecamonas trahens                              Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase family Oxidoreductase (253 aa)
      
      #;ALKBH6 in kinetoplastids
      Sarc1000005302    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Sarc1000005302          256   eukaryota>ichthyosporea                               Sphaeroforma arctica                            Sphaeroforma arctica JP610 hypothetical protein (256 aa)
      109148544         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH6                  266   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 isoform 2 [Homo sapiens].
      34855673          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC292780               238   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              PREDICTED: similar to calpain, small subunit 1 [Rattus norvegicus].
      38569508          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Alkbh6                  238   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Mus musculus].
      320169428         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          CAOG_04295              256   eukaryota                                             Capsaspora owczarzaki ATCC 30864               calcium-dependent cysteine protease [Capsaspora owczarzaki ATCC 30864].
      674588781         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          HmN_000423000           268   eukaryota>metazoa                                     Hymenolepis microstoma                         nucleic acid binding [Hymenolepis microstoma].
      321463592         -                                   AlkB-2OGFEDO+SWC3                                2OG-FeII_Oxy_2                                                                          DAPPUDRAFT_307189       211   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_307189 [Daphnia pulex].
      Caps1000008447    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Caps1000008447          215   eukaryota>metazoa>annelida                            Capitella spI                                  e_gw1.16.157.1
      47196062          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          GSTEN:00003217:G:001    234   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                         unnamed protein product, partial [Tetraodon nigroviridis].
      53292605          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          alkbh6                  234   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Danio rerio].
      158288561         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AgaP_AGAP003866         227   eukaryota>metazoa>hexapoda                            Anopheles gambiae str. PEST                    AGAP003866-PA [Anopheles gambiae str. PEST].
      193718445         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC100164195            230   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Acyrthosiphon pisum].
      91076692          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC660463               215   eukaryota>metazoa>hexapoda                            Tribolium castaneum                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Tribolium castaneum].
      85726418          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Dmel_CG6144             228   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                        CG6144, isoform C [Drosophila melanogaster].
      115960280         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC585652               245   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: hypothetical protein [Strongylocentrotus purpuratus].
      156544714         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC100122093            231   eukaryota>metazoa>hexapoda                            Nasonia vitripennis                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Nasonia vitripennis].
      110758548         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC552732               221   eukaryota>metazoa>hexapoda                            Apis mellifera                                 PREDICTED: similar to calpain, small subunit 1 [Apis mellifera].
      196006752         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          TRIADDRAFT_57191        232   eukaryota>metazoa>placozoa                            Trichoplax adhaerens                           hypothetical protein TRIADDRAFT_57191 [Trichoplax adhaerens].
      156215604         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NEMVEDRAFT_v1g119331    234   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein [Nematostella vectensis].
      238652512         -                                   -                                                2OG-FeII_Oxy_2                                                                          Smp_120440.1            257   eukaryota>metazoa                                     Schistosoma mansoni                            nucleic acid binding, putative [Schistosoma mansoni].
      291241873         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC100369792            243   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii                       PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 6-like [Saccoglossus kowalevskii].
      210129621         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       BRAFLDRAFT_202284       231   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_202284 [Branchiostoma floridae].
      219489319         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          BRAFLDRAFT_244728       231   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_244728 [Branchiostoma floridae].
      Lgig1000010006    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Lgig1000010006          228   eukaryota>metazoa>mollusca                            Lottia gigantea                                fgenesh2_pg.C_sca_12000135
      168057031         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PHYPADRAFT_148636       258   eukaryota>viridiplantae                               Physcomitrella patens                          predicted protein, partial [Physcomitrella patens].
      302823387         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SELMODRAFT_136984       229   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_136984 [Selaginella moellendorffii].
      302781915         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SELMODRAFT_98137        231   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_98137 [Selaginella moellendorffii].
      Hrob1000017985    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Hrob1000017985          223   eukaryota>metazoa>annelida                            Helobdella robusta                             86209
      325075616         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          DICPUDRAFT_21733        192   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Dictyostelium purpureum                        hypothetical protein DICPUDRAFT_21733, partial [Dictyostelium purpureum].
      735994710         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SAMD00019534_035740     266   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Acytostelium subglobosum LB1                   hypothetical protein SAMD00019534_035740 [Acytostelium subglobosum LB1].
      281203011         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PPL_12421               251   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Polysphondylium pallidum PN500                 hypothetical protein PPL_12421 [Polysphondylium pallidum PN500].
      545710109         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Gasu_18590              268   eukaryota>rhodophyta                                  Galdieria sulphuraria                          hypothetical protein Gasu_18590 [Galdieria sulphuraria].
      Mver1000002798    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Mver1000002798          245   eukaryota>fungi>zygomycete                            Mortierella verticillata                        Mortierella verticillata NRRL 6337 hypothetical protein (245 aa)
      Uram1000003236    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Uram1000003236          230   eukaryota>fungi>mucoromycotina                        Umbelopsis ramanniana                          fgenesh1_kg.14_#_327_#_combest_scaffold_14_20870
      Spun1000005556    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Spun1000005556          222   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (222 aa)
      111056864         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SNOG_14792              238   eukaryota>fungi>ascomycota                            Phaeosphaeria nodorum SN15                     hypothetical protein SNOG_14792 [Phaeosphaeria nodorum SN15].
      Pram1000009643    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Pram1000009643          198   eukaryota>stramenopiles                               Phytophthora ramorum                           73389
      262101095         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PITG_11617              231   eukaryota>stramenopiles                               Phytophthora infestans T30-4                   alkylated DNA repair protein alkB [Phytophthora infestans T30-4].
      Psoj1000004970    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Psoj1000004970          281   eukaryota>stramenopiles                               Phytophthora sojae                             131422
      71409378          SP                                  AlkB-2OGFEDO+DNAA-HTH                            SP                                                                                      Tc00.1047053503971.10   638   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener             hypothetical protein [Trypanosoma cruzi strain CL Brener].
      528222890         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2+Myosin_tail_1                                                         STCU_07865              661   eukaryota>euglenozoa>kinetoplastida                   Strigomonas culicis                            alkylated DNA repair protein alkB like protein 6 [Strigomonas culicis].
      146102795         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LINJ_36_5180            715   eukaryota>euglenozoa>kinetoplastida                   Leishmania infantum JPCM5                      conserved hypothetical protein [Leishmania infantum JPCM5].
      157877528         SP                                  AlkB-2OGFEDO                                     SP                                                                                      LMJF_36_4950            716   eukaryota>euglenozoa>kinetoplastida                   Leishmania major strain Friedlin               conserved hypothetical protein [Leishmania major strain Friedlin].
      Ttra1000003432    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Ttra1000003432          269   eukaryota>apusozoa                                    Thecamonas trahens                              Thecamonas trahens ATCC 50062 alkylated DNA repair protein alkB (269 aa)
      
      #;AT4G02485-like
      Uram1000001443    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Uram1000001443          252   eukaryota>fungi>mucoromycotina                        Umbelopsis ramanniana                          fgenesh1_kg.5_#_553_#_combest_scaffold_5_102347
      Pbla1000008773    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Pbla1000008773          238   eukaryota>fungi>basal                                 Phycomyces blakesleeanus                       fgeneshPB_pg.23__224
      Lhya1000000544    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Lhya1000000544          211   eukaryota>fungi>mucoromycotina                        Lichtheimia hyalospora                         estExt_Genemark1.C_30112
      Bcir1000016642    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Bcir1000016642          204   eukaryota>fungi>mucoromycotina                        Backusella circina                             estExt_Genewise1Plus.C_7650003
      Mcir1000005378    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Mcir1000005378          234   eukaryota>fungi>basal                                 Mucor circinelloides                           Mucci1.e_gw1.4.913.1
      302757747         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SELMODRAFT_165238       225   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_165238 [Selaginella moellendorffii].
      302763591         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SELMODRAFT_230539       207   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_230539 [Selaginella moellendorffii].
      162692790         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PHYPADRAFT_26843        198   eukaryota>viridiplantae                               Physcomitrella patens                          predicted protein, partial [Physcomitrella patens].
      18411957          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AT4G02485               226   eukaryota>viridiplantae                               Arabidopsis thaliana                           oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana].
      320166009         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          CAOG_08040              304   eukaryota                                             Capsaspora owczarzaki ATCC 30864               hypothetical protein CAOG_08040 [Capsaspora owczarzaki ATCC 30864].
      119358859         -                                   AlkB-2OGFEDO                                     HEAT_EZ+2OG-FeII_Oxy_2                                                                  Ot02g04110              494   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             SelMay undefined product (IC) [Ostreococcus tauri].
      226521158         -                                   -                                                2OG-FeII_Oxy_2                                                                          MICPUN_63915            319   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      303286859         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       MICPUCDRAFT_42311       244   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      
      #; Related to above?
      167521620         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       MONBRDRAFT_24779        474   eukaryota>choanoflagellida                            Monosiga brevicollis MX1                       hypothetical protein [Monosiga brevicollis MX1].
      
      #; ALKBH8
      189530001         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                    LOC556362               657   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    PREDICTED: similar to alkB, alkylation repair homolog 8 [Danio rerio].
      Ttra1000009841    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Ttra1000009841          232   eukaryota>apusozoa                                    Thecamonas trahens                              Thecamonas trahens ATCC 50062 hypothetical protein (232 aa)
      89299232          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          TTHERM_00483520         199   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila SB210                  2OG-Fe(II) oxygenase family oxidoreductase (macronuclear) [Tetrahymena thermophila SB210].
      198432246         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   2OG-FeII_Oxy_2+Methyltransf_11                                                          LOC100183670            593   eukaryota>metazoa>chordata                            Ciona intestinalis                             PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Ciona intestinalis].
      20270315          -                                   RRM                                              DUF1891+RRM_1                                                                           ALKBH8                  238   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   alkB, alkylation repair homolog 8 [Homo sapiens].
      114565310         -                                   -                                                -                                                                                       LOC736490               220   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Pan troglodytes].
      169162451         -                                   -                                                -                                                                                       LOC646804               220   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Homo sapiens].
      569418308         -                                   IG+AlkB-2OGFEDO                                  PCEMA1+2OG-FeII_Oxy_2                                                                   RFI_09511               706   eukaryota                                             Reticulomyxa filosa                            hypothetical protein RFI_09511 [Reticulomyxa filosa].
      209556865         -                                   RRM+AlkB-2OGFEDO                                 RRM_5+2OG-FeII_Oxy_2                                                                    CMU_032950              332   eukaryota>alveolata>apicomplexa                       Cryptosporidium muris RN66                     oxidoreductase, 2og-Fe(II) oxygenase family protein [Cryptosporidium muris RN66].
      46229757          -                                   RRM+AlkB-2OGFEDO                                 RRM_5+2OG-FeII_Oxy_2                                                                    cgd7_1000               350   eukaryota>alveolata>apicomplexa                       Cryptosporidium parvum Iowa II                 F27M3_19 plant like RRM plus AlkB domain containing protein [Cryptosporidium parvum Iowa II].
      255086679         -                                   RRM+AlkB-2OGFEDO                                 FSH1+RRM_5+2OG-FeII_Oxy_2                                                               MICPUN_86885            418   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      116060339         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Ot11g00700              232   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             2-Oxoglutarate-and iron-dependent dioxygenase-related proteins (ISS) [Ostreococcus tauri].
      302797440         -                                   RRM+AlkB-2OGFEDO                                 2OG-FeII_Oxy_2                                                                          SELMODRAFT_112315       315   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_112315, partial [Selaginella moellendorffii].
      302758364         -                                   RRM+AlkB-2OGFEDO                                 2OG-FeII_Oxy_2                                                                          SELMODRAFT_78643        315   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_78643, partial [Selaginella moellendorffii].
      42571711          -                                   RRM+AlkB-2OGFEDO                                 RRM_5+2OG-FeII_Oxy_2                                                                    AT1G31600               431   eukaryota>viridiplantae                               Arabidopsis thaliana                           tRNA methyltransferase 9 [Arabidopsis thaliana].
      42571709          -                                   RRM+AlkB-2OGFEDO                                 RRM_5+2OG-FeII_Oxy_2                                                                    AT1G31600               344   eukaryota>viridiplantae                               Arabidopsis thaliana                           tRNA methyltransferase 9 [Arabidopsis thaliana].
      168033740         -                                   RRM+AlkB-2OGFEDO                                 RRM_1+2OG-FeII_Oxy_2                                                                    PHYPADRAFT_15669        334   eukaryota>viridiplantae                               Physcomitrella patens                          predicted protein, partial [Physcomitrella patens].
      303284329         -                                   RRM+AlkB-2OGFEDO                                 RRM_5+2OG-FeII_Oxy_2                                                                    MICPUCDRAFT_41620       408   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      239884096         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Pmar_PMAR017769         325   eukaryota>alveolata                                   Perkinsus marinus ATCC 50983                   conserved hypothetical protein [Perkinsus marinus ATCC 50983].
      551578395         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_240478       226   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_240478 [Emiliania huxleyi CCMP1516].
      72390966          -                                   -                                                2OG-FeII_Oxy_2                                                                          Tb927.7.1530            454   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma brucei brucei TREU927              hypothetical protein [Trypanosoma brucei brucei TREU927].
      71423311          -                                   -                                                2OG-FeII_Oxy_2                                                                          Tc00.1047053503579.130  426   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener             hypothetical protein [Trypanosoma cruzi strain CL Brener].
      71418255          -                                   -                                                2OG-FeII_Oxy_2                                                                          Tc00.1047053507517.110  426   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener             hypothetical protein [Trypanosoma cruzi strain CL Brener].
      157870995         -                                   -                                                2OG-FeII_Oxy_2                                                                          LMJF_26_0400            562   eukaryota>euglenozoa>kinetoplastida                   Leishmania major strain Friedlin               conserved hypothetical protein [Leishmania major strain Friedlin].
      146089452         -                                   Nimm73                                           2OG-FeII_Oxy_2                                                                          LINJ_26_0390            563   eukaryota>euglenozoa>kinetoplastida                   Leishmania infantum JPCM5                      conserved hypothetical protein [Leishmania infantum JPCM5].
      528216286         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       STCU_09264              424   eukaryota>euglenozoa>kinetoplastida                   Strigomonas culicis                            hypothetical protein STCU_09264 [Strigomonas culicis].
      Ccor1000000123    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Ccor1000000123          336   eukaryota>fungi>entomophthoromycota                   Conidiobolus coronatus                         fgenesh1_pg.3_#_43
      85090541          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NCU09959                290   eukaryota>fungi>ascomycota                            Neurospora crassa OR74A                        hypothetical protein NCU09959 [Neurospora crassa OR74A].
      Amac1000013082    SP                                  RRM+AlkB-2OGFEDO                                 SP+2OG-FeII_Oxy_2                                                                       Amac1000013082          375   eukaryota>fungi>blastocladiomycota                    Allomyces macrogynus                            Allomyces macrogynus ATCC 38327 hypothetical protein (375 aa)
      Spun1000006493    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Spun1000006493          383   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (383 aa)
      Bden1000007191    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Bden1000007191          367   eukaryota>fungi>chytridiomycota                       Batrachochytrium dendrobatidis                  Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (367 aa)
      284094307         -                                   AlkB-2OGFEDO+SAM-methylase                       2OG-FeII_Oxy_2+Methyltransf_11                                                          NAEGRDRAFT_958          460   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein, partial [Naegleria gruberi].
      403335499         -                                   AlkB-2OGFEDO+SAM-methylase                       2OG-FeII_Oxy_2+Methyltransf_11                                                          OXYTRI_12781            710   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                            hypothetical protein OXYTRI_12781 (macronuclear) [Oxytricha trifallax].
      145491391         METHYLASE                           AlkB-2OGFEDO+SAM-methylase+SbcC                  2OG-FeII_Oxy_2+Methyltransf_11                                                          GSPATT00006150001       634   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      145522424         METHYLASE                           AlkB-2OGFEDO+SAM-methylase+SbcC                  2OG-FeII_Oxy_2+Methyltransf_11                                                          GSPATT00014589001       636   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      299115673         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Esi_0186_0021           350   eukaryota>stramenopiles                               Ectocarpus siliculosus                         conserved unknown protein [Ectocarpus siliculosus].
      Sarc1000000358    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Sarc1000000358          335   eukaryota>ichthyosporea                               Sphaeroforma arctica                            Sphaeroforma arctica JP610 hypothetical protein (335 aa)
      58394263          METHYLASE                           AlkB-2OGFEDO+SAM-methylase                       2OG-FeII_Oxy_2+Methyltransf_11                                                          AgaP_AGAP011900         621   eukaryota>metazoa>hexapoda                            Anopheles gambiae str. PEST                    AGAP011900-PA, partial [Anopheles gambiae str. PEST].
      170579523         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                    Bm1_17050               576   eukaryota>metazoa>nematoda                            Brugia malayi                                  hypothetical protein [Brugia malayi].
      17552176          METHYLASE                           AlkB-2OGFEDO+SAM-methylase                       2OG-FeII_Oxy_2+Methyltransf_11                                                          CELE_C14B1.10           591   eukaryota>metazoa>nematoda                            Caenorhabditis elegans                         ALKB-8 [Caenorhabditis elegans].
      193643465         DnaJ                                DNAJ                                             2OG-FeII_Oxy_2+DnaJ+zf-CSL                                                              LOC100163323            382   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                            PREDICTED: similar to alkB, alkylation repair homolog 8 (E. coli) (alkbh8) [Acyrthosiphon pisum].
      Fcyl1000110820    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Fcyl1000110820          269   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.6.1356.1
      Fcyl1000110682    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Fcyl1000110682          280   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.6.1329.1
      Fcyl1000021378    METHYLASE                           AlkB-2OGFEDO+SAM-methylase                       2OG-FeII_Oxy_2+Methyltransf_11                                                          Fcyl1000021378          947   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.6_#_646
      91080367          METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                    LOC663807               582   eukaryota>metazoa>hexapoda                            Tribolium castaneum                            PREDICTED: alkylated DNA repair protein alkB homolog 8 [Tribolium castaneum].
      24658267          METHYLASE                           SAM-methylase+Tox-HetC                           2OG-FeII_Oxy_2+Methyltransf_11                                                          Dmel_CG17807            615   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                        CG17807 [Drosophila melanogaster].
      307180204         METHYLASE                           TRPR-HTH+AlkB-2OGFEDO+SAM-methylase              2OG-FeII_Oxy_2+Methyltransf_11                                                          EAG_13148               604   eukaryota>metazoa>hexapoda                            Camponotus floridanus                          Alkylated DNA repair protein alkB-like protein 8 [Camponotus floridanus].
      156552181         -                                   AlkB-2OGFEDO+SAM-methylase                       2OG-FeII_Oxy_2+Methyltransf_11                                                          LOC100122369            589   eukaryota>metazoa>hexapoda                            Nasonia vitripennis                            PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform X1 [Nasonia vitripennis].
      110756990         METHYLASE                           GHH+AlkB-2OGFEDO+SAM-methylase                   2OG-FeII_Oxy_2+Methyltransf_11                                                          LOC411649               558   eukaryota>metazoa>hexapoda                            Apis mellifera                                 PREDICTED: similar to CG17807-PA [Apis mellifera].
      307214872         METHYLASE                           AlkB-2OGFEDO+SAM-methylase                       2OG-FeII_Oxy_2+Methyltransf_11                                                          EAI_01988               558   eukaryota>metazoa>hexapoda                            Harpegnathos saltator                          Alkylated DNA repair protein alkB-like protein 8 [Harpegnathos saltator].
      115738137         -                                   RRM+AlkB-2OGFEDO                                 RRM_1+2OG-FeII_Oxy_2                                                                    LOC592985               424   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: alkylated DNA repair protein alkB homolog 8 [Strongylocentrotus purpuratus].
      219505941         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase+GAF               RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                    BRAFLDRAFT_111176       650   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_111176 [Branchiostoma floridae].
      219431450         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_1+2OG-FeII_Oxy_2                                                                    BRAFLDRAFT_215107       641   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_215107 [Branchiostoma floridae].
      Lgig1000007292    METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                    Lgig1000007292          624   eukaryota>metazoa>mollusca                            Lottia gigantea                                e_gw1.192.7.1
      321463990         -                                   RRM+AlkB-2OGFEDO+SAM-methylase                   2OG-FeII_Oxy_2+Methyltransf_11                                                          DAPPUDRAFT_306906       574   eukaryota>metazoa>crustacea                           Daphnia pulex                                  hypothetical protein DAPPUDRAFT_306906 [Daphnia pulex].
      Caps1000010719    METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                    Caps1000010719          611   eukaryota>metazoa>annelida                            Capitella spI                                  estExt_Genewise1.C_3640053
      291238544         SP                                  RRM+AlkB-2OGFEDO+SAM-methylase                   SP+RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                 LOC100369536            742   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii                       PREDICTED: hypothetical protein [Saccoglossus kowalevskii].
      327269144         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                    alkbh8                  666   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                            PREDICTED: alkylated DNA repair protein alkB homolog 8 [Anolis carolinensis].
      114640181         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   DUF1891+RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                            ALKBH8                  664   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform X2 [Pan troglodytes].
      61675696          METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase+GAF               DUF1891+RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                            Alkbh8                  664   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   alkylated DNA repair protein alkB homolog 8 [Mus musculus].
      109478839         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   DUF1891+RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                            RGD1304687_predicted    671   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              PREDICTED: similar to CG17807-PA [Rattus norvegicus].
      224043547         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                    LOC100232062            847   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                            PREDICTED: similar to Alkylated DNA repair protein alkB homolog 8 [Taeniopygia guttata].
      118085116         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   DUF1891+RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                            LOC418972               679   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                                  PREDICTED: hypothetical protein [Gallus gallus].
      326914414         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                    LOC100541852            846   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                            PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Meleagris gallopavo].
      Hrob1000005052    METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_1+2OG-FeII_Oxy_2+Methyltransf_11                                                    Hrob1000005052          559   eukaryota>metazoa>annelida                            Helobdella robusta                             69129
      569439499         TM                                  AlkB-2OGFEDO                                     2OG-FeII_Oxy_2+TM                                                                       RFI_00879               195   eukaryota                                             Reticulomyxa filosa                            hypothetical protein RFI_00879 [Reticulomyxa filosa].
      156212272         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   RRM_6+2OG-FeII_Oxy_2+Methyltransf_11                                                    NEMVEDRAFT_v1g235894    648   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein [Nematostella vectensis].
      Psoj1000018446    METHYLASE                           AlkB-2OGFEDO+SAM-methylase                       2OG-FeII_Oxy_2+Methyltransf_11                                                          Psoj1000018446          430   eukaryota>stramenopiles                               Phytophthora sojae                             125662
      301121774         -                                   AlkB-2OGFEDO+SAM-methylase+S2P                   2OG-FeII_Oxy_2+Methyltransf_11                                                          PITG_02033              640   eukaryota>stramenopiles                               Phytophthora infestans T30-4                   alkylated DNA repair protein alkB 8 [Phytophthora infestans T30-4].
      Pram1000006453    METHYLASE                           AlkB-2OGFEDO+SAM-methylase                       2OG-FeII_Oxy_2+Methyltransf_11                                                          Pram1000006453          643   eukaryota>stramenopiles                               Phytophthora ramorum                           77246
      196005257         METHYLASE                           RRM+AlkB-2OGFEDO+SAM-methylase                   2OG-FeII_Oxy_2+Methyltransf_11                                                          TRIADDRAFT_26003        653   eukaryota>metazoa>placozoa                            Trichoplax adhaerens                           hypothetical protein TRIADDRAFT_26003 [Trichoplax adhaerens].
      
      #;
      735850808         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SAMD00019534_100830     200   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Acytostelium subglobosum LB1                   hypothetical protein SAMD00019534_100830 [Acytostelium subglobosum LB1].
      281212158         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PPL_00108               168   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Polysphondylium pallidum PN500                 2-oxoglutarate and Fe(II)-dependent oxygenase family protein [Polysphondylium pallidum PN500].
      #; ALKBH5 mRNA methylase, crown-SAR
      313240619         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          GSOID_T00020806001      462   eukaryota>metazoa>chordata                            Oikopleura dioica                              unnamed protein product [Oikopleura dioica].
      115965125         -                                   -                                                2OG-FeII_Oxy_2                                                                          LOC579335               266   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus                  PREDICTED: similar to MGC79570 protein [Strongylocentrotus purpuratus].
      Adig1000002503    -                                   -                                                2OG-FeII_Oxy_2                                                                          Adig1000002503          322   eukaryota>cnidaria                                    Acropora digitifera                            adi_v1.20302
      156219200         -                                   -                                                2OG-FeII_Oxy_2                                                                          NEMVEDRAFT_v1g108628    256   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein [Nematostella vectensis].
      291224533         -                                   -                                                2OG-FeII_Oxy_2                                                                          LOC100378951            362   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii                       PREDICTED: RNA demethylase ALKBH5-like [Saccoglossus kowalevskii].
      Smar1000006760    -                                   -                                                2OG-FeII_Oxy_2                                                                          Smar1000006760          334   eukaryota>metazoa>arthropoda>myriapoda                Strigamia maritima                             SMAR013901-PA pep:novel scaffold:Smar1:JH431789:635068:636630:1 gene:SMAR013901 transcript:SMAR013901-RA
      210101960         -                                   -                                                2OG-FeII_Oxy_2                                                                          BRAFLDRAFT_126925       314   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_126925 [Branchiostoma floridae].
      Lgig1000000949    -                                   -                                                2OG-FeII_Oxy_2                                                                          Lgig1000000949          256   eukaryota>metazoa>mollusca                            Lottia gigantea                                gw1.8.349.1
      198418993         -                                   MIP-T3                                           2OG-FeII_Oxy_2+MIP-T3                                                                   LOC100176098            308   eukaryota>metazoa>chordata                            Ciona intestinalis                             PREDICTED: RNA demethylase ALKBH5, partial [Ciona intestinalis].
      221119302         -                                   -                                                2OG-FeII_Oxy_2                                                                          LOC100212340            329   eukaryota>metazoa>cnidaria                            Hydra vulgaris                                 PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5-like [Hydra vulgaris].
      Caps1000015030    -                                   -                                                2OG-FeII_Oxy_2                                                                          Caps1000015030          231   eukaryota>metazoa>annelida                            Capitella spI                                  gw1.256.16.1
      116517268         -                                   -                                                2OG-FeII_Oxy_2                                                                          alkbh5                  352   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    RNA demethylase ALKBH5 [Danio rerio].
      47218956          -                                   -                                                2OG-FeII_Oxy_2                                                                          GSTEN:00015751:G:001    355   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                         unnamed protein product, partial [Tetraodon nigroviridis].
      327287260         -                                   -                                                2OG-FeII_Oxy_2                                                                          alkbh5                  379   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                            PREDICTED: RNA demethylase ALKBH5 [Anolis carolinensis].
      118097886         -                                   -                                                2OG-FeII_Oxy_2                                                                          ALKBH5                  374   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                                  PREDICTED: RNA demethylase ALKBH5 isoform X1 [Gallus gallus].
      224070277         -                                   -                                                2OG-FeII_Oxy_2                                                                          ALKBH5                  383   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                            PREDICTED: RNA demethylase ALKBH5 [Taeniopygia guttata].
      148539642         -                                   -                                                              2OG-FeII_Oxy_2                                                            ALKBH5                  394   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   RNA demethylase ALKBH5 [Homo sapiens].
      114668860         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       LOC743473               378   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: hypothetical protein LOC743473 [Pan troglodytes].
      109490878         -                                   -                                                              2OG-FeII_Oxy_2                                                            Alkbh5                  395   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              PREDICTED: alkB, alkylation repair homolog 5-like [Rattus norvegicus].
      31044423          -                                   -                                                              2OG-FeII_Oxy_2                                                            Alkbh5                  395   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   RNA demethylase ALKBH5 [Mus musculus].
      403336135         SP **                               ATHOOK+AlkB                                      SP                                                                                      OXYTRI_12242            943   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                            hypothetical protein OXYTRI_12242 (macronuclear) [Oxytricha trifallax].
      116061067         SP   **                             PHD                                              SP                                                                                      Ot13g01270              544   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             unnamed protein product [Ostreococcus tauri].
      303274614         -   **                              C1+PHD                                           2OG-FeII_Oxy_2                                                                          MICPUCDRAFT_38490       897   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      300262451         SP                                  POTRA+POTRA+Asp-B-Hydro+UBC                      SP+MSP1_C+SDA1+SDA1+PAT1+PAT1                                                           VOLCADRAFT_92841        2654  eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis                  hypothetical protein VOLCADRAFT_92841 [Volvox carteri f. nagariensis].
      403365523         -                                   CDC27                                            -                                                                                       OXYTRI_19840            1315  eukaryota>alveolata>ciliophora                        Oxytricha trifallax                            hypothetical protein OXYTRI_19840 (macronuclear) [Oxytricha trifallax].
      118352562         -                                   SFII-RAD3+Nimm67+Classical-AAA                   TFIIA+SMC_N+2OG-FeII_Oxy_2                                                              TTHERM_00371220         1999  eukaryota>alveolata>ciliophora                        Tetrahymena thermophila                        hypothetical protein TTHERM_00371220 (macronuclear) [Tetrahymena thermophila].
      145483981         SP+Tox-ABHYDROLASE3                 SFII-RAD3+Metallopeptidase                       SP+Lipase_2                                                                             GSPATT00005366001       1283  eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      145520431         SP                                  35exo+NTox3                                      SP                                                                                      GSPATT00001715001       941   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      307105467         -                                                                                    -                                                                                       CHLNCDRAFT_136563       521   eukaryota>viridiplantae>chlorophyta                   Chlorella variabilis                           hypothetical protein CHLNCDRAFT_136563 [Chlorella variabilis].
      Sarc1000012919    -                                   ZNR                                              FTO_NTD                                                                                 Sarc1000012919          292   eukaryota>ichthyosporea                               Sphaeroforma arctica                            Sphaeroforma arctica JP610 hypothetical protein (292 aa)
      Sarc1000002122    -                                   PLUS3                                            -                                                                                       Sarc1000002122          178   eukaryota>ichthyosporea                               Sphaeroforma arctica                            Sphaeroforma arctica JP610 hypothetical protein (178 aa)
      30690892          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy                                                                            AT2G48080               438   eukaryota>viridiplantae                               Arabidopsis thaliana                           oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana].
      15236223          Tox-MCF                             Asp-B-Hydro                                      2OG-FeII_Oxy                                                                            AT4G02940               569   eukaryota>viridiplantae                               Arabidopsis thaliana                           oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana].
      302803799         -                                   -                                                2OG-FeII_Oxy_2+Totivirus_coat                                                           SELMODRAFT_422939       556   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_422939 [Selaginella moellendorffii].
      186489643         -                                   -                                                2OG-FeII_Oxy_2                                                                          AT1G48980               325   eukaryota>viridiplantae                               Arabidopsis thaliana                           2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein [Arabidopsis thaliana].
      79319564          -                                   TET-JBP                                          2OG-FeII_Oxy_2                                                                          AT1G48980               327   eukaryota>viridiplantae                               Arabidopsis thaliana                           2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein [Arabidopsis thaliana].
      79361742          -                                   TET-JBP                                          2OG-FeII_Oxy_2                                                                          AT1G48980               331   eukaryota>viridiplantae                               Arabidopsis thaliana                           2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein [Arabidopsis thaliana].
      79326344          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AT4G36090               520   eukaryota>viridiplantae                               Arabidopsis thaliana                           oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana].
      302775126         -                                   AlkB-2OGFEDO                                     DUF2052+2OG-FeII_Oxy_2                                                                  SELMODRAFT_94921        307   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_94921 [Selaginella moellendorffii].
      302757365         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SELMODRAFT_64626        289   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_64626, partial [Selaginella moellendorffii].
      15227938          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AT2G17970               507   eukaryota>viridiplantae                               Arabidopsis thaliana                           2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein [Arabidopsis thaliana].
      116061183         -                                   AlkB-2OGFEDO+EP1+SAM-methylase                   2OG-FeII_Oxy_2+Methyltransf_11                                                          OT_ostta13g02300        597   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             Alpha-ketoglutarate-dependent dioxygenase AlkB-like [Ostreococcus tauri].
      281211827         -                                   -                                                2OG-FeII_Oxy_2                                                                          PPL_01222               280   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Polysphondylium pallidum PN500                 hypothetical protein PPL_01222 [Polysphondylium pallidum PN500].
      
      323447352         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AURANDRAFT_68154        461   eukaryota>stramenopiles                               Aureococcus anophagefferens                    hypothetical protein AURANDRAFT_68154 [Aureococcus anophagefferens].
      485613446         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       EMIHUDRAFT_120329       373   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_120329, partial [Emiliania huxleyi CCMP1516].
      #; FTO, note fusion to ASCH in stramenopiles and kinase in Chlamy
      291236823         -                                   -                                                FTO_NTD+FTO_CTD                                                                         LOC100366525            455   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii                       PREDICTED: alpha-ketoglutarate-dependent dioxygenase FTO-like, partial [Saccoglossus kowalevskii].
      326927229         -                                   -                                                FTO_NTD+FTO_CTD                                                                         LOC100546308            509   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase FTO-like [Meleagris gallopavo].
      50753676          -                                   -                                                FTO_NTD+FTO_CTD                                                                         LOC415718               120   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                                  PREDICTED: hypothetical protein [Gallus gallus].
      224064302         -                                   -                                                FTO_NTD+FTO_CTD                                                                         LOC100223734            509   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                            PREDICTED: fat mass and obesity associated [Taeniopygia guttata].
      122937263         -                                   -                                                FTO_NTD+FTO_CTD                                                                         FTO                     505   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   alpha-ketoglutarate-dependent dioxygenase FTO [Homo sapiens].
      114662524         -                                   -                                                FTO_NTD+FTO_CTD                                                                         FTO                     505   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: alpha-ketoglutarate-dependent dioxygenase FTO isoform X3 [Pan troglodytes].
      89337260          -                                   -                                                FTO_NTD+FTO_CTD                                                                         Fto                     502   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              alpha-ketoglutarate-dependent dioxygenase FTO [Rattus norvegicus].
      6753916           -                                   -                                                FTO_NTD+FTO_CTD                                                                         Fto                     502   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   similar to FTO [Mus musculus].
      327276419         -                                   -                                                FTO_NTD+FTO_CTD                                                                         LOC100556980            489   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase FTO-like [Anolis carolinensis].
      189521378         SP                                  -                                                SP+FTO_NTD+FTO_CTD+FTO_CTD                                                              fto                     556   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    PREDICTED: fto protein [Danio rerio].
      47216623          TM                                  -                                                FTO_NTD+FTO_CTD+TM                                                                      GSTEN:00025432:G:001    488   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                         unnamed protein product, partial [Tetraodon nigroviridis].
      Fcyl1000025701    SP                                  -                                                SP+FTO_NTD+FTO_CTD                                                                      Fcyl1000025701          545   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.11.582.1
      320169135         -                                   -                                                FTO_NTD+FTO_NTD+FTO_CTD                                                                 CAOG_04002              638   eukaryota                                             Capsaspora owczarzaki ATCC 30864               hypothetical protein CAOG_04002 [Capsaspora owczarzaki ATCC 30864].
      Sarc1000012918    -                                   -                                                FTO_NTD                                                                                 Sarc1000012918          212   eukaryota>ichthyosporea                               Sphaeroforma arctica                            Sphaeroforma arctica JP610 hypothetical protein (212 aa)
      298709920         SP                                  SPX                                              SP+FTO_NTD+FTO_CTD                                                                      Esi_0270_0028           715   eukaryota>stramenopiles                               Ectocarpus siliculosus                         conserved unknown protein [Ectocarpus siliculosus].
      Ttra1000008560    -                                   -                                                SRP-alpha_N+FTO_NTD+FTO_CTD                                                             Ttra1000008560          561   eukaryota>apusozoa                                    Thecamonas trahens                              Thecamonas trahens ATCC 50062 FATSO protein (561 aa)
      220977073         -                                   -                                                FTO_NTD+FTO_CTD                                                                         THAPSDRAFT_261481       613   eukaryota>stramenopiles                               Thalassiosira pseudonana CCMP1335              hypothetical protein THAPSDRAFT_261481, partial [Thalassiosira pseudonana CCMP1335].
      298709934         ASCH                                ASCH                                             FTO_NTD+FTO_CTD                                                                         Esi_0272_0024           1065  eukaryota>stramenopiles                               Ectocarpus siliculosus                         conserved unknown protein [Ectocarpus siliculosus].
      Fcyl1000039992    ASCH                                ASCH                                             FTO_NTD+FTO_CTD+ASCH                                                                    Fcyl1000039992          886   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.102_#_45
      Fcyl1000029925    ASCH                                ASCH                                             FTO_NTD+FTO_CTD+ASCH                                                                    Fcyl1000029925          870   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.21_#_237
      116060758         TM+TM+TM                            TFIIE-HTH                                        TM+TM+TM+FTO_NTD+FTO_CTD                                                                Ot12g01460              689   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             unnamed protein product [Ostreococcus tauri].
      226457226         -                                   -                                                FTO_NTD+FTO_CTD                                                                         MICPUCDRAFT_60556       541   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      255078368         -                                   -                                                FTO_NTD+FTO_CTD                                                                         MICPUN_112682           520   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          FATSO protein [Micromonas sp. RCC299].
      220977539         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          THAPSDRAFT_20825        573   eukaryota>stramenopiles                               Thalassiosira pseudonana CCMP1335              predicted protein [Thalassiosira pseudonana CCMP1335].
      219113643         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       PHATR_44026             384   eukaryota>stramenopiles                               Phaeodactylum tricornutum CCAP 1055/1          predicted protein [Phaeodactylum tricornutum CCAP 1055/1].
      Fcyl1000122509    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Fcyl1000122509          409   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.31.61.1
      Fcyl1000129851    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Fcyl1000129851          477   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.113.10.1
      Fcyl1000032662    Tox-ABHYDROLASE3                    Tox-ABHYDROLASE3+AlkB-2OGFEDO                    Lipase_3+2OG-FeII_Oxy_2                                                                 Fcyl1000032662          827   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.31_#_127
      Fcyl1000040299    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Fcyl1000040299          406   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.113_#_6
      299473601         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Esi_0081_0103           484   eukaryota>stramenopiles                               Ectocarpus siliculosus                         conserved unknown protein [Ectocarpus siliculosus].
      303291017         -                                   -                                                2OG-FeII_Oxy_2+API5                                                                     MICPUCDRAFT_67764       233   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      255082812         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2+API5                                                                  MICPUN_61562            577   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      Bnat1000005628    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Bnat1000005628          508   eukaryota>rhizaria>cercozoa                           Bigelowiella natans                            fgenesh1_pg.46_#_69
      302836187         -                                   POTRA+AlkB-2OGFEDO                               PAT1+DUF4175+2OG-FeII_Oxy_2                                                             VOLCADRAFT_104423       765   eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis                  hypothetical protein VOLCADRAFT_104423 [Volvox carteri f. nagariensis].
      158275615         -                                   STYKIN                                           2OG-FeII_Oxy_2+Pkinase                                                                  CHLREDRAFT_175223       1503  eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii                      predicted protein [Chlamydomonas reinhardtii].
      320162612         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          CAOG_00036              372   eukaryota                                             Capsaspora owczarzaki ATCC 30864               alkylated DNA repair protein [Capsaspora owczarzaki ATCC 30864].
      497649437
      490863615
      502454342
      521074729
      521061610
      499713678
      522048777
      771842312
      657897720
      515934545
      752713757
      53758311
      766766902
      #; ALKBH3, note fusion to HOMEO domain, mehtylase not DNA methylase
      116054890         SP                                  METHYLASE+AlkB-2OGFEDO                           SP+DUF633+2OG-FeII_Oxy_2                                                                Ot14g02210              516   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             alkylated DNA repair protein (ISS), partial [Ostreococcus tauri].
      Caps1000024925    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Caps1000024925          159   eukaryota>metazoa>annelida                            Capitella spI                                  e_gw1.37312.2.1
      125855293         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       LOC792266               282   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                                    PREDICTED: hypothetical protein [Danio rerio].
      47214690          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          GSTEN:00019672:G:001    302   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                         unnamed protein product, partial [Tetraodon nigroviridis].
      156218294         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NEMVEDRAFT_v1g168448    269   eukaryota>metazoa>cnidaria                            Nematostella vectensis                         predicted protein [Nematostella vectensis].
      198419633         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       LOC100187190            288   eukaryota>metazoa>chordata                            Ciona intestinalis                             PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 [Ciona intestinalis].
      Hrob1000003787    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Hrob1000003787          293   eukaryota>metazoa>annelida                            Helobdella robusta                             185055
      Caps1000018790    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Caps1000018790          288   eukaryota>metazoa>annelida                            Capitella spI                                  e_gw1.669.6.1
      Lgig1000006095    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Lgig1000006095          278   eukaryota>metazoa>mollusca                            Lottia gigantea                                e_gw1.85.31.1
      114637163         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH3                  286   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 isoform X1 [Pan troglodytes].
      21040275          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          ALKBH3                  286   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                                   alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 [Homo sapiens].
      114637165         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC741336               145   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                                PREDICTED: similar to ALKBH3 protein isoform 1 [Pan troglodytes].
      110625726         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Alkbh3                  286   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                                   alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 [Mus musculus].
      62079085          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Alkbh3                  295   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                              alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 [Rattus norvegicus].
      118118318         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC776379               70    eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                                  PREDICTED: similar to ALKBH3 protein, partial [Gallus gallus].
      224051018         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC100220917            338   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                            PREDICTED: similar to alkB, alkylation repair homolog 3 [Taeniopygia guttata].
      326920364         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          LOC100545576            228   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 3-like [Meleagris gallopavo].
      118091513         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       ALKBH3                  333   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                                  PREDICTED: similar to prostate cancer antigen-1 [Gallus gallus].
      327259719         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2+2OG-FeII_Oxy_2                                                           LOC100557858            220   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                            PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 3-like [Anolis carolinensis].
      210103637         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          BRAFLDRAFT_126257       287   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_126257 [Branchiostoma floridae].
      219407733         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       BRAFLDRAFT_117316       316   eukaryota>metazoa>chordata                            Branchiostoma floridae                         hypothetical protein BRAFLDRAFT_117316 [Branchiostoma floridae].
      485640232         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_111319       283   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_111319 [Emiliania huxleyi CCMP1516].
      551577156         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_207885       280   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_207885 [Emiliania huxleyi CCMP1516].
      Fcyl1000026428    -                                   HOMEO+AlkB-2OGFEDO                               2OG-FeII_Oxy_2                                                                          Fcyl1000026428          396   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.13_#_88
      743520408
      735848816         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SAMD00019534_121570     167   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Acytostelium subglobosum LB1                   hypothetical protein SAMD00019534_121570 [Acytostelium subglobosum LB1].
      281207196         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PPL_05363               121   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Polysphondylium pallidum PN500                 hypothetical protein PPL_05363 [Polysphondylium pallidum PN500].
      735849527         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SAMD00019534_115570     173   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Acytostelium subglobosum LB1                   hypothetical protein SAMD00019534_115570 [Acytostelium subglobosum LB1].
      735994552         -                                   Actinomycete-peptide+AlkB-2OGFEDO+STYKIN         2OG-FeII_Oxy_2+DUF605                                                                   SAMD00019534_034160     511   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Acytostelium subglobosum LB1                   hypothetical protein SAMD00019534_034160, partial [Acytostelium subglobosum LB1].
      281207383         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PPL_05555               506   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Polysphondylium pallidum PN500                 putative alkylated DNA repair protein [Polysphondylium pallidum PN500].
      569381717         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          RFI_21980               275   eukaryota                                             Reticulomyxa filosa                            alkylated DNA repair protein [Reticulomyxa filosa].
      284094864         Nimm55                              AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NAEGRDRAFT_63689        292   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi].
      #; AlkBH3- subgroup Likely to bind DNA, note SAD and PHD in one instance  and ATHOOK in another. 
      568037062
      299115604         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Esi_0181_0046           349   eukaryota>stramenopiles                               Ectocarpus siliculosus                         2OG-Fe(II) oxygenase [Ectocarpus siliculosus].
      Bnat1000014488    -                                   AlkB-2OGFEDO+TET-JBP                             2OG-FeII_Oxy_2                                                                          Bnat1000014488          166   eukaryota>rhizaria>cercozoa                           Bigelowiella natans                            gw1.3.140.1
      217403598         SP                                  AlkB-2OGFEDO+SAD                                 SP+2OG-FeII_Oxy_2+YDG_SRA                                                               PHATRDRAFT_49981        544   eukaryota>stramenopiles                               Phaeodactylum tricornutum CCAP 1055/1          predicted protein [Phaeodactylum tricornutum CCAP 1055/1].
      323453703         PHD                                 AlkB-2OGFEDO+SAD+PHD+AN1+PHD+FUNDEAMN+EP1+SH3    2OG-FeII_Oxy_2+YDG_SRA+PAT1+zf-HC5HC2H+MAP65_ASE1+Rib_recp_KP_reg                       AURANDRAFT_63241        2643  eukaryota>stramenopiles                               Aureococcus anophagefferens                    hypothetical protein AURANDRAFT_63241 [Aureococcus anophagefferens].
      Spun1000006275    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Spun1000006275          232   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (232 aa)
      313239117         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          GSOID_T00007062001      262   eukaryota>metazoa>chordata                            Oikopleura dioica                              unnamed protein product [Oikopleura dioica].
      485619453         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_210584       259   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_210584 [Emiliania huxleyi CCMP1516].
      220973386         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          THAPSDRAFT_6486         318   eukaryota>stramenopiles                               Thalassiosira pseudonana CCMP1335              predicted protein [Thalassiosira pseudonana CCMP1335].
      239886228         -                                   ATHOOK+AlkB-2OGFEDO                              2OG-FeII_Oxy_2                                                                          Pmar_PMAR018546         305   eukaryota>alveolata                                   Perkinsus marinus ATCC 50983                   conserved hypothetical protein [Perkinsus marinus ATCC 50983].
      239895917         -                                   AlkB-2OGFEDO                                     AF-4+2OG-FeII_Oxy_2                                                                     Pmar_PMAR006990         477   eukaryota>alveolata                                   Perkinsus marinus ATCC 50983                   conserved hypothetical protein [Perkinsus marinus ATCC 50983].
      Wseb1000002244    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Wseb1000002244          200   eukaryota>fungi>basidiomycota                         Wallemia sebi                                  estExt_Genemark1.C_70130
      71003245          -                                   SWC3+AlkB-2OGFEDO                                2OG-FeII_Oxy_2                                                                          UM00156.1               421   eukaryota>fungi>basidiomycota                         Ustilago maydis 521                            hypothetical protein UM00156.1 [Ustilago maydis 521].
      18399917          SP                                  AlkB-2OGFEDO                                     SP+DUF4057+2OG-FeII_Oxy_2                                                               AT2G22260               314   eukaryota>viridiplantae                               Arabidopsis thaliana                           DNA repair protein ALKBH2 [Arabidopsis thaliana].
      307102474         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          CHLNCDRAFT_13108        199   eukaryota>viridiplantae>chlorophyta                   Chlorella variabilis                           hypothetical protein CHLNCDRAFT_13108, partial [Chlorella variabilis].
      Spun1000007268    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Spun1000007268          281   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (281 aa)
      70996955          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AFUA_5G14250            317   eukaryota>fungi>ascomycota                            Aspergillus fumigatus Af293                    DNA repair family protein [Aspergillus fumigatus Af293].
      67901590          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AN7782.2                335   eukaryota>fungi>ascomycota                            Aspergillus nidulans FGSC A4                   hypothetical protein AN7782.2 [Aspergillus nidulans FGSC A4].
      Sarc1000007962    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Sarc1000007962          360   eukaryota>ichthyosporea                               Sphaeroforma arctica                            Sphaeroforma arctica JP610 hypothetical protein (360 aa)
      46108746          SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       FG01255.1               326   eukaryota>fungi>ascomycota                            Fusarium graminearum PH-1                      hypothetical protein FG01255.1 [Fusarium graminearum PH-1].
      160705040         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SNOG_15119              325   eukaryota>fungi>ascomycota                            Phaeosphaeria nodorum SN15                     hypothetical protein SNOG_15119 [Phaeosphaeria nodorum SN15].
      Chet1000005718    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Chet1000005718          317   eukaryota>fungi>ascomycota                            Cochliobolus heterostrophus                    estExt_fgenesh1_pg.C_410002
      162673787         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PHYPADRAFT_141856       320   eukaryota>viridiplantae                               Physcomitrella patens                          predicted protein [Physcomitrella patens].
      Ttra1000002481    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Ttra1000002481          292   eukaryota>apusozoa                                    Thecamonas trahens                              Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase (292 aa)
      
      #; AlkBH3-related Interesting fusion to CUE and TOPC- likely to bind DNA  
      Lhya1000011955    -                                   CUE+AlkB-2OGFEDO+TOPC                            2OG-FeII_Oxy_2+zf-GRF                                                                   Lhya1000011955          397   eukaryota>fungi>mucoromycotina                        Lichtheimia hyalospora                         e_gw1.1115.1.1
      50423311          -                                   CUE+AlkB-2OGFEDO+TOPC                            2OG-FeII_Oxy_2                                                                          DEHA0E22759g            404   eukaryota>fungi>ascomycota                            Debaryomyces hansenii CBS767                   hypothetical protein DEHA0E22759g [Debaryomyces hansenii CBS767].
      45199260          -                                   CUE+AlkB-2OGFEDO                                 2OG-FeII_Oxy_2                                                                          AGOS_AFR741W            407   eukaryota>fungi>ascomycota                            Ashbya gossypii ATCC 10895                     AFR741Wp [Ashbya gossypii ATCC 10895].
      50550127          -                                   CUE+AlkB-2OGFEDO+TOPC                            2OG-FeII_Oxy_2                                                                          YALI0D07546g            372   eukaryota>fungi>ascomycota                            Yarrowia lipolytica CLIB122                    YALI0D07546p [Yarrowia lipolytica].
      116505443         TM                                  AlkB-2OGFEDO+JAB                                 2OG-FeII_Oxy_2+TM+Peptidase_M13_N+Peptidase_M13                                         CC1G_05104              1263  eukaryota>fungi>basidiomycota                         Coprinopsis cinerea okayama7#130               hypothetical protein CC1G_05104 [Coprinopsis cinerea okayama7#130].
      527305314         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          FOMPIDRAFT_41770        430   eukaryota>fungi>basidiomycota                         Fomitopsis pinicola FP-58527 SS1               hypothetical protein FOMPIDRAFT_41770 [Fomitopsis pinicola FP-58527 SS1].
      Abis1000001814    -                                   AlkB-2OGFEDO+TOPC                                2OG-FeII_Oxy_2                                                                          Abis1000001814          379   eukaryota>fungi>basidiomycota                         Agaricus bisporus                              estExt_Genewise1Plus.C_21078
      164647078         SP                                  AlkB-2OGFEDO+TOPC                                SP+2OG-FeII_Oxy_2                                                                       LACBIDRAFT_232887       357   eukaryota>fungi>basidiomycota                         Laccaria bicolor S238N-H82                     predicted protein [Laccaria bicolor S238N-H82].
      Mver1000002291    -                                   CUE+AlkB-2OGFEDO                                 2OG-FeII_Oxy_2                                                                          Mver1000002291          480   eukaryota>fungi>zygomycete                            Mortierella verticillata                        Mortierella verticillata NRRL 6337 hypothetical protein (480 aa)
      Chet1000002884    -                                   UBA+AlkB-2OGFEDO+TOPC                            2OG-FeII_Oxy_2+zf-GRF                                                                   Chet1000002884          453   eukaryota>fungi>ascomycota                            Cochliobolus heterostrophus                    estExt_Genewise1Plus.C_320154
      169625210         -                                   CUE+AlkB-2OGFEDO                                 2OG-FeII_Oxy_2                                                                          SNOG_15872              420   eukaryota>fungi>ascomycota                            Phaeosphaeria nodorum SN15                     hypothetical protein SNOG_15872 [Phaeosphaeria nodorum SN15].
      67517207          -                                   CUE+RelE-ParE+AlkB-2OGFEDO+TOPC                  CUE+2OG-FeII_Oxy_2+zf-GRF                                                               AN0881.2                448   eukaryota>fungi>ascomycota                            Aspergillus nidulans FGSC A4                   hypothetical protein AN0881.2 [Aspergillus nidulans FGSC A4].
      70996308          -                                   CUE+AlkB-2OGFEDO+TOPC                            CUE+2OG-FeII_Oxy_2+zf-GRF                                                               AFUA_1G15410            493   eukaryota>fungi>ascomycota                            Aspergillus fumigatus Af293                    CUE domain protein [Aspergillus fumigatus Af293].
      46121463          -                                   AlkB-2OGFEDO+TOPC                                2OG-FeII_Oxy_2+zf-GRF                                                                   FG05110.1               357   eukaryota>fungi>ascomycota                            Fusarium graminearum PH-1                      hypothetical protein FG05110.1 [Fusarium graminearum PH-1].
      85107094          -                                   CUE+AlkB-2OGFEDO+TOPC                            CUE+2OG-FeII_Oxy_2+zf-GRF                                                               NCU07663.1              584   eukaryota>fungi>ascomycota                            Neurospora crassa OR74A                        hypothetical protein [Neurospora crassa OR74A].
      116196186         SP                                  CUE+AlkB-2OGFEDO+TOPC                            SP+2OG-FeII_Oxy_2+zf-GRF                                                                CHGG_04691              473   eukaryota>fungi>ascomycota                            Chaetomium globosum CBS 148.51                 hypothetical protein CHGG_04691 [Chaetomium globosum CBS 148.51].
      Spun1000003263    -                                   CUE+AlkB-2OGFEDO+TOPC                            2OG-FeII_Oxy_2+zf-GRF                                                                   Spun1000003263          427   eukaryota>fungi>chytridiomycota                       Spizellomyces punctatus                         Spizellomyces punctatus DAOM BR117 hypothetical protein (427 aa)
      255085460         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2+zf-GRF                                                                MICPUN_62379            453   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      485619574         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_210764       448   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_210764 [Emiliania huxleyi CCMP1516].
      551544968         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_249216       366   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_249216 [Emiliania huxleyi CCMP1516].
      
      #; THAPSDRAFT_42543-like RNA modifying
      220970302         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          THAPSDRAFT_42543        222   eukaryota>stramenopiles                               Thalassiosira pseudonana CCMP1335              predicted protein, partial [Thalassiosira pseudonana CCMP1335].
      Fcyl1000019629    METHYLASE                           AlkB-2OGFEDO+THUMP+METHYLASE+DUF2431             2OG-FeII_Oxy_2+UPF0020                                                                  Fcyl1000019629          822   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.4_#_1043
      Fcyl1000039366    METHYLASE                           AlkB-2OGFEDO+THUMP+METHYLASE+DUF2431             2OG-FeII_Oxy_2+UPF0020                                                                  Fcyl1000039366          813   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.88.28.1
      Fcyl1000078853    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Fcyl1000078853          234   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.4.648.1
      551550668         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_215572       548   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_215572 [Emiliania huxleyi CCMP1516].
      551618356         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_225460       465   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_225460 [Emiliania huxleyi CCMP1516].
      551576966         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       EMIHUDRAFT_207723       307   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_207723 [Emiliania huxleyi CCMP1516].
      
      #; Naegleria LSE
      284090982         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NAEGRDRAFT_79711        271   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi].
      284094920         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NAEGRDRAFT_63194        236   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi].
      284086042         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          NAEGRDRAFT_52210        251   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi].
      
      
      #; Note fusion to Little finger and HECT
      Psoj1000002243    SP                                  LittleFinger+HECT+AlkB-2OGFEDO                   SP+HECT+2OG-FeII_Oxy_2                                                                  Psoj1000002243          1001  eukaryota>stramenopiles                               Phytophthora sojae                             128295
      Pram1000006143    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Pram1000006143          292   eukaryota>stramenopiles                               Phytophthora ramorum                           77618
      Fcyl1000015510    POTRA                               CYSTM+AlkB-2OGFEDO                               2OG-FeII_Oxy_2                                                                          Fcyl1000015510          379   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.1_#_1574
      Fcyl1000029493    SP+POTRA+POTRA                      AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       Fcyl1000029493          394   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.20_#_150
      262105909         PG_binding_1                        AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PITG_02472              261   eukaryota>stramenopiles                               Phytophthora infestans T30-4                   conserved hypothetical protein [Phytophthora infestans T30-4].
      
      #; 
      Ttra1000010051    SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2+TFIIA+IMCp                                                            Ttra1000010051          935   eukaryota>apusozoa                                    Thecamonas trahens                              Thecamonas trahens ATCC 50062 hypothetical protein (935 aa)
      #; 
      303276360         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       MICPUCDRAFT_56679       318   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    predicted protein [Micromonas pusilla CCMP1545].
      226517324         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          MICPUN_58441            335   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein [Micromonas sp. RCC299].
      #; 
      118346635         -                                   SbcC+AlkB-2OGFEDO                                2OG-FeII_Oxy_2                                                                          TTHERM_00035460         403   eukaryota>alveolata>ciliophora                        Tetrahymena thermophila                        hypothetical protein TTHERM_00035460 (macronuclear) [Tetrahymena thermophila].
      145491776         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          GSPATT00033965001       312   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      145488027         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          GSPATT00032472001       304   eukaryota>alveolata>ciliophora                        Paramecium tetraurelia strain d4-2             hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2].
      #; 
      Pram1000006748    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Pram1000006748          308   eukaryota>stramenopiles                               Phytophthora ramorum                           76882
      Psoj1000014320    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Psoj1000014320          310   eukaryota>stramenopiles                               Phytophthora sojae                             142118
      301093209         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PITG_19142              309   eukaryota>stramenopiles                               Phytophthora infestans T30-4                   alkylated DNA repair protein alkB-like protein [Phytophthora infestans T30-4].
      209582552         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PHATR_46782             352   eukaryota>stramenopiles                               Phaeodactylum tricornutum CCAP 1055/1          predicted protein [Phaeodactylum tricornutum CCAP 1055/1].
      Fcyl1000110306    -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          Fcyl1000110306          534   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       gw1.5.156.1
      Fcyl1000020581    -                                   CheRN-Alpha+AlkB-2OGFEDO                         Pap_E4+2OG-FeII_Oxy_2                                                                   Fcyl1000020581          588   eukaryota>stramenopiles                               Fragilariopsis cylindrus                       fgenesh2_pg.5_#_951
      551549245         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          EMIHUDRAFT_358442       316   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_358442 [Emiliania huxleyi CCMP1516].
      551567129         SP                                  -                                                SP+2OG-FeII_Oxy_2                                                                       EMIHUDRAFT_355877       280   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                     hypothetical protein EMIHUDRAFT_355877, partial [Emiliania huxleyi CCMP1516].
      284095290         SP                                  AlkB-2OGFEDO                                     SP+2OG-FeII_Oxy_2                                                                       NAEGRDRAFT_30537        314   eukaryota>heterolobosea                               Naegleria gruberi                              predicted protein [Naegleria gruberi].
      735859727         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SAMD00019534_002080     394   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Acytostelium subglobosum LB1                   hypothetical protein SAMD00019534_002080 [Acytostelium subglobosum LB1].
      66808825          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          DDB_G0285575            393   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Dictyostelium discoideum AX4                   alkylated DNA repair protein [Dictyostelium discoideum AX4].
      325082087         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          DICPUDRAFT_55106        328   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Dictyostelium purpureum                        hypothetical protein DICPUDRAFT_55106 [Dictyostelium purpureum].
      281211828         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PPL_01223               116   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Polysphondylium pallidum PN500                 alkylated DNA repair protein [Polysphondylium pallidum PN500].
      281209952         POTRA+TM+TM+TM+TM                   POTRA+POTRA+AlkB-2OGFEDO+sGTP                    2OG-FeII_Oxy_2+Ras+TM+TM+TM+TM                                                          PPL_03193               780   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Polysphondylium pallidum PN500                 alkylated DNA repair protein [Polysphondylium pallidum PN500].
      307109513         -                                   SHELIX+AlkB-2OGFEDO                              2OG-FeII_Oxy_2                                                                          CHLNCDRAFT_143028       800   eukaryota>viridiplantae>chlorophyta                   Chlorella variabilis                           hypothetical protein CHLNCDRAFT_143028 [Chlorella variabilis].
      239900379         -                                   -                                                2OG-FeII_Oxy_2                                                                          Pmar_PMAR019901         332   eukaryota>alveolata                                   Perkinsus marinus ATCC 50983                   conserved hypothetical protein [Perkinsus marinus ATCC 50983].
      221487023         -                                   AlkB-2OGFEDO                                     Macoilin+2OG-FeII_Oxy_2                                                                 TGGT1_010770            927   eukaryota>alveolata>apicomplexa                       Toxoplasma gondii GT1                          conserved hypothetical protein [Toxoplasma gondii GT1].
      302829380         -                                   SWC3+AlkB-2OGFEDO                                2OG-FeII_Oxy_2                                                                          VOLCADRAFT_115825       365   eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis                  hypothetical protein VOLCADRAFT_115825 [Volvox carteri f. nagariensis].
      159479846         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          CHLREDRAFT_151320       398   eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii                      hypothetical protein CHLREDRAFT_151320, partial [Chlamydomonas reinhardtii].
      116000537         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2+Aha1_N+AHSA1                                                             Ot01g03050              836   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                             DNA alkylation damage repair protein (ISS) [Ostreococcus tauri].
      255081849         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          MICPUN_74198            126   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                          predicted protein, partial [Micromonas sp. RCC299].
      303285390         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          MICPUCDRAFT_35646       144   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545                    predicted protein, partial [Micromonas pusilla CCMP1545].
      323447764         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AURANDRAFT_8404         133   eukaryota>stramenopiles                               Aureococcus anophagefferens                    hypothetical protein AURANDRAFT_8404, partial [Aureococcus anophagefferens].
      162670995         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          PHYPADRAFT_169907       401   eukaryota>viridiplantae                               Physcomitrella patens                          predicted protein [Physcomitrella patens].
      302801802         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SELMODRAFT_116512       342   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_116512 [Selaginella moellendorffii].
      302798841         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          SELMODRAFT_113969       342   eukaryota>viridiplantae                               Selaginella moellendorffii                     hypothetical protein SELMODRAFT_113969 [Selaginella moellendorffii].
      15221095          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          AT1G11780               345   eukaryota>viridiplantae                               Arabidopsis thaliana                           oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana].
      156082850         -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          BBOV_I002590            336   eukaryota>alveolata>apicomplexa                       Babesia bovis T2Bo                             hypothetical protein [Babesia bovis T2Bo].
      84999908          -                                   AlkB-2OGFEDO+CytC                                2OG-FeII_Oxy_2                                                                          TA19740                 350   eukaryota>alveolata>apicomplexa                       Theileria annulata strain Ankara               alkylated DNA repair protein [Theileria annulata].
      71031831          -                                   AlkB-2OGFEDO                                     2OG-FeII_Oxy_2                                                                          TP01_0030               259   eukaryota>alveolata>apicomplexa                       Theileria parva strain Muguga                  hypothetical protein [Theileria parva strain Muguga].
      
      Back to Contents
    • Multiple sequence alignment of the AlkB-H4 family that was recently shown to have a N6A demethylation activity in C. elegans

                                                                                                                                                                                                                                                                Str (-3)                                                            Str (-2)                     Str (-1)                                                                                                                                           Str-1                                  Str-2     Str-3      Str-4                                                                                                                                  Str-5             Str-6                                                 Str-7                            
      FINAL                                                               -----HHHH-HHH---------------------------------------EEEE-EE-----------------------------------------------------------------------------------------------------------------------------EEEEEEE--HHHHHHHHHHH----------------------------------------------------------------------------E----------------------------------HHHHH-H-HH-----------------------------------------------------------------------------HHHH--------------------------EE-------------EEE--------------------EEE---EEEEEEE---EEEEEE----------------------------------------------------------------------------------------------------------------------------------EEEE-------EE----EEEEE---HHHHHHHHHHHH----------------------------------EEEEEEE-----------------------------------------HHHHHH------HHHHHHHHH------
      ALIGN                                                               ------------------------------------------------------HH-H--------HHHH---------------------E--------------------------------------------------------------------------------------------EEEEE----HHHHHHHHHHH-------------------------------------------------------------------------------HHH-----------------------------HHHHH-H-HH-----------------------------------------------------------------------------HH--------------HHH---------H-HH-------------HHHHH---H--------------------EEEEEE-----EEEEE----------------------------------------------------------------------------------------------------------------------------------EEEE-------EE-----EEEE----HEEEHEHHH------------------------------------EEEEHHHHH---------------------------------------HH-HHH------HHHHHHHH-------
      HMM                                                                 -----HEEH-H------------------------------------------EEE-EE----------EE--H-H-H--HHHHHHHH-HHHH--HHH--------H----------------------------------------------------------------------------EEEEEEEEE-HHHHHHHHHHHH---------------------------------------------E--EEE----------EEEE------EEEEHHHHHEE----------EE---------------HHHHHH-H-HH-----------------------------------------------------------------------------HHHH-----------HHHHH-------EE-EE-------------EEEE-----H------EE--EEEEEE--HEEEEEEE--EEEEEEEE---------------------------------------------------------------------------------------------------------------------------------EEEE-------EE----EEEEE---HHHHHHHHHHHH----------------------------------EEEEEEE----HH----------------------H-HH----H---HHHHHHH------HHHHHHHHHH-----
      FREQ                                                                ------EEE-EEE--------------------------------------EEEEE-E------------------------------------------------------------------------------------------------------------------------------EEEEEE---HHHHHHHHHH--------------------------------------------------------------------------------HHHH----------HHH----------------HHHH-H-HH-----------------------------------------------------------------------------HH-----------------------------E-------------E---------------------HHHHHHHHHHEE-----EEEEE-----------------------------------------------------------------------------------------------------------------------------------EEEE-------E------EEEE---HHHHHHHHHHHHH----------------H-------------HHHHEEHHHHH------------------------------------------HHHH------HHHHHHHHHH-----
      PSSM                                                                ----HHHHH-HHH---------------------------------------HEEE-E------------------------------------------------------------------------------------------------------------------------------EEEE-----HHHHHHHHHHH----------------------------------------------------------------------------------------------------------------HHHH-H-HH-----------------------------------------------------------------------------HHHH------------------------------------------E----------------------------EEEEE-----EEEE------------------------------------------------------------------------------------------------------------------------------------EEE-------EE-----EEEE------EEE-----------------------------------------EEEEE------------------------------------------HHHHHH------HHHHHHH--------
      CELE_F09F7.7_Caenorhabditis_elegans_25148697                        CGCKGARFC-ALC-ET-TE-------RVKK-------LRVVE---DKHVNYKVFIY-DH------IRQIAIPTTNL-N--SQSSLEDI-IDES--TSC-------QSV---------------------------------------------S----TDGS---I-----------E-I--DGLTLIHNFLSESEESKILNMID-------------------------------T--------V--EWA--QSQSG--------RRKQ------DYGPKVNFKHK----------KVK-TD--T--FV--GMPEYADM-L-LN-----------------------------------------------------------------------------KMSE-----------YDVKKLG--NY-QP-FE-------------MCNLE---YEEVKKSAIEMHQDDMWIWGNRLISINLINGSVMTLSNDNK----------------------------------------------------S-------------------------------------------------------------------------FLCY-------VHMPHRSLLCMADECRYDWKHGVLAHH----------------I-------------RGRRIALTMREA--AK----------------------D-FA----E---GGELYEK------YGAELIRLGNIRVPL
      Dmel_CG4036_Drosophila_melanogaster_24583140                        CGCKGVRTC-LSC-EQ-DF-------HIAK-------TSLRE---QFQ-------------------------------------QLEAWSYC--IQC-------DLL-----Q-R-GWDT--NHVQKDHE--------------------NHK----KDEG---L-----------P-L--PGILVQEEFLSVDEGAQLIADLD-------------------------------D--------L--PWD--ISQSG--------RRKQ------NFGPKTNFKKR----------KLR-LG--S--FA--GFPRTTEY-V-QR-----------------------------------------------------------------------------RFED-VP------L-LR-------GF-QT-IE-------------QCSLE---YEPSKGASIDPHVDDCWIWGERVVTVNCLGDSVLTLTPY-EVQQSGKYNLD------LVASYEDELLA-PLLT------------DDQLATFEG-------------------------------------------------------------------------KVLR-------IPMPNLSLIVLYGPARYQFEHSVLRED----------------V-------------QERRVCVAYREF--TP----------------------M-YI----NGV-DIQKGDP--VRE-KSQIFW-----QIN-
      LOC100367945_Saccoglossus_kowalevskii_585709738                     CGCKGIRTC-LVCEKS-KI-------DLSC-------RSGRF---EKP-------------------------------------DAS-YSFC--WLC-------NIA-----W-L-ESNT--EQHPSHQG--------------------KF------------I-----------R-F--PGVTLIENVVSEEEEEAIIQAVD-------------------------------A--------T--PWK--VSQSG--------RRKQ------DYGPKVNFKKR----------KVN-SK--C--FS--GLPAFIRP-L-TE-----------------------------------------------------------------------------RLVQ-MD------G-LA-------DF-QV-VE-------------QCNLE---YVPDRGSSIDPHFDDVWLWGERLVTLNLNSETTLTMTQK-E---------------------------------------------------KD-------------------------------------------------------------------------ICVS-------IPLPKRSVIVLYGPARYEWMHAIHRED----------------I-------------INRRIAVTFREL--SA----------------------E-FL----DGGVNEDTGSR--LLE-IARTYE-----GKAV
      NEMVEDRAFT_v1g34785_Nematostella_vectensis_156402493                CGCTGIRSC-LFC-KD-NT-------KTSQ-------STTSV---DEA-------------------------------------KTK-YLFC--HLC-------SQT-----LPL-GGTC--SHELGDCG--------------------RYG-----------P-----------S-L--DGITLIEDFVSQREEARIVQVID-------------------------------E--------T--VWK--PSQSG--------RRKQ------DYGPQVNFKKK----------KVK-MS--H--FN--GLPAFSEF-L-VR-----------------------------------------------------------------------------RMNDDVP------G-LK-------DF-VP-VE-------------LCNLE---YDEARGSSIDAHFDDFWLWGERLVTLNLLSATRLTMTKD-T---------------------------------------------------YE-------------------------------------------------------------------------I--S-------VPMPRRSLIIVSGAARHLWQHAVKRED----------------I-------------SGRRIAITLREL--SE----------------------E-FC----KGGRNENVGFQ--AIK-TALTFN-----GTSV
      alkbh4_Danio_rerio_688557483                                        CGCKGIRTC-LRC-ET-DE-------TKHL-------L-QKN---DLI-------------------------------------HYD-FIYD--PV------------------L-KSAV--REEEGSTP--------------------Q-C-----------F-----------E-F--PGVLLWENFVSEDEERELVSRMD-------------------------------Q--------D--VWR--ESQSG--------RRKQTSVYPKDFGPKVNFKKR----------RVH-VG--S--FS--GLPAISRR-L-LV-----------------------------------------------------------------------------RMSD-LP------Q-LS-------SF-KP-VE-------------QCNLD---YDSLRGSAIDPHLDDSWLWGENLVTVNLLSDTVLTLSLD-Q--------------------------------------------GWGDMEQGE-------------------------------------------------------------------------VRVA-------VRLPRRSLVVLYGDARHRWKHAIHRKD----------------I-------------HGRRVCSTFREL--SA----------------------E-FL----PGGQQEKLGSE--LLD-IALSFQ-----GAPL
      ALKBH4_Taeniopygia_guttata_823470605                                CGCKGIRSC-LLC-EG-PA-------AAAP-------P-------PQG-------------------------------------EDN-FTYC--PA------------------T-GLAK--GNEHSEFA--------------------GWA-----------F-----------P-F--PGVFLVEEFISEDEECEIVELMD-------------------------------R--------D--DWK--PSQSG--------RKKQ------DYGPKVNFKKQ----------RLK-AG--S--FT--GLPSFSRK-I-VA-----------------------------------------------------------------------------QMKA-CA------V-LS-------GF-LP-VE-------------QCNLD---YSPERGSAIDPHFDDWWLWGERLVSLNLLSKTVLSMSCD-SEDTIQLFPISSK----EELSPPSPFMQ-TSACRNSGEEGTQCFLSPRLVPGKE-------------------------------------------------------------------------VSVA-------ILLPQRSLVVLQGDARYKWKHGIHRRH----------------I-------------EHRRVCITFREL--SA----------------------E-FS----AGGRHEELGKE--LLQ-IALSFQ-----GRPV
      LOC100520070_Sus_scrofa_311251081                                   CGCKGIRTC-LIC-ER-QR-------GGDP-------PWQHS---PQK-------------------------------------THR-FIYY--TD------------------T-GWAV--GAEESDFE--------------------GWA-----------F-----------P-F--PGVTLIEDFVTREEEAEMVQLMD-------------------------------R--------D--PWK--LSQSG--------RRKQ------DYGPKVNFRKQ----------KLK-TA--S--FR--GLPSFSRE-V-VR-----------------------------------------------------------------------------RMGL-YP------V-LE-------DF-RP-VE-------------QCNLD---YCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSRE-APGSLLLCL--------APSGFPEALVE-GAVA------------PSRSVLCQE-------------------------------------------------------------------------VEVA-------VPLPRRSLLVLTGAARHQWKHAIHRRH----------------I-------------EARRVSATFREL--SA----------------------D-FG----PGGRQQDLGRE--LLQ-ISLSFQ-----GRPT
      alkbh4_Danio_rerio_68372246                                         CGCKGIRTC-LRC-ET-DE-------TKHL-------L-QKN---DLI-------------------------------------HYD-FIYD--PV------------------L-KSAV--REEEGSTP--------------------Q-C-----------F-----------E-F--PGVLLWENFVSEDEERELVSRMD-------------------------------Q--------D--VWR--ESQSG--------RRKQ------DFGPKVNFKKR----------RVH-VG--S--FS--GLPAISRR-L-LV-----------------------------------------------------------------------------RMSD-LP------Q-LS-------SF-KP-VE-------------QCNLD---YDSLRGSAIDPHLDDSWLWGENLVTVNLLSDTVLTLSLD-Q--------------------------------------------GWGDMEQGE-------------------------------------------------------------------------VRVA-------VRLPRRSLVVLYGDARHRWKHAIHRKD----------------I-------------HGRRVCSTFREL--SA----------------------E-FL----PGGQQEKLGSE--LLD-IALSFQ-----GAPL
      ALKBH4_Monodelphis_domestica_612002272                              CGCKGVRTC-LLC-EG-ER-------DGGT-------AGTLY---PKK-------------------------------------TAH-FIYC--LE------------------T-GLAL--GTEKSGFA--------------------GWA-----------F-----------P-F--PGVAMIKDFVSADEETELVRLMD-------------------------------Q--------D--DWK--LSQSG--------RRKQ------DYGPKVNFRKQ----------KLK-TG--G--FD--GLPSFSRE-I-VH-----------------------------------------------------------------------------RMGQ-HP------V-LE-------RF-LP-VE-------------QCNLD---YHPERGSAIDPHLDDSWLWGERLVSLNLLSPTVLSMSRD-SNERLQLLSVAQQGTRNSPPNDPVPRDP-EDTG------------SRRSVPCDQ-------------------------------------------------------------------------VEVA-------IHLPARSLLVLFGAARYQWKHAIHRQH----------------I-------------ESHRICATFREL--SA----------------------E-FC----PGGKQGELGQE--LLE-IALSFQ-----GKPV
      ALKBH4_Homo_sapiens_8923019                                         CGCKGIRTC-LIC-ER-QR-------GSDP-------PWELP---PAK-------------------------------------TYR-FIYC--SD------------------T-GWAV--GTEESDFE--------------------GWA-----------F-----------P-F--PGVMLIEDFVTREEEAELVRLMD-------------------------------R--------D--PWK--LSQSG--------RRKQ------DYGPKVNFRKQ----------KLK-TE--G--FC--GLPSFSRE-V-VR-----------------------------------------------------------------------------RMGL-YP------G-LE-------GF-RP-VE-------------QCNLD---YCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMCRE-APGSLLLCS--------APSAAPEALVD-SVIA------------PSRSVLCQE-------------------------------------------------------------------------VEVA-------IPLPARSLLVLTGAARHQWKHAIHRRH----------------I-------------EARRVCVTFREL--SA----------------------E-FG----PGGRQQELGQE--LLR-IALSFQ-----GRPV
      _Schmidtea_mediterranea_386783769                                   CTCKGIRTC-SSC-NP-NK-------I------------KIE---NQN-------------------------------------CIV-CYFC--PK----------------I-S-KIVK--ENCISDLK--------------------C---EEFHKVN---I-----------E-L--NGIILIENFLTEDDKNYLLGGIC-------------------------------S--------N--SWV--DSQSG--------RRKQ------DFGPKVNFKKR----------KIN-LT--K--FQ--GLPEYIER-F-VN-----------------------------------------------------------------------------RFSD-IP------E-LK-------DF-NP-VE-------------LCNLE---YNPTRGASIDPHFDDFWLWGERLVTINVQSSTYLTFTPG-FPDMFD--------------ESSQAFFS-SVCS----EN------HENMGNNCA-------------------------------------------------------------------------VSIK-------VPLPEGSLVIVSGDARHKWMHAVSAAD----------------V-------------QSTRIASTLREL--SM----------------------E-FT----EN--DRELSRK--LID-LSLTFN-----GQPV
      MOQ_003722_Trypanosoma_cruzi_marinkellei_407409697                  CCCSGIRYC-GRC-IE-SE-------RAQG-------IIHQK---FLLVKSSDVIS-RQ------YGAGRTSSCSF-T--CVD--SSH-YGYC--WQC-------NRI-----F-LMHHGA--FKSCADHE--------------------GAT----PNLD---I-----------R-I--EGLFVIPDFLSLLDEEKLVSFLD-------------------------------E-P-SS-F-S--GWK--HSQSG--------RRKQ------DFGPRANFKKR----------KLN-TS--G--MR--GMPKQLES-V-ME-----------------------------------------------------------------------------KVKS-----------FVREITSK-EY-HI-VE-------------VSALE---YTSENSSSIDPHIDDTWVWGDRVGGLNLLEDTVMTFVNN-E----------------------------------------------------G-------------------------------------------------------------------------TAVD-------VFLPRGAFFLLSQGSRYDWLHGIRLEN----------------I-------------KHRRISFTFREF--SS----------------------D-LD----I---DREIIQN--VRK-ITSTFV---------
      TCSYLVIO_004974_Trypanosoma_cruzi_407849132                         CCCSGIRFC-GRC-IE-SE-------RAQG-------IIHQN---VLLVKSSDVIS-QQ------YSAGRTSSCSF-T--CVE--LSH-YGLC--WQC-------NRI-----F-LMHHGA--FKSCADHE--------------------GAT----PNLD---I-----------R-I--EGLFVIPDFLSLLDEEKLVSFLD-------------------------------E-P-SS-F-S--GWK--HSQSG--------RRKQ------DFGPRANFKKR----------KLN-TS--G--IR--GMPKQLES-V-VK-----------------------------------------------------------------------------KVKS-----------FVRDITSK-EY-HI-VE-------------VSALE---YTSENSSSIDPHIDDTWVWGNRIGGLNLLEDTVMTFVNN-E----------------------------------------------------G-------------------------------------------------------------------------TAVD-------VFLPRGAFFLLSNGSRYDWLHGIRLEN----------------I-------------KHRRISFTFREF--SN----------------------D-LD----I---DREIIQN--VIK-ITSTFV---------
      LPMP_352050_Leishmania_panamensis_731709183                         CVCSGIRFC-AKC-RD-TL-------RVQQ-------LFSGS---VFLSSASSVIE-KQ------WHNDRLSSCSF-A--IIG--KST-LSYC--IEC-------MTI-----F-K-SEAP--IKSCVDHQ--------------------G-A----ISTS---V-----------V-I--SGLVVFQDVLTEEEETALIYYLD-------------------------------N-S-HP-F-P--PWK--ESQSG--------RRKQ------DYGPKRNFKKK----------KVR-PA--E--IP--AMPLALEP-V-CA-----------------------------------------------------------------------------TISS-----------TTENFTGR-AY-RI-AE-------------VSALE---YVEGKMSNFDPHIDDTWLWGDRIAGLNLNEPCVVTFVEP-D----------------------------------------------------G-------------------------------------------------------------------------VCCD-------VYLPRRSFFLMSGNCRYKWMHGIRPEH----------------V-------------RGRRISLTFREL--SD----------------------E-IL----A---DAGTSEV--VLS-AASTFV---------
      LBRM_35_2190_Leishmania_braziliensis_MHOM/BR/75/M2904_154345804     CVCSGIRFC-AKC-RD-TL-------RVQQ-------LFSGS---VFLSSASSVIE-KQ------WHNDRLSSCSF-A--IIG--KST-LSYC--IEC-------MTI-----F-K-SEAP--IKSCVDHQ--------------------G-A----MSTS---I-----------V-I--SGLVVFQDVLTEEEETALIYYLD-------------------------------N-S-HP-F-P--PWK--ESQSG--------RRKQ------DYGPKRNFKKK----------KVR-PA--E--IP--AMPLALEP-V-CA-----------------------------------------------------------------------------TISS-----------TTENFTGR-AY-RI-AE-------------VSALE---YVEGKMSNFDPHIDDTWLWGDRIAGLNLNEPCVVTFVEP-D----------------------------------------------------G-------------------------------------------------------------------------VCCD-------VYLPRRSFFLMSGNCRYKWMHGIRPEH----------------V-------------RGRRISLTFREL--SD----------------------E-IL----A---DAETSEV--VLS-AASTFV---------
      TCIL3000_10_5510_Trypanosoma_congolense_IL3000_342184304            CHCTGIRFC-GRC-VH-SD-------RAQS-------IINQR---IPLVKSSAVVS-QQ------YTGKRTSSCSF-T--CVE--PSR-RSIC--PVC-------LGI-----F-ETDGSL--IRECKDHM--------------------CLT----LCDD---I-----------I-M--DGFCLMTDFISLKLVTFFD-------------------------------D-P-AP-F-P--PWK--DSQSG--------RRKQ------DYGPKANFKKK----------KLK-LG--N--FQ--GMPQQMEE-L-LG-----------------------------------------------------------------------------RVTS-----------FVSRYTNK-EF-SV-AE-------------VSALE---YTTKC-SSLDPHVDDTWLWGDRIGGLNLLVDVVLTFVNA-S----------------------------------------------------G-------------------------------------------------------------------------IAVA-------AHIPRRSFFMLSKVCRYEWMHGIRRED----------------I-------------VGRRISVTFREL--AD----------------------K-ID----V---DEDLCRG--IKA-AASTFV---------
      TVY486_1006420_Trypanosoma_vivax_Y486_340057250                     CCCSGIRIC-RHC-VM-SD-------RAQS-------IINCH---VPLTKACDVVE-KQ------YSVERISSCSF-I--CIG--AST-HSFC--SNC-------GKI-----F-VFPERA--IRACADHN--------------------GLT----ADSE---T-----------T-M--RGLMVSPDFVSDIEEDYLLRFFD-------------------------------G---AH-H-S--RWK--VSQSG--------RRKQ------DYGPRANFKKR----------KLK-KG--D--GN--GMPIQLKD-I-IT-----------------------------------------------------------------------------RVNQ-----------FISNETMK-RY-QT-IE-------------VSVLE---YSTKCGSSIDTHIDDTWLWGDRIGGLNLLEDVVLTLVDS-K----------------------------------------------------G-------------------------------------------------------------------------TVAT-------VFVPRRSFFLLSGESRYNWMHGIRSED----------------I-------------KSRRISMTFREF--AD----------------------N-LE----V---DERLLQD--ILS-FSLTFV---------
      LMJF_36_1970_Leishmania_major_strain_Friedlin_157876860             CSCAGIRFC-AKC-RG-SS-------RVQQ-------LFNGS---VPLSSARSVIE-KQ------QDNERLSSCSF-A--VIG--KSS-LSFC--IEC-------ASV-----F-K-SEVP--IKSCADHQ--------------------G-A----VATG---I-----------T-I--SGLAVFRDTLTEEDETAVIRFLD-------------------------------D-S-RP-F-P--PWK--ESQSG--------RRKQ------DYGPKRNFKKK----------KIK-VA--E--IP--GMPLVFES-V-FA-----------------------------------------------------------------------------VISS-----------MVETFTGK-AY-RI-AE-------------VSALE---YMEGKMSNFDPHVDDTWLWGDRIAGLNLNEACAVTFVNP-E----------------------------------------------------G-------------------------------------------------------------------------VCCD-------VYLPRRAFFLMSGNCRYRWMHGIRPEH----------------V-------------RGRRISLTFREL--SD----------------------E-IL----V---DTKASEM--VLS-AASHFV---------
      Tb10.70.0360_Trypanosoma_brucei_brucei_TREU927_71747664             CRCTGIRFC-QHC-ID-SD-------RAQS-------IVRRH---VSLANASDVVL-RQ------YTDQRMSSCSF-A--CVG--PSL-LSMC--CAC-------RTV-----F-ETAGGV--LKCCGDHK--------------------GFI----ARTD---I-----------E-L--SGLTIQPGFVSPNEEEYLVAFFD-------------------------------N-P-AP-F-A--AWK--VSQSG--------RRKQ------EYGPKANFKKR----------KLK-VG--D--FR--ALPHQMKT-T-LD-----------------------------------------------------------------------------RVRS-----------FVAEQTMR-EY-CI-VE-------------VSVLE---YTAEC-SCLDPHIDDTWLWGDRIGGLNLLDDVTLTFVSA-D----------------------------------------------------E-------------------------------------------------------------------------VAVT-------VFVPRGAFFLLTGVSRYEWMHGIRRED----------------V-------------KNRRVSVTFREF--AD----------------------N-LV----V---DQEILKT--IVM-SATTFI---------
      LINJ_36_2080_Leishmania_infantum_JPCM5_146104297                    CSCAGIRFC-AKC-RD-SS-------RVQQ-------LFSGS---VPLSSARTVIE-KQ------HNDERLSSCSF-A--IIG--KSS-LSFC--IEC-------ASV-----F-K-SEVP--IKSCADHQ--------------------G-A----MATS---I-----------T-I--SGLAVLQHALTEEDEAAVIRFLD-------------------------------D-S-HP-F-P--PWK--ESQSG--------RRKQ------DYGPRRNFKKK----------KVK-VA--E--IP--SMPLVFES-V-FA-----------------------------------------------------------------------------VISS-----------MTETFTGK-AY-RI-AE-------------VSALE---YMEGKMSNFDPHVDDTWLWGDRIAGLNLNEACVVTFVNP-E----------------------------------------------------G-------------------------------------------------------------------------VCCD-------VYLPRRAFFLMSGNCRYKWMHGIRHEH----------------V-------------RGRRISLTFREL--SD----------------------E-IL----A---DTEASEM--VLS-AASNFV---------
      TbgDal_X7900_Trypanosoma_brucei_gambiense_DAL972_788018057          CRCTGIRFC-QHC-ID-SD-------RAQS-------IVRRH---VSLANASDVVL-RQ------YTDQRMSSCSF-A--CVG--PSL-LSMC--CAC-------KTV-----F-ETAGGV--LKCCGDHK--------------------GFI----ARTD---I-----------E-L--SGLTIQPGFVSPNEEEYLVAFFD-------------------------------N-P-AP-F-A--AWK--VSQSG--------RRKQ------EYGPKANFKKR----------KLK-VG--D--FR--ALPHQMKT-T-LD-----------------------------------------------------------------------------RVRS-----------FVAEQTMR-EY-CI-VE-------------VSVLE---YTAEC-SCLDPHIDDTWLWGDRIGGLNLLDDVTLTFVSA-D----------------------------------------------------E-------------------------------------------------------------------------VAVT-------VFVPRGAFFLLTGVSRYEWMHGIRRED----------------V-------------KNRRVSVTFREF--AD----------------------N-LV----V---DQEILKT--IVM-SATTFT---------
      GSEM1_T00000659001_Phytomonas_sp_isolate_EM1_588321322              CQCRGIRFC-IFC-KE-SE-------RVRK-------LLSAE---GKLLDPSVVIS-HQ------QEEERTSSCSL-T--LLE--ESR-LRFC--INC-------QAI-----F-V-SSVP--LMSCSQHS--------------------S-S----LRSN---V-----------S-I--LGLHVVRDFLSDDEETYLVKFLD-------------------------------D-P-SP-Y-P--PWK--ESQSG--------RRVQ------EYGPKKNFKKK----------KIK-TT--E--FV--SIPLPFEN-I-LD-----------------------------------------------------------------------------KARG-----------LVERLTEK-PY-II-AE-------------VSVLE---YLRRRLSNFDPHIDDTWLWGDRIAGINLLEDCFITFVNS-E----------------------------------------------------G-------------------------------------------------------------------------VCLE-------VFLPRKCFFLMSGESRYSWMHAIRPEN----------------V-------------GNRRISFTIREL--SD----------------------A-FK----E---ENETACC--QITDAAKNYL---------
      LMXM_36_1970_Leishmania_mexicana_MHOM/GT/2001/U1103_401420114       CSCAGIRFC-AKC-CD-SS-------RVQQ-------LFSGS---VPLSSASSVIE-KQ------HHEERLSSCSF-A--TIG--KSS-LSFC--IEC-------MSV-----F-K-SEVP--IKSCADHH--------------------G-A----MATS---I-----------T-I--SGLAVFQDALPEEDETAVIRFLD-------------------------------D-S-HP-F-P--PWK--ESQSG--------RRKQ------DYGPRRNFKKK----------KVK-VA--D--IP--AMPLVFEP-I-FS-----------------------------------------------------------------------------VISS-----------MTETFTGK-AY-RI-AE-------------VSALE---YMEGKMSNFDPHVDDTWLWGDRIAGLNLNEACAVTFVNS-E----------------------------------------------------G-------------------------------------------------------------------------VCCD-------VYLPRRTFFLMSGDCRYKWMHGIRPEH----------------V-------------RGRRISLTFREL--SD----------------------E-IL----A---DTEASEM--VLS-AASNFV---------
      STCU_00887_Strigomonas_culicis_528254392                            CACSGIRFC-NRC-IN-SD-------RVKY-------LFSGN---VQLRNADDVIA-NQ------TSNDRTSSLSF-A--LLG--NSH-KSVC--IEC-------GVV-----Y-N-SSVL--ITSCADHK--------------------N-K----TDDK---V-----------D-VMVKGLFVERDVISDQEEADLIHFFD-------------------------------N-P-SP-F-P--DWK--ISQSG--------RRKQ------DFGPKRNFKKK----------KVK-PA--D--FP--HMPKVFEP-L-FC-----------------------------------------------------------------------------KVSK-----------EVSQCTASIPY-SI-AE-------------VSVLE---YTSENMSNFDPHIDDTWLWGDRIAGVNLLEDCVMTFVDS-N----------------------------------------------------G-------------------------------------------------------------------------NVVD-------AFLPRRCLFLMSSDCRSIWMHGIRPEN----------------I-------------KGRRVSITMREL--SD----------------------E-IK----Q---DLAVAAP--LLE-AARSFV---------
      DQ04_02651050_Trypanosoma_grayi_686635833                           CCCSGIRFC-DRC-IS-SD-------RAQA-------IIHQS---VDLTKASDVVS-QQ------YGSERTSSCSF-A--VVG--PSL-YSFC--WEC-------SSV-----F-QMHGRV--VRSCADHV--------------------ELS----SVTN---L-----------S-L--AGLFVAPDFLSLLEEEKLVMFFD-------------------------------N-P-SS-F-P--GWK--PSQSG--------RRKQ------DFGPRANFKKR----------KLR-VP--G--TP--WMPQQLKD-V-LG-----------------------------------------------------------------------------KVSS-----------FVTQQSGK-PF-CI-VE-------------ASVLE---YTSENSSSIDPHIDDTWLWGDRIGGLNLLEDAVMTFVHG-N----------------------------------------------------G-------------------------------------------------------------------------TAVD-------VFIPRGAFFLLSKDSRFIWMHGIRQEN----------------I-------------RNRRISITVREL--AE----------------------D-LE----V---DAELLKT--IME-NASSFV---------
      Tc00.1047053510187.490_Trypanosoma_cruzi_strain_CL_Brener_71659461  CCCSGIRFC-GRC-IE-SE-------RAQG-------ITHQN---VLLVKSSDVIS-QQ------YGAGRTSSCSF-T--CVE--LSH-YGLC--WQC-------NRI-----F-LMHHGV--FKSCADHE--------------------GTT----PNLD---I-----------R-I--EGLFVIPDFLSLLDEEKLVSFLD-------------------------------E-P-SS-L-S--GWK--HSQSG--------RRKQ------DFGPRANFKKR----------KLN-TS--G--IR--GMPKQLES-V-ME-----------------------------------------------------------------------------KVKS-----------FVRDITSK-EY-HI-VE-------------VSALE---YTSENSSSIDPHIDDTWVWGNRVGGLNLLEDTVMTFVNN-E----------------------------------------------------G-------------------------------------------------------------------------TAVD-------VFLPRGAFFLLSNGSRYDWLHGIRLEN----------------I-------------KHRRISFTFREF--SN----------------------D-LD----I---DREIIQN--VIK-ITYTFV---------
      AGDE_03849_Angomonas_deanei_528266291                               CTCTGIRFC-ALC-AD-SA-------RVRE-------VFENK---ELLRSADAVIA-NQ------SQKDRSSSYSF-A--LLN--KSK-YALC--AEC-------GAT-----F-KLRDHP--FRACAEHA--------------------G-K----EQEP---K-----------R-I--GGLHVMENIVTPDMEQSLLDFFD-------------------------------NTP-EP-F-G--GWK--VSQSG--------RRKQ------DYGPKRNFKKK----------KVK-PS--D--IP--HMPLVFKK-L-FA-----------------------------------------------------------------------------DVSQ-----------AVSGPTGK-PF-DT-VE-------------VSVLE---YTTERMSNFDPHVDDTWLWGDRIVGLNLLEDCIMTFVDA-E----------------------------------------------------G-------------------------------------------------------------------------DALD-------VCLPRRCLFIMSGESRYTWMHGIRPES----------------I-------------LNRRISMTMREL--SD----------------------E-LL----S---DTEAATM--IKE-AAHSFI---------
      AGDE_09971_Angomonas_deanei_528238215                               CTCTGIRFC-ALC-AD-SA-------RVRE-------VFENK---ELLRSADAVIA-NQ------SQKDRSSSYSF-A--LLN--KSK-YALC--AEC-------GAT-----F-KLRDHP--FRACAEHA--------------------G-K----EQEP---K-----------R-I--GGLHVMENIVTPDMEQSLLEFFD-------------------------------NTP-EP-F-G--GWK--VSQSG--------RRKQ------DYGPKRNFKKK----------KVK-PS--D--IP--HMPLVFKK-L-FA-----------------------------------------------------------------------------DVSQ-----------AVSGPTGK-PF-DT-VE-------------VSVLE---YTTERMSNFDPHVDDTWLWGDRIVGLNLLEDCIMTFVDA-E----------------------------------------------------G-------------------------------------------------------------------------DALD-------VCLPRRCLFIMSGESRYTWMHGIRPES----------------I-------------LNRRISMTMREL--SD----------------------E-LL----S---DTEAATM--IKE-AAHSFI---------
      OXYTRI_23312_Oxytricha_trifallax_403359506                          CACTGVRYC-KDC-ND-PEFRK----QFKD-------LYPID---DILEAQQKVLT---------------------------------YVTC--GLC-------NRF-----K-L-KNEL--NISISDQNDGNNDKQDNQNDQSFIDQCIGHL----TAEQ---L-----------D-F--GGLYTIKEIISEDFEYNIVNKLQ-------------------------------D--------Y--KWV--DSQSG--------RKKI------DFGPQVNFKKQ----------KLK-YT--K--FT--GFPLFIKP-I-LD-------LISTLNDQNLQQKEEELKEEQKDQALL-------------------------------------------QLKQ-----------HLPSVLK--DF-QP-IE-------------VNVLE---YDEQRGSNIAPHKDDFWLWGERIIGINLLKDTFMTFQRD-S---------ENQL---------------------------------------G--------------QI---------------------------------------------------------VEIE-------VPVKRRMMYVISGKSRFEWMHGIKSEH----------------I-------------KGKRIVCTFREF--SD----------------------E-FK----S--QDNEDANK--IRE-IAKNFI---------
      OXYTRI_08063_Oxytricha_trifallax_403343479                          CACTGVRYC-KDC-ND-PEFRK----QFKD-------LYPID---DILEAQQKVLT---------------------------------YVTC--ELC-------NRF-----K-L-KNEL--NISQSDQNNGNNDKQDDQNYQSFIDQCIGHL----TSEQ---L-----------D-F--GGLYTINDFISEEFEQDIVNKLQ-------------------------------D--------Y--KWV--DSQSG--------RKKI------DFGPQVNFKKQ----------KLK-YT--K--FT--GFPLFIKP-I-LD-------LIQTLNDEILQEKIEEQKNQQKPQ-----------------------------------------------LKE-----------NLPSVLK--DF-IP-IE-------------VNVLE---YDEQRGSNIAPHKDDFWLWGERIIGINLLQDTFMTFQRD-S---------QNQI---------------------------------------G--------------QI---------------------------------------------------------VEIE-------VPVKRRMLYIISGKSRFEWMHGIKSEH----------------I-------------KGKRIVCTIREF--SD----------------------E-FK----S--QDNEDANK--IRA-IAKTYI-QA--NIKY
      Pmar_PMAR003551_Perkinsus_marinus_ATCC_50983_294944511              CACTGVRSC-RLC---------------EE-------VTGRS---LKSTHPRP-----R------YLPDGVAD--------------------------------------------------------------------------------T----AKSD---I-----------V-P--PGLVVLADAITEAEEATLLGDIY-------------------------------A--------R--PWK--LSQSG--------RRKQ------DYGPQVNFKKR----------KLKCPD--N--FQ--GLPHSIDL-V-LP-----------------------------------------------------------------------------RIHT-----------GLGLLLDH-AW-H---E-------------MVVQE---YAVSRGSSIDLHVDHSWVWADGILDLSLAADCIMAFANPKE----------------------------------------------------G-------------------------------------------------------------------------VYYD-------VGLPRRSACLIAGLSQTQWMHGIKRDN----------------A----------CLGGDTRVSITLRVL--DG----------------------A-VA----L---TAEGQET--IRR-SRMRC----------
      NCLIV_063160_Neospora_caninum_Liverpool_401412938                   LARPSLRD--PS--------------------------ESPA---APACPGRRGLSMKN------LEEELMSLLGLCGDGGAG--ISG-DRGCGAQGG-------SGV-----S---EMQP--MAGADARP--------------------ELV-AG-RRER-P-G-----------D-G-PFGVFLLPDALQPQEETEILAWADGGTEE-TQAREGDSRAEACGGE----GEPRRE-T-RAKEQG--FWA--LSQSG--------RRKI------DFGPKVNFKKK----------RLK-LG--L--FN--GFPPFTKR-L-LALHPDERSPASSCSRSGCSSPS--SSPSSPPPPPLSSRS-VRAFPFPVAASSVCTVER-------------AGVQGAERLTL--Q------A-FRKKLLS--TF-QP-VE-------------LCLLE---YVPSRGSHIEEHFDDFWLWGPRLVTFTLASSTILSFVSP-------VFCVPREL---FEAARPHSLCR-HGED------------TPSP----SSYPSPSSSSAASPEALQASPASAPHSVCGRSAKKSASSVSSSASRESSLPSLSLGPSSLPSSSSLASECGRVRVEIR-------VLLPRRSLVVCEGPCRYTWTHAIRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGESAEVGQK--LLS-LAASFN-----GSPV
      TGMAS_246140_Toxoplasma_gondii_MAS_672578582                        VELQALEH--ATT-----R-------RDRA-------LRAPV---PPVSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSAPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTPGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASTSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-QSAFFN-----GSPV
      TGGT1_246140_Toxoplasma_gondii_GT1_523576915                        VELQALEH--ATT-----R-------RDRA-------LRAPA---PPVSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPGNASSAPPSCSDSFLPSPADPSSPTVSSAPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTPGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASSSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV
      TGFOU_246140_Toxoplasma_gondii_FOU_672285053                        VELQALEH--ATT-----R-------RDRA-------LRAPA---PPVSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPGNASSAPPSCSDSFLPSPADPSSPTVSSAPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTPGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASSSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV
      TGVAND_246140_Toxoplasma_gondii_VAND_672573839                      VELQALEH--ATT-----R-------RDRA-------LRAPV---PPVSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------EEK-----S---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMGTGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSPPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTPGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-CLSSKAVSSSAVASTSGSSSASCSSAAPSDSCSLPSLGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV
      TGRUB_246140_Toxoplasma_gondii_RUB_672301308                        VELQALEH--ATT-----R-------RDRA-------LRAPA---PPMSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSPPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTPGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASTSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV
      HHA_246140_Hammondia_hammondi_675123610                             VELQALEQ--ATT-----K-------RERV-------LRAAV---SPVSEGKLGLS-SD------LEEVLIPLLGL----SDA--HSD-ADDC--EEG-------GEN-----R---EAEV--VVRKRDSS--------------------QFA-TR-RRDG-L-A-----------E-S-PFGVFLLPDALERQEEAAILAWADGNMETGAQSREATHKPRPRGDTREYGGETETE-P-RA---G--FWA--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFTKA-L-LSQPEKASCAPPSRSDSFLPSPSNPSSPDVSSVPSV------------FSPRATSLMESPRAAASNVRTPAGTGKRGVGRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VVCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-QLSPASSCSP-SLSPAAVSSSSGASPSCSSSDPCSSSDPSDSCSLPAPCLASGEFGRVRVEIR-------VVLPRRSLVVVEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGKNEEVGQK--LLS-LSAFFN-----GSPV
      TGVEG_246140_Toxoplasma_gondii_VEG_557738285                        VELQALEP--ATT-----R-------RDRA-------LRAPV---PPVSEGKPGLS-SD------LEEALLPLLGL----SEA--HSD-TEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSPPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTSGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASSSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV
      TGME49_046140_Toxoplasma_gondii_ME49_237835449                      VELQALEP--ATT-----R-------RDRA-------LRAPV---PPVSEGKPGLS-SD------LEEALLPLLGL----SEA--HSD-TEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAKTE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSPPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTSGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASSSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV
      TGP89_246140_Toxoplasma_gondii_p89_672276401                        VELQALEH--ATI-----R-------RDRA-------LRAPV---PPVSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------EEK-----S---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSPPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTSGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASSSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV
      PTSG_09879_Salpingoeca_rosetta_514683301                            MTMVVMEKK-KDD----RE-------EAAP-------PPATV---SAP-------------------------------------HVQ-YRWC--EA------------------C-GKLR--IMPPGDVP--------------------CTGASTCASAESLGL-----------PAF--EGVHVFREFVTADEEQALLSQMD-------------------------------E--------W--PWK--LSQSG--------RWKQ------DFGPKVNFKRK----------KVK-VG--N--FT--GLPSSSRD-L-VA-----------------------------------------------------------------------------RMQA-LP------C-LQ-------DF-EP-VE-------------QCHLD---YRPERGAAIDMHFDDDWIWGERLVTVNLLSETRLSFEHP-Q----------------HE---------------------------------GA--------------------------------------------------------------------------QVY-------VVLPARSLVAVQGSARTQWKHAVHRGA----------------I-------------TDRRIAVTLREL--GP----------------------D-FV----AGAGRAEEGRH--LLD-IAHTFK-----GTPL
      OT_ostta04g01970_Ostreococcus_tauri_693500295                       KGCGRLASS-EKR-RE-VV-D-----DFRG---------------RRAHTGARTTG---------WVAKDDLAGGW-I-------DRR-------DGG-------GAT-----V---GKST--PVDAEGTK----D---------------AFA-EM-ARAM-K-R----VGAV---K-M--SGHFLLLDFITEDEERAIVEYLD-------------------------------A-D-TS---R--PWK-DSSFNG--A-----HEGK------KYGVEPNLLKR----------TVE-PT-----KV--PIPKILKQLV-IK-----------------------------------------------------------------------------KFAS-AH--E---T-LR-------RF-EP-NE-------------CNAIN---YRKDLGSVLTPHCDDRQLSSDILVNLSLVSDCTMTYIHE------------------------------KHPE----------------------------------------------------------------------------------------------RRVE-------VYLPRRSLQIQSGSTRYDYMHSIANEN----------------L-----------H-GDRRVSVTFRES--GA----FTAKKK----------------------------------------------------
      Ot04g02100_Ostreococcus_tauri_308802830                             KGCGRLASS-EKR-RE-VV-D-----DFRG---------------RRAHTGARTTG---------WVAKDDLAGGW-I-------DRR-------DGG-------GAT-----V---GKST--PVDAEGTK----D---------------AFA-EM-ARAM-K-R----VGAV---K-M--SGHFLLLDFITEDEERAIVEYLD-------------------------------A-D-TS---R--PWK-DSSFNG--A-----HEGK------KYGVEPNLLKR----------TVE-PT-----KV--PIPKILKQLV-IK-----------------------------------------------------------------------------KFAS-AH--E---T-LR-------RF-EP-NE-------------CNAINCTFYS-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      Bathy06g04110_Bathycoccus_prasinos_612393879                        HHVVRLQEHVNEC-LD-ER-A-KR--EEKE---------------EKEEREHSVIT-DA------EKTETLASKNC-R-------KRK-------AFE-------EKE-----E---MPSIPIPTIPNAKN---ND---------------AFL-KI-KRGS-I-EMSKISSPS---H-I--SGHKVIENFITEEEETELLRAVY-------------------------------D-M-SE---Q--TWN-DRNASGNGR-----HNGK------SWGVNADRARR----------KVF-KA-----KR--EMPEAFQR-IVLE-----------------------------------------------------------------------------KLRA-NE--YGYKG-LR-------EFGFACNE-------------CNAIE---YVREKGHELRPHVDDRILSSDIIINLSMVGTCIMRYRMN------------------------------KNQG----------------------------------------------------------------------------------------------IEIA-------KKLPRFSLQIQGGRCRFEWEHGIRNED----------------L-----------I-DGKRVSLTFRCS--GRGNGKDGVYKD---------VVFE-EP----PRGYVNDLEGR---------------------
      MICPUCDRAFT_60114_Micromonas_pusilla_CCMP1545_303282765             GGETAAAAV-NVH-GQ-IA-H-----TARG---------------TPWRRSMAASGMDA------LEDDDARRRGD-D-------DDD-------DDDVCVSRDGNDD-----I---ANDA--PGDASSNV---KD---------------TLA-RL-MRAS-A-A----TSTSNTPS-L--PGHHLLLDFITEDEENALVAFLD-------------------------------D-G-ER---GIHDWK-PSTFNG--A-----HRGK------AWGVRVDLKRR----------TVS-PP-----TR--EMPPRLLA-V-AE-----------------------------------------------------------------------------KMRG-AH--A---L-LA-------RF-SP-NE-------------ANAIS---YDKRLGDRLLSHVDDRQLSSDVLVNLSLCGECVMTYERT----------------T-TRSSGGG-----TRGS----------------------------------------------------------------------------------------------DRVD-------VRLPRRSLQIQSGDARYAFAHSIANEN----------------L-----------L-DPRRVSITFRES--RT----PSTRTT---------------------------TRGK---------------------
      OSTLU_86981_Ostreococcus_lucimarinus_CCE9901_145345998              DEAGKMPSS-EKR-RD-VR-E-----DYRG---------------RRAHTGRTTTG---------FVAKDDLEGGW-I-------ARA-------GEC-------GAR------------------SDGAR---SD---------------AFA-AL-ARGA-K-L----QAKV---K-L--PGHYILENFITEDEERRIVDWLD-------------------------------D-DIAA---G--PWR-DSSFNG--A-----HQGK------KYGVEPNLLKR----------CVE-PA-----RV--PMPKILRDLV-VA-----------------------------------------------------------------------------KFAA-AH--E---T-LK-------HF-TP-NE-------------CNAIN---YRKDLGSVLTPHCDDRQLSSDILVNLSLCSDCTMTYSHE------------------------------KFAS----------------------------------------------------------------------------------------------KRVD-------VRLPRRSLQIQSGSTRYDYMHSIANEN----------------L-----------H-GNRRVSVTFRES--GV----LQKSPQTPKWRPNQ--------------------------------------------
      MICPUN_62550_Micromonas_sp_RCC299_255085018                         RDETPRASV-------------------------------------PDQEEPAATG-EK------VSDQTTTKTGE-L-------DDD-------DDD--------------------DTK--P-----------S---------------ALA-EL-MRAS-A-T----APT----R-L--PGHHLIPDFITPDEESRLIEYLD-------------------------------R-D-ES---DTNPWR-PSNFNG--K-----HRGK------KWGVEVDLKRR----------TVA-PE-----RR--PLPALVRA-V-AD-----------------------------------------------------------------------------RMPA-AH--P---A-LR-------GF-VP-NE-------------ANAID---YDRRGGAELLPHVDDRQMSTDLIVNLSLAGDCVMTYVED----------------A-GRDGRRGGWEGVPAGA----------------------------------------------------------------------------------------------RRVD-------VFLPRRSLQVQSGPCRFNFAHSIRNEN----------------L-----------R-APRRVSITFRRS--QM----PRTRTR---------------------------VRGE---------------------
      CHLREDRAFT_95290_Chlamydomonas_reinhardtii_159473956                --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MRNK------RWGVQPDYHRR----------GVA-PA-----AH--PLPPLLLA-L-AR-----------------------------------------------------------------------------RMRA-AV----GGP-LK-------DF-SP-NE-------------ANAID---YRRSHGSWLKPHADDRILSGEVIVNLSLAGDCVMTLARL-------------------APQSGS-----GGGG--GGGG-----------------------------VAGAAGGGGA-------------------------------------------------DEFR-------IRLPRRSLQIMSKAARYSYSHGIWPAD----------------L-----------L-DERRVSITFRHS--KP-R--PCAK------------------------------------------------------
      VOLCADRAFT_119009_Volvox_carteri_f_nagariensis_302847355            SGGGASASC-RHG-LN-EH-G-----LDPG----PVAGNIPL---HPMFLPRGAVA-SR------RVCTTWGPKPA-C-------RQE-------GAE-------EEE-----E---AARA--AVPEVIAE---AA---------------AVA-GE-MRAE-V-S-HP--------A-L--EGQYLVLEFVTPAEEAELLAMCD-------------------------------D-P-VL-KPSWSPWI--GQMYG--NATAQKTRGK------RWGVLPDYHRR----------GVA-PV-----EH--PLPPLLRI-L-TE-----------------------------------------------------------------------------RMRV-QV----G-L-LR-------CF-QP-NE-------------ANAID---YWRSRGSWLRPHVDDRILSGDLIVNLSLGGAAVMTFARE-------------------RDKDEG-----GHPGLQQHQQ-----------------------------QQQHPRQSAG-------------------------------------------------DEVR-------VRLAPRSLQILSRAARYSYTHAIAASD----------------L-----------L-DARRVSITFRRS--EL-R--PFERAE--R-------------------------------------------------
      Ot04g02090_Ostreococcus_tauri_308802828                             ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MFAH-AT--D--------------------------------------------RKDLGSVLTPHCDDRQLSSDILVNLSLVSDCTMTYIHE------------------------------KHPE----------------------------------------------------------------------------------------------RRVE-------VYLPRRSLQIQSGSTRYDYMHSIANEN----------------L-----------H-GDRRVS-TVKHT--DA----DSVCAR------NHLLLHT-LP----RRTPAMSSRDP---------------------
      SELMODRAFT_403903_Selaginella_moellendorffii_302757669              RPTSKVAPK-KRA-LK-KS-S-----SQQQ---------------RLVNTGASSFV-EC------PACQFLLSAGR-D-------ETI-------CLD-------EQA-----S---CSSV-------------------------------FV-AL-SSSR-V-Q----PKPS---V-M--NGYQLVENFISCEEEEKLIFAMD-------------------------------S-D-AR---N--LWK-PYSFTG--L-----NNGK------SYGLVMALGKRFVEINAFVHRKIL-AP-----KF--EMPHVLEP-I-ME-----------------------------------------------------------------------------RMRS-IP--L-----LA-------EF-FP-NE-------------MNSLE---YIRESGHFLRPHVDDRQLSGTLIVNLSMCGECYMTFKRE------------------------------RGAY----------------------------------------------------------------------------------------------ECHK-------IRLKQRSLQILTGDSRYNFTHEIENRD----------------L-----------L-SPRRVSITFREV--IS--------------------------------------------------------------
      SELMODRAFT_406367_Selaginella_moellendorffii_302763503              RPTSKVAPK-KRA-LK-KS-S-SRASPQQQ---------------RLVNTGASSFV-EC------PACQVLLSAGR-E-------ETI-------CLD-------EQA-----S---CSSV-------------------------------FV-AL-SSSR-V-Q----PKPS---V-M--NGYQLVENFISCEEEEKLIFAMD-------------------------------S-D-AR---N--LWK-PYSFTG--L-----NNGK------SYGLVMALGKRFVEINAFVHRKIL-AP-----KF--EMPHVLEP-I-ME-----------------------------------------------------------------------------RMRS-IP--L-----LA-------EF-FP-NE-------------MNSLE---YIRESGHFLRPHVDDRQLSGTLIVNLSMCGECYMTFKRE------------------------------RGAY----------------------------------------------------------------------------------------------ECHK-------IRLKQRSLQILTGDSRYNFTHEIENRD----------------L-----------L-SPRRVSITFREV--IS--------------------------------------------------------------
      AURANDRAFT_66792_Aureococcus_anophagefferens_676394053              RDAAAEAAA-LDD-VA-SR-A-----ARRE------ALCLMC--CEKRYTNCHREG----------LATALVARG------------------------------------------VRVV--HLRADGDA----E---------------AHP-RP-LAAL-R-A----ARHE---A-L--AGQYTVANFLSEAEEASLLAFLD-------------------------------G---EP---G-HPFV-RRDFNG--P-----ARGK------AWGVRTDLKRR----------TFA-EP-----AR--AMPDIFAP-L-VR-----------------------------------------------------------------------------RMRT-IP------E-LR-------SF-RP-NE-------------ANALD---YRRSEGHYLGAHCDDRQLSGPILVNLCLAGDATMTYTRD----------------------QAGR----GSAG----------------------------------------------------------------------------------------------ETVR-------ARLPRRALQIQSGSVRFDYRHGITNAD----------------F-----------H-ADRRVSITFRMN--KH----PGHRFG----------------------------RMMAGWTP-LRAWLI-----VATP
      EMIHUDRAFT_458739_Emiliania_huxleyi_CCMP1516_551571747              RETAHGPAA-TAG-AQ-LR-A-----TPPRLQLLWAALSPACGRFAAEHVQRHALG----------PCQRSVGRG------------------------------------------SESA--VVNAAAGG----R---------------CEV-RA---------------HA---V-L--RGLFLVHDFVTEEEEAALLQWMD-------------------------------G-Q-Q----P--GWR-LRHFNG--P-----ALGM------RWGATTDLRRR----------SVT-LG-----AP--MPPPLLA--L-TA-----------------------------------------------------------------------------RMRT-LPVPS---P-LA-------GF-EA-NE-------------ANALR---YVRAEGHFLGPHCDDRQLSGDTLVNLSLAGEATMTYAHD----------------------RDG-------SR----------------------------------------------------------------------------------------------PPVR-------VRLPRRSLQIQTEDVRFNHTHGIAAED----------------L-------------PELRVSVTFRRA--KL----TQ--------------------------------------------------------
      GUITHDRAFT_150889_Guillardia_theta_CCMP2712_551671246               --------------------------------------------------------------------------------------------------------------------------------------------------------------MKPK-A-I-R---------D-V--PGLFQFPDFITEDEEGLLLQSLD-------------------------------T--------G-NKWQ-LSSFNG--E-----CMTQ------RWGVVTDLKRR----------SVR-PCSIERGEE--DLPSFLRA-I-IE-----------------------------------------------------------------------------KWIN-RC----E-V-IA-------QF-HP-NE-------------ANANS---YEKHKGHSLAAHFDDRFLSGDILVNLSLGADCHMTFARK--------------------------------------------------------------------------------------------------------------------------------DKIK-------VLVPRRSLQVVTGRARFEHTHGIDLDD----------------F-----------H-GPRRVSITFRRA--KL-T--CT--------------------------------------------------------
      MONBRDRAFT_26078_Monosiga_brevicollis_MX1_167524358                 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------M-----------P-F--TGVRVWLDFVTEAEETALVAQMD-------------------------------A--------W--PWT--DSQSG--------RRKQ------DFGPKANFKKK----------KVK-AA--N--FT--GLPALARP-L-IP-----------------------------------------------------------------------------RLQA-LT------ADLK-------DF-VP-VE-------------QCHLD---YEPSRGAAIVPHFDDFWLWGERLITLNLLSTTFLCFQLP-D----------------EP---------------------------------DA-------------------------------------------------------------------------LEIA-------VPLPPRSLLEVKGVARMAWHHAVHRQD----------------V-------------SQRRIAMTWREL--TP----------------------E-FL----SGE-RQAEGHA--LLD-LAATYT-----GLPM
      consensus/100%                                                      ...................................................................................................................................................................................................................................................................................................................................................................................................................h............................................................................................................................................................................................................................................................................................................................................................................
      consensus/95%                                                       .......................................................................................................................................................................................G.......l.......hl..h.............................................h....p..G...........b.......aG...sh.+b...........l..............hP........................................................................................h.......................a.....E..............s.bp...Y.......h..H.DD.bl.u..l.shsh...s.hsh..............................................................................................................................................h.h....h.h....sR..a.H.l...p................h...............pRls.ThR....................................................................
      consensus/90%                                                       .......................................................................................................................................................................................G.....phlp...E..ll..hp............................................W....sbsG........pb.b......paGsp.sh.+b..........pl...s..........hP...b..l.h..............................................................................ph.......................a.....E..............s.lp...Y.......h..HhDD.bl.u..l.shsl...s.hoh.....................................................................................................................................h........l.ls..sh.l....sRa.a.Hul..pp................l...............+RlshThRc...................................................................
      consensus/85%                                                       .....h.................................................................................................................................................................................G..lh.shlp..-E..ll..hD............................................W....SbsG........+c.b......caGsp.sh+++..........plp..s.....b....hP...c..l.h..............................................................................ph.............h.........a....sE..............s.l-...Y...p.u.h..HhDD.bl.G.blsshsL..ss.hoh.p...................................................................................................................................h........l.lPpbuh.lb.s.sRapa.Hul..cp................l...............RRlshThRc...................................................................
      consensus/80%                                                       ..h.sh.................................................................................................................................................................................Gh.lh.shlp..-E..ll.bhD...............................p............W....SboG........++.b......caGsp.Nh+K+..........+lp..s.....b...shP...c..l.h..............................................................................+hp............h.........a.p..sE.............hs.L-...Y..pp.u.lp.HhDD.hL.G.blsslsLh.ss.hohsp...................................................................................................................................h........l.lP+Ruh.lhpG.sRapa.Hulp.cp................l..............sRRluhThREh..ss..............................................................
      consensus/75%                                                       s.h.ulb........p......p.....p......................................................................................................................................................h...Gl.lh.shlo.pEE..ll.hhD...............................p........s...W....SQSG........RRpb......caGPpsNFKK+..........+lp.ss.....h...shP.hhc..l.h..............................................................................+hps...........h.........a.p..sE.............hssL-...Y..ppuS.lcsHhDD.WLWGpblsslsLhpsshhoasps..................................................................................................................................lp.......V.LPRRuhhlhpG.sRapW.Hulc.cp................l..............sRRluhThREh..us........................h..........................h..........
      consensus/70%                                                       s.hpulc........p.pp...p.....p.........................................................p.....s....s......................s.......s........................s...........h.............h...Gl.lh.-hloppEE..ll.hhD...............................p........s...W....SQSG........RRKb......-aGPpsNFKK+..........+lc.ss.....h...uhP.hhc..l.h..............................................................................+hps...........h........sF.ps.sE.............hssL-...Y..pcuS.l-sHhDD.WLWG-RllslNLhssshhoasps................................................................................................................................splc.......V.LPRRSlhlhpG.sRapW.HuI+.cc................l..............sRRlulThREh..us......................p.h.........p.p..p...lb...s..a..........
      hCG_40699_Homo_sapiens_119587492                                    -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------A-L-P--PGLMVVEEIISSEEEKMLLESV--------------------------------------------DWTEDTDNQN-SQKSLK-HRRV-K----HFGYEFHYENN----------NVD-KD-KPL-SG--GLPDICES-F-LE-----------------------------------------------------------------------------KWLR---------K----------GY-IK-HK----PD-------QMTIN-Q-YEPGQG--IPAHIDTHSAFEDEIVSLSLGSEIVMDFKHP-D----------------------------------------------------G-------------------------------------------------------------------------IAV--P-----VMLPRRSLLVMTGESRYLWTHGITCRKFDT-VQASE-SLKSG-IITSDVGDLTLSK-RGLRTSFTFRKV--------------------------------R-------QTPCN--CRA----------------\AlkBH8 used for comparison
      GLOINDRAFT_52996_Rhizophagus_irregularis_DAOM_181602_552928730      ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------P-I--DGLFLIDDFISEQEEFELINSID-----------------------------------LC------EWSGNGIPPN-PE---M-RRRT-Q----HYGYEFSYRYR----------KVV-QN-----LG--VLPNFLDF-L-IK-----------------------------------------------------------------------------RFIE---------K----------KF-IQ-STEQEYPN-------MCIIN-E-YQAGQG--IMPHTDSPEIFGPVILSLSILTSCLITFTHI-Q----------------------------------------------------D-------------------------------------------------------------------------SSNQSI-----ILLKPRSLLVMTKSSRFDYKHSISKDAIEY-YNGEE-------I-----------K-RDRRVSLTFRTI----------------------------V-------------------------------------|
      RirG_035650_Rhizophagus_irregularis_DAOM_197198w_595492923          -----------------------------------------------------------------------------------------------------------------M---TTSE--QLNSQSFP--------TTSQS--------------YLSS---L-N-------S-P-I--DGLFLIDDFISEQEEFELINSID-----------------------------------LC------EWSGNGIPPN-PE---M-RRRT-Q----HYGYEFSYRYR----------KVV-QN-----LG--VLPNFLDF-L-IK-----------------------------------------------------------------------------RFIE---------K----------KF-IQ-STEQEYPN-------MCIIN-E-YQAGQG--IMPHTDSPEIFGPVILSLSILTSCLITFTHI-Q----------------------------------------------------D-------------------------------------------------------------------------SSNQSI-----ILLKPRSLLVMTKSSRFDYKHSISKDAIEY-YNGEE-------I-----------K-RDRRVSLTFRTI----------------------------VN----FNENNENMEKG--GCK-FNNNSV-HY--K---|
      LCOR_06335.1_Lichtheimia_corymbifera_JMRC:FSU:9682_661182209        -----------------------------------------------------------------------------------------------------------------------------------------------M--------------HTID---Y-G-------T-Q-I--PGLLVIEDFVTDEEEVALVTEVD-----------------------------------SR------TWCGLGVSPN-PE---L-KRRT-Q----QYGHLFSYRYR----------KVL-EK-----YG--PLPAFTHT-V-VT-----------------------------------------------------------------------------RIME---------N----------KL-MP-KE----PD-------HLLVN-E-YNAGQG--IMPHTDAPALFGPAILSLSLLSACVMKFTHV-E----------------------------------------------------K-------------------------------------------------------------------------GNSV-D-----ILLPRRSMLVMTGDARYLYKHSISKDLVESSSEGVT-------V-----------H-RDRRVSFTFREI----------------------------IA----WEVPPEDPSCG--CTK-SCSNK----------|
      EMIHUDRAFT_240478_Emiliania_huxleyi_CCMP1516_551578395              -------------------------------------------------------------------------------------------------------M-QPE-----A---CHGV--PLANLEAP--------------------------------------------S-A-V--EGLALHPDFVSEDEERELLRTVD-----------------------------------AQ------PWDC--------T---I-RRRT-Q----HYGRRFDYHNK----------TVG-DE-----AT--PLPAEIRR-L-IL-----------------------------------------------------------------------------ERVD-------VQP----------AL-LP-WR----PAAGREGSLQCTVN-E-YPPGVG--IAPHIDTHSAFEDGIASLSLGGGCAFRLRRG-S----------------------------------------------------D-------------------------------------------------------------------------GADH-S-----VWLPPRCLLVMSGAARYEWQHSISGRKFDR-VEHRESPAGWEWV-----------P-RERRISVTLRRV-LGS----------------------G-TA----------GTGQS--AW-----------------|
      ACA1_127620_Acanthamoeba_castellanii_str_Neff_470376461             --------------------------------------MSRN---QRVQPGGGSRG-GK-----PSGRGRGGAKPA----VPGW-APV-ATTS--TDS-A-----PPL-----A---LAYT--PSTSSSSS--------SGL-R--------------QAAA---L-P--------------PDLEYIEDFITADEERALVQAID-----------------------------------AQ------EWSE--------K---L-HRRT-Q----HYGYEFDYSRQ----------DIN-TS-----VPI-ELPVFAQQ-I-IE-----------------------------------------------------------------------------KMRQ---------R----------GL----PQ----FD-------QLIIN-E-YTPGQG--INPHIDKTHCFGPCVVSLSLLSTCVMTFTSL-E----------------------------------------------------T-------------------------------------------------------------------------GEKI-P-----VVLRPRSLVVLRGQARYGWQHGIEPKRADI-VAGKH-------T-----------P-RARRVSLTYRTV----------------------------AK----------SAGNA---------------------|
      MICPUN_86885_Micromonas_sp_RCC299_255086679                         GDVDGVDMF--------DP---S----MAR--CVV--TMSRV---EDAIAAQAATH-ET-------CRPDLGDRRL----WVRF-SSD-PNAT--GG--------EPE--SR-A---KQEE--TWCAATRD--------SAT---------------------------------L-G-V--PGVTLITDFVTEEEEREMLACVD-----------------------------------SD-E----RWQG----------L-A-KRRV-L----HYGYAFDYGTR----------DAR-DK-----TS--PMPAFVAG-L-LG-----------------------------------------------------------------------------RAAS---------C----------GA-PG-AC----ES--VHCD-QLTVN-E-YVAGVG--IAPHVDTHSAFGPTILSLSLAGRAVMEFRLH-E----------------------------------------------------G-------------------------------------------------------------------------GEKE-PRERRAISMPPRSLLVLHGEARYRWLHYIPHRKRDA-IVGED---ECE-A-----------R-EERRVSFTFRRR--RE----------------------G-------------ACGCE--WPE-ACDSRE-GA--AQRL|
      NAEGRDRAFT_58773_Naegleria_gruberi_strain_NEG-M_290982964           --------------------------------------------------MKRTLN-DF-L----QTSDKKKQSTL---------KKI-------KSN-------SEV---------SGAP--VKQISSYK---------------------------TDSE-----------------I--GGLYIIENIIDVAEERKLVKFID-----------------------------------SQ------KWN---DEIS--------RRTQ------HYGVSYNYGAR----------GVK-EA-LK--VP--PVPSEFSD-L-LE-----------------------------------------------------------------------------EIKN-KE-GL-D-S-IRNLMEGI-DF--------------K----QVIIN-E-YKGAKQG-ISKHVDHCQDFGPLILILSLGDECVMKFHKL-E------------------QVKEEDLKK-KKVK--------RT--EVSP----S-------------------------------------------------------------------------ECYD-------RRMPRRSLIILSGDARYQYQHEIPKTM----VFKID-GKQF--L-KRS-------E-SYRRVSITYRSL--TT----------------------D---------------------------------------|
      SAMD00019534_100830_Acytostelium_subglobosum_LB1_735850808          ----------------------------------------------------------------------------------------------------------------------------MATVEVV----D----------------------MDDG---V-----------Q-L-PPGLSLLTDFITEEEERILVDNID-----------------------------------KS------EWK---TEIA--------RRTQ------QYGYHYCYRLR--GVDELDD-QGQ-PM-----TP--PIPQYLQF-L-VD-----------------------------------------------------------------------------RLAA-TP------Q-------------IP-VG----MD-------QIIIN-E-YEPGQQ--IKPHIDSTKDWDACVVSLSCLSDWRMVFIPE-D----------------------------DDKS----------------------------------------------------------------------------------------------KEVS-------MVLPKRSLLVLKGDARYKWKHGIRSQ-----------------V-----------K-VGRRVSLTFRHY----------------------------IG----SGGNS---------------------------|
      BATDEDRAFT_87049_Batrachochytrium_dendrobatidis_JAM81_575476790     ---------------------------------------------------------------------------------------------------------MDSSVGLDA---ADDP--TRNHANLP--------S---L--------------HKHP---F-E-------P-V-I--SGLRLIPDFITQQEELDLIASID-----------------------------------AH------PWSGYGIPPN-PE---L-KRHT-Q----QYGFLFSFRTR----------TIT-EC-----LG--SLPAFSSF-V-ID-----------------------------------------------------------------------------RMLL---------P----------EF-NV-FPNDP-PN-------HVLVN-E-YQPGQG--IMPHVDSQDTFGDVVTSLSLWSSCVMSFGNK-M----------------------------------------------------T-------------------------------------------------------------------------GEKV-H-----LELPRRSLLILTGDARTHYTHAIPKEDMLF-AGNEC-------V-----------D-RGRRVSLTIRSI----------------------------LK----SAIP----------------------------|
      MVEG_06150_Mortierella_verticillata_NRRL_6337_672822524             -----------------------------------------------------------------------------------------------------------------------------------------------------------------------M-------A-T-I--PGLEVILDFITEEEEQQLITELD-----------------------------------AG------HWAGRGIEPN-PE---M-KRRH-Q----HYGGVFSYRLR----------RVV-GD-----ME--KLPGMFDF-I-TE-----------------------------------------------------------------------------RLLQ---------R----------RI-YD-RS----PN-------SIIVN-E-YEAGQG--IMPHVDAPKLFGKTITALSLLSACVMTFQHV-K----------------------------------------------------D-------------------------------------------------------------------------PSQIYH-----IHLPQRSLVVMNGSSRYDFKHSISKDLIEH-VDGLE-------I-----------V-RARRVSITYRDM----------------------------LVEDRQQDRESDEAGSS--CKE-LCGNGI-SS--CTRS/
      
      Back to Contents
    • General notes and phyletic distribution of the AlkB-H4 family that was recently shown to have a N6A demethylation function in C. elegans. Note this collection is from Genbank and the above is from a local database of eukaryotic sequences

      Some of the key differences/unique synapomorphies of the AlkBH4 group are E in strand-1 '+' in Str (-1), and HxD'D' motif in strand-2

      General notes

      # 1;
      25148697   CELE_F09F7.7            291   eukaryota>metazoa>nematoda                            Caenorhabditis elegans                    F09F7.7, isoform a [Caenorhabditis elegans].
      24583140   Dmel_CG4036             304   eukaryota>metazoa>hexapoda                            Drosophila melanogaster                   CG4036, isoform A [Drosophila melanogaster].
      585709738  LOC100367945            271   eukaryota>metazoa>hemichordata                        Saccoglossus kowalevskii                  PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4-like [Saccoglossus kowalevskii].
      156402493  NEMVEDRAFT_v1g34785     267   eukaryota>metazoa>cnidaria                            Nematostella vectensis                    predicted protein, partial [Nematostella vectensis].
      688557483  alkbh4                  321   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                               PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 isoform X1 [Danio rerio].
      823470605  ALKBH4                  402   eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                       PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Taeniopygia guttata].
      311251081  LOC100520070            302   eukaryota>metazoa>chordata>vertebrata                 Sus scrofa                                PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4-like [Sus scrofa].
      68372246   alkbh4                  315   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                               PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 isoform X2 [Danio rerio].
      612002272  ALKBH4                  304   eukaryota>metazoa>chordata>vertebrata                 Monodelphis domestica                     PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Monodelphis domestica].
      8923019    ALKBH4                  302   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                              alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Homo sapiens].
      386783769                          287   eukaryota>metazoa                                     Schmidtea mediterranea                    alpha ketoglutarate dependent dioxygenase ABH4 [Schmidtea mediterranea].
      407409697  MOQ_003722              304   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi marinkellei             hypothetical protein MOQ_003722 [Trypanosoma cruzi marinkellei].
      407849132  TCSYLVIO_004974         304   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi                         hypothetical protein TCSYLVIO_004974 [Trypanosoma cruzi].
      731709183  LPMP_352050             297   eukaryota>euglenozoa>kinetoplastida                   Leishmania panamensis                     alpha-ketoglutarate-dependent dioxygenase AlkB-like, putative [Leishmania panamensis].
      154345804  LBRM_35_2190            297   eukaryota>euglenozoa>kinetoplastida                   Leishmania braziliensis MHOM/BR/75/M2904  conserved hypothetical protein [Leishmania braziliensis MHOM/BR/75/M2904].
      342184304  TCIL3000_10_5510        306   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma congolense IL3000             unnamed protein product [Trypanosoma congolense IL3000].
      340057250  TVY486_1006420          299   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma vivax Y486                    conserved hypothetical protein [Trypanosoma vivax Y486].
      157876860  LMJF_36_1970            297   eukaryota>euglenozoa>kinetoplastida                   Leishmania major strain Friedlin          conserved hypothetical protein [Leishmania major strain Friedlin].
      71747664   Tb10.70.0360            304   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma brucei brucei TREU927         hypothetical protein [Trypanosoma brucei brucei TREU927].
      146104297  LINJ_36_2080            297   eukaryota>euglenozoa>kinetoplastida                   Leishmania infantum JPCM5                 conserved hypothetical protein [Leishmania infantum JPCM5].
      788018057  TbgDal_X7900            304   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma brucei gambiense DAL972       hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972].
      588321322  GSEM1_T00000659001      297   eukaryota>euglenozoa>kinetoplastida                   Phytomonas sp. isolate EM1                unnamed protein product [Phytomonas sp. isolate EM1].
      401420114  LMXM_36_1970            297   eukaryota>euglenozoa>kinetoplastida                   Leishmania mexicana MHOM/GT/2001/U1103    conserved hypothetical protein [Leishmania mexicana MHOM/GT/2001/U1103].
      528254392  STCU_00887              304   eukaryota>euglenozoa>kinetoplastida                   Strigomonas culicis                       alkylated DNA repair protein alkB like protein 4 [Strigomonas culicis].
      686635833  DQ04_02651050           296   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma grayi                         alkylated DNA repair protein alkB like protein 4 [Trypanosoma grayi].
      71659461   Tc00.1047053510187.490  304   eukaryota>euglenozoa>kinetoplastida                   Trypanosoma cruzi strain CL Brener        hypothetical protein [Trypanosoma cruzi strain CL Brener].
      528266291  AGDE_03849              297   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                          alkylated DNA repair protein alkB like protein 4 [Angomonas deanei].
      528238215  AGDE_09971              297   eukaryota>euglenozoa>kinetoplastida                   Angomonas deanei                          alkylated DNA repair protein alkB like protein 4 [Angomonas deanei].
      403359506  OXYTRI_23312            335   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                       hypothetical protein OXYTRI_23312 (macronuclear) [Oxytricha trifallax].
      403343479  OXYTRI_08063            342   eukaryota>alveolata>ciliophora                        Oxytricha trifallax                       hypothetical protein OXYTRI_08063 (macronuclear) [Oxytricha trifallax].
      294944511  Pmar_PMAR003551         252   eukaryota>alveolata                                   Perkinsus marinus ATCC 50983              conserved hypothetical protein [Perkinsus marinus ATCC 50983].
      401412938  NCLIV_063160            951   eukaryota>alveolata>apicomplexa                       Neospora caninum Liverpool                conserved hypothetical protein [Neospora caninum Liverpool].
      672578582  TGMAS_246140            1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii MAS                     hypothetical protein TGMAS_246140 [Toxoplasma gondii MAS].
      523576915  TGGT1_246140            1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii GT1                     hypothetical protein TGGT1_246140 [Toxoplasma gondii GT1].
      672285053  TGFOU_246140            1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii FOU                     hypothetical protein TGFOU_246140 [Toxoplasma gondii FOU].
      672573839  TGVAND_246140           1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii VAND                    hypothetical protein TGVAND_246140 [Toxoplasma gondii VAND].
      672301308  TGRUB_246140            1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii RUB                     hypothetical protein TGRUB_246140 [Toxoplasma gondii RUB].
      675123610  HHA_246140              803   eukaryota>alveolata>apicomplexa                       Hammondia hammondi                        hypothetical protein HHA_246140 [Hammondia hammondi].
      557738285  TGVEG_246140            1033  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii VEG                     hypothetical protein TGVEG_246140 [Toxoplasma gondii VEG].
      237835449  TGME49_046140           1033  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii ME49                    hypothetical protein TGME49_046140 [Toxoplasma gondii ME49].
      672276401  TGP89_246140            1030  eukaryota>alveolata>apicomplexa                       Toxoplasma gondii p89                     hypothetical protein TGP89_246140 [Toxoplasma gondii p89].
      514683301  PTSG_09879              270   eukaryota>choanoflagellida                            Salpingoeca rosetta                       hypothetical protein PTSG_09879 [Salpingoeca rosetta].
      167524358  MONBRDRAFT_26078        207   eukaryota>choanoflagellida                            Monosiga brevicollis MX1                  hypothetical protein [Monosiga brevicollis MX1].
      693500295  OT_ostta04g01970        351   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                        Alpha-ketoglutarate-dependent dioxygenase AlkB-like [Ostreococcus tauri].
      308802830  Ot04g02100              245   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                        unnamed protein product [Ostreococcus tauri].
      612393879  Bathy06g04110           317   eukaryota>viridiplantae>chlorophyta                   Bathycoccus prasinos                      predicted protein [Bathycoccus prasinos].
      303282765  MICPUCDRAFT_60114       377   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545               predicted protein [Micromonas pusilla CCMP1545].
      145345998  OSTLU_86981             347   eukaryota>viridiplantae>chlorophyta                   Ostreococcus lucimarinus CCE9901          predicted protein [Ostreococcus lucimarinus CCE9901].
      255085018  MICPUN_62550            353   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                     predicted protein [Micromonas sp. RCC299].
      159473956  CHLREDRAFT_95290        170   eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii                 predicted protein [Chlamydomonas reinhardtii].
      302847355  VOLCADRAFT_119009       753   eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis             hypothetical protein VOLCADRAFT_119009 [Volvox carteri f. nagariensis].
      308802828  Ot04g02090              147   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                        LOC553475 protein (ISS) [Ostreococcus tauri].
      302757669  SELMODRAFT_403903       268   eukaryota>viridiplantae                               Selaginella moellendorffii                hypothetical protein SELMODRAFT_403903 [Selaginella moellendorffii].
      302763503  SELMODRAFT_406367       272   eukaryota>viridiplantae                               Selaginella moellendorffii                hypothetical protein SELMODRAFT_406367 [Selaginella moellendorffii].
      676394053  AURANDRAFT_66792        2180  eukaryota>stramenopiles                               Aureococcus anophagefferens               hypothetical protein AURANDRAFT_66792 [Aureococcus anophagefferens].
      551571747  EMIHUDRAFT_458739       375   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516                hypothetical protein EMIHUDRAFT_458739 [Emiliania huxleyi CCMP1516].
      551671246  GUITHDRAFT_150889       193   eukaryota>cryptophyta                                 Guillardia theta CCMP2712                 hypothetical protein GUITHDRAFT_150889 [Guillardia theta CCMP2712].
      
      Back to Contents

    • Multiple sequence alignment of the RAMA domain

      RES                                                                     RQPSLARASVA------------RASVD-SLTSLGAMMA----------------------------HRLI-EAAPAALS---VEAASG--------EWLLADLCD-DGTILHLPADG--------GVPQRFRSPGGFVRFLRPWLSAAS-------VEGGSWTAVHYK-----------------GL----------------PLDLIRKAATSDADALGAV---A
      FINAL                                                                   ------------------------------HHHHHHHHH-------------------------------------EEEE---EEE------------EEEEEE-----EEEEE---------------EEE--HHHHHHHHHHH-----------------EEEEEE------------------------------------HHHHHHHHHHHHH----------
      ALIGN                                                                   ------------------------------HHHHHHHHH-------------------------------------EEEE---E---------------EEEEEE----EEEEE---------------------HHHHHHHH--------------------EEEEE-------------------------------------HHHHHHHHHHH-----------
      HMM                                                                     ------------------------------HHHHHHHHH------------------------------EE-E---EEEE---EE-------------EEEEEEE----EEEEE---------------E----HHHHHHHHHH---------------EEEEEEEEE------------------------------------HHHHHHHHHHHH-----------
      FREQ                                                                    ------------------------------HHEEEEHHH--------------------------------------HHH---HHHH----------------------EEEEEE-E---------E--EEE---HHHHHHHHHH-----------------EEEEE---------------------------------------HHHHHHHHHHH----------
      PSSM                                                                    --------------------------------EHHHHH--------------------------------------EEEE---EE-------------EEEEEE-----EEEE---------------------HHHHHHHHHHH------------------EEEEE-------------------------------------HHHHHHHHHHHH----------
      RES                                                                     KPNPND-IELL------SLE--IKPPK---VPMKTLIE-----------------------------ADFL-RVGQTLFD-------KN--------ENAICIVTQ-DGNVKDN------------E---ETLSIHKMSAKYLNK------------TNNNGWDYFYLFR----------------NNN-------------FITLDSLRYEYTNQ
      FINAL                                                                   -----H-HHHH---------------------HHHHHH-----------------------------H--------EEE-------------------EEEEEEEE---EEEE---------------------HHHHHHHHH--------------------EEEEEE-----------------------------------HHHHHHHHHHH-
      ALIGN                                                                   ------------------------------HHHHHHHH-----------------------------H---------EE--------------------EEEEEE-----EEE----------------------HHHHHHHH--------------------EEEEE-----------------------------------HHHHHHHHH----
      HMM                                                                     ------------------------------HHHHHHHH-----------------------------HHH--H---EEEE------------------EEEEEEE----EEEE---------------------HHHHHHHHH-------------------EEEEEEE-------------------H-------------HHHHHHHHHH----
      FREQ                                                                    ----HH-HHHH------H----------------EEEE-------------------------------------HEEE----------------------EEEH-----EEE--------------------EEEEEE-HH--------------------EEEEEE-------------------------------------HHHHHHHHHH-
      PSSM                                                                    ---------------------------------HHHHH-----------------------------HH-------EE---------------------EEEEEEE---EEE----------------------HHHHHHHHH--------------------EEEEE-------------------------------------HHHHHHHH---
      MPND_Homo_sapiens_664805955                                             AGGCGGPGGAL--------------TRR-AVTLRVLLK-----------------------------DALL-EPGAGVLSI----YYLG--------KKFLGDLQP-DGRIMWQET----------G--QTFNSPSAWATHCKKLVNPAK-------KSGCGWASVKYK-----------------GQ----------------KLDKYKATWLRLHQLHT------------------------------------------
      LOC101862270_Aplysia_californica_524883584                              EGALHKAKQAI-------------TCR--SVTLNTLME-----------------------------DGVI-HPGKGVLSI----DYLG--------QRFEADLLE-SGKIKWQ------------GSEEEFGTPSAWALHCKRLVNPIK-------RSGCGWASVKYN-----------------NK----------------KLDIWKSIWARKHRVSSPF---KASSDSSNSSPTKPPQPSTSPAMLASPTLSIDGEKCS
      Dmel_CG4751_Drosophila_melanogaster_19921138                            DDETKENYEGF-----------NGTGR--TVTLQTLMA-----------------------------ANVL-QPGLGLMTI----EYLG--------QKFVGDLLA-DGKIKSHET----------E--TIFLTPSAWAMHCKRIINPDK-------KSGCGWASVKYK-----------------GK----------------KLDAYKNTYLRKCALQK------------------------------------------
      DAPPUDRAFT_23970_Daphnia_pulex_321479075                                --------GGF-------------TGR--GVTLQMLLE-----------------------------DNIL-QPKDGAMSL----EYMG--------QKFNGDLLA-DGKIFSAEV----------Q--EVFSSPSAWALRCKKIVNPEQ-------KYGCGWSSVRYC-----------------GR----------------PLDVYKNQWMRKRRLEQ------------------------------------------
      NEMVEDRAFT_v1g22327_Nematostella_vectensis_156394960                    QVRSRKPRSFL-------------TGR--GVTLAMLME-----------------------------DGIM-QPGEKLLSI----DYLG--------QKFQADLLP-DGKIKWPEA----------N--KVFNSPSAWAIYCKKLVNPSK-------KSGCGWASVKYK-----------------GR----------------KLDQYKSTWFRKQRAQT------------------------------------------
      BRAFLDRAFT_221394_Branchiostoma_floridae_260806149                      ----------L-------------TGR--GVTMAMLLS-----------------------------DGIL-EPGDACLSI----DYLG--------QKFVGDLLP-NGKIKWQ------------GSNRVFNSPSAWAIHCKKMVNPSK-------KSGCGWASVKYK-----------------GK----------------KLDIYKTIWFRKQRGVN------------------------------------------
      LOC100209039_Hydra_vulgaris_449665213                                   KKKVLSKSAML-------------TGR--GVTTQMLIE-----------------------------ENIL-EAGENNLTI----NYLG--------NKFVGDLKE-DGTINCK------------NANRIFSSPSAWAMHCKKQVNPDK-------KSGCGWASVKYK-----------------GR----------------KLDEFKSTWFRKQKIEQ------------------------------------------
      LOC105319737_Crassostrea_gigas_762155186                                DKQGSLAKAAI-------------TGR--SVTLQMLIS-----------------------------EKII-EPGPALLSL----KYLG--------RRFIADLLP-DGKIQMP------------GTKETFTSPSAWAIHCKKQVNPHK-------KSGCGWASVMYK-----------------ER----------------KLDIWKAAWFRKHRSSS------------------------------------------
      CAPTEDRAFT_110378_Capitella_teleta_443694149                            KPFEQRAHHAI-------------TGR--GVTLAMLMA-----------------------------DGFI-QPGNDTMSI----DYLG--------QNFRADLLD-GGRIREN------------G--KVFGTPSAWAIHCKNIVNPGK-------KSGCGWASVKYK-----------------GK----------------KLDAYKLSWLSKHRPLAIA---AVSLTVSNYSYFPYINQCYDIFLEDIKP---------
      TRIADDRAFT_54406_Trichoplax_adhaerens_196002159                         TKNSNKLPYNA-------------AGR--GVSIAVLIK-----------------------------DGIL-KPRKKCLSL----EYLK--------KTFYGDLLP-NGKIQSS------------TTEDIFNSPSAWAIHCKRLVNPAK-------KSGCGWASVKYN-----------------GV----------------KLDEFKATWYKKKKLSC------------------------------------------
      LOC587071_Strongylocentrotus_purpuratus_390342613                       TTSPPKKEKVL-------------TGR--GVTLSMLMG-----------------------------DGVV-EAGKDCLSI----EYLG--------SKFTADLMT-DGRIFWS------------KEKQIFNSPSAWAIHIKSILNPGK-------RSGCGWASVKYN-----------------GK----------------KLDVVKSQWFRNMKIPY------------------------------------------
      mpnd_Maylandia_zebra_498992415                                          LRSSSGRGSLL--------------TRR-GITLRVLLK-----------------------------DGLV-EPGDGVLAI----HYLG--------KNFVGDLLT-DGKIRWVET----------G--QIFNSPSAWATHCKRLVNPAK-------KSGCGWASVRYR-----------------GQ----------------KLVQYKTTWLHKYQPSA------------------------------------------
      LOC100902139_Metaseiulus_occidentalis_391344637                         GERLEEREPRK-------------------LTLETLLR-----------------------------EGVL-ESGDGVLSM----EYMG--------MRFTGDLLS-DGAIRWGES----------G--EVFPSPSAWAIHCKRITQPDK-------KTSCGWSQVKYK-----------------GR----------------KLELYKQEWLHRHQTAK------------------------------------------
      LOC100641685_Amphimedon_queenslandica_761911769                         --------MAV-------------AGR--SISMFSLIT-----------------------------DNIL-EPGDEVLTF----DYLG--------KRYTADLLP-EGTIRGN------------G--QIYASPYAWASYCKNEINPDQ-------KTAIGWGHIRYR-----------------GI----------------KLSQYKNLYLKKHKLCS------------------------------------------
      LOC100178477_Ciona_intestinalis_198425307                               SLDSNSSRKDI---SPRSSR--GGTSR--SVTLQTLVQ-----------------------------EGVL-EPGNGVLSI----DYLS--------HKYLGDLLP-NGKILWD------------N--VQFPSPSTWATHIKKKINPSK-------KSGCGWNSVKYK-----------------GK----------------KLDKLKANWFRKNAGVV------------------------------------------
      ANKRD31_Gallus_gallus_513229710                                         ----NYGYKECEQKQRKHAR----KNKK-KLQLIDLLE-----------------------------LGRI-KPGENVLEF----TLKE--------FTCKATLLT-NGKIKTSK-----------N--KIFQNPVQWVKDLLGSDIYV--------TWKYAWNKVVYR-----------------GT----------------QLSKLVVENAPVSNDLEIP---S------------------------------------
      ANKRD31_Homo_sapiens_256574792                                          ----GSGQQDTIKKALNYST---APKKK-CIQIKDLIL-----------------------------LGRI-NPGNNILEF----KTQE--------TTHKASILL-NGKLKVES-----------G--QIYKNPVTWLKDLLGGNSYV--------TWNYAWSKVTYL-----------------GK----------------ELLRYVSEDAPILPEPNSV---P------------------------------------
      LOC101480116_Maylandia_zebra_499004945                                  GIALEHFSIMIRRKNVLIQN---RAVDN-SRRLSVLIQ-----------------------------RGII-SPGSA-LQL----LLKG--------HCHFANVLA-DGSILSK------------G--KVHLAPECWLKSILGKNIPV--------SSAYAWDKVTFR-----------------GR----------------SLSFYLLNMEGDENTPQRC---L------------------------------------
      GSONMT00016346001_Oncorhynchus_mykiss_642092905                         ---HGNTPDMGSNVLQQGTA---SDGEE-NRKLIRLIK-----------------------------RGVI-TPGEDVLQL----MWRG--------CVHQASLLL-EGWIRDSVT----------G--REFQAPELWVAAILGNNIPV--------SSAYAWDKVNTT-----------------QQ----------------SS---------------------------------------------------------
      CHLREDRAFT_187079_Chlamydomonas_reinhardtii_159476834                   ---GAVSTRALPSPPRNSRG---VSAAA-AGSSARALQ-----------------------------ASLL--VGNQDV--------------------VDVVVHA-DGRIDCQ------------Y--GEFRSVSSLALKVLRQRNPNR-------MACDGWQEVKLN-----------------GV----------------RLDEMRQEAGRLLAREAAA---S------------------------------------
      CHLREDRAFT_150353_Chlamydomonas_reinhardtii_159478713                   DSPKSAPARSGGVVGKDGKR---VYNRT-VPTFGDIIS-----------------------------HGLF-PPGPCRWTV----GTIK--------EEVSVEVRP-SGEILYC------------G--NAYPSISAFALVVLRSRNSER-------IACDGWREVRHN-----------------GI----------------KMEVLRKECLRMMLENG------------------------------------------
      MONBRDRAFT_26083_Monosiga_brevicollis_MX1_167524112                     --------AMSAPGTDRSKL---QYNPE-SVLLRCLVE-----------------------------CRLV-TPGQRQLRL----DHGP--------FQYAADLSP-DGLVTCA------------N--RVFASLTDFVAHCQREVATELGP-----AQLDPWASVRHL-----------------GT----------------PLAELVTNCPPTARASPCL---N------------------------------------
      COCSUDRAFT_48713_Coccomyxa_subellipsoidea_C-169_545357568               ------------------------MTDRRAFTLQPLVE-----------------------------AGVF-EPGENVLSC----SVGG--------VEYFADLGP-AGEIIFE------------G--QFFKSPSAFSVFVKRKVNPSR-------KADDGWTSCKYR-----------------GE----------------LLSIYRPQ-LENLLGGDGR---SG-----------------------------------
      CHLREDRAFT_188109_Chlamydomonas_reinhardtii_159471497                   APRAPPKPPAAKGGTGAKPKAAPKRPAG-AITLRTLLD-----------------------------AGFL-VPGSKVLYV----EYKG--------LITWADLTE-EATIMCD------------G--QTFESPSAFSIFVKRKLNPER-------KADDGWKAVKYA-----------------GK----------------LLEHYKEQYLRQQLAAS------------------------------------------
      COCSUDRAFT_46372_Coccomyxa_subellipsoidea_C-169_545371515               -LLERQRAAVAKKRGPGTGK---PRAGG-GITLKLLID-----------------------------EDIL-QPGDNILSV----EYKS--------SMTYASLEH-DGRISCFVQ----------GQHLTFESPSAFSIYLKRLVNPAR-------KADDGWKTVKCN-----------------GR----------------FLEQYKLELARRRFGKP------------------------------------------
      MICPUCDRAFT_60251_Micromonas_pusilla_CCMP1545_303282299                 KPKPKKERPPA------KAP---SGGAS-GVTLAHLID-----------------------------ANLI-APGVDVVST----LYNG--------VTELATLRD-DGAIAWD------------G--REFHSVSAFSLAVKRRTNPDR-------KADDGWKCVKCD-----------------GV----------------ALDAVKRRYEATVAAGG------------------------------------------
      MPVG_00227_Micromonas_pusilla_virus_12T_472342784                       ---------------MAPSP----------VSLKTLID-----------------------------AGLL-TPGHNVLQI----KINR------KKVTQCASLTN-EGVIVFK------------N--KQYHHPSEWSLYVKNIYNPTL-------TSDRGWTSILYQ-----------------GS----------------TLNVIRSSYISRRD---------------------------------------------
      OSTLU_32332_Ostreococcus_lucimarinus_CCE9901_145347713                  -RKNRAAGEPLVQLGERGKA---GIGKI-SCSLVDLIQ-----------------------------SGLL-KSGAEKMFI----VYQD--------NVWKGDLGE-DGVITFQ------------G--QRFTSPSAWAIFAKRLTNPTK-------KADDGWKSARYGDP--------------DGP----------------TLDQVKGEYARINQLKLA-----------------------------------------
      MICPUCDRAFT_47146_Micromonas_pusilla_CCMP1545_303277185                 --------GTVGAPKRPARSG--VPGRS-TVSLKMLVD-----------------------------DGKC-NPGHDALWI----TYQN--------QTWSGELSA-DGIITFQ------------G--KTFNSPSAWAIYVKRLANPGK-------KADDGWKSVRYGHE--------------EGP----------------MLDDLKHEYLRGESGAGL-----------------------------------------
      Bathy04g03050_Bathycoccus_prasinos_612396523                            NKKKENPNNLMTTKERKLHE---RGPYD-GITLFTLIQ-----------------------------LKLL-KVGANNLLA--SYEYGG------EKVQVVGSLTE-NGHIYVQ------------KEDEMFRNPRSMSIDVKRRLNPEI-------KFTEGWGHVRYVNTGSWHLDGKNDEK--NGQ----------------TLKEIRERIPIKDVPSLV-----------------------------------------
      PHYSODRAFT_347238_Phytophthora_sojae_695436758                          SPRPRPRPQLVRPPPRGLATSF-IMGDA-RIVLTTLID-----------------------------EKLL-SPGPKKLYV----SYYR--------KRVYADLLP-DGSIRFK------------D--QVYTSPVPCALHMKRTLNPSL-------KTDAGWSSMYSAA---------------SGE----------------SLKDIKDRLNIRKRGTNA-----------------------------------------
      CAOG_001362_Capsaspora_owczarzaki_ATCC_30864_765549348                  --QAAPPKPRA---PQRLE----RKDRI-QITLSDLID-----------------------------AGLL-KPGTVLSS--------G---------SAQCLLQA-DSSVVSQPE----------G--KPYASAQAWLATVYTKE-----------QRPSMWSRVSAK-----------------GM----------------VLNIYREMYIKRAESA-------------------------------------------
      GUITHDRAFT_132396_Guillardia_theta_CCMP2712_551676387                   CEDKTWNPEDL-KKNAEDL----ARDPS-SVTLKQLLD-----------------------------AGFL-SLDAELYWQ-KKHPEKG------LLGPVFGKITL-DGQIEFE------------G--QKHKTPAAFASAACKSLTGRSLK-----RKHDGWSSVRYR-----------------NR----------------TLEYYKSQYIDRNSRKN------------------------------------------
      DI09_127p30_Mitosporidium_daphniae_692170605                            -------SDKI---KAANLS---AGSKSETFSLMDLIS-----------------------------SGTV-PIGSVAYCF-----------------DCTAKITS-SGSLIDEID----------L--QEYFDPSDWATYVVRCVENRRL------SRQDGLKAIKVG-----------------GK----------------TLEELYSSVFVQQSTTRND---P------------------------------------
      MVEG_11466_Mortierella_verticillata_NRRL_6337_672817350                 -----ASSATIGLAVNNTVK---LAHQL-RVTLHNLVS-----------------------------TGYL-PAETRVIFR-----------------DHSAIVTA-KGTLIPIYSELNCATHCPWLQ-GEYETPSAWATAVVKGARTGK-------VAVNGWSAIKVNVHQNPALVKMFSGQGLPEV----------------SLDVLRKRYLMDMVDDG------------------------------------------
      Gasu_54090_Galdieria_sulphuraria_545702290                              ---DKMEKSSV-----SDISSP-RSKEA-QQKLLELTK-----------------------------HGLL-KVGEKVQFY-----YKA--------KEFAAQVTS-EGCLLYQGEN---------GESELFLSPSAFVNTLAKRQGPSSRGKSKPKLNLNGWEFCFVA-----------------GV----------------SLSQLKQQLESILQAEGT-----------------------------------------
      DDB_G0293300_Dictyostelium_discoideum_AX4_66800521                      GKIKRKRGIVI------DNR---RKKVP-DITIDDLMK-----------------------------KDLL-RIGDTLCYC-----IGG--------VNHFALLLR-DGYIEYD------------S--LRLPSVHAFVIHVLSNLEKNKKFR-----WFSPWDSVSVR-----------------GK----------------SLNFIRSIFQSKFYKGG------------------------------------------
      Sthe_2269_Sphaerobacter_thermophilus_DSM_20745_269787547                ASVGALWHIPA-SSPQNKGR---YKHSY-GVHLSDLVK-----------------------------SGVL-PAGTPVILV----GPRNK-------DLAHAEVSQ-DGHIIWG------------G--KRYRSLSDRAFCAAFS--PPR-------VSFNGWKHWYAVL--------------PRGRV---------------QLAELREEYLRAVADQK------------------------------------------
      DDB_G0272516_Dictyostelium_discoideum_AX4_66823477                      -----------------------MVNEH-NVTLSNLID-----------------------------FGLI-KPNQEVKYS-----YRG--------VSYTGVILL-NGEIHTN------------G--VSFTNPTHWTRTISG-------------NNCSGWGTVKLSG--------------ASGP----------------PLLKLKREYLFRVGSTGK-----------------------------------------
      CAPTEDRAFT_211270_Capitella_teleta_443691830                            QYVSPQPKNQV-------KQ---SKRVE-LPTMKTLLK-----------------------------RGII-ISGTKVLSV----HTQE--------GIKFASVDV-EGNIITPT-----------G--HKFVSPLRWAMCLKGVGH-MK--------RSTAYKMILYQ-----------------GA----------------TLYDLTQSSAGSNPLILTS---S------------------------------------
      LOC100181315_Ciona_intestinalis_459177714                               TKSPNTSQHPA----QADIQ---QDKLD-L-NLSYLLK-----------------------------NKII-QQGSNALKL----KSMG--------TEHVASLTS-NGSILTSE-----------F--QLYVTPVAWIKGVTGRSY-SK---------VNAFKMVTYL-----------------DE----------------PLYNISMRVAQKTQAAV------------------------------------------
      BRAFLDRAFT_71623_Branchiostoma_floridae_260823234                       HTQGLSTPALM------EGQ---TAGTE-VPTVSELLR-----------------------------AQLI-QPGKDVLSC----KGKA--------GLQFASLQP-DGAVMTQG-----------G--LSFSTVAQWHRAIWGHRT-GQ-------KRAMVFRQVCYK-----------------GT----------------PLADVSSKFTPAAKTITP-----------------------------------------
      LOTGIDRAFT_237598_Lottia_gigantea_676429688                             -------------------T---KKYKK-FPSLKRMMD-----------------------------RGLL-KPGKNKLSI----IRKG--------ERVTATLLN-SGMILDAT-----------G--PGFSTATKWFSAVTGTQL-TT--------KAKAYRMVCYE-----------------NT----------------PLMEFKKQYDKLGNVNLV-----------------------------------------
      LOC105318954_Crassostrea_gigas_762153126                                NMDIGCEQSQY-------EK---YGGLK-FPPLKSLID-----------------------------LHVL-YPAENVLAA----LYMG--------KVFTASLTM-MGNIEGKG-----------G--EVFNTPMKWLSAVKGGEV-VK--------KAQAYREIKYD-----------------GH----------------SLKSYVDGEQQTSIEKINV---L------------------------------------
      Tsp_04073_Trichinella_spiralis_339246505                                FSQRRGRRKSI-----RGKS---KAQLY-VRNIRQLIK-----------------------------EGIL-EAGVNILVY---PQDQD--------NVHYASLLP-NGRVQANVRL---------G--PLFSSIQRWVGFCLGHQNTSR-------TPLFELLRVRYR-----------------NV----------------SLFKINIILGVGLSRDEVV---E------------------------------------
      HELRODRAFT_165788_Helobdella_robusta_675890068                          KPKMAPNYKIV--------K---QQEIK-FRSIEQLIL-----------------------------LKVI-EPGANVLSI----QREK--------VNHLASILP-DGLILEDKS----------G--RVHNSLISWYRFITNSKQ-SK-------IAMNQLMEVKYD-----------------GK----------------PLALRLKEANYRRSEITNI----------------------------------------
      _Haemophilus_influenzae_127456                                          KPNPND-IELL------SLE--IKPPK---VPMKTLIE-----------------------------ADFL-RVGQTLFD-------KN--------ENAICIVTQ-DGNVKDN------------E---ETLSIHKMSAKYLNK------------TNNNGWDYFYLFR----------------NNN-------------FITLDSLRYEYTNQ-----------------------------------------------
      MOMA_RS00825_Moraxella_macacae_497185310                                VVNPND-IEWL------SLE--TKPPK---VAMKTLVA-----------------------------ANYL-NIGQALFD-------KN--------QNRICTVLA-DGKVTDS------------V---DTLSIHKMSAKYLNK------------TNHNGWDYFYVIK----------------DNK-------------LITLDSLRYDYASKMGK--------------------------------------------
      AAR27819.1_Staphylococcus_sp_L1_38906136                                NQVVID-DDYV----NAVFD--KKLIR---VPFKKLVE-----------------------------EGFI-DKNEYIYF-------NN--------TEEYAVISD-DKELLYN------------G----KHSIHSLAGILKGL------------ERANGWNYWYVKR----------------NNK-------------IYFYRSLS-----------------------------------------------------
      HMPREF1766_RS00325_Fusobacterium_nucleatum_696308956                    YQKNMI-TELL-------LE--VKPPK---VPLKKLVE-----------------------------KGYL-KENQVLYN-------SL--------GEAKVTVLS-NGDVFDG------------N---EKLSIHKMSAKILNK------------TNNNGWDYFYVMN---------------NGK--------------LIPLNDLRYQYDKEVNNEK------------------------------------------
      CCAN_RS08180_Capnocytophaga_canimorsus_503763750                        NVVPEI-DLFS----QLELE--VKPPR---ISMKELIT-----------------------------KGFL-KIGQQLFS-------KD--------KKYSVTICQ-NGNVSDG------------E---EMLSIHKMSAKLLKR------------TNNNGWDYFWTDY---------------KGE--------------FISIDSLRYLANKQEKI--------------------------------------------
      DV59_RS09130_Helicobacter_pylori_446268888                              NTRDKS-DFIT----NLELE--TKPPK---IPMSLLIS-----------------------------KQLL-KIGDFLYS-------PN--------KEKICQVLE-NGQVRDN------------E--NYETSIHKMSAKYLNK------------TNHNGWKFFYAYY----------------QNQ-------------FLLLDELRYICQRDS----------------------------------------------
      F811_RS0110085_Brachyspira_innocens_518849236                           NTYVEM-TDLI----NLDYE--VKPPK---VPIKNLIE-----------------------------KGYL-KANQALYS-------KK--------GDEVCKLNG-NGNV-EN------------E--LGNFSIHQMSAKLQNL------------SKYNGWNYFYTYY----------------KDK-------------FISIDELRYIYIGDNHE--------------------------------------------
      G500_RS22855_Flexibacter_roseolus_737789152                             QVKPLS-KNVL----EYKID--RKKPR---IPFGNLVE-----------------------------KGYV-SIGETLYS-------KD--------KKLTAVVQA-NASIIAN------------G--TAVGSIHKVSSVLLNK------------ATNNGWTFWYVMR----------------ENE-------------LISIDELR-----------------------------------------------------
      _Thermoplasma_acidophilum_499204035                                     NTASYQ-QKLL----DYPLE--IRPKR---VPFGSLIE-----------------------------NGYV-KAGEYLYS-------PD--------GEARALVLA-NGTLSYE------------D---KYGSIHKISAMILNK------------PANNGWAFWYVKR---------------DGK--------------LVSINDLRQKLLKDQYANH------------------------------------------
      ANT_RS09810_Anaerolinea_thermophila_503325698                           QIEPYP-QQAL----ALPVR--SRKSR---LPFGRLVE-----------------------------QNLV-QPGQILFF------DRN--------PEIRAVVLS-DGHLSVN------------G---WKGSIHMTAEKICG-------------HPTNGWERWFFLD--------------EQGI--------------FQPISILRQKYLSNVSIEN------------------------------------------
      HAUR_RS15710_Herpetosiphon_aurantiacus_501142320                        -ESPSS-TDAL---QALPSN-KRRIPR---IPFGNLLE-----------------------------HGLL-QAGQQLWF------NRD--------PNLVATLLA-DASLRMS------------DG--TRGSIHKLGTILTGQ------------PSCNGWEHWFFQA--------------SDGT--------------LTSIDVLRQEVRRLREQTP------------------------------------------
      RCAS_RS10325_Roseiflexus_castenholzii_501069455                         -TPVSVCDDAM----LATRS-KRDMPR---VGFGQLVE-----------------------------AQYL-RVGQNLYS-------SD--------RNVVAIVRA-DSQLQWG------------N---ITSSIHRIAALAQHK------------PAFNGWEYWHYED--------------QAGR--------------LVSIDSLREQYRFDQGVAD------------------------------------------
      CCALI_RS04915_Chthonomonas_calidirosea_512724354                        -TPFCGVDERE---LLITPS-KRAAPR---VAFGQLVE-----------------------------AGYL-KVGTVLYS-------RD--------RRIVAYVKA-DSLLRWD------------S---KEGSIHQIAALAEGK------------PACNGWEYWYYED--------------EDGQ--------------LISIDVLRARYRAENGLE-------------------------------------------
      OSCT_3182_Oscillochloris_trichoides_DG-6_308225152                      ---AIS-ATCLEHGELLTRS-KRNAPR---ISFGQLLE-----------------------------AQYI-SVGQPIFS-------QD--------RAVTAIVKA-DAQLICN------------D---QTGSIHKIAASVQNR------------AAANGWEYWYYED--------------AAGN--------------LVSIDELRERYRHENHVN-------------------------------------------
      K355_RS0107980_Thalassospira_lucentensis_550983501                      KVQMLD-GDSL----EVTES-KRSLPR---IPFGAVIE-----------------------------RGLL-SPGEKIYD--------NR-------GNVAAMVRA-DGSISHK------------D---NAGSIHQIGARVQGA------------EACNGWTYWHYKC---------------DGR--------------LVSIDNLRSQLRKEMGQVP------------------------------------------
      MGMSR_RS10205_Magnetospirillum_gryphiswaldense_568205957                KVRPVS-ELSL----LSTPS-KKSEPR---VPFGTVVE-----------------------------RGLL-EVGTVLYG-------NGK-------DSLTAKVRA-DGTLISA------------D---HRGSIHKVGALVQNA------------PACNGWTFWHLKQ----------------GDE-------------LVPIDVLRQKIRAELH---------------------------------------------
      L902_RS0138340_Agrobacterium_tumefaciens_665867244                      AVEPLG-KAEL----TVMTG-KKAEPR---VAFNTLVE-----------------------------SGLV-RPGQVLTD--------AK-------RRYSAIIRA-DGTIASG------------G---TAGSIHRLGAKVQGL------------DACNGWTFWHFED----------------GDA-------------LKPIDDLRTIIRSELAKAE------------------------------------------
      EX02_RS05975_Rhizobium_leguminosarum_739196468                          AVEPLG-KAEL----TVMTG-KKQEVR---VAFNVLVE-----------------------------SGLI-KPGQVLTD--------AR-------RRHSAIVRA-DGTVASG------------G---EAGSIHRLGAKVQGL------------DACNGWTFWHFDD----------------GKS-------------LRPIDDLRSVIRSDLAKAE------------------------------------------
      _Brucella_melitensis_bv_1_str_16M_81851486                              AVEPLG-KAEL----TVMTG-KRAEPR---VAFTSVME-----------------------------AGLL-RPGTVLCD--------ER-------RRFAAIVRA-DGTLTAN------------G---EAGSIHRIGARVQGF------------DACNGWTFWHFEE---------------NGV--------------LKPIDALRKIIREQMAAAG------------------------------------------
      BIND_RS03830_Beijerinckia_indica_501352122                              AIDPLP-SEAI----ASFPN-KRTEPR---IPFMTLIE-----------------------------SGLL-AAGETLTD--------EK-------GRHEAVVRA-DGTLAVG------------P---IIGSIHKIGALVQGL------------PACNGWTFWHFQR---------------DGQ--------------KHPLDRLRIQLRESGKEPV------------------------------------------
      BJ6T_73150_Bradyrhizobium_japonicum_USDA_6_354959883                    AVEPLP-EESL----APFMT-AREAPR---VAFSELIE-----------------------------RGMI-MPGTKLFD--------AK-------KKLGALVRA-DGAIMLG------------D---KVGSIHRIGAVAQGA------------QACNGWTFWHVET--------------KKG---------------LKLIDELRAEIRAGMGAE-------------------------------------------
      QH73_RS47605_Scytonema_millei_748143394                                 SISPAT-REVL----AVTQS-KRAEPR---IPFGNLVE-----------------------------RGLV-KPGDTLYC--------PR-------GERTARVRA-DGTLISG------------R---STGSIHKVGAEIQKA------------PSCNGWTFWHVRV---------------RGG--------------FQPLDTLRAQIRETMAV--------------------------------------------
      YY1_RS0113660_Mastigocoleus_testarum_654345013                          AIEPIE-GAVL----ETEKS-KKSLPR---VPFGALLE-----------------------------SGWL-KPGDRLFS--------PQ-------RRYQARIRV-DGSLTTG------------S---VSGSIHRLGAHVQQA------------PACNGWTYWHYED-------------PKRN---------------LAPIDLLRRRYREEMGLN-------------------------------------------
      HPO_RS06710_Hyphomonas_polymorpha_737625856                             AITPLE-GEVL----ETERS-KKSLAR---VPFGALIE-----------------------------TGWL-KPGDRLFS--------PQ-------RRHQARIRV-DGSLTTG------------A---ITGSIHRLGAQVQQA------------PACNGWTYWHYET-------------EKRD---------------LAPIDLLRRRYREEMGLA-------------------------------------------
      HMPREF1019_RS05155_Campylobacter_sp_10_1_50_496651971                   DIVFED-SDIA----HAKFD-KK-PLK---VNLDQMID-----------------------------ANFL-NLGERFYL--------KN-------SDEFAILKR-GSRLEYN------------N---ILYDIHSLAAKLKSAKS----------ERLNGFKFWHVMR---------------DNK--------------KILLDDIRSHFREINA---------------------------------------------
      EMIHUDRAFT_114853_Emiliania_huxleyi_CCMP1516_551589812                  RQPSLARASVA------------RASVD-SLTSLGAMMA----------------------------HRLI-EAAPAALS---VEAASG--------EWLLADLCD-DGTILHLPADG--------GVPQRFRSPGGFVRFLRPWLSAAS-------VEGGSWTAVHYK-----------------GL----------------PLDLIRKAATSDADALGAV---A------------------------------------
      SFUL_6650_Streptomyces_fulvissimus_DSM_40593_485098347                  -EPQTATADGH------------RVDL---R-TLVAALP----------------------------PAAF-SAGGIALT------GRGKR------GPAKATLLE-DGRIMCF------------R--QPMNTPTSAARMAAGD------------DTVDGWAFWSLT---------------VDGKAR--------------TLADLRDTLIAQRG---------------------------------------------
      G407_RS0122780_Salinarimonas_rosea_655991010                            DAPAEAVGAAP-----------RRKRR---LMRTTEMIE----------------------------RGLL-RAGMRLTI-------KGR-------PGSEAVVV--DGRHVEF-------------------GGERMSFNDWG-------------CRVTGWTAIQIYEWAEM----------PDGR----------------LLSALREA---------------------------------------------------
      Haur_5215_Herpetosiphon_aurantiacus_DSM_785_159894763                   HAVVGEATEQP-----------S-EGR---KPRFNDMVQ----------------------------AKKV-VPGDQLYT-----KKY---------PQRRATVV--DGETVEY-------------------DGVRYPINVWG-------------EKVTGWSSINIYDSVILE---------RTGK----------------PLRSLREEG--------------------------------------------------
      M657_RS0116450_Bacillus_megaterium_651957888                            SSIKRDNSNRI-----------SKKRY---LPRMKELFE----------------------------WGIL-SPNQKVSI-------KNQ-------DNSDATVVD-EKTVSFN--------------------ENIMSYNQWG-------------KVVTGWSAMSVYEWIIPE---------GQTK----------------TLHELRLERLDNRQWD-------------------------------------------
      _Methanococcus_maripaludis_500693945                                    ----KYVSKKT-----------KDITRR-SLPKISDMLE----------------------------WGVV-KPGDVIKA------KD---------HNAEAILLK-NGNVSVIPTEMSIGSADVRKIPMQTAVATEMSMQTWL-------------KSVYGWSSIQTYNFAVLK---------ETGE----------------TLSKIREKYMEKLSSENT-----------------------------------------
      HMPREF1040_RS01335_Megasphaera_sp_UPII_135-E_494634130                  HSNLHLERTIK-----------KI-GK---VEADGSQTS----------------------------EGFVVFKGSHISL-----ADDNTIPA-VIKERRKNALIDEQGVLQED---------------MLFTSPSYAAMFVIG-------------KSANGLTSWKTAD----------------GK----------------TLKSLESSHDTEQTDET------------------------------------------
      PTSG_07559_Salpingoeca_rosetta_514687557                                EEEAQPSYERE-----------TE-TF---CSAVRELHA----------------------------NGML-QTGDVCEF-----K------------GVVAELDG-DGDLVLA----------D-G--QVFPTPGMMACIMQWSG-----------FSSNGWSCVKFYSSSPQLMDDGPTSSTKRGKRARKSSAAAKPAYSMLTIAQLRKQLLKKEQERRQL---KREE---------------------------------
      JONANDRAFT_RS02920_Jonquetella_anthropi_495798143                       WNKVPDGQYFI-----SGSK--KNFGK---IKATMHVEN----------------------------GTFIVERGSVCVP-----FADG---------KVQPGFIG-NARIEE-------NILQE-D--VPCSSPSAAGALVLG-------------RSTNGWTAWKDSS----------------GR----------------LIDVYRMPEPDEKEVAQKG---H------------------------------------
      MBVG_RS02290_Mycoplasma_bovigenitalium_490555808                        SKCIPNGEYFL-----ERNV--KGFGR---VEGRARVHD----------------------------GVFTLLKGSYCAD-----YND----------KYPSQLRK-NAKFKN-------NFLQE-D--IICKSPSAASLIVVG-------------KSTNGWDWWKNID----------------GE----------------SIDIYRK-----KEISD------------------------------------------
      B437_00700_Fusobacterium_hwasookii_ChDC_F128_402258132                  --DMEKITFVL-------------KGR---VTSGTGRLLSN--------------------------EKFEILKGTSIVL----EVKSENPTTFKRNKNLIDDLLR-KNLIEKSG---DKYIFKE-N--YIATSPSAAAILVLG-------------RSANGWSEWKTYE----------------GK----------------LLSEYRR----------------------------------------------------
      CLO_RS14710_Clostridium_botulinum_489467259                             --EIMDIDFYC-----------QG-SR---GKGAGKLKK----------------------------GKFIVLKGSRASK----FFYDSV---KTSNTKLVNKLMN-EGKLREEN---EYYILIE-E--CIFTSPSAAAKFILG-------------RSANGWSEWKTYE----------------GD----------------TLDNFRDKEE-------------------------------------------------
      IF25_RS0123745_Streptomyces_sp_NRRL_B-5680_739980329                    --RLPSGGPVA------------R-GR--LLPEKGSNGS----------------------------QKFLVYAGAPARG----KVVPSYSERRASSSRLRTQLID-EGRLRPSERWPGHLEVAE-D--VEFGSPSAAAEVLLG-------------RSANGWTRWRTKD----------------DR----------------PLSEFMPGVWAGPNRAWLV---R------------------------------------
      NTHER_RS04135_Natranaerobius_thermophilus_501422597                     -------VFVC------------K-GK--EAYAEGDYID----------------------------EGFVVYKGSKANL----TETQSAGEW---LINLRKQLIE-SGVLVKRD---EIYEFSS-N--YIFSSPSAAAATVLA-------------RRANGWTEWKNKD----------------GK----------------TLDDVKRKSD-------------------------------------------------
      N690_RS02735_Brevibacillus_thermoruber_737312527                        -------LLYC------------K-GK--DAKAVGEYTD----------------------------EGLVVLKGSTANL----TESPSSSNT---LKALRKKLID-EGVLVQEG---NVYVFTK-D--YIFGSPSSAADVVLA-------------RSANGWTEWKRED----------------GK----------------TLYQLKRE---------------------------------------------------
      H630_RS36845_Salinibacillus_aidingensis_763304606                       ----NDNLFYC------------K-GR--GIQAVAEYTE----------------------------EGMVVLKNSEMAR----DTTESFKEYMTSSGMRRDTLIK-DGIVKLTG---DVYTFQE-D--YIFTSPSAAAKTVLG-------------RAANGWTKWKTKD----------------GK----------------TLDQIKRKGK-------------------------------------------------
      X941_RS08425_Burkholderia_pseudomallei_759571980                        --VAKEEVFYC------------KTSN---YDAIAQYTT----------------------------EGMVVMKGSKARV----EMVPSAGES---QQKRRQQLIA-EGVLKLED---GFYVFQR-D--VLFKSPSGASDAITG-------------ASTNGWQLWKTKE----------------GK----------------TLDELKRQPASS-----------------------------------------------
      MA05_RS01915_Comamonas_aquatica_772620269                               NESVEADTFYL------------RSTK---YDAVGEYTA----------------------------EGMVVLKGSKARI----DIATSMAQT--PLVPKRQALIE-DGALKLEG---DFYVFQR-D--VLFKSPSGAAAMVRG-------------ASSNGWVEWVSET----------------GK----------------TLDELKRQAPVNKSVS-------------------------------------------
      K225_RS0108435_Acidovorax_sp_JHL-9_736707052                            GEPQPTELFYC------------K-GPD--ASGVGEYTP----------------------------EGFVVHKGSTARI----GNVASIQGT--SQERFREQLVT-DGVLKLQG---TQYVFTR-D--YLFSSPSMAAIAVLG-------------RSANGWMEWKTEQ----------------GQ----------------TLDGAKRQVMNAVN---------------------------------------------
      LI82_07720_Methanococcoides_methylutens_695943706                       --------YFC------------KSKN---ANAIGEYTE----------------------------EGFVVNKGSKSNM----EETTSLQPS---IRAFRANLLE-KGIVKEED---GVYVYQE-D--FTFSSPSMSASVVLG-------------RAANGWMLWKDKD----------------GK----------------TLDELVRQKGISNG---------------------------------------------
      MPSY_RS00900_Methanolobus_psychrophilus_504865027                       --------YFC------------KSKD---ADAVGEYTE----------------------------EGFIVNKGSKSNV----KETPSIQQS---IKTFRANLVD-KGILKEEN---GVYVFQE-D--FTFSSPSMAASVVLA-------------RTANGWAEWRVQD----------------GGNIK-------------TLDDVIRKKEKED----------------------------------------------
      DSM3645_RS18720_Blastopirellula_marina_750016631                        --STNTESFFL------------KTNE---CDAEGNFVE----------------------------DGFVVRAGAIARK----EITPSGIDL---IEPVRTLLIE-SGVLIDQG---ANLRFTQ-D--YLFNSPSRAAIVVLG-------------RRANGWTEWKDAL----------------GR----------------SLDEVYRADGDA-----------------------------------------------
      RS9917_RS10690_Synechococcus_sp_RS9917_494162448                        ---AEVLSLRS-----------PSNG----VEAKGLYTA----------------------------EGFVVLAGSVGRG----DTAPSLGET---NERWRQRLLD-GGVMQPDDR--GRLVFPK-D--HLFKSPSGAAIALLG-------------RTANGWREWKSPQ----------------GP----------------TLHQLIREGCTDDQHP-------------------------------------------
      G442_RS0116315_Acetobacter_nitrogenifigens_651265592                    SSDEEGVTVYC-----------LAPG----VEGQARYTE----------------------------EGLVVLSGSYGRS----EVSDSFSRH--NYYRKRQNLID-QGALRIDG---SRIVYTR-D--TLFKSPSPGAVYLLG-------------RSANGWFEWKDRT----------------GR----------------SLADVMERKQ-------------------------------------------------
      Thi970DRAFT_03846_Thiorhodovibrio_sp_970_380878131                      --------LVC-----------RIKG----ARALGRPTP----------------------------DGFVVFKDSTAVL----HERPATPKRQPYVVALRKRLVD-EGILVEQD---GYLQFLH-D--AEFSSPSAAASVIHG-------------GGANGLTEWRTEL----------------VS------------------DNGANG---------------------------------------------------
      HMPREF9303_0585_Prevotella_denticola_CRIS_18C-A_325482299               --PKEEHLFFT-------------KGR--GCDAKGFYHS----------------------------KGFTVLKGSTIVS----SSSPSFKWK-----NKREKMLS-EYTHSS-K---GKLELSS-D--TTFNSPSTAADFCIG-------------SSNNGWLVWKDKD----------------GN----------------TLDSVYRKQLE------------------------------------------------
      M091_1691_Parabacteroides_distasonis_str_3776_D15_i_649528608           --LKEEHLFYT-------------KGR--GCNAKGFYNS----------------------------SGFTVLKGSIIVN----SSVPSFNWK-----EKREKLIN-EYTVTK-N---GLLVMES-D--KTFSSPSTAADFCIG-------------SSNNGWLVWKDKN----------------GQ----------------TLDSVYRKQLE------------------------------------------------
      JCM15984_RS09020_Porphyromonas_macacae_640573949                        --PKKEHLFYT-------------KGR--GCEANGFYSS----------------------------SGFTVLKGSIIAE----TPTPSFHWK-----EKRDRLIK-EYTSKK-D---GCFVVTS-D--ITFSSPSTAAMFCLG-------------RSANGWDEWKDEN----------------WK----------------TLDAIYRKQLE------------------------------------------------
      M068_1150_Bacteroides_fragilis_str_J38-1_596132180                      --PKDKPIFQI-------------VSKK--CDAKGFYDT----------------------------SGFTVLKGSRISD----KSTDSLSWR-----DKRTKLIE-EYANNK-N-----FVINE-D--ITFSSPSKAADFCLG-------------SSSNGWIMWKTER----------------GQ----------------TLDSVYRKQLE------------------------------------------------
      EH55_RS06500_Synergistes_jonesii_740127358                              -NTKNAQILHT--------------TRN-GITALGVYSG----------------------------DKFDVLEGSEINM----DKPVHLPKY----NKQRQELLD-DGHIISEN---GKSILKI-T--LTFNTPSGASNFVLG-------------GSTNGWAEWKNSD----------------GK----------------TLDELFRKS--------------------------------------------------
      HGEM_RS06435_Enorma_massiliensis_517958401                              -ALADVKLFYT--------------SRR-GVRARGVYTG----------------------------DTFDVLEGSPVDL----KVKPKLDRY----EKLRQELLA-SGDLVQDG---DGGRLVK-T--VSFSTPSGAADFVLG-------------GSNNGWIEWKDGD----------------KQ----------------TLDALYRK---------------------------------------------------
      CCUR_RS01535_Cryptobacterium_curtum_502483234                           --PSVGSATFH--------------TKKLGVKAIGRYDKET--------------------------GKFIVFAGSQIAL-------DKSIIK-----NRIAITVR-AEQFGETT---ERTTLIN-D--VVFPSPSAAAVFVLG-------------GSQNGWTEWVDDN----------------GN----------------TLSDIYRTEEN------------------------------------------------
      l13_07900_Neisseria_weaveri_ATCC_51223_343968291                        TWTGRTVTVFC-------------RSRDKILRGKGLFNVET--------------------------KQILLLEGTMIYRKISETLPPGWK-------EVYQQWLA-SDFLADSKDP-DFYVLQS-E--QLCASPSMAAALVLG-------------NNRNGWQYWRTER----------------GL----------------TLDETYRKK--------------------------------------------------
      T491_RS0113070_Prevotella_sp_P6B4_655526129                             ---SHLIKCLL--------------TR--NASAQGLFNPAD--------------------------QSLTVLSGSKINPVHLNKISPAGR-------KKRDILFA-KYTELRN----GERIVKE-D--ICFDSPSGAAQFCVG-------------GSSNGWSQWKDEN----------------GK----------------ELDSYRSNEVAKVPAPD------------------------------------------
      T495_RS0106435_Prevotella_sp_P6B1_697058363                             ---SHLIKCLL--------------TR--NASAQGLFNPAD--------------------------QSLTVLSGSKINPVHLNKISPAGR-------KKRDILFA-KYTELRN----GERIVKE-D--ICFDSPSGAAQFCVG-------------GSSNGWSQWKDEN----------------GK----------------ELDSYRSNEVAKVPAPD------------------------------------------
      BCH11DRAFT_RS20235_Burkholderia_sp_Ch1-1_494322155                      -------------------------EP--A----WVW------------------------------KGVSFPAGTKLRA-----TFKG---------KEYMGEVH-NGAFLLD------------G--VEYTSPSAAAQSVTQ-------------SPVNGWTFWQCLR----------------PG----------------DT-QWIGINTLRR----------------------------------------------
      Aam_125_009_Acidocella_aminolytica_101_=_DSM_11237_775294809            -------------------------EP--AAGMGWVS------------------------------KGVTFPDGTEFRA-----TYKG----------QHVTARVARGRLRGA------------GD-KVATSLSQAARMVTQ-------------TSVDGWTFWEVKR----------------PN----------------DL-QWQQAGTLRKKS--------------------------------------------
      BI00_RS24945_Rhodococcus_fascians_739317412                             IYVPANHHQRVN---AGNSS---PRQSY-DPDILALIE-----------------------------RGYM-MPGDVLVFY----QKKAQ-------RNYQARVRR-DGTITVG------------N--ETFTAVSTALGFCLG-------------YSVNGWQSWRLQR---------------TGE----------------LIHDLRLRVLGETH---------------------------------------------
      G418_29712_Rhodococcus_qingshengii_BKS_20-40_452756494                  --EQPASNAVV--------D---PEVTY-DDDILQLML-----------------------------FGYL-QVDQQLSY---HDIGRG--------LNFTATVTR-DGALEVN------------G--KRYGSPSAPLTELMG-------------QQRHGWRDWQLA----------------DGR----------------QLSQLRREGRTEMAW--------------------------------------------
      SCP1.291c_Streptomyces_coelicolor_499350070                             ---TLSLSTGA----QTSSS---SPVTP-HGPLAELMQ-----------------------------ADLI-KAGTVLTF---HQRRAK--------RSGRAVVTA-DGQLIVD------------GHASPFPSPSKAAEAVTG-------------NVINGWTLWHVEG---------------VGR----------------TLDDLRRELDSRTSR--------------------------------------------
      TR51_RS33030_Kitasatospora_griseola_763031682                           ---TLSLGEGA----ASAQA---APVTP-QGPLAGLMR-----------------------------ARLL-EPGAVLTF---RQRRAN--------RSGRAVVTA-DGQLVVD------------GHSSPFPSPSKAAEAVTG-------------NIINGWTLWRTS----------------DGS----------------TLDQLRQKLDAE-----------------------------------------------
      INTCA_RS15730_Intrasporangium_calvum_503259250                          ADEDEPTPWAA------------TRTQI-PGTVADLLA-----------------------------EGLL-HAGTELRCI-----RGG--------RQGQGAIGS-DGQIIVD------------GV-G-YSTPSLAAGVSLGATNS---------TGYGGWEMWHVGS--------------LTGP----------------TLADLRAQLPKRANR--------------------------------------------
      AMYBE_RS0132875_Amycolatopsis_benzoatilytica_522152436                  -------GESA-----------GSAPRRAPDALAALIE-----------------------------AGLI-EVGEQLVW--------G---------GHTATVRA-GGVLHDGG-----------GHEFAVATVTSLATHLAGY-------------TANGWHLWHRAR---------------DHR----------------PLSALRTELGTPQ----------------------------------------------
      OQ02_RS24335_Saccharothrix_sp_NRRL_B-16314_703488257                    LHVEATGIAPLPTGPTEGRS--LGGFGG-NGALADLLA-----------------------------AGLL-YEGEEFIW---DLPGRG--------ARHTARIRS-DGTLVLAD-----------G--RAYANPSGALTALAGS------------FHGNGWVQWKRTS---------------DGR----------------SLAELRAELRTRRGLTIG-----------------------------------------
      A37G_RS0107850_Dehalobacter_sp_FTH1_736354164                           ---------EA-----PPSP---KKPL-APGKLLPLVE-----------------------------AGLI-EEGDVLVH---ERPRKG--------DRFEATVTE-SGWLNAC------------G--VLYQYPSAALGNLVR-------------SQINGWLNWTHQP---------------SGK----------------TLRELELELEGVNKGGSN-----------------------------------------
      BLA_0918_Bifidobacterium_animalis_subsp_lactis_AD011_219621053          QGQSDRTGTSA------HRS--RRTSK--RVTVGEVVR-----------------------------AGLL-TVGDRFVW---NRPRKH--------EIWRITVTE-SGFRGED------------G--TEYATPTSAARAIGG-------------SSA-SLNVWKRES---------------DGR----------------ALSDIWKTYRTSM----------------------------------------------
      _Bradyrhizobium_japonicum_742488091                                     -VPPDHRSGFA------RAR--RRPGR--NVDLADLIN-----------------------------AGLL-QPGMSLVP---KRKK-F--------SHRVATLLA-DGRVEVD------------G--EAFANAREAATAIYG-------------KKTGGWWFFLTDP--------------ASGR----------------TLRAVRRDYIEAMAVD-------------------------------------------
      OP04_RS15495_Catenuloplanes_japonicus_703061045                         PGRSTRTTYLI-------------DGR--RVVIRDLID-----------------------------AGLL-VPNTELSF---NRPRMR--------ESHRATVTE-TGRIRLDS-----------G--EEFQSPSRAAVAAANT------------RVLDGWRAWVVE---------------PERR----------------SLDALRQEFLDKAVADTP-----------------------------------------
      L942_RS08340_Amycolatopsis_orientalis_739497435                         -------MHLI-------------GGR--RVTVSDLID-----------------------------AGLL-QAGERLRF---ARNRIG--------VAYDATVTA-EGRIRLASD----------G--EEFRSPSRAAMVAAGM------------RAVDGWRAWQVV---------------EQDR----------------LLDGLRQDFLDQALTGT------------------------------------------
      AMIS_RS01730_Actinoplanes_missouriensis_754222116                       ----------M-------------NGR--RVRILDLIK-----------------------------AGLL-NPGQELVF---ERPRIG--------EIHRAVVTD-NGRIRVAD-----------G--QEFASPSRAADDVSG-------------TGTDGWYAWRVG---------------DDGP----------------LLDQLRQELLKSAASQT------------------------------------------
      SBI_RS18605_Streptomyces_bingchenggensis_759777256                      ---MGRESYLL-------------EGR--RVTVGDLLE-----------------------------AGYL-NAGAKLTF---ERPRRG--------EKHHAELAA-NGKVQLSD-----------G--QLFRSPSRAAIAAVGG------------GSFDGWHAWTLD----------------DGR----------------TLDQLRQQLLDEAAQQA------------------------------------------
      VVMO6_RS10310_Vibrio_vulnificus_763144244                               EDIDNLVRPLL--------------DS---LTV----------------------------------KGVTFPPGTQFRA-----SYKG--------SLHHGVVE--SGNIVVN------------G--VRCKSPSKAAEAITG-------------NSVNGWKFWECKL----------------PS----------------CN-HWVAISSLRES---------------------------------------------
      SACT1_4469_Streptomyces_griseus_XylebKG-1_326658949                     SLNRYAERPRY-----------LTAGR--RVTMADLLD-----------------------------AGLI-TEGTQLTF---ER--AG--------ARYSARVTA-AGRLELVG-----------G--QQFPSPNRAAAAAVGE------------GTVDGWQSWALE----------------DGT----------------TLDRLRQRFLGTSTVASS-----------------------------------------
      C569_RS0103420_Micromonospora_sp_CNB394_648575907                       -MSNGEPRPRRTH---------LLDGR--RVRISDLMD-----------------------------ANLL-KAGDDLYF---QQRIGD--------PPHQATVTE-RGRLRLQD-----------G--REFSTPSKAGAVVARR------------RAVAGWSAWQVG---------------IGGR----------------TLHQLRLQLLRDVAAEVTANEEVPR---AETA---------------------------
      KUTG_05615_Kutzneria_sp_744_585093370                                   --LTGSRSPKA--PSARTKV--SAAER--ALTVADLID-----------------------------AGLL-STRRPLTA-----EWRR--------EQRQAELLS-DGSLRFN------------G--QNYKSLSSAGEAVKMDIAGLDLKEST--RATDGWEFWTAP-------------DPTSGEPE--------------RLKELRRRLADQS----------------------------------------------
      Isova_2668_Isoptericola_variabilis_225_334108481                        YGSYGSYDTGYPTYPHVP----TDDEE---DEDLVAL------------------------------AGRLGRPTALVWS----RPRRR--------QHFEGTLHP-DGTIELAD-----------G--RRYRNPDAAASAAAGTP-----------T-SDGWGVWRLG---------------AAGP----------------TLLEAYRQHFA------------------------------------------------
      XCEL_RS15100_Xylanimonas_cellulosilytica_502643352                      EPQPQQQAPMV-----------LDEDA---DADLAAL------------------------------AARFGVPTALVWE----RPRRG--------QRYDATLHP-DGTLELYG-----------G--GRYRHPDVAASAASGSY-----------T-ADGWTVWRVA---------------ATGE----------------TLAEAFRARFA------------------------------------------------
      H291_RS0125805_Promicromonospora_sukumoe_649278169                      GESIRASTPML-----------FEEED---DPDLEAL------------------------------ARSIGTPTRIVWS----RPRRN--------QHFEAMLLP-DGAIELAN-----------G--ARYRHPDSAATAASGSY-----------T-ADGWSVWRLG---------------DTGP----------------TLVEEFSRRFA------------------------------------------------
      CELF_RS02725_Cellulomonas_fimi_503535640                                DDDEPQDAPTF-----------LDAP----HPELATL------------------------------AKRRRAVTTLVWV----RERRG--------QRFEAMLRP-DGYVELED-----------G--SVHADPDVAAAAVIGAE-----------SSVDGWRAWRLG---------------DGGP----------------TLAEATGVDRA------------------------------------------------
      N866_19315_Actinotalea_ferrariae_CF5-4_601042837                        TGPVPVNHPIAVVPGASAGSPATGEHRQGPDPRLVEI------------------------------ASRLDGPGRLVWV----RRRRG--------ERYDAVLHP-DGVVETR------------G--RRFGDPDRAAAFASGG------------TVVDGWSVWRVG--------------HDDGP----------------SLGDLLRGSQLDGARR-------------------------------------------
      HMPREF1979_RS00220_Actinomyces_johnsonii_545335724                      RGSADALDPVAEALGGDPS-KSLVVQE---TAELAAV------------------------------AATLGEPTQLIWQ----RLRRG--------IYHEAMLSV-EGVITLSD-----------G--RSFTDPTSAANAAQDV------------TDADGWRVWRVG---------------VRGA----------------HLGDLRDDLADRSS---------------------------------------------
      W5W_RS0104910_Actinomyces_massiliensis_515761987                        DRNHRAAGPVL--GAQPTVP--SGGKS---AAALASV------------------------------ASRINTPATLVWQ----RVRRG--------IHHEAVLNA-DGIITLSN-----------G--MRFRDPSAAANAAQHT------------QDIDGWRVWRIG---------------AQGP----------------ALRDFIDDQG-------------------------------------------------
      HMPREF0045_01501_Actinomyces_graevenitzii_C83_365257398                 LSRRDSGQSTVSRSHP------SGEDS---RLALNAL------------------------------ASILSEPVQITWQ----SVTEG--------IFHTAQLRP-DGMIRVSD-----------G--TSFDEPGQAAHHCEPA------------KSVDGWDVWRFG---------------ADGP----------------SLYESLEELIAAAERSPRRPGRPVR---SRRQ--------------------G--R---
      HMPREF0574_0913_Mobiluncus_curtisii_subsp_curtisii_ATCC_35241_304326663 GAGQSDEFDLL-----------YRDAT---GLGIIAQ------------------------------VTGEDTPLVALIDF----NGTP--------AEVTAILAE-RGVIILE------------G--REFHDPSDAAREL-G-------------QDVDGWEFWHLG--------------FSEGP----------------TLAEAQAEINAEIQRN---R---------------------------------------
      AURANDRAFT_62586_Aureococcus_anophagefferens_676383955                  -----EPGKGA------------RAAK---ALTLGEMVA----------------------------RGLV-APGAGVIS---LAR-RP---------DVVADLL--DGGAIRH------------GA-ATYASPTAFATAVIGK------------SVRKGLKAVTYN-----------------GA----------------NLDELRGAAVRGTAAAPAP---P------------------------------------
      BGIM_RS0130265_Zavarzinella_formosa_521961820                           --------VLA----EGDLS-REARQRAAVIADDADLRLTPPREGTGEPKAFGGQTIATSQSLPQTRDRRL-PPAGTVLT----RAYQG--------RTIRVTVAA-DGF-EFD------------G--EMFGSLSAVAKSVTG-------------SHCNGFAFFKL------------------GG----------------KS---------------------------------------------------------
      _Nocardioides_sp_JGI_0001009-J09_655234456                              ---------LA----QGDLS-ERAKQRAAEIANDADLRLMPPVVAPGATPRPVPQPAASKSH-----DPRL-PPVGTILN----RPYKG--------RAVQVRVLT-DGF-ECD------------G--KVYSSLSSLAKEITG-------------SHCNGFAFFKLT----------------KGG----------------KE---------------------------------------------------------
      B038_RS0115505_Martelella_mediterranea_516723200                        -----------------SLA---ASPI--AEGRSWVGKGKS--------------------------AGLMLPHGTDLQM-----VYNG--------QHFTGHVD--NGSLVLE------------G--QRFSSPSGAADELCRTRDGKK-------TSLNGKELIQVRL----------------PG----------------ES-EWQL----------------------------------------------------
      RSMK_RS04070_Ralstonia_solanacearum_489363811                           -------ELAY-----GALPAGLRRHL---VEAGARLSKIKTATGRG--------------------SQLVLMPGTTLIR-----EWDE--------REYRVTVTP-DGLFELN------------G--QVFKSLSAAARHITG-------------TQWNGPRFFGLRD----------------GK----------------GGTR-------------------------------------------------------
      JL55_RS07420_Pseudomonas_chloritidismutans_757691291                    -------EQAY-----GPLPAGVRRYL---VERGAQFSKIQQA-GRG--------------------TECHLMPGTVLVR-----EWDE--------REYRVTVTA-DGLYDLN------------G--QRFKSLSAAARHITG-------------TQWSGPKFFGLKP----------------GK----------------GGKQ-------------------------------------------------------
      MMC1_RS13375_Magnetococcus_marinus_500033502                            -------ELAW-----GGLSETTKAKL---EAQAAEEKTMDRTPNPIRN------------------DGLP-VAGTRLVR-----EWKG--------VEHSCTVLD-DGF-EYQ------------G--RKFKSLSAAARAVTG-------------TRWNGKIFW-LGG----------------KK---------------------------------------------------------------------------
      HMPREF0731_0014_Roseomonas_cervicalis_ATCC_49957_296267968              -------ELAY-----GGLKPETVARL---EALGEQLDGGNVVLRRLRAG-----------------SDRP-VIGTRLIR-----EYQG--------VQHGVTVLA-DGF-EYE------------G--RPYRSLSAIARHITG-------------TRWNGWAFFGLKA----------------QR----------------GSA--------------------------------------------------------
      B038_RS0116660_Martelella_mediterranea_516723468                        --MLFK-EAQK-----IWLAPQRKDGF---VKTSEGLAGMTTEVR----------------------KGFP-KDGTKCDF-----TYGE--------TIYRGEIV--SGSIVLA------------GVAEQFGSFSAASKHITK-------------TSRNGWNDWYLDL----------------PN----------------GQ-RMLADHWRKSSEA-------------------------------------------
      IG10_RS0113630_Streptomyces_griseus_664177934                           EQLIVE-GPTA-----PTLVG-AEIQE---LRTKRGVA-----------------------------IHAV-YEGQRVDA------------------YYDPLSRV-VRIPSGP------------GR-GEYETPSGAAVAVVHVLNPHVN------PNRNGWNFWTVTA---------------TGR----------------LLQSIR-----------------------------------------------------
      FJSC11DRAFT_3600_Fischerella_sp_JSC-11_353541191                        SKQLHI-FESP-----TQEE-NAEHSQ---VRLSQLFD-----------------------------AGIT-KKGMSVRVKLKREVAKK----LQRDYINGLEISV-KGTIVYN------------G--EEFDKPSPLAAKING-------------GAINGWEYIEVKK----------------DDK-------------WVRLEELRKIWRKTNG---------------------------------------------
      HMPREF1650_RS09705_Corynebacterium_freneyi_737135649                    GVTSVPTVDSS-----AQLE--ERYDE---PTLAELVE-----------------------------KGLL-RPGALLDP-----VDPG--------WEVDAVIDD-DGTLVID------------GV-HQFDSLTEATHSLGV-------------TNMSGLSFWALET----------------SER-------------LVPLAELVASDTRIPR---------------------------------------------
      UPA14_RS00020_Ureaplasma_parvum_493739841                               ISAKIG-VVEN-----SFFD--QKPIR---VSMFEMIS-----------------------------DGYF-KLGEYFIN-------SN--------GEKAKLAKA-NGWLEYQ------------G---EINSMHEVAAKMIGRE-----------RRVNAFNYLFVER---------------DGE--------------IISINKIRENYRQHLIAKS------------------------------------------
      T403_RS0103085_Mycoplasma_collis_697093027                              TQKQIG-NVEN-----AVFD--IKPIK---VNFSSMVE-----------------------------KNFF-FLNEKFYH-------KN--------GQSAELVDP-KGKLKYK------------N---DISSMHEIAAKMMNRN-----------LKVNAYDYLFVKR----------------NEK-------------LISIAKIREDYRKILKAN-------------------------------------------
      TREVI0001_RS02010_Treponema_vincentii_493197799                         TKEMIG-NIEK-----ATFD--IKPIK---VDFIDLIK-----------------------------NNFL-LPDEKFFL-------KN--------SDSFAILKS-DGKIELP------------S--NIVTDIHKGAAILGNKKA----------ARVNGFDFWYVER----------------NNK-------------RKSIKDIREDYRKIIAG--------------------------------------------
      RB2501_15584_Robiginitalea_biformata_HTCC2501_88784593                  KEILTG--SYD-----KNFN-FFLSLR--AEAIFKVIKQEILDKKEEIQKEFYSIPQVDST------KEIA-IFGS-YYK-------KR---------IEARFNTK-TGSVFYN------------G--KLYETPSSAANQAKIDCGAHSG------ITSSGWTFWKFIN----------------ENDNT-----------EEQIDVLRKISEAY-----------------------------------------------
      WH82_RS16745_Geobacillus_thermoglucosidasius_755032699                  TEIIDG--AYD-----DFYI-AFLEKR--AEAIFKLVEKYIIAKEQKIVDLFYQPPK--TK------GNIK-IFAS-YYN-------KK---------VEAIFDIE-TQQVHYN------------G---EVLSVSAAADKAKYNLSGKDN------TSTNGWRFWKYIN----------------EQN-Q-----------ERYIDDFRK----------------------------------------------------
      BN613_00479_Cryptobacterium_sp_CAG:338_548076194                        NISERG-VRLA-----DKCV--ATWPI--PKQYQSLVEVSDVNSGRRSPFRFS--------------MVGL-SEGDVVTF------ADD--------SSKTAVITD-DSHVEYC------------G---EIFSLTALAMKLLKSS-----------HSVQGPFYFMYDG----------------E-----------------RLSELREQVESGMF---------------------------------------------
      consensus/100%                                                           .....................................................................................................................................................................h.................................................................................................
      consensus/95%                                                            ..........................................................................s............................h.............................s...h...h...................uh..h.................................................................................................
      consensus/90%                                                            ....................................h.....................................s..h.........................l...pu.h......................s.p.hs..h..................pua..h....................s..................l..h......................................................
      consensus/85%                                                            ....................................h................................hh...s..h.......................s.l...pu.l...............s......s.p.hu..h..................sGa..h....................s..................l..hb.....................................................
      consensus/80%                                                            ....................................h...............................shh...Gp.h.......................s.l...pG.l...............s....h.o.p.hu..h..................sGW..h....................s..................l..hb.....................................................
      consensus/75%                                                            ..........h...............p....sh..hhp..............................shl...Gp.h..........s.........p..s.l...sG.l...............s....h.o.p.hu..h.................ssGW..hp...................u..................lp.hb.....................................................
      consensus/70%                                                            ..........h..............sp...hsh..hhp..............................shl..sGp.h..........s.........p..s.l.s.sG.l...............s....a.oss.hu..h...............psssGWp.hph..................Gp.................Lsphb.p...................................................
      
      Back to Contents
    • Phyletic distribution and domain architectures of the RAMA domain

      The RAMA domain is a predicted DNA binding domain and is likely to distinguish Methyladenine. In addition to the several distinct fusions below there is a fusion of the RAMA to a Group2/Clade2/Cholorophyte type methylase, which seems to be an independent fusion, suggesting a strong linkage between this domain and the methylase.
      --Eukaryotic versions----
      GI           Architectures                                                                                  Pfam-archs        Gene name              Len   Taxonomy                                              Species                                 Genbank 
      # 174; MPND-like with fusions to JAB                                                                                                                                                                                                                                       
      167524112    RAMA+JAB                                                                                       AvrE               MONBRDRAFT_26083      1097  eukaryota>choanoflagellida                            Monosiga brevicollis MX1                hypothetical protein [Monosiga brevicollis MX1].
      391344637    RAMA+JAB                                                                                       Prok-JAB           LOC100902139          519   eukaryota>metazoa                                     Metaseiulus occidentalis                PREDICTED: MPN domain-containing protein-like [Metaseiulus occidentalis].
      761911769    RAMA+JAB                                                                                       Prok-JAB           LOC100641685          387   eukaryota>metazoa                                     Amphimedon queenslandica                PREDICTED: MPN domain-containing protein-like [Amphimedon queenslandica].
      241173425    RAMA+JAB                                                                                       JAB                IscW_ISCW016991       405   eukaryota>metazoa                                     Ixodes scapularis                       MPN domain-containing protein, putative, partial [Ixodes scapularis].
      443694149    RAMA+JAB                                                                                       JAB                CAPTEDRAFT_110378     411   eukaryota>metazoa>annelida                            Capitella teleta                        hypothetical protein CAPTEDRAFT_110378 [Capitella teleta].
      198425307    RAMA+JAB                                                                                       FAP                LOC100178477          470   eukaryota>metazoa>chordata                            Ciona intestinalis                      PREDICTED: MPN domain-containing protein-like [Ciona intestinalis].
      260806149    RAMA+JAB                                                                                       JAB                BRAFLDRAFT_221394     432   eukaryota>metazoa>chordata                            Branchiostoma floridae                  hypothetical protein BRAFLDRAFT_221394, partial [Branchiostoma floridae].
      612011725    RAMA+JAB                                                                                       Prok-JAB           MPND                  568   eukaryota>metazoa>chordata>vertebrata                 Monodelphis domestica                   PREDICTED: MPN domain-containing protein, partial [Monodelphis domestica].
      444509502    RAMA+JAB                                                                                       Prok-JAB           TREES_T100006078      567   eukaryota>metazoa>chordata>vertebrata                 Tupaia chinensis                        MPN domain-containing protein [Tupaia chinensis].
      586477103    RAMA+JAB                                                                                       Prok-JAB           MPND                  547   eukaryota>metazoa>chordata>vertebrata                 Chrysochloris asiatica                  PREDICTED: MPN domain-containing protein [Chrysochloris asiatica].
      635035272    RAMA+JAB                                                                                       GRP                MPND                  530   eukaryota>metazoa>chordata>vertebrata                 Chlorocebus sabaeus                     PREDICTED: MPN domain-containing protein [Chlorocebus sabaeus].
      741880389    RAMA+JAB                                                                                       Prok-JAB           MPND                  529   eukaryota>metazoa>chordata>vertebrata                 Bos taurus                              PREDICTED: MPN domain-containing protein isoform X1 [Bos taurus].
      513008759    RAMA+JAB                                                                                       GRP                Mpnd                  523   eukaryota>metazoa>chordata>vertebrata                 Heterocephalus glaber                   PREDICTED: MPN domain-containing protein isoform X1 [Heterocephalus glaber].
      729729621    RAMA+JAB                                                                                       Prok-JAB           MPND                  523   eukaryota>metazoa>chordata>vertebrata                 Haliaeetus leucocephalus                PREDICTED: MPN domain-containing protein [Haliaeetus leucocephalus].
      543377065    RAMA+JAB                                                                                       Prok-JAB           MPND                  588   eukaryota>metazoa>chordata>vertebrata                 Pseudopodoces humilis                   PREDICTED: MPN domain-containing protein [Pseudopodoces humilis].
      507541562    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  511   eukaryota>metazoa>chordata>vertebrata                 Jaculus jaculus                         PREDICTED: MPN domain-containing protein [Jaculus jaculus].
      569001445    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  511   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                            PREDICTED: MPN domain-containing protein isoform X1 [Mus musculus].
      426386710    RAMA+JAB                                                                                       DUF2763            MPND                  508   eukaryota>metazoa>chordata>vertebrata                 Gorilla gorilla gorilla                 PREDICTED: MPN domain-containing protein [Gorilla gorilla gorilla].
      632966309    RAMA+JAB                                                                                       Prok-JAB           mpnd                  507   eukaryota>metazoa>chordata>vertebrata                 Callorhinchus milii                     PREDICTED: MPN domain-containing protein [Callorhinchus milii].
      525025711    RAMA+JAB                                                                                       Adeno_terminal     MPND                  503   eukaryota>metazoa>chordata>vertebrata                 Ficedula albicollis                     PREDICTED: MPN domain-containing protein [Ficedula albicollis].
      344237591    RAMA+JAB                                                                                       Prok-JAB           I79_008167            502   eukaryota>metazoa>chordata>vertebrata                 Cricetulus griseus                      MPN domain-containing protein [Cricetulus griseus].
      564243961    RAMA+JAB                                                                                       Prok-JAB           MPND                  501   eukaryota>metazoa>chordata>vertebrata                 Alligator mississippiensis              PREDICTED: MPN domain-containing protein [Alligator mississippiensis].
      664805955    RAMA+JAB                                                                                       DUF2763            MPND                  501   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            MPN domain-containing protein isoform 3 [Homo sapiens].
      694972324    RAMA+JAB                                                                                       GRP                MPND                  501   eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                         PREDICTED: MPN domain-containing protein [Pan troglodytes].
      667305367    RAMA+JAB                                                                                       Prok-JAB           MPND                  499   eukaryota>metazoa>chordata>vertebrata                 Galeopterus variegatus                  PREDICTED: MPN domain-containing protein [Galeopterus variegatus].
      4581082      RAMA+JAB                                                                                       DUF2763            -                     498   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            R31167_1, partial protein, partial [Homo sapiens].
      676421995    RAMA+JAB                                                                                       Prok-JAB           N302_03640            295   eukaryota>metazoa>chordata>vertebrata                 Corvus brachyrhynchos                   MPN domain-containing protein, partial [Corvus brachyrhynchos].
      669290398    RAMA+JAB                                                                                       Prok-JAB           MPND                  322   eukaryota>metazoa>chordata>vertebrata                 Corvus brachyrhynchos                   PREDICTED: MPN domain-containing protein [Corvus brachyrhynchos].
      395518304    RAMA+JAB                                                                                       Prok-JAB           LOC100916797          333   eukaryota>metazoa>chordata>vertebrata                 Sarcophilus harrisii                    PREDICTED: MPN domain-containing protein-like, partial [Sarcophilus harrisii].
      677295082    RAMA+JAB                                                                                       Adeno_terminal     N310_04334            358   eukaryota>metazoa>chordata>vertebrata                 Acanthisitta chloris                    MPN domain-containing protein, partial [Acanthisitta chloris].
      146134497    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  487   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                            MPN domain-containing protein [Mus musculus].
      507697284    RAMA+JAB                                                                                       Prok-JAB           MPND                  372   eukaryota>metazoa>chordata>vertebrata                 Echinops telfairi                       PREDICTED: MPN domain-containing protein [Echinops telfairi].
      664779050    RAMA                                                                                           -                  LOC103543755          376   eukaryota>metazoa>chordata>vertebrata                 Equus przewalskii                       PREDICTED: MPN domain-containing protein-like, partial [Equus przewalskii].
      683465952    RAMA+JAB                                                                                       Adeno_terminal     N321_06775            377   eukaryota>metazoa>chordata>vertebrata                 Caprimulgus carolinensis                MPN domain-containing protein, partial [Caprimulgus carolinensis].
      548473217    RAMA+JAB                                                                                       Prok-JAB           MPND                  377   eukaryota>metazoa>chordata>vertebrata                 Capra hircus                            PREDICTED: MPN domain-containing protein, partial [Capra hircus].
      675756747    RAMA+JAB                                                                                       Prok-JAB           LOC100402569          389   eukaryota>metazoa>chordata>vertebrata                 Callithrix jacchus                      PREDICTED: MPN domain-containing protein [Callithrix jacchus].
      759183571    RAMA                                                                                           -                  MPND                  393   eukaryota>metazoa>chordata>vertebrata                 Pteropus vampyrus                       PREDICTED: MPN domain-containing protein [Pteropus vampyrus].
      641763616    RAMA+JAB                                                                                       Prok-JAB           MPND                  395   eukaryota>metazoa>chordata>vertebrata                 Chrysemys picta bellii                  PREDICTED: MPN domain-containing protein isoform X2 [Chrysemys picta bellii].
      554528053    RAMA+JAB                                                                                       Prok-JAB           MPND                  399   eukaryota>metazoa>chordata>vertebrata                 Myotis brandtii                         PREDICTED: MPN domain-containing protein [Myotis brandtii].
      478534656    RAMA+JAB                                                                                       Prok-JAB           MPND                  399   eukaryota>metazoa>chordata>vertebrata                 Ceratotherium simum simum               PREDICTED: MPN domain-containing protein [Ceratotherium simum simum].
      640778776    RAMA+JAB                                                                                       Prok-JAB           MPND                  400   eukaryota>metazoa>chordata>vertebrata                 Tarsius syrichta                        PREDICTED: MPN domain-containing protein [Tarsius syrichta].
      585173801    RAMA+JAB                                                                                       Prok-JAB           MPND                  400   eukaryota>metazoa>chordata>vertebrata                 Leptonychotes weddellii                 PREDICTED: MPN domain-containing protein [Leptonychotes weddellii].
      560906146    RAMA+JAB                                                                                       Prok-JAB           MPND                  400   eukaryota>metazoa>chordata>vertebrata                 Camelus ferus                           PREDICTED: LOW QUALITY PROTEIN: MPN domain containing, partial [Camelus ferus].
      676589955    RAMA+JAB                                                                                       Adeno_terminal     N303_15020            404   eukaryota>metazoa>chordata>vertebrata                 Cuculus canorus                         MPN domain-containing protein, partial [Cuculus canorus].
      351711696    RAMA+JAB                                                                                       Prok-JAB           GW7_06760             592   eukaryota>metazoa>chordata>vertebrata                 Heterocephalus glaber                   MPN domain-containing protein [Heterocephalus glaber].
      281349777    RAMA+JAB                                                                                       Prok-JAB           PANDA_018471          406   eukaryota>metazoa>chordata>vertebrata                 Ailuropoda melanoleuca                  hypothetical protein PANDA_018471, partial [Ailuropoda melanoleuca].
      543725811    RAMA+JAB                                                                                       Adeno_terminal     MPND                  407   eukaryota>metazoa>chordata>vertebrata                 Columba livia                           PREDICTED: MPN domain-containing protein, partial [Columba livia].
      511983578    RAMA+JAB                                                                                       Prok-JAB           MPND                  407   eukaryota>metazoa>chordata>vertebrata                 Mustela putorius furo                   PREDICTED: MPN domain-containing protein, partial [Mustela putorius furo].
      395831687    RAMA+JAB                                                                                       Prok-JAB           MPND                  407   eukaryota>metazoa>chordata>vertebrata                 Otolemur garnettii                      PREDICTED: MPN domain-containing protein [Otolemur garnettii].
      591296222    RAMA+JAB                                                                                       -                  MPND                  408   eukaryota>metazoa>chordata>vertebrata                 Panthera tigris altaica                 PREDICTED: MPN domain-containing protein, partial [Panthera tigris altaica].
      694857258    RAMA+JAB                                                                                       Adeno_terminal     MPND                  409   eukaryota>metazoa>chordata>vertebrata                 Nipponia nippon                         PREDICTED: MPN domain-containing protein [Nipponia nippon].
      355702999    RAMA+JAB                                                                                       Prok-JAB           EGK_09937             410   eukaryota>metazoa>chordata>vertebrata                 Macaca mulatta                          hypothetical protein EGK_09937, partial [Macaca mulatta].
      555967973    RAMA+JAB                                                                                       Prok-JAB           MPND                  411   eukaryota>metazoa>chordata>vertebrata                 Bos mutus                               PREDICTED: MPN domain-containing protein, partial [Bos mutus].
      532030006    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  601   eukaryota>metazoa>chordata>vertebrata                 Microtus ochrogaster                    PREDICTED: MPN domain-containing protein [Microtus ochrogaster].
      465951216    RAMA+JAB                                                                                       -                  UY3_18872             413   eukaryota>metazoa>chordata>vertebrata                 Chelonia mydas                          MPN domain-containing protein [Chelonia mydas].
      532093187    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  494   eukaryota>metazoa>chordata>vertebrata                 Ictidomys tridecemlineatus              PREDICTED: MPN domain-containing protein [Ictidomys tridecemlineatus].
      354479184    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  487   eukaryota>metazoa>chordata>vertebrata                 Cricetulus griseus                      PREDICTED: MPN domain-containing protein isoform X2 [Cricetulus griseus].
      524963049    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  487   eukaryota>metazoa>chordata>vertebrata                 Mesocricetus auratus                    PREDICTED: MPN domain-containing protein [Mesocricetus auratus].
      589939100    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  487   eukaryota>metazoa>chordata>vertebrata                 Peromyscus maniculatus bairdii          PREDICTED: MPN domain-containing protein isoform X2 [Peromyscus maniculatus bairdii].
      211826729    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  486   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                            Mpnd protein, partial [Mus musculus].
      146134361    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  487   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                       MPN domain-containing protein [Rattus norvegicus].
      641763614    RAMA+JAB                                                                                       Prok-JAB           MPND                  484   eukaryota>metazoa>chordata>vertebrata                 Chrysemys picta bellii                  PREDICTED: MPN domain-containing protein isoform X1 [Chrysemys picta bellii].
      471381804    RAMA+JAB                                                                                       Prok-JAB           MPND                  482   eukaryota>metazoa>chordata>vertebrata                 Trichechus manatus latirostris          PREDICTED: MPN domain-containing protein [Trichechus manatus latirostris].
      557264364    RAMA+JAB                                                                                       Prok-JAB           MPND                  481   eukaryota>metazoa>chordata>vertebrata                 Alligator sinensis                      PREDICTED: MPN domain-containing protein [Alligator sinensis].
      556970552    RAMA+JAB                                                                                       JAB                MPND                  480   eukaryota>metazoa>chordata>vertebrata                 Latimeria chalumnae                     PREDICTED: MPN domain-containing protein [Latimeria chalumnae].
      602649789    RAMA+JAB                                                                                       Prok-JAB           MPND                  480   eukaryota>metazoa>chordata>vertebrata                 Python bivittatus                       PREDICTED: MPN domain-containing protein [Python bivittatus].
      589939098    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  488   eukaryota>metazoa>chordata>vertebrata                 Peromyscus maniculatus bairdii          PREDICTED: MPN domain-containing protein isoform X1 [Peromyscus maniculatus bairdii].
      533186386    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  476   eukaryota>metazoa>chordata>vertebrata                 Chinchilla lanigera                     PREDICTED: MPN domain-containing protein [Chinchilla lanigera].
      472351398    RAMA+JAB                                                                                       API5               MPND                  474   eukaryota>metazoa>chordata>vertebrata                 Odobenus rosmarus divergens             PREDICTED: MPN domain-containing protein [Odobenus rosmarus divergens].
      558220477    RAMA+JAB                                                                                       Prok-JAB           MPND                  474   eukaryota>metazoa>chordata>vertebrata                 Pelodiscus sinensis                     PREDICTED: MPN domain-containing protein [Pelodiscus sinensis].
      513008765    RAMA+JAB                                                                                       GRP                Mpnd                  473   eukaryota>metazoa>chordata>vertebrata                 Heterocephalus glaber                   PREDICTED: MPN domain-containing protein isoform X4 [Heterocephalus glaber].
      578833691    RAMA+JAB                                                                                       DUF2763            MPND                  472   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            PREDICTED: MPN domain-containing protein isoform X1 [Homo sapiens].
      296485713    RAMA+JAB                                                                                       Prok-JAB           BOS_7727              585   eukaryota>metazoa>chordata>vertebrata                 Bos taurus                              TPA: CG4751-like [Bos taurus].
      544507859    RAMA+JAB                                                                                       Bacteriocin_IIc    MPND                  471   eukaryota>metazoa>chordata>vertebrata                 Macaca fascicularis                     PREDICTED: MPN domain-containing protein isoform X1 [Macaca fascicularis].
      513008761    RAMA+JAB                                                                                       GRP                Mpnd                  495   eukaryota>metazoa>chordata>vertebrata                 Heterocephalus glaber                   PREDICTED: MPN domain-containing protein isoform X2 [Heterocephalus glaber].
      562889145    RAMA+JAB                                                                                       Prok-JAB           MPND                  718   eukaryota>metazoa>chordata>vertebrata                 Tupaia chinensis                        PREDICTED: MPN domain-containing protein [Tupaia chinensis].
      545535184    RAMA+JAB                                                                                       Prok-JAB           MPND                  470   eukaryota>metazoa>chordata>vertebrata                 Canis lupus familiaris                  PREDICTED: MPN domain-containing protein [Canis lupus familiaris].
      663260198    RAMA+JAB                                                                                       Prok-JAB           MPND                  489   eukaryota>metazoa>chordata>vertebrata                 Calypte anna                            PREDICTED: MPN domain-containing protein [Calypte anna].
      74183910     RAMA+JAB                                                                                       Prok-JAB           -                     467   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                            unnamed protein product [Mus musculus].
      507966202    RAMA+JAB                                                                                       Prok-JAB           MPND                  467   eukaryota>metazoa>chordata>vertebrata                 Condylura cristata                      PREDICTED: MPN domain-containing protein [Condylura cristata].
      589939102    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  467   eukaryota>metazoa>chordata>vertebrata                 Peromyscus maniculatus bairdii          PREDICTED: MPN domain-containing protein isoform X3 [Peromyscus maniculatus bairdii].
      744621045    RAMA+JAB                                                                                       Prok-JAB           MPND                  490   eukaryota>metazoa>chordata>vertebrata                 Camelus dromedarius                     PREDICTED: MPN domain-containing protein [Camelus dromedarius].
      156717812    RAMA+JAB                                                                                       Prok-JAB           mpnd                  466   eukaryota>metazoa>chordata>vertebrata                 Xenopus (Silurana) tropicalis           MPN domain-containing protein [Xenopus (Silurana) tropicalis].
      634876687    RAMA+JAB                                                                                       GRP                MPND                  466   eukaryota>metazoa>chordata>vertebrata                 Orycteropus afer afer                   PREDICTED: MPN domain-containing protein isoform X1 [Orycteropus afer afer].
      625249800    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  465   eukaryota>metazoa>chordata>vertebrata                 Cricetulus griseus                      PREDICTED: MPN domain-containing protein isoform X3 [Cricetulus griseus].
      724844417    RAMA+JAB                                                                                       Bacteriocin_IIc    MPND                  465   eukaryota>metazoa>chordata>vertebrata                 Rhinopithecus roxellana                 PREDICTED: MPN domain-containing protein [Rhinopithecus roxellana].
      731512762    RAMA+JAB                                                                                       GRP                MPND                  465   eukaryota>metazoa>chordata>vertebrata                 Loxodonta africana                      PREDICTED: MPN domain-containing protein isoform X2 [Loxodonta africana].
      602697096    RAMA+JAB                                                                                       GRP                MPND                  463   eukaryota>metazoa>chordata>vertebrata                 Lipotes vexillifer                      PREDICTED: MPN domain-containing protein [Lipotes vexillifer].
      752433833    RAMA+JAB                                                                                       Prok-JAB           MPND                  463   eukaryota>metazoa>chordata>vertebrata                 Ailuropoda melanoleuca                  PREDICTED: MPN domain-containing protein [Ailuropoda melanoleuca].
      594035787    RAMA+JAB                                                                                       GRP                MPND                  461   eukaryota>metazoa>chordata>vertebrata                 Bubalus bubalis                         PREDICTED: MPN domain-containing protein isoform X2 [Bubalus bubalis].
      335282406    RAMA+JAB                                                                                       Prok-JAB           MPND                  460   eukaryota>metazoa>chordata>vertebrata                 Sus scrofa                              PREDICTED: MPN domain-containing protein isoform 2 [Sus scrofa].
      768397034    RAMA+JAB                                                                                       Prok-JAB           MPND                  491   eukaryota>metazoa>chordata>vertebrata                 Aquila chrysaetos canadensis            PREDICTED: MPN domain-containing protein [Aquila chrysaetos canadensis].
      742217815    RAMA+JAB                                                                                       Prok-JAB           MPND                  491   eukaryota>metazoa>chordata>vertebrata                 Bison bison bison                       PREDICTED: MPN domain-containing protein [Bison bison bison].
      675752711    RAMA+JAB                                                                                       Prok-JAB           MPND                  457   eukaryota>metazoa>chordata>vertebrata                 Pan paniscus                            PREDICTED: MPN domain-containing protein [Pan paniscus].
      699627942    RAMA+JAB                                                                                       Prok-JAB           MPND                  457   eukaryota>metazoa>chordata>vertebrata                 Picoides pubescens                      PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein [Picoides pubescens].
      594035785    RAMA+JAB                                                                                       GRP                MPND                  491   eukaryota>metazoa>chordata>vertebrata                 Bubalus bubalis                         PREDICTED: MPN domain-containing protein isoform X1 [Bubalus bubalis].
      358413003    RAMA+JAB                                                                                       Prok-JAB           MPND                  491   eukaryota>metazoa>chordata>vertebrata                 Bos taurus                              PREDICTED: MPN domain-containing protein isoform X2 [Bos taurus].
      743741068    RAMA+JAB                                                                                       Prok-JAB           MPND                  492   eukaryota>metazoa>chordata>vertebrata                 Camelus bactrianus                      PREDICTED: MPN domain-containing protein [Camelus bactrianus].
      696986250    RAMA+JAB                                                                                       Adeno_terminal     MPND                  492   eukaryota>metazoa>chordata>vertebrata                 Cuculus canorus                         PREDICTED: MPN domain-containing protein [Cuculus canorus].
      426230708    RAMA+JAB                                                                                       GRP                MPND                  497   eukaryota>metazoa>chordata>vertebrata                 Ovis aries                              PREDICTED: MPN domain-containing protein [Ovis aries].
      674057368    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  492   eukaryota>metazoa>chordata>vertebrata                 Nannospalax galili                      PREDICTED: MPN domain-containing protein isoform X2 [Nannospalax galili].
      586519820    RAMA+JAB                                                                                       Prok-JAB           MPND                  454   eukaryota>metazoa>chordata>vertebrata                 Pteropus alecto                         PREDICTED: MPN domain-containing protein [Pteropus alecto].
      672063995    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  493   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                       PREDICTED: MPN domain-containing protein isoform X1 [Rattus norvegicus].
      513008763    RAMA+JAB                                                                                       GRP                Mpnd                  493   eukaryota>metazoa>chordata>vertebrata                 Heterocephalus glaber                   PREDICTED: MPN domain-containing protein isoform X3 [Heterocephalus glaber].
      14042888     RAMA+JAB                                                                                       DUF2763            -                     451   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            unnamed protein product [Homo sapiens].
      229577180    RAMA+JAB                                                                                       DUF2763            MPND                  451   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            MPN domain-containing protein isoform 2 [Homo sapiens].
      544507861    RAMA+JAB                                                                                       Bacteriocin_IIc    MPND                  451   eukaryota>metazoa>chordata>vertebrata                 Macaca fascicularis                     PREDICTED: MPN domain-containing protein isoform X2 [Macaca fascicularis].
      466045649    RAMA+JAB                                                                                       GRP                MPND                  493   eukaryota>metazoa>chordata>vertebrata                 Orcinus orca                            PREDICTED: MPN domain-containing protein [Orcinus orca].
      731512760    RAMA+JAB                                                                                       GRP                MPND                  494   eukaryota>metazoa>chordata>vertebrata                 Loxodonta africana                      PREDICTED: MPN domain-containing protein isoform X1 [Loxodonta africana].
      4581083      RAMA+JAB                                                                                       DUF2763            -                     448   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            R31167_2, partial protein, partial [Homo sapiens].
      530667319    RAMA+JAB                                                                                       Prok-JAB           CB1_000413045         448   eukaryota>metazoa>chordata>vertebrata                 Camelus ferus                           hypothetical protein CB1_000413045 [Camelus ferus].
      634876690    RAMA+JAB                                                                                       GRP                MPND                  446   eukaryota>metazoa>chordata>vertebrata                 Orycteropus afer afer                   PREDICTED: MPN domain-containing protein isoform X2 [Orycteropus afer afer].
      676274283    RAMA+JAB                                                                                       Prok-JAB           H920_09774            446   eukaryota>metazoa>chordata>vertebrata                 Fukomys damarensis                      MPN domain-containing protein [Fukomys damarensis].
      440905924    RAMA+JAB                                                                                       Prok-JAB           M91_12016             443   eukaryota>metazoa>chordata>vertebrata                 Bos mutus                               MPN domain-containing protein, partial [Bos mutus].
      505852230    RAMA+JAB                                                                                       Prok-JAB           MPND                  441   eukaryota>metazoa>chordata>vertebrata                 Sorex araneus                           PREDICTED: MPN domain-containing protein [Sorex araneus].
      591355969    RAMA+JAB                                                                                       -                  MPND                  441   eukaryota>metazoa>chordata>vertebrata                 Chelonia mydas                          PREDICTED: MPN domain-containing protein [Chelonia mydas].
      755695164    RAMA+JAB                                                                                       Prok-JAB           MPND                  441   eukaryota>metazoa>chordata>vertebrata                 Felis catus                             PREDICTED: MPN domain-containing protein, partial [Felis catus].
      449281945    RAMA+JAB                                                                                       Adeno_terminal     A306_02191            439   eukaryota>metazoa>chordata>vertebrata                 Columba livia                           MPN domain-containing protein, partial [Columba livia].
      512916480    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  438   eukaryota>metazoa>chordata>vertebrata                 Heterocephalus glaber                   PREDICTED: MPN domain-containing protein, partial [Heterocephalus glaber].
      683931016    RAMA+JAB                                                                                       Prok-JAB           MPND                  437   eukaryota>metazoa>chordata>vertebrata                 Serinus canaria                         PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein, partial [Serinus canaria].
      75517321     RAMA+JAB                                                                                       Prok-JAB           Mpnd                  436   eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                       Mpnd protein, partial [Rattus norvegicus].
      556717255    RAMA+JAB                                                                                       Prok-JAB           MPND                  436   eukaryota>metazoa>chordata>vertebrata                 Pantholops hodgsonii                    PREDICTED: MPN domain-containing protein, partial [Pantholops hodgsonii].
      432116860    RAMA+JAB                                                                                       Prok-JAB           MDA_GLEAN10011104     435   eukaryota>metazoa>chordata>vertebrata                 Myotis davidii                          MPN domain-containing protein [Myotis davidii].
      529444158    RAMA+JAB                                                                                       JAB                MPND                  435   eukaryota>metazoa>chordata>vertebrata                 Falco peregrinus                        PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein [Falco peregrinus].
      584067554    RAMA+JAB                                                                                       Prok-JAB           MPND                  435   eukaryota>metazoa>chordata>vertebrata                 Myotis davidii                          PREDICTED: MPN domain-containing protein, partial [Myotis davidii].
      74147413     RAMA+JAB                                                                                       Prok-JAB           -                     434   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                            unnamed protein product [Mus musculus].
      507652018    RAMA+JAB                                                                                       CD99L2             Mpnd                  498   eukaryota>metazoa>chordata>vertebrata                 Octodon degus                           PREDICTED: MPN domain-containing protein [Octodon degus].
      403296246    RAMA+JAB                                                                                       Prok-JAB           MPND                  433   eukaryota>metazoa>chordata>vertebrata                 Saimiri boliviensis boliviensis         PREDICTED: MPN domain-containing protein, partial [Saimiri boliviensis boliviensis].
      560968488    RAMA+JAB                                                                                       Prok-JAB           MPND                  433   eukaryota>metazoa>chordata>vertebrata                 Vicugna pacos                           PREDICTED: MPN domain-containing protein [Vicugna pacos].
      731239925    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  433   eukaryota>metazoa>chordata>vertebrata                 Fukomys damarensis                      PREDICTED: MPN domain-containing protein, partial [Fukomys damarensis].
      625196764    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  494   eukaryota>metazoa>chordata>vertebrata                 Cricetulus griseus                      PREDICTED: MPN domain-containing protein isoform X1 [Cricetulus griseus].
      431922315    RAMA+JAB                                                                                       Prok-JAB           PAL_GLEAN10006063     430   eukaryota>metazoa>chordata>vertebrata                 Pteropus alecto                         MPN domain-containing protein, partial [Pteropus alecto].
      594631561    RAMA+JAB                                                                                       Prok-JAB           MPND                  430   eukaryota>metazoa>chordata>vertebrata                 Balaenoptera acutorostrata scammoni     PREDICTED: MPN domain-containing protein, partial [Balaenoptera acutorostrata scammoni].
      641718859    RAMA+JAB                                                                                       Prok-JAB           MPND                  430   eukaryota>metazoa>chordata>vertebrata                 Eptesicus fuscus                        PREDICTED: MPN domain-containing protein [Eptesicus fuscus].
      511904656    RAMA+JAB                                                                                       Prok-JAB           MPND                  427   eukaryota>metazoa>chordata>vertebrata                 Mustela putorius furo                   PREDICTED: MPN domain-containing protein, partial [Mustela putorius furo].
      617610563    RAMA+JAB                                                                                       GRP                MPND                  494   eukaryota>metazoa>chordata>vertebrata                 Erinaceus europaeus                     PREDICTED: MPN domain-containing protein [Erinaceus europaeus].
      514442534    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  422   eukaryota>metazoa>chordata>vertebrata                 Cavia porcellus                         PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein, partial [Cavia porcellus].
      558208646    RAMA+JAB                                                                                       Prok-JAB           MPND                  422   eukaryota>metazoa>chordata>vertebrata                 Myotis lucifugus                        PREDICTED: MPN domain-containing protein, partial [Myotis lucifugus].
      593726834    RAMA+JAB                                                                                       Prok-JAB           MPND                  421   eukaryota>metazoa>chordata>vertebrata                 Physeter catodon                        PREDICTED: MPN domain-containing protein [Physeter catodon].
      297275824    RAMA+JAB                                                                                       Gly_rich           LOC721854             420   eukaryota>metazoa>chordata>vertebrata                 Macaca mulatta                          PREDICTED: MPN domain-containing protein-like [Macaca mulatta].
      74186431     RAMA+JAB                                                                                       Prok-JAB           -                     417   eukaryota>metazoa>chordata>vertebrata                 Mus musculus                            unnamed protein product, partial [Mus musculus].
      677968406    RAMA+JAB                                                                                       Adeno_terminal     MPND                  417   eukaryota>metazoa>chordata>vertebrata                 Acanthisitta chloris                    PREDICTED: MPN domain-containing protein, partial [Acanthisitta chloris].
      704318525    RAMA+JAB                                                                                       Prok-JAB           MPND                  417   eukaryota>metazoa>chordata>vertebrata                 Caprimulgus carolinensis                PREDICTED: MPN domain-containing protein, partial [Caprimulgus carolinensis].
      674057366    RAMA+JAB                                                                                       Prok-JAB           Mpnd                  496   eukaryota>metazoa>chordata>vertebrata                 Nannospalax galili                      PREDICTED: MPN domain-containing protein isoform X1 [Nannospalax galili].
      31542699     RAMA+JAB                                                                                       DUF2763            MPND                  471   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            MPN domain-containing protein isoform 1 [Homo sapiens].
      551488218    RAMA+JAB                                                                                       JAB                LOC102231554          454   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Xiphophorus maculatus                   PREDICTED: MPN domain-containing protein-like [Xiphophorus maculatus].
      642090869    RAMA+JAB                                                                                       JAB                GSONMT00029460001     469   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Oncorhynchus mykiss                     unnamed protein product [Oncorhynchus mykiss].
      47215175     RAMA+JAB                                                                                       Prok-JAB           GSTEN:00020207:G:001  466   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Tetraodon nigroviridis                  unnamed protein product, partial [Tetraodon nigroviridis].
      742103817    RAMA+JAB                                                                                       JAB                mpnd                  459   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Esox lucius                             PREDICTED: MPN domain-containing protein [Esox lucius].
      115497614    RAMA+JAB                                                                                       JAB                mpnd                  458   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                             MPN domain-containing protein [Danio rerio].
      410921370    RAMA+JAB                                                                                       Prok-JAB           mpnd                  454   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Takifugu rubripes                       PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein [Takifugu rubripes].
      432853455    RAMA+JAB                                                                                       JAB                mpnd                  454   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Oryzias latipes                         PREDICTED: MPN domain-containing protein [Oryzias latipes].
      498992415    RAMA+JAB                                                                                       JAB                mpnd                  454   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Maylandia zebra                         PREDICTED: MPN domain-containing protein [Maylandia zebra].
      542212081    RAMA+JAB                                                                                       JAB                LOC100696290          454   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Oreochromis niloticus                   PREDICTED: MPN domain-containing protein-like [Oreochromis niloticus].
      583990359    RAMA+JAB                                                                                       JAB                LOC102785695          454   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Neolamprologus brichardi                PREDICTED: MPN domain-containing protein-like [Neolamprologus brichardi].
      657555408    RAMA+JAB                                                                                       JAB                mpnd                  454   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Stegastes partitus                      PREDICTED: MPN domain-containing protein [Stegastes partitus].
      736173885    RAMA+JAB                                                                                       JAB                mpnd                  454   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Notothenia coriiceps                    PREDICTED: MPN domain-containing protein [Notothenia coriiceps].
      617421732    RAMA+JAB                                                                                       JAB                mpnd                  451   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Poecilia formosa                        PREDICTED: MPN domain-containing protein isoform X1 [Poecilia formosa].
      657740336    RAMA+JAB                                                                                       Prok-JAB           mpnd                  449   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Cynoglossus semilaevis                  PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein [Cynoglossus semilaevis].
      734603143    RAMA+JAB                                                                                       JAB                mpnd                  622   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Larimichthys crocea                     PREDICTED: MPN domain-containing protein [Larimichthys crocea].
      617421735    RAMA+JAB                                                                                       Prok-JAB           mpnd                  365   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Poecilia formosa                        PREDICTED: MPN domain-containing protein isoform X2 [Poecilia formosa].
      573904867    RAMA+JAB                                                                                       Prok-JAB           LOC102696675          485   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Lepisosteus oculatus                    PREDICTED: MPN domain-containing protein-like [Lepisosteus oculatus].
      156394960    RAMA+JAB                                                                                       Prok-JAB           NEMVEDRAFT_v1g22327   423   eukaryota>metazoa>cnidaria                            Nematostella vectensis                  predicted protein, partial [Nematostella vectensis].
      449665213    RAMA+JAB                                                                                       JAB                LOC100209039          529   eukaryota>metazoa>cnidaria                            Hydra vulgaris                          PREDICTED: MPN domain-containing protein-like [Hydra vulgaris].
      321479075    RAMA+JAB                                                                                       JAB                DAPPUDRAFT_23970      413   eukaryota>metazoa>crustacea                           Daphnia pulex                           hypothetical protein DAPPUDRAFT_23970, partial [Daphnia pulex].
      390342613    RAMA+JAB                                                                                       JAB                LOC587071             476   eukaryota>metazoa>echinodermata                       Strongylocentrotus purpuratus           PREDICTED: MPN domain-containing protein [Strongylocentrotus purpuratus].
      170038365    RAMA+JAB                                                                                       Prok-JAB           CpipJ_CPIJ005266      268   eukaryota>metazoa>hexapoda                            Culex quinquefasciatus                  MPN domain-containing protein [Culex quinquefasciatus].
      755873328    RAMA+JAB                                                                                       DUF755             LOC101887400          1459  eukaryota>metazoa>hexapoda                            Musca domestica                         PREDICTED: MPN domain-containing protein CG4751 [Musca domestica].
      195030430    RAMA+JAB                                                                                       E_Pc_C             Dgri_GH10765          1444  eukaryota>metazoa>hexapoda                            Drosophila grimshawi                    GH10765 [Drosophila grimshawi].
      198474706    RAMA+JAB                                                                                       DUF4557            Dpse_GA18404          1442  eukaryota>metazoa>hexapoda                            Drosophila pseudoobscura pseudoobscura  GA18404 [Drosophila pseudoobscura pseudoobscura].
      195118722    RAMA+JAB                                                                                       DUF2968            Dmoj_GI20608          1441  eukaryota>metazoa>hexapoda                            Drosophila mojavensis                   GI20608 [Drosophila mojavensis].
      194759065    RAMA+JAB                                                                                       DUF755             Dana_GF14762          1384  eukaryota>metazoa>hexapoda                            Drosophila ananassae                    GF14762 [Drosophila ananassae].
      195434072    RAMA+JAB                                                                                       Secretin_N_2       Dwil_GK14895          1432  eukaryota>metazoa>hexapoda                            Drosophila willistoni                   GK14895 [Drosophila willistoni].
      19921138     RAMA+JAB                                                                                       TSA                Dmel_CG4751           1412  eukaryota>metazoa>hexapoda                            Drosophila melanogaster                 CG4751 [Drosophila melanogaster].
      195340083    RAMA+JAB                                                                                       Atrophin-1         Dsec_GM18929          1410  eukaryota>metazoa>hexapoda                            Drosophila sechellia                    GM18929 [Drosophila sechellia].
      646722843    RAMA+JAB                                                                                       Vicilin_N          L798_11172            678   eukaryota>metazoa>hexapoda                            Zootermopsis nevadensis                 MPN domain-containing protein [Zootermopsis nevadensis].
      195472102    RAMA+JAB                                                                                       TSA                Dyak_GE18516          1410  eukaryota>metazoa>hexapoda                            Drosophila yakuba                       GE18516 [Drosophila yakuba].
      194861797    RAMA+JAB                                                                                       Med15              Dere_GG23709          1407  eukaryota>metazoa>hexapoda                            Drosophila erecta                       GG23709 [Drosophila erecta].
      478259688    RAMA+JAB                                                                                       JAB                YQE_03995             746   eukaryota>metazoa>hexapoda                            Dendroctonus ponderosae                 hypothetical protein YQE_03995, partial [Dendroctonus ponderosae].
      91083749     RAMA+JAB                                                                                       Prok-JAB           LOC659985             650   eukaryota>metazoa>hexapoda                            Tribolium castaneum                     PREDICTED: MPN domain-containing protein CG4751 isoform X1 [Tribolium castaneum].
      195578469    RAMA+JAB                                                                                       Prok-JAB           Dsim_GD23765          1393  eukaryota>metazoa>hexapoda                            Drosophila simulans                     GD23765 [Drosophila simulans].
      768415170    RAMA+JAB                                                                                       Prok-JAB           LOC105380298          900   eukaryota>metazoa>hexapoda                            Plutella xylostella                     PREDICTED: MPN domain-containing protein CG4751 [Plutella xylostella].
      751770382    RAMA+JAB                                                                                       JAB                LOC105223148          1201  eukaryota>metazoa>hexapoda                            Bactrocera dorsalis                     PREDICTED: MPN domain-containing protein CG4751 [Bactrocera dorsalis].
      751478045    RAMA+JAB                                                                                       JAB                LOC105219911          1189  eukaryota>metazoa>hexapoda                            Bactrocera cucurbitae                   PREDICTED: MPN domain-containing protein CG4751 [Bactrocera cucurbitae].
      357602867    RAMA+JAB                                                                                       Prok-JAB           KGM_00027             900   eukaryota>metazoa>hexapoda                            Danaus plexippus                        hypothetical protein KGM_00027 [Danaus plexippus].
      641676301    RAMA+JAB                                                                                       JAB                LOC100164705          539   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                     PREDICTED: MPN domain-containing protein isoform X1 [Acyrthosiphon pisum].
      498932490    RAMA+JAB                                                                                       JAB                LOC101449034          1174  eukaryota>metazoa>hexapoda                            Ceratitis capitata                      PREDICTED: MPN domain-containing protein CG4751 isoform X1 [Ceratitis capitata].
      568252643    RAMA+JAB                                                                                       T4SS               AND_006389            1600  eukaryota>metazoa>hexapoda                            Anopheles darlingi                      hypothetical protein AND_006389 [Anopheles darlingi].
      668454611    RAMA+JAB                                                                                       OEP                ZHAS_00010716         1593  eukaryota>metazoa>hexapoda                            Anopheles sinensis                      hypothetical protein ZHAS_00010716 [Anopheles sinensis].
      642924394    RAMA+JAB                                                                                       Prok-JAB           LOC659985             580   eukaryota>metazoa>hexapoda                            Tribolium castaneum                     PREDICTED: MPN domain-containing protein CG4751 isoform X2 [Tribolium castaneum].
      158299477    RAMA+JAB                                                                                       Gly-zipper_OmpA    AgaP_AGAP008858       1417  eukaryota>metazoa>hexapoda                            Anopheles gambiae str. PEST             AGAP008858-PA [Anopheles gambiae str. PEST].
      157117027    RAMA+JAB                                                                                       DUF3915            AaeL_AAEL007827       1313  eukaryota>metazoa>hexapoda                            Aedes aegypti                           AAEL007827-PA [Aedes aegypti].
      512926144    RAMA+JAB                                                                                       Prok-JAB           LOC101745255          934   eukaryota>metazoa>hexapoda                            Bombyx mori                             PREDICTED: MPN domain-containing protein CG4751 isoform X1 [Bombyx mori].
      641676305    RAMA+JAB                                                                                       JAB                LOC100164705          471   eukaryota>metazoa>hexapoda                            Acyrthosiphon pisum                     PREDICTED: MPN domain-containing protein isoform X2 [Acyrthosiphon pisum].
      195385142    RAMA+JAB                                                                                       Herpes_BLLF1       Dvir_GJ13201          1399  eukaryota>metazoa>hexapoda                            Drosophila virilis                      GJ13201 [Drosophila virilis].
      157141845    RAMA+JAB                                                                                       SprA-related       AaeL_AAEL015398       866   eukaryota>metazoa>hexapoda                            Aedes aegypti                           AAEL015398-PA, partial [Aedes aegypti].
      195148332    RAMA+JAB                                                                                       DUF4557            Dper_GL19545          1441  eukaryota>metazoa>hexapoda                            Drosophila persimilis                   GL19545 [Drosophila persimilis].
      676438501    RAMA+JAB                                                                                       JAB                LOTGIDRAFT_140609     434   eukaryota>metazoa>mollusca                            Lottia gigantea                         hypothetical protein LOTGIDRAFT_140609 [Lottia gigantea].
      524883584    RAMA+JAB                                                                                       Prok-JAB           LOC101862270          492   eukaryota>metazoa>mollusca                            Aplysia californica                     PREDICTED: MPN domain-containing protein-like [Aplysia californica].
      762155186    RAMA+JAB                                                                                       RNA_pol_3_Rpc31    LOC105319737          534   eukaryota>metazoa>mollusca                            Crassostrea gigas                       PREDICTED: MPN domain-containing protein-like isoform X1 [Crassostrea gigas].
      762155188    RAMA+JAB                                                                                       JAB                LOC105319737          523   eukaryota>metazoa>mollusca                            Crassostrea gigas                       PREDICTED: MPN domain-containing protein-like isoform X2 [Crassostrea gigas].
      196002159    RAMA+JAB                                                                                       Prok-JAB           TRIADDRAFT_54406      509   eukaryota>metazoa>placozoa                            Trichoplax adhaerens                    hypothetical protein TRIADDRAFT_54406 [Trichoplax adhaerens].
      255085490    RAMA+JAB                                                                                       JAB                MICPUN_68730          333   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                   predicted protein, partial [Micromonas sp. RCC299].
      693497512    RAMA+JAB                                                                                       Prok-JAB           OT_ostta12g00050      540   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                      Tryptophan synthase beta subunit-like PLP-dependent enzymes superfamily [Ostreococcus tauri].
      145352800    RAMA+JAB                                                                                       Prok-JAB           OSTLU_26664           458   eukaryota>viridiplantae>chlorophyta                   Ostreococcus lucimarinus CCE9901        predicted protein [Ostreococcus lucimarinus CCE9901].
      303282299    RAMA+JAB                                                                                       Prok-JAB           MICPUCDRAFT_60251     603   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545             predicted protein [Micromonas pusilla CCMP1545].
      302836718    RAMA+JAB                                                                                       Cnd2               VOLCADRAFT_117378     903   eukaryota>viridiplantae>chlorophyta                   Volvox carteri f. nagariensis           hypothetical protein VOLCADRAFT_117378, partial [Volvox carteri f. nagariensis].
      545371515    RAMA+JAB                                                                                       DUF2076            COCSUDRAFT_46372      733   eukaryota>viridiplantae>chlorophyta                   Coccomyxa subellipsoidea C-169          hypothetical protein COCSUDRAFT_46372 [Coccomyxa subellipsoidea C-169].
      541962068    RAMA+JAB                                                                                       Prok-JAB           FSD1                  871   eukaryota>metazoa>chordata>vertebrata                 Falco cherrug                           PREDICTED: fibronectin type III and SPRY domain-containing protein 1 [Falco cherrug].
      # 94; ANKRD31-like                                                                                                                                                                                                                                       
      670997993    ANK+ANK+ANK+ANK+ANK+RAMA                                                                       Ank_3              ANKRD31               2017  eukaryota>metazoa>chordata>vertebrata                 Ursus maritimus                         PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ursus maritimus].
      676274142    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              H920_09890            2013  eukaryota>metazoa>chordata>vertebrata                 Fukomys damarensis                      Ankyrin repeat domain-containing protein 31 [Fukomys damarensis].
      545185913    ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                                   Ank_3              ANKRD31               1993  eukaryota>metazoa>chordata>vertebrata                 Equus caballus                          PREDICTED: putative ankyrin repeat domain-containing protein 31 [Equus caballus].
      742156297    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1957  eukaryota>metazoa>chordata>vertebrata                 Bison bison bison                       PREDICTED: putative ankyrin repeat domain-containing protein 31 [Bison bison bison].
      594655164    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1942  eukaryota>metazoa>chordata>vertebrata                 Balaenoptera acutorostrata scammoni     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Balaenoptera acutorostrata scammoni].
      528961800    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1940  eukaryota>metazoa>chordata>vertebrata                 Bos taurus                              PREDICTED: putative ankyrin repeat domain-containing protein 31 [Bos taurus].
      741883924    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1940  eukaryota>metazoa>chordata>vertebrata                 Bos taurus                              PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Bos taurus].
      694910754    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1931  eukaryota>metazoa>chordata>vertebrata                 Pan troglodytes                         PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pan troglodytes].
      767935664    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1931  eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Homo sapiens].
      675645695    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1930  eukaryota>metazoa>chordata>vertebrata                 Callithrix jacchus                      PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Callithrix jacchus].
      767935666    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1930  eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Homo sapiens].
      675645697    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1929  eukaryota>metazoa>chordata>vertebrata                 Callithrix jacchus                      PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Callithrix jacchus].
      593725202    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1922  eukaryota>metazoa>chordata>vertebrata                 Physeter catodon                        PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Physeter catodon].
      426233803    ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                                   Ank_3              ANKRD31               1920  eukaryota>metazoa>chordata>vertebrata                 Ovis aries                              PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ovis aries].
      532027299    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              Ankrd31               1913  eukaryota>metazoa>chordata>vertebrata                 Microtus ochrogaster                    PREDICTED: putative ankyrin repeat domain-containing protein 31 [Microtus ochrogaster].
      752395703    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1913  eukaryota>metazoa>chordata>vertebrata                 Ailuropoda melanoleuca                  PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ailuropoda melanoleuca].
      767935668    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1907  eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Homo sapiens].
      675645699    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1906  eukaryota>metazoa>chordata>vertebrata                 Callithrix jacchus                      PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Callithrix jacchus].
      431907833    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              PAL_GLEAN10024895     1894  eukaryota>metazoa>chordata>vertebrata                 Pteropus alecto                         Ankyrin repeat domain-containing protein 31 [Pteropus alecto].
      556748271    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1889  eukaryota>metazoa>chordata>vertebrata                 Pantholops hodgsonii                    PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Pantholops hodgsonii].
      731240079    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1883  eukaryota>metazoa>chordata>vertebrata                 Fukomys damarensis                      PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Fukomys damarensis].
      731240081    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1882  eukaryota>metazoa>chordata>vertebrata                 Fukomys damarensis                      PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Fukomys damarensis].
      426384418    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1881  eukaryota>metazoa>chordata>vertebrata                 Gorilla gorilla gorilla                 PREDICTED: putative ankyrin repeat domain-containing protein 31 [Gorilla gorilla gorilla].
      471368209    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              LOC101360661          1877  eukaryota>metazoa>chordata>vertebrata                 Trichechus manatus latirostris          PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Trichechus manatus latirostris].
      544437792    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1877  eukaryota>metazoa>chordata>vertebrata                 Macaca fascicularis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Macaca fascicularis].
      544437794    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1876  eukaryota>metazoa>chordata>vertebrata                 Macaca fascicularis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Macaca fascicularis].
      635028857    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1876  eukaryota>metazoa>chordata>vertebrata                 Chlorocebus sabaeus                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Chlorocebus sabaeus].
      297294549    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1875  eukaryota>metazoa>chordata>vertebrata                 Macaca mulatta                          PREDICTED: ankyrin repeat domain-containing protein 31-like [Macaca mulatta].
      544437796    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1875  eukaryota>metazoa>chordata>vertebrata                 Macaca fascicularis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Macaca fascicularis].
      635028859    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1875  eukaryota>metazoa>chordata>vertebrata                 Chlorocebus sabaeus                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Chlorocebus sabaeus].
      332233855    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1874  eukaryota>metazoa>chordata>vertebrata                 Nomascus leucogenys                     PREDICTED: putative ankyrin repeat domain-containing protein 31 [Nomascus leucogenys].
      512006126    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1874  eukaryota>metazoa>chordata>vertebrata                 Mustela putorius furo                   PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Mustela putorius furo].
      634855178    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1874  eukaryota>metazoa>chordata>vertebrata                 Orycteropus afer afer                   PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Orycteropus afer afer].
      767935670    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1874  eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X4 [Homo sapiens].
      256574792    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1873  eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            putative ankyrin repeat domain-containing protein 31 [Homo sapiens].
      397478346    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1873  eukaryota>metazoa>chordata>vertebrata                 Pan paniscus                            PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pan paniscus].
      472393840    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1873  eukaryota>metazoa>chordata>vertebrata                 Odobenus rosmarus divergens             PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Odobenus rosmarus divergens].
      532086706    ANK+ANK++ANK+ANK+ANK+SbcC+ANK+ANK+RAMA                                                         Ank_3              Ankrd31               1869  eukaryota>metazoa>chordata>vertebrata                 Ictidomys tridecemlineatus              PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Ictidomys tridecemlineatus].
      585166062    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1869  eukaryota>metazoa>chordata>vertebrata                 Leptonychotes weddellii                 PREDICTED: putative ankyrin repeat domain-containing protein 31 [Leptonychotes weddellii].
      351698287    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Apc3               GW7_02855             1868  eukaryota>metazoa>chordata>vertebrata                 Heterocephalus glaber                   Ankyrin repeat domain-containing protein 31 [Heterocephalus glaber].
      403256456    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1868  eukaryota>metazoa>chordata>vertebrata                 Saimiri boliviensis boliviensis         PREDICTED: putative ankyrin repeat domain-containing protein 31 [Saimiri boliviensis boliviensis].
      560950350    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1868  eukaryota>metazoa>chordata>vertebrata                 Vicugna pacos                           PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Vicugna pacos].
      560927924    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1867  eukaryota>metazoa>chordata>vertebrata                 Camelus ferus                           PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Camelus ferus].
      743708583    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1867  eukaryota>metazoa>chordata>vertebrata                 Camelus bactrianus                      PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Camelus bactrianus].
      744549800    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1867  eukaryota>metazoa>chordata>vertebrata                 Camelus dromedarius                     PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Camelus dromedarius].
      759163878    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA+RAMA                                                          Ank_3              ANKRD31               1865  eukaryota>metazoa>chordata>vertebrata                 Pteropus vampyrus                       PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pteropus vampyrus].
      586560310    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA+RAMA                                                          Ank_3              ANKRD31               1863  eukaryota>metazoa>chordata>vertebrata                 Pteropus alecto                         PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Pteropus alecto].
      344272374    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1861  eukaryota>metazoa>chordata>vertebrata                 Loxodonta africana                      PREDICTED: putative ankyrin repeat domain-containing protein 31 [Loxodonta africana].
      507934536    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA+RAMA                                                          Ank_3              ANKRD31               1859  eukaryota>metazoa>chordata>vertebrata                 Condylura cristata                      PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Condylura cristata].
      602699333    ANK+ANK+ANK+SbcC+ANK+ANK+ANK+SbcC+ANK+ANK+RAMA                                                 Ank_3              ANKRD31               1859  eukaryota>metazoa>chordata>vertebrata                 Lipotes vexillifer                      PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Lipotes vexillifer].
      395825692    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1857  eukaryota>metazoa>chordata>vertebrata                 Otolemur garnettii                      PREDICTED: ankyrin repeat domain-containing protein 31 [Otolemur garnettii].
      466014300    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1857  eukaryota>metazoa>chordata>vertebrata                 Orcinus orca                            PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Orcinus orca].
      568899965    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1857  eukaryota>metazoa>chordata>vertebrata                 Mus musculus                            PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Mus musculus].
      568984809    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1857  eukaryota>metazoa>chordata>vertebrata                 Mus musculus                            PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Mus musculus].
      640824341    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1857  eukaryota>metazoa>chordata>vertebrata                 Tarsius syrichta                        PREDICTED: putative ankyrin repeat domain-containing protein 31 [Tarsius syrichta].
      568899967    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1856  eukaryota>metazoa>chordata>vertebrata                 Mus musculus                            PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Mus musculus].
      568984811    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1856  eukaryota>metazoa>chordata>vertebrata                 Mus musculus                            PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Mus musculus].
      655893862    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1856  eukaryota>metazoa>chordata>vertebrata                 Oryctolagus cuniculus                   PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Oryctolagus cuniculus].
      755691260    ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                                   Ank_3              ANKRD31               1856  eukaryota>metazoa>chordata>vertebrata                 Felis catus                             PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Felis catus].
      478492470    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1855  eukaryota>metazoa>chordata>vertebrata                 Ceratotherium simum simum               PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ceratotherium simum simum].
      544437798    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1853  eukaryota>metazoa>chordata>vertebrata                 Macaca fascicularis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X4 [Macaca fascicularis].
      664736023    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1850  eukaryota>metazoa>chordata>vertebrata                 Equus przewalskii                       PREDICTED: putative ankyrin repeat domain-containing protein 31 [Equus przewalskii].
      345798574    ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                                   Ank_3              ANKRD31               1848  eukaryota>metazoa>chordata>vertebrata                 Canis lupus familiaris                  PREDICTED: putative ankyrin repeat domain-containing protein 31 [Canis lupus familiaris].
      296483786    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              BOS_10089             1847  eukaryota>metazoa>chordata>vertebrata                 Bos taurus                              TPA: ankyrin repeat domain 31 [Bos taurus].
      511894632    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1845  eukaryota>metazoa>chordata>vertebrata                 Mustela putorius furo                   PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Mustela putorius furo].
      533143043    ANK+ANK+MED12+ANK+ANK+ANK+ANK+RAMA                                                             Ank_3              Ankrd31               1844  eukaryota>metazoa>chordata>vertebrata                 Chinchilla lanigera                     PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Chinchilla lanigera].
      505839474    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1841  eukaryota>metazoa>chordata>vertebrata                 Sorex araneus                           PREDICTED: putative ankyrin repeat domain-containing protein 31 [Sorex araneus].
      554526760    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1838  eukaryota>metazoa>chordata>vertebrata                 Myotis brandtii                         PREDICTED: putative ankyrin repeat domain-containing protein 31 [Myotis brandtii].
      512830241    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1833  eukaryota>metazoa>chordata>vertebrata                 Heterocephalus glaber                   PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Heterocephalus glaber].
      584062428    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1833  eukaryota>metazoa>chordata>vertebrata                 Myotis davidii                          PREDICTED: putative ankyrin repeat domain-containing protein 31 [Myotis davidii].
      548481845    ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                                   Ank_3              ANKRD31               1826  eukaryota>metazoa>chordata>vertebrata                 Capra hircus                            PREDICTED: putative ankyrin repeat domain-containing protein 31 [Capra hircus].
      555956017    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1825  eukaryota>metazoa>chordata>vertebrata                 Bos mutus                               PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Bos mutus].
      594102027    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1824  eukaryota>metazoa>chordata>vertebrata                 Bubalus bubalis                         PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Bubalus bubalis].
      585716479    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1822  eukaryota>metazoa>chordata>vertebrata                 Elephantulus edwardii                   PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Elephantulus edwardii].
      767935672    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1815  eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X5 [Homo sapiens].
      504147955    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                   Ank_3              ANKRD31               1813  eukaryota>metazoa>chordata>vertebrata                 Ochotona princeps                       PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Ochotona princeps].
      586487981    ANK+ANK+ANK+ANK+ANK+ANK+SbcC+ANK+ANK+RAMA                                                      Ank_3              ANKRD31               1811  eukaryota>metazoa>chordata>vertebrata                 Chrysochloris asiatica                  PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Chrysochloris asiatica].
      589946321    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1806  eukaryota>metazoa>chordata>vertebrata                 Peromyscus maniculatus bairdii          PREDICTED: putative ankyrin repeat domain-containing protein 31 [Peromyscus maniculatus bairdii].
      507697054    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1804  eukaryota>metazoa>chordata>vertebrata                 Octodon degus                           PREDICTED: putative ankyrin repeat domain-containing protein 31 [Octodon degus].
      685546121    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1797  eukaryota>metazoa>chordata>vertebrata                 Papio anubis                            PREDICTED: putative ankyrin repeat domain-containing protein 31 [Papio anubis].
      617548503    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_5              ANKRD31               1795  eukaryota>metazoa>chordata>vertebrata                 Erinaceus europaeus                     PREDICTED: putative ankyrin repeat domain-containing protein 31 [Erinaceus europaeus].
      674044047    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+MED7+ANK+ANK+RAMA                                              Ank_3              Ankrd31               1777  eukaryota>metazoa>chordata>vertebrata                 Nannospalax galili                      PREDICTED: putative ankyrin repeat domain-containing protein 31 [Nannospalax galili].
      348551138    ANK+ANK+ANK+ANK+ANK +RAMA                                                                      Ank_3              Ankrd31               1776  eukaryota>metazoa>chordata>vertebrata                 Cavia porcellus                         PREDICTED: putative ankyrin repeat domain-containing protein 31 [Cavia porcellus].
      558097829    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1762  eukaryota>metazoa>chordata>vertebrata                 Myotis lucifugus                        PREDICTED: putative ankyrin repeat domain-containing protein 31 [Myotis lucifugus].
      625212140    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1753  eukaryota>metazoa>chordata>vertebrata                 Cricetulus griseus                      PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 isoform X1 [Cricetulus griseus].
      507539698    ANK+ANK+ANK+ANK+ANK+ANK+EVH1+ANK+ANK+RAMA                                                      Ank_3              LOC101605591          1748  eukaryota>metazoa>chordata>vertebrata                 Jaculus jaculus                         PREDICTED: putative ankyrin repeat domain-containing protein 31 [Jaculus jaculus].
      524922599    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              Ankrd31               1744  eukaryota>metazoa>chordata>vertebrata                 Mesocricetus auratus                    PREDICTED: putative ankyrin repeat domain-containing protein 31 [Mesocricetus auratus].
      667286971    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1743  eukaryota>metazoa>chordata>vertebrata                 Galeopterus variegatus                  PREDICTED: putative ankyrin repeat domain-containing protein 31 [Galeopterus variegatus].
      724797638    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1740  eukaryota>metazoa>chordata>vertebrata                 Rhinopithecus roxellana                 PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Rhinopithecus roxellana].
      591306729    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                   Ank_3              ANKRD31               1703  eukaryota>metazoa>chordata>vertebrata                 Panthera tigris altaica                 PREDICTED: putative ankyrin repeat domain-containing protein 31 [Panthera tigris altaica].
      731240083    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              Ankrd31               1695  eukaryota>metazoa>chordata>vertebrata                 Fukomys damarensis                      PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Fukomys damarensis].
      297675470    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1693  eukaryota>metazoa>chordata>vertebrata                 Pongo abelii                            PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Pongo abelii].
      507534796    ANK+ANK+SWC3+ANK+ANK+RAMA                                                                      Ank_3              Ankrd31               1655  eukaryota>metazoa>chordata>vertebrata                 Jaculus jaculus                         PREDICTED: putative ankyrin repeat domain-containing protein 31 [Jaculus jaculus].
      625271191    ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                                   Ank_3              Ankrd31               1345  eukaryota>metazoa>chordata>vertebrata                 Cricetulus griseus                      PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 isoform X2, partial [Cricetulus griseus].
      700388987    ANK+ANK+ANK+ANK+ANK+ANK+ANK+SbcC+ANK+ANK+RAMA                                                  Ank_3              ANKRD31               1664  eukaryota>metazoa>chordata>vertebrata                 Opisthocomus hoazin                     PREDICTED: putative ankyrin repeat domain-containing protein 31 [Opisthocomus hoazin].
      529449538    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1653  eukaryota>metazoa>chordata>vertebrata                 Falco peregrinus                        PREDICTED: putative ankyrin repeat domain-containing protein 31 [Falco peregrinus].
      690452132    ANK+ANK+ANK+ANK+ANK+ANK+SbcC+ANK+ANK+RAMA                                                      Ank_3              ANKRD31               1625  eukaryota>metazoa>chordata>vertebrata                 Pygoscelis adeliae                      PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pygoscelis adeliae].
      768382274    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1611  eukaryota>metazoa>chordata>vertebrata                 Aquila chrysaetos canadensis            PREDICTED: putative ankyrin repeat domain-containing protein 31 [Aquila chrysaetos canadensis].
      543729927    ANK+ANK+ANK+ANK+ANK+ANK+ANK+Imm22+ANK+ANK+RAMA                                                 Ank_3              ANKRD31               1608  eukaryota>metazoa>chordata>vertebrata                 Columba livia                           PREDICTED: putative ankyrin repeat domain-containing protein 31 [Columba livia].
      729767935    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1593  eukaryota>metazoa>chordata>vertebrata                 Haliaeetus leucocephalus                PREDICTED: putative ankyrin repeat domain-containing protein 31 [Haliaeetus leucocephalus].
      541975648    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1578  eukaryota>metazoa>chordata>vertebrata                 Falco cherrug                           PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Falco cherrug].
      700426650    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1564  eukaryota>metazoa>chordata>vertebrata                 Leptosomus discolor                     PREDICTED: putative ankyrin repeat domain-containing protein 31 [Leptosomus discolor].
      696971827    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1552  eukaryota>metazoa>chordata>vertebrata                 Cuculus canorus                         PREDICTED: putative ankyrin repeat domain-containing protein 31 [Cuculus canorus].
      513229707    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1548  eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                           PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X8 [Gallus gallus].
      513229710    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1547  eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                           PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X9 [Gallus gallus].
      513229713    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1545  eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                           PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X10 [Gallus gallus].
      527247521    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1544  eukaryota>metazoa>chordata>vertebrata                 Melopsittacus undulatus                 PREDICTED: putative ankyrin repeat domain-containing protein 31 [Melopsittacus undulatus].
      513229716    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1538  eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                           PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X7 [Gallus gallus].
      686604346    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1520  eukaryota>metazoa>chordata>vertebrata                 Aptenodytes forsteri                    PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Aptenodytes forsteri].
      513229721    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1514  eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                           PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X11 [Gallus gallus].
      701430760    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1492  eukaryota>metazoa>chordata>vertebrata                 Chaetura pelagica                       PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Chaetura pelagica].
      663291212    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1491  eukaryota>metazoa>chordata>vertebrata                 Calypte anna                            PREDICTED: putative ankyrin repeat domain-containing protein 31 [Calypte anna].
      675413736    ANK+ANK+ANK+Imm22+ANK+ANK+RAMA                                                                 Ank_3              ANKRD31               1470  eukaryota>metazoa>chordata>vertebrata                 Manacus vitellinus                      PREDICTED: putative ankyrin repeat domain-containing protein 31 [Manacus vitellinus].
      701288139    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1437  eukaryota>metazoa>chordata>vertebrata                 Nestor notabilis                        PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Nestor notabilis].
      525026971    ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                                   Ank_3              ANKRD31               1415  eukaryota>metazoa>chordata>vertebrata                 Ficedula albicollis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ficedula albicollis].
      695155706    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1400  eukaryota>metazoa>chordata>vertebrata                 Phalacrocorax carbo                     PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Phalacrocorax carbo].
      705661395    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1400  eukaryota>metazoa>chordata>vertebrata                 Chlamydotis macqueenii                  PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Chlamydotis macqueenii].
      697830380    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1397  eukaryota>metazoa>chordata>vertebrata                 Egretta garzetta                        PREDICTED: putative ankyrin repeat domain-containing protein 31 [Egretta garzetta].
      542159298    ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                                   Ank_3              ANKRD31               1389  eukaryota>metazoa>chordata>vertebrata                 Zonotrichia albicollis                  PREDICTED: putative ankyrin repeat domain-containing protein 31 [Zonotrichia albicollis].
      719764347    ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                                   Ank_3              ANKRD31               1389  eukaryota>metazoa>chordata>vertebrata                 Tinamus guttatus                        PREDICTED: putative ankyrin repeat domain-containing protein 31 [Tinamus guttatus].
      727001773    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1378  eukaryota>metazoa>chordata>vertebrata                 Corvus cornix cornix                    PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X5 [Corvus cornix cornix].
      669287973    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1356  eukaryota>metazoa>chordata>vertebrata                 Corvus brachyrhynchos                   PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Corvus brachyrhynchos].
      683923285    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1353  eukaryota>metazoa>chordata>vertebrata                 Serinus canaria                         PREDICTED: putative ankyrin repeat domain-containing protein 31 [Serinus canaria].
      513229724    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1333  eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                           PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X12 [Gallus gallus].
      513229727    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                   Ank_3              ANKRD31               1330  eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                           PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X13 [Gallus gallus].
      704245972    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1304  eukaryota>metazoa>chordata>vertebrata                 Eurypyga helias                         PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31, partial [Eurypyga helias].
      699659375    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1291  eukaryota>metazoa>chordata>vertebrata                 Picoides pubescens                      PREDICTED: putative ankyrin repeat domain-containing protein 31 [Picoides pubescens].
      543281441    ANK+ANK+ANK+ANK+FUNDEAMN+ANK+ANK+RAMA                                                          Ank                ANKRD31               1270  eukaryota>metazoa>chordata>vertebrata                 Geospiza fortis                         PREDICTED: putative ankyrin repeat domain-containing protein 31 [Geospiza fortis].
      727001756    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1250  eukaryota>metazoa>chordata>vertebrata                 Corvus cornix cornix                    PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Corvus cornix cornix].
      543357448    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               1246  eukaryota>metazoa>chordata>vertebrata                 Pseudopodoces humilis                   PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Pseudopodoces humilis].
      727001759    ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                                   Ank_3              ANKRD31               1198  eukaryota>metazoa>chordata>vertebrata                 Corvus cornix cornix                    PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Corvus cornix cornix].
      706133765    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                       Ank_3              ANKRD31               1141  eukaryota>metazoa>chordata>vertebrata                 Colius striatus                         PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Colius striatus].
      700345054    ANK+ANK+ANK+ANK+ANK+RAMA                                                                       Ank                ANKRD31               1078  eukaryota>metazoa>chordata>vertebrata                 Haliaeetus albicilla                    PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Haliaeetus albicilla].
      675626531    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              ANKRD31               1006  eukaryota>metazoa>chordata>vertebrata                 Merops nubicus                          PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Merops nubicus].
      697030258    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              ANKRD31               1002  eukaryota>metazoa>chordata>vertebrata                 Fulmarus glacialis                      PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Fulmarus glacialis].
      698383920    ANK+ANK+ANK+SbcC+ANK+RAMA                                                                      Ank_3              ANKRD31               998   eukaryota>metazoa>chordata>vertebrata                 Gavia stellata                          PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Gavia stellata].
      701383969    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              ANKRD31               858   eukaryota>metazoa>chordata>vertebrata                 Tyto alba                               PREDICTED: putative ankyrin repeat domain-containing protein 31 [Tyto alba].
      694641918    ANK+ANK+ANK+ANK+RAMA                                                                           Ank                LOC104029345          789   eukaryota>metazoa>chordata>vertebrata                 Pelecanus crispus                       PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pelecanus crispus].
      678179736    ANK+ANK+ANK+SbcC+ANK+RAMA                                                                      Ank_3              N328_02606            744   eukaryota>metazoa>chordata>vertebrata                 Gavia stellata                          Putative ankyrin repeat domain-containing protein 31, partial [Gavia stellata].
      679141382    ANK+ANK+ANK+SbcC+ANK+RAMA                                                                      Ank_3              AS28_06212            743   eukaryota>metazoa>chordata>vertebrata                 Pygoscelis adeliae                      Putative ankyrin repeat domain-containing protein 31, partial [Pygoscelis adeliae].
      675312958    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              AS27_09918            742   eukaryota>metazoa>chordata>vertebrata                 Aptenodytes forsteri                    Putative ankyrin repeat domain-containing protein 31, partial [Aptenodytes forsteri].
      676819697    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              Z169_10261            741   eukaryota>metazoa>chordata>vertebrata                 Egretta garzetta                        Putative ankyrin repeat domain-containing protein 31, partial [Egretta garzetta].
      679004453    ANK+ANK+ANK+ANK+RAMA                                                                           Ank                N327_04278            741   eukaryota>metazoa>chordata>vertebrata                 Fulmarus glacialis                      Putative ankyrin repeat domain-containing protein 31, partial [Fulmarus glacialis].
      676584541    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              N303_00948            740   eukaryota>metazoa>chordata>vertebrata                 Cuculus canorus                         Putative ankyrin repeat domain-containing protein 31, partial [Cuculus canorus].
      678221323    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              N308_15574            740   eukaryota>metazoa>chordata>vertebrata                 Struthio camelus australis              Putative ankyrin repeat domain-containing protein 31, partial [Struthio camelus australis].
      449278665    ANK+ANK+ANK+ANK+ANK+RAMA                                                                       Ank_3              A306_04927            739   eukaryota>metazoa>chordata>vertebrata                 Columba livia                           Ankyrin repeat domain-containing protein 31, partial [Columba livia].
      697419024    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              N309_15508            739   eukaryota>metazoa>chordata>vertebrata                 Tinamus guttatus                        Putative ankyrin repeat domain-containing protein 31, partial [Tinamus guttatus].
      678997602    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              N326_09043            738   eukaryota>metazoa>chordata>vertebrata                 Eurypyga helias                         Putative ankyrin repeat domain-containing protein 31, partial [Eurypyga helias].
      677552062    ANK+ANK+ANK+SbcC+ANK+RAMA                                                                      Ank_3              N306_04945            737   eukaryota>metazoa>chordata>vertebrata                 Opisthocomus hoazin                     Putative ankyrin repeat domain-containing protein 31, partial [Opisthocomus hoazin].
      483522412    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              Anapl_02880           735   eukaryota>metazoa>chordata>vertebrata                 Anas platyrhynchos                      Ankyrin repeat domain-containing protein 31, partial [Anas platyrhynchos].
      677470756    ANK+ANK+ANK+ANK+RAMA                                                                           Ank                N334_10293            735   eukaryota>metazoa>chordata>vertebrata                 Pelecanus crispus                       Putative ankyrin repeat domain-containing protein 31, partial [Pelecanus crispus].
      676420350    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              N302_11259            734   eukaryota>metazoa>chordata>vertebrata                 Corvus brachyrhynchos                   Putative ankyrin repeat domain-containing protein 31, partial [Corvus brachyrhynchos].
      679188311    ANK+ANK+ANK+RAMA                                                                               Ank_3              N305_02338            734   eukaryota>metazoa>chordata>vertebrata                 Manacus vitellinus                      Putative ankyrin repeat domain-containing protein 31, partial [Manacus vitellinus].
      676781911    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              N300_04350            732   eukaryota>metazoa>chordata>vertebrata                 Calypte anna                            Putative ankyrin repeat domain-containing protein 31, partial [Calypte anna].
      683462737    ANK+ANK+ANK+ANK+RAMA                                                                           Ank                N338_06128            732   eukaryota>metazoa>chordata>vertebrata                 Podiceps cristatus                      Putative ankyrin repeat domain-containing protein 31, partial [Podiceps cristatus].
      704177536    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              ANKRD31               730   eukaryota>metazoa>chordata>vertebrata                 Buceros rhinoceros silvestris           PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Buceros rhinoceros silvestris].
      677395853    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              N330_10098            729   eukaryota>metazoa>chordata>vertebrata                 Leptosomus discolor                     Putative ankyrin repeat domain-containing protein 31, partial [Leptosomus discolor].
      676701437    ANK+ANK+ANK+RAMA                                                                               Ank_3              N320_10473            728   eukaryota>metazoa>chordata>vertebrata                 Buceros rhinoceros silvestris           Putative ankyrin repeat domain-containing protein 31, partial [Buceros rhinoceros silvestris].
      697455688    ANK+ANK+Tox-REase-7+ANK+RAMA                                                                   Ank                N301_04126            723   eukaryota>metazoa>chordata>vertebrata                 Charadrius vociferus                    Putative ankyrin repeat domain-containing protein 31, partial [Charadrius vociferus].
      678199929    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              N307_02015            717   eukaryota>metazoa>chordata>vertebrata                 Picoides pubescens                      Putative ankyrin repeat domain-containing protein 31, partial [Picoides pubescens].
      678120939    ANK+ANK+ANK+RAMA                                                                               Ank_3              M959_03623            714   eukaryota>metazoa>chordata>vertebrata                 Chaetura pelagica                       Ankyrin repeat domain-containing protein 31, partial [Chaetura pelagica].
      677066606    ANK+ANK+ANK+ANK+ANK+RAMA                                                                       Ank_3              N325_11692            713   eukaryota>metazoa>chordata>vertebrata                 Colius striatus                         Putative ankyrin repeat domain-containing protein 31, partial [Colius striatus].
      698470854    ANK+ANK+ANK+RAMA                                                                               Ank_4              LOC104167505          704   eukaryota>metazoa>chordata>vertebrata                 Cariama cristata                        PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31, partial [Cariama cristata].
      677277442    ANK+ANK+ANK+RAMA                                                                               Ank_3              N322_03767            687   eukaryota>metazoa>chordata>vertebrata                 Cariama cristata                        Putative ankyrin repeat domain-containing protein 31, partial [Cariama cristata].
      677432816    ANK+ANK+ANK+ANK+POTRA+RAMA                                                                     Ank_3              N331_11356            673   eukaryota>metazoa>chordata>vertebrata                 Merops nubicus                          Putative ankyrin repeat domain-containing protein 31, partial [Merops nubicus].
      677380247    ANK+ANK+ANK+ANK+RAMA                                                                           Ank                N329_03825            667   eukaryota>metazoa>chordata>vertebrata                 Haliaeetus albicilla                    Putative ankyrin repeat domain-containing protein 31, partial [Haliaeetus albicilla].
      677447578    ANK+ANK+ANK+ANK+RAMA                                                                           Ank                N333_13417            663   eukaryota>metazoa>chordata>vertebrata                 Nestor notabilis                        Putative ankyrin repeat domain-containing protein 31, partial [Nestor notabilis].
      694858262    ANK+RAMA                                                                                       Ank_4              LOC104019302          649   eukaryota>metazoa>chordata>vertebrata                 Nipponia nippon                         PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Nipponia nippon].
      677544370    ANK+RAMA                                                                                       Ank_2              Y956_09939            644   eukaryota>metazoa>chordata>vertebrata                 Nipponia nippon                         Putative ankyrin repeat domain-containing protein 31, partial [Nipponia nippon].
      677115369    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              N324_07686            633   eukaryota>metazoa>chordata>vertebrata                 Chlamydotis macqueenii                  Putative ankyrin repeat domain-containing protein 31, partial [Chlamydotis macqueenii].
      677495871    ANK+ANK+ANK+SbcC+ANK+RAMA                                                                      Ank                N337_13309            632   eukaryota>metazoa>chordata>vertebrata                 Phoenicopterus ruber ruber              Putative ankyrin repeat domain-containing protein 31, partial [Phoenicopterus ruber ruber].
      677223564    ANK+RAMA                                                                                       Ank_2              N323_01666            624   eukaryota>metazoa>chordata>vertebrata                 Cathartes aura                          Putative ankyrin repeat domain-containing protein 31, partial [Cathartes aura].
      679210087    ANK+ANK+ANK                                                                                    Ank_3              N336_07707            589   eukaryota>metazoa>chordata>vertebrata                 Phalacrocorax carbo                     Putative ankyrin repeat domain-containing protein 31, partial [Phalacrocorax carbo].
      678131372    ANK+RAMA                                                                                       -                  N340_11409            575   eukaryota>metazoa>chordata>vertebrata                 Tauraco erythrolophus                   Putative ankyrin repeat domain-containing protein 31, partial [Tauraco erythrolophus].
      676240185    RAMA                                                                                           -                  N312_04000            537   eukaryota>metazoa>chordata>vertebrata                 Balearica regulorum gibbericeps         Putative ankyrin repeat domain-containing protein 31, partial [Balearica regulorum gibbericeps].
      677478221    RAMA                                                                                           -                  N335_11233            534   eukaryota>metazoa>chordata>vertebrata                 Phaethon lepturus                       Putative ankyrin repeat domain-containing protein 31, partial [Phaethon lepturus].
      733925440    RAMA                                                                                           -                  LOC104914976          462   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                     PREDICTED: putative ankyrin repeat domain-containing protein 31 [Meleagris gallopavo].
      440910685    ANK+ANK+ANK+RAMA                                                                               Ank_3              M91_07742             738   eukaryota>metazoa>chordata>vertebrata                 Bos mutus                               Ankyrin repeat domain-containing protein 31 [Bos mutus].
      15207865     ANK+ANK+ANK+ANK+ANK+RAMA                                                                       Ank_3              -                     733   eukaryota>metazoa>chordata>vertebrata                 Macaca fascicularis                     hypothetical protein [Macaca fascicularis].
      281339402    ANK+ANK+ANK+ANK+ANK+RAMA                                                                       Ank_3              PANDA_005463          733   eukaryota>metazoa>chordata>vertebrata                 Ailuropoda melanoleuca                  hypothetical protein PANDA_005463, partial [Ailuropoda melanoleuca].
      355691398    ANK+ANK+ANK+ANK+ANK+RAMA                                                                       Ank_3              EGK_16593             733   eukaryota>metazoa>chordata>vertebrata                 Macaca mulatta                          hypothetical protein EGK_16593 [Macaca mulatta].
      528757498    ANK+ANK+ANK+ANK+ANK+RAMA                                                                       Ank_3              CB1_001305002         703   eukaryota>metazoa>chordata>vertebrata                 Camelus ferus                           Ankyrin repeat domain protein 11 (Ankyrin repeat-containing cofactor-1)-like protein [Camelus ferus].
      537215000    ANK+RAMA                                                                                       Ank_5              H671_2g8049           599   eukaryota>metazoa>chordata>vertebrata                 Cricetulus griseus                      ankyrin repeat domain-containing protein 31 [Cricetulus griseus].
      470622172    RAMA                                                                                           -                  ANKRD31               489   eukaryota>metazoa>chordata>vertebrata                 Tursiops truncatus                      PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Tursiops truncatus].
      637259745    ANK+ANK+ANK+RAMA                                                                               Ank_3              ankrd31               1122  eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Anolis carolinensis].
      637259749    ANK+ANK+ANK+RAMA                                                                               Ank_3              ankrd31               1115  eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Anolis carolinensis].
      637259753    ANK+ANK+ANK+RAMA                                                                               Ank_3              ankrd31               1085  eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Anolis carolinensis].
      637259758    ANK+ANK+ANK+RAMA                                                                               Ank_3              ankrd31               1053  eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X4 [Anolis carolinensis].
      637259762    ANK+ANK+ANK+RAMA                                                                               Ank_3              ankrd31               1038  eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X5 [Anolis carolinensis].
      637259766    ANK+ANK+ANK+RAMA                                                                               Ank_3              ankrd31               981   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X6 [Anolis carolinensis].
      499004945    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              LOC101480116          695   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Maylandia zebra                         PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Maylandia zebra].
      542224388    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              LOC102077672          679   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Oreochromis niloticus                   PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Oreochromis niloticus].
      768948668    ANK+ANK+ANK+RAMA                                                                               Ank_3              LOC101078418          621   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Takifugu rubripes                       PREDICTED: tankyrase-like [Takifugu rubripes].
      548357001    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              LOC102212411          510   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Pundamilia nyererei                     PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Pundamilia nyererei].
      554822723    ANK+ANK+ANK+ANK+RAMA                                                                           Ank_3              LOC102308989          510   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Haplochromis burtoni                    PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Haplochromis burtoni].
      632974189    ANK+ANK+ANK+ANK+ANK+RADICAL-SAM+ANK+ANK                                                        Ank_3              ankrd31               1518  eukaryota>metazoa>chordata>vertebrata                 Callorhinchus milii                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Callorhinchus milii].
      632974191    ANK+ANK+ANK+ANK+ANK+RADICAL-SAM+ANK+ANK                                                        Ank_3              ankrd31               1511  eukaryota>metazoa>chordata>vertebrata                 Callorhinchus milii                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Callorhinchus milii].
      632974193    ANK+ANK+ANK+ANK+ANK+RADICAL-SAM+ANK+ANK                                                        Ank_3              ankrd31               1425  eukaryota>metazoa>chordata>vertebrata                 Callorhinchus milii                     PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Callorhinchus milii].
      697522283    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1822  eukaryota>metazoa>chordata>vertebrata                 Struthio camelus australis              PREDICTED: putative ankyrin repeat domain-containing protein 31 [Struthio camelus australis].
      641759975    ANK+ANK+ANK+ANK+RAMA                                                                           Ank                LOC101940798          1434  eukaryota>metazoa>chordata>vertebrata                 Chrysemys picta bellii                  PREDICTED: putative ankyrin repeat domain-containing protein 31 [Chrysemys picta bellii].
      699661481    ANK+ANK+ANK+Tox-REase-7+ANK+RAMA                                                               Ank                ANKRD31               1406  eukaryota>metazoa>chordata>vertebrata                 Charadrius vociferus                    PREDICTED: putative ankyrin repeat domain-containing protein 31 [Charadrius vociferus].
      672015439    ANK+ANK+ANK+SFII-RAD3+ANK+RAMA                                                                 Ank_3              Ankrd31               1048  eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                       PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Rattus norvegicus].
      672041375    ANK+ANK+ANK+SFII-RAD3+ANK+RAMA                                                                 Ank_3              Ankrd31               1177  eukaryota>metazoa>chordata>vertebrata                 Rattus norvegicus                       PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 isoform X2 [Rattus norvegicus].
      734646682    ANK+ANK+ANK+ANK+ANK+RAMA                                                                       Ank_3              LOC104937873          385   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Larimichthys crocea                     PREDICTED: ankyrin repeat domain-containing protein 11-like [Larimichthys crocea].
      602639254    ANK+ANK+ANK+ANK+ANK+RAMA                                                                       TctB               ANKRD31               1273  eukaryota>metazoa>chordata>vertebrata                 Python bivittatus                       PREDICTED: putative ankyrin repeat domain-containing protein 31 [Python bivittatus].
      449514661    ANK+ANK+ANK+ANK+ANK+RAMA                                                                       Ank_3              ANKRD31               1062  eukaryota>metazoa>chordata>vertebrata                 Taeniopygia guttata                     PREDICTED: putative ankyrin repeat domain-containing protein 31 [Taeniopygia guttata].
      507649885    ANK+ANK+ANK+ANK+ANK+ANK+MND1+ANK+ANK+RAMA                                                      Ank_3              ANKRD31               1616  eukaryota>metazoa>chordata>vertebrata                 Echinops telfairi                       PREDICTED: putative ankyrin repeat domain-containing protein 31 [Echinops telfairi].
      558168115    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               2337  eukaryota>metazoa>chordata>vertebrata                 Pelodiscus sinensis                     PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pelodiscus sinensis].
      465967576    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              UY3_11445             1388  eukaryota>metazoa>chordata>vertebrata                 Chelonia mydas                          Ankyrin repeat domain-containing protein 31 [Chelonia mydas].
      521020823    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK                                                                Ank_3              D623_10035941         1209  eukaryota>metazoa>chordata>vertebrata                 Myotis brandtii                         Ankyrin repeat domain-containing protein 31 [Myotis brandtii].
      620941514    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                               Ank_3              ANKRD31               1693  eukaryota>metazoa>chordata>vertebrata                 Ornithorhynchus anatinus                PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ornithorhynchus anatinus].
      641695633    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_5              ANKRD31               1705  eukaryota>metazoa>chordata>vertebrata                 Eptesicus fuscus                        PREDICTED: putative ankyrin repeat domain-containing protein 31 [Eptesicus fuscus].
      557323251    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1726  eukaryota>metazoa>chordata>vertebrata                 Alligator sinensis                      PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Alligator sinensis].
      612003933    ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                           Ank_3              ANKRD31               2041  eukaryota>metazoa>chordata>vertebrata                 Monodelphis domestica                   PREDICTED: putative ankyrin repeat domain-containing protein 31 [Monodelphis domestica].
      591380064    ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA                                                               Ank_3              ANKRD31               1198  eukaryota>metazoa>chordata>vertebrata                 Chelonia mydas                          PREDICTED: putative ankyrin repeat domain-containing protein 31 [Chelonia mydas].
      688556915    ANK+ANK+ANK+ANK+ANK                                                                            Ank_3              LOC103911116          847   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Danio rerio                             PREDICTED: putative ankyrin repeat domain-containing protein 31 [Danio rerio].
      395510523    ANK+ANK+ANK+ANK                                                                                Ank_3              LOC100921662          1245  eukaryota>metazoa>chordata>vertebrata                 Sarcophilus harrisii                    PREDICTED: ankyrin repeat domain-containing protein 31-like [Sarcophilus harrisii].
      # 16;                                                                                                                                                                                                                                        
      695436758    RAMA+PHD                                                                                       DDRGK              PHYSODRAFT_347238     1408  eukaryota>stramenopiles                               Phytophthora sojae                      hypothetical protein PHYSODRAFT_347238 [Phytophthora sojae].
      566028887    RAMA+PHD                                                                                       DUF1675            F443_04913            1314  eukaryota>stramenopiles                               Phytophthora parasitica P1569           hypothetical protein F443_04913 [Phytophthora parasitica P1569].
      567966240    RAMA+PHD                                                                                       DUF1675            L915_04781            1314  eukaryota>stramenopiles                               Phytophthora parasitica                 hypothetical protein L915_04781 [Phytophthora parasitica].
      567994957    RAMA+PHD                                                                                       DUF1675            L916_04727            1314  eukaryota>stramenopiles                               Phytophthora parasitica                 hypothetical protein L916_04727 [Phytophthora parasitica].
      568024215    RAMA+PHD                                                                                       DUF1675            L917_04629            1314  eukaryota>stramenopiles                               Phytophthora parasitica                 hypothetical protein L917_04629 [Phytophthora parasitica].
      568054639    RAMA+PHD                                                                                       DUF1675            L914_04731            1314  eukaryota>stramenopiles                               Phytophthora parasitica                 hypothetical protein L914_04731 [Phytophthora parasitica].
      570991851    RAMA+PHD                                                                                       DUF1675            F442_04975            1314  eukaryota>stramenopiles                               Phytophthora parasitica P10297          hypothetical protein F442_04975 [Phytophthora parasitica P10297].
      675186757    RAMA+PHD                                                                                       DUF1675            PPTG_09144            1314  eukaryota>stramenopiles                               Phytophthora parasitica INRA-310        hypothetical protein PPTG_09144 [Phytophthora parasitica INRA-310].
      566028888    RAMA+PHD                                                                                       DUF4407            F443_04913            1307  eukaryota>stramenopiles                               Phytophthora parasitica P1569           hypothetical protein, variant 1 [Phytophthora parasitica P1569].
      567966241    RAMA+PHD                                                                                       DUF1675            L915_04781            1307  eukaryota>stramenopiles                               Phytophthora parasitica                 hypothetical protein, variant 1 [Phytophthora parasitica].
      567994958    RAMA+PHD                                                                                       DUF1675            L916_04727            1307  eukaryota>stramenopiles                               Phytophthora parasitica                 hypothetical protein, variant 1 [Phytophthora parasitica].
      568024216    RAMA+PHD                                                                                       DUF1675            L917_04629            1307  eukaryota>stramenopiles                               Phytophthora parasitica                 hypothetical protein, variant 1 [Phytophthora parasitica].
      568054640    RAMA+PHD                                                                                       DUF4407            L914_04731            1307  eukaryota>stramenopiles                               Phytophthora parasitica                 hypothetical protein, variant 1 [Phytophthora parasitica].
      570991852    RAMA+PHD                                                                                       DUF1675            F442_04975            1307  eukaryota>stramenopiles                               Phytophthora parasitica P10297          hypothetical protein, variant 1 [Phytophthora parasitica P10297].
      675186763    RAMA+PHD                                                                                       OmpH               PPTG_09144            1307  eukaryota>stramenopiles                               Phytophthora parasitica INRA-310        hypothetical protein, variant 1 [Phytophthora parasitica INRA-310].
      301095381    RAMA+PHD                                                                                       Lipase_chap        PITG_17280            1304  eukaryota>stramenopiles                               Phytophthora infestans T30-4            conserved hypothetical protein [Phytophthora infestans T30-4].
      669167767    RAMA+PHD                                                                                       DUF605             SDRG_15055            763   eukaryota>stramenopiles                               Saprolegnia diclina VS20                hypothetical protein SDRG_15055 [Saprolegnia diclina VS20].
      641537267    RAMA+PHD                                                                                       PHD                SPRG_04269            758   eukaryota>stramenopiles                               Saprolegnia parasitica CBS 223.65       hypothetical protein SPRG_04269 [Saprolegnia parasitica CBS 223.65].
      698784133    RAMA+PHD                                                                                       MSP1_C             H257_02952            737   eukaryota>stramenopiles                               Aphanomyces astaci                      hypothetical protein H257_02952 [Aphanomyces astaci].
      698784135    RAMA+PHD                                                                                       MSP1_C             H257_02952            713   eukaryota>stramenopiles                               Aphanomyces astaci                      hypothetical protein, variant [Aphanomyces astaci].
      301090511    RAMA+PHD                                                                                       V_ATPase_I         PITG_20831            1037  eukaryota>stramenopiles                               Phytophthora infestans T30-4            conserved hypothetical protein [Phytophthora infestans T30-4].
      # 9;                                                                                                                                                                                                                                         
      470649823    RAMA                                                                                           -                  LOC101339899          237   eukaryota>metazoa>chordata>vertebrata                 Tursiops truncatus                      PREDICTED: MPN domain-containing protein-like [Tursiops truncatus].
      675365883    RAMA                                                                                           -                  X975_21913            212   eukaryota>metazoa                                     Stegodyphus mimosarum                   MPN domain-containing protein, partial [Stegodyphus mimosarum].
      705695653    RAMA                                                                                           Adeno_terminal     LOC104483765          212   eukaryota>metazoa>chordata>vertebrata                 Chlamydotis macqueenii                  PREDICTED: MPN domain-containing protein-like, partial [Chlamydotis macqueenii].
      685554980    RAMA                                                                                           DUF2763            LOC103886366          210   eukaryota>metazoa>chordata>vertebrata                 Papio anubis                            PREDICTED: MPN domain-containing protein-like isoform X1 [Papio anubis].
      637356646    RAMA                                                                                           -                  LOC103280867          205   eukaryota>metazoa>chordata>vertebrata                 Anolis carolinensis                     PREDICTED: MPN domain-containing protein-like, partial [Anolis carolinensis].
      685554982    RAMA                                                                                           DUF2763            LOC103886366          200   eukaryota>metazoa>chordata>vertebrata                 Papio anubis                            PREDICTED: MPN domain-containing protein-like isoform X2 [Papio anubis].
      675756739    RAMA                                                                                           API5               LOC100401478          193   eukaryota>metazoa>chordata>vertebrata                 Callithrix jacchus                      PREDICTED: MPN domain-containing protein-like [Callithrix jacchus].
      504183573    RAMA                                                                                           -                  LOC101526050          170   eukaryota>metazoa>chordata>vertebrata                 Ochotona princeps                       PREDICTED: MPN domain-containing protein-like, partial [Ochotona princeps].
      565301529    RAMA                                                                                           -                  L345_15667            127   eukaryota>metazoa>chordata>vertebrata                 Ophiophagus hannah                      MPN domain-containing protein, partial [Ophiophagus hannah].
       3;                                                                                                                                                                                                                                         
      308805448    RAMA                                                                                           -                  Ot06g04230            441   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                      26S proteasome regulatory complex, subunit RPN11 (ISS) [Ostreococcus tauri].
      693499678    RAMA                                                                                           -                  OT_ostta06g03960      409   eukaryota>viridiplantae>chlorophyta                   Ostreococcus tauri                      unnamed product [Ostreococcus tauri].
      145347713    RAMA                                                                                           -                  OSTLU_32332           341   eukaryota>viridiplantae>chlorophyta                   Ostreococcus lucimarinus CCE9901        predicted protein [Ostreococcus lucimarinus CCE9901].
      # 2;                                                                                                                                                                                                                                         
      765549348    RAMA+PHD+PHD+PHD+BROMO+PHD+SJA/FYR+SET                                                         Atrophin-1         CAOG_001362           1884  eukaryota                                             Capsaspora owczarzaki ATCC 30864        mixed-lineage leukemia protein [Capsaspora owczarzaki ATCC 30864].
      470324569    RAMA+PHD+PHD+PHD+BROMO+PHD+SJA/FYR+SET                                                         FYVE_2             CAOG_01362            1858  eukaryota                                             Capsaspora owczarzaki ATCC 30864        mixed-lineage leukemia protein, partial [Capsaspora owczarzaki ATCC 30864].
      514687557    RAMA+PHD+PHD+PHD+PHD+SJA/FYR+SET                                                               DUF2413            PTSG_07559            2027  eukaryota>choanoflagellida                            Salpingoeca rosetta                     mixed-lineage leukemia protein [Salpingoeca rosetta].
      # 2;                                                                                                                                                                                                                                         
      669313787    RAMA                                                                                           -                  M513_01753            410   eukaryota>metazoa>nematoda                            Trichuris suis                          hypothetical protein M513_01753 [Trichuris suis].
      669222357    RAMA                                                                                           -                  TTRE_0000461101       320   eukaryota>metazoa>nematoda                            Trichuris trichiura                     hypothetical protein TTRE_0000461101 [Trichuris trichiura].
      # 2;                                                                                                                                                                                                                                         
      751771263    RAMA                                                                                           -                  LOC105226825          197   eukaryota>metazoa>hexapoda                            Bactrocera dorsalis                     PREDICTED: MPN domain-containing protein CG4751-like [Bactrocera dorsalis].
      751479038    RAMA                                                                                           -                  LOC105220345          195   eukaryota>metazoa>hexapoda                            Bactrocera cucurbitae                   PREDICTED: MPN domain-containing protein CG4751-like [Bactrocera cucurbitae].
      # 1;                                                                                                                                                                                                                                         
      545357568    RAMA+ZFCW+BRIGHT                                                                               ARID               COCSUDRAFT_48713      1111  eukaryota>viridiplantae>chlorophyta                   Coccomyxa subellipsoidea C-169          hypothetical protein COCSUDRAFT_48713 [Coccomyxa subellipsoidea C-169].
      761967687    RAMA+ZFCW                                                                                      zf-CW              MNEG_8869             296   eukaryota>viridiplantae>chlorophyta                   Monoraphidium neglectum                 hypothetical protein MNEG_8869 [Monoraphidium neglectum].
      545371504    RAMA+ZFCW                                                                                      zf-CW              COCSUDRAFT_60779      229   eukaryota>viridiplantae>chlorophyta                   Coccomyxa subellipsoidea C-169          hypothetical protein COCSUDRAFT_60779 [Coccomyxa subellipsoidea C-169].
      # 1;                                                                                                                                                                                                                                         
      676429688    RAMA                                                                                           -                  LOTGIDRAFT_237598     1826  eukaryota>metazoa>mollusca                            Lottia gigantea                         hypothetical protein LOTGIDRAFT_237598 [Lottia gigantea].
      692170605    RING+RAMA+BRCT+BRCT                                                                            zf-RING_5          DI09_127p30           539   eukaryota>fungi>microsporidia                         Mitosporidium daphniae                  Rad18-like protein [Mitosporidium daphniae].
      281208107    RAMA+BRCT+BRCT                                                                                 Herpes_TAF50       PPL_04709             1110  eukaryota>amoebozoa>mycetozoa>dictyosteliida          Polysphondylium pallidum PN500          hypothetical protein PPL_04709 [Polysphondylium pallidum PN500].
      66800521     RAMA+BRCT+BRCT                                                                                 DUF4175            DDB_G0293300          1217  eukaryota>amoebozoa>mycetozoa>dictyosteliida          Dictyostelium discoideum AX4            hypothetical protein DDB_G0293300 [Dictyostelium discoideum AX4].
      330844042    RAMA+BRCT+BRCT                                                                                 DUF4175            DICPUDRAFT_158874     1093  eukaryota>amoebozoa>mycetozoa>dictyosteliida          Dictyostelium purpureum                 hypothetical protein DICPUDRAFT_158874 [Dictyostelium purpureum].
      672817350    RAMA+PHD+PHD+PHD                                                                               RAG2_PHD           MVEG_11466            1018  eukaryota>fungi                                       Mortierella verticillata NRRL 6337      hypothetical protein MVEG_11466 [Mortierella verticillata NRRL 6337].
      733923653    RAMA+RRM                                                                                       DUF3584            LOC100540096          891   eukaryota>metazoa>chordata>vertebrata                 Meleagris gallopavo                     PREDICTED: scaffold attachment factor B1-like [Meleagris gallopavo].
      735859753    RAMA                                                                                           VAR1               SAMD00019534_002340   904   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Acytostelium subglobosum LB1            hypothetical protein SAMD00019534_002340 [Acytostelium subglobosum LB1].
      676383955    RAMA+PARPFIN+TUDOR+RING                                                                        DUF3584            AURANDRAFT_62586      1445  eukaryota>stramenopiles                               Aureococcus anophagefferens             hypothetical protein AURANDRAFT_62586 [Aureococcus anophagefferens].
      470238462    RAMA+BRCT+BRCT                                                                                 PTCB-BRCT          DFA_12254             890   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Dictyostelium fasciculatum              hypothetical protein DFA_12254 [Dictyostelium fasciculatum].
      513226956    RAMA+DUF4417                                                                                   -                  MPND                  629   eukaryota>metazoa>chordata>vertebrata                 Gallus gallus                           PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein-like [Gallus gallus].
      735848583    RAMA                                                                                           Rrn6               SAMD00019534_125030   487   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Acytostelium subglobosum LB1            hypothetical protein SAMD00019534_125030 [Acytostelium subglobosum LB1].
      612395248    RAMA                                                                                           Nucleoplasmin      Bathy05g04310         487   eukaryota>viridiplantae>chlorophyta                   Bathycoccus prasinos                    predicted protein [Bathycoccus prasinos].
      642092905    RAMA                                                                                           -                  GSONMT00016346001     373   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Oncorhynchus mykiss                     unnamed protein product [Oncorhynchus mykiss].
      159478713    RAMA                                                                                           FAM222A            CHLREDRAFT_150353     1245  eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii               predicted protein, partial [Chlamydomonas reinhardtii].
      620987738    RAMA                                                                                           Mito_fiss_reg      LOC100089729          371   eukaryota>metazoa>chordata>vertebrata                 Ornithorhynchus anatinus                PREDICTED: MPN domain-containing protein-like [Ornithorhynchus anatinus].
      303277185    RAMA                                                                                           -                  MICPUCDRAFT_47146     361   eukaryota>viridiplantae>chlorophyta                   Micromonas pusilla CCMP1545             predicted protein [Micromonas pusilla CCMP1545].
      339246505    RAMA                                                                                           -                  Tsp_04073             324   eukaryota>metazoa>nematoda                            Trichinella spiralis                    hypothetical protein Tsp_04073 [Trichinella spiralis].
      546679818    RAMA                                                                                           -                  D910_07564            308   eukaryota>metazoa>hexapoda                            Dendroctonus ponderosae                 hypothetical protein D910_07564 [Dendroctonus ponderosae].
      543287426    RAMA                                                                                           Adeno_terminal     MPND                  415   eukaryota>metazoa>chordata>vertebrata                 Geospiza fortis                         PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein, partial [Geospiza fortis].
      762153126    RAMA                                                                                           -                  LOC105318954          886   eukaryota>metazoa>mollusca                            Crassostrea gigas                       PREDICTED: uncharacterized protein LOC105318954 [Crassostrea gigas].
      119616158    RAMA                                                                                           -                  hCG_1647033           186   eukaryota>metazoa>chordata>vertebrata                 Homo sapiens                            hCG1647033, partial [Homo sapiens].
      761962926    RAMA                                                                                           -                  MNEG_12234            74    eukaryota>viridiplantae>chlorophyta                   Monoraphidium neglectum                 hypothetical protein MNEG_12234, partial [Monoraphidium neglectum].
      633909077    RAMA                                                                                           -                  H632_c1317p0          248   eukaryota>viridiplantae>chlorophyta                   Helicosporidium sp. ATCC 50920          hypothetical protein H632_c1317p0 [Helicosporidium sp. ATCC 50920].
      675890068    RAMA                                                                                           -                  HELRODRAFT_165788     847   eukaryota>metazoa>annelida                            Helobdella robusta                      hypothetical protein HELRODRAFT_165788 [Helobdella robusta].
      671037731    RAMA                                                                                           -                  MPND                  501   eukaryota>metazoa>chordata>vertebrata                 Ursus maritimus                         PREDICTED: MPN domain-containing protein [Ursus maritimus].
      545702290    RAMA                                                                                           Filament           Gasu_54090            805   eukaryota>rhodophyta                                  Galdieria sulphuraria                   hypothetical protein Gasu_54090 [Galdieria sulphuraria].
      255072729    RAMA                                                                                           -                  MICPUN_98973          247   eukaryota>viridiplantae>chlorophyta                   Micromonas sp. RCC299                   predicted protein [Micromonas sp. RCC299].
      66823477     RAMA                                                                                           DUF4175            DDB_G0272516          465   eukaryota>amoebozoa>mycetozoa>dictyosteliida          Dictyostelium discoideum AX4            hypothetical protein DDB_G0272516 [Dictyostelium discoideum AX4].
      612386108    RAMA                                                                                           Daxx               Bathy16g00130         620   eukaryota>viridiplantae>chlorophyta                   Bathycoccus prasinos                    predicted protein [Bathycoccus prasinos].
      488545531    RAMA                                                                                           -                  MPND                  241   eukaryota>metazoa>chordata>vertebrata                 Dasypus novemcinctus                    PREDICTED: MPN domain-containing protein, partial [Dasypus novemcinctus].
      760437079    RAMA                                                                                           TT_ORF1            F751_2934             306   eukaryota>viridiplantae>chlorophyta                   Auxenochlorella protothecoides          MPN domain-containing protein [Auxenochlorella protothecoides].
      552813497    RAMA                                                                                           -                  CHLNCDRAFT_139765     582   eukaryota>viridiplantae>chlorophyta                   Chlorella variabilis                    hypothetical protein CHLNCDRAFT_139765 [Chlorella variabilis].
      443691830    RAMA                                                                                           -                  CAPTEDRAFT_211270     576   eukaryota>metazoa>annelida                            Capitella teleta                        hypothetical protein CAPTEDRAFT_211270 [Capitella teleta].
      159471497    RAMA                                                                                           JAB                CHLREDRAFT_188109     545   eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii               predicted protein [Chlamydomonas reinhardtii].
      159476834    RAMA                                                                                           -                  CHLREDRAFT_187079     540   eukaryota>viridiplantae>chlorophyta                   Chlamydomonas reinhardtii               hypothetical protein CHLREDRAFT_187079 [Chlamydomonas reinhardtii].
      584005556    RAMA                                                                                           -                  LOC102779856          243   eukaryota>metazoa>chordata>vertebrata>actinopterygii  Neolamprologus brichardi                PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Neolamprologus brichardi].
      405977144    RAMA                                                                                           -                  CGI_10025066          531   eukaryota>metazoa>mollusca                            Crassostrea gigas                       hypothetical protein CGI_10025066 [Crassostrea gigas].
      701427628    RAMA                                                                                           -                  MPND                  236   eukaryota>metazoa>chordata>vertebrata                 Chaetura pelagica                       PREDICTED: MPN domain-containing protein, partial [Chaetura pelagica].
      761967974    RAMA                                                                                           MAT1               MNEG_8657             1281  eukaryota>viridiplantae>chlorophyta                   Monoraphidium neglectum                 hypothetical protein MNEG_8657 [Monoraphidium neglectum].
      260823234    RAMA+DOUBLECORTIN+DOUBLECORTIN+STYKIN                                                          CCDC66             BRAFLDRAFT_71623      2268  eukaryota>metazoa>chordata                            Branchiostoma floridae                  hypothetical protein BRAFLDRAFT_71623 [Branchiostoma floridae].
      612396523    RAMA+N6-MTase+ZFCW                                                                             DUF1421            Bathy04g03050         1310  eukaryota>viridiplantae>chlorophyta                   Bathycoccus prasinos                    predicted protein [Bathycoccus prasinos].
      551676387    AT-hook+BRIGHT/ARID+RAMA+TAMMBD                                                                ARID               GUITHDRAFT_132396     1403  eukaryota>cryptophyta                                 Guillardia theta CCMP2712               hypothetical protein GUITHDRAFT_132396 [Guillardia theta CCMP2712].
      551621810    RAMA+CHROMO                                                                                    Chromo             EMIHUDRAFT_224016     613   eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516              hypothetical protein EMIHUDRAFT_224016 [Emiliania huxleyi CCMP1516].
      551589812    RAMA+CHROMO+LRR+LRR+LRR+LRR                                                                    LRR_4              EMIHUDRAFT_114853     1118  eukaryota>haptophyceae                                Emiliania huxleyi CCMP1516              hypothetical protein EMIHUDRAFT_114853 [Emiliania huxleyi CCMP1516].
      302672755    RAMA+BULBLECTIN                                                                                B_lectin           SCHCODRAFT_238794     296   eukaryota>fungi>basidiomycota                         Schizophyllum commune H4-8              hypothetical protein SCHCODRAFT_238794 [Schizophyllum commune H4-8].
      636755953    RAMA                                                                                           -                  GLAREA_06546          312   eukaryota>fungi>ascomycota                            Glarea lozoyensis ATCC 20868            hypothetical protein GLAREA_06546 [Glarea lozoyensis ATCC 20868].
      459177714    RAMA                                                                                           -                  LOC100181315          872   eukaryota>metazoa>chordata                            Ciona intestinalis                      PREDICTED: uncharacterized protein LOC100181315 [Ciona intestinalis].
      
      --A limited number of prokaryotic operons are presented here.
      GI           Gene neighborhoods                                  Architectures                 Pfam-archs                     Gene-name         Len   Taxonomy                                       Species                                         Genbank
      # 26;                                                                                                                                                                                                                   
      501352122    <-N6-MTase+RAMA*                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  395  bacteria>proteobacteria>alphaproteobacteria    Beijerinckia indica                             restriction endonuclease subunit M [Beijerinckia indica].                              <-501352115_?<-501352116_?<-501352117_?<-501352118_?<-501352119_?||754154301_?->501352121_?-><-501352122_N6-MTase+RAMA*||501352123_?->501352124_?->501352125_?-><-501352126_?<-754154302_?<-501352128_?||754154303_?->
      499204035    -                                                   N6-MTase+RAMA                 N6_N4_Mtase                    -                  381  archaea>euryarchaeota                          Thermoplasma acidophilum                        DNA methyltransferase [Thermoplasma acidophilum].                                      
      739196468    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 SP+N6_N4_Mtase                 -                  376  bacteria>proteobacteria>alphaproteobacteria    Rhizobium leguminosarum                         restriction endonuclease subunit M [Rhizobium leguminosarum].                          739196456_?->739196457_?-><-739196458_?||739196461_?-><-739196566_?||739196463_?->739196466_?->739196468_N6-MTase+RAMA*-><-739196470_?<-739196471_?<-739196474_?||739196567_?->739196477_?->739196569_?->739196480_?->
      501142320    <-N6-MTase+RAMA*                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  375  bacteria>chloroflexi                           Herpetosiphon aurantiacus                       restriction endonuclease subunit M [Herpetosiphon aurantiacus].                        501142313_?->501142314_?->501142315_?->752637577_?->501142317_?-><-501142318_?<-501142319_?<-501142320_N6-MTase+RAMA*<-501142321_?<-752637579_?<-501142323_?<-501142324_?||501142325_?-><-501142326_?||501142327_?->
      512724354    <-N6-MTase+RAMA*                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  373  bacteria                                       Chthonomonas calidirosea                        restriction endonuclease subunit M [Chthonomonas calidirosea].                         <-512724347_?||769177975_?-><-512724349_?<-769179139_?<-512724351_?<-769179140_?||512724353_?-><-512724354_N6-MTase+RAMA*||769179142_?->512724356_?->512724357_?-><-512724358_?<-512724359_?<-512724360_?||512724362_?->
      493197799    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  372  bacteria>spirochaetes                          Treponema vincentii                             restriction endonuclease subunit M [Treponema vincentii].                              <-493197792_?<-493197793_?<-748667390_?<-493197795_?||748667391_?->493197797_?->493197798_?->493197799_N6-MTase+RAMA*->748667392_?->493197801_?-><-493197802_?<-493197803_?<-493197804_?||493197805_?-><-493197806_?
      501069455    REase->?->?->N6-MTase+RAMA*->                       N6-MTase+RAMA                 N6_N4_Mtase                    -                  370  bacteria>chloroflexi                           Roseiflexus castenholzii                        restriction endonuclease subunit M [Roseiflexus castenholzii].                         501069448_?-><-501069449_?<-501069450_?||501069451_?->501069452_REase->501069453_?->752683981_?->501069455_N6-MTase+RAMA*-><-501069456_?<-501069457_?<-501069458_?<-501069460_?||501069461_?->501069462_?->501069463_?->
      496651971    HNH-><-?<-?||?-><-?<-?<-?<-N6-MTase+RAMA*<-REase    N6-MTase+RAMA                 N6_N4_Mtase                    -                  369  bacteria>proteobacteria>epsilonproteobacteria  Campylobacter sp. 10_1_50                       restriction endonuclease subunit M [Campylobacter sp. 10_1_50].                        736902613_HNH-><-496651959_?<-496651961_?||496651963_?-><-496651965_?<-496651967_?<-496651969_?<-496651971_N6-MTase+RAMA*<-496651973_REase<-496651975_?<-736902809_?<-496651979_?<-496651981_?<-496651983_?<-489029481_?
      503325698    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  368  bacteria>chloroflexi                           Anaerolinea thermophila                         restriction endonuclease subunit M [Anaerolinea thermophila].                          503325692_?-><-752816319_?<-752816320_?<-752815681_?<-752815682_?<-503325696_?<-752815683_?||503325698_N6-MTase+RAMA*->752816321_?->503325700_?->503325701_?->503325702_?-><-752815684_?||503325704_?->503325705_?->
      550983501    <-N6-MTase+RAMA*                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  366  bacteria>proteobacteria>alphaproteobacteria    Thalassospira lucentensis                       restriction endonuclease subunit M [Thalassospira lucentensis].                        <-655387197_?<-703179655_?<-550983496_?<-550983497_?<-703179657_?||655387198_?-><-550983500_?<-550983501_N6-MTase+RAMA*||550983502_?->703179660_?->550983504_?->550983505_?->550983506_?->550983507_?->550983508_?->
      696308956    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  365  bacteria>fusobacteria                          Fusobacterium nucleatum                         DNA methyltransferase [Fusobacterium nucleatum].                                       492647398_?->492647397_?->552907250_?->552907251_?->492647394_?->492647392_?->492647391_?->696308956_N6-MTase+RAMA*->552907253_?->552907254_?->696308958_?->552907257_?->552907258_?->492627342_?->495970163_?->
      493739841    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  363  bacteria>tenericutes                           Ureaplasma parvum                               restriction endonuclease subunit M [Ureaplasma parvum].                                493739899_?->493739841_N6-MTase+RAMA*->493739954_?->755116483_?->755116485_?->755116486_?->493739937_?->493739927_?-><-493739876_?
      497185310    <-N6-MTase+RAMA*                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  361  bacteria>proteobacteria>gammaproteobacteria    Moraxella macacae                               DNA methylase N-4/N-6 domain-containing protein [Moraxella macacae].                   <-750342964_?<-750342967_?||497185298_?->497185301_?->497185304_?->497185306_?-><-497185308_?<-497185310_N6-MTase+RAMA*<-497185312_?<-497185314_?||497185316_?->750342968_?-><-497185321_?<-497185605_?<-497185606_?
      503763750    <-REase<-N6-MTase+RAMA*<-?<-?<-?<-?<-?<-?<-ParB     N6-MTase+RAMA                 N6_N4_Mtase                    -                  361  bacteria>bacteroidetes                         Capnocytophaga canimorsus                       DNA methyltransferase [Capnocytophaga canimorsus].                                     <-754503245_?<-503763744_?<-754502974_?<-503763746_?<-503763747_?<-503763748_?<-503763749_REase<-503763750_N6-MTase+RAMA*<-503763751_?<-503763754_?<-503763755_?<-503763756_?<-503763757_?<-503763758_?<-503763759_ParB
      518849236    N6-MTase+RAMA*->?->?-><-?<-HNH<-?||?->McrB->        N6-MTase+RAMA                 N6_N4_Mtase                    -                  360  bacteria>spirochaetes                          Brachyspira innocens                            DNA methyltransferase [Brachyspira innocens].                                          <-518849229_?||518849230_?-><-703420857_?<-518849232_?||703420859_?->518849234_?->518849235_?->518849236_N6-MTase+RAMA*->518849237_?->518849238_?-><-518849239_?<-518849240_HNH<-518849241_?||518849242_?->518849243_McrB->
      654345013    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 N6_N4_Mtase+PCMT               -                  360  bacteria>cyanobacteria                         Mastigocoleus testarum                          restriction endonuclease subunit M [Mastigocoleus testarum].                           654345007_?-><-654345008_?||654345009_?-><-654345010_?||654345011_?->738308971_?->654345012_?->654345013_N6-MTase+RAMA*-><-654345014_?||654345015_?-><-738308930_?<-738308933_?||738308975_?->654345018_?->654345019_?->
      697093027    REase->N6-MTase+RAMA*->                             N6-MTase+RAMA                 N6_N4_Mtase                    -                  360  bacteria>tenericutes                           Mycoplasma collis                               restriction endonuclease subunit M [Mycoplasma collis].                                697093022_?->697093023_?->697093024_?->697093036_?->697093025_?->697093026_REase->697093027_N6-MTase+RAMA*-><-697093028_?<-697093029_?||697093030_?->697093037_?->697093031_?->738479282_?->697093032_?->
      737625856    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  360  bacteria>proteobacteria>alphaproteobacteria    Hyphomonas polymorpha                           restriction endonuclease subunit M [Hyphomonas polymorpha].                            737625844_?-><-737625847_?||737626074_?->737625850_?-><-737625853_?||737626077_?-><-737626080_?||737625856_N6-MTase+RAMA*-><-737625859_?<-737626083_?<-737625862_?||737625865_?->737625868_?->737625870_?->737625873_?->
      748143394    <-N6-MTase+RAMA*                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  360  bacteria>cyanobacteria                         Scytonema millei                                restriction endonuclease subunit M [Scytonema millei].                                 <-748143391_?<-748143392_?<-748143393_?<-748143596_?<-748143597_?<-748143598_?||748143599_?-><-748143394_N6-MTase+RAMA*<-748143395_?<-748143396_?<-748143600_?||748143601_?-><-748143602_?<-748143397_?<-748143398_?
      446268888    <-REase<-N6-MTase+RAMA*                             N6-MTase+RAMA                 N6_N4_Mtase                    -                  359  bacteria>proteobacteria>epsilonproteobacteria  Helicobacter pylori                             DNA methyltransferase [Helicobacter pylori].                                           <-447055814_?||487802840_?-><-446116267_?||446003551_?->446761496_?-><-658502684_?<-727092483_REase<-446268888_N6-MTase+RAMA*<-446833673_?<-446375435_?<-447064608_?<-446148357_?<-446875834_?<-727086548_?<-446836634_?
      308225152    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 N6_N4_Mtase                    OSCT_3182          357  bacteria>chloroflexi                           Oscillochloris trichoides DG-6                  DNA methylase N-4/N-6 domain-containing protein [Oscillochloris trichoides DG-6].      308225190_?-><-308225191_?<-308225192_?||308225193_?->308225194_?->308225150_?->308225151_?->308225152_N6-MTase+RAMA*->308225153_?-><-308225154_?<-308225155_?<-308225156_?||308225157_?->308225158_?->308225159_?->
      568205957    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  357  bacteria>proteobacteria>alphaproteobacteria    Magnetospirillum gryphiswaldense                restriction endonuclease subunit M [Magnetospirillum gryphiswaldense].                 <-753897375_?<-568205951_?<-753897377_?||753897379_?->753897381_?->568205955_?->753897383_?->568205957_N6-MTase+RAMA*-><-568205958_?<-568205959_?<-568205960_?||753897386_?-><-568205962_?||568205963_?->753897389_?->
      665867244    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 N6_N4_Mtase                    -                  353  bacteria>proteobacteria>alphaproteobacteria    Agrobacterium tumefaciens                       restriction endonuclease subunit M, partial [Agrobacterium tumefaciens].               665867244_N6-MTase+RAMA*->489604643_?->648671957_?-><-489604641_?<-523694408_?||523694409_?->489604638_?->489604637_?->
      38906136     N6-MTase+RAMA*->                                    N6-MTase+RAMA                 Methyltransf_26+N6_N4_Mtase    -                  350  bacteria>firmicutes                            Staphylococcus sp. L1                           adenine DNA methyltransferase [Staphylococcus sp. L1].                                 38906136_N6-MTase+RAMA*->
      737789152    N6-MTase+RAMA*->REase->                             N6-MTase+RAMA                 N6_N4_Mtase                    -                  349  bacteria>bacteroidetes                         Flexibacter roseolus                            hypothetical protein, partial [Flexibacter roseolus].                                  737789149_?-><-652631573_?||737789152_N6-MTase+RAMA*->652631574_REase-><-652631575_?||737789146_?->
      354959883    N6-MTase+RAMA*->                                    N6-MTase+RAMA                 N6_N4_Mtase                    BJ6T_73150         344  bacteria>proteobacteria>alphaproteobacteria    Bradyrhizobium japonicum USDA 6                 adenine DNA methyltransferase [Bradyrhizobium japonicum USDA 6].                       354959876_?-><-354959877_?<-354959878_?||354959879_?->354959880_?-><-354959881_?<-354959882_?||354959883_N6-MTase+RAMA*->354959884_?->354959885_?-><-354959886_?<-354959887_?<-354959888_?<-354959889_?<-354959890_?
      # 11;                                                                                                                                                                                                                   
      651265592    <-URI+RAMA*                                         URI+RAMA                      GIY-YIG+DUF4357                -                  305  bacteria>proteobacteria>alphaproteobacteria    Acetobacter nitrogenifigens                     hypothetical protein [Acetobacter nitrogenifigens].                                    651265585_?->651265586_?->651265587_?->651265588_?-><-737395337_?||651265590_?->651265591_?-><-651265592_URI+RAMA*<-651265593_?<-651265594_?<-651265595_?||737395338_?->651265596_?->737395339_?->737395340_?->
      772620269    URI+RAMA*->                                         URI+RAMA                      GIY-YIG+DUF4357                -                  295  bacteria>proteobacteria>betaproteobacteria     Comamonas aquatica                              excinuclease ABC subunit C [Comamonas aquatica].                                       772620263_?->772620264_?->772620265_?->772622099_?->772620266_?->772620267_?->772620268_?->772620269_URI+RAMA*->772620270_?->772620271_?->772620272_?-><-772620273_?||772620274_?->772620275_?-><-759660784_?
      494162448    URI+RAMA*->                                         URI+RAMA                      GIY-YIG+DUF4357                -                  294  bacteria>cyanobacteria                         Synechococcus sp. RS9917                        excinuclease ABC subunit C [Synechococcus sp. RS9917].                                 <-494162439_?<-494162440_?||494162441_?->494162442_?->494162443_?->494162444_?->494162445_?->494162448_URI+RAMA*->494162449_?->494162450_?-><-740157707_?||494162452_?->494162454_?-><-494162455_?<-740157709_?
      736707052    URI+RAMA*->                                         URI+RAMA                      GIY-YIG+DUF4357                -                  293  bacteria>proteobacteria>betaproteobacteria     Acidovorax sp. JHL-9                            excinuclease ABC subunit C [Acidovorax sp. JHL-9].                                     <-651308342_?<-651308343_?<-651308344_?<-651308345_?<-736707050_?<-651308347_?||651308348_?->736707052_URI+RAMA*->651308350_?-><-651308351_?<-651308352_?||651308353_?->736707081_?->651308354_?->736707082_?->
      759571980    URI+RAMA*->                                         URI+RAMA                      DUF4357                        -                  287  bacteria>proteobacteria>betaproteobacteria     Burkholderia pseudomallei                       hypothetical protein [Burkholderia pseudomallei].                                      759571971_?->759571973_?-><-740943948_?||759571977_?->740943944_?->759572002_?->759571978_?->759571980_URI+RAMA*->759571982_?->759572005_?->759571985_?->759571988_?->759572008_?->759572010_?-><-759572013_?
      763304606    URI+RAMA*->                                         URI+RAMA                      GIY-YIG+DUF4357                -                  287  bacteria>firmicutes                            Salinibacillus aidingensis                      hypothetical protein [Salinibacillus aidingensis].                                     763304595_?->763304597_?->763304599_?->763304601_?->763304630_?->763304603_?->763304604_?->763304606_URI+RAMA*->763304608_?->763304610_?->763304611_?->763304631_?->763304613_?-><-763304633_?
      504865027    <-URI+RAMA*<-?||?-><-?<-URI+RAMA                    URI+RAMA                      GIY-YIG+DUF4357                -                  285  archaea>euryarchaeota                          Methanolobus psychrophilus                      hypothetical protein [Methanolobus psychrophilus].                                     <-504865020_?<-504865021_?<-504865022_?||504865024_?->851281505_?->504865025_?-><-851281508_?<-504865027_URI+RAMA*<-851281511_?||504865029_?-><-851281508_?<-504865030_URI+RAMA||504865031_?->504865032_?-><-504865033_?
      750016631    <-URI+RAMA*                                         URI+RAMA                      GIY-YIG+DUF4357                -                  285  bacteria>planctomycetes                        Blastopirellula marina                          hypothetical protein [Blastopirellula marina].                                         750016625_?-><-750016626_?<-488730430_?<-750016628_?||488730433_?->750016629_?-><-488730435_?<-750016631_URI+RAMA*<-488730437_?<-488730438_?||750015383_?->488730440_?->488730441_?->750015384_?->488730444_?->
      695943706    <-URI+RAMA*                                         URI+RAMA                      GIY-YIG+DUF4357                LI82_07720         283  archaea>euryarchaeota                          Methanococcoides methylutens                    hypothetical protein LI82_07720 [Methanococcoides methylutens].                        <-695943703_?||695943704_?-><-695943705_?<-695943706_URI+RAMA*||695943707_?->695943708_?->695943709_?->695943710_?-><-695943711_?<-695943712_?<-695943713_?
      737312527    <-URI+RAMA*                                         URI+RAMA                      GIY-YIG+DUF4357                -                  281  bacteria>firmicutes                            Brevibacillus thermoruber                       hypothetical protein [Brevibacillus thermoruber].                                      737312520_?-><-737312521_?<-737312523_?<-737311946_?||517953542_?->737311949_?-><-737312525_?<-737312527_URI+RAMA*<-737312529_?<-737312530_?<-737312532_?<-737312534_?<-737312535_?<-737312537_?||737312884_?->
      501422597    URI+RAMA*->HNH->?->McrB->McrC->                     URI+RAMA                      GIY-YIG+DUF4357                -                  279  bacteria>firmicutes                            Natranaerobius thermophilus                     hypothetical protein [Natranaerobius thermophilus].                                    752720267_?->501422593_?->501422594_?->501422595_?->501422520_?->752719768_?->501422596_?->501422597_URI+RAMA*->501422598_HNH->501422599_?->752720268_McrB->501422601_McrC->752720270_?->501422603_?->501422604_?->
      # 5;                                                                                                                                                                                                                              
      648575907    RAMA+CBS*->                                         RAMA+CBS                      SP+DUF4357+CBS                 -                  467  bacteria>actinobacteria                        Micromonospora sp. CNB394                       hypothetical protein [Micromonospora sp. CNB394].                                      <-517614212_?<-517614213_?<-517614214_?<-517614215_?<-517614216_?<-517614217_?||517614218_?->648575907_RAMA+CBS*-><-738393323_?||517614221_?->517614222_?-><-517614223_?||517614224_?-><-517614227_?<-738393325_?
      759777256    RAMA+CBS*->                                         RAMA+CBS                      DUF4357+CBS                    -                  460  bacteria>actinobacteria                        Streptomyces bingchenggensis                    hypothetical protein [Streptomyces bingchenggensis].                                   <-503942394_?<-503942395_?<-759781450_?<-503942397_?<-503942398_?<-759781453_?<-503942400_?||759777256_RAMA+CBS*-><-503942402_?||759781456_?->503942404_?->503942406_?->503942407_?->503942408_?-><-503942409_?
      754222116    <-RAMA+CBS*                                         RAMA+CBS                      DUF4357+CBS                    -                  457  bacteria>actinobacteria                        Actinoplanes missouriensis                      hypothetical protein [Actinoplanes missouriensis].                                     504253353_?->504253354_?->754222113_?-><-754220794_?||504253357_?-><-504253358_?<-754222114_?<-754222116_RAMA+CBS*||504253361_?->504253362_?->504253363_?->504253364_?->504253365_?-><-504253366_?<-754220795_?
      703061045    <-RAMA+CBS*                                         RAMA+CBS                      CBS                            -                  456  bacteria>actinobacteria                        Catenuloplanes japonicus                        hypothetical protein [Catenuloplanes japonicus].                                       <-703061033_?<-703061035_?<-703061037_?||703061039_?->703061041_?->703061043_?->703061672_?-><-703061045_RAMA+CBS*<-703061047_?<-703061049_?<-703061052_?<-703061054_?<-703061678_?||703061681_?->703061056_?->
      739497435    RAMA+CBS*->                                         RAMA+CBS                      DUF4357+CBS                    -                  453  bacteria>actinobacteria                        Amycolatopsis orientalis                        hypothetical protein [Amycolatopsis orientalis].                                       739497424_?->739497426_?-><-739497428_?||739497482_?-><-739497430_?<-739497432_?||739497434_?->739497435_RAMA+CBS*->739497483_?-><-739497437_?||739497439_?->739497441_?-><-739497484_?<-739497443_?||739497445_?->
      # 4;                                                                                                                                                                                                                              
      640573949    <-URI+RAMA*                                         URI+RAMA                      GIY-YIG+DUF4357                -                  275  bacteria>bacteroidetes                         Porphyromonas macacae                           methionine sulfoxide reductase [Porphyromonas macacae].                                738988659_?->640573943_?->640573944_?->640573945_?->640573946_?->640573947_?-><-640573948_?<-640573949_URI+RAMA*<-738988661_?<-640573951_?<-640573952_?<-640573953_?<-517171461_?<-640573954_?<-640573955_?
      649528608    URI+RAMA*->                                         URI+RAMA                      GIY-YIG+DUF4357                M091_1691          247  bacteria>bacteroidetes                         Parabacteroides distasonis str. 3776 D15 i      GIY-YIG catalytic domain protein [Parabacteroides distasonis str. 3776 D15 i].         <-649528276_?||649528560_?-><-649528583_?||649528351_?->649528575_?->649528432_?->649528598_?->649528608_URI+RAMA*->649528373_?-><-649528566_?||649528459_?->649528613_?-><-649528574_?||649528362_?->649528551_?->
      596132180    <-URI+RAMA*                                         URI+RAMA                      GIY-YIG+DUF4357                M068_1150          245  bacteria>bacteroidetes                         Bacteroides fragilis str. J38-1                 GIY-YIG catalytic domain protein [Bacteroides fragilis str. J38-1].                    596132234_?->596132210_?->596132254_?->596132135_?->596132046_?->596132118_?-><-596132198_?<-596132180_URI+RAMA*<-596132083_?<-596132050_?<-596132066_?<-596132205_?<-596132209_?<-596132185_?<-596132235_?
      325482299    <-URI+RAMA*                                         URI+RAMA                      DUF4357                        HMPREF9303_0585    235  bacteria>bacteroidetes                         Prevotella denticola CRIS 18C-A                 hypothetical protein HMPREF9303_0585 [Prevotella denticola CRIS 18C-A].                325482290_?->325482255_?->325482244_?->325482263_?->325482315_?-><-325482295_?<-325482292_?<-325482299_URI+RAMA*<-325482335_?<-325482243_?<-325482249_?<-325482334_?||325482258_?->325482287_?->325482242_?->
      # 4;                                                                                                                                                                                                                              
      296267968    <-RAMA*                                             RAMA                          DUF2924                        HMPREF0731_0014    158  bacteria>proteobacteria>alphaproteobacteria    Roseomonas cervicalis ATCC 49957                hypothetical protein HMPREF0731_0014 [Roseomonas cervicalis ATCC 49957].               <-296267968_RAMA*<-296267969_?
      489363811    S-resolvase<-RAMA*                                  RAMA                          DUF2924                        -                  154  bacteria>proteobacteria>betaproteobacteria     Ralstonia solanacearum                          hypothetical protein [Ralstonia solanacearum].                                         746510525_?->489357851_?->489357852_?->489357853_?->755831621_?-><-755831623_?<-489367078_S-resolvase<-489363811_RAMA*<-489367076_?<-489367075_?<-489367073_?<-746524612_?<-755654450_?<-755654448_?||755830982_?->
      757691291    RAMA*->                                             RAMA                          DUF2924                        -                  148  bacteria>proteobacteria>gammaproteobacteria    Pseudomonas chloritidismutans                   hypothetical protein [Pseudomonas chloritidismutans].                                  <-757691263_?<-757691286_?<-757691265_?<-757691287_?<-757691289_?<-757691267_?||757691269_?->757691291_RAMA*->757691292_?-><-757691271_?<-757691272_?<-757691273_?||757691275_?->757691276_?->757691294_?->
      500033502    RAMA*->S-resolvase                                  RAMA                          DUF2924                        -                  144  bacteria>proteobacteria>alphaproteobacteria    Magnetococcus marinus                           hypothetical protein [Magnetococcus marinus].                                          <-753915544_?<-500033496_?<-500033497_?<-500033498_?<-500033499_?<-500033500_?||500033501_?->500033502_RAMA*->500033503_S-resolvase->753915546_?->753915551_?-><-500033506_?<-500033507_?<-500033508_?<-500033509_?
      # 3;                                                                                                                                                                                                                              
      159894763    <-PARB+HNH+RAMA*                                    PARB+HNH+RAMA                 DUF262+DUF1524                 Haur_5215          690  bacteria>chloroflexi                           Herpetosiphon aurantiacus DSM 785               protein of unknown function DUF262 (plasmid) [Herpetosiphon aurantiacus DSM 785].      <-159894756_?<-159894757_?<-159894758_?||159894759_?-><-159894760_?<-159894761_?<-159894762_?<-159894763_PARB+HNH+RAMA*<-159894764_?<-159894765_?<-159894766_?||159894767_?->159894768_?->159894769_?->159894770_?->
      488784593    PARB+HNH+RAMA*->                                    PARB+HNH+RAMA                 SP+DUF262+DUF1524              -                  689  bacteria>bacteroidetes                         Microscilla marina                              hypothetical protein [Microscilla marina].                                             <-488784585_?<-488784586_?<-488784587_?<-488784588_?<-770187064_?||488784590_?->770186989_?->488784593_PARB+HNH+RAMA*->488784595_?-><-488784597_?||488784598_?->488784601_?-><-488784602_?<-488784603_?||488784605_?->
      755032699    <-PARB+HNH+RAMA*                                    PARB+HNH+RAMA                 DUF262+DUF1524                 -                  672  bacteria>firmicutes                            Geobacillus thermoglucosidasius                 hypothetical protein [Geobacillus thermoglucosidasius].                                <-755032684_?<-755032686_?<-755032688_?<-755032690_?||755032692_?-><-755032695_?<-755032696_?<-755032699_PARB+HNH+RAMA*||755032700_?-><-503642854_?<-503642853_?<-755032703_?
      # 3;                                                                                                                                                                                                                              
      763144244    DCM->HNH->SeqA+RAMA*->                              SeqA+RAMA                     DUF4357                        -                  137  bacteria>proteobacteria>gammaproteobacteria    Vibrio vulnificus                               hypothetical protein [Vibrio vulnificus].                                              763144243_?->503337475_?->503337476_?->763144312_?->686222949_?->763144313_DCM->763144314_HNH->763144244_RAMA*->503337482_?->499393456_?->503337483_?->503337484_?->503337485_?-><-499462739_?<-763144245_?
      775294809    RAMA*->                                             RAMA                          -                              Aam_125_009        134  bacteria>proteobacteria>alphaproteobacteria    Acidocella aminolytica 101 = DSM 11237          hypothetical protein Aam_125_009 [Acidocella aminolytica 101 = DSM 11237].             <-775294802_?<-775294803_?||775294804_?->775294805_?-><-775294806_?||775294807_?->775294808_?->775294809_RAMA*->775294810_?->775294811_?->775294812_?->775294813_?->
      494322155    RAMA*->                                             RAMA                          DUF2924                        -                  126  bacteria>proteobacteria>betaproteobacteria     Burkholderia sp. Ch1-1                          DUF2924 domain-containing protein [Burkholderia sp. Ch1-1].                            737545229_?->494322149_?->494322150_?->494322151_?->494322152_?->494322153_?->494322154_?->494322155_RAMA*->737545230_?->494322157_?->737544494_?->737545231_?-><-494322159_?||494322160_?-><-737545232_?
      # 2;                                                                                                                                                                                                                              
      402258132    <-PARB+HNH+RAMA*                                    PARB+HNH+RAMA                 DUF262+DUF1524+DUF4357         B437_00700         693  bacteria>fusobacteria                          Fusobacterium hwasookii ChDC F128               hypothetical protein B437_00700 [Fusobacterium hwasookii ChDC F128].                   <-402258125_?<-402258126_?<-402258127_?||402258128_?->402258129_?->402258130_?-><-402258131_?<-402258132_PARB+HNH+RAMA*<-402258133_?<-402258134_?<-402258135_?<-402258136_?||402258137_?->402258138_?->402258139_?->
      489467259    <-PARB+HNH+RAMA*                                    PARB+HNH+RAMA                 DUF262+DUF1524+DUF4357         -                  674  bacteria>firmicutes                            Clostridium botulinum                           hypothetical protein [Clostridium botulinum].                                          489465522_?-><-489464004_?<-489464013_?<-489464728_?||737813847_?-><-489465217_?<-489469295_?<-489467259_PARB+HNH+RAMA*<-489466793_?<-489464181_?<-489465116_?<-489469379_?<-489466191_?<-489464120_?<-489469029_?
      # 2;                                                                                                                                                                                                                              
      655526129    <-HSDRN+DUF4268+RAMA*<-HNH                          HSDRN+DUF4268+RAMA            HSDR_N_2+DUF4357+DUF4268       -                  509  bacteria>bacteroidetes                         Prevotella sp. P6B4                             hypothetical protein [Prevotella sp. P6B4].                                            655526122_?->655526123_?->655526124_?->655526125_?->655526126_?->655526127_?->655526128_?-><-655526129_HSDRN+DUF4268+RAMA*<-655526130_HNH<-739035040_?||655526131_?->697058486_?->655526132_?->655526133_?->
      697058363    HNH->HSDRN+DUF4268+RAMA*->                          HSDRN+DUF4268+RAMA            HSDR_N_2+DUF4357+DUF4268       -                  509  bacteria>bacteroidetes                         Prevotella sp. P6B1                             hypothetical protein [Prevotella sp. P6B1].                                            <-697058485_?<-697058359_?<-697058360_?<-697058486_?<-697058361_?||697058487_?->697058362_HNH->697058363_HSDRN+DUF4268+RAMA*-><-655526128_?<-697058364_?<-697058365_?||697058366_?-><-697058367_?<-697058368_?<-655526118_?
      # 2;                                                                                                                                                                                                                              
      500693945    -                                                   REase+RAMA                    PDDEXK_4                       -                  358  archaea>euryarchaeota                          Methanococcus maripaludis                       hypothetical protein [Methanococcus maripaludis].                                      
      651957888    <-REase+RAMA*                                       REase+RAMA                    DUF91                          -                  333  bacteria>firmicutes                            Bacillus megaterium                             hypothetical protein [Bacillus megaterium].                                            <-651957878_?||651957879_?->651957880_?->651957881_?-><-651957882_?||651957884_?-><-651957886_?<-651957888_REase+RAMA*||651957890_?-><-651957892_?<-651957894_?
      # 2;                                                                                                                                                                                                                              
      740127358    URI+RAMA*->                                         URI+RAMA                      GIY-YIG+DUF4357                -                  277  bacteria>synergistetes                         Synergistes jonesii                             hypothetical protein [Synergistes jonesii].                                            <-740127346_?||740127347_?->740127351_?->740127353_?-><-740127355_?<-740127722_?||740127356_?->740127358_URI+RAMA*->740127360_?->740127366_?->740127368_?->740127371_?->740127725_?-><-740127728_?<-740127373_?
      517958401    URI+RAMA*->                                         URI+RAMA                      GIY-YIG+DUF4357                -                  276  bacteria>actinobacteria                        Enorma massiliensis                             hypothetical protein [Enorma massiliensis].                                            517958393_?->517958394_?-><-737113754_?<-517958396_?||517958398_?->737113756_?->737113031_?->517958401_URI+RAMA*->517958402_?->517958403_?->517958404_?->737113758_?-><-737113760_?<-517958407_?<-517958409_?
      # 2;                                                                                                                                                                                                                              
      495798143    T5orf172+RAMA*->                                    T5orf172+RAMA                 DUF4357                        -                  236  bacteria>synergistetes                         Jonquetella anthropi                            hypothetical protein [Jonquetella anthropi].                                           <-495798139_?<-495798140_?<-495798141_?<-495798142_?<-495796199_?<-495796200_?<-495796202_?||495798143_T5orf172+RAMA*->495798146_?-><-495798147_?<-495798148_?||495798149_?-><-495798150_?||495798151_?->737819285_?->
      490555808    T5orf172+RAMA*->                                    T5orf172+RAMA                 T5orf172+DUF4357               -                  223  bacteria>tenericutes                           Mycoplasma bovigenitalium                       protein, helicase [Mycoplasma bovigenitalium].                                         <-490555796_?||750242421_?->490555798_?->490555801_?->490555803_?->490555805_?->490555807_?->490555808_T5orf172+RAMA*->750242422_?->
      # 2;                                                                                                                                                                                                                              
      521961820    RAMA*->S-resolvase                                  RAMA                          DUF2924                        -                  165  bacteria>planctomycetes                        Zavarzinella formosa                            hypothetical protein [Zavarzinella formosa].                                           750609086_?-><-657929650_?||750609087_?->521961815_?->750609088_?->521961818_?->657929652_?->521961820_RAMA*->521961821_S-resolvase->521961822_?->521961823_?->521961824_?->521961825_?->521961826_?->521961827_?->
      655234456    S-resolvase<-RAMA*                                  RAMA                          DUF2924                        -                  155  bacteria>actinobacteria                        Nocardioides sp. JGI 0001009-J09                hypothetical protein [Nocardioides sp. JGI 0001009-J09].                               <-655234449_?<-655234450_?<-655234451_?<-655234452_?<-655234453_?<-655234454_?<-655234455_S-resolvase<-655234456_RAMA*||655234457_?->655234458_?->655234459_?->655234460_?->
      # 2;                                                                                                                                                                                                                              
      499350070    <-RAMA*                                             RAMA                          DUF4357                        -                  137  bacteria>actinobacteria                        Streptomyces coelicolor                         hypothetical protein [Streptomyces coelicolor].                                        21234283_?-><-21234284_?||21234285_?->21234286_?-><-21234287_?||21234288_?-><-21234289_?<-499350070_RAMA*<-21234291_?<-21234292_?||21234293_?->21234294_?->21234295_?->21234296_?->21234297_?->
      763031682    RAMA*->                                             RAMA                          DUF4357                        -                  133  bacteria>actinobacteria                        Kitasatospora griseola                          hypothetical protein [Kitasatospora griseola].                                         763031679_?->763032620_?->763032621_?->763031680_?-><-763031681_?<-763032622_?<-763032623_?||763031682_RAMA*->763031683_?->763031684_?->763031685_?-><-763032624_?||763031686_?->763031687_?-><-763031688_?
      # 1;                                                                                                                                                                                                                              
      326658949    <-N6-MTase+RAMA*                                    N6-MTase+RAMA                 HsdM_N+N6_Mtase+DUF4357+CBS    SACT1_4469         867  bacteria>actinobacteria                        Streptomyces griseus XylebKG-1                  N-6 DNA methylase [Streptomyces griseus XylebKG-1].                                    <-326658942_?||326658943_?->326658944_?->326658945_?->326658946_?->326658947_?->326658948_?-><-326658949_N6-MTase+RAMA*||326658950_?-><-326658951_?||326658952_?->326658953_?->326658954_?->326658955_?->326658956_?->
      545335724    RAMA*->                                             RAMA                          -                              -                  731  bacteria>actinobacteria                        Actinomyces johnsonii                           hypothetical protein [Actinomyces johnsonii].                                          545335723_?->545335724_RAMA*->736449647_?->545335726_?->545333444_?-><-545335727_?<-496526014_?<-545335728_?||545335729_?->
      649278169    RAMA*->                                             RAMA                          DUF1707+MCPVI+DUF605           -                  678  bacteria>actinobacteria                        Promicromonospora sukumoe                       hypothetical protein [Promicromonospora sukumoe].                                      518862026_?-><-518862027_?||518862028_?-><-518862029_?<-518862030_?||518862031_?-><-649278167_?||649278169_RAMA*-><-649278171_?<-703011700_?<-518862035_?<-518862036_?<-703011703_?<-518862038_?||518862039_?->
      516723468    ParB->?->ParB->?-><-PARB+HNH+RAMA*                  PARB+HNH+RAMA                 DUF262+DUF1524                 -                  706  bacteria>proteobacteria>alphaproteobacteria    Martelella mediterranea                         hypothetical protein [Martelella mediterranea].                                        <-516723460_?<-516723461_?||516723462_?->516723463_ParB->516723464_?->516723465_ParB->516723466_?-><-516723468_PARB+HNH+RAMA*||516723471_?-><-648481907_?||503898728_?->648481908_?->516723474_?->516723475_?->516723476_?->
      737135649    PARB+HNH+RAMA*->                                    PARB+HNH+RAMA                 DUF262+DUF1524                 -                  702  bacteria>actinobacteria                        Corynebacterium freneyi                         hypothetical protein [Corynebacterium freneyi].                                        <-737135688_?<-737135647_?||737135648_?->737135649_PARB+HNH+RAMA*-><-737135689_?<-737135690_?<-737135691_?<-737135651_?<-737135653_?<-737135654_?||737135656_?->
      742488091    PARB+HNH+RAMA*->                                    PARB+HNH+RAMA                 DUF262+DUF1524                 -                  712  bacteria>proteobacteria>alphaproteobacteria    Bradyrhizobium japonicum                        hypothetical protein, partial [Bradyrhizobium japonicum].                              742488083_?->742488084_?-><-742488085_?<-742488086_?<-742488087_?<-742488089_?||742488090_?->742488091_PARB+HNH+RAMA*-><-654689942_?<-654689943_?||654689944_?->654689945_?->654689946_?->654689947_?->742488092_?->
      548076194    PARB+HNH+RAMA*->                                    PARB+HNH+RAMA                 DUF262+DUF1524                 -                  676  bacteria>actinobacteria                        Cryptobacterium sp. CAG:338                     hypothetical protein [Cryptobacterium sp. CAG:338].                                    548076163_?->548076167_?-><-548076172_?||548076176_?->548076179_?->548076184_?->548076189_?->548076194_PARB+HNH+RAMA*-><-548076199_?<-548076204_?||548076210_?-><-548076215_?||548076219_?->548076222_?->548076225_?->
      664177934    <-PARB+HNH+RAMA*                                    PARB+HNH+RAMA                 DUF262+DUF1524                 -                  641  bacteria>actinobacteria                        Streptomyces griseus                            hypothetical protein [Streptomyces griseus].                                           <-664177917_?<-664177919_?<-664177921_?<-664177924_?||664177926_?-><-664177929_?||664177931_?-><-664177934_PARB+HNH+RAMA*||664177937_?-><-664177940_?<-664177943_?<-664177946_?<-497744080_?<-664177949_?<-664177952_?
      739317412    <-PARB+HNH+RAMA*                                    PARB+HNH+RAMA                 DUF262+DUF4357                 -                  843  bacteria>actinobacteria                        Rhodococcus fascians                            hypothetical protein [Rhodococcus fascians].                                           694052417_?->694052418_?->694052332_?->694052333_?->694052334_?->694052335_?->694052419_?-><-739317412_PARB+HNH+RAMA*||694052421_?->694052422_?-><-694052423_?<-694052424_?||694052336_?-><-694052337_?<-694052338_?
      739980329    <-McrC<-RAMA+EVE+McrB*                              RAMA+EVE+McrB                 DUF4357+AAA_5                  -                  599  bacteria>actinobacteria                        Streptomyces sp. NRRL B-5680                    ATPase AAA, partial [Streptomyces sp. NRRL B-5680].                                    <-663270903_?<-663270906_?<-663271111_?<-663271115_?||663270908_?-><-663270911_?<-663270915_McrC<-739980329_AAA++RAMA*<-663270921_?<-663270926_?||663270932_?-><-663270935_?<-663270938_?<-663270958_?<-739980313_?
      334108481    InPase-><-?||?->?->?-><-?||?-><-RAMA*               RAMA                          -                              Isova_2668         578  bacteria>actinobacteria                        Isoptericola variabilis 225                     hypothetical protein Isova_2668 [Isoptericola variabilis 225].                         334108474_InPase-><-334108475_?||334108476_?->334108477_?->334108478_?-><-334108479_?||334108480_?-><-334108481_RAMA*||334108482_?-><-334108483_?||334108484_?->334108485_?-><-334108486_?<-334108487_?||334108488_?->
      452756494    <-X+RAMA*                                           X+RAMA                        DUF1571+DUF4357                G418_29712         558  bacteria>actinobacteria                        Rhodococcus qingshengii BKS 20-40               hypothetical protein G418_29712 [Rhodococcus qingshengii BKS 20-40].                   452756487_?->452756488_?-><-452756489_?||452756490_?->452756491_?->452756492_?-><-452756493_?<-452756494_X+RAMA*<-452756495_?<-452756496_?||452756497_?->452756498_?-><-452756499_?<-452756500_?||452756501_?->
      304326663    <-X+RAMA*                                           X+RAMA                        -                              HMPREF0574_0913    554  bacteria>actinobacteria                        Mobiluncus curtisii subsp. curtisii ATCC 35241  hypothetical protein HMPREF0574_0913 [Mobiluncus curtisii subsp. curtisii ATCC 35241]. <-304326656_?<-304326657_?||304326658_?-><-304326659_?||304326660_?->304326661_?->304326662_?-><-304326663_X+RAMA*<-304326664_?<-304326665_?<-304326666_?||304326667_?->304326668_?->304326669_?->304326670_?->
      503535640    X+RAMA*-><-?<-?<-?<-?<-?||?-><-InPase               X+RAMA                        Dicty_REP                      -                  532  bacteria>actinobacteria                        Cellulomonas fimi                               hypothetical protein [Cellulomonas fimi].                                              <-503535633_?<-503535634_?||753798085_?->503535636_?->503535637_?->753797675_?->503535639_?->503535640_X+RAMA*-><-503535641_?<-503535642_?<-503535643_?<-503535644_?<-753798086_?||503535646_?-><-503535647_InPase
      502643352    <-RAMA*                                             RAMA                          PAT1                           -                  521  bacteria>actinobacteria                        Xylanimonas cellulosilytica                     hypothetical protein [Xylanimonas cellulosilytica].                                    <-502643345_?||502643346_?->502643347_?->502643348_?-><-502643349_?||502643350_?->502643351_?-><-502643352_RAMA*<-502643353_?||502643354_?->502643355_?->502643356_?-><-502643357_?||502643358_?-><-502643359_?
      365257398    X+RAMA*-><-?||?->?->?->?-><-InPase                  X+RAMA                        Med3                           HMPREF0045_01501   490  bacteria>actinobacteria                        Actinomyces graevenitzii C83                    hypothetical protein HMPREF0045_01501 [Actinomyces graevenitzii C83].                  365257391_?->365257392_?->365257393_?->365257394_?-><-365257395_?||365257396_?->365257397_?->365257398_X+RAMA*-><-365257399_?||365257400_?->365257401_?->365257402_?->365257403_?-><-365257404_InPase<-365257405_?
      515761987    X+RAMA*->?->?-><-?<-?<-?||?-><-InPase               X+RAMA                        -                              -                  460  bacteria>actinobacteria                        Actinomyces massiliensis                        hypothetical protein [Actinomyces massiliensis].                                       515761982_?->515761983_?->496008053_?->515761985_?->515761986_?->657869467_?->657869468_?->515761987_X+RAMA*->515761988_?->515761989_?-><-496004859_?<-496004863_?<-515761991_?||515761992_?-><-496005322_InPase
      585093370    HSDRN+HSDR-REase+RAMA*->                            HSDRN+HSDR-REase+RAMA         -                              KUTG_05615         435  bacteria>actinobacteria                        Kutzneria sp. 744                               type I restriction enzyme R protein [Kutzneria sp. 744].                               585093363_?->585093364_?-><-585093365_?<-585093366_?<-585093367_?||585093368_?->585093369_?->585093370_HSDRN+HSDR-REase+RAMA*-><-585093371_?<-585093372_?<-585093373_?<-585093374_?<-585093375_?||585093376_?->585093377_?->
      759995468    RAMA*->                                             RAMA                          -                              -                  420  bacteria>actinobacteria                        Nocardia vulneris                               hypothetical protein [Nocardia vulneris].                                              <-759995508_?<-759995453_?||759995456_?->759995459_?->759995462_?->759995465_?->759995513_?->759995468_RAMA*->759995471_?-><-759995474_?<-759995477_?<-759995480_?<-759995483_?<-759995486_?<-759995489_?
      269787547    <-HSDRN+HSDR-REase+RAMA*                            HSDRN+HSDR-REase+RAMA         HSDR_N_2                       Sthe_2269          394  bacteria>chloroflexi                           Sphaerobacter thermophilus DSM 20745            protein of unknown function DUF450 [Sphaerobacter thermophilus DSM 20745].             269787540_?->269787541_?->269787542_?->269787543_?->269787544_?->269787545_?-><-269787546_?<-269787547_HSDRN+HSDR-REase+RAMA*<-269787548_?||269787549_?-><-269787550_?||269787551_?->269787552_?->269787553_?-><-269787554_?
      485098347    <-HNH+RAMA*<-?<-N6-MTase                            HNH+RAMA                      HNH                            SFUL_6650          394  bacteria>actinobacteria                        Streptomyces fulvissimus DSM 40593              hypothetical protein SFUL_6650 [Streptomyces fulvissimus DSM 40593].                   485098340_?->485098341_?-><-485098342_?||485098343_?-><-485098344_?<-485098345_?||485098346_?-><-485098347_HNH+RAMA*<-485098348_?<-485098349_N6-MTase<-485098350_?<-485098351_?<-485098352_?<-485098353_?<-485098354_?
      601042837    X+RAMA*->                                           X+RAMA                        -                              N866_19315         375  bacteria>actinobacteria                        Actinotalea ferrariae CF5-4                     oxidoreductase [Actinotalea ferrariae CF5-4].                                          <-601042830_?<-601042831_?||601042832_?->601042833_?->601042834_?-><-601042835_?||601042836_?->601042837_X+RAMA*-><-601042838_?<-601042839_?<-601042840_?<-601042841_?
      655991010    HSDRN+HSDR-REase+RAMA*->                            HSDRN+HSDR-REase+RAMA         HSDR_N                         -                  355  bacteria>proteobacteria>alphaproteobacteria    Salinarimonas rosea                             hypothetical protein [Salinarimonas rosea].                                            <-655991005_?<-759845134_?<-655991006_?<-759845135_?<-759845067_?||655991008_?-><-655991009_?||655991010_HSDRN+HSDR-REase+RAMA*-><-655991011_?||759845068_?->759845136_?->655991012_?->759845137_?->655991013_?-><-655991014_?
      343968291    URI+RAMA*->                                         URI+RAMA                      GIY-YIG+DUF4357                l13_07900          324  bacteria>proteobacteria>betaproteobacteria     Neisseria weaveri ATCC 51223                    hypothetical protein l13_07900 [Neisseria weaveri ATCC 51223].                         343968284_?->343968285_?-><-343968286_?||343968287_?->343968288_?->343968289_?->343968290_?->343968291_URI+RAMA*->343968292_?->343968293_?-><-343968294_?<-343968295_?||343968296_?->343968297_?->343968298_?->
      494634130    <-X+RAMA*<-?<-N6-MTase                              X+RAMA                        DUF4357                        -                  306  bacteria>firmicutes                            Megasphaera sp. UPII 135-E                      methionine sulfoxide reductase [Megasphaera sp. UPII 135-E].                           <-494634068_?||494634072_?->738304214_?-><-494634127_?<-494634108_?||738304246_?-><-738304216_?<-494634130_X+RAMA*<-494634144_?<-494634151_N6-MTase<-494634114_?<-494634162_?<-738304218_?<-738304248_?<-494634096_?
      703488257    <-RAMA*                                             RAMA                          DUF4357                        -                  298  bacteria>actinobacteria                        Saccharothrix sp. NRRL B-16314                  hypothetical protein [Saccharothrix sp. NRRL B-16314].                                 703488342_?->703488252_?->703488253_?->703488254_?->703488255_?->703488343_?-><-703488256_?<-703488257_RAMA*||703488344_?->703488345_?->703488346_?-><-703488258_?<-703488259_?<-703488347_?<-703488260_?
      522152436    RAMA*->                                             RAMA                          -                              -                  285  bacteria>actinobacteria                        Amycolatopsis benzoatilytica                    hypothetical protein [Amycolatopsis benzoatilytica].                                   <-654457843_?<-522152430_?||522152431_?->522152432_?->522152433_?->654457844_?->522152435_?->522152436_RAMA*->522152437_?->522152438_?->522152439_?->522152440_?->522152441_?-><-522152442_?||736166120_?->
      219621053    Bifidobacterium                                     -                             SP                             BLA_0918           280  bacteria>actinobacteria                        Bifidobacterium animalis subsp. lactis AD011    conserved hypothetical protein [Bifidobacterium animalis subsp. lactis AD011].         subsp.
      502483234    URI+RAMA*->                                         URI+RAMA                      DUF4357                        -                  277  bacteria>actinobacteria                        Cryptobacterium curtum                          hypothetical protein [Cryptobacterium curtum].                                         502483227_?->502483228_?->502483229_?->502483230_?->502483231_?->752554187_?->502483233_?->502483234_URI+RAMA*->502483235_?->502483236_?->502483237_?->502483239_?->502483240_?-><-752554188_?||752554189_?->
      380878131    <-URI+RAMA*                                         URI+RAMA                      DUF4357                        Thi970DRAFT_03846  275  bacteria>proteobacteria>gammaproteobacteria    Thiorhodovibrio sp. 970                         LOW QUALITY PROTEIN: hypothetical protein Thi970DRAFT_03846 [Thiorhodovibrio sp. 970]. 380878124_?-><-380878125_?<-380878126_?||380878127_?->380878128_?->380878129_?-><-380878130_?<-380878131_URI+RAMA*<-380878132_?||380878133_?-><-380878134_?<-380878135_?||380878136_?->380878137_?->380878138_?->
      503259250    <-URI+RAMA*<-?<-REase                               URI+RAMA                      GIY-YIG                        -                  273  bacteria>actinobacteria                        Intrasporangium calvum                          excinuclease ABC subunit C [Intrasporangium calvum].                                   503259243_?->503259244_?->503259245_?->503259246_?->503259247_?->752639935_?->752639937_?-><-503259250_URI+RAMA*<-503259251_?<-503259252_REase||752638883_?-><-503259255_?<-752639938_?||752638885_?->752639940_?->
      353541191    RAMA*->                                             RAMA                          -                              FJSC11DRAFT_3600   167  bacteria>cyanobacteria                         Fischerella sp. JSC-11                          hypothetical protein FJSC11DRAFT_3600 [Fischerella sp. JSC-11].                        353541184_?-><-353541185_?<-353541186_?<-353541187_?<-353541188_?<-353541189_?||353541190_?->353541191_RAMA*->353541192_?->353541193_?->353541194_?->353541195_?->353541196_?->353541197_?->353541198_?->
      516723200    <-RAMA*                                             RAMA                          -                              -                  156  bacteria>proteobacteria>alphaproteobacteria    Martelella mediterranea                         hypothetical protein [Martelella mediterranea].                                        516723193_?-><-516723194_?||516723195_?->516723196_?->516723197_?->703369724_?->516723199_?-><-516723200_RAMA*<-516723201_?||703369699_?-><-703369731_?<-516723204_?<-648481875_?<-516723207_?<-516723208_?
      736354164    <-RAMA*                                             RAMA                          SeqA                           -                  153  bacteria>firmicutes                            Dehalobacter sp. FTH1                           hypothetical protein [Dehalobacter sp. FTH1].                                          <-521977782_?<-521977783_?||521977784_?-><-736355130_?||521977786_?->648464136_?->521977788_?-><-736354164_RAMA*<-736355132_?||648464137_?->736355134_?->521977793_?-><-736355137_?<-736355139_?||521977797_?->
      
      
      
      Back to Contents
      li>

      Multiple sequence alignment of the PWI domain found in ASC1

      %*
      ppp1r14a_Xenopus_laevis_147898606                                       -------------DVEKW-ID-EQMEE-LYLGREVD-------MPDE-VNIDDLLDL----ETDEDRRRSLQVILK-----SCTNN-----TEVFIRELLLRLK-GLQKQTLLKKNGLEVSSEE------------------------------------------------------------
      gigyf2_Xenopus_laevis_148223517                                         ------GANKCQDDFTQW-CE-KTMHA-INTAHSLD-------VPTF-VSF--LREV----ESPYEVHDYVCAYLG-----DTPEA-----KD-FSKQFIER-R-TKQKTSQHRPQ----QDVA-W---VTCQTSQANSQPITLEAVQCAGRKKKKQ---------------------------
      AT5G42950_Arabidopsis_thaliana_15239132                                 ----AVTKLTEANGFRDW-CKSECLRL-LGSEDTSV-------LEFC-LKL-----------SRSEAETLLIENLG-----SRDPD-----HK-FIDKFLNY-K-DLLPSEVVEIA--F-QSKG---SGVGT----------------------------------------------------
      NEMVEDRAFT_v1g247227_Nematostella_vectensis_156360915                   ----A---AASADDFTQW-CE-TTLRS-M-KATGVD-------IPTF-IMF--LKEV----ESPYEIHDYVKSYVG-----DTKEA-----RD-FAKEFIEK-R-KKDKHRSAPT-----PSPP------------------------------------------------------------
      NEMVEDRAFT_v1g180626_Nematostella_vectensis_156399329                   -----------------W-CV-DELSK-I-TDCGEE-------ITDY------ILHM----DNIEDVKEYLGGFLG-----QENPK-----QIEFLNILVQR-L-NEINPEFERAG--T------W----------VRKEKLEKETSPTKGN--------------------------------
      GIGYF2_Homo_sapiens_156766045                                           ------GVNKAQDGFTQW-CE-QMLHA-LNTANNLD-------VPTF-VSF--LKEV----ESPYEVHDYIRAYLG-----DTSEA-----KE-FAKQFLER-R-AKQKAN-------------------------------------------------------------------------
      CHLREDRAFT_189453_Chlamydomonas_reinhardtii_159468295                   -------ANAAPAPVPSW-CR-GKMVE-FFGNDDLT-------LVAF-L----YSSC----TSRSEVADYCQEYMR-----GKPNV-----ST-FVAEFLKR-K-DADVAAR------------------------------------------------------------------------
      MONBRDRAFT_33655_Monosiga_brevicollis_MX1_167533317                     MQVGT-SLAQVQPSFLAW-CK-KELVK-I-VNTEVD-------PETF-VSF--LLDI----GSTSEVIDYLSEMSS-----KPDQV-----RK-FAHEFMKN-R-VSAMSGTVAKG----IVKP--------------KPPMDDS---------------------------------------
      PHYPADRAFT_167456_Physcomitrella_patens_168039262                       -------------ALRQW-CE-AQLKK-LSADEDVT-------NVDF-SIVDFCISL----PSISEASDYLSQYLGTLPGVTQQHI-----QA-FRKEFVRR-K-EQLPSDGNASS----SDNE-Y------------SDALFGSNGKKVDR--------------------------------
      CC1G_01669_Coprinopsis_cinerea_okayama7#130_169844537                   ----P---VSPSHEFLKW-LS-ESLKG-LNSSVNVE-------EIIS-MLL--SFSL----DPDPTTIEIISDTIY-ASS-TTLDG-----RR-FAAEFVSR-R-KADAANRKGPN----AAKG-------------PTKPISIAEVVKA----------------------------------
      trip4_Xenopus_(Silurana)_tropicalis_187607944                           ----------MAADLLGW-CV-EELEKRFGLGVSED-------VVKY------ILSI----DKEEEIDEYINDLVQ-----APEDT-----KSLFTRELKLR-W-HRIRQPPASRT----------------------TSAFQRKDG-------------------------------------
      SCRG_02797_Saccharomyces_cerevisiae_RM11-1a_190408675                   M-----NNVSPRQEFIKW-CK-SQMKL----NSGIT-------NNNV-LEL--LLSL----PTGPESKELIQETIY-ANS-DVMDG-----RR-FATEFIKR-R-VACEKQGDDPL--S------W------------------------------NEALALSGNDDDGWE--FQVVSKKKGR-
      LOC100158827_Acyrthosiphon_pisum_193643347                              ---------MSKQEFTQWICD-KLTAL-LQFEVQND-------MAEY------IISM----QAERDIDEYCNSLLD-I---KSQVH-----KQ-FLTDLKKK-H-RIFNQKPTSSV--Q-NTRP-------------PKKNTKSSENEDPQK--------------------------------
      TRIADDRAFT_57541_Trichoplax_adhaerens_196007084                         --S-N-TAYNPENALIQW-CE-RRLAG---IETTID-------IPTF-ARF--ISSL----GSPAEVRDYLKMYLG-----STEEV-----KN-FFNDYVRK-R-RECESQQFPKA--S-PGNM---KETTT----------------------------------------------------
      PPP1R14B_Homo_sapiens_20162550                                          -------------NLEEW-IL-EQLTR-LYDCQEEE-------IPELEIDVDELLDM----ESDDARAARVKELLV-----DCYKP-----TEAFISGLLDK-I-RGMQKLSTPQK--K-----------------------------------------------------------------
      PHATR_43835_Phaeodactylum_tricornutum_CCAP_1055/1_219113419             -------GAKMSPSFEKW-CK-DQVYK-LNGTDDLT-------LVAF------CMTL----QDPGEIRQYLSAYLG-----STPQV-----NS-FATEFINR-K-A------------------------------------------------------------------------------
      PHATRDRAFT_19937_Phaeodactylum_tricornutum_CCAP_1055/1_219118084        -------PSTNKTETIQW-CS-DALHD-LLGFADTA-------LASY------LVSV----AKKATQSSEIVQILV--DG-DVRDVTPERMER-FAEQLLSH-A-RPTPKQSHGGP----ASRQ---AKA------------------------------------------------------
      PHATRDRAFT_46184_Phaeodactylum_tricornutum_CCAP_1055/1_219119804        -------------------LN-SKFAQ-I-LGFEDG-------VDDV-VDH--LLTI----DSKEDLSDYLSQLLG---S-LSVEG-----KK-FVDDIEKF-K-RGVPIEPMILV--V-PDKA--------------EKSEQTHDLP------------------------------------
      AT1G32490_Arabidopsis_thaliana_22329903                                 ---------MASNDLKTW-VS-DKLMM-LLGYSQAA-------VVNY-L----IAMA----KKTKSPTELVGELVD-YGF-SSSGD-T---RS-FAEEIFAR----------------------------------------------------------------------------------
      Dmel_CG11710_Drosophila_melanogaster_24643521                           --------------MENF-LR-GTLSK---CLDCVI-------TDQM-LAA--ILNI----KDDYEFDNYFGNLLS---E-DNEEH-----RM-FLVNC----R-RMLLSGKQPRN----NGKY-L------------SPPLAPTSPNCPKQ--------------------------------
      PPL_07241_Polysphondylium_pallidum_PN500_281206218                      MAQ-V-NEPLTWESLERW-TT-EKLQK-MLGFPPDE-------IIRY------IFAA----ESNSDIDNYLVDLLG-----NTKKT-----RT-FIEQLTKK-I-NSLPRRIVKP---------------------------------------------------------------------
      PPL_04211_Polysphondylium_pallidum_PN500_281208347                      ----V-QAPQPKADFIKW-CH-QQLKP-LTNMDVAT-------VTEL------LCSL----KTENEIRECAKECLG-----YSSEV-----TN-FINDYLMA-R-GDEPGLQFEAS----------------------SP--------------------------------------------
      NAEGRDRAFT_57247_Naegleria_gruberi_strain_NEG-M_290995819               --F---GQADVSQEFKDW-FK-KGLKK-LNKSVDPS-------FMYF------LLSL----NSEKETIDYMSEYIG-----NSSAA-----QK-FAQEFIAN-K-SFENPNQGANK----KK--------------------------------------------------------------
      PITG_01508_Phytophthora_infestans_T30-4_301120876                       A-F---GSNNVSSEFMTW-AL-RHLKA-IDSNADVT-------LLEY------CATL----EDPGEVREYLAAYLG-----STPRV-----SA-FATEFIQR-K-KTQHSGKKSTG--N-QDAQ--------------QRASETGSSNKRGK--R-----------------------------
      SELMODRAFT_438614_Selaginella_moellendorffii_302760897                  --S-T-GPNAEAKAFRQW-CE-SQMKK-LTGNDDMT-------LLEF------CLSL----PSSAEAGEYLTQYLG-----STANV-----QA-FKSELLQR-K-ELLPVEALRSV---------F-------TV---SDAVISEDWKQASR------GQGQ----------------------
      SELMODRAFT_442631_Selaginella_moellendorffii_302786242                  -------------SFRSW-CE-TQIKE-LTGSSDMR-------QLDY------CVSL----PSVLEAERSLTQYLG-----ESSDA-----QA-FKGEFLRC-R-ELMTPQMLQVF----NSGP--------------KSDTKVEG--------------------------------------
      VOLCADRAFT_92322_Volvox_carteri_f_nagariensis_302840495                 ---------AAEREVRTW-VQ-DQLHR-LLGFADAN-------VAAF------LISI----ARKHTSADSLFTDLK-RSC-NLPNT-SDV-QS-FAAELLRR----------------------------------------------------------------------------------
      VOLCADRAFT_108378_Volvox_carteri_f_nagariensis_302854783                ------GEIQMSAEFRNW-CR-SKMVE-FFGNDDLS-------LVHF-L----LT-V----NSRSEVADYCQVYMR-----GKPNV-----ST-FVADFLKR-K-DAELARQ------------------------------------------------------------------------
      EAI_14891_Harpegnathos_saltator_307207345                               ------QNTAKTDEFTQW-CT-KALNG-L--QASVD-------IPTF-VGF--LRDI----ESAYEVKEYVRVYLG-----DTKQS-----TE-FAKQFLEK-R-SKWRSAQRPQA----QADD-L-CKP--------APAVNPNA--------------------------------------
      GSOID_T00013439001_Oikopleura_dioica_313227523                          ------------DPLLSW-AR-VELEL-IPNSNTVD-------VPTF-VSF--LREV----EHDYEVEDYSRQFFG-----NGKEV-----LN-FAHKFMEK-R-KQIRNEGPAKK----------------------SKGKKKNKNKATAD---------------------LLGFTPAAGDQ
      GSOID_T00030811001_Oikopleura_dioica_313240030                          --------------MRQW-IA-HQMEL-LLGFSDAG-------LISA------IENQ----KSESELRKYCGELLG--DQ-----------KS-FVDELVKR-K-FGNSASQNNNQ----GGNV---SSGNS----------------------------------------------------
      TRIP4_Homo_sapiens_32189376                                             MAV---AGAVSGEPLVHW-CT-QQLRKTFGLDVSEE-------IIQY------VLSI----ESAEEIREYVTDLLQ-----GNEGK-K---GQ-FIEELITK-WQKNDQELISDPL----QQCF--------------KKDEILDGQKSGDHL-------------------------------
      DICPUDRAFT_150853_Dictyostelium_purpureum_330797588                     -------PGQARPDFIKW-CH-QQLKA-LTNNDVTV-------ITEL------LCSL----KTESEIRECAKECLG-----YTSDV-----DN-FINDYLLA-R-SDEPGLTFESS----------------------SPVITIPIKKQPKSQTA------------------STTASKQKKKK
      LOC100637876_Amphimedon_queenslandica_340373604                         ----F-TGASPDEAFTDW-CK-RELDK---YNSDVD-------AVTF-VSL--LLEV----ESTYEVHDYIKSFLG-----ETDEV-----HS-FAKEFLER-R-QKIRNYKQTKT----QQQS--------------KKPSAGS---------------------------------------
      RTG_02889_Rhodotorula_glutinis_ATCC_204091_342319142                    ----------MPTSLETW-VS-DNLLV-LLGASDST-------TTAY------FITL----AQSSPSASALVQTLT--QN-GLSDS-AQT-RR-FAADLFDR-A-PRKTSKRAEQN----AADA--------------RRKAEKEKKQL-----------------------------------
      GIGYF2_Gallus_gallus_356460922                                          ------GVNKAQDGFTQW-CE-QMLHA-LNTANNLD-------VPTF-VSF--LKEV----ESPYEVHDYIRAYLG-----DTPEA-----KE-FAKQFLER-R-AKQKASQQRQQ----QQQQDS----------------------------------------------------------
      dZ221I3.1-001_Danio_rerio_37606133                                      --------------------------------------------------Y--ILSI----DNADEIVEYVGDLLQ-----GTEGN-K---QE-FVDELVQR-W-QKCQTQTSEGL----GGVL--------------RKEAVMEELDTAPK------------------------DTQKKSKR
      RO3G_00766_Rhizopus_delemar_RA_99-880_384483882                         ----------PSEDFRRW-CR-KALRG---LNSGVN-------EDEI-LDM--LLMF----PVDSSTAEIIEDVIY---A-NSVRI-DG--KR-FAQEFMRR-R-KADLAGRSD----------------------------------------------------------------------
      LOC100889908_Strongylocentrotus_purpuratus_390357703                    ----F-QSNPPSSEFGQW-CE-MELKR---MRPPVD-------VPTF-VSF--LQDI----DSPYEVHEYVKMYLG-----DTPES-----SN-FARQFLER-R-SRQRDQQRQQK--E-VESA-W--------L---GNKSSLP---------------------------------------
      LOC581973_Strongylocentrotus_purpuratus_390357928                       --------MAVSPSLVTW-CS-EELSL---LLASET-------TDEF-VSY--LLAI----RDDQELREYLSQLLD-----SSSKK-N---AE-FVEELMRR-R-NSGPALPQNFT--V------Y------------RKSTTQEESKK-----------------------------------
      CGI_10008333_Crassostrea_gigas_405951638                                --------AAASVSFEDW-LG-ERLTS---LNPEVD-------TEVF-VTY--ITGILETETDEEDTRESILGILG-EVL-EEEKE-----TV-MCDEIVQR-W-NQEHSKQAKAE----TDTS--------------TLATQLSEILENQK--------------------------------
      LOC101238576_Hydra_vulgaris_449668186                                   ----F-QAPTPSSDFINW-VE-RNLPA---VKKAID-------VPTV-VSF--LQDI----ESPYEIHDYMRSYLG-----DCNEQ-----KE-FVREFLER-R-KKTWQQQKQRS----PTVP------------------------------------------------------------
      CELE_C18H9.3_Caenorhabditis_elegans_453231778                           ----------ATDELQQW-FV-KRFQQ---FSTQVD-------SSTL-FDC--IMSL----ENPNEVEDIVMSYLD-----ESKTV-----KE-FVREFIKR-R-IAMRAAGGRPD----------------------ADDLTSARTAAAAP--------------------------------
      LOC100186532_Ciona_intestinalis_459185799                               -------MASGTMSLSQW-VN-KELVK-LLGIAETD-------VEDL-SSY--LVTI----DNPEELTTFVTELLS-ENG-TESGL-DGPKKQ-FIRDLLGR-W-ERVR---------------------------------------------------------------------------
      DFA_02695_Dictyostelium_fasciculatum_470247412                          ------------SDFIKW-CH-QQLKP-LTNMDVAT-------VTEL------LCSL----KKESEIRECARECLG-----FTTEV-----NN-FINDYLMA-R-SDEPTLAYESS----------------------SPFITVPKGKTPNV------------------------KNAPKGKK
      CAOG_06042_Capsaspora_owczarzaki_ATCC_30864_470297466                   ------------PDFMEW-CQ-HKLRT---FNAGID-------AESF-VSI--LNAF----DSKQDVTDYAHEFLG-----RSSEV-----AA-FAREFCER-R-RPFEVVAGKRT--A-HAAP---------AP---APAAAPTQGKKKAQ--------------------------------
      ACA1_074550_Acanthamoeba_castellanii_str_Neff_470519016                 -----------------W-VM-KRLKH-LLGFDEVD-------GL---VDN--LMKL----QSAEEIEKYIKDILG-----TERKA-V---QK-FTEDLIQK---RKDASPLFRTV----PIRK-L------------STAGPSGQGPSEPN--------------------------------
      GSTEN:00026621:G:001_Tetraodon_nigroviridis_47209798                    -------------DVEKW-ID-DSLDQ-LYSGQEDD-------MPEE-VNIDDLIDL----PSDEERVRKLRELLQ-----NCNNN-----TESFVTELVAR----------------------------------------------------------------------------------
      GSTEN:00018164:G:001_Tetraodon_nigroviridis_47230723                    ------------DALLKW-CV-DQLHYKFGLEASED-------IVQY------ILSI----EKAEEIEEYVGDLLQ-----GTDRK-K---EP-FIEELLIR-W-EKSRKQTTDNN--L------F-LFTEL-VP---SSEITKDAQKKSKR--------------------------------
      TRIUR3_01899_Triticum_urartu_474124787                                  -------MATSASTSGEW-LE-GALQE-LRGRTGSALELDDGLISGL-VS---FCEL----APPPDAADYLANIVG---A-EAAQD--------LIQEYLQR-R-GYIDPSKGAGS--S-QSSN---LQPYL------KPSADAATAQTKKQ--------------------------------
      TRIP4_Gallus_gallus_513200543                                           MA--------APGVLLDW-CV-RRLRGDFGLDVGEE-------VVRY------ILSI----TSEDEIREYVVDLLQ-----GTEGR-K---GR-FVEELLSR-W-QQSSQSPAEPL----PAYR--------------KKDETSESPRAGDQ--------------------------------
      CAOG_06042_Capsaspora_owczarzaki_ATCC_30864_514484474                   ------------PDFMEW-CQ-HKLRT---FNAGID-------AESF-VSI--LNAF----DSKQDVTDYAHEFLG-----RSSEV-----AA-FAREFCER-R-RPFEVVAGKRT--A-HAAP---------AP---APAAAPTQGKKKAQ--------------------------------
      PTSG_00883_Salpingoeca_rosetta_514701569                                -----------RKEFMGW-CK-KEIGA-LTSDVDAE----T--LVKV------LLEI----PQPEDVIDFVTEQLG-----ERARK--------FSKAFLQK-R-ATAFEGDAGQV----------------------LPTVDDDSWQP-----------------------------------
      SPOG_00659_Schizosaccharomyces_cryophilus_OY26_528316225                ---------MPSSALEQW-TK-DNVLK-LLPLDEES-AV-L--VAQT------ALAA----ENADDAKTHWISLLG-----ESQET-----IE-FVSEFNRR-R-FAASFSQKDAI----------------------SKKLESNRPSSYSA--------------------------------
      Gasu_19130_Galdieria_sulphuraria_545710223                              ----S-FGANIPEDLKKW-CE-EQFSE-LVSSQDVT-------LAEF------LASL----NTREEIREYAIIYLG-----NSKKT-----ED-FVEEFVRR---LQFEFESQKVT----PGSS------------G-SQAFKSGRRRKK----------------------------------
      GUITHDRAFT_122423_Guillardia_theta_CCMP2712_551630891                   ------------DSFSKW-CF-KELEK-LTGSDDTT-------LGEF------LMSL----HSSSDVQEYVEEYLG-----PKGRS--------FAEEFVLR-K-QMDSVEVVSSR--G-GSKE--------------EANLQAKKKKK-----------------------------------
      EAH_00020370_Eimeria_acervulina_557125805                               -------PKRAAAQLQEW----NCLLA--ECELPLD-VP----ILEY------LATM----ENPLEVEEFLLESFP-----SHKNL-----RL-FAENFVMT-N-DKYNKRPQDDK----GGPQ--------------ASAASWEAIQGKGR--------------------------------
      PVAR5_6530_Byssochlamys_spectabilis_No_5_557723410                      ---------MPATDLVAW-AA-PRLSQ-LLPLDDES-------LTQI-ITY--SASL-----SKDACAEHLKNLLG-----DSPAA-----LE-FISSFNSR-R-GGGETSTQSGI----ASPG--------------AGARDGTERQTNED-----------------------RNVQKKNKR
      LOC102564933_Alligator_mississippiensis_564240896                       -------------DTEKW-ID-GCLEE-LYRGREAE-------MPDE-VNIDELLEL----DTDEERARKLQGILR-----SCGNS-----TEDFVRELLLKLR-GLQKQQALQQPS---PDEQ------------------------------------------------------------
      L345_07652_Ophiophagus_hannah_565314478                                 ----------MAAALVSW-CT-GELRGTFGLDVSEE-------IIQY------ILSI----DDEEEIREYVTDLIQ-----GRDGQ-K---KY-FIDELVAR-W-KQSRSTTSDLL----LLYQ--------------KKDDILDTPRPGDQ--------------------------------
      AND_002481_Anopheles_darlingi_568257448                                 -------------SLQGW-IK-EELSK-CLCFEIPD----A--MISY------ICNI----REGCEIDEYFRTLLD-F---KNPEH-----VK-FLGELKRR-M-GTRPGANNQQA--S-------------------KQTAPAPSGKQKKQ--------------------------------
      LOC102654890_Apis_mellifera_571571334                                   --------------MEDW-IC-ENLSQ-ILDFPVTN----E--IVQY------MIQI----QNERDLDDYMRSFLD-Y---TNGKH-----RQ-FITDFKKQ-Q-VKAISALIKDQ--A-----------------------------------------------------------------
      BATDEDRAFT_36451_Batrachochytrium_dendrobatidis_JAM81_575473160         ------QATNGSAALVQW-CR-VALRG-VQRTSTTV----N--VDEF-ITM--LNSI-NV-KESATITMICDDTLG-----GSTAI-DP--RK-FADEYIRR-R-QAEASGTHWSA----QSSA---ES-------------------------------------------------------
      BATDEDRAFT_88748_Batrachochytrium_dendrobatidis_JAM81_575480014         ----------MSTKLEDW-AT-NEVIR-LLGHSLPR-NE----VHQL-ITY--SLTL----DTKDEAANYFQDLLG-----TTEES-----LE-FISEFLTK-R-FPPLQTVGAWS----STAT--------------ASRKEKQASIQREQ--------------------------------
      WALSEDRAFT_69827_Wallemia_sebi_CBS_63366_588260673                      -------PGAPSEEFIKW-CK-GALVG----LNGTT-------SDEL-LPI--LLSF----DIEKPDLELIQDMIY-ASS-STMDG-----RR-FAGEFAKK-R-K--EDSKGVSL--S------S------------------------------AADIVKQQQAPKTLQETFKVVQKKKKR-
      H779_YJM993P00176_Saccharomyces_cerevisiae_YJM993_628229990             ------ASVSKRQEFLRW-CR-SQLKL----NTGVQ-------PDNV-LEM--LLSL----PPGSESKEIIADTIY-SYS-STMDG-----RR-FATDFIKK-R-LECEEEINDPL--S------W------------------------------SEVLAMPEGSSEDWE--FQVVGKKKGKR
      _Danio_rerio_62960123                                                   -------------DVEKW-ID-EALDK-LYEGKVED-------MPEE-VNIDDLLDL----PSDEARTHRLQALLQ-----SCSSN-----TEAFIAELLQK----------------------------------------------------------------------------------
      gigyf1_Anolis_carolinensis_637369140                                    --------PPPQDGFTQW-CE-QMLHA-LNTSSNLD-------VPTA-VAF--LKEV----ESPYEVHDCIRSYLG-----DTLEA-----KE-FAKQFLER-R-AKQRANQQRQQ----QQEASW----------------------------------------------------------
      DDB_G0279309_Dictyostelium_discoideum_AX4_66815539                      -------PGQARPDFIKW-CH-QQLKA-LTSNDVSV-------ITEL------LCSM----KTESEIRECAQECLG-----YSSEV-----NI-FLNEYLLA-R-SDEPGLAFESS----------------------SPVITIPAKKTNKS--S------------------QTNPSKTKKKK
      DDB_G0269884_Dictyostelium_discoideum_AX4_66826047                      --------PMSYEEIEKW-TI-EKMDK-MLGVDSKE-------MAKY------VLSM----DTNSEIENYLADVLG-----NTKKV-----QT-FIEQLIKK----------------------------------------------------------------------------------
      BM_Bm1959a_Brugia_malayi_671413957                                      ----------MADFLEQW-VN-DELYT-LVGCSDRT-------AVQY------ILAL--A-RKSIDAEDLLGRLRS--TD-TMEDT-PAV-RK-FISELIAR---VPHAAAKREKV----IQPS--------------AAELRAKEI-------------------------------------
      BM_Bm2316_Brugia_malayi_671418067                                       ------SVTSSGNALTSW-MI-NRVKQ-LNPQVDAD----V--FAAF------IEGV----DNPNEVEDYIIGYLG-----ESRLV-----KE-LIREFLER-R-SQARHKKEPVD----KDDL--------------THPAQAADA-------------------------------------
      LOTGIDRAFT_162501_Lottia_gigantea_676463843                             ---------MAAPTTEEW-IC-QELAK-FGIETTPE-------NASY------ILSM----DNNQDLEDYMNDLLD---K-SDPKV-----RI-FVQELLRR----------------------------------------------------------------------------------
      OT_ostta09g03850_Ostreococcus_tauri_693498597                           ----F-PPLNNKQALRAW-CK-AQMSQ-LNNSDDMT-------LVDF------LLGL----PSAGEVQEYVALYLG-----KTPQA-----NA-FATELIRQ-K-RADPS--------------------------------------------------------------------------
      LOC100184186_Ciona_intestinalis_699243562                               ---------AQTDPFVSW-CD-TEIKK-LPSAVNLD-------IPTF-VAF--LRDV----ESPHEVKDYVASYLG-----ESKPA-----RD-FAEAFLQQ-R--------------------------------------------------------------------------------
      AFUA_6G02200_Aspergillus_fumigatus_Af293_70984701                       ---------MSNSNLVAW-AV-PRLAQ-LLPLDEES-------LTQI-ITY--SAGL-----PKEEGAEHLKNLLG-----DSPAA-----FE-FIASFSAR-R-DQTQAQTRSTV----PSPV--------------RGGEEQSAA-------------------------------------
      CELE_F55C10.5_Caenorhabditis_elegans_71995966                           --------------VENW-IE-TEVTK-LFNGNETN----N--VDID-LDV--IQDI----EDVTGKRKFAFEQLQ-KAH-CPCSM-DKI-IM-FLDELIIQ-L-NTL----------------------------------------------------------------------------
      GE21DRAFT_1337754_Neurospora_crassa_725976398                           --------------------Q-QQLSR-LLPLPDED-------LKQV-LDY--ASTL-----SKTEAIDHFTNLLG-----DSPAV-----ID-FISTFNAR-R-ADPKAPPAPSS--A-ARTP--------------SAPSSAQNS-------------------------------------
      RMATCC62417_11189_Rhizopus_microsporus_727141261                        -----------MSTLDTW-AQ-DKLSV-FLGFDPET----I--RSQV-LPY--LMST----QTPEAFGERLMEMVG-----LSEDA-----LK-FIEEFTER-R-FHPERQQNTTT--V-VASG---SN-------------------------------------------------------
      RMATCC62417_06585_Rhizopus_microsporus_727147058                        ----------PSEDFRRW-CR-KALRG---LNSGVN-------EDEI-LEM--LLSF----PVDGSSAEIIEDVIY---A-NSLRI-DG--KR-FAQEFMRR-R-KADIAGRTD----------------------------------------------------------------------
      NCU09657_Neurospora_crassa_OR74A_85091072                               ------AKNVAMEEFKKW-LH-RELSR---GLNGVN----D--IETF-AST--LLEL------PLDVSILSECVYG--FS-TTMDG-----RH-FAEEFVRR-R-KLADKGIVEKD----SNTG--------------AMSSSNGGWSEVAK--K-G---------------------------
      GIGYF1_Homo_sapiens_92087055                                            --------PRPQDGFTQW-CE-QMLHT-LSATGSLD-------VPMA-VAI--LKEV----ESPYDVHDYIRSCLG-----DTLEA-----KE-FAKQFLER-R-AKQKASQQRQQ----QQEA-WLSSASLQTA-------------------------------------------------
      consensus/100%                                                          ..............................................................................................h...h.....................................................................................
      consensus/95%                                                           .................W......h...............................h...........p.h...h...................F..ph.....................................................................................
      consensus/90%                                                           .................W.h....h............................h..h........p..p.h...l...................Fh.ph..p..................................................................................
      consensus/85%                                                           ..............h.pW.h....h..........p..........h......h..h.....p..p..p.h..hl.......p...........Fhpphh.p..................................................................................
      consensus/80%                                                           ..............h.pW.h..ppl.......s..s.........ph......l.ph.....s..-..chh..hl.......s...........Fhpphh.+.b................................................................................
      consensus/75%                                                           ..............h.pW.hp.pplp..h...ss.s.......h.ph......l.sl.....s..-h.chh.phLs.....ps.p......pp.Fhpchhb+.b................................................................................
      consensus/70%                                                           ..............h.pW.hp.ppLp..h...ss.s.......hsph......lhsl....ps..-hp-hl.phLs.....ss.ps.....pp.Fhp-hlb+.b................................................................................
      
      Back to Contents
    • 
      Back to Contents
      
    • 
      Back to Contents
      
    • 
      Back to Contents
      
    • Fasta sequences of entries with temporary ids and not found in Genbank

      >gi|Lgig1000010006|ref|jgi|Lotgi1|156824|fgenesh2_pg.C_sca_12000135
      MIPDTIIYIPNFISQEEEQKLIDHVYSAPKPKWTHLSNRRLQNWGGLPHPKGMVAEDIPQWLDLYCDKIGKLDLFEGKKPNHVLVNEYSPGQGIMPHEDGPLFYPTVTTISLGSSTVLDFYTHINQGKAEDKSEPCSENMSKKFEDRHVCSVYLEPRSLVIVKDDMYTKYLHGIKEQTEDMVDDRICNIKQCSDINIDNIKTRQTRISLTIRLVPKVLKTKLFFGKK
      >gi|Fcyl1000050013|ref|jgi|Fracy1|223976|fgenesh2_pm.2_#_126 ; gi|Fcyl1000106017|ref|jgi|Fracy1|180489|e_gw1.2.1691.1 ; gi|Fcyl1000088756|ref|jgi|Fracy1|163097|gw1.2.1691.1
      MNDESMKKKSRRYEKGHWDAVINLYKELFNRIRQQLAEHHLTDYYDDQESIHWLPCHAIDLKKDGELNAHVDSVRFSGDLVAGLSLLSPSIMRLIPCDDNDDDDNKNSENSTKDEEPYYVDMFLPPRSLYVLTGVGRYKYSHQLLPDGSIFHKTDTDIVVRRDHRLSVIFRDSKQPSS
      >gi|Ttra1000010051|ref|AMSG_11934T0 | AMSG_11934 | Thecamonas trahens ATCC 50062 hypothetical protein (935 aa)
      MTFRYTVPLFLDLILPLFIMSSIATSAPRSSSSSAASSASSASTSPAKSYNQRKVHNGKLYSGMRVGSTHRWNYSDAALEVSEMKAPAGDSSLGVLTPMQSQLAYEAVKSRLNAAPIGSGAAVNTIYHWFLVTELAFGDASDDGKVTSATLRGPKFKLAHKRPHWRKFSAEYSGNTPVPRKRIECLESLLGDAPPREAVTSAAKAAALPSYQASELLDGVAALKLGTCDTSWIETKVGPDEWEVAVNYGLTAAPRALAGARFLVVADQFATKLNANQYATQLAGAMYYLGDGSLATPAANALKIKIVSALTSALKSELPALKPAQPSPPPPKPLTAFWQAVGTPMPVMPPPKKVSRKRPREPEEKLVHPDVAVGPIVRPAELTKRAKTTSAGADADILARLKHELSPSAGILYAPGAVPADQADALFDAMATTVPWGAKRWRGSVLPRLVWHYQEGVVPALDMLLCSISSAFGVSAIKGAFCNLYRNGEDHTPYHADEYGADVLSISLGATRKFHFKPKGVTGAAATARRITYDLAHGDVLIFDESVNARYLHTVPKMKAVTAARINVTVFAVRDAPPPPPPALAHPSALEPTRPRSPPPPFVDDDALAAAAAAATSPLPTHETNMTANPALSDHPAAAPANNTTLGQPGFVSGYGSAHGNATGPVVGSYVAPSGPATVQVQEAEAFENVVVRVPRAEATIVRRGPPSVRTQVVPGPTFVKVVPEYVAGPVRTIKEVIPGPVTYRDEKVEVPGPVRERIERVEVPGPVVERVVHDYVPGAVVEKEVKVEIPGPVTYKDVKVQVPGPVIRRPVPKQVQVPGPVRVVKQRVPVPQPHIREVTVKRKVAVPKPVNVEIPGPTREILVHKRDNLLEAENARLRAEIAFLQSIPSAPHFEVLARREIKQQQQQFGGYPQQFGGYPQQQTYGSAYPGYGF
      >gi|Falb1000000067|ref|H696_00080T0 | H696_00080 | Fonticula alba ATCC 38817 (V2) hypothetical protein (377 aa)
      MLRAPIGLAPRTLGLALSSATRRSVSSSVAVSAHNREALAPVEVDGLAMFDVGAALLAEGCHPGAFLRGGALVRATLPAIRDLFLSNTPDATGRRSPSPPLVADGRWLEDHFGPTRALGALPAGSTAGGGISGLVGWDVALDGLLAAGELLRLQQASYSRSHFDSVIHHYRETIVRALGRRLDRGDADVAPGPDVSQRDRQLEALARIERIMHQTIVRSFAPGDSLAHMAEGNFLPLHTLDLRADGHIQPHVDNLSASGRLVAGVSLLSGRVVRFEQMYTDARPRESVKPSERRVLDVLLPPGSFYMQRDKLRYDYTHAILPVTPGQETVWPEGAGQRVVPLEPARRIVFLVRDRLNATGPGPGVGAGGKVLEGTY
      >gi|Sarc1000010122|ref|SARC_09892T0 | SARC_09892 | Sphaeroforma arctica JP610 hypothetical protein (129 aa)
      MQVWEFTIDLFAAAHNAHTAKFYTKEQNALLQKWADEINAWANPPWELVPDVLDKVIEKATSITICVPHYPNASWFPKLMSLLEQDPMIVKNMNNTFLQGGTTARGKTPWGVTLVAKIGTKSPYLTKK
      >gi|Ccor1000000123|ref|jgi|Conco1|67245|fgenesh1_pg.3_#_43
      MILNKNQSKKRLQSYQTFHKLNKISDYASDRNSLFSAEPTKYLVLTNIGYGGVGGIKPQELNTILNNLEINGFELICKNGKPFSYLIFNNIQNSIESYKKLNLIELKELNKLIYCEYLKFNPIKLSNTNDKQDNINGLVLINDFLTIEEELELVKNIEDDVTNNWSIVQNRFVKHYGFKFDYNTNSFGSSNNEMPIWSTKLLQKLYKITSDSEVINMDQLTVSKYPKGTGIPPHVDAHTPFGHTILSISLLSSTQMEFSNPETKLQYSTMLNPRSALVMSGESRYGWEHCIKERKFDLNEKGELVDRGERISLTYRRTNPTLDCNCQFGYLCNRK
      >gi|Psoj1000010133|ref|137293
      MDPLEAILASMAAATGNASPPKSNAAAAAPATTKSAGKKRERESESETPPVSSSTPPSGIWNPQQQQPLDAATLRKLEDAARARSVNPAMEIARQQTVRALCNKIRRASEDLGIGKLPNSAYETWQFTSQLKVKELDPLIPHAGSDYSGLFEELRKAGATKSGATKKCKELTRESERLLRKFGQQDFVAGKKKRVHVAAAEDGMRQLTYGNSTVKLSAAHFAKLREMYARKQGLGGDGSSMAPKDQRSFESALFCLLLRYDSLDGGGFQAALNEECFDVLLKEFDCKMECFASPLNCRYSRFCSAFLDTDCAFGSVGSFFDFSPRSGCFEANPPFIPKVIKRMADHMTALLNAADGPLAFIVIIPAWQDTEGWQQLNSSRYNQTHLLIPQKQHGYCEGKQQIRKTRWRIASFDTSVFFWQNSKACNKWPVTEKKLDSLKSAFKSKQADERDALGLRKSGKRVRSAKD
      >gi|Sarc1000000137|ref|SARC_00134T0 | SARC_00134 | Sphaeroforma arctica JP610 hypothetical protein (556 aa)
      MEGKKQFHGNKRPQNTPLWDSTVKTSYKNALGLSRVKSSSDKLAVMPLNEMERWLAVKKLRQSVRQACGVPALLAYERWCARSSLPAAFIKDDDDDDGESGAKEDVGSGAMATVVSYLVPNSDPEKGLQKDLMRLGGLSEEDAQEAAAKCVALGVRLNAKLNDTTVEGSGKGSETAVQGDDDMENEGVITDAATSTDTNGSDVVANKDTEPNNTPAKKKKADVKNKLRSIGVRVEGDDDDTHSTHQAQVEERGALLAYSLRTKKKPFFSLSRPHAAKLRSLYARTRGGKWVDKEGSDNDRFVDAVFCVLARYDALGGAGYQAALNEASFDVLKDKMRVDCECFASPLNCRYGQFCSAFPDTDSPFGSLGSFFDFYPSKGSFEMNPPFVPEVLCAAAEHANALLSLTKEPLSFVVVVPAWKEVRMWQVLSNSAYNKHEPLILTASNHGYCDGQQHQRRPSERYRVSSYDTAVFFLQNDAGAKKWPVSEAIRNELVESMHKAVGSAKTVQELEGRYRPNKEGDDPNMEGGDAGDGSRKRKGGRVFLLQKKRKANEDQ
      >gi|Smin1000020134|ref|symbB.v1.2.017755.t1|scaffold1389.1|size122275|4
      MERRILLEHTATVKSCASLVQWKQCLDILQQLKEEEQSPDVILLTSIVTACGRATEGSHAVDVFEELKRYRIEATVISYSAVINACSKCSDWWRALSYFDDAWDTMGDSTSKEAVSFLTTASISACGRAVQWLIALEILEEVSKRSMESTSIFAQNAAISSNHQAKPLPEWIQALELFGHANDSDVVTYTSVISACQKVARWQEAIELFVTMEKQMRPDLVCFGAVISALEKVGQWEKAFEFFRMLESQDVSLLPNLIIFNALMSACEKAGQWQRALHLLDLLKSTTTDVTAYDVVTYNALISACEKGQQWRKALEVLEEARSLLKIDVISGNAVLSACAAASRWVQSLQFLEVLLCNKLLLDLIGRTAIISSCAKVRGWQKALQFLGSCAATEVTYNAALSSMEGTVRWKESLEVVAAMRQCKMSPNILNYGIAIGACDEAMASPFHSTKWKLFIAAGVGAVCGFCLARRSPRGDGDEEEKNSILLGYCQHHPTKRGCGALHAASTMKERTGGRKMMPKEAPNGRREVICEDALEWIEKQGHFPSGSMVFTSLPDMSEVVEFAPRFEDWEDFFMKAVRHILTALPYGSVAAFYQTDVRLPTEGQVSKAFLVLKAAEAVPEARGFGGGCVKV
      >gi|Pram1000000142|ref|84939
      MLSRPLRSFAPLRRLLSSSAIAANEPTSLWQDVYNLDATHCHDPLVSEGDLQVVLDVITEDEEQVVADECSRILRRRRYEEDHWDNAIVKFKEMERSRWSTETQRILQKVREAAILPKELKYFPAVHVIELAEDGYIKPHVDSIKFSGRVVAGINLLSPSIMRFKEEHGDSIIDAYLQRRSMYMMTGRVRYHYTHEILPGAQVFKGQVPVNRTHRISIMLRDEFLEEHVTKYHTPYVKLDVVAQ
      >gi|Lgig1000020166|ref|jgi|Lotgi1|228010|estExt_fgenesh2_pg.C_sca_10395
      MYEYNEDITENDQFKPTYKYYKAKIPEPDFSNVLDFQAISESEKPDERVTNFQLQIPTGNINHINGFRPVHDWKAYSINDNPGFIFVLNPFKDGYQKYWVHRCLNDYPDAKNKANLDIQLDSEKRAKLWKNFVETDPDSIHKNDEIMGLRWTTLGYHYVWNNKEYHKERRTDMPEDLEELTKFVAQTIGYPNYCPEAGIVNYYHLDSTLGGHTDHAEFEQGAPLISFSFGQDCIFLLGGLTKETKPTAMFLRSGDICVMWKQSRLAYHGIPKILAPKLTHYPLSLNSEYVSESASDEAQYGNIHEINKKIEKTLQNLNWKPYKLYMSCSRINLNTRQVEGENKFFPEK
      >gi|Sarc1000000227|ref|SARC_00221T0 | SARC_00221 | Sphaeroforma arctica JP610 hypothetical protein (158 aa)
      MPTLSTQPLRTPMMMTCLDFVLTLVIFLTKMPRTSPPPPPLSTPLLPIDTPTNKSPPNFPTQDIQPPTKMTYDEYDRNNGHRSRKTTLRMALKDNVTLTHPPPPGVTTEDWMLNKNDWASAMTHLDFTPTIDRCASQTNSQLPRFAGPEGGEVNDFR
      >gi|Crev1000000227|ref|jgi|Coere1|78980|fgenesh1_kg.1_#_172_#_isotig04968
      MLNTVRNSCRIPHMVNTTVQRRHFIEAGRKSRVTKPSCGDIPQPKEPINLASPASAKPLIKYSDAFIKHGYSKGDIYLHPEFIDTEEHDLLVRSCNKKLRRLANNYEQGHFDKRIHNYRECTVSAWLPNKLGVAGHIAKAMGHHIDEDPIDLPDRAPTKKSSGWSSIGKHDGQIRHILERVWGLFPPHLAWLPPHILDLHEDGEILPHIDNPEYSGFVVGGLCLLGSAVSTFKHANDPSIRVDVLLPPGSLYFMTNHIRYQFTHEITANPEQRAWGGQPIPKAHRISLMFRDAKEPVGGWGSALTTGCATTTMASAKSDHGI
      >gi|Hrob1000010247|ref|jgi|Helro1|123161
      GLKPVNTWKGYTIANHPGLILLTDVMTSNLQKHLVKRCLNDLVKQPNKTNLDSYTKEEDIDNIFINFSNLLKKLRWITMGYHHNWDTKLYAEENKSEMPQDISELAECIAKVVGFTEFTAEAGIVNVYHKNSSLAGHVDNSEYDFSSPIISLSLGCTCIFLIGGPDKTSEPTALYLRSGDVVVMSGEARLCYHAVPCIITSSYTCLNDKSTTDCHEKCFDSNENNSKINSESYTMDNDFDNDWHLYENFLKESRININTRQV
      >gi|Bnat1000020289|ref|jgi|Bigna1|25145|gw1.85.31.1
      RFNFCLLNLYRNGSDYMGWHCDDEREMEGPIASVSLGETRDFVFKEKANRTSKHFLELESGSLLVMNEETQKLFLHSLPRRAKVQNCRINLTFR
      >gi|Fcyl1000040299|ref|jgi|Fracy1|258165|fgenesh2_pg.113_#_6
      MKAKRYGSDTKLSNVISFLRRFGWLHVTVFAVVTVSAFSGKTGTVGHRRNKKSPSVQQIIALDNENNEYLINDNNKQLWKEGLRNLVSCLASSNWKASPKALETAVEGVKAASVISRLVSSDYLISSNNDNNNGKVWWEPLVEKLHEEADDQLVRMIQPHQLSGIKFSIDCIQLSSSTKQDASDLLSSHRQQYLLPQSLQIAYDNLNLPFSVRPGFLNGNDDDDDDDVDNKHNNNLFTVASFVKQVKFQIETIQTATNRTVAERRQTAWEGDEHVENFEYSEKSMRRLPWSDVVANVRDRLYNETSHYYDGCLLNFYPDGDSAMRYHIDPDQGVLWDYETAVVSIGATRRFSFRESSSGNGSNKPHVFVLMNGDVTEMFNDCQERFQHTVQKSSVKGESASRFEP
      >gi|Fcyl1000110306|ref|jgi|Fracy1|184778|e_gw1.5.156.1 ; gi|Fcyl1000066930|ref|jgi|Fracy1|139687|gw1.5.156.1
      MSTSKNVNAAAKTKTKTKKNDASSKTTTKISVFKKACIRHKRRGNDPNRIDDLLVGSGGCADDSIIDVKWIHYLQQRSRRKRRKHPQQQHSTKKKDNDDDGNCDSDNDSKNTTSDNGGRFNKDLVFLIDHHKRSTSTTSTSTTAEDATDTDTTTPTPPRCYGFHEYPGVYIYPNALSEEVQLQLSYEAVTKYCEHSSTNSNSSSSSSPNASSPNAYRTNTDLLPPKTNEQINDTTNNNISETMWNLWKQEQEQSDSSLSSTTTTTTTNYYRRFSKLSWSTMGYHYDWNKRQYHPDQKLIVIPSLVTKISKYFASASLLYNNQNNIDNYSLTNAPLIPPPTGTGTGTGENNISFIPSASIVNYYTEKSNFGGHRDDLEHINAMDKPIVSISTGLPAIFILGGYTIKDEYNYEDNKNENDNDGNENENENDENHPQPHPVRAILIRPGDVLIMGGPSRLNYHAVARIVPYEAIIKYDNTLFGSDDENENNENNNGNNNENNTNDNAITDEKKYLKRYLKDHRININVRQVYPDQK
      >gi|Bcir1000010321|ref|jgi|Bacci1|200768|e_gw1.291.36.1
      MHPSHISKSLSVLPTIKRTLSTVSATPTERRPSVTQSEASSVQSDDTYVIDNRYTAQELLDIAEEHLFGRNGKAIDRAYAIACLQESAINMRCAGAQAVLGFCFEFGIGVPIHFEAAEQYYLMSIKTVLTGLDLQGEKSQTLSIDNVSLSSATLLGITRLAFLRKYGRPGVHINRIEAEYWESKIQQRGLEAIAWIQRAAIYDQCSASQYCLGVCYHDGIAVPKNEYKAFKWYRLSAEQGNCRGQGILGYCYGEGFGVEKSEATAIDWYRRAAAQGETVAIYNIGYCYEDGIGVEKDAVEAVKWYKLAAEKGNAFAQNSLGYCYEDGIGLQLNKNFAAYWYRRSAEQGYPWAQCNLGYCHQNGIGTEKDTIAGAYWYSRAARQGHARAQHNLGFCYQNGIGVNKNFKLAFEWYSKSAHQGNVFAFHSLGYCYQNGLGIEINHTEAVKWYLRSAEHKHAPAQLSLGFCYRNGMGVEKDEAKAAKWFELAAHQGNPLAQNSLGFCYEEGLGVSKNVKLAVHWYIKAAKQNNPWAQCNLGFCYASGIGVMEDTTKAVYWYRKAAQQNHARAQDKLGVHLQAGIGCRQNMGLAVRYFRLAANGGQVSAQYHLALCYEKGLGVEMNLQEALVWYERAASSGCRNSYEHLRQLLLRYCLENSSAVETLQRSDAHGETRNCSHRLGWISGFAAPAA
      >gi|Sarc1000000340|ref|SARC_00333T0 | SARC_00333 | Sphaeroforma arctica JP610 hypothetical protein (291 aa)
      MLHSRFYQEAERRFGKFTVDRFASAHNTHTDKYYTKIHNALQQKWAQEVNAWANPPWKLIPDILDEIIEEATSITICAPYYPNATWFPKFLSLLEEDPMLVEKTNNTFLKEGKKACGKTPWGVTLVVKIGVKCPYLDSTREQRAIHVAASEPVHSGDAPQATEYETRVNLIEQYHKLRIYTLEEPIHKLEAMDMVGLISNNMYTSSLTDAYHVSKIRYTKKEFHPLQSITATFPGDRVQVDTIGRATIGFQQRAHICVSMARCTLIRMSYYSDDGQMSGYYSTHASTNHV
      >gi|Sarc1000000358|ref|SARC_00350T0 | SARC_00350 | Sphaeroforma arctica JP610 hypothetical protein (335 aa)
      MKDKKAYEYLAKSGAEISSTPTRAVCLVNAGNQQGIAVDDFEAACSLLGKIAYLLNFPGKTYCVVIYEGLECAIDAHKALAGRACTLLKRTYSKEASIPLLVEYLIEDSIQLPVPATVQDARAVPGLVVVENFITAEEESTLKQCLDRYKWEPLSRRRVQHHGIRFNYSTNRHADDSMPGFPQEVQDIIIGKLRRMKSLNPVDEVSPVEPVPVPDQLTVNEYKPGAGISAHVDTHSPFRGAIISVSLCGRVVMEFKHRDGRAMSVLVPARSLVAFTGEVRYDWTHAITETRFDYVDNSFRERQFRISLTFRETKSTPCDCNYPHLCDSVGSTQL
      >gi|Crev1000000444|ref|jgi|Coere1|13168|estExt_Genemark1.C_20086
      MQQQTAFRRAEKKHKAQHQLPNLSDVIDVSAIDASDGAAVRRLCLTHDMCQPSVLPFQPFCQDRPPAYTLRDHPGLIVIPNPFTAEAQRWIARKCLCDCTRPPNHTNLDPFFDLPSQSLFSLASGPTNNKPAGALTRVEAQHMVASRVQSDSIDPSVSKPKGKIYMQSAPAADLLERLRWCTLGQQYNWTTKEYDLGTSVFDCELNALMRSIAEAITNPAYAACNRSEDWPPINKYEGSEFVSQAGIINYYHERASMAGHVDKTEESMDAPLISLSIGLSCIYLIGGPTRDTEPTPLLLRSGDILAMCGDSRLAFHGVPRVLSDTAPECLTLPNAGDNDSVAARYPNWHNFATYLSTHRINCNARKCM
      >gi|Lhya1000010458|ref|jgi|Lichy1|232292|estExt_fgenesh1_pm.C_2000009
      MIRKTGRITDPDPALYDTRILGGRGVQAYVYFMWICLQRCDPHNGELCLITPSQWLVLEFARHLRAWIWEHCELLQLFQLEPYKVWPRVQTDSLIFRLRMRGTRPPNLNTHTLFLRHTARRATLENILAAYATFNPHQQPPSSSTDIAYKYTPTHDRSRIQNSPNASFAFLSPSTSLTGELAHLTHSLSRLCDGPGAPLVFHRGPNTHPVYALVVRTQWARDYFGPHCCSRWLRPAFYWSGKAAGTHDPESIFWHLRDTQRLARKETSPAEAYAPFYAPDANYSLLLVDKEGADALESTLDKDDARLYEYLQAARLALQPTREERKVTWCHYNQCGTDVAVKIVHPINCGYFTKSQPRQRFFVDRHQLCVTNQCIILLWITQLEHGTILST
      >gi|Vcar1000000459|ref|jgi|Volca1|78621|estExt_Genewise1Plus.C_10079
      MDIEEPTRNPPTVDCSHCVLRSVFLRLQKASGRVFAIDASCNVRSDNSLCPIFACPDSFTSHNLSGQHIWCNAPADRAIPWLNNYSTFKQRTPDTTSAVILVPKCAHLEKEFQTRGWTLLKEFTKNSSIFSEPKPGGGRTRSPNCSGEFQAWLDPCQE
      >gi|Uram1000000474|ref|jgi|Umbra1|218622|fgenesh1_kg.2_#_546_#_combest_scaffold_2_37280
      MCEACFELASLDLIQDELPLVLDEEADHTYAASIEDYSHKPYLTREALSSTKFDDMKSNAVRLASYAPVEHQLFSLSFDSTYFDIPGRAPRWASHSGTDYHGTWLPQTVRRAILRHTRKDDRILSNFLGRGTDAIECFLLQRRCCGVDINPAAVSLSQRSCSFETPPGLTTAEHRPIIVQADSRKLTGALFADESYDHVLSHPPYKDCVAYSLHIEGDLSRYTNPLDFQEQYDKCVRESWRLLKMDRRLTLGIGDNREHCFYIPVGFQLIRLYINNGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKVPRNSNDKMDSFDASNTKSQMRITYTCREIPRSPIARKSVVMGTVWLFKPSSRHSFAQLCTSRMVERFGKDESNWEHVELEIISDNDDSSLKPSLANIATDSMVVEDEGLSISSYEIERQKRIDENRLALLQLVSSLSTPFIHINDDLTYLFCMAGVDIRPQ
      >gi|Lhya1000000544|ref|jgi|Lichy1|202942|estExt_Genemark1.C_30112
      MSDIDWEDLFGVSDHDNDDSNSEQHVDNAYSIPGLKLEKNALTHEQQMQLVYAIAEADYFQGGKLDQAMCFGDLGPFEWVESWIRNEYPNLIPSRILKRACLFDQAIINLYHKGQGIKSHVDLMRFDDGIVIISLLSSCVMVMKPVDSETLQRRGTIPILLRPGDVLALSGPARYDWEHGIEERMADEVNGEWIERGTRISVTLRKLLVK
      >gi|Sarc1000000559|ref|SARC_00546T0 | SARC_00546 | Sphaeroforma arctica JP610 hypothetical protein (181 aa)
      MSQPHNPSGITLIDRILQQRVSFAWPSELPDPDNTLPMPADCDDNSDNEQRFTLPVAGCVPASSEKRTQRLHPAWVRRVMEKWHGVAIDLFANRRNAQIPYYASDDPCGGTWGGDAFTIPLDEDFGWACPPRVSGFWFLHYLQSCDKASAAMVVPVWSGATWFPLFLSDVGGCASSSTAY
      >gi|Fcyl1000020581|ref|jgi|Fracy1|238447|fgenesh2_pg.5_#_951
      MSTSKNVNAAAKTKTKTKKNDASSKTTTKISVFKKACIRHKRRGNDPNRIDDLLVGSGGCADDSIIDVKWIHYLQQRSRRKRRKHPQQQHSTKKKDNDDDGNCDSDNDSKNTTSDNGGRFNKDLVFLIDHHKRSTSTTSTSTTAEDATDTDTTTPTPPRCYGFHEYPGVYIYPNALSEEVQLQLSYEAVTKYCEHSSTNSNSSSSSSPNASSPNAYRTNTDLLPPKTNEQINDTTNNNISETMWNLWKQEQEQSDSSLSSSSSPPPTKKKKTATTTTTPTTTATPIVVASTTTTTTTNYYRRFSKLSWSTMGYHYDWNKRQYHPDQKLIVIPSLVTKISKYFASASLLYNNQNNIDNYSLTNAPLIPPPTGTGTGTGENNISFIPSASIVNYYTEKSNFGGHRDDLEHINAMDKPIVSISTGLPAIFILGGYTIKDEYNYEDNKNENDNDGNENENENDENHPQPHPVRAILIRPGDVLIMGGPSRLNYHAVARIVPYEAIIKYDNTLFGSDDENENNENNNGNNNENNTNDNAITNSDEKKYLKRYLKDHRININVLQFNIVEAGDMLIQEALFGWTKDRAATGSN
      >gi|Fcyl1000110682|ref|jgi|Fracy1|185154|e_gw1.6.1329.1 ; gi|Fcyl1000087436|ref|jgi|Fracy1|161704|gw1.6.1329.1
      MVVSWELKNDNISNSTNKNINSGELFLDYATVTKKSKAKQQGEYEKGEASRPDCTSTTDHVYVPGLVVVENFLTEAEEELLVAILTGPQAPWAPQQSNMSQTGSVKRRVQHYGYVFDYRTADVLRRDVEEESGRLDPDANCPPLPSIPVEKGMEKQTNDNDLLMIFSDLNQMTLNQYKTGEGIGSHVDTPSAFGDGLISISLSSGIVMEFQKVTVGNDDDGNKVSPKNIKKLVYLPRRSLVLMSGACRYEWEHHIVTRRTDTHNGVVIPRGLRVSLTLR
      >gi|Fcyl1000030685|ref|jgi|Fracy1|248551|fgenesh2_pg.24_#_31
      MKEDAVTFNYVINDKSKGLHVKVFRRYQNQIQSDYWWNTILDNIDWYRVKYKSDRFQKNCETPCWTTFFGGRKEYTPYQDIPDWLQPLVNQVSSDLKVPTTKAFNAILVRLYFDGNDEIAWHTDGRTFLGNTPTIASLSFGSKANFQMRRMTNVWPSVNRNVVNNYWHHRVPKEKGRRPRININFRYINPGKDAERGQKTYYKYMVHGDDDKKSLKSYSYKSILAMRGGIMNFISSSSLRPNANNNIKKILTEFNDAATATAATNIGGVDGNNSNDCANKKRRVDDKRKNNDNNNNNNNNNGKPSSSSSSSNHNHNHNHKTGNNSYDVPTAGIVQKYNNGNDEVDVINDSMNSTDDATTTTTTTTSTSCSSSTTTQQYYYLSASENNNIDKSAFMALPDDIRKELINEWWKKKYISQRQGYAKTNTNTSTSTSTSTNTNMNTKLVVPNQPKVQQQDNTYNKKQKGESKEKIRSSAVAAARTDTLHSYFSTTKKK
      >gi|Bcir1000010688|ref|jgi|Bacci1|327341|estExt_fgenesh1_pm.C_3310003
      MEDIPTHPKYEWSSTQEKRVFNMLDEKTKEYRKQKEDPFSWQLPCSVKFQQDNPNTGETTTNKQKTSKFGNFMGKLLLKSSTNASKIKKSVPKPLLVEKKDSTEMPLTGYFRFMKRRTHHSNTSEDNGFPKINFKFSTARVLGNENTQIVGLNTQTNIKPKYRLINGKRTRWDVVSPVANNSKDEDSSLDLKNKEVVLPQDQSQEPTSSTEKSDMREKCIDFDFQYKEDFNDDVVDDGSSLGSLQSDDDVLGPWIELGMFDSNVSETKKSDHFSSFYSVDNQQNLKSLDLPEIQCLDCKLRTAYDKKPLDISNLCLSCQNKWSDTLSNVFRKFEFAVLTQTCKEVTEKPKSKKVKVTNDNKKASVEKNDVLHKEKTGGEKLLPQKLKKSPPAIGIPPTKSNYTRKTNKKMATGSSKKNDIPKKTNATAKPIVVATPETNDDDDPPFGFCSNPRGLVYKQVVEVLNINGHWYRGTLELMDKRKVKVKYIDWDDQEEWVIIGSKRLRTIQLEDKESDQQTDQQMKSKGESTADEVSKNNPIYFRKIAAAVKSKEPDDYVSSTLDKDPTQIFNDNEVFMTRRLAQELVDEHGFMPNSFGYRRNRAVAVTFYTSSKQRKQKEESVGYLREMHKNQVRVWYPDLHQSEWLLVGSRRLRILTEEEEESILFDSSIDLDRQEVPKIQEIAQIENKIDEIPIINPPPKRSRGRPKKTLPTEVVEIATEEDTNNVYEPEQVPQSTIILEKKVTEEHGQGDEAKVSNFLTTGAFATRRAMRQLTDQSGFVPNPYGYTNNQAVEVLNTRSGKKKFWEFGRLVEMKPGKVRVHYEGWSDLYDEWIMVGSRRIRVAQEQIPQKEDNDEVIAAPVPKTNDLLMTELNPEIRDEVKRNKKHKILSAKDYQELGLLVNIEELAAKELRKKKLHEKKTEEMGTTVKVKAVSKTKSKKSEIGGDKYEDEHDDEDIDEGDLDNDYQDTVVKKRLKSASKFKRKVKKSKTKIAKQTPCEHHSPSPPPANDTQVISLRLAQARASNSQSFVANVYGYDYMQHVTVLHLDKKFYEGRLVSMRKNKIKVHYCGWLDAFDEYITCGSRRLQVIENDHEVVCIEPNFKERYESMKSTGEPSLPEITPVNRIVRKRITLDDVCEEDSEGQREYHKEPSGEGEDEEELVEMDAWKVYCNQCNIVIKQFRYYCTYCETPSAGCDYHSFELCLRCFDQNFPFWHDHPRSSFAIQAVIDKEVGPMPIKGELVTVWEEDVLEESVNITNEDEEKNGEENIEPMFESKIDSVDASKVFSGDASITTDQGYKYLKRWKRRKVCAFCNDDDDTSNELGQFIGPFIIATFNKNGVEKKRSFWAHDSCARYSPEVFCTPEGKWYNVTLALRRGRGMRCYGCKEKGATIGCFESKCSKSFHLPCSQKPASYFKNGVIFWCQTHEAYYNKKDTYVNIFNCDGCSKKLEEETWFTCVQCATSYFSTFDLCVDCYEKFPADHRHGEEDFEETSLAILKEMEAQKATEAAREKEELRAANARKKKKSLFPRRRRKLPDGSTPVSCCYCGTYEAETWRKGYDGGVIMCNTCFELALLIDNDGDTNVTDMPLVVDNDGLQQRYVSSIEDYSHKPYFTREALSSTKFSDASTGRRLESYEPQPNQYFSLTFDSSYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKYTVKDERVLSNFLGRGTDAIECFLLQRRCCGIDINPAAVSLSQRNCCFEIPPGLTSAEYRPIVAQADARQLTGSLFGDESFHHVLSHPPYKDCVAYSTHIDGDLSRYTHIDDFKVEYNKVVKESWRLLKMSRRLTLGIGDNREHCFYIPVGFHLIRLYIDQGFELEELVIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIGTFKKIPLENIDRMLIKNEEEEDSAERASHVRLTSMQRGVPSSAILRKSVVMGTVWVFRPTESFRFSQLCTSRMVERFGKDDNNWEHIELDFSFQDQPRCEQIESCHAETEKDQSIDEEESPLSEYEQQRLRRIEENNKTLLKLGLISELSEESNDVIHYENMMDKAPLEDGKLVLMITAHQTLAPCQINLYRKTIVQLAKDATKKLAHHGMLIIGTQDIRNNTSGKLWPMTMLVLEDIERAVDQSTLKLKEMVVTVPDGYSKNRKQNMDEQPDTEHNEEEIDIETVDDYVPIVHAVYLVFQRL
      >gi|Caps1000010719|ref|jgi|Capca1|156565|estExt_Genewise1.C_3640053
      MTKAERRAFKKQMRSQHTLLRHENIISLLHQHLLIANGGLGNSVSRDMLEKVFKPCGSILDIVMVPGKPYSFVTFSDLSEAQSAVQSLQGTELPSSAASSEVPPVKLYLSFVKSVPGAEVASNILPAGLTLIQDFVSQEEEIELLKCIDWDYMDPQLKEDSKISLKHRRVKHFGFEFLYSTNNVDPDHPLDMGIPPECSPILQRMLSQQIILNLPDQLTVNQYQPGQGIPPHVDTHSAFEEELVSLSLGSQVVMDFKAPGGCHYPVFLPQRSLVVMRGESRYQLTHAIAPRKSDVVPLACLTKDNQMKLTLMARMERTSFTFRKVRNPPVCECDFPEQCDYQKLKKSNRPLTKPISTSPKELETEHVHQVYEEIATHFSDTRHTPWPRVAQFLNGLDPGSVVVDVGCGNGKYLGINPQLVMFGCDRSSGLTAISHERGHQVWVSDVLATPLRDGSVDAAICIAVIHHLSTQERRFQALVEMKRILRVGGKALIYVWARDQKRGNVASNYLKESSDLPRDEEIKSAVDAKDLKAELPTELSVHVNRTEFQQQDLLVPWKLKGKEEKQVFHRYYHVFEESELEELCQRMQDVEVNDVYYDQGNWCVIFTRIH
      >gi|Vcar1000010806|ref|jgi|Volca1|99194|fgenesh4_pg.C_scaffold_83000041
      MNGAGGTNDLGGSLDVAMASGSSGNPAGEAGAAAPPLAPVVPVPVAPVGQPVIPIAGAPAAGAIPNPAQPVAVPAAVAVAQLPDAQMVRNIPAPKLPVATPVEPDRIHAFVADVRDYFVLVGWQANIPAQKLFISGALEGFFKEWHITWTKSVPDYTPDQLLDAFLIRSAPEMYSRTHVARTTFYSATFKQELNELHLPLLLLWRHPLQYTLDLSRRRPSLPLDSRPTRVRAVVLEAVAGLVDVVVAQQAAWPDAMVQGVAPPRFVRRPDGNCLWSKYWQGLGRSVHESEVASIATTMGREFTLDACASDCGLSAVCNAFSCTARPFLDTNIAGHTVWMAPNAADLPAYVTHYRACKPLAPQSTAACILVPSGTEPSLLKGMKLVRRYPVGTSLFYVPDVQGSRALLPPITEVMEVWYDGPDSTEEIPACTAIGNAVPHLAVKISGSTFMAMLDSGATHSFVSEALVRLLHLHVLPSTFTYVRLADGGMSPIVGQPMLKVLSPLLGPYID
      >gi|Hrob1000010803|ref|jgi|Helro1|76748
      MKQGVAVFENFASVEEENSILSEVEPYLKRLKYEDSHWDDAIRGFRETEKKTWTAKNRPLIERIQKTSFTPSCSILPHVHVLDLLPSGVIKAHVDSVKFCGDTITGLCLLTSCVMRFVNVDDQSKYADVLLPRYSLYYMKNAARYKFTHEVLPNDLSKFKGLQVQRSRRIVVLLRNKPVIKGDGDVDGGESDDVGSSYGGYSKS
      >gi|Vcar1000010818|ref|jgi|Volca1|77763|estExt_Genewise1.C_830117
      MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFLDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA
      >gi|Caps1000000823|ref|jgi|Capca1|163497|estExt_Genewise1.C_120131
      MGLLRFLSIPRHLGSLPIRLFTQSSLRSCPTLQTSNGLKPCETCITASDRATEEDIRGSFFVYKDFISEEEEQSLFDEVEPYLKRLHYEQDHWDDAIHGYRETERQQWTKKNRGIIQRVRDLAFPPGVPQLSYVHVLDLQKTGCVKAHIDSIKFCGSTIAGLSLLSNSVMRLVHDKTKSRVADIYAERRALYIMTGSSRYDYTHEILGESESFFEGKAVERDRRISIVCRNKP
      >gi|Fcyl1000110820|ref|jgi|Fracy1|185292|e_gw1.6.1356.1 ; gi|Fcyl1000110923|ref|jgi|Fracy1|185395|e_gw1.6.1355.1 ; gi|Fcyl1000088004|ref|jgi|Fracy1|162301|gw1.6.1355.1 ; gi|Fcyl1000088005|ref|jgi|Fracy1|162303|gw1.6.1356.1
      MVVSWELKNDNISNSTNKNINSGELFLDYATVTKKSKAKQQGEYEKGEASRPDCTSTTDHVYVPGLVVVENFLTEAEEELLVAILTGPQAPWAPQQSNMSQTGSVKRRVQHYGYVFDYRTADVLRRDVEEESGRLDPDANCPPLPSIPKMIFSDLNQMTLNQYKTGEGIGSHVDTPSAFGDGLISISLSSGIVMEFQKVTVGNDDDGNKVSPKNIKKLVYLPRRSLVLMSGACRYEWEHHIVTRRTDTHNGVVIPRGLRVSLTLRTAL
      >gi|Vcar1000010860|ref|jgi|Volca1|99520|fgenesh4_pg.C_scaffold_89000033
      MASPARNPQHAQSAEADHEHYQLCQPQRGTWQGAAPRGDDYKKLVLEKLDQMSRRMDVIEAHVQAQAAAPPPPSTPIIPAEEGLNVQPTAAELLAITAPVQPAQASDQAPQPVPPSIVAPAAPAANCAPPAACEPAAAQAGHDSGIVGCKDVQSVSVQNANHRSLKNFRQPPCFTGVPSTSVTKPEIWFDTTVDYMTRSGSNPIMGMRYYVMGKAAEWYHDLFTHKGSMLTIQGMCDDFVLLFSDECATDANGNNWFAFDTLINFAQDSVTSHDLNGQHIWCSTPADRVIPWLNNYSTFKQRTPDTTRAVILVPKCIHLEKEFQTRGWTLLKEFTKNSRIFSEPKPGGGHTSTKAVTLVDSGAWHVFVSAALVQKLGLQPLTSHVAICVLGDGANSVRLQGQAVMPDKPDANWGTASFTDGTWFKGNKIIVPNDPQLRKDILHAYHDTPLAGHHGGLITSLLQTPSGNTAICVFVDKFSKMVSLVPTTDQLTTIGFAKMLVNHIICKHGRITALLTDCDPPFTAQAMRNTAKQLGVKQLMSSNSTLFFIAGTAAVRQGAEIGTQSWGP
      >gi|Bcir1000010874|ref|jgi|Bacci1|251514|estExt_Genewise1Plus.C_890072
      MANKNQYNVLVDMEDAAYTQPATIESDGLEFQDFSGSSGMNNKSSYSQPAPPPPSNTTSFFDAPQPTNNNSNRGPIWSLDYYSRFFDVDTSQVIERCLKSMYPVGDFAADTLNNQPDLYGPFWISTTVVFSVFVCSSLAGSLAAYIAGQPHVYDFRTLSVAVFVIYMYGFFCPAAVWASTKYFGCQPSLLEIVNYYGYGLSIWIPVSLLCIIPNDIARWVFVGVAFTVTAVFLVKNLYIVISRADAKISRIILLAMLGAHVIFALILKQQKIWEKQKSENEKRKKQRKTYVNQEAFRSAERNFKSRNPPPDFSKVVDVTKEDQEHIIKVPLTQDLESLSRLFGDNQHSKTCQDAYVLKNVPGLIVIPNAMTAKAQRHLIKQCLSVYSLHPNISNLDTHYNIPESGLWSLFEKQKENTLKPEDALVYKKENKSLQQGGYSSDNDKDSDDSNSSAPPDELIKKLRWVTLGYQYDWLSKTYHPDKKYAFPQDIAELSKRVVKAIEGIGYTSEETSWRNEYKGSDFIAEAGLLNYYQYKDTLMGHVDRSEVNTEAPLVSLSLGNACIYLIGGPTKETVPIPLYLRSGDIIVMTGPCRKAYHGVPLIIEGTLPDYLDSQDGDADWAIFGEYMRTTRINLNIRQVNTSC
      >gi|Fcyl1000120875|ref|jgi|Fracy1|195347|e_gw1.24.318.1 ; gi|Fcyl1000088089|ref|jgi|Fracy1|162391|gw1.24.318.1
      MNNNNKNNIEEINDKSKGLHVKVFRRYQNQIQSDYWWNTILDNIDWYRVKYKSDRFQKNCETPCWTTFFGGRKEYTPYQDIPDWLQPLVNQVSSDLKVPTTKAFNAILVRLYFDGNDEIAWHTDGRTFLGNTPTIASLSFGSKANFQMR
      >gi|Vcar1000000870|ref|jgi|Volca1|102807|estExt_fgenesh4_pg.C_10481
      MASPGNTMRTQSAKADHERYQLRQPTAAAAEPARVTGPTAALQPQRGTRQDSFTSHNLSGQHIWCNAPADRAIPWLNNFSTFKQRAPDTTSAVILVPKCAHLEKEFQTRGWTLLKEFTKNSNIFSEPKPGGGRTRSPNCSGEFQAWLDPCQEKSKCSALEPLTENTAPLLPCNFTSTKAVALVDSGASHVFISATLVQKRGLQPLTSHVATCMLGDGANSARLQGQVHTSLRIHGFRCKIVAQVIPNYPPHSRRPEDGFAFKRGGM
      >gi|Vcar1000010947|ref|jgi|Volca1|106299|estExt_fgenesh4_pg.C_410059
      MLPPVANWSQFLSAFVIVIVIVIALLVIFVFVILQWLSSPQKPTATAPVHGLRNFRATRLSPPPSAPQWPNSLRPHGRPPNSPIGDLYAPLPTIAQPRPPHLNLLHYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDGECVWMVPPTSSPSAFISQYLESKTTNPRTSAIIVLPDRPTAPWAPLIRHMTVVRRFPAGARIVCRRDPSDASRSLYPLSSAA
      >gi|Lgig1000000949|ref|jgi|Lotgi1|74562|gw1.8.349.1
      SDLRLKLKTAHPSKNSQRRRHSNENYDTMYSDSYEKISRYDRHKYQDYHLDELKKIHSGIQQRRLFTSAECAEVEKKIDDVVAKANRNEYKDNTVDRAPLRNKYFFGEGYTYGSQMERKGPGMERLYAKGEVDDVPKWIEKLVIKPLYDANIIPKDFINSAVINDYLPGGCIVSHIDPPHIFDRPIVSVSFFSDSALSFGCKFSFRPIRVSKPVLNLPIARGCVTLLSGYAADDITHCIRPQDVVSRRAVIILRR
      >gi|Psoj1000010940|ref|138234
      MSSKAERIQAKRHQHAADEHELDLFLNAPQRAPLPCNPNAAPTAENGAPEASGANDQVPPRGSTDTGALLPGLVILKGFLSPQEQQELVDDSRRMGMGEGGFYKPTYASGAKCRLHQMCLGRHWNVKTEKYEQRRSNHDNAPVPPLPESWKKCAQRSLEAAREIDPQVMGTCKHMTPDICVVNFYKKAGRNGMHVDKDESDEAMSMGSPVISFSIGCAAEFAYIDHYPEPHEAVPIVRLGSGDALVFGGPARKVVHALTRVYNNTQPKWLRMRSGRLNLTFREYKPSELAC
      >gi|Fcyl1000020973|ref|jgi|Fracy1|238839|fgenesh2_pg.6_#_241 ; gi|Fcyl1000111282|ref|jgi|Fracy1|185754|e_gw1.6.1166.1 ; gi|Fcyl1000111321|ref|jgi|Fracy1|185793|e_gw1.6.1362.1
      MPRHETTTTVATGLGSKVAKKQKVKAGKSNIWNPDVCYVSEKKECTTSSGKLVENFPDPTCSAEFMRMTSTNWLRKQLHKLRKSNKSAEDVEFMKPAVMAVERWFMRYSLDHPHRLAAGEDPVLPMPGADLEEIDPHFVDDLVKTGKDTQEAAVAVVKDFYALTRKAALDILKKEQALTKEEIVVVIHHRHSCDVMLAKNKEKLLKINPAHYEKLKTIYIAVQESSSLSLTSGKKKKKTKTTTTTTFDLAEFHRRLFCILARYHGIQGHGFQAACTEHVFDVLHGRFGVNMECFASPLNSRYASHCSAFPDTDACFGSLGSFFQFHPKHGSFEANPPFEPYVMLAMVNHMEVLFKKATGALSFIVVVPGWKESEAFQRLKASKWNKKSLPIAQKDHGFCDGAAHQRRDRYRISPYDTFFFFLQNDTAASKWPLTEIAINELRSAMACGVPSPSMQARHDKAGRGTDDIARGVYKGKKRKATGEGVMSRKLEETRLKKAGVKMAGGAKKYKRNT
      consensus/100% not found.
      >gi|Lhya1000001028|ref|jgi|Lichy1|233553|estExt_fgenesh1_pg.C_70030
      MRNIDYGTQIPGLIVIEDFVTEQEEVALVTEVDNRTWCGLGVSPNPELKRRTQQYGHLFSYRYRKVLEKYGPLPEFTHAVVARIKENKLMPKEPDHLLVNEYNAGQGIMPHTDAPALFGPAILSLSLLSACVMKFTHVENGNSIDILLPRRSMLVMTGDARYLYKHSISKDLVESSSEGVTVHRDRRVSFTFREIIAWEVPAENPSCSCTKSCSNNK
      >gi|Bnat1000001029|ref|jgi|Bigna1|88467|estExt_fgenesh1_pg.C_320094
      MEAEIIALTFALVVTFVALTLGWQFILGTPGTEEHVLGLCYEQGEGVEKDLKEAVWWYRKAAEKGLAKSQYQLGDCYDKGNGVEKDLKEAVRWYHKAAEQGHSAAQHQLGYCYKHGKGVEKDLKEAVKWYLKAVKWYRKAAEQGCAAAQNNLGDYYEHGKGVKKDLKEAVKWYRKAAEQGDAEAQNNLGHCYYVGEGVEEDLKEAVKWHRKAAEQGHAEAQNNLGHCYHVGEGVEKDLKEAVKWYRKAAEQGHAEAQNNLGHCYEHGKGVEKDLKEAVKWYRKAAEQGDAEAQNNLGHCYYVGEGVEEDLKEAVKWHRKAAEQGHAEAQNNLGHCYYVGEGVEEDLKEAVKWHRKAAEQGHAEAQNNLGHCYHVGEGVEKDLKEAVKWYRKAAEQGHAEAQNNLGHCYEHGKGVEKDLKEAVKWYRKAAEQGDAEAQNNLGCRYYVGKGVEGDWKEAVKWYRKAAEQGHAASQIELGWCYKYGEGVEKDLKEAVKWYREAAEQGNAEAQHNLGACYEHGNGVEKDWKEAVKWYRKAADQGHVEAQNNLGWCYKYGNGVEEDLKEAAKWYRKAAEQGHAASQTELGWCYKHGKGVEKDLKEAAKWYREAAEWYRENAERGCAEAQNNLGDCYRHGRGVEKDWKEAVKWYRKAADQGHVTAQKNLICRKSRATQTTMRW
      >gi|Smar1000001056|ref|SMAR012323-PA pep:novel scaffold:Smar1:JH432129:50962:52819:1 gene:SMAR012323 transcript:SMAR012323-RA
      MSLVETTAMTAAILTMQEQGIIKVVSPVFGQVLSPVFPVPKPDGSAPDPEASKIDAFAHFWSDFSYAFSPFSVISKVPWKVHQDKATILLLMPLWMTQPWFPRLLESLIATPVWFPAKDLLRLEHSPQEKPRLNDQLVLLGCKISGNPMLPKVFRKTLQSSSWTGGGQTLSPRTQREFEVEKLSGFTTDYCD
      >gi|Sarc1000001093|ref|SARC_01068T0 | SARC_01068 | Sphaeroforma arctica JP610 hypothetical protein (320 aa)
      MATTDNTYTFDTVVPKIRNHAINTSAVLRGPSLYGVASTRFQPRSNGTDTLKRPPIDARATRATQPAPSSSTTLPSYYTVQSLTLRATPQLDTALATTFSTALINSGADISIVTSTDHLTDLQSGTYTIELAGGSTTTAYTRGTILGLGPTLVLPTAAQPILSVRNLQCNGYTVTFPEINTPNTPGESHITYGTDTIQIVTEAAGRYQLNAFQAESLHLHHARFGHQSTRTTKAMALANNLPIKPPLTYCNDYQLAPDLFEDHVQKPFGPIDIDLFSSKHNKQVPTYCTEDFTDKDTHYHDAYKQNWHKPNKKLYGNQP
      >gi|Mcir1000011152|ref|jgi|Mucci2|115786|Genemark1.11549_g
      MQTATVNHQQQFKSKRQFKIWEKQQKENAAKRLNRALYANQSPFRYAERVYKSRVLSDADAREIVDFSNLSNNTTQVQSNIVQVPLKHDLGTLSNAFSADMSHNKSNYALIMKNVPGLIVIPDAFSPHAQRQLVKHCLRDYAKPPHTSNLDGHYHVSKDGIWPLYEQEKKGSLVPGDANYYVPIKTVPQDENDMYPTAIGDDDNNDAFSVASSMYSDISGKSAPPRAASSPTQMLKKQRWVTLGYQYHWGTKKYNLDDPIPIPSEIADLMKAVVTATEDIGCQDAEVPWKNQYKGADFRPEAGIINFYQLQSTLMGHVDQSEINMEAPLISMSLGHSCIYLIGGNTRDTKPIPLRLNSGDMLVMTAIARRAYHGVPRILENTLPEYMLPESVDDEDWKPFGDYMKTTRINLNIRQVFPK
      >gi|Lhya1000001164|ref|jgi|Lichy1|190365|gm1.1132_g
      MSNKNQYNVLVDMDEGGYTQPATIESDGLEFQDFSSNTGHHGSAPPPPPPPAASSTTNFFDTQESRRSGANKAIWSLDYYSQFFDVDTSQVIERCLKTLYPVGDFASDTLNNQPDLYGPFWIATTVVFAMFVCSSLAGSLAAYIADVPHQSDFRLLSYAVGVVYSYGFLCPALVWLATKYFGCQPSLLEIVNYYGYGLTVWIPVSILCVMPFDIARWVFVGVAALLTGYFLVKNLYIVISKTDAKTSRILLLAILEMQPSLANKHELMSSRRQRKILERQQLKAAQLKQDTKTYVNQSPFRYVERNFKSRVPPPDFSKVIDLHQHPNHDRVVPVGLACDLSCPLFEQKKPSRAYILHDIPGKHELISFMSNCSHLFNRQLIRECLSLYTRPPNTSNLDTHYAIPEQGLWNLCEKEHHGQLDPSFVVPRKTMEEWQLQTNDSEKHDPPPVESMPLLRPFELMHRMRWVTLGYQYHWPTKTYHFDKRYPMPALVDQLTSSIAYAVDGVGQEGVWKNTYRGQDFKAEAGVVNYYQYRDALMGHVDRSELNMDAPLISVSLGNSCIYVIGGTTQDTEPVPLALHSGDIVVMTKPCRKFFHGVPRIIEDTLPEYLSSPLSNTEEQDDDWELYSEFLKTSRININVRQVFPP
      >gi|Smar1000011252|ref|SMAR003465-PA pep:novel scaffold:Smar1:AFFK01018421:2875:3482:-1 gene:SMAR003465 transcript:SMAR003465-RA
      MDISVIHIPGKLNKIADFLSRDFTSSDGEWSLDTYTLNNLFSIFVTPEVDWFATRLNYKLPKFCAWGPDPMAWKLNTTNNPEDEQRPSRPADYRSPNLVCATAREPDRSTSTLCSKGQIVLGAQARGQAPITQQNDPAWMQDLREPLQAGGFSTHAADLYTASWRKGTVTSYTSGIKVAELS
      >gi|Vcar1000001335|ref|jgi|Volca1|89254|fgenesh4_pg.C_scaffold_12000015
      MVPFRRTATPTPTAAAPHDPVNVARAPPPAEQVSHSVPVHSAVQQSAEPDTPPPPFNVPRLTQITSKPCRLYAAASMMGVHRVDSDWMLSCTIFLDLYSQYGLFTVDACCDDFGINAHIIPFFSPSCSFLSAQVDGLWFAYSLWMFWAVR
      >gi|Vcar1000001334|ref|jgi|Volca1|104096|estExt_fgenesh4_pg.C_120014
      MAVQYMHEAITSAVRAAMAELPPAPWSTAPQPHTGPYAPLPTIAQPRPPYSNLLHVSRDFPPFDPKDIRADIGGWDITMRHNLDMAGVAEDSPEAAKITLSVLRGTMGDSLRRLNADPATRFTSYHAVLQAVTPLAPDLIDLTLTISTGRDAANRIRDDPTPPRPRDHRRNDPMDTSAALVRLPQGTRTINVPAQLRHQRQDAGRCLQCGSEFHDKLSCPDLIHLTAPTLAAMTSTAAPTQTAAARPVPVNVAFKPPPAEQVSHSLPVHFAVQRSAESDTPPPPPFNVSRLTQITTSTTDAHRVDSDWMLTRAIFLDLDSQYGPFTVDACCDGINAHVVPFFSPSRSFLSAQVDGECVWMVPPTSSPSAFISQYLESKTTNPRTSAIIILPDRPTAPWAPLIRHMTVMRRFPAGAWIVCRRDPSDASRAPGARPLSGEPPTV
      >gi|Smin1000031348|ref|symbB.v1.2.027722.t1|scaffold2860.1|size68653|1
      MAISSDLAAIKVLCKNQRPTSESINHDLSPFLDELKSSPRFGTSLLNKLARRKKLRQLSLVLDALLLNRCDVNVFHYGVSISAFEKAEKWDRALLLLRQMDVVRVEPNTVTYSACISAMEKCGLWRQAVDLLAEMEYRRVEKNVITISAGISACSKAGHWWLALDVLDKMCKDQIDPDTVAFNACISSCSSQWQVALHLFQQMSGFKLQPDVISYSGAIAACEEGLQWETTLQLFTDLQSKEVHLDDFSYNALISSFSRGSAWQLCLHVLQLKQLASCASTASIASETNHSNEFEFATNDTNDTNDTTDICKLLVPSVPSVNFVKLVEGLDIFAVSATVSASLELQWAVGYPFLPAKEEQLTHGFYKYIAGMQALCARELLELVPHAQNIMDMFCGSGTVLVEALRTGKRAIGCDVSPLALLVATHHSDAARIDLYELFEVARELVASMEARNEGWHYLKSRISNLRSKNLRDALHFILLVSLSRVQDVTYLHSSSKVIKSSVPDHGLPPCMFLGVAQLYVARVRSLRARALESECEIYRCDARVLRLEPVDAIVTGPPYPGVYDYHSPANMCADLLGENILYDFCAPGYSIRGSKAPTNVEMAHEKSSTYAAGREIGQNRLWLEDSDFAEIWQSEQEAWLTSAFENLREGGTATLMIGDGDLHSAGDGGFDNLEPTIIAAEKVGFATIATATIRGKSKHPKQPKGMKRTEHVVHLKKPKL
      >gi|Bnat1000011351|ref|jgi|Bigna1|87291|estExt_fgenesh1_pg.C_180193
      MIQSDLEAKRPSSNPSTQDGGGKGGGTIVPLRLGDETALLKNQPTNVERKERLDSTKSIPAGKLGAGDTTLHVGFLSKEEANSALSAFQNGEVIFQQWYHMPDKRSPAGPLRKLRRIKAALCNPEKDGRIPLYRFPVNDQKRYGETIPMPPALEAIRKRAAKVTGFEYNHAVVLLYRNGDDCIGFHKDKTLDLDDKAPIISISLGAERPYVLRDNIRAPKIEQEFIFPHGSLLALGPETNANFYHSVRQLKKEEETGDVRISITFRKVATFRSEDGKIITGKGAGRGTNLWPEAINGAHRLDTKLDESLKQAESVEDDAKARTVREAHHQAHEKRMAERKAAKKQKQRSLDKTKIEEKLSGVNRDDGESLKDLKIRLMGLVRKDIKNFALPGVNIDDKGIEQELSSLVGKFLGAKVASSKNAEA
      >gi|Uram1000001377|ref|jgi|Umbra1|222849|fgenesh1_kg.5_#_213_#_combest_scaffold_5_101108
      MSAKRSNSQILHHFGRVFARLTVKDSSTPDLFKSNSSPSTHHMPSDPLASHLFQTANNHLFGTNGFAKDPKVAVTYLRQAASQGHAQAEGVLGFCYEFGLGNIATDFRQAEALYIRAARKSDGLAMARLAFLRKYGRPGVKIDRGEAELWTERLCNLGVESIQWIREAASQHNCPEAQYVLGVCYHDGVGPKANEAEAFRWYKTSAEQGNARGQGILGYCYGEGYGVKKDDVEAIRWYRLAADQGETVAIYNLGYCYEDGIGVERNVTEAVKWYKLAAEQGNAFAQNSLGYCYEDGIGINRDSQKAATWYQKSADQGYPWAQCNLGFCYQNGIGVEKNERLGAYWYRQAANQGHARAQHNLGFCYQNGIGVPKSATDAVHWYTKSAERGNSFAYHSLGYCYQNGVGVPVDGKRAVYWYYLSAKEDHAPAQLSLGYCYRNGIGVEKNETEAFKWFYKSAAQGNALAQNSLGFCYEEGIGTKKAPKLAVSWYSKSAKQGNSWAQCNLGFCYANGIGVNKNYQKAVFWYQQAAAQNHARAMDKLGMHLQSGQGIEQDVKLAFEMFKRSASLQHVAAQFHLANCFENGLGCDVDLAEATMWYERAALAGCRTSHERLRRLLMRACLDSAGGTGNDEDSPLGEGAYYSTFAYGHCAPAA
      >gi|Fcyl1000021378|ref|jgi|Fracy1|239244|fgenesh2_pg.6_#_646
      MTMLTDDNGINERRFRSVLPHEPLPIDGDSKGYAHLQGAFEPKRNKLTNNSNDKKSSTTTTTSSSSSLSYWKSPEALEEVLLLALRDDFERLKLPYPVLSVHVTDPNNLSKVRLEYNSPHEAIQIQYAFRDQRISPHDIIILTDEQQHVLFGSRPCQATMITDKPLPTDRCFWPRSNPPQFRRLLHDRGDDEKERSETRFVYVTGLIDNNITAELSDWWNNPYYVYQAMRQVFGTDVEIFLPKKINKRQQRIQSCQLGFRSAEEAQDAVQKFQGMVVSWELKNDNISNSTNKNINSGELFLDYATVTKKSKAKQQGEYEKGEASRPDCTSTTDHVYVPGLVVVENFLTEAEEELLVAILTGPQAPWAPQQSNMSQTGSVKRRVQHYGYVFDYRTADVLRRDVEEESGRLDPDANCPPLPSIPVEKGMEKQTNDNDLLVKEGKGWELLAQIIEKTRQHEFDVCSNKSNNSLNNNENADLIDPQPTKKMIFSDLNQMTLNQYKTGEGIGSHVDTPSAFGDGLISISLSSGIVMEFQKVTVGNDDDGNKVSPKNIKKLVYLPRRSLVLMSGACRYEWEHHIVTRRTDTHNGVVIPRGLRVSLTLRTALSSKGIPLPRFESNMFPPVWGINDEKRNGSTMDSNVLVTPNTERDHVHAVYDAIATQWHHTRGKRGVLWPSASLFVKDLPEGSIVADCGCGDGKYFPAVWEAGSYVIGTDISLPLLKTALLNDSASLSDGGKVPDTRRVSPHRESLQKRPAVAVADCMSIPFRNNSCDAAICIAVMHHLSTIDRRIRCIEELARIVKINGKIMIQAWAMGCHSRRRFAAPDVFVPFNAQPKYLDKVSENSKTTMQDFKAGTMNDSNTGVAATSVVPTINGPHASKSAAEIYSDAYDNADFDEQKGLVVFQRYCHMYREGELEDIVSQIPSVKLIDGGFETGNYFVILEVVAN
      >gi|Vcar1000011421|ref|jgi|Volca1|45994|gw1.116.24.1
      MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSCFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA
      >gi|Uram1000001443|ref|jgi|Umbra1|223189|fgenesh1_kg.5_#_553_#_combest_scaffold_5_102347
      MHELNTDQLEELFGSDNSDFDDSFSDDQTRMDARLGSDRNRSPARMLTSEKFGRIPGLRLLRQGLSHSVQTKLLDTIISAKYFNGATNNQAMHFGDLPQQFQDIGQWAKRDTDLLESIDREPLFDQAILNLYRPGEGIRSHVDLQKFDDGILIVSLLSSCVMIMTPASEAQKHATSYSEEGNLDDGIPILLRPGDVLAMIGPARWDWAHCIPARLVDDVNGDIIQRGSRVSITLRKLNCTTADAIELGHRQ
      >gi|Caps1000011463|ref|jgi|Capca1|150134|estExt_Genewise1.C_2310033
      MICNGGNCIYINNMTSESRGSSKPCACKGIRSCLLCERDSPSTPRSVSVTSSWSADTHDVKDRKEATKQIYVYCHRCRRAHIAPWTATPSLKDVINHAELNCSPSDSDLPLQGIHLFENFISADEEKELCDRINCTAWVVSQSGRRKQDYGPKVNFKRKKVKLASFSGLPSYSEPFIQRMLQLPQLSDFTPVELCNLEYSRERGSAIDAHFDDFWLWGERLLTINMVSDTVYTMTNEGLPRTEILIHFPRFAFIVMMGEARYEWKHAIQRTHVAQRRMATTIRELTPEFLPGGKQEDVGREILDLALTYKGRAVGSAS
      >gi|Fcyl1000031479|ref|jgi|Fracy1|249345|fgenesh2_pg.26_#_236
      MAPPTIATFFSMAWSLMVAPECLSLATISVSQKKRVEVVEPGLVILRNFIDDEACQRIAAMAKDFGDEFYTVNKEGEKILNTGESRGRIYDAATRFPRDLIQLSNDAVSTSRAADTSMPAMQCTHVLLNLYTTSEGLVWHRDIYENDGKSDHPVVNLSIGATCVFGFKHLDTDEERTVELRSGDILLFGGPCRLIKHAVLEIKLDDAPEWMSYDPSRFSFTFRDSPEVLGREEEFKYFRVKEDLVGQDNFKVPTSSTDRKAFHGLPSYTTQQHVSMAS
      >gi|Ttra1000001477|ref|AMSG_01559T0 | AMSG_01559 | Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase family Oxidoreductase (253 aa)
      MADSNEMAPGRARREVKECVRYEVDVPAETIAASRRTVAALPSAVDRPRLAGVTDAGVPGFLYLPHYLSAEEQSAVLAAIEGDTSVDWSDAFETRLQKHWGWAFVYECGQIVGGEAAPPAPALLLEVLAERFVADGLVATPPNQVLANKYMPGNGIGYHVDRIDLFGDVIIGVNLVVPTKFTLKSVAEPEERVSVVMAPGSVYVLSGEARFGWRHGITRKARLYKRFRRAPELLPDEPWRVSLTFRDVLEAT
      >gi|Pbla1000001570|ref|jgi|Phybl1|23346|e_gw1.22.32.1
      MDSNVIEQEIMSLLCKKAIEEVHEPGFRSRIFTIPKKTGDLRPVLNLRPLNQYIPKQSFKMESIKHVCQLIQRGDYLTSVDLQDAFLHILIVKSSRKYLQFSWKGHIYQFRVLPFGLSLSPLVFTKMVRPVLRWARRQGIRLSAYLDDLIVIARSPTLSLQHTQRVVDKLQSLGFLIKATKSHLTPSTSIEHLGFVINSKDMTLIIPRSKLRDIRREASRLLHNPTITLRQLSSFIGKAQATTLAVLPARLQTRQLISIRNQALYRGLQWTSPIHLSSMARQELQWWIDQLKAWNGHSFLPEVPQVEVYTDASETGWGIVYDNTVLSGTWTTDQQTEHINYLELMTIAFATKLPRLQGKALRIYCDNMTTIAYVNHFGGTRSAKLMNLATDMWKQCLVTGTRVRLAYIPSPLNPADPPSRSLIQQLEWSICPTFFRHLDSLWGPHHIDCFASSLNTQLPTYMTWKWDPQAFAMDALSVPWTTWPRLYLCPPWNLLLHILQKLQREKVPATLITPNWSSALWYPLLLQLSSRPPIPIPRHLVLPAPGCAGHVLLKNPHWNMIAWDINYAG
      >gi|Sarc1000011591|ref|SARC_11325T0 | SARC_11325 | Sphaeroforma arctica JP610 hypothetical protein (77 aa)
      MEQVRSQGLNTKSRVTWWYPVVCDAKHRTNEYKAATFVFDKAERAWRPLTHDAFALPGNQQLPKYFSPSTDEGALT
      >gi|Ccor1000001613|ref|jgi|Conco1|3535|gm1.1775_g
      MSYNKEWRIDYEYGYIQEPRALSSKKEANNMQVGNYDTQSYPRNRDTRSPYKFASSSSDLINYSVSSHAGQSHQQFTPITPCFPLRSPNEEISIEKNPRLEINYLLSHQRLNETLSSPNIQHSDSMASLSSREAIKDDTARRASWPTKSQAQSMQNPVRAQGLKRPLEEEPELSTSDEYPIAVIRQIQEEIERIILDTHSNRSMTACAHINTSSHSPNTCQNCNENQFFQTQDLWKFLNNEKTENYSLVVSFYTVCRIWCEARMAAELSPREMASGILERYSIVKLDRNVMDDSRCEGWDWFYNKIKFQNSCQISYDTIFKHFVHLVRTNEELLNMNWIDNIYTQLFHQTNSSLKFFEGESQIQRDRGQFFTPIEVVDFMWNIAIPSWDDHIKHQIMSARIQDLVFDPCMGLGTSILYYIDRVIKQLKSNHCKVIWNNPMSLEVIYNKLRSGIFGIELDLVPFLVCLCRIEAHLFPIYIQISRLNRDLTEINNSPVRLFWNDTLQLLPREEFIKRPTDTIYRRWSEAQINKLRNFKYSFSIVLTNPPYLLRHQFSFPDPKLYSQAILGGKGHQAYLYFIWLGLHCISHEYGQLVMITPSQWLTCEFAQRFRDWMWGRIWMHRLFLFDPFKVFRKVQTNSLIFTLSWRDKDTQQKEHQIGFFKCKDKKLSLKSMLEHYHYWNCQLNNKCGMDQVLDPNSSCSIQLKLTTQLETKLTESPANPFYSLIPHNNLSHLMHQLAANLPTLCDSNGKTVEWDSSSPLIWNRGPNTNPVYGLVVRTEWAKQNFGEMIYSKYMVPCFYWNGCSNNPLRDEGVSGEGTKEIQFWNKRDPMRLSRKESSPAESYVVHQQYFKHYSLIMVDKLTAKDIIQKNNHAQYKIANPSICVFYQFLKEVREELQPQYADREIAYSNIQKCGNNMGQKIVTPINYGYFTTTQPRQRFFLEEESTSVTNQCIYFTIKQTDLYDKPILDADYYLALLNSAIIQYYMNIHCQYDQQGRLRLFRSNMAHIPFQYPESSAYIQILQLSRLMKSLKSVVYNLQTIFNPSVTGPSFLLEPLRRGLIDLSEPNRWQIVEDCCNEISQRNNINSFEWVQSRLRFVYRMVNFVQLKIDLLMFEIYQLPFEFVEELFQELDLMTQYENYLQEVDAINIKGGYNGWVNEVEIVLAEFDVWIKSIY
      >gi|Spun1000001767|ref|SPPG_01847T0 | SPPG_01847 | Spizellomyces punctatus DAOM BR117 hypothetical protein (259 aa)
      MGSMSCSHMVGRLRTFPFLSLQATRACKRSHVTTPSPKRNLGQLHPSNPLLDLSNHTSNLPPPSSLTIIPNFLSPAQQALITKSADKKLRRLCGREYWKGHFDGVIEKYRECSVGAWSRSGGDEPEIVGIVDEIKKVVEEVVEADGGRVGKWLDPHILDLGEGGEIRGHVDNVQASGTIITGLSLLSSAVIIFRQVKDPTQSFSALVEPGSLYIQRDALRFDYTHEIPDDPALRKFRGNTIPRGRRISIMMRNEREVV
      >gi|Fcyl1000111862|ref|jgi|Fracy1|186334|e_gw1.6.1306.1 ; gi|Fcyl1000084765|ref|jgi|Fracy1|158880|gw1.6.1166.1 ; gi|Fcyl1000088082|ref|jgi|Fracy1|162384|gw1.6.1362.1 ; gi|Fcyl1000086782|ref|jgi|Fracy1|161015|gw1.6.1306.1
      MLAKNKEKLLKINPAHYEKLKTIYIATTTTTTFDLAEFHRRLFCILARYHGIQGHGFQAACTEHVFDVLHGRFGVNMECFASPLNSRYASHCSAFPDTDACFGSLGSFFQFHPKHGSFEANPPFEPYVMLAMVNHMEVLFKKATGALSFIVVVPGWKESEAFQRLKASKWNKKSLPIAQKDHGFCDGAAHQRRDRYRISPYDTFFFFLQNDTAASKWPLTEIAINELRSAMACGVPSPSMQARHDKAGRGTDDIARGVYKGKKRKATGEGVMSRKLEETRLKKAGVKMAGGAKKYKRNT
      >gi|Lhya1000011955|ref|jgi|Lichy1|139241|e_gw1.1115.1.1
      KINELKISFPDISTDLLLEILLSCNGSVKQSKHLLYESIPNKKRKLNSDNINNTIYQSTLKDIFHFKSKEIEKKINSNIITLYDKNDVEKALSPYVTFHKNFLPQELSNSILKYILYENTIQDAFTNKEFYIFDKKCKSSHLTSIFSSNEDFITGKNKLFYYAKKSNNIKKYNNDLLIAQLLVEDIVNKEILKYSNKQIYPFLDKNQFSGEVAFINKYENEYQHLDWHSDILTYIGPHCVIASLSLGVKREFKFKKKCDEKNNLIYSIPLPHNTLCIMHAGCQEIFKHCITKSNLPINSHPISGKTRINITYRSYRKDFINNIPKCKCGIDMALKICYKNIKNRGRYFWSCEGTYQNNSCYDFYWADFNDKKLITKDYNKCSIWFENDNQKTINFN
      >gi|Fcyl1000122023|ref|jgi|Fracy1|196495|e_gw1.28.330.1 ; gi|Fcyl1000088062|ref|jgi|Fracy1|162364|gw1.28.330.1
      MVPTKQPFNAILVRLYFDGNDEIAWHTDGRTFLGTTPTIASLSFGSKANFQMRRMTNNNNNIKSSGIDYNTPQHDFIVGDGDMLVMLDETQKYWHHRVPKEKGRRPRININFRYINPGKD
      >gi|Fcyl1000032066|ref|jgi|Fracy1|249932|fgenesh2_pg.28_#_266
      MSVPAACSSSSSSSSSSSSSDMNNNNNKNNIEEINDKSKGLHVKVFRRYQDQIQSDYWWNTILNNINWYRVKYKSGRFQKNCETPCWTTFFGGRKEYTPYQDIPDWLQPLVNQVSSDLMVPTKQPFNAILVRLYFDGNDEIAWHTDGRTFLGTTPTIASLSFGSKANFQMRRMTNVWPSVNRNVVNNNVNNNNSSSRSSSSNKNNNNIKSSGIDYNTPQHDFIVGDGDMLVMLDETQKYWHHRVPKEKGRRPRININFRYINPGKDAERGQKTYYKYMVHGDDDEKSLKSYSYKSILAMRGGIMNFISSSGKPSSSSSSSSNHNHKTGSNNYDVPTAGMVQKYKNGNDEDDVINDGMNSTDDATTTTTTTTSTSTSSSTTTQQYYYLSASENNNIDKSAFMALPDDIRKELIHECKKGKAKRKYGHQQ
      >gi|Sarc1000002122|ref|SARC_02073T0 | SARC_02073 | Sphaeroforma arctica JP610 hypothetical protein (178 aa)
      MDTRRIKVFLGYRYDYGKTQTGPRQLLYDDVDTMSKLSLVALVREVLVRVLRNDMQGYLKLPGRNFFNQAVMVCYLKFNSGLGTHQDSKSLFQRPIISIRLLADAALSFNRVGRECRRTPQSFSVPQSVGTVTLLERFAADYATHCVLKKDLKVPSLSIVFRGVQQSAVEAMRSSQV
      >gi|Vcar1000012130|ref|jgi|Volca1|97056|fgenesh4_pg.C_scaffold_57000068
      MNISTEPDTPPPPFNVSRLNQIATSTTRAQRVDSDWMLARTVFLDLDSQYSPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDGECVWMVPPTSTPSTFISQYLESKSTNPRTSAVIVLPDRPTAPWTPLIRHMTVVRRFPAGARIVCHRDPSDASST
      >gi|Mver1000012212|ref|MVEG_12216T0 | MVEG_12216 | Mortierella verticillata NRRL 6337 hypothetical protein (214 aa)
      MIFFGSDSDISQIRSTYPIRKPTSDNGWGIVIGNRSWSGLWLMPQRHLHINTKELLMVFMAADLQECQGQMLNIICDNMTSIAYINHFGGTHSPELMHWATKLWDRCLKTGTRLKMTYISSSFNPANAPSHQMIAQLEWSIDPSFFLWMDKKWGPHKVDLFASEQNHQTTCFMTWKPCKMAMAWDALQQPWMTLGRVYCCPPWNLIPAVLQKV
      >gi|Wseb1000002244|ref|jgi|Walse1|68555|estExt_Genemark1.C_70130
      MRGGDFFYMNEFLKQNEANELYNQALELEFYRPTLKIYGKDVIQSRQVAVYAIEEKRAHMKYSNHDAKVNHPFPQLVNQIAGRLKEVTGVDFTHCMLNYYQDGSVYIGKHNDNFNNQVIATVSLGAERTIHLSPQTTKAALKVYPETDVPGREKSTLKLTNGSLFVMQGSTQRYWKHEIKKEPKVKTGRISLTYRQIVD
      >gi|Psoj1000002243|ref|128295
      MTAGLAVLVALSVLLFVGAIVMLLFFCRRIQVQRERESLVFIQEPADVYREELNDSFMEQALWRCGVCKFLNHPERKLCDLCQTLKGAERDVAGNKKHTSSGSRGGSFSRGSLSSQRMGRIGETEAFSSSQADDAIGGSFGRPSFEFSRKQKPQNKLSKLQLAASRRQQWKRMPTTDGLHRWVRQEDKSARGYNARNTQRDSIESDPGGRMSLNRYLDATERDSNLSVGYIRVRDSLGRLVLNESDVVATDFHYRINILDGTGSSELIMNLEGVHNLPFPDKIRWFSTEIHRLWLPWESGHAELVVRRDHLLQDSFELVAAMKPYQLRQRWRVVFDGEPALDAGGVMREWFTLLFAELFDPAFGLFVSTVGDERSYWINASSDLLIGEEDHLAYFEFAGRLVGKAILEEHLMPVHLALPFLKHVLGVPISFSDLQFLDDEIYNSALMVKKIDDIEPLCLDFTATRIVDGKPEIVELVEGGANIDVTRENRARYLDALFKYHVLGSVSEQLLSFLTALYDVVPEGLLKLFDYQELELLMCGVPSIDVEDWKKHTDFKFFTHNFPTELELNNIEWFWEVVEDMKNEDRVRLLQFATGTSRVPAQGFKGLISSDGRVRRFNVAFAGANQSFLFPKAHTCFNRLDLPIYNSKEVLSEYVKLIVQMDITGFTIDRMREDKQEQECHRDENGFADFKYPKGWCQPTKPPAWVPPVAKQAKRPQSSDGGPSKRAKLSPPPEQEFSLATVVANVKKAAIEHSHLPAEVASRLFEEDATISYLTTDHKSWVYHVPQWYKHVFEHVTPEELWEAMADDDDTESQVTWATLFEQAWEAHPKQHDTIMMFGKPAKLPRFQQLCGEMGSYRYSGKTFEAQKKYPRGLEHAVLHMQRMVEDPATHRTRLTGGLVNWYENGDHYIGPHADDERDMMACSPIVALSLGATRHFVFTKKTSKSAPQGDEAVARLELQIGDGDLMIMGGTTQRTHKHAVPKMARCREPRISITLRCFH
      >gi|Lhya1000002295|ref|jgi|Lichy1|204403|estExt_Genemark1.C_180068
      MQPPSHLTSRRQRKIWEQQQRDNAVKRQKKDIYENQTPFRHAERHFKLTRQLDFSNVVDFDLPPDAKDKRIVSVSLHNALPESLFGKATDTAYILKGVNGLIYIPNPFSPDKQRHYVKQCLSTYALPPNKSNLDPHYKIPAQGLWHLYTKEQKEEEEEEEEAKIVSINNHEQQVLPSDIMHKLRWITLGYQYDWTTKKYNDEAYLIADDLSELTKAIVAAIEGIGQQNEWKNTYSSDKFKAEAGVINYYQLKDTLMGHVDQSERNMDAPLISLSFGHACIYLLGGTTRDTEPIPIHLKSGDLLIMTGQCRKAYHGVPRIIENSLPEYLSPCIKDDDWKLYGSYMKTSRINLNIRQVE
      >gi|Mver1000002291|ref|MVEG_02285T0 | MVEG_02285 | Mortierella verticillata NRRL 6337 hypothetical protein (480 aa)
      MAQTSLPHNGTAEDWDFKMATLFSIFESTPEHILHQALTNAHGDLEQAIPIVLSGQQSASNTTANSHHPSRPKKKQRLVQPRLAAFLSSPSSSSSQSSSPSSCSLPTTTLAPSLSSSSNLPSLNDRLRWKDSIDDSTRPRERIKPLVLYNPEDVAKHCPCTLIHNVLDRDLASRLLQVMLVESETWNRNRWWLFERIVESPHKTSYYTEDSHDLAEVSGWTYNGKKQDPPRKFLKEMDEAKLVVRKIVNELRSARELHPYEEQGEWKCNVAAVNHYAHSKESVGWHADKLTYLGPRPTIGSLTLGATRFFRIRKVVPDVKNPDTAGQMISIALPHNSLCIMWPPMQEEWKHEIPAQATVTPHPIRHVDQRKRYGTARINITYRLSRPGFAPKDTPVSGSQEGKTCGEFHWVDMETKLGLVKLEEEGKDPSQDTIQDASSNSLKRRPSDILKDDDDPLDLASELLDDVSPIQAPEEGEAE
      >gi|Vcar1000012306|ref|jgi|Volca1|46001|gw1.61.90.1
      MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA
      >gi|Sarc1000002310|ref|SARC_02258T0 | SARC_02258 | Sphaeroforma arctica JP610 hypothetical protein (167 aa)
      MSYPALFTKHITEAFDTVDIDLFADAKTAQAPIYCSLEPNAPLHDAFEQSWKQDRKYLYGNIPFHSKTTWRESLLKLERKNELITMIFPVIPGQRNYKQILLQLNSHVEIIRTNEHTFLQNDTNVTGPLSKWRYICLARVGNTKYSILPTDDLMSAEADVVAHPHA
      >gi|Vcar1000012324|ref|jgi|Volca1|91734|fgenesh4_pg.C_scaffold_22000129
      MASLFLKRNILLKKNTGRDAANRIRDDPTPPRPRDHRRNDPMDTSAALLRHQRQDAGRCLQCGSEFHDKLSCPDLIHLTAPTLAAMTSTAAPTQTAAARPVPVNVAFKPPPAEQIASKPARRVGCMRARVAATSTTEVHRVDSDWMLSCTVFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDGECVRMVPRTSTPSTFISQYLESKTTNPRTSAIIILPDRPTAPWAPLIRHMTVVRRFPAGARIVCRRDPSDASRYVNDANKVVCRLPAYNEEV
      >gi|Lgig1000002422|ref|jgi|Lotgi1|103779|e_gw1.2.1002.1
      MDLFVVKTKRKRADSQSESSGSKKFKSYEKSECNVIDFKKFRLENLSCDYGILYSKREADNLLKECEKILCYNEGKLAKICLFGKWHNIPRKQVAHGDEGLSYKFSGNSIPARPWLPLLENIRDDITRQTGYKFNFVLINRYKDGSDYMGEHKDDEKDLEADHPIASLSLGQARDFIFKHQDSRGKNSTRRDIPPLKLVLQHGGLLMMNYPTNSYWYHSLPCRKKLFNVRINMTFRKMVVK
      >gi|Uram1000002459|ref|jgi|Umbra1|228021|fgenesh1_kg.10_#_156_#_combest_scaffold_10_5503
      METLQPPAHLKSRRQKEMWKRQLLQNVKERQKAQESQTPFRIAERSLQSKVATPDRPPVVDFTNLENNPPEVRAKLIRVELSHDLREICPLFGRTDDDWSKRRSIAYIHSDIEGLIFIPNPFTESAQRQLIYNCLQKYTQAPNSSSLDAHYEVPAEGVWNLYTKSRRCASNEDDSQLLISRKSRSNNGQKIPESADPYDAPSTSGQQSTSIEKDPDIADIKSEPSLHKPSELLHPSELIRKLRWVSLGYQYNWSEKTYFFDRPIPVPEEAAKLSIAIAKAVEGIGYRQDDGFRWQNNYKGDNYAPEAGIVNYYQLKDTLMAHVDRSELNMEAPLISMSLGHKCIYLIGEQTRDIKPHAILLQSGDVMAMTGASRSAFHGVPKILDDGPSFLQPGTIDENIPDWDVYGEYVSKARINLNIRQVNVDSSST
      >gi|Sarc1000002473|ref|SARC_02416T0 | SARC_02416 | Sphaeroforma arctica JP610 hypothetical protein (686 aa)
      MPELQLAGYQFYKEPCVSARAYRHVAPSIELVRAQKVQELTEALHAAVDSQYGGPSTQGRKRVHESSHYNDGSGSAVPILALERWMFNAKDLERDALRDISGKTGDEGGEKNQDYEKSVGRTEETDALLPDIINVAEKVEPVLVRDLVRGSHTQESAAEIATHVAELSHEFAVGINRLCKLGLSADLEPSSSMAPSCPQCGKIIEVRVRVTMHKHTIDVNTVRIRQRPDMKQKGTQVKYVQGVKEEGLEGGDGTNVSVGPPVCAVCVADGRTAPTRLLKLNHEHYAKLRALWDSAQAAESVADHAGNADGDGMGNSKANKKRNKNKKRNENRRKKKKAKGERDGNNNGTGAEHEEKTPTDTIKDIKNIEGKQGKKHSDTKPSSKTSAERSAEKDTSTNEAKKGDTAITGAIKRSNSSTVKRVTSDKHGKPIELDDKQHKNTEELFHNDLYSMLLRYNALSGMGFQAACSEHVFAALKSLFRTDFECFSSPLNTHHPLYCSAYIDTDFHFGSRGNFFDFYPASGSFQANPPFVTGVMERMAKHIETLLQKANTNSQPLSFIVVVPGWVNEVSYQTMLASPFMEGDGPLLISKDDHGFCDGAQHQRQDRYRNSPFDTAIFFLRTDAARNVYGERTFESDAILRKAFAEALPSDAAVARRARDARGFADLDRMGNGSSKKRKGFFKRY
      >gi|Ttra1000002481|ref|AMSG_02643T0 | AMSG_02643 | Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase (292 aa)
      MASSPSKPSNVSAAAASPDLARAEAAIAAARGFSVHVLHDTADTFAYVAVGRGLLPLELADAAWAEASVPAHEAAQIVDGSLATSSGPMAWSQDEVTVGGRVVREPRLTAFLASAADVVYTYSGKANLGRAFPPAVSAVRAWLADALERGDDEYWNACLANCYIDGSQAVGWHSDAEDDLVPGSPIVTVSLGATRGLHFRSRAASDLFVAQNDARHARKHRPLTAAEAEVLATPTPTDLVLDLAHGDVLIMGGMTQSLFRHRLVRDADCHLPRIVLTFRAVVPSALAPPPL
      >gi|Crev1000002507|ref|jgi|Coere1|86268|fgenesh1_pg.10_#_86
      MSGPDRCHGADGPLSPLAAAFDYDSSGSLSSLDSFLDALSDCADDPLRPDSSLLPERQFNRSASATDAKSSQTVPLKQQEHKKEKQNLPSRNKRERTADVSSPKPRQTVNLQRQSMQTLVSSNNMITTGAFLTRTTLRKLGVADIVADPEASNAGLAVGSRIKVLNLDKHWYTAVVLAIDSGKALAHYPGWEHCYNEWVLIESRRLLYRGKADLGVSGSTKAMEQLEALYSGVEEPVLIGFDLAQAINDAFGISTGCNLKNGGADEAAANVFVNSVDNIDSGTLARKQSGVKESKEHRSRGRPAGTRNRRRAGHIKAKSSRKQRPTIQNDKAKKTLVGDQEESAAAECPSTPGVPAELRVPSVRLVRAAENPYARCHRSEDIFCGDSDENEATARESCLTAVRDVHNGGNEEPSGKKARIADDNDTATASSGNAIWHLTRGDYVTTGAFSTRRTIKALAHSGSTGGIMQDHHGYYPGQRVEIMNANQSWYQGRVIAYANKKFLIHYGGWDHANNEWIVAGSRRMRPASDINDMAVTETEEMARKACVVLVDEYNTYIDGVERKNAEKADAKRKARELRKTPVNVRVAKLAQSMSADSMGDDEAEEMTHACLPENEEEDDEDVDPISVEAGYTPVPQLLRVKDYVQLFRKGMQIAARDRNKLWWRATIAEIKTFRLRIHYTGFGSSWDEWVEMNTQRIMFEESAESSRCDAEMAGDPLHVSGENSSALSKEPNCMGQVISSSSHGKDAGQTDYGTVEESQETKGIPVPRRLGRPPGPETKSTPLSLRLALKALMSDREMFEQCHPEELDVFHLPKEHMSMRDYSTFLKVGDRVRIRDRDKQWYDCTIIDLRHGRIRICFNGHSDEFNQWIPVNSDRIRILRETIDGDKRLEKMEKESQIAQRRKQEKLRAQRRKRSQASIASLVRLAESLEYIVDCEDTFVSGHTQVGPQDGITELDAATEVQSRLEGSTEDDASGGDDKPLLQLMMESDIDNVDGMPLLTRILLAEHFKRQRFGALIRHGSMVAMQDSATWFVYCNQCNIVISTFRYYCLSCERPSDGYDYESYDLCLMCFSRQFPSDHPHSQASFARAAVGDAESIVKFTADALSRCRDHERLAAASAHMLDLFSGLIAVYEPDAFDTSYKPRTPGTSLWSKLAVGLHGTTTSTLDTSAVVGKIIGNTRRSRITSIINPDSEMLSSCNGHDADASSDKEDKDETDRQLCKADVDDLPPRCAFCSEDDQSQRDLLGTFAAEQPFVLSMVRDDGTVRRRRFWAHTACAKYSPEVLVTEAGQWFNVAAALRRARTIKCAECKRRGATIGCFHDRCQKSFHVACAGMSKSFFESGRIFWCPKHARMAAGVVEGNAGPEPVSLEARCANCNHELSGDLMWMECLECLAEPERQFSLCLTCYDSKDALADHPHKKRCFREHLSHTGGVSSNGQYLADIAAQDSRRRVGKGTTCCHYCRSRQSRRWRKGYAGVVMCEACFNTAHSLRGGAQAKQVQAGTVCDQDLFAEADNDSPGELEVVALNPFGRSLITGSDAQPLPPPQQQQQGALIEDYTQGIYFTREACIAPNRVGLPSVSQQPLGELSSYGPTDSMLFTLPVNTSYFDIPGRAPRWASHSGTDYHGTWLPQTVRRALLRYTQRGEHVLSNFLGRGTDAIECFLLNRKCVGVDINPSAVSLSQRNCSFTITPGCGMSIEFRPTIMQGDARDLRSDLWPGASYFAESESFDHILSHPPYKDCVLYSTNIDGDLSRFPGPDEFQREMEKVVTESWRLLKMGRHLTLGIGDNRAECFYIPVSYQLIRTYISSGFELEELVVKRQRYCQAFGLGTYLCVQFDFLMFTHEFIATLRKVPKDQIDSMHLADRHYAEDSEFGLQTVTVDKDPLDFRLVAISHRCLREVPASPIERKGVVMGSVWTFEHHPVHSFTHMCMSRMVERFGRDGSNWEQIDLALRPLEQGTTENAADGTNAASDIAAASDTQCTGDVIDDKCASLNNARNQAESDPELLDSDTEEGGYERARQRQIQQNREQLLQLGLVSELGEDSTDIAHYQKMIAMTPLPPTSSAPLALIVVPHILNTEFARCHVEPYRRTLVQITHDASHRLCPSGLLVLGVQDVRDEHGKLWPLGMLVLEDVQRAVGSIRLRLKEFIVVVENGYARKRDDVMSRETFVDEQCVVEVNTPDIHVPIVHAYYLVFMKLK
      >gi|Fcyl1000122509|ref|jgi|Fracy1|196981|e_gw1.31.61.1 ; gi|Fcyl1000122713|ref|jgi|Fracy1|197185|e_gw1.31.263.1 ; gi|Fcyl1000088412|ref|jgi|Fracy1|162734|gw1.31.263.1 ; gi|Fcyl1000066918|ref|jgi|Fracy1|139673|gw1.31.61.1
      LHTKLQNYFTPTSILENIAVLVTPKVDPSASLSSLALIRLSKQIIALDNENNEYLINDNNKQLWKEGLRNLVSCLASSNWKASPKSLETAVEGVNAASVISRLMSSDYLLSSNNDNNNGKIWWEPLVEKLHEEADDQLVRMIQPHQLSGIKFSIDCIQLSSSTKQDASDLLSSHQQQYLLPQSLQIAYDNLNLPFSVRPGFLNGNDDDDDDVDNKHNNNLFTVASFVKQVNFQIETIQTATNRTVAERRQTAWEGDEHVENFEYSEKSMRRLPWSDVVANVRDRLYNETSHYYDGCLLNFYPDGDSAMRYHIDPDQGVLWDYETAVVSIGATRRFSFRESSSGDGSNKPHVFVLMNGDVTEMFNDCQERFQHTVQKSSVKGESASRVSLVFKKTLGYSKERTKSRTKV
      >gi|Wseb1000002511|ref|jgi|Walse1|64082|gm1.2421_g
      MLKRTLVRLNKIERINLSNASNNRLFNFSNLKSDALSELDKEDFVIYPNYLNIEEQKVLLKQLLKKLDRVCGKPRRNTNLQRQHEEQYEEGNLQRAFCHKDMYRWQTSHFDNVITGYREANVRSMTVPNVVSEEGILGILKRLYGCLYDNSTELTKLQANDMKDERLEDDDLSVPKWIQSHILHLSPDGTIQAHVDNQEAMGSTIMGLSLGEERLVEFNNESKGSFLVRLPSGSVYIQKSKLRYEYKHSILQGNCRDQRLSLMLRDQPSPK
      >gi|Fcyl1000032662|ref|jgi|Fracy1|250528|fgenesh2_pg.31_#_127
      MGTVLYNVNHYLATITRDGERTMFTQSDDGDRSTTNHDKKFDTHNDKSKNEDKNGGKRNVLFDMTEKFGLEIRKKFIGNDQSPTKNSTIIDRKTTTEEQESKSEETYQDMGRMLMKLMLPSNSNDSKVESLNDVIEEVKGMSGRGDIQDNNTIVEVFNVAKRCHNMLDSQLNEFFGEKGSPPLYLTNLIYYIEREDEIKNPTWKRRKHYFFPGIDIAQMDDLNEKLKLTDLAYEDTIDEIRDRLDIEYNSELVYCSLESLPNKPAHFIAVKRDQSPRSKELEVLLVVCGTKRITDIITDLICDATVYREGFAHYGIRDSGQWIANEHSDLFEKLRVLANKKKIKLTLLGHSLGAGAASIAGIELNDNPFIDVKVVGFGCPAMMSSELSESYEDIITTVIGDNDCIPRMSMATMVNALLDITELDYTPFALRDFEETVDEMQRFLPSYVDDILENIAVLVTPKVDPSASLSSLALIRLSKQIIALDNENNEYLINDNNKQLWKEGLRNLVSCLASSNWKASPKSLETAVEGVNAASVISRLMSSDYLLSSNNDNNNGKIWWEPLVEKLHEEADDQLVRMIQPHQLSGIKFSIDCIQLSSSTKQDASDLLSSHQQQYLLPQSLQIAYDNLNLPFSVRPGFLNGNDDDDDDVDNKHNNNLFTVASFVKQVNFQIETIQTATNRTVAERRQTAWEGDEHVENFEYSEKSMRRLPWSDVVANVRDRLYNETSHYYDGCLLNFYPDGDSAMRYHIDPDQGVLWDYETAVVSIGATRRFSFRESSSGDGSNKPHVFVLMNGDVTEMFNDCQERFQHTVQKSSVKGESASRLEP
      >gi|Mcir1000002688|ref|jgi|Mucci2|106998|Genemark1.2761_g
      MPPTTSTNITFKSRRQQKIWERQRKETEAKKKTSTSYVNQAPFRYAERNFKSRVPPPDFSQVVDFEKMQDHSDIIVPVQLTDDLRRLSSVFGQCEAPCRDAYVLKNVPGLIIIPNAFTPAAQRSLIKQCLSVYPKPPNTSNLDTHYIIPDTGIWPLYEAQEKGTLKPTDPEYHVPKKVVVDGSSSTYSDDEDKEEEKEPPRMAPTACSDDFQPVIKDPKPDPLPAPGVPLLSPSEMVRKMRWITLGYQYHWPTKTYHLDRRYPFPADVADLTKAVVTAVENIGHGDWINQYKGEDFNAEAGVINYYQYRDTLMGHVDRSEMNMEAPLVSLSLGQSCIYLIGGLTRDTVPVPLLLRSGDIVVMTGPCRKAFHGVPLIMENTLPDYLSNNDQYEDAPDWKLFGDFMSTSRINLNIRQVYPRHQSEETVQ
      >gi|Mver1000002798|ref|MVEG_02791T0 | MVEG_02791 | Mortierella verticillata NRRL 6337 hypothetical protein (245 aa)
      MTRSIGDGRVPGAPDSIYYLPDFISAEEEQALISKVLTAPKPKWVYLKKRRLQNWGGIVMNNGMIAESLPTWLTNLHPRFQESGVFDGLHPTLNEPNHCLVNEYLAGQGILPHKDGPAYLPTVATISLSSHCILEFYKCPTGSDEPGMDKLSNSRSQEPEFSILVQPRSLLVLKSDVYKSYMHGIREITVDTLAESNILNLVEAMPGMDLSEARAKQLDRGTRISLTFRIVEKTKSGRKFLLGR
      >gi|Sarc1000012860|ref|SARC_12577T0 | SARC_12577 | Sphaeroforma arctica JP610 hypothetical protein (80 aa)
      MHHFIAQFNVTITHVPGELNKLPDMWSRVHQGTADINYPSVFYTSCWKLYPDFFDQVQKNLGPNDVDAFASSHNTQLDNT
      >gi|Wseb1000002876|ref|jgi|Walse1|60458|estExt_fgenesh1_kg.C_100073
      MPLSKQRFARPPTPPETDTAIRRSERFYKRKDIPLDLSYAFDWQRDEKDAIKIAEKCYTFEKHPGLIFLPEYLNEEEQKGLIRQSVKDIPTPPNRTSLDAHYYMPKEGLWYHYANQTKDDIALPRATKEEKREPPSYYAPSGTRPTINNEPSTFEILKQISRSNNPEIPPSTTVKPLNGERAMNKLRWTNIGHYYHWGLKQYDFSVRDPQTGGPIAVPAPVSDVCKSVVSSIPWERTSVADQASEWKKSYRPDAGIINYYNLNDTLMAHVDRSEVTATLPLVSISLGHSAILLIGDDIRESINPPTAIVLRSGDVIVMSGPTRRSYHGVPRILERTLPEHLKSQEDDEEWEPYARYLSKTRINVNVRQSGLTDEQITELVSV
      >gi|Chet1000002884|ref|jgi|CocheC5_1|114719|estExt_Genewise1Plus.C_320154
      MDAFVTRKRKRDERVVKPAVATTRVEPGEEEGKEEECTDFKLAVLASLHAGVEETALLEALLAADGSVEQALEYLAQSSRSPMRKRPAPATVGYQSSLTSYRIAPPNGALVNKSVVKKGKTLFLYSPEDIETHTPCSIIHNFLPAKQADALLLELLEESSTYQRFEFKLFDRVVQSPHTYAFYVNSLEEVERQKTEYVYDGRQMNNVRQSPPSLLAALPAVQTAVNTEIQRRIRTFYPNAQKLAHQSPQPWHPNTAFVNCYDGPHQSVGYHADQLTYLGPRPVIGSLSLGVAREFRVRRIVAQDDDARADGSRESTADAQGQISIHLPHNSLLVMHAEMQEEWKHSIAPAHAIDPHPLAGNKRINVTYRWYRESLHPKFTPRCRCGVPTVLRCAMRKKESRGRYMWMCHVGFVPGKEGCGFFVWAEFDADGEPPWAAGKVKKDEEQRSVGSV
      >gi|Sarc1000012919|ref|SARC_12635T0 | SARC_12635 | Sphaeroforma arctica JP610 hypothetical protein (292 aa)
      MASNWAKLQQQIVCNPKKKQKSTGVVQGKTIGNINNGSKKTAPFHKKTQSGIIRDTAASTPAINPDISSGKRKLKVDGNECSKGNKRKKIHQQTSFPTDVPTTQVNNQQNNTIQNGRGDATQITSSTRHIDKQLETSQKKSKKKIKHKQAQIQSRKNDKSRSDCTHIEVSSNSQLPISATKAPTPFTIQANTCHACRKFTTPQDCVCTKGSKKESVCVNAFKQCMAHSYRGFSRTSAEKIESSLHDSILDTLDDMVKKNYFHYDIVSAGKAVCRQYVFEFILYTKLHVNEG
      >gi|Sarc1000012918|ref|SARC_12634T0 | SARC_12634 | Sphaeroforma arctica JP610 hypothetical protein (212 aa)
      MTYHYQRLRIFALPWDDDGTGVYQADTAKCESGRPVSNLLALKQLNSTLTTNATELLLQYHKEGGDKHNQQIAQKGVVPRGSCKYSVSLINYMFCEKDSKVPLKNEAVYGMGPTSVSWHADSSLQNYSTIAVYQITGNGPIAGSQNQTKRSSKTSDTTKGSEDDWSVAMRVVADDVTPAVACRLKSGETYYMMDDFNHHHHHAGMQHSNGT
      >gi|Vcar1000012920|ref|jgi|Volca1|99783|fgenesh4_pg.C_scaffold_95000030
      MKLGELRFQVSTVGSGPNSLQCYIAEFKRLMADLPYRHEKDHVLFFAQGLQDDLRKEIFSRLRNPGSYIRLQDAAPTLAAMTSTAAPTPPAAVRLDPVNVAFKPPPAEQVSHSLPIASKPARRVGCMRARVAAASTTEAHRVDSDWMLSRTVFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAHVDGECVWMVPPTSSPSAFISQYLESKTTNPRTSAIIVLPDRPTAPWAPLIRHMTVVRRFPTGARIVCRRDPSDASSHATSSLSTPRGRGAFAMDVRQGAQRAGDGGGRVPANCMRYEPTHQVPIEVRPTLAGQGVRLRHPCDLRTEAYKAPPPRLLLNVTKGLSIKRMVTVWFLFICTP
      >gi|Pbla1000012921|ref|jgi|Phybl1|77051|estExt_fgeneshPB_pg.C_50358
      MTESTPTSSYPISKRQQKILERQKRQYAERKEKADTYVNQTPFRYVERNFKSRVPPPDLSHVVDFNNLDNNLKRINDEIVELHITNDLRSLSSLFGEHDTEWENRAHKAYGLKSSPGFIFIPNPFTPRAQRHLVKQCLSEYTLSPSTSNLDTHYVTPPNGFWNLYEREHLQDLKEGDADYFVAKKAGFTGSEQRYDSSDSEDENSNNKSNSISNFNNTFEKETVPTPTACSREFEPVQWQPKPDPPPAPSVPLLSPRELFRKMRWTTIGCHYHWPTKTYHLDRRFPVPEDVRDLTQAVAHAVERVGYEGDKSRSWKNEYLGADFKAEAGVVNYYQYKDTLMGHVDRSELNMDAPLVSLSLGHTCIYLLGGPTQDTVPVSIYLRSGDIVVMTKHCRQYFHGVPKIIEDTLPAYLSPQTAFTDTPDWEPFGTYMQTSRINLNVRQVFPKKNE
      >gi|Vcar1000012957|ref|jgi|Volca1|46013|gw1.105.40.1
      MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFAQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA
      >gi|Sarc1000013043|ref|SARC_12758T0 | SARC_12758 | Sphaeroforma arctica JP610 hypothetical protein (243 aa)
      MVPYNSPTVEGGVFALDTWFGRSWYMNAPFSKLDAGVQRVKADRATATLVVPYYPETAWFKQMVPSLADTPIVIPPAHGVFLRYGREPMPPPPFVTLICHLSPRCEKFTSAEGFWAAVDAHRPVTSALVRALANRPDDLESCIVQDRVAPAVAGVVEKPVRCGHDPSLGGFALKPIEHVVGSCTFMTTTVKPDESVLEESVRRAEREYDVHRARGVAVMTRAQRLRASAAGEELTVEESVSLL
      >gi|Vcar1000003043|ref|jgi|Volca1|92007|fgenesh4_pg.C_scaffold_23000205
      MPFCHDTIPPRRHSGAGHHEEDVVFLVVIRQVSHKVFILHNVTLERVLPGTLRQSKHGKLCIQFVLNDRGQWGEGLEHGMVRCEVCSRVRMKTLQTVAHCAVKNRKGDFCCLRALHGLINVAREPPPAEQVSHSVPVCSAVQRSAEPDTPLPPPFNVSRLTQIATASTTGVHWVDSDWMLSCTIFLDLDSQYGPFTIDACCDDFGINVPFFSPPHSFLSAQVDGECVWMVLPTLNPSAFISQYLESKTTNPCTSAIIVLPDRPTAPWALLIHHMAIVRRFPAGVQIFCRRDPSDASSHPPL
      >gi|Vcar1000003045|ref|jgi|Volca1|92008|fgenesh4_pg.C_scaffold_23000206
      MYSCKENLWLTAYSYQLAGEFCNPRRIFTGKKKYDVVVFPKIVSIADFSDLYGILVTFLYDGGAVILGPTTYDINITTVFTQSMASEVSFYGDVGTAVTIDCGYAAVPVYNSSNRSAQWNASSAANTTTTTTTNTTGSSSAAILPSSSWPSLLSAGLASDEPTSYSANCGPLGGSLGGGVMYSDSSPMKATTVFSLPRGSGRFYYVGLDFDNDPRDGSWGQLLSAIAVQAASNPLSGNGSPPSPSPPPPPPATAPEEPGLYDINVNTSSSTTQWPLGRGRLSYTAAGLTATSALTAPPSPAASRTLYLCHDLPVVNGTGAAANGSSNSSGGGGAWLVIDLGTRRRVASVRIQALEAGYDNRSVIRGVPVRVGAVPPNDVTDPAAVNDPLCPALAHTIWVGAAVASGSAAVQVSCGANNSLVGRYVSLQLQDDIPGASWARMCGVDVMGVLPASRSLVSRNKPICVNAGASSFPPDMDPSLAVDGLYNDNRNGGGLCAMSALRQDPFFTVDLGAVMTIERVEVTKSWRNEYDKQLSNFSIVVGLTSCISVDNPLTNLTSSQAVVCASGLTLPPGTTGVYNCSGISGRYVTIATQGYNGYMRLCELDVYAAVPSYGAYRVSVHKRVTVAVDYLDSLLPGLAAATNKSVAGMLVDGVTPTRLRSPAAGGDPSAPLGRTCLVAATNYPSAASWGSNSPWLRVELGGRFLVQHVQLHFAAHFDDLLSGATTTTATTTATTSTSAATTVTPANYKYTLDVRLGDSAPPVMYTSGTENPACATGAGFFTGLESFTDLYGSGENVRRYGCGYYGSYVVVTVLRTTADGNTTLALPPLCEVEVFVVAEDGNGGAAAVGYYLSSLGHYAAASNGNGSWAAVNATRALRNPNYLEAAVREEDRCVVAAAAPGGYAAWLVDLGSPLEISMVEILSSEYSTATVNLTILNDTAAAAAGGGGGTLLLSPGSGTLLTGAANLSLPLSTVTHVLPAAAATASIAVGNASANGGNGSGTAGGGGSAPVGRLLMVTNTVPGAELRLCQVWVYGTVDGGGGGARPGIAVRKLSGRVAANFGSLPLDTGATGTTSNTSSAIRALLLHRRLQATTDTNDHNRLLTGPSDVIGASSSTEDISTSRTAVGTAAAAVDATDAVLGGSSGGAAALALRGKDISRRRSLLQTNSSSPSSSSSTSTTMTGAASLVLDLGLPRAVEVAVLVMESSLSYSGSTLELSGVVISVSNTSSGTPGSYCVYDVTIPLSQSRHVYGCGGAYGQYIVASRTVSYALPNGTSLEVYLTDPGTELLVSAHRTTSQAGGVTTGAGGSGNAVNGYYSQAALDAAAAAAAAAAAGAGVVSGLFFGSATAVSPSPWWAVDLGSALPLAFLEVTAHPAPTDGLGLLDFEIALTNYSVVTGTEGIAVLSGLSLPAGGTGRYPLGGAAARQLVLRQPGAVRSLQLAQVDVIADRATVTAGANALIGPKPLSYPLVYDMARTSGSYIDPAVSVGNATSSGELAPLAVDGYDSTLAASCARAVPATSSQLPWLLVDLGSSIRVDRVDFLKAADPSLAAELEGVDVLLGNNTSTSTPSSVVFPPPALSTSFHPVALANPVVLSKLSLPQPGAWSTFLLSPPASGRYLVVRGPQRGATMTLCELQAYGESAFEMELVPAPRPPQPPPMPFPSPQPPSPPPPPSPPPSSKSSSRSREWPVSKVSTNPTAVTAVTASIVGVSIGTTVTATMVSSAAATAAAAAAAAGGGAVAASSVPSAAAAGTAGTGAALAFVGHMQFFALTANAAANTSAGYQTTNGQLSWLNLQFNTLRSLDHLPEEEQRALSQALNIAIAYVGALVLHFLVVMLFSALRWVIALNRRGAAAAAKGATADGKSGGAAAASAAVAPILPSFLVFPFVEVFVIIFFITSAATAAGTLLSYGITAHRAVSIAVGIVILVLLVIFTAAVLWLLWRLYAAADRLGLSYVWTRKPPPADGSGGRLAHWLRLAERGYWERPEGVDVWLMEPHRTQYYLRQGFSLPQALRAADPNHHIDHTAAVLEGQEGQEGQKGQKGQEGEKHGAVAETVMAAAAAPGIQRANRSSSGSGGVCGSSTTSQVQPGGDVREGRKEKDGPLGGGGVATTAEAVAGVQPTSQNRNTKPTASRGKAWASFTTASDAAGGGGGTAVTGSRPGSAARTSAAAAPVAESAADNTIRAAAGGSGTAAVGDGGSARRLLPVAPLPSPPAAHNTSASAAAPRRTQALIVFEEPSDSDSDCILDPLTGGGLHTAPPPPPPPPPPPAPPLPPPPAPRPVLLPPLDPHNRDSLIKAIGLPREPAASNHLGSTTAGGGGCNGGARVSGGGGGGVRQDGEFVEEHTAWAAVICNLVLPAVLLGVQVGAQMEPHSSAARGTTVALLVCKALIAIYMALILPYNNIVVMSVELLCAWLETAVCACLVGLQWTHGSEPGISDAMLGCEITVFGLQYLGSKTTNLRTSAIIVLPDRPTAPWAPLIRHMTVVCRFPAGARIVCHRDPSNASRFSESNERKPRFSVYLPRAYSAVRWWYGGAVLVALVAMFAGIVMMPAKRRVGDQLVELQMVHADLHAQLTAIEQSDDYQKAASLLASDPLARLNKKKFSTAMGHAWDTYVSLKNQMASVSANIAQLEMTMEQPAMKVLVSDKSGRVMERPFKDLDYVNDSWVNKSGKEHKALKRENRELNQIYHAFRMRALRLGASPHSLYSFKSLPFACLAMAPAPSTYPPSSATNTAPTLAAMTSTAAPTQTAAARPVPVNVAFKPPPAEQVSHSLPIASKPARRVGCMRARVAATSTTEVHRVDSDWMLSCTVFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDGEPPYSPLGSPHTPHDRRAPLSCWRADRLPPRPLRRLQHSSSTTVFANLHDHLSTKTHPLHRFLYPLSSAA
      >gi|Smin1000013068|ref|symbB.v1.2.011515.t1|scaffold777.1|size163462|6
      MPLRKKKKHLEGEEGGTRRGPPTATKSGFLSCRVANDEWQTTFGAWKAISSYFASYKKKSVWMPFYYDGLCADHLRSLGFQNVIHRQEDFFERVLDKDFLSRIDLIWDNPPYTAPEMKEKVLRALAECGKPFVMLLPISVLHVGFVRDIVDMQRVQAIIPRRVHVRKTGEEILPFKYLCWFCYRTELPRDLLFVNDVEQVSNPSPSEMHSAAPARKKIPKKAMKKSPRVKKKGSPFHRLPFPRVFCLVHEG
      >gi|Mcir1000003087|ref|jgi|Mucci2|107410|Genemark1.3173_g
      MNSSINYLELLTIWKHFGGTKLTCLMDLAAQIWRHCFATDTRLLLTYIPSKFNPVDPLYRNQLRQLEWSLSTATFNMLNSIWGPMQVDCFASQTNHKLPKYISWTWDLQAIGTDAMLIDWRKLRCLYLCPQWNLILPVVNKRNMVSNSLQAGNSSSHAYKSYGYHLRKRRRVVPSGGQQQLDFPGLEDPRIAERPRPSGVSSAVEALLAQAPAPHQLAQDSSSLYQMV
      >gi|Uram1000003236|ref|jgi|Umbra1|231739|fgenesh1_kg.14_#_327_#_combest_scaffold_14_20870
      MNPTLEPYKLPGIPDSAYYIPNFISAAEEEYLISKVQTAPAPKWVSLKARRLQNWGGTPKVEDGKMLQEPIPTWLKEPIFKKLQSIGVIFGDSPSTEPNHVLVNEYLPGQGIMPHQDGPLYNPIVATVTLNSHSILNFYPHGVEKATSEPEFSVLLEPRSLFVQTGQLYQTYLHGIAEVTEDDLAERPPINYHTIPNRTLLPRQTRISLTYRHVKRAIKNPFAKTLFKH
      >gi|Spun1000003257|ref|SPPG_03428T0 | SPPG_03428 | Spizellomyces punctatus DAOM BR117 hypothetical protein (341 aa)
      MWCDRCGRISRRVWVYKWVCEGCQHTIEIPIPIYTRTTIRHLPPSRTEQYGCRSFKPSSNIRGYLERIPNHVPRFGGWERMVYVFPEGGHVHHFLAGNREDVDGVYEGLQRGRLPFRRMGVKSRLNGLLTSNFVLNSGEAYAQASNLPARSMTENTRIELEAIQWLQDAVYEFGTVYNQSHKNLSLDGDEKEMHWHDDGEHTVHGPVSSLSLGSDALMLFRTKPTPTQKPKTVLSLAVRHGDVVLMVGKAVQRYYEHCVRVRGHRVSVTGRMVLNEGMRVEEGVWRVLEGVERGVGCARRRKGRLVFGKEVEEEKEVEVEKKGEEEEWDYPPAPPSSDIE
      >gi|Sarc1000013250|ref|SARC_12965T0 | SARC_12965 | Sphaeroforma arctica JP610 hypothetical protein (182 aa)
      MFAARQEFEWIRMFWLQGESHAQSHERYWLGCIQTLTHLWELSEAMLSHIVTVLGADVASEAGGARVCVSKDVRTYAMTIYILREIRFRRREYAKRCKSQAYHFLEARNKPVENKLKYNIPLDRAGLEAYKHLDLPQGVSLFSHTQACRKGARTCAFTLPKDLSPTIEAIEEQKKAIFGKV
      >gi|Vcar1000003269|ref|jgi|Volca1|92356|fgenesh4_pg.C_scaffold_25000154
      MAHICLHEHTFRGAPGKPQQGATASKLPNCWTYRLLYVYAEEQKTGPLVRHQSQLAMFALAQGSGPNSLQCYIAEFKHLMADLPYRHEKDHVLFFARGLQDDLRKEIFSRLCNLGSYIRLQDVIDLACTISMERDAAHRSPHSLPQGIRTINTAPTLAMMTSTAALTQPATALPIPVNVAFKPPPAEQVSHSMPIASKPARHVGCMHARVAAASTAEAHRVDSDWMLSCTVFLDLDSQYGPFTVDACCDDFGINTHVVPFFSPSRSFLSAQVDGECVWMVPPTSTLSTFISQYLESKTTNPCTAPISTLVII
      >gi|Spun1000003263|ref|SPPG_03435T0 | SPPG_03435 | Spizellomyces punctatus DAOM BR117 hypothetical protein (427 aa)
      MDRDAKLALLLSIFDDRSERNLLDALDFAQGRVEQAVEILLGNAEGHVSGGTNKRKHGATIDEFFGGSERGGKKMHLDRHVDVSDDASQPGTTTTSSPGSRNAFEVLCSGSLHAEDAKGVSLPPRTLTAKEVAEHIPCQLVLDVLPEELAASLLQKLMVEAESWKIKRFVMFDREVESPHTTSFYTDEPYTSSSSTPDVEYYYGGKKTTDVRTMFPELAQARSLIAERVNRALDERDHVQGGRHPAELVGIWEPNIVLANCYRGAAEGVGAHNDKMTYIGPRPTIGSLTLGATRPFRVRRIRRPGGPVPQTFNIMLPHNSLLIMLPPMQEEYKHEVPKCNPKLLIQHPISKDTRINLTFRVARPEYRDNIPVCRCGNPTELRVVIKKESNLGRYFYMCAGGGNETSGIESGSNCGFFEWLDLKGKS
      >gi|Pbla1000013272|ref|jgi|Phybl1|77820|estExt_fgeneshPB_pg.C_100091
      MPSDPFSWQLVRPTAEAEGVHPKPIKPPLDRWKGAVHGNCENSTQKHSFGTADTHKSQASDASLSSSTPIPVLEKPKQTNPIILTSKFGQFMGKMLLQSSNNRTKPVPKINEHSQQSNEPSYTGYFNIERKRRSSDGLKVVLRAVGQEERSESSTQKVVELKDSIPEDTIMDDTMDDTMDDMSIEEESIEDLPTSNSLLKQNLTVNTTIGEASTNNTPIEEVVLKNTPSPISPIGLNQPKDITLSDPRYHHKKGKRTRWDVGPILIPEDMECTQSFASLQLDDTSKALDYEPENLCDTLATGDCDACNIMNSQERGPLDQVALCESCKQTWLPNAKSLLDRLSKHSKDFVKPTQKQTSKQTLKQTSKLLSKQQKQPHQQDNQSQSQPQSKQKQTPKQTSKKTTKRLPPAKSVHLHSKKSSIQEAKNYYTTGAFLTRNTAKQLVDEAGFHPNPHGFTNMQKVKVLNINGHWYRGILTMMYGSKVKVHYLDWDDQEEWIVMGSRRLRGLTKDEEEEDEDANEEEDGEDAAIENENKDEENEEKDEDKDKEEEEEEEEDDILSIPAESKTTQPVKKGEHLSVSPKSHSRKSQSNNVSALNPKKHYTTPIDTDPTQIFNDNEIFMTRRMAHQLTDEHGFKPNSFGYRYNRAVAVSLRAEKGKRNRMEYNGLLREMRGNQVRVWYPSLRQSDWLIIGSRRLRVLTDQEASELDNLGTELVRTMDTRAKDSEISTKLPTETKTPSPSTTETLPEPQSESVPKHVEETVEDTVEDTVEGEVEGIAEEPVEAPVPEPAPGPIKRGRGRPRKVALPVDPSNPHIVTIPKTPTKTALKKRNDSIKNGIKETKKAAAAAAMVVEMGTKDTEDALDYLTTGAFATRRAMRQLKDEHGFVPNPYGYVYDQPIEILNTRSSKNKFWERGRLIGMCPGKVLVRYDGWGEVYDEWVMVGSRRIRPAAAQIESSGDQKSSTMASTENTAPTSTLNGGSASTKKRAKQAARNDLLVTEANPEVEDEARKKRQHRVLGPEDYERLGLLAGSEKVEKIERRGRKKMVRDVETPKETETMKDKAVLIEEEPKPIEPSVEAPIVQNEDQEMAEPESDLTKPNGAMTVPEGTQNDLPVKRKKQKAKTQQRKRKPAKATSPSPSVSSSTSLQQTQAQTELSTELSTETETETPTPISTSIVSVAATEADHDTSTLTSSYRRHIPSDESNHGFVANVYGYDYLQHVQVLHLDKKWYEGRLVSMERNRVRVHYCGWLDKFDENIAVGSRRIQVIENDHEVVCIEPTYSERLEKMQEEKEKKAVEPEDAQVVKPSKRREVAPTVVPAPEEPVHGTHDMVEYHMEAVDGMEVEENDTWKVYCNQCNIIIKQFRYYCTYCETPSEGHDYQSFELCLRCFDQNFPFWHEHPRSSFAVQAVIDADMGPMPIKGELVTVWEEDILEEIPDDTQDDLNDPDDMFSGTMEASEVFSGVAPLDEDQGYKFLKRWQRRKVCAFCNDDDDTSTELGKFIGPFVITSFNKNGTEKKRSFWAHDACARYSPEVFCTSEGKWYNVTIALRRGRGMKCYGCKEKGATIGCFESKCSKSFHLPCAQKPVSYFQSGVIFWCNTHEAYYKKKDTYVNIFNCDGCSKRLEDETWFTCIPCASSYFSSFDLCAECFHNFPQDHAHDEDQFEETSFAILKEVEAQKATEAAKAKEELRAANPKKKPLFPKRKRRLADGSVPLTCSYCGTEEAESWRKGYDGGVLMCTPCFELALFIDNDGNTASNESLVIDSEETHRYVMSIEDYTHKPYLTRDAVSATKFSDHRTGPRLASYGPQPNQLFSLVFDSTYYDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKYTNKDERVLSNFLGRGTDAIECFLLQRRCCGVDINPAAVALSQRNCCFEVPPGLTSAEYRPIIAQADSRQLTGALFADESFHHVLSHPPYKDCVAYSTHLEGDLSRFTSVEDFRAEYGRVVRESWRLLKMGRRLTLGIGDNREHCFYIPVGFHLLREYINHGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFVATFRKIPLECTDKMLPIDNSDCRDHVRVEHVTKAVPQSAISRKSVVMGTVWIFKPTDTHTFEQLCISRMLERFGKDDGNWEQVLLDFMSPESMMIQNNVQQQYQSSTSSQNVHKDKPEEQEHDRDLDLDQDQEQENNKEDSQNQLSDYEKLRLKRIEENNQTLLKLGLISEMSEDSDDVIHYESMMSKKPLENAPLVLVMVGHQPIEPRQIGLYRETIVQIALEAVKKLAPLGMLIIGTKDIRQKDNGKLWPMSMLVLEDIERAIDRSVLKLKEMVVTVPEGHSKDRQQKNLNTEVEEELEIVDEHLTIVHAIYLVFQRMNYSHNYN
      >gi|Uram1000003343|ref|jgi|Umbra1|232251|fgenesh1_kg.14_#_839_#_combest_scaffold_14_22505
      MQAIRVQNSFRSLGFAKARVGFGLKGRNGITVGRVRSVSSNAYASLRNSPAILDSCFELSTAFTDNASSKGYSINDFLVYPDFVTKEEKETLTSICEKKLRRSFGPKVEYFPIHPDSVIHQYRECSASHWGKQDEFMKEFINKKIYSMFPDQMEWLDPHVLDLDGDGEIKAHVDNIEYSGSVVAGLCLLSPAIMTMRHKDDSNIRFDVLLEPGTFYIQRDTIRYSFTHEIRLSNTTWKGNEVDRSRRISLLFRDKKI
      >gi|Vcar1000013369|ref|jgi|Volca1|45995|gw1.101.33.1
      MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADSFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA
      >gi|Fcyl1000123364|ref|jgi|Fracy1|197836|e_gw1.35.136.1 ; gi|Fcyl1000138707|ref|jgi|Fracy1|213179|estExt_Genewise1Plus.C_350063 ; gi|Fcyl1000088052|ref|jgi|Fracy1|162352|gw1.35.136.1 ; gi|Fcyl1000100129|ref|jgi|Fracy1|174601|estExt_Genewise1.C_350073
      MSLSSIKNNKNKKKNNNRLPKISLSSSSDEGGSGGGDSSSTLVRVTYSGVTLKLHSAYLEKLQRLYDRTQQRRRRQQQQQQQQHQVGLSFEESLFCLLCRYDMIQGAGLQAGVPGSIMDILLKQFNCTKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFELEEGICYQANPPFCEGLILQLNDKITDILLSSQQQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAAKRLW
      >gi|Vcar1000013363|ref|jgi|Volca1|69777|e_gw1.101.32.1
      MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFVVPKDVFTFECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILADDWLERKKARMCWESHTLTVRKATGDQCSGRNGNLPKRKPAETSFGDSGEEVLVLRKCRN
      >gi|Fcyl1000123395|ref|jgi|Fracy1|197867|e_gw1.35.124.1 ; gi|Fcyl1000138708|ref|jgi|Fracy1|213180|estExt_Genewise1Plus.C_350064 ; gi|Fcyl1000086733|ref|jgi|Fracy1|160962|gw1.35.124.1 ; gi|Fcyl1000100130|ref|jgi|Fracy1|174602|estExt_Genewise1.C_350074
      MSLSSIKNNKNKKKNNNRLPKISLSSSSDEGGSGGGDSSSTLVRVTYSGVTLKLHSAYLEKLQRLYDRTQQRRRRQQQQQQQQHQVGLSFEESLFCLLCRYDMIQGAGLQAGVPGSIMDILLKQFNCTKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFELEGICYQANPPFCEGLILQLNDKITDILLSSQQQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAA
      >gi|Vcar1000013410|ref|jgi|Volca1|46011|gw1.111.24.1
      MFLRDEFRRVETELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCSISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA
      >gi|Ttra1000003432|ref|AMSG_03671T0 | AMSG_03671 | Thecamonas trahens ATCC 50062 alkylated DNA repair protein alkB (269 aa)
      MAEAAAEAAAEAATKAAAEAGSEANDRVPVVLAQPCHAGLLAGWADGEVEPLDALDALGIRVVPEFVTADEEAALVECLEAGRFTSVGGRELQNLGGVPPVLDEGPGMVVEPLPEWLGDVVAALNAAGLYSGEAAVPNHCLANRYPPGTGIAPHCDGPRFAPLVADISLGADAVMVFSEVASREIVAEVELPARSLLVFARDAYHKHKHAIASSPTYEGSPTRYSLTLRRVKRVAEAGAAREILYTDEGLRAARAAKFAHLRLISEKE
      >gi|Pbla1000013531|ref|jgi|Phybl1|78334|estExt_fgeneshPB_pg.C_140177
      MTERASSNLEDQRFKSIDQSFIHITNDSCHCNPALDYSHLSHNCSANYYSQHLMSLNSSTSHYTPILLDTLTPPYETNNALMHGPHCTAALSNNDCIDLLPGIKRKRQDTIGDQSASSSIYANAYQPKGQKTSNTDQIINQTNGFQNEYRSPISLVNSETYYRNTPSDLVHDNRIYPSIKQACISESFEAVSINESEAKQVVCILGNMLSIVKVDVSKEVAKRINNTGKYFDNHAQLDFKLYLLEWSYMPDLALDYLFRSLESERVCKDMSALRETVNQQTIDIYCLMTGFMTVYLVFIELLFQDLIGNVNRSSFVEYSSLFAEQDDILKKSDDIFLWHHSYTRHHTTILSSLRSQMQAAGIKFHSLQLIDTIFSQFYTLHFLETSAQKHQKDHGQFYTPQSVVQFMWMRCMSKQTLRDTLKTGRVPRVLDPCMGIGTFLCEFLTRLVSQSTTTPLLWNDPTMLRTMLSQTIPDALWGVEIDPFACNLGKLNIILHLFPFYKRLVELGECLTPRMINRLRIFCNDTLKLTVESKPVTESGTEAQLWEKEQLEKLRDAALFKFDYVVTNPPYMIRKTGFIAVPDPELYDDSVLGRGSQAYMYFLWICLQRCEETNGQICFITPSQWMVLEFAEELRAWIWQHFEMLEIFQFEPYKVWPKVQTDSLIFNLRKRAPGRVPEQTLFLRHMSRKHNLESIIKSYNIFDRSRLEVQDPLIKYKLTSTEPVDFIHQIPHASFSFLSPTSSVSDQLMNLTKSLPRLCDGEMCLNTSSSCAPLVWNRGPNTNPVYALVVRTRWALGVFGKECCDQWLRPVFYWSGKSSSASRRRKTSGFQTVSKEATFWQDRDPLRLTKKENSPAEAYVPLCRSDPDDTRLSFYSMILVDKDGALQLEEEYKLYGNTANSSALYHYLLDARNALQTTKTDRNIAYCHYSKCGIETPVKIVHPINFGYFTRTQPRQRFFIDTDRRCVTNQCMYFTIKSEYPWQSADFFCGLLNSSTLQFFIRDTCYYDQQGRTRFFGKHMAKIPFTPPRSTIDVEIMAMFVRHLTTARQWIYGIIQLSNTLNVMEQVRGCTWHIPLVDQALLSLCEQRTPNWRIDLFRAPYDANDWINIIIQSESHTRSVGMYEELTRLLKVASLFQYCIDQLVYDIYSIPADLQKGLEQELGLILTETWKNILFEETSVKDHCLTWGLCMEEVAKKIFDKDS
      >gi|Vcar1000013547|ref|jgi|Volca1|40642|gw1.120.3.1
      MYLPDEFRNVENMLGRQFTFDAACNNSGDNSLCTRFASPSNSFLTSDVSGEFVWANPPYTKIELWHKHYLRCKRMRPETTSAVFVVPKWSQIERRMRAAGHQLLKTYAVGTKLFLEKADDGSRQEMPGIPWPIQLWYDPPAKEVQLNSIGSSQSMIFGAFLCQTPCSALIDSGADCKGTADGFISSEAARRMGLAIRPCSTSSVRVVNGSSELIEGLIHAKLRIASFHDTVKLLVLKQANAGVELILGAD
      >gi|Vcar1000003571|ref|jgi|Volca1|45982|gw1.10.241.1
      MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTHVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTTSTVRVANGTHELINGSITATLRIGSFHETVTLLVLKQGVPGVEIILA
      >gi|Smar1000003623|ref|SMAR010103-PA pep:novel scaffold:Smar1:JH431960:112870:114093:-1 gene:SMAR010103 transcript:SMAR010103-RA
      MLFEILHSNLVSVRHLASVIGKLNATMLAVSSAPLHYRGLQMLSAGFLRGGSYESECSLNTETRTELAWWLRNLNICKGRSLLPKPLVLIIETDASNWGWGAFSKDFSIGGPWFKSFKAKHINILELTAVFWALKALARNYSNATIKLKMDNISAITAINKLGSPRSPQLTAVAQDLWTWAMDRDLTIVAEHIPGTENCLADAASRGRLFDSGDWKLDTAVFDRLGSVFGKFSMDLFAACHNCQIYPFFSWKPDPDAIAFDALTQAWKGGLLYAFPPFSLIAQCLTKLRNSDHPQEIVLIAPVWPSQAWFPLQWSASTVFPVLLPQYDQLIVNPAKEPHPLMIDRSMRLAAWKLSTPSITTRSFQKLLPSWLGPPGNPLLPSNMIYAGENGANGHMSNILTLFRSLP
      >gi|Bnat1000003625|ref|jgi|Bigna1|73330|fgenesh1_pg.23_#_199
      MELGSSFRRRDKASRDDASSDKGKPENKAGGAAATNLFRIAEKRYRLYKVNDTASPPSSAPRKKDDAQQVWRIPEVEGLYIIKNALSLQEQLFWAKRAIKEFSNCAHTNLSNLHGTQPDHWANAVECNDVGKDSEFYKLRWASLGYHYDWTQRLYHKYQHQNEKSDFPENLASMADALARRVGLKVKSEAAIVNFYPYDGSMGGGKSKDVEPTAIFVNSGDAVIMGGHSRLCFHGVPRILEGSCEELARFLHDSPSKEEKLIGKYLTRSRINMNVRQVTQLAKQTGIRGIC
      >gi|Smar1000013635|ref|SMAR001291-PA pep:novel scaffold:Smar1:AFFK01015044:37482:38168:-1 gene:SMAR001291 transcript:SMAR001291-RA
      MLKTKFLRVGNYESHCSLDHDARNELNCWVKNLDFCKGRSLIPKPVSLVITTDACNTGHRGWGAVCNNVKISEKFSSFELKKHINVLGLLGAYLALKSFARNSSDSSIKIRIDNTSAVAAINHLGSPLSPELTALAQDIWSWAMAHNLNIIAEHNPGAKNIKADAASRGSVKDSEDWKLDPVIFSRISNLWGPFSIDLFASHHNHQFCSFFSWRPNPSALAFIALIQD
      >gi|Rall1000003656|ref|jgi|Rozal1|3657|O9G_006133m.01
      MIPMYSKIVGPLEKLRCQSVIEWNEEYWAIYTKLRTILSSEIILSFPDFDVIFDVGTDASDKGIGAVLYQTINGKTKYINFAARSLHDGEEGYGATKRALLAFVFALQKFRPYLWGTKFNLYTDHRSLTYMYSQKHVNQMLNNWLKILLEYDFNIYHKPGFLNILPDAISRLYDADPEIEEIELKVWSTTVQKNIEWRLNPKIFEKLNKELGPFEIDAFASPINHHLPMYYAKDIDALTQNWKDKYLWIYPPENLIDKVIDKIYKDKAKAAILTPFDKTKSYFTKLEAIXNHISLNWKLLVENSL
      >gi|Hrob1000003787|ref|jgi|Helro1|185055
      MSDQRRNVVQGAWANVSTRSKKVDGSLASKKGMVTFDVRKTAEKNDNASSGSNLVKKSSFQFENVELPREPSFRTINQSGDYIISDEPNGQSRIALVTNFIDSSECGWMFDQLNDELSWKQHEVHRIGKTYMQPRMTAWIGNVPYSYSGITHKCQCWNPLLTMLKDTIENKTGHQYNSVLANLYRDGHDHVPWHCDDEKDFDTHPSIASLSFGDSRIFELRRKPYKNEHLKSLKNDALTDADYLEYIKIPLNAGSLLLMEGAVQLDWQHRIVREYHDRGPRINLTFRSIIDR
      >gi|Lgig1000003858|ref|jgi|Lotgi1|115378|e_gw1.21.357.1
      MGTAKGDNATAPDLKGPGPVERQDSITSVPSPTPSGSDPVHDLPPELLQAGWRKFWSKREGRPYFFNKLNNESLWEMPQLGFHGNMHDPMTDPLGINTPEGHAPLPAISKFIYKAKSFTINFCNIFYEFVLFCSPFWNFEIPTNCVINERAPVNIPGPYPDIEQLRAQLVTKLRQNYNELCYSREGIEAPRESFNRWLLERNVIDKGTDPMLPSQCIPEVSQSMYREIMNDIPVKLVKPKYSGDARKQLFRYAEAAKKMIESRGVASESRKIVKWNVEDTFTWLRRQQNASYEDYLERLAHLKRQCQPHLTEAAKTSVEGICKKIYNQSYEIVKKLSERHWEILKENNINKMDPLPEPTTPRKVLCYSIQVSVPTPRPVLVEHATENDITSLRYKSETVKINSSHFHKLEQLYKLNCRDDPRFDHFLCRVWCLLRRYQTYFGIHTNEGFGLQGALPVTVFECLHRVFGVTFECFASPLNCYFRQFCSAFTDTDGYFGSRGDILNFFPKSGSFEANPPFCEELMEAMVDHFENLLHESNEPLSFIVFIPEWRDPPTEALMRLESSRFKKKQITFPAYEHEYRNGFQHICPKNDMSVKSLHGTVAIFLQNDAGFSKWGPTPERIKELLLSSKPRDTVS
      >gi|Sarc1000003931|ref|SARC_03841T0 | SARC_03841 | Sphaeroforma arctica JP610 hypothetical protein (111 aa)
      MEPEDVVPLPTEDDFIDDEDALPVAGCVPVAHEKRTQRLNPEWFRRAMEKWPGVVINLFVNRYNAQLPDYASDEPRVGTWGGNPFYGPFGCHFWICLSAMAFSSAVLVVS
      >gi|Fcyl1000123964|ref|jgi|Fracy1|198436|e_gw1.39.264.1 ; gi|Fcyl1000084768|ref|jgi|Fracy1|158883|gw1.39.252.1 ; gi|Fcyl1000088083|ref|jgi|Fracy1|162385|gw1.39.270.1 ; gi|Fcyl1000086781|ref|jgi|Fracy1|161014|gw1.39.264.1
      MLAKNKEKLLKINPAHYEKLKTIYIATTTTTTFDLAEFHRRLFCILARYHGIQGHGFQAACTEHVFDVLHGRFGVNMECFASPLNSRYASHCSAYPDTDACFGSLGSFFQFHPKQGSFEANPPFEPYVMLAMVNHMEVLFKKATGALSFIVVVPGWKESEAFQRLQASKWNKKSLPIAQKDHGFCDGAAHQRRDRYRISPYDTFFFFLQNDAAASKWPLTEIAISELRSAMACGVPSPSMQARHDKAGRGTDDIARGVYKGKKRKATGEGVMSRKLEETRLKKAGVKKAGGAKKHKRNT
      >gi|Mver1000004016|ref|MVEG_04014T0 | MVEG_04014 | Mortierella verticillata NRRL 6337 hypothetical protein (426 aa)
      MPDPLISNRQRKIMERQAERNREMAAAKRSDTRTPFREAELQYLSRHPPPDYSQALDFRKPHEELVEDPKVKPVHLQKPLNEFCPLFGSEDYQGVGKSRYAFLHEDHPGLIYIPAGFTPAAQRTLVKACLKDYSRHPNKSNLDTHYTVPDSGLWDLHEDVFYDKREADDPAVLVPRKATTDVHAGGYGSDDDDEEEENNKKSKKSKISIRTLVPITDDTPSVPDNVPKTDPEPSAHVPILPPGQLVRKMRWITLGYQYHWPSKTYHFDQNAPFPPELCELSKAVVEAIHEVGSYPYASEDFVAEAGVVNYYQLKDRLMGHVDRSELNKDAPLVSFSFGHSCIYLLGGSTREKPPTPVLLQSGDILVMTGPCRAAFHGVPRIIEGTLPAYLQKSQDPDWDIYAEYLAEARINLNIRQVYPPKKGET
      >gi|Fcyl1000034107|ref|jgi|Fracy1|251973|fgenesh2_pg.39_#_31 ; gi|Fcyl1000123871|ref|jgi|Fracy1|198343|e_gw1.39.252.1 ; gi|Fcyl1000123890|ref|jgi|Fracy1|198362|e_gw1.39.270.1
      MPRHETTTTVATGLGSKVAKKQKVKAGKSNIWNPDVCYVSEKKECTTSSGKLVENFPDPTCSAEFMRMTSTNWLRKQLHKLRKSNKSAEDVEFMKPAVMAVERWFMRYSLDHPHRLAAGEDPVLPMPGADMEEIDPHFVDDLVKTGKDTPEAAVAVVKDFYALTRKAALDILKKEKALTKEETVVVIHHRHSCDVMLAKNKEKLLKINPAHYEKLKTIYIAVQESSSLSLTSGKKKKKTKTTTTTTFDLAEFHRRLFCILARYHGIQGHGFQAACTEHVFDVLHGRFGVNMECFASPLNSRYASHCSAYPDTDACFGSLGSFFQFHPKQGSFEANPPFEPYVMLAMVNHMEVLFKKATGALSFIVVVPGWKESEAFQRLQASKWNKKSLPIAQKDHGFCDGAAHQRRDRYRISPYDTFFFFLQNDAAASKWPLTEIAISELRSAMACGVPSPSMQARHDKAGRGTDDIARGVYKGKKRKATGEGVMSRKLEETRLKKAGVKKAGGAKKHKRNT
      >gi|Fcyl1000014177|ref|jgi|Fracy1|232043|fgenesh2_pg.1_#_241 ; gi|Fcyl1000104334|ref|jgi|Fracy1|178806|e_gw1.1.2000.1 ; gi|Fcyl1000088076|ref|jgi|Fracy1|162378|gw1.1.2000.1
      MPLWAAETRISIRSLASSGSIILPDPEQDAARMAALKRLRHKFVRLCSENNRSKPPVLAFERWLGRASLKRGISTSDGYDPIIPSDSVMDKGFAKDISRTLPSWAAANAVAEEMTKEATKQIRGMATQREEIDEHKDLGMLRKKIREEAALNESTTKAQQALGGANNSAVVGGGKVVLNGNRRDGIYDVMLCGPCGKPRRPYLTISSLHLSKLLRLWKLKNKEGNDDDDDGNQIEVIEVENPIDNMNALLEDDRIMFTKSLYCCLARYEGLKGAGYQCAVPGVAFDAAIACGLGSTIECFASPLNCRYQKFCSAFPDIESRFGSLGSFFDDEAFNPLTGTFEANPPFVPEIMVAMGTKLKRLLGDKSRGALSFLVVVPAWGAGIDFVTDLESSTYVRASSRIKASDHAFCDGAQHTKPLSNQADPNLRPSSWDTAVILLQNDSGALKWDVDDKKLEESFCNALKATIGQVPDKFTKLDDWERRGVGQGGGSAGSKGYQGNPNKRPFAKEGNRNIDRKRFARGYAIVLDWMCGDHQSNNYTGHFVIVARSQQTATFGVNNNSPRLKSGEVGIHSSAT
      >gi|Vcar1000014193|ref|jgi|Volca1|98959|fgenesh4_pg.C_scaffold_80000013
      MLMFVTRSELRTDATSTVYALQAKQRDAPTMIGMHVHSPVRQTPCKTLQLHSQYTAALFSPIRPHSVSPTHFGTFHNYRHFANAGSCFTRTSVFCIQVDADTSCTAVPEDNFPAQISMDTIRQGSGPNSLVSYVAAFGRLMGDLPHRHELDHVFHFAKGLNRDLREEVYARLPQYGTDVTLQIIIDLAMGIFSGRHNAHLIDYNTAPCSSRDVHRDDPMDLTANVYSTTPHHSTSPLPSSIKTIGVPYNLRLQRMAQHVCLQCGDESHAKEHCPELRRLLAATTISQPNSSRRHSFQGANNSRKPPRSPARSPARSPSRSSTSSASEDRRSRGHRQYDNHSHRGRSPSRAPVKSALRPSSAETLSPFFSPSRPFLSTDIEGECIWMVPPVDNVSTIVARYLDAKTANPKTSAIIVLPDRPQAPWAPLIRHMTIVHRFPAGAQIVCRPTTSDPSSPTALLKLHFRR
      >gi|Vcar1000014202|ref|jgi|Volca1|84221|estExt_Genewise1Plus.C_800030
      MFLPGEFRNVENMLGRQFTFDTACNNSGDNSLCTRFASLSSSFLTSDVSGEFPGFKPPYTKIELWHKHYLRCKRMRPETTSAVFVVPKWSQVERRRQGASGARTAGGKCVPPVCGKHLPQMPRQEMPGIPWPIQLWYDPPAKEVQLNSIGSSQSMIFGAFLCQTPCSALIDSGADCKGTADGFISSEAARNMGLAIRPCSTSCSEPPSSLRA
      >gi|Lgig1000004215|ref|jgi|Lotgi1|118068|e_gw1.28.427.1
      MKVYENFLTEKEEESLMLEIEPQIKRIRYQDTHWDDAIHGYRETERKVWNSVNQEILKKVRDISFAPGTPQIDFTHILDIKKDGYIKPHIDAVRFCGRIIAGLCLLSPCVMRLVLEKDKGKYADILLNRRALYVMKDRARYEYTHEVLAAEKSYFKGNIVPRDRRVSVICRNQPK
      >gi|Vcar1000014314|ref|jgi|Volca1|78281|estExt_Genewise1.C_1320001
      MNSHDFRIIDYELLRQLERRIGRTFSLDAAANDDGSNSVCKMFASPTRSFLNSDCSGHTIWMNPPMKLLSDFLRHYHRCKSYDPSISSWTAPWASTAWPRVNCLDVNCSMCRVLTFPAKSVLFNGLSLEGKTVELPGIPYPAELWYDAP
      >gi|Psoj1000014320|ref|142118
      MAAAYKASEKRWKRATDEDLRNDEALVDPRNLSKEQQAKVQRVGSWKWGPEDKERPVLAFDDFVASHRGFYVIPNAIDTKTQLQFAHACLTEFTEEPHVTNMHLQNQQETDIWRKARGSHPKDPAASPLLSKLCWAASGYHYDWTARKYYRHSFSAVPELLQQLGSRCAAACGMSLSAEAVIVNFYKSKSSMGGHLDDVEYTMDHPVVSLSLGSRCVFLMGGHTKDELPLEILLRSGDIAIMGGESRTCYHGVARVLPTPFDVPSDDFDALLQSEDDREEYEAVRTYLSTQRININVRQVYPVEPASTD
      >gi|Rall1000004359|ref|jgi|Rozal1|4360|O9G_000840m.01
      MTPKPNNIPVHPIHFVPSIKTYVTNPDVLPIKGMSYISDFLTVDEQKEIWDTVYSNPFSTVIHRRQQFYGETYYHTTPNISSLQPLDNQTQLDSQNDNVADLKVENGDMKRCWTGKEAPKALPLKLFDSLIEKLINAGFFTKEDPPTQILVNEYSGPMGISNHYDDDKAFGDTIATISLGKPIWLNLQLPERQNNVCKSILKQTKVFLEPGSLFVMQTDARYVWRHGISNAKWIRYPDNTPVRRDDDYVRVSLTIRKLLDGRKRVTKTTTDWLEIPDV
      >gi|Vcar1000014369|ref|jgi|Volca1|70135|e_gw1.112.21.1
      MFLPDEFRNVENMLGRQFTFDAACNNSGDNSLCTRFASPSNSFLTSDVSGEFFVWANPPYTKIELWHKHYLRCKRMRPETTSAVFVVPKWSQIERRMRAAGHQLLKTYAVGTKLFLEKADDGSRQEMPGIPWPIQLWYDPPAKEVQLNSIGSSQSMIFGAFLCQTPCSALIDSGADCKGTADGFISSEAARKMGLAIRPCSTSSVRVANGSSELIEGLIHAKLRIASFHDTVKLLVLKQANAGVELILGAD
      >gi|Smar1000004446|ref|SMAR009389-PA pep:novel scaffold:Smar1:JH431896:51121:52792:1 gene:SMAR009389 transcript:SMAR009389-RA
      MSLSETTAMTAAILAMQEQGIIKVVSPVFGQVLSPVFPVPKPDGSVRLILNLKQFNTNLEYKHFKMASVRDAIALMQKDFFMFKLDLKNAFYLVLVYENYKKYLRFLWLGILYEYQVMSMGLAHAPRIFTKLIAPVFAHLRVLGLCGPNRLFTSYWYIKPLEAFKTGQLIKNNNDYDAIVPLYPDIKSELQRWVGIDMFTPKSLISTVSTVDIFTDASKNEWGAVWGNESKFHINVQELLAVLFTWKSFFPFNGTKHLTFHIDNLAVVSLFKNHGSSKHLLHSFGRELWEEACRHKISLFFVHVSGKENTIADFLSRNFISADGEWSLHCSVFNKLTEEFVMPSIDLFASRLHFKLKIFILGLQTPKLQRLTPLLTFGQIFHTPFLLSVSSPRSYGRSIRTKQPFYYWCYFGRRNRGFHARWKA
      >gi|Bden1000004472|ref|BDEG_04459 | BDET_04473 | Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (226 aa)
      MDSSVGLDAADDPTRNHANLPSLHKHPFEPVISGLRLIPDFITQQEELDLIASIDAHPWSGYGIPPNPELKRHTQQYGFLFSFRTRTITECLGSLPAFSSFVIDRMLLPEFNVFPNDPPNHVLVNEYQPGQGIMPHVDSQDTFGDVVTSLSLWSSCVMSFGNKMTGEKVHLELPRRSLLILTGDARTHYTHAIPKEDMLFAGNECVDRGRRVSLTIRSILKSAIP
      >gi|Bnat1000014488|ref|jgi|Bigna1|25582|gw1.3.140.1
      KLFSLLRDQVPWQRETDDFGPQQRLSYYMGDPDCTFRYVGLSLEPNPWLREMENVRGVAEQDNPRILTGCLLNNYEEGGSFIPWHFDEVRAHGDSKVVMSLSLGGPRRFDLRRRRQSLHNISLLLQPGSVLLMAGNTQEHWLHQLPDITGEPKPHRISLTFRSIV
      >gi|Sarc1000004600|ref|SARC_04494T0 | SARC_04494 | Sphaeroforma arctica JP610 hypothetical protein (341 aa)
      MQVESPIIKNRVSFADPLVTVPDTEEYSLALKYFDKALQRVGPFDAEGFALPGNNLMVPYYSPTLEGGVFALDTWFGRWYMNSLFSKLDAVVQRVKSDRATATLVVPYHPEAAWFKQMVPLLADTPIVIPPAHGVFLKHGREPMSPPPFVTLICHLSPRCEKFTLAERFWATVDAHRPVTSALVGALTTRPDDLESCMVQDRLAPAVAGVVEKPVRCGRGPTLGGFALKPIEHVVGSCTFMATTVMSGELVLEESVSRAEREYEAHQARTVAVMTRAQRLRASAAEADVAASEEPLVEEPALLLSRAREQLAAPEAWLASLWATHREGVSRRCTVWIVWY
      >gi|Rall1000004614|ref|jgi|Rozal1|4615|O9G_005572m.01
      MVGVSGKILTSNVVLSSPNFDYPIFVATYASNKVLGAELYQEYDNRIDPPVEVIERILNKIVKDEAEGSIITPNDSTQPWFTKLKSISIRNPIIVTNMDGTFNKIDQNISETKPDYDTIIVWTFKGNHRNKEDCEFINLSETAFENINLPSERKQGDLRSQIIDIVTDSNLKEVPENERENLLNEAHLQGNF
      >gi|Smar1000004632|ref|SMAR009255-PA pep:novel scaffold:Smar1:JH431878:386467:389171:-1 gene:SMAR009255 transcript:SMAR009255-RA
      MGDSTHEGESSRSTRTSKSKIKYGILLLLCCCLLAGIGIGIGYMTAIQSSDYVNNAQDGKNKTCTHTSLLGDDPFPWWTIDMKSQYEISKVSFQGREDCCADRNHDLQVRVGNYLNKGWGTVLTLNGLCGEIGENKGQKMYEFDCPEMLIGQYVNINKAILNDNSAVVSLFKNHGSSKHLLHSFGRDLWEDACRHKISLSFVHVPGKENTIADFLSRNFISADGEWSLHCSVFNKLTEKFGKPSIDLFASRLNFKLKIFYSWAPDPEASKIDAFAHFWSDFSYAFPPFSPMEGPSGQGNHSIIGATLDDATVVSTPAEKPNSNSSVVSGQGLALPGTQPTGEAETKRSTRPAWLQNLGQSHVAQGFSKDAAELLMDGWRPNTVTTYSSGVTRWENFLGSQPTTAIEKPATPATLTANLVASMYNKGLSLSTVNTTLAAGSAAGFIIPETRATPAIKQIWDVGLPLAELRKMWPHEDLDWRDLQIKVIMLIALVIASRISTIQSLCLDDLTISADVAIFRPSAIQKTLQSGIHPVLRLAPYPAEPPICPHLALCDFLLRTQDWGLFLIQNSPFSGPSKYTISHWIKDFLAKSGVDKNVFGAHTTRSASTSKAAKQVRIQDILNAAGWTFEFTFARFYHRPIVDPNAFQNAGLATE
      >gi|Vcar1000014693|ref|jgi|Volca1|108458|estExt_fgenesh4_pg.C_1190017
      MDTIRQGSGPNSLVSYVAAFGRLMGDLPRRHELDHVFHFAKGLNRDLREEVYARLPQYGTDVTLQIIIDLAMGIFSGRHNAHLIDYNTAPCSSRDVRRDDPMDLTANVYSTTLHHSNPPLPSGVKTIGVPYNLRLQCMAQHVCLQCGDKSHAKEHCPELRRLLAATTISQPNSSRRHSFQGANNSRKPPLTAPTAAAPSTSATNPLPAPAPQAPVKSALRPSSAEPVSHSLPTSSAVHSSEPAFNVPCLAQIVRKPARRVGRKRAHAAAASTSQGACQASNDWKIAQLLFLEYDSQYGPFTVDAYCDDLGLTAQLSPFFSPSRPFLSTDIEGECVWMVPPVDNASTNIARYLDAKTANPNTSAIIVLPDRPQAPWGPLIRHMTIVRRFPAGAQIVCRPLSSDPSRPKAGYGDSRGKPASQPVPTKKVDDKPCKEGNPNDDPEELGSTEPCTQEAEQHFCSAKNSHRLFEVKPDELPNVLKEFNVMKARLAYYISLGRVTWEAAWVSLRPEAHLLPRMMRLAQVATILPTQTACVERGFSRHRIIKNRLTNRLKLETVDSLLRVGMLGPAADTGAKPLIDQAAGIYSRKGYTGIIHKLFSDVSKINLNPYGEDDNEEVATDPEIEVLVQATYPASDDEAFSDSDYETDVERDTDESSVDEDFVDEEEDAKE
      >gi|Spun1000004719|ref|SPPG_04959T0 | SPPG_04959 | Spizellomyces punctatus DAOM BR117 hypothetical protein (1591 aa)
      MPTKSRPSICEMARRYVCRRNGQSPLQAFERFCYPLALSVCRHAGLRFRTLHVKLPVTGVTGLCNVLCMPANSNTCSARICSILGNDAPSTRSARMSVFQTLPRSKENPNNHSRRMLRGMGQKRTAGCCCVVQEDRDPFDTQIPCTSCCQFFASCLPSFFSQDLVKKKKKVSRKSIKGRKSSSANERDDDFLEPVILPPIINSRKLALPANCPAFHIDAIVEVRDGAKTWWPGRVVTVQSGKVCVHYDGWGDQYDEWIDCESQRIRLATQMPADQCSERISQENEHGQTGEAGIGPDAVILVGRSVREKRKKSAAYTNQKNAKRKKANSRTNVTVVNPITSKSTESATPARPQGFNAQQPRSSSTSPVDNFGIFRAAANREAREAYISRRALANDATADDMKRLYGKGARVEVCCAGGERYMATVIKTRSWQVLVQYDGWDEAWNEWIDMNSSKMKLVEAASGENNSSEDGNSSAGECSSEEDDEQKWKIFCNRCEKRIRQYRYFCTYCEVPSEGFEYESFDLCLACFQQDFPLDHPHPIQSFAVEPLLDTDDPTRRKFKDGELVSTFVLDEFDTSYIAMGTNQSQIDAETVVTAIPMVPRCAFCHSERTDIVGPFIGPHPFRNTRISGRRMPLPSSEKKNSGKNRRVPIFWAHDACARFSPEVYFMKDSGKWHNVLKALARGRGVKCAACKERGATIGCFDVRCTRSYHVGCTRKPLSQFEEGVIFWCPRHESLVNKADNYKDVYNCDVCSNSLGLSDNDEQWHTCDECAQNHFNTFDLCKECFEGRFPETHDHGKDRFITTCMSQRKEIREMEQQLARELVAANASRKSLGQRRKKKLERASGIRCAYCWIDSSSRWRKGYNGIPMCEDCFQMASSAFASSKAPELCATPEAIPSPSPSESLSQSPVNAPTPVPVRIEADSPAPVLDPTSMERVYRTEIEAYSHEYYLTRGVVGKASGADEIGAAEVNKSSEFGILQSYAPTDDQLFTMGFDTSFYDIPGRAPRWATHSGGDYHGTWLPQIVRMSLLRYTSEGERVLSNFSGRGTDAIECFLLKRRCCSVDINPASVALSQRNVSFSVPPELGLTAAYRPVIVLADSRELIGSLFEDESYDHILSHPPYKDCVSYSAHIEGDLSHFPDMEDFQKEMEKIVAETWRLLKPNRRCTLGIGDNRRECFYQPVSFQTIRTYINDGFELEELIVKRQRYCQMAPLGTYLCTQYNFLMFTHEFIAILRKVDDRQHSGLFSYLKVDDDHDFHVNPTRILRVIPAAPIDRTSIVMGTVWTFRVTQKHSLARLAMSKLIERFGTDSAYWEEVSISEFRNKVIRAAVYDDLCAERDPPEEEDEEEENGTEVTEYERRRREQLSKNTRELLSMGLISELSPEGEDDAKHLETLLAMPPVQTIQHEGCPTMHPPSSPVIIFVPHINAPSTAILPHAWINEYRKFVIDCARDAAARLTDGGYFIIGVKDARIFLPPKTDPSSENEQPCGIQTKYVPLGLLVSEDLSRYFEGSEMRLKDFVVAVPEGYSRDKGIEFEEMKARIDEDEQEWKSEQEKEANGQSNVRRLLPIVQAYYFIYAKQAGNRKLSEPSS
      >gi|Mver1000004713|ref|MVEG_04708T0 | MVEG_04708 | Mortierella verticillata NRRL 6337 hypothetical protein (324 aa)
      MSRMNTWRVARTALPRALSSRIVARNDICMRKFMGAQQVMSTATALPSVQLSRKISSIPSAYATKPLISHYSTSAAEHSPTNSSTNGSFYATLSTPPGIVATHFDLSKIPTAEHAQITHDFIVVPNYLSPQEHDMMVEAATKKLKRALGKQVRYEDGHFDGVITRYRECSASDWGAPPSLSSQGAEQKERQTPSEVLQAIKLHYFPQDWKWVAPHILELEAGKGGIKPHVDHLEASGQVVAGICLGSTAVMELIHDKEPSKSFRVLLPKGCFYFQRDSVRYQYKHGIPIELEDHQFKGTVYPKEKRISVMLRNALEPVHHNGH
      >gi|Pram1000014720|ref|50629
      LLPGFVVLKGFLSPQEQQELVNDSRRMGLQEGGFYKPTYASGAKCRLHQMCLGRHWNVKTEKYEDQRSNFDDAPVPALPMSWKLYAQRSLEAAKAIDPQVMGTCKNMMPDVCVINFYKKAGRNGMHIDKDESDEAMTMGSPVISFSIGCAAEFAYIDHYPEPHEAVPIVRLESGDVLVFGGPARNVVHALTRVYNRTQPLWLRMRSGRLNLTFREY
      >gi|Fcyl1000104830|ref|jgi|Fracy1|179302|e_gw1.1.1865.1 ; gi|Fcyl1000086724|ref|jgi|Fracy1|160953|gw1.1.1865.1
      MTVLVERSFHVLVLALLLRYSALSGGQLLEDLRGGGMQGAIHSSVFDVLQSHFSKIPHNDKKSSSSTKQFWLEGFASPFNATLPRFASAFPDLDWHFGSVGRFLDCSFDGRLKYCEANPPFTPGIMLAMADHTTNVLQRADNDNTRLTFVVVVPSADNKKNTNKDEAVVKHEAQKSFRSMVSSVYCTKHIQLKAREHGYVEGAQHLRPTQYKQSSYDTSIIILQSPKAKKYGLDKTNMKQLEKDIRIAFASRHENEIMERKKMAASAM
      >gi|Mver1000004892|ref|MVEG_04886T0 | MVEG_04886 | Mortierella verticillata NRRL 6337 hypothetical protein (2160 aa)
      MDHDQQGHNRRPQPSPSRPQPPQPQQRHSERHPQYHPQHHDPQYYDPQYYDPQRQPQQQYPPSHSSIHTPSPYLDHHPAWTSASPRTQLQQHHHQHQQHQQQHQQHLHRSHFLPLVETPPSQPPHRSYGRASFSHHEEQPHFYQDAYRDMHDHGQMSSQLGADHARSHSPLLPALTSSHTSLHPRPRAASFSASLPSFASSFSSLVHPVPLPPSSSPPSRAFPEHTMPSTHHTRYDDFPQRKRGRPSHGGFGGDNNMDSGVFQSTGQAIGRTDLDSASPHSRDSRPSHIFRPEGDDQYQTYYHHTHAPDQTRPGHSPESHYTPHPSLQIQTDTQTHSPSKRPLILSPLLGPEYSEPLETHRTLSSSRRSISHHHNEHHHNEHHHQLHHQQHHQQQQQQLHEQRLSHGPTQSREQFLHDPMEHATAPLPRTNSPSDPMLRPSSSPSSFPTSSSASIPMPHFRGQLLHRAPDSAYQRRYFGASTSGSGSSPNPLSSFSPSTVPVPIHSDPLSKMPRKTSTSKRPQPNDTIGNTSTSGPFSSGRGSIANQDATLTSYLHTQLGRLLLSTKAHVRRLLESFLHQKAASTVHPRLESVLVELFNGMLMSHNALATREHISSQEQPMTLRPVLIQSLDHLAALMAHAPSVSSSSTPATLATATPGATRKKAPPHPPSRLSQSIYFSDTDRDTDSDLDLEGVAQGTDATRREATEGTTGRSSGPPRPESELLAIEIETVEMYTSTIEFLSLMTAFVSVVCWLMQARLKRDTSQPTTLGVDIEDVDMENRQPTDKDCFDRLHSFLLLFDDHDNVLKPIQDALHSEEQGGHSTTHPRGAMDDPTRQVGLFAWHFSLFSDDEDGPLQQEVNLIDRQALDSIHFDVILNDLYSTHVLAMTAKEHQKDHGQFYTPSNVVDFMWRRAIVGRENLLERFVANLGGAKGQGVQASMAPVESEASLVPTALDPCLGVSTFLSCYVRLLIQKARQDHTETIWNSPIASRLLLAQICENIWGIELDGFAFWMARCGILASLIPLVERVQKLQHQQQQGLQAYQAGRGETTKLTRLHLFRNDTLQLTVPDGVHPDKSWERACILQLRDPQLLRFDFIVTNPPYMIRKTGTFSAPDPEVYDWSILETGGSPTIITSNVSPSETGSRSKPRRGSISPNPPINEDVVSAAEEEDELEGSDSEATTPDSRSGSPRSSRVKASSSSWPTSSASASMRLGAKGMMQAYGYFIWFAAQRIKPYAGVSCMITASQWLTLEFATKLRAWLFENCLMDEFFQFEPFKVFAKVQTDSLIFKIRSMEPGRTRQDSSIEPSIPLYDRLLEIGAHRTVFLRHTDHHRPLDGILQDYMDFFAISPQEQSSSVNIMVSNKTREELSAVIAAAPQPSSSSTTVTAPTYSFAPMMPSSLLSTFLLSLTQDLPGICSAGTKRVNRLSAVEPLLWHRGPNTNPVYGLVVRMEYAEVMFGEVMKARWFRPAFYWNGKNSPEVGMMTKALHKEGQFWQGRDRLRLSKKEGSPAESYLVPTPGSHRLYGLCMVDKESVKVLREQMAQGVQGAAALWQYLTDVRNHFQPGLASKKRKVFLSGKQQMTDDEGVAYCSTNQCGSDVPEKLVHPINYGYFSKTQPRQRFFLDTSSLAVTNQCIYLTLNKLSHHYDAAQSPPLIYFLTLLNSSTLQFFVLHHCQYDQQGRMRLFRESMAKIPFQDRDVKSSPQRIQYAAQLGQLMIDLKGTLYKVVMEWHLTGSSSRTDLGAPRLSEPFIGSVGGNQGLLDWIRRGGDPPTGVLPKTRDQIWRMLQGHASAPTTRSPSAPSSIAQLSTSAPPALPALGAHFHRAESLSTQADIDTDTNTGTDTDTDTDDNFESGRRSRFDQEEDFEKPRREYQHPLQEPRASGFSPQQYNQQHSSWLKSSNDPTPSLTPLPPSLQPSTHNQHVSQTRHSLQNQDTECDAIMRALERAVTMVEMLQWAVDQYGYMLYGIQPRFQKLLELELKVAYGSRLDAVIVNMSSPVSEPLLLSHHHQHHHHQHQPVGPLDRDQPMRFGEEMSFDHVEDPLTVSEGSLLSTSAPPALYSTPLSAPIAPALRPILPRQGAVGPHSRAPLATPVSVHTPSNLEQDLGITVSKLMRWDKHEKDPTSIAVPSYAQSIMENAQAAVSSLEDLLRRYPPL
      >gi|Caps1000024925|ref|jgi|Capca1|112116|e_gw1.37312.2.1
      ITIYGKTLPIPRLQVWMGDADANYQYSGLELSPHPWNPTIRSIKQQLQPICGHNFNSVLINLYRNGQDSNGWHSDDEPELGENPIIASFSLGATRRFRLRHKYRKDLTPYTFDLMSGSLLVMAGSTQKYWQHCLTKTAKQVEPRINLTFRKTLRGLDD
      >gi|Vcar1000014935|ref|jgi|Volca1|100989|fgenesh4_pg.C_scaffold_273000001
      MRHNLDMAGVPEDSPEAAKIALSVLRGSMGDSLRRLNADPATRFTSYQAVLQAVTPLAPDLIDLALTISTGRDAANRIRDDPSPPRPRDHRRNDLMDTSAALDAGRCLQCGSEFHDKLSCPDLIHLTAPTLAAMTSTAVPTQPAAARPVPVNVAFKPPPAEQVSHSLPIASKPARRVGCMRARVAAASTTEAHRVDSDWMLSRTVFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDGECVWMVPPTSSPSAFISQYLESKTTNPRTSAIIVLPDRPTAPWAPLIRHMTVVRRFPAGARIVCRRDPSDAS
      >gi|Psoj1000004970|ref|131422
      MATLMRISPNQSRVSLKAQHHPTLLLSMDFRQLLREERRRAREAAQSQTSTELRSGQEAATNNATEDEDKLAQWQDATLKVWAKRSKIDIEEFRRGPIPGVYYIPNWITQDEEKAILERVYAVPDDSDLWVRLKHRRLQMWGGEVKAPFDPKPLPRWLTQISQTLVEAGIFSEEKKPNHALINEYGVGDCIMPHEDGPAYYPFVSIISTGAECRVTFEPHRALEASSATVSEVVPHFDFQLERRSLLLFTGEAYTRYLHSIDNIEVGTRISLTVRHVDLR
      >gi|Fcyl1000014986|ref|jgi|Fracy1|232852|fgenesh2_pg.1_#_1050
      MPKKRRKTQDVSQLSNLSNLTGFGNAGFGASSSLDDCLAEMSNTTTREKQQPSWIRQAKISTAEGIDNWNRNFQIWAKGGIYHPGLLPNIDQEIARNFKVQELSTLLLSNEFVKGKDIRMTTFERWLLDSKHEEEEEDEEDNAGGGGGIVQGDPVLPLRSSPDSKASQRLLSELITCTSTNLKNKNNNKTSKKIPRQDAEKIIAKLCRTTNLTCQELLCQEDRYRKQSPLNKGDRINVSTKTTESSNVIVVSILYSRKRWKKPFCFKLNQNHYKLLKDRFMEIHAPSSTTLTNAPLFGGRNNNIMSSSDNTMTVLVERSFHVLVLALLLRYSALSGGQLLEDLRGGGMQGAIHSSVFDVLQSHFSKIPHNDKKSSSSTKQFWLEGFASPFNATLPRFASAFPDLDWHFGSVGRFLDCSFDGRLSNSGGGGGGDDEEEYCEANPPFTPGIMLAMADHTTNVLQRADNDNTRLTFVVVVPSADNKKNTNKDEAVVKHEAQKSFRSMVSSVYCTKHIQLKAREHGYVEGAQHLRPTQYKQSSYDTSIIILQSPKAKKYGLDKTNMKQLEKDIRIAFASRHENEIMERKKMAASAM
      >gi|Caps1000015030|ref|jgi|Capca1|35874|gw1.256.16.1
      REYGKGGKPKSKYDFMTIDEKQVEVDKIRTGIKQRCLFNDSACKEIEAKIEEVVRKAANGEYKKNTVDTAPLRNKYFFGEGYTYGSHMEARGPGMERLYPKEGEESVDPIPEWIHEMVVQPLLRAQLIPPDFINSAVINDYQPGGCIVSHIDPYHIFDRPIVTVSFFSSSALSFGCKFEFRPIRVSNPVLSVKLPRGGVTLISGFAADEITHCIRPQDTPHRRAVVILRR
      >gi|Hrob1000005052|ref|jgi|Helro1|69129
      IMVANGGLGNGVSREKLYLLFSGFGTIQDIQMKPKRSYSILNFDDVNDAQIVFNKFNKFCAIKDESEKDVHLCLAFIEKMPYLEDSNAMHYPQGLEIVNDFVSENEEKMFLDYCDWGEEDSISLKNRRVRHFGYEFNYKEKTISKLNGLLNRIPQFCCDLLKKLKQLNYVTEDFDQLTVNQYNPGKGIPNHVDTPEVFEKEIAIISLGSSIVMDFKHPDGTHVPIVLPRFSVAILKNESRYIWSHGIAPRLYDLIKSETGLTLARRSTRISFTFRKIKTKLLGNFLFTNVMKFPKLNCVAITVDNDKLAQVYNTIAPHFSRTRHSPWPSVVNFLNRLDPYAIVLDVGCGNGKYLNTRKSDLFMIGCDASPTLCQIGSSVGEVQICNVVALPYPDCTFDACICIAVLHHLASQERRSLAIKEIVRVLKVGGQALIYVWAMEQDFAGVKSKYLKSVKKITELTEFHKVDDNLSLPVHNSKTTFKYCDNLVPWTLQTEYPTKDDLNSVEVYKRYYHVFCKGELVELCRQFDSSLIVSNVFYEEGNWCVEIIKTNKIETILL
      >gi|Vcar1000015077|ref|jgi|Volca1|100923|fgenesh4_pg.C_scaffold_231000001
      MGQRQSLIPGRIDRECEYFARLTISPMNGAGGTNDLGGSLDVAMASGSSGNPAGEAGAAAPPLAPVVPVPVAPVGQPVIPIAGAPAAGAIPNPAQPVAVPAAVAVAQLPDAQMVRNIPAPKLPVATPVEPDRIHAFVADVRDYFVLVGWQANIPAQKLFISGALEGFFKEWHITWTKSVPDYTPDQLLDAFLIRSAPEMYSRTHVARTTFYSATFKQELNESVLTFAGRFEALLQDIPDMQEPTKLWHFREKLLPYLREECAVRPTDFQEHTVYAELLHLPLLLLWRHPLQYTLDLSRRRPSLPLDSRPTRVRAVVLEAVAGLVDVVVAQQAAWPDAMVQGVAPPRFVRRPDGNCLWSKYWQGLGRSVHESEVASIATTMGREFTLDACASDCGLSAVCNAFSCTARPFLDTNIAGHTVWMAPNAADLPAYVTHYRACKPLAPQSTAACILVPSGTEPSLLKGMKLVRRYPVGTSLFYVPDVQGSRALLPPITEVMEVWYDGPDSTEEIPACTAIGNAVPHLAVKISGSTFMAMLDSGATHSFVSEALVRLLHLHVLPSTFTYVRLADGGMSPIVGQPMLKYTLDLSRRRPSLPLDSRPTRVRAVVLEAVAGLVDVVVAQQAAWLDAMVQVGVADREAGVMLTQVEAVVPEEEPPQVAVQRLPVTMYKEAACFFLLLLKLWRYGMMVQTARKKSLPVQLLGMLFLT
      >gi|Ccor1000005156|ref|jgi|Conco1|17958|estExt_Genemark1.C_990013
      MKSLLYKSAFNSKRSILQLCKPSTIIRTYLSSTINNFEYYEHIDSENLKYIDPINVPSTPNEKFNHKDFLIIPDFITKEEEEMLVAKSNFKLRRMRHYEDSHFDGVIQNYKEVTVSDWAPFQNEVNPILERAKSLIKDPINWLPTHILDLGGKGGIDPHVDNIQASGDYILGLCLLSDSVIVFKSEDYHFEVLAKARSLYMQRKTIRYNFTHAIPTLPQDHKFKDVLLDKNRRISWMLRDMKPLSK
      >gi|Caps1000025183|ref|jgi|Capca1|219923|estExt_fgenesh1_pg.C_4020007
      MNNPSLPPPPQASGIPPGPQTPGIRPPPGMLSPAQPGDPVHDLPHDLLEAGWRRFWSRREGRPYFFNKVSNESRWEIPGQDAFTDSDPLGIDASPAPMPSRSLSIDTAMPSTSNGVGMKRRSSEDPNLMSPAKKISFSYSPYWNFDIPTNVMIWERKPCYLPPPHPEIELLRTQLMSKLRIQYKDLCRNREKIEPPRESFNRWLLERKVIDKGHDPVLPCLCTPEVSPSMYREIMHDIPMKVVKCKYFNETKRQLSKYAQAAKELMDSRSASAESRKLVKWQVEDVIQWLQRTENASYADYDSRLVHLKKQCQPHIMEVAKSSVEGICLKMYNTAVDHVKKIHERHWEILAEMKIYPSTTGPPPPRRNIPCYPIQMVIPAPRMSGIEHHIEGDVVSLRFKSEHLIKVNTSHFHKLEQLYRCNCRDDPKFENFLPRVWCLLRRYHTYFGLSADEGSGLQGALPVPVFECLHRVFNVTFECFASPLNCYFKQYCSAFVDTDGYFGSRGPLLDFSPTSGSFEANPPFGEELMEAMVDHFESLLSESNDPLSFIVFVPDWRDPPTEALMRLESSRFKRKQATVVAYEHEYRQGFQHIVNKADTNIRASHGTVIIFLQNEAGFNKWGPTRERLNELLLAYNPQIKDATAPSSAT
      >gi|Smar1000005290|ref|SMAR008745-PA pep:novel scaffold:Smar1:JH431850:1046606:1050059:-1 gene:SMAR008745 transcript:SMAR008745-RA
      MASTAVQSQTIPQESSSSQWEETSPATTLESNDGSRDSSPAAATPTGDFASPQHPLSLDPAHELPQELINQGWKKFWSKRENRPYFWNKMTNESLWEMPRMANSSQYDPMTDPLGIQCSPMPSESPATPVIAAKRRASDTDGSSSPSAKRFVLGGPWDLEVQTIAVMWERTPSLLPPPHPQIELLRANYVGRLRQHYQEMCHSREGIDAPKESFNRWLLERKVADTGSDPVLPSACFREISMSMYREIMNDIPIKLVRPKYSGDARKQLSKYAEAAKKMIESRNVSPRSRKIVKWNVEDAFQWLRKTQNATYDDYLERLAHLKHQCQPHLTEAAKSSVEGICLKIYHLSIDYAKKIHDKHWALLNEHGIAEIRSSLQVTNPKKVLCYPVQLALSSPRLPTVEFVQDKDMTILTFNGDSVRINSLYFQKLEQLYRWNCSDDRKFENFLSRVWCLLKRYQTYFGVSNNEGHCTQGALPVSVFQALQRNFDVSFECFASPLNCYFRQFCSAFPDTDGFFGSRGPILDFRPVSGSFEANPPFCEELMETVVEHFEKLLSHSNEPLSFIVFMAEWRDPTPVALAKLEASSFRRHHVVVSAMEHEYRHGFQHVCNRTEVNIKSAHGTVAVWLQNEAGYSRWGPTKERIEAFLDSYKLNRDREHQEIIAEVPVTDIAITTATAAITTTATVAITTTTTTTTATLAANVITTATTPCKSVTK
      >gi|Sarc1000005302|ref|SARC_05176T0 | SARC_05176 | Sphaeroforma arctica JP610 hypothetical protein (256 aa)
      MIRHQDDKPIIEGHLIEGAPPSLYCVFDFISADQETLFLSKIYDGSKKWTSLLNRRLQNWGGLPSHKGMVKEGLPQWLTEQCENLSAGSVFPSGQQPNHVLVNEYLPGQGILPHEDGPVFSPTIATVTLRSHCLLEFYPHRRQPDKDDPQAGAEAEDGTEAGKDDGNEPEFKIYLPPRSLFVVKDDCYRTYLHGISDTSADIVDEKVLNRTQSTHALAINEVLERGTRVSMTIRHVPKVMKESVQQALMARLLKR
      >gi|Bcir1000015312|ref|jgi|Bacci1|280182|fgenesh1_pg.351_#_5
      MRKPNDEDLLRLQDRLKVYDQIRGNNINDVTRACSSDTIIMQPSQHQGSLSAYSRDFEHSSGQVKPVEKAVIRSNNSEKDVQLYTEKHQLSKYWTWSTDTPALVVDALRQVWLPKGMYLYPPWKLMPQVLKKVQEQKSILVPNDLKDKTPTITNHLENKSETVSSHLAIINNSKLKDGLDEQRANRDYSSTHLSTLRSSITSAFSIIHSQKQPITDQLLIKEFFTA
      >gi|Mcir1000005378|ref|jgi|Mucci2|37740|Mucci1.e_gw1.4.913.1
      MSNNIDWEDLFGSDDDSDKEVEEREHAKPIITFEAIPGLKLIKQALSHQEQMSLTHALIDRNFFTGNANQAMLFGELPDFIKWVEPWISDNYPDLFGQDIMHRQPLFDQAILNMYKKGQGITSHVDLLRFEDGILIISLMSSCVMTMRPARKDATSYHAENTDDSEKHDILLCPGDVLALSQEARYDWEHGIPSRLVDEIEGRSIERGTRISVTLRKLKHGEQEMPSAATSER
      >gi|Fcyl1000015510|ref|jgi|Fracy1|233376|fgenesh2_pg.1_#_1574
      MYYHSLDIDDNDNDETVTIKKKKKKNDDGGGGGEEEGRTAIIDDDNSNSKSKSKSDADLNNSTNDNNCYYKYSGTSRLCLPCNRSRIDENFIINLCSAADQIITQLQMAAIVALKEEEQQQQQQQHQQQQKNNNDNKKFLENIRLRIIHQKGENHNDNNNDNDCDTIIIENKKEDHVGQKRKTKKGSRKRIYNSCLVNWYKPDHTIGLHSDDEPEMDTVTYPIVSLSLGGPRRFVLKSKQSQSLSKQHQQQQQSQSNNRTIQKNHEFILKDGDLFIMGGNCQKEYKHEIPKVRKTKDHQFGGITSNRISWTLRRMKIKTSKMKATTNNSTNSNSKSNSNSNNNNNNNNNDPKQQAARSLIRNPYASQQQQQQHKRQRR
      >gi|Spun1000005556|ref|SPPG_05863T0 | SPPG_05863 | Spizellomyces punctatus DAOM BR117 hypothetical protein (222 aa)
      MLTSNPLEQYILPNAPPTVYYIPSFLTPQESTTLLEKIYKTPKPKWVSLRNRRLQNWGVLAGHCPLSDPLPPWLTAIENRIAALGIWHQHSHANNCLINEYLPGQGILPHEDGPAFHPTIATLSLSSTCLLQLYPHGQTKASHTLFLEPGSLLVLDGEVYRGFLHGIPEQRVDVLEGVVNPVRDFGKEGGRIERGTRVSVTFRVVKKVAKVSVGRVFGKRV
      >gi|Ccor1000005557|ref|jgi|Conco1|7959|gm1.6199_g
      MVELSNRQKKLLKHQSTFRDPATFNNQTAFRSAERRYKNRFDNPDYSACYDFKYLDKNLPHIREQIVELDLGCNIASLCEYFGSYKPEVTKAYVIGTIPGLIVIPNPFSNRAQRGLISACTRVYSKPPYVSNLDTHFTVPANGVWDVYERGHLMQITTDDPDYYIPLKLDPSKVNDTIKSGSGLKNIETSTATEGFKYNGLNDRNVLIKHSNLKLLPPDQIIRRLRWITFGYQYHWPSKEYICDENHKVPVEFSNLCDAVVKSVYQVCNPISGEYISNYNPCDFKSEGGVINYYQLKDSLMGHVDKSEPNTTAPLISFSLGQSAVFLIGGPTKDTEPIPLILRSGDILIMGEQCRRNYHGVPRIIENTLPEYLTSSKTNPSLEEDELNYWDVIADYLQTTRVNINVRQVF
      >gi|Falb1000005586|ref|H696_05801T1 | H696_05801 | Fonticula alba ATCC 38817 (V2) hypothetical protein (411 aa)
      MPSTPASRKRPAAGGRPLVEDTPLWGALPVEGSTHAAPRTDTAFRQAERRLKQAPRAELLDPGLARTRHGVHDPHNSDDLAAIMRRAPGCFVAAGALSIADQARLAQCAVVSWTGGRTNFSSLTLGSLKALTEVDLLAESPAPVARAAMISNAEACMERSCSRHHCAAALSQRHGALVKNPPAAKAPESDPRAAALKLVDRITTAEGAAAGCPPRRSHPWREFALAGGFPRKRGVPAPPLRKLRWATLGWHYNWTNKTYVEGDHGPMPELLASLARGAISQSPMAEEEGRKWVPEAAIVNYYSPGDSLTGHADFSEQAPEQAPLVSIRYDSMPVRPCPGQGSVLWVLTLLSLACTVSAARPSSLLEALVDRWTRCRFSCVAATLSSSVGPLDWLSMAFHASWRRRLHQHC
      >gi|Falb1000005587|ref|H696_05801T0 | H696_05801 | Fonticula alba ATCC 38817 (V2) hypothetical protein (452 aa)
      MPSTPASRKRPAAGGRPLVEDTPLWGALPVEGSTHAAPRTDTAFRQAERRLKQAPRAELLDPGLARTRHGVHDPHNSDDLAAIMRRAPGCFVAAGALSIADQARLAQCAVVSWTGGRTNFSSLTLGSLKALTEVDLLAESPAPVARAAMISNAEACMERSCSRHHCAAALSQRHGALVKNPPAAKAPESDPRAAALKLVDRITTAEGAAAGCPPRRSHPWREFALAGGFPRKRGVPAPPLRKLRWATLGWHYNWTNKTYVEGDHGPMPELLASLARGAISQSPMAEEEGRKWVPEAAIVNYYSPGDSLTGHADFSEQAPEQAPLVSISLGCSAIFLIGGTCRSVDPVPVLLRSGDVVILSGPSRLAFHGVPRILEETTPPALLTYLAARGEDLFAEVGPGGQHKCPLLPSPPAEGEDFPSATTEGQLLAEYMRQSRLNFSVRQVFAAASPK
      >gi|Bnat1000005628|ref|jgi|Bigna1|77192|fgenesh1_pg.46_#_69
      MPRTSLQRTNTGALQALSHLPWRMYFRQPRNSQACQAEGRTSESTSTRMVAKMVLSRPRRAQRKRDTHTPPQASGPKYGGGNHHNSQHPKQRGGPLSRAVSLLQAATTPDEILTVIAQHGYDATFRTSHIVQGLLRLAKSFVPASASRARKEFVADDARLQELLQALLRRTKERDFRPVQATDTLLALALLYPKEVVNDGTGEHYNNNNNNKEEKEEEEEEGQCWRRQLQPQLLGIINRDQDLLTGFECTNLAWACRKLGIDTPEGVAARQEQLPFQHVQQAFTDIEMERLLEEIEFNVDEVGLNLGGKRIKERRMTAWQSDSNKPFEYAIYSGKIMEPMEMTPTIERLRDEIYRRTSVRYDCALINLYPDSRSGMRFHADPDQNTLWSTNSVVVSAGDTRLFIMRQMNDHSKRHQFYVSAGDIVIMYADCQERYQHSIRTEQSSNDDYSSTVAATQRPQQRQQQQESRRDGDFPMGPRVSIVFKQSLEEWERTNRVNHTATIHSFA
      >gi|Vcar1000005651|ref|jgi|Volca1|100148|fgenesh4_pg.C_scaffold_107000001
      MSLSNEFPFPPSFPAATDPAPAPVAPPTTSEAALPWDRISEVATHAAVRALENRSSNNQRGPAPRFNPTAKDADLASWANRLRLHFTVCGLHEDSAPAVAIALSAIEGPTLDTLLLLHQRTPFTSLTAVLQALTPLAPRPDRLLQAQISMDTIRQGSGPNSLVSYVAAFGRLMGDLPHRHELDHVFHFAKGLNRDLREEVYARLPQYGTDVTLQIIIDLAMGIFSGRHNAHLIDYNTAPRSSRDVRRDDPMDLTANVYSTTPHHSNSPLPSGVKTIGVPYNLRLQRMAQHAPVKSALRPSSAETVSHSLPTSSAVHSSETACNVQRLAQIVRKPARPVGRKRARAAAASTSKGACQASNDWKIARSLFLEYDSQYGPFTVDAYCDDLGLTAQLSPFFSPSRPFLSTDIEGECVWMVPPVDNASTIVARYLDAKTASPNTSAIIVLPDRPQAPWAPLIRHMTIVRRFPAGAQIVCRPTTSDPS
      >gi|Bcir1000005678|ref|jgi|Bacci1|183776|e_gw1.67.64.1
      MSPHFVSKRQQKIWEKQQRENAEKRSSYANQSAFRYAERVFKSNIPTSEFEEVVDFSNIDNNIQETRDNLVKVKLSNDLRQLTTAFGVPSESPQVDAIVMKNVPGLIVIPNPFTPEAQRTLVKHCLTDCAKPPHTSNLDGHYHTPLQGIWPLYKREQEGILKPGDPDYYVPIKTTEYQDQGIYTESVDDDDAVSTVNSMYSSIGKSAHIQSPTQLLKKQRWITLGYQYHWGSKKYNLDNPIPIPELISDLMKAVAIATDDIGCDEAIVKWKNEYNGKEYKPEAGIINYYQLQSTLMGHVDQSELNMDAPLISLSLGHSCIYLIGGFTKDAKPIPLRLNSGDIIVMTKICRKAYHGVPKIIKGSLPEYMVHCDDPEWPLYAKYMSTTRINLNIRQVFPNEP
      >gi|Fcyl1000025701|ref|jgi|Fracy1|243567|fgenesh2_pg.11_#_461 ; gi|Fcyl1000116478|ref|jgi|Fracy1|190950|e_gw1.11.582.1 ; gi|Fcyl1000084399|ref|jgi|Fracy1|158488|gw1.11.582.1
      MKGSKRKREGNPTREEQEPSSTSDPTVNQLYLTPEDGVYFQKHLQDHYKGFVHLPVDRLEPRDFHEEFKNSLERLRDAGYYQYDVVMAGGKYSSRTFVKRTLVGNPGITYKYLGLRLFAHAWSGPGCTPLMKSIGDMNQSMIKMTEQFPENERCDYNLTLINYMEPTTHTKVGFKDEANYGMGKVSVSWHADSSLVDNSSIGVYHCLPTQRNSKWDWKIALRPLAIESKDNTGSTKRDNKSTTPKPVVVNTKDGDAYFLLGNFNMKHQHCVLAGSQANRISSTHRVAVTIEDTYDYILKRVKIARKRFRLQMETIPPSLQSPVVGLDAKVIRYCQRILTEVEMEWIAQYWLQGDQHDKMRVWWQKPMKTLEAYWCALEVYTYRLFEFLLSRIASNDTVPIDVVKVLIIEFKTRQSFREQWDERRSDKIYQRRVSEEFRPVARPIFENNDEVKIDERRLPKDLTSAIQALSIFLEKDSKGTKSKKVEQVIDDHPLPPPSKSCQSKISSIVKEDIDTVTTKMSKQQPKEEHSSNAKKRKKKKRNKK
      >gi|Chet1000005718|ref|jgi|CocheC5_1|33815|estExt_fgenesh1_pg.C_410002
      MKRGAIDSFFTRPSPKKPKYQASKAKSSHASYPFAIPHLPEDFAQQLGFAPADEGKVINDQLDLDLVYYQPYIPSSIAGGVFEFLRQELPFYRIIYNITRGGVQTQINTPRFTTVFGVDDTCRFTPDGKIIDAKTSKPVEKSRYKCAPRPIPQCLDELRKVTEGTTGETFNFCLVNYYAHGKDSISYHSDDERFLGPNPAIASFSLGAKRDFLMKHKPIPPKDGEKIEEPKGLKLPLGSGDMILMRGTTQANWLHSIPKRAGPEAGKGRINITFRKAMVKGGTENYYQYNVGSGGVYRWDAKAEKMIQQEIEKGND
      >gi|Sarc1000005744|ref|SARC_05614T0 | SARC_05614 | Sphaeroforma arctica JP610 hypothetical protein (206 aa)
      MGRGNKRIGVFPWELISDVLDKVIEEATRITICVPYYPNTTWFPKFLSLLEQDPMIVENTNNTFLNEGTTVCGKTPLVGTLVAKIGTKSPCFDQKLAHPAIKIVSVQTDDESAELDTVTFEDRVTLISQYHSVGHYTVEETVNKLQLNGHTWSRQKEHVNQYIEGCVPCLKYKSTKRASTFTTNNSRISGRQNTSEHHRSLDQLE
      >gi|Chet1000005801|ref|jgi|CocheC5_1|108207|estExt_Genewise1Plus.C_130166
      MASKMADLMDLVSQNQPKFQTHQALLAIGLQNDFVLPDGRLPVNTSTGFLDRIQTLVPKFRELSGNVIWVQTLYETDRIATGADTGEGDALVVGGLIDARRKTLPKEIAKASVEEDDELFLLKSEKRTPACVPKTPGAEFIDFVTQQMELPADAVIRTTNYSAFQGTNLLITLRARLVTELFICGCITNVSVLATVIDAARHGIKICVIEDCLGFRKQTRHEMALKRMDDFFDAYLVNSEEILEKDPAELPQKPELSSNGANSDAKQNEKMVEDLLSGFSDRVRMSLSRGPKSEPGPEAKRGSAPSSKAASTISKQDDKEIQQSLAESGKSASMSQSESPAVEPIQNKIAPVTLATTPVEETAKVPTKSKSPKLKSLANLPVLGPGDEIAEGDSRIIHDFFPSDLRHPSDPSQPLKDIIFGQLYNEVRWQKMLHQQGEVPRLVCCQGAFGDDGSMPVYRHPADQTLPLLRFSPKVQIIRRQAEKLVEHPLNHVLIQLYRSGNDFISEHSDKTLDIVKGSSIVNVSFGSQRTMRIRRKKPQSKKDETLVEDSAVAQRETQRVPLPHNSMFVLGLESNKKWLHAIQPDKRLASERSEAETSHNGIRISLTFRNIGTFLDAKESTIWGQGATAREQRDAADVINGDEEEAKRVITAFSRENHDPDFDWDEWYGDGFDTLHLQTPPKDTPLLFASNNPIETRQAQIALAECKIHYSLLEAPATEETYEQDRQVTFRDADTHHTEIHTPFSILLYLDRYHPIDISPSSHPVIASAYPVMLITADVTKTWLSRANSPAEMVSEFDATLSSLLQRLEDGFEMHAGPYIAGVRFSVADCLAWPVIDALVDEWNGWSAEKFGSLDAWYRACWKKKACVKKVKEKLSA
      >gi|Rall1000005813|ref|jgi|Rozal1|5814|O9G_000964m.01
      MRLPVLAPGEIPAPEKYIPRKGVESRVPMNHTAFRIAERRYKGRKTPIDFSQVIDPRKGHELLEKMEYKVSKEYKWKDFGNRAFGAWSVKGMPGFIVIPEALNEQEQKELAKKCYTSYIEPPNLNSLDRIFSIPSKGLWKSMVENTPIYEYEKSENESSGYDTDVPYATQYKNDKRKKKETPVVDVENAIRRLRWIILGYPYNWYTKEYDFNSTFKAVPLELNLICKTLSEMLGFGPYEAEAGIVNFYQEKDTLMGHVDRSEKNMNAPLFSISLGQSAIFLIGGKTREDPVIPILLRSGDIAILSGESRWFYHGVPRIIENSLPEYFMDCCDCGNEENHSFDCWKLVSHYLKGTRLNINIRQVN
      >gi|Vcar1000005961|ref|jgi|Volca1|121547|estExt_fgenesh5_synt.C_620028
      MGNSICYGGQRGISESDENVAVGATAVPDREANNTATRQAAELENGGDNRSIGARSSSAEVTPSEEAGVRNMQLALLTQTNIEPHPDQPDRSQVPIPRSSVDQPESSGSQRQRKLMITTINMFPGPTLAQQNFILQTPEEMIEDVQVLSAIGAGGYAVVYRGVYQGGDVAIKVATINPDHAGWTEGFVACQLRHPHPQPGQLPAGRKVQQMDELPVGPSEDGLGDPRKTSHDRQGWKDVLARVGAAPNKGLLMLIQEYCDRGNLGKVIRSGIFKTAIGRTSSTTSAEKGQLLARRMLLRTATEICRGMIHLHNASVVHGDLKPANVLIQSSNKDRRGFTVKIADFGLARLLQKDTSSVESEASAGEAGAAAPPLAPVVPVPVAPVGQPVIPIAGAPAAGAMPDPVQPVAVPAAVAVAQLPDAQMVRNIPAPKLPVATPVEPDRIRAFVADVRDYFVLVGWQANLPAQKLFISGALEGFFKEWHVTWTKSVPDYTPDQLLDAFLVRFAPEMYSRTHVARTTFYSAAFKQELNESVLTFAGRFEALLQDIPDMQEPTKLWHFREKLLPYLREELHLPLLLLWRHPLQYTLDLSRRRPSLPLFSRPTWVRAVVLEAVAGVVDVVVAQQAAWPDAMVQEGVADREAGVMLTRVEAVVPEEEPPQVAVQRLPVAGIAPPRFVRRPDGNCLWSKYWQGLGRSVHESEVASIATIMDREFTLDACASDCGLSAVCNAFSCTARPFLDTNVAGHTVWMAPNAADLPAYVTHYRACKPLAPQSTAACILVPSGTEPSLLKGMKLVRRYPVGTSLFYVPDVQGSRALLPPITEVMEVWYDGPDSTEEIPACTAVGSAVPHLAVKISGSTFMAMLDSGATHSFVSEALVRLLHLNVLPSTFTYVRLADGGMSPIVGQTMLKYTLDLSRRRPSLPLFSRPTWVRAVVLEAVAGVVDVVVAQQAAWPDAMVQEGVADREAGVMLTRVEAVVPEEEPPQVAVQRLPVAVYAFGVVLWEMLCGTQPYESMPVGQVMLGVSFHNLRPPWPESHWPGLCALGRSCLAQLPEERPSFRELEKQLVALEEEVRVESLRHTQNIKRDNANQPNHPTASSSSSSRSTFTTPTPATSCPNTYNITSAATTSTTAAAGAATATASTEAVGPAANAPTAAAGTSTSPFAAAPAPTGRVLAMAAGEVTAGGGGGGGSTPGYVKHRSSLELRRAAAAGAAAASAAVAAVSSAAAAAAGNADWTPSSQPDSTAAVETAVVVKRFEDIEDKGEREDKAAASLVVHVDTPAQEASVAAAAAAAATAGSSAAGSEPQSPWLTTTQMSELMRMEEGRESEAAAETA
      >gi|Pram1000005973|ref|77822
      MHIKQVVVCGFRSYKDQVAVEPFSKQHNVVIGRNGTGKSNFFDAIRFGLLTSRFANLRPEERQALLHEGSGKHVMSAYVEIIFDNSDGRLPVDDVEVALRRTIGVKKDEFFLNRKHIPKSDVQGKVNALAVMRERERLELLKEVAGTKVYEDQRTKALKILHETQAQRDKIQEVVSYIEERLSELEEEKEELQEYQQLDREQRALEYTMYEKELQKVRAEIEALDRHRQEEGALATDLHEKLMHVRAEINRIESAHRSRDQDLAQLVEDRKSREDERNGLMEARYKLEMEMKELKEQIRSDGVQRSAVSKEVEVVKREIAAKRARLTNEILPALRQAEQTHDQVARNLQECRAQSEHLIAKQSRKSQFLTQQQRDDYLQREISDIESLVRRKESDTASLRHSTEGLARSIEGSDRTLQEQIEELREHRRRVDAVGAEMLRLKEQRNYLNEERKGKWREENQISYDVRELTKKLNDGENALQSTMAYDVRRGLQAVREMQGRIRGIYGPLIDLVRPVDERYCIAADEAAGGSLFHVVVDTDDTAAKIMRELDKKNLGRLTFLPLNRLKVKEHFDYPHNDDVVALVEKLEYPAEVRKGVMTAFGKKLLCRDLDACVRYAEQTNMDCLTLDGDMVHRRGALNGGFKDLRRSRTRAMMEVKQAQLDLESVTERARRVNTEAQQADQRVTGVISEIQKLEAEKNRSISAHKNLCDEISRRKNHIHSEKENLAQRERSCELQEREVKDLAAKITSLRSELLTPMQDTLTAVEQELLHSLSAKISLFEAEERDQRQRLEEIRSREEGIKTVLEENLVRRENELARQLGEGIEELAISEREEALKAKQIDLEDASRLVDDNNSSLKDIERKIATLQQEITNENVQVDALNGEDVSLSDQIEQEARRAEKVLNKRRRLLKKREDSTKDIRELGTLPISELEKFKVLSYQEVIKQFKRRNEKLKKYSHVNKKALDQFMSFNEQRETLLERKREIDDAYSSIEDLIDVLDKRKDEAIFRTFKGVAGFFSEVFRELVPTGEGKMILVGADTDQSNNGTDGGDEESNVDTYSGVQIKVNFRGEGDSYLMQQLSGGQKALVALAFIFAIQRVDPAPFYLFDEIDQALDSTHRAAVAALIHRQAHSKDSPAQFITSTFRPELVNIADKFYGIGYQNKISNVYSMAKEESLDFISNIMAEEEGVADWSTMADPLEAILASMAAATGNASPPTKSATKTKDAAAASTGKKRGREVSSTPPSGIWNPQEEMPLDSAALDTLQDAAKTWSVSPAMEITRQQTVRVLCNKIRRASEDLGIGKLPNSAYETWQLTSQLTVKEQDPLIPHASSDYSGLFEELRKAGATKSGATRKCKELTKEAERMLRKFGQQDFVAGKKKKVHVADAGDDVRQLTYGNSTVKLSASHFAKLREMYARKLGLSGNGSSMAPKDQRQFESALFCLLLRYDSLDGGGFQVREAVAALNEECFDVLLKEFDCKMECFASPLNCRYSRFCSAFLDTDFAFGSVGSFFDFSPRSGSFEANPPFIPKVIKRMADHMTELLNAADGPLAFIVIIPAWHETEVLIIALGWQQLNSSRFNQRHLLIPQKQHGYCEGKQQIRKTRWRIASFDTSVFFWQNSKACSKWPVSEKKLESLQRAFKSKQADERDALGLRKSGKRARVAKD
      >gi|Lgig1000006095|ref|jgi|Lotgi1|133518|e_gw1.85.31.1
      MNTKQKRSRVQGGWAAPKKQTARSDKDRVKPPNVPVWTSKNIDQVSATPQFLYQQPDEEIEYQPAPRQNCKGGVYDISDGPSGISRLRFFPNFINQSDANKYYDMLYQGLPWKQNSYVKNGVTHLNDRLTAWVGDLPYSYSGIVHPADPSWIPPLPTLKDKIEDLTGYKFNSLLGNLYRDDKDGVEWHCDDEPELGPQPIIASVSLGDVRNFELRQKPVPNTGDYTFSQIVRMPLTHGSLLIMEGGTQDDWQHRIPKEYHDRGPRINLTFRVIYPKP
      >gi|Fcyl1000016119|ref|jgi|Fracy1|233985|fgenesh2_pg.2_#_93
      MPLWAAETRISIRSLASSGSIILPDPEQDAARMAALKRLRHKFVRLCSENNRSKPPVLAFERWLGRASLKRGISTSDGYDPIIPADSVMDKGFAKDISRTLPSWAAANAVAEEMTKEATKQIRGMATQREEIDEHKDLRVLRKKIREEAALNESTTKAQQALGGANNSAVGGGGKVVLNGNRRDGIYDVMLCGPCGKPRRPYLTISSLHLSKLLRLWKLKNKEGNDDDDDGNQIEVIEVENPIDNMNALLEDDRIMFTKSLYCCLARYEGLKGAGYQCAVPGVAFDAAIACGLGSTIECFARRYKQISRYQQLVLSFQVPNSEGCRHTNNVQALAVLWYVNNREKSIISKTSI
      >gi|Vcar1000006131|ref|jgi|Volca1|62740|e_gw1.31.78.1
      MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDGNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA
      >gi|Pram1000006143|ref|77618
      MAKQPKRSASPQLSGGSPTKRAKLSTLPSEEEFSLTSVMANVKKAAVEHAHLSEAVVSKLFDKDASVSYLTTDHRSWIYHVPRWYWHVFEHAKPEELQATATCSPVTWSTMFKQAWEAHPKEHDTIMMFGKPTKIPRFQLLCGEMPSYRYSGKTFEAQKRFPQGLEHAVQHMRRMVEDSVTQQTRLSGGLVNWYENGNHYIGPHADDERDMMACSPIVALSLGATRRFVLTKKTSKSAPQGDGAVTRLELQMEDGDLTIMGGTTQRTHKHAVPKMARCREPRISITLRCFH
      >gi|Mver1000006163|ref|MVEG_06150T0 | MVEG_06150 | Mortierella verticillata NRRL 6337 hypothetical protein (222 aa)
      MATIPGLEVILDFITEEEEQQLITELDAGHWAGRGIEPNPEMKRRHQHYGGVFSYRLRRVVGDMEKLPGMFDFITERLLQRRIYDRSPNSIIVNEYEAGQGIMPHVDAPKLFGKTITALSLLSACVMTFQHVKDPSQIYHIHLPQRSLVVMNGSSRYDFKHSISKDLIEHVDGLEIVRARRVSITYRDMLVEDRQQDRESDEAGSSCKELCGNGISSCTRS
      >gi|Spun1000006275|ref|SPPG_06620T0 | SPPG_06620 | Spizellomyces punctatus DAOM BR117 hypothetical protein (232 aa)
      MPKRLSATTNDTNATTKRPRTIPPSITPASMQYTLSHATGHLIHYPSFFSPTASPTLFTSLRTVLPWKTIPITIFGRQIDQPRQVCFIADNNTYYAYSGSGKMGTMLPDWPDALKEIKEKVEEVVGEKFNSVLCNLYRNGSDYIGFHSDNEKSLGPAPTIASVSFGASRKFVVKPNPGGPAREAGKLELVLADGSLVIMKGMMQKYWKHAVPKTSRKVGERINLTFRKVVS
      >gi|Fcyl1000106316|ref|jgi|Fracy1|180788|e_gw1.2.998.1 ; gi|Fcyl1000078436|ref|jgi|Fracy1|152064|gw1.2.998.1
      MATNLKRNLNLQFLTSISKKGVQVCSISSTTSTTISSNHARIPDPSFVNTLNAPFDFDASCAVVYNEYITSTEEDLVANDIKNKMKRRRYEKGHWDAVINLYKEVEIADDSFQEEGDEKNNYSEGIPKLFNRIRQQLAEHHLTDYYDDQESIHWLPCHAIDLKKDGELNAHVDSVRFSGDLVAGLSLLSPSIMRLIPCDDNDDDDNKNSENSTKDEEPYYVDMFLPPRSLYVLTGVGRYKYSHQLLPDGSIFHKTDTDIVVRRDHRLSVIFRDSKQPSS
      >gi|Fcyl1000016334|ref|jgi|Fracy1|234200|fgenesh2_pg.2_#_308
      MKYLLPRPMATNLKRNLNLQFLTSISKKGVQVCSISSTTSTTISSNHARIPDPSFVNTLNAPFDFDASCAVVYNEYITSTEEDLVANDIKNKMKRVWQFLFVDYCQLTSNGICSFCDSAALKMNDESMKKKSRRYEKGHWDAVINLYKEVEIADDSFQEEGDEKNNYSEGIPKLFNRIRQQLAEHHLTDYYDDQESIHWLPCHAIDLKKDGELNAHVDSVRFSGDLVAGLSLLSPSIMRLIPCDDNDDDDNKNSENSTKDEEPYYVDMFLPPRSLYVLTGVGRYKYSHQLLPDGSIFHKTDTDIVGCEMVLQVLTSNCTYFHCDSNYLPLGITLLHGVVLAVLVYYMATTPYS
      >gi|Sarc1000006366|ref|SARC_06218T0 | SARC_06218 | Sphaeroforma arctica JP610 hypothetical protein (1129 aa)
      MANKKLASQSVQAGDESHSHSKPKWKHAKDERLVKRKTIEYNKATDVAHDGLETGPVKKRRKKETSRPFSVETNLRHQQSSESAAKPSKIDKQSEKVKKAKKDKKNKKTKKFKEKDKKNKKSKKSKESDDDGGEGGMHSDKATDGSVTEKKQKERKAKKRHRKHETVNTHTEQPQSAESDTSKDHEVVDETAEATFTSEAPPLGCVWNPNTKLSLETWPQSSHQKNVSTTSRVDPNLEIYRQKAVNRLQYQLETDCRNNRIRFNNVFFENWIFNCERASKSNQCASEGIAPDPVIPSQPTADGLVADLVGAGMQKTVAQGVARKLCAAAARETERMARTTAETSARARSVDCVEVVITGKKAREALLERATTDLRAFDLTYNQMTVRINKAHHDKLRLLHSRNAPQSERGDDSALNSRVFSLLVRYHTLQGGHVQGGGMQAALIEDTFDALLRNFGVNFECFASPLNSRYGQYCSMFADTDGPFGSVGSFFDFYPLSGSFEANPPFEDGVIHRMAMHIDVLLDRSDRENKPLSFVVVIPAWAESSGWQRLNQSTHLKRLLTLSQRDHGFCEGTQHSRPTRYRISTYVVPIVVGLAMTGDFFAYWVISAVMVGMVLSLVRLFMEFLNFVILAIRHVRENWRRLLVVRFKPVPLSEVVKSYAFFAYEHKLLIEKRVTSWIFTYRLQKAFRYFGGSYWALSILIVIQFVFVGLFEDSQGWIFLCTINYLVIYVLIWRKAAPWSEVAALAETEEDSQSTGDANNKTTEVPQTSAENLSAKPSRLARMASRVAGQDRYSTDLEGCEYIYRFIPDYHVRNRKTWLSLLLPILFISQSIWLCVASVEHQGGVAFIVAYLTVFYGLFWGLFYCFIMALWLKSRKVVRIVERREIAAEYVRIIKAMNEYLIYILMFVSFIGVVCGIVYLTILAEGTFLDGFLLFLAVFLLFSVISTITTKAFRRAYPMEAFWFLFILFVTVGVVLSVFSYSLQQNDDNNDTSEPISLVSAPIAPSPTSFASCDVTFGPTAMSIMDMAYMSKVAYAPMEVVQRELNTWFNANQTDGTGWYIHYNATTDGYPYFYDIRHNTTGMAIISVRGTNSLYDVVMDMDLWLEIAVLQMTDFFLPTLSLWPVGMSL
      >gi|Fcyl1000026428|ref|jgi|Fracy1|244294|fgenesh2_pg.13_#_88
      MSSSSSTESPPLVLTPPDISRSVIGRNINSRWISFIQNDELSCYSFYLPHYDDDDTSSTVVNTVGNNSNKHNDDESTTRTNPDNPNPDNPNPNPNPDPNPKTTTTARTSSIFTSKQLDEWFYKLHPSRYNNIDNDDDVDNASSASSASASAWTSASYKNQLLLRKTAWYTFNKECVCEYGYSDTWQKQIQSKEMINVLQEITEVVSSNLGQEEEELNCVNLNYYPSAGGIGFHADDEYMFDGLNRNTKIISLSLCCTTTTTSTCLHKPNNNYKNKNWGARKFQIKKRDNNKSESDDDDDNDDKNVVHEIILRHGDIVTMEGMFQKYYLHSIWPGDDDDNSIDYNDDDLCQGERINLTWRTIVKHLGSGGSSSIEEVEDFHGITCPLSLSLSSSSS
      >gi|Pram1000006453|ref|77246
      MSDVGESAANAQPSLTDFVADEQPSKYLKLTMSRRIKDKAIAEPKLLPFLLQLLQIDPVPVVSSLDADSIVKRVLYGTGSRRVVLLELSDVSVAQKLREQLHEKPCELLGRRPMYAEFALPRKEQEARDRLLRAHNVQRNADPPGLRVPGLRFEAEFITKEQEAACVAFFERENGAHWANTIRARQVQHFGYEFNYDTRRCDPDEPMKEPIPEVLQPIMEKIAQSGIMDGDTPDQITINEYLPGQGIAFHLDTHSAFTTTIASLSICSEVVMDFRHPDGVRYEGVLLPARSLAVMSGASRYKWEHAIVPRTFDVIDGKQVNRQRRVSITFRKVRSGPCECPFPKYCDTPEREGQENTGDDDQEAITTAEASSLAPTALEQQFVHEFYETVAAHFSSTRHSPWPRVAEFVGSLPSGSMIADLGCGNGKYMKCVDAAQSFVVGGDRSSRLVTICRDRGLEAVVCDALAVPLRSNSCDAALSIAVLHHLSTLGHRLAAVKELLRVLRVGGRGIIYAWAHEQMKGSRRRFEEGRQDFMVPWNLDKRFVISNEDGSTTTETAEASQDVEPIEEGSQQDPSEDDGANRDDNTCDKSTAKVHERVLVQRYCHMFKQGELESLVGLAGNAEVEKSYYDESNWAVVLRRVS
      >gi|Psoj1000016465|ref|144652
      MLGSRYLRRSLAPLRRLMSSSAAAASPASSLWQDVYNLDATHCHDPLVSEGDLQVLLDVITEDEEMVVADECARILKRRRYEEGHWDNVIIKFKEMERSRWSAETQRVLQKVREAAILPKELNYFPAVHVIELAEDGYIKPHVDSIKFSGRVVAGINLLSPSIMRFKEEHGDSVIDAYLQRRSMYMMTGRVRYHYTHEILPGAQVFRGELPVNRTHRISIMLRDEFLEEHVAKYHTPFAKPDVETQ
      >gi|Psoj1000016470|ref|144657
      MSDSSSDEEDFFGRMESDDLFEESEVQQEKRREAQRYVEQYAERDWGLAARQCRAQGTNKDLVTESTLELRADKKVVFQEKQGQQAKVWDCALVLSKFLTNDAYFAPDFFVNKHVIELGCGIGVPGLAAAALGAKEVMLTDMDMAIPWIQVNIERNQTLGCISGDVRAEALMWGENAPLESHQFDVILCSDLVYGERKISEKLVQTIAKLSHPDTLVISAHEARFAGDRGGSFFELLSEQNFEVEQLAEDGYIKPHVDSIKFSGRVVAGINLLSPSIMRFKEEHGDSVIDAYLQRRSMYMMTGRVRYHYTHEILPGAQVFRGELPVNRTHRISIMLRDEFLEEHVAKYHTPFAKPDVETQ
      >gi|Spun1000006493|ref|SPPG_06848T0 | SPPG_06848 | Spizellomyces punctatus DAOM BR117 hypothetical protein (383 aa)
      MAADASGHSTGSLSNQKRILREAHKLARKRATQLDIFAKSENLPLSELVVDVENGQDPTRFIVILNAGSTGDVGGVSVDDLERAFGAFEGFLGVEMMLGKPYSYAVYNTPTAAFQAYTVLNNSPIPIPHSNSKPLLLTFTTRVSLDIALSDPKDVMEAVPGLFLLHDFVDCEEEQNLLACVKDNANSWVSLNKRRVQHYGYRFDYPLNTVDFDTSIIDPIPPWGQEILSRYCRTFPLHSTPDQLTVNEYWPSAGIAPHADRHSIFGDVVIALSLGSGVVMEFRRPEETPSTTVSNPCKPFSYHCINIHLPPRSLLVMSGNARYQWEHSIRPRRTDIVHGRAIERGTRVSLTFRNVKRVKGCECGWTHACDVGRDDTGRLDIG
      >gi|Fcyl1000046637|ref|jgi|Fracy1|264503|estExt_fgenesh2_pg.C_270122 ; gi|Fcyl1000031649|ref|jgi|Fracy1|249515|fgenesh2_pg.27_#_122 ; gi|Fcyl1000011937|ref|jgi|Fracy1|271662|estExt_fgenesh2_kg.C_270020 ; gi|Fcyl1000005027|ref|jgi|Fracy1|220849|fgenesh2_kg.27_#_20_#_0_0_CCUX4586.b1_CCUX_EXTA
      MTARNSDDAIASLMQEMGQQRGYDQYDQYDTTTATTATTTVVQQQATKLNNANGNTTATTTTDDDGDRQLLLRQQQQQQQQLLIPELVDYPNNQFMAGSQSIWNPLVMSSDGSGSSDNGSGDRYEYENYGLILPTNPQIELIRRRVYEKFRNEVTILLQDIEQKLLPSSLIGGGGGGGSFSKNKNKLPIPSMLDKWHMDSKLVEWNILKQQQGKHDDISGSISTINRMTTTSTVDILRELTGRKQQDGMTPVYDPILLNKQSPTIFVDSILKPNVEKLWEQYHYHNNGGSNNNNKGNNNNLNLPPKFNKKSKQIHKGLYRLACEANDSFELQLHQVVRQESSSSSSSTTTKNKKNNNRLPKISLSSSSDEGSSGGGSDSSSTLVRVTYSGVTLKLHSAYLEKLQRLYDRTQQRRQQQQQHQQRSQQQYQVGLSFEESLFCLLCRYDMIQGAGLQAGVPGSIMDILLKQFNCKKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFEVEVEVTPPSVAVTAVAVAAAATKDEKEEGICYQANPPFCEGLILQLNNKITDILLSSQQHQHQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAAKRLWNLQKTDQKQKVSSSSATCSTDDNNKSNKNNGDANDDILPLGNAILDELKNSFCQDPGSMQKEEQQQQQQQQETSRINKPRRSNLVTTNTKSTTSTNPSNVVTSSTHDNKPSENTTSLSSFSVVQRKRNKKATKLSPAKKERKRKWNQQEEGKAQLNLLESLGLSSSAATIATTTKSSQAGSRTVSAEDNNNSSKKLNKLRPGGKTNNKEKSLPRKMKKKRPHR
      >gi|Bcir1000016642|ref|jgi|Bacci1|264733|estExt_Genewise1Plus.C_7650003
      MDWKDLFGDEEEEEDNIEIKGLTHIKQALDHDQQMLLVNQLIEHGYFSNTNQAMCFGELPSYINWLIPWISTNHPTLFPREIMERQPLFDQAILNLYKKGQGIVSHVDLLRFEDGIIIISLLSSCVMTMRPANSNKKGYNQHQEQQDTLRRDILLRPGDILALSGEARYDWEHGIEEKLIDEIDGQIIERGTRISVTLRRLNE
      >gi|Smar1000006659|ref|SMAR007432-PA pep:novel scaffold:Smar1:JH431789:114970:119470:-1 gene:SMAR007432 transcript:SMAR007432-RA
      MLIFQDFISKEEENTLLMELEPYLKRMRYEFDHWDNAIHGYRETEKTNWNEENKNIIRRIKKLAFPEDTTTLQHIHVLDITKDGYIKPHIDSVRFCGNIITGISLLSTSVMRLIHEKNPEFKVDVLLPQRSLYIMKNLVRFEYTHQVLADKDSFHRGNHIPRERRISIMCRNEPSIS
      >gi|Vcar1000006689|ref|jgi|Volca1|45981|gw1.75.44.1
      MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAIPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVCRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA
      >gi|Pram1000006748|ref|76882
      MTAAYKASEKRWKRATDEDLQHDATLLDPRSLSEEQQAKVRCVGSWKWGLEDEERPVLAFDGFGASHRGFCVIPNALDGKMQLQFAHACLTEFAEEPHVTNMHLQKQQVSEIWRKASEAHPQNPAESPLLTKLCWAASGYHYDWTARRYYKDSFSPVPELLQQLGARCASACGMAMAAEAVIVNYYKQKSSMGGHLDDVEYTMDHPVVSLSLGCRCVFLMGGHTKDEPPLEVLLRSGDIAIMGGASRTCYHGVARVLPTPFSEEFDTLPQSDDDEREEYEAVRTYLSTQRININVRQVYPSEPASTD
      >gi|Smar1000006760|ref|SMAR013901-PA pep:novel scaffold:Smar1:JH431789:635068:636630:1 gene:SMAR013901 transcript:SMAR013901-RA
      MAAGVVDLRHKLRSNQRQTATALRRYEKDETNRYHRAYKDKAPESKRNESETDEYSEYDSRLKTLRKLHSGIQQRRLFTDAECQVIEQQIDEVVENGEKGFYRPQTVDRAPLRNKYFFGEGYTYGSQLSKRGPGMERLYPPGHVDVIPEWIERLVVKPIVKAKIVPEGFINSAVINVYQPGGCIVSHIDPIHIFDRPIVSVSFKSDSALCFGCRFSFKPIRCTKPVLSLPIPRGCVTVLSGFAADDITHCVRPQDTVKKRAVIILRRVRSDAPRLDPSEMSPLDTEKRSEASRKRRLKSAIVDCNVKDENDPVKDVAKIPEERKSNKKIKIKR
      >gi|Vcar1000006797|ref|jgi|Volca1|81146|estExt_Genewise1Plus.C_190101
      MFLPDEFRNVENMLGRQFTFDAACNNSGDNSLCTRFASPSNSFLTSDVSGEFVWANPPYTKIELWHKHYLRCKRMRPETTSAVFVVPKWSQIERRMRAAGHQLLKTYAVGTKLFLEKADDGSRQEMPGIPWPIQLWYDPPAKEVQLNSIGSSQSMIFGAFLCQTPCSALIDSGADCKGTADGFISSEAARKMGLAIRPCSTSSVCVANGSSELIEGLIHAKLRIASFHDTVKLLVLKQANAGVELILGAD
      >gi|Ccor1000006818|ref|jgi|Conco1|72317|fgenesh1_pg.165_#_2
      MELNKEKWKIGKPLVWANTIQELCEGLDYFKSYQSSLYTNNSELKGLLLCDSIMPNDYFAWDVFITHGGGGMKASKEPGASSGTFVYRLSRSQEEDDTLVIPYLKLYKSQGRIPVILNSTTYDTGSVVDIPKYSVLGYYFITHTWIELEEGLSERCSRYKFRFERQESSYPRWWSVDKLIAEPDEIIKNEKCLVCNHYSPILYEIGFICFNKSCPRFYLHHGQVLPDYLAIKLGVLKIQFNNSESLNLPSLIPKNRLADQTNTLTDSIECKAYYCQDCYKLSSREYWHKWICTNCNNTIDIINPTISDIYSIKNDSIEFIMESFEMGMLFWNSDYFSIKRYYKGEQCRAVYKYDKNNSIEHITCHSYNKAFMDKLFLFCQNSQSGLDFKRYEVRKSRIRTRLLGNQFQYSCGVLYQHLLRTETIPLEEAPPIIRAILNYMQFYEPTADYNQITSLLYCKGQKMGWHNDSEEELGDKIISLSLGNPAKMNFKFKENDEKALSLYLLHGDILIMNGPDIQKKFLHCIQPDGFRIAFAARYLTRPRVV
      >gi|Smar1000006848|ref|SMAR015629-PA pep:novel scaffold:Smar1:JH431783:5867:6133:1 gene:SMAR015629 transcript:SMAR015629-RA
      MAALSIYFTLLAESYGKRLASTKFSYPLCMCQEKKTPSLIFFLETLFQHCSVFNKLTEEFGTPSIDLFASRLNFKLKIFYSWAPDPEA
      >gi|Mver1000006934|ref|MVEG_06923T0 | MVEG_06923 | Mortierella verticillata NRRL 6337 hypothetical protein (281 aa)
      MTAQLEWSINNSFFQHLDSTWGPHSVDLFAMAENAKVPRYVSWLHEETAWKQDAFSCSWKDLGRAYICPPWSLLNRVLEKIRIDRVQATVITPQWPAMIWYPTIRAMSTNEPIPQRSSDRPTTNLALTKTPAFYLFRSPDPVSILSDSSDSSEQSGPSSPLTNPENPSPIMATPLIPETFLKGLRPDTFNGRYRDTRAEDWLVRFERYCNAAHIPETGQDRILCAGLLLTDGASCWYDQLGTIAATTVNGQDLSAYQVFKYKFRQCFVNANDAEDAFDLI
      >gi|Bcir1000016958|ref|jgi|Bacci1|300658|MIX930_10_37
      MPSSPIPDSSTLQGQPKRQRVAISEEQVKQIVDILGSMLIKIKNEIMDIKEKYAPTAAVNFLLRRMESSDAILTNDVYCLMTGFMTIYLVFIELLFKDLSNNSHNTKINPLEDFMQLFSENDTLFGSDNPFLWYYLYARTNTSILEDLDKQLTIHFTSLNHIDSIFSRFYTVYFLGEESNTQKKHQKDHGQYYTPHSVIRFMWDRCLLPSSSSLLRNGNIPRVFDPCLGIGSFLCEFLSRLVNACRFNLDIWNDPRRLYSILTTEIPESVYGIEIDPFAFQLCKINMLIHLFPFYQRLLELNVQLHQTSRIIQRIRLFCNDTLKLDKX
      >gi|Vcar1000006954|ref|jgi|Volca1|88423|fgenesh4_pg.C_scaffold_9000001
      MVILSRVPIVDPVPTTSDTAQYVPSLQEAITSAVRAAMAELPPAPWSTAQQPHTGPYAPLPTITQPRPPYSNLLHVSRDFPPLIPRISTQTATARPVPVNVAFKPPPAEQVSHSLPIASNPARHVGCMRARVAAASMTEAHQAQISMDTIRQGSGLNSLVSYMAAFGRLMGDLPHRHELDHVFHFVKGLNRDLPTHLFRVASKPSESPTTCASSAWPSTSAYSVAPVKSALRPSSAETLSPFFSPSRPFLSTDIEGECVWMVPPVDNASTIVARYLDAKTANPKTSAIIVLPDRPQASWAPLIRHMTIVRRFPAGAQIVCRPTSSDPSSVLPPSPTALFKLRFRR
      >gi|Ccor1000007017|ref|jgi|Conco1|72468|fgenesh1_pg.176_#_7
      MVEVNEFDYLPGATVIYDFISEAEELELVSNIDKSNWGGNGQYPNPELRRRTQHFGYLFSYRYRQIEKYLGDFPQFLKTINKKLVNIENGHIDLNSIIINEYEVGQGIMPHTDSAEIFGPVISSLSLLSPCNMEFTPTKQKTLELQKSDQKIPKINIHLPRRSLLIMKENCRYDYQHSISKNSIEYLPIINDGSLSSEEFTRDRRISLTFRTMLDTESFDNSNNN
      >gi|Fcyl1000037010|ref|jgi|Fracy1|254876|fgenesh2_pg.60_#_27 ; gi|Fcyl1000126553|ref|jgi|Fracy1|201025|e_gw1.60.71.1 ; gi|Fcyl1000121611|ref|jgi|Fracy1|196083|e_gw1.26.280.1 ; gi|Fcyl1000087922|ref|jgi|Fracy1|162208|gw1.60.71.1 ; gi|Fcyl1000087925|ref|jgi|Fracy1|162211|gw1.26.280.1
      MAPPTIATFFSMAWSLMAAPECLSLATISVSQKKRVEVVEPGLVILRNFIDDEACQRIAAMAKDFGDEFYTVNKEGEKILNTGESRGRIYDAATRFPRDLIQLSNDAVSTSRAADTSMPAMQCTHVLLNLYTTSEGLVWHRDIYENDGKSDHPVVNLSIGATCVFGFKHLDTDEERTVELRSGDILLFGGPCRLIKHAVLEIKLDDAPEWMSYDPSRFSFTFRDSPEVLGREEEFKYFRVKEDLVGQDNFKVPTSSTDRKAFHGLPSYTTQQHVSMAS
      >gi|Spun1000007058|ref|SPPG_07458T0 | SPPG_07458 | Spizellomyces punctatus DAOM BR117 alkylated DNA repair protein AlkB (379 aa)
      MGRKKQKVLPPDSPLWANQTAFRVVERSWKRKEHAPDLLKTLVDPSRNAAIAVEAGLLKPVALSDDPRRISPHFGFSVEPDTPLPPAYELVNVPGLVIIPNMFSPSAQRTIIKQCLKEYTRHPNVTNLDTHWKLPSAGIWNLHELVRKGEIASGAEETLLRMRHDGTVEGALKNGYDSDVNEAGIKIDPPTTDTHHLTHLTPQDALLRLRWTSLGFQYNWSTKEYHLDRRPSFPPLIGELSTAIVEAVQGVTGYEARQWKAEAGIINFYGLKDALMAHQDRSEENAAAPLVSFSFGHSCIFLIGTESRDDIPTSVVLRSGDVLVMHGQSRLAFHSVPRILENTLPEYLFPHVDEAPDWDLFAEYMAASRININVRQVK
      >gi|Fcyl1000047105|ref|jgi|Fracy1|264971|estExt_fgenesh2_pg.C_350095 ; gi|Fcyl1000033484|ref|jgi|Fracy1|251350|fgenesh2_pg.35_#_95 ; gi|Fcyl1000012287|ref|jgi|Fracy1|272012|estExt_fgenesh2_kg.C_350010 ; gi|Fcyl1000005423|ref|jgi|Fracy1|221245|fgenesh2_kg.35_#_10_#_0_0_CCUX4586.b1_CCUX_EXTA
      MTARNSDDAIASLMQEMGQQRGYDQNEQYDTTATTAVQQATKINNANAYDNSTTTTTTDDDGDLDYPNNQFMAGSQSIWNPLVMSCGNSSDSGSGGSGNGYEYDRYGYGLILPTNPQIEIIRRRVYEKFRNEVTLLLQDIEHKILPLGGGSGSFSKNKNKLPIPSMLDKWHMDSKLEEWNTLKQEQKEKQQQQQQETQKHDELSGSDSSTINMMTTTSTVDILRELTRRKQQDGSNTMTPVYDPILLDKQSPTIFVDSILKPNVEKLWEQYHHHNGGSNNNNKGNNSNNFLNLPPKFNKKSKQIHKGLYRLVCEANDSFEIQLHQVVRQESSSSMSLSSIKNNKNKKKNNNRLPKISLSSSSDEGGSGGGDSSSTLVRVTYSGVTLKLHSAYLEKLQRLYDRTQQRRRRQQQQQQQQHQVGLSFEESLFCLLCRYDMIQGAGLQAGVPGSIMDILLKQFNCTKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFELEVEVTPPPVAVATAAAAAAVVTKDEEEEGICYQANPPFCEGLILQLNDKITDILLSSQQQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAAKRLWNLQKTNQKQKASSSSTSCSTESYINNGDADDDVLPLGNAILDELKNAFCQDPGSMQKEEQQQQQETSRINKPRRSNFVTTNTKSTTITRSSTHDDKPSENTTHSSSFSVVQRKRNKKATKLSPAKKERKRKWNQQEEGKAQLNLLESLGLSSSAATTATTTESSQAGSRTVSAEDNNNSSKKMNKLRPGGKTNNKEKLLSRKMKKKRPHR
      >gi|Bden1000007191|ref|BDEG_07173 | BDET_07192 | Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (367 aa)
      MILKPWQVFKRKKKMLATLRKLHPEIQIITTEDYNTVEITNTISVDASCWLIALNASYDSKITGIMLADLFGRCSGFIRSEMFLGAMPHSFILFNSPQSAYNCYKLLDNTLISQWNNKLLLLAQLDKCCDPLPLFPQRPITTASDAALAIPGLYLVPDFISVCNSLDLVCHLKSHWTSCPISTDPAWKSLQRRRVLHFGYSFDYSRNEIDRTVVGSDHAQLPHMPEWSVSILDQYTKLFPQYPFPNQLTINHYFPGGGIAPHSDRHSSFISPIVIISLGSGLVMEFRRKSSLSDPTYTTVHVYLPPCSLMVLDGDARFAWEHAIRPRTMDLIDGNVVERSERWSLTFRNLRELHDVCQCGYTHLCN
      >gi|Lgig1000007221|ref|jgi|Lotgi1|142409|e_gw1.180.14.1
      MNSNKSCGCKGIRSCLLCEDNKDKIENVSEKKEKVTLKYCDLCKKAWAWGFHPNHTGDSRDFEGIYLHEEFIDEILEAELIDEIDKTVYSDSQSGRRKQDYGPKVNFKKQKVKYEVFTGLPKYSEVLYTTLTALPILKDFIPVELCNLEYIPERGSAIDPHYDDFWLWGERLVTINLISETVLYFINDQDSNIEVQVTLPPRSLVVVFGEARHKWKHGIHRDDIQNRRIAMTFRELSQEFEKDGKSSEIGEKLKDIALTFNGTAVGTIPKSS
      >gi|Spun1000007268|ref|SPPG_07680T0 | SPPG_07680 | Spizellomyces punctatus DAOM BR117 hypothetical protein (281 aa)
      MPRIDELFQPSSKRRRSADEKADDESAQAPASKLPKRDAPDVSPAINVALRRVIKKSPGLDLLYFKQFITKSMAKEFMTWCLESLNWYKVQYKIRGMEVNTPRFTTVFGIDETKSPASFYKQPPRPIPMPLQILKAHVESATGATYNFTLLNYYHDGSHSITYHADDESFLGPNPSIASLTLGGTRDFLMKHKEDKSKKEKFVLEDGDLLVMQGTTQKEWLHAIPKRATASPRINITFRKAINVAGTNNYYKYNVGDGASYRYINGKMVPSTDAMTGKAL
      >gi|Lgig1000007292|ref|jgi|Lotgi1|143038|e_gw1.192.7.1
      MESKKLTKSERKMNKKISRSRQLLLKHEKIETTEEVSNMLCVYNGGLIMGNSQEKIQDLFNPHGTIQDITMVPEKPYCFVCFERQVDAEKAQSILNGFSFMNGETEYKLCIFYVSQVPPSIGPTTALPPGLILIPDFIPETLEQELLESIDWSTGVDQGNVSLKHRKVKHYGYEFRYDINNVDPSCPLNEGIPSRCLKLLHQVNDLGIVNFIPDQLTVNQYQPGQGIPSHIDTREAFEDGLMSLSLGSQVVMEFRHPKGDHLSVLIPPRSLLIMTSDSRYVWSHGITPRKSDIIPASDGKFTLSQRGIRTSFTFRKLTENAGIYMNNNQPVQDKKEKKAVFSLPRNDEDAVKLEQQHVHQVYEEIADHFSSTRHSPWPQIVQFINSQPPGSIMADIGCGNGKYLGINENVYSIGSDRSYNLASICRFRNFQSLVSDVMCIPLRPDSFDVCICIAVIHHLSTWERRLGAIQELVRILRPGGQVLIYVWALEQQRHKEKSKYLKQDKNKLDSNENSEVENEYSANNMQINSQDSANNMQINSQDSDTGSSKNIGIHVNRTEFQEQDMFVPWQLKNKSVKNAATFHRFYHVFREGELEDLCEKISDCRIIKSYYDQGNWCIILQKL
      >gi|Sarc1000007323|ref|SARC_07150T0 | SARC_07150 | Sphaeroforma arctica JP610 hypothetical protein (147 aa)
      MSRVHEGTPESYEPEILCADSWKLFPDYIEQAQQLFGPNNVDVFVSPQNEQLLDFWIKDEQKGAFEHNLSLVDGCPPWQLIHKDGASATVCLPWLPRAPWFDLFKQLLQSEPVLVPRIPHTFLKFGKYACGKILGSHSDRKNRPAQ
      >gi|Spun1000007354|ref|SPPG_07764T0 | SPPG_07764 | Spizellomyces punctatus DAOM BR117 hypothetical protein (742 aa)
      MVGGGPAVWCETRQELCEGTWYFKAYQSGVYHRGGVVQGYMIDGFPGQRDAWSERVFISHGGGKNDPLRQRHSVRFEDPMRKEDLGEATLRASQEESDRSIRVLMNTWREKRPFVVVMGSEYTLAGFQASYRYCVLGWYVITQTWAEKEFPSVLPQEPNAPDYFLRYKFAFQFLPRNTIKPWFMTEECVRPSTRHTYNCALCGKDSPKVYTAGICLIPDCERFWKIQDGEYWSDAHELEIDPEFLCPVQLDSDLETLIPESIIPPPPPLNIERFTPCALWCRRCGRITCREIWDKWICASCGYLKTYPNTPVPCATLSSTSTQWKDHLGCLAAARDSGIKRYVDVIDQGICRGLIRATYVLPTGGRIEHILADPHHKLAKIGSTIFEQYQTGVVPLRRWRFSAHRVKGFLTQNFSHNAGEAYRHAISMPYSHFGSAPEPVRRARSILQAVSPHTQINELLSIFYMDGQKMSWHDDGEKEVQGPIIGWSLGSDSIMRFRRKMKKKKCIFDIGRAKAAKQRRSARPQENRLDSKPVQTESMEPIQKKQSEQSTANIENSLHTISGPVSAKIDEGETHIPRKTVLKLVTRHGDLVIMYGKAVQTHYEHAVEPQGLRLCVTGRTLRTLTPLSDEAYPADPPWSTSPGSVPLDLNLPKEGQPQSIDDNILGAARLMRLADPLWNSCSPLDMEFDEIFAQSDYEPDISSSDSERHDSVLDSETSDLERRFTCAGLLMRISRSLTVDN
      >gi|Ttra1000007412|ref|AMSG_08546T0 | AMSG_08546 | Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase (330 aa)
      MGASAVDDEPWASNAPHTSGGPTIAEPVEAVPGLFFIADFVAPAESDALLDFLAACDRWKSMGTGRRVLHFGHMFDYATESVVPIPDPDCPDDVFMPAALAPVVASINALVTPDGSPLPGGPQAYDQVIVNGYEKSQGIHPHIDRTHCFGPVIAALSLGDDAVLTFLRDTQDGRVVYDVPVPARSLYIMTGDARYAWRHGLDARTNAAVRTGVERVSITWRTVVDSETAAAVTPSANSLPGVIIDALRPETDLVIEGELVGRVVGRRGATVRALSGRLGSQLVVGTLAGDPSVGLVRILDRGTTSLEALQTELDAILDPFRSSGAAAED
      >gi|Bnat1000017417|ref|jgi|Bigna1|21597|gw1.33.39.1
      ESLFRLLARYHALSGHGFQAALNETAFEVLRKHLSVNFECFASPLNAYFPRFCSAFPDTDVPFGSSLGSFFTLPEDGSVEGSFEANPPFISEVMTAMADKMHRMLAKAESSSRRLSFVVIVPGWTDEPSWKKMSSSSFLSKKLLVARDDHGFCDGAQHQRRDRYRESPYDTAIFVLQTTSAREAWSFTDDAERDLRAAMAEAHPTPGAKLRRK
      >gi|Ttra1000007497|ref|AMSG_08656T0 | AMSG_08656 | Thecamonas trahens ATCC 50062 hypothetical protein (512 aa)
      MHTHDPPTAKASALGLAWRRLQLGLTAAGCGFGSYEEVAAALKENADVWDEYVSLVGVDLPASKCLTAALEIMHAALATEPLAALVAEACGRDGSEPFAGWEPVIRATGSETKWTLSMPLPEALTDALHNSVWVPKRRKRKDRFVPTKSLAVAINVEHVTKLAARYTGPRPAEGDVDVELLFLLLFQYNALEGGGFQAALPDPVFDLLAAAPFHADTEAFASPLNVTLPRYHSALPAIDAAFGSSGSFFDAMPDAGVVEANPPFTEPFITRMLAHMHKCLAAASGPLTFVVIVPAWRQSPAWLALTTSPHASRTAVLPAAEHAYCEGKQHLRKTRFRLASNDTSIVFLQNELAAASLTITDAHVEAIAAAFAAPLETSRTAHATGADSRAFRANRAQERKDALSSHVRSAVATSNWASLSKSMKGKGKGKGKGKGKDKDKGKSKDKGKSKDKGKSKDKGKSKDKGKSKDKSKSKSKSKSKSMDNDKDRKRRRHGDDVDHASSRQSKRSKRS
      >gi|Sarc1000007502|ref|SARC_07328T0 | SARC_07328 | Sphaeroforma arctica JP610 hypothetical protein (346 aa)
      MPTATQNILSVQVLQRHNYVIYLPSEPNKFVPTECHILQNDELLHRNGQYTVANFTATSLENWHDRLGHRCDPRNTLTTSNTTPTTPAKPSHDDWKLNPTLVKTHVTDVFGEIGIDLFAQKHNAQVPVYCSLEPDAPLHDAFKQSWQQPNTHLYGNILSVSKHGEMHFSSLRRKMVLSLRGFRFNLTRATTTKRNFSYTHMWNSSKPPPTRSYRMALRIPDRLSGDESTLPPVGTVFHVDLHTGLPTSPTGYTCNIKYVDANSGQPFVYPLRHNDDASATLDVFYTDIGKDYLANLRELKCDRGGKFVSKEFNSVNTLQHVKVTCIPTNTPQLHGLVERTNEQLV
      >gi|Uram1000007539|ref|jgi|Umbra1|252871|fgenesh1_kg.52_#_84_#_combest_scaffold_52_106915
      MQNSIISQYSSIAPHAGASISFNAPKRKWSSVLSSSDSPTPFHDNFLSQIEQHQPRYDASESITPVYKNPSALIVPLRFPTLDKGSARSLSRSSSSSQVPTRDPSPSTHSQETNTSGDRSSAISFSLIVGVLASEMMSVKEAILKSISSKLLEPCGQKWPVDEKFSSFVYQQLQTHLKKETPQGQNSAQLVRSDDPDVRNAVEFYCLLVAFMTVYVSFLEVSTSLMASKESLGFLALPFETIMNSFDPFDNIFHDASEDMYFWYYVEQRYFLGSVTSSICKELQVLNFTVEKPHQAEAILSSFYTNHLLRFAAQRHQKDHGQFYTPTSVVDFMWTTCMKDDADWVSNVLHSYCPSVLDPCMGTGSFLSSYIERIVQCLQERSTSWDNSDALKTMINSMCSNIWGIEIDHFVVQLGKLNVMLHIFPLLCRWMSITGQPLDFRLPRLNLFCNDILTLSLPPATNHFNNWEFEQLKKLRDPEVLKFEYMVTNPPYMIRKTGFISVPDTSLYDMSLIGGRGTQAYMYFMWICLQRCHPLRGRLCFITPSQWILLEFAKNLRSWLWKNYILDVIYQFEPYKVWPKIQTDSLIFRLRPRSAAHLEEACTLFLRHEDRTLTLDSVLSDYQSFIFKDPTTHTRISYRLTPATIDALENVKDCSFSSLTPASPVSEQMKQLTQDFIKICGGKNDSSGIQSPLLWNRGPNTNPVYALLVRTEWALKTFGTDVVEKWMRPAIYWNGKRENSTTGKSESKEILFWKDRDRLRVSHKENSPAEAYVPFRFEDLANEDKHYSMILVDSANASILERDGTDQPLYQYLKDAREKLQPKQIDKEIAWCPFRQCGIESPIKIVHPINFGYFSRSQPRQRFFLDTRRQCVTNQCMYFTIQPTTAIQDPLFYLGLLNSSTVQFFITIHCCYDQQGRTRFFAKNMANIPYPPSPSFELVAIMVKLVSRISSVRSMVYMVARERRMRILVEKLRQGRWDLSCTACDKSNHTSSEHTSPVGQPVEEDIRICPNKNICSCLRVASLLQYGVDQLSYLLYGVPVDTQLAVESELSIGTFTDFETVFPRQDAIIPAWYDRIIQYSDLILSSVDL
      >gi|Bcir1000007546|ref|jgi|Bacci1|282602|fgenesh1_kg.13_#_14_#_Locus8775v1rpkm8.87
      MIFNKDYKSQVPGLILIEDFITEQEEACLVSYTEQGTWSGLGIGPNPELKRRTQQYGHLFSYRYRKVMEEYGPLPDFATLLVDRIMENQLMPNTPNHLLINEYNAGQGIMPHTDAPALFGPSILSLSLLSDCIMQFTCQDQAVDIVLPRRSIVVLTGDARYKYKHCISKDLIETTPSGMTIHRDRRISFTFREIVSWEDTKKCNP
      >gi|Uram1000007702|ref|jgi|Umbra1|253628|fgenesh1_kg.54_#_164_#_combest_scaffold_54_109393
      MQILPGLTVVEDFVSPEEETHLVQCCDERLWSGLGISPNPELKRRTQQYGHLFSYQYRKVLQELGPLPDFVTPVLDRIADQNLSPPPNHLLVNEYEPGQGIMPHTDAPSIFGPAILSLSLLSACVMKFTNAETGRTVDVLLPRRSMLVMTEEARYNYKHSISKDLIETLSDGTTIERSRRVSFTFRQIISFTGECSK
      >gi|Mcir1000007727|ref|jgi|Mucci2|85076|Mucci1.fgeneshMC_pg.8_#_475
      MRTIDLSDKIPGLVLIEEAVTESEEARLIESVNKETWSGLGIGPNPELKRRTQQYGHLFSYRYRKVLEEYGPLPDFTDFLVNRIMEYKWMPRKPNHLLANEYNPGQGIMPHVDAPALFGPAILSLSLLSECIMKFTCDEQSIDIVLPRRSLVILTGDARYKFKHGISKDLIETTDSGIVIERDKRISFTFREIIAWEVAENAPCCHGNKQ
      >gi|Fcyl1000027737|ref|jgi|Fracy1|245603|fgenesh2_pg.15_#_440 ; gi|Fcyl1000118033|ref|jgi|Fracy1|192505|e_gw1.15.550.1 ; gi|Fcyl1000089800|ref|jgi|Fracy1|164179|gw1.15.550.1 ; gi|Fcyl1000088409|ref|jgi|Fracy1|162731|gw1.15.516.1 ; gi|Fcyl1000117801|ref|jgi|Fracy1|192273|e_gw1.15.516.1
      MTRSSMEPIAELSVADSRFLGFCFNFSTIKEVEQCQRTLKEQYPTAAHIPIVYKFSSNNNNKEGWDEDQEPSDSVGPGIMKEIIKKKQQQQQQKKSDDDDGSSGDDENKNKNNLVVAVVRFWGDTLLGVTCGRLPQCYQSIARLVLHRYCYSTSTNSNNNKIMQPFELEILNNIENSIYGLGAGDCELIVNIVPDDNDDDLLLVDKVKMELNFEGFMGAAGEVLPRLQNLQADLTQNLIPIYRYPGNYSGDSWKTFEWSPTSLIIKEAVEDNLLLPQTMNHCVTNYYRDGTDFIGHHSDKDLDLNRDGAIVSVSLGDERIFELKRRKDPKDITRIVLPPRSMLVLGPITNKEFSHSILQNVDSNKTRISLTMREVKTFKDLNTNRLFGQGVRNKSLQQLRKRHLIENCALFSGFCTLSALMVSKIKINNAINNTNTCLLMTGIFATGTLSVRLLTNTWYRQQEEREARDFFSKSSMSGTKY
      >gi|Vcar1000007748|ref|jgi|Volca1|100029|fgenesh4_pg.C_scaffold_102000028
      MPPPVPHISTRPGSELFKDGIDVIVSNSGSITNYQETIAAAVRAAVADLPNSTPWPTLPQPHPGPSQAYAPVPSFAHPRSPYSTPNLLHIARDFPTFDPKEPRADIVGWDILMRHNLDMAGVPVDSPEAAKIALAVLRGTTGDSLRRLNSDPATRFHSYHAVLQALTPLAPVIQHKLDAQLAMFELTQGSGPNSLQRYIAEYKRLMADLPYRHEKDHVLFFARGLRDDLREEIFSRIRHLGSYVRLQDLIDLALTISTGRDAAHRIRSDVIPPRTPPTRIAMVSTVAPIHSAAAPLVPANVALGPPPAEQVSHSVPVRFAVQRSTEPDTPPPPFNVSRLNQIATSTTGAQRVDSDWMLARTIFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLGPSRRRYLESKSTNPRTSAVIVLPDRPTAPWTPLIRHMTVVRRFPAGARIVCHRDPSDAS
      >gi|Mver1000007812|ref|MVEG_07803T0 | MVEG_07803 | Mortierella verticillata NRRL 6337 hypothetical protein (408 aa)
      MLQRLGFLVNNTKSVLQPTQILKHLGFYINTCKMILTLPKDKVRELKREATKLWTSTMTTIRRLASFIGKAQAAMLAVLPARLQTRHLVACKDCALASGLNWSDSIQLSKEALKDISWWRNHLQSWTGQSFLPQIPEMDLFTDASDWGWGIVLPYKVISRAWPPQESQHSINWCKLRTVLHAVHLPEVQGKVIQIHSDSTTTLAYITKFGGTRSTLLMDLACQIWTHCLQTGTRVMTSFIKSEDNPADRLSRALWLQTEWTLDPTLFQQIDKMWGPHTVDLFASRSNTQLPRFISWKPDPQALAMNALTVPWTQENSFACPPWALISQCLMKIQREQLTLTLVTPYWPSAIWFPTLKNLALHPPLLIQSQDRLAHSPDGTLLDWTLAAWCISGSGSRRKVHQRLLSK
      >gi|Bcir1000007834|ref|jgi|Bacci1|265756|fgenesh1_pg.3_#_153
      MYEWKLPRQWMNKIQIHWHSLCIDALASRVGKRLLIFWSHRMDLGAAATDAFMQTWPKQELFFHSPWKLIPHGEISQFKIKNVVPSDNGLSIFIDQSKEGGIKFTHIGYQVEQLAEYCLVRTWKLFMYKTKYLQHGPDAFLFLYYIEHHNNKFRPAKAASWLKQILKDSWIYTTLFQAHSFNFPTQYEFFY
      >gi|Vcar1000007858|ref|jgi|Volca1|105574|estExt_fgenesh4_pg.C_300114
      MALVPTDANRDRSRSRSAELPSNVHESPFTTLTPIPPMVNPVPTVPIVDSVPTTSVTEPYVPSLQETIAAAVRAAVADLPNSTPWPTLPQPHPGPSQAYAPVPSFAHPRSPYSTPNLLHIARDFPTFDPKEPRADIVGWDILMRHNLDMAGVPVDSPEAAKIALAVLRGTTGRRSSRDTPSGSYVRLQDLIDLALTISTGRDAAHRIRSDVIPPRTPPTRVAMVSTVAPTHSATAPLVPTNVALGPPPAEQVSHSVPIASKPARRVGCMRARVAAASTTGAQRVDSDWMLARTVFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDALKQHNRLHQPPLPPVDENTPTSRSTTLDMVKQWATDVGGGDCNSHDDSKGHLKAQPFWT
      >gi|Vcar1000007918|ref|jgi|Volca1|58624|e_gw1.11.237.1
      GDNSLCTRFASPSNSFLTSDVSGEFVWANPPYTKIELWHKHYLRCKRMRPETTSAVFVVPKWSQIERRMRAAGHQLLKTYAVGTKLFLEKADDGSRQEMPGIPWPIQLWYDPPAKEVQLNSIGSSQSMIFGAFLCQTPCSALIDSGADCKGTADGFISSEAARKMGLAIRPCSTSSVRVANGSSELIEGLIHAKLRIASFHDTVKLLVLKQGNAGVELILGAD
      >gi|Smar1000007917|ref|SMAR006464-PA pep:novel scaffold:Smar1:JH431701:250462:251748:1 gene:SMAR006464 transcript:SMAR006464-RA
      MVTLPENRILNIKTSISKILKLDKVTVRDLASVIGKLNATTLAISAAPLHYRGIQLLKAKFLRKGHYESTCSLTSEVKNELIWWMRNLSLCKGRSLTPKPLKLVIASDASLEFGWGAVSNGVSIGFKWEKFMLSKHINILELIAAFWGLKSFALNLQDATVKLKIDNTTAVAAINRLGSPRSPEATAVAQDIWSWAFKRNLTLVAEHIPGSKNCLADAASRGAVKDSGDWKLDSIVFSCVLETWGPFSVDLFTNCRNYQFKLFFSWLPDPLATGFNALNQEWCEGLPYAFPPFALISLCLQRLRKSPSLQELVLITPVWPAQPWFPLLWQLSTQFPLLFPLFDELILDAQNAPHPLIINGALQLAAWRLSSPSSKTQEFQSKLPNWWRRLGSNHQGADMICIGKIGAFGQENNFWTLFNHLQLPSQNS
      >gi|Smar1000007923|ref|SMAR006470-PA pep:novel scaffold:Smar1:JH431701:276342:276938:1 gene:SMAR006470 transcript:SMAR006470-RA
      MTSREVDWFATRLNYKLPKFCAWGPDPMAWKVDAFAQNWSNIYGYAFPPFSLIPRIIQKMNRDQADLLIVVPLWPA
      >gi|Sarc1000007962|ref|SARC_07773T0 | SARC_07773 | Sphaeroforma arctica JP610 hypothetical protein (360 aa)
      MPPKKSTLEHFFKAPPSKRTRNAASPSSPSTTTKSSTDSLVPSSHPTYQFAIPQLPNSITESLEHAIPSLDNGKRVDDQPHLDLVHFHRYLPRGIEASIFDFLRENLFFYRVQYAKKFGNKEMQINTPRYTTVFGVDESSFFNAQPGSDSGPNVCVECMGTGACNHGVDTRTTQVIFDSKSKCPVPKDRYSCRPRPIPDGLMFLKKMTEASTAQTYNFCLVNYYADGNDSISYHSDDERFLGANPAIASFSLGTNRDFLMKHKPDKKKPAKTENVLKDKPLKLTLDSGDMLLMRGAKKAKWLHSIPKRKGGESGNGRINITFRKAMVPGGTENYYKYNVGEGGTFKWSRKDKNMIPWTS
      >gi|Hrob1000017985|ref|jgi|Helro1|86209
      LPLEVYYVPDYVSEICEGELLSQIYNVSKTRWTQLSHRRLQNWGGTPHIKGMVEEKLPNWLAEQCSRLASLGLYGGKTPNHVLINEYLPGQGIMPHTDGPLYYPTVCILSLASNLLIDFYRPHNHHHHLQQESISKRSKMADRRVGSLYLKRRSLLIFRSEAYTNYLHGIRDTTNDHIDDKVLNLGPDFYGKDLKRAEARISMTIRVVPKTLNASKLFGKIR
      >gi|Vcar1000008009|ref|jgi|Volca1|45734|gw1.11.234.1
      RSVFLRLQKASGRVFTFDATCNGGSDALCPKFACSSSPIISHDVSGQHVWCQPPPKCVNDWLDHYSACKQRSPESTSAIFVVPKCTQFEQTFQKRGWTLLKEFLSDAHIFSVPKSGGGRTRLRSNTLVTQAWLDPCQQ
      >gi|Bcir1000008046|ref|jgi|Bacci1|215245|estExt_Genewise1.C_420006
      MRQASLTNILHRSKQVFSKLNHKRHSQQPLDSTQLYKLGNRLLFGRHNETQNQPLAISYFLKAAQLGNARAQGVLGFCYEFGLGVETDFVKSEAYYLKAAKLDDGLSMARLAFLRKYGRPNVKIDRAEAEEWTEKVRHRPNAIQWIVEAASLTGDPAAQYVLGVCYHDGISVAKDEQAAFRWYKASADQGNARGQGILGYCYGEGFGVAKDEVEAMKWYRLAALQGETVAIYNVGYCYEDGIGVDKNVDEAVKWYKLSAEQGNAFAQNSLGYCYEDGIGVQQNFKEAAKWYKLSAEQGYPWAECNLGYCYQNGIGTAKDDSSGAYWYRKAALQGHARAQHNLGFCYQNGIGIEKNEKEAIKWYRRSAERGNIFAYHSLGYCYQNGIGVKVNERESVFWYYLSAEENHAPAQLSLGYCYRNGIGVPKNEGEAVKWFQRSAEQGNALAQNSLGFCYEEGLGITKDMTMAVHWYTKSAKQNNPWAQCNLGFCYANGIGLEQNNVKAVYWYRQAAAQNHARALDKLGTHLLNGVGVERDLKTAFELFQKAAQADHVAAQYHFANCFEKGLGCEVDLTQATHWFERAALAGCRNSHERLRRLIVRECLLSPANSPLLSNDDDFGYGGLTIGYSAPAA
      >gi|Fcyl1000088053|ref|jgi|Fracy1|162353|gw1.27.225.1 ; gi|Fcyl1000086727|ref|jgi|Fracy1|160956|gw1.27.207.1
      FEESLFCLLCRYDMIQGAGLQAGVPGSIMDILLKQFNCKKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFEKEEGICYQANPPFCEGLILQLNNKITDILLSSQQHQHQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAAKRLW
      >gi|Smar1000008099|ref|SMAR006255-PA pep:novel scaffold:Smar1:JH431682:46249:48654:-1 gene:SMAR006255 transcript:SMAR006255-RA
      MAVCFNLSKSGSPDLSRVIHFNLTDHDAQKYEFEVYPLNYEMKHADIEEIGLLFIVNPFTAAGQRYWIQRCYLDYPNSPNVTNLTGKKHIWSKNDVHTWKYLRWTTLGYHHNWDTKVYSLDHHTPFPADLASLNAFFAKILQFPPYFAEAAIVNYYHLDSTLAGHVDRSEFDLEAPLFSISFGSPAIFLLGGQTKSEEPSAMMLNSGDVVVMSGQSRLSYHAVPKILSSEATPWLELENHEQERPEWIHVKKVICQSRINLNVRQAQTETISKSLLTT
      >gi|Chet1000008148|ref|jgi|CocheC5_1|32311|estExt_fgenesh1_pg.C_180212
      MVFLDLVLIRSSFHSVRQHVLPRNAFVASLPSWNCQRMQLNAFFIQKEHWRYQSMRKADLENDPDIFDLSKRNEFSQDWKDIWRPAGVIPATQIEAACMAYAGGKPLSAPVQDAQIFEHRDFPGLQVISRLLPPETQVLFTSCLMHRDLADPGHKINLQADFDIPYPPKPTSEPSRFDSNFFLRDRAAETDCLIPKSPDKQKPLNNEQFLYSKLRWLTLGEQYDWPTRSYAKHATPFPDDLSRLVTGLFPHIRPESGVVLMYSAKDFMPVHRDVSEQCQRALASFSVGCDGIFIMAKGEDDGQGENAPRSVAIRVHSGDVVHLTGDARWAWHAMARCIPSTCPPHLASWPVGTPGATPAEEKAYKKWKGYMSTKRINVSCRQVWD
      >gi|Bcir1000008318|ref|jgi|Bacci1|335164|estExt_fgenesh1_pg.C_1200041
      MSLSHVVELNESSMPSSPIPDSSTLQGQPKRQRVAISEEQVKQIVDILGSMLIKIKNEIMDIKEKYAPTAAVNFLLRRMESSDAILTNDVYCLMTGFMTIYLVFIELLFKDLSNNSHNTKINPLEDFMQLFSENDTLFGSDNPFLWYYLYARTNTSILEDLDKQLTIHFTSLNHIDSIFSRFYTVYFLGEESNTQKKHQKDHGQYYTPHSVIRFMWDRCLLPSSSSLLRNGNIPRVFDPCLGIGSFLCEFLSRLVNACRFNLDIWNDPRRLYSILTTEIPESVYGIEIDPFAFQLCKINMLIHLFPFYQRLLELNVQLHQTSRIIQRIRLFCNDTLKLDKDSNAFTNNIDSFESDCLSQLRDSSKLKFHYIVTNPPYMIRKTGFITQPDPQLYNEHVLGGRGTQAYLYFMWICLQRCDDSLGQVCLITPSQWTVLEFAEHLRNWIWSNCRLLEMYEFEPYKVWPKVQTDSLIFRVCKRSCVLPHIDKTIYLRHTSPRRIPLKELLACYGNFNINQNNPDIMFKCTSTNDHLSTSLSTILPTTSVLDRLTALTEHLPRICDGEGRKNNNNSNTSPLVWNRGPNTNPVYSLVVRTEWALPTFGLKACARWLRPCFYWNGKSSLSESGGGKEGQFWRERDTLRLEKKEFSAAEAYWPFCNLTKSKVSYYSMIMVNKEDAETLQREYDQWGEQSDSASLYHYLQEARIALQPNKKDPLASCQYNKSGAEHAIKLVHPINCGYFTRSQPRQRFFLDKSQMVVTNQCIYFTIKPESKIQDADFFCGLLNCSLFQFFIKSTCYYDQQGRMRFFGRLMANIPYMPPDDSTITCCVSRFSQGITTCRTWLYAIIRPTSNTKGLMERIRNNEYKLSVAELDLIKNMDHIIEPDSSLPAPLDEGHFPWVHDFVLSKRVSMDKVFVILLKTTCLFQFAIDQLVYCLYKIPIDLQLGIEDELNLKNNRMKELPPILENDANAWSESVIDFAKKLTLISNEHLIL
      >gi|Fcyl1000018322|ref|jgi|Fracy1|236188|fgenesh2_pg.3_#_923 ; gi|Fcyl1000107847|ref|jgi|Fracy1|182319|e_gw1.3.1101.1 ; gi|Fcyl1000089018|ref|jgi|Fracy1|163370|gw1.3.1101.1
      MVTPNNRRRKITGKKQQKRLTSYFQDPSTTNSSSLMSSSSTTSFRKRQRASSSSTSSSSSNAAKFRAQNGGSSFGVCPICQSTIAWHILESHASECHGRSKETYENDDNNNSKKVIARPGQSTMIATATTTNNYNTASSRCYNNDVIIGSLKGESKSASTSASTIHQLSPSSSSSSSATQQQQQQQQQLPRPHTFEPIPGLFVYEDFITEEEESMILYGIDAVDTLPWKLSKFNGKHIGKRWGVHCNLRDRRVDAPENPLPDIIQQIVLPKLKRLLFQKGKTKNTTIPNEANSIDYRRKQGHWLQDHVDDRKLSKEVIFNLSLIGDCYMTFKNIAKHRNIAVPKQRVLLKRRCLQIITGKSRYDFTHGITNSDLLSDRRISITMRESPLTKSKPKPKPITITKSATTTITTQERIPE
      >gi|Bden1000008328|ref|BDEG_08304 | BDET_08329 | Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (371 aa)
      MSNRDKKRQRKQQQQQQPANSQQWSNQTPFRILERTFKRRQTLLKDLQPLLLDLDLESPSTSFKTVELSPPLILSEPQLDVPGTTRVIELTDIPGLFILKRAIPPSLQRQLVQECLEKHCKVPNLSNLDAHYLIPNIGIWKVYQQSKRLNHAELIHPRSIENTETMESTTADGAMKIDPPTDAMTAPKSMSVDAVIHRLRWVTLGHQYDWTRKQYHFDRLHAFPDTIATITHQILAFTQELTGYSSSQWRPEAGVINWYHPGDTLMGHQDRSEVDMTAPLVSLSVGLSCVFLISPCESKDITPTAIRLDSGDVLIMSKSARRVFHGVPLVIPDTCPDYLMHGTDSEWNAFAEWMDHSRLNINVRQVFPTI
      >gi|Ccor1000008322|ref|jgi|Conco1|123389|CE35304_1103
      MKCTKCRMMGATVGCFNAKCPRIYHLTCCDKNPKLFLQGYIFYCPKHEAIENKKQTYEEYYHCDHCKNSLPRANTLGPFPDYSPDEWFTCQACVEENFFSGFDLCTECFTHKFKSIKHNHKANRFIRTTKEKLEVLLDVLANNKLTLRNDKLKGRKSLKNDKLNSIENIKDIAKPEDSAPKKRKIIKKIVQYQPNIHCSYCWSTSSTIWRRGYMGVLLCSKCFMNTSTDKNLASIATQSTSEVNEDDQDSEEIIIDVVNDLPSPPAQQDKFGYHGNYEDYIHQPYHTRNLPQLNCLKYDPSQIAESSVMANKAIHLETYGPTIYQAFSLDYKSTYYDIPGSAPRWASHSGSDYHGTWLPQTVRRAITRYTKEGDMVLSNFLGRGTDAIECFLLKRKCIGIDINPVAVSLSQKNISFALPPSLLANSEFKYHRPTIIQGDARNLFNILTNESISHVLSHPPYKDCVEYSNNIDGDLSKFSTNMEFCKEMQNVVNETWRVLKMGGRCTLGIGDNRDQCFYQPVSFDLLLLYMETGFQVEEIIVKRQRQCRAFGLGTFLCVKYDFLMFTHEFIITLRKVPITSRGSMSRNMKKSKFFQQ
      >gi|Hrob1000018333|ref|jgi|Helro1|102536
      MRGLMVAKLRQNYQDLCSHREGIDAPMESFNRWLLERATVDKGIDPLLPSQCSVPLSSSMFREVINDVPIRLSRPRRCEDARRMLSKYAEAAKSLVMKRGCNVENRKTVAWHADDTFSFMQKRGNATYDDYMDRLAHLKRECQPHVIEAAKSSVEGICLKVYSLSCVYVKKIYDKHLSVLSAAGVDVKTLPGPTSSSSLLLPHSLTHHHHRYPCSPIALSTPLSPNVTVEHHDSGEVVTLRLKLAAAGVANNDVMKINSMHFRKLEQLYMLNCRDDPKMEHFLHRTWCLLKRYNTFFGTKENEGFGLQGALPVSVFQCLNRSFGVTFECFASPLNCYFKQFCSAFPDTDGYFGSRGSILDFYPISGSFEANPPFNEELMEAMVDHFESLLSETPLPLSFIIFLPDWKDPPTEALIKLESSRYKRQQMTIPAMEHEYRHGFQHICQRKDLNVRSLHGTLVIFLQNDAGANKWSVNNDNMRELLYAYQLNNNNNNNGPTN
      >gi|Sarc1000008354|ref|SARC_08162T0 | SARC_08162 | Sphaeroforma arctica JP610 hypothetical protein (611 aa)
      MAAIAVPMSDLTKKGAVFTWKTPQQAALQKLLTLIMKRIKIYEPLPHKQLVIMSDACNVGVSAAVFVQIPPPGTAEWDEEQKCIAACKQYREALGLIPLTLHPQTPRLVISDQERALYDTTDKPETIYNPIDDLRTLPGFQTATPRNAASDLDWAHSVDISHRNTNDQTVKLTREQSGTLVHSVEFQHKGQSGPLAQDADQQGKEQSGTLVRGGVPQEDEQSGSLVHMNQVQSGSFTHTLDMETVSAFTHTLDMETVSANSPRDLRITIGKYRFLYPVAFYSSKLNPAQRNYGATDREMEESSKNSRIIRALNTINEVVMDVRYVQGPLTVFGDYFSRSTNHTDKLLQQKQRGRVEFGKATGAAASDTQPACALSFAPQQRRVTWGHIKVNQEHNGREALIYSQSHFEFPESYSLNLTWVDLIDQLFGPFDVELFASHEDHVMDVYCTADGSGTWGTDAFGVDLQTPSAVWFPLLMRIAQGLPLVIPHRDDTFLHRGRTISGLPPWEYTLVVILQGGCPKPPLLPDTFWHKIGLQRAPAFLHSLGYPLKPCRTMGTGVPGIPAGAGVPVATKTWKWLSTKKAPTPLHYAGDDRTPQQWKFGPGKRSHLEH
      >gi|Caps1000008447|ref|jgi|Capca1|112863|e_gw1.16.157.1
      QVPAAIYYIPNFITEAEEESLIQYVNSAPIPKWTQLSNRRLQNWGGLPHPKGMVPEKIPEWLDSFGQRIGQLGVFDGQMPNHVLVNEYLPGQGIMPHTDGPLYFPTVSTITLGSHTLLDFYTPLNDRSSSFDDRHFASFLLERRSLVLVREEMYSRMLHGIKEVETDTLCEKVLNLDSSEHSLGDTLARNTRISLTIRVVPKVLKAKLFFGKKK
      >gi|Psoj1000018446|ref|125662
      MKEPIPEVLQPIIEKIARCGIMDGDEPDQITVNEYLPGQGIAFHLDTHSAFTTTIASLSICSEVVMDFRHPDGVRNEGVLLPARSLAVMSGASRYMWEHAIVPRTFDVIDGKQVNRQRRVSITFRKIRSGLCECPFPKYCDTPGRDGQEAAGDDEQGSVAEETTSLAPTALEQQYVHEFYETVAAHFSSTRHSPWPRVAQFVSSLPSGSMIADLGCGNGKYMKCVDAAQSFVVGGDRSSRLVKICRDRGLEAMVCDALAVPLRSNSCDAALSIAVLHHLSTLGHQLAAVKELLRVLRVGGRGIIYAWAHEQMKGSRRRFEEGRQDFMVPWNLDKRFAFSTEESSTETDAAETSQEDEQAEESPKDSSEEGDDANAGDRSSSKVQARVVMQRYCHMFKQGELESLVALAGNAEVEESYYDESNWAVILRR
      >gi|Ttra1000008560|ref|AMSG_10032T0 | AMSG_10032 | Thecamonas trahens ATCC 50062 FATSO protein (561 aa)
      MSNWAALQRELLGSGKSKDEAGKSKDKGKGKNKSKGKGKGKGKGKGKGKETLTGSSGQDGRGAESGKRARKRKRDSNGDGIGDARDKRCGSDGKSNSKKTKKQKRGGATSRSEATLHLHLAMALGLPYPIEALPSPRPLFCTPSSHPKLVEKLMATSYAGAVIEDGASSLAASDHAKFHAAMKVLKARGWFKFDFVQPGKIEELGPTYVRRVVVGEPGMTYKYLNARVFAVPWAGPALEAADAAVAGALGAIRELNEVMIAKGNAHLEAAGTPGSTLFNLTLINDMDVLTGEGEGKPPPFEGLGRAVVSWHQDTSLEDWSTIGVYQLTAENETAPWHVGLRVIYDEVTPKLALPLDHGAMYFLLGDFNHHHQHVVVCGSSHRYSSTHRVTTTEHNHFDRVFPRLTRAVRKAKKLTKSPAALRALLPRPGSTWDLLCESFLVLEFEWIRQYWWHGATHHKLQRGLAYWPERVAQLEKAWAQLLALFRDVATLTASRPQSKLAAAFASLQADVAREREAWRARFAAVDAKVTAGEWTGVDLPIDWRRCPVAHGASSLVGATD
      >gi|Sarc1000008574|ref|SARC_08375T0 | SARC_08375 | Sphaeroforma arctica JP610 hypothetical protein (91 aa)
      MLSRLTATVDPIMEQVGGQGLNPRRRGTWRYPVARDSKPRTNKYKVAGFVFNKAEREWGPHTHDAFALPGNQLLPKYFSPSTLGGAVAES
      >gi|Bnat1000018648|ref|jgi|Bigna1|43015|e_gw1.71.34.1
      MAASVGDSIAQYNLGTSYKKGSGVDKDFKQALEWYRKSADQGYSLASYAVALCYEKGEGVEKDWQQAVECYRKAATQGHSGAQYNLGVCYQNGRGIERDVKLAYWWYRKAADQGYVISQCNVALCYQRGVGVGIDLKQAFEWYLKAANQGHSGAQCTLGICYEKGRGVEKSWEQAVEWYRKSAKQGNRGAQNNIAFCYQKGLGVEQNWKQAVEWYRKAANQNDSGAQYYLAHSYQKGVGVEKDMRQAVEWYHKAADLGHSGAQYALGFRYKKGEGVEQDWKQAVEWYRKAALQGHSGAQCCVAACYKKGEGVQKDWQRAVSWYRKAAIKGHSGAQHELGLCYEKGRGVGKDWEQAVQWYRKAANQGHPGAQASLEKADGIALKEGCRTIRPNS
      >gi|Pbla1000008754|ref|jgi|Phybl1|68567|fgeneshPB_pg.23__178
      MSLTTLKKTFKYLTVAYNGNVDYINNLIIPMEPPAHLTSRRQRKIWEQQQKDNAAKRHKPHHDPFRTLERHFKAPHPSMEDVIDLHAPHDKLIAVPLAHPLTSDVFGPQITSQTAYIVKDIPGLIVIPNPFSPEAQRAMIAQCLTDFARPPNTSNHDAFFKIPETGLWPLYVAEQSGAKSIPTVPPKEKSQGAFLANSLLSPTDMLRKQRWVTLGYQYHWGTKEYDLERDIKVPSTVGEMAVDVVTAIQGIGGEGWTNSYQANDFAAEAGVINYYQIKDTLMAHVDKSELNMEAPLVSASFGLSCIYLLGGPTKETPPVALRLSSGDIIVMTGACRKAYHGVPRILEGTLPDYLGSDAFGDQLDGELLGKWMNSTRINLNIRKVFPSSTSLETP
      >gi|Spun1000008764|ref|SPPG_09218T0 | SPPG_09218 | Spizellomyces punctatus DAOM BR117 hypothetical protein (195 aa)
      MECSKVPGLFLIDDFISKEEEDQLLATLDGRAWGGKGQKPNEELRRRTQQYGYLFSFRTRQVEEHLGPLPAFVDGVVERMRAFGVFAKEPPEYLLVNEYERGQGIMPHVDASTFGSTVTSLSLLTPCVMTFSKRDSGESVDILLRPRSLLVLTGDSRYNFTHSISKNQVDHYCGEPIERGRRVSLTFRRRAESS
      >gi|Pbla1000008773|ref|jgi|Phybl1|68613|fgeneshPB_pg.23__224
      MEDDSLDWESLFGSDNDSVDWNSLFGSNDEKEDDDRYSTDIPGLELVREALDHSQQMKIIQAILDTNTFSDAGRVNQAMCFGKLPPHLDWLSRHIKNQLPHLLPRVLMQREPVFDQAILNLYRKGEGIVSHVDLARFEDGVVILSLMSSCVMTMRPVPKVSPADPSLEVDILLNPGDILALSGLARYEWEHGIKECEYDVVRGERIERGTRISVTLRKLGTTVETPTIETTATRTSI
      >gi|Caps1000018790|ref|jgi|Capca1|105332|e_gw1.669.6.1
      MSSDRRRRARVQGGWAAPVPAAAAAAAGKGKEKAKLASPSPPAWLGKNVDHGPAPQKFLFKEPQEVILKYKLYIKAGVYDISSEPSGVARVRLFPSFIEANQCEWMYEQLFSELPWRQRSDVKSGVSYLQPRLTAWFGDFPYSYSGVRHEGNKNWPPILAMLKEKLEENTGCKFNSVLANLYRNGHDHVPWHSDDESQLGNHPTIASLSFGDLRLFELRKKAPLELRANLPEDYQYTEYVRVPLDAGTLLIMEGACQEDWQHRIKREYHDRGPRINLTFRVIHPETE
      >gi|Fcyl1000078853|ref|jgi|Fracy1|152506|gw1.4.648.1
      MDEQYLIRDIEIPCVYHDSNFISKSDADDYYETLRTTIPWQKTAKINRWVRLYQAVDDVDESTIETTTSDGDEDGSDEGTSGYTYKDAPPVEDDKNKDNNVIGAGYPELIQSIRQQCQDWYAAANPHCKQDDNIPSFNICLLNFYEDGQQRIGWHSDREEIGRTTPIISVSLGASRQFLLRSQTDGRNDRCSLNLTSGSVTVMEPICQIKYLHSVPKESDVVNGRINLTFRCK
      >gi|Smar1000009144|ref|SMAR005333-PA pep:novel scaffold:Smar1:JH431599:48277:48814:-1 gene:SMAR005333 transcript:SMAR005333-RA
      MVFLLAERLGTLEVDLFASRLNFKFKKFCSWSPDPLAWKPHSENFIESSLGPSNGVISGASMDHTKLVRPPTRVPYSISLQLPSKRSFEARTSGGNKVSTRRKNDPPSMQNLRASLTDRGFSEDATALYTASWWDTTVASYTEGDAQWQEFCDKKQDS
      >gi|Bcir1000009260|ref|jgi|Bacci1|261852|estExt_Genewise1Plus.C_2670019
      METPPVFKSKRQEKIWLNQKRFNENKKKDQKTYVNQAPFRYCERNFKSKVPPPDFTNVIDFKKQTHNTIENQDRIVSVELKHDLADLSSLFGTSTRQAYVLKNIPGLIVIPNAFAPEQQRYLIKQCLLKYPQAPNTSNLHTHYEIPSQGIWPLFEQQRNGKLNEADPDFYVQKKKIDSQDASTYSDSEEEEEEEEEEVMACSSVTACSDDFSPIIDGPKIDPPPAPGVPLLSPSELIRKLRWVTLGYQYHWPTKTYHLDRRYPFPEDVSELTRAVVTAVEGIGYQDKWRNTYKGEDYKAEAGVVNYYQYRDALMGHVDRSELNMDAPLVSLSLGSSCIYVIGGETRDTEPAALYLRSGDMVVMTGPCRRVFHGVPLIIEDSLPDYLSTSNNNNNNDDDDYKLYSEFMKTARINLNIRQVNTINNTD
      >gi|Fcyl1000039366|ref|jgi|Fracy1|257232|fgenesh2_pg.88_#_37 ; gi|Fcyl1000079933|ref|jgi|Fracy1|153622|gw1.88.28.1
      MDEQYLIRDIEIPCVYHDSNFICKSDADDYYETLRTTIPWQKTAKINRWVRLYQAVDDVDESTIETIETIENGNQNENQPQQQKQTTASGYTYKDAPPVEDDKNKDVIGAGYPELIQSIRQQCQDWYAAANPHCKQDDNIPSLNICLLNFYEDGQQRIGWHSDREEIGRTTPIISVSLGASRQFLLRSQTDGRNDRCSIALTSGSVTVMEPICQIKYLHSVPKESDVVNGRINLTFRCKDFSSNNSKEDQTTEGEELHERRDTFIDRITNGIEPSTTPWTATTSKSSTINSTCRATADNTAFGSKPYLFGEEEEEESFDENNKDNILEPSNVQFLIKTNMGAERYCKAEIRERLFTLSCDNYMNIADHWKVITRPFNLDGYIAVVCNKSTTGDEDNDNDNDDNTGFDNDNQTTINDIRDTLLQLKTAHHVLKYHYHFHLKECKKFVHQYIEDGVDTNIDEMKVEQYPKETLYEHIKEELTNNIISFRPIQDLTNNPSKTSFRVTSDRVGGPHAWQTPEVEYEIGGAIAEAYEQYHWKPKMVDYDICIRADIIGPSCIIGTQLNIHDLSKGRHFTRFRNAVTIKTNLAYAMIRLANISEGSKIVDPFCGSGTLLLEALDIYNGRLKNCLGMDVSRRSAIGSRENADAEGYTEDIVRFVCSDARTLRRKVDGDNTVDAIVTNLPWGIMTGQNQSVSSLQTLYEVFLRNSWYVLKPGGRIVMLVLRGLQIMRIVRKLSGRFRLLHINVIRTTNNLPCIIVIEKLNDDIVRDSIKGQLAHLNQYVNVSPEIYKSIHCEEVDEDCEPTNNNYNNNKK
      >gi|Fcyl1000029493|ref|jgi|Fracy1|247359|fgenesh2_pg.20_#_150
      MISSFFLPRRSSSLSSSSSEVNNNNKGDDNNNNNNDDEDRQQQQQQKGQLIIPTLPLVSIEVITQPSSQQQQQQQHDDDNNNNLNENRYSQMYYHSLDNNNDKDDNDNDNDETVTIKSKKNDDGGGGGEKGRTAIIDDDNNSNSKSKSKSKSDPDLNNSTNDNNCYYKYSGTSRLCLPCNRSRIDENFTINLCSAADQIITQLQMAAIVALKEEEQQQQQQQQQQKNDNDNKKFLENIRLRIIHQKDNKNNDSDTIITNKKEDAENDTIKIRKSRRKRIYNSCLVNWYKPDHTIGLHSDDEPEMDTVTYPIVSLSLGGPRRFILKSKQKQQKQKSQSSQSKQHQQQQQQSQSNNRTIQKNHEFILKDGDLFIMGGNCQKEYKHEIPKPQQQQQ
      >gi|Fcyl1000099591|ref|jgi|Fracy1|174063|estExt_Genewise1.C_270102 ; gi|Fcyl1000099590|ref|jgi|Fracy1|174062|estExt_Genewise1.C_270101
      MIQGAGLQAGVPGSIMDILLKQFNCKKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFEKEEGICYQANPPFCEGLILQLNNKITDILLSSQQHQHQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAAKRLWNLQKTDQK
      >gi|Fcyl1000049607|ref|jgi|Fracy1|223570|fgenesh2_pm.1_#_442 ; gi|Fcyl1000103291|ref|jgi|Fracy1|177763|e_gw1.1.174.1 ; gi|Fcyl1000105119|ref|jgi|Fracy1|179591|e_gw1.1.2002.1 ; gi|Fcyl1000088119|ref|jgi|Fracy1|162422|gw1.1.2002.1 ; gi|Fcyl1000064982|ref|jgi|Fracy1|137554|gw1.1.174.1
      MPKKRRKTQDVSQLSNLSNLTGFGNAGFGASSSLDDCLAEMSNTVSATTTTTTTTTTTTISDCIKGQNEDKKVESSLHNKSRSKTTREKQQPSWIRQAKISTAEGIDNWNRNFQIWAKGGIYHPGLLPNIDQEIARNFKVQELSTLLLSNEFVKGKDIRMTTFERWLLDSKHEEEEEDEEDNAGGGGGIVQGDPVLPLRSSPDSKASQRLLSELITCTSTNLKNKNNNKTSKKIPRQDAEKIIAKLCRTTNLTCQELLCQEDRYRKQSPLNKGDRINVSTKTTESSNVIVVSILYSRKRWKKPFCFKLNQNHYKLLKDRFMEIHAPSSTTLTNAPLFGGRNNNIMSSSDNTMTVLVERSFHVLVLALLLRYSALSGGQLLEDLRGGGMQGAIHSSVFDVLQSHFSKIPHNDKKSSSSTKQFWLEGFASPFNATLPRFASAFPDLDWHFGSVGRFLDCSFDEEYCEANPPFTPGIMLAMADHTTNVLQRADNDNTRLTFVVVVPSADNKKNTNKDEAVVKHEAQKSFRSMVSSVYCTKHIQLKAREHGYVEGAQHLRPTQYKQSSYDTSIIILQSPKAKKYGLDKTNMKQLEKDIRIAFASRHENEIMERKKMAASAM
      >gi|Fcyl1000019629|ref|jgi|Fracy1|237495|fgenesh2_pg.4_#_1043
      MDEQYLIRDIEIPCVYHDSNFISKSDADDYYETLRTTIPWQKTAKINRWVRLYQAVDDVDESTIETTTSDGDEDGSDEGSASNQNQKQQQKQTTASGYTYKDAPPVEDDKNKDNNVIGAGYPELIQSIRQQCQDWYAAANPHCKQDDNIPSFNICLLNFYEDGQQRIGWHSDREEIGRTTPIISVSLGASRQFLLRSQTDGRNDRCSLNLTSGSVTVMEPICQIKYLHSVPKESDVVNGRINLTFRCKDFNNNNSSIEDQTTEGEELHERRDTFIDRITNGIEPSTTPWTAENAMSSTINSTCRAADNTAFGSKPYLFGEEEEESSFEDSDSNDILEPSNVQFLIKTNMGAERYCKAEIRERLFTLACDNYLNIADHWKVITRPFNLDGYIAVVCNKSTTGDNDNNDNNDDNTGFENDNQTTINDIRDTLLQLKTAHHVLKYHYHFHLKECKKFVPQYIEDGVDSNIDEMKVEQYPKETLYEHIKEQLTNNSISFRPSQDLTKNPSKTSFRVTSDRVGGPHAWQTPEVEYEIGGAIAEAYEHYNWKPKMVDYDICIRADIIGPSCIIGTQLNIHDLSKGRHFTRFRNAVTIKTNLAYAMIRLANITEGSKIVDPFCGSGTLLLEALDIYHGKLTNCLGMDVSRRSAIGSRENADAEGYTEDIVRFVCSDARTLRRKVDGDNTVDAIVTNLPWGIMTGQNQSVSSLQTLYEVFLRNSWYVLKPGGRIVMLVLRGLQIMRIVRKLSGRFRLLHINVIRTTNNLPCIIVIEKLNDDIVRDSIKGQLAHLNQYVNVSPEIYKSIHCEEVDEDCEPTNNNYNNNKK
      >gi|Pram1000009643|ref|73389
      MDVEDFRRGPIPGVFYIPNWITQDEEDAVLERVYAVPDDNELWVRLKHRRLQMWGGEVKDPFEPKPLPQWLMQISQTLMDAGFFSEEKTPNHALINEYGAGDCILPHEDGPAYFPFVAIISTGAECRVTFELHRDLASTDNQGVSAATELVPHFDFQLERRSLLMFTGEAYRRYLHSIDNVEVGTRISLTVRHVDLR
      >gi|Caps1000009657|ref|jgi|Capca1|191192|fgenesh1_pg.C_scaffold_335000028
      MEKFKESFKLYKRKKPPPDYSNVIDFTKIDAEDRQVCSLILEKQSCDVLDGMIPFTSWKAYQHRKIPGFYFISNPFTPDGQRKWIKRCLNDFPLKPNITNLDVNYSDLIRERNVWSMHLDADSTDEEKQLLHKLRWSTLGYHHNWDTKKYTADRYTPFPDDLSCLSRCIAHGIGFPHFKAEAAIVNYYHLDSTLSGHTDHSEFDHISPLISISFGQTAVFLLGGLTKDIDPIALYLHSGDICIMSGECRLAYHAVPKILRTPTSELPYHEGDDDTVNGRKVEDTFEPFESYLQSSRINMNVRQVLCDGQEFPKDES
      >gi|Sarc1000009775|ref|SARC_09551T0 | SARC_09551 | Sphaeroforma arctica JP610 hypothetical protein (395 aa)
      MEGATVNYLHTSAQHFLTDIEYGKFPLRLANNKTTIITTRGTLPNLGLAYLDHTLTQGIIAQSHLQDSGCSLSFPGDIMSCHLTYTGTKLELRRRNGGYYIYYTALQNLFHSPTIRNIPQESLDIHTAPNTTSNTIMIDPPAPPTHTTMTYDEFHRSEGHRIRRTTLRMAKQKHITLTHTPSKYVQCEAWMLDRSIFASSMIFLGYTPKMDLFAAKHNVQVPHFVSPEGGGIATDWRQIDFTNEPGLYGNPIWPDIKELIDKNTKAGTTLAIVVPLRPTATWWDSFLAHLQSQPYIIQNTTSIFRRHSATIVGKPSWLFTLVALLGPISSHVTPGDDLKSAIDYVRFNPEFPKNCRVCVEGKMKAKHVSHCDKYASITISQRTPLIRGETLAIY
      >gi|Ttra1000009841|ref|AMSG_11724T0 | AMSG_11724 | Thecamonas trahens ATCC 50062 hypothetical protein (232 aa)
      MSAEASTSTSRKRVRDEDGDGFTRDDGGRTGKAARGLDGEAVPGVPGLTVVTDAVSPEDELALLAQVDNGEWMTSLKRRVQHYGYRYDYRSRKVAPESYLGELPEWSKQVMARLGAAGDGEFDQLIVNEYEPGQGISAHIDCKPCFAGVIVSVSLGSGAVMRFAKDDVVCDVWLPARSAVVLTGPARWEWSHSIAGRKSDKVGGVRIKRTRRVSLTFRTVRLAGSEPATGE
      >gi|Fcyl1000129851|ref|jgi|Fracy1|204323|e_gw1.113.10.1 ; gi|Fcyl1000129857|ref|jgi|Fracy1|204329|e_gw1.113.39.1 ; gi|Fcyl1000088406|ref|jgi|Fracy1|162728|gw1.113.39.1 ; gi|Fcyl1000066919|ref|jgi|Fracy1|139674|gw1.113.10.1
      MKAKRYGSDTKLSNVISFLRRFGWLHVTVFAVVTVSAFSGKTGTVGHRRNKKSPSVQVQQQQQQQPSLHTKLQNCFTPTSILENIAVLVTPKVDPSASLSSLALIRLSKQIIALDNENNEYLINDNNKQLWKEGLRNLVSCLASSNWKASPKALETAVEGVKAASVISRLVSSDYLISSNNDNNNGKVWWEPLVEKLHEEADDQLVRMIQPHQLSGIKFSIDCIQLSSSTKQDASDLLSSHRQQYLLPQSLQIAYDNLNLPFSVRPGFLNGNDDDDDDDVDNKHNNNLFTVASFVKQVKFQIETIQTATNRTVAERRQTAWEGDEHVENFEYSEKSMRRLPWSDVVANVRDRLYNETSHYYDGCLLNFYPDGDSAMRYHIDPDQGVLWDYETAVVSIGATRRFSFRESSSGNGSNKPHVFVLMNGDVTEMFNDCQERFQHTVQKSSVKGESASRVSLVFKKTLGYSNYKRENKISN
      >gi|Fcyl1000029925|ref|jgi|Fracy1|247791|fgenesh2_pg.21_#_237
      MGKSRSKRKSKDKLSEENISQPQQLPSTTMTASLSLTNGDNNNSVHLPQAMPQHLPRPFGDNFLKDESPYRKSFREALTTSYEGFVFDDAATLMSQPTAINNVKEDVVQNSLESMSRGGIFRTDVTQPFGLGTKCAKTYVTRCLVGSPGTTYKYLGLRMFAHPWTTSTTTGEDNNNNSNMKRNDNERRVCHTVVTNDAQTIQELALALTNRTKKHLRDLDESRRQRQPMFGTRGRPGFDICLINRMESSSDLKPYNFSGDSSTSSSNGKSGNKNGVKTTVSWHADSSLEHFSSIAVYQTILGSQKDESNNNSNDKRKRQRTDCQKQKADEEEEGQWLVALRVAHHSEGPQASQQRRRGTNTETATVEETPPIAVNLPSGSCYYLLDDFNHHHQHTVLTTGNTSTDWIDRGKSAIRQFHKKGSRIWRSEQLLLTEIESEWIRQFYIQGMGHHQLLWESYWKDPIQELLSIWSRLEHRTEQTIELLRAAAEGKCGVGMNTEKAADKPTKAERKARDRRKKSLASIRELVSRINETPEEDGATAFTELYQPMAELLEERAEMRSKWEKREKDHVFHELPLDYRPMKVPFKFERTIDENNIGNEYVATSPLPNSPDKLKEIAAQLLQLGRAYRNGDAKQLPPPWKKEKPKQDALAEGDSTVDDHSKPLDWSGWNACAQLFGLELQHPWAAAIIDGEKVIETRSYSLPPSLIGGTKVMIIESSSGKAGVSSLCNHVDFSTSSRKKGTGNSKVIGWCTFISVKTYTTKQEFQAEENLHLVTPDSGYGWKDDGSTEKVYGWIVGERYRFDESSTTDEENFLYDSGVRRFRSLFQLHKKKSKDAYPNTSNKKNINKRNLERENKQNSNGKNKKRGRY
      >gi|Sarc1000009974|ref|SARC_09749T0 | SARC_09749 | Sphaeroforma arctica JP610 hypothetical protein (333 aa)
      MWHRDEFATDPRGRHQGRWQEVCRNIDVLLLHVRVVDNGPVDMLSRLTPKVDPIMGQVGGQGANPRRKVNWRYPVAREAQPRTNEYKAAAFAFNTAEKAWGAHTHDAFALPGNQVPKYFSQSLEGGASAESWVGRNMWVNPPWELIPRVLAKVVAEHLEITLVCPYMPKAKRWDPMTRMRASQSQYRCSMVFSCGMGMKLRVPTMGSDAALSDLVQQALPGVVGDCPTAPALVSRPERALLVPTPAESDQLAAAAEVEEREARQQRAARAKRRVAQKLAGKALLARQGQSPYALRGLTLSRPEVRNVVGAAHNVYMHTGAEKFLGKLEELYG
      >gi|Fcyl1000039992|ref|jgi|Fracy1|257858|fgenesh2_pg.102_#_45
      MGKSRSKRKSKDKLSEENISQPQQLPSTAMTASLSLTNGDNNNSVHLPQAMPQHLPRPCGDNFLKDESPYRKSFRKALTTSYEGFVFDDAATLMSQPTAINNVKEDVVQNSLESMSRGGIFRTDVTQPFGLGTKCAKTYVTRCLVGSPGTTYKYLGLRMFAHPWTTSTTTDEDNNNNIHIKRNDNERRVCHTVVTNDAQTIQELALALTNRTKKHLRDLDESRRQRQPMFGTRGRPGFDICLINRMESSADLKPYNFSGDSSTSSNGKSGNKNGVKTTVSWHADSSLEHFSSIAVYQTILGSHKDESNNNSNDERKRQRTDCQKQKADEEEEGQWLVALRVAHHSEGPQASQQRRRGTNTETATVEETPPIAVNLPSGSCYYLLDDFNHHHQHTVLTTGNTSTVRYSCTFRLLRDSHNIQDWIDRGKSAMRQFHKKGSRIWRSEQLLLTEIESEWIRQFYIQGTGHHQLLWESYWKDPIQELLSIWSRLEHRTEQTIELLRAAAEGKCGVGMNTEKAADKPTKAERKARDRRKKSLASIRELVSRINETPEEDGATAFTELYQPMAELLEERAEMRSKWEKREKDHVFHELPLDYRPMKVPFKFERTIDENNIGNEYVATSPLPNSPDKLKEIAAQLLQLGRAYRNGDAKQLPPPWKKEKPKQNALAESDSTVDDHSKPLNWSGWNACDQLFGLELQHPWAAAIIDGKKVIETRSYSLPPSLIGGTKIMIIESSSGKAGVSSLCNHVDFSTSSRKKGTGNSKVIGWCTFTSVKTYTTKQEFQAEENLHLVTPDSGYGWKDDGSTEKVYGWIVGERYRFDESSTTDEENIPYDSGVRRFRSLFQLHKKKSKDACPHTSNKKNINKRNLERKNKQNSNGKKKKRGRY
      >gi|Wseb1000004329|ref|jgi|Walse1|61194|estExt_fgenesh1_kg.C_210042
      MISSEGDALLRYHVMKRNNEEEYLAKLPEDKKQKRKVYGQRYSLTDEESIRNDYSMAYVNQLVRPQDQIINPHREGRFAEYPRQKQLLKLKDSLVEEYAYPPVYLPFDLVDAIPKSPTTPSSLFDAIDIGTKFDVIMIDPPLPNSRSSKLEGRQWTWDALATLPIRNLSADPSFVFVWVGSGGEDGLEKGRELLAKWGFRRAEDIIWVETTPEGHKEEDTSDVPDNLFKRTKQHCLMGIRGTVRRALDSHFVNCNIDVDTIIAESASRKPSELMALIETFCLGTRRLCLFGEPENARRGWLTVGMKGSNNETYSPPEGVDMFDSKQWSERFPNNKANLVPLTPEIDSLRPKSPNRSNNNSRNGTPKSDPRTPPFATRPVGSPYGTPPNMPYNPQFNAQMRYHMQQQMQQQMQQQVQLQIQMQLQMQAQQAHMQMFGGGNLYQGVQAVPQMYYPPVNTTGTIYPLSPQMPISLGFDGNTNNRRHAKTPSNSNQNNINRNRN
      >gi|Wseb1000002161|ref|jgi|Walse1|60123|estExt_fgenesh1_kg.C_70035
      MESAIMNILDNTSGYNINQLLRRLIFNYPDLGLTFNDLDKVSSKLIELGIDLPTTSYINNKDTLYDSYTRSVVPEDDYVKICSNTTRTECTEDCQKVHFQPIIRPYSDHALGHCSYLNTCYPFYNNAPPTLSNAFQPAKLNSPRLDRTCKYLHFQLESPSESAIEQADYQTKRRKKCRGDGLRQELDTILGSKRYPAQYINCDLRSFDYNTLGKFQIIVADPPWDIHMSLPYGTLTDDEMRKMPMSTLSEEGTLIFLWVTGRAMDLGRECLSIWGFKRVEEIAWVKINQLQRLIRTGRTGHWLNHTKEHCLVGMKVSDPDASDIQWPEWLNRGLDTDVIVSEVRETSRKPDELYGMIERCCPVGRKVELFGRRHNGRDGWLTLGNQIGEDEVYDPELSQRLNERYPEKGKLVVGR
      >gi|Uram1000008906|ref|jgi|Umbra1|259486|fgenesh1_kg.75_#_199_#_combest_scaffold_75_134233
      MSSREDTPISVDADDFALDNVTDSSLKALLQEETKLKAQLDALTTEIAKLESQINPAPKEEDDEEIDMEEFEAPQWCVPIKANVMNFDWDALAAETQFDVILADPPWQLATHAPTRGVAIAYQQLPDVCIEEIPIPKLSKNGFIFIWVINNKYAKAFELMEKWGYKYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGEDPPGCRHSVGSDVIFSERRGQSQKPEELYEMIEELVPNGRYLEIFGRKNNLRDYWVTVGNEL
      >gi|Uram1000004716|ref|jgi|Umbra1|238969|fgenesh1_kg.24_#_143_#_combest_scaffold_24_50604
      MADRSRRKRKSRNTVASNIHYVGYVEDEESVEAIMKKFEELDRIQKEFSAISVNSTPNTSSENKENEVESNGTPAEMEEKKSEASSQGLTEAQLQEIFKRTSAFTIRSAMMDSNDMDELNDVEIWQLQYHDGLTDEIYEEDDYIHLDMEDDDFWDMEFGGDAKQKKRMRRAPAPVREKAPSGGRRGIDRESIIAKYKIMQVRVQDRNGNFFMVKKKVSAIDPSLPTYVKIPGEPIPRSWVHTILHQQVPTEDIASTIGQKYLKKNILDMDLKELGTNFQAVYMDPPLLLPGEEPCPGKITIEQLASLDIPSIVPKGFLFIWLEKEWLPDIVRICEKWQFKYVENFCWIKKNLNNQISKKPYTYFNKSKLSLLIFRREGDVELRHQRNPDCVFDFSKPRVPGELTELKPRFLYQIIETLLPSSTFHPENNPDGNRLLELWAKHGQKRLGWTTVVDES
      >gi|Uram1000004450|ref|jgi|Umbra1|237704|fgenesh1_kg.22_#_168_#_combest_scaffold_22_46365
      MGQLDDSMDTLSSHAEDQGGYDFIVIDPPWPNKSAHRSKNYDTLDIYALFDIPMAKLLSSDALVAVWVTNRPKYKQFLIDKLFPSWNLELVGEWYWMKMTTMGQPVMPLDSTHRKPYELLLVARSKAGSSIIHVPEKLVFASVASQHSRKPPLNDLFSLFDSNHPDRRCLELFARCLAYNTTSWGNECLLFQHDSYFTKTSSAPDTP
      >gi|Uram1000001485|ref|jgi|Umbra1|223403|fgenesh1_kg.5_#_767_#_combest_scaffold_5_103034
      MEFNLTRFEKSTTEEKPPKKDLLSLSRKRRIGSEEDNSTKRQALKPSNGAVQGSKCNVTDHIADKLQDSHISKQPNKPYCDKACAHGEGSSKHHQIHWGFMMKDKPKVKKEENKKVDSDEEFWKKAAEMAPEDDVQSEDEDMIETLPGTPSFDDMPSPVVNTLLEQLEAQPEEEDDDTLMIPDDDTDTEKEVANDRRSPEPSEQDIPANDEEETEKSSVVRTKVPGENTFSHDCVKTWSPARIHAWEIRRLNPEAFYYRFVDPAQGQSNGSFTKTDHDNFMKRMEEWKEKGYRIGASWGIFSMGIPHRAGYQCSAYYRKLIQSKKIKDDAYAIVDGKLKMVDKNRTNAGEAATSALSSAWDLDEVKEIEREVDQWLKEYHNRSGSIVAKKPAAPRPKAAPRVHKSTATKSGDIALLVKQNIGVGNFRLLSDEEAFMLANEPSAITLKRRNWDLEWKESLQEYRKFVEGFQDPDVRRKYHELKKQWRKGLITTELNMSLRRQTQNTPTTTVAPTTSAPVAKKPTQLSLANFFSGVKKVKPKTDTPDDLISRYIIPNDLFSGVKQMRGLYNSKRASATEPKTAYWEMKDMNDCVSVLDEVMSQTEAASSQSPIEGMLIDPPWEFYVADGRNDGRCTWSLTDMMSLMENVLQHMSAGLIFMWTHKSTQADVVRMMYSLGCTYVENLVWFKKTLSNVLQDRPSPYFSSSKEILLIFKRGEGFELRHQRTADVIIDFERPTEQWIHEEYTELKPPGVYDMIETLLPKAGFDEALGRGRFLELWAKRQAPRRDGWLAFHHIKTEVDTDAKQIHQEESMDTANNETEVASMADQN
      >gi|Uram1000000276|ref|jgi|Umbra1|217644|fgenesh1_kg.1_#_1214_#_combest_scaffold_1_3769
      MADPSYRWFDEVPPRKKFKQQHSGIPNHNYPHESRQRVCPVALDELLQTETTSEEIQRLTWKSSLDGFKSHCEHGTKQECRRRSIENRTCDRLHFKHILNSHTDSQLGDCSYLNTCHRMGSCKYMHYCLDATAEELKPILRPTLSNPLMLQRKKLLPPQWIKCDIRKLDFDVIGKFTVILADPPWDIHMSLPYGTMTDDEMKDLKISKLQDRGLLFLWVTSRAMELGRECLMNWGYELVQELVWVKTNQLQRIIRTGRTGHWLNHSKEHCLIGRKGNDPFFNVGLDCDVLVSEVRETSRKPDELYDIINRLSPGTRKLEMFGREHNLQPGWLTLGNQLKDSRIYEPQLVERYNAKYANHPIRLFSLPHEYR
      >gi|Ttra1000008696|ref|AMSG_10222T0 | AMSG_10222 | Thecamonas trahens ATCC 50062 hypothetical protein (556 aa)
      MSTPSVASLYVGYAEDFESIESIMRKFEALESVKAKAAAEAAAQAEAAEVSAAEATAIGEVVADGGDEDRGVDTTRASMPAQSGSQELTQAQLEEVFRMTSLQPVTLLQTDFVFESGDVAADKAELDRFLEQNEAFFENSESDDDDAAFVAVLEDDGWWDLEYAAKPKKRKRKRSQSSKMEAKMEARRAAAALKKARRLEAKARKMEEARLRREMRAQKIRERKAARAAAKKAKASVPKPRTSFMRCELLDEFGRKFVFKKREKDVDPRQPQYIRLPEYPQLRTWCHRIRPAGQPPSSLPPDLMEASHLVGLSEHGRSPGTVRVADILRTPWDVFGEGYEGILLSPTLRLDSEPAFPNSISISQLEELRLSDALFPGGLIFVWTDKLTLPHVVRIFEAWGLKYVENLVWIVLNRNHKFHESPSSVFKSSKLTLLIFRKCGRSLDLRHQRNTDVIFDFMRDPELRKKPTQVYETIRTLLPYSRLLNVWAPPTALDQPIPTWEAIVDVSVVPPTHPMDTLDVIDVPPASVPLVASVLPPSRDTPLPMAVAPVATGAL
      >gi|Ttra1000006356|ref|AMSG_07188T0 | AMSG_07188 | Thecamonas trahens ATCC 50062 hypothetical protein (1237 aa)
      MPTVAEMVEEERGRRRRRGKVVVYNEEALLKSQWRDSEKMKKAMKVEEIEAKKVRRAHARRRAAEKKKKAAERRRKAKAAAAARRKKKAAEAAAAAAAAAAAAAAMDPGDAATLQDSVGSTVGGQKVDAGSGAQASAAGMGAGSSGAASVSASASAASTPAATPSRDSRNARHPRHPQHGYVIYDDYGLPIVVDQLPKRVRKRAPAAVTSAQAAAQNRPMVARKLLDVPSPPTTPGRRAGLARPDRNSPRLSAKASGSAPPSASKGKGKSKGKSKGKGKASVALGANGSGGPPMKDVLRRFKRWTKTYLTNAVDVLPQTAMPSTVIVGENAVVRHGSSVFLDRITAYNEAELKDDPVALALDKALSCFSEDNPLTTLPGFSPRSLWYPSLSEVLDAAFGGSVEQMARSLSAVDASTRAHQAALRKGLIRPRTLTATVSAPRSSAPEQALAPMDVVTAPPQRSQPPPVSTRTRAVQTEITHVQSKTARRWDDVQRVSSELDAGIIPKNVYADQLNPKRTFEHFISRSPTKTATRPTAPLATIAATVATAAPAAGEPKDSSVNTLTSAMNPAPSDALRNPAALSTGKRGRDEADEATGTAAGHGPGPVPKKAKPMEGPQLKLSAPMAAPAPGPVAETPAPSMTVAEMVAAATAAATTAAAVLGTGGPAGRSTVSATPAGAAVPPRACIASTASVSSAPPLAAESRPPPPPPPPPPSLPLPPPPPSVPTGVEAAMANEDGGPKMMLKGEERWTSGTYVNCDVRYFNLKCLGKFDMVYIDPPWRIRGNQVMPEDGHIFSNSQRRRNYDTLSNTDIYDLEIGELVDTGLIFMWTVTSQLKVALQCLEHWGFDFIDKITWVRLTHRDNVAMGLGYYFLHSSEICLVGAKSRPGARLEIIPKISNDLLFARVGKPSEKPVEMYDIMERMMPGGRKIEIFARNNNLRRGWLSVGNLLGPNFEYVKDRVMCDECAAGIPIGTRRFKSRSVPNRDVCAECFAGTGEPAHNYFELEHAIAVMVFHNNYTCDMCGMNPICGIRFSCSRCEYDLCEGCFDFSVTTWAAAAGAASPSAAEPSIFHTPDKKSTRVRSATHDPAHAFLAYEDMDDSGGLPKHHARCTSCFDFPLSATGSSASTANGSRSAKSASSAMRLLDRPRESCICPRCVIDVHATERALVRELPSVRVADRALVLAQRAASAAAADEIIHDAVSDVAQGVAADAVADAVAETMIDDAAAAVAAESMAG
      >gi|Ttra1000005250|ref|AMSG_05730T0 | AMSG_05730 | Thecamonas trahens ATCC 50062 hypothetical protein (304 aa)
      MTSRGEGEDGAGPNLEAGTVETMTTGQTMTTDMVTAGPEASMLVADEELKSLDVDTLARLLAQEEKEVEAIEGDIKRLNAQKRIPLATGKNKVEFLKSSFDEIDWESFEAPEHCVPIRADVRLFDWAALGAQVQFDVIVMDPPWQLATSAPTRGVALGYSQLRDEDIMRIPVPKLQENGLLFIWVINARYSFAFQLFKEWGYEYVDDVVWVKRTVNRRMAKGHGYYLQHAKETCLVGRKGADPPTLQSGVVSDVIWSERRGQSQKPEEMYEMVEKLVPNGRYLEIFGRKNNLRDYWVTVGNEI
      >gi|Spun1000006218|ref|SPPG_06562T0 | SPPG_06562 | Spizellomyces punctatus DAOM BR117 hypothetical protein (358 aa)
      MGPHMGCDLENEPELLELSANNPEKKEKTEDEPEKPAAQIIVRNDYMQNFINTSLRPQNYIREVHPLVRFTEYPKLQHLAQLKDAIVDAYATPPMYIKANLRREGLGTLLGGIRFDVILIDPPLREYCEWSPSTIINCPDPVRPYWTWEEIGALAIEDVAATPSFIFIWVGDCEGLDQGRSLLRKWGYRRCEDICWVKTNKTWPGTPLMAPPSVFQHTKEHCLMGIRGTVRRSTDGHFIHCNVDTDVIIAEEPEDLTTRKPEELYYLIEHFCMGRRRLELFGNDENIRRGWITVGLGLSTTNWDRDRYLSWLRGEGAAQGTPYVRGGLLVPTTPDIEAIRPKSPPLSVRKGPRRGII
      >gi|Spun1000005457|ref|SPPG_05756T0 | SPPG_05756 | Spizellomyces punctatus DAOM BR117 hypothetical protein (454 aa)
      MSSRRSTRKRKCNTADISSSWYVGYAEDGESVEAIMQKFQELERMQQELAAQGSSTPVSAPTPEASSVFASGTNSDADADMARAIALQEGQEESTFTQAQLEELFKRTSCFTVKQATLDLDPDDLDELELWRLEMEGGDDDDWEENDNHILDDDMWDDEFGPARSGRRGERIPRARSGGLRSKLDRESLIAKYKVMQIQMQDRNGNFFVMKKRVCTVDPRLPTYIRIPPVPIPRSWVKLITSYAPPSGDIEGCRYFEDDILHFNMKPLGNRFQVVHMNPPFLMPDEEPTSGKISMKQFEKLDIPAIIPFGFLFIWAEKELTPDILRATQSWGFRYVENFAWIKRERSNKIARQSSRYFNKSKTTCLILRKERKEGDVELRHQRSPDCEFDFIKPKLPEDLTEEKPNFVYDVIETLLPQAVYGPANQNGDRMLDLWAKPGRRRKGWTMVVQKRS
      >gi|Spun1000004502|ref|SPPG_04738T0 | SPPG_04738 | Spizellomyces punctatus DAOM BR117 hypothetical protein (418 aa)
      MSQIVYQSDFGWVLRPHLYTLGDAWHWPASAFDVLAPYTARTKQHVSKALPVEQGDAGVDAPNVRKRKRSKQVKSDHVNKQNGINELHSSICHWLSEAHASLVLHSGTFPLPTTAAVADTSSGASIDDLDFIKFRELADIADSSKHALESEETDQACGIVPVDDVSRELDISSLYHQFISNDSDECKTLPVLGHSFIVPPHSVFLMSDLSQAQLLRSLNPFDFILMDPPWPNKSVRRAGKYQEIDIYDLFRLPLKHLIKPGGCLAVWVTNKPKYQRFVRDKLFSACGLAYVAEWYWLKVTLKGEWVVDLDSLHRKPYEVLIIGRAQHASNSHPDPVASLPTRRAICSVPSKHHSRKPLLDDAIAPFLPRDAQKLEIFARNLLPGWTSWGNEVLRFNDVQYLTQTTEGYLAPPPTELG
      >gi|Spun1000003158|ref|SPPG_03319T0 | SPPG_03319 | Spizellomyces punctatus DAOM BR117 hypothetical protein (581 aa)
      MDGHTDVESSANADNHGLLSMSLKRRIAERKAKGVMLDASTSFEGRVRKTSLLRRPSPPRRIQSPGDTYTNKPNEDLLASALLLSQRDLEPLVQNFLNTSNCIEWLPMDSHTLLRKYAGVTVSITPTLIIMIEKFFITSCSLIEFTPTLAMAHYNWKNVQSLERTLRALESRDGEPYLQLDIALVGSGKRVAVTRVLSKSTCPATLYPPSTRHRQLEVDWVHEVFVSLHPEPPGIESIKELLATPSFKGTANGKIWNELHNLIHRPTAKQNLIQEKFKNREDVEFKEFCEWGLRTDCNKHQHSTPCPKLHFRRIIKPQTDVSLGDCSYLNTCHRLDQCKYVHYELDEVNVTIDIDRPLPVLQIGNPLPAQWISCDVRKFDLSILGKFTVIMADPPWDIHMNLPYGTMTDDEMKEMPIQDLQDEGFLFLWVTGRAMELGRDCMAIWGYTRIDELIWVKTMQLQKLIRTGRTGHWLNHSKEHCLVGVKGGCHLQTGGIDGDVLVAEVRETSRKPDEIYSLIDRLCPGTRKLEIFGRPHNTRDGWMTLGNQLDGVRICEKAVLERYNKLYPATPATLFVDRMS
      >gi|Spun1000000693|ref|SPPG_00725T0 | SPPG_00725 | Spizellomyces punctatus DAOM BR117 hypothetical protein (357 aa)
      MSKMPPSRGPMAISSDVDVLTSDSEVLNIVESDGDGDFAPSRPGKRSQRVSKIVSKTSVSKSRTKSKSLKSTSRLSRPEIVRASSVVSTGSSASGTASADGKEDAAKIGPESSLEELLKREEHLQLQIDILMEEIKVLRGGEKSNAVGDQAEEEEIDYSNFDAPEWCVPIKANVMTFEWDRLADACQFDVILMDPPWQLASHAPTRGVAIAYQQLPDACIEELPIQKLQKNGFIFIWVINNKYVKAFELMERWGYKYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGRKGNDPPGCRHGIASDVIFSERRGQSQKPEELYEMIEELVPNGKYLEIFGRKNNLRDFWVTVGNEL
      >gi|Smar1000012137|ref|SMAR002698-PA pep:novel scaffold:Smar1:JH431216:18104:19856:1 gene:SMAR002698 transcript:SMAR002698-RA
      MGDCLKLLKERSQKRRELLAQQLGASSVEKLSEVLGNENGKERNKEESSDKKESKDTGDRPKKRLKDDEDDEEYYASYELPEAEEDIYKDSSTFLKGTQSANPHNDYCQHFVDTGQRPQNFIRDVGLADRFEEYPKLRELIRLKDDLISTTATPPMYLKTDLETFDIRELGGKFDVILVEPPLEEYQRSAGVTNTQFWSWEQIMKLEIEEVAAQRSFVFLWCGSSEGLDLGRKCLRKWGFRRCEDICWIKTNIANPGHSKNLESRQVFQRTKEHCLMGIKGTVRRSTDSDFIHANVDIDLIISEEPQYGSLEKPVEIFHIIEHFCLGRRRLHVFGRDCTIRPGWLTVGPDLSNSNFNNETYNMYFNNGPTDYLTGCTERIEALRPKSPPPKMKGAGAGGGGGRGASARGNSGGRGRGGFSARGRGSHRGR
      >gi|Smar1000010256|ref|SMAR004352-PA pep:novel scaffold:Smar1:JH431477:154003:157599:1 gene:SMAR004352 transcript:SMAR004352-RA
      MGERKGVNKYYPPDYRPEKGSLNKYHGTHALRERARKLDQGILIIRFEMPYNIWCDGCGNHIGMGVRYNAEKRKVGMYYSTPIYQFRMKCHLCDSHFEMKTDPQNLDYVVVCGARRQERRWDPTQNSQVVPEDKDTTQKLVTDAMFRLEHGEADKRKGKEAAPSLARLEDMQNRWKDDYTSNKILRQIFRDKKKEKQEKEKNNKLLLIKSSLSIPLLDENPEDVKMAKLLKLTPLQSYEERQREKRIEIDSKPILPQLVKNLNQNQKDLKREELVRIKNCKNSTETRISLVASEYNDSGDTNSYEIKDIFFDIRSEYLMDTQALARVSTTQRKRRRKIERKNEFSIYHKEVCDLIEKCAQSWKPECLPGKPNLEEILLNNIFARTASKIIDDNCRLSQLCSRNMMKEEPLIHTITRNGHVEPNLKFVNKLITHDEDYACVIKWDSKDFLIPANSSFLLSNDLNILTKTNRRYDVIIIDPPWTNKSLKRKKCYNTSSEVNFLSLPIKDLASKNCLIGVWTTNNSYLINYVKSVMFPHWGVEYETDWHWLKVTRYGEFVVPLNNHTKKPYENLILGRMTGSMENITSNLIFVSVPSCFHSHKPLIRDLLKNFINKEVKGLEIFSRSLCSGWTSFGDECEHK
      >gi|Smar1000006443|ref|SMAR007641-PA pep:novel scaffold:Smar1:JH431796:151221:153694:-1 gene:SMAR007641 transcript:SMAR007641-RA
      MSDTWSDIQAHKFRQSSLREKIQKRKKEREEIVNSIANDLSPTVTNKANTVESRSNSPTPIITKAKDSSESDSKCDPDVESKLLLCLCDVALNLPTDSRALGSLVSKALNREASNKIVENLLQKFAAQELISLKDGFTPDGKSCLNVTSAEHTKLTAVSNDLIGIQSEETTKVGKKRKHDSLHDDETNVKIVKDNLKKDKKDESIESLLSMPSIREKENKKMGEEILDLLSKPTAKERSLAERFRSQGGAQVQEFCPHGIREECAKVSGNNEPCRRLHFKKIIQKHTDESLGDCSFLNTCFHMDTCKYVHYEVDYYGAQIGEKFRQERELVAPKNLCGEKDLATLHPPQWIQCDLRYFDMTILGKFAVVMADPPWDIHMELPYGTMSDDEMRQLNIPALQDDGLIFLWVTGRAMELGRECLKLWGYERCDEIIWVKTNQLQRIIRTGRTGHWLNHGKEHCLVGVKGNPKGINRGLDCDVIVAEVRATSHKPDEIYGMIERLSPGTRKIELFGRPHNIQPNWCTLGNQLDGVRLMDMELIHDFKKAYPDGNCMVPMAKS
      >gi|Sarc1000000133|ref|SARC_00130T0 | SARC_00130 | Sphaeroforma arctica JP610 hypothetical protein (361 aa)
      MLGSVADDTKEMDSYSDDNELSSCDASDSSEEEPELASDVDELIDEEIRTVQEMVLSKRKIKALKKARRVLTEDEVVTEESTTLCELNSGLSKEELLASRSARRVLRRNHEAKRDQTAEDETNLHNSNEPHATNNGAVATAENADVAGSCSESKKTSDSKSSTQNFTPPPFSVPISADVTSYDFKSLAQLTKFDVIHMDPPWRLANAKPTRGVALGYSQLCDNDIADMPVECLSDSGFIFIWVINNRFEVGLELMTKWGYKFVDNVDWVKQTVNRRLAKSHGYYLQHAKETCLVGFKGDLNYVKSTTHTDVIFAPRRGQSQKPEEIYHLIEALVPNGKYLEIFARKNNLRDYWVSIGNEL
      >gi|Psoj1000016503|ref|144700
      MEPKVSSVNHAALVNGYYMGRLCLREDSLQIPATAFIRPSIAQAPGTLSAAAARRHRKRETAARRRSERLGELTAQGKFVPLSDQVRDALQNAFERFGGPRFVLRDFLPAFQDPSDDIQKVLGSLPVLSDTEALDDGDLHCNDTDQVRVSISVSRAKRYDMFDHTELLKIDVPHLADSDECILAVWVTNRPRYMAYLREQALPAWGFTYHASWDWLKLSKKRLGEHKYF
      >gi|Pram1000002112|ref|82533
      MWLLNRIGLAAAGCSAALLSCGATASEPRCFPADFLFGSATASYQVEGAWSEDGRTPSIWDDFCREQPGFECANVADDFYHRYADDIKLMVETGLQSFRFSVSWSRVMNWDPETRRMQPNAPGLVFYHALLDKLVENGISPILTLYHWDLPIELHNELTPQGWLNPDIIDHFVQYSTLLYNEFGGKVDFWTTFNEPLSFVVYGYNTGLHPPGLHDSPTLVYEVAHKVLLSHAYAVQKFRELKSGGVIQPKARIGIVLNANQFYPLDASNPKDVEATERAMNFEFGWWLLPLTTGDYPPVMRERVGDRLPRFTVEQAAVLKGSYDVFMLNHYYSRAITDCDSETSNTPCSSLHVGFGQDKGVDDTHIVPGSRPGLQDSQGNNYCKSYTAFPPGYLQTIKWMHAKDPSAEILLTENGWCGNDEVENLDQLWFFQSYTEQVYKAVVEEKIPVIGYTAWSFLDNYEWGSYGPRFGLYYVNFTAQTGSVEGYEPKPTDLERIARPSAKWFSKLASTGCLDELSQADETAAIAMRATTRQELSVPVRTKPMEPKVSCVNHAALVNGYYAGQVHLREDSLQAPSSAFVRPTGSAQSLEKMSAAAARRQRKRETAARRRAERLDELTMQGKFVPLSNGARSVLQGAFDRFGGARFVLRDFLPSFGDDRSKTPDVVASVPEMSNATALDGGKVYCNEADHVRVAAVDNARVVLPAWCSFAQCDVRELHQLELARHKLIVMDPPWQNKSVSRGKRYDMFDHTELLKVDVPHIADLDECILAVWVTNRPRYMAYLREQALPSWGFTFHACWYWLKLCKDGELVTPLDSTHRLPVETLVVAYRAKDPHHEKLLRQRLGEQMRVVLSIPLRHSWKPPPECSFDEDIISRTDKKAELFARELRPCSSFNQAKCSTGAALSDAETSVVSPFTSKEKATQSVVDL
      >gi|Pisp1000009414|ref|jgi|PirE2_1|12713|gm1.11659_g
      YYEEDIFKFDFNKIGNDFQAIYMDPPLLLPGEEKTPGKITIEQFGSLKISRLVKKGFLFIWCEKEYIPNLIQICDKWGFKYVENFTWIKKYINNRIVRQPYTYFNKSKITCFIFRKEGDVELRHQRNPDCEFDYIKPKIPGELTEQKPEFIYKVIETLLPQARYSPTNNKGTRLLELFGRKNYKRRGWTSIVQKSD
      >gi|Pisp1000005866|ref|jgi|PirE2_1|53885|estExt_Genewise1Plus.C_1020015
      MDPPWQLATHAPTRGVAIAYQQLPDQFIEELPIEKLQKNGFIFIWVINNKYVKAFELMKKWGYTFVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGDDPVGCKHKISSDVIYSVRRGQSQKPEELYEMIEELIPNGKYLEIFGRKNNLRDYWVTIGNEL
      >gi|Pbla1000012755|ref|jgi|Phybl1|76718|estExt_fgeneshPB_pg.C_40189
      MSDRPRRARKQKPTVASNVHYVGYVEDAESVEAIMKKFEELERIQKEFSTMDVKEPEVDIVEEMEVESQPLTEEQLQEVFKRTSAFTVKTATMDSRAADDMDALELWQVENKDGNTDEIYEEEDYIHVDDDFWDLEFGELPRPKRARKGGPVVERVARRGVDRESILAKYKVMQVQVQDRNGNYLLLHRFQNSTIDPSLPTYVKIPGKPIPRSWAHSILCKSKSQLPKPPPIASRLHTVNNILSTDLSRYGKSFSAVYMDPPLLLPGESPVPGKIHIDDLAKLNVSSVVSAGFLFIWLEKEWLQRIVSMASQWGFKYVENFCWIKKNINNQIHKSPYKYFNKSKLSLLIFRKEGDIELRHQRNPDCVFDFVKPMLPDEISEKKPEFMYNVIETLLPNAVYHPEKNPEGNGLLELWAKRGQRRTGWTTLVEQPIKEGENMERHRDIEAQRHGEE
      >gi|Pbla1000007785|ref|jgi|Phybl1|66651|fgeneshPB_pg.16__43
      MSHNPQDSGDWQSEFLTFSLYRVSHSKMRKYLDIMDHILESDSTNPKSHVNFPLLLKPRVGKMSSRESTPSSILMDRDDFDETTVSDGTLKSLLKQESELHLQIDALQIEIATLEEKLGKQEKGDELDEQDLEEFEAPEWCVPIKANVMNFDWDSLAAEVQFDVIVTDPPWQLATHAPTRGVAIAYQQLPDICIEDIPVPKLQKNGFIFIWVINNKYAKAFELMEKWGYTYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGEDPPGCRHSVGSDVIFSERRGQSQKPEELYELIEELVPDGKYLEIFGRKNNLRDYWVTIGNEL
      >gi|Pbla1000005949|ref|jgi|Phybl1|63089|fgeneshPB_pg.6__310
      MHRESSPSSSKPGRSSSSSQSPLPFRLSLSKKPKKLGSESTAISTSSKQNSPDVSESSLNSTIKAKDEDDEDDIQFDSITPRINTLDTDGESMDTSENSSEDEYIEDSAKKNISNRIKPDISSAKPRPTVRDTNSESKSAQSTDLQPLFLGEDTFSEASVKEWQTPRIKAWESRRSTPETFYFRFVAPGEGQSNGKWSKEEHKCFMDRYEEWIASGRKMGHSWGLFSIKIPHRVGYQCMNYYRRLVRRGEIKDNAYDTTNGVLKHVGRERASVTISSTELGPEWETEHVKNIEKNVNNWIKEFHNSTGRKPSGKSKTEKLVVRRSVPIGDLIKAQPARKRKRVSEINPDEEDFMEMQEDEPERMNVSTHPIDWEEGWKERLENYKDFMKPFLDTETRENYWHAKQRWRQGLVTTEMLLAKKPIQRLVPQETTEPIIKPLANRMQASLSRFFAGVKKLKVDTPEELISNVRIPNDLFSGVRHILPLKMAIEKKTGRTKTIYYEVDDIMDSIKTIDEVLENVPEHLSHQSPLEGVLVDPPWEFYVNDGRNDGRCTWNLKNMAMLMDKILDKMTAGLIFIWTHKLIQADVVKLMSTLGCKYVENLVWFKKSVNNVQLDQPSPYISSAKEILLMFKKGEGFELRHQRSADVIIEFESPREEWIHDEYTEPKPNAVYEMIETLLPKAAYDETLGRGRFLELWSKRVSPKREGWISFHQKKYPLVMTADPQIQDTEMCIKDEFSNMSIKEYIKNEDRIMDEDMLEG
      >gi|Pbla1000000301|ref|jgi|Phybl1|7247|gw1.70.18.1
      EGFDCIVMDPPWPNKSVHRSSHYETQDIYDFFKIPLPSLLSEEHPSLVAVWVTNRPKYRRFVIDKLFKAWGVTWVTDWYWLKLTTKGEPVMPLDSPHRKPYEQLIIGRRIPESTGEIPSNINIPPRILASVPSNRHSRKPPLDSILAPYLPNKPKCLELFARCLTPGWTSWGNECLKFQHQHYF
      >gi|Mver1000006359|ref|MVEG_06347T0 | MVEG_06347 | Mortierella verticillata NRRL 6337 hypothetical protein (466 aa)
      MELAAGRNDTWLLVPQRTPVPGWCVRPGTFRVTHPYDRTLDKSTGANAPSNRALKKRKVGAQALLEGNDVNKAEQGIVAWIKSEWLGLLDQEDVTLLFQSMAPDYFYGSCVYAQADEDVALDFVQLQPMLKMLTSGFHNTGASAEVGEDYEPFGMLQLSPSREQDKLLTLDLGDIYETLVTNNSSDAMVVSMASDGSPLYLIPPRSGFVISDFGQIHRLKDIAQKHSGFDMIVMDPPWQNASVDRMSHYGTMDLYDLFKIPIPHLLSKDGVVAVWITNRAKVQKVVVEKLFPAWGLTWVAHWFWLKVTTHGEPVLSLECGHRKPYEGILIGTRIPSNNTDMSTTTADLPNVDKECSSPHSVKKKLLVSVPSQHSRKPSIAQLLEKEFLASSEGNSNPDKEPRRLELFARNLEEGFVSWGNEPIRYQYCGRGHANGKLDIQDGLDIQDGLDVQDGFLVPAPRPIVD
      >gi|Mver1000003113|ref|MVEG_03108T0 | MVEG_03108 | Mortierella verticillata NRRL 6337 mRNA (2'-O-methyladenosine-N6-)-methyltransferase (309 aa)
      MTLPDEVVMTDSGNDSESGSFQDGNNSSTSSASNNTTRRTVLLGNSSLGKNDSPDLDDNTASGRLTKLLRRERELIETLEALARDIEQLEKKPEEEGEGGKDEDEEGDDLEEFEAPEWCVPIKANVMTYDWDSLAAECQFDVILMDPPWQLATHAPTRGVAIAYQQLPDICIEELPVPKLSSNGFIFIWVINNKYARAFDLMRKWGYSYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGVDPPNCRHSIGSDVIFSERRGQSQKPEELYELIEELVPNGRYLEIFGRKNNLRDYWVTVGNEL
      >gi|Mver1000002542|ref|MVEG_02535T0 | MVEG_02535 | Mortierella verticillata NRRL 6337 hypothetical protein (471 aa)
      MSTRSRRKPRKAQTINNAHYVGYVEENESIEAIMKKFEELARMEEEFKKSANTNTNTDTNTNTNAIANTNGEGGSNGTINDNSSPLTDGELILGVERVQAHDEHQGFTDEQLQEVFKRTSGFTVRSMMRDTPEDIYEEDVWQANIADEDYLYDFEEEDDYLMAMDDHFWDEEVGGSRKRGRKEKEPRVPREPRVKGERKRHVSDRESILNRYKVMQVRLQDRNGNFFTVKKKISTMDPSLPTYVRIPPVPIPRSWIHPIKPFVDTTVEIPGSRYEETNNVLDMDLKRFGTDYQVIYMDPPLLRAGEEPGPNKITMEQLATLDIGSILPKGFLFVWIEKEFLPDIVRLAERWEFRYVENFCWIKRNVNNLIAREPSPYFNSSKLSCLIFRKEGDVELRHQRSPDCVFDFPKPVNAATLSEEKPKFMYELIETLLPQAVYSESNPNGDKMMELWARPGTRRKGWTSICQTKV
      >gi|Mver1000001135|ref|MVEG_01134T0 | MVEG_01134 | Mortierella verticillata NRRL 6337 hypothetical protein (786 aa)
      MSPTTPQTDDLTSSSSTLPASSSLTTTTSSTANPLQTLNSSLKRIRLNDSRNDSSTPGSDDSLEREWGGIDPAELDEAFDAIQEELPSPKKPRREIKDEDEDEDEDFVVSSKLSVKAAQAPPSSNFGNISNSTLSATKFTPMIQRPSLPAKSSKGKLAKPTRSNNGMEDVFTHEQVRTWSEVRIKTWEHRRTNTEGFYYRFVDPTEGQQNGAWGKKSQQEFLERLEEWKSRGIRIGTSWGVFSMGVSHKAGYQCSSYYRKLLENKTLTDPAYAWENGKLVMVQKSSGGEMAISGLSTRWETEEVKEIEANITSWIKEYHSNGASRPAAKVKRVETSISSSSGSGSTATGSKIMNGMFRPTTEAPRVKATTAAAVSKALSLRPSKQPIDESEMIPVVDIDDKLAEYSSFMKKSTSTSSGPNKIRQTISLTVPGTVHTPTTTPARGVVSIEVRKSPLATHTVKGQTGLSLFWKNIRPVRVECPDTLISKYIIPKDVEQRPTWAHRVQAVTDPEDDESVVAGSIHKEVHTMVDFDWASLSENFDDPVTDIQGIMADPPWNFIVEDGRNDGACRLTTKAFGDIMEKALELMPSGIVCVWTHKAILPEVVSIMHGLGCRYVENLVWYKIALNNTNLDRASPYFRTSKEILLMFKKGDGFDIRHQRTADVIMDFEKPTSAWIEEDYTEPKPTAVFEMMEILLPDARHMPEAGRGRLLEIWAKKDPEYRRPGWFSIHELKGETPDGVVSKAPLQVIEIDEDSDAKSDDLDLLLRDNPEFEASHRDIDMEMDA
      >gi|Mcir1000010337|ref|jgi|Mucci2|157306|fgenesh1_kg.09_#_17_#_987_1_CCIA_CCIB_EXTA
      MSERPKRKRRQAASRSARVSNAHYVGYVEDNESVEAIMKKFEELERIQQEFTSSPQPVNDTLQLEQDDRDQDTLIEDSLTQEQLEEVFRRTSAFTVKSASVDPDFIVDMDALDLLQAEYRHNNTAEFIEEDDYYYVGDDFDLDGQVDDDGKDEQYRDRYYRRSTPSKKKRLDRQSVLNKHKMLAAQTKDETGRTVPVKRQACDIDPSLPTYVRIPPRPIKASWAHSIKPLSSNATPPTNAAYHEVHRLVDQDLTQYGSQFQAIYMDPPLLMAGEPPTPGKISIEDLAKLNVPDVMDTGFLFIWLEKEWLHRIVKIATQWGFRYVENYCWIKKNINNTIYTGESNYFCKSKLNLLIFKREQSKIEIRHQRNADCLFDFVKPMKAGQFTEPKPAHVYHVIETLLPKSVPQSKLLELWSTRNYRRQGWTTVAEVVCND
      >gi|Mcir1000003212|ref|jgi|Mucci2|138359|e_gw1.02.942.1
      MYVFCPCVTPSHQLKTIQRLAETTQFDVILADPPWQLATNTPTRGVAIAYQQLPDVCIEELPIPKLQQNGFLFMWVINNKYAKAFELMEKWGYTYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGQDPPNCKHSLSSDVIYSERRGQSQKPEELYQMIEALVPNGRYLEIFGRKNNLRDYWVTIGNEL
      >gi|Mcir1000000489|ref|jgi|Mucci2|31991|Mucci1.e_gw1.1.979.1
      GADLIVMDPPWPNKSVQRSSHYETQDIYDLYSIPMQSMMCSKDTIVAIWVTNKPKFRNFIINKLFPSWKLQCVSEWVWLKMTTQGECIFPIDSEHKKPYEQLIIGRPIASNNMSNRISMPESHTIVSVPSTRHSRKPPLHDVLSAYLSDKKQDPVCVEMFSRCLYPGWISWGNECLKFQHLSFFDKEDAIKEPE
      >gi|Lhya1000007505|ref|jgi|Lichy1|132223|e_gw1.100.12.1
      MTAGLVFVWTHKLLQADVVRLMDELDCRYVENLVWFKKYVNNIPVDQPSPYISSSKEILLIFRKVSSNIIGDGFELRHQRSPDVIIDFEQPASQWIQHEFTEPKPAAVYDMIETLLPKAGYDEELGRGRLLDLWTKRDAPRRDGWIAFHQTKPSDAKASTMTAQDDTTHEINGIHHKAMNETNSKSIEDQDDIMDIL
      >gi|Lhya1000007007|ref|jgi|Lichy1|163792|estExt_Genewise1Plus.C_880061
      MSDRTTRSKRSKRNNAVSNVHYVGYVEDEESVEAIMKKFEELERIQSEIAGSSTPETPEITDNDVNMDSPDVNTAAARPLTEEQLEEVFKRTSAFTVKSAMIDTNDDDVDALELWQIEFQDGNTEEIFEEDDYMHVDDDFWDQEFGDAPQRPKRGRRAAIPRERPASTGRRGLDRDSIIAKYRIMQVQVQDQNGNFFMVKKRVSALDPGLPTYVKIPGAPIPRSWVHQIMEVPRIRKFSGSHLHEVDDILSLDLKQYGSDFKAIYMDPPLLLPGEEPTPGKIHVDDLLRLNIPEVIPCGFLFIWLEKEWLRRIVEIGEQWGFKYVENFCWIKKNINNQIHTSPGTYFKKSKLTLLVFRKEGDVELRHQRNPDCVFDFVKPMLPGECTESKPSFIYHVIETMLPGANYHPETNPDGKGLLQLWAKKGQQRTGWTTIIEKHN
      >gi|Lhya1000004692|ref|jgi|Lichy1|206348|estExt_Genemark1.C_470043
      MTILYSTHNVDVIDCQSAFSKELVLHSQALQLRRGEFNVHEPYYRSSTTALAGQKRKRENTADIDTENWHQEHARPFLIKCIDELPNNVFSHLDNTEDTSTATAKDDSGIDFTSLISLAQASSRFTGPMDHLELTEENHVITMEPLDVFYRILSNPSPMNSMQITIEDQNYIMPPGASFYMSDMSTGMKDLKAHARSIGFYDFVVMDPPWPNKSVHRSSHYETQDIYDLFKIPMKQLIATSGLLAVWVTNKPKFRRFVINKLFPTWNIKCVGVWYWLKVTTHGEPVVPLDSPHRKPYEQLVLGIAMQQQQQEDPGTREEIPDKHAIISIPSRRHSRKPPLQDVMAKYLPKDPKCLELFARCLTPHWTSWGNECLKFQHTQYYETVEEEETIKDRVES
      >gi|Lhya1000001414|ref|jgi|Lichy1|141189|estExt_Genewise1.C_100084
      MSREGTPSSDTLIDIDNFDENEVTDLGLKNLLKREIELQLLIDALQTEIAQLEEGINGKDKGDEEEELDDQDLEEYEAPEWCVPIKANVMNFDWDSLAKEVQFDVIVADPPWQLATHAPTRGVAIAYQQLPDICIEEIPIPKLQKNGFIFIWVINNKYAKAFELMERWGYTYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGEDPPNCRHSVGSDVIFSERRGQSQKPEELYELIEELVPDGKYLEIFGRKNNLRDYWVTIGNEL
      >gi|Lgig1000006160|ref|jgi|Lotgi1|134083|e_gw1.88.185.1
      MSDTWSDIQAVKTKQSSLRAKLAQRKKEREGLAKELNLGVSTSSSTVSTSVVDPEIEKKLPYVLTDINLEIPAESTVIKDFLTKSLERAVNQTVVDELLEKFAAQQLIRFESLNIEPFLTISASYTIITKIFFLYFQLSALTSDDKGRKRKREDGDEEDKRKDDKNSKQSKKSADILESLLTSQSAKEKENKKVNEEILQILSKPTAKEQYLVERFKSRGGVQLKEFCQVGTREECRKMNNTTEPCSKLHFRKIIHKHTDESLGDCSFLNTCFHMESCKYVHYEIDYPDKKPEPVKDIVKYKVPDVDSDVYMFPPQWIQCDLRIFDMTTIGKCAIVMADPPWDIHMELPYGTMSDDEMRRLDVPGLQDEGFIFLWVTGRAMELGRECLELWGYKRIDELIWVKTNQLQRIIRTGRTGHWLNHGKEHCLVGVKGNPKGANKGLDCDILVAEVRATSHKPDEIYGIIERLSPGTRKVELFGRPHNVQPNWITLGNQVDGVRLKDPDVVKLFKERYPDGNCMEPPKPR
      >gi|Lgig1000004573|ref|jgi|Lotgi1|121106|e_gw1.35.28.1
      VKIFGEKYVIPPYCKFLLSDWKQLLQLHTVDANKYDLIVIDPPWKNKSVKRKKSYDTLWEDTLLDLPVTKLVNPGCLIVIWVTNKTRLHAFIENTLVEKWQLQQCVQWHWLKITRGGELICDINSNKKPFETLFIGRYDNSLTNQYSTVPNHRTIISIPCSLHSKKPSLAEILKPYLPEKPECLELFARNLQSNWTSWGNEVLKHQHLEFFDVT
      >gi|Hrob1000012694|ref|jgi|Helro1|153237
      CHRIRTQIYYTSRKPDKIYGIIERLSTGSRKIELFGRLHNVRPNWVTITHQLPNIMIVDPKMKEAFSNSFPNGN
      >gi|Hrob1000012559|ref|jgi|Helro1|79167
      MFIYFQKESSKEDQLIESLLSTQSAKERETNRLTEEILILLAKPTAKELLLSERFRSQGGKQVKEFCGYGTREECQKNNIICDRLHFKKIIHKHTDESLGDCSFLNTCFHMDTCKYVHYQIDYLAVNNRVHIVNDQQIISTSNNNVEELALSKIDDATTTLFPPQWVQCDLRQFDMSVLGKFAIVMADPPWDIHMELPYGTMSDDEMRKLNVPVLQDDGYIFLWVTGRAMELGRECLTLWGYERVDELIWVKTNQLQRIIRTGRTGHWLNHGKEHCLVGVKGNLPVNRGLDCDVIVSEVRATSHKPDEIYGIIERLSPGTRKIELFGRPHNVQPNWVTLGNQVDGVKLLDPEMVKAFRKVYPDGNCIK
      >gi|Hrob1000012135|ref|jgi|Helro1|121421
      CNERHFNKIIHSNTDESLGDCQYQNMCFNMNFCKYIHYEKDEKDEVTFKMPSREKSYPPQWIQCDLRYFDFKILGKFSIVMADPPWDIHMDLPYGTMPDDVMKKLDVSSLQDDGYIFLWVTGRAMELGRECLTLWGYEKVDELIWVKTNQLQKVIRTGRTGHWLNHSKEHCLVGVKGNLTVNKGIDCDVIVSEIRDTSRKPDEIYGIIERFSPGTRKIELFGRLHNVRPNWLTIGNQLPSVMLIDSKMKEAFYNTYPNGIC
      >gi|Hrob1000005904|ref|jgi|Helro1|69400
      YIIPPRSSFLLSDFSEIDLLAKVETKFDFVVVDPPWQNKSVKRKKRYYETFSMLNLVKIPMPEWCNENCLVAVWVTNKIKYQDYVREELFPSWNLQFVATWYWLKVTKKLTPVYPLLLSTSSTKHPKKPYELLMLGRFRLEKYHLMSNSNIDCKIQDRMVIISVPSAIHSHKPYLREVFKDHLPEDAKCLELFARNLQSGWTSWGNEVRVCVWIYQQYIFISLSLLGMQSTLACSQDLINII
      >gi|Hrob1000005423|ref|jgi|Helro1|68987
      CYERHFKKIIHSNTDESLGDCQYLNMCFNMNFCKYVHYEKDEKDEETFKMPSREKSFPPQWIQCDLRYFDFKILGKFSIVMADPPWDIHMDLPYGTMPDDVMKKLDVSSLQDDGYIFLWVTGRAMELGRECLTLWGYEKVDELIWVKTNQLQKVIRTCRTGHWLNHSKEHCLVGVKGNLTVNKGIDCDVIVSEIRDTSRKHDEIYGIIERFSPGTRKTELFGRLHNVRPNWLTIGNQLSSVMLIDSKMKEAFYNTYPNGICPK
      >gi|Hrob1000005324|ref|jgi|Helro1|164465
      MYDVAWPRHQPIALNYWNWDEIEKLELEHVAALRSIIWLWCGGSNGLEASRKDGSEEKPVEIFHIIERFCFGRRRLHLFGRDTSVRPGWLTIGPKLTSTNYDRESYNANFVKNPVVTRLVALKRYQRLRPKSPPRKIGHQM
      >gi|Crev1000002253|ref|jgi|Coere1|80436|fgenesh1_kg.9_#_19_#_isotig04348
      MWQDEIYYEDDDAAMAAYHEATRASSNDDDDDDFAPEPPTGHRGRTKSQRVSVSKKKTESARRATTNSHKPRIADRFMSSKLDTTLPSSSGEDQSNIGDEDIDIESLGDSMGVRDSCASAVPTPDIGQLKVCSSVGGSPSPPSPQQLETASATAMDTDEGSNDPSTALLALRQREQTIRARMEQLESEIAELEKKCGVEDSSDKKGRSEQLDLSEFRAPEWSVPIRANVMNFEWEKLAASCQFDVILMDPPWQLASQAPTRGVAIAYQQLPDVCIESLPIQLLQTNGFIFIWVINNKYTKAFQLMKQWGYTYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGADPQTLQRSVASDVIFSERRGQSQKPEEMYEIIEQLIPGGNYLEIFGRKNNLRDYWVTIGNEL
      >gi|Crev1000000518|ref|jgi|Coere1|37216|e_gw1.2.97.1
      ISNQYYVGYVEDDESVDAIMKKFEELERIEKEFSAKKKSAADDSSTEHKSKEQPIESGMSLELLEEVFKRTSAFTVRGAMMDDFDLEVMDDIELWEAEAADNVLAEWEEEEDYVALHLDDDALDDEFGVVLPRRRYQRHDTAKKSGPKLSSRDQIIQRYKYMQVRVQDRHGHTFFVSRKVNAVDPSLPTYVRIPPNPISRSWVKHIRPLSYETESTPTPDWAQCIETDNTLEFDYSKFDRTFQCVYMDPPLLLPNETSKPGFFAIAKLATIPVNKLLVPGAFLFTWCEKELIPDMCEIAEKTWGLKYVENFCWVRQQTNNQIARLESPFFCRSKVTLLIFRGEGNLEMRHQRSPDSVFDFTKPRLPGDLNDRKPEFAYTVIETLLPRAICTKEEPDPNRLLQLWMPADSHRKNWTTIVQNTQ
      >gi|Cmer1000000996|ref|gnl|CMER|CMH026C similar to (N6-adenosine)-methyltransferase
      MKGERKRPQRHDRDADRELALRLHNEEKQELRGLRRQTRTSADKAVVEDATVMGRPCETVTGLDSFLSSCLNETGRHSRPRASRKRTVENAFEHFSVAKYTEKHAKLEPRKQGTSALPIDEIQTVPIKDLDQFPSCVGVAENQKTSSSLRRQIREQIRRLNDGYYINCDLRYFNLAYLRECVGNFDVVLIDPPWRIAGGQRASTPNGPMFTNNHWAVNYNTLSNEEILDLDIGCLSNSGLCFLWVVSSQLPTGMACLSRWGYEYIDKITWIKKRQGKLHVSHGYHFMHSSELCLIGVKRPCEFIGKVSNDLIFAEVREKSRKPDELYHVVETMLPGTAKIELFARNHNIRRGWLSLGNELGEQFCDWFNDFECDMCGARIHFGERRYKAKNRPNSDFCRDCYLEAISAGVTTETEWFELANDAADPVYHEYYECNHCNIYPLWGVRFHRDPDMDLCEQCYDELVANDAETEDASNNARTSSPDDWTAIESPICGGSLPVHRNIRCSSCLQCPIIGYRFSCTCCENLSLCQKCFFQQKCPKGHNADHDIVIIVDSEAALNALVRCDGCGIRPILGTRYRCNTCYAFDLCETCYKKVEQGEYDLQQVSQKKLAEHNRSHAFSAIPVSA
      >gi|Cmer1000000605|ref|gnl|CMER|CME116C similar to (N6-adenosine)-methyltransferase
      MNVLQTHLLQTVKAHLITRSRKWNTIEVNEDDTDSGWRQLDASCNRTHARHGAKTYKRHCLSGAAPRLVAMVESAPLGLLESTEELENRLLQSYVSLIKNLTQQNASSNPEDAHGGSENPAGAAAAPGATAAAEAAVARHEHPGGNETAKADALTDSVFSDLCYVPEDFMVPPHCIPVRADVRFADWDQIAAAANGNYDVILMDPPWQLATANPTRGVALGYNQLSDESILAIPLEKLQRCGLLLIWVINAKYRVALQMFERWGYRLVDEIVWVKLTVNRRLAKNHGFYLQHAKETCLVGVKGNDLSALSTAPGMPRPDVILSERRGQSQKPDELYEWIEALVPNGKYIEIFARKNNLRNFWVSIGNEVTGESFEQALPPELCKQLREELST
      >gi|Cmer1000000469|ref|gnl|CMER|CMD131C hypothetical protein
      MEALRITFWYLRYWNTIITDKGHQLCFGEKHRPRRLGIAQSLVQGIGAHCRTDLCHYFWRETVSIASVGISILHWVYLLFGACTGAAETLLCTDRVLAPGVDALGTMPRGTDATASDEREQEEQARATRPRRRAAARYGSQKRSVAVDPAYYLGYADDDETPEMIMRKFEALERLQAQKAANTQQSSGDANDSQTAKEEVTLTAAEQAELFRQTSFFSVDMLAGSTTQSLPGGVGESTDLIAVGALDDSSFERVLAEYGLDAVLFGAELSDSEPGSEYDDGEDLLWEEALESDRSARFPGRLTRSRPDRSESSRVALELWRARAALLMRLSAGGDALAAAQAAAMIPAALRARRRRLAAEGVLRETIPTEYPLPVSWARTIRPLREAVQEGQKLERNYECGRLLRFANVFQAAKTLHERIKGRHFRCVVLDPPCWNQPDFRVEQLRSLRIAEIVPFGFVFIWVPKTRIAETLRWFGEEMHQGGGGGFHFVESFCVVLKWTNNRVATLLSEPSDLLCLDQPVAVAEHAPSGVYACSHETCLVLRRPGPHIELAHQRNPDLCFDYVRLHQKRYQRRRPELFYHMIETMLPEVARDGDMLCVWADQSVCKRTCWVSVVEDQSLEAGESSS
      >gi|Chet1000009456|ref|jgi|CocheC5_1|33790|estExt_fgenesh1_pg.C_390006
      MFSTLPPPPPRASPTAQSAPHVPDPIIFQNADADITLVDIPASIVAAQGDRSDVLLSTAPLEEPIQLRQDYEPKTQKTRAQAAKVHHGDSTQQNEHDLQSLHLDDAGYKLLVEHALAQIRAHVSGPWCMRRQLMTQTSRSAQDGAMDLDSPSERNLELCMREWASRSQAKQDDMAFNLQQMMASLGAASEPADSAAAADKCILSYRHAPVIESNTQAADSVTQETQTVPWTCTFHNPNQHSLEATITDRTLQSASSAQDYRFAIPPRATLFLSDSTASDAFRWSFRRLTDEYSLARHFDLVLLDPPWPNRSAKRKGTYEQVGGMPYLKRMLLNMHLDMYLEHNALVGVWVTNKPSLKQHVLGPGGLFEAWNVGLMEEWIWIKTTTKGEPMFDIDSPMRKPYEMLLLGRAAPNSWSRMAHAPVVKRRVIAAVPDVHSRKPCLKSLLEPYLVDPTDYTALEVFSRYLVSGWTSWGNEVLKYNWDGYWVKAGSAEN
      >gi|Ccor1000008813|ref|jgi|Conco1|87489|estExt_fgenesh1_pg.C_3190005
      MSSRRSKRKSTTRGNYVSPNSSHYVGYVEDEESVEAIMKKFEELEKIQNQPPSIPQNTSTNNDDMLTGNEVDELSLLTSEQATISGNNNQALTESQLEELFKNTSVFTVESALKHNQTYFPLNPNDFDDDDLEYYGDDLFILNEDEEFEEYDDIDLDDSDWELEMGLVSRKRVKRSSASGPREKRQSSTQKREHYRNTHLRVLDDLGKSFVLVKKQLPQDPTQPSYVRIPQNPVPVSWAHRIITLKQANQLRGGKRLGIDFINSFDQLIGFSGIGSIECILMNPPLVSDNFMQTKEEYLAHREPITVSQLLALPIKKLLKSGFLMIFVPKPLISKVVQRISNEWKLRYVENVAWLQLTPNQEILKADLDESYIRTSKLTLLMFRTEGDIDIRHQRSPDCVFDFVQPDVDLSKKLPNHHREFPNFIYKMIETLLPNGKSGLDNGFKLIEMWGHPKSRRDGWLSIYHKNQ
      >gi|Ccor1000007140|ref|jgi|Conco1|109297|CE21212_12273
      MDPPWQLATHAPTRGVAIGYQQLPDLFIEQLPIPSLQQNGFIFIWVINNKYVKAFELMERWGYEYVDDICWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGKDPPNCRKGIGSDVIFSERRGQSQKPIELYELIEELVPNGKYLEIFGRKNNLRDYWVTIGNEL
      >gi|Ccor1000001300|ref|jgi|Conco1|3188|gm1.1428_g
      MEGLKSLQPLNLTNSQYLDQQPLNKFNHQVNKGVNFAQFDDLIKLIYKFNDECDIDRIELSDEEVNTLDIEHLFHHLISNSTDKVCKLSIANYTQTYAIPPNSKFIMSDLESFNDLLIYKQPASCIYDLILMDPPWFNHSIKRGNHYKGQDKYTLLNIEIPKLLHSRGILAIWITNQPSILKFATLKLFPKWKLKLVSTWYWVKITTQGDLVISLDGERERKPYEILLLATFESNQQFTEIPSNFIYSVPFQYHSRKPNLFPYLSELIDYKNGNDITKLELFGRYLTQGCTTWGNEVLKFQESYFLD
      >gi|Caps1000024491|ref|jgi|Capca1|192776|fgenesh1_pg.C_scaffold_975000002
      MADSQDTWKEIQAHKSRQLSLREKLAMRKKAREEVVAQVAEIIGEPAALGPPTAKAGKVESRAVEKELLIVLDEATLNLPVNLEALMVEMSKVQTNISPKLIENLLQKFSAQMLIRIVSAVSASLAGVPGQKAPRKRKHEGNESEEVGRKEAKKSHSEPSIEVGEVDVLTSLLLMQSAKERESKKLNEEIVQLLSQPTAKEQSLVEKFKSQGGAQVKEFCQFGTIGECQKINGVNQHCGKLHFKKIIHKHTDESLGDCSFLNTCFHMDTCKYVHYEIAYPEISPKAQPSTISKKTEGTILYPPQWVHCDLRNFDVSVLGKFSVIMADPPWDIHMELPYGTMQDNEMRNLQVPLLQDDGFIFLWVTGRAMELGRECLTLWGYDRVDELIWVKANQLQRIIRTGRTGHWLNHGKEHCLIGMKGNPTNINRGLDCDVIVAEVRATSHKPDEIYGIIERLAPGSRKLELFGRPHNVQPNWITLGNQLDGVKLIDPEVVEAFKKKYPNGYNIVKGKLPPPT
      >gi|Caps1000008038|ref|jgi|Capca1|218111|fgenesh1_pg.C_scaffold_355000005
      MKRMKTRKKSTQTPALSSKVPKAPILTTITVRTLLTPESALRITSGMLVRYMHCDLDTYDMRDLDSKFDVILIEPPLEEYQRYLGVSREKFWSWQDIENLQIESIAAQRSFIWIWCGFGEGLDAARRCLRKWGFRRCEDICWIKTNIKNPGHNKNLGPKAIFQRTKEHCLMGIKGTVRRSTDGDFIHANVDIDLIIEEEFPPGSDEKPVEIFHIIEHFCLGRRRLHVFGRDLSIRPGWMTIGPDVTNTNYNAKTYNAYFNKTPDGFLTGCTEEIERLRPKSPPAKMKDGKGSGRGGRGGGGNKGGGGARGGGGRGNFSGRGGFQGRGRGGQRSGPPR
      >gi|Caps1000003296|ref|jgi|Capca1|94532|e_gw1.295.23.1
      MPGRTNIKLADLVGVFAVNSADTSSIIPFSGINYIVPPRCSFLLSDVSNPHLLPPNVQYDLIVMDPPWENKSVKRKKNYQMVRDFELEDIPIGQLATDGCLVVTWVTNKQQQQQLVKETLFPKWGITPLATWYWLKVTTEGEPVYPMRSQHSKKPYEALILGCKSLSPPLKIPDHKVILSIPSCIHSHKPPLHDILQDFLPSSTPRCLEIFARSLHPRWTSWGNEVSSDNISKIE
      >gi|Bnat1000020351|ref|jgi|Bigna1|81922|fgenesh1_pg.85_#_83
      MRHNKTGNDEVYRMCGHQFVIPRGAKFLLADITGLGALLKAKPNPGFRLIVLDPPWPSMSVSRSHKYKTLNPRDLINLPIRKLLYYDDDYMEEEEEEEDVTKGKKKERKTAEFSPSWVMIWVTNDPDLHRLVRHKMLKKWRCE
      >gi|Bden1000006054|ref|BDEG_06038 | BDET_06055 | Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (183 aa)
      MGELASMAVSSRFADPTTLSTVQSITLSANSHSIGLADLFNIQISNRDGSLPVVLDICGHKYIIPEHSWFVMSDFSFLSTLCKCLLDQPLRFSLVLMDPPWENKSVKRAKQYASMDCYRLSNIPLGQLVLPNGIVAVWVTNKSKVREMVLNRLFPAWGVEFVTEWVWLKVTHHGDLVIDLER
      >gi|Bcir1000011661|ref|jgi|Bacci1|265856|fgenesh1_pg.4_#_30
      MSVSIIDTEKNFSTELYRLKPGDFDIKEPYFRPSKSAPTESNTKKRRKRNEPKQADIDTQKRHEELKPFLTACLAMLKWETKEILLHNKEEEDTIKEAIDFPTIQAMVQSAHLKFDQQDEEEEAPYHFETDCCKELDIFQIFNRVYINPTKKITLLEFNKDATYLLPPRSTFLMGSMQDSLKQLGSYVHSIGGADLIVMDPPWPNKSVYRSSQYGCQDIYDLFSIPIPDMLTPDSVVAIWVTNKPKFRHFILDKLLPAWKLKCVAEWIWLKVTVKGECVFPLDSSHKKPYEQLIIAKPVESESTIPKQHMMVSIPSKRHSRKPPLHDLLTKYVKKDPVCVELFARCLLPGWISWGNECLKFQQLDYFEKG
      >gi|Bcir1000007777|ref|jgi|Bacci1|290217|fgenesh1_pm.3_#_58
      MSSGIIFVWVHKLIQGDVIRLMYELDCRYVENLVWFKKSCNNNVLDRPSPYIASTKEILLLFKKGDSIDLRHQRSPDVVIDFEIAPEHWINEEYTEPKPAMVHNMIQTMLPGGIYNETLKRGKLLELWAKKSATRREGWISFHQVKHSLQMETE
      >gi|Bcir1000006779|ref|jgi|Bacci1|219299|estExt_Genewise1.C_720022
      MSDRPKRKRKERARVSNAHYVGYVEDEESVEAIMKKFEELERIQQEFSTKVDIKEDDVVEEEMEEEIAVDEVENKFTQEQLEEVFKRTSSFTVKSATIDTSFVDDLDALDLWQVEYHDNDTNEFYEEDEYTYMDDFFWDEEFGGDNNKKRTGGRQPRIPKEPRGTRKLDRESIIAKYRIMQVQVQDKHGNYFTVKKRVSNVDPSLPTYVKIPARPIPMSWAHKIKPIIRKQHISVPGSRYEVVDSILSTDLSSFGNSFSAIYMDPPLLLPGEKPTRGKISIDDFANLKISNIMKAGFLFIWLEKEWLRQIVAIAAKWGFKYVENFCWIKKDLNNQIHKSKYTYFNKSKLTLLIFRKEGDIELRHQRNPDCVFDYIRPMLPDEVTESKPPFMYHVIETLLPNAIYHPEKNPNNDKLLELWSKKDQRRQGWTTLCE
      >gi|Bcir1000003454|ref|jgi|Bacci1|330834|estExt_fgenesh1_pg.C_270053
      MSTVSSRESSPFSNVDDDVLDSSLKELLEEEKELQLEIEAVRVEISKLEERLGVSANDNLGDKELEEFEAPQWCIPIKANVMTYDWEALAAQEQFDVILTDPPWQLATHAPTRGVAIAYQQLPDICIEDLPIPKLSKNGFLFIWVINNKYAKAFELMEKWGYTYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGDDPPNCRHSIASDIIFSERRGQSQKPEELYELIEELVPNGKYLEIFGRKNNLRDYWVTIGNEL
      >gi|Aque1000027926|ref|Aqu1.228194
      MVREYQAEVKANGISDRYDSSRTWRSKTKHSARHKSFMKKVRNKRRHRQERIQRRTEEYEDEKVLLPSEIDPKLEQRVLYALLNPTLEIPTQGTKLLEMSGTDFENLPNLLQKLYGQDLICLQSDGYTITSINLESISRLIEMKNLSLRQSNIPRFKTDPEEIEYLLNAPSVRDKETKRVGSEIQELLTAKSYQEQLIKQKFQSAGGSQLREFCPQKTREDCRRVSRSGRACPRLHFRRIIQSHTDESLGDCSFLNTCFHMESCKFVHYEIDQTQETESRKGGIKPRPSLQSLGSKLVPPQWLNCDLRNFDTSVLGKFAVVMADPPWDIHMELPYGTMSDDEMRQLDIPSLQDDGFIFLWVTGRAMELGRECLTLWGYERIDELVWVKTNQLQRLIRTGRTGHWINHGKEHCLVGAKGNLQGVNRGIDTDVIVAEVRATSRKPDEIYGVIERLSPGTRKIELFGRQHNCQPNWLTLGNQLEGDNLHDPELRERFYSRYPEKLHAYPAVT
      >gi|Aque1000012323|ref|Aqu1.212591
      MAGVGPSRGRSGTPSAIPHLQSSSDRGDETGSSCISSLRQCRKLLRSLKKRSVLKRKIASRKSGMSHLRYILKLSSQREFEMNDNHKFLINPKRHRVTQSKPKIKESAATKENGEDQTFKGSDVFLKGTQSANPHNDYSQHFVDTGQRPQNFIRDTGMNQRFEEYPKLKELIRLKDKQIKDGAIPPVYYKVDLSSFDLTSLDAKFDVILIDPPLEEYQRRTTGITYPWQPWDFEEIMNLKIEDVSAPRSFVFLWCGSCEGLDLGRECLKKWGFRRCEDICWVKTNMNDPGNTTHLEQKSIFQHTKEHCLMGIKGTVRRNQDGHFIHANIDLDIIISEEPEMGNNDKPEEIFHIIEHFCLGRKRLHLFGNDATVRPGWLTLGPNLSSSNYHKESYLANFTDESGGPLLKFDETIETLRPKTPPPKGRGIGRGGLNVNPGMGGVGRGRGSYT
      >gi|Aque1000011545|ref|Aqu1.211813
      MRLVSLDSSTSFFFDDRTAANRRGRRGPHIPSIEDARKSVAYHRSISRYLRFSNRSLGPNPKRAKPKQSKGRMQMPGRAKSGDDSIFPPMPSKRYEVIYADPPWDYKGQLQHCGAGGGDSGGAMRHYPTVTLADLKTLPVDRIAADDSLLFLWATSPHLDQAIDLGKAWGFAWATVAFVWDKCKTNPGYYTLSQCELCLAFKKGKIPSPRGARNIRQLVSEPRQGHSRKPDEVRRRIESMFPDQSRIELFAREPAAGWKAWGLEAQWDAKRPRLTRPRIDR
      >gi|Adig1000021966|ref|adi_v1.16957
      MSLWDIKSLPVPQLVAPGALVGVWVTNKQKYLRFTRSELFPHWSVELVAEWFWAKVTRRGELVTELDSPHKKPYEPLLIGRFQPMMKLLRSESLNSGKDLKDIDSCPGMDLLLNYQKRRKISLSHNFENNGTISLDGTHRTVSSSTNESEMTNRESDVRCTECTKGKTDTSNALLEVRTAQSSNLHGLDGKELVDIVTGDVPANSRSKHVDKQSTRNQTKGKIVGQSAREISETGLRLNLEEGDRDNKPSKSQSEILQTGFDGCINLPYHQVICSVPCRIHSRKPPLNDILEKYVPPQPLCLEMFARNLTPNWTSWGNE
      >gi|Adig1000009633|ref|adi_v1.06363
      VLISEEKKEGKHTSQPDKEDADKAKDYEEEDVEEVVYKDSSHFLKILKLEIEKVAAQRSFLFLWCGSHEGLNEGRKPPLGSTEKPTEIFHIMEHFCLGRRRLHLFAGDDTLRPGWLSVGPSLSSSNFSADTYNSYFSDAPEQLLVGSTQEIETLRPKSPTGKHKGDGGGRGGAIAPSRGGSGPRARGGVRGGMMRGAGFQGKGGRGFISFSVR
      >gi|Adig1000003645|ref|adi_v1.21218
      MSLWDIKSLPVPHLVAPGALVGVWVTNKQKYLRFTKSELFPHWSVELVAEWFWAKVTRRGELVTELDSPHKKPYEPLLIGRFQPMMKLLKSESLNSGKDLKNIYSCPGMDSHLNSDKRRKISLSHNFENSGTLSLGGAHITVSSSTNESTMTNCASEVRCTECTKGKADRSNALLGVKTPKSSNLYGLDCEELVDIVRGDVPLKSTSKHVDKQRTTNQIQGKIVGLSAREISETGLRLNLEEGDRDNKPSESQSEILQTVFDGCIHLPYHQVICSVPCRIHSRKPPLNNILEKYVPPQSLCLEMFARNLTPNWTSWGNE
      >gi|Adig1000001851|ref|adi_v1.03360
      MDTCKYVHYEVDYSDVEMNKEDKGKDKKKDEKTSLANTVEGDNRVTLYPPQWIQCDMRYFDMTVLGKFSVVMADPPWDIHMELPYGTMSDDEMRKLDVPSLQDEGYIFLWVTGRAMELGRECLKLWGYERCDELIWVKTNQLQRLIRTGRTGHWLNHGKEHCLVGVKGDCKNFNRGLDCDVLVAEVRQTSHKPDEVYGVIERLAPGTRKIELFGRQHNVQPNWITLGNQLDGVRLLDPEVTARFKKRYPNGIVLNTSSTKI
      >gi|Abis1000008599|ref|jgi|Agabi_varbisH97_2|121782|Genemark.8250_g
      MISTAELLCEANDVLAAHASLLDHVRASQHQFRRHLHTLQSPPEELLKLPTVPSSPLLTPDDASPSPSPPLMQEDQERSDLPAPKKARLARYRNYVPEEETIRNDYSQRYVDGGEWPQDWVIGAEPEHRFEEYPKQQRLLTLKKNSVNSHATPPYYLPYHELSSLHPNKFDIILLDPPFSSSFSWEQLLELPIPNLAADPSFVFLWVGSGAGEGLERGREVLAKWGYRRCEDVVWVKTNKTSNQGPGTDPPTTSLFTRTKQHCLIGIRGTVRRSTDSWFVHCNVDTDVIIWEGDPADPTRKPPEMYTLIENFCLGIRRLEIFGRATSLRRGWVTVLTRGNDRQLAVSEDGSVHVEGEEGGLATTWRQETWDEQVKSLLTNGRAVVPMTPEIDALRPKSPVRHNQNISGGGGSAMSGGVAVGIPGGSNNNSMNTNSGGARFNSGNRPNAFINHGPAAMLPPNQMVNPNQMMVQQQNMMGMGVGVGGVNQFGMGMGVPVPMEEMMSAGGWNHHHMMNAGPMGGMGPPGIPGAPGHVGMNASNVNMGMSGVGPMGGVNMPLHHHHHHHHQQMMNQMGMGGGGFQGHAGVGFGANGMPVFNPAAMNNAGMGGWGDQGPMVNGMNMGGMNMNMNMNNMPHNMGMGGQWGNGGF
      >gi|Abis1000003455|ref|jgi|Agabi_varbisH97_2|67638|e_gw1.4.1246.1
      CDRVHFRPLIRPHTDPSLGHCSYLNTCYSEPTYAQSPSIPAYPGRGKEKAPCRYLHYEVDWDPTDAENEKTKERVAVKGKPHRLEIGLGPPGREATPLPPQWINCDLRKFDYSVLGKFHVIMADPPWDIHMSLPYGTMTDDEMRAMPIPALQDEGLLFLWVTGRAMEVGRECLRVWGYTRVDEVIWVKTNQLQRVIRTGRTGHWLNHTKEHMLVGVKTPSSPSDGPETELKFPKWVNRGVDTDVIVSEVRETSRKPDEVYGLIERMCPGGRKVEIFGRKHNARPGWLTLGNQLGPADQIWEEDLLERVRAK
      
      Back to Contents