>rifcsplowo2_01_scaffold_68480_2Parcubacteria QKDFII--KRKDKPDVVVKYFQTP--GS-FINAAIDLSKEQSFSSI----EDPIKEKERF AAKVISVVGTEVELAFPIELFGT-SLALLLDTVIGEVHNIAKFTGLKVLDIEFPKSWNKK YGGPVFGIEGIRKIVG-KKNPLLISPVKPCVGLSPDEFAERVKRCLMGGFDGVKDDELLL DPPYCPFKERVTKTMESIKEVQKKTGKKKIYFAHVGGDIDKIDTLVEFALSKGG-GIMFS LV-NGIDIIRKYK---GKVPIIAHNNLMYGMSRHPLNGISFYLLMKIQRLCGADMIICPA PRSFYKNVR--ACIVD-------------NGMRKTLPGLSGSQTPETLYNHYNLLGHDYA ICPGAAVYEHPMGIEAGATSFVEAVDALKSGVTLQKYAKNHEALRISMNHFKKKT >rifoxyc1_full_scaffold_13163_3Parcubacteria IIRRKD--ARPE---VVVKYFQTP--GN-FINAAVDLSKEQSFSYI----EDPIKEKKKF AAKVISVVKKEVELAFPIELFGD-SLVLLLDTVIGEVHNIAKFTGLKVLDIEFPRSWNKK YGGPVFGIEGIRKIVG-KKNPLLISPVKPCVGLSPDEFAGRVKRCLMGGFDGVKDDELLL DPPYCPFKERVTKTMRTVKEVEKKTGRKKIYFAHVGGDIDKIDGLVKFALSKGG-GIMFS PLVNGIDIIRKYK---GKVPIIAHNNLTYGMSRHPLNGVSFHLLMKIQRLCGADMVICPA PRSYVM-DIETHKKNVQACT---------NGMRKTLAGLSGSQTPETLYNHYKILGHDYA ICPGAAVYEHPMGIEAGATSFVEAVDALNNGILLKKYAKNHEALRVSMSHFVKKG >rifcsplowo2_12_scaffold_29845_1Parcubacteria -CTKKN--ISANKKDVVVKYFQTP--GN-FINAAVDLSKEQSFSYI----EDPIKEKKKF AAKVISVVKKEVELAFPIELFGD-SLVLLLDTVIGEVHNIAKFTGLKVLDIEFPRSWNKK YGGPVFGIEGIRKIVG-KKNPLLISPVKPCVGLSPDEFAERVKQCLTGGFDGVKDDELLL DPPYCPFKERVTKTMRTVKEVEKKTGRKKIYFAHVGGDIDKIDGLVKFALSKGG-GIMFS PLVNGIDIIRKYK---GKVPIIAHNNLTYGMSRHPLNGVSFHLLMKIQRLCGADMVICPA PRSYVETHKKNVQACT--MD---------NGMRKTLAGLSGSQTPETLYNHYKILGHDYA ICPGAAVYEHPMGIEAGATSFVEAVDALNNGILLKKYAKNHEALRVSMSHFVKKG >rifoxyb1_full_scaffold_10500_4Parcubacteria IIRRKD--ARPE---VVVKYFQTP--GN-FINAAVDLSKEQSFSYI----EDPIKEKKKF AAKVISVVKKEVELAFPIELFGD-SLMLLLDTVIGEVHNIAKFTGLKVLDIEFPRSWNKK YGGPVFGIEGIRKIVG-KKNPLLISPVKPCVGLSPDEFAERVKQCLTGGFDGVKDDELLL DPPYCPFKERVTKTMRTVKEVEKKTGRKKIYFAHVGGDIDKIDGLVKFALSKGG-GIMFS PLVNGIDIIRKYK---GKVPIIAHNNLTYGMSRHPLNGVSFHLLMKIQRLCGADMVICPA PRSYVM-DIKTHKKNVQACT---------NGMRKTLAGLSGSQTPETLYNHYKILGHDYA ICPGAAVYEHPMGIEAGATSFVEAVDALNNGILLKKYAKNHEALRVSMSHFVKKG >gwd1_scaffold_1521_5Parcubacteria IIGRKD--ARPE---VVVKYFQTPGNFI---NAAVDLSKEQSFSSI----EDPIKEKERF AAKVINIDGPEVELAFPIELFGD-SLVLLLDTVIGEVHNIAKFTGLKVLDIEFPRSWNKK YGGPVFGIEGIRKIVG-KKNPLLISPVKPCVGLSPDEFAGRVKRCLMGGFDGVKDDELLL DPPYCPFKERVTKTMRTVKEVEKKTGRKKIYFAHVGGDIDKIDGLVKFALSKGG-GIMFS PLVNGIDIIRKYK---GKVPIIAHNNLTYGMSRHPLNGVSFHLLMKIQRLCGADMVICPA PRSYVM-DIETHKKNVQACT---------NGMRKTLAGLSGSQTPETLYNHYKILGHDYA ICPGAAVYEHPMGIEAGATSFVEAVDALNNGILLKKYAKNHEALRVSMSHFVKKG >rifcsplowo2_02_scaffold_9609_9Parcubacteria EKDFII--RRKDRPEVVVKYFQTP--GS-FINAAVDLSKEQSFSSI----EDPIKEKERF AAKVISVVETEVELAFPIELFGD-SLVLLLDTVIGEVHNIAKFTSLKVLDIEFPRSWNKK YGGPVFGIEGIRKIVG-KKNPLLISPVKPCVGLSPDEFAERVKQCLTGGFDGVKDDELLL DPPYCPFKERVTKTMRTVKEVEKKTGRKKIYFAHVGGDIDKIDGLVKFALSKGG-GIMFS PLVNGIDIIRKYK---GKVPIIAHNNLTYGMSRHPLNGVSFHLLMKIQRLCGADMVICPA PRSYVETHKKNVQACT--MD---------NGMRKTLAGLSGSQTPETLYNHYKILGHDYA ICPGAAVYEHPMGIEAGATSFVEAVDALNNGILLKKYAKNHEALRVSMSHFVKKG >rifcsphigho2_12_scaffold_16902_10Micrarchaeota -MGATD--FIQKDS-VVVKYLITT----GPKTAAIKLCKEQSLSNA----LG--NDLNKF SAKFISIERVMVEVAFPPENCEN-SLSMMLSAVGGDTFNIKNLYPIKILSISLSKPFYKK YGGPRYGVDGLRKKLKAYSRPILVGPVKPCIGMNPGAFAQRAREALLGGTDIVKDDELIC NPPYNPLSSRTKLLSKTIRETERETGEKKMYFAFIGSGTTQIIEYAEIAKDNGDGYMISP AI-NGFEIVKELKDE-FQLPIIAHNALAYS-AYTPNHGMSFSVFSLLQRVCGADIVITPA KHGTF--DVMSAEEHKENIS---VLHSKIAGIKRAFPAFCGGQKPETIPLLKKDAGNDFI VVAGTSLYDHPDGPTAGAKRLRERF------------------------------ >rifcsphigho2_01_scaffold_24034_3Parcubacteria KSFFVC--DQPHKDDIVVTYLVTT--SLNCEEASIKLCKEQSLSN--ALGDDEEVIKKKY IPSVRHVNETKVDVAYPAENCEN-SIAMVLSAAGGDTYNIKNLYPIKILDIKLPPTFVGN YFGPTFGVEGLREDLGIYNRPILVGPVKPCVGMGPKAFAKRAFEALLGGTDIVKDDELIC SPSYSPLIQRVQAVSRSVKEAEQKTGEKKMYFAFIGSGSKEIMAQAGIAKLNGANGFMIS AI-NGLEIISDLT--DFGLPVIAHNALYAGHTKD--HGIVFSVFALFQRLCGADIVITPA PYGTFDSGAEHLENISAVSR-------DMDQIERSFPAFCGGQSVHTIPLLRRDVGSDFI IVAGTSLYDHPDGPGAGAKLLRESF------------------LSL--------- >PH2015_07_scaffold_0_448_PHAGEputative_phage --------MSKS---YRVTYHVESP---DIPRAADAIALGQSIGNP-DIR-NDYELEEAS AAKVVEIDGNRVVIDFQWRNLHSKDIGHMLCNIQGGQSDIALLEVCRVTDIEL------- --NQPFTKNPVTKTLKHRNRPLLGSIVKPKAGLTPSQLHEIVMMMCNGGIDFIKEDEILG DPAYLPLTQRLDIIQPIMENFPDT-----VYTYCVNGNPGHLIEQLDLVQKAKGDGVHIN FWSMGAYADSNAR----GLFTHFQRSGIRAYTDPRNYGIDWTVLVKLAIYQGIDSIHVGM IGGYYPGDHQEVRDAVALCQ-----------KHDVWATLSCGMTPEIAKDIQQEIGNHWL ANVGGWLHTGE-SIEKNIIKFRKSLEV---------------------------- >imgVR_3300006305_____Ga0068468_1000608_36_PHAGEputative_phage PFDIPD--TQTPRQQYTVTYEVDGP---DIPKIAHEIAIGQSIGNP-NIR-SEIENSQDF AANIVSYEDNIVKISFNRNAFIWPNINQLMCIIMGGHTDIMGVDKCRVIDIDIDVKNLK- ---PVLGMSGWKKRLDAENRPLFGAIIKPKSGLTEEQYIRIVKDMIYGGADFIKEDEIMA DNLYLPLTKRV----ETVEHIKNISGWKGYFAYCINADPMELVSNLKAIYYHG--GCHIN FW-SSLGAYTTAR--GFDIATHYQRSGIRILTDPSNYSVAWPVLVKLACLAGVDSIHVGM LGGYYPETEEETLEAINICK-------KY----DVIPSLSCGMNPVLAREIRERIGNDFM ASVGGWLHT-------GDSLIKK----VM---EMKESLL---------------- >imgVR_3300000117_____DelMOWin2010_c10004565_18_PHAGEputative_phage ------------MKKYTVTYEVDGP---DIAKIAHEIAIGQSIGNP-NIR-SEIENATDY IAQVKSIDKNIVVIEFPLGAFDWPNINQLMCIIMGGHTDILGVDRCRVIDIDIPIKTIE- ---PLLGMSGWKKRLNAENRPLFGAIVKPKSGLNKEQLLSLVKDMMYGGADFIKEDEIMA NNSYLPLQERI----DAIEHLKTISGWKGFYAYCINADPLELVDNCAAVKMAG--GVHIN FW-SGLGAYTTAR----------------------------------------------- ------------------------------------------------------------ -----------K------------------------------------------- >imgVR_3300017986_____Ga0181569_10001190_17_PHAGEputative_phage ------------MKKYTVTYEVDG--PD-IPKIAHEIAIGQSIGNP-NI----RSEIENV KEIAEDIQGKIVRIDFPIRAFDWPNINQLMCIIMGGHTDILGVDRCRVIDIDIPIKTLE- ---PVLGMSGWKKRLEAENRPLFGAIVKPKSGLNKEQLLSLVKDMMYGGADFIKEDEIMA NNSYLPLEERV----DAIEHLKSISGWKGFYAYCINADPFELVDNIKTVSTRGNIGVHIN FW-SGLGAYTSSR--KYSLATHYQRSGIRILTDPSNYSLSWPVLVKLGCMAGIDSMHVGM LGGY-PESEEETLKAIEICN-------EY----DVIPSLSCGMNPVLAREIRERIGNNFM ASVGGWLHTGD-GTAG---------NTYH------KVKE-------MSEAVAK-- >imgVR_3300001472_____JGI24004J15324_10000511_11_PHAGEputative_phage ------------MKKYVVTYVDGP---D-IPKIAHEIAIGQSIGNP-NI----RSEIENV KEIAEDIQGKIVKIAFPIRSFDWPNINQLMCIIMGGHTDILGVDRCRVIDIDLP-----T KPGPVLGMSGWKKRLDAEHRPLFGAIIKPKSGLNKEQLLSLVKDMIYGGADFIKEDEIMA DNSYLPLKERV----DMIEHLKSVSGWKGFYAYCINADPIELVDNLKKV---ADTGVHIN FW-SGLGAYTSAR--KYNLATHYQRSGIRILTDPSNYSLSWPVLVKLGCMAGIDSMHVGM LGGY-PESEEETLEAIQICK-------KY----NVIPSLSCGMNPVLAREIKERIGTKWM ASVGGWLHTGD-GTTN---------NTFH---KVREMSE-------AVK------ >imgVR_3300012953_____Ga0163179_10002609_6_PHAGEputative_phage IPVINS--WKPPPMKYTVTYEVDGPD---IPKIANEIAIGQSIGNP-NI----RSEIENV KEIAKDIQGN-VKIDFPIRAFEWPNINQLMCIIMGGHTDILGVDRCRVIDIDLPT----K PTGPVLGLSGWKKRLDAEHRPLFGAIIKPKSGLNKEQLLSLVKDMIYGGADFIKEDEIMA DNSYLPLKERV----DMIEHLKNISGWKGFYAYCINADPIELVDNLKTV---SNVGVHIN FW-SGLGAYTSAR--KYNLATHYQRSGIRILTDPSNYSLSWPVLVKLGCMAGIDSMHVGM LGGYPEGSEEETLEAIKICN-------EY----DVIPSLSCGMNPVLAREIKERIG-NWM ASVGGWLHTGD-GTPK---------NTFH---KVKEMSE-------AVK------ >imgVR_3300000116_____DelMOSpr2010_c10001115_19_PHAGEputative_phage ------------MSHYTVTYELDGP---DIAKAAWNLAIGQSIGNP-NIR----NEIENI KEEALSIEGNIVKIKFPLKAFAWPNLNQLMCIIQGGQSDIADVTRCRVLDIE-GLPYMN- --NPRLGMAAYKKRVGAENRPLFGGIIKPKSGLTKEQLLHIAKQMMDGGADFIKEDEIMA DNDYLPLEFRVNAVSNLIEQ----TGWKGVYAYCANADPFELCENLRTIRNHGGEACHIN FW-SGLGAYTTSH--KLHVVTHFQRSGIRTWTDPDNFSIAWTVIAKLAIMAGVDTMHVGM LGGYYPESEEETLEVIEACV-----------AEDRVPALSCGMNPVLAQEIRERIGNNWM ANVGGWLHTGE-NIYEKVYEMRK---------SFD-------------------- >imgVR_3300012953_____Ga0163179_10002555_5_PHAGEputative_phage ---------------YTVTYEVDG----DIQKAAWNIAIGQSIGNP-NIR-NEIENSPEM EAIIDSIEGK-VKIKFPLKAFKWPNLNQLLCIIQGGQSDIECVHRCRVIDIE-GLPYMN- --ESVLGMKSWKYRVDAENRPLFGGIIKPKSGLTADQLISITQQMMDGGADFIKEDEIMA DNDYLSLSERVDVISNAISKCNWK----GVYAYCINADPYDLCENLKTINEYGGEACHIN FW-SGMGAYTTS-NL-YHVVTHFQRSGIRTWTDPGNFSISWNVIVKLAIMAGVDSIHVGM LGGYYE-GESEEETLQAVSD---------LQNADRVAALSCGMNPEIARQIRGNIGNNWM ANIGGWLHTGE-SIYEKVYEMRK---------SLE-------------------- >imgVR_3300000117_____DelMOWin2010_c10001624_5_PHAGEputative_phage ------------------------------------------------------------ ---------------------------------------------MQ------------- --------------------------------------------------TFIKEDEIMA NNSYLPLMDRVDAVVDLIDRKKSN----VVYSYCVNADPQLMDNLCDVA-DGGGDCVHIN FW-SGLGSYNTSN--EAGLITHYQRSGIRILTDPDNFSIAWPVLVKLGVMCGIDTMHVGM LGGYYPESEEETLEAIQIMV-----------DNNRTPCLSCGMNPETAREIRGNIGNDWL ANIGGWLHTGD-SIYEKCYEMRK---------SLEDYKS---------------- >imgVR_3300000116_____DelMOSpr2010_c10000825_24_PHAGEputative_phage ---------------YTVTYKVKSK---DIPLAAHNIAIGQSIGNP-NVR----SEIENI KEEAIHIEGDIVKIDYPLKAFDWPNISQLLCIIQGGQSDIDLIQRCRVINIE-GLPYMN- --DPVLGMKAMKERVGAENRPLFGGIVKPKSGLTVNQLLDIVEQMIDGGADFVKEDEIMA NNSYLPLNERVREVTHLLNRKNSK----MVYSYCVNADPIELIDNLKTVKTLGGDCVHIN FW-SGLGAYTTSN--DIGLITHYQRSGIRILTDPDNFSVSWPVLVKLGVMAGIDTMHVGM LGGYYPESEEETLNAIDICE-----------TNDRVAALSCGMNPRLAKDIRDMIGDNWM ANIGGWLHTGE-SIYEKVYEMRK---------SLD-------------------- >imgVR_3300006802_____Ga0070749_10000052_38_PHAGEputative_phage --------MT-----YTVTYEVEGD---DIAKVAHNIAIGQSIGNP-NIR----SEIENI KEEAISIDGNIVKIKYPLKAWTWPNISQLLCVIQGGQSDIDLVKRCRVLDIE-GLPYMY- --EPMLGLKGMKERCDAQNRPLFGSIVKPKSGLTKEQLLNIVEQMIDGGTDFIKEDEIMA DNNYLPLSVRVRAVSELISKKNSK----VVYSYCVNADPIELINNLSDVSQHGGDCVHIN FW-SGLGAYTASR--QKGLITHYQRSGIRILTDPGNFSIAWPVLVKLGVMAGIDTMHVGM LGGYYPESEEETLKAMSICV-----------SNDRVPALSCGMNPQAAREIRGMIGNDWL ANIGGWLHTGE-SIYDKVYEMRKSLDTIM-------------------------- >imgVR_3300006027_____Ga0075462_10000321_14_PHAGEputative_phage --------------MYTVTYHVHSK-DIP--SAAHNIAIGQSIGNP-NIR-SEIETSAEM EAIVKSIEGN-VKIDYPLKAFTWPNISQLLCIIQGGQSDIEIVERCRVIDIE-GLPYMN- --APVLGMKAFKERVGAENRPLFGSIVKPKSGLTEQQLLSIVEQMIDGGSDFIKEDEIMA NNDYLPLTQRVQAVSSLIEKKNSK----VVYSYCVNADPYELVQNLEDVRSLGGDCVHIN FW-SGLGAYTASN--QAGLITHYQRSGIRILTDPGNFSISWPVLVKLGVMAGIDTIHVGM LGGYYEGSEEETMEALSICM-----------NNDRVAALSCGMNPDVAREIRGNIGNDWM ANIGGWLHTGE-NIYDKVYEMRK---------SLD-------------------- >imgVR_3300005805_____Ga0079957_1000004_32_PHAGEputative_phage TKSNIELIYKNRDGNFVTYI-VKS-----LHEAAKAIAIGQSIGNP-NKR-SEFETDKKF CCKIISVEGK-VVIFYPDTNFKEDGISHFLTTLMGGQLDIDIIEKCRLEDIEFSLHFRSF FSGPNLGLNEIRKLTNTYNKPLFGGIIKPKIGLNAEDYFEVFKIYADNGCNFIKEDEILS NQSFCSIRKRLERVANYIRTNNIK----LVYCPSITCDHKYLESRVEKIEKMGINGVHIN IH-SGYGAYKLVKDSFRNMYLHYQKSGDRLWTNPQHFSINETVLFEIASRCGCSTLHVGM IGGYLNSDSESLKRSIKKLV-------EL----NSVSALSCGLHPGLIDYITNELGHDWM GNVGGSISSHPMGAKAGVKAMVQAI----NKVYGEEYNK-------AIEVWGKK- >imgVR_3300013131_____Ga0172373_10003100_23_PHAGEputative_phage KTEIINIILISTNGN-IVTYDVKSKTNL---DAAYAIAVGQSVGNP-EKR-SEFESRETY CAKIV--DTN-INIYYPDNNFKEDGITHFLTTVMGGQLDIDIILKCRIVNIEFSREFNQI FNGPNLGLKEMREYCDVDEKPLFGGIIKPKIGLNVKDYVEVVKIYADNGCNFIKEDEILS NQSFCSIEKRLEKIGDYLNT----NNIKMVYAPSITCDHLYLEDRIRKIHNLGINGLHIN IH-SGYGAYKMVKDLRLNLYLHYQKSGDKMWTNPQHYSIDESVLFEIASRCGCSTLHTGM IGGYLNSDENSLVRTINKLV-----------SLNSVSALSCGMHPGLIQYIISKLGHNWM ANVGGAISSHPMGSAAGVRAMKQSI----SGNHGEEYFE-------AITKWGLKQ >RBG_13_scaffold_9498_7unknown IDDYL---FQSEEENIIATYSVRSK---NLVNAAKAIAIGQSIGNP-EVRTQRDSPEINL AKILCEIRGM-VQIAYPLVNFDIEGITQLLCTLMGGQMDIDAIESCRLVDVDFPKKVLEV YKGPKFGVANIRRRAKANGRPLLGGIIKPKTGITVPQLEEMVKEMLEGGVDFIKEDEILG NPYFCKFKERVKRVSDLVNDFSAKQGREVFYTPCINSDYPYFLERAQFASDNGAKALHLN FW-AGLSSYRALRDMDLKSAIFFQKSGDKVITDKRHFSIDWSVICKLARMSGCDFIHAGM WGGYLSDKKEDLMRVMESLR----GGDTY---KATVPSLSCGSHPGLVHTTIDNFGTNLM MNVGGAMQGHPLGTTAGAKAMRQAFECYQQKKPIYDFMKDKEELKEAIEKWGYVK >rifcsphigho2_12_scaffold_162_94_PHAGEputative_phage SPNEID--YNKY---FVVTYYLES--KTCLKDAALAIAIGQS---WTEMRAGKWESKEKY AAKVIGISGR-VNIVFPICNLSEDGISQLLCIIQGGQLDIKNIQTCLVVNIKFPDSVKKH FMGPKFGLKGLREKTKCFNKPLLMGICKPKLMDSPQMLLDMIKELVDGGVNIIKLDEINS SPPSCRFRDRIKLIADYL------RNKSVVYFDCITSDYPYVISRAIQANLEGISGIHVN CW-AGWGVYKAIRELDLDTFLFCQRSGKCMTDRIHRFHIRWSVLCKLVAMSGVDVVHAGM RFGYSNDDPEEVNKAIKVLR-------GF----DVAPTLSCGFTPELVEKVTNEVG-DYV IGAGGSIHSDANGTTSGALKFRK---------TIDNLYG---------------- >imgVR_3300002835_____B570J40625_100002082_7_PHAGEputative_phage LNLYYN--TVPD--DYTVTYYVES-----LAEAAEGIAIGQTIGNP-STRIPQWETPENY SAKILREAGK-ITLAFPSKNINRDGFSHLLCVLLGGQVDIDIIKKCHVIDID-DRNCIPK LK-PYYGLTGLRDYTGRYKRPLLGCIIKPKIGLSAKDYVSIVKEMIQGGADIIKEDEILG SPLFCSLEERLEMVNNLI------KNRPIVYLTAINGDADTVLDKAQTVHDYKVNGLHIN VW-SGLGTYRAVRKKNLPVVMHFQKSGDKTFTHPDNFRFDWSVICKIAAWSGVDTIHTGM WGSYLSDDPVVLKNNMDMLV-----------KHNVVPALSCGMNARLIPVITEKFGYDYL ANVGSACHSHPDGVYAGVKQLRAAIDAVS-------------------------- >imgVR_3300006484_____Ga0070744_10000432_2_PHAGEputative_phage REIYLN--AIPD---FVVEYYIES-----LGHACEAIAIGQSIGNP-SVR-NVFETVENH CAKIIYDKGT-VKIAFPIANVEEDGISHLVCNFMGGQVDIDFIKKCQVMDVD-IFDYYKT FE-PKFGITGIRKLTNRYEKPILGCIVKPKIGLKPKELASIVKDMIDGGADLIKEDEIMS NPSFCNYEDRLKYISDLI------EDKPVIYLATCNSDPHKLEERAKNIFRHGSNGIHVN LW-SGLGSYLSVRKLDLPMVLHYQKSGDKVITHKSNFMISWYVLCKLATWCGVDTIHAGM WGGYLSDKPKDLKRIMNLLQ-----------SNNTLPALSCGMNAELIPKVTKKFGYDYL ANVGGAVHSNPDGIKTAVMELRRAI----DEK----------------------- >imgVR_3300003277_____JGI25908J49247_10000163_14_PHAGEputative_phage MNIYIN--EIPT--DIKVTYQVTSTKNLS--IAAEAIAIGQSIGNP-SVR-NEFESPENH SAKILKNDGE-IIIGYPLVNIDEDGVSQLLCMIQGGQLDIDHIVKCRVTDIDINLPMLK- ---PKFGLSGIRELTKTYDRPLLGCILKPKTGLRPKELSLLVKEMIAGGANIIKEDEILG SPSYCNLESRLPYIRDII------QDKNVVYLTCINSNPDKLIEKSSTVRMLGVNGIHIN IW-SGLGSYAAVRNKNYPLVMHYQKSGDKILTNIHNYGIDWIVLCKLAIISGIDTIHAGM WGGYLSDDPVYLKNIMDTLT-----------SHNVVPALSCGMNATLIPQVTAKFGVDYL ANVGGACHTHPDGIKAAVRELRNAIDR---------------------------- >imgVR_3300009164_____Ga0114975_10000049_44_PHAGEputative_phage MNIYIN--EIPT--DIKVTYQVTSTKNLS--IAAEAIAIGQSIGNP-SVR-NEFESPENH SAKILKNDGE-IIIGYPLVNIDEDGVSQLLCMIQGGQLDIDHVVKCRVTDIDINLPMLK- ---PKFGLSGIRELTKTYDRPLLGCILKPKTGLRPKELSLLVKEMIDGGANIIKEDEILG SPSYCNLESRLPYIRDII------QDKNVVYLTCINSNPDKLIEKTNTVKMLGVNGIHIN IW-SGLGSYAAVRNKNYPLVMHYQKSGDKILTNIHNYGIDWIVLCKLAIISGIDTIHAGM WGGYLSDDPVYLKNIMDTLT-----------SHNVVPALSCGMNATLIPQVTDKFGVDYL ANVGGACHTHPDGIKAAVRELRNAIDR---------------------------- >imgVR_3300006129_____Ga0007834_1000020_33_PHAGEputative_phage NINIVQ--ETKNLEEFYATYEVES--TVSVYDAAWNIAVGQSIGNP-TKR-SVWETTEKY CAKILDNRGI-VEIGYPIANIKQDGISQLLCIVQGGQTDIATITKCRALKLEFPDFVART FNKPKFGISGFRDFTNTHGKPFFGGIVKPKTGISPSQLLDMTREMVEGGVNFIKEDEVMA NPDVCPLEVRVPLIAEYM------RDKPVVYCFCINSDPAYILDRARFVAANGGNGVHIN IW-SGLGAVKSVRDLDLPLYIHYQKSGDKVITHPANFGISWHLLCQLAAFAGVDTIHAGM WGGYLSDSEEDLRDVFEVLH-----------GNNVVPALSCGMEPSLVQPIVDKFGIDWM ANVGGYIHSDPLGTREGSKKMRAACDAIL---NLEK------------------- >imgVR_3300000439_____TBL_comb48_EPIDRAFT_1000912_4_PHAGEputative_phage --------MTIDKKDFYVKYFLKS--ATSLYDAAFDLAVGQSIGNP-SMR-SVWETPESY CAILRDLEGQ-VEIGFPLVNIDWTGIAQLLCTIAGGQVDIDRIKGCRAIGLEFSESFIEN FKKPKFGLSGFRALTGQYNKPLFGGIIKPKTGITPDQLLDMTKQLVDGGCDFIKEDEIMA NPAVCSLEDRVELISRYI------SNTKTVYCFCINADPAYIVPRAKFVAENGGRGVHIN VW-SGLGSYKSIRDLDLPLYIHYQKSGDRIFTSKYNFSISWQLCCQLAAWCGVDTIHAGM WGGYLSDPEDELRETLKILT-----------DRNVTPALSCGLTSESIRPIVDKFGVDWL GNSGGGIHSHPEGSQTGAAKIRAAV----DQI--------------G-------- >imgVR_3300000553_____TBL_comb47_HYPODRAFT_10006500_13_PHAGEputative_phage --------MTLNEKDFYVTYELAS--DISVYDAAFNVACGQSIGNPRSVWETDEMIEQKI VRTNFELHGT-VEIAFPYALIAEDGISQLLCIIAGGQSDIAAIKRCRVQHIEMKQSVVDY FHKPKYGITGMREFTGQYNKPLFGGIIKPKSGITPQVLLEMVKELVDGGCDFIKEDEILS SPAICRLEDRVELISNYL------HDKKVVYSYCINSDAAYILDRARFVAANGGNGVHVN VW-SGLGTYKNIRELDLPLHIHYQRSGMDFFASKHNFSISWHVLCQLAAWSGVDTIHAGM WGGYLSDDEDFLRTTLKILQ-----------AGNVVPALSCGLTAEHVAPIVEKFGIDWM ANAGGAIHGHPNGTKAGAAKIRAAVDAIG-------------------------- >imgVR_3300010334_____Ga0136644_10000002_223_PHAGEputative_phage --MSID--IVRDQKEFYVTYDVES-----LFDAAWAIAVGQSIGNP-NTR-SVWETPEAH CAKIL--RSN-VTIGYPAANIDNDGIAQLLCIIQGGQTDIDIIKKCRVVDLEIPQTIDSH FNKPRYGISGMRAYTGCFGKPFFGGIIKPKTGVTPHQLLDMVKELVDGGVNFIKEDEILS NPDICRLEDRVSLISDYI------SDKSVVYTFCINSDPLHVLDRARFVASNGGTGVHVN VW-SGLGVYKSIRDLDLDLFIHYQKSGDKVITHKKNFGIEWTVLCQLAAISGVDTIHAGM WGGYLSDPEDEITAYMKVLR-----------SGNVVPALSCGMTAELIPPIVEKFGVDWM ANVGGAIHGHPDGTLAGAQKIRRAIDS----------------LKV--------- >imgVR_3300009181_____Ga0114969_10000030_77_PHAGEputative_phage --MTID--IVRDEKEFYVTYEVES----SVYDAGWNIAVGQSIGNPRSVWETDDMIENKI MRKDFKK-GR-VTIAYPNANIETDGIAHLLCMIQGGQVDIDTIRKCRVVDLEIPQTIDTY FRKPKYGITGMREYTGNFGKPFFGGIVKPKTGITPQILLEMTKELVEGGVNFIKEDEILS NPNICRLEDRVELISNYL------QDKRVVYCFCINSDPAHIENRARFVSQNGGNGVHIN VW-SGLGSYRTIRNLDLPLFIHYQKSGDKVITGKRNFGIEWTVLCQLAAMSGVDTIHAGM WGGYLSDPEDEITAYMKVLH---------DG--NVVPALSCGMTAELIPPIVEKFGIDWM ANVGGAIHGHPDGTLAGARKIRN---------AIDTYQY---------------- >imgVR_3300006108_____Ga0007862_1000138_16_PHAGEputative_phage --MSID--IVRDEKEFYVTYEVES----SVYDAGWNIAVGQSIGNP-TKR-SVWETDEAY CAIIRDFKGH-VTIGYPSANIDQDGIAHLLCIIQGGQVDIDTIKKCRVVGLEIPQTIDTH FRKPKYGITGMREYTGNFGKPFFGGIVKPKTGITPQVLLEMTKQLVEGGVNFIKEDEILS NPAVCRLEDRVELISNYI------TGKNVVYCFCINSDPAHIENRARFVSQNGGNGVHIN VW-SGLGSYRTIRNLDLPLFIHYQKSGDKVITGKRNFGIEWTVLCQLAAMSGVDTIHAGM WGGYLSDPEDEITAYMQTLH---------DG--NVVPALSCGMTAELIPPIVEKFGIDWM ANVGGAIHGHPDGTLAGTQKIRR---------AIDDLQV---------------- >imgVR_3300013093_____Ga0164296_1006299_14_PHAGEputative_phage IDIVRD---EKELGTFYVTYEVES----SVYDAGWNIAVGQSIGNP-TKR-SVWETDEAY CAKIIKDEKK-VTIGYPSANIDQDGIAHLLCIIQGGQVDIDSIKKCRVVDLEIPQTIDTH FRKPKYGITGMREYTGNFGKPFFGGIVKPKTGITPQILLEMTKQLVEGGVNFIKEDEILS NPAVCRLEDRVELISNYI------TGKNVVYCFCINSDPAHIENRARFVSQNGGNGVHIN VW-SGLGSYRTIRNLDLPLFIHYQKSGDKVITGKRNFGIEWTVLCQLAAMSGVDTIHAGM WGGYLSDPEDEITAYMKTLH---------DG--NVVPALSCGMTAELIPPIVEKFGVDWM ANVGGAIHGHPDGTLAGTQKIRRAI--NL---------------KV--------- >CG_2015-01t_scaffold_2353_6_PHAGEputative_phage YRKTVD------NSKIIATYDVST--TATPEDAAWAIAVGQSIGNP-NAR-SNWETKQDH ACKILSIESD-IAIAYPIENIDLGGVSHLLCQLMGGQMDIDIITRCRLMDIEFPDSVYNH FKGPRYGISGIRDFTGVYDKPLFGGIVKPKIGLTPDKLLEVVREMVEGGVNFIKEDEILS NPNHCPIEDRVPLIMDYL------EGKNVIYAVSITADPLEVIRRVVQVYDLGGNAVHIN WW-AGLGVYKTIRDLDLPMFLFFQKSGDKVITHRSNFGIEWSVVCKLAAMSGVDFIHAGM WGGYMDTDSEELRRIMTGLI-----------NNNVLPSLSCGMTPDLVKPITDRFGVDYM ANTGGYIHSYNGGTKAGCLKMREAI--------------DNDD------------ >imgVR_3300005528_____Ga0068872_10000195_49_PHAGEputative_phage --------MSVNDIDFIVKYFLAG--KTSLRDAAWNLAIGQSIGNP-NNR-SVWETDQDH SCFVLENSGE-VDIAFPLENLEEDGISQILCHIAGGQVDILEIEQCHVLDVTLPAHIEQQ FTKPAYGIDGFRKFNGVEGKPFFGGIIKPKVGMSPEVLLEAVKEMVYGGVNFIKEDELLG SPAHCPLTKRVPLITNWLAN----NAPNVMYTFCINGDSPYALQRAQFVSDEGGLGVHIN VW-SGLGAYRAIRKQNPNLWIHFQKSGDKFFTDRRANHIYWPVICKIAGWSGADSIHAGM IGGYM--NQDDT-ELQDALK------VLW--NYNVIPALSCGMHPGLVQHINGLLDSNWM ANVGGAMHGHPMGTLAGGLAMRQAI----DGNHGAEYDA-------AVKKWGYKA >imgVR_3300017989_____Ga0180432_10000539_12_PHAGEputative_phage --------FKDRIADFVVRYYLEA-----LRDASWNLAIGQSIGNP-TNR-SEWETDENH SCFILEQDGE-VEIAFPLANLEEDGISQILCHIAGGQVDIEEVKQCHVLDIVLPQEVEDS FSKPAYGINGWRRFNGVDDKPFLGGIVKPKVGMSPEVLLEAVKEMVYGGVNFIKEDELLA NPSHCPLEQRVPLISSWLKENAPD----VIYCFCINGDSPYALERAKFVADNGGNGIHIN VW-SGLGVYRAIRKQNPNLWIHFQKSGDKFFTDRRSNHIYWPVLCKIAGWSGVDSIHAGM IGGYMNQDEEEIRDTLRVLW-------NY----NIVPALSCGMHPGLVQYINETISSDWM ANVGGAMHGHPQGTRAGALAMRQSIDQDY---DHPEYTI-------AIKKWGNKC >imgVR_3300006810_____Ga0070754_10002591_16_PHAGEputative_phage -MISI---FKNKTSQFIVKYFLEAKTSLR--EASWNLAIGQSIGNP-NNR-SEWETDENH SCFILEEEGE-IEIAFPLANLEEDGISQILCHIAGGQVDIEEVRQCHVLDISLPEDAERS FSNPAYGIDGWRKFNGIKQRPFLGGIVKPKVGMSPSVLLEAVKEMVYGGVNFIKEDELLA NPSHCPLEERVPLISSWLKENAPD----VIYCFCINGDSPYALERAKFVADNGGNGIHIN VW-SGLGIYRAIRKQNPNLWIHFQKSGDKFFTDRRANHIYWPVLCKIAGWSGVDSIHAGM IGGYM--SQDEQ-EIKDTLQ------TLW--NYNIVPALSCGMHPGLVEYINTTIGSDWM ANVGGAMHGHPQGTRAGALAMRQSI----DGDDEPEYKA-------AIEKWGNKC >imgVR_3300018423_____Ga0181593_10013560_4_PHAGEputative_phage -MISI---FKNKTSQFIVKYFLEAKTSLR--EASWNLAIGQSIGNP-NNR-SEWETDENH SCFILEEEGE-IEIAFPLANLEEDGISQILCHIAGGQVDIEEVRQCHVLDISLPEDAERS FSNPAYGIDGWRKFNGINQRPFLGGIVKPKVGMSPSVLLEAVKEMVYGGVNFIKEDELLA NPSHCPLEERVPLISSWLKENAPD----VIYCFCINGDSPYALERAKFVADNGGNGIHIN VW-SGLGIYRAIRKQNPNLWIHFQKSGDKFFTDRRANHIYWPVLCKIAGWSGVDSIHAGM IGGYM--SQDEQ-EIKDTLQ------TLW--NYNIVPALSCGMHPGLVEYINTAISSDWM ANVGGAMHGHPQGTRAGALAMRQSI----DGDDQPEYKA-------AVAKWGNRC >imgVR_3300006802_____Ga0070749_10000694_17_PHAGEputative_phage -MISI---FTKEVQKFVVRYFLEAKTSLR--DASWNLAIGQSIGNP-NNR-SIWETDQDH SCFVLEKSGE-VSIAFPLDNIEEDGISQILCHIAGGQVDIEEIVKCHILDIRLPDKVESD FSAPAYGIDGFRRFNNVIDKPFFGGIIKPKVGMSPGVLLEAVKEMVYGGVNFIKEDELLG NPSHCPFEERVPLISSWLKENAPD----VIYCFCINGDSPYALERAKFVADNGGNGIHIN VW-SGLGVYRAIRKQNPNLWIHFQKSGDKFFTDKRSFHIYWPVICKIAGWSGVDSIHAGM IGGYM--NQDDD-EIRDTLK------VLW--HYNIVPALSCGMHPGLVDYINESINSDWM ANVGGAMHGHPMGTRAGGLAMKQAINNDF---QEEEYVC-------AIKKWGKRT >imgVR_3300017818_____Ga0181565_10006269_7_PHAGEputative_phage -MISI---FTKEVQKFVVRYFLEAKTSLR--DASWNLAIGQSIGNP-NNR-SIWETDQDH SCFVLEKSGE-VSIAFPLDNIEEDGISQILCHIAGGQVDIEEIVKCHILDISLPDKVESD FSAPAYGIDGFRRFNNVIDKPFFGGIIKPKVGMSPEVLLEAVKEMVYGGVNFIKEDELLG NPSHCPFEERVPLISSWLKENAPD----VIYCFCINGDSPYALERANFVANNGGNGIHIN VW-SGLGVYRAIRKQNPNLWIHFQKSGDKFFTDKRAFHIYWPVICKIAGWSGVDSIHAGM IGGYM--NQDDD-EIRDTLK------VLW--HYNIVPALSCGMHPGLVDYINESIHSDWM ANVGGAMHGHPMGTRAGGLAMKQAINNDF---QEEEYVC-------AIKKWGKRT >imgVR_3300003410_____JGI26086J50260_1000284_21_PHAGEputative_phage ISIFVE----PEEKYFIVTYFLGSKRSLR--EASWNLAIGQSIGNP-NNR-SVWETDQDH SCFILEEEGI-VKIAFPLENIEEDGISQILCHIAGGQVDIEEIEKCNVLDITLPEMVEEN FRKPAYGIDGFRKFNSVYEKPFFGGIIKPKVGMSPEILLEAVKEMVYGGVNFIKEDELLA NPSHCPLEERVPLITSWLKENAPD----VIYCFCINGDSPYALERAKFVADNGGNGIHIN VW-SGLGVYRAIRKQNPNLWIHFQKSGDKFFTDRRAYHIYWPVICKIAGWSGVDSIHAGM VGGYMNQDEEEIRDALEVLW-------KY----NIVSALSCGMHPGLVEYINETLSSDWM ANVGGAMHGHPMGTRAGALAMKQSINGDL---EGEEYLA-------AIEKWGKRK >imgVR_3300005662_____Ga0078894_10001118_13_PHAGEputative_phage MNIFKN--TPPH---FVVEYYLESKTSLR--DAAWNLAIGQSIGNPRSLRETEEMYENHI LHEDYSLKEGIVRISFPLSNLKEDGISQILCHIAGGQVDILEVQKCHVLNIELPKEVEDS FRKPAYGIDGFRKFNGVIDKPFFGGIIKPKVGMSPDILLDAVKEMVHGGVNFIKEDELLG NPEHCPLVKRVPLISKWLNENAPD----VIYCFCVNGDSPYVLDRVNFIEQEGGNGIHVN VW-SGLGIYRAIRKQNPNMWIHFQKSGDKFFTDKRSYHIYWPVICKIAGWSGVDSIHAGM IGGYM--NQDDN-ELADALK------VLW--NYNIVPALSCGMHPGLVQYINETLDSDWM ANVGGAMHGHPMGTLSGGLAMKQAINKEF---DKVEYKQ-------AIEKWGNKE >imgVR_3300009158_____Ga0114977_10008854_1_PHAGEputative_phage ---------------LLKKSK--------------------------------------- -------AGN-VKIAFPLANLEEDGVSQILCHIAGGQVDILEIQKCHILDVVLPVEVEES FREPAYGIDGFRKFNGVENKPFFGGIIKPKVGMSPDILLEAVKEMVNGGVNFIKEDELLG NPEHCPLEVRVPLISGWLKENAPD----VIYCFCINGDSPYALERAKFVSDNGGNGIHIN VW-SGLGVYRAIRKQNPDLWIHFQKSGDKFFTDRRAFHIYWPVICKIAGWSGADSIHAGM IGGYM--NQDDD-ELVDALK------VLW--KYNIVPALSCGMHPGLVQYINQMLNSDWM ANVGGAMHGHPMGTLSGGLAMKQAINQEFDGI---EYKK-------AIEKWGSRE >CG09_land_8_20_14_0.10_scaffold_14070_3Beckwithbacteria MKWFRNVN----KEEWLADYWLES-----LKEASLSLAVGQSVGNP-TIR-SRRETESKH GAIILEAKGR-VRIAFPLENMQEDGVTQLLVQAMGGQLDINIIEKCHLLNLQFPKGF--K YNRSKFGINGVRRFTGVYNKPILGGIIKPKVGLTSQELLGMVKEMVEGGVNFIKEDEILA NPDCCPFEERVPLVMKYL------KGKKVIYAVCINSDYPYILERARRVYELGGNAVHVN WW-AGLGVYKAIRELNLPLFLFFQKSGDRILTNEKHFYIAWPVICRLAGLMGVDIIHAGM WGGYLTGLRKTLV-ILADCG--------------VLPSLSCGLHPGLIQAINRRFGTDYL ANAGGAIHGHPGGTKAGVKAFRQAI----DGKLGREYFA-------AVRLWGEVK >gwa2_scaffold_24083_5Beckwithbacteria MQWFRNVN----QKEWLADYWLES--TTDLREASFSLAIGQSVGNP-TIR-SRRETEKQH SAIILETKGR-VRIAFPLKNMQEDGVTQLLVQAMGGQLDIKIIKKCRLLNLDLPKAF--K FKGPRFGIKGVRQFTGVNDRPILGGIIKPKVGLSSQELLVVMKEMVEGGVNFIKEDEIMA NPGCCPLEKRVPLVMQYL------KGKKVIYAVCINSDYPYILERARRVYELGGNAVHVN WW-AGLGIYKAIRELDLPLFLFFQKSGDRIITDEKHFHIAWPVICQLAGLMGVDIIHAGM WGGYLAETAASLRKTLAVLA-----------AYNVLPSLSCGLHPGMIGAINKRFGVNYL ANAGGAIHGHPGGTKAGVQAFRQAI----DGKLGREYFA-------AVRLWGEVK >imgVR_3300013132_____Ga0172372_10011093_16_PHAGEputative_phage ------HIFNPDSSEFIVKYFLAS--NKNLREAAWDLALGQSVGNP-SMR-SIWETEQDH SIIILEQAGE-VAFAFPLKNIDLGGVSQILCQLMGGQMDINHILRCHLNDIIFPSVVEEY FRGPRFGISGTRTFTETWNKPLLGGIVKPKVANDVEVLKKIVGEMVEGGVNFIKEDEIMA NPSSCPLELRVKEISKMI------ENTKVVYCYCING--DDVVTRAKRVYDLGGRGVHVN FW-SGLGTYKRIRELNEPLFIHFQKSGDRVLTNPNHYHIKWNVICKLAGMMGVDSIHAGM VGGYMQ-QDENFLDTLNILR-------HY----NVIPALSCGMHPGLVDSITKQIGVDYM ANVGGALHGHPMGTKAGVMAMRQAI----YQQPGEEYNV-------AIQKWGKVS >imgVR_3300004691_____Ga0065176_1000054_16_PHAGEputative_phage -MSFDA--FRDKPQEVFVKYELDS-----VKKAAWDLAIGQSVGNP-NVR-NDWETDEKH SCLITDEENKLAEIAFPVENTKEDGISHLLCQIMGGQTDIDHIIKSRVIDIEIPECVKAW FREPRFGFKGYREYLNQYDKPLLGGIVKPKTGISPQTLLEMVKQMVEGGVDFIKEDEILS NPNFCPLWQRVPLIADYLAN----CGRKVAYHFCINSDPLYVLERAKYVAKHG---VHIN VW-SGLGVYNSIRKLNLPLFIHYQKSGEKTFTHPKNFGISWPVLCELAGLSGVDTIHAGM IGGYS--SDDPVMMEQAIAN---------LNKYGTVPSLSCGMHPGLVNHVTELFGSDYM ASVGGAIHGHPNGTLAGATAMRQAI----DKTYGPEYDV-------AIAKWGLVN >imgVR_3300007516_____Ga0105050_10000484_9_PHAGEputative_phage --------FDPFDPELIVTYHMSS--TISLKKAAYDLAIGQSMGNP-NER-NSWETDERH SCLVLTLEGF-VDIAFAVENTDWGGIAHLMCQISGGQADIDHITRSRVIDLKIPQSVRKH FHTPKYGISGLRAFNQQFDKPFMGGIVKPKTGLSPARLLEMVKQMVDGGVDFIKEDEILS NPQFCSLADRVPLISNYIQN----CGRAVKYCFSINGDPHVIESRVKFIASEGGNGVHIN FW-SGLGVYHSIRRLDLPIYLHYQKSGDKAITHHANFGISWYVMCQLAALAGVDSLHAGM YGGYLSDEETQLNILMKMLQ-----------TNNVIPGLSCGMHAGLVNHITEHVGNDYL ANTGGAIHGHPRGTTAGCKAMRQAI----DHTYGTEYDE-------AIKKWGLIN >imgVR_3300009182_____Ga0114959_10000174_59_PHAGEputative_phage MLKYDI--FKPESSFFIVTYLIGS--KTDLKDAAWNLAIGQSIGNP-SKR-SEFESQDNH CAIILEKEGI-VKIAFANINFKTDGVSQLLVQIMGGQCDIDIFEKCVIKDIKLTPNMENC LQGPKIGLKEMREYCG-VEKPLFGGIVKPKVGLSPQKNLDLVKKLIDGGCNFIKEDEILS DPDHCRIEDRVPPVMDYIKS----TNAKVYYAVSIHSDPAHILNRVKQVYELGGNAVHVN FH-CGLGVYKSIRELDLPILVHFQKSGDKILNCYDHFAIDQDVIFKLVGQSGCSTLHAGM IGGYMDNETQAVKKTISMLN-------DI----NCVPALSCGMHPGLIDYILDVVGHNWM ANVGGALTSHPSGTLAGTKAMRQAI----DKNYGDEYHQ-------AIEKWGKK- >imgVR_3300018790_____Ga0187842_1000009_46_PHAGEputative_phage ------KIFRNKINKVVATYRMSS--NQSLKAAAWNLAIGQSVGNP-NVR-NEWETDSNH SCIIFEKAGM-VEIGFPVINSDWNGISHLLCQLMGGHVDIDLITSCRLIDLLIPECVKQH FLGPKFGITGLRKLTNQFNKPLFGAIIKPKTGITSKVLLEMVKQVVEGGVDFIKEDEIMS NPACCRLEERVEVIANYLSTQKKK----IAFCHTINGDPHVVLDRVRKVSELGGNGVHIN VW-SGLGVYNSIRKLDLPVFLHFQKSGDRVFTDRSHFSIAWPVVCQLATLMGADTIQAGM VGGYSNDDEIELMQALKILR-----------EGNTVPALSCGMHPGLVDFCIGRVGNDFM ANVGGAVHGHPGGSRSGALAMRQAI----DKEHGKEYNQ-------AIEKWGIVS >imgVR_3300009154_____Ga0114963_10001892_12_PHAGEputative_phage NQSEID------KERFVVTYKLES-----LRDAAWNIAIGQSVGNP-NVR-NRWESEDNH SCLILEREGE-VKIAFPVINTETDGVSHLLCQIMGGHVDIDLVKKCRAIKIDFPNSVTKH FSGPKFGITGMRKFTGQYNKPLFGAIIKPKIGIGPETLLDMVKELVDGGVDFIKEDEIMS NPSFCSLDRRVEMISNYLSSQSRK----VVFCHTINCDPHVLVDRVRRIHSLGGNGVHIN VW-SGYGSYNSIRKLDLPIYCHFQSSGAKVVTSVNNFSISWSVICQLATMMGVDTIQTGM VGGYSNDDPEEILECIRILR-----------EGNTLPALSCGLHPGLIDKITSLVGNDYL GNAGGAVHGHPSGTLAGSRAMRQAI----DGNYQTEYHQ-------AIQKWGKEN >imgVR_3300003410_____JGI26086J50260_1002745_8_PHAGEputative_phage -MKF----FRNRSDKYIATYEMTSSDNL---EAAWALAIGQSVGNP-SVR-NEWETDENH SCIILEEDGL-VHIAFPVANT-EDGMAHMLCQLMGGHVDIAIITSCRLVNLELPETVTKH FLGPKFGLSGIREFTGQYNKPLLGGIVKPKIGITPTILLEMVKQMVDGGVDFIKEDEIMS NPAVCPLDERVDLISNYLAK----QSRKVVFCHTINCDPHIVSDRVKRVHELGGNGVHIN VF-SGYGVYNSIRKLDLPIFMHYQSSGKVTTDVNHRFSISWPVMCQLASLMGVDTIQTGM VGGYSNDDPVEIQKCLEILR-----------AGNTMPVLSCGFNPGLVEKVNKLAGTDYL ANAGGAIHGHPGGTVDGATAMRQAV----DKTYGSEYDV-------AIEKWGLTK >imgVR_3300004686_____Ga0065173_1000025_12_PHAGEputative_phage --------YRDRLSKYIATYKITS---K---DAAWNLAIGQSVGNP-SV-RNEWETDDNH SCIIL------VEIAFPVANTAT-GVAHMLCQLMGGHVDIDCFNSCRLIKLELPETVTQH FLGPKFGITGFRALTGQYGKPLFGSIVKPKIGITPEVLLEMVKQMVDGGVDFIKEDEIMV NPACAPLDRRVDIIANYLAK----QSRKIVFCHTINCDPHVLVDRVKRVHELGGTGVHIN VF-SGYGSYNSIRKLDLPLYLHYQSSGKVTTDVNHRFSISWPVMCQLATLMGADTIQTGM VGGS-NDDPEEIKECLDILR-----------AGNTVPALSCGFHPGLVEKVTEIAGQDYL ANAGGAVHGHPGGTVAGATAMRQAI----DKTYGPEYDQ-------AIAKWGLIK >imgVR_3300004770_____Ga0007804_1000749_8_PHAGEputative_phage RE-RLE--IDLE--KYIATYEMAS-ST-K--DAAWNLAIGQSVGNP-NV-RNEWETDDNN SCIIVEKEGI-VEIAFPVINT-D-GISHMLCQLMGGHVDIDIITKCRLVKLELPETVTKH FKGPKFGITGIRKFTKQYDKPIFGSIVKPKIGITPKVLLEMVKQMVDGGVDFIKEDEIMS NPAFCKLEKRVDLIANYLAKQSRK----VIFCHTINCDPHVLVDRVNMVHQLGGNGVHIN VF-SGYGAYNSIRKQNLPLFLHYQSSGAKVTTDVNHFSISWPVMCQLATLMGADTIQTGM VGGSND-DPEEIKQCLDILR-----------AGNTLPTLSCGFHPGLLNKVSSIAGNDYL ANVGGAVHGHPGGTRAGATAMRQAI----DKTYGPEYDE-------AIAKWGLA- >imgVR_3300006484_____Ga0070744_10000091_14_PHAGEputative_phage MKF-----FKNRDINFIATYDMKSSTT-K--EAAWNLAIGQSVGNP-NV-RNEWETDENH SCIILEYYGE-VAIAFPVANT-D-GISHLLCQLMGGHVDIDIIKKCRLIKLELPKVVTQH FLGPKFGLSGFREFTGQYNKPLLGGIVKPKIGVTPEILLEMVKQMVDGGVDFIKEDEIMS NPVCAPLERRVDIISNYLAKQSRK----VVFCHTINCDPHILVDRVKRVHSLGGNGVHIN VF-SGLGVYNSIRKLDLPLYLHYQSSGAKVFTDVNHFSISWPVMCQLATMMGVDTIQTGM VGGSND-DPEEIKCCLEILK-----------AGNTAPALSCGLHPGLIDKVTELAGVDYL GNAGGAIHGHPGGTIAGARAMRQSI----DKDYGNEYNQ-------AISKWGLDK >imgVR_3300006121_____Ga0007824_1000696_6_PHAGEputative_phage MKF-----FRTRNLNFIATYEMSSSAN-K--EAAWNLAIGQSVGNP-NV-RNEWETDENH SCIILEANGT-VEIAFPIANT-D-GISHLLCQLMGGHVDIDIVTKCRLVKLELPDTVTSH FLGPKFGLSGFREFTGQYNKPLFGGIVKPKIGVSPKVLLEMVKQMVDGGVDFIKEDEIMS NPACAPLERRVDLIANYLAKQSRK----VVFCHTINADPHVVVDRVKRVYELGGNGVHIN VF-SGLGVYNSIRKMDLPLFLHFQKSGDKVFTDKNHFSISWPVICQLATMMGVDTIQTGM MGGSND-DPIELQQAIDVLR-----------AGNTTPVLSCGFHPGLVDKITKLAGNDYM ANVGGAMHGHPGGTREGSAAMRQAI----DKTYGPEYDQ-------AIAKWGLIS >imgVR_3300005581_____Ga0049081_10002534_4_PHAGEputative_phage MKF-----FRTRDLNYIATFEMKSAST-K--DAAWNLAIGQSVGNP-NV-RNEWETDENH SCIIVEDVGN-VEIAFPVANT-D-GISHMLCQLMGGHVDIDIITKCRLIKLELPKTVTDH FLGPKFGLSGFREFTGQYNKPLLGSIVKPKIGITPEVLLEMVKQMVDGGVDFIKEDEIMS NPACAPLDRRVDIIANYLAKQSRK----VVFCHTINADPHVIVDRVNRVHELGGNGVHIN VF-SGLGVYNSIRKMNLPLFLHFQKSGDKVFTDKNHFSIAWTVICQLATMMGVDTIQTGM MGGSND-DPEEIKQCLEILR-----------AGNTTPALSCGFHPGLVEKINSLAGTDYL ANAGGAVHGHPGGTIAGAGAMRQAI----NREYSVEYHQ-------AISKWGLIT >imgVR_3300005683_____Ga0074432_100043_78_PHAGEputative_phage MTQFEV--FQPDTTDFVVTYKLSSTKSLE--NAAWNLAIGQSVGNP-NVR-NAWESDEKH SCIVLEQSGT-VKIAFPLSNIDFSGMSQFLCHIMGGQLDIDSIVKCHVLDVQFPDSVKNT FLGPKFGIQGIRKYTGVYDKPLLGGIVKPKVCMNEDILLDLVKALVDGGVNFIKEDEIMA NPSCCTLEKRLPKLAQYL------STQNVVYCVCINADPHNVLDRVKQVHEAGVNGVHVN FW-SGLGVYNSIRKLDLPLFLHFQKSGDKIFTSKKHFHIDWSVVCKLAGMMGADFIHNGM LGGYSSDDESELKKTVEILQ-----------FHGTMPALSCGMHPGLVEYIRGKLGNELL LNCGGAIHGHPGGTVSGARAMRQAI----DRQHGKEYDE-------AISTWGLVQ >imgVR_3300009068_____Ga0114973_10000306_11_PHAGEputative_phage ----------------MSLF-----------DAAENIAIGQSVGNP-KVR-NGWETPERA GCRIETESGD-VWIAFPLSNIDLTGVSQLLCHIMGGQLDINTITRCVVLDLDLPNDVVSY FLGPKYGIQKIREYTGAYNKPLLGGIVKPKVGINKETLLEIVKQLVEGGVNFIKEDEIMA NPEHCPFEERVPLICDYL------KGKNVVYCFCINADPDLVLERVKFVAAHGGNGVHVN FW-CGIGLYKRIRELDLPIFVHYQKSGISILSDVRNFHIEWAVFCKLAGLCGVDFIHAGM MGGYSDNDRDEMTRVLKELH-----------THGVMPALSCGMHPGLVDAICASIGVDWM ANCGGAIHGHPEGTLAGAKAMRQAI----DHTYGPEYDS-------AIAKWGKVE >imgVR_3300009182_____Ga0114959_10006311_8_PHAGEputative_phage KFFRVH--YSLA---TVSLF-----------DAAENIAIGQSVGNP-KVR-NGWETPERA GCRIETEEGN-VWIAFPLANIDLTGVSQLLCHIMGGQLDINTITRCVVLDLDIPKDVVSY FLGPKYGIQKVREFTGVYNKPLLGGIVKPKVGIYKETLLEIVKQLVEGGVNFIKEDEIMA NPEHCPLEERVPLICDYL------KGKNVVYCFCINGDPDIVLERVKFVAAHGGNGVHVN FW-CGIGLYKRIRELDLPVFVHYQKSGITILSDVRNFHIEWSVFCKLAGMCGVDFIHAGM MGGYSDNDRDEMTRVFNELR-------PY----GVVPALSCGMHPGLVDAICASIGFDWM ANCGGGIHGHPQGTLAGAKAMRQAI----DHTYGPEYDS-------AIQKWGKVD >imgVR_3300009058_____Ga0102854_1000016_27_PHAGEputative_phage -MIFVDID----SNKIKGYYFLESKKGVR--DAAWNLAIGQSVGNP-LVR-LERETDENH SCIKNDLSGR-VEIGFPVANISTDGVSQLLCFLMGGQLDIDTIETCWLENLIIPDWVIEY FKLPKFGITGAREFTKANDKPLLGGICKPKTGISPSILLDMVKEMVDGGVNFIKEDEIMS NPECCRLEERVSLISNYI------RDKNVVYCFCINSDPAHILDRVRFVHQEGGNGIHIN FW-SGMGVYKSVRELDLPLFMHFQKSGDKILTNKKHFHIDWSVICYLAGLMGVDFIHTGM WGGYASDDENDLRKTMGILH-----------SRNVVPALSCGMHPGIVNTITEKFG-NYL ANCGGALHGHPSGTVAGAQAMKQAI----DKTFEAEYLQ-------AINKWGLVE >imgVR_3300006030_____Ga0075470_10000039_31_PHAGEputative_phage MKFYRD--LDDSEKQIVVTYYVNPSHGR-LNDAAWALAIGQSVGNP-KQR-NSWETDDMS SCVIYDDEEK-VKIGFPKINSDNDGISHFLCQIMGGQLDIDVFKVCRVVDVEFPESVKKS FLGPKFGMSGIRELTGQKNKPLLGGIVKPKTGISVKQLGYIVKELIDGGVDFIKEDEILS NPTFCRLEERVEHIANIIAD----SGRKVVFAHCINADPHAILNRVKTVYENGGNGVHVN FW-SGFGAYNSIRKMDLPIYMHFQKSGDKILTNPSHFRIDWYVICKLAALMGVDTIHTGM WGGYLSDDEDNLKRSIDLLN-----------DNNVVPALSCGMHPGLVEAISRRFG-NYM ANVGGAIHGHSGGTLGGTMAMRQAI----DKSYGPEYIS-------AIEKWGIVN >imgVR_3300005805_____Ga0079957_1000649_25_PHAGEputative_phage MKFFRE--LTKDEKDIVATYYIESHHHI-LKDAAWSLAIGQSVGNP-KVR-NRWENDDLA SCVIYDIQGT-VKIGFPKSNTDWGGISHLLCQVMGGQLDIDIFKACRLKKIEFPADVEAE FLAPKNGISGIRKFVNRYNKPLSGAIVKPKTGISPITLGEMVKELLDGGVDFIKEDEILS NPSFCRLEDRVELISNIINN----CGRGVIYCFCINGDHHAVLDRAKFVAANGGNGVHIN FW-SGLGVYNSIRKLDLPLFIHYQKSGDKILTDKRNFSIDWSVLCNLAGLCGVDTIHAGM WGGYLSNDVEELRNIMDILH-----------KRNVLPALSCGMHPGIVNPTVDKFGIDFL ANCGGAIHGHPGGTLAGALAMRQAI----DKTPKEEFFA-------AIDKWGSN- >imgVR_3300006641_____Ga0075471_10002405_15_PHAGEputative_phage FRELTD--YERD---VVATYYIESDLGT-LRDAAWNLAIGQSVGNP-KVR-NRWESDELA SCVIYEQKGQ-VKIGFPKVNTDWGGISHLLCQLMGGQLDIDVFKVCRLQKLEFPADVEAQ FFGPKNGIDGIRRFVNRYDKPLSGAIVKPKTGISPQTLSEMVKELLDGGVDFIKEDEILS NPSFCRLEDRVELISNIVNN----CGRNVIYTFCINGDHHTILDRAKFVAENGGNGIHIN FW-SGLGVYNSVRKMDLPLFIHYQKSGDKILTDKRHFGIDWDVLCDLAGLCGVDTIHAGM WGGYLSDDEDELRQTMATLH-----------RRNVLPALSCGMHPGIVNTTAEKFGTDFL ANCGGAIHGHPEGTLAGALAMRQAI----DKNPGKEFRA-------AIDKWGYET >imgVR_3300000116_____DelMOSpr2010_c10002632_7_PHAGEputative_phage -MKF----FRKLKNRVIATFYIETSIGN-LRDAAWALAIGQSVGNP-KVR-NRWESDELS SCVIYEGCGE-VKIGFPKVNTDWGGISHLMCQLMGGQLDIDIFKTCRLKKLDFPIDVESH FLGPKNGIDGIRKFVNRYDKPLSGAIVKPKTGISPDTLSEMVKELLDGGVDFIKEDEILS NPSFCRLEDRVELISNIVNN----CGRGVIYCFCINGDHHTILDRARFVADNGGNGIHIN FW-SGLGVYNSVRKMDLPLFIHYQKSGDKILTDKRHFGIDWSVLCDLAGLCGVDTIHAGM WGGYLSDDEEELHTVMNTLH-----------RRNVLPALSCGMHPGIVNTTAEKFGTDFL ANCGGSVHGHPGGTLSGALAMRQAI----DKDPGPEFRA-------AIDTWGYET >imgVR_3300005605_____Ga0066850_10002422_3_PHAGEputative_phage MKFFRE--LTEQEKNVVATFYIETNIGN-LRDAAWALAIGQSVGNP-KVR-NRWETDELA SCVIYEESGE-VKIGFPKVNTDWGGISHLMCQLMGGHLDIDVFKTCRLRKLDFPVDVEAQ FLGPKYGIDGIRRFVNRYDKPLSGAIVKPKTGISPQTLSEMVKELLDGGVDFIKEDEILS NPSFCRLEDRVELISNLVND----CGRGVIYCFCINGDHHTILDRAKFVADNGGNGIHIN FW-SGLGVYNSVRKMDLPLFVHYQKSGDKILTDKRHFGIDWSVLCDLAGLCGVDTIHAGM WGGYLSDDEVELKTVMDTLH-----------HRNVLPALSCGMHPGIVNTTAEKFGTDFL ANCGGAVHGHPGGTLSGALAMRQAI----DKTPGVEFRA-------AIDKWGYET >imgVR_3300006026_____Ga0075478_10000002_49_PHAGEputative_phage MKFFRE--LTEQEKNVVATFYIETNIGN-LRDAAWALAIGQSVGNP-KVR-NRWESDDLA SCVIYEESGE-VKIGFPKVNTDWGGISHLMCQLMGGQLDIDVFKTCRLRKLDFPADVEAQ FLGPKYGIDGIRRFVNRYDKPLSGAIVKPKTGISPQTLSEMVKELLDGGVDFIKEDEILS NPSFCRLEDRVELISNLVNE----CGRGVIYCFCINGDHHTILDRAKFVANNGGNGIHIN FW-SGLGVYNSVRKMDLPMFVHYQKSGDKILTDKRHFGIDWSVLCDLAGLCGVDTIHAGM WGGYLSDDEAELHTVMDTLH-----------HRNVLPALSCGMHPGIVNTTAEKFGTDFL ANCGGAVHGHPGGTLSGALAMRQAI----DKNPGEEFRA-------AIDKWGYET >imgVR_3300009182_____Ga0114959_10000188_47_PHAGEputative_phage IIFDIN--IDPT--EVIATYFLEST---TLEKAAWELAIGQSVGNP-NVR-NEWETDDKY SC-KV-M-HDIVKIAFPIINTEEDGISHLLCQLMGGQMDIDNVIKCHLLKLDFPQKIVEY FKKPKYGIDGVRKYTN-VNKPLLGGIVKPKTGITPDVLLAMVKQMVEGGVNFIKEDEILS NPSFCEIKDRVPLIMNYLNQRVIDGYDPVVYAVCINADSPYLLDRVKQVYELGGNAVHIN FW-CGMGSYLSVRKLDREFLIHFQKSGDKILTNVNHYHIDWKVICQLAGLSGVDFIHAGM WGGYMSDDEEELRQVLEILH-------RHN----VIPALSCGMHPGVVNAIVKRFG-NFM ANCGGSIHGHPGGTISGATAMKQ----AIKTFG-NEYDE-----AI--QKWGLTE >imgVR_3300001450_____JGI24006J15134_10002570_9_PHAGEputative_phage MDFYNKID----QNKVVATYYVET----DLKKASWELAIGQSVGNP-HER-SQWETSEKH SCIILEAQGK-IKIAFPNINTSTDGITQMLCQVMGGQMDINSFSSCRLIDLKIPSAVKKY FLGPKHGIEGIRKFTGCIDKPLSGAIIKPKIGLDPSTLLQVVKDLYKGGVDFIKEDEIMS NPAVCPIEERVPLIMDWLNKQERK----IIYAVCINGDHDQVLYRSMQVRALGGNAVHIN HW-AGLGVYNAIRKLDSGLFIHFQKSGDKVFTDKSHFGIDWKVICKLATMSGVDTIHIGM LGGYADDSQEEIKDIYSNVV-----------EAGTMPALSCGMHPGLVHSTVQTVG-DFI ANVGGAIHSHPGGTLAGAKAMRQAI----DDEPGPEFVQ-------ATNKWGVQ- >imgVR_3300009155_____Ga0114968_10000900_19_PHAGEputative_phage MRIYRS------ECDFVVDYYLESKTTLA--EAAWNLAIGQSVGNP-NQR-NAWESDEKH SCIVLEAKGH-VSIAFPVVNTKEDGISHLLCQIMGGQLDIDIITKCHVDRIRFPKSIESQ FNKPTFGIDGIRSFTGVHGKPLLGGIVKPKTGVSADVLLEMVKQMVDGGVNFIKEDEILS NPAFCSIEERVPKIMKYL------DGKRVIYSVCINSDPAYSVKRAQLVHELGGNSVHIN FW-SGLGVYKSIRDLQLPLFIHFQKSGDKILTNQNHFHISWNVVCDLAGLMGVDFIHAGM SGGYS--TTSDLELRLAIDR---------LHKRNVMPALSCGMHPGLVQSINGKFGLDYM ANVGGAIHGHPMGTSGGARAMRQAI----DGCHGDEYKL-------AIEKWGLYE >imgVR_3300013093_____Ga0164296_1000036_79_PHAGEputative_phage DLIFLEIN----KEEIIATYELEGLNSLA--DAAWELAIGQSVGNP-NVR-NKWETDENY SAKVIRERGL-INIAFPVINTKEDGITQLMVQLMGGQLDIDNIKYCRLLKLEFPESVKSA FLGPKYGIKGIREYIGVQDRPILGGIIKPKTGITPDILLKMVQELVEGGVNFIKEDEILS NPDFCPISVRVPLIMDYIKK----SGKKVIYAVCINSDFPYVINRVKQVYELGGNAVHIN FW-NGLGVYKAVRELDLPIFVHFQKSGDKILTDKTHFSIDFSVICQLAGMMGVDFIHAGM WGGYSSTEKDELSNILSELY-----------KHNVMPALSCGMHPGLVGAIENQFSIEFM ANTGGAIHGHPGGSKNGTIAMRSAIDKNY---DCEQYKI-------AIEKWGLV- >imgVR_3300013093_____Ga0164296_1001304_19_PHAGEputative_phage DLVFLDID----KSRVIAFYDLEG----SLADAAWELAIGQSVGNP-NVR-NEWETDDTY SAKVI---GN-VQIAFPSVNTKEDGITQLMVQLMGGQLDIDNITYCRLLSLEFPDSVKSA FLGPKYGIKGIRDYIGIHDRPVLGGIIKPKTGITPEVLLKMVQELVEGGVNFIKEDEILS NPDFCPISVRVPLIMDYIKK----SGKNVIYAVCINSDFPYVIDRVKQVHELGGNAVHIN FW-NGLGVYKAVRELDLPIFVHFQKSGDKILTDKNHFSIDFSVICQLAGMMGVDFIHAGM WGGYST-EKDELTGILNELY-----------KHDVLPALSCGMHPGLVGAIENQFG-QFM ANTGGAIHGHPNGSKAGATAMRSAIDKNT---ACEQYKT-------AIQKWGIVE >imgVR_3300001450_____JGI24006J15134_10000144_30_PHAGEputative_phage -MQF----YNPNKSKVIATYFMKS----DLRKVSWDLAIGQSVGNP-NVR-NRWETEERS SCIVHDNEGK-VKIAFPIINTDWGGISHLLCQLMGGQMDIDTFDSCRLIDLEFPAEIKSK FLGPKYGISGMREYTGQYDKPFSGAIVKPKTGMDANTLLDMVKELVDGGCDFIKEDEIMS NPSFCPIEERVPLIADWMAKQSKK----VVYAVCINGDHDHILKRATKVSELGGNAVHVN FW-AGLGVYGAIRRLDLPLFIHFQKSGDKVITDTRHFGIDWNVICQLAGMQGVDTIHAGM WGGYLSDDEDDLKKTISTLH-----------NHNVVPALSCGMHPGLVQANVRQFGNDFI ANVGGAIHGHPGGTLAGAKAMRQSV----DKTGGDEYEQ-------AIAKWGLIK >imgVR_3300006754_____Ga0098044_1000148_36_PHAGEputative_phage MLFYNDIN----QENIIATYFIKSKNAD-LATCAWNLAIGQSVGNP-NVR-NQWETEEQS SCIVHKEQGK-VKIAFPIINTDWGGISHLLCQLMGGQVDIDTFDSCRLTQLKFPESVKEY FLGPTHGITGMRDYTKRYNKPLSGAIVKPKTGMPAETLLNMVKELVDGGCDFIKEDEIMS NPSFCPLEERVPLISNWLNS----QSKKVVYAVCINGDHHHILKRTKMVADMGGNAIHVN FW-SGFGVYNAIRKMNTGLFLHFQKSGDKVITDKRHFGIDWNVICQLAGMMGVDSIHAGM WGGYLSDDEDDLRETINVLH-----------DHNVVPALSCGMHPGLVQANIKQFGNDFI ANVGGAIHGHPMGTLAGAKAMRQSI----DKTHDIEYEQ-------AIAKWGFVK >imgVR_3300017967_____Ga0181590_10000756_38_PHAGEputative_phage -MNFNI--FRSAETKTIATYKLRS--STDLREAAWALAIGQSVGNP-NVR-NKWETKENH SCVMHWVEDE-VKIAFPIANTDWGGISHLLCQLMGGQMDIGIITGCRLVDLQIPKHIEDM FKGPKYGIDGFRKFTQTFDKPLLGGIVKPKIGVTPEVLLEMVKEMVEGGINFIKEDEVMA NPSICPIRERVPLIMDYL------KDKDVIYSVCINGDAPHALERAKLVSELGGNSVHIN VW-SGLGVYKSIRDLDLPLFIHFQKSGDKVFTEKLHYGIDWKVICDLAGLMGVDSIHAGM WGGYME-EDQDLGGVIATLH-----------NRNVVPALSCGMHPGLVNAITEQFGIDYM ANVGGALHGHPGGTKSGVKAMRQSI----DKTYGQEYGE-------AINKWGLVK >imgVR_3300006749_____Ga0098042_1000004_15_PHAGEputative_phage IFRNININ----KDKILCDYFLSS--NKTLRDAAWGLSIGQSVGNP-NVR-NHWETDENH SCFVIEEEGE-VTIAFPIANTKTDGMSHLLCQFMGGQMDIDIVKKCHLLKVSFPKHIENY FLKPKYGIKGIRKFTSTQDKPLFGGIVKPKIGVNSDILLEMVKEMVEGGINFIKEDEIMS NPACCPIEERVPKIMEYL------KDKDVIYAVCINCDPHHVIDRVKRVYELGANAVHIN FW-SGLGVYKSIRELDLPIFIHFQKSGDKVLTNKSHYHISWDVICDFAGLMGVDFIHAGM WGGYMSDNEDELRETLRVLH-----------NRGVMPALSCGMHAGLIEAINKRFGIDYM ANVGGAIHGHPGGSRSGAKAIKQSI----DREYGKEYEI-------AIKKWGIVE >imgVR_3300006810_____Ga0070754_10000068_66_PHAGEputative_phage -MKSEL--FKSRADKFFAKYYVECSKNLA--SAAWNIAIGQSVGNP-SSR-SVWETDENH SCIVGDMTGE-VTIAFPVINIKTDGVAHLLVNVMGGQMDIAEIKKCKLIDIDFPECVSDC FMGPKYGIDGIRKFTGVYNKPLLGGILKPKTGVSPQVMLEMVKEMVEGGVNFIKEDEILS SPSFCKIEDRVPLIMDYL------KDKNVVYCVSIHSDPAYILERVKQVYELGGNGIHVN FW-CGLGVYKNIRELDLPLFMHFQKSGDRVITNPDHFSISWPFMCKLAGMMGVDFIHSGM IGGYYPADEEEVLAAIEQSR-----------NQGTLPALSCGFHPGLVDQINEQVGVDYL ANVGGAMHGHPSGTKSGAIAMRQAI----DGERGKEYRE-------AIYKFVNS- >imgVR_3300006810_____Ga0070754_10000142_45_PHAGEputative_phage -MKSDL--FKLKSENFFAKYYVEC-----LASAAWNIAIGQSVGNP-SSR-SVWETEEDH SCIVGDMSGE-VIIAFPVVNIRTDGVSHLLVNVMGGQMDIEEIKSCKLLDIEFPDCVSEC FLGPKYGINGIREYTGVYDKPLLGGILKPKTGVSPRIMLEMVKEMVEGGVNFIKEDEILS SPSFCRIEDRVPLIMDYL------KDKRVVYCVSIHSDPAYILERVKQVHELGGNGVHVN FW-CGLGSYKNIRELDLPIFMHFQKSGDRVITNPDHFSISWPFMCKLAGMMGVDFIHSGM IGGYYPADEEEVLKAMEESR-----------KQGTLPALSCGFHPGLVNNINEQVG-DYL ANVGGAMHGHPSGTKSGAIAMRQAI----DGIVGKEFRE-------AVGKFGNPK >imgVR_3300005662_____Ga0078894_10000568_30_PHAGEputative_phage ------NIYKKIKENLLVKYYLKSTTNLN--EAAWALAIGQSVGNP-KVR-NKWETEEKY SCIILQNEGI-VYIAFPTENLKQNGISHLLVNLMGGQLDIDVVDECHVLDIEFPNSILSE FKGPKYGISGIRKFTNSYDKPLLGAIVKPKIGVTPQTLLEMVKELVEGGVNFIKEDEIMS EFSLCPIEERVPLIMEYL------KDKNVIYSVSIHCDPDKILDRVKLVHSLGGNSVHVN FW-CGLGVYRSIRELDLPIFIHFQKSGDKIFTNKSHFHIDWRVVCKIAGLSGVDFIHAGM IGGYYKWDEQEVIDSCKILT-----------ELNVMPAISCGFNAGLTDMVNARLGVDYM ANVGGGIHGHPSGTLAGAKSMRQSI----DKIAGTEYIQ-------AIEKWGYQN >imgVR_3300009183_____Ga0114974_10000026_102_PHAGEputative_phage ---MNI--YKKLKDDLIVKYYIKSTTNLN--DAAWALAIGQSVGNP-KV-RNKWETEEKY SCIILKKEGI-VDIAFPTVNIKNNGISHLLVNLMGGQLDIDVVDQCHVLDIEFPDSILSE FKGPKYGISGIRKFVNSYDKPVLGAIVKPKIGVTPQVLLEMVKELVDGGVNFIKEDEIMS EFDLCPLEERVPLIMDYL------KDKNVIYSVSIHCDPDKILERVKLVHSLGGNSVHVN FW-CGLGVYRSIRELDLPIFIHFQKSGDKIFTNKSHFHIDWRVICKIAGLSGVDFIHAGM IGGYYKWDEQEVIDSCKILT-----------ELNVMPAISCGFNAGLTEMVNSRLGVDYM ANVGGGIHGHPNGTLAGAKSMRQSI----DKEYGDEYHQ-------AIEKWGVIK >imgVR_3300002835_____B570J40625_100027887_6_PHAGEputative_phage IFENIEID----ISKYIVKYFLKSTSSLR--DAAWQLAIGQSVGNP-NIR-NEWETDENH SCIILEELGE-VNIAFPVINIETDGISHLLVNIMGGQLDIDIIEKCQVLDIKFPDSVDKY FKGPKFGISGIRNFTKVKDKPIFGSIIKPKTGISPNVLLEMVKQLVEGGVNFIKEDEILS DPNFCKIEDRIPLIMDYL------KDKNVIYAASIHSDSPYLLDRVRKIYELGGNGIHVN FW-CGLGSYKAIRELDLPIFIHFQKSGDKILTNKAHFHIDWKVICKLIGMMGVDFAHAGM IGGYYKWDEKEVIDSVNVLQ-------KY----NVMPALSCGFHSGLTQMVTDKLGVDYM ANVGGAIHGHSGGTVSGAKEIKASIERVI-------------------------- >imgVR_3300009164_____Ga0114975_10000143_17_PHAGEputative_phage FRKKIN------KEDFIVTYFLES--TTTLRDAAWNLAIGQSVGNP-NVR-NQWESDENH SCIILEQSGT-IEIAFPIINIKTDGISHLLVNIMGGQLDIDNIVKCQVLNIVFPESVEKL FLGPKFGIKGIREYTKCFDKPLFGAIVKPKTGISPQTLLEMVKELVEGGVNFIKEDEILS NPSFCTIEERVPLIMDYL------KDKNVIYAVSIHADYPYVIDRVKKVYELGGNAVHIN FW-CGLGVYKTVRELDLPIFIHFQKSGDKILTNKNHFHIDWNVISKIAGMMGVDFIHAGM IGGYYKWDESEVLNSVETLH-------KY----NVMPALSCGFHPGLTEWVTSKVGIDYM ANVGGAIHGHPDGTLAGAKAMRQSI----DGEQSSEYQI-------AIEKWGKI- >imgVR_3300006790_____Ga0098074_1002816_5_PHAGEputative_phage --------MIFREKEFIVEYFLESTTNLR--DASWNLAIGQSVGNP-NVR-NKWESDEKY SCMVLEESGV-VKIAFPVVNIKTDGISHLLVNIMGGQLDINIISKCHVLDIVFPKSVKEL FLGPKFGIDGIRKYTNTYGKPLLGAIVKPKTGITPKVLLEMVRELVEGGVNFIKEDEILS DPSFCPIEERVPLIMDYL------KDKNVIYSVSIHSDSPYILDRVKRIYELGGNSVHVN FW-CGLGIYKAIRELDLPIFIHFQKSGDKILTNKNHFHIDWRVICKLAGMMGVDFIHAGM IGGYYKWEESETLDSIKILK-------DY----NVMPALSCGFHPGLTEWVTGLVGTEYM ANVGGSIHGHPGGTVSGTKAMRQSI----DNIEGVEYNE-------AIKKWGKK- >imgVR_3300001346_____JGI20151J14362_10002242_5_PHAGEputative_phage ------LSLRNKLDKFLVKYYLES--KTTLEDAAWNLSIGQSVGNP-KVR-NQWETDENH SCIIIDPSGN-VEIAFPVINIKTDGIAHMLVNIMGGQMDIDDIEKCQVLDIIFPQSVKDC FLGPKFGITGIREFTGVKDKPLFGAIVKPKTGITPEVLLEMVKELVEGGCNFIKEDEILS DPIFCPIEKRVPLIMDYL------KDKNVIYCVSIHSDYPYILDRVKQVYELGGNGVHVN FW-CGLGVYKAIRKLDLPIFVHFQKSGDKILTNKNHFHIDWRVMCKLAGMMGVDFIHAGM IGGYYKWDEQEVIDSVNILH-------EY----NVMPALSCGFNSSLTQIVTDKVGSNYM ANVGGAIHGHKDGTLAGALKMKK---------SIEDLK----------------- >imgVR_3300009181_____Ga0114969_10002748_1_PHAGEputative_phage ------------------------------------------------------------ ---------------------ER-----------------------------V------- --------------------PLIM--------------------------DYLKDKN--- ----------------------------VIYCVSIHSDYPYIIDRVKKVYELGGNGVHVN FW-CGIGVYKAIRELDLPIFVHFQKSGDKIITNKNHFHIDWRVICKLAGMMGVDFIHAGM IGGYYKWDEIETIDAVKILN-----------EHNVMPALSCGFHPGLTKWVTDKVG-NYM ANVGGAIHGHPDGTTNGTKAMRQ----SIDGIYEKEYER-------AIDKWGKNN >imgVR_3300006793_____Ga0098055_1001775_7_PHAGEputative_phage MKLYRE--TIPE--DFSCEF-FLMSDST-LRQAAEALAIGQSIGNP-AVR-SKYETLENH SAKIIPDSGI-VEIAWPYRNIDWAGIAQLMCTVMGGQMDIDIIKQCHWIDVN-IDRGKSG LKMPSYGLSGFRNHVQSYEKPLLGTIVKPKTGLTADTLRDIVQQMVDGGVDFIKEDEIMS NPACLPLEERISIVQPILESYDTV------YCYCINSDPHTLMDKARAVSQSGGMGVHIN FW-SGMGSYKAIRDEDNGTFIHFQKSGDKVLTSKHNYRIEWSVMCKLAGLIGCDTIHAGM YGGYMDMGYEELKNIQNICL-----------EQNMVPAFSCGMTKELIPTIREKFGNDWM ANVGGAIHSHPSGIETAVKELRTVI----DNG----------------------- >imgVR_3300006193_____Ga0075445_10001210_10_PHAGEputative_phage MELYRT--TKPE--DFSCKFFLMSKTTLK--EAAEALAIGQSIGNP-SVR-SKYETPENH SAKVVDPTGV-VEIAWPYRNIDWAGIAQLMCTVMGGQMDIDIIQQCHWIDID-VDKTKTG WKGPSYGLTGFREHVQSFDKPLLGTIVKPKTGLTPEILKDIVQQMVDGGVDFIKEDEIMS NPACLTLNQRIDIVQPILDRSKTV------YCYCINSDPHTLMDKARTVSSRGGMGVHIN FW-SGMGAYKAIRDEDNGTFIHFQKSGDKVLTSKYNYRIEWSVLCKLAGLIGCDTIHAGM YGGYMDMGYEELKNIQNVCL-----------EQNMVPAFSCGMTKELIPGIREKFGNDWM ANIGGAIHTHPSGIEVAVKELRAVI----DNG----------------------- >GS605_3p0_scaffold_7_49_PHAGEputative_phage MELYRT--IKPE--DFSCNFFLMSKTNLK--EAAEALAIGQSIGNP-SVR-SKYETAENH SAIIADNAGV-VEIAWPYRNIDWAGIAHLMCTVMGGQMDIDIIQQCHWIDIH-IDKTKAD LQGPAYGISGFRDHVEQYDKPLLGTIVKPKTGLTPETLKDIVSQMVDGGVDFIKEDEIMS NPACLTLEQRIAIVQPIL------DGSRTVYCYCINSDPHTLMDKARSISGWGGMGVHIN FW-SGMGSYKAIRDEDNGTFIHFQKSGDKVLTSKYNYRIEWAVLCKLAGLIGCDTIHAGM YGGYMDMGYEELKHIQNVCL-----------EQNLTPAFSCGMTKELIPVIREKFGNDWM ANVGGAIHSHPDGIEKAVKELRAVI----DNG----------------------- >imgVR_3300006193_____Ga0075445_10002989_10_PHAGEputative_phage MEIYRA--NKPE--DFSCHFFLQS--KTTLKEAAEGLAIGQSIGNP-SVR-SKYETTENH SAKIIPDAGV-VEIAWPYRNIDWAGIAQLMCTVMGGQMDIDIIQQCHWTDIN-IDRVKAD LSAPAYGLTGFRDHVQQYDKPLLGTIVKPKTGLSPATLKDIVQQMVDGGTDFIKEDEIMS NPACLTLEERIGIVQPII------DGSKTVYCYCINSDPHTLMDKARTVSGWGGMGVHIN FW-SGMGAYKAIRDEDNGTFIHFQKSGDKVLTSKYNYRIEWAVMCKLAGLIGCDTIHAGM YGGYMDMGYEELKNIQNVCL-----------EQNLTPAFSCGMTKDLIPVIREKFGNDWM ANVGGAIHTHPKGIRAGVEELRKAI----DNG----------------------- >imgVR_3300001472_____JGI24004J15324_10000002_33_PHAGEputative_phage LDIYVD------AKEFTVTYKLEG--ND-LQRAAFGLAVGQSVGNP-SVR-NEYETPEQY AAKIIPEAGKIVKIAWPYRIIETDGIAQLMCVLMGGQMDIDYIHRCELLDLDIDFELCDA VK-PRYGLSGFRKLVGNYNKPLLGSIVKPKTGLTEKALIEIITAFVEGGVDFIKEDEIMA NPACLPLKTRIDVVQRAL------KGTDVVYCYCINSDPLHLLERAKMVVDGGGNGVHVN FW-SGHGAYKSINKLNLPLFVHYQKSGDKVLSSEKHYRISWYVLCKLASICGVDTIHTGM WGGYLSDDENELRRTMHMLS-----------NNNVVPALSCGMTAELIPEITRRFGVDYM ANVGGAIHEHPDGMKAGALKLRKAIDSTQ-------------------------- >imgVR_3300001589_____JGI24005J15628_10000085_3_PHAGEputative_phage LDIYVK--VDPK---FTVTYYLEG--ND-LDRAAFGLAVGQSVGNP-GV-RNDYETPEQY AAKIVKSAGT-VKIAFPYRLIDWTGIAQFLCVVMGGQMDIDYIHRCELLELDIDYALCDA VE-PRYGLSGMRALTGNYNKPLLGSIVKPKTGLTGRALQEIVSAFVEGGVDFIKEDEIMA NPSCLPLTQRIELAQEVL------KGSKVVYCYCINADPLHVLDRAKQVADGGGNGVHIN FW-SGHGVYKSINQMNLPLYIHYQKSGDKVLSSTKNYRIAWSVLCKLASISGVDTIHTGM WGGYLSDDENELRRTMQMLS-----------SNNTVPALSCGMTAELIPEVTRRFGVDYM ANVGGAIHEHPDGLAAGAKKLRAAI----DNT----------------------- >ABC22798_Rhodospirillum_rubrum_IV_DeepYk_REFreference --------MTDR---LRATYRVKATAAS-IEARAKGIAVEQSVEMPLAIDDGIVGVVEER GEDCF-----EVRLALSTATI--GGDAGQLFNM---LFGNTSLQDTVLLDIDLPDDLLAS FGGPNIGAAGLRARVG-ADRALTCSALKPQ-GLPPDRLADLARRMALGGLDFIKDDHGMA DQAYAPFASRVGAVAAAVDEVNRQTGGQTRYLPSLSGHLDQLRSQVRTGLDHGIDTFLIA PMIVGPSTFHAVVREFPGAAFFAHPTLAGPSRIAPP-----AHFGKLFRLLGADAVIFPN SGGRF-GSRDTCQAVAEAAL------GPWGGLHASLPVPAGGMSLARVPEMIATYGPDVI VLIGGNLLEARDRLTEETAAFVASVAGASRGCGLAP------------------- >YP_532057_Rhodopseudomonas_palustris_BisB18_IV_DeepYk_REFreference ----------PR---IVATYHIASDAER-IEQRALALAIEQSVECPLAIGANIVGRVEEL QPGRYAV---RIGLAAATAPAEPGQLLNM-------LFGNSSIQPIALADVELPAHYLTA FGGPRVGLAGIRTLTGAQSRALTASALKPQ-GLSPAALASIAHQLALGGVDLIKDDHGLA DQAFSPFAERAAAVGKAVREANAARGGRTLYAPNISGTLDDMRRQLGVIRDEGG-AVLVA PMIVGVSNFHAIVKEAAGLVVVAHPAMAGA------AKIAALLLGRLFRLFGADATVFPN YGGRFAYSTASCLALAQAAR------DPFGKLNACIPTPAGGIMLQRVNELLRFYGQDVM LLIGGSLLASRERLTEQASRFVN---------KVADYGQ-------R-------- >YP_782588_Rhodopseudomonas_palustris_BisA53_IV_DeepYk_REFreference --------SSPR---IIATYHLASDADL-VAERAKGLAIEQSVECPLAVGDTIVGRVEEL TPGRYAV---RIGLAAATAPAEPGQLINM-------LFGNSSMQQVTLVDVELPPSYLAA FGGPRLGLAGLRALTGTQDRALTASALKPQ-GLSPAALARIAGQLARGGVDVIKDDHGIA DQAYSPFAARVEAVGRAVQEANVARDGHSLYAPNLSGTLDDMRRQLDLAREQGIAAVMVA PMIVGLANFHAVVKAAAGMVVIAHPALSGVTRIAPP-----LLLGKLFRLFGADATIFPN AGGRF-GSTETCLQLARAGR------DPWGALEPCMPVPAGGLTADRVAELLQFYGHDVM LLIGGGLLSARERLTEEAARFVN---------EVAAHGK-------A-------- >CAE27610_Rhodopseudomonas_palustris_CGA009_IV_DeepYk_REFreference ----MD--MSER---IIVTYQVAAAPAE-IAARAEALAIEQSVECPATEQQIRDEIVGAI AP-----IGE-TRFSVRVSLASA-TAPAEPGQLLNMLFGNSSIQPVTLADVELPPAYPAA FGGPKLGISGLRAKLGAPRRALTGSALKPQ-GLAPEALAGLAHRLALGGVDLIKDDHGIA DQAFSPFAARVPAVARAMREACAVRGAAMLYAPHVSGSLDDMRRQLDIVRREGSVVMLMP MI-VGLANFHLIAKEAEGLIVLAHPSLAGAQRIAPD-----LLLGKLFRLLGADATIFPH YGGRFAYTPETCRALADAAR------RDWHDLKPCLPVPAGGIAIDRIKELLAFYGTDVM LLIGGSLLAA--GEQLTEHAARF--TA-----EVASHGQ---------------- >YP_569369_Rhodopseudomonas_palustris_BisB5_IV_DeepYk_REFreference --------MTDD---FIATYQVTSEPAR-IAERAQALAIEQSVECPLAI-GDPWVRDARV AG---EIEGRAVRIAL-----ASATAPAEPGQLINMLFGNASIQPVALTDVELPPLYLKA FGGPKLGSDGIRALAGAHGRALTASALKPQ-GLSPQALAAIAGRLAAGGVDLIKDDHGLA DQAYSPFAARIAAVWRAVRNSNDSSGRRTLYAPHVSGSLDDMRRQLDGIRAEGPALMMMP MI-VGLSNFHALAKEAEGLVVLAHPALSGATRIAPP-----LLLGRLFRLFGADATVFPH HGGRFASTPQTCRALADAAR------GAWGGLKPCLPAPAGGIAIDRVAELLSFYGEDVM LLIGGSLLAARE--QLSEQAARFAA-------EVAAHGR---------------- >YP_001002057_Halorhodospira_halophila_SL1_IV_DeepYk_REFreference --------MSAE---LRVTYYLTCRPGEDPHDKAKGIALEQSAELPC---IPEHVYDDTI QETAL--DGRRLVLDFPEAIT--GLEPTQLI---NNLFGNISLKSIRLADVEWTPNLLRA LGGPRYGTAGVREMLGIGERPISSTALKP-LGLDTATLAGFCADFARGGIDLIKDDHGLC DQDTSRFVDRVQACQRAVNEVNAETGGRSLYLPNVTGPRWELDKRLDAAQEAGCKAVLIC FL-TGLDALIWARER-YDMALMAHPAFAGAVAGAE-HGIDPLLLGEITRLFGADMVVYTN AEGRFP-TYDQALRINDRLR------RPLGDIRPALPTPGGGVDAARAPYWAERYGPDVV LLIGGSLYAQG-DRAAAARRLQDVV----EGQ----------------------- >YP_742007_Alkalilimnicola_ehrlichei_MLHE_1_IV_DeepYk_REFreference ------------MTDLTATYELYLAEGESPEGKARGIALEQTVEMPACLPDIAERMVGTL EPADHWV----LEIDYPLAAI--GELTQCLNLLFGN---ISLQSGIRLVQVAWPPSLLRR WGGPGLGVSGLRARLDVGARPLLCAALKP-MGLSAPALAARCAAFARGGVDLIKDDHGLA DQPDAPFAERLNACQDAVRQANRRSGGRSLYLPNVTAAPQALGERLAAARDAGCEMVLIS PWLTGLETLRWARDE-YGLALMAHPAMGGLFLPR--HGISPLLLGELFRIAGADAVIYPN VGGRFF-SADTCQAINHALR------RPLEGLASAWPTPGGGVDVKRAGHWKQAYGPDTI LLIGGSLYAQG-DIEAASRALMQ---------AIRD------------------- >CG_2015-09_scaffold_5489_2unknown MRFTIE---------LVTCLQGET-----IQEKALGIALEQSVELPVDAWAGKVESIKET GEKTY-----LVECSYAVSTIDG-DLTQFLNVLLGN---ISLKPGIQVINVDW-EPIQDW FPGPRFGVEGVRINMGIPNRPLSCSALKP-LGLSIDKLADLAFQFATGGIDIIKDDHGIA DQRHSPFTERVKACVEAVNHSYNLTGKQTWYFPNVTTNGSKLIDRFRQAEDLGAHGVLLC QL-CGLEMMADLASSDVNLPIMAHPAFSGTYLGAI-NGFSHFQYGSLWRAFGADFVIYPN NGGRFSFTLDECIGINQAAL------DPMVPYARTFPTPGGGVDRNKIPEWRAKYGNDIV FLMGASLYKHPEGIKAATIEVQEALNQ---------------------------- >ABH04879_Heliobacillus_mobilis_IV_DeepYk_REFreference ---MIS------GERFSVIYRLVGKEAI---KKALDICLEQTVEFP----EELVPRGMHV VGRIESVQASRVTISYAVE--TAGELTQLLNVIFGN---ISIKPGIVVEEIYLPESLLKS FRGPRFGRQGLRKLLGVPHRPLLFTALKP-MGLSTAELAKMAYECALGGIDIIKDDHGLT NQVFAPYEERVRLCTQAVAKANQETGFKAIYVPNVTAPYNQLFQRARFAKEVGAQGLMIS AL-AGLDTMRELADDDLALPIFSHPAFQGSYVTSAENGISHALFGQITRLAGADGVVYPN FGGRFSFSREECRRIAEGTE------TPMGHIQPIFPCPGGGMSLEAIPDMLQVYGRDVT FLIGGGLFRHGPDLVENCRYFREMV----EKM--------------V-------- >BAB53192_Mesorhizobium_loti_IV_Non_phot_REFreference ---------------ITLTYRIETSESIE--ALAAKIASDQSTGTFV---ALPGETEERV AARVLAISNG-VDIAFPFDAIGT-DLSALMTIAIGGTYSIKGLSGIRIVDMKLPDDYRGA HPGPQFGVAGSRKLTGVEGRPIIGTIVKPALGLRPYETAEMVGELIAAGVDFIKDDEKLM SPAYSPLAERVKAIMPLVRDHEQKTGKKVMYAFGISHADDEMMRNHDLVLKAGGNCAVIN INSIGFGGMAYLRKR-SGLVLHAHRNGWDILTRHPGLGMDFKVWQQFWRLLGVDQFQING IASYWE-PDASFIESFKAVTTPIFSPDDC-----ALPVAGSGQWGGQAPETYQRTGTDLL YLCGGGIVSHPDGPGAGVRAVRQAWQAAVDGIPLAKYARSHAELARSIEKFGDGK >CAC48779_Sinorhizobium_meliloti_1021_IV_Non_phot_REFreference ---------------LRITYRIET-PGD-VEFLAKKIASDQSTGTFV---PVPGETEERV AARVLAIAGSDAEIAFPLDAVGT-DLSALMTIAIGGVYSIKGMTGIRIVDMKLPPEFAAA HPGPQFGVVGSRRLTGVEGRPIIGTIVKPALGLRPEETAVLVGELLSSGVDFIKDDEKLM SPAYSPLSARIAAIMPRILDHEQKTGKKVMYAFGISHTDDEMMRNHDLVVAAGGNAAVVN INSVGMGGVAFLRKR-SGLVLHAHRNGWDILTRHGGLGMEFSVWQQFWRLLGVDQFQIGI RVKYWE-PDESFVKSFEAVS-----TPLFTKDDCPLPVVGSGQWGGQAPETYARTGTDLL YLCGGGIVSHPGGPGAGVRAVRQAWEAAVAGIPLADYARDHPELAQSLEKFADGK >CAE31534_Bordetella_bronchiseptica_RB50_IV_Non_phot_REFreference --------MSDR---FEATYLIETPHDVA--SVAQELAGEQSTATSS---RMPGETDADF GARVEEALGQ-LTLSWPLHNIGD-SLPMLLTTLLGNQTGMRRLSGIRLERVAMPQSFIAA QPRPAFGIAGTRRLTGVQGRPLIGSIVKPNIGLAPEQTAAMARQLAEGGVDFIKDDELLA NPPYSPVARRAALVLRALDEAAQRTGRRTMYAVNITDGLDEMRRHHDAVVQAGGTCIMVN LNSVGLSALLALRRH-SQLPIHGHRAGWAMMTRCPALGMEFQPYQMLHRLAGVDHLHVSG LGGKFWEHADSVLQAAHECL-TPLDTQAGAADDRALPVFSGGSTIFDVAPTYQGIGTDLI FASGGGIFGHPDGLAAGCASLRQAWEAAIAGQELRAYAQSRPELAAALARGKPVR >Mesorhizobium_sp._L2C085B00_REFreference --------MPDR---IRCTYLLESPIDIG--RAAELLASEQSIGSFTVAGETDEIIERRV DERVVDTSRCIATISFPVENFGP-NLPSLFAILLGNLFELQELTGVRLEDVDIPASFADR FPGPAFGISGTRRLSGV-VSPIIGTIVKPKLGLEPAAIADLVQMLGDAGIHFIKDDECMT DPPSAPFSKRVEAIAPVLDRLADRHGRKVMYAFNISDDGDRMCRNHDLVANSGGTCVMVS LNACGPAAIAQLRRH-ASLPIHAHRAGWGMLTRHPALGLSFTAYQKLWRLAGIDQIHIGG IQGKFSEVDATVTQSARACL------GPFGDHAPIMPVISSGQWGGQAPVTFKAFGSDVI YLAGGGILGHPLGPAAGVRAISEAWEAARLGIDLVDYARSHEPLASSIGFFGNLR >ZP_01056409_Roseobacter_sp_MED193_IV_Non_phot_REFreference ------------MNRITATYDIESPVGVA--RAAEVLAGEQSTGTFTLAAETDALRERRI ESKITSARGT-VTISWSLDSFGT-SLPTLMSTLAGNLFELAELSAIRLIDMQLPSAFATD HPGPQFGAQGTREFMG-GHQPVIGTIIKPSVGLTPSETAALVQTLVDAGVDFIKDDELQA NGPHCPFDERVKEVSRVLNAHADKMGKRVMYAFNLTDEIDQMWRNLELLETYGGTCAMVC MSSVGLTGLRALRNR-SSMTIHGHRAGWGIYSRSPDIGISFPVMQKLWRLSGADHLHVNG LANKFT-ETDDVI-AQSAIS-VQTAVADCGPAHLAMPVYSSGQTIWQIDPARSLLGNDFI FCAGGGILSHPSGAAAGVIALRQAAEAARSGTDIKQYAKEHPELQGAVETFQRSR >YP_511005_Jannaschia_sp_CCS1_IV_Non_phot_REFreference ------IRFEAD---LIETP-GDP----Q--AATETMAGEQSSGTFV---SVPGETAERA AARVERL---RVTLSWPMDSIGA-SLPNLMTTVGGNLFELRAFSGLRVEDIRLPAAFAAA GPGPRFGPDGTRRLAGV-EKPLIGTIVKPSVGLSPENTAGLVAEMAEAGLDFIKDDELQA DGPACPFDARVRAVMDVLNRHADRTGKKVMYAFNLTGEVDEMRRRHDLVRDLGGTCVMVS MLPVGLSGFLALSRH-ADLPIHAHRNGWGALSRHPALGWSYVAFSKLWRLAGADHLHVNG LSNKFCEPDESVTASARACL------SPLGHPMLAMPVFSSGQTAAQAAPTLAALGSDLI HAAGGGILGHPDGPRAGVAAMRSAW-AAMHG------------------------ >ZP_01438569__Fulvimarina_pelagi_HTCC2506_IV_Non_phot_REFreference --------MVER---LTVDYSLRTQDD--PRKVAETIAGEQSSGTFI---ATPGEDAARA AATVEILSGGGLRLSWPMTNFGP-SIPNLLATIAGNLFELKGVAGLKIQDIHLPESFKTA YLGPGFGIEGTRRVAGV-SRPIVGTIIKPSVGLDPDETASLVEALCEGGIDFIKDDELQA DGPHCPFDERLRAVMAVIDRHADRLGRKVMYAINLTGEIDEMLRRHDLVAETGGTCVMVS LNSVGPTGLTALRRH-SSLPIHAHRNGWGYLGRSPDNGWSYRAWHKIWRLAGADHMHVGI ANKFWE-PDDSVIESAKACL------DPLDRPDRVMPVFSSGQTVMTAHPTFERLGADLI FTAGGGIVAHPQGVAAGVRSLREAWEAAVAGVPLEEAAKNSAALSKAIETYS--- >BAD64310_Bacillus_clausii_KSM_K16_IV_Ykr_REFreference ------------MAQINAVYYVSDSNEF-IEEKAEKMATGLTAKPWQ---EMPEAEKQAY KGKVVSIEGSIVTISFPVAYEVP-DFPSILTTTYGR---LSYEPNVKLLDLQFSNDLVER FPGPLYGIEGIRDLVEVEGRPLAMSVAKGAIGRSIDSFHEQMLAHSYGGIDIIQDDERLF EHNWTPYEQRVPAGLAAIAEAAERTGRTPLYVVNLTGKTFELKERAREAIGLGAPALMLN VYAYGIDVLQGLREDPIDVPIFAHSSLTGMMTRSKQHGIASLLLGKLLRMAGADAVLFPS PYGRI-GNPEEAQRVKDQLT---------TQMKRAFPIPSAGIDFQTIATVRQDFGEDVI INLGGSVHRYKGGVEAGGKAFIEAL--------------NSAN------------ >AAU16474_Bacillus_cereus_E33L_IV_Ykr_REFreference -------------MSIIATY-LIHD-SHNLEKKAEQIALGLTIGSWTHLPHLLQEQLKQH KGNVIHVEHTIIKIEYPLLNFSP-DLPAILTT----TFGKLSLDGVKLIDLTFSDGLKKH FPGPKFGIDGIRNLLQVHDRPLLMSIFKGMIGRNIGYLKTQLRDQAIGGVDIVKDDEILF ENALTPLTKRIVSGKEVLQSVYETYGHKTLYAVNVTGRTFDLKENAKRAVQAGADILLFN VFAYGLDVLQSLAEDEIPVPIMAHPAVSGAYSASKLYGISSLLLGKLLRYAGADFSLFPS PYGSVALEKEEALAISKYLT---------VFFKKSFSVPSAGIHPGFVPFIIRDFGKDVV INAGGGIHGHPNGAQGGGKAFRTAIDATLQNKPLHEVDDI--NA---LQIWGNPS >AAU23062_Bacillus_licheniformis_ATCC_14580_IV_Ykr_REFreference ------------MSELLATYLADP--GCDAEKRAEQIAIGLTVGSWTDLPLLKQEQLKKH KGRVVNVEETTVTIAYPEANFTN-DIPAVLTT----VFGKLSLDGIKLADLEFSRSFKQS LPGPKFGVYGIRKKIGEFERPLLMSIFKGVIGRDMEDLKEQLRQQALGGVDLIKDDEILF ETGSAPFEKRITEGKKVLEEAFEETGRKTLYAVNLTGRTMELKAKARKAAELGADVLLLN VFAYGLDVLQSFAEDDIPLPIMAHPAVSGALTSSPHYGFSHLLLGKLNRYAGADFSLFPS PGSAL--PKRDALAIYDECT-----KEDV--FKPTFAVPSAGIHPGMVPLLMKDFGIDHI INAGGGIHGHPNGAAGGGRAFRAVIDAVLEAEPVEEKAKRSPDLKLALEKWGRVE >CAB13232_Bacillus_subtilis_subsp_subtilis_str_168_IV_Ykr_REFreference ------SG----MSELLATYLTEP--GADTEKKAEQIATGLTVGSWTDLPLVKQEQMQKH KGRVIKVEGT-ITIAYPEINFSQ-DIPALLTT----VFGKLSLDGIKLIDLHFSEAFKRA LPGPKFGVYGIRKLLGEFERPLLMSIFKGVIGRDLSDIKEQLRQQALGGVDLIKDDEIFF ETGLAPFETRIAEGKQILKETYEQTGHKTLYAVNLTGRTADLKDKARRAAELGADALLFN VFAYGLDVMQGLAEDPIPVPIMAHPAVSGAFTSSPFYGFSHLLLGKLNRYCGADFSLFPS PGSAL--PRADALAIHEECV------RE-DAFNQTFAVPSAGIHPGMVPLLMRDFGIDHI INAGGGVHGHPNGAQGGGRAFRAIIDAVLEAQPIDEKAEQCKDLKLALDKWGKAE >AAM72993_Chlorobium_tepidum_TLS1_IV_Phot_REFreference AEDVKG--ASREMEQLVLDYYLESG-DIE--TALAHFCSEQSTAQWV---GVDEDFRLVH AAKVIEVYPVRVTIAHPHCNFGP-KIPNLLTAVCGETYFTPGVPVVKLMDIHFPDTYLAD FEGPKFGIEGLRDILNAHGRPIFFGVVKPNIGLSPGEFAEIAYQSWLGGLDIAKDDEMLA DVTWSSIEERAAHLGKARRKAEAETGEPKIYLANITDEVDSLMEKHDVAVRNGANALLIN ALPVGLSAVRMLSNY-TQVPLIGHFPFIASFSRMEKYGIHSKVMTKLQRLAGLDAVIM-- PFGRMMTPEEEVLENVIECT------KPMGRIKPCLPVPGGSDSALTLQTVYEKVGNDFG FVPGRGVFGHPMGPKAGAKSIRQAWEAIEQGISIETWAETHPEMVD--QSLLKKQ >ABB28892_Chlorobium_chlorochromatii_CaD3_IV_Phot_REFreference INGF----FASKMADLILDYYLEC-VGD-IETALAHFCSEQSTAQW----KRVDYDEDKH AAKVISLHATRVTIAHPHTNFGT-KLPNLLSAVCGEAFFTPGVPVVKLLDIHFPDSYLQA FEGPKFGIDGIRDLLKAYNRPIFFGVVKPNIGLSPAEFGEIARQSWLGGLDIAKDDEMLA DVTWSSLADRSRELGEARRNAEQATGEPKVYLANITDEVDRLLEQHDVAVRNGANALLIN ALPVGLSAVRMLAKH-TKVPLIGHFPFIAAFSRLEKFGVHSRVMTKLQRLAGLDSIIMG- -FGRMMTTEEEVMANVQECL------QDFGHLRRSLPVPGGSDSALTLEGVYRKVGSDFG FVPGRGIFGHPMGPKAGAASIRQAWEAIEQGVPLETYAQGRPELQA-MVDGAHGK >BAB44150_Allochromatium_vinosum_IV_Phot_REFreference SDWAGF--FADEREALFLDYYLEC-SGD-PELAAAHFCSEQSTAQW----RRVGSDEDRF GARVVQIGSDRLRIAHPHGNFGP-RLPNLLSAICGETFFSPGAPIVKLLDIEFPESYLAC FQGPQFGVAGLRERLQVHDRPIFFGVIKPNIGLPPEAFSELGHESWLGGLDIAKDDEMLA DTDWCPLDRRAELLGEARRRAEAATGVPKIYLANITDEVDRLVELHDRAVERGANALLIN AMPTGLSAVRMLRKH-AQVPLMAHFPFIAPFARLERFGVHTRVFTKLQRLAGYDVIIM-P GFGRMHMTDDEVRACATACL------EPMGPIKPSLPVPGGSDWAGTLRPLYEKLGTDFG FVPGRGVFGHPMGPRAGAASIRQAWEAIVAGETLEERAKRHPELSAAIAAFGKPA >YP_530146_Rhodopseudomonas_palustris_BisB18_IV_Phot_REFreference LDDYIE---------LDFTFE---CAGD-PREAAAHLCSEQSTAQW----RRVGVDEDRF AAKVLAMTGPRVTIAHPHGNFGP-KLPNLLSAVCGEVFFSPGIPLIRLEDIRFPDGYLAA FQGPQFGVQGLRDRLQAFERPIFFGVIKPNIGLPAAPFAELGYQSWKGGLDIAKDDEMLA DVDWCPLSERAAALGEARLRAERETGVPKIYLANITDEVDRLVPLHDLAVANGANALLVN AMPVGLSAVRMLRKH-ATVPLIAHFPFIAAFSRLAAYGIHSRVFTRLQRLAGFDVVIMG- -FGRMMTPEHEVLDCVRACL------EPMGPIKPCLPVPGGSDSAATLEGVYRKVGSDFG FVPGRGVFGHPMGPAAGAASIRQAWDAIASGIAVADHAKTHPELAAALQAFGAKK >14_1009_16_20cm_scaffold_137301_2Parcubacteria --------MSTVAAPLTVTFYFETAAGK-PEEILTHMCHEQTTTGYDTLAPRLAPYQARL RMPSADIESGYAEVDFPWASMSHGALEDALTFVMGESSHVKGMLKLRMVDLNFPEDMMRS LPGPRHGVPGVRKRLGVYHRPLLMGPMRPEVGLAPAEYARISYEALVAGADIVKDDELLV DPPYCPIKERAPLCAKAAREAEQKTGERKMWVLHLGCDIDEFFKIGESA---GVDGYMVQ RLTPSLLTFVRGR---TELPIIAHYSM--------------LTLMPWS------------ ----------------------------W------------------------------- ------THWHS-AVRY--------------------------------------- >rifcsphigho2_01_scaffold_5232_2unknown ------FQFLSTKSDVIAEYAIEPQ-AVSFQDACRHVAAAS-IG------VSPEAFVRGD APAVVSLDKG-ALIRYPADLFEPGNADQISHSLTGH-LQLPSVKRATLLDMQFPRTLVKS FPGPRFGVKGIRKLIGT-NQALLEASL---LQGSSYEFSMNAYEAWT-GMDIVADAPSLT NLPSNRFKERVFQTHRRKKKAEQLNGGHKMYMPNVTG--SETVKRADFAHTLGCEAVVLN L--SGFSSLQNLREADVPVAIQLQRSV-----RMPH--MSSHACAKLCRLAGADIVHIGS LPGPMNA--DASVHVHDVID---SLAKPWHGVKSSMGLAC-RVGASHIPGLVQRLGNDLV IQFTDAVSGHSHGVRKGAMACRQALDSHLHHIPLQVYGLKHSELRESLASLQPRF >Clostridium_asparagiforme_REFreference ------------------------M-DV-VKKAETMAVGQT-VGTWVPVPGVTDEMRETH MGRVVNIQIG-IQIAYPTVNFGP-QFPMMFTTLLGN--DASTSAQVKLVDLQLPENFLSH FKGPKFGIQGLRELTGVWDRPLLLNMIKPCTGLTPEAGAKIFYETALGGVDFIKDDELLA NPDFCPAAKRVKAYNEAAKAAYEKTGKETIYICNITDSSARLKDTLNAVLEAGAKAVMLC FSTVGYSTFRAISEQ-IPIPVMGHYAASGISNEGLNSGLSSLSIGKFPRMAGADLVMMNT PYGGYPLTRQQYFKTAHQLS--------LCHLKPTMPICGGGVHPGMVQRFIDDFGTDII LAAGGAIQGHPQGPAAGVRAMSQAVAAALTHTPLTTAAAHHPELKTALSLFSPTP >NODE_4604_length_2333_cov_5.460550_2 KLPIESPNALPDPEKVIATYYFGAI-GLPIKDICEMAAVEQSTGTWVLVPGETPQMRKKY VAKVIGIPKD-VQVAFPWVNFGQ-QIPMLLSTVVGN---ISMGGRVKVLDLRFPKSWLKG FQGPKFGTQGVFKVLGVKGRPLLNNMIKPCTGYPPEVGAKLLYEAARGGADIIKDDELIA DPVFNRITDRLPLYMDAIDKANSEKGEKTLYTINITDRIPKLMENAEAALEHGANALMIN YLAVGYSAVRQVTDDSINVPVLGHMDIAGAMSYSPISGLSTLVIGKLPRLAGVDIEVFPA PYGKAPLLKQRYIEMARSMN--------YHHIKPTMPMPSGGITPGHVPMVVEDLGLEIM IGTGGGIHAFPGGPSAGARAFRQAIDATVKGISLKDYAEDHPELKVALDKWGTGK >rifcsphigho2_12_scaffold_331687_2Pacearchaeota CRMMRNLNCQPSAHDLVAEYRVSP-------KSVKQIALQF---------LEHRPQTKRL EPTIFYLDQK-IKIAFPKEGFEEGNAPALLSSLTGNFFGAN--NGIRLQDINLPLDFLHS FKGPRLGVEGIRKLTKIKTRPFLGSV-----ALQTENH---IFSSLCDGMDVIQDYDTLT KRE--SFESHIKQLFKQRDNAEKETGMKKLVFPNITAETMEMLRRAKIVRNYGGEFVMVD LMTTGLSGLQTLRDH-WDQGIYANRSMHNEKSD---IGISNLVHAKLARMIGVDCLRLGT SLNSKETKLVECELSNHIVK---VLSQAWESTPAVLPVVLGNLNEHHLPSVFQTLGNDVV A-----LTENPQKAKI----FRRAMEAALHQIPLEDYAIGHPE-----LRMNFR- >ABA23512_Anabaena_variabilis_ATCC_29413_I_REFreference -YYTPD--YTPKDTDILAAFRVTPQPGVPFEEAAAAVAAESSTGTWTTVWTDLLTDLDRY KGRCYDIPGEIAYIAYPLDLFEEGSITNVLTSIVGNVFGFKALRALRLEDIRFPVAYIKT FQGPPHGIQVERDKLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENIN SAPFQRWRDRFLFVSDAISKAQAETGEIKGHYLNVTAPTEEMLKRAEYAKELNQPIIMHD YLTAGFTANTTLARWDNGVLLHIHRAMHAVIDRQKNHGIHFRVLAKALRLSGGDHIHTGT VVGKLEGERGITMGFVDLLRRGIYFTQDWASLPGVMAVASGGIHVWHMPALVEIFGDDSV LQFGGGTLGHPWGNAPGATANRVALEAVQGNDVIREAAKWSPELAVACELWKEIK >Spinacia_oleracea_NP_05494_REFreference --------YTPEDTDILAAFRVSPQPGV-PPEAGAAVAAESSTGTWTTVWTDGLTNLDRY KGRCYHIAGEICYVAYPLDLFEEGSVTNMFTSIVGNVFGFKALRALRLEDLRIPVAYVKT FQGPPHGIQVERDKLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVN SQPFMRWRDRFLFCAEALYKAQAETGEIKGHYLNATAGTEDMMKRAVFARELGVPIVMHD YLTGGFTANTTLSHYDNGLLLHIHRAMHAVIDRQKNHGMHFRVLAKALRLSGGDHIHSGT VVGKLEGERDITLGFVDLLRRGIYFTQSWVSTPGVLPVASGGIHVWHMPALTEIFGDDSV LQFGGGTLGHPWGNAPGAVANRVALEAVQGNTIIREATKWSPELAAACEVWKEIK >P00876_Nicotiana_tabacum_I_REFreference --------YTPEDTDILAAFRVTPQPGV-PPEAGAAVAAESSTGTWTTVWTDGLTSLDRY KGRCYRIEKDIAYVAYPLDLFEEGSVTNMFTSIVGNVFGFKALRALRLEDLRIPPAYVKT FQGPPHGIQVERDKLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVN SQPFMRWRDRFLFCAEALYKAQAETGEIKGHYLNATAGTEEMIKRAVFARELGVPIVMHD YLTGGFTANTSLAHYDNGLLLHIHRAMHAVIDRQKNHGIHFRVLAKALRMSGGDHIHSGT VVGKLEGERDITLGFVDLLRRGIYFTQDWVSLPGVLPVASGGIHVWHMPALTEIFGDDSV LQFGGGTLGHPWGNAPGAVANRVALEAVKGNEIIREACKWSPELAAACEVWKEIV >Isoetes_savatieri_AAM3458_REFreference --------YTPDDTDILAAFRMTPQPGV-PPEAGAAVAAESSTGTWTTVWTDGLTSLDRY KGRCYDIAGEIAYVAYPLDLFEEGSVTNMFTSIVGNVFGFKALRALRLEDLRIPPAYSKT FQGPPHGIQVERDKLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVN SQPFMRWRDRFLFVAEALNKSQAETGEIKGHYLNATAGTEEMMKRAVFARELGAPIVMHD YLTGGFTANTSLAHYDNGLLLHIHRAMHAVIDRQRNHGIHFRVLAKALRMSGGDHIHSGT VVGKLEGEREVTLGFVDLLRRGIYFTQDWVSMPGVLPVASGGIHVWHMPALTEIFGDDSV LQFGGGTLGHPWGNAPGAVANRVASEAVQARNEGRDLAREGNE------------ >Picea_abies_CAA5320_REFreference --------YTPEDTDILAAFRVTPQPGVPPEEAGAAVAAESSTGTWTTVWTDGLTSLDRY KGRCYDIAGEIAFVAYPLDLFEEGSVTNLFTSIVGNVFGFKALRALRLEDLRIPPAYSKT FQGPPHGIQVERDKLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVN SQPFMRWRDRFVFCAEALNKAQAETGEIKGHYLNATAGTGEMMKRAVFARELGVPNVMHD YLTGGFTANTSLAHYDNGLLLHIHHAMHAVIDRQKNHGMHFRVLAKALRMSGGDHIHGGT VVGKLEGEREITLGFVDLLRRGIYFTQDWVSMPGVLPVASGGIHVWHMPALTEIFGDDSV LQFGGGTLGHPWGNAPGAVANRVALEAVQGNEVIREACKWSPELAAACEIWKEIK >Q51856_Hydrogenophilus_thermoluteolus_I_REFreference --------WEPTKDSFLAVFKIVPP-GVPREESAAAVAAESSTATWTTVWTDLLTDLYYY KGRAYAIVPGYAFIAYPMGLFEEGSVVNVFTSLVGNVFGFKAVRSLRLEDVRIPLWFVTT CPGPPHGIYVERDKLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVN SQPFMRWRDRFLFCQEAIEKAQAETGERKGHYMNVTGPTEEIYKRAEFAKEIGTPIIMID YLTVGWAATQSLSKWDNGMLLHVHRAMHAVIDRNPKHGINFRVLAKIMRLIGGDHLHSGT VVGKLEGDRAATLGWIDLMRRGIFFDQDWGQMPGMFPVASGGIHVWHMPALVSIFGDDSV LQFGGGTIGHPWGNAAGACANRVALEAVKRNEILTEAAKSCPELKVAMETWKEVK >2_Hydrogenovibrio_marinus_BAD1531_REFreference -YWTPD--YTPLDTDLLACFKVVPQEGV-PREAAAAVAAESSTGTW----TTVWTDLLEF YKRAYRIPGDYAFIAYPLDLFEEGSVVNVLTSLVGNVFGFKAVRSLRLEDLRFPIAFIKT CGGPPSGIQVERDKLNKYGRPMLGCTIKPKLGLSAKNYGRAVYECLRGGLDLTKDDENIN SQPFQRWRDRFEFVAEAVDKATAETGERKGHYLNVTAGTEEMMKRAEFAKELGQPIIMHD FLTAGFTANTTLANWENGMLLHIHRAMHAVIDRNPLHGIHFRVLAKCLRLSGGDHLHTGT VVGKLEGDRASTLGFVDQLRRGVFFDQDWGSMPGVMAVASGGIHVWHMPALVNIFGDDSV LQFGGGTQGHPGGNAAGAAANRVALEAVKGGDILREAARTSKELAVALETWKEIK >Thioalkalimicrobium_cyclicum_YP_00453780_REFreference -YWTPD--YTPLDTDLLACFKVIPQAGV-PREAAAAVAAESSTGTW----TTVWTDLLEF YKRCYRIPGNYAFIAYPLDLFEEGSVVNVLTSLVGNVFGFKAVRSLRLEDIRFPVAFIKT CGGPPSGIQVERDKLNKYGRPMLGCTIKPKLGLSAKNYGRAVYECLRGGLDLTKDDENIN SQPFQRWRDRFSFVADAINKAEAETGEVKGHYLNVTAATEDMMERAEYAKELGVRIVMHD FLTGGFTANTSLANWKNGMLLHIHRAMHAVIDRNPNHGIHFRVLAKCLRLSGGDHLHTGT VVGKLEGDRASTLGFVDQLRRGVFFDQDWGSMPGVMAVASGGIHVWHMPALVTIFGDDSV LQFGGGTQGHPGGNAAGAAANRVALEAVKGGDILRDAARHSPELAVALETWKEIK >Allochromatium_vinosum_YP_00344469_REFreference --------WTPDLDSLLACFKVTPA-KVSREEAAAAVAAESSTGTWTTVWSDLLTDLDYY KGRAYRIVPGYAFIAYPLDLFEEGSIVNVLTSLVGNVFGFKAVRALRLEDIRFPLHYVKT CGGPPNGIQVERDRMDKYGRPFLGATVKPKLGLSAKNYGRAVYEMLRGGLDFTKDDENVN SQPFMRWQNRFEFVSEAVRKAQEETGERKGHYLNVTAPTEEMFKRAEFAKECGAPIIMHD FLTGGFTANTSLANWDNGMLLHIHRAMHAVIDRNPKHGIHFRVLAKCLRLSGGDHLHTGT VVGKLEGDRQSTLGFVDQLRRGLFFDQDWGGMPGVMAVASGGIHVWHIPALVTIFGDDSV LQFGGGTQGHPWGNAAGAAANRVATEAVKRNEVLSDAARHSPELAVAMETWKEIK >Prochlorococcus_marinus_YP_00109080_REFreference -YWTPE--YVPLDTDLLACFKCTGQEGV-PREVAAAVAAESSTGTW----STVWSELLEF YKRCYRIPGDYAFIAYPLDLFEEGSITNVLTSLVGNVFGFKALRHLRLEDIRFPIAFIKT CGGPPNGIVVERDRLNKYGRPLLGCTIKPKLGLSGKNYGRVVYECLRGGLDLTKDDENIN SQPFQRWRERFEFVAEAVKLAQRETGEVKGHYLNCTANTEELYERAEFAKELDMPIIMHD YITGGFTANTGLANWKNGMLLHIHRAMHAVIDRHPKHGIHFRVLAKCLRLSGGDQLHTGT VVGKLEGDRQTTLGYIDNLRRGNFFDQDWGSMPGVFAVASGGIHVWHMPALLAIFGDDSC LQFGGGTHGHPWGSAAGAAANRVALEAVKSRDILMEAAKHSPELAIALETWKEIK >Synechococcus_sp_WH8102_NP_89780_REFreference -YWTPD--YVPLDTDLLACFKCTGQEGV-PKEVAAAVAAESSTGTW----STVWSELLDF YKRCYRIPGDYAFIAYPLDLFEEGSITNVLTSLVGNVFGFKALRHLRLEDIRFPMAFIKS CYGPPNGIQVERDRMNKYGRPLLGCTIKPKLGLSGKNYGRVVYECLRGGLDFTKDDENIN SQPFQRWQNRFEFVAEAIKLSEQETGERKGHYLNVTANTEEMYERAEFAKELGMPIIMHD FITGGFTANTGLSKWKNGMLLHIHRAMHAVIDRHPKHGIHFRVLAKCLRLSGGDQLHTGT VVGKLEGDRQTTLGYIDQLRRGNFFDQDWGSMPGVFAVASGGIHVWHMPALVAIFGDDSV LQFGGGTHGHPWGSAAGAAANRVALEAVKSRDILMEAGKHSPELAIALETWKEIK >uncultured_marine_typeA_Synechococcus_ABD9625_REFreference --------WTPDLDTLLACFKCTGE-GVPKEEVAAAVAAESSTGTWSTVWSELLTDLDFY KGRCYRIPGD-AFIAYPLDLFEEGSITNVLTSLVGNVFGFKALRHLRLEDLRFPLAFIKT CYGPPNGIQVERDRMNKYGRPLLGCTIKPKLGLSGKNYGRVVYECLRGGLDFTKDDENIN SQPFQRWQNRFEFVAEAIKLSEQETGEKKGHYLNVTANTEEMYERAEFAKELGMPIIMHD FITGGFTANTGLSKWKNGMLLHIHRAMHAVIDRHPKHGIHFRVLAKCLRLSGGDQLHTGT VVGKLEGDRQTTLGYIDQLRRGNFFDQDWGSMPGVFAVASGGIHVWHMPALVTIFGDDSV LQFGGGTHGHPWGSAAGAAANRVALEACVKARILMEAAKHSPELAIALETWKEIK >Nitrococcus_mobilis_ZP_0112568_REFreference -YWTPD--YVPLDTDLLACFKCTGQPGV-PREVAAAVAAESSTGTW----SSVWSELLEY YKRAYEVPGDYAFIAYPIDLFEEGSVVSVLTSLVGNVFGFKALKHLRLEDIRFPIAYVKT CMGPPSGIQVERDKLNKYGRPLLGCTIKPKLGLSAKNYGRTVYECLRGGLDLTKDDENVS SQPFMRWQNRFEFVAEAVLKAQAETGERKGHYLNVTSPDEQMYERAEFAKALGMPIIMHD FLTAGFTANTGLAKWKNGMLLHIHRAMHAVIDRHPKHGIHFRVLAKCLRLSGGDHLHAGT VVGKLEGDRSSTMGFVDQLRRGVFFDQDWGSMPGVFPVASGGIHVWHMPALVTIFGDDSC LQFGGGTQGHPWGNAAGAAANRVSLEAVKAADILSEAAARSPELAIAMETWKEIK >Thiocapsa_marina_ZP_0877298_REFreference -YWTPD--YVPLDTDLLACFKCTGQPGV-PREVAAAVAAESSTGTW----STVWSELLEF YKRAYRIPGDYAFIAYPIDLFEEGSVVNVLTSLVGNVFGFKALRHLRLEDIRFPIAYVKT CMGPPSGIQVERDKLNKYGRPLLGATIKPKLGLSAKNYGRAVYECLRGGLDLTKDDENVN SQPFMRWQNRFEFVAEAVSAAQAETGERKGHYLNVTAADEQMYERAEFAKELGQPIIMHD FLTGGFTANTGLAKWKNGMLLHIHRAMHAVIDRHPKHGIHFRVLAKCLRLSGGDHLHTGT VVGKLEGDRGSTLGFVDLLRRGIFFDQDWGSMPGVFAVASGGIHVWHMPALVTIFGDDSV LQFGGGTQGHPWGNAAGAAANRVALEAVKAREIMTGAAKHSPELAIAMETWKEIK >Thiorhodovibrio_sp_970_ZP_0894247_REFreference -YWTPD--YVPLDTDLLACFKCTGQPGV-PREVAAAVAAESSTGTW----STVWSELLEY YKRAYRIPGDYAFIAYPIDLFEEGSIVNVLTSLVGNVFGFKALRHLRLEDIRFPIAYVKT CMGPPNGIQVERDKMNKYGRPLLGATIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVN SQPFMRWQNRFEFVGEAIQSAQQETGERKGHYLNVTAATEDMYERAEFAKECGVPIIMHD FLTGGFTANTGLAKWKNGVLLHIHRAMHAVIDRHPKHGIHFRVLAKCLRLSGGDHLHTGT VVGKLEGDRASTLGYVDQLRRGIFFDQDWGSMPGVFAVASGGIHVWHMPALVTIFGDDSV LQFGGGTQGHPWGNAAGAAANRVALEAVKAREIMTDAARHSPELAIAMETWKEIK >Bradyrhizobium_sp_ORS_278_YP_00120434_REFreference --------WTPELDTLLAVFKIVAA-GVPREEAAAAVAAESSTGTWTTVWTDLLTDLDYY KGRAYRIVPGYAFIAYPIDLFEEGSVVNVLTSLVGNVFGFKAVRSLRLEDIRFPLAYVKT CGGPPNGIQLERDRLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENIN SQPFMRWQHRFEFVMEAVHKATSETGERKGHYLNVTAPTEEMYKRAEFAKSLGAPIIMHD FLTAGFTANTGLANWENGMLLHIHRAMHAVLDRNPMHGIHFRVLTKCLRLSGGDHLHSGT VVGKLEGDREATIGWVDLMRRGIFFDQDWGAMPGVMPVASGGIHVWHMPALTAIFGDDAC FQFGGGTLGHPWGNAAGAHANRVALEAVERNQILTEAAQHSPELKIAMETWKEIK >Cupriavidus_metallidurans_YP_58365_REFreference -YWTPE--YTPKDTDLLACFKVTPQPGVAREEVAAAVAAESSTGTWTTVWTDLLTDLDYY KGRAYRIPGDYAFVAYPIDLFEEGSIVNVLTSLVGNVFGFKALRALRLEDVRFPIAYVMT CGGPPHGIQVERDIMNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVN SQPFMRWKQRFDFVQEATEKAQRETGERKGHYLNVTAPTEEMYKRAEYAKEIGAPIIMHD YLTGGFCANTGLANWDNGMLLHIHRAMHAVLDRNPHHGIHFRVLTKCLRLSGGDHLHSGT VVGKLEGDRDATLGWIDIMRRGIMFDQDFGSMPGVMPVASGGIHVWHMPALVTIFGDDSV LQFGGGTLGHPWGNAAGAAANRVALEAVERNKILTEAATHSPELKIAMETWKEIK >Nitrosomonas_eutropha_YP_74703_REFreference -YWQPD--YVPLDTDILACFKITPQSGVDREEAAAAVAAESSCGTWTTVWTDLLTDLDYY KGRAYRIPGDYAFVAYPIDLFEEGSVVNVFTSLVGNVFGFKAIRALRLEDVRFPIAYVKT CGGPPSGIQVERDKMNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGSLDFTKDDENIN SQPFMRWRDRFEFVQEATLKAEAETGERKGHYLNVTAPTEEMYKRAEFAKEIGAPIIMHD YLAGGLCANAGLANWNNGMLLHVHRAMHAVLDRNPHHGIHFRVLTKILRLSGGDHLHTGT VVGKLEGDRASTLGWIDLLRRGIFFDQDWGSMPGAFAVASGGIHVWHMPALVAIFGDDSV LQFGGGTLGHPWGNAAGAHANRVALEAVQRNEILTAAAQHSPELKIAMETWKEIK >Q59613_Nitrobacter_vulgaris_I_REFreference -YWQPD--YMPLDTDILACFKITPQPGVDREEAAAAVAAESSCGTWTTVWTDLLTDLDYY KGRAYRLPGDYAFIAYPIDLFEEGSVVNVFTSLVGNVFGFKAVRALRLEDLRFPIAYVKT CGGPPHGIQVERDKLNKYGRPLLGCTIKPKLGLSAKNYGRACYEALRGGLDFTKDDENIN SQPFMRWRDRFDFVMEAVQKAEHETGERKGHYLNVTAPTEEMYKRAEYAKEIRAPIIMHD YLAGGLCANAGLANWNNGMLLHIHRAMHAVIDRNPHHGIHFRVLTKILRLSGGDHLHTGT VVGKLEGDRASTLGWIDLLRRGIFFDQDWGSMPGGFAVASGGIHVWHMPALVTIFGDDSV LQFGGGTVGHPWGNAL-AHANRVALEAVQRGELLTAAAAHSPELKIAMETWKEIK >Halorhodospira_halophila_YP_00100262_REFreference -YWEPD--YKIKDSDLLAVFKVTPQPGVDREEAAAAVAAESSTGTWTTVWTDLLTDLEHY KGRAYKVPGDYAFIAYPIDLFEEGSIVNVFTSLVGNVFGFKAVRALRLEDVRFPLHFVMT CPGPPNGIQVERDKMNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVN SQPFMRWRDRFEFVMEAIQKAEEETGERKGHYLNVTAPTEEMYKRAEFAKELGAPIIMHD YITAGFCAHQGLANWDNGMLLHIHRAMHAVLDRNPNHGIHFRVLTKILRLMGGDQLHTGT VVGKLEGDRQSTLGWIDLLRRGLFFDQDWGAMPGAFAVASGGIHVWHMPALLSIFGDDAV FQFGGGTLGHPWGNAAGAAANRVALEAVKRNEILTEAAKSSPELKAAMETWKEIK >Acidimicrobium_ferrooxidans_YP_00310876_REFreference -YWAPD--YVPLDSDLLAVFKIVPQPGVDREEAAAAVAAESSTGTWTTVWTDLLTDLDYY KGRAYRIPGDYAFVAYPIDLFEEGSVVNVFTSLVGNVFGFKAVRSLRLEDVRFPLAFVNT CNGPPHGIQVERDKMNKYGRPLLGCTIKPKLGLSAKNYGRAVYEVLRGGLDFTKDDENVN SQPFMRWRDRFLFVAEAIHQAEAETGERKGHYLNVTAPSEEMYERAEFAKELGMPIIMHD FLTGGFTANTGLARWRNGMLLHIHRAMHAVIDRNPYHGIHFRVLAKALRLSGGDHLHTGT VVGKLEGDRAATQGWVDLLRRGIFFDQDWGSMPGVFAVASGGIHVWHMPSLLTIFGDDAV FQFGGGTLGHPWGNAPGATANRVALEAVQRNEILQNAAKHSPELRVAMETWKEIK >Alkalilimnicola_ehrlichii_YP_74366_REFreference -YWEPD--YQVKDSDFLACFKVVPQAGVPREEAAAAVAAESSTGTWTTVWTDLLTDLDYY KGRAYKIGDD-AFIAYPIDLFEESSVVNVFTSLVGNVFGFKAVRSLRLEDVRIPLAYVMT CNGPPHGIQVERDKMDKYGRGLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVN SQPFMRWRDRFLFVQEATEKAQQETGERKGHYLNVTAPSEEMYERAEFAKEIGAPIIMHD FLTGGFCANTGLARWKNGMLLHIHRAMHAVMDRNPRHGIHFRVLAKALRLSGGDHLHTGT VVGKLEGDRAATEGWIDLLRRGIFFDQDWGAMPGVFAVASGGIHVWHMPALVSIFGDDAV FQFGGGTLGHPWGNAAGAAANRVALEAVKGKEILQAAAQHSPELKIAMETWKEIK >WP_029927633_Nocardia_otitidiscaviaru_REFreference DRWDPG--YAPDDTDVLAVFRVTPQQGV-DPEASAAVAGESSTATWTVVWTDRLTAHDRY RGKCYRVPGEFAYVAYDLDLFEEGSITNLTSSVIGNVFGFKPLKALRLEDMRIPVAYVKT FQGPAHGIVMEREYLNKYGRPLLGATVKPKLGLSARNYGRVVYEACKGGLDFTKDDENIN SQPFMRWRDRYLFAMEGVNRAIAETGELKGHYLNVTAADEDMYERAEFAKSLGSVIIMMD -LTVGYTAMQSMAKWRNGMLLHLHRAGHSTFTRQKTHGVSFRVLAKWCRLIGVDHVHAGT VVGKLEGDPHTVRGFYDTLRNGIFFDQDWASLPGVMPVASGGIHAGQMHQLLDLFGDDVI LQFGGGTIGHPFGIAAGAEANRVALEAVVKARVLRQAAEHCRPLEVALATWGDVT >NP_043654_Odontella_sinensis_I_REFreference PYAKMG--WDASQDTVLALFRITPP-GVDPVEAAAAVAGESSTATWTVVWTDLLTACERY RAKAYRVVPNFAFIAYECDLFEEGSLANLTASIIGNVFGFKAVAALRLEDMRIPYAYLKT FQGPATGIVVERERLNKYGAPLLGATVKPKLGLSGKNYGRVVYEGLKGGLDFLKDDENIN SQPFMRWRERFLYCLEGINRASAATGEVKGSYLNITAATEEVYKRADYAKQIGSVIVIID -LVMGYTAIQSAAIWDNDMLLHLHRAGNSTYARQKNHGINFRVICKWMRMSGVDHIHAGT VVGKLEGDPLMIKGFYDVLRYGIFFDMSWASLRKCMPVASGGIHCGQMHQLIHYLGDDVV LQFGGGTIGHPDGIQAGATANRVALEAVLRNEILREAAKKCGPLQTALDLWKDIS >gi_518884741_ref_WP_020040616_ribulose_bisophosphate_carboxylase_Salipiger_mucosu_REFreference KKRYSAMGWEPDDTDIIALFRITPQDGV-DEEAAAAVAGESSTATWTVVWTDRLTACDKY RAKAYRVPGEFAYIAYDLDLFEPGSIANLTASIIGNVFGFKPLKALRLEDMRLPVAYVKT FQGPATGIVVERERLNAYGRPLLGATVKPKLGLSGRNYGRVVYEALKGGLDFTKDDENIN SQPFMHWRDRFLYCMEAVNRASAATGEVKGTYLNITAGTEEMYARAEFAKSLGSVIIMID LV-IGYTAIQSMAKWDNDMILHLHRAGHSTYTRQRSHGVSFRVIAKWMRLAGVDHLHAGT VVGKLEGDPATTKGYYDIFRNGVFFDQDWASLNKMMPVASGGIHAGQMHQLLTYLGEDVV LQFGGGTIGHPHGIEAGATANRVALEAVFGSDILRDAAQTCTPLKQALETWKDVT >AAB41464_Aurantimonas_manganoxydans_SI85_I_REFreference GYWEPD--YEPKETDVIACFRITPQDGV-DPEAAAAVAGESSTATWTVVWTDRLTAAEKY RAKAYRVDDQFAYIAYDLDLFENGSIANLTASIIGNVFGFKPLKGLRLEDMRLPTAYVKT FQGPATGIVVERERLDKFGRPLLGATVKPKLGLSGRNYGRVVYEALKGGLDFTKDDENIN SQPFMDWRERFLYCMEAVNKAQAATGEIKGTYLNVTAATEDMYERAEFARDLGSNIIMID LV-IGWTAMQSMAKWRNNMILHLHRAGHSTYTRQKTHGVSFRVIAKWARLAGVDHIHAGT VVGKLEGDPATTKGYYDICRNGIFFDQPWASLNKMMPVASGGIHAGQMHQLLDLLGDDTV LQFGGGTIGHPMGIAAGATANRVALECVLGPEILQEAARSCTPLQQALETWKDVT >emb_CDP50020_Devosia_sp_DBB00_REFreference KDRYKSMGWEPDDTDVIALFRITPQDGV-DPEAAAAVAGESSTATWTVVWTDRLTASEKY RAKAYRVPGQFAYIAYDLDLFEPGSIANLSASIIGNVFGFKPLKALRLEDMRFPVAYVKT FQGPATGIVVERERLDKFGRPLLGATIKPKLGLSGRNYGRVVYEALKGGLDFTKDDENIN SQPFMHWRDRFLYCMEAVNKAEAATGEIKGTYLNVTAGDEAMYERAEFAKELGSCIVMID LV-IGYTAIQSMAKWKNDMILHLHRAGHGTYTRQKSHGVSFRVIAKWMRLAGVDHIHAGT VVGKLEGDPNTTKGYYDICRHGVFFDQDWASLNKLMPVASGGIHAGQMHQLIHLLGEDTI LQFGGGTIGHPMGIQAGATANRVALEAIYGPQILQEAARHCLPLKQALDTWGDVT >WP_002717593_Afipia_feli_REFreference KDRYKSMGWEPDDTDVIALFRVTPQDGV-DPEASAAVAGESSTATWTVVWTDRLTAAEKY RAKCYRVPGQFAYIAYDLDLFEPGSIANLSASIIGNVFGFKPLKGLRLEDMRFPVAYVKT FQGPATGIVVERERLDKFGRPLLGATVKPKLGLSGRNYGRVVYEALKGGLDFTKDDENIN SQPFMHWRERFLYCMEAVNRAQAASGEVKGTYLNVTAATEDMYERAEFAKELGSCIVMID LV-IGYTAIQSMAKWKNDMILHLHRAGHSTYTRQKSHGVSFRVIAKWMRLAGVDHIHAGT VVGKLEGDPNTTRGYYDICRHGIFFDQNWASLNKLMPVASGGIHAGQMHQLLDLLGEDVV LQFGGGTIGHPMGIQAGAIANRVALEAILARNEGRDYVAEGPEKAAALEVWKDVT >Nitrobacter_hamburgensis_YP_57154_REFreference GYWEPD--YTPKDTDIICLFRVTPQDGV-DPEAAAAVAGESSTATWTVVWTDRLTAAEKY RAKCYRVEGQFAYIAYDLDLFEPGSISNLTASVIGNVFGFKPLKALRLEDMRLPVAYVKT FKGPPTGIVVERERLDKFGRPLLGATVKPKLGLSGRNYGRVVYEALKGGLDFTKDDENIN SQPFMHWRERFLYCMEAVNRAQAATGEIKGSYLNVTAATEDMYERAEFAKELGSVVVMID LV-IGYTAIQSMSNWKNDMILHLHRAGHSTYTRQRNHGVSFRVISKWMRLAGVDHIHAGT VVGKLEGDPLTTRGYYDICRHGIFFDQNWASLNKMMPVASGGIHAGQMHQLIQHLGEDVV LQFGGGTIGHPMGIQAGATANRVALEAILARNEGRDYVSEGPDKAAALEVWKDVT >WP_011316104_Nitrobacter_winogradsky_REFreference KDRYKSMGWEPDDTDVICLFRVTPQDGV-DPEASAAVAGESSTATWT---VVWTDRLTKY RAKCYRVEGQFAYIAYDLDLFEPGSISNLTASVIGNVFGFKPLKALRLEDMRLPVAYVKT FKGPPTGIVVERERLDKFGRPLLGATVKPKLGLSGRNYGRVVYEALKGGLDFTKDDENIN SQPFMHWRERFLYCMEAVNRAQAATGEIKGSYLNVTAATEDMYERAEFAKELGSVVVMID LV-IGYTAIQSMSNWKNDMILHLHRAGHSTYTRQRSHGVSFRVISKWMRLAGVDHIHAGT VVGKLEGDPLTTRGFYDICRHGIFFDQNWASLNKVMPVASGGIHAGQMHQLIQHLGEDVV LQFGGGTIGHPMGIQAGATANRVALEAILARNEGRDYVSEGPEKAAALEVWKDVT >ABA56859_Nitrosococcus_oceani_ATCC_19707_I_REFreference GYWEPD--YQPKDTDIIAMFRITPQPGVDPEEAAAAVAGESSTATWTVVWTDRLTDCELY RAKAYDLEGTFAYIAYDLDLFEPGSIANLTASIIGNVFGFKAVKALRLEDMRIPVAYLKT FQGPATGVVVERERLDKFGRPLLGATTKPKLGLSGRNYGRVVYEALKGGLDFVKDDENIN SQPFMHWRDRFLYCMEAVNKASAATGEVKGHYLNVTAATEDMYERAEFAKSLGSVIIMID LV-VGYTAIQSMAKWKNDMILHLHRAGNSTYSRQKNHGMNFRVICKWMRMAGVDHIHAGT VVGKLEGDPLMIKGFYDTLLHGLFFDQDWASLNKVMPVASGGIHAGQMHQLIQYLGEDVI LQFGGGTIGHPQGIQAGAVANRVALEAILARNEGRDYVKEGPQDAAALDTWKDVT >YP_411385_Nitrosospira_multiformis_ATCC_251966_I_REFreference GYWEPD--YVPKDTDIIAMFRITPQAGVEPEEAAAAVAGESSTATWTVVWTDRLTACELY RAKAFDPEGTFAYIAYDLDLFEPGSIANLTASIIGNVFGFKAVKALRLEDMRIPVAYLKT FQGPATGIIVERERLDKFGRPLLGATTKPKLGLSGRNYGRVVYEGLKGGLDFMKDDENIN SQPFMHWRDRFLYCMEAVNKASAATGEVKGHYLNVTAGTEEMYERAEFAKSLGSVIIMID LV-IGYTAIQSMAKWKNDMILHLHRAGNSTYSRQKNHGMNFRVICKWMRMAGVDHIHAGT VVGKLEGDPLMIKGFYDTLRHGLFFEQDWASLNKVMPVASGGIHAGQMHQLLDYLGEDVI LQFGGGTIGHPQGIQAGAVANRVALEAIMARNEGRDYVKEGPQEAAALDTWKDIT >Galdieria_Partita_1IW_REFreference PYAKMG--WNPDKDTVLALFRVTPP-GVDPIEAAAAVAGESSTATWTVVWTDLLTAADLY RAKAYKVVPNFAYIAYELDLFEEGSIANLTASIIGNVFGFKAVKALRLEDMRLPLAYLKT FQGPATGVILERERLDKFGRPLLGCTTKPKLGLSGKNYGRVVYEALKGGLDFVKDDENIN SQPFMRWRERYLFTMEAVNKASAATGEVKGHYLNVTAATEEMYARANFAKELGSVIIMID -LVIGYTAIQTMAKWDNDMILHLHRAGNSTYSRQKNHGMNFRVICKWMRMAGVDHIHAGT VVGKLEGDPIITRGFYKTLLEGLFFDMEWASLRKVMPVASGGIHAGQMHQLIHYLGEDVV LQFGGGTIGHPDGIQAGATANRVALEAILRNEILREAAKTCGALRTALDLWKDIT >Chondrophycus_papillosus_ABO3124_REFreference PYAKMG--WDPNKDTVLALFRVSPP-GVDPVEASAAVAGESSTATWTVVWTDLLTACDLY RAKAYKVVPNFAYIAYDIDLFEEGSIANLTASIIGNVFGFKAVKALRLEDMRIPVAYLKT FQGPATGIVVERERMDKFGRPFLGATVKPKLGLSGKNYGRVVYEGLKGGLDFLKDDENIN SQPFMRWKERFLYSMEAVNRSIAATGEVKGHYMNVTAATEDMYERAEFAKELGTVIIMID -LVIGYTAIQTMAIWKNDMILHLHRAGNSTYSRQKIHGMNFRVICKWMRMAGVDHIHAGT VVGKLEGDPLMIRGFYNTLLQGIFFEQDWASLRKVTPVASGGIHCGQMHQLLDYLGNDVV LQFGGGTIGHPDGIQAGATANRVALEAVIRNEILRDAAKTCGPLQTALDLWKDIT >Q08051_Pleurochrysis_carterae_I_REFreference PYAKMG--WDPEKETILALFRCTPP-GVDPVEAAAALAGESSTATWTVVWTDLLTACDLY RAKAYRVVPSFCYIAYDIDLFEEGSLANLTASIIGNIFGFKAVKALRLEDMRMPYALLKT YQGPATGLIVERERLDKFGRPLLGATVKPKLGLSGKNYGRVVFEGLKGGLDFLKDDENIN SQPFMRYRERFLYSMEGVNHAAAVSGEVKGHYLNATAATEDMYERAEFAKDLGSVIVMID -LVIGYTAIQSMAIWKTDMILHLHRAGNSTYSRQKSHGMNFRVICKWMRMSGVDHIHAGT VVGKLEGDPLMIKGFYNTLLQGLFFAQDWASLRKCVPVASGGIHCGQMHQLINYLGDDVV LQFGGGTIGHPDGIQAGATANRVALECVIRNEILRDAAKTCGPLQTALDLWKDIT >Pyropia_dentata_Q760S_REFreference PYAKMG--WDADKETILALFRITPP-GVDPIEASAAIAGESSTATWTVVWTDLLTACDLY RAKAYRVVPNFAYIAYDIDLFEEGSIANLTASIIGNVFGFKAVKALRLEDMRMPVAYLKT FQGPATGLIVERERMDKFGRPFLGATVKPKLGLSGKNYGRVVYEGLKGGLDFLKDDENIN SQPFMRWRERFLYSMEGVNKASASAGEIKGHYLNVTAATEDMYERAEFSKEVGSIICMID -LVIGYTAIQSMAIWKHDMILHLHRAGNSTYSRQKNHGMNFRVICKWMRMAGVDHIHAGT VVGKLEGDPLMIKGFYNTLLQGLFFAQNWASLRKVVPVASGGIHAGQMHQLLDYLGDDVV LQFGGGTIGHPDGIQAGATANRVALESVMRNEILRDAAKTCGPLQTALDLWKDIS >Burkholderia_xenovorans_YP_55288_REFreference GYWQPD--YAPKDTDVIALFRITPQPGVDPEEAAAAVAGESSTATWTVVWTDRLTACDIY RAKAYRVDPVFAYIAYELDLFEEGSVANLTASIIGNVFGFKPLKALRLEDMRIPVAYLKT FQGPPTGIVVERERLDKYGRPLLGATVKPKLGLSGKNYGRVVYEGLRGGLDFLKDDENIN SQAFMHWRDRFLFAMEAVNRAQAETGEVKGHYLNVTAGTEDMYERAEFAKELGSCIVMID LV-IGWTAIQSMARWRNDMILHLHRAGHSTYTRQRNHGISFRVIAKWLRMAGVDHAHAGT AVGKLEGDPLSVQGYYNVCRRGIFFDQPWAGLRKVMPVASGGIHAGQMHQLLDLFGDDAI LQFGGGTIGHPAGIQAGAVANRVALEAMVKARILEAAARWCTPLKQALDTWRDVT >Rubrivivax_benzoatilyticus_ZP_0840404_REFreference GYWDSD--YVPKPTDVVCLFRITPQEGV-DPEAAAAVAGESSTATWTVVWTDRLTACDSY RAKAYKVPGEFAWVAYDLILFEEGSIANMTASLIGNVFSFKPLKAARLEDIHIPVAYVKT FKGPPTGLVVERERLDKFGRPLLGATTKPKLGLSGRNYGRVIYEGLKGGLDFMKDDENIN SQPFMHWRDRFLYVMDGVNKASAATGEVKGSYLNITAATEDMYERAEFAKELGSVVVMVD LV-IGWTAIQSIANWKNDMIVHMHRAGHGTYTRQKNHGVSFRVIAKWLRLAGCDHLHTGT AVGKLEGDPLTVQGYYNVCRRGLFFDQDWADLRKVMPVASGGIHAGQMHQLIDLFGDDVI LQFGGGTIGHPAGIQAGAVANRVALEAVKGPEILQKAAQFCTPLKQALDTWKDVS >YP_001020675_Methylibium_petroleiphilum_PM1_PutativeType1_REFreference GYWMPD--YVPKDTDTICLFRITPQDGVDPVEAAAAVAGESSTATWTVVWTDRLTACDSY RAKAYKVPGQFAWVAYDIILFEEGSIANMTASLIGNVFSFKPLKAARLEDIRIPVAYVKT FKGPPTGLVVERERLDKFGRPLLGATTKPKLGLSGRNYGRVVYEGLKGGLDFMKDDENIN SQPFMHWRDRYLYVMDAVNKASAATGEIKGSYLNVTAATEDMYERAEFAKELGSVIVMID LV-IGYTAIQSMSNWRNDVILHLHRAGHGTYTRQKNHGVSFRVIAKWMRLAGVDHIHTGT AVGKLEGDPMTVQGYYNVCRRGLFFDQDWADLKKVMPVASGGIHAGQMHQLIDLFGDDVV LQFGGGTIGHPQGIQAGAVANRVALECMVKARILRKAAQFCAPLKQALDTWGEIS >YP_001416820_Xanthobacter_autotrophicus_Py2_PutativeType1_REFreference GYWDGD--YQPKDTDVLALFRITPQDGV-DAEAAAAVAGESSTATWTVVWTDRLTAADMY RAKAYKVPGQFCWVAYDLDLFEEGSIANLTASIIGNVFSFKPLKACRLEDMRLPVAYVKT FRGPPTGIVVERERLDKFGRPLLGATTKPKLGLSGKNYGRVVYEGLKGGLDFVKDDENIN SQPFMHWRDRFLYCMEAVNKAQAETGEVKGHYLNITAGTEEMYRRAEFAKELGSVVVMVD LI-VGWTAIQSISNWENDVLLHMHRAGHGTYTRQKGHGISFRVIAKWLRLAGVDHLHTGT AVGKLEGDPMTVQGYYNVCRRGIFFDQDWAGLRKVMPVASGGIHAGQMHQLIDLFGEDVV LQFGGGTIGHPDGIQAGAIANRVALETILGPEILIEAAKWCRPLRAALDTWGEVT >Azoarcus_sp_KH32C_BAL2711_REFreference GYWDGD--YEPKETDVLALFRITPQEGV-DPEAAAAVAGESSTATWTVVWTDRLTACDRY RAKAYRVPGQFCYVAYELDLFEEGSIANLTASIIGNVFSFKPIKAARLEDMRLPVAYVKT FRGPPTGIVVERERLNCFGRPLLGATTKPKLGLSGKNYGRVVYEALLGGLDFTKDDENIN SQPFMHWRDRFLYVMEGVNRASAATGEVKGHYLNVTAGTEEMYARAEFAKSLGSTIVMVD LI-IGWTAIQSMSNWANDMILHMHRAGHGTYTRQKNHGISFRVIAKWLRLAGVDHLHAGT AVGKLEGDPMTVQNVCREMKRGIFFEQDWASLRRVMPVASGGIHAGQMHQLLDLFGDDVV LQFGGGTIGHPMGIQAGATANRVALEAVLGSDILKAAARDCAPLRAALDTWGEVS >Paracoccus_sp_TRP_ZP_0866487_REFreference GYWDGD--YQPKDTDVLALFRITPQEGV-DPEAAAAVAGESSTATWTVVWTDRLTACDQY RAKAYKVPGQFCYVAYDLILFEEGSIANVTASIIGNVFSFKPLKAARLEDMRFPVAYMKT FAGPPTGIVVERERLDKFGRPLLGATTKPKLGLSGKNYGRVVYEGLKGGLDFMKDDENIN SQPFMHWRDRFLYCMEAVNKASAATGEVKGHYLNITAGTEEMYRRAELAKELGSVIVMVD LI-VGWTAIQSISNWQNDMLLHMHRAGHGTYTRQKNHGISFRVIAKWLRMAGVDHLHAGT AVGKLEGDPLTVQGYYNVCRRGIYFEQDWGNLKKVMPVASGGIHAGQMHQLLDLFGDDVV LQFGGGTIGHPMGIQAGATANRVALEAVLGPEVLRRAAKWCKPLEAALDTWGNIT >Stappia_aggregata_ZP_0154896_REFreference GYWDSD--YVPKDTDILALFRITPQDGV-DPEAAAAVAGESSTATWTVVWTDRLTACDSY RAKAYRVEGQFAYVAYDLILFEEGSIANLTASIIGNVFSFKPLRAARLEDMRLPVAYVKT FAGPPTGLIVERERLDKFGKPLLGATTKPKLGLSGKNYGRVVYEGLKGGLDFMKDDENIN SQPFMHWRDRFLYCMEAVNKASAETGEIKGHYLNITAGTEEMYRRAEFAKELGSVIVMVD LI-VGWTAIQSISNWDNNMILHMHRAGHGTYTRQKNHGISFRVIAKWLRLAGVDHLHCGT AVGKLEGDPLTVQGYYNVCRRGNFFEQDWADLKKVMPVASGGIHAGQMHQLLDLFGDDVV LQFGGGTIGHPMGIQAGATANRVALEAVLGPQILREAARWCKPLEAALDTWGNIT >Rhodobacter_sphaeroides_YP_35436_REFreference GYWDGD--YVPKDTDVLALFRITPQEGV-DPEAAAAVAGESSTATWTVVWTDRLTACDSY RAKAYRVPGQFCYVAYDLILFEEGSIANLTASIIGNVFSFKPLKAARLEDMRFPVAYVKT YKGPPTGIVGERERLDKFGKPLLGATTKPKLGLSGKNYGRVVYEGLKGGLDFMKDDENIN SQPFMHWRDRFLYVMEAVNLASAQTGEVKGHYLNITAGTEEMYRRAEFAKSLGSVIVMVD LI-IGYTAIQSISEWQNDMILHMHRAGHGTYTRQKNHGISFRVIAKWLRLAGVDHLHCGT AVGKLEGDPLTVQGYYNVCRRGIFFEQDWADLRKVMPVASGGIHAGQMHQLLSLFGDDVV LQFGGGTIGHPMGIQAGATANRVALEAVLGPEILRAAAKWCKPLEAALDTWGNIT >Sinorhizobium_medicae_YP_00131266_REFreference GYWNGD--YEPKDTDLIALFRITPQDGVDPIEAAAAVAGESSTATWTVVWTDRLTACDQY RAKAYRVDPV-CYVAYDLILFEEGSIANLTASIIGNVFSFKPLKAARLEDMRLPVAYVKT FKGPPTGIVVERERLDKFGKPLLGATTKPKLGLSGKNYGRVVYEGLKGGLDFMKDDENIN SQPFMHWRDRYLYCMEAVNHASAVTGEVKGHYLNVTAGTEEMYRRAEFAKELGSVIVMVD LI-IGWTAIQSMSEWQNDMILHMHRAGHGTYTRQKNHGISFRVIAKWLRLAGVDHLHAGT AVGKLEGDPLTVQGYYNVCRRGLFFEQDWADLKKVLPVASGGIHAGQMHQLLDLFGDDVV LQFGGGTIGHPMGIQAGATANRVALEAVLGPEILRAAAKWCKPLEAALDTWGNIS >emb_CDO21134_Mycobacterium_mageritense_DSM_4447_REFreference DRWNAG--WQPDDSDVLCAFRITPQDGVPPEEAGAAVAGESSTATWTVVWTDRLTTFEHY QAKCYSVPGQIALIAYDLDLFEEGSIANLTSSIIGNVFGFKPLKALRLEDMRIPTHYVKT FQGPAHGIVMEREYLGKFGRPLLGATTKPKLGLSARNYGRVVYEALRGGLDFTKDDENIN SQPFMRWRDRFLFCMEAVNRAQAATGEIKGHYLNITAGTEEMYERADFAAELGSVVVMID -LTIGYTAIQSMAKWDNGVILHLHRAGHGTYTRQKTHGVSFRVISKWMRLAGVDHIHAGT VVGKLEGDPNTTAGFYDTLRKGLYFDQEWVSMPGVMPVASGGIHAGQMHQLIHYLGEDVV LQFGGGTIGHPMGIAAGAEANRVALEAIKARNEGRDYYREGPDKAA-LATWGDIT >WP_013076789_Kyrpidia_tuscia_REFreference KRWASG--WQPDDTDVIAAFRVVPQEGIDPEEAAAAVAGESSTATWTVVWTDRLTTYEHY QGKAFRVPGTIAYIAYDIDLFEEGSIANLASSIIGNVFGFKALKSLRLEDMRIPLHYVKT FQGPAHGIVMEREMLNKYGRPLLGATTKPKLGLSARNYGRVVYEALRGGLDFVKDDENIN SQPFMRWRDRFLYAMEAVHRAMAETGEIKGHYLNVTGATEDIYERAEFAKELGSVIVMID -LTVGYSAIQSLAKWRNSVLLHLHRAGHSTFTRQKTHGVSFRVIAKWMRLAGVDHLHAGT VVGKLEGDPNITKGYYQTLRLGLLFEQDWGSMPAVMPVASGGIHAGQVHQLIDLFGEDVI FQFGGGTIGHPMGIAAGATANRVAIEAMIQARILEKAAKWSPELRAALEVWKDVT >WP_028963129_Sulfobacillus_thermosulfidooxidan_REFreference SRWASG--WRPDETDVICVFRITPQEGV-SPEAAAAVAGESSTATWTVVW---TDRLTNY QAKAYRVPGTFAYIAYSIDLFEEGSIANLASSIIGNVFGFKPLKALRLEDMRIPLHYIKT FQGPAHGIVVEREYLDKYGRPLLGATIKPKLGLSARNYGRVAYEALRGGLDFTKDDENIG SQPFMRWRDRFLYVMEGVNRAAAETGEVKGHYMNVTAATEDMYERAEFARDLGSVIIMID -LTVGYTAIQSMANWKNGMLLHLHRAGHATFTRQKTHGVSFRVISKWMRLAGVDHIHAGT IVGKLEGDPNMIHGYYKTLREGLFFDQDWGSMPGVMPVASGGIHAGQMHLLLHHLGEDVI LQFGGGTIGHPMGISAGATANRVALEAIKGPEILQKAARMSPALQAALDVWKDVT >AAA98748_Gonyaulax_polyedra_I_REFreference ---YADLT-EEDNGKVLVAYIMN-G-GYD-YATAAHVAAESSTGTNVNV-CTTDDFTKTV DALVYYIDPEEMKIAYPVPLFDRNMMCSVLTLSIGNNQGMGDVEYGKIYDIYFPPSYLRF FDGPACSILDMWRILGTDGGLVVGTIIKPKLGLQPKPFGEACYAFGQGG-DFIKNDEPQG NQVFCQMNECIPEVVTAMKACIKETGSEKLFSANITADDAEMIARGKYILGQENCAFLVD GYVAGGTAVTVARRNFPKQFFHYHRAGHGAVTSPQTRGYTAFVHTKISRVIGASGIHVGT MFGKMVG-DASDKGIAYMLQGGPYYHQKWEGVVQTTPIISGGMNALRLPAFFENLGHYVI LTAGGGTFGHKDGPKQGATSCRQAWKLWKAGTGVIEYAKTHEEAFLTYPGWKEKL >AAG37859_Symbiodinium_sp_I_REFreference ---YADLD-EATNGKVLVAYIMKPA-GYD-YATAAHFAAESSTGTNVNV-CTTDDFTKSV DALVYYIDPDEMKIAYPTLLFDRNMMCSFLTLAIGNNQGMGDVEYGKIYDFYLPPSFLRL YDGPAVNVEDMWRILGSNGGLVDGTIIKPKLGLQPNPFGEACYSFWQGG-DFIKNDEPQG NQVFCQMNECIPEVVKAMRACVKETGSSKLFSANITADDEEMIARGKYIMSQENCAFLVD GYVAGGTAVTCCRRNFPKQFLHYHRAGHGSVTSPQTRGYTAFVHTKISRVIGASGIHVGT MFGKMEG-DASDKNIAYMLQDGPYYRQEWQGMKETTPIISGGMNALRLPAFFENLGHNVI LTAGGGSFGHKDGPKIGAISCRQAWKQWKAGQGVIEYAKTHEEAFLTYPGWKEKL >Mariprofundus_ferrooxydans_ZP_0145121_REFreference ---YADLN-EADGGKLLVAYKLIPK-GYG-FEVAAHIAAESSTGTNVEV-STTDDFTRGV DALVYDIDETLMKVAYPVELFDPNNVSHMWSLILGNNQGMGDHEGLRMLDFMVPECMIRK FDGPSAGIADLWKVLGVDGGYISGTIIKPKLGLRPEPFAKACLDFWLGG-DFIKNDEPQA NQPFCPMKVVIPKVAEAMDRAQQETGQAKLFSANATADYGECVARGEYILSEKHVAFLID GFVTGPAGVTTARRAFPDTFLHFHRAGHGAVTSYKSMGMDPLCYMKLARLQGASGIHTGT MYGKMEG-HGKETVLAYMLEMGHYFNQKWYGMKPTAPIISGGMNALRLPGFFENLGHNVI NTCGGGSFGHIDSPAAGGKSLDQAYDCWKSGTDPIEYAKTHGEAFESYPGWRDKL >Q59462_Hydrogenovibrio_marinus_I_REFreference -YADLT--LTEEDGNLLVAYRLKPAAGYGFLEVAAHVAAESSTGTN----VEVSTTDDGV DALVYEIKGGLMKIAYPVDLFDPNNVSHMWSLILGNNQGMGDHEGLRMLDFLVPEKMVKR FDGPATDISDLWKVLGRPDGYIAGTIIKPKLGLRPEPFAKACYDFWLGG-DFIKNDEPQA NQNFCPMEVVIPKVAEAMDRAQQATGQAKLFSANVTADFEEMIKRGEYVAKY-GNAFLVD GFVTGPAGVTTSRRAFPDTYLHFHRAGHGAVTSYKSMGMDPLCYMKLARLMGASGIHTGT MYGKME-GHNDERVLAYMLEQGPYFYQKWYGMKPTTPIISGGMDALRLPGFFENLGHNVI NTCGGGSFGHIDSPAAGGISLGQAYACWKTGAAPREFARESFPDKI-FPGWREKL >YP_422059_Magnetospirillum_magnetotacticum_AMB_1_I_REFreference ------LGLREAGGRVLCAYRMRPRPGHGYVETAAHFAAESSTGTNVEV-CTTDDFTRGV DALVYEVDEA-MKIAYPVELFDRNMIASFLTLTVGNNQGMSDVENAKMEDFYVPPDFLTL FDGPARNIAHMWKVLGRPNGMVVGTIIKPKLGLRPKPFADACHQFWLGG-DFIKNDEPQG NQVFAPFKDTMRLVADSMRRAQDETGQAKLFSANITADDAEMIARGQFILDTGENAFLVD GFVAGPTAVTTCRRNFPDTFLHYHRAGHGAITSRQSRGYSVLVHMKMARLLGASGIHTGT MYGKMEGAPDEKV-VAYMLEEGPHYRQDWGDMRACTPIISGGMNALRLPGFFDNLGHNVI QTSGGGAFGHKDGPVAGALSLRQAHEAWMRGISLVEYAQGHPEAFE-YPGWRDRL >231173_pdb_5RUB_A_Rhodospirillum_Rubru_REFreference ---YVNLK-EEDGGEVLCAYIMKPA-GYG-YATAAHFAAESSTGTNVEV-CTTDDFTRGV DALVYEVDEALTKIAYPVALFDRNMIASFLTLTMGNNQGMGDVEYAKMHDFYVPEAYRAL FDGPSVNISALWKVLGVDGGLVVGTIIKPKLGLRPKPFAEACHAFWLGG-DFIKNDEPQG NQPFAPLRDTIALVADAMRRAQDETGEAKLFSANITADDFEIIARGEYVLETSHVALLVD GYVAGAAAITTARRRFPDNFLHYHRAGHGAVTSPQSRGYTAFVHCKMARLQGASGIHTGT MFGKMEG-ESSDRAIAYMLTQGPFYRQSWGGMKACTPIISGGMNALRMPGFFENLGNNVI LTAGGGAFGHIDGPVAGARSLRQAWQAWRDGVPVLDYAREHKEAFESYPGWRKAL >Desulfovibrio_aespoeensis_YP_00412229_REFreference ---YVNLK-EDQDGNVLCAYIYKPN-GYG-NEVAAHFAAESSTGTNVEV-CTTDDFTKGV DALVYEVSDALMKIAYPVELFDRNMLASFLTLCIGNNQGMGDVAYAKMHDFYVPRQYLEL FDGPSKNIADFWRMLGENGGMIAGTIVKPKLGLRPKPFADACYHFWLGG-DFVKNDEPQG NQVFAPLKETITAVADAMRRAQDETGEAKIFSANITADDREMIARGEFILNASRVAFLVD GYVAGPTAITTARRNFPNQFLHYHRAGHGAVTSPQARGYTAFVLSKMSRLQGASGIHTGT MFGKMEG-EMADKSMAYMIEQGPYFKQKWYGMKATTPMISGGMNALRLPGFFDNLGHNIC QTSGGGAFGHLDGPTAGALSLRQSHDAWIQGVNLLDYAKEHNEAFESYPTWRHQL >AAN52766_Rhodopseudomonas_palustris_I_REFreference ---YANLK-ESEGGRVLCAYIMKPA-GFGNFQTAAHFAAESSTGTNVEV-STTDDFTRGV DALVYEIDEALMKIAYPIELFDRNMIASFLTLTIGNNQGMGDVEYAKMYDFYVPPAYLKL FDGPSTTIRDLWRVLGVNGGFIVGTIIKPKLGLRPQPFANACYDFWLGG-DFIKNDEPQG NQVFAPFKDTVRAVADAMRRAQDKTGEAKLFSFNITADDYEMLARGEFILETDHIAFLVD GYVAGPAAVTTARRAFPKQYLHYHRAGHGAVTSPQSRGYTAFVLSKMARLQGASGIHTGT MFGKMEG-EAADRAIAYMITDGPYFHQEWLGLNPTTPIISGGMNALRMPGFFDNLGHNLI MTAGGGAFGHVDGGAAGAKSLRQAEQCWKQGADPVEFAKDHREAFESYPNWRAKL >P50922_Rhodobacter_capsulatus_ATCC11166_I_REFreference ------LK-EADGGRVLCAYIMKPA-GYGYLETAAHFAAESSTGTNVEVS-TTDDFTRGV DALVYEIDPE-MKIAYPVELFDRNMLCSFLTLTIGNNQGMGDVEYAKMHDFYVPPCYLRL FDGPSMNIADMWRVLGRPDGMVVGTIIKPKLGLRPKPFADACYEFWLGG-DFIKNDEPQG NQTFAPLKETIRLVADAMKRAQDETGEAKLFSANITADDYEMVARGEYILETGENAFLVD YV-TGPAAITTARRSFPRQFLHYHRAGHGAVTSPQSRGYTAFVLSKMSRLQGASGIHTGT MYGKMEGDASDKIMLNDEAAQGPFYHQDWLGMKATTPIISGGMNALRLPGFFDNLGHNVI QTSGGGAFGHLDGATAGAKSLRQSCDAWKAGVDLVTYAKSHREAFESYPGWRVAL >AAC38280_Riftia_pachyptila_endosymbiont_I_REFreference ---YSDLK-EDESGDVLCAYLMKPS-GYG-YEAAAHFAAESSTGTNVEV-STTDDFTKGV DALVYEIDEALMKIAYPVDLFDINMLASFLTLTIGNNQGMGDIEYAKMLDFYMPPKYLRL YDGPAVNIQDMWRILGENGGYIAGTIIKPKLGLRPEPFAEAAYQFWLGG-DFIKNDEPQG NQPFSPMKKTIPLVADAMRRAQDETGEAKLFSANITADDAEMIARGEFVLETSQVAFLVD GYVAGPTAVATARRNFPNQFLHFHRAGHGAVTSPQSRGYTAFVHIKMTRLLGASGMHVGT MYGKMEG-EASDKLIAYMIEDGPFYHQEWAGMKPTTPIISGGMNALRLPGFFENLGHNVI NTAGGGTYGHIDSPAAGAVSLRQAYECWKEGADPVEYAKEHKEAFESFPGWRDKL >ABB41020_Thiomicrospira_crunogena_XCL_2_II__REFreference SNRYADLK-EEDGQNILVAYTMEPA-GYG-YEVAAHIAAESSTGTNVEV-CTTDDFTKGV DAIVYDIDEA-MKVAYPFDLFDRNMIVSFLTLAIGNNQGMGDVKNLQMFDFWVPETKLHL FDGPAVDITNMWKMLG-NGGYIAGTIIKPKLGLRPEPFADAAYQFWLGG-DFIKNDEPQG NQTFCPMKKVIPLVADAMKRAQDETGETKLFSANITADDHEMCYRADYILETGADAFLVD YV-GGPGMITTARRNYPSQYLHYHRAGHGAITSPSARGYTAFVLAKISRLQGASGIHVGT MYGKMEG-DASDKNIAYMIEQGPAFYQKWNGMKPTTPIISGGMNALRLPGFFENLGHNVI NTSGGGSYGHIDSPAAGATSLRQSYECWKSGADPIEFAKDHKEAFESYPGWREKL >YP_522655_Rhodoferax_ferrireducens_T118_I_REFreference ---YADLQ-EAAGGQILCAYKMAPD-GLN-YEAAAHFAAESSTGTNVEV-CTTDDFTRDV DALVYYVNEA-MRIAYPLALFDRNMLVSFLTLAVGNNQGMGDIKHAKMIDFYVPERVIQM FDGPAKDISDLWRILG-DGGFIVGTIIKPKLGLRPEPFAQAAYQFWLGG-DFIKNDEPQG NQVFSPIKKTLPLVYDALKRAQDETGQAKLFSMNITADDFEMCARADFALETGADAFLVD FV-GGPGMVTTARRQYPNQYLHYHRGGHGMVTSPSSRGYTALVLAKMSRLQGASGIHVGT MHGKMEG-AGDDRVMAYMIEQGPVYFQKWYGIKPTTPIVSGGMNALRLPGFFDNLGHNII NTAGGGSYGHLDSPAAGAVSLRQAYECWKAGADPIEWAKEHREAFESFAGWRDKL >YP002220242_Acidithiobacillus_ferrooxidans_ATCC_5399_REFreference ---YADLK-EEDGGKILVAYKMKPA-GHG-YEAAAHFAAESSTGTNVEV-STTDDFTKGV DALVYHIDEA-MRIAFPLELFDRNMIVSFLTLVIGNNQGMGDIEHGQMIDFFMPPRAIQL FDGPAKDISDLWRILG-DGGYIAGTIIKPKLGLRPEPFAAAAYQFWLGG-DFIKNDEPQG NQVFAPVKKTIPLVYDAMKRAMDETGEAKLFSMNITADDYEMCARADFALETGPDAFLVD FV-GGPGMITTARRQYAGQYLHYHRAGHGMITSPSARGYTAFVLAKMSRLQGASGIHVGT MYGKMEG-GADDRNIAYMIEDGPVYHQEWYGMKPTTPIISGGMNALRLPGFFENLGHNVI NTAGGGSYGHIDSPAAGAVSLRQAYECWKAGADPIEYAREHKEAFESYPGWREKL >Halothiobacillus_neapolitanus_YP_00326297_REFreference -YADLS--LKEEGGKILVAYKMKPKAGHGYLEASAHFAAESSTGTN----VEVSTTDDGV DALVYYIDEADMRIAYPMDLFDRNMLVSVLTLIIGNNQGMGDIEHAKIHDIYFPERAIQL FDGPSKDISDMWRILGRPNGYIAGTIIKPKLGLRPEPFAAAAYQFWLGG-DFIKNDEPQG NQVFCPLKKVLPLVYDSMKRAQDETGQAKLFSMNITADDYEMMARADFGLETGPDAFLVD GFVGGPGMITTARRQYPNQYLHYHRAGHGMITSPSARGYTAFVLAKISRLQGASGIHVGT MYGKME-GEGDDRNIAYMIEQGPVYFQKWYGMKPTTPIISGGMNALRLPGFFENLGHNVI NTAGGGSYGHIDSPAAGAISLKQAYECWKAGADPIEFAKEHKEAFE-FPGWREKL >Thiobacillus_denitrificans_AAA9917_REFreference -YADLS--LKEEGGRILVAYKMKPKSGYGYLEAAAHFAAESSTGTN----VEVSTTDDGV DALVYYIDEADMRIAYPLELFDRNMLVSFLTLAIGNNQGMGDIEHAKMIDFYVPERCIQM FDGPATDISNLWRILGRPNGYIAGTIIKPKLGLRPEPFAKAAYQFWLGG-DFIKNDEPQG NQVFCPLKKVLPLVYDAMKRAQDDTGQAKLFSMNITADDYEMCARADYALEVGPDAFLVD GYVGGPGMVTTARRQYPGQYLHYHRAGHGAVTSPSARGYTAFVLAKMSRLQGASGIHVGT MYGKME-GEGDDKIIAYMIEQGPVYFQKWYGMKPTTPIISGGMNALRLPGFFENLGHNVI NTAGGGSYGHIDSPAAGAISLRQSYECWKQGADPIEFAKEHKEAFE-FPGWREKL >YP003522651_Sideroxydans_lithotrophicus_ES_REFreference -YSNLD--LKEAGGKILVAYKMKPKDGHGYLEAAAHFAAESSTGTNVEVS-TTDDFTKGV DALVYYIDEA-MRIAYPLELFDRNMLVSFLTLAIGNNQGMGDIEHAKMVDFYMPERAIQM FDGPATDISNLWRILG-KDGYISGTIIKPKLGLRPEPFAKAAYQFWLGG-DFIKNDEPQG NQTFCPLKKVLPLVYDALKRAQDETGQAKLFSMNITADDYEMCARADYALEVGPDAFLVD GYVGGPGMVTTARRQYPGQYLHYHRAGHGAVTSPSARGYTAFVLAKMSRLQGASGIHVGT MYGKME-GEGDDRNIAYMIEQGPVYFQKWYGMKPTTPIISGGMNALRLPGFFQNLGHNVI NTAGGGSYGHIDSPAAGAISLRQAYECWKSGADPIQYAKQHKEAFE-YPGWREKL >Methanocella_arvoryzae_MRE5_REFreference DKDLII--QVPLIMTVRTTYYVEADAPIA--KVAKEIAAEQSTGTW----TEVAAEKEKL GAHVVSAEGNTVVIDFPVEIFEPDNVPQILSVVAGNLFGLGGLKACRLMDVDFP--LTKY YNGPEFGIEEVRKILGVYDRPLVGTIIKPKVGLSPKRTAEVAEQAALGGLDLIKDDETLT DQKFCPLEERLTMVMDRLHKVEDRIGKPCFYAVNVTCGADMIVERAERAVELGANMVMVD ILTAGFSAVQALTDEKIGVPIHIHRTMHGALTRGK-YGIAMPVISKLTRMCGGTNLHTGT YAGKMERNVCEIDASRDILR------KPWAGYKRVWPVSSGGLYPQKVRENLDCYGIDVI LQAGGGIHGHPEGTTVGVKAMFQAVEAWQQQKTLEEYAKTHKELAGALKQWGPSQ >Methanocella_paludicol_REFreference --MFIM---------VRTTYHVEAETKP-LDVVAKEIAAEQSTGTW----TNVPAEREKV GARVVSAENNTVVIDFPEEIFEPTNIPQILSVVAGNLYGLGGLKALRLEDVDF-DPLVKY YKGPVFGIKEVRKYLGVYDRPLVGTIIKPKVGLSPKRTAEVAEQAALGGLDLIKDDETLT DQAFCPLEERLTAVMDKLDKVKKVVGKPCLYAVNVTTGADKIVERAERAKELGANMIMVD ILTAGFSAIQAINEANIDLPIHVHRTMHGALTRGP-FGISMPVISKLTRMVGGTNLHIGT YSGKMEHNVCDIDRSRDILR------QPWDNFKPMFPAVSGGIYPQLLKPNLDCYGIDCI LQAGGGIHGHPEGTVAGAKAMFQAVEAWKENVPLEQYAKTHHELAVALKKWGPSQ >Methanofollis_liminatans_DSM_414_REFreference --------MTKD---VVATYYFRPRSDTTPEAAAQAIVEEETTGTWTDI-TTTTDYVRRL DGEVLSLSGNVTRLRYPAEIFEAGNIPQYLSVVAGNLFGLGRLEAVRLLDVDFPEELV-P FTGPKFGMEGVRKLIGTTDRPHVGTIIKPKVGLNPKDTAEVAYKAAIGGVDLIKDDETLT DQTFCPMDERLQAVMAKLDEAKSETGQEVLYAVNISARADDIVERAEHAIDLGANMVMID VITCGFTALQALAEAPVSVPVHVHRTMHGAITRNPEHGIAMRPLARIVRMLGGDQLHTGT VSGKMSHDVSELRGDNAALT------DPYYGLKPTFPVASGGLHPGKVAAELKNLGTNIV LQAGGGIHGHPDGTEAGARAMRQAADAFMAGVSAEEYAKDHRELARALERWGNR- >Methanospirillum_hungatei_JF-_REFreference ---------MTD---VIATYYFRPREGVTPEWAAQAIAEEQTTGTWTDI-STRQNYVHYL DGVVDEISGGTCTIRYPSEIFEPGNIPQYLSVLAGNLFGLSRIAAVRLVDVEFSRDIV-P FKGPKFGIEGVRKLAGTVDRPHVGTIIKPKVGLNPKDTAAVAYEAAIGGVDLIKDDETLT DQAFCPLGERLPLIMEQLDRVKSETGRNVLYAVNISSAGDKIVQRAREAARMGANMLMID VIVCGFDAVRAVAEEPINLPIHVHRTMHAAITRNPEHGIAMRPICRLVRMLGGDQLHTGT VSGKMEHDVTELRGDNLALT------EPFFDLKPTFPVASGGLHPGGVHKEVSMLGRDII LQAGGGIHGHPDGTRVGATAMRQAVDAAVAGISPATYAEDHPELKRALDKWGIA- >gi|939574905|gb|LIHJ01000032.1|_68 GKTFFA---KPEEKHVIATYYVES--KLPLPEAGEQIAIEESIGTWTEV-TTTTAWIKKL PAKVFRWEGGLVSIAFPSELFDTGGIPNILSIVAGNLFGLSTLKNVRLLDIDLSKEIVSV FPGPKFGIERIRRFVG-TDRPHVGTIVKPKVGLDPKQTATVAYEAALGGVDLIKDDETLT NQRFCPLEERVIRVMEAIDKAKGETGKNVFYAVNITANVDKMMGLADTAIEHGANMLMVD ILTAGFSAVQMLAKDPINVPLHIHRTMHGAITRNPKHGIHMMVLAKLVRVAGGDQLHTGT AAGKMEKAVTEVKKVNDFLR------NEWYHLNNALPVASGGIHPGIVHPNIKKLGKDLV INAGGGIHGHPMGTRAGAMAMRQAIDAFMDNIPLGEHAKTHRELQLALETWDYRY >QMYC01000269.1_2 KEWYEE--FTSTDPDVVCTYYVTPREGVGIQSAAINIAAEQSIGTW----TKVVTMTDKL AGKVFQIDGGVVKIAFPLDLFDLAGVSHLLSIVAGNLFGIGALKHVRLLDISMPKEYVKV FKGPKFGIEGVRKIIGTDPRPHLGTIIKPKVGLTPKETARVAYEAAVGGVDFVKDDETLT SQKFNPIEDRVSNVMEALDRAREETGRNILYAVDVTADIGMLWKNVETALSNGANCIMVD VLCVGFPALRSLAEDPVKVPIHVHRCMHAAMTRSPVHGIHMLVLAKLVRLCGGDQLHTGT AKGKMEGGVEGIRYMNDFLR------SDWYDLKTVLPVASGGIHPALVPSNLRLLGYDIQ LNAGGGIHGHPKGTRAGAKAMLQAIEAFMKGIPLEEYAKDHIELQEALKHWGTKF >gi|1007509650|gb|LUCB01000003.1|_11 REWYEE--FTSTDPDVVCTYYVAPHEEVSIRSAAINIAAEQSIGTW----TKVATMTDEL AGKVFQIDGGVVKIAFPLDLFDLAGVSHLLSIVAGNLFGMGALKHVRLLDISMPREYVKA FKGPKFGIEGIREILDTDPRPHLGTIIKPKVGLTPKETAKVAYEAAVGGVDFVKDDETLT NQKFNPIEDRVSNVMEALDKAREETGRNILYAVDVTADIGMLWKNVETALSNGANCIMVD VLCVGFPVLRSLAEDPIKVPIHVHRCMHAAMTRSSIHGIHMLVLAKLVRLCGGDQLHTGT AKGKMEGKYSGVKYMNDFLR------GDWYGLKTVLPVASGGIHPALVPSNLRLLGYDIQ LNAGGGIHGHPKGTRAGAKAMLQAIEAFMKGIPLEEYAKDHVELREALEYWGMKF >QMYB01000005.1_3 REWYEEFTSTPEPDRVVCTYYVAPK-EVSIRSAAINIAAEQSIGTWTKVATMTDRVFE-L AGKVFQIVGD-VKIAFPLDLFDLDGVSHLLSIVAGNLFGMGALKHVRLLDISMPREYVKA FKGPKFGIEGIREILDTDPRPHLGTIIKPKVGLTPKETAKVAYEAAVGGVDFVKDDETLT NQKFNPIEDRVSNVMEALDKAREETGRNILYAVDVTADIGMLWKNVETALSNGANCIMVD VLCVGFPVLRSLAEDPIKVPIHVHRCMHAAMTRSSIHGIHMLVLAKLVRLCGGDQLHTGT AKGKMEGKYSGVKGIRYMND---FLRGDWYGLKTVLPVASGGIHPALVPSNLRLLGYDIQ LNAGGGIHGHPKGTRAGAKAMLQAIEAFMKGIPLEEYAKDHVEA---LEYWGMKF >gwc1_scaffold_1390_11Dojkabacteria HNGYIAKGWKPKDENYIITYDIELADGIKFEDVVAACAAESSTGTWTDVYSGRDSGVRKL RAVAFDLEPE-FKIAYKKELFELDNMSGILAGVVGNIDGMKMLKAFRCLDIRFPKDIIQS FQGPQFGIDGMRELLHV-EEPLLCTVPKPKVGRTAKEQAKLAEILFSAANHGIKDDENLT SLRFNTFDDRCELIHRVLREVERKTGQRKFYLCNVTHSNDVMIQRADKIKAQGGRWMMMD IVTTGFSAIQTMRMHNPGLAIHAHRAMHSLFDRESGFSMSMIVTAKIMRMLGVDSLHGGA PNTKMEGEPKLIRDALQLDISSMTLGQNWYGMKPVWHVASGGLHPGTIPEVLHQLGEDII IQTGGGVLGHPWGIEAGVEAVIQAKDVALGRGNMEQWIVEHPDAKA-AHHWGFGP >UBA2177contig_38184_3Dojkabacteria SKKYIAKGWKPKDRDYIITFDIELADGI-FETVVAACAAESSTGTWTEVYSGKDSGVEKL RATAFDLDPKTFKIAYKVELFELGNMSGLLAGIVGNVGGMKMIKALRCLDIRFPESMVKS FPGPQFGIGGIREMLQV-ERPLLLTVPKPKVGRTAKEQAELARILFTAANHGIKDDENLT NLFFNKFDDRCELVHKVRREIEEKSGKKKFYLCNITHSNDTMISRADKIKAQGGRWMMLD VVTTGFSALHTMRLKNPGLAIHAHRAMHSLFTRESGFSMSMVANAKILRLLGVDALHGGA PKTKME-NYGEPKLIRDALQSSVTLGQNWFGMKPVWHVASGGLHPGSIPEVIHQLGEDIM LQCGGGVLGHPWGIEAGVEAVVQAKELALGSGDLEKWLKENPDAKA-ADHWGFGP >UBA4813contig_29569_18Dojkabacteria SKKYIAKGWKPKDKDYIITFDIELADGV-FETVVAACAAESSTGTWTKVYSGKDSGVEKL RAIAFDLDPATFKIAYKVELFELGNMSGLLAGIVGNIGGMKMLKALRCLDIRFPEPMVKS FPGPQFGIDGVREMLQV-ERPLLLTVVKPKVGRTAKEQAELARILFTAANHGIKDDENLT NLYFNTFDERSELIHKVIKEVEEKSGKRKFYLCNITHSNDVMISRADKIKAQGGRWMMMD VVTTGFSALHTMRLKNPGLAIHAHRAMHSLFTRESGFSMSMVANAKILRLLGVDSLHGGA PKTKME-NYGEPKLIRDALQSSITLGQNWFGMKPVWHVASGGLHPGSVPEVLHQLGEDIM IQCGGGVLGHPWGIEAGVEAVVAAKELALGSGNLEKWIMENPDAKA-ADHWGFGP >gwf1_scaffold_4493_1Dojkabacteria HSSYVAKGWKPKDEDYIISFDIELADGI-FEDVAAACAAESSTGTWTGVYSGKNSGVKKM KAIAFDLEPETFKIAYKKELFESGNMSGILAGVVGNIDGMKMLKAFRCLDIRFPKAIVQS FPGPQFGISGVREQMQL-EEPLLCTVPKPKVGRTAKEQAELAKILFTAANHGIKDDENLT SLFFSKFEDRCDLVHAVQRDIEKKSGKKKFYLCNVTHSNDTMIERTDRIKAQGGRWMMLD VVVTGFSAVHTMRLKNPGLAIHAHRAMHGLFDRDSGFSLSMVVLAKLMRLLGVDSMHGGA PKTKME-NYGEPKLIRDTLQSLMTLGQNWYGMKPVWHVASGGLHPGTIGESIKQLGEDII IQCGGGVLGHPWGIEAGVEAVVQAKDIALGRGNIDDWILQNPDAKA-AQHWGFDE >rifoxyc3_full_scaffold_48488_1Dojkabacteria HSSYVAKGWKPKDEDYIISYDIELADGV-FEDVAAACAAESSTGTWTGVYSGKNSGVKKM KAVAFDLEPKTFKIAYKKELFELGNMSGLLAGVVGNIDGMKMLKAFRCLDIRFPKAIVQS FPGPQFGISGVREQMQL-EEPLLCTVPKPKVGRTAKEQAELAKILFTAANHGIKDDENLT SLFFNKFEERCDRVLEVQRDIEKKSGKKKFYLCNVTHSNDTMIERTDRIKAQGGRWMMLD VVVTGFTAVHTMRLKNPGLAIHAHRAMHGLFDRDSGFSMSMVVIAKLMRLLGVDSIHGGA PKTKME-NYGEPKLIRDALQSSMTLGQNWFGMKPVWHVASGGLHPGTIGESIKQLGEDII IQCGGGVLGHPWGIEAGVEAVVQAKDILGRGN---------------IHDWI--- >RIFOXYB1_FULL_GWC1_WS6_33_15_rifoxyb1_full_scaffold_291_15Dojkabacteria QLNYIAKGWEPPGRDYLITYDIELADGI-FRDAAASCAAESSTGTWTKVYAGKDSGIKKM KAVAYDLNPETFKIAYRSDLFEEGNMAGLLAGVAGNIDSMKMLKAFRLIDIKFPKNIVNS FPGPQFGIDGVRKFMEI-ERPMLCTVPKPKVGRTALEQAQLAKVLFTAAKQGIKDDENLT SLYFNTFEDRCKKVHAVQREIEQKSGRKKFYLCNITHSDDIMLERADMIKENGGRWLMMD AVTTGFTAVQTVRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA PKTKME-NYGEPKLVQEALQTDMTLGQNWYGMKGVWHVASGGLHAGTIGDSITQLGENLI IQAGGGVLGHPWGIEAGVEAVVQARDLMAKG-DIKAWIIDNPEAKA-AQHWGFDP >gwd1_scaffold_9197_5Dojkabacteria QLNYIAKGWEPPGRDYLITYDIELADGI-FRDAAASCAAESSTGTWTKVYAGKDSGIKKM KAVAYDLNPETFKIAYRSDLFEEGNMAGLLAGVAGNIDSMKMLKAFRLIDIKFPKNIVNS LG---------------------------RTALEQAQLAKVLFTAAKGEFQGIKDDENLT SLYFNTFEDRCKKVHAVQREIEQKSGRKKFYLCNITHSDDIMLERADMIKENGGRWLMMD AVTTGFTAVQTVRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA PKTKME-NYGEPKLVQEALQTDMTLGQNWYGMKGVWHVASGGLHAGTIGDSITQLGENLI IQAGGGVLGHPWGIEAGVEAVVQARDLMAKG-DIKAWIIDNPEAKA-AQHWGFDP >gwe2_scaffold_12031_6Dojkabacteria ------------------------------------------------------------ ----------------------------------------------------M------- ----------------------LCTVPKPKVGRTALEQAQLAKVLFTAAKQGIKDDENLT SLYFNTFEDRCKKVHAVQREIEQKSGRKKFYLCNITHSDDIMLERADMIKENGGRWLMMD AVTTGFTAVQTVRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA X----X-XTSNRYDIRSKLV--------WRSMACSFWWFTCRNNRGLYNTI--------- -----------R------------------------------------------- >gwf2_scaffold_35647_6Dojkabacteria --------------------M--------------------------------------- -----EI--------------ERG------------------------------------ --------------------PMLCTVPKPKVGRTALEQAQLAKVLFTAAKQGIKDDENLT SLYFNTFEDRCKKVHAVQREIEQKSGRKKFYLCNITHSDDIMLERADMIKENGGRWLMMD AVTTGFTAVQTVRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA X--------------------------------------------------------XXX XXXXXXXXXHQ------------------QQI----------------------- >UBA3081contig_986_14Dojkabacteria QFNYVAKGWKPKDRDYLITYDIELADGIKFEDAVASCAAESSTGTWTKVFSGKNSGVKKL KAIAYDLDPK-FKIAYKREIFEEGNMSGLLAGIAGNIDSMKMLKAFRLIDVRFPESIVKS FPGPQFGIDGVRKFMQI-ESPLMCTVAKPKIGRTAKEQAALAKILFTAAKQGIKDDENLT SLYFNQFENRCRRVHKVRRDIERKTGKRKMFLCNTTHSDDIMLERADMIKEQGGRWMMMD VVTTGFTAVQTVRNQNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA PKTKME-NYNEPKLIQEALQGPMTLGQNWFGMKGVWHTASGGLHSGSIGESIKKLGEDII LQAGGGVLGHPWGIEAGVEAMVQARDIAMKRLDIEVWIKNNPDAKA-SSHWGFGP >UBA5749contig_21275_3Dojkabacteria QFNYVAKGWKPKDEDLITELDIELADGIKFEDAAGSCAAESSTGTWTKVDEGKHSGTKKL KAIAYDLDPK-FKIAYKKEIFEPGNMAGILAGVAGNIDSMKMLKAFRLLDIRFPKSLVQS FPGPQFGITGVRDFMQI-EKPLMCTVPKPKIGRTAKEQAYLAKILFTAANQGIKDDENLT SLYFNKFEDRCKLVHEVQRDIEKKSGKRKMYLCNTTHSDDIMLERADMIKANGGRWMMMD VV-TGFSAVQTIRKHNPGLAIHAHRAMHGLLDRESGFSMSMVLIAKLMRLLGVDSLHGGA PKTKME-NYNEPKLIQEVLQSDMTLGQNWYGMKGVWHTASGGLHPGTIGESITKLGEDII VQAGGGVLGHPWGIEAGVEAMVQAKNIAMSRGDLKKWILDNPDAKA-ASFWGFDP >UBA2259contig_3196_71Dojkabacteria QFNYVAKGWKPKDEEYLITYDIELADGI-FEDAAGSCAAESSTGTWTKVDEGKHSGTKKL KAVAYDLDPATFKIAYKREIFEDGNISGLLAGIAGNIDSMKMLKAFRLIDVRFPKSIVKS FAGPQFGINGVREFMQI-ERPLMCTVPKPKIGRSAKEQAYLAKILFTAAKQGIKDDENLT SLYFNRFEDRCKLVHEVQREVEKKSGKRKMYLCNTTHSDDIMLERADMIKENGGRWMMMD VVTTGFAAVQTVRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA PKTKME-NYNEPKLIQEALQSDMTLGQNWYGMKGVWHTASGGLHPGTIGDSITKLGENII VQAGGGVLGHPWGIEAGVEAMVQARDIAMEKGNIKKWIIDNPDAKA-SSFWGFDP >UBA2222contig_92676_9Dojkabacteria QFDYVAKGWKPKDEDYLITYDIELADGI-FEDAAGSCAAESSTGTWTKVDEGKHSGTKKL KAVAYDLDPKTFKIAYRKEIFEEGNMAGLFAGIAGNIDSMKMLKAFRLIDVRFPKSIVKS FQGPQFGIDGVRKFMQI-ERPLLCTVPKPKVGRTAKEQAYLARILFTAANQGIKDDENLT SLYFNKFEDRCKLVHEVQREIEKKSGKRKLYLCNTTHSDDIMLERADMIKENGGRWLMID VVTTGFAAVQTVRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA PKTKME-NYNEPKLIQEALQGDMTLGQNWFGMKGVWHTASGGLHAGTMGDSITKLGEDII IQGGGGVLGHPWGIEAGVEAFVQARDIAMEKGDIKKWIIDHPDAKA-ASFWGFDP >gwd1_scaffold_806_10Dojkabacteria QFNYVAKGWQPKDKDYLITYDIELADGI-FEDAAGSCAAESSTGTWTKVDEGKHSGTKKL KAVAYDLDPKTFKIAYKKEIFEEGNMAGLFAGIAGNIDSMKMLKAFRLIDVRFPQSLVES FPGPQFGISGVREFMQI-ERPLLCTVPKPKVGRTAKEQAYLAKVLFTAAKQGIKDDENLT SLYFNKFEDRCKLVHEVQRDIEKKSGKRKLYFCNTTHSDDIMLERADLIKEHGGRWMMMD VVTTGFTAVQTVRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKVMRLLGVDSLHGGA PKTKME-NYNEPKLIQEALQSDMTLGQDWFGMKGVWHTASGGLHPGTIGESLTKLGEDIV LQAGGGVLGHPWGIEAGVEAMVEARDIAMERKDLKKWIIDHPEAKA-SSHWGFDP >Ig5185_scaffold_2729_7Dojkabacteria QFNYVAKGWSPPDKDYLITYDIELADGI-FEDAAGSCAAESSTGTWTKVDEGKHSGTKKL KAVAYDLDPKTFKIAYKKEIFEEGNMAGLFAGIAGNIDSMKMLKAFRLIDVRFPESLVKS FPGPQFGITGVREFMQI-ERPLMCTVPKPKVGRTAKEQAYLAKILFTAAKQGIKDDENLT SLYFNKFEDRCKLVHEVQRDIEKKSGKRKLYFCNTTHSDDIMLERADLIKENGGRWMMMD VVTTGFSAVQTVRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA PKTKME-NYNEPKLIQEALQSDMTLGQNWFGMKGVWHTASGGLHSGTIGDSILKLGEDII LQAGGGVLGHPWGIEAGVEAMVEARDIAMERKDLKQWIIDNPQAKA-ASHWGFDP >rifoxya1_full_scaffold_23855_2Dojkabacteria QFNYVAKGWQPKDKDYLITYDIELADGI-FEDAAGSCAAESSTGTWTKVDEGKHSGTKKL KAVAYDLDPKTFKIAYKKEIFEEGNMAGLFAGIAGNIDSMKMLKAFRLIDVRFPESLVKS FPGPQFGITGVREFMQI-ERPLMCTVPKPKVGRTAKEQAYLAKILFTAAKQGIKDDENLT SLYFNKFEDRCKLVHEVQRDIEKKSGKRKLYFCNTTHSDDIMLDRADMIKANGGRWMMMD VVTTGFSAVQTVRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA PKTKME-NYNEPKLIQEALQSDMTLGQNWFGMKGVWHTASGGLHSGTIGDSILKLGEDII LQAGGGVLGHPWGIEAGVEAMVEARDIAMKRQDLKKWILDHPQAKA-SSFWGFDP >rifoxyd1_full_scaffold_4223_4Dojkabacteria ------------------------------------------------------------ ----------------MAGLF---------AGIAGNIDSMKMLKAFRLIDVRFPQSLVES FPGPQFGISGVREFMQI-ERPLLCTVPKPKVGRTAKEQAYLAKVLFTAAKQGIKDDENLT SLYFNKFEDRCKLVHEVQRDIEKKSGKRKLYFCNTTHSDDIMLDRADMIKANGGRWMMMD VVTTGFSAVQTVRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA PKTKME-NYNEPKLIQEALQSDMTLGQNWFGMKGVWHTASGGLHSGTIGDSILKLGEDII LQAGGGVLGHPWGIEAGVEAMVEARDIAMKRQDLKKWILDHPQAKA-SSFWGFDP >UBA2278contig_10362_8Dojkabacteria QFDYLAKGWKPKDKDYLITYDIELADGIKFEDAAASCAAESSTGTWTEVYAGKNSGVKKL KAVAYDLEPK-FKIAYKKELFEEGNMAGILAGIAGNIDGMKMLKAFRLIDVRFPESIVKS FPGPQFGISGVREFMGI-ESPLLCTVPKPKVGRTAKEQAELAKILFTAANHGIKDDENLT NLYFNKFEDRCKRVHAVRRDIEKKTGKKKFYLCNISHSDDIMMERAEMIKKEGGRWLMVD VVTTGFTAIQTLRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA PKTKME-NYGEPKLIQEALQSFITLGQNWYGMKGVWHTASGGLHAGTIGDAITQLGEDII IQAGGGILGHPWGIEAGVEAVVQAKEIAMQKGDIKKWILENPDAKA-AQHWGFDP >bjp_ig2158_scaffold_0_692Dojkabacteria QLDYLAKGWKPKDKDYLITYDIELADGI-FEDAAASCAAESSTGTWTEVYAGKNSGIKKL KAIAYDLDPKTFKIAYKKEIFEEGNMSGILAGIAGNIDSMKMLKAFRLIDVRFPESLVKS FPGPMFGIKGVRDFMEL-PSPMLCTVPKPKVGRTAKEQAELAKVLFTAANQGIKDDENLT SLYFNKFEDRCKRVHAVQRDIEKKSGKKKFYLCNVTHSDDIMLDRADMIKKEGGRWMMID VVTTGFTAVQTMRKHNPGLAIHAHRAMHGLLDRESGFSMSMVVIAKIMRLLGVDSLHGGA PKTKME-NYGEPKLIQEALQSHITLGQNWFGMKGVWHTASGGLHAGTIGESITQLGEDLV IQAGGGTLGHPWGIEAGVEAVVQARDIAMEKGDIKKWIVEHPEAKA-AQHWGFDP >tara_MHASMcontig_673921_7Pacearchaeota QLDYIAKGWKPTSNHLITLIKLELKIAIPFEEGAASVAAESSTGTWTKVYDGPGSGIPKY KALAFDFNNKMFKVAYPMDLFEPDNMSGLLAGIVGNIAGMKMVSGMRIFDVKFPKKMVKA FLGPKFGVEGVRKFLG-KKKCLLCTVPKPKIGRTDKEQALLARDLFSAGDDGIKDDENLT NLTFNPFYSRVKLVLDELKKAENKTGKKKFYLCDISHSNDEMEKRARHIKKHKGTFMMID VITTGFAAVDTIRRKDLGLAIHAHRAMHGFITRDNSFSISMIVLAKIFRLLGVDSIHGGS PLAKME-DYGEAEYIAKVLQRIPSLGQDWHHIKPVWMTASGGLHPGDFESVLSKLGNDII IQCGGGVLGHPKGVKAGVIAAQQARSIFYKKIPIRRFVRDCSELASAVEIWGYGP >rifcsphigho2_01_scaffold_1015_10Pacearchaeota QLDYLAKGWKPS-SDLVTLLKMEL----AFQEACASVAAESSTGTWTKVYDGKGSGIAKK KAVAFDLDYK-FKVAYPCDLFELDNVSGLLAGIVGNIAGMKMVSGMRVFDVQFPKKMIAV FPGPQFGISGVRKFLK-KKKCLLVTVPKPKIGRTDKEQSQLAWELFTAGDEGIKDDENLT NLYFNSFERRCRLVYDEIKRAERVTGKKKFYLCNITHSNEVMTKRAELIKKNGGKFLMMD VITTGFSAVDTMRRKNLGLAIHAHRAMHGFITRDNSFSISMIVFAKLFRLLGVDSLHGGS PLAKME-DYHEALYIQKVLQKIPSLGQEWYHIKPVWMTASGGLHPGDFETVLGQLGENIL IQCGGGVLGHPQGVKAGVIAAQQAR--FIHGIPLRTFVKDCSELAQAVAEWGYGP >tara_38885_13Pacearchaeota QLDYLAKGWKASSKNLVTLIKMELAKGSVFEEGCASVAAESSTGTWTKVFDGKGSGIPKK KAVAFDYRNHMFKVAYPIDLFELDNISGLLAGIVGNIAGMKMVSGMRIFDVRFPKKMVKA FLGPRFGVRGVRRFLG-KKKCLLATVPKPKIGRTAKEQALLARDLFSAGDEGIKDDENLT NLYFNKFDKRCRLVHNERKRAEKISARKKFYLCNVTHSNDKMVDRANLIKKNGGRFMMVD VVTTGFSAVDSLRRKNLGLAIHAHRAMHGFVTRDNSFSISMVVMAKLFRLLGVDSIHGGS PLAKME-DYGEAVYISKVLQKIPSLGQEWFHIKPVWMTASGGLHPGDFEKILKLLGEDII IQCGGGVLGHPQGVKPGVIAAQQARSLFENGISVRKFVKDCSELARAVGEWGYGP >rifcsplowo2_01_scaffold_11977_10Pacearchaeota QFDYLAKGWKPKEHKLITLLKVELSKEAIFEEAAASIAAESSTGTWTKVYDGKDSGIPRL RAMAYDLDYK-FKVAYPVQLFEKNNLSGLLAGIVGNIAGMKMVHAMRVFDLRFPKPMIKA FLGPKYGVSGVRKILKI-PRPLLATVPKPKIGRNAKEQAHLAELLFTSGKDGIKDDENLT SLYFNNFYDRTQRVLHILQKAEKKTGKKKFYLANATHSQDEMIKRAEYIKKHKGIYMMMD VVCTGITAVDSIRRKDLGLAIHAHRAMHGFMTRDNSFSISMIVLAKWFRLLGVDNLHGGS PLAKME-DYGEAKYIQEVLQKIPSLGQKWYHIKPVWMVASGGLHPGDFEEVIKILGEDIV IQCGGGLLGHPEGIARGVEAIEEARDSVMKKIPLKEYVKKNPDAAA-VKLWGYGP >rifcsplowo2_01_scaffold_118_122Pacearchaeota QLNYLAKRWKPRDENYLITFKIELKIKI-FEEAAASIAAESSTGTWTKVYAGKGSGILKE RALAFDLDYKMFKVAYPVTLFEPDNISGLLAGIVGNIAGMKMVRAMRMFDVRFPRKIIQA FPGPKYGINGIRKRLG-KPCPLLVTVPKPKIGRTAKEQANLARILFTSGRDGIKDDENLT SLSFNTFDERVRLVFKELKAAENKTKHKKFYLCNISHSNDVMEKHAKLLQKHDSVFMMMD VICTGFAAVDTMRRKNLSLAIHAHRAMHGFITRDNSFSISMIVLAKLFRLLGVDSLHGGS PLAKME-DYGEASYIQHVLQSIPSLGQKWFHIKPVWITASGGLHPGDVEEVVRVLGEDVI IQCGGGLLGHPWGVEAGVCALEQALELAVKKIPFKQWVKKHPNAAA-IKLWGYGP >rifoxya1_full_scaffold_39886_1Pacearchaeota QFNYLAKGWKPKDENIITLLKVELESAVKFEKAAATIASESSIGTWTKVYDGEDSGIPKY RALAYDYRKNMFKIAYPIELFELDNISGLLAGITGNIAGMKMLSGLRVYDVRLPKKMIEK FPGPMFGVPGIRKMLD-KPKPITITVPKPKIGRTDVEQAKLAYQLTTAGNGGIKDDENLT NLAFNSFEKRARLVLAQLDIAEKLTGNKKIYLCNLTHSNETMEQRAKLIKSLGGRYFMMD VVTTGFAALDTMRRKNTGLAIHAHRAMHAFMTRDNSFSVSMIFLAKIFRLLGVDNLHGGS PLAKME-DYGEAKYIREVLQKIPSLGQNWHHIKPVWMVTSGGLHPGDIEAVMKIIGEDIV IQLGGGVLGHPHGIDAGVKAVEEAIECYYRRIPLKKFAEQNPSASA-VNLWGYGP >rifcsphigho2_02_scaffold_900_36Pacearchaeota QFNYVAKGWKPQNEDVITLLKAELDSAIAFEEAAGTVAAESSTGTWTKVYDGKDSGIPRK KALAYDLDNEMFKIAYPLELFELDNISGLLAGIVGNIAGMKMLSGLRIYDIKFPKAMIAK FPGPAFGVKGVRKMLR-KPKPLTATVPKPKIGRTDVEQAKLARILFTSGNDGIKDDENLT NLSFNHFDKRISLVLKEAREAEKLTGNKKFYLANITHSNEIMKKHAELIKKNKGVYCMID VVTTGFTAIDTFRRMNTGLAIHAHRAMHGFMTRDNSFSISMIVLAKVFRLLGVDNMHGGS PLAKME-DYGEAKYIKDILQQIPSLGQNWHHIKPVWMVASGGLHPGDIEKVLDELGEDVI LQFGGGLLGHPQGIEAGVKAIEQARDAYMKKIPLRKFIQNNPNAAA-VRLWGYGP >rifcsplowo2_01_scaffold_352176_1Pacearchaeota QFNYVAKGWEPANENVITLLKIELDSAIAFEEAAGTVAAESSTGTWTKVYDGKDSGIPRK KALAYDLDSEMFKVAYPIELFELDNISGLLAGIAGNIAGMKMLSGLRVYDVKFPKVMVEK FPGPAFGVSGIRKILK-RPKPLVATVPKPKIGRTDAEQAKLARILFTAGNDGIKDDENLT NLNFNHFDKRVSLILKEAREAEKLNGNKKFYLANLTHSNNIMKKRAEIIKKNKEVYCMID VVTTGFSAIDTFRRLNIGLAIHAHRAMHGFMTRDNSFSISMIVLAKLFRLLGVDDLHGGS PLAKME-DYGEAKYIRDVLQQIPSLGQNWHHIKPVWMVASGGLHPGDIENVLDELGE--- -----------D------------------------------------------- >rifcsphigho2_02_scaffold_438502_1Pacearchaeota QMNYLAKGWKPSQDQLITLIKVELGKNL-FEEAAASVAAESSTGTWTKVYDGPGSGIPKE RAFAFDLDYKMFKVAYPISLFELNNISGLFAGIIGNIAGMKMVSAMRVYDVRYPKKMIEA FPGPQFGVPGLRKILN-KAKPLVATVPKPKIGRTAKEQAELAKILFTSGLDGIKDDENLT NLSFNKFEERTKLVLKAIKAAEKQTGHKKFYLCNISHSNETMRKHAKLIKDNGGIFMMLD VICTGFSAVDTMRRYNPGLAIHAHRAMHGFITRDNS------------------------ -PG--------------------------------------------------IHG---- -----------K------------------------------------------- >rifoxya2_sub10_scaffold_3412_1Pacearchaeota QLNYVAKDWKPS-KELITLIKVELETAISFEDAAASVAAESSTGTWTKVYDGEGSGMSKE RAMAFDLDYTMFKVAYPPFLFELDNISGLFAGIIGNIAGMKMVSAMRVYDVKFPEKMIKA FPGPRFGVKGIRKLLK-KPRPLVATVPKPKIGRTAKEQADLAKMLFTSGLDGIKDDENLT NLSFNKFDERAKCVLKALKAAEKQTGKKKFYLCNVSHSNEVMRKRAKLIKDNGGIFMMLD VVCTGFAAADTMRRYNPGLAIHAHRAMHGFITRDNSFSVSMIFLAKTFRLLGVDTLHGGS PLAKME-DYGEAKYIQQVLQKIPSLGQDWYHIKPVWMVASGGLHPGDFELILKELGEDII IQMGGTTSSFRSGLRSKDRVLMGAIEQTLGAILLAALVFQYFSLST-IGSIAILV >rifcsphigho2_02_sub10_scaffold_110_79Woesearchaeota QLNYVAKDWKPS-KELITLIKVELT-AISFEDAAASVAAESSTGTWTKVYDGEGSGMSKE RAMAFDLDYT-FKVAYPPFLFELDNISGLFAGIIGNIAGMKMVSAMRVYDVKFPEKMIKA FPGPRFGVKGIRKLLK-KPRPLVATVPKPKIGRTAKEQADLAKMLFTSGLDGIKDDENLT NLSFNKFDERAKCVLKALKAAEKQTGKKKFYLCNVSHSNEVMRKRATCN---AWFYYKRQ SW-SAWS-WKTL----WLLSIYDFPCKNISSTRCRY--LAWLSISK-------------- -NGRL-----------------------W------------------------------- ------------------------------------------------------- >CG10_big_fil_rev_8_21_14_0.10_scaffold_617_c_46Pacearchaeota EKAIRE--AKSSSEKLRKSL---------FEEAAATVAAESSTGTWTKVYDGKDSGIPKL RALVFDFKRNMFKVAYPIELLELDNISGILAGISGNIAGMKMVSAIRIYDVYFPKKMIEK FPGPKFGVPGIRKLLN-KKKCLVATVPKPKIGRTDKEQAELAEILFTSGKDGIKDDENLT HLKFNDFYKRCKLVLGKIREAEKKTGNKKFYLCNTTHSNNEMMNRANFIKKNGGIFMMMD VVTTGFSAVDTIRRNNPGLAIHAHRAMHGFLTRDNSFSVSMIFLAKLYRLMGVDSLHGGS PLAKME-DYGEAEYIQKLLQKIPTLGQDWGHIKPVWMVASGGLHPGDFEAVLTKLGENII LQCGGGLLGHPNGVEAGVIAIEEAREIYEKGISVRNFVKENPEAKA-IEHWGYGP >rifoxyc1_sub10_scaffold_612_2Pacearchaeota QLDYLASGWKPKDENFIITLKIDFEAGKTLFEAAASVAAESSTGTWTKVYDGPDSGIPEK RAMAYDLDYE-FKVAYPIELFELDNVSGLLAGIVGNIAGMKMVSAMRIFDVRFPRKMILK LPGPAFGVEGIRKILN-KPKPLVATVPKPKIGRTDIEQAELAKILFTSGNDGIKDDENLT SLPFNNFDKRCKLVLKEAKEAEKITGNKKFYLCNISHSNETMEKHAKTIKDNGGIFMMID VVTTGFTGIHSMRLKNTGLAIHAHRAMHGFITRDNSFSISMIFLAKLFRLLGVDTLHGGS PLAKME-DYGEAIYIKDVLQQIPSLGQNWYNVKPVWMVASGGLHPGDFETVLKGLGDDII IQCGGGLLGHPWGVEAGVKAIEQARDAWLKKIPLKKYVKDNPQTEA-IRLWGYGP >rifcsphigho2_01_scaffold_156249_3Pacearchaeota ------------------------------------------------------------ -----------------------------------------------FHDARFPEAMVKK FLGPKFGIEGVRKFLN-KPSCLVATVPKPKIGRSDVEQSRLAKTLFTSGHDGIKDDENLT SLFFNNFEKRARLVLDELKKAEKETGHKKFYLCNTTHSNDKMMEHAKTIKKFGGNFMMMD VVTTGFTAVDTIRRRDLGLAIHAHRAMHGFITRDNSFSISMIFLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQKIPSLGQNWYHIKPVWMTASGGLHPGDFEAVLNSLGEDII IQCGGGLLGHPEGVERGVEAIEEARNAYENGVHLKDFVKDNPTAVA-VKLWGYGP >CG10_big_fil_rev_8_21_14_0.10_scaffold_9756_11Pacearchaeota QFNYLAKGWKPANADLITLIKVEFSKSAVFEDAAASVAAESSTGTWTKVYDGRDSGIQKY RALAYDLDYKMFKVAYPIELFELNNISGLLAGIVGNIAGMKMVSAMRIYDIKFPKQMVKS FLGPKFGVLGVRKMLK-KPKCLVVTVPKPKIGRSDKEQSHLARELFTSGHDGIKDDENLT NLYFNKFDKRAKLVLEEVRKAEKVTGNKKFYLCNTTHSNDEMERRVDLIKKNGGSFMMMD VVTTGFAAVDTIRRRNLGLAIHAHRAMHGFITRDNSFSISMIVLAKLFRLLGVDSLHGGS PLAKME-DYGEAKYIQRVLQKIPSLGQNWHHIKPVWMTASGGLHPGDFEAVLKELGEDII IQCGGGLLGHPGGVAQGVIAIEEARMIYEKGIPLRKWVKENQDAVA-VKLWGYSP >rifcsplowo2_01_scaffold_107583_2Pacearchaeota QFNYLAKGWKPKDKDLTTLIKIEL--AIKFEEAAATIAAESSTGTWTKVYSGKGSGISKK RAMAYSLDYK-FKIAYPIDLFELDNISGLLAGIVGNIAGMKMISAMRVYDVKFPEKMVKA FPGPRYGVQGIRKILK-KNKPFTCTVPKPKIGRTDVEQAKLAKILFTSGKDGIKDDENLT SLPFNKFDKRCSLVLNELKKAEIKTGNKKFYLCNTTHSDDEMLRRANIIKKNGGVYMMMD VITTGFAAVHTIRKRNLNLAIHAHRAMHGFITRDNSFSISMIVLAKLFRLIGVDSFHGGS PLAKME-DYGEAKYIQKVLQKIPSLGQNWYNIKSVLMTASGGLHPGDIEVILKELGEDII MQFGGGLLGHPGGIEAGVKSIEQALEAYRKKIPLNKFVEKNPNAAA-IKLWGYGP >CG_2015-10_scaffold_33734_2Pacearchaeota QFNYLAKGWKPKDENVITLLKIELDEKAQFEDAAATVAAESSTGTWTKVYSGSDSGILEK RAIAYDLDYDMFKIAYPIELFELNNLSGLLAGIVGNIAGMKMISAMRIYDIKLPKKIVES FPGPKFGVKGIRKLLN-KPKCLVITVPKPKIGRTDKEQAELAKILFNSGKDGIKDDENLT NLSFNKFEKRAKLVLDEAKIAEKKTKHKKFYLCNITHSNEEMINRAKIIKDNQGIFIMID VVTTGFAAVDTLRRKNLGLAIHAHRAMHGFMTRDNSFSISMIVLAKLFRLLGVDSLHGGS PLAKME-DYKEASYIRHVLQKIPSLGQKWRHINPVWMTASGGLHPGDFESILNELGQDTI IQCGGGLLGHPEGIEAGVIAIETAREIYEKGIPLKKFVEKNPDAVA-MKHWGYGP >CG11_big_fil_rev_8_21_14_0.20_scaffold_5804_7Pacearchaeota QFNYLAKGWKPKDENVITLLKIELDEKAQFEDAAATVAAESSTGTWTKVYSGSDSGILEK RAIAYDLDYDMFKIAYPIELFELNNLSGLLAGIVGNIAGMKMISAMRIYDIKLPKKIVES FPGPKFGVEGIRKLLN-KPKCLVITVPKPKIGRTDKEQAELAKILFNSGKDGIKDDENLT NLSFNKFEKRAKLVLDEAKIAEKKTKHKKFYLCNITHSNEEMINRAKIIKDNQGIFIMMD VVTTGFAAVDTLRRKNLGLAIHAHRAMHGFMTRDNSFSISMIVLAKLFRLLGVDSLHGGS PLAKME-DYKEASYIRHVLQKIPSLGQKWWRINPVWMTASGGLHPGDFESILNKLGQDTI IQCGGGLLGHPEGIEAGVIAIETAREIYEKGIPLKKFVEKNPDAVA-MKHWGYGP >scaffold_31901_2_REFreference QFNYLAKGWKPKDENVITLLKIELDEKAQFEDAAATVAAESSTGTWTKVYSGSDSGILEK RAIAYDYDRNMFKIAYPIELFELNNLSGLLAGIVGNIAGMKMISAMRIYDIKLPKKIVES FPGPKFGVKGIRKLLN-KPKCLVITVPKPKIGRTDKEQAELAKILFNSGKDGIKDDENLT NLSFNKFEKRAKLVLDEAKIAEKKTKHKKFYLCNITHSNEEMINRAKIIKDNQGIFIMMD VVTTGFAAVDTLRRKNLGLAIHAHRAMHGFMTRDNSFSISMIVLAKLFRLLGVDSLHGGS PLAKME-DYKEASYIRHVLQKIPSLGQKWWRINPVWMTASGGLHPGDFESILNKLGQDTI IQCGGGLLGHPEGIEAGVIAIETAREIYEKGIPLKKFVEKNPDAVA-VKHWGYGP >rifcsphigho2_01_scaffold_113321_3Pacearchaeota QFDYLARGWKPNDENVITLLKIELAVKERFEEATATVAAESSTGTWTKVYDGKDSGVPKK RAMAFDLDYK-FKIAYPIELFELNNVSGLLAGVVGNIAGMKMVSAMRMYDIRFPRKMIQA FPGPKFGVPGVRKLLK-KPRCLVATVPKPKIGRTDKEQAELARILFTSGKEGIKDDENLT NLSFNKFSKRAELVLKELREAERKTGHKKFYLCNITHSNEEMLRRARLIKDNGGIFMMVD VITTGFSAVDSLRRKNLDLAIHAHRAMHGFITRDNSFSISMIFLAKLFRLLGVDSIHGGS PLAKMEGEAEYIKDILQAKNQIPTLGQRWYHIKPVWMAASGGLHPGDFEAILDALGEDIL IQCGGGLLGHPKGVEAGVISIEQAREIYEKGISVRKFVKQNPKAAA-VKLWGFGP >rifcsplowo2_01_scaffold_369553_1Pacearchaeota ------------------------------------------------------------ ----------------------------------------------------FPRKMISA FPGPAFGIEGIRNVLG-KKKCLVCTVPKPKIGRTDVEQAKLARILFNSGNDGIKDDENLL SLPFNKFEKRCKLVLRETREAEKRTGNKKFYLCNVTHSNNEMLKRVRMIKENKGVFMMMD VVTTGFAAVDSVRRANTGLAIHAHRAMHGFITRDNSFSISMIVLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQKIPSLGQKWWHIKPVWMTASGGLHPGDFETILEELGSDII IQCGGGLLGHPQGIEAGVKAIEQARDAYYKNIPLKKFVEKNPNAAA-VKLWGYGP >rifcsphigho2_01_scaffold_14826_13Pacearchaeota QFDYVARGWKPKNEDVITLLKIELEQEL-FEEAAATVAAESSTGTWTKVYDGKDSGIPAK RALAYDLDYEMFKVAYPIDLFEANNISGLFAGIIGNIAGMKMVSAMRVYDVRLPRKIIQA FPGPAFGIEGVRKMLG-KKKCLVCTVPKPKIGRTDVEQAKLAKILFNTGRDGIKDDENLL DFYFNKYDKRCKMVLAEARAAEKATGNKKFYLCNTTHSNNEMLRRAKLIKDNKGIFMMMD VVTTGFAAVDSVRRANTGLAIHAHRAMHGFITRDNSFSVSMIFLAKIFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQKIPSLGQKWWNIKPVWMVASGGLHPGDFEAVLKALGDDII IQCGGGLLGHPHGVEAGVIAIEQARDSYYKGIHLKDYVKKNPNAAA-VKLWGYGP >rifcsphigho2_01_scaffold_44408_3Pacearchaeota QFDYVAKGWKPKDEDVITLLKIEIEAAVNFQKAAATVAAESSTGTWTRVYDGKDSGIPAK RALAYDLDYEMFKVAYPMDLFEANNISGLFAGIIGNIAGMKMVSAMRVYDVRLPRKIIQA FPGPAFGIEGVRKMLG-KKKCLVCTVPKPKIGRTDVEQAKLARILFNSGRDGIKDDENLL DFYFNRYDKRCKMVLAEARAAEKATGNKKFYLCNTTHSNNEMLRRAKLIKENKGYFMMMD VVTTGFAAVDSVRRANTGLAIHAHRAMHGFITRDNSFSVSMIFLAKIFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQIIPSLGQKWWHIKPVWMVASGGLHPGDFEAVLKTLGNDII IQCGGGLLGHPQGIEAGVKAIEQARDAYYKGIHLKDFVKKNPNAAA-VKLWGYGP >rifcsplowo2_01_scaffold_156696_4Pacearchaeota QLNYLANGWKPEDENVITMIKVELDEEASFEDAASTIAAESSTGTWTRVYSGKDSGIPKI KALAYDLDYE-FKVAYPIILFEMNNLSGLLAGIAGNIAGMKMISAMRIYDIRFPKKMIDA FPGPRFGVSGVRKLLK-KKKCLICTVPKPKIGRTDEEQARLAKILFTSGKEGIKDDENLT NLKFNMFEKRARLILKELKEAERKTGNRKFYLCNTTHSDNEMLKRAKVIKENGGIFMMMD VVTTGFAAVDTVRRNNPGLAIHAHRAMHGFMTRDNSFSISMIVLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDILQIIPSLGQNWHHIKPVWMTASGGLHPGDFETVLSELGDDII IQCGGGLLGHPDGIEAGVKAIEQARELWEKGISNKEFVEKNPDAKA-IKLWGYGP >CG10_big_fil_rev_8_21_14_0.10_scaffold_76659_1Pacearchaeota QLDYLAKGWKPKDNDLITLLKVDFKDAIKFEDAAASVAAESSTGTWTKVYDGENSGIKRL RAIAYDLDYK-FKVAYPIELFELNNMSGLLAGIVGNIAGMKMVKAMRFYDVKFPKKMVEA YPGPRFGIEGVRKLLN-KPKCLVATVPKPKIGRSDKEQSILAKELFTSGSDGIKDDENLT SLYFNDFDKRTKLVLKELKDAEKKTGNKKFYLCNTTHSNDEMLRRANLIKNNGGIFMMMD VITTGFAATDTMRRKNPGLAIHAHRAMHGFITRDNS------------------------ -----P------------------------------------------------------ -----GVHGSGE------------------------------------------- >rifcsplowo2_01_scaffold_36463_6Pacearchaeota QFDYLAEGWKPKDENVITLLKVELAVKEKFEEAAATIAAESSTGTWTKVYDGKGSGIPEK RALAFDLDYE-FKIAYPLELFELNNISGLLAGIVGNIAGMKMVSAMRVYDIRFPRKMINN FPGPAFGVEGVRKLLN-KPKCLVCTVPKPKIGRTDKEQAELAKILFTSGNEGIKDDENLT SLSFNQFDKRCKLVLKELKLAEKKTKHKKFYLCNTTHSDDEVLRRAKLIKENNGIFMMLD VVTTGFAAVDTIRRKNTGLAIHAHRAMHGFITRDNSFSISMIILAKLFRLMGVDSIHGGS PLAKME-DYGEARYIQQVLQIIPSLGQDWHNIKPVWMVASGGLHPGDFEAILNELGNDVI IQCGGGLLGHPNGIEAGVIAIEQARDIYYKKIPIKKFVKENPDAVA-VGYWGYGP >rifcsphigho2_01_scaffold_42348_2Pacearchaeota QFNYIARGWKPKDENVITLLKIELDETANFEDAAGTVAAESSTGTWTKVYDGKDSGILKI RAVAYDLDYEMFKIAYPIELFELDNISGLLAGIVGNIAGMKMVSAMRVYDVRFPKKMINA FPGPRFGVSGIRKLLK-KPICLVCTVPKPKIGRTAKEQAELARILFNSGNDGIKDDENLT NLFFNKFDERAKLVLSELKKAEKKTGNKKFYLCNTTHSNDEVLRRAKLIKENQGIFMMVD VVTTGFSAVDTIRRKNLGLAIHGHRAMHGFITRDNSFSVSMIFLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQKIPSLGQNWYHIKSVWMTASGGLHPGDFEEVLNKLGDDII IQCGGGLLGHPSGIEAGVKAIEQARDIYYKKIPLKKFIKENPNASA-VKLWGFGP >rifcsplowo2_01_scaffold_142244_2Pacearchaeota QFNYVAKGWKPKDENVVTLLKGKPEKEL-FADAAGTVAAESSTGTWTKVYSGKDSGIKKF RALAYDLDYEMFKIAYPIELFELDNISGLLAGIVGNIAGMKMVSAMRVYDIRFPKKMINA FPGPRFGVPGIRKLLK-KPKCLVCTVPKPKIGRTAKEQSELAKILFSSGNDGIKDDENLT NLFFNKFDERCKLVLSELKKAEKKNGNKKFYLCNVTHSDDEMLRRAKLIKDNEGIFMMID VVTTGFAAVDTIRRKNPGLAIHAHRAMHGFITRDNSFSVSMIFLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQKIPSLGQNWYHVKPVFMTASGGLHPGDFEEVLNKLGDDII IQFGGGLLGHPDGVEAGVKAIEQARDIYYKKIPLKKFIKENPNASA-VKLWGFGP >rifcsphigho2_01_scaffold_125366_1Pacearchaeota --------IEFDETAKEAKYHLKKKIDKLFEDAAGTVAAESSTGTWTKVYSGKDSGIKKF RAMAYELDYE-FKIAYPIELFELDNISGLLAGVVGNIAGMKMVSAMRVYDIRFPKKMIQA FPGPRFGVAGIRKLLK-KPKCLVCTVPKPKIGRTAKEQTELAKILFSSGNDGIKDDENLT NLVFNKFDERCKLILNELKKAEKKTGNKKFYLCNTTHSNDEVLRRAKLIKENGGIFMMLD VVTTGFAAVDTIRRKNPELAIHAHRAMHGFITRDNSFSVSMILLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQKIPSLGQNWYNIKPVFMTASGGLHPGDFEEVLNKLGDDII IQFGGGLLGHPGGVEAGVKAIEQARDIYYKKIPLKKFVKENPNASA-VKIWGYGP >rifcsphigho2_02_scaffold_440038_1Pacearchaeota QFNYIAKGWKPKDDNVITLLKIELQ-AIEFEDAAGTVAAESSTGTWTKVYSGKDSGIKKF RAMAYELDYE-FKIAYPIELFELDNISGLLAGVVGNIAGMKMVSAMRVYDIRFPKKMIQA FPGPRFGVAGIRKLLK-KPKCLVCTVPKPKIGRTAKEQTELAKILFSSGNDGIKDDENLT NLVFNKFDERCKLILNELKKAEKKTGNKKFYLCNTTHSNDEVLRRAKLIKENGGIFMMLD VVTTGFAAVDTIRRKNPELAIHAHRAMHGFITRDNSFSVSMILLAKLFRLLGVDSLHGGS PLAKME-----------------------------------------------DYG---- -----------E-----------AI-----------YIK---------------- >rifcsplowo2_01_scaffold_97081_1Pacearchaeota --------------------EKEL-----FEDAAGTVAAESSTGTWTKVYSGKDSGIKKF RALAFDLDYG-FKVAYPIELFELDNISGLLAGIVGNIAGMKMVSAMRVYDIRFPRKMIKM FPGPRFGVSGIRKLLK-KPRCLVCTVPKPKIGRTAKEQSELAKILFSSGNDGIKDDENLT NLFFNKFDERTKLVLNELKKAEKKTGNKKFYLCNTTHSNDEVLRRAKLIKENGGIFMMLD VVTTGFAAVDTIRRKNLGLAIHAHRAMHGFITRDNSFSISMILLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQKIPSLGQNWHHIKPIWMTASGGLHPGDFEEVLNKLGDDII IQCGGGLLGHPDGIEAGVKAIEQARDIYYKKIPLKKFVKENPNASA-VKLWGYGP >GWB1_scaffold_7111_3Pacearchaeota QFNYVARGWKPKDENVITLLKIELDETAHFEDAAGTVAAESSTGTWTKVYSGKDSGIKKF RALAFDYEKKMFKVAYPIELFELDNISGLLAGIVGNIAGMKMVSAMRVYDIRFPKKMINA FPGPRFGVSGIRKLLK-KPKCLVCTVPKPKIGRTAKEQAELAKILFNSENDGIKDDENLT NLFFNKFDERCRLVLNELKKAEQKTGNKKFYLCNTTHSNDEVLRRAKLIKENGGIFMMLD VVTTGFAAVDTIRRKNLGLAIHAHRAMHGFITRDNSFSISMILLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQKIPSLGQNWHHIKPIWMTASGGLHPGDFEEVLNKLGDDII IQCGGGLLGHPDGIEAGVKAIEQARDIYYKKIPLKKFVKENPNASA-VKLWGYGP >rifoxyd1_full_scaffold_35748_2Pacearchaeota --------------------K------------------EQS------------------ -----EL----AKILF-----NS-----------GN------------------------ --GT-------------Y--------------------------------DGIKDDENLT NLFFNKFDERCRLVLNELKKAEKKTGNKKFYLCNTTHSNDEVLRRAKLIKDNEGIFMMID VVTTGFAAVDTIRRKNPGLAIHAHRAMHGFITRDNSFSISMILLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKNVLQKIPSLGQNWHNIKPVFMTASGGLHPGDFEEVLNKLGDDII IQFGGGLLGHPNGVEAGVKAIEQARDIYYKKIPLKKFIKENPKASA-VKLWGFGP >GWB1_scaffold_11255_2Pacearchaeota QFNYVAKGWKPQNSDVITLLKGKPEKEL-FEDAVGTIAAESSTGTWTKVYSGKDSGIKKF RAMAYDLNYEMFKVAYPIELFELNNVSGLLADIVGNIAGMKMVSAMRVYDIRFPKKMINA FPGPRFGVSGIRKLLK-KPKCLVCTVPKPKIGRTAKEQSELAKILFSSGNDGIKDDENLT NLVFNKFDERCKLVLNELKKAEKKTGNKKFYLCNTTHSNDEVLRRAKLIKDNEGIFMMID VVTTGFAAVDTIRRKNLGLAIHAHRAMHGFITRDNSFSISMILLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKNVLQKIPSLGQNWHNIKPVFMTASGGLHPGDFEEVLNKLGDDII IQFGGGLLGHPDGVEAGVKAIEQARDIYYKKIPLKKFIKSNSELASAVKLWGFGP >rifoxya1_full_scaffold_21555_2Pacearchaeota QFNYVAKGWKPQNSDVITLLKGKPEKEL-FEDAVGTIAAESSTGTWTKVYSGKDSGIKKF RAMAYDYEKKMFKVAYPIELFELNNISGLLAGIVGNIAGMKMVSAMRVYDIRFPKKMINA FPGPRFGVSGIRKLLK-KPKCLVCTVPKPKIGRTAKEQSELAKILFSSGNDGIKDDENLT NLVFNKFDERCKLVLNELKKAEKKTGNKKFYLCNTTHSNDEVLRRAKLIKDNEGIFMMID VVTTGFAAVDTIRRKNLGLAIHAHRAMHGFITRDNSFSISMILLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKNVLQKIPSLGQNWHNIKPVFMTASGGLHPGDFEEVLNKLGDDII IQFGGGLLGHPDGVEAGVKAIEQARDI-----YYSKKIQ--------IQNWLLP- >rifcsplowo2_12_scaffold_228026_1Pacearchaeota QFNYVAKGWKPKNSDVITLLKGKPEKEL-FEDAAGTVAAESSTGTWTKVYSGKDSGIKKF RAMAYDLDYEMFKVAYPIELFELDNISGLLAGIVGNIAGMKMVSAMRVYDIRFPKKMIQA FPGPRFGVSGIRKLLK-KPKCLVCTVPKPKIGRTAKEQSELAKILFNSGNDGIKDDENLT NLVFNKFDERCKLILNELKKAEKKTGNKKVYLCNTTHSNDEVLRRAKLIKENGGIFMMLD VVTTGFAAVDTIRRKNPELAIHAHRAMHGFITRDNSFSVSMIFLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKNVLQKIPSLGQNWHNIKPVFMTASGGLHPGDFEEVLNKLGDDII IQCGGGLLGHPDGIEAGVK------------------------------------ >rifoxyb1_full_scaffold_56378_1Pacearchaeota ---------------------IAE----------------------------------KF RAMAYDLDYEMFKVAYPIELFELNNISGLLAGIVGNIAGMKMVSAMRVYDIRFPKKMIQA FPGPRFGVSGIRKLLK-KPKCLVCTVPKPKIGRTAKEQSELAKILFNSGNDGIKDDENLT NLFFNKFDERCKLILNELKKAEQKTGNKKFYLCNTTHSDDEMMRRAKLIKENGGIFMMMD VVTTGFAAVDTIRRKNPGLAIHAHRAMHGFITRDNSFSVSMILLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQKIPSLGQNWHHIKPVFMTASGGLHPGDFEEVLNKLGDDII IQFGGGLLGHPDGVEAGVKAIEQARDIYYKKIPLKKFVKENPKASA-VKLWGYGP >rifoxyd1_full_scaffold_17021_2Pacearchaeota QFNYVAKGWKPKNSDVITLLKIELEKAIEFEDAAGTVAAESSTGTWTKVYSGKDSGIKKF RAMAYDLDYE-FKVAYPIELFELDNISGLLAGIVGNIAGMKMVSAMRVYDIRFPKKMIQA FPGPRFGVSGIRKLLK-KPKCLVCTVPKPKIGRTAKEQSELAKILFNSGNDGIKDDENLT NLFFNKFDERCKLILNELKKAEQKTGNKKFYLCNTTHSNDEVLRRAKLIKENGGIFMMMD VVTTGFAAVDTVRRKNPGLAIHAHRAMHGFITRDNSFSVSMILLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQKIPSLGQNWHHIKPVFMTASGGLHPGDFEEVLNKLGDDII IQFGGGLLGHPDGVEAGVKAIEQARDIYYKKIPLKKFVKENPNASA-VKLWGYGP >rifcsphigho2_02_scaffold_332588_3Pacearchaeota QFNYVAKGWKPKNSDVITLLKIELEKAIEFEDAAGTVAAESSTGTWTKVYSGKDSGIKKF RAMAYDLDYE-FKVAYPIELFELDNISGLLAGIVGNIAGMKMVSAMRVYDIRFPKKMIQA FPGPRFGVSGIRKLLK-KPKCLVCTVPKPKIGRTAKEQSELAKILFNSGNDGIKDDENLT NLFFNKFDERCKLILNELKKAEQKTGNKKFYLCNTTHSNDEVLRRAKLIKENGGIFMMMD VVTTGFAAVDTVRRKNPGLAIHAHRAMHGFITRDNSFSVSMILLAKLFRLLGVDSLHGGS PLAKME-DYGEAIYIKDVLQ-----SKELHNKIPSLGHASGGLHPGDFEEVLNKLGDDII IQFGGGLLGHPNGVEAGVKAIEQARDIYYKKIPLKKFIKENPKASA-VKLWGFGP >rifcsphigho2_01_sub10_scaffold_15638_2Pacearchaeota QFNYVAKGWKPKNSDVITLLKGKPEKEL-FEDAAGTVAAESSTGTWTKVYSGKDSGIKKF RAMAYDYEKKMFKVAYPIELFELDNISGLLAGIVGNIAGMKMVSAMRVYDIRFPKKMIQA FPGPRFGVSGIRKLLK-KPKCLVCTVPKPKIGRTAKEQSELAKILFNSGNDGIKDDENLT NLFFNKFDERCKLILNELKKAEQKTGNKKFYLCNTTHSNDEVLRRAKLIKENGGIFMMMD VVTTGFAAVDTVRRKNPGLAIHAHRAMHGFITRDNSFSVSMILLAKLFRLLGVDSLHGGS PLAKME-DYREAIYIKDVLQKIPSLGQNWHHIKPVFMTASGGLHPGDFEEVLNKLGDDII IQFGGGLLGHPNGVEAGVKAIEQARDIYYKKIPLKKFIKENPNASA-VKLWGFGP >GWF1_WS6_37_7_1_127Dojkabacteria QIDYFAIGWKPKDEDLIAYFQIALKDGI-FKEVAATVAAESSTGTWTEVDESSNAGMDKY RAVVFDIEEDRFKIAYPVNLFEPDNMSGMLAGFAGNIGGMKALKGMRLLDVRFPKKIVQS FPGPKFGIEGMRDMLGYPKKPIMGTVPKPKVGRTAEQQSALARELWTAGDDFIKDDENLT SLEFNNFYDRARLVLDVQHDVENMSKNKRFSLLNITHSNEELIRRAETIRDQGGRFVMID VVTTGFGMLHSYRNEDLGLAIHAHRAMHSFWTRHNSFSVSMLVLAKIYRLLGVDSLHTGS PKAKME-DYGESDIIARTVSKNLTLGQNWFGMKETWPVASGGLHPGVLDKVVEKLGPNIF IQLGGGVLGHPEGIRRGVEAALEARKAIAAGEPISEFVKKHQDAVA-VKEWGTEP >rifcsplowo2_02_scaffold_12225_10Parcubacteria QLNYLAPDWQPPEADVIVLFKLQLAPGV-FAQAAASVAAESSTGTWTTVEHRPDSGMETY KAVAYDLSQTMFKVAYRVDLFEPGNISGFLAGPLGNVAGMKMIQGLRIFDIRFPRPWLES FPGPRFGIAGLRQTLG-HPQPLLGTVPKPKVGRTATEQASLARRLWTAGDDFIKDDENLT SLPFNKFEDRCRAVLKVQREVESVGTPKKLYLCNITHSDDVMRQRANLIAQEGGRAMMLD VVTTGFAALHTMRLKNPGLFIHAHRAMHGFITRESGFSVAMLALAKIFRLLGVDSLHIGS PKSKMQGNPDEVSGYDSYSQNFHTLGQKWYGLKPVWSVASGGLHPGVIDTVINKLGHDIF IQLGGGVLGHPEGADRGVEAALEARRAVMSGRSIKEYVKQNPRAAA-VAQWGTEP >RIFOXYC1_FULL_GWE2_OD1_42_9_rifoxyc1_full_scaffold_781_14Veblenbacteria QLNYLAAGWQPEKADVIVQFKLQLATGV-MFQAAASVAAESSTGTWTTVEDRADSGMKDY KAIVFDINEH-FKVAYRADLFEPGNISGFLAGPAGNIAGMKMVQGLRIFDIRFPRSLVES FPGPRFGIDGLRQLLGQTEKPILGTVPKPKVGRTAAEQAILAKRLWSAGDDFIKDDENLT SLPFNKFEDRCRAILKVQRDIEASGQSKKLYLCNVTHSDDIMLSRANLIAEEGGRVMMMD VVTTGFAAVHTMRQKNPGLFIHGHRAMHGFMTRESGFSVSMLTLAKIYRLLGVDSLHIGS PKSKMQGESELIDAAMNPAENFHTLGQNWYSLKPVWSVASGGLHPGVIDTVVNKLGRDIF IQLGGGVLGHPGGAERGVEAALEARQAVMRGQTIKEYVKTNPEAEA-VAKWGTEP >rifcsphigho2_01_scaffold_9117_11Gottesmanbacteria QFNYIAEGWKPEDEEVIVQFKIELADGV-FPEAAASVAAESSTGTWTKVEERADSGIREY KALAFDLDKKMFKVAYKVDLFEHDNMSGFLAGPVGNVGGMKMVKGVRVFDIRFPKPIVKS FPGPLYGIEGVRDLLN-NRKPILGTVPKPKVGRSAKEQADLARRLWLAGDDFLKDDENLT SLPFNTFEERVKLVHAVQKEAEKKTGKKKLYLSNITHSNDTMVSRANMIKDNGGRCMMID VITTGFAAVHTIRLKNPELIIHAHRAMHAFITRESGFSISMFVFAKIFRLLGVDSMHGGS PKSKME-DYGESVEIMKIFTRFYTLGQNWFGMKTVWPVASGGLHPGVMDTVVEVLGSDCY IQLGGGVLGHPEGAERGVESALEARSAIAEGITVKEYASKHPNAKA-VELWGTEP >RIFCSPHIGHO2_02_FULL_OP11_39_11_rifcsphigho2_02_scaffold_11740_29Gottesmanbacteria QLNYIAQGWVPDDENIIVQFKISLADFV-FLDGAASVAAESSTGTWTKVDEGPDSGIKEY KAVVFDIDQNMFKVAYKVDLFEPDNMSGFLAGPAGNIGGMKMVKGVRIFDMRFPKKMVQA FPGPRYGIEGIRNLLD-WNMPIMGTVPKPKVGRTAKEQAILARRMFSAGTDFIKDDENLT SLLFNRFEERAKLVNEAIADVEKQSGKKKLYLCNLTHSNDTMIKRAELIKETGGRCMMID VVTTGFAAVHTMRLKNPGLAIHAHRAMHAFITRESGFSISMATLAKLCRLLGVDSIHTGS PKAKME-DYGESEMIKDVLVKFKTLGQNWFGMKTVWPTASGGLHPGVMDVVVEKLGKDCY IQMGGGVLGHPQGIEKGVEAALEARRAIAQGISVKEYADKYPDAYA-IKLWGTEP >RBG_13_OP11_45_10_RBG_13_scaffold_565_32Gottesmanbacteria QLNYIAAGWQPEDENVIIQFKMQLADGI-FLDAAASVAAESSTGTWTKVEDRPDSGLAEF KAIVFDVDEGMFKVAYKTDLFEHDNMAGFLAGPVGNIGGMKMVKGLRLMDIRFPKAIVTA FPGPRYGIAGVRDLLN-DRKPIFGTVPKPKVGRNAEEQAALARRLFTAGDDFIKDDENLT SLPLNRFEDRAKLVLEAIAEVEQKTGIRKLYLCNITHSNDVMMKRADMIAEYGGRCMMLD VVASGVAAVHTMRLKNPNLIIHAHRAMHAFITRESGFSVSMFILAKMYRMLGVDSLHSGS PKAKME-DYGEAEEIGNILRSFHTLGQNWFGMKSVWPVASGGLHPGVLDVVIAKMTPDCY IQLGGGVLGHPEGAEKGVEAALEARKAVADGMTIKEFVAKNSNAIA-VGLWGTEP >RBG_13_scaffold_565_33Gottesmanbacteria QLNYIAAGWQPEDENVIIQFKMQLADGI-FLDAAASVAAESSTGTWTKVEDRPDSGLAEF KAIVFDVDEGMFKVAYKTDLFEHDNMAGFLAGPVGNIGGMKMVKGLRLMDIRFPKAIVTA FPGPRYGIAGVRDLLN-DRKPIFGTVPKPKVGRNAEEQAALARRLFTAGDDFIKDDENLT SLPLNRFEDRAKLVLEAIAEVEQKTGIRKLYLCNITHSNDVMMKRADMIAEYGGRCMMLD VVASGVAAVHTMRLKNPNLIIHAHRAMHAFITRESGFSVSMFILAKMYRMLGVDSLHSGS PKAKME-DYGEAEEIGNILRSFHTLGQNWFGMKSVWPVASGGLHPGVLDVVIAKMTPDCY IQLGD--HGPL-----PGIATRR-------------------------------- >rifcsplowo2_01_scaffold_312661_3Gottesmanbacteria ------------------------------------------------------------ ------------------------------------------------------------ ---------------------IIGTVPKPKVGRTAQEQAVLARRLWTAGDDFIKDDENLT SLSFNRFEDRAKLVLEAQRKAEKQTGFRKLYLCNVTHSNDVMMKRADMIKSHGGRCMMLD VVASGVGAVHTMRLKNPELIIHAHRAMHAFITRESGFSVSMFILAKIFRMLGVDSMHSGS PKSKME-DYGEAEEIGGILTKFHTLGQNWFGMKNVWPVASGGLHPGVMDVVIAKMTPDCY VQLGGGVLGHPEGGERGVEAALEARNAVYSGLTVKEFVSKNPNSKA-VELWGTEP >rifoxyc3_full_scaffold_6738_7Gottesmanbacteria ------------------------------------------------------------ ----------------------------------------------------M------- --------------------PIIGTVPKPKVGRTAQEQAVLARRLWTAGDDFIKDDENLT SLSFNRFEDRAKLVLGAQLKAEKQTGFKKLYLCNVTHSNDVMMKRADMIKSYGGRCMMID VVASGVGAVHTMRLKNPELIIHAHRAMHAFITRESGFSVSMFILAKIFRMLGVDSMHSGS PKSKMEGEAEEIGGILTQNKKFHTLGQNWFGMKNVWPVASGGLHPGVMDVVIAKMTPDCY VQLGGGVLGHPEGGERGVEAALEAR----NAV----YSG---------------- >gwa2_scaffold_143220_2Roizmanbacteria QFDYIAPGWQPKDENVIIQFKMQLADGI-MFKGAASVAAESSTGTWTKVEDRPDSGLKEF KAIVFDMDKKLFKVAYKTDLFEYDNMAGFLAGPVGNIGGMKMVKGLRLMDIRFPKAIVTA FPGPRYGIAGVRDLLQ-KRNPIIGTVPKPKVGRTAQEQAVLARRLWTAGDDFIKDDENLT SLSFNRFEDRAKLVLEAQRKAEKQTGFRKLYLCNVTHSNDVMMKRADMIKSYGGRCMMLD VVASGVGAVHTMRLKNPELIIHAHRAMHAFITRESGFSVSMFILAKIFRMLGVDSMHSGS PKSKME-DYGEAEEIGGILT----------------QNKTCKNDPGLLCPTWWRSATRRR RTWSGGGLGSK-----ERRLFRI---------NRKRICV---------------- >rifoxyb2_full_scaffold_37565_5Roizmanbacteria -MQLAD--------------GIDP--SM-FLKGAASVAAESSTGTWTKVEDRPDSGLKEF KAIVFDMDKKLFKVAYKTDLFEYDNMAGFLAGPVGNIGGMKMVKGLRLMDIRFPKAIVTA FPGPRYGIAGVRDLLQ-KRGPILGNVPKPKVGRTAQEQAVLARRLWTAGDDFVKDDENLT SLSFNRFEDRVKLVHQAQTEVEKQTGFKKLYLCNITHSNDVMMKRTDMIKSYGGRCMMLD VVASGVGAVHTMRLKNPELIIHAHRAMHAFMTRESGFSVSMFILAKIFLMLGVDSMHSGS PKSKME-DYGEAE----------------------------------------EIG---- -----------S------------------------------------------- >rifcsphigho2_02_scaffold_255720_1Gottesmanbacteria QLNYIAPGWQPDDENVLVAFKIQLAENI-FPEAAASVAAESSTGTWTKVEDRSDSGIKSY KAIVYDLDSEMFKVAYKVDLFEADNMSGFLAGPVGNIGGMKMVRGLRIFDIRFPKKMVEA FPGPRYGIEGVRDLLE-ERRPVVGTVPKPKVGRTAVEQAELARRLWTAGDDFIKDDENLT SLIFSKFEDRVRLIHQVQKEVEQKNGKKKLYLCNLTHSNDVMLKRADLIKEAGGRCLMID VVTTGMSAVHTMRLKNPELIIHAHRAMHAFMTRESGFSISMIALAKIYRLLGVDSLHSGS PKAKME-DYGESIEIANVLTNFRTLGQNWFGKKTVWPVASGGLHPGVVDRVIEGLGKDIY IQMGGGVLGHPEGAERGAEAAIEARNAVCDGITVKEYAV-------S-------- >rifcsplowo2_01_scaffold_1975_18Gottesmanbacteria --------MKNT------------------------------------------------ ----------------------------------GR---------CRLWE---------R FP---------------SXR-------------NSHEQAELARRLWTAGDDFIKDDENLT SLSFNKFEDRARLIHEVQRELEEKGGKKKLYLCNLTHSSDIMIERANLIKETGGRCMMID VVTTGPAAVHTMRLKNPELVIHAHRAMHAFITRESGFSVSMLTLSKIFRLLGVDSIHTGS PKAKME-DYGESEEIAKVLTKFHSLGQRWFGTKRVWPTASGGLHPGVIDTVIGKLGSDCY IQMGGGVLGHPQGIQRGVEAAIEAREAVYEGKSVKEYVSENPDAVA-VGLWGTEP >rifcsp10_1_full_scaffold_5396_9Gottesmanbacteria QLNYIAPGWKPEDQNVIVQYKIELNDFV-FLDAAASVAAESSTGTWTKVDEGADSGILLY KALVFDIDEPIFKVAYKKDLFEEDNMSGFLAGPAGNIGGMKMVSGLRMFDIRFPEAMVKA FPGPRYGIEGVRDLLE-YRKPIMGTVPKPKVGRNSHEQAELARRLWTAGDDFIKDDENLT SLSFNKFEDRARLIHEVQRELEEKGGKKKLYLCNLTHSSDIMIERANLIKETGGRCMMID VVTTGPAAVHTMRLKNPELVIHAHRAMHAFITRESGFSVSMLTLSKIFRLLGVDSIHTGS PKAKME-DYGESEEIAKVLTKFHSLGQRWFGTKRVWPTASGGLHPGVMDTVIGKLGSDCY IQMGGGVLGHPQGIQRGVEAAIEAREAVYEGKSVKEYVSENPDAVA-VGLWGTEP >RifSed_csp1_13ft_1_scaffold_2112_6Gottesmanbacteria QLNYIAPGWKPEDQNVIVQYKIELNDFV-FLDAAASVAAESSTGTWTKVDEGADSGILLY KALVFDIDEPIFKVAYKKDLFEEDNMSGFLAGPAGNIGGMKMVSGLRMFDIRFPEAMVKA FPGPRYGIEGVRDLLE-YRKPIMGTVPKPKVGRNSHEQAELARRLWTAGDDFIKDDENLT SLSFNKFEDRARLIHEVQRELEEK----------------------------GGM----- RL-----------KN-PELVIHAHRAMHAFITRESGFSVSMLTLSKIFRLLGVDSIHTGS PKAKME-DYGESEEIAKVLTKFHSLGQRWFGTKRVWPTASGGLHPGVIDTVIGKLGSDCY IQMGGGVLGHPQGIQRGVEAAIEAREAVYEGKSVKEYVSENPDAVA-VGLWGTEP >RifSed_csp1_13ft_4_scaffold_30323_4Gottesmanbacteria QLNYIAPGWKPEDQNVIVQYKIELNDFV-FLDAAASVAAESSTGTWTKVDEGADSGILLY KALVFDIDEPIFKVAYKKDLFEEDNMSGFLAGPAGNIGGMKMVSGLRMFDIE-------- ---------GVRDLLE-YRKPIMGTVPKPKVGRNSHEQAELARRLWTAGDDFIKDDENLT SLSFNKFEDRARLIHEVQRELEEKGGKKKLYLCNLTHSSDIMIERANLIKETGGRCMMID VVTTGPAAVHTMRLKNPELVIHAHRAMHAFITRESGFSVSMLTLSKIFRLLGVDSIHTGS PKAKME-DYGESEEIAKVLTKFHSLGQRWFGTKRVWPTASGGLHPGVIDTVIGKLGSDCY IQMGGGVLGHPQGIQRGVEAAIEAREAVYEGKSVKEYVSENPDAVA-VGLWGTEP >gwa2_scaffold_134_63Microgenomates QLNYIAPGWQPPDQDLLIQFKIELDKKM-FPDATASVAAESSTGTWTKVDEGPESGIKEM KAIVYEIDEAMFRVAYKTDLFETDNMSGFLAGPAGNIGGMKMVKGLRMFDIRFPEKMVKA FPGPRYGIEGVRDLLD-ERKPIIGTVPKPKVGRTAQEQAVLARRLWTAGDDFIKDDENLT SLFFNKFEDRARLVHQVQKELEEKGGKKKLYLCNLSHSNDTMLKRADLIKETGGRCMMID VIATGFAAVHSMRLKNPELIIHAHRAMHAFITRESGFSVSMLTLAKIFRLLGVDSLHTGS PKAKMEGESEEIMRNLTQDEKFHTLGQKWFGTKKVWPVASGGLHPGVMDTVVSKMGHDCY VQMGGGVLGHPQGVERGVEAALEARRAIINGQTVQEYVEQNPDAAA-VGLWGTEP >mol-32-1605-051349_9Microgenomates QLNYIAAGWRPADQDLIIQFKIELDKKL-FAEAAASVAAESSTGTWTKVDEGPKSGIKEM KAIVFDVDEAMFKVAYKTDLFEADNMSGFLAGPAGNIGGMKMVAGLRMFDIRFPEKMVKA FPGPRWGIDGVRDILE-ERKPILGTVPKPKVGRTAEEQAELARRLWTAGDDFIKDDENLT SLFFNKFEDRARLVHQVQREIEERGGRKKLYLCNLTHSNDVMTSRANYIKETGGRCMMID VIATGFAAVHTMRLKNPELVIHAHRAMHAFITRESGFSVSMLTLAKIFRLLGVDSLHTGS PKAKMEGESEEIMRVLTSDEKFHTLGQKWFGTKRVWPVASGGLHPGVMDTVVAKLGHDCY IQMGGGVLGHPQGVERGVEAALEARRAIVNGQTVKEYVAQNPDAGA-VALWGTEP >16ft_4_scaffold_46755_1Gottesmanbacteria QLNYIAPGWQPEDQDLVIQFKIELADFV-FADAAASVAAESSTGTWTKVDEGPQSGIKDM KAIVFDVDEAMFKVAYKTDLFEADNMSGFLAGPAGNIGGMKMVKGLRMFDVRFPEKMVKA FPGPRYGIEGVRDLLE-DRKPILGTVPKPKVGRTAEEQAVLARRLWTAGDDFVKDDENLT SLFFNKFDDRARLVHQVQLEIEQKTGKKKLYLCNLSHSNDVMLKRADLIKETGGRCMMID VIATGFAAVHTMRLKNPELVIHAHRAMHAFITRESGEGIEG---------------H-GA P------------HLTDNQK---------------------GQTSGTAKRYLDTFH---- ---EQAFYQHLFGINL-------------QNLFSNSFYI----LKL--------- >rifcsp2_19_4_full_scaffold_90165_2Gottesmanbacteria QLNYIAPGWQPEDQDLVIQFKIELADFV-FADAAASVAAESSTGTWTKVDEGPQSGIKDM KAIVFDVDEAMFKVAYKTDLFEADNMSGFLAGPAGNIGGMKMVKGLRMFDVRFPEKMVKA FPGPRYGIEGVRDLLE-DRKPILGTVPKPKVGRTAEEQAVLARRLWTAGDDFVKDDENLT SLFFNKFDDRARLVHQVQLEIEQKTGKKKLYLCNLSHSNDVMLKRADLIKETGGRCMMID VIATGFAAVHTMRLKNPELVIHAHRAMHAFITRESGFSVSMLTLAKIFRLLGVDSLHTGS PKAKME-DYGESEEIMRVLTKFLSLGQKWFGTKRVWPVAS------LLPKI-----PQPW LQLSVSGAQNP------------NL--FI---DILTSSR---------------- >mol-32-1605-013658_7Gottesmanbacteria QLNYIAPGWQPEDQDLVIQFKIELADFV-FADAAASVAAESSTGTWTKVDEGPQSGIKGM KAIVFEVDEAMFKVAYKTDLFETDNMSGFLAGPAGNIGGMKMVKGLRMFDVRFPEKMVKA FPGPRFGIEGVRDLLE-DRKPILGTVPKPKVGRTAEEQAVLARRLWTAGDDFIKDDENLT SLFFNKFDDRARLVHQVQKEIEQKTGKKKLYLCNLSHSNDVMLKRADLIKETGGRCMMID VIATGFTAVHTMRLKNPELIIHAHRAMHAFITRESGFSVSMLTLAKIFRLLGVDSLHTGS PKAKME-DYGESEEIMRVLTKFLSLGQKWYGTKRVWPVASGGLHPGVMDTVLAKFGHDCY IQMGGGVLGHPQGVERGVEAALEARQAFSQGLTVNDYVSQNPDASA-VALWGTEP >RifSed_csp2_13ft_2_scaffold_388165_1Gottesmanbacteria QLNYIAPGWQPEDQDLVIQFKIELADFVKFADAAASVAAESSTGTWTKVDEGPQSGIKDM KAIVFDVDEA-FKVAYKTDLFEADNMSGFLAGPAGNIGGMKMVKGLRMFDVRFPEKMVKA FPGPRYGIEGVRDLLE-DRKPILGTVPKPKVGRTAEEQAVLARRLWTAGDDFVKDDENLT SLFFNKFDDRARLVHQVQLEIEQKTGKKKLYLCNLSHSNDVMLKRADLIKETGGRCMMID VIATGFAAVHTMRLKNPELVIHAHRAMHAFITRESGFSVSMLTLAKIFRLLGVDSLHTGS PKAKMEGESEEIMRVLTQDEKFLSLGQKWFGTKRVWPVASGGLHPGVMDTVLAKFGRDCY IQMGGGVLGHPQGVERGVEAALEARRAIVQGLTVKEFVTTDQESYRTDATFQKKM >RifSed_csp1_19ft_4_scaffold_94308_4Gottesmanbacteria QLNYIAPGWQPEDQDLVIQFKIELADFV-FADAAASVAAESSTGTWTKVDEGPQSGIKDM KAIVFDVDEAMFKVAYKTDLFEADNMSGFLAGPAGNIGGMKMVKGLRMFDVRFPEKMVKA FPGPRYGIEGVRDLLE-DRKPILGTVPKPKVGRTAEEQAVLARRLWTAGDDFVKDDENLT SLFFNKFDDRARLVHQVQLEIEQKTGKKKLYLCNLSHSNDVMLKRADLIKETGGRCMMID VIATGFAAVHTMRLKNPELVIHAHRAMHAFITRESGFSVSMLTLAKIFRLLGVDSLHTGS P--KA-------------------------------------------------FGRDCY IQMGGGVLGHPQGVERGVEAALEARRAIVQGLTVKEFVTKNPDASA-VSLWGTEP >rifcsplowo2_01_scaffold_1259_4Gottesmanbacteria QLNYIAPGWQPEDQDLVIQFKIELADFV-FADAAASVAAESSTGTWTKVDEGPQSGIKDM KAIVFDVDEAMFKVAYKTDLFEADNMSGFLAGPAGNIGGMKMVKGLRMFDVRFPEKMVKA FPGPRYGIEGVRDLLE-DRKPILGTVPKPKVGRTAEEQAVLARRLWTAGDDFVKDDENLT SLFFNKFDDRARLVHQVQLEIEQKTGKKKLYLCNLSHSNDVMLKRADLIKETGGRCMMID VIATGFAAVHTMRLKNPELVIHAHRAMHAFITRESGFSVSMLTLAKIFRLLGVDSLHTGS PKAKME-DYGESEEIMRVLTKFLSLGQKWFGTKRVWPVASGGLHPGVMDTVLAKFGRDCY IQMGGGVLGHPQGVERGVEAALEARRAIVQGLTVKEFVTKNPDASA-VSLWGTEP >rifcsphigho2_02_scaffold_287381_2Gottesmanbacteria QLNYIAPGWQPGDQDLIIQFKIELDKKM-FREAAASVAAESSTGTWTKVDEGPQSGIKDM KAIVFDVDEALFKVAYKTDLFETDNMSGFLAGPVGNIGGMKMVKGLRMFDVRFPEKMVKA FPGPRYGIEGVRDLLE-DRKPILGTVPKPKVG--------------DGSYDFIKDDENLT SLFFNKFDDRARLVHEVQREIEQKSGRKKLYLCNLTHSNDIMVQRANLIKETGGRCMMID VIATGFAAVHTMRLKNPELIIHAHRAMHAFITRESGFSVSMLTLAKIFRLLGVDSLHTGS PKAKME-DYGESEEIMRVLTKFLSLGQKWYGTKRVWPVASGGLHPGVMDTVIAKMGHDCY IQMGGGVLGHPQGVERGVEAALEARRAIVQGLTVKEFVAKNPDASA-VSLWGTEP >rifcsphigho2_02_scaffold_83541_5Gottesmanbacteria QLNYIAPGWQPADQDLVIQFKIELDKKM-FREAAASVAAESSTGTWTKVDEGPQSGIKEM KAIVFDVDEAMFKVAYKTDLFEADNMSGFLAGPVGNIGGMKMVKGLRLFDVRFPEKMVKA FPGPRYGIEGVRDLLE-ERKPILGTVPKPKVGRTAEEQAVLARRLWMAGDDFIKDDENLT SLFFNKFDDRARLVHQVQLEIEQKSGRKKLYLCNLTHSNDIMLQRANLIKETGGRCMMID VIATGFAAVHTMRLKNPELIIHAHRAMHAFITRESGFSVSMLTLAKIFRLLGVDSLHTGS PKAKME-DYGESEEIMRVLTKFHTLGQKWFGTKRVWPVASGGLHPGVLDTVIGKMGHDCY IQMGGGVLGHPQGVERGVEAALEARRAIVQGLTVNDYVAKNPDASA-VSLWGTEP >rifcsphigho2_01_scaffold_206378_7Gottesmanbacteria QLNYIAPGWKPVDQNVIIQFKMELDKKM-FRDGAASVAAESSTGTWTKVDEGPESGIKSY KAIVFDIDEH-FKVAYKCDLFEADNMSGFLAGPAGNIGGMKMVKGLRMFDIRFPEAMVKA FPGPRFGIEGVRDLLS-ERKPIMGTVPKPKVGRNAQEQAHLARRLWTAGDDFIKDDENLT SLFFNKFEDRARLVHEVQRELEAKGGKKKLYLCNLTHSNDVMLERANLIKETGGRCMMID VIATGVAAVHTMRLRNPELVIHAHRAMHAFITRESGFSISMLILAKIFRLLGVDSIHTGS PKAKMEGESEEIMRNLTTDEKFHSLGQKWFGTKRVWPVASGGLHPGVMDTVVAKLGD--- ---GGGSFRTS-------------------------------------------- >RBG_16_OP11_37_8_RBG_16_scaffold_34109_9Gottesmanbacteria QLNYIADGWQPEDQNVIIQFKIELDKKV-FREAAASVAAESSTGTWTKVDEGSDSGMKEF KAIVFDIDEVRFKVAYKTDLFEADNMSGFLAGPAGNIGGMKMVKGLRMFDIRFPEKIVKA FPGPRYGIEGVRDLLE-ERKPILGTVPKPKVGRNAQEQAILARRLWSAGDDFIKDDENLT SLFFNKFEDRARLVHQVQKEMEEKNGKKKLYLCNLTHSNDIMMERANLIKETGGRCMMID VVTTGMAAVHTMRLKNPELVIHAHRAMHAFVTRESGFSISMLTLAKIYRLLGVDSIHTGS PKAKMEGESEEIMRNLTSDEKFHSLGQKWFGMKRVWPTASGGLHPGVMDTVVAKLGYDSY IQMGGGVLGHPEGVERGVEAALEARKAIAEGMSVNEYVEKNPSSSA-VKLWGTEP >RBG_16_OP11_38_7b_RBG_16_scaffold_13163_2Gottesmanbacteria QLNYLAPGWQPDDQNVIVQYKIELDKKV-FREAAASVAAESSTGTWTKVDEGPDSGLSEF KAIVFDIDEVMFKVAYKTDLFEADNMSGFLAGPAGNIGGMKMVKGLRMFDVRFPEKMVKA FPGPKYGITGVRDLLE-QRKPVLGTVPKPKVGRTAEEQAVLARRLWTAGDDFIKDDENLT SLFFNRFEDRARLVHQVQREIEEKGGKKKLYLCNLTHSNDTMIERANLIKETGGRCMMID VVTTGIAAVHTLRLKNPELIIHAHRAMHAFITRESGFSVSMLTLAKIYRLLGVDSIHTGS PKAKMEGESEEIMTSDEAMPKFHSLGQKWFGMKRVWPTASGGLHPGVMDTVVAKLGNDCY IQMGGGVLGHPQGIERGVEAALEARKAVFEGMGVREYAEKYPDAAV-VAYWGMEP >07M_4_2014_scaffold_14364_2Peregrinibacteria QKDYIN--LNLKGAYLLTVFHLVPAAGSDILGIASEVAAESSTGSN-----VQVVDTKGL NAQVYKVDKKLVWIAYPWRIFDRGTIQNILTFIAGNIFGVADVKALKLLDVWFPKEMLKH YDGPKVSMPELRKYLKVYNRPVLGTIIKPKIGLKPHEYAHKAYLFWSGGGDFVKNDEPQS DQDFCPFDETVDKIREAMDKAEHETGHTKVHSFNISAADDTMKKRAEYVIKKGSYCFLVD GITAGWTALQTSRRLWPNVFLHFHRAGHGAMTRPENIGYTAGVLSKFARLAGASGVHTGT AIGKMAGTVEEDIFAAHSAYKSKYFEQDWHGMKGTVSIASGGLNPTRLAPYIKVIGHDFI TTMGGGVHAHPMGTRGGATALVQACKAWQKKVSIEKYAKNHKEAIE-GLAWGDKK >gwf2_scaffold_2732_6Peregrinibacteria QKEYINLK-NPESGELLSVFHLIPEKGRDILDAASEVAAESSTGSN-----VKVTATKSL DAQVYKVDRK-VWIAFPLRIFDRGNVQNILTYIAGNIFGMAQVQALRLLDVWFPKAMLKK YDGPKTSIKELRKYLNVYDRPILGTIIKPKIGLKTHEFAEKAYQFWAGGGDFVKHDEPQA DQDFCPYMKTVDAIRKAMNRAEDETGRHKMHSFNISSSDDTMKRRAEYIIKKGSFCFLVD GITAGWMALQTARRLWPNVFLHFHRAGHGAMTRLENIGYSVLVMTKFGRLAGSSGIHTGT AIGKMAGTKEEDVTAAHMALKGLFFEQDWYGMKGTVSIASGGLNPTKLKAYIEAIGHDFI TTMGGGVHAHPGGTKMGARALIQACEAWQKKVSIQKYAKDHLELAQAIEFYTKKG >CG08_land_8_20_14_0.20_scaffold_13131_3Peregrinibacteria QKEYIN--LNLKSGNLLCVFHLVPEKGMDLLDAASEVAAESSTGSN-----VRVTDTKGL NAQIYKVDKKLVWIAFPWRIFDRGNVQNILTYIAGNVFGMSNIIALKLLDVWFPKEMLKN YDGPSTSLPEARKYLKVYNRPILGTIIKPKIGLKPHEFAEKAYQFWSGGGDFVKHDEPQA DQDFCPYMKTVDKIRDAMNRAEDETGHTKLHSFNISSADDTMKRRAEYIIKKGSFCFLVD GITAGWTAVQTARRLWPKVFIHFHRAMHGAFTRDENIGFTAGVLAKFARLAGASGVHTGT AIGKMAGSKEDDIFAAYAAYKSEFFEQDWHGMKGCVSIASGGLNPLLLAPYIKAIGHDFI TTMGGGVHAHPMGTRGGATALVQACEAWQKGVTIQKYAKDHKELALAIERFGK-- >gwa2_scaffold_228_63Peregrinibacteria QKEYINLK-NPYSGDLLTVYHLVPEKGTDLLDAASEVAAESSTGSN-----VKVTTTKGL DAQVYKVDKK-VWIAFPWRIFDRGNVQNILTYIAGNVFGMSNIIALKLLDVWFPKEMLKK YDGPSTSLPEARKYLKVYNRPILGTIIKPKIGLKPEEFADKAYQFWAGGGDFVKHDEPQA DQDFCPYFKTVDKIRDAMNRAEDETGHAKLHSFNISGADDTMKKRAEYIIKKGSFCFLVD GITAGWTSVQTARRLWPKVFLHFHRAMHGAFTRAENIGFTPGVLSKFARLAGASGVHTGT AIGKMAGTAEEDIFAAYAAYKGEFFEQDWHGMKGCVSIASGGLNPLLLAPYIKAIGHDFI TTMGGGVHAHPMGTKGGATALVQACEAWQKGVSVQKYAKDHEELALAIGRYKK-- >gwf2_scaffold_583_77Peregrinibacteria QKEYIN--LNLKGAYLLTVYHLVPEKGTDLLDAASEVAAESSTGSN-----VKVVDTKGL DAQVYKVDKKLVWIAFPWRIFDRGNVQNILTYIAGNVFGMSNIIALKLLDVWFPKEMLKK YDGPSTSLPEARKYLKVYNRPILGTIIKPKIGLKPEEFANKAYQFWAGGGDFVKHDEPQA DQDFCPYFKTVDKIRDAMNRAEDETGHAKLHSFNISGADDTMKRRAEYIIKKGSFCFLVD GITAGWTSVQTARRLWPKVFLHFHRAMHGAFTRAENIGFTPGVLSKFARLAGASGVHTGT AIGKMAGSAEEDIFAAYAAYKGEFFEQDWHGMKGCVSIASGGLNPMLLAPYIKAIGHDFI TTMGGGVHAHPMGTKGGATALVQACEAWQKGVSIQKYAKDHEELALAIGRYQK-- >rifixya3_full_scaffold_35984_2Peregrinibacteria QKEYLKLGFDPIAGNMLVVFHLVPGEGRDLLDAASEVAAESSTGSN----LTIGTATESM DALVYKIDEA-VWIAYPVDIFDRGNVQNILTYIVGNVFGMADVKAIKALDCWFPPEMLKN YDGPYTTIGDMKKYLGIDARPVLGTIIKPKIGLKTDEFADVCYRFWKGGGDFVKFDEPQA DQVFCPFEDAVKAIAKKMEQVRKETGKNKVMSFNISAADMTMQKRAEIVMKYGSYAFLVD GLTAGWTAVQTARRMWPDVFLHFHRAGHGAMTREENIGYTVEVLTKFGRLAGASGMHTGT AIGKMD-GDTDVRAAHLALDSGPFFEQDWGDMKPMCPIASGGLNPVLLKPFADVIGTDFI TTMGGGVHSHPSGTEKGAMALVQACEAWKQKIDMNEYAKTHTELGQAVEFYKEHV >gwa2_scaffold_61538_7Peregrinibacteria QKEYIN--LKLNGGKMLAVFHLVPKPGEDFLSCASEVASESSTGSN----LRVGTATKNL NAIVYKIDKK-VWIAFPWKIFDRGNVQNILTYVVGNVFGMGDLSALKALDCWFPKEMLEH YDGPATTIHDLKKYLGVKGRPVLGTIVKPKIGLKPKQFAD--------VC---------- ------------------SKVEKETGKKKVMSINISAADMTMQKRAEYVIKKGSYAFLVD GLTAGWTAVQTARRMWPGVFLHFHRAGHGAMTRPENIGYTVPFMTKMGRLAGASGMHTGT AIGKMEGAKEDVMAAHHALFEGDFFDQDWYGMKPMCPIASGGLNPILLKPFADVVGTDFI TTMGGGVHSHPGGTEKGAMALVQACDAWKKGISIKEYAKNHKELAQAIGFYKEKV >gwf2_scaffold_150_90Peregrinibacteria QKEYINLKLNPLKGGMLAVFHLVPKPGEDFLSCASEVASESSTGSN----LRVGTATKNL NAIVYDKKKNLVWIAFPWKIFDRGNVQNILTYVVGNVFGMGDLSALKALDCWFPKEMLEH YDGPATTIHDLKKYLGVKGRPVLGTIVKPKIGLKPKQFADVCYKFWKGGGDFVKFDEPQA DQEFCPFKEAIDEIVKAMAKVEKETGKKKVMSINISAADMTMQKRAEYVIKKGSYAFLVD GLTAGWTAVQTARRMWPGVFLHFHRAGHGAMTRPENIGYTVPFMTKMGRLAGASGMHTGT AIGKMEGAKEDVMAAHHALFEGDFFDQDWYGMKPMCPIASGGLNPILLKPFADVVGTDFI TTMGGGVHSHPGGTEKGAMALVQACDAWKKGISIKEYAKNHKELAQAIGFYKEKV >Crystal_Geyser_CG15_big_fil_post_rev_8_21_14_0.20_scaffold_4700_5Peregrinibacteria QKEYLN--FDLDDGKMLTVFRMEPSDGEDFVGEATEVAAESSTGSN----LRVSTATADL DAIVYKVDEELVWMACPWRIFDRGNVQNILTYVIGNVFGMSTLKGLKALDCWFPKEMLEH YDGPATTIQDLKAYLGIKDRPVLGTIVKPKIGLKPDEFAEVCYQFWSGGGDFVKFDEPQA DQDFCPMKEVVDAIRKAMDRAEEVTGDKKVMSFNISSADATMKERAEYIKSVGSYAFLVD GITAGWSAVQTARREWPEVFLHFHRAGHGALTRPENFGCSVPFMTKFGRLAGASAMHTGT AIGKMAGTIDEDITAAHQALEGDFFEQDWYGMKGMCPIASGGLNPVLLKPFADAVGTDFI TTMGGGVHSHPGGTAKGATALRQACDAWVAGIDLQEYAKDHEELAQAIEFYGGKF >scaffold_1669_9_REFreference QEEYLN--FDLDDGKMLTVFRMEPSDGEDFVGEATEVAAESSTGSN----LRVSTATADL DAIVYKVDEELVWMACPWRIFDRGNVQNILTYVIGNVFGMSTLKGLKALDCWFPKEMLEH YDGPATTIQDLKAYLGIKDRPVLGTIVKPKIGLKPDEFAEVCYQFWSGGGDFVKFDEPQA DQDFCPMKEVVDAIRKAMDRAEEVTGDKKVMSFNISSADATMKERAEYIKSVGSYAFLVD GITAGWSAVQTARREWPEVFLHFHRAGHGALTRPENFGCSVPFMTKFGRLAGASAMHTGT AIGKMAGTIDEDITAAHQALEGGFFEQDWYGMKGMCPIASGGLNPVLLKPFADAVGTDFI TTMGGGVHSHPGGTAKGATALRQACDAWVAGIDLQEYAKDHEELAQAIEFYGGKF >rifoxyb2_full_scaffold_53823_1Peregrinibacteria --------------------HLVPKEGH-SLDYASEVASESSTGSN----LRVGTATENL NAVVYDEEKNLVWIAYPWKIFDRGNVQNILTYVVGNVFGMSGLSALKALDCWFPKEMLET YDGPSTTIKDMKAYLG-INRPVLGTIVKPKIGLKPAEFAEVCFQFWSGGGDFVKFDEPQA DQEFCPFKETVDEIRKAMDRAEEATGGHKVLSFNISSADMIMKDRAEYVRSKGSYAFLVD GLTSGWTAVQTARRLWPEVFIHFHRAGHGAMTRPENIGYTVPFMTKMGRLAGASGMHTGT AIGKMEGNAQEDITAAHQALEGEFFEQDWYGMKGMCPIASGGLNPVLLKPFADVVGTDFI TTMGGGVHSHPGGTAKGATALVQACEAWVAGIDIREYAKNHEELAQAIEFYSGAT >UBA5223contig_6548_20Dojkabacteria ----MDMEXXXX-XXXXXXXXXXXX-XX---------------XXXXXX----XXXXXXX XXXXXXXXXX-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX XXXXXXXXXXXXXXXXXX-XXXLGTIIKPKIGLTSSEYAELCYDFWLGGGDFVKNDEPQA DQDFCPYEKMVVDVRHAMDKVEQETGKTKVHSFNISSADDTMIKRADYIKSVGSYAFLVD GITSGWTAVQTIRRHYPDVFLHFHRAGHGAFTREEAFGFTVPVLTKFARLAGASGIHTGT AVGKMAGSSKEDVMAINHALHDYSDIGYWRIMKKTTPIISGGLNPTLLGKFLDIAGTDFI TTMGGGVHSHPMGTKAGAKAVLQSYEAWEKKIPLDEYAKDKEERVA-IEFYGKK- >MW-5_scaffold_55805_2Gottesmanbacteria QLDYVAKGWKPSAEKITVLFKVELAKGV-FRDAAASVAAESSTGTWTEVYSGRHSGMKKY RALVYEIDRKMFKVAYPLDLFEPGNISGFLAGPAGNIAGMKMLAGLRLMDMRFPRKFVSS FPGPRHGIQGLRKMLRIFAEPVMGTVPKPKIGRTHSAQAV-------------------- -----------LARVSELKKAEKLTGHRKLYLANVSHSSNEMLRRASHIKRNGGRVMMLD VVVTGFAALHTMRMRNPGLFIHAHRAMHGFITRESGFSVSMLTLAKIYRLLGVDSLHIGS PKAKME-DYGEAAVIAEAITRECTLGQKWYGVKKVWPVASGGLHPGVVGKVVAALGPDIF IQL---------------------------------------------------- >rifcsphigho2_01_scaffold_387592_2Gottesmanbacteria QLDYMAKGWKPSAEKITVLFKIELARGM-FRDAAASVAAESSTGTWTEVYSGRNSGMKKY RALVYDIDQRMFKVAYPLDLFEPGNISGFLAGPAGNIAGMKMLAGLRLMDMRFPRKFVKS FPGPRHGIQGLRNILQVFAEPVMGTVPKPKIGRTHSEQAALARELFTAGDDFIKDDENLT SLKFNDFYRRTSMVMGEIKKAEKLTGHRKLYLANISHSSDEMMRRAAHIKRNGGRVMMLD VVVTGFAALHTMRMRNPDLFIHAHRA---------------------------------- ------------------------------------------------------------ ------------------------------------------------------- >UBA4787contig_1245_80Patescibacteria QKAYLKLN-NPQNGQMLAVFHLVPESGD-LLSAATEVAAESSTGSN----IEVGTATDSL DALVYKVDEQLVWIAYPWRLFDRGNIQNILTFIVGNVFGMKEAKALKLLDVWFPAEMLEQ YDGPARNIDNLRKYLDVYDRPILGAIIKPKMGLTASQYAEVCYDFWSGGGDFVKNDEPQA DQDFCPYEHMVKYVKQAMDKAVQETGRKKIHSFNISAADDAMIKRADLIKDTGSYAFLVD GITAGWTAVQTIRRRYPDVFLHFHRAMHGAFTREENIGFTNLVLAKFARLAGASGIHVGT SIGKMAGSPDEDLVASHALLSLNLNDELWRGIQKSTPILSGGLNPTKLKAIIDIMGTDFI TTMGAGVHAHPDGTRAGAMAVLQACEAYQKKIDINEYAKDHEELARAIKFFGA-- >UBA5232contig_5010_25Dojkabacteria QAQYIQ--LGNQNGQLLAVYRIRGKEGVALDDIASEVAAESSTGSN----VKIGSSTNSM NAVIYRIDESLAWIAYPWRIFDSGNVQNIITFLAGNVLGMSSVKECKLLDVYFPPQMLVH YDGPSYTIDNMREYLQIPKRPIFGTIIKPKIGLTSSEYAELCYDFWSGGGDFIKNDEPQA DQDFAPFERMVQDVRYAMDRAEQETGRTKVHSFNISAVDDTMLKRANLIQSIGSYAFLVD GITAGWTAVQTIRRHYPNVFLHFHRAGHGAFTREENIGFTVPVLTKFARLAGASGIHTGT AVGKMAGPKEDIMAARQALNESYKDFGYWRITKKMCPIISGGLNPTLIGKFLDIIGTDFI TTMGGGVHSHPMGTKVGATAVLQAYEAWEKKIPLDEYAKEKEELRVAIEFFGKKK >gwd2_scaffold_5982_7Dojkabacteria QKEYIQ--GNPNNGQMLAVFRLQGEEGMTLVDTASEVAAESSTGSF----VKIGTATASL DALVYRIDEK-VWIAYPWRIFDRGNVQNIMTFIAGNVFGMASVKVCKILDVYFPPQMLVQ YDGPEYTIDDMRKYLNIQERPIFGSIIKPKIGLTSSEYAELCYDFWSGGGDFVKNDEPQA DQDFCPYDKMVQDVRHAMDRVEQETGKTKVHSFNISSSDDTMIKKADYIQSMGSYAFLVD GITAGWMAIQTIRRKYPNVFLHFHRAGHGAFTRDENIGYTVPVLTKFARLAGASGIHTGT AIGKMS-PKEDVMAARHALKEDYSNLGYWRITKKMCPIISGGLNPLLIGKFIDTVGTDFI TTMGAGVHSHPMGTKAGATAVLQAYEAWKQKISLEDYAKDKEELRAAIQFYDKHE >gwf2_scaffold_5170_19Dojkabacteria QKEYIQ--GNPNNGQMLAVFRLQGEEGMSLVDTASEVAAESSTGSF----VKIGTATASL DALVYRIDEK-VWIAYPWRIFDRGNVQNIMTFIAGNVFGMASVKVCKILDVYFPPQMLVQ YDGPEYTIDDMRKYLNIQERPIFGSIIKPKIGLTSSEYAELCYDFWSGGGDFVKNDEPQA DQDFCPYDKMVQDVRHAMDRVEQETGKTKVHSFNISSSDDTMIKKADYIQSVGSYAFLVD GITAGWMAVQTIRRKYPNVFLHFHRAGHGAFTRDENIGYTVPVLTKFARLAGASGIHTGT AIGKMS-PKEDVMAANHSLREDYSNLGYWRITKKMCPIISGGLNPLLVGEFIDTVVTDFI TTMGAGVHSHPMGTKAGATAVLQAYEAWKQKISLEEYAKDKEELRVAIDFYTKHE >bjp_ig2158_scaffold_0_9Dojkabacteria QKEYL---FLGDNGRLLAVFHIMPVEGLKLADVATEIAAESSTGSS----ISIGSSTESN NAKVYKIDEDLVWIAYPWDIFDSGNVQNILTFIAGNIFGVSEVKACKLLDVYFPPEMLVQ YDGPSYTLDDMREYLGVWDSPILGTIIKPKIGLSSHQYAELAYDFWAGGGMFVKNDEPQA NQSFCPYEKMVDAIRIAMDRAEDETGKPKVHSFNVSAADDTMIRRCDYIISKGSFAFLID GVTAGWMAVQTLRKRYPNVFIHFHRAGHGAYTRTENIGYSVAVLTKFARLAGASGIHTGT AVGKMAGDPETDITAAHLALYYPNDPSRF--ILKMAPIISGGLNPMLLKEFIEVMGTDYI TTMGGGVHSHPEGTRGGAAALVQAYEAWKAGVEIEEYAKQNRELSLAIEFYTQNR >rifcsplowo2_01_scaffold_240_75Woesearchaeota NDKQKNLKKNPT-GTLLTVSHLIPK-GLNILQAAAEVAAESSTGTNFRV----KTETAEL NALVYKIDLS-VWIAYPWRIFDRGNVQNILTFIVGNVLGMKEISALKMLDIWFPAAMLEQ YAGPSYTLDDMRKYLQVD-RPILGTIIKPKIGLTASEYAEVCYDFWSGGGDFVKNDEPQA NQDFCEYRLMVKFVKKAMNKAVKETGHKKIHSFNVSSPDDEMIRRCELIRKVGSYAFLID GITAGWMAVQTLRRRYPDVFIHFHRASHGAYTRPENIGYSVLVLSKFARLAGASGIHTGT AIGKMEGSVEEDVTAAKEILHVILEEDSWRGMKKCCPIISGGLNPVKLKPFIEVMGENFI TTMGAGCHAHPKGTTAGAKALVQSCEAYLKGIDIYEYAKTHKEKEA-IDFFTNK- >DolZOral124_scaffold_14485_4unknown QRAYVNLK-DPKNGDMLAVFHFVPGKKLNMLQASCEVAAESSTGTN----FLVKTETKAM NALVYDLEKELVWIAYPWRLFDRGNVQNILTYVAGNVFGMKEAKALKLLDVWFPRVMLDQ YDGPSYTLDDMRTYLGVEDRPILGTIVKPKMGLTSSEYAEVCYDFWSGGGDFVKNDEPQA NQDFCPYDKMVHYVKQAMDKAVKETGHKKVHSFNVSAPDDTMIERCEMIVNAGSYAFLID GTTAGWMAVQTLRRKYPGVFIHFHRAGHGAYTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMAGTPGEDITSAKNIFHVIEEDDDWRSVKRCTPIVSGGLNPTLLKPFIDLMGSDFI TTMGAGCNAHPDGTKKGATALVQACEAYKQGIDIHDYAKDHEELAKAIEFFEKKK >OR1_SR1_3_148SR1 VKDLLPLP-NPKNGQMLIAAHFQPGKSMNILQAACEVAAESSTGTN----FLVETETAEM NALVYDVEKELIWIAYPWRLFDRGNIQNILTYIAGNVFGMKEVKALKILDVWFPPAMLEQ YDGPSYTLADMRKYLNVYNRPILGTIIKPKMGLTSAEYAEVCYDFWVGG-DFVKNDEPQA NQDFCPYDKMVKHVKEAMDKAVKE-TQKKVHSFNVSAADDTMIERCEMIRNAGSYAFLID GTTAGWMAVQTLRRKYPDVFIHFHRAGHGAFTRPENIGYSVLVLSKFARLAGASGIHTGT AVGKMAGDKAEDVTAAEGIREEVLTDDSWRAIKTCCPIISGGLNPTLLKPFIDLMGNDFI TTMGAGCHAHPKGTQSGAKALVQACEAYQKGVDIHEYAKNKPELAEAIEFFEKPS >CG_2015-03_scaffold_6_523unknown QKAYIN--FEIPNGEMLCAFHLRPG-DLNILQAACETAAESSTGTN-FLVNTETPFAREM NALVYKLDLE-VWIAYPWRLFDRGNVQNILTYIVGNVLGMKEVSAIKLLDVWFPPAMLEQ YDGPSYTLDDMRKYLDVYDRPILGTIIKPKMGLTSAEYAEACYDFWVGGGDFVKNDEPQA NQDFCPYDKMVKHVKEAMDKAVKETGHKKVHSFNVSAADDTMIERCEMIVNAGFEAFLID GTTAGWMAVQTLRRKYPDVFLHFHRAGHGAYTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMAGTPTEDITAATGILHVILEDDSWRGLKKCTPIISGGLNPTLLKPFIDLMGGDFI TTMGAGCHAHPKGTQAGAKALVQACEAYQKGISIEEYSKDRPELAEAIEFFTKKK >RAAC1_SR1_1_647SR1 QAAYINLK-DPRNGDMLAAFHLIPGGKLNILQAACEAAAESSTGTN-FLVQTETAYSKML NALVYDLEKNLVWIAYPWRLFDRGNVQNILTYIAGNVFGMKEVAAFKLLDVRFPAAMLEQ YDGPSYTLDDMRAYLNVYDRPILGTIVKPKMGLTSAEYAEVAYDFRVGGGDFVKNDEPQA DQDFCPYDKMVAHIKEAMDKAVKETGKKKVHSFNVSASDDTMIKRCELIVNSGEKAFLID GTTAGWMAVQTLRRKYPDVFIHFHRAGHGAFTRPENIGYTVLVLSKFARLAGASGIHTGT AVGKMAGTPEEDVTAAHSIFHLTKEDDTWRNLKKCCPIISGGLNPTLLKPFIDVMGDDFI TTMGAGCHAHPRGTKAGATALVQSCEAYKQGINIAEYAKDHVELAEAIEFFTKKE >cg2_3.0_scaffold_6121_5SR1 QAAYVNLK-DPKNGEMLCCFHLVAGGKLNLLQAACEVSAESSTGTN----FAVKTETPEM NSLVYDLEKSLVWIAYPRRLRDRGNVQNIMTYIAGNIF-MKEVSALKLLDVRYPAAMLEQ YDGPSYTLDDMREYLQVFDRPILGTIVKPKM-LTSAEYAEVAYDFRVGGGDFVKNDEPQA DQDFCPYEKMVIHVKEAMAKAVKETGHKKVHSFNVSAADDTMIARCEMVRNSGMEAFLID GTTAGWMAVQTLRRKYPDVFIHFHRAGHGAFTRPENIGYNVLVLSKFARLAGASGIHTGT AVGKMAGTPEEDVTAAHNILGKVLQDDSRRGIKKCCPIIS-GLNPTLLKPFIDVMGNDFI TTMGAGCHAHPGGTQKGATALVQACEA--KAVDIHEYAKDHEELAQAIEFFEKKE >scaffold_6121_5_REFreference QAAYVNLK-DPKNGEMLCCFHLVAGGKLNLLQAACEVSAESSTGTN----FAVKTETPEM NSLVYDLEKSLVWIAYPRRLRDRGNVQNIMTYIAGNIFWMKEVSALKLLDVRYPAAMLEQ YDGPSYTLDDMREYLQVFDRPILGTIVKPKMWLTSAEYAEVAYDFRVGGGDFVKND---A DQDFCPYEKMVIHVKEAMAKAVKETGHKKVHSFNVSAADDTMIARCEMVRNSGMEAFLID GTTAGWMAVQTLRRKYPDVFIHFHRAGHGAFTRPENIGYNVLVLSKFARLAGASGIHTGT AVGKMAGTPEEDVTAAHNILGKVLQDDSRRGIKKCCPIISWGLNPTLLKPFIDVMGNDFI TTMGAGCHAHPGGTQKGATALVQAYKAWV---DIHEYAKDHEELAQAIEFFEKKE >CG02_land_8_20_14_3.00_150_scaffold_3515_6SR1 QAAYVNLK-DPQNGEMLCVFHLVPGGKLNMLQAACEVSAESSTGTN----FAVKTETPAM NSLVYDIEKNLVWIAYPRRLFDRGNVQNIFTYIAGNIF-MKEIQALKLLDVRFPSSMLEQ YDGPGYTLDDMRKYLNIYDRPIL-TIVKPKMGLTSAEYAEVAYDFRVGGGDFVKNDEPQA DQDFCPYDKMVKHIKEAMAKAVKETGHKKVHSFNVSAADDTMIARCEMIVNSGSYAFLID GTTAGRMAVQTLRRKYPDVFIHFHRAGHGAFTRPENIGYSVLILSKFARLAGASGIHTGT AVGKMAGSPEEDVTAAHNILAHHTVLQDDRGIKKCAPIIS-GLNPTLLKPFIDVMNVDFI TTMGAGCHAHPGGTQKGAAALVQACEA--KKMDIHEYAKDHEELAQAIEFFEANK >CG06_land_8_20_14_3.00_150_scaffold_21999_1unknown --------------------K--------------------------------------- ----------------------------------------------------M------- ----------------V------------------------------------------- -----------KHIKEAMAKAVKETGHKKVHSFNVSAADDTMIARCEMIVNSGSYAFLID GTTAGRMAVQTLRRKYPDVFIHFHRAGHGAFTRPENIGYSVLILSKFARLAGASGIHTGT AVGKMAGSPEEDVTAAHNILHTVLQDDSRRGIKKCAPIIS-GLNPTLLKPFIDVMNVDFI TTMGAGCHAHPGGTQKGAAALVQACEATA--VDIHEYAKDHEELAQAIEFFEANK >CG10_big_fil_rev_8_21_14_0.10_scaffold_3250_c_3Pacearchaeota ISKMIKLK-NPKNGEMLCVFHLIP--GVNMLQAACEMAAESSTGTN----FLVKTETKEM NARVYDLKNNLVWVAYPWRLFDRGNIQNIITYVVGNVLGMKEARALKLLDIWFPPTMLEQ YDGPSYTIDDMRKYLGVYDRPILGTIIKPKMGLTSSEYAEVCYDFWTGGGDFVKNDEPQA NQDFCPYDKMVKYVKEAMDKAVRETGKKKVHSFNVSASDDTMIERCEMIRDAGSYAFLID GITAGWMAVQTLRRKYPEVFIHFHRASHGAYTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMQGSPKEDVTAAKNILHVTLEDDNWRSMKKCCPIISGGLNPTLLKPFIEVMGNDFI TTMGAGVHAHPKGTKKGATALVQSCEAYKKGISIEEYAKTHKELEEAIKFFTKKK >CG_2015-01t_scaffold_70_76unknown NEKQRRLEANPK-GEMLAVYHLIPG-DLNMLQAAAEVAAESSTGTNFTV----KTETDEM NALTYKVDLK-VWMAFPWRLFDRGNIQNILTYIVGNVLGMKEINALKLLDLWFPPSMLEQ YDGPSYTLEDMKKYLGID-RPVLGTIIKPKMGLTSSEYAEVCYDFWKGGGDFVKNDEPQA NQDFCPYDKMVKYVKEAMDKAVKETGNKKVHSFNVSASDDTMIERCEMIRAAGSYAFLID GLTAGWMAVQTLRRKYPDVFIHFHRAMHGAFTRPENIGFSVLILSKFARLAGVSGIHTGT AVGKMKGNANDDIIPAHNILHVILSDDSWRGMKKCAPIISGGLNPTKLKPFIDLMKSDFI TTMGAGVHAHPDGTQAGAKALVQAMEAYNKNISIDVYAKTHPEKKA-IEFFTKP- >rifoxyc1_full_scaffold_4459_6Pacearchaeota LKKFYKLK-DPKNGEMLCVFRLFPDKSMNILQSAAEIAAESSTGTNV---KTETPFSREM NALVYQLDEK-VWIAYPWRLFDRGNVQNILTYVVGNVLGMKEVKGLKLLDTWFPPSMLEQ FDGPSYTLDDMRKYLNVYDRPILGTIIKPKMGLTSAEYAEVCYDFWSGGGDFVKNDEPQA DQDFCPYDKMVLHVKKAMDKAVKETGKKKVHSFNVSASDDTMIKRCELIRSAGSYAFLID GITAGWMAVQTLRRRYPDVFIHFHRAAHGAFTREENFGFSVLVLSKFARLAGASGIHTGT AIGKMAGSPEEDVTAANNILHLILEDDEWRGMKKCCPIISGGLNPTLLKPFIEVMGNDFI TTMGAGCHAHPNGTKTGAAALVQSCEAYKKGISIQEYAKTHKELADAIKFFTKNK >CG_4_10_14_0.8_um_filter_scaffold_123569_1Dojkabacteria ------------------------------------------------------------ ------------------------NVQNIMTFIAGNIFGMAELRECKLLDVWFPSQMLDQ YDGPSVTIDDMRDYLQNFDRPILGSIIKPKIGLTSTEYAEVCYDFWAGGGDFVKNDEPQA DQNFSPFQKMVVSVREAMDRAEELTGHTKVHSFNVSASDDTMIKRADYISSVGSYAFLVD GITSGWTAVQTIRRHYPNVFLHFHRAGHGAFTRAENIGYSVLVLSKFARLAGASGIHTGT AVGKMAGVRDDVTAAHGILRTFLERKMSWRVINKTAPIISGGLNPVLLPQFLDVIGTDFI TTMGGGVHSHPSGTKAGATAVVQ-----------------------AYEAW---- >CG_4_9_14_3_um_filter_150_scaffold_194919_1Dojkabacteria ------------------------------------------------------------ ------------------------NVQNIMTFIAGNIFGMAELRECKLLDVWFPSQMLDQ YDGPSVTIDDMRDYLQNFDRPILGSIIKPKIGLTSTEYAEVCYDFWAGGGDFVKNDEPQA DQNFSPFQKMVVSVREAMDRAEELTGHTKVHSFNVSASDDTMIKRADYISSVGSYAFLVD GITSGWTAVQTIRRHYPNVFLHFHRAGHGAFTRAENIGYSVLVLSKFARLAGASGIHTGT AVGKMAGVRDDVTAAHGILRTFLERKMSWRVINKTAPIISGGLNPVLLPQFLDVIGTDFI TTMGGGVHSHPR--------LRR--------------------------PW---- >CG_2015-10_scaffold_113557_2Dojkabacteria QEQYLH--LNDKNGEMLAVFKMVPYGDIEIEDVATEVAAESSTGSN----LKVGTMTGSV DARVYKIDKELVYIAYPWVMFDRGNVQNIMTFIAGNIFGMAELRECKLLDVWFPSQMLDQ YDGPSVTLDDMRGYLQNFDRPVLGSIIKPKIGLTSTEYAEVCYDFWVGGGDFVKNDEPQA DQNFSPFQKMVVSVREAMDRAEEATGHTKVHSFNVSASDDTMIKRADYISSIGSYAFLVD GITSGWTAVQTIRRHYPNVFLHFHRAGHGAFTRAENIGYSVLVLSKFARLAGASGIHTGT AVGKMAGVKDDITAAQGILRKGHYFDQCW----------------SEIPE-SDIKG-KII FSL--------------------TY-------FLTSFAL---------------- >CG_4_10_14_0.2_scaffold_11904_c_5Dojkabacteria QEQYLH--LNDKNGEMLAVFKMVPYGDIEIEDVATEVAAESSTGSN----LKVGTMTGSV DARVYDKEKNLVYIAYPWVMFDRGNVQNIMTFIAGNIFGMAELRECKLLDVWFPSQMLDQ YDGPSVTLDDMRGYLQNFDRPVLGSIIKPKIGLTSTEYAEVCYDFWVGGGDFVKNDEPQA DQNFSPFQKMVVSVREAMDRAEEATGHTKVHSFNVSASDDTMIKRADYISSIGSYAFLVD GITSGWTAVQTIRRHYPNVFLHFHRAGHGAFTRAENIGYSVLVLSKFARLAGASGIHTGT AVGKMAGVKDDITAAQGILRTFLERKMSWRVISKTAPIISGGLNPVLLPQFLDVIGTDFI TTMGGGVHSHPSGTKAGATAVVQSYEAWKAEVSLEEYAKEHKELDEAIKFFNKHG >rifoxyb1_full_scaffold_364_31Woesearchaeota QRAYVNLK-APKNGEMLVVFHLIPGEKLNILQAASEVAAESSTGTN----FKVSTETAEL NALVYDLKRNLIWVAYPWRIFDRGNVQNILTFVVGNVLGMKEISALKMLDVWFPTEMLEH YDGPSYTLDDMRKYLGVYGRPILGTIIKPKIGLTAAEYAEVCYDFWSGGGDFVKNDEPQA NQDFCDYEKMVKFVKKAMDKAVKETGHKKVHSFNVSASDDEMIRRCELIRKTGFEAYLID GITAGWMAVQTLRRKYPNVFLHFHRAAHGAFTRPENIGFSVLVLSKFAKLAGASGIHTGT AVGKMKGSPDEDITAAKNILHVILNDDSWRGMKKCCPIISGGLNPTKLKSFIDIMGEDFI TTMGAGVHAHPRGTYYGAKALIQACEAYNKGINIRAYAKSHKELAEAINFFENRK >UBA2022contig_15642_11Dojkabacteria QEQYLG--LGDTNGQMLTVFKLVPE---SIESAATELAAESSSGSN----LKVSTATNNL DAIVYDIDKE-VYIAYPWLMFDRGNVQNILTFIAGNIFGMSNLKECKLLDVWFPPQMLVQ YDGPSYTLDDMRKYLGVFDRPILGTIIKPKIGLTSTEYAELCYDFWVGGGDFVKNDEPQA DQQFAPYEKMVDSVRMAMDKAELETGKRKIHSFNVSAADDTMIRRADYVRRVGSYAFLID GITAGWTSVQTLRRHYPDVFIHFHRAGHGAFTRKENIGFSVPVLTKFARLAGASGIHTGT AVGKMAGDAAEDI----------------------------------------------- ------------GAAK--------------------------------------- >gwf2_scaffold_9_343Dojkabacteria QEQYLG--LGDQNGQMLTVYKVVPYGEETIESAATELAAESSSGSN----LKVSTATNNL DAIVYDIDHDLVYIAYPWLMFDRGNIQNILTFVAGNIYGMGNLKECKLLDVWFPPQMLVQ YDGPSYTLDDMRKYLNVYDTPILGTIIKPKIGLTSTEYAELCYDFWVGGGHFVKNDEPQA DQQFAPFEKMVDSVRIAMDKAERETGHTKVHSFNVSAADDTMLRRTEYIRNVGSYSYLID GITAGWMAVQTLRRHYPDVFIHFHRAGHGAFTRTENIGFSVPVLTKFARLAGTSGVHTGT AVGKMD-GREDIGAAKQALKEVTEEHTMWRVIKKTSPIVSGGLNPVLLPEFLDVFGSDFI VTMGGGIHSHPQGTGAGIKAVFQAYEAWMANIPLEEYSRDKEELRVAMDFYNRYG >rifoxyb1_full_scaffold_3633_1Woesearchaeota QKAYVNLK-TPANGKLLAVFHLVPGEKLNILQAAAEVAAESSTGTN----FKVNTETISM NALIYDLKKSLVWIAYPWRIFDRGNIQNILTYVVGNVLGMKEVSALKLLDLWFPQAMLKK YDGPSYTLDDMRKYLGVYDRPILGTIIKPKIGLNADEYGKVCYDFWVGGGDFVKNDEPQA DQDFCAFEKMVMNVKKAMDNAVKETKRKKVHSFNVSAADDTMIKRCEIIRKAGSYAFLID GITAGWMAVQTLRRKYPNTFIHFH------------------------------------ ------------------------------------------------------------ ------------------------------------------------------- >rifcsphigho2_02_scaffold_6968_22Woesearchaeota QKAYLNLK-DPKNGELLTVYHLVP----NMLQAAAEVAAESSTGTN----FKVNTETAEM NALVYKVDTK-VWIAYPWRIFDRGNVQNILTYVVGNVLGMKEVKALKLLDVWFPSAMLEQ YDGPSYTLDDMRKYLNVYGRPILGTIIKPKIGLTSAEYAEVCYDFWVGGGDFVKNDEPQA DQDFCPYDKMVKHVKEAMDKAVKKTGKKKIHSFNVSAADDTMIKRCEMIRKTGSYAFLID GITAGWMSVQTLRRKYPDVFIHFHRASHGAYTRRENFGFSVLVLSKFARLAGASGIHTGT AVGKMSGSPGEDIIAAHNILHVVLDDDSWRGVKKCCPIISGGLNPTLLKPFIAVMGNDFI TTMGAGCHAHPWGTQAGAKALVQSCEAYKKKIDIKKYARNHKELAEAIKFFSNRK >rifcsplowo2_02_scaffold_6002_1Woesearchaeota QKAYVNLK-DPKNGEFLAVYHLVPNKLN-ILQAAAEVAAESSTGTN----FKVNTETAEM NALVYDTKKNLVWIAYLWRIFDRGNVQNILTYVVGNVLGMKELKALKLLDVWFPPEMLKK YDGPSYTLDDMRKYLNVYGRPILGTIIKPKIGLNDREYADVCYNFWVGGGDFVKNDEPQA DQDFCPFEKMVKNVKIAMDKAVKETGKKKVHSFNVSAADDTMIKRCELICKAGFEAFLID GITAGWMAVQTLRRKYPGVFIHFHRASHAAYTRPENFGFSVLVLSKFARLAGASGIHTGT AVGKMSGPGEDIIAAHNILNKGYYFDQDWKSVKKCCPIISGGLNPTLLKPFITLMGNDFI TTMGAGCHAHPLGTKSGARALVQSCEAYKKKIDIKKYSKDHKELAEAIKFFSKAP >rifoxyb2_full_scaffold_114_158Pacearchaeota QLRYTNLK-NPKNGEMLCVFHFVPGKGLNTLQAAAEIAAESSTGTN----FTVQTETDEM DALVYQVDYK-VWIAYPWRLFDRGNVQNILTYVVGNVLGMKQISALKLLDIWFPSAMLEQ YDGPSYTLDDMKKYLGIKGRPVLGTIVKPKMGLTSAEYAEVCYDFWVGGGDFVKNDEPQA NQDFCPYDKMVLHVKEAMDKAVRETGKKKVHSFNVSAADDTMIERCEMIRGAGSYAFLID GLTAGWMAVQTLRRKYPDVFIHFHRAMHGAFTRPENIGFSVLVLSKFARLSGASGIHTGT AVGKMAGTPEEDIVAAKGILHVILSDDSWRGIKKCAPIISGGLNPTLLKAFIDVAGTDFI TTMGAGCHAHPDGTRAGATALVQALEAYQKKKSIEDYAKTHKEVRA-IEFFGKKK >CG09_land_8_20_14_0.10_scaffold_8073_10Pacearchaeota ------------------------------------------------------------ ----------------------------------------------------M------- -------------------------------GLTSAEYAEVCYDFWVGGGDFVKNDESQA DQDFCPYDKMVRHVKEAMDKAVKETGRKKVHSFNVSAADDTMIKRCELIRQAGFEAFLID GITAGWMAVQTLRRKYPDVFIHFHRAGHSGYTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMKGSPEEDITAADGILHLILEDDSWRGIKKCAPIISGGLNPTLLKKFIEVAGTDFI TTMGAGCHAHPDGTKAGATALVQSLEAYEKKIPIEKYAKTHKELARAIEFFSKKK >cg1_0.2_scaffold_360_c_42Pacearchaeota QVSYIN--LNFKNGEMLAVFHMVPEKGL-NIQAACEIAAESSTGTN----FLVQTETSKM NALVYQFDEKLVWIAYPWRLFDRGNIQNILTYVVGNVLGMKQVSALKLIDIWFPSAMLEQ YDGPSYTLDDMKKYLGIKNRPVLGTIIKPKMGLTSAEYAEVCYEFWTGGGDFVKNDEPQA NQDFCPYDKMVKHVKEAMDKAVKVTGKKKVHSFNVSASDDTMIKRCEMIREAGFEAFLID GITAGWMAVQTLRRRYPDVFIHFHRAAHGAFTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMKGSPEEDITAADGILHLILEDDSWRGIKKCAPIISGGLNPTLLKKFIEVAGTDFI TTMGAGCHAHPDGTKAGATALVQSLEAYEKKIPIEKYAKTHKELARAIEFFFKKK >scaffold_360_42_REFreference QVSYIN--LNFKNGEMLAVFHMVPEKGLNILQAACEIAAESSTGTN----FLVQTETSKM NALVYDEKRNLVWIAYPWRLFDRGNIQNILTYVVGNVLGMKQVSALKLIDIWFPSAMLEQ YDGPSYTLDDMKKYLGIKNRPVLGTIIKPKMGLTSAEYAEVCYEFWTGGGDFVKNDEPQA NQDFCPYDKMVKHVKEAMDKAVKVTGKKKVHSFNVSASDDTMIKRCEMIREAGFEAFLID GITAGWMAVQTLRRRYPDVFIHFHRAAHGAFTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMKGSPEEDITAADGILHLILEDDSWRGIKKCAPIISGGLNPTLLKKFIEVAGTDFI TTMGAGCHAHPDGTKAGATALVQSLEAYEKKIPIEKYAKTHKELARAIEFFFKKK >rifoxyc1_full_scaffold_2432_3Pacearchaeota QVSYIN--LNFKNGEMLAVFHMVPEKGLNILQAACEIAAESSTGTN----FLVQTETSKM NALVYDEKRNLVWIAYPWRLFDRGNIQNILTYVVGNVLGMKQVSALKLIDIWFPSAMLEQ YDGPSYTLDDMKKYLGIKNRPVLGTIIKPKMGLTSAEYAEVCYDFWTGGGDFVKNDEPQA NQDFCPYDKMVKHVKEAMDKAVKVTGKKKVHSFNVSASDDTMIKRCEMIREAGFEAFLID GITAGWMAVQTLRRRYPDVFIHFHRAAHGAFTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMKGSPEEDITAANGILHLILEDDSWRGIKKCAPIISGGLNPTLLKKFIEVAGTNFI TTMGAGCHAHPDGTKAGATALVQSLEAYEKKIPIEKYAKTHKELARAIEFFSKKK >YP_004385218_Methanosaeta_concilii_GP6_Putative_IIandII_REFreference -YMRMD--PDPRNGELLAVFHLIPSGELNIMQAAAEVAAESSTGTN----FAVKTETPVM NALVYDIEKNLVWIAYPWRLFDRGNVQNIMTYIAGNALGMKEIKALKLLDIWFPPSMLEQ YDGPSYTLDDMRTYLNVHDRPILGTIIKPKMGLTSSEYAEVCYDFWVGGGDFVKNDEPQA DQDFCPYDKMVKYVKMAMDKAVKETGKKKVHSFNVSSADDTMIERCEMIREAGFEAFLID GITAGWMAVQTLRRRYPDVFLHFHRAGHGGFTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMAGSPEEDVTAARNILHVVLEDDSWRGLKKCCPIISGGLNPTLLEPFIDVMGGDFI TTMGAGCHAHPRGTRAGAMALVQACEAYKNKIDIADYAKDHRELAEAIEFFSKKK >rifoxyd1_full_scaffold_48_74Pacearchaeota PKSLIKLK-NPKNGEMLAVYHLKPGKGLNILQASCEVAAESSTGTN----FKVNTETPEM NALVYDIKKNLVWIAFPWRLFDRGNVQNILTYIVGNVLGMKEVKALKLLDIWFPSTMLEQ YDGPSYTLDDMRKYLNVYDRPILGTIIKPKMGLTSSEYAEVCYDFWTGGGDFVKNDEPQA NQDFCPYDKMVVYVKKAMDKAVKETGKKKIHSFNVSAADDTMISRCEMIRNAGFEAFLID GITAGWMAVQTLRRKYPGVFIHYHRASHGAYTREENIGFSVLVLSKFARLAGASGIHTGT AVGKMAGSPKEDVTAAHNILHVILDDDSWRGMKKCCPIISGGLNPTLLKKFIDVMQGDFI TTMGAGCHAHPKGTRAGATALIQSCEAYIKKIDIHEYAKTHVELKEAIDFFEKKK >RIFCSPLOWO2_02_FULL_OP11_38_8_rifcsplowo2_02_scaffold_81990_5Gottesmanbacteria QQAYVN--LNLQNGELLSVFRLLPGNSLNILQAASEVAAESSTGTN----FKVNTETPEM NAIVYDLEKNLVWIAYPWRLFDRGNVQNVLTYIVGNVLGIKEVSGLKLLDVWFPPSMLEQ FDGPSYTLDDMRKYLNVYDRPILGTIVKPKMGLTSAEYAEVAYDFWVGGGDFVKNDEPQA DQDFCPYDKMVKHVKEAMDKAVRETGKKKVHSFNVSASDDTMITRCEMIRNSGFEAFLID GITAGWMAVQTLRRKYPDVFIHFHRAGHGGFTRTENFGFTVLVLSKFARLAGVSGIHTGT AVGKMAGPQEDVVAANNILRHVILEDDSWRGMKKCCPIISGGLNPTLLNPFIDVMGNDFI TTMGAGCHAHPGGTKVGATALVQACEAYLKKIDIHEYAKDHKELTEAIDFFGKKQ >rifcsplowo2_02_scaffold_81990_4Gottesmanbacteria QQAYVN--LNLQNGELLSVFRLLPGNSLNILQAASEVAAESSTGTN----FKVNTETPEM NAIVYDLEKNLVWIAYPWRLFDRGNVQNVLTYIVGN---------VD------------- --GPSYTLDDMRKYLNVYDRPILGTIVKPKMGLTSAEYAEVAYDFWVGGGDFVKNDEPQA DQDFCPYDKMVKHVKEAMDKAVRETGKKKVHSFNVSASDDTMITRCEMIRNSGFEAFLID GITAGWMAVQTLRRKYPDVFIHFHRAGHGGFTRTENFGFTVLVLSKFARLAGVSGIHTGT AVGKMAGPQEDVVAANNILRHVILEDDSWRGMKKCCPIISGGLNPTLLNPFIDVMGNDFI TTMGAGCHAHPGGTKVGATALVQACEAYLKKIDIHEYAKDHKELTEAIDFFGKKQ >YP_004615354_Methanosalsum_zhilinae_DSM_4017_Putative_IIandII_REFreference ---YVN--EDPENGELLGVFHLIPGGKMNILQAASEVAAESSTGTN----FKVNTETATM NALVYDLDKNLVWIAYPWRLFDRGNVQNILTYIVGNILGMKEISALKLLDVWFPPAMLEQ YDGPSYTLDYMRQYLGVYDRPILGTIVKPKMGLTSAEYAEVCYDFWTGGGDFVKNDEPQA NQDFCPYEKMVMHVKEAMDKAVRETGQKKVHSFNVSAADDIMIQRCEMIRNAGFEAFLID GITAGWMAVQTLRRRYPDVFIHYHRAGHGGFTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMKGTPEEDVVAAHGILHIILEDDSWRGVKKCCPIVSGGLNPVRLKPFIDVMGNDFI TTMGSGVHSHPEGTKAGAKALVQACDAYLKGIDIEEYAKDHNELAQSLEYFSKAK >YP_003542093_Methanohalophilus_mahii_DSM_5219_Putative_IIandII_REFreference ---YVN--PDPTNGELLTVFRLVPGGEMNMLQAAAEIAAESSTGTN----FRVNTETKVM NALVYDLERELVWIAYPWRLFDRGNVQNILTYIVGNVLGMKEISALKLLDVWFPPSMLEQ YDGPGFTVDDMRSYLGVYDRPILGTIVKPKMGLTSAEYAEVCYDFWAGGGDFVKNDEPQA NQDFCPYDKMVKHVKEAMDKAVKETGKKKVHSFNVSAPDDTMIERCEMIRNAGFEAFLID GITAGWMAVQTIRRRYPDVFLHFHRAAHGAFTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMKGTPEEDVVAAHGIQHVILEKDSWRGMKKCCPIVSGGLNPVRLKPFIDVMGNDFI TTMGSGVHAHPEGTRSGAKALIQACDAYLQGIDIKDYAKNHRELEQAIEFFPEK- >YP_566926_Methanococcoides_burtonii_DSM_6242_IIandII_REFreference ---YVD--PDPTNGELLAVFHMIPGGDLNVLQAAAEIAAESSTGTN----IKVSTETATM NARVYDLERELVWIAYPWRLFDRGNVQNILTYIIGNILGMKEIQALKLMDIWFPPSMLEQ YDGPSYTVDDMRKYLDVYDRPILGTIVKPKMGLTSAEYAEVCYDFWVGGGDFVKNDEPQA NQDFCPYEKMVAHVKEAMDKAVKETGQKKVHSFNVSAADDTMIERCEMITNAGFEAFLID GITAGWMAVQTLRRRYPDVFLHFHRAAHGAFTRQENIGFSVLVLSKFARLAGASGIHTGT AIGKMKGTPAEDVVAAHSIQHVILEDDSWRAMKKCCPIVSGGLNPVKLKPFIDVMENDFI TTMGSGVHSHPGGTQSGAKALVQACDAYLQGMDIEEYAKDHKELAEAIEFYLNR- >YP006921907_Methanolobus_psychrophilus_R1_REFreference ---YVD--ADPRNGELLGVFHLVPEGRLNMLQTAAEVAAESSTGTN----FKVNTETPVM NALVYDLERDLVWIAYPWRLFDRGNVQNILTYIVGNILGMKEVKALKLMDVWFPPSMLEQ YDGPSYTVDDMRKYLGVYNRPILGTIVKPKMGLTSAEYAEVCYDFWVGGGDFVKNDEPQA NQDFCPYDKMVKHVKEAMDKAVKETGKKKVHSFNVSAADDTMIKRCEMIVNAGFEAFLID GITAGWMAVQTLRRRYPGVFIHFHRAGHGAFTRPENLGFSVLVLSKFARLAGASGIHTGT AVGKMKGTPQEDVVAANGILHAILEDDSWRAMKKCCPIVSGGLNPIRLKPFIDVMGNDFI TTMGSGVHAHPGGTQGGAKALVQACEAYLKNMDIEQYAKDHEELAQAIEYFSKAS >YP007313659_Methanomethylovorans_hollandica_DSM1597_REFreference ---YVN--PDPYNGELLSVFHLVPEGKLNILQAAAETAAESSTGTN----FKVNTETPTM NALVYDLEQNLVWIAYPWRLFDRGNIQNILTYIVGNILGIKEIKALKLLDVWFPSSMLEQ YDGPSYTLDDMRKYLGVYDRPILGTIVKPKMGLTSAEYAEVCYDFWAGGGDFVKNDEPQA NQDFCPYDKMVKHVKEAMDKAVKETGRNKVHSFNVSAADDTMIERCEMIVNAGFEAFLID GITAGWMAVQTLRRRYPGVFIHFHRAAHGAFTRPENIGFSVLVLSKFARLAGVSGIHTGT AVGKMKGTPEEDVVAAHGILHVILEEDSWRGVKKCCPIVSGGLNPIRLKPFIDVMGNDFI TTMGSGVHAHPGGTKDGAKALVQACDAYLNKMDIAEYAREHSELAQAIDHFTKAQ >cg1_0.2_scaffold_107_c_58Micrarchaeota QKAYVDLK-NPRNGEMLTAFRLVPGRGLNFLQAAAEIAAESSTGTN----FKVQTETAEM NALVYDSKRNLVWIAYPWRLFDRGNVQNILTYLVGNVLGMKEVDGLKLLDVWFPPAMLEQ YDGPSYTLDDMRKYLKVYNRPILGTIVKPKMGLTSAEYAEVCYDFWVGGGDFVKNDEPQA DQDFCPYYKMVKHVKQAMDKAIKKTGKKKVHSFNVSAADDTMITRCEMIRNAGSYAFLID GIMAGWTAVQTLRRRYPDVFIHFHRAGHGAFTRPENFGFSVLVLSKFARLAGASGIHTGT AVGKMKGTPEEDVVAAHNILHVILEDDSWRGMKKCCPIVSGGLNPLLLKPFIDMMGNDFI TTMGAGCHAHPEGTKAGATALVQACDAQQKRIPLEKYAKTHKELATAIKFFGKKK >07M_4_2014_scaffold_3235_2unknown QKAYVNLK-NPKNGELLSVFHLIPGGDLNILQAASEVAAESSTGTNV---NTETPFSREM NALVYDLKRNLVWIAYPWRLFDRGNVQNILTYIVGNVLGMKEIKALKLLDVWFPPAMLEQ YDGPSYTIDDMRKYLNVYGRPILGTIIKPKMGLTSAEYAEVCYDFWVGGGDFVKNDEPQA NQDFCPYDKMVKYVKKAMDKAVKETGKKKVHSFNVSASDNTMIERCEMIRAAGSYAFLID GITAGWMAVQTLRRKYPGVFIHFHRAGRGGYTRPENIAFTVLVLSKFARLAGASGIHTGT AVGKMAGSPEEDVTAARNILHLILDNDSWRGMKKCCPIISGGLNPTLLKPFINIMGNDFI TTMGAGVHAHPQGTKAGAKALVQACEAHLKKIDINEYSKTHKELAEAIKFFPKRR >bjp_ig2599_sub10_scaffold_1302_18Micrarchaeota ------------------------------------------------------------ ---------------------QT----------------------LR------------- ------------------RR---------------------------------------- -------------------------------------------------------Y---- ----------------PDVFIHFHRAGHGGFTRPENIGFSVLVLSKFARLAGASGIHTGT AVGKMKGPEEDVVAAHNILHRVILEDDSWRGMKKCCPIISGGLNPTLLKPFIDIMGNDFI TTMGAGCHAHPKGTGAGAKALVQACEAYIKKTDIKEYAKNHRELAEAIKFFSKES >Ig5771_scaffold_652_7Pacearchaeota QQAYVN--PDPRNGELLTVFYLKSGGKLNILQAATEVAAESSTGTT----FKVNTETSEM NALVYDLKRNLVWIAYPWRLFDRGNVQNILTYIVGNVLGMKEVSALKLLDVWFPPAMLEQ YDGPSYTLEDMRKYLNVYKRPILGTIIKPKMGLTSAEYAEAAYDFWVGGGDFVKNDEPQA DQDFCPYDKMVKHVKEAMDKAVKETKKKKVHSFNVSAADDTMIKRCEMIRNAGFEAFLID GITAGWMAVQTLRRRYPDVFIHFHRAGHGGFTRPENLGFSVLVLSKFARLAGASGIHTGT AVGKMKGPEEDVVAAHNILHQAILEDDSWRGMKKCCPIISGGLDPTLLKPFIDIMGNDFI TTMGAGCHAHPKGTTAGAKALVQACEAYEKGIDITKYAKKHKELAEAIKFFSKGK >rifoxyc1_full_scaffold_336_27Pacearchaeota QQAYVN--PDPRNGELLAVFHLVSGGKLNILQAATEIAAESSTGTT----FKVNTETPEM NALVYDLKRNLVWVAYPWRLFDRGNVQNILTYIVGNVLGMKEVSALKLLDVWFPPAMLEQ YDGPSYTLDDMRKYLNVYKRPILGTIIKPKMGLTSAEYAEACYDFWVGGGDFVKNDEPQA DQDFCPYDKMVKHVKEAMNKAVKETKKKKIHSFNVSAADDTMIKRCEMIRNAGFEAFLID GITAGWMAVQTLRRRYPDVFIHFHRAGHGAFTRLENLGFSVLVLSKFARLAGASGIHTGT AVGKMQGPEEDVVAAHNILHQAILEDDSWRGMKKCCPIVSGGLDPTLLKPFMDIMGNDFI TTMGAGCHAHPGGTTAGAKALVQACEAYEKGIDIKKYAKKHKELAEAIKFFSKGK >rifcsphigho2_02_scaffold_33630_15Pacearchaeota ----MTLSRKPSKNDVVVDYKIRP--KISLEEASRLITGEF-PADF----VPRPRTPEKL KPVISSIDKK-VRIAYPNELFENENVSQILSLIAGRVFGLRALKSIRIEDVSFPLKLLKS FRGPKFGIDGIRKISKIKDRPLLATVLKPRFGLSSEIYTRETYASWLGGCDIVKDDESLA TTKTNNFDERLKKILIAKDKAEKETMQKKLYIPNISAETVEMTRRAEKVKALGGEFVMVD ISSVGWSAFHTIRKMDPDIIMHGQMSNISLFTRNPEHGVSARVLAKIARLVGADMVHINS AFGSVQESPAEIRDAERELEKGHILEQKWFGIKKSMPYVSGGIQPVHIPKLVKLLGKNIV ISLSSGIHRHPDGTIAGAIASRQAIEAVMKGIDLHKYAKTMPELKRVV------- >CG23_combo_of_CG06-09_8_20_14_all_150_scaffold_8060_3Parcubacteria KMKYLH--LGEELRSVVTEFYVES--DEPMIKVAEALASESSVGTWTDLSTMKEDSCSKL SARIFYVDDKIIKIAYPIALFEDGSIPQLLSDVTGNVFGLKEIKNLRVRDIVFPEEYVRT YNGPAFGIEGIRSVMQIYDRPLLGTIIKPKVGLNSDEHALVAYEAWVGGVDVVKDDENLT NQDFNPFEERVVKTMAMLQKAQDATGKRKMYVPNISASVTDMERRAEFVKAQGGTTAMMD ILTVGFAGVQHIRNQNYGLILHGHRAMHGAFTHGTRHGVAMLPIAKAARLSGIDQLHTGT IIGKMEGGREDVSEINESMR------RPWFGQKPTMPIASGGVHPGLLPKMIELIGNDVI INLGGGIHGHPDGTTKGAIAAVQAIESVKAGISLREYAKDHEELAKAIEKWGVYG >UBA1441contig_19953_11Patescibacteria --MEYSLDYQPDEKEVLVTYRIRPQ-GQSLEAAAEKVAAESSVGTWTPTVTLSDQIFHDL AARIYKI-NS-VQIAYPLGLFELGSLPQLGSGIMGNVFSMKDLEALRVEDIEFPETYINS FSGPAFGTAGIRQLLNIYSRPIIGSIIKPKIGLSPQDNAQLAYVVWSHGVDLVKDDENLT DLPINPFRERVVEVMKKQRLAEQESGQSKVYAFNVTAPVTQMLERGRWVAEAGGRCLMVD LVTTGWSGVQELRRAFPEMIIHGHRAGHSAFTRVPDHGLSMLVVAKMARLAGVDQLHTGT VVGKMEGSESEVVQINQFLS------SAWGQQKTVLPIASGGLHPGMVPDLVKVLGLDLI INFGGGIHGHPDGSAAGAEAAQIAVAAVSQGQSIVEAAKTSPALARALDYWSD-- >RBG_13_scaffold_14099_11Dojkabacteria SKNYSY--LHVGGRDVITTFYLEVKEGEDFMIIAEGVAAESSIGTWTDVAGLNQEVFDRL SAKVFEVSGV-VKIAYPLALFETDNIPQIIASIAGNVFGLKEIENLRVKDVSFPDEFLKG LRGPAFGINGIRDLFQVYDRPLIGTIIKPKLGLSTRQHAQAAYEAWIGGVDIVKDDENLS DQDFNPFYERVTQTLEARRRAEDETGEKKLYCPNISARVGEMYARAKYVREMGGRAVMVD IITVGFSGVQFIRDQSMNLIIHGHRAMHGAFTNNDKHGISMLTIAKFARAAGIDQLHTGT VIGKMEGTRSEVANIHDFLK------GHWGSLKPTMPIASGGLHPGHVHQLYEIMGKDVI INFGGGIHGHPDGTKAGAAAARQAVEAAVGGVSSREYAKNHGELSRALDKWGTGT >gwc2_scaffold_9210_4Gottesmanbacteria ETPYLTLREKVNRNHLIGKFKLTTE-NLNFKEIAGGIAAESSVGTWTKVTTQFSKVWERL HARVLEA-DK-LTIAYPLDLFEGGNIPQLIASVAGNIYGLKEVKYLRLLDLEMPEIYVKS FPGPGVGLVGIRKITQVENGPLVGSIIKPKEGLDFRQHSDVAMEVYAGGLNFVKDDENLT SQIFNPFDNRVRMITKKGKNKFND--KNRIYAFNISADTSTMRKRAEFVKSYEGNCIMVD ILTVGFSALQYIRNKNYGLVIHGHRAMHAALTRSPDEGLTMLVIAKLARLAGIDSLHTGT VVGKMEGGKDEVVEINDFLR------SEWYGMKKVLPVASGGLYPNLIPSLINVLGKDIL MNFGGGIHGHPGGSKAGAIAVVQGVEAVQNHMTLEKYGETHPELKTAIEHFA--- >mol-32-1605-030446_1_GottesmanbacteriaGottesmanbacteria PVYVLAEIYNPGPKDVVTEFILKPGEKVLFLQTAGGLGAESSVGTWT---EVSTQFQKIL HARILEADGTIIKIAYPLELFEPGNIAQLLSSVAGNIYGLKEVAHLKLLDLELPEIYVKS FPGPGFGIEGIRRISGVYGRPLLGTIIKPKLGLDYQEHAKAVIAAYEGGLDFVKDDENLT SQTFNPFEKRVTEVMKYLIDSNQL---SKICAFNVTAETGIMLKRAEFIKQKGGNCAMID ILTSGFAAVQSLRKQNLGLILHGHRAMHAALTRNPIEGISMLVLAKLARLAGIDSLHTGT VVGKMEGGQKEVVEINNFLR------SEWYGIKPTLPVASGGLYPNLIPDLMRILGKDML FNFGGGVHGHPDGTKAGVKAITQAVQAVMMGKSLQEYGKDHQELKVAIEHWNNI- >img_2698597071_3Microgenomates --PYRSLGEKVDKKNLIVHFALTVR-GHDFAQIAGGVAAESSVGTWTKVSTEIKEMFDSL HARVLEA-DK-LKIAYPLALFEEGNLSQLLSSVAGNVFGLKEVKNLKLLDLDFPEKYVRG FPGPAFGIEGVRRLTGIKSRPLIGCIIKPKLGLEPKIYGQISREVFEGGVDFVKDDENLT SQIFNLFPERVKVIT--KILKEEK--KNKIYALNVTAEVEVMLERAKMVETQGGNCVMID FLTAGFAGLQALRKKNFNLFLHGHRAMHAAFDRVPAHGISMLVLAKLARLAGLDSLHTGT VVGKMEGEKEEINKINQFLL------GDWHGLKKVLPVASGGLHPALVPSLMKILGKDVL FNFGGGIHGHPQGSAAGAAAVLEAVEATEKGLSLLEAAKTHEALAIALEHWEGR- >rifoxyb1_full_scaffold_633_14Gottesmanbacteria ANTYKN--YLSIDKTIIATFSLSVQGEP-FDKTAGGVAAESSVGTW----TDILEEKIKL HAKVITLDEPKITIAYPLELFEPGNIPQLLSSIAGNVFGLKEIVGLRLEDLEFPETYVRS FPGPALGIPGIRALTGVKDGPLLGSIIKPKLGLSAKNQAEAAVAVWNEGIHLVKDDENLT SVAGDNFYDRV---NEVVKRMEKT----KIYAFNITASYEEMTKRATHVRDANANCLMID VLTAGFSAVQGIRNRNFGLMIHGHRAMHAAFTRSKQFGISMLVIARLTRLAGVDSFHTGT VIGKMEGEKEEVLAINNFLR------SEWYGLKTVLPTASGGLHPGHISDLVKILGSDML LNLGGGIHGHPDGSAA-------------GGTPGPELAR-------ALEKWKV-- >rifoxyc1_full_scaffold_26262_2Gottesmanbacteria --------SIGEKRAIIATFSLRVS-GEPFDKTAGGVAAESSVGTWTDVGLGEKTIWDKL HAKVISLDEP-ITIAYPLELFEPGNIPQLLSSVTGNVFGLKEITGLRLEDLEFPQMYIKS FPGPALGIPGIRALTGVKDGPLLGSIIKPKLGLSAKNQAEAAVAVWNEGIHLVKDDENLT SVAGDNFYDRV---NEVVKRMEKT----KIYAFNITASYEEMTKRATHVRDANANCLMID VLTAGFSAVQGIRNRNFGLMIHGHRAMHAAFTRSKQFGISMLVIARLTRLAGVDSFHTGT VIGKMEGEKEEVLAINNFLR------SEWYGLKTVLPTASGGLHPGHISDLVKILGSDML LNFGGGIHGHPDGSAAGARAVVQ----ALAGTPGPELAR-------ALEKWKV-- >rifcsphigho2_01_scaffold_5271_63Gottesmanbacteria MNTYKNEKIDPA---ILVTFSLQA----PFDRTAGGVAAESSVGTWTDIRLEEKAIWDRL HAKVIKMSGR-IVVAYPLDLFEAGNLAQFLSSIAGNIFGLKEITKLRLENIEFPEIYVKA FPGPALGIDGVRRLTGVMDRPLLGSIIKPKLGLASAKHAEAAMAVWDNGLDLVKDDENST SMASDNFYTRVEVTRRMKEKGYLEINKAKIHAFNISAHYEEMMKRANYVRDSGANCLMID ILTAGFSATQGIRNRNFGLMIHGHRAMHAAFTRSREYGISMLVIAKLARLAGVDSLHTGT VVGKMEGGEEEVIAINNFLR------SEWYGLKTVLPTASGGLHPGMIPDLYRILGKDML LNFGGGIQLLP-----SPLLLQCLV--FP-------------------------- >rifcsplowo2_01_scaffold_7182_31Gottesmanbacteria MNTYKNEKIDPA--KILVTFSLQAT-GEPFDRTAGGVAAESSVGTWTDIRLEEKAIWDRL HAKVIKMDDT-IVVAYPLDLFEAGNLAQFLSSIAGNIFGLKEITKLRLENIEFPEIYVKA FPGPALGIDGVRRLTGVMDRPLLGSIIKPKLGLASAKHAEAAMAVWDNGLDLVKDDENST SMASDNFYTRVEVTRRMKEKGYLEINKAKIHAFNISAHYEEMMKRANYVRDSGANCLMID ILTAGFSATQGIRNRNFGLMIHGHRAMHAAFTRSREYGISMLVIAKLARLAGVDSLHTGT VVGKMEGGEEEVIAINNFLR------SEWYGLKTVLPTASGGLHPGMIPDLYRILGKDML LNFGGGIHGHPDGSAAGARAIAQSLTAAMSGIPLTEAKS-SPEA---LEKWGS-- >UBA2393contig_52368_1Gottesmanbacteria ANRYIH--LALNKTKIIATFTLHVE-GEPFDKTAGGVAAESSVGTWTDIGLEEKAIWDKL HAKVIDMDES-IVVAYPLDLFEAGSIPQLLSSITGNVFGLKEIVGLRLEDLDFPEVFVKS FPGPALGIEGIRALTGVKDAPLTGSIIKPKLGLTSVRHAEACMEVWDGGVNLVKDDENLA NMAFDNFYTRVEVTRRMKEKNYLEIGKAKIHAFNITASYEEMMKRANYVRDSGANCLMID VLTAGFAATQGIRNRNYGLMIHGHRAMHAAFTRSKVFGISMLVIAKLARLAGIDSFHTGT VVGKMEGGKEEVVTINNFLR------SEWYGLKTTLPTASGGLHPGHIPDLVSILGKDLL MNFGGGIHGHPDGTPAGARAIVQSLNATLAGIPLSEAKD-SPELQRALEKWQ--- >UBA927contig_123_73Gottesmanbacteria ANRYIH--YLALDKTIIATFTLKVEGEP-FDKTAGGVAAESSVGTWTDIGLEEKAIWDKL HAKVIDMTGR-ITVAYPLDLFEAGSIPQLLSSITGNVFGLKEIVGLRLEDLEFPEVFVKS FPGPALGIEGIRALTGVKDSPLTGSIIKPKLGLTSVRHAEACMEVWDGGVNLVKDDENLA NMTFDNFYTRVEVTRRMKEKNYLEIGKAKVHAFNITASYEEMMKRANYVRDSGANCLMID VLTAGFAATQGIRNRNYGLMIHGHRAMHAAFTRSRQYGISMLVIAKLARLAGIDSFHTGT VVGKMEGGKEEVVTINNFLR------SEWYGLKTTLPTASGGLHPGHIPDLVSILGKDLL MNFGGGIHGHPDGTPAGARAIVQSLKATLAGTPLADAKD-SPELQRALEKWMEK- >CG11_big_fil_rev_8_21_14_0.20_scaffold_1948_c_6Gottesmanbacteria PNTYNT--LSIGKNKIFASFHLGSE-GQPFDRTAGGVAAESSVGTWTDVFLQTKTSWDNL HAKVI--EKD-LKIAYPLDLFEAGNIPQLLSSIAGNIFGLLEISALRLEDIDFPEEYIKA FPGPALGITGVRNMAGVKEMPLLGSIIKPKLGLSSKDHIDAAMAVYDGGVNLVKDDENLT SQIYNNFYDRVEGTTRMKEKGYLGKGNEKIYAYNITASYEEMQKRAEFVVENGGNCHMID ILTAGFAAVCGMRNKNYGKMIHAHRAMHAAFTRSKQYGISMLVLAKLSRLAGVDSLHTGT VVGKMEGGKEEVTKIDNFLR------AEWYGLKTVLPVASGGLHPGHLPDVVKILGNDLL INFGGGIHGHPEGTYKGAIAAVQAREAVSQGIRLDEYAKNHFELAKALEKWGNGN >OIO15311.1_hypothetical_protein_AUJ73_00860_Candidatus_Gottesmanbacteria_bacterium_CG1_02_37_22_REFreference PNTYNT--YLSIDKNIFASFHLGSEGQP-FDRTAGGVAAESSVGTWTDVFLQTKTSWDNL HAKVIDESSGFLKIAYPLDLFEAGNIPQLLSSIAGNIFGLLEISALRLEDIDFPEEYIKA FPGPALGITGVRNMAGV-KEPLLGSIIKPKLGLSSKDHIDAAMAVYDGGVNLVKDDENLT SQIYNNFYDRVEGTTRMKEKGYLGKGNEKIYAYNITASYEEMQKRAEFVVENGGNCHMID ILTAGFAAVCGMRNKNYGKMIHAHRAMHAAFTRSKQYGISMLVLAKLSRLAGVDSLHTGT VVGKMEGGKEEVTKIDNFLR------AEWYGLKTVLPVASGGLHPGHLPDVVKILGNDLL INFGGGIHGHPEGTYKGAIAAVQAREAVSQGIRLDEYAKNHFELAKALEKWGNGN >gwa2_scaffold_47144_2Gottesmanbacteria TNPYKS--YLSIDKSILATFHLQSDAGG-FDQTAGGVAAESSVGTWT---DVSLQEKINL RGRVVELDEKIIRIAYPLELFEPGNIPQFLSSIAGNVFGLSEVSALRLMDIELPEVYVRS FPGPFLGIDGIRKITGISDRPFLGSIIKPKLGLSAQEHIAASMEVYDSGIDLVKDDENLT SQTFNNFYTRVEGTKRMKEAGYLKEGRQKIYAYNITASYEEMLKRAEYVVDRGGNCLMMD VLTAGFASVCGIRNRNFGKIIHAHRAMHAAFTRSKQYGISMLVIAKLARLAGVDSLHT-- ---------------------------------------------GLLPAVFT---RDTC LT---------W------------------------------------------- >RBG_13_scaffold_5391_5Gottesmanbacteria ANPYKS--LSLGLNDIIAKF-VLGSEGQPFDKTAGGVAAESSVGTWTDVSLQEKTLWDKL HGKVIEMSGK-LTIAYPLELFEPGNIPQLLSSIAGNIFGLAEISVLRLIDLEFPESYVRS FPGPFHGIEGIRKFTGIIRRPLIGSIIKPKLGLSAEDHMEAALEVYDGGGDLVKDDENLT SQVFNNFYKRVEGINRMKEKGYLNKGMEKIYCYNITASFDEMMKRADYVSEKGGNCHMID ILTAGFSAVQGIRNRNFGLMIHAHRAMHAAFTRSSQYGISMMVIAKLARLAGVDSLHTGT VVGKMQGGEAEVT----------------------------------------------- ------------GINEG-------------------------------------- >AR1-0.1_scaffold_1790_3Gottesmanbacteria YKSYLA--IGEKDKNILAAFHLQSE-GEPFNQTAGGVAAESSVGTWTDISLQAKTMWDRL HAKVAEMDEG-LIIAYPLELFEYGNLPQLLSSIAGNIFGLSEISALRLNDIEFPEVYVKA FPGPYHGIEGIRKFTKIEKRPLLGSIIKPKLGLSSKNHMEATIEVYDGGVDLVKDDENLT SQSFNLFYERVEITGKMKEKGYLGKDGEKIYAFNITASYEEMLKRAEYVDEKGGNCQMID ILTAGFSGVCAIRNRNFGKIIHAHRAMHAAFTRSRQYGISMLVIAKLARLAGVDSLHTGT IVGKMEGGKKEVLAINKFLR------SEWYGLKPVLPVASGGLHPGNIPSLVELLGPDML FNFGGGIHGHPDGSLAGSRACCQALEATEEGVALEEYAKTHQELAKALEKWG--- >UBA1558contig_16814_5Patescibacteria DFNYYD--KKIDREKIVATFFKAP--KLTFTQAVHAIAGESSIGTWTKISGLSAAQYKKL APLIWEAKHM-VSIAYPLALFELKNIPQLLSSIGGNVFSMKVVTQLRLLDIEFPKVYIDS FLGPQHGIVGIRKILKIKKRPLIGSIVKPKVGLTAKEHAQAAYTLWKNGVDIVKDDENLT DLSFNRFKVRVKEVIKMRKQVEKETGQLKLAVINITAPYNLALERAKYIKKMGGRCMMVD IVAMGWSAIQSLRQENLGLIIHGHRAGHSMFTRNEEHGMTMYVVAKLARLAGIDQLHTGT VVGKMDGTKTEVQKIDDFLKHFDNLQSDWSEIKPVMPIASGGLHPGLLPDVVKYIGQDVI INFGGGIFGHPDGVAAGAKAASQAAQAVMNQQTLEEFSQTHDELATALKFWVNN- >CG_2015-11_scaffold_25419_1Kuenenbacteria FGAYLSERWAELRREDKETEYLTVI---SFAARCAGVVKDDGTSPWELVESEQGEIISQT PIANVYLSDV-YSVRYELAAREVQRVARLIRNILGGIFNNRSVSGITLFDITFPEEYVNS FKGPQFGIAGIRKIMNIYNRPLIGCIAKPKLGLTATEHAKLAYRVFIGGVDILKDDENLT DLTFDHFEKRAHFTIHLAKKAERETGQKKMCVLNVTAPPQEMLKRTRLVKKLGGKAVMVD IVSVGLDNVQMLRSENLGLIIHGHRAGHSMFTKDPKHGMSMLVLAKLSRLAGIDQLHTGT VVGKMEGEQKEILAINRFLK------SKWYKIKPVLPIASGGLHPGMMPKLYRIFGQDVI FNFGGGIHGHPAGSLAGAQAARQALEAVLRKKTLKAYSKNHLELKQAMEHWE--- >cgr2_combo_scaffold_6030_4Kuenenbacteria EKFYRKPKYRPKRNELVASFYLEAI---DFQEAAGGVAAESSIGTWTEVGTLKKEIFRKL AAHAFNL-NA-FQVAYPLALMEKGNIPQFFSGIAGNIFSMKIVKNLRLFDITFPEEYVNS FKGPQFGIAGIRKIMNIYNRPLIGCIAKPKLGLTATEHAKLAYRVFIGGVDILKDDENLT DLTFDHFEKRAHFTIHLAKKAERETGQKKMCVLNVTAPPQEMLKRTRLVKKLGGKAVMVD IVSVGLDNVQMLRSENLGLIIHGHRAGHSMFTKDPKHGMSMLVLAKLSRLAGIDQLHTGT VVGKMEGEQKEILAINRFLK------SKWYKIKPVLPIASGGLHPGMMPKLYRIFGQDVI FNFGGGIHGHPAGSLAGAQAARQALEAVLRKKTLKAYSKNHLELKQAMEHWE--- >CG_2015-13_scaffold_124523_1Kuenenbacteria -RKFLK--YRPKRNELVASFYLEA--KIDFQEAAGGVAAESSIGTWTEVGTLKKEIFRKL AAHAFNAKKKTFQVAYPLALMEKGNIPQFFSGIAGNIFSMKIVKNLRLFDITFPEEYVNS FKGPQFGIAGIRKIMNIYNRPLIGCIAKPKLGLTATEHAKLAYRVFIGGVDILKDDENLT DLTFDHFEKRAHFTIHLAKKAERETGQKKMCVLNVTAPPPEMMKRARLVKRLGGKAVMVD IVSVGLDNVQMLRSAKLGLIIHGHRAGHSMFTKNPKHGMSMLVLAKLSRLAGVDQLHTGT VIGKMEGEQKEVLEIDQFLR------NKWYQMKSTMPIASGGLHPGMMPKLYKIFGKDII LNFGGGIHGHPAGSLAGAKAARQALEAVLKRKSLKNYAKNHLELFQAMEHWK--- >CG_4_9_14_0.2_um_filter_scaffold_23606_2Kuenenbacteria ------------------------------------------------------------ ----------------------------------MN---------IR------------- ------------------NRPLLGCIAKPKLGLSAKEHAQLAYQVFSGGVDVLKDDENLT DLTSDHFEKRAHFTIHLAKKAERETGQKKMCVLNVTAPPPEMMKRARLVKRLGGKAVMVD IVSVGLDNVQMLRSAKLGLIIHGHRAGHSMFTKNPKHGMSMLVLAKLSRLAGVDQLHTGT VIGKMEGEQKEVLEIDQFLR------NKWYQMKSTMPIASGGLHPGMMPKLYKIFGKDII LNFGGGIHGHPAGSLAGAKAARQALEAVLKRKSLKNYAKNHLELFQAMEHWK--- >CG_4_10_14_0.8_um_filter_scaffold_1033_22Parcubacteria GKFYLKPGYRPKNSELVASFYLEA--KIDFKEAAGGVASESSIGTWTEVGTLKEENLRKL AAHAFNVKKKTFQVAYPLALMEKGSIPQFFSGIAGNVFSMKIIRNLRLFDITFPKEYVNS FQGPAFGIEGIRKIMNIRNRPLLGCIAKPKLGLSAKEHAQLAYQVFSGGVDVLKDDENLT DLTSDHFEKRARATIHLAKKAEKETGEKKMCVLNVTAPPPEMMKRARLVKRLGGKAVMVD IVSVGLDNVQMLRSAKLGLIIHGHRAGHSMFTKNPKHGMSMLVLAKLSRLAGVDQLHTGT VIGKMEGEQKEVLEIDQFLR------NKWYQMKSTMPIASGGLHPGMMPKLYKIFGKDII LNFGGGIHGHPAGSLAGAKAARQALEAVLKRKSLKNYAKNHLELFQAMEHWK--- >bjp_ig2599_sub10_scaffold_364_11Parcubacteria NLSYLH--LGEKAKNIIATFRVES--NLPLEEAAGEIAAESSIGTW----TKVTLSEEKL GAKVFKIVGELVKIAYPLELFEMGNIPQLLSSVAGNIFSMKKIIDLRLEDLEFPEEYVKS FSGPAFGIDGVREITGIKDRPLIGSIIKPKMGLSAKEHAKVAYECFSGGVDLVKDDENLT DQKFNRFNSRVKETLELARKAEKETRDKKICAFNTTAETNEMIKRAKFIKKSGGSCAMAD IITLGFGAVQSLRNENLGLIIHGHRAMHSAFTRNPRHGISMLVIAKLARLAGVDQLHTGT VVGKMEGGEEDVLAINKFLL------SDWHGLKPVLPIASGGLHPALVPELVRILGKDVI INFGGGIHGHPEGTLAGARAARQAVEAAMKNIPLREYAKNHRDLAAALEKWE--- >07M_4_2014_scaffold_5663_3Parcubacteria NLSYLH--LGEKAKNIIATFRVES--NLPLEEAAGEIAAESSIGTWTKVG---TLSEEKL GAKVFKIVGELVKIAYPLELFEMGNIPQLLSSVAGNIFSMKKIIDLRLEDLEFPEEYVKS FSGPAFGIDGVREITGIKDRPLIGSIIKPKMGLSAKEHAKVAYECFSGGVDLVKDDENLT DQKFNRFNSRVKETLELARKAEKETRDKKICAFNTTAETNEMIKRAKFIKKSGGSCAMAD IITLGFGAVQSLRNENLGLIIHGHRAMHSAFTRNPRHGISMLVIAKLARLAGVDQLHTGT VVGKMEGGEEDVLAINKFLL------SDWHGLKPVLPIASGGLHPALVPELVRILIT--- ----------------GRKEIKERI------------------------------ >Ig5770_scaffold_3414_3Parcubacteria NLSYLH--LGEKAKNIIATFRVES--NLPLEEAAGEIAAESSIGTWTKVGTLSEETFKKL GAVFWKIVGELVKIAYPLELFEMGNIPQLLSSVAGNIFSMKKIIDLRLEDLEFPEEYVKS FSGPAFGIDGVREITGIKDRPLIGSIIKPKMGLSAKEHAKVAYECFSGGVDLVKDDENLT DQKFNRFNSRVKETLELARKAEKETRDKKICAFNTTAETNEMIKRAKFIKKSGGSCAMAD IITLGFGAVQSLRNENLGLIIHGHRAMHSAFTRNPRHGISMLVIAKLARLAGVDQLHTGT VVGKMEGGEEDVLAINKFLL------SDWHGLKPVLPIASGGLHPALVPELVRILGKDVI INFGGGIHGHPEGTLAGARAARQAVEAAMKNIPLRIRTGEIKE-RI--------- >bjp_ig2599_scaffold_5183_11Parcubacteria -MSNLSLGEKIDAKNIIATFRVESL---PLEEAAGEIAAESSIGTWTKVGTLSEETFKKL GAKVFWASGA-VKIAYPLELFEMGNIPQLLSSVAGNIFSMKKIIDLRLEDLEFPEEYVKS FSGPAFGIDGVREITGIKDRPLIGSIIKPKMGLSAKEHAKVAYECFSGGVDLVKDDENLT DQKFNRFNSRVKETLELARKAEKETRDKKICAFNTTAETNEMIKRAKFIKKSGGSCAMAD IITLGFGAVQSLRNENLGLIIHGHRAMHSAFTRNPRHGISMLVIAKLARLAGVDQLHTGT VVGKMEGGEEDVLAINKFLL------SDWHGLKPVLPIASGGLHPALVPELVRILGKDVI INFGGGIHGHPEGTLXXXLEQEERKLKKEFDAYYN-IAAGEEKIAEPEERLSEE- >bjp_ig3402_scaffold_5048_11Parcubacteria -MSNLSLGEKIDAKNIIATFRVESL---PLEEAAGEIAAESSIGTWTKVGTLSEETFKKL GAKVFWASGA-VKIAYPLELFEMGNIPQLLSSVAGNIFSMKKIIDLRLEDLEFPEEYVKS FSGPAFGIDGVREITGIKDRPLIGSIIKPKMGLSAKEHAKVAYECFSGGVDLVKDDENLT DQKFNRFNSRVKETLELARKAEKETRDKKICAFNTTAETNEMIKRAKFIKKSGGSCAMAD IITLGFGAVQSLRNENLGLIIHGHRAMHSAFTRNPRHGISMLVIAKLARLAGVDQLHTGT VVGKMEGGEEDVLAINKFLL------SDWHGLKPVLPIASGGLHPALVPELVRILGXX-- -------XXXXXXXXXXXLEQEERKLKKEFDAYYN-IAAGEEKIAEPEERLSEE- >RIFOXYC2_FULL_OD1_48_21_rifoxyc2_full_scaffold_977_4Falkowbacteria HVGFIDLTYHPNHKDVIAHFYVEPEKN--FEEAVNAIAGESSIGSWT---DLSTLKVSKL KARVFQMNEKLVKIAYPLDLFELGSIPQFLSSVGGNIYSMKAIKNLRLVDVEFPEKYIKS FPGPAFGIAGIRKILKNENRLILGSIIKPKVGLNASEQAELSYQIWKNGIDLVKDDENLT SMTFNNFYDRVEKVLKAKKKVELETGKKKFYSCNITAEPTEMLKRAKFIKKLGGEVAMID IISVGLAGVQFIRNQNLGLILHGHRAGHSTFSRSTKHGISMLVVAKLARLAGIDQLHTGT VVGKMDGTAEEVSTINDLLRHFHKLKENWSSLKPVMPIASGGLHPGLLPGLVKILGTDLI ANFGAGVHGHRNGSAAGAMACAQAAEAVVRGITLEEQAKIHPELRIALEQWM--- >rifoxyd2_full_scaffold_3430_5Falkowbacteria -MPHVGLTYHPNRAKLIAHFYVEP----NFEEAVNAIAGESSIGSWTDLSTLKVSVANKL KARVFQMNEK-VKIAYPLDLFELGSIPQFLSSVGGNIYSMKAIKNLRLVDVEFPEKYIKS FPGPAFGIAGIRKILKNENRLILGSIIKPKVGLSASEQADLSYEIWKNGIDLVKDDENLT SMTFNNFYDRVEKVLKAKKKVELETGKKKFYSANITAEPTEMLKRAKFVKKLGGEVAMID IISVGLAGVQFIRNQNLGLILHGHRAGHSTFSRSTKHGISMLVVAKLARLAGIDQLHTGT VVGKMDGTAEEVSTINDLLRHFHKLKENWSSLKPVMPIASGGLHPGLLPGLVKILGSPVF MAIETDPRPEPW------HALKR------------------------RRRW---- >gwa2_scaffold_5659_22Parcubacteria -MSHIGTMYQPNKKELVATFYLEPA---AFNEAAEAVAGESSIGSWTDLGTLNMKVANRL KATVFYL-NQ-IKIAYPLALFELASIPQFLSSVGGNIFSMKAVKNLRLMDVEFPEKYINS FQGPAFGIEGIRKYLKNQDRIILGSIIKPKVGLNPDEQAALSYDLWTNGIDLVKDDENLT SMVFNNFYERAKKVLAALKKAERKTDAKKIYSCNVTAPSDEMLKRALYVKEQGGKCVMVD IVSMGLDNVQYLRKQKLGLIIHGHRAGHSTFSRNPKHGISMLVLAKLSRLAGIDQLHTGT VVGKMEGSADEVLSINELLRYH-VLRENWAKVKPCLPIASGGMHPGLLPKLAEILGSNLV ANFGAGIHGHRDGSIAGAKACYQASVAVAKGITLKDYAVNHSELKVALEQWGGV- >gwc2_scaffold_11863_3Parcubacteria GFVQVG--YKPKDQEIIATFFVEPSAGVAFNEVAEAVAGESSIGSWTDLGTLKKSVAEKL KARVFDIKTKIIKIAYPLDLFELGSIPQFLSSIGGNIFSMKAVKHLRLLDVEFPKKYIGS FQGPAFGLEGVRKITGIQKRLILGSIIKPKVGLNPDEQAELSYTVWKNGIDLIKDDENLT SMSFNNFEERVRKVLKALKKVEQETGLVKLYSANITAPPDIMLERAKFVKSQGGKVVMID IVSTGLDSVQYLRKQNLGLIIHGHRAGHSTFTRYEKHGITMLVLAKLSRLAGVDQLHTGT VVGKMDGTREEVLSINQLLQDYHALREDWSKIKPTMPIASGGMHPGLLPKLVESLGSDLI ANFGGGIHGHVDGSGAGARGCFQAAEAVEKGIPLSVYAKDHDELRKALIQWSNVK >gwc2_scaffold_633_23Parcubacteria GFVQIG--YKPKDREIITTFYVEPSAGVAFNEVADAVAGESSIGSWTDLGTLKKSVAEKI QARVFDVKKKIMKIAYPLALFELGSIPQFLSSVGGNIFSMKAVKHLRLLDVEFPKKYIDS FQGPAFGLAGIRKITGIKKRLILGSIIKPKVGLNPDEQAELSYAVWRNGIDLIKDDENLT NMSFNNFEERVRKVLKALKKVEQETGLVKLYSANITAPPDIMLERAKFVKSQGGKVVMVD IVSTGLDSVQYLRKQNLGLIIHGHRAGHSTFTRYEKHGITMLVLAKLARLAGVDQLHTGT VVGKMDGTREEVLTINQLLQDYHALREDWSSIKPTMPIASGGMHPGLLPKLVESLGSDLI ANFGGGIHGHVDGSAAGARGCFQAAEAVEKGIPLAVYAKDHDELRKALIQWNNVK >CG08_land_8_20_14_0.20_scaffold_141789_1Kuenenbacteria RREYLKLNYKPDKNEIITTFYLES--NLSLKEAAVKVAEESSIGTWT---RLTTLSSRSL AARIFYLNKKLVKIAYPLALFEKGSIPQLLSSIGGNIFSMKDIKNLRLLDVEFPKKYITS FPGPRWGIEGVRKILGIYDRPLIGCIIKPKLGLTSVQNAWLAYGVFKNGVDLIKDDENLT DLSFNKFSQRVKKVLSLKRKVENQTGQKKIYVFNVTGPADIMLRRAKLVKKMGARCVMVD IIACGWSGVQYLRNQNLGLIIHGHRAGHAAFTRNKKHGISM------------------- ------------------------------------------------------------ ------------------------------------------------------- >UBA6264contig_60200_23Patescibacteria MKNYQP--YQPDKNEIIATFYGES--KLPLKELAVKVAEESSIGTWTKLSTLSDKVFNKL AAKVFQPRQGIFKIAYPLDLFEPRSIPQLLSSLAGNIFSMKIIKNLRLNDLEFPEKYINQ YPGPQWGIEGVRKITKIYNRPLIGCIIKPKMGLTAKENACLAADIFKNGVDLIKDDENLT DMTFNRFSERVKRVLSLKQKIEKTAKTKKIYVFNVTAPTEIMLSRAALVKKLGGKCVMVD IVTTGWAGVQSLRNQNLGLILHGHRAGHSAFTRNKKHGISMLVLANLARLAGIDQLHTGT VVGKMEGEEKEVLKINQLLKEIDKLRENWGGLKPVLPIASGGLHPGLTAKLIEILGDNLV INFGGGIHGHPQGSLAGARAAQQSIEASLKGISLKVYAQKHKELKQALDYWK--- >CG08_land_8_20_14_0.20_scaffold_311_25Parcubacteria MNNYLRLNYKPDKNDIIAVFYGESKLSLS--ELAIKVAEESSIGTWTKITTLSPKTFEKL AGRIFYVNSKVFKIAYPLALFEPKNIPQLLSSIGGNVFSMKAVENLRLEDIEFPQKYING FPGPKWGIAGIRKMLGVYNRPLVGCIMKPKVGLNSEQNAKLAAEVFTNGVDIIKDDENLT DLSFNRFEDRVIKVLKFKKQVEKQTKEKKVYVFNVTAPVEIMLKRAEFVKKMGGHCVMID IVSAGWAAVQSLRNRNLGLIIHGHRAGHSTFTKNKKHGISMLVLANLARLAGIDQLHTGT VVGKMEGSEKEVLGINELLKGFNKLRENWSGLKSVFPIASGGLHPGLTSKLVKILGQDVI INFGGGIHGHPQGSAAGARAARQAVEATIKKIPLKVYAQKHQELKQALDYWKV-- >ncbi_NATD01000013.1_9Parcubacteria KNSYLNLNYKPDNNEIVATFYGESKASLK--EMAQQVAGESSIGTWT---KLTTLSDKKL AARIFYLNEKIFKIAYPLSLFEPKSIPQLLSSIAGNIFSMKTVKNLRLNDIEFPKKYIRQ FDGPRWGIAGIRKTLNIRNRPIIGCIMKPKLGLTSTQNANLAAEIFKNGVDLIKDDENLT SLTFNKFEDRVRKVLVLKKQVEKETGRKKIYVFNVTAPADVMVKRAKLVKKLGGRCIMVD VISAGWAAVQELRSQNLDLIIHGHRAGHSTFTRNKKHGISMLVLANLCRLAGIDQLHTGT VVGKMEGGAKEVCQINDLLK------EDWGSIKPVLPIASGGLHPGLIPKLVKILGRDLV VNFGGGLHGHPQGVAAGAKAAVQSVEATIQGVPLNVYARNHKELSVALEHWKS-- >OQX71398.1_type_III_ribulose-bisphosphate_carboxylase_Candidatus_Parcubacteria_bacterium_4484_255_REFreference KNSYLNLNYKPDNNEIVATFYGES--KASLKEMAQQVAGESSIGTWTKLTTLSDKVFEKL AARIFYLNEKIFKIAYPLSLFEPKSIPQLLSSIAGNIFSMKTVKNLRLNDIEFPKKYIRQ FDGPRWGIAGIRKTLNIRNRPIIGCIMKPKLGLTSTQNANLAAEIFKNGVDLIKDDENLT SLTFNKFEDRVRKVLVLKKQVEKETGRKKIYVFNVTAPADVMVKRAKLVKKLGGRCIMVD VISAGWAAVQELRSQNLDLIIHGHRAGHSTFTRNKKHGISMLVLANLCRLAGIDQLHTGT VVGKMEGGAKEVCQINDLLK------EDWGSIKPVLPIASGGLHPGLIPKLVKILGRDLV VNFGGGLHGHPQGVAAGAKAAVQSVEATIQGVPLNVYARNHKELSVALEHWKS-- >ncbi_NATD01000022.1_13Parcubacteria KNSYLRLSYKPDNNEIIATFYGESKTSLK--ELSQQVAGESSIGTWT---KLTTLSDKKL AARIFYLNKKIFKIAYPLSLFEPKSIPQLLSSIAGNIFSMKAVKNLRLKDLEFPKKYINQ FDGPRWGITGIRKILNIHNRPIIGCIMKPKLGLTSTQNANLAAKIFKNGVDLIKDDENLT NLTFNKFEDRVKKVLFLKKQVEKETGQKKIYVFNVTAPVSTMIKRAKLVKKSGGKCVMVD IISAGWAAVQELRNQNLDLIIHGHRAGHSTFTRNKKHGISMLVLANLSRLAGVDQFHTGT VVGKMEGSAEEVCRINDLLKEFDKLREDWGSIKPVLPIASGGLHPGLIPKLVEILGKNLV INFGGGLHGHPQGSIAGAKAAVQSVEAAIQGIPLKVYARNHKELKVALEYWKS-- >OQX71191.1_type_III_ribulose-bisphosphate_carboxylase_Candidatus_Parcubacteria_bacterium_4484_255_REFreference KNSYLRLSYKPDNNEIIATFYGES--KTSLKELSQQVAGESSIGTWTKLTTLSDKVFEKL AARIFYLNKKIFKIAYPLSLFEPKSIPQLLSSIAGNIFSMKAVKNLRLKDLEFPKKYINQ FDGPRWGITGIRKILNIHNRPIIGCIMKPKLGLTSTQNANLAAKIFKNGVDLIKDDENLT NLTFNKFEDRVKKVLFLKKQVEKETGQKKIYVFNVTAPVSTMIKRAKLVKKSGGKCVMVD IISAGWAAVQELRNQNLDLIIHGHRAGHSTFTRNKKHGISMLVLANLSRLAGVDQFHTGT VVGKMEGSAEEVCRINDLLKEFDKLREDWGSIKPVLPIASGGLHPGLIPKLVEILGKNLV INFGGGLHGHPQGSIAGAKAAVQSVEAAIQGIPLKVYARNHKELKVALEYWKS-- >img_2264877716_46unknown KNSYLKLKHKPNDNEIIATFYGES--KTSLKELAQQVAGESSIGTWTKLTTLSNKIFEKL AARIFYLNKKIFKIAYPLSLFEPKSIPQLLSSIGGNIFSMKAVKNLRLKDIEFPKKYINQ FDGPRWGIEGIRNILKIHNRPIIGCIMKPKLGLTSTQNANLAAKIFQNGIDLIKDDENLT SLTFNKFEDRVKKVLFLKKKVEKETGQKKVYVFNVTAPVSIMIKRAKLVKKLGGRCIMID IISAGWAAVQELRNQNLDLIIHGHRAGHSTFTRNKRHGISMLVLANLSRLAGVDQFHTGT VVGKMEGTAEEVCKINDLLKEFDKLRENWGLIKPVLPIASGGLHPGLVSKLIEILGENLV INFGGGLHGHPQGSIAGARAAVQSVEATIKGIPLKVYARNHKELKIALEHWKS-- >RifSed_csp2_16ft_2_scaffold_214444_1Kuenenbacteria ------------------------------------------------------------ ----------------------------------------------------FPKKYLTA FPGPAFGIEGIRKYLNIYNRPIIGSIIKPKVGLSAKEQAELSYTIWKNGIDLIKDDENLT DMVFNRFEDRVKEVMGMKKLAEKETGQKKVYVFNVTGPADVMLKRAKLVKKYGGKVVMID LVATGLDNILYLRKQNLGLIIHGHRAGHSMFTKNPRHGMSMLLLAKLARLCGIDELHTGT VVGKMEGAAEEVTKINLKMRGINKETSDWSKIKPVLPVASGGLHPGFVPRLIKILGENIV VNFGGGLHGHPDGSAAGARACKQAVEATMKKIPLAKFAKSHNELKRALEYWK--- >rifcsphigho2_01_scaffold_155436_2Kuenenbacteria KPPYLKLGYQPNLKDLIATYYIESKKSLA--EAAENIAAESSIGTWTEIGTMKEKILKQL GPKIFEQKKKIAHIAYPLSLFELGNIAQLLSALAGNVFSLKIIDNLRLSDIQFPKKYLTA FPGPAFGIEGIRKYLNIYNRPIIGSIIKPKVGLSAKEQTELSYTIWKNGMDLIKDDENLT DMVFNRFEDRVKEVMGMKKLAEKETGQKKVYVFNVTGPADVMLKRAKLVKKYGGKVVMID LVATGLDNILYLRKQNLGLIIHGHRAGHSMFTKNPRHGMSMLLLAKLARLCGTDELHTGT VVGKMEGVAEEVTKINLKMRGINKETSDWSKIKPVLPVASGGLHPGFVPRLIKILGENIV VNFGGGLHGHPDGSAAGARACKQAVEATMKKIPLAKFAKSHNELKRALEYWK--- >cg1_0.2_scaffold_10592_2Kuenenbacteria RSPYLKLNTKPALQGLLTVYYLES--DLPLSEAAVKIAEESSIGTWTDLATLNKKTQAKL GPKIYETKNRIVKIAYPLDLFELGNIAQLLSALAGNIFSMKVIKNLRLLDIQFPQKYLNS FLGPAFGIQGVRNYLRIYNRPLIGSIIKPKVGLSAKEQAKLSYNIWLNGIDLVKDDENLT DMRFNRFSDRVREVLKLKKIAEAKTKSKKVYVFNITGPADLMLNRAKVVKKLGGRAVMID IVSCGLDNAQFLRKQNLGLILHGHRAGHSMFTKNPRHGMAMLVLAKLARLAGIDQLHTGT VVGKMEGTAEEVLSIDEIIKGVNNEPTNWARIKPILPVASGGLHPGLVHKLIKILGNDIV INFGGGLHGHPDGSIAGARACCQALEAVEKGVGLEIYSQEHIELKAALDYWR--- >CG03_land_8_20_14_0.80_scaffold_45401_2Parcubacteria ----------------LTVYYLES----PLSEAAVKIAEESSIGTWTDLATLNKKTQAKL GPKIYCLETK-VKIAYPLDLFELGNIAQLLSALAGNIFSMKVIKNLRLLDIQFPQKYLNS FLGPAFGIQGVRNYLRIYNRPLIGSIIKPKVGLSAKEQAKLSYNIWLNGIDLVKDDENLT DMRFNRFSDRVREVLKLKKIAEAKTKSKKVYVFNITGPADLMLNRAKVVKKLGGRAVMID IVSCGLDNAQFLRKQNLGLILHGHRAGHSMFTKNPRHGMAMLGLLELMASQGVSEQDM-- ----------------------------W---KTALVVNSYWFSDAYITIAIYMKNGEWK DVNSEEILGINYSSTQGYKSVSS----QI-------------------------- >CG10_big_fil_rev_8_21_14_0.10_scaffold_1094_21Kuenenbacteria IYLQLK--YKPRKDKIIVTFYLESLEGLK--KAAVEVAAESSIGTWTELSTMTDKIQRKL SAKIFYLDQKIIKIAYPLELFELGNIPQLLSSVAGNIFSMKVIKNLRLLDIEFPDKYINS FTGPKYGLPGIRKIFKIKKRPLVGAIMKPKVGLDSKTNAKYAYEMWLNGVDLIKDDENLT DQNFNKFKNRVTEVIKMQKKAEKETGNKKVYVFNVTAPSDEMLKRAKFAKQKGGDCVMVD IIATGLDNVQFLRDQNLSLIIHGHRAGHSLFTRDKRHGMTMLVMAKLARLAGIDELHTGT VVGKMEGDAVEVKAINSFLK------KSFGKLKPVMPIASGGLHPALVPKLVKILGNDLI INFGGGLHGHPDGSAAGARACFAAVEATTKKINLKKYSKNYPDLKMALDHWN--- >rifcsphigho2_01_scaffold_71747_2Kuenenbacteria KSSYLNLNYRPRNREIIATYHVES--ALPLPKAAVEIAKESSIGTWTELATLKEKTFNRL APQIFDLKQKTIKIAYPLALFEPENLPQLLSALAGNIFSMKAVSKLRLLNLEFPEKYLNA FPGPAFGISGIRKILKIKSRPIIGSIIKPKLGLSSTEHARLAYQVWKNGVDLVKDDENLT DMGFNRFDDRVKKVLALKRRAEKETGQIKIHAFNITAPPDVMLKRAKFVKKMGGKCVMVD IVSTGLDNVLFLRKQNLGLVIHGHRAGHSLFTRDAHHGMTMLVLAKLARLSGIDQLHTGT VVGKMEGTKKEVTNINEQLRHFNTLRENWSNLKPVMPIASGGLHPGLVPALVKILGNNLI INFGGGLHGHPHGSVAGAHACYDAVMATTKKIPLKTYAKNHPELKAALNYWK--- >gwa2_scaffold_40962_2Falkowbacteria KTDYLNLKYRPDKTEILAEYLVES--SLPLKTIAVEIAKESSIGTWT---KLTTLDKKRL APQVYLIDSKIIKIAYPLDLFEPNSIPQLLSSLAGNIFSMKIITNLRLLDLHFPEQYINS YQGPAFGIEGIRKTLKIKKRPLVGSIIKPKVGLTASEHAQLAYAIWKNGVDLVKDDENLT DLPFNRFTDRVKKVIRLRRMVEKETGQKKLHVFNVTGTADMMLKRAKYVKSQGGKCVMLD IVSTGLDNVQFLRKWNLGLIIHGHRAGHSMFTRNEKHGLTMLVLAKLARLAGVDQLHTGA VVGKMEGGENEVVNINQFLKHFNFLKDDWSKLKPVMPVASGGLHPGLVEKLVDTLGNNLI INFGGGLHGHPAGSAAGAKACYDAVMATQKNYSLKQYANNHPELKEALDYWK--- >cg1_0.2_scaffold_64959_1Falkowbacteria KTSYLNLKYKPKSNNIIAVYYVES--KLPLEKTANEIAAESSIGSWT---ELTTTTKKRL APKIFYLNKKIIKIAYPLALFELGNLPQLLSSLAGNIFSMKIVDNLRLLDLEFPNKYIDS FKGPRWGIEGIRKITNIKKRPLVGSIIKPKVGLTAKQHAKLAYDIWKNGIDVVKDDENLT DLSFNKFKQRVDEVIKLRHQVEKETGQKKIHVFNITAPADLMLERAKYVKNRGGRCVMVD LVATGLDNVQFLRNRNLGLIIHGHRAGHSMFTRNPKHGITMLVIAKLARLAGIDQLHTGT IVGKMEGEKSEVTAINQFLK------SSWSKLKPVMPIASGGLHPALVPKLMKHLGNDLI INFGGGLHGHPGGSIAGAIACKASIYASFKNVSLRHVAKNCPELDMALKHWK--- >CG22_combo_CG10-13_8_21_14_all_scaffold_27032_2_ParcubacteriaParcubacteria SGTIYVLNYKPDDSEIIAIYQIEA--TLPLKQAAIEVAKESSIGTWT---TLSQEGSKKL SPHIFYADKKVVKIAYPLVLFELGNIPQLLSALAGNIFSMKMIENLRLLDLELPKRYIDS FPGPKWGIDGIRKIMKIKNRPLIGSIIKPKVGLSAQAHAKLAYDIWKNGVDLVKDDENLT DLTFNKFKDRVDAVIKYRRQVEQETKLRKIHVFNVTGPADLMLKRARYVKKQGGRCVMVD LVACGLDNVQFLRNQKLDLIIHGHRAGHSLFTRNPKHGLTMLILAKLARLAGVDQLHTGT VVGKMEGGKKEVAGINDFLKQVNFLLSDWSGIKPIMPIASGGLHPGLMPSLIKNLGQELI INFGGGLHGHPDG------------------------------------------ >CG06_land_8_20_14_3.00_150_scaffold_71672_c_2Micrarchaeota --------------------L--------------------------------------- ----------------PFSAT---GLP-------GE-----------SQAXX-XSEFVNS FKGPVYGKDAIKKIFK-KKSPITSVVPKPKLGYTAREHAEVAYAIWKGGMDCVKDDENLT DQKFNQFDERVKWVAKFRDKAEKETGNVKDAFLNVTSPDRELERRIKLVHDNGFKYFMID VVVSGFTAVQTACARDYKMAIHGHRAMHAMFTRNESHGMSMLFLAKLMRLMGVDQLHIGT VVGKLTGSQREIVATKEMILSGLRMPQKWGKIKPMLPVASGGLHPGLLPEVFGIYGTDLV LQLGGGTQGHPMGIEAGARAAMQAIDAYKEGISLSEYAKKHRELAAALKKWNVMK >CG08_land_8_20_14_0.20_scaffold_7644_c_2Micrarchaeota SDWYNERHYIKKKGDLIALFRYRAG-GISSEAAIGRIASESSSGTWTTLTKLP-RLLPKV KAYAYKYDSR-VEVAYPPIIFENGSVPALMSAIGGNIFGMKALDSLRLQDAELPSEFVNS FKGPVYGKDAIKKIFKKKSGPITSVVPKPKLGYTAREHAEVAYAIWKGGMDCVKDDENLT DQKFNQFDERVKWVAKFRDKAEKETGNVKDAFLNVTSPDRELERRIKLVHDNGFKYFMID VVVSGFTAVQTACNIDYKMAIHGHRAMHAMFTRNESHGMSMLFLAKLMRLMGVDQLHIGT VVGKLTGSQREIVATKEMILGL-RMPQKWGKIKPMLPVASGGLHPGLLPEVFGIYGTDLV LQLGGGTQGHPMGIEAGARAAMQAIDAYKEGISLSEYAK-HRELAAALKKWNVM- >CG09_land_8_20_14_0.10_scaffold_6172_c_6Diapherotrites IEWYHEKKYKPKNNDVKALFKYKPD-GLTKEEVIGRVASESSSGTWTTLTNKP-KLLKKV MAYGYEYDKD-VKIAYPRILFEDGSVPCMLSGIAGNIFGMKAVEGLRLQDAELPVDYVNG FKGPKFGQTAIQKIFKRKHGPITSVVPKPKVGYTAKEHAYVAKQVWSGGIDCVKDDENLT NQKFNKFADRVKYLAKVRDQVMKDTGDVKDAFINCTSPNKEMEKRVKMVHDYGFNYFMLD LVVAGFTAVGTAVELDYKMAIHGHRAMHAAFTKNPDQGISMLFLAKLYRLIGVDNIHTGT VVGKLEGNEKEIMAMKDMI----------------------------------------- ------------------------------------------------------- >rifcsplowo2_01_scaffold_53096_8Woesearchaeota IEWYHETKYKPKANDMKALFHYECAEGI-TSDAIGRVASESSSGTW----TTLTELPKKT KAYAYWYDKKSVKIAYPRLILEDGSVPALMSGVAGNIFGMKAVKYLRLIDAELPEDYVRT FPGPKYGKNVIKEIWK-RKSPVTAVVPKPKLGFTAREHAEVGYHVWKGGIDCVKDDENLT SQKFNRFDERVKWLAKFRDKAEKETGDYKDAFINCTAPNKELERRVKLVHDNGFRYFMID LLTAGFGEVGTAVELDYKMAIHGHRAMHAAMTKNPHIGISYLFLAKLFRLMGVDQVHTGT VVGKLTGKAEDVQAINEMCTSGLRMAQKWGKIKPILPVSSGGLHPGILDKVFDIYGTDIG LQVGGGTQGHPDGIEAGAKAVMQSIEAYKQGISVEEYAKNHRELARALEKWGTMH >CAI49476_Natronomonas_pharaonis_DSM_2160_II_REFreference -FLDES--YEPSDDDLVCTFRLVPGEGISVADAAARVASESSNGTW-AA-LSPESDVRAL ACDIGDEHGTQVTVAYPSGLFEDGSLPQILSCIAGNIMGMKAVETIRLLDCEWPAVIARS FPGPQYGSDVRTELLDAGDRPPLATVPKPKVGLSTEEHVSVAESAWRGGVDLLKDDENLT DQTFNPFEQRVADSFAARDRLEEETGERKDYLVNITAETDEMVRRAEFVDDHGGSFVMVD IITCGWSGLQTVRRRDLDLAIHAHRAMHAAFDRLPQHGVSMRCLAQFARLAGVDHIHTGT ALGKL--ENEDTAGINEWLR------SDLHGHSDVLPVASGGLHPGIVDQLLDALGTNVM VQAGGGIHGHPDGTEAGARALRAAVDAYADGESLDSRAESVPALRTALDEWGTQN >WP006108853_Natrialba_asiatica_DSM_1227_REFreference ------RTYDPAATELVCTFRIEPAEEMTIEAAASRVASESSNGTWATLH-VDESELTHL GAVACEIDES-ITVAYPADLFEAGSMPQILSCIAGNIMGMKAVESIRLEDCEWPESIVSG FPGPQFGTGVAREKLDAGDRPILATVPKPKVGLSTAAHARVGEEAWRGGVDLLKDDENLT DQDFNPFEDRLAESLAARDRVEEDVGERKDYLVNVTGETTEMLERVDLVAEHGGGFVMVD VITCGWAAVQSVRERDHGLAIHAHRAMHAAFDRLDHHGVSMRVLAQIARLCGVDHIHTGT ALGKL--ANEDTPGINDWLH------GDCQGLKPVLPVASGGLHPGVVDRLIDALGTDIC IQAGGGIHGHPDGTHAGAKALRQSVDASLDGVPLDEYAQEHAELATALDKWGAET >WP006666558_Natrialba_aegyptia_DSM_1307_REFreference ------RAYEPAATELVCTFRIEPAEGMTIEAAASRVASESSNGTWATLH-VDESELTHL GAVACEIDES-ITVAYPTDLFEAGSMPQILSCIAGNIMGMKAVESIRLEDCDWPESIVSG FPGPQFGTGVAREKLDAGDRPILATVPKPKVGLSTAAHARVGEEAWRGGVDLLKDDENLT DQDFNPFEDRLAESLAARDRVEEDVGERKDYLVNVTGETTEMLDRVDLVAEHGGGFVMVD VITCGWAAVQSVRERDHGLAIHAHRAMHAAFDRLDHHGVSMRVLAQIARLCGVDHIHTGT ALGKL--ANEDTPGINDWLH------GDCHGLKPVLPVASGGLHPGVVDQLIDALGTDIC IQAGGGIHGHPDGTHAGAKALRQSVDASLDGVSLDEYAQEHAELATALEKWGSET >WP006826142_Natrialba_taiwanensis_DSM_1228_REFreference ---FLD--YDPAATELVCTFRIEPAEDMTIEAAASRVASESSNGTW----ATLHVDESHL GAVACEIDESEITVAYPADLFEAGSMPQILSCIAGNIMGMKAVESIRLEDCDWPESIVSG FPGPQFGTGVAREKLDAGERPILATVPKPKVGLSTAAHARVGEEAWRGGVDLLKDDENLT DQDFNPFEDRLAESLAARDRVEEDVGERKDYLVNVTGETTEMLDRVDLVAEHGGGFVMID VITCGWAAVQSVRERDHGLAIHAHRAMHAAFDRLDHHGVSMRVLAQIARLCGVDHIHTGT ALGKL--ANEDTPGINDWLH------GDCHGLKPVLPVASGGLHPGVVDQLIDALGTDIC IQAGGGIHGHPDGTHAGAKALRQSVEASLDGVSLDEYAQEHAELATALDKWGAET >WP004216495_Natrialba_magadii_ATCC_4309_REFreference -FLDRE--YNPDSTDLVCTFRIDPAEDMSMEAAASRVASESSNGTW----AALHVDEDHL GAVAYGIDGDEITVAYPAELFEAGSMPQILSCIAGNIMGMKAVDAIRLEDCKWPESITSG FPGPQFGTSVAREKLDAGDRPILATVPKPKVGLSTDAHVRVGEEAWRGGVDLLKDDENLT DQDFNPFEDRLAESLAARDRVEEDVGERKDYLVNVTAETNEMLERVDLVDEHGGGFVMVD IITCGWSAVQSVRDRKHGLAIHAHRAMHAAFDRMDHHGVSMRVIAQISRLCGVDHIHTGT ALGKL--ANEDTPGINDWLH------GNCHGLNPVLPVASGGLHPGVVDQLIDALGTDIC VQAGGGIHGHPDGTHAGAKALRQATDASLDGVPLEEYAQAHAELATALKKWGAET >WP006167759_Natrialba_chahannaoensis_JCM_1099_REFreference -FLNRE--YEPDSTDLVCTFSIDPAEDMSMEAAASRVASESSNGTW----AALHVDEDHL GAVACGMDGNEITVAYPAELFEVGSMPQILSCIAGNIMGMKAVDAIRLEDCEWPKSITSG FPGPQFGTSVAREKLDAGDRPILATVPKPKVGLSTDAHVRVGEEAWRGGVDLLKDDENLT DQDFNPFEDRLAESLAARDRVEEDVGERKDYLVNVTAETNEMLERVDLVAEHGGGFVMVD IITCGWSAVQSVRDRDHGLAIHAHRAMHAAFDRMDHHGVSMRVIAQISRLCGVDHIHTGT ALGKL--ANEDTPGINDWLH------GDCHGLNPVLPVASGGLHPGVVDRLIDALGTDIC IQAGGGIHGHPDGTHAGAKALRQATDASLEGVPLEEYAQDHAELATALDKWGAET >WP006651990_Natrialba_hulunbeirensis_JCM_1098_REFreference -FLDRE--YDPDSTDLVCTFTIDPAEDMSMEAAASRVASESSNGTW----AALHVDEDHL GAVACGIDGDEITVAYPAELFEAGSMPQILSCIAGNIMGMKAVDAIRLEDCEWPGSITSG FPGPQFGTSVAREKLDAGDRPILATVPKPKVGLSTDAHVRVGEEAWRGGVDLLKDDENLT DQEFNPFEDRLAESLAARDRVEEDVGERKDYLVNVTAETNEMVERVDLVAEHGGGFVMVD IITCGWSAVQSVRDRKHGLAIHAHRAMHAAFDRMDHHGVSMRVIAQIARLCGVDHIHTGT ALGKL--ANEDTPGINDWLH------GDCHGLDPVLPVASGGLHPGVVDRLIDALGTDIC IQAGGGIHGHPDGTHAGAKALRQATDASLEGVPLEEYAQDHAELRTALQKWGSET >WP008012454_Haloterrigena_limicola_JCM_1356_REFreference -FLDLE--YEPAATDLTCEFTIDPAADMSMEAAASRVASESSNGTW----AALHVDEDDL GAVACEIDGSEITVAYPAALFEAGSMPQILSCIAGNILGMKAVDSIRLEDCHWPESIVSG FLGPQFGTSVANEKLDAGERPVLATVPKPKVGLSTAAHAEIGEEAWLGGVDLLKDDENLT DQEFNPFEDRLTESLAARDRAQEATGERKDYLVNVTAETTEMLERVDLVAEHGGGFVMVD VITAGWAAVQSVRKREHGLAIHAHRAMHAAFDRLEHHGVSMRVIAQIARLCGVDHIHTGT ALGKL--ANEDTPGINEWLT------GECHGLTPVLPVASGGLHPGVVDQLLEALGTDII VQAGGGIHGHPDGTHAGAKALRQSVDASMAGESLEDYADDHPELATALEKWGAET >WP006065604_Natronorubrum_bangense_JCM_1063_REFreference ------LEYEPTDTDLVCEFTIDPE-GMSTEAAASRVASESSNGTW-AALHVDEDELTDL SAVACAIDGH-ITVAYPDALFEAGSMAQILSCIAGNIMGMKAVETIRLEDCYWPEPLVSG FPGPQFGTSVAHEKLDAGDRPILATVPKPKVGLSTDAHVQIGEDAWRGGIDLLKDDENLT DQAFNPFEDRLADSLAARDRVEEDVGERKDYLVNVTAETNEMLERVDLVAEHGGGFVMVD VITCGWSAVQTVRERRHDLAIHAHRAMHAAFDRLEHHGVSMRVLAQIARLCGVDHIHTGT ALGKL--ANEDTPGINEWLT------SDLYGMNPVLPVASGGLHPGVVDQLLDALGTNII VQAGGGIHGHPDGTHAGAKALRQSVEASLEGVSLEDAADDHDELATALEKWGAET >YP003178578_Halomicrobium_mukohataei_DSM_1228_REFreference ---FLDLSYDPADSDLVCEFYVEPAADQDAESAASRVASESSNGTWLQVDGGVTDLSAGI EATAGDREGYRVTVAYPDALFEGGNMPQILSCIAGNILGMKAVDRIRLLDCTWPEPLATS FAGPQFGTSVRSEIFDADDRPITATVPKPKVGLSTDQHAQIGYEAWTGGLDLLKDDENLT DQAFNPFEDRLTESLAQRDRAVEETGEPKSYLLNVTAGGTEMLERVDMAAEHGCEYVMVD VVTCGWAAVQQVRERRHGLAIHAHRAMHAAFDRLPSHGVSMRVIAQIARLCGVDQIHTGT ALGKL--ANEDTVGINEWLT------SDLYGLDDVLPMASGGLHPGLVPELVDRCGTNIG IQAGGGVHGHPDGTHEGAKALRAAVEAAAEGRSIEAAADDEPALATALDKWGTET >WP007979331_Haladaptatus_paucihalophilus_DX25_REFreference ---FLDLGYTPAESELVCTFHVEPAADMDMEDAASRVASESSNGTW-----AELQVEGDL SATTFSIEGNEVKVAYPDALFEDGSMPQVLSCIAGNIMGMKAVERIRLLDCDWPEPLTTS FPGPQFGSGVRNEIFDAGERPITATVPKPKVGLSTDQHAQIGYEAWVGGLDLLKDDENLT DQEFNPFADRLTQSLAMRDDAQDETGEKKSYLVNITADTNTMLERADMVAEQGCEYVMVD VITTGWGAVQSVRERELGLAIHAHRAMHAAFDRIPSHGVSMRVLAQIARICGVDQLHTGT ALGKL--ENEDTVGINEWLY------SDLYGMNDVLPVASGGLHPGLVAELIDREG---- ------------------------------------------------------- >WP009365917_Halogranum_salarium_B_REFreference ---FLDLSYTPTETDLVCTFRIAPAEGMSMEAAASRVASESSNGTW-----APLHIDDDM GATTFSIDGDTIRVAYPAGLFEAGNMPQVLSCIAGNIMGMKAVDTIRLEDCEWPEAIVEG YPGPQFGSGVRQEIFGVDDRPILATVPKPKVGLSTERHAEIGYEAWVGGLDLLKDDENLT DQSFNPYSDRLTESLAMRDKAEDETGEKKSYLINITAETNEMLERADEAASQGNEYVMVD VVTAGWGAVQTVRERDLGLAIHAHRAMHAAFDRVPDHGVSMRFIAQISRLCGVDQLHTGT VLGKL--ANEDTVGINEWMR------SDLYGLKDVLPTASGGLHPGLVPQLMDAAGGNLC IQAGGGIHGHPDGTRAGAMALRQSVDSVTEGVPIGEYAEDHPELATALEKWGTET >WP_006056824_Halogeometricum_borinquens_REFreference GITYEDLSYEPDESDLVCTFRIAPE-GMDVEAAASRVASESSNGTWAALPTGE-G-FTDM GATTFDI-GS-VDVAYPAGLFEPGSMPQILSCIAGNIMGMKAVDTIRLADCQWPEALVSS FAGPQFGSSVREEIFGIEDRPILATVPKPKVGLSTDRHAEVGYEAWLGGVDLLKDDENLT DQSFNPYHDRLVESLALRDDAEDETGEKKSYLINVTADTNTMLERVDLAAEEGCEYVMVD VITTGWAAVQTVRERDHGIAIHAHRAMHAAFDRLENHGVSMRVLAQISRLCGVDQLHTGT ALGKLAN--EDTVGINDWMR------SELYGLNDVLPVASGGLHPGLIPDLLDATGTNVC VQAGGGIHGHPDGTRAGAAALRAAVDGYAAGKSLEESAD-SPELATAIDEWGTE- >WP007539013_Haloferax_larsenii_JCM_1391_REFreference -FLDLD--YEPTDEDLVCTFRIDPATGM-TTEAASRVASESSNGTW----AALQTGADDM GATAFSIDGDTIGVAYPAGLFEPGNMPQILSCIAGNIMGMKAVDTIRLMDCEWPESIVSS FDGPLFGSSVREEIFGVDDRPITATVPKPKVGLSTKSHAQVGYDAWLGGVDLLKDDENLT DQDFNPFADRLTESLALRDDAEDETGEKKSYLINVTADTQTMLDRVDEVAEQGGEYVMVD IITAGWAGLQTVRERKHGMAIHAHRAMHAAFDRMPTHGVSMRVLAQISRLLGVDQLHTGT ALGKL--ANEDTVGINEWMR------SDLYGTTDVLPVASGGLHPGLLPDLLDATGTNVC VQLGGGIHGHPDGTRAGAVALRAAIDGYVDGKSIQEVADETPELAVALDKWGTET >WP008323546_Haloferax_elongans_ATCC_BAA151_REFreference -FLDLD--YEPTDEDLVCTFRIDPATGMTTEEAASRVASESSNGTW----AALQTGADDM GATAFSIDGDTIGVAYPAGLFEPGNMPQILSCIAGNIMGMKAVDTIRLMDCEWPEAIVSS FDGPLFGSSVREEIFGVDDRPITATVPKPKVGLSTKSHAQVGYDAWLGGVDLLKDDENLT DQAFNPFADRLTESLALRDDAEDETGEKKSYLINVTADTQTMLDRVDEVAEQGGEYVMVD IITAGWAGLQTVRERKHGMAIHAHRAMHAAFDRMPTHGVSMRVLAQISRLLGVDQLHTGT ALGKL--ANEDTVGINEWMR------SDLYGTTDVLPVASGGLHPGLLPDLLDATGTNVC IQLGGGIHGHPDGTRAGAVALRAAIDGYVDGKSIQEVADETPELAVALDKWGTET >WP008321381_Haloferax_mucosum_ATCC_BAA151_REFreference -FLDLD--YEPTSEDLVCTFRIDPATGMSTEAAASRVASESSNGTW----AALQTGADDM GATTFAIDGDSIQVAYPTGLFEPGNMPQVLSCIAGNIMGMKAVDSIRLMDCEWPESVVSS YPGPLYGSSVREEVFGVSDRPITATVPKPKVGLSTAAHAQVGYDAWVGGVDLLKDDENLT DQAFNPFADRLTESLSLRDDAEDETGEKKSYLINVTADTQTMLDRVDEVAAQGGEYVMVD IITAGWAGLQTVRERKHGLAIHAHRAMHAAFDRLPTHGVSMRVLAQISRLCGVDQLHTGT ALGKL--ANEDTVGINDWLR------RDLYGANDVLPVASGGLHPGLLPDLLDATGTNVC IQLGGGIHGHPDGTKAGAVALRSAIDAYVEGTSITEAADETPELAVALDKWGTET >WP004971045_Haloferax_denitrificans_ATCC_3596_REFreference -FLDLD--YEPTDEDLVCTFRIDPATGMSTEAAASRVASESSNGTW----AALQTGADDM GATTFDIDGDLIKVAYPAGLFEPGNIPQVLSCIAGNIMGMKAVDTIRLLDCEWPESVVSA YPGPLYGSSVREEIFGVTDRPITATVPKPKVGLSTAAHAQIGYDAWVGGVDLLKDDENLT DQSFNPFSDRLTESLSLRDDAEDETGETKSYLINVTADTQTMLDRVDEVAAQGGEYVMVD IITAGWAGLQTVRERKHGIAIHAHRAMHAAFDRLPTHGVSMRVLAQVSRLCGVDQLHTGT ALGKL--ANEDTVGINDWLA------SDLYGTTDVLPVASGGLHPGLLPDLLDATGTNVC VQLGGGIHGHPDGTRAGAVALRSAIDAYVEGRSISEAAEETPELAVALDKWGTET >WP004043974_Haloferax_volcanii_DS_REFreference -FLDLD--YEPTDEDLVCTFRIDPATGMSTEAAASRVASESSNGTW----AALQTGADDM GATTFDIDGDLIRVAYPAGLFEPGNMPQVLSCIAGNIMGMKAVDTIRLLDCEWPESVVSA YPGPLYGSSVREEIFGVTDRPITATVPKPKVGLSTAAHAQVGYDAWVGGVDLLKDDENLT DQAFNPFSDRLTESLSLRDDAEDETGEKKSYLINVTADTQTMLDRVDEVAAQGGEYVMVD IITAGWAGLQTVRERKHGLAIHAHRAMHAAFDRLPAHGVSMRVLAQVSRLCGVDQLHTGT ALGKL--ANEDTVGINEWLA------GDLHGATDVLPVASGGLHPGLLPDLLDATGTNVC VQLGGGIHGHPDGTRAGAVALRSAIDAYVEGRAITEAAEETPELAVALDKWGTET >WP004977657_Haloferax_gibbonsii_ATCC_3395_REFreference -FLDLE--YEPTDEDLVCTFRIDPATGM-STAAASRVASESSNGTW----AALQTGADDM GATTFDIDGDLIRVAYPAGLFEPGNMPQVLSCIAGNIMGMKAVDTIRLLDCEWPESVVSS YPGPLYGSSVREEVFGVTDRPITATVPKPKVGLSTAAHAQVGYDAWVGGVDLLKDDENLT DQAFNPFSDRLTESLSLRDDAEDETGEKKSYLINVTADTQTMLDRVDEVAAQGGEYVMVD IITAGWAGLQTVRERKHGIAIHAHRAMHAAFDRLPAHGVSMRVLAQVSRLCGVDQLHTGT ALGKL--ANEDTVGINEWLA------GDLYGMNDVLPVASGGLHPGLLPDLLDATGTNVC VQLGGGIHGHPDGTRAGAVALRSAIDAYVEGRSISEAAEETPELAVALDKWGTET >WP008092479_Haloferax_prahovense_DSM_1831_REFreference -FLDLE--YEPTDEDLVCTFRIDPATGM-STAAASRVASESSNGTW----AALQTGADDM GATTFDIDGDLIRVAYPAGLFEPGNMPQVLSCIAGNIMGMKAVDTIRLLDCEWPESVVSS YPGPLYGSSVREEVFGVTDRPITATVPKPKVGLSTAAHAQVGYDAWVGGVDLLKDDENLT DQAFNPFSDRLTESLSLRDDAEDETGEKKSYLINVTADTQTMLDRVDEVAAQGGEYVMVD IITAGWAGLQTVRERKHGIAIHAHRAMHAAFDRLPSHGVSMRVLAQISRLCGVDQLHTGT ALGKL--ANEDTVGINEWLA------GDLYGMNDVLPVASGGLHPGLLPDLLDATGTNVC VQLGGGIHGHPDGTRAGAVALRSAIDAYVEGRSISAAAEETPELAVALDKWGTET >ncbi_ASMR01000001.1_43Diapherotrites KEVYEQLHHQPGKNELVCLFYIEPK-GETIKRAAGAVAAESSVGTWTSVRGLGLKHVHKI AATVFEI-GN-IKVSYPLDNFELGNMSQVYSAIAGNILGMKAVNNIRLQDTQWPAKLLNS FPGPQFGIEKLRKKLKIKKRPFLACVPKPKIGMTSKEHAQIGWEIWTGGIDLLKDDENLT SQPFNRFSERVKLCMKLAEKAEKVTGERKAGLFNVTAPAKEMFKRASLVADYGNPYVMVD LLTTGWSAVQGLRDHDLGLAVYAHRAFHSAFTRNPKHGVSMLMVAKTGRLIGVDNVHIGT GIGKLAGAKPEVLVLKKQMQEH-VLFQNWGRIKNVVPVSSGGLHPGNIPELIGFLGTDIA IQIGGGVHGHPNGSHAGAKATRQAISATLSGVDLRDYSRTHPELKVALDTWGFK- >rifcsphigho2_01_scaffold_10007_6Diapherotrites KEVYENLSHKPGKDELVCLFYIEPMKGESIKRAAGAVAAESSVGTWT---EVKGLGLKKI AATVFDIQGNFIQVAYPLDNFEAGNISQIYSAIAGNIMGMKAVENIRVEDVKWPAKILNS FPGPQFGMKGVRKRMKIKKRPLLACVPKPKIGMTSKEHAKIGFDVWSGGVDLLKDDENLT SQPFNRFSDRVKWCMKEAEKAEKMTGEAKACLLNVTAPGKELYQRAHVVAEYGNPYVMVD LLTAGWSAVQGLRKEDLGLALYCHRAFHSAFTRNPRHGVSMYVIAKSARLIGADNIHVGT GIGKLAGTKPEVLAISNAMQSLRLLGQDWGKMKDVIPVSSGGLHPGNVPEVMNLLGSEIC IQIGGGIHGHPNGSHDGALAARQAVSAAMEKVSLSDYARFHPQLQAALDTWGQKH >07M_4_2014_scaffold_64364_1Pacearchaeota ------------STN-----K----------RAWGEVVAESSVGTWSKIDRNKYKYVNSV SAWVFEAKGDWIKIAYPEDHFEQGNLPQILASIAGNVFGMKAVESLRLEDIEFTKKIVDS FPGPKFGRNGIRKFMDVKKRPIMLSVAKPKVGMTTEEHTEVGRQIWSGGLDLLKDDENLA SQFFNPFEKRVKSCLKVRDRIEKETGEKKSYLINITGPTKEMEKRAKIIKEYGGEFAMID IITTGWSGVASMREIDLGLALHAHRAMHGAMTRNHLQGFSMTCVAKCSRLLGIDTLHIGT AIGKLVGSKEEVIHLEDEITLLHTLHQDWYDIKPVFPCSSGGLHPAIVDKVLNRLG---- ------------------------------------------------------- >07M_4_2014_scaffold_1566_11Pacearchaeota MAGYEDMDYKPDSHDVIASFRVKVPRWSSSKRAWGAVAAESSVGTWSPLKAMNYAHVKKV AAKVFEAKGEWIKIAYPEEHFEEGNMSQILASIAGNVFGMKAVGALRLEDMQFTKKIIKS FPGPQFGRNGVRNIFKEYKRPLMLSVAKPKVGLTTKEHVEVGRQIWEGGLDLLKDDENLA DQKFNPFKKRVVDSLKVRDKIEKNTGEKKSYLINITGPNKEMEKRAKFIKEQGGEYAMID IVTTGWAAVASVREIDLGLAIHAHRAMHGAMTRVPTHGFSMNCVAKCIRLLGVDQFHIGT AVGKLVGTVEETVHHEQQITKLHCLEQKWFGMKPVFPVSSGGLHPLLVDKVMDKLG-DIM LQIGGGIHGHPNGSYAGAKAMREAVEAYMAGTDLAVAAKKCEELKVALDYWGRKG >rifcsphigho2_01_scaffold_54027_7Pacearchaeota MADYDNLDYKPTKKDVIASFRVRVP-SWSEKRAWGAVAAESSVGTWSPLKAMNYAHVKKV AAKVYYAKGE-IKISYPEDHFEEGNMSQILASIAGNVFGMKAVAGLRLEDIKFTKKIVNS FPGPKYGRKGIRKIFKVYKRPLMLSVAKPKVGLTTKEHIEVGRQIWEGGLDLLKDDENLT NQKFNKFEDRVVGSLKVRDKIEKKTGEKKSYLINITGPTKEMERRAKFVKNQGGEYIMID IITTGWTALNSIREIDLGLAIHSHRAMHGAMTRVRTHGFSMMCVAKCARLIGVDQLHIGT AVGKLVGPKDEVFEIEKIIT-LHTLGQEWFGMNEVFPVSSGGLHPLLVDKVMDILGTDIM LQIGGGIHGHPKGSYAGAKAMREAVDAYINGESLITAAKKSKELGEALKYWGRKA >rifcsplowo2_01_scaffold_43_141Diapherotrites ---------------MIVSFRVGVP-GWETKRAWGAVASESSIGTWAKVVGLKYEHVKKV AAKVFGKDGW-IKIAYPVDHFEYGNMAQILASIAGNDFGMKAVSSLRLQDVKWPAKIVKA FPGPQFGIKGIRKIFGVKKRPLMLSVPKPKVGMTTKEHAQIGYDIWTGGLDLLKDDENLS NQKFNPFEKRVKLSLKLRDKAEKETGEKKSYLINVTAPTDEMLRRAKFVAREGGEYAMID VITSGWTSVRTLREHDLGLAIHAHRAFHGAFDRNPKHGFSNLAVGEVVRLLGVDQYHIGT HIGKMVSPKHEVLDVQEHITVEHCLAEDWLDTKPVFPVASGGLQPGLIPQIIDLLGTEIM LQLGGGVHGHPKGSHAGAIAMRAALEAKLEGKSLEEASASCKALQQAMDYWGYTR >CG10_big_fil_rev_8_21_14_0.10_scaffold_48964_c_1Woesearchaeota IAGYDDLNYKPKPTDIIASFKVKVK-WQKNKTAFGAVASESSVGTWTGLKALHYKHVQKV AAKVYETKDSWIKIAYPEKHFEPGNMPQILASIAGNVFGMKTVDGLRLQDIKWTKSAVKA FPGPQFGIKGVRKIFNEKKRPLMLSVAKPKVGMTTAEHSKVGWQIWTGGLDLLKDDENLS NQWFNPFQRRVTTCLKLRDKAEKLTGDKKSYLINVTAPTKEMENRAKFVAKMGGEYAMID ICTSGWTALHSLREIDLGLAIHCHRAFHGAFTRNPQHGMSMILMGEITRLLGGDQLHIGT AVGKMVGGKKEVLEIEDHVTEN-TLGQDWYGMKPVFPVA--------------------- ------------------------------------------------------- >rifcsphigho2_02_scaffold_221194_1Pacearchaeota ------------QKD-----T----------TALGAVASESSVGTWA---AVKALHYKKV AAKVYKKIGPWVKIAYPEEHFEPGNMSQIFASIAGNVFGMKAVDSLRLQDIQWTKKLIKS FPGPQFGINGVRKIFK-EKKPLMLSVAKPKVGMTTAEHARVGWQIWTGGLDLLKDDENLS NQWFNPFQRRVQTCLQLRDKAEKITGDKKSYLINVTAPTQEMEKRAKFVAKQGGEYVMID ICTGGWMGGSSLREIDLGLAIHAHRAFHGAFTRIPAHGFSMLNLAKSVRLLGVDNLHIGT AVGKLVGGAQEVKNIQEQITQEHSLEQNWYGTKPVFPVSSGGLHPILVEDVLDRMGTNIM LQIGGGVHGHPKGSYAGAKAMREAVEAYMDGVHVDDAAKTSKELRQALEYWGHTR >rifcsphigho2_02_scaffold_58627_1Pacearchaeota -------------------FHIKVS-WQKDTTALGAVASESSVGTWA-VKAVHYKHVQKV AAKVFEAPRNWVKIAYPEAHFEPGNMSQVLASIAGNVFGMKSVDSLRLQDIQWTKKLIKS FPGPQFGINGIRKIFKEKKRPLMLSVPKPKVGMTTAEHKDIGWQIWTGGLDLLKDDENLT NQWFNPFYRRVSETLAVRDKAEKVTGDVKSYLINVTAPTQEMEKRAKFVAKQGGEYVMID ICTGGWMAVSSLREIDLGLAIHAHRAFHGAYTRIPSHGFSMLNLAKTVRLLGVDNLHIGT AVGKLVGGAVEVKNIQQQITEH-SLEQNWYGTKS----SHSGRCP--------------- ------------------------------------------------------- >rifcsplowo2_02_scaffold_83872_4Pacearchaeota MAGYEDTKYKPKSTDLLVSFKVKIK-GVKKEKAIGAVASESSVGTWATVKGSEYSHVKKV AGTVYFVPTSWTKIAYPQDHFEPGNMSQILASIAGNVFGMKAVDGLRIEDIKWPAKIRDS FPGPQFGIDGIRKIFKVKKRPLLLSVPKPKVGMTTPEHCEIGRQIWSGGIDLLKDDENLS SQKFNPFEKRVRTAVKIRDRIEKETGDKKSYLINITHSSKEMEKRAKLIADLGWEYVMID IVTTGFSGVHSVRELDLGLAIHAHRAMHASFTRNPDHGFSMLALAETSRLLGVDNIHIGT AVGKLVGTSDQVLSLQDHITLR-TLGEDWGRIKPVFPVSSGGLHPGVLPDVMKKMGTNIM IQAGGGIHGHPGGSHAGAIAVRQAVEAYLDGETLDERARKVPELQQALTLWGKK- >rifcsplowo2_02_scaffold_22033_7Woesearchaeota MAGYEDLGYKPKPTDLLVSFRVHVS-WETPKRSFGAVASESSVGTWASVVALKYLHVQEV AGKVYAV-QSAIKIAYPQDHFEAGNMPQILASIAGNVFDMKAVASLRLEDIRWPKKIVKS FPGPQYGIAGIRNILKVKKRPLMLTVPKPKVGMTTTEHCEIGRQIWLGGLDLLKDDENLT NQTFNPFKKRVETALKIRDRIENQTGEKKSYLINATAPTKEMEQRARFIADQGGEYAMID IITSGWTALHSLRELDLKLAIHAHRAFHGAFTRVPFQGFSMLATAEVARLLGADQLHIGT AVGKMVGSAHEVHEIQDHIV-H-CLGENWGNTKPMFPVSSGGLHPILVPDVLERMGTNIM LQIGGGIHGHPGGSYAGARAMREAVEAYLDGKTMEEASQSCKELRDALKFWGHT- >rifcsphigho2_02_scaffold_55123_15Woesearchaeota MAGYEDLDYKPKKTDLIASFHVKAA-WSTMKRSCGAVASESSVGTWASVTGLKYEHVQKV AGKVFHI-NDWIKIAYPQDHFEMGNMSQILASIAGNIFGMKAVESLRLQDIQWPEKIKKS FPGPQFGISGIRKIFGVKKRPLMLSVAKPKVGMTTKEHCEIGRQIWTGGLDLLKDDENLT GQKFNPFERRVTEALRIRNNVERKTGEKKSYLINVTHSRKEMERRAKFVAKHGGEYVMLD IVTAGWQATHSLRELDLGLAIHAHRAFHGAFTRNPEHGFSMLAVGECARLVGVDQLHIGT AVGKLVGSKEEVITIQEHIALH-TLDEDWGNMKPVFPVSSGGLHPTILPTVLDRLGIDVM LQIGGGIHGHPKGSYAGACAMRAAVEAYMDDISMEEAAKKSVPLKQALDFWGYT- >RifSed_csp2_13ft_1_scaffold_647396_2Aenigmarchaeota ------------------------------------------------------------ ----------------------------------------------------L------- --------------------------------------------------DLLKDDENLT SQKFNQFDRRVKTALKVRNQCERATGDKKSYLINITHSNNEMVRRAKLVASLGWEYVMID IVTTGFTAVQSVRELDLGLAIHAHRAMHGAFTRHPLQGFSMLALGEVSRLLGVDQLHIGT AVGKLVGSKQEVIEIEKHIAKLHTLSEDWLNLKPVFPVSSGGIHPGIIPDICRRMGVDIQ LQIGGGIHGHPKGSYAGAKAMREAVEAYMDGISVDKKAKTSPELAQALKQWGHMR >rifcsplowo2_01_scaffold_223977_1Pacearchaeota DFVHLD--YKPSKTDLIVAFVRVPSWEK-KPRAIGAVASESSVGTWATVTGLKYAHVQRV AGRVFSISGRWIKIAYPQDHFELGNMSQILASIAGNVFAMKAVDSLRIEDIKWPKAIVKS FPGPQYGIDGVRKIFKVKERPLMLSVPKPKVGMTVEEHCEIGREIWTGGLDLLKDDENLS NQKFNQFDKRISLALKIRDKIERETGEKKSYLINVTHSNKEMERRAKLVASLGWEYAMID VVTTGFTAVHSIRELDLKLAIHAHRAMHGAFTRDPLQGFSMLALGEVVRWLGVDQLHIGT AVGKLVGTREEVVEIQKHIAKLHTLDEDWLDVKPVFPCSSGGLHPGVVADVCDRLGVDLL LQLGGGVHGFA-------------------------------------------- >RifSed_csp2_10ft_2_scaffold_141884_1Pacearchaeota PEWYHERSHKPDPDDIIVHYRYDAR-GFSPEECVGRVASESSVGTWTTLARLP-ARVRKI RATGYDY-KN-MLVSYPRELWEPGNLAQLWSGICGNVFGMKAVKNLRLIDVRLPRWYLKG FRGPQFGIAGVRRMLKVKDRPLTATVPKPKLGYSAKEHAEVGFQTWCGGVDLIKDDENLT NQSFNRFGDRIKRMAKMRDKAEKKTGERKSALLNITAETNEMVKRAKMLADLGWEYAMVD VVVAGTSALQTVRETDLGLAIHAHRAMHAMYTKNLQHGMAMPALVKFSRLIGVDNIHVGT VVGKLESPREEVMANIAQCRGF-LLDQDWGTMKPVWPVPSGGLHPGLMYDIIKIFGKDCI IQAGGGV------------------------------------------------ >RifSed_csp2_10ft_3_scaffold_402847_1Pacearchaeota TEWYDEAKVRPKPTDIVALFRFDCR-GISVREALGRIASESSVGTWTTLAELP-RRIPGM KAMAYKW-SH-AHVTYPAELWEPGNMPQLLSGVAGNIFGMKAVKRLRLLDLRLPAWYLKG FRGPLDGIAGLRRRLKVPKRPLTATVPKPKLGWSAAEHAHLGAEAWMGGMDLVKDDENLT SQSFNRFEKRVELLTRERRKAERATGEVKSALINVTAETKEATRRAKLLAREGWEYAMVD VVTTGWGALQTLRDEELDLAIHAHRAMHAMFTKDEAHGMAMPCLAKLLRCVGVDQLHAGT IVGKLVSPKHEVLASADTLREG-LLDQDWGRIKPAFPVASGGLHIGIVPDILRTFGLDQA IQLGGGVHGHPDGTRAGAKALLDVIHGTMDGERLADIGRSSPEVRRALEKWGRE- >YP001047237_Methanoculleus_marisnigri_JR_REFreference -FVDLN--HTPGPDELVALYYFEPAAGI-SKEAVGRIASESSTGTW----TTLFTLPPDL QAKAFEIEGNYVKIAYPLALWEEGNAAQLLSGIAGNVFGMKALDRLRLIDASLPAEYLRH FKGPHFGMEGIRDMMKIRGRPLTGAVPKPKVGFTAEEHAEVGYETWMGGFDFVKDDENLT SQSFNRFDDRVRAMAKMRDRAEQETGDVKSAFINITADTETMEKRAKMLADYGWNYAMID VVVAGTAAVATLRDYDLGLAIHAHRAMHAAFDRDEKHGITMQFLAKMMRLVGVAQIHTGT AVGKLVGTRAEATVLADMLRDHMALDQDWGNIKSAFPVSSGGLHPGLVPDVLDIYGSELV LLVSGGIHGHPKGTRAGAEAAMQAIEAWKEGETLEEKAKKAPALSEALEKWGRYK >YP006545805_Methanoculleus_bourgensis_MS_REFreference -FVDTD--HTPGSDELVALYYFEPAAGI-SREAVGRIASESSTGTW----TTLFTMPPDL QAKAFEIEGNYVKIAYPLALWEEGNAPQLLSGIAGNVFGMKALESLRLIDASLPAEYLRH FKGPHFGMEGIRDMMKVHGRPLTGAVPKPKVGFTAEEHAEVGYETWMGGFDFVKDDENLT SLAFNRFEDRVRAMTKMRDRAEQETGDVKSAFINITADTETMKKRAEMLADYGWNYAMID VVVTGTAAVATLRDHDLGLAIHAHRAMHAAFDRDEKHGITMQFLAKIMRLIGVAQIHTGT AVGKLVGTKAEATVLADMVRDHMALDQDWGAIKSAFPVSSGGLHPGLVPDVLDIYGSDLV LLVSGGIHGHPKGTRAGAMATMQAIEAWSEGETLEEKAKKAPELREALEKWGYYK >rifcsphigho2_12_scaffold_32189_9Aenigmarchaeota FEWYTDENYKAKKTDVVCLFYFEPDHVS-VKEAAGRIAAESSAGTW----TTLNRIPKKV MATSFDIRGKYVKIAYPIELWDEGNMPQLLSGIAGNIFGMKALKNLRLVDVSLPPVYLKH YKGPRHGIQGLRKILKVKKRPITGAVPKPKIGFTAKEHAEIALETWMGGFDLTKDDENLT TTPFNKFEERVKLMTRMRQKAEKETGDVKSALLNITGPTHIMIKRAKMLHDLGWEYAMID VVTAGTAAVQTMREVDYGMAIHAHRAMHATFDRNPKHGLSMLFLGKIMRAIGVDQIHVGT VVGKLVGGKGEVTEIEREITPFDLLPQNWGKIKPVLPVSSGGVHPGIIPDILDILGNDIG LLVSGGIHGHPKGTRAGAVAALQAIDAHMHGIKLEDYARHHVELAQALEKWGRAR >rifcsplowo2_12_scaffold_67979_10Aenigmarchaeota FEWYTDENYKAKKTDVVCLFY--------FE----RIAAESSAGTW----TTLNRIPKKV MATSFDIRGKYVKIAYPIELWDEGNMPQLLSGIAGNIFGMKALKNLRLVDVSLPPVYLKH YKGPRHGIQGLRKILKVKKRPITGAVPKPKIGFTAKEHAEIALETWMGGFDLTKDDENLT TTPFNKFEERVKLMTRMRQKAEKETGDVKSALLNITGPTHIMIKRAKMLHDLGWEYAMID VVTAGTAAVQTMREVDYGMAIHAHRAMHATFDRNPKHGLSMLFLGKIMRAIGVDQIHVGT VVGKLVGGKGEVTEIEREITPFDLLPQNWGKIKPVLPVSSGGVHPGIIPDILDILGNDIG LLVSGGIHGHPKGTRAGAVAALQAIDAHMHGIKLEDYARHHVELAQALEKWGRAR >rifcsphigho2_01_scaffold_412949_4Aenigmarchaeota FEWYTDTNYKPKKTDIKALFYFEPAKGISIKEAAGRVASESSAGTW----TTLSNMPKEV MATAYEIHGNYVKIAYPGCLWDKGNAPQLLSGIAGNIFGMKALSNLRLIDAEFPADYIKG FRGPQYGIQGLRRLLKVKKRPITGAVPKPKIGFSAAEHAKVGYETWMGGFELLKDDENLT STPFNRFEERIRLCAKMRDKAEKETGDRKSALLNITGPTHVMIKRAKMLHNLGWEYAMID VVTAGTAAVQTMREVDYHLAIHAHRAMHASFDRNPKHGISMLFLAKLMRLIGVDQIHVGT VVGKLVG----------------------------KPMFCGARRRKNLSPHLIRTG---- -----------------------AK------------------------------ >rifcsphigho2_02_scaffold_49666_4Aenigmarchaeota FEWYTDTNYKPKKTDIKALFYFEPAKGISIKEAAGRVASESSAGTW----TTLSNMPKEV MATAYEIHGNYVKIAYPGCLWDKGNAPQLLSGIAGNIFGMKALSNLRLIDAEFPADYIKG FRGPQYGIQGLRRLLKVKKRPITGAVPKPKIGFSAAEHAKVGYETWMGGFELLKDDENLT STPFNRFEERIRLCAKMRDKAEKETGDRKSALLNITGPTHVMIKRAKMLHNLGWEYAMID VVTAGTAAVQTMREVDYHLAIHAHRAMHASFDRNPKHGISMLFLAKLMRLIGVDQIHVGT VVGKLVGTKDEVMGIEKEITPFPTLNQNWGKIKPVLPVSSGGVHPGLIPDILGVLGTDIC LLVSGGIHGHPKGTRAGAKAAMQAIDAHMSHTSLEEYAKKHIELRQALEKWGHKH >rifcsphigho2_02_scaffold_58713_2Aenigmarchaeota FEWYTDTSYRPNKTDLKCLFYFEPK-GISVKEAVGRIASESSAGTWTTLARMP-KRIKGT MATAFEIGGH-AKVAYPIDLWDPGNVPQLLSGIAGNIFGMKALDNLRLVDVSLPQAYIRH FRGPQFGIDGLRKLLKVKKRPITGAVPKPKIGFSAREHAEVGYETWMGGFELLKDDENLT STSFNKFEERIRLCAKMRDKAEKETGDRKSALLNITGPTDVMIKRARMLHDLGWEYAMID VVTCGAAGVQTLREIDYHLAIHAHRAMHASFDRNPKHGISMLFLAKMMRLIGVDQLHIGT VVGKLVGTKGEVMNLEKELTSP-LLNQKWNHIKPVLPTSSGGVHPGLLPEIFDILGYDIL LLVSGGIHGHPKGTRAGAKAAMQAIEAHMVHTSLEEYAKKHIELRQALEKWGHK- >rifcsphigho2_02_scaffold_69512_7Aenigmarchaeota FEWYTDESYSPKSSDVVCLFYFEPAKGISVREAAGRIASESSAGTW----TTLARMPKGT MATSFEISGHYVKVAYPLDLWDPGNVPQLLSGIAGNIFGMKALENLRLLDVSLPAAYIKN FRGPQFGIQGLRKMLRVKKRPITGAVPKPKIGFSAREHAEVGYETWMGGFELLKDDENLT TTSFNRFEERIRLCAKMRDRAERETGDRKSALLNITGPTHVMVKRAKMLHNLGWEYAMID VVTAGSAAVQTLREIDYPLLLIAIQSM-------------------------------GY Q----------------------------------------------------CFS---- --------------------LRK-------------------------------- >rifcsplowo2_02_scaffold_215139_2Aenigmarchaeota ------------------------------------------------------------ ---------------------------------------MKALDNLRLVDVSLPPVYIKH FRGPQFGIDGLRKMLNVKKRPITGAVPKPKIGFSAREHAEVGYETWMGGFELLKDDENLT TTSFNRFEERIRLCARMRDKAEKETGDRKSALLNITGPTHTMIKRAKMLHDLGWEYAMID VVTAGSAGVQTLREVDYHLAIHAHRAMHASFDRNPKHGISMLFLAKMMRLVGVDQIHVGT VVGKLVGGKGEVMDIEREITSFPILPQNWGKIKPVLPVSSGGVHPGLIPDILDILGTDIC LLVSGGIHGHPDGTRAGAKAAMQAIEAHMSGVSLEEKAKKHKELAAAMQKWGHSK >rifcsphigho2_02_scaffold_22708_9Aenigmarchaeota FEWYTDESYRQKSSDVVCLFYFEPAKGISVKEAAGRIASESSAGTW----TTLARMPKGT MATAFDISGSYIKVAYPLDLWDPGNVPQLLSGIAGNIFGMKALENLRLVDVSLPPAYIRN FRGPQFGIDGLRKMLKVKKRPITGAVPKPKIGFSAREHAEVGYETWMGGFELLKDDENLT TTSFNRFEERIRLCARMRDKAEKETGERKSALLNITGPTHVMVKRAKMLHDLGWEYAMID VVTAGSAGVQTLREVDYHLAIHAHRAMHASFDRNPRHGMSMLFLAKMMRLIGVDQLHIGT VVGKLVGSKREVTDLEKELTEFPILDQKWGHIKPVLPTSSGGVHPGLLPEIFDILGYDIL LLVSGGIHGHPNGTRAGAKAAMQAIEAHMNKISLEEQAKKHKELAAALEKWGHSR >gwa2_scaffold_63_236Aenigmarchaeota FEWYTDESYKPKSSDIICLFYFEPAKGISVKEAAGRIASESSAGTW----TTLARMPKGT MATAFEINGHYVKVAYPLDLWDPGNVPQLLSGIAGNIFGMKALDNLRLVDVSLPPAYIKN FRGPQFGIDGLRKMLKVKKRPITGAVPKPKIGFSAREHAEVGYETWMGGFELLKDDENLT TTSFNRFEERIRLCAKMRDKAEKETGERKSALLNITGPTHVMAKRAKMLHDLGWEYAMID VVTAGAAGVQTLREIDYHLAIHAHRAMHASFDRNPRHGMSMLFLAKMMRLIGVDQIHVGT VVGKLVGTKGEVMDIEREITPFPVLRQNWGHIKPVLPVSSGGVHPGLVPDILDILGTDIC LLVSGGIHGHPNGTRAGAKAAMQAIEAHMNGISLEEQAKRHKELAAALQKWGHSR >rifcsplowo2_01_scaffold_126992_2Aenigmarchaeota FEWYTDESYKPKSSDIICLFYFEPAKGISVKEAAGRIASESSAGTW----TTLARMPKGT MATAFEKSGNYVKVAYPLDLWDPGNVPQLLSGIAGNIFGMKALDNLRLVDVSLPPAYIKN FRGPQFGIDGLRKMLKVKKRPITGAVPKPKIGFSAREHAEVGYETWMGGFELLKDDENLT TTSFNRFEERIRLCAKMRDKAEKETGERKSALLNITGPTHVMAKRAKMLHDLGWEYAMID VVTAGAAGVQTLREIDYHLAIHAHRAMHASFDRNPRHGVSMLFLAKMMRLIGVDQIHVGT VVGKLVGGKDEVMGIHKEIVSFPLLNQEWGRIKPVLPVSSGGVHPGLIPDILSILGTDIC LLVSGGIHGHPDGTRAGARAAMQAIEAHMSGISLEEQAKHNKELAAALQKWGHAK >rifcsplowo2_01_scaffold_13705_14Pacearchaeota TEWYHEERYAPHKSDLIVLFYYEPA-GVSNEEAVGRIASESSTGTWTTLFRLP-PRMRKL MATAFWM-GN-VKVAYPIGLWEMGNAPQLLSGIAGNIFGMKALRNLRLIDVTLPKEYIQH FKGPSHGIQGLRSLLKVKKRPLTGAVPKPKIGFSAAEHAGIAFETWMGGFDLVKDDENLT STTFNQFEERVKRMARMRDHAQHLTGNEKDALINITAETTEMILRAKLLHDHGFRYAMID VVTTGTAAVQTLREVDYNMAIHAHRAMHASFDRNPKHGITMQFLAKLMRLIGVDQIHAGT AVGKLVGGKHEVQSIANVLRTM-LLEQDWGSIKPAFPVSSGGLHPGLVPDVMRIFGNELV LLVSGGIHGHPKGTRAGAEATMQAIEAVERNVSLEEYAKTHGALNEALKKWGRL- >rifcsphigho2_02_scaffold_2812_33Micrarchaeota VEWYADEGHSPLKDDIVALFYFEPA-GISKKEAVGRIASESSTGTWTTLFTMP-PRMKKL MATAFEI-GN-VKVAYPIGLWEEGNAPQLLSGIAGNIFGMKAIENLRLVDASFPHAYVRN FKGPGCGMEGIRRMMKVKKRPLTGAVPKPKVGFSAAEHAAIGYETWMGGFDLVKDDENLT STHFNRFEERVKLMTRLREKAEKETGEEKDALINITAPVAQMEKRAKMLHENGWRYAMVD VVVAGTCAVQHMREVDYGMAMHAHRAMHASFDRNPKHGITMQFLAKMHRLIGVDQIHAGT AVGKLVGDRKEVQSVANVLRKM-LLRQEWGGIKGAFPVSSGGLHPALVPEVMDIFGNECV ILVSGGIHGHPKGTRAGAKASMDAMHAKMDGKTLEEAAKKSAELRAALEKWGYL- >rifcsplowo2_01_scaffold_95698_5Pacearchaeota TEWYTEENYTPSKNDLVALFYFEPAAGM-SKEAVGRIASESSTGTW----TTLFKLPPKL MAKAYEIRGNYVKIAYPLDLWEEGNAPQLLSGIAGNIFGMKALNNLRLLDVAFPSKYLAS FPGPRHGMTGIRSMMKVQKRPLFGAVPKPKVGFTAAEHAQIGYETWMGGFDLVKDDENLT SQNFNRFEERVKLMTRLRNRAERETGERKDALINITAETNEMQKRAKLLHDNGWRFAMID VVVAGTAATQTLREVDYGMAIHAHRAMHASFDRNPKHGMTMHFLAKLMRLIGVDQIHAGT AVGKLVGGVSEVQLNAAVLRANQLLAQDWGSIKPIFPVSSGGLHPGLIPEVMRILGNECV ILVSGGIHGHPRGTRAGAIASMQAKKA-----QRTGAGS---------------- >rifcsphigho2_01_scaffold_15374_6Pacearchaeota IEWYSESSYKPSRDELIALFYFEPAKGISVNEAMGRIASESSTGTW----TTLFKLPDKL KAHAYHKDGNFVKVAYPFDLWEPGNCPQLLSGIGGNIFGMKALDNLRLFDIQFPKKYLYS FKGPRHGIQGIRKMMKVYKRPMLGAVPKPKIGFSDVEHANIGYETWMGGFDFVKDDENLT STKFNNFDKRVKLLAKLRDKAEHATGDNKDAYINITSETEKMKKRAKILHNYGFRHAMID VVVSGYSAVQTLRDVDYNMAIHAHRAMHSMFDRNVKHGMSMYMLAKLMRMIGVDQIHAGT VVGKLVGTADEVRNISDTLRKNLRFYQDWHGLKQAFPVTSGGLHPGLVPELIKLFGDECV MLVSGGIHGHPKGTRAGAMATRQAIDAVHEGESLKEHAKYNRELREALDKWGSMR >rifcsphigho2_02_scaffold_369452_2Pacearchaeota IEWYSESSYKPSRDELIALFYFEPAKGISVNEAMGRIASESSTGTW----TTLFKLPDKL KAHAYHKDGNFVKVAYPFDLWEPGNCPQLLSGIGGNIFGMKALDNLRLFDIQFPKKYLYS FKGPRHGIQGIRKMMKVYKRPMLGAVPKPKIGFSDVEHANIGYETWMGGFDFVKDDENLT STKFNNFDKRVKLLAKLRDKAEHATGDNKDAYINITSETEKMKKRAKILHNYGFRHAMID VVVSGYSAVQTLRDVDYNMAIHAHRAMHSMFDRNVKHGMSMYMLAKLMRMIGVDQIHAGT AVGKLVGTADEVRNISDTLRKNLRFYQDWHGLKQAFPDKNANFS---------------- -----------------------SL------------------------------ >rifcsplowo2_01_scaffold_391963_1Pacearchaeota ------------------------------------------------------------ -----------VKVSYPLDLWEPGNAAQLLSGAAGNIFGMKALDNLRLLDINFPREYLKH FPGPRHGMEGVRKMMKVYHRPLLGAVPKPKVGFSDVEHAQIGFETWMGGFDFVKDDENLT STKFNNFDKRVKLLAKLRDKAEHATGDLKDAYINITAETHEMERRAKLLHNYGFRYAMID VVTCGASSVQTLRETDYGMAIHAHRAMHASFDRNPKHGISMYFLAKLMRMIGVDQIHAGT AVGKLIGSADEVAGIRDVLRKNSRLSQDWHGIKRAFPVTSGGLHPGLVPDLLKLFGDECV MLVSGGIHGHPKGTRAGAMAVVQAIQACKEGETLENHAKYNPELREALEKWGHLR >GWB1_scaffold_10454_2Pacearchaeota VEWYLDSHYKPRKDELIALFYYEPANGV-SREAIGRIASESSTGTW----TTLFNMPPKL MAHSFSVDGNFVKVSYPLDLWEPGNAAQLLSGAAGNIFGMKALDNLRLIDIKFPREYLKH FPGPRYGMTGVRKMMKVYRRPLLGAVPKPKIGFSDVEHANIGFETWMGGFDFVKDDENLT STKFNNFDRRVKLIAKLRDKAEHATGDLKDAYINITAETHEMERRAKLLHNYGFRYAMID VVTCGVSSVQTLRETDYGMAIHAHRAMHASFDKNPKHGISMYFLAKLMKMIGVDQIHSGT AVGKLVGTADEVKGIADVLRKNVRLNQDWNGIKRAFPVTSGGLHPGLVPDLLKLFGDECV MLVSGGIHGHPRGTRAGAMATRQAIDAVHEGETLEQHAKYNKELREALEKWGHLR >rifcsphigho2_01_scaffold_464599_1Pacearchaeota --------------------R-------------------------------------KL MAHAYEVGGDFVKVSYPLDLWEPGNAAQLLSGAAGNIFGMKALDNLRLVDIHFPKEYLKH FPGPKHGMSGVRKMMDVYKRPLLGAVPKPKIGFSDVEHAQIGFETWMGGFDFVKDDENLT STKFNNFDKRVRLLAKLRDKAEHATGDNKDAYINITAETHEMERRAKLLHNYGFRYAMID VVTCGVSAVQTLRETDYGMAIHAHRAMHASFDKNLKHGISMYFLAKLMRAIGVDQIHSGT AVGKLVGTADEVKGIADVLRKNVRLPQDWHGIKRAFPVTSGGLHPGLVPDLLKLFGDECV MLVSGGIHGHPRGTRAGAMATRQAIDAVHEGETLEQHAKYNKELHQALEKWGHLR >gwa1_scaffold_8053_8Pacearchaeota VEWYLDSAYKPAKDELTALFYFEPAKGVSVNEAIGRIASESSTGTWTTLF-TMPPRMKKL MAHAYHREGNFVKVSYPFDLWEPGNCPQLLSGIAGNIFGMKALDNLRLIDIQLPKEYLKH FPGPRHGMAGVRKMMNVYNRPLLGAVPKPKIGFSDVEHAQIGYETWMGGFDFVKDDENLT STKFNNFDKRVKLLAKLRDKAEHATGDTKDAYINITAETDEMKRRAKLLHDYGFKYAMID VVTCGFSSVQTLRETDYGMAIHAHRAMHASFDKNPKHGISMYLLAKLMRMIGVDQIHSGT AVGKLVGTSAEVKGIADVLRKNIRLSQDWNGIKRAFPVTSGGLHPGLVPDLLKLFGDECV MLVSGGIHGHPKGTRAGAMATRQAIDAVHEGETLEQHAKFNPELREALDKWAHLR >rifcsplowo2_01_scaffold_51844_6Pacearchaeota VEWYLDSKYKPSKNELIALFYFEPAKEVSIDESVGRIASESSTGTW----TTLFTMPAKL MAHAYHRDGNFVKVSYPFDLWEPGNAPQLLSGVAGNIFGMKALDNLRLIDIQLPKEYLKH FSGPRHGMAGVR-----------------------------------GGFDFVKDDENLT STKFNNFDKRVKLLAKLRDKAEQATGDLKDAYINITAETHEMERRAKLLHDYGFRYAMID VVTCGFSAVQTLRETDYGMAIHAHRAMHASFDKNPKHGISMYLLAKLMRMIGVDQIHSGT AVGKLVGTASEVRGIADVLRKKVRLNQDWHGIKRAFPVTSGGLHPGLVPDLLKLFGDECV MLVSGGIHGHPRGTRAGAMAARQAIDAVHEGETLEQHAKFNPELAQALEKWGHLR >rifcsphigho2_01_sub10_scaffold_4260_8Pacearchaeota VEWYLDSSYKPARDELIALFYFEPAKGVSVNEAIGRIASESSTGTW----TTLFKMPPKL KAHAYYRSGNFVKVAYPLDLWEPGNAPQLLSGIAGNIFGMKALNNLRLLDIHFPKNYLKH FPGPRHGMHGIRKMMNVYHRPMLGAVPKPKIGFTDVEHAQIGYETWMGGFDFVKDDENLT STKFNNFDKRVKLLAKLRDKAEHATGDTKDAYINITAETDEMKKRAKLLHDYGFKHAMID VVVSGYSAVQTLRETDYGMAIHAHRAMHAMFDRNPKHGMTMYMLAKLMRMIGVDQIHSGT AVGKLVGTFAEVKSISDTLRRNLRLTQDWHGLKQAFPVTSGGLHPGLVPYLMKEFGDECV MLVSGGIHGHPKGTRAGAMATRQAIDAVHEGETLREHAKYNLELAQALEKWGNLR >rifcsplowo2_01_scaffold_22357_10Pacearchaeota ------------------------------------------------------------ ----------------------------------MN------------------------ ----------------VYHRPMLGAVPKPKIGFTDVEHAQIGYETWMGGFDFVKDDENLT STKFNNFDKRVKLLAKLRDKAEHATGDTKDAYINITAETDEMKKRAKLLHDYGFKHAMID VVVSGYSAVQTL-----------HRAMHAMFDRNPKHGMTMYMLAKLMRMIGVDQIHSGT AVGKLVGTFAEVKSISDTLRKNVRFAQDWHSLKQAFPVTSGGLHPGLVPYLMKEFGDECV MLVSGGIHGHPKGTRAGAMATRQAIDAVHEGETLREHAKYNLELAQALEKWGNLR >rifcsplowo2_02_scaffold_295114_1Pacearchaeota -----------------------------------RIASESSTGTWTTLF-KMPPRMNKL KAHAYYRSGNFVKVAYPLDLWEPGNAPQLLSGIAGNIFGMKALNNLRLLDIHFPKNYLKH FPGPRHGMHGIRKMMNVYHRPMLGAVPKPKIGFTDVEHAQIGYETWMGGFDFVKDDENLT STKFNNFDKRVKLLAKLRDKAEHATGDTKDAYINITAETDEMKKRAKLLHDYGFKHAMID VVVSGYSAVQTLRETDYGMAIHAHRAMHAMFDRNPKHGMTMYMLAKLMRMIGVDQIHSGT AVGKLVGTFAEVKSISDTLRKNVRFAQDWHSLKQAFPVTSGGLHPGLVPYLMKEFGDECV MLVSGGIHGHPKGTRAGAMATRQAIDAVHEGETLREHAKYNLELAQALEKWGNLR >rifcsphigho2_01_scaffold_6966_11Pacearchaeota VEWYLDSSYKPAKDELLALFYFEPS-GISVNEAMGRIASESSTGTWTTLFKMP-PRMNKL KAHAYSH-GN-VKVAYPLDLWEKGNCPQLLSGIGGNIFGMKALDNLRLLDIHFPKSYLRS FPGPRHGMQGIRKMMNVYNRPMLGAVPKPKIGFTDLEHAEIGFETWMGGFDFVKDDENLT STNFNNFDKRVKLIAKLRDKAEHATGDNKDAYINITAETHEMERRAKLLHDYGFKHAMID VVVTGYSGVQTLRNTDYGMAIHAHRAMHAMFDRNPKHGMSMYMLAKLMRMIGVDQIHAGT AVGKLTGTAGEVKNISDTLRNV-RFAQDWFGLKQAFPVTSGGLHPGLVPDLLKIFGDECV MLVSGGIHGHPKGTRAGAMATRQAIDAIHEGETLEEHAKYNLELREALEKWSHL- >rifcsplowo2_01_scaffold_412156_2Pacearchaeota ------------------------------------------------------------ -----------------------------------------------FSDVE-------- -------------------------------------HAEIGFETWMGGFDFVKDDENLT STSFNNFDKRVRLLAKLRDKAEHLTGDTKDAYINITAETHLMEKRAKLLHNYGFKHAMID VVVAGNSGVQTLRNTDYKMAIHAHRAMHAVFDKNPKHGMTMYMLAKLMRMIGVDQIHSGT AVGKLVGSKHEVQDIAYALRKHHLLPQEWHHIKPAFPVTSGGVHPGLIPDLLKIFGDECV MLVSGGIHGHPRGTRAGAMAARQAIDAVHEGETLEQHAKYNRELREALDKWSHLR >GWB1_scaffold_6749_11Pacearchaeota VEWYLDSSYKPSKDELIALFYFEPAKGI-SAEAMGRIASESSTGTW----TTLFTMPPKL MAHAYSREGSFVKVAYPLDLWEEGNAPQLLSGIAGNIFGMKALDNLRLIDIQFPKAYIKH YPGPTHGMHGIRKMMNVYNRPMLGAVPKPKIGFSDVEHAQIGFETWMGGFDFVKDDENLT STKFNNFDKRVRLLAKLRDKAEHATGDIKDAYINITAETHQMEKRAKILHNYGFRHAMID VVVAGNAAVQTLRNTDYKMAIHAHRAMHAVFDKNPKHGMTMYMLAKLMRMIGVDQIHSGT AVGKLVGTKHEVQDIAYTLRKHHLLPQEWHHIKPAFPVTSGGLHPGLVPDLLKIFGDECV MLVSGGIHGHPKGTRAGAMATRQAIDAVHEGETLEQHAKFNPELREALDKWSHLR >rifcsplowo2_02_scaffold_407950_1Pacearchaeota VEWYLDSSYKPAKDELIALFYFEPASGISVNEAMGRIASESSTGTW----TTLFTMPPKL MAHAYSREGSFVKVAYPLDLWEEGNAPQLLSGIAGNIFGMKALDNLRLIDIQFPKAYIKH YPGPTHGMHGIRKMMNVYNRPMLGAVPKPKIGFSDVEHAQIGFETWMGGFDFVKDDENLT STKFNNFDKRVRLLAKLRDKAEHATGDIKDAYINITAETHQMEKRAKILHNYGFKHAMID VVVTGYSGVQTLRNTDYGMAIHAHRAMHAVFDRNPKHGMSMYMLAKLMRMIGVDQIHAGT AVGKLT------------------------------------------------------ ------------GTA---------------------------------------- >RifSed_csp2_13ft_1_scaffold_36185_8Pacearchaeota VEWYLDLGYHPKKTDLKVLYYFEPK-NVSKNDAVGRIASESSTGTWTTLYQIP-KRMREL MAIAYKI-GN-AYVAYPLELWDAGNLPQLLSGIAGNIFGIKSLKNLRLIDVSFPEEYIKN FKGPNLGISGLRKYFKIYNRPIVGAVPKPKIGFSAEEHAKIGFETWLGGFDLVKDDENLT SQKFNDFNKRVKLLSKKRDLAEKLTGEVKDALINVTSETKEMEKRAKIVHGYGFKYVMID VVVSGFSAVQTLRNVDYGMAIHAHRAMHAAFDRNPKHGITMQFLAKTMRLVGVDEIHSGT SVGKLAGGREEVLSISNTLRKI-LLEQDWGKIKPAFPVASGGLHPGLVPDVMDIYGRDFV LLVSGGIHGHPQGTKAGAKATMQAIESVDKKISLEEYAKTHKELMEALKKWGRM- >CG10_big_fil_rev_8_21_14_0.10_scaffold_31956_2Micrarchaeota IEWYSEDGYKPKQDELVALFYFEPE-GVSAKEAAGRIASESSTGTWTTLFKMP-ARMRSL MATAYGI-GN-VRVAYPLDLWEAGNAAQLLSGIAGNIFGMKALKNLRLVDVSFPKEYLHG FPGPALGISGIRKALRVPKRPITGAVPKPKVGFSAAEHAGIAYETWMGGFDLVKDDENLS SQPFNRFNDRVNYMTKARDRAERETGERKDALINITAETEEMKRRAKLLHDNGWRYAMID VVVAGTAAVQTMRETDYGMAIHAHRAMHAAFTRNPKHGMTMLFLAKLMRMVGVDEIHAGT AVGKLEGKRADITHIADTLRNA-VLSQDWGNIKPAFPVASGGLHPGLVPDVMRLYGNECV ILVSGGIHGHPRGTRAGAKAAMQAIEATMEGVTLDEAAKTNVELAQALEKWGHL- >rifcsphigho2_01_scaffold_85698_3Pacearchaeota IEWYHDQSYKPKSTDLKALFYFEPT-GITKEDAIGRIASESSTGTWTTLFKLP-PRMKNL MATAYKV-GN-VSVAYPLELWEPGNSPQLLSGIAGNIFGMKALKNLRLVDVDFPQAYFKG FPGPRHGISGIRKFMKVYDRPLTGAVPKPKIGFSHQEHADIAFETWMGGFDITKDDENLS SLKFNDFYKRTLLMTKLRNKAEQLTGEKKDAFLNITAETKEMQKRAKFLHDHGWNYAMID VVVSGTAAVQTLRETDYGMAIHAHRAMHASFDRNPKHGMSMQFIAKLMRMIGVDQIHSGT AVGKLVGSKEEVRCVADVLRNI-LVNQDWGNIKSGFPVTSGGLHPGLVPDVMKIYGNEMV MLVSGGIHGHPKGTRAGAKASMQAIEATQQNISLEQYAKTHIELRQSLEKWGRM- >rifcsplowo2_01_scaffold_40614_6Pacearchaeota VEWYSSYNYKPKKTDLIALFYFESS-GMTAKEAVGRIASESSTGTWTTLFKMP-KRMEKL KATAFSV-KN-VKIAYPLDLWEKGNMPQLLSGAAGNIFGMKALKNLRLIDISFPKEYLKG FKGPNQGISGLRKFFKQNKRPLTGAVPKPKVGFSAKEHSEIAFETWTGGFDLTKDDENLT SQSFNKFDERVRRMTKMRDKAEKITGEKKDALLNITGETDEMKRRAKLLHDNGWNYAMID VVVAGTAAVQTLRNTDYGMAIHAHRAMHASFDRNLKHGITMQYLAKMMRIVGVDQIHSGT AVGKLISPRIEVETIANLMRPH-NLAQDFYNIKQGFPVASGGLHPGLVPDVMNILGNECV LLVSGGIHGHPKGTKSGAMAAMQAIEASMDGVSLYEKAKTSLQLKQALEKWGLM- >rifcsplowo2_01_sub10_scaffold_13483_3Pacearchaeota VEWYLDFSYKPKKSDLVVLFYVEPK-GVSVKEAIGRIASESSTGTWTTLFKLP-QRMRGL MATAFEI-GN-VKVAYPLELWEPGNAPQLLSGIAGNIFGMKALKNLRLVDATLPKRYIRS FSGPRQGIEGIRRFMKVKDRPLVGAVPKPKIGFSASEHAQIGFETWMGGFDFVKDDENLS STSFNNFYRRVELMAKLRDKAEKETGHIKDAFINITAETKEMMKRAKFLHDHGWRYAMID VVVAGTAAVQTLREVDYGMAIHAHRAMHASFDRNPKHGMSMPFLAKLMRLVGVDQIHSGT AVGKLVGDVEEVQCNANVLRRM-LLEQDWGRIKPAFPVASGGLHPGLIPEELKIFGKECV LLVSGGIHGHPQGTRAGAAAAMQAIEATLEGVSLEEAAKVNGELRSALGKWGRM- >rifcsphigho2_01_scaffold_319644_1Pacearchaeota ------------------------------------------------------------ ---------------------------------------------LK------------H FPGPTKGIEGIRKMMKVKDRPLTGAVPKPKIGFSAKEHAEIAYETWMGGFDLTKDDENLS STSFNNFYKRVDLMTKLRDKVERETGQVKDALLNITAETDEMKKRAKYLHDRDWKYAMID VVVTGHAAVQTLRETDYGMAIHAHRAMHATFDKNSKHGLTMNFLAKMMRMIGVDQIHSGT AVGKLVGTRHEVTDIATTLRFPHLLNQDWLHIKPAFPVASGGLHPGLIPDELKIFGKECT LLVSGGIHGHPRGTRAGAMAAMQAIEATMDGMSLTEAAKSNVELKEALDKWGFMH >rifcsplowo2_01_scaffold_159415_2Pacearchaeota VEWYKDESYKPNKNDIVVLFYFEPK-GITKEEAIGRIASESSTGTWTTLFKMP-KRMKSL MATAYEV-GN-VKVAYPYDLWEPGNAPQLLSGIAGNIFGMKALKNLRLVDATFPKSYLQS FKGPNHGIDGIRKFMKVKKRPLTGAVPKPKIGFSAKEHAEIAFETWMGGFDLTKDDENLS STKFNNFFERVKLMTKLRDKAEKETGEVKDALINITAETEEMKRRAKVLHDHGWKYAMID VVVAGNASVQTLRDTDYGMAIHAHRAMHASFDRNLKHGLTMQFLAKMMRMIGVDEIHCGT ALGKLVSPKHEVINIADTLRHH-LMPQEWGHIKPAFPVASGGLHPGLVPGVMDILGNECV ILVSGGIHGHPKGTRAGAKASMQAIDAKMNNISLPEYAKNHVELKEALNKWGFM- >rifcsplowo2_01_scaffold_401896_1Pacearchaeota --------YKPSNTDIVVLFYFEPTKGI-TKEAIGRIASESSTGTW----TTLFRMPKSL MATAFEVDGNYVKVAYPLDLWEPGNAPQLLSGIAGNIFGMKALDNLRLIDVTLPKEYIKH FKGPTQGIEGIRKMMKVKKRPLTGAVPKPKIGFSAKEHAEIAFETWMGGFDLTKDDENLS STSFNNFYERVRLMTKLRDRAEKETGEQKDALINITAETKEMMKRAKFLNDHGWRYAMID VVVTGHAAVQTLRDTDYKMAIHAHRAMHATFDRNEKHGLTMQFLAKMMRLIGVDQIHSGT AVGKLVSPRKEVEVIAETMRRHNLMYQDWGKIKPGFPVASGGLHPGLVPEVMEILGNECT MLVSGGIHGHPKGTRAGAKATMQAIEATM-------------------------- >rifcsphigho2_01_sub10_scaffold_18411_1Pacearchaeota TEWYLDKHYRPSKTDLVALFYFEPK-GVSKEEAIGRIASESSTGTWTTLFKLP-PRMKKL MATSFEV-GN-VKVAYPLDLWEKGNAPQLLSGVAGNIFGMKALKNLRLIDVSFPKEYIQN FKGPNLGISGLRKYFKVYDRPLTGAVPKPKVGFSYKEHADIAFETWMGGFDLVKDDENLS SQKFNPFSKRVKLMAKQRDKAEKLTGEVKDALLNITAETKEMERRAKLLHNYGWKYAMID VVVTGTSAVQTLRDTDYKMSIHAHRAMHASFDKNPKHGITMQFLAKLMRLIGVDQIHAGT AVGKLVGTKHEVQNIADILRNQ-ILYQDWDNIKPAFPVSSGGLHPGLVPDVMRIFGKDMI LLVSGGIHGHPKGTRAGAKAAIQAIEATNKKLSLEEYAGTHKELQQALEKWGHL- >RifSed_csp2_10ft_1_scaffold_178317_3Pacearchaeota VEWYLDENYKPSKSDLIALFYFEPD-GISREEAIGRIASESSTGTWTTLFKMP-PRMKKL MATAFEV-GN-VKVAYPLDLWESGNFPQLLSGAAGNIFGMKALDNLRLIDISLPKEYIKH FKGPSNGIQGLRKYFKVYDRPLTGAVPKPKVGFSAKEHADIGFETWMGGFDIVKDDENLT SQKFNRFEERVKLMAKLRYKAEKLTGEKKDAFINITAETEQMKKRAKMLHNYGWKYAMID VVVSGHAAVQTMRDVDYKMAIHAHRAMHASFDKNPKHGITMQFLAKSMRLIGVDGIHCGT AVGKLVGTRHEVENIADTLRSV-ILNQNWHDIKPAFPVASVG------------------ ------------------------------------------------------- >RifSed_csp2_10ft_3_scaffold_196980_2Pacearchaeota IEWYLDYNYKPSKSDLIALFYFEPS-GISKEEAMGRIASESSTGTWTTLFKMP-PRMKKL QAIAYEV-GN-LKVAYPLDLWEKGNFPQLLSGIAGNIFGMKALDNLRLIDVSFPKEYVQS FKGPSLGIKGLQSYFKAKNRPLTGAVPKPKLGFSAKEHADIGFETWMGGFDLVKDDENLT SQKFNHFEERVKLMAKLRNKAEKLTGEKKDALINITAETEEMKKRAKILNNYGWKYAMID VVVAGTAAVQTMRNVDYKMAIHAHRAMHASFDKNPKHGITMQFLAKSMRLIGVDQIHGGT AVGKLTGNKHEVQTIAETLRFS-LLNQNWHNIKPAFPVASGGLHPGLVPDVLKILGTDCG LLVSGGIHGHPRGTRAGAKATMQAIEAFKERISLEEKAKSSIELKQALDKWGHL- >RifSed_csp2_16ft_3_scaffold_46608_6Pacearchaeota VEWYLDENYKPSKTDLIALFYFEPSEGI-SKEAIGRIASESSTGTW----TTLFKMPDKI MATAFEIDGNFVKVAYPLELWELGNFPQLLSGIAGNIFGMKALDNLRLVDVSFPKEYVKS FKGPSLGIKGLQAYFKAKNRPITGAVPKPKLGFSAKEHADIGFETWMGGFDLVKDDENLT SQKFNRFEERVKLMTKLRDKAEKLTGEKKDALINITSETDEMKKRAKILHNYGWKYAMID VVVSGHAAVQTMRNVDYKMAIHAHRAMHATFDKNPKHGITMQFLAKSMRLIGVDQIHSGT AVGKLTGTSHEVRDIATTLRKFHLLNQNWHHIKPAFPVASGGLHPGLVPDVLKLLGTDCC LLVSGGIHGHPKGTRAGAKATMEAIDAFKKGISLEEKAKTSIELRQALEKWGHLH >rifcsphigho2_02_scaffold_16976_10Pacearchaeota VEWYLDKNYKPSKTDLQVLFYFEPA-GVSKEEAIGRIASESSTGTWTTLFKLP-PRMKNL MATAFEV-GN-VKVAYPLDLWEPGNAPQLLSGIAGNIFGMKAINNLRLVDVSLPKEYIKH FPGPTHGISGLRRIMKVNNRPMTGAVPKPKIGFSAAEHARIGFETWMGGFDLVKDDENLS SIKFNNFYERVKLMTKLRDKAEKLTGEQKDALINITAETDEMKKRAKFLHDYGWRYAMID VVVSGTSAVQTLRETDYKMAIHAHRAMHAAFDRSPKHGITMEFLAKLMRLVGVDQIHAGT AVGKLVGNRHEVLDITHILRNR-ILNQDWGNIRSAFPVSSGGLHPGLVPDVMNIFGNDCV ILVSGGIHGHPRGTRAGAKAAMQAIEATQEKISLEDYAKNHKELREALDKWGHL- >rifcsphigho2_02_scaffold_27808_2Pacearchaeota VEWYLDENYKPAKTDLVVLFYFEPAEGVSVKEAVGRIASESSTGTW----TTLYKLPGKL MATAFEIDGNFVKVAYPLDLWELGNAPQLLSGIAGNIYGMKALKNLRLVDASFPLKYSGS FKGPFYGINGLRKLMGVKGRPFTGAVPKPKIGFSAKEHADIGFETWMGGFDLVKDDENLT STKFNNFYERVKLMTKLRDKAEKLTGDSKDALINITAETDEMKKRARVLHNHGWRYAMID VVVTGTGAVETMRDTDYKMAIHAHRAMHAAFDKNSKHGMTMSFLAKMMRLIGVDQIHAGT AVGKLVGSRHEVMDIADVLRRHHLLSHDWGNIKPAFPVASGGLHPGLVPDEIKIFGNDMV LLVSGGIHGHPRGTRAGAKATMQAIEATQEGISLEEKAKKSKELREALEKWGHMR >rifcsphigho2_12_scaffold_224551_2Pacearchaeota VEWYLDNNYKPSKTDLVCLFYFEPK-GISKEEAIGRIASESSTGTWTTLFKLP-PRMKKL QATGFEV-GN-VKVAYPLDLWEKGNAPQLLSGIAGNIFGMKALNNLRLIDVSFPKEYLNA FKGPKHGTKGLRKLFKVNKRPLTGAVPKPKIGFSAAEHADIAFQTWTGGFDLTKDDENLT STKFNNFNKRVELMTRLRDKAEKETGEVKDALLNITGETNEMIKRAKLLHDNGWKYAMID VVVAGTASVQTLRNVDYGMAIHAHRAMHASFDRNPKHGVSMQFLAKLMRIIGVDQIHSGT AVGKLVGDKKEVLSIAETLRGF-LLNQDWRSIKPAFPVSSGGLHPGLVPDVMNLFGNEFV LLVSGGIHGHPKGTKAGAIATMQAIEATLDKITLEEKAKTSTELKQALEKWGRL- >gwa2_scaffold_43928_8Pacearchaeota VEWYLELDYKPAKDDLKVLFYFEPSKGI-SPEAAGRIASESSTGTW----TTLFTMPKAL EAKVFEIQGNYCKVAYPLDLWEKGNAPQLLSGIAGNIYGMKALENLRLVDASFPKEYLKG FKGPNLGIKGLRKYFGVKKRPLTGAVPKPKIGFSAKEHARIGYETWVGGFDLVKDDENLT STSFNKFEERVKLMTKLREKAEKETGSVKDALLNITAPVKIMQKRAKLLHENGWKYAMID VVTTGTSAVQEMREVDYKMAIHAHRAMHAAFDKNPKHGITMQFLAKLHRIIGVDQIHAGT AVGKLVGDRHEVHNTAEVLRKGHLLEQDWQGIKPVFPVSSGGLHPGLVPEVMRILGNETV ILVSGGIHGHPKGTRAGAKAAMQAIEAEMQGESLTEKAKSSVELRQALEKWGRMK >rifcsphigho2_02_scaffold_110943_6Pacearchaeota VEWYLELDYKPAKDDLKVLFYFEPSKGISPKEAAGRIASESSTGTWT------------- --TLF------TMLAYPLDLWEKGNAPQLLSGIAGNIYGMKALENLRLVDASFPKEYLKG FKGPNLGIKGLRKYFGVKKRPLTGAVPKPKIGFSAKEHARIGYETWVGGFDLVKDDENLT STSFNKFEERVKLMTKLREKAEKETGSVKDALLNITAPVKIMQKRAKLLHENGWKYAMID VVTTGTSAVQEMREVDYKMAIHAHRAMHAAFDKNPKHGITMQFLAKLHRIIGVDQIHAGT AVGKFVGDRHEVHNTAEVLRKGHLLEQDWQGIKPVFPVSSGGLHPGLVPEVMRILGNETV ILVSGGIHGHPKGTRAGAKAAMQAIEAEMQGESLTEKAKSSVELRQALEKWGRMK >rifcsplowo2_12_scaffold_1860_42Pacearchaeota VEWYHDLKYRPKKTDLKVLFYFEPSIGI-TKDAIGRIASESSTGTW----TTLHTLPKQI MAVAHKIEGNYVHIAYPIELWELGNAPQLLSGIAGNIFGMKALKNLRLIDVSLPENYLKS FRGPNLGIHGLRRYFRIYDRPLTGAVPKPKLGFSAEEHARMGMETWLGGFDLVKDDENLT SQSFNNFYKRVRLMSKMRDKAESETGEIKDALINITAETGEMKRRAKYLYDYGFKYAMID VVASGVSSVQTLRETDYKMAIHAHRAMHASFDKNPRHGITMQFLAKLMRMIGVEEIHSGT GVGKLVGSVDELKAVSSVLRKGFLLEQNWHKTKPAFPVSSGGLHPGLVPEELKIYGKEFV LLVSGGIHGHPQGTRAGPWKLQE------RIFPSKNIPKKNSE-----RHWKNGE >rifoxyc1_full_scaffold_38709_3Pacearchaeota VEWYHDLKYRPKKTDLKVLFYFEPI-GITKDDAIGRIASESSTGTWTTLHTLP-KRMKQI MAVAHKI-GN-VHIAYPIELWELGNAPQLLSGIAGNIFGMKALKNLRLIDVSLPENYLKS FRGPNLGIHGLRRYFRIYDRPLTGAVPKPKLGFSAEEHARMGMETWLGGFDLVKDDENLT SQSFNNFYKRVRLMSKMRDKAESETGEIKDALINITAETGEMKRRAKYLYDYGFKYAMID VVASGVSSVQTLRETDYKMAIHAHRAMHASFDKNPRHGITMQFLAKLMRMIGVEEIHSGT GVGKLVGSVDELKAVSSVLRGF-LLEQNWHKTKPAFPVSSGGLHPGLVPEELKIYGKEFV LLVSGGIHGHPQGTRAGAKAVIQALGPWKLQERIFPSKNIPKATKNSERHWKNG- >rifcsphigho2_02_scaffold_15332_15Pacearchaeota VEWYHDLKYRPKKTDLKVLFYFEPSIGI-TKDAIGRIASESSTGTW----TTLHTLPKQI MAVAHKIEGNYVHIAYPIELWELGNAPQLLSGIAGNIFGMKALKNLRLIDVSLPENYLKS FRGPNLGIHGLRRYFRIYDRPLTGAVPKPKLGFSAEEHARMGMETWLGGFDLVKDDENLT SQSFNNFYKRVRLMSKMRDKAESETGEIKDALINITAETGEMKRRAKYLYDYGFKYAMID VVASGVSSVQTLRETDYKMAIHAHRAMHASFDKNPRHGITMQFLAKLMRMIGVEEIHSGT GVGKLVGSVDELKAVSSVLRKGFLLEQNWHKTKPAFPVSSGGLHPGLVPEELKIYGKEFV LLVSGGIHGHPQGTRAGAKAVIQALEATRKNISLEEYSKSNKELREALEKWGRMK >rifoxyd1_full_scaffold_4719_16Pacearchaeota VEWYHDLKYRPKKTDLKVLFYFEPSIGI-TKDAIGRIASESSTGTW----TTLHTLPKQI MAVAHKIEGNYVHIAYPIELWELGNAPQLLSGIAGNIFGMKALKNLRLIDVSLPENYLKS FRGPNLGIHGLRRYFRIYDRPLTGAVPKPKLGFSAEEHARMGMETWLGGFDLVKDDENLT SQSFNNFYKRVRLMSKMRDKAESETGEIKDALINITAETGEMKRRAKYLYDYGFKYAMID VVASGVSSVQTLRETDYKMAIHAHRAMHASFDKNPRHGITMQFLAKLMRMIGVEEIHSGT GVGKLVGSVDELKAVSSVLRKGFLLEQNWHKTKPAFPVSSGGLHPGLVPEELKIYGKEFV LLVSGGIHGHPQGTRAGAKAVIQALEATRKNISLEEYSKSNKELREALEKWGRMK >gwc1_scaffold_4584_2Pacearchaeota ---------------LKVLFYFEPSIGI-TKDAIGRIASESSTGTW----TTLHTLPKQI MAVAHKIEGNYVHIAYPIELWELGNAPQLLSGIAGNIFGMKALKNLRLIDVSLPENYLKS FRGPNLGIHGLRRYFRIYDRPLTGAVPKPKLGFSAEEHARMGMETWLGGFDLVKDDENLT SQSFNNFYKRVRLMSKMRDKAESETGEIKDALINITAETGEMKRRAKYLYDYGFKYAMID VVASGVSSVQTLRETDYKMAIHAHRAMHASFDKNPRHGITMQFLAKLMRMIGVEEIHSGT GVGKLVGSVDELKAVSSVLRKGFLLEQNWHKTKPAFPVSSGAL----------------- -----------EATRK-------NI-------SLEEYSKSNKELREALEKWGRMK >CG10_big_fil_rev_8_21_14_0.10_scaffold_8652_c_5Pacearchaeota IEWYHDKSYKPKRTDLKVLYYFEPR-KTSKEDAVGRIASESSTGTWTTLHTMP-KRMKSL MATAYKI-GN-VHVAYPLELWEKESMPQLLSGIAGNIFGMKALNNLRLIDASLPKEYVKE FKGPSLGISGLRKYFKVYERPLTGAVPKPKVGFSSDEHAKIGFETWMGGFDLVKDDENLT SQKFNNFYKRVKLMAKMRDKAEKMTGEVKDALLNITSETKEMEKRAKFVHNHGFKYAMID VVTCGTASVQTLRETDYKMAIHAHRAMHAAFDREPKHGMTMHFLAKIMKLIGVDEIHTGT GVGKLVGTREEIKALADMLREL-MLEQDWGHIKPAFPVSSGGLHPGLVPDEIDIYGEDVV LLVSGGIHGHPKGTRAGAQAVMQALEATKRKITLEEYGKTHKELREALEKWGRL- >rifcsphigho2_01_scaffold_125633_2Pacearchaeota VEWYHDLKYKPSKTELEVLFYFEPT-GITRDDAIGRIASESSTGTWTTLFKLP-PRMKNL MATAYSV-GN-VKVAYPLDLWEPGNMPQLLSGIAGNIFGMKALKNLRLVDVSLPREYIKS FKGPALGIEGLRRYFKVYDRPLTGAVPKPKIGFSAEEHAKIGYETWLGGFELVKDDENLS STKFNNFYKRVDLLTKLRDKAEKETGNMKDALLNITAETEEMKRRAKHLHNKGWKYAMID VVTCGTSAVQTLREVDYDMAIHAHRAMHASFDRNPRHGISMQFLAKIMRLIGVEQIHSGT AVGKLVGSREEVKSIANTLRNI-LLEQNWQNIKPAFPVSSGGVHPGLIPDEIDIYGKDFV LLVSGGIHGHPRGTRAGAMATMQAIEATNRGVSLEEFSKKNKELREALEKWGRL- >rifoxyc1_full_scaffold_14429_1Pacearchaeota VEWYHSRKYKPKKTDLKVLYYFEPK-NTSKEDAIGRIASESSTGTWTTLFKIP-KRMKSL MATAYKT-GN-VYIAYPLELWEKGSVPQLLSGIAGNIFGMKALNNLRLVDASLPGDYIKH FKGPNLGIKGLRKYFKIYDRPLTGAVPKPKVGFSSEEHARIGYETWLGGFDLVKDDENLT SQSFNNFYKRVKLMTKLRDKAERETGKVKDALLNITSETREMQKRAKFLHDYGWKYAMID VVTSGTSAVQTLRETDYDMAIHAHRAMHASFDRNPRHGITMQFLAKIMKLIGVEEIHSGT AVGKLVGSKEEVNAVADVLRGL-LLEQDWGKIKPAFPVSSGGLHPGLIPDEIGIYGKDVV LLVSGGIHGHPRGTRSGAQAVMQAIQAVKKKTPLEEYPRTNKELKEALEKWGRL- >RifSed_csp1_19ft_2_scaffold_187283_2Pacearchaeota VEWYEELNYKPKRSDLKVLFYFEPDKGISIKEAVGRIASESSTGTWTTLF-TMPPRMKNL MATAYKIEGNYVYVAYPIELWEKGNAPQLLSGIAGNIFGMKALKNLRLVDVSLPEEYIKS FSGPNLGINGLRKYFKVYDRPLTGAVPKPKVGFSAEEHAKIGYETWLGGFDLVKDDENLS SQSFNSFQKRVELMTKLRDKAEKETGEVKDALLNITAETREMEKRAKLLHDKGWKYAMID VVVCGDSAVQTLRKTDYGMAIHAHRAMHASYDRNPKHGMSMQFLAKRMRMIGVDQIHSGT AVGKLVGSKEEVQSIASVLRKGFLLEQSWGNTKPAFPVSSGGLHPGLVPDEMNIYGKDVA LLVSGGIHGHPRGTRAGAKATMQALEATKRKIKLEEYAKTHVELREALGKWGRMN >RifSed_csp2_10ft_3_scaffold_492694_2Pacearchaeota --------------------L--------------------------------------- ----------------------------------NN---------LRLVDVSLPKEYIKD YKGPALGIKGLRKYFKVYDRPLTGAVPKPKVGFSSEEHAKIGFETWLGGFDLVKDDENLT SQSFNNFYKRVKLMGKLRDKAEKITGNVKDALLNITAETNEMEKRAKFLHDHGWKYAMID VVTCGTAAVQTLREIDYGMAIHAHRAMHASFDRNPKHGISMQFLAKVMRLIGVEQIHSGT GVGKLVGTRDEVRAIADVLRKGLLLEQDWNGIKPSFPVSSGGLHPGLIPDELEIYGKDVV LLVSGGIHGHPHGTRKGAMAAMQAIDAVREKTSIEEYAKTHMELKEALMKWGRLR >RifSed_csp1_13ft_1_scaffold_35671_4Pacearchaeota VEWYDELSYKPKRTDLRVLFYFEPDSGISVKEAIGRIASESSTGTWTTLF-KLPPGMKKM MAIAYKIEGNLVHVAYPLELWEMGNMPQLLSGIAGNIFGMKALKNLRLVDVSLPSEYLRG FKGPNLGIEGIRKYFKVYDRPLTGAVPKPKLGFSSEEHAKIGYETWIGGFDLVKDDENLT NQTFNNFNKRVALMTKLRDKAEKETGNVKDALINITAETKEMEKRAKILHDNDWKYAMID VVTCGTSSVQTLRKTDYGMAIHAHRAMHASFDRNQKHGISMQFLAKIMRVIGVEQIHSGT AVGKLVGSKEELSAISSTLRKGFLLEQSWNNVKPAFPVSSGGLHPGLIPDEMEIYGKQFV MLVSGGIHGHPRGTKAGAMAAMQAIEAVNKKVSLEEFAKKNKELKEALGKWGRLK >rifcsplowo2_01_scaffold_178776_2Pacearchaeota IDWYDELSYKPKKTDLRVLFYFEPNKGISVKEAVGRIASESSTGTWTTLF-TLPPRMKSL MATAYKIDGNLVHIAYPLELWEKGNMPQLLSGIAGNIFGMKALKNLRLVDVSLPREYLNG FKGPSLGINGLRKYFKVYDRPLTGAVPKPKVGFSAEEHAGIGFETWLGGFDLVKDDENLT SQSFNNFYKRVSLMTKLRDKAEKETGEVKDALINITAETKEMERRAKVLHDNGWKYAMID VVTCGTSAVQTLRNTDYGMAIHAHRAMHASFDRNPK------------------------ ------------------------------------------------------------ ------------------------------------------------------- >rifcsplowo2_01_scaffold_246509_2Pacearchaeota VEWYDELNYKPSKTDLKVLFYFEPATGMSVKEAIGRIASESSTGTW----TTLFKIPGKL MATAYKIEGNYIHVAYPLELWEKGNMPQLLSGIAGNIFGMKALKNLRIVDVSLPKEYLRG FRGPNLGINGLRKYFKVYDRPLTGAVPKPKVGFSAEEHAKIGYETWLGGFDLVKDDENLT SQSFNNFYKRVSLMTKLRDKAEKETGEVKDALLNITAETKEMEKRAKFLHDKGWKYAMID VVVCGTAATQTLRKTDYGMAIHAHRAMHASFDKNTKHGISMQFLAKTMRTIGVDQIHSGT AVGKLVGSREEVTSIASTLRKGFLLEQNWHGIKPAFPVSSGGVHPGLIPDEMDIYGKDFV LLVSGGIHGHPKGTRAGAKASMQAIEATKKKISLEEYAKTNKELREALEKWGRLK >rifcsplowo2_01_scaffold_102233_3Pacearchaeota VEWYDELSYKPKKTDLKVLFYFEPEKGMSVKEAIGRIASESSTGTW----TTLFTLPSGL MATAYKIEGNYVYVAYPLELWETGNMPQLLSGIAGNIFGMKALKNLRIVDVSLPKEYLRG FSGPNLGIGGLRKYFKVYDRPLTGAVPKPKVGFSAEEHAKIGYETWLGGFDLVKDDENLT SQGFNNFYKRVSLMTKLRDRAEKETGEVKDALLNITAETKEMEKRAKFLHEHGWKYAMID VVVCGASAVQTLRKTDYGMAIHAHRAMHASFDRNPKHGISMQFLAKTMRTIGVDQIHSGT AVGKLVGSKEEVTAIADTLRKGFLLNQDWGKIKPAFPVSSGGIHPGLIPDEMNIYGKDFV LLVSGGIHGHPRGTRAGAMATMQAIESTNKGISLEQFAKSNKELREALEKWGRLK >rifcsphigho2_01_scaffold_173637_10Pacearchaeota VEWYDELSYKPKKTDLKVLFYFEPEKGMSVKEAIGRIASESSTGTW----TTLFTLPSGL MATAYKIEGNFVYVAYPLGLWEAGNMPQLLSGIAGNIFGMKALKNLRLVDVSLPKEYLKG FQGPNLGIQGLRKYFKVYDRPLTGAVPKPKVGFSAED----------------------- ---FNNFYKRVSLMTKLRDRAEKETGEVKDALLNITAETKEMEKRAKFLHEHGWKYAMID VVVCGTAATQTLRKTDYGMAIHAHRAMHASFDRNPRHGISMQFLAKTMRMIGVDQIHSGT AVGKLVGSREEVTAIADTLRKGFLLEQNWGNTKPAFPVSSGGVHPGLIPDEMDIYGKDIV LLVSGGIHGHPHGTRAGAKATMQALEATKKKISLEEFAKKNKELREALEKWGRLK >rifcsphigho2_12_scaffold_1445_63Pacearchaeota VEWYDELSYKPKKTDLKVLFYFEPEKGMSVKEAIGRIASESSTGTWTTLF-TLPSRMKGL MATAYKIEGNFVYVAYPLGLWEAGNMPQLLSGIAGNIFGMKALKNLRLVDVSLPKEYLKG FQGPNLGIQGLRKYFKVYDRPLTGAVPKPKVGFSAEEHAKIGYETWLGGFDLVKDDENLT SQSFNNFYKRVSLMTKLRDRAEKETGEVKDALLNITAETKEMEKRAKFLHEHGWKYAMID VVVCGTAATQTLRKTDYGMAIHAHRAMHASFDRNPRHGISMQFLAKTMRMIGVDQIHSGT AVGKLVGSREEVTAIADTLRKGFLLEQNWGNTKPAFPVSSGGVHPGLIPDEMDIYGKDIV LLVSGGIHGHPHGTRAGAKATMQALEATKKKISLEEFAKKNKELREALEKWGRLK >GWB1_scaffold_16004_5Pacearchaeota IEWYHDLKYKPKRTDLKVLFYFESAKGI-SRDAIGRIASESSTGTW----TTLFKMPQSL MATAYKTEGNYVHVAYPLDLWERGNMPQLLSGIAGNIFGMKALDNLRLVDASLPEEYIKG FKGPNVGIEGLRKYFKVYDRPLTGAVPKPKVGFSAEEHAKIGYETWIGGFDLVKDDENLT SQSFNNFYKRVKLMTKLRDKAEKETGEVKDALLNITGETEEMKKRAKFLHNNGWKYAMID VVTCGAAAVQTMRNVDYDMAIHAHRAMHASFDKNPKHGITMQFLAKTMKLIGVDQIHSGT AVGKLVGTKAEVQAISSVLRKGFLLEQSWGKIKPAFPVSSGGLHPGLVPDEIDIYGKDVV LLVSGGIHGHPKGTRAGATAVMQALEATKKHTSLEAYAKSHVELKQALEKWGRMK >gwa1_scaffold_15687_9Pacearchaeota ---------------------LTL------------------------------------ ----------------------------------------------------L------- --CPR-------------NR--------------------------IGGFDLVKDDENLT SQSFNNFYKRVKLMTKLRDKAEKETGEVKDALLNITGETEEMKKRAKFLHNNGWKYAMID VVTCGAAAVQTMRNVDYDMAIHAHRAMHASFDKNPKHGITMQFLAKTMKLIGVDQIHSGT AVGKLVGTKAEVQAISSVLRKGFLLEQSWGKIKPAFPVSSGGLHPGLVPDEIDIYGKDVV LLVSGGIHGHPKGTRAGATAVMQALEATKKHTSLEAYAKSHVELKQALEKWGRMK >CG10_big_fil_rev_8_21_14_0.10_scaffold_2738_15Pacearchaeota VEWYHDRKYKPKKTDLKALFYFEPA-GISKDDAIGRIASESSTGTWTTLYELP-KRMKEI MAIAYKV-DN-VHIAYPLGLWEKGNIPQLLSGIAGNIFGMKAIKNLRLIDVSLPDEYIRS FKGPNLGIAGLRKYFKVYDRPLLGAVPKPKVGFDAEEHAKIGFETWMGGFDCVKDDENLT STNFNNFYKRVGFMSKMRDRAEKITGEIKDAFINITSETEEMKKRAKALHNYGFKYAMID VVTAGTASVQTMRNVDYGMAIHAHRAMHSAFDRNEKHGITMQFLAKIMKLAGVDQIHSGT AVGKLVGDKEEVVSIANVLRKI-ILEQDWGNIKPAFPVTSGGLHPGLVPDVMDIYGKEMV MLVSGGIHGHPNGTRAGAIATRQALEATKKKISLEEYAKTHKELKQALEKWGRL- >rifcsphigho2_02_scaffold_8518_20Pacearchaeota VEWYHDLKYKPKKTDLKVLFYFEPR-GITKQDAIGRVASESSTGTWTTLFKLP-EMMKKL MATAYKI-GN-VYVAYPLDLWEKGNLPQLLSGIAGNIFGMKALKNLRLIDASLPEEYIKN YKGPNLGIEGLRKYFRVYDRPLTGAVPKPKVGFSSEEHAKIGYETWLGGFDLVKDDENLT SQSFNNFYKRVKLMTKLRDKAEKETGEVKDALLNITAETKEMEKRAKFLHDYGWKYAMID VVVSGTSATQTLREKDYDMAIHAHRAMHASFDRNLKHGISMQFLAKIMRLIGVEQIHSGT GVGKLVGSISEVKSISSTLRGF-LLEQNWKNIKSSFPVSSGGVHPGIVPDEINIYGKDFV LLVSGGIHGHPQGTRAGAKATMQAIEAVKKKISLEEYSKNHNELAQALEKWGRL- >ncbi_ASMP01000003.1_19Nanoarchaeota IEWYHDLKYKPRKTDLKALYYFEPK-GITKEDAIGRVASESSTGTWTTLALIP-ARMKNL MATAYKV-GN-VYIAYPLELWEKGNAPQLLSGIAGNIFGMKALDNLRLIDVSLPEEYIRS FQGPNLGIQGLRKYFKVYHRPLTGAVPKPKLGFNAEEHAKIGFETWLGGFDLVKDDENLT SQSFNNFYKRVDLMTKMRDRAEKETGEVKDALINITAETEEMKKRAKHLHDKGWKYAMID VVVAGTAAVQTLRETDLGMAIHAHRAMHASFDRNSRHGMTMQFLAKIMKLVGVDQIHSGT AIGKLVGSKEEVLSIANVLRKV-LLEQNWHGIKPAFPVSSGGMHPGIIPEEIDIYGKDVV LLISGGIHGHPKGPRAGARAAMQALEATRKKISLEEFSKTHTELREALEKWGRL- >RifSed_csp1_19ft_1_scaffold_43_1Pacearchaeota IEWYHDLNYKPRKTDLKVLYYFEPAKGI-TRDAIGRIASESSTGTW----TTLFKLPPKI MATAYKVQGNFVHIAYPLELWEKGNAPQLLSGIAGNIFGMKALNNLRLIDVSLPEEYIRG FQGPNLGISGLRKYFKVYDRPLTGAVPKPKLGFSAEEHAQIGFETWLGGFDLVKDDENLT SQSFNNFDKRVELMGKARDKAERETGQVKDALINITSETKEMERRAKTLHQHGFKYAMID VVTCGTASVQTLRDVDYGMAIHAHRAMHAAFDRNPKHGITMQFLAKIMKLIGVDQIHSGT AVGKLVGSKEEVLSIADVLRKKILLEQDWHGIKPAFPVSSGGVHPGIIPDEMDIYGKDFV LLVSGGIHGHPRGTRAGAMASMQALEATRKKISLEEYAKMHRELREALEKWGRLN >RifSed_csp2_13ft_2_scaffold_701908_1Pacearchaeota VEWYRDLKYRPAKNDLNVLFYFEPNAGV-TRDAIGRIASESSTGTWTTLF-TMPPRMKKL MAAAYRIDGNFVHVAYPFDLWEKGNMPQLLSGIAGNIFGMKALKNLRLVDVSLPRDYIKN FKGPNLGISGLRKYFKVYNRPLTGAVPKPKIGFSAEEHAQIGFETWMGGFDCVKDDENLT STNFNNFYKRVEFMAKLRDKAEKMTGEVKDAFINITGEVEEMKKRAKFLHNHGFKYAMID VVTCGSASVQTMRNVDFGMAIHAHRAMHASFDRNPKHGMTMEFIAKIMKLIGVDQIHSGT SVGKLVGSREEVLSVANLLR---------TRVKPYK-----------------NIG-DEE KNLPRGIRENPC------------------------------------------- >rifcsphigho2_01_scaffold_293991_4Pacearchaeota ---------------------------------------ESSTGTW----TTLFKLPENL MATAYKIEGNFVHVAYPIYLWEKGNLPQLLSGIAGNIFGMKALDNLRLVDVSFPLDYIKY FPGPNLGIHGLRKYFKVYNRPITGAVPKPKIGFSALEHAQIGFETWMGGFDCVKDDENLT STKFNNFYKRVELLAKMRDKAEKETGEIKDAFINITGEVEEMKKRAKFLHNHGFKYAMID IVTCGAASVQTLRETDYGMAIHAHRAMHAAFDRNEKHGITMEFLAKIMKLAGVDQIHSGT AVGKLVGSREEVLSISNILRKKILLEQNWGKIKPAFPVTSGGLHPGLVPDIMEIYGKETV MLVSGGIHGHPKGTRAGAKATMQAIEAVNKKISLEEQSKNNTELKQALEKWGRMR >rifcsphigho2_02_scaffold_248456_1Pacearchaeota VEWYRELRYKPKKNDLKVLFYFQPN-GI-TRDAVGRIASESSTGTWTTLF-KLPERMKNL MATAYKIEGN-VHVAYPIYLWEKGNLPQLLSGIAGNIFGMKALDNLRLVDVSFPLDYIKY FPGPNLGIHGLRKYFKIYNRPITGAVPKPKIGFSALEHAQIGFETWMGGFDCVKDDENLT STKFNNFYKRVELLAKMRDKAEKETGEIKDAFINITGEVEEMKKRAKFLHNHGFKYAMID VVTCGPASVQTLRETDYGMAIHAHRAMHAAFDRNPKHGITMEFLAKIMKLAGVDQIHSGT AVGKLVGSREEVLSISNVLR---------------------------------------- -----------E-----------------NNV----------------------- >rifcsphigho2_01_scaffold_257738_1Pacearchaeota VEWYHELKYKPKKSDLKVLFYFQPR-GISQEDAIGRIASESSTGTWTTLFKLP-ARMKSL MATAYKI-GN-VHVAYPIDLWEKGNEPQLLSGIAGNIFGMKALDNLRLIDVSLPQEYIKY FPGPNLGISGLRKYFKVYNRPLTGAVPKPKIGFSAEEHAQIGFETWLGGFDCVKDDENLT STKFNEFYKRARLMSKMRDKAEKETGEIKDAFINITGEVEEMKRRARFLHEHDFKYAMID VVTCGVASVQTMREYDLKMAIHAHRAMHAAFDRNEKHGITMEFLAKIMKLAGVDQIHS-- ------------------------------------------------------------ ------------------------------------------------------- >rifcsplowo2_01_scaffold_331279_1Pacearchaeota VEWYHDLKYKPK-RNLKVLFYFQPK-GITKEDAIGRIASESSTGTWTTLF-KIPARMKKL MATAYKIEGN-VHVAYPFDLWEKKNEPQLLSGIAGNIFGMKALENLRLIDVSLPHDYIKY FPGPNLGISGLRKYFKIHDRPLTGAVPKPKIGFSAEEHSRIAFETWLGGFDCVKDDENLT STKFNEFYHRVNLMSKMRDKAEKETGEIKDAFINITGEVEEMKRRAKFLHEHGFKYAMID VVTCGVSAVQTLRETDMGMAIHAHRAMHAAFDRNEKHGITMEFLAKIMKLAGVDQIHS-- ------------------------------------------------------------ ------------------------------------------------------- >RifSed_csp1_19ft_4_scaffold_33260_6Pacearchaeota VEWYHDLKYKPKPDELKVLYYFEPDKGI-TKDAIGRIASESSTGTW----TTLFKLPPKL MATAYKVDGNYVHVAYPLDLWERGNAPQLLSGIAGNIFGMKALKNLRLVDVSLPSEYLRS FPGPNLGIEGLRKYFKVYNRPLTGAVPKPKVGFDANEHAKIGYETWMGGFDVVKDDENLT SQSFNNFYKRVSLMSKMRDKAEKETGDVKDAFINITAETKEMEKRAKMLHNHGFKYAMID VVVAGTSAVQTLRETDYGMAIHAHRAMHASFDKNEKHGISMWFLAKMMRMIGVDEIHGGT AVGKLVGGKHEVLDIANVLRERHLLEQNWENIKPAFPATSGGLHPGLVPDILKILGKDCV LLVSGGIHGHPKGTRAGAKATMQAINATNNGVSLEEYAKYHPELKQALDKWGRLR >RifSed_csp1_19ft_1_scaffold_47266_3Pacearchaeota VEWYHDLKYRPKQDELKVLYYFEPDKGI-TKDAIGRIASESSTGTW----TTLFKLPPKL MATAYKVDGNYVHVAYPLDLWEKGNAPQLLSGIAGNIFGMKALKNLRLVDASLPSEYLRS FPGPNLGIEGLRKYFKVYSRPLTGAVPKPKVGFDASEHAKIGYETWMGGFDVVKDDENLT SQSFNNFYKRVSLMSKMRDRAEKETGKVKDAFINITAETKEMEKRAKMLHNHGFKYAMID VVVAGTSAVQTLRETDYGMAIHAHRAMHASFDKNEKHGISMWFLAKMMRMIGVDEIHGGT AVGKLVGGRHEVLDIANVLRERHLLEQHWGNIKSAFPATSGGLHPGLVPDILKILGKDCV LLVSGGIHGHPKGTSAGAKATMQAINATNSGVSLEEYAKYHPELKQALEKWGRLK >RifSed_csp1_19ft_3_scaffold_56327_3Pacearchaeota VEWYHDLKYRPKQDELKVLYYFEPDKGI-TKDAIGRIASESSTGTW----TTLFKLPPKL MATAYKVDGN---YAYPLDLWEKGNAPQLLSGIAGNIFGMKALKNLRLVDASLPSEYLRS FPGPNLGIEGLRKYFKVYSRPLTGAVPKPKVGFDASEHAKIGYETWMGGFDVVKDDENLT SQSFNNFYKRVSLMSKMRDRAEKETGKVKDAFINITAETKEMEKRAKMLHNHGFKYAMID VVVAGTSAVQTLRETDYGMAIHAHRAMHASFDKNEKHGISMWFLAKMMRMIGVDEIHGGT AVGKLVGGRHEVLDIANVLRERHLLEQHWGNIKSAFPATSGGLHPGLVPDILKILGKDCV LLVSGGIHGHPKGTRAGAKATMQAINATNNGVSLEEYAKYHPELKQALEKWGRLK >RifSed_csp2_16ft_2_scaffold_361294_1Pacearchaeota ------------------------------------------------------------ ------------------------------------------------------------ -------------------------VPKPKVGFDANEHAKIGYETWMGGFDVVKDDENLT SQSFNNFYKRVSLMSKMRDRAEKETGEIKDAFINITAETEEMKKRAKALHNHGFKYAMID VVVAGTSAVQTLRETDYDMAIHAHRAMHASFDKNEKHGISMWFLAKIMRMIGVDEIHGGT AVGKLVGGRHEVLDIANVLREKHLLEQNWEKIKPAFPVTSGGLHPGLIPDVLGILGKDCV LLVSGGIHGHPKGTRAGAKATMQAINATNNGVSLEEYAKYHLELRQALEKWGRLK >RifSed_csp1_19ft_4_scaffold_32161_5Pacearchaeota VEWYHDLKYKPKADELKVLYYFEPAAGE-TKDAIGRIASESSTGTW----TTLFKLPPSL MATAYKVDGNYVHVAYPMDLWEKGNAPQLLSGIAGNIFGMKALKNLRLVDVSLPREYIKS FPGPNLGIDGLRKYFKVYNRPLTGAVPKPKVGFNAEEHAKIGFETWMGGFDVVKDDENLT SQSFNNFYKRVSLMSKMRDRAEKETGEVKDAFINITAETKEMEKRAKVLHNSGFKYAMID VVVAGTSAVQTLRQTDYGMAIHAHRAMHASFDKNEKHGISMWFLAKIMRMIGVDEIHGGT AVGKLVGGRHEVLDIANVLREKHLLEQDWGNVKPAFPATSGGLHPGLVPDILKILGKDCV LLVSGGIHGHPKGTRAGAKATMQALNATNNGVSLEEYAKYHPELKV---KW---- >RifSed_csp1_16ft_1_scaffold_25993_3Pacearchaeota VEWYHDLKYKPK-ADLKVLYYFEPA-GETKEDAIGRIASESSTGTWTTLF-KLPPRMKSL MATAYKVDGN-VHVAYPMDLWEKGNAPQLLSGIAGNIFGMKALKNLRLVDVSLPREYIKS FPGPNLGIDGLRKYFKVYNRPLTGAVPKPKVGFNAEEHAKIGFETWMGGFDVVKDDENLT SQSFNNFYKRVSLMSKMRDRAEKETGEVKDAFINITAETKEMEKRAKVLHNSGFKYAMID VVVAGTSAVQTLRQTDYGMAIHAHRAMHASFDKNEKHGIS-------------------- ------------------------------------------------------------ ------------------KATMQALNATNNGVSLEEYAKYHPELRQALEKWGRLK >GWB1_scaffold_7630_9Pacearchaeota VEWYHDLKYKPK-ADLKVLYYFEPA-GETKEDAIGRIASESSTGTWTTLF-KLPPRMKSL MATAYKVDGN-VHVAYPMDLWEKGNAPQLLSGIAGNIFGMKALKNLRLVDVSLPREYLRS FPGPNLGIDGLRKYFKVYNRPLTGAVPKPKVGFNAEEHAKIGFETWMGGFDVVKDDENLT SQSFNNFYKRVSLMSKMRDRAEKETGEVKDAFINITAETKEMEKRAKVLHNSGFKYAMID VVVAGTSAVQTLRQTDYGMAIHAHRAMHASFDKNEKHGISMWFLAKIMRMIGVDEIHGGT AVGKLVGGRHEVLDIANVLR---LLEQDWGNVKPAFPATSGGLHPGLVPDILKILGKDCV LLVSGGIHGHPKGTRAGAKATMQALNATNNGVSLEEYAKYHPELRQALEKWGRLK >GWB1_scaffold_2416_28Pacearchaeota IEWYHDLKYKPKKTDLKTLFYFEPR-GITKEDAIGRIASESSTGTWTTLFKLP-PRMKKL MATAYKI-GN-VHVAYPFELWEKGNMPQLLSGIAGNIFGMKALDNLRLVDVSLPQEYIKH FPGPNLGIPGIRKYFKVYDRPLTGAVPKPKVGFSAEEHAKIGFETWMGGFDCVKDDENLT SQSFNNFYRRVKLMAKMRDKAEKLTGEIKDAFINITAETEEMKKRAKALHNHGFKYAMID VVTCGEA---------------------------------MEFMAKIMRLIGVDQIHSGT AVGKLVGSKEEVLSVANILRKV-LLEQDWGKIRPALPVTSGGLHPGLVPDVMDIYGKDLV LLVSGGIHGHPRGTRAGAEAVMQALEATKKKISLEEYAKTHRELREALEKWGRL- >gwa1_scaffold_1688_10Pacearchaeota IEWYHDLKYKPKASDLKVLFYFEPAKDI-TKDAIGRIASESSTGTW----TTLFKLPPKL MATAYKINGNWVHVAYPLDLWERGNMPQLLSGIAGNIFGMKALNNLRLVDVSLPGEYIRH FPGPNLGIQGLRKYFKVYDRPLTGAVPKPKVGFSAEEHAKIGYETWLGGFDCVKDDENLT SQTFNNFNKRVSLMAKMRDKAERETGEIKDAFINITAETEEMKKRAKILHNHGFKYAMID VVTCGEASDQTLRETDYGMAIHAHRAMHASFDRNPKHGITMQFLAKIMRLIGVDQIHSGT AVGKLVGSKEEVLAISNILRKNVLLEQNWGKIKPAFPVTSGGLHPGLVPDIIDIYGKDVV LLVSGGIHGHPRGTRAGAKATMQAIEATKKKISLEEYAKTHAELKQALEKWGRLR >RifSed_csp2_16ft_3_scaffold_83149_1Pacearchaeota --------------------R-----GI-TKDAIGRIASESSTGTW----TTLFKLPPKL MATAYKINGNWVHVAYPLDLWEKGNMPQLLSGIAGNIFGMKALDNLRLVDISLPQEYIKH FPGPNLGIQGLRKYFKIYDRPLTGAVPKPKVGFSAKEHAQIGFETWMGGFDCVKDDENLT SQSFNNFNKRVSLMAKLRDKAEKETGEVKDAFINITAETEEMKKRAKILHNHGFKYAMID VVTCGEAADQTLRETDYDMAIHAHRAMHASFDKNDKHGITMQFLAKIMRLIGVDQIHSGT GVGKLVGTTHEVKDIANTLRSHHLLEQNWGRIKPAFPVSSGGLHPGLVPDELKIYGNDVV LLVSGGIHGHPKGTIAGAKATMQALDATKKKISLEEYAKTHTELREALEKWGRMK >gwa1_scaffold_1760_30Pacearchaeota ---------------------LER-----YS----------------------------- ---------------------ER----------------QKALNNLRLIDVSLPQEYIKH FPGPNLGIPGLRKYFKIYNRPLTGAVPKPKVGFSAEEHAKIGYETWIGGFDCVKDDENLT SQSFNNFNKRVSLMAKFRDRAEKETGEVKDAFINITAETEEMKKRAKILHNYGFKYAMID VVTCGEASDQTLRETDYRMAIHAHRAMHASFDKNEKHGITMQFLAKIMRLIGVDQIHSGT GVGKLVGTTHEVKDIANTLRSHHLLEQNWGKIKPAFPVSSGGLHPGLVPDELEIYGNDVV LLVSGGIHGHPRGTRAGAKATMQALDATKQKISLEEYAKTHTELREALGKWGRLK >19ft_2_nophage_noknown_scaffold_9330_1Pacearchaeota IEWYHDLKYKPKANDLKVLFFFEPARGI-TKDAIGRIASESSTGTW----TTLFKLPPKL MATAYKINGNWVHVAYPLDLWEKGNMPQLLSGIAGNIFGMKALNNLRLIDVSLPQEYIKH FPGPNLGIPGLRKYFKIYNRPFTGAVPKPKVGFSAKEHAQIGFETWMGGFDCVKDDENLT SQSFNNFNKRVSLMAKFRDRAEKETGEVKDAFINITAETEEMKKRAKILHNHGFKYAMID VVTCGEASDQTLRETDYGMAIHAHRAMHASFDKNEKHGITMQFLAKIMRLIGVDQIHSGT GVGKLVGTTHEVKDIANTLRSHHLLEQNWGKIKPAFPVSSGGLHPGLVPDELEIYGNDVV LLVSGGIHGHPKGTRAGAKATMQAL------------------------------ >16ft_4_scaffold_39691_1Pacearchaeota --------MKKL---MATAYK--------------------INGNW-------------- -----------VHVAYPLDLWEKGNMPQLLSGIAGNIFGMKALNNLRLIDVSLPQEYIKH FPGPNLGIPGLRKYFKIYNRPLTGAVPKPKVGFSAEEHAKIGYETWIGGFDCVKDDENLT SQSFNNFNKRVSLMAKFRDKAEKETGEVKDAFINITAETDEMKKRAKILHNHGFKYAMID VVTCGEASDQTLRETDMGMAIHAHRAMHASFDKNEKHGITMQFLAKIMRLIGVDQIHSGT GVGKLVGTTHEVKDIANTLRSHHLLEQNWGKIKPAFPVSSGGLHPGLVPDELEIYGNDVV LLVSGGIHGHPKGTRAGAKATMQALDATKQKISLEEYAKTHTELREALGKWGRLK >RifSed_csp1_16ft_4_scaffold_172579_3Pacearchaeota IEWYHDLKYKPKANDLKVLFYFEPSRGI-TKDAIGRIASESSTGTW----TTLFKLPPKL MATAYKINGNWVHVAYPLDLWEKGNMPQLLSGIAGNIFGMKALNNLRLIDVSLPQEYIKH FPGPNLGIPGLRKYFKIYNRPLTGAVPKPKVGFSAEEHAKIGYETWIGGFDCVKDDENLT SQSFNNFNKRVSLMAKFRDKAEKETGEVKDAFINITAETDEMKKRAKILHNHGFKYAMID VVTCGEASDQTLRETDMGMAIHAHRAMHASFDKNEKHGITMQFLAKIMRLIGVDQIHSGT GVGKLVGTTHEVKDIANTLRSHHLLEQNWGKIKPAFPVSSGGLHPGLVPDELEIYGNDVV LLVSGGIHGHRHGTRAGAKATMQALDATKQKISLEEYAKTHTEA---LGKWGRLK >rifcsp_13ft_1_scaffold_503499_2Pacearchaeota IEWYHDLKYKPKANDLKVLFFFEX----------XXXXXXXXXXXX-------XKLPPKL MATAYKINGNWVHVAYPLDLWEKGNMPQLLSGIAGNIFGMKALNNLRLVDVSLPQEYIKH FPGPNLGIQGLRKYFKVYDRPLTGAVPKPKVGFSAGEHAKIGYETWLGGFDCVKDDENLT SQSFNNFNKRVSLMAKFRDKAEKETGEVKDAFINITAETDEMKKRAKILHNHGFKYAMID VVTCGEASDQTLRETDMGMAIHAHRAMHASFDKNERHGITMQFLAKIMRLIGVDQIHSGT GVGKLV------------------------------------------------------ ------------GTT---------------------------------------- >RifSed_csp2_13ft_2_scaffold_695229_1Pacearchaeota ------------------------------------------------------------ ------------------------------------------------------------ ------GIQGLRKYFKVYDRPLTGAVPKPKVGFSAGEHAKIGYETWLGGFDCVKDDENLT SQSFNNFNKRVSLMAKFRDKAEKATGEIKDAFINITAETDEMKKRAKILHNHGFKYAMID VVTCGEASDQTLREFDMGMAIHAHRAMHASFDKNERHGITMQFLAKIMRLIGVDQIHSGT GVGKLVGTTHEVKDIANTLRSHHLLEQNWGKIKPAFPVSSGGLHPGLVPDELKIYGNDVV LLVSGGIHGHPRGTRAGAKATMQALDATKQKISLEEYAKTHTELREALGKWGRFK >RifSed_csp1_16ft_3_scaffold_247731_2Pacearchaeota IEWYHDLKYKPKANDLKVLFYFEPSRGI-TKDAIGRIASESSTGTW----TTLFKLPPKL MATAYKINGNWVHVAYPLDLWEKGNMPQLLSGIAGNIFGMKALNNLRLVDVSLPQEYIKH FPGPNLGIQGLRKYFKVYDRPLTGAVPKPKVGFSAGEHAKIGYETWLGGFDCVKDDENLT SQSFNNFNKRVSLMAKFRDKAEKETGEVKDAFINITAETDEMKKRAKILHNHGFKYAMID VVTCGEASDQTLREFDMGMAIHAHRAMHASF----------------------------- ------------------------------------------------------------ ------------------------------------------------------- >rifcsp2_19_4_full_scaffold_35462_2Pacearchaeota IEWYHDLKYKPKANDLKVLFFFEPARGI-TKDAIGRIASESSTGTW----TTLFKLPPKL MATAYKINGNWVHVAYPLDLWEKGNMPQLLSGIAGNIFGMKALNNLRLVDVSLPQEYIKH FPGPNLGIQGLRKYFKVYDRPLTGAVPKPKVGFSAGEHAKIGYETWLGGFDCVKDDENLT SQSFNNFNKRVSLMAKFRDKAEKETGEVKDAFINITAETDEMKKRAKILHNHGFKYAMID VVTCGEASDQTLRETDMGMAIHAHRAMHASFDKNEKHGITMQFLAKIMRLIGVDQIHSGT GVGKLVGTTHEVKDIANTLRSHHLLEQNWGKIKPAFPVSSGGLHPGLVPDELEIYGNDVV LLVSGGIHGHPKGTRAGAKATMQALDATKQKISLEEYAKTHTELREALGKWGRLK >rifcsplowo2_01_scaffold_211_122Aenigmarchaeota YDWYLDLNYKPSKTDVVCEFRVEPK-GFSMKEAAGRVASESSAGTWTTLFNLP-KRVRKI MAIAFDI-GN-VKVAYPIELWEPGNAPQLLSGIAGNIFGMKALKGLRLMDVSLPKEYLKH FKGPSHGVKGIRKMMKIKKRPITGAVPKPKIGYSAAEHTKIGYETWMGGFDIVKDDENLT STSFNKFEDRAKMMAKARERAEKLTGERKSAFLNITGETKEMIRKATLLHDLGWEFAMID VVTCGTAAVHTLRDVDLNLAIHAHRAMHAAFDRTPNHGMTMYFLGKIMRMIGVDEIHVGT AVGKLVGTRYEVKYVADMLR-K-ILEQKWYHIKPTLPVSSGGLHPGLIPEVMKILGTECA LLVSGGIHGHSKGTRQGAKAAMQAIDATMRGISLKDYAKKHVELKLALEKWGTA- >RifSed_csp2_13ft_2_scaffold_441897_1Levybacteria KDWYLDLNYKPSKTDLVCLFRVEPK-GITMKEAAGRVASESSAGTWTTLY-RLPPRLKKI MARVFEIKGN-VKVAYPIDLWEPGNAPQLLSGIAGNIFGMKALSSLRLVDVSLPKTYLKN FKGPSYGIDGYRKILRVKKRPITGAVPKPKIGYSAAEHAKIGFETWMGGFDIVKDDENLT STSFNKFENRVRLLSKMRDKAEKLTGERKSAFLNITHETKEMIKRARMLKNFGWEFAMVD VVTCGTSAVQTIRDEDLGLAIHAHRAMHASFDRSHVHGITMQFIAKLMRMIGVDSIHV-- ------------------------------------------------------------ ------------------------------------------------------- >rifcsplowo2_01_scaffold_17881_1Micrarchaeota IDWYDEFHYKPAKDDLVCLYYFEPE-GISAKEAIGRIASESSSGTWTTLYKLP-ARVAKI KARAFVV-GN-VKVAYPIDLWEEGNAPQLLSGIAGNIFGMKALKNLRLMDVTLPHAYLKH FKGPSLGIHGIRKKMRVAKRPLTGAVPKPKIGFSAQEHADIAFETWMGGFDIVKDDENLT TTNFNRFEDRVKLMTRLRDKAEKETGECKDAFLNITGETNEMIRRAKLLYDSGWRFAMID VVTAGTAAVQTLRDVELGLAIHAHRAMHAAFDRNPKHGMTMYFLAKLMRLIGADEIHVGT AVGKLVGTAQEVKSIADMLR-L-NLAQDWGNHKPMLPVSSGGLHPGLVPAVMKIFGNDLT LLVSGGIHGHPKGTRAGAKAAMQAIEATMNGVSLDEYAKKHVELAQALEKWGYY- >KCZ70371_Candidatus_Methanoperedens_nitroreducen_REFreference IDWYDEFSYTLAKDDLVCLYYFEPK-GISATEAIGRIASESSAGTWTTLNKLP-DRVSKI KARAFEL-SN-VKVAYPIELWEPGNAPQLLSGIAGNIFGMRALRNLRLVDASLPEDYIKH FRGPNFGIQGIRSLLRIKKRPVTGAVPKPKIGFSAEEHAQIAFETWMGGFDLVKDDENLT STSFNRFEDRVERMAKLRDKAQEQTGEQKDALLNITGETNEMIRKARLLHDSGFRFAMID VVTCGTAAVQTLRKEDLGIAIHAHRAMHAAFDRNPRHGISMYFLAKLMRLIGVDEIHVGT AVGKLVGSREEVIQIANMLR-M--LSQEWGLIKPMLPVSSGGLHPGLVPSVMRILGNDCT LLVSGGIHGHPEGTRAGAKAVMQAIEASMEGIDLREYAKKNKELQQALDKWGYL- >RBG_16_scaffold_853_33Levybacteria IDWYDELGYTPTKNDLICLYYFEPAKGMTAKEAAGRIASESSAGTW----TTLHKLPEKI KAQAFQIDGKYVKVAYPLDLWEPGNAPQLLSGIAGNIFGMKALANLKLIDASLPKKYIKS FKGPHQGIKGIRDLLKVKKRPVTGAVPKPKIGFSAAEHAKVAYETWMGGFDLVKDDENLT SPSFNRFEDRVKRMAKQKDRAERATGDQKDALLNITGETNLMIQRAKFLHDSGFRFAMID VVTCGTSAVQTLRDEDLGLAIHAHRAMHAAFDRNPRHGISMYFLAKLMRLIGVDEIHVGT AIGKLVGTRSEVIEIADMLR---------------------------------------- -----------SSRVKSSTMLEQ-------------------------------- >RifSed_csp2_13ft_1_scaffold_420248_2Levybacteria IDWYDELRYTPAKNDLICLYYFEPARGM-NAEAAGRIASESSAGTW----TTLHKLPEKI KAQAFQIDGKYVKVAYPLDLWEPGNAPQLLSGIAGNIFGMKALANLKLIDASLPKKYIKS FKGPHQGIKGIRDLLKVKKRPVTGAVPKPKIGFSASEHAKVAYETWMGGFDLVKDDENLT SPSFNRFEDRVKRMAKQKDRAERATGDQKDALLNITGETNVMIQRAKLLLESGFRFAMID VVTCGTSAVQTLRDEDLGLAIHAHRAMHAAFDRNPRHGISMYFLAKLMRLIGVDEIHVGT AIGKLVGTRSEVIEI-------------------------------LVPTVMKILGNDCT LLVSGGIHGHPKGTRAGACATMQAIEATLDDIDLKEYARDHKELQQALDKWEYFK >RifSed_csp2_19ft_2_scaffold_258162_1Micrarchaeota SDWYLDLNYKPDNNDLICLFRAEPR-GFSMKEVAGRVASESSVGTWTKLYRLP-RRIKSL MARVFEIKGN-IKVAYPPELFEPGNMPQIFSSIAGNIFGMKALNNLRLEDVEWPKKIMKS FEGPQFGIRGLRKKFKVHNRPLLATVPKPKLGMATAEYIKVAGQIWKGGMDFVKNDENMT SQNFVEFYETTRKIFELRDKVERQTGEKKMYLPNVSAETKEMIKRAEFVAENEGEIV--- --------------------------------RNPRHGVSMLVVADASRLIGVDTIHIGG -MGKLVSPAEEVYILKEEVEGH-VLQENWYNIKPVFPVTSGGLHPGILPRLIKLLGKELI LQVGGGVLGHPSGPLAGGRAVRQAVEATLHGIALKKYAKNHPELKQALELWGTK- >RIFCSPHIGHO2_01_FULL_OD1_39_14_rifcsphigho2_01_scaffold_71123_4Buchananbacteria MPGYEGLRYKPSEDDLIVDFTIEPK-GVSMKLAAGAVAGESSVGTWTELTTMR-KHIDAI KARCFEIKAG-IRVAYPSILFEAGNMPQIWSSIAGNIFGMKAVKNIRLESAEWPKTIRDS FPGPKFGIKGVRDILKVYDRPITASVPKPKIGMTTAEHAHTAYEIWMGGFDLVKDDENLS SQSFNRFKDRVIESMKMRDKAEQETGERKSYLINITAETREMLARAKFVRDFGNEYVMMD ILTAGWAGLQTVRDEDLNLAIHAHRAFHAAFTRNKKHGASMKFVAETARLLGVDQLHIGT VIGKLESPKEEVFGLNERLK------ENWGRIKDVLPVCSGGLHPGLLPPLIKWLSKDIA IQVGGGIHGHPQGTRAGAKAVMAAIQAGIEGESLTEAAKKSKELAVAIKKWGYY- >rifcsplowo2_01_scaffold_39879_1Pacearchaeota VSQYLHLSYTPSRSDLICVFCVRPK-GVSFKEAAGRVAAESSNGTWTELTTLK-PHIDRL RARCFKISGD-IWVAYPLELFELGNMPQIWSSVAGNIFGMKALSGLRLEDIQFPAALLKS FPGPQFGISGVRKLMNIKNRPLTATVPKPKIGMTTAEHADVLFNSWLGGIDFGKDDENLT SQVFNKFEPRVKTCLRLRDKAEKITGERKSYFINITADSRTMEKRAKFVANCGGEYVMVD ICTAGWAGLQHVRDVDYKLAIHAHRAFHAAFDRNPLHGLSMLTLAKCARLVGVDNLHIGT VVGKLVGSLSEVQRLHAEVAAH-LLGQNWGSVKSVLPTSSGGLHPGLVPAVMHILGPDIC LQAGGGIHGHPSGSLAGARALRQAIDATLHHIPLATFARTHSELASALFQWGVK- >rifcsplowo2_01_scaffold_94273_4Pacearchaeota MSQYLDYKYHPSKDDLICLFRVEPR-GMSFDEAVGRVAAESSNGTWTTLTTIN-ERIRRI RARAFDEKKK-VKTAYPIELFELGSMPQLWSSICGNIFGMKAVKNLRLLDADFPEIYIKS FSGPQFGLGGVRKFMKIPTRPMIATVPKPKVGMTTEEHAKVGYEAWVGGVDFLKDDENLT NQKFNRFDNRVKLCAKMRDKAEKETGDKKDYFVNVTSETNEMLKRTRLAHNYGFKYVMID MLTAGWAGLESLRQQDTKQAIHSHRAMHATFTRNPLHGISMLFVAKCAKLVGVDNIHIGT VIGKLVSPKSEVMALEHEIETG-VLEEHWFGMKPIIPCSSGGLHPGLVPYVLKLLGKDCL LQLGGGIHGHPHGTREGAMALRQAIDAYVKKENLEEYSKENKELAIALKTWGTV- >rifcsplowo2_01_scaffold_193789_1Pacearchaeota --------------------L--------------------------------------- -----------------------GNMPQIISSIAGNIFGMKGVNNLRLEDVKWPKQIIKS FRGPQFGVEGIRKFMKIKERPLLGSVPKPKVGMNTKEHCNTAYDIWYGGLDLVKDDENLS NQKFNRFEKRLKGCMKIRDKVEKEIGERKSYLINITSETREMIKRAKLVKDYGNEFVMVD ILTAGFSGFQTIRNEDLKLAIHAHRAFHSTFTRNPRHGVSMLVVADIARLIGADNLHIGT VFGKLISPEEEVINLEDEIQGKNRLSEDWYDKKRMFAVSSGGLYPSLIPKVIKILGKDII LQVGGGVHGHRYGTRAGARAVRQIVDATMEGINLKEYSKNNNELKVALRQWG--- >rifcsplowo2_01_scaffold_160892_4Pacearchaeota VNPYLDLNYKPRNSDLICLFRIEPAKGISLKEAAGMVASESSNGTW----TELTTLKEKM RGRVFSIRSDYVKIAYPIDLFELGNMPQIISSIAGNIFGMK------------------- --G------------------------------NTKEHCNTAYDIWYGGLDLLKDDENLS NQKFNRFEKRLNGCMKIRDRVEKETGERKSYLINITSETKEMIKRAKLVKDYGNEFVMVD ILTAGFSGFQTIRNEDLKLAIHAHRAFHSTFTRNPRHGVSMLVVADVARLIGADNLHIGT VFGKLVSPEEEVINLEDEIQGKNRLSENWYDKKRMFAVSSGGLYPSLIPKIIKILGKDII LQVGGGVHGHRYGTRAGARAVRQIVDATMKGINLKEYSKDNNELRVALRQWG--- >rifoxyd1_full_scaffold_56120_1Pacearchaeota ---------------------IEPAKGISLKEAAGMVASESSNGTWTEL-TTLKEHIRKM RGRVFSIRSDYVKIAYPIDLFELGNMPQIISSIAGNIFGMKGVNNLRLEDVKWPKQIIKS FRGPQFGVEGIRKFMKIKERPLLGSVPKPKVGMNTKEHCNTAYEIWYGGLDLVKDDENLS NQKFNRFEKRLNGCMKIRDRVEKETGERKSYLINITSETKEMIKRAKLVKDYGNEFVMVD ILTAGFSGFQTIRNEDLKLAIHAHRAFHSTFTRNPRHGVSMLVVADVARLIGADNLHIGT VFGKLVSPEEEVINLEDEIQGKNRLSENWYDKKRMFAVSSGGLYPSLIPKIIKILGKDII LQVGGGVHGHRYGTRAGARAVRQIVDATMKGINL--------------------- >CG10_big_fil_rev_8_21_14_0.10_scaffold_107_143Pacearchaeota MSQYLDLHYKPKKSDLIVLFRVEPK-GMSKREAIGRIAAESSNGTWTTLSTLK-PHIRKI RARAYEFKGP-VKVAYPLELFELGSIPQLMSSVAGNIFGMKAVDNLRIIDIQFPEKYIKS FKGPQFGIEKIRKYMHIKDRPMVATVPKPKVGLTTKEHTKVIYNSWLGGVDFAKDDENLT SQNFNKFSDRVKEAAKARDKAEKETGEKKDYFINVTAETKEMLKRAKIVKDHNFKYIMAD ILTIGWSGLQTLRDEDLKLAIHAHRAFHAAIDRNPKHGMTMLSLAKLARLIGVDNIHIGT VIGKLVGDKKEVLEIREGIV----LPQNWHNLKDTIPVSSGGLHPGLIPQIIKMLGKGVV LQAGGGIHGHPKGSMAGAKSIRQSIDASLKGVSLREYAKTHKELDEAINKWKME- >rifoxya1_full_scaffold_286_38Woesearchaeota MSQYLDLKYKPSSSDLVVLFRIEPAKGI-SKEAIGRVAAESSNGTW----TTLSTLKSKI RARAFEFDGNYVKVAYPIELFELGSIPQLMSSVAGNIFGMKAINNLRLEDIQFSKEYIKS FRGPQYGIEGIRKYMKIKERPMIATVPKPKVGLTTKEHTKVIFDSWVGGVDFAKDDENLT SQNFNKFENRVKAAAKARDHAEKITGEKKDYFINVSAETKEMLRRTKLANEYDFKYVMCD ILTVGWSGLQTLREEDLKQAIHAHRAFHAAFDRNLKHGMSMLTLAKLSRLIGVDNIHIGT VLGKLVGTKEEVIDIEKEIVKENILEQNWYNMRDVIPVSSGGLHPGLVPNIIDLLGKDIV IQLGGGIHGHPKGSKYGAMALRQAIDAKLNGISLENYSKNNHELKLALSKWGTSK >rifcsphigho2_01_scaffold_52038_9Pacearchaeota MSHYLDLKYKPSKDDLVCLFRFEPGKGISSKEAIGRIAAESSNGTWSETEFGAKEHIRNI RGRAFHISGDLVYVAYPLDLFELGSMPQLFSSVGGNIFGMKAMNNLRFEDIYFSEKYLKS FRGPQFGINGVRNFMKVKKRPMIACVPKPKVGMYTEEHADTAYKFWIGGGDFLKDDENLT DQKFNRFDARVKLCSKMRDKAEKETGEKKDYFINITAETNEMLRRAKVAYDNNFKYIMVD IVTAGWSGLQTVREFDHKLAIHAHRAMHATFTRNPKHGISMLTLAKCARLVGVDNIHIGT AVGKLVSPKEEVMAIASEITGINMLKQKWCGVKDTIPVSSGGLHPGILPFVMKMLGNDCV IQAGGGIHGHPGGTMEGSKAVRQAIEATLGGVKLKDYAKNKHELRLALEKWGELK >tara_45732_108Nanoarchaeota MSQYLDLHYKPSRSDLICKFRFESK-NISIKEAVGRIASESSNGTWTTLSTLK-PHIRKI RARAFEIKKP-VKITYPLELFELGNIPQLLSSIAGNIFGMKALKNLRLEDIQFSKKYVSS FKGPQFGIQGIRKLTQVKERPLIATVPKPKVGMTTKEFANVAYRLWTGGVDFVKTDENMT SQPFVNFYKTTREVLKMRDKAEKETGERKFFLANVTAETNEMVKRAHFVKKCGGEFIMVD FLTAGFSGFQTLRNEKLKLAIHLHRALHGSMTRNPKHGISMLTLAKLARLIGGDTLHIGT VIGKLVGKREDVINLKDSLN------RSLYHIKPTLPVSSGGLHPGIIPYILKMLGKDIM VQSGGGVLGNPLGVEAGAKALRQSIEATLQKVTLKIYSQKHKELSAALKKWGTS- >tara_37404_15Pacearchaeota MSQYLDPNYKPKSKDLICLFYLEPK-GISANEAIGRIAAESSNGTWTELTTLK-PHIRKI RARAYHRKGK-VKVAYPIELFEKGSMPQIYSSVVGNIFGMKAVNNLKLLDIDFPKTMIKS FPGPQFGLHGVRRFMKIPKRPMTATVPKPKVGMTTAEHTNVGYQAWMGGVDFLKDDENLT DQVFNRFRNRVKTCAKARDRAEKLTGEKKDYFINVTAETKEMLRRANIAADYDFKYVMCD IVTAGWSGLQSLREHDNKQAIHAHRAMHATFTRNPKHGLTMLALAKSARLVGVDNIHIGT VIGKLVSPRNEVIALEQEMEQG-ILQQSWHNLKPTIPCSSGGLHTGIVPDVLKLLGNDCL LQLGGGIHGHPQGTRYGAESLRQAIDASLEGISLKEAAQENKALAHALEHFGHE- >tara_83453_5Pacearchaeota MSQYLDYKYKPKPGDLVALYKIEPSKKISFNEAAGRVAAESSNGTWTEL-TTLKPHIRKI RARAFSKSGDLYKIAYPSDLFELGSMPQIYSALAGNIFGMKAVKNLRLLDINFPDKMIRS FKGPQFGLHGVRKFMKVSKRPLTATVPKPKVGMTTSEHAKVGLDSWLGGIDFLKDDENLT NQSFNKFQARAKLCAKMRDRAESQTGEKKDYFINVTAETNEMLKRAKIAKSFDFKYIMCD IVTTGWAGVQSLREFDSKQAIHAHRAMHSTFTRDLKHGVTMLTLAKSARLVGVDNIHIGT VVGKLVSPKDEVLELNKAMK------SSLSHIKSVIPTSSGGLHPGIVPDVIELLGSDCL LQLGGGIHGHPKGSKAGAAALRQAIDATLSKESLKEHARSNKELAQALEHFGKQH >rifcsphigho2_02_scaffold_53546_4Pacearchaeota MSQYLDLKYKPKKTDLICLFRFEPAPGISVEEAIGRIASESSNGTWTEL-TTLKEHIRKI RARAFHISGDLVKIAYPIELFELGSMPQIYSSVVGNIFGMKALKNLRIEDIEYPEIMLKS FLGPQFGISGVRKFMKVPKRPLTATVPKPKVGMTTEEHAKVGYDAWIGGVDFLKDDENLT DQKFNRFEARAKLCAKMRDKAEKFTGEKKDYFINVSAETNEMIRRAKFAQSYGFKYVMCD IVTVGWAALQTLRNHDSKQAIHAHRAMHATFTRNPKHGVSMQVLADSARLVGVDNIHIGT VVGKLVSPKDEVMTLENEMRREGTLHQDWHNIKSVLPVSSGGLHPGLVPDVLKLLGNNCA LQLGGGLSGHPKGIKEGAKAFRQAIDAFMEKKTLEDYAKKNRALAIALKHFGHSR >rifoxyd1_full_scaffold_47708_2Pacearchaeota VNPYLDLNYKPRNSDLICLFRIEPAKGINLKEAAGMVASESSNGTW----TELTTLKEKI RARAFHISGDLVKIAYPIELFELGSMPQIYSSVVGNIFGMKALKNLRIEDIEYPEIMLKS FLGPQFGISGVRKFMKVPKRPLTATVPKPKVGMTTEEHAKVGYDAWIGGVDFLKDDENLT DQKFNRFEARAKLCAKMRDKAEKFTGEKKDYFINVSAETNEMIRRAKFAQSYGFKYVMCD IVTVGWAGLQTLRNHDSKQAIHAHRAMHATFTRNPKHGVSMQVLADSARLVGVDNIHIGT VVGKLVSPKDEVMTLENEMRREGTLHQDWHNIKSVLPVSSG------------------- ------------------------------------------------------- >CG_2015-18_scaffold_146281_1Pacearchaeota --------------------LFNP------------------------------------ ----------------DLDAIER-------------DFIVNTLKGLN-----L------- --GKSHKI----------SRPWAKRLM-PAL---------RAKYILSG--DYKKDDENLT DQKFNRFKARAKACAKMRDKAEKKTGEIKDYFINVTAESKEMLKRAKIAKNYGFKYVMCD IVTAGWSGLQTLREHDSKQAIHAHRAMHATFTRNPKHGISMLTLAKSARLVGVDNIHIGT VIGKLVGTKDEVLNLEREMEKEGILEEDWKRIKSVFPCSSGGLHPGILPEIMDMMGKNIM VQLGGGIHGHPDGTKSGAMATRQAIDAYIRKVKIKEATLIYPELTRALNKWGHEK >CG_2015-09_scaffold_50799_3Pacearchaeota ----MK--YKPKKDDLICLFRIEP-NGLSFNDAIGRVAAESSNGTWTTLSTLK-PHIRKI RGRAFYRKGNLVKIAYPSELFELGNMAQVYSAIAGNIFGMKAVDNLRLLDIDFPDMMMKS FRGPQFGIEGVRKFMKVKGRPLTATVPKPKVGMTTREHAKVGYDAWMGGIDFLKDDENLT DQKFNRFKARAKACAKMRDKAEKKTGEIKDYFINVTAESKEMLKRAKIAKNYGFKYVMCD IVTAGWSGLQTLREHDSKQAIHAHRAMHATFTRNPKHGISMLTLAKSARLVGVDNIHIGT VIGKLVGTKDEVLNLEREMEEG-ILEEDWKRIKSVFPCSSGGLHPGILPEIMDMMGKNIM VQLGGGIHGHPDGTKSGDIQPGQIVEGDVKDIDVRVEYRGSTAQQKVQYPFGFT- >cg1_0.2_scaffold_5501_c_7Pacearchaeota MSQYLDMKYKPKKDDLICLFRIEP-NGLSFNDAIGRVAAESSNGTWTTL-STLKPHIRKI RGRAFYRKGNLVKIAYPSELFELGNMAQVYSAIAGNIFGMKAVDNLRLLDIDFPDMMMKS FRGPQFGIEGVRKFMKVKGRPLTATVPKPKVGMTTREHAKVGYDAWMGGIDFLKDDENLT DQKFNRFKARAKACAKMRDKAEKKTGEIKDYFINVTAESKEMLKRAKIAKNYGFKYVMCD IVTAGWSGLQTLREHDSKQAIHAHRAMHATFTRNPKHGISMLTLAKSARLVGVDNIHIGT VIGKLVGTKDEVLNLEREMEKEGILEEDWKRIKSVFPCSSGGLHPGILPEIMDMMGKNIM VQLGGGIHGHPDGTKSGAMATRQAIDAYIRKVKIKEATLIYPELTRALNKWGHEK >UBA92contig_3568_7Pacearchaeota MSQYLDYNYKPSKKDMVCLFRFEPRAGVSVKEVLGRIAAESSNGTWTKL-TTLKPHIRKI RGRAFYVRGNLVKIAYPEVLFEAGSMPQIYSAIAGNIFGMKAVNNLRLMDIDFPDSIMKS FKGPQFGIEGVRRFMKVKKRPLTATVPKPKVGMTTKEHAQVGYEAWAGGLDFLKDDENLT DQKFNRFEARAKACAKMRDRAEKETGEKKDYFINVTGEAKEMLIRAKIAADYDFKYVMCD IVTAGWSGLQTLRNFDTKQAIHAHRAMHATFDRNPRHGLSMLTLAKCARLVGVDNIHIGT VIGKLVGSKDEVLAIEDEMEKKGILEENWRNIKSVFPCSSGGLHPGIVPDIMKMLGNNIV IQAGGGVHGHPFGTKAGAEALRQAIDATMEGESLKEYAK-GPGLKVALEHFGHEK >cg1_0.2_scaffold_3141_c_19Pacearchaeota MSQYLDYNYKPSKKDMVCLFRFEPRKGVSVNEAIGRIAAESSNGTWTEL-TTLKPHIRRI RARAFYRRGNLVKIAYPDELFEAGSMPQIYSAIAGNIFGMKAINNLRLMDIDFPDSIMKS FRGPQFGIEGVRKFMKVKKRPMTATVPKPKVGMTTSEHAQVGYEAWTGGLDFLKDDENLT DQKFNRFSARAKSCAKMRDKAEKETGEKKDYFINVTGETKEMLKRAKLAANYDFRYVMCD IVTTGWAGLQSLRDADNKQAIHAHRAMHATFDRNQRHGITMLTLAKCARLIGVDNLHIGT VIGKLVGTKDEVLSIENEMERKGILQENWRQIKPVFPCSSGGLHPGIVPEIIRMLGRDIV IQAGGGVHGHPLGTKGGSEALRQAIDAEVEGEKLEEYAK-RPGLRIALEHFGHEK >rifcsplowo2_01_scaffold_45370_2Woesearchaeota ----LN--YKPSKDDIVCLFRFEPALGISVKEAVGRIASESSNGTWTDL-TTLKPHIRKI RARVFLIHGNFCKIAYPLELFELGSMPQLYSSVAGNIFGMKALKNLRLEDIDFPEKYIRS FKGPQFGIDGVRKFMNILDRPLTATVPKPKVGMYTSEYCKAAYEIWKGGIDIVKTDENMT SQKFVNFYKTTEKILNVRDRVEKETGERKTFLANVTAETKEMIKRAKFVKKCGGEFVMID FLTAGWAALQTLRNEDLKLAIHCHRAFHAAFTRNPKHGVSMLTLAKCARLVGVDNIHIGT AVGKLVSPIKEIMGIEKEINDGHILEQKWHNIKPVFPVSSGGLHPGLVPYILNKLGRNII LXXXX-------------------------------------------------- >rifcsplowo2_01_scaffold_277111_1Pacearchaeota ------------------------------------------------------------ -------------------------MPQLYSSVAGNIFGMKALKNLRLEDIDFSEKYVKS FKGPQFGIAGVRKFMNIYNRPLTATVPKPKLGMTTEEYCKTANDIWSGGLDIVKTDENMT SQKFVNFYKTTEKILNIRDRVEKETGERKTFLANITGETKEMIKRAKFVKKCGGEFIMID IVTAGWAGLQTIRNEDLKLAIHAHRAMHAAFTRNPKHGISMLTLAKCARLVGVDNIHIGT AVGKLVSPIKEVLGIEREITNGHILQQEWYNIKPVFPVSSGGLHPGLIPYIMRMLGKDII LQCGGGVTGNPLGTKAGAVGMRQAIEATLNNRPLKEYAKNHKELKAALDKWGYKK >rifcsplowo2_01_scaffold_37901_5Pacearchaeota ----LN--YKPKKDDIICLFRFEPASGISVKEAVGRVASESSNGTWTSL-STLKPHIRKI RARAFYIEGNFVKIAYPLELFELGSMPQLYSSVAGNIFGMKAVKNLRLEDIDFSEKYIKS FRGPQFGIDGVRKFMNIYDRPLTATVPKPKLGMTTDEYCKVAKDIWSGGLDIVKTDENMT SQKFVNFYKTTEKILKVRDRAEKETGERKTFLANVTAETIEMIKRAKFVKKCGGEFVMID FLTAGWAGLQTLRNEDLKLAIHAHRAMHAAFTRNPKHGISMLTLAKCARLIGVDNIHIGT AVGKLVSPINEVLGIEREITNGHILEQDWYGIKPVFPVSSGGLHPGLIPYIIKMLGKNII LQCGGGVTGNPLGVKAGAKGMRQAIDATLDNKDLKVYAKTHKELKAAMDKWGYTK >CG10_big_fil_rev_8_21_14_0.10_scaffold_5506_5Pacearchaeota -----------------------------MNEAIGRVASESSNGTW-STLSTLKPHIRKI RARAFEIKGNWVKVAYPIELFEKGNVPQLLSSFAGNIFGMKAVKNLRLEDVHFPDELLKS FRGPEYGIEGIRRMFKVKDRPLTASVLKPKVGMTTSEHCQVAGDIWKGGCDFLKDDENLT DQSFNRFENRANQCFRLRDRIEKETGERKGYFINVTAETMEMLSRAKLVYDLGGEFVMID VLTAGFSAFQTLREFDHKMAIHIHRAMHASMTRDQKHGISMLSLAKFVRLVGGDNLHIGT VVGKLVGKEDDVLLLEKEIENVHALNQKWGRIKPMFAVSSGGLHAGLVPYVVKLLGNDIL IQAGGGIHGHPNGSYAGAKSLRQAIDAVIYNISLKEYSKTHEELKTALDKWGSMR >rifcsphigho2_02_scaffold_489089_1Pacearchaeota MSQYLDTSYKPKSSDVICLFRVEPAYGMSKKEVIGRVASESSNGTWSSL-TTLKPHIRKI RAKAYEVKGNYVKIAYPIELFEMGSVPQLLSSFAGNIFGMKAVKNLRLEDIHFPDKLIKS FRGPEYGIEGIRRRFKVYKRPLTASVPKPKVGLTTAEFAKVAEDVWSGGVDFLKDDENLT DQLFNRFENRANHCFKIRDKVEKETGERKGYFCNVTAETMEMLARAKLVHDLGGEYVMID VLTAGFAGFQTLREFDHKMAIHCHRAFHSSFTRNPKHGMSMLAVAKLVRLTGGDSLHVGT VIGKLVGKKDEVLSIEHEIE---------HNI------------------------ADFL ----------------GLPVLDQ--KTTLQMI-FLDFQY---------------- >rifcsplowo2_01_scaffold_5191_1Woesearchaeota MSQYLDTSYNPKSSDIICLFRVEPY-GMSKKEVIGRVASESSNGTW------STLK-PKI RARAYEVKGD-VKIAYPIELFEMGSVPQLLSSFAGNIFGMKAVKNLRLEDIHFPDKLMKS FRGPEYGIEGIRRKFKIHKRPLTASVPKPKVGLTTVEFAKVAEDVWSGGVDFLKDDENLT DQLFNRFENRANHCFKVRDKVEKETGERKGYFCNVTAETMEMLARAKLVRDLGGEYVMID VLTAGFAGFQTLREFDHKMAIHCHRAFHSSFTRNPRHGMSMLAVAKLVRLTGGDSLHIGT VIGKLVGKKDEVLSIEHEIELP-VLDQKWGDIKPMFATSSGGLHPGLVPHIMNLLGNDII IQLGGGIHGHPGGSYRGAIALRQSI------------------------------ >rifcsphigho2_12_scaffold_356730_1Pacearchaeota --------------------Y--------------------------------------- -----------VKIAYPIELFEMGSVPQLLSSFAGNIFGMKAVKNLRLEDIHFPDKLIKS FRGPEYGIEGIRRRFKVYKRPLTASVPKPKVGLTTAEFAKVAEDVWSGGVDFLKDDENLT DQLFNRFENRANHCFKVRDKVEKETGERKGYFCNVTAETMEMLARAKLVRDLGGEYVMID VLTAGFAGFQTLREFDHKMAIHCHRAFHSSFTRNPRHGMSMLAVAKLVRLTGGDSLHIGT VIGKLVGKKDEVLSIEHEIEGLPVLDQKWGDIKPMFATSSGGLHPGLVPHIMNLLGNDII IQLGGGIHGHPGGSYRGAIALRQSIEATMHGIPLKEFAQTSPELKQAMEKWGHIR >rifoxyc1_full_scaffold_21863_2Pacearchaeota MSQYLDTSYKPKSSDVICLFRVEPAYGM-SKEVIGRVASESSNGTW----SSLTTLKPKI RAKAYEVKGNYVKIAYPIELFEMGSVPQLLSSFAGNIFGMKAVKNLRLEDIHFPDKLIKS FRGPEYGIEGIRRRFKVYKRPLTASVPKPKVGLTTAEFAKVAEDVWSGGVDFLKDDENLT DQLFNRFENRANHCFKIRDKVEKETGERKGYFCNVTAETMEMLARAKLVHDLGGEYVMID VLTAGFAGFQTLREFDHKMAIHCHRAFHSSFTRNPKHGMSMLAVAKLVRLTGGDSLHVGT VIGKLVGKKDEVLSIEHEIEGLPVLDQKWGDIKPMFATSSGGLHPGLVPHIMNLLGNDII IQLGGGIHGHPGGSYRGAIALRQSIEATMHGIPLKEFAQTSPELKQAMEKWGHIR >rifoxyd1_full_scaffold_41945_2Pacearchaeota MSQYLDTSYKPKSSDVICLFRVEPAYGM-SKEVIGRVASESSNGTW----SSLTTLKPKI RAKAYEVKGNYVKIAYPIELFEMGSVPQLLSSFAGNIFGMKAVKNLRLEDIHFPDKLIKS FRGPEYGIEGIRRRFKVYKRPLTASVPKPKVGLTTAEFAKVAEDVWSGGVDFLKDDENLT DQLFNRFENRANHCFKIRDKVEKETGERKGYFCNVTAETMEMLARAKLVHDLGGEYVMID VLTAGFAGFQTLREFDHKMAIHCHRAFHSSFTRNPKHGMSMLAVAKLVRLTGGDSLHVGT VIGKLVGKKDEVLSIEHEIEGLPVLDQKWGDIKPMFATSSGGLHPGLVPHIMNLLGNDII IQLGGGIHGHPGGSYRGAIALRQSIEATMHGIPLKEFAQTSPELKQAMEKWGHIR >gwa1_scaffold_6140_7Pacearchaeota MSHYLDTNYKPKSSDVVCLFRVEPATGM-SKEAIGRVASESSNGTW----SDLTTLKPKI RARAYEVKGNYVKIAYPIELFEMGSVPQLLSSFGGNVFGMKAVKNLRLEDIHFPEKLLNS FRGPEYGIEGIRKRFKIYKRPLTASVPKPKVGLTTSEFAKVAEDVWSGGIDLVKEDENLT DQFFNRFENRVNHCFRIRDKVEKETGERKGQLVNVTAETMEMLARAKLVYDLGGEFVMID VVTTGFGAFQTLREFDHKMAIWVHRAMHSMFTRNPKHGMSMLALAKLVRLVGGDSLHIGT ARGKLFGKKDEVLMLEREIEGVNVLNQKWGHIKPILAVSSGGLHAGSVPYIVKTLGKDIA IQVGGGCHGHPMGSKAGATSVRQAIDATMNGIHLSEYAKTHKELKSALDKWGYIV >rifoxya1_full_scaffold_42954_2Pacearchaeota MSHYLDTNYKPKSSDVVCLFRVEPATGM-SKAAIGRVASESSNGTW----SDLTTLKSKI RARAYEVKGNYVKIAYPIELFEMGSVPQLLSSFGGNIFGMKAVKNLRLEDIHFPEKLLNS FRGPEYGIEGIRKRFKIYKRPLTASVPKPKVGLTTSEFAKVAEDVWSGGIDLVKEDENLT DQFFNRFENRVNHCFKIRDKVEKETGERKGQLVNVTAETMEMLARAKLVYDLGGEFVMID VVTTGFGAFQTLREFDHKMAIWVHRAMHSMFTRNSKHGMSMLALAKLVRLVGGDSLHIGT ARGKLFGKKDEVLMLEREIE--------------------------------QEMG---- ----------------AYKTYFSCI------------------------------ >rifcsphigho2_01_scaffold_53648_5Pacearchaeota ---------------------LKT-----LK----------P-----HIR--------KI RGRTFEIKGNYARIAYPIELFELGSVPQLLSSFAGNIFGMKAVNNLRLEDVTIPKVMLNS FRGPEYGIHGIRKLFKIDKRPLTASVPKPKVGMTTEEHCKVANEIWMGGVDFLKDDENLT DQKFNRFEARANKCFKIRDKAEKETGERKGYYINVTAETSEMLARAKLVHDLGGEYVMID VLTAGFASFQTLREFDHKMAIHVHRAMHAMFTRNPKHGMSMLALSKFIRMVGGDTLHIGT VIGKLVGKKDEVLMLEHEIEKEHVLNQDWHSIKPMFAVSSGGLHPGLVPQIMKMLGYDVV IQLGGGIHGHPGGSYHGAMALRQAIDATMKGLTLREYAENHEELKSALEKWGYMR >rifcsplowo2_01_scaffold_53007_11Pacearchaeota MSQYLDLKYKPSSSDLICLFRFEPR-GISANECAGRIASESSNGTWSDLKTLK-PHIRKI RGRTFEIKGN-ARIAYPIELFELGSVPQLLSSFAGNIFGMKAVNNLRLEDVTIPKVMLNS FRGPEYGIHGIRKLFKIDKRPLTASVPKPKVGMTTEEHCKIADEIWMGGVDFLKDDENLT DQKFNRFEARANKCFKIRDKAEKETGERKGYHINVTAETSEMLARAKLVHDLGGEYVMID VLTAGFASFQTLREFDHKMAIHVHRAMHAMFTRNPKHGMSMLALSKFIRMVGGDTLHIGT VIGKLVGKKDEVLMLEHEIEEH-VLNQDWHSIKPMFAVSSGGLHPGLVPQIMKMLGYDVV IQLGGGIHGHPGGSYHGAMALRQAIDATMKGLTLREYAENHEELRSALEKWGYM- >rifcsplowo2_01_scaffold_38123_6Pacearchaeota ---------------------LST-----LK-----------------------PHIRKI RARAFEIKGNYVKIAYPIELFELGSVPQLLSSFAGNIFGMKAVNNLRLEDVHFPEVLLKS FRGPQYGIDGIRKLFKIHKRPLTASVPKPKVGMTTEEHCKVADDIWRGGVDFLKDDENLT DQKFNRFENRANKCFKIRDKVEIETGERKGYYINVTAETAEMLARAKLVHDLGGEYLMID VMTAGFSGFQTLREFDHKMAMHCHRAFHSSFTRNPRHGMSMVALAKFVRVVGGDSLHVGT VIGKLIGKKDEVLTIEREFEKQPVLSQKWYNIKPAFAASSGGLHPGLVPQIMHMLGNDIV IQAGGGIHGHPMGSKSGAIALRHSIDAVMNNIPLTEYAKKSSELSMALEKWGYVR >rifcsphigho2_01_scaffold_136210_1Pacearchaeota -TQYLNLKYKPNSSDVVCLFRFEPARGISVNECVGRIASESSNGTWTSL-STLKPHIRKI RGRAFEIKGDYVKIAYPIELFELGSIPQLLSSYGGNIFGMKAVNNLRLENMTFPKVLIKS FRGPEYGIHGLRKLFRVEKRPLTASVPKPKVGMTTEEHCQVAKNIWEGGVDFLKDDENLT NQKFNRFDKRAKKCFKIRDKVEKETGERKGYYINVTAETNEMIRRARLVRDLGGEYVMID VLTAGFAGFQTLREFDNHMAIHIHRAMHATMTRNPRHGISMLSLAKFVRLVGGDSLHIGT VIGKLVGKKDEVLTLEHEIEKEHVLNQNWYNIKPMFATSSGGLHPGLIPQIMNMLGNDII IQLGGGIHGHPMGSKYGAIALRHAIEATMNNIPLTEYAKKSSELSMALEKWGYTR >rifcsplowo2_01_scaffold_200222_1Pacearchaeota ------------------------------------IASESSNGTWSDL-STLKPHIRKI RARAFEIKGDYVKIAYPIELFELGSVPQLLSSFAGNIFGMKAVNNLRLEDVHFPKILIKS FRGPQYGIDGIRRLFRVEKRPLTASVPKPKVGMTADEHCKVADEIWRGGVDFLKDDENLT NQKFNRFDTRARRCFKIRDRVEMEIGERKGYFINVTAETNEMLRRAKLVRDLGGEYVMID VLTAGFSGFQTLREFDYKMAIHIHRAMHATMTRNPRHGISILSLAKFVRLVGGDSLHIGT VIGKLAGKKDEVLMLEHEIEKEHVLNQNWHDIKPMFAVSSGGLHPGLVPQVMHMLGNNIV LQLGGGIHGHPLGSKSGAIALRHAIDSVMHNIPLTEYAKKSSELSMALEKWGYNR >rifoxyc1_full_scaffold_36956_1Pacearchaeota MSQYLDLKYNPSKTDLVCLFRFESARGI-STECIGRIASESSNGTWSDL-STLKPHIRKI RARAFEIKGNYVKIAYPIELFELGSVPQLLSSFAGNIFGMKAVNNLRLEDVHFPKILIKS FRGPQYGIDGIRRLFRVEKRPLTASVPKPKVGMTADEHCKVADEIWRGGVDFLKDDENLT NQKFNRFDTRARRCFKIRDRVEMEIGERKGYFINVTAETNEMLRRAKLVRDLGGEYVMID VLTAGFSGFQTLREFDYKMAIHIHRAMHATMTRNPRHGISMLSLAKFVRLVGGDSLHIGT VIGKLAGKKDEVLMLEHEIEKEHVLNQNWHDIKPMFAVSSGGLHPGLVPQVMHMLGNNIV LQLGGGIHGHPLGSKSGAIALRHAVDSVMHDIPLIEYAKKSSELSMALEKWGYNR >rifcsphigho2_01_scaffold_10367_34Pacearchaeota ------LKYNPSKTDLVCLFRFEPK-GISTNECVGRIASESSNGTWSSLTTLK-PHIRKI RARAFEIKGT-VKIAYPIELFELGSVPQLLSSFAGNIFGMKAVNNLRLEDIHFPKVLIKS FRGPEYGIHGIRKLFKVEKRPLTASVPKPKVGMTTEEHCNVAKGIWEGGVDFLKDDENLT DQKFNRFDKRARKCFKIRDKIEKETGERKGYYINVTAETNEMIRRARLVRELGGEYVMID VLTAGFAGFQTLREFDNKMAIHCHRAQHSMMTRNPKHGMSMLTLAKFIRMVGGDSLHIGT VIGKLVGKKDEVLMLEHEIEKH-VLNQNWYNIKPIFATSSGGLHSGLIPQIMEMLGNDIV IQVGGGIAGHPDGITAGAKSIRQSIDATMRGISLKEYAKDHVELQRALDKFGYV- >rifcsplowo2_01_sub10_scaffold_35299_1Pacearchaeota ----LK--YNPSKTDLVCLFRFEPAKGI-STECVGRIASESSNGTW----SSLTTLKPKI RARAFEIKGTYVKIAYPIELFELGSVPQLLSSFAGNIFGMKAVNNLRLEDIHFPKVLIKS FRGPEYGIHGIRKLFKVEKRPLTASVPKPKVGMTTEEHCNVAKGIWEGGVDFLKDDENLT DQKFNRFDKRARKCFKIRDKIEKETGERKGYYINVTAETNEMIRRARLVRELGGEYVMID VLTAGFAGFQTLREFDNKMAIHCHRAQHSMMTRNPKHGMSMLTLAKFIRMVGGDSLHIGT VIGKLVGKKDEVLMLEHEIEKEHVLNQNWHDIKPMFAVSSGGLHPGLVPQVMHMLGNNIV LQLGGGIHGHPLGSKSGAIALRHAIDSVMHNIPLTEYAKKSSELSMALEKWGYNR >rifcsplowo2_01_scaffold_24480_6Pacearchaeota ------------------------------------------------------------ ---------------------------------------MKAVKNLKLEDVHFPKIMLDS FRGPEYGIQGIRKLFKIYDRPLTASVPKPKVGMNTNEFNDVAYKIWKGGLDFVKTDENMT SQKFVNFYKTTEKVLKTRDKIEKETRERKMFLANVTAETNEMIKRAKFVKNHGGEFVMLD VVTAGFAAFQTLREFDNHMAIWSHRAMHAMFTRNPKHGMSMLTLAKFVRLVGGDSLHVGT VKGKLVGKKDEVLRIEHELEKEHVLNQKWNSIKPVLVVSSGGLHAGSVPYIMRHMGNDIA IQIGGGCHGHPSGTEAGAKSVRQAIDATMNNISLEEYSKTHKELKEALGKWGHVI >rifcsplowo2_01_scaffold_256272_1Pacearchaeota ------------------LFRFEPAKGISIKEAIGRVASESSNGTW----TSLSTLKPKI RARAFEIKKDYVKIAYPIELFEMGSVPQLLSSFAGNIFGMKAINNLRLEDVHFPKVLIKS FRGPEYGIHGLRKLFKVEKRPLTASVPKPKVGMTTSEFNDVAYKIWKGGLDFVKTDENMT SQRFVNFYKTTTKVLKTRDKVERETGERKMFLANVTAETNEMIKRAKFVKEHGGEFVMLD VLTAGFAAFQTLREFDNKMAIWTHRAFHSAFTRNPKHGMSMLTVAKLVRLVGGDALHIGT VRGKLVGKKDEVLMLEREIEKEHVLNQKWSYIKPVLSISSGGLHAGSIPYIMKYLGKDIA IKVGGGCHGHPKGTEAGAMSVRQAIDAVMNNISLEEYAKTHKELKEALNKWGYII >rifcsplowo2_01_scaffold_157232_1Pacearchaeota MTHYLDLNYKPGKDDLVCLFYFEPAEGISIKEAIGRIAAESSNGTWSETEYGSKPHIRKI RARAYTINGNYVKIAYPLPLFELGNVPQLLSSVAGNIFGMKALKNLRLEDISFSKEYIKC FKGPKFGIEGIKKFMKIKERILTATVPKPKLGMVTNEYCSIAEKIWEGGVDIVKTDENMT SQKFVNFYKTTDKILKIRDKVEKKTGERKAFLANVTSETKEMLKRARFVKDCGGEFVMVD VVTAGFAGFQSLRNEDLGLAIHIHRAMHAAFTRNKKHGISMLVLAKLVRLIGGDTLHIGT IIGKLVGAKDEVLMIKEGLKKQHILPQEWYNIKPVMPVSSGGLHPGLIPYIVNIFGKDVM VQVGGGVLGNPLGAKAGAMALRQAIDATLNKIPLEKYAKTHKELKAALNKWGTER >rifcsplowo2_01_scaffold_143159_2Pacearchaeota ------------------LFRFEPAKGISTNECIGRIASESSNGTWL---TTLKPHIRKI RARAFEIKNNYVKIAYPIELFELGSVPQLLSSVAGNIFGMKALKNLRLEDIHFPKVYIKH FKGPQFGINEIRKFMKVYNRPFTATVPKPKLGMNTNEYCDVAYKIWRGGIDIVKTDENMT SQKFVNFYKTTEKILKIRDRVEKETGERKAFLANVTAETKEMLKRAKFVKEHGGEFVMID FLTAGFAGFQSLRNEDLKLAIHIHRAFHSLFSRNPKHGMSMLTLAKLVRLVGGDTLHIGT VYGKLVGTKDEVLMIEHEIEKQHVLNQNWYNFKPLIPVSSGGLHPGLIPYIMNMLGKDIM IQVGGGVLGNPLGVESGSRALRQSIDATLNNISLEKYAKTHKELKAALDKWGYVR >rifcsplowo2_01_scaffold_16673_1Woesearchaeota ------------------LFYFEPARGISVKEAVGRVAAESSNGTWSTLY-GSLPHIRKI RGRAFEIKGDYVKIAYPIELFELGSIPQLMSSVAGNIFGMKAINNLKLLDIEFSRKYIKS FRGPQFGINGIRKFMKIYNRPLTCTVPKPKLGMNIIEYCDAAYKIWKGGVDIVKTDENMT SQKFINFYKNTEKMLNIRNKVEKETGERKTFLANVTSETKEMLKRAKFVADNGGEFVMID FLTAGFAGFQTLRDEDLGLAIHVHRAFHAAFTRNPKHGVSMLTLAKLSRLVGGDTLHIGT VIGKLVGKKDEVLMIEHEIEKQHVLNQKWDNIKPVLPVSSGGLHPLLVSQIIKMLGNDIM VQCGGGVLGHPSGIEAGAMALRQAMDATSNNISLKEYAKTHIELKVALGKWGFSR >rifcsphigho2_01_scaffold_21619_3Pacearchaeota -----------------LLF---------WK-----------KGNF-------------- -----------VKVAYPIELFELGSIPQLLSSVAGNIFGMKALKNLRLEDIEFSKVYIKS FKGPQFGINGIRKFMKIKKRPLIATVPKPKLGMNTEEYCDAAYKIWKGGVDIVKTDENMT SQKFINFYRTTEKILKVRDKVEKETGERKTFLANVTSETKEMLKRAKFVKEKGGEFVMID FLTAGFAGFQSLRDEDLGLAIHVHRAFHSLFTRNEKHGMSMVTLAKLVRLVGGDTLHIGT VIGKLVGTKDEVITIEHEIEKQHILNQKWYNIKPVMPVSSGGLHPGLVPEIIKMLGKDIL IQAGGGVLGNPLGIEAGAKAFRQAIDATIEGINLKEYARNHIELKVALNRWGISR >rifcsphigho2_01_scaffold_37456_9Woesearchaeota -----------------------------VKEAVGRVASESSNGTW----TSLSTLKEKI RARAFEIKGNYVKIAYPIELFEFGNVPQLMSSVAGNIFGMKAIKNLKLLDIMFPKAYIKS FDGPGFGVHGVRKFMKIKDRPLTCTVPKPKVGMTTVEHAKVGMDAWLGGVDFLKDDENLS DQKFNRFYKRAELCFKVRDMVEKKTGERKSYFINVTAETKEMLKRAKFVADLNGEYVMID FLTAGFAGFQSLRDFDNKIAIHVHRAMHAAFDRNPKHGVSMLVLAKLVRLIGGDNMHIGT AVGKLVGSREDVMLYRDSISGKENLYQDWQGLKDTIPVSSGGLHPGLVPNIIGLLGNDCV LQLGGGIHGHPNGTISGAMAFRQALDATLKGIDLGDYAVEHKELRDALRMWGYSR >rifcsplowo2_01_scaffold_108589_3Pacearchaeota ------LKYRPGKDDLVCLFYFELAKGMAVKEAVGRIASESSNGTW----TSLSTLKEKI RARAFEINGNYVKIAYPIELFELGSIPQLMSSVAGNIFGMKAVANLKLLDISFPEKYLRS FKGPGFGMNGIRKVMAVSKRPLTCTVPKPKVGMTTNEHLNVARDAWTGGIDFLKDDENLT DQNFNKFYKRADLCFKLRDKIEKITGERKSYFINVTAETNEMVKRAKHVADLGGEYVMID FLTAGYSGFQTLRDFDHKLAIHVHRAMHAAFDRNPKHGISMLTLAKIVRLIGGDNLHIGT VVGKLVGSKSDVLMLKEGIVGQLNLSQKWHGMKDMIPVSSGGLHPGLVPYIMNILGNNCV LQLGGGIHGHPNGTKYGAIALRQAIDASLSKVNLTEYAKMHKELKIALGKWGFEK >rifcsplowo2_01_scaffold_283644_2Pacearchaeota ------LKYEPGKDDLVCLFYFEPAKGMAVKEAIGRIASESSNGTWTSL-STLKEHIRKI RGRAFEINGNYVKIAYPLELFELGSVPQLMSSVAGNIFGMKAVDNLKLLDISFPEKYIRS FKGPGFGMNGARKFMKIYNRPLTCTVPKPKVGMTTVEHLNVARDAWTGGIDFLKDDENLT DQNFNKFDKRAELCFKLRDKIERITGERKSYFINVTAETNEMVKRAKHVADLGGEYVMID FLTAGYSGFQTLRDFDNKLAIHVHRAMHAAFDRNPKHGISMLTLAKITRLIGGDNLHIGT VVGKL-------------------------------------------------VG---- ------------------------------------------------------- >rifcsplowo2_01_scaffold_45660_7Pacearchaeota MSHYLKLNYKPGKDDLICLFYFEPP-GMSTKEAIGRIAAESSNGTWTQLGTL--PHIMKI RARAFKIKGN-VHVAYPLDLFELGSIPQLLSSVAGNIFGMKAMKNLRLMDITFPKKYVQS FKGPQFGIEGIRKFMKIYNRPLTATVPKPKVGMTTEEHAKVGYEAWVGGVDLLKDDENLT DQNFNKFDARVKLCAKMRDKAEKETGEKKDYLINVTAETNEMLRRAKLAHDYGFKYIMAD IVTTGFAGLQTLRNFDTKQAIHAHRAMHATFTRNPKHGLSMLFLAKLARLVGVDNLHIGT VIGKLVGTKDEVLMLEHEIEEH-ILCQDWYNLKSVLATSSGGLHPGLIPQIIKMLGKDIC VQLGGGIHGHPNGTRIGAKALRQAIEATMKRIPLKEYSKSHKELETALQKWGSE- >rifcsphigho2_01_scaffold_111818_5Pacearchaeota LSQYLDLKYKPERDDLICLFYFEAK-GMPVKEAIGRIASESSNGTWTSLTTLK-PHIRKI RARAYEIKGN-VKIAYPIELFELGNMPQLYSSVAGNIFGMKAVNNLRLLDIQFSKRYIDS FKGPQFGIDGVRKFMKVKDRPLTATVPKPKVGMTTEEHAKVLYEAWTGGVDFGKDDENLT SQVFNKFENRVKVCAKMRDKAEKETGEVKEYFINITAETDEMKKRAKIIKDYNFKYVMVD ILTAGWSGLQTVRNEDLKLAIHAHRAMHAAFTRNPKHGVSMLTIAKSARLVGVDNIHIGT VIGKLVGTKDEVLNLECEIEEH-ILSQKWNNINPVFAVSSGGLHPGLVPYIIKMLGKDVV VQLGGGIHGHPNGTYSGAKALRQAIDSTLKGIKLENYALGHKELKIALEKWGSE- >rifcsphigho2_01_scaffold_7201_13Pacearchaeota ISQYLDLKYKQDKNDIICLFYFEPAKGISTKEAIGRIAAESSNGTW----TNLSTLKEKI RARAFEIKGNYVKIAYPLELFELGSMPQLYSSVGGNIFGMKAMKNLRLIDITFPKKYMES FKGPQYGINGVRTFMKTPRRPLTATVPKPKVGMYTEEHAKVGYEAWMGGVDFLKDDENLT DQPFNRFNDRVKLCAKYREKAEKETGEIKDYFVNITAETKEMIKRARTVKEYGFRYVMVD ILTAGWAGLQTIREEDLKLAIHAHRAMHATFTRNPKHGISMLTLAKCARLVGVDNIHIGT VVGKLVGKKDEVLMLEHEIEIVHSLSQDWEKIKPVFAVSSGGLHPGLVPDILNLLGNECV VQLGGGIHGHPNGTRKGAMALRQAIDATLDGISLDDYSKNHKELKEALKKWGHGK >rifcsplowo2_01_sub10_scaffold_17928_1Pacearchaeota VNPELSLRYKQDKNDIICLFYFEPK-GISTKEAIGRIAAESSNGTWTNLSTLK-EHIRKI RARAFEIKGN-VKIAYPLELFELGSMPQLYSSVGGNIFGMKAMKNLRLIDITFPKKYMES FKGPQYGINGVRTFMKTPRRPLTATVPKPKVGMYTEEHAKVGYEAWMGGVDFLKDDENLT DQPFNRFNDRVKLCAKYREKAEKETGEIKDYFVNITAETKEMIKRARTVKEYGFRYVMVD ILTAGWAGLQTIREEDLKLAIHAHRAMHATFTRNPKHGISMLTLAKCARLVGVDNIHIGT VVGKLVGKKDEVLMLEHEIE-H-SLSQDWEKIKPVFAVSSGGLHPGLVPDILNLLGNECV VQLGGGIHGHPNGTRKGAMALRQAIDATLDGISLDDYSKNHKELKEALKKWGHG- >YP_001012710_Hyperthermus_butylicus_DSM_5456_II_REFreference -FVDES--YKPGKDEVIAVFRVTPAQGISIKDAAGRIAAESSVGTWTTL-SVKPSWFEKL KAKAYRFDGSLVWVAYPVELFEEGSIPNFASSILGNIFGMKAIAGLRVEDVYFPPSYLET FPGPNKGIQGVREILGIKDRPILATVPKPKLGYTPEEYGRVAYEILIGGIDLVKDDENFA SQPFCRFEARLKEVMKAIDRAEKETGERKGYLANVTAPIREMEKRIKLVADYGNKFIMID FLTAGWAALQHARELEYDLAIHGHRAFHAAFTRNPKHGVSMFLVAKLARMAGVDHVHVGT PVGKMDAKTREVLEHTRIVRDLFHLEQPWSNIKPVFPVASGGLHPGTLPEVIRVMGKDII MQVGGGVLGHPDGPEAGARAVRQAVEAAMKGISLDEYAREHRELARALEKWGYVR >Ferroglobus_placidus_YP_00343593_REFreference ---YVD--YEPNKRDVIAVFKITPAEGYTIKECAGGVAAESSTGTWTTLY--PWYEEEDL SAKAYEFDGSIVKIAYPYHAFEERNLPALLASIAGNVFGMRRVKALRLEDLYLPEKLIRE FKGPSKGIEGVRKMLEIKDRPIYGVVPKPKVGYSAEEFESLAYDLLSSGADYIKDDENLA SPWYNRFEERARIVSRVIEKVESETGEKKTWFANITANVKEMERRLEILAEHNLKHAMVD VVICGWAVLEHIRDIDYNLAIHGHRAMHAAFTRNPQHGISMFVLAKLYRLIGIDQLHVGT AAGKLEGGKWEVIQNARILRDVFHLEQKFYSIKPAFPVSSGGLHPGNLAQVFEALGTDIV IQVGGGTVGHPDGPKAGAKAVRQAIDAIMQGIPLEEYAKKHKELARALEKWGTVT >YP002960117_Thermococcus_gammatolerans_EJ_REFreference ---YVD--YEPNKRDIIAVFRVTPAEGYTIEQAAGAVAAESSTGTWTTLY--PWYEQEDL SAKAYDFDGSIVKIAYPFHAFEEWNLPGLLASIAGNVFGMKRVKGLRLEDLYFPEIVLRN FSGPAFGIEGVRKMLEIYDRPLYGVVPKPKVGYSPEEFEKLAYELLSNGADYIKDDENLT SPWYNRFDERAEIVTRVIDKVENETGEKKTWFANITADIREMERRLEVLADLGLKHAMVD VVITGWGALEYIRDLDYGLAIHGHRAMHAAFTRNKYHGISMFVLAKLYRIIGIDQLHVGT AAGKLEGGKWDVIQNARILRDVFHLEQKFYGMKAAFPTSSGGLHPGNIEPVIEALGKDIV LQLGGGTLGHPDGPGAGARAVRQAIDAIMQGIPLDEYAKTHKELARALEKWGHVT >NP_070466_Archaeoglobus_fulgidus_DSM_4304_II_REFreference ---YVD--YEPQKDDIVAVFRITPAEGFTIEDAAGAVAAESSTGTWTSLH--PWYDEEGL SAKAYDFDGSIVRIAYPSELFEPHNMPGLLASIAGNVFGMKRVKGLRLEDLQLPKSFLKD FKGPSKGKEGVKKIFGVADRPIVGTVPKPKVGYSAEEVEKLAYELLSGGMDYIKDDENLT SPAYCRFEERAERIMKVIEKVEAETGEKKSWFANITADVREMERRLKLVAELGNPHVMVD VVITGWGALEYIRDLDYDLAIHGHRAMHAAFTRNAKHGISMFVLAKLYRIIGIDQLHIGT AAGKLEGQKWDTVQNARIFSDAFHLSQNFHHIKPAMPVSSGGLHPGNLEPVIDALGKEIV IQVGGGVLGHPMGAKAGAKAVRQALDAIISAIPLEEHAKQHPELQAALEKWGRVT >YP002428833_Desulfurococcus_kamchatkensis_1221_REFreference ---YID--YTPDSNDVIAVYRVKPAQGFTIEDAAGGVAAESSTGTWTSLY--NWYDVGRL SGKAYYFDGSIVKIAYPVELFEEGNIPGLLASIAGNIFGMKRVEGLRLEDIYLPKKFLES FKGPSKGLNGVREIFGVKDRPIVGTVPKPKEGYSPEEVEKLALELLSGGLDYIKDDENLT SPSFCRFEARAKAIMKVIDKVEKETGERKVWFANITSDIREMEKRLRLVADYGNPYIMVD VVITGWSALTYIRDLEYGLAIHGHRAMHAAFTRNPYHGISMYVLAKLYRIIGIDQLHIGT AAGKLEGGKLDVIRYAKILRDMYHLEQPMHHIKPAMPVSSGGLHPGNLPPVIEALGTNLV LQIGGGVIGHPDGPRAGALAVRQALEAIMNNIPLDEYAKTHRELARALEKWGFAK >ZP_09027197_Desulfurococcus_fermentans_DSM_16532_II_REFreference ---YID--YTPDSNDVIAVYRVKPAQGFTIEDAAGGVAAESSTGTWTSLY--NWYDVGRL SGKAYYFDGSIVKIAYPVELFEEGNIPGLLASIAGNIFGMKRVEGLRLEDIYLPKKFLES FKGPSKGLNGVREIFGIKDRPIVGTVPKPKEGYSPEEVEKLALELLSGGLDYIKDDENLT SPSFCRFEARAKAIMKVIDKVEKETGERKVWFANITSDIREMERRLRLVADYGNPYIMVD VVIAGWSALTYIRDLEYGLAIHGHRAMHAAFTRNPYHGISMYVLAKLYRIIGIDQLHIGT AAGKLEGGKLDVVRYAKILRDIYHLEQPMHHIKPAMPVSSGGLHPGNLPPVIEALGTNLV LQIGGGVIGHPDGPRAGALAVRQALEAIMNNIPLDEYAKTHRELARALEKWGFVK >YP001041057_Staphylothermus_marinus_F_REFreference ---YID--YVPDENDIIAVFRIKPAKGFTIEDAAGGVAAESSTGTWTTLY--PWYNTEKL SGKAYYFDGSIVRIAYPVELFEEANMPGLLASIAGNVFGMKRVEGLRLEDIYLPKKFLQY FKGPSKGVEGVKKIFRVTDRPIVGTVPKPKVGYSPEEVEKLAYELLVGGMDYIKDDENLT SPSFCRFSERAKHIMRAIDRAEKETGERKVWFANITSDIREMEKRLKLVADYDNPYVMVD VVVTGWSTLTYIRDLEYGLAIHAHRAMHAAFTRNPYHGISMYVLAKLYRIIGVDQLHIGT AVGKLEGGKIDVIRYARILRDVFHIEQEMYHIKPAMPVSSGGLHPGNLPGVIDALGTELV LQIGGGVLGHPDGPRAGAMAVRQSLEAILKGIPLDEYAKTHRELARALEKWGFAK >YP_003669400_Staphylothermus_hellenicus_DSM_12710_II_REFreference ---YVD--YVPDDNDIIAVFRIKPAKGFTIEDAAGGVAAESSTGTWTTLY--PWYDTEKL SGKAYYFDGSIVRIAYPAELFEEANMPGLLASIAGNVFGMKRVEGLRLEDIYLPKKFLQY FKGPSKGVDGVRKIFGINDRPIVGTVPKPKVGYSPEEVEKLAYELLVGGIDYIKDDENMT SPSFCRFSERAKHIMRAIDRAEKETGERKVWFANITSDIREMEKRLKLVADYGNPYVMVD VVVTGWSALTYIRDLEYGLAIHAHRAMHSAFTRNPYHGISMYVLAKLYRIIGVDQLHIGT AVGKLEGGKIEVIRYARILRDIFHLEQEMHHIKPAMPVSSGGLHPGNLPGVIEALGTELI LQIGGGVLGHPDGPRAGAMAVRQSLEAILKGVPLDEYAKTHRELARALEKWGFAK >YP_920628_Thermofilum_pendens_Hrk_5_II_REFreference -FVVKS--YLPDDKDVIVTFRVTPSEGFTIEDAAGGVAAESSVGTWTTLY--QWYDKSRL KGKAYYMDGSILRVAYPVELFEEGNMPAFLASVAGNIFGMRRVRSLRVEDIYLPEAFLKH FKGPSQGVEGVRGKLKIWGRPIIGTVPKPKVGYSPEEVEKLAYEILVGGMDFVKDDENLA GPSYCRFEERAKAIMKAIDRAEKETGERKAWLANITADVREMERRLKLVAELGNTHVMVD VVIAGWSSLTYVRDLDYKLAIHGHRAFHAAFTRNPYHGVSMFTLAKLYRIIGVDQLHVGT PVGKLEAKAVDVIRMARLLRDGLHMQQPFPGIKPAFPVSSGGLHPGTLPAVIKAMGVDTV IQVGGGVVGHPDGPRAGAAAARQAVEAYLEGVPLQEYAKTHRELARALEKWGQVI >Ignisphaera_aggregans_YP_00386030_REFreference -WVEKS--YTPDKSDVIVTFRITPAEGFTIEDVAGGVAAESSTGTWTTLY--PWYDENRL RGKAYAFDGSLVRIAYPVELFEEGNMPVFLASVAGNIFGMRRAKYLRVEDIYMPYDFIKY FKGPVKGIQGVRDTLKVYDRPIVGTVPKPKVGYTADEVEKLAYEILSGGMDFIKDDENLG GPSYCRFEARAKAIMKIIDKVEKETGERKVWLANITADVREMEKRLKLVADYGNPVIMVD VVIVGWASLGYIRDLEYKLYIHAHRAMHAAITRNPYHGISMFTLAKLFRIIGVDQLHIGT PVGKLEARTIDVIRNAKVLRDDFHLEQEFYHIKPALPTSSGGLHPGTLPEVVRVMGKDLV IQVGGGTIGHPDGPRAGAMAVRQALEAIAKGIPLDEYAKDHKELRRALEKWGYVK >YP004622994_Pyrococcus_yayanosii_CH_REFreference -FVDLD--YTPGRDELIVEYYFEP-NGVSPEEAAGRIASESSIGTW----TTLWKLPDRS MAKVFEKSGEIAKIAYPLTLFEEGNLVQLFSAIAGNIFGMKALKNLRLLDFHPPYEYLRH FKGPQYGVKGI---MGIEDRPLTATVPKPKMGWSVDEYAEIAYELWSGGIDLLKDDENFT SFPFNRFEERVRKLYAVRDRVEAETGETKEYLINITGPAHVMEKRAQLVAAEGGQYIMID IVVVGWSALQYMREVDLGLAIHAHRAMHAAFTRNPRHGISMFVLAKAARMVGVDQIHTGT AVGKMAGDYEEVKRINGFLL------SEWEHIKPIFPVASGGLHPGLMPELIRLFGRDLV IQVGGGVMGHPDGPRAGAKALRDAIEAAIEGVSLEEKAKESPELKKALEKWGYLK >BAA30036_Pyrococcus_horikoshii_OT3_II_REFreference -FVDLN--YEPGRDELIVEYYFEP-NGVSPEEAAGRIASESSIGTW----TTLWKLPERS MAKVFEKHGEIAKIAYPLTLFEEGSLVQLFSAVAGNVFGMKALKNLRLLDFHPPYEYLRH FKGPQFGVQGIREFMGVKDRPLTATVPKPKMGWSVEEYAEIAYELWSGGIDLLKDDENFT SFPFNRFEERVRKLYRVRDRVEAETGETKEYLINITGPVNIMEKRAEMVANEGGQYVMID IVVAGWSALQYMREVDLGLAIHAHRAMHAAFTRNPRHGITMLALAKAARMIGVDQIHTGT AVGKMAGNYEEIKRINDFLL------SKWEHIRPVFPVASGGLHPGLMPELIRLFGKDLV IQAGGGVMGHPDGPRAGAKALRDAIDAAIEGVDLDEKAKSSPELKKSLREVGLSK >YP004424539_Pyrococcus_sp_NA_REFreference -FVDLD--YTPGRDELIVEYYFEP-NGVSPEEAAGRIASESSIGTW----TTLWKLPERS IAKVFEKHGEIAKIAYPLTLFEEGSLVQLFSAIAGNVFGMKALKNLRLLDFHPPYEYLRH FKGPQFGVHGI---MGVKDRPLTATVPKPKMGWSVEEYAEIAYELWSGGIDLLKDDENFT SFPFNRFEERVRKLYRVRDKVEEETGETKEYLINITGPVHVMEKRAELVACEGGRYVMID IVVVGWSALQYMREIDLGLAIHAHRAMHAAFTRNPRHGITMLALAKAARMIGVDQIHTGT AVGKMAGDYEEIKRINDFLL------SKWEHIRPVFPVASGGLHPGLMPELIRLFGKDLV IQAGGGVMGHPDGPRAGAKALRDAIEAAVEGVDLEEKAKSSPELKKALEKWGYLK >YP004341517_Archaeoglobus_veneficus_SNP_REFreference -FVDLS--YEPEENEIICVFRVEP-DGISMEEAAGRVASESSVGTW----TTLAKLPERL MAKVFEIDGNVVKIAYPLDLFEEGSIPQLLSSVAGNVFGMKALKNLRLEDIEFPAEYCKH FSGPLLGIEGVRKLFGVYDRPLTATVPKPKVGFDADEYADIAYQGWSGGIDFIKDDENLT SQPFVRFEKRLEKVMKAREKAEKETGEKKVYLANVTAVGKEMLRRAKLVADYGNEYVMVD ILTAGFSAVQMLREEDLGLGIHAHRAMHAAFTRNPKHGISLDVLVKISRLAGVDNFHVGT GVGKMEGSKDMVKRLADICR------REW-YVKPVFPVSSGGLHPGLVPDIVQLFGKDVI IQAGGGVHGHPDGTHAGAKALRQAIDAVIKGISLDEHAKKHAELARALEKWGYTR >YP007907939_Archaeoglobus_sulfaticallidus_PM70_REFreference -FVDLS--YKPE-DEIVCYFKVKS--DLPIEESSGRVASESSVGTW----TTLSRLPDWL MAKVFRIDGERIAVAYKTDLFEEGNIPQFLSSVAGNIFGMRAIKGLRFEDFEVPASFAKH FKGPNFGIDGVRNIMRVHDRPLTATVPKPKVGFDADEYAEVGYRSWVGGIDILKDDENLT SQPFIRFEKRLAKVMKARERAENETGEKKGYLINITAEAREMERRAELVADYGNEFVMVD ILTAGFSAVQTVRNKELGLAIHAHRAMHGAFTRNEEHGISLKVLTKLARMAGVDHMHVGT GVGKMAGDKAEVMELRDVCR------KSWHNFKPVFPVSSGGLHPGLIPDIIELFGVDVI IQAGGGVHGHPDGSEKGAMALRQAISAVLEGVDLEDYAKSHSELARALERWGRVS >LSDeep1_scaffold_289_96Peregrinibacteria VTQQLQLKYKPKSTEVLVLYKPKVK-GFTIEYAAENLAAESSIGTWTALSTMNNKIAQQL KPNVYKIDKK-VYIAYPVKLFEIGNMSGILSSICGNIFGMDFLDGLRVCDIQFPKAMVTS FPGPYHGIAGTRKILKVKGRPITGTIIKPKVGLTSKQHAQVGYEAWVGGLDIVKDDENLT SLTFNQFDSRAKLTCKAKGIAEKVTGQKKLWLANITHSNDEMMRRDALLRKLGNEVTMLD VVTLGFNAVHTYRLRNTKQIIHAHRAMHGTITRTPGFSMSMLILAKVYRMLGVDLLHVGT ATGKMEGGAAETMVLVEAIQKE-ILGQDWYGMKPVVAVASGGLYPGAIPTVVKYMGRDIV CQMGGGCHGHPDGTRGGATGIVEAVDSVMLGQPIRQYAKNHEMIKKAIDKWGV-- >CG09_land_8_20_14_0.10_scaffold_28336_1Micrarchaeota GGMMAGARYAPKEDDVVCTFRMKNR-GVSTGYAVQQLVAESSIGTWTDIATMKPEIARKL GPKAFSLRGG-VKVAYPADDFEPGNASQIWSAVCGNIFGMKILDSLRVEDIELPKKLARS FKGPALGVPGIRRMLGVPKRPLVGTIVKPKVGLNEREHAHVAYEAWANGLDLVKDDENLC SMSFNRFEKRVVEVLKARERAERETGEKKLAVLNVSGPK--MLQRAEFIREHGGNAAMVD IISCGWAGLQMLRNAETGLVMHGHRAGHAAFTRNREHGIAMMVVAKTARWCGIDTLHIGT AIGKMEGGSDEVVAVRG------AIQRDEFGLKPVFAVASGGLHPALLPGVIERLGGDVV VQFGGGVHGHPRGTAAGARAVRQALDATVEGETLDAVAARHWELREALEKWRRN- >CG08_land_8_20_14_0.20_scaffold_15999_2Gottesmanbacteria -MDYLDLKYTPK-EDIVCTFILKGKEDKK--TLSSKIAGESSTGTW----TKVLYSKKEL NAKVFSINKDTIKIAYPLELFEKGNIPQLLSSVAGNIFGMRGIDKLILRDIDFPKKYVEA FKGPELGISEIRKKTGIKERPIVGTIFKPKLGMSPKQMAENAYLVFSAGLDFAKDDENLS NQDFCKFKERFNLITKVLDKVENERGRRPIYAINITAPYDEMVKRAEFAKENNGNCIMVD AITLGFSSLQSLLSKKWGMLIHCHRAMHGAITRSRDFGISMLVFAKLLRLTGVTELHTGT VVGKMEGSKEEVVEINNFLR------KEWFNIKPIFPTASGGLHPVLMPELIKILGKDLM ITSGGGIWGHEMGAASGAKAMLQAVDAGVKNIPINEYARTHKELESAIKSWKGFE >rifcsphigho2_01_scaffold_6465_9Woesearchaeota -MKYEDTGYRPGRDDIICLFRIIPAAGFTFEETAARIASESSNGTWAELD--VPQHVRKL SAVAFELKPPYAKIAYPLGLFELGSIPQALSSIAGNIFGMKAARSVRLEDISWPKAYLRS FRGPQFGIPGVRKLLGIPRRPILASVPKPKVGLTTREFSAMAAAAWRGGVDLLKDDENLT DQAFNRFERRLDACMKLRRTIERETGERKSYLLNITAETQEMLRRARLAAKLGNEYVMVD ILTAGWAAVQTVREEKLGLAIHAHRAFHAAIDRNPAQGMSMKILAEIARIQGCDQVHIGG -LGKLAGDRREVRGIWEKVASKEVLAQDWAGMKPLLGTCSGGLHPGIISRLVRLLSADIV IQAGGGIHGHPRGTEAGARAMRAALDAIADGHRVEDAARVHPELADALAFWGHGT >rifoxyc1_full_scaffold_6493_4Woesearchaeota IIKYSDFNYKPSSKDLICLFRINPGSGFSIKESAARVASESSNGTWTGLD--VPSHIPKI SAKCFKISRDYAWIAYPEELFELGSIPQVVSSIMGNIFGMKAVDGLRLEDVTWPKSIVNS FRGPKYGIKGVRKLLNIKDRPLLATVPKPKVGYYPDEHAKVGYNAWAGGVDLLKDDENLT NQSFNPFEKRLDKSMAMMHKAEKETGEKKGYLINVTAETNEMIRRARLAKKAGNDFVMID ILTAGWSGLQTLREEKLGLAIHAHRAFHSTFDRNPRHGITMRVIIEMARLIGVDSIHIGG -LGKLVGGKTEVSYIKASMNIESVLAQDWHDINPCISCCSGGLHPGIIERLTDLLGSDLI LQAGGGIHGHPGGTHSGAIAFRQAFDAIKEGISVKTYAKTHEELGQAINKWGSVT >rifoxyb1_full_scaffold_13298_4Pacearchaeota KAKYNDTELIEKNPTFLARYDYNLR-DARYQGTAVIIPELDFKAKQGDIKGCNFFYKTAL SNIIRELSNE-PMTQFISELFKLEDVNNTYEDSGFTNYPQRAINPKSWQHLRVKSYFLNI DLEVPFVVEGL--------------------------------------MDVVKDDENLT SQPFNNFYKRIPLTLQALKKAEKETGEKKVYLANCTAPADEMIKRIKFVEKCGGNYIMLD ILTLGWSALQLARKT-TKLPIHAHRAGHAMFDRSRDHGMKMEVIAQFARMIGVDTLHIGT AYGKMTGGKDEVLHIEDEIE-E-NLHQKWFHIKPVFAVASGGVYPGIVPKIIEFMGQDVV IQAGGGIHGHPNGTVAGAKAMRQAVNATKNKVSLKEYAKYHPELRLALEKWGD-- >LSDeep1_scaffold_252_71unknown VSAYNGTKYKPTSTDVVVQYKITPARGYSFRQVAEMTAGESSVGTW----TEVQTTNKRL APKVFYLNKK-VRIAYPLKLFEYSSVPNILSSIGGNVFGMKAAKGLLFEDITFPKKMIKK FKGPRFGIKGLRKYLGVKKRPLVGTIVKPKVGLNEHEHALVAYDSWIGGCDIVKDDENLG DQDFNRFKKRFLLTIKKCREAEKKTGEKKVYLVNCTAEYDEMLKRIDFVQKNGGNYIMLD ILTIGWGSLQSIRDH-IKLPIHAHRAGHAMYDRDPNHGMTMEVIAQFARLIGVDTLHIGT AYGKMSGGKKEIIHIEKEIETMEHLSQKWYGLKPIFAVASGGVHPRMLPKIIKFMGNDVV LQAGGGIHGHPDGTVAGAIAMRQSVDAAMNNIPLGEYAKTHEELKKALKKWGR-- >gwa1_scaffold_12779_4Pacearchaeota KVNYENLNYTPKNTDVICQFKITPAKGYDFREVCSITAGESSVGTWTDISTVNKKMQKKI APKVYYLKGNRCRIAYPIELFELGNLSGVLSSIGGNIYGMNSVNGLLWEDIKIPEKMLKS FRGPKFGIPGIRKYLKVYDRPLVGTIVKPKVGLT---------------CDIVKDDENLT SQSFNNFYKRITLTLEAMRKAEKETGEKKAYLANCTAPVDEMIKRIKFVEKNNGNYIMLD ILTLGWSALQKAREV-TNVPIHAHRAGHAMFDRTPNHGMKMEVIAQFARMVGVDSLHIGT AYGKMAGGKDEVMHIEKEMETKENLSQRWFGIKPVFGAASGGVYPGITDKIIEFMGKDVI LQAGGGIHGHPDGTIAGAKAMRQAVNAVLKGIPLKEYAKTNLELKKSIEKWG--- >GWB1_scaffold_22216_8Pacearchaeota KVNYENLNYTPKNTDVICQFKITPAKGYDFREVCSITAGESSVGTWTDISTVNKKMQKKI APKVYYLKGNRCRIAYPIELFELGNLSGVLSSIGGNIYGMNSVNGLLWEDIKIPEKMLKS FRGPKFGIPGIRKYLKVYDRPLVGTIVKPKVGLTSKEHAKVAYESWLGGCDIVKDDENLT SQSFNNFYKRITLTLEAMRKAEKETGEKKAYLANCTAPVDEMIKRIKFVEKNNGNYIMLD ILTLGWSALQKAREV-TNVPIHAHRAGHAMFDRTPNHGMKMEVIAQFARMVGVDSLHIGT AYGKMAGGKDEVMHIEKEMETKENLSQRWFGIKPVFGAASGGVYPGITDK---------- LQAGGGIHGHPDGTIAGAKAMRQAVNAVLKGIPLKEYAKTNLELKKSIEKWG--- >gwc1_scaffold_2334_8Pacearchaeota KVNYENLNYTPKNTDVICQFKITPAKGYDFREVCSITAGESSVGTWTDISTVNKKMQKKI APKVYYLKGNRCRIAYPIELFELGNLSGVLSSIGGNIYGMNSVNGLLWEDIKIPEKMLKS FRGPKFGIPGIRKYLKVYDRPLVGTIVKPKVGLTSKEHAKVAYESWLGGCDIVKDDENLT SQSFNNFYKRITLTLEAMRKAEKETGEKKAYLANCTAPVDEMIKRIKFVEKNNGNYIMLD ILTLGWSALQKAREV-TNVPIHAHRAGHAMFDRTPNHGMKMEVIAQFARMVGVDSLHIGT AYGKMAGGKDEVMHIEKEMETKENLSQRWFGIKPVFGAASGGVYPGITDKIIEFMGKDVI LQAGGGIHGHPDGTIAGAKAMRQAVNAVLKGIPLKEYAKTNLELKKSIEKWG--- >LSDeep1_scaffold_26_533unknown VKAYEGKKYKPKKTDVIVQYRVKPSKGYTFRQVAEMTAGESSVGTWTEV-STMKPKIQKL APKVFYLDKK-IRIAYPLKLFELGNLPEILSSIGGNIYGMKAAKELFWEDINIPKKMLES FKGPRFGMKGLRKYLGVWSRPLVGTIVKPKVGLNEHEHAIVAYDSWLGGCDVVKDDENLT SQDFNQFKKRFLLTIKKCKEAEKKTGEKKVYLINCTAETQEMIKRIKFVEAHGGNYIMLD ILTLGWAALQTARNI-TKLPIHAHRAGHAMFDRDAKFGMSMEVIAQFSRMIGVDTLHIGT AYGKMSGGKKEVLHIEKEIETKEYLSQKWWGVKPVFAVASGGVYPQLVPKIMKFMGKDVV IQAGGGVHGHPHGSVAGAKAMRQAVEASLKGVSLKKYSKDHKELAEALEKWG--- >rifoxyb2_full_scaffold_57680_2Pacearchaeota --------------------M--------INIIATILTLKS-SFIW-------------- -----EEKRK------PTVPIGK-KARNILVHKL--IMSLLAERD-------F------- ---SDFGEEEI-------------------------------------GNEILQNAKYLR EDYIKNERQRL---QTEMKKAEEETGEKKVYLVNCTAETEEMLRRIKFVEQNGGNYIMLD IITLGWAGLQTARNH-TKLPIHAHRAGHAMFDRNPEHGMSMEVIAQLARMVGIDTLHIGT AYGKMTGGKDEVLHIEQEIETKEHLSQKWFGVKPVFGVASGGVHPGIVDKIMQFMSKDVV IQAGGGIHGHPQGTIAGAKAMRQAVDATMNKISLKNYAKSHRELREALEKWVK-- >tara_37686_27Pacearchaeota KSGYKSLKYKPKKTDLICQFKVTPAKGYNFKDVVSMVAGESSVGTWTEVKTMNPEIGKTL TPKVYYLDTK-CRIAYPIELFELGNLPEIMSSIGGNVYGMKSAKGLRWEDIEIPKKMLKS FKGPRYGIKGIRKYLKIKKRPLVGTIVKPKVGLTESQHAKVAYESWLGGCDIVKDDENLT SQNFNKFKKRFLLTIKLLKKAEYETGEKKVYLINCTAETEEMLKRIKFVENNGGNYIMLD IITLGWGALQTARNF-TKLPIHAHRAGHAMFDRNPDHGMSMEVIAQLARMVGVDTLHIGT AYGKMSGGKKEVLHLEKEIETKENLKQKWYGIKPVFGVASGGVYPGIVSKIVKFMGNDVV LQAGGGIHWNPRGSKYGAMGMRQAVDAVMKNIPLKTYAKKHKELKEAIDKFGLG- >tara_50434_27Pacearchaeota KSGYKSLKYKPKKTDLICQFKVTPK-GYNFKDVVSMVAGESSVGTWTEVKTMNPKIGKTL TPKVYYLDTK-CRIAYPIELFELGNLPEIMSSIGGNIYGMKSAKGLRWEDIEIPKKMLKS FKGPRYGIKGIRKYLKIKKRPLVGTIVKPKVGLTESQHAKVAYESWLGGCDIVKDDENLT NQNFNKFKKRFLLTIKLLKKAEKETGEKKVYLINCTAETKEMLKRIKFVEDNGGNYIMLD IITLGWGALQTARNF-TKLPIHAHRAGHAMFDRDSRHGMAMEVIAQLARMIGVDTLHIGT AYGKMSGGKKEVLHIEQEIE---NLKQRWYGIKPIFGVASGGVYPQIVPQIIKFMGNEVV IQAGGGIHGHPKGSISGAKAMRQAVDAVIKNKSLNEYSKTHKELKEALNKWKK-- >rifoxya1_sub10_scaffold_6_49Pacearchaeota VTGYKSMKYRPKKTDVVCQFKVTPAKGYSFRDVVSMVAGESSVGTWTDIK-TMKPRIGKL LAKVYYLDAK-CRIAYPLELFELGNLPEVMSSIGGNIYGMKSAKGLLWEDIKIPKKMLKS FKGPQFGIKGIRKYLKVFDRPLVGTIVKPKVGLDEKEHAKVAYDSWIGGCDVVKDDENLT SQTFNKFKKRFLLTIKALKKAEEETGEKKVYLVNCTAETEEMLRRIKFVEQNGGNYIMLD IITLGWAGLQTARNH-TKLPIHAHRAGHAMFDRNPEHGMSMEVIAQLARMVGIDTLHIGT AYGKMTGGKDEVLHIEQEIETKEHLSQKWFGVKPVFGVASGGVHPGIVDKIMQFMGKDVV IQAGGGIHGHPQGTIAGAKAMRQAVDATMNKISLKNYAKSHRELREALEKWVK-- >rifoxyb2_full_scaffold_25016_1unknown ----------PK------------------------------IGK-------------KL APKVYYLDKERIRIAYPLDLFELGNLPCVMSSIGGNIYGMKSADGILWEDVRIPKKMLKS FKGPRYGIKGIRKYLKVKDRPLVGTIVKPKVGLTSKEHAKVAYESWLGGCDIVKDDENLT SQNFNKFKKRFLLTIKALKKAEKETGEKKVYLINCTAECEEMKRRIKFVEENGGNYIMLD ILTLGWSALQTARNF-TKLPIHAHRAGHAMFDRNPEHGMSMEIIAQCARMVGVDTLHIGT AYGKMTGDKTEVLHIEEEIETRNNLEQKWYGTKPVFAVASGGVYPRIVDKIIDFMGKDVV IQAGGGIHGHPQGTIVGARAMRQAVDATMKKISLDEYAKTHPELKVALEYWSNVK >gwa2_scaffold_8105_21Pacearchaeota KSGYRSLKYKPRDTDVLCQFKITPAKGYEFNDVASMVAGESSVGTWTDVKTMNPKIGKKL APKVYYLDKERIRIAYPLDLFELGNLPCVMSSIGGNIYGMKSADGILWEDVRIPKKMLKS FKGPRYGIKGIRKYLK-------------------------------------KDDENLT SQNFNKFKKRFLLTIKALKKAEKETGEKKVYLINCTAECEEMKRRIKFVEENGGNYIMLD ILTLGWSALQTARNF-TKLPIHAHRAGHAMFDRNPEHGMSMEIIAQWARMVGVDTLHIGT AYGKMTGDKNEVLHIEQEIETKNNLEQKWYGTKPVFAVASGGVYPRIVDKIIDFMGKDVV IQAGGGIHGHPQGTIVGARAMRQAVDATMKKISLDEYAKTHPELKTALEYWSNVR >gwd2_scaffold_498_27unknown KSGYRSLKYKPRDTDVLCQFKITPAKGYEFNDVASMVAGESSVGTWTDVKTMNPKIGKKL APKVYYLDKERIRIAYPLDLFELGNLPCVMSSIGGNIYGMKSADGILWEDVRIPKKMLKS FKGPRYGIKGIRKYLKIKDRPLVGTIVKPKVGLTSKEHAKVAYESWLGGCDVVKDDENLT SQNFNKFKKRFLLTIKALKKAEKETGEKKVYLINCTAECEEMKRRIKFVEENGGNYIMLD ILTLGWSALQTARNF-TKLPIHAHRAGHAMFDRNPEHGMSMEIIAQWARMVGVDTLHIGT AYGKMTGDKNEVLHIEQEIETKNNLEQKWYGTKPVFAVASGGVYPRIVDKIIDFMGKDVV IQAGGGIHGHPQGTIVGARAMRQAVDATMKKISLDEYAKTHPELKTALEYWSNVR >UBA96contig_21807_17Micrarchaeota YGGYSKLRMRVEPEKLTAVFYVEPK-GKSLVQAAEAVAAESSIGTWVKLSTMQDRVLQKL QAKVFEINRNIIKIAYPLEIFENGNAPQLLSDIAGNIFGMKEVENIRLLDFEAPKKYVQS FPGPAFGVEGIRKIAGTLRRPHIGTIVKPKVGLTPKENAAVAYEAWMGGCDFVKDDENLT SHSFSPFEERVVRVLEAADKAEAETGAKKMYAPNITASADVMLKRAEFVKAQGGNCIMVD VITAGFAGVQFIRGRNLGMCIHAHRAMHAAFGRNKRHGIAMKVIAKISRMCGTDQLHIGG IVGKMEGDEKEVREISREMR---KLGGCWRKMKPVFSVCSGGLSPLSIPFLYKTLGRDII IQAGGGVHGHPDGTRAGAIAMRQALDASMRGVSLKEYAKGHEELRKAVEKFGK-- >UBA93contig_1572_78Micrarchaeota HDVYSGEKIDPE---VLVDFSFEPA-GD-PRNDAQALAAESSIGTWTELTTMKPSIRKRL AARVFRLGKNRASVAYPLDIFELGSIPQFLSDAEGNIFGMSEISALRLLDIRFPRAYARS FKGPQLGLEGCRRIVG-TSRPHAGTIIKPKIGLGPREHAAVAYEAWAGGCDFVKDDENLT DQKFNPFKERVIRTLDALDKAESETGEKKIYAANITAETGEMLKRADFVKQHGGNCIMVD VFTAGFSALQTVRNRNYGMIIHGHRAMHAAFTRNKRHGISMLSLAKLLRLAGVDQLHTGA IVGKMEGNVREILEMNEWLR------SDFYGLKPVLPVASGGVDPTRVPRLLDLAGTELV INAGGGIHGHPCGTRAGARALRQSMDAWMAGKSLRDYAKTHVELGQALEEWGNR- >ncbi_AWOG01000022.1_5Nanoarchaeota NLNFID--YKPK-NELIAEYHVEP-YRIDFRKACNHIAGESSIGTWTDIGTLSKEMFKRL KPTVFSIKKDKIKIAYPLDLFEKGNMSQILSSIAGNIFGMKSVKHLRLIDIDFPKELVRS FKGPGMGLEDIRNFTKIKDRPIAGTIYKPKIGLTDKEQAKLAYRIYKAGIDFSKDDENLC SMRFNSFRDRTVRILEVIDKIKDEEGRNVIYVPNVTAPYEEMMKRTEFVYEHGGKAIMMD ILTTGFSAHEALSKDFRKMIIHGHRALHGALTRDKHEGISMLVIAKIARLLGVSSLHTGT VVGKMEGTKEEVSEINDFLK------SKWFGLKRVMPAASGGLHPGLIPLVMKYLGNDLI INCGGGLWGHPDGYEAGVKAIRQSIDATMEGISLRDYSKNHYELRRALEKWGS-- >RBG_16_scaffold_41955_19Micrarchaeota -MTYLDLSYRPR-NDIVCEFYLEPAKGLSVKEAAEHVAGESSVGTWTEV-ATSSPRIRKM AAKVFSVKGNHVKIAYPSELFEPGNMPQIMSSIAGNIFGMKIVENLRLVDIEFPKDMAKS FRGPEIGLEDLRNITGIKGRPILGTIYKPKLGLNPKEMEDLAYKVYSAGLDYTKDDENLT SMSFNRFEDRVSRILKVADRIKSEQGRTVVYAANITAPAQEMLKRAEFVKDHGGKCIMID IMXXXWSGLQFIREQNMGMIIHAHRAMHAAVTRNPKHGISMLAIAKAARLCGVTALHAGT VVGKMEGPKEEVVKIDSFLK------SEFHGLKKVMPIASGGLHPRLVPDLLNILGNDLV ITFGGGLWGHPGGPESGVRAIRQAADAAIQKIPLAEYAKTHKELADALKEWK--- >RBG_16_scaffold_32845_14Micrarchaeota -MPYLDLSYRPR-NDLVCEFYVEPAKGLSVKEAAEHIAGESSIGTWTEVS-TSTPRIRKM AAKVFSIKGNCVKIAYPAELFEPGNMPQIMSSIAGNIFGMKAVENLRLEDIQFPKEIAKS FRGPEVGLDDLRKITGIKGRPILGTIYKPKLGLNPKEMEELAYKVYSAGLDYTKDDENLT SMSFNKFEDRVQRILGVADRIRSGQARTVVYAANITAPSQEMLKRAQFVKDHGGKCIMID IMTAGWSGLQYIREQNLGMIIHAHRAMHAAFSRNPKHGISMLAIAKAARLCGVSALHTGT VVGKMEGPKEEVMKIDSFLK------SDFYGMKKVMPIASGGLHPRLVPDLLKILGNDLV ITFGGGLWGHPGGPESGVRAITQATEATIQKTPLEEYAKTHRELAEALKAWK--- >RifSed_csp2_16ft_2_scaffold_590394_1Micrarchaeota VMSYLDLSYRPK-SDLVCEFCVEPTKGLSVKDAAEHIAGESSIGTW----TEVATSTPKM AAKVFSIKGNHVKIAYPTELFEQGNMPQIMSSIAGNIFGMKAVENLRLEDIDFPKEIAKS FKGPEIGLDELRKITGIKGRPILGTIYKPKLGLNPNEMGELAYKVYSAGLDYTKDDENLT SMKFNSFEDRVSRVLGVADRIRSEQGRTVVYAANITSPAQEMLKRAQFVKDHGGKCIMID IMTAGWSALQYIREQNLGMIIHAHRAMHAAVTRNPKHGISMLAMAKAARLCGVTALHTGT VVGKMEGPKEEVVAIDNFLK------SDFYGMKKVMPIASGGLHPRLVPDLLKILGNDLV ITFGGGLWGHPRGPESGVRAIRQAAEAAVQ------------------------- >mol-32-15fa-040034_9Micrarchaeota IMSYVDLNYLPR-NDLVCEFYLEPAKGLSIKEAAEHIAGESSVGTW----TEVSTSTPKM AAKVFSIKGNHVKIAYPQELFEPGNMPQIMSSIAGNIFGMKAVENLRLKDINFPKDIAKS FKGPEVGLDDLRKITGIKGRPILGTIYKPKLGLNPKEMEELAYKVYSAGLDYTKDDENLT SMGFNRFEDRVHRILGVADRIRSEQGRTVIYAANITAPAQEMLKRAQFVKDHGGKCIMID IMTAGWSGLQYIREQNLGLIIHAHRAMHAAMTRNPKHGISMLAIAKAARLCGVTALHAGT VVGKMEGPKEEVIAIDNFLK------SDFYGMKKVMPIASGGLHPRLVPDLLKILGNDLV ITFGGGLWGHPGGPESGVRAIRQAAEAAVQRILLEEYAKTHTELAEALKQWN--- >rifcsplowo2_01_scaffold_4908_46Aenigmarchaeota -MQYDD--YVSLVNSVVCTFRIDP-----PTNAAAAVAAESSTGTW-----SDVPDTNKI SAFIFERRGQMFKIAYPITLFEKNNIPQLMSSVAGNVFDMKNVKNLRLEDIEFPRSYYTA FKGPAYGIKGIRKIMKIRKRPLIGTIIKPKLGLNTEQHAERAYQAWIGGCDIVKDDENLS NQKFNKFQKRGFNTLMMCQRAENETGFKKAYMPNITAETNEMIKRAKFVEDHGGKYIMID VVTTGWAGLQTVRDH-VKIPIHAHRAGHGAFTRMENHGIAMRVIAKICRMIGIDQLHTGT VIGKMQ-GGKGVFDSVHALQ------DRWRGIKSCFAVCSGGLCPTHAPELIKMFGRNII IQAGGGIHSHPKGTIHGAEAMMQAVEAGMKKISLTEYAENHTSLRLALEKWGLSK >rifcsphigho2_02_scaffold_173369_1Micrarchaeota ---------------LICLFRIRPAKGVSVRSAANQVALESSVGTWDKVEGLSKGIMDRY GGKVFSIKGNYVKIAYPAELFEFGNMPSILSGIAGNIFGMKSLDALRLEDISFPRKLRDS FNGPKYGIEGVRKILKIKNRPLIGTIVKPKVGLSPIGHAEYAYNAWKGGLDLVKSDENLT NQKFNRFEERCRKTIKMMKKIEGETGEKKLYVENVTAETKEMIRRAKFIQDNGGNCAMID IVVSGWSAFQTLRNEGLDLVIHCHRAGHGMFTENPEHGMSMLTVAKIARLIGGDGLHIGA VFGKMHGKKDEVTQIKEEIEHKLILKENWGKIKPMFSVCSGGIHPGTLPKLIKTMGNDII CQAGAGVSAHPLGVEAGAKAMKQALDASLKHINLKEYSKNHKELRVAIERWGYLK >CG_2015-01t_scaffold_89495_1Pacearchaeota ------------------------------------------------------------ -------------------LFETNNMSQILSSIAGNIFGMKAVKGLRLEDVQWPGKMMKG FKGPNFGIKGVQKLFKISNRPLIATVPKPKVGYYSEEHARIGYDAWTGGVDLLKDDENLS SQKFNPFEKRLKLCMKMRDRAEKETGEKKSYLINITAETNEMLKRLKMVKDYGNEYAMID TLTAGWAATQTICNKEEKIIMHFHRAFHSAFDRNPLHGVSMKVLCSIARLQGADQLHIGG -LGKLAGDKEEVRNNYEKCSKEDMLAQNWHGMKKTLSVGSGGLHVGILKPLMDLLGTNIC IQVGGGIHGHPNGTHDGAVAVRQSIDAYLKCKTLDEYAETHKELKIALEKWGRKV >CG_4_9_14_3_um_filter_150_scaffold_9172_2Micrarchaeota ANITRGPGYRPTGKDMVVVYRVEAG-RMPIGKVAEHLAAESSIGTWTDIATVNPASAARL RPHVYSIARRYVKIAYPEELFERGSIPQILSSVAGNIFGMKEVKRLRLVDLSFSNGMLKS LPGPAFGIDGVRSKAGVTDRPMVGTIVKPKVGLSAAEHAKVAYDAWVGGLDLVKDDENLT SQPFNTFQARMRETFKAVRTAEQITGQKKTYVPNVTAETGEMIKRAKLVKKLGGTTVMVD ILTAGWSGIESLRQANLGLVIHGHRAGHAALTRDPTHGISMLALAKLSRAAGIDQLHVGT AVGKMNGAADEVQRIQQAIS-I-YLKQDWGTMKPTLAIASGGLHPGQVDRLIDRMGTNIV AQFGGGCHGHPCGTVAGAKAIRQAVDARMSGVPLKDYARDHLELNQALQKWGN-- >CG10_big_fil_rev_8_21_14_0.10_scaffold_1834_23Micrarchaeota ---MHELDYKPSRSDVIAEFYVEP--KIKLNKAATHIAGESSIDTWSDIKTLSDSVIKRL APHVFYINHN-IRIAYPSELFEKGNICQILSSIAGNVYGMKAINHLRLQDIHFPKKIVTS FPGPRMGIIGLRKLMNVNKRPFVGTIVKPKLGLNPKQWADVAYQAWSGGLDIVKDDENLT SMTFNNFKKRIELVLKLLEKAEKETGERKIYFPNISAETFEMLRRMDYVRERGGKAVMVD LLTVGWAGTHTLRNKSQGLAIHCHRAGHAALTRTPRHGMSMLLIAKLARMIGVDSLHIGT AVGKMEGAPDEVLKIEQEIEHH-VLGQKWYRVKPTLAVASGGLHPGGIPLLLKRMGKDIA MMFGGGVHANKYGTKAGAMSVRQALYASLKNIPLEKYAKKHKELEFVVKKWGVP- >CG10_big_fil_rev_8_21_14_0.10_scaffold_3334_6Micrarchaeota -MQFINLKYKPSKEDLVVEFYVEP--NVSMEKAANAIAGESSIGTWTKVTTMERYIAENL RPTVFSIKGKYIRIAYPTKLFELNNMSGILSAIAGNIFGMKDLNNLKLIGIDWPDKIIKS FKGPKYGIDGIRKLLKVKKRPLIGTIIKPKIGLSYKKHAKVAYEAWIGGCDIVKDDENLT NQLFNPFNKRVVETLKMRDKAEKETGEVKVYIPNVTAEVDEMKKRAHFIKKHGGRYMMID VVTVGFAGLQTMRDENLNLVIHGHRAMHAALTRGR-MGISMFALAEIYRLIGVDQLHIGT IVGKMEGSREEVRDIKNELSKKHVLETRWGKIKPIFPVCSGGLHPGHVDKLIKNIGNNII IQMGGGIHWNPRGTKYGAMGARQAVEAIMQKISLKKYAKTHRELKEALDRFG--- >UBA119contig_8444_4Woesearchaeota MVLVVDKKYKPKQTELVCEYLVKPS-HITIEFAASQIAAESSIGTWTTIQTMKPRIAKRL APHVFSIERKLIKIAYPLDLFEIGNMPQLLSAIAGNIFGMKILDELRLEQVSFPKKYLKG FKGPEYGIQGVREVIKVEKRPLLGTIVKPKVGLNPEEHAQVAKEAWAGGLDIVKDDENLT DMNFNNFKKRIKKTLEKRDMVESLTGEKKIYMPNVTAEPLEMLRRAEFVREQGGEYVMID VLTAGFGAVQTLR--AKGLVIHAHRAMHAALTRHSGHGISMLALAQFLRLIGVDQLHIGT AVGKMSGKKKEVKTIFNEIE----MKQDWGNIKPVMAVCSGGLHPLLMPDLVKIFGNDAI FQFGGGCHGHPKGTRAGARALRQALDAVLLKIPLEKAAEERKELKQAFDKWGD-- >CG13_big_fil_rev_8_21_14_2.50_scaffold_6700_4CPR --MRQDLKYKPKKNEVIALYYLEPR-GVSFEEVANHLAGESSIDTWSDILTLSPSLASKL KPHVFFLNKKIIKVAYHLDLFEINSIPQILSALAGNIFSLETIKNLKLLDLSFPRELIKQ FKGPKFGLNGIRKLLRVKNRPLIGTIIKPKVGLTPEQQAKVAYEAWSGGCDLVKDDENST NQKFNQFKERAKLVLKARKKAEKETRERKMYLANITSPTEEMLKRAKFVKDLGGEYVMID IIPVGWTALQTLREADLNLVLHGHRCMHSVFTRDLKHGISMLVIA--------------- ------------------------------------------------------------ ------------------------------------------------------- >rifcsphigho2_01_scaffold_7865_10Woesearchaeota MSIYGNLKSKPKSTDVVVQYKATPVKGVPLKRLCEYIAGESSIGTWTKISTMNPKIAKTL KPHIFYVNEK-IKIAYPEQLFETGNMPGIMSSIAGNIFGMKEIKGLRFEDFALTKKHVKA FRGPQFGVQGIRKMTGVKKRPFVGTIVKPKVGLTSAQHAKVAYESWAGGLDIVKDDENLT SMTFNQFDKRMKLTFRARDQSEKETGEKKLYLANITAPYNEMIRRAKIVKKLGGEYIMYD VLTAGWSALHGVRDFDNKLAIHGHRAMHGALTRNHDMGISMLVLAKTYRLIGVDTLHIGT AIGKMHGSENEELIIEQEIESQNILAQDWHGLKPVLAVASGGLSPLQLPQVMKVMGNDIV MQGGGGVHGHPEGTMKGATAFRQAVDAAMENISLSEYAKTHKELAKAIEKWG--- >rifcsplowo2_01_scaffold_4893_79Micrarchaeota VNYFEK--YAPGKNDLVCEYRVEPK-GITLERAAQHIAAESSIGTWTGLATLTQGIIRRL KPKIFFIDKR-VRIAYPADLFEKGNMPQVLSSIAGNIFGMDVLKKLRLEDITFPKSIMGS FSGPEFGIGGIRKMTGVKKRPLVGTIVKPKLGLGSREHAKAAYNAWVGGCDIVKDDENLT SQSFNRFDERLKETLKMRDRAESETGEKKMYMINITAETGEMMKRAKKAKKAGNEYAMID MLTSGFSALQTLRASELGLVIHAHRAGHAAFTRGK-HGISMLTIAKIARLIGVDQLHIGT ALGKMEGSPEEVVGIEQEIEKGHVLEQDWFGIKPVFAVASGGLYAGAVPKLIRIMGSNII IQAGGGVHGHKGGSVSGAMSMRQAIDAAMNKIPLKEYAEKHRELRQAIEQWGMV- >CG10_big_fil_rev_8_21_14_0.10_scaffold_34346_3Micrarchaeota -MKYIDLSYKPNKNDLITSFYVEP----ATDEVFEAIASESSIGTWTFLSTLTPKVRKKL GAKVFFIDKK-VKIAYPSALFEKGNMPQILSSIAGNIFGMKAIKNLRLEDIKWPYVLIKS FSGPKYGIQGIRKILGIKKRPLVGSILKPKVGLSPKEQANNAYLAWKNGIDVIKEDENLG DLEFNRFKERVIETLKMKRKAEKETGEGKAYIPNITAETNEMIKRAKFVERAGGRHVMVD IITVGWSALQTLRNENLNLILHGHRSGHAAMTKGK-HGISMLVIADIARLIGIDQLHIGT VVGKMIGEKEEVVHIGEEIEKIHTLAENWYNVKPVMAICSGGLHPAHIPHLVKYLGNDII CQFGGGLWGHKMGGEAGARAIRQALEATLNEIPLKDYAKKHIELKTALEQWT--- >CG10_big_fil_rev_8_21_14_0.10_scaffold_87261_1Micrarchaeota MLEYIN--YKPSSTDVVVEYYVEPN-GISLEEAAENLAAESSIGTWTDISTMNQRVSTKL RPWVFDIDKEEIRIAYHGDLFEEGNMPGILSGIAGNIYGMKCVRRLRLQDIEFPKFLIKS FKGPAFGIDGVRKLLKIRDRPLTGTIIKPKMGLNYFEHANVAYDAWTGGLDVVKDDENLT SQKFNPFKERVEITLQKLDKAEQRTGEKKAYMPNITAETNEMLRRADFIKQCGGTYMMID VLTAGFAGVQTLREANKGLVIHAHRAGHAAVTRNTKHGITMLALAKMFRMIGVDQLHIGA AFGKMEGAPSEVRDICDEIEPRHILEQKWYDVKPVFAVCSGGLHPGKIGELVNYMGKNIV IQAGGGVHGHPGGTRKGAMAMRQAVDAAMKGIHQGVYARSHDELRKAIGLWGV-- >UBA153contig_35505_2Woesearchaeota IYGNLELKYKPKPTDVVLDYLVTPS-GITVARAAEMIAGESSIGTWTRISTMNPVIAQTL KPHIILADEKEVKIAYPETLFEPGNMPGILSSIAGNIYGMKGVAKLRLEDIHFTKKLVES FPGPGFGIPGIRKFTGVHNRPLLGTIVKPKVGLSAQQHADVAYQAWTGGLDVVKDDENLV SMSFNNFDERMKLTFKARDKAERETGEKKFYLANITAETDEMKRRARVVKKLGGEYIMID ILTAGWAAVQTVRNEKLGLAIHAHRAMHGALTRDHRHGMSMIAIAKLARMIGVDQLHITA AIGKMEERVDEALAIERVVEVH-MLNQEWYGVKPVLAVASGGLSPLSTEAVVKLMGKDLV VQYGGGCHGHPDGTFAGAKAIRQSWEAVAKGVSLEQYARTHHELARALEKWGSD- >mol-32-15ef-016606_7Micrarchaeota VLEYIDLRYKPGSHDLLCEYYLEPN-GMSIEKAAEHIAGESSIGTWTDLCTMNKEIAHRL KPRIYSIDAKLVKIAYPKELFEERNMPQILSSVAGNIFGMKAVSALRLEDISFPKSLVKS FPGPVYGIDGIRKLAGVKKRPLVGTIVKPKVGLNADQHAKVAFDAWMGGVDIVKDDENLT SQRFNVFEDRVKKTLEMRDEAEQITGERKFYMPNVTAETDEMITRAEFVKQHGGEYIMVD VLTVGWSGLQTLRNFDLKRVIHAHRAGHAALTRNKRHGISMLVLAKICRLIGVDQLHIGT IVGKMEGSQLEVEETEDEIESAHALEQDWLDIKPVFAVCSGGLHPGLLPQLVDMLGNNII CQFGGGLFGHPSGAFAGARAIRQSLDATMKKIPLKRYAESHPELRQALDYFSSP- >RBG_16_Archaea_36_9_RBG_16_scaffold_36_87Micrarchaeota -MSYIDLKYKSSKDDLICLFRVEPAGKT-IKEAAENIAAESSIGTWTDV-KTMKPGIKKL GAKVFEIKGKYVKIAYPLELFEPGNMPQILSSIAGNIFGMKSVKNLRLEDIDWPNNLISS FKGPLYGINGIRKILKIPKRPLCGTIIKPKLGLNEEEHAKVAYDAWVGGMDIVKDDENLS NQSFNHFTKRVEETLKMRNNAEQETGERKVYMPNISAETDTMLERANFVKEIGGEYAMVD ILTVGWSALQNLRNEDLKLVLHAHRAGHAAFTRNKKHGISMVVIGDIARLIGVDQLHIGT VIGKMEGIKEEILTTEDEIENNNCLAEDWLNIKPVFAVCSGGLHPGLVPYLVKTLSNDII IQAGGGIHGHKLGTTAGARAMRQAIDATLNKVTLKEYSKNHKELNIALKQWM--- >CG08_land_8_20_14_0.20_scaffold_9016_6Micrarchaeota -MSYINLKYKPKGNDLIAEFSVEP-----CNQVAEAIAGESSIGTWTSLTTLTPQIRRKL AAKVFSIDKK-IKIAYPVELFEPGNMPQILSSIAGNIFGMKLIKNLRLEDIIFPKELIKS FKGPEIGIKEIRKLLRVKKRPLVGTIIKPKLGLSSKEHAKVAYEAWTGGLDLVKCDENLT SQKFNLFETNIKETLKMLKKAEKITGEKKIYVPNVTAETKEMIKRAKFVKKSGGNCVMMD IITEGWSGLQTLRNKKLGLIIHAHRAGHGMFTRGK-HGMSMLTIAKIARLIGVDQIHIGT AVGKMESGEEETEEINLFLK------SKWYHIKPVFSICSGGLHPGSIPKLVKMLGKDII IQAGGGVHGHPQGTRAGAKAMRQALDATMQNVSLAEYAKKHPELKAALRKWVK-- >rifcsplowo2_01_scaffold_135759_3Micrarchaeota -MNYINLKYKPTKEDLVVEYRLET--RNPFEKTCEQIAAESSVGTW----TEVTTMKKRL KPNVFSINKKEIKIAYPIELFEKGNMPEILSSIAGNIFGMNAITNLRLQDIHFPRALIKS FKGPEFGINGIRKMLKVKNRPLLGTIIKPKLGLNPKQHAKVAYESWIGGCDIVKDDENLS SQSFNQFKSRIIETLNLKEKAEKETGEKKVYLPNITAETNEMVKRGEFVKKHGGNFLMLD ILTLGWAALQTVRNHEFKLPLHAHRAGHAILTRNKKHGISMLTIAKIARLIGVDTLHIGT AVGKMEGPMKEIEAIEEEIEQEHILEQKWYDKKEVFAVCSGGLHPSLIPKLVNMLGFNII IQLGGGLHGHPKGSFYGAQAARQAIEATMQGVSLKEYAKNHTSLKEALEHFKNK- >gwa2_scaffold_25721_1Micrarchaeota ----------------VCQYYIEP----SLEEAAEQIAAESSIGTWTELSTMKPEVASRL KPTIFSMKKNEIKIAYSADLFENGNMPQILSSIAGNIFGMKLVKNLRLEDISFPECIVKS FKGPEFGINGIRNLLKIKERQLVGTIVKPKVGLSSKEHAQVAYEAWTGGLDIVKDDENLT NQDFNPFKSRVMQTLQKRDKAEDITGEKKMYMPNITAETNEMLKRAEFVKDLGGEYVMID IITAGFSALQTLRDADLGLVIHAHRAMHAAITRTKQHGISMLAIAKISRLIGVDQLHIGT AVGKMEGKAIETEEIEHEIESGHILEQKWRDIKPVFAVSSGGLHPGLIPKVAHILGKNVI MQFGGGVHGHPLGTRKGAIAVRQALDAYLKEIPLDIYAKSHSELKIALDKWGM-- >rifcsplowo2_01_scaffold_304868_2Pacearchaeota VLEYIDLRYQPSNNDLICSFYVEPDNCS-LEKAARNVALESSIGTW----TEI-EIDKKL KPNVFSINRREIKIAYPSELFEPGNIPQILSSVAGNIFGMKIVKNLRLNDVAFPYKIIKS FKGPEFGIEGIRNLLKIKSRPLVGTIIKPKLGLVTKKHADVAYEAWLGGCDIVKDDENLT NQKFNPFRERVIKTLEKRDKVQELTGEKKIYMPNITAETNEMIKRAEFVKEHGGEYIMVD ILTVGWSALQTIRDLNLKLVIHAHRAMHAAITRNQKHGISMLTLAKISRLIGVDQLHIGT VVGKMEGTAEEVEEVEDEMEPTHILEQRWYHIKPVFAVCSGGLHPGHIPELVKILGKNII IQMGGGIHGHPSGTRIGAVAARQAVDAVMRRVPLSVFARDHYELKQALEHWLG-- >CG10_big_fil_rev_8_21_14_0.10_scaffold_13451_3Micrarchaeota KISYIDLKYKPKKSDLMCLFYVEPQ--VSMKEAAGAVAAESSVGTWTELTTETKRIAR-M RAKVFEI--KYIKVAYPAELFERENMPEILSSIAGNIFGMKIVKNLRLEDIEFPESIIKS FKGPAFGINGIRKLLNVKGRPLVGTIVKPKLGLNSKEHAKVAYDAWLGGCDIVKDDENLS SQDFNKFRERVMYTLIARDKAEKETGEKKAYLPNITAETEEMIDRANFVRHNNGNYLMVD IITMGWSALQTVR--NFKLPLHAHRAGYAALSRNKKHGISMLTIAKIARLIGVDSLHIGT AVGKMEGPTKEVAEIEEEIESH-ILEQKWYNIKPVMAVCSGGLYPTLVPSLIKMFGKNII IQAGGGIHGHELGTTAGATAMRQAVEASLKGISLREYAKNHKELYIALRQWH--- >07M_4_2014_scaffold_1143_16Micrarchaeota QLEYID--YKTSKKDLVVTYRAEPDKIS-LEDACEQIAAESSIGTWTDISTMSPEIGKRL KPHVFFIDKNIVKIAYSSDLFEEGNMPQIWSAVAGNIFGMKVVDNLRLVDIEFPRNIVKS FKGPVFGIEGIRKLLKAKYRPLCGTIIKPKVGLDEKNHARVAYEAWGGGLDIVKDDENLT SMVFNNFEKRILETLKLRDKAEAETGEKKVYMANVTSESKTMLKRARFVRDCGGEYVMVD ILTLGWSALQTLIDEGMGLVIHAHRAGHAALTRNKRHGISMKVIAKTARLIGADQLHIGT VVGKMEGEREEVLDIRDSLT------EDI-GIKTTFPVASGGLHPGHVPSLVKIFGNDII MQFGGGCHGHPQGTASGAKAIRQAISATGDKVSLGDYAKSHKELREALLKWKNA- >CG07_land_8_20_14_0.80_scaffold_42665_1Micrarchaeota GCADLK--YKPTKEDLICEYYVEPNKIS-LEQACENIAGESSIGTWTTIATMSPEIARRL KPHVFSIDKSEVKIAYPQELFEAGNMPEILSSIAGNIFGMKALKNLRLQDISFPKKIISS FKGPKFGIPGIRKLLKVKDRPLCGTIVKPKVGLTASQHARVAYEAWAGGLDLVKADENLG SMVFNNFYERIKETLKLKEKAERETGEKKLYMANVTAESREMLKRAQAVKKLGGESVMID ILTAGWSALQTLREADLGLIIHAHRAGHAAFTRDPRHGISMLTIAKIARLIGVDEIHIGA ILGKMFGPALEVKHIGEEIEAAHMLEQRWYGIKPIFSVCSGGLHPGILPPLIKIMGNNII CQCGGGCHGHPQGTRAGAAAIRQAVEATMKKIPLREYAKKYKELKLALEKWT--- >rifcsplowo2_12_scaffold_17178_2Micrarchaeota -MDYIDLKYKPKKEDLVCLYRVEPSKNISFERAANTIALESSIGTW----TDLITMNKKL RARVFEINKKMIKVAYPIELFEMGNMPGILSGIAGNIFGIKDVRNLRLEDVRFPKKLIKS FKGPKFGIKGIRKLLRIKKRPLVGSIIKPKIGLRTKEHAKVAYEVWKGGLDIVKDDENLV DLNFNRFKDRVIETLKMRDKAEKETGERKVYMPNITAETNEMIKRAKFVKDNNGRYVMVD IISLGWSSLQTIRNENLGLVIHAHRAGHAALTRNKKHGISMLSIAKIARLIGVDQLHIGT VVGKMEGGKKEVKDIEEEIEQGHILEQKWYNIKPVFAVASGGLHPGLVPKLVNILGNNII MQFGGGVHGNPLGTESGARAVRGSVEAVMKNISLKEYSKKNKELEIALKKWKNI- >rifcsplowo2_01_scaffold_93746_1Micrarchaeota QLKYLDLKYKPAKTDLVCEYYVEPSRMS-IEEACENIAAESSIGTWTDISTMSPYIARKL KPHIYSINKKEVKIAYTCDLFEKGNMAEILSSIAGNIFGMKAIKNLRLQDISFPEKLIRS FRGPKYGIEGIRKLLKVKERPLCGTIIKPKVGLNEKGHARVAYEAWTGGLDIVKDDENLS SMTFNNFYKRIAETLKMRDKAEKETGEKKMYMANITAETDEMLKRAKFIKAHGGEYMMVD IVTMGWSALQTVRNDNLNMVIHAHRAGHAMLTRNPKHGMSMMTVAKCARLIGVDQLHIGT AVGKMFGSRIEV-DIEKEIEKKNVLNQKWFDIKPVLAVASGGLHPGMIPEVVKVMGKDVV IQLGGGVHWHPKGTKYGAMAARQAIEAVMKNISLREYAKTHQELKAAINKFGITK >rifcsplowo2_01_scaffold_45645_1Micrarchaeota ------------EKN-----E--------------------------------------- -----------IKIAYPQELFEAGNMPQVYSAVAGNIFGMSLVKGLRLQDISFPWDIMKK FHGPKFGIDGIRKLLRVKNRPLIGTIIKPKVGLDARHHAKVAYDAWYGGLDIVKDDENLT SMSFNKFDDRIEETLRLRDKAEQDTGERKVYMANVTAETNEMIKRAEFVKKCGGEYVMID IITTGWSGLQTLRQANLGLVIHAHRAGHGAFTENPHHGISMLTIAKTARLIGVDQLHVGA IVGKMKGGKEEVRMIGEEIEAEHVLEQKWYNVKPVFAVCSGGLYPGTIPSVVKAMGNNVI IQAGGGVTGHPDGVFYGARAMRQAVEAIMQNIPLREYAKNNIELYKAINKWGIR- >rifcsphigho2_01_scaffold_346573_2Micrarchaeota -----------------VLFR----------------SNSCSVGTWTSLSTETKEVLKKI QARVFEIDGNYIKIAYPLELFELGNMPQIMSSIAGNIFGMKAVKNLRLQDVEFPKKMIHS FKGPKYGIEGIRKILKIKDRPICGTIIKPKLGLNAKQHAKVAYDAWVGGIEIVKDDENLS NAKFNNFKERVIETLKMRDKAEKETKEIKIYMPNITSETDEMLKRAEFVKKNNGEYVMLD IITLGWSALQTFRNKDFNLILHAHRAGYAALARNKKHGISMLTIAKIARLIGVDQLHIGT AVGKMEGSKKEVMDLEEEIESEHILEQKWYNIKPVFAVASGGLHPLLVPKLNKILGNNVI MQFGGGCHAHPQGTQAGAKAIRQAIDATMQRIPLKEYSKKHEELKMAIEKWG--- >rifcsplowo2_01_scaffold_49323_11Micrarchaeota -MKYKDYRYRPSKNDLICSFYIEPNKVS-FKEAAGAVASESSVGTWTSLSTETKEVLKKI QARVFEIDGNYIKIAYPLELFELGNMPQIMSSIAGNIFGMKAVKNLRLQDVEFPKKMINS FKGPKYGIEGIRKILKIKDRPICGTIIKPKLGLNAKQHAKVAYDAWVGGIEIVKDDENLS NAKFNNFKERVIETLKMRDKAEKETKEIKIYMPNITSETNEMLKRAEFVKKNNGEYVMLD IITLGWSALQTFRNKDFNLILHAHRAGYAALARNKKHGISMLTIAKIARLIGVDQLHIGT AVGKMEGSKKEVMDLEEEIESEHILEQKWYNIKPVFAVASGGLHPLLVPKLNKILGNNVI MQFGGGCHAHPQGTKAGAKAIRQAIDATMQRIPLKEYSKKHEELKMAIEKWG--- >rifoxyd1_full_scaffold_54939_2Micrarchaeota --LGIDLKYKPNNKDLVAEYYIEPK-GMAFEKAASNVALESSIGTWTAISTMNPGIARKL KPSVYYINKKSIKIAYPEELFEPGNMPEILSSIAGNIFGMKAVRNLRLQDINFPKKILDS FHGPLFGISGVRKLTGVKERPLVGTIVKPKVGLNEKQHAKVAYDAWAGGLDVVKDDENLS SMSFNNFRKRMYETFKLRDKAEKETGEKKIYMPNITAETMEMLKRADIVDECGGEYIMVD ILTIGWAGLQTVRNYKIKKVIHAHRAMHGALTRNPK------------------------ ------------------------------------------------------------ ------------------------------------------------------- >rifcsplowo2_01_scaffold_51891_2Micrarchaeota MLSYINLRYKPKKEDLVVEYYLEP----SLERAACNVAAESSIGTWTDVKTLNKKIIKEY KPNVFYIDEK-VKISYNSELFEKGNMPQILSSIAGNIFGMRIIKNLRLEDIHFPKSIIKS FKGPEYGIKGIRELLRIKNRPLTGTIIKPKLGLNAKQHADVALEAWLGGVDIVKDDENLA DMKFNKFEKRVEETLKARNRAEKETGEKKIYMPNVTAETNEMLRRAKYVKKLGGEYVMVD IITAGFSTLQTLRDAKLDLVIHAHRAGHAAFTRHK-HGISMLAIAKIARLIGVDQLHIGT AVGKMDSDVKEVTDIGDEIETGHILEQRWYNIKPVLAVASGGLHPRSVPKLLQRMGKNIV IQAGGGIHGHPDGTRAGATAIRQAVDATMRKISLREYAKTHIALKRALERWH--- >CG_2015-13_scaffold_41721_2Micrarchaeota LLNYIDLKYKPLKTDLICKFYLEPN-NITIEKAASHVALESSIGTWTDICTMNKRIAKTL KPSVFYINKK-IKIAYNHNLFEKNNMPQILSSIAGNIFGMSAVKNLRLIDISFPESIIRS FKGPKFGIKGIRKITSIKKRPLTGTIIKPKLGLNEKEHAKVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNVTAETNEMIKRAKFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRNEKHGISMLAIAKIARLIGVDQLHIGA IVGKMTGTKSEVKDIGEDIEGTHVLEQRWYNIKPTLAVCSGGLHPGCIPSLMRIMGNNIV MQFGGGCHGHPDGTKAGAMAIRQAVDSAIKKIPLKTYAKSHEELKKALDKFGIIS >cg1_0.2_scaffold_10457_2Woesearchaeota LLNYIDLKYKPLKTDLICKFYLEPN-NITIEKAASHVALESSIGTWTDICTMNKRIAKTL KPSVFYINKK-IKIAYNHNLFEKNNMPQILSSIAGNIFGMSAVKNLRLIDISFPESIIRS FKGPKFGIKGIRKITSIKKRPLTGTIIKPKLGLNEKEHAMVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNVTAETNEMLKRAKFVKANNGRYVMID IITSGWAALQTLRNADLGLVIHAHRAGHAAFTRNEKHGISMLAIAKIARLIGVDQLHIGA IVGKMTGTKSEVKDIGENIEGTNVLEQKWHNIKPTLAVCSGGLHPGCVPSLMRIMGNNIV MQFGGGCHGHPDGTKAGAMAIRQAVDSAIKKIPLKNYAKSHEELKKALDKFGIIS >CG_2015-19_scaffold_56610_2Micrarchaeota ----------------------------S------------SIGTWTDIGTMNKRIAKTL KPSVFYINKK-IKIAYNENLFEKGNMPEILSGIAGNIFGMSALNNLRLLDISFPESIIKS FKGPRFGIKGIRKITGIKNRPLTGTIIKPKLGLNEKEHAMVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNVTAETNEMIKRAKFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRNEKHGISMLAIAKIARLIGVDQLHIGA IVGKMTGTKSEVKDIGEDIEGTHVLEQRWYNIKPTLAVCSGGLHPGCIPSLMRIMGNNIV MQFGGGCHGHPDGTKAGAMAIRQAVDSAIKKIPLKNYARDHKELKRALDKFGIIS >CG10_big_fil_rev_8_21_14_0.10_scaffold_133078_1Micrarchaeota ---------KPS------VFYINK-----KE----------------------------- -----NI----IKIAYNENLFEKGNMPEILSGIAGNIFGMSALNNLRLLDISFPESIIKS FKGPRFGIKGIRKITGIKNRPLTGTIIKPKLGLNEKEHAMVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNVTAETNEMIKRAKFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRNEKHGISMLAIAKIARLIGVDQLHIGA IVGKMTGTKSEVKDIGENIEGTNVLEQKWHNIKPTLAVCSGGLHPGCVPSLMRIMGNNIV MQFGGGCHGHPDGTKAGAMAIRQAVDSAIKKIPLKNYAKSHEELKKALDKFGIIS >CG03_land_8_20_14_0.80_scaffold_60991_1Micrarchaeota LLNYIDLKYKPLKTDLICEFYLELNNIT-VEKAASHVALESSIGTWIDICTINKRIVKTL KPSVFYINKKIIKIAYNENLFEKGNMPEILSGIAGNIFGMSALNNLRLLDISFPESIIKS FKGPRFGIKGIRKITGIKNRPLTGTIIKPKLGLNEKEHAMVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNVTAETNEMLKRAKFVKANNGRYVMID IITSGWAALQTLRNADLGLVIHAHRAGHAAFTRNEKHGISMLAIAKIARLIGVDQLHVGA IVGKMTGTKSEVKDIGEDIEGTHVLEQRWYNIKPTLAVCSGGLHPGCIPSLMRIMGNNIV MQFGGGCHG---------------------------------------------- >CG02_land_8_20_14_3.00_150_scaffold_91997_1Micrarchaeota -------------TDLICEFYLELNNIT-VEKAASHVALESSIGTWIDICTMNKRIVKTL KPSVFYINKKIIKIAYNENLFEKGNMPEILSGIAGNIFGMSALNNLRLLDISFPESIIKS FKGPRFGIKGIRKITGIKNRPLTGTIIKPKLGLNEKEHAMVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNVTAETNEMLKRAKFVKANNGRYVMID IITSGWAALQTLRNADLGLVIHAHRAGHAAFTRNEKHGISMLAIAKIARLIGVDQLHIGA IVGKMTGTKSEVKDIGENIEGTNVLEQKWHNIKPTLAVCSGGLHPGCVPSLMRIMGNNIV MQFGGGCHGHPDGTKAGAMAIRQAVDSAIKKIPLKNYARDHKELKRALDKFGIIS >CG_4_10_14_0.2_scaffold_37340_1Micrarchaeota --------MNKR----IAKT-LKP------------------------------------ --SVFYINKKIIKIAYNENLFEKGNMPEILSSIAGNIFGMSVVKNLRLLDISFPESIVKS FKGPRFGINGIRKITGIKNRPLIGTIVKPKLGLNEKEHAKVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNITAETNEMLKRARFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRDEKHGITMLVIAKIARLIGVDQLHIGA IVGKMTGTKHEVKDIGENIEGSHILEQKWYNIKPTLAVCSGGLHPGCVPSLMKTMGNNIV MQFGGGCHGHPDGTKAGAIAIRQAVDSAIKKIPLKNYARDHKELKRALDKFGIIS >CG08_land_8_20_14_0.20_scaffold_203273_1Micrarchaeota -------------TDLICEFYLELN-NITVEKAASHVALESSIGTWIDICTINKRIVKTL KPSVFYINKK-IKIAYNENLFEKGNMPEILSGIAGNIFGMSALNNLRLLDISFPESIIKS FKGPRFGIKGIRKITGIKNRPLTGTIIKPKLGLNEKEHAMVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNVTAETNEMIKRAKFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRNEKHGIML------------------- -LQRL------------------------------------------------------- -----------------------------QDL----------------------- >CG11_big_fil_rev_8_21_14_0.20_scaffold_98546_1Micrarchaeota ------------------------------------------------------------ --------------------------PQILSSIAGNIFGMSAVKNLRLIDISFPESIIKS FKGPRFGIKGIRKITGIKNRPLTGTIIKPKLGLNEKEHAMVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNVTAETNEMIKRAKFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRNEKHGITMLVIAKIARLIGVDQLHIGA IVGKMTGTKHEVKDIGENIEGSHILEQKWYNIKPTLAVCSGGLHPGCVPSLMKTMGNNIV MQFGGGCHGHPDGTKAGAIAIRQAVDSAIKKIPLKNYARDHKELKRALDKFGIIS >cg1_0.2_scaffold_44525_1Micrarchaeota LLNYIDLKYKPLKTDLICEFYLELNNIT-VEKAASHVALESSIGTWIDICTINKRIVKTL KPSVFYINKKIIKIAYNENLFEKGNMPEILSGIAGNIFGMSALNNLRLLDISFPESIIKS FKGPRFGIKGIRKITGIKNRPLTGTIIKPKLGLNEKEHAMVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNVTAETNEMIKRAKFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRNEKHGITMLVIAKIARLIGVDQLHIGA IVGKMTGTKHEVKDIGENIEGSHILEQKWYNIKPTLAVCSGGLHPGCVPSLMKTMGNNIV MQFGGGCHGHPDGTKAGAIAIRQAVDSAI-------------------------- >CG01_land_8_20_14_3.00_scaffold_28014_2Micrarchaeota LLNYIDLKYKPQKTDLVCEFYLEPNNTT-IENAASNVALESSIGTWTDIGTMNKRIAKTL KPSVFNKKENIIKIAYNENLFEKGNMPEILSSIAGNIFGMSVVKNLRLLDISFPESIVKS FKGPRFGINGIRKITGIKNRPLIGTIVKPKLGLNEKEHAKVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGQTKIYMPNVTAETNEMLKRARFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRDEKHGITMLVIAKIARLVGVDQLHIGA IVGKMTGTKSEVKDIGEDIEGTNVLEQKWHNIKPTLAVCSGGLHPGCVPSLMRIMGNNIV MQFGGGCHGHPDGTKAGAMAIRQAVDSAIKKIPLKNYARDHKELKRALDKFGIIS >CG03_land_8_20_14_0.80_scaffold_21801_2Micrarchaeota LLNYIDLKYKPQKTDLVCEFYLEPNNTT-IENAASNVALESSIGTWTDIGTMNKRIAKTL KPSVFNKKENIIKIAYNENLFEKGNMPEILSSIAGNIFGMSVVKNLRLLDISFPESIVKS FKGPRFGINGIRKITGIKNRPLIGTIVKPKLGLNEKEHAKVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNITAETNEMLKRARFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRDEKHGITMLVIAKIARLVGVDQLHIGA IVGKMTGTKHEIKDIREDIEGTYMLEQKWYNIKPTLAVCSGGLHPGCVPSLMRAMGNNIV MQFGGGCHGHPDGTKAGAIAIRQAVDSAIKKIPLKNYARDHKELKRALDKFGIIS >CG_2015-18_scaffold_160514_1Micrarchaeota ---------------------------------------ESSIGTWTDIGTMNKRIAKTL KPSVFYINKKIIKIAYNENLFEKGNMPEILSSIAGNIFGMSVVKNLRLLDISFPESIVKS FKGPRFGINGIRKITGIKNRPLIGTIVKPKLGLNEKEHAKVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNITAETNEMLKRARFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRDEKHGISMLVIAKVARLIGVDQLHIGA IVGKMTGTKHEIKDIREDIEGTYMLEQKWYNIKPTLAVCSGGLHPGCIPPLMKVMGNDIV MQFGGGCHGHPDGTKAGAMAI---------------------------------- >CG_4_10_14_0.8_um_filter_scaffold_69571_1Micrarchaeota LLNYIDLKYKPLKTDLICEFYLELN-NITVEKAASHVALESSIGTWTDIGTMNKRIAKTL KPSVFYINKK-IKIAYNENLFEKGNMPEILSSIAGNIFGMSVVKNLRLLDISFPESIVKS FKGPRFGINGIRKITGIKNRPLIGTIVKPKLGLNEKEHAKVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGETKIYMPNITAETNEMLKRARFVKANGGRYVMID IITSGWSALQTLRNADLGLIIHAHRAGHAAFTRDEKHGISMLAIAKIARLIGVDQLHIGA IVGKMTGTKHEVKDIGENIEGTHVLEQKWHNIKPTLAVCSGGLHPGCVPSLMRIMGNNIV MQFGGGCHGHPDGTKAGAMAIRQAVDSAIKKIPLKNYAKSHEELKKALDKFGIIS >CG_2015-13_scaffold_88435_1Micrarchaeota LLNYIDLKYKPLKTDLICEFYLELNNIT-VEKAASHVALESSIGTWTDIGTMNKRIAKTL KPSVFNKKENIIKIAYNENLFEKGNMPEILSSIAGNIFGMSVVKNLRLLDISFPESIVKS FKGPRFGINGIRKITGIKNRPLIGTIVKPKLGLNEKEHAKVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGQTKIYMPNVTAETNEMLKRARFVKANGGRYVMID IITSGWSALQTLRNADLGLVIHAHRAGHAAFTRDEKHGITMLVIAKIARLIGVDQLQAFN TEGFEDMRRQQVGRIANVVRRHQYLGRQAAEVDELLELLQTRAHQRLDLEAVRFLLCRLD LRLEVRLLADD---------LLN--DE--SGEPLHQNARTALR------------ >CG_4_9_14_0.2_um_filter_scaffold_8702_1Micrarchaeota LLNYIDLKYKPLKTDLICEFYLELN-NITVEKAASHVALESSIGTWIDICTINKRIVKTL KPSVFYINKK-IKIAYNENLFEKGNMPEILSSIAGNIFGMSVVKNLRLLDISFPESIVKS FKGPRFGINGIRKITGIKNRPLIGTIVKPKLGLNEKEHAKVAYDAWTGGLDIVKDDENLT SMKFNNFNKRVIETLKMRNKAEKETGQTKIYMPNVTAETNEMLKRARFVKANGGRYVMID IITSGWSALQTLRNADLGLIIHAHRAGHAAFTRDEKHGISMLVIAKVAR----------- ------------------------------------------------------------ ------------------------------------------------------- >rifcsplowo2_01_scaffold_118545_1Amesbacteria QPQYLSEGYRPKPSELICTFRVENPSELTFREACGGVAAESSTGTWTKL-TTVKPYMDRL GAKVYRIRGNTIQIAYPPELFEAGSVPNILSSIAGNVFGLGGLKNLRLEDIYFPKSITKG FKGPKYGIAGIRKFMRLKKRPPVGTVIKPKLGLKTKDHLKFCEEAWQGGCDFLKDDENLS GQKFNPFYERALKGLDIADRVASETGEKKVFFLNVTAETNEMVKRAEFVEKHGGKYVMID ILTAGWASFQTLRDLNLKCAIHCHRAQHAAFDRNPKHGIAMLVLARLVRLIGGDQLHVGT AASRLQ-------------------------------------AAACIPAMCR----NCL KFSAMILSSRP-----AAESM--AI----QMERLRAPGR---------------- >CG08_land_8_20_14_0.20_scaffold_1314_c_7Diapherotrites YENFID--YKPKKTDLVCKFRLKPN-GFTFEKAAGGVAAESSIGTWTEL-TTEKEYMKKL AGIVFELKNPYFKVAYPIELFEEGNMPNTLSGIAGNLFGLKEIEKVRFEDIIFPKKLANS FKGPKYGIDGVRKLTRVKNRPLVGTIVKPKLGLNVKDHAKVAYDAWFGGCDVVKDDENLS SQAFNRFEPRLKETLRLKRKAEKETGEFKAYMINVTASGKEMLRRAKLVEDSGNEYLMVD ILTVGWAALQDLRDINVNLVMHAHRAGHAALTKDKEHGISMPVLAKCSRLIGVDQLHVGT AVGKMSEGKTEVLENIKACK------EELYGIKKVMPVASGGLHPGHVEDLYKIFGKDFI AQAGGGIHGHPQGTVKGAMAMRQAMDAVVHGYTLKEYAKKHNELRIALEKWD--- >AR1-0.1_scaffold_8297_1Pacearchaeota --------LKLG----ATVFYLKR-----EE-----------EGFL-------------- -----------CKVAYPNELFEYDNMPNILSSVAGNIFGMKEIKNLRLEDIHFPIEIINS YKGPKFGIDGIRRLLKVYDRPLLGTIVKPKIGLNSYDHAQVCYDAWVGGCDVVKDDENLS SQKFNDFKTRLRETLKMKEKAERKTGEKKIYMINITAETNEMIKRAELAKKAGNEYVMVD VLTVGFSALQTLRNEKLNLVIHAHRAMHAALTKNLKHGISMKVLVKIYRLIGVDQLHIGT GVGKMFETLEEVKENVKACT------EEMHGIKKVFPVCSGGLHPGHIPFLMKNIG-NEI IQMGGGIHGHPLGTVEGAKAARQAIDATIKNIPLEKYAEHHIQ----L------- >cg1_0.2_scaffold_115_c_6Pacearchaeota KKGYEDLNYKPKKDDLICEFYFEPNKED-YKRAAGAIAAESSIGTW-TFLTTTKPYMLKL AANVFYLKKRIAKVAYPKELFEYNNMPNILSSVAGNIFGMKEINNLRLNDIIFPKCIVKA YKGPKFGIDGVRKILNVYGRPLLGTIVKPKLGLNSRDHAQVCYDAWIGGCDIVKDDENLS SQSFNKFKKRLIKTLKMKRKAEFETGEKKIYMINITAETNEMLKRMELAKKYGNEYVMID VITVGWSALQTVRNEELNLVLHGHRAMHAILTKNLKHGISMKVLAKIYRIIGIDQLHIGT GIGKMFESLEEVKENVKACV------ENVYDLKQVFPVCSGGLHPGHVPFLIKNLGKDII IQMGGGIHGHPVGTVEGAKAARQSIDAAMKNIPLEKYAKTHIQLKEALQKYRIHK >CG10_big_fil_rev_8_21_14_0.10_scaffold_22081_3Pacearchaeota -MKYINLSYKPSKRDLVCTFSLET-----IKEVAGAVAAESSIGTWT---ATVKKYMEKL AATVFKKEGNEIKIAYPNELWELNNMPGILSGIAGNIFGMKEVDFLKLKDIEFPENIAKS FPGPKYGIEGIRKITNIQNRPLVGTIIKPKIGLNPKDHAKVAYEAWTGGCDVVKDDENLV GQNFNKFEARLKETVKLKKLAEKETGEKKVYMINVTAETKDMLKRAKLAEDYGNEYLMVD IITVGWAGLQTLRNENFKLIMHAHRAGHAALDRIENHGISMNVIARLTRLIGCDQLHIGA AVGKMFETREDVLENKKVLT------DNFYGIKKVMPVSSGGLHPGHVPDLYKIFGKNIV IQCGGGIHGNKLGTRLGAVAVRQAIDATMKGISLKEYSKTHFELRSTLNQWGTK- >UBA284contig_56696_5Pacearchaeota -MKYSDLSYKPSKKDLICEFYFES----DLKHAAGAVAAESSIGTWT---STVKEYMKNL AAVFYKINNKNIRVAYPNELFEESNLPNIMSSIAGNIFGMKEIVNLRLNNIIFPENIAKS FPGPKYGIEGIRKIVKI-KRPLIGTIVKPKLGLNTKDHAKVSYDAWIGGCDVVKDDENLS SQKFNKFEARLKETFKMKEKAEKETGEKKIYMINITAETKEMLRRAKLVQDYGNEYIMVD IITSGWSALQTLRNENFNLVLHAHRAGHAAFDKNIKHGINMKVIARLTRMIGLDQLHVGT AVGKMFETHEEVIENKNALI------DNFYGIKKVMPVSSGGLHPLMIPELYKMFGKDVV LQFGGGIHGHPDGTLAGAKTARQVLDATMKNISLKEYAKTHRELRIALEFFKNK- >CG06_land_8_20_14_3.00_150_scaffold_1834_c_8Pacearchaeota YEDYFDLSYKPAKKDLICEFYLES-NGD-LKKVSGGVAAESSIGTW----TETATMKKNL AAKVFRKIGKNIKIAYPIDLFEQGNVPDIMSSIAGNVFGMKDVLNLRLNDIKFPSEIVRS FKGPKFGIDGVRRITRVAKRPLIGTIVKPKLGLNSQDHAKVAYSAWLGGCDIVKDDENLS SQKFNRFEERLKETLKLRDKAERETGEKKSYMVNITAETGEMLHRARLAKEYGNEYAMVD IITAGWSSLQTLRNEKLKLVLHAHRAGHAALDKNLRHGISMKVLARLTRMIGLDQLHVGT AVGKMFETHEDVLENKKVLT------ENFYGLKRTMPVASGGLQPLMIPELLKIFGNDVI LQFGGGIHGHPRGTLSGARACRQALDAAMRRINLKEYSKNHAELREAVEFFG--- >CG_2015-17_scaffold_79772_2Pacearchaeota ---------------------LSD-----QTDALKKMQLYSA------------------ -----------MYGTY-----EQRNVPDIMSSIAGNVFGMKDVLNLRLNDIKFPSEIVRS FKGPKFGIDGVRRITRVAKRPLIGTIVKPKLGLNSQDHAKVAYSAWLGGCDIVKDDENLS SQKFNRFEERLKETLKLRDKAERETGEKKSYMVNITAETGEMLHRARLAKEYGNEYAMVD IITAGWSSLQTLRNEKLKLVLHAHRAGHAALDKNLRHGISMKVLARLTRMIGLDQLHVGT AVGKMFETHEDVLENKKVLT------ENFYGLKRTMPVASGGLQPLMIPELLKIFGNDVI LQFGGGIHGHPRGTLSGARACRQALDAAMRRINLKEYSKNHAELREAVEFFG--- >RifSed_csp2_19ft_2_scaffold_163929_2Pacearchaeota -MKYIDLNFHPRADDLICEFLVEP-LGIDIKLAAGALATTKS-------------YVEKL HATAFSIDGNSVRIAYPIELFESGNMPNILSSVAGNVFGLGELNNLRLNDIQLPAKLVKS FKGPKYGIDGIRKLLGIKKRCLVGTIIKPKLGLRTEDHAKVAYDAWVGGCDIVKDDENLS SQKFNKFEKRAIQTLKMRDKAEKLTGEKKVYMVNVTAETNEMIRRANLVEKLGGDYIMVD ALTVGFSALQTLRNEDLDLVIHAHRAMHAAITKNPKHGISMKVLAKLLRAVGVDQLHAGA VVGKMSESEEDVRKNCEALK------GDMFGLKKVMPVASGGLWAGSVPEILRIFGNDVV IQAGGGIHGI--GTVTGARAMRQAVDAAMQKIDLKEYSKLHKELRAALEKWKGKK >mol-32-1605-029005_8Pacearchaeota -MKYIDLEYSPEKDDLICEF-LVDALGMDMNTAAGAIAAESSIGTW----AETTTTRPKL RARVFSIEGDVAKIAYPIELFEGGNMPNILSSVAGNVFGLKELQNLRLNDITIPKKLAKS FDGPEFGIDGIRKITHVKKRCMIGTIVKPKLGLNVKEHAKISYDAWIGGCDIVKDDENLS SQDFNRFEDRIRQTLRMRDKAEKLTGERKMYMANVTAETGEMIKRAKFVKKLGGEYVMVD VLTAGFSALQSIREADLGMVLHAHRAMHAALTKNPKHGISMGVLSRLLRIVGVDQLHAGT AVGKMSETESEVRENIDALK------SELYGIKTVMPVASGGLWAGSVPDIIRIFGNDVI IQAGGGIHGI--GTRNGAMSMRQAADAAMEGVTLDDYSKSHKELRMAMEKWKR-- >CG23_combo_of_CG06-09_8_20_14_all_150_scaffold_130068_2Micrarchaeota ------------------------------------------------------------ ---------------------DS----------------------LKLQDIEFPKEIVDS FFGPKFGVQGIRKLLKVPKRPLIGTIVKPKLGLNESEHAEMAFKAWAGGCDIVKDDENLT SQSFNKFLIRVKETLKQRDRAEKLTGEKKIYMPNITAETEEMIRRAKFVKSLGGEYVMVD ILSCGWSAMETLRNHDLNLVIHAHRAGYAALSRTKDHGISMLVIAKLARLVGVDQLHIGT VVGKMDTPREEVVNIDQEMEGKHILEQNWYNIKPVFAVSSGGLHPGHIPYLVKTLGNNII IQCGGGIHGHPFGTIAGARAARQALEGTMKGISLSKYAKSRVELNTALNFFNKKV >RifSed_csp2_19ft_2_scaffold_254614_2Micrarchaeota -MKYVDLNHKPASEDLVCEFYVEPD-GISLKEAAGGVAAESSIGTWTELS-TMKKYVEKL HATVFDIHGNKVKISYPIELFEEGNMPNILSSVAGNVFGLKALKNLRLNDIHFPKKLVKS FKGPQFGIDGVRRILKVPDRPLVGTIIKPKLGLNTRDHAEVAYEAWVGGCDFVKDDENLA NQSFNRFEERLRETLAKRNRAEKETGERKMYLINVTAETEEMLRRSRQVCDQGGEYVMVD ILTCGLASVQTLRNHDLSLVLHAHRAGHAAFTKNPKHGISMKVIAKTARIIGLDQLHVGA VVGKMAESEEEVRQNIEALK------EEMYGLKQVLPIASGGLYPTLVPSLIEIFGQNLV IQAGGGIHGHKAGTRRGAKAMRQAVDAALLGKTLKEYATNHEE------------ >RifSed_csp2_16ft_2_scaffold_571319_1Pacearchaeota -MKYVDLNHKPA-SELVCEFYVEPD-GISLKEAAGGVAAESSIGTWTEL-STMKKYVEKI HATVFDIHGN-VKISYPIELFEEGNMPNILSSVAGNVFGLKALKNLRLNDIHFPKKLVKS FKGPQFGIDGVRRILKVPDRPLVGTIIKPKLGLNTRDHAEVAYEAWVGGCDFVKDDENLA NQSFNRFEERLRETLAKRNRAEKETGERKMYLINVTAETEEMLRRSRLVRDQGGEYVMVD ILTCGLASVQTLRNHDLSLALHAHRAGHAAFTKNPKHGISMKVIAKTARIIGLDQLHVGA VVGKMAESEEEVRQNIEALK------EEMYGLKQVLPIASGGLYPTLVPSLIEIFGQNLV IQ----------------------------------------------------- >RifSed_csp2_19ft_2_scaffold_12174_2Pacearchaeota -MRYVDLNYIPSLTDIICEFFIET-EGVNFKEAAGSVAAESSIGTW----THLSTMKEKL HATVFEMNGNTAKVAYPIELFEEGNMPNILSSVAGNVFGLKTLRNLRLNDIIFPEKLIKS FQGPKYGINGVRKILNVKKRPLVGTIIKPKLGLNTKDHSEVAYESWIGGCDFVKDDENLA SQSFNKFEARLKETLKKKDMAEKETGEHKMYLINITAETGEMLKRARLVSDLKGEYAMVD ILTCGYSSLQTLRLKDFDLVLHAHRAGHAAFTKNPKHGISMRFISKIVRIIGVDQLHVGA VVGKIAESKQEVSMNVSALK------CDMFGLNQVFPVASGGLHPRLIPSVIEIFGNDIV IQAGGGIHGHKDGTRSGAKAMRQAVDAVIAGKTLTEYAENHIELMSALETWL--- >RifSed_csp1_16ft_2_scaffold_53506_3Pacearchaeota -MRYVDLNYIPSLTDIICEFFIETE-GVNFKEAAGSVAAESSIGTWTHL-STMKEYVEKL HATVFEMNGNTAKVAYPIELFEEGNMPNILSSVAGNVFELKTLRNLRLNDIIFPEKLIKS FQGPKYGINGVRKILNVKKRPLVGTIIKPKLGLTTKDHSEVAYESWIGGWDFVKDDENLA SQSFNKFEARLKETLKKKDMAEKETCERKMYLVNITAETGEMLKRARLVSDLKCEYAMVD ILTCGYSSLQTLRLKDFDLVLHAHRAGHAAFTKNTKHGISMRFISKIVRIIGVDQLHVGA VVGKMAESKQEVSMNVSALK------CDMFGLNQVFPVASGGLHPRLIPSVIEIFGNDIV IQAGGGIHGHKD-----------------------------------CHSWKNLD >16ft_4_scaffold_166118_1Pacearchaeota -MRYVDLNYIPSLTDIICEFFIET-EGVNFKEAAGSVAAESSIGTWTHL-STMKEYVEKL HATVFEMNGNTAKVAYPIELFEEGNMPNILSSVAGNVFGLKTLRNLRLNDIIFPEKLIKS FQGPKYGINGVRKILNVKKRPLVGTIIKPKLGLNTKDHSEVAYESWIGGCDFVKDDENLA SQSFNKFEARLKETLKKKDMAEKETCERKMYLVNITAETGEMLKRARLVSDLKGEYAMID ILTCGYSSLQTLRLKDFDLVLHAHRAGHAAFTKNTKHGISMRFISKIVRIIGVDQLHVGA VVGKMAESKQEVSMNVSALK------CDMFGLNQVFPVASGG------------------ ------------------------------------------------------- >gi|1001835459|gb|LSCO01000005.1|_13 -MRYTDLSYTPKDTDLICDFYVEP----SMEFISGGVAAESSVGTWTEL-STEKPYMQKY AATVFDIQGNNIKIAYPVELFEPTNMPNILSSVAGNVFGLEDIANLRLNDIVFPDALITS FKGPRYGIDGVRRITGVTGRPLIGTIVKPKLGLMTKDHAKVAYDSWMGGCDIVKDDENLS NQKFNPFTERVLKTLEERDKAESVTGEKKVYLINVTAEVEEMKRRAQFVEDNGGRYMMID ILTTGWSSLQTMRNNGFNLIIHAHRAGHAAYTRSHKHGINMVVLAKVSRLIGVDQLHVGT AVGKMAETRQEVIANKNACV------EPFGEIKKVLPVASGGLHPGMVPKLVEYFGSDVI IQAGGGIHGHPDGTTKGATALRQAVDASMMDIPLQEYGKTHAELGKALTKWSVL- >mol-32-15fa-031464_9Pacearchaeota VMQYIDIGYRPKETDLICRFRLET----PFNVIAGGIAAESSVGTWTEL-STEKPYMRDM AAKIFRIDHS-IDVAYPFELFEPGNMPNILSSVAGNVFGLEDVKNLRLNDIVFPKELLKS FKGPKFGIAGVREVLRVRDRPLVGTIIKPKLGLIAKDHAKVAYEAWRGGCDVVKGDENLA SQRFNPFEERVLRTLEARDKAEAETGEQKAYMINVTAELEEMKRRAQYVEDHGGRYMMID ILTTGWSALQTMRDADFNLVIHAHRAGHAAYTRSKVHGINMIVLARVARLIGVDQLHVGT AVGKMSETEEEVKENIRACK------ETLYGVKQVLPVASGGLYPGLVPRLLKIFGRDFV IQAGGGIHGHPDGTMSGATAMRQALDAALLGVSLQEYAIEHIELSKAIEKWNDEQ >QMXL01000017.1_2 -MRYIDTKYRPKDTDLICEFHVEP----PLDVIAGGVAAESSIGTWTSL-TTEKPYIHDK AATVYEIKGNEIKIAYPEALFEPGNMPNILSSIAGNVFGLEDIKYLRLNNIHFPEEIAKS FKGPKYGIEGVRKLTGVTERPLVGTIIKPKLGLVTKDHAEVAYNAWKGGCDVVKDDENLS SQRFNPFDERVIETLEARDRAESETGEKKVYLINVTAPMEEMKRRADFVEDHGGRYMMID ILTTGWSSLQAMRDLDLKLLIHAHRAGHAAFTRSHRHGINMVVLAKVARLIGVDQLHVGT AVGKMAETREEVLMNKEACV------EPLYGIKPVLPIASGGLHPGMVPKLVEIFGKDTV IQAGGGIHGHPDGTVVGATALRQAVDAAMKGIPLKEYSKDHPALEKALEKWPVL- >NODE_5388_length_2116_cov_2.948014_2 ---------------------------------------------------TEKPYIHGK AATVFEINCNEIKIAYPIELFEPENMPNILSSVAGNVFGLEDIENLRLNDIHFPKELAKS FKGPKYGIEGVKNLTGVRDRPLVGTIIKPKLGLITKDHAEIAYEAWLGGCDVVKDDENLS SQRFNPFEDRVIQTLEMRDRAKSETGEKKVYLINVTASMEEMKRRAEFVEDHGGRYMMID ILTTGWSSLQAMRDLDLKLLIHAHRAGHAAFTRSHRHGINMVVIARVARLIGVDQLHVGT AVGKMAETKEEVLANIHACK------DPYYGLKPVLPIASGGLHPGMVPKLVEIFGKDTV IQAGGGIHGHPDGTIIGAKTVRQAVDATMKGIPLKEYAKTHLELAKALEKWPVL- >NODE_1993_length_3878_cov_5.275717_3 -MRYIDRKYKPKDTDMICSFHVEPH-NQPLDVIAGGVAAESSVGTWTDL-TTEKPYIHGK AATVFEINGN-IKIAYPIELFEPENMPNILSSVAGNVFGLEDIENLRLNDIHFPEELAKS FKGPKYGIEGVKDLTGVRDRPLVGTIIKPKLGLITKDHALVAYEAWLGGCDVVKDDENLS SQRFNPFEDRVIQTLEARDRAESETGEKKVYLINVTAGMEEMKRRAEFVEDHGGRYMMID ILTTGWSSLQAMRDLDLKLLIHAHRAGHAAFTRSHIHGINMVVIARVARLIGVDQLHVGT AVGKMAETKEEVLTNIHACK------EPLYGLKPVLPIASGGLHPGMVPKLVEIFGKDTV IQAGGGIHGHPDGTIIGAKTVRQAVDATMKGIPLTQYAKTHPELAKALEKWPVL- >gi|1083044429|gb|MEZI01000063.1|_2 -MRYIDRGYRPRDTDLLCRFRLEP--DQPFDVIAGGVAAESSVGTWTELSTEKP-YMMDK AAKVYRIEGD-IDVAYPEALFEPGNMPNILSSVAGNVFGLEDIRNLRLEDITFPRELAGS FKGPRHGIEGXXXXXXXXXXXXXXXXXXXXXXLNTRDHARVAYEAWSGGCDVVKDDENLS SQRFNPFEDRVLETLEAADRAESETGERKVYLVNVTAELEEMRRRAQYVEDHGGRCMMID ILTTGWSSLQTMRDAGYRLILHAHRAGXXXXTRSPVHGINMVVLARVARLIGVDQLHVGT AVGKMSETEEXXX--------X-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX XXXXXXXXXHPDGTAKGATAMRQAVDAVLEGYTLEEYAEEHEELARALARWKTV- >mol-32-15fa-021564_1Pacearchaeota -MRYIDRGYRPRDTDLLCRFRLEP----PFDVIAGGVAAESSVGTWTEL-STEKPYMVDK AAKVYRIEGDLIDVAYPEALFEPGNMPNILSSVAGNVFGLEDIRNLRLEDITFPRELAGS FKGPRHGIEGVREITRTRGRPLVGTIIKPKLGLNTRDHAQVAYEAWSGGCDVVKDDENLS SQRFNPFEDRVVETLEAADRAESETGERKVYLVNVTAELEEMRRRAQYVEDHGGRCMMID ILTTGWSSLQTMRDAGYRLILHAHRAGHAAFTRSPVHGINMVVLARVARLIGVDQLHVGT AVGKMSETEEEVRANIAACK------EPMHGVKPVLPVASGGLHAGMVPRLVEIFG-DAV IQAGGGIHGHPDGTAKGATAMRQAVDAVLGGYTLEEYAEEHKELAQALAKWKTV- >NODE_3392_length_2798_cov_3.069827_2 -MRYIDHNYEPKETDLVCEFHVEPHDQP-LEVIAGGVAAESSVGTWTEL-STEKPYMQRL AAKVYSIEGNEVRIAYPLELFEPGNMPNILSSVAGNVFGLEDIRHLRLNDIHFPKELAKS FSGPRYGIQGVREVTGIRDRPLVGTIIKPKLGLVTADHARVTYEAWSGGCDVVKDDENLS SQSFNPFEDRVVETLEARDRAESETGERKVYLVNVTAETEEMKRRAQYVEDHGGRYMMID ILTTGWSSLQAMRDAGFNLVIHAHRAGHAAFTRSHVHGVNMVVLARVARLLGVDQLHVGT AVGKMAETEEEVRANIAACK------EPMHGVRPVLPVASGGLHPGTVPKLVEIFGNDTV IQAGGGIHGHPDGTVKGATAMRQAVDATL-GISLEDYAMDHVELSRALTKWPVV- >PBSW01000070.1_5 -MRYIDLDYEPSEKDLICGFHLET----PLNIVAGGVAAESSIGTWTEL-TTVKPYMAGL QATTFEIKENEISVAYPHELFEPGNMPNILSSVSGNVFGLEDIKSLRLNEIKFPAGLMEG FKGPKYGIDGIRKLLGINERPLVGTIIKPKLGLRTEDHAKIAYEAWRGGCDIVKDDENLS GQAFNPFEDRLRETLEARDRAEEETGETKVYMVNVTAETDEMIRRAGLIEDQGGRYMMVD ILTCGWSALQTLRDRDFDLVIHAHRAGHAAFTRSHSHGINMRVIAKVARIIGVDQLHIGT AVGKMAETATEVKGNMMTLL------EPLHGMKDVFPVASGGLYPGLIPALMKIFGKNFI IQAGGGIHGHPDGSISGARAMRQAVDAVMAERSLEDYADDHVELETALDTWSGS- >CG10_big_fil_rev_8_21_14_0.10_scaffold_92999_2Pacearchaeota -MKYIDSNYTPTKSDLVCEFFVETEGGFDIRTAAGGVAAESSVGTWTEL-TTEKEYMKNL AAKVFSISAN-IKIAYPAGLFEPANMSNILGSVAGNVFGLRELRNLRLNDIFLPKQLIKS FRGPKYGIGGIRKLLRVKSRPLVGTIIKPKLGLRTKDHAQVAYDAWTGGCDIVKDDENLG SQKFNPFRERFLQTIKLRDKAERETGEKKVYMANITAETEEMMRRADFVKKNGGEYMMVD ILTAGWSALQSLRNRNLNLVIHAHRAGHAAITKNPKHGISMKVIAKVARIIGVDQLHVGT VVGKMFETKEEVLENISALK-------ERNGLKQVFPVASGGLSPLGVPELIRIFGNDVI IQAGGGIHGHPRGSKAGATAMRQAVDAVMAGASLKEYAKAHSELKEALERFG--- >bjp_ig2599_scaffold_21435_3Pacearchaeota LEIYIKAKWHPEFEDLVCDFFVETG-GFDIRTAAGGVAAESSVGTWTEL-TTEKEYMKSL AAKVFSITNN-IRIAYPIRLFEPANMPNILSSVAGNVFGLRELKNLRLNDIHFPEKLLKS FKGPKYGIDGIRKLLQVKSRPLVGTIVKPKLGLNTKDHAKVAYDAWTGGCDIVKDDENLG SQKFNPFRERFLQTIKMRDKTEKETGEKKVYMVNVTAETDEMLRRAEFVARNGGEYLMVD ILTAGWSALQTLRNKNLNLVIHAHRAGHAAVTKNPKHGISMKVIAKVSRAIGVDQLHVGT VVGKME-TKEDVLANVSALK-------ERNGLKEVFPVASGGLSPLSVPDLVKIFGNDVI IQAGGGVHAHKLGTRAGARAMRDALYAAMKEIPLEEYAKQSKELKIALMQWGKSK >gi|1083045515|gb|MEZE01000052.1|_12 XXXXXXLDYTPKETDVVCTFYVEPK-GISLKEAAGXXXXXXXXXXX----XXXXXXXXKL AAHVFSINGNTVKIAYPMELFEFGNMPNILSSIAGNVFGLRTLRNLRLNDVEFPKEVVNS FRGPKYGIEGIRKLLKVYDRPLVGTIIKPKLGLKTTDHARVAYEAWVGGCDIVKDDENLS SQRFNPFEERLTKTLERRDQAENETGEKKVYMINVTAETSEMIRRAELVLKQGGEYVMVD ILTCGFAALQSLRDRNLDLVIHAHRAGHAAFTRNLKHGISMRVITKIARMIGVDQLHVGA VVGKMAETKREVSENVEALK------MKMGGLKVVLPVASGGLYPRLVPSLMDFFGKDFV IQAGGGIHGHSDGTVAGARAMRQAVDATLKGVSLNDYAKTHKELEAALQTWK--- >QMYO01000036.1_9 -MRYRDLTYRPSENDLVCDFSVET----DFVKMVGGVAAESSIGTWTEL-TTMKAYVKKL HAIVFDINGGSVRIAYPIDLFEPGNMPNILSSIAGNVFGLKDLKNLRLNDVHVPEELVAS FKGPKYGIAGIRKLVDVHDRPLIGTIIKPKLGLNPKDHAEVAFNAWVGGCDIVKDDENLS SQKFNPFDDRVVKTLEYRDKAEEQTGEKKIYMPNVTAETNEMIKRAQFVADQGGEYVMVD ILTCGFSALQTLRNQDLDLVLHAHRAGHAAFTKNPKHGIAMRVIAKLSRIIGVDQLHVGT AVGKMAETKEEVLINCEALR------GKMNGLRKVMPVASGGLHPALVPSLIGLFG-DFV IQAGGGIHGHKHGTVAGAKAMRQAVDATVEHIELEEYAETHSELQSALETWK--- >gi|1007508460|gb|LUCA01000056.1|_2 -MRYRDLGYRPRESDVVCRFHLEP----PLEEAAGAVAAESSIGTWTELR-TLKPYMVEL RARVFEMEGQAISVAYPLELFEPGNLSNLLSSVAGNIFGMSAISRLKLLDIMLPEALLRS YPGPRYGIEGVRNLLGIRGRPLVGTIIKPKLGLNVQDHARVAYEAWRGGCDIVKDDENLA DQGFNPFEDRVIETLEMRDRAEEETGERKAYLVNITAETGEMLRRAEYVEDHGGRYVMID ILTTGFAALQTLRKADLKLALHGHRAGHAAFTRPRRHGIAMRVIAKLARFAGIDQLHVGT AVGKMAEPRERVLENVEALK------TPMGRLKPALPVASGGLHPGLVPELLKIFGPDVI IQAGGGVHGHPGGTEAGARAMRQAVEAALEGIPLKEYAKGRPELAAALERWSRP- >rifcsphigho2_01_scaffold_440213_2Micrarchaeota -------------------------------EAAGGVAAESSIGTWTEL-TAKHKYVERL AARVFEINNN-IKIAYPAELFEPGNMPNILSSVAGNVFGLNALKNLRLNDINFPANIVKS FKGPKYGIEGIRKLLRVKKRPLVGTIIKPKLGLKTKDHAKVAYESWLK------------ ---FNSFEKRIRETLKARDKVERETGEKKVYLANITAETNEMLRRARLVKDLGGEYIMID ILTSGWSALQTLINADFGLVIHAHRASHATFTKNPKHGISMKVIAKIARIVGVDQLHVGT VVGKMFETKKDVL---ENCR---ALKEELWGMNQVMPVASGGLHPGLVPELIRIFGDDIV IQAGGGIHGHPDGTFAGAKAMRQAVESIVEDIDIKEYAAKHKELASALKHF---- >rifcsplowo2_01_scaffold_35063_9Pacearchaeota -MKYRDLNYKPAKTDLICEFYVEP-LGVSLKEAAGGVAAESSIGTWTEL-TAKHKYVERL AARVFEISNNNIKIAYPAELFEPGNMPNILSSVAGNVFGLNALKNLRLNDINFPANIVKS FKGPKYGIEGIRKLLRVKKRPLVGTIIKPKLGLKTKDHAKVAYESWLGGCDIVKDDENLA SQKFNSFEKRIRETLKARDKVERETGEKKVYLANITAETNEMLRRARLVKDLGGE----- -------XXQTLINADFGLVIHAHRASHATFTKNPKHGISMKVIAKIARIVGVDQLHVGT VVGKME-TKKDVLENCRALK-----EELW-EMNQVMPVASGGLHPGLVPELIRIFGDDIV IQAGGGIHGHPDGTFAGAKAMRQAVESIVEDIDIKEYAAKHKEA---LKHF---- >QMXQ01000154.1_1 -MRYLDLTYEPSETDVICDFHVEP-LGISLEEAAGGVAAESSIGTWTEL-TTIKPYVERL HATVFQIEGNEVRIAYPVELFEPGNMPNILSSVAGNVFGLGAIKRLRLNDIHLPEALVRS FRGPSYGIEGVRRLLRVEDRPLVGTIIKPKLGLRTRDHARVAYEAWIGGCDIVKDDENLS SQAFNPFEDRVVETLERRDRAEEETGERKAYMVNVTAETGEMIRRAEFVKDHGGRYAMVD IITCGFSALQAVREQDLGLVIHAHRAGHAAFTRMERHGISMRVIAKAARMVGVDQLHVGT AVGKMSEGREEVLGNVEALK-----------------------------------G---- ------------------------------------------------------- >RBG_16_scaffold_11079_2Pacearchaeota ------LKYKPKDYDLICTFKLDPE-GVDFKEAAGAIAAESSVGTWTEL-TTIKPYVEEL AAHVFQLEGDLARIAYPIELFEIENMPNILSSIAGNVFGLKALRNLRLIDVQLPEGLIRS FKGPKYGIQGIRKILGVKDRPLVGTIIKPKLGLKSEDHAKVAYEAWAGGCDIVKDDENLS SQKFNPFEKRLDATLEAKDRAENETGEKKVFITNITAETETMLERADMVIGHGGEYVMVD ILTCGWSALQTLRKQELKLVIHAHRAGHAAFTKNPLHGIAMRPIATIARIIGVDQLHVGT VVGKMSETKAEVLENIDACK------MELSGLKTIMPVASGGIHPRLVPALLETFGKDVV IQAGGGVHGHPMGTKTGALAMRHAVDAAIQGIILEEYAKKHIELASALDTWKA-- >MTLU01000004.1_18 -MKYKDLGYKPKETDIICTFYVEPE-GIDLSEAAGGVAAESSIGTWTEL-TTIKPYVKEL AAHVFGIHENYVKIAYPIELFENNNMPNILSSISGNVFGLKTIKNLRLNDVHFPHELIRS FKGPKYGIDGIRQLLKVHDRPLVGTIIKPKLGLKTSDHAQVAYQAWLGGCDIVKDDENLS SQKFNPFEERVVKTLESRDKAQEETGEKKVYMVNITAETEEMLRRAEFVLNHGGEYVMVD ILTCGFSALQTLRERDFDLVIHAHRAGHAAFTKNSKHGVSMRVIAKIARIIGVDQLHIGT VVGKMFETKEEVAENCRALK------EKIADLKPVFPVASGGLHPRLVPELIKFFGTDII IQAGGGIHGHREGTVAGAKAMRQAVDATLKGISLKEHAQTHRELQIALEMWR--- >MTLV01000011.1_15 -MKYKDLGYKPKETDIICTFYVEP-EGVDLSEAAGGXAAESSIGTWTEL-TTIKPYVKEL AAXVFGIHENYVKIAYPIELFENNNMPNIXSSISGNVFGLKTIKNLRLNDVHFPHELIRS FKGPKYGIDGIRQLLKVHGRPLVGTIIKPKLGLKTSDHAQVAYQAWLGGCDIVKDDENLS SQKFNPFEERVVKTLESRDKAQEETGEKKVYMVNITAETEEMLRRAEFVLNHGGEYVMVD ILTCGFSALQTLRERDFDLVIHAHRAGHAAFTKNSKHGVSMRVIAKIARIIGVDQLHIGT VVGKMFETKEEVAENCRALK------EKIADLKPVFPVASGGLHPRLVPELIKFFGTDII IQAGGGIHGHREGTVAGAKAMRQAVDATLKGISLKEHAQTHRELQIALEMWR--- >MTLR01000131.1_5 -MRYLDLSYEPKETDVVCVFYVEPD-GVSISEAAGGVAAESSIGTWTEL-TTIKPYVKEL AAHVFSIDGNTVKIAYPIELFELGNMPNILSSISGNVFGLKTIKHLRLNDVYFPSELIRS FKGPKYGVEGVRNLLGVYDRPLVGTIIKPKLGLKTADHAQVAYEAWVGGCDIVKDDENLS SQKFNPFEDRVIKTLESRDKAQEETGERKVYMVNITAETEEMLRRADFVLNHGGEYVMVD ILTCGFSALQSLREQDFNLVVHAHRAGHAAFTKNPRHGISMRVIAKISRIIGVDQLHVGT VVGKMFETREEVAENCKAIR------EEMKGLKPVLPVASGGLYPGLVPALIDFFG-DFV IQAGGGIHGHREGTVAGAKAMRQAVDATLRNIPLSKYAEAHRELKIALDMWGTGH >DAZI01000075.1_6 DLKYLDLSYIPKETDVICTFYVEP-DGVTIEEAAGGIAAESSVGTWTEL-TTIRPYIKEL VAHVFDINGNNVKIAYPIELFEPKNMPNILSSISGNIFGLKTIKHLRLSDVYFPSELIRS FKGPKYGIEGVRSLLRVDDRPLIGTIIKPKLGLKTPDHAQVAYEAWVGGCDIVKDDENLS NQKFNPFEERVIKTLESRDKAQEETGEKKVYMVNITAETEEMLRRAEFVLNHGGEYVMVD IITCGFSALQTLREQDFNLVIHAHRAGHAAFTKDPKHGISMRVIAKVARIIGVDQLHVGT VVGKMFETREEVAENCRALK------EEMASIKPVLPVASGGLHPGMVPVLIEFFGLDFV IQAGGGIHGHKEGTAAGARAMRQAVDATLKKISLKEYAESHKELKIALDLWSPSA >RifSed_csp2_19ft_3_scaffold_147407_1Pacearchaeota -LRYVDLKYEPKENDVICEFYVEPE-GISIKGAAGGVAAESSIGTWTEL-TTEKAYVKKL AARVFNIEGNYAKIAYPIELFEYGNMPNILSSVAGNVFGLRTLKNLRLNDINFPQRLVRS FKGPKFGVKGIRKLLKIFNRPLVGTIIKPKLGLKTVHHEKVAYDSWVGGCDIVKDDENLS SQKFNPFRERVIKTLEGRDKAEEETGERKVYMANITAETGEMLKRAEYVLDHGGEYVMID VLSCGFSSLQTLREQNLNLVIHAHRAGHATFTKNPKHGISMRVIAKMARLVGVDQLHVGT VVGKMFESREEVTENCEALK------KKMDILKPVLPVASGGLHPGLVPALIEFFG---- -----------------------------KDF----------------------- >CG_2015-19_scaffold_109081_1Pacearchaeota -LKYVDLEYKPKEADVVCTFIVEPD-GISMKQAAGAVAAESSIGTWTHL-TTIKPYVEKL AARVFSIEGKVAKIAYPIELFEQGNMPNILSSVAGNVFGLRTLKNLRLEDVVFPEKIVKS FKGPKYGIEGIRTLLKIYDRPLVGTIIKPKLGLKTSDHADVAYKAWAGGCDIVKDDENLS SQRFNPFEERLTKTLESRDKAEEETGERKVYMINVTAETKEMLRKAEMVLEQGGEYVMID ILTCGFAALQTFREQDFKLVVHAHRAGHAAVTKNPKHGISMQVLAKAARIIGVDQLHVGT VVGKMFETKEEVADNCKALK------MHIGDLKKALPVASGGLHPGLVPALIEFFGKDVV ------------------------------------------------------- >PEWI01000089.1_3 -MRYIDLNYTPKETDVICTFHIEPA-GISMKEAAGGVAAESSVGTW-TEFTTVKPYVDRL AACVFDIDGGLAKIAYPLELFELGNMPNILSSVAGNVFGLRALENLRLNDIDFPSKLVRS FKGPKFGIEGIRRLLKVYDRPLVGTIIKPKLGLKTADHAKVAYEAWVGGCDIVKDDENLG SQSFNPFDERVLKTLEARDRAQRKTGEKKVYMVNITAETGQMLKRAEFVLSHGGEYVMVD ILTCGFAALQTLRDQGFKLVIHAHRAGHAAFTKNPKHGISMKVIAKVARMIGVDQLHVGT VVGKMFETRDEVRENCEALK------TEMSGLKPVLPVASGGLHPGLVPSLMEFFGKDFV IQAGGGIHGHSDGTVAGAKAMRQAVEATLEGVPLKEYARAHVELETALRIWGQRP >rifcsp2_19_4_full_scaffold_215841_1Micrarchaeota ILKYVDPNYKPKETDLICTFAVEPE-GISLKEAAGGVAAESSVGTWTEL-TTIQPYVEKL AATVFSIKGNTIKIAYPIELFEAGNMPNILSSVAGNVFGLKALKNLRLLDIEFPKALLDS FKGPAYGIKGIRELVKVPKRPLVGTIIKPKLGLKTVDHAKVAYNAWAGGCDVVKDDENLS SQKFNPFEDRLVQTLESRDKAQKETGECKVYMVNITAETDIMLKRAQAVVDQGGEYVMVD ILTCGWSALQTLRNQNFPFVLHAHRAGHAAFTKNPLHGIAMKPIATVSRVIGVDQLHVGT VVGKMSETQQEVLENIDACK------SSMGNLRPVLPVASGGLHPRLVPALLKTFGNDVV LQAGGGIHGNPLGTVS--------------------------------------- >mol-32-1605-070610_1Micrarchaeota -LKYLDFSYKPKETDLLCTFYVEPE-GISLKEAAGGVAAESSVGTWTEL-TTEKPYVKKL AAHVYRIEGNIIKIAYPIELFEQGSMPNILSSVAGNVFGLKALKNLRLLDIEVPKALLSG FKGPQYGITGIRKLLKVPKRPLVGTIIKPKLGLNTKDHAKVAYDAWLGGCDVVKDDENLS SQKFNPFDDRLFETLEARDKAQDETGERKVYMINVTAETNLMLKRAQTVVDQGGEYVMVD ILTCGWSALQSLREQNFKVVIHAHRAGHAAFTKNPLHGISMKPIVSVARIIGVDQLHVGT VVGKMSETKPEVLENISACK------AELGDLAPVLPVASGGLYPQLVPSLLETFGNDVV LQAGGGIHGHPEGTVNGAKAMRQAVDAVLEGRPLDEYAKTHKELQLALQHWKV-- >PKYH01000122.1_5 -MKYLDSTYKPKETDLVCTFYLEPE-GISLNEAAGGVAAESSIGTWTEL-TTTQPYVTRL AAHVFSIEGNIAKIAYSIELFEPANMPNILSSVAGNVFGLKALKNLRLLDIQMPKDLVNS FKGPNYGIAGIRKLLKVPERPLVGTIIKPKLGLNTKDHAKVAYDAWSGGCDIVKDDENLS SQKFNPFEDRLSQTLESRDKAQEETGERKVYMVNITAETDTMLKRAQTVLDQGGEYVMVD ILTCGWSALQTLRNQNYKLVIHAHRAGHAAFTKNPKHGISMRPIATVARIIGVDQLHVGT VVGKMSEAKAEVLENIDACK------AELGGLKPVLPVASGGLHPRLVPALVETFGNDVV IQAGGGIHGHPDGTVAGAKAMRQAVDATLKGLSLEEYAKRHKELKTALELWKA-- >RBG_16_scaffold_1633_8Micrarchaeota -LRYLDLTYKPKETDLTCTFYVEPE-GISLKEAAGGIAAESSIGTWTEL-TTTQPYVTRL AAHVFSIEGTVVKIAYPIELFEPANMPNILSSVAGNVFGLKALKNLRLLDIQMPQGLINS FKGPLFGITGIRKLLKVPKRPLVGTIIKPKLGLKTKEHAKVSYDAWSGGCDIVKDDENLS SQKFNPFEERVTQTLECRDKAQQETGERKVYMVNITAETDTMLKRAQTVINQGGEYVMVD ILTCGWSALQTLRNQNFKLVIHAHRAGHAAFTKNPKHGIAMRPIATVSRVIGVDQLHVGT VVGKMSETKAEVIENIVACK------AELGGLKPVLPVASGGLHPRLVPALLETFGNDVV IQAGGGIHGHPDGTVAGAKAMRQAVNASLKGLSLEEYAKTHGELKAALSLWRA-- >PLLN01000104.1_1 ALKYLDLSYKPKETDLTCTFTVDPE-GISLKEAAGAVAAESSVGTWTEL-TTEKPYVKRL AAHVFSIEGSVVKIAYPKELFEPANMPNILSSVAGNVFGLKALRNLRLLDIEFPKNLAES FKGPAFGISGIRKLLKVPKRPLVGTI---------------------------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------- >RifSed_csp2_16ft_2_scaffold_362462_2Micrarchaeota RLKYLDLHYKAKETDLICTFQIEPE-GISLKEAAGGGAAESSVGTWTEL-TTEKPYVKKL AAHVYSIDGSVVRIAYPRELFESSNMPNILSSVAGNVFGLKALKNLRLLDIEFPNHLLAS FKGPAFGIAGIRKLLKVPKRPLVGTIIKPKLGLKTEDHAKVAYESWFGGCDVVKDDENLS SQSFNPFEARLTQTLEARDKAQSATGERKVYMINITAETQTMLKRAQAVVNQGGEYVMVD ILTCGWSALQTLRGQNFKLVIHAHRAGHAAFTKSQQHGIAMKSIAAVARIIGVDQLHVGT VVGKMSETKQEVVENISVCK------AEMGGLKPVLPVASGGLHPRLVPALVETFGNDFV IQAGGGIHGHSEGTVAGAKAMRQAVDATLECKTIDEYAKTHRELALALKQWKT-- >PMAZ01000301.1_9 -MKYLDLTYKPKETDLICTFYVEPE-GISLKEAAGGVAAESSVGTWTEL-TTEKPYVKRF AAHVFSIEGNVIKIAYPIELFEPSNMPNVLSSVAGNVFGLKALKNLRLLDIEFPNQLLNS FKGPAFGIAGIRKLLKIPKRPLVGTIIKPKLGLETKDHAKVAYEAWLGGCDVVKDDENLS SQSFNPFEERLAQTLESRDKAQEETNERKVYMINVTAETQMMLKRAQAVVDQGGEYIMVD ILTCGWSALQTLRDQNFKLVIHAHRAGHAAFTKNPLHGIAMKPIATVARMIGVDQLHVGT VVGKMSETKAEVIENIGACK------AEMGLLKPVLPVASGGLHPRLVPALMETFGNDFV IQAGGGIHGHPDGTVAGAKAMRQAVDATLYGKTLEEYAKTHKELALALKQWKI-- >PLNS01000004.1_27 -MKYLDLAYEPKETDLICTFYVEPE-GISLKEAAGGVAAESSVGTWTEL-TTEQPYVKRL AAHVYDIDGSVVKIAYPNELFEQANMPNILSSVAGNVFGLKALKNLRLLNIEFPRQLLTS FKGPAFGIAGIRQLLKVPKRPLVGTIIKPKLGLETKDHAKVAYEAWYGGCDIVKDDENLS SQKFNPFEERLTQTLESRDKAQEETGERKVYMINITAETDTMVKRAQTVVDQGGEYVMVD ILTCGWSALQTLREQNLKLVIHAHRAGHAAFTKNHVHGIAMKPIATVARVIGVDQLHVGT VVGKMSETKAEVMENIEVCK------AELGDLKPVLPVASGGLHPRLVPALMETFGNDFV IQAGGGIHGHPHGTICGAKAMRQAVDATLEGRTLDEYAKNHRELASALKQWKA-- >PLLN01000078.1_17 -MKYLDLAYEPKETDLICTFYVEP-EGISLKEAAGGVAAESSVGTWTEL-TTEQPYVKRL AAHVYDIDGSVVKIAYPNELFEQANMPNILSSVAGNVFGLKALKNLRLLNIEFPRQLLTS FKGPAFGIAGIRQLLKVPKRPLVGTIIKPKLGLETKDHAKVAYEAWYGGCDIVKDDENLS SQKFNPFEERLTQTLESRDKAQEETGERKVYMINITAETDTMVKRAQTVVDQGGEYVMVD ILTCGWSALQTLREQNLKLVIHAHRAGHAAFTKNHVHGIAMKPIATVARVIGVDQLHVGT VVGKME-TKAEVMENIEVCK------AELGDLKPVLPVASGGLHPRLVPALMETFGNDFV IQAGGGIHGHPHGTICGAKAMRQAVDATLEGRTLDEYAKNHRELASALKQWKA-- >PLYI01000055.1_62 SLKYLDLTYKPKESDLICTFLVEP-QGISLKEAAGGVAAESSVGTWTEL-TTEKPYVKRL AAHVYSIEGSEVKIAYPAELFEAANMPNILSSVAGNVFGLKALKNLRLLDLEFPKQLLAS FKGPAFGIAGIRKLLKIPKRPLVGTIIKPKLGLETKDHAKVAYEAWLGGCDIVKDDENLS SQKFNPFETRLTQTLESREKAQAETGERKVYMINITAETDTMLKRAQTVVDQGGEYVMVD ILTCGWSSLQTLRNQNLKLVIHAHRAGHAAFTKNPTHGIAMKPVATVARVIGVDQLHVGT VVGKME-TKAEVIENIDACK------TQMGDLRPVLPVASGGLHPRLIPSLMETFGNNFV IQAGGGIHGHPDGTVAGAKAMRQAVDATLERKTLEEYAKNHKELATALKQWKT-- >gi|921074160|gb|LFWU01000010.1|_5 ---------------MICDFYVEP-EGISLKEAAGGVAAESSVGTWTEL-TTIKPYVEKL AARVFSINGNHFRVAYSTELFESGNMPNILSSVAGNVFGLRALKNLRLNDIHFPKVLVGS FKGPKYGIAGIRKLLKVHDRPFVGTIIKPKLGLKTVDHAKVAYDAWVGGCDIVKDDENLS SQRFNPFNARIEATLEMRDRAEKKTGEKKVYMANITSETEEMLKRAQFVKDHGGRYIMID ILTCGFSALQTLRDHDFGLVIHAHRAGHAAFTKNPKHGISMKVIAKVVRLIGVDQLHVGT VVGKMSETKEEVSENREACT------IELGGLKKVLPVASGGLHPGLVPALMNFFGNDFV IQAGGGIHGHPDGTVLGAIAMRQAVDATLQGVSLKEYAKNHKELQEALNIWN--- >KON34323.1_ribulose_1 -MRYVDLSYKPKTTDLICDFYVEP-EGISLKEAAGGVAAESSVGTWTEL-TTIKPYVEKL AARVFSINGNHFRVAYSTELFESGNMPNILSSVAGNVFGLRALKNLRLNDIHFPKVLVGS FKGPKYGIAGIRKLLKVHDRPFVGTIIKPKLGLKTVDHAKVAYDAWVGGCDIVKDDENLS SQRFNPFNARIEATLEMRDRAEKKTGEKKVYMANITSETEEMLKRAQFVKDHGGRYIMID ILTCGFSALQTLRDHDFGLVIHAHRAGHAAFTKNPKHGISMKVIAKVVRLIGVDQLHVGT VVGKMSETKEEVSENREACT------IELGGLKKVLPVASGGLHPGLVPALMNFFGNDFV IQAGGGIHGHPDGTVLGAIAMRQAVDATLQGVSLKEYAKNHKELQEALNIWN--- >PIXT01000089.1_9 -MKYTDQSYEPKVTDLLCDFYIEPE-GISLKEAAGGVAAESSVGTWTEL-TTIKPYVEKL AARVFSIDNN-IRVAYPIELFEHGNMPNILSSVAGNVFGLRALKNLRLNDIQLPKELVHS FKGPKYGIAGIRELLNVKDRPFVGTIIKPKLGLKTKDHAKVAYDAWAGGCDVVKDDENLS SQRFNSFDDRVVATLEMRDRAEKETGEKKVYMVNITSETEEMTKRAQFVKDHGGRYLMID ILTCGYSALQTIREQDFGLVIHAHRAGHAAFTKNTKHGISMKVIAKVARIIGVDQLHVGT VVGKMSETKEEVSENCEALK------TDMYGLKDALPVASGGLYPSLVPALMKFFGNDLV IQAGGGIHGHTDGTVSGAIAMRQAVDATLQGVSLKEYAKSHKELHVALELWK--- >PIXS01000337.1_2 -MKYVDQSYTPKPTDTICSFYVEPE-GITLKEAAGGVAAESSVGTWTEL-TTIKPYVEKL AATVFNINGNNIQIAYPIELFEHGNMPNILSSVAGNVFGLRALKNLRLNDIQLPKDLVQS FKGPKFGISGIRELMGVKNRPLVGTIIKPKLGLKTEDHAKVAYDAWVGGCDVVKDDENLS SQRFNPFDERVIKTLEMRDRAEKETGEKKVYMVNITSETEEMLKRAQFVKDHGGRYLMID ILTCGFSALQTVREQDFGLVLHAHRAGHAAFTKNKKHGISMRVIAKVSRIIGVDQLHVGT VVGKMSETKEEVSENCEALK------TSMFGLKNVLPVASGGLYPRVVPALMSFFGNDLV IQAGGGIHGHVDGTVSGAKAMRQAVDAALSGKSLEDYAKTHKELRVALDVWK--- >NODE_4014_length_2540_cov_3.835566_3 -MKYVDNSYNPKTTDTICTFYVEP-KGISLKEAAGGVAAESSVGTW----TELTTIKPKL AAKVFSIEGNDIQIAYPIELFEPGNMPNILSSISGNVFGLRALKNLRLNDIQLPKELVKS FKGPKYGINGIRELLGVKDRPFVGTIIKPKLGLKTEDHAKVAYEAWIGGCDIVKDDENLS SQLFNPFTDRVIRTLEMRDLAEKETGEKKVYMINITSETEEMLKRAQFVKDHCGKYLMID ILTCGFSALQTVREKD--------------------FGL--------------------- ------------------------------------------------------------ ------------------------------------------------------- >gi|921072115|gb|LFWV01000001.1|_5 -MKYTDLKYEPAETDLICAFYVEPE-DISLKEATGGVAAESSIGTWTEL-TTTEPYMAKL AARVFAIVGNTANIAYPIELFEQGNMPNILSSLAGNVFGLKALKNLRLTDIKLPAELVKS FKGPKFGIQKIRSLLKVPERPLVGTIIKPKLGLKTKDHAKVAYEAWAGGCDIVKDDENLS SQRFNPFEERIVETLDGRDKAEEETGERKVYMANITGETEKMLKRAKCVLDHGGRYVMVD ILTCGWSALQTLRDQDFKLVIHAHRAGHASFTKNPKHGIAMRVIAKVARVIGVDQLHVGT IVGKMSETKDEVLENIDALK------MDMAGLKPVLPVASGGLHPQLIPALMEYFGKDFV IQAGGGIHGHTDGTFAGATAMRQAVEAIMQGKTLEAYAETHKELEVALKRWRE-- >KON32451.1_ribulose_1 -MKYTDLKYEPAETDLICAFYVEPEDIS-LKEATGGVAAESSIGTWTEL-TTTEPYMAKL AARVFAIVGNTANIAYPIELFEQGNMPNILSSLAGNVFGLKALKNLRLTDIKLPAELVKS FKGPKFGIQKIRSLLKVPERPLVGTIIKPKLGLKTKDHAKVAYEAWAGGCDIVKDDENLS SQRFNPFEERIVETLDGRDKAEEETGERKVYMANITGETEKMLKRAKCVLDHGGRYVMVD ILTCGWSALQTLRDQDFKLVIHAHRAGHASFTKNPKHGIAMRVIAKVARVIGVDQLHVGT IVGKMSETKDEVLENIDALK------MDMAGLKPVLPVASGGLHPQLIPALMEYFGKDFV IQAGGGIHGHTDGTFAGATAMRQAVEAIMQGKTLEAYAETHKELEVALKRWRE-- >NODE_69_length_33692_cov_19.525331_18 -MKYIDLNYKPALADSILTFYLEPE-GISIKEAAGGVAAESSIGTWTEL-TTMEPYMMKL AACVFSMEGNFVKIAYPVELFEQGNMPNILSSVAGNVFGLKALKNLRLVDIELPAILLKS FKGPKFGIKGIRSLLKVPKRPLVGTIIKPKLGLKTKDHAKVAYEAWVGGCDVVKDDENLS SQRFNPFEKRILKTLEGRDKAENETGEHKVYMANITAETETMLKRAEFVLDHGGRYVMVD ILTCGWSALQTLRDQNFKLVIHAHRAGHAAFTKNLKHGIAMRTIAKVSRIIGVDQLHVGT IVGKMSETKEEVLENIDALK------TEMAGLKPVLPVASGGLHPKLVPALMEYFGRDFV IQAGGGIHGHTDGTSAGATAMRQAVDATMQGKTLAAYSETHKELKLALDLWRD-- >NODE_1486_length_4611_cov_27.932730_4 -MKYIDLNYEPVETDLICTFYIEPE-GISLKEAAGGVAAESSIGTWTEL-TTTEPYMVKL AACVFSMEDNTAKIAYPIELFEQENMPNILSSVAGNVFGLKALKNLRLIDIKFPVRLLEG FKGPKFGIQGIRDLLKVPERPLVGTIIKPKLGLKTKDHAKVAYEAWAGGCDIVKDDENLS SQRFNPFEERIVKTLEGRDKAEEESGECKVYMANITAETETMLKRAEYVLDHGGRYVMVD ILTCGWSALQTLREQNFKLVIHAHRAGHAAFTKNPKHGLAMRTIAKVARIIGVDQLHVGT IVGKMSETKKEVLENIDALK------TEKVGLKPVLPVASGGLHPKLVPALMKYFGKDFV IQAGGGIHGHTDGTFAGATAMRQAVEATLQGKTLEMYAQTHKELKVALKLWKK-- >NODE_514_length_8638_cov_28.201729_2 GLKYIDLNYEPVETDLICTFYVEP-EGISLKEAAGGVAAESSIGTWTEL-TTTEPYMVKL AARVFNMENNTVKIAYPIELFEKENMPNILSSVAGNVFGLKALKNLRLIDIKFPVKLLKG FKGPKFGIQGIRNLLKVLERPLVGTIIKPKLGLKTKDHAKVAYEAWAGGCDIVKDDENLG SQLFNPFEERLVKTLESRDKAEEETGECKVYMANITAETETMLKRAEYVLDHGGKYVMVD ILTCGWSALQTLREQNFKLVIHAHRAGHAAFTKNPKHGITMRTIAKVARIIGVDQLHVGT IVGKMSESKKVVLENIDALK------TDMEGLKPVLPVASGGLHPKLVPALMKYFGKDFV IQAGGGIHGHTNGSFAGATAMRQAVEATLQGITLEIYAETHEELKIALKLWKK-- >PIXS01000241.1_4 -----------------------------------------T------------------ ---------------------------------------------LR------------- ------------------------------------------------------------ ------------------------------------------------------------ -------------EHDFGLVIHAHRAGHAAFTKNPLHGISMKVIAKVARIIGVDQLHVGT VVGKMSDTKEEVAESCEALT------ADMYGIKDVLPVASGGLYPGLVPALMGFFGKDFV IQAGGGIHGHPNGTVSGAIALRQAVDATLQGMPLKEYAKTHKELQEALNIWK--- >NODE_210_length_15982_cov_4.309965_25 ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ -----------------------------------------RAIAKVARIIGVDQLHVGT IVGKMSETKEEVLANIDALK------MELAGLKPVLPVASGGLHPKLVPALMDYFGKDFI IQAGGGIHGHTEGTFAGAAAMRQAVDATLKGKTLGEYAETHEELKAALKLWTD-- >RifSed_csp2_13ft_1_scaffold_60832_1Micrarchaeota -LKYQDLKYKPKPSDLTCTFTLEPE-GISLKEAAGAVAAESSIGTWTEL-TTLKTYLEKL AAYVYSLEGNTAKIAYPIALFEAGNMPNILSSVAGNVFGLKALSNLRLKDIELPSDLTKT FHGPKYGISGIRKLLEVPERPLVGTIIKPKLGLKTMDHARVAYEAWKGGCDVVKDDENLS SQEFNPFEERVSETLDCRDQAEAETGERKVYMANITAETSGMLKRAEFVQDHGGEYVMVD ILTCGWSALQTLRDQDFKLVIHAHRAGHAAFTKNPKHGIAMRVTAKVARIIGVDQLHVGT VVGKMSETKEEVL----------------------------------------------- -------------------------EN---------------------------- >NODE_2094_length_3761_cov_7.127036_5 -MRYVDLKYKPAETDLICTFYVEPN-GIGLKEAAGGVAAESSVGTWTEL-TTVRPYVERL AALVFSIDGNNVKIAYPIELFESRNMPNILSSVAGNVFGLRALRNLRLNNIEFPEKLIIS FKGPQFGIDGVRRLLKVMNRPFVGTIIKPKLGLKTVDHAKVAYEAWAGGCDIVKDDENLS SQRFNPFEERVVKTLDMRDKAQSEMGERKVYMVNVTAETELMVKRADFVFAHGGEYVMVD ILTCGFSALQTLRDRGLKFVIHAHRAGHAAFTKNPRHGIAMRVIAKVARIIGVDQLHVGT VVGKMSETKQEVLENVETCK------MPMDGLKRVLPVASGGLHPRLVPALMATFGKDFV IQAGGGIHGHPNGTFAGATAMRQAVEATLEGKTLEDYAAAHKELKTALELWRD-- >rbg_19ft_combo_scaffold_2906_2Micrarchaeota -LRYVDLKYKPLETDLICSFFVEPD-GISLKEAAGGVAAESSVGTWTEL-TTVTPYVERL AARVFSIEGNVAKIAYPIELFEGGNMPNILSSVAGNVFGLRALRTLRLNNIEFPEKLLAS FKGPQFGINGIRKLLRVSGRPLVGTIIKPKLGLKTADHARVAYEAWAGGCDIVKDDENLS SQSFNPFEGRIVKTLESRDKAESETGERKVYMANITAETEIMLKRAEFVLEHGGEYVMVD ILTCGFSALQTLRDQDFKLVIHAHRAGHAAFTKNPKHGIAMRPIVKVARIIGIDQLHVGT VVGKMFETKQEVL--------------------------EAGL--SLLCQLLQ------- -----------A-------------DC--------------------THDWFQH- >RBG_16_scaffold_12677_9Micrarchaeota -VDFVDLKYQPSETDLVCTFYVEPD-GISLKEAAGGVAAESSVGTWTEL-TTEKPYVERL AARVFSIEGNTAKIAYPIELFEGGNMPNILSSVAGNVFGLRALGNLRLLDIE----LVKS FKGPRFGIGGIRKLLKVPIRPLVGTIIKPKLGLKTVDHAKVAYDAWAGGCDIVKDDENLS SQTFNPFEERVLKTLEMRDRAESETRERKVYMANITAETETMLKRAEFVLSHGGEYVMVD ILAAGFSALQTLRDQDFKLVIHAHRAGHAAFTKNPKHGIAMRVIAKVARVIGVDQLHVGT VVGKMAETKQEVLENVAACK------APLNGLEAVLPVASGGLHPRLVSALMETFGNDFV IQAGGGIHGHKNGTYAGATAMRQAVDATMEGKRLEEYAETHGELRLALELWKNN- >RifSed_csp2_16ft_2_scaffold_576471_1Micrarchaeota -VDFVDLKFQPSETDLICTFYVEP-DGISLKEAAGGVAAESSVGTWTEL-TTVKPYVERL AARVFSIEGNTAKIAYPIELFEAGNMPNILSSVAGNVFGLRALKNLRLLDIALPSKLVKG FRGPRFGIAGIRKLLNVPERPLVGTIIKPKLGLKTVDHAKVAYDAWAGGCDIVKDDEKQS SQTFNPFEERVVKTLESRDKAESETGERKVYMVNVTAETETMLKRAEFVLAHGGEYVMVD ILTCGFSALQTLRDKNYKLVIHAHRAGHAAFTKNPKHGIAMRPIVRVARIIGVDQLHVGT VVGKMF------------------------------------------------------ -------------------------ET---------------------------- >rifcsphigho2_01_scaffold_100504_5Micrarchaeota -MSYINFKYRPN-DDIVCEFYAEPAKDMTMKRAAENVAAESSTGTWTEV-ATSKLYMKKL AAKVFEIKGNKIKVAYPLELFELGSVPQLLSSVAGNIFGMKAVDNLRLEDIQFPKRYVKS FSGPKYGIRGIRKLLKIKKRPLVGTIIKPKLGLKTKDHAESAYNAWRGGCDIVKDDENLT SQNFNRFEDRIIKTLDKLDEAKAETGERKVYMPNVTAETEEMINRADFVKKHGGTYVMVD ILTCGWSSLQTLRNS-TKLVIHAHRAGHAALTRNRYHGISMLTIAKLCRLIGVDQLHIGT IVGKME-GGKEIIDIENEIAHDKVLKQHWFGMKPVFPVASGGLHPGHVPKLIEYFGKDVI IQMGGGIHGHPKGTFYGAKAARQAIDAAVQGKSLAEYAKTHKELKQAIEKWKADS >CG10_big_fil_rev_8_21_14_0.10_scaffold_160807_1Micrarchaeota YSGFVD--YRPKPTDLITELYIEPAKGIPFEQACSEAAAESSVGTWTDISTSTKRIEKQL KAKAFYIDKKIAKIAYPYSLFEEGNIPQALSSIAGNIYGMRSLRNLRMEDIDFPSKYMRS FKGPRYGIQGVRKITKVKKRPLVGTIVKPKLGLTEKQHAKVAYDAWKGGCDLVKDDENLS SLQFNKFGERVKETLKLRDKAEKETGERKFYMPNVTGETLQMLKRAEYVKKQGCEYAMVD IITCGWSALQTLRNENPGLILHAHRAGHGMFTENPKQGMSMLTVAKISRLIGVDQIHVGA VVGKMKGGRREIQ----------------------------------------LIG---- -----------E-----------NI----EKR----------------------- >CG10_big_fil_rev_8_21_14_0.10_scaffold_756_c_32Woesearchaeota -MKYLDLNYNPKSSDLICKFYLEA----NIKEAAGAVASESSTGTWTENV---PGEISNL SAKVFKIRGN-VYIAYPIELFEENNIPQILSSIAGNIFGMKILKNLRLEDIEIPKQIITK FKGPKFGISGIRKLLNVPSRPLVGTIIKPKLGLNAHEHSKRAFEAWLGGCDIVKDDENLG NQKFNSFEKRIKETLKLKKEAEKITGEKKVYMPNITAETEEMLKRARFVKQHGGTYVMVD VFSCGWSALQSLRKEKLNLVLHAHRAGHAALTRNEKHGISMLVIAKLCRLIGLDQIHIGT IIGKMEGGQ-EVRDIDQEIE-H-LLEQDWHNLKPIFAVCSGGLSPRDVPFLVKNLGKNII IQAGGGVHSHPSGTFAGAKAMRQAIEASTKNISLEIYAEKNKELETALKFFK--- >rifcsplowo2_01_scaffold_119419_3Micrarchaeota NDAYSGPKYVPR-ADLVAAFFMEPK-GVSFEEAAQAVASESSIGTWTDLSTLSERMKAEL HARVFSIDKR-VLIAYPLALFELGSIPQLLSSVAGNVFGMKEVKNLRLLDINFPNRYIRA FKGPRFGIEGVRRVLRVPKRPLLGTIVKPKLGLGPKQHALVAEQAWLGGCDLVKDDENLT SMRFNHFEKRVEETLKARANAEKVSGERKAYMPNVTAPFSEMMRRARFAKGQGCEYAMVD VLSVGWSALQDLR--DLGLILHAHRAGHAAFTRNPEHGISMLLVAKLCRLAGMDQLHVGA IVGKMEGGKREVKAIGEEIENH-ALSENWLHIKPVLAVCSGGLHPGKIPALVSAMGNDIV IQMGGGIHGHPLGTRHGAMAARQALEATVNGVSLNYAAMKHFELAIALRKWKG-- >rifcsplowo2_01_scaffold_58670_5Micrarchaeota ------------------------------------------------------------ -----------------------------MSSIAGNIFGMSLLNNLRLEDINFPNTYIRA FKGPKYGIPGIRKLLRVYKRPLVGTIIKPKLGLNEKEHSEVAAQAWLGGCDIVKDDENLS SMKFNKFYSRIAKTINLRDRCEKITGEKKIYMPNITAECDEMLRRAEFVKKVGCEYAMVD VLTVGWSALQKLRNENLKLVLHAHRAGHAAITRNPRHGISMLVIAKLCRLIGMDQLHIGA IVGKMEGGKREVQEIGEEIEAQHVLSENWLHIKPMFAVCSGGLHPGKVPALVDALGKDII IQMGGGIHGHPKGTFSGAMAARQAVEATMHGIALNNYAKYNKELALALKHWK--- >rifcsplowo2_01_scaffold_465272_1Micrarchaeota --------------------R--------------------------------------- -----------------------------------------------LEDINFPNSYIRA FRGPKYGIPGIRKLLRVYKRPLVGTIIKPKLGLNEREHSEVAAQAWLGGCDIVKDDENLS SMKFNKFYLRIAKTINLRDKCEKITGEKKIYMPNITAECDEMLRRAEFVKKVGCEYAMVD VLTVGWSALQKLRNEDLKLVLHGHRAGHAAITRNPRHGISMLVIAKLCRLIGLDQLHIGA IVGKMEGAKREVQEIGEEIEKQHVLAENWLHIKPMFAVCSGGLHPGKVPALMEALGKDIV IQLGGGIHGHPKGTFSGAMAARQAIEATMHNIPLKDYAKYNRELALALEHWK--- >rifcsphigho2_02_scaffold_38081_1Micrarchaeota -MDYTDLRYKPNGNDLIAEFRLEPARGVSFKEAAGAVAAESSVGTWT---QLTTITDKQI SAKVFSANEKIIKIAYPSELFEFGNIPQLMSSIAGNVFGMSLLNNLRLEDINFPNAYIRA FKGPKYGIPGIRKLLRVYKRPLLGTIIKPKLGLNEKEHAQVAYQAWLGGCDIVKDDENLS SMKFNKFYSRIAKTINLRDKCEKETGEKKIYMPNITAECDEMLKRAEFVKKVGCEYAMVD VLTVGWSALQKLRNEDLKLVLHAHRAGHAAITRNPRHGISMLVIAKLCRLIGLDQLHIGA IVGKMEGAKREVQEIGEEIEKQHVLAENWLHIKPMFAVCSGGLHPGKVPALMEALGKDIV IQLGGGIHGH--------------------------------------------- >rifcsplowo2_01_scaffold_68750_1Micrarchaeota --------------------QLTT-----IT----------------------DKRLKQI SAKVFNEKSGIIKIAYPSELFEFGNIPQLMSSIAGNVFGMSLLNNLRLEDINFPNAYIRA FKGPKYGIPGIRKLLRVYKRPLLGTIIKPKLGLDEKEHAQVAYQAWLGGCDIVKDDENLS SMKFNKFYSRIAKTINLRDKCEKETGEKKIYMPNITAECDEMLKRAEFVKKVGCEYAMVD VLTVGWSALQKLRNEDLKLVLHAHRAGHAAITRNPRHGISMLVIAKLCRLIGMDQLHIGA IVGKMEGGKKEVQGIGEEIEKQHVLAENWLHIKPMFAVCSGGLHPGKVPALVNALGKDII IQMGGGIHWNPRGSYYGALGARQALEATKGGF--------------G-------- >CG10_big_fil_rev_8_21_14_0.10_scaffold_63175_3Micrarchaeota ------LGYKPKSADLTAEFFLEPVKGISFSEACQAVASESSIGTWTDIATMSPAIRKKL SPKVFEANKKIIKIAYSPELFEKGNIPQLMSSIAGNIFGMKEVQNLRLEDINFPDAYIKS FRGPAFGIKGIRKILKVPKRPLLGTIVKPKLGLNAKQHAEVAFQAWIGGCDIVKDDENLS SMSFNKFEKRVKHTLRLRDKAEKITGERKAYVANVTAPYKEMVRRAKFLKKAGNEYAMVD VVTTGWSALQELRNENLGLILHAHRAGHAAFTRNPKHGISMLAIAKLCRLCGTDQLHVGA IFGKMTGPKKEVKAIREEIEKQHVLGENWLNIKPMFAVCSGGLHPGKVPGLVNAMGNDII IQMGGGIHGHPKGTIKGAMAARQSLESALQGISLKDAAEKNSELRIALKKWKE-- >gwa2_scaffold_48274_5Micrarchaeota NLAYSGIGYKPSPKDLVAEFFLEKGRGVSFREAAQAIASESSIGTWTEISTMKPEIKKSL SPKIFEMSGT-VKIAYPMELFEKGNIPQLLSSIAGNIFGMKEAKNLRLEDINFPDEYINA FKGPKFGIAGIRKTLNIYNRPLLGTIIKPKLGLNPEEHAKVAFEAWLGGCDLVKDDENLT SMAFNNFERRVKETLKMRAKAEKMTGERKAYMANVSAPYKEMVRRAKFLKKQGNEYAMVD IVSIGWSALQELRNEELGLILHAHRAGHAAFTRNPKHGISMLAIAKLCRLCGLDQLHIGA IFGKMTGARFEVQKIGEEIEGGHVLAENWLHLKPMLAVCSGGLHPAKVPGLVNALGKDIA IQMGGGIHGHPMGTMKGATAARQAIEAAVHKIPLHEFAKNHFELQAVLRQWK--- >cg1_0.2_scaffold_83715_1Micrarchaeota MQGYLNLGYEPTKEDLICEFYIEPNRIT-FEKACEEIAAESSIGTWTDICTMNRKIGEKL GPRIFDKKKKIAKIAYPGELFEPGNMAQIYSSIAGNIFGMKCLNNLRLIDIQFPEYLRNS FPGPKYGLHEIRKSMNIVNRPLLGTIIKPKVGLDPLMHAQVAYNAWVGGCDVVKDDENLT NQSFNPFEKRIVETFRRREMAERATGERKIYLANITAETREMIRRARFVKRHGGEFVMID VLTAGFSALQTIRNLDLGVAIHAHRAMHAAITRNPKHGITMLALAKSYRLVGVDSLHIGT AVGKMEGSKVDVAEIEEEVERKDVLVQDWGKIKPVLAVCSGGLYPNLMPSLIKNMGTDIL IQAGGGVHGHPDGTTAGATAMRQAIFAAQYGYSLKEYAKDHPELAKALKKWKDIK >rifcsplowo2_01_scaffold_269943_3Pacearchaeota --------FSALDEKGLVIHH----------RAMHAALTKKPNGIS-MLSLAKFARLSQL HIGTVKMGGSEIKNALEENLF---GLKSTLSVCSGG-LSTSNIPAIKFFGNNISGGGIHA HPGTFSGAKALRQSVD----SVLKGIP---IKEYSKNHSLFIFGLVLGTINSVTGDAIIT YKAFG------------MNELQQKRGSNPVTIIRIENTT----------IPAGG-YLRLE VD----PGIRGAR-----------------------KDIRIYKINELGKGIAKKGIGGGN AISNF--CSQGITKCFDTVKRPIRTGSNW---------KPGLYAVKIFDYYLNDYASEFT IE-----YTKP------------------KGF--------------T-------- >CG_2015-01_scaffold_106789_1Micrarchaeota VIEKSIIDVDIGNEDKLIKFLEKNNVELGSEKACEEIAAESSIGTWTDICTMNRKIGEKL GPRIFFIDKKIAKIAYPGELFEPGNMAQIYSSIAGNIFGMKCLNNLRLIDIQFPEYLRNS FPGPKYGLHEIRKSMNIVNRPLLGTIIKPKVGLDPLMHAQVAYNAWVGGCDVVKDDENLT NQSFNPFEKRIVETFRRREMAERATGERKIYLANITSETREMIRRARFVKRHGGEFVMID VLTAGF------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------- >rifcsphigho2_02_scaffold_50705_4Pacearchaeota -MAYKDLKFKPDKKGLICTYYIEP----SLQETAGGVAAESSTGTWIDVK-TKQNYVKKL DATVFSIKKQ-IKIYYPSGLFEKGNMPNILSSIAGNIFGMEEIKNIKLLDISFPNEIARS FAGPKYGIQGIRKIMKVHNRPLLGTIIKPKLGLKTGHHAKVAYESWLGGCDVVKDDENLS SQKFNRFEDRLKETLKMKERAEKETGSKKAYLVNVTAETKEMLRRAKLAENYGNEYIMID VLTAGFSALQTLRQENFDLIIHAHRAGHAAMTKNHKHGIGMNVLVKMLRLVGVDQLHVGA AVGKMF-ETKNFK----------------FKTSHACCIWRLGNK---------RYS-SCC QNFWKGC-CYPDGRGNSFTSARH----FCRG-KGSERSD---------------- >rifcsplowo2_01_scaffold_0_375Pacearchaeota -MAYKDLKFKPD-KKLICTYYIEPL-HLSLQETAGGVAAESSTGTWIDVK-TKQNYVKKL DATVFSIKKQ-IKIYYPSGLFEKGNMPNILSSIAGNIFGMEEIKNIKLLDISFPNEIARS FAGPKYGIQGIRKIMKVHNRPLLGTIIKPKLGLKTGHHAKVAYESWLGGCDVVKDDENLS SQKFNRFEDRLKETLKMKERAEKETGSKKAYLVNVTAETKEMLRRAKLAENYGNEYIMID VLTAGFSALQTLRQENFDLIIHAHRAGHAAMTKNHKHGIGMNVLVKMLRLVGVDQLHVGA AVGKME-TSAEVKENILAAK------EKTFNLKPVMPVASGGLAIKDIPAVVKIFGKDVV IQMGGGIHSHPQGTFAGAREAREAIENAV-------------------------- >tara_MHASMcontig_152320_17Pacearchaeota -MVKNTLKYKPNSEDLVCLFRVTPK--MSLKEAANTVALESSIGTWTHV-SSNKSYVNRL GAKVFSINKT-CKIAYPSVLFEDGNMPDLLSSIAGNVFGMKAVKKLRLEDVDFPSRYIKT YPGPRHGIGGVRKILGVKNRPLVGTIVKPKLGLKTKDHAKVAYDAWLGGCDVVKDDENLS SQKFNPFDKRAVQTLRMMDKAAEETGEKKVYLINVTAEALEMVRRADFAKEHGANYLMHD ILTAGFSSLQTLRRST-KLPIHAHRAMHGALTEDPKHGISMMSIADFARLCGVDTLHIGT GIGKMKGGWKEVEEIREEIE-G-RLKEDWVGMKPVMAVCSGGIYPGHIPFLINHYGKDIV IQGGGGVHWNPRGSKYGAMGMRQAVDAVMKGKTLKEYSKNHKELREALDHFGYK- >tara_MHASMcontig_158273_9Pacearchaeota VRAKFALNYKPKKTDLVCLFRVSPK-GLSLEKAANIVALESSTGTWTPV-SSSKRYVNKL AARVFSI-SK-VKIAYPGRLFEGDNAPNILSSVAGNIFGMKAIQGLRLEDIDFSNDLIKT FDGPRYGIEGIRKILKIKNRPLVGTIVKPKLGLKTEDHAKVARAAWVGGCDLVKDDENLA SQDFNEFDKRAARTLEMADLAEAETGEKKGYLINVTAETKDMLERADVVRELGGRFVMHD FITAGFAAFQTLRK-NTKLAIHAHRAMHGAFTENPKHGISMMVMADFARLIGADSLHIGT GIGKMRGGVEEVEDIREEIE-F-RLSEKWAGKKPVMAVCSGGLYPGHVPYLIKHFKKDII IQAGGGIHGHPQGTKAGAMAMRQAVDAVMKKKTLERWVEKEKALGTALEFWGRV- >rifcsphigho2_01_scaffold_12654_5Amesbacteria RFIHLG--YKPK-EDIVCLFRIEPEKGLKIDEAADTVALESSIGTWTDVN--MPKHVEKL GAKVFEIHGNLVKIAYPYELFEKGNVPNILSSIAGNIFGMKSVENIRLEDVSFPKKILKD FKGPEFGIKGIRKILRVWNRPLVGTIIKPKLGLTSKEHAKSAYESWRGGCDAVKDDENLS SQEFNKFESRLVNTLEMKDRAEHETGEKKAYLMNVTAEANEMIKRAERIKEQGGKFAMVD VLTVGWGALQSLRQANLKLALHGHRAMHAAFDRNSKHGINMIVLADFCRLIGIDSLHIGT GIGKLEGDIKDIIELEEEIETKYRLTQNWHNLKPTLAVCSGGLHPGHVPFLIRHLGKDIL IQAGGGIHWNPKGSYYGALGMRQAVESVMKGISLREYSRDHTELRMALDKFGYI- >rifcsplowo2_01_scaffold_5197_14Pacearchaeota -MKPEDLKYRPKEDDLICLFKVEPK-GITIKNAAANVALESSIGTWVEV-STEKQYMKKL AAKVFSI-KE-VKIAYPSELFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKFPTGLLKS FPGPKFGIQGIRKVMKINSRPLIGTIIKPKLGLKPKDHALSAYNSWVGGLDLVKCDENLA SQKFNEFEKRLSITLEMADKAEEKTGERKGYLENVTAETKEMIKRAQLVENMGGKYIMVD ILTEGWGAVQTLREANFNLIIHAHRAFHAAFTRNKKHGMSMMVVADLVRIIGLDQIHIGT GIGKLQGDIRDIKEIEEEIE-D-RLNQDWGKIKPTLAVCSGGLQPAHIPFLINHLGKDIV IQLGGGVHGHKDGSYAGAKASRQALDATIKKISLRDYSKDHLELKTALEQWGYA- >CG10_big_fil_rev_8_21_14_0.10_scaffold_7288_19Amesbacteria EWNYLKIGYKPR-DDVVCLFRIEPGENLTLKEAAETVALESSTGTW-----TKIPGREKL NAKVFSIKGNFVKIAYPSALFERGNAPNILSSIAGNIFGMKAVRNLRLEDVSFPKNLVKS FPGPYYGIEGIRKLLKVYNRPLLGTIIKPKLGLPPKEHALSAYNSWRGGLDLVKCDENLV SQDFNRFEERLSKTLEMKDKAEEETGERKGYLENVTAETNEMIRRAELIKSLGGKFCMVD ILTEGWGAIQTLREANFKIAIHGHRASHATMTRNPKHGIRMIVLADFARLIGVDSLHIGT GIGKLEGDIKDVYELAEEIETKLRLKQDWYGKKPVLAVCSGGLTPLHFPYLIRHLGKDII LQCGGGIHGNKLGTISGAKAARQAVDATMQGIGLKEYAKTHSELKSTIEQWG--- >rifcsphigho2_01_scaffold_437666_1Pacearchaeota -------------KDLICLFRLEP----TIKEVAASVALESSVGTWTKL-NTEKEYMKKL GAKVFSIKGNMIKIAYPGDLFEPGNIPNILSSIAGNIFGMKDVKNLRLEDVRFPSYLLKS FPGPRYGIEGLRKLMKIKNRPFVGTIVKPKLGLNSKDHAEVAYEAWMGGCDFVKDDENLS SQSFNKFEKRLAKTLEVSDKAESITGEKKAYLVNVTAETKVMLKRAQLVEDQGGKYVMID ILTEGFGAVQTLRQEGFKLAIHGHRAMHAALTRNPKHGISMMTLADFSRIVGIDSLHIGT GIGKLEGNIKEIKEIEEEIEIENRLSQNWGKINSCIGVSSGGLHPGYVPFLMKNLGKDIV LQFGGGIHGHPKGTLRGAIAARQAVEATMNGISLQNYSK-------N-------- >rifcsphigho2_01_scaffold_97976_8Pacearchaeota PSDFVNLKYKPKSSDLICLFRVEPNKMS-IKEASATVALESSVGTW-----TDIKLNKKL KAQVFSIKNNYVKIAYPAILFEKGNTPNILSSIAGNIFGMKAVKNLRLEDIKIPKELLDS FYGPQFGINGIRKFMKIKNRPLIGTIIKPKLGLNPYEHAKSAYESWIGGCDLVKSDENLA SQKFNEFEERLARTLEYCSKAEEETGMKKGYIENVTAETKEMMKRAQLVEDLGGKYVMID MITAGFSALQSLREADFKMAIHAHRAMHAAMTRNPKHGISMMVLADLARLIGVDQLHIGT GVGKLEGNLNDIEELVEEIETKQRLTQDWGKIKPVLAVSSGGLCPLDIPVLIKNLGKDVA IQFGGGIHSHPSGTIAGARAARQALDAAMQGISLKKYAETHKELKGAIGKWGKKE >rifcsplowo2_12_scaffold_170419_2Amesbacteria -----------------------------ME---------------------------RL AAKVFSIRGNLVKIAYPAELFESGNAPNILSSIAGNIFGMKAVKNLRLEDISFPKSIIRG FKGPVLGMDGIKKLMKIKDRPLLGTIIKPKLGLKTKDHAKNAYESWIGGCDFVKDDENLA SQKFNEFEERVARTLEKASQAGEETGEKKAYLVNVTAETNEMLRRAQIVKYLGGKFIMVD MMTAGFSALQTLRNANLKLAIHAHRAMHAAMTRNPKHGIRMIVLADFARLIGVDSLHIGT GIGKLEGGISDVYELEEEIETQLRLSQNWRRIKPVLGVCSGGLHPRHVPYLIRHLGRNIL IQAGGGVHWNPRGSRYGAMGMRQAIDAVMKGISLREYAETHIELKEALEKFTEK- >rifcsplowo2_02_scaffold_55575_5Amesbacteria ---MGDIGYKPK-NDVVCLFRVEPAKGMRITTAAKTVALESSTGTWTELG-TKKKYMERL AAKVFSIRGNLVKIAYPAELFESGNAPNILSSIAGNIFGMKAVKNLRLEDISFPKSIIRG FKGPVLGMDGIKKLMKIKDRPLLGTIIKPKLGLKTKDHAKNAYES--------------- ----------VARTLEKASQAGEETGEKKAYLVNVTAETNEMLRRAQIVKDLGGKFIMVD MMTAGFSALQTLRNANLKLAIHAHRAMHAAMTRNPKHGIRMIVLADFARLIGVDSLHIGT GIGKLEGGISDVYELEEEIETQLRLSQNWRRIKPVLGVCSGGLHPRHVPYLIRHLGRNIL IQAGGGVHWNPRGSRYGAMGMRQAIDAVMKGISLREYAETHLELREALEKFAER- >gwa2_scaffold_13440_1Amesbacteria DYEYLNIGYKPK-NDVVCLFRVEPAKGMRITTAAKTVALESSTGTWTEL-GTKKKYMERL AAKVFSIRGNLVKIAYPAELFESGNAPNILSSIAGNIFGMKAVKNLRLEDISFPKSIIRG FKGPVLGMDGIKKLMKIKDRPLLGTIIKPKLGLKTKDHAKNAYESWIGGCDFVKDDENLA SQKFNEFEERVARTLEKASQAGEETGEKKAYLVNVTAETNEMLRRAQIVKDLGGKFIMVD MMTAGFSALQTLRNANLKLAIHAHRAMHAAMTRNPKHGIRMIVLADFARLIGVDSLHIGT GIGKLEGGISDVYELEEEIETQLRLSQNWRRIKPVLGVCSGGLHPRHVPYLIRHLGRNIL IQAGGGVHWNPRGSRYGAMGMRQAIDAVMKGISLREYAE-------T-------- >rifcsplowo2_01_scaffold_141933_1Amesbacteria DYEYLNIGYKPK-NDVVCLFRVEPAKGMRITTAAKTVALESSTGTWTEL-GTKKKYMERL AAKVFSIRGNLVKIAYPAELFESGNAPNILSSIAGNIFGMKAVKNLRLEDISFPKSIIRG FKGPVLGMDGIKKLMKIKDRPLLGTIIKPKLGLKTKDHAKNAYESWIGGCDFVKDDENLA SQKFNEFEERVARTLEKASQAGEETGEKKAYLVNVTAETNEMLRRAQIVKDLGGKFIMVD MMTAGFSALQTLRNANLKLAIHAHRAMHAAMTRNPKHGIRMIVLADFARLIGVDSLHIGT GIGKLEGGISDVYELEEEIETQLRLSQNWRRIKPVLGVCSGGLHPRHVPYLIRHLGRNIL IQAGGGVHWNPRGSRYGAMGMRQAIDAVMKGISLREYAETHLELREALEKFAER- >KKS31468.1_Ribulose_bisphosphate_carboxylase DYEYLNIGYKPK-NDVVCLFRVEPAKGMRITTAAKTVALESSTGTWTEL-GTKKKYMERL AAKVFSIRGNLVKIAYPAELFESGNAPNILSSIAGNIFGMKAVKNLRLEDISFPKSIIRG FKGPVLGMDGIKKLMKIKDRPLLGTIIKPKLGLKTKDHAKNAYESWIGGCDFVKDDENLA SQKFNEFEERVARTLEKASQAGEETGEKKAYLVNVTAETNEMLRRAQIVKDLGGKFIMVD MMTAGFSALQTLRNANLKLAIHAHRAMHAAMTRNPKHGIRMIVLADFARLIGVDSLHIGT GIGKLEGGISDVYELEEEIETQLRLSQNWRRIKPVLGVCSGGLHPRHVPYLIRHLGRNIL IQAGGGVHWNPRGSRYGAMGMRQAIDAVMKGISLREYAE-------T-------- >MW-5_scaffold_28295_1Amesbacteria DYEYLKLGYEPR-NNIVCLFRIEPAKGLSVIEAAKNVALESSTGTWTKV-GTEKSYMNAL AAKVFSIKGNLVKIAYPSALFERGNVPNILSSVAGNIFGMKAVKNLRLEDISFPKSIIKG FKGPVFGTKGIRKIMKIKNRPLLGTIIKPKLGLKTKDHAKSAYESWIGGCDFVKDDENLA SQKFNEFEERVARTLEKANKAEEETGERKAYLVNVTAETNEMLRRAQLVKELGGKFIMVD IITAGFAGLQTLRNANLKLAIHAHRAMHAAMTRNPKHGIRMIVLADFARLIGVDSLHIGT GIGKLEGSISDVHELEEEIETQLRLKQDWHRIKPVLGVCSGGLHPGHFPYLIRHLGKDII LQCGGGIHGHPSGSMAGARAARQAIEATMQGISLKEYSKNHLELREALEKFEKAD >rifcsphigho2_02_scaffold_363598_1Amesbacteria DYEYLKIGYEPR-NDIVCLFRIEPAKGLSVIEAAKNVALESSTGTWTKV-GTEKSYMKAL AAKVFSIKGNLVKIAYPSALFERGNVPNILSSVAGNIFGMKAVKNLRLEDISFPKSIIKG FKGPVFGTKGIRKIMKIKNRPLLGTIIKPKLGLKTKDHAKSAYESWMGGCDFVKDDENLA SQKFNEFEERVARTLENANKAEEETGEKKAYLINVTAETEEMLRRAQLVKELGGKFIMVD IITVGFAGLQTLRNANLKLAIHAHRAMHAAMTRNPKHGIRMIVLADFARLIGVDSLHIGT GIGKLEGSISDVHELEEEIETQLRLKQDWHRIKPVLGVCSGGLHPGHFPYLIRHLGKDII LQCGGGIHGHPSGSMAGAKAARQAIEATIQGINLK-------------------- >rifcsplowo2_01_scaffold_130404_3Amesbacteria DYEYLKLGYEPR-NNIVCLFRIEPAKGLSVIEAAKNVALESSTGTWTKV-GTEKSYMNAL AAKVFSIKGNLVKIAYPSALFERGNVPNILSSVAGNIFGMKAVKNLRLEDISFPKSIIKG FKGPVFGTKGIRKIMKIKNRPLLGTIIKPKLGLKTKDHAKSAYESWMGGCDFVKDDENLA SQKFNEFEERVARTLENANKAEEETGEKKAYLINVTAETEEMLRRAQLVKELGGKFIMVD IITAGFAGLQTLRNANLKLAIHAHRAMHAAMTRNPKHGIRMIVLADFARLIGVDSLHIGT GIGKLEGSISDVHELEEEIETQLRLKQDWHRIKPVLGVCSGGLHPGHFPYLIRHLGKDII LQCGGGIHGHPSGSMAGAKAARQAIEATMQGINLKEYSKNHLELREALEKFEQAE >rifcsphigho2_01_scaffold_66616_16Pacearchaeota MKRGLELKYKPNKKDLICLFRVEP--RISLKKAANTVALESSVGTWTDV-SSEKEYVKKL RAKVFSIEKN-VKIAYPSELFEKGNAPNILSSIAGNIFGMKAVKNLRLEDVSFPRTILSS FSGPKYGISGIRKMLKIYDRPLLGTIIKPKLGLKTEDHAKVAYGAWKGGCDLVKDDENLS SQKFNIFEERIARTLEMQNKAEEETGEKKAYLVNVTAETKEMLRRAELVEDLGGKYVMLD ILTAGFSALQTLRQANFKLAIHAHRAMHAAFTRNKKHGISMMTIADISRLIGVDSLHIGT VIGKLEGNLIEVSQLKDEIE-I-RLSQDWYNIKPVFAVCSGGLHPLHFPYLIRHLGRDII LQCGGGLHGHPSGTIWGARAARQAIEATTQGISLKEYAKGHKELKQAIDFWRK-- >rifcsplowo2_01_scaffold_42386_8Pacearchaeota DMNFINLKYKPKEKDLICLFRVEPSGNVSVKKAADTIALESSTGTW----TEVKTRKSKL GAKVFSIKGNLIKIAYPSELFEKGNAPNILSSIAGNIFGMKIVKNLRLEDVKIPDEILKS FSGPKYGIEGIRKMMKIKDRPLLGTIVKPKLGLKTRDHAKVAYDAWAGGCDIVKDDENLA SQSFNTFEERVARTLEKANQAEEETGEKKAYLINITAETEEMLKRAELVEELGGKYVMLD IITAGFSALQTLREANFKLAIHAHRAMHAAFTRNKKHGIAMMVLADFARLIGVDQLHIGT VVGKLEGTLREVSELREEIETKDRLSQDWKKIKPVLAVSSGGLHPGHIPYLINHLGKDIV IQMGGGVHGNKLGTIAGARAARQAVDAVMQKIPLKKYAKDHLELKTALDQWGLVR >rifcsphigho2_01_scaffold_526185_2Pacearchaeota ------LRYKPSKEDLICLFRIEPSRGLTIKKVAETVALESSIGTW----VDVKTSKRRL AAKVFSIKGNFVKIAYPSELFEKNNVPNILSSIAGNIFGMKAIKNLRLEDVSFPKSILKS FKGPRFGISGIRDMMKVYNRPLVGTIVKPKLGLKTLDHAKIAYESWVGGCDLVKDDENLA SQRFNIFEERVARTLEKADKAEEETGERKAYLINITAETNEMIRRAELVEKQGGKFIMVD VVTEGFGALQTLRNANFKLAIHAHRAMHAAFTRNKKHGISMIVLADFLRLIGVDTLHIGT VVGKLEGGLEEVSNLKEEIESKIRLTQDWGKIKPIMAVSSGGL----------------- ------------------------------------------------------- >rifcsphigho2_01_scaffold_8061_16Pacearchaeota KFEFIDLNYKPR-DDIVCLFIVEPNKVS-IKKAANTIALESSIGTWTEV-STKKSYVNKL AAKVFSINGKKVKIAYPSELFEKGNVPNLLSSIAGNIFGMKIVKNLRLEDISIPKKILNS FSGPKYSINGIRKIMKIPKRPLIGTIIKPKLGLKTIDHAKIAYEAWIGGCDIVKDDENLA GQKFNDFEERIARTLERLNQAEKETGEKKAYLANVTAETKEMLKRAQLVEDLGGKFIMVD VVTEGFGALQTLREADFNLAIHAHRAMHAAFTRNKKHGISMMVLADLVRMIGVDTLHIGT VVGKLEGTLDEVSEIKEEIETKIRLEQNWGKIKPVLAVSSGGLTPLHFPYLIRHLGKDII LQCGGGIHGNWLGTKQGAIAARQSIDAIMRKVSLREYAKDHFELKSTIDQWKIKE >rifcsplowo2_01_scaffold_57929_6Pacearchaeota KFEFVNLHYKPKSSDLICLFRVEPAKRISVKEAANTIALESSIGTW----TDVSTKRKKL AAQVFSIKGKMIKIAYPSALFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPSEILSG FKGPKYGIDGIRKFMKIYNRPLLGTIIKPKLGLKTKDHAEVAYESWVGGCDVVKDDENLA SQRFNVFEERIARSLEAANKAEEKTGEKKAYLVNVTAETKEMLKRAQLVEDLGGKFIMVD VITEGFGALQTLREANFNLAIHAHRAMHAAFTRNKRHGISMMVLADIIRLIGVDTLHIGT VVGKLEGTLQEVSELEEEIETKDRLSQNWKNIKSVLAVSSGGLHPGHVPFLIKHLGKNLV IQMGGGIHGHPNGTLAGATAARQSIESALKKISLKRYAETHKELKTALKQWTNF- >rifcsplowo2_01_scaffold_294211_1Pacearchaeota ------LNYKPK-DDLICLFRIEP----SVKKAANNVALESSVGTWTDVK-TEKQYVSKL SAKVFSIRGREVKIAYPSALFEKGNAPNILSSIAGNIFGMKSVKNLRLEDISIPYKILFG FKGPKYGIKGIRKILRIKKRPLVGTIIKPKLGLKTHDHAQVAYEAWKGGCDLVKDDENLA SQKFNVFEERLARTLERKNIVEEETGEKKAYLINITAETREMMKRAQLVEDLGGNFVMLD ILTSGFSALQTLREADFKLAIHAHRAMHAAFTRNEKHGIAMMVLADIARLIGVDSLHIGT GIGKMEGGIKEVYELEEEIETHIRLSQNWGKIKPVMAVCSGGLHPGYVPYLIKNLGNDII IQAGGGIHGHPSGTIWGARAMRQ-------------------------------- >rifcsplowo2_01_scaffold_41118_9Pacearchaeota PSNFVNLKYKPKESDLVCLFRVEPNKMS-IKEASATVALESSVGTWV---ETEKEYVRKL AAKVFSINGNWVKIAYPFQLFEKGNAPNILSSIAGNIFGMKAVKNLRLEDIKIPKELLNS FYGPKFGIAGIRKFMKIKKRPLVGTIVKPKLGLKTKDHADVAYEAWLGGCDLVKDDENLS SQKFNEFEERIARTLEKANQAEEQTGEKKAYLVNVTAETKEMIKRAQIVQDLGGKFIMVD VVTEGFGALQTLREADFKMAIHAHRAMHAAFTRNKKHGISMMVLADLVRLIGVDTLHIGT VVGKLEGSLEEVSEIEEEIETKKRLEQNWGKIKPIMAVSSGGLHPGHVPFLVKHLGKDLV IQAGGGCHGHPLGTSAGAKALRQAVDAAMEGITLKRYAENHSELRIALEEWM--- >rifcsphigho2_01_scaffold_11863_1Pacearchaeota PIDFLNLKYKPKSSDLICLFRVEPNKVS-LKEASENVALESSVGTWV---STEKEYMKKL GAKVFSIKGNHVKIAYPSELFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPDEILNS FSGPKYGIDGIRKIMRIYDRPLIGTIIKPKLGLKTKDHAKVAYEAWTGGCDLVKDDENLS SQKFNQFEERLARTLEKADKAEEETGEKKAYLVNITAETKEMMKRAELVEQLGGKFVMID VVTEGFGALQTLREADFKMAIHAHRAMHAAFTRNPKHGISMMVLADIVRLIGCDSLHIGT VVGKLEGSLNEVSEIEEEIETKIRLEQDWEKIKPVMAVSSGGLHPGHVPFLIKHLGKDLI IQFGGGIHGHPFGSSAGARAARQAINATIEGISLKEYEKDHSELRAALEKWVKNR >rifcsplowo2_01_scaffold_474979_2Pacearchaeota -----------------------------MK----------------------------- ---------------------------------------------VK------------- ------------------DRPLLGTIIKPKLGLKTKDHAKVAYDAWAGGCDIVKDDENLS SQKFNVFEERVARTLEKANKAEEETGEKKAYLVNVTAETKEMIKRAQIVEDLGGRFVMVD VVTEGFGALQTLREADFKLAIHAHRAMHAAFTRNKRHGISMMVLADLARLIGVDTLHIGT VVGKLEGSLEEVSEIEEEIETKNRLMQDWKNIKPVMAVSSGGLHPGHVPYLMKHLGKDLI IQMGGGIHGHPSGSMAGAMAARQAIEAAMQNISLKRYAKDHLELRGALEKFTKK- >rifcsplowo2_01_scaffold_224073_1Pacearchaeota KLEFINLNYKPKSSDLICLFRVEPNRIS-VKKAANTIALESSIGTW----VPVKTKKQKL AAKVFDISGKQIKIAYPSGLFEKGNAPNILSSIAGNIFGMKAVKNLRLEDISIPKDILKS FPGPKYGIKGIRKMLKIKKRPLLGTIIKPKLGLKTVDHAQVAYEAWLGGCDICKDDENLS SQRFNVFEERLARTLERQNQAEEETGEKKAYLINVTAETKEMMKRAQLVEDLGGKFIIVD AVTVGFSALQTLREADFKLAIHSHRAMHAA------------------------------ ------------------------------------------------------------ ------------------------------------------------------- >rifcsplowo2_01_scaffold_309117_2Pacearchaeota GLEFVDLKYKPKNTDLICLFRVEPAGKLKLKNVANTIALESSTGTWV---STEKRYVKDL RAKVFEIKGNFVKIAYPSDLFERGNAPNILSSIAGNIFGMKVIKNLRLEDIKIPKNILVG FKGPKYGIKGIRKFMKIKKRPLLGTIVKPKLGLRTADHAQVSYDAWVGGCDIVKDDENLS SQKFNIFEERAARTLEKANKAEEETGKKKAYLINITSETNEMLKRAQLVEDLGGKYVMID IITSGFSALQTLREQNFKLAIHAHRAMHAAFTRNKKHGINMMVLADLCRLIGVDQLHIGT VVGKLEGSIEEVSELKEEIETKIRLSQNWWKIKPVLAVSSGGLHPGHVPYLI-------- ------------------------------------------------------- >rifcsphigho2_01_scaffold_41251_2Pacearchaeota PKDFVDLNYRPTGKDLICLFRVEPGPGISLRKAAENIALESSVGTWTDVH--EKKYVRKL GAKVFSINGDMIKIAYPEELFEKGNAPNILSSIAGNIFGMKIVKNLRLQDIKIPKDILNS FPGPKYGIKGIRKFMKIKDRPLVGTIVKPKLGLRTKDHAEIAYESWQGGCDIVKDDENLS SQRFNQFEERLARTLEKAHKAEEQTGEKKAYLVNVTAETKEMMKRAQLVEELGGKYVMID IVAAGFSALQTLREADFKMAIHAHRAMHAAFTRDKKHGISMMVLADIARLIGVDQLHIGT VVGKLEGSLNEVSKLNEEIETRDRLSQNWGEIKPVLSVSSGGLHPGHIPYLVEHLGKDIV MQFGGGIHGHPNGTLRGAIAVRQAVDAVMQGISLKKFSENHVELKDALEFWK--- >rifcsplowo2_01_scaffold_51839_11Pacearchaeota PKDFINLKYKPNSDDLVCLFRVEPNKIS-VKEAAATVALESSVGTW----TDV-HEKEKL GAKVFWINGNMIKIAYPSELFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPSGILNS FSGPKYGIDGIRKMLKIKTRPLVGTIVKPKLGLKTRDHAEISYEAWLGGCDVVKDDENLS SQKFNEFEERLARTLEKADLAESQTGEKKAYLVNVTAETKEMIKRAQLVEELGGKYVMID IVTAGFAALQTLREANFKMAIHAHRAMHAAFTRNKKHGISMMVLADIARLIGVDQLHIGT VVGKLEGSLKEVSEINEEIETKDRLSQNWGRIKPVLSVSSGGLHPRHIPYLIKHLGKDVV MQFGGGIHGHPKGTLRGAIAVRQAVDAVMQGISLKKFSENHVELKDALELWK--- >cg1_0.2_scaffold_82321_1Pacearchaeota ------------------------------------------------------------ -------------IAYPEILFEKGNAPNILSSIAGNIFGMKAIKNLRLEDISIPKNILNS FSGPRYGIQGLREMMKIKKRPFVGTIVKPKLGLKTKDHAEVAYEAWVGGCDFVKDDENLA SQKFNQFEERFARTLEKANKAEQETGEKKAYLVNVTAETKEMMKRAQLVEDLGGKYVMID IVTAGFSALQTLREANFKLAIHAHRAMHAAFTRDKKHGISMMVLADLARLIGVDTLHIGT VVGKLEGTLKEVYEIDEEIETHDRLKQDWGKIKPVMAVSSGGLHPGHVPYLIKHLGKDLV IQLGGGIHGHPNGTFRGAIAARQAVEATLRNISLEKYSKDHIELRDALETWEK-- >RBG_13_scaffold_1813_22Pacearchaeota PSEFINLKYQPKENDLICLFRIEPNKVS-LKEAAANVALESSIGTW----AEVKSEKDKL AAKVFSIKGNLVKIAYPSCLFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPKEILNS FLGPKYGIEGIRKIMKIKERPLLGTIIKPKLGLRTKDHAKIAYEAWVNGIDATKDDENLS GQRFNEFEERIARTLEMANKAEEQTGEKKAYLVNVTAETKEMMRRAQLVEDLGGKFVMVD VVTEGFGALQTLREADFKLAIHAHRAMHAAFTRNKKHGISMMVLADLCRLIGMDTLHIGT AVGKLEGSLKEVSEIEEEIETKDRLSQDWGKIKPTLAVSSGGLHPGNISFLVKHLGKDVL LQFGGGLHGHKLGTAAGARAVRQALNATLEGISLKQYSKDHSELRIALEQWSSVK >rifcsphigho2_01_scaffold_156759_4Pacearchaeota -MEFVNLKYKPKSSDLVCLFRVEPNKVS-MKEAAKNIALESSTGTW----AELKTEQKKL AAKVFSIKGNLVEIAYPSELFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPKEILNS FKGPHFGISGIRKMMKVKNRPFLGTIIKPKLGLKTRDHVKVAYDAWLGGCDIVKDDENLA SQKFNQFEERLARSLEMANKAEEETGEKKAYLINVTAETKEMMKRAQLVEDLGGKFVMVD VVTEGFGALQTLREAEFKLAIHAHRAMHAAFTRNKKHGIAMMVLADIIRVIGVDTLHIGT VVGKLEGTLQEVSEIEEEIETKLRLEQNWQKIKPILAVSSGGLHPGHVPYLIKHLGKDLV IQMGGGIHGHPNGTLRGAIAARQAIEATLKGISLKEYSNKHLELRDALKLWQK-- >rifcsplowo2_01_scaffold_183449_2Pacearchaeota ESEFLNLKYKPRESDLICLFRVEPARGVGIKKASEDIALESSIGTW----AEVKTEKEKL AAKVFSICGKMVKIAYPSALFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPEEIINS FYGPQFGIPGIRKMLKVYNRPLLGAIIKPKLGLKTKDHAKIAYEAWLGGCDLVKDDENLA GQKFNGFEERLARSLEKADKAEEETGEKKAYLINVTAETKEMMKRAQLAEDLGGKFVMID VVTEGFGALQSLREADFKLAIHAHRAMHAAFTRNKKHGISMLVLADILRLIGVDSLHIGT IVGKMAGDKKEVEELEEEIETKDRLSQNWHNLKPVMAVSSGGLHPGKIPFLIKHLGKDII MQAGGGLHGHSKGTIAGAMAMRQALNAALKDIPLKEYAKNHSELRIALEQWKI-- >CG11_big_fil_rev_8_21_14_0.20_scaffold_64508_2Pacearchaeota KMKESELKYKPRESDLICLFRVEP--GVSIKKAAENIALESSTGTWAEVK-TEKKYMQNL AAKVFSIRGNLVKIAYPSALFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPKEIIKS FYGPQFGISGIRKMLKVYDRPLLGTIIKPKLGLKTREHAKVAYEAWLEGCDIVKDDENLA SQNFNEFEERLARSLEMANKVEEETSEKKAYLINVTAETKEMMKRAQLVEDLGGKFVMLD VVTEGFGALQSLREADFKLAIHAHRAMHAAFTRNKKHGISMLVLADILRLIGIDSLHIGT IVGKMEGDEKEVEELEEEIETKGRLSQNWGKIKPVMAVSSGGLHPGRVPYLIKHLGKNII IQAGGGIHGHKKGTIAGTRAMRQAIDATMEGTSLKKYAKDHIELKIALEQWKI-- >CG_4_9_14_0.2_um_filter_scaffold_45485_3Pacearchaeota ESEFLNLKYKPRESDLICLFRVEP-AGVSIKKAAENIALESSTGTW----AEVKTEKKNL AAKVFSIRGNLVKIAYPSALFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPKEIIKS FYGPQFGISGIRKMLKVYDRPLLGTIIKPKLGLKTREHAKVAYEAWLEGCDIVKDDENLA SQNFNEFEERLARSLEMANKVEEETSEKKAYLINVTAETKEMMKRAQLVEDLGGKFVMLD VVTEGFGALQSLREADFKLAIHAHRAMHAAFDRNPEQGISMMVLADFARLIGVDQIHIGT GIG--------------------------------------------------------- -----------K------------------------------------------- >CG23_combo_of_CG06-09_8_20_14_all_150_scaffold_26480_3Pacearchaeota ESEFLNLKYKPRESDLICLFRVEP-AGVSIKKAAENIALESSTGTW----AEVKTEKKNL AAKVFSIRGN-VKIAYPSALFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPKEIIKS FYGPQFGISGIRKMLKVYDRPLLGTIIKPKLGLKTREHAKVAYEAWLEGCDIVKDDENLA SQNFNEFEERLARSLEMANKVEEETSEKKAYLINVTAETKEMMKRAQLVEDLGGKFVMLD VVTEGFGALQSLREADFKLAIHAHRAMHAAFTRNKKHGISMLVLADILRLIGIDSLHIGT IVGKMEGDEKEVEELEEEIETKGRLSQNWGKIKPVMAVSSGGLHPGRVPYLIKHLGKNII IQAGGGIHGHKKGTIAGTNENTFNEEVMERKGDMIDLTKLGSGIPSEDKFWKKYN >rifcsphigho2_01_scaffold_65352_11Pacearchaeota PSEFVNLKYRPKESDLICLFRIEPNRVS-IKDAAANVALESSVGTW----TEVKSEKEKL AAKVFSIKGNMVKIAYPNELFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPKEILNS FSGPKYGIEGIRKMVNIKNRPLLGTIIKPKLGLVTKDHAYVAYDAWRGGCDIVKDDENLS SQKFNEFEERLARTLESANLAEEATGEKKAYLVNVTAETKEMMKRAQLVEDLGGKFVMID VVTEGFGALQTLREADFKLAIHAHRAMHAAFTRNKKHGISMMVLADLCRLIGMDTLHIGT IVGKMEGDEEEVEEIEEEIETKDRLAQNWNKIKPTLAVSSGGLHPGHVPFLIKSLGKDLV IQMGGGIHGHPRGTLRGAIAARQAIEATMQKKSLKEYSKKHLELRDALELWKN-- >rifcsphigho2_02_scaffold_162351_1Pacearchaeota PSEFVNLKYRPKESDLICLFRIEPNRVS-IKDAAANVALESSVGTWTEV-KSEKEYVRKL AAKVFSIKGNMVKIAYPNELFEKGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPKEILNS FSGPKYGIEGIRKMVNIKNRPLLGTIIKPKLGLVTKDHAYVAYDAWRGGCDIVKDDENLS SQKFNEFEERLARTLESANLAEEATGEKKAYLVNVTAETKEMMKRAQLVEDLGGKFVMID VVTEGFGALQTLREADFKLAIHAHRAMHAAFTRNKKHGISMMVLADLCRLIGMDTLHIGT IVGKMEGDEEEVEEIEEEIETKDRLAQNWNKIKPTLAVSSGGLHPGHVPFLIKSLGKDLV IQMGGGIHGHPRGTLRGAIAARQAIEATMQKKSLKEYSKKQQNIRNESKQWKNGG >rifcsplowo2_01_scaffold_349272_1Pacearchaeota ------------------------------------------------------------ -----------------------------------------------------PDEILKS FKGPKYGIEGIRKIMKIKDRPLLGTIIKPKLGLRTKDHAYVAYDAWRGGCDVVKDDENLS SQKFNEFEERLARSLEAANLAEEATGEKKAYLVNVTAETKEMMKRTQLVEDLGGKFVMVD IFTAGFAALQSLREADFKLAIHAHRAMHAAFTRNKKHGISMMVLADLARLIGVDTLHIGS IVGKMEGEEIEVEEIDEEIETKDRLSQNWGKIKPVLGVSSGGLHPGHIPFLIKHLGKDIL IQAGGGCLGHRLGTLRGAIALRQAIEATMQNISLKTYAKNHIELKIALEQWKNIQ >RBG_13_Archaea_36_9_RBG_13_scaffold_1343_4Pacearchaeota PKDFLNLKYKPKESDLVCLFRVEPARGVSMKEAVENVALESSTGTW----AEVKTEKQKL AAQAFSIKKNYVKIAYPLPLFEGGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPKEILNS FSGPKYGIEGIRKMMKVKDRPLLGTIIKPKLGLRTKDHAYAAYDAWRGGCDFVKDDENLS SQKFNEFEERLARSLEAANLAEEATGEKKAYLINVTAETKEMMKRTQLVEELGGKFVMID IVTAGFAALQSLREADFKLAIHAHRAMHAAFTRNKKHGISMMVLADLARLIGVDTLHIGT VVGKLEGTLKEVSEIEEEIETKDRLSQNWGKIKPTMAVSSGGLHPGHVPYLIKHLGKDLV IQMGGGVHGHPKGTLRGAMGARQAIESVMKNIPLKEYAKXXXXXXXXXXLFSRQA >RBG_13_scaffold_1343_4Pacearchaeota PKDFLNLKYKPKESDLVCLFRVEPARGVSMKEAVENVALESSTGTW----AEVKTEKQKL AAQAFSIKKNYVKIAYPLPLFEGGNAPNILSSIAGNIFGMKIVKNLRLEDIKIPKEILNS FSGPKYGIEGIRKMMKVKDRPLLGTIIKPKLGLRTKDHAYAAYDAWRGGCDFVKDDENLS SQKFNEFEERLARSLEAANLAEEATGEKKAYLINVTAETKEMMKRTQLVEELGGKFVMID IVTAGFAALQSLREADFKLAIHAHRAMHAAFTRNKKHGISMMVLADLARLIGVDTLHIGT VVGKLEGTLKEVSEIEEEIETKDRLSQNWGKIKPTMAVSSGGLHPGHVPYLIKHLGKDLV IQMGGGVHGHPKGTLRGAMGARQAIESVMKNIPLKEYAKNHLELKEALKTWK--- >gwc1_scaffold_4706_5Pacearchaeota NLEFVDLKYRPHGKDLICLFKIHPAKGLSIEKASNIVALESSVGTW-----TKVPGQEKL RAKVFSIRKNIVAIAYPQELFEYSNVPNILSSIAGNIMGMNAVESIRLEDVSFPKSILNN FKGPRFGIEGIRKVMNVKHRPLVGTIIKPKLGLNTKHHAQSAYESWVGGCDLVKDDENLS SQRFNEFEERLARTLEKANQSEEETGEKKGYMVNVTAETNEMIKRAQLAQDLGSKYIMID IITAGWAALQTLRNANFKLIIHAHRAMHAAFDRNPEQGISMMVLADFARLIGVDQIHIGT GIGKLEGKIQDIKEIEEEIETKKRLEQNWGKIKSVLAVSSGGLHPGHVPFLIKNLGKDLA IQAGGGIHGHPDGSRAGAIAMRQAVDAVMKHKTLNEYSKFHKELRRALVCWGEK- >rifcsplowo2_01_scaffold_1373_9Pacearchaeota ILNYIVLNYKPK-DDLICLFKIKPARGLSVKQAANTVALESSVGTWTEVK--NENYVRRL RARVFSINGNWIKVAYPEEIFESDNVPNIFSSIAGNIMGMKSVDSIRLEDVSFPKKILRS FDGPRYGISGIRKMMKIKERPLIGTIIKPKLGLFTKDHAISAYESWSGGLDLVKDDENLA SQKFNVFEERIARTLEKVHKAEEETGEKKAYLVNVTAETKEMIKRAQLVEDLGGKYVMLD IITAGWAALQTLRDADFKMAIHAHRAMHAAFDRNPNHGVSMMVIADFARLIGVDQLHIGT GIGKLEGKIEDVEDLLEEIETYKRLNQKWMNIKPVLGVSSGGLHPGHIPFLVKHLGKDIV IQCGGGVHGHPSGTKAGAMAVRQAVDAVMKKQSLKDYAKTHEELKEALDKFGYSK >rifcsplowo2_01_scaffold_159277_2Pacearchaeota ELKFIDLGYKPK-DDLICLFKIKPARGLSVKQAANTVALESSVGTW-----TEVKNENKL RARVFSINGNWIKIAYPEEIFELDNVPNIFSSIAGNIMGMKSVDSIRLEDVSFPKKILKS FDGPRYGINGIRKMMKIKERPLIGTIIKPKLGLFTKDHAQVAYESWSGGCDLIKDDENLS SQKFNTFEERIARTLEKAHKAEEETGEKKAYLVNVTAETKEMIKRAQLVEDLGGKYVMLD IITAGWAALQTLREADFKMAIHAHRAMHAAFDRNPNHGVSMMVIADFARLIGVDQLHIGT GIGKLEGKIEDVEDLLEEIETHERLNQNWMNIKPVLGVSSGGLTPLHFPYLIKHLGKNIV LQCGGGIHGNWLGARMGAVAARQSIDAVMKGISLKEYAKEHEELKSTINQWGIPK >rifcsplowo2_01_scaffold_202_72Pacearchaeota KFEFIDLNYKPK-KDLICLFRVFPARGLSVKEAANIVALESSTGTWTDVP--GKEYVKNL RARVFSINGNWIKVAYPESLFEKDNVPNILSSVAGNIFGMKAVNSIRLEDVSFPKSILKS FFGPRYGIRGIRRMMKIQKRPLVGTIIKPKLGLITKHHAKSAYESWIGGCDIVKDDENLA SQKFNVFEKRLAETLEMADRAESETGEKKAYLVNVTAETKEMVKRTQLVENQGGKYIMID ILTSGWAALQTLCESDFKMAIHGHRAMHSAMTRNPKHGISMMVIADFARLIGVDQIHIGT GIGKLEGTIEEITEIKDEIETKKRLEQKWGSIKPVLAVSSGGLHPGHVPYLMKHLGKDLV IQAGGGIHWNPRGSKYGAMGLRQAVEATMKRISLKKYSKTHKELKEALDKFGYPR >CG10_big_fil_rev_8_21_14_0.10_scaffold_127377_1Pacearchaeota --------------------KVHPAKGLSIEKAANTIALESSVGTWTEIK--AQDYVKKL RAKVFSIKGNLIKIAYPQELFEYNNIPNILSSIAGNIFGMKAIRAMRLEDISFPKNILKS YKGPKYGIPGIRKFLKIKSRPLIGTIIKPKLGLITKHHAQSAYESWVGGCDIVKDDENLS SQKFNKFEERIARTLEKLHKAEGETGEKKGYLVNVTAETKEMMKRAQLVENLGGKYIMVD IITEGWGAVQTLRDGGFKLAIHAHRAMHAAFDRNSDHGISMMVLADFARLVGVDQLHIGT GIGKLEGNIEDIEDLSEEISTHKRLKQGWGKIKSVFPVSSGGLHPRYIPFLIKHLGKDLI IQAGGGIHGHPHGTMAGAAAMRQAVDATLKGKTLKQYSSKHDELKEALKKWN--- >CG23_combo_of_CG06-09_8_20_14_all_150_scaffold_29588_1Pacearchaeota KLDFVDLKYKPR-GDLICLLKIKPNRVS-MQEAANTVALESSVGTW-----TEVPGQKKL KARVFSIKGDFIKIAYPQELFESNNVPNILSSIAGNILGMKAVKTIRLEDISFPKKMINS FSGPKYGINGIRKIMKIQKRPLVGTIIKPKLGLNTKHHAQSAYESWIGGCDIVKDDENLA SQKFNVFEERIARTLEKAHKAEEETGEKKAYLVNVTAETKEMIKRAQTVEDLGGKYIMVD IITEGWGAVQTLREAGFKMAIHAHRAMHAAFDRNPNQGISMMVIADFARLIGVDQIHIGT GIGKLEGDIKDIKCLAEEISTRERLSQRWEKIKPVLPVSSGGLHPGHVPFLMKHLGNDLV IQAGGGIHGHPKGTSAGAKAMRQAVEAVMKHKTLKEYAKTHEELKTALKRWE--- >CG10_big_fil_rev_8_21_14_0.10_scaffold_28047_3Pacearchaeota KFEFVDLKYHPR-EDLICMFKIYPANGFDVTRAANTVALESSVGTW-----TDVPGKDKL KAKVFSIKKNSVKIAYPQELFEPNNVPNILSSIAGNIMGMKAVNAIRLEDVSFPKKILKS FKGPKYGIDGIRKIMKIKNRPLVGTIIKPKLGLKTKDHARSAYESWCGGCDIVKDDENLA SQKFNIFEERIARTLEKANQAESETGEKKAYLVNVTAETKEMLKRAQLVEDLGGKYVMID ILTAGWAALQTLKQANFKMAIHAHRAMHAAFDRNPHHGISMMVLADFARLIGVDQIHIGT GIGKLEGNIEDIEEVAEEILTKNRLKQKWGNIKPVMPVASGGLHPRYIPFLVKHLGKDLI IQAGGGIHGHPHGTRAGATAMRQAVDAVLKRKSIGEYARTHEELREALEKWKR-- >CG09_land_8_20_14_0.10_scaffold_57150_2Pacearchaeota NLPFVNLHYHPQ-DDLICLFKIIPNRIS-IEKAANTVALESSVGTW-----TRIPNTDKL KAKVFSIKEKYVKIAYPKELFEYDNAPNILSSIAGNIFGMKSVKAIRLEDISFPKVILRA FKGPRYGIKGIRKMMKIKSRPLVGTIIKPKLGLKTKDHAQSAYESWIGGCDIVKDDENLS SQKFNEFGLRLAKTLEMADKAQSETGEKKAYLVNVTAETMEMIKRAQLVQDLGGKYVMID ILTSGWAALQTLRQANFKLAIHAHRAMHAAFDRNPK----------------------ET S----------------------------------------------------------- ------------------------------------------------------- >CG10_big_fil_rev_8_21_14_0.10_scaffold_139632_1Pacearchaeota KLEFIDLKYKPK-NDLICLFKIIPNRMS-LKEAANAVALESSTGTWT---DVEKGKLDKL QAKVFSIKDNYVKIAYPQELFEYDNVPNILSSIAGNIFGMKDVRAIRLEDVSFPKSILKS FKGPKYGIDGIRKMMKIPTRPLIGTIIKPKLGLNTEHHAESAYESWIGGCDFVKDDENLA SQKFNLFEKRLAKTLEMADKAEKETGEKKAYLVNVTAETKEMIKRAQLVEKMGGKYVMVD ILTSGWASVQTLREANFKMAIHAHRAMHAALDRNPNHGIAMMVIADFARLIGIDTLHIGT GIGKLEGDIKDIKELSEEIETKNRLSQKWEKIKPVLSVSSGGLHPGHVPFLIKHLGKNLV IQAGGGIHAHPYGARAG------AI------------------------------ >RBG_13_Archaea_33_26_RBG_13_scaffold_69_17Pacearchaeota MTKKLNLKYKPK-NDLICLFKISPN-KISIEKAANTVALESSVGTWTKVA--GQEYVEKL KAKVFSIRGNYIKIAYPEALFEKDNVPNILSSIAGNIFGMKAVKAIRLEDVSFPKSILKS FKGPKYGIDGIRKMMKIKSRPLVGTIIKPKLGLNTEHHAESAYESWLGGCDLVKDDENLA SQKFNEFEKRLAKTLEMADKAESETGEKKAYLVNITAETKEMLKRAQLVEKQGGKFMMVD ILTAGWASVQTLKEANFKMAIHAHRAMHAAFDRNPEHGMDMMVIADFARLIGVDTLHIGT GIGKLEGNVKDIEELQEEIETKERLEQRWANIKSVLGVSSGGLHPRYVPFLIKHLGKDLV IQAGGGIHGHPFGTRAGAIAMRQAVDASLKKISLKEYARTHVELEEALRIWKK-- >RBG_13_scaffold_259_7Pacearchaeota KLDFVNLKYKPK-NDLICLFKISPNKMS-MEKAANTVALESSVGTW-----TKVAGQEKL KAKVFSIKKNYVKIAYPETLFEKDNVPNILSSIAGNIFGMKAVKAIRLEDVSFPKSILKS FKGPKYGISGIRKMMKVKERPLVGTIIKPKLGLNTEHHAKSAYESWSGGCDFVKDDENLA SQKFNEFEKRVAKTLEMADKAESETGEKKAYLVNVTAETKEMLKRAQLVENQGGKYIMMD ILTAGWASVQTLREANFKMAIHAHRAMHAAFDRNPEHGMDMMVIADFARLIGVDTLHIGT GIGKLEGNVRDIEELQEEIETKERLEQRWENIKPVLGVSSGGLHPRYVPFLIKHLGKDLV IQAGGGIHGHPFGTRAGAIAMRQAVDATLKKISLKEYAKTHVELEEALKLWGK-- >tara_MHASMcontig_668995_27Pacearchaeota RLDFINLKYKPR-DDLVCLFKIVPNRIS-LEKAANTVALESSVGTW-----TKIKERDKL KAKVFSIKRNWIKIAYPSELFEKDNIPNILSSIAGNIFGMKAVKSIRLEDISFPTKILGK FKGPRYGIKGVRRLLKVKNRPFIGTIIKPKLGLNPKHHAESAYESWKGGCDIVKDDENLA SQKFNVFEKRLAKTLEMADKTEKETGEKKVYLVNATAETKEMLKRAQLAEKMGSKYIMVD ILTSGWAAIQTLREANFKLAIHAHRAMHAAFDRNPEHGMSMKVIADFARLIGVDQIHIGT GIGKLEGKIKDIREIKEDISTEKMLTQDWRNIKPVMPVSSGGLHPGHVPFLIKHLGKDLV IQAGGGIHGHPFGTQAGAIAMRQAVDAVLKKKSLKEYARTHVELEEALNEWC--- >rifcsphigho2_01_scaffold_82070_3Pacearchaeota KLDFVDLKYKPDNTDLVCLFKIVPNKIS-LEKAANTVALESSVGTWT---NVERGKEDKI KARVFSIKKDWIKIAYPQELFEQDNVPNILSSVAGNIFGMKAVKTIRLEDVRFPKGILKS FKGPKYGIDGVRNLLKIKNRPLVGTIIKPKLGLNTKHHAESAYESWKGGCDIVKDDENLS SQKFNIFENRLAKTLEMADKAESETGEKKGYLVNVTAETKEMMKRAQLTEKMGGKYVMID ILTSGWAAVQTLREANFKLAIHAHRAMHAALDRNSEHGISMIVIADFARLIGVDQIHIGT GIGKLEGDIKNIIQIKEDIVTKKRLQQNWMNIKPVMPVSSGGLHPGHIPFLIKHLGRDLI IQAGGGVHGHPFGTEAGARAMRQAVDAVLKKKSLGEYAKTHIELEEAIKKWN--- >rifcsplowo2_01_scaffold_112877_5Pacearchaeota ----------------------------SIK-----------KDGW-------------- -----------IKIAYPQELFEQDNVPNILSSVAGNIFGMKAVKTIRLEDVRFPKGILKS FKGPKYGIDGVRNLLKIKNRPLVGTIIKPKLGLNTKHHAESAYESWKGGCDIVKDDENLS SQKFNIFENRLAKTLEMADKAESETGEKKGYLVNVTAETKEMMKRAQLTEKMGGKYVMID ILTSGWAAVQTLREANFKLAIHAHRAMHAALDRNSEHGISMIVIADFARLIGVDQIHIGT GIGKLEGDIKNIIQIKEDIVTKKRLQQNWMNIKPVMPVSSGGLHPGYIPFLIKHLGMDLI IQAGGGVHGHPFGTEAGARAMRQAVDAVLKKKSLGEYAKTHVELEEALKKWN--- >CG02_land_8_20_14_3.00_150_scaffold_129462_2Pacearchaeota KKKRLDLKYKPR-NDLICLFRINPARNLSVKECANTVALESSVGTWTDVAKSKLDYVEKL KARVFSIKKNKVKIAYPEELFEKDNVPNILSSIAGNIFGMKAVKTIRLEDVSFPKNILKS FSGPKYGIEGIRKMMKIKSRPLVGTIIKPKLGLNVKHHAQSAYESWKGGCDIVKDDENLS SQKFNEFEKRLSKTLEMADKAEQETGEKKAYLVNVTAETKEMLKRAQLVEKMGGKYIMID ILTAGWAALQTLREANFKTAIHAHRAMHAAFDRNPEHGISMMVLVDFARLIGVDQIHIGT IIGKM-YGKKEEV----------------------------------------------- ------------------------------------------------------- >CG23_combo_of_CG06-09_8_20_14_all_150_scaffold_90018_1Pacearchaeota KGKYKMLKYKPR-NDLICLFRINPARNLSVKECANTVALESSVGTWTDVAKSKLDYVEKL KARVFSIKKNKVKIAYPEELFEKDNVPNILSSIAGNIFGMKAVKTIRLEDVSFPKNILKS FSGPKYGIEGIRKMMKIKSRPLVGTIIKPKLGLNVKHHAQSAYESWKGGCDIVKDDENLS SQKFNEFEKRLSKTLEMADKAEQETGEKKAYLVNVTAETKEMLKRAQLVEKMGGKYIMID ILTAGWAALQTLREANFKTAIHAHRAMHAAFDRNPEHGISMMVLVDFARLIGVDQIHIGT GIGKLEGKIEDIEDLVEEISAHNRLKQRWEKIKSTLPVSSGGLHPGHVPFLIKNLGKNLV IQAGGGIHGHPLGSEAGAKAMRQAVDAVLKKVSLHEYAKTHVELEEALKKWKLPR