Richards et al. (2009) Supplemental Dataset 5A: Figure 3A unmasked (mase) sequence alignment. ;; saved by seaview on Fri Jan 2 02:13:07 2009 ;; saved by seaview on Sun Sep 7 09:16:17 2008 ;; saved by seaview on Tue Jul 15 09:38:42 2008 ;; saved by seaview on Thu Jul 3 15:42:56 2008 ;; saved by seaview on Mon Jun 9 11:01:59 2008 ;; saved by seaview on Mon Jun 9 10:57:32 2008 ;; saved by seaview on Mon Jun 2 20:09:19 2008 ;;# of segments=17 subset_mask ;; 431,434 436,443 577,595 606,613 617,648 658,671 689,702 708,722 750,795 ;; 802,820 822,826 833,835 838,845 850,874 882,900 917,923 927,942 ;no comment NFIA061740 ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ -----------------------------------------------------------M DSPPID---WEQAL-QEGRSLHP----------LHRCRYPIVSS-----------FRTAK LYWVAIDSSIVHTQGE------YMQWIKKLAPDAN-------------IGQSSVLFPVHE LQLAK---VISEFPD----------------ARVLPGTIDNARPQISLRTLTLPNT---- -----TEVCVTLY---LKVQNLVRTMSEWSIMMGQQLRQVLPVIEAAS---AEFGGSLMM AREYAAAASP------------------SQHLGCIIRESSES--QAARTGDRIIVCATLS EN--------------VDVIWGD------RPDKKAILREYASKFLRAVLPPLFKYGLGLA VHRQNALLRLDPLSK----EIKGFVIRDLGFMRVHRPTFRKGTCFDIE----TPK--TSI YTETL----EEVYQYMFNPAIIGHLSSFVRALHP-------GLAGWKVIREELERIIPDN --------------DSIARSVWL--QSPEFPTKANITMEVHGYSGL-------------- --------------HPFVNVS-NPLYYCQLNEEPDHSVS------- ;no comment Semo407912 --------------------MEVSKFQTEAKNAVLARLLACLVNEKFVKAYLVL------ ----HAESLP-------------------------------------------------- ---------------------------------ARLAAPVPPKHRKHWMCISAASEAFEL LKHGSF------------------------------------------------------ -------------------------LVPLRELPPVQKPPQLGGEPAEILPVTLLDPDEIG FPVQQLQVDRPVEYVPA-------------DVRTVTTQLQEWN----------------- ---------------------PALSAALVEQITQEMENSVSHQAAAYSHHQGRAPPELEH TTDSTQ---WEQLI-VEGHATHP----------MHKSRFHVPPTLPIDPWNN-D-LSAPK IHFFAVPRSKLTVRGD------FEKLVAPLVQISS------TRANVFMDSCDELVMPVHH VQVPS---VLLHFPH-----------------ARHLPFTCKARAQASVRTVVVDEL--PA -----FNLKLPIG---VKISSATRTITPWTTHIGPAFKPVADKIIEDN---DL-LE-VAH ELASVVSNHPD-PD------V-------AKNLSCVIRQDPQHLATTRARGERVVLSAALT ER--YPSRSDDFVVNTL---FHLDS----LEKKELFFQNYARLFLRAFLPPLLKHGVSFE AHLQNVLARFRPAAVAGEWELVGFTVRDLGGIKAHQQTLFETTGLELDV---WDAENCSQ LAETI----DEVHNLAFHTIIQCHLHRLARALSVHY-----NAKGWEFVRAEVDAMLPPG ---------------CHARSLWL---GATVNWKAFITMKLQGLYRD-------------- --------------YLYIKVP-NLITYRPS---------------- ;no comment CC1G_07068 ---------------MIPRIYELRTGGAFENLDQFSRDVFSWLQDFL------------- ------------------------------------------------------------ ---------------------------------PQPGTSNSLGEIV-------------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------DPLYLWDKFAAIF----------------- ----------------------KLDKPIKESIAKELTSSFHYQYLAYIN----PPKCPSL KSPPIQ---WEQSL-VAGHPTHP----------MHRAR--LLPGSDIGYD-----WYRPC IRFVQVPRASLEIRGS------FMETSKTLVSIAA------GTHAHELDHERYVFMPAHE LQLSN---IVRLFPD---------------AVVLPPHVHLKAQAQTSIRHVAFLS-H--- Q----EICIDPVTQNRVKISSALRTISHFTADFGPRFSTEVVPKLRIDH--TI-FA-IET EPSSGIYRCED-PE------I-------RKHFTAVIREEYKA-----APYENVILIAALL ET---DHANLPPGVPAVTHVFGLNT----ERKRIQFLDRYIELSCKALLPALVYNGVAFE AHAQNLLLRVDRRTG----KITG------------------SLGIDFDF---LPK--HAI ATESL----QEVYPKFYHTYVHNHLQRLIRLLGLHH-----NGIGWELLREHMNAVIPID ---------------HELRDLWLSPKSHTVESKCLMRMRLQDSYRE-------------- --------------NVYHPFP-NLIQFRPEKIDRRRESHL------ ;no comment Labi297159 -----------------MSSGSGLPPRQHAEFAVMSRLISCLVTEKLLPAFYIPTKSSYP ASGIMVVISP-------------------------------------------------- ---------------------------------RKLLPSDPLNPNDIFVVVPLHHAPVLL KDDC-------------------------------------------------------- -------------------------LHTYGRRVGLVDPLDMLPVIYEPTERSPIEDINYL FP--VKTLACLTSHISNIDCSPTLNIAS--DPIALWRRCVEHI----------------- ----------------------AVEESLRQSIEKELQSSVEWQVLAYQN----PPICPSL QSPSIQ---WEQSL-LAGHPTHPVCCLLVEHLRGPDSFQKMHRARMVNNGPPDFDWYHPL IRFVRVPRSSVLLNGY------MEDISLSLAQKAA-----DYSGRCLPPTNASVFLPVHE LQVDN---ILAKFPD----------------VEVLQDICIPGLAQSSIRTVIVRH-L--P G----MALKLSVG---VKISSALRTISHFTADFGPRFSSQIVPKLSMNR--DI-LS-VEL EPASAIYHTPD-PD------I-------AKHLTVVIREEYQP-----AVGEVVIVCAALL ET---DHSGIPASMSAVQHVLNLRT----EQSCAKFLDRYIRIACEALLPALLYNGVAFE AHAQNLLARFDKKTG----ELLGFVIRDLGGLRIHPPTLRMSTGVDFQF---LPG--HCI ATETL----DEIYPKFYHTFVHNHIQRLIRVLGLHY-----SGMGWQILREHMESVIPDH ---------------HRLRKLWLHSDSKFVSSKCLLRMRIRDSYRDVRNLHS-------- --LLSTDFSNVSTQMVYSPYP-NLIQYDPVWNELKD---------- ;no comment Phybl77600 ----------------MPVSLTMSKNLQYAKFATTSRLISCLVMETLVSAYYV------- ----PANTPQ-------------------------------------------------- ---------------------------------GTPICLLVRPNCSSTQETPSTIRLSDV LAVI---------------------PLRGVPILSTETG---------------------- ---SCAVL-----------------NDIKCPRIDLIDPMDMLSHIYSIQGQEPSTLTISD DNLRNQVYDLLVSVCKPNIQLTAKKLIDAFDAVQLWKQFADDF----------------- ---------------------G-VDPKLTEEIGLELNSSIEHQNYAYDN----PKSLPTL KSSSIH---WEQSV-LEGHATHP----------MHKARKSYPPMPPLIPGKV-N-LEQPQ LRLVAVPVSSLKIRGD------FEKLSAPIVNAILSKSEGKSADEMRAAYPNHIFIPIHE LQVPN---VESKFPE---------------ATVLPKENSVTIQSLTSLRSVAVPDIL--P G----LSLKLCLG---IKISSALRTITPFSTHFGPGFSYDVVPKLTYDP--NV-LT-VER ELSSAVHVNAD-FD------I-------AKHCSCVLREAVEFPVDAEQCPDKVVVCAALV EKIQRPDTDETLLTHV----WKLDS----EAKRIAFLDRYIDLALQAFLPPCLINGVAFE AHGQNTLARFDKETG----ELKGFVARDFGGVKVHRETLRKSAGVDIDV---LPD--SCV VADTL----EEVYKLLYHTLIHSHLQRLIRVLDLHY-----NGVGWELLRKHMSQMIPRD ---------------HPMWKVFM--ETPKVPGKCLVRMKIEELYRD-------------- --------------YIYCPVP-NVIHYIPQNVEEIAVATA------ ;no comment RO3G_06864 ----------------MPVASSEYQNEHYASFATTSRLVTCLVSETLVPVFFVPVKSV-- ----DRNNQF-------------------------------------------------- ---------------------------------IGLCLLLRPTTVKQESELPTNITASDI LTVVPLRGLPILNNERV------------------------------------------- -------------------------ALFNGIRCPQIDLVDFLDMLPHIYSVESSGSLKSG DSLKQKTFDTLSAILDGNKTFDLVDGY---SAVQLWNHFAQDL----------------- ----------------------EINSKLREQIGQELGSSILFQKYTYDN----PKPLPTL NSSTIK---WEQSV-VEGHATHP----------MHKARKSFPPMPPLNPGSY-D-LDHPA VRLVGIPRENAILRGE------YEELSAPLVNALM--DAGGNHKDIRAQYQNYVFIAIHE LQLPN---IQEKFKD---------------AVIFSKEHQLNVEALASLRSVARPDIL--P G----LSVKLCLG---IKISSALRTVTPFTTYFGPGFSFNVVPKLTYDH--EV-LA-IER ELGTITYRHED-SD------V-------AKHCSSVIREALEY--DPKYQDDLFIPCGALV EKIQRPDTDETLVAHV----WNLDT----KEKRVEFLDRYVDFALRSFLPPCLINGVAFE AHGQNTLARFDRKTG----LLKGFVIRDFGGVKAHNETLKKSAGVELDI---LPD--SCV EAHSL----EEVFKLLYHTLFHCQLQRLIRVLDLHY-----SGEGWEIVRKYLTQYVPKD ---------------HVMWPMFM--ESSKVPGKCLVRMKIDELYRD-------------- --------------YIYRPVP-NMIKYEPQSVPEAI---------- ;no comment CC1G_07067 ---------MAKKTGLDYAALSSLDPKDRALFATSARLLSCLVTESMVDA---------- ----FFYSHP-------------------------------------------------- ---------------------------------TGPAAGFATIQLTGDSADISNILAVFP LHHEPV------------------------------------------------------ -------------------------LAPGKVDRYGARRISLLDPLDMLPSILTVQTSAVP DRAEVSPPTPVLRGLPKKNFFLAHDL----DPIRLWHRFALSV----------------- ----------------------NLDKSLVEDIELEFRSSVTWQEHAFNY----PPPSPSF DAPSVA---WEQSI-VQGHPTHP----------MHKTRRFLPPIPSFQPGQC-D-LLHPI LRFISVPRENLKITYN------FEELIAPLVKVAE-----KKASKPIISPKDHVIIPVHE LQVYH---IRDKFPE---------------ARIYPPEYSLPLEAQQSLRSVVVPNGY--R N----LHLKLAVG---MKLTSAVRTISPESAYLGPRFSAQVVPVIRMDR--NI-VT-VAK ELASVVHTHPD-GD------I-------AKHCAALVREYHEG--LCEDRQERMIVCTSLV EYGHGSRDGNIPPVIRL---FGLDT----EEKRIEWLSKFLDIFFRAFLPSVLHNGVAFE CHPQNCVARFDARTK----ELKGFIIRDFGGIRVHPPTLKATTGVDIDF---VEG--HSI IAPDL----DDVYTRMYHTVFHNHLQQLIRVLDLHY-----NGRGWELVRTHLKANIPAD ---------------HPLYTAWLSPERVTLPGKCFMRMRLSGMYR--------------- --------------HVSPEAL-WLI--------------------- ;no comment Labi297160 -----------------MSTPATPFPLERALFATTARLISCLVTESLTRAYYL------- ----PLHLAN-------------------------------------------------- ---------------------------------APIGVAVILAGNVHPDKTAFDAGDVIA VVAL-------------RHPPVFRGEVKSSRGRAIGLL---------------------- ---DPQDLHH---------------LVFLTSKTVIEGISEHEELCSSIAESLASAGWTIT EP--FQLHLCT-------------------DPISLWNTFATGI----------------- ----------------------NLDQGLLVDIASELSSAVKWQTHSYQH----PPIAPLF TSASID---WEQSI-VEGHPTHP----------MHKTRRFLPPLPDFLPGSY-D-LYHPK LRLISVPRENLKVTYN------FEALSQPILDAVT-----KSSGQHFSAPDGHVVVPVHE LQLLH---IEAKFPD---------------AIIYPAQFSLPLLAQQSLRSVLVPDAY--R D----LHLKLGVG---IKLTSAVRTISPESAYLGPRFSAKVVPVLALDP--NV-VT-VAK ELASVVHGHAD-GE------I-------AKHCSAIIRESPES--TSEERQERLIVCTALV ESGHAGIDGHLPSVIRV---FGLDN----DDKRAFWFEKFVQVFFQAFLPPMLQNGVAFE CHPQNCVARFDIHTK----ELKGFIIRDFGGLRVHRETLLATTGVELDF---LDG--HSI IAADL----DDVYTRMYHTVFHNHLQQLIRVLGLHY-----NGRGWKIVRDQLGRSIPQD ---------------HPLHKAWLCPDRTSLPGKCFLRMRMAGMYRF-------------- --------------HLHAPFP-NLIHYQGAEE-------------- ;no comment Mycgr14198 ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ -----------------------------------------WLQVAATK------SRPTL DSRMIN---WENAV-FRGHHLHP----------VGRHCSP-------------------- ----------------------------PDLPRLL---------------PNEVVLPCLS QQLPA---IQRHFPS----------------TRVLLHDAFTAHAQASLRTVNIPSEM-R- FA---YNMKFALS---CTISSVLRTITPWTTCLGPEISAVIEDAVTEN------TW-VCK EVAAITGSQKD-FS------A-------AKNLSCILREDLEP--RALALGQTLIVVAALA EK---PVGSSECLAALT---FGLRS----SGQKKKWLRDYASKLIHAVLTPALESGVCLE AHGQNSLVRVDKRTK----AIVGFCFRDFGSVKCHTPTLR-NRGHQLLTV--LPA--CWI ETDVE----EEGWDTLQHTMIHNHLQLLIRGLNLH------PIEAWPVIRRQLDNFFEQH ---------TESESARRMHAYLT---RPIVRYKALLRMRMSA------------------ ---------------------------------------------- ;no comment CHG05341 ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ -----------------------MARTIRGELENSADNQEQWLKFERSQ------PPPQL GSPLIV---WEQRL-FKGHPTHP------------------------------Y-LGTDA IRVG-------------------------------------------------------- ------------------------------------SSELYGLHQASMRTISIPDFP--- -----FHVKMALS---FTITSARRTMTPWTTRMCIEVSRLLQDIADPEL-----LW-IAL KRAAACSANRD-FE------A-------AKHLSVMLREDPEP--RARELGQCLVLPAALF EL--RSEDRVAPVVEL----FNLEPT---LEARKNWFRNYARLYLQAVLPPLLSYGICLE AHLQNVLARFDVETG----ALRGFVYRDMGGLRMHLPTMK-ARGIGIKSADLVPG--AVT LTDDL----EAVWINAYHNLVVNHMGGALRGMGLQW------DGGWDILKEELKQALEAS --------PDPDNKTAELLEFMV---RPTMKRKAFLRMQIAGIYRDSLRPPSKSFIHPSK GIGGLRELVGQRNLYMYSRLLAAQMLSAP----------------- ;no comment FGSG_11242 --------------------MEPTNYASLAKGEATKRLISQLVN---------------- ------------------------------------------------------------ ---------------------------------ERLVVISLPDGIDQPCAHISGPDTTAK WITL-------------------------------------------------------- -------------------------PVMEGLNPSTHFRPNDFGTPVKL--CSDDGEIIEE DP--GSIF----------------------TFTASWFACD-------------------- ---------------------EKTKLSIVDELRNSASMLEKWMELGSKA------PILDI NSSFID---WERSL-IAGHPTHP-----------------------------------PL LKLIGVPPPSTH--------------------------------------DDYTVVPCLS RHLPA---LLHFFPE----------------ATLVKSVDGRALAQASIRTVSVPSFD--- -----HDLKLSLA---CLITSGLRVLPCYSAEAAPAMTRLLKGLIPQD------LW-LCG EVAAVTGSQPNTEE--------------AQYITCILRENLEL--RAEENNESIILVAALL ER---PRCGTKTYAEIL---FNLET----TEDKVVWLKSYLRKLLSVALDPVTRHGVAFE FHAQNAVVRVCRRTK----EIKGFAIRDLGGVRMHRPTLE-SQGFDL-----KGM--DES LTDDV----HQVWDRTHHVLIQNHIGYLIYSLRLER-----EYGGWQLVFSELERALEG- ---------DGDSTSQKIFHYLV---KSTMPFKAFLRMRMESAMS--------------- ------------------VVN-DPPHG------------------- ;no comment Asni140614 ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ --------------------------------------IAKWLEIASTQ------PTLHL NSPLYE---WEQSV-VLGHPTHP-----------------------------------PL LESLGIPSP---------------------------------------ESSDRVVVPCFT RQLPS---ILPLFPD----------------ARLLGSVRKCCRAQISMRTISFLPDVG-- SL---LHLKLSLN---CQITSGPRTITPWTAALSPALSTALKNLLPPD------LW-IFE DAAAITGGQDD-FD------K-------ARHLTCIIRKSPEK--QAEELGETIIPVAGLF QK---PYKDDRTYMEIM---FGLDD----SKQKQAWLRKYLAKLFSLLLPPLVRHGIGLE SHAQNVLVRVNTTSK----EITGFVVRDFGGMKIHSPTFS-RTRIDLSSI--PPG--ASA FVDDI----HKVWHKVYHALIQMHVGHLLYMLDLE------SHGGWPIVREELERVLDPL ----------GDPDGRAVHEAFT---NKTMAFKCFMEMRLRN------------------ ---------------------------------------------- ;no comment Asni193539 ---------------MRADAYVETDGGRVVGFVRADDLLGPVLTQNLNGEIS-------- ------------------------------------------------------------ ---------------------------------EELDPGVICGVICRWRSRQDE------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ---------------------SDAVEILVKEVRNSANNQEKWLEMASTL------PILHL KSSLCD---WEQSV-VLGHPTHP------------------------------S-LLDPL LRSLGIPKPD--------------------------------------DTVNEIVVPCFS RQLPC---ITPLFPN----------------ARILGTVQNRCRAQLSMRTVSMTPDVGS- SP---LHIKLSLN---CQITSRLRTISPNAAALATAVGNILRDLLPPD------LW-IFE EAASITGAQED-FQ------K-------ACQLTCIIRKSPEL--RAADIGETLIPVAGLM QK---PLNDYRTYMEIM---FGLHS----LKEKQTWFRKYLIKLFSLILPPLVRHGIRIE AHSQNVLVRVNITNK----EIAGFAIRDFDGIRIHYPTFLRHPDNENALNDIPPG--ART LTDDL----HRMWHAVHHSLLQAHVGHLLYMLGLE------SKGGWRVVREELERALNPS ----------RDPDGRLLYDFWL---KETMLSRCFLEMRLRDAYAKVGLG---------- --------------FLFCVVIFRRSLSVG----------------- ;no comment AFL04155 MAWEAKDISLDHALGGGTPYLLILQGPQLAEIDAALKHFQCQQYDSSHMSIKADCSTSGS AQPMEALNPSTFPLPGLSLILRSVSSNLHSGYGFTLIRGVPVERYTREENMIIYVGISSH IGAMRGRQDHQFKGQPADVMLAHITDMRRPDGEQNYALAAYTDSEVVFHTDVGDIVSLFV LSEPANGGESLLASGWTVYNALAENRPDLVRVLAEDWPIPSAQEPGLIKYRPLLFYQPAS GSTPERVILQFSRRSFSGFGAHSQLNRLSPAQVEALDALHFLAEKFHVAMDLKRGDMQFI NN--LSMIHARNSYVDGPGSRWVKVGVTPGTVMEMKEDRVVSVIRPESLQPPVIVGETGN QELDPGAIFAVLSALLKDVADGTVLEAIVRELRNSATNQEKWLEISQDQ------PVFGL GDTSAK---WERAL-INGHPSHP----------YHRLCYAQEPLKPVGPGDIPG-MLTPT LAFVSVPCDNLRITGH------FERELQPLLKRLD----------IPQTTSDRVIVPCLA QQLPS---ILQRFPD----------------VVILKLAADCADAQASMRTITIRPELG-- FK---YHLKLSLA---CHITGALRTITPWTACGGPVQTELLEKFLPDD------LW-VFR EVAAVSGSQKDFNE--------------ARHLACILRDTLES--RAQANDEVLIMAMALT QK---PYGDSRTYAEIL---YNLET----VVQKKEWFQRYITVLFSLVLPPLVQYGIGLE GHGQNLVVRVCRQTG----QIKGFAVRDFGGVRMHVPTLN-NHGVKFDSL--PPG--GAT LTDNL----DNVWSKVHHSLLQNHVGFLLVALGLE------SHGGWAITLETLSTVLGGG ----------QDSPGAKLLEYFT---KDTMPFKCFLRMRMESKYRD-------------- --------------YIEREVP-NVILMDSPRWKSIIETYQPSLHAT ;no comment ZP019051Pp --------------MRALPTAAEVDHQAEARAARWIRRVLAFARGPLAELAPASELEPAA LARCWRGAALEIG----------------------------------------------- ---------------------------------ARVVLSSVREGLVPGEREPASVRWHDP ALGAPLV----------------------------------------------------- -------------------------LPLRRWGAFGLDLPELSGAL----------DPRLL DPAGLIALPCARADCGPG------------DLARVRAELDDSV----------------- -----------------------------DKLAWARLAQRLRPALAQRS---------PE DPRVRD---PEHLV-TDGHPWHP----------MTRTRVGLGAAANLRHAPE-L-LGRAW ISTVEVAASEVQRAGT------WDALAVRCFGAPR--------------DPGWVRIPVHP TQRRL---LPRLFPELVARGALRTADGRPFRAPPDPSQALAVRSLLSLRTVTLEDRA--L P----FHLKLATN---IHTTSAKRVVSAMSVTNGPRVSALLERIQAQDPKTQS-LE-IMV EPAAAGLDPAR-HS------R-------ASSLGAILRRAPS-----RSDGAQAWVCAAVA EP--WPLDPSQRVLERLASGYPGDA----KARLRALLGDWIALLVPPCLRLLTVHGVALE VHLQNTLARVAQGRL------VGFAVRDLGGIRIHTPRLT-RAGHTLSL---APE--SFI WTDDL----PEVRGKLEHTLFHAHLTHVFAVAEALSVP---AGESWARTRTVVEGCLTRW ----------GGERAREDLEAML---APTVRAKALLSMRLRERSSD-------------- --------------YDYTRVD-NPLSA-------------------