Supp. Mat. File 2. Alignments of misidentified conotoxin precursors clustered by the source of error Abalde, S. Tenorio, M. J., Afonso, C. M. L., and Zardoya, R. (2020) Comparative transcriptomics of the venoms of continental and insular radiations of West African cones. Proceedings Royal Society B https://doi.org/ 10.1098/rspb.2020.0794 ########################## ## 1) Wrong frame ## ########################## [Superfamily R of Lavergne et al. 2013] A_0855_174 MRASTWLSGRMDITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTASVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP A_0885_178 MRASTWLSGRMDITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTASVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP A_0875_152 MRASTWLSGRMDITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTASVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP A_0055_229 MRASTWLSGRMDITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTASVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP A_0039_212 MRASTWLSGRMDITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTARVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP V_1302_232 MRASTWLSGRMVITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTARVRASSRVVSPSLYSCFSRDTALALLLPMHVAFQPP V_1278_129 MRASTWLSGRMVITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTARVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP V_1258_186 MRASTWLSGRMVITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTARVRASSRVVSPSLYSCFSRDTALALLLPMHVAFQPP A_0239_030 MRASTWLSGRMDITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTARVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP A_0031_005 MRASTWLSGRMDITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTARVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP A_0025_042 MRASTWLSGRMDITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTARVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP A_0520_027 MRASTWLSGRMDITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTASVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP A_0048_016 MRASTWLSGRMDITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTASVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP A_1387_011 MRASTWLSGRMDITVLPFLRVSVAISTLSGVSLVRSRLLLSTLTARVRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP K_0010_076 MRASTWLSGRIVITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTARARASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP ermineusAXL95348 MRASTWLSGRMVITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTARDRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP lenavatiJAI17797 MRASTWLSGRMVITVLPSLRVSVAISTLSGVSLVRSRLLLNTLTARDRASSRLVSPSLYSCFSRDTALALLLPMHVAFQPP marmoreusBAO02223 MRASTWLSGRMVITVLPSLRVSVAISTLSGVSLVRSRLLLSTLTARARASSRLVSPS------------------------ [real: [Proteasome subunit alpha type-4] A_0025_042 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMSILPDNQVEAL------------------------------- A_0031_005 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMSILPDNQVEAL------------------------------- A_0039_212 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMSILPDNQVEAL------------------------------- A_0048_016 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMSILPDNQVEAL------------------------------- A_0055_229 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMSILPDNQVEAL------------------------------- A_0239_030 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMSILPDNQVEAL------------------------------- A_0855_174 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMSILPDNQVEAL------------------------------- A_0520_027 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMSILPDNQVEAL------------------------------- A_0875_152 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMSILPDNQVEAL------------------------------- A_0885_178 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMSILPDNQVEAL------------------------------- A_1387_011 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKNGKTVMSILPDNQVEAL------------------------------- V_1258_186 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETT-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMTILPDNQVEAL------------------------------- V_1278_129 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMTILPDNQVEAL------------------------------- V_1302_232 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETT-LDEALTLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMTILPDNQVEAL------------------------------- K_0010_976 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALALAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMTILPDNQVEAL------------------------------- magusQFQ61176 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALALAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMTILPDNQVEAL------------------------------- marmoreusBAO02223 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------RSVPAEAGVQEGETS-LDEALALAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMTILPDNQVEAL------------------------------- lenavatiJAI17797 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALSLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMTILPDNQVEAL------------------------------- ermineusAXL95348 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------YGGWKATCIGNNSANAVSLLKQEYKEGETS-LDEALSLAVKVLSKSLDLTKLTPDKVEMATLTRKDGKTVMTILPDNQVEAL------------------------------- LottiaXP_009059710 -----------------MARRYDSRTTIFSPEGRLYQVEYAMEAIGHAGTCLGILANDGVLLAAERRNTNKLLDEVSFSEKIYKLYDDMACSVAGITADANVLTHDLRLIAQSFVMSYLYQEPIPCEQIVSSLCDIKQAYTQFGGKRPFGVSLLYMGWDRHYGFQLYQSDPSGNYGGWKATCIGNNSANAVSMLKQEYKEGETN-LQEALALSIKVLSKTLDMTKLTPDKVELATLTRENGKTKIRVLGFPEVDKLITVYNEEQAKIEAEKKKEAEKKKASS----- AplysiaXP_005092365 -----------------MARRYDTRTTIFSPEGRLYQVEYAMEAIGHAGTCLGILAKDGILLAAERRNTNKLLDEVSYSEKIYRLYDDMACSVAGITADANVLTNELRLIAQR--HILQYQEAIPCEQVVSALCDLKQAYTQYGGKRPFGVSILYMGWDKHYGYQLYQSDPSGNYGGWKATCIGNNSANAISLLKQEYKSGDIN-LSASLDLAIKVLSKTLDMTKVTADKVEIATLTRKDNKTVIRILSDSEVEVHIKNFEEEEAKLEAEKKKEADKKKASEKPSDK BiomphalariaXP_013065013 -----------------------------------------MEAIGHAGTCLGIMAQDGILLAAEKRNTNKLLDEVSYSEKIYRLYDDMAISVAGITADANVLTNELRLIAQR--YILQYQEPIPCEQLVSTLCDIKQAYTQFGGKRPFGVSILYMGWDKHYGFQLYQSDPSGNYGGWKATCIGNNSANAISLLKQEYKDEDNFSLSAALDLSIKVLSKTLDMTKVTADKVEIATLTRKDNKTVMRILPDNEVEVHIKKYEEEEARLEAEKKKEAEKKKA-EK-SDK ElysiaRUS72144 MLADVLTKYICEATIVNQARRYDSRTTIFSPEGRLYQVEYAMEAIGHAGTCLGIMASDGILLATEKRNTNKLLDEVSYSEKIYRLYDDMACSVAGITADANVLTNELRLIAQR--YILQYQESIPCEQLVSTLCDVKQAYTQFGGKRPFGVSILYMGWDKHYGFQLYQSDPSGNYGGWKATCIGNNSANAISLLKQEYKDDNTINLSAALDLSIKVLAKTLDMTKLTPDK--------------------------------------------------------- ************************************************************************************************************************************************* [Formerly conotoxin] [Framework XXII] [ /----Signal----/---CC-----------C-------------C------C-C-------------C--------------C----------------C----] A_0875_147 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT A_0039_214 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT A_0055_259 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT V_1278_110 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMSLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT V_1258_171 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMSLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT V_1302_224 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMSLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT V_CG13_215 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMSLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT A_0885_182 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDGRRSWCPRSGDSPSGHVRTTSACVLRAT A_0855_173 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT A_0031_033 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT A_0025_010 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDAKRSWCPRSGDSPSGHVRTTSACVLRAT A_0239_021 MLSTCVSAYTLSTSSASTRCCPVLELTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT A_1387_362 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVPRAT A_0520_023 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVPRAT A_0048_318 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCLCVPVSSMRLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVPRAT K_0010_065 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGSPSCRCVPVSSMRLPSSKLCVVQSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT betulinusALM87514 MLSTCVSAYTLSTSSASTRCCPVLGLTGSRLVCVVPLESHRALWPGCTLGNPSCLCVPVSSMRLPSSKLCVVPSSSTQDARRSWCPRSGDSPSGHVRTTSACVLRAT [real: 60S ribosomal protein L10] A_0025_010 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- A_0031_033 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- A_0039_214 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- A_0048_318 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- A_0055_259 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- A_0239_021 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- A_0520_023 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- A_0855_173 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- A_0875_147 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- A_0885_182 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- A_1387_362 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- K_0010_065 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWPREDYERMRAEGYL---------------------------- V_1258_171 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHESSIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- V_1278_110 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHESSIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- V_1302_224 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHESSIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- V_CG13_215 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHESSIIEALRRAKFKYPGRQKIVVSKKWGFTKWAREDYERMRAEGYL---------------------------- betulinusALM87514.3 -----------------------------------------------------------------------------------AFHLRVRLHPFHIIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEASIIEALRRAKFKYPGRQKIVVSKKWGFTKWPREDYERMRAEGYL---------------------------- AplysiaXP_005108638 MGRRPARCYRYCKNKPYPKSRFCRGVPDPKIRIFDLGRKRARVDEFPLCIHMISDEYEQLSSEALEAGRICANKYLVKNCGKDAFHLRVRLHPFHVNRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHEPQVVEALRRAKFKYPGRQKIVISKKWGFTKWPREDYEKMRADGILVPDGVNCQYKPNHGPLDAWRSRQVARS- PomaceaXP_025085190 MGRRPARCYRYCKNKPYPKSRFCRGVPDAKIRIFDLGRKKARVDEFPLCVHLISDEYEQLSSEALEAGRICANKYLVKNCGKDAFHLRVRLHPFHVNRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRAREQHESAIIEALRRAKFKYPGRQKIVVSKKWGFTKWPKENYERMRQEGKLVQDGANVQYRPDHGPLSAWKQAQTLSRQ PhysellaAXN72710 MGRRPARCYRYCKNKPYPKSRFCRGVPDPKIRIFDLGRKKARVDEFPLCVHMISDEYEQLSSEALEAGRICANKYLVKNCGKDAFHLRVRLHPFHINRINKMLSCAGADRLQTGMRGAFGKPQGTVARVNIGQPIMSVRAREADETKVIEALRRAKFKYPGRQKIVVSKKWGFTKWPRTDYERMRAEGKLVPDGVGVQYKPNHGPLIDWKDRQTA-S- BiomphalariaXP_013094488 ------FSYRYCKNKPYPKSRFCRGVPDPKIRIFDLGRKKARVDEFPLCVHMISDEYEQLSSEALEAGRICANKYLVKNCGKDAFHLRVRLHPFHINRINKMLSCAGADRLQTGMRGAFGKPQGTVARVNIGQPIMSVRARENHEQQVIEALRRAKFKYPGRQKIVVSKKWGFTKFAKTDYEKMRAEGFLVPDGVGVQYKPNHGPLDAWKGRQTA-S- CrassostreaACO07302 MGRRPARCYRYCMNKPYPKSRFCRGVPDPKIRIFDLGRKKARVDEFSLCVHLVSDEYEQLSSEALEAGRICANKYLVKNCGKDAFHMRIRVHPFHVIRINKMLSCAGADRLQTGMRGAFGKPQGTVARVHIGQPIMSVRARENHQASVIEALRRAKFKFPGRQKIHISKKWGFTKWEKPQYEEMRADGRLIPDGVTVQYKPNKGPLKAWMDRQRV--- ************************************************************************************************************************************************* [Formerly conotoxin] A_0885_173 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA A_0875_150 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA A_0039_309 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA A_0055_334 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA V_CG13_211 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRIPPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA V_1302_225 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRIPPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA V_1258_173 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRIPPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA V_1278_181 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRIPPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA A_0855_234 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA K_0010_051 MESTSSSSSVVFAVWLRNVWAVVVVVCVCSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTNTARCVA A_1387_347 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA A_0520_329 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA A_0239_001 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA A_0048_325 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA A_0031_443 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA A_0025_032 MESTSSSSSVVFAVWLRNVWAVVVVVCVFSVLTGCVRILPTSSSRSFWWIHSTRPSDVTPRPTGSATLSTSTARCVA betulinusALM87520 MESTSSSSSVVFAVWLRNVWAVVVVVCVCSTLTGCVRILPTSSSRSFWWIHSTKPSDVTPRPTGSATLSTSTARCVA [real: 60S ribosomal protein L15-like] A_0025_032 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- A_0031_443 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- A_0039_309 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- A_0048_325 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- A_0055_334 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- A_0239_001 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- A_0520_329 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- A_0855_234 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- A_0875_150 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- A_0885_173 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- A_1387_347 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- K_0010_051 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- V_1258_173 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- V_1278_181 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- V_1302_225 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- V_CG13_211 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLSSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- betulinusALM87520 ---------------------------------------------------------------------------------------GVNQLKFQRSLRSVAEERVGRRCGGLRVLNSYWVCEDSTYKFFEVILVDPFHKAIRRDPKANWICNPVHKHREMRGL---------------------------------------- PomaceaXP_025088085 MGAYKYMQELWRKKQSDVMRFLLRVRCWQYRQLSAVHRAPRPTRPDKARRLGYRAKQGYVIYRVRVRRGGRKRPVPKGCTYGKPVTHGVNQLKFARSLRSVAEERVGRRCGGLRVLNSYWVCEDSTYKFFEVILVDPFHKTIRRDPQANWICKPVHKHREMRGLTSAGKSSRGLGKGHLFNKTKGGSRRAYWKKQNSLSLRRKR BiomphalariaXP_013064049 MGAYKYIQELYRKKQSDIMRFLIRVRCWQYRQLSSIHRASRPTRPDKARRLGYRAKQGYVVYRVRIKRGGRKRPVPKGCTYGKPKTHGVNQLKFQRNQRSVAEERVGRKCGGLRVLNSYWVGQDSTYKFFEVILVDPFHKTIRRDPAIQWICNPVHKHRELRGLTSAGKSSRGLGKGHLFNKTIGGSRRRNWRKHNTVSMRRKR AplysiaXP_005105827 MGAYKYMQEIYRKKQSDIMRFLIRVRCWQFRQLSAIHRASRPMRPDKARRLGYRAKQGYVIYRVRIKRGGRKRPVPKGCTYGKPKTHGVNQLKFQRSHRNVAEERVGRKCGALRVLNSYWVAQDSTYKFFEIILVDPFHKTIRRDPAIQWICRSVHKHRELRGLTSAGKSSRGLGKGHKFNKTTGGSRRKNWKKNNAVSLRRKR LottiaXP_009052062 MGAYKYMQEIYRKKQSDVMRFLLRVRCWQYRQLNPVHRCPRPTRPEKARQLGYKATQGFVIYRVRVRRGSRKKPVRKGITYGKPKRHCVVRIKFARNLRSVAEEKVGRRCGNLRVLNSYWVCQDSVFKFYEVILVDPFHKSIRRNPRIQWICNPVMKHRELRGLTNAGKNSRGLGKGHKFNKTTGGSRWANWKKHNFEQMHRKR HaliotisABU43069 -----------------------------YRQLSAIHRAPRPTRPDKARRLGYRAKQGYVIYRVRVRRGGRKRPVPKGCTYGKPVTHGVNQLKFARSLRSVAEERVGRKCGGLRVLNSYWVAEDSTYKFFEVIMVDPANKTIRRDPESNWLCRAVHKHRELRGLTSAGRSSRGLGKGHKFTKTNGGSRRAYMIKHNSLSLRRKR ElysiaRUS68828 --------------------------------------------VLLPSHLDIIFLSGYVIYRVRVKRGGRKRPVPKGATYGKPKTHGVNQLKFQRSHRSIAEERVGRKCGGLRVLNSYWVGQDSTYKFFEIILVDPFHKTIRRDPAIQWICKAVHKHRELRGLTSAGKSSRGLGKGHLFNKTIGGSRRANWKRNNTTSLRRKR #################################################################################################################################################################### ############################ ## 2) Chimeric assemblies ## ############################ [Formerly conotoxin] [No Framework] [similarity only in the putative mature peptide with only one episcopatus conotoxin precursor; the signal region of this precursor had similarity with the mature region of a precursor from imperialis] imperialisC7DQY0 MMFRLTSVSCFLLVIACLNLFQVVLTSRCFPPGIYCTPYLPCCWGICCGTCRNDNSSLTFLQFCLPFFFFLRPSHPLFLLLPAR---------------------------------------------------------------------- episcopatusBAS22909 ---------------------------------------------------MADNSSLTFLQFFLPVFFFLSPSHLLSLLLPAEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0031_242 -----------------------------------------------------------------------------------EIPTVPIYYLAKPQPRERAWKNQRGKKTLLSLTL------------------------------------- A_1387_258 -----------------------------------------------------------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0520_258 -----------------------------------------------------------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0239_246 -----------------------------------------------------------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0048_049 -----------------------------------------------------------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0031_241 -----------------------------------------------------------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0025_363 -----------------------------------------------------------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0855_225 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ V_CG13_310 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ V_1302_294 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ V_1278_154 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ V_1258_233 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0885_290 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0875_222 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0039_236 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0055_310 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ magusQFQ61189 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ PomaceaPVD36229 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLVKPQPRERVWRNQRGKKTLLSLTLSSGERGAIPSPLELSSRRHLAGVARAIRSEDRVRRGV LottiaXP_009053172 ---------------------------------------------------------------------MPRHVISDAHEWINEIPTVPIYYLAKPQPEERVRGCQRGKKTLLSLIRDWLGWARAQV--------------------------- schistocephalusVDM05491 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWQNQRGKKTLLSLTLVRLCEETWWV--------------------------- brugiaXP_001891758 ---------------------------------------------------------------------MPRHLIRDAHEWINEIPTVPIYYLAKPQPRERAWQTQRGKKTLLSLTLVRLCEES------------------------------ nematostellaXP_001618891 ---------------------------------------------------------------------MPRHLISDAHEWINEILTVPIYYLAKPQPRERAWQNQRGKKTLLSLTLV------------------------------------ PiceaABR16542 ---------------------------------------------------------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWQNQRGKKTLLSLTLVRLCEMT------------------------------ [Real: Transcription factor] A_0031_242 ----------------------------------EIPTVPIYYLAKPQPRERAWKNQRGKKTLLSLTL------------------------------------- A_1387_258 ----------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0520_258 ----------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0239_246 ----------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0048_049 ----------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0031_241 ----------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0025_363 ----------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ K_0010_408 ----------------------------------EIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0855_225 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ V_CG13_310 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ V_1302_294 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ V_1278_154 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ V_1258_233 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0885_290 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0875_222 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0039_236 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ A_0055_310 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ magusQFQ61189 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWRNQRGKKTLLSLTLVRLCEET------------------------------ PomaceaPVD36229 --------------------MPRHLISDAHEWINEIPTVPIYYLVKPQPRERVWRNQRGKKTLLSLTLSSGERGAIPSPLELSSRRHLAGVARAIRSEDRVRRGV LottiaXP_009053172 --------------------MPRHVISDAHEWINEIPTVPIYYLAKPQPEERVRGCQRGKKTLLSLIRDWLGWARAQV--------------------------- schistocephalusVDM05491 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWQNQRGKKTLLSLTLVRLCEETWWV--------------------------- brugiaXP_001891758 --------------------MPRHLIRDAHEWINEIPTVPIYYLAKPQPRERAWQTQRGKKTLLSLTLVRLCEES------------------------------ nematostellaXP_001618891 --------------------MPRHLISDAHEWINEILTVPIYYLAKPQPRERAWQNQRGKKTLLSLTLV------------------------------------ PiceaABR16542 --------------------MPRHLISDAHEWINEIPTVPIYYLAKPQPRERAWQNQRGKKTLLSLTLVRLCEMT------------------------------ [Formerly conotoxin] [Framework XIV] [similarity only in the putative mature peptide with only one episcopatus superfamily T mature conotoxin; named Superfamily Cerm17 in Abalde et al. 2018] [Hypothetical protein Pmag05 as in Pardos-Blas et al. 2019] [ /---Signal---/-----C-------------------------C-C------------C--------------------------------------------] V_1302_292 --------------------------------------KFSQNVACDCSNLLSVDHILFHCSILTSLYKESNVDVTVHDSIQTITHKPDIVQIAKVLSMHQINRFL V_CG13_265 --------------------------------------KFSQNVACDCSNLLSVDHILFHCSILTSLYKESNVDVTVHDSIKTITHKPDIVQIAKVLSMHQINRFL A_0055_304 ------------------SCPRYLSSLIYKLRLNSWKTKFSQNVACDCSNLLSVDHILFHCSILTSLYKESNVDVTVHDSIQTITHKPDIVQIAKVLSMHQINRFL V_1258_235 -----------------LSCPRYLSSLIYKLRLNSWKTKFSQNVACDCSNLLSVDHILFHCSILTSLYKESNVDVTVHDSIQTITHKPDIVQIAKVLSMHQINRFL A_0855_177 -----------------LSCPRYLSSLIYKLHLNSWKTKFSQNVACDCSDLLSVDHILFHCSILTSLYKESNVDVTVHDSIKTITHKPDIVQIAKVLSMHQINRFL A_0875_210 -----------------LSCPRYLSSLIYKLHLNSWKTKFSQNVACDCSNLLSVDHILFHCSILTSLYKESNVDVTVHDSIQTITHKPDIVQIAKVLSMHQINRFL A_0885_262 -----------------LSCPRYLSSLIYKLHLNSWKTKFSQNVACDCSNLLSVDHILFHCSILTSLYKESNVDVTVHDSIKTITHKPEIVQIAKVLSMHQINRFL V_1278_118 -----------------LSCPRYLSSLIYKLHLNSWKTKFSQNVACDCSNLLSVDHILFHCSILTSLYKESNVDVTVHDSIQTITHKPDIVQIAKVLSMHQINRFL A_0039_290 ------MREHLPEKYE-ILCPRYVSSLIYKLRLHSWKTKFSQSVACDCSNLLSVDHILFHCSILTSLYKESNVDVTVHDSVKTITHKPDIVQIAKVLSMHQINRFL K_0010_210 ---------------------------IYKLRLNSWKTKFSQSVACGCSNLLSVDHILFHCSILTSLYKESNVDVTIHDSIKTITHKSEIVQIAKVLSMHQINRFL K_0010_211 ---------------------------IYKLRLNSWKTKFSQSVACGCSNLLSVDHILFHCSILTSLYKESNVDVTVHDSIKTITHKPEIVQIAKVLSMHQINRFL A_0055_241 ----------------METHPRHISSLIYKICLNFWKTKYSQNIACACSKPLSGEHILFHCPILNALYSKANIEISKSKPVHAISFS------------------- A_0855_187 ----------------METHPRHISSLIYKICLNFWKTKYSQNIACACSKPLSGEHILLYCPILNALYSKANIEISKSKPVHAISFS------------------- V_1302_291 --------------------------------------KYSQNIACACSKPLSGEHILFHCLILNALYSKANIEISKSKPVHAISFSSEIVEVAKIISQSQISIFL V_CG13_270 ----------------METHPRHISSLIYKIRLNSWKTKYSQNIACACSKPLSGEHILFHCLILNALYSKANIEISKSKPVHAISFSSEIVEVAKIISQSQISIFL ermineusAXL95377 ----------------METYPRHISSLIYKIRLNSWKTKYSQNIACACSKPLSGEHILFHCPILNALYSKANIEISKSKSVHAISFSAEIVEVAKIISQSQISIFL magusQFQ61211 ----------------METYPRHISSLIYKIRLNSWKTKYSQNIACACSKPLSSEHILFHCPISKALYSKANIEISKSKPAHAISFSSEIVEVAKIISQSQISIFL A_0039_218 ------VQEKLPAKTKALLCQRSLSSIIYKIRLNSWKTKFSKDVDCACGLPLTIQHILFECPVPTYLYYILKIDIASLGALHEILYSPKMIDIAKAISSSVIIHLL A_0855_167 -----MNNEKLPAKTKVLLCQRSLSSIIYKMRLNSWKTKFSKDVDCACGLPLSIQHILFECPVLTSLYNILKVDIASLGALQEILYSPKMIDIAKAISSSVIIHLL A_0885_172 --------------------QRSSSSIIYKMRLNSWKTKFSKDVDCACGLPLSIQHILFECPVLTSLYNILKIDIASLGALHEILYSPKMIDIAKAISSSVIIHLL V_1302_256 -----MNNEKLPAKTKALLCQRSLSSIIYKMRLNSWKTKFSKDVDCACGLPLSIQHILFECPVLTSLYNILKIDIASLGALHEILYSPKMIDIAKAISSSVIIHLL V_1278_127 -----MNNEKLPAKTKALLCQRSLSCIIYKMRLNSWKTKFSKDVDCACGLPLSIQHILFECPIL------------------------------------------ V_CG13_233 -----MNNEKLPAKTKVLLCQRSLSSIIYKMRLNSWKTKFSKDVDCACGLPLSIQHILFECPILTSLYNIQRIDIGSLGALQEILYSPKMIDIAKAISSSVIIHLL K_0010_220 -----MNDEKLPAKTKVLLCQRSLSSIIYKMRLNSWKTKFSKDVDCACGLPLSIQHILFECPILTSLYNIQRIDIGSLGALQEILYSPKMIDIAKAISSSVIIHLL A_0039_217 MLESRIMTEKLPSKSKILPCSRSLSSLIYKIRLNSWKTKFSKNVECACGHPVSIEHILFDCPLLSLLYNISGIDVAKMRSLNDIIYSEEMIQIASVLSSTILANLL A_0055_330 MLESRIMTEKLPSKSKILPCSRSLSSLIYKIRLNSWKTKFSKNVECACGHPISIEHILFDCPLLSLLYNISGIDVAKMRSLNDIIYSEEMIQIASVLSSTILANLL A_0875_196 -------------------CSRSLSSLIYKIRLNSWKTKFSKNVECACGHPISIEHILFDCPLLSLLCNISGIDVAKMRSLNDIIYSEEMIQIASVLSSTILANLL V_1258_232 MLESSIMTEKLPSKSKILLCSRSLSSLIYKIRLNSWKTKFSKNVECACGHPISIEHILFDCPLLSLLYNISGIDVAKMGSLNDIIYSEEMIQIASVVSSTILSNLL V_1278_117 MLESSIMTEKLPSKSKILLCSRSLSSLIYKIRLNSWKTKFSKNVECACGHPISIEHILFDCPLLSLLYNISGIDVAKMGSLNDIIYSEEMIQIASVVSSTILSNLL V_1302_293 MLESSIMTEKLPSKSKILLCSRSLSSLIYKIRLNSWKTKFSKNVECACGHPISIEHILFDCPLLSLLYNISGIDVAKMGSLNDIIYSEEMIQIASVVSSTILSNLL V_CG13_303 MLESSIMTEKLPSKSKILLCSRSLSSLIYKIRLNSWKTKFSKNVECACGHPISIEHILFDCPLLSLLYNISGIDVAKMGSLNDIIYSEEMIQIASVVSSTILSNLL K_0010_026 MLESRIMTEKLPSKSKILPCSRSLSSLIYKIRLNSWKTKFSKNVECACGHPISIEHILFDCPLLSLLYNISGIDVAKMGSLNDIIYSEEMIQIASVLSSTILANLL ermineusAXL95414 ------------------PCSRSLSSVIYKIRLNSWKAKFSKNVDCACGHPISIEHILFDFPLLSILYNISGIDVAKMGSLNDIIYSEEMIQIASVLSSTLLVNLL [Formerly conotoxin] [New Framework] [similarity only in the putative mature peptide with only one episcopatus conotoxin precursor; named Superfamily Cerm19 in Abalde et al. 2018] [ /----------C----------C--------------------------CC----C----------C----C--C------------C---C----C--------------C----] episcopatusBAS23914 -------------------------------------MRCLPVFVILLLLIASAPSVDARPKTKDDIPQASFQDNAKRILQVSSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0055_269 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0885_185 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0875_214 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0855_168 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0039_277 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKWCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_1387_326 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0520_140 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0239_015 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0031_313 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0025_392 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL V_1258_174 ETSGGGPVASRCTSLPWRTLWWCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL V_1278_111 ETSGGGPVASRCTSLPWRTLWWCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL V_1302_227 ETSGGGPVASRCTSLPWRTLWWCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL V_CG13_235 ETSGGGPVASRCTSLPWRTLWWCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL ermineusAXL95402 ETSGRGPVASRCTSLPWRTL-WCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL magusQFQ61215 ETSGGGPVASRCTSLPWRTL-WCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL real: Hypothetical protein Pmag06 as in Pardos-Blas et al. 2019] A_0055_269 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0885_185 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0875_214 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0855_168 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0039_277 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKWCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_1387_326 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0520_140 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0239_015 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0031_313 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL A_0025_392 ETSGGGPVASHCTSLPWRTLWWCWGGQGRPKSARRRKGQSSWGSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL V_1258_174 ETSGGGPVASRCTSLPWRTLWWCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL V_1278_111 ETSGGGPVASRCTSLPWRTLWWCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL V_1302_227 ETSGGGPVASRCTSLPWRTLWWCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL V_CG13_235 ETSGGGPVASRCTSLPWRTLWWCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL K_0010_077 ETSGGGPVASRCTSLPWRTL-WCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPLISSSSTCSRFCGTFFCGFRIFSFPSPSSLNCKFSSL ermineusAXL95402 ETSGRGPVASRCTSLPWRTL-WCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL magusQFQ61215 ETSGGGPVASRCTSLPWRTL-WCWGGQGRPKSA-RRKGQSSWRSSQLKRCCRAALCTLGRTRYATSCPSGFCAPCRWHKPPISSSSTCSRFCGTFFCGFRIFSSPSPSSLNCKFSSL [Formerly conotoxin] [Framework XIV] [similarity only in the putative mature peptide with only one episcopatus superfamily T mature conotoxin; named Superfamily Cerm 20 in Abalde et al. 2018 GBE 10:2643–2662] [ /------Signal--------/----------Propre----------/-----------------------C------C---------C--C---] episcopatusBAS23805 MRCLPVFVILLLLIASAPSVDARPKTKDDIPQASFQDNAKRILQVLESKRNCCRLQVCCGSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_0855_224 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL V_CG13_309 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_0875_221 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_0039_281 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_0055_328 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL V_1302_295 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL V_1278_155 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL V_1258_234 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_0885_291 ------------------------------------------------------------STQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL V_1258_209 -----------------------------------------------------------------YGPEECDCPNKTKHCNGHHSVSTQCDLCPVL A_0239_317 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_0239_318 ------------------------------------------------------------STQNWHGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_0048_050 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_1387_257 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_0520_259 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_0031_240 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL A_0025_362 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL K_0010_407 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL K_0010_050 --------------------------------------------------------------QNWYGPGESDCLITTKHCDGQHSVLTECDLYSVL ermineusAXL95399 -----------------------------------------------------------QSTQNWYGPGESDCLIKTKHCDGHHSVLTQCDFCPVL [real: 28S rRNA] A_0025_362 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA A_0031_240 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA A_0039_281 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA A_0048_050 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA A_0055_328 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA A_0239_317 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA A_0239_318 ---TCAACTCAGAACTGGCACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA A_0520_259 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA A_0855_224 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA A_0875_221 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTC--- A_0885_291 ---TCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA A_1387_257 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA K_0010_050 ---------CAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATCACAACAAAGCATTGTGATGGCCAGCACTCGGTGTTGACGGAATGTGATTTATACTCAGTGCTCTGA K_0010_407 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA V_1258_209 ------------------TACGGACCAGAGGAATGTGACTGTCCAAATAAAACAAAGCATTGTAATGGCCATCACTCGGTGTCGACGCAATGTGATTTGTGCCCAGTGCTCTGA V_1258_234 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA V_1278_155 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTC--- V_1302_295 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA V_CG13_309 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA ermineusMH360350 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA BuccinulumMK543279 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA PenioMG194426 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA KelletiaMH277544 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA IlyanassaAY145411 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA AustrofususMK543277 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA CominellaMH277542 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA CrepidulaJF509736 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA GibbulaAY145406 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA PleuroceraDQ256747 CAGTCAACTCAGAACTGGTACGGACCAGGGGAATCCGACTGTCTAATTAAAACAAAGCATTGCGATGGCCATCACTCGGTGTTGACGCAATGTGATTTCTGCCCAGTGCTCTGA [Formerly conotoxin] [No Framework] [similarity only in the putative mature peptide with only one episcopatus conotoxin precursor; named Superfamily Cerm21 in Abalde et al. 2018] episcopatusBAS25221 MRCLPVFVILLLLTASGPSVDARPMTKDDVPLSSFHDNTYYSLD----------FFFLLSFFFFFRWMLYAIFVLFADIGVPSRKDVFICLF episcopatusBAS23645 MRCLPVFVILLLLIASAPSVDARPKTKDDIPQASFQDNAKRILQVLERLLAGFFFFFFLSFFFF-RWMLYAIFVLFADIGVPSRKDVFICLF A_0855_194 ---------------------------------------------------------------IFRWIKYAIFILFADIKLPSRKDVFICLL--------- A_0039_227 ---------------------------------------------------------------IFRWIKYAIFILFADIKLPSRKDVFICLL--------- A_0875_213 ---------------------------------------------------------------IFRWIKYAIFILFADIELPSRKDVFICLL--------- A_0885_248 ---------------------------------------------------------------IFRWIKYAIFILFADMELPSRKDVFICLLWISMCGSKC V_CG13_240 ---------------------------------------------------------------FFRCIKYAIFILFADIGLPSRKDVFICLL--------- V_CG13_253 ---------------------------------------------------------------FFRWIKYAIFILFADIGLPSRKDVFICLL--------- V_1302_242 ---------------------------------------------------------------FFRWIKYAIFILFADIGLPSRKDVFICLL--------- V_1302_243 ---------------------------------------------------------------FFRLIKYAIFILFADIGLPSRKDVFICLL--------- V_1258_207 ---------------------------------------------------------------FFRWIKYAIFVLFADIGVPSRKDVFICLL--------- V_1258_206 ---------------------------------------------------------------SFRWIKYAIFVLFADIGVPSRKDVFICLL--------- A_0520_089 ---------------------------------------------------------------FFRWIKYAIFVLFADIGVPSRKDVFICLL--------- A_0048_170 ---------------------------------------------------------------FFRWIKYAIFVLFADIGVPSRKDVFICLL--------- ermineusAXL95655 -------------------------------------------------------FFFFFFFLFFRWIKYAIFVLLVDIGVPSRKDVFIGLL--------- [Real: Hypothetical protein] A_0855_194 --------IFRWIKYAIFILFADIKLPSRKDVFICLL--------- A_0039_227 --------IFRWIKYAIFILFADIKLPSRKDVFICLL--------- A_0875_213 --------IFRWIKYAIFILFADIELPSRKDVFICLL--------- A_0885_248 --------IFRWIKYAIFILFADMELPSRKDVFICLLWISMCGSKC V_CG13_240 --------FFRCIKYAIFILFADIGLPSRKDVFICLL--------- V_CG13_253 --------FFRWIKYAIFILFADIGLPSRKDVFICLL--------- V_1302_242 --------FFRWIKYAIFILFADIGLPSRKDVFICLL--------- V_1302_243 --------FFRLIKYAIFILFADIGLPSRKDVFICLL--------- V_1258_207 --------FFRWIKYAIFVLFADIGVPSRKDVFICLL--------- V_1258_206 --------SFRWIKYAIFVLFADIGVPSRKDVFICLL--------- A_0520_089 --------FFRWIKYAIFVLFADIGVPSRKDVFICLL--------- A_0048_170 --------FFRWIKYAIFVLFADIGVPSRKDVFICLL--------- K_0010_101 ----------RWIKYAIFILFADIELPSRKDVFICLL--------- ermineusAXL95655 FFFFFFFLFFRWIKYAIFVLLVDIGVPSRKDVFIGLL--------- [Formerly conotoxin] [No Framework] [similarity only in the putative mature peptide with only one episcopatus conotoxin precursor; named Superfamily Cerm22 in Abalde et al. 2018] episcopatusBAS23734 MRCLPVFVILLLLIASAPSVDARPKTKDDIPQASFQDNAKRILQVLESKRNCCRLQVCCGCFHVVNDPFDLQGAISGLLPCENQFVTFVYTHAQCLLFYH-------- A_0039_268 -----------------------------------------------------------VCFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDSHTMPTLLPLMRWV A_0855_169 -----------------------------------------------------------VCFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDSHTMPTLLPLMRWV V_1258_211 -------------------------------------PIELAAVYLLVRRLCACARFVSVCFHVVDLLFNLKGAISGLLPCKNRFLTFVYT----HTMLTLLPLMRWV V_1302_258 -------------------------------------PIELAAVYLLVRRLCACARFVSVCFHVVDLLFNLKGAISGLLPCKNRFLTFVYT----HTMLTLLPLTRWV V_1278_164 -------------------------------------PIELAAVYLLVRRLCACARFVSVCFHVVDLLFNLKGAISGLLPCKNQFLTFVYT----HTMLTLLPLMRWV A_0885_222 -----------------------------------MRPIELAAVYLLVRRLCACARFVSVCFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDSHTMPTLLPLTRWV A_0520_179 -----------------------------------MRPIELAAVYLLVRRLCACARFVSVCFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDSHTMPTLLPLMRWV A_0239_059 -----------------------------------MRPIELAAVYLLVRRLCACVRFVSVCFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDTHTMPTLLPLMRWV A_0048_128 -----------------------------------MRPIELAAVYLLVRRLCACARFVSVRFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDSHTMPTLLPLRRWV ermineusAXL95376 -------------------------------------PIKLAAVCLLVRRQCVCARCVCVCCHVVDHPFDLHEATSGLLPCKNQFLTFLDSHTHTH------------ A_0875_165 -----------------------------------MRPIELAAVYLLVRRLCACARFVSVFMLLIFYLTCKGPYLGCFHVKISSLLLCTHTQTHTQCLLFYH------ A_0039_267 -----------------------------------MRPIELAAVYLLVRRLCACARFVSVFMLLIFYLTCKGPYLGCFHVKISSLLLCTHTQTHTQCLLFYH------ A_0055_284 -----------------------------------MRPIELAAVYLLVRRLCACARFVSVFMLLIFYLTCKGPYLGCFHVKISSLLLCTHTQTHTQCLLFYH------ A_0885_221 -----------------------------------MRPIELAAVYLLVRRLCACARFVSVFMLLIFYLTCKGPYLGCFHVKISSLLLCTHTQTRTQCLLFYH------ [Real: Hypothetical protein] A_0039_268 ------------------------VCFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDSHTMPTLLPLMRWV A_0855_169 ------------------------VCFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDSHTMPTLLPLMRWV V_1258_211 --PIELAAVYLLVRRLCACARFVSVCFHVVDLLFNLKGAISGLLPCKNRFLTFVYT----HTMLTLLPLMRWV V_1302_258 --PIELAAVYLLVRRLCACARFVSVCFHVVDLLFNLKGAISGLLPCKNRFLTFVYT----HTMLTLLPLTRWV V_1278_164 --PIELAAVYLLVRRLCACARFVSVCFHVVDLLFNLKGAISGLLPCKNQFLTFVYT----HTMLTLLPLMRWV A_0885_222 MRPIELAAVYLLVRRLCACARFVSVCFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDSHTMPTLLPLTRWV A_0520_179 MRPIELAAVYLLVRRLCACARFVSVCFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDSHTMPTLLPLMRWV A_0239_059 MRPIELAAVYLLVRRLCACVRFVSVCFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDTHTMPTLLPLMRWV A_0048_128 MRPIELAAVYLLVRRLCACARFVSVRFHVVDLLFNLQGAISGLLPCKNQFLTFVYTHTDSHTMPTLLPLRRWV K_0010_348 ----------------------------VVDFLFNLQGAISGLLSCKNQFLTFVYTHT--QTMPTLLPLMRWV ermineusAXL95376 --PIKLAAVCLLVRRQCVCARCVCVCCHVVDHPFDLHEATSGLLPCKNQFLTFLDSHTHTH------------ A_0875_165 MRPIELAAVYLLVRRLCACARFVSVFMLLIFYLTCKGPYLGCFHVKISSLLLCTHTQTHTQCLLFYH A_0039_267 MRPIELAAVYLLVRRLCACARFVSVFMLLIFYLTCKGPYLGCFHVKISSLLLCTHTQTHTQCLLFYH A_0055_284 MRPIELAAVYLLVRRLCACARFVSVFMLLIFYLTCKGPYLGCFHVKISSLLLCTHTQTHTQCLLFYH A_0885_221 MRPIELAAVYLLVRRLCACARFVSVFMLLIFYLTCKGPYLGCFHVKISSLLLCTHTQTRTQCLLFYH [Formerly conotoxin] [No Framework] A_0885_241 MGGRFVVTALIVMMVLSLIVTM----------RSHKRSLPRRGKRSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRR A_0875_228 MGGRFVVTALIVMMVLSLIVTM----------RSHKRSLPRGGKRGYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRR A_0855_208 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNLPRRGKRSYSFGDWSVRHALSTGYKGAGGNGRSADDKGTGGNRRR V_CG13_216 MGGRFVVTALIVMMVLSVIVTM----------RSHKRNLPRHGKRSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ V_1258_202 MGGRFVVTALIVMMVLSVIVTM----------RSHKRNLPRHGKRSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ V_1302_245 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNPPRYGKTSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ A_0039_299 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNPPRTGKRSYTFGDWIVRHALSAGYKGADGNGRSADDKGTGGNRRQ A_0055_329 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNPPRTGKRSYTFGGWSVRHALSAGYKGADGNGRSADDKGTGGNRRR A_0025_386 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNLPRRGKRSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ A_1387_220 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNLPRRWKRSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ A_0520_180 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNLPRVGKRSYTFGDWSVRHALSAGYKGADGNGRSADDKGTGGNRRQ A_0520_247 MGGRFVVTALIVMMVLSVMVTM----------RSHKRNLPRVGKRSYSFGDWSVRHALSAG---------------------- A_0048_221 MGGRFVVTALIVMMVLSLIVTM----------RSHKRSPPRVGKRSYSFGDWSVRHALSAGYKGADGNGRSADDKGTGGNRRQ A_1387_219 MVGRFVVTALIVMMVLSVLVTM----------RSHKRNLPRIGKRSYTFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ andremeneziATF27510 MGGRFVVTALIAMTLLSLVLTSVPDRRIFWVDESNRPNLQRIGQRSYTFGDWNIRRAVSTG------------HKRTDGNGRQ [Real: Hypothetical protein] A_0885_241 MGGRFVVTALIVMMVLSLIVTM----------RSHKRSLPRRGKRSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRR A_0875_228 MGGRFVVTALIVMMVLSLIVTM----------RSHKRSLPRGGKRGYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRR A_0855_208 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNLPRRGKRSYSFGDWSVRHALSTGYKGAGGNGRSADDKGTGGNRRR V_CG13_216 MGGRFVVTALIVMMVLSVIVTM----------RSHKRNLPRHGKRSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ V_1258_202 MGGRFVVTALIVMMVLSVIVTM----------RSHKRNLPRHGKRSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ V_1302_245 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNPPRYGKTSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ A_0039_299 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNPPRTGKRSYTFGDWIVRHALSAGYKGADGNGRSADDKGTGGNRRQ A_0055_329 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNPPRTGKRSYTFGGWSVRHALSAGYKGADGNGRSADDKGTGGNRRR A_0025_386 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNLPRRGKRSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ A_1387_220 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNLPRRWKRSYSFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ A_0520_180 MGGRFVVTALIVMMVLSLIVTM----------RSHKRNLPRVGKRSYTFGDWSVRHALSAGYKGADGNGRSADDKGTGGNRRQ A_0520_247 MGGRFVVTALIVMMVLSVMVTM----------RSHKRNLPRVGKRSYSFGDWSVRHALSAG---------------------- A_0048_221 MGGRFVVTALIVMMVLSLIVTM----------RSHKRSPPRVGKRSYSFGDWSVRHALSAGYKGADGNGRSADDKGTGGNRRQ A_1387_219 MVGRFVVTALIVMMVLSVLVTM----------RSHKRNLPRIGKRSYTFGDWSVRHALSTGYKGADGNGRSADDKGTGGNRRQ andremeneziATF27510 MGGRFVVTALIAMTLLSLVLTSVPDRRIFWVDESNRPNLQRIGQRSYTFGDWNIRRAVSTG------------HKRTDGNGRQ