# This dataset was compiled from the SWISSPROT database (Release 52.4) # ID the accession number of SWISSPROT # Pos The position of mucin-type O-glycosylation residue in the corresponding sequence # Each site is presented by a 41 residue long fragment ID Pos 41 residue long fragment CD24_MOUSE 30 GLGLLLLALLLPTQIYCNQTSVAPFPGNQNISASPNPSNAT CGHB_HUMAN 141 GGPKDHPLTCDDPRFQDSSSSKAPPPSLPSPSRLPGPSDTP CGHB_HUMAN 147 PLTCDDPRFQDSSSSKAPPPSLPSPSRLPGPSDTPILPQOO CGHB_HUMAN 152 DPRFQDSSSSKAPPPSLPSPSRLPGPSDTPILPQOOOOOOO CGHB_HUMAN 158 SSSSKAPPPSLPSPSRLPGPSDTPILPQOOOOOOOOOOOOO CMGA_BOVIN 185 EDNQAPGEEEEAPSNAHPLASLPSPKYPGPQAKEDSEGPSQ CMGA_BOVIN 204 ASLPSPKYPGPQAKEDSEGPSQGPASREKGLSAEQGRQTER CSF2_HUMAN 22 WLQSLLLLGTVACSISAPARSPSPSTQPWEHVNAIQEARRL CSF2_HUMAN 24 QSLLLLGTVACSISAPARSPSPSTQPWEHVNAIQEARRLLN CSF2_HUMAN 26 LLLLGTVACSISAPARSPSPSTQPWEHVNAIQEARRLLNLS DLK_HUMAN 94 CICTDGWDGELCDRDVRACSSAPCANNGTCVSLDGGLYECS EPO_HUMAN 153 TTLLRALGAQKEAISPPDAASAAPLRTITADTFRKLFRVYS EPYC_BOVIN 95 QPEEAEEEEEEESTPRLIDGSSPQEPEFTGVLGPQTNEDFP FA12_HUMAN 308 EYCDLAQCQTPTQAAPPTPVSPRLHVPLMPAQPAPPKPQPT FETUA_BOVIN 271 FQTQPVIPQPQPDGAEAEAPSAVPDAAGPTPSAAGPPVASV FETUA_BOVIN 282 PDGAEAEAPSAVPDAAGPTPSAAGPPVASVVVGPSVVAVPL FETUA_HUMAN 346 LGSPSGEVSHPRKTRTVVQPSVGAAAGPVVPPCPGRIRHFK FGFP1_BOVIN 172 KLVNSTLIRIKKPSQELMEPSPMDTVEVTTSSSPEKTQTMA GLPA_HUMAN 21 MYGKIIFVLLLSAIVSISASSTTGVAMHTSTSSSVTKSYIS GLPA_HUMAN 30 LLSAIVSISASSTTGVAMHTSTSSSVTKSYISSQTNDTHKR GLPA_HUMAN 32 SAIVSISASSTTGVAMHTSTSSSVTKSYISSQTNDTHKRDT GLPA_HUMAN 38 SASSTTGVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPR GLPA_HUMAN 41 STTGVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAHE GLPA_HUMAN 63 QTNDTHKRDTYAATPRAHEVSEISVRTVYPPEEETGERVQL GLPA_HUMAN 66 DTHKRDTYAATPRAHEVSEISVRTVYPPEEETGERVQLAHH GLPC_HUMAN 3 OOOOOOOOOOOOOOOOOOMWSTRSPNSTAWPLSLEPDPGMA GLPC_HUMAN 6 OOOOOOOOOOOOOOOMWSTRSPNSTAWPLSLEPDPGMASAS GLPC_HUMAN 9 OOOOOOOOOOOOMWSTRSPNSTAWPLSLEPDPGMASASTTM GLPC_HUMAN 15 OOOOOOMWSTRSPNSTAWPLSLEPDPGMASASTTMHTTTIA GLPC_HUMAN 24 TRSPNSTAWPLSLEPDPGMASASTTMHTTTIAEPDPGMSGW GLPC_HUMAN 26 SPNSTAWPLSLEPDPGMASASTTMHTTTIAEPDPGMSGWPD GLPC_HUMAN 42 MASASTTMHTTTIAEPDPGMSGWPDGRMETSTPTIMDIVVI GLP_CANFA 12 OOOOOOOOOQDVTEIIPHQISSKLPTQAGFISTEDPSFNTP GLP_CANFA 13 OOOOOOOOQDVTEIIPHQISSKLPTQAGFISTEDPSFNTPS GLP_CANFA 23 VTEIIPHQISSKLPTQAGFISTEDPSFNTPSTREDPSGTMY GLP_CANFA 39 AGFISTEDPSFNTPSTREDPSGTMYQHLPDGGQKOOOOOOO GLP_HORSE 7 OOOOOOOOOOOOOOQTIATGSPPIAGTSDLSTITSAATPTF GLP_HORSE 17 OOOOQTIATGSPPIAGTSDLSTITSAATPTFTTEQDGREQG GLP_HORSE 21 QTIATGSPPIAGTSDLSTITSAATPTFTTEQDGREQGDGLQ GLP_MACFU 2 OOOOOOOOOOOOOOOOOOOSSTTVPATHTSSSSLGPEQYVS GLP_MACFU 12 OOOOOOOOOSSTTVPATHTSSSSLGPEQYVSSQSNDKHTSD GLP_MACFU 14 OOOOOOOSSTTVPATHTSSSSLGPEQYVSSQSNDKHTSDSH GLP_MACFU 23 TTVPATHTSSSSLGPEQYVSSQSNDKHTSDSHPTPTSAHEV GLP_MACFU 48 KHTSDSHPTPTSAHEVTTEFSGRTHYPPEEDDRVQLVHEFS GLP_PIG 11 OOOOOOOOOOTETPVTGEQGSATPGNVSNATVTAGKPSATS GLP_PIG 31 SATPGNVSNATVTAGKPSATSPGVMTIKNTTAVVQKETGVP IACA_PIG 12 OOOOOOOOOTRKQPNCNVYRSHLFFCTRQMDPICGTNGKSY IACA_PIG 62 KGLRNQKFDFGHWGHCREYTSARSOOOOOOOOOOOOOOOOO IC1_HUMAN 64 KVATTVISKMLFVEPILEVSSLPTTNSTTNSATKITANTTD IGHA1_HUMAN 105 CHVKHYTNPSQDVTVPCPVPSTPPTPSPSTPPTPSPSCCHP IGHA1_HUMAN 111 TNPSQDVTVPCPVPSTPPTPSPSTPPTPSPSCCHPRLSLHR IGHA1_HUMAN 113 PSQDVTVPCPVPSTPPTPSPSTPPTPSPSCCHPRLSLHRPA IGHA1_HUMAN 119 VPCPVPSTPPTPSPSTPPTPSPSCCHPRLSLHRPALEDLLL IGHA1_HUMAN 121 CPVPSTPPTPSPSTPPTPSPSCCHPRLSLHRPALEDLLLGS IGHD_HUMAN 109 TASKSKKEIFRWPESPKAQASSVPTAQPQAEGSLAKATTAP IGHD_HUMAN 110 ASKSKKEIFRWPESPKAQASSVPTAQPQAEGSLAKATTAPA ITA2B_HUMAN 878 FPQPPVNPLKVDWGLPIPSPSPIHPAHHKRDRRQIFLPEPE ITIH2_HUMAN 673 GALYYGSKVVPDSTPSWANPSPTPVISMLAQGSQVLESTPP ITIH4_HUMAN 696 PDVPDHAAYHPFRRLAILPASAPPATSNPDPAVSRVMNIKI ITIH4_HUMAN 702 AAYHPFRRLAILPASAPPATSNPDPAVSRVMNIKIEETTMT KLK1_HUMAN 93 LWLGRHNLFDDENTAQFVHVSESFPHPGFNMSLLENHTRQA KLK1_HUMAN 104 ENTAQFVHVSESFPHPGFNMSLLENHTRQADEDYSHDLMLL KLK1_HUMAN 167 PEVGSTCLASGWGSIEPENFSFPDDLQCVDLKILPNDECEK KNG1_BOVIN 398 MKRPPGFSPFRSVQVMKTEGSTTVSLPHSAMSPVQDEERDS KNG1_BOVIN 406 PFRSVQVMKTEGSTTVSLPHSAMSPVQDEERDSGKEQGPTH KNG1_BOVIN 512 GKHYDWRTPYLASSYEDSTTSSAQTQEKTEETTLSSLAQPG KNG1_HUMAN 577 TVTFSDFQDSDLIATMMPPISPAPIQSDDDWIPDIQIDPNG KNG2_BOVIN 400 PGFSPFRSVQVMKTEGSTTVSLPHSAMSPVQDEERDSGKEQ LAMP1_HUMAN 206 GETRCEQDRPSPTTAPPAPPSPSPSPVPKSPSVDKYNVSGT LAMP1_HUMAN 208 TRCEQDRPSPTTAPPAPPSPSPSPVPKSPSVDKYNVSGTNG LAMP1_HUMAN 210 CEQDRPSPTTAPPAPPSPSPSPVPKSPSVDKYNVSGTNGTC LAMP2_HUMAN 195 AFVQNGTVSTNEFLCDKDKTSTVAPTIHTTVPSPTTTPTPK LCAT_HUMAN 433 HINAILLGAYRQGPPASPTASPEPPPPEOOOOOOOOOOOOO LEUK_HUMAN 29 GVLVVSPDALGSTTAVQTPTSGEPLVSTSEPLSSKMYTTSI LEUK_HUMAN 35 PDALGSTTAVQTPTSGEPLVSTSEPLSSKMYTTSITSDPKA LEUK_HUMAN 37 ALGSTTAVQTPTSGEPLVSTSEPLSSKMYTTSITSDPKADS LEUK_HUMAN 41 TTAVQTPTSGEPLVSTSEPLSSKMYTTSITSDPKADSTGDQ LEUK_HUMAN 42 TAVQTPTSGEPLVSTSEPLSSKMYTTSITSDPKADSTGDQT LEUK_HUMAN 48 TSGEPLVSTSEPLSSKMYTTSITSDPKADSTGDQTSALPPS LEUK_HUMAN 99 TSIGASTGSPLPEPTTYQEVSIKMSSVPQETPHATSHPAVP LEUK_HUMAN 103 ASTGSPLPEPTTYQEVSIKMSSVPQETPHATSHPAVPITAN LEUK_HUMAN 114 TYQEVSIKMSSVPQETPHATSHPAVPITANSLGSHTVTGGT LEUK_RAT 23 QVVSQENLPNTMTMLPFTPNSESPSTSEALSTYSSIATVPV LEUK_RAT 25 VSQENLPNTMTMLPFTPNSESPSTSEALSTYSSIATVPVTE LEUK_RAT 27 QENLPNTMTMLPFTPNSESPSTSEALSTYSSIATVPVTEDP LEUK_RAT 29 NLPNTMTMLPFTPNSESPSTSEALSTYSSIATVPVTEDPKE LEUK_RAT 33 TMTMLPFTPNSESPSTSEALSTYSSIATVPVTEDPKESISP LEUK_RAT 36 MLPFTPNSESPSTSEALSTYSSIATVPVTEDPKESISPWGQ LEUK_RAT 37 LPFTPNSESPSTSEALSTYSSIATVPVTEDPKESISPWGQT LEUK_RAT 108 PELTTSQEVSTEASLVLFPKSSGVASDPPVTITNPATSSAV LEUK_RAT 113 SQEVSTEASLVLFPKSSGVASDPPVTITNPATSSAVASTSL LEUK_RAT 125 FPKSSGVASDPPVTITNPATSSAVASTSLETFKGTSAPPVT LEUK_RAT 126 PKSSGVASDPPVTITNPATSSAVASTSLETFKGTSAPPVTV LEUK_RAT 176 FVATTVSSETSGPPVTMATGSLGPSKETHGLSATIATSSGE LEUK_RAT 180 TVSSETSGPPVTMATGSLGPSKETHGLSATIATSSGESSSV LEUK_RAT 187 GPPVTMATGSLGPSKETHGLSATIATSSGESSSVAGGTPVF LSHB_HORSE 138 TDCGVFRDQPLACAPQASSSSKDPPSQPLTSTSTPTPGASR LSHB_HORSE 143 FRDQPLACAPQASSSSKDPPSQPLTSTSTPTPGASRRSSHP LSHB_HORSE 148 LACAPQASSSSKDPPSQPLTSTSTPTPGASRRSSHPLPIKT LSHB_HORSE 150 CAPQASSSSKDPPSQPLTSTSTPTPGASRRSSHPLPIKTSO LSHB_HORSE 157 SSKDPPSQPLTSTSTPTPGASRRSSHPLPIKTSOOOOOOOO LSHB_HORSE 160 DPPSQPLTSTSTPTPGASRRSSHPLPIKTSOOOOOOOOOOO LSHB_HORSE 161 PPSQPLTSTSTPTPGASRRSSHPLPIKTSOOOOOOOOOOOO LSHB_HORSE 169 TSTPTPGASRRSSHPLPIKTSOOOOOOOOOOOOOOOOOOOO NID1_MOUSE 331 SPSHSPRRGYPDPHNVPRILSPGYEATERPRGVPTERTRSF NLGN1_RAT 683 STPVTSAFPTAKQDDPKQQPSPFSVDQRDYSTELSVTIAVG NLGN1_RAT 686 VTSAFPTAKQDDPKQQPSPFSVDQRDYSTELSVTIAVGASL PLMN_BOVIN 365 TNSEVRWEYCTIPSCESSPLSTERMDVPVPPEQTPVPQDCY PLMN_HUMAN 268 TDPNKRWELCDIPRCTTPPPSSGPTYQCLKGTGENYRGNVA PRG2_HUMAN 24 PLLLALLFGAVSALHLRSETSTFETPLGAKTLPEDEETPEQ RNBR_BOVIN 159 NPYVPVHFDGAVLLPATPVPSLPPPHRLLOOOOOOOOOOOO TPO_HUMAN 22 ELTELLLVVMLLLTARLTLSSPAPPACDLRVLSKLLRDSHV TPO_HUMAN 184 MLVGGSTLCVRRAPPTTAVPSRTSLVLTLNELPNRTSGLLE TPO_HUMAN 265 GYLNRIHELLNGTRGLFPGPSRRTLGAPDISSGTSDTGSLP TRFE_HUMAN 51 SEHEATKCQSFRDHMKSVIPSDGPSVACVKKASYLDCIRAI ZP4_PIG 293 SITRDSIFRLRVSCIYSVSSSALPVNIQVFTLPPPLPETHP