LOCUS NC_045512 29903 bp ss-RNA linear VRL 30-MAR-2020 DEFINITION Severe acute respiratory syndrome coronavirus 2 isolate Wuhan-Hu-1, complete genome. ACCESSION NC_045512 VERSION NC_045512.2 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Severe acute respiratory syndrome coronavirus 2 (SARS-CoV2) ORGANISM Severe acute respiratory syndrome coronavirus 2 Viruses; Riboviria; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Betacoronavirus; Sarbecovirus. REFERENCE 1 (bases 13476 to 13503) AUTHORS Baranov,P.V., Henderson,C.M., Anderson,C.B., Gesteland,R.F., Atkins,J.F. and Howard,M.T. TITLE Programmed ribosomal frameshifting in decoding the SARS-CoV genome JOURNAL Virology 332 (2), 498-510 (2005) PUBMED 15680415 REFERENCE 2 (bases 29728 to 29768) AUTHORS Robertson,M.P., Igel,H., Baertsch,R., Haussler,D., Ares,M. Jr. and Scott,W.G. TITLE The structure of a rigorously conserved RNA element within the SARS virus genome JOURNAL PLoS Biol. 3 (1), e5 (2005) PUBMED 15630477 REFERENCE 3 (bases 29609 to 29657) AUTHORS Williams,G.D., Chang,R.Y. and Brian,D.A. TITLE A phylogenetically conserved hairpin-type 3' untranslated region pseudoknot functions in coronavirus RNA replication JOURNAL J. Virol. 73 (10), 8349-8355 (1999) PUBMED 10482585 REFERENCE 4 (bases 1 to 29903) AUTHORS Wu,F., Zhao,S., Yu,B., Chen,Y.-M., Wang,W., Hu,Y., Song,Z.-G., Tao,Z.-W., Tian,J.-H., Pei,Y.-Y., Yuan,M.L., Zhang,Y.-L., Dai,F.-H., Liu,Y., Wang,Q.-M., Zheng,J.-J., Xu,L., Holmes,E.C. and Zhang,Y.-Z. TITLE A novel coronavirus associated with a respiratory disease in Wuhan of Hubei province, China JOURNAL Unpublished REFERENCE 5 (bases 1 to 29903) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (17-JAN-2020) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 6 (bases 1 to 29903) AUTHORS Wu,F., Zhao,S., Yu,B., Chen,Y.-M., Wang,W., Hu,Y., Song,Z.-G., Tao,Z.-W., Tian,J.-H., Pei,Y.-Y., Yuan,M.L., Zhang,Y.-L., Dai,F.-H., Liu,Y., Wang,Q.-M., Zheng,J.-J., Xu,L., Holmes,E.C. and Zhang,Y.-Z. TITLE Direct Submission JOURNAL Submitted (05-JAN-2020) Shanghai Public Health Clinical Center & School of Public Health, Fudan University, Shanghai, China COMMENT ##Assembly-Data-START## Assembly Method :: Megahit v. V1.1.3 Sequencing Technology :: Illumina ##Assembly-Data-END## REVIEWED REFSEQ: This record has been curated by NCBI staff. The reference sequence is identical to MN908947. On Jan 17, 2020 this sequence version replaced NC_045512.1. Annotation was added using homology to SARSr-CoV NC_004718.3. ### Formerly called 'Wuhan seafood market pneumonia virus.' If you have questions or suggestions, please email us at info@ncbi.nlm.nih.gov and include the accession number NC_045512.### Protein structures can be found at https://www.ncbi.nlm.nih.gov/structure/?term=sars-cov-2.### Find all other Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequences at https://www.ncbi.nlm.nih.gov/genbank/sars-cov-2-seqs/ COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..29903 /organism="Severe acute respiratory syndrome coronavirus 2" /mol_type="genomic RNA" /isolate="Wuhan-Hu-1" /host="Homo sapiens" /db_xref="taxon:2697049" /country="China" /collection_date="Dec-2019" 5'UTR 1..265 gene 266..21555 /gene="ORF1ab" /locus_tag="GU280_gp01" /db_xref="GeneID:43740578" CDS join(266..13468,13468..21555) /gene="ORF1ab" /locus_tag="GU280_gp01" /ribosomal_slippage="" /note="pp1ab; translated by -1 ribosomal frameshift" /codon_start=1 /product="ORF1ab polyprotein" /protein_id="YP_009724389.1" /db_xref="GeneID:43740578" /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQH LKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETL GVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQENWNT KHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFI DTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSII KTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKAT CEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRK GGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEK VNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAK KGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQY SLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKF KEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSII IGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLT EEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNT FTLKGGAPTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEKCSAYTVELGTEVNEF ACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDED EEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVG QQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEE AKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSG HNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCV DTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQR KQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAP YIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAK TVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTI QRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEE AARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQS TQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINL HTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTT DPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQD AYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCG QQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYE LKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYT TTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCD NIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVW HVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEV VENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKP NELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYM PYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLIN IIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIP CSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAI MQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCN SSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTF ISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVN LDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGD SAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGF VDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHIN AQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIA LKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTR DIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTT NGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDT NVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAG VCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVA IVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLT FYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSF STFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAAC CHLAKALNDFSNSGSDVLYQPPQTSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTL NGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCV LKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSC GSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNV LAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLD MCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLL TILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSL ATVAYFNMVYMPASWVMRIMTWLDMVDTSLSGFKLKDCVMYASAVVLLILMTARTVYDD GARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMC VEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFR YMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLR VESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRA TLQAIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQ RKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCV PLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISM DNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTT KGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGL NNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITN CVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQI PTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNRVCGVSAARL TPCGTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFS NYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEG NCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAM RNAGIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAES HVDTDLTKPYIKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFN VLFSTVFPPTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLV YAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEG SSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINAN QVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISA KNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDV ENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVM CGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRL YECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVL YYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVD DIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYS VMLTNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCY DHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVF GLYKNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAETLKATEETFKLSY GIATVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDA VVYRGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVA NYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLP IDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLS VVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRC PAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRN PAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFN VAITRAKVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKVITGLHP TQAPTHLSVDTKFKTEGLCVDIPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEA IRHVRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSA KPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKY FVKIGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSN HDLYCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHM VVKAALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHS DKFTDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDK SAFVNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYR LYLDAYNMMISAGFSLWVYKQFDTYNLWNTFTRLQSLENVAFNVVNKGHFDGQQGEVPV SIINNTVYTKVDGVDVELFENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANT VIWDYKRDAPAHISTIGVCSMTDIAKKPTETICAPLTVFFDGRVDGQVDLFRNARNGVL ITEGSVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQ EFKPRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKE SPFELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVK VTIDYTEISFMLWCKDGHVETFYPKLQSSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGD SATLPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPT GTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFF TYICGFIQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGC NYLGKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQIN DMILSLLSKGRLIIRENNRVVISSDVLVNN" mat_peptide 266..805 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="leader protein" /note="nsp1; produced by both pp1a and pp1ab" /protein_id="YP_009725297.1" mat_peptide 806..2719 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp2" /note="produced by both pp1a and pp1ab" /protein_id="YP_009725298.1" mat_peptide 2720..8554 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp3" /note="former nsp1; conserved domains are: N-terminal acidic (Ac), predicted phosphoesterase, papain-like proteinase, Y-domain, transmembrane domain 1 (TM1), adenosine diphosphate-ribose 1''-phosphatase (ADRP); produced by both pp1a and pp1ab" /protein_id="YP_009725299.1" mat_peptide 8555..10054 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp4" /note="nsp4B_TM; contains transmembrane domain 2 (TM2); produced by both pp1a and pp1ab" /protein_id="YP_009725300.1" mat_peptide 10055..10972 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="3C-like proteinase" /note="nsp5A_3CLpro and nsp5B_3CLpro; main proteinase (Mpro); mediates cleavages downstream of nsp4. 3D structure of the SARSr-CoV homolog has been determined (Yang et al., 2003); produced by both pp1a and pp1ab" /protein_id="YP_009725301.1" mat_peptide 10973..11842 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp6" /note="nsp6_TM; putative transmembrane domain; produced by both pp1a and pp1ab" /protein_id="YP_009725302.1" mat_peptide 11843..12091 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp7" /note="produced by both pp1a and pp1ab" /protein_id="YP_009725303.1" mat_peptide 12092..12685 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp8" /note="produced by both pp1a and pp1ab" /protein_id="YP_009725304.1" mat_peptide 12686..13024 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp9" /note="ssRNA-binding protein; produced by both pp1a and pp1ab" /protein_id="YP_009725305.1" mat_peptide 13025..13441 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp10" /note="nsp10_CysHis; formerly known as growth-factor-like protein (GFL); produced by both pp1a and pp1ab" /protein_id="YP_009725306.1" mat_peptide join(13442..13468,13468..16236) /gene="ORF1ab" /locus_tag="GU280_gp01" /product="RNA-dependent RNA polymerase" /note="nsp12; NiRAN and RdRp; produced by pp1ab only" /protein_id="YP_009725307.1" mat_peptide 16237..18039 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="helicase" /note="nsp13_ZBD, nsp13_TB, and nsp_HEL1core; zinc-binding domain (ZD), NTPase/helicase domain (HEL), RNA 5'-triphosphatase; produced by pp1ab only" /protein_id="YP_009725308.1" mat_peptide 18040..19620 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="3'-to-5' exonuclease" /note="nsp14A2_ExoN and nsp14B_NMT; produced by pp1ab only" /protein_id="YP_009725309.1" mat_peptide 19621..20658 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="endoRNAse" /note="nsp15-A1 and nsp15B-NendoU; produced by pp1ab only" /protein_id="YP_009725310.1" mat_peptide 20659..21552 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="2'-O-ribose methyltransferase" /note="nsp16_OMT; 2'-o-MT; produced by pp1ab only" /protein_id="YP_009725311.1" CDS 266..13483 /gene="ORF1ab" /locus_tag="GU280_gp01" /note="pp1a" /codon_start=1 /product="ORF1a polyprotein" /protein_id="YP_009725295.1" /db_xref="GeneID:43740578" /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQH LKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETL GVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQENWNT KHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFI DTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSII KTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKAT CEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRK GGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEK VNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAK KGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQY SLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKF KEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSII IGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLT EEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNT FTLKGGAPTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEKCSAYTVELGTEVNEF ACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDED EEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVG QQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEE AKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSG HNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCV DTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQR KQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAP YIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAK TVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTI QRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEE AARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQS TQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINL HTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTT DPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQD AYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCG QQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYE LKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYT TTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCD NIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVW HVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEV VENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKP NELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYM PYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLIN IIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIP CSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAI MQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCN SSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTF ISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVN LDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGD SAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGF VDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHIN AQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIA LKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTR DIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTT NGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDT NVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAG VCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVA IVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLT FYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSF STFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAAC CHLAKALNDFSNSGSDVLYQPPQTSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTL NGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCV LKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSC GSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNV LAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLD MCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLL TILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSL ATVAYFNMVYMPASWVMRIMTWLDMVDTSLSGFKLKDCVMYASAVVLLILMTARTVYDD GARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMC VEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFR YMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLR VESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRA TLQAIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQ RKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCV PLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISM DNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTT KGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGL NNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITN CVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQI PTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNGFAV" mat_peptide 266..805 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="leader protein" /note="nsp1; produced by both pp1a and pp1ab" /protein_id="YP_009742608.1" mat_peptide 806..2719 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp2" /note="produced by both pp1a and pp1ab" /protein_id="YP_009742609.1" mat_peptide 2720..8554 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp3" /note="former nsp1; conserved domains are: N-terminal acidic (Ac), predicted phosphoesterase, papain-like proteinase, Y-domain, transmembrane domain 1 (TM1), adenosine diphosphate-ribose 1''-phosphatase (ADRP); produced by both pp1a and pp1ab" /protein_id="YP_009742610.1" mat_peptide 8555..10054 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp4" /note="nsp4B_TM; contains transmembrane domain 2 (TM2); produced by both pp1a and pp1ab" /protein_id="YP_009742611.1" mat_peptide 10055..10972 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="3C-like proteinase" /note="nsp5A_3CLpro and nsp5B_3CLpro; main proteinase (Mpro); mediates cleavages downstream of nsp4. 3D structure of the SARSr-CoV homolog has been determined (Yang et al., 2003); produced by both pp1a and pp1ab" /protein_id="YP_009742612.1" mat_peptide 10973..11842 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp6" /note="nsp6_TM; putative transmembrane domain; produced by both pp1a and pp1ab" /protein_id="YP_009742613.1" mat_peptide 11843..12091 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp7" /note="produced by both pp1a and pp1ab" /protein_id="YP_009742614.1" mat_peptide 12092..12685 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp8" /note="produced by both pp1a and pp1ab" /protein_id="YP_009742615.1" mat_peptide 12686..13024 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp9" /note="ssRNA-binding protein; produced by both pp1a and pp1ab" /protein_id="YP_009742616.1" mat_peptide 13025..13441 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp10" /note="nsp10_CysHis; formerly known as growth-factor-like protein (GFL); produced by both pp1a and pp1ab" /protein_id="YP_009742617.1" mat_peptide 13442..13480 /gene="ORF1ab" /locus_tag="GU280_gp01" /product="nsp11" /note="produced by pp1a only" /protein_id="YP_009725312.1" stem_loop 13476..13503 /gene="ORF1ab" /locus_tag="GU280_gp01" /inference="COORDINATES: profile:Rfam-release-14.1:RF00507,Infernal:1.1.2" /function="Coronavirus frameshifting stimulation element stem-loop 1" stem_loop 13488..13542 /gene="ORF1ab" /locus_tag="GU280_gp01" /inference="COORDINATES: profile:profile:Rfam-release-14.1:RF00507,Infernal:1.1.2" /function="Coronavirus frameshifting stimulation element stem-loop 2" gene 21563..25384 /gene="S" /locus_tag="GU280_gp02" /gene_synonym="spike glycoprotein" /db_xref="GeneID:43740568" CDS 21563..25384 /gene="S" /locus_tag="GU280_gp02" /gene_synonym="spike glycoprotein" /note="structural protein; spike protein" /codon_start=1 /product="surface glycoprotein" /protein_id="YP_009724390.1" /db_xref="GeneID:43740568" /translation="MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRS SVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGW IFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSA NNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSA LEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNE NGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGE VFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADS FVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRK SNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFEL LHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAV RDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRV YSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIA YTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLL LQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKP SKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIA QYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSA IGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEA EVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYH LMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQR NFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDL GDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLI AIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT" gene 25393..26220 /gene="ORF3a" /locus_tag="GU280_gp03" /db_xref="GeneID:43740569" CDS 25393..26220 /gene="ORF3a" /locus_tag="GU280_gp03" /codon_start=1 /product="ORF3a protein" /protein_id="YP_009724391.1" /db_xref="GeneID:43740569" /translation="MDLFMRIFTIGTVTLKQGEIKDATPSDFVRATATIPIQASLPFGW LIVGVALLAVFQSASKIITLKKRWQLALSKGVHFVCNLLLLFVTVYSHLLLVAAGLEAP FLYLYALVYFLQSINFVRIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPYNSV TSSIVITSGDGTTSPISEHDYQIGGYTEKWESGVKDCVVLHSYFTSDYYQLYSTQLSTD TGVEHVTFFIYNKIVDEPEEHVQIHTIDGSSGVVNPVMEPIYDEPTTTTSVPL" gene 26245..26472 /gene="E" /locus_tag="GU280_gp04" /db_xref="GeneID:43740570" CDS 26245..26472 /gene="E" /locus_tag="GU280_gp04" /note="ORF4; structural protein; E protein" /codon_start=1 /product="envelope protein" /protein_id="YP_009724392.1" /db_xref="GeneID:43740570" /translation="MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCN IVNVSLVKPSFYVYSRVKNLNSSRVPDLLV" gene 26523..27191 /gene="M" /locus_tag="GU280_gp05" /db_xref="GeneID:43740571" CDS 26523..27191 /gene="M" /locus_tag="GU280_gp05" /note="ORF5; structural protein" /codon_start=1 /product="membrane glycoprotein" /protein_id="YP_009724393.1" /db_xref="GeneID:43740571" /translation="MADSNGTITVEELKKLLEQWNLVIGFLFLTWICLLQFAYANRNRF LYIIKLIFLWLLWPVTLACFVLAAVYRINWITGGIAIAMACLVGLMWLSYFIASFRLFA RTRSMWSFNPETNILLNVPLHGTILTRPLLESELVIGAVILRGHLRIAGHHLGRCDIKD LPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDNIALLVQ " gene 27202..27387 /gene="ORF6" /locus_tag="GU280_gp06" /db_xref="GeneID:43740572" CDS 27202..27387 /gene="ORF6" /locus_tag="GU280_gp06" /codon_start=1 /product="ORF6 protein" /protein_id="YP_009724394.1" /db_xref="GeneID:43740572" /translation="MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLT ENKYSQLDEEQPMEID" gene 27394..27759 /gene="ORF7a" /locus_tag="GU280_gp07" /db_xref="GeneID:43740573" CDS 27394..27759 /gene="ORF7a" /locus_tag="GU280_gp07" /codon_start=1 /product="ORF7a protein" /protein_id="YP_009724395.1" /db_xref="GeneID:43740573" /translation="MKIILFLALITLATCELYHYQECVRGTTVLLKEPCSSGTYEGNSP FHPLADNKFALTCFSTQFAFACPDGVKHVYQLRARSVSPKLFIRQEEVQELYSPIFLIV AAIVFITLCFTLKRKTE" gene 27756..27887 /gene="ORF7b" /locus_tag="GU280_gp08" /db_xref="GeneID:43740574" CDS 27756..27887 /gene="ORF7b" /locus_tag="GU280_gp08" /codon_start=1 /product="ORF7b" /protein_id="YP_009725318.1" /db_xref="GeneID:43740574" /translation="MIELSLIDFYLCFLAFLLFLVLIMLIIFWFSLELQDHNETCHA" gene 27894..28259 /gene="ORF8" /locus_tag="GU280_gp09" /db_xref="GeneID:43740577" CDS 27894..28259 /gene="ORF8" /locus_tag="GU280_gp09" /codon_start=1 /product="ORF8 protein" /protein_id="YP_009724396.1" /db_xref="GeneID:43740577" /translation="MKFLVFLGIITTVAAFHQECSLQSCTQHQPYVVDDPCPIHFYSKW YIRVGARKSAPLIELCVDEAGSKSPIQYIDIGNYTVSCLPFTINCQEPKLGSLVVRCSF YEDFLEYHDVRVVLDFI" gene 28274..29533 /gene="N" /locus_tag="GU280_gp10" /db_xref="GeneID:43740575" CDS 28274..29533 /gene="N" /locus_tag="GU280_gp10" /note="ORF9; structural protein" /codon_start=1 /product="nucleocapsid phosphoprotein" /protein_id="YP_009724397.2" /db_xref="GeneID:43740575" /translation="MSDNGPQNQRNAPRITFGGPSDSTGSNQNGERSGARSKQRRPQGL PNNTASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMKDL SPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQ GTTLPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSRGTSPARMAGNGGDAALALL LLDRLNQLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQ TQGNFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLD DKDPNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTVTLLPAAD LDDFSKQLQQSMSSADSTQA" gene 29558..29674 /gene="ORF10" /locus_tag="GU280_gp11" /db_xref="GeneID:43740576" CDS 29558..29674 /gene="ORF10" /locus_tag="GU280_gp11" /codon_start=1 /product="ORF10 protein" /protein_id="YP_009725255.1" /db_xref="GeneID:43740576" /translation="MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFNLT" stem_loop 29609..29644 /gene="ORF10" /locus_tag="GU280_gp11" /inference="COORDINATES: profile::Rfam-release-14.1:RF00165,Infernal:1.1.2" /function="Coronavirus 3' UTR pseudoknot stem-loop 1" stem_loop 29629..29657 /gene="ORF10" /locus_tag="GU280_gp11" /inference="COORDINATES: profile::Rfam-release-14.1:RF00165,Infernal:1.1.2" /function="Coronavirus 3' UTR pseudoknot stem-loop 2" 3'UTR 29675..29903 stem_loop 29728..29768 /inference="COORDINATES: profile:Rfam-release-14.1:RF00164,Infernal:1.1.2" /note="basepair exception: alignment to the Rfam model implies coordinates 29740:29758 form a noncanonical C:T basepair, but the homologous positions form a highly conserved C:G basepair in other viruses, including SARS (NC_004718.3)" /function="Coronavirus 3' stem-loop II-like motif (s2m)" ORIGIN 1 attaaaggtt tataccttcc caggtaacaa accaaccaac tttcgatctc ttgtagatct 61 gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact 121 cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcgtctatc 181 ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt 241 cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc cctggtttca acgagaaaac 301 acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac gtgctcgtac gtggctttgg 361 agactccgtg gaggaggtct tatcagaggc acgtcaacat cttaaagatg gcacttgtgg 421 cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa cagccctatg tgttcatcaa 481 acgttcggat gctcgaactg cacctcatgg tcatgttatg gttgagctgg tagcagaact 541 cgaaggcatt cagtacggtc gtagtggtga gacacttggt gtccttgtcc ctcatgtggg 601 cgaaatacca gtggcttacc gcaaggttct tcttcgtaag aacggtaata aaggagctgg 661 tggccatagt tacggcgccg atctaaagtc atttgactta ggcgacgagc ttggcactga 721 tccttatgaa gattttcaag aaaactggaa cactaaacat agcagtggtg ttacccgtga 781 actcatgcgt gagcttaacg gaggggcata cactcgctat gtcgataaca acttctgtgg 841 ccctgatggc taccctcttg agtgcattaa agaccttcta gcacgtgctg gtaaagcttc 901 atgcactttg tccgaacaac tggactttat tgacactaag aggggtgtat actgctgccg 961 tgaacatgag catgaaattg cttggtacac ggaacgttct gaaaagagct atgaattgca 1021 gacacctttt gaaattaaat tggcaaagaa atttgacacc ttcaatgggg aatgtccaaa 1081 ttttgtattt cccttaaatt ccataatcaa gactattcaa ccaagggttg aaaagaaaaa 1141 gcttgatggc tttatgggta gaattcgatc tgtctatcca gttgcgtcac caaatgaatg 1201 caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat tgtggtgaaa cttcatggca 1261 gacgggcgat tttgttaaag ccacttgcga attttgtggc actgagaatt tgactaaaga 1321 aggtgccact acttgtggtt acttacccca aaatgctgtt gttaaaattt attgtccagc 1381 atgtcacaat tcagaagtag gacctgagca tagtcttgcc gaataccata atgaatctgg 1441 cttgaaaacc attcttcgta agggtggtcg cactattgcc tttggaggct gtgtgttctc 1501 ttatgttggt tgccataaca agtgtgccta ttgggttcca cgtgctagcg ctaacatagg 1561 ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt cttaatgaca accttcttga 1621 aatactccaa aaagagaaag tcaacatcaa tattgttggt gactttaaac ttaatgaaga 1681 gatcgccatt attttggcat ctttttctgc ttccacaagt gcttttgtgg aaactgtgaa 1741 aggtttggat tataaagcat tcaaacaaat tgttgaatcc tgtggtaatt ttaaagttac 1801 aaaaggaaaa gctaaaaaag gtgcctggaa tattggtgaa cagaaatcaa tactgagtcc 1861 tctttatgca tttgcatcag aggctgctcg tgttgtacga tcaattttct cccgcactct 1921 tgaaactgct caaaattctg tgcgtgtttt acagaaggcc gctataacaa tactagatgg 1981 aatttcacag tattcactga gactcattga tgctatgatg ttcacatctg atttggctac 2041 taacaatcta gttgtaatgg cctacattac aggtggtgtt gttcagttga cttcgcagtg 2101 gctaactaac atctttggca ctgtttatga aaaactcaaa cccgtccttg attggcttga 2161 agagaagttt aaggaaggtg tagagtttct tagagacggt tgggaaattg ttaaatttat 2221 ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc acctgtgcaa aggaaattaa 2281 ggagagtgtt cagacattct ttaagcttgt aaataaattt ttggctttgt gtgctgactc 2341 tatcattatt ggtggagcta aacttaaagc cttgaattta ggtgaaacat ttgtcacgca 2401 ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa gaaactggcc tactcatgcc 2461 tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa acacttccca cagaagtgtt 2521 aacagaggaa gttgtcttga aaactggtga tttacaacca ttagaacaac ctactagtga 2581 agctgttgaa gctccattgg ttggtacacc agtttgtatt aacgggctta tgttgctcga 2641 aatcaaagac acagaaaagt actgtgccct tgcacctaat atgatggtaa caaacaatac 2701 cttcacactc aaaggcggtg caccaacaaa ggttactttt ggtgatgaca ctgtgataga 2761 agtgcaaggt tacaagagtg tgaatatcac ttttgaactt gatgaaagga ttgataaagt 2821 acttaatgag aagtgctctg cctatacagt tgaactcggt acagaagtaa atgagttcgc 2881 ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca gtatctgaat tacttacacc 2941 actgggcatt gatttagatg agtggagtat ggctacatac tacttatttg atgagtctgg 3001 tgagtttaaa ttggcttcac atatgtattg ttctttctac cctccagatg aggatgaaga 3061 agaaggtgat tgtgaagaag aagagtttga gccatcaact caatatgagt atggtactga 3121 agatgattac caaggtaaac ctttggaatt tggtgccact tctgctgctc ttcaacctga 3181 agaagagcaa gaagaagatt ggttagatga tgatagtcaa caaactgttg gtcaacaaga 3241 cggcagtgag gacaatcaga caactactat tcaaacaatt gttgaggttc aacctcaatt 3301 agagatggaa cttacaccag ttgttcagac tattgaagtg aatagtttta gtggttattt 3361 aaaacttact gacaatgtat acattaaaaa tgcagacatt gtggaagaag ctaaaaaggt 3421 aaaaccaaca gtggttgtta atgcagccaa tgtttacctt aaacatggag gaggtgttgc 3481 aggagcctta aataaggcta ctaacaatgc catgcaagtt gaatctgatg attacatagc 3541 tactaatgga ccacttaaag tgggtggtag ttgtgtttta agcggacaca atcttgctaa 3601 acactgtctt catgttgtcg gcccaaatgt taacaaaggt gaagacattc aacttcttaa 3661 gagtgcttat gaaaatttta atcagcacga agttctactt gcaccattat tatcagctgg 3721 tatttttggt gctgacccta tacattcttt aagagtttgt gtagatactg ttcgcacaaa 3781 tgtctactta gctgtctttg ataaaaatct ctatgacaaa cttgtttcaa gctttttgga 3841 aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag attcctaaag aggaagttaa 3901 gccatttata actgaaagta aaccttcagt tgaacagaga aaacaagatg ataagaaaat 3961 caaagcttgt gttgaagaag ttacaacaac tctggaagaa actaagttcc tcacagaaaa 4021 cttgttactt tatattgaca ttaatggcaa tcttcatcca gattctgcca ctcttgttag 4081 tgacattgac atcactttct taaagaaaga tgctccatat atagtgggtg atgttgttca 4141 agagggtgtt ttaactgctg tggttatacc tactaaaaag gctggtggca ctactgaaat 4201 gctagcgaaa gctttgagaa aagtgccaac agacaattat ataaccactt acccgggtca 4261 gggtttaaat ggttacactg tagaggaggc aaagacagtg cttaaaaagt gtaaaagtgc 4321 cttttacatt ctaccatcta ttatctctaa tgagaagcaa gaaattcttg gaactgtttc 4381 ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca cgcaaattaa tgcctgtctg 4441 tgtggaaact aaagccatag tttcaactat acagcgtaaa tataagggta ttaaaataca 4501 agagggtgtg gttgattatg gtgctagatt ttacttttac accagtaaaa caactgtagc 4561 gtcacttatc aacacactta acgatctaaa tgaaactctt gttacaatgc cacttggcta 4621 tgtaacacat ggcttaaatt tggaagaagc tgctcggtat atgagatctc tcaaagtgcc 4681 agctacagtt tctgtttctt cacctgatgc tgttacagcg tataatggtt atcttacttc 4741 ttcttctaaa acacctgaag aacattttat tgaaaccatc tcacttgctg gttcctataa 4801 agattggtcc tattctggac aatctacaca actaggtata gaatttctta agagaggtga 4861 taaaagtgta tattacacta gtaatcctac cacattccac ctagatggtg aagttatcac 4921 ctttgacaat cttaagacac ttctttcttt gagagaagtg aggactatta aggtgtttac 4981 aacagtagac aacattaacc tccacacgca agttgtggac atgtcaatga catatggaca 5041 acagtttggt ccaacttatt tggatggagc tgatgttact aaaataaaac ctcataattc 5101 acatgaaggt aaaacatttt atgttttacc taatgatgac actctacgtg ttgaggcttt 5161 tgagtactac cacacaactg atcctagttt tctgggtagg tacatgtcag cattaaatca 5221 cactaaaaag tggaaatacc cacaagttaa tggtttaact tctattaaat gggcagataa 5281 caactgttat cttgccactg cattgttaac actccaacaa atagagttga agtttaatcc 5341 acctgctcta caagatgctt attacagagc aagggctggt gaagctgcta acttttgtgc 5401 acttatctta gcctactgta ataagacagt aggtgagtta ggtgatgtta gagaaacaat 5461 gagttacttg tttcaacatg ccaatttaga ttcttgcaaa agagtcttga acgtggtgtg 5521 taaaacttgt ggacaacagc agacaaccct taagggtgta gaagctgtta tgtacatggg 5581 cacactttct tatgaacaat ttaagaaagg tgttcagata ccttgtacgt gtggtaaaca 5641 agctacaaaa tatctagtac aacaggagtc accttttgtt atgatgtcag caccacctgc 5701 tcagtatgaa cttaagcatg gtacatttac ttgtgctagt gagtacactg gtaattacca 5761 gtgtggtcac tataaacata taacttctaa agaaactttg tattgcatag acggtgcttt 5821 acttacaaag tcctcagaat acaaaggtcc tattacggat gttttctaca aagaaaacag 5881 ttacacaaca accataaaac cagttactta taaattggat ggtgttgttt gtacagaaat 5941 tgaccctaag ttggacaatt attataagaa agacaattct tatttcacag agcaaccaat 6001 tgatcttgta ccaaaccaac catatccaaa cgcaagcttc gataatttta agtttgtatg 6061 tgataatatc aaatttgctg atgatttaaa ccagttaact ggttataaga aacctgcttc 6121 aagagagctt aaagttacat ttttccctga cttaaatggt gatgtggtgg ctattgatta 6181 taaacactac acaccctctt ttaagaaagg agctaaattg ttacataaac ctattgtttg 6241 gcatgttaac aatgcaacta ataaagccac gtataaacca aatacctggt gtatacgttg 6301 tctttggagc acaaaaccag ttgaaacatc aaattcgttt gatgtactga agtcagagga 6361 cgcgcaggga atggataatc ttgcctgcga agatctaaaa ccagtctctg aagaagtagt 6421 ggaaaatcct accatacaga aagacgttct tgagtgtaat gtgaaaacta ccgaagttgt 6481 aggagacatt atacttaaac cagcaaataa tagtttaaaa attacagaag aggttggcca 6541 cacagatcta atggctgctt atgtagacaa ttctagtctt actattaaga aacctaatga 6601 attatctaga gtattaggtt tgaaaaccct tgctactcat ggtttagctg ctgttaatag 6661 tgtcccttgg gatactatag ctaattatgc taagcctttt cttaacaaag ttgttagtac 6721 aactactaac atagttacac ggtgtttaaa ccgtgtttgt actaattata tgccttattt 6781 ctttacttta ttgctacaat tgtgtacttt tactagaagt acaaattcta gaattaaagc 6841 atctatgccg actactatag caaagaatac tgttaagagt gtcggtaaat tttgtctaga 6901 ggcttcattt aattatttga agtcacctaa tttttctaaa ctgataaata ttataatttg 6961 gtttttacta ttaagtgttt gcctaggttc tttaatctac tcaaccgctg ctttaggtgt 7021 tttaatgtct aatttaggca tgccttctta ctgtactggt tacagagaag gctatttgaa 7081 ctctactaat gtcactattg caacctactg tactggttct ataccttgta gtgtttgtct 7141 tagtggttta gattctttag acacctatcc ttctttagaa actatacaaa ttaccatttc 7201 atcttttaaa tgggatttaa ctgcttttgg cttagttgca gagtggtttt tggcatatat 7261 tcttttcact aggtttttct atgtacttgg attggctgca atcatgcaat tgtttttcag 7321 ctattttgca gtacatttta ttagtaattc ttggcttatg tggttaataa ttaatcttgt 7381 acaaatggcc ccgatttcag ctatggttag aatgtacatc ttctttgcat cattttatta 7441 tgtatggaaa agttatgtgc atgttgtaga cggttgtaat tcatcaactt gtatgatgtg 7501 ttacaaacgt aatagagcaa caagagtcga atgtacaact attgttaatg gtgttagaag 7561 gtccttttat gtctatgcta atggaggtaa aggcttttgc aaactacaca attggaattg 7621 tgttaattgt gatacattct gtgctggtag tacatttatt agtgatgaag ttgcgagaga 7681 cttgtcacta cagtttaaaa gaccaataaa tcctactgac cagtcttctt acatcgttga 7741 tagtgttaca gtgaagaatg gttccatcca tctttacttt gataaagctg gtcaaaagac 7801 ttatgaaaga cattctctct ctcattttgt taacttagac aacctgagag ctaataacac 7861 taaaggttca ttgcctatta atgttatagt ttttgatggt aaatcaaaat gtgaagaatc 7921 atctgcaaaa tcagcgtctg tttactacag tcagcttatg tgtcaaccta tactgttact 7981 agatcaggca ttagtgtctg atgttggtga tagtgcggaa gttgcagtta aaatgtttga 8041 tgcttacgtt aatacgtttt catcaacttt taacgtacca atggaaaaac tcaaaacact 8101 agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc ttagacaatg tcttatctac 8161 ttttatttca gcagctcggc aagggtttgt tgattcagat gtagaaacta aagatgttgt 8221 tgaatgtctt aaattgtcac atcaatctga catagaagtt actggcgata gttgtaataa 8281 ctatatgctc acctataaca aagttgaaaa catgacaccc cgtgaccttg gtgcttgtat 8341 tgactgtagt gcgcgtcata ttaatgcgca ggtagcaaaa agtcacaaca ttgctttgat 8401 atggaacgtt aaagatttca tgtcattgtc tgaacaacta cgaaaacaaa tacgtagtgc 8461 tgctaaaaag aataacttac cttttaagtt gacatgtgca actactagac aagttgttaa 8521 tgttgtaaca acaaagatag cacttaaggg tggtaaaatt gttaataatt ggttgaagca 8581 gttaattaaa gttacacttg tgttcctttt tgttgctgct attttctatt taataacacc 8641 tgttcatgtc atgtctaaac atactgactt ttcaagtgaa atcataggat acaaggctat 8701 tgatggtggt gtcactcgtg acatagcatc tacagatact tgttttgcta acaaacatgc 8761 tgattttgac acatggttta gccagcgtgg tggtagttat actaatgaca aagcttgccc 8821 attgattgct gcagtcataa caagagaagt gggttttgtc gtgcctggtt tgcctggcac 8881 gatattacgc acaactaatg gtgacttttt gcatttctta cctagagttt ttagtgcagt 8941 tggtaacatc tgttacacac catcaaaact tatagagtac actgactttg caacatcagc 9001 ttgtgttttg gctgctgaat gtacaatttt taaagatgct tctggtaagc cagtaccata 9061 ttgttatgat accaatgtac tagaaggttc tgttgcttat gaaagtttac gccctgacac 9121 acgttatgtg ctcatggatg gctctattat tcaatttcct aacacctacc ttgaaggttc 9181 tgttagagtg gtaacaactt ttgattctga gtactgtagg cacggcactt gtgaaagatc 9241 agaagctggt gtttgtgtat ctactagtgg tagatgggta cttaacaatg attattacag 9301 atctttacca ggagttttct gtggtgtaga tgctgtaaat ttacttacta atatgtttac 9361 accactaatt caacctattg gtgctttgga catatcagca tctatagtag ctggtggtat 9421 tgtagctatc gtagtaacat gccttgccta ctattttatg aggtttagaa gagcttttgg 9481 tgaatacagt catgtagttg cctttaatac tttactattc cttatgtcat tcactgtact 9541 ctgtttaaca ccagtttact cattcttacc tggtgtttat tctgttattt acttgtactt 9601 gacattttat cttactaatg atgtttcttt tttagcacat attcagtgga tggttatgtt 9661 cacaccttta gtacctttct ggataacaat tgcttatatc atttgtattt ccacaaagca 9721 tttctattgg ttctttagta attacctaaa gagacgtgta gtctttaatg gtgtttcctt 9781 tagtactttt gaagaagctg cgctgtgcac ctttttgtta aataaagaaa tgtatctaaa 9841 gttgcgtagt gatgtgctat tacctcttac gcaatataat agatacttag ctctttataa 9901 taagtacaag tattttagtg gagcaatgga tacaactagc tacagagaag ctgcttgttg 9961 tcatctcgca aaggctctca atgacttcag taactcaggt tctgatgttc tttaccaacc 10021 accacaaacc tctatcacct cagctgtttt gcagagtggt tttagaaaaa tggcattccc 10081 atctggtaaa gttgagggtt gtatggtaca agtaacttgt ggtacaacta cacttaacgg 10141 tctttggctt gatgacgtag tttactgtcc aagacatgtg atctgcacct ctgaagacat 10201 gcttaaccct aattatgaag atttactcat tcgtaagtct aatcataatt tcttggtaca 10261 ggctggtaat gttcaactca gggttattgg acattctatg caaaattgtg tacttaagct 10321 taaggttgat acagccaatc ctaagacacc taagtataag tttgttcgca ttcaaccagg 10381 acagactttt tcagtgttag cttgttacaa tggttcacca tctggtgttt accaatgtgc 10441 tatgaggccc aatttcacta ttaagggttc attccttaat ggttcatgtg gtagtgttgg 10501 ttttaacata gattatgact gtgtctcttt ttgttacatg caccatatgg aattaccaac 10561 tggagttcat gctggcacag acttagaagg taacttttat ggaccttttg ttgacaggca 10621 aacagcacaa gcagctggta cggacacaac tattacagtt aatgttttag cttggttgta 10681 cgctgctgtt ataaatggag acaggtggtt tctcaatcga tttaccacaa ctcttaatga 10741 ctttaacctt gtggctatga agtacaatta tgaacctcta acacaagacc atgttgacat 10801 actaggacct ctttctgctc aaactggaat tgccgtttta gatatgtgtg cttcattaaa 10861 agaattactg caaaatggta tgaatggacg taccatattg ggtagtgctt tattagaaga 10921 tgaatttaca ccttttgatg ttgttagaca atgctcaggt gttactttcc aaagtgcagt 10981 gaaaagaaca atcaagggta cacaccactg gttgttactc acaattttga cttcactttt 11041 agttttagtc cagagtactc aatggtcttt gttctttttt ttgtatgaaa atgccttttt 11101 accttttgct atgggtatta ttgctatgtc tgcttttgca atgatgtttg tcaaacataa 11161 gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc actgtagctt attttaatat 11221 ggtctatatg cctgctagtt gggtgatgcg tattatgaca tggttggata tggttgatac 11281 tagtttgtct ggttttaagc taaaagactg tgttatgtat gcatcagctg tagtgttact 11341 aatccttatg acagcaagaa ctgtgtatga tgatggtgct aggagagtgt ggacacttat 11401 gaatgtcttg acactcgttt ataaagttta ttatggtaat gctttagatc aagccatttc 11461 catgtgggct cttataatct ctgttacttc taactactca ggtgtagtta caactgtcat 11521 gtttttggcc agaggtattg tttttatgtg tgttgagtat tgccctattt tcttcataac 11581 tggtaataca cttcagtgta taatgctagt ttattgtttc ttaggctatt tttgtacttg 11641 ttactttggc ctcttttgtt tactcaaccg ctactttaga ctgactcttg gtgtttatga 11701 ttacttagtt tctacacagg agtttagata tatgaattca cagggactac tcccacccaa 11761 gaatagcata gatgccttca aactcaacat taaattgttg ggtgttggtg gcaaaccttg 11821 tatcaaagta gccactgtac agtctaaaat gtcagatgta aagtgcacat cagtagtctt 11881 actctcagtt ttgcaacaac tcagagtaga atcatcatct aaattgtggg ctcaatgtgt 11941 ccagttacac aatgacattc tcttagctaa agatactact gaagcctttg aaaaaatggt 12001 ttcactactt tctgttttgc tttccatgca gggtgctgta gacataaaca agctttgtga 12061 agaaatgctg gacaacaggg caaccttaca agctatagcc tcagagttta gttcccttcc 12121 atcatatgca gcttttgcta ctgctcaaga agcttatgag caggctgttg ctaatggtga 12181 ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat gtggctaaat ctgaatttga 12241 ccgtgatgca gccatgcaac gtaagttgga aaagatggct gatcaagcta tgacccaaat 12301 gtataaacag gctagatctg aggacaagag ggcaaaagtt actagtgcta tgcagacaat 12361 gcttttcact atgcttagaa agttggataa tgatgcactc aacaacatta tcaacaatgc 12421 aagagatggt tgtgttccct tgaacataat acctcttaca acagcagcca aactaatggt 12481 tgtcatacca gactataaca catataaaaa tacgtgtgat ggtacaacat ttacttatgc 12541 atcagcattg tgggaaatcc aacaggttgt agatgcagat agtaaaattg ttcaacttag 12601 tgaaattagt atggacaatt cacctaattt agcatggcct cttattgtaa cagctttaag 12661 ggccaattct gctgtcaaat tacagaataa tgagcttagt cctgttgcac tacgacagat 12721 gtcttgtgct gccggtacta cacaaactgc ttgcactgat gacaatgcgt tagcttacta 12781 caacacaaca aagggaggta ggtttgtact tgcactgtta tccgatttac aggatttgaa 12841 atgggctaga ttccctaaga gtgatggaac tggtactatc tatacagaac tggaaccacc 12901 ttgtaggttt gttacagaca cacctaaagg tcctaaagtg aagtatttat actttattaa 12961 aggattaaac aacctaaata gaggtatggt acttggtagt ttagctgcca cagtacgtct 13021 acaagctggt aatgcaacag aagtgcctgc caattcaact gtattatctt tctgtgcttt 13081 tgctgtagat gctgctaaag cttacaaaga ttatctagct agtgggggac aaccaatcac 13141 taattgtgtt aagatgttgt gtacacacac tggtactggt caggcaataa cagttacacc 13201 ggaagccaat atggatcaag aatcctttgg tggtgcatcg tgttgtctgt actgccgttg 13261 ccacatagat catccaaatc ctaaaggatt ttgtgactta aaaggtaagt atgtacaaat 13321 acctacaact tgtgctaatg accctgtggg ttttacactt aaaaacacag tctgtaccgt 13381 ctgcggtatg tggaaaggtt atggctgtag ttgtgatcaa ctccgcgaac ccatgcttca 13441 gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca 13501 ccgtgcggca caggcactag tactgatgtc gtatacaggg cttttgacat ctacaatgat 13561 aaagtagctg gttttgctaa attcctaaaa actaattgtt gtcgcttcca agaaaaggac 13621 gaagatgaca atttaattga ttcttacttt gtagttaaga gacacacttt ctctaactac 13681 caacatgaag aaacaattta taatttactt aaggattgtc cagctgttgc taaacatgac 13741 ttctttaagt ttagaataga cggtgacatg gtaccacata tatcacgtca acgtcttact 13801 aaatacacaa tggcagacct cgtctatgct ttaaggcatt ttgatgaagg taattgtgac 13861 acattaaaag aaatacttgt cacatacaat tgttgtgatg atgattattt caataaaaag 13921 gactggtatg attttgtaga aaacccagat atattacgcg tatacgccaa cttaggtgaa 13981 cgtgtacgcc aagctttgtt aaaaacagta caattctgtg atgccatgcg aaatgctggt 14041 attgttggtg tactgacatt agataatcaa gatctcaatg gtaactggta tgatttcggt 14101 gatttcatac aaaccacgcc aggtagtgga gttcctgttg tagattctta ttattcattg 14161 ttaatgccta tattaacctt gaccagggct ttaactgcag agtcacatgt tgacactgac 14221 ttaacaaagc cttacattaa gtgggatttg ttaaaatatg acttcacgga agagaggtta 14281 aaactctttg accgttattt taaatattgg gatcagacat accacccaaa ttgtgttaac 14341 tgtttggatg acagatgcat tctgcattgt gcaaacttta atgttttatt ctctacagtg 14401 ttcccaccta caagttttgg accactagtg agaaaaatat ttgttgatgg tgttccattt 14461 gtagtttcaa ctggatacca cttcagagag ctaggtgttg tacataatca ggatgtaaac 14521 ttacatagct ctagacttag ttttaaggaa ttacttgtgt atgctgctga ccctgctatg 14581 cacgctgctt ctggtaatct attactagat aaacgcacta cgtgcttttc agtagctgca 14641 cttactaaca atgttgcttt tcaaactgtc aaacccggta attttaacaa agacttctat 14701 gactttgctg tgtctaaggg tttctttaag gaaggaagtt ctgttgaatt aaaacacttc 14761 ttctttgctc aggatggtaa tgctgctatc agcgattatg actactatcg ttataatcta 14821 ccaacaatgt gtgatatcag acaactacta tttgtagttg aagttgttga taagtacttt 14881 gattgttacg atggtggctg tattaatgct aaccaagtca tcgtcaacaa cctagacaaa 14941 tcagctggtt ttccatttaa taaatggggt aaggctagac tttattatga ttcaatgagt 15001 tatgaggatc aagatgcact tttcgcatat acaaaacgta atgtcatccc tactataact 15061 caaatgaatc ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc 15121 tctatctgta gtactatgac caatagacag tttcatcaaa aattattgaa atcaatagcc 15181 gccactagag gagctactgt agtaattgga acaagcaaat tctatggtgg ttggcacaac 15241 atgttaaaaa ctgtttatag tgatgtagaa aaccctcacc ttatgggttg ggattatcct 15301 aaatgtgata gagccatgcc taacatgctt agaattatgg cctcacttgt tcttgctcgc 15361 aaacatacaa cgtgttgtag cttgtcacac cgtttctata gattagctaa tgagtgtgct 15421 caagtattga gtgaaatggt catgtgtggc ggttcactat atgttaaacc aggtggaacc 15481 tcatcaggag atgccacaac tgcttatgct aatagtgttt ttaacatttg tcaagctgtc 15541 acggccaatg ttaatgcact tttatctact gatggtaaca aaattgccga taagtatgtc 15601 cgcaatttac aacacagact ttatgagtgt ctctatagaa atagagatgt tgacacagac 15661 tttgtgaatg agttttacgc atatttgcgt aaacatttct caatgatgat actctctgac 15721 gatgctgttg tgtgtttcaa tagcacttat gcatctcaag gtctagtggc tagcataaag 15781 aactttaagt cagttcttta ttatcaaaac aatgttttta tgtctgaagc aaaatgttgg 15841 actgagactg accttactaa aggacctcat gaattttgct ctcaacatac aatgctagtt 15901 aaacagggtg atgattatgt gtaccttcct tacccagatc catcaagaat cctaggggcc 15961 ggctgttttg tagatgatat cgtaaaaaca gatggtacac ttatgattga acggttcgtg 16021 tctttagcta tagatgctta cccacttact aaacatccta atcaggagta tgctgatgtc 16081 tttcatttgt acttacaata cataagaaag ctacatgatg agttaacagg acacatgtta 16141 gacatgtatt ctgttatgct tactaatgat aacacttcaa ggtattggga acctgagttt 16201 tatgaggcta tgtacacacc gcatacagtc ttacaggctg ttggggcttg tgttctttgc 16261 aattcacaga cttcattaag atgtggtgct tgcatacgta gaccattctt atgttgtaaa 16321 tgctgttacg accatgtcat atcaacatca cataaattag tcttgtctgt taatccgtat 16381 gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc aactttactt aggaggtatg 16441 agctattatt gtaaatcaca taaaccaccc attagttttc cattgtgtgc taatggacaa 16501 gtttttggtt tatataaaaa tacatgtgtt ggtagcgata atgttactga ctttaatgca 16561 attgcaacat gtgactggac aaatgctggt gattacattt tagctaacac ctgtactgaa 16621 agactcaagc tttttgcagc agaaacgctc aaagctactg aggagacatt taaactgtct 16681 tatggtattg ctactgtacg tgaagtgctg tctgacagag aattacatct ttcatgggaa 16741 gttggtaaac ctagaccacc acttaaccga aattatgtct ttactggtta tcgtgtaact 16801 aaaaacagta aagtacaaat aggagagtac acctttgaaa aaggtgacta tggtgatgct 16861 gttgtttacc gaggtacaac aacttacaaa ttaaatgttg gtgattattt tgtgctgaca 16921 tcacatacag taatgccatt aagtgcacct acactagtgc cacaagagca ctatgttaga 16981 attactggct tatacccaac actcaatatc tcagatgagt tttctagcaa tgttgcaaat 17041 tatcaaaagg ttggtatgca aaagtattct acactccagg gaccacctgg tactggtaag 17101 agtcattttg ctattggcct agctctctac tacccttctg ctcgcatagt gtatacagct 17161 tgctctcatg ccgctgttga tgcactatgt gagaaggcat taaaatattt gcctatagat 17221 aaatgtagta gaattatacc tgcacgtgct cgtgtagagt gttttgataa attcaaagtg 17281 aattcaacat tagaacagta tgtcttttgt actgtaaatg cattgcctga gacgacagca 17341 gatatagttg tctttgatga aatttcaatg gccacaaatt atgatttgag tgttgtcaat 17401 gccagattac gtgctaagca ctatgtgtac attggcgacc ctgctcaatt acctgcacca 17461 cgcacattgc taactaaggg cacactagaa ccagaatatt tcaattcagt gtgtagactt 17521 atgaaaacta taggtccaga catgttcctc ggaacttgtc ggcgttgtcc tgctgaaatt 17581 gttgacactg tgagtgcttt ggtttatgat aataagctta aagcacataa agacaaatca 17641 gctcaatgct ttaaaatgtt ttataagggt gttatcacgc atgatgtttc atctgcaatt 17701 aacaggccac aaataggcgt ggtaagagaa ttccttacac gtaaccctgc ttggagaaaa 17761 gctgtcttta tttcacctta taattcacag aatgctgtag cctcaaagat tttgggacta 17821 ccaactcaaa ctgttgattc atcacagggc tcagaatatg actatgtcat attcactcaa 17881 accactgaaa cagctcactc ttgtaatgta aacagattta atgttgctat taccagagca 17941 aaagtaggca tactttgcat aatgtctgat agagaccttt atgacaagtt gcaatttaca 18001 agtcttgaaa ttccacgtag gaatgtggca actttacaag ctgaaaatgt aacaggactc 18061 tttaaagatt gtagtaaggt aatcactggg ttacatccta cacaggcacc tacacacctc 18121 agtgttgaca ctaaattcaa aactgaaggt ttatgtgttg acatacctgg catacctaag 18181 gacatgacct atagaagact catctctatg atgggtttta aaatgaatta tcaagttaat 18241 ggttacccta acatgtttat cacccgcgaa gaagctataa gacatgtacg tgcatggatt 18301 ggcttcgatg tcgaggggtg tcatgctact agagaagctg ttggtaccaa tttaccttta 18361 cagctaggtt tttctacagg tgttaaccta gttgctgtac ctacaggtta tgttgataca 18421 cctaataata cagatttttc cagagttagt gctaaaccac cgcctggaga tcaatttaaa 18481 cacctcatac cacttatgta caaaggactt ccttggaatg tagtgcgtat aaagattgta 18541 caaatgttaa gtgacacact taaaaatctc tctgacagag tcgtatttgt cttatgggca 18601 catggctttg agttgacatc tatgaagtat tttgtgaaaa taggacctga gcgcacctgt 18661 tgtctatgtg atagacgtgc cacatgcttt tccactgctt cagacactta tgcctgttgg 18721 catcattcta ttggatttga ttacgtctat aatccgttta tgattgatgt tcaacaatgg 18781 ggttttacag gtaacctaca aagcaaccat gatctgtatt gtcaagtcca tggtaatgca 18841 catgtagcta gttgtgatgc aatcatgact aggtgtctag ctgtccacga gtgctttgtt 18901 aagcgtgttg actggactat tgaatatcct ataattggtg atgaactgaa gattaatgcg 18961 gcttgtagaa aggttcaaca catggttgtt aaagctgcat tattagcaga caaattccca 19021 gttcttcacg acattggtaa ccctaaagct attaagtgtg tacctcaagc tgatgtagaa 19081 tggaagttct atgatgcaca gccttgtagt gacaaagctt ataaaataga agaattattc 19141 tattcttatg ccacacattc tgacaaattc acagatggtg tatgcctatt ttggaattgc 19201 aatgtcgata gatatcctgc taattccatt gtttgtagat ttgacactag agtgctatct 19261 aaccttaact tgcctggttg tgatggtggc agtttgtatg taaataaaca tgcattccac 19321 acaccagctt ttgataaaag tgcttttgtt aatttaaaac aattaccatt tttctattac 19381 tctgacagtc catgtgagtc tcatggaaaa caagtagtgt cagatataga ttatgtacca 19441 ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg gtgctgtctg tagacatcat 19501 gctaatgagt acagattgta tctcgatgct tataacatga tgatctcagc tggctttagc 19561 ttgtgggttt acaaacaatt tgatacttat aacctctgga acacttttac aagacttcag 19621 agtttagaaa atgtggcttt taatgttgta aataagggac actttgatgg acaacagggt 19681 gaagtaccag tttctatcat taataacact gtttacacaa aagttgatgg tgttgatgta 19741 gaattgtttg aaaataaaac aacattacct gttaatgtag catttgagct ttgggctaag 19801 cgcaacatta aaccagtacc agaggtgaaa atactcaata atttgggtgt ggacattgct 19861 gctaatactg tgatctggga ctacaaaaga gatgctccag cacatatatc tactattggt 19921 gtttgttcta tgactgacat agccaagaaa ccaactgaaa cgatttgtgc accactcact 19981 gtcttttttg atggtagagt tgatggtcaa gtagacttat ttagaaatgc ccgtaatggt 20041 gttcttatta cagaaggtag tgttaaaggt ttacaaccat ctgtaggtcc caaacaagct 20101 agtcttaatg gagtcacatt aattggagaa gccgtaaaaa cacagttcaa ttattataag 20161 aaagttgatg gtgttgtcca acaattacct gaaacttact ttactcagag tagaaattta 20221 caagaattta aacccaggag tcaaatggaa attgatttct tagaattagc tatggatgaa 20281 ttcattgaac ggtataaatt agaaggctat gccttcgaac atatcgttta tggagatttt 20341 agtcatagtc agttaggtgg tttacatcta ctgattggac tagctaaacg ttttaaggaa 20401 tcaccttttg aattagaaga ttttattcct atggacagta cagttaaaaa ctatttcata 20461 acagatgcgc aaacaggttc atctaagtgt gtgtgttctg ttattgattt attacttgat 20521 gattttgttg aaataataaa atcccaagat ttatctgtag tttctaaggt tgtcaaagtg 20581 actattgact atacagaaat ttcatttatg ctttggtgta aagatggcca tgtagaaaca 20641 ttttacccaa aattacaatc tagtcaagcg tggcaaccgg gtgttgctat gcctaatctt 20701 tacaaaatgc aaagaatgct attagaaaag tgtgaccttc aaaattatgg tgatagtgca 20761 acattaccta aaggcataat gatgaatgtc gcaaaatata ctcaactgtg tcaatattta 20821 aacacattaa cattagctgt accctataat atgagagtta tacattttgg tgctggttct 20881 gataaaggag ttgcaccagg tacagctgtt ttaagacagt ggttgcctac gggtacgctg 20941 cttgtcgatt cagatcttaa tgactttgtc tctgatgcag attcaacttt gattggtgat 21001 tgtgcaactg tacatacagc taataaatgg gatctcatta ttagtgatat gtacgaccct 21061 aagactaaaa atgttacaaa agaaaatgac tctaaagagg gttttttcac ttacatttgt 21121 gggtttatac aacaaaagct agctcttgga ggttccgtgg ctataaagat aacagaacat 21181 tcttggaatg ctgatcttta taagctcatg ggacacttcg catggtggac agcctttgtt 21241 actaatgtga atgcgtcatc atctgaagca tttttaattg gatgtaatta tcttggcaaa 21301 ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt acatattttg gaggaataca 21361 aatccaattc agttgtcttc ctattcttta tttgacatga gtaaatttcc ccttaaatta 21421 aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca atgatatgat tttatctctt 21481 cttagtaaag gtagacttat aattagagaa aacaacagag ttgttatttc tagtgatgtt 21541 cttgttaaca actaaacgaa caatgtttgt ttttcttgtt ttattgccac tagtctctag 21601 tcagtgtgtt aatcttacaa ccagaactca attaccccct gcatacacta attctttcac 21661 acgtggtgtt tattaccctg acaaagtttt cagatcctca gttttacatt caactcagga 21721 cttgttctta cctttctttt ccaatgttac ttggttccat gctatacatg tctctgggac 21781 caatggtact aagaggtttg ataaccctgt cctaccattt aatgatggtg tttattttgc 21841 ttccactgag aagtctaaca taataagagg ctggattttt ggtactactt tagattcgaa 21901 gacccagtcc ctacttattg ttaataacgc tactaatgtt gttattaaag tctgtgaatt 21961 tcaattttgt aatgatccat ttttgggtgt ttattaccac aaaaacaaca aaagttggat 22021 ggaaagtgag ttcagagttt attctagtgc gaataattgc acttttgaat atgtctctca 22081 gccttttctt atggaccttg aaggaaaaca gggtaatttc aaaaatctta gggaatttgt 22141 gtttaagaat attgatggtt attttaaaat atattctaag cacacgccta ttaatttagt 22201 gcgtgatctc cctcagggtt tttcggcttt agaaccattg gtagatttgc caataggtat 22261 taacatcact aggtttcaaa ctttacttgc tttacataga agttatttga ctcctggtga 22321 ttcttcttca ggttggacag ctggtgctgc agcttattat gtgggttatc ttcaacctag 22381 gacttttcta ttaaaatata atgaaaatgg aaccattaca gatgctgtag actgtgcact 22441 tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc actgtagaaa aaggaatcta 22501 tcaaacttct aactttagag tccaaccaac agaatctatt gttagatttc ctaatattac 22561 aaacttgtgc ccttttggtg aagtttttaa cgccaccaga tttgcatctg tttatgcttg 22621 gaacaggaag agaatcagca actgtgttgc tgattattct gtcctatata attccgcatc 22681 attttccact tttaagtgtt atggagtgtc tcctactaaa ttaaatgatc tctgctttac 22741 taatgtctat gcagattcat ttgtaattag aggtgatgaa gtcagacaaa tcgctccagg 22801 gcaaactgga aagattgctg attataatta taaattacca gatgatttta caggctgcgt 22861 tatagcttgg aattctaaca atcttgattc taaggttggt ggtaattata attacctgta 22921 tagattgttt aggaagtcta atctcaaacc ttttgagaga gatatttcaa ctgaaatcta 22981 tcaggccggt agcacacctt gtaatggtgt tgaaggtttt aattgttact ttcctttaca 23041 atcatatggt ttccaaccca ctaatggtgt tggttaccaa ccatacagag tagtagtact 23101 ttcttttgaa cttctacatg caccagcaac tgtttgtgga cctaaaaagt ctactaattt 23161 ggttaaaaac aaatgtgtca atttcaactt caatggttta acaggcacag gtgttcttac 23221 tgagtctaac aaaaagtttc tgcctttcca acaatttggc agagacattg ctgacactac 23281 tgatgctgtc cgtgatccac agacacttga gattcttgac attacaccat gttcttttgg 23341 tggtgtcagt gttataacac caggaacaaa tacttctaac caggttgctg ttctttatca 23401 ggatgttaac tgcacagaag tccctgttgc tattcatgca gatcaactta ctcctacttg 23461 gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt gcaggctgtt taataggggc 23521 tgaacatgtc aacaactcat atgagtgtga catacccatt ggtgcaggta tatgcgctag 23581 ttatcagact cagactaatt ctcctcggcg ggcacgtagt gtagctagtc aatccatcat 23641 tgcctacact atgtcacttg gtgcagaaaa ttcagttgct tactctaata actctattgc 23701 catacccaca aattttacta ttagtgttac cacagaaatt ctaccagtgt ctatgaccaa 23761 gacatcagta gattgtacaa tgtacatttg tggtgattca actgaatgca gcaatctttt 23821 gttgcaatat ggcagttttt gtacacaatt aaaccgtgct ttaactggaa tagctgttga 23881 acaagacaaa aacacccaag aagtttttgc acaagtcaaa caaatttaca aaacaccacc 23941 aattaaagat tttggtggtt ttaatttttc acaaatatta ccagatccat caaaaccaag 24001 caagaggtca tttattgaag atctactttt caacaaagtg acacttgcag atgctggctt 24061 catcaaacaa tatggtgatt gccttggtga tattgctgct agagacctca tttgtgcaca 24121 aaagtttaac ggccttactg ttttgccacc tttgctcaca gatgaaatga ttgctcaata 24181 cacttctgca ctgttagcgg gtacaatcac ttctggttgg acctttggtg caggtgctgc 24241 attacaaata ccatttgcta tgcaaatggc ttataggttt aatggtattg gagttacaca 24301 gaatgttctc tatgagaacc aaaaattgat tgccaaccaa tttaatagtg ctattggcaa 24361 aattcaagac tcactttctt ccacagcaag tgcacttgga aaacttcaag atgtggtcaa 24421 ccaaaatgca caagctttaa acacgcttgt taaacaactt agctccaatt ttggtgcaat 24481 ttcaagtgtt ttaaatgata tcctttcacg tcttgacaaa gttgaggctg aagtgcaaat 24541 tgataggttg atcacaggca gacttcaaag tttgcagaca tatgtgactc aacaattaat 24601 tagagctgca gaaatcagag cttctgctaa tcttgctgct actaaaatgt cagagtgtgt 24661 acttggacaa tcaaaaagag ttgatttttg tggaaagggc tatcatctta tgtccttccc 24721 tcagtcagca cctcatggtg tagtcttctt gcatgtgact tatgtccctg cacaagaaaa 24781 gaacttcaca actgctcctg ccatttgtca tgatggaaaa gcacactttc ctcgtgaagg 24841 tgtctttgtt tcaaatggca cacactggtt tgtaacacaa aggaattttt atgaaccaca 24901 aatcattact acagacaaca catttgtgtc tggtaactgt gatgttgtaa taggaattgt 24961 caacaacaca gtttatgatc ctttgcaacc tgaattagac tcattcaagg aggagttaga 25021 taaatatttt aagaatcata catcaccaga tgttgattta ggtgacatct ctggcattaa 25081 tgcttcagtt gtaaacattc aaaaagaaat tgaccgcctc aatgaggttg ccaagaattt 25141 aaatgaatct ctcatcgatc tccaagaact tggaaagtat gagcagtata taaaatggcc 25201 atggtacatt tggctaggtt ttatagctgg cttgattgcc atagtaatgg tgacaattat 25261 gctttgctgt atgaccagtt gctgtagttg tctcaagggc tgttgttctt gtggatcctg 25321 ctgcaaattt gatgaagacg actctgagcc agtgctcaaa ggagtcaaat tacattacac 25381 ataaacgaac ttatggattt gtttatgaga atcttcacaa ttggaactgt aactttgaag 25441 caaggtgaaa tcaaggatgc tactccttca gattttgttc gcgctactgc aacgataccg 25501 atacaagcct cactcccttt cggatggctt attgttggcg ttgcacttct tgctgttttt 25561 cagagcgctt ccaaaatcat aaccctcaaa aagagatggc aactagcact ctccaagggt 25621 gttcactttg tttgcaactt gctgttgttg tttgtaacag tttactcaca ccttttgctc 25681 gttgctgctg gccttgaagc cccttttctc tatctttatg ctttagtcta cttcttgcag 25741 agtataaact ttgtaagaat aataatgagg ctttggcttt gctggaaatg ccgttccaaa 25801 aacccattac tttatgatgc caactatttt ctttgctggc atactaattg ttacgactat 25861 tgtatacctt acaatagtgt aacttcttca attgtcatta cttcaggtga tggcacaaca 25921 agtcctattt ctgaacatga ctaccagatt ggtggttata ctgaaaaatg ggaatctgga 25981 gtaaaagact gtgttgtatt acacagttac ttcacttcag actattacca gctgtactca 26041 actcaattga gtacagacac tggtgttgaa catgttacct tcttcatcta caataaaatt 26101 gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg acggttcatc cggagttgtt 26161 aatccagtaa tggaaccaat ttatgatgaa ccgacgacga ctactagcgt gcctttgtaa 26221 gcacaagctg atgagtacga acttatgtac tcattcgttt cggaagagac aggtacgtta 26281 atagttaata gcgtacttct ttttcttgct ttcgtggtat tcttgctagt tacactagcc 26341 atccttactg cgcttcgatt gtgtgcgtac tgctgcaata ttgttaacgt gagtcttgta 26401 aaaccttctt tttacgttta ctctcgtgtt aaaaatctga attcttctag agttcctgat 26461 cttctggtct aaacgaacta aatattatat tagtttttct gtttggaact ttaattttag 26521 ccatggcaga ttccaacggt actattaccg ttgaagagct taaaaagctc cttgaacaat 26581 ggaacctagt aataggtttc ctattcctta catggatttg tcttctacaa tttgcctatg 26641 ccaacaggaa taggtttttg tatataatta agttaatttt cctctggctg ttatggccag 26701 taactttagc ttgttttgtg cttgctgctg tttacagaat aaattggatc accggtggaa 26761 ttgctatcgc aatggcttgt cttgtaggct tgatgtggct cagctacttc attgcttctt 26821 tcagactgtt tgcgcgtacg cgttccatgt ggtcattcaa tccagaaact aacattcttc 26881 tcaacgtgcc actccatggc actattctga ccagaccgct tctagaaagt gaactcgtaa 26941 tcggagctgt gatccttcgt ggacatcttc gtattgctgg acaccatcta ggacgctgtg 27001 acatcaagga cctgcctaaa gaaatcactg ttgctacatc acgaacgctt tcttattaca 27061 aattgggagc ttcgcagcgt gtagcaggtg actcaggttt tgctgcatac agtcgctaca 27121 ggattggcaa ctataaatta aacacagacc attccagtag cagtgacaat attgctttgc 27181 ttgtacagta agtgacaaca gatgtttcat ctcgttgact ttcaggttac tatagcagag 27241 atattactaa ttattatgag gacttttaaa gtttccattt ggaatcttga ttacatcata 27301 aacctcataa ttaaaaattt atctaagtca ctaactgaga ataaatattc tcaattagat 27361 gaagagcaac caatggagat tgattaaacg aacatgaaaa ttattctttt cttggcactg 27421 ataacactcg ctacttgtga gctttatcac taccaagagt gtgttagagg tacaacagta 27481 cttttaaaag aaccttgctc ttctggaaca tacgagggca attcaccatt tcatcctcta 27541 gctgataaca aatttgcact gacttgcttt agcactcaat ttgcttttgc ttgtcctgac 27601 ggcgtaaaac acgtctatca gttacgtgcc agatcagttt cacctaaact gttcatcaga 27661 caagaggaag ttcaagaact ttactctcca atttttctta ttgttgcggc aatagtgttt 27721 ataacacttt gcttcacact caaaagaaag acagaatgat tgaactttca ttaattgact 27781 tctatttgtg ctttttagcc tttctgctat tccttgtttt aattatgctt attatctttt 27841 ggttctcact tgaactgcaa gatcataatg aaacttgtca cgcctaaacg aacatgaaat 27901 ttcttgtttt cttaggaatc atcacaactg tagctgcatt tcaccaagaa tgtagtttac 27961 agtcatgtac tcaacatcaa ccatatgtag ttgatgaccc gtgtcctatt cacttctatt 28021 ctaaatggta tattagagta ggagctagaa aatcagcacc tttaattgaa ttgtgcgtgg 28081 atgaggctgg ttctaaatca cccattcagt acatcgatat cggtaattat acagtttcct 28141 gtttaccttt tacaattaat tgccaggaac ctaaattggg tagtcttgta gtgcgttgtt 28201 cgttctatga agacttttta gagtatcatg acgttcgtgt tgttttagat ttcatctaaa 28261 cgaacaaact aaaatgtctg ataatggacc ccaaaatcag cgaaatgcac cccgcattac 28321 gtttggtgga ccctcagatt caactggcag taaccagaat ggagaacgca gtggggcgcg 28381 atcaaaacaa cgtcggcccc aaggtttacc caataatact gcgtcttggt tcaccgctct 28441 cactcaacat ggcaaggaag accttaaatt ccctcgagga caaggcgttc caattaacac 28501 caatagcagt ccagatgacc aaattggcta ctaccgaaga gctaccagac gaattcgtgg 28561 tggtgacggt aaaatgaaag atctcagtcc aagatggtat ttctactacc taggaactgg 28621 gccagaagct ggacttccct atggtgctaa caaagacggc atcatatggg ttgcaactga 28681 gggagccttg aatacaccaa aagatcacat tggcacccgc aatcctgcta acaatgctgc 28741 aatcgtgcta caacttcctc aaggaacaac attgccaaaa ggcttctacg cagaagggag 28801 cagaggcggc agtcaagcct cttctcgttc ctcatcacgt agtcgcaaca gttcaagaaa 28861 ttcaactcca ggcagcagta ggggaacttc tcctgctaga atggctggca atggcggtga 28921 tgctgctctt gctttgctgc tgcttgacag attgaaccag cttgagagca aaatgtctgg 28981 taaaggccaa caacaacaag gccaaactgt cactaagaaa tctgctgctg aggcttctaa 29041 gaagcctcgg caaaaacgta ctgccactaa agcatacaat gtaacacaag ctttcggcag 29101 acgtggtcca gaacaaaccc aaggaaattt tggggaccag gaactaatca gacaaggaac 29161 tgattacaaa cattggccgc aaattgcaca atttgccccc agcgcttcag cgttcttcgg 29221 aatgtcgcgc attggcatgg aagtcacacc ttcgggaacg tggttgacct acacaggtgc 29281 catcaaattg gatgacaaag atccaaattt caaagatcaa gtcattttgc tgaataagca 29341 tattgacgca tacaaaacat tcccaccaac agagcctaaa aaggacaaaa agaagaaggc 29401 tgatgaaact caagccttac cgcagagaca gaagaaacag caaactgtga ctcttcttcc 29461 tgctgcagat ttggatgatt tctccaaaca attgcaacaa tccatgagca gtgctgactc 29521 aactcaggcc taaactcatg cagaccacac aaggcagatg ggctatataa acgttttcgc 29581 ttttccgttt acgatatata gtctactctt gtgcagaatg aattctcgta actacatagc 29641 acaagtagat gtagttaact ttaatctcac atagcaatct ttaatcagtg tgtaacatta 29701 gggaggactt gaaagagcca ccacattttc accgaggcca cgcggagtac gatcgagtgt 29761 acagtgaaca atgctaggga gagctgccta tatggaagag ccctaatgtg taaaattaat 29821 tttagtagtg ctatccccat gtgattttaa tagcttctta ggagaatgac aaaaaaaaaa 29881 aaaaaaaaaa aaaaaaaaaa aaa // LOCUS NC_002645 27317 bp RNA linear VRL 13-AUG-2018 DEFINITION Human coronavirus 229E, complete genome. ACCESSION NC_002645 VERSION NC_002645.1 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Human coronavirus 229E ORGANISM Human coronavirus 229E Viruses; Riboviria; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Alphacoronavirus; Duvinacovirus. REFERENCE 1 (bases 1 to 27317) AUTHORS Thiel,V., Herold,J., Schelle,B. and Siddell,S.G. TITLE Infectious RNA transcribed in vitro from a cDNA copy of the human coronavirus genome cloned in vaccinia virus JOURNAL J. Gen. Virol. 82 (Pt 6), 1273-1281 (2001) PUBMED 11369870 REFERENCE 2 (bases 1 to 27317) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (13-JAN-2001) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 27317) AUTHORS Thiel,V., Herold,J. and Siddell,S.G. TITLE Direct Submission JOURNAL Submitted (11-SEP-2000) Virology, University of Wuerzburg, Versbacherstr.7, Wuerzburg 97078, Germany COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence is identical to AF304460. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..27317 /organism="Human coronavirus 229E" /mol_type="genomic RNA" /strain="229E" /db_xref="taxon:11137" /clone="inf-1" gene 293..20568 /locus_tag="HCoV229Egp1" /db_xref="GeneID:918764" CDS join(293..12520,12520..20568) /locus_tag="HCoV229Egp1" /note="translated by -1 ribosomal frameshift" /codon_start=1 /product="replicase polyprotein 1ab" /protein_id="NP_073549.1" /db_xref="GeneID:918764" /translation="MACNRVTLAVASDSEISANGCSTIAQAVRRYSEAASNGFRACRFV SLDLQDCIVGIADDTYVMGLHGNQTLFCNIMKFSDRPFMLHGWLVFSNSNYLLEEFDVV FGKRGGGNVTYTDQYLCGADGKPVMSEDLWQFVDHFGENEEIIINGHTYVCAWLTKRKP LDYKRQNNLAIEEIEYVHGDALHTLRNGSVLEMAKEVKTSSKVVLSDALDKLYKVFGSP VMTNGSNILEAFTKPVFISALVQCTCGTKSWSVGDWTGFKSSCCNVISNKLCVVPGNVK PGDAVITTQQAGAGIKYFCGMTLKFVANIEGVSVWRVIALQSVDCFVASSTFVEEEHVN RMDTFCFNVRNSVTDECRLAMLGAEMTSNVRRQVASGVIDISTGWFDVYDDIFAESKPW FVRKAEDIFGPCWSALASALKQLKVTTGELVRFVKSICNSAVAVVGGTIQILASVPEKF LNAFDVFVTAIQTVFDCAVETCTIAGKAFDKVFDYVLLDNALVKLVTTKLKGVRERGLN KVKYATVVVGSTEEVKSSRVERSTAVLTIANNYSKLFDEGYTVVIGDVAYFVSDGYFRL MASPNSVLTTAVYKPLFAFNVNVMGTRPEKFPTTVTCENLESAVLFVNDKITEFQLDYS IDVIDNEIIVKPNISLCVPLYVRDYVDKWDDFCRQYSNESWFEDDYRAFISVLDITDAA VKAAESKAFVDTIVPPCPSILKVIDGGKIWNGVIKNVNSVRDWLKSLKLNLTQQGLLGT CAKRFKRWLGILLEAYNAFLDTVVSTVKIGGLTFKTYAFDKPYIVIRDIVCKVENKTEA EWIELFPHNDRIKSFSTFESAYMPIADPTHFDIEEVELLDAEFVEPGCGGILAVIDEHV FYKKDGVYYPSNGTNILPVAFTKAAGGKVSFSDDVEVKDIEPVYRVKLCFEFEDEKLVD VCEKAIGKKIKHEGDWDSFCKTIQSALSVVSCYVNLPTYYIYDEEGGNDLSLPVMISEW PLSVQQAQQEATLPDIAEDVVDQVEEVNSIFDIETVDVKHDVSPFEMPFEELNGLKILK QLDNNCWVNSVMLQIQLTGILDGDYAMQFFKMGRVAKMIERCYTAEQCIRGAMGDVGLC MYRLLKDLHTGFMVMDYKCSCTSGRLEESGAVLFCTPTKKAFPYGTCLNCNAPRMCTIR QLQGTIIFVQQKPEPVNPVSFVVKPVCSSIFRGAVSCGHYQTNIYSQNLCVDGFGVNKI QPWTNDALNTICIKDADYNAKVEISVTPIKNTVDTTPKEEFVVKEKLNAFLVHDNVAFY QGDVDTVVNGVDFDFIVNAANENLAHGGGLAKALDVYTKGKLQRLSKEHIGLAGKVKVG TGVMVECDSLRIFNVVGPRKGKHERDLLIKAYNTINNEQGTPLTPILSCGIFGIKLETS LEVLLDVCNTKEVKVFVYTDTEVCKVKDFVSGLVNVQKVEQPKIEPKPVSVIKVAPKPY RVDGKFSYFTEDLLCVADDKPIVLFTDSMLTLDDRGLALDNALSGVLSAAIKDCVDINK AIPSGNLIKFDIGSVVVYMCVVPSEKDKHLDNNVQRCTRKLNRLMCDIVCTIPADYILP LVLSSLTCNVSFVGELKAAEAKVITIKVTEDGVNVHDVTVTTDKSFEQQVGVIADKDKD LSGAVPSDLNTSELLTKAIDVDWVEFYGFKDAVTFATVDHSAFAYESAVVNGIRVLKTS DNNCWVNAVCIALQYSKPHFISQGLDAAWNKFVLGDVEIFVAFVYYVARLMKGDKGDAE DTLTKLSKYLANEAQVQLEHYSSCVECDAKFKNSVASINSAIVCASVKRDGVQVGYCVH GIKYYSRVRSVRGRAIIVSVEQLEPCAQSRLLSGVAYTAFSGPVDKGHYTVYDTAKKSM YDGDRFVKHDLSLLSVTSVVMVGGYVAPVNTVKPKPVINQLDEKAQKFFDFGDFLIHNF VIFFTWLLSMFTLCKTAVTTGDVKIMAKAPQRTGVVLKRSLKYNLKASAAVLKSKWWLL AKFTKLLLLIYTLYSVVLLCVRFGPFNFCSETVNGYAKSNFVKDDYCDGSLGCKMCLFG YQELSQFSHLDVVWKHITDPLFSNMQPFIVMVLLLIFGDNYLRCFLLYFVAQMISTVGV FLGYKETNWFLHFIPFDVICDELLVTVIVIKVISFVRHVLFGCENPDCIACSKSARLKR FPVNTIVNGVQRSFYVNANGGSKFCKKHRFFCVDCDSYGYGSTFITPEVSRELGNITKT NVQPTGPAYVMIDKVEFENGFYRLYSCETFWRYNFDITESKYSCKEVFKNCNVLDDFIV FNNNGTNVTQVKNASVYFSQLLCRPIKLVDSELLSTLSVDFNGVLHKAYIDVLRNSFGK DLNANMSLAECKRALGLSISDHEFTSAISNAHRCDVLLSDLSFNNFVSSYAKPEEKLSA YDLACCMRAGAKVVNANVLTKDQTPIVWHAKDFNSLSAEGRKYIVKTSKAKGLTFLLTI NENQAVTQIPATSIVAKQGAGDAGHSLTWLWLLCGLVCLIQFYLCFFMPYFMYDIVSSF EGYDFKYIENGQLKNFEAPLKCVRNVFENFEDWHYAKFGFTPLNKQSCPIVVGVSEIVN TVAGIPSNVYLVGKTLIFTLQAAFGNAGVCYDIFGVTTPEKCIFTSACTRLEGLGGNNV YCYNTALMEGSLPYSSIQANAYYKYDNGNFIKLPEVIAQGFGFRTVRTIATKYCRVGEC VESNAGVCFGFDKWFVNDGRVANGYVCGTGLWNLVFNILSMFSSSFSVAAMSGQILLNC ALGAFAIFCCFLVTKFRRMFGDLSVGVCTVVVAVLLNNVSYIVTQNLVTMIAYAILYFF ATRSLRYAWIWCAAYLIAYISFAPWWLCAWYFLAMLTGLLPSLLKLKVSTNLFEGDKFV GTFESAAAGTFVIDMRSYEKLANSISPEKLKSYAASYNRYKYYSGNANEADYRCACYAY LAKAMLDFSRDHNDILYTPPTVSYGSTLQAGLRKMAQPSGFVEKCVVRVCYGNTVLNGL WLGDIVYCPRHVIASNTTSAIDYDHEYSIMRLHNFSIISGTAFLGVVGATMHGVTLKIK VSQTNMHTPRHSFRTLKSGEGFNILACYDGCAQGVFGVNMRTNWTIRGSFINGACGSPG YNLKNGEVEFVYMHQIELGSGSHVGSSFDGVMYGGFEDQPNLQVESANQMLTVNVVAFL YAAILNGCTWWLKGEKLFVEHYNEWAQANGFTAMNGEDAFSILAAKTGVCVERLLHAIQ VLNNGFGGKQILGYSSLNDEFSINEVVKQMFGVNLQSGKTTSMFKSISLFAGFFVMFWA ELFVYTTTIWVNPGFLTPFMILLVALSLCLTFVVKHKVLFLQVFLLPSIIVAAIQNCAW DYHVTKVLAEKFDYNVSVMQMDIQGFVNIFICLFVALLHTWRFAKERCTHWCTYLFSLI AVLYTALYSYDYVSLLVMLLCAISNEWYIGAIIFRICRFGVAFLPVEYVSYFDGVKTVL LFYMLLGFVSCMYYGLLYWINRFCKCTLGVYDFCVSPAEFKYMVANGLNAPNGPFDALF LSFKLMGIGGPRTIKVSTVQSKLTDLKCTNVVLMGILSNMNIASNSKEWAYCVEMHNKI NLCDDPETAQELLLALLAFFLSKHSDFGLGDLVDSYFENDSILQSVASSFVGMPSFVAY ETARQEYENAVANGSSPQIIKQLKKAMNVAKAEFDRESSVQKKINRMAEQAAAAMYKEA RAVNRKSKVVSAMHSLLFGMLRRLDMSSVDTILNMARNGVVPLSVIPATSAARLVVVVP DHDSFVKMMVDGFVHYAGVVWTLQEVKDNDGKNVHLKDVTKENQEILVWPLILTCERVV KLQNNEIMPGKMKVKATKGEGDGGITSEGNALYNNEGGRAFMYAYVTTKPGMKYVKWEH DSGVVTVELEPPCRFVIDTPTGPQIKYLYFVKNLNNLRRGAVLGYIGATVRLQAGKQTE FVSNSHLLTHCSFAVDPAAAYLDAVKQGAKPVGNCVKMLTNGSGSGQAITCTIDSNTTQ DTYGGASVCIYCRAHVAHPTMDGFCQYKGKWVQVPIGTNDPIRFCLENTVCKVCGCWLN HGCTCDRTAIQSFDNSYLNRVRGSSAARLEPCNGTDIDYCVRAFDVYNKDASFIGKNLK SNCVRFKNVDKDDAFYIVKRCIKSVMDHEQSMYNLLKGCNAVAKHDFFTWHEGRTIYGN VSRQDLTKYTMMDLCFALRNFDEKDCEVFKEILVLTGCCSTDYFEMKNWFDPIENEDIH RVYAALGKVVANAMLKCVAFCDEMVLKGVVGVLTLDNQDLNGNFYDFGDFVLCPPGMGI PYCTSYYSYMMPVMGMTNCLASECFMKSDIFGQDFKTFDLLKYDFTEHKEVLFNKYFKY WGQDYHPDCVDCHDEMCILHCSNFNTLFATTIPNTAFGPLCRKVFIDGVPVVATAGYHF KQLGLVWNKDVNTHSTRLTITELLQFVTDPTLIVASSPALVDKRTVCFSVAALSTGLTS QTVKPGHFNKEFYDFLRSQGFFDEGSELTLKHFFFTQKGDAAIKDFDYYRYNRPTMLDI GQARVAYQVAARYFDCYEGGCITSREVVVTNLNKSAGWPLNKFGKAGLYYESISYEEQD AIFSLTKRNILPTMTQLNLKYAISGKERARTVGGVSLLATMTTRQFHQKCLKSIVATRN ATVVIGTTKFYGGWDNMLKNLMADVDDPKLMGWDYPKCDRAMPSMIRMLSAMILGSKHV TCCTASDKFYRLSNELAQVLTEVVYSNGGFYFKPGGTTSGDATTAYANSVFNIFQAVSS NINCVLSVNSSNCNNFNVKKLQRQLYDNCYRNSNVDESFVDDFYGYLQKHFSMMILSDD SVVCYNKTYAGLGYIADISAFKATLYYQNGVFMSTAKCWTEEDLSIGPHEFCSQHTMQI VDENGKYYLPYPDPSRIISAGVFVDDITKTDAVILLERYVSLAIDAYPLSKHPKPEYRK VFYALLDWVKHLNKTLNEGVLESFSVTLLDEHESKFWDESFYASMYEKSTVLQAAGLCV VCGSQTVLRCGDCLRRPMLCTKCAYDHVFGTDHKFILAITPYVCNTSGCNVNDVTKLYL GGLNYYCVDHKPHLSFPLCSAGNVFGLYKSSALGSMDIDVFNKLSTSDWSDIRDYKLAN DAKESLRLFAAETVKAKEESVKSSYAYATLKEIVGPKELLLLWESGKAKPPLNRNSVFT CFQITKDSKFQVGEFVFEKVDYGSDTVTYKSTATTKLVPGMLFILTSHNVAPLRAPTMA NQEKYSTIYKLHPSFNVSDAYANLVPYYQLIGKQRITTIQGPPGSGKSHCSIGIGVYYP GARIVFTACSHAAVDSLCAKAVTAYSVDKCTRIIPARARVECYSGFKPNNNSAQYVFST VNALPEVNADIVVVDEVSMCTNYDLSVINQRISYKHIVYVGDPQQLPAPRVLISKGVME PIDYNVVTQRMCAIGPDVFLHKCYRCPAEIVNTVSELVYENKFVPVKEASKQCFKIFER GSVQVDNGSSINRRQLDVVKRFIHKNSTWSKAVFISPYNSQNYVAARLLGLQTQTVDSA QGSEYDYVIFAQTSDTAHACNANRFNVAITRAKKGIFCIMSDRTLFDALKFFEITMTDL QSESSCGLFKDCARNPIDLPPSHATTYLSLSDRFKTSGDLAVQIGNNNVCTYEHVISYM GFRFDVSMPGSHSLFCTRDFAMRHVRGWLGMDVEGAHVTGDNVGTNVPLQVGFSNGVDF VAQPEGCVLTNTGSVVKPVRARAPPGEQFTHIVPLLRKGQPWSVLRKRIVQMIADFLAG SSDVLVFVLWAGGLELTTMRYFVKIGAVKHCQCGTVATCYNSVSNDYCCFKHALGCDYV YNPYVIDIQQWGYVGSLSTNHHAICNVHRNEHVASGDAIMTRCLAVYDCFVKNVDWSIT YPMIANENAINKGGRTVQSHIMRAAIKLYNPKAIHDIGNPKGIRCAVTDAKWYCYDKNP INSNVKTLEYDYMTHGQMDGLCLFWNCNVDMYPEFSIVCRFDTRTRSTLNLEGVNGGSL YVNNHAFHTPAYDKRAMAKLKPAPFFYYDDGSCEVVHDQVNYVPLRATNCITKCNIGGA VCSKHANLYRAYVESYNIFTQAGFNIWVPTTFDCYNLWQTFTEVNLQGLENIAFNVVNK GSFVGADGELPVAISGDKVFVRDGNTDNLVFVNKTSLPTNIAFELFAKRKVGLTPPLSI LKNLGVVATYKFVLWDYEAERPLTSFTKSVCGYTDFAEDVCTCYDNSIQGSYERFTLST NAVLFSATAVKTGGKSLPAIKLNFGMLNGNAIATVKSEDGNIKNINWFVYVRKDGKPVD HYDGFYTQGRNLQDFLPRSTMEEDFLNMDIGVFIQKYGLEDFNFEHVVYGDVSKTTLGG LHLLISQVRLSKMGILKAEEFVAASDITLKCCTVTYLNDPSSKTVCTYMDLLLDDFVSV LKSLDLTVVSKVHEVIIDNKPWRWMLWCKDNAVATFYPQLQSAEWKCGYSMPGIYKTQR MCLEPCNLYNYGAGLKLPSGIMFNVVKYTQLCQYFNSTTLCVPHNMRVLHLGAGSDYGV APGTAVLKRWLPHDAIVVDNDVVDYVSDADFSVTGDCATVYLEDKFDLLISDMYDGRTK AIDGENVSKEGFFTYINGFICEKLAIGGSIAIKVTEYSWNKKLYELVQRFSFWTMFCTS VNTSSSEAFVVGINYLGDFAQGPFIDGNIIHANYVFWRNSTVMSLSYNSVLDLSKFNCK HKATVVVQLKDSDINEMVLSLVRSGKLLVRGNGKCLSFSNHLVSTK" CDS 293..12550 /locus_tag="HCoV229Egp1" /codon_start=1 /product="replicase polyprotein 1a" /protein_id="NP_073550.1" /db_xref="GeneID:918764" /translation="MACNRVTLAVASDSEISANGCSTIAQAVRRYSEAASNGFRACRFV SLDLQDCIVGIADDTYVMGLHGNQTLFCNIMKFSDRPFMLHGWLVFSNSNYLLEEFDVV FGKRGGGNVTYTDQYLCGADGKPVMSEDLWQFVDHFGENEEIIINGHTYVCAWLTKRKP LDYKRQNNLAIEEIEYVHGDALHTLRNGSVLEMAKEVKTSSKVVLSDALDKLYKVFGSP VMTNGSNILEAFTKPVFISALVQCTCGTKSWSVGDWTGFKSSCCNVISNKLCVVPGNVK PGDAVITTQQAGAGIKYFCGMTLKFVANIEGVSVWRVIALQSVDCFVASSTFVEEEHVN RMDTFCFNVRNSVTDECRLAMLGAEMTSNVRRQVASGVIDISTGWFDVYDDIFAESKPW FVRKAEDIFGPCWSALASALKQLKVTTGELVRFVKSICNSAVAVVGGTIQILASVPEKF LNAFDVFVTAIQTVFDCAVETCTIAGKAFDKVFDYVLLDNALVKLVTTKLKGVRERGLN KVKYATVVVGSTEEVKSSRVERSTAVLTIANNYSKLFDEGYTVVIGDVAYFVSDGYFRL MASPNSVLTTAVYKPLFAFNVNVMGTRPEKFPTTVTCENLESAVLFVNDKITEFQLDYS IDVIDNEIIVKPNISLCVPLYVRDYVDKWDDFCRQYSNESWFEDDYRAFISVLDITDAA VKAAESKAFVDTIVPPCPSILKVIDGGKIWNGVIKNVNSVRDWLKSLKLNLTQQGLLGT CAKRFKRWLGILLEAYNAFLDTVVSTVKIGGLTFKTYAFDKPYIVIRDIVCKVENKTEA EWIELFPHNDRIKSFSTFESAYMPIADPTHFDIEEVELLDAEFVEPGCGGILAVIDEHV FYKKDGVYYPSNGTNILPVAFTKAAGGKVSFSDDVEVKDIEPVYRVKLCFEFEDEKLVD VCEKAIGKKIKHEGDWDSFCKTIQSALSVVSCYVNLPTYYIYDEEGGNDLSLPVMISEW PLSVQQAQQEATLPDIAEDVVDQVEEVNSIFDIETVDVKHDVSPFEMPFEELNGLKILK QLDNNCWVNSVMLQIQLTGILDGDYAMQFFKMGRVAKMIERCYTAEQCIRGAMGDVGLC MYRLLKDLHTGFMVMDYKCSCTSGRLEESGAVLFCTPTKKAFPYGTCLNCNAPRMCTIR QLQGTIIFVQQKPEPVNPVSFVVKPVCSSIFRGAVSCGHYQTNIYSQNLCVDGFGVNKI QPWTNDALNTICIKDADYNAKVEISVTPIKNTVDTTPKEEFVVKEKLNAFLVHDNVAFY QGDVDTVVNGVDFDFIVNAANENLAHGGGLAKALDVYTKGKLQRLSKEHIGLAGKVKVG TGVMVECDSLRIFNVVGPRKGKHERDLLIKAYNTINNEQGTPLTPILSCGIFGIKLETS LEVLLDVCNTKEVKVFVYTDTEVCKVKDFVSGLVNVQKVEQPKIEPKPVSVIKVAPKPY RVDGKFSYFTEDLLCVADDKPIVLFTDSMLTLDDRGLALDNALSGVLSAAIKDCVDINK AIPSGNLIKFDIGSVVVYMCVVPSEKDKHLDNNVQRCTRKLNRLMCDIVCTIPADYILP LVLSSLTCNVSFVGELKAAEAKVITIKVTEDGVNVHDVTVTTDKSFEQQVGVIADKDKD LSGAVPSDLNTSELLTKAIDVDWVEFYGFKDAVTFATVDHSAFAYESAVVNGIRVLKTS DNNCWVNAVCIALQYSKPHFISQGLDAAWNKFVLGDVEIFVAFVYYVARLMKGDKGDAE DTLTKLSKYLANEAQVQLEHYSSCVECDAKFKNSVASINSAIVCASVKRDGVQVGYCVH GIKYYSRVRSVRGRAIIVSVEQLEPCAQSRLLSGVAYTAFSGPVDKGHYTVYDTAKKSM YDGDRFVKHDLSLLSVTSVVMVGGYVAPVNTVKPKPVINQLDEKAQKFFDFGDFLIHNF VIFFTWLLSMFTLCKTAVTTGDVKIMAKAPQRTGVVLKRSLKYNLKASAAVLKSKWWLL AKFTKLLLLIYTLYSVVLLCVRFGPFNFCSETVNGYAKSNFVKDDYCDGSLGCKMCLFG YQELSQFSHLDVVWKHITDPLFSNMQPFIVMVLLLIFGDNYLRCFLLYFVAQMISTVGV FLGYKETNWFLHFIPFDVICDELLVTVIVIKVISFVRHVLFGCENPDCIACSKSARLKR FPVNTIVNGVQRSFYVNANGGSKFCKKHRFFCVDCDSYGYGSTFITPEVSRELGNITKT NVQPTGPAYVMIDKVEFENGFYRLYSCETFWRYNFDITESKYSCKEVFKNCNVLDDFIV FNNNGTNVTQVKNASVYFSQLLCRPIKLVDSELLSTLSVDFNGVLHKAYIDVLRNSFGK DLNANMSLAECKRALGLSISDHEFTSAISNAHRCDVLLSDLSFNNFVSSYAKPEEKLSA YDLACCMRAGAKVVNANVLTKDQTPIVWHAKDFNSLSAEGRKYIVKTSKAKGLTFLLTI NENQAVTQIPATSIVAKQGAGDAGHSLTWLWLLCGLVCLIQFYLCFFMPYFMYDIVSSF EGYDFKYIENGQLKNFEAPLKCVRNVFENFEDWHYAKFGFTPLNKQSCPIVVGVSEIVN TVAGIPSNVYLVGKTLIFTLQAAFGNAGVCYDIFGVTTPEKCIFTSACTRLEGLGGNNV YCYNTALMEGSLPYSSIQANAYYKYDNGNFIKLPEVIAQGFGFRTVRTIATKYCRVGEC VESNAGVCFGFDKWFVNDGRVANGYVCGTGLWNLVFNILSMFSSSFSVAAMSGQILLNC ALGAFAIFCCFLVTKFRRMFGDLSVGVCTVVVAVLLNNVSYIVTQNLVTMIAYAILYFF ATRSLRYAWIWCAAYLIAYISFAPWWLCAWYFLAMLTGLLPSLLKLKVSTNLFEGDKFV GTFESAAAGTFVIDMRSYEKLANSISPEKLKSYAASYNRYKYYSGNANEADYRCACYAY LAKAMLDFSRDHNDILYTPPTVSYGSTLQAGLRKMAQPSGFVEKCVVRVCYGNTVLNGL WLGDIVYCPRHVIASNTTSAIDYDHEYSIMRLHNFSIISGTAFLGVVGATMHGVTLKIK VSQTNMHTPRHSFRTLKSGEGFNILACYDGCAQGVFGVNMRTNWTIRGSFINGACGSPG YNLKNGEVEFVYMHQIELGSGSHVGSSFDGVMYGGFEDQPNLQVESANQMLTVNVVAFL YAAILNGCTWWLKGEKLFVEHYNEWAQANGFTAMNGEDAFSILAAKTGVCVERLLHAIQ VLNNGFGGKQILGYSSLNDEFSINEVVKQMFGVNLQSGKTTSMFKSISLFAGFFVMFWA ELFVYTTTIWVNPGFLTPFMILLVALSLCLTFVVKHKVLFLQVFLLPSIIVAAIQNCAW DYHVTKVLAEKFDYNVSVMQMDIQGFVNIFICLFVALLHTWRFAKERCTHWCTYLFSLI AVLYTALYSYDYVSLLVMLLCAISNEWYIGAIIFRICRFGVAFLPVEYVSYFDGVKTVL LFYMLLGFVSCMYYGLLYWINRFCKCTLGVYDFCVSPAEFKYMVANGLNAPNGPFDALF LSFKLMGIGGPRTIKVSTVQSKLTDLKCTNVVLMGILSNMNIASNSKEWAYCVEMHNKI NLCDDPETAQELLLALLAFFLSKHSDFGLGDLVDSYFENDSILQSVASSFVGMPSFVAY ETARQEYENAVANGSSPQIIKQLKKAMNVAKAEFDRESSVQKKINRMAEQAAAAMYKEA RAVNRKSKVVSAMHSLLFGMLRRLDMSSVDTILNMARNGVVPLSVIPATSAARLVVVVP DHDSFVKMMVDGFVHYAGVVWTLQEVKDNDGKNVHLKDVTKENQEILVWPLILTCERVV KLQNNEIMPGKMKVKATKGEGDGGITSEGNALYNNEGGRAFMYAYVTTKPGMKYVKWEH DSGVVTVELEPPCRFVIDTPTGPQIKYLYFVKNLNNLRRGAVLGYIGATVRLQAGKQTE FVSNSHLLTHCSFAVDPAAAYLDAVKQGAKPVGNCVKMLTNGSGSGQAITCTIDSNTTQ DTYGGASVCIYCRAHVAHPTMDGFCQYKGKWVQVPIGTNDPIRFCLENTVCKVCGCWLN HGCTCDRTAIQSFDNSYLNESGALVPLD" gene 20570..24091 /gene="S" /locus_tag="HCoV229Egp2" /db_xref="GeneID:918758" CDS 20570..24091 /gene="S" /locus_tag="HCoV229Egp2" /note="structural protein" /codon_start=1 /product="surface glycoprotein" /protein_id="NP_073551.1" /db_xref="GeneID:918758" /translation="MFVLLVAYALLHIAGCQTTNGLNTSYSVCNGCVGYSENVFAVESG GYIPSDFAFNNWFLLTNTSSVVDGVVRSFQPLLLNCLWSVSGLRFTTGFVYFNGTGRGD CKGFSSDVLSDVIRYNLNFEENLRRGTILFKTSYGVVVFYCTNNTLVSGDAHIPFGTVL GNFYCFVNTTIGNETTSAFVGALPKTVREFVISRTGHFYINGYRYFTLGNVEAVNFNVT TAETTDFCTVALASYADVLVNVSQTSIANIIYCNSVINRLRCDQLSFDVPDGFYSTSPI QSVELPVSIVSLPVYHKHTFIVLYVDFKPQSGGGKCFNCYPAGVNITLANFNETKGPLC VDTSHFTTKYVAVYANVGRWSASINTGNCPFSFGKVNNFVKFGSVCFSLKDIPGGCAMP IVANWAYSKYYTIGSLYVSWSDGDGITGVPQPVEGVSSFMNVTLDKCTKYNIYDVSGVG VIRVSNDTFLNGITYTSTSGNLLGFKDVTKGTIYSITPCNPPDQLVVYQQAVVGAMLSE NFTSYGFSNVVELPKFFYASNGTYNCTDAVLTYSSFGVCADGSIIAVQPRNVSYDSVSA IVTANLSIPSNWTTSVQVEYLQITSTPIVVDCSTYVCNGNVRCVELLKQYTSACKTIED ALRNSARLESADVSEMLTFDKKAFTLANVSSFGDYNLSSVIPSLPTSGSRVAGRSAIED ILFSKLVTSGLGTVDADYKKCTKGLSIADLACAQYYNGIMVLPGVADAERMAMYTGSLI GGIALGGLTSAVSIPFSLAIQARLNYVALQTDVLQENQKILAASFNKAMTNIVDAFTGV NDAITQTSQALQTVATALNKIQDVVNQQGNSLNHLTSQLRQNFQAISSSIQAIYDRLDT IQADQQVDRLITGRLAALNVFVSHTLTKYTEVRASRQLAQQKVNECVKSQSKRYGFCGN GTHIFSIVNAAPEGLVFLHTVLLPTQYKDVEAWSGLCVDGTNGYVLRQPNLALYKEGNY YRITSRIMFEPRIPTMADFVQIENCNVTFVNISRSELQTIVPEYIDVNKTLQELSYKLP NYTVPDLVVEQYNQTILNLTSEISTLENKSAELNYTVQKLQTLIDNINSTLVDLKWLNR VETYIKWPWWVWLCISVVLIFVVSMLLLCCCSTGCCGFFSCFASSIRGCCESTKLPYYD VEKIHIQ" gene 24091..24492 /gene="4a" /locus_tag="HCoV229Egp3" /db_xref="GeneID:918759" CDS 24091..24492 /gene="4a" /locus_tag="HCoV229Egp3" /codon_start=1 /product="4a protein" /protein_id="NP_073552.1" /db_xref="GeneID:918759" /translation="MALGLFTLQLVSAVNQSLSNAKVSAEVSRQVIQDVKDGTVTFNLL AYTLMSLFVVYFALFKARSHRGRAALIVFKILILFVYVPLLYWSQAYIYATLIAVILLG RFFHTAWHCWLYKTWDFIVFNVTTLCYAR" gene 24482..24748 /gene="4b" /locus_tag="HCoV229Egp4" /db_xref="GeneID:918760" CDS 24482..24748 /gene="4b" /locus_tag="HCoV229Egp4" /codon_start=1 /product="4b protein" /protein_id="NP_073553.1" /db_xref="GeneID:918760" /translation="MQGKCWFLENKALKPFVCFYGGDQFLYIGDRIVSYFSTNDLYVAL RGRIDKDLSLSRKVELYNGECVYLFCEHPAVGIVNTDFKLEIH" gene 24750..24983 /gene="E" /locus_tag="HCoV229Egp5" /db_xref="GeneID:918761" CDS 24750..24983 /gene="E" /locus_tag="HCoV229Egp5" /note="structural protein; E protein" /codon_start=1 /product="envelope protein" /protein_id="NP_073554.1" /db_xref="GeneID:918761" /translation="MFLKLVDDHALVVNVLLWCVVLIVILLVCITIIKLIKLCFTCHMF CNRTVYGPIKNVYHIYQSYMHIDPFPKRVIDF" gene 24995..25672 /gene="M" /locus_tag="HCoV229Egp6" /db_xref="GeneID:918762" CDS 24995..25672 /gene="M" /locus_tag="HCoV229Egp6" /note="structural protein" /codon_start=1 /product="membrane protein" /protein_id="NP_073555.1" /db_xref="GeneID:918762" /translation="MSNDNCTGDIVTHLKNWNFGWNVILTIFIVILQFGHYKYSRLFYG LKMLVLWLLWPLVLALSIFDTWANWDSNWAFVAFSFFMAVSTLVMWVMYFANSFRLFRR ARTFWAWNPEVNAITVTTVLGQTYYQPIQQAPTGITVTLLSGVLYVDGHRLASGVQVHN LPEYMTVAVPSTTIIYSRVGRSVNSQNSTGWVFYVRVKHGDFSAVSSPMSNMTENERLL HFF" gene 25686..26855 /gene="N" /locus_tag="HCoV229Egp7" /db_xref="GeneID:918763" CDS 25686..26855 /gene="N" /locus_tag="HCoV229Egp7" /note="structural protein" /codon_start=1 /product="nucleocapsid protein" /protein_id="NP_073556.1" /db_xref="GeneID:918763" /translation="MATVKWADASEPQRGRQGRIPYSLYSPLLVDSEQPWKVIPRNLVP INKKDKNKLIGYWNVQKRFRTRKGKRVDLSPKLHFYYLGTGPHKDAKFRERVEGVVWVA VDGAKTEPTGYGVRRKNSEPEIPHFNQKLPNGVTVVEEPDSRAPSRSQSRSQSRGRGES KPQSRNPSSDRNHNSQDDIMKAVAAALKSLGFDKPQEKDKKSAKTGTPKPSRNQSPASS QTSAKSLARSQSSETKEQKHEMQKPRWKRQPNDDVTSNVTQCFGPRDLDHNFGSAGVVA NGVKAKGYPQFAELVPSTAAMLFDSHIVSKESGNTVVLTFTTRVTVPKDHPHLGKFLEE LNAFTREMQQHPLLNPSALEFNPSQTSPATAEPVRDEVSIETDIIDEVN" ORIGIN 1 acttaagtac cttatctatc tacagataga aaagttgctt tttagacttt gtgtctactt 61 ttctcaacta aacgaaattt ttgctatggc cggcatcttt gatgctggag tcgtagtgta 121 attgaaattt catttgggtt gcaacagttt ggaagcaagt gctgtgtgtc ctagtctaag 181 ggtttcgtgt tccgtcacga gattccattc tacaaacgcc ttactcgagg ttccgtctcg 241 tgtttgtgtg gaagcaaagt tctgtctttg tggaaaccag taactgttcc taatggcctg 301 caaccgtgtg acacttgccg tagcaagtga ttctgaaatt tctgcaaatg gctgttctac 361 tattgcgcaa gccgtccgcc gttatagcga ggccgctagc aatggtttta gggcatgccg 421 atttgtttca ttagatttgc aggattgcat cgttggcatt gcagacgata catatgttat 481 gggtctgcat ggcaatcaga cgttgttttg caacataatg aaattttctg accgtccttt 541 tatgcttcat gggtggttgg ttttttccaa ttcaaattac cttttggagg aatttgatgt 601 tgtcttcggt aagagaggtg gtggtaatgt gacatacact gaccagtatc tctgtggcgc 661 cgatggcaaa cctgttatga gtgaagattt atggcagttt gttgaccatt tcggtgagaa 721 cgaagaaatt atcatcaatg gtcatactta cgtttgtgct tggcttacta agcgtaagcc 781 cttagattac aaacgtcaga acaaccttgc cattgaagag attgaatatg tgcatggtga 841 tgctttgcat acactacgca atggttctgt tcttgaaatg gctaaggaag tgaagacatc 901 tagtaaagtt gtgttaagcg atgctcttga caaactttac aaagtctttg gttctcctgt 961 tatgacaaat ggttccaaca tcctagaggc ctttactaaa cctgtgttta ttagtgcatt 1021 agttcaatgt acttgtggta ccaagtcttg gtctgttggt gattggaccg gttttaaatc 1081 ctcttgttgc aacgtgatca gtaataaact gtgtgttgtt cccggtaatg ttaaacctgg 1141 tgatgctgtg attaccactc agcaagctgg tgctggtatt aagtattttt gtggcatgac 1201 tcttaagttt gttgcaaata ttgaaggtgt ctctgtttgg agagtgattg ctcttcagag 1261 tgtggattgc tttgttgctt cttccacttt tgtagaagag gaacatgtta atagaatgga 1321 tacattctgc ttcaatgtac gcaatagtgt tactgatgag tgtcgtctgg ccatgttggg 1381 tgctgaaatg actagtaatg tcagaagaca agttgcttca ggtgtcatag acattagtac 1441 cggttggttt gatgtttatg atgacatctt tgctgaaagc aaaccatggt ttgttcgcaa 1501 ggctgaagac atttttggcc cttgttggtc cgctcttgct tctgcactta aacaacttaa 1561 agtcactaca ggtgaacttg tgagatttgt taagtctatt tgcaattcag ctgttgctgt 1621 cgtgggtggt actatacaaa ttctcgctag tgtgcctgag aagtttttga atgcgtttga 1681 cgtgtttgtc acagctattc aaactgtctt tgactgtgct gttgaaactt gtactattgc 1741 cggtaaagca tttgacaagg tttttgacta tgttttgctt gataatgcgc ttgtaaaact 1801 tgtcaccaca aagcttaagg gtgttcgtga acgtggcctt aataaagtta agtatgcaac 1861 agttgttgtt ggttccactg aagaagttaa atcttcacgt gttgaacgta gcactgctgt 1921 acttacaatc gccaacaatt attccaaact ttttgatgaa gggtatactg ttgtaattgg 1981 cgatgtggcg tactttgtta gtgacggcta cttccgtctt atggccagtc caaatagtgt 2041 gttgactact gcagtctata aaccattgtt tgcttttaat gtgaatgtta tgggtactag 2101 acctgaaaaa tttccaacca ctgtgacttg tgaaaattta gagtctgctg ttttgtttgt 2161 taatgacaaa attactgaat tccaattgga ttacagtatt gatgtcattg ataatgaaat 2221 aattgtcaaa cctaatatca gcctatgtgt tccactttat gtgagagact atgttgacaa 2281 atgggatgat ttttgcagac aatatagtaa cgagtcttgg tttgaggatg attacagggc 2341 ttttatcagt gttttggaca tcactgatgc tgctgtgaaa gctgcagagt ctaaagcttt 2401 cgttgatact attgttccac cttgcccatc tattttgaaa gttatagatg gaggcaaaat 2461 atggaatggt gttattaaaa atgttaactc tgttagagac tggcttaagt ctttgaagtt 2521 aaatctcaca caacagggtt tgcttggaac atgtgcaaag cgttttaaac gttggcttgg 2581 cattttgcta gaggcctata atgcgttttt agacactgtg gtttctactg ttaaaattgg 2641 tggcttgacc tttaaaacat atgcttttga taaaccttac attgtgatac gtgatatcgt 2701 gtgtaaggtt gaaaataaaa cagaagcaga atggattgag ctttttccac ataatgacag 2761 gattaagtct tttagtactt tcgagagtgc ttacatgcca attgcagacc ctacacattt 2821 tgacattgaa gaagttgaac ttttagatgc agagtttgta gaaccaggct gtggtggtat 2881 tttggcagta atagatgagc acgtctttta taagaaggat ggtgtttatt atccatcaaa 2941 tggtactaac attctacctg ttgcatttac aaaagccgct ggtggtaaag tttcattttc 3001 tgatgacgtt gaagtaaaag acattgaacc tgtttacaga gtcaagcttt gctttgagtt 3061 tgaagatgaa aaacttgtag atgtttgtga aaaggcaatt ggcaagaaaa ttaaacatga 3121 aggtgactgg gatagctttt gtaagactat tcaatcagca ctttctgttg tttcttgcta 3181 tgtaaatcta cctacttatt acatttatga tgaagaaggc ggtaatgact tgagtttgcc 3241 cgttatgatt tctgaatggc ctctttctgt tcaacaagct caacaagaag ctactttacc 3301 tgatattgct gaggatgttg ttgaccaagt tgaagaagtc aatagcattt ttgacattga 3361 gacagtggat gttaaacatg atgtgagtcc ttttgaaatg ccatttgaag agttaaatgg 3421 tttaaagata ctcaaacaat tggataacaa ctgctgggtt aactcagtta tgttacaaat 3481 acaattaact ggtatacttg atggtgacta tgctatgcag ttttttaaaa tgggccgagt 3541 tgccaagatg attgaacgct gctacactgc tgagcaatgt atacgtggtg ctatgggtga 3601 tgttggtttg tgtatgtata gactgcttaa agacttacac actggtttta tggttatgga 3661 ttataaatgt agttgtacca gtggtaggct tgaagaatcg ggagctgttt tgttttgtac 3721 gcccactaag aaggcgtttc cttatggtac ttgtctaaat tgtaacgcac ctcgcatgtg 3781 tacaattagg cagttacaag gtaccataat atttgtgcaa caaaaaccag aacctgttaa 3841 tcctgtttct tttgttgtta aaccagtctg ctcatcaatt tttcgtggtg ctgtgtcttg 3901 tggtcattac cagactaaca tctattcaca aaatttgtgt gtggatggtt ttggtgttaa 3961 caagattcag ccctggacaa atgatgcact taatactatt tgtattaagg atgcagatta 4021 taatgcaaaa gttgaaatat ctgttacacc aattaaaaat acagttgata caacacctaa 4081 ggaagaattt gttgttaaag agaagttgaa cgccttcctc gttcatgaca atgtagcttt 4141 ctaccaaggt gatgttgata ctgttgttaa tggtgttgac tttgacttta ttgtaaatgc 4201 tgctaatgag aaccttgctc atggtggagg acttgccaaa gctttagatg tgtacactaa 4261 aggtaaactt caacgtttat ctaaagaaca cattggatta gcgggtaaag taaaagttgg 4321 tacaggagtt atggttgagt gtgatagcct tagaattttt aatgttgttg gtccacgcaa 4381 gggtaaacat gaacgtgatt tactcataaa agcttacaac actattaata atgaacaagg 4441 cacaccttta acaccaattt tgagctgtgg tatttttggt atcaaactcg aaacttcatt 4501 agaagttttg cttgatgttt gtaatacaaa agaagttaaa gtttttgttt atacagacac 4561 agaggtttgt aaggttaagg attttgtgtc tggtttagtg aatgttcaaa aagttgagca 4621 acctaaaata gaaccaaaac cagtgtccgt aattaaagtt gcacccaagc cttacagggt 4681 agatggtaaa tttagttact ttacagaaga cttgttgtgt gtcgctgatg acaaacccat 4741 tgttttgttt actgactcta tgcttacttt ggatgaccgt ggtttagctc tagacaatgc 4801 acttagtggt gtgcttagtg ctgctattaa ggattgtgtt gacataaata aagctatacc 4861 ttctggtaat cttattaagt ttgatatagg ttctgttgtt gtctacatgt gtgttgtgcc 4921 atccgaaaag gacaaacatt tagataataa tgttcaacga tgcacacgta agttgaatag 4981 acttatgtgt gatatagttt gtactatacc agctgactac atcttgccat tggtgttgtc 5041 tagtttgact tgtaatgttt cttttgtagg tgaacttaaa gctgctgaag ctaaagttat 5101 aactataaag gtgacagagg atggtgttaa tgttcatgat gtgaccgtga caacagacaa 5161 gtcatttgaa caacaagttg gtgttattgc tgataaggac aaagatcttt ctggtgcagt 5221 accaagtgat cttaacacat ctgaattgct tactaaagca atagatgttg attgggtcga 5281 attttatggc tttaaagatg ctgttacttt tgcaacagtt gatcatagtg cttttgccta 5341 tgaaagtgct gttgttaatg gtattagagt gttaaaaact agtgataata attgttgggt 5401 gaatgctgtt tgtattgcac tacagtattc gaaaccccat tttatttcac aaggtcttga 5461 tgctgcgtgg aataaatttg ttttaggcga tgttgaaatt tttgttgcat ttgtttacta 5521 tgttgcaaga ctaatgaaag gtgacaaggg tgatgctgaa gacactttga ctaagttgtc 5581 taagtatctt gctaatgaag ctcaagttca attagaacat tatagttctt gtgttgaatg 5641 tgatgctaaa tttaaaaact ctgttgcatc tatcaattct gctatagttt gtgctagtgt 5701 caaacgtgat ggtgtgcaag ttggttattg tgtccatggt attaagtact attcacgtgt 5761 tagaagtgtt agaggtagag ctattatagt cagtgtcgaa cagcttgaac cgtgtgctca 5821 gtctagactt ttgagtggtg ttgcttatac tgctttttct ggacctgttg acaaaggtca 5881 ttatactgtt tatgatactg caaagaaatc aatgtatgat ggtgatcgtt ttgttaaaca 5941 tgatctttct ctgctgtctg tcacatcagt tgttatggtt ggtggttatg ttgcacctgt 6001 taatacagtg aaacctaaac cagtcattaa tcaacttgat gaaaaggcac agaagttctt 6061 tgattttggt gattttttga ttcataattt tgttattttt ttcacatggt tattgagtat 6121 gtttactttg tgtaaaactg cagtaactac aggtgatgtt aaaataatgg ccaaagcacc 6181 acaaaggacg ggtgttgttt taaaacgtag tcttaaatat aacttaaaag cgtcagcagc 6241 tgttcttaaa tctaagtggt ggctgcttgc taagtttacg aaactactgt tactcatata 6301 tacattgtac tcagtagttt tgctttgtgt acgttttgga ccgtttaatt tttgtagtga 6361 gactgttaat ggttatgcta agtcaaactt tgtcaaggat gattactgtg atggttcatt 6421 gggctgcaag atgtgtcttt ttggttacca agagttaagt caatttagcc atttggatgt 6481 tgtgtggaag catataacag accctttgtt tagtaatatg caacctttca ttgtcatggt 6541 tttgctgctt atatttggtg acaattattt gagatgcttc ttgctgtatt ttgttgctca 6601 gatgataagc acagttggtg tttttctagg ttacaaggaa acaaattggt tcttgcactt 6661 tattccattt gatgttattt gtgatgaact gcttgtcact gttattgtta ttaaggttat 6721 ttcttttgtc agacatgtgc tttttggttg tgaaaaccca gattgtattg cgtgttctaa 6781 gagtgctaga cttaagagat tccctgttaa cacaattgtc aatggtgtgc aacgttcatt 6841 ttatgttaat gcaaatggtg gtagtaagtt ttgtaagaaa catagatttt tctgtgttga 6901 ttgtgactct tatggttatg gcagcacgtt tataacaccc gaagtttcta gagaacttgg 6961 taacattacc aaaacaaatg tgcaaccaac agggccggcc tatgtcatga ttgacaaagt 7021 ggagtttgaa aatggttttt acagattgta ttcctgtgaa acattttggc gttacaactt 7081 tgatataact gaaagcaagt attcttgcaa agaggttttt aaaaattgta atgttttgga 7141 tgatttcatc gtgtttaaca ataatgggac caatgtaacg caggttaaaa atgctagtgt 7201 ttacttttca cagttgttgt gtaggcccat taaattagtt gacagtgaac ttttgtccac 7261 tttgtcagtt gattttaatg gtgtcttaca caaggcatac attgatgtac tacgtaatag 7321 ctttggtaaa gatcttaatg ctaatatgtc tttagccgag tgcaagagag ctttaggcct 7381 gtctattagt gatcatgaat ttactagtgc tatttctaat gcacatcgtt gtgacgtgtt 7441 gttatctgat ttgtcattta acaactttgt cagttcgtat gctaaacctg aggaaaaatt 7501 atcagcttat gacttggcgt gttgtatgcg tgcaggtgct aaggttgtta atgccaatgt 7561 tctgacaaag gaccaaactc ctattgtttg gcatgcaaag gattttaaca gtctttctgc 7621 tgaaggtcgc aagtatattg taaaaactag caaagctaag ggtttgactt tcttgttgac 7681 aattaatgaa aaccaagctg tcacgcaaat acctgcaact agcattgttg ctaagcaagg 7741 tgctggtgat gctggccatt cattaacatg gctgtggcta ctgtgtggtc ttgtgtgttt 7801 gattcaattc tacttgtgct ttttcatgcc ctattttatg tacgatatcg tgagtagttt 7861 tgagggttat gattttaagt atatagaaaa tggtcagttg aagaattttg aagcgccact 7921 taaatgcgtc agaaacgttt ttgaaaactt tgaggactgg cattatgcta agtttggctt 7981 cacaccttta aacaagcaaa gctgtcctat tgtagttgga gtttctgaaa ttgttaatac 8041 tgtcgctggc attccatcta atgtgtatct tgttggtaaa actttaattt ttacactaca 8101 agctgctttt ggtaatgctg gtgtttgtta tgacattttt ggagtcacaa cacctgaaaa 8161 gtgcattttt acttctgctt gtactagatt agaaggtttg ggtggtaaca atgtttattg 8221 ttataacaca gcgcttatgg aaggttcttt gccttacagt tcaatacaag ctaatgcata 8281 ttataaatat gacaatggca attttattaa gttgccagaa gttattgcac aaggctttgg 8341 ttttagaaca gtgcgtacta ttgccaccaa atactgccgc gtaggtgaat gtgttgaatc 8401 caatgcaggt gtgtgttttg gctttgacaa gtggtttgtt aacgatggac gtgttgccaa 8461 tggttacgtt tgtggtactg gtttgtggaa ccttgtattt aacatacttt ccatgttttc 8521 atcttcattc tctgttgctg caatgtcagg tcaaatttta cttaattgtg cattaggtgc 8581 ttttgctatt ttttgttgtt ttcttgtgac aaagtttaga cgcatgtttg gtgacctttc 8641 tgtaggtgtt tgcactgttg ttgtggctgt tttgcttaac aatgtctctt acattgtaac 8701 tcagaattta gtaacaatga ttgcttatgc catattgtat ttctttgcta ctagaagctt 8761 acgctatgca tggatttggt gtgctgcata tttaattgcg tatatttctt ttgctccatg 8821 gtggttgtgt gcttggtact ttcttgctat gttgacaggt ttgttaccta gtttgctgaa 8881 gcttaaagtt tcgacaaatc ttttcgaagg tgacaaattt gtaggtacat ttgaaagtgc 8941 tgctgcagga acatttgtca ttgacatgcg ttcttatgag aaacttgcta atagcatctc 9001 tccagaaaag ttgaaaagtt atgctgctag ctataataga tataagtact atagtggtaa 9061 tgcaaatgaa gctgattacc gttgcgcttg ttatgcctat ttagcaaaag caatgttgga 9121 cttttcgcgt gatcataatg acatcttgta cacacctccg actgtcagtt atggttctac 9181 attacaggct ggtttgcgca aaatggcaca accatctggc tttgtggaga aatgtgttgt 9241 ccgtgtctgc tatggaaaca ctgtgttgaa tgggttgtgg cttggtgata ttgtttattg 9301 cccacgtcat gttatcgcat ctaacacaac ttctgctata gattatgatc acgaatatag 9361 tattatgcgg ttgcataatt tttctataat atctggtaca gcatttcttg gtgttgtagg 9421 tgctactatg catggagtaa ctcttaaaat taaggtttca cagactaaca tgcacacacc 9481 tagacattct tttagaacac taaaatctgg tgaaggtttt aacatcttag catgctatga 9541 tggttgtgct caaggtgttt ttggtgtgaa catgagaact aattggacta tccgtggttc 9601 atttattaat ggtgcgtgtg gttcccctgg ctacaatctt aaaaatggcg aggtggaatt 9661 tgtttatatg catcaaattg aactcggaag tggtagccat gtaggttcta gctttgatgg 9721 tgttatgtat ggtggttttg aagaccaacc taatcttcaa gttgaatctg caaaccagat 9781 gttaacagtt aatgtggttg catttcttta tgctgctata ttgaatggtt gcacatggtg 9841 gcttaaaggt gaaaaattgt ttgtggagca ttataatgag tgggcacagg ctaatggttt 9901 cacagctatg aatggtgaag acgctttttc cattcttgct gctaaaactg gtgtctgtgt 9961 ggaaagatta cttcatgcta ttcaagtttt gaataatggc tttggtggta aacaaatttt 10021 gggttattct agtctcaatg atgagttcag tattaatgaa gttgtcaaac aaatgtttgg 10081 tgttaacctg caaagtggta aaaccactag tatgtttaaa tccataagct tatttgctgg 10141 cttctttgtc atgttctggg ctgaattatt tgtttatacc accactattt gggttaaccc 10201 tggttttctt actccgttta tgattttgct tgttgctttg tcactctgtc ttacatttgt 10261 tgttaaacat aaggttttgt ttttgcaagt gtttttgttg ccttcaatta ttgtggctgc 10321 tattcaaaac tgtgcttggg actaccatgt tacaaaggtg ttggcagaga agtttgatta 10381 taatgtttct gttatgcaaa tggacatcca gggttttgtt aacattttta tttgtctttt 10441 tgttgcactg ttgcatactt ggcgctttgc taaagagcgt tgtacacatt ggtgcactta 10501 tttgttctca ctcattgctg ttttatacac tgcattgtat agttatgact acgttagttt 10561 gctggttatg ctactttgtg caatttctaa tgaatggtat attggtgcta ttatttttag 10621 aatttgtcgt tttggtgttg catttttacc agtggaatac gtgtcttact ttgatggtgt 10681 taaaactgtg ctgttgtttt acatgttgtt aggctttgtt agctgtatgt actatggttt 10741 gttgtactgg attaacaggt tctgtaagtg cacattaggt gtttatgatt tctgtgttag 10801 tccagccgaa tttaagtata tggttgctaa tggtttgaat gcaccaaatg gcccttttga 10861 tgcgctcttt ctgtctttta aactaatggg tattggcggt cctagaacca ttaaagtttc 10921 tactgtacag tctaaattga ctgatcttaa gtgcacaaac gtcgttctaa tgggcatttt 10981 gtctaacatg aacatagctt ctaattcaaa ggagtgggca tattgtgttg aaatgcacaa 11041 taaaataaac ttgtgtgacg accctgaaac tgctcaagag ttattgctgg cgttgttggc 11101 ctttttcttg tctaagcata gtgattttgg tcttggtgat cttgtcgatt cttattttga 11161 gaacgactcc attttgcaaa gtgttgcatc ttcttttgtt ggtatgccat cttttgttgc 11221 atatgaaaca gcaagacaag agtatgaaaa tgctgttgca aatggttcct caccacaaat 11281 aatcaaacaa ttgaagaagg ctatgaatgt tgcaaaagct gagtttgaca gggaatcatc 11341 tgttcaaaag aaaattaaca gaatggctga acaagctgct gcagctatgt acaaagaagc 11401 acgtgctgtt aatagaaaat caaaagttgt tagtgccatg catagtttac tctttggcat 11461 gctccgacgt ttggacatgt ctagtgttga cactatcctt aatatggcac gtaatggtgt 11521 tgtccctctt tccgttatcc ctgctacttc tgcagccagg ctcgtcgtcg tagtaccaga 11581 tcatgattca tttgtgaaaa tgatggtaga tggttttgtg cactacgctg gtgttgtttg 11641 gacattacag gaagttaagg ataatgatgg taagaatgtg catcttaaag atgttacaaa 11701 ggaaaaccag gaaatacttg tttggcctct gattttgact tgtgaacgtg tcgttaaatt 11761 gcagaacaat gaaataatgc cgggcaagat gaaggtcaag gccaccaaag gtgaaggtga 11821 tggaggcatt actagtgaag gtaatgctct atacaacaat gaaggtggac gtgcattcat 11881 gtatgcatat gtgactacga agcctggcat gaagtatgtt aaatgggaac atgactctgg 11941 tgtggttaca gttgaattgg aaccaccttg cagatttgtt atagacacac ctactggacc 12001 ccaaattaag tatctttatt ttgttaagaa tcttaacaat ttaaggagag gtgctgtttt 12061 gggttacatt ggtgccactg tgagattgca agctggcaaa cagactgagt ttgtttcaaa 12121 ctcccattta ttaacacatt gttcttttgc tgttgaccca gctgcagcct atcttgatgc 12181 tgttaaacaa ggcgcaaaac ctgttggcaa ttgtgtaaag atgttgacta atggttctgg 12241 tagcggtcag gctattactt gtaccattga ttccaacact acgcaggaca catatggtgg 12301 cgcgtctgtt tgtatttatt gcagagcaca tgttgcacat ccaaccatgg acggtttttg 12361 tcagtacaaa ggcaagtggg tacaagtgcc tataggtaca aatgacccta taagattttg 12421 tcttgaaaat actgtttgta aagtttgtgg ttgttggctt aatcatggct gtacatgtga 12481 ccggactgct atccaaagtt ttgataacag ttatttaaac gagtccgggg ctctagtgcc 12541 gctcgactag agccctgtaa tggtacagac atagattact gtgtccgtgc atttgacgtt 12601 tacaataaag atgcgtcttt tatcggaaaa aatctgaagt ccaattgtgt gcgcttcaag 12661 aatgtagata aggatgacgc gttctatatt gttaaacgtt gcattaagtc agttatggac 12721 cacgagcagt ccatgtataa cttacttaaa ggctgtaatg ctgttgctaa gcatgatttc 12781 tttacttggc atgagggcag aaccatttat ggtaatgtta gtagacagga tcttactaaa 12841 tacaccatga tggatttgtg cttcgctctg cgtaactttg atgaaaaaga ctgtgaagtt 12901 tttaaggaga tattggttct tactggttgt tgtagtactg attactttga aatgaagaat 12961 tggtttgacc ccatagaaaa tgaggacata caccgtgtgt atgctgcttt aggtaaggta 13021 gttgcaaatg caatgcttaa gtgtgttgct ttttgcgacg aaatggtgct caaaggagtt 13081 gttggtgttt tgaccttaga caaccaagat cttaatggga atttctatga cttcggtgac 13141 tttgtattgt gtcctcctgg aatgggaata ccctactgca cgtcatacta ttcttatatg 13201 atgcctgtta tgggtatgac taattgttta gctagtgagt gctttatgaa aagtgacatc 13261 tttggtcaag acttcaaaac ttttgatttg ttgaaatatg atttcacaga acataaggag 13321 gttttgttta acaagtactt taagtattgg ggacaggatt atcatcctga ttgtgttgat 13381 tgccatgacg agatgtgtat tttgcattgt tcaaatttta acacactctt cgcaaccaca 13441 attccaaaca cggcttttgg acctctatgc agaaaagtgt ttattgatgg tgtacccgta 13501 gttgctactg ctggttacca ctttaaacaa ttaggacttg tgtggaacaa agatgttaac 13561 actcattcta ccagacttac tattactgaa ctcttacagt ttgtgacaga tccaacgctt 13621 atagttgcgt catcgcctgc cttggtggat aaacgcactg tttgtttttc tgtcgctgct 13681 ttgagtacag gattaacatc ccaaacagta aaacctggcc attttaataa ggagttttat 13741 gacttcttac gttctcaggg gtttttcgat gagggttcag aattaacatt gaagcatttc 13801 ttttttacac aaaagggtga tgctgcaatt aaagattttg attattatcg ttacaacaga 13861 cctactatgc tggatattgg acaagctcgc gtagcatatc aagtggcagc tcgctatttt 13921 gactgttacg agggtggctg tattacatct agagaggttg ttgttacaaa ccttaataaa 13981 agcgctggtt ggccccttaa taagtttggt aaagctggtt tatattatga gtctattagt 14041 tatgaggaac aagatgctat tttttcatta acaaagcgta atattctccc tactatgact 14101 cagttaaatc ttaaatacgc catatctggt aaggaacgcg cacgtacagt gggtggcgtc 14161 tctttattag ctactatgac tacaagacag tttcatcaga aatgtctgaa atccatagta 14221 gctaccagaa atgccaccgt tgttatcggc actaccaagt tttatggcgg gtgggataat 14281 atgttaaaga acctgatggc cgatgttgat gatcctaaat tgatgggatg ggactatcct 14341 aagtgtgata gagctatgcc ctcaatgatt cgtatgttgt cggctatgat cttaggttct 14401 aagcatgtca catgttgtac ggctagtgat aaattttata gacttagtaa tgagcttgct 14461 caagttttga ccgaggttgt ttattcaaat ggtgggtttt attttaaacc tggtggtaca 14521 acttctggtg atgcaactac agcctacgcc aattctgtct ttaatatatt tcaggctgta 14581 agttctaaca ttaattgcgt tttgagcgtt aactcgtcaa attgcaataa ttttaatgtt 14641 aagaagttac agagacaact ttatgataat tgctatagaa atagtaatgt tgatgaatct 14701 tttgtggatg acttttatgg ttatttgcaa aagcattttt ctatgatgat tctttctgat 14761 gatagtgttg tgtgctataa taaaacttat gctggacttg gttacattgc tgatattagt 14821 gcttttaaag ccactttgta ttatcagaat ggtgtgttta tgagtacagc taagtgttgg 14881 actgaggaag atctttctat aggacctcat gaattttgct cacagcacac tatgcagatt 14941 gtagatgaaa atggtaagta ttatctacca tatccagatc ctagccgtat tatttctgct 15001 ggtgtttttg tggatgacat cactaagact gatgctgtca ttcttttgga acgctatgtt 15061 tctctggcta tagatgccta cccattgtct aagcatccta aacctgagta caggaaggtg 15121 ttttacgcat tgttagactg ggtcaaacat ctcaacaaga ctcttaacga aggtgttttg 15181 gagtcttttt ctgttacact tttagatgaa catgagtcta agttttggga tgaaagcttt 15241 tatgctagta tgtatgagaa gtctacagta ttacaagctg ctggtctttg tgtagtatgt 15301 ggttctcaaa cagttctaag atgcggtgat tgtttacgca gaccgatgtt gtgcactaag 15361 tgcgcctatg atcatgtgtt tggcactgat cataagttca ttttagctat tacaccatat 15421 gtgtgtaaca catctggctg caatgtaaat gacgttacaa aactgtatct tggaggtttg 15481 aattattact gtgtagacca caaaccacat ctttcattcc cactgtgttc agctggtaat 15541 gtctttggtt tgtacaaaag ttctgctttg ggttccatgg acattgatgt ctttaacaaa 15601 ctttctacct ctgattggtc tgacattcgc gactacaagc ttgctaatga tgcaaaagag 15661 tcactaaggt tgtttgcagc tgaaacggtc aaggctaaag aggaaagtgt taagtcatca 15721 tacgcttatg ctaccctaaa ggagattgta ggtcctaagg aacttttgct cttatgggaa 15781 agtggaaaag ccaaaccacc gttaaaccgt aattctgttt ttacatgctt ccaaattaca 15841 aaagactcca agtttcaagt tggtgagttt gtgtttgaga aagtagatta cggttctgat 15901 acggttactt acaaatccac tgctactact aagttagtac caggtatgtt gtttattttg 15961 acttctcata atgttgctcc acttagagcg ccaacaatgg caaaccagga gaaatattct 16021 accatttaca agttgcaccc atcatttaat gttagtgatg cttatgcaaa tcttgtacct 16081 tattaccaac ttattggcaa acagcgtata accacaatac agggtcctcc tggtagtgga 16141 aaatcgcatt gttctattgg tattggtgtg tattaccctg gagcgaggat cgtgttcacc 16201 gcttgttctc acgctgctgt tgattcgctc tgtgcaaaag ctgtcacagc ctatagtgtt 16261 gataagtgta cacgtattat tcctgcacgt gccagagttg agtgttatag tggttttaaa 16321 cctaacaata atagtgcaca atacgtgttt agtactgtta atgcgttacc tgaagttaat 16381 gcagacattg ttgtcgtgga tgaggtgtct atgtgcacta actatgactt gtctgtgatt 16441 aaccagcgta tatcatataa acacattgta tatgttggtg atcctcaaca gcttccagct 16501 cctagagttc ttatctctaa aggtgttatg gaaccaattg actataatgt tgtgacacaa 16561 cgtatgtgtg ctataggacc cgatgtcttt ttacacaagt gttacagatg tcctgctgaa 16621 atagttaaca ctgtttcaga gcttgtttat gaaaacaagt ttgtacctgt caaagaagct 16681 agtaagcagt gcttcaaaat ctttgaacgc ggtagtgttc aggtagacaa tggctccagt 16741 ataaataggc gtcaacttga tgttgttaag cgatttatac ataaaaactc cacatggagc 16801 aaggctgtgt ttatctcacc ttacaatagt caaaattatg tagctgccag gcttttaggc 16861 ttacaaactc agacagtgga ttctgctcaa ggtagtgaat atgactatgt tatattcgca 16921 cagacatcag atactgctca tgcctgtaat gccaatcgtt ttaacgttgc cattactaga 16981 gcaaagaaag gtattttctg tattatgtct gacagaactt tgtttgatgc acttaagttc 17041 tttgaaatca ctatgacaga tttacagtct gaaagtagtt gtggtttgtt taaggattgt 17101 gcacgtaacc ctattgattt accaccaagt catgccacta cttatttgtc attgtctgat 17161 agatttaaga ctagtggtga cttggctgtt caaataggta acaacaatgt ttgtacctat 17221 gaacatgtga tttcatatat gggtttcagg tttgatgtta gcatgcctgg tagtcatagt 17281 ttgttctgta ctagagactt tgccatgcgt catgtcagag gttggttagg aatggatgtg 17341 gaaggtgcac atgtcacagg tgacaatgtt ggcactaatg tacctctaca agttggtttt 17401 tccaatggtg ttgattttgt agctcaacct gaaggttgtg ttctaacaaa cactggcagt 17461 gttgtaaaac ctgttcgtgc tcgtgcacca cctggagaac aattcactca cattgtacct 17521 ctgttacgca agggacaacc ttggagtgtg ttgagaaaac gtattgttca aatgatagca 17581 gattttcttg ctggctcatc tgatgtactg gtgtttgtac tttgggctgg cggtttagag 17641 ttgaccacta tgcgttattt tgttaagatt ggagctgtta aacattgcca atgtggtact 17701 gttgcaacat gctacaattc tgttagtaat gactattgtt gctttaaaca tgcattgggc 17761 tgtgactatg tttataatcc atatgtcata gatattcaac aatggggtta tgttggttca 17821 ctctccacta atcaccatgc aatttgtaat gttcatagaa atgagcatgt tgcttctggt 17881 gatgctatta tgactagatg tttggctgtg tatgactgct ttgttaagaa tgtggattgg 17941 tcaattacct accctatgat agctaatgaa aatgccataa acaagggcgg tcgcactgtg 18001 cagagtcata ttatgcgtgc tgctattaaa ttgtacaacc ctaaagcaat ccatgacatt 18061 ggtaatccta agggtattcg ttgtgctgta actgatgcca agtggtattg ttatgacaag 18121 aaccctatta attctaatgt gaaaacattg gagtatgatt acatgacaca tggccaaatg 18181 gatggcttgt gtttgttttg gaattgtaat gtggatatgt accctgaatt ctcaattgtt 18241 tgcaggtttg acacacgtac acgatctaca ttgaaccttg aaggtgtaaa tggtgggtca 18301 ttgtatgtca ataatcatgc atttcacact cctgcttatg ataaacgtgc tatggctaaa 18361 ttgaaaccag caccgttttt ctactatgac gacggttcat gtgaggttgt tcacgatcaa 18421 gttaactatg ttcctttgag agccactaat tgcattacca agtgtaatat tggtggtgct 18481 gtatgttcta agcacgctaa tctctataga gcatatgttg agtcatataa catttttact 18541 caagctggtt ttaatatttg ggttcctacc acgtttgatt gttataattt gtggcagaca 18601 ttcacagagg tcaatttaca aggtttagag aacattgctt ttaacgttgt taataaaggt 18661 tcatttgttg gtgctgatgg tgaattacca gtagccatta gtggtgataa agtgttcgta 18721 cgtgatggta acactgataa tttagtcttt gttaacaaaa catcactgcc tacaaacata 18781 gcatttgaac tttttgctaa gaggaaggtt ggtttaacac cacctctcag tattctcaaa 18841 aaccttggtg ttgtcgccac atataagttt gtcttgtggg attatgaagc tgagcgtccc 18901 ttgacaagct ttactaagtc tgtttgtggt tatacagact ttgcagagga tgtttgtact 18961 tgttacgata atagtataca aggttcatac gaacgtttta ctctgtcaac taatgctgtg 19021 ttattctctg ctactgctgt gaaaacaggt ggtaagagtt tgccggctat taaattgaat 19081 tttggaatgc ttaatggtaa tgcaattgct actgtcaaat cagaagatgg taacataaaa 19141 aatattaact ggtttgttta cgtacgcaaa gatggcaaac ctgttgatca ttatgatggt 19201 ttttataccc aaggtcgtaa tttacaagac tttttgcctc gcagcacaat ggaagaagac 19261 tttttgaaca tggatatagg cgtgtttatt caaaagtatg gtctagagga tttcaacttc 19321 gagcacgttg tgtatggtga tgtttcaaaa actactctag gcggtttaca cttgttgatt 19381 tcacaagtac gtctgagtaa aatgggcatc ttaaaggcag aggagtttgt ggcagcatct 19441 gacataacac tcaaatgttg tactgtgact tatcttaatg atcctagttc taagactgtt 19501 tgtacttaca tggatttgtt gttggatgat tttgtttctg tattgaagtc tttggatttg 19561 actgttgtat ccaaggttca tgaggtcata attgacaaca aaccatggag atggatgcta 19621 tggtgtaaag ataatgccgt tgctacattc tatcctcagt tgcagagtgc agaatggaaa 19681 tgcgggtatt ctatgcctgg tatttataag acacaacgta tgtgcttaga accatgtaat 19741 ttgtataatt atggtgcagg tttgaagttg cccagtggca ttatgttcaa tgttgttaaa 19801 tacactcaat tgtgtcaata ttttaacagt accacgttat gtgttcctca taatatgaga 19861 gtgttacact tgggtgctgg ctctgattat ggtgttgcac caggaactgc tgttcttaaa 19921 aggtggttgc cgcacgacgc aattgttgtt gacaacgatg ttgttgacta tgtgagtgac 19981 gctgatttta gtgttactgg tgattgtgca accgtttatt tggaagacaa gtttgacttg 20041 ttaatctctg atatgtacga tggtaggaca aaggcaattg atggtgaaaa tgtttcgaaa 20101 gaaggatttt tcacttacat caatggtttc atttgtgaaa aacttgccat cggaggttcg 20161 attgctatta aagtaacaga gtatagctgg aataagaaat tgtatgaact tgtacaaaga 20221 ttttcttttt ggactatgtt ttgcacttct gttaatacgt catcatcaga agcctttgtt 20281 gtcggaatta actatcttgg tgatttcgca caaggacctt ttatagatgg taacataata 20341 cacgcaaatt atgtattttg gcgtaactcc actgttatga gtttgtccta caactctgtt 20401 ttagacctga gtaaatttaa ttgcaaacac aaagcgactg ttgttgtgca attaaaggat 20461 agtgatatta atgaaatggt gcttagtctt gttaggagtg gtaagttgct tgtaaggggt 20521 aatggcaagt gtttgagttt tagtaatcat ttagtctcaa ctaaataaaa tgtttgtttt 20581 gcttgttgca tatgccttgt tgcatattgc tggttgtcaa actacaaatg ggctgaacac 20641 tagttactct gtttgcaacg gctgtgttgg ttattcagaa aatgtatttg ctgttgagag 20701 tggtggttat ataccctccg actttgcatt caataattgg ttccttctaa ctaatacctc 20761 atctgttgta gatggtgttg tgaggagttt tcagcctttg ttgcttaatt gcttatggtc 20821 tgtttctggc ttgcggttta ctactggttt tgtctatttt aatggtactg ggagaggtga 20881 ttgtaaaggt ttttcctcag atgttttgtc tgatgtcata cgttacaacc tcaattttga 20941 agaaaacctt agacgtggaa ccattttgtt taaaacatct tatggtgttg ttgtgtttta 21001 ttgtaccaac aacactttag tttcaggtga tgctcacata ccatttggta cagttttggg 21061 caatttttat tgctttgtaa atactactat tggcaatgaa actacgtctg cttttgtggg 21121 tgcactacct aagacagttc gtgagtttgt tatttcacgc acaggacatt tttatattaa 21181 tggctatcgc tatttcactt taggtaatgt agaagccgtt aatttcaatg tcactactgc 21241 agaaaccact gatttttgta ctgttgcgtt agcttcttat gctgacgttt tggttaatgt 21301 gtcacaaacc tctattgcta atataattta ttgcaactct gttattaaca gactgagatg 21361 tgaccagttg tcctttgatg taccagatgg tttttattct acaagcccta ttcaatccgt 21421 tgagctacct gtgtctattg tgtcgctacc tgtttatcat aaacatacgt ttattgtgtt 21481 gtacgttgac ttcaaacctc agagtggcgg tggcaagtgc tttaactgtt atcctgctgg 21541 tgttaatatt acactggcca attttaatga aactaaaggg cctttgtgtg ttgacacatc 21601 acacttcact accaaatacg ttgctgttta tgccaatgtt ggtaggtgga gtgctagtat 21661 taacacggga aattgccctt tttcttttgg caaagttaat aactttgtta aatttggcag 21721 tgtatgtttt tcgctaaagg atatacccgg tggttgcgca atgcctatag tggctaattg 21781 ggcttatagt aagtactata ctataggctc attgtatgtt tcttggagtg atggtgatgg 21841 aattactggc gtcccacaac ctgttgaggg tgttagttcc tttatgaatg ttacattgga 21901 caaatgtact aaatataata tttatgatgt atctggtgtg ggtgttattc gcgttagcaa 21961 tgacaccttt cttaatggaa ttacgtacac atcaacttca ggtaaccttc tgggttttaa 22021 agatgttact aagggcacca tctactctat cactccttgt aacccaccag atcagcttgt 22081 tgtttatcag caagctgttg ttggtgctat gttgtctgaa aattttacta gttacggctt 22141 ttctaatgtt gtagaactgc cgaaattttt ctatgcgtcc aatggcactt ataattgcac 22201 agacgctgtt ttaacttatt ctagttttgg cgtttgtgca gatggttcta taattgctgt 22261 tcaaccacgt aatgtttcat atgatagtgt ttcagctatc gtcacagcta atttgtctat 22321 accttccaat tggaccactt cggtccaggt tgagtattta caaattacaa gtacacctat 22381 cgtagttgat tgctccactt atgtttgcaa tggtaatgtg cgctgtgttg aattgcttaa 22441 gcagtatact tctgcttgta aaactattga agacgcctta agaaatagcg ccaggctgga 22501 gtctgcagat gttagtgaga tgctcacttt tgacaagaaa gcgtttacac ttgctaatgt 22561 tagtagtttt ggtgactaca accttagcag cgtcatacct agcttgccca caagtggtag 22621 tagagtggct ggtcgcagtg ccatagaaga catacttttt agcaaacttg ttacttctgg 22681 acttggcact gtggacgcag actacaaaaa gtgcactaag ggtctttcca ttgctgactt 22741 ggcttgtgct caatattata atggcattat ggttttgcct ggcgtcgctg atgctgaacg 22801 aatggccatg tatacaggtt ctttaattgg tggaattgct ttaggaggtc taacatcagc 22861 cgtttcaata ccattttcat tagcaattca ggcacgttta aattatgttg cattgcagac 22921 tgatgtttta caagaaaatc agaaaattct tgctgcatct tttaacaaag caatgaccaa 22981 catagtagat gcctttactg gtgttaatga tgctattaca caaacttcac aagccctaca 23041 aacagttgct actgcactta acaagatcca ggatgttgtt aatcaacaag gcaactcatt 23101 gaaccattta acttctcagt tgaggcagaa ttttcaagct atctctagct ctattcaggc 23161 tatctatgac agacttgaca ctattcaggc tgatcaacaa gtagataggc tgattactgg 23221 tagattggct gctttgaatg tattcgtttc tcatacattg actaagtaca ctgaagttcg 23281 tgcttccaga cagcttgcac aacaaaaagt gaatgagtgt gtcaaatccc agtctaagcg 23341 ttatggcttc tgtggaaatg gcactcacat tttctcaatt gttaatgctg ctcctgaggg 23401 gcttgttttt ctccacactg tcttgttgcc gacacaatat aaggatgttg aagcgtggtc 23461 tgggttgtgc gttgatggta caaacggtta tgtgttgcga caacctaatc ttgctcttta 23521 caaagaaggc aattattata gaatcacatc tcgcataatg tttgaaccac gtattcctac 23581 catggcagat tttgttcaaa ttgaaaattg caatgtcaca tttgttaaca tttctcgctc 23641 tgagttgcaa accattgtgc cagagtatat tgatgttaat aagacgctgc aagaattaag 23701 ttacaaattg ccaaattaca ctgttccaga cctagttgtc gaacagtaca accagactat 23761 tttgaatttg accagtgaaa ttagcaccct tgaaaataaa tctgcggagc ttaattacac 23821 tgttcaaaaa ttgcaaactc tgattgacaa cataaatagc acattagtcg acttaaagtg 23881 gctcaaccgg gttgagactt acatcaagtg gccgtggtgg gtgtggttgt gcatttcagt 23941 cgtgctcatc tttgtggtga gtatgttgct attatgttgt tgttctactg gttgctgtgg 24001 cttctttagt tgttttgcat cttctattag aggttgttgt gaatcaacta aacttcctta 24061 ttacgacgtt gaaaagatcc acatacagta atggctctag gtttgttcac attgcaactt 24121 gtgtctgctg ttaatcaatc gcttagcaat gcgaaagtta gtgctgaagt ttcacgacag 24181 gttatccaag acgtgaaaga tggcactgtt accttcaact tgctagcgta tacactaatg 24241 agcctctttg ttgtgtattt tgctttattt aaagcaagat cacaccgtgg cagagctgct 24301 cttatagtgt ttaaaattct aatccttttc gtttatgtgc cattgctgta ttggtctcaa 24361 gcatatattt acgcaacttt gattgctgta attttgcttg gaagattttt ccatacagct 24421 tggcactgct ggctctacaa gacatgggat ttcattgtct tcaatgtaac cacactttgc 24481 tatgcaaggt aagtgttggt ttcttgaaaa taaggctctg aaaccattcg tttgttttta 24541 cggaggggat caattccttt acataggcga cagaattgtt tcttatttct caactaacga 24601 cttgtacgtt gctcttagag gacgtattga taaagacctc agcctttcta gaaaggttga 24661 gttatataac ggtgaatgtg tatacttgtt ttgtgaacac ccagctgttg gaatagtcaa 24721 cacagatttc aaattagaaa tccactaaga tgttccttaa gctagtggat gatcatgctt 24781 tggttgttaa tgtactactc tggtgtgtgg tgcttatagt gatactacta gtgtgtatta 24841 caataattaa actaattaag ctttgtttca cttgccatat gttttgtaat agaacagttt 24901 atggccccat taaaaatgtg taccacattt accaatcata tatgcacata gaccctttcc 24961 ctaaacgagt tattgatttc taaactaaac gacaatgtca aatgacaatt gtacgggtga 25021 cattgtcacc catttgaaga attggaattt tggttggaat gttattctaa ccatattcat 25081 tgttattctt cagtttggac actataaata ctccagattg ttttatggtt tgaagatgct 25141 tgtactgtgg cttctttggc cactcgtact tgctttgtca atctttgaca cctgggctaa 25201 ttgggattct aattgggcct ttgttgcatt tagctttttt atggccgtat caacactcgt 25261 tatgtgggtg atgtacttcg caaacagttt cagacttttc cgacgtgctc gaactttttg 25321 ggcatggaat cctgaggtta atgcaatcac tgtcacaacc gtgttgggac agacatacta 25381 tcaacccatt caacaagctc caacaggcat tactgtgacc ttgctgagcg gcgtgcttta 25441 cgttgacgga catagattgg cttcaggtgt tcaggttcat aacctacctg aatacatgac 25501 agttgccgtg ccgagcacta ctataattta tagtagagtc ggaaggtccg taaattcaca 25561 aaatagcaca ggctgggttt tctacgtacg agtaaaacac ggtgattttt ctgcagtgag 25621 ctctcccatg agcaacatga cagaaaacga aagattgctt cattttttct aaactgaacg 25681 aaaagatggc tacagtcaaa tgggctgatg catctgaacc acaacgtggt cgtcagggta 25741 gaatacctta ttctctttat agccctttgc ttgttgatag tgaacaacct tggaaggtga 25801 tacctcgtaa tttggtaccc atcaacaaga aagacaaaaa taagcttata ggctattgga 25861 atgttcaaaa acgtttcaga actagaaagg gcaaacgggt ggatttgtca cccaagctgc 25921 atttttatta tcttggcaca ggaccccata aagatgcaaa atttagagag cgtgttgaag 25981 gtgtcgtctg ggttgctgtt gatggtgcta aaactgaacc tacaggttac ggtgttaggc 26041 gcaagaattc agaaccagag ataccacact tcaatcaaaa gctcccaaat ggtgttactg 26101 ttgttgaaga acctgactcc cgtgctcctt cccggtctca gtcgaggtcg cagagtcgcg 26161 gtcgtggtga atccaaacct caatctcgga atccttcaag tgacagaaac cataacagtc 26221 aggatgacat catgaaggca gttgctgcgg ctcttaaatc tttaggtttt gacaagcctc 26281 aggaaaaaga taaaaagtca gcgaaaacgg gtactcctaa gccttctcgt aatcagagtc 26341 ctgcttcttc tcaaacttct gccaagagtc ttgctcgttc tcagagttct gaaacaaaag 26401 aacaaaagca tgaaatgcaa aagccacggt ggaaaagaca gcctaatgat gatgtgacat 26461 ctaatgtcac acaatgtttt ggccccagag accttgacca caactttgga agtgcaggtg 26521 ttgtggccaa tggtgttaaa gctaaaggct atccacaatt tgctgagctt gtgccgtcaa 26581 cagctgctat gctgtttgat agtcacattg tttccaaaga gtcaggcaac actgtggtct 26641 tgactttcac tactagagtg actgtgccca aagaccatcc acacttgggt aagtttcttg 26701 aggagttaaa tgcattcact agagaaatgc aacaacatcc tcttcttaac cctagtgcac 26761 tagaattcaa cccatctcaa acttcacctg caactgctga accagtgcgt gatgaagttt 26821 ctattgaaac tgacataatt gatgaagtaa actaaacatg ccactgtgtt gtttgaaatt 26881 caggctttag ttggaatttt gcttttgttc tttcttttat tatctttctt ttgcctgttt 26941 ttagagagat ttggcgcctt ggtgccgtag atgaatacat tgcttttctc tgatctatgt 27001 atgatggtac gatcagagct gcttttaatt aacatgatcc cttgctttgg cttgacaagg 27061 atctagtctt atacacaatg gtaagccagt ggtagtaaag gtataagaaa tttgctacta 27121 tgttactgaa cctaggtgaa cgctagtata actcattaca aatgtgctgg agtaatcaaa 27181 gatcgcattg acgagccaac aatggaagag ccagtcattt gtcttgagac ctatctagtt 27241 agtaactgct aatggaacgg tttcgatatg gatacacaaa aaaaaaaaaa aaaaaaaaaa 27301 aaaaaaaaaa aaaaaaa // LOCUS NC_001803 15191 bp RNA linear VRL 13-AUG-2018 DEFINITION Respiratory syncytial virus, complete genome. ACCESSION NC_001803 VERSION NC_001803.1 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Respiratory syncytial virus ORGANISM Respiratory syncytial virus Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; Mononegavirales; Pneumoviridae; unclassified Pneumoviridae. REFERENCE 1 (bases 1 to 15191) AUTHORS Tolley,K.P., Marriott,A.C., Simpson,A., Plows,D.J., Matthews,D.A., Longhurst,S.J., Evans,J.E., Johnson,J.L., Cane,P.A., Easton,A.J. and Pringle,C.R. TITLE Identification of mutations contributing to the reduced virulence of a modified strain of respiratory syncytial virus JOURNAL Vaccine 14 (17-18), 1637-1646 (1996) PUBMED 9032893 REFERENCE 2 (bases 1 to 15191) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (01-AUG-2000) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 15191) AUTHORS Easton,A.J. TITLE Direct Submission JOURNAL Submitted (30-OCT-1995) Andrew J. Easton, Biological Sciences, University of Warwick, Coventry CV4 7AL, UK COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence is identical to U39661. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..15191 /organism="Respiratory syncytial virus" /mol_type="genomic RNA" /strain="S2 ts1C" /db_xref="taxon:12814" gene 1..44 /locus_tag="Rsvgs01" /db_xref="GeneID:1724895" misc_RNA 1..44 /locus_tag="Rsvgs01" /note="leader" /db_xref="GeneID:1724895" gene 45..576 /gene="NS1" /locus_tag="Rsvgp01" /db_xref="GeneID:1494468" mRNA 45..576 /gene="NS1" /locus_tag="Rsvgp01" /db_xref="GeneID:1494468" CDS 99..518 /gene="NS1" /locus_tag="Rsvgp01" /codon_start=1 /product="non-structural protein 1 (1C)" /protein_id="NP_044589.1" /db_xref="GeneID:1494468" /translation="MGSNSLSMIKVRLQNLFDNDEVALLKITCYTDKLIHLTNALAKAV IHTIKLNGIVFVHVITSSDICPNNNIVVKSNFTTMPVLQNGGYIWEMMELTHCSQPNGL IDDNCEIKFSKKLSDSTMTNYMNQLSELLGFDLNP" gene 596..1097 /gene="NS2" /locus_tag="Rsvgp02" /db_xref="GeneID:1494469" mRNA 596..1097 /gene="NS2" /locus_tag="Rsvgp02" /db_xref="GeneID:1494469" CDS 628..1002 /gene="NS2" /locus_tag="Rsvgp02" /codon_start=1 /product="non-structural protein 2 (1B)" /protein_id="NP_044590.1" /db_xref="GeneID:1494469" /translation="MDTTHNDTTPQRLMITDMRPLSLETIIISLTRDIITHRFIYLINH ECIVRKLDERQATFTFLVNYEMKLLHKVGSTKYKKYTEYNTKYGTFPMPIFINHDGFLE CIGIKPTKHTPIIYKYDLNP" gene 1125..2329 /gene="N" /locus_tag="Rsvgp03" /db_xref="GeneID:1494470" mRNA 1125..2329 /gene="N" /locus_tag="Rsvgp03" /db_xref="GeneID:1494470" CDS 1140..2315 /gene="N" /locus_tag="Rsvgp03" /codon_start=1 /product="Nucleoprotein (N)" /protein_id="NP_044591.1" /db_xref="GeneID:1494470" /translation="MALSKVKLNDTLNKDQLLSSSKYTIQRSTGDSIDTPNYDVQKHIN KLCGMLLITEDANHKFTGLIGMLYAMSRLGREDTIKILRDAGYHVKANGVDVTTHRQDI NGKEMKFEVLTLSSLTTEIQINIEIESRKSYKKMLKEMGEVAPEYRHDSPDCGMIILCI AALVITKLAAGDRSGLTAVIRRANNVLKNEMKRYKGLLPKDIANSFYEVFEKYPHFIDV FVHFGIAQSSTRGGSRVEGIFAGLFMNAYGAGQVMLRWGVLAKSVKNIMLGHASVQAEM EQVVEVYEYAQKLGGEAGFYHILNNPKASLLSLTQFPHFSSVVLGNAAGLGIMGEYRGT PRNQDLYDAAKAYAEQLKENGVINYSVLDLTAEELEAIKHQLNPKDNDVEL" gene 2331..3220 /gene="P" /locus_tag="Rsvgp04" /db_xref="GeneID:1494471" mRNA 2331..3220 /gene="P" /locus_tag="Rsvgp04" /db_xref="GeneID:1494471" CDS 2348..3073 /gene="P" /locus_tag="Rsvgp04" /note="phosphoprotein" /codon_start=1 /product="Phosphoprotein (P)" /protein_id="NP_044592.1" /db_xref="GeneID:1494471" /translation="MEKFAPEFHGEDANNRATKFLESIKGKFTSPKDPKKKDSIISVNS IDIEVTKESPITSNSTIINPTNETDDTVGNKPNYQRKPLVSFKEDPTPSDNPFSKLYKE TIETFDNNEEESSYSYEEINDQTNDNITARLDRIDEKLSEILGMLHTLVVASAGPTSAR DGIRDAMVGLREDMIEKIRTEALMTNDRLEAMARLRNEESEKMAKDTSDEVSLNPTSEK LNNLLEGNDSDNDLSLDDF" gene 3224..4180 /gene="M" /locus_tag="Rsvgp05" /db_xref="GeneID:1494472" mRNA 3224..4180 /gene="M" /locus_tag="Rsvgp05" /db_xref="GeneID:1494472" CDS 3233..4003 /gene="M" /locus_tag="Rsvgp05" /note="matrix" /codon_start=1 /product="Matrix protein (M)" /protein_id="NP_044593.1" /db_xref="GeneID:1494472" /translation="METYVNKLHEGSTYTAAVQYNVLEKDDDPASLTIWVPMFQSSMPA DLLIKELANVNILVKQISTPNGPSLRVMINSRSAVLAQMPSKFTICANVSLDERSKLAY DVTTPCEIKACSLTCLKSKNMLTTVKDLTMKTLNPTHDIIALCEFENIVTSKKVIIPTY LRSISVRNKDLNTLENITTTEFKNAITNAKIIPYSGLLLVITVTDNKGAFKYIKPQSQF IVDLGAYLEKESIYYVTTNWKHTATRFAIKPMED" gene 4190..4599 /gene="SH" /locus_tag="Rsvgp06" /db_xref="GeneID:1494473" mRNA 4190..4599 /gene="SH" /locus_tag="Rsvgp06" /db_xref="GeneID:1494473" CDS 4274..4468 /gene="SH" /locus_tag="Rsvgp06" /codon_start=1 /product="Small hydrophobic protein (SH)" /protein_id="NP_044594.1" /db_xref="GeneID:1494473" /translation="MENTSITIEFSSKFWPYFTLIHMITTIISLLIIISIMIAILNKLC EYNVFHNKTFELPRARVNT" gene 4644..5565 /gene="G" /locus_tag="Rsvgp07" /db_xref="GeneID:1494474" mRNA 4644..5565 /gene="G" /locus_tag="Rsvgp07" /db_xref="GeneID:1494474" CDS 4659..5555 /gene="G" /locus_tag="Rsvgp07" /codon_start=1 /product="Attachment glycoprotein (G)" /protein_id="NP_044595.1" /db_xref="GeneID:1494474" /translation="MSKNKDQRTTKTLEKTWDTLNHLLFISSCLYKLNLKSIAQITLSI LAMIISTSLIIAAIIFIASANHKVTLTTAIIQDATSQIKNTTPTYLTQNPQLGISFSNL SETTSQTTTILASTTPSVKSTLQSTTVKTKNTTTTKIQPSKPTTKQRQNKPPNKPNNDF HFEVFNFVPCSICSNNPTCWAICKRIPNKKPGKKTTTKPTKKPTIKTTKKDLKPQTTKP KEVPTTKPTEKPTINTTKTNIRTTLLTNNTTGNPEHTSQKGTLHSTSSDGNPSPSQVYT TSEYLSQPPSPSNTTNQ" gene 5619..7521 /gene="F" /locus_tag="Rsvgp08" /db_xref="GeneID:1494475" mRNA 5619..7521 /gene="F" /locus_tag="Rsvgp08" /db_xref="GeneID:1494475" CDS 5632..7356 /gene="F" /locus_tag="Rsvgp08" /codon_start=1 /product="Fusion protein (F)" /protein_id="NP_044596.1" /db_xref="GeneID:1494475" /translation="MELPILKTNAITAILAAVTLCFASSQNITEEFYQTTCSAVSKGYL SALRTGWYTSVITIELSNIKENKCNGTDAKVKLIKQELDKYKSAVTELQLLMQSTPATN NRARRELPRFMNYTLNNTKNTNVTLSKKRKRRFLGFLLGVGSAIASGIAVSKVLHLEGE VNKIKSALLSTNKAVVSLSNGVSVLTSKVLDLKNYIDKQLLPIVNKQSCSISNIETVIE FQQKNNRLLEITREFSVNAGVTTPVSTYMLTNSELLSLINDMPITNDQKKLMSNNVQIV RQQSYSIMSIIKEEVLAYVVQLPLYGVIDTPCWKLHTSPLCTTNTKEGSNICLTRTDRG WYCDNAGSVSFFPLAETCKVQSNRVFCDTMNSLTLPSEVNLCNIDIFNPKYDCKIMTSK TDVSSSVITSLGAIVSCYGKTKCTASNKNRGIIKTFSNGCDYVSNKGVDTVSVGNTLYY VNKQEGKSLYVKGEPIINFYDPLVFPSDEFDASISQVNEKINQSLAFIRKSDELLHNVN AGKSTINIMITTIIIVIIVILLSLIAVGLLLYCKARSTPVTLSKDQLSGINNIAFSN" gene 7567..8527 /gene="M2" /locus_tag="Rsvgp09" /db_xref="GeneID:1494476" mRNA 7567..8527 /gene="M2" /locus_tag="Rsvgp09" /db_xref="GeneID:1494476" CDS 7576..8160 /gene="M2" /locus_tag="Rsvgp09" /codon_start=1 /product="matrix (M2/22K)" /protein_id="NP_044597.1" /db_xref="GeneID:1494476" /translation="MSRRNPCKFEIRGHCLNGKRCHFSHNYFEWPPHALLVRQNFMLNR ILKSMDKSIDTLSEISGAAELDRTEEYALGVVGVLESYIGSINNITKQSACVAMSKLLT ELNSDDIKKLRDNEEPNSPKIRVYNTVISYIESNRKNNKQTIHLLKRLPADVLKKTIKT TLDIHKSITINNPKESTVSDINDHAKNNDTT" gene 8460..15037 /gene="L" /locus_tag="Rsvgp10" /db_xref="GeneID:1494467" mRNA 8460..15037 /gene="L" /locus_tag="Rsvgp10" /db_xref="GeneID:1494467" misc_RNA 8460..8527 /gene="M2" /locus_tag="Rsvgp09" /note="overlap" /db_xref="GeneID:1494476" CDS 8468..14965 /gene="L" /locus_tag="Rsvgp10" /codon_start=1 /product="Polymerase (L)" /protein_id="NP_044598.1" /db_xref="GeneID:1494467" /translation="MDPIINGNSANVYLTDSYLKGVISFSECNALGSYIFNGPYLKNDY TNLISRQNPLIEHINLKKLNITQSLMSKYHKGEIKIEEPTYFQSLLMTYKSMTSLEQIT TTNLLKKIIRRAIEISDVKVYAILNKLGLKEKDKIKSNNGQDEDNSVITTIIKDDILLA VKDNQSHLKAVKNHSTKQKDTIKTTLLKKLMCSMQHPPSWLIHWFNLYTKLNNILTQYR SSEVKNHGFILIDNHTLNGFQFILNQYGCIVYHKELKRITVTTYNQFLTWKNISLSRLN VCLITWISNCLNTLNKSLGLRCGFNNVILTQLFLYGDCILKLFHNEGFYIIKEVEGFIM SLILNITEEDQFRKRFYNSMLNNITDAANKAQKNLLSRVCHTLLDKTVSDNIINGRWII LLSKFLKLIKLAGDNNLNNLSELYFLFRIFGHPMVDERQAMDAVKVNCNETKFYLLSSL SMLRGAFIYRIIKGFVNNYNRWPTLRNAIVLPLRWLTYYKLNTYPSLLELTERDLIVLS GLRFYREFRLPKKVDLEMIINDKAISPPKNLIWTSFPRNYMPSHIQNYIEHEKLKFSES DKSRRVLEYYLRDNKFNECDLYNCVVNQSYLNNPNHVVSLTGKERELSVGRMFAMQPGM FRQVQILAEKMIAENILQFFPESLTRYGDLELQKILELKAGISNKSNRYNDNYNNYISK CSIITDLSKFNQAFRYETSCICSDVLDELHGVQSLFFWLHLAIPHVTIICTYRHAPPYI RDHIVDLNNVDEQSGLYRYHMGGIEGWCQKLWTIEAISLLDLISLKGKFSITALINGDN QSIDISKPVRLMEGQTHAQADYLLALNSLKLLYKEYAGIGHKLKGTETYISRDMQFMSK TIQHNGVYYPASIKKVLRVGPWINTILDDFKVSLESIGSLTQELEYRGESLLCSLIFRN VWLYNQIALQLKNHALCNNKLYLDILKVLKHLKTFFNLDNIDTALTLYMNLPMLFGGGD PNLLYRSFYRRTPDFLTEAIVHSVFILSYYTNHDLKDKLQDLSDDRLNKFLTCIITFDK NPNAEFVTLMRDPQALGSERQAKITSEINRLAVTEVLSTAPNKIFSKSAQHYTTTEIDL NDIMQNIEPTYPHGLRVVYESLPFYKAEKIVNLISGTKSITNILEKTSAIDLTDIDRAT EMMRKNITLLIRIFPLDCNRDKREILSMENLSITELSKYVRERSWSLSNIVGVTSPSIM YTMDIKYTTSTIASGIIIEKYNVNSLTRGERGPTKPWVGSSTQEKKTMPVYNRQVLTKK QRDQIDLLAKLDWVYASIDNKDEFMEELSIGILGLTYEKAKKLFPQYLSVNYLHRLTVS SRPCEFPASIPAYRTTNYHFDTSPINRILTEKYGDEDIDIVFQNCISFGLSLMSVVEQF TNVCPNRIILIPKLNEIHLMKPPIFTGDVDIHKLKQVIQKQHMFLPDKISLTQYVELFL SNKTLKSGSHVNSNLILAHKISDYFHNTYILSTNLAGHWILIIQLMKDSKGIFEKDWGE GYITDHMFINLKVFFNAYKTYLLCFHKGYGRAKLECDMNTSDLLCVLELIDSSYWKSMS KVFLEQKVIKYILSQDASLHRVKGCHSFKLWFLKRLNVAEFTVCPWVVNIDYHPTHMKA ILTYIDLVRMGLINIDKIYIKNKHKFNDEFYTSNLFYINYNFSDNTHLLTKHIRIANSE LENNYNKLYHPTPETLENILTNPVKCNDKKTLNDYCIGKNVDSIMLPLLSNKKLIKSPT MIRTNYSKQDLYNLFPTVVIDKIIDHSGNTAKSNQLYTTTSHQIPLVHNSTSLYCMLPW HHINRFNFVFSSTGCKISIEYILKDLIIKDPNCIAFIGEGAGNLLLRTVVELHPDIRYI YRSLKDCNDHSLPIEFLRLYNGHINIDYGENLTIPATDATNNIHWSYLHIKFAEPISLF VCDAELPVTVNWSKIIIEWSKHVRKCKYCSSVNKCTLIVKYHAQDDIDFKLDNITILKT YVCLGSKLKGSEVYLVLTIGPANVFPVFNVVQNAKLILSRTKNFIMPKKADKESIDANI KSLIPFLCYPITKKGINTALSKLKSVVSGDILSYSIAGRNEVFSNKLINHKHMNILKWF NHVLNFRSTELNYNHLYMVESTYPYLSELLNSLTTNELKKLIKITGSLLYNFHNE" gene 15038..15191 /locus_tag="Rsvgs02" /db_xref="GeneID:1724894" misc_RNA 15038..15191 /locus_tag="Rsvgs02" /note="trailer" /db_xref="GeneID:1724894" ORIGIN 1 acgcgaaaaa atgcgtacaa caaacttgcg taaaccaaaa aaatggggca aataagaatt 61 tgataagtac cacttaaatt taactccctt ggttagagat gggcagcaat tcattgagta 121 tgataaaagt tagattacaa aatttgtttg acaatgatga agtagcattg ttaaaaataa 181 catgctatac tgacaaatta atacatttaa ctaatgcatt ggctaaggca gtgatacata 241 caatcaaatt gaatggcatt gtatttgtgc atgttattac aagtagtgat atttgcccta 301 ataataatat tgtagtgaaa tccaatttca caacaatgcc agtgttacaa aatggaggtt 361 atatatggga aatgatggaa ttaacacact gctctcaacc taatggccta atagatgaca 421 attgtgaaat taaattctcc aaaaaactaa gtgattcaac aatgaccaat tatatgaatc 481 aattatctga attacttgga tttgatctta atccataaat tataataaat atcaactagc 541 aaatcaatgt cactaacacc attagttaat ataaaacttg acagaagata aaaatggggc 601 aaataaatca attcagccga cccaaccatg gacacaacac acaatgacac cacaccacaa 661 agactgatga tcacagacat gagaccattg tcacttgaga ctataataat atcactaacc 721 agagacatca taacacacag atttatatac ttgataaatc atgaatgtat agtgagaaaa 781 cttgatgaaa gacaggccac atttacattc ctggtcaact atgaaatgaa actattgcac 841 aaagtgggaa gcactaaata caaaaaatat actgaataca acacaaaata tggcactttt 901 cctatgccaa tatttatcaa tcatgatggg ttcttagaat gcattggcat taagcctaca 961 aagcacactc ccataatata caagtatgat ctcaatccat gaatttcaac acaagagtca 1021 cacaatctga aataacaact tcatgcataa ccacactcca tagttcaaat ggagcctgaa 1081 aattatagta atttaaaatt aaggagagac ataagatgaa agatggggca aatacaaaaa 1141 tggctcttag caaagtcaag ttgaatgata cactcaacaa agatcaactt ctgtcatcca 1201 gcaaatacac catccaacgg agcacaggag atagtattga tactcctaat tatgatgtgc 1261 agaaacacat caataagtta tgtggcatgt tattaatcac agaagatgct aatcataaat 1321 tcactgggtt aataggtatg ttatatgcta tgtctagatt aggaagagaa gacaccataa 1381 aaatactcag agatgcggga tatcatgtaa aagcaaatgg agtggatgta acaacacatc 1441 gtcaagatat taatgggaaa gaaatgaaat ttgaagtgtt aacattgtca agcttaacaa 1501 ctgaaattca aatcaacatt gagatagaat ctagaaaatc ctacaaaaaa atgctaaaag 1561 aaatgggaga ggtagctcca gaatacaggc atgactctcc tgattgtggg atgataatat 1621 tatgtatagc ggcattagta ataaccaaat tagcagcagg ggatagatct ggtcttacag 1681 ctgtgattag gagggctaat aatgtcctaa aaaatgaaat gaaacgttat aaaggcttac 1741 tacccaagga tatagccaac agcttctatg aagtgtttga aaaatatcct cactttatag 1801 atgtttttgt tcattttggt atagcacaat cttctaccag aggtggcagt agagttgaag 1861 ggatttttgc tggattgttt atgaatgcct atggtgcagg gcaagtgatg ttacggtggg 1921 gggtcttagc aaaatcagtt aaaaatatta tgctaggaca cgctagtgtg caagcagaaa 1981 tggaacaagt tgtggaggtt tatgaatatg cccaaaaatt gggtggagaa gcagggttct 2041 accatatatt gaacaaccca aaagcatcat tattgtcttt gactcaattt cctcacttct 2101 ccagtgtagt attaggcaat gctgctggcc taggcataat gggagaatac agaggtacac 2161 caaggaatca agatctatat gatgctgcaa aagcatatgc tgaacaactc aaagaaaatg 2221 gtgtgattaa ctacagtgta ttagacttga cagcagaaga actagaggct atcaaacatc 2281 agcttaatcc aaaagataat gatgtagagc tttgagttaa taaaaaaaat ggggcaaata 2341 aaacatcatg gaaaagtttg ctcctgaatt ccatggagaa gatgcaaaca acagagctac 2401 caaattccta gaatcaataa agggcaaatt cacatcacct aaagatccca agaaaaaaga 2461 tagtatcata tctgtcaact caatagatat agaagtaacc aaagaaagcc ctataacatc 2521 aaattcaacc attataaacc caacaaatga gacagatgat actgtaggga acaagcccaa 2581 ttatcaaaga aaacctctag taagtttcaa agaagaccct acgccaagtg ataatccctt 2641 ttcaaaacta tacaaagaaa ccatagaaac atttgataac aatgaagaag aatctagcta 2701 ttcatatgaa gaaataaatg atcagacaaa cgataatata acagcaagat tagataggat 2761 tgatgaaaaa ttaagtgaaa tactaggaat gcttcacaca ttagtagtag cgagtgcagg 2821 acctacatct gctcgggatg gtataagaga tgccatggtt ggtttaagag aagacatgat 2881 agaaaaaatc agaactgaag cattaatgac caatgacaga ctagaagcta tggcaagact 2941 caggaatgag gaaagtgaaa agatggcaaa agacacatca gatgaagtgt ctctcaatcc 3001 aacatcagag aaattgaaca acctgttgga agggaatgat agtgacaatg atctatcact 3061 tgatgatttc tgatcagtta ccaatctgta catcaacaca caacaccaac agaagaccaa 3121 caaacaaacc aactcaccca tccaaccaaa catctatacg ccaatcagcc aatccaaaac 3181 tagccacccg gaaaaaatag atactatagt tacaaaaaaa gatggggcaa atatggaaac 3241 atacgtgaac aaacttcacg aaggctccac atacacagct gctgttcaat acaatgtctt 3301 agaaaaagac gatgaccctg catcacttac aatatgggtg cccatgttcc aatcatccat 3361 gccagcagat ttacttataa aagaactagc taatgtcaac atactagtga aacaaatatc 3421 cacacccaat ggaccttcat taagagtcat gataaactca agaagtgcag tgctagcaca 3481 aatgcccagc aaatttacca tatgtgccaa tgtgtccttg gatgaaagaa gcaagctggc 3541 atatgatgta accacaccct gtgaaatcaa ggcatgtagt ctaacatgcc taaaatcaaa 3601 aaatatgtta actacagtta aagatctcac tatgaaaaca ctcaacccaa cacatgacat 3661 cattgcttta tgtgaatttg aaaatatagt aacatcaaaa aaagtcataa taccaacata 3721 cctaagatcc atcagtgtca gaaataaaga tctgaacaca cttgaaaata taacaaccac 3781 tgaattcaaa aatgccatca caaatgcaaa aatcatccct tactcaggat tactgttagt 3841 catcacagtg actgacaaca aaggagcatt caaatacata aagccacaaa gtcaatttat 3901 agtagatctt ggagcttacc tagaaaaaga aagtatatat tatgttacaa caaattggaa 3961 gcacacagct acacgatttg caatcaaacc catggaagat taaccttttt cttctacatc 4021 agtgagttga ttcatacaaa ctttctacct acattcttca cttcaccatc ataatcacca 4081 accctctgtg gttcaactaa tcaaacaaaa cccatctgga gcctcagatc atcccaagtc 4141 attgttcatc agatctagta ctcaaataag ttaataaaaa tatccacatg gggcaaataa 4201 tcattggagg aaatccaact aatcacaata tctgtcaaca tagacaagtc aacacgccag 4261 gcaaaatcaa ccaatggaaa atacatccat aacaatagaa ttctcaagca aattctggcc 4321 ttactttaca ctaatacaca tgataacaac aataatctct ttgctaatca taatctccat 4381 catgattgca atactgaaca aactctgtga atataacgta ttccataaca aaacctttga 4441 gctaccaaga gctcgagtca atacatagca ttcaccaatc tgatggcaca aaacagtaac 4501 cttgcatttg taagtgaaca accctcacct ctttacaaaa ccacatcaac atctcaccat 4561 gcaagccatc atccatatta taaagtagtt aattaaaaat aatcataaca atgaactaag 4621 atattaagac taacaataac gttggggcaa atgcaaacat gtccaaaaac aaggaccaac 4681 gcaccaccaa gacactagaa aagacctggg acactctcaa tcatctatta ttcatatcat 4741 cgtgcttata caagttaaat cttaaatcta tagcacaaat cacattatcc attctggcaa 4801 tgataatctc aacttcactt ataattgcag ccatcatatt catagcctcg gcaaaccaca 4861 aagtcacact aacaactgca atcatacaag atgcaacaag ccagatcaag aacacaaccc 4921 caacatacct cacccagaat ccccagcttg gaatcagctt ctccaatctg tctgaaacta 4981 catcacaaac caccaccata ctagcttcaa caacaccaag tgtcaagtca accctgcaat 5041 ccacaacagt caagaccaaa aacacaacaa caaccaaaat acaacccagc aagcccacca 5101 caaaacaacg ccaaaacaaa ccaccaaaca aacccaataa tgattttcac tttgaagtgt 5161 tcaactttgt accttgcagc atatgcagca acaatccaac ctgctgggct atctgtaaaa 5221 gaataccaaa caaaaaacct ggaaagaaaa ccaccaccaa gcccacaaaa aaaccaacca 5281 tcaagacaac caaaaaagat ctcaaacctc aaaccacaaa accaaaggaa gtacctacca 5341 ccaagcccac agaaaagcca accatcaaca ccaccaaaac aaacatcaga actacactgc 5401 tcaccaacaa taccacagga aatccagaac acacaagtca aaagggaacc ctccactcaa 5461 cctcctccga tggcaatcca agcccttcac aagtctatac aacatccgag tacctatcac 5521 aacctccatc tccatccaac acaacaaacc agtagtcatt aaaaagcgta ttattgcaaa 5581 aagccatgac caaatcaacc agaatcaaaa tcaactctgg ggcaaataac aatggagttg 5641 ccaatcctca aaacaaatgc aattaccgca atccttgctg cagtcacact ctgttttgct 5701 tccagtcaaa acatcactga agaattttat caaacaacat gcagtgcagt cagcaaaggc 5761 tatcttagtg ctctaagaac tggttggtat actagtgtta taactataga attaagtaat 5821 atcaaggaaa ataagtgtaa tggaacagac gctaaggtaa aattgataaa acaagaatta 5881 gataaatata aaagtgctgt aacagaattg cagttgctca tgcaaagcac accggcaacc 5941 aacaatcgag ccagaagaga actaccaagg tttatgaatt atacactcaa caataccaaa 6001 aataccaatg taacattaag caagaaaagg aaaagaagat ttcttggctt tttgttaggt 6061 gttggatctg caatcgccag tggcattgct gtatctaagg tcctgcacct agaaggggaa 6121 gtgaacaaaa tcaaaagtgc tctactatcc acaaacaagg ctgtagtcag cttatcaaat 6181 ggagttagtg tcttaaccag caaagtgtta gacctcaaaa actatataga taaacagttg 6241 ttacctattg tgaacaagca aagctgtagc atatcaaaca ttgaaactgt gatagagttc 6301 caacaaaaga acaacagact actagagatt accagggaat ttagtgttaa tgcaggtgta 6361 actacacctg taagcactta tatgttaaca aatagtgaat tattatcatt aatcaatgat 6421 atgcctataa caaatgatca gaaaaagtta atgtccaaca atgttcaaat agttagacag 6481 caaagttact ctatcatgtc cataataaag gaggaagtct tagcatatgt agtacaatta 6541 ccactatatg gtgtaataga tacaccttgt tggaaactgc acacatcccc tctatgtaca 6601 accaacacaa aggaagggtc caacatctgt ttaacaagaa ccgacagagg atggtactgt 6661 gacaatgcag gatcagtatc tttcttccca ctagctgaaa catgtaaagt tcaatcgaat 6721 cgagtatttt gtgacacaat gaacagttta acattaccaa gtgaagtaaa tctctgcaac 6781 attgacatat tcaaccccaa atatgattgc aaaattatga cttcaaaaac agatgtaagc 6841 agctccgtta tcacatctct aggagccatt gtgtcatgct atggcaaaac taaatgtaca 6901 gcatccaata aaaatcgtgg aatcataaag acattttcta acgggtgcga ttatgtatca 6961 aataaggggg ttgacactgt gtctgtaggt aatacattat attatgtaaa taagcaagaa 7021 ggcaaaagtc tctatgtaaa aggtgaacca ataataaatt tctatgaccc attagtgttc 7081 ccctctgatg aatttgatgc atcaatatct caagtcaatg agaagattaa ccagagccta 7141 gcatttattc gtaaatccga tgaattatta cataatgtaa atgctggtaa atccaccata 7201 aatatcatga taactactat aattatagtg attatagtaa tattgttatc attaattgcc 7261 gttggactgc tcctatactg caaggccaga agcacaccag tcacactaag caaggatcaa 7321 ctgagtggta taaataatat tgcatttagt aactaaataa aaatagcacc taatcatgtt 7381 cttacaatgg tttcatatct gctcatagac aacccatcta tcattggatt ttcttaaaat 7441 ctgaacttca tcgaaactct catctataaa ccatctcact tacattattt aagtagattc 7501 ctagtttata gttatataaa acaattgaat accagattaa cttactattt gtaaaaaatg 7561 agaactgggg caaatatgtc acgaaggaat ccttgcaaat ttgaaattcg aggtcattgc 7621 ttgaatggta agaggtgtca ttttagtcat aattattttg aatggccacc ccatgcactg 7681 cttgtaagac aaaactttat gttaaacaga atacttaagt ctatggataa aagcatcgat 7741 actttatcag aaataagtgg agctgcagag ttggacagaa cagaagagta tgccctcggt 7801 gtagttggag tgctagagag ttatatagga tctataaata atataactaa acaatcagca 7861 tgtgttgcca tgagcaaact cctcactgaa ctcaacagtg atgacatcaa aaaactgagg 7921 gacaatgaag agccaaattc acccaagata agagtgtaca atactgtcat atcatatatt 7981 gaaagcaaca ggaaaaacaa taaacaaact atccatctgt taaaaagatt gccagcagac 8041 gtattgaaga aaaccataaa aaccacattg gatatccaca agagcataac catcaataac 8101 ccaaaagaat caactgttag tgatataaac gaccatgcca aaaataatga tactacctga 8161 caaatatcct tgtagtataa attccatact aataacaagt agttgtagag ttactatgta 8221 taatcaaaag aacacactat atttcaatca aaacaaccaa aataaccata tatactcacc 8281 gaatcaacca ttcaatgaaa tccattggac ctctcaagac ttgattgatg caattcaaaa 8341 ttttctacaa catctaggta ttactgatga tatatacaca atatatatat tagtgtcata 8401 acactcaatc ctaatgctta ccacatcatc aaactattaa ctcaaacaat tcaagccatg 8461 ggacaaaatg gatcccatta ttaatggaaa ttctgctaat gtgtatctaa ccgatagtta 8521 tttaaaaggt gttatttctt tctcagaatg taatgcttta ggaagttaca tattcaatgg 8581 tccttatctc aaaaatgatt ataccaactt aattagtaga caaaatccat taatagaaca 8641 cataaatcta aagaaactaa atataacaca gtccttaatg tctaagtatc ataaaggtga 8701 aataaaaata gaagaaccta cttattttca gtcattactt atgacataca agagtatgac 8761 ctcgttagaa cagattacta ccactaattt acttaaaaag ataataagaa gagctataga 8821 aattagtgat gtcaaagtct atgctatatt gaataaactg gggcttaaag aaaaagacaa 8881 gattaaatcc aacaatggac aagatgaaga caactcagtt attacaacca taatcaaaga 8941 tgatatactt ttagctgtta aggataatca atctcatctt aaagcagtca aaaatcactc 9001 tacaaaacaa aaagatacaa tcaaaacaac actcttgaag aaattaatgt gttcaatgca 9061 acatcctcca tcatggttaa tacattggtt taatttatac acaaaattaa acaacatatt 9121 aacacagtat cgatcaagtg aggtaaaaaa ccatggtttt atattgatag acaatcatac 9181 tctcaatgga ttccaattta ttttgaatca atatggttgt atagtttatc ataaggaact 9241 caaaagaatt actgtgacaa cctataatca attcttgaca tggaaaaata ttagccttag 9301 tagattaaat gtttgtttaa ttacatggat tagtaactgt ttgaacacat taaataaaag 9361 cttaggctta agatgcggat tcaataatgt tatcttgaca caactattcc tctatggaga 9421 ttgtatacta aaactattcc acaatgaggg gttctacata ataaaagagg tagagggatt 9481 tattatgtct ctaattttaa atataacaga agaagatcaa ttcagaaaac ggttttataa 9541 tagtatgctc aacaacatca cagatgctgc taataaagct cagaaaaatc tgctatcaag 9601 agtatgtcat acattattag ataagacagt atccgataat ataataaatg gcagatggat 9661 aattctatta agtaagttcc ttaaattaat taagcttgca ggtgacaata accttaacaa 9721 tctgagtgaa ttatattttt tgttcagaat atttggacac ccaatggtag atgaaagaca 9781 agccatggat gctgttaaag ttaattgcaa cgagaccaaa ttttacttgt taagcagttt 9841 gagtatgtta agaggtgcct ttatatatag aattataaaa ggatttgtaa ataattacaa 9901 cagatggcct actttaagga atgctattgt tttaccctta agatggttaa cttactataa 9961 actaaacact tatccttcct tgttggaact tacagaaaga gatttgattg ttttatcagg 10021 actacgtttc tatcgtgagt ttcggttgcc taaaaaagtg gatcttgaaa tgatcataaa 10081 tgataaggct atatcacctc ctaaaaattt gatatggact agtttcccta gaaattatat 10141 gccgtcacac atacaaaatt atatagaaca tgaaaaatta aaattttccg agagtgataa 10201 atcaagaaga gtattagagt actatttaag agataacaaa ttcaatgaat gtgatttata 10261 caactgtgta gttaatcaaa gttatcttaa caaccctaat catgtggtat ctttgacagg 10321 caaagaaaga gaactcagtg taggtagaat gtttgcaatg caaccaggaa tgttcagaca 10381 agttcaaata ttagcagaga aaatgatagc tgaaaacatt ttacaattct ttcctgaaag 10441 tcttacaaga tatggtgatc tagaactaca gaaaatatta gaattgaaag caggaataag 10501 taacaaatca aatcgttaca atgataatta caacaattac attagtaagt gctctatcat 10561 cacagatctc agcaaattca atcaagcatt tcgatatgaa acatcatgta tttgtagtga 10621 tgtactggat gaactgcatg gtgtacaatc tctatttttc tggttacatt tagctattcc 10681 tcatgtcaca ataatatgca catataggca tgcacccccc tatataagag atcatattgt 10741 agatcttaac aatgtagatg aacaaagtgg attatataga tatcatatgg gtggtatcga 10801 agggtggtgt caaaaactat ggaccataga agctatatca ctattggatc taatatctct 10861 caaagggaaa ttctcaatta ctgctttaat taatggtgac aatcaatcaa tagatataag 10921 taaaccagtc agactcatgg aaggtcaaac tcatgctcaa gcagattatt tgctagcatt 10981 aaatagtctt aaattactgt ataaagagta tgcaggcata ggccacaaat taaaaggaac 11041 tgagacttat atatcaagag atatgcaatt tatgagtaaa acaattcaac ataacggtgt 11101 atattaccca gctagtataa agaaagtcct aagagtggga ccgtggataa acactatact 11161 tgatgatttc aaagtgagtc tagaatctat aggtagtttg acacaagaat tagaatatag 11221 aggagaaagt ctattatgca gtttaatatt tagaaatgta tggttatata atcaaattgc 11281 tttacaacta aaaaatcatg cattatgtaa caataaatta tatttggaca tattaaaggt 11341 tctgaaacac ttaaaaacct tttttaatct tgataatatt gatacagcat taacattgta 11401 tatgaatttg cccatgttat ttggtggtgg tgatcccaac ttgttatatc gaagtttcta 11461 tagaagaact cctgatttcc tcacagaggc tatagttcac tctgtgttca tacttagtta 11521 ttatacaaac catgatttaa aagataaact tcaagatctg tcagatgata gattgaataa 11581 gttcttaaca tgcataatca catttgacaa aaaccctaat gctgaattcg taacattgat 11641 gagagatcct caagctttag ggtctgagag acaagctaaa attactagcg aaatcaatag 11701 actggcagtt actgaggttt tgagcacagc tccaaacaaa atattctcca aaagtgcaca 11761 acactatacc actacagaga tagatctaaa tgatattatg caaaatatag aacctacata 11821 tcctcacggg ctaagagttg tttatgaaag tttacccttt tataaagcag agaaaatagt 11881 aaatcttata tccggtacaa aatctataac taacatactg gaaaagactt ctgccataga 11941 cttaacagat attgatagag ccactgagat gatgaggaaa aacataactt tgcttataag 12001 gatatttcca ttagattgta acagagataa aagggaaata ttgagtatgg aaaacctaag 12061 tattactgaa ttaagcaaat atgttaggga aagatcttgg tctttatcca atatagttgg 12121 tgttacatca cctagtatca tgtatacaat ggacatcaaa tatacaacaa gcactatagc 12181 tagtggcata atcatagaga aatataatgt taacagttta acacgtggtg agagaggacc 12241 cactaaacca tgggttggtt catctacaca agagaaaaaa acaatgccag tttataatag 12301 acaagtttta accaaaaaac agagagatca aatagatcta ttagcaaaat tggattgggt 12361 gtatgcatct atagataaca aggatgaatt catggaagaa cttagcatag gaattcttgg 12421 gttaacatat gagaaagcca aaaaattatt tccacaatat ttaagtgtta actatttgca 12481 tcgccttaca gtcagtagta gaccatgtga attccctgca tcaataccag cttatagaac 12541 tacaaattat cactttgata ctagccctat taatcgcata ttaacagaaa agtatggtga 12601 tgaagatatt gatatagtat tccaaaactg tataagcttt ggccttagct taatgtcagt 12661 agtagaacaa tttactaatg tatgtcctaa cagaattatt cttataccta agcttaatga 12721 gatacattta atgaaacctc ccatattcac aggtgatgtt gatattcaca agttaaaaca 12781 agtgatacaa aaacagcata tgtttttacc agacaaaata agtttgactc aatatgtgga 12841 attattctta agtaataaaa cactcaaatc tggatctcat gttaattcta atttaatatt 12901 ggcgcataag atatctgact attttcataa tacttacatt ttaagtacta atttagctgg 12961 acattggatt ctgattatac aacttatgaa agattctaag ggtatttttg aaaaagattg 13021 gggagaggga tatataactg atcatatgtt cattaatttg aaagttttct tcaatgctta 13081 taagacctat ctcttgtgtt ttcataaagg ttacggcaga gcaaagctgg agtgtgatat 13141 gaatacttca gatctcctat gtgtattgga attaatagac agtagttatt ggaagtctat 13201 gtctaaggta tttttagaac aaaaagttat caaatacatt cttagccagg atgcaagttt 13261 acatagagta aaaggatgtc atagcttcaa actatggttt cttaaacgtc ttaatgtagc 13321 agaattcaca gtttgccctt gggttgttaa catagattat catccaacac atatgaaagc 13381 aatattaact tatatagatc ttgttagaat gggattgata aatatagata aaatatacat 13441 taaaaataaa cacaaattca atgatgaatt ttatacttct aatctctttt acattaatta 13501 taacttctca gataatactc atctattaac taaacatata aggattgcta attctgaatt 13561 agaaaataat tacaacaaat tatatcatcc tacaccagaa accctagaaa atatactaac 13621 caatccggtt aaatgtaatg acaaaaagac actgaatgac tattgtatag gtaaaaatgt 13681 tgactcaata atgttaccat tgttatctaa taagaagctt attaaatcgc ctacaatgat 13741 tagaaccaat tacagcaaac aagatttgta taatttattt cctacggttg tgattgataa 13801 aattatagat cattcaggta atacagccaa atctaaccaa ctttacacta ctacttctca 13861 tcaaatacct ttagtgcaca atagcacatc actttattgc atgcttcctt ggcatcatat 13921 taatagattc aattttgtat ttagttctac aggttgtaaa attagtatag agtatatttt 13981 aaaagacctt ataattaaag atcctaattg tatagcattc ataggtgaag gagcagggaa 14041 tttattattg cgtacagtag tggaacttca tcccgatata agatatattt acagaagtct 14101 gaaggattgc aatgatcata gtttacctat tgagttttta aggctgtaca atggacatat 14161 caacattgat tatggtgaaa atttgaccat tcctgctaca gatgcaacca acaacattca 14221 ttggtcttat ttacatataa agtttgctga acctatcagt ctttttgtct gtgatgctga 14281 attgcctgta acagtcaact ggagtaaaat tataatagag tggagcaagc atgtaagaaa 14341 atgcaagtac tgttcctcag ttaataaatg tacgttaata gtaaaatatc atgctcaaga 14401 tgatatcgat ttcaaattag acaatataac tatattaaaa acttatgtat gcttaggcag 14461 taagttaaag gggtctgaag tttacttagt ccttacaata ggtcctgcaa atgtgttccc 14521 agtatttaat gtagtacaaa atgctaaatt gatactatca agaaccaaaa atttcatcat 14581 gcctaagaag gctgataaag agtctattga tgcaaatatt aaaagtttga taccctttct 14641 ttgttaccct ataacaaaaa aaggaattaa tactgcattg tcaaaactaa agagtgttgt 14701 tagtggagat atactatcat attctatagc aggacgtaat gaagttttca gcaataaact 14761 tataaatcat aagcatatga acatcttaaa atggttcaat catgttttaa atttcagatc 14821 aacagaacta aactataatc atttatatat ggtagaatct acatatcctt atctaagtga 14881 attgttaaac agcttgacaa ctaatgaact taaaaaactg attaaaatca caggtagttt 14941 gttatacaac tttcataatg aataatgaat aaaaatctta tattaaaaat tcccatagct 15001 acacactaac actgtattca attatagtta tttaaaatta aaaattatat aattttttaa 15061 taacttttag tgaactaatc ctaaaattat cattttgatc taggaggaat aaatttaaat 15121 ccaaatctaa ttggtttata tgtatattaa ctaaactacg agatattagt ttttgacact 15181 ttttttctcg t // LOCUS NC_001796 15462 bp cRNA linear VRL 13-AUG-2018 DEFINITION Human parainfluenza virus 3, complete genome. ACCESSION NC_001796 VERSION NC_001796.2 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Human respirovirus 3 ORGANISM Human respirovirus 3 Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; Mononegavirales; Paramyxoviridae; Orthoparamyxovirinae; Respirovirus. REFERENCE 1 AUTHORS Durbin,A.P., McAuliffe,J.M., Collins,P.L. and Murphy,B.R. TITLE Mutations in the C, D, and V open reading frames of human parainfluenza virus type 3 attenuate replication in rodents and primates JOURNAL Virology 261 (2), 319-330 (1999) PUBMED 10497117 REFERENCE 2 AUTHORS Ohsawa,K., Yamada,A., Takeuchi,K., Watanabe,Y., Miyata,H. and Sato,H. TITLE Genetic characterization of parainfluenza virus 3 derived from guinea pigs JOURNAL J. Vet. Med. Sci. 60 (8), 919-922 (1998) PUBMED 9764404 REFERENCE 3 (bases 1 to 15462) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (05-DEC-2000) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 4 (bases 1 to 15462) AUTHORS Ohsawa,K. TITLE Direct Submission JOURNAL Submitted (12-MAR-1998) Kazutaka Ohsawa, Laboratory Animal Center for Biomedical Research, Nagasaki University School of Medicine; 1-12-4 Sakamoto, Nagasaki 852-8523, Japan COMMENT VALIDATED REFSEQ: This record has undergone validation or preliminary review. The reference sequence was derived from AB012132. On Oct 20, 2000 this sequence version replaced NC_001796.1. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..15462 /organism="Human respirovirus 3" /mol_type="viral cRNA" /db_xref="taxon:11216" gene 111..1658 /locus_tag="HPIV3gp1" /db_xref="GeneID:911955" CDS 111..1658 /locus_tag="HPIV3gp1" /codon_start=1 /product="nucleocapsid protein" /protein_id="NP_067148.1" /db_xref="GeneID:911955" /translation="MLSLFDTFNARRQENITKSAGGAIIPGQKNTVSIFALGPTITDDN EKMTLALLFLSHSLDNEKQHAQRAGFLVSLLSMAYANPELYLTTNGSNADVKYVIYMIE KDLKRQKYGGFVVKTREMVYDKTTDWIFGSDLDCDQETMLQNGRNNSTIEDLVHTFGYP SCLGALIIQIWIVLVKAITSISGLRKGFFTRLEAFRQDGTVQAGLVLSGDTVDQIGSIM RSQQSLVTLMVETLITMNTSRNDLTTIEKNIQIVGNYIRDAGLASFFNTIRYGIETRMA ALTLSTLRPDINRLKALMELYLSKGPRAPFICILRDPIHGEFAPGNYPAIWSYAMGVAV VQNRAMQQYVTGRSYLDIDMFQLGQAVARDAEAQMSSTLEDELGVTHEAKESLKRHIRN INSSETSFHKPTGGSAIEMAIDEEPEQFEHRSDQERDGEPQSSIIQYAWAEGNRSDDRT EQDTESDNIKTEQQNIRDRLNKRLNEKKKQGSQPPTNPTNRTNQDEIDDLFNAFGSN" gene 1784..3595 /locus_tag="HPIV3gp2" /db_xref="GeneID:911956" CDS 1784..3595 /locus_tag="HPIV3gp2" /codon_start=1 /product="phosphoprotein" /protein_id="NP_067149.1" /db_xref="GeneID:911956" /translation="MESDAKNYQIMDSWEEESRDKSTNISSALNIIEFILSTDPQEDLS ENDTINTRTQQLSATIYQPKIKPTETSEKDSGSTDKNRQSGSSHECTTEAKDRTIDQET VQRGPGRRSSSDSRAETVVSGGISRSITNSKNGTQNTEDIDLNEIRKMDKDSIEGKVRQ SADVPSEISGSDVIFTTEQSRNSDHGRSLESISTPDTRSISVVTAATPDDEEEILMKNS RTKKSSSIHQEDDKRIKKGGKGKDWFKKSKDTDNQIPTSDYRSTSKGQKKISKTTTINT DTKGQTEIQTESSGTQSSSWNLTIDNNTDRTEQTNTTPPTTTSGSTYTKESIRTNSGSK PKTQKTNGKERKDTEESNRFTERAITLLQNLGVIQSTSKLDLYQDKRVVCVANVLNNVD TASKIDFLAGLVIGVSMDNDTKLTQIQNEMLNLKADLKKMDESHRRLIENQREQLSLIT SLISNLKIMTERGGKKDQNESNERVSMIKTKLKEEKIKKTRFDPLMETQGIDKNIPDLY RHAGNTLENDVQVKSEILSSYNESNATRLIPKKVSSTMRSLVAVISNSNLSQSTKQSYI NELKHCKNDEEVSELMDMFNEDVNNCQ" gene 1784..2903 /locus_tag="HPIV3gp3" /db_xref="GeneID:911958" CDS join(1784..2509,2508..2903) /locus_tag="HPIV3gp3" /exception="RNA editing" /codon_start=1 /product="D protein" /protein_id="NP_599250.1" /db_xref="GeneID:911958" /translation="MESDAKNYQIMDSWEEESRDKSTNISSALNIIEFILSTDPQEDLS ENDTINTRTQQLSATIYQPKIKPTETSEKDSGSTDKNRQSGSSHECTTEAKDRTIDQET VQRGPGRRSSSDSRAETVVSGGISRSITNSKNGTQNTEDIDLNEIRKMDKDSIEGKVRQ SADVPSEISGSDVIFTTEQSRNSDHGRSLESISTPDTRSISVVTAATPDDEEEILMKNS RTKKSSSIHQEDDKRIKKGGEKGKTGLRNQKILTTRYQHQTTDPHQKGRRKSQKQQPST PTQRGKQKYRQNHQEHNPHHGISPLITTQIEPNRQTQLPQQQPPDQLIQKNQSEQTLDP NPRHKRQMERKGRIQKRAIDLQRGQLLYCRILV" gene 1794..2393 /locus_tag="HPIV3gp4" /db_xref="GeneID:911959" CDS 1794..2393 /locus_tag="HPIV3gp4" /codon_start=1 /product="C protein" /protein_id="NP_599251.1" /db_xref="GeneID:911959" /translation="MLKTIKSWILGKRNQEINQLISPRPSISLNSYLAPTPKKTYRKTT QSTQEPSNSVPPSINQKSNQQKQVRKIVDQLTKIDSLGHHTNVQQKQKIELLIRKLYRE DLGEEAAQIVELRLWSLEESPEASQILKMEPRTRRILISMKLERWIRTLLRGKCDNLQM FQARYQEVMSYLQQNKVETVIMEEAWNLSVHLIQDQ" gene 3753..4814 /locus_tag="HPIV3gp5" /db_xref="GeneID:911954" CDS 3753..4814 /locus_tag="HPIV3gp5" /codon_start=1 /product="matrix potein" /protein_id="NP_067150.1" /db_xref="GeneID:911954" /translation="MSITNSAIYTFPESSFFENGHIEPLPLKVNEQRKAVPHIRVAKIG NPPKHGSRYLDVFLLGFFEMERIKDKYGSVNDLDSDPSYKVCGSGSLPIGLAKYTGNDQ ELLQAATKLDIEVRRTVKAKEMVVYTVQNIKPELYPWSNRLRKGMLFDANKVALAPQCL PLDRSIKFRVIFVNCTAIGSITLFKIPKSMASLSLPNTISINLQVHIKTGVQTDSKGIV QILNEKGEKSLNFMIHLGLIKRKIGRMYSVEYCKQKIEKMRLIFSLGLVGGISLHVNAT GSISKTLASQLVFKREICYPLMDLNPHLNLVIWASSVEITRVDAIFQPSLPGEFRYYPN IIAKGVGKIKQWN" gene 5072..6691 /locus_tag="HPIV3gp6" /db_xref="GeneID:911957" CDS 5072..6691 /locus_tag="HPIV3gp6" /codon_start=1 /product="fusion protein" /protein_id="NP_067151.1" /db_xref="GeneID:911957" /translation="MPTSTLLIITTIIMASFCQIDITKLQHVGVLVNSPKGMKISQNFE TRYLILSLIPKIEDSNSCGDQQIKQYKRLLDRLIIPLYDGLRLQKDVIVTNQESNENTN PRTKRFFGGVIGTIALGVATSAQITAAVALVEAKQARSDIEKLKEAIRDTNKAVQSVQS SIGNLIVAIKSVQDYVNKEIVPSIARLGCEAAGLQLGIALTQHYSELTNIFGDNIGSLQ EKGIKLQGIASLYRTNITEIFTTSTVDKYDIYDLLFTESIKVRVIDVDLNDYSITLQVR LPLLTRLLNTQIYKVDSISYNIHNREWYIPLPSHIMTKGAFLGGADVKECIEAFSSYIC PSDPGFVLNHEMESCLSGNISQCPRTTITSDIVPRYAFVNGGVVANCITTTCTCNGIGN RINQPPNQGVKIITHKECSTIGINGMLFNTNKEGTLAFYTPDDITLNNSVALDPIDISI ELNKAKSDLEESKEWIRKSNQKLDSIGNWHQSSTTIIIILMMIIILFIINITIITIAIK YYRIQKRNQMDQNDKPYVLTNK" gene 6806..8530 /locus_tag="HPIV3gp7" /db_xref="GeneID:911961" CDS 6806..8530 /locus_tag="HPIV3gp7" /codon_start=1 /product="hemagglutinin-neuraminidase" /protein_id="NP_067152.1" /db_xref="GeneID:911961" /translation="MEYWKHTNHGKDAGNELETSMATHNNKLTNKIIYILWTIILVLLS IVFIIVLINSINSEKVHNSLLQEINNEFMEITEKIQMASDNTNDLIQSGVNTRLLTIQS HVQNYIPISLTQQMSDLRKFISEITIRNDNQEVPQQRITHDVGIKPLNPDDFWRCTSGL PFLMRNPKIRLMPGPGLLAMPTTVDGCVRTPSLIINDLIYAYTSNLITRGCQDIGKSYQ VLQVGIITVNSDLVPDLNPRFSHTFNINDNRKSCSLALLNTDVYQLCSTPKVDERSDYA SSGIEDIVLDIVNYDGSISTTRFKNNNISFDQPYAALYPSVGPGIYYKGKIIFLGYGGL EHPINENVICNTTECPGKTQRDCNQASYSPWFSDRRMVNSIIVVDKGLNSIPKLKVWTI SMRQNYWGSEGRLILLGNKIYIYTRSTSWHSKLQLGIIDITDYSDIRIKWTWHNVLSRP GNDECPWGHSCPNGCITGVYTDAYPLNPTGSIVSSVILDSQKSRVNPVITYSTATERVN ELAIRNRTLSAGYTTTSCITHYDKGYCFHIVEINQKSSNTFQPMLFKTEIPKSCSQS" gene 8646..15347 /locus_tag="HPIV3gp8" /db_xref="GeneID:911960" CDS 8646..15347 /locus_tag="HPIV3gp8" /codon_start=1 /product="large protein" /protein_id="NP_067153.2" /db_xref="GeneID:911960" /translation="MDTESNNGTVSDILYPECHLNSPIVKGKIAQLHTIMSLPQPYDMD DDSILVITRQKIKLNKLDKRQRSIRRLKLILTEKVNDLGKYTFIRYPEMSKEMFKLYIP GINSKVTELLLKADRTYSQMTDGLRDLWINVLSKLASKNDGSNYDLNEEINNISKVHTT YKSDKWYNPFKTWFTIKYDMRRLQKARNEITFNVGKDYNLLEDQKNFLLIHPELVLILD KQNYNGYLITPELVLMYCDVVEGRWNISACAKLDPKLQSMYQKGNNLWEVIDKLFPIMG EKTFDVISLLEPLALSLIQTHDPVKQLRGAFLNHVLSEMELIFESGESIREFLSVDYID KILDIFNESTIDEIAEIFSFFRTFGHPPLEASIAAEKVRKYMYIEKQLKFDTVNKCHAI FCTIIINGYRERHGGQWPPVTLPDHAHEFIINAYGSNSAISYENAVDYYQSFIGIKFNK FIEPQLDEDLTIYMKDKALSPKKSNWDTVYPASNLLYRTNASNESRRLVEVFIADSKFD PHQILDYVESGDWLDDPEFNISYSLKEKEIKQEGRLFAKMTYKMRATQVLSETLLANNI GKFFQENGMVKGEIELLKRLTTISISGVPRYNEVYNNSKSHTDDLKTYNKISNLNLSSN QKSKKFEFKSTDIYNDGYETVSCFLTTDLKKYCLNWRYESTALFGETCNQIFGLNKLFN WLHPRLEGSTIYVGDPYCPPSDKEHISLEDHPDSGFYVHNPRGGIEGFCQKLWTLISIS AIHLAAVRIGVRVTAMVQGDNQAIAVTTRVPNNYDYRIKKEIVYKDVVRFFDSLREVMD DLGHELKLNETIISSKMFIYSKRIYYDGRILPQALKALSRCVFWSETVIDETRSASSNL ATSFAKAIENGYSPVLGYACSIFKNIQQLYIALGMNINPTITQNIKDQYFKNSNWMQYA SLIPASVGGFNYMAMSRCFVRNIGDPSVAALADIKRFIKANLLDRSVLYRIMNQEPGES SFLDWASDPYSCNLPQSQNITTMIKNITARNVLQDSPNPLLSGLFTNTMIEEDEELAEF LMDRKVILPRVAHDILDNSLTGIRNAIAGMLDTTKSLIRVGINRGGLTYSLLRKISNYD LVQYETLSRTLRLIVSDKIRYEDMCSVDLAIALRQKMWIHLSGGRMISGLETPDPLELL SGVVITGSEHCKICYSSDGTNPYTWMYLPGNIKIGSAETGVSSLRVPYFGSVTDERSEA QLGYIKNLSKPAKAAIRIAMIYTWAFGNDEISWMEASQIAQTRANFTLDSLKILTPVAT STNLSHRLKDTATQMKFSSTSLIRVSRFITMSNDNMSIKEANETKDTNLIYQQIMLTGL SVFEYLFRLKETTGHNPIVMHLHIEDECCIKESFNDEHINPESTLELIRYPESNEFIYD KDPLKDVDLSKLMVIKDHSYTIDMNYWDDTDIIHAISICTAITIADTMSQLDRDNLKEI IVIANDDDINSLITEFLTLDILVFLKTFGGLLVNQFAYTLYSLKIEGRDLIWDYIMRTL RDTSHSILKVLSNALSHPKVFKRFWDCGVLNPIYGPNTASQDQIKLALSICEYALDLFM REWLNGVSLEIYICDSDMEVANDRKQAFISRHLSFVCCLAEIASFGPNLLNLTYLERLD LLKQYLELNIKEDPTLKYVQISGLLIKSFPSTVTYVRKTAIKYLRIRGISPPEVIDDWD PIEDENMLDNIVKTINDNCNKDNKGNKINNFWGLALKNYQVLKIRSITSDSDDNDRLDA STSGLTLPQGGNYLSHQLRLFGINSTSCLKALELSQILMKEVNKDKDRLFLGEGAGAML ACYDATLGPAINYYNSGLNITDVIGQRELKIFPSEVSLVGKKLGNVTQILNRVKVLFNG NPNSTWIGNMECESLIWSELNDKSIGLVHCDMEGAIGKSEETVLHEHYSVIRITYLIGD DDVVLVSKIIPTITPNWSRILYLYKLYWKDVSIISLKTSNPASTELYLISKDAYCTIME PSEVVLSKLKRLSLLEENNLLKWIILSKKRNNEWLHHEIKEGERDYGVMRPYHMALQIF GFQINLNHLAKEFLSTPDLTNINNIIQSFQRTIKDVLFEWINITHDDKRHKLGGRYNIF PLKNKGKLRLLSRRLVLSWISLSLSTRLLTGRFPDEKFEHRAQTGYVSLADTDLESLKL LSKNIIKNYRECIGSISYWFLTKEVKILMKLIGGAKLLGIPRQYKEPEEQLLENYNQHD EFDID" ORIGIN 1 accaaacaag agaagagact tgtttggaaa tataaattta aattaaaatt aacttaggat 61 taaagacatt gactagaagg tcaagaaaag ggaactctat aatttcaaaa atgttgagcc 121 tatttgatac atttaatgca cgtaggcaag aaaacataac aaaatcagcc ggtggagcta 181 tcattcctgg acagaaaaat actgtctcta tatttgctct tggaccgaca ataactgatg 241 ataatgagaa aatgacatta gctcttcttt ttttatctca ttcactggat aatgagaaac 301 aacatgcaca aagggcaggg ttcctggtgt ccttattatc aatggcttat gccaatccag 361 agctctacct aacaacaaat ggaagtaatg cagatgtcaa gtatgtcata tacatgatcg 421 agaaagatct aaaacggcaa aagtatggag gatttgtggt taagacgaga gagatggtat 481 atgacaagac aactgattgg atatttggaa gtgacctgga ttgtgatcag gaaactatgt 541 tgcagaacgg aagaaacaat tcaacaattg aagaccttgt ccacacattt gggtatccat 601 catgtttagg agctcttata atacagatct ggatagttct ggtcaaagct atcactagta 661 tctcagggtt aagaaaaggc tttttcaccc gattggaagc tttcagacaa gatggaacag 721 tgcaggcagg gctggtattg agcggtgaca cagtggatca gattgggtca attatgcggt 781 ctcaacagag cttggtaact cttatggttg aaacattaat aacaatgaat actagcagaa 841 atgacctcac aaccatagaa aagaatatac aaattgttgg taactatata agagatgcag 901 gtctcgcttc attcttcaat acaatcagat atggaattga gactagaatg gcagctttga 961 ctctatccac tctcagacca gacatcaata gattaaaagc tttgatggaa ctgtatttat 1021 caaagggacc acgcgctcct ttcatctgta tcctcagaga tcccatacat ggtgagttcg 1081 caccaggcaa ctatcctgcc atatggagct atgcaatggg agtggcagtt gtacaaaata 1141 gagccatgca acagtatgtg acgggaagat catatctgga tattgatatg ttccagctag 1201 gacaagcagt agcacgtgat gctgaagctc agatgagctc aacactagaa gatgaacttg 1261 gagtgacaca cgaagctaaa gaaagcttga agagacatat aaggaacata aatagttcag 1321 agacatcttt ccacaaacca acaggtggat cagccataga gatggcaata gatgaagagc 1381 cagaacaatt cgaacataga tcagatcaag aacgagatgg agaacctcaa tcatctataa 1441 ttcaatatgc ttgggcagaa ggaaacagaa gcgatgatcg gactgagcaa gatacagaat 1501 ctgacaatat caagactgaa caacaaaaca tcagagacag actaaacaag agactcaacg 1561 aaaagaagaa acaaggcagt caaccaccta ccaatcccac aaacagaacg aatcaggacg 1621 aaatagatga tctgtttaat gcatttggaa gcaactaatc gaatcaacat tttgatccaa 1681 attaataata aataagaaaa acttaggatt aaagaatcct attataccag aatatagggt 1741 ggtaaattta gagtttgctt gcaactcaat caatagagag ttgatggaaa gcgatgctaa 1801 aaactatcaa atcatggatt cttgggaaga ggaatcaaga gataaatcaa ctaatatctc 1861 ctcggccctc aatatcattg aattcatact tagcaccgac ccccaagaag acctatcgga 1921 aaacgacaca atcaacacaa gaacccagca actcagtgcc accatctatc aaccaaaaat 1981 caaaccaaca gaaacaagtg agaaagatag tggatcaact gacaaaaata gacagtctgg 2041 gtcatcacac gaatgtacaa cagaagcaaa agatagaact attgatcagg aaactgtaca 2101 gagaggacct gggagaagaa gcagctcaga tagtagagct gagactgtgg tctctggagg 2161 aatctccaga agcatcacaa attctaaaaa tggaacccag aacacggagg atattgatct 2221 caatgaaatt agaaagatgg ataaggactc tattgagggg aaagtgcgac aatctgcaga 2281 tgttccaagc gagatatcag gaagtgatgt catatttaca acagaacaaa gtagaaacag 2341 tgatcatgga agaagcttgg aatctatcag tacacctgat acaagatcaa taagtgttgt 2401 tactgctgca acaccagatg atgaagaaga aatactaatg aaaaatagta ggacaaagaa 2461 aagttcttca atacaccaag aagatgacaa aagaattaaa aaagggggaa aagggaaaga 2521 ctggtttaag aaatcaaaag atactgacaa ccagatacca acatcagact acagatccac 2581 atcaaaaggg cagaagaaaa tctcaaaaac aacaaccatc aacaccgaca caaaggggca 2641 aacagaaata cagacagaat catcaggaac acaatcctca tcatggaatc tcaccattga 2701 taacaacaca gatcgaaccg aacagacaaa cacaactccc ccaacaacaa cctccggatc 2761 aacttataca aaagaatcaa tccgaacaaa ctctggatcc aaacccaaga cacaaaagac 2821 aaatggaaag gaaaggaagg atacagaaga gagcaatcga tttacagaga gggcaattac 2881 tctattgcag aatcttggtg taattcaatc cacatcaaaa ctagatttat atcaagacaa 2941 acgagttgta tgtgtagcaa atgtactaaa caatgtagat actgcatcaa agatagactt 3001 cttggcagga ttagtcatag gggtttcaat ggacaacgac acaaaattaa cacagataca 3061 aaatgaaatg ctaaacctca aagcagatct aaagaaaatg gatgaatcac atagaagatt 3121 gatagaaaat caaagagaac aactgtcatt gatcacgtca ttaatttcaa atcttaaaat 3181 tatgactgag agaggaggaa agaaagacca aaatgaatcc aatgagagag tatccatgat 3241 caaaacaaaa ttgaaagaag aaaagatcaa gaaaactagg tttgacccac ttatggagac 3301 acaagggatt gacaagaata tacctgatct atatcgacat gcaggaaata cactagagaa 3361 cgatgtgcaa gttaaatcag agatattaag ttcatacaat gaatcaaatg caacaagact 3421 aatacccaaa aaagtgagca gtacaatgag atcactagtt gcagttataa gcaacagcaa 3481 tctctcacaa agcacaaaac aatcatatat aaacgaactc aaacattgca aaaatgatga 3541 agaagtatct gaattaatgg acatgttcaa tgaagatgtc aataattgcc aatgatccaa 3601 caaagaaacg acaccgaaca aacagacaag aaacaacagt agatcaaaat ctatcaacac 3661 acacaaaatc aagcagaatg aaacaataga tatcaatcaa tatacaaata agaaaaactt 3721 aggattaaag aataaattaa tccttatcca aaatgagtat aactaactct gcaatataca 3781 cattcccaga atcatcattc tttgaaaatg gtcatataga accattacca ctcaaagtca 3841 atgaacagag aaaagcagta ccccacatta gagttgccaa aatcggaaat ccaccaaaac 3901 atggatctcg gtatttagat gtcttcctac taggcttctt cgaaatggaa cgaatcaaag 3961 acaaatacgg gagtgtaaat gatctcgaca gtgacccgag ttacaaagtt tgtggctctg 4021 gatcattacc aatcggattg gctaaataca ctgggaatga ccaggaatta ttacaagctg 4081 caaccaaact ggacatagaa gtgagaagaa cagtcaaagc gaaagagatg gttgtgtata 4141 cagtacaaaa tataaaacca gaactatacc catggtctaa tagactaaga aaaggaatgc 4201 tgttcgatgc caacaaagtt gctcttgctc ctcaatgtct tccactagat aggagcataa 4261 aatttagagt aatcttcgtg aattgtacgg caattggatc aataaccttg ttcaaaattc 4321 ctaagtcaat ggcatcatta tctctaccca acacaatatc aatcaatctg caggtacata 4381 tcaaaacagg ggttcagact gattccaaag ggatagttca aattttgaat gagaagggcg 4441 aaaaatcact gaatttcatg atccatctcg gattgatcaa gagaaaaata ggcagaatgt 4501 actctgttga atactgtaaa cagaaaatcg agaaaatgag attgatattt tctttaggat 4561 tagttggagg aatcagtctt catgtcaatg caactggatc catatcaaaa acactagcaa 4621 gtcagctggt attcaaaagg gagatttgtt atcctttaat ggatctaaat ccgcatctta 4681 atctggttat ctgggcttca tcagtagaga ttacaagagt ggatgcaatt ttccaacctt 4741 ctttacctgg cgaattcaga tactatccta atattatcgc aaaaggagtt gggaaaatca 4801 aacaatggaa ctaataatct ctacttcaat ctggacgcat ccattaagcc gaagcaaata 4861 agggataatc aaaaacttag gacaaaagag gtcaacacca acaactatta gcagtcatac 4921 tcgcaggaac aagaaagaag ggacaaaaaa agcaaaatag gagaaatcaa aacaaaaagt 4981 acagaacacc aaaacaacaa aaccaaaaaa tctgatctat taaaaataaa aatcccaaaa 5041 gagactggca acacagcaaa cactaaacac aatgccaact tcaacactgc taattattac 5101 aaccataatt atggcatctt tctgccaaat agatatcaca aaactacagc atgtaggtgt 5161 attggtcaac agtcccaaag ggatgaagat atcacaaaac ttcgaaacaa gatatttaat 5221 tttgagcctc ataccaaaaa tagaagactc taactcttgt ggtgaccaac agatcaagca 5281 atacaagagg ttattggata gactgatcat ccctctatat gatgggttaa gattacagaa 5341 agatgtgata gtaaccaatc aggaatccaa tgaaaacact aatcccagaa caaaacgatt 5401 ctttggaggg gtaattggaa ccattgctct gggagtagca acctcagcac aaatcacagc 5461 ggcggttgct ctggttgaag ccaagcaggc aagatcagat attgagaaac tcaaagaagc 5521 aatcagggac acaaacaaag cagtgcagtc agttcagagc tccataggga atctaatagt 5581 agcaatcaaa tcagtccaag attatgtcaa caaagaaatc gtgccatcaa ttgcgaggct 5641 aggttgtgaa gcagcaggac ttcaattagg aattgcatta acgcagcatt actcagaatt 5701 aacaaacata tttggtgata acataggatc attacaagaa aaaggaataa aattacaagg 5761 tatagcatca ttataccgca caaatatcac agaaatattc acaacatcaa cagttgataa 5821 atatgatatc tatgatctat tatttacaga atcaataaag gtgagagtta tagatgttga 5881 tttgaatgat tactcaatca ccctccaagt cagactccct ttattaacta ggctgctgaa 5941 cactcagatc tacaaagtag attccatatc atataatatc cacaatagag aatggtatat 6001 tcctcttccc agccatatca tgacgaaagg ggcatttcta ggtggagcag atgtcaaaga 6061 atgtatagaa gcattcagca gttatatatg tccttctgat ccaggatttg tactaaacca 6121 tgaaatggag agctgcttat caggaaacat atcccaatgt ccaagaacca cgatcacatc 6181 agacattgtt ccaagatatg catttgtcaa tggaggagtg gttgcaaact gtataacaac 6241 cacttgtaca tgcaacggaa tcggcaatag aatcaatcaa ccacctaatc aaggagtaaa 6301 aattataaca cataaagaat gtagtacaat aggtatcaac ggaatgctgt tcaatacaaa 6361 caaagaagga actcttgcat tctacacacc agatgatata acattaaaca attctgttgc 6421 acttgatcca attgacatat caatcgagct caacaaggcc aaatcagatc tagaagaatc 6481 aaaagaatgg ataagaaagt caaatcaaaa actagattct attgggaatt ggcatcaatc 6541 tagcaccaca atcataatta ttttgatgat gatcattata ttgtttataa tcaatataac 6601 gataattaca attgcaatta agtattacag aattcaaaag agaaatcaaa tggatcaaaa 6661 tgacaagcca tatgtactaa caaacaaata acatatctat agatcatcag atattaagat 6721 tataaaaaac ttaggagtaa agttacgcaa tccaactcta ctcatacaat tgagaaagaa 6781 cccaatagac aaatccaaat tcgagatgga atactggaag cataccaatc acggaaagga 6841 tgctggtaat gagctggaga cgtccatggc tactcataac aacaagctca ctaataagat 6901 aatatatata ttatggacaa taatcctggt gttattatca atagttttca tcatagtgct 6961 aattaattcc atcaatagtg aaaaggtcca taattcattg ctacaagaaa taaataatga 7021 gtttatggag attacagaaa agatccaaat ggcatcggat aataccaatg atctaataca 7081 gtcaggagtg aacacaaggc ttcttacaat tcagagtcat gtccagaatt atataccaat 7141 atcattgaca caacagatgt cggatcttag gaaattcatt agtgaaatta caattagaaa 7201 tgataatcaa gaagtgccgc aacaaagaat aacacatgat gtaggtataa aacctttaaa 7261 tccagatgat ttttggaggt gcacgtctgg tcttccattt ttaatgagaa acccaaaaat 7321 aaggttaatg ccagggccgg gattattagc catgccaacg actgttgatg gctgtgtcag 7381 aactccgtcc ttaattataa atgatctgat ttatgcttat acctcaaatc taattactcg 7441 aggttgtcag gatataggaa aatcatacca agtcttacag gtagggataa taactgtaaa 7501 ctcagacttg gtacctgact taaatcccag gttctctcat actttcaaca taaatgacaa 7561 taggaagtca tgttctctag cactcctaaa tacagatgta tatcaactgt gttcaactcc 7621 caaagttgat gaaagatcag attatgcatc atcaggcata gaagatattg tacttgatat 7681 tgtcaattat gatggctcaa tctcaacaac aagatttaag aataataaca taagctttga 7741 tcaaccctat gctgcactat acccatctgt tggaccaggg atatattaca aaggcaaaat 7801 aatatttctc gggtatggag gtcttgaaca tccaataaat gagaatgtaa tctgcaacac 7861 aactgagtgt cccgggaaaa cacagagaga ctgcaatcag gcatcttata gtccatggtt 7921 ttcagatagg aggatggtca actccatcat tgttgttgac aaaggcttaa actcaattcc 7981 aaaattaaag gtatggacaa tatctatgag acaaaattac tgggggtcag aaggaaggtt 8041 gattctacta ggtaacaaga tctatatata tacaagatct acaagttggc atagcaaatt 8101 acaattagga ataattgata ttactgatta cagtgatata aggataaaat ggacatggca 8161 taatgtgcta tcaagaccag gaaacgatga atgtccatgg ggacattcat gtccaaatgg 8221 atgtataaca ggagtatata ctgatgcgta tccactaaat cccacaggga gcattgtgtc 8281 atctgtcata ctagactcac agaaatcgag agtgaaccca gtcataactt actcaacagc 8341 aactgaaaga gtaaacgagc tggccatcag aaacagaaca ctctcagctg gatatacaac 8401 aacaagctgc attacacact atgacaaagg atattgtttt catatagtag aaataaatca 8461 gaaaagttca aacacatttc aacctatgtt gttcaaaaca gagattccaa aaagctgcag 8521 tcaatcataa ttaaccataa tatgcattaa cctatctata atacaagtat atgataagta 8581 atcagcaatc agacaatgga caaaaggaaa atataaaaaa cttaggagca aagcgtgctc 8641 ggaaaatgga cactgaatct aacaatggca ctgtatctga catactctat cctgagtgtc 8701 accttaactc tcctatcgtt aaaggtaaaa tagcacaatt acacactatt atgagtctac 8761 ctcagcctta tgatatggat gacgactcaa tactagttat cactagacag aaaataaaac 8821 tcaataaatt agataaaaga caacgatcta ttagaagatt aaaattaata ttaactgaga 8881 aagtgaatga cttaggaaaa tacacattta tcagatatcc agaaatgtca aaagaaatgt 8941 tcaaattata tatacctggt attaacagta aagtgactga attattactt aaagcagata 9001 gaacatatag tcaaatgact gatggattaa gagatctatg gattaacgtg ctatcaaaat 9061 tagcctcaaa aaatgatgga agcaattatg atcttaatga agaaattaat aatatatcaa 9121 aagttcacac aacttataaa tcagataaat ggtataatcc attcaaaaca tggtttacta 9181 tcaagtatga tatgagaaga ttacaaaaag ctcgaaatga gatcactttt aatgttggga 9241 aagattataa cttgttagaa gaccagaaga atttcttatt gatacatcca gaattggttt 9301 tgatattaga taaacaaaac tataatggtt atctaattac tcctgaatta gtactgatgt 9361 attgtgacgt agtcgaaggc cgatggaata taagtgcatg tgctaagcta gatccaaaat 9421 tacaatctat gtatcaaaaa ggtaataacc tatgggaagt gatagataaa ttgtttccaa 9481 ttatgggaga aaagacattt gatgtgatat cattattaga accacttgca ttatccttaa 9541 ttcaaactca tgatcctgtt aaacaactaa gaggagcttt tttaaatcat gtgttatccg 9601 aaatggaatt gatatttgaa tctggagaat cgattaggga atttctgagt gtagattaca 9661 ttgataaaat cttagatatt tttaatgaat ctacaataga tgaaatagca gagattttct 9721 ctttttttag aacatttggg catcctccat tagaggctag tattgcagca gaaaaggtta 9781 gaaaatatat gtatattgag aaacaattaa aatttgacac tgttaataaa tgccatgcta 9841 tcttctgtac aataataatt aacggatata gagagaggca tggtggacag tggcctcctg 9901 tgacattacc tgaccatgca cacgaattca tcataaatgc ttacggttca aactctgcga 9961 tatcatatga aaacgctgtt gattattacc agagcttcat aggaataaaa tttaataaat 10021 tcatagagcc tcagttagat gaggatttga caatttatat gaaagataaa gcattatctc 10081 caaaaaaatc aaattgggac acagtttatc ctgcatctaa tttactgtac cgtactaacg 10141 catccaacga atcacgaaga ttagttgaag tatttatagc agatagtaaa tttgatcctc 10201 atcagatatt ggattatgta gaatctgggg actggttaga tgatccagag tttaatattt 10261 cttatagtct caaagaaaaa gagattaaac aagaaggtag actctttgca aaaatgacat 10321 acaaaatgag agctacacaa gttttatcag agacactact tgcaaataat ataggaaaat 10381 tctttcaaga gaatgggatg gtgaagggag agattgaatt acttaagaga ttaacgacca 10441 tatcaatatc aggagttcca cggtataatg aagtgtacaa taattctaaa agccatacag 10501 atgaccttaa aacctacaat aaaataagta atcttaattt gtcttccaat cagaaatcta 10561 agaaatttga attcaagtca acggatatct acaatgatgg atatgagact gtgagctgtt 10621 tcctaacaac agatctcaaa aaatactgtc ttaattggag atatgaatca acagctctat 10681 ttggagaaac ttgcaaccaa atatttggat taaataaatt gtttaattgg ttacaccctc 10741 gtcttgaagg aagtacaatc tatgtaggtg atccttactg ccctccatca gataaagaac 10801 atatatcatt agaggatcac cctgattctg gtttttatgt tcataaccca agagggggta 10861 tagaaggatt ttgtcaaaaa ttatggacac tcatatctat aagtgcaata catctagcag 10921 ctgttagaat aggcgtgagg gtgactgcaa tggttcaagg agacaatcaa gctatagctg 10981 taaccacaag agtaccaaac aattatgatt acagaattaa gaaggagata gtttataaag 11041 atgtagtgag attttttgat tcattaagag aggtgatgga tgatctaggt catgaactta 11101 aattaaatga aacgattata agtagcaaga tgttcatata tagcaaaaga atctattatg 11161 atgggagaat tcttcctcaa gctctgaaag cattatctag atgtgtcttt tggtcagaga 11221 cagtaataga cgaaacaaga tcagcatctt caaacttggc aacatcattt gcaaaagcaa 11281 ttgagaatgg ttattcacct gttctaggat atgcatgctc aatttttaag aatattcaac 11341 aactatatat tgcccttggg atgaatatca atccaactat aacacagaat atcaaagatc 11401 agtattttaa gaattcaaat tggatgcaat atgcctcttt aatacctgcc agtgttgggg 11461 gattcaatta catggctatg tcaagatgtt ttgtaaggaa tattggtgat ccatcagttg 11521 ctgcattagc tgatattaaa agatttatca aggcgaatct attagaccga agtgttcttt 11581 ataggattat gaaccaagaa ccaggtgagt catctttttt ggactgggct tcagacccat 11641 attcatgcaa tttaccacaa tctcaaaata taaccaccat gataaaaaat ataacagcaa 11701 ggaatgtatt acaagattca ccaaatccat tattatctgg attattcaca aatacaatga 11761 tagaggaaga tgaagaatta gctgagttcc tgatggacag gaaggtaatt ctccctagag 11821 ttgcacatga tattctagat aattctctca caggaattag aaatgctata gctggaatgt 11881 tagatacgac aaaatcatta attcgggttg gcataaatag aggaggactg acatatagtt 11941 tgttgagaaa aatcagtaat tatgatctag tgcaatatga aacactaagt aggactttgc 12001 gattaattgt aagtgataaa atcagatatg aagatatgtg ttcagtagac cttgccatag 12061 cattgcgaca aaagatgtgg attcatttat caggaggaag gatgataagt ggacttgaaa 12121 cacctgaccc attagaatta ttatctgggg tggtaataac aggatcagaa cattgtaaaa 12181 tatgttattc ttcagatggc acaaacccat atacttggat gtatttaccg ggtaatatca 12241 aaatagggtc agcagaaaca ggtgtgtcat cattaagagt tccttatttt ggatcagtca 12301 ctgatgaaag atctgaggca caattaggat atatcaagaa tcttagtaaa cctgcaaaag 12361 ccgcaataag aatagcaatg atatatacat gggcatttgg taatgatgag atatcttgga 12421 tggaagcctc acagatagca caaacacgtg caaattttac actagatagt ctcaaaattt 12481 taacaccggt agctacatca acaaatttat cacacagatt aaaagatact gcaactcaga 12541 tgaaattctc cagtacatca ttaatcagag tcagcagatt cataacaatg tccaatgata 12601 acatgtctat taaagaagct aatgaaacca aagataccaa tcttatttac caacaaataa 12661 tgttaacagg attaagtgtt ttcgaatatt tatttagatt aaaagaaacc acaggacaca 12721 atcctatagt tatgcatctg cacattgaag atgaatgttg tattaaagaa agttttaatg 12781 atgaacatat taatccagag tctacattag aattaattcg atatcctgaa agtaatgaat 12841 ttatttatga taaagaccca ctcaaagatg tggatttgtc aaaacttatg gttattaaag 12901 accattctta tacaattgat atgaattatt gggatgatac tgacatcata catgcaattt 12961 caatatgtac tgcaattaca atagcagata ctatgtcaca attagatcga gataatttaa 13021 aagagataat agtcattgca aatgacgatg atattaatag cttaatcact gaatttttga 13081 ctcttgacat acttgtattt cttaagacat ttggtggatt attagtaaat caatttgcat 13141 acactcttta tagtttaaaa atagaaggta gagatctcat ttgggattat ataatgagaa 13201 cactgagaga tacttcacat tcaatattaa aagtattatc taatgcatta tctcatccta 13261 aagtatttaa gaggttctgg gattgtggag ttttaaaccc tatttatggt cctaatactg 13321 ctagtcaaga ccagataaaa cttgctctct ctatatgtga atatgcacta gatctattta 13381 tgagagaatg gttgaatggt gtatcacttg aaatatacat ttgtgacagt gatatggaag 13441 ttgcaaatga taggaaacaa gcctttattt ctagacatct ttcatttgtt tgttgtttag 13501 cagaaattgc atcttttgga cctaacctgt taaacttaac atacttagag agacttgatc 13561 tattgaaaca atatcttgaa ttaaatatta aagaagaccc tactcttaaa tatgtacaaa 13621 tatctggatt attaattaaa tcgttcccat caactgtaac atacgtaaga aagactgcaa 13681 tcaaatattt aaggattcgc ggtattagtc cacctgaagt aattgatgat tgggatccga 13741 tagaagatga aaatatgctg gataacattg tcaaaactat aaatgataac tgtaataaag 13801 ataataaagg gaataaaatt aacaatttct ggggactagc acttaagaac tatcaagtcc 13861 ttaaaatcag atctataaca agtgattctg atgataatga tagactagat gctagtacaa 13921 gtggtttgac acttcctcaa ggagggaatt atctatcgca tcaattgaga ttattcggaa 13981 tcaacagcac tagttgtttg aaagctcttg agttatcaca aattttaatg aaggaagtta 14041 ataaagacaa ggacaggctc ttcctgggag aaggagcagg agctatgcta gcatgttatg 14101 atgccacatt aggacctgca attaattatt ataattcagg tttgaatata acagatgtaa 14161 ttggtcaacg agaattgaaa atattccctt cagaggtatc attagtaggt aaaaaattag 14221 gaaatgtgac acagattctt aacagggtta aagtactgtt caatgggaat cccaattcaa 14281 catggatagg aaatatggaa tgtgagagct taatatggag tgaattaaat gataagtcta 14341 ttggattagt acattgtgat atggaaggag ctatcggtaa atcagaagaa actgttctac 14401 atgaacatta tagtgttata agaattacat acttgattgg agatgatgat gttgttttag 14461 tttccaaaat tatacctaca atcactccga attggtctag aatactttat ctatataaat 14521 tatattggaa agatgtaagt ataatatcac ttaaaacttc taatcctgca tcaacagaat 14581 tatatctaat ttcgaaagat gcatattgta ctataatgga acctagtgaa gttgttttat 14641 caaaacttaa aagattgtca ctcttggaag aaaataatct attaaaatgg atcattttat 14701 caaagaagag gaataatgaa tggttacatc atgaaatcaa agaaggagaa agagattatg 14761 gagttatgag accatatcat atggcattac aaatctttgg atttcaaatc aacttaaatc 14821 atctggcgaa agaattttta tcaactccag atctgaccaa tatcaacaat ataatccaaa 14881 gttttcagcg aacaatcaaa gatgtcttgt ttgaatggat taatataact catgatgaca 14941 agagacataa attaggcggg agatataaca tattcccact aaaaaataag ggaaagttaa 15001 gactgttatc gagaagacta gtattaagtt ggatttcatt atcattatcg actcgattgc 15061 ttacaggccg ctttcctgat gaaaaatttg aacatagagc acagactgga tatgtatcat 15121 tagctgatac tgatttagaa tcattaaaat tattgtcgaa aaacatcatt aagaattata 15181 gagagtgtat aggatcaata tcatattggt ttctaaccaa agaagttaaa atacttatga 15241 aattgattgg tggtgctaaa ttattaggaa ttcccagaca atataaagaa cccgaagaac 15301 agttattaga aaactacaat caacatgatg aatttgatat cgattaagac ataaatacaa 15361 taaagatata tcttaacctt tatctttgag cccaaggata gacaaaaagt aagaaaaaca 15421 tgtaatatat atataccaag cagagttctt ctcttgtttg gt // LOCUS NC_001498 15894 bp cRNA linear VRL 13-AUG-2018 DEFINITION Measles virus, complete genome. ACCESSION NC_001498 VERSION NC_001498.1 DBLINK Project: 15025 BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Measles morbillivirus ORGANISM Measles morbillivirus Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; Mononegavirales; Paramyxoviridae; Orthoparamyxovirinae; Morbillivirus. REFERENCE 1 AUTHORS Rima,B.K. and Duprex,W.P. TITLE The measles virus replication cycle JOURNAL Curr. Top. Microbiol. Immunol. 329, 77-102 (2009) PUBMED 19198563 REFERENCE 2 AUTHORS Takeuchi,K., Miyajima,N., Kobune,F. and Tashiro,M. TITLE Comparative nucleotide sequence analyses of the entire genomes of B95a cell-isolated and vero cell-isolated measles viruses from the same patient JOURNAL Virus Genes 20 (3), 253-257 (2000) PUBMED 10949953 REFERENCE 3 (bases 1807 to 2705) AUTHORS Wardrop,E.A. and Briedis,D.J. TITLE Characterization of V protein in measles virus-infected cells JOURNAL J. Virol. 65 (7), 3421-3428 (1991) PUBMED 2041073 REFERENCE 4 (bases 1748 to 3402) AUTHORS Cattaneo,R., Kaelin,K., Baczko,K. and Billeter,M.A. TITLE Measles virus editing provides an additional cysteine-rich protein JOURNAL Cell 56 (5), 759-764 (1989) PUBMED 2924348 REFERENCE 5 (bases 1748 to 3402) AUTHORS Bellini,W.J., Englund,G., Richardson,C.D., Rozenblatt,S. and Lazzarini,R.A. TITLE Matrix genes of measles virus and canine distemper virus: cloning, nucleotide sequences, and deduced amino acid sequences JOURNAL J. Virol. 58 (2), 408-416 (1986) PUBMED 3754588 REFERENCE 6 (bases 1748 to 3402) AUTHORS Bellini,W.J., Englund,G., Rozenblatt,S., Arnheiter,H. and Richardson,C.D. TITLE Measles virus P gene codes for two proteins JOURNAL J. Virol. 53 (3), 908-919 (1985) PUBMED 3882996 REFERENCE 7 (bases 1 to 15894) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (01-AUG-2000) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 8 (bases 1 to 15894) AUTHORS Takeuchi,K., Tanabayashi,K. and Tashiro,M. TITLE Direct Submission JOURNAL Submitted (10-JUL-1998) Kaoru Takeuchi, National Institute of Infectious Diseases, Viral Disease and Vaccine Contorol; 4-7-1 Gakuen, Musashi-murayama, Tokyo 208-0011, Japan (E-mail:ktake@nih.go.jp, Tel:81-42-561-0771(ex.530), Fax:81-42-567-5631) COMMENT REVIEWED REFSEQ: This record has been curated by NCBI staff. The reference sequence was derived from AB016162. Sequence updated (21-Jul-1998) Sequence updated (11-Dec-1998). COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..15894 /organism="Measles morbillivirus" /mol_type="viral cRNA" /strain="Ichinose-B95a" /db_xref="taxon:11234" gene 56..1744 /gene="N" /locus_tag="MeVgp1" /db_xref="GeneID:1489804" mRNA 56..1744 /gene="N" /locus_tag="MeVgp1" /product="gene N" /db_xref="GeneID:1489804" CDS 108..1685 /gene="N" /locus_tag="MeVgp1" /codon_start=1 /product="nucleocapsid protein" /protein_id="NP_056918.1" /db_xref="GeneID:1489804" /translation="MATLLRSLALFKRNKDKPPITSGSGGAIRGIKHIIIVPIPGDSSI TTRSRLLDRLVRLIGNPDVSGPKLTGALIGILSLFVESPGQLIQRITDDPDVSIRLLEV VQSDQSQSGLTFASRGTNMEDEADQYFSHDDPSSSDQSRSGWFENKEISDIEVQDPEGF NMILGTILAQIWVLLAKAVTAPDTAADSELRRWIKYTQQRRVVGEFRLERKWLDVVRNR IAEDLSLRRFMVALILDIKRTPGNKPRIAEMICDIDTYIVEAGLASFILTIKFGIETMY PALGLHEFAGELSTLESLMNLYQQMGETAPYMVILENSIQNKFSAGSYPLLWSYAMGVG VELENSMGGLNFGRSYFDPAYFRLGQEMVRRSAGKVSSTLASELGITAEDARLVSEIAM HTTEDRISRAVGPRQAQVSFLHGDQSENELPGLGGKEDRRVKQGRGEARESYRETGSSR ASDARAAHPPTSMPLDIDTASESGQDPQDSRRSADALLRLQAMAGILEEQGSDTDTPRV YNDRDLLD" gene 1748..3402 /gene="P/V/C" /locus_tag="MeVgp2" /gene_synonym="P" /db_xref="GeneID:1489805" mRNA 1748..3402 /gene="P/V/C" /locus_tag="MeVgp2" /gene_synonym="P" /product="P/V/C" /experiment="cDNA sequencing" /note="three proteins are translated from this mRNA; translation of phosphoprotein is initiated from an alternative site; V protein is translated from mRNA modified by the addition of a non-templated G" /citation=[4] /db_xref="GeneID:1489805" CDS 1807..3330 /gene="P/V/C" /locus_tag="MeVgp2" /gene_synonym="P" /note="subunit of RNA dependent RNA polymerase (RdRp); chaperone for Measles virus nucleocapsid protein" /codon_start=1 /product="phosphoprotein" /protein_id="NP_056919.1" /db_xref="GeneID:1489805" /translation="MAEEQARHVKNGLECIRALKAEPIGSLAVEEAMAAWSEISDNPGQ DRATCKEEEAGSSGLSKPCLSAIGSTEGGAPRIRGQGSGESDDDAETLGIPSRNLQASS TGLQCYHVYDHSGEAVKGIQDADSIMVQSGLDGDSTLSGGDDESENSDVDIGEPDTEGY AITDRGSAPISMGFRASDVETAEGGEIHELLKLQSRGNNFPKLGKTLNVPPPPNPSRAS TSETPIKKGTDARLASFGTEIASLLTGGATQCARKSPSEPSGPGAPAGNVPECVSNAAL IQEWTPESGTTISPRSQNNEEGGDYYDDELFSDVQDIKTALAKIHEDNQKIISKLESLL LLKGEVESIKKQINRQNISISTLEGHLSSIMIAIPGLGKDPNDPTADVELNPDLKPIIG RDSGRALAEVLKKPVASRQLQGMTNGRTSSRGQLLKEFQLKPIGKKVSSAVGFVPDTGP ASRSVIRSIIKSSRLEEDRKRYLMTLLDDIKGANDLAKFHQMLMKIIMK" CDS join(1807..2499,2499..2705) /gene="P/V/C" /locus_tag="MeVgp2" /gene_synonym="P" /experiment="antibody studies of V protein in tissue culture cells" /experiment="in vitro translation of synthetic RNA" /exception="RNA editing" /note="prevents interferon induced transcriptional responses in Measles virus; translated from P/V/C mRNA modified by co-transcriptional insertion of non-templated G at genome position 2499 nt" /citation=[3] /citation=[4] /codon_start=1 /product="V protein" /protein_id="YP_003873249.2" /db_xref="GeneID:1489805" /translation="MAEEQARHVKNGLECIRALKAEPIGSLAVEEAMAAWSEISDNPGQ DRATCKEEEAGSSGLSKPCLSAIGSTEGGAPRIRGQGSGESDDDAETLGIPSRNLQASS TGLQCYHVYDHSGEAVKGIQDADSIMVQSGLDGDSTLSGGDDESENSDVDIGEPDTEGY AITDRGSAPISMGFRASDVETAEGGEIHELLKLQSRGNNFPKLGKTLNVPPPPNPSRAS TSETPIKKGHRREIGLIWNGDRVFIDRWCNPMCSKVTLGTIRARCTCGECPRVCEQCRT DTGVDTRIWYHNLPEIPE" CDS 1829..2389 /gene="P/V/C" /locus_tag="MeVgp2" /gene_synonym="P" /experiment="antibody studies of C protein in tissue culture cells" /note="prevents interferon-induced transcriptional responses; acts as infectivity factor; inhibits transcription; translated from overlapping reading frame in P/V/C mRNA using alternative start site" /citation=[6] /codon_start=1 /product="C protein" /protein_id="NP_056920.1" /db_xref="GeneID:1489805" /translation="MSKTDWNASGLSRPSPSAHWPSRKPWQHGQKYQTTQDRTEPPARK RRQAVRVSANHASQQLDQLKAVHLASAVRDLEKAMTTLKLWESPQEISRHQALGYSVIM FMITAVKRLRESKMLTLSWFNQALMVIAPSQEETMNLKTAMWILANLIPRDMLSLTGDL LPSLWGSGLLMLKLQKEGRSTSS" gene 3406..4872 /gene="M" /locus_tag="MeVgp3" /db_xref="GeneID:1489803" mRNA 3406..4872 /gene="M" /locus_tag="MeVgp3" /product="M gene" /db_xref="GeneID:1489803" CDS 3438..4445 /gene="M" /locus_tag="MeVgp3" /note="hydrophobic protein on inner leaflet of membrane; inhibits transcription" /codon_start=1 /product="matrix protein" /protein_id="NP_056921.1" /db_xref="GeneID:1489803" /translation="MTEIYDFDKSAWDIKGSIAPIQPTTYSDGRLVPQVRVIDPGLGDR KDECFMYMFLLGVVEDSDPLGPPIGRAFGSLPLGVGRSTAKPEELLKEATELDIVVRRT AGLNEKLVFYNNTPLTLLTPWRKVLTTGSVFNANQVCNAVNLIPLDTPQRFRVVYMSIT RLSDNGYYTVPRRMLEFRSVNAVAFNLLVTLRIDKAIGPGKIIDNAEQLPEATFMVHIG NFRRKKSEVYSADYCKMKIEKMGLVFALGGIGGTSLHIRSTGKMSKTLHAQLGFKKTLC YPLMDINEDLNRLLWRSRCKIVRIQAVLQPSVPQEFRIYDDVIINDDQGLFKVL" gene 4876..7247 /gene="F" /locus_tag="MeVgp4" /db_xref="GeneID:1489800" mRNA 4876..7247 /gene="F" /locus_tag="MeVgp4" /product="F gene" /db_xref="GeneID:1489800" CDS 5458..7110 /gene="F" /locus_tag="MeVgp4" /note="type I transmembrane glycoprotein; synthesized as precurser peptide F0; cleaved to a disulphide linked F1-F2 complex by furin-like protease; acylated" /codon_start=1 /product="fusion protein" /protein_id="NP_056922.1" /db_xref="GeneID:1489800" /translation="MGLKVNVSAIFMAVLLTLQTPTGQIHWGNLSKIGVVGIGSASYKV MTRSSHQSLVIKLMPNITLLNNCTRVEIAEYRRLLRTVLEPIRDALNAMTQNIRPVQSV ASSRRHKRFAGVVLAGAALGVATAAQITAGIALHQSMLNSQAIDNLRASLETTNQAIEA IRQAGQEMILAVQGVQDYINNELIPSMNQLSCDLIGQKLGLKLLRYYTEILSLFGPSLR DPISAEISIQALSYALGGDINKVLEKLGYSGGDLLGILESRGIKARITHVDTESYFIVL SIAYPTLSEIKGVIVHRLEGVSYNIGSQEWYTTVPKYVATQGYLISNFDESSCTFMPEG TVCSQNALYPMSPLLQECLRGSTKSCARTLVSGSFGNRFILSQGNLIANCASILCKCYT TGTIINQDPDKILTYIAADHCPVVEVNGVTIQVGSRRYPDAVYLHRIDLGPPISLERLD VGTNLGNAIAKLEDAKELLESSDQILRSMKGLSSTSIVYILIAVCLGGLIGIPALICCC RGRCNKKGEQVGMSRPGLKPDLTGTSKSYVRSL" gene 7251..9208 /gene="H" /locus_tag="MeVgp5" /db_xref="GeneID:1489801" mRNA 7251..9208 /gene="H" /locus_tag="MeVgp5" /product="H gene" /db_xref="GeneID:1489801" CDS 7271..9124 /gene="H" /locus_tag="MeVgp5" /note="type II transmembrane glycoprotein; attachment glycoprotein that interacts directly with entry receptors" /codon_start=1 /product="hemagglutinin protein" /protein_id="NP_056923.1" /db_xref="GeneID:1489801" /translation="MSPQRDRINAFYKDNPHPKGSRIVINREHLMIDRPYVLLAVLFVM FLSLIGLLAIAGIRLHRAAIYTAEIHKSLSTNLDVTNSIEHQVKDVLTPLFKIIGDEVG LRTPQRFTDLVKFISDKIKFLNPDREYDFRDLTWCINPPERIKLDYDQYCADVAAEELM NALVNSTLLEARATNQFLAVSKGNCSGPTTIRGQFSNMSLSLLDLYLSRGYNVSSIVTM TSQGMYGGTYLVGKPNLSSKGSELSQLSMHRVFEVGVIRNPGLGAPVFHMTNYFEQPVS NDFSNCMVALGELKFAALCHREDSITIPYQGSGKGVSFQLVKLGVWKSPTDMRSWVPLS TDDPVIDRLYLSSHRGVIADNQAKWAVPTTRTDDKLRMETCFQQACKGKNQALCENPEW APLKDNRIPSYGVLSVNLSLTVELKIKIASGFGPLITHGSGMDLYKTNHNNVYWLTIPP MKNLALGVINTLEWIPRFKVSPNLFTVPIKEAGEDCHAPTYLPAEVDGDVKLSSNLVIL PGQDLQYVLATYDTSRVEHAVVYYVYSPSRSFSYFYPFRLPIKGVPIELQVECFTWDKK LWCRHFCVLADSESGGHITHSGMVGMGVSCTVTREDGTNRR" gene 9212..15854 /gene="L" /locus_tag="MeVgp6" /db_xref="GeneID:1489802" mRNA 9212..15854 /gene="L" /locus_tag="MeVgp6" /product="L gene" /db_xref="GeneID:1489802" CDS 9234..15785 /gene="L" /locus_tag="MeVgp6" /note="large subunit of RNA dependent RNA polymerase (RdRp); RNA synthesis, capping, and polyadenylation" /codon_start=1 /product="large polymerase protein" /protein_id="NP_056924.1" /db_xref="GeneID:1489802" /translation="MDSLSVNQILYPEVHLDSPIVTNKIVAILEYARVPHAYSLEDPTL CQNIKHRLKNGFSNQMIINNVEVGNVIKSKLRSYPAHSHIPYPNCNQDLFNIEDKESTR KIRELLKKGNSLYSKVSDKVFQCLRDTNSRLGLGSELREDIKEKIINLGVYMHSSQWFE PFLFWFTVKTEMRSVIKSQTHTCHRRRHTPVFFTGSSVELLISRDLVAIISKESQHVYY LTFELVLMYCDVIEGRLMTETAMTIDARYAELLGRVRYMWKLIDGFFPALGNPTYQIVA MLEPLSLAYLQLRDITVELRGAFLNHCFTEIHDVLDQNGFSDEGTYHELIEALDYIFIT DDIHLTGEIFSFFRSFGHPRLEAVTAAENVRKYMNQPKVIVYETLMKGHAIFCGIIING YRDRHGGSWPPLTLPLHAADTIRNAQASGEGLTHEQCVDNWKSFAGVRFGCFMPLSLDS DLTMYLKDKALAALQREWDSVYPKEFLRYDPPKGTGSRRLVDVFLNDSSFDPYDMIMYV VSGAYLHDPEFNLSYSLKEKEIKETGRLFAKMTYKMRACQVIAENLISNGIGKYFKDNG MAKDEHDLTKALHTLAVSGVPKDLKESHRGGPVLKTYSRSPVHTSTRNVKAEKGFVGFP HVIRQNQDTDHPENIETYETVSAFITTDLKKYCLNWRYETISLFAQRLNEIYGLPSFFQ WLHKRLETSVLYVSDPHCPPDLDAHVPLCKVPNDQIFIKYPMGGIEGYCQKLWTISTIP YLYLAAYESGVRIASLVQGDNQTIAVTKRVPSTWPYNLKKREAARVTRDYFVILRQRLH DIGHHLKANETIVSSHFFVYSKGIYYDGLLVSQSLKSIARCVFWSETIVDETRAACSNI ATTMAKSIERGYDRYLAYSLNVLKVIQQILISLGFTINSTMTRDVVIPLLTNNDLLIRM ALLPAPIGGMNYLNMSRLFVRNIGDPVTSSIADLKRMILASLMPEETLHQVMTQQPGDS SFLDWASDPYSANLVCVQSITRLLKNITARFVLIHSPNPMLKGLFHDDSKEEDERLAAF LMDRHIIVPRAAHEILDHSVTGARESIAGMLDTTKGLIRASMRKGGLTSRVITRLSNYD YEQFRAGMVLLTGRKRNVLIDKESCSVQLARALRSHMWARLARGRPIYGLEVPDVLESM RGHLIRRHETCVICECGSVNYGWFFVPSGCQLDDIDKETSSLRVPYIGSTTDERTDMKL AFVRAPSRSLRSAVRIATVYSWAYGDDDSSWNEAWLLARQRANVSLEELRVITPISTST NLAHRLRDRSTQVKYSGTSLVRVARYTTISNDNLSFVISDKKVDTNFIYQQGMLLGLGV LETLFRLEKDTGSSNTVLHLHVETDCCVIPMIDHPRIPSSRKLELRAELCTNPLIYDNA PLIDRDATRLYTQSHRRHLVEFVTWSTPQLYHILAKSTALSMIDLVTKFEKDHMNEISA LIGDDDINSFITEFLLIEPRLFTIYLGQCAAINWAFDVHYHRPSGKYQMGELLSSFLSR MSKGVFKVLVNALSHPKIYKKFWHCGIIEPIHGPSLDAQNLHTTVCNMVYTCYMTYLDL LLNEELEEFTFLLCESDEDVVPDRFDNIQAKHLCVLADLYCQPGTCPPIRGLRPVEKCA VLTDHIKAEARLSPAGSSWNINPIIVDHYSCSLTYLRRGSIKQIRLRVDPGFIFDALAE VNVSQPKVGSNNISNMSIKDFRPPHDDVAKLLKDINTSKHNLPISGGSLANYEIHAFRR IGLNSSACYKAVEISTLIRRCLEPGEDGLFLGEGSGSMLITYKEILKLNKCFYNSGVSA NSRSGQRELAPYPSEVGLVEHRMGVGNIVKVLFNGRPEVTWVGSIDCFNFIVSNIPTSS VGFIHSDIETLPNKDTIEKLEELAAILSMALLLGKIGSILVIKLMPFSGDFVQGFISYV GSHYREVNLVYPRYSNFISTESYLVMTDLKANRLMNPEKIKQQIIESSVRTSPGLIGHI LSIKQLSCIQAIVGGAVSRGDINPILKKLTPIEQVLISCGLAINGPKLCKELIHHDVAS GQDGLLNSILILYRELARFKDNQRSQQGMFHAYPVLVSSRQRELVSRITRKFWGHILLY SGNRKLINRFIQNLKSGYLVLDLHQNIFVKNLSKSEKQIIMTGGLKREWVFKVTVKETK EWYKLVGYSALIKD" ORIGIN 1 accaaacaaa gttgggtaag gatagatcaa tcaatgatca tattctagta cacttaggat 61 tcaagatcct attatcaggg acaagagcag gattagggat atccgagatg gccacacttt 121 tgaggagctt agcattgttc aaaagaaaca aggacaaacc acccattaca tcaggatccg 181 gtggagccat cagaggaatc aaacacatta ttatagtacc aattcctgga gattcctcaa 241 ttaccactcg atccagacta ctggaccggt tggtcaggtt aattggaaac ccggatgtga 301 gcgggcccaa actaacaggg gcactaatag gtatattatc cttatttgtg gagtctccag 361 gtcaattgat tcagaggatc accgatgacc ctgacgttag catcaggctg ttagaggttg 421 ttcagagtga ccagtcacaa tctggcctta ccttcgcatc aagaggtacc aacatggagg 481 atgaggcgga ccaatacttt tcacatgatg atccaagcag tagtgatcaa tccaggtccg 541 gatggttcga gaacaaggaa atctcagata ttgaagtgca agaccctgag ggattcaaca 601 tgattctggg taccattcta gcccagatct gggtcttgct cgcaaaggcg gttacggccc 661 cagacacggc agctgattcg gagctaagaa ggtggataaa gtacacccaa caaagaaggg 721 tagttggtga atttagattg gagagaaaat ggttggatgt ggtgaggaac aggattgccg 781 aggacctctc tttacgccga ttcatggtgg ctctaatcct ggatatcaag aggacacccg 841 ggaacaaacc taggattgct gaaatgatat gtgacattga tacatatatc gtagaggcag 901 gattagccag ttttatcctg actattaagt ttgggataga aactatgtat cctgctcttg 961 gactgcatga atttgctggt gagttatcca cacttgagtc cttgatgaat ctttaccagc 1021 aaatgggaga aactgcaccc tacatggtaa tcctagagaa ctcaattcag aacaagttca 1081 gtgcaggatc ataccctctg ctctggagct atgccatggg agtaggagtg gaacttgaaa 1141 actccatggg aggtttgaac tttggtcgat cttactttga tccagcatat tttagattag 1201 ggcaagagat ggtgaggagg tcagctggaa aggtcagttc cacattggca tccgaactcg 1261 gtatcactgc cgaggatgca aggcttgttt cagagattgc aatgcatact actgaggaca 1321 ggatcagtag agcggtcgga cccagacaag cccaagtgtc atttctacac ggtgatcaaa 1381 gtgagaatga gctaccagga ttggggggca aggaagatag gagggtcaaa cagggtcggg 1441 gagaagccag ggagagctac agagaaaccg ggtccagcag agcaagtgat gcgagagctg 1501 cccatcctcc aaccagcatg cccctagaca ttgacactgc atcggagtca ggccaagatc 1561 cgcaggacag tcgaaggtca gctgacgccc tgctcaggct gcaagccatg gcaggaatct 1621 tggaagaaca aggctcagac acggacaccc ctagggtata caatgacaga gatcttctag 1681 actaggtgcg agaggccgag gaccagaaca acatccgcct accctccatc attgttataa 1741 aaaacttagg aaccaggtcc acacagccgc cagccaacca accatccact cccacgactg 1801 gagccgatgg cagaagagca ggcacgccat gtcaaaaacg gactggaatg catccgggct 1861 ctcaaggccg agcccatcgg ctcactggcc gtcgaggaag ccatggcagc atggtcagaa 1921 atatcagaca acccaggaca ggaccgagcc acctgcaagg aagaggaggc aggcagttcg 1981 ggtctcagca aaccatgcct ctcagcaatt ggatcaactg aaggcggtgc acctcgcatc 2041 cgcggtcagg gatctggaga aagcgatgac gacgctgaaa ctttgggaat cccctcaaga 2101 aatctccagg catcaagcac tgggttacag tgttatcatg tttatgatca cagcggtgaa 2161 gcggttaagg gaatccaaga tgctgactct atcatggttc aatcaggcct tgatggtgat 2221 agcaccctct caggaggaga cgatgaatct gaaaacagcg atgtggatat tggcgaacct 2281 gataccgagg gatatgctat cactgaccgg ggatctgctc ccatctctat ggggttcagg 2341 gcttctgatg ttgaaactgc agaaggaggg gagatccacg agctcctgaa actccaatcc 2401 agaggcaaca actttccgaa gcttgggaaa actctcaatg ttcctccgcc cccgaacccc 2461 agtagggcca gcacttccga gacacccatt aaaaagggca cagacgcgag attggcctca 2521 tttggaacgg agatcgcgtc tttattgaca ggtggtgcaa cccaatgtgc tcgaaagtca 2581 ccctcggaac catcagggcc aggtgcacct gcggggaatg tccccgagtg tgtgagcaat 2641 gccgcactga tacaggagtg gacacccgaa tctggtacca caatctcccc gagatcccag 2701 aataatgaag aagggggaga ctattatgat gatgagctgt tctccgatgt ccaagacatc 2761 aaaacagcct tggccaaaat acacgaggat aatcagaaga taatctccaa gctagaatca 2821 ttgctgttat tgaagggaga agttgagtca attaagaagc agatcaacag gcaaaatatc 2881 agcatatcca ccctggaagg acacctctca agcatcatga ttgccattcc tggacttggg 2941 aaggatccca acgaccccac tgcagatgtc gaactcaatc ccgacctgaa acccatcata 3001 ggcagagatt caggccgagc actggccgaa gttctcaaga agcccgttgc cagccgacaa 3061 ctccagggaa tgactaatgg acggaccagt tccagaggac agctgctgaa ggaatttcaa 3121 ctaaagccga tcgggaaaaa ggtgagctca gccgtcgggt ttgttcctga caccggccct 3181 gcatcacgca gtgtaatccg ctccattata aaatccagcc ggctagagga ggatcggaag 3241 cgttacctga tgactctcct tgatgatatc aaaggagcca acgatcttgc caagttccac 3301 cagatgctga tgaagataat aatgaagtag ctacagctca acttacctgc caaccccatg 3361 ccagtcgacc taattagtac aacctaaatc cattataaaa aacttaggag caaagtgatt 3421 gcctcctaag ttccacaatg acagagatct acgatttcga caagtcggca tgggacatca 3481 aagggtcgat cgctccgata caacctacca cctacagtga tggcaggctg gtgccccagg 3541 tcagagtcat agatcctggt ctaggtgata ggaaggatga atgctttatg tacatgtttc 3601 tgctgggggt tgttgaggac agcgatcccc tagggcctcc aatcgggcga gcattcgggt 3661 ccctgccctt aggtgttggt agatccacag caaaacccga ggaactcctc aaagaggcca 3721 ctgagcttga catagttgtt agacgtacag cagggctcaa tgaaaaactg gtgttctaca 3781 acaacacccc actaaccctc ctcacacctt ggagaaaggt cctaacaaca gggagtgtct 3841 tcaatgcaaa ccaagtgtgc aatgcggtta atctaatacc gctggacacc ccgcagaggt 3901 tccgtgttgt ttatatgagc atcacccgtc tttcggataa cgggtattac accgttccca 3961 gaagaatgct ggaattcaga tcggtcaatg cagtggcctt caacctgcta gtgaccctta 4021 ggattgacaa ggcgattggc cctgggaaga tcatcgacaa tgcagagcaa cttcctgagg 4081 caacatttat ggtccacatc gggaacttca ggagaaagaa gagtgaagtc tactctgccg 4141 attattgcaa aatgaaaatc gaaaagatgg gcctggtttt tgcacttggt gggatagggg 4201 gcaccagtct tcacattaga agcacaggca aaatgagcaa gactctccat gcacaactcg 4261 ggttcaagaa gaccttatgt tacccactga tggatatcaa tgaagacctt aatcggttac 4321 tctggaggag cagatgcaag atagtaagaa tccaggcagt tttgcagcca tcagttcctc 4381 aagaattccg catttacgac gacgtgatca taaatgatga ccaaggacta ttcaaagttc 4441 tgtagaccgc agtgcccagc aatacccgaa aacgaccccc ctcataatga cagccagaag 4501 gcccggacaa aaaagccccc tccagaagac tccacggacc aagcgagagg ccagccagca 4561 gccgacagca agtgtggaca ccaggcggcc caagcacaga acagccccga cacaaggcca 4621 ccaccagcca tcccaatctg cgtcctcctc gtgggacccc cgaggaccaa ccccgaaggt 4681 cgctccgaac acagaccacc aaccgcatcc ccacagctcc cgggaaagga acccccagca 4741 actggaaggc ccctcccccc ctcccccaac gcaagaaccc cacaaccgaa ccgcacaagc 4801 gaccgaggtg acccaaccgc aggcatccga ctccttagac agatcctctc cccccggcat 4861 actaaacaaa acttagggcc aaggaacaca cacactcgac agaacccaga ccccggcccg 4921 cggcaccgcg cccccacccc ccgaaaacca gagggagccc ccaaccaaac ccgccggccc 4981 ccccggtgcc cacaggtagg cacaccaacc cccgaccaga cccagcaccc agccaccgac 5041 aatccaagac ggggggcccc ccccaaaaaa aggcccccag gggccgacag ccagcatcgc 5101 gaggaagcac acccacccca cacacgacca cggcaaccga accagagtcc agaccaccct 5161 gggccaccag ctcccagact cggccatcac cccgcaaaaa ggaaaggcca caacccgcgc 5221 accccagccc cgatccggcg ggcagccact caacccgaac cagcacccaa gagcgatccc 5281 tgggggaccc ccaaaccgca aaagacatca gtatcccaca gcctctccaa gtcccccggt 5341 ctcctcctct tctcgaaggg accaaaagat caatccacca catccgacga cactcaattc 5401 cccaccccca aaggagacac cgggaatccc agaatcaaga ctcatccagt gtccatcatg 5461 ggtctcaagg tgaacgtctc tgccatattc atggcagtac tgttaactct ccaaacaccc 5521 accggtcaaa tccattgggg caatctctct aagatagggg tggtagggat aggaagtgca 5581 agctacaaag ttatgactcg ttccagccat caatcattgg tcataaaatt aatgcccaat 5641 ataactctcc tcaataactg cacgagggta gagatcgcag aatacaggag actactgaga 5701 acagttttgg aaccaattag agatgcactt aatgcaatga cccagaatat aagaccggtt 5761 cagagtgtag cttcaagtag gagacacaag agatttgcgg gagttgtcct ggcaggtgcg 5821 gccctaggcg ttgccacagc tgctcagata acagccggca ttgcacttca ccagtccatg 5881 ctgaactctc aagccatcga caatctgaga gcaagcctgg aaactactaa tcaggcaatt 5941 gaggcaatca gacaagcagg gcaggagatg atattggctg ttcagggtgt ccaagactac 6001 atcaataatg agctgatacc gtctatgaac caactatctt gtgatttaat cggccagaag 6061 ctagggctca aattgctcag atactataca gaaatcctgt cattatttgg ccccagctta 6121 cgggacccca tatctgcgga gatatccatc caggctttga gctatgcgct tggaggagat 6181 atcaataagg tattagaaaa gctcggatac agtggaggtg atttactggg catcttagag 6241 agcagaggaa taaaggcccg gataactcac gtcgacacag agtcctactt cattgtactc 6301 agtatagcct atccgacgct gtccgagatt aagggggtga ttgtccaccg gctagagggg 6361 gtctcgtaca atataggctc tcaagagtgg tataccactg tgcccaagta tgttgcaacc 6421 caagggtacc ttatctcgaa ttttgatgag tcatcgtgta ctttcatgcc agaggggact 6481 gtgtgcagcc aaaatgcctt gtacccgatg agtcctctgc tccaagaatg cctccggggg 6541 tccaccaagt cctgtgctcg tacactcgta tccgggtctt ttgggaaccg gttcatttta 6601 tcacaaggga acctaatagc caattgtgca tcaatcctct gcaagtgtta cacaacagga 6661 acgatcatta atcaagaccc tgacaagatc ctaacataca ttgctgccga tcactgcccg 6721 gtggtcgagg tgaacggtgt gaccatccaa gtcgggagca ggaggtatcc ggacgcggtg 6781 tacctgcaca gaattgacct cggtcctccc atatcattgg agaggttgga cgtagggaca 6841 aatctgggga atgcaattgc taagttggag gatgccaagg aattgttgga gtcatcggac 6901 cagatattga ggagtatgaa aggtttatcg agcactagca tagtttacat cctgattgca 6961 gtgtgtcttg gagggttgat agggatcccc gctttaatat gttgctgcag ggggcgctgt 7021 aacaaaaagg gagaacaagt tggtatgtca agaccaggcc taaagcctga tcttacaggg 7081 acatcaaaat cctatgtaag gtcgctctga tcctctacaa ctcttgaaac acagatttcc 7141 cacaagtctc ctcttcgtca tcaagcaacc accgcatcca gcatcaagcc cacctgaaat 7201 tgtctccggc ttccctctgg ccgaacgata tcggtagtta attaaaactt agggtgcaag 7261 atcatccaca atgtcaccac aacgagaccg aataaatgcc ttctacaaag acaacccaca 7321 tcctaaggga agtaggatag ttattaacag agaacatctt atgattgata gaccttatgt 7381 tttgctggct gttctattcg tcatgtttct gagcttgatc gggttgctag ccattgcagg 7441 cattagactc catcgtgcag ccatctacac cgcagagatc cataagagcc tcagcaccaa 7501 tctagatgta actaactcga tcgagcatca ggtcaaggac gtgctgacac cactcttcaa 7561 gatcattggt gatgaagtgg gcctgaggac acctcagaga ttcactgacc tagtgaaatt 7621 catctctgac aaaattaaat tccttaatcc ggatagggag tacgacttca gagatctcac 7681 ttggtgtatc aacccgccag agagaatcaa attggattat gatcaatact gtgcagatgt 7741 ggctgctgaa gaactcatga atgcattggt gaactcaact ctactggagg ccagggcaac 7801 caatcagttc ctagctgtct caaagggaaa ctgctcaggg cccactacaa tcagaggtca 7861 attctcaaac atgtcgctgt ccctgttgga cttgtattta agtcgaggtt acaatgtgtc 7921 atctatagtc actatgacat cccagggaat gtacggggga acttacctag tgggaaagcc 7981 taatctgagc agtaaagggt cagagttgtc acaactgagc atgcaccgag tgtttgaagt 8041 aggggttatc agaaatccgg gtttgggggc tccggtgttc catatgacaa actattttga 8101 gcaaccagtc agtaatgatt tcagcaactg catggtggct ttgggggagc ttaaattcgc 8161 agccctctgt cacagggaag attctatcac aattccctat caggggtcag ggaaaggtgt 8221 cagcttccag ctcgtcaagc taggtgtctg gaaatcccca accgacatgc gatcctgggt 8281 ccccctatca acggatgatc cagtgataga taggctttac ctctcatctc acagaggtgt 8341 tatcgctgac aatcaagcaa aatgggctgt cccgacaaca cggacagatg acaagttgcg 8401 aatggagaca tgcttccagc aggcgtgtaa gggtaaaaac caagcactct gcgagaatcc 8461 cgagtgggca ccattgaagg ataacaggat tccttcatac ggggtcttgt ctgttaatct 8521 gagtctgaca gttgagctta aaatcaaaat tgcttcagga ttcgggccat tgatcacaca 8581 cggttcaggg atggacctat acaaaaccaa ccacaacaat gtgtattggc tgactatccc 8641 gccaatgaag aacctagcct taggtgtaat caacacattg gagtggatac cgagattcaa 8701 ggttagtccc aacctcttca ctgttccaat caaggaagca ggcgaggact gccatgcccc 8761 aacataccta cctgcggagg tggatggtga tgtcaaactc agttccaatc tggtaattct 8821 acctggtcag gatctccaat atgttttggc aacctacgat acttccaggg ttgaacatgc 8881 tgtggtttat tatgtttaca gcccaagccg ctcattttct tacttttatc cttttaggtt 8941 gcctataaag ggggtcccaa tcgaattaca agtggaatgc ttcacatggg acaaaaaact 9001 ctggtgccgt cacttctgtg tgcttgcgga ctcagaatct ggtggacata tcactcactc 9061 tgggatggtg ggcatgggag tcagctgcac agtcactcgg gaagatggaa ccaatcgcag 9121 atagggctgc cagtgaaccg atcacatgat gtcacccaga catcaggcat acccactagt 9181 gtgaaataga catcagaatt aagaaaaacg tagggtccaa gtggtttccc gttatggact 9241 cgctatctgt caaccagatc ttataccctg aagttcacct agatagcccg atagttacca 9301 ataagatagt agctatcctg gagtatgctc gagtccctca cgcttacagc ctggaggacc 9361 ctacactgtg tcagaacatc aagcaccgcc taaaaaacgg attctccaac caaatgatta 9421 taaacaatgt ggaagttggg aatgtcatca agtccaagct taggagttat ccggcccact 9481 ctcatattcc atatccaaat tgtaatcagg atttatttaa catagaagac aaagagtcaa 9541 caaggaagat ccgtgagctc ctaaaaaagg gaaattcgct gtactccaaa gtcagtgata 9601 aggttttcca atgcctgagg gacactaact cacggcttgg cctaggctcc gaattgaggg 9661 aggacatcaa ggagaaaatt attaacttgg gagtttacat gcacagctcc caatggtttg 9721 agccctttct gttttggttt acagtcaaga ctgagatgag gtcagtgatt aaatcacaaa 9781 cccatacttg ccataggagg agacacacac ctgtattctt cactggtagt tcagttgagc 9841 tgttaatctc tcgtgacctt gttgctataa tcagtaagga gtctcaacat gtatattacc 9901 tgacgtttga actggttttg atgtattgtg atgtcataga ggggaggtta atgacagaga 9961 ccgctatgac cattgatgct aggtatgcag aacttctagg aagagtcaga tacatgtgga 10021 aactgataga tggtttcttc cctgcactcg ggaatccaac ttatcaaatt gtagccatgc 10081 tggagccact ttcacttgct tacctgcaac tgagggatat aacagtagaa ctcagaggtg 10141 ctttccttaa ccactgcttt actgaaatac atgatgttct tgaccaaaac gggttttctg 10201 atgaaggtac ttatcatgag ttaattgaag ccctagatta cattttcata actgatgaca 10261 tacatctgac aggggagatt ttctcatttt tcagaagttt cggccacccc agacttgaag 10321 cagtaacggc tgctgaaaat gtcaggaaat acatgaatca gcctaaagtc attgtgtatg 10381 agactctgat gaaaggtcat gccatatttt gtggaatcat aatcaacggc tatcgtgaca 10441 ggcacggagg cagttggcca cccctgaccc tccccctgca tgctgcagac acaatccgga 10501 atgctcaagc ttcaggtgaa gggttaacac atgagcagtg cgttgataac tggaaatcat 10561 ttgctggagt gagatttggc tgttttatgc ctcttagcct ggacagtgat ctgacaatgt 10621 acctaaagga caaggcactt gctgctctcc aaagggaatg ggattcagtt tacccgaaag 10681 agttcctgcg ttacgatcct cccaagggaa ccgggtcacg gaggcttgta gatgttttcc 10741 ttaatgattc gagctttgac ccatatgata tgataatgta tgtcgtaagt ggagcctacc 10801 tccatgaccc tgagttcaac ctgtcttaca gcctgaaaga aaaggagatc aaggaaacag 10861 gtagactttt cgctaaaatg acttacaaaa tgagggcatg ccaagtgatc gctgaaaatc 10921 taatctcaaa cgggattggc aagtatttta aggacaatgg gatggccaag gatgagcacg 10981 atttgactaa ggcactccac actctggctg tctcaggagt ccccaaagat ctcaaagaaa 11041 gtcacagggg ggggccagtc ttaaaaacct actcccgaag cccagtccac acaagtacca 11101 ggaacgttaa agcagaaaaa gggtttgtag gattccctca tgtaattcgg cagaatcaag 11161 acactgatca tccggagaat atagaaacct acgagacagt cagcgcattt atcacgactg 11221 atctcaagaa gtactgcctt aattggagat atgagaccat cagcttattt gcacagaggc 11281 taaatgagat ttacggatta ccctcatttt ttcagtggct gcataagagg cttgaaacct 11341 ctgtcctcta tgtaagtgac cctcattgcc cccccgacct tgacgcccat gtcccgttat 11401 gcaaagtccc caatgaccaa atcttcatca agtaccctat gggaggtata gaagggtatt 11461 gtcagaagct gtggaccatc agcaccattc cctacttata cctggctgct tatgagagcg 11521 gggtaaggat tgcttcgtta gtgcaagggg acaatcagac catagccgta acaaaaaggg 11581 tacccagcac atggccttac aaccttaaga aacgggaagc tgctagagta actagagatt 11641 actttgtaat tcttaggcaa aggctacatg acattggcca tcacctcaag gcaaatgaga 11701 caattgtttc atcacatttt tttgtctatt caaaaggaat atattatgat gggctacttg 11761 tgtcccaatc actcaagagc atcgcaagat gtgtattctg gtcagagact atagttgatg 11821 aaacaagggc agcatgcagt aatattgcta caacaatggc taaaagcatc gagagaggtt 11881 atgaccgtta tcttgcatat tccctgaacg tcctaaaagt gatacagcaa attttgatct 11941 ctcttggctt cacaatcaat tcaaccatga cccgagatgt agtcataccc ctcctcacaa 12001 acaacgatct cttaataagg atggcactgt tgcccgctcc tattgggggg atgaattatc 12061 tgaatatgag caggctgttt gtcagaaaca tcggtgatcc agtaacatca tcaattgctg 12121 atctcaagag aatgattctc gcatcactaa tgcctgaaga gaccctccat caagtaatga 12181 cacaacaacc gggggactct tcattcctag actgggctag cgacccttac tcagcaaatc 12241 ttgtatgcgt ccagagcatc actagactcc tcaagaacat aactgcaagg tttgtcctaa 12301 tccatagtcc aaacccaatg ttaaaagggt tattccatga tgacagtaaa gaagaggacg 12361 agagactggc ggcattcctc atggacaggc atattatagt acctagggca gctcatgaaa 12421 tcctggatca tagtgtcaca ggggcaagag agtctattgc aggcatgcta gataccacaa 12481 aaggcctgat tcgagccagc atgaggaagg gggggttaac ctctcgagtg ataaccagat 12541 tgtccaatta tgactatgaa caatttagag cagggatggt gctattgaca ggaagaaaga 12601 gaaatgtcct cattgacaaa gagtcatgtt cagtgcagct ggctagagcc ctaagaagcc 12661 atatgtgggc aagactagct cgaggacggc ctatttacgg ccttgaggtc cctgatgtac 12721 tagaatctat gcgaggccac cttattcggc gtcatgagac atgtgtcatc tgcgagtgtg 12781 gatcagtcaa ctacggatgg ttttttgtcc cctcgggttg ccaactggat gatattgaca 12841 aggaaacatc atccttgaga gtcccatata ttggttctac cactgatgag agaacagaca 12901 tgaagctcgc cttcgtaaga gccccaagta gatccttgcg atctgccgtt agaatagcaa 12961 cagtgtactc atgggcttac ggtgatgatg atagctcttg gaacgaagcc tggttgttgg 13021 caaggcaaag ggccaatgtg agcctggagg agctaagggt gatcactccc atctcgactt 13081 cgactaattt agcgcatagg ttgagggatc gtagcactca agtgaaatac tcaggtacat 13141 cccttgtccg agtggcaagg tataccacaa tctccaacga caatctctca tttgtcatat 13201 cagataagaa ggttgatact aactttatat accaacaagg aatgcttcta gggttgggtg 13261 ttttagaaac attgtttcga ctcgagaaag atactggatc atctaacacg gtattacatc 13321 ttcacgtcga aacagattgt tgcgtgatcc cgatgataga tcatcccagg atacccagct 13381 cccgcaagct agagctgagg gcagagctat gtaccaaccc attgatatat gataatgcac 13441 ctttaattga cagagatgca acaaggctat acacccagag ccataggagg caccttgtgg 13501 aatttgttac atggtccaca ccccaactat atcacattct agctaagtcc acagcactat 13561 ctatgattga cctggtaaca aaatttgaga aggaccatat gaatgaaatt tcagctctca 13621 taggggatga cgatatcaat agtttcataa ctgagtttct gcttatagag ccaagattat 13681 tcaccatcta cttgggccag tgtgcagcca tcaattgggc atttgatgta cattatcata 13741 gaccatcagg gaaatatcag atgggtgagc tgttgtcttc gttcctttct agaatgagca 13801 aaggagtgtt taaggtgctt gtcaatgctc taagccaccc aaagatctac aagaaattct 13861 ggcattgtgg tattatagag cctatccatg gtccttcact tgatgctcaa aacttgcaca 13921 caactgtgtg caacatggtt tacacatgct atatgaccta cctcgacctg ttgttgaatg 13981 aagagttaga agagttcaca tttcttttgt gtgaaagcga tgaggatgta gtaccggaca 14041 gattcgacaa catccaggca aaacacttgt gtgttctggc agatttgtac tgtcaaccag 14101 ggacctgccc accgattcga ggtctaaggc cggtagagaa atgtgcagtt ctaaccgatc 14161 atatcaaggc agaggctagg ttatctccag caggatcttc gtggaacata aatccaatta 14221 ttgtagacca ttactcatgc tctctgactt atctccgtcg aggatctatc aaacagataa 14281 gattgagagt tgatccagga ttcatttttg acgccctcgc tgaggtaaat gtcagtcagc 14341 caaaggtcgg cagcaacaac atctcaaata tgagcatcaa ggatttcaga cctccacacg 14401 atgatgttgc aaaattgctc aaagatatca acacaagcaa gcacaatctt cccatttcag 14461 ggggtagtct cgccaattat gaaatccatg ctttccgcag aatcgggtta aactcatctg 14521 cttgctacaa agctgttgag atatcaacat taattaggag atgccttgag ccaggggaag 14581 acggcttgtt cttgggtgag gggtcgggtt ctatgttgat cacttataag gagatactaa 14641 aactaaacaa gtgcttctat aatagtgggg tttccgccaa ttctagatct ggtcaaaggg 14701 aattagcacc ctatccctcc gaagttggcc ttgtcgaaca cagaatggga gtaggtaata 14761 ttgtcaaggt gctctttaac gggaggcccg aagtcacgtg ggtaggcagt atagattgct 14821 tcaatttcat agtcagtaat atccctacct ctagtgtggg gtttatccat tcagatatag 14881 agaccttacc taacaaagat actatagaga agctagagga attggcagcc atcttatcga 14941 tggctctact ccttggcaaa ataggatcaa tactggtgat taagcttatg cctttcagcg 15001 gggattttgt tcagggattt ataagctatg tagggtctca ttatagagaa gtgaaccttg 15061 tctaccctag gtacagcaac ttcatatcta ctgaatctta tttagttatg acagatctca 15121 aagctaaccg gctaatgaat cctgaaaaga tcaagcagca gataattgaa tcatctgtgc 15181 ggacttcacc tggacttata ggtcacatcc tatccattaa gcaactaagc tgcatacaag 15241 caattgtggg aggcgcagtt agtagaggtg atatcaaccc tattctgaaa aaacttacac 15301 ctatagagca ggtgctgatc agttgcgggt tggcaattaa cggacctaaa ctgtgcaaag 15361 aattaatcca ccatgatgtt gcctcagggc aagatggatt gcttaactct atactcatcc 15421 tctacaggga gttggcaaga ttcaaagaca accaaagaag tcaacaaggg atgttccacg 15481 cttaccccgt attggtaagt agtaggcaac gagaacttgt atctaggatc actcgcaaat 15541 tttgggggca tattcttctt tactccggga acagaaagtt gataaatcgg tttatccaga 15601 atctcaagtc cggttatcta gtactagact tacaccagaa tatcttcgtt aagaatctat 15661 ccaagtcaga gaaacagatt attatgacgg ggggtttaaa acgtgagtgg gtttttaagg 15721 taacagtcaa ggagaccaaa gaatggtaca agttagtcgg atacagcgct ctgattaagg 15781 attaattggt tgaactccgg aaccctaatc ctgccctagg tagttaggca ttatttgcaa 15841 tatattaaag aaaactttga aaatacgaag tttctattcc cagctttgtc tggt // LOCUS NC_006577 29926 bp ss-RNA linear VRL 13-AUG-2018 DEFINITION Human coronavirus HKU1, complete genome. ACCESSION NC_006577 VERSION NC_006577.2 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Human coronavirus HKU1 (HCoV-HKU1) ORGANISM Human coronavirus HKU1 Viruses; Riboviria; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Betacoronavirus; Embecovirus. REFERENCE 1 (bases 1 to 29926) AUTHORS Woo,P.C., Lau,S.K., Chu,C.M., Chan,K.H., Tsoi,H.W., Huang,Y., Wong,B.H., Poon,R.W., Cai,J.J., Luk,W.K., Poon,L.L., Wong,S.S., Guan,Y., Peiris,J.S. and Yuen,K.Y. TITLE Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia JOURNAL J. Virol. 79 (2), 884-895 (2005) PUBMED 15613317 REFERENCE 2 (bases 1 to 29926) AUTHORS Woo,P.C.Y., Lau,S.K.P., Chan,K.H., Tsoi,H.W., Huang,Y., Wong,B.H.L., Cai,J.J., Wong,S.S.Y., Peiris,J.S.M., Chu,C.M. and Yuen,K.Y. TITLE Direct Submission JOURNAL Submitted (18-JAN-2006) Department of Microbiology, The University of Hong Kong, University Pathology Building, Queen Mary Hospital, Hong Kong, China REMARK Sequence update by submitter REFERENCE 3 (bases 1 to 29926) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (28-DEC-2004) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 4 (bases 1 to 29926) AUTHORS Woo,P.C.Y., Lau,S.K.P., Chan,K.H., Tsoi,H.W., Huang,Y., Wong,B.H.L., Cai,J.J., Wong,S.S.Y., Peiris,J.S.M., Chu,C.M. and Yuen,K.Y. TITLE Direct Submission JOURNAL Submitted (11-APR-2004) Department of Microbiology, The University of Hong Kong, University Pathology Building, Queen Mary Hospital, Hong Kong, China COMMENT VALIDATED REFSEQ: This record has undergone validation or preliminary review. The reference sequence was derived from AY597011. On Jan 23, 2006 this sequence version replaced NC_006577.1. The mature peptides were designated nsp1 through nsp16 following the nomenclature used in the MHV-A59 and SARS-CoV RefSeqs (NC_001846 and NC_004718, respectively). COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..29926 /organism="Human coronavirus HKU1" /mol_type="genomic RNA" /isolate="HKU1" /isolation_source="patient with pneumonia" /host="Homo sapiens" /db_xref="taxon:290028" /genotype="A" gene 206..21753 /gene="orf1ab" /locus_tag="HCHV1gp1" /db_xref="GeneID:3200429" CDS join(206..13600,13600..21753) /gene="orf1ab" /locus_tag="HCHV1gp1" /ribosomal_slippage="" /note="translated via -1 ribosomal frameshift" /codon_start=1 /product="orf1ab polyprotein" /protein_id="YP_173236.1" /db_xref="GeneID:3200429" /translation="MIKTSKYGLGFKWAPEFRWLLPDAAEELASPMKSDEGGLCPSTGQ AMESVGFVYDNHVKIDCRCILGQEWHVQSNLIRDIFVHEDLHVVEVLTKTAVKSGTAIL IKSPLHSLGGFPKGYVMGLFRSYKTKRYVVHHLSMTTSTTNFGEDFLGWIVPFGFMPSY VHKWFQFCRLYIEESDLIISNFKFDDYDFSVEDAYAEVHAEPKGKYSQKAYALLRQYRG IKPVLFVDQYGCDYSGKLADCLQAYGHYSLQDMRQKQSVWLANCDFDIVVAWHVVRDSR FVMRLQTIATICGIKYVAQPTEDVVDGDVVIREPVHLLSADAIVLKLPSLMKVMTHMDD FSIKSIYNVDLCDCGFVMQYGYVDCFNDNCDFYGWVSGNMMDGFSCPLCCTVYDSSEVK AQSSGVIPENPVLFTNSTDTVNHDSFNLYGYSVTPFGSCIYWSPRPGLWIPIIKSSVKS YDDLVYSGVVGCKSIVKETALITHALYLDYVQCKCGNLEQNHILGVNNSWCRQLLLNRG DYNMLLKNIDLFVKRRADFACKFAVCGDGFVPFLLDGLIPRSYYLIQSGIFFTSLMSQF SQEVSDMCLKMCILFMDRVSVATFYIEHYVNRLVTQFKLLGTTLVNKMVNWFNTMLDAS APATGWLLYQLLNGLFVVSQANFNFVALIPDYAKILVNKFYTFFKLLLECVTVDVLKDM PVLKTINGLVCIVGNKFYNVSTGLIPGFVLPCNAQEQQIYFFEGVAESVIVEDDVIENV KSSLSSYEYCQPPKSVEKICIIDNMYMGKCGDKFFPIVMNDKNICLLDQAWRFPCAGRK VNFNEKPVVMEIPSLMTVKVMFDLDSTFDDILGKVCSEFEVEKGVTVDDFVAVVCDAIE NALNSCKEHPVVGYQVRAFLNKLNENVVYLFDEAGDEAMASRMYCTFAIEDVEDVISSE AVEDTIDGVVEDTINDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDED VVTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNDDE DVVTGDNDDEDVVTGDNDDEDVVTGDNDDEDVVTGDNNDEEIVTGDNDDQIVVTGDDVD DIESIYDFDTYKALLVFNDVYNDALFVSYGSSVETETYFKVNGLWSPTITHTNCWLRSV LLVMQKLPFKFKDLAIENMWLSYKVGYNQSFVDYLLTTIPKAIVLPQGGFVADFAYWFL NQFDINAYANWCCLKCGFSFDLNGLDALFFYGDIVSHVCKCGHNMTLIAADLPCTLHFS LFDDNFCAFCTPKKIFIAACAVDVNVCHSVAVIGDEQIDGKFVTKFSGDKFDFIVGYGM SFSMSSFELPQLYGLCITPNVCFVKGDIINVARLVKADVIVNPANGHMLHGGGVAKAIA VAAGKKFSKETAAMVKSKGVCQVGDCYVSTGGKLCKTILNIVGPDARQDGRQSYVLLAR AYKHLNNYDCCLSTLISAGIFSVPADVSLTYLLGVVDKQVILVSNNKEDFDIIQKCQIT SVVGTKALAVRLTANVGRVIKFETDAYKLFLSGDDCFVSNSSVIQEVLLLRHDIQLNND VRDYLLSKMTSLPKDWRLINKFDVINGVKTVKYFECPNSIYICSQGKDFGYVCDGSFYK ATVNQVCVLLAKKIDVLLTVDGVNFKSISLTVGEVFGKILGNVFCDGIDVTKLKCSDFY ADKILYQYENLSLADISAVQSSFGFDQQQLLAYYNFLTVCKWSVVVNGPFFSFEQSHNN CYVNVACLMLQHINLKFNKWQWQEAWYEFRAGRPHRLVALVLAKGHFKFDEPSDATDFI RVVLKQADLSGAICELELICDCGIKQESRVGVDAVMHFGTLAKTDLFNGYKIGCNCAGR IVHCTKLNVPFLICSNTPLSKDLPDDVVAANMFMGVGVGHYTHLKCGSPYQHYDACSVK KYTGVSGCLTDCLYLKNLTQTFTSMLTNYFLDDVEMVAYNPDLSQYYCDNGKYYTKPII KAQFKPFAKVDGVYTNFKLVGHDICAQLNDKLGFNVDLPFVEYKVTVWPVATGDVVLAS DDLYVKRYFKGCETFGKPVIWFCHDEASLNSLTYFNKPSFKSENRYSVLSVDSVSEESQ GNVVTSVMESQISTKEVKLKGVRKTVKIEDAIIVNDENSSIKVVKSLSLVDVWDMYLTG CDYVVWVANELSRLVKSPTVREYIRYGIKPITIPIDLLCLRDDNQTLLVPKIFKARAIE FYGFLKWLFIYVFSLLHFTNDKTIFYTTEIASKFTFNLFCLALKNAFQTFRWSIFIKGF LVVATVFLFWFNFLYINVIFSDFYLPNISVFPIFVGRIVMWIKATFGLVTICDFYSKLG VGFTSHFCNGSFICELCHSGFDMLDTYAAIDFVQYEVDRRVLFDYVSLVKLIVELVIGY SLYTVWFYPLFCLIGLQLFTTWLPDLFMLETMHWLIRFIVFVANMLPAFVLLRFYIVVT AMYKVVGFIRHIVYGCNKAGCLFCYKRNCSVRVKCSTIVGGVIRYYDITANGGTGFCVK HQWNCFNCHSFKPGNTFITVEAAIELSKELKRPVNPTDASHYVVTDIKQVGCMMRLFYD RDGQRVYDDVDASLFVDINNLLHSKVKVVPNLYVVVVESDADRANFLNAVVFYAQSLYR PILLVDKKLITTACNGISVTQTMFDVYVDTFMSHFDVDRKSFNNFVNIAHASLREGVQL EKVLDTFVGCVRKCCSIDSDVETRFITKSMISAVAAGLEFTDENYNNLVPTYLKSDNIV AADLGVLIQNGAKHVQGNVAKAANISCIWFIDAFNQLTADLQHKLKKACVKTGLKLKLT FNKQEASVPILTTPFSLKGGVVLSNLLYILFFVSLICFILLWALLPTYSVYKSDIHLPA YASFKVIDNGVVRDISVNDLCFANKFFQFDQWYESTFGSVYYHNSMDCPIVVAVMDEDI GSTMFNVPTKVLRHGFHVLHFLTYAFASDSVQCYTPHIQISYNDFYASGCVLSSLCTMF KRGDGTPHPYCYSDGVMKNASLYTSLVPHTRYSLANSNGFIRFPDVISEGIVRIVRTRS MTYCRVGACEYAEEGICFNFNSSWVLNNDYYRSMPGTFCGRDLFDLFYQFFSSLIRPID FFSLTASSIFGAILAIVVVLVFYYLIKLKRAFGDYTSVVVINVVVWCINFLMLFVFQVY PICACVYACFYFYVTLYFPSEISVIMHLQWIVMYGAIMPFWFCVTYVAMVIANHVLWLF SYCRKIGVNVCSDSTFEETSLTTFMITKDSYCRLKNSVSDVAYNRYLSLYNKYRYYSGK MDTAAYREAACSQLAKAMETFNHNNGNDVLYQPPTASVSTSFLQSGIVKMVSPTSKIEP CIVSVTYGSMTLNGLWLDDKVYCPRHVICSSSNMNEPDYSALLCRVTLGDFTIMSGRMS LTVVSYQMQGCQLVLTVSLQNPYTPKYTFGNVKPGETFTVLAAYNGRPQGAFHVTMRSS YTIKGSFLCGSCGSVGYVLTGDSVKFVYMHQLELSTGCHTGTDFTGNFYGPYRDAQVVQ LPVKDYVQTVNVIAWLYAAILNNCAWFVQNDVCSTEDFNVWAMANGFSQVKADLVLDAL ASMTGVSIETLLAAIKRLYMGFQGRQILGSCTFEDELAPSDVYQQLAGVKLQSKTKRFI KETIYWILISTFLFSCIISAFVKWTIFMYINTHMIGVTLCVLCFVSFMMLLVKHKHFYL TMYIIPVLCTLFYVNYLVVYKEGFRGFTYVWLSYFVPAVNFTYVYEVFYGCILCVFAIF ITMHSINHDIFSLMFLVGRIVTLISMWYFGSNLEEDVLLFITAFLGTYTWTTILSLAIA KIVANWLSVNIFYFTDVPYIKLILLSYLFIGYILSCYWGFFSLLNSVFRMPMGVYNYKI SVQELRYMNANGLRPPRNSFEAILLNLKLLGIGGVPVIEVSQIQSKLTDVKCANVVLLN CLQHLHVASNSKLWQYCSVLHNEILSTSDLSVAFDKLAQLLIVLFANPAAVDTKCLASI DEVSDDYVQDSTVLQALQSEFVNMASFVEYEVAKKNLADAKNSGSVNQQQIKQLEKACN IAKSVYERDKAVARKLERMADLALTNMYKEARINDKKSKVVSALQTMLFSMVRKLDNQA LNSILDNAVKGCVPLSAIPALAANTLTIVIPDKQVFDKVVDNVYVTYAGSVWHIQTVQD ADGINKQLTDISVDSNWPLVIIANRYNEVANAVMQNNELMPHKLKIQVVNSGSDMNCNI PTQCYYNNGSSGRIVYAVLSDVDGLKYTKIMKDDGNCVVLELDPPCKFSIQDVKGLKIK YLYFIKGCNTLARGWVVGTLSSTIRLQAGVATEYAANSSILSLCAFSVDPKKTYLDYIQ QGGVPIINCVKMLCDHAGTGMAITIKPEATINQDSYGGASVCIYCRARVEHPDVDGICK LRGKFVQVPLGIKDPILYVLTHDVCQVCGFWRDGSCSCVGSSVAVQSKDLNFLNRVRGT SVNARLVPCASGLSTDVQLRAFDICNTNRAGIGLYYKVNCCRFQRIDDDGNKLDKFFVV KRTNLEVYNKEKTYYELTKSCGVVAEHDFFTFDIDGSRVPHIVRRNLSKYTMLDLCYAL RHFDRNDCSILCEILCEYADCKESYFSKKDWYDFVENPDIINIYKKLGPIFNRALLNTV IFADTLVEVGLVGVLTLDNQDLYGQWYDFGDFIQTAPGFGVAVADSYYSYMMPMLTMCH VLDCELFVNDSYRQFDLVQYDFTDYKLELFNKYFKYWGMKYHPNTVDCDNDRCIIHCAN FNILFSMVLPNTCFGPLVRQIFVDGVPFVVSIGYHYKELGVVMNLDVDTHRYRLSLKDL LLYAADPAMHVASASALLDLRTCCFSVAAITSGIKFQTVKPGNFNQDFYEFVKSKGLFK EGSTVDLKHFFFTQDGNAAITDYNYYKYNLPTMVDIKQLLFVLEVVYKYFEIYDGGCIP ASQVIVNNYDKSAGYPFNKFGKARLYYEALSFEEQNEIYAYTKRNVLPTLTQMNLKYAI SAKNRARTVAGVSILSTMTGRMFHQKCLKSIAATRGVPVVIGTTKFYGGWDDMLRHLIK DVDNPVLMGWDYPKCDRAMPNILRIVSSLVLARKHEFCCSHGDRFYRLANECAQVLSEI VMCGGCYYVKPGGTSSGDATTAFANSVFNICQAVTANVCSLMACNGHKIEDLSIRNLQK RLYSNVYRTDYVDYTFVNEYYEFLCKHFSMMILSDDGVVCYNSDYASKGYIANISVFQQ VLYYQNNVFMSESKCWVENDITNGPHEFCSQHTMLVKIDGDYVYLPYPDPSRILGAGCF VDDLLKTDSVLLIERFVSLAIDAYPLVHHENEEYQKVFRVYLEYIKKLYNDLGTQILDS YSVILSTCDGLKFTEESFYKNMYLKSAVMQSVGACVVCSSQTSLRCGSCIRKPLLCCKC CYDHVMATNHKYVLSVSPYVCNAPNCDVSDVTKLYLGGMSYYCENHKPHYSFKLVMNGM VFGLYKQSCTGSPYIDDFNKIASCKWTEVDDYVLANECIERLKLFAAETQKATEEAFKQ SYASATIQEIVSDREVILCWETGKVKPPLNKNYVFTGYHFTSTGKTVLGEYVFDKSELT NGVYYRATTTYKLSIGDVFVLTSHSVASLSAPTLVPQENYASIRFSSVYSVPLVFQNNV ANYQHIGMKRYCTVQGPPGTGKSHLAIGLAVYYYTARVVYTAASHAAVDALCEKAYKFL NINDCTRIIPAKVRVDCYDKFKINDTTCKYVFTTINALPELVTDIVVVDEVSMLTNYEL SVINARIKAKHYVYIGDPAQLPAPRVLLSKGSLEPRHFNSITKIMCCLGPDIFLGNCYR CPKEIVETVSALVYDNKLKAKNDNSSLCFKVYFKGQTTHESSSAVNIQQIYLISKFLKA NPVWNSAVFISPYNSQNYVAKRVLGVQTQTVDSAQGSEYDYVIYSQTAETAHSVNVNRF NVAITRAKKGIFCVMSNMQLFESLNFITLPLDKIQNQTLPRLHCTTNLFKDCSKSCLGY HPAHAPSFLAVDDKYKVNENLAVNLNICEPVLTYSRLISLMGFKLDLTLDGYSKLFITK DEAIKRVRGWVGFDVEGAHATRENIGTNFPLQIGFSTGVDFVVEATGLFAERDCYTFKK TVAKAPPGEKFKHLIPLMSKGQKWDIVRIRIVQMLSDYLLDLSDSVVFITWSASFELTC LRYFAKLGRELNCNVCSNRATCYNSRTGYYGCWRHSYTCDYVYNPLIVDIQQWGYTGSL TSNHDIICNVHKGAHVASADAIMTRCLAIYDCFCKSVNWNLEYPIISNEVSINTSCRLL QRVMLKAAMLCNRYNLCYDIGNPKGLACVKDYEFKFYDAFPVAKSVKQLFYVYDVHKDN FKDGLCMFWNCNVDKYPSNSIVCRFDTRVLNKLNLPGCNGGSLYVNKHAFHTNPFTRTV FENLKPMPFFYYSDTPCVYVDGLESKQVDYVPLRSATCITRCNLGGAVCSKHAEEYCNY LESYNIVTTAGFTFWVYKNFDFYNLWNTFTTLQSLENVIYNLVNVGHYDGRTGELPCAI MNDKVVVKINNVDTVIFKNNTSFPTNIAVELFTKRSIRHHPELKILRNLNIDICWKHVL WDYVKDSLFCSSTYGVCKYTDLKFIENLNILFDGRDTGALEAFRKARNGVFISTEKLSR LSMIKGPQRADLNGVIVDKVGELKVEFWFAMRKDGDDVIFSRTDSLCSSHYWSPQGNLG GNCAGNVIGNDALTRFTIFTQSRVLSSFEPRSDLERDFIDMDDNLFIAKYGLEDYAFDH IVYGSFNHKVIGGLHLLIGLFRRKKKSNLLIQEFLQYDSSIHSYFITDQECGSSKSVCT VIDLLLDDFVSIVKSLNLSCVSKVVNINVDFKDFQFMLWCNDNKIMTFYPKMQATNDWK PGYSMPVLYKYLNVPLERVSLWNYGKPINLPTGCMMNVAKYTQLCQYLNTTTLAVPVNM RVLHLGAGSDKEVAPGSAVLRQWLPSGSILVDNDLNPFVSDSLVTYFGDCMTLPFDCHW DLIISDMYDPLTKNIGDYNVSKDGFFTYICHLIRDKLSLGGSVAIKITEFSWNADLYKL MSCFAFWTVFCTNVNASSSEGFLIGINYLGKSSFEIDGNVMHANYLFWRNSTTWNGGAY SLFDMTKFSLKLAGTAVVNLRPDQLNDLVYSLIERGKLLVRDTRKEIFVGDSLVNTC" mat_peptide 206..871 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="Leader protein" /note="PL1-PRO cleavage product; nsp1" /protein_id="YP_460018.1" mat_peptide 872..2632 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp2" /note="PL1-PRO cleavage product" /protein_id="YP_459934.1" mat_peptide 2633..8719 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp3" /inference="similar to AA sequence:RefSeq:NP_740609.2" /note="acidic tandem repeats (ATR), papain-like proteinase 1 domain (PL1pro), Appr-1'-p processing enzyme, papain-like proteinase 2 domain (PL2pro) and a hydrophobic domain (HD)" /protein_id="YP_460024.1" misc_feature 3038..3517 /gene="orf1ab" /locus_tag="HCHV1gp1" /note="Region: contains 14 acidic tandem repeats (ATR)" mat_peptide 8720..10207 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp4 (TM2)" /inference="similar to AA sequence:RefSeq:NP_001012459.1" /note="transmembrane domain" /protein_id="YP_459935.1" mat_peptide 10208..11116 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp5" /note="3C-like proteinase; 3CL-PRO cleavage product" /protein_id="YP_459936.1" mat_peptide 11117..11977 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp6 (hydrophobic domain)" /note="3CL-PRO cleavage product" /protein_id="YP_460019.1" mat_peptide 11978..12253 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp7" /inference="similar to AA sequence:RefSeq:NP_740612.1" /note="3CL-PRO cleavage product" /protein_id="YP_459938.1" mat_peptide 12254..12835 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp8" /inference="similar to AA sequence:RefSeq:NP_740613.1" /note="3CL-PRO cleavage product" /protein_id="YP_460020.1" mat_peptide 12836..13165 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp9" /inference="similar to AA sequence:RefSeq:NP_828867.1" /note="3CL-PRO cleavage product" /protein_id="YP_459943.1" mat_peptide 13166..13576 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp10" /inference="similar to AA sequence:RefSeq:NP_740615.1" /note="growth factor like protein; 3CL-PRO cleavage product" /protein_id="YP_459939.1" mat_peptide join(13577..13600,13600..16359) /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp12" /inference="similar to AA sequence:RefSeq:NP_740616.1" /note="RNA-dependent RNA polymerase; RdRp; pol; 3CL-PRO cleavage" /protein_id="YP_459941.1" mat_peptide 16360..18168 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp13" /inference="similar to AA sequence:RefSeq:NP_740617.1" /note="NTPase/helicase domain; 3CL-PRO cleavage product" /protein_id="YP_459942.1" mat_peptide 18169..19731 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp14" /inference="similar to AA sequence:RefSeq:NP_740618.1" /note="nuclease ExoN homolog; 3CL-PRO cleavage product" /protein_id="YP_460021.1" mat_peptide 19732..20853 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp15" /inference="similar to AA sequence:RefSeq:NP_740619.1" /note="NendoU; 3CL-PRO cleavage product" /protein_id="YP_460022.1" mat_peptide 20854..21750 /gene="orf1ab" /locus_tag="HCHV1gp1" /product="nsp16" /inference="similar to AA sequence:RefSeq:NP_740620.1" /note="2'-0-ribose methyltransferase; 3CL-PRO cleavage product" /protein_id="YP_460023.1" gene 21773..22933 /gene="HE" /locus_tag="HCHV1gp2" /db_xref="GeneID:3200425" CDS 21773..22933 /gene="HE" /locus_tag="HCHV1gp2" /codon_start=1 /product="hemagglutinin-esterase glycoprotein" /protein_id="YP_173237.1" /db_xref="GeneID:3200425" /translation="MLIIFLFFYFCYGFNEPLNVVSHLNHDWFLFGDSRSDCNHINNLK IKNFDYLDIHPSLCNNGKISSSAGDSIFKSFHFTRFYNYTGEGDQIIFYEGVNFNPYHR FKCFPNGSNDVWLLNKVRFYRALYSNMAFFRYLTFVDIPYNVSLSKFNSCKSDILSLNN PIFINYSKEVYFTLLGCSLYLVPLCLFKSNFSQYYYNIDTGSVYGFSNVVYPDLDCIYI SLKPGSYKVSTTAPFLSLPTKALCFDKSKQFVPVQVVDSRWNNERASDISLSVACQLPY CYFRNSSANYVGKYDINHGDSGFISILSGLLYNVSCISYYGVFLYDNFTSIWPYYSFGR CPTSSIIKHPICVYDFLPIILQGILLCLALLFVVFLLFLLYNDKSH" gene 22942..27012 /gene="S" /locus_tag="HCHV1gp3" /db_xref="GeneID:3200426" CDS 22942..27012 /gene="S" /locus_tag="HCHV1gp3" /codon_start=1 /product="spike glycoprotein" /protein_id="YP_173238.1" /db_xref="GeneID:3200426" /translation="MLLIIFILPTTLAVIGDFNCTNFAINDLNTTVPRISEYVVDVSYG LGTYYILDRVYLNTTILFTGYFPKSGANFRDLSLKGTTYLSTLWYQKPFLSDFNNGIFS RVKNTKLYVNKTLYSEFSTIVIGSVFINNSYTIVVQPHNGVLEITACQYTMCEYPHTIC KSKGSSRNESWHFDKSEPLCLFKKNFTYNVSTDWLYFHFYQERGTFYAYYADSGMPTTF LFSLYLGTLLSHYYVLPLTCNAISSNTDNETLQYWVTPLSKRQYLLKFDNRGVITNAVD CSSSFFSEIQCKTKSLLPNTGVYDLSGFTVKPVATVHRRIPDLPDCDIDKWLNNFNVPS PLNWERKIFSNCNFNLSTLLRLVHTDSFSCNNFDESKIYGSCFKSIVLDKFAIPNSRRS DLQLGSSGFLQSSNYKIDTTSSSCQLYYSLPAINVTINNYNPSSWNRRYGFNNFNLSSH SVVYSRYCFSVNNTFCPCAKPSFASSCKSHKPPSASCPIGTNYRSCESTTVLDHTDWCR CSCLPDPITAYDPRSCSQKKSLVGVGEHCAGFGVDEEKCGVLDGSYNVSCLCSTDAFLG WSYDTCVSNNRCNIFSNFILNGINSGTTCSNDLLQPNTEVFTDVCVDYDLYGITGQGIF KEVSAVYYNSWQNLLYDSNGNIIGFKDFVTNKTYNIFPCYAGRVSAAFHQNASSLALLY RNLKCSYVLNNISLTTQPYFDSYLGCVFNADNLTDYSVSSCALRMGSGFCVDYNSPSSS SSRRKRRSISASYRFVTFEPFNVSFVNDSIESVGGLYEIKIPTNFTIVGQEEFIQTNSP KVTIDCSLFVCSNYAACHDLLSEYGTFCDNINSILDEVNGLLDTTQLHVADTLMQGVTL SSNLNTNLHFDVDNINFKSLVGCLGPHCGSSSRSFFEDLLFDKVKLSDVGFVEAYNNCT GGSEIRDLLCVQSFNGIKVLPPILSESQISGYTTAATVAAMFPPWSAAAGIPFSLNVQY RINGLGVTMDVLNKNQKLIATAFNNALLSIQNGFSATNSALAKIQSVVNSNAQALNSLL QQLFNKFGAISSSLQEILSRLDALEAQVQIDRLINGRLTALNAYVSQQLSDISLVKFGA ALAMEKVNECVKSQSPRINFCGNGNHILSLVQNAPYGLLFMHFSYKPISFKTVLVSPGL CISGDVGIAPKQGYFIKHNDHWMFTGSSYYYPEPISDKNVVFMNTCSVNFTKAPLVYLN HSVPKLSDFESELSHWFKNQTSIAPNLTLNLHTINATFLDLYYEMNLIQESIKSLNNSY INLKDIGTYEMYVKWPWYVWLLISFSFIIFLVLLFFICCCTGCGSACFSKCHNCCDEYG GHHDFVIKTSHDD" gene 27051..27380 /gene="orf4" /locus_tag="HCHV1gp4" /db_xref="GeneID:3200427" CDS 27051..27380 /gene="orf4" /locus_tag="HCHV1gp4" /codon_start=1 /product="non-structural protein" /protein_id="YP_173239.1" /db_xref="GeneID:3200427" /translation="MDVWRPSYTHSLVIREFGVTNLEDLCLKYNYCQPIVGYCIVPLNV WCRKFGKFASHFTLRSHDISHSNNFGVVTSFTTYGNTVSEAVSRLVESASEFIVWRAEA LNKYG" gene 27373..27621 /gene="E" /locus_tag="HCHV1gp5" /db_xref="GeneID:3200430" CDS 27373..27621 /gene="E" /locus_tag="HCHV1gp5" /codon_start=1 /product="small membrane protein" /protein_id="YP_173240.1" /db_xref="GeneID:3200430" /translation="MVDLFFNDTAWYIGQILVLVLFCLISLIFVVAFLATIKLCMQLCG FCNFFIISPSAYVYKRGMQLYKSYSEQVIPPTSDYLI" gene 27633..28304 /gene="M" /locus_tag="HCHV1gp6" /db_xref="GeneID:3200428" CDS 27633..28304 /gene="M" /locus_tag="HCHV1gp6" /codon_start=1 /product="membrane glycoprotein" /protein_id="YP_173241.1" /db_xref="GeneID:3200428" /translation="MNKSFLPQFTSDQAVTFLKEWNFSLGVILLFITIILQFGYTSRSM FVYLIKMIILWLMWPLTITLTIFNCFYALNNAFLAFSIVFTIISIVIWILYFVNSIRLF IRTGSWWSFNPETNNLMCIDMKGKMFVRPVIEDYHTLTATVIRGHLYIQGVKLGTGYTL SDLPVYVTVAKVQVLCTYKRAFLDKLDVNSGFAVFVKSKVGNYRLPSSKPSGMDTALLR A" gene 28320..29645 /gene="N" /locus_tag="HCHV1gp7" /db_xref="GeneID:3200423" CDS 28320..29645 /gene="N" /locus_tag="HCHV1gp7" /codon_start=1 /product="nucleocapsid phosphoprotein" /protein_id="YP_173242.1" /db_xref="GeneID:3200423" /translation="MSYTPGHYAGSRSSSGNRSGILKKTSWADQSERNYQTFNRGRKTQ PKFTVSTQPQGNTIPHYSWFSGITQFQKGRDFKFSDGQGVPIAFGVPPSEAKGYWYRHS RRSFKTADGQQKQLLPRWYFYYLGTGPYANASYGESLEGVFWVANHQADTSTPSDVSSR DPTTQEAIPTRFPPGTILPQGYYVEGSGRSASNSRPGSRSQSRGPNNRSLSRSNSNFRH SDSIVKPDMADEIANLVLAKLGKDSKPQQVTKQNAKEIRHKILTKPRQKRTPNKHCNVQ QCFGKRGPSQNFGNAEMLKLGTNDPQFPILAELAPTPGAFFFGSKLDLVKRDSEADSPV KDVFELHYSGSIRFDSTLPGFETIMKVLEENLNAYVNSNQNTDSDSLSSKPQRKRGVKQ LPEQFDSLNLSAGTQHISNDFTPEDHSLLATLDDPYVEDSVA" gene 28342..28959 /gene="N2" /locus_tag="HCHV1gp8" /db_xref="GeneID:3200424" CDS 28342..28959 /gene="N2" /locus_tag="HCHV1gp8" /codon_start=1 /product="nucleocapsid phosphoprotein 2" /protein_id="YP_173243.1" /db_xref="GeneID:3200424" /translation="MLEVEAPLEIVQESSRKLLGLTNLSEITKPLIEAEKPNLNSLCLL NHKEILSHIIPGSPGSLNFKKVETLNFQMVKEFPLLSEYPLLKQKDIGIDTAGVLLKQL MVNKSSCYRDGISTISVPAHMPMHPMVNPSKGSSGLLITKLTLLLPPMFRQGILLLKKL SLLGFRLVRFCLKAIMLKAQEGLLLIVDQVHVLNHVDPIIVH" ORIGIN 1 gagtttgagc gattgacgtt cgtaccgtct atcagcttac gatctcttgt cagatctcat 61 taaatctaaa ctttttaaac aagattccct gttatccatg cttgtgagtg tggtttaatc 121 ataatcttgt attttacttt ccacactttt catctctctg ccagtgacgt gttggttgtc 181 ctcagcgtcc ctcccatagg tcgcaatgat taaaaccagc aaatacggtc tcggcttcaa 241 gtgggcgcca gaatttcgtt ggctgcttcc ggatgcagcg gaggagttgg ctagtcctat 301 gaagtcagat gagggtgggt tatgcccctc tactggtcaa gcgatggaaa gtgttggatt 361 cgtttatgat aatcatgtga agatagattg tcgctgcatt cttggacaag aatggcatgt 421 gcagtcaaat cttatccgtg atatttttgt tcatgaagat ctacatgttg tagaagttct 481 aactaaaaca gccgtaaagt ccggtacggc aattttaatt aaatcacctt tgcatagctt 541 gggtggtttt cctaaagggt atgttatggg cttgttccgt tcatacaaga ctaaacgtta 601 tgttgtacat catctttcta tgactacatc tactactaat tttggtgaag attttttggg 661 ttggattgta ccttttggtt ttatgccatc ttatgttcac aaatggtttc aattctgtag 721 gttgtatatt gaagagagtg atttaataat ttcaaatttt aaatttgatg attatgattt 781 tagtgtagaa gatgcttatg ctgaggttca tgctgagcct aaaggtaaat attcacaaaa 841 agcttatgct ttacttagac aatatcgtgg tattaaaccc gtactttttg tagaccagta 901 tggttgtgac tattctggta aattagcaga ttgtcttcaa gcttatggtc attattcttt 961 gcaagatatg agacaaaagc agtctgtatg gcttgccaat tgtgactttg atattgtagt 1021 ggcttggcat gtagttcgtg attcacgatt tgttatgcgc ctgcagacta tagctactat 1081 ttgtggtatt aaatatgttg cacaacctac agaagatgta gtagatggag atgtagttat 1141 acgtgaacct gtacatttat tatctgctga tgcaatagtt ttaaagcttc ctagtttgat 1201 gaaagttatg actcatatgg atgatttttc tattaaatct atatataatg ttgatttgtg 1261 tgattgtggt tttgttatgc agtatggtta tgtagattgt tttaatgata attgtgattt 1321 ttatggttgg gtttcaggta atatgatgga tggtttttct tgtccattgt gttgtacagt 1381 ttatgactct agcgaagtta aagcccaatc atctggtgtt attcctgaaa atcctgtgtt 1441 atttactaat agtactgata ctgttaacca tgattctttt aatttgtatg gttattctgt 1501 cacaccattt ggttcttgta tatattggtc gccgcgtcct ggattgtgga ttcctataat 1561 taaatcttca gtcaagtctt atgatgattt ggtttattca ggtgtagtag gttgtaaatc 1621 tattgttaaa gaaactgctc ttattactca tgcactttac ttagattatg ttcaatgtaa 1681 gtgtggtaat cttgaacaaa atcatattct tggcgttaat aattcttggt gtaggcaact 1741 gttgcttaat agaggtgatt ataatatgct tctaaaaaat attgacttgt ttgttaagcg 1801 tcgtgctgat tttgcttgca agtttgcagt ttgtggagat ggttttgtac cttttttact 1861 agatggttta attccccgta gttattatct aattcagagt ggtattttct ttacatcttt 1921 gatgtctcaa ttttcacaag aagtttctga tatgtgttta aaaatgtgta ttttgtttat 1981 ggacagagtt tcagttgcta cattttatat agagcattat gttaataggt tggttactca 2041 atttaagtta ttgggtacta cacttgttaa taaaatggtt aattggttta ataccatgtt 2101 agatgctagt gcacctgcta caggctggct tctttaccaa ttattgaatg gtctttttgt 2161 agtatctcaa gccaacttta attttgttgc tttaatacct gattatgcta aaattttagt 2221 taataaattt tacacttttt ttaagttatt attagagtgt gttacagttg atgttttaaa 2281 agatatgcct gttcttaaaa ctattaatgg tttagtttgt attgtaggca ataagtttta 2341 taacgttagt acagggttaa ttcctggttt tgttttacca tgtaatgcac aggaacaaca 2401 aatttatttt tttgaaggcg ttgcagaatc tgttatagta gaagatgatg ttattgagaa 2461 tgtcaaatct tctttatcat cttatgagta ttgtcaacca cctaaatctg tagaaaaaat 2521 ttgtattata gataatatgt acatgggtaa gtgtggtgat aaatttttcc ctattgtcat 2581 gaatgataaa aatatttgtc ttttagatca ggcttggcgt tttccatgtg caggtagaaa 2641 agttaatttt aacgagaaac ctgttgttat ggagattccg tctttgatga cagttaaggt 2701 tatgtttgat ttagattcta cttttgatga tattttaggt aaagtttgtt cagaatttga 2761 agtagaaaag ggtgttactg tagatgattt tgttgctgtt gtttgtgatg ctatagagaa 2821 tgctttaaac tcttgtaaag agcatccagt ggttggttat caagttcgtg catttttaaa 2881 taaacttaat gagaatgttg tttatttatt tgatgaggct ggtgatgaag caatggcctc 2941 tcgtatgtat tgtacttttg ctattgagga tgttgaagac gttatcagta gtgaagctgt 3001 cgaagatact attgatggtg tcgttgaaga cactattaat gacgatgaag atgttgttac 3061 tggtgacaat gacgatgaag atgttgttac tggtgacaat gacgatgaag atgttgttac 3121 tggtgacaat gacgatgaag atgttgttac tggtgacaat gacgatgaag atgttgttac 3181 tggtgacaat gacgatgaag atgttgttac tggtgacaat gacgatgaag atgttgttac 3241 tggtgacaat gacgatgaag atgttgttac tggtgacaat gacgatgaag atgttgttac 3301 tggtgacaat gacgatgaag atgttgttac tggtgacaat gacgatgaag atgttgttac 3361 tggtgacaat gacgatgaag atgttgttac tggtgacaat gacgatgaag atgttgttac 3421 tggtgacaat gacgatgaag atgttgttac tggtgacaat aacgatgaag agattgttac 3481 tggtgacaat gatgaccaaa ttgttgttac tggtgatgat gtagatgata ttgaaagtat 3541 ttatgacttt gatacttata aagctctttt agtttttaat gatgtctata atgatgcttt 3601 gtttgttagt tatggttcta gtgttgaaac agaaacatat tttaaagtta atggtttatg 3661 gtcacctact attacacata ctaattgttg gttgcgttct gtgttacttg taatgcagaa 3721 attacctttt aagtttaagg atttagctat tgaaaatatg tggttatctt ataaggtggg 3781 ttataatcaa agttttgttg attatttact gaccactatt cctaaagcta ttgttttgcc 3841 tcaaggtggt tttgtagctg attttgctta ttggttttta aaccagtttg atattaatgc 3901 gtatgctaat tggtgttgtt taaaatgtgg tttttctttt gatttaaatg gtttggatgc 3961 tttgtttttt tatggagata ttgtgtctca tgtttgtaag tgtggacata atatgactct 4021 aatagcagcg gacttacctt gtacattaca tttttcatta tttgatgaca atttttgtgc 4081 tttttgcacc cctaaaaaaa tttttattgc tgcatgtgct gtggatgtaa acgtttgtca 4141 ttctgtagct gttataggtg atgaacaaat agatggtaag tttgttacta aatttagtgg 4201 tgataaattt gattttatag taggttatgg aatgtcattt agtatgtctt cttttgagtt 4261 acctcaattg tatggtttgt gtataacacc taatgtatgt tttgttaaag gtgatattat 4321 aaatgttgct agacttgtta aagctgatgt tattgttaat cctgctaatg ggcatatgct 4381 ccatggtggt ggagttgcaa aagctatagc tgtagctgca ggtaaaaaat tttctaaaga 4441 aactgctgct atggttaaat ctaaaggtgt ttgccaagta ggagattgtt atgtttctac 4501 cggtggtaaa ttatgtaaaa caattcttaa tattgtaggc cctgatgcta gacaagatgg 4561 aagacaatct tatgttttgt tagcacgtgc ttataagcat cttaataatt atgattgttg 4621 tttgtctact ctcatatcgg ctggtatatt tagtgttcct gctgatgtgt cattaactta 4681 ccttctaggt gttgttgata aacaagttat ccttgttagt aataataaag aagattttga 4741 tattattcaa aaatgtcaaa ttacttcagt tgttggtact aaagcattgg ctgttagatt 4801 aactgctaat gtaggccgtg ttattaaatt tgagacagat gcatacaaac tttttttgag 4861 tggtgatgat tgttttgttt caaattcttc tgttatacaa gaagttttat tgcttcgtca 4921 tgatatacaa ttgaataatg acgttcgtga ttatttgttg tctaagatga ctagtcttcc 4981 taaagattgg cgtcttatca ataaatttga tgttattaac ggtgttaaaa ctgttaagta 5041 ttttgagtgt cctaattcta tttatatatg tagtcagggt aaagactttg gttatgtatg 5101 tgatggttct ttttataaag caactgttaa tcaagtttgt gttttattag ctaagaagat 5161 agatgttttg cttactgtag atggtgttaa ttttaaatct atttctctta ctgtaggtga 5221 agtttttggt aaaatacttg gtaatgtttt ctgtgatggc attgatgtta ctaagttaaa 5281 gtgtagtgat ttttatgccg ataaaatttt atatcagtat gaaaatttgt ctttagctga 5341 tatttctgct gtacaaagtt catttgggtt tgatcagcaa caattgcttg cttattataa 5401 ttttttaaca gtatgtaaat ggtctgtagt tgttaacggt ccattttttt cttttgaaca 5461 gtctcataat aattgttatg tgaatgtagc ttgtcttatg ttgcagcata ttaatcttaa 5521 atttaataaa tggcagtggc aggaagcatg gtatgaattt cgtgctggca gaccacatag 5581 gttagttgct cttgttttag ctaaaggtca ttttaaattt gatgaaccat cagatgctac 5641 tgattttatt cgtgttgttt tgaaacaagc tgatttatca ggtgcaattt gtgaattaga 5701 acttatttgt gattgtggta ttaaacaaga aagtcgtgtt ggtgttgatg ctgttatgca 5761 ttttggtaca ttagcaaaga ctgatctttt taatggttat aagattggct gtaattgtgc 5821 aggtagaatt gtccattgta ctaaattgaa tgtaccattt ttgatttgtt ctaatactcc 5881 tctgagtaag gatttacctg atgatgttgt tgcagctaac atgtttatgg gtgtaggtgt 5941 aggccattat acacatttga aatgtggttc accttaccaa cattatgatg cttgtagtgt 6001 taaaaaatat acaggtgtta gtggttgttt aactgactgc ttgtatctta aaaatttaac 6061 ccagactttt acatctatgt tgactaatta ttttttggat gatgttgaaa tggttgctta 6121 taaccctgat ctttcacaat attattgtga taatggtaag tattatacaa aacctattat 6181 aaaggctcag tttaaaccat ttgctaaagt tgacggtgtt tatactaact ttaagttagt 6241 tggacatgat atttgtgctc aattgaatga taagttaggt tttaatgtag atttgccgtt 6301 tgttgagtac aaagtaacag tctggcctgt agctactggt gatgttgttt tggcatctga 6361 tgatttatat gtgaaacgtt attttaaagg atgtgaaact tttggtaagc ctgttatttg 6421 gttttgtcat gatgaagcat cattgaattc tcttacttat tttaataaac ctagttttaa 6481 atctgaaaat agatatagtg ttttgtctgt tgattctgta tctgaggagt cacaaggtaa 6541 tgtggttact tctgttatgg aatcgcagat tagtactaaa gaggttaagt taaagggtgt 6601 tagaaagact gttaaaatag aagatgctat tattgttaat gatgaaaata gttctattaa 6661 ggttgttaaa agtttatctt tagttgatgt ttgggatatg tatttgacag gttgtgatta 6721 tgttgtttgg gttgctaatg aattgtcacg cctagttaaa tcaccaacag ttagggaata 6781 tatacgatat ggtattaaac ctattactat acctatagat ttgttatgtt taagagatga 6841 taatcaaact cttttagttc ctaaaatttt taaagcaaga gctatagaat tttatggttt 6901 tttgaagtgg ttgtttattt atgtttttag tttattacat tttacaaatg ataaaaccat 6961 tttttatact acagaaatag cttctaagtt tacttttaat ttgttttgtt tggctcttaa 7021 aaatgctttt cagacattta gatggagtat atttataaaa ggttttcttg ttgtagccac 7081 tgtgtttttg ttttggttta attttttgta tataaatgtt atttttagtg acttttatct 7141 tcctaatatt agtgtttttc ctatttttgt gggaagaatt gttatgtgga taaaggctac 7201 ttttggtttg gttacaattt gtgattttta ttctaagtta ggtgtaggtt ttacaagtca 7261 tttttgtaat ggtagtttta tatgtgaatt gtgtcattct ggttttgata tgttggatac 7321 atatgcagct atagattttg ttcagtatga agtagataga cgtgttttat ttgattatgt 7381 tagtttagtc aaattaattg ttgaactcgt tattggttat tcattataca cagtatggtt 7441 ttatccatta ttttgtctta ttggtttaca attatttact acatggttgc ctgatttgtt 7501 tatgttagaa actatgcatt ggttgattag atttattgta tttgtagcta atatgttacc 7561 tgcttttgtc ttgttgcggt tttatatagt tgttactgct atgtataaag tagttggttt 7621 tattaggcat attgtctatg gttgtaataa agctggttgt ttattttgtt ataaacgaaa 7681 ttgtagtgtt cgtgttaagt gtagtactat tgttggtggt gtaattcgtt attatgatat 7741 tactgctaat ggtggtactg gtttttgtgt taaacatcaa tggaattgtt ttaattgcca 7801 ttcttttaaa ccaggtaaca cttttataac tgtagaagct gctatagaac tttctaaaga 7861 gcttaaacga cctgtaaatc caactgatgc ttcacattat gtagttactg atattaagca 7921 agttggttgt atgatgcgtt tgttctatga tagagatgga cagcgtgttt acgatgatgt 7981 tgatgctagt ttatttgtag atattaataa tctgttacat tctaaagtta aagttgttcc 8041 taatttgtat gtagttgtag tagagagtga tgctgataga gctaattttc tgaatgctgt 8101 tgtgttttat gcacaatcat tgtataggcc tatattactt gtagacaaaa agttaattac 8161 tacagcttgt aatggtatct ctgtaaccca gactatgttt gatgtttatg ttgatacttt 8221 tatgtctcat tttgatgttg atagaaagag ttttaataat tttgttaaca ttgctcatgc 8281 ttctcttaga gagggtgtgc aattagaaaa ggttttagat acttttgtgg gatgtgtacg 8341 taaatgttgt tccattgatt cagatgttga aacaagattt attactaaat ctatgatatc 8401 tgcagtagct gctggtttgg aatttactga tgaaaattat aacaatttgg tacctacata 8461 tttaaagagt gataatattg tagctgctga tttaggtgtt cttatacaga atggtgctaa 8521 gcatgtacag ggtaatgttg ctaaggcagc taatatttct tgtatatggt ttattgatgc 8581 ttttaatcaa cttactgctg atttacagca taaattaaaa aaagcatgtg ttaaaactgg 8641 cttgaagtta aaattgactt ttaataagca agaggcaagt gtccctattc ttacaacacc 8701 cttttcactt aaaggaggtg ttgtattgag taatttgtta tatatattat tttttgttag 8761 tttaatctgt tttatattat tgtgggcttt attgcctaca tatagtgttt ataagtctga 8821 tattcatttg cctgcttatg ctagttttaa agttattgat aatggtgttg ttagagatat 8881 ttcagttaat gatttatgtt ttgctaataa atttttccaa tttgatcaat ggtatgagtc 8941 cacttttggg tctgtttact atcataattc tatggattgc cctattgtag tggcagttat 9001 ggatgaagat atcggttcta ctatgtttaa tgttcctact aaagttttga gacatggctt 9061 tcatgtttta cattttttaa cttatgcatt tgctagtgat agtgttcagt gctatacacc 9121 acatattcag atttcttata atgattttta tgctagtggt tgtgttttat catctttgtg 9181 tactatgttt aaaagaggtg atggtacacc acatccttat tgttattcag atggtgttat 9241 gaagaatgct tctttgtata catctttggt tccacataca cgttatagcc ttgctaattc 9301 taatggtttt ataagatttc ctgatgttat tagtgaaggt attgtacgta ttgtaagaac 9361 gcgctctatg acttattgta gagtgggtgc atgtgaatac gccgaagagg gtatatgttt 9421 taattttaat agttcctggg ttttgaataa tgattattat agaagtatgc ctggaacttt 9481 ttgtggtaga gatctttttg atttgtttta tcaatttttt agtagtttaa ttcgtcctat 9541 agatttcttt tctcttactg ctagttctat ttttggagct atattggcta tagttgttgt 9601 cttggttttt tattatttaa taaaacttaa gcgtgctttt ggagattata ctagtgttgt 9661 agttataaat gttgttgttt ggtgtattaa ttttcttatg ctttttgttt ttcaagttta 9721 tcctatttgt gcatgtgttt atgcttgttt ttatttttat gtaacattgt attttccttc 9781 tgaaattagt gtaattatgc atttgcaatg gattgttatg tatggtgcta taatgccttt 9841 ttggttttgt gtcacatatg tagctatggt tattgcaaac catgttttat ggttattttc 9901 atattgtagg aaaattggtg ttaatgtatg tagtgatagt acatttgaag aaacatctct 9961 tactactttt atgattacta aagattctta ttgtagatta aagaattctg tttctgatgt 10021 tgcctacaat agatatttga gtttgtataa taagtatcgt tactatagtg gtaaaatgga 10081 tactgctgcc tatagagaag cggcgtgttc tcagttagct aaagctatgg aaacatttaa 10141 tcacaataat ggtaatgatg tcttatacca acctcctaca gcatctgttt ctacatcttt 10201 tttgcaatca ggtattgtaa agatggtatc tcctacgtca aaaattgaac cttgtattgt 10261 tagtgttact tatggtagta tgactttgaa tggtttatgg ttagatgaca aagtttattg 10321 tcctcgtcat gttatatgtt catcctctaa tatgaacgaa cctgattatt ctgccttatt 10381 gtgtagagtt actctaggtg attttactat aatgtctggt cggatgagtt taacagttgt 10441 gtcttaccag atgcagggct gtcaacttgt tttgacagtc tctttacaaa atccttacac 10501 tccaaaatat acttttggta atgttaaacc tggtgaaact tttactgttt tagctgcgta 10561 taatggccga ccacaagggg catttcatgt tactatgcgt agtagttata ctattaaagg 10621 ttcttttttg tgtgggtcat gtggatctgt tggttatgta ttaacaggtg atagtgttaa 10681 gtttgtatat atgcatcaat tagagctcag tactggttgt cacactggca ctgattttac 10741 tggtaatttt tatggtccat atagagatgc tcaagttgta cagttgccag ttaaggacta 10801 cgtccagact gttaatgtta ttgcttggct ctatgcagct atacttaata attgtgcttg 10861 gtttgtacaa aatgatgttt gttctactga agattttaat gtttgggcta tggcaaatgg 10921 ttttagccaa gtaaaagcag atcttgtctt agatgctttg gcttcaatga caggtgtttc 10981 tattgaaact ttattggctg ctattaagcg tctatatatg ggatttcaag gtcgtcaaat 11041 actaggaagt tgtacttttg aagatgaatt ggcaccttct gacgtttatc aacaattggc 11101 tggtgttaaa ttgcaatcta aaacaaaaag atttattaaa gaaacaattt attggatttt 11161 gatatctaca tttttgttta gttgtataat ttctgcattt gttaaatgga ctatatttat 11221 gtatattaat acacatatga ttggtgttac attatgtgta ctttgttttg ttagttttat 11281 gatgttacta gttaaacata agcattttta tttgactatg tatataattc ctgtactctg 11341 taccttgttt tatgtaaatt atttagttgt ttataaggaa ggttttagag gttttactta 11401 tgtctggctc tcatattttg ttcctgctgt gaattttact tatgtttatg aagtatttta 11461 tggttgtatt ttatgtgttt ttgctatttt tataactatg catagtatta atcatgacat 11521 tttttctttg atgtttttgg ttggtagaat agttacttta atttctatgt ggtattttgg 11581 gtcgaattta gaagaggatg ttttgttatt tattacagcc tttttaggta cttatacatg 11641 gaccactatt ttgtcattag ctatagcaaa aattgttgct aattggttgt ctgttaatat 11701 attttatttt acagatgtac cttatattaa attgattctc ttgagttact tatttatagg 11761 gtatatttta tcttgttatt ggggattttt ctctctttta aacagtgttt ttagaatgcc 11821 tatgggtgtt tataattata aaatttctgt tcaagaattg cgttatatga atgctaatgg 11881 cttacgtcca cctcgtaata gttttgaggc tattttgtta aatttaaaac tgcttggaat 11941 aggtggcgtg ccagttattg aagtctccca aattcaatca aaattgactg atgtgaaatg 12001 tgctaatgtt gttttgttaa attgtttaca gcatttgcat gttgcttcta attctaagtt 12061 gtggcagtat tgtagtgttt tacataatga aatactatct acttcagatt tgagtgtagc 12121 ttttgataag cttgctcaat tattgattgt tttattcgcc aatcctgctg cagttgatac 12181 taagtgtctt gcaagtatag atgaagttag cgatgattat gttcaagata gtaccgtttt 12241 gcaggctttg caaagtgagt ttgtaaatat ggctagtttt gttgaatatg aagtcgcaaa 12301 gaaaaatttg gctgatgcta aaaatagtgg ttctgttaat caacaacaga taaaacagtt 12361 agaaaaagca tgtaatatag ctaagtctgt gtatgaacgt gataaagctg tagctcgcaa 12421 acttgaacgt atggcagacc tagcacttac taacatgtat aaagaggctc ggattaatga 12481 taagaagagt aaagttgttt ccgctttgca gacaatgctt tttagcatgg ttcgtaaatt 12541 ggataatcag gctttaaatt ctattctgga taatgctgtt aaaggttgtg tacctttgag 12601 tgctattcca gcattggctg ctaatacttt aactatagta ataccagata aacaagtttt 12661 tgataaagtt gttgataatg tttatgttac atatgctggt agtgtatggc atatacagac 12721 tgttcaagat gctgatggta ttaataaaca gttaactgat attagtgttg attctaattg 12781 gcctcttgtt atcattgcga acaggtataa tgaagttgct aatgctgtta tgcagaataa 12841 tgagttgatg cctcataaat taaaaataca agttgttaat agtggttctg atatgaattg 12901 taatattcct actcaatgtt attataataa tggtagtagt ggtagaatag tttatgctgt 12961 tcttagtgat gttgatggtc ttaagtatac taagataatg aaagatgatg gaaattgtgt 13021 tgttttagag cttgatcctc cttgtaaatt ttctatacaa gatgttaagg gacttaaaat 13081 taagtatctt tattttatta aaggatgtaa cactttagct agagggtggg ttgttggtac 13141 tttatcttca acaattagat tgcaggctgg tgttgctact gagtatgcag ctaattcttc 13201 tatactttca ttatgtgcat tttctgtaga tcctaagaaa acttatttag attatataca 13261 acaaggtggt gtacctataa ttaattgtgt taaaatgctc tgtgatcatg ctggtactgg 13321 tatggccatt actattaaac ctgaggctac tattaaccaa gattcttatg gtggtgcctc 13381 agtttgtatt tattgccgtg cacgtgtaga gcatccagat gtagatggta tatgtaaatt 13441 acgtggtaaa tttgtacaag tccctttggg tataaaagat cctattcttt atgtgttaac 13501 acatgatgtt tgtcaagtct gtggtttttg gagagatggc agttgttcct gtgtaggttc 13561 aagtgtcgct gttcaatcta aagatttaaa ttttttaaac gggttcgggg tactagtgtg 13621 aatgcccggc tagtaccctg tgctagtggt ttatctactg atgttcaatt aagggcattt 13681 gacatttgta ataccaatag agctggtata ggtttatatt ataaagtgaa ttgttgccgt 13741 tttcagcgta tagatgacga cggtaataaa ttggataagt tctttgttgt caaaagaact 13801 aatttagaag tttataataa agagaaaact tattatgagt tgactaaaag ttgtggtgtt 13861 gtggctgaac atgatttctt tacatttgat attgatggta gtcgcgtgcc acatatagtt 13921 cgtaggaatc tttcaaagta tactatgtta gatctttgct atgcattgcg tcattttgat 13981 cgtaatgatt gttcaatatt gtgtgaaatt ctttgtgagt atgctgattg taaagaatcc 14041 tacttttcta agaaagattg gtatgatttt gttgaaaatc ctgatattat taatatatat 14101 aaaaaattag gccctatttt taatagagct ttacttaata ctgtcatttt tgcagacacc 14161 ttagttgaag taggtttagt tggtgtttta actttagata accaagattt gtatggtcaa 14221 tggtatgatt ttggtgattt tatacaaaca gccccagggt ttggtgtggc agttgcagat 14281 tcttactatt cttatatgat gcctatgttg actatgtgtc atgtattaga ttgtgaatta 14341 tttgttaatg atagttatag acaattcgat cttgtacagt atgattttac tgattacaag 14401 ttagagttgt ttaataagta ttttaagtat tggggtatga agtatcatcc taatactgtg 14461 gattgtgata atgataggtg tattattcat tgtgctaatt ttaatatact atttagtatg 14521 gttttaccta atacttgttt tggtcccctt gttagacaaa tttttgtaga tggtgtaccg 14581 tttgttgttt ctattggtta ccattacaaa gagttaggtg tagttatgaa cttagatgtt 14641 gacacacacc gttatcgttt gtctcttaaa gatttacttc tttatgcagc agatcctgct 14701 atgcacgttg catctgctag tgctctgctt gatttacgaa cttgttgttt tagtgtagct 14761 gccattacaa gtggtataaa atttcaaact gtaaaaccag gtaactttaa ccaagacttt 14821 tacgagtttg ttaaaagtaa aggcttgttt aaagagggta gtacagttga tttgaaacat 14881 tttttcttta ctcaagatgg taatgctgca attactgatt ataattatta taagtataat 14941 ttacctacta tggttgatat taagcagtta ttgtttgtat tagaagttgt ttataaatat 15001 tttgaaattt atgatggtgg ttgtatacca gcatcacaag ttattgttaa taattatgat 15061 aaaagtgctg gttatccatt taataaattt ggtaaagcca gactttatta tgaggcatta 15121 tcatttgagg aacagaatga aatttatgca tatactaaac gtaatgttct gcccacctta 15181 actcaaatga atttaaaata tgctatcagt gctaagaata gagctcgcac tgtagcaggt 15241 gtttctattc ttagtactat gacaggccga atgttccatc aaaaatgttt gaagagtata 15301 gcagctaccc gaggtgttcc tgttgttata ggaaccacta aattttatgg tggttgggac 15361 gatatgttac gtcatcttat aaaggatgtt gacaaccctg ttcttatggg ttgggattat 15421 cctaaatgtg atcgtgctat gccaaatatt ttgcgtattg ttagtagttt agttttggcc 15481 cgcaaacatg aattttgttg ttcacatggt gatagatttt atcgccttgc gaatgaatgt 15541 gctcaagttt tgagtgaaat agttatgtgt ggcggttgct attatgttaa gcctggtggt 15601 actagcagtg gtgatgcaac tactgctttt gctaattctg tttttaatat atgtcaggct 15661 gttactgcta atgtttgttc tcttatggcc tgtaatggcc ataagattga agatttaagt 15721 atacgcaatt tacaaaaacg cttatactct aatgtttatc gtacagatta tgttgattat 15781 acatttgtta atgagtatta tgaattttta tgtaagcatt ttagtatgat gattttgagt 15841 gatgatggtg ttgtctgtta taactctgat tatgctagta agggttatat agctaatata 15901 agtgtttttc aacaagtttt gtactatcag aataatgtct ttatgtctga atctaaatgt 15961 tgggttgaaa atgatattac taatggtcct catgaatttt gttcccaaca tactatgtta 16021 gttaagatag atggtgatta tgtttattta ccatatccag atccttctag aattttagga 16081 gctggttgtt ttgttgatga tttattgaag actgacagtg ttcttttgat agagcgcttt 16141 gtaagtctag ctatagatgc ttacccttta gtacatcatg aaaatgaaga ataccaaaaa 16201 gtctttcgtg tatatttaga atatataaaa aaactgtata atgatcttgg tactcagatc 16261 ttagatagtt atagtgttat tttaagtact tgtgatggtt taaagtttac tgaagaatca 16321 ttttacaaga atatgtattt aaaaagtgcc gtgatgcaga gtgtaggtgc atgcgttgtt 16381 tgttcatcac aaacttcttt gcgttgtggc agttgtatac gtaagccttt gttatgttgt 16441 aaatgttgtt atgaccatgt tatggcaact aatcataaat atgttttgag tgtctcacct 16501 tacgtttgta atgcacctaa ctgtgatgtg agtgatgtca ccaaattata tttgggcggt 16561 atgtcttact attgtgaaaa ccataaaccc cattattcat ttaagttagt tatgaatggt 16621 atggtctttg gtttgtataa acaatcttgc acgggttcac cttatataga tgattttaat 16681 aagatagcta gttgtaaatg gacagaagtt gatgattatg ttctggcaaa tgagtgtatt 16741 gaacgtttaa agttatttgc tgcagaaact caaaaggcaa ctgaagaggc ttttaaacaa 16801 agctatgctt ctgctaccat tcaagagatt gttagtgata gagaagttat tttgtgttgg 16861 gagacaggta aagttaaacc accacttaat aaaaattatg ttttcacagg ctaccatttt 16921 actagtactg gtaagacagt tttaggtgag tatgtttttg ataaaagtga attaactaac 16981 ggtgtgtatt accgcgctac aactacttat aaactttcta taggtgatgt ttttgtttta 17041 acatcacatt ctgtagctag tttaagtgca cctacacttg tcccacaaga gaactatgct 17101 agtataagat tttctagtgt ttatagtgtt ccattggtgt ttcaaaataa tgttgctaat 17161 tatcagcaca ttggaatgaa acgttattgc actgttcaag gtccccctgg tacgggaaag 17221 tctcatcttg ctataggtct agctgtttat tactacacag cacgtgtagt ttatactgct 17281 gctagtcatg ctgctgtaga tgcattgtgt gaaaaagctt ataagttttt aaatattaac 17341 gattgtacac gtattattcc tgctaaagtt cgtgtagatt gttatgataa gtttaaaatt 17401 aatgatacca cttgtaagta tgtttttacc acaataaatg cattaccaga gttggttaca 17461 gatattgttg ttgttgatga agttagtatg cttactaatt atgaattgtc tgttataaat 17521 gctcgtatta aagctaaaca ttatgtatat attggagatc ctgctcaatt acctgcacca 17581 cgtgtgctgt tgagcaaggg ttctttagaa cctaggcact tcaattctat tactaaaata 17641 atgtgttgtt taggtcctga tatctttttg ggaaattgtt ataggtgtcc taaagaaatt 17701 gtagaaactg tttcagcatt ggtttatgat aataaactca aggctaaaaa tgataatagt 17761 tcattatgtt ttaaagtata ttttaaggga cagacaacac atgagagttc aagtgctgta 17821 aatattcaac agatatatct aattagtaaa tttttaaaag ctaatccagt ttggaatagt 17881 gctgttttta ttagtcctta taatagtcag aattatgttg ctaagcgtgt tttaggtgtt 17941 caaacacaaa ctgtagattc tgctcaaggt tcggaatatg attatgttat atattcacaa 18001 acagcagaaa cagcccattc tgttaatgtt aatcgattta atgttgccat aactagagcc 18061 aagaagggca ttttttgtgt tatgagtaat atgcaattat ttgaatctct taattttatt 18121 actctacctt tagataaaat tcaaaatcaa actttacctc gtttgcattg cacaactaat 18181 ctttttaaag attgtagtaa aagttgctta ggttatcatc cagcgcatgc cccctcattt 18241 ttagcagttg atgataaata taaggttaat gaaaatttgg ctgtaaattt aaatatttgt 18301 gaacctgttt taacatattc tcgtttaata tctcttatgg gttttaaatt agatttgact 18361 cttgatggtt attctaaatt gtttattact aaagatgaag ccattaaacg tgttagaggt 18421 tgggttggtt ttgatgttga gggcgctcat gctactcgcg aaaacattgg aacaaacttt 18481 ccactgcaaa taggtttttc aactggtgtg gattttgtag ttgaagctac tggcttattt 18541 gctgagagag attgttatac ttttaaaaaa actgtagcta aagctcctcc tggtgaaaaa 18601 tttaaacatt taatacccct tatgtcaaaa ggtcaaaagt gggatattgt tagaattaga 18661 attgttcaaa tgttatctga ttatctttta gacctttctg atagtgtagt atttattact 18721 tggtctgcca gttttgaact tacttgttta aggtattttg ctaaattagg cagagagctt 18781 aattgtaatg tgtgttctaa tcgtgctaca tgctacaatt ctagaactgg ttattatggt 18841 tgttggcgcc atagttatac ttgtgattat gtgtataatc cacttattgt agatatacaa 18901 cagtggggtt atacaggttc tttaactagt aatcacgata taatttgtaa tgtacataaa 18961 ggtgcacatg ttgcgtcagc tgatgcaatt atgactcgtt gtttagcaat ctatgattgt 19021 ttttgtaaat ctgttaattg gaatttagag tatccaataa tttctaatga ggtcagtata 19081 aatacatctt gtaggttatt gcagcgtgtc atgcttaaag ctgccatgct atgtaataga 19141 tacaacttat gttatgacat aggcaatcct aaaggtttag cttgtgtcaa agattatgaa 19201 tttaaatttt atgatgcttt tcctgtagcc aagtctgtta aacagttatt ttatgtctat 19261 gatgtgcata aagataattt taaagatggt ttatgtatgt tttggaattg taatgttgat 19321 aaatatccat ctaattcaat tgtttgtaga tttgacactc gagtgttaaa taaattaaac 19381 cttcctggat gtaatggtgg tagtttgtat gttaataaac atgcattcca tactaatcct 19441 tttactagaa ctgtttttga aaatcttaag cctatgcctt ttttctatta ttcagatacg 19501 ccttgtgtgt acgtagatgg tttagaatct aaacaagttg attacgttcc tttaagaagc 19561 gccacttgta tcacacggtg taatctaggt ggagctgttt gttcaaagca tgctgaagaa 19621 tattgtaact accttgagtc ttataatata gttactacag caggctttac tttttgggtt 19681 tataagaatt ttgattttta taatttatgg aacactttta ctacgttaca gagtttagaa 19741 aacgtaatat ataacttggt taatgttggt cattatgatg gacgtacagg tgaattacct 19801 tgtgctatta tgaatgacaa agttgttgtt aagattaata atgtagatac tgttattttt 19861 aaaaataata catcatttcc tactaatata gctgttgaat tgtttacaaa acgtagtatc 19921 cggcaccacc ctgaacttaa gattcttaga aatttgaaca ttgatatttg ttggaagcat 19981 gtcctgtggg attatgttaa agatagtttg ttttgtagtt ccacttatgg tgtttgtaaa 20041 tacacagatt tgaagttcat cgaaaatttg aatatacttt ttgatggtcg tgacactggc 20101 gctttagaag cttttagaaa agcaagaaat ggtgttttta ttagtactga aaaattaagt 20161 aggttatcaa tgattaaagg tccgcaacga gctgatttaa atggtgtgat tgtggataaa 20221 gttggagaac tcaaagttga gttttggttc gctatgagaa aagatggtga cgatgttatc 20281 ttcagccgaa cagacagcct atgctcaagc cattactgga gcccacaagg taatctaggt 20341 ggtaattgcg cgggtaatgt cattggtaat gatgctctaa cacgttttac tatctttact 20401 cagagtcgtg tattgtcaag ttttgaacct cgctcagatt tagaacggga ttttattgat 20461 atggatgata atctgtttat tgctaaatat ggtttagaag actatgcatt tgatcatata 20521 gtttatggta gttttaacca taaagttata ggaggtttgc atttgcttat aggcttattt 20581 cgtaggaaaa aaaaatctaa tttgttaatt caagagtttt tacagtatga ttctagtatt 20641 cattcatatt ttattactga tcaggagtgt ggtagtagta agagtgtttg tacagttatt 20701 gatttattat tagatgattt tgtttctatt gttaagtcat taaatttgag ttgtgttagt 20761 aaagttgtta atattaatgt tgattttaag gattttcaat ttatgttgtg gtgtaatgat 20821 aataaaatta tgacttttta tcctaaaatg caagccacta atgattggaa acctggctat 20881 tctatgcctg ttttgtataa gtatttgaat gttccattag agagagtctc tttatggaat 20941 tatggtaaac ctattaattt gcctacaggc tgtatgatga atgttgctaa gtacactcaa 21001 ttatgtcagt atttgaatac tacaacatta gctgttcctg ttaatatgcg tgttttacat 21061 ttaggtgcag ggtctgataa agaagtagct ccaggttctg ctgttttaag acagtggtta 21121 ccatctggta gtattcttgt agataatgat ttaaacccat ttgttagcga tagtttagtt 21181 acttattttg gagattgtat gactttacca tttgattgtc attgggattt gataatatct 21241 gatatgtatg atcctcttac taaaaatatt ggtgattata atgtgagtaa ggatgggttt 21301 tttacttaca tttgtcattt aattcgtgat aaattatctt tgggtggtag tgtagctata 21361 aaaattacag agttttcttg gaatgctgat ttatataaat taatgagttg ttttgcattt 21421 tggacagttt tttgtactaa tgtaaatgct tcttctagtg aagggttttt aataggtata 21481 aattacctgg gtaaatcttc ttttgaaata gatggcaatg ttatgcatgc taactatttg 21541 ttttggagaa atagtacaac atggaatggc ggtgcttata gtttatttga tatgactaaa 21601 ttttctttga aattggctgg cactgctgtt gttaatttaa gaccagatca attaaatgat 21661 ttagtttatt ctcttattga aagaggtaaa ttattagttc gcgatacgcg taaagagatt 21721 tttgttggtg atagtcttgt aaatacttgt tagatctcat taaatctaaa ctatgttaat 21781 tattttttta tttttttatt tctgttatgg ttttaatgaa cctcttaatg ttgtgtctca 21841 tttaaaccat gactggtttt tatttggtga tagtcgttct gattgtaacc atattaataa 21901 tttaaaaatt aaaaattttg attatttgga tattcaccct agtttgtgca acaatggtaa 21961 gatttcatct agtgccggtg attctatttt taagagtttt catttcactc gattttataa 22021 ttacactggc gaaggtgatc aaattatttt ttatgagggt gttaatttta atccttatca 22081 tagatttaag tgttttccta atggtagtaa tgatgtatgg cttcttaaca aggtaagatt 22141 ttatcgtgcc ttatattcta atatggcctt ttttcgttat cttacttttg ttgatattcc 22201 ttataatgtt tctctttcta agtttaattc ttgtaaaagt gatattttat cacttaacaa 22261 tcctattttt attaattatt ctaaggaagt ttattttact ttattaggtt gttctcttta 22321 tttagtaccg ctttgccttt ttaaatctaa ctttagtcag tactattata acatagatac 22381 tggctctgtt tatggttttt ctaatgttgt ttatcctgat ttagactgta tttatatttc 22441 tcttaaacca ggttcttata aagtttccac cactgcacct tttttatcct tacctactaa 22501 agctctctgt tttgataaat ctaaacaatt tgtacctgta caggttgttg attctagatg 22561 gaacaacgag cgtgcctcag atatttcttt atctgttgca tgtcaattgc catattgtta 22621 ttttcgcaat tcttctgcta attatgttgg caagtatgat attaaccacg gtgatagtgg 22681 ttttatttct attttatctg gtcttttata taatgtttct tgtatttcat attatggtgt 22741 atttttatat gataatttta catccatttg gccctattat tcttttggta ggtgtcctac 22801 atcttctatt attaaacatc caatttgtgt ttatgatttt ttgcctatta ttttacaagg 22861 tattttatta tgtttagctt tactttttgt tgtttttcta ttatttttgt tatataacga 22921 taaatctcat taaatctaaa catgttatta attattttta ttttgcctac aacattagct 22981 gttataggtg attttaattg tactaatttt gctattaatg atttaaacac cacagttcct 23041 cgcataagtg agtatgttgt ggatgtttct tatggtttgg gtacatatta tatacttgat 23101 cgtgtttatt taaatactac tatattattt actggttatt tccctaaatc tggtgccaat 23161 tttagggatc tatctttaaa aggtactaca tatttgagta ctctttggta tcagaaaccc 23221 tttttatctg attttaataa tggtattttt tctagagtta agaatactaa gttgtatgtt 23281 aataaaactt tgtatagtga gtttagtact atagttatag gtagtgtttt tattaacaac 23341 tcttatacta ttgttgttca acctcataat ggtgttttgg agattacagc ttgtcaatac 23401 actatgtgtg agtatcctca tactatttgt aaatctaaag gtagttctcg taatgaatct 23461 tggcattttg ataaatctga acctttgtgt ctgttcaaga aaaattttac ttataatgtt 23521 tctacagatt ggttgtattt tcatttttat caagaacgtg gcacttttta tgcttattat 23581 gctgattctg gcatgcctac tactttttta tttagtttgt atcttggtac tcttttatct 23641 cattattatg ttttgccttt gacttgtaat gctatatctt ctaatactga taatgagact 23701 ttacaatatt gggtcacacc tttgtctaaa cgccaatatc ttcttaaatt tgacaaccgt 23761 ggtgttatta ctaatgctgt tgattgttct agtagtttct ttagcgagat tcaatgtaaa 23821 actaaatctt tattacctaa tactggtgtt tatgacttat ctggttttac tgttaagcct 23881 gttgcaactg tacatcgtcg tattcctgat ttacctgatt gtgacattga taaatggctt 23941 aacaatttta atgtaccctc acctcttaat tgggaacgta aaattttttc taattgcaac 24001 tttaatttga gtactttgct tcgtttagtt catactgatt ctttttcttg taataatttt 24061 gatgaatcta agatatatgg tagttgtttt aagagtattg ttttagataa atttgccata 24121 cccaactcca gacgatctga tttgcagttg ggcagttctg gttttctgca atcttctaat 24181 tataaaattg acactacttc tagttcttgt caattgtatt atagtttgcc tgcaattaat 24241 gttactatta ataattataa tccttcttct tggaatagaa ggtatggttt taataatttt 24301 aatttgagct ctcatagtgt tgtttactca cgttattgtt tttctgttaa taatactttt 24361 tgtccttgtg ctaaaccttc ttttgcttca agttgcaaga gtcataaacc accttctgct 24421 tcctgtccta ttggtactaa ttatcgttct tgtgagagta ctactgtact cgaccacact 24481 gactggtgta ggtgttcttg tttacctgat cctataactg cttatgaccc taggtcttgt 24541 tctcaaaaaa agtctctggt tggtgttggt gaacattgtg cagggttcgg tgttgatgaa 24601 gaaaagtgtg gtgtattgga tggatcatat aatgtttctt gtctttgtag tactgatgcc 24661 tttctaggtt ggtcttatga cacttgcgtc agtaacaacc gttgtaatat tttttctaat 24721 tttattttaa atggtatcaa tagtggtacc acttgttcta atgatttatt gcagcctaat 24781 actgaagttt ttactgatgt ttgtgttgat tacgaccttt atggtattac aggacaaggt 24841 atttttaaag aagtttctgc tgtttattat aatagttggc aaaatctttt gtatgattct 24901 aatggcaaca ttattggttt taaagatttt gttactaata aaacatataa tattttccct 24961 tgttatgcag gaagagtttc tgctgctttt catcaaaatg cttcctcttt ggctttactt 25021 tatcgtaatt taaaatgtag ctatgttttg aataatattt ctttaactac tcagccatat 25081 tttgatagtt atcttggttg cgtttttaat gctgataatt taactgatta ttctgtttct 25141 tcttgtgctc ttcgcatggg tagtggtttt tgtgttgatt ataactcacc ttcttcttcc 25201 tcttcgcgtc gtaaacgtag aagtatttct gcttcttatc gttttgttac ttttgaaccc 25261 tttaatgtca gttttgttaa tgacagtatt gagtctgtgg gtggtcttta tgagatcaaa 25321 attcccacta actttactat agttggtcaa gaggaattta ttcaaactaa ttctcctaaa 25381 gttactattg attgttcttt atttgtctgt tctaattatg cagcttgcca tgacttattg 25441 tcagagtatg gcactttttg tgataatatt aatagtattt tagatgaagt taatggttta 25501 cttgatacta ctcaattgca tgtagctgat actcttatgc aaggtgtcac acttagctcc 25561 aatcttaata ctaatttgca ttttgatgtt gataatatta attttaaatc cctagttgga 25621 tgtttaggtc cacactgcgg ttcttcttct cgttcttttt ttgaagattt attgtttgac 25681 aaagttaaac tttcagatgt tggttttgtt gaagcttata acaattgtac tggtggtagt 25741 gaaattagag atcttctttg tgtacaatcc tttaatggta ttaaagtttt gcctcctatt 25801 ttgtctgaat ctcaaatttc tggttacacc acagccgcta ctgttgctgc tatgtttcca 25861 ccatggtcag cagcagctgg cataccattt tctcttaatg tacaatatag aattaatggt 25921 ttgggtgtta ctatggatgt tcttaataaa aatcaaaagt tgatagctac tgcttttaat 25981 aatgctcttc tttctattca gaatggtttt agtgctacca actctgcact tgctaaaata 26041 caaagtgttg ttaattctaa tgctcaagca cttaatagtt tgttacagca attatttaat 26101 aaatttggtg caattagttc ttctttacaa gaaattttat ctcgtctcga tgctttagag 26161 gctcaggttc agattgatag gcttattaat ggtcgtttaa ctgctttaaa tgcttatgtc 26221 tctcaacagc ttagtgatat ttctcttgta aaatttggtg ctgctttagc tatggagaag 26281 gttaatgagt gtgttaaaag tcaatctcct cgtattaatt tttgtggtaa tggtaatcat 26341 attttgtcat tagttcaaaa tgctccttat ggtttgttgt ttatgcattt tagttataaa 26401 cctatttctt ttaaaactgt tttagtaagt cctggtttgt gtatatcagg tgatgtaggt 26461 attgcaccta aacaagggta ttttattaaa cataatgatc attggatgtt cactggtagt 26521 tcttactatt atcctgaacc aatttcagat aaaaatgttg tttttatgaa tacttgttct 26581 gttaatttta ctaaagcgcc tcttgtttat ttgaatcatt ctgtaccaaa attgtctgat 26641 tttgaatctg agttatctca ttggtttaaa aatcaaacat ccattgcgcc taatttgact 26701 ttaaatcttc atactattaa tgctactttt ttagatttgt attatgagat gaatcttatt 26761 caagagtcta ttaagtcttt gaataatagt tatatcaatc ttaaagatat aggtacatat 26821 gaaatgtatg taaaatggcc ttggtatgtt tggctactaa tttctttttc atttataata 26881 ttccttgtat tgctcttttt tatatgttgt tgtactggtt gtggttctgc atgttttagt 26941 aaatgtcata attgttgtga tgagtatggt ggtcatcatg attttgttat caaaacatct 27001 catgatgatt agaatctctt gtcagatctc attaaatcta aactttattt atggacgttt 27061 ggagacctag ctacacacat tctcttgtta ttagagaatt tggtgttaca aaccttgaag 27121 atttgtgtct aaagtataat tactgtcaac ctattgttgg ttactgtatt gtacctttaa 27181 atgtttggtg tcgcaagttt ggcaaatttg cttctcactt tacattacgt agtcacgata 27241 tttcccatag taataatttt ggtgttgtaa ctagttttac tacttatggt aatactgttt 27301 ctgaggctgt gtctagatta gttgaatcag cttctgaatt tattgtttgg cgtgcagagg 27361 cacttaataa gtatggttga tttatttttc aatgatactg cttggtacat aggacagatt 27421 ttagttttag ttttattttg tcttatttct ttaatctttg ttgttgcttt tttagcaact 27481 attaagcttt gtatgcaact ttgtggtttt tgtaatttct ttattatttc accttcggct 27541 tacgtttata aaagaggtat gcagttgtat aagtcttata gtgaacaagt tataccaccc 27601 acttcagatt atttaatcta aatctaaaca ttatgaataa atcttttctt cctcaattta 27661 cttctgatca agctgttaca ttcttaaaag aatggaattt ctctttgggt gtaatactac 27721 tttttattac tatcatattg cagttcggtt atacgagccg tagtatgttt gtttatctta 27781 tcaagatgat tattctttgg cttatgtggc cattgactat caccttgact atatttaatt 27841 gtttttatgc tttgaataat gcttttcttg cattttctat agtgtttact attatttcta 27901 ttgttatatg gattctttat tttgttaata gtattcggct ttttattaga actggcagtt 27961 ggtggagttt taatccagag accaataatc ttatgtgtat tgatatgaaa ggcaagatgt 28021 ttgttaggcc agttattgag gactatcaca cattaactgc tactgttatt cgtggtcatc 28081 tttatataca gggtgtcaaa cttggcactg gttatactct ttcagatttg cccgtatatg 28141 ttactgtagc taaggtgcaa gtactttgta cctataaacg tgccttttta gataagttag 28201 atgttaatag tggttttgct gtttttgtta agtctaaagt tggtaactat cgtttaccgt 28261 ctagtaaacc tagtggtatg gatactgcct tgttaagagc ttaaatctaa actattagga 28321 tgtcttatac tcccggtcat tatgctggaa gtagaagctc ctctggaaat cgttcaggaa 28381 tcctcaagaa aacttcttgg gctgaccaat ctgagcgaaa ttaccaaacc tttaatagag 28441 gcagaaaaac ccaacctaaa ttcactgtgt ctactcaacc acaaggaaat actatcccac 28501 attattcctg gttctccggg atcactcaat ttcaaaaagg tagagacttt aaattttcag 28561 atggtcaagg agttcccatt gctttcggag tacccccttc tgaagcaaaa ggatattggt 28621 atagacacag ccggcgttct tttaaaacag ctgatggtca acaaaagcag ttgttaccga 28681 gatggtattt ctactatctc ggtaccggcc catatgccaa tgcatcctat ggtgaatccc 28741 tcgaaggggt cttctgggtt gctaatcacc aagctgacac ttctactccc tccgatgttt 28801 cgtcaaggga tcctactact caagaagcta tccctactag gtttccgcct ggtacgattt 28861 tgcctcaagg ctattatgtt gaaggctcag gaaggtctgc ttctaatagt cgaccaggtt 28921 cacgttctca atcacgtgga cccaataatc gttcattaag tagaagtaat tctaatttta 28981 gacattcaga ttctatagta aaacctgata tggctgatga gatcgctaat cttgttttag 29041 ccaagcttgg taaagattct aaacctcagc aagtcactaa gcaaaatgcc aaggaaatca 29101 ggcataaaat tttaacaaaa cctcgccaaa agcgaactcc taataaacat tgtaatgttc 29161 aacagtgttt tggtaaaaga ggaccttctc aaaattttgg taatgctgaa atgttaaagc 29221 ttggtactaa tgatcctcag tttcctattc ttgcagaatt agctcctaca ccaggtgctt 29281 ttttctttgg ttctaaatta gacttggtta aaagagattc cgaggctgac tcacctgtta 29341 aagatgtttt tgaacttcat tattctggtt ctattaggtt tgatagtact ttaccaggct 29401 ttgagacaat tatgaaagtt cttgaagaga atttaaatgc ttacgttaat tctaatcaga 29461 acactgattc tgattcgttg agttctaaac ctcagcgtaa aagaggtgtt aaacaattac 29521 cagaacagtt tgactctctt aatttaagtg ctggtactca gcacatttca aatgatttta 29581 ctcctgagga tcatagttta cttgctactc ttgatgatcc ttatgtagaa gactctgttg 29641 cttaatgaga atgaatccta attcgacact aggtggtaac ccctcgctat tattcggaat 29701 aggacactct ctatcagaat gaattcttgc tgtaataaca gatagagtag gttgttacag 29761 actatatatt aattagtaga aattttatat ttagacattt gattgttaga gtagttataa 29821 ggtttagctg tagtataaac gcctccggga agagctatca attgtagtgt ttaatatata 29881 tattagtata tgattgaaat taattatagc cttttggagg aattac // LOCUS NC_005831 27553 bp ss-RNA linear VRL 13-AUG-2018 DEFINITION Human Coronavirus NL63, complete genome. ACCESSION NC_005831 VERSION NC_005831.2 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Human coronavirus NL63 (HCoV-NL63) ORGANISM Human coronavirus NL63 Viruses; Riboviria; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Alphacoronavirus; Setracovirus. REFERENCE 1 (bases 1 to 27553) AUTHORS van der Hoek,L., Pyrc,K., Jebbink,M.F., Vermeulen-Oost,W., Berkhout,R.J., Wolthers,K.C., Wertheim-van Dillen,P.M., Kaandorp,J., Spaargaren,J. and Berkhout,B. TITLE Identification of a new human coronavirus JOURNAL Nat. Med. 10 (4), 368-373 (2004) PUBMED 15034574 REFERENCE 2 (bases 1 to 27553) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (24-JUN-2004) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 27553) AUTHORS van der Hoek,L., Pyrc,K., Jebbink,M.F., Vermeulen-Oost,W., Berkhout,R.J.M., Wolthers,K.C., Wertheim-van Dillen,P.M.E., Kaandorp,J., Spaargaren,J. and Berkhout,B. TITLE Direct Submission JOURNAL Submitted (22-JUN-2004) Department of Human Retrovirology, University of Amsterdam, Academic Medical Center, Meibergdreef 15, Amsterdam 1105 AZ, The Netherlands REMARK Sequence update by submitter REFERENCE 4 (bases 1 to 27553) AUTHORS van der Hoek,L., Pyrc,K., Jebbink,M.F., Vermeulen-Oost,W., Berkhout,R.J.M., Wolthers,K.C., Wertheim-van Dillen,P.M.E., Kaandorp,J., Spaargaren,J. and Berkhout,B. TITLE Direct Submission JOURNAL Submitted (05-MAR-2004) Department of Human Retrovirology, University of Amsterdam, Academic Medical Center, Meibergdreef 15, Amsterdam 1105 AZ, The Netherlands COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence is identical to AY567487. On Jun 24, 2004 this sequence version replaced NC_005831.1. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..27553 /organism="Human coronavirus NL63" /mol_type="genomic RNA" /strain="Amsterdam I" /db_xref="taxon:277944" 5'UTR 1..286 /inference="non-experimental evidence, no additional details recorded" gene 287..20475 /locus_tag="HCNV63gp1" /db_xref="GeneID:2943501" CDS join(287..12439,12439..20475) /locus_tag="HCNV63gp1" /inference="non-experimental evidence, no additional details recorded" /ribosomal_slippage="" /note="ORF1a/1b; translated via -1 ribosomal frameshift" /codon_start=1 /product="replicase polyprotein 1ab" /protein_id="YP_003766.2" /db_xref="GeneID:2943501" /translation="MFYNQVTLAVASDSEISGFGFAIPSVAVRTYSEAAAQGFQACRFV AFGLQDCVTGINDDDYVIALTGTNQLCAKILPFSDRPLNLRGWLIFSNSNYVLQDFDVV FGHGAGSVVFVDKYMCGFDGKPVLPKNMWEFRDYFNNNTDSIVIGGVTYQLAWDVIRKD LSYEQQNVLAIESIHYLGTTGHTLKSGCKLTNAKPPKYSSKVVLSGEWNAVYRAFGSPF ITNGMSLLDIIVKPVFFNAFVKCNCGSESWSVGAWDGYLSSCCGTPAKKLCVVPGNVVP GDVIITSTSAGCGVKYYAGLVVKHITNITGVSLWRVTAVHSDGMFVASSSYDALLHRNS LDPFCFDVNTLLSNQLRLAFLGASVTEDVKFAASTGVIDISAGMFGLYDDILTNNKPWF VRKASGLFDAIWDAFVAAIKLVPTTTGVLVRFVKSIASTVLTVSNGVIIMCADVPDAFQ SVYRTFTQAICAAFDFSLDVFKIGDVKFKRLGDYVLTENALVRLTTEVVRGVRDARIKK AMFTKVVVGPTTEVKFSVIELATVNLRLVDCAPVVCPKGKIVVIAGQAFFYSGGFYRFM VDPTTVLNDPVFTGDLFYTIKFSGFKLDGFNHQFVTASSATDAIIAVELLLLDFKTAVF VYTCVVDGCSVIVRRDATFATHVCFKDCYNVWEQFCIDNCGEPWFLTDYNAILQSNNPQ CAIVQASESKVLLERFLPKCPEILLSIDDGHLWNLFVEKFNFVTDWLKTLKLTLTSNGL LGNCAKRFRRVLVKLLDVYNGFLETVCSVAYTAGVCIKYYAVNVPYVVISGFVSRVIRR ERCDMTFPCVSCVTFFYEFLDTCFGVSKPNAIDVEHLELKETVFVEPKDGGQFFVSGDY LWYVVDDIYYPASCNGVLPVAFTKLAGGKISFSDDVIVHDVEPTHKVKLIFEFEDDVVT SLCKKSFGKSIIYTGDWEGLHEVLTSAMNVIGQHIKLPQFYIYDEEGGYDVSKPVMISQ WPISNDSNGCVVEASTDFHQLECIVDDSVREEVDIIEQPFEEVEHVLSIKQPFSFSFRD ELGVRVLDQSDNNCWISTTLVQLQLTKLLDDSIEMQLFKVGKVDSIVQKCYELSHLISG SLGDSGKLLSELLKEKYTCSITFEMSCDCGKKFDDQVGCLFWIMPYTKLFQKGECCICH KMQTYKLVSMKGTGVFVQDPAPIDIDAFPVKPICSSVYLGVKGSGHYQTNLYSFNKAID GFGVFDIKNSSVNTVCFVDVDFHSVEIEAGEVKPFAVYKNVKFYLGDISHLVNCVSFDF VVNAANENLLHGGGVARAIDILTEGQLQSLSKDYISSNGPLKVGAGVMLECEKFNVFNV VGPRTGKHEHSLLVEAYNSILFENGIPLMPLLSCGIFGVRIENSLKALFSCDINKPLQV FVYSSNEEQAVLKFLDGLDLTPVIDDVDVVKPFRVEGNFSFFDCGVNALDGDIYLLFTN SILMLDKQGQLLDTKLNGILQQAALDYLATVKTVPAGNLVKLFVESCTIYMCVVPSIND LSFDKNLGRCVRKLNRLKTCVIANVPAIDVLKKLLSSLTLTVKFVVESNVMDVNDCFKN DNVVLKITEDGINVKDVVVESSKSLGKQLGVVSDGVDSFEGVLPINTDTVLSVAPEVDW VAFYGFEKAALFASLDVKPYGYPNDFVGGFRVLGTTDNNCWVNATCIILQYLKPTFKSK GLNVLWNKFVTGDVGPFVSFIYFITMSSKGQKGDAEEALSKLSEYLISDSIVTLEQYST CDICKSTVVEVKSAIVCASVLKDGCDVGFCPHRHKLRSRVKFVNGRVVITNVGEPIISQ PSKLLNGIAYTTFSGSFDNGHYVVYDAANNAVYDGARLFSSDLSTLAVTAIVVVGGCVT SNVPTIVSEKISVMDKLDTGAQKFFQFGDFVMNNIVLFLTWLLSMFSLLRTSIMKHDIK VIAKAPKRTGVILTRSFKYNIRSALFVIKQKWCVIVTLFKFLLLLYAIYALVFMIVQFS PFNSLLCGDIVSGYEKSTFNKDIYCGNSMVCKMCLFSYQEFNDLDHTSLVWKHIRDPIL ISLQPFVILVILLIFGNMYLRFGLLYFVAQFISTFGSFLGFHQKQWFLHFVPFDVLCNE FLATFIVCKIVLFVRHIIVGCNNADCVACSKSARLKRVPLQTIINGMHKSFYVNANGGT CFCNKHNFFCVNCDSFGPGNTFINGDIARELGNVVKTAVQPTAPAYVIIDKVDFVNGFY RLYSGDTFWRYDFDITESKYSCKEVLKNCNVLENFIVYNNSGSNITQIKNACVYFSQLL CEPIKLVNSELLSTLSVDFNGVLHKAYVDVLCNSFFKELTANMSMAECKATLGLTVSDD DFVSAVANAHRYDVLLSDLSFNNFFISYAKPEDKLSVYDIACCMRAGSKVVNHNVLIKE SIPIVWGVKDFNTLSQEGKKYLVKTTKAKGLTFLLTFNDNQAITQVPATSIVAKQGAGF KRTYNFLWYVCLFVVALFIGVSFIDYTTTVTSFHGYDFKYIENGQLKVFEAPLHCVRNV FDNFNQWHEAKFGVVTTNSDKCPIVVGVSERINVVPGVPTNVYLVGKTLVFTLQAAFGN TGVCYDFDGVTTSDKCIFNSACTRLEGLGGDNVYCYNTDLIEGSKPYSTLQPNAYYKYD AKNYVRFPEILARGFGLRTIRTLATRYCRVGECRDSHKGVCFGFDKWYVNDGRVDDGYI CGDGLIDLLVNVLSIFSSSFSVVAMSGHMLFNFLFAAFITFLCFLVTKFKRVFGDLSYG VFTVVCATLINNISYVVTQNLFFMLLYAILYFVFTRTVRYAWIWHIAYIVAYFLLIPWW LLTWFSFAAFLELLPNVFKLKISTQLFEGDKFIGTFESAAAGTFVLDMRSYERLINTIS PEKLKNYAASYNKYKYYSGSASEADYRCACYAHLAKAMLDYAKDHNDMLYSPPTISYNS TLQSGLKKMAQPSGCVERCVVRVCYGSTVLNGVWLGDTVTCPRHVIAPSTTVLIDYDHA YSTMRLHNFSVSHNGVFLGVVGVTMHGSVLRIKVSQSNVHTPKHVFKTLKPGDSFNILA CYEGIASGVFGVNLRTNFTIKGSFINGACGSPGYNVRNDGTVEFCYLHQIELGSGAHVG SDFTGSVYGNFDDQPSLQVESANLMLSDNVVAFLYAALLNGCRWWLCSTRVNVDGFNEW AMANGYTSVSSVECYSILAAKTGVSVEQLLASIQHLHEGFGGKNILGYSSLCDEFTLAE VVKQMYGVNLQSGKVIFGLKTMFLFSVFFTMFWAELFIYTNTIWINPVILTPIFCLLLF LSLVLTMFLKHKFLFLQVFLLPTVIATALYNCVLDYYIVKFLADHFNYNVSVLQMDVQG LVNVLVCLFVVFLHTWRFSKERFTHWFTYVCSLIAVAYTYFYSGDFLSLLVMFLCAISS DWYIGAIVFRLSRLIVFFSPESVFSVFGDVKLTLVVYLICGYLVCTYWGILYWFNRFFK CTMGVYDFKVSAAEFKYMVANGLHAPHGPFDALWLSFKLLGIGGDRCIKISTVQSKLTD LKCTNVVLLGCLSSMNIAANSSEWAYCVDLHNKINLCDDPEKAQSMLLALLAFFLSKHS DFGLDGLIDSYFDNSSTLQSVASSFVSMPSYIAYENARQAYEDAIANGSSSQLIKQLKR AMNIAKSEFDHEISVQKKINRMAEQAATQMYKEARSVNRKSKVISAMHSLLFGMLRRLD MSSVETVLNLARDGVVPLSVIPATSASKLTIVSPDLESYSKIVCDGSVHYAGVVWTLND VKDNDGRPVHVKEITKENVETLTWPLILNCERVVKLQNNEIMPGKLKQKPMKAEGDGGV LGDGNALYNTEGGKTFMYAYISNKADLKFVKWEYEGGCNTIELDSPCRFMVETPNGPQV KYLYFVKNLNTLRRGAVLGFIGATIRLQAGKQTELAVNSGLLTACAFSVDPATTYLEAV KHGAKPVSNCIKMLSNGAGNGQAITTSVDANTNQDSYGGASICLYCRAHVPHPSMDGYC KFKGKCVQVPIGCLDPIRFCLENNVCNVCGCWLGHGCACDRTTIQSVDISYLNRARGSS AARLEPCNGTDIDKCVRAFDIYNKNVSFLGKCLKMNCVRFKNADLKDGYFVIKRCTKSV MEHEQSMYNLLNFSGALAEHDFFTWKDGRVIYGNVSRHNLTKYTMMDLVYAMRNFDEQN CDVLKEVLVLTGCCDNSYFDSKGWYDPVENEDIHRVYASLGKIVARAMLKCVALCDAMV AKGVVGVLTLDNQDLNGNFYDFGDFVVSLPNMGVPCCTSYYSYMMPIMGLTNCLASECF VKSDIFGSDFKTFDLLKYDFTEHKENLFNKYFKHWSFDYHPNCSDCYDDMCVIHCANFN TLFATTIPGTAFGPLCRKVFIDGVPLVTTAGYHFKQLGLVWNKDVNTHSVRLTITELLQ FVTDPSLIIASSPALVDQRTICFSVAALSTGLTNQVVKPGHFNEEFYNFLRLRGFFDEG SELTLKHFFFAQNGDAAVKDFDFYRYNKPTILDICQARVTYKIVSRYFDIYEGGCIKAC EVVVTNLNKSAGWPLNKFGKASLYYESISYEEQDALFALTKRNVLPTMTQLNLKYAISG KERARTVGGVSLLSTMTTRQYHQKHLKSIVNTRNATVVIGTTKFYGGWNNMLRTLIDGV ENPMLMGWDYPKCDRALPNMIRMISAMVLGSKHVNCCTATDRFYRLGNELAQVLTEVVY SNGGFYFKPGGTTSGDASTAYANSIFNIFQAVSSNINRLLSVPSDSCNNVNVRDLQRRL YDNCYRLTSVEESFIDDYYGYLRKHFSMMILSDDGVVCYNKDYAELGYIADISAFKATL YYQNNVFMSTSKCWVEEDLTKGPHEFCSQHTMQIVDKDGTYYLPYPDPSRILSAGVFVD DVVKTDAVVLLERYVSLAIDAYPLSKHPNSEYRKVFYVLLDWVKHLNKNLNEGVLESFS VTLLDNQEDKFWCEDFYASMYENSTILQAAGLCVVCGSQTVLRCGDCLRKPMLCTKCAY DHVFGTDHKFILAITPYVCNASGCGVSDVKKLYLGGLNYYCTNHKPQLSFPLCSAGNIF GLYKNSATGSLDVEVFNRLATSDWTDVRDYKLANDVKDTLRLFAAETIKAKEESVKSSY AFATLKEVVGPKELLLSWESGKVKPPLNRNSVFTCFQISKDSKFQIGEFIFEKVEYGSD TVTYKSTVTTKLVPGMIFVLTSHNVQPLRAPTIANQEKYSSIYKLHPAFNVSDAYANLV PYYQLIGKQKITTIQGPPGSGKSHCSIGLGLYYPGARIVFVACAHAAVDSLCAKAMTVY SIDKCTRIIPARARVECYSGFKPNNTSAQYIFSTVNALPECNADIVVVDEVSMCTNYDL SVINQRLSYKHIVYVGDPQQLPAPRVMITKGVMEPVDYNVVTQRMCAIGPDVFLHKCYR CPAEIVNTVSELVYENKFVPVKPASKQCFKVFFKGNVQVDNGSSINRKQLEIVKLFLVK NPSWSKAVFISPYNSQNYVASRFLGLQIQTVDSSQGSEYDYVIYAQTSDTAHACNVNRF NVAITRAKKGIFCVMCDKTLFDSLKFFEIKHADLHSSQVCGLFKNCTRTPLNLPPTHAH TFLSLSDQFKTTGDLAVQIGSNNVCTYEHVISFMGFRFDISIPGSHSLFCTRDFAIRNV RGWLGMDVESAHVCGDNIGTNVPLQVGFSNGVNFVVQTEGCVSTNFGDVIKPVCAKSPP GEQFRHLIPLLRKGQPWLIVRRRIVQMISDYLSNLSDILVFVLWAGSLELTTMRYFVKI GPIKYCYCGNSATCYNSVSNEYCCFKHALGCDYVYNPYAFDIQQWGYVGSLSQNHHTFC NIHRNEHDASGDAVMTRCLAVHDCFVKNVDWTVTYPFIANEKFINGCGRNVQGHVVRAA LKLYKPSVIHDIGNPKGVRCAVTDAKWYCYDKQPVNSNVKLLDYDYATHGQLDGLCLFW NCNVDMYPEFSIVCRFDTRTRSVFNLEGVNGGSLYVNKHAFHTPAYDKRAFVKLKPMPF FYFDDSDCDVVQEQVNYVPLRASSCVTRCNIGGAVCSKHANLYQKYVEAYNTFTQAGFN IWVPHSFDVYNLWQIFIETNLQSLENIAFNVVKKGCFTGVDGELPVAVVNDKVFVRYGD VDNLVFTNKTTLPTNVAFELFAKRKMGLTPPLSILKNLGVVATYKFVLWDYEAERPFTS YTKSVCKYTDFNEDVCVCFDNSIQGSYERFTLTTNAVLFSTVVIKNLTPIKLNFGMLNG MPVSSIKGDKGVEKLVNWYIYVRKNGQFQDHYDGFYTQGRNLSDFTPRSDMEYDFLNMD MGVFINKYGLEDFNFEHVVYGDVSKTTLGGLHLLISQFRLSKMGVLKADDFVTASDTTL RCCTVTYLNELSSKVVCTYMDLLLDDFVTILKSLDLGVISKVHEVIIDNKPYRWMLWCK DNHLSTFYPQLQSAEWKCGYAMPQIYKLQRMCLEPCNLYNYGAGIKLPSGIMLNVVKYT QLCQYLNSTTMCVPHNMRVLHYGAGSDKGVAPGTTVLKRWLPPDAIIIDNDINDYVSDA DFSITGDCATVYLEDKFDLLISDMYDGRIKFCDGENVSKDGFFTYLNGVIREKLAIGGS VAIKITEYSWNKYLYELIQRFAFWTLFCTSVNTSSSEAFLIGINYLGDFIQGPFIAGNT VHANYIFWRNSTIMSLSYNSVLDLSKFECKHKATVVVTLKDSDVNDMVLSLIKSGRLLL RNNGRFGGFSNHLVSTK" gene 20472..24542 /locus_tag="HCNV63gp2" /db_xref="GeneID:2943499" CDS 20472..24542 /locus_tag="HCNV63gp2" /inference="non-experimental evidence, no additional details recorded" /note="ORF2" /codon_start=1 /product="spike protein" /protein_id="YP_003767.1" /db_xref="GeneID:2943499" /translation="MKLFLILLVLPLASCFFTCNSNANLSMLQLGVPDNSSTIVTGLLP THWFCANQSTSVYSANGFFYIDVGNHRSAFALHTGYYDANQYYIYVTNEIGLNASVTLK ICKFSRNTTFDFLSNASSSFDCIVNLLFTEQLGAPLGITISGETVRLHLYNVTRTFYVP AAYKLTKLSVKCYFNYSCVFSVVNATVTVNVTTHNGRVVNYTVCDDCNGYTDNIFSVQQ DGRIPNGFPFNNWFLLTNGSTLVDGVSRLYQPLRLTCLWPVPGLKSSTGFVYFNATGSD VNCNGYQHNSVVDVMRYNLNFSANSLDNLKSGVIVFKTLQYDVLFYCSNSSSGVLDTTI PFGPSSQPYYCFINSTINTTHVSTFVGILPPTVREIVVARTGQFYINGFKYFDLGFIEA VNFNVTTASATDFWTVAFATFVDVLVNVSATNIQNLLYCDSPFEKLQCEHLQFGLQDGF YSANFLDDNVLPETYVALPIYYQHTDINFTATASFGGSCYVCKPHQVNISLNGNTSVCV RTSHFSIRYIYNRVKSGSPGDSSWHIYLKSGTCPFSFSKLNNFQKFKTICFSTVEVPGS CNFPLEATWHYTSYTIVGALYVTWSEGNSITGVPYPVSGIREFSNLVLNNCTKYNIYDY VGTGIIRSSNQSLAGGITYVSNSGNLLGFKNVSTGNIFIVTPCNQPDQVAVYQQSIIGA MTAVNESRYGLQNLLQLPNFYYVSNGGNNCTTAVMTYSNFGICADGSLIPVRPRNSSDN GISAIITANLSIPSNWTTSVQVEYLQITSTPIVVDCATYVCNGNPRCKNLLKQYTSACK TIEDALRLSAHLETNDVSSMLTFDSNAFSLANVTSFGDYNLSSVLPQRNIRSSRIAGRS ALEDLLFSKVVTSGLGTVDVDYKSCTKGLSIADLACAQYYNGIMVLPGVADAERMAMYT GSLIGGMVLGGLTSAAAIPFSLALQARLNYVALQTDVLQENQKILAASFNKAINNIVAS FSSVNDAITQTAEAIHTVTIALNKIQDVVNQQGSALNHLTSQLRHNFQAISNSIQAIYD RLDSIQADQQVDRLITGRLAALNAFVSQVLNKYTEVRGSRRLAQQKINECVKSQSNRYG FCGNGTHIFSIVNSAPDGLLFLHTVLLPTDYKNVKAWSGICVDGIYGYVLRQPNLVLYS DNGVFRVTSRVMFQPRLPVLSDFVQIYNCNVTFVNISRVELHTVIPDYVDVNKTLQEFA QNLPKYVKPNFDLTPFNLTYLNLSSELKQLEAKTASLFQTTVELQGLIDQINSTYVDLK LLNRFENYIKWPWWVWLIISVVFVVLLSLLVFCCLSTGCCGCCNCLTSSMRGCCDCGST KLPYYEFEKVHVQ" gene 24542..25219 /locus_tag="HCNV63gp3" /db_xref="GeneID:2943500" CDS 24542..25219 /locus_tag="HCNV63gp3" /inference="non-experimental evidence, no additional details recorded" /note="ORF3" /codon_start=1 /product="protein 3" /protein_id="YP_003768.1" /db_xref="GeneID:2943500" /translation="MPFGGLFQLTLESTINKSVANLKLPPHDVTVLRDNLKPVTTLSTI TAYLLVSLFVTYFALFKPLTARGRVACFVLKLLTLFVYVPLLVLFGMYLDSFIIFSTLL FRFIHVGYYAYLYKNFSFVLFNVTKLCFVSGKCWYLEQSFYENRFAAIYGGDHYVVLGG ETITFVSFDDLYVAIRGSCEKNLQLMRKVDLYNGAVIYIFAEEPVVGIVYSSQLYEDVP SIN" gene 25200..25433 /locus_tag="HCNV63gp4" /db_xref="GeneID:2943502" CDS 25200..25433 /locus_tag="HCNV63gp4" /inference="non-experimental evidence, no additional details recorded" /note="ORF4; small membrane protein" /codon_start=1 /product="envelope protein" /protein_id="YP_003769.1" /db_xref="GeneID:2943502" /translation="MFLRLIDDNGIVLNSILWLLVMIFFFVLAMTFIKLIQLCFTCHYF FSRTLYQPVYKIFLAYQDYMQIAPVPAEVLNV" gene 25442..26122 /locus_tag="HCNV63gp5" /db_xref="GeneID:2943503" CDS 25442..26122 /locus_tag="HCNV63gp5" /inference="non-experimental evidence, no additional details recorded" /note="ORF5" /codon_start=1 /product="membrane protein" /protein_id="YP_003770.1" /db_xref="GeneID:2943503" /translation="MSNSSVPLLEVYVHLRNWNFSWNLILTLFIVVLQYGHYKYSRLLY GLKMSVLWCLWPLVLALSIFDCFVNFNVDWVFFGFSILMSIITLCLWVMYFVNSFRLWR RVKTFWAFNPETNAIISLQVYGHNYYLPVMAAPTGVTLTLLSGVLLVDGHKIATRVQVG QLPKYVIVATPSTTIVCDRVGRSVNETSQTGWAFYVRAKHGDFSGVASQEGVLSEREKL LHLI" gene 26133..27266 /locus_tag="HCNV63gp6" /db_xref="GeneID:2943504" CDS 26133..27266 /locus_tag="HCNV63gp6" /inference="non-experimental evidence, no additional details recorded" /note="ORF6" /codon_start=1 /product="nucleocapsid protein" /protein_id="YP_003771.1" /db_xref="GeneID:2943504" /translation="MASVNWADDRAARKKFPPPSFYMPLLVSSDKAPYRVIPRNLVPIG KGNKDEQIGYWNVQERWRMRRGQRVDLPPKVHFYYLGTGPHKDLKFRQRSDGVVWVAKE GAKTVNTSLGNRKRNQKPLEPKFSIALPPELSVVEFEDRSNNSSRASSRSSTRNNSRDS SRSTSRQQSRTRSDSNQSSSDLVAAVTLALKNLGFDNQSKSPSSSGTSTPKKPNKPLSQ PRADKPSQLKKPRWKRVPTREENVIQCFGPRDFNHNMGDSDLVQNGVDAKGFPQLAELI PNQAALFFDSEVSTDEVGDNVQITYTYKMLVAKDNKNLPKFIEQISAFTKPSSIKEMQS QSSHVAQNTVLNASIPESKPLADDDSAIIEIVNEVLH" 3'UTR 27267..27553 /inference="non-experimental evidence, no additional details recorded" ORIGIN 1 cttaaagaat ttttctatct atagatagag aattttctta tttagacttt gtgtctactc 61 ttctcaacta aacgaaattt ttctagtgct gtcatttgtt atggcagtcc tagtgtaatt 121 gaaatttcgt caagtttgta aactggttag gcaagtgttg tattttctgt gtctaagcac 181 tggtgattct gttcactagt gcatacattg atatttaagt ggtgttccgt cactgcttat 241 tgtggaagca acgttctgtc gttgtggaaa ccaataactg ctaaccatgt tttacaatca 301 agtgacactt gctgttgcaa gtgattcgga aatttcaggt tttggttttg ccattccttc 361 tgtagccgtt cgcacctata gcgaagccgc tgcacaaggt tttcaggcat gccgttttgt 421 tgcttttggc ttacaggatt gtgtaaccgg tattaatgat gatgattatg tcattgcatt 481 gactggtact aatcagctct gtgccaaaat tttacctttt tctgatagac cccttaattt 541 gcgaggttgg ctcatttttt ctaacagcaa ttatgttctt caggactttg atgttgtttt 601 tggccatggt gcaggaagtg tggtttttgt ggataagtac atgtgtggtt ttgatggtaa 661 acctgtgtta cctaaaaaca tgtgggaatt tagggattac tttaataata atactgatag 721 tattgttatt ggtggtgtca cttatcaact agcatgggat gttatacgta aagacctttc 781 ttatgaacag caaaatgttt tagccattga gagcattcat taccttggta ctacaggtca 841 tactttgaag tctggttgca aacttactaa tgctaagccg cctaaatatt cttctaaggt 901 tgttttgagt ggtgaatgga atgctgtgta tagggcgttt ggttcaccat ttattacaaa 961 tggtatgtca ttgctagata taattgttaa accagttttc tttaatgctt ttgttaaatg 1021 caattgtggt tctgagagtt ggagtgttgg tgcatgggat ggttacttat cttcttgttg 1081 tggcacacct gctaagaaac tttgtgttgt tcctggtaat gtcgttcctg gtgatgtgat 1141 catcacctca actagtgctg gttgtggtgt taaatactat gctggcttag ttgttaaaca 1201 tattactaac attactggtg tgtctttatg gcgtgttaca gctgttcatt ctgatggaat 1261 gtttgtggca tcatcttctt atgatgcact cttgcataga aattcattag accctttttg 1321 ctttgatgtt aacactttac tttctaatca attacgtcta gcttttcttg gtgcttctgt 1381 tacagaagat gttaaatttg ctgctagcac tggtgttatt gacattagtg ctggtatgtt 1441 tggtctttac gatgacatat tgacaaacaa taaaccttgg tttgtacgca aagcttctgg 1501 gctttttgat gcaatctggg atgcttttgt tgccgctatt aagcttgtac caactactac 1561 tggtgttttg gttaggtttg ttaagtctat tgcttcaact gttttaactg tctctaatgg 1621 tgttattatt atgtgtgcag atgttccaga tgcttttcaa tcagtttatc gcacatttac 1681 acaagctatt tgtgctgcat ttgatttttc tttagatgta tttaaaattg gtgatgttaa 1741 atttaaacga cttggtgatt atgttcttac tgaaaacgct cttgttcgtt tgactactga 1801 agttgttcgt ggtgttcgtg atgctcgcat aaagaaagcc atgtttacta aagtagttgt 1861 aggtcctaca actgaagtta agttttctgt tattgaactt gccactgtta atttgcgtct 1921 tgttgattgt gcacctgtag tttgccctaa aggtaagatt gttgttattg ctggacaagc 1981 ttttttctat agtggtggtt tttatcgttt tatggttgat cctacaactg tattaaatga 2041 tcctgttttt actggtgatt tattctacac tattaagttt agtggtttta agcttgatgg 2101 ttttaaccat cagtttgtta ctgctagttc tgctacagat gccattattg ctgttgagct 2161 gttgttattg gattttaaaa ctgcagtttt tgtgtacaca tgtgtggttg atggctgtag 2221 tgtcattgtt agacgtgatg ctacattcgc tacacatgtg tgttttaagg actgttataa 2281 tgtttgggag caattctgca ttgataattg tggtgagcca tggtttttga ctgattataa 2341 tgctatcttg cagagtaata accctcaatg tgctattgtt caagcatcag agtctaaagt 2401 tttgcttgag aggtttttac ctaagtgtcc tgaaatactg ttgagtattg atgatggcca 2461 tttatggaat ctttttgttg aaaagtttaa ttttgttaca gattggttaa aaactcttaa 2521 gcttacactt acttctaatg gtcttttagg taattgtgcc aaacgtttta gacgtgtttt 2581 ggtaaaattg cttgatgtct ataatggttt tcttgaaact gtctgtagtg tcgcatacac 2641 tgctggtgtt tgcatcaaat attatgctgt taatgttcca tatgtagtta ttagtggttt 2701 tgtaagtcgt gtaattcgta gagaaaggtg tgacatgact tttccttgtg ttagttgtgt 2761 cacctttttc tatgaatttt tagacacttg ttttggtgtt agtaaaccta atgccattga 2821 tgttgaacat ttagagctta aagaaactgt ttttgttgaa cctaaggatg gtggtcaatt 2881 ttttgtttct ggtgattatc tttggtatgt tgtagatgac atttattatc cagcttcatg 2941 taatggtgta ttgcctgttg cttttacaaa attagctggt ggtaaaatat ctttttctga 3001 tgatgttata gttcatgatg ttgaacctac ccataaagtc aagctcatat ttgagtttga 3061 agatgatgtt gttaccagtc tttgtaagaa gagttttggt aagtccatta tttatacagg 3121 tgattgggaa ggtctacatg aagttcttac atctgcaatg aatgtcattg ggcaacatat 3181 taagttgcca caattttata tttatgatga agagggtggt tatgatgttt ctaaaccagt 3241 tatgatttca caatggccta ttagtaatga tagtaatggt tgtgttgttg aagcgagcac 3301 tgattttcat caattagaat gtattgttga tgactctgtt agagaagagg ttgatataat 3361 tgaacaacct tttgaagaag ttgaacatgt gctctcaatt aagcaacctt tttctttttc 3421 ttttagagat gaattgggtg ttcgtgtttt agatcaatct gataataatt gttggattag 3481 taccacactt gtacagttgc aacttacaaa gcttttggat gattctattg agatgcaatt 3541 gtttaaagtt ggtaaagttg attcaattgt ccaaaagtgt tatgagttgt ctcatttaat 3601 tagtggttca cttggtgata gtggtaaact tcttagtgaa cttcttaaag aaaaatatac 3661 atgttctata acttttgaga tgtcttgtga ttgtggtaaa aagtttgatg atcaggttgg 3721 ttgtttgttt tggattatgc cttacacaaa actttttcaa aaaggtgagt gttgtatttg 3781 tcataaaatg cagacttata agcttgttag tatgaaaggt actggtgtgt ttgtacagga 3841 tccagcacct attgacattg atgctttccc tgtgaaacct atatgttcat ctgtatattt 3901 aggtgttaag ggttctggtc attatcaaac aaatttatac agttttaaca aagctattga 3961 tggttttggt gtctttgaca ttaaaaatag tagtgttaat actgtttgtt ttgttgatgt 4021 tgattttcat agtgtagaaa tagaagctgg tgaagttaaa ccttttgctg tatataaaaa 4081 tgttaaattt tatttaggtg atatttcaca ccttgtaaac tgtgtttctt ttgactttgt 4141 tgtcaatgct gctaatgaaa atctcttgca tggaggcggt gttgcacgtg ctattgatat 4201 tttgactgaa ggtcaacttc agtcattatc taaagattac attagtagta atggtccact 4261 taaggttgga gcaggtgtta tgttggagtg tgaaaaattc aacgtattta atgttgttgg 4321 tccgcgaact ggtaaacatg agcattcatt acttgttgaa gcttataatt ctattttatt 4381 tgaaaatggt attccactta tgcctcttct tagttgtggt atttttggtg taaggattga 4441 aaattctctt aaagctttgt ttagttgtga cattaataaa ccattgcaag tttttgttta 4501 ttcttcaaat gaagaacaag ctgttcttaa gtttttagat ggtttagatt taacaccagt 4561 cattgatgat gttgatgttg ttaaaccttt tagagttgaa ggtaattttt cattctttga 4621 ttgtggtgtc aatgccttgg atggtgatat ttacttatta tttactaact ctattttaat 4681 gttggataaa caaggacaat tattggacac aaaacttaat ggtattttgc aacaggcagc 4741 tcttgattat cttgctacag ttaaaactgt accagctggt aatttggtta aactttttgt 4801 tgagagttgt accatttata tgtgtgttgt accatcgata aatgatcttt cttttgataa 4861 aaatcttggt cgttgtgtgc gtaaacttaa tagattgaaa acttgtgtta ttgccaatgt 4921 tcctgctatt gatgttttga aaaagcttct ttcaagtttg actttaactg ttaaatttgt 4981 tgtagagagt aatgttatgg atgttaacga ctgttttaag aatgataatg tagttttgaa 5041 aattactgaa gatggtatta atgttaaaga tgttgttgtt gagtcttcta agtcacttgg 5101 taaacaattg ggtgttgtga gtgatggtgt tgactctttt gaaggtgttt tacctattaa 5161 tactgatact gtcttatctg tagctccaga agttgactgg gttgcttttt acggttttga 5221 aaaggcagca ctttttgctt ctttggatgt aaagccatat ggttacccta atgattttgt 5281 tggtggtttt agagttcttg ggaccaccga caataattgt tgggttaatg caacttgtat 5341 aattttacag tatcttaagc ctacttttaa atctaagggt ttaaatgttc tttggaacaa 5401 atttgttaca ggtgatgttg gaccttttgt tagttttatt tattttataa ctatgtcttc 5461 aaagggtcaa aagggtgatg ctgaagaggc attatctaaa ttgtcagagt atttgattag 5521 tgattctatt gttactcttg aacaatattc aacttgtgac atttgtaaaa gtactgtagt 5581 tgaagttaaa agtgctattg tctgtgctag tgtgcttaaa gatggttgtg atgttggttt 5641 ttgtccacac agacataaat tgcgttcacg tgttaagttt gttaatggac gtgttgttat 5701 taccaatgtt ggtgaaccta taatttcaca accttctaag ttgcttaatg gtattgctta 5761 tacaacattt tcaggttctt ttgataacgg tcactatgta gtttatgatg ctgctaataa 5821 tgctgtctat gatggtgctc gtttattttc ttcagatttg tctactttag ctgttacagc 5881 tattgttgta gtaggtggtt gtgtaacatc taatgttcca acaattgtta gtgagaaaat 5941 ttctgttatg gataaacttg atactggtgc acaaaaattt ttccaatttg gtgattttgt 6001 tatgaataac attgttctgt ttttaacttg gttgcttagt atgtttagtc ttttacgtac 6061 ttctattatg aagcatgata ttaaagttat tgccaaggct cctaaacgta caggtgttat 6121 tttgacacgt agttttaagt ataacattag atctgctttg tttgttataa agcagaagtg 6181 gtgtgttatt gttactttgt ttaagttctt attattatta tatgctattt atgcacttgt 6241 ttttatgatt gtgcaattta gtccttttaa tagtctttta tgtggtgaca ttgtaagtgg 6301 ttatgaaaaa tccactttta ataaggatat ttattgtggt aattctatgg tttgtaagat 6361 gtgtttgttc agttatcaag agtttaatga tttggatcat actagtcttg tttggaagca 6421 cattcgtgat cctatattaa tcagtttaca accatttgtt atacttgtta ttttgttaat 6481 ttttggtaat atgtatttgc gttttggact tttatatttt gttgcacaat ttattagtac 6541 ttttggttct ttcttaggct ttcatcagaa acagtggttt ttacattttg tgccgtttga 6601 tgttttatgt aatgagtttt tagctacatt tattgtctgc aaaatcgttt tatttgttag 6661 acatattatt gttggctgta ataatgctga ctgtgtagct tgttctaaaa gtgctagact 6721 taaacgtgta ccacttcaaa ctattattaa tggtatgcat aaatcattct atgttaatgc 6781 taatggtggt acttgtttct gtaataaaca taacttcttt tgtgttaatt gtgattcttt 6841 tgggcctggt aatactttta ttaatggtga tattgcaaga gagcttggta atgttgttaa 6901 aacagctgtt caacccacag ctcctgcata tgttattatt gataaggtag attttgttaa 6961 tggattttat cgtctttata gtggtgacac tttttggcgg tatgactttg acattactga 7021 atctaagtat agttgtaaag aggttctgaa gaattgtaat gttttagaaa attttattgt 7081 ttacaataat agtggtagta acattacaca gattaaaaat gcttgtgttt atttttctca 7141 attgttgtgt gaacctataa agttggtaaa ttcagagttg ttgtcaactt tatctgttga 7201 ttttaatggt gttttgcata aggcatatgt tgatgttttg tgtaatagtt tttttaagga 7261 gttaactgct aacatgtcca tggctgaatg taaagctaca cttggtttga ctgtttctga 7321 tgatgatttt gtttcagctg ttgccaatgc acataggtat gacgttttgc tttcagattt 7381 gtcatttaat aattttttta tttcttatgc taaacctgaa gataagttgt ccgtttatga 7441 cattgcttgt tgtatgcgtg ccggttctaa ggttgttaac cataatgttt taattaaaga 7501 gtcaatacct attgtttggg gtgtcaagga ctttaatact ctttctcaag aaggtaagaa 7561 gtaccttgtt aaaacaacta aagcaaaggg tttgactttt ttattaactt ttaatgataa 7621 ccaagcaatt acacaagttc ctgctactag tatagttgca aaacagggtg ctggttttaa 7681 acgtacttat aattttctgt ggtatgtatg tttatttgtt gttgcattgt ttattggtgt 7741 ctcatttatt gattatacaa ccactgtaac tagctttcat ggttatgatt ttaagtacat 7801 tgagaatggt cagttgaagg tgtttgaagc acctttacac tgtgttcgta atgtttttga 7861 taattttaat caatggcatg aggctaagtt tggtgttgtt actactaata gtgataaatg 7921 tcctatagtt gttggtgttt cagagcgtat taatgttgtt cctggtgttc caacaaatgt 7981 atatttggta ggaaagactc ttgtttttac attacaggct gcttttggaa acacaggtgt 8041 ttgttatgac tttgatggtg ttaccactag tgataagtgt atttttaatt ctgcttgtac 8101 taggttggaa ggtttgggtg gtgacaatgt ttattgttac aacactgatc ttattgaagg 8161 ttctaaacct tatagtactt tacagcccaa tgcgtattat aagtatgatg ctaaaaatta 8221 tgtacgtttt ccagaaattt tagctagagg ttttggctta cgtactatta gaactttggc 8281 tacacgttat tgtagagttg gtgaatgccg tgactcacat aaaggtgttt gttttggttt 8341 tgataaatgg tatgttaatg atggacgtgt tgatgacggt tacatttgtg gtgatggtct 8401 tatagacctt cttgttaatg tactctcaat ctttagttca tcttttagcg ttgtggctat 8461 gtctggacat atgttgttta attttctttt tgcagcattt attacatttt tgtgcttttt 8521 agttactaaa tttaaacgtg tttttggtga tctttcttat ggtgttttta ctgttgtttg 8581 tgcaactttg attaataaca tttcttatgt tgttactcaa aatttatttt ttatgttgct 8641 ttatgctatt ttgtattttg tttttactag gacagtgcgt tatgcttgga tttggcatat 8701 tgcatacatt gttgcatact tcttgttaat accatggtgg cttctcacat ggtttagttt 8761 tgctgcattt ttagagcttt tacctaatgt ttttaagtta aaaatctcta ctcaattgtt 8821 tgaaggtgat aagtttatag gtacttttga gagtgctgct gcaggtacat ttgttcttga 8881 catgcgttct tatgaaaggc tgataaatac tatttcacct gagaaactta agaattatgc 8941 tgcaagttat aataaatata aatattatag tggtagtgct agtgaggctg attatcgttg 9001 tgcttgttat gctcatttag ccaaggctat gttagattat gcaaaagatc ataatgacat 9061 gttatattct ccacctacta ttagctacaa ttccacctta caatctggtc ttaagaagat 9121 ggcacaacca tctggttgtg ttgagagatg tgtggttcgc gtctgttatg gtagtactgt 9181 gcttaatgga gtttggttag gtgacactgt tacttgtcct agacatgtca tagcaccatc 9241 aaccactgtt cttattgatt atgatcatgc atatagtact atgcgtttgc ataatttttc 9301 agtgtctcat aatggtgtct tcttgggagt tgtcggtgtt acaatgcatg gttctgtgtt 9361 gcgtattaag gtttcacaat ctaatgtaca tacacctaaa catgttttta aaacgttgaa 9421 acctggtgat tcttttaata ttttagcatg ttatgaaggt attgcatctg gtgtttttgg 9481 tgttaattta cgtacaaact ttactattaa aggttctttt ataaatggag cttgtggttc 9541 tcctggttat aatgttagaa atgatggtac tgttgagttt tgttatttac accaaattga 9601 gttaggtagt ggtgctcatg ttggttctga ttttactggt agtgtttatg gtaattttga 9661 tgaccaacct agtttgcaag ttgagagtgc caaccttatg ctatcagata atgttgttgc 9721 ctttttgtat gctgctttgt tgaatggttg taggtggtgg ttgtgttcaa ctagagttaa 9781 tgttgatggt tttaatgaat gggctatggc taatggttat acaagtgttt ctagtgttga 9841 gtgctattct attttggcag caaaaactgg tgttagtgtt gaacaattgt tagcttccat 9901 tcaacatctt catgaaggtt ttggtggtaa aaacatactt ggttattcta gtttatgtga 9961 tgagttcaca ctagctgaag ttgtgaagca gatgtatggt gttaacttgc aaagtggtaa 10021 ggttattttt ggtttaaaaa caatgttttt atttagcgtt ttcttcacaa tgttttgggc 10081 agaactcttt atttatacaa acactatatg gataaaccct gtgatactta cacctatatt 10141 ttgtctactt ttgtttttgt cattagtttt aactatgttt cttaaacata agtttttgtt 10201 tttgcaagta tttttattac ctactgttat tgcaactgct ttatataatt gtgttttgga 10261 ttattacata gtaaaatttt tggctgacca ttttaactat aatgtttcag tattacaaat 10321 ggatgttcag ggtttagtta atgttttggt ctgtttattt gttgtatttt tacacacatg 10381 gcgcttttct aaagaacgtt ttacacattg gtttacatat gtgtgttctc ttatagcagt 10441 tgcttacact tatttttata gtggtgactt tttgagtttg cttgttatgt ttttatgtgc 10501 tatatctagt gattggtaca ttggtgccat tgtttttagg ttgtcacgtt tgattgtatt 10561 tttttcacct gaaagtgtat ttagtgtttt tggtgatgtg aaacttactt tagttgttta 10621 tttaatttgt ggttatttag tttgtactta ttggggcatt ttgtattggt tcaataggtt 10681 ttttaaatgt actatgggtg tttatgattt taaggtgagt gctgctgaat ttaaatacat 10741 ggttgctaat ggacttcatg caccacatgg accttttgat gcactttggt tatcattcaa 10801 actacttggt attggtggtg accgttgtat aaaaatttca actgtccaat ccaaactgac 10861 tgatttgaag tgtactaatg ttgtgttatt gggttgtttg tctagtatga acattgcagc 10921 taattctagt gaatgggctt attgtgttga tttacacaat aagattaatc tttgtgatga 10981 ccctgaaaaa gctcaaagta tgttgttagc actccttgcg ttctttctaa gtaaacatag 11041 tgattttggt cttgatggcc ttattgattc ttattttgat aatagtagca cccttcagag 11101 tgttgcttca tcatttgtta gtatgccatc atatattgct tatgaaaatg ctagacaagc 11161 ttatgaggat gctattgcta atggatcttc ttctcaactt attaaacaat tgaagcgtgc 11221 catgaatatc gcaaagtctg aatttgatca tgagatatct gttcagaaga aaattaatag 11281 aatggctgaa caagctgcta ctcagatgta taaagaagca cgctctgtta atagaaaatc 11341 taaagttatt agtgctatgc actctttact ttttggaatg ttaagacgtt tggatatgtc 11401 tagtgttgaa actgttttga atttagcacg tgatggtgtt gtgccattgt cagttatacc 11461 tgcaacttca gcttctaaac taactattgt tagtccagat cttgaatctt attctaagat 11521 tgtttgtgat ggttctgttc attatgctgg agttgtttgg acacttaatg atgttaaaga 11581 caatgatggt agacctgttc atgttaaaga gattacaaag gaaaatgttg aaactttgac 11641 atggcctctt atccttaatt gtgaacgtgt tgttaaactt caaaataatg aaattatgcc 11701 tggtaaactt aagcaaaaac ctatgaaagc tgagggtgat ggtggtgttt taggtgatgg 11761 taatgccttg tataatactg agggtggtaa aacttttatg tacgcttata tttctaataa 11821 agctgacctt aaatttgtta agtgggagta tgagggtggt tgcaacacaa tcgagttaga 11881 ctctccttgt cgatttatgg tcgaaacacc taatggtcct caagtgaagt atttgtattt 11941 tgttaaaaat ttaaatacct tacgtagagg tgccgttctt ggttttatag gtgccacaat 12001 tcgtctacaa gctggtaaac aaactgaatt ggctgttaat tctggacttt taactgcttg 12061 tgctttttct gttgatccag caactactta cttggaagct gttaaacatg gtgcaaaacc 12121 tgtaagtaat tgtattaaga tgttatctaa tggtgctggt aatggtcaag ctataacaac 12181 tagtgtagat gctaacacca atcaagattc ttatggtgga gcgtctattt gtttgtattg 12241 tcgggcccac gttcctcacc ctagtatgga tggttactgt aagtttaagg gtaaatgtgt 12301 tcaggttcct attggttgtt tggatcctat taggttttgt ttagaaaata atgtgtgtaa 12361 tgtttgtggt tgttggttgg gacacgggtg tgcttgtgac cgtacaacta ttcaaagtgt 12421 tgacatttct tatttaaacg agcaaggggt tctagtgcag ctcgactaga accctgcaat 12481 ggcacggaca tcgataagtg tgttcgtgct tttgacattt ataataaaaa tgtttcattc 12541 ttgggtaagt gtttgaagat gaactgtgtt cgttttaaaa atgctgatct taaggatggt 12601 tattttgtta taaagaggtg tactaagtcg gttatggaac acgagcaatc catgtataac 12661 ctacttaact tttctggtgc tttggctgag catgatttct ttacttggaa agatggcaga 12721 gtcatttatg gtaatgttag tagacataat cttactaaat atactatgat ggacttggtc 12781 tatgctatgc gtaactttga tgaacaaaat tgtgatgttc taaaagaagt attagtttta 12841 actggttgtt gtgacaattc ttattttgat agtaagggtt ggtatgaccc agttgaaaat 12901 gaagatatac atagagttta tgcatctctt ggcaaaattg tagctagagc tatgcttaaa 12961 tgcgttgctc tatgcgatgc gatggttgct aaaggtgttg ttggtgtttt aacattagat 13021 aaccaagatc ttaatggtaa cttttatgat tttggtgatt ttgttgttag cttacctaat 13081 atgggtgttc cctgttgtac atcatattat tcttatatga tgcctattat gggtttaact 13141 aattgtttag ctagtgagtg ttttgtcaag agtgatattt ttggtagtga ttttaaaact 13201 tttgatttgc ttaagtatga tttcactgaa cataaagaaa atttattcaa taagtacttt 13261 aagcattgga gttttgatta tcatcctaat tgtagtgact gttatgatga tatgtgtgtt 13321 atacattgtg ctaattttaa tacactattt gccacaacta taccaggtac tgcttttggt 13381 ccactatgtc gtaaagtttt tatagatggt gttccacttg ttacaactgc tggttatcat 13441 tttaagcaat taggtttggt ttggaataaa gatgttaaca cacactcagt taggttgaca 13501 attactgaac ttttgcaatt tgtcaccgac ccttccttga taatagcttc ttccccagca 13561 ctcgttgatc aacgcactat ttgtttttct gttgcagcat tgagtactgg tttgacaaat 13621 caagttgtta agccaggtca ttttaatgaa gagttttata actttcttcg tttaagaggt 13681 ttctttgatg aaggttctga acttacatta aaacatttct tcttcgcaca gaatggtgat 13741 gctgctgtta aagattttga cttttaccgt tataataagc ctaccatttt agatatttgt 13801 caagctagag ttacatataa gatagtctct cgttattttg acatttatga aggtggctgt 13861 attaaggcat gtgaagttgt tgtaacaaat cttaataaga gtgctggttg gccattaaat 13921 aagtttggta aagctagttt gtattatgaa tctatatctt atgaagaaca ggatgctttg 13981 tttgctttga caaagcgtaa tgtcctccct actatgacac agctgaatct taagtatgct 14041 attagtggta aagaacgtgc tagaactgtt ggtggtgttt ctctgttgtc tacaatgacc 14101 acaagacaat accatcaaaa acatcttaaa tccattgtta atacacgcaa tgccactgtt 14161 gttattggta ctaccaaatt ttatggtggt tggaataata tgttgcgtac tttaattgat 14221 ggtgttgaaa accctatgct tatgggttgg gattatccca aatgtgatag agctttgcct 14281 aacatgatac gtatgatttc agccatggtg ttgggctcta agcatgttaa ttgttgtact 14341 gcaacagata ggttttatag gcttggtaat gagttggcac aagttttaac agaagttgtt 14401 tattctaatg gtggttttta ttttaagcca ggtggtacga cttctggtga cgctagtaca 14461 gcttatgcta attctatttt taacattttt caagccgtga gttctaacat taacaggttg 14521 cttagtgtcc catcagattc atgtaataat gttaatgtta gggatctaca acgacgtctg 14581 tatgataatt gttataggtt aactagtgtt gaagagtcat tcattgatga ttattatggt 14641 tatcttagga aacatttttc aatgatgatt ctctctgatg acggtgttgt ctgttataac 14701 aaggattatg ctgagttagg ttatatagca gacattagtg cttttaaagc cactttgtat 14761 taccagaata atgtctttat gagtacttct aaatgttggg ttgaagaaga tttaactaag 14821 ggaccacatg agttttgttc ccagcatact atgcaaatag ttgacaaaga tggtacctat 14881 tatttgcctt acccagatcc tagtaggatc ttgtcagctg gtgtttttgt tgatgatgtt 14941 gttaagacag atgctgttgt tttgttagaa cgttatgtgt ctttagctat tgatgcatac 15001 cctctttcaa aacaccctaa ttccgaatat cgtaaggttt tttacgtatt acttgattgg 15061 gttaagcatc ttaacaaaaa tttgaatgag ggtgttcttg aatctttttc tgttacactt 15121 cttgataatc aagaagataa gttttggtgt gaagattttt atgctagtat gtatgaaaat 15181 tctacaatat tgcaagctgc tggtttatgt gttgtttgtg gttcacaaac tgtacttcgt 15241 tgtggtgatt gtctgcgtaa gcctatgttg tgcactaaat gcgcatatga tcatgtattt 15301 ggtaccgacc acaagtttat tttggctata acaccgtatg tatgtaatgc atcaggttgt 15361 ggtgttagtg atgtcaaaaa attgtatctt ggtggtttga attactattg tacaaatcat 15421 aaaccacagt tgtcttttcc attatgttca gctggtaata tatttggttt atataaaaat 15481 tcagcaactg gttccttaga tgttgaagtt tttaataggc ttgcaacgtc tgattggact 15541 gatgttaggg actataaact tgctaatgat gttaaagata cacttagact ctttgcggct 15601 gaaactatta aagctaaaga agagagtgtt aagtcttctt atgcttttgc aactcttaaa 15661 gaggttgttg gacctaaaga attgcttctt agttgggaaa gtggtaaagt taaaccacct 15721 ttgaatcgta attctgtttt cacttgtttt caaataagta aggactcaaa attccaaata 15781 ggtgagttca tctttgagaa ggttgaatat ggttctgata ctgttacgta taagtctact 15841 gtaactacta agttagttcc tggtatgatt tttgtcttaa catctcacaa tgtccaacct 15901 ttacgtgcac caactattgc aaaccaagag aagtattcta gcatttataa attgcaccct 15961 gcttttaatg tcagtgatgc atatgctaat ttggttccat attaccaact tattggtaaa 16021 caaaagataa ctacaataca gggtcctcct ggtagtggta agtcacattg ttccattgga 16081 cttggattgt actacccagg tgcgcgtatt gtttttgttg cttgtgccca tgctgctgtt 16141 gattccttat gtgcaaaagc tatgactgtt tatagcattg ataagtgtac taggattata 16201 cctgcaagag ctcgggttga gtgttatagt ggctttaaac caaataacac tagtgcacaa 16261 tacatattta gcactgttaa cgcattacct gagtgtaatg ctgatatcgt tgttgtagat 16321 gaagtttcaa tgtgtacaaa ttatgacctt tctgttatta accagcgttt atcatataaa 16381 catattgttt atgttggtga tccacaacaa cttcctgcac ctagagtaat gattactaaa 16441 ggtgttatgg agcctgttga ttataacgtt gttactcaac gtatgtgtgc tataggccct 16501 gatgtttttc ttcataaatg ttatagatgt cctgctgaaa tagttaatac agtttctgaa 16561 cttgtttatg agaacaagtt tgtccctgtt aaacctgcta gtaaacagtg ttttaaagtc 16621 ttttttaagg gtaatgtaca ggttgacaat ggttctagta ttaacagaaa gcagcttgaa 16681 atagttaagc tgtttttagt taaaaatcca agttggagta aggctgtgtt tatttctcct 16741 tataatagtc agaattatgt tgctagtaga tttttaggac ttcaaattca aactgttgat 16801 tcttctcaag gtagtgagta tgattatgta atctatgcac aaacttctga cactgcacat 16861 gcttgcaatg taaaccgttt taatgttgct ataacacgtg ctaagaaggg tatattttgt 16921 gtaatgtgtg ataaaacttt gtttgattca cttaagtttt ttgagattaa acatgcagat 16981 ttacactcta gccaggtttg tggcttgttt aaaaattgta cacgcactcc tcttaattta 17041 ccaccaactc atgcacacac tttcttgtcg ttgtcagatc agtttaagac tacaggtgat 17101 ttagctgttc aaataggttc aaataacgtt tgtacttatg aacatgttat atcatttatg 17161 ggttttaggt ttgatattag tattcctggt agtcatagtt tgttttgtac acgtgacttt 17221 gctattcgta atgtgcgtgg ttggttgggt atggatgttg aaagtgctca tgtttgtggc 17281 gataacatag gtactaatgt tcctttacag gttggttttt caaatggtgt taattttgtt 17341 gtgcaaactg aaggttgtgt gtctaccaat tttggtgatg ttattaaacc tgtttgtgca 17401 aaatctccac caggtgaaca atttagacac cttattcctc ttttacgtaa aggacaacct 17461 tggttaattg ttcgtagacg cattgtgcaa atgatatctg attatttgtc caatttgtct 17521 gacattcttg tctttgtttt gtgggcaggt agtttggaat taactacaat gcgttacttt 17581 gtaaaaatag ggccaattaa atattgttat tgtggtaatt ctgccacttg ttataattca 17641 gttagtaatg aatattgttg ttttaaacat gcattgggtt gtgattatgt ttacaatccg 17701 tatgcttttg atatacaaca gtggggttat gttggttcct tgagccaaaa ccaccacaca 17761 ttctgtaaca ttcatagaaa cgagcatgat gcctctggtg atgctgttat gacacgttgt 17821 ttggcagtac atgattgttt tgtcaaaaat gttgattgga ctgtaacgta cccctttatt 17881 gcaaatgaga aatttatcaa tggctgtggg cgtaatgtcc agggacatgt tgttcgtgca 17941 gccttgaaat tgtataaacc tagtgttatt catgacattg gtaatcctaa aggtgtacgt 18001 tgtgctgtta ctgatgccaa atggtactgt tatgacaagc aacctgttaa tagtaatgtc 18061 aagttgttgg attatgatta tgcaacccat ggtcaacttg atggtctttg tttattctgg 18121 aattgtaatg ttgatatgta tccagaattt tcaattgtgt gtcgttttga cacacgtact 18181 cgttctgttt ttaatttaga aggtgttaat ggtggttctc tttatgttaa caaacatgcg 18241 tttcatacac cagcatatga taaacgtgct tttgttaaat taaaacctat gccctttttt 18301 tactttgatg acagtgattg tgatgttgtg caagaacaag ttaattatgt accccttcgc 18361 gctagtagtt gtgttactcg ttgtaatata ggtggtgctg tttgttcaaa acatgcaaat 18421 ttgtatcaaa aatatgttga ggcatataat acatttacac aggcaggttt taacatttgg 18481 gtaccacata gttttgatgt ttataatttg tggcaaattt ttattgaaac taatttacaa 18541 agtcttgaaa atatagcatt taatgttgta aaaaaagggt gttttactgg tgttgatggt 18601 gagttacctg ttgcagttgt taacgacaaa gtttttgttc gctatggcga tgttgacaac 18661 ttggttttta caaataaaac aacattgcct actaatgttg cttttgaatt gtttgcaaaa 18721 cgaaaaatgg gtttaacacc accattgtct attctcaaaa atctcggtgt tgttgctaca 18781 tataaatttg ttttatggga ttatgaagct gaaagacctt ttacctcata tactaagagt 18841 gtatgtaaat acactgattt taatgaggat gtttgtgttt gttttgacaa tagtattcag 18901 ggttcgtatg agcgttttac gcttactacg aacgctgttt tattttctac tgttgtcatt 18961 aaaaatttaa cacctataaa gttgaatttt ggtatgttga atggtatgcc agtttcttct 19021 attaagggtg ataaaggtgt tgaaaaatta gttaattggt acatatatgt tcgtaaaaat 19081 ggtcaatttc aagatcacta tgatggtttt tacactcaag gtaggaattt atcagacttt 19141 acaccaagaa gtgatatgga gtatgatttt cttaacatgg atatgggtgt ttttattaat 19201 aaatatggtc ttgaggattt taattttgaa catgttgtat atggtgatgt ttcaaaaact 19261 acattaggag gtcttcattt gttgatatca cagtttaggc ttagtaaaat gggtgttttg 19321 aaagctgatg attttgtcac tgcttctgac acaactttga ggtgctgtac tgttacttat 19381 cttaatgaac ttagttcaaa agttgtttgt acttatatgg atttgttgtt ggacgacttt 19441 gttactatac taaagagttt agatcttggt gtaatatcta aagttcatga agttattata 19501 gataataaac cttataggtg gatgttgtgg tgtaaagata accacttgtc cactttttat 19561 ccacagttgc agtctgctga atggaagtgt ggttatgcta tgccacaaat ttataagctt 19621 caacgtatgt gtttggaacc ttgtaattta tataattatg gtgctggtat taagttgcct 19681 agtggtataa tgttaaatgt tgttaaatac actcagcttt gtcaatacct aaatagcact 19741 acaatgtgcg tacctcataa tatgcgtgtt ttgcactatg gtgctggttc tgacaaaggt 19801 gtggcacctg gtacaactgt tttaaaacgt tggctaccac ccgatgcaat aatcattgat 19861 aatgatatca atgattatgt tagtgatgca gattttagca ttacaggtga ttgtgctact 19921 gtttatcttg aagataagtt tgacttactt atttctgata tgtatgatgg tagaattaaa 19981 ttttgtgatg gtgaaaatgt ctctaaagat gggtttttta cttatcttaa tggtgttatt 20041 agagaaaaat tagctattgg tggtagtgtt gccattaaga ttacagaata tagttggaat 20101 aagtatcttt atgaattaat acaaagattt gctttttgga ctttgttttg cacgtctgtt 20161 aatacatcct cttcagaagc ttttcttatt ggtattaatt atttaggtga ctttattcaa 20221 ggtcctttta tagctggtaa cactgttcat gctaattata tattttggcg taattctact 20281 attatgtctt tgtcatacaa ttcagtttta gatttaagta agtttgaatg taaacataaa 20341 gccactgttg ttgttacact taaagatagt gatgtaaatg atatggtttt gagtttgatt 20401 aagagtggta ggttgttgtt acgcaataat ggtcgttttg gtggttttag taatcattta 20461 gtctcaacta aatgaaactt ttcttgattt tgcttgtttt gcccctggcc tcttgctttt 20521 tcacatgtaa tagtaatgct aatctctcta tgttacaatt aggtgttcct gacaattctt 20581 caactattgt tacgggttta ttgccaactc attggttttg tgctaatcag agtacatctg 20641 tttactcagc caatggtttc ttttatattg atgttggtaa tcaccgtagt gcttttgcgc 20701 tccatactgg ttattatgat gctaatcagt attatattta tgttactaat gaaataggct 20761 taaatgcttc tgttactctt aagatttgta agtttagtag aaacactact tttgattttt 20821 taagtaatgc ttctagttct tttgactgta tagttaattt gttatttaca gaacagttag 20881 gtgcgccttt gggcataact atatctggtg aaactgtgcg tctgcattta tataatgtaa 20941 ctcgtacttt ttatgtgcca gcagcttata aacttactaa acttagtgtt aaatgttact 21001 ttaactattc ctgtgttttt agtgttgtca acgccaccgt tactgtgaat gtcaccacac 21061 ataatggccg tgtagttaac tacactgttt gtgatgattg taatggttat actgataaca 21121 tattttctgt tcaacaggat ggccgcattc ctaatggttt cccttttaat aattggtttt 21181 tgttaactaa tggttccaca ctagtggacg gggtctctag actttatcaa ccactccgtt 21241 taacttgttt atggcctgta cctggtctta aatcttcaac tggttttgtt tattttaatg 21301 ccactggttc tgatgttaat tgtaacggct atcaacataa ttctgttgtt gatgttatgc 21361 gttacaatct taacttcagt gctaattctt tggacaatct caagagtggt gttatagttt 21421 ttaaaacttt acagtacgat gttttgtttt attgtagtaa ttcttcctca ggtgttcttg 21481 acaccacaat accttttggc ccgtcctctc aaccttatta ctgttttata aacagcacta 21541 tcaacactac tcatgttagc acttttgtgg gtattttacc acccactgtg cgtgaaattg 21601 ttgttgctag aactggccag ttttatatta atggttttaa gtatttcgat ttgggtttca 21661 tagaagctgt caattttaat gtcacgactg ctagcgccac agatttttgg acggttgcat 21721 ttgctacttt tgttgatgtt ttggttaatg ttagtgcaac taacattcaa aacttacttt 21781 attgcgattc tccatttgaa aagttgcagt gtgagcactt gcagtttgga ttgcaggatg 21841 gtttttattc tgcaaatttt cttgatgata atgttttgcc tgagacttat gttgcactcc 21901 ccatttatta tcaacacacg gacataaatt ttactgcaac tgcatctttt ggtggttctt 21961 gttatgtttg taaaccacac caggttaata tatctcttaa tggtaacact tcagtgtgtg 22021 ttagaacatc tcatttttca attaggtata tttataaccg cgttaagagt ggttcaccag 22081 gtgactcttc atggcacatt tatttaaaga gtggcacttg tccattttct ttttctaagt 22141 taaataattt tcaaaagttc aagactattt gtttctcaac cgtcgaagtg cctggtagtt 22201 gtaattttcc gcttgaagcc acctggcatt acacttctta tactattgtt ggtgctttgt 22261 atgttacttg gtctgaaggt aattctatta ctggtgtacc ttatcctgtc tctggtattc 22321 gtgagtttag taatttagtt ttaaataatt gtaccaaata taatatttat gattatgttg 22381 gtactggaat tatacgttct tcaaaccagt cacttgctgg tggtattaca tatgtttcta 22441 actctggtaa tttacttggt tttaaaaatg tttccactgg taacattttt attgtgacac 22501 catgtaacca accagaccaa gtagctgttt atcaacaaag cattattggt gccatgaccg 22561 ctgttaatga gtctagatat ggcttgcaaa acttactaca gttacctaac ttttattatg 22621 ttagtaatgg tggtaacaat tgcactacgg ccgttatgac ttattctaat tttggtattt 22681 gtgctgatgg ttctttgatt cctgttcgtc cgcgtaattc tagtgataat ggtatttcag 22741 ccataatcac tgctaattta tccattcctt ctaactggac tacttcagtt caagttgagt 22801 acctccaaat tactagtact ccaatagttg ttgattgtgc tacttatgtg tgtaatggta 22861 accctcgctg taagaatcta cttaagcagt atacttctgc ttgtaaaact attgaagatg 22921 ccttacgact tagtgctcat ttggaaacta atgatgttag tagtatgcta actttcgata 22981 gcaatgcttt tagtttggct aatgttacta gttttggaga ttataacctt tctagtgttt 23041 tacctcagag aaacattcgt tcaagccgta tagcaggacg tagtgctttg gaagatttgt 23101 tgtttagcaa agttgttaca tctggtttgg gtactgttga tgttgactat aagtcttgta 23161 ctaaaggtct ttctattgct gaccttgctt gtgctcagta ctacaatggc ataatggttt 23221 tgccaggtgt tgctgatgct gaacgtatgg ccatgtacac aggttctctt ataggtggca 23281 tggtgctcgg aggtcttaca tcagcagccg ccataccttt ttctttggca ctgcaagcac 23341 gacttaacta tgttgcttta caaactgatg tgcttcaaga aaatcagaaa attttggctg 23401 catcatttaa taaggctatt aataatattg ttgcttcttt tagtagcgtt aatgatgcta 23461 ttacacaaac tgcagaggct atacatactg ttactattgc acttaataag attcaggatg 23521 ttgttaatca acagggtagt gctcttaacc atctcacttc acaattgaga cataattttc 23581 aggccatttc taattcaatt caggctattt atgaccggct tgattcaatt caagccgatc 23641 aacaagttga cagattaatt actggacggc ttgcagcttt gaatgcattt gtttcccaag 23701 ttttgaataa atatactgaa gttcgtggtt caagacgctt agcacagcag aagattaatg 23761 aatgtgtcaa gtcacaatct aatagatatg gtttttgtgg caatggcact cacatctttt 23821 caatcgtcaa ctcagctcca gatggtttgc tttttcttca tactgttttg ctgccaactg 23881 attacaagaa tgtaaaggcg tggtctggta tctgtgttga tggcatttat ggctatgttc 23941 tgcgtcaacc taacttggtt ctttattctg ataatggtgt ctttcgtgta acttccaggg 24001 tcatgtttca acctcgctta cctgttttgt ctgattttgt gcaaatatat aattgtaatg 24061 ttacttttgt taacatatct cgtgttgagt tacatactgt catacctgac tacgttgatg 24121 ttaataaaac attacaagag tttgcacaaa acttaccaaa gtatgttaag cctaattttg 24181 acttgactcc ttttaattta acatatctta atttgagttc tgagttgaag caactcgaag 24241 ctaaaactgc tagtcttttt caaactactg ttgaattaca aggtcttatt gatcagatta 24301 acagtacata tgttgatttg aagttgctta ataggtttga aaattatatc aaatggcctt 24361 ggtgggtttg gctcattatt tctgttgttt ttgttgtatt gttgagtctt cttgtgtttt 24421 gttgtctttc tacaggttgt tgtggttgtt gcaattgttt aacttcatca atgcgaggct 24481 gttgtgattg tggttcaact aaacttcctt attacgaatt tgaaaaggtc cacgttcaat 24541 aatgcctttt ggtggcctat ttcaacttac tcttgaaagt actattaata agagtgtggc 24601 taatctcaaa ttaccacctc atgatgttac tgtcttgcgt gacaatctta aacctgttac 24661 tacacttagt actattactg cttatttgtt agttagtttg tttgtcactt actttgcttt 24721 attcaaacct cttactgcta gaggtcgtgt tgcttgtttt gttttaaaac tattgacact 24781 atttgtctat gtgcctttat tggttctttt tggtatgtat cttgacagtt ttataatttt 24841 ttctacgctg ttgtttcgat tcatacatgt tggctattat gcctatctct ataaaaattt 24901 ttcatttgtt ttgttcaatg ttactaaact atgcttcgtt tcaggcaagt gttggtatct 24961 tgaacaatca ttttatgaaa atcgttttgc tgctatttat ggtggtgacc actatgtcgt 25021 tttaggtggt gaaactatta cttttgtttc ttttgatgac ctttatgttg ctattagagg 25081 ttcttgtgaa aagaacctac aacttatgcg taaggttgac ttgtataatg gtgctgtcat 25141 ttacattttt gccgaagagc ctgttgttgg tatagtctac tcttctcaac tatacgaaga 25201 tgttccttcg attaattgat gacaatggta ttgtcctcaa ttccatttta tggctccttg 25261 ttatgatatt tttctttgtg ttggcaatga cctttattaa actgattcaa ttgtgtttta 25321 cttgtcatta tttttttagt aggacattat atcaaccagt ttataaaatt tttcttgctt 25381 accaagatta tatgcaaata gcacctgttc cagctgaagt actaaatgtc taaactaaac 25441 gatgtctaat agtagtgtgc ctcttttaga ggtttatgtc catttacgta actggaactt 25501 tagttggaat ttaattctaa cgctttttat agttgtgttg cagtatgggc attataagta 25561 tagcagactt ctttatggtt taaagatgtc tgttttatgg tgtttatggc cacttgttct 25621 agctttgtct atttttgact gttttgtcaa ttttaatgtg gactgggtct tttttggttt 25681 tagtattctt atgtctatta ttacactttg tttatgggtt atgtattttg ttaatagttt 25741 cagactttgg cgccgtgtta aaactttttg ggcttttaat cctgaaacta atgcaatcat 25801 ctctctccag gtttacggac ataattatta cttaccggtg atggctgcac ctacaggtgt 25861 tacattaaca cttcttagtg gtgtacttct tgttgatggc cataagattg ctactcgtgt 25921 tcaagtgggt cagttgccta aatatgtaat agttgctacg cctagtacca caattgtttg 25981 tgaccgtgtt ggtcgctctg ttaatgaaac aagccagact ggttgggcat tctacgtccg 26041 tgctaaacat ggtgattttt ctggtgttgc ctctcaggag ggtgttttgt cagaaagaga 26101 gaagttgctt catttaatct aaactaaaca aaatggctag tgtaaattgg gccgatgaca 26161 gagctgctag gaagaaattt cctcctcctt cattttacat gcctcttttg gttagttctg 26221 ataaggcacc atatagggtc attcccagga atcttgtccc tattggtaag ggtaataaag 26281 atgagcagat tggttattgg aatgttcaag agcgttggcg tatgcgcagg gggcaacgtg 26341 ttgatttgcc tcctaaagtt catttttatt acctaggtac tggacctcat aaggacctta 26401 aattcagaca acgttctgat ggtgttgttt gggttgctaa ggaaggtgct aaaactgtta 26461 ataccagtct tggtaatcgc aaacgtaatc agaaaccttt ggaaccaaag ttctctattg 26521 ctttgcctcc agagctctct gttgttgagt ttgaggatcg ctctaataac tcatctcgtg 26581 ctagcagtcg ttcttcaact cgtaacaact cacgagactc ttctcgtagc acttcaagac 26641 aacagtctcg cactcgttct gattctaacc agtcttcttc agatcttgtt gctgctgtta 26701 ctttggcttt aaagaactta ggttttgata accagtcgaa gtcacctagt tcttctggta 26761 cttccactcc taagaaacct aataagcctc tttctcaacc cagggctgat aagccttctc 26821 agttgaagaa acctcgttgg aagcgtgttc ctaccagaga ggaaaatgtt attcagtgct 26881 ttggtcctcg tgattttaat cacaatatgg gggattcaga tcttgttcag aatggtgttg 26941 atgccaaagg ttttccacag cttgctgaat tgattcctaa tcaggctgcg ttattctttg 27001 atagtgaggt tagcactgat gaagtgggtg ataatgttca gattacctac acctacaaaa 27061 tgcttgtagc taaggataat aagaaccttc ctaagttcat tgagcagatt agtgctttta 27121 ctaaacccag ttctatcaaa gaaatgcagt cacaatcatc tcatgttgct cagaacacag 27181 tacttaatgc ttctattcca gaatctaaac cattggctga tgatgattca gccattatag 27241 aaattgtcaa cgaggttttg cattaaattg ttttgtaatt ccagttgaat gtttattatt 27301 attagttgca accccatgcg tttagcgcat gataagggtt tagtcttaca cacaatggta 27361 ggccagtgat agtaaagtgt aagtaatttg ctatcatatt aacatgtcta gaggaaagtc 27421 agaacttttt ctgtttgtgt tgttggagta cttaaagatc gcataggcgc gccaacaatg 27481 gaagagccaa caacatatct aaaaatgttt tgtctggtac ttgttaatga tattgttttt 27541 gatatggata cac // LOCUS NC_003443 15646 bp ss-RNA linear VRL 13-AUG-2018 DEFINITION Human rubulavirus 2, complete genome. ACCESSION NC_003443 VERSION NC_003443.1 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq; F gene; fusion protein; haemagglutinin neuraminidase; HN gene; L gene; large protein; M-gene; matrix protein; NP gene; nucleocapsid protein; p gene; parainfluenza virus; phosphoprotein. SOURCE Human rubulavirus 2 ORGANISM Human rubulavirus 2 Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; Mononegavirales; Paramyxoviridae; Rubulavirus. REFERENCE 1 (bases 8743 to 15646) AUTHORS Kawano,M., Okamoto,K., Bando,H., Kondo,K., Tsurudome,M., Komada,H., Nishio,M. and Ito,Y. TITLE Characterizations of the human parainfluenza type 2 virus gene encoding the L protein and the intergenic sequences JOURNAL Nucleic Acids Res. 19 (10), 2739-2746 (1991) PUBMED 1645865 REFERENCE 2 AUTHORS Ohgimoto,S., Bando,H., Kawano,M., Okamoto,K., Kondo,K., Tsurudome,M., Nishio,M. and Ito,Y. TITLE Sequence analysis of P gene of human parainfluenza type 2 virus: P and cysteine-rich proteins are translated by two mRNAs that differ by two nontemplated G residues JOURNAL Virology 177 (1), 116-123 (1990) PUBMED 2162103 REFERENCE 3 (bases 1 to 15646) AUTHORS Kawano,M., Bando,H., Yuasa,T., Kondo,K., Tsurudome,M., Komada,H., Nishio,M. and Ito,Y. TITLE Sequence determination of the hemagglutinin-neuraminidase (HN) gene of human parainfluenza type 2 virus and the construction of a phylogenetic tree for HN proteins of all the paramyxoviruses that are infectious to humans JOURNAL Virology 174 (1), 308-313 (1990) PUBMED 2152995 REFERENCE 4 (bases 1 to 15646) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (16-MAR-2002) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 5 (bases 1 to 15646) AUTHORS Kawano,M. TITLE Direct Submission JOURNAL Submitted (04-MAR-1991) M. Kawano, Dept of Microbiology, Mie University School of Medicine, 2-174 Edobashi, Tsu-Shi Mie Prefecture 514, Japan COMMENT VALIDATED REFSEQ: This record has undergone validation or preliminary review. The reference sequence was derived from X57559. Sequence overlaps with M38200 & M37751. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..15646 /organism="Human rubulavirus 2" /mol_type="genomic RNA" /db_xref="taxon:1979160" gene 71..1919 /gene="NP" /locus_tag="HPIV2gp1" /db_xref="GeneID:935191" mRNA 71..1919 /gene="NP" /locus_tag="HPIV2gp1" /db_xref="GeneID:935191" CDS 157..1785 /gene="NP" /locus_tag="HPIV2gp1" /codon_start=1 /product="nucleocapsid protein" /protein_id="NP_598401.1" /db_xref="UniProtKB/Swiss-Prot:P21737" /db_xref="GeneID:935191" /translation="MSSVLKTFERFTIQQELQEQSDDTPVPLETIKPTIRVFVINNNDP VVRSRLLFFNLRIIMSNTAREGHRAGALLSLLSLPSAAMSNHIKLAMHSPEASIDRVEI TGFENNSFRVIPDARSTMSRGEVLAFEALAEDIPDTLNHQTPFVNNDVEDDIFDETEKF LDVCYSVLMQAWIVTCKCMTAPDQPPVSVAKMAKYQQQGRINARYVLQPEAQRLIQNAI RKSMVVRHFMTYELQLSQSRSLLANRYYAMVGDIGKYIEHSGMGGFFLTLKYGLGTRWP TLALAAFSGELQKLKALMLHYQSLGPMAKYMALLESPKLMDFVPSEYPLDYSYAMGIGT VLDTNMRNYAYGRSYLNQQYFQLGVETARKQQGAVDNRTAEDLGMTAADKADLTATISK LSLSQLPRGRQPISDPFAGANDREMGGQANDTPVYNFNPIDTRRYDNYDSDGEDRIDND QDQAIRENRGEPGQPNNQTSDNQQRFNPPIPQRTSGMSSEEFQHSMNQYIRAMHEQYRG SQDDDANDATDGNDISLELVGDFDS" gene 1924..3365 /gene="V" /locus_tag="HPIV2gp2" /db_xref="GeneID:935190" mRNA 1924..3365 /gene="V" /locus_tag="HPIV2gp2" /db_xref="GeneID:935190" CDS join(1993..2481,2480..3178) /gene="V" /locus_tag="HPIV2gp2" /exception="RNA editing" /codon_start=1 /product="P protein" /protein_id="NP_599019.1" /db_xref="GeneID:935190" /translation="MAEEPTYTTEQVDELIHAGLGTVDFFLSRPIDAQSSLGKGSIPPG VTAVLTSAAETKSKPVAAGPVKPRRKKVISNTTPYTIADNIPPEKLPINTPIPNPLLPL ARPHGKMTDIDIVTGNITEGSYKGVELAKLGKQTLLTRFTSNEPVSSAGSAQDPNFKRG GELIEKEQEATIGENGVLHGSEIRSKSSSGVIPGVPQSRPQLASSPAHADPAPASAENV KEIIELLKGLDLRLQTVEGKVDKILATSATIINLKNEMTSLKASVATMEGMITTIKIMD PSTPTNVPVEEIRKSLHNVPVVIAGPTSGGFTAEQVILISMDELARPTLSSTKRITRKP ESKKDLTGIKLTLMQLANDCISRPDTKTEFVTKIQAATTESQLNEIKRSIIRSAI" CDS 1993..2670 /gene="V" /locus_tag="HPIV2gp2" /codon_start=1 /product="phospho-protein" /protein_id="NP_598402.1" /db_xref="UniProtKB/Swiss-Prot:P23057" /db_xref="GeneID:935190" /translation="MAEEPTYTTEQVDELIHAGLGTVDFFLSRPIDAQSSLGKGSIPPG VTAVLTSAAETKSKPVAAGPVKPRRKKVISNTTPYTIADNIPPEKLPINTPIPNPLLPL ARPHGKMTDIDIVTGNITEGSYKGVELAKLGKQTLLTRFTSNEPVSSAGSAQDPNFKRG GANRERARGNHRREWSIAWVGDQVKVFEWCNPRCAPVTASARKFTCTCGSCPSICGECE GDH" gene 3411..4742 /gene="M" /locus_tag="HPIV2gp3" /db_xref="GeneID:935187" mRNA 3411..4742 /gene="M" /locus_tag="HPIV2gp3" /db_xref="GeneID:935187" CDS 3479..4612 /gene="M" /locus_tag="HPIV2gp3" /codon_start=1 /product="matrix protein" /protein_id="NP_598403.1" /db_xref="UniProtKB/Swiss-Prot:P24266" /db_xref="GeneID:935187" /translation="MPIISLPADPTSPSQSLTPFPIQLDTKDGKAGKLLKQIRIRYLNE PNSRHTPITFINTYGFVYARDTSGGIHSEISSDLAAGSITACMMKLGPGPNIQNANLVL RSLNEFYVKVKKTSSQREEAVFELVNIPTLLREHALCKRKMLVCSAEKFLKNPSKLQAG FEYVYIPTFVSITYSPRNLNYQVARPILKFRSRFVYSIHLELILRLLCKSDSPLMKSYN ADRTGRGCLASVWIHVCNILKNKSIKQQGRESYFIAKCMSMQLQVSIADLWGPTIIIKS LGHIPKTALPFFSKDGIACHPLQDVSPNLAKSLWSVGCEIESAKLILQESDLNELMGHQ DLITDKIAIRSGQRTFERSKFSPFKKYASIPNLEAIN" gene 4771..6630 /gene="F" /locus_tag="HPIV2gp4" /db_xref="GeneID:935186" mRNA 4771..6630 /gene="F" /locus_tag="HPIV2gp4" /db_xref="GeneID:935186" CDS 4789..6444 /gene="F" /locus_tag="HPIV2gp4" /codon_start=1 /product="fusion protein" /protein_id="NP_598404.1" /db_xref="UniProtKB/Swiss-Prot:P26629" /db_xref="GeneID:935186" /translation="MHHLHPMIVCIFVMYTGIVGSDAIAGDQLLNIGVIQSKIRSLMYY TDGGASFIVVKLLPNLPPSNGTCNITSLDAYNVTLFKLLTPLIENLSKISTVTDTKTRQ KRFAGVVVGLAALGVATAAQITAAVAIVKANANAAAINNLASSIQSTNKAVSDVIDASR TIATAVQAIQDRINGAIVNGITSASCRAHDALIGSILNLYLTELTTIFHNQITNPALTP LSIQALRILLGSTLPIVIESKLNTNFNTAELLSSGLLTGQIISISPMYMQMLIQINVPT FIMQPGAKVIDLIAISANHKLQEVVVQVPNRILEYANELQNYPANDCVVTPNSVFCRYN EGSPIPESQYQCLRGNLNSCTFTPIIGNFLKRFAFANGVLYANCKSLLCRCADPPHVVS QDDTQGISIIDIKRCSEMMLDTFSFRITSTFNATYVTDFSMINANIVHLSPLDLSNQIN SINKSLKSAEDWIADSNFFANQARTAKTLYSLSAIALILSVITLVVVGLLIAYIIKLVS QIHQFRSLAATTMFHRENPAFFSKNNHGNIYGIS" gene 6639..8742 /gene="HN" /locus_tag="HPIV2gp5" /db_xref="GeneID:935188" mRNA 6639..8742 /gene="HN" /locus_tag="HPIV2gp5" /db_xref="GeneID:935188" CDS 6817..8532 /gene="HN" /locus_tag="HPIV2gp5" /codon_start=1 /product="hemagglutinin-neuraminidase" /protein_id="NP_598405.1" /db_xref="UniProtKB/Swiss-Prot:P25466" /db_xref="GeneID:935188" /translation="MEDYSNLSLKSIPKRTCRIIFRTATILGICTLIVLCSSILHEIIH LDVSSGLMDSDDSQQGIIQPIIESLKSLIALANQILYNVAIIIPLKIDSIETVIFSALK DMHTGSMSNTNCTPGNLLLHDAAYINGINKFLVLKSYNGTPKYGPLLNIPSFIPSATSP NGCTRIPSFSLIKTHWCYTHNVMLGDCLDFTTSNQYLAMGIIQQSAAAFPIFRTMKTIY LSDGINRKSCSVTAIPGGCVLYCYVATRSEKEDYATTDLAELRLAFYYYNDTFIERVIS LPNTTGQWATINPAVGSGIYHLGFILFPVYGGLISGTPSYNKQSSRYFIPKHPNITCAG NSSEQAAAARSSYVIRYHSNRLIQSAVLICPLSDMHTARCNLVMFNNSQVMMGAEGRLY VIDNNLYYYQRSSSWWSASLFYRINTDFSKGIPPIIEAQWVPSYQVPRPGVMPCNATSF CPANCITGVYADVWPLNDPEPTSQNALNPNYRFAGAFLRNESNRTNPTFYTASASALLN TTGFNNTNHKAAYTSSTCFKNTGTQKIYCLIIIEMGSSLLGEFQIIPFLRELIP" gene 8785..15625 /gene="L" /locus_tag="HPIV2gp6" /db_xref="GeneID:935189" mRNA 8785..15625 /gene="L" /locus_tag="HPIV2gp6" /db_xref="GeneID:935189" CDS 8793..15581 /gene="L" /locus_tag="HPIV2gp6" /codon_start=1 /product="Large protein" /protein_id="NP_598406.1" /db_xref="UniProtKB/Swiss-Prot:P26676" /db_xref="GeneID:935189" /translation="MAASSEILLPEVHLNSPIVKHKLIYYLLLGHFPHDLDISEISPLH NNDWDQIAREESNLAERLGVAKSELIKRVPAFRATRWRSHAAVLIWPSCIPFLVKFLPH SKLQPVEQWYKLINASCNTISDSIDRCMENISIKLTGKNNLFSRSRGTAGAGKNSKITL NDIQSIWESNKWQPNVSLWLTIKYQMRQLIMHQSSRQPTDLVHIVDTRSGLIVITPELV ICFDRLNSVLMYFTFEMTLMVSDMFEGRMNVTALCTISHYLSPLGPRIDRLFSIVDELA QLLGDTVYKVIASLESLVYGCLQLKDPVVELAGSFHSFITQEIIDILIGSKALDKDESI TVTTQLLDIFSNLSPDLIAEMLCLMRLWGHPTLTAAQVGKVRESMCAGKLLDFPTIMKT LAFFHTILINGYRRKKNGMWPPLILPKNASKSLIEFQHDNAEISYEYTLKHWKEISLIE FRKCFDFDPGEELSIFMKDKAISAPRSDWMSVFRRSLIKQRHQRHHIPMPNPFNRRLLL NFLEDDSFDPVAELRYVTGGEYLQDDTFCASYSLKEKEIKPDGRIFAKLTNRMRSCQVI AEAILANHAGTLMKENGVVLNQLSLTKSLLTMSQIGIISEKAKRYTRDNISSQGFHTIK TDSKNKRKSKTASSYLTDPDDTFELSACFITTDLAKYCLQWRYQTIIHFARTLNRMYGV PHLFEWIHLRLIRSTLYVGDPFNPPAATDAFDLDKVLNGDIFIVSKGGIEGLCQKMWTM ISISVIILSSAESKTRVMSMVQGDNQAIAVTTRVPRSLPSIQKKELAYAASKLFFERLR ANNYGLGHQLKAQETIISSTFFIYSKRVFYQGRILTQALKNASKLCLTADVLGECTQAS CSNSATTIMRLTENGVEKDTCYKLNIYQSIRQLTYDLIFPQYSIPGETISEIFLQHPRL ISRIVLLPSQLGGLNYLACSRLFNRNIGDPLGTAVADLKRLIKCGALESWILYNLLARK PGKGSWATLAADPYSLNQEYLYPPTTILKRHTQNTLMEICRNPMLKGVFTDNAKEEENL LAKFLLDRDIVLPRVAHIIIDQSSIGRKKQIQGFFDTTRTIMRRSFEIKPLSTKKTLSV IEYNTNYLSYNYPVILNPLPIPGYLNYITDQTCSIDISRSLRKLSWSSLLNGRTLEGLE TPDPIEVVNGFLIVGTGDCDFCMQGDDKFTWFFLPMGIIIDGNPETNPPIRVPYIGSRT EERRVASMAYIKGATHSLKAALRGAGVYIWAFGDTVVNWNDALDIANTRVKISLEQLQT LTPLPTSANITHRLDDGATTLKFTPASSYAFSSYTHISNDQQYLEIDQRVVDSNIIYQQ LMITGLGIIETYHNPPIRTSTQEITLHLHTSSSCCVRSVDGCLICESNGEVPQITVPYT NTFVYDPDPLADYEIAHLDYLSYQAKIGSTDYYSLTDKIDLLAHLTAKQMINSIIGLDE TVSIVNDAVILSDYTNNWISECSYTKIDLVFKLMAWNFLLELAFQMYYLRISSWTNIFD YTYMTLRRIPGTALNNIAATISHPKLLRRAMNLDIITPIHAPYLASLDYVKLSIDAIQW GVKQVLADLSNGIDLEILILSEDSMEISDRAMNLIARKLTLLALVKGENYTFPKIKGMP PEEKCLVLTEYLAMCYQNTHHLDPDLQKYLYNLTNPKLTAFPSNNFYLTRKILNQIRES DEGQYIITSYYESFEQLETDIILHSTLTAPYDNSENSNKVRFIPFDIFPHPESLEKYPL PVDHDSQSAISTLIPGPPSHHVLRPLGVSSTAWYKGISYCRYLETQKIQTGDHLYLAEG SGASMSLLELLFPGDTVYYNSLFSSGENPPQRNYAPLPTQFVQSVPYKLWQADLADDSN LIKDFVPLWNGNGAVTDLSTKDAVAFIIHKVGAEKASLVHIDLESTANINQQTLSRSQI HSLIIATTVLKRGGILIYKTSWLPFSRFSQLAGLLWCFFDRIHLIRSSYSDPHSHEVYL VCRLAADFRTIGFSAALVTATTLHNDGFTTIHPDVVCSYWQHHLENVGRVGKVIDEILD GLATNFFAGDNGLILRCGGTPSSRKWLEIDQLASFDLVQDALVTLITIHLKEIIEVQSS HTEDYTSLLFTPYNIGAAGKVRTIIKLILERSLMYTVRNWLVLPSSIRDSVRQDLELGS FRLMSILSEQTFLKKTPTKKYLLDQLTRTYISTFFNSHSVLPLHRPYQKQIWKALGSVI YCSETVDIPLIKDIQIEDINDFEDIERGIDGEEL" ORIGIN 1 accaagggga gaatcagatg gcatcgttat atgacgaatt gcaaaaagat tacgtaggtc 61 cggaaccact agattcggtg ccggtaacga ttccagtttt atactatctg atcattctct 121 atctctatta aggatatttc tagtctaaag ttcaaaatgt caagtgtttt aaagacattt 181 gaaagattta ctatacaaca ggagcttcag gagcaatctg atgacactcc agtacctctt 241 gagacaatca aacctacaat cagggtattt gtcatcaata ataatgatcc tgtcgtaaga 301 tctagacttt tattctttaa tctacgaatc attatgagta acactgcaag agagggacat 361 agagctggtg ctctcctcag tcttttatca ctaccttctg cagctatgag taatcacatc 421 aaattagcca tgcattcacc agaagccagc atagatagag tagagataac agggtttgag 481 aataattcat tccgagtcat tccagatgct cgatcaacta tgtccagagg agaggtgctg 541 gcttttgaag cattagctga ggacattcct gataccctta atcaccaaac tccatttgta 601 aataatgatg tagaagatga catatttgat gaaacagaga aattcttaga tgtttgctac 661 agtgtgctta tgcaggcatg gatagtaaca tgcaagtgta tgactgctcc tgatcaacca 721 ccagtatcag tagcaaagat ggctaaatat caacaacaag ggagaatcaa tgctaggtat 781 gtactacaac ctgaagcaca aagactaatt cagaatgcca tccgcaagtc aatggtagta 841 aggcatttca tgacttatga gcttcaactt tcacaatcaa gatctttgct agcaaaccgc 901 tactatgcta tggtgggaga cattggcaag tacattgaac acagcggaat gggaggattt 961 ttcttaacac ttaaatatgg acttggaaca agatggccta cattggctct tgcagcattt 1021 tctggggaac tccagaaatt aaaagctctc atgctacatt atcagagtct aggacccatg 1081 gccaagtaca tggctctatt agaatcacca aaactgatgg attttgtccc atctgaatat 1141 ccattagatt atagctatgc aatgggtatt ggaactgtcc ttgatacaaa tatgagaaat 1201 tatgcatacg gtagatcata tttaaatcag caatattttc agctaggagt agaaacagca 1261 aggaaacagc agggagctgt tgacaacagg acagcagagg acctcggcat gactgctgca 1321 gacaaagcag acctcactgc aaccatatca aagctatcct tgtcccaatt acctaggggt 1381 agacaaccaa tatctgaccc atttgctgga gcaaatgaca gagaaatggg aggacaagca 1441 aatgatacac ctgtgtataa cttcaatcca atcgatactc ggaggtatga caactatgac 1501 agtgatggtg aggacagaat tgacaacgat caagatcaag ctatcagaga gaatagagga 1561 gagcctggac aaccaaacaa ccagacaagt gacaaccagc agagattcaa cccccccata 1621 ccgcaaagaa catcaggtat gagcagtgaa gagttccaac attcaatgaa tcagtacatc 1681 cgtgctatgc atgagcaata cagaggctcc caggatgatg atgccaatga tgccacagat 1741 gggaatgaca tttctcttga gctagttgga gattttgatt cctaactctc aatgtcatac 1801 aaccagatat acacatccac atcactcaga gatacagctg ccactcacac actcatccag 1861 acaaatcaaa ctagactcac atcattcgga aacaattctc tcataattta aagaaaaaat 1921 cataggccgg acgggttaga aatccggtgc ttgttcgtga tcagataacc tccacaccag 1981 aatcatacaa tcatggccga ggaaccaaca tacaccactg agcaagttga tgaattaatc 2041 catgctggac tgggaacagt agatttcttc ctatctagac ccatagatgc tcagtcttct 2101 ttaggcaaag gcagcatccc accaggtgtc acagctgttc taactagtgc agcggagaca 2161 aaatccaaac cagttgctgc tggtccagtt aaacccaggc ggaagaaagt gatcagcaat 2221 actactccat acactattgc agacaatatt ccacctgaga agctaccgat caacactcca 2281 atacccaatc cattacttcc actggcacgc cctcacggaa agatgacaga cattgacatt 2341 gtcactggga acattacaga aggatcgtac aaaggtgtgg agcttgctaa attagggaag 2401 cagacactac tcacaaggtt cacctcgaat gagccagtct cctcagctgg atccgcccaa 2461 gaccccaact ttaagagggg gggagctaat agagaaagag caagaggcaa ccataggaga 2521 gaatggagta ttgcatgggt cggagatcag gtcaaagtct tcgagtggtg taatcccagg 2581 tgtgccccag tcacggcctc agctcgcaag ttcacctgca catgcggatc ctgccccagc 2641 atctgcggag aatgtgaagg agatcattga gctcttaaag ggacttgatc ttcgccttca 2701 gactgtagaa gggaaagtag ataaaattct tgcaacttct gcaactataa tcaatcttaa 2761 aaatgaaatg actagtctca aggcgagtgt tgcaactatg gaaggtatga taacaacaat 2821 taaaatcatg gatcccagta caccaactaa tgtccctgta gaggagatca gaaagagttt 2881 acacaatgtt ccagtagtaa ttgccggtcc aactagtgga ggcttcacag ccgaacaggt 2941 gatattgatt tcaatggatg aactagctag acctacactc tcatcaacaa aaaggatcac 3001 acgaaagcct gaatccaaga aagatttaac aggcataaaa ctaactttga tgcagcttgc 3061 aaatgactgc atctcgcgtc cagataccaa gactgagttc gtgactaaga ttcaggcagc 3121 aaccacagaa tcacagctta acgaaattaa acggtcaata atacgctctg caatataaaa 3181 tgaggtgcag tcacacaaga gacactcaac atgcatccaa tcaagatcca gactccatcc 3241 atccaaaaac acgcccacaa ttgtcaacac caagaaacaa ccacagccga accatgctca 3301 accaaaagac ccaaacaaca cctcacatca atagaaggct ggacatgata aatttaataa 3361 aaaaagaaaa gaagttaagt aaaatttaaa ggacacaata gagaaaatct aggtccgaaa 3421 gcttgcctct cagacagatc ccaaaatcat agtccaaacc ccaaacacag cagcagacat 3481 gcctataata tcattaccag cagatccaac ttcacccagt caatccctta ctccgtttcc 3541 aatacaactt gacaccaaag atggcaaggc agggaaactc cttaaacaga ttcgaattag 3601 gtatctaaat gagcctaatt ctcgccatac accaataact ttcatcaata cgtatggatt 3661 tgtttatgct cgagacactt cagggggcat tcacagtgag atcagcagtg acctagctgc 3721 agggtccata acagcatgca tgatgaagct aggacctggt ccaaatattc agaatgcaaa 3781 tctagtgcta agatctctga atgaattcta cgtaaaagtc aagaagacat caagccagag 3841 agaggaagca gtgtttgaat tagttaacat tccaacttta ttgagagaac atgctctttg 3901 caaacgcaaa atgttagtat gctctgcaga aaaattcctc aagaacccgt caaagctaca 3961 agctggattt gagtatgtat acataccaac ttttgtctcc attacatact caccacgaaa 4021 tctgaattac caagtggcca gacctatcct taagttcaga tcacgctttg tgtatagcat 4081 tcatttggaa ttaatcctga gattgctatg caaatctgac tcccccttga tgaaatccta 4141 caatgcagac agaacaggtc ggggatgcct cgcatcagtc tggatccatg tatgtaacat 4201 tctgaaaaac aaaagcatca agcaacaagg cagagaatca tatttcatag ctaagtgcat 4261 gagcatgcag ctgcaggtgt ccattgcaga tctttgggga ccaacaatca taatcaaatc 4321 attgggtcac atccccaaga ctgcacttcc ttttttcagc aaagatggga ttgcctgtca 4381 tccattacaa gatgtttccc ctaatctagc aaaatcactg tggtcagttg gatgtgagat 4441 agaatctgcc aagttgatac ttcaagaatc tgatcttaat gagctaatgg gccaccagga 4501 ccttatcact gataagattg ccattagatc aggtcaacgg acatttgaga ggtccaaatt 4561 cagcccattc aaaaaatatg catcaattcc aaacttggaa gccatcaact gaatgctcca 4621 gcatctgaga atagaaccac aatcaagtca tactactagt cactatacaa taatcaacaa 4681 ttttagtcaa ctgattacca agatgttatc ataggtccga actgatcaat ctaacaaaaa 4741 aactaaacgt tccacaataa atcaacgttc aggccaaaat attcagccat gcatcacctg 4801 catccaatga tagtatgcat ctttgttatg tacactggaa ttgtaggttc agatgccatt 4861 gctggagatc aactacttaa tataggggtc attcaatcaa agataagatc actcatgtac 4921 tatactgatg gtggtgctag ctttattgtt gtaaaattgc tacctaatct tcccccaagc 4981 aatggaacat gcaacatcac cagtctagat gcatataatg ttaccctatt taagttacta 5041 acacccctga ttgagaacct gagtaaaatt tccactgtta cagataccaa aacccgccaa 5101 aaacgatttg caggagtagt tgttggactt gctgcattag gagtagccac agccgcacaa 5161 ataactgcag ctgtagcaat agtgaaagct aatgcaaatg ctgctgcgat aaacaatctt 5221 gcatcttcaa ttcaatccac caacaaggca gtatccgatg tgatagatgc atcaagaaca 5281 attgcaaccg cagttcaagc aattcaggat cgcatcaatg gagctattgt taatgggata 5341 acatctgcat catgccgtgc ccatgatgca ctcattgggt caatattaaa tctttatctc 5401 actgagctta ccacaatatt tcataatcaa ataacaaacc ctgcgctgac accactctcc 5461 atccaagctt taagaatcct cctcggtagc accttgccaa ttgtcattga gtccaaactc 5521 aacacaaact tcaacacagc agagctgctc agttccggac tgttaactgg tcaaataatt 5581 tccatttccc caatgtacat gcaaatgcta attcaaatca atgttccgac atttataatg 5641 caacccggtg cgaaggtaat tgatctaatt gctatctccg caaaccataa attgcaagaa 5701 gtggttgtac aagttccgaa taggattcta gagtatgcaa atgaactaca aaattaccca 5761 gccaatgact gtgtcgtgac accgaactct gtattttgta gatacaatga gggttcccct 5821 atccctgaat cacaatatca atgcttgagg gggaatctta attcttgcac ttttacccct 5881 attatcggga actttcttaa gcgattcgca tttgctaatg gtgtgctcta tgccaactgc 5941 aaatctttgc tatgtaggtg tgccgacccc ccccatgttg tatcccagga tgatacccaa 6001 ggcatcagca taattgatat taagagatgc tctgagatga tgcttgacac tttttcattt 6061 aggatcacat ctactttcaa tgctacgtac gtgacagact tctcaatgat taatgcaaat 6121 attgtacatc taagtcctct agatttgtca aatcaaatca attcaataaa caaatctctt 6181 aaaagtgctg aggattggat tgcagatagc aacttctttg ctaatcaagc caggacagcc 6241 aagacacttt attcactaag tgcaatagca ttaatactat cagtgattac tttggttgtc 6301 gtgggattgc tgattgccta catcatcaag ctggtttctc aaatccatca attcagatcg 6361 ctagctgcta caacaatgtt ccacagggaa aatcctgcct tcttttccaa gaataaccat 6421 ggaaacatat atgggatatc ttaagaaatc tatcacaagt ctatatatgt ccacaattga 6481 cccttaagaa ccaacttcca acgattatcc gttaaattta agtataatag tttaaaaatt 6541 aacattaagc ctccagatac caatgaatat gaatatatct cttagaaaac ctgattatta 6601 tgtgatagcg tagtacaatt taagaaaaaa cctaaaataa gcacgaaccc ttaaggtgtc 6661 gtaacgtctc gtgacaccgg gttcagttca aatatcgacc tctaacccaa tttaacaccc 6721 attcttatat aagaacacag tataatttaa tcacaaaaga cctcaaaaac tgacacagct 6781 tgatccactc aacatataat tgtaagatta ataataatgg aagattacag caatctatct 6841 cttaaatcaa ttcctaaaag gacatgtaga atcattttcc gaactgccac aattcttgga 6901 atatgcacat tgattgttct atgttcaagt attcttcatg agataattca tcttgatgtt 6961 tcctctggtc tcatggattc cgatgattca cagcaaggca ttattcagcc tattatagaa 7021 tcattaaaat cattaattgc tttggctaac cagattctgt acaatgttgc aataataatt 7081 cctcttaaaa ttgacagtat cgagactgta atattctctg ctttaaagga tatgcatact 7141 gggagcatgt ccaacaccaa ctgtacaccc ggaaatctgc ttctgcatga tgcagcgtac 7201 atcaatggaa taaacaaatt ccttgtactt aaatcataca atgggacgcc taaatatgga 7261 cctctcctaa atattcccag ctttatcccc tcagcaacat ctcccaacgg gtgcactaga 7321 ataccatcat tttcactcat taagacccat tggtgttaca ctcacaatgt aatgcttgga 7381 gattgcctcg atttcacgac atctaatcag tatttagcaa tggggataat acaacaatct 7441 gctgcagcat ttccaatctt caggactatg aaaaccattt acctaagtga tggaatcaat 7501 cgcaaaagct gttcagtcac tgctatacca ggaggttgtg tcttgtattg ctatgtagct 7561 acaagatctg agaaagaaga ttatgccaca actgatctag ctgaactgag acttgctttc 7621 tattattata atgatacctt tattgaaaga gtcatatctc ttccaaatac aacagggcaa 7681 tgggccacaa tcaatcctgc agttggaagc gggatctatc atctaggctt tatcttattt 7741 cctgtatatg gtggtctcat aagtgggact ccttcctaca acaagcagtc ctcacgctat 7801 tttatcccaa aacatcccaa cataacctgt gccggtaact ccagcgaaca ggctgcagca 7861 gcacggagtt cctatgtaat ccgttatcac tcaaacaggt tgattcagag tgctgttctt 7921 atttgcccat tgtctgacat gcacacagca aggtgtaatc tagttatgtt taacaattct 7981 caagtcatga tgggtgcaga aggtaggctc tatgttattg acaataattt gtattattat 8041 caacgtagtt cctcttggtg gtctgcatcg cttttttaca ggatcaatac agatttttct 8101 aaaggaattc ctcctatcat tgaggctcaa tgggtaccgt cctatcaagt tccccgtcct 8161 ggagtcatgc catgcaatgc aacaagtttt tgccctgcta attgcatcac aggggtgtac 8221 gcagatgtgt ggccgcttaa cgatccagaa cccacatcac aaaatgctct gaatcccaac 8281 tatcgatttg ctggagcctt tctcagaaat gagtccaacc gaaccaatcc cacattctac 8341 actgcatcag ccagcgccct actaaatact accggattca acaacaccaa tcacaaagca 8401 gcatatacgt cttcaacctg ctttaagaat actggaactc aaaagattta ttgtttgata 8461 ataattgaaa tgggctcatc tcttttaggg gagttccaaa taataccatt tctaagggaa 8521 ctaatacctt aatactattg aatgaagact ccagattcaa taataattga aaggctctct 8581 atcttatgca atagttatac gttttggctg tattagaatg ttatagattc tgctgttttt 8641 cccatatgaa gcaatccttc aacaccgact taggttcaat tttctcatca tttactgttg 8701 taattcaatc ttactaaagt tattccgata tttaagaaaa aataaccttt atataatgta 8761 acaatactat taagattatg atataggcca gaatggcggc ctcttctgag atactccttc 8821 ctgaagtcca cttgaactca ccaatagtca aacacaaact catatactac ttattactag 8881 ggcacttccc gcatgatctt gacatttctg aaataagccc ccttcacaat aatgattggg 8941 atcaaattgc cagagaagaa tccaatcttg ctgaacgact tggagtagct aaatctgaat 9001 taattaaacg tgtgcccgca tttagagcaa ctagatggcg tagtcatgca gccgtcctta 9061 tatggccttc ttgtatacca tttcttgtta aattcctacc tcattctaag cttcaaccag 9121 tagaacaatg gtacaagttg atcaatgctt catgtaatac tatatctgac tcaattgata 9181 gatgtatgga gaatatttct attaagctta ctgggaaaaa caatctattc tctcgatcca 9241 gaggaactgc aggtgcaggt aaaaacagta aaatcaccct caatgatatc caatctattt 9301 gggaatcaaa caagtggcaa cctaatgtat ctttatggct tacaattaaa taccaaatgc 9361 gacaacttat aatgcatcaa agttctcgtc agccgactga tttagttcac attgttgaca 9421 cacgatctgg tctaatagtt atcacccctg aacttgttat ttgttttgat cggttaaata 9481 gtgttttaat gtattttaca tttgagatga ctttaatggt aagtgacatg tttgagggaa 9541 ggatgaatgt caccgctctc tgcactatta gtcattactt atctccacta gggccaagga 9601 tagatagatt gttttccatt gtagatgaat tagcacaact attaggtgac actgtatata 9661 aagttattgc atctcttgaa tctttagtat atgggtgtct acaacttaaa gatccagtag 9721 tggaattagc agggtcattt cattccttta ttacacaaga gattatagat atcctaattg 9781 gttcaaaagc ccttgataag gatgaatcaa taactgttac tacacaattg ttagatatat 9841 tttccaacct ttctccagat ttaattgctg agatgttgtg tctcatgaga ctttggggtc 9901 atcccactct tactgctgcg caagtgggta aagtgagaga atctatgtgt gcaggtaagt 9961 tacttgattt ccctacaata atgaaaactc ttgctttttt ccacacaatt ttaattaatg 10021 gttaccgtag aaagaaaaat ggaatgtggc ctccacttat acttcctaaa aatgcatcaa 10081 aaagcttaat agaatttcaa catgataatg ctgaaatatc ttacgaatat acactcaagc 10141 attggaaaga gatatctctc atagaattta gaaagtgctt tgactttgat cctggtgagg 10201 agctaagcat ttttatgaaa gacaaggcaa taagtgctcc aagaagtgat tggatgagtg 10261 tatttcgtag aagtctaata aaacaacgac atcagagaca tcatattcct atgcccaatc 10321 catttaatag acgtctatta ctcaatttct tagaagatga cagttttgat ccagttgccg 10381 agcttcgata tgttaccggt ggtgaatatc tccaagatga cacattttgt gcatcttact 10441 cattaaaaga gaaagaaata aaaccagatg gaaggatatt tgctaagctt actaatagaa 10501 tgcggtcctg tcaagtaatt gcggaagcaa ttctcgcaaa tcatgcaggt actctaatga 10561 aggaaaacgg agttgtcttg aatcaattat cactgactaa atcattgctt actatgagtc 10621 aaattggcat aatatcagaa aaggcgaaga gatatacgcg agataacatc tcatcccaag 10681 gtttccatac aatcaagact gattctaaaa ataagaggaa aagcaaaact gcatcatcat 10741 acctcacaga tcctgatgat acatttgaac ttagtgcatg ttttataact actgatcttg 10801 ctaaatactg tcttcaatgg agatatcaga ccataatcca ttttgctcga acattaaaca 10861 gaatgtatgg agttccacat ttatttgaat ggattcatct tcgtttaatt agatctacat 10921 tatatgttgg tgatccattc aatcctcctg ccgcaactga tgctttcgat ctagataaag 10981 tattaaatgg tgatatcttt atagtctcca agggaggtat tgaaggccta tgtcagaaaa 11041 tgtggacaat gatctctatt tctgtgatca tcctctcttc agccgaatcc aaaacaagag 11101 taatgagcat ggttcaagga gataatcagg cgattgcagt tacaacaaga gttcctagat 11161 cattacctag tattcagaaa aaggagttag cctatgcagc aagcaagtta ttttttgaaa 11221 gacttagggc aaataattat gggttgggtc atcagctaaa ggctcaagaa actataataa 11281 gttccacgtt cttcatatat agtaaacggg tattttatca aggacgtata ctaacacagg 11341 cactcaaaaa tgctagcaag ttatgtctta ctgcagatgt attaggtgaa tgtactcaag 11401 cttcctgttc aaattctgct actaccatca tgagattaac agaaaatggg gttgagaaag 11461 atacatgtta taagcttaat atttatcagt ccattcgtca actcacatat gatctaatat 11521 ttccccaata ctccatacca ggtgaaacta taagtgagat tttcctacag catccaagac 11581 taatctcacg tattgttctg ctcccttcac agctaggtgg tcttaattac ctcgcatgta 11641 gcagattatt taaccgcaat atcggagatc ctcttggtac agctgtggca gatctcaaga 11701 ggttaattaa atgtggtgct cttgaatcat ggatactgta taatttacta gcaagaaaac 11761 cagggaaagg ttcatgggca actttagcag ccgatccata ctcattgaat caagaatatc 11821 tttatcctcc tactactata cttaaaagac atactcaaaa tactttaatg gagatatgtc 11881 ggaatcctat gttaaaggga gtttttacag ataatgcaaa agaggaggaa aatctccttg 11941 caaaatttct tcttgatcgt gatatagtat tgccaagagt tgcacacatt ataatagatc 12001 aatctagcat cggaaggaag aaacagatac aaggattttt tgacaccaca aggaccataa 12061 tgagacgatc atttgaaatc aaaccactct caactaagaa gactctttca gtcatagaat 12121 ataatactaa ttacttatct tataactacc ctgtcatact taatccttta cctattcctg 12181 gatatttaaa ttatattact gaccaaactt gcagtattga tatatctaga agtttaagaa 12241 aattatcatg gtcttcttta ttgaatggaa gaactttaga aggattagaa actccagatc 12301 caattgaagt tgtcaatggt ttcttgattg taggtacagg agattgtgat ttttgtatgc 12361 agggtgacga caaatttact tggttctttt tacctatggg gataattatt gatggaaatc 12421 ctgaaactaa tccacccatc agagttccat acattgggtc tagaacagag gaaagaagag 12481 ttgcatcaat ggcatatatt aaaggtgcca cacacagttt gaaggctgct cttagaggcg 12541 caggggtata tatttgggca ttcggggata ctgtagtgaa ctggaatgat gcacttgata 12601 tcgcaaatac tagggttaag atatccctag agcaacttca gacccttaca cctcttccta 12661 catctgcaaa cattacacac cgtttagatg atggagccac aacacttaaa ttcactccag 12721 ctagttccta tgcattttct agttatactc atatatcaaa tgatcaacaa tatttagaaa 12781 tagatcagag agtagtcgat tctaatatta tttatcaaca attaatgata acaggacttg 12841 ggattattga gacctaccat aacccaccta taaggacttc tacacaagaa atcactctcc 12901 atttgcacac tagctcatct tgttgtgtta gaagtgtaga tggttgcctt atatgtgaga 12961 gcaatggaga ggttcctcag atcactgttc cctatactaa tacatttgta tatgatcctg 13021 atccactagc agattatgag attgcacacc tagattatct ctcctaccaa gctaaaattg 13081 gaagtacaga ttactactca ctcactgata aaattgacct attagcacat ttaactgcaa 13141 aacaaatgat aaactcaata attgggttag atgaaacagt atcaattgtc aatgatgcgg 13201 ttatcctatc tgactatact aataactgga ttagtgaatg ttcttatact aagatagatt 13261 tagtttttaa attaatggca tggaatttcc ttcttgagct tgcattccag atgtactact 13321 taaggatatc atcttggaca aatatatttg actatactta tatgactttg cgcaggatac 13381 ccggaactgc tctaaataat attgcagcta ctattagcca tccaaaatta ttaagacgtg 13441 caatgaatct tgatattatc actcctatac atgcaccgta tttagcttca ttagattatg 13501 tcaaattaag tattgatgca attcagtggg gagttaaaca agttcttgct gatttatcaa 13561 atggaattga tcttgaaatc ttgattcttt cagaggattc aatggaaatt agtgataggg 13621 caatgaatct cattgctaga aaactaactc tccttgcact tgttaaaggt gagaactata 13681 cttttccaaa aattaaaggg atgccaccag aagaaaagtg tttagtctta actgaatatc 13741 tagcaatgtg ttatcaaaat actcatcact tagatccaga tcttcaaaag tatttatata 13801 atctaactaa tccaaaattg actgcatttc ccagtaacaa cttctactta actagaaaaa 13861 tccttaatca aattagagaa tcagacgaag gacaatatat tatcacctca tattatgaat 13921 ccttcgaaca attagaaaca gatataattc ttcactctac tttaactgct ccttatgata 13981 attcagaaaa ctctaacaaa gttcgattta tccctttcga catctttcca catccagaat 14041 ctctcgagaa atatcctctt ccagttgatc atgactctca atctgcaatt tcaacactaa 14101 ttccaggccc tccttctcat catgtattac gaccactagg agtgtcatcc acagcttggt 14161 ataaagggat aagttattgt agatacctag aaacacaaaa gatacagact ggtgatcatc 14221 tttatttagc cgaaggaagc ggtgcttcaa tgtcacttct agaactctta tttccaggag 14281 atactgtcta ttataatagt ctttttagta gtggagagaa tcctccacag agaaactatg 14341 cccctcttcc aactcaattt gtacagagtg ttccatataa attgtggcaa gctgatcttg 14401 ctgatgatag caatttgata aaagattttg tcccattatg gaatggaaac ggtgcagtta 14461 cagacttatc aacaaaggat gcagttgcat tcataataca taaagtagga gcagagaaag 14521 catcccttgt ccatatagat ctcgaatcaa ctgctaatat aaatcagcaa actctgtcca 14581 gatcccagat tcattcatta attatagcaa ctactgttct taagaggggt gggatattaa 14641 tttataaaac atcatggctt ccgttttcta ggtttagtca actagcaggt ctactttggt 14701 gcttctttga ccggatccat ctaatacgta gtagctattc tgatcctcac agtcatgagg 14761 tttatcttgt atgtagactt gccgcagatt ttagaactat cggtttcagt gcagctctag 14821 taactgctac tactcttcac aatgacggat tcacaacaat acatcctgat gttgtttgta 14881 gttattggca acaccatctt gaaaatgttg ggagagtcgg aaaagtaatt gatgagatac 14941 ttgatggttt agccaccaac ttcttcgcag gagataatgg gcttattcta agatgtggag 15001 gaactccaag ctccagaaaa tggttagaga ttgaccagtt agcatcattt gatttggttc 15061 aagatgctct ggttacactt atcactatac acctaaagga aattatagaa gtgcagtcat 15121 cacatacaga ggattataca tctctcctct tcacacctta taatattggt gcagcaggga 15181 aagtcagaac tatcatcaaa ttaattctag aacgatcttt aatgtataca gtccgaaatt 15241 ggttagtttt acccagttcc atccgggatt ctgtacgaca agatttagaa ttagggtcat 15301 ttagattaat gtctatttta agtgaacaga catttcttaa aaagacaccc acaaaaaaat 15361 acttacttga tcagcttaca aggacatata tatcaacctt ctttaactct cactcagtcc 15421 ttcccctcca ccgtccatat caaaaacaaa tatggaaagc cttaggtagt gtaatatatt 15481 gttcggagac agttgatata cctctaatta aagacattca gatagaagat attaatgatt 15541 ttgaagatat cgagaggggt atcgatggcg aagaattatg acaacaatga ttataagaac 15601 tcatgatagt tttatttaag aaaaacatat tgattttccc cttggt // LOCUS NC_006213 30741 bp RNA linear VRL 21-FEB-2019 DEFINITION Human coronavirus OC43 strain ATCC VR-759, complete genome. ACCESSION NC_006213 VERSION NC_006213.1 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Human coronavirus OC43 (HCoV-OC43) ORGANISM Human coronavirus OC43 Viruses; Riboviria; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Betacoronavirus; Embecovirus. REFERENCE 1 (bases 1 to 30741) AUTHORS St-Jean,J.R., Jacomy,H., Desforges,M., Vabret,A., Freymuth,F. and Talbot,P.J. TITLE Human Respiratory Coronavirus OC43: Genetic Stability and Neuroinvasion JOURNAL J. Virol. 78 (16), 8824-8834 (2004) PUBMED 15280490 REFERENCE 2 (bases 1 to 30741) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (21-FEB-2019) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 30741) AUTHORS St-Jean,J.R. and Talbot,P.J. TITLE Direct Submission JOURNAL Submitted (29-MAR-2004) INRS-Institut Armand-Frappier, 531 boul. des Prairies, Laval, Qc H7V 1B7, Canada COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence is identical to AY585228. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..30741 /organism="Human coronavirus OC43" /mol_type="genomic RNA" /strain="ATCC VR-759" /serotype="OC43" /db_xref="ATCC:VR-759" /db_xref="taxon:31631" /country="USA" gene 210..21496 /gene="orf1ab" /locus_tag="EYW02_gp1" /db_xref="GeneID:39105222" CDS join(210..13340,13340..21496) /gene="orf1ab" /locus_tag="EYW02_gp1" /ribosomal_slippage="" /note="replicase polyprotein; translated via a -1 ribosomal frameshift; ORF 1a/1b" /codon_start=1 /product="Orf1ab" /protein_id="YP_009555238.1" /db_xref="GeneID:39105222" /translation="MSKINKYGLELHWAPEFPWMFEDAEEKLDNPSSSEVDMICSTTAQ KLETDGICPENHVMVDCRRLLKQECCVQSSLIREIVMNASPYDLEVLLQDALQSREAVL VTTPLGMSLEACYVRGCNPKGWTMGLFRRRSVCNTGRCTVNKHVAYQLYMIDPAGVCLG AGQFVGWVIPLAFMPVQSRKFIVPWVMYLRKRGEKGAYNKDHGRGGFGHVYDFKVEDAY DQVHDEPKGKFSKKAYALIRGYRGVKPLLYVDQYGCDYTGSLADGLEAYADKTLQEMKA LFPTWSQELLFDVIVAWHVVRDPRYVMRLQSAATIRSVAYVANPTEDLCDGSVVIKEPV HVYADDSIILRQYNLVDIMSHFYMEADTVVNAFYGVALKDCGFVMQFGYIDCEQDSCDF KGWIPGNMIDGFACTTCGHVYEVGDLMAQSSGVLPVNPVLHTKSAAGYGGFGCKDSFTL YGQTVVYFGGCVYWSPARNIWIPILKSSVKSYDSLVYTGVLGCKAIVKETNLICKALYL DYVQHKCGNLHQRELLGVSDVWHKQLLLNRGVYKPLLENIDYFNMRRAKFSLETFTVCA DGFMPFLLDDLVPRAYYLAVSGQAFCDYADKLCHAVVSKSKELLDVSLDSLGAAIHYLN SKIVDLAQHFSDFGTSFVSKIVHFFKTFTTSTALAFAWVLFHVLHGAYIVVESDIYFVK NIPRYASAVAQAFQSVAKVVLDSLRVTFIDGLSCFKIGRRRICLSGRKIYEVERGLLHS SQLPLDVYDLTMPSQVQKAKQKPIYLKGSGSDFSLADSVVEVVTTSLTPCGYSEPPKVA AKICIVDNVYMAKAGDKYYPVVVDDHVGLLDQAWRVPCAGRRVTFKEQPTVKEIISMPK IIKVFYELDNDFNTILNTACGVFEVDDTVDMEEFYAVVIDAIEEKLSPCKELEGVGAKV SAFLQKLEDNPLFLFDEAGEEVLAPKLYCAFTAPEDDDFLEESDVEEDDVEGEETDLTV TSAGQPCVASEQEESSEVLEDTLDDGPSVETSDSQVEEDVEMSDFVDLESVIQDYENVC FEFYTTEPEFVKVLGLYVPKATRNNCWLRSVLAVMQKLPCQFKDKNLQDLWVLYKQQYS QLFVDTLVNKIPANIVLPQGGYVADFAYWFLTLCDWQCVAYWKCIKCDLALKLKGLDAM FFYGDVVSHICKCGESMVLIDVDVPFTAHFALKDKLFCAFITKRIVYKAACVVDVNDSH SMAVVDGKQIDDHRITSITSDKFDFIIGHGMSFSMTTFEIAQLYGSCITPNVCFVKGDI IKVSKLVKAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKETTDMVKSKGVCATGDCYV STGGKLCKTVLNVVGPDARTQGKQSYVLLERVYKHLNNYDCVVTTLISAGIFSVPSDVS LTYLLGTAKKQVVLVSNNQEDFDLISKCQITAVEGTKKLAARLSFNVGRSIVYETDANK LILINDVAFVSTFNVLQDVLSLRHDIALDDDARTFVQSNVDVVPEGWRVVNKFYQINGV RTVKYFECTGGIDICSQDKVFGYVQQGIFNKATVAQIKALFLDKVDILLTVDGVNFTNR FVPVGESFGKSLGNVFCDGVNVTKHKCDINYKGKVFFQFDNLSSEDLKAVRSSFNFDQK ELLAYYNMLVNCFKWQVVVNGKYFTFKQANNNCFVNVSCLMLQSLHLTFKIVQWQEAWL EFRSGRPARFVALVLAKGGFKFGDPADSRDFLRVVFSQVDLTGAICDFEIACKCGVKQE QRTGLDAVMHFGTLSREDLEIGYTVDCSCGKKLIHCVRFDVPFLICSNTPASVKLPKGV GSANIFIGDKVGHYVHVKCEQSYQLYDASNVKKVTDVTGKLSDCLYLKNLKQTFKSVLT TYYLDDVKKIEYKPDLSQYYCDGGKYYTQRIIKAQFKTFEKVDGVYTNFKLIGHTVCDS LNAKLGFDSSKEFVEYKITEWPTATGDVVLATDDLYVKRYERGCITFGKPVIWLSHEKA SLNSLTYFNRPSLVDDNKFDVLKVDDVDDGGDSSESGAKETKEINIIKLSGVKKPFKVE DSVIVNDDTSETKYVKSLSIVDVYDMWLTGCKYVVRTANALSRAVNVPTIRKFIKFGMT LVSIPIDLLNLREIKPAVNVVKAVRNKISVCFNFIKWLFVLLFGWIKISADNKVIYTTE IASKLTCKLVALAFKNAFLTFKWSMVARGACIIATIFLLWFNFIYANVIFSDFYLPKIG FLPTFVGKIAQWIKNTFSLVTICDLYSIQDVGFKNQYCNGSIACQFCLAGFDMLDNYKA IDVVQYEADRRAFVDYTGVLKIVIELIVSYALYTAWFYPLFALISIQILTTWLPELFML STLHWSFRLLVALANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKRNR SLRVKCSTIVGGMIRYYDVMANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALDLSKE LKRPIQPTDVAYHTVTDVKQVGCSMRLFYDRDGQRTYDDVNASLFVDYSNLLHSKVKSV PNMHVVVVENDADKANFLNAAVFYAQSLFRPILMVDKNLITTANTGTSVTETMFDVYVD TFLSMFDVDKKSLNALIATAHSSIKQGTQIYKVLDTFLSCARKSCSIDSDVDTKCLADS VMSAVSAGLELTDESCNNLVPTYLKSDNIVAADLGVLIQNSAKHVQGNVAKIAGVSCIW SVDAFNQFSSDFQHKLKKACCKTGLKLKLTYNKQMANVSVLTTPFSLKGGAVFSYFVYV CFVLSLVCFIGLWCLMPTYTVHKSDFQLPVYASYKVLDNGVIRDVSVEDVCFANKFEQF DQWYESTFGLSYYSNSMACPIVVAVIDQDFGSTVFNVPTKVLRYGYHVLHFITHALSAD GVQCYTPHSQISYSNFYASGCVLSSACTMFTMADGSPQPYCYTEGLMQNASLYSSLVPH VRYNLANAKGFIRFPEVLREGLVRIVRTRSMSYCRVGLCEEADEGICFNFNGSWVLNND YYRSLPGTFCGRDVFDLIYQLFKGLAQPVDFLALTASSIAGAILAVIVVLVFYYLIKLK RAFGDYTSVVFVNVIVWCVNFMMLFVFQVYPILSCVYAICYFYATLYFPSEISVIMHLQ WLVMYGTIMPLWFCLLYIAVVVSNHAFWVFSYCRKLGTSVRSDGTFEEMALTTFMITKD SYCKLKNSLSDVAFNRYLSLYNKYRYYSGKMDTAAYREAACSQLAKAMDTFTNNNGSDV LYQPPTASVSTSFLQSGIVKMVNPTSKVEPCVVSVTYGNMTLNGLWLDDKVYCPRHVIC SASDMTNPDYTNLLCRVTSSDFTVLFDRLSLTVMSYQMRGCMLVLTVTLQNSRTPKYTF GVVKPGETFTVLAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVGYVIMGDCVKFVYM HQLELSTGCHTGTDFNGDFYGPYKDAQVVQLLIQDYIQSVNFVAWLYAAILNNCNWFVQ SDKCSVEDFNVWALSNGFSQVKSDLVIDALASMTGVSLETLLAAIKRLKNGFQGRQIMG SCSFEDELTPSDVYQQLAGIKLQSKRTRLFKGTVCWIMASTFLFSCIITAFVKWTMFMY VTTNMFSITFCALCVISLAMLLVKHKHLYLTMYITPVLFTLLYNNYLVVYKHTFRGYVY AWLSYYVPSVEYTYTDEVIYGMLLLVGMVFVTLRSINHDLFSFIMFVGRLISVFSLWYK GSNLEEEILLMLASLFGTYTWTTVLSMAVAKVIAKWVAVNVLYFTDIPQIKIVLLCYLF IGYIISCYWGLFSLMNSLFRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKL LGIGGVPIIEVSQFQSKLTDVKCANVVLLNCLQHLHVASNSKLWHYCSTLHNEILATSD LSVAFEKLAQLLIVLFANPAAVDSKCLTSIEEVCDDYAKDNTVLQALQSEFVNMASFVE YEVAKKNLDEARFSGSANQQQLKQLEKACNIAKSAYERDRAVAKKLERMADLALTNMYK EARINDKKSKVVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLNII VPDKSVYDQVVDNVYVTYAGNVWQIQTIQDSDGTNKQLNEISDDCNWPLVIIANRYNEV SATVLQNNELMPAKLKIQVVNSGPDQTCNTPTQCYYNNSNNGKIVYAILSDVDGLKYTK ILKDDGNFVVLELDPPCKFTVQDAKGLKIKYLYFVKGCNTLARGWVVGTISSTVRLQAG TATEYASNSSILSLCAFSVDPKKTYLDFIQQGGTPIANCVKMLCDHAGTGMAITVKPDA TTSQDSYGGASVCIYCRARVEHPDVDGLCKLRGKFVQVPVGIKDPVSYVLTHDVCRVCG FWRDGSCSCVSTDTTVQSKDTNFLNRVRGTSVDARLVPCASGLSTDVQLRAFDIYNASV AGIGLHLKVNCCRFQRVDENGDKLDQFFVVKRTDLTIYNREMKCYERVKDCKFVAEHDF FTFDVEGSRVPHIVRKDLTKYTMLDLCYALRHFDRNDCMLLCDILSIYAGCEQSYFTKK DWYDFVENPDIINVYKKLGPIFNRALVSATEFADKLVEVGLVGVLTLDNQDLNGKWYDF GDYVIAAPGCGVAIADSYYSYIMPMLTMCHALDCELYVNNAYRLFDLVQYDFTDYKLEL FNKYFKHWSMPYHPNTVDCQDDRCIIHCANFNILFSMVLPNTCFGPLVRQIFVDGVPFV VSIGYHYKELGIVMNMDVDTHRYRLSLKDLLLYAADPALHVASASALYDLRTCCFSVAA ITSGVKFQTVKPGNFNQDFYDFVLSKGLLKEGSSVDLKHFFFTQDGNAAITDYNYYKYN LPTMVDIKQLLFVLEVVYKYFEIYDGGCIPASQVIVNNYDKSAGYPFNKFGKARLYYEA LSFEEQDEIYAYTKRNVLPTLTQMNLKYAISAKNRARTVAGVSILSTMTGRMFHQKCLK SIAATRGVPVVIGTTKFYGGWDDMLRRLIKDVDNPVLMGWDYPKCDRAMPNLLRIVSSL VLARKHETCCSQSDRFYRLANECAQVLSEIVMCGGCYYVKPGGTSSGDATTAFANSVFN ICQAVSANVCALMSCNGNKIEDLSIRALQKRLYSHVYRSDKVDSTFVTEYYEFLNKHFS MMILSDDGVVCYNSDYASKGYIANISAFQQVLYYQNNVFMSESKCWVEHDINNGPHEFC SQHTMLVKMDGDDVYLPYPNPSRILGAGCFVDDLLKTDSVLLIERFVSLAIDAYPLVYH ENEEYQKVFRVYLAYIKKLYNDLGNQILDSYSVILSTCDGQKFTDESFYKNMYLRSAVM QSVGACVVCSSQTSLRCGSCIRKPLLCCKCCYDHVMATDHKYVLSVSPYVCNAPGCDVN DVTKLYLGGMSYYCEDHKPQYSFKLVMNGLVFGLYKQSCTGSPYIDDFNRIASCKWTDV DDYILANECTERLKLFAAETQKATEEAFKQSYASATIQEIVSERELILSWEIGKVKPPL NKNYVFTGYHFTKNGKTVLGEYVFDKSELTNGVYYRATTTYKLSVGDVFVLTSHSVANL SAPTLVPQENYSSIRFASVYSVLETFQNNVVNYQHIGMKRYCTVQGPPGTGKSHLAIGL AVFYCTARVVYTAASHAAVDALCEKAYKFLNINDCTRIVPAKVRVECYDKFKINDTTRK YVFTTINALPEMVTDIVVVDEVSMLTNYELSVINARIRAKHYVYIGDPAQLPAPRVLLS KGTLEPKYFNTVTKLMCCLGPDIFLGTCYRCPKEIVDTVSALVYENKLKAKNESSSLCF KVYYKGVTTHESSSAVNMQQIYLINKFLKANPLWHKAVFISPYNSQNFAAKRVLGLQTQ TVDSAQGSEYDYVIYSQTAETAHSVNVNRFNVAITRAKKGILCVMSNMQLFEALQFTTL TLDKVPQAVETKVQCSTNLFKDCSKSYSGYHPAHAPSFLAVDDKYKATGDLAVCLGIGD SAVTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNF PLQLGFSTGIDFVVEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGHRWDVVRP RIVQMFADHLIDLSDCVVLVTWAANFELTCLRYFAKVGREISCNVCTKRATVYNSRTGY YGCWRHSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAV YDCFCNNINWNVEYPIISNELSINTSCRVLQRVILKAAMLCNRYTLCYDIGNPKAIACV KDFDFKFYDAQPIVKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAVVCRFDTRV LNNLNLPGCNGGSLYVNKHAFHTKPFARAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVD YVPLKSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTF TKLQSLENVVYNLVKTGHYTGQAGEMPCAIINDKVVAKIDKEDVVIFINNTTYPTNVAV ELFAKRSVRHHPELKLFRNLNIDVCWKHVIWDYARESIFCSNTYGVCMYTDLKFIDKLN VLFDGRDNGALEAFKRSNNGVYISTTKVKSLSMIRGPPRAELNGVVVDKVGDTDCVFYF AVRKEGQDVIFSQFDSLGVSSNQSPQGNLGSNGKPGNVGGNDALSISTIFTQSRVISSF TCRTDMEKDFIALDQDVFIQKYGLEDYAFEHIVYGNFNQKIIGGLHLLIGLYRRQQTSN LVVQEFVSYDSSIHSYFITDEKSGGSKSVCTVIDILLDDFVALVKSLNLNCVSKVVNVN VDFKDFQFMLWCNDEKVMTFYPRLQAASDWKPGYSMPVLYKYLNSPMERVSLWNYGKPV TLPTGCMMNVAKYTQLCQYLNTTTLAVPVNMRVLHLGAGSEKGVAPGSAVLRQWLPAGT ILVDNDLYPFVSDSVATYFGDCITLPFDCQWDLIISDMYDPITKNIGEYNVSKDGFFTY ICHMIRDKLALGGSVAIKITEFSWNAELYKLMGYFAFWTVFCTNANASSSEGFLIGINY LCKPKVEIDGNVMHANYLFWRNSTVWNGGAYSLFDMAKFPLKLAGTAVINLRADQINDM VYSLLEKGKLLIRDTNKEVFVGDSLVNVI" mat_peptide 210..947 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="leader protein" /note="PL1-PRO cleavage product" /protein_id="YP_009555246.1" mat_peptide 948..2762 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="MHV p65-like protein" /note="PL1-PRO cleavage product" /protein_id="YP_009555247.1" mat_peptide 2763..8459 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp1" /note="PL1-PRO; PL2-PRO; HD; contains papain-like proteinase 1 domain (PL1-PRO), X-domain, papain-like proteinase 2 domain (PL2-PRO) and hydrophobic domain (HD)" /protein_id="YP_009555248.1" mat_peptide 8460..9947 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="transmembrane domain 2" /protein_id="YP_009555249.1" mat_peptide 9948..10856 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp2" /note="3CL-PRO cleavage product" /protein_id="YP_009555250.1" mat_peptide 10857..11717 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp3" /note="HD2; hydrophobic domain" /protein_id="YP_009555258.1" mat_peptide 11718..11984 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp4" /protein_id="YP_009555259.1" mat_peptide 11985..12575 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp5" /protein_id="YP_009555251.1" mat_peptide 12576..12905 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp6" /protein_id="YP_009555252.1" mat_peptide 12906..13316 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp7" /note="GFL; growth-factor-like protein" /protein_id="YP_009555253.1" mat_peptide join(13317..13340,13340..16099) /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp9" /note="RNA-dependent RNA polymerase" /protein_id="YP_009555260.1" mat_peptide 16100..17908 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp10" /note="MB; NTPase/HEL; metal-binding domain; NTPase/helicase domain" /protein_id="YP_009555254.1" mat_peptide 17909..19471 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp11" /protein_id="YP_009555255.1" mat_peptide 19472..20596 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp12" /protein_id="YP_009555256.1" mat_peptide 20597..21493 /gene="orf1ab" /locus_tag="EYW02_gp1" /product="nsp13" /protein_id="YP_009555257.1" gene 21506..22342 /gene="ns2" /locus_tag="EYW02_gp2" /db_xref="GeneID:39105215" CDS 21506..22342 /gene="ns2" /locus_tag="EYW02_gp2" /note="non-structural protein" /codon_start=1 /product="ns2" /protein_id="YP_009555239.1" /db_xref="GeneID:39105215" /translation="MAVAYADKPNHFINFPLTHFQGFVLNYKGLQFQILDEGVDCKIQT APHISLTMLDIQPEDYKSVDVAIQEVIDDMHWGDGFQIKFENPHILGRCIVLDVKGVEE LHDDLVNYIRDKGCVADQSRKWIGHCTIAQLTDAALSIKENVDFINSMQFNYKITINPS SPARLEIVKLGAEKKDGFYETIVSHWMGIRFEYTSPTDKLAMIMGYCCLDVVRKELEEG DLPENDDDAWFKLSYHYENNSWFFRHVYRKSFHFRKACQNLDCNCLGFYESSVEEY" gene 22354..23628 /locus_tag="EYW02_gp3" /db_xref="GeneID:39105216" CDS 22354..23628 /locus_tag="EYW02_gp3" /codon_start=1 /product="hemagglutinin-esterase" /protein_id="YP_009555240.1" /db_xref="GeneID:39105216" /translation="MFLLPRFILVSCIIGSLGFYNPPTNVVSHVNGDWFLFGDSRSDCN HIVNINPHNYSYMDLNPVLCDSGKISSKAGNSIFRSFHFTDFYNYTGEGQQIIFYEGVN FTPYHAFKCNRSGSNDIWMQNKGLFYTQVYKNMAVYRSLTFVNVPYVYNGSAQATALCK SGSLVLNNPAYIAPQANSGDYYYKVEADFYLSGCDEYIVPLCIFNGKFLSNTKYYDDSQ YYFNKDTGVIYGLNSTETITTGFDLNCYYLVLPSGNYLAISNELLLTVPTKAICLNKRK DFTPVQVVDSRWNNARQSDNMTAVACQPPYCYFRNSTTNYVGVYDINHGDAGFTSILSG LLYNSPCFSQQGVFRYDNVSSVWPLYPYGRCPTAADINIPDLPICVYDPLPVILLGILL GVAIVIIVVLLLYFMVDNVTRLHDA" gene 23643..27704 /locus_tag="EYW02_gp4" /db_xref="GeneID:39105218" CDS 23643..27704 /locus_tag="EYW02_gp4" /codon_start=1 /product="spike surface glycoprotein" /protein_id="YP_009555241.1" /db_xref="GeneID:39105218" /translation="MFLILLISLPTAFAVIGDLKCTSDNINDKDTGPPPISTDTVDVTN GLGTYYVLDRVYLNTTLFLNGYYPTSGSTYRNMALKGSVLLSRLWFKPPFLSDFINGIF AKVKNTKVIKDRVMYSEFPAITIGSTFVNTSYSVVVQPRTINSTQDGDNKLQGLLEVSV CQYNMCEYPQTICHPNLGNHRKELWHLDTGVVSCLYKRNFTYDVNADYLYFHFYQEGGT FYAYFTDTGVVTKFLFNVYLGMALSHYYVMPLTCNSKLTLEYWVTPLTSRQYLLAFNQD GIIFNAVDCMSDFMSEIKCKTQSIAPPTGVYELNGYTVQPIADVYRRKPNLPNCNIEAW LNDKSVPSPLNWERKTFSNCNFNMSSLMSFIQADSFTCNNIDAAKIYGMCFSSITIDKF AIPNGRKVDLQLGNLGYLQSFNYRIDTTATSCQLYYNLPAANVSVSRFNPSTWNKRFGF IEDSVFKPRPAGVLTNHDVVYAQHCFKAPKNFCPCKLNGSCVGSGPGKNNGIGTCPAGT NYLTCDNLCTPDPITFTGTYKCPQTKSLVGIGEHCSGLAVKSDYCGGNSCTCRPQAFLG WSADSCLQGDKCNIFANFILHDVNSGLTCSTDLQKANTDIILGVCVNYDLYGILGQGIF VEVNATYYNSWQNLLYDSNGNLYGFRDYITNRTFMIRSCYSGRVSAAFHANSSEPALLF RNIKCNYVFNNSLTRQLQPINYFDSYLGCVVNAYNSTAISVQTCDLTVGSGYCVDYSKN RRSRGAITTGYRFTNFEPFTVNSVNDSLEPVGGLYEIQIPSEFTIGNMVEFIQTSSPKV TIDCAAFVCGDYAACKSQLVEYGSFCDNINAILTEVNELLDTTQLQVANSLMNGVTLST KLKDGVNFNVDDINFSPVLGCLGSECSKASSRSAIEDLLFDKVKLSDVGFVEAYNNCTG GAEIRDLICVQSYKGIKVLPPLLSENQISGYTLAATSASLFPPWTAAAGVPFYLNVQYR INGLGVTMDVLSQNQKLIANAFNNALYAIQEGFDATNSALVKIQAVVNANAEALNNLLQ QLSNRFGAISASLQEILSRLDALEAEAQIDRLINGRLTALNAYVSQQLSDSTLVKFSAA QAMEKVNECVKSQSSRINFCGNGNHIISLVQNAPYGLYFIHFSYVPTKYVTARVSPGLC IAGDRGIAPKSGYFVNVNNTWMYTGSGYYYPEPITENNVVVMSTCAVNYTKAPYVMLNT SIPNLPDFKEELDQWFKNQTSVAPDLSLDYINVTFLDLQVEMNRLQEAIKVLNQSYINL KDIGTYEYYVKWPWYVWLLICLAGVAMLVLLFFICCCTGCGTSCFKKCGGCCDDYTGYQ ELVIKTSHDD" gene 27792..28121 /gene="ns12.9" /locus_tag="EYW02_gp5" /db_xref="GeneID:39105219" CDS 27792..28121 /gene="ns12.9" /locus_tag="EYW02_gp5" /note="non-structural protein" /codon_start=1 /product="ns12.9" /protein_id="YP_009555242.1" /db_xref="GeneID:39105219" /translation="MDIWRPEKKYLRYINGFNVSELEDACFKFNYQFPKVGYCRVPSHA WCRNQGRFCATFTLYGKSKHYDKYFGVINGFTAFANTVEDAVNKLVFLAVDFITWRRQE LNVYG" gene 28108..28362 /locus_tag="EYW02_gp6" /db_xref="GeneID:39105217" CDS 28108..28362 /codon_start=1 /product="envelope protein" /protein_id="YP_009555243.1" /translation="MFMADAYLADTVWYVGQIIFIVAICLLVTIVVVAFLATFKLCIQL CGMCNTLVLSPSIYVFNRGRQFYEFYNDVKPPVLDVDDV" gene 28377..29069 /locus_tag="EYW02_gp7" /db_xref="GeneID:39105220" CDS 28377..29069 /locus_tag="EYW02_gp7" /codon_start=1 /product="membrane protein" /protein_id="YP_009555244.1" /db_xref="GeneID:39105220" /translation="MSSKTTPAPVYIWTADEAIKFLKEWNFSLGIILLFITIILQFGYT SRSMFVYVIKMIILWLMWPLTIILTIFNCVYALNNVYLGLSIVFTIVAIIMWIVYFVNS IRLFIRTGSFWSFNPETNNLMCIDMKGTMYVRPIIEDYHTLTVTIIRGHLYIQGIKLGT GYSLADLPAYMTVAKVTHLCTYKRGFLDRISDTSGFAVYVKSKVGNYRLPSTQKGSGMD TALLRNNI" gene 29079..30425 /locus_tag="EYW02_gp8" /db_xref="GeneID:39105221" CDS 29079..30425 /locus_tag="EYW02_gp8" /codon_start=1 /product="nucleocapsid protein" /protein_id="YP_009555245.1" /db_xref="GeneID:39105221" /translation="MSFTPGKQSSSRASSGNRSGNGILKWADQSDQFRNVQTRGRRAQP KQTATSQQPSGGNVVPYYSWFSGITQFQKGKEFEFVEGQGVPIAPGVPATEAKGYWYRH NRRSFKTADGNQRQLLPRWYFYYLGTGPHAKDQYGTDIDGVYWVASNQADVNTPADIVD RDPSSDEAIPTRFPPGTVLPQGYYIEGSGRSAPNSRSTSRTSSRASSAGSRSRANSGNR TPTSGVTPDMADQIASLVLAKLGKDATKPQQVTKHTAKEVRQKILNKPRQKRSPNKQCT VQQCFGKRGPNQNFGGGEMLKLGTSDPQFPILAELAPTAGAFFFGSRLELAKVQNLSGN PDEPQKDVYELRYNGAIRFDSTLSGFETIMKVLNENLNAYQQQDGMMNMSPKPQRQRGH KNGQGENDNISVAVPKSRVQQNKSRELTAEDISLLKKMDEPYTEDTSEI" ORIGIN 1 attgtgagcg atttgcgtgc gtgcatcccg cttcactgat ctcttgttag atctttttgt 61 aatctaaact ttataaaaac atccactccc tgtaatctat gcttgtgggc gtagattttt 121 catagtggtg tttatattca tttctgctgt taacagcttt cagccaggga cgtgttgtat 181 cctaggcagt ggcccgccca taggtcacaa tgtcgaagat caacaaatac ggtctcgaac 241 tacactgggc tccagaattt ccatggatgt ttgaggacgc agaggagaag ttggataacc 301 ctagtagttc agaggtggat atgatttgct ccaccactgc gcaaaagctg gaaacagacg 361 gaatttgtcc tgaaaatcat gtgatggtgg attgtcgccg acttcttaaa caagagtgtt 421 gtgtgcagtc tagcctaata cgtgaaattg ttatgaatgc aagtccatat gatttggagg 481 tgctacttca agatgctttg cagtcccgtg aagcagtttt ggttacaacc cccttaggta 541 tgtctttaga ggcatgctat gtgagaggtt gtaatcctaa aggatggacc atgggtttgt 601 ttcggcgtag aagtgtgtgt aacactggtc gttgcactgt taataagcat gtggcctatc 661 agttatatat gattgatcct gcaggtgtct gtcttggtgc aggtcaattc gtgggttggg 721 tcataccctt agcctttatg cctgtgcaat cccggaaatt tattgttcca tgggttatgt 781 acttgcgtaa gcgtggcgaa aagggtgctt acaataaaga tcatggacgt ggcggttttg 841 gacatgttta tgattttaaa gttgaagatg cttatgacca ggtgcatgat gagcctaagg 901 gtaagttttc taagaaggct tatgctttaa ttagagggta tcgtggtgtt aaaccacttc 961 tctatgtaga ccagtatggt tgtgattata ctggtagtct tgcagatggc ttagaggctt 1021 atgctgataa gacattgcaa gaaatgaagg cattatttcc tacttggagt caggaactcc 1081 tttttgatgt aattgtggca tggcatgttg tgcgtgatcc acgttatgtt atgagattgc 1141 agagtgctgc tactatacgt agtgttgcat atgttgctaa tcctactgaa gacttgtgtg 1201 atggttctgt tgttataaaa gaacctgtgc atgtttatgc agatgactct attattttac 1261 gtcaatataa tttagttgac attatgagtc atttttatat ggaggcagat acagttgtaa 1321 atgcttttta tggtgttgct ttgaaagatt gcggttttgt tatgcagttt ggttacattg 1381 attgcgaaca agactcgtgt gattttaaag gttggattcc tggtaacatg atagatggtt 1441 ttgcttgcac cacttgtggt catgtttatg aagtaggtga tttgatggca caatcttcag 1501 gtgttttgcc tgttaaccct gtattgcata ctaagagtgc agcaggctat ggtggttttg 1561 gttgtaaaga ttcttttact ctgtatggcc aaactgtagt ttattttgga ggttgtgtgt 1621 attggagtcc agcacgtaat atatggattc ctatattaaa atcctctgtt aagtcatatg 1681 acagtttggt ttatactgga gttttaggtt gcaaggctat tgtaaaggaa acaaatctca 1741 tttgcaaagc tttgtacctt gattatgttc aacacaagtg tggcaattta caccaacggg 1801 agttgctagg tgtttcagat gtgtggcata aacaattgct attaaataga ggtgtttata 1861 aacctctgtt agagaatatt gattatttta atatgcggcg cgctaaattt agtttagaaa 1921 cttttactgt ttgtgcagat ggctttatgc cttttctttt agatgattta gttccacgcg 1981 catattattt ggcagtaagt ggtcaagcat tttgtgatta tgcagataaa ctttgccatg 2041 ccgttgtgtc taagagtaaa gagttacttg atgtgtctct ggattcttta ggtgcagcta 2101 tacattattt gaattctaag attgttgatt tggctcaaca ttttagtgat tttggaacaa 2161 gtttcgtttc taaaattgtt catttcttta agacttttac tactagcact gctcttgcat 2221 ttgcatgggt tttatttcat gttttgcatg gtgcttatat agtagtggag agtgatatat 2281 attttgttaa aaacattcct cgttatgcta gtgctgttgc acaagcattt cagagtgttg 2341 ctaaagttgt actggactct ttaagagtta cttttattga tggcctttct tgttttaaga 2401 ttggacgtag aagaatttgt ctttcaggca gaaaaattta tgaagttgag cgtggcttgt 2461 tacattcatc ccaattgcca ttagatgttt atgatttaac catgcctagt caagttcaga 2521 aagccaagca aaaacctatt tatttaaaag gttctggttc tgatttttca ttagcggata 2581 gtgtagttga agttgttaca acttcactta caccatgtgg ttattctgaa ccacctaaag 2641 ttgcagctaa aatttgcatt gtggataatg tttatatggc caaggctggt gacaaatatt 2701 accctgttgt ggttgatgat catgttggac tcttggatca agcatggaga gttccttgtg 2761 ctggaaggcg tgttacattt aaggaacagc ctacagtaaa ggagattata agcatgccta 2821 agattattaa ggttttttat gagcttgaca acgattttaa tactatttta aatactgcgt 2881 gtggagtgtt tgaagtggat gatactgttg atatggagga attttatgct gtggtgattg 2941 atgccataga agagaaactt tctccatgta aggagcttga aggtgtaggt gctaaagtta 3001 gtgccttttt acagaaatta gaggataatc ccctattttt atttgatgag gctggcgagg 3061 aagttcttgc tcctaaattg tattgtgcct ttacagctcc tgaagatgat gactttcttg 3121 aggaaagtga tgttgaagaa gatgatgtag aaggtgagga aactgattta actgtcacaa 3181 gtgctggaca gccttgtgtt gctagtgaac aggaggagtc ttctgaagtc ttagaggaca 3241 ctttggatga tggtccaagt gtggagacat ctgattcaca agttgaagaa gatgtagaaa 3301 tgtcggattt tgttgatctt gaatctgtga ttcaggatta tgaaaatgtt tgttttgagt 3361 tttatactac agagccagaa tttgttaaag ttttgggtct gtatgtgcct aaagcaactc 3421 gcaacaattg ctggttgcga tcagttttgg cagtgatgca gaaattgccc tgtcaattta 3481 aagataaaaa tttgcaggat ctttgggtgt tatacaagca acagtatagt cagttgtttg 3541 ttgatacctt ggttaataag atacctgcta atattgtact tccacaaggt ggttatgttg 3601 ctgattttgc atattggttt ttaaccttat gtgattggca gtgtgttgca tactggaaat 3661 gcattaaatg tgatttagct cttaagctta aaggcttgga tgctatgttc ttttatggtg 3721 atgttgtttc acatatatgc aagtgtggtg agtctatggt acttattgat gttgatgtgc 3781 catttacagc ccactttgct cttaaagata agttgttttg tgcatttatt actaagcgta 3841 ttgtgtataa agcagcttgt gttgtggatg ttaatgatag tcattctatg gctgttgttg 3901 atggtaaaca aattgatgat catcgtatca ctagtattac tagtgataag tttgatttta 3961 ttattgggca tggtatgtca ttttcaatga ctacttttga aattgcccaa ttgtatggtt 4021 cttgtataac acctaatgtg tgttttgtta aaggtgatat aattaaagta tctaagcttg 4081 ttaaagcaga agttgttgta aaccctgcta atggccatat ggcacatggt ggtggtgttg 4141 caaaagctat tgcagtagca gctggacagc agtttgttaa agagactacc gatatggtta 4201 agtctaaagg agtttgtgct actggagatt gttatgtctc tacagggggc aaattatgta 4261 aaactgtgct taatgttgtt ggacctgatg cgagaacaca gggtaaacaa agttatgtat 4321 tgttagagcg tgtttataaa catcttaaca actatgactg tgttgttaca actttgatct 4381 cagctggtat atttagtgtg ccttctgatg tgtctttaac atatctactt ggtactgcta 4441 agaaacaagt tgttcttgtt agcaataatc aagaggattt tgatcttatt tctaagtgtc 4501 agataactgc tgttgagggc actaagaaat tggcagcgcg tctttctttt aatgttggac 4561 gttccattgt ttacgaaaca gatgctaata agttgatttt aatcaatgac gttgcatttg 4621 tttcgacatt taatgtttta caggatgttt tatccttaag acatgatata gcacttgatg 4681 atgatgcacg aaccttcgtt cagagcaatg ttgatgttgt acctgagggt tggcgtgttg 4741 tcaataagtt ttatcaaatt aatggtgtta gaaccgttaa gtattttgag tgtactggag 4801 gcatagatat atgcagccag gataaagttt ttggttatgt acagcagggt atttttaata 4861 aggctactgt tgctcaaatt aaagccttgt ttttggataa agtggacatc ttgctaactg 4921 ttgatggtgt taatttcact aataggtttg tgcctgttgg tgaaagtttt ggtaagagtc 4981 taggaaatgt gttttgtgat ggagttaatg tcacgaagca taagtgtgat ataaattata 5041 aaggtaaagt ctttttccag tttgataatc tttctagtga agatttaaag gctgtaagaa 5101 gttcctttaa ttttgatcag aaggaattgc ttgcctatta caacatgctt gttaattgtt 5161 ttaagtggca ggttgttgtt aatggtaagt atttcacttt taagcaagct aataacaatt 5221 gttttgttaa tgtttcttgc ttaatgctcc agagtttgca tctgacattt aaaattgttc 5281 aatggcaaga ggcatggctt gaatttcgtt ctggccgccc tgctagattt gtagctttgg 5341 ttttggccaa aggtgggttt aaatttggag atcctgctga ttctagagat ttcttgcgtg 5401 ttgtgtttag tcaagttgat ttgactgggg caatatgtga ttttgaaatt gcatgtaaat 5461 gtggtgtaaa gcaggaacag cgtactggtc tggacgctgt tatgcatttt ggtacattga 5521 gtcgtgaaga tcttgagatt ggttataccg tggactgttc ttgcggtaaa aagctaattc 5581 attgtgtacg atttgatgta ccatttttaa tttgcagtaa tacacctgct agtgtaaaat 5641 tacctaaggg tgtaggaagt gcaaatattt ttataggtga taaggttggt cattatgttc 5701 atgttaagtg tgaacaatct tatcagcttt atgatgcttc taatgttaag aaggttacag 5761 atgttactgg caagttgtca gattgtctgt atcttaaaaa tttgaaacaa acttttaaat 5821 cggtgttaac cacctattat ttggatgatg ttaagaaaat tgagtataaa cctgacttgt 5881 cacaatatta ttgtgacgga ggtaagtatt atactcagcg tattattaaa gcccaattta 5941 aaacattcga gaaagtagat ggtgtgtata ctaattttaa attgatagga cacaccgtct 6001 gtgacagtct taatgctaag ttgggttttg atagctctaa agagtttgtt gaatataaga 6061 ttactgagtg gccaacagct acaggtgatg tggtgttggc tactgatgat ttgtatgtta 6121 agagatatga gaggggttgt attacttttg gtaaacctgt tatatggtta agccatgaga 6181 aagcttccct caattcttta acatatttta atagaccttc attggttgat gataataaat 6241 ttgatgtttt aaaagtggat gatgttgacg atggtggtga cagctcagag agtggtgcca 6301 aagaaaccaa agaaatcaac attattaagt taagtggtgt taaaaaacca tttaaggttg 6361 aagatagtgt cattgttaat gatgatacta gtgaaaccaa atatgttaag agtttgtcta 6421 ttgttgatgt gtatgatatg tggcttacag gttgtaagta tgttgttaga actgctaatg 6481 ctttgagcag agcagttaac gtacctacaa tacgtaagtt tataaaattt ggtatgactc 6541 ttgttagtat accaattgat ttgttaaatt taagagagat taagcctgct gttaatgtgg 6601 ttaaagctgt gcgaaataaa atttctgtat gctttaattt tattaaatgg ctttttgtct 6661 tattatttgg ctggattaaa atatccgctg ataataaagt aatctacacc acagaaattg 6721 catcaaagct tacgtgtaag cttgtagctt tagcttttaa aaatgcattt ttgacattta 6781 agtggagtat ggttgctaga ggtgcttgca ttatagcgac tatatttcta ttgtggttta 6841 attttatata tgccaatgta atttttagtg atttttattt gcctaaaatc ggtttcttgc 6901 cgacttttgt tggtaagatt gcacagtgga ttaagaacac ttttagtctt gtaactattt 6961 gtgatctata ttccattcag gatgtgggtt ttaagaatca gtattgtaat ggaagtattg 7021 catgtcagtt ctgcttggca ggatttgata tgttagataa ttataaagcc attgatgtag 7081 tacagtatga agctgatagg agagcatttg ttgattatac aggtgtgtta aagattgtca 7141 ttgaattgat agttagttac gccctgtata cggcatggtt ttatccattg tttgccctta 7201 tcagtattca gatcttgacc acttggctgc ctgagctttt tatgcttagt acattacatt 7261 ggagttttag gttgctggtg gctttagcta atatgttacc agcacatgtg tttatgaggt 7321 tttatattat tattgcctct tttattaagc tctttagctt gtttaggcat gttgcctatg 7381 gttgtagtaa atctggttgt ttgttttgtt acaagaggaa tcgtagtcta cgtgttaaat 7441 gtagtactat cgttggtggc atgatacgct attacgatgt tatggctaat ggtggcactg 7501 gcttttgttc aaaacatcaa tggaattgca ttgattgtga ttcttataaa ccaggtaata 7561 cttttattac tgttgaggcc gctcttgatc tatctaagga attgaaacgg cccattcagc 7621 ctacagatgt tgcttatcat acggttactg atgttaagca agttggttgt tctatgcgct 7681 tgttctatga tcgtgatgga cagcgcacat atgatgatgt taatgctagt ttgtttgtgg 7741 attatagtaa tttgctacat tctaaggtta agagtgtgcc taatatgcat gttgtggtag 7801 tggaaaatga tgctgataaa gccaattttc tgaatgctgc tgtattttat gcacagtctt 7861 tgtttagacc tattttaatg gttgataaaa atctgataac tactgctaac actggtacgt 7921 ctgttacaga aactatgttt gatgtttatg tggatacatt tttgtctatg tttgatgtgg 7981 ataaaaagag tcttaatgct ttaatagcaa ctgcgcattc ttctataaaa cagggtacgc 8041 agatttataa agttttggat acctttttaa gctgtgctcg taaaagttgt tctattgatt 8101 cagatgttga tactaagtgt ttagctgatt ctgtcatgtc tgctgtatcg gcaggtcttg 8161 aattgacgga tgaaagttgt aataacttgg tgccaacata tttgaagagt gacaacattg 8221 tggcagctga tttaggtgtt ctgattcaaa attctgcaaa gcatgtgcag ggtaatgttg 8281 ctaaaatagc tggtgtttcc tgtatatggt ctgtggatgc ttttaatcag tttagttctg 8341 atttccagca taaattgaag aaagcatgtt gtaaaactgg tttgaaactg aagcttactt 8401 ataataagca gatggctaat gtctctgttt taactacacc ctttagtctt aaagggggtg 8461 cagtttttag ttattttgtt tatgtgtgtt ttgtgttgag tttggtctgt tttattggac 8521 tgtggtgctt aatgcccact tacacagtac acaaatcaga ttttcagctt cccgtttatg 8581 ccagttataa agttttagat aatggtgtta ttagagatgt tagcgttgaa gatgtttgtt 8641 tcgctaacaa atttgaacaa tttgatcaat ggtatgagtc tacatttggt ctaagttatt 8701 atagtaacag tatggcttgt cccattgttg ttgctgtaat agatcaggat tttggctcta 8761 cagtgtttaa tgtccctacc aaagtgttac gatatggtta tcatgtgttg cactttatta 8821 cacatgcact ttctgctgat ggagtgcagt gttatacgcc acatagtcaa atatcgtatt 8881 ctaattttta tgctagtggc tgtgtgcttt cctctgcttg cactatgttt acaatggccg 8941 atggtagtcc acaaccttat tgttatacag aggggcttat gcaaaatgct tctctgtata 9001 gttcattggt acctcacgtg cggtataatc ttgctaatgc taaaggtttt atccgttttc 9061 cagaagtgtt gcgagaaggg cttgtacgta tcgtgcgtac tcgttctatg tcgtattgca 9121 gagttggatt atgtgaggaa gctgatgagg gtatatgctt taattttaat ggttcttggg 9181 tgcttaataa tgattattat agatcattgc ctgggacctt ttgtggtaga gatgtttttg 9241 atttaattta tcagctattt aaaggtttag cacagcctgt ggattttttg gcattgactg 9301 ctagttccat tgctggtgct atactcgctg taattgttgt tttggtgttt tattacctaa 9361 taaagcttaa acgtgctttt ggtgattaca ccagtgttgt ttttgttaac gtgattgtgt 9421 ggtgtgtaaa ttttatgatg ctttttgtgt ttcaagttta ccccatactt tcttgtgtat 9481 atgctatttg ttatttttat gccacgcttt atttcccttc ggagataagt gtgataatgc 9541 acttacaatg gctagttatg tatggcacta ttatgccttt atggttttgt ttgctatata 9601 tagctgttgt tgtttcaaat catgcttttt gggtattttc ttactgcaga aagcttggta 9661 cttctgttcg tagtgatggt acatttgaag aaatggctct cactactttt atgattacaa 9721 aagattctta ttgtaagctt aagaattctt tgtctgatgt tgcttttaat agatatttga 9781 gtttgtataa taaatatagg tattacagcg gtaaaatgga tactgctgca tatagggagg 9841 ctgcttgctc tcagttggct aaagcaatgg acacatttac caataataat ggtagtgatg 9901 tgctttacca accgcctact gcttccgtct caacttcatt cttgcaatct ggtattgtga 9961 aaatggtaaa tcctacttct aaggtagaac catgtgttgt cagtgttacc tatggtaata 10021 tgacattgaa tggtttatgg ttggatgaca aggtctactg tcccagacat gtaatatgtt 10081 ctgcttcaga tatgactaat ccagattata caaatttgtt gtgtagagta acatcaagtg 10141 attttactgt attgtttgat cgtctaagcc ttacagtgat gtcttatcaa atgcggggtt 10201 gtatgcttgt tcttacagtg accctgcaaa attctcgtac gccaaaatat acatttggtg 10261 tggttaaacc tggtgagact tttactgttt tagctgctta taacggcaaa ccacaaggag 10321 cctttcatgt aactatgcgt agtagttata ccattaaggg ttccttttta tgcggatctt 10381 gtggatctgt tggttatgta ataatgggtg attgtgttaa atttgtttat atgcatcaat 10441 tggagcttag tactggttgt catactggta ctgacttcaa tggggatttt tatggtcctt 10501 ataaggatgc tcaggttgtt cagttgctca ttcaggatta tatacaatct gttaattttg 10561 tagcatggct ttatgctgct atacttaaca attgtaattg gtttgtacaa agtgataagt 10621 gttctgtaga agattttaat gtgtgggctc tgtccaatgg atttagccaa gttaaatctg 10681 accttgttat agatgcttta gcttctatga ctggtgtgtc tttggaaaca ctgttggctg 10741 ctattaagcg tcttaagaat ggtttccaag gacgtcagat tatgggtagt tgctcttttg 10801 aggatgaatt gacacctagc gatgtttatc aacaactcgc tggtatcaag ttacaatcaa 10861 aacgcactag attgtttaaa ggcactgttt gttggattat ggcttctaca tttttgttta 10921 gttgcataat tacagcattt gtgaaatgga ctatgtttat gtatgtaact actaatatgt 10981 ttagtattac gttttgtgca ctttgtgtta taagtttggc catgttgttg gttaagcata 11041 agcatcttta tttgactatg tatataactc ctgtgctttt tacactgttg tataacaact 11101 atttggttgt gtacaagcat acatttagag gctatgtcta tgcatggcta tcatattatg 11161 ttccatcagt tgagtacact tatactgatg aagttattta tggcatgtta ttgcttgtag 11221 gaatggtctt tgttacatta cgtagcatta accatgattt gttttctttt ataatgtttg 11281 ttggtcgttt gatttctgtt ttctctttgt ggtacaaggg ttctaactta gaggaagaaa 11341 ttcttcttat gttggcttcc ctttttggta cttacacatg gacaacagtt ttatctatgg 11401 ctgtagcaaa ggttattgct aagtgggttg ctgtgaatgt cttgtatttc acagatatac 11461 ctcaaattaa gatagtgctt ttgtgctatt tgtttattgg ttatattatt agctgttatt 11521 ggggcttgtt ttccttgatg aacagtttgt ttagaatgcc tttgggtgtt tataattata 11581 aaatttcagt acaggaatta agatatatga atgctaatgg attgcgccct cctaagaata 11641 gttttgaagc ccttatgctt aattttaagc tgttgggtat tggaggtgtt ccaatcattg 11701 aagtatctca atttcaatca aaattgactg atgtcaaatg tgctaatgtc gtcttgctta 11761 attgcttgca acatttgcat gttgcttcta attctaagtt gtggcattat tgtagcactt 11821 tgcacaatga aatacttgcc acttcggatc tgagtgttgc ttttgaaaag cttgctcagt 11881 tattaattgt tttgtttgct aatccagctg ctgtggatag caagtgcctg actagtattg 11941 aagaagtttg cgatgattac gcaaaggaca atactgtttt gcaggcttta cagagtgaat 12001 ttgttaatat ggctagcttc gttgaatatg aagttgctaa gaaaaatctt gatgaggcgc 12061 gttttagtgg ttctgctaat caacagcagt taaaacagct agagaaagcc tgtaatattg 12121 ctaaatctgc ttatgaacgc gaccgtgctg tagcaaaaaa gttggagcgt atggctgatt 12181 tggctctcac taatatgtat aaagaagcta gaattaatga taagaagagt aaggttgttt 12241 ctgccttgca aactatgctt tttagtatgg tgcgtaagtt agataatcaa gctctgaatt 12301 caatattaga taacgctgtg aagggttgtg taccattgaa tgcaatacct tcattggcag 12361 caaatactct gaatataatt gtaccagata aaagtgttta tgaccaggta gttgataatg 12421 tctatgttac ctatgcgggt aatgtatggc agattcaaac tatccaggat tcagatggta 12481 caaataagca gttgaatgag atatctgatg attgtaactg gccactagtt attattgcaa 12541 atcggtataa tgaggtatct gctactgttt tgcaaaataa tgaattaatg cctgctaagt 12601 tgaaaattca ggttgttaat agtggtccag atcagacttg taatacacct actcaatgtt 12661 actataataa tagtaacaat gggaagattg tttatgctat acttagtgat gttgatggtc 12721 ttaagtatac aaaaattctt aaagatgatg gcaattttgt tgttttggag ttagatcctc 12781 cttgtaaatt tactgttcaa gatgctaaag gtcttaaaat taagtacctt tattttgtaa 12841 aaggttgtaa cacactagca agaggctggg ttgttggtac aatttcttct acagttagat 12901 tgcaagctgg aactgctact gaatatgctt ccaactcatc tatattgtct ttatgtgcgt 12961 tttctgtaga tcctaagaaa acgtatttag attttataca acaaggagga acacctattg 13021 ccaattgtgt taaaatgttg tgtgaccatg ctggtaccgg tatggccatt actgttaaac 13081 ccgatgctac cactagtcag gattcatatg gtggtgcgtc tgtttgtata tattgccgcg 13141 cacgagttga acacccagat gttgatgggt tgtgcaaatt acgcggcaag tttgtacaag 13201 tgcctgtagg tataaaagat cctgtgtctt atgttttgac acatgatgtt tgtcgagttt 13261 gtggattttg gcgggatgga agttgttcat gtgttagcac tgacactact gttcaatcaa 13321 aagatactaa ttttttaaac gggttcgggg tacgagtgta gatgcccgtc tcgtaccctg 13381 cgccagtggt ttatctactg atgtacaatt aagggcattt gatatttaca atgctagtgt 13441 tgctggcatt ggtttacatt taaaagttaa ttgttgccgt tttcagcgtg ttgatgagaa 13501 cggtgataaa ttagatcagt tctttgttgt taagaggaca gatctgacta tatataatag 13561 agagatgaaa tgctatgagc gtgtaaaaga ttgtaagttt gtggctgaac acgatttctt 13621 tacatttgat gtagaaggta gtcgtgtgcc acacattgta cgcaaggatt taacaaagta 13681 tactatgttg gatctttgct atgcattgcg acattttgat cgcaatgatt gcatgctgct 13741 ttgtgacatt ctctctatat atgctggttg tgaacaatcc tactttacta agaaggattg 13801 gtatgatttt gttgaaaatc ctgatattat taatgtgtat aaaaagctag gacctatttt 13861 taatagagcc ctagttagcg ctactgagtt tgcggacaaa ttggtggagg taggcttagt 13921 aggcgtttta acacttgata atcaagattt aaatggtaaa tggtatgatt ttggtgacta 13981 tgttattgca gccccaggat gtggtgttgc tatagcagat tcttattatt cttatatcat 14041 gcctatgctg accatgtgtc atgcattgga ttgcgaattg tatgtgaata atgcttatag 14101 actatttgat cttgtacagt atgattttac tgattacaag cttgaattgt ttaataagta 14161 ttttaagcac tggagtatgc catatcatcc taacactgtt gattgtcagg atgatcggtg 14221 tattatacat tgtgctaatt ttaacatact ttttagtatg gttttaccta atacatgttt 14281 tgggcctctt gttaggcaaa tttttgtgga tggtgtgcct tttgttgttt caattggcta 14341 ccattataaa gaacttggta ttgtgatgaa tatggatgtg gatacacatc gttatcgctt 14401 gtctttaaaa gacttgcttt tatatgctgc tgatccagct ttgcatgtag cttctgctag 14461 tgcattgtat gatttacgca cttgctgttt tagtgttgcc gctataacaa gcggtgtaaa 14521 atttcaaaca gttaaacctg gtaattttaa tcaggatttt tatgattttg ttttaagtaa 14581 aggcctgctt aaagagggta gctcagttga tctgaagcac tttttcttta cacaggatgg 14641 taatgctgct attactgatt ataattatta taagtataat ttgcccacca tggtggacat 14701 taagcagttg ttgtttgttt tggaagttgt ttataagtat tttgagattt atgatggtgg 14761 gtgtataccg gcatcacaag tcattgttaa taattatgat aagagtgctg gctatccatt 14821 taacaaattt ggaaaagcca ggctctatta tgaagcatta tcatttgagg aacaggatga 14881 aatttacgct tatactaagc gtaatgtcct gccaacactt actcaaatga atttgaaata 14941 tgctattagt gctaagaata gagcccgcac tgttgctggt gtttccatac ttagtactat 15001 gactggcaga atgtttcatc aaaaatgttt gaaaagtata gcagctacac gtggtgttcc 15061 tgtagttata ggcaccacta aattttatgg tggctgggat gatatgttac gccgccttat 15121 taaagatgtt gacaatcctg tacttatggg ttgggattat cctaagtgtg atcgtgctat 15181 gccaaaccta ctacgtattg ttagtagttt ggtattagcc cgaaaacatg agacatgttg 15241 ttcgcaaagc gataggtttt atcgacttgc gaatgaatgc gcacaagttt tgagtgaaat 15301 tgttatgtgt ggtggctgtt attatgttaa gcctggtggc actagtagtg gtgatgcaac 15361 tactgctttt gctaattcag tctttaacat atgtcaagct gtttcagcca atgtatgtgc 15421 cttaatgtca tgcaatggca ataagattga agatcttagt atacgtgctc ttcagaagcg 15481 cttatactca catgtgtata gaagtgataa ggttgattca acctttgtca cagaatatta 15541 tgaattttta aataagcatt ttagtatgat gattttgagt gatgatgggg ttgtgtgtta 15601 taattctgat tatgcgtcca aagggtatat tgctaatata agtgcctttc aacaggtatt 15661 atattatcaa aataacgttt ttatgtcaga atccaaatgt tgggttgaac atgacataaa 15721 taatggacct catgaattct gttcacaaca cacaatgctt gtaaagatgg atggtgacga 15781 tgtctacctt ccatatccta atcctagtcg tatattagga gctggatgtt ttgtagatga 15841 tttgttaaag actgatagtg ttcttttaat agaacgattt gtaagtcttg caatagatgc 15901 ttatccactt gtgtatcatg aaaatgaaga ataccaaaag gtttttcgtg tttatttggc 15961 gtatataaag aagttgtaca atgacctggg taatcagatc ttggatagct acagtgttat 16021 tttaagtact tgtgatggac aaaagttcac tgatgagtcc ttttacaaga acatgtattt 16081 aagaagtgca gttatgcaga gtgttggagc ttgcgtggtc tgctcttctc aaacatcatt 16141 acgttgtggc agttgcatca gaaagcctct tctttgctgc aagtgttgtt atgatcatgt 16201 tatggcgact gatcataaat atgtcttgag tgtttcacca tatgtgtgta atgcaccagg 16261 atgtgatgta aatgatgtta ccaaattgta tctaggtggt atgtcatatt attgtgaaga 16321 ccataagcca caatattcat tcaagttggt aatgaatggt ctggtttttg gtctatataa 16381 acaatcttgt acaggatctc cgtacataga cgattttaat cgtatagcta gttgtaaatg 16441 gaccgatgtg gatgattaca tactagctaa tgaatgtaca gagcgcttga aattgtttgc 16501 tgcagaaacg caaaaggcaa ccgaggaagc ctttaagcag agttatgcat cagcaacaat 16561 acaagagatt gttagtgagc gcgaattgat tctctcttgg gagattggaa aagttaagcc 16621 accacttaat aaaaattatg tttttactgg ctaccatttt actaaaaatg gtaagacagt 16681 tttaggtgag tatgtttttg ataagagtga gttgactaat ggtgtgtatt atcgcgccac 16741 aaccacttat aagctatctg taggagatgt ttttgtttta acctctcatt cagtagctaa 16801 tttaagtgct cctacgcttg ttccgcagga gaattatagt agtattagat ttgctagtgt 16861 ttatagtgtg cttgagacgt ttcagaacaa tgttgttaat tatcaacaca ttggtatgaa 16921 acgttactgc accgtgcaag gacctcctgg tacagggaag tcacatcttg ctattggtct 16981 tgctgtattc tattgtacag cacgtgttgt atacacagcg gccagccatg cagctgttga 17041 cgcattgtgt gaaaaagcat ataaattttt gaatataaat gattgcactc gtattgttcc 17101 ggccaaggtc agggtggagt gctatgataa gtttaaaatt aatgacacca ctcgtaagta 17161 tgtgtttact accataaatg cattacctga gatggtgact gatattgttg ttgtagatga 17221 agttagtatg cttaccaatt atgagctttc tgttattaat gctcgtattc gcgctaagca 17281 ttatgtttat attggtgatc ctgctcaatt gccagcacca cgtgtgttat tgagcaaggg 17341 tacacttgaa cctaaatatt ttaacactgt tactaagctc atgtgttgct tagggccaga 17401 catttttctt ggtacatgtt atagatgtcc taaggaaatc gttgatacag tgtccgcctt 17461 ggtttatgaa aataagctta aggctaagaa tgagagtagt tcattgtgtt ttaaggtcta 17521 ttataagggc gttacaacac atgaaagttc tagtgctgta aatatgcagc agatttattt 17581 gattaataag tttttgaagg ctaacccttt gtggcataaa gctgttttta ttagcccata 17641 taatagtcag aactttgcag ctaagcgtgt tttgggttta caaacccaaa ccgtggattc 17701 tgctcaaggt tctgaatatg attatgttat atattcacag actgcagaaa cagcgcattc 17761 tgtaaatgtt aatcgcttca atgttgctat tactcgagcc aagaaaggta ttctttgtgt 17821 tatgagtaat atgcagttgt ttgaagcatt acagtttact acattgacct tagataaagt 17881 gccacaggcc gtcgaaacta aagttcaatg tagtactaat ttatttaaag attgtagcaa 17941 gagttatagc ggttatcacc cagctcatgc tccttcattt ttggcagtag atgacaaata 18001 taaggcaact ggcgatttag ccgtgtgtct tggtattggt gattctgctg ttacatattc 18061 aagattaata tcactcatgg gttttaaatt ggatgttacc cttgatgggt attgtaagct 18121 ttttataact aaagaagaag ctgttaaacg cgtgcgtgcc tgggttggct ttgatgctga 18181 aggtgctcat gccacgcgtg atagcattgg gacaaatttc ccacttcaat taggattttc 18241 cacaggaatt gattttgttg tggaagccac tggtttgttt gctgatagag atggttacag 18301 ctttaaaaag gctgtggcga aagctcctcc tggtgaacaa tttaagcacc tcatcccttt 18361 gatgacgaga ggtcatcgct gggatgttgt tagacctaga atagtacaaa tgtttgcaga 18421 tcatttaatt gatctgtctg attgtgttgt gctagttaca tgggcagcca actttgagct 18481 cacttgtctc cgctactttg caaaagtagg gcgtgagatt tcttgtaatg tatgcactaa 18541 acgtgccaca gtttacaatt ctagaactgg ttactatggt tgttggcgcc atagtgttac 18601 atgtgattac ttgtataatc cacttattgt tgatattcaa cagtggggat atattggttc 18661 tttatcaagt aatcatgatt tatattgtag tgtccataaa ggagcacatg ttgcttcctc 18721 tgatgctata atgacacggt gtttggccgt ttatgattgc ttttgcaata atattaattg 18781 gaatgtggag tatcccatca tttcaaatga gttaagtatt aatacctctt gtagggtctt 18841 gcagcgtgtg attcttaaag ctgccatgct ctgcaacaga tatactttgt gttatgatat 18901 tggcaaccca aaagcgattg cctgtgtcaa agattttgat tttaagttct atgatgccca 18961 accaattgtt aagtctgtta agactctttt gtattctttt gaggcacata aggactcttt 19021 taaagacggt ttgtgtatgt tttggaactg taatgtggat aagtatccac cgaatgcagt 19081 tgtatgtaga tttgacacta gagtgttgaa taatttaaat cttcctggct gtaatggagg 19141 tagtttgtat gttaataaac atgcattcca cactaaaccc tttgctaggg cagcctttga 19201 gcatttgaag cctatgccat tcttctatta ttcagatacg ccttgtgtgt atatggatgg 19261 catggatgct aagcaggttg attatgtacc tttgaaatct gccacgtgca tcacaagatg 19321 caatttaggt ggtgcagttt gtttaaaaca tgctgaagag tatcgtgagt acttagagtc 19381 ttacaataca gctactacag caggttttac tttttgggtc tataagacat ttgattttta 19441 taatttgtgg aatacgttca ccaagctaca aagcttggag aatgttgtat ataatttagt 19501 caagactggt cattatacag gacaggctgg tgaaatgcct tgtgccatta taaatgataa 19561 agttgtggct aagatcgata aggaggatgt tgtcattttt attaataata caacataccc 19621 tactaatgtg gccgttgaat tatttgccaa gcgcagtgtt cgacaccacc cagagcttaa 19681 gctctttaga aatttaaata tagacgtgtg ttggaagcac gtcatttggg attatgctag 19741 agaaagtata ttttgcagta atacctatgg tgtctgcatg tatacagatt taaagttcat 19801 tgataaattg aatgtccttt ttgatggtcg tgataatggt gctcttgaag cttttaaacg 19861 ttctaataat ggcgtttaca tttccacgac aaaagttaag agtctttcga tgataagagg 19921 tccaccgcgt gctgaattaa atggcgtagt ggtggacaag gttggagaca ctgattgtgt 19981 gttttatttt gctgtgcgta aagaaggtca ggatgtcatc ttcagccaat tcgacagcct 20041 gggagtcagc tctaaccaga gcccacaagg taatctgggg agtaatggta aacccggtaa 20101 tgtcggtggt aatgatgctc tgtcaatctc tactatcttt acacaaagcc gtgttattag 20161 ctcttttaca tgtcgtactg atatggaaaa agattttata gctttagatc aagatgtgtt 20221 tattcagaag tatggtttgg aggactatgc ctttgaacac attgtttatg gtaacttcaa 20281 ccagaagatt attggtggtt tgcatttgtt aataggcttg taccgaagac agcaaacttc 20341 caatctggtt gttcaggagt ttgtttcata tgactccagc atacactctt attttatcac 20401 tgacgagaag agtggtggta gtaagagtgt ttgcactgtt atagatattt tgttggatga 20461 ttttgtggct cttgttaagt cacttaatct taattgtgtg agtaaggttg ttaatgttaa 20521 tgttgatttt aaagattttc agtttatgct ttggtgtaac gatgagaaag ttatgacttt 20581 ctatcctcgt ttgcaagctg catctgactg gaagcctggt tattctatgc ctgtattata 20641 taagtatttg aattctccaa tggaaagagt tagtctctgg aattatggga agccagttac 20701 tttgcctaca ggctgtatga tgaatgttgc taagtatact cagttatgtc aatatctgaa 20761 tactacaaca ttagctgtac ctgttaatat gcgagttttg catttaggtg caggttcaga 20821 aaaaggagta gcaccgggtt ctgcagttct taggcagtgg ttgcctgctg gtactattct 20881 tgtagataac gatttatacc catttgttag tgacagtgtc gctacatatt ttggggattg 20941 tataacttta ccctttgatt gtcaatggga tttgataatt tctgatatgt atgaccctat 21001 tactaagaac ataggggagt acaatgtgag taaagatggt ttctttacat acatttgtca 21061 tatgattcga gacaagttag ctctgggtgg cagtgttgct ataaaaataa cagagttttc 21121 ttggaatgca gaattatata agttaatggg gtattttgca ttttggactg tgttttgcac 21181 aaatgcaaat gcttcttcta gtgaaggatt tttaattggc ataaattatt tgtgtaagcc 21241 caaggttgag atagatggaa atgttatgca tgccaattat ttgttttgga gaaattccac 21301 agtttggaac gggggtgctt atagcctgtt tgatatggct aaattcccgc ttaagttggc 21361 tggtactgcc gtaataaatt taagagcaga ccagattaat gatatggttt attcccttct 21421 tgaaaagggt aaactactta ttagagatac aaataaagaa gttttcgttg gtgacagttt 21481 ggttaatgta atctaaactt taaaaatggc tgtcgcttat gcagacaagc ctaatcattt 21541 tatcaatttt ccacttaccc attttcaggg ttttgtgtta aattataaag gtttacaatt 21601 tcaaattctc gatgaaggag tggattgtaa aatacaaaca gcgccacaca ttagtcttac 21661 tatgctggac atacagcctg aagactataa aagtgttgat gtcgctattc aagaagttat 21721 tgatgatatg cattggggtg atggttttca gattaaattt gagaatcctc acatcctagg 21781 aagatgcata gttttagatg ttaaaggtgt agaagaattg catgacgatt tagttaatta 21841 cattcgtgat aaaggttgtg ttgctgacca atccaggaaa tggattggcc attgcaccat 21901 agctcaactc acggatgcag cactgtccat taaggaaaat gttgatttta taaacagcat 21961 gcaattcaat tataaaatca ccatcaaccc ctcatcaccg gctagacttg aaatagttaa 22021 gctcggtgct gaaaagaaag atggttttta tgaaaccata gttagtcact ggatgggaat 22081 tcgttttgaa tacacatcac ccactgataa gctagctatg attatgggtt attgttgttt 22141 agatgtggta cgtaaagagc tagaagaagg cgatcttccc gagaatgatg atgatgcttg 22201 gtttaagcta tcgtaccatt atgaaaacaa ttcttggttc ttccgacatg tctacaggaa 22261 aagttttcat ttccgtaagg cttgtcaaaa tttagattgt aattgtttgg ggttttatga 22321 atcttcagtt gaagaatatt aaactcagtg aaaatgtttt tgcttcctag atttattcta 22381 gttagctgca taattggtag cttaggtttt tacaaccctc ctaccaatgt tgtttcgcat 22441 gtaaatggag attggttttt atttggtgac agtcgttcag attgtaatca tattgttaat 22501 atcaaccccc ataattattc ttatatggac cttaatcctg ttctgtgtga ttctggtaaa 22561 atatcatcta aagctggcaa ctccattttt aggagttttc actttaccga tttttataat 22621 tacacaggcg aaggtcaaca aattattttt tatgagggtg ttaattttac gccttatcat 22681 gcctttaaat gcaaccgttc tggtagtaat gatatttgga tgcagaataa aggcttgttt 22741 tatactcagg tttataagaa tatggctgtg tatcgcagcc ttacttttgt taatgtacca 22801 tatgtttata atggctccgc acaagctaca gctctttgta aatctggtag tttagtcctt 22861 aataaccctg catatatagc tcctcaagct aactctgggg attattatta taaggttgaa 22921 gctgattttt atttgtcagg ttgtgacgag tatatcgtac cactttgtat ttttaacggc 22981 aagtttttgt cgaatacaaa gtattatgat gatagtcaat attattttaa taaagacact 23041 ggtgttattt atggtctcaa ttctacagaa accattacca ctggttttga tcttaattgt 23101 tattatttag ttttaccctc tggtaattat ttagccattt caaatgagct attgttaact 23161 gttcctacga aagcaatctg tcttaataag cgtaaggatt ttacgcctgt acaggttgtt 23221 gattcgcggt ggaacaatgc caggcagtct gataacatga cggcggttgc ttgtcaacct 23281 ccgtactgtt attttcgtaa ttctactacc aactatgttg gtgtttatga tattaatcat 23341 ggagatgctg gttttactag catacttagt ggtttgttat ataattcacc ttgtttttcg 23401 cagcaaggcg tttttaggta tgataatgtt agcagtgtct ggcctctcta cccctatggc 23461 agatgtccca ctgctgctga tattaatatc cctgatttac ccatttgtgt gtatgatccg 23521 ctaccagtta ttttgcttgg cattcttttg ggcgttgcga ttgtaattat tgtagttttg 23581 ttgttatatt ttatggtgga taatgttact aggctgcatg atgcttagac cataatctaa 23641 acatgttttt gatactttta atttccttac caacggcttt tgctgttata ggagatttaa 23701 agtgtacttc agataatatt aatgataaag acaccggtcc tcctcctata agtactgata 23761 ctgttgatgt tactaatggt ttgggtactt attatgtttt agatcgtgtg tatttaaata 23821 ctacgttgtt tcttaatggt tattacccta cttcaggttc cacatatcgt aatatggcac 23881 tgaagggaag tgtactattg agcagactat ggtttaaacc accatttctt tctgatttta 23941 ttaatggtat ttttgctaag gtcaaaaata ccaaggttat taaagatcgt gtaatgtata 24001 gtgagttccc tgctataact ataggtagta cttttgtaaa tacatcctat agtgtggtag 24061 tacaaccacg tacaatcaat tcaacacagg atggtgataa taaattacaa ggtcttttag 24121 aggtctctgt ttgccagtat aatatgtgcg agtacccaca aacgatttgt catcctaacc 24181 tgggtaatca tcgcaaagaa ctatggcatt tggatacagg tgttgtttcc tgtttatata 24241 agcgtaattt cacatatgat gtgaatgctg attatttgta ttttcatttt tatcaagaag 24301 gtggtacttt ttatgcatat tttacagaca ctggtgttgt tactaagttt ttgtttaatg 24361 tttatttagg catggcgctt tcacactatt atgtcatgcc tctgacttgt aatagtaagc 24421 ttactttaga atattgggtt acacctctca cttctagaca atatttactc gctttcaatc 24481 aagatggtat tatttttaat gctgttgatt gtatgagtga ttttatgagt gagattaagt 24541 gtaaaacaca atctatagca ccacctactg gtgtttatga attaaacggt tacactgttc 24601 agccaatcgc agatgtttac cgacgtaaac ctaatcttcc caattgcaat atagaagctt 24661 ggcttaatga taagtcggtg ccctctccat taaattggga acgtaagaca ttttcaaatt 24721 gtaattttaa tatgagcagc ctgatgtctt ttattcaggc agactcattt acttgtaata 24781 atattgatgc tgctaagata tatggtatgt gtttttccag cataactata gataagtttg 24841 ctatacccaa tggcaggaag gttgacctac aattgggtaa tttgggctat ttgcagtcat 24901 ttaactatag aattgatact actgcaacaa gttgtcagtt gtattataat ttacctgctg 24961 ctaatgtttc tgttagcagg tttaatcctt ctacttggaa taagagattt ggttttatag 25021 aagattctgt ttttaagcct cgacctgcag gtgttcttac taatcatgat gtagtttatg 25081 cacaacactg tttcaaagct cctaaaaatt tctgtccgtg taaattgaat ggttcgtgtg 25141 taggtagtgg tcctggtaaa aataatggta taggcacttg tcctgcaggt actaattatt 25201 taacttgtga taatttgtgc actcctgatc ctattacatt tacaggtact tataagtgcc 25261 cccaaactaa atctttagtt ggcataggtg agcactgttc gggtcttgct gttaaaagtg 25321 attattgtgg aggcaattct tgtacttgcc gaccacaagc atttttgggt tggtctgcag 25381 actcttgttt acaaggagac aagtgtaata tttttgctaa ttttattttg catgatgtta 25441 atagtggtct tacttgttct actgatttac aaaaagctaa cacagacata attcttggtg 25501 tttgtgttaa ttatgacctc tatggtattt taggccaagg catttttgtt gaggttaatg 25561 cgacttatta taatagttgg cagaaccttt tatatgattc taatggtaat ctctacggtt 25621 ttagagacta cataacaaac agaactttta tgattcgtag ttgctatagc ggtcgtgttt 25681 ctgcggcctt tcacgctaac tcttccgaac cagcattgct atttcggaat attaaatgca 25741 actacgtttt taataatagt cttacacgac agctgcaacc cattaactat tttgatagtt 25801 atcttggttg tgttgtcaat gcttataata gtactgctat ttctgttcaa acatgtgatc 25861 tcacagtagg tagtggttac tgtgtggatt actctaaaaa cagacgaagt cgtggagcga 25921 ttaccactgg ttatcggttt actaattttg agccatttac tgttaattca gtaaacgata 25981 gtttagaacc tgtaggtggt ttgtatgaaa ttcaaatacc ttcagagttt actataggta 26041 atatggtgga gtttattcaa acaagctctc ctaaagttac tattgattgt gctgcatttg 26101 tctgtggtga ttatgcagca tgtaaatcac agttggttga atatggtagt ttctgtgata 26161 acattaatgc catactcaca gaagtaaatg aactacttga cactacacag ttgcaagtag 26221 ctaatagttt aatgaatggt gttactctta gcactaagct taaagatggc gttaatttca 26281 atgtagacga catcaatttt tcccctgtat taggttgtct aggcagcgaa tgtagtaaag 26341 cttccagtag atctgctata gaggatttac tttttgataa agtaaagtta tctgatgtcg 26401 gttttgttga ggcttataat aattgtacag gaggtgccga aattagggac ctcatttgtg 26461 tgcaaagtta taaaggcatc aaagtgttgc ctccactgct ctcagaaaat cagatcagtg 26521 gatacacttt ggctgccacc tctgctagtc tatttcctcc ttggacagca gcagcaggtg 26581 taccatttta tttaaatgtt cagtatcgca ttaatgggct tggtgtcacc atggatgtgc 26641 taagtcaaaa tcaaaagctt attgctaatg catttaacaa tgccctttat gctattcagg 26701 aagggttcga tgcaactaat tctgctttag ttaaaattca agctgttgtt aatgcaaatg 26761 ctgaagctct taataactta ttgcaacaac tctctaatag atttggtgct ataagtgctt 26821 ctttacaaga aattctatct agacttgatg ctcttgaagc ggaagctcag atagatagac 26881 ttattaatgg tcgtcttacc gctcttaatg cttatgtttc tcaacagctt agtgattcta 26941 cactggtaaa atttagtgca gcacaagcta tggagaaggt taatgaatgt gtcaaaagcc 27001 aatcatctag gataaatttc tgtggtaatg gtaatcatat tatatcatta gtgcagaatg 27061 ctccatatgg tttgtatttt atccacttta gttatgtccc tactaagtat gtcacagcga 27121 gggttagtcc tggtctgtgc attgctggtg atagaggtat agctcctaag agtggttatt 27181 ttgttaatgt aaataatact tggatgtaca ctggtagtgg ttactactac cctgaaccta 27241 taactgaaaa taatgttgtt gttatgagta cctgcgctgt taattatact aaagcgccgt 27301 atgtaatgct gaacacttca atacccaacc ttcctgattt taaggaagag ttggatcaat 27361 ggtttaaaaa tcaaacatca gtggcaccag atttgtcact tgattatata aatgttacat 27421 tcttggacct acaagttgaa atgaataggt tacaggaggc aataaaagtc ttaaatcaga 27481 gctacatcaa tctcaaggac attggtacat atgaatatta tgtaaaatgg ccttggtatg 27541 tatggctttt aatctgcctt gctggtgtag ctatgcttgt tttactattc ttcatatgct 27601 gttgtacagg atgtgggact agttgtttta agaaatgtgg tggttgttgt gatgattata 27661 ctggatacca ggagttagta atcaaaactt cacatgacga ctaagttcgt ctttgattca 27721 ttgcactgat ctcttgttag atctttttgc aatctagcat ttgttaaagt tcttaaggcc 27781 acgccctatt aatggacatt tggagacctg agaagaaata tctccgttat attaacggtt 27841 ttaatgtctc agaattagaa gatgcttgtt ttaaatttaa ctatcaattt cctaaagtag 27901 gatattgtag agttcctagt catgcttggt gccgtaatca aggtagattt tgtgctacat 27961 tcactcttta tggtaaatcc aaacattatg ataaatattt tggagtaata aatggtttca 28021 cagcattcgc taatactgta gaggatgctg ttaacaaact ggttttctta gctgttgact 28081 ttattacctg gcgcagacag gagttaaatg tttatggctg atgcttatct tgcagacact 28141 gtgtggtatg tggggcaaat aatttttata gttgccattt gtttattggt tacaatagtt 28201 gtagtggcat ttttggcaac ttttaaattg tgtattcaac tttgcggtat gtgtaatacc 28261 ttagtactgt ccccttctat ttatgtgttt aatagaggta ggcagtttta tgagttttac 28321 aatgatgtaa aaccaccagt ccttgatgtg gatgacgttt aggtaatcca aacattatga 28381 gtagtaaaac tacaccagca ccagtttata tctggactgc tgatgaagct attaaattcc 28441 taaaggaatg gaatttttct ttgggtatta tactactttt tattacaatc atattgcaat 28501 ttggatatac aagtcgcagt atgtttgttt atgttattaa gatgattatt ttgtggctta 28561 tgtggcccct tactataatc ttaactattt tcaattgcgt atacgcattg aataatgtgt 28621 atcttggcct ttctatagtt tttaccatag tggccattat tatgtggatt gtgtattttg 28681 tgaatagtat caggttgttt attagaactg gaagtttttg gagtttcaac ccagaaacaa 28741 acaacttgat gtgtatagat atgaaaggaa caatgtatgt taggccgata attgaggact 28801 atcatactct gacggtcaca ataatacgcg gccatcttta cattcaaggt ataaaactag 28861 gtactggcta ttctttggca gatttgccag cttatatgac tgttgctaag gttacacacc 28921 tgtgcacata taagcgtggt tttcttgaca ggataagcga tactagtggt tttgctgttt 28981 atgttaagtc caaagtcggt aattaccgac tgccatcaac ccaaaagggt tctggcatgg 29041 acaccgcatt gttgagaaat aatatctaaa ttttaaggat gtcttttact cctggtaagc 29101 aatccagtag tagagcgtcc tctggaaatc gttctggtaa tggcatcctc aagtgggccg 29161 atcagtccga ccagtttaga aatgttcaaa ccaggggtag aagagctcaa cccaagcaaa 29221 ctgctacctc tcagcaacca tcaggaggga atgttgtacc ctactattct tggttctctg 29281 gaattactca gtttcaaaag ggaaaggagt ttgagtttgt agaaggacaa ggtgtgccta 29341 ttgcaccagg agtcccagct actgaagcta aggggtactg gtacagacac aacagacgtt 29401 cttttaaaac agccgatggc aaccagcgtc aactgctgcc acgatggtat ttttactatc 29461 tgggaacagg accgcatgct aaagaccagt acggcaccga tattgacgga gtctactggg 29521 tcgctagcaa ccaggctgat gtcaataccc cggctgacat tgtcgatcgg gacccaagta 29581 gcgatgaggc tattccgact aggtttccgc ctggcacggt actccctcag ggttactata 29641 ttgaaggctc aggaaggtct gctcctaatt ccagatctac ttcgcgcaca tccagcagag 29701 cctctagtgc aggatcgcgt agtagagcca attctggcaa tagaacccct acctctggtg 29761 taacacctga catggctgat caaattgcta gtcttgttct ggcaaaactt ggcaaggatg 29821 ccactaaacc tcagcaagta actaagcata ctgccaaaga agtcagacag aaaattttga 29881 ataagccccg ccagaagagg agccccaata aacaatgcac tgttcagcag tgttttggta 29941 agagaggccc taatcagaat tttggtggtg gagaaatgtt aaaacttgga actagtgacc 30001 cacagttccc cattcttgca gaactcgcac ccacagctgg tgcgtttttc tttggatcaa 30061 gattagagtt ggccaaagtg cagaatttat ctgggaatcc tgacgagccc cagaaggatg 30121 tttatgaatt gcgctataac ggcgcaatta ggtttgacag tacactttca ggttttgaga 30181 ccataatgaa ggtgctgaat gagaatttga atgcctatca acaacaagat ggtatgatga 30241 atatgagtcc aaaaccacag cgtcagcgtg gtcataagaa tggacaagga gaaaatgata 30301 atataagtgt tgcagtgccc aaaagccgcg tgcagcaaaa taagagtaga gagttgactg 30361 cagaggacat cagccttctt aagaagatgg atgagcccta tactgaagac acctcagaaa 30421 tataagagaa tgaaccttat gtcggcatct ggtggtaacc cctcgcagaa aagtcgagat 30481 aaggcactct ctatcagaat ggatgtcttg ctgctataat agatagagaa ggttatagca 30541 gactatagat taattagttg aaagttttgt gttgtaatgt atagtgttgg agaaagtgaa 30601 agacttgcgg aagtaattgc cgacaagtgc ccaagggaag agccagcatg ttaagttacc 30661 acccagtaat tagtaaatga atgaagttaa ttatggccaa ttggaagaat cacaaaaaaa 30721 aaaaaaaaaa aaaaaaaaaa a // LOCUS NC_039199 13350 bp RNA linear VRL 25-AUG-2018 DEFINITION Human metapneumovirus isolate 00-1, complete genome. ACCESSION NC_039199 VERSION NC_039199.1 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Human metapneumovirus (HMPV) ORGANISM Human metapneumovirus Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; Mononegavirales; Pneumoviridae; Metapneumovirus. REFERENCE 1 (bases 1 to 13350) AUTHORS van den Hoogen,B.G., de Jong,J.C., Groen,J., Kuiken,T., de Groot,R., Fouchier,R.A. and Osterhaus,A.D. TITLE A newly discovered human pneumovirus isolated from young children with respiratory tract disease JOURNAL Nat. Med. 7 (6), 719-724 (2001) PUBMED 11385510 REFERENCE 2 (bases 1 to 13350) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (25-AUG-2018) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 13350) AUTHORS van den Hoogen,B.G., de Jong,J.C., Groen,J., Kuiken,T., de Groot,R., Fouchier,R.A. and Osterhaus,A.D. TITLE Direct Submission JOURNAL Submitted (25-SEP-2001) Dept of Virology, Erasmus University, Dr. Molewaterplein 50, Rotterdam 3015 GE, The Netherlands REMARK Sequence update by submitter REFERENCE 4 (bases 1 to 13350) AUTHORS van den Hoogen,B.G., de Jong,J.C., Groen,J., Kuiken,T., de Groot,R., Fouchier,R.A. and Osterhaus,A.D. TITLE Direct Submission JOURNAL Submitted (18-APR-2001) Dept of Virology, Erasmus University, Dr. Molewaterplein 50, Rotterdam 3015 GE, The Netherlands COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence is identical to AF371337. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..13350 /organism="Human metapneumovirus" /mol_type="genomic RNA" /isolate="00-1" /db_xref="taxon:162145" gene 40..1224 /gene="N" /locus_tag="D1Y22_gp1" /db_xref="GeneID:37626968" CDS 40..1224 /gene="N" /locus_tag="D1Y22_gp1" /codon_start=1 /product="nucleoprotein" /protein_id="YP_009513265.1" /db_xref="GeneID:37626968" /translation="MSLQGIHLSDLSYKHAILKESQYTIKRDVGTTTAVTPSSLQQEIT LLCGEILYAKHADYKYAAEIGIQYISTALGSERVQQILRNSGSEVQVVLTRTYSLGKIK NNKGEDLQMLDIHGVEKSWVEEIDKEARKTMATLLKESSGNIPQNQRPSAPDTPIILLC VGALIFTKLASTIEVGLETTVRRANRVLSDALKRYPRMDIPKIARSFYDLFEQKVYHRS LFIEYGKALGSSSTGSKAESLFVNIFMQAYGAGQTMLRWGVIARSSNNIMLGHVSVQAE LKQVTEVYDLVREMGPESGLLHLRQSPKAGLLSLANCPNFASVVLGNASGLGIIGMYRG RVPNTELFSAAESYAKSLKESNKINFSSLGLTDEEKEAAEHFLNVSDDSQNDYE" gene 1248..2132 /gene="P" /locus_tag="D1Y22_gp2" /db_xref="GeneID:37626964" CDS 1248..2132 /gene="P" /locus_tag="D1Y22_gp2" /codon_start=1 /product="phosphoprotein" /protein_id="YP_009513266.1" /db_xref="GeneID:37626964" /translation="MSFPEGKDILFMGNEAAKLAEAFQKSLRKPGHKRSQSIIGEKVNT VSETLELPTISRPAKPTIPSEPKLAWTDKGGATKTEIKQAIKVMDPIEEEESTEKKVLP SSDGKTPAEKKLKPSTNTKKKVSFTPNEPGKYTKLEKDALDLLSDNEEEDAESSILTFE ERDTSSLSIEARLESIEEKLSMILGLLRTLNIATAGPTAARDGIRDAMIGVREELIADI IKEAKGKAAEMMEEEMSQRSKIGNGSVKLTEKAKELNKIVEDESTSGESEEEEEPKDTQ DNSQEDDIYQLIM" gene 2165..2929 /gene="M" /locus_tag="D1Y22_gp3" /db_xref="GeneID:37626965" CDS 2165..2929 /gene="M" /locus_tag="D1Y22_gp3" /codon_start=1 /product="matrix protein" /protein_id="YP_009513267.1" /db_xref="GeneID:37626965" /translation="MESYLVDTYQGIPYTAAVQVDLIEKDLLPASLTIWFPLFQANTPP AVLLDQLKTLTITTLYAASQNGPILKVNASAQGAAMSVLPKKFEVNATVALDEYSKLEF DKLTVCEVKTVYLTTMKPYGMVSKFVSSAKSVGKKTHDLIALCDFMDLEKNTPVTIPAF IKSVSIKESESATVEAAISSEADQALTQAKIAPYAGLIMIMTMNNPKGIFKKLGAGTQV IVELGAYVQAESISKICKTWSHQGTRYVLKSR" gene 3052..4671 /gene="F" /locus_tag="D1Y22_gp4" /db_xref="GeneID:37626966" CDS 3052..4671 /gene="F" /locus_tag="D1Y22_gp4" /codon_start=1 /product="fusion protein" /protein_id="YP_009513268.1" /db_xref="GeneID:37626966" /translation="MSWKVVIIFSLLITPQHGLKESYLEESCSTITEGYLSVLRTGWYT NVFTLEVGDVENLTCADGPSLIKTELDLTKSALRELRTVSADQLAREEQIENPRQSRFV LGAIALGVATAAAVTAGVAIAKTIRLESEVTAIKNALKKTNEAVSTLGNGVRVLATAVR ELKDFVSKNLTRAINKNKCDIADLKMAVSFSQFNRRFLNVVRQFSDNAGITPAISLDLM TDAELARAVSNMPTSAGQIKLMLENRAMVRRKGFGFLIGVYGSSVIYMVQLPIFGVIDT PCWIVKAAPSCSGKKGNYACLLREDQGWYCQNAGSTVYYPNEKDCETRGDHVFCDTAAG INVAEQSKECNINISTTNYPCKVSTGRHPISMVALSPLGALVACYKGVSCSIGSNRVGI IKQLNKGCSYITNQDADTVTIDNTVYQLSKVEGEQHVIKGRPVSSSFDPVKFPEDQFNV ALDQVFESIENSQALVDQSNRILSSAEKGNTGFIIVIILIAVLGSTMILVSVFIIIKKT KKPTGAPPELSGVTNNGFIPHN" gene 4737..5463 /gene="M2" /locus_tag="D1Y22_gp5" /db_xref="GeneID:37626967" CDS 4737..5300 /gene="M2" /locus_tag="D1Y22_gp5" /codon_start=1 /product="matrix protein 2-1" /protein_id="YP_009513269.1" /db_xref="GeneID:37626967" /translation="MSRKAPCKYEVRGKCNRGSECKFNHNYWSWPDRYLLIRSNYLLNQ LLRNTDRADGLSIISGAGREDRTQDFVLGSTNVVQGYIDDNQSITKAAACYSLHNIIKQ LQEVEVRQARDNKLSDSKHVALHNLVLSYMEMSKTPASLINNLKRLPREKLKKLAKLII DLSAGAENDSSYALQDSESTNQVQ" CDS 5248..5463 /gene="M2" /locus_tag="D1Y22_gp5" /codon_start=1 /product="matrix protein 2-2" /protein_id="YP_009513270.1" /db_xref="GeneID:37626967" /translation="MTLHMPCKTVKALIKCSEHGPVFITIEVDDMIWTHKDLKEALSDG IVKSHTNIYNCYLENIEIIYVKAYLS" gene 5494..6045 /gene="SH" /locus_tag="D1Y22_gp6" /db_xref="GeneID:37626961" CDS 5494..6045 /gene="SH" /locus_tag="D1Y22_gp6" /codon_start=1 /product="small hydrophobic protein" /protein_id="YP_009513271.1" /db_xref="GeneID:37626961" /translation="MITLDVIKSDGSSKTCTHLKKIIKDHSGKVLIVLKLILALLTFLT VTITINYIKVENNLQICQSKTESDKKDSSSNTTSVTTKTTLNHDITQYFKSLIQRYTNS AINSDTCWKINRNQCTNITTYKFLCFKSEDTKTNNCDKLTDLCRNKPKPAVGVYHIVEC HCIYTVKWKCYHYPTDETQS" gene 6247..6957 /gene="G" /locus_tag="D1Y22_gp7" /db_xref="GeneID:37626962" CDS 6247..6957 /gene="G" /locus_tag="D1Y22_gp7" /codon_start=1 /product="attachment glycoprotein" /protein_id="YP_009513272.1" /db_xref="GeneID:37626962" /translation="MEVKVENIRTIDMLKARVKNRVARSKCFKNASLVLIGITTLSIAL NIYLIINYKMQKNTSESEHHTSSSPMESSRETPTVPTDNSDTNSSPQHPTQQSTEGSTL YFAASASSPETEPTSTPDTTNRPPFVDTHTTPPSASRTKTSPAVHTKNNPRTSSRTHSP PRATTRTARRTTTLRTSSTRKRPSTASVQPDISATTHKNEEASPASPQTSASTTRIQRK SVEANTSTTYNQTS" gene 7167..13184 /gene="L" /locus_tag="D1Y22_gp8" /db_xref="GeneID:37626963" CDS 7167..13184 /gene="L" /locus_tag="D1Y22_gp8" /codon_start=1 /product="RNA-dependent RNA polymerase" /protein_id="YP_009513273.1" /db_xref="GeneID:37626963" /translation="MDPLNESTVNVYLPDSYLKGVISFSETNAIGSCLLKRPYLKNDNT AKVAIENPVIEHVRLKNAVNSKMKISDYKIVEPVNMQHEIMKNVHSCELTLLKQFLTRS KNISTLKLNMICDWLQLKSTSDDTSILSFIDVEFIPSWVSNWFSNWYNLNKLILEFRKE EVIRTGSILCRSLGKLVFVVSSYGCIVKSNKSKRVSFFTYNQLLTWKDVMLSRFNANFC IWVSNSLNENQEGLGLRSNLQGILTNKLYETVDYMLSLCCNEGFSLVKEFEGFIMSEIL RITEHAQFSTRFRNTLLNGLTDQLTKLKNKNRLRVHGTVLENNDYPMYEVVLKLLGDTL RCIKLLINKNLENAAELYYIFRIFGHPMVDERDAMDAVKLNNEITKILRWESLTELRGA FILRIIKGFVDNNKRWPKIKNLKVLSKRWTMYFKAKSYPSQLELSEQDFLELAAIQFEQ EFSVPEKTNLEMVLNDKAISPPKRLIWSVYPKNYLPEKIKNRYLEETFNASDSLKTRRV LEYYLKDNKFDQKELKSYVVKQEYLNDKDHIVSLTGKERELSVGRMFAMQPGKQRQIQI LAEKLLADNIVPFFPETLTKYGDLDLQRIMEIKSELSSIKTRRNDSYNNYIARASIVTD LSKFNQAFRYETTAICADVADELHGTQSLFCWLHLIVPMTTMICAYRHAPPETKGEYDI DKIEEQSGLYRYHMGGIEGWCQKLWTMEAISLLDVVSVKTRCQMTSLLNGDNQSIDVSK PVKLSEGLDEVKADYSLAVKMLKEIRDAYRNIGHKLKEGETYISRDLQFISKVIQSEGV MHPTPIKKILRVGPWINTILDDIKTSAESIGSLCQELEFRGESIIVSLILRNFWLYNLY MHESKQHPLAGKQLFKQLNKTLTSVQRFFEIKKENEVVDLWMNIPMQFGGGDPVVFYRS FYRRTPDFLTEAISHVDILLRISANIRNEAKISFFKALLSIEKNERATLTTLMRDPQAV GSERQAKVTSDINRTAVTSILSLSPNQLFSDSAIHYSRNEEEVGIIADNITPVYPHGLR VLYESLPFHKAEKVVNMISGTKSITNLLQRTSAINGEDIDRAVSMMLENLGLLSRILSV VVDSIEIPTKSNGRLICCQISRTLRETSWNNMEIVGVTSPSITTCMDVIYATSSHLKGI IIEKFSTDRTTRGQRGPKSPWVGSSTQEKKLVPVYNRQILSKQQREQLEAIGKMRWVYK GTPGLRRLLNKICLGSLGISYKCVKPLLPRFMSVNFLHRLSVSSRPMEFPASVPAYRTT NYHFDTSPINQALSERFGNEDINLVFQNAISCGISIMSVVEQLTGRSPKQLVLIPQLEE IDIMPPPVFQGKFNYKLVDKITSDQHIFSPDKIDMLTLGKMLMPTIKGQKTDQFLNKRE NYFHGNNLIESLSAALACHWCGILTEQCIENNIFKKDWGDGFISDHAFMDFKIFLCVFK TKLLCSWGSQGKNIKDEDIVDESIDKLLRIDNTFWRMFSKVMFESKVKKRIMLYDVKFL SLVGYIGFKNWFIEQLRSAELHEVPWIVNAEGDLVEIKSIKIYLQLIEQSLFLRITVLN YTDMAHALTRLIRKKLMCDNALLTPIPSPMVNLTQVIDPTEQLAYFPKITFERLKNYDT SSNYAKGKLTRNYMILLPWQHVNRYNFVFSSTGCKVSLKTCIGKLMKDLNPKVLYFIGE GAGNWMARTACEYPDIKFVYRSLKDDLDHHYPLEYQRVIGELSRIIDSGEGLSMETTDA TQKTHWDLIHRVSKDALLITLCDAEFKDRDDFFKMVILWRKHVLSCRICTTYGTDLYLF AKYHAKDCNVKLPFFVRSVATFIMQGSKLSGSECYILLTLGHHNNLPCHGEIQNSKMKI AVCNDFYAAKKLDNKSIEANCKSLLSGLRIPINKKELNRQRRLLTLQSNHSSVATVGGS KVIESKWLTNKANTIIDWLEHILNSPKGELNYDFFEALENTYPNMIKLIDNLGNAEIKK LIKVTGYMLVSKK" ORIGIN 1 gtataaatta gattccaaaa aaatatggga caagtgaaaa tgtctcttca agggattcac 61 ctgagtgatt tatcatacaa gcatgctata ttaaaagagt ctcagtacac aataaaaaga 121 gatgtgggta caacaactgc agtgacaccc tcatcattgc aacaagaaat aacactgttg 181 tgtggagaaa ttctgtatgc taaacatgct gactacaaat atgctgcaga aataggaata 241 caatatatta gcacagcttt aggatcagag agagtgcagc agattctgag gaactcaggc 301 agtgaagtcc aagtggtctt aaccagaacg tactctctgg ggaaaattaa aaacaataaa 361 ggagaagatt tacagatgtt agacatacac ggggtagaga agagctgggt agaagagata 421 gacaaagaag caaggaaaac aatggcaacc ttgcttaagg aatcatcagg taatatccca 481 caaaatcaga ggccctcagc accagacaca cccataatct tattatgtgt aggtgcctta 541 atattcacta aactagcatc aaccatagaa gtgggactag agaccacagt cagaagggct 601 aaccgtgtac taagtgatgc actcaagaga taccctagaa tggacatacc aaagattgcc 661 agatccttct atgacttatt tgaacaaaaa gtgtatcaca gaagtttgtt cattgagtat 721 ggcaaagcat taggctcatc atctacaggc agcaaagcag aaagtctatt tgttaatata 781 ttcatgcaag cttatggggc cggtcaaaca atgctaaggt ggggggtcat tgccaggtca 841 tccaacaata taatgttagg acatgtatcc gtccaagctg agttaaaaca ggtcacagaa 901 gtctatgact tggtgcgaga aatgggccct gaatctggac ttctacattt aaggcaaagc 961 ccaaaagctg gactgttatc actagccaac tgtcccaact ttgcaagtgt tgttctcgga 1021 aatgcctcag gcttaggcat aatcggtatg tatcgaggga gagtaccaaa cacagaatta 1081 ttttcagcag ctgaaagtta tgccaaaagt ttgaaagaaa gcaataaaat aaatttctct 1141 tcattaggac ttacagatga agagaaagag gctgcagaac atttcttaaa tgtgagtgac 1201 gacagtcaaa atgattatga gtaattaaaa aagtgggaca agtcaaaatg tcattccctg 1261 aaggaaaaga tattcttttc atgggtaatg aagcagcaaa attagcagaa gctttccaga 1321 aatcattaag aaaaccaggt cataaaagat ctcaatctat tataggagaa aaagtgaata 1381 ctgtatcaga aacattggaa ttacctacta tcagtagacc tgcaaaacca accataccgt 1441 cagaaccaaa gttagcatgg acagataaag gtggggcaac caaaactgaa ataaagcaag 1501 caatcaaagt catggatccc attgaagaag aagagtctac cgagaagaag gtgctaccct 1561 ccagtgatgg gaaaacccct gcagaaaaga aactgaaacc atcaactaac accaaaaaga 1621 aggtttcatt tacaccaaat gaaccaggga aatatacaaa gttggaaaaa gatgctctag 1681 atttgctctc agataatgaa gaagaagatg cagaatcttc aatcttaacc tttgaagaaa 1741 gagatacttc atcattaagc attgaggcca gattggaatc aatagaggag aaattaagca 1801 tgatattagg gctattaaga acactcaaca ttgctacagc aggacccaca gcagcaagag 1861 atgggatcag agatgcaatg attggcgtaa gagaggaatt aatagcagac ataataaagg 1921 aagctaaagg gaaagcagca gaaatgatgg aagaggaaat gagtcaacga tcaaaaatag 1981 gaaatggtag tgtaaaatta acagaaaaag caaaagagct caacaaaatt gttgaagatg 2041 aaagcacaag tggagaatcc gaagaagaag aagaaccaaa agacacacaa gacaatagtc 2101 aagaagatga catttaccag ttaattatgt agtttaataa aaataaacaa tgggacaagt 2161 aaaaatggag tcctacctag tagacaccta tcaaggcatt ccttacacag cagctgttca 2221 agttgatcta atagaaaagg acctgttacc tgcaagccta acaatatggt tccctttgtt 2281 tcaggccaac acaccaccag cagtgctgct cgatcagcta aaaaccctga caataaccac 2341 tctgtatgct gcatcacaaa atggtccaat actcaaagtg aatgcatcag cccaaggtgc 2401 agcaatgtct gtacttccca aaaaatttga agtcaatgcg actgtagcac tcgatgaata 2461 tagcaaactg gaatttgaca aactcacagt ctgtgaagta aaaacagttt acttaacaac 2521 catgaaacca tacgggatgg tatcaaaatt tgtgagctca gccaaatcag ttggcaaaaa 2581 aacacatgat ctaatcgcac tatgtgattt tatggatcta gaaaagaaca cacctgttac 2641 aataccagca ttcatcaaat cagtttcaat caaagagagt gagtcagcta ctgttgaagc 2701 tgctataagc agtgaagcag accaagctct aacacaggcc aaaattgcac cttatgcggg 2761 attaattatg atcatgacta tgaacaatcc caaaggcata ttcaaaaagc ttggagctgg 2821 gactcaagtc atagtagaac taggagcata tgtccaggct gaaagcataa gcaaaatatg 2881 caagacttgg agccatcaag ggacaagata tgtcttgaag tccagataac aaccaagcac 2941 cttggccaag agctactaac cctatctcat agatcataaa gtcaccattc tagttatata 3001 aaaatcaagt tagaacaaga attaaatcaa tcaagaacgg gacaaataaa aatgtcttgg 3061 aaagtggtga tcattttttc attgttaata acacctcaac acggtcttaa agagagctac 3121 ttagaagagt catgtagcac tataactgaa ggatatctca gtgttctgag gacaggttgg 3181 tacaccaatg tttttacact ggaggtaggc gatgtagaga accttacatg tgccgatgga 3241 cccagcttaa taaaaacaga attagacctg accaaaagtg cactaagaga gctcagaaca 3301 gtttctgctg atcaactggc aagagaggag caaattgaaa atcccagaca atctagattc 3361 gttctaggag caatagcact cggtgttgca actgcagctg cagttacagc aggtgttgca 3421 attgccaaaa ccatccggct tgaaagtgaa gtaacagcaa ttaagaatgc cctcaaaaag 3481 accaatgaag cagtatctac attggggaat ggagttcgtg tgttggcaac tgcagtgaga 3541 gagctgaaag attttgtgag caagaatcta acacgtgcaa tcaacaaaaa caagtgcgac 3601 attgctgacc tgaaaatggc cgttagcttc agtcaattca acagaaggtt cctaaatgtt 3661 gtgcggcaat tttcagacaa cgctggaata acaccagcaa tatctttgga cttaatgaca 3721 gatgctgaac tagccagagc tgtttccaac atgccaacat ctgcaggaca aataaaactg 3781 atgttggaga accgtgcaat ggtaagaaga aaagggttcg gattcctgat aggagtttac 3841 ggaagctccg taatttacat ggtgcaactg ccaatctttg gggttataga cacgccttgc 3901 tggatagtaa aagcagcccc ttcttgttca ggaaaaaagg gaaactatgc ttgcctctta 3961 agagaagacc aaggatggta ttgtcaaaat gcagggtcaa ctgtttacta cccaaatgaa 4021 aaagactgtg aaacaagagg agaccatgtc ttttgcgaca cagcagcagg aatcaatgtt 4081 gctgagcagt caaaggagtg caacataaac atatctacta ctaattaccc atgcaaagtt 4141 agcacaggaa gacatcctat cagtatggtt gcactatctc ctcttggggc tttggttgct 4201 tgctacaagg gagtgagctg ttccattggc agcaacagag tagggatcat caagcaactg 4261 aacaaaggct gctcttatat aaccaaccaa gacgcagaca cagtgacaat agacaacact 4321 gtataccagc taagcaaagt tgaaggcgaa cagcatgtta taaaaggaag gccagtgtca 4381 agcagctttg acccagtcaa gtttcctgaa gatcaattca atgttgcact tgaccaagtt 4441 ttcgagagca ttgagaacag tcaggccttg gtggatcaat caaacagaat cctaagcagt 4501 gcagagaaag gaaacactgg cttcatcatt gtaataattc taattgctgt ccttggctct 4561 accatgatcc tagtgagtgt ttttatcata ataaagaaaa caaagaaacc cacaggagca 4621 cctccagagc tgagtggtgt cacaaacaat ggcttcatac cacataatta gttaattaaa 4681 aataaagtaa attaaaataa attaaaatta aaaataaaaa tttgggacaa atcataatgt 4741 ctcgcaaggc tccgtgcaaa tatgaagtgc ggggcaaatg caatagagga agtgagtgca 4801 agtttaacca caattactgg agttggccag atagatactt attaataaga tcaaattatt 4861 tattaaatca acttttaagg aacactgata gagctgatgg cttatcaata atatcaggag 4921 caggcagaga agataggaca caagattttg tcctaggttc caccaatgtg gttcaaggtt 4981 atattgatga taaccaaagc ataacaaaag ctgcagcctg ttacagtcta cataatataa 5041 tcaaacaact acaagaagtt gaagttaggc aggctagaga taacaaacta tctgacagca 5101 aacatgtagc acttcacaac ttagtcctat cttatatgga gatgagcaaa actcctgcat 5161 ctttaatcaa caatctcaag agactgccga gagagaaact gaaaaaatta gcaaagctca 5221 taattgactt atcagcaggt gctgaaaatg actcttcata tgccttgcaa gacagtgaaa 5281 gcactaatca agtgcagtga gcatggtcca gttttcatta ctatagaggt tgatgacatg 5341 atatggactc acaaggactt aaaagaagct ttatctgatg ggatagtgaa gtctcatact 5401 aacatttaca attgttattt agaaaacata gaaattatat atgtcaaggc ttacttaagt 5461 tagtaaaaac acatcagagt gggataaatg acaatgataa cattagatgt cattaaaagt 5521 gatgggtctt caaaaacatg tactcacctc aaaaaaataa ttaaagacca ctctggtaaa 5581 gtgcttattg tacttaagtt aatattagct ttactaacat ttctcacagt aacaatcacc 5641 atcaattata taaaagtgga aaacaatctg caaatatgcc agtcaaaaac tgaatcagac 5701 aaaaaggact catcatcaaa taccacatca gtcacaacca agactactct aaatcatgat 5761 atcacacagt attttaaaag tttgattcaa aggtatacaa actctgcaat aaacagtgac 5821 acatgctgga aaataaacag aaatcaatgc acaaatataa caacatacaa atttttatgt 5881 tttaaatctg aagacacaaa aaccaacaat tgtgataaac tgacagattt atgcagaaac 5941 aaaccaaaac cagcagttgg agtgtatcac atagtagaat gccattgtat atacacagtt 6001 aaatggaagt gctatcatta cccaaccgat gaaacccaat cctaaatgtt aacaccagat 6061 taggatccat ccaagtctgt tagttcaaca atttagttat ttaaaaatat tttgaaaaca 6121 agtaagtttc tatgatactt cataataata agtaataatt aattgcttaa tcatcatcac 6181 aacattattc gaaaccataa ctattcaatt taaaaagtaa aaaacaataa catgggacaa 6241 gtagttatgg aggtgaaagt ggagaacatt cgaacaatag atatgctcaa agcaagagta 6301 aaaaatcgtg tggcacgcag caaatgcttt aaaaatgcct ctttggtcct cataggaata 6361 actacattga gtattgccct caatatctat ctgatcataa actataaaat gcaaaaaaac 6421 acatctgaat cagaacatca caccagctca tcacccatgg aatccagcag agaaactcca 6481 acggtcccca cagacaactc agacaccaac tcaagcccac agcatccaac tcaacagtcc 6541 acagaaggct ccacactcta ctttgcagcc tcagcaagct caccagagac agaaccaaca 6601 tcaacaccag atacaacaaa ccgcccgccc ttcgtcgaca cacacacaac accaccaagc 6661 gcaagcagaa caaagacaag tccggcagtc cacacaaaaa acaacccaag gacaagctct 6721 agaacacatt ctccaccacg ggcaacgaca aggacggcac gcagaaccac cactctccgc 6781 acaagcagca caagaaagag accgtccaca gcatcagtcc aacctgacat cagcgcaaca 6841 acccacaaaa acgaagaagc aagtccagcg agcccacaaa catctgcaag cacaacaaga 6901 atacaaagga aaagcgtgga ggccaacaca tcaacaacat acaaccaaac tagttaacaa 6961 aaaatacaaa ataactctaa gataaaccat gcagacacca acaatggaga agccaaaaga 7021 caattcacaa tctccccaaa aaggcaacaa caccatatta gctctgccca aatctccctg 7081 gaaaaaacac tcgcccatat accaaaaata ccacaaccac cccaagaaaa aaactgggca 7141 aaacaacacc caagagacaa ataacaatgg atcctctcaa tgaatccact gttaatgtct 7201 atcttcctga ctcatatctt aaaggagtga tttcctttag tgagactaat gcaattggtt 7261 catgtctctt aaaaagacct tacctaaaaa atgacaacac tgcaaaagtt gccatagaga 7321 atcctgttat cgagcatgtt agactcaaaa atgcagtcaa ttctaagatg aaaatatcag 7381 attacaagat agtagagcca gtaaacatgc aacatgaaat tatgaagaat gtacacagtt 7441 gtgagctcac attattaaaa cagtttttaa caaggagtaa aaatattagc actctcaaat 7501 taaatatgat atgtgattgg ctgcagttaa agtctacatc agatgatacc tcaatcctaa 7561 gttttataga tgtagaattt atacctagct gggtaagcaa ttggtttagt aattggtaca 7621 atctcaacaa gttgattctg gaattcagga aagaagaagt aataagaact ggttcaatct 7681 tgtgtaggtc attgggtaaa ttagtttttg ttgtatcatc atatggatgt atagtcaaga 7741 gcaacaaaag caaaagagtg agcttcttca catacaatca actgttaaca tggaaagatg 7801 tgatgttaag tagattcaat gcaaattttt gtatatgggt aagcaacagt ctgaatgaaa 7861 atcaagaagg gctagggttg agaagtaatc tgcaaggcat attaactaat aagctatatg 7921 aaactgtaga ttatatgctt agtttatgtt gcaatgaagg tttctcactt gtgaaagagt 7981 tcgaaggctt tattatgagt gaaattctta ggattactga acatgctcaa ttcagtacta 8041 gatttagaaa tactttatta aatggattaa ctgatcaatt aacaaaatta aaaaataaaa 8101 acagactcag agttcatggt accgtgttag aaaataatga ttatccaatg tacgaagttg 8161 tacttaagtt attaggagat actttgagat gtattaaatt attaatcaat aaaaacttag 8221 agaatgctgc tgaattatac tatatattta gaatattcgg tcacccaatg gtagatgaaa 8281 gagatgcaat ggatgctgtc aaattaaaca atgaaatcac aaaaatcctt aggtgggaga 8341 gcttgacaga actaagaggg gcattcatat taaggattat caaaggattt gtagacaaca 8401 acaaaagatg gcccaaaatt aaaaacttaa aagtgcttag taagagatgg actatgtact 8461 tcaaagcaaa aagttacccc agtcaacttg aattaagcga acaagatttt ttagagcttg 8521 ctgcaataca gtttgaacaa gagttttctg tccctgaaaa aaccaacctt gagatggtat 8581 taaatgataa agctatatca cctcctaaaa gattaatatg gtctgtgtat ccaaaaaatt 8641 acttacctga gaaaataaaa aatcgatatc tagaagagac tttcaatgca agtgatagtc 8701 tcaaaacaag aagagtacta gagtactatt tgaaagataa taaattcgac caaaaagaac 8761 ttaaaagtta tgttgttaaa caagaatatt taaatgataa ggatcatatt gtctcgctaa 8821 ctggaaaaga aagagaatta agtgtaggta gaatgtttgc tatgcaacca ggaaaacagc 8881 gacaaataca aatattggct gaaaaattgt tagctgataa tattgtacct tttttcccag 8941 aaaccttaac aaagtatggt gatctagatc ttcagagaat aatggaaatc aaatcggaac 9001 tttcttctat taaaactaga agaaatgata gttataataa ttacattgca agagcatcca 9061 tagtaacaga tttaagtaag ttcaaccaag cctttaggta tgaaactaca gcgatctgtg 9121 cggatgtagc agatgaacta catggaacac aaagcctatt ctgttggtta catcttatcg 9181 tccctatgac aacaatgata tgtgcctata gacatgcacc accagaaaca aaaggtgaat 9241 atgatataga taagatagaa gagcaaagtg gtttatatag atatcatatg ggtggtattg 9301 aaggatggtg tcaaaaactc tggacaatgg aagctatatc tctattagat gttgtatctg 9361 taaaaacacg atgtcaaatg acatctttat taaacggtga caaccaatca atagatgtaa 9421 gtaaaccagt taagttatct gagggtttag atgaagtgaa agcagattat agcttggctg 9481 taaaaatgtt aaaagaaata agagatgcat acagaaatat aggccataaa cttaaagaag 9541 gggaaacata tatatcaaga gatcttcagt ttataagtaa ggtgattcaa tctgaaggag 9601 taatgcatcc tacccctata aaaaagatct taagagtggg accatggata aacacaatat 9661 tagatgacat taaaaccagt gcagagtcaa tagggagtct atgtcaggaa ttagaattta 9721 ggggggaaag cataatagtt agtctgatat taaggaattt ttggctgtat aatttataca 9781 tgcatgaatc aaagcaacac cccctagcag ggaagcagtt attcaaacaa ctaaataaaa 9841 cattaacatc agtgcagaga ttttttgaaa taaaaaagga aaatgaagta gtagatctat 9901 ggatgaacat accaatgcag tttggaggag gagatccagt agtcttctat agatctttct 9961 atagaaggac ccctgatttt ttaactgaag caatcagtca tgtggatatt ctgttaagaa 10021 tatcagccaa cataagaaat gaagcgaaaa taagtttctt caaagcctta ctgtcaatag 10081 aaaaaaatga acgtgctaca ctgacaacac taatgagaga tcctcaagct gttggctcag 10141 agcgacaagc aaaagtaaca agtgatatca atagaacagc agttaccagc atcttaagtc 10201 tttctccaaa tcaacttttc agcgatagtg ctatacacta cagtagaaat gaagaagagg 10261 tcggaatcat tgctgacaac ataacacctg tttatcctca tggactgaga gttttgtatg 10321 aatcattacc ttttcataaa gctgaaaaag ttgtgaatat gatatcagga acgaaatcca 10381 taaccaactt attacagaga acatctgcta ttaatggtga agatattgac agagctgtat 10441 ccatgatgct ggagaaccta ggattattat ctagaatatt gtcagtagtt gttgatagta 10501 tagaaattcc aaccaaatct aatggtaggc tgatatgttg tcagatatct agaaccctaa 10561 gggagacatc atggaataat atggaaatag ttggagtaac atcccctagc atcactacat 10621 gcatggatgt catatatgca actagctctc atttgaaagg gataatcatt gaaaagttca 10681 gcactgacag aactacaaga ggtcaaagag gtccaaagag cccttgggta gggtcgagca 10741 ctcaagagaa aaaattagtt cctgtttata acagacaaat tctttcaaaa caacaaagag 10801 aacagctaga agcaattgga aaaatgagat gggtatataa agggacacca ggtttaagac 10861 gattactcaa taagatttgt cttggaagtt taggcattag ttacaaatgt gtaaaacctt 10921 tattacctag gtttatgagt gtaaatttcc tacacaggtt atctgtcagt agtagaccta 10981 tggaattccc agcatcagtt ccagcttata gaacaacaaa ttaccatttt gacactagtc 11041 ctattaatca agcactaagt gagagatttg ggaatgaaga tattaatttg gtcttccaaa 11101 atgcaatcag ctgtggaatt agcataatga gtgtagtaga acaattaact ggtaggagtc 11161 caaaacagtt agttttaata cctcaattag aagaaataga cattatgcca ccaccagtgt 11221 ttcaagggaa attcaattat aagctagtag ataagataac ttctgatcaa catatcttca 11281 gtccagacaa aatagatatg ttaacactgg ggaaaatgct catgcccact ataaaaggtc 11341 agaaaacaga tcagttcctg aacaagagag agaattattt ccatgggaat aatcttattg 11401 agtctttgtc agcagcgtta gcatgtcatt ggtgtgggat attaacagag caatgtatag 11461 aaaataatat tttcaagaaa gactggggtg acgggttcat atcggatcat gcttttatgg 11521 acttcaaaat attcctatgt gtctttaaaa ctaaactttt atgtagttgg gggtcccaag 11581 ggaaaaacat taaagatgaa gatatagtag atgaatcaat agataaactg ttaaggattg 11641 ataatacttt ttggagaatg ttcagcaagg ttatgtttga atcaaaggtt aagaaaagga 11701 taatgttata tgatgtaaaa tttctatcat tagtaggtta tatagggttt aagaattggt 11761 ttatagaaca gttgagatca gctgagttgc atgaggtacc ttggattgtc aatgccgaag 11821 gtgatctggt tgagatcaag tcaattaaaa tctatttgca actgatagag caaagtttat 11881 ttttaagaat aactgttttg aactatacag atatggcaca tgctctcaca agattaatca 11941 gaaagaagtt gatgtgtgat aatgcactat taactccgat tccatcccca atggttaatt 12001 taactcaagt tattgatcct acagaacaat tagcttattt ccctaagata acatttgaaa 12061 ggctaaaaaa ttatgacact agttcaaatt atgctaaagg aaagctaaca aggaattaca 12121 tgatactgtt gccatggcaa catgttaata gatataactt tgtctttagt tctactggat 12181 gtaaagttag tctaaaaaca tgcattggaa aacttatgaa agatctaaac cctaaagttc 12241 tgtactttat tggagaaggg gcaggaaatt ggatggccag aacagcatgt gaatatcctg 12301 acatcaaatt tgtatacaga agtttaaaag atgaccttga tcatcattat cctttggaat 12361 accagagagt tataggagaa ttaagcagga taatagatag cggtgaaggg ctttcaatgg 12421 aaacaacaga tgcaactcaa aaaactcatt gggatttgat acacagagta agcaaagatg 12481 ctttattaat aactttatgt gatgcagaat ttaaggacag agatgatttt tttaagatgg 12541 taattctatg gaggaaacat gtattatcat gcagaatttg cactacttat gggacagacc 12601 tctatttatt cgcaaagtat catgctaaag actgcaatgt aaaattacct ttttttgtga 12661 gatcagtagc cacctttatt atgcaaggta gtaaactgtc aggctcagaa tgctacatac 12721 tcttaacact aggccaccac aacaatttac cctgccatgg agaaatacaa aattctaaga 12781 tgaaaatagc agtgtgtaat gatttttatg ctgcaaaaaa acttgacaat aaatctattg 12841 aagccaactg taaatcactt ttatcagggc taagaatacc gataaataag aaagaattaa 12901 atagacagag aaggttatta acactacaaa gcaaccattc ttctgtagca acagttggag 12961 gtagcaaggt catagagtct aaatggttaa caaacaaggc aaacacaata attgattggt 13021 tagaacatat tttaaattct ccaaaaggtg aattaaatta tgattttttt gaagcattag 13081 aaaatactta ccctaatatg attaaactaa tagataatct agggaatgca gagataaaaa 13141 aactgatcaa agtaactgga tatatgcttg taagtaaaaa atgaaaaatg ataaaaatga 13201 taaaataggt gacaacttca tactattcca aagtaatcat ttgattatgc aattatgtaa 13261 tagttaatta aaaactaaaa atcaaaagtt agaaactaac aactgtcatt aagtttatta 13321 aaaataagaa attataattg gatgtatacg // LOCUS NC_038311 7137 bp ss-RNA linear VRL 24-AUG-2018 DEFINITION Human rhinovirus 1 strain ATCC VR-1559, complete genome. ACCESSION NC_038311 VERSION NC_038311.1 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Human rhinovirus A1 (HRV-A1) ORGANISM Human rhinovirus A1 Viruses; Riboviria; Picornavirales; Picornaviridae; Enterovirus; Rhinovirus A. REFERENCE 1 (bases 1 to 7137) AUTHORS Palmenberg,A.C., Spiro,D., Kuzmickas,R., Wang,S., Djikeng,A., Rathe,J.A., Fraser-Liggett,C.M. and Liggett,S.B. TITLE Sequencing and analyses of all known human rhinovirus genomes reveal structure and evolution JOURNAL Science 324 (5923), 55-59 (2009) PUBMED 19213880 REFERENCE 2 (bases 1 to 7137) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (24-AUG-2018) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 7137) AUTHORS Spiro,D., Kuzmickas,R., Palmenberg,A.C., Wang,S., Djikeng,A., Rathe,J.A., Overton,L., Tsitrin,T., Gallagher,T., Liu,J., Pushparaj,V., Tallon,L.J., Fraser-Liggett,C.M. and Liggett,S.B. TITLE Direct Submission JOURNAL Submitted (10-NOV-2008) University of Maryland School of Medicine, 20 Penn St, Baltimore, MD 21201, USA COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence is identical to FJ445111. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..7137 /organism="Human rhinovirus A1" /mol_type="genomic RNA" /strain="ATCC VR-1559" /culture_collection="ATCC:VR-1559" /db_xref="taxon:573824" /note="HRV01; hrv-01" gene 627..7100 /locus_tag="D1P36_gp1" /db_xref="GeneID:37616484" CDS 627..7100 /locus_tag="D1P36_gp1" /codon_start=1 /product="polyprotein" /protein_id="YP_009505608.1" /db_xref="GeneID:37616484" /translation="MGAQVSRQNVGTHSTQNSVSNGSSLNYFNINYFKDAASSGASRLD FSQDPSKFTDPVKDVLEKGIPTLQSPSVEACGYSDRIMQITRGDSTITSQDVANAVVGY GVWPHYLTPQDATAIDKPTQPDTSSNRFYTLESKHWNGSSKGWWWKLPDALKDMGIFGE NMYYHFLGRSGYTVHVQCNASKFHQGTLLVAMIPEHQLASAKHGSVTAGYKLTHPGEAG RDVSQERDASLRQPSDDSWLNFDGTLLGNLLIFPHQFINLRSNNSATLIVPYVNAVPMD SMLRHNNWSLVIIPISPLRSETTSSNIVPITVSISPMCAEFSGARAKNIKQGLPVYITP GSGQFMTTDDMQSPCALPWYHPTKEISIPGEVKNLIEMCQVDTLIPVNNVGNNVGNVSM YTVQLGNQTGMAQKVFSIKVDITSQPLATTLIGEIASYYTHWTGSLRFSFMFCGTANTT LKLLLAYTPPGIDEPTTRKDAMLGTHVVWDVGLQSTISLVVPWVSASHFRLTADNKYSM AGYITCWYQTNLVVPPSTPQTADMLCFVSACKDFCLRMARDTDLHIQSGPIEQNPVENY IDEVLNEVLVVPNIKESHHTTSNSAPLLDAAETGHTSNVQPEDAIETRYVITSQTRDEM SIESFLGRSGCVHISRIKVDYTDYNGQDINFTKWKITLQEMAQIRRKFELFTYVRFDSE ITLVPCIAGRGDDIGHIVMQYMYVPPGAPIPSKRNDFSWQSGTNMSIFWQHGQPFPRFS LPFLSIASAYYMFYDGYDGDNTSSKYGSVVTNDMGTICSRIVTEKQKHSVVITTHIYHK AKHTKAWCPRPPRAVPYTHSHVTNYMPETGDVTTAIVRRNTITTAGPSDLYVHVGNLIY RNLHLFNSEMHDSILISYSSDLIIYRTNTIGDDYIPNCNCTEATYYCRHKNRYYPIKVT PHDWYEIQESEYYPKHIQYNLLIGEGPCEPGDCGGKLLCRHGVIGIITAGGEGHVAFID LRQFHCAEEQGITDYIHMLGEAFGNGFVDSVKEQINAINPINNISKKVIKWLLRIISAM VIIIRNSSDPQTIIATLTLIGCNGSPWRFLKEKFCKWTQLTYIHKESDSWLKKFTEMCN AARGLEWIGNKISKFIDWMKSMLPQAQLKVKYLNEIKKLSLLEKQIENLRAADSATQEK IKCEIDTLHDLSCKFLPLYAHEAKRIKVLYNKCSNIIKQRKRSEPVAVMIHGPPGTGKS ITTNFLARMITNESDVYSLPPDPKYFDGYDNQSVVIMDDIMQNPDGEDMTLFCQMVSSV TFIPPMADLPDKGKPFDSRFILCSTNHSLLAPPTISSLPAMNRRFFFDLDIVVHDNYKD TQGKLDVSKAFRPCNVNTKIGNAKCCPFVCGKAVXFKDRSTCSTYTLAQVYNHILEEDK RRRQVVDVMSAIFQGPISLDXPPPPAIXDLLQSVRTPEVIKYCQDNKWVIPAECQVERD LNIANSIIAIIANIISIAGIIFVIYKLFCSLQGPYSGEPKPKTKVPERRVVAQGPEEEF GRSILKNNTCVITTGNGKFTGLGIHDRILIIPTHADPGREVQVNGVHTKVLDSYDLYNR DGVKLEITVIQLDRNEKFRDIRKYIPETEDDYPECNLALSANQDEPTIIKVGDVVSYGN ILLSGNQTARMLKYNYPTKSGYCGGVLYKIGQILGIHVGGNGRDGFSAMLLRSYFTDTQ GQIKVNKHATECGLPTIHTPSKTKLQPSVFYDVFPGSKEPAVLTDNDPRLEVNFKEALF SKYKGNVECNLNEHMEIAIAHYSAQLMTLDIDSRPIALEDSVFGIEGLEALDLNTSAGF PYVTMGIKKRDLINNKTKDISRLKEALDKYGVDLPMITFLKDELRKKEKISTGKTRVIE ASSINDTILFRTTFGNLFSKFHLNPGVVTGSAVGCDPETFWSKIPVMLDGDCIMAFDYT NYDGSIHPVWFQALKKVLENLSFQSNLIDRLCYSKHLFKSTYYEVAGGVPSGCSGTSIF NTMINNIIIRTLVLDAYKNIDLDKLKIIAYGDDVIFSYKYTLDMEAIANEGKKYGLTIT PADKSNEFKKLDYSNVTFLKRGFKQDERHTFLIHPTFPVEEIHESIRWTKKPSQMQEHV LSLCHLMWHNGRKVYEDFSSKIRSVSAGRALYIPPYDLLKHEWYEKF" mat_peptide 627..833 /locus_tag="D1P36_gp1" /product="1A" /note="VP4" /protein_id="YP_009508974.1" mat_peptide 834..1622 /locus_tag="D1P36_gp1" /product="1B" /note="VP2" /protein_id="YP_009508975.1" mat_peptide 1623..2336 /locus_tag="D1P36_gp1" /product="1C" /note="VP3" /protein_id="YP_009508976.1" mat_peptide 2337..3197 /locus_tag="D1P36_gp1" /product="1D" /note="VP1" /protein_id="YP_009508977.1" mat_peptide 3198..3623 /locus_tag="D1P36_gp1" /product="2A" /note="P2-A" /protein_id="YP_009508978.1" mat_peptide 3624..3908 /locus_tag="D1P36_gp1" /product="2B" /note="P2-B" /protein_id="YP_009508979.1" mat_peptide 3909..4874 /locus_tag="D1P36_gp1" /product="2C" /note="P-2C" /protein_id="YP_009508980.1" mat_peptide 4875..5105 /locus_tag="D1P36_gp1" /product="3A" /note="P3-A" /protein_id="YP_009508981.1" mat_peptide 5106..5168 /locus_tag="D1P36_gp1" /product="3B" /note="VPg" /protein_id="YP_009508982.1" mat_peptide 5169..5717 /locus_tag="D1P36_gp1" /product="3C" /note="P3-C" /protein_id="YP_009508983.1" mat_peptide 5718..7097 /locus_tag="D1P36_gp1" /product="3D" /note="polymerase" /protein_id="YP_009508984.1" ORIGIN 1 ttaaaactgg gtgtgggttg ttcccaccca caccacccaa tgggtgttgt actctgttat 61 tccggtaact ttgtacgcca gtttttccct cccctcccca tccttttacg taacttagaa 121 gttttaaata caagaccaat agtaggcaac tctccaggtt gtctaaggtc aagcacttct 181 gtttccccgg ttgatgttga tatgctccaa cagggcaaaa acaacagata ccgttatccg 241 caaagtgcct acacagagct tagtaggatt ctgaaagatc tttggttggt cgttcagctg 301 catacccagc agtagacctt gcagatgagg ctggacattc cccactggta acagtggtcc 361 agcctgcgtg gctgcctgcg cacctctcat gaggtgtgaa gccaaagatc ggacagggtg 421 tgaagagccg cgtgtgctca ctttgagtcc tccggcccct gaatgcggct aaccttaaac 481 ctgcagccat ggctcataag ccaatgagtt tatggtcgta acgagtaatt gcgggatggg 541 accgactact ttgggtgtcc gtgtttcact ttttccttta ttaattgctt atggtgacaa 601 tatatatatt gatatatatt ggcatcatgg gcgcccaggt atctagacaa aatgttggta 661 cacactcaac ccaaaattca gtgtcaaatg gatcaagttt aaattacttt aatataaatt 721 acttcaagga tgctgcctca agtggtgcat ctagattaga tttctctcaa gatccaagca 781 aattcactga cccagttaaa gatgtcttag aaaaggggat cccaacacta caatcaccat 841 ctgttgaggc ttgtggctat tcagacagga ttatgcaaat aaccagagga gattcaacaa 901 tcacatctca agatgtagca aatgctgtgg ttgggtatgg ggtctggccg cattacttaa 961 caccacaaga tgccactgct atagacaaac caacacaacc tgatacatca tcaaatagat 1021 tttatacact agagagtaaa cattggaatg gtagttcaaa aggatggtgg tggaaattac 1081 cagatgctct taaagacatg ggtatttttg gagaaaatat gtattatcat ttcctgggta 1141 gaagtggata tacagttcat gtgcagtgta atgctagtaa attccatcag ggtaccttgt 1201 tagttgcaat gataccagaa caccagctag caagtgcaaa acacggaagt gtgactgctg 1261 gttacaaact cacacaccca ggtgaggctg gcagagatgt aagtcaagaa cgtgatgcaa 1321 gtttaagaca acctagtgat gatagttggc ttaattttga tggcaccctt cttggaaatt 1381 tattaatttt cccacatcaa tttataaacc ttaggagtaa taattctgca actctaatag 1441 taccatatgt aaatgctgtg ccaatggatt caatgcttcg acataataac tggagcctgg 1501 tcatcatacc aattagtcca ttacgtagtg aaactacatc ttctaatata gtgccaatca 1561 ctgtatcaat aagtcccatg tgtgctgaat tttctggtgc aagagcaaaa aacattaaac 1621 aaggattacc tgtatatata actccaggat ctgggcaatt catgactact gatgatatgc 1681 aatcaccttg tgcactaccc tggtaccatc ctactaaaga aatatctatt ccgggtgaag 1741 ttaaaaacct tatagaaatg tgtcaagttg ataccttgat tccagtcaat aatgtgggta 1801 acaatgttgg aaatgtcagt atgtacactg tacaactagg gaaccaaaca ggcatggcac 1861 aaaaagtctt ttcaataaaa gtagacatta catcacagcc tttggctaca actttaattg 1921 gggagattgc aagctattac acccattgga ctggcagtct gcgatttagt tttatgtttt 1981 gtgggactgc aaacacaaca cttaaattat tacttgcata cacaccacct ggtattgatg 2041 aaccaacaac tagaaaggat gcaatgctag ggacacatgt tgtgtgggat gttggattgc 2101 agtctactat atctcttgtt gtaccatggg tgagtgccag ccacttcagg ttaactgcag 2161 ataataaata ctccatggct ggttatatca catgttggta ccaaactaat ttagtagtgc 2221 ccccaagtac gccacagact gctgatatgc tgtgttttgt ttctgcatgt aaagattttt 2281 gtctacgaat ggcaagggat acagatttac acatacaaag tggtccaata gagcaaaatc 2341 cagtagaaaa ctacattgat gaagttttaa atgaagtttt agtagtgccg aatataaaag 2401 aaagtcatca cactacatca aactctgccc cacttttaga tgctgcagag acgggacaca 2461 ccagtaatgt tcaaccagaa gatgctatag agacaaggta tgttataaca tcacaaacaa 2521 gagatgagat gagtatagaa agtttccttg gtagatctgg ttgtgtccac atctcaagaa 2581 taaaggttga ttacactgac tataatggac aggacataaa tttcacaaaa tggaaaatca 2641 cactacagga aatggcacag attaggagaa aatttgaatt gtttacatat gtcaggtttg 2701 actcagaaat aaccttggtg ccttgtattg ctggtagagg agacgacatt ggacatattg 2761 taatgcaata tatgtatgtt cctccaggag ctccaattcc ttcaaaaaga aacgatttct 2821 catggcaatc aggcaccaat atgtcaatat tctggcaaca tggacagcca tttcctagat 2881 tttctttacc atttcttagc attgcatcag cttattatat gttttatgat ggatatgatg 2941 gagacaacac ttcttccaag tatggtagcg tagttactaa tgatatgggt actatatgct 3001 caagaatagt tacagaaaaa cagaaacatt ctgttgtcat cacaacacac atatatcata 3061 aagctaaaca cacaaaagct tggtgtccta ggccccctag agctgtccct tacacacata 3121 gtcatgtgac taattatatg ccagaaacag gtgacgtgac aacagccata gtccgcagaa 3181 acactataac aactgctggg cccagtgatc tatatgtgca tgtaggtaac ttaatatata 3241 gaaacttaca tctgttcaat tctgaaatgc atgattcaat tttgatttca tactcttctg 3301 atttaatcat ataccgcaca aacactatag gtgatgatta tattcccaat tgtaactgca 3361 ctgaggctac ttattattgt agacacaaaa ataggtatta cccaataaaa gttactccac 3421 atgattggta tgaaatacaa gagagtgaat attaccccaa acacatccaa tacaacctat 3481 taattggtga aggaccatgt gaacctggtg attgtggtgg aaaacttctt tgtagacatg 3541 gtgtcattgg cataatcaca gcaggtggtg aaggtcatgt agcatttata gatcttagac 3601 aatttcactg tgctgaggaa caaggcataa ctgattacat acacatgttg ggagaggctt 3661 ttggcaatgg ttttgtagat agtgttaaag aacaaataaa tgcaataaat ccaatcaata 3721 acattagtaa gaaggttatt aagtggctac ttagaataat ctcggctatg gttattataa 3781 tcagaaactc ctctgaccct caaacgatca tagcaacctt gacactaatt ggttgcaatg 3841 gttcaccatg gagatttctc aaagaaaagt tttgcaaatg gacccaatta acttatatcc 3901 acaaagagtc tgattcatgg cttaagaaat tcactgaaat gtgtaatgct gcacgtggtc 3961 ttgaatggat tggtaataaa atttcaaaat ttatagattg gatgaaatct atgctacccc 4021 aggcccaatt gaaagttaaa tacttgaatg aaataaagaa actcagtttg cttgaaaaac 4081 agattgaaaa tctacgtgcg gcagatagtg caacacaaga gaaaatcaaa tgtgaaattg 4141 acaccctaca tgatctatcg tgcaaatttc ttcctttgta tgcacatgag gcaaaaagaa 4201 tcaaagtgct ttataataaa tgttccaata taattaaaca aagaaagaga agtgaaccgg 4261 tggcggtgat gatacatgga ccacccggta ctggtaaatc tataacaact aacttcttgg 4321 ctagaatgat aacaaatgaa agtgatgtgt actcattacc tccagatccc aaatattttg 4381 atggttatga caatcagagt gttgtaatca tggatgatat tatgcaaaat ccagatggag 4441 aagacatgac actattttgc caaatggttt caagtgttac atttatacca cccatggctg 4501 atttgcctga caagggtaaa ccatttgatt caagatttat cttatgtagt actaaccact 4561 cgcttttagc cccacctact atatcttcat tacccgcaat gaatagaaga tttttctttg 4621 acttagatat tgtagttcat gacaattata aagatacaca agggaaatta gatgtatcca 4681 aagcttttcg accttgtaat gttaacacca aaattggcaa tgcaaaatgt tgtccatttg 4741 tatgtggtaa ggcagtgwca ttcaaagatc gcagcacttg ctcaacatac accttagctc 4801 aagtttacaa tcacattttg gaagaagaca aaagaaggag acaggtggtg gatgtcatgt 4861 ctgcaatttt ccaaggacca atttctttag acgytccacc accaccagct atagyagatc 4921 tgttacaatc agttagaaca cctgaggtaa tcaagtactg tcaagataat aaatgggtca 4981 ttccagcaga gtgccaagtg gaaagagact taaatatagc caatagcata atagctatta 5041 tagcaaatat aataagtata gctggcatta tatttgtaat ttataaattg ttttgttcat 5101 tacaaggacc atactcaggt gaacctaaac ctaaaaccaa agtacctgaa agaagagtag 5161 ttgctcaagg tccagaagaa gaatttggaa ggtcaattct caaaaacaat acttgtgtga 5221 ttactacagg taatggaaaa tttacaggtc ttggtataca tgacagaatt ctaatcatcc 5281 caacacatgc tgatccaggt agagaggtcc aagttaatgg tgtccacact aaggttctag 5341 actcatatga tctttataat agagatggag ttaaacttga aataacggtc atacaattag 5401 atagaaatga aaaatttagg gacattagaa agtatatacc tgaaacagaa gacgattatc 5461 cagaatgcaa tttggcactt tcagctaatc aagatgaacc aactataatt aaagtaggag 5521 atgtagtgtc ctatggcaat attttgctta gtggaaatca aacagccaga atgcttaaat 5581 ataattaccc cacaaaatca gggtattgtg gaggggtact atataaaatt ggtcaaattc 5641 taggtattca tgtgggtgga aatggaaggg atggtttttc agctatgtta cttagatcat 5701 actttacaga tactcagggc caaattaaag tcaataagca tgctactgaa tgtggtcttc 5761 caactataca cactcctagc aaaaccaaac ttcagcctag tgtattttat gatgtcttcc 5821 caggctctaa ggaaccagct gtgctcacag ataatgaccc tagattggaa gttaatttta 5881 aagaagcttt attttctaaa tataaaggta atgtggaatg taatttgaat gaacatatgg 5941 aaattgctat tgcccattac tcagcacaat taatgacact agatattgat tccaggccaa 6001 tagcattgga agatagcgtg tttgggatag aaggacttga ggctttagat ctaaacacca 6061 gtgcagggtt tccttatgtc acaatgggta ttaaaaagag agatttaata aataacaaaa 6121 caaaagatat atccaggctt aaagaggctc tagataaata tggagttgac ttacccatga 6181 tcactttctt aaaagacgag cttaggaaaa aggagaaaat ttcaacaggt aaaactagag 6241 ttatagaagc aagtagtata aatgacacaa tattatttag aactactttt ggcaatttgt 6301 tctctaagtt tcatttaaac ccaggcgttg ttactggctc tgcagtaggg tgtgaccctg 6361 agactttctg gtctaaaatc ccagttatgc ttgatggaga ttgtataatg gcttttgact 6421 atacaaatta tgatggtagt atacaccctg tctggtttca agctctgaaa aaagttcttg 6481 aaaatttatc tttccaatct aatttaattg atagattgtg ttattccaag cacttgttta 6541 aatcaacata ttatgaagtg gcaggtggag ttccttctgg gtgttctgga accagtatat 6601 tcaatactat gattaataac attataataa gaacactagt tctagatgca tacaaaaata 6661 ttgatctgga caagcttaaa ataattgcat atggtgatga tgtgattttc tcttataaat 6721 atactctaga tatggaagct attgctaatg aaggaaagaa atatggactt acaataacac 6781 cagcagataa gtccaatgaa ttcaagaaac ttgattatag taatgtgact tttcttaaac 6841 gtggttttaa gcaagatgaa agacatacat tccttattca tcctacattc ccagtggaag 6901 agatacatga atcaattaga tggaccaaga aaccttcaca gatgcaagaa catgtgctat 6961 cattatgtca cctgatgtgg cacaatggac gtaaggtgta tgaagatttc tctagtaaga 7021 tacgcagtgt cagcgctgga cgtgcactgt atatcccacc ttatgatctg ttgaaacatg 7081 aatggtatga aaaattttag atatagaaat aatgaatgaa tgattcttta attctat // LOCUS NC_004718 29751 bp ss-RNA linear VRL 13-AUG-2018 DEFINITION SARS coronavirus, complete genome. ACCESSION NC_004718 VERSION NC_004718.3 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Severe acute respiratory syndrome-related coronavirus ORGANISM Severe acute respiratory syndrome-related coronavirus Viruses; Riboviria; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Betacoronavirus; Sarbecovirus. REFERENCE 1 (bases 1 to 29751) AUTHORS He,R., Dobie,F., Ballantine,M., Leeson,A., Li,Y., Bastien,N., Cutts,T., Andonov,A., Cao,J., Booth,T.F., Plummer,F.A., Tyler,S., Baker,L. and Li,X. CONSRTM BCCA Genome Sciences Centre, British Columbia Centre for Disease Control and National Microbiology Laboratory Canada TITLE Analysis of multimerization of the SARS coronavirus nucleocapsid protein JOURNAL Biochem. Biophys. Res. Commun. 316 (2), 476-483 (2004) PUBMED 15020242 REFERENCE 2 (bases 1 to 29751) AUTHORS Snijder,E.J., Bredenbeek,P.J., Dobbe,J.C., Thiel,V., Ziebuhr,J., Poon,L.L., Guan,Y., Rozanov,M., Spaan,W.J. and Gorbalenya,A.E. TITLE Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage JOURNAL J. Mol. Biol. 331 (5), 991-1004 (2003) PUBMED 12927536 REFERENCE 3 (bases 1 to 29751) AUTHORS Marra,M.A., Jones,S.J., Astell,C.R., Holt,R.A., Brooks-Wilson,A., Butterfield,Y.S., Khattra,J., Asano,J.K., Barber,S.A., Chan,S.Y., Cloutier,A., Coughlin,S.M., Freeman,D., Girn,N., Griffith,O.L., Leach,S.R., Mayo,M., McDonald,H., Montgomery,S.B., Pandoh,P.K., Petrescu,A.S., Robertson,A.G., Schein,J.E., Siddiqui,A., Smailus,D.E., Stott,J.M., Yang,G.S., Plummer,F., Andonov,A., Artsob,H., Bastien,N., Bernard,K., Booth,T.F., Bowness,D., Czub,M., Drebot,M., Fernando,L., Flick,R., Garbutt,M., Gray,M., Grolla,A., Jones,S., Feldmann,H., Meyers,A., Kabani,A., Li,Y., Normand,S., Stroher,U., Tipples,G.A., Tyler,S., Vogrig,R., Ward,D., Watson,B., Brunham,R.C., Krajden,M., Petric,M., Skowronski,D.M., Upton,C. and Roper,R.L. TITLE The Genome sequence of the SARS-associated coronavirus JOURNAL Science 300 (5624), 1399-1404 (2003) PUBMED 12730501 REFERENCE 4 (bases 1 to 29751) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (31-AUG-2004) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 5 (bases 1 to 29751) CONSRTM BCCA Genome Sciences Centre, British Columbia Centre for Disease Control and National Microbiology Laboratory Canada TITLE Direct Submission JOURNAL Submitted (30-APR-2003) Genome Sciences Centre, British Columbia Cancer Research Centre, 600 West 10th Avenue, Vancouver, BC V5Z 4E6, Canada REMARK Sequence update by submitter REFERENCE 6 (bases 1 to 29751) CONSRTM BCCA Genome Sciences Centre, British Columbia Centre for Disease Control and National Microbiology Laboratory Canada TITLE Direct Submission JOURNAL Submitted (23-APR-2003) Genome Sciences Centre, British Columbia Cancer Research Centre, 600 West 10th Avenue, Vancouver, BC V5Z 4E6, Canada REMARK Sequence update by submitter REFERENCE 7 (bases 1 to 29751) CONSRTM BCCA Genome Sciences Centre, British Columbia Centre for Disease Control and National Microbiology Laboratory Canada TITLE Direct Submission JOURNAL Submitted (13-APR-2003) Genome Sciences Centre, British Columbia Cancer Research Centre, 600 West 10th Avenue, Vancouver, BC V5Z 4E6, Canada COMMENT REVIEWED REFSEQ: This record has been curated by NCBI staff. The reference sequence was derived from AY274119. On or before Mar 28, 2016 this sequence version replaced NC_028858.1, NC_028866.1, NC_028884.1, NC_028893.1, NC_028873.1, NC_028845.1, NC_009696.1, NC_009695.1, NC_013664.1, NC_009693.1, NC_009694.1, NC_004718.2. The annotation in based mainly on the sequence analysis described by Snijder et al. (2003). Annotation of transcription regulatory sequences was copied from virtually identical (except the very 3' end) AY291315 (Frankfurt 1). Designations of the 3'-adjacent genes do not coincide with those provided by Marra et al. (2003). COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..29751 /organism="Severe acute respiratory syndrome-related coronavirus" /mol_type="genomic RNA" /isolate="Tor2" /isolation_source="patient #2 with severe acute respiratory syndrome (SARS)" /db_xref="taxon:694009" /country="Canada: Toronto" 5'UTR 1..264 /inference="non-experimental evidence, no additional details recorded" misc_feature 67..72 /note="transcription regulatory sequence leader TRS" gene 265..21485 /gene="orf1ab" /locus_tag="sars1" /db_xref="GeneID:1489680" CDS join(265..13398,13398..21485) /gene="orf1ab" /locus_tag="sars1" /ribosomal_slippage="" /note="It was assumed that the SARS orf1ab polyprotein processing map should be similar to that of murine hepatitis virus; however, of the two MHV papain-like proteinases, only PL2-PRO is well conserved for SARS coronavirus. The mature peptides located downstream from nsp4-pp1a/pp1ab are cleaved from the polyprotein by the nsp5-pp1a/pp1ab proteinase 3CL-PRO. The orf1a/orf1b translational frameshift, the predicted processing map, and both proteinase activities have been supported by in vitro expression and mutagenesis experiments (Thiel et al., 2003); -1 frameshift" /codon_start=1 /product="orf1ab polyprotein (pp1ab)" /protein_id="NP_828849.2" /db_xref="GeneID:1489680" /translation="MESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEEALSEAREH LKNGTCGLVELEKGVLPQLEQPYVFIKRSDALSTNHGHKVVELVAEMDGIQYGRSGITL GVLVPHVGETPIAYRNVLLRKNGNKGAGGHSYGIDLKSYDLGDELGTDPIEDYEQNWNT KHGSGALRELTRELNGGAVTRYVDNNFCGPDGYPLDCIKDFLARAGKSMCTLSEQLDYI ESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFPLNSKV KVIQPRVEKKKTEGFMGRIRSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTCDFLKAT CEHCGTENLVIEGPTTCGYLPTNAVVKMPCPACQDPEIGPEHSVADYHNHSNIETRLRK GGRTRCFGGCVFAYVGCYNKRAYWVPRASADIGSGHTGITGDNVETLNEDLLEILSRER VNINIVGDFHLNEEVAIILASFSASTSAFIDTIKSLDYKSFKTIVESCGNYKVTKGKPV KGAWNIGQQRSVLTPLCGFPSQAAGVIRSIFARTLDAANHSIPDLQRAAVTILDGISEQ SLRLVDAMVYTSDLLTNSVIIMAYVTGGLVQQTSQWLSNLLGTTVEKLRPIFEWIEAKL SAGVEFLKDAWEILKFLITGVFDIVKGQIQVASDNIKDCVKCFIDVVNKALEMCIDQVT IAGAKLRSLNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKEVTFLEGDSHDTVLTS EEVVLKNGELEALETPVDSFTNGAIVGTPVCVNGLMLLEIKDKEQYCALSPGLLATNNV FRLKGGAPIKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNEKCSVYTVESGTEVTE FACVVAEAVVKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDE EEEDDAECEEEEIDETCEHEYGTEDDYQGLPLEFGASAETVRVEEEEEEDWLDDTTEQS EIEPEPEPTPEEPVNQFTGYLKLTDNVAIKCVDIVKEAQSANPMVIVNAANIHLKHGGG VAGALNKATNGAMQKESDDYIKLNGPLTVGGSCLLSGHNLAKKCLHVVGPNLNAGEDIQ LLKAAYENFNSQDILLAPLLSAGIFGAKPLQSLQVCVQTVRTQVYIAVNDKALYEQVVM DYLDNLKPRVEAPKQEEPPNTEDSKTEEKSVVQKPVDVKPKIKACIDEVTTTLEETKFL TNKLLLFADINGKLYHDSQNMLRGEDMSFLEKDAPYMVGDVITSGDITCVVIPSKKAGG TTEMLSRALKKVPVDEYITTYPGQGCAGYTLEEAKTALKKCKSAFYVLPSEAPNAKEEI LGTVSWNLREMLAHAEETRKLMPICMDVRAIMATIQRKYKGIKIQEGIVDYGVRFFFYT SKEPVASIITKLNSLNEPLVTMPIGYVTHGFNLEEAARCMRSLKAPAVVSVSSPDAVTT YNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSGQRTELGVEFLKRGDKIVYHTLESPVE FHLDGEVLSLDKLKSLLSLREVKTIKVFTTVDNTNLHTQLVDMSMTYGQQFGPTYLDGA DVTKIKPHVNHEGKTFFVLPSDDTLRSEAFEYYHTLDESFLGRYMSALNHTKKWKFPQV GGLTSIKWADNNCYLSSVLLALQQLEVKFNAPALQEAYYRARAGDAANFCALILAYSNK TVGELGDVRETMTHLLQHANLESAKRVLNVVCKHCGQKTTTLTGVEAVMYMGTLSYDNL KTGVSIPCVCGRDATQYLVQQESSFVMMSAPPAEYKLQQGTFLCANEYTGNYQCGHYTH ITAKETLYRIDGAHLTKMSEYKGPVTDVFYKETSYTTTIKPVSYKLDGVTYTEIEPKLD GYYKKDNAYYTEQPIDLVPTQPLPNASFDNFKLTCSNTKFADDLNQMTGFTKPASRELS VTFFPDLNGDVVAIDYRHYSASFKKGAKLLHKPIVWHINQATTKTTFKPNTWCLRCLWS TKPVDTSNSFEVLAVEDTQGMDNLACESQQPTSEEVVENPTIQKEVIECDVKTTEVVGN VILKPSDEGVKVTQELGHEDLMAAYVENTSITIKKPNELSLALGLKTIATHGIAAINSV PWSKILAYVKPFLGQAAITTSNCAKRLAQRVFNNYMPYVFTLLFQLCTFTKSTNSRIRA SLPTTIAKNSVKSVAKLCLDAGINYVKSPKFSKLFTIAMWLLLLSICLGSLICVTAAFG VLLSNFGAPSYCNGVRELYLNSSNVTTMDFCEGSFPCSICLSGLDSLDSYPALETIQVT ISSYKLDLTILGLAAEWVLAYMLFTKFFYLLGLSAIMQVFFGYFASHFISNSWLMWFII SIVQMAPVSAMVRMYIFFASFYYIWKSYVHIMDGCTSSTCMMCYKRNRATRVECTTIVN GMKRSFYVYANGGRGFCKTHNWNCLNCDTFCTGSTFISDEVARDLSLQFKRPINPTDQS SYIVDSVAVKNGALHLYFDKAGQKTYERHPLSHFVNLDNLRANNTKGSLPINVIVFDGK SKCDESASKSASVYYSQLMCQPILLLDQALVSDVGDSTEVSVKMFDAYVDTFSATFSVP MEKLKALVATAHSELAKGVALDGVLSTFVSAARQGVVDTDVDTKDVIECLKLSHHSDLE VTGDSCNNFMLTYNKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSE QLRKQIRSAAKKNNIPFRLTCATTRQVVNVITTKISLKGGKIVSTCFKLMLKATLLCVL AALVCYIVMPVHTLSIHDGYTNEIIGYKAIQDGVTRDIISTDDCFANKHAGFDAWFSQR GGSYKNDKSCPVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVGNICYTPS KLIEYSDFATSACVLAAECTIFKDAMGKPVPYCYDTNLLEGSISYSELRPDTRYVLMDG SIIQFPNTYLEGSVRVVTTFDAEYCRHGTCERSEVGICLSTSGRWVLNNEHYRALSGVF CGVDAMNLIANIFTPLVQPVGALDVSASVVAGGIIAILVTCAAYYFMKFRRVFGEYNHV VAANALLFLMSFTILCLVPAYSFLPGVYSVFYLYLTFYFTNDVSFLAHLQWFAMFSPIV PFWITAIYVFCISLKHCHWFFNNYLRKRVMFNGVTFSTFEEAALCTFLLNKEMYLKLRS ETLLPLTQYNRYLALYNKYKYFSGALDTTSYREAACCHLAKALNDFSNSGADVLYQPPQ TSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDML NPNYEDLLIRKSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPG QTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELP TGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTTL NDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGSTI LEDEFTPFDVVRQCSGVTFQGKFKKIVKGTHHWMLLTFLTSLLILVQSTQWSLFFFVYE NAFLPFTLGIMAIAACAMLLVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRIMTWL ELADTSLSGYRLKDCVMYASALVLLILMTARTVYDDAARRVWTLMNVITLVYKVYYGNA LDQAISMWALVISVTSNYSGVVTTIMFLARAIVFVCVEYYPLLFITGNTLQCIMLVYCF LGYCCCCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKSSIDAFKLNIKL LGIGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKD TTEAFEKMVSLLSVLLSMQGAVDINRLCEEMLDNRATLQAIASEFSSLPSYAAYATAQE AYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDK RAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVVPDYGTY KNTCDGNTFTYASALWEIQQVVDADSKIVQLSEINMDNSPNLAWPLIVTALRANSAVKL QNNELSPVALRQMSCAAGTTQTACTDDNALAYYNNSKGGRFVLALLSDHQDLKWARFPK SDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNA TEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANM DQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLRNTVCTVCGM WKGYGCSCDQLREPLMQSADASTFLNRVCGVSAARLTPCGTGTSTDVVYRAFDIYNEKV AGFAKFLKTNCCRFQEKDEEGNLLDSYFVVKRHTMSNYQHEETIYNLVKDCPAVAVHDF FKFRVDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKK DWYDFVENPDILRVYANLGERVRQSLLKTVQFCDAMRDAGIVGVLTLDNQDLNGNWYDF GDFVQVAPGCGVPIVDSYYSLLMPILTLTRALAAESHMDADLAKPLIKWDLLKYDFTEE RLCLFDRYFKYWDQTYHPNCINCLDDRCILHCANFNVLFSTVFPPTSFGPLVRKIFVDG VPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAADPAMHAASGNLLLDKRTTCF SVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDY YRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARL YYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRQFHQ KLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVETPHLMGWDYPKCDRAMPNMLRI MASLVLARKHNTCCNLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYAN SVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDHEFVDEFYAYLR KHFSMMILSDDAVVCYNSNYAAQGLVASIKNFKAVLYYQNNVFMSEAKCWTETDLTKGP HEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYP LTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTP HTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPG CDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCD WTNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIATVREVLSDRELHLSWEVGKP RPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVYRGTTTYKLNVGDYFVLTSHT VMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSH FAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVN STLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAP RTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKDK SAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKIL GLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKIGILCIMSDRDLYDKL QFTSLEIPRRNVATLQAENVTGLFKDCSKIITGLHPTQAPTHLSVDIKFKTEGLCVDIP GIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVG TNLPLQLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNV VRIKIVQMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTS SDTYACWNHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRC LAVHECFVKRVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAI KCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYATHHDKFTDGVCLFWNCNVDRYPANAI VCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHG KQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFD TYNLWNTFTRLQSLENVAYNVVNKGHFDGHAGEAPVSIINNAVYTKVDGIDVEIFENKT TLPVNVAFELWAKRNIKPVPEIKILNNLGVDIAANTVIWDYKREAPAHVSTIGVCTMTD IAKKPTESACSSLTVLFDGRVEGQVDLFRNARNGVLITEGSVKGLTPSKGPAQASVNGV TLIGESVKTQFNYFKKVDGIIQQLPETYFTQSRDLEDFKPRSQMETDFLELAMDEFIQR YKLEGYAFEHIVYGDFSHGQLGGLHLMIGLAKRSQDSPLKLEDFIPMDSTVKNYFITDA QTGSSKCVCSVIDLLLDDFVEIIKSQDLSVISKVVKVTIDYAEISFMLWCKDGHVETFY PKLQASQAWQPGVAMPNLYKMQRMLLEKCDLQNYGENAVIPKGIMMNVAKYTQLCQYLN TLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIGD CATVHTANKWDLIISDMYDPRTKHVTKENDSKEGFFTYLCGFIKQKLALGGSIAVKITE HSWNADLYKLMGHFSWWTAFVTNVNASSSEAFLIGANYLGKPKEQIDGYTMHANYIFWR NTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQINDMIYSLLEKGRLIIRENNRVVVS SDILVNN" mat_peptide 265..804 /gene="orf1ab" /locus_tag="sars1" /product="leader protein" /experiment="experimental evidence, no additional details recorded" /note="PL2-PRO cleavage product; nsp1-pp1a/pp1ab" /protein_id="NP_828860.2" mat_peptide 805..2718 /gene="orf1ab" /locus_tag="sars1" /product="counterpart of MHV p65" /experiment="experimental evidence, no additional details recorded" /note="PL2-PRO cleavage product; nsp2-pp1a/pp1ab" /protein_id="NP_828861.2" mat_peptide 2719..8484 /gene="orf1ab" /locus_tag="sars1" /product="nsp3-pp1a/pp1ab" /note="PL2-PRO cleavage product; former nsp1; conserved domains are: N-terminal acidic (Ac), predicted phosphoesterase (similar to the Appr-1'-p; processing enzyme) formerly known as 'X-domain', papain-like proteinase similar to that of MHV PL2-PRO, Y-domain; transmembrane domain 1 (TM1); adenosine diphosphate-ribose 1''-phosphatase (ADPR)" /protein_id="NP_828862.2" mat_peptide 8485..9984 /gene="orf1ab" /locus_tag="sars1" /product="nsp4-pp1a/pp1ab" /experiment="experimental evidence, no additional details recorded" /note="cleaved from polyprotein by the PL2-PRO at the N-terminus and by 3CL-PRO at the C-terminus; contains transmembrane domain 2 (TM2)" /protein_id="NP_904322.1" mat_peptide 9985..10902 /gene="orf1ab" /locus_tag="sars1" /product="3C-like proteinase" /experiment="experimental evidence, no additional details recorded" /note="mediates cleavages downstream from nsp4-pp1a/pp1ab. 3D structure has been determined (Yang et al., 2003); main proteinase (Mpro); nsp5-pp1a/pp1ab (3CL-PRO)" /protein_id="NP_828863.1" mat_peptide 10903..11772 /gene="orf1ab" /locus_tag="sars1" /product="nsp6-pp1a/pp1ab (TM3)" /note="putative transmembrane domain" /protein_id="NP_828864.1" mat_peptide 11773..12021 /gene="orf1ab" /locus_tag="sars1" /product="nsp7-pp1a/pp1ab" /inference="non-experimental evidence, no additional details recorded" /protein_id="NP_828865.1" mat_peptide 12022..12615 /gene="orf1ab" /locus_tag="sars1" /product="nsp8-pp1a/pp1ab" /inference="non-experimental evidence, no additional details recorded" /protein_id="NP_828866.1" mat_peptide 12616..12954 /gene="orf1ab" /locus_tag="sars1" /product="nsp9-pp1a/pp1ab" /experiment="experimental evidence, no additional details recorded" /note="ssRNA-binding protein" /protein_id="NP_828867.1" mat_peptide 12955..13371 /gene="orf1ab" /locus_tag="sars1" /product="formerly known as growth-factor-like protein (GFL)" /inference="non-experimental evidence, no additional details recorded" /note="nsp10-pp1a/pp1ab" /protein_id="NP_828868.1" mat_peptide join(13372..13398,13398..16166) /gene="orf1ab" /locus_tag="sars1" /product="RNA-dependent RNA polymerase" /inference="non-experimental evidence, no additional details recorded" /note="nsp12-pp1ab (RdRp)" /protein_id="NP_828869.1" mat_peptide 16167..17969 /gene="orf1ab" /locus_tag="sars1" /product="nsp13-pp1ab (ZD, NTPase/HEL; RNA 5'-triphosphatase)" /experiment="experimental evidence, no additional details recorded" /note="zinc-binding domain (ZD), NTPase/helicase domain. RNA-stimulated ATPase and dsDNA helicase activities have been confirmed (Thiel et al., 2003)" /protein_id="NP_828870.1" mat_peptide 17970..19550 /gene="orf1ab" /locus_tag="sars1" /product="3'-to-5' exonuclease" /inference="non-experimental evidence, no additional details recorded" /note="nsp14-pp1ab (nuclease ExoN homolog)" /protein_id="NP_828871.1" mat_peptide 19551..20588 /gene="orf1ab" /locus_tag="sars1" /product="endoRNAse" /experiment="experimental evidence, no additional details recorded" /note="the C-terminal domain is a homolog of endoRNase XendoU and is conserved through the order Nidovirales; nsp15-pp1ab; uridylate-specific endoribonuclease NendoU" /protein_id="NP_828872.1" mat_peptide 20589..21482 /gene="orf1ab" /locus_tag="sars1" /product="2'-O-ribose methyltransferase (2'-o-MT)" /inference="non-experimental evidence, no additional details recorded" /note="nsp16-pp1ab" /protein_id="NP_828873.2" CDS 265..13413 /gene="orf1ab" /locus_tag="sars1" /inference="non-experimental evidence, no additional details recorded" /codon_start=1 /product="orf1a polyprotein (pp1a)" /protein_id="NP_828850.1" /db_xref="GeneID:1489680" /translation="MESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEEALSEAREH LKNGTCGLVELEKGVLPQLEQPYVFIKRSDALSTNHGHKVVELVAEMDGIQYGRSGITL GVLVPHVGETPIAYRNVLLRKNGNKGAGGHSYGIDLKSYDLGDELGTDPIEDYEQNWNT KHGSGALRELTRELNGGAVTRYVDNNFCGPDGYPLDCIKDFLARAGKSMCTLSEQLDYI ESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFPLNSKV KVIQPRVEKKKTEGFMGRIRSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTCDFLKAT CEHCGTENLVIEGPTTCGYLPTNAVVKMPCPACQDPEIGPEHSVADYHNHSNIETRLRK GGRTRCFGGCVFAYVGCYNKRAYWVPRASADIGSGHTGITGDNVETLNEDLLEILSRER VNINIVGDFHLNEEVAIILASFSASTSAFIDTIKSLDYKSFKTIVESCGNYKVTKGKPV KGAWNIGQQRSVLTPLCGFPSQAAGVIRSIFARTLDAANHSIPDLQRAAVTILDGISEQ SLRLVDAMVYTSDLLTNSVIIMAYVTGGLVQQTSQWLSNLLGTTVEKLRPIFEWIEAKL SAGVEFLKDAWEILKFLITGVFDIVKGQIQVASDNIKDCVKCFIDVVNKALEMCIDQVT IAGAKLRSLNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKEVTFLEGDSHDTVLTS EEVVLKNGELEALETPVDSFTNGAIVGTPVCVNGLMLLEIKDKEQYCALSPGLLATNNV FRLKGGAPIKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNEKCSVYTVESGTEVTE FACVVAEAVVKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDE EEEDDAECEEEEIDETCEHEYGTEDDYQGLPLEFGASAETVRVEEEEEEDWLDDTTEQS EIEPEPEPTPEEPVNQFTGYLKLTDNVAIKCVDIVKEAQSANPMVIVNAANIHLKHGGG VAGALNKATNGAMQKESDDYIKLNGPLTVGGSCLLSGHNLAKKCLHVVGPNLNAGEDIQ LLKAAYENFNSQDILLAPLLSAGIFGAKPLQSLQVCVQTVRTQVYIAVNDKALYEQVVM DYLDNLKPRVEAPKQEEPPNTEDSKTEEKSVVQKPVDVKPKIKACIDEVTTTLEETKFL TNKLLLFADINGKLYHDSQNMLRGEDMSFLEKDAPYMVGDVITSGDITCVVIPSKKAGG TTEMLSRALKKVPVDEYITTYPGQGCAGYTLEEAKTALKKCKSAFYVLPSEAPNAKEEI LGTVSWNLREMLAHAEETRKLMPICMDVRAIMATIQRKYKGIKIQEGIVDYGVRFFFYT SKEPVASIITKLNSLNEPLVTMPIGYVTHGFNLEEAARCMRSLKAPAVVSVSSPDAVTT YNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSGQRTELGVEFLKRGDKIVYHTLESPVE FHLDGEVLSLDKLKSLLSLREVKTIKVFTTVDNTNLHTQLVDMSMTYGQQFGPTYLDGA DVTKIKPHVNHEGKTFFVLPSDDTLRSEAFEYYHTLDESFLGRYMSALNHTKKWKFPQV GGLTSIKWADNNCYLSSVLLALQQLEVKFNAPALQEAYYRARAGDAANFCALILAYSNK TVGELGDVRETMTHLLQHANLESAKRVLNVVCKHCGQKTTTLTGVEAVMYMGTLSYDNL KTGVSIPCVCGRDATQYLVQQESSFVMMSAPPAEYKLQQGTFLCANEYTGNYQCGHYTH ITAKETLYRIDGAHLTKMSEYKGPVTDVFYKETSYTTTIKPVSYKLDGVTYTEIEPKLD GYYKKDNAYYTEQPIDLVPTQPLPNASFDNFKLTCSNTKFADDLNQMTGFTKPASRELS VTFFPDLNGDVVAIDYRHYSASFKKGAKLLHKPIVWHINQATTKTTFKPNTWCLRCLWS TKPVDTSNSFEVLAVEDTQGMDNLACESQQPTSEEVVENPTIQKEVIECDVKTTEVVGN VILKPSDEGVKVTQELGHEDLMAAYVENTSITIKKPNELSLALGLKTIATHGIAAINSV PWSKILAYVKPFLGQAAITTSNCAKRLAQRVFNNYMPYVFTLLFQLCTFTKSTNSRIRA SLPTTIAKNSVKSVAKLCLDAGINYVKSPKFSKLFTIAMWLLLLSICLGSLICVTAAFG VLLSNFGAPSYCNGVRELYLNSSNVTTMDFCEGSFPCSICLSGLDSLDSYPALETIQVT ISSYKLDLTILGLAAEWVLAYMLFTKFFYLLGLSAIMQVFFGYFASHFISNSWLMWFII SIVQMAPVSAMVRMYIFFASFYYIWKSYVHIMDGCTSSTCMMCYKRNRATRVECTTIVN GMKRSFYVYANGGRGFCKTHNWNCLNCDTFCTGSTFISDEVARDLSLQFKRPINPTDQS SYIVDSVAVKNGALHLYFDKAGQKTYERHPLSHFVNLDNLRANNTKGSLPINVIVFDGK SKCDESASKSASVYYSQLMCQPILLLDQALVSDVGDSTEVSVKMFDAYVDTFSATFSVP MEKLKALVATAHSELAKGVALDGVLSTFVSAARQGVVDTDVDTKDVIECLKLSHHSDLE VTGDSCNNFMLTYNKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSE QLRKQIRSAAKKNNIPFRLTCATTRQVVNVITTKISLKGGKIVSTCFKLMLKATLLCVL AALVCYIVMPVHTLSIHDGYTNEIIGYKAIQDGVTRDIISTDDCFANKHAGFDAWFSQR GGSYKNDKSCPVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVGNICYTPS KLIEYSDFATSACVLAAECTIFKDAMGKPVPYCYDTNLLEGSISYSELRPDTRYVLMDG SIIQFPNTYLEGSVRVVTTFDAEYCRHGTCERSEVGICLSTSGRWVLNNEHYRALSGVF CGVDAMNLIANIFTPLVQPVGALDVSASVVAGGIIAILVTCAAYYFMKFRRVFGEYNHV VAANALLFLMSFTILCLVPAYSFLPGVYSVFYLYLTFYFTNDVSFLAHLQWFAMFSPIV PFWITAIYVFCISLKHCHWFFNNYLRKRVMFNGVTFSTFEEAALCTFLLNKEMYLKLRS ETLLPLTQYNRYLALYNKYKYFSGALDTTSYREAACCHLAKALNDFSNSGADVLYQPPQ TSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDML NPNYEDLLIRKSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPG QTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELP TGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTTL NDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGSTI LEDEFTPFDVVRQCSGVTFQGKFKKIVKGTHHWMLLTFLTSLLILVQSTQWSLFFFVYE NAFLPFTLGIMAIAACAMLLVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRIMTWL ELADTSLSGYRLKDCVMYASALVLLILMTARTVYDDAARRVWTLMNVITLVYKVYYGNA LDQAISMWALVISVTSNYSGVVTTIMFLARAIVFVCVEYYPLLFITGNTLQCIMLVYCF LGYCCCCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKSSIDAFKLNIKL LGIGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKD TTEAFEKMVSLLSVLLSMQGAVDINRLCEEMLDNRATLQAIASEFSSLPSYAAYATAQE AYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDK RAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVVPDYGTY KNTCDGNTFTYASALWEIQQVVDADSKIVQLSEINMDNSPNLAWPLIVTALRANSAVKL QNNELSPVALRQMSCAAGTTQTACTDDNALAYYNNSKGGRFVLALLSDHQDLKWARFPK SDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNA TEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANM DQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLRNTVCTVCGM WKGYGCSCDQLREPLMQSADASTFLNGFAV" mat_peptide 13372..13410 /gene="orf1ab" /locus_tag="sars1" /product="nsp11-pp1a" /note="putative C-terminal cleavage product of pp1a" /protein_id="NP_904321.1" misc_feature 13392..13472 /gene="orf1ab" /locus_tag="sars1" /note="Region: potential ribosome slippery sequence followed by stimulatory RNA pseudoknot" misc_feature 21486..21491 /note="transcription regulatory sequence for mRNA2" gene 21492..25259 /gene="S" /locus_tag="sars2" /gene_synonym="E2" /db_xref="GeneID:1489668" CDS 21492..25259 /gene="S" /locus_tag="sars2" /gene_synonym="E2" /experiment="experimental evidence, no additional details recorded" /note="As established by Krokhin et al. (2003), the glycosylated spike protein (as well as the nucleocapsid protein) can be detected in infected cell culture supernatants with antisera from SARS patients; spike glycoprotein" /codon_start=1 /product="E2 glycoprotein precursor" /protein_id="NP_828851.1" /db_xref="GeneID:1489668" /translation="MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPDE IFRSDTLYLTQDLFLPFYSNVTGFHTINHTFGNPVIPFKDGIYFAATEKSNVVRGWVFG STMNNKSQSVIIINNSTNVVIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCTFEY ISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDVVRDLPSGFNTLKPIFKL PLGINITNFRAILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAVDCSQN PLAELKCSVKSFEIDKGIYQTSNFRVVPSGDVVRFPNITNLCPFGEVFNATKFPSVYAW ERKKISNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFVVKGDDVRQIAP GQTGVIADYNYKLPDDFMGCVLAWNTRNIDATSTGNYNYKYRYLRHGKLRPFERDISNV PFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVVVLSFELLNAPATVCGPKLST DLIKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPKTSEILDISPC AFGGVSVITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGC LIGAEHVDTSYECDIPIGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSSIAYSNNTI AIPTNFSISITTEVMPVSMAKTSVDCNMYICGDSTECANLLLQYGSFCTQLNRALSGIA AEQDRNTREVFAQVKQMYKTPTLKYFGGFNFSQILPDPLKPTKRSFIEDLLFNKVTLAD AGFMKQYGECLGDINARDLICAQKFNGLTVLPPLLTDDMIAAYTAALVSGTATAGWTFG AGAALQIPFAMQMAYRFNGIGVTQNVLYENQKQIANQFNKAISQIQESLTTTSTALGKL QDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTY VTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQAAPHGVVFLHVT YVPSQERNFTTAPAICHEGKAYFPREGVFVFNGTSWFITQRNFFSPQIITTDNTFVSGN CDVVIGIINNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEID RLNEVAKNLNESLIDLQELGKYEQYIKWPWYVWLGFIAGLIAIVMVTILLCCMTSCCSC LKGACSCGSCCKFDEDDSEPVLKGVKLHYT" misc_feature 21843..21845 /gene="S" /locus_tag="sars2" /gene_synonym="E2" /experiment="experimental evidence, no additional details recorded" /note="second glycosylation site" misc_feature 21846..21848 /gene="S" /locus_tag="sars2" /gene_synonym="E2" /experiment="experimental evidence, no additional details recorded" /note="first glycosylation site" misc_feature 22170..22172 /gene="S" /locus_tag="sars2" /gene_synonym="E2" /experiment="experimental evidence, no additional details recorded" /note="glycosylation site" misc_feature 22296..22298 /gene="S" /locus_tag="sars2" /gene_synonym="E2" /experiment="experimental evidence, no additional details recorded" /note="glycosylation site" misc_feature 23838..23840 /gene="S" /locus_tag="sars2" /gene_synonym="E2" /experiment="experimental evidence, no additional details recorded" /note="glycosylation site" misc_feature 25260..25265 /note="transcription regulatory sequence for mRNA3" gene 25268..26092 /locus_tag="sars3a" /db_xref="GeneID:1489669" CDS 25268..26092 /locus_tag="sars3a" /inference="non-experimental evidence, no additional details recorded" /codon_start=1 /product="hypothetical protein sars3a" /protein_id="NP_828852.2" /db_xref="GeneID:1489669" /translation="MDLFMRFFTLRSITAQPVKIDNASPASTVHATATIPLQASLPFGW LVIGVAFLAVFQSATKIIALNKRWQLALYKGFQFICNLLLLFVTIYSHLLLVAAGMEAQ FLYLYALIYFLQCINACRIIMRCWLCWKCKSKNPLLYDANYFVCWHTHNYDYCIPYNSV TDTIVVTEGDGISTPKLKEDYQIGGYSEDRHSGVKDYVVVHGYFTEVYYQLESTQITTD TGIENATFFIFNKLVKDPPNVQIHTIDGSSGVANPAMDPIYDEPTTTTSVPL" gene 25689..26153 /locus_tag="sars3b" /db_xref="GeneID:1489670" CDS 25689..26153 /locus_tag="sars3b" /inference="non-experimental evidence, no additional details recorded" /codon_start=1 /product="hypothetical protein sars3b" /protein_id="NP_828853.1" /db_xref="GeneID:1489670" /translation="MMPTTLFAGTHITMTTVYHITVSQIQLSLLKVTAFQHQNSKKTTK LVVILRIGTQVLKTMSLYMAISPKFTTSLSLHKLLQTLVLKMLHSSSLTSLLKTHRMCK YTQSTALQELLIQQWIQFMMSRRRLLACLCKHKKVSTNLCTHSFRKKQVR" misc_feature 26109..26114 /locus_tag="sars3b" /note="transcription regulatory sequence for mRNA4" gene 26117..26347 /gene="E" /locus_tag="sars4" /db_xref="GeneID:1489671" CDS 26117..26347 /gene="E" /locus_tag="sars4" /experiment="experimental evidence, no additional details recorded" /note="E. coli expression reported by Shen et al. (2003); protein sM; small envelope protein" /codon_start=1 /product="protein E" /protein_id="NP_828854.1" /db_xref="GeneID:1489671" /translation="MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCN IVNVSLVKPTVYVYSRVKNLNSSEGVPDLLV" misc_feature 26348..26353 /note="transcription regulatory sequence for mRNA5" gene 26398..27063 /gene="M" /locus_tag="sars5" /db_xref="GeneID:1489672" CDS 26398..27063 /gene="M" /locus_tag="sars5" /note="E. coli expression reported by Zhang et al. (2003)" /codon_start=1 /product="matrix protein" /protein_id="NP_828855.1" /db_xref="GeneID:1489672" /translation="MADNGTITVEELKQLLEQWNLVIGFLFLAWIMLLQFAYSNRNRFL YIIKLVFLWLLWPVTLACFVLAAVYRINWVTGGIAIAMACIVGLMWLSYFVASFRLFAR TRSMWSFNPETNILLNVPLRGTIVTRPLMESELVIGAVIIRGHLRMAGHSLGRCDIKDL PKEITVATSRTLSYYKLGASQRVGTDSGFAAYNRYRIGNYKLNTDHAGSNDNIALLVQ" gene 26913..27265 /locus_tag="sars6" /db_xref="GeneID:1489673" misc_feature 26913..26918 /locus_tag="sars6" /note="transcription regulatory sequence for mRNA6" CDS 27074..27265 /locus_tag="sars6" /inference="non-experimental evidence, no additional details recorded" /codon_start=1 /product="hypothetical protein sars6" /protein_id="NP_828856.1" /db_xref="GeneID:1489673" /translation="MFHLVDFQVTIAEILIIIMRTFRIAIWNLDVIISSIVRQLFKPLT KKNYSELDDEEPMELDYP" misc_feature 27267..27272 /note="transcription regulatory sequence for mRNA7" gene 27273..27641 /locus_tag="sars7a" /db_xref="GeneID:1489674" CDS 27273..27641 /locus_tag="sars7a" /inference="non-experimental evidence, no additional details recorded" /codon_start=1 /product="hypothetical protein sars7a" /protein_id="NP_828857.1" /db_xref="GeneID:1489674" /translation="MKIILFLTLIVFTSCELYHYQECVRGTTVLLKEPCPSGTYEGNSP FHPLADNKFALTCTSTHFAFACADGTRHTYQLRARSVSPKLFIRQEEVQQELYSPLFLI VAALVFLILCFTIKRKTE" gene 27638..27772 /locus_tag="sars7b" /db_xref="GeneID:1489675" CDS 27638..27772 /locus_tag="sars7b" /inference="non-experimental evidence, no additional details recorded" /codon_start=1 /product="hypothetical protein sars7b" /protein_id="NP_849175.1" /db_xref="GeneID:1489675" /translation="MNELTLIDFYLCFLAFLLFLVLIMLIIFWFSLEIQDLEEPCTKV" misc_feature 27773..27778 /note="transcription regulatory sequence for mRNA8" gene 27779..27898 /locus_tag="sars8a" /db_xref="GeneID:1489676" CDS 27779..27898 /locus_tag="sars8a" /inference="non-experimental evidence, no additional details recorded" /codon_start=1 /product="hypothetical protein sars8a" /protein_id="NP_849176.1" /db_xref="GeneID:1489676" /translation="MKLLIVLTCISLCSCICTVVQRCASNKPHVLEDPCKVQH" gene 27864..28118 /locus_tag="sars8b" /db_xref="GeneID:1489677" CDS 27864..28118 /locus_tag="sars8b" /inference="non-experimental evidence, no additional details recorded" /codon_start=1 /product="hypothetical protein sars8b" /protein_id="NP_849177.1" /db_xref="GeneID:1489677" /translation="MCLKILVRYNTRGNTYSTAWLCALGKVLPFHRWHTMVQTCTPNVT INCQDPAGGALIARCWYLHEGHQTAAFRDVLVVLNKRTN" misc_feature 28106..28111 /locus_tag="sars8b" /note="transcription regulatory sequence for mRNA9" gene 28120..29388 /gene="N" /locus_tag="sars9a" /db_xref="GeneID:1489678" CDS 28120..29388 /gene="N" /locus_tag="sars9a" /experiment="inhibits the activity of cyclin-CDK complex and blocks S phase progression in mammalian cells" /note="As established by Krokhin et al. (2003), the N-terminal methionine is removed, all other methionines are oxidized, and the resulting N-terminal serine is acetylated" /codon_start=1 /product="nucleocapsid protein" /protein_id="NP_828858.1" /db_xref="GeneID:1489678" /translation="MSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGRNGARPKQRRPQG LPNNTASWFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRVRGGDGKMKE LSPRWYFYYLGTGPEASLPYGANKEGIVWVATEGALNTPKDHIGTRNPNNNAATVLQLP QGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETALAL LLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAFGRRGPE QTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYHGAIKL DDKDPQFKDNVILLNKHIDAYKTFPPTEPKKDKKKKTDEAQPLPQRQKKQPTVTLLPAA DMDDFSRQLQNSMSGASADSTQA" misc_feature 28123..28125 /gene="N" /locus_tag="sars9a" /experiment="experimental evidence, no additional details recorded" /note="acetylation site" gene 28130..28426 /locus_tag="sars9b" /db_xref="GeneID:1489679" CDS 28130..28426 /locus_tag="sars9b" /inference="non-experimental evidence, no additional details recorded" /codon_start=1 /product="hypothetical protein sars9b" /protein_id="NP_828859.1" /db_xref="GeneID:1489679" /translation="MDPNQTNVVPPALHLVDPQIQLTITRMEDAMGQGQNSADPKVYPI ILRLGSQLSLSMARRNLDSLEARAFQSTPIVVQMTKLATTEELPDEFVVVTAK" 3'UTR 29389..29751 ORIGIN 1 atattaggtt tttacctacc caggaaaagc caaccaacct cgatctcttg tagatctgtt 61 ctctaaacga actttaaaat ctgtgtagct gtcgctcggc tgcatgccta gtgcacctac 121 gcagtataaa caataataaa ttttactgtc gttgacaaga aacgagtaac tcgtccctct 181 tctgcagact gcttacggtt tcgtccgtgt tgcagtcgat catcagcata cctaggtttc 241 gtccgggtgt gaccgaaagg taagatggag agccttgttc ttggtgtcaa cgagaaaaca 301 cacgtccaac tcagtttgcc tgtccttcag gttagagacg tgctagtgcg tggcttcggg 361 gactctgtgg aagaggccct atcggaggca cgtgaacacc tcaaaaatgg cacttgtggt 421 ctagtagagc tggaaaaagg cgtactgccc cagcttgaac agccctatgt gttcattaaa 481 cgttctgatg ccttaagcac caatcacggc cacaaggtcg ttgagctggt tgcagaaatg 541 gacggcattc agtacggtcg tagcggtata acactgggag tactcgtgcc acatgtgggc 601 gaaaccccaa ttgcataccg caatgttctt cttcgtaaga acggtaataa gggagccggt 661 ggtcatagct atggcatcga tctaaagtct tatgacttag gtgacgagct tggcactgat 721 cccattgaag attatgaaca aaactggaac actaagcatg gcagtggtgc actccgtgaa 781 ctcactcgtg agctcaatgg aggtgcagtc actcgctatg tcgacaacaa tttctgtggc 841 ccagatgggt accctcttga ttgcatcaaa gattttctcg cacgcgcggg caagtcaatg 901 tgcactcttt ccgaacaact tgattacatc gagtcgaaga gaggtgtcta ctgctgccgt 961 gaccatgagc atgaaattgc ctggttcact gagcgctctg ataagagcta cgagcaccag 1021 acacccttcg aaattaagag tgccaagaaa tttgacactt tcaaagggga atgcccaaag 1081 tttgtgtttc ctcttaactc aaaagtcaaa gtcattcaac cacgtgttga aaagaaaaag 1141 actgagggtt tcatggggcg tatacgctct gtgtaccctg ttgcatctcc acaggagtgt 1201 aacaatatgc acttgtctac cttgatgaaa tgtaatcatt gcgatgaagt ttcatggcag 1261 acgtgcgact ttctgaaagc cacttgtgaa cattgtggca ctgaaaattt agttattgaa 1321 ggacctacta catgtgggta cctacctact aatgctgtag tgaaaatgcc atgtcctgcc 1381 tgtcaagacc cagagattgg acctgagcat agtgttgcag attatcacaa ccactcaaac 1441 attgaaactc gactccgcaa gggaggtagg actagatgtt ttggaggctg tgtgtttgcc 1501 tatgttggct gctataataa gcgtgcctac tgggttcctc gtgctagtgc tgatattggc 1561 tcaggccata ctggcattac tggtgacaat gtggagacct tgaatgagga tctccttgag 1621 atactgagtc gtgaacgtgt taacattaac attgttggcg attttcattt gaatgaagag 1681 gttgccatca ttttggcatc tttctctgct tctacaagtg cctttattga cactataaag 1741 agtcttgatt acaagtcttt caaaaccatt gttgagtcct gcggtaacta taaagttacc 1801 aagggaaagc ccgtaaaagg tgcttggaac attggacaac agagatcagt tttaacacca 1861 ctgtgtggtt ttccctcaca ggctgctggt gttatcagat caatttttgc gcgcacactt 1921 gatgcagcaa accactcaat tcctgatttg caaagagcag ctgtcaccat acttgatggt 1981 atttctgaac agtcattacg tcttgtcgac gccatggttt atacttcaga cctgctcacc 2041 aacagtgtca ttattatggc atatgtaact ggtggtcttg tacaacagac ttctcagtgg 2101 ttgtctaatc ttttgggcac tactgttgaa aaactcaggc ctatctttga atggattgag 2161 gcgaaactta gtgcaggagt tgaatttctc aaggatgctt gggagattct caaatttctc 2221 attacaggtg tttttgacat cgtcaagggt caaatacagg ttgcttcaga taacatcaag 2281 gattgtgtaa aatgcttcat tgatgttgtt aacaaggcac tcgaaatgtg cattgatcaa 2341 gtcactatcg ctggcgcaaa gttgcgatca ctcaacttag gtgaagtctt catcgctcaa 2401 agcaagggac tttaccgtca gtgtatacgt ggcaaggagc agctgcaact actcatgcct 2461 cttaaggcac caaaagaagt aacctttctt gaaggtgatt cacatgacac agtacttacc 2521 tctgaggagg ttgttctcaa gaacggtgaa ctcgaagcac tcgagacgcc cgttgatagc 2581 ttcacaaatg gagctatcgt tggcacacca gtctgtgtaa atggcctcat gctcttagag 2641 attaaggaca aagaacaata ctgcgcattg tctcctggtt tactggctac aaacaatgtc 2701 tttcgcttaa aagggggtgc accaattaaa ggtgtaacct ttggagaaga tactgtttgg 2761 gaagttcaag gttacaagaa tgtgagaatc acatttgagc ttgatgaacg tgttgacaaa 2821 gtgcttaatg aaaagtgctc tgtctacact gttgaatccg gtaccgaagt tactgagttt 2881 gcatgtgttg tagcagaggc tgttgtgaag actttacaac cagtttctga tctccttacc 2941 aacatgggta ttgatcttga tgagtggagt gtagctacat tctacttatt tgatgatgct 3001 ggtgaagaaa acttttcatc acgtatgtat tgttcctttt accctccaga tgaggaagaa 3061 gaggacgatg cagagtgtga ggaagaagaa attgatgaaa cctgtgaaca tgagtacggt 3121 acagaggatg attatcaagg tctccctctg gaatttggtg cctcagctga aacagttcga 3181 gttgaggaag aagaagagga agactggctg gatgatacta ctgagcaatc agagattgag 3241 ccagaaccag aacctacacc tgaagaacca gttaatcagt ttactggtta tttaaaactt 3301 actgacaatg ttgccattaa atgtgttgac atcgttaagg aggcacaaag tgctaatcct 3361 atggtgattg taaatgctgc taacatacac ctgaaacatg gtggtggtgt agcaggtgca 3421 ctcaacaagg caaccaatgg tgccatgcaa aaggagagtg atgattacat taagctaaat 3481 ggccctctta cagtaggagg gtcttgtttg ctttctggac ataatcttgc taagaagtgt 3541 ctgcatgttg ttggacctaa cctaaatgca ggtgaggaca tccagcttct taaggcagca 3601 tatgaaaatt tcaattcaca ggacatctta cttgcaccat tgttgtcagc aggcatattt 3661 ggtgctaaac cacttcagtc tttacaagtg tgcgtgcaga cggttcgtac acaggtttat 3721 attgcagtca atgacaaagc tctttatgag caggttgtca tggattatct tgataacctg 3781 aagcctagag tggaagcacc taaacaagag gagccaccaa acacagaaga ttccaaaact 3841 gaggagaaat ctgtcgtaca gaagcctgtc gatgtgaagc caaaaattaa ggcctgcatt 3901 gatgaggtta ccacaacact ggaagaaact aagtttctta ccaataagtt actcttgttt 3961 gctgatatca atggtaagct ttaccatgat tctcagaaca tgcttagagg tgaagatatg 4021 tctttccttg agaaggatgc accttacatg gtaggtgatg ttatcactag tggtgatatc 4081 acttgtgttg taataccctc caaaaaggct ggtggcacta ctgagatgct ctcaagagct 4141 ttgaagaaag tgccagttga tgagtatata accacgtacc ctggacaagg atgtgctggt 4201 tatacacttg aggaagctaa gactgctctt aagaaatgca aatctgcatt ttatgtacta 4261 ccttcagaag cacctaatgc taaggaagag attctaggaa ctgtatcctg gaatttgaga 4321 gaaatgcttg ctcatgctga agagacaaga aaattaatgc ctatatgcat ggatgttaga 4381 gccataatgg caaccatcca acgtaagtat aaaggaatta aaattcaaga gggcatcgtt 4441 gactatggtg tccgattctt cttttatact agtaaagagc ctgtagcttc tattattacg 4501 aagctgaact ctctaaatga gccgcttgtc acaatgccaa ttggttatgt gacacatggt 4561 tttaatcttg aagaggctgc gcgctgtatg cgttctctta aagctcctgc cgtagtgtca 4621 gtatcatcac cagatgctgt tactacatat aatggatacc tcacttcgtc atcaaagaca 4681 tctgaggagc actttgtaga aacagtttct ttggctggct cttacagaga ttggtcctat 4741 tcaggacagc gtacagagtt aggtgttgaa tttcttaagc gtggtgacaa aattgtgtac 4801 cacactctgg agagccccgt cgagtttcat cttgacggtg aggttctttc acttgacaaa 4861 ctaaagagtc tcttatccct gcgggaggtt aagactataa aagtgttcac aactgtggac 4921 aacactaatc tccacacaca gcttgtggat atgtctatga catatggaca gcagtttggt 4981 ccaacatact tggatggtgc tgatgttaca aaaattaaac ctcatgtaaa tcatgagggt 5041 aagactttct ttgtactacc tagtgatgac acactacgta gtgaagcttt cgagtactac 5101 catactcttg atgagagttt tcttggtagg tacatgtctg ctttaaacca cacaaagaaa 5161 tggaaatttc ctcaagttgg tggtttaact tcaattaaat gggctgataa caattgttat 5221 ttgtctagtg ttttattagc acttcaacag cttgaagtca aattcaatgc accagcactt 5281 caagaggctt attatagagc ccgtgctggt gatgctgcta acttttgtgc actcatactc 5341 gcttacagta ataaaactgt tggcgagctt ggtgatgtca gagaaactat gacccatctt 5401 ctacagcatg ctaatttgga atctgcaaag cgagttctta atgtggtgtg taaacattgt 5461 ggtcagaaaa ctactacctt aacgggtgta gaagctgtga tgtatatggg tactctatct 5521 tatgataatc ttaagacagg tgtttccatt ccatgtgtgt gtggtcgtga tgctacacaa 5581 tatctagtac aacaagagtc ttcttttgtt atgatgtctg caccacctgc tgagtataaa 5641 ttacagcaag gtacattctt atgtgcgaat gagtacactg gtaactatca gtgtggtcat 5701 tacactcata taactgctaa ggagaccctc tatcgtattg acggagctca ccttacaaag 5761 atgtcagagt acaaaggacc agtgactgat gttttctaca aggaaacatc ttacactaca 5821 accatcaagc ctgtgtcgta taaactcgat ggagttactt acacagagat tgaaccaaaa 5881 ttggatgggt attataaaaa ggataatgct tactatacag agcagcctat agaccttgta 5941 ccaactcaac cattaccaaa tgcgagtttt gataatttca aactcacatg ttctaacaca 6001 aaatttgctg atgatttaaa tcaaatgaca ggcttcacaa agccagcttc acgagagcta 6061 tctgtcacat tcttcccaga cttgaatggc gatgtagtgg ctattgacta tagacactat 6121 tcagcgagtt tcaagaaagg tgctaaatta ctgcataagc caattgtttg gcacattaac 6181 caggctacaa ccaagacaac gttcaaacca aacacttggt gtttacgttg tctttggagt 6241 acaaagccag tagatacttc aaattcattt gaagttctgg cagtagaaga cacacaagga 6301 atggacaatc ttgcttgtga aagtcaacaa cccacctctg aagaagtagt ggaaaatcct 6361 accatacaga aggaagtcat agagtgtgac gtgaaaacta ccgaagttgt aggcaatgtc 6421 atacttaaac catcagatga aggtgttaaa gtaacacaag agttaggtca tgaggatctt 6481 atggctgctt atgtggaaaa cacaagcatt accattaaga aacctaatga gctttcacta 6541 gccttaggtt taaaaacaat tgccactcat ggtattgctg caattaatag tgttccttgg 6601 agtaaaattt tggcttatgt caaaccattc ttaggacaag cagcaattac aacatcaaat 6661 tgcgctaaga gattagcaca acgtgtgttt aacaattata tgccttatgt gtttacatta 6721 ttgttccaat tgtgtacttt tactaaaagt accaattcta gaattagagc ttcactacct 6781 acaactattg ctaaaaatag tgttaagagt gttgctaaat tatgtttgga tgccggcatt 6841 aattatgtga agtcacccaa attttctaaa ttgttcacaa tcgctatgtg gctattgttg 6901 ttaagtattt gcttaggttc tctaatctgt gtaactgctg cttttggtgt actcttatct 6961 aattttggtg ctccttctta ttgtaatggc gttagagaat tgtatcttaa ttcgtctaac 7021 gttactacta tggatttctg tgaaggttct tttccttgca gcatttgttt aagtggatta 7081 gactcccttg attcttatcc agctcttgaa accattcagg tgacgatttc atcgtacaag 7141 ctagacttga caattttagg tctggccgct gagtgggttt tggcatatat gttgttcaca 7201 aaattctttt atttattagg tctttcagct ataatgcagg tgttctttgg ctattttgct 7261 agtcatttca tcagcaattc ttggctcatg tggtttatca ttagtattgt acaaatggca 7321 cccgtttctg caatggttag gatgtacatc ttctttgctt ctttctacta catatggaag 7381 agctatgttc atatcatgga tggttgcacc tcttcgactt gcatgatgtg ctataagcgc 7441 aatcgtgcca cacgcgttga gtgtacaact attgttaatg gcatgaagag atctttctat 7501 gtctatgcaa atggaggccg tggcttctgc aagactcaca attggaattg tctcaattgt 7561 gacacatttt gcactggtag tacattcatt agtgatgaag ttgctcgtga tttgtcactc 7621 cagtttaaaa gaccaatcaa ccctactgac cagtcatcgt atattgttga tagtgttgct 7681 gtgaaaaatg gcgcgcttca cctctacttt gacaaggctg gtcaaaagac ctatgagaga 7741 catccgctct cccattttgt caatttagac aatttgagag ctaacaacac taaaggttca 7801 ctgcctatta atgtcatagt ttttgatggc aagtccaaat gcgacgagtc tgcttctaag 7861 tctgcttctg tgtactacag tcagctgatg tgccaaccta ttctgttgct tgaccaagct 7921 cttgtatcag acgttggaga tagtactgaa gtttccgtta agatgtttga tgcttatgtc 7981 gacacctttt cagcaacttt tagtgttcct atggaaaaac ttaaggcact tgttgctaca 8041 gctcacagcg agttagcaaa gggtgtagct ttagatggtg tcctttctac attcgtgtca 8101 gctgcccgac aaggtgttgt tgataccgat gttgacacaa aggatgttat tgaatgtctc 8161 aaactttcac atcactctga cttagaagtg acaggtgaca gttgtaacaa tttcatgctc 8221 acctataata aggttgaaaa catgacgccc agagatcttg gcgcatgtat tgactgtaat 8281 gcaaggcata tcaatgccca agtagcaaaa agtcacaatg tttcactcat ctggaatgta 8341 aaagactaca tgtctttatc tgaacagctg cgtaaacaaa ttcgtagtgc tgccaagaag 8401 aacaacatac cttttagact aacttgtgct acaactagac aggttgtcaa tgtcataact 8461 actaaaatct cactcaaggg tggtaagatt gttagtactt gttttaaact tatgcttaag 8521 gccacattat tgtgcgttct tgctgcattg gtttgttata tcgttatgcc agtacataca 8581 ttgtcaatcc atgatggtta cacaaatgaa atcattggtt acaaagccat tcaggatggt 8641 gtcactcgtg acatcatttc tactgatgat tgttttgcaa ataaacatgc tggttttgac 8701 gcatggttta gccagcgtgg tggttcatac aaaaatgaca aaagctgccc tgtagtagct 8761 gctatcatta caagagagat tggtttcata gtgcctggct taccgggtac tgtgctgaga 8821 gcaatcaatg gtgacttctt gcattttcta cctcgtgttt ttagtgctgt tggcaacatt 8881 tgctacacac cttccaaact cattgagtat agtgattttg ctacctctgc ttgcgttctt 8941 gctgctgagt gtacaatttt taaggatgct atgggcaaac ctgtgccata ttgttatgac 9001 actaatttgc tagagggttc tatttcttat agtgagcttc gtccagacac tcgttatgtg 9061 cttatggatg gttccatcat acagtttcct aacacttacc tggagggttc tgttagagta 9121 gtaacaactt ttgatgctga gtactgtaga catggtacat gcgaaaggtc agaagtaggt 9181 atttgcctat ctaccagtgg tagatgggtt cttaataatg agcattacag agctctatca 9241 ggagttttct gtggtgttga tgcgatgaat ctcatagcta acatctttac tcctcttgtg 9301 caacctgtgg gtgctttaga tgtgtctgct tcagtagtgg ctggtggtat tattgccata 9361 ttggtgactt gtgctgccta ctactttatg aaattcagac gtgtttttgg tgagtacaac 9421 catgttgttg ctgctaatgc acttttgttt ttgatgtctt tcactatact ctgtctggta 9481 ccagcttaca gctttctgcc gggagtctac tcagtctttt acttgtactt gacattctat 9541 ttcaccaatg atgtttcatt cttggctcac cttcaatggt ttgccatgtt ttctcctatt 9601 gtgccttttt ggataacagc aatctatgta ttctgtattt ctctgaagca ctgccattgg 9661 ttctttaaca actatcttag gaaaagagtc atgtttaatg gagttacatt tagtaccttc 9721 gaggaggctg ctttgtgtac ctttttgctc aacaaggaaa tgtacctaaa attgcgtagc 9781 gagacactgt tgccacttac acagtataac aggtatcttg ctctatataa caagtacaag 9841 tatttcagtg gagccttaga tactaccagc tatcgtgaag cagcttgctg ccacttagca 9901 aaggctctaa atgactttag caactcaggt gctgatgttc tctaccaacc accacagaca 9961 tcaatcactt ctgctgttct gcagagtggt tttaggaaaa tggcattccc gtcaggcaaa 10021 gttgaagggt gcatggtaca agtaacctgt ggaactacaa ctcttaatgg attgtggttg 10081 gatgacacag tatactgtcc aagacatgtc atttgcacag cagaagacat gcttaatcct 10141 aactatgaag atctgctcat tcgcaaatcc aaccatagct ttcttgttca ggctggcaat 10201 gttcaacttc gtgttattgg ccattctatg caaaattgtc tgcttaggct taaagttgat 10261 acttctaacc ctaagacacc caagtataaa tttgtccgta tccaacctgg tcaaacattt 10321 tcagttctag catgctacaa tggttcacca tctggtgttt atcagtgtgc catgagacct 10381 aatcatacca ttaaaggttc tttccttaat ggatcatgtg gtagtgttgg ttttaacatt 10441 gattatgatt gcgtgtcttt ctgctatatg catcatatgg agcttccaac aggagtacac 10501 gctggtactg acttagaagg taaattctat ggtccatttg ttgacagaca aactgcacag 10561 gctgcaggta cagacacaac cataacatta aatgttttgg catggctgta tgctgctgtt 10621 atcaatggtg ataggtggtt tcttaataga ttcaccacta ctttgaatga ctttaacctt 10681 gtggcaatga agtacaacta tgaacctttg acacaagatc atgttgacat attgggacct 10741 ctttctgctc aaacaggaat tgccgtctta gatatgtgtg ctgctttgaa agagctgctg 10801 cagaatggta tgaatggtcg tactatcctt ggtagcacta ttttagaaga tgagtttaca 10861 ccatttgatg ttgttagaca atgctctggt gttaccttcc aaggtaagtt caagaaaatt 10921 gttaagggca ctcatcattg gatgctttta actttcttga catcactatt gattcttgtt 10981 caaagtacac agtggtcact gtttttcttt gtttacgaga atgctttctt gccatttact 11041 cttggtatta tggcaattgc tgcatgtgct atgctgcttg ttaagcataa gcacgcattc 11101 ttgtgcttgt ttctgttacc ttctcttgca acagttgctt actttaatat ggtctacatg 11161 cctgctagct gggtgatgcg tatcatgaca tggcttgaat tggctgacac tagcttgtct 11221 ggttataggc ttaaggattg tgttatgtat gcttcagctt tagttttgct tattctcatg 11281 acagctcgca ctgtttatga tgatgctgct agacgtgttt ggacactgat gaatgtcatt 11341 acacttgttt acaaagtcta ctatggtaat gctttagatc aagctatttc catgtgggcc 11401 ttagttattt ctgtaacctc taactattct ggtgtcgtta cgactatcat gtttttagct 11461 agagctatag tgtttgtgtg tgttgagtat tacccattgt tatttattac tggcaacacc 11521 ttacagtgta tcatgcttgt ttattgtttc ttaggctatt gttgctgctg ctactttggc 11581 cttttctgtt tactcaaccg ttacttcagg cttactcttg gtgtttatga ctacttggtc 11641 tctacacaag aatttaggta tatgaactcc caggggcttt tgcctcctaa gagtagtatt 11701 gatgctttca agcttaacat taagttgttg ggtattggag gtaaaccatg tatcaaggtt 11761 gctactgtac agtctaaaat gtctgacgta aagtgcacat ctgtggtact gctctcggtt 11821 cttcaacaac ttagagtaga gtcatcttct aaattgtggg cacaatgtgt acaactccac 11881 aatgatattc ttcttgcaaa agacacaact gaagctttcg agaagatggt ttctcttttg 11941 tctgttttgc tatccatgca gggtgctgta gacattaata ggttgtgcga ggaaatgctc 12001 gataaccgtg ctactcttca ggctattgct tcagaattta gttctttacc atcatatgcc 12061 gcttatgcca ctgcccagga ggcctatgag caggctgtag ctaatggtga ttctgaagtc 12121 gttctcaaaa agttaaagaa atctttgaat gtggctaaat ctgagtttga ccgtgatgct 12181 gccatgcaac gcaagttgga aaagatggca gatcaggcta tgacccaaat gtacaaacag 12241 gcaagatctg aggacaagag ggcaaaagta actagtgcta tgcaaacaat gctcttcact 12301 atgcttagga agcttgataa tgatgcactt aacaacatta tcaacaatgc gcgtgatggt 12361 tgtgttccac tcaacatcat accattgact acagcagcca aactcatggt tgttgtccct 12421 gattatggta cctacaagaa cacttgtgat ggtaacacct ttacatatgc atctgcactc 12481 tgggaaatcc agcaagttgt tgatgcggat agcaagattg ttcaacttag tgaaattaac 12541 atggacaatt caccaaattt ggcttggcct cttattgtta cagctctaag agccaactca 12601 gctgttaaac tacagaataa tgaactgagt ccagtagcac tacgacagat gtcctgtgcg 12661 gctggtacca cacaaacagc ttgtactgat gacaatgcac ttgcctacta taacaattcg 12721 aagggaggta ggtttgtgct ggcattacta tcagaccacc aagatctcaa atgggctaga 12781 ttccctaaga gtgatggtac aggtacaatt tacacagaac tggaaccacc ttgtaggttt 12841 gttacagaca caccaaaagg gcctaaagtg aaatacttgt acttcatcaa aggcttaaac 12901 aacctaaata gaggtatggt gctgggcagt ttagctgcta cagtacgtct tcaggctgga 12961 aatgctacag aagtacctgc caattcaact gtgctttcct tctgtgcttt tgcagtagac 13021 cctgctaaag catataagga ttacctagca agtggaggac aaccaatcac caactgtgtg 13081 aagatgttgt gtacacacac tggtacagga caggcaatta ctgtaacacc agaagctaac 13141 atggaccaag agtcctttgg tggtgcttca tgttgtctgt attgtagatg ccacattgac 13201 catccaaatc ctaaaggatt ctgtgacttg aaaggtaagt acgtccaaat acctaccact 13261 tgtgctaatg acccagtggg ttttacactt agaaacacag tctgtaccgt ctgcggaatg 13321 tggaaaggtt atggctgtag ttgtgaccaa ctccgcgaac ccttgatgca gtctgcggat 13381 gcatcaacgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca ccgtgcggca 13441 caggcactag tactgatgtc gtctacaggg cttttgatat ttacaacgaa aaagttgctg 13501 gttttgcaaa gttcctaaaa actaattgct gtcgcttcca ggagaaggat gaggaaggca 13561 atttattaga ctcttacttt gtagttaaga ggcatactat gtctaactac caacatgaag 13621 agactattta taacttggtt aaagattgtc cagcggttgc tgtccatgac tttttcaagt 13681 ttagagtaga tggtgacatg gtaccacata tatcacgtca gcgtctaact aaatacacaa 13741 tggctgattt agtctatgct ctacgtcatt ttgatgaggg taattgtgat acattaaaag 13801 aaatactcgt cacatacaat tgctgtgatg atgattattt caataagaag gattggtatg 13861 acttcgtaga gaatcctgac atcttacgcg tatatgctaa cttaggtgag cgtgtacgcc 13921 aatcattatt aaagactgta caattctgcg atgctatgcg tgatgcaggc attgtaggcg 13981 tactgacatt agataatcag gatcttaatg ggaactggta cgatttcggt gatttcgtac 14041 aagtagcacc aggctgcgga gttcctattg tggattcata ttactcattg ctgatgccca 14101 tcctcacttt gactagggca ttggctgctg agtcccatat ggatgctgat ctcgcaaaac 14161 cacttattaa gtgggatttg ctgaaatatg attttacgga agagagactt tgtctcttcg 14221 accgttattt taaatattgg gaccagacat accatcccaa ttgtattaac tgtttggatg 14281 ataggtgtat ccttcattgt gcaaacttta atgtgttatt ttctactgtg tttccaccta 14341 caagttttgg accactagta agaaaaatat ttgtagatgg tgttcctttt gttgtttcaa 14401 ctggatacca ttttcgtgag ttaggagtcg tacataatca ggatgtaaac ttacatagct 14461 cgcgtctcag tttcaaggaa cttttagtgt atgctgctga tccagctatg catgcagctt 14521 ctggcaattt attgctagat aaacgcacta catgcttttc agtagctgca ctaacaaaca 14581 atgttgcttt tcaaactgtc aaacccggta attttaataa agacttttat gactttgctg 14641 tgtctaaagg tttctttaag gaaggaagtt ctgttgaact aaaacacttc ttctttgctc 14701 aggatggcaa cgctgctatc agtgattatg actattatcg ttataatctg ccaacaatgt 14761 gtgatatcag acaactccta ttcgtagttg aagttgttga taaatacttt gattgttacg 14821 atggtggctg tattaatgcc aaccaagtaa tcgttaacaa tctggataaa tcagctggtt 14881 tcccatttaa taaatggggt aaggctagac tttattatga ctcaatgagt tatgaggatc 14941 aagatgcact tttcgcgtat actaagcgta atgtcatccc tactataact caaatgaatc 15001 ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc tctatctgta 15061 gtactatgac aaatagacag tttcatcaga aattattgaa gtcaatagcc gccactagag 15121 gagctactgt ggtaattgga acaagcaagt tttacggtgg ctggcataat atgttaaaaa 15181 ctgtttacag tgatgtagaa actccacacc ttatgggttg ggattatcca aaatgtgaca 15241 gagccatgcc taacatgctt aggataatgg cctctcttgt tcttgctcgc aaacataaca 15301 cttgctgtaa cttatcacac cgtttctaca ggttagctaa cgagtgtgcg caagtattaa 15361 gtgagatggt catgtgtggc ggctcactat atgttaaacc aggtggaaca tcatccggtg 15421 atgctacaac tgcttatgct aatagtgtct ttaacatttg tcaagctgtt acagccaatg 15481 taaatgcact tctttcaact gatggtaata agatagctga caagtatgtc cgcaatctac 15541 aacacaggct ctatgagtgt ctctatagaa atagggatgt tgatcatgaa ttcgtggatg 15601 agttttacgc ttacctgcgt aaacatttct ccatgatgat tctttctgat gatgccgttg 15661 tgtgctataa cagtaactat gcggctcaag gtttagtagc tagcattaag aactttaagg 15721 cagttcttta ttatcaaaat aatgtgttca tgtctgaggc aaaatgttgg actgagactg 15781 accttactaa aggacctcac gaattttgct cacagcatac aatgctagtt aaacaaggag 15841 atgattacgt gtacctgcct tacccagatc catcaagaat attaggcgca ggctgttttg 15901 tcgatgatat tgtcaaaaca gatggtacac ttatgattga aaggttcgtg tcactggcta 15961 ttgatgctta cccacttaca aaacatccta atcaggagta tgctgatgtc tttcacttgt 16021 atttacaata cattagaaag ttacatgatg agcttactgg ccacatgttg gacatgtatt 16081 ccgtaatgct aactaatgat aacacctcac ggtactggga acctgagttt tatgaggcta 16141 tgtacacacc acatacagtc ttgcaggctg taggtgcttg tgtattgtgc aattcacaga 16201 cttcacttcg ttgcggtgcc tgtattagga gaccattcct atgttgcaag tgctgctatg 16261 accatgtcat ttcaacatca cacaaattag tgttgtctgt taatccctat gtttgcaatg 16321 ccccaggttg tgatgtcact gatgtgacac aactgtatct aggaggtatg agctattatt 16381 gcaagtcaca taagcctccc attagttttc cattatgtgc taatggtcag gtttttggtt 16441 tatacaaaaa cacatgtgta ggcagtgaca atgtcactga cttcaatgcg atagcaacat 16501 gtgattggac taatgctggc gattacatac ttgccaacac ttgtactgag agactcaagc 16561 ttttcgcagc agaaacgctc aaagccactg aggaaacatt taagctgtca tatggtattg 16621 ccactgtacg cgaagtactc tctgacagag aattgcatct ttcatgggag gttggaaaac 16681 ctagaccacc attgaacaga aactatgtct ttactggtta ccgtgtaact aaaaatagta 16741 aagtacagat tggagagtac acctttgaaa aaggtgacta tggtgatgct gttgtgtaca 16801 gaggtactac gacatacaag ttgaatgttg gtgattactt tgtgttgaca tctcacactg 16861 taatgccact tagtgcacct actctagtgc cacaagagca ctatgtgaga attactggct 16921 tgtacccaac actcaacatc tcagatgagt tttctagcaa tgttgcaaat tatcaaaagg 16981 tcggcatgca aaagtactct acactccaag gaccacctgg tactggtaag agtcattttg 17041 ccatcggact tgctctctat tacccatctg ctcgcatagt gtatacggca tgctctcatg 17101 cagctgttga tgccctatgt gaaaaggcat taaaatattt gcccatagat aaatgtagta 17161 gaatcatacc tgcgcgtgcg cgcgtagagt gttttgataa attcaaagtg aattcaacac 17221 tagaacagta tgttttctgc actgtaaatg cattgccaga aacaactgct gacattgtag 17281 tctttgatga aatctctatg gctactaatt atgacttgag tgttgtcaat gctagacttc 17341 gtgcaaaaca ctacgtctat attggcgatc ctgctcaatt accagccccc cgcacattgc 17401 tgactaaagg cacactagaa ccagaatatt ttaattcagt gtgcagactt atgaaaacaa 17461 taggtccaga catgttcctt ggaacttgtc gccgttgtcc tgctgaaatt gttgacactg 17521 tgagtgcttt agtttatgac aataagctaa aagcacacaa ggataagtca gctcaatgct 17581 tcaaaatgtt ctacaaaggt gttattacac atgatgtttc atctgcaatc aacagacctc 17641 aaataggcgt tgtaagagaa tttcttacac gcaatcctgc ttggagaaaa gctgttttta 17701 tctcacctta taattcacag aacgctgtag cttcaaaaat cttaggattg cctacgcaga 17761 ctgttgattc atcacagggt tctgaatatg actatgtcat attcacacaa actactgaaa 17821 cagcacactc ttgtaatgtc aaccgcttca atgtggctat cacaagggca aaaattggca 17881 ttttgtgcat aatgtctgat agagatcttt atgacaaact gcaatttaca agtctagaaa 17941 taccacgtcg caatgtggct acattacaag cagaaaatgt aactggactt tttaaggact 18001 gtagtaagat cattactggt cttcatccta cacaggcacc tacacacctc agcgttgata 18061 taaagttcaa gactgaagga ttatgtgttg acataccagg cataccaaag gacatgacct 18121 accgtagact catctctatg atgggtttca aaatgaatta ccaagtcaat ggttacccta 18181 atatgtttat cacccgcgaa gaagctattc gtcacgttcg tgcgtggatt ggctttgatg 18241 tagagggctg tcatgcaact agagatgctg tgggtactaa cctacctctc cagctaggat 18301 tttctacagg tgttaactta gtagctgtac cgactggtta tgttgacact gaaaataaca 18361 cagaattcac cagagttaat gcaaaacctc caccaggtga ccagtttaaa catcttatac 18421 cactcatgta taaaggcttg ccctggaatg tagtgcgtat taagatagta caaatgctca 18481 gtgatacact gaaaggattg tcagacagag tcgtgttcgt cctttgggcg catggctttg 18541 agcttacatc aatgaagtac tttgtcaaga ttggacctga aagaacgtgt tgtctgtgtg 18601 acaaacgtgc aacttgcttt tctacttcat cagatactta tgcctgctgg aatcattctg 18661 tgggttttga ctatgtctat aacccattta tgattgatgt tcagcagtgg ggctttacgg 18721 gtaaccttca gagtaaccat gaccaacatt gccaggtaca tggaaatgca catgtggcta 18781 gttgtgatgc tatcatgact agatgtttag cagtccatga gtgctttgtt aagcgcgttg 18841 attggtctgt tgaataccct attataggag atgaactgag ggttaattct gcttgcagaa 18901 aagtacaaca catggttgtg aagtctgcat tgcttgctga taagtttcca gttcttcatg 18961 acattggaaa tccaaaggct atcaagtgtg tgcctcaggc tgaagtagaa tggaagttct 19021 acgatgctca gccatgtagt gacaaagctt acaaaataga ggaactcttc tattcttatg 19081 ctacacatca cgataaattc actgatggtg tttgtttgtt ttggaattgt aacgttgatc 19141 gttacccagc caatgcaatt gtgtgtaggt ttgacacaag agtcttgtca aacttgaact 19201 taccaggctg tgatggtggt agtttgtatg tgaataagca tgcattccac actccagctt 19261 tcgataaaag tgcatttact aatttaaagc aattgccttt cttttactat tctgatagtc 19321 cttgtgagtc tcatggcaaa caagtagtgt cggatattga ttatgttcca ctcaaatctg 19381 ctacgtgtat tacacgatgc aatttaggtg gtgctgtttg cagacaccat gcaaatgagt 19441 accgacagta cttggatgca tataatatga tgatttctgc tggatttagc ctatggattt 19501 acaaacaatt tgatacttat aacctgtgga atacatttac caggttacag agtttagaaa 19561 atgtggctta taatgttgtt aataaaggac actttgatgg acacgccggc gaagcacctg 19621 tttccatcat taataatgct gtttacacaa aggtagatgg tattgatgtg gagatctttg 19681 aaaataagac aacacttcct gttaatgttg catttgagct ttgggctaag cgtaacatta 19741 aaccagtgcc agagattaag atactcaata atttgggtgt tgatatcgct gctaatactg 19801 taatctggga ctacaaaaga gaagccccag cacatgtatc tacaataggt gtctgcacaa 19861 tgactgacat tgccaagaaa cctactgaga gtgcttgttc ttcacttact gtcttgtttg 19921 atggtagagt ggaaggacag gtagaccttt ttagaaacgc ccgtaatggt gttttaataa 19981 cagaaggttc agtcaaaggt ctaacacctt caaagggacc agcacaagct agcgtcaatg 20041 gagtcacatt aattggagaa tcagtaaaaa cacagtttaa ctactttaag aaagtagacg 20101 gcattattca acagttgcct gaaacctact ttactcagag cagagactta gaggatttta 20161 agcccagatc acaaatggaa actgactttc tcgagctcgc tatggatgaa ttcatacagc 20221 gatataagct cgagggctat gccttcgaac acatcgttta tggagatttc agtcatggac 20281 aacttggcgg tcttcattta atgataggct tagccaagcg ctcacaagat tcaccactta 20341 aattagagga ttttatccct atggacagca cagtgaaaaa ttacttcata acagatgcgc 20401 aaacaggttc atcaaaatgt gtgtgttctg tgattgatct tttacttgat gactttgtcg 20461 agataataaa gtcacaagat ttgtcagtga tttcaaaagt ggtcaaggtt acaattgact 20521 atgctgaaat ttcattcatg ctttggtgta aggatggaca tgttgaaacc ttctacccaa 20581 aactacaagc aagtcaagcg tggcaaccag gtgttgcgat gcctaacttg tacaagatgc 20641 aaagaatgct tcttgaaaag tgtgaccttc agaattatgg tgaaaatgct gttataccaa 20701 aaggaataat gatgaatgtc gcaaagtata ctcaactgtg tcaatactta aatacactta 20761 ctttagctgt accctacaac atgagagtta ttcactttgg tgctggctct gataaaggag 20821 ttgcaccagg tacagctgtg ctcagacaat ggttgccaac tggcacacta cttgtcgatt 20881 cagatcttaa tgacttcgtc tccgacgcag attctacttt aattggagac tgtgcaacag 20941 tacatacggc taataaatgg gaccttatta ttagcgatat gtatgaccct aggaccaaac 21001 atgtgacaaa agagaatgac tctaaagaag ggtttttcac ttatctgtgt ggatttataa 21061 agcaaaaact agccctgggt ggttctatag ctgtaaagat aacagagcat tcttggaatg 21121 ctgaccttta caagcttatg ggccatttct catggtggac agcttttgtt acaaatgtaa 21181 atgcatcatc atcggaagca tttttaattg gggctaacta tcttggcaag ccgaaggaac 21241 aaattgatgg ctataccatg catgctaact acattttctg gaggaacaca aatcctatcc 21301 agttgtcttc ctattcactc tttgacatga gcaaatttcc tcttaaatta agaggaactg 21361 ctgtaatgtc tcttaaggag aatcaaatca atgatatgat ttattctctt ctggaaaaag 21421 gtaggcttat cattagagaa aacaacagag ttgtggtttc aagtgatatt cttgttaaca 21481 actaaacgaa catgtttatt ttcttattat ttcttactct cactagtggt agtgaccttg 21541 accggtgcac cacttttgat gatgttcaag ctcctaatta cactcaacat acttcatcta 21601 tgaggggggt ttactatcct gatgaaattt ttagatcaga cactctttat ttaactcagg 21661 atttatttct tccattttat tctaatgtta cagggtttca tactattaat catacgtttg 21721 gcaaccctgt catacctttt aaggatggta tttattttgc tgccacagag aaatcaaatg 21781 ttgtccgtgg ttgggttttt ggttctacca tgaacaacaa gtcacagtcg gtgattatta 21841 ttaacaattc tactaatgtt gttatacgag catgtaactt tgaattgtgt gacaaccctt 21901 tctttgctgt ttctaaaccc atgggtacac agacacatac tatgatattc gataatgcat 21961 ttaattgcac tttcgagtac atatctgatg ccttttcgct tgatgtttca gaaaagtcag 22021 gtaattttaa acacttacga gagtttgtgt ttaaaaataa agatgggttt ctctatgttt 22081 ataagggcta tcaacctata gatgtagttc gtgatctacc ttctggtttt aacactttga 22141 aacctatttt taagttgcct cttggtatta acattacaaa ttttagagcc attcttacag 22201 ccttttcacc tgctcaagac atttggggca cgtcagctgc agcctatttt gttggctatt 22261 taaagccaac tacatttatg ctcaagtatg atgaaaatgg tacaatcaca gatgctgttg 22321 attgttctca aaatccactt gctgaactca aatgctctgt taagagcttt gagattgaca 22381 aaggaattta ccagacctct aatttcaggg ttgttccctc aggagatgtt gtgagattcc 22441 ctaatattac aaacttgtgt ccttttggag aggtttttaa tgctactaaa ttcccttctg 22501 tctatgcatg ggagagaaaa aaaatttcta attgtgttgc tgattactct gtgctctaca 22561 actcaacatt tttttcaacc tttaagtgct atggcgtttc tgccactaag ttgaatgatc 22621 tttgcttctc caatgtctat gcagattctt ttgtagtcaa gggagatgat gtaagacaaa 22681 tagcgccagg acaaactggt gttattgctg attataatta taaattgcca gatgatttca 22741 tgggttgtgt ccttgcttgg aatactagga acattgatgc tacttcaact ggtaattata 22801 attataaata taggtatctt agacatggca agcttaggcc ctttgagaga gacatatcta 22861 atgtgccttt ctcccctgat ggcaaacctt gcaccccacc tgctcttaat tgttattggc 22921 cattaaatga ttatggtttt tacaccacta ctggcattgg ctaccaacct tacagagttg 22981 tagtactttc ttttgaactt ttaaatgcac cggccacggt ttgtggacca aaattatcca 23041 ctgaccttat taagaaccag tgtgtcaatt ttaattttaa tggactcact ggtactggtg 23101 tgttaactcc ttcttcaaag agatttcaac catttcaaca atttggccgt gatgtttctg 23161 atttcactga ttccgttcga gatcctaaaa catctgaaat attagacatt tcaccttgcg 23221 cttttggggg tgtaagtgta attacacctg gaacaaatgc ttcatctgaa gttgctgttc 23281 tatatcaaga tgttaactgc actgatgttt ctacagcaat tcatgcagat caactcacac 23341 cagcttggcg catatattct actggaaaca atgtattcca gactcaagca ggctgtctta 23401 taggagctga gcatgtcgac acttcttatg agtgcgacat tcctattgga gctggcattt 23461 gtgctagtta ccatacagtt tctttattac gtagtactag ccaaaaatct attgtggctt 23521 atactatgtc tttaggtgct gatagttcaa ttgcttactc taataacacc attgctatac 23581 ctactaactt ttcaattagc attactacag aagtaatgcc tgtttctatg gctaaaacct 23641 ccgtagattg taatatgtac atctgcggag attctactga atgtgctaat ttgcttctcc 23701 aatatggtag cttttgcaca caactaaatc gtgcactctc aggtattgct gctgaacagg 23761 atcgcaacac acgtgaagtg ttcgctcaag tcaaacaaat gtacaaaacc ccaactttga 23821 aatattttgg tggttttaat ttttcacaaa tattacctga ccctctaaag ccaactaaga 23881 ggtcttttat tgaggacttg ctctttaata aggtgacact cgctgatgct ggcttcatga 23941 agcaatatgg cgaatgccta ggtgatatta atgctagaga tctcatttgt gcgcagaagt 24001 tcaatggact tacagtgttg ccacctctgc tcactgatga tatgattgct gcctacactg 24061 ctgctctagt tagtggtact gccactgctg gatggacatt tggtgctggc gctgctcttc 24121 aaataccttt tgctatgcaa atggcatata ggttcaatgg cattggagtt acccaaaatg 24181 ttctctatga gaaccaaaaa caaatcgcca accaatttaa caaggcgatt agtcaaattc 24241 aagaatcact tacaacaaca tcaactgcat tgggcaagct gcaagacgtt gttaaccaga 24301 atgctcaagc attaaacaca cttgttaaac aacttagctc taattttggt gcaatttcaa 24361 gtgtgctaaa tgatatcctt tcgcgacttg ataaagtcga ggcggaggta caaattgaca 24421 ggttaattac aggcagactt caaagccttc aaacctatgt aacacaacaa ctaatcaggg 24481 ctgctgaaat cagggcttct gctaatcttg ctgctactaa aatgtctgag tgtgttcttg 24541 gacaatcaaa aagagttgac ttttgtggaa agggctacca ccttatgtcc ttcccacaag 24601 cagccccgca tggtgttgtc ttcctacatg tcacgtatgt gccatcccag gagaggaact 24661 tcaccacagc gccagcaatt tgtcatgaag gcaaagcata cttccctcgt gaaggtgttt 24721 ttgtgtttaa tggcacttct tggtttatta cacagaggaa cttcttttct ccacaaataa 24781 ttactacaga caatacattt gtctcaggaa attgtgatgt cgttattggc atcattaaca 24841 acacagttta tgatcctctg caacctgagc ttgactcatt caaagaagag ctggacaagt 24901 acttcaaaaa tcatacatca ccagatgttg atcttggcga catttcaggc attaacgctt 24961 ctgtcgtcaa cattcaaaaa gaaattgacc gcctcaatga ggtcgctaaa aatttaaatg 25021 aatcactcat tgaccttcaa gaattgggaa aatatgagca atatattaaa tggccttggt 25081 atgtttggct cggcttcatt gctggactaa ttgccatcgt catggttaca atcttgcttt 25141 gttgcatgac tagttgttgc agttgcctca agggtgcatg ctcttgtggt tcttgctgca 25201 agtttgatga ggatgactct gagccagttc tcaagggtgt caaattacat tacacataaa 25261 cgaacttatg gatttgttta tgagattttt tactcttaga tcaattactg cacagccagt 25321 aaaaattgac aatgcttctc ctgcaagtac tgttcatgct acagcaacga taccgctaca 25381 agcctcactc cctttcggat ggcttgttat tggcgttgca tttcttgctg tttttcagag 25441 cgctaccaaa ataattgcgc tcaataaaag atggcagcta gccctttata agggcttcca 25501 gttcatttgc aatttactgc tgctatttgt taccatctat tcacatcttt tgcttgtcgc 25561 tgcaggtatg gaggcgcaat ttttgtacct ctatgccttg atatattttc tacaatgcat 25621 caacgcatgt agaattatta tgagatgttg gctttgttgg aagtgcaaat ccaagaaccc 25681 attactttat gatgccaact actttgtttg ctggcacaca cataactatg actactgtat 25741 accatataac agtgtcacag atacaattgt cgttactgaa ggtgacggca tttcaacacc 25801 aaaactcaaa gaagactacc aaattggtgg ttattctgag gataggcact caggtgttaa 25861 agactatgtc gttgtacatg gctatttcac cgaagtttac taccagcttg agtctacaca 25921 aattactaca gacactggta ttgaaaatgc tacattcttc atctttaaca agcttgttaa 25981 agacccaccg aatgtgcaaa tacacacaat cgacggctct tcaggagttg ctaatccagc 26041 aatggatcca atttatgatg agccgacgac gactactagc gtgcctttgt aagcacaaga 26101 aagtgagtac gaacttatgt actcattcgt ttcggaagaa acaggtacgt taatagttaa 26161 tagcgtactt ctttttcttg ctttcgtggt attcttgcta gtcacactag ccatccttac 26221 tgcgcttcga ttgtgtgcgt actgctgcaa tattgttaac gtgagtttag taaaaccaac 26281 ggtttacgtc tactcgcgtg ttaaaaatct gaactcttct gaaggagttc ctgatcttct 26341 ggtctaaacg aactaactat tattattatt ctgtttggaa ctttaacatt gcttatcatg 26401 gcagacaacg gtactattac cgttgaggag cttaaacaac tcctggaaca atggaaccta 26461 gtaataggtt tcctattcct agcctggatt atgttactac aatttgccta ttctaatcgg 26521 aacaggtttt tgtacataat aaagcttgtt ttcctctggc tcttgtggcc agtaacactt 26581 gcttgttttg tgcttgctgc tgtctacaga attaattggg tgactggcgg gattgcgatt 26641 gcaatggctt gtattgtagg cttgatgtgg cttagctact tcgttgcttc cttcaggctg 26701 tttgctcgta cccgctcaat gtggtcattc aacccagaaa caaacattct tctcaatgtg 26761 cctctccggg ggacaattgt gaccagaccg ctcatggaaa gtgaacttgt cattggtgct 26821 gtgatcattc gtggtcactt gcgaatggcc ggacactccc tagggcgctg tgacattaag 26881 gacctgccaa aagagatcac tgtggctaca tcacgaacgc tttcttatta caaattagga 26941 gcgtcgcagc gtgtaggcac tgattcaggt tttgctgcat acaaccgcta ccgtattgga 27001 aactataaat taaatacaga ccacgccggt agcaacgaca atattgcttt gctagtacag 27061 taagtgacaa cagatgtttc atcttgttga cttccaggtt acaatagcag agatattgat 27121 tatcattatg aggactttca ggattgctat ttggaatctt gacgttataa taagttcaat 27181 agtgagacaa ttatttaagc ctctaactaa gaagaattat tcggagttag atgatgaaga 27241 acctatggag ttagattatc cataaaacga acatgaaaat tattctcttc ctgacattga 27301 ttgtatttac atcttgcgag ctatatcact atcaggagtg tgttagaggt acgactgtac 27361 tactaaaaga accttgccca tcaggaacat acgagggcaa ttcaccattt caccctcttg 27421 ctgacaataa atttgcacta acttgcacta gcacacactt tgcttttgct tgtgctgacg 27481 gtactcgaca tacctatcag ctgcgtgcaa gatcagtttc accaaaactt ttcatcagac 27541 aagaggaggt tcaacaagag ctctactcgc cactttttct cattgttgct gctctagtat 27601 ttttaatact ttgcttcacc attaagagaa agacagaatg aatgagctca ctttaattga 27661 cttctatttg tgctttttag cctttctgct attccttgtt ttaataatgc ttattatatt 27721 ttggttttca ctcgaaatcc aggatctaga agaaccttgt accaaagtct aaacgaacat 27781 gaaacttctc attgttttga cttgtatttc tctatgcagt tgcatatgca ctgtagtaca 27841 gcgctgtgca tctaataaac ctcatgtgct tgaagatcct tgtaaggtac aacactaggg 27901 gtaatactta tagcactgct tggctttgtg ctctaggaaa ggttttacct tttcatagat 27961 ggcacactat ggttcaaaca tgcacaccta atgttactat caactgtcaa gatccagctg 28021 gtggtgcgct tatagctagg tgttggtacc ttcatgaagg tcaccaaact gctgcattta 28081 gagacgtact tgttgtttta aataaacgaa caaattaaaa tgtctgataa tggaccccaa 28141 tcaaaccaac gtagtgcccc ccgcattaca tttggtggac ccacagattc aactgacaat 28201 aaccagaatg gaggacgcaa tggggcaagg ccaaaacagc gccgacccca aggtttaccc 28261 aataatactg cgtcttggtt cacagctctc actcagcatg gcaaggagga acttagattc 28321 cctcgaggcc agggcgttcc aatcaacacc aatagtggtc cagatgacca aattggctac 28381 taccgaagag ctacccgacg agttcgtggt ggtgacggca aaatgaaaga gctcagcccc 28441 agatggtact tctattacct aggaactggc ccagaagctt cacttcccta cggcgctaac 28501 aaagaaggca tcgtatgggt tgcaactgag ggagccttga atacacccaa agaccacatt 28561 ggcacccgca atcctaataa caatgctgcc accgtgctac aacttcctca aggaacaaca 28621 ttgccaaaag gcttctacgc agagggaagc agaggcggca gtcaagcctc ttctcgctcc 28681 tcatcacgta gtcgcggtaa ttcaagaaat tcaactcctg gcagcagtag gggaaattct 28741 cctgctcgaa tggctagcgg aggtggtgaa actgccctcg cgctattgct gctagacaga 28801 ttgaaccagc ttgagagcaa agtttctggt aaaggccaac aacaacaagg ccaaactgtc 28861 actaagaaat ctgctgctga ggcatctaaa aagcctcgcc aaaaacgtac tgccacaaaa 28921 cagtacaacg tcactcaagc atttgggaga cgtggtccag aacaaaccca aggaaatttc 28981 ggggaccaag acctaatcag acaaggaact gattacaaac attggccgca aattgcacaa 29041 tttgctccaa gtgcctctgc attctttgga atgtcacgca ttggcatgga agtcacacct 29101 tcgggaacat ggctgactta tcatggagcc attaaattgg atgacaaaga tccacaattc 29161 aaagacaacg tcatactgct gaacaagcac attgacgcat acaaaacatt cccaccaaca 29221 gagcctaaaa aggacaaaaa gaaaaagact gatgaagctc agcctttgcc gcagagacaa 29281 aagaagcagc ccactgtgac tcttcttcct gcggctgaca tggatgattt ctccagacaa 29341 cttcaaaatt ccatgagtgg agcttctgct gattcaactc aggcataaac actcatgatg 29401 accacacaag gcagatgggc tatgtaaacg ttttcgcaat tccgtttacg atacatagtc 29461 tactcttgtg cagaatgaat tctcgtaact aaacagcaca agtaggttta gttaacttta 29521 atctcacata gcaatcttta atcaatgtgt aacattaggg aggacttgaa agagccacca 29581 cattttcatc gaggccacgc ggagtacgat cgagggtaca gtgaataatg ctagggagag 29641 ctgcctatat ggaagagccc taatgtgtaa aattaatttt agtagtgcta tccccatgtg 29701 attttaatag cttcttagga gaatgacaaa aaaaaaaaaa aaaaaaaaaa a // LOCUS NC_003461 15600 bp ss-RNA linear VRL 13-AUG-2018 DEFINITION Human parainfluenza virus 1, complete genome. ACCESSION NC_003461 VERSION NC_003461.1 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Human respirovirus 1 ORGANISM Human respirovirus 1 Viruses; Riboviria; Negarnaviricota; Haploviricotina; Monjiviricetes; Mononegavirales; Paramyxoviridae; Orthoparamyxovirinae; Respirovirus. REFERENCE 1 (bases 1 to 15600) AUTHORS Newman,J.T., Surman,S.R., Riggs,J.M., Hansen,C.T., Collins,P.L., Murphy,B.R. and Skiadopoulos,M.H. TITLE Sequence analysis of the Washington/1964 strain of human parainfluenza virus type 1 (HPIV1) and recovery and characterization of wild-type recombinant HPIV1 produced by reverse genetics JOURNAL Virus Genes 24 (1), 77-92 (2002) PUBMED 11928991 REFERENCE 2 (bases 1 to 15600) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (05-MAY-2009) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 15600) AUTHORS Newman,J.T., Surman,S.R., Riggs,J.M., Hansen,C.T., Collins,P.L., Murphy,B.R. and Skiadopoulos,M.H. TITLE Direct Submission JOURNAL Submitted (11-DEC-2001) National Institute of Allergy and Infectious Diseases, National Institutes of Health, 7 Center Drive Building 7, Bethesda, MD 20892, USA COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence was derived from AF457102. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..15600 /organism="Human respirovirus 1" /mol_type="genomic RNA" /strain="Washington 1964" /db_xref="taxon:12730" gene 56..1737 /gene="N" /locus_tag="Hpv1gp01" /db_xref="GeneID:935258" mRNA 56..1737 /gene="N" /locus_tag="Hpv1gp01" /product="nucleoprotein" /db_xref="GeneID:935258" CDS 120..1694 /gene="N" /locus_tag="Hpv1gp01" /note="nucleocapsid protein" /codon_start=1 /product="nucleoprotein" /protein_id="NP_604433.1" /db_xref="GeneID:935258" /translation="MAGLLSTFDTFSSRRSESINKSGGGAIIPGQRSTVSVFTLGPSVT DDADKLLIATTFLAHSLDTDKQHSQRGGFLVSLLAMAYSSPELYLTTNGVNADVKYVIY NIERDPKRTKTDGFIVKTRDMEYERTTEWLFGPMINKNPLFQGQRENADLEALLQTYGY PACLGAIIVQVWIVLVKAITSSAGLRKGFFNRLEAFRQDGTVKSALVFTGDTVEGIGAV MRSQQSLVSLMVETLVTMNTSRSDLTTLEKNIQIVGNYIRDAGLASFMNTIKYGVETKM AALTLSNLRPDINKLRSLVDIYLSKGARAPFICILRDPVHGDFAPGNYPALWSYAMGVA VVQNKAMQQYVTGRTYLDMEMFLLGQAVAKDADSKISSALEEELGVTDTAKERLRHHLT NLSGGDGAYHKPTGGGAIEVAIDHTDITFGVEDTADRDNKNWTNDSNERWMNHSISNHT ITIRGAEELEEETNDEDITDIENKIARRLADRKQRLSQANNKRDTSSDADYENDDDATA AAGIGGI" regulatory 1726..1737 /regulatory_class="terminator" /gene="N" /locus_tag="Hpv1gp01" gene 1741..3663 /gene="P" /locus_tag="Hpv1gp02" /db_xref="GeneID:935260" mRNA 1741..3633 /gene="P" /locus_tag="Hpv1gp02" /product="phosphoprotein" /note="also transcript for C' protein, C protein, Y1 protein, and putative Y2 protein" /db_xref="GeneID:935260" gene 1809..2468 /gene="C'" /locus_tag="Hpv1gp03" /db_xref="GeneID:935259" CDS 1809..2468 /gene="C'" /locus_tag="Hpv1gp03" /experiment="experimental evidence, no additional details recorded" /codon_start=1 /transl_except=(pos:1809..1811,aa:Met) /product="C' protein" /protein_id="NP_604434.1" /db_xref="GeneID:935259" /translation="MDTSASKTLLPEWIRMPSFLRGILKPKERHHENKNHSQMSSDSLT SSYPTSPQKLEKTEAGSMVSSTTQKKTSHHAKPTITTKTEQSQRRPKIIDQVRGVESLG EQVSQKQRHMLESLINKVYTGPLGEELVQTLYLRIWAMKETPESTKILQMREDIRDQYL RMKTERWLRTLIRGKKTKLRDFQKRYEEVHPYLMMERVEQIIMEEAWKLAAHIVQE" CDS 1844..3550 /gene="P" /locus_tag="Hpv1gp02" /codon_start=1 /product="phosphoprotein" /protein_id="NP_604435.1" /db_xref="GeneID:935260" /translation="MDQDAFFFERDPEAEGEAPRKQESLSDVIGLLDVVLSYKPTEIGE DRSWLHGIIDNPKENKPSCKADDNNKDRAISTSTQDHRSSEGSGISRRTSESKTETHAR ILDQQGIHRASRRGTSPNPLPENMGNERNTRIDEDSPNERRHQRSVLTDEDRKMAENSN KREEDQVEGFPEEVRRSTPLSDDGEGRTNNNGRSMETSSTHSTRITDVITNPSPELEDA VLQRNKRRPTTIKRNQTRSERTQSSELHKSTSENSSNLEDHNTKTSPKVPPSKNEESAA TPKNNHNHRKTRYTTNNANNNTKSPPTPEHDATANEEETSNTSVDEMAKLLVSLGVMKS QHEFELSRSASHVFAKRMLKSANYKEMTFNLCGMLISVEKSLENKVEENRTLLKQIQEE INSSRDLHKRFSEYQKEQNSLMMANLSTLHIITDRGGKTGNPSDTTRSPSVFTKGKDNK VKKTRFDPSMEALGGQEFKPDLIREDELRDDIKNPVLEENNNEPQASNASRLIPSTEKH TLHSLKLVIENSPLSRVEKKAYIKSLYKCRTNQEVKNVMELFEEDIDSLTN" gene 1854..2468 /gene="C" /locus_tag="Hpv1gp04" /db_xref="GeneID:935261" CDS 1854..2468 /gene="C" /locus_tag="Hpv1gp04" /codon_start=1 /product="C protein" /protein_id="NP_604436.1" /db_xref="GeneID:935261" /translation="MPSFLRGILKPKERHHENKNHSQMSSDSLTSSYPTSPQKLEKTEA GSMVSSTTQKKTSHHAKPTITTKTEQSQRRPKIIDQVRGVESLGEQVSQKQRHMLESLI NKVYTGPLGEELVQTLYLRIWAMKETPESTKILQMREDIRDQYLRMKTERWLRTLIRGK KTKLRDFQKRYEEVHPYLMMERVEQIIMEEAWKLAAHIVQE" gene 1923..2468 /gene="Y1" /locus_tag="Hpv1gp05" /db_xref="GeneID:935262" CDS 1923..2468 /gene="Y1" /locus_tag="Hpv1gp05" /note="expressed via a ribosomal shunting mechanism" /codon_start=1 /product="Y1 protein" /protein_id="NP_604437.1" /db_xref="GeneID:935262" /translation="MSSDSLTSSYPTSPQKLEKTEAGSMVSSTTQKKTSHHAKPTITTK TEQSQRRPKIIDQVRGVESLGEQVSQKQRHMLESLINKVYTGPLGEELVQTLYLRIWAM KETPESTKILQMREDIRDQYLRMKTERWLRTLIRGKKTKLRDFQKRYEEVHPYLMMERV EQIIMEEAWKLAAHIVQE" gene 1941..2468 /gene="Y2" /locus_tag="Hpv1gp06" /db_xref="GeneID:935263" CDS 1941..2468 /gene="Y2" /locus_tag="Hpv1gp06" /inference="non-experimental evidence, no additional details recorded" /note="expressed via a ribosomal shunting mechanism" /codon_start=1 /transl_except=(pos:1941..1943,aa:Met) /product="putative Y2 protein" /protein_id="NP_604438.1" /db_xref="GeneID:935263" /translation="MSSYPTSPQKLEKTEAGSMVSSTTQKKTSHHAKPTITTKTEQSQR RPKIIDQVRGVESLGEQVSQKQRHMLESLINKVYTGPLGEELVQTLYLRIWAMKETPES TKILQMREDIRDQYLRMKTERWLRTLIRGKKTKLRDFQKRYEEVHPYLMMERVEQIIME EAWKLAAHIVQE" regulatory 3622..3633 /regulatory_class="terminator" /gene="P" /locus_tag="Hpv1gp02" gene 3637..4809 /gene="M" /locus_tag="Hpv1gp07" /db_xref="GeneID:935264" mRNA 3637..4809 /gene="M" /locus_tag="Hpv1gp07" /product="matrix protein" /db_xref="GeneID:935264" CDS 3669..4715 /gene="M" /locus_tag="Hpv1gp07" /codon_start=1 /product="matrix protein" /protein_id="NP_604439.1" /db_xref="GeneID:935264" /translation="MAETYRFPRFSHEENGTVEPLPLKTGPDKKAIPHIRIVKVGDPPK HGVRYLDVLLLGFFETPKQGPLSGSISDLTESTSYSICGSGSLPIGIAKYYGTDQELLK ACIDLKITVRRTVRSGEMIVYMVDSIHAPLLPWSSRLRQGMIYNANKVALAPQCLPVDK DIRFRVVFVNGTSLGTITIAKVPKTLADLALPNSISVNLLVTLRAGVSTEQKGILPVLD DDGEKKLNFMVHLGIIRRKVGKIYSVEYCKNKIEKMKLIFSLGLVGGISFHVHATGTLS KTLMSQLAWKKAVCYPLMDVNPHMNLVIWAASVEITSVDAVFQPAIPKEFRYYPNVVAK SIGKIRRI" regulatory 4798..4809 /regulatory_class="terminator" /gene="M" /locus_tag="Hpv1gp07" gene 4813..6843 /gene="F" /locus_tag="Hpv1gp08" /db_xref="GeneID:935265" mRNA 4813..6843 /gene="F" /locus_tag="Hpv1gp08" /product="F glycoprotein" /db_xref="GeneID:935265" CDS 5088..6755 /gene="F" /locus_tag="Hpv1gp08" /note="fusion glycoprotein" /codon_start=1 /product="F glycoprotein" /protein_id="NP_604440.1" /db_xref="GeneID:935265" /translation="MQKSEILFLVYSSLLLSSSLCQIPVEKLSNVGVIINEGKLLKIAG SYESRYIVLSLVPSIDLQDGCGTTQIIQYKNLLNRLLIPLKDALDLQESLITITNDTTV TNDNPQTRFFGAVIGTIALGVATAAQITAGIALAEAREARKDIALIKDSIVKTHNSVEL IQRGIGEQIIALKTLQDFVNDEIRPAIGELRCETTALKLGIKLTQHYSELATAFSSNLG TIGEKSLTLQALSSLYSANITEILSTTKKDKSDIYDIIYTEQVKGTVIDVDLEKYMVTL LVKIPILSEIPGVLIYRASSISYNIEGEEWHVAIPNYIINKASSLGGADVTNCIESKLA YICPRDPTQLIPDNQQKCILGDVSKCPVTKVINNLVPKFAFINGGVVANCIASTCTCGT NRIPVNQDRSRGVTFLTYTNCGLIGINGIELYANKRGRDTTWGNQIIKVGPAVSIRPVD ISLNLASATNFLEESKTELMKARAIISAVGGWHNTESTQIIMIIIVCILIIIICGILYY LYRVRRLLVMINSTHNSPVNAYTLESRMRNPYMGNNSN" regulatory 6832..6843 /regulatory_class="terminator" /gene="F" /locus_tag="Hpv1gp08" gene 6847..8740 /gene="HN" /locus_tag="Hpv1gp09" /db_xref="GeneID:935266" mRNA 6847..8740 /gene="HN" /locus_tag="Hpv1gp09" /product="HN glycoprotein" /db_xref="GeneID:935266" CDS 6903..8630 /gene="HN" /locus_tag="Hpv1gp09" /note="hemagglutinin-neuraminidase glycoprotein; major antigenic determinant" /codon_start=1 /product="HN glycoprotein" /protein_id="NP_604441.1" /db_xref="GeneID:935266" /translation="MAEKGKTNSSYWSTTRNDNSTVNTHINTPAGRTHIWLLIATTMHT VLSFIIMILCIDLIIKQDTCMKTNIMTVSSMNESAKIIKETITELIRQEVISRTINIQS SVQSGIPILLNKQSRDLTQLIEKSCNRQELAQICENTIAIHHADGISPLDPHDFWRCPV GEPLLSNNPNISLLPGPSLLSGSTTISGCVRLPSLSIGDAIYAYSSNLITQGCADIGKS YQVLQLGYISLNSDMYPDLNPVISHTYDINDNRKSCSVIAAGTRGYQLCSLPTVNETTD YSSEGIEDLVFDILDLKGKTKSHRYKNEDITFDHPFSAMYPSVGSGIKIENTLIFLGYG GLTTPLQGDTKCVINRCTNVNQSVCNDALKITWLKKRQVVNVLIRINNYLSDRPKIVVE TIPITQNYLGAEGRLLKLGKKIYIYTRSSGWHSNLQIGSLDINNPMTIKWAPHEVLSRP GNQDCNWYNRCPRECISGVYTDAYPLSPDAVNVATTTLYANTSRVNPTIMYSNTSEIIN MLRLKNVQLEAAYTTTSCITHFGKGYCFHIVEINQASLNTLQPMLFKTSIPKICKITS" regulatory 8729..8740 /regulatory_class="terminator" /gene="HN" /locus_tag="Hpv1gp09" gene 8744..15543 /gene="L" /locus_tag="Hpv1gp10" /db_xref="GeneID:935267" mRNA 8744..15543 /gene="L" /locus_tag="Hpv1gp10" /product="L polymerase protein" /db_xref="GeneID:935267" CDS 8772..15443 /gene="L" /locus_tag="Hpv1gp10" /note="RNA polymerase" /codon_start=1 /product="L polymerase protein" /protein_id="NP_604442.1" /db_xref="GeneID:935267" /translation="MDKQESTQNSSDILYPECHLNSPIVKSKIAQLHVLLDINQPYDLK DNSIINITKYKIRNGGLSPRQIKIRSLGKILKQEIKDIDRYTFEPYPIFSLELLRLDIP EICDKIRSIFSVSDRLIRELSSGFQELWLNILRQLGCVEGKEGFDSLKDVDIIPDITDK YNKNTWYRPFLTWFSIKYDMRWMQKNKSGNHLDVSNSHNFLDCKSYILIIYRDLVIIIN KLKLTGYVLTPELVLMYCDVVEGRWNMSSAGRLDKRSSKITCKGEELWELIDSLFPNLG EDVYNIISLLEPLSLALIQLDDPVTNLKGAFMRHVLTELHTILIKDNIYTDSEADSIME SLIKIFRETSIDEKAEIFSFFRTFGHPSLEAITAADKVRTHMYSSKKIILKTLYECHAI FCAIIINGYRERHGGQWPPCEFPNHVCLELKNAQGSNSAISYECAVDNYSSFIGFKFLK FIEPQLDEDLTIYMKDKALSPRKAAWDSVYPDSNLYYKVPESEETRRLIEVFINDNNFN PADIINYVESGEWLNDDSFNISYSLKEKEIKQEGRLFAKMTYKMRAVQVLAETLLAKGV GELFSENGMVKGEIDLLKRLTTLSVSGVPRSNSVYNNPILHEKLIKNMNKCNSNGYWDE RKKSKNEFKAADSSTEGYETLSCFLTTDLKKYCLNWRFESTALFGQRCNEIFGFKTFFN WMHPILEKSTIYVGDPYCPVPDRMHKELQDHDDTGIFIHNPRGGIEGYCQKLWTLISIS AIHLAAVKVGVRVSAMVQGDNQAIAVTSRVPVTQTYKQKKTHVYEEITRYFGALREVMF DIGHELKLNETIISSKMFVYSKRIYYDGKILPQCLKALTRCVFWSETLVDENRSACSNI ATSIAKAIENGYSPILGYCIALFKTCQQVCISLGMTINPTITSTIKDQYFKGKNWLRCA ILIPANIGGFNYMSTARCFVRNIGDPAVAALADLKRFIKAGLLDKQVLYRVMNQEPGDS SFLDWASDPYSCNLPHSQSITTIIKNVTARSVLQESPNPLLSGLFSESSSEEDLNLASF LMDRKAILPRVAHEILDNSLTGVREAIAGMLDTTKSLVRASVRRGGLSYSILRRLINYD LLQYETLTRTLRKPVKDNIEYEYMCSVELAIGLRQKMWFHLTYGRPIHGLETPDPLELL RGSFIEGSEICKFCRSEGNNPMYTWFYLPDNIDLDTLSNGSPAIRIPYFGSATDERSEA QLGYVKNLSKPAKAAIRIAMVYTWAYGTDEISWMEAALIAQTRANLSLENLKLLTPVST STNLSHRLRDTATQMKFSSATLVRASRFITISNDNMALKEAGESKDTNLVYQQIMLTGL SLFEFNMRYKQGSLSKPMILHLHLNNKCCIIESPQELNIPPRSTLDLEITQENNKLIYD PDPLKDIDLELFSKVRDVVHTIDMNYWSDDEIIRATSICTAMTIADTMSQLDRDNLKEM IALINDDDINSLITEFMVIDIPLFCSTFGGILINQFAYSLYGLNVRGRDEIWGYVIRII KDTSHAVLKVLSNALSHPKIFKRFWDAGVVEPVYGPNLSNQDKILLAISVCEYSVDLFM RDWQEGIPLEIFICDNDPNIAEMRKLSFLARHLAYLCSLAEIAKEGPKLESMTSLERLE SLKEYLELTFLDDPILRYSQLTGLVIKIFPSTLTYIRKSSIKVLRVRGIGIPEVLEDWD PDADSMLLDNITAEVQHNIPLKKNERTPFWGLRVSKSQVLRLRGYEEIKREERGRSGVG LTLPFDGRYLSHQLRLFGINSTSCLKALELTYLLNPLVNKDKDRLYLGEGAGAMLSCYD ATLGPCMNYYNSGVNSCDLNGQRELNIYPSEVALVGKKLNNVTSLCQRVKVLFNGNPGS TWIGNDECETLIWNELQNNSIGFIHCDMEGGEHKCDQVVLHEHYSVIRIAYLVGDKDVI LVSKIAPRLGTDWTKQLSLYLRYWRDVSLIVLKTSNPASTEMYLISKDPKSDIIEDSNT VLANLLPLSKEDSIKIEKWILVEKAKVHDWIVRELKEGSASSGMLRPYHQALQIFGFEP NLNKLCRDFLSTLNIVDTKNCIITFDRVLRDTIFEWTRIKDADKKLRLTGKYDLYPLRD SGKLKVISRRLVISWIALSMSTRLVTGSFPDIKFESRLQLGIVSISSREIKNLRVISKI VIDKFEDIIHSVTYRFLTKEIKILMKILGAVKLFGARQSTSADITNIDTSDSIQ" regulatory 15532..15543 /regulatory_class="terminator" /gene="L" /locus_tag="Hpv1gp10" ORIGIN 1 accaaacaag aggaaaaact tgtttggaat atataataat attaaatagt attttagggt 61 taaagtaata ctttaaaggg acaagtcaca gacatttgat cttagtataa atttttataa 121 tggccgggct actaagtact tttgacacat ttagttccag gagaagtgag agcatcaata 181 agtctggcgg aggagcaatt atacctggtc aaagaagtac cgtttctgtc ttcacattag 241 gcccgagtgt gacagatgat gcagataaat tattaatagc aaccactttc ttagcccact 301 cattggacac agataaacaa cactctcaaa gaggaggatt tttagtatca ctccttgcaa 361 tggcctacag tagtccggag ttatatctca ctacaaacgg tgtcaatgct gatgtcaagt 421 atgtgatata caatatagag agagatccta aaagaacaaa aacagatggg ttcattgtca 481 aaacaagaga catggagtat gaaagaacaa cagagtggtt gtttggacct atgatcaaca 541 agaacccatt gttccaaggg caaagagaga atgcggatct agaggcattg cttcagacat 601 atggatatcc tgcatgtctc ggagctataa tagttcaagt ttggatagtg ttggttaaag 661 ccataacaag tagtgctggt ctaagaaaag gattcttcaa tagattagaa gcattcagac 721 aggatggaac cgttaaaagt gccctggtct tcacaggaga cacagttgaa ggtattggtg 781 cagtgatgag gtcacaacaa agcttagtat ctcttatggt agaaactctg gtgactatga 841 acacatccag gtcagattta actacattag agaagaacat tcagattgta ggaaattaca 901 taagagatgc aggattagca tctttcatga acaccatcaa gtatggtgta gaaacgaaga 961 tggccgccct gacactatca aatctgagac cagatataaa caaattgaga agccttgttg 1021 atatctatct atcaaaggga gcccgagccc cttttatatg tatactcaga gacccagttc 1081 atggagactt tgcccctgga aactatccag cactgtggag ctacgcaatg ggcgttgctg 1141 tggtacaaaa caaagctatg caacagtatg taactggaag aacatatttg gacatggaaa 1201 tgttcctact tggacaagct gtagctaaag atgctgattc caaaatcagc agtgctctgg 1261 aggaagaact aggtgtgaca gatacagcaa aagagagact aagacaccat ctgacaaacc 1321 tttcaggagg ggatggtgcg taccacaagc ctacaggtgg tggagctata gaagtggcaa 1381 ttgatcatac agacataaca tttggagtcg aggacactgc tgatcgggac aacaagaact 1441 ggacaaatga cagcaatgaa agatggatga atcactcgat cagcaaccac acaatcacga 1501 ttcgtggtgc agaagaactt gaagaagaga caaatgatga agacatcact gatatagaaa 1561 acaagattgc acgaaggctg gccgacagaa aacagagact aagccaggca aacaataaac 1621 gagacaccag cagtgatgct gactatgaga atgatgatga tgctacagcg gctgcaggga 1681 taggaggaat ttaacaggat acttggacaa tagaagccag atcaaaagta agaaaaactt 1741 agggtgaatg acaattcaca gatcagctca accagacacc accagcatac acgaaaccaa 1801 ccttcacagt ggatacctca gcatccaaaa ctctccttcc cgaatggatc aggatgcctt 1861 cttttttgag agggatcctg aagccgaagg agaggcacca cgaaaacaag aatcactctc 1921 agatgtcatc ggactccttg acgtcgtctt atcctacaag cccacagaaa ttggagaaga 1981 cagaagctgg ctccatggta tcatcgacaa cccaaaagaa aacaagccat catgcaaagc 2041 cgacgataac aacaaagaca gagcaatctc aacgtcgacc caagatcata gatcaagtga 2101 ggggagtgga atctctagga gaacaagtga gtcaaaaaca gagacacatg ctagaatcct 2161 tgatcaacaa ggtatacaca gggcctctag gcgaggaact agtccaaacc ctctacctga 2221 gaatatgggc aatgaaagaa acaccagaat cgacgaagat tctccaaatg agagaagaca 2281 tcagagatca gtacttacgg atgaagacag aaagatggct gagaactcta ataagaggga 2341 agaagaccaa gttgagggat ttccagaaga ggtacgaaga agtacaccct tatctgatga 2401 tggagagggt agaacaaata ataatggaag aagcatggaa actagcagca cacatagtac 2461 aagaataact gatgtcatta ccaacccaag tccagagctt gaagatgccg ttctacaaag 2521 gaacaaaaga cggccgacga ccatcaagcg taaccaaaca agatcagaga gaacacagag 2581 ttcagaactc cacaaatcaa caagtgaaaa tagctccaac ctcgaagacc acaacaccaa 2641 aaccagccca aaagttccac cgtcaaagaa cgaagagtca gcagccactc caaagaacaa 2701 ccacaaccac agaaaaacaa gatacacaac aaacaatgca aacaacaaca caaaaagtcc 2761 accaactccc gaacacgacg caaccgcaaa tgaagaggaa accagcaaca catcggtcga 2821 tgagatggcc aagttattag taagtcttgg tgtaatgaaa tcacaacatg aatttgaatt 2881 atctaggagt gcaagtcatg tatttgctaa gcgcatgtta aaatctgcaa attacaaaga 2941 aatgacattt aatctctgtg gtatgcttat atcagttgaa aaatcacttg agaataaagt 3001 agaagaaaat agaacattac ttaaacaaat tcaagaggaa ataaattcat ccagggatct 3061 tcacaaacgg ttctcggaat accaaaaaga acagaactca ctcatgatgg ccaatctatc 3121 cacactccat ataattacag atagaggcgg gaaaacggga aatcccagtg atactacaag 3181 gtcaccatca gtcttcacaa aagggaaaga caataaggtc aaaaagacaa ggtttgaccc 3241 ctctatggaa gctctaggag gtcaagagtt caagcctgac ttgataagag aggatgaact 3301 gagagatgac atcaaaaatc cggtactaga agaaaacaac aatgagcctc aagcatccaa 3361 tgcatcacgc ctgattccgt ccactgaaaa acacactctg cactcactca aactagttat 3421 cgaaaacagt cctctaagca gagtagagaa gaaggcttac atcaaatccc tttataagtg 3481 tcggacaaac caagaggtta aaaatgtaat ggagctattc gaggaagaca tagattcact 3541 aactaactaa acatgaatct acaatttcaa ccagcaatca aaatcaatat ccagagccaa 3601 ctcaaaaagc tccctcaaaa caattaagaa aaacttaggg tcaaagaaat tttgcccgga 3661 gaaaggaaat ggctgaaaca tacaggttcc ccagattctc acacgaagaa aatgggacag 3721 tagaacctct ccctctcaaa acaggtcctg acaaaaaagc aatccctcac atcagaatag 3781 tcaaggtagg agatcctcca aaacatggag tcaggtatct tgatgtgcta ctattgggat 3841 tctttgaaac acctaagcaa ggacctctat ctggcagcat atctgatctc acagaatcaa 3901 ccagttattc aatctgtgga tccggatcct taccaattgg catagccaag tattacggca 3961 cagatcaaga attattaaaa gcctgcattg acctcaaaat aactgtacga agaacagtta 4021 gatctggaga aatgatagta tacatggtag attcgatcca tgctcctcta ctaccatggt 4081 ccagccgact gagacaaggg atgatatata atgccaataa agtagctcta gcacctcaat 4141 gtctcccagt cgacaaagac atcagattca gggttgtatt tgtcaatgga acatcactag 4201 gtacaattac aattgctaag gtcccaaaaa ctcttgcaga tcttgcatta ccgaactcaa 4261 tatcagtgaa tctgctggtt acacttaggg caggagtatc aacggaacaa aaaggaatcc 4321 tccccgttct agacgatgat ggagaaaaga agctcaactt catggtacac ctaggaatca 4381 taagaagaaa agttgggaag atatattcag ttgaatactg caaaaataaa attgagaaga 4441 tgaagctaat attctctctc gggcttgtag gtggaataag tttccatgta catgcaacag 4501 gcacattatc caaaactcta atgagccaac ttgcatggaa aaaagcagtt tgctatcctt 4561 taatggatgt aaatccacat atgaatctag tcatctgggc agcttcagta gaaatcacaa 4621 gtgtcgatgc tgtgttccaa cctgcaattc cgaaagaatt tcgctattac ccaaatgttg 4681 ttgcaaaaag catcgggaaa atcaggagga tataagtcta cactcctcaa taatgacacc 4741 cattagctct aaatcgtacc attaatcaaa tacagatcaa ttcgatacaa tcagttcaaa 4801 taagaaaaac gtagggacaa agtcctctac caacatcaag gaagacaaga gtctcaaaaa 4861 gctcagccta agcagagaga aaaacaacaa cacaaagaaa gaaaaggaca agatcacaaa 4921 caagaacaaa agcaaaaaca aaaacaagaa caaaaaaggg aagaaaaaca aaagtataca 4981 caaaaaccaa aaaagaaaaa aggccagaga caaaaacgga ggcaagaaca aaaatttaaa 5041 caaaaacaga atttaaattc ataataaaca ccaagataga gacaaaaatg caaaaatcag 5101 agatcctctt cttagtatac tcaagcttgc tattatcttc atcattatgt caaattccgg 5161 tagaaaaact ttcaaatgta ggggttataa tcaatgaggg caaattactt aaaatagcag 5221 gatcttatga atctagatac atagtgttaa gcttggtacc ttcaattgac ctacaagatg 5281 gatgtggaac aactcaaatt attcaataca agaatttatt aaatagactt ctaattcctc 5341 tgaaggatgc cttggatctt caggaatccc tgataacaat aactaatgac accactgtga 5401 caaatgataa tccacaaact agattctttg gtgctgtcat tggtaccata gcactaggag 5461 tagccacagc tgctcaaata actgcgggca ttgcattagc tgaagcacga gaagccagga 5521 aggacatagc actaataaaa gattccatag tcaagacaca caattctgta gaactcattc 5581 aaagaggtat aggagaacag ataattgcat taaagacatt acaagatttt gtaaatgacg 5641 agataagacc tgcaatagga gaactaaggt gtgagactac ggcattgaaa ctagggatca 5701 agctcaccca acactactct gaattagcaa cagcattcag ctccaatctt gggactatag 5761 gagaaaaaag tcttaccttg caggcattat catctctcta ctctgctaat ataacagaaa 5821 ttctaagtac aactaaaaag gataaatcag atatatatga catcatttac actgaacagg 5881 ttaagggaac tgtgatagat gttgatttgg aaaaatacat ggttaccctc ttagttaaaa 5941 taccaatttt atcagaaata ccaggcgtgt tgatatacag agcttcatct atatcttata 6001 atattgaagg agaagaatgg catgtcgcaa tcccaaatta cataatcaat aaggcatcat 6061 ccttaggagg tgcagatgtc acaaactgta tagaatcaaa attggcatat atatgtccta 6121 gagatcctac acaattaata cctgataacc aacagaagtg tatactcggg gatgtatcaa 6181 agtgccctgt gactaaagta ataaacaatc tagtaccaaa gttcgcattc atcaatggtg 6241 gtgtagtggc taattgcatt gcatccacat gtacatgcgg gacaaacaga ataccagtga 6301 atcaagatcg ctcaagagga gttacattct tgacctatac caattgcggt ttaataggta 6361 taaatggaat agaactatat gccaataaaa ggggacgaga cactacttgg gggaatcaaa 6421 tcatcaaagt gggtccagca gtctccatta gacctgtaga catttcttta aatcttgcat 6481 ctgccacaaa tttcctagag gaatccaaga cagagctcat gaaggcaagg gcaatcatat 6541 cagcagttgg aggatggcac aacacagaga gtactcagat aatcatgata ataattgtgt 6601 gcatacttat aataatcata tgtggtatat tatactatct atacagggtt agaagactat 6661 tagtaatgat taattcaact cataattcac ctgttaatgc ttatactctg gagtcaagaa 6721 tgagaaatcc ctacatgggt aacaactcca attaaaaaat cagatcaagt acattgtagc 6781 atacatacaa caatcaaatc tatccacaac ttcaccaatc aggtgtacaa caagtaagaa 6841 aaacttaggg ttaaagacaa tccagtcaac ctataaggca acagcatccg attatacaaa 6901 cgatggctga aaaagggaaa acaaatagtt catattggtc tacaacccga aatgacaatt 6961 ccacggtaaa cacacacatt aatacaccag caggaaggac acacatctgg ctactgattg 7021 caacaacaat gcatacagta ttgtccttca ttatcatgat cctatgcatt gacctaatta 7081 taaaacaaga cacttgtatg aagacaaaca tcatgacagt atcctccatg aacgaaagtg 7141 ccaaaataat caaagagaca atcacagaat taatcagaca agaagtaata tcaaggacca 7201 taaacataca aagttcagta caaagcggga tcccaatatt gttaaacaag caaagcagag 7261 atctcacaca attaatagag aagtcatgca acagacagga attggctcag atatgcgaaa 7321 acaccattgc tattcaccat gcagacggca tatctcctct ggacccacac gatttctgga 7381 gatgtcccgt aggggaaccc ctactgagca acaaccccaa tatctcatta ttacctggac 7441 caagtctact ttctggatcc accacaattt caggatgtgt tagactacct tcattatcaa 7501 ttggtgatgc aatatatgcg tattcatcaa acttaatcac tcaaggatgt gcagatatag 7561 ggaagtcata tcaggtttta caattaggtt acatatcctt aaattcagat atgtatcctg 7621 atttaaaccc ggtaatttct catacctatg acatcaacga caacaggaaa tcatgttctg 7681 taatagctgc aggaacaagg ggttatcagt tatgctcctt gcccactgtg aatgagacta 7741 cagactactc gagtgaaggt atagaagatt tagtatttga catattagat ctcaagggaa 7801 agaccaaatc tcatcgatac aaaaatgaag atataacttt tgaccatcct ttttctgcaa 7861 tgtatccgag tgtaggaagt gggataaaaa ttgaaaatac actcattttc ctagggtacg 7921 gtggcttaac aactccgctc caaggcgaca ctaagtgtgt gataaacaga tgtaccaatg 7981 ttaatcagag tgtttgcaat gatgctctta agataacttg gctaaagaaa agacaagttg 8041 tcaatgtctt aattcgtatc aataattatt tatctgatag gccaaagatt gttgtcgaga 8101 caattccaat aactcaaaat tacttaggtg ccgaaggtag gctacttaaa ctaggtaaaa 8161 agatctacat atatactaga tcttcaggtt ggcactccaa cctgcaaata ggatcattag 8221 atatcaacaa ccccatgacc attaaatggg cgcctcatga agtcctgtct cgaccaggaa 8281 accaagactg caactggtac aacagatgtc cgagagaatg catatcaggt gtatatactg 8341 atgcatatcc actatctcct gatgcagtca atgttgctac aaccacactg tacgcaaaca 8401 catcacgtgt taatcccacc ataatgtact caaatacctc agaaattatc aacatgctaa 8461 gactcaagaa tgtacaacta gaggcagcat acactactac atcatgtatc actcatttcg 8521 ggaagggcta ctgcttccac attgttgaaa tcaaccaagc cagccttaat accttacaac 8581 ctatgttgtt caagacaagt atccctaaaa tatgtaaaat cacatcttga gcagatcaag 8641 acccaacact atatcaatta tgtgaaaacc agatatgatg tataaaaatt taaaaacaaa 8701 gcatgaatag acatttatat gacaaataga ataagaaaaa cttagggtta atgcctgcct 8761 atttgtcaaa tatggataaa caggagtcaa ctcagaattc ctcagacatc ttatatccag 8821 aatgtcactt gaactctccg attgtaaaaa gcaagattgc tcaacttcac gttttgctag 8881 atatcaatca accctatgat ttaaaagata acagtataat aaatatcacc aaatacaaaa 8941 tcagaaatgg aggtttatcg ccccggcaga tcaaaatcag atcgctaggc aaaatcctta 9001 aacaagaaat taaggatatt gatcgttaca cttttgaacc ttatccgatt ttctcattag 9061 agttactcag actggatatc ccagaaatat gtgacaaaat aagatccatt ttttcagtct 9121 ctgatagatt aataagagaa ctatcatctg gatttcaaga attgtggtta aatattctta 9181 gacaattagg ctgtgttgaa gggaaagagg gatttgactc attaaaggat gtagatatca 9241 tcccagatat aactgataaa tataataaaa acacatggta tcgcccattc ttaacatggt 9301 ttagcatcaa atatgatatg agatggatgc aaaagaataa gtcggggaac catttagatg 9361 tctcaaattc tcacaatttt cttgactgta aatcatatat tttgattata tatagagatt 9421 tagtgataat aataaataaa ttaaaattaa ccggttatgt ccttacacct gaattagtat 9481 taatgtattg tgatgttgtc gaaggaagat ggaatatgtc ttcagctgga cgactcgata 9541 aaaggtcatc aaaaataaca tgtaaggggg aagaattatg ggagcttatc gactctttat 9601 ttcccaatct tggtgaggat gtatataata ttatatcact actagaacct ttatcacttg 9661 ctttaataca gttggatgac cctgtaacta atttaaaagg agctttcatg agacatgttt 9721 tgactgagct acatacaatt ttaataaaag ataatatata cacagattca gaagcagaca 9781 gcataatgga atcattgata aagattttca gagagacatc aattgatgaa aaagcagaaa 9841 ttttctcctt ttttagaacg tttggacatc ctagcttaga agcaataact gctgccgata 9901 aagtaaggac acatatgtat tcctccaaaa aaatcatact aaagacacta tatgagtgtc 9961 atgcaatctt ctgtgcaatt ataataaacg gatatagaga aagacacggt ggtcaatggc 10021 cgccatgcga attccccaat catgtatgtc ttgaactcaa gaatgcacaa ggatccaact 10081 ctgcaatttc gtatgaatgt gccgtagaca attatagtag ttttatagga tttaaatttt 10141 taaaatttat tgagcctcaa ttagatgaag atttgacaat ttatatgaag gataaggctc 10201 tatcacctag gaaagcagca tgggattcag tatatcccga cagtaattta tattacaaag 10261 tccctgaatc agaagagact cgtaggttaa tcgaggtttt tataaatgat aataatttta 10321 accctgcgga tattattaat tatgtagagt caggagaatg gttaaatgac gatagcttca 10381 acatatctta cagtctcaaa gaaaaagaaa ttaaacaaga gggtcgactc tttgccaaga 10441 tgacatataa gatgagagca gtccaggtat tagcagaaac actactagca aaaggagtag 10501 gtgagttatt cagtgaaaat gggatggtaa agggagaaat tgacctacta aagagactga 10561 ctacattatc tgtctcaggt gttccaagat ccaactcagt ttacaataat cccatattac 10621 atgagaaatt gatcaaaaat atgaataagt gcaattcaaa tgggtattgg gatgaaagaa 10681 agaaatctaa aaatgaattc aaagctgcag actcatcaac cgaggggtat gagactctga 10741 gctgtttttt aaccaccgat ttgaaaaaat actgtctcaa ctggagattt gaaagtacag 10801 cgttgttcgg tcaaagatgt aatgagatat tcgggtttaa aactttcttt aactggatgc 10861 accctattct agaaaaaagt acaatttatg taggagatcc ttactgtcca gtacctgata 10921 gaatgcacaa agaactccaa gatcatgatg ataccggaat ctttatccat aatccaagag 10981 ggggaataga gggttattgc cagaaattat ggacactaat ctctattagt gcaatccatc 11041 ttgcagctgt taaagttggt gtcagagtgt cagcaatggt acaaggagac aatcaagcta 11101 tagcagtgac atccagagtt cctgtcacac aaacctataa gcaaaaaaag actcacgtct 11161 atgaagaaat cacaagatat ttcggtgcct tgagagaagt tatgtttgat attggacatg 11221 aattaaaatt aaatgagacc attataagta gcaaaatgtt tgtatacagc aaacggatat 11281 attatgatgg gaaaatcctc ccacagtgcc tcaaagcttt aacaagatgt gtattttggt 11341 cagagactct tgtagatgaa aacaggtcag catgctcaaa cattgcaaca tctatagcca 11401 aagctattga gaatggatat tcacctatct taggctattg tattgctctt tttaaaactt 11461 gccaacaggt atgtatatca ttaggaatga ccattaatcc tactattacg tcaactatca 11521 aagatcaata ttttaaaggg aaaaattggt taagatgtgc aatattgatc ccagctaaca 11581 taggagggtt caactatatg tctacagcta gatgttttgt cagaaatata ggtgatccag 11641 cagttgcagc tctagcagac ttaaagagat tcatcaaagc aggtctgtta gataaacagg 11701 tattatatcg tgtgatgaat caagaaccag gagactcaag cttcttagat tgggcatcag 11761 acccttattc atgcaatctc ccacactcac aaagtataac aactataatc aaaaatgtaa 11821 cagctagatc agtattgcag gaatcaccta atcctctcct atcaggtctc ttttcagaat 11881 caagtagtga agaagatctc aacttagcat catttttgat ggataggaaa gccatattgc 11941 ccagagtagc tcacgagatc ttagataact cacttacagg tgtaagagaa gctatagccg 12001 ggatgcttga tacaacgaaa tctctagtaa gagctagtgt caggagagga ggattatcat 12061 atagtatctt aagaagactt ataaattatg atctattaca atatgagacc ttaacaagga 12121 cactcagaaa accggttaag gataatatag aatatgagta tatgtgttca gtagaattgg 12181 caataggatt gaggcaaaaa atgtggtttc atctaactta tggaagacca atccacggtt 12241 tagaaactcc agacccgtta gaattattaa gaggatcatt tattgaaggc tcagaaatat 12301 gtaaattttg tagatcagaa gggaataacc ctatgtatac ttggttctat cttcctgaca 12361 acatcgactt agatacactt agcaatggaa gtcctgccat acgtatccct tattttggtt 12421 ctgctactga tgaaagatca gaggctcaac taggttatgt taagaactta agcaagccgg 12481 caaaagcagc aataagaatc gcaatggttt acacttgggc ttatggaact gatgaaatat 12541 catggatgga agcagcactt atagctcaaa ccagggctaa cttaagttta gagaatttga 12601 agttactcac ccctgtatcg acttctacaa atttgtccca cagattgaga gatactgcta 12661 cacagatgaa attttcaagt gctactttag ttcgagcgag tcgatttatt accatatcta 12721 atgataatat ggcattaaaa gaggcaggag agtctaaaga tactaattta gtttatcaac 12781 aaattatgtt aaccggattg agcttatttg aattcaatat gaggtataaa caaggatcat 12841 tatctaaacc tatgatatta cacttacatt tgaataataa atgctgtatc atagaatctc 12901 ctcaagaatt gaatattcct cctagatcta cattggactt agagatcact caggaaaata 12961 acaagttaat ctatgatcct gatcctctca aggacataga tctagagtta tttagtaagg 13021 ttagggatgt agtacacaca attgatatga attattggtc tgatgatgaa ataattagag 13081 caactagtat atgtacagct atgactattg cagacacaat gtctcaatta gatagagaca 13141 atcttaaaga aatgatagca ctgataaatg atgatgatat aaatagttta atcaccgaat 13201 ttatggttat tgatataccc ttattttgtt ccactttcgg gggtattcta atcaatcaat 13261 ttgcatattc actttacggg ttaaacgtca gagggaggga tgaaatatgg ggatatgtga 13321 tacgcataat taaagacaca tcacatgcag tcctaaaagt actgtccaat gcattatcac 13381 atcctaaaat attcaaacga ttctgggatg caggagttgt agagcctgtt tatggaccta 13441 acttgtccaa tcaagacaag atactgttag ccatttcagt atgtgaatac tctgttgacc 13501 tcttcatgcg tgattggcaa gagggcatac cgcttgaaat atttatttgt gataacgacc 13561 caaatatagc agaaatgaga aaactttcat ttttagctag acatctagca tacttgtgta 13621 gtttggcaga gatagctaaa gagggaccaa aattggaatc tatgacatct ctcgaacgac 13681 tcgaatcatt gaaagagtat ctagaactta cttttttaga cgatcctata ttaagatata 13741 gtcaattgac aggcttagtt attaagatat tcccttcaac gttaacttac atcaggaaat 13801 cttcaattaa ggtgttgaga gtaagaggta tagggatacc agaagtctta gaggactggg 13861 atcctgatgc cgatagtatg ctactagata atataactgc tgaggttcaa cacaatatac 13921 ctttaaagaa gaacgaaaga actcccttct gggggttaag ggtatcaaaa tcacaagttc 13981 tgcgacttag aggttatgaa gagataaaaa gggaagaaag aggaagatca ggtgtaggat 14041 taactctacc ttttgatggg cgatatttat cacaccaatt gagacttttc gggattaata 14101 gcaccagttg tttgaaagca ttggaactta cctatttact gaatcctcta gtcaataagg 14161 ataaagatag attatatctc ggagaaggtg caggtgcaat gctgtcttgt tatgatgcta 14221 cattaggacc ctgcatgaac tattataatt caggtgttaa ttcttgtgat ctcaacggac 14281 aaagagaatt aaatatttat ccttcagaag tggcactggt agggaagaaa ttgaataatg 14341 tcacgagttt atgtcaaaga gttaaggttt tattcaatgg gaatcctgga tcaacttgga 14401 tagggaatga tgaatgtgaa acactaatct ggaatgaatt acagaataat tcaatagggt 14461 ttattcattg tgacatggaa ggtggagaac acaaatgtga tcaggtggtc ttacatgaac 14521 attatagtgt gatcaggatt gcataccttg ttggggataa ggacgttatc ttagtaagca 14581 aaattgcacc aagattaggt acagactgga caaaacaatt aagtttgtat ttaagatact 14641 ggagagatgt cagcttaata gtgttgaaaa catctaaccc agcctctaca gaaatgtatc 14701 tgatatcaaa agatcctaaa tctgatatta tagaggatag taatacagta ttggcaaacc 14761 ttcttccatt atctaaagag gatagtatta agatagaaaa atggattcta gttgagaaag 14821 ccaaagttca tgattggata gttagagaat taaaggaagg gagtgcatcg tcaggtatgc 14881 taagacctta ccatcaagca ttacaaatct tcggatttga gcctaattta aacaaattat 14941 gtagagattt cttatctaca ctaaatatag tagacacaaa aaattgtatt atcacatttg 15001 atagagtatt aagagataca atctttgagt ggactcggat aaaagacgca gataagaagc 15061 taagacttac aggtaaatat gatctatatc ctcttagaga ttcaggtaag ttaaaagtta 15121 tttctagaag gcttgtaata tcttggatag cattgtctat gtctacaaga ctagtaacag 15181 ggtcatttcc agacattaaa tttgaatcaa gactccaatt aggtatagta tcaatatcct 15241 ctcgtgaaat caaaaatctt agggttatat caaagattgt cattgacaaa tttgaagata 15301 ttatacatag tgtgacctat aggttcttga ctaaagaaat aaaaatattg atgaaaattt 15361 tgggagcagt caaattattt ggggcaagac agagcacatc tgctgatatc actaatatcg 15421 atacatcgga ctccatacaa tgatcttata tcttctcatc tttattatct aatttgttta 15481 aagagatgag ttaacaagat aagaaatccc tttaactgac tcataaaaac atagtaagaa 15541 aaacttacaa cagacaagag tattaataat atatcgatat ttcttaaact cttgtctggt // LOCUS NC_019843 30119 bp RNA linear VRL 13-AUG-2018 DEFINITION Middle East respiratory syndrome coronavirus, complete genome. ACCESSION NC_019843 VERSION NC_019843.3 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Middle East respiratory syndrome-related coronavirus (MERS-CoV) ORGANISM Middle East respiratory syndrome-related coronavirus Viruses; Riboviria; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Betacoronavirus; Merbecovirus. REFERENCE 1 (bases 1 to 30119) AUTHORS van Boheemen,S., de Graaf,M., Lauber,C., Bestebroer,T.M., Raj,V.S., Zaki,A.M., Osterhaus,A.D., Haagmans,B.L., Gorbalenya,A.E., Snijder,E.J. and Fouchier,R.A. TITLE Genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans JOURNAL MBio 3 (6), e00473-12 (2012) PUBMED 23170002 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 30119) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (22-JUL-2014) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 30119) AUTHORS van Boheemen,S., de Graaf,M., Lauber,C., Bestebroer,T.M., Victor,S.R., Zaki,A.M., Osterhaus,A.D.M.E., Haagmans,B.L., Gorbalenya,A.E., Snijder,E.J. and Fouchier,R.A.M. TITLE Direct Submission JOURNAL Submitted (16-OCT-2012) Viroscience Lab, Erasmus MC, Dr. Molewaterplein 50, Rotterdam 3015 GE, The Netherlands REMARK Sequence update by submitter REFERENCE 4 (bases 1 to 30119) AUTHORS van Boheemen,S., de Graaf,M., Lauber,C., Bestebroer,T.M., Victor,S.R., Zaki,A.M., Osterhaus,A.D.M.E., Haagmans,B.L., Gorbalenya,A.E., Snijder,E.J. and Fouchier,R.A.M. TITLE Direct Submission JOURNAL Submitted (26-SEP-2012) Viroscience Lab, Erasmus MC, Dr. Molewaterplein 50, Rotterdam 3015 GE, The Netherlands COMMENT ##Assembly-Data-START## Assembly Method :: CLC Genomics Workbench v. 5.5.1 Sequencing Technology :: 454; Sanger dideoxy sequencing ##Assembly-Data-END## REVIEWED REFSEQ: This record has been curated by NCBI staff. The reference sequence is identical to JX869059. On Jul 23, 2014 this sequence version replaced NC_019843.2. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..30119 /organism="Middle East respiratory syndrome-related coronavirus" /mol_type="genomic RNA" /strain="HCoV-EMC" /host="Homo sapiens" /db_xref="taxon:1335626" /collection_date="13-Jun-2012" gene 279..21514 /gene="orf1ab" /locus_tag="G128_gp01" /db_xref="GeneID:14254602" CDS join(279..13433,13433..21514) /gene="orf1ab" /locus_tag="G128_gp01" /ribosomal_slippage="" /note="ORF1ab product; pp1ab" /codon_start=1 /product="1AB polyprotein" /protein_id="YP_009047202.1" /db_xref="GeneID:14254602" /translation="MSFVAGVTAQGARGTYRAALNSEKHQDHVSLTVPLCGSGNLVEKL SPWFMDGENAYEVVKAMLLKKEPLLYVPIRLAGHTRHLPGPRVYLVERLIACENPFMVN QLAYSSSANGSLVGTTLQGKPIGMFFPYDIELVTGKQNILLRKYGRGGYHYTPFHYERD NTSCPEWMDDFEADPKGKYAQNLLKKLIGGDVTPVDQYMCGVDGKPISAYAFLMAKDGI TKLADVEADVAARADDEGFITLKNNLYRLVWHVERKDVPYPKQSIFTINSVVQKDGVEN TPPHYFTLGCKILTLTPRNKWSGVSDLSLKQKLLYTFYGKESLENPTYIYHSAFIECGS CGNDSWLTGNAIQGFACGCGASYTANDVEVQSSGMIKPNALLCATCPFAKGDSCSSNCK HSVAQLVSYLSERCNVIADSKSFTLIFGGVAYAYFGCEEGTMYFVPRAKSVVSRIGDSI FTGCTGSWNKVTQIANMFLEQTQHSLNFVGEFVVNDVVLAILSGTTTNVDKIRQLLKGV TLDKLRDYLADYDVAVTAGPFMDNAINVGGTGLQYAAITAPYVVLTGLGESFKKVATIP YKVCNSVKDTLAYYAHSVLYRVFPYDMDSGVSSFSELLFDCVDLSVASTYFLVRILQDK TGDFMSTIITSCQTAVSKLLDTCFEATEATFNFLLDLAGLFRIFLRNAYVYTSQGFVVV NGKVSTLVKQVLDLLNKGMQLLHTKVSWAGSKIIAVIYSGRESLIFPSGTYYCVTTKAK SVQQDLDVILPGEFSKKQLGLLQPTDNSTTVSVTVSSNMVETVVGQLEQTNMHSPDVIV GDYVIISEKLFVRSKEEDGFAFYPACTNGHAVPTLFRLKGGAPVKKVAFGGDQVHEVAA VRSVTVEYNIHAVLDTLLASSSLRTFVVDKSLSIEEFADVVKEQVSDLLVKLLRGMPIP DFDLDDFIDAPCYCFNAEGDASWSSTMIFSLHPVECDEECSEVEASDLEEGESECISET STEQVDVSHETSDDEWAAAVDEAFPLDEAEDVTESVQEEAQPVEVPVEDIAQVVIADTL QETPVVPDTVEVPPQVVKLPSAPQTIQPEVKEVAPVYEADTEQTQNVTVKPKRLRKKRN VDPLSNFEHKVITECVTIVLGDAIQVAKCYGESVLVNAANTHLKHGGGIAGAINAASKG AVQKESDEYILAKGPLQVGDSVLLQGHSLAKNILHVVGPDARAKQDVSLLSKCYKAMNA YPLVVTPLVSAGIFGVKPAVSFDYLIREAKTRVLVVVNSQDVYKSLTIVDIPQSLTFSY DGLRGAIRKAKDYGFTVFVCTDNSANTKVLRNKGVDYTKKFLTVDGVQYYCYTSKDTLD DILQQANKSVGIISMPLGYVSHGLDLMQAGSVVRRVNVPYVCLLANKEQEAILMSEDVK LNPSEDFIKHVRTNGGYNSWHLVEGELLVQDLRLNKLLHWSDQTICYKDSVFYVVKNST AFPFETLSACRAYLDSRTTQQLTIEVLVTVDGVNFRTVVLNNKNTYRSQLGCVFFNGAD ISDTIPDEKQNGHSLYLADNLTADETKALKELYGPVDPTFLHRFYSLKAAVHGWKMVVC DKVRSLKLSDNNCYLNAVIMTLDLLKDIKFVIPALQHAFMKHKGGDSTDFIALIMAYGN CTFGAPDDASRLLHTVLAKAELCCSARMVWREWCNVCGIKDVVLQGLKACCYVGVQTVE DLRARMTYVCQCGGERHRQLVEHTTPWLLLSGTPNEKLVTTSTAPDFVAFNVFQGIETA VGHYVHARLKGGLILKFDSGTVSKTSDWKCKVTDVLFPGQKYSSDCNVVRYSLDGNFRT EVDPDLSAFYVKDGKYFTSEPPVTYSPATILAGSVYTNSCLVSSDGQPGGDAISLSFNN LLGFDSSKPVTKKYTYSFLPKEDGDVLLAEFDTYDPIYKNGAMYKGKPILWVNKASYDT NLNKFNRASLRQIFDVAPIELENKFTPLSVESTPVEPPTVDVVALQQEMTIVKCKGLNK PFVKDNVSFVADDSGTPVVEYLSKEDLHTLYVDPKYQVIVLKDNVLSSMLRLHTVESGD INVVAASGSLTRKVKLLFRASFYFKEFATRTFTATTAVGSCIKSVVRHLGVTKGILTGC FSFAKMLFMLPLAYFSDSKLGTTEVKVSALKTAGVVTGNVVKQCCTAAVDLSMDKLRRV DWKSTLRLLLMLCTTMVLLSSVYHLYVFNQVLSSDVMFEDAQGLKKFYKEVRAYLGISS ACDGLASAYRANSFDVPTFCANRSAMCNWCLISQDSITHYPALKMVQTHLSHYVLNIDW LWFAFETGLAYMLYTSAFNWLLLAGTLHYFFAQTSIFVDWRSYNYAVSSAFWLFTHIPM AGLVRMYNLLACLWLLRKFYQHVINGCKDTACLLCYKRNRLTRVEASTVVCGGKRTFYI TANGGISFCRRHNWNCVDCDTAGVGNTFICEEVANDLTTALRRPINATDRSHYYVDSVT VKETVVQFNYRRDGQPFYERFPLCAFTNLDKLKFKEVCKTTTGIPEYNFIIYDSSDRGQ ESLARSACVYYSQVLCKSILLVDSSLVTSVGDSSEIATKMFDSFVNSFVSLYNVTRDKL EKLISTARDGVRRGDNFHSVLTTFIDAARGPAGVESDVETNEIVDSVQYAHKHDIQITN ESYNNYVPSYVKPDSVSTSDLGSLIDCNAASVNQIVLRNSNGACIWNAAAYMKLSDALK RQIRIACRKCNLAFRLTTSKLRANDNILSVRFTANKIVGGAPTWFNALRDFTLKGYVLA TIIVFLCAVLMYLCLPTFSMAPVEFYEDRILDFKVLDNGIIRDVNPDDKCFANKHRSFT QWYHEHVGGVYDNSITCPLTVAVIAGVAGARIPDVPTTLAWVNNQIIFFVSRVFANTGS VCYTPIDEIPYKSFSDSGCILPSECTMFRDAEGRMTPYCHDPTVLPGAFAYSQMRPHVR YDLYDGNMFIKFPEVVFESTLRITRTLSTQYCRFGSCEYAQEGVCITTNGSWAIFNDHH LNRPGVYCGSDFIDIVRRLAVSLFQPITYFQLTTSLVLGIGLCAFLTLLFYYINKVKRA FADYTQCAVIAVVAAVLNSLCICFVTSIPLCIVPYTALYYYATFYFTNEPAFIMHVSWY IMFGPIVPIWMTCVYTVAMCFRHFFWVLAYFSKKHVEVFTDGKLNCSFQDAASNIFVIN KDTYAALRNSLTNDAYSRFLGLFNKYKYFSGAMETAAYREAAACHLAKALQTYSETGSD LLYQPPNCSITSGVLQSGLVKMSHPSGDVEACMVQVTCGSMTLNGLWLDNTVWCPRHVM CPADQLSDPNYDALLISMTNHSFSVQKHIGAPANLRVVGHAMQGTLLKLTVDVANPSTP AYTFTTVKPGAAFSVLACYNGRPTGTFTVVMRPNYTIKGSFLCGSCGSVGYTKEGSVIN FCYMHQMELANGTHTGSAFDGTMYGAFMDKQVHQVQLTDKYCSVNVVAWLYAAILNGCA WFVKPNRTSVVSFNEWALANQFTEFVGTQSVDMLAVKTGVAIEQLLYAIQQLYTGFQGK QILGSTMLEDEFTPEDVNMQIMGVVMQSGVRKVTYGTAHWLFATLVSTYVIILQATKFT LWNYLFETIPTQLFPLLFVTMAFVMLLVKHKHTFLTLFLLPVAICLTYANIVYEPTTPI SSALIAVANWLAPTNAYMRTTHTDIGVYISMSLVLVIVVKRLYNPSLSNFALALCSGVM WLYTYSIGEASSPIAYLVFVTTLTSDYTITVFVTVNLAKVCTYAIFAYSPQLTLVFPEV KMILLLYTCLGFMCTCYFGVFSLLNLKLRAPMGVYDFKVSTQEFRFMTANNLTAPRNSW EAMALNFKLIGIGGTPCIKVAAMQSKLTDLKCTSVVLLSVLQQLHLEANSRAWAFCVKC HNDILAATDPSEAFEKFVSLFATLMTFSGNVDLDALASDIFDTPSVLQATLSEFSHLAT FAELEAAQKAYQEAMDSGDTSPQVLKALQKAVNIAKNAYEKDKAVARKLERMADQAMTS MYKQARAEDKKAKIVSAMQTMLFGMIKKLDNDVLNGIISNARNGCIPLSVIPLCASNKL RVVIPDFTVWNQVVTYPSLNYAGALWDITVINNVDNEIVKSSDVVDSNENLTWPLVLEC TRASTSAVKLQNNEIKPSGLKTMVVSAGQEQTNCNTSSLAYYEPVQGRKMLMALLSDNA YLKWARVEGKDGFVSVELQPPCKFLIAGPKGPEIRYLYFVKNLNNLHRGQVLGHIAATV RLQAGSNTEFASNSSVLSLVNFTVDPQKAYLDFVNAGGAPLTNCVKMLTPKTGTGIAIS VKPESTADQETYGGASVCLYCRAHIEHPDVSGVCKYKGKFVQIPAQCVRDPVGFCLSNT PCNVCQYWIGYGCNCDSLRQAALPQSKDSNFLNRVRGSIVNARIEPCSSGLSTDVVFRA FDICNYKAKVAGIGKYYKTNTCRFVELDDQGHHLDSYFVVKRHTMENYELEKHCYDLLR DCDAVAPHDFFIFDVDKVKTPHIVRQRLTEYTMMDLVYALRHFDQNSEVLKAILVKYGC CDVTYFENKLWFDFVENPSVIGVYHKLGERVRQAILNTVKFCDHMVKAGLVGVLTLDNQ DLNGKWYDFGDFVITQPGSGVAIVDSYYSYLMPVLSMTDCLAAETHRDCDFNKPLIEWP LTEYDFTDYKVQLFEKYFKYWDQTYHANCVNCTDDRCVLHCANFNVLFAMTMPKTCFGP IVRKIFVDGVPFVVSCGYHYKELGLVMNMDVSLHRHRLSLKELMMYAADPAMHIASSNA FLDLRTSCFSVAALTTGLTFQTVRPGNFNQDFYDFVVSKGFFKEGSSVTLKHFFFAQDG NAAITDYNYYSYNLPTMCDIKQMLFCMEVVNKYFEIYDGGCLNASEVVVNNLDKSAGHP FNKFGKARVYYESMSYQEQDELFAMTKRNVIPTMTQMNLKYAISAKNRARTVAGVSILS TMTNRQYHQKMLKSMAATRGATCVIGTTKFYGGWDFMLKTLYKDVDNPHLMGWDYPKCD RAMPNMCRIFASLILARKHGTCCTTRDRFYRLANECAQVLSEYVLCGGGYYVKPGGTSS GDATTAYANSVFNILQATTANVSALMGANGNKIVDKEVKDMQFDLYVNVYRSTSPDPKF VDKYYAFLNKHFSMMILSDDGVVCYNSDYAAKGYIAGIQNFKETLYYQNNVFMSEAKCW VETDLKKGPHEFCSQHTLYIKDGDDGYFLPYPDPSRILSAGCFVDDIVKTDGTLMVERF VSLAIDAYPLTKHEDIEYQNVFWVYLQYIEKLYKDLTGHMLDSYSVMLCGDNSAKFWEE AFYRDLYSSPTTLQAVGSCVVCHSQTSLRCGTCIRRPFLCCKCCYDHVIATPHKMVLSV SPYVCNAPGCGVSDVTKLYLGGMSYFCVDHRPVCSFPLCANGLVFGLYKNMCTGSPSIV EFNRLATCDWTESGDYTLANTTTEPLKLFAAETLRATEEASKQSYAIATIKEIVGERQL LLVWEAGKSKPPLNRNYVFTGYHITKNSKVQLGEYIFERIDYSDAVSYKSSTTYKLTVG DIFVLTSHSVATLTAPTIVNQERYVKITGLYPTITVPEEFASHVANFQKSGYSKYVTVQ GPPGTGKSHFAIGLAIYYPTARVVYTACSHAAVDALCEKAFKYLNIAKCSRIIPAKARV ECYDRFKVNETNSQYLFSTINALPETSADILVVDEVSMCTNYDLSIINARIKAKHIVYV GDPAQLPAPRTLLTRGTLEPENFNSVTRLMCNLGPDIFLSMCYRCPKEIVSTVSALVYN NKLLAKKELSGQCFKILYKGNVTHDASSAINRPQLTFVKNFITANPAWSKAVFISPYNS QNAVSRSMLGLTTQTVDSSQGSEYQYVIFCQTADTAHANNINRFNVAITRAQKGILCVM TSQALFESLEFTELSFTNYKLQSQIVTGLFKDCSRETSGLSPAYAPTYVSVDDKYKTSD ELCVNLNLPANVPYSRVISRMGFKLDATVPGYPKLFITREEAVRQVRSWIGFDVEGAHA SRNACGTNVPLQLGFSTGVNFVVQPVGVVDTEWGNMLTGIAARPPPGEQFKHLVPLMHK GAAWPIVRRRIVQMLSDTLDKLSDYCTFVCWAHGFELTSASYFCKIGKEQKCCMCNRRA AAYSSPLQSYACWTHSCGYDYVYNPFFVDVQQWGYVGNLATNHDRYCSVHQGAHVASND AIMTRCLAIHSCFIERVDWDIEYPYISHEKKLNSCCRIVERNVVRAALLAGSFDKVYDI GNPKGIPIVDDPVVDWHYFDAQPLTRKVQQLFYTEDMASRFADGLCLFWNCNVPKYPNN AIVCRFDTRVHSEFNLPGCDGGSLYVNKHAFHTPAYDVSAFRDLKPLPFFYYSTTPCEV HGNGSMIEDIDYVPLKSAVCITACNLGGAVCRKHATEYREYMEAYNLVSASGFRLWCYK TFDIYNLWSTFTKVQGLENIAFNVVKQGHFIGVEGELPVAVVNDKIFTKSGVNDICMFE NKTTLPTNIAFELYAKRAVRSHPDFKLLHNLQADICYKFVLWDYERSNIYGTATIGVCK YTDIDVNSALNICFDIRDNCSLEKFMSTPNAIFISDRKIKKYPCMVGPDYAYFNGAIIR DSDVVKQPVKFYLYKKVNNEFIDPTECIYTQSRSCSDFLPLSDMEKDFLSFDSDVFIKK YGLENYAFEHVVYGDFSHTTLGGLHLLIGLYKKQQEGHIIMEEMLKGSSTIHNYFITET NTAAFKAVCSVIDLKLDDFVMILKSQDLGVVSKVVKVPIDLTMIEFMLWCKDGQVQTFY PRLQASADWKPGHAMPSLFKVQNVNLERCELANYKQSIPMPRGVHMNIAKYMQLCQYLN TCTLAVPANMRVIHFGAGSDKGIAPGTSVLRQWLPTDAIIIDNDLNEFVSDADITLFGD CVTVRVGQQVDLVISDMYDPTTKNVTGSNESKALFFTYLCNLINNNLALGGSVAIKITE HSWSVELYELMGKFAWWTVFCTNANASSSEGFLLGINYLGTIKENIDGGAMHANYIFWR NSTPMNLSTYSLFDLSKFQLKLKGTPVLQLKESQINELVISLLSQGKLLIRDNDTLSVS TDVLVNTYRKLR" mat_peptide 279..857 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp1 protein" /protein_id="YP_009047213.1" mat_peptide 858..2837 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp2 protein" /protein_id="YP_009047214.1" mat_peptide 2838..8498 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp3 protein" /note="putative ADP-ribose 1'-phosphatase, putative papain-like proteinase 2, putative transmembrane domain 1" /protein_id="YP_009047215.1" mat_peptide 8499..10019 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp4 protein" /note="putative transmembrane domain 2" /protein_id="YP_009047216.1" mat_peptide 10020..10937 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp5 protein" /note="putative 3C-like cysteine proteinase" /protein_id="YP_009047217.1" mat_peptide 10938..11813 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp6 protein" /note="putative transmembrane protein 3" /protein_id="YP_009047218.1" mat_peptide 11814..12062 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp7 protein" /protein_id="YP_009047219.1" mat_peptide 12063..12659 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp8 protein" /note="putative primase" /protein_id="YP_009047220.1" mat_peptide 12660..12989 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp9 protein" /protein_id="YP_009047221.1" mat_peptide 12990..13409 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp10 protein" /protein_id="YP_009047222.1" mat_peptide join(13410..13433,13433..16207) /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp12 protein" /note="putative RNA-dependent RNA polymerase" /protein_id="YP_009047223.1" mat_peptide 16208..18001 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp13 protein" /note="putative zinc-binding domain, putative superfamily 1 helicase" /protein_id="YP_009047224.1" mat_peptide 18002..19573 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp14 protein" /note="putative 3' to 5' exonuclease, N7-methyltransferase" /protein_id="YP_009047225.1" mat_peptide 19574..20602 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp15 protein" /note="putative nidoviral endoribonuclease specific for U" /protein_id="YP_009047226.1" mat_peptide 20603..21511 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp16 protein" /note="putative S-adenosylmethionine-dependent ribose 2'-O-methiltransferase" /protein_id="YP_009047227.1" CDS 279..13454 /gene="orf1ab" /locus_tag="G128_gp01" /note="ORF1a product; pp1a" /codon_start=1 /product="1A polyprotein" /protein_id="YP_009047203.1" /db_xref="GeneID:14254602" /translation="MSFVAGVTAQGARGTYRAALNSEKHQDHVSLTVPLCGSGNLVEKL SPWFMDGENAYEVVKAMLLKKEPLLYVPIRLAGHTRHLPGPRVYLVERLIACENPFMVN QLAYSSSANGSLVGTTLQGKPIGMFFPYDIELVTGKQNILLRKYGRGGYHYTPFHYERD NTSCPEWMDDFEADPKGKYAQNLLKKLIGGDVTPVDQYMCGVDGKPISAYAFLMAKDGI TKLADVEADVAARADDEGFITLKNNLYRLVWHVERKDVPYPKQSIFTINSVVQKDGVEN TPPHYFTLGCKILTLTPRNKWSGVSDLSLKQKLLYTFYGKESLENPTYIYHSAFIECGS CGNDSWLTGNAIQGFACGCGASYTANDVEVQSSGMIKPNALLCATCPFAKGDSCSSNCK HSVAQLVSYLSERCNVIADSKSFTLIFGGVAYAYFGCEEGTMYFVPRAKSVVSRIGDSI FTGCTGSWNKVTQIANMFLEQTQHSLNFVGEFVVNDVVLAILSGTTTNVDKIRQLLKGV TLDKLRDYLADYDVAVTAGPFMDNAINVGGTGLQYAAITAPYVVLTGLGESFKKVATIP YKVCNSVKDTLAYYAHSVLYRVFPYDMDSGVSSFSELLFDCVDLSVASTYFLVRILQDK TGDFMSTIITSCQTAVSKLLDTCFEATEATFNFLLDLAGLFRIFLRNAYVYTSQGFVVV NGKVSTLVKQVLDLLNKGMQLLHTKVSWAGSKIIAVIYSGRESLIFPSGTYYCVTTKAK SVQQDLDVILPGEFSKKQLGLLQPTDNSTTVSVTVSSNMVETVVGQLEQTNMHSPDVIV GDYVIISEKLFVRSKEEDGFAFYPACTNGHAVPTLFRLKGGAPVKKVAFGGDQVHEVAA VRSVTVEYNIHAVLDTLLASSSLRTFVVDKSLSIEEFADVVKEQVSDLLVKLLRGMPIP DFDLDDFIDAPCYCFNAEGDASWSSTMIFSLHPVECDEECSEVEASDLEEGESECISET STEQVDVSHETSDDEWAAAVDEAFPLDEAEDVTESVQEEAQPVEVPVEDIAQVVIADTL QETPVVPDTVEVPPQVVKLPSAPQTIQPEVKEVAPVYEADTEQTQNVTVKPKRLRKKRN VDPLSNFEHKVITECVTIVLGDAIQVAKCYGESVLVNAANTHLKHGGGIAGAINAASKG AVQKESDEYILAKGPLQVGDSVLLQGHSLAKNILHVVGPDARAKQDVSLLSKCYKAMNA YPLVVTPLVSAGIFGVKPAVSFDYLIREAKTRVLVVVNSQDVYKSLTIVDIPQSLTFSY DGLRGAIRKAKDYGFTVFVCTDNSANTKVLRNKGVDYTKKFLTVDGVQYYCYTSKDTLD DILQQANKSVGIISMPLGYVSHGLDLMQAGSVVRRVNVPYVCLLANKEQEAILMSEDVK LNPSEDFIKHVRTNGGYNSWHLVEGELLVQDLRLNKLLHWSDQTICYKDSVFYVVKNST AFPFETLSACRAYLDSRTTQQLTIEVLVTVDGVNFRTVVLNNKNTYRSQLGCVFFNGAD ISDTIPDEKQNGHSLYLADNLTADETKALKELYGPVDPTFLHRFYSLKAAVHGWKMVVC DKVRSLKLSDNNCYLNAVIMTLDLLKDIKFVIPALQHAFMKHKGGDSTDFIALIMAYGN CTFGAPDDASRLLHTVLAKAELCCSARMVWREWCNVCGIKDVVLQGLKACCYVGVQTVE DLRARMTYVCQCGGERHRQLVEHTTPWLLLSGTPNEKLVTTSTAPDFVAFNVFQGIETA VGHYVHARLKGGLILKFDSGTVSKTSDWKCKVTDVLFPGQKYSSDCNVVRYSLDGNFRT EVDPDLSAFYVKDGKYFTSEPPVTYSPATILAGSVYTNSCLVSSDGQPGGDAISLSFNN LLGFDSSKPVTKKYTYSFLPKEDGDVLLAEFDTYDPIYKNGAMYKGKPILWVNKASYDT NLNKFNRASLRQIFDVAPIELENKFTPLSVESTPVEPPTVDVVALQQEMTIVKCKGLNK PFVKDNVSFVADDSGTPVVEYLSKEDLHTLYVDPKYQVIVLKDNVLSSMLRLHTVESGD INVVAASGSLTRKVKLLFRASFYFKEFATRTFTATTAVGSCIKSVVRHLGVTKGILTGC FSFAKMLFMLPLAYFSDSKLGTTEVKVSALKTAGVVTGNVVKQCCTAAVDLSMDKLRRV DWKSTLRLLLMLCTTMVLLSSVYHLYVFNQVLSSDVMFEDAQGLKKFYKEVRAYLGISS ACDGLASAYRANSFDVPTFCANRSAMCNWCLISQDSITHYPALKMVQTHLSHYVLNIDW LWFAFETGLAYMLYTSAFNWLLLAGTLHYFFAQTSIFVDWRSYNYAVSSAFWLFTHIPM AGLVRMYNLLACLWLLRKFYQHVINGCKDTACLLCYKRNRLTRVEASTVVCGGKRTFYI TANGGISFCRRHNWNCVDCDTAGVGNTFICEEVANDLTTALRRPINATDRSHYYVDSVT VKETVVQFNYRRDGQPFYERFPLCAFTNLDKLKFKEVCKTTTGIPEYNFIIYDSSDRGQ ESLARSACVYYSQVLCKSILLVDSSLVTSVGDSSEIATKMFDSFVNSFVSLYNVTRDKL EKLISTARDGVRRGDNFHSVLTTFIDAARGPAGVESDVETNEIVDSVQYAHKHDIQITN ESYNNYVPSYVKPDSVSTSDLGSLIDCNAASVNQIVLRNSNGACIWNAAAYMKLSDALK RQIRIACRKCNLAFRLTTSKLRANDNILSVRFTANKIVGGAPTWFNALRDFTLKGYVLA TIIVFLCAVLMYLCLPTFSMAPVEFYEDRILDFKVLDNGIIRDVNPDDKCFANKHRSFT QWYHEHVGGVYDNSITCPLTVAVIAGVAGARIPDVPTTLAWVNNQIIFFVSRVFANTGS VCYTPIDEIPYKSFSDSGCILPSECTMFRDAEGRMTPYCHDPTVLPGAFAYSQMRPHVR YDLYDGNMFIKFPEVVFESTLRITRTLSTQYCRFGSCEYAQEGVCITTNGSWAIFNDHH LNRPGVYCGSDFIDIVRRLAVSLFQPITYFQLTTSLVLGIGLCAFLTLLFYYINKVKRA FADYTQCAVIAVVAAVLNSLCICFVTSIPLCIVPYTALYYYATFYFTNEPAFIMHVSWY IMFGPIVPIWMTCVYTVAMCFRHFFWVLAYFSKKHVEVFTDGKLNCSFQDAASNIFVIN KDTYAALRNSLTNDAYSRFLGLFNKYKYFSGAMETAAYREAAACHLAKALQTYSETGSD LLYQPPNCSITSGVLQSGLVKMSHPSGDVEACMVQVTCGSMTLNGLWLDNTVWCPRHVM CPADQLSDPNYDALLISMTNHSFSVQKHIGAPANLRVVGHAMQGTLLKLTVDVANPSTP AYTFTTVKPGAAFSVLACYNGRPTGTFTVVMRPNYTIKGSFLCGSCGSVGYTKEGSVIN FCYMHQMELANGTHTGSAFDGTMYGAFMDKQVHQVQLTDKYCSVNVVAWLYAAILNGCA WFVKPNRTSVVSFNEWALANQFTEFVGTQSVDMLAVKTGVAIEQLLYAIQQLYTGFQGK QILGSTMLEDEFTPEDVNMQIMGVVMQSGVRKVTYGTAHWLFATLVSTYVIILQATKFT LWNYLFETIPTQLFPLLFVTMAFVMLLVKHKHTFLTLFLLPVAICLTYANIVYEPTTPI SSALIAVANWLAPTNAYMRTTHTDIGVYISMSLVLVIVVKRLYNPSLSNFALALCSGVM WLYTYSIGEASSPIAYLVFVTTLTSDYTITVFVTVNLAKVCTYAIFAYSPQLTLVFPEV KMILLLYTCLGFMCTCYFGVFSLLNLKLRAPMGVYDFKVSTQEFRFMTANNLTAPRNSW EAMALNFKLIGIGGTPCIKVAAMQSKLTDLKCTSVVLLSVLQQLHLEANSRAWAFCVKC HNDILAATDPSEAFEKFVSLFATLMTFSGNVDLDALASDIFDTPSVLQATLSEFSHLAT FAELEAAQKAYQEAMDSGDTSPQVLKALQKAVNIAKNAYEKDKAVARKLERMADQAMTS MYKQARAEDKKAKIVSAMQTMLFGMIKKLDNDVLNGIISNARNGCIPLSVIPLCASNKL RVVIPDFTVWNQVVTYPSLNYAGALWDITVINNVDNEIVKSSDVVDSNENLTWPLVLEC TRASTSAVKLQNNEIKPSGLKTMVVSAGQEQTNCNTSSLAYYEPVQGRKMLMALLSDNA YLKWARVEGKDGFVSVELQPPCKFLIAGPKGPEIRYLYFVKNLNNLHRGQVLGHIAATV RLQAGSNTEFASNSSVLSLVNFTVDPQKAYLDFVNAGGAPLTNCVKMLTPKTGTGIAIS VKPESTADQETYGGASVCLYCRAHIEHPDVSGVCKYKGKFVQIPAQCVRDPVGFCLSNT PCNVCQYWIGYGCNCDSLRQAALPQSKDSNFLNESGVLL" mat_peptide 279..857 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp1 protein" /protein_id="YP_009047229.1" mat_peptide 858..2837 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp2 protein" /protein_id="YP_009047230.1" mat_peptide 2838..8498 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp3 protein" /note="putative ADP-ribose 1'-phosphatase, putative papain-like proteinase 2, putative transmembrane domain 1" /protein_id="YP_009047231.1" mat_peptide 8499..10019 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp4 protein" /note="putative transmembrane domain 2" /protein_id="YP_009047232.1" mat_peptide 10020..10937 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp5 protein" /note="putative 3C-like cysteine proteinase" /protein_id="YP_009047233.1" mat_peptide 10938..11813 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp6 protein" /note="putative transmembrane domain 3" /protein_id="YP_009047234.1" mat_peptide 11814..12062 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp7 protein" /protein_id="YP_009047235.1" mat_peptide 12063..12659 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp8 protein" /note="putative primase" /protein_id="YP_009047236.1" mat_peptide 12660..12989 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp9 protein" /protein_id="YP_009047237.1" mat_peptide 12990..13409 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp10 protein" /protein_id="YP_009047238.1" mat_peptide 13410..13451 /gene="orf1ab" /locus_tag="G128_gp01" /product="nsp11 protein" /protein_id="YP_009047228.1" gene 21456..25517 /gene="S" /locus_tag="G128_gp02" /db_xref="GeneID:14254594" CDS 21456..25517 /gene="S" /locus_tag="G128_gp02" /note="S protein" /codon_start=1 /product="spike glycoprotein" /protein_id="YP_009047204.1" /db_xref="GeneID:14254594" /translation="MIHSVFLLMFLLTPTESYVDVGPDSVKSACIEVDIQQTFFDKTWP RPIDVSKADGIIYPQGRTYSNITITYQGLFPYQGDHGDMYVYSAGHATGTTPQKLFVAN YSQDVKQFANGFVVRIGAAANSTGTVIISPSTSATIRKIYPAFMLGSSVGNFSDGKMGR FFNHTLVLLPDGCGTLLRAFYCILEPRSGNHCPAGNSYTSFATYHTPATDCSDGNYNRN ASLNSFKEYFNLRNCTFMYTYNITEDEILEWFGITQTAQGVHLFSSRYVDLYGGNMFQF ATLPVYDTIKYYSIIPHSIRSIQSDRKAWAAFYVYKLQPLTFLLDFSVDGYIRRAIDCG FNDLSQLHCSYESFDVESGVYSVSSFEAKPSGSVVEQAEGVECDFSPLLSGTPPQVYNF KRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFSYPLSMKSDLSV SSAGPISQFNYKQSFSNPTCLILATVPHNLTTITKPLKYSYINKCSRLLSDDRTEVPQL VNANQYSPCVSIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQ YGTDTNSVCPKLEFANDTKIASQLGNCVEYSLYGVSGRGVFQNCTAVGVRQQRFVYDAY QNLVGYYSDDGNYYCLRACVSVPVSVIYDKETKTHATLFGSVACEHISSTMSQYSRSTR SMLKRRDSTYGPLQTPVGCVLGLVNSSLFVEDCKLPLGQSLCALPDTPSTLTPRSVRSV PGEMRLASIAFNHPIQVDQLNSSYFKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCN GFQKCEQLLREYGQFCSKINQALHGANLRQDDSVRNLFASVKSSQSSPIIPGFGGDFNL TLLEPVSISTGSRSARSAIEDLLFDKVTIADPGYMQGYDDCMQQGPASARDLICAQYVA GYKVLPPLMDVNMEAAYTSSLLGSIAGVGWTAGLSSFAAIPFAQSIFYRLNGVGITQQV LSENQKLIANKFNQALGAMQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAIS ASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLVRSESAALSAQLAKDKVNECV KAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCDAANPTNCIA PVNGYFIKTNNTRIVDEWSYTGSSFYAPEPITSLNTKYVAPQVTYQNISTNLPPPLLGN STGIDFQDELDEFFKNVSTSIPNFGSLTQINTTLLDLTYEMLSLQQVVKALNESYIDLK ELGNYTYYNKWPWYIWLGFIAGLVALALCVFFILCCTGCGTNCMGKLKCNRCCDRYEEY DLEPHKVHVH" gene 25532..25843 /gene="orf3" /locus_tag="G128_gp03" /db_xref="GeneID:14254595" CDS 25532..25843 /gene="orf3" /locus_tag="G128_gp03" /note="ORF3 protein" /codon_start=1 /product="NS3 protein" /protein_id="YP_009047205.1" /db_xref="GeneID:14254595" /translation="MRVQRPPTLLLVFSLSLLVTASSKPLYVPEHCQNYSGCMLRACIK TAQADTAGLYTNFRIDVPSAESTGTQSVSVDLESTSTHDGPTEHVTSVNLFDVGYSVN" gene 25852..26181 /gene="orf4a" /locus_tag="G128_gp04" /db_xref="GeneID:14254596" CDS 25852..26181 /gene="orf4a" /locus_tag="G128_gp04" /note="ORF4a protein" /codon_start=1 /product="NS4A protein" /protein_id="YP_009047206.1" /db_xref="GeneID:14254596" /translation="MDYVSLLNQIWQKYLNSPYTTCLYIPKPTAKYTPLVGTSLHPVLW NCQLSFAGYTESAVNSTKALAKQDAAQRIAWLLHKDGGIPDGCSLYLRHSSLFAQSEEE EPFSN" gene 26093..26833 /gene="orf4b" /locus_tag="G128_gp05" /db_xref="GeneID:14254597" CDS 26093..26833 /gene="orf4b" /locus_tag="G128_gp05" /note="ORF4b protein" /codon_start=1 /product="NS4B protein" /protein_id="YP_009047207.1" /db_xref="GeneID:14254597" /translation="MEESLMDVPSTSGTQVYSRKARKRSHSPTKKLRYVKRRFSLLRHE DLSVIVQPTHYVRVTFSDPNMWYLRSGHHLHSVHNWLKPYGGQPVSEYHITLALLNLTD EDLARDFSPIALFLRNVRFELHEFALLRKTLVLNASEIYCANIHRFKPVYRVNTAIPTI KDWLLVQGFSLYHSGLPLHMSISKLHALDDVTRNYIITMPCFRTYPQQMFVTPLAVDVV SIRSSNQGNKQIVHSYPILHHPGF" gene 26840..27514 /gene="orf5" /locus_tag="G128_gp06" /db_xref="GeneID:14254598" CDS 26840..27514 /gene="orf5" /locus_tag="G128_gp06" /note="ORF5 protein" /codon_start=1 /product="NS5 protein" /protein_id="YP_009047208.1" /db_xref="GeneID:14254598" /translation="MAFSASLFKPVQLVPVSPAFHRIESTDSIVFTYIPASGYVAALAV NVCLIPLLLLLRQDTCRRSIIRTMVLYFLVLYNFLLAIVLVNGVHYPTGSCLIAFLVIL IILWFVDRIRFCLMLNSYIPLFDMRSHFIRVSTVSSHGMVPVIHTKPLFIRNFDQRCSC SRCFYLHSSTYIECTYISRFSKISLVSVTDFSLNGNVSTVFVPATRDSVPLHIIAPSSL IV" gene 27590..27838 /gene="E" /locus_tag="G128_gp07" /db_xref="GeneID:14254599" CDS 27590..27838 /gene="E" /locus_tag="G128_gp07" /note="E protein" /codon_start=1 /product="envelope protein" /protein_id="YP_009047209.1" /db_xref="GeneID:14254599" /translation="MLPFVQERIGLFIVNFFIFTVVCAITLLVCMAFLTATRLCVQCMT GFNTLLVQPALYLYNTGRSVYVKFQDSKPPLPPDEWV" gene 27853..28512 /gene="M" /locus_tag="G128_gp08" /db_xref="GeneID:14254600" CDS 27853..28512 /gene="M" /locus_tag="G128_gp08" /note="M protein" /codon_start=1 /product="membrane protein" /protein_id="YP_009047210.1" /db_xref="GeneID:14254600" /translation="MSNMTQLTEAQIIAIIKDWNFAWSLIFLLITIVLQYGYPSRSMTV YVFKMFVLWLLWPSSMALSIFSAVYPIDLASQIISGIVAAVSAMMWISYFVQSIRLFMR TGSWWSFNPETNCLLNVPFGGTTVVRPLVEDSTSVTAVVTNGHLKMAGMHFGACDYDRL PNEVTVAKPNVLIALKMVKRQSYGTNSGVAIYHRYKAGNYRSPPITADIELALLRA" gene 28566..29807 /gene="N" /locus_tag="G128_gp09" /db_xref="GeneID:14254601" CDS 28566..29807 /gene="N" /locus_tag="G128_gp09" /note="N protein" /codon_start=1 /product="nucleoprotein" /protein_id="YP_009047211.1" /db_xref="GeneID:14254601" /translation="MASPAAPRAVSFADNNDITNTNLSRGRGRNPKPRAAPNNTVSWYT GLTQHGKVPLTFPPGQGVPLNANSTPAQNAGYWRRQDRKINTGNGIKQLAPRWYFYYTG TGPEAALPFRAVKDGIVWVHEDGATDAPSTFGTRNPNNDSAIVTQFAPGTKLPKNFHIE GTGGNSQSSSRASSLSRNSSRSSSQGSRSGNSTRGTSPGPSGIGAVGGDLLYLDLLNRL QALESGKVKQSQPKVITKKDAAAAKNKMRHKRTSTKSFNMVQAFGLRGPGDLQGNFGDL QLNKLGTEDPRWPQIAELAPTASAFMGMSQFKLTHQNNDDHGNPVYFLRYSGAIKLDPK NPNYNKWLELLEQNIDAYKTFPKKEKKQKAPKEESTDQMSEPPKEQRVQGSITQRTRTR PSVQPGPMIDVNTD" gene 28762..29100 /gene="orf8b" /locus_tag="G128_gp10" /db_xref="GeneID:19910005" CDS 28762..29100 /gene="orf8b" /locus_tag="G128_gp10" /codon_start=1 /product="ORF8b protein" /protein_id="YP_009047212.1" /db_xref="GeneID:19910005" /translation="MPILPLRKMLGIGGDRTEKLIPGMELSNWLPGGTSTTLELDPKQH SHSGLLRMASFGSMKMAPLMLLQLLGRGTLTMIQLLLHNSRPVLSFLKTSTLRGLEAIV NHLQEPLA" ORIGIN 1 gatttaagtg aatagcttgg ctatctcact tcccctcgtt ctcttgcaga actttgattt 61 taacgaactt aaataaaagc cctgttgttt agcgtatcgt tgcacttgtc tggtgggatt 121 gtggcattaa tttgcctgct catctaggca gtggacatat gctcaacact gggtataatt 181 ctaattgaat actatttttc agttagagcg tcgtgtctct tgtacgtctc ggtcacaata 241 cacggtttcg tccggtgcgt ggcaattcgg ggcacatcat gtctttcgtg gctggtgtga 301 ccgcgcaagg tgcgcgcggt acgtatcgag cagcgctcaa ctctgaaaaa catcaagacc 361 atgtgtctct aactgtgcca ctctgtggtt caggaaacct ggttgaaaaa ctttcaccat 421 ggttcatgga tggcgaaaat gcctatgaag tggtgaaggc catgttactt aaaaaggagc 481 cacttctcta tgtgcccatc cggctggctg gacacactag acacctccca ggtcctcgtg 541 tgtacctggt tgagaggctc attgcttgtg aaaatccatt catggttaac caattggctt 601 atagctctag tgcaaatggc agcctggttg gcacaacttt gcagggcaag cctattggta 661 tgttcttccc ttatgacatc gaacttgtca caggaaagca aaatattctc ctgcgcaagt 721 atggccgtgg tggttatcac tacaccccat tccactatga gcgagacaac acctcttgcc 781 ctgagtggat ggacgatttt gaggcggatc ctaaaggcaa atatgcccag aatctgctta 841 agaagttgat tggcggtgat gtcactccag ttgaccaata catgtgtggc gttgatggaa 901 aacccattag tgcctacgca tttttaatgg ccaaggatgg aataaccaaa ctggctgatg 961 ttgaagcgga cgtcgcagca cgtgctgatg acgaaggctt catcacatta aagaacaatc 1021 tatatagatt ggtttggcat gttgagcgta aagacgttcc atatcctaag caatctattt 1081 ttactattaa tagtgtggtc caaaaggatg gtgttgaaaa cactcctcct cactatttta 1141 ctcttggatg caaaatttta acgctcaccc cacgcaacaa gtggagtggc gtttctgact 1201 tgtccctcaa acaaaaactc ctttacacct tctatggtaa ggagtcactt gagaacccaa 1261 cctacattta ccactccgca ttcattgagt gtggaagttg tggtaatgat tcctggctta 1321 cagggaatgc tatccaaggg tttgcctgtg gatgtggggc atcatataca gctaatgatg 1381 tcgaagtcca atcatctggc atgattaagc caaatgctct tctttgtgct acttgcccct 1441 ttgctaaggg tgatagctgt tcttctaatt gcaaacattc agttgctcag ttggttagtt 1501 acctttctga acgctgtaat gttattgctg attctaagtc cttcacactt atctttggtg 1561 gcgtagctta cgcctacttt ggatgtgagg aaggtactat gtactttgtg cctagagcta 1621 agtctgttgt ctcaaggatt ggagactcca tctttacagg ctgtactggc tcttggaaca 1681 aggtcactca aattgctaac atgttcttgg aacagactca gcattccctt aactttgtgg 1741 gagagttcgt tgtcaacgat gttgtcctcg caattctctc tggaaccaca actaatgttg 1801 acaaaatacg ccagcttctc aaaggtgtca cccttgacaa gttgcgtgat tatttagctg 1861 actatgacgt agcagtcact gccggcccat tcatggataa tgctattaat gttggtggta 1921 caggattaca gtatgccgcc attactgcac cttatgtagt tctcactggc ttaggtgagt 1981 cctttaagaa agttgcaacc ataccgtata aggtttgcaa ctctgttaag gatactctgg 2041 cttattatgc tcacagcgtg ttgtacagag tttttcctta tgacatggat tctggtgtgt 2101 catcctttag tgaactactt tttgattgcg ttgatctttc agtagcttct acctattttt 2161 tagtccgcat cttgcaagat aagactggcg actttatgtc tacaattatt acttcctgcc 2221 aaactgctgt tagtaagctt ctagatacat gttttgaagc tacagaagca acatttaact 2281 tcttgttaga tttggcagga ttgttcagaa tctttctccg caatgcctat gtgtacactt 2341 cacaagggtt tgtggtggtc aatggcaaag tttctacact tgtcaaacaa gtgttagact 2401 tgcttaataa gggtatgcaa cttttgcata caaaggtctc ctgggctggt tctaaaatca 2461 ttgctgttat ctacagcggc agggagtctc taatattccc atcgggaacc tattactgtg 2521 tcaccactaa ggctaagtcc gttcaacaag atcttgacgt tattttgcct ggtgagtttt 2581 ccaagaagca gttaggactg ctccaaccta ctgacaattc tacaactgtt agtgttactg 2641 tatccagtaa catggttgaa actgttgtgg gtcaacttga gcaaactaat atgcatagtc 2701 ctgatgttat agtaggtgac tatgtcatta ttagtgaaaa attgtttgtg cgtagtaagg 2761 aagaagacgg atttgccttc taccctgctt gcactaatgg tcatgctgta ccgactctct 2821 ttagacttaa gggaggtgca cctgtaaaaa aagtagcctt tggcggtgat caagtacatg 2881 aggttgctgc tgtaagaagt gttactgtcg agtacaacat tcatgctgta ttagacacac 2941 tacttgcttc ttctagtctt agaacctttg ttgtagataa gtctttgtca attgaggagt 3001 ttgctgacgt agtaaaggaa caagtctcag acttgcttgt taaattactg cgtggaatgc 3061 cgattccaga ttttgattta gacgatttta ttgacgcacc atgctattgc tttaacgctg 3121 agggtgatgc atcctggtct tctactatga tcttctctct tcaccccgtc gagtgtgacg 3181 aggagtgttc tgaagtagag gcttcagatt tagaagaagg tgaatcagag tgcatttctg 3241 agacttcaac tgaacaagtt gacgtttctc atgagacttc tgacgacgag tgggctgctg 3301 cagttgatga agcgttccct ctcgatgaag cagaagatgt tactgaatct gtgcaagaag 3361 aagcacaacc agtagaagta cctgttgaag atattgcgca ggttgtcata gctgacacct 3421 tacaggaaac tcctgttgtg cctgatactg ttgaagtccc accgcaagtg gtgaaacttc 3481 cgtctgcacc tcagactatc cagcccgagg taaaagaagt tgcacctgtc tatgaggctg 3541 ataccgaaca gacacagaat gttactgtta aacctaagag gttacgcaaa aagcgtaatg 3601 ttgacccttt gtccaatttt gaacataagg ttattacaga gtgcgttacc atagttttag 3661 gtgacgcaat tcaagtagcc aagtgctatg gggagtctgt gttagttaat gctgctaaca 3721 cacatcttaa gcatggcggt ggtatcgctg gtgctattaa tgcggcttca aaaggggctg 3781 tccaaaaaga gtcagatgag tatattctgg ctaaagggcc gttacaagta ggagattcag 3841 ttctcttgca aggccattct ctagctaaga atatcctgca tgtcgtaggc ccagatgccc 3901 gcgctaaaca ggatgtttct ctccttagta agtgctataa ggctatgaat gcatatcctc 3961 ttgtagtcac tcctcttgtt tcagcaggca tatttggtgt aaaaccagct gtgtcttttg 4021 attatcttat tagggaggct aagactagag ttttagtcgt cgttaattcc caagatgtct 4081 ataagagtct taccatagtt gacattccac agagtttgac tttttcatat gatgggttac 4141 gtggcgcaat acgtaaagct aaagattatg gttttactgt ttttgtgtgc acagacaact 4201 ctgctaacac taaagttctt aggaacaagg gtgttgatta tactaagaag tttcttacag 4261 ttgacggtgt gcaatattat tgctacacgt ctaaggacac tttagatgat atcttacaac 4321 aggctaataa gtctgttggt attatatcta tgcctttggg atatgtgtct catggtttag 4381 acttaatgca agcagggagt gtcgtgcgta gagttaacgt gccctacgtg tgtctcctag 4441 ctaataaaga gcaagaagct attttgatgt ctgaagacgt taagttaaac ccttcagaag 4501 attttataaa gcacgtccgc actaatggtg gttacaattc ttggcattta gtcgagggtg 4561 aactattggt gcaagactta cgcttaaata agctcctgca ttggtctgat caaaccatat 4621 gctacaagga tagtgtgttt tatgttgtaa agaatagtac agcttttcca tttgaaacac 4681 tttcagcatg tcgtgcgtat ttggattcac gcacgacaca gcagttaaca atcgaagtct 4741 tagtgactgt cgatggtgta aattttagaa cagtcgttct aaataataag aacacttata 4801 gatcacagct tggatgcgtt ttctttaatg gtgctgatat ttctgacacc attcctgatg 4861 agaaacagaa tggtcacagt ttatatctag cagacaattt gactgctgat gaaacaaagg 4921 cgcttaaaga gttatatggc cccgttgatc ctactttctt acacagattc tattcactta 4981 aggctgcagt ccatgggtgg aagatggttg tgtgtgataa ggtacgttct ctcaaattga 5041 gtgataataa ttgttatctt aatgcagtta ttatgacact tgatttattg aaggacatta 5101 aatttgttat acctgctcta cagcatgcat ttatgaaaca taagggcggt gattcaactg 5161 acttcatagc cctcattatg gcttatggca attgcacatt tggtgctcca gatgatgcct 5221 ctcggttact tcataccgtg cttgcaaagg ctgagttatg ctgttctgca cgcatggttt 5281 ggagagagtg gtgcaatgtc tgtggcataa aagatgttgt tctacaaggc ttaaaagctt 5341 gttgttacgt gggtgtgcaa actgttgaag atctgcgtgc tcgcatgaca tatgtatgcc 5401 agtgtggtgg tgaacgtcat cggcaattag tcgaacacac caccccctgg ttgctgctct 5461 caggcacacc aaatgaaaaa ttggtgacaa cctccacggc gcctgatttt gtagcattta 5521 atgtctttca gggcattgaa acggctgttg gccattatgt tcatgctcgc ctgaagggtg 5581 gtcttatttt aaagtttgac tctggcaccg ttagcaagac ttcagactgg aagtgcaagg 5641 tgacagatgt acttttcccc ggccaaaaat acagtagcga ttgtaatgtc gtacggtatt 5701 ctttggacgg taatttcaga acagaggttg atcccgacct atctgctttc tatgttaagg 5761 atggtaaata ctttacaagt gaaccacccg taacatattc accagctaca attttagctg 5821 gtagtgtcta cactaatagc tgccttgtat cgtctgatgg acaacctggc ggtgatgcta 5881 ttagtttgag ttttaataac cttttagggt ttgattctag taaaccagtc actaagaaat 5941 acacttactc cttcttgcct aaagaagacg gcgatgtgtt gttggctgag tttgacactt 6001 atgaccctat ttataagaat ggtgccatgt ataaaggcaa accaattctt tgggtcaata 6061 aagcatctta tgatactaat cttaataagt tcaatagagc tagtttgcgt caaatttttg 6121 acgtagcccc cattgaactc gaaaataaat tcacaccttt gagtgtggag tctacaccag 6181 ttgaacctcc aactgtagat gtggtagcac ttcaacagga aatgacaatt gtcaaatgta 6241 agggtttaaa taaacctttc gtgaaggaca atgtcagttt cgttgctgat gattcaggta 6301 ctcccgttgt tgagtatctg tctaaagaag acctacatac attgtatgta gaccctaagt 6361 atcaagtcat tgtcttaaaa gacaatgtac tttcttctat gcttagattg cacaccgttg 6421 agtcaggtga tattaacgtt gttgcagctt ccggatcttt gacacgtaaa gtgaagttac 6481 tatttagggc ttcattttat ttcaaagaat ttgctacccg cactttcact gctaccactg 6541 ctgtaggtag ttgtataaag agtgtagtgc ggcatctagg tgttactaaa ggcatattga 6601 caggctgttt tagttttgcc aagatgttat ttatgcttcc actagcttac tttagtgatt 6661 caaaactcgg caccacagag gttaaagtga gtgctttgaa aacagccggc gttgtgacag 6721 gtaatgttgt aaaacagtgt tgcactgctg ctgttgattt aagtatggat aagttgcgcc 6781 gtgtggattg gaaatcaacc ctacggttgt tacttatgtt atgcacaact atggtattgt 6841 tgtcttctgt gtatcacttg tatgtcttca atcaggtctt atcaagtgat gttatgtttg 6901 aagatgccca aggtttgaaa aagttctaca aagaagttag agcttaccta ggaatctctt 6961 ctgcttgtga cggtcttgct tcagcttata gggcgaattc ctttgatgta cctacattct 7021 gcgcaaaccg ttctgcaatg tgtaattggt gcttgattag ccaagattcc ataactcact 7081 acccagctct taagatggtt caaacacatc ttagccacta tgttcttaac atagattggt 7141 tgtggtttgc atttgagact ggtttggcat acatgctcta tacctcggcc ttcaactggt 7201 tgttgttggc aggtacattg cattatttct ttgcacagac ttccatattt gtagactggc 7261 ggtcatacaa ttatgctgtg tctagtgcct tctggttatt cacccacatt ccaatggcgg 7321 gtttggtacg aatgtataat ttgttagcat gcctttggct tttacgcaag ttttatcagc 7381 atgtaatcaa tggttgcaaa gatacggcat gcttgctctg ctataagagg aaccgactta 7441 ctagagttga agcttctacc gttgtctgtg gtggaaaacg tacgttttat atcacagcaa 7501 atggcggtat ttcattctgt cgtaggcata attggaattg tgtggattgt gacactgcag 7561 gtgtggggaa taccttcatc tgtgaagaag tcgcaaatga cctcactacc gccctacgca 7621 ggcctattaa cgctacggat agatcacatt attatgtgga ttccgttaca gttaaagaga 7681 ctgttgttca gtttaattat cgtagagacg gtcaaccatt ctacgagcgg tttcccctct 7741 gcgcttttac aaatctagat aagttgaagt tcaaagaggt ctgtaaaact actactggta 7801 tacctgaata caactttatc atctacgact catcagatcg tggccaggaa agtttagcta 7861 ggtctgcatg tgtttattat tctcaagtct tgtgtaaatc aattcttttg gttgactcaa 7921 gtttggttac ttctgttggt gattctagtg aaatcgccac taaaatgttt gattcctttg 7981 ttaatagttt cgtctcgctg tataatgtca cacgcgataa gttggaaaaa cttatctcta 8041 ctgctcgtga tggcgtaagg cgaggcgata acttccatag tgtcttaaca acattcattg 8101 acgcagcacg aggccccgca ggtgtggagt ctgatgttga gaccaatgaa attgttgact 8161 ctgtgcagta tgctcataaa catgacatac aaattactaa tgagagctac aataattatg 8221 taccctcata tgttaaacct gatagtgtgt ctaccagcga tttaggtagt ctcattgatt 8281 gtaatgcggc ttcagttaac caaattgtct tgcgtaattc taatggtgct tgcatttgga 8341 acgctgctgc atatatgaaa ctctcggatg cacttaaacg acagattcgc attgcatgcc 8401 gtaagtgtaa tttagctttc cggttaacca cctcaaagct acgcgctaat gataatatct 8461 tatcagttag attcactgct aacaaaattg ttggtggtgc tcctacatgg tttaatgcgt 8521 tgcgtgactt tacgttaaag ggttatgttc ttgctaccat tattgtgttt ctgtgtgctg 8581 tactgatgta tttgtgttta cctacatttt ctatggcacc tgttgaattt tatgaagacc 8641 gcatcttgga ctttaaagtt cttgataatg gtatcattag ggatgtaaat cctgatgata 8701 agtgctttgc taataagcac cggtccttca cacaatggta tcatgagcat gttggtggtg 8761 tctatgacaa ctctatcaca tgcccattga cagttgcagt aattgctgga gttgctggtg 8821 ctcgcattcc agacgtacct actacattgg cttgggtgaa caatcagata attttctttg 8881 tttctcgagt ctttgctaat acaggcagtg tttgctacac tcctatagat gagataccct 8941 ataagagttt ctctgatagt ggttgcattc ttccatctga gtgcactatg tttagggatg 9001 cagagggccg tatgacacca tactgccatg atcctactgt tttgcctggg gcttttgcgt 9061 acagtcagat gaggcctcat gttcgttacg acttgtatga tggtaacatg tttattaaat 9121 ttcctgaagt agtatttgaa agtacactta ggattactag aactctgtca actcagtact 9181 gccggttcgg tagttgtgag tatgcacaag agggtgtttg tattaccaca aatggctcgt 9241 gggccatttt taatgaccac catcttaata gacctggtgt ctattgtggc tctgatttta 9301 ttgacattgt caggcggtta gcagtatcac tgttccagcc tattacttat ttccaattga 9361 ctacctcatt ggtcttgggt ataggtttgt gtgcgttcct gactttgctc ttctattata 9421 ttaataaagt aaaacgtgct tttgcagatt acacccagtg tgctgtaatt gctgttgttg 9481 ctgctgttct taatagcttg tgcatctgct ttgttacctc tataccattg tgtatagtac 9541 cttacactgc attgtactat tatgctacat tctattttac taatgagcct gcatttatta 9601 tgcatgtttc ttggtacatt atgttcgggc ctatcgttcc catatggatg acctgcgtct 9661 atacagttgc aatgtgcttt agacacttct tctgggtttt agcttatttt agtaagaaac 9721 atgtagaagt ttttactgat ggtaagctta attgtagttt ccaggacgct gcctctaata 9781 tctttgttat taacaaggac acttatgcag ctcttagaaa ctctttaact aatgatgcct 9841 attcacgatt tttggggttg tttaacaagt ataagtactt ctctggtgct atggaaacag 9901 ccgcttatcg tgaagctgca gcatgtcatc ttgctaaagc cttacaaaca tacagcgaga 9961 ctggtagtga tcttctttac caaccaccca actgtagcat aacctctggc gtgttgcaaa 10021 gcggtttggt gaaaatgtca catcccagtg gagatgttga ggcttgtatg gttcaggtta 10081 cctgcggtag catgactctt aatggtcttt ggcttgacaa cacagtctgg tgcccacgac 10141 acgtaatgtg cccggctgac cagttgtctg atcctaatta tgatgccttg ttgatttcta 10201 tgactaatca tagtttcagt gtgcaaaaac acattggcgc tccagcaaac ttgcgtgttg 10261 ttggtcatgc catgcaaggc actcttttga agttgactgt cgatgttgct aaccctagca 10321 ctccagccta cacttttaca acagtgaaac ctggcgcagc atttagtgtg ttagcatgct 10381 ataatggtcg tccgactggt acattcactg ttgtaatgcg ccctaactac acaattaagg 10441 gttcctttct gtgtggttct tgtggtagtg ttggttacac caaggagggt agtgtgatca 10501 atttctgtta catgcatcaa atggaacttg ctaatggtac acataccggt tcagcatttg 10561 atggtactat gtatggtgcc tttatggata aacaagtgca ccaagttcag ttaacagaca 10621 aatactgcag tgttaatgta gtagcttggc tttacgcagc aatacttaat ggttgcgctt 10681 ggtttgtaaa acctaatcgc actagtgttg tttcttttaa tgaatgggct cttgccaacc 10741 aattcactga atttgttggc actcaatccg ttgacatgtt agctgtcaaa acaggcgttg 10801 ctattgaaca gctgctttat gcgatccaac aactgtatac tgggttccag ggaaagcaaa 10861 tccttggcag taccatgttg gaagatgaat tcacacctga ggatgttaat atgcagatta 10921 tgggtgtggt tatgcagagt ggtgtgagaa aagttacata tggtactgcg cattggttgt 10981 ttgcgaccct tgtctcaacc tatgtgataa tcttacaagc cactaaattt actttgtgga 11041 actacttgtt tgagactatt cccacacagt tgttcccact cttatttgtg actatggcct 11101 tcgttatgtt gttggttaaa cacaaacaca cctttttgac acttttcttg ttgcctgtgg 11161 ctatttgttt gacttatgca aacatagtct acgagcccac tactcccatt tcgtcagcgc 11221 tgattgcagt tgcaaattgg cttgccccca ctaatgctta tatgcgcact acacatactg 11281 atattggtgt ctacattagt atgtcacttg tattagtcat tgtagtgaag agattgtaca 11341 acccatcact ttctaacttt gcgttagcat tgtgcagtgg tgtaatgtgg ttgtacactt 11401 atagcattgg agaagcctca agccccattg cctatctggt ttttgtcact acactcacta 11461 gtgattatac gattacagtc tttgttactg tcaaccttgc aaaagtttgc acttatgcca 11521 tctttgctta ctcaccacag cttacacttg tgtttccgga agtgaagatg atacttttat 11581 tatacacatg tttaggtttc atgtgtactt gctattttgg tgtcttctct cttttgaacc 11641 ttaagcttag agcacctatg ggtgtctatg actttaaggt ctcaacacaa gagttcagat 11701 tcatgactgc taacaatcta actgcaccta gaaattcttg ggaggctatg gctctgaact 11761 ttaagttaat aggtattggc ggtacacctt gtataaaggt tgctgctatg cagtctaaac 11821 ttacagatct taaatgcaca tctgtggttc tcctctctgt gctccaacag ttacacttag 11881 aggctaatag tagggcctgg gctttctgtg ttaaatgcca taatgatata ttggcagcaa 11941 cagaccccag tgaggctttc gagaaattcg taagtctctt tgctacttta atgacttttt 12001 ctggtaatgt agatcttgat gcgttagcta gtgatatttt tgacactcct agcgtacttc 12061 aagctactct ttctgagttt tcacacttag ctacctttgc tgagttggaa gctgcgcaga 12121 aagcctatca ggaagctatg gactctggtg acacctcacc acaagttctt aaggctttgc 12181 agaaggctgt taatatagct aaaaacgcct atgagaagga taaggcagtg gcccgtaagt 12241 tagaacgtat ggctgatcag gctatgactt ctatgtataa gcaagcacgt gctgaagaca 12301 agaaagcaaa aattgtcagt gctatgcaaa ctatgttgtt tggtatgatt aagaagctcg 12361 acaacgatgt tcttaatggt atcatttcta acgctaggaa tggttgtata cctcttagtg 12421 tcatcccact gtgtgcttca aataaacttc gcgttgtaat tcctgacttc accgtctgga 12481 atcaggtagt cacatatccc tcgcttaact acgctggggc tttgtgggac attacagtta 12541 taaacaatgt ggacaatgaa attgttaagt cttcagatgt tgtagacagc aatgaaaatt 12601 taacatggcc acttgtttta gaatgcacta gggcatccac ttctgccgtt aagttgcaaa 12661 ataatgagat caaaccttca ggtctaaaaa ccatggttgt gtctgcgggt caagagcaaa 12721 ctaactgtaa tactagttcc ttagcttatt acgaacctgt gcagggtcgt aaaatgctga 12781 tggctcttct ttctgataat gcctatctca aatgggcgcg tgttgaaggt aaggacggat 12841 ttgtcagtgt agagctacaa cctccttgca aattcttgat tgcgggacca aaaggacctg 12901 aaatccgata tctctatttt gttaaaaatc ttaacaacct tcatcgcggg caagtgttag 12961 ggcacattgc tgcgactgtt agattgcaag ctggttctaa caccgagttt gcctctaatt 13021 cctcggtgtt gtcacttgtt aacttcaccg ttgatcctca aaaagcttat ctcgatttcg 13081 tcaatgcggg aggtgcccca ttgacaaatt gtgttaagat gcttactcct aaaactggta 13141 caggtatagc tatatctgtt aaaccagaga gtacagctga tcaagagact tatggtggag 13201 cttcagtgtg tctctattgc cgtgcgcata tagaacatcc tgatgtctct ggtgtttgta 13261 aatataaggg taagtttgtc caaatccctg ctcagtgtgt ccgtgaccct gtgggatttt 13321 gtttgtcaaa taccccctgt aatgtctgtc aatattggat tggatatggg tgcaattgtg 13381 actcgcttag gcaagcagca ctgccccaat ctaaagattc caatttttta aacgagtccg 13441 gggttctatt gtaaatgccc gaatagaacc ctgttcaagt ggtttgtcca ctgatgtcgt 13501 ctttagggca tttgacatct gcaactataa ggctaaggtt gctggtattg gaaaatacta 13561 caagactaat acttgtaggt ttgtagaatt agatgaccaa gggcatcatt tagactccta 13621 ttttgtcgtt aagaggcata ctatggagaa ttatgaacta gagaagcact gttacgactt 13681 gttacgtgac tgtgatgctg tagctcccca tgatttcttc atctttgatg tagacaaagt 13741 taaaacacct catattgtac gtcagcgttt aactgagtac actatgatgg atcttgtata 13801 tgccctgagg cactttgatc aaaatagcga agtgcttaag gctatcttag tgaagtatgg 13861 ttgctgtgat gttacctact ttgaaaataa actctggttt gattttgttg aaaatcccag 13921 tgttattggt gtttatcata aacttggaga acgtgtacgc caagctatct taaacactgt 13981 taaattttgt gaccacatgg tcaaggctgg tttagtcggt gtgctcacac tagacaacca 14041 ggaccttaat ggcaagtggt atgattttgg tgacttcgta atcactcaac ctggttcagg 14101 agtagctata gttgatagct actattctta tttgatgcct gtgctctcaa tgaccgattg 14161 tctggccgct gagacacata gggattgtga ttttaataaa ccactcattg agtggccact 14221 tactgagtat gattttactg attataaggt acaactcttt gagaagtact ttaaatattg 14281 ggatcagacg tatcacgcaa attgcgttaa ttgtactgat gaccgttgtg tgttacattg 14341 tgctaatttc aatgtattgt ttgctatgac catgcctaag acttgtttcg gacccatagt 14401 ccgaaagatc tttgttgatg gcgtgccatt tgtagtatct tgtggttatc actacaaaga 14461 attaggttta gtcatgaata tggatgttag tctccataga cataggctct ctcttaagga 14521 gttgatgatg tatgccgctg atccagccat gcacattgcc tcctctaacg cttttcttga 14581 tttgaggaca tcatgtttta gtgtcgctgc acttacaact ggtttgactt ttcaaactgt 14641 gcggcctggc aattttaacc aagacttcta tgatttcgtg gtatctaaag gtttctttaa 14701 ggagggctct tcagtgacgc tcaaacattt tttctttgct caagatggta atgctgctat 14761 tacagattat aattactatt cttataatct gcctactatg tgtgacatca aacaaatgtt 14821 gttctgcatg gaagttgtaa acaagtactt cgaaatctat gacggtggtt gtcttaatgc 14881 ttctgaagtg gttgttaata atttagacaa gagtgctggc catcctttta ataagtttgg 14941 caaagctcgt gtctattatg agagcatgtc ttaccaggag caagatgaac tttttgccat 15001 gacaaagcgt aacgtcattc ctaccatgac tcaaatgaat ctaaaatatg ctattagtgc 15061 taagaataga gctcgcactg ttgcaggcgt gtccatactt agcacaatga ctaatcgcca 15121 gtaccatcag aaaatgctta agtccatggc tgcaactcgt ggagcgactt gcgtcattgg 15181 tactacaaag ttctacggtg gctgggattt catgcttaaa acattgtaca aagatgttga 15241 taatccgcat cttatgggtt gggattaccc taagtgtgat agagctatgc ctaatatgtg 15301 tagaatcttc gcttcactca tattagctcg taaacatggc acttgttgta ctacaaggga 15361 cagattttat cgcttggcaa atgagtgtgc tcaggtgcta agcgaatatg ttctatgtgg 15421 tggtggttac tacgtcaaac ctggaggtac cagtagcgga gatgccacca ctgcatatgc 15481 caatagtgtc tttaacattt tgcaggcgac aactgctaat gtcagtgcac ttatgggtgc 15541 taatggcaac aagattgttg acaaagaagt taaagacatg cagtttgatt tgtatgtcaa 15601 tgtttacagg agcactagcc cagaccccaa atttgttgat aaatactatg cttttcttaa 15661 taagcacttt tctatgatga tactgtctga tgacggtgtc gtttgctata atagtgatta 15721 tgcagctaag ggttacattg ctggaataca gaattttaag gaaacgctgt attatcagaa 15781 caatgtcttt atgtctgaag ctaaatgctg ggtggaaacc gatctgaaga aagggccaca 15841 tgaattctgt tcacagcata cgctttatat taaggatggc gacgatggtt acttccttcc 15901 ttatccagac ccttcaagaa ttttgtctgc cggttgcttt gtagatgata tcgttaagac 15961 tgacggtaca ctcatggtag agcggtttgt gtctttggct atagatgctt accctctcac 16021 aaagcatgaa gatatagaat accagaatgt attctgggtc tacttacagt atatagaaaa 16081 actgtataaa gaccttacag gacacatgct tgacagttat tctgtcatgc tatgtggtga 16141 taattctgct aagttttggg aagaggcatt ctatagagat ctctatagtt cgcctaccac 16201 tttgcaggct gtcggttcat gcgttgtatg ccattcacag acttccctac gctgtgggac 16261 atgcatccgt agaccatttc tctgctgtaa atgctgctat gatcatgtta tagcaactcc 16321 acataagatg gttttgtctg tttctcctta cgtttgtaat gcccctggtt gtggcgtttc 16381 agacgttact aagctatatt taggtggtat gagctacttt tgtgtagatc atagacctgt 16441 gtgtagtttt ccactttgcg ctaatggtct tgtattcggc ttatacaaga atatgtgcac 16501 aggtagtcct tctatagttg aatttaatag gttggctacc tgtgactgga ctgaaagtgg 16561 tgattacacc cttgccaata ctacaacaga accactcaaa ctttttgctg ctgagacttt 16621 acgtgccact gaagaggcgt ctaagcagtc ttatgctatt gccaccatca aagaaattgt 16681 tggtgagcgc caactattac ttgtgtggga ggctggcaag tccaaaccac cactcaatcg 16741 taattatgtt tttactggtt atcatataac caaaaatagt aaagtgcagc tcggtgagta 16801 cattttcgag cgcattgatt atagtgatgc tgtatcctac aagtctagta caacgtataa 16861 actgactgta ggtgacatct tcgtacttac ctctcactct gtggctacct tgacggcgcc 16921 cacaattgtg aatcaagaga ggtatgttaa aattactggg ttgtacccaa ccattacggt 16981 acctgaagag ttcgcaagtc atgttgccaa cttccaaaaa tcaggttata gtaaatatgt 17041 cactgttcag ggaccacctg gcactggcaa aagtcatttt gctatagggt tagcgattta 17101 ctaccctaca gcacgtgttg tttatacagc atgttcacac gcagctgttg atgctttgtg 17161 tgaaaaagct tttaaatatt tgaacattgc taaatgttcc cgtatcattc ctgcaaaggc 17221 acgtgttgag tgctatgaca ggtttaaagt taatgagaca aattctcaat atttgtttag 17281 tactattaat gctctaccag aaacttctgc cgatattctg gtggttgatg aggttagtat 17341 gtgcactaat tatgatcttt caattattaa tgcacgtatt aaagctaagc acattgtcta 17401 tgtaggagat ccagcacagt tgccagctcc taggactttg ttgactagag gcacattgga 17461 accagaaaat ttcaatagtg tcactagatt gatgtgtaac ttaggtcctg acatattttt 17521 aagtatgtgc tacaggtgtc ctaaggaaat agtaagcact gtgagcgctc ttgtctacaa 17581 taataaattg ttagccaaga aggagctttc aggccagtgc tttaaaatac tctataaggg 17641 caatgtgacg catgatgcta gctctgccat taatagacca caactcacat ttgtgaagaa 17701 ttttattact gccaatccgg catggagtaa ggcagtcttt atttcgcctt acaattcaca 17761 gaatgctgtg tctcgttcaa tgctgggtct taccactcag actgttgatt cctcacaggg 17821 ttcagaatac cagtacgtta tcttctgtca aacagcagat acggcacatg ctaacaacat 17881 taacagattt aatgttgcaa tcactcgtgc ccaaaaaggt attctttgtg ttatgacatc 17941 tcaggcactc tttgagtcct tagagtttac tgaattgtct tttactaatt acaagctcca 18001 gtctcagatt gtaactggcc tttttaaaga ttgctctaga gaaacttctg gcctctcacc 18061 tgcttatgca ccaacatatg ttagtgttga tgacaagtat aagacgagtg atgagctttg 18121 cgtgaatctt aatttacccg caaatgtccc atactctcgt gttatttcca ggatgggctt 18181 taaactcgat gcaacagttc ctggatatcc taagcttttc attactcgtg aagaggctgt 18241 aaggcaagtt cgaagctgga taggcttcga tgttgagggt gctcatgctt cccgtaatgc 18301 atgtggcacc aatgtgcctc tacaattagg attttcaact ggtgtgaact ttgttgttca 18361 gccagttggt gttgtagaca ctgagtgggg taacatgtta acgggcattg ctgcacgtcc 18421 tccaccaggt gaacagttta agcacctcgt gcctcttatg cataaggggg ctgcgtggcc 18481 tattgttaga cgacgtatag tgcaaatgtt gtcagacact ttagacaaat tgtctgatta 18541 ctgtacgttt gtttgttggg ctcatggctt tgaattaacg tctgcatcat acttttgcaa 18601 gataggtaag gaacagaagt gttgcatgtg caatagacgc gctgcagcgt actcttcacc 18661 tctgcaatct tatgcctgct ggactcattc ctgcggttat gattatgtct acaacccttt 18721 ctttgtcgat gttcaacagt ggggttatgt aggcaatctt gctactaatc acgatcgtta 18781 ttgctctgtc catcaaggag ctcatgtggc ttctaatgat gcaataatga ctcgttgttt 18841 agctattcat tcttgtttta tagaacgtgt ggattgggat atagagtatc cttatatctc 18901 acatgaaaag aaattgaatt cctgttgtag aatcgttgag cgcaacgtcg tacgtgctgc 18961 tcttcttgcc ggttcatttg acaaagtcta tgatattggc aatcctaaag gaattcctat 19021 tgttgatgac cctgtggttg attggcatta ttttgatgca cagcccttga ccaggaaggt 19081 acaacagctt ttctatacag aggacatggc ctcaagattt gctgatgggc tctgcttatt 19141 ttggaactgt aatgtaccaa aatatcctaa taatgcaatt gtatgcaggt ttgacacacg 19201 tgtgcattct gagttcaatt tgccaggttg tgatggcggt agtttgtatg ttaacaagca 19261 cgcttttcat acaccagcat atgatgtgag tgcattccgt gatctgaaac ctttaccatt 19321 cttttattat tctactacac catgtgaagt gcatggtaat ggtagtatga tagaggatat 19381 tgattatgta cccctaaaat ctgcagtctg tattacagct tgtaatttag ggggcgctgt 19441 ttgtaggaag catgctacag agtacagaga gtatatggaa gcatataatc ttgtctctgc 19501 atcaggtttc cgcctttggt gttataagac ctttgatatt tataatctct ggtctacttt 19561 tacaaaagtt caaggtttgg aaaacattgc ttttaatgtt gttaaacaag gccattttat 19621 tggtgttgag ggtgaactac ctgtagctgt agtcaatgat aagatcttca ccaagagtgg 19681 cgttaatgac atttgtatgt ttgagaataa aaccactttg cctactaata tagcttttga 19741 actctatgct aagcgtgctg tacgctcgca tcccgatttc aaattgctac acaatttaca 19801 agcagacatt tgctacaagt tcgtcctttg ggattatgaa cgtagcaata tttatggtac 19861 tgctactatt ggtgtatgta agtacactga tattgatgtt aattcagctt tgaatatatg 19921 ttttgacata cgcgataatt gttcattgga gaagttcatg tctactccca atgccatctt 19981 tatttctgat agaaaaatca agaaataccc ttgtatggta ggtcctgatt atgcttactt 20041 caatggtgct atcatccgtg atagtgatgt tgttaaacaa ccagtgaagt tctacttgta 20101 taagaaagtc aataatgagt ttattgatcc tactgagtgt atttacactc agagtcgctc 20161 ttgtagtgac ttcctacccc tttctgacat ggagaaagac tttctatctt ttgatagtga 20221 tgttttcatt aagaagtatg gcttggaaaa ctatgctttt gagcacgtag tctatggaga 20281 cttctctcat actacgttag gcggtcttca cttgcttatt ggtttataca agaagcaaca 20341 ggaaggtcat attattatgg aagaaatgct aaaaggtagc tcaactattc ataactattt 20401 tattactgag actaacacag cggcttttaa ggcggtgtgt tctgttatag atttaaagct 20461 tgacgacttt gttatgattt taaagagtca agaccttggc gtagtatcca aggttgtcaa 20521 ggttcctatt gacttaacaa tgattgagtt tatgttatgg tgtaaggatg gacaggttca 20581 aaccttctac cctcgactcc aggcttctgc agattggaaa cctggtcatg caatgccatc 20641 cctctttaaa gttcaaaatg taaaccttga acgttgtgag cttgctaatt acaagcaatc 20701 tattcctatg cctcgcggtg tgcacatgaa catcgctaaa tatatgcaat tgtgccagta 20761 tttaaatact tgcacattag ccgtgcctgc caatatgcgt gttatacatt ttggcgctgg 20821 ttctgataaa ggtatcgctc ctggtacctc agttttacga cagtggcttc ctacagatgc 20881 cattattata gataatgatt taaatgagtt cgtgtcagat gctgacataa ctttatttgg 20941 agattgtgta actgtacgtg tcggccaaca agtggatctt gttatttccg acatgtatga 21001 tcctactact aagaatgtaa caggtagtaa tgagtcaaag gctttattct ttacttacct 21061 gtgtaacctc attaataata atcttgctct tggtgggtct gttgctatta aaataacaga 21121 acactcttgg agcgttgaac tttatgaact tatgggaaaa tttgcttggt ggactgtttt 21181 ctgcaccaat gcaaatgcat cctcatctga aggattcctc ttaggtatta attacttggg 21241 tactattaaa gaaaatatag atggtggtgc tatgcacgcc aactatatat tttggagaaa 21301 ttccactcct atgaatctga gtacttactc actttttgat ttatccaagt ttcaattaaa 21361 attaaaagga acaccagttc ttcaattaaa ggagagtcaa attaacgaac tcgtaatatc 21421 tctcctgtcg cagggtaagt tacttatccg tgacaatgat acactcagtg tttctactga 21481 tgttcttgtt aacacctaca gaaagttacg ttgatgtagg gccagattct gttaagtctg 21541 cttgtattga ggttgatata caacagactt tctttgataa aacttggcct aggccaattg 21601 atgtttctaa ggctgacggt attatatacc ctcaaggccg tacatattct aacataacta 21661 tcacttatca aggtcttttt ccctatcagg gagaccatgg tgatatgtat gtttactctg 21721 caggacatgc tacaggcaca actccacaaa agttgtttgt agctaactat tctcaggacg 21781 tcaaacagtt tgctaatggg tttgtcgtcc gtataggagc agctgccaat tccactggca 21841 ctgttattat tagcccatct accagcgcta ctatacgaaa aatttaccct gcttttatgc 21901 tgggttcttc agttggtaat ttctcagatg gtaaaatggg ccgcttcttc aatcatactc 21961 tagttctttt gcccgatgga tgtggcactt tacttagagc tttttattgt attctagagc 22021 ctcgctctgg aaatcattgt cctgctggca attcctatac ttcttttgcc acttatcaca 22081 ctcctgcaac agattgttct gatggcaatt acaatcgtaa tgccagtctg aactctttta 22141 aggagtattt taatttacgt aactgcacct ttatgtacac ttataacatt accgaagatg 22201 agattttaga gtggtttggc attacacaaa ctgctcaagg tgttcacctc ttctcatctc 22261 ggtatgttga tttgtacggc ggcaatatgt ttcaatttgc caccttgcct gtttatgata 22321 ctattaagta ttattctatc attcctcaca gtattcgttc tatccaaagt gatagaaaag 22381 cttgggctgc cttctacgta tataaacttc aaccgttaac tttcctgttg gatttttctg 22441 ttgatggtta tatacgcaga gctatagact gtggttttaa tgatttgtca caactccact 22501 gctcatatga atccttcgat gttgaatctg gagtttattc agtttcgtct ttcgaagcaa 22561 aaccttctgg ctcagttgtg gaacaggctg aaggtgttga atgtgatttt tcacctcttc 22621 tgtctggcac acctcctcag gtttataatt tcaagcgttt ggtttttacc aattgcaatt 22681 ataatcttac caaattgctt tcactttttt ctgtgaatga ttttacttgt agtcaaatat 22741 ctccagcagc aattgctagc aactgttatt cttcactgat tttggattac ttttcatacc 22801 cacttagtat gaaatccgat ctcagtgtta gttctgctgg tccaatatcc cagtttaatt 22861 ataaacagtc cttttctaat cccacatgtt tgattttagc gactgttcct cataacctta 22921 ctactattac taagcctctt aagtacagct atattaacaa gtgctctcgt cttctttctg 22981 atgatcgtac tgaagtacct cagttagtga acgctaatca atactcaccc tgtgtatcca 23041 ttgtcccatc cactgtgtgg gaagacggtg attattatag gaaacaacta tctccacttg 23101 aaggtggtgg ctggcttgtt gctagtggct caactgttgc catgactgag caattacaga 23161 tgggctttgg tattacagtt caatatggta cagacaccaa tagtgtttgc cccaagcttg 23221 aatttgctaa tgacacaaaa attgcctctc aattaggcaa ttgcgtggaa tattccctct 23281 atggtgtttc gggccgtggt gtttttcaga attgcacagc tgtaggtgtt cgacagcagc 23341 gctttgttta tgatgcgtac cagaatttag ttggctatta ttctgatgat ggcaactact 23401 actgtttgcg tgcttgtgtt agtgttcctg tttctgtcat ctatgataaa gaaactaaaa 23461 cccacgctac tctatttggt agtgttgcat gtgaacacat ttcttctacc atgtctcaat 23521 actcccgttc tacgcgatca atgcttaaac ggcgagattc tacatatggc ccccttcaga 23581 cacctgttgg ttgtgtccta ggacttgtta attcctcttt gttcgtagag gactgcaagt 23641 tgcctcttgg tcaatctctc tgtgctcttc ctgacacacc tagtactctc acacctcgca 23701 gtgtgcgctc tgttccaggt gaaatgcgct tggcatccat tgcttttaat catcctattc 23761 aggttgatca acttaatagt agttatttta aattaagtat acccactaat ttttcctttg 23821 gtgtgactca ggagtacatt cagacaacca ttcagaaagt tactgttgat tgtaaacagt 23881 acgtttgcaa tggtttccag aagtgtgagc aattactgcg cgagtatggc cagttttgtt 23941 ccaaaataaa ccaggctctc catggtgcca atttacgcca ggatgattct gtacgtaatt 24001 tgtttgcgag cgtgaaaagc tctcaatcat ctcctatcat accaggtttt ggaggtgact 24061 ttaatttgac acttctagaa cctgtttcta tatctactgg cagtcgtagt gcacgtagtg 24121 ctattgagga tttgctattt gacaaagtca ctatagctga tcctggttat atgcaaggtt 24181 acgatgattg catgcagcaa ggtccagcat cagctcgtga tcttatttgt gctcaatatg 24241 tggctggtta caaagtatta cctcctctta tggatgttaa tatggaagcc gcgtatactt 24301 catctttgct tggcagcata gcaggtgttg gctggactgc tggcttatcc tcctttgctg 24361 ctattccatt tgcacagagt atcttttata ggttaaacgg tgttggcatt actcaacagg 24421 ttctttcaga gaaccaaaag cttattgcca ataagtttaa tcaggctctg ggagctatgc 24481 aaacaggctt cactacaact aatgaagctt ttcagaaggt tcaggatgct gtgaacaaca 24541 atgcacaggc tctatccaaa ttagctagcg agctatctaa tacttttggt gctatttccg 24601 cctctattgg agacatcata caacgtcttg atgttctcga acaggacgcc caaatagaca 24661 gacttattaa tggccgtttg acaacactaa atgcttttgt tgcacagcag cttgttcgtt 24721 ccgaatcagc tgctctttcc gctcaattgg ctaaagataa agtcaatgag tgtgtcaagg 24781 cacaatccaa gcgttctgga ttttgcggtc aaggcacaca tatagtgtcc tttgttgtaa 24841 atgcccctaa tggcctttac ttcatgcatg ttggttatta ccctagcaac cacattgagg 24901 ttgtttctgc ttatggtctt tgcgatgcag ctaaccctac taattgtata gcccctgtta 24961 atggctactt tattaaaact aataacacta ggattgttga tgagtggtca tatactggct 25021 cgtccttcta tgcacctgag cccattacct cccttaatac taagtatgtt gcaccacagg 25081 tgacatacca aaacatttct actaacctcc ctcctcctct tctcggcaat tccaccggga 25141 ttgacttcca agatgagttg gatgagtttt tcaaaaatgt tagcaccagt atacctaatt 25201 ttggttccct aacacagatt aatactacat tactcgatct tacctacgag atgttgtctc 25261 ttcaacaagt tgttaaagcc cttaatgagt cttacataga ccttaaagag cttggcaatt 25321 atacttatta caacaaatgg ccgtggtaca tttggcttgg tttcattgct gggcttgttg 25381 ccttagctct atgcgtcttc ttcatactgt gctgcactgg ttgtggcaca aactgtatgg 25441 gaaaacttaa gtgtaatcgt tgttgtgata gatacgagga atacgacctc gagccgcata 25501 aggttcatgt tcactaatta acgaactatt aatgagagtt caaagaccac ccactctctt 25561 gttagtgttt tcactctctc ttttggtcac tgcatcctca aaacctctct atgtacctga 25621 gcattgtcag aattattctg gttgcatgct tagggcttgt attaaaactg cccaagctga 25681 tacagctggt ctttatacaa attttcgaat tgacgtccca tctgcagaat caactggtac 25741 tcaatcagtt tctgtcgatc ttgagtcaac ttcaactcat gatggtccta ccgaacatgt 25801 tactagtgtg aatctttttg acgttggtta ctcagttaat taacgaactc tatggattac 25861 gtgtctctgc ttaatcaaat ttggcagaag taccttaact caccgtatac tacttgtttg 25921 tacatcccta aacccacagc taagtataca cctttagttg gcacttcatt gcaccctgtg 25981 ctgtggaact gtcagctatc ctttgctggt tatactgaat ctgctgttaa ttctacaaaa 26041 gctttggcca aacaggacgc agctcagcga atcgcttggt tgctacataa ggatggagga 26101 atccctgatg gatgttccct ctacctccgg cactcaagtt tattcgcgca aagcgaggaa 26161 gaggagccat tctccaacta agaaactgcg ctacgttaag cgtagatttt ctcttctgcg 26221 ccatgaagac cttagtgtta ttgtccaacc aacacactat gtcagggtta cattttcaga 26281 ccccaacatg tggtatctac gttcgggtca tcatttacac tcagttcaca attggcttaa 26341 accttatggc ggccaacctg tttctgagta ccatattact ctagctttgc taaatctcac 26401 tgatgaagat ttagctagag atttttcacc cattgcgctc tttttgcgca atgtcagatt 26461 tgagctacat gagttcgcct tgctgcgcaa aactcttgtt cttaatgcat cagagatcta 26521 ctgtgctaac atacatagat ttaagcctgt gtatagagtt aacacggcaa tccctactat 26581 taaggattgg cttctcgttc agggattttc cctttaccat agtggcctcc ctttacatat 26641 gtcaatctct aaattgcatg cactggatga tgttactcgc aattacatca ttacaatgcc 26701 atgctttaga acttaccctc aacaaatgtt tgttactcct ttggccgtag atgttgtctc 26761 catacggtct tccaatcagg gtaataaaca aattgttcat tcttatccca ttttacatca 26821 tccaggattt taacgaacta tggctttctc ggcgtcttta tttaaacccg tccagctagt 26881 cccagtttct cctgcatttc atcgcattga gtctactgac tctattgttt tcacatacat 26941 tcctgctagc ggctatgtag ctgctttagc tgtcaatgtg tgtctcattc ccctattatt 27001 actgctacgt caagatactt gtcgtcgcag cattatcaga actatggttc tctatttcct 27061 tgttctgtat aactttttat tagccattgt actagtcaat ggtgtacatt atccaactgg 27121 aagttgcctg atagccttct tagttatcct cataatactt tggtttgtag atagaattcg 27181 tttctgtctc atgctgaatt cctacattcc actgtttgac atgcgttccc actttattcg 27241 tgttagtaca gtttcttctc atggtatggt ccctgtaata cacaccaaac cattatttat 27301 tagaaacttc gatcagcgtt gcagctgttc tcgttgtttt tatttgcact cttccactta 27361 tatagagtgc acttatatta gccgttttag taagattagc ctagtttctg taactgactt 27421 ctccttaaac ggcaatgttt ccactgtttt cgtgcctgca acgcgcgatt cagttcctct 27481 tcacataatc gccccgagct cgcttatcgt ttaagcagct ctgcgctact atgggtcccg 27541 tgtagaggct aatccattag tctctctttg gacatatgga aaacgaacta tgttaccctt 27601 tgtccaagaa cgaatagggt tgttcatagt aaactttttc atttttaccg tagtatgtgc 27661 tataacactc ttggtgtgta tggctttcct tacggctact agattatgtg tgcaatgtat 27721 gacaggcttc aataccctgt tagttcagcc cgcattatac ttgtataata ctggacgttc 27781 agtctatgta aaattccagg atagtaaacc ccctctacca cctgacgagt gggtttaacg 27841 aactccttca taatgtctaa tatgacgcaa ctcactgagg cgcagattat tgccattatt 27901 aaagactgga actttgcatg gtccctgatc tttctcttaa ttactatcgt actacagtat 27961 ggatacccat cccgtagtat gactgtctat gtctttaaaa tgtttgtttt atggctccta 28021 tggccatctt ccatggcgct atcaatattt agcgccgttt atccaattga tctagcttcc 28081 cagataatct ctggcattgt agcagctgtt tcagctatga tgtggatttc ctactttgtg 28141 cagagtatcc ggctgtttat gagaactgga tcatggtggt cattcaatcc tgagactaat 28201 tgccttttga acgttccatt tggtggtaca actgtcgtac gtccactcgt agaggactct 28261 accagtgtaa ctgctgttgt aaccaatggc cacctcaaaa tggctggcat gcatttcggt 28321 gcttgtgact acgacagact tcctaatgaa gtcaccgtgg ccaaacccaa tgtgctgatt 28381 gctttaaaaa tggtgaagcg gcaaagctac ggaactaatt ccggcgttgc catttaccat 28441 agatataagg caggtaatta caggagtccg cctattacgg cggatattga acttgcattg 28501 cttcgagctt aggctcttta gtaagagtat cttaattgat tttaacgaat ctcaatttca 28561 ttgttatggc atcccctgct gcacctcgtg ctgtttcctt tgccgataac aatgatataa 28621 caaatacaaa cctatctcga ggtagaggac gtaatccaaa accacgagct gcaccaaata 28681 acactgtctc ttggtacact gggcttaccc aacacgggaa agtccctctt acctttccac 28741 ctgggcaggg tgtacctctt aatgccaatt ctacccctgc gcaaaatgct gggtattggc 28801 ggagacagga cagaaaaatt aataccggga atggaattaa gcaactggct cccaggtggt 28861 acttctacta cactggaact ggacccgaag cagcactccc attccgggct gttaaggatg 28921 gcatcgtttg ggtccatgaa gatggcgcca ctgatgctcc ttcaactttt gggacgcgga 28981 accctaacaa tgattcagct attgttacac aattcgcgcc cggtactaag cttcctaaaa 29041 acttccacat tgaggggact ggaggcaata gtcaatcatc ttcaagagcc tctagcttaa 29101 gcagaaactc ttccagatct agttcacaag gttcaagatc aggaaactct acccgcggca 29161 cttctccagg tccatctgga atcggagcag taggaggtga tctactttac cttgatcttc 29221 tgaacagact acaagccctt gagtctggca aagtaaagca atcgcagcca aaagtaatca 29281 ctaagaaaga tgctgctgct gctaaaaata agatgcgcca caagcgcact tccaccaaaa 29341 gtttcaacat ggtgcaagct tttggtcttc gcggaccagg agacctccag ggaaactttg 29401 gtgatcttca attgaataaa ctcggcactg aggacccacg ttggccccaa attgctgagc 29461 ttgctcctac agccagtgct tttatgggta tgtcgcaatt taaacttacc catcagaaca 29521 atgatgatca tggcaaccct gtgtacttcc ttcggtacag tggagccatt aaacttgacc 29581 caaagaatcc caactacaat aagtggttgg agcttcttga gcaaaatatt gatgcctaca 29641 aaaccttccc taagaaggaa aagaaacaaa aggcaccaaa agaagaatca acagaccaaa 29701 tgtctgaacc tccaaaggag cagcgtgtgc aaggtagcat cactcagcgc actcgcaccc 29761 gtccaagtgt tcagcctggt ccaatgattg atgttaacac tgattagtgt cactcaaagt 29821 aacaagatcg cggcaatcgt ttgtgtttgg caaccccatc tcaccatcgc ttgtccactc 29881 ttgcacagaa tggaatcatg ttgtaattac agtgcaataa ggtaattata acccatttaa 29941 ttgatagcta tgctttatta aagtgtgtag ctgtagagag aatgttaaag actgtcacct 30001 ctgcttgatt gcaagtgaac agtgcccccc gggaagagct ctacagtgtg aaatgtaaat 30061 aaaaaatagc tattattcaa ttagattagg ctaattagat gatttgcaaa aaaaaaaaa //