Gene list
Applied filters:
COG category: Unclassified
Organism: Mycobacterium ulcerans, AGY99
Gene type: CDS
Number of genes found: 58
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Mycobacterium ulcerans, AGY99 >MUP002c MUP002c, hypothetical protein MTNAPEFKRQCATAGSAEQEALRRKRRQGSENRNMAGRLQVRFDTTKLRE LEAAAGTKLPALIRQCGDLLTELVPVAEQQGIDPVELIRRTAASLLDRTE LLSA >MUP003 MUP003, hypothetical protein MSRKWRRTLLTYPLLRKNGTVSSPSEPSAYTLADAGKQLGVTAQRVSQML SSGELTGPQYPQQRVPKNAVRVWKWSLDQELARRRSTPPPRHSRRQQPDP GADQGYLAQWWSRREARVNAAAHELKVAADLARQHASDERRKARQLMSKL ARLTIQQAELIERLKDDLAVESERTEQIIDAYSDALTQLLAPDFGPPP >MUP004c MUP004c, hypothetical protein MSKLTPQELPEADLADLTKPKSRGAGLADLVNAAQPRPASAPEQAAVEVP VEDRPVAATPSKTSDVSAAKVPARKPGRPRTRSKPTTAVYVSPGVKSRFE KYRHNVKSTNLHVVLEAISIKYQAGELAEIIEKSRYSTAPTHDLFPADPS AVRYLGGGSAQIAFSLTAEQEEVIDTIGSKLGFDTRSTWIAPVLNAFLPG RKDA >MUP006c MUP006c, hypothetical protein MSTQLKQLRAFLAAAGYPDPAAVTQERTIWAAVQHLTPTSRICSASSMRQ AATVIDPEILKAVQTVYSTDLGLPEEWTPAQRMEFLTAEADKISSMAASM AETLWEQAITAWTRSRNGQTPNHATKVALLGDARAQAVTRVLNSELYELI ETEETDETDMSPPQSGQPIPHRNQVDWQQRWTNPIYQSDPTPDLKALIDR LWPASDFSAPRRWWPPCARSLTSLRPPVVRRWEWEELAGDDRPHGKAK >MUP007c MUP007c, conserved hypothetical protein MNAIQHVGHDDRCRGQMMGRRRHRFEPQHVNTTIRQTNQSIGILESQETP PGVGMAAQARRPTVGQPARACGA >MUP009 MUP009, hypothetical protein MRLGFVDGCGLAGLASHSTRSSRSVAIRSTILYISSGVHGSSRMAQLRTA QRPLTTAQQAPPLGVAGDAVKGGWAEARRIATAVVGPWETDPTLVHYAQF DSGVVKGPGAVDFRLGSPIGDALKGHNVIAGFSSAQDRHQPL >MUP010 MUP010, hypothetical protein MVLELPNADEAVETVRAMADNSAALTFPFSAEPFPTEPFSIPRHPDTSAV TFHHDVPAPEPVGGERFSVVAFTAHGPFVLAQWADAAESADKSAQLIATT LELQEPRIDTFTPTPPDRISDLPLDPSGMLSRVLPPEEENKTVDDGVYDH LGIVHFGGDPVRRQAMFKSAGLQQAAYTVTADVYEAGDAESAQRIVAELA AEAIDDGLMAATGVQGMPKARCLEGDLAYWCVAAADRYAYTVQGHEGAVH QMVAAQYRILTGK >MUP012c MUP012c, hypothetical protein MSHSELLDPALPKSPFGKQGYRYDDVDVFLRLIADELQGSDLSESDIHSI TFRCAAPLTRGYDTESVDRFLDRIAETIARGHGDEGQ >MUP013c MUP013c, possible conserved membrane protein MRTSRVLVATLILLVGTTLVVTIVALFQPGSATHEAMPLWWFPTFTSAGL LVSVVPGVAMALALLTSKRDWAHANEYRVVLPTVAVGSLFGSLPRCLTGT SGCWAPRCCLRLGLFGSYADCHATIAAQFGLRCVVARTHSGAAATRTLSS TR >MUP014c MUP014c, putative integral membrane protein MLISGLWARLVVAPWWVRVLANGLALAGIAVVFTIGFVAHFVSQTNWAWA TAAAGATGVGYGALITSVRGPIHLRFAATVAGLTVQQRGQALTALRRGDV PSDPRVVAAASRLGALMLAYGRRTHPLQWAFLGAFAAVAVVLRITHSQPG FFCLLLVYLAATPTLIWQRRLLKRLPDRLARLHAAASPAIWAAQEQPDPS TMAQQRLLSSALTGFVAGAFAVAALVGTQYVSQPQAAAGDAQGAALLLAP VTATVGR >MUP015c MUP015c, possible secreted protein MTDNRSGRAQLVTAMLAVLVMIAITLTAYAVGARWPAHPRQHRLAPGLAA LSGERLAELLPVQTDLPPGWTRYDDPYRRTPAAGFGYHRYHSNGVDWGYQ PAECIDVRYGNGHVTPSPAAEFAQYAPGHPTEVHQVADLQIQISREFNPA LFTDMLAQVSRCRRFIARQPFGTARFTVRVLEDSHPLHGPHRFRYAVTVT YSDDDLLPSETRYVYYARTSRLVVAASDSSGNPQLLDTVFDKALHQICTR ATESRQRC >MUP016c MUP016c, hypothetical protein MMMAASPDPRSTAVRRRGSQRGLAAVALATVGVCGLLLLMVPPPSVNSLL SRSNWELATALPAIDAFPADWNYDLWSDIVQPTPAATTGSGPLTTRGSVS QSVPAGCGDVPTLVGLYDGVSRSAAMVHVDLRTDEIAKAILASSDEPEPN ARFVIWRKPNGAQLIADYLGWIGRCGSYRVTSPPPDNHTTTVTTTVSVES VTGTDAAVIATRSSISSEDSQTYHVMFYALRGVVLECATNLTGDQVDMVR RLADQTLHRLHTL >MUP017c MUP017c, possible conserved transmembrane protein MSHPYTNAQPHHYPYPNPQHPHPYPYPQRPAYPPPPQVGYYPPHAAAPWT PQPTAPSVLAASDHQPLFSVRVTKHTGLVVAFYQQSYTVSGTFAQCETAL REAQQHNLVVGWWGVASLVLWNWIAITNNRSARKALHRAAADRGYNTGGT >MUP019 MUP019, probable conserved membrane protein MLGITFGLAPRRMITMSVDTEPTMPSHTWDARPSTGPTPVDPWGPPITTM GQPRPTNPQTPEARYSWAPPPMAFQPGFGRAPGWPYPPATPNTAPHRASA RIAAAVVGAALAVIALIVLITTVGGSTKPGAALTHTTPTVSAQPPSTTTT PPPPPPIAPAALAGLLLDTDTINALINSGELAVDPKHTTTKLFTDTADHP ACGGVLVNASKQVYDGSGWVAAQTQALRDNTTRQHVVYESVISYPSARTA TKMVAQEAHNWQRCNGRSITTTATSYPAQTWFVATVDNHDGMLTALMNQE GARGWACQHALTARNNIVIDIEVCGADITHQATAIAKKVAQNVH >MUP020 MUP020, conserved hypothetical protein MSTDSQSLGNDQHLERKPTEPRAGKGCRPALQPQTDVIVMAVLSLPVVPI TPLFQGWCSL >MUP021 MUP021, possible transcriptional regulatory protein MTAANPVKLGVGLCVYDDEDKWFSGYSSNRKAAAICANCPIIVSCAERAL RLQVTDGVWVTVVMPGSRHTDALEQARARLRRVIEHY >MUP023c MUP023c, hypothetical protein MSPRPKPKAPRTPPEQLIREPDDNIDYSGTLWRVHRTEGEHILPWNTLRT FGPLPSMRWDPHPGPQPSSHADGVLYAAADVATSLAEVYQTTRVIDTRAN APTLTAWQPQRRLRLLDLSGTWLLRNTASAALLAAPRSICRRWARAIYTT WPELDGLYVPSTMTGRPNIVLWNAAADSIPTMPSFTRPLTHPLVWSIGQA AAAEIGYRIQ >MUP024c MUP024c, hypothetical protein MGGSTAPALGALLARHQIDLTVEEVLDELDSGFAAIPGAATLSTTEVDFL RANAGPGTAAVIDAWSASNERPARARIALQQLTGALSGSVSIKEAATMLG VDRSRVSRRITAKALWAFDLQGNRRIPRWQFLSNELLPGLDVIVPAIARG TTPAVLDVFMHTPQPDFDDRTPIEHLAAGGDPALVAGFIADLARW >MUP025 MUP025, putative transposase MAKPMAMHVARTPSAHVDKAGNARRYEAVLVRRSYRDGKKVRHQTLANLS KLPAHVIDVVEASLKGQALVAPESVCEITRSLPHGHVAAVGAMARTLGLP ALLGPRCRSRDLVLGLIISRVLRPASKLATLAWWADTTLGEDLNATNAST GEIYEAMDWLLARQDAIEKQLAAKHLAASVNPSRMALFDLSSSWMTGQCC DLAARGYSRDGKKGLPQIEYGLLTDPAGRPVAIRVFAGNTADPAAFTDIV EVVRERFGLDRLVLVGDRGMITSARIAALRELNNDPDTATGFGWITALRA PAIAKLAGDDGPLQLSLFDTQDLATITHPDYPSERLIACRNPLLATQRAR KRAELLTVTEAALAPIIAAVACGRLAGAGRIGVKVGKVLAKFKMAKHFHL DITDTTLTVTRDQTKIDAEAALDGIYVLRTSVTANELDPAAVVVSYKNLA NVERDFRSIKTDDLDLRPIHHRLDDRVKAHILIAMLACYLVWHLRKAWAP MTYTDENPPARENPVTPAQRSAAAKTKAARHQDANGATLRSFSGLLEHLA TLTRNDVNFTHTTNPIPMLATPPRPTTRL >MUP028c MUP028c, putative transposase MAKPMAMHVARTPSAHVDKAGNARRYEAVLVRRSYRDGKKVRHQTLANLS KLPAHVIDVVEASLKGQALVAPESVCEITRSLPHGHVAAVGAMARTLGLP ALLGPRCRSRDLVLGLIISRVLRPASKLATLAWWADTTLGEDLNATNAST GEIYEAMDWLLARQDAIEKQLAAKHLAASVNPSRMALFDLSSSWMTGQCC DLAARGYSRDGKKGLPQIEYGLLTDPAGRPVAIRVFAGNTADPAAFTDIV EVVRERFGLDRLVLVGDRGMITSARIAALRELNNDPDTATGFGWITALRA PAIAKLAGDDGPLQLSLFDTQDLATITHPDYPSERLIACRNPLLATQRAR KRAELLTVTEAALAPIIAAVACGRLAGAGRIGVKVGKVLAKFKMAKHFHL DITDTTLTVTRDQTKIDAEAALDGIYVLRTSVTANELDPAAVVVSYKNLA NVERDFRSIKTDDLDLRPIHHRLDDRVKAHILIAMLACYLVWHLRKAWAP MTYTDENPPARENPVTPAQRSAAAKTKAARHQDANGATLRSFSGLLEHLA TLTRNDVNFTHTTNPIPMLATPPRPTTRL >MUP029c MUP029c, probable transposase for the insertion element IS2404 (fragment) MTATDQHSVEVVYAICSLPFEHARPTAIMTWMRQHCGIENSLHWIRDVTF DEDRHRAHTGNGAQVLATLRNTAINLHRLNGADNIAEACRITALTANRRL DLLNPQFPSSQAC >MUP030c MUP030c, probable transposase for the insertion element IS2404 (fragment) MALLAIAVLATAAGMRGYAGFATWAATASDDVLAQVGVRFRRPSEKTFRA VLSRLDPADLNARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRAKATA THLVSVFAHRARLVLGQLAVAEKSNEIPCVCALLTLLPDNLRWLVTVDAM HTQVVTAKLICATLKSHYLMIVKSNQAKILARITALPWAEVPAAATDDSR GHGRVKTRTLQIITTARGIGFPYAKQIIRITRER >MUP033c MUP033c, putative transposase MVLMVSVVPLADPGVCWGRSWRIIVTLVCIGDLEGLGRGQRVGKLHRRRP RPGDKWHLAEVLVKVNGITRCLWRAIDQDGNVSDVLVRSRRNAKRHLLSA IDSRTEMADRFAVGYEVTVLDAAA >MUP034c MUP034c, putative transposase MAMDFQFDFTTDSKAVKIASMVDEHIRESLLNIVERSITAERLIAGLEWV FAAAGGPPKVLRMDNGPELISQALRQFCDRKVGLCYIQPRTPWNNATSNR STTGYEGTASTAAAGPTHSKTAWSWTISTTNTIIGIVTPHWVTCPPRSRL PDAATLIAPWPIALWPVRSNDICVKGTRL >MUP035 MUP035, putative transposase MGRSSRAIIGGVDTHAATHHGAVIDSRGRLLADAEYPASGRGYAAMLTWM RSKGNLTKVGVEGTGAYGAGLARYLHEQGVEVLEVPRPDRRIRRQRGKSD PIDAEAAARTVLAGRASGASKLVDGPIEAIRMLRVARSDAVKAKTAAVNA LRAMLITTHQDANGATLRSFSGLLEHLATLTRNDVNFTHTTNPIPMLATP PPTNNAPLTSSEPPSPSPAPRSHQRQPTKPQNPQVNS >MUP037 MUP037, putative transposase MNATNASTGEIYEAMDWLLARQDAIEKQLAAKHLAASVNPSRMALFDLSS SWMTGQCCDLAARGYSRDGKKGLPQIEYGLLTDPAGRPVAIRVFAGNTAD PAAFTDIVEVVRERFGLDRLVLVGDRGMITSARIAALRELNNDPDTATGF GWITALRAPAIAKLAGDDGPLQLSLFDTQDLATITHPDYPSERLIACRNP LLATQRARKRAELLTVTEAALAPIIAAVACGRLAGAGRIGVKVGKVLAKF KMAKHFHLDITDTTLTVTRDQTKIDAEAALDGIYVLRTSVTANELDPAAV VVSYKNLANVERDFRSIKTDDLDLRPIHHRLDDRVKAHILIAMLACYLVW HLRKAWAPMTYTDENPPARENPVTPAEGTTSPGSTPRKRDTTLFANSKPW ATTSPSTERPDRSPSKRTSRQSSRQVNKFTDASSAKRNVSTPAVGAQSGN L >MUP041c MUP041c, putative transposase MVLMVSVVPLADPGVCWGRSWRIIVTLVCIGDLEGLGRGQRVGKLHRRRP RPGDKWHLAEVLVKVNGITRCLWRAIDQDGNVSDVLVRSRRNAKRHLLSA IDSRTEMADRFAVGYEVTVLDAAA >MUP042c MUP042c, putative transposase MAMDFQFDFTTDSKAVKIASMVDEHIRESLLNIVERSITAERLIAGLEWV FAAAGGPPKVLRMDNGPELISQALRQFCDRKVGLCYIQPRTPWNNATSNR STTGYEGTASTAAAGPTHSKTAWSWTISTTNTIIGIVTPHWVTCPPRSRL PDAATLIAPWPIALWPVRSNDICVKGTRL >MUP044c MUP044c, putative truncated transposase MDVDAGKELKELREQNTRLKRLPAETELVKDALREVAKDAPIDVKC >MUP046 MUP046, possible membrane protein MGWRWWLRGAGEGDGGSDEVKGALLVGGGVASIGVGLVVWVKLTSLRVRV ARCSSRPLKLRSVAPLASWWRAALVLAAAERWAGATGFSRAGGFSSV >MUP047 MUP047, probable transposase for the insertion element IS2404 (fragment) MALLAIAVLATAAGMRGYAGFATWAATASDDVLAQVGVRFRRPSEKTFRA VLSRLDPADLNARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRAKATA THLVSVFAHRARLVLGQLAVAEKSNEIPCVRALLTLLPDNLRWLVTVDAM HTQVVTAKLICATLKSHYLMIVKSNQAKILARITALPWAEVPAAATDDSR GHGRVETRTLQIITTARGIGFPYAKQIIRITRER >MUP048 MUP048, probable transposase for the insertion element IS2404 (fragment) MTATDQHSVEVVYAICSLPFEHARPTAIMTWMRQHCGIENSLHWIRDVTF DEDRHRAHTGNGAQVLATLRNTAINLHRLNGADNIAEACRITALTANRRL DLLNPQFPSSQAC >MUP051 MUP051, putative transposase MAGRKRYSAEDIVRKLRRADELAAAGSSGEQIAAELGVSAATLYNWRRAY GGMDLDAAKELKELREQNGRLKRLLADAELEKDALREVAKVKF >MUP054c MUP054c, possible integrase fragment MMAAQPRQSWSSCDLSGPDTPGPIRPLTPLPRQSLENCRNRLTPVDVALF RGLGSHRDRAIVLAMLLGGLRAGEVRRLLLADVDQGRRQLRVVGKGGRER VVPVDDAFFAELASYLSHQQQCPQSTAVIPCGYRRQRCDATGTDDADLIS RSTRRTMQLMYLPKEAS >MUP056c MUP056c, hypothetical protein MHQPADEHTAAAEVEFATTTLQNLLHQIEVGNANAAAYPRPGRYTYLVSH AWADRTTIHLVYTAPPSDITWGLVRDTRESLIDPGGWNSVDDAPRYYYLL DLDENWPGHALRQAGDDPHAIRWRGD >MUP057c MUP057c, possible lipoprotein MNLPTRRGRPRIHARTAIGSTVAALAMATVGCSSDSDVVATSMAPSSSST GVSARQSTTASVPAQDRPTGYVSRKTWTDGPWPLIIDEAVLDCQGDSLVT ITANESTYALNSAAYSQTELPDYAPAIGAHDPDKPGSYLDAGPLIERGLA LCGTPATTSTPISGSNRPAGLVERKTWTDGRWPFTVDSATLFCTKPAGPQ SERVTVVANHEMYALNGTAQDANLWPAFDPIWRDDPIAPGMKINIGPMIE RGLALCEG >MUP059c MUP059c, probable transposase for the insertion element IS2404 MALLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKIFRA VLSRLDPADLNARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRAKATA THLVSVFAHRARLVLGQLAVAEKSNEIPCVCALLTLLPDNLRWLVTVDAM HTQVVTAKLICATLKSHYLMIVKSNQAKILARITALPWAEVPAAATDDSR GHGRVETRTLQIITAARGIGFPYAKQIIRITRERLITATDQRSVEVVYAI CSLPFEHARPTAIMTWMRQHCGIENSLHWIRDVTFDEDRHRAHTGNGAQV LATLRNTAINLHRLNGADNIAEACRITALTANRRLDLLNPQFPSSQAC >MUP062 MUP062, probable transposase for the insertion element IS2404 (fragment) MALLAIAVLATAAGMRGYAGFATWAATASDDVLAQVGVRFRRPSEKTFRA VLSRLDPADLNARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRAKATA THLVSVFAHRARLVLGQLAVAEKSNEIPCVCALLTLLPDNLRWLVTVDAM HTQVVTAKLICATLKSHYLMIVKSNQAKILARITALPWAEVPAAATDDSR GHGRVKTRTLQIITTARGIGFPYAKQIIRITRER >MUP063 MUP063, probable transposase for the insertion element IS2404 (fragment) MTATDQRSVEVVYAICSLPFEHARPTAIMTWMRQHCGIENSLHWIRDVTF DEDRHRAHTGNGAQVLATLRNTAINLHRLNGADNIAEACRITALTANRRL DLLNPQFPSSQAC >MUP064c MUP064c, possible conserved membrane protein MTTPPPPPGWYPDPSDPEKRIYWDGAAWSRPASAATEVDKANKSKKTAIT IGVCVLVVIGLVMSMQSVSLMTGSGPVWTGVAVVAAGTAIAFFLRAARWV RVVAALILALALANAGLRRC >MUP066c MUP066c, conserved hypothetical protein MPTAVGIALWVDPVLHANESRRFDSFVVRGHYERDCDFFTGAIGTDGYGR FYISRGKELCVRPHRYALARSRGALRSHELALHECDNPLCVKISDASASS QHVVVGTQADNMRRMARMRRGGGRKPIAGDGLRVRRERSVALRAVLRERG WDRDAVEAALLGDMPTLW >MUP067c MUP067c, conserved hypothetical protein MYPGCVAPGLPQRPSPPARSKPSAWVHHCQPACRVGNEVQRSLPGVELFA ACGRVSQRQFDIDVIDYASTVVAGGVNGASLSGQQFYAGVGSAAVAGVDA SSIAAAQLARFALFSVVNVDVGQRQWLRQSHFGDRVGQCVVGGMSGAADA ALRLGEHVVALPRRSRRGYGGDHLLGLGCDPFVIDAFDLASRPGKDRASA TDTVQRSHAG >MUP068c MUP068c, conserved membrane protein MPLKLASKTQVSGHFFVRRRLAFALLRRSVSMEINPVRWHRTLLVLSAVL GVVLVVGAFVYGWFRPAGTIDASSKIVTDRSSGALFVVVGGRLHPALNLM SAQLIAGSPDRPTFVSSAQIAEWPKGPAVGIAGAPVQTPTVLSPQVSRWA VCDAAAETLRGVPVVTGIDGQLALGQGAAELGGAEALLLSYGQQVYVVAN GVRMPVDVSEPALAGPLGISAGAAPTAMSEALFDALPAGDRLVVPVVSGA GGAPGVDLGPRVVVGAVVASRDVAADTDRFYVVLADGVQEISPVVASMLR QHDSFGLPTPPRVSPDRLAKVPRPIGTTAPIAGAPPARVAAVAAPPALSA AACAASAGDSTNIIGQSLTLSETHPV >MUP070c MUP070c, conserved hypothetical protein MSRYPRVSICVTSCSGPSTGANLIRRERKKDGFRDTLIWYTTQHIAVADR DCEIWLVSTNHRDFGDKSQAVDHEACPYPLHPHLLEDLDTADLSGRVSYV RTLGRLVQHLAGKYDSQPESQREALISQLNQDKFEIALAASVDRFRLNPA AAALPLKAAYGVVHAFDRDVGSLEFVDVAMRGGGEWTAQFKQPIYATVDL TDRAAETSNVDKTLNVAGRLTVGANGHVRSMIVTSIEALPDDPMLRAWRM AWDPMSDTGFAQNIRQQIDPLSNPNTAKRIHESPMRPPEPDEPQKLASPS LDELAANDVASPSGEDSDPDENNGTTRDCRGKGVSGLIGPGVSGPERSHD DQDCRGCAAITKPRR >MUP071c MUP071c, conserved hypothetical protein MLRSPDWMELIEKAPQWDLRFAVPEVCLLEAVADVPREWRKRRSEVAKLA VGEFGLTQSQNEWLDVIDRKIDGYEEALGARLADIEADIVPIPEGVNLRD IVQRAIDRRKPYQEGEEKRRLSRHADLVYDTTHRGCGS >MUP072c MUP072c, conserved hypothetical protein MTVTPAGAGVATGSGTAHAPAMMLPPPGLGAPAAPVAASGAAGGAAAVTP AGSSATPSGSAGPAGPTGGSPAGSGAAMVVPASVVSAGTTNRSRAESPEL AAAKELVRRLRRDSDMVNYACIEWAVGVFRSEANGTTECVAMSNEGFGYI PWGVFLPRTARLLAADKLADDQFRQRWFGAADPAEVMDEYARLRASRGAH LVAMAVTADSPFGPPPGVEHAVCGRDLAGDGYVRPTLDDMGMHRLEATHP DLFARIQRLTGTEDQARLVENQVVLPLAMQMIDPVQTTSVQTPPELRQMW NELGTGDSIGSDAWQKYNIAATVFFVNVSANRPGPDGEVAVREQYRGQWV AARAMELLQGWERRPIATADMVYAAATAYPGDFAAKLERLLRGPEDGG >MUP073c MUP073c, conserved hypothetical protein MADVAAPSIAANVIITPDMVLPPAEPLEAQAAQYKTLADQVSDLAQQLRA ANAMREDAAQSPGWDVGHEKNRQLAADYGCMAGIYTAAAQYFTGVAGVYR DAQRAQRSVVNRANQELSQAKNAVQQQAIVARWHAHARALTTSAVGAATA RATVFQETAGTDITALTSRLGGTPMPRDPVLPHSGGSGIAVPPGKEKPLG LGDEPEPQEADPTGISNSNDASDAGVTHGHRPAGPREHVAGTEGTATGSP MPTPALGTPFPPGATQLPAGGGSGVRSVGFPPGLSSPGGLGSMLGTGGSS GGLGGLPASTGQLPGTQAAGLGGLPVSAPAAGGQAAQAAQLGRRFLEACR QGRGWDHYRRQPESEPRQPHRRARRRRLDWPQAALRLRGWPRPARHR >MUP074c MUP074c, possible membrane protein MSVVLHPPHVPEAAGPPPVSPPPQLLRGVRPGRGIVWGSAAALLAAAVAL VIAAMGWVAPSGGAVSTVVVPWSPPSPSAAEVAAARTQACKLWGITASAM DDASNLVAHTPGDWNAPDMQEALANEARVIAVEGAYLRRALPDHTPAAIR SGIEDYLAASFDMENATTHRQGTSRNAAIDRANAAEDRVNAACR >MUP075c MUP075c, hypothetical protein MQADETLQVTPPAAESPNVLTGAATVEFAARVPEQSALATAQGAADAVHA ASMWAPSVSASAGVSQQVPAAAAALAPHGGAVAQCNSQSLAEDLETDTQN AQDLTPHPAVSI >MUP076c MUP076c, possible membrane protein MNPQPVGQHGPPGGPGGWPAPGNGAPNNAAVLAPPAAQVPPPPGPGRLPV SRPRRRGWIAVGVMLTLAVLMALTAVVMSTIKLTAPAPTATTTTVMAPPP PPTSFSPDQLAAAKKEACHASESAATTINTAQEKYLIAARDRQSPTYGPA LANFQLVAILETQYMQQHLPPATPNNVVDSTNGYITAILALADAHTRGLS EDEAQQFVVAARKAGRQLDEACV >MUP077c MUP077c, conserved hypothetical MTVYEYPRYQADKDLTASVVTRLERVSSMLSKVLEEIDGIDVEAIGGDGD VVLSVNAHGQLTSLSLAQGCTTRYTHGGLAELINTTLGEAVNAAAAETSA VGEGQDPAALDHAVQAFIDPDSQVWKPQSR >MUP078c MUP078c, conserved hypothetical MLHVDTAGLQSVAADLGSAASALAGLAAQPLVHPPLATDAVSMSAAARLS GHGAVVASRALDGAAVLEAGAQAITQAALGYAAMDEANRAVVSLQGSPGA PTPTLVSAVTADVVAPGVPIAAPAAQPAETTAAMIEAGRPAAGDGFVSGC AALSKGFREGAVSARSAAIAVSEHLSGQAGSRISAALNRYADWAQEMAGY AEAVGQHAGDHKSRFGEVKHATPTTSEFTHRHRELQNAIQLNSSFPSPGS AAAVSQAHANLVQLTNRTHVVAAGYHTSEIPAAPSGPPPPVSPIVEPGGG QGDTTTPVPTTQSEQEPNPTESGPDNEGHDERDVDADSDDLDELGVDGEL AADPLGAGGAGSLPGMASAVPAMLTGALGAGVGMAGQIPQKLGQQLQGLA QEATQGMTGLASGLTGAEGVDIDSEGLDGALGGFGAGSGGGGGLTEPAGA GGVGDVAPAGSAPSGDGFADGIGVAAVDWCRWRLRCRGACGCRIRGNAAD VHAADGRWYGRRWRGRDPQRQRARQIH >MUP079c MUP079c, conserved hypothetical protein MFEARLTDYDTVVFEAASHDESIVVAVGRGGNALGVELQAPAMALTDAEL ANRIVKLNTLAHLRSQAALRHEWEAQHVNVSATLPTEDQVAGYEALIDF >MUP080c MUP080c, conserved hypothetical protein MRWAITQEHAADRCASARAENPHAIATAESWGPLFAEARRATVDAVNARE ATLREQEEQHRAMARQLRIAAARMEEMDAENRAALTISTD >MUP081c MUP081c, conserved hypothetical protein MTGWADVVLSEVFGGVSEVLGSPFPQTPSPHDAVSSVPITSTDQLTPLDR AKLTYMGIDPDSAGLDQINRALGRDQPLSVPPPAAPPPRPTTPAATPPNP DSPELRGAAAEAAKRLDEALARNHSAINDADDQLADAVLKASSSSAEGHQ RLQRLQQEIIDEIDKLGSSLDTPAGQQQLAEFLQGKTGDILNVLKNAGLD SDSQARVLDGLSARYRGLHHDAGSDETPASDGQPQDATTSSGDAPPSTGD GGEDGLPAADPLLDGLASDPLLSGLGIMAGPAMGALGSLPGALGSAIPSL GGGGLGGAGLGDLGSALGSALHDGAAPSELDSDEHVEPLTETPQSNPEDD QSPDPTQPAGLADHGRDHPQLGDSDDAAAPAQPLEASTQVELPDQSVRTA GSSALATAGRAVLAGENIDDAYAAAQVQLPPPGAPISAPISPGRLQFGDV GQYTDHRVMALDKDHVWLNGQVTPIDQLETGPNFLGWTRPSTTTSTLTSV TTQPAPLTPATEPS >MUP040c mlsA1, Type I modular polyketide synthase MIFGDAHQNCRGGRVLGDAVAVVGMSCRVPGASDPDALWALLRDGISVVD EIPSARWNLDGLVAHRLTDEQRSALRHGAFLDDVEGFDAAFFGINPSEAG SMDPQQRLMLELTWAALEDARIVPEHLSGSSSGVFTGAMSDDYTTAVTYR AAMTAHTFAGTHRSLIANRVSYTLGLRGPSLVIDTGQSSSLVAVHVAMES LRREETSLAIAGGIHLNLSLAAALSAAHFGALSPDGRCYTFDARANGYVR GEGGGVVVLKRLNDALADGNHIYCVIRGSSVNNDGATQDLTAPGVDGQRQ ALLQAYERAEIDPSEVQYVELHGTGTRLGDPTEAHSLHSVFGTSTVPRSP LLVGSIKTNIGHLEGAAGILGLIKTALAVHHRQLPPSLNYTVPNPKIPLE QLGLRVQTTLSEWPDLDKPLTAGVSSFSMGGTNAHLILQQPPTPDTTQTP NPTTGSDPAVGSDPAVGVLVWPLSARSAPGLSAQAARLYQHLSAHPDLDP IDVAHSLATTRSHHPHRATITTSIEHHSENNHDTTDALAALHALANNGTH PLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPVFAHALDEVAA ALNPHLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALHRLFTHA GIHPDYLLGHSIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTPGGTML ALQASEAEVQPLLEGLDHAVSIAAINGATSIVLSGDHDSLEQIGEHFITQ DRRTTRLQVSHAFHSPHMDPILEQFRQIAAQLTFSAPTLPILSNLTGQIA RHDQLASPDYWTQQLRNTVRFHDTVAALLGAGEQVFLELSPHPVLTQAIT DTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISPSWNVLYCQAR PLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENRGWVFT GRISPRTQPWLNEHAVESAVLFPNTGFVELALHVADRAGYSSVNELIVHT PLLLAGHDTADLQITVTDTDDMGRQSLNIHSHPHIGHDNTTTGDEQPEWV LHASAVLTAQTTDHNHLPLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGP TFQGVQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALFDAALHPLLAL TQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLRVRLTRTGADA ITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWPPHPDT TTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPV PVPSNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIV TRHGVSTSAHDPVPDLAHAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTL TTALTLPTRENQLAIRRDTIHIPRLTRHSSDGALTAPVVVDPEGTVLITG GTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVT ITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQ VLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALD ALADYRHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATS HGLALFDAALATGQPVSIPAPINTHTLARHARDNTLAPILSALITTPRRR AASAATDLAARLNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAF KDLGIDSLTALELRNTLTHNTGLNLSSTLIFDHPTPHAVAEHLLEQIPGI GALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVG NFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREA RAMDPQQRLLLEVCWEALETAGIPAHTLAGTSTGVFVGAWAQSYGATNSD DAEGYAMTGGATSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSL RNNESQLALAGGVTVMSTPAVFTDFSRQRGLAPDGRCKAFAATADGTGWG EGAAVLVLERLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRV INQALANAGLTHDQVDAVEAHGTGTTLGDPIEAGALHATYGHHHTPDQPL WLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSS GTVRLLTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPN PTTGSDPAVGSDSAVGSDPAVGVLVWPLSARSAPGLSAQAARLYQHLSAH PDLDPIDVAHSLATTRSHHPHRATITTSIEHHSENNHDTTDALAALHALA NNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPVFAHAL DEVAAALNPHLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALHR LFTHAGIHPDYLLGHSIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTP GGTMLALQASEAEVQPLLEGLDHAVSIAAINGATSIVLSGDHDSLEQIGE HFITQDRRTTRLQVSHAFHSPHMDPILEQFRQIAAQLTFSAPTLPILSNL TGQIARHDQLASPDYWTQQLRNTVRFHDTVAALLGAGEQVFLELSPHPVL TQAITDTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISPSWNVL YCQARPLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENR GWVFTGRISPRTQPWLNEHAVESAVLFPNTGFVELALHVADRAGYSSVNE LIVHTPLLLAGHDTADLQITVTDTDDMGRQSLNIHSRPHIGHDNTTTGDE QPEWVLHASAVLTAQTTDHNHLPLTPVPWPPPGTAAIEVDDFYDDLAAQG YNYGPTFQGVQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALFDAALH PLLALTQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLRVRLTR TGADAITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWP PHPDTTTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADV VVWPVPVPSNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGT RLVIVTRHGVSTSAHDPVPDLAHAAVWGLIRSAQNEHPGRFTLLDTDDNT NSDTLTTALTLPTRENQLAIRRDTIHIPRLTRHSSDGALTAPVVVDPEGT VLITGGTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDL GAHVTITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTG DQLDQVLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAA NTALDALADYRHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLM PIATSHGLALFDAALATGQPVSIPAPINTHTLARHARDNTLAPILSALIT TPRRRAASAATDLAARLNGLSPQQQQQTLATLVAAATATVLGHHTPESIS PATAFKDLGIDSLTALELRNTLTHNTGLNLSSTLIFDHPTPHAVAEHLLE QIPGIGALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAG RDVVGNFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGI SPREARAMDPQQRLLLEVCWEALETAGIPAHTLAGTSTGVFVGAGAQSYG ATNSDDAEGYAMTGGATSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHL ACQSLRNNESQLALAGGVTVMSTPAIFTEFSRQRGLAPDGRCKAFAATAD GTGWGEGAAVLVLERLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGP SQQRVINQALANAGLTHDQVDAVEAHGTGTTLGDPIEAGALHATYGHHHT PDQPLWLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPH IDWSSGTVRLLTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDT TQTPNPTTGSDPAVGSDSAVGSDPAVGVLVWPLSARSAPGLSAQAARLYQ HLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIEHHSENNHDTTDALAA LHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPV FAHALDACDAALQPFTGWSVLAVLHDEPEAPSLERVDVVQPVLFSVMVSL AALWRWAGITPDAVIGHSQGEIAAAHVAGALTLPEAAAVVALRSRVLTDL AGAGAMASVLSPEEPLTQLLARWDGKITVAAVNGPASAVVSGDTTAITEL LITCEHENIDARAIPVDYPSHSPYMEHIRHQFLDELPELTPRPSTIAMYS TVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHDTVAALLGAGEQVFLELS PHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISP SWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATE LAENRGWVFTGRISPRTQPWLNEHAVESAVLFPGTGFVELALHVADRAGY SSVNELIVHTPLLLAGHDTADLQITVTDTDDMGRQSLNIHSRPHIGHDNT TTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPWPPPGTAAIEVDDFYDD LAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALF DAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLR VRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLL HLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTST TEADVVVWPVPVPSNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHP DTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGLIRSAQNEHPGRFTLLD TDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRLTRTAVLTPPDSGPWR LDTTGKGDLANLALLPTAHTALASGQIRIDVRAAGLNFHDVVVALGLIPD DGFGGEAAGVISEIGPDVYGFAVGDAVTGMTVSGAFAPSTVADHRMVMTI PARWSFPQAASIPVVFLTAYIALAEISGLSRGQRVLIHAGTGGVGMAAIQ LAHHLGAEVFATASAAKWSTLEALGVPRDHIASSRTLDFSNAFLDATNGA GVDVVLNCLSGEFVEASLALLPRGGHFVEIGKTDIRDTEVIAATHPGVIY RALDLLSVSPDHIQRTLAQLSPLFATDTLKPLPTTNYSIYQAISALRDMS QARHTGKIVLTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLL TSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHR LTAVVHTAVVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSAF IMFSSMAGMIGSPGLGNYAAANTALDALADYRHRLGLPATSLAWGYWQTR TGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINT HTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTL ATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLN LSSTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGM ACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGKT YTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGIP AHTLAGTSTGVFVGAGAQSYGATNSDGAEGYAMTGGAISVMSGRIAYTLG LEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAIFTE FSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAI VAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTG TTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKM IQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAVS SFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDSAVGSDPAVGVL VWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRAT ITTSIEHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFV FPGQGSQYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPE APSLERVDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAG ALTLPEAAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITV AAVNGPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIR HQFLDELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTV RFHDTVAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKD RPDAVAFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPT AGDFSGANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESA VLFPGTGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDT DDMGRQSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPL TPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIY AEVELPEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQ VRLPYAFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSL ITRPLTTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYRVIAEP TQQLPRYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVS SRIHTLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHA AVWGLIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDT IHIPRLTRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGV RHLLLTSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSV PTQHRLTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEH NLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWG YWQTHTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIP APINTHTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQ QQQTLATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTH NTGLDLPPTLIFDHPTPTALTQHLHTRLTTGALVPAPVVIAAGRTEEPVA VVGMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDA VGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALET AGIPAHTLAGTSTGVFVGAWAQSYGATNSDDAEGYAMTGGAISVMSGRIA YTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPA VFTDFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHP VLAIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEA HGTGTTLGDPIEAGALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAG VVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRT AAVSSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVW PLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATIT TSIEHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFP GQGSQYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAP SLERVDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAGAL TLPEAAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITVAA VNGPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIRHQ FLDELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRF HDTVAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRP DAVAFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAG DFSGANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVL FPGTGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDD MGRQSLNIHSHPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTP VPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAE VELPEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVR LPYAFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLIT RPLTTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQ QLPRYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSR IHTLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAV WGLIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIH IPRLTRTAVLTPPDSGPWRLDTTGKGDLANLALLPTAHTALASGQIRIDV RAAGLNFHDVVVALGLIPDDGFGGEAAGVISEIGPDVYGFAVGDAVTGMT VSGAFAPSTVADHRMVMTIPARWSFPQAASIPVVFLTAYIALAEISGLSR GQRVLIHAGTGGVGMAAIQLAHHLGAEVFATASAAKWSTLEALGVPRDHI ASSRTLDFSNAFLDATNGAGVDVVLNCLSGEFVEASLALLPRGGHFVEIG KTDIRDTEVIAATHPGVIYRALDLLSVSPDHIQRTLAQLSPLFATDTLKP LPTTNYSIYQAISALRDMSQARHTGKIVLTAPVVVDPEGTVLITGGTGTL GALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVTITACD ISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQVLAPK IDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADY RHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATSHGLAL FDAALATGQPVSIPAPINTHTLARHARDNTLAPILSALITTPRRRAASAA TDLAARLNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAFKDLGI DSLTALELRNTLTHNTGLNLSSTLIFDHPTPHAVAEHLLEQIPGIGALVP APVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVGNFPAD RGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDP QQRLLLEVCWEALETAGIPAHTLAGTSTGVFVGAGAQSYGATNSDDAEGY AMTGGATSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNES QLALAGGVTVMSTPAVFTEFSRQRGLAPDGRCKAFAATADGTGFGEGAAV LVLERLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRVINQAL ANAGLTHDQVDAVEAHGTGTTLGDPIEAGALHATYGHHHTPDQPLWLGSI KSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRL LTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPNPTQTPEDCSPA QSPCATITDAGTGLSFVPWVISAKSAEALSAQASRLLTRLDDDPVVDAID LGWSLIATRSMFEHRAVVVGADRHQLQRGLAELASGNLGADVVVGRARAA GETVMVFPGQGSQRLGMGAQLYEQFPVFAAAFDDVVDALDQYLRLPLRQV MWGDDEGLLNSTEFAQPSLFAVEVALFALLRFWGVVPDYVIGHSVGELAA AQVAGVLSLQDAAKLVSARGRLMQALPAGGAMVAVAASQHEVEPLLVEGV DIAALNAPGSVVISGDQAAVRLIANRLADRGYRAHELAVSHAFHSSLMEP MLEEFARLASEIVVEQPQIPLISNVTGQLANADYGSAGYWVDHIRRPVRF ADSVASLEAMGASCFIEVGPASGLGAAIEQSLKSAEPTVSVSALSTDKPE SVAVLRAAARLSTSGIPVDWQSVFDGRSTQTVNLPTYAFQRQRFWLDANR IGQGDPASQPQAQNVESRFWEAVEREDVDGLADSIGVTASAMQTVLPALS SWRRAERTQSELDSWRYQVTWLSSPATPSSITLSGIWLLIVPSELAKTDP VIGCAAALEAHGALVTIITIFEPDFNRSLMGASLKDIGSHISGVISFLGI HGSEFSDSGAVKTLNLVQAMGDVHLDVPLWCLTQGAVSISADDLIRCSSA ALVWGLGRVVALEHPGSWGGLVDLPESPDDAAWERLCALLAQPTDEDQFA IRPSGVFLRRLIHAPATTTSKSSTAWAPRGTVLITGGTGALGAHVARWLA HKYESVDLLLTSRRGMAADGATELVDDLRTAGASVTVHACDVTDRTSVEA AIAGKSLDAVFHLAGRHQPTLLTELEDESFSDELAPKVHGAQVLSDITSN LTLSAFVMFSSVAGIWGGKSQGAYAAANAFLDSLAEKRRTLGLPATSVAW GLWAGGGMGDRPSASGLNLIGLKSMSADLAVQALSDAIDRPQATLTVASV NWDRFYPTFALARPRPFLHEITEVMAYRESMRSSSASTATLLTSKLAGLT ATEQRAVTRKLVLDQAASVLGYASTESLDTHESFKDLGFDSLTALELRDH LQTATGLNLSSTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTE EPVAVVGMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDP DPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWE ALETAGIPAHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGSTSVMS GRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVM STPAIFTEFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARR NNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVD AVEAHGTGTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAA GAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTD HPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVG VLVWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHR ATITTSIEHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTV FVFPGQGSQYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDE PEAPSLERVDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHV AGALTLPEAAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKI TVAAVNGPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEH IRHQFLDELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRN TVRFHDTVAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALR KDRPDAVAFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLL PTAGDFSGANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVE SAVLFPGTGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVT DTDDMGRQSLNIHSHPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHL PLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDV IYAEVELPEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTG DQVRLPYAFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIID SLITRPLTTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIA EPTQQLPRYLHDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIH TLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWG LIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIP RLTRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLL LTSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQH RLTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSA FIMFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQT HTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPIN THTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQT LATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGL DLPPTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVG MACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGK TYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGI PAHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGSTSVMSGRIAYTL GLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFT EFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLA IVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGT GTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVK MIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAV SSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVWPLS ARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSI EHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQG SQYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSLE RVDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAGALTLP EAAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITVAAVNG PASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIRHQFLD ELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHDT VAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAV AFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFS GANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPG TGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGR QSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPW PPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVEL PEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPY AFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPL TTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLP RYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHT LTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGL IRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPR LTRTAVLTPPDSGPWRLDTTGKGDLANLALLPTAHTALASGQIRIDVRAA GLNFHDVVVALGLIPDDGFGGEAAGVISEIGPDVYGFAVGDAVTGMTVSG AFAPSTVADHRMVMTIPARWSFPQAASIPVVFLTAYIALAEISGLSRGQR VLIHAGTGGVGMAAIQLAHHLGAEVFATASAAKWSTLEALGVPRDHIASS RTLDFSNAFLDATNGAGVDVVLNCLSGEFVEASLALLPRGGHFVEIGKTD IRDTEVIAATHPGVIYRALDLLSVSPDHIQRTLAQLSPLFATDTLKPLPT TNYSIYQAISALRDMSQARHTGKIVLTAPVVVDPEGTVLITGGTGTLGAL FAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVTITACDISD PEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDA AWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADYRHR LGLPATSLAWGYWQTRTGVTAHLTDVDLARMTRLGLMPIATSHGLALFDA ALATGQPVSIPAPINTHTLARHARDNTLTPILSALITTPRRRAASAATDL AARLNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAFKDLGIDSL TALELRNTLTHNTGLDLPPTLIFDHPTPTALTQHLHTRLTTGALVPAPVV IAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWD VAGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRL LLEVCWEALETAGIPAHTLAGTSTGVFVGAGAQSYGATNSDDAEGYAMTG GAISVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLAL AGGVTVMSTPAVFTDFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLE RLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAG LTHDQVDAVEAHGTGTTLGDPIEAGALHATYGHHHTPDQPLWLGSIKSNI GHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEP IQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPNPTTGSDPAV GSDSAVGSDPAVGVLVWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVA HSLATTRSHHPHRATITTSIEHHSENNHDTTDALAALHALANNGTHPLLS RGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPVFAHALDEVAAALNP HLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALHRLFTHAGIHP DYLLGHSIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTPGGTMLALQA SEAEVQPLLEGLDHAVSIAAINGATSIVLSGDHDSLEQIGEHFITQDRRT TRLQVSHAFHSPHMDPILEQFRQIAAQLTFSAPTLPILSNLTGQIARHDQ LASPDYWTQQLRNTVRFHDTVAALLGAGEQVFLELSPHPVLTQAITDTVE QAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISPSWNVLYCQARPLTL PTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENRGWVFTGRIS PRTQPWLNEHAVESAVLFPNTGFVELALHVADRAGYSSVNELIVHTPLLL AGHDTADLQITVTDTDDMGRQSLNIHSRPHIGHDNTTTGDEQPEWVLHAS AVLTAQTTDHNHLPLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQG VQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALFDAALHPLLALTQPP TNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLRVRLTRTGADAITVH TSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWPPHPDTTTDT DTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPVPV PSNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIVTR HGVSTSAHDPVPDLAHAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTLTT ALTLPTRENQLAIRRDTIHIPRLTRHSSDGALTAPVVVDPEGTVLITGGT GTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVTIT ACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQVL APKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDAL ADYRHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATSHG LALFDAALATGQPVSIPAPINTHTLARHARDNTLAPILSALITTPRRRAA SAATDLAARLNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAFKD LGIDSLTALELRNTLTHNTGLDLPPTLIFDHPTPTALTQHLHTRLTQIES PNSEDSMLNLKNLDRIESYIFRNSGEDRAHVIANRLRSILSKWDGTRSPE LPAELHLESATDDELFSLANMFRTPTSEISPTLEGGRGVN >MUP032c mlsB, Type I modular polyketide synthase MIFGDAHQNCRGGRVLGDAVAVVGMSCRVPGASDPDALWALLRDGISVVD EIPSARWNLDGLVAHRLTDEQRSALRHGAFLDDVEGFDAAFFGINPSEAG SMDPQQRLMLELTWAALEDARIVPEHLSGSSSGVFTGAMSDDYTTAVTYR AAMTAHTFAGTHRSLIANRVSYTLGLRGPSLVIDTGQSSSLVAVHVAMES LRREETSLAIAGGIHLNLSLAAALSAAHFGALSPDGRCYTFDARANGYVR GEGGGVVVLKRLNDALADGNHIYCVIRGSSVNNDGATQDLTAPGVDGQRQ ALLQAYERAEIDPSEVQYVELHGTGTRLGDPTEAHSLHSVFGTSTVPRSP LLVGSIKTNIGHLEGAAGILGLIKTALAVHHRQLPPSLNYTVPNPKIPLE QLGLRVQTTLSEWPDLDKPLTAGVSSFSMGGTNAHLILQQPPTPDTTQTP NPTTGSDPAVGSDPAVGVLVWPLSARSAPGLSAQAARLYQHLSAHPDLDP IDVAHSLATTRSHHPHRATITTSIEHHSENNHDTTDALAALHALANNGTH PLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPVFAHALDEVAA ALNPHLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALHRLFTHA GIHPDYLLGHSIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTPGGTML ALQASEAEVQPLLEGLDHAVSIAAINGATSIVLSGDHDSLEQIGEHFITQ DRRTTRLQVSHAFHSPHMDPILEQFRQIAAQLTFSAPTLPILSNLTGQIA RHDQLASPDYWTQQLRNTVRFHDTVAALLGAGEQVFLELSPHPVLTQAIT DTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISPSWNVLYCQAR PLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENRGWVFT GRISPRTQPWLNEHAVESAVLFPNTGFVELALHVADRAGYSSVNELIVHT PLLLAGHDTADLQITVTDTDDMGRQSLNIHSHPHIGHDNTTTGDEQPEWV LHASAVLTAQTTDHNHLPLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGP TFQGVQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALFDAALHPLLAL TQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLRVRLTRTGADA ITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWPPHPDT TTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPV PVPSNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIV TRHGVSTSAHDPVPDLAHAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTL TTALTLPTRENQLAIRRDTIHIPRLTRHSSDGALTAPVVVDPEGTVLITG GTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVT ITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQ VLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALD ALADYRHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATS HGLALFDAALATGQPVSIPAPINTHTLARHARDNTLAPILSALITTPRRR AASAATDLAARLNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAF KDLGIDSLTALELRNTLTHNTGLNLSSTLIFDHPTPHAVAEHLLEQIPGI GALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVG NFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREA RAMDPQQRLLLEVCWEALETAGIPAHTLAGTSTGVFVGAWAQSYGATNSD DAEGYAMTGGATSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSL RNNESQLALAGGVTVMSTPAVFTEFSRQRGLAPDGRCKAFAATADGTGWG EGAAVLVLERLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRV INQALANAGLTHDQVDAVEAHGTGTTLGDPIEASALHATYGHHHTPDQPL WLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSS GTVRLLTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPNPTQTPE DCSPAQSPCATITDAGTGLSFVPWVISAKSAEALSAQASRLLTRLDDDPV VDAIDLGWSLIATRSMFEHRAVVVGADRHQLQRGLAELASGNLGADVVVG RARAAGETVMVFPGQGSQRLGMGAQLYEQFPVFAAAFDDVVDALDQYLRL PLRQVMWGDDEGLLNSTEFAQPSLFAVEVALFALLRFWGVVPDYVIGHSV GELAAAQVAGVLSLQDAAKLVSARGRLMQALPAGGAMVAVAASQHEVEPL LVEGVDIAALNAPGSVVISGDQAAVRLIANRLADRGYRAHELAVSHAFHS SLMEPMLEEFARLASEIVVEQPQIPLISNVTGQLANADYGSAGYWVDHIR RPVRFADSVASLEAMGASCFIEVGPASGLGAAIEQSLKSAEPTVSVSALS TDKPESVAVLRAAARLSTSGIPVDWQSVFDGRSTQTVNLPTYAFQRQRFW LDANRIGQGDPASQPQAQNVESRFWEAVEREDVDGLADSIGVTASAMQTV LPALSSWRRAERTQSELDSWRYQVTWLSSPATPSSITLSGIWLLIVPSEL AKTDPVIGCAAALEAHGALVTIITIFEPDFNRSLMGASLKDIGSHISGVI SFLGIHGSEFSDSGAVKTLNLVQAMGDVHLDVPLWCLTQGAVSISADDLI RCSSAALVWGLGRVVALEHPGSWGGLVDLPESPDDAAWERLCALLAQPTD EDQFAIRPSGVFLRRLIHAPATTTSKSSTAWAPRGTVLITGGTGALGAHV ARWLAHKYESVDLLLTSRRGMAADGATELVDDLRTAGASVTVHACDVTDR TSVEAAIAGKSLDAVFHLAGRHQPTLLTELEDESFSDELAPKVHGAQVLS DITSNLTLSAFVMFSSVAGIWGGKSQGAYAAANAFLDSLAEKRRTLGLPA TSVAWGLWAGGGMGDRPSASGLNLIGLKSMSADLAVQALSDAIDRPQATL TVASVNWDRFYPTFALARPRPFLHEITEVMAYRESMRSSSASTATLLTSK LAGLTATEQRAVTRKLVLDQAASVLGYASTESLDTHESFKDLGFDSLTAL ELRDHLQTATGLNLSSTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIA AGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVE GLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLL EVCWEALETAGIPAHTLAGTSTGVFVGAWAQSYGATNSDDAEGYAMTGGA TSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAG GVTVMSTPAVFTEFSRQRGLAPDGRCKAFAATADGTGWGEGAAVLVLERL SEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLT HDQVDAVEAHGTGTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGH TQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQ WPNTDHPRTAAVSSFGISGTNAHLILQQPPTPNPTQTPEDCSPAQSPCAT ITDAGTGLSFVPWVISAKSAEALSAQASRLLTRLDDDPVVDAIDLGWSLI ATRSMFEHRAVVVGADRHQLQRGLAELASGNLGADVVVGRARAAGETVMV FPGQGSQRLGMGAQLYEQFPVFAAAFDDVVDALDQYLRLPLRQVMWGDDE GLLNSTEFAQPSLFAVEVALFALLRFWGVVPDYVIGHSVGELAAAQVAGV LSLQDAAKLVSARGRLMQALPAGGAMVAVAASQHEVEPLLVEGVDIAALN APGSVVISGDQAAVRLIANRLADRGYRAHELAVSHAFHSSLMEPMLEEFA RLASEIVVEQPQIPLISNVTGQLANADYGSAGYWVDHIRRPVRFADSVAS LEAMGASCFIEVGPASGLGAAIEQSLKSAEPTVSVSALSTDKPESVAVLR AAARLSTSGIPVDWQSVFDGRSTQTVNLPTYAFQRQRFWLDANRIGQGDP ASQPQAQNVESRFWEAVEREDVDGLADSIGVTASAMQTVLPALSSWRRAE RTQSELDSWRYQVTWLSSPATPSSITLSGIWLLIVPSELAKTDPVIGCAA ALEAHGALVTIITIFEPDFNRSLMGASLKDIGSHISGVISFLGIHGSEFS DSGAVKTLNLVQAMGDVHLDVPLWCLTQGAVSISADDLIRCSSAALVWGL GRVVALEHPGSWGGLVDLPESPDDAAWERLCALLAQPTDEDQFAIRPSGV FLRRLIHAPATTTSKSSTAWAPRGTVLITGGTGALGAHVARWLAHKYESV DLLLTSRRGMAADGATELVDDLRTAGASVTVHACDVTDRTSVEAAIAGKS LDAVFHLAGRHQPTLLTELEDESFSDELAPKVHGAQVLSDITSNLTLSAF VMFSSVAGIWGGKSQGAYAAANAFLDSLAEKRRTLGLPATSVAWGLWAGG GMGDRPSASGLNLIGLKSMSADLAVQALSDAIDRPQATLTVASVNWDRFY PTFALARPRPFLHEITEVMAYRESMRSSSASTATLLTSKLAGLTATEQRA VTRKLVLDQAASVLGYASTESLDTHESFKDLGFDSLTALELRDHLQTATG LNLSSTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVV GMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVG KTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAG IPAHTLAGTSTGVFVGAWAQSYGATNSDDAEGYAMTGGAISVMSGRIAYT LGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALTGGVTVMSTPAIF TEFSRQRGLAPDGRCKAFAATADGTGWGEGAAVLVLERLSEARRNNHPVL AIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHG TGTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVV KMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAA VSSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVWPL SARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTS IEHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQ GSQYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSL ERVDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAGALTL PEAAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITVAAVN GPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIRHQFL DELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHD TVAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDA VAFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDF SGANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFP GTGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMG RQSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVP WPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVE LPEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLP YAFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRP LTTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQL PRYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIH TLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWG LIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIP RLTRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLL LTSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQH RLTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSA FIMFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQT HTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPIN THTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQT LATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGL DLPPTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVG MACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGK TYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGI PAHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGSTSVMSGRIAYTL GLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFT EFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLA IVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGT GTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVK MIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAV SSFGISGTNAHLILQQPPTPDTTQTPNPTTGSDPAVGSDPAVGVLVWPLS ARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSI EHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQG SQYPGMGADLYRQFPVFAHALDEVAAALNPHLDVALLEVMFSQQDTAMAQ LLDQTFYAQPALFALGTALHRLFTHAGIHPDYLLGHSIGELTAAYAAGVL SLQDAATLVTSRGRLMQSCTPGGTMLALQASEAEVQPLLEGLDHAVSIAA INGATSIVLSGDHDSLEQIGEHFITQDRRTTRLQVSHAFHSPHMDPILEQ FRQIAAQLTFSAPTLPILSNLTGQIARHDQLASPDYWTQQLRNTVRFHDT VAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAV AFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFS GANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPG TGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGR QSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPW PPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVEL PEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPY AFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPL TTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLP RYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHT LTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGL IRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPR LTRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLL TSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHR LTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSAF IMFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQTH TGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINT HTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTL ATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLD LPPTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGM ACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGKT YTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGIP AHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGATSVMSGRIAYTLG LEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTE FSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAI VAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTG TTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKM IQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAVS SFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVWPLSA RSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIE HHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGS QYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSLER VDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAGALTLPE AAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITVAAVNGP ASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIRHQFLDE LPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHDTV AALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVA FAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSG ANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPGT GFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGRQ SLNIHSHPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPWP PPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELP EDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYA FTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLT TATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYRVIAEPTQQLPR YLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHTL TRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGLI RSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRL TRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLT SRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRL TAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSAFI MFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQTHT GLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINTH TLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTLA TLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDL PPTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGMA CRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGKTY TRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGIPA HTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGATSVMSGRIAYTLGL EGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTEF SRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAIV AGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTGT TLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKMI QAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAVSS FGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVWPLSAR SAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIEH HSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQ YPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSLERV DVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAGALTLPEA AAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITVAAVNGPA SAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIRHQFLDEL PELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHDTVA ALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAF AAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGA NTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPGTG FVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGRQS LNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPWPP PGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELPE DTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAF TGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTT ATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLPRY LHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHTLT RQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGLIR SAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRLT RHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLTS RRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRLT AVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSAFIM FSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQTHTG LTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINTHT LARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTLAT LVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDLP PTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGMAC RFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGKTYT RYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGIPAH TLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGATSVMSGRIAYTLGLE GPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTEFS RQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAIVA GSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTGTT LGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKMIQ AITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAVSSF GISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVWPLSARS APGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIEHH SENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQY PGMGADLYRQFPVFAHALDEVAAALNPHLDVALLEVMFSQQDTAMAQLLD QTFYAQPALFALGTALHRLFTHAGIHPDYLLGHSIGELTAAYAAGVLSLQ DAATLVTSRGRLMQSCTPGGTMLALQASEAEVQPLLEGLDHAVSIAAING ATSIVLSGDHDSLEQIGEHFITQDRRTTRLQVSHAFHSPHMDPILEQFRQ IAAQLTFSAPTLPILSNLTGQIARHDQLASPDYWTQQLRNTVRFHDTVAA LLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFA AALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGAN THAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPGTGF VELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGRQSL NIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPWPPP GTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELPED TDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFT GISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTTA TGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLPRYL HDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHTLTR QTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGLIRS AQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRLTR HSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLTSR RGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRLTA VVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSAFIMF SSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQTHTGL TAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINTHTL ARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTLATL VAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDLPP TLIFDHPTPHALTQHLHTRLTQSHTPVGPIASLLSHAIDEGKFRAGADLL MAASNLNQSFSNMAELNQLPAVTDIADASPDGLLTLICISTSENEYARLA AANIHSLTFAEIAAPGFYDAQLPNSIETSAEALATAITGAYANTSIVLVA HSIVCELAQATMTRLQDADIDLVGLVLLDPLEGTNSTEDYVETVLTRIEH INAPRVGVDGYLAALGRYLQFHEDRRIPIPETRHMTLHSDTKIDRAQTPM NLLQDEAALTALKIGNWMNDVGVALSVNLE >MUP001 rep, probable replication protein Rep MPAPSVFVGLELDTNSYTGVPCWSAGPAHWAHNTVAVAYDLRYQQIRSLM CDGGIARKTLIVIAAAMARHADWSTGRNCRPTNNQLEAATGFHQRTIQRA HECLRLLGVATEVLRGRQRTYIERMASWRMGDRHRGWASVWALHDNGEVA RVVHSLSPHLERSSVTTHTSPKTSLFTTHPGATRARESGATRRNSPDARG RRLAAQWRADPHAPPWTRRYSPTSWSAMLAAPAAAGWTARDLTALVQDWL GTGHRIPSTPARPIALLGTLLAWHTSHNSIQHRPAALEEAREAADLAAVR KRIQKQTAEHHANLAAREAGRAALGGPGHQAARIAAAAAARNAARRRTKM VAAEVASVDAAIRRARGR