TitleGenColors Logo

Gene list

Applied filters:

Organism: Mycobacterium ulcerans, AGY99
Gene type: CDS

Number of genes found: 81

Free access
Sort by:

 



# Mycobacterium ulcerans, AGY99

>MUP002c MUP002c, hypothetical protein
MTNAPEFKRQCATAGSAEQEALRRKRRQGSENRNMAGRLQVRFDTTKLRE
LEAAAGTKLPALIRQCGDLLTELVPVAEQQGIDPVELIRRTAASLLDRTE
LLSA
>MUP003 MUP003, hypothetical protein
MSRKWRRTLLTYPLLRKNGTVSSPSEPSAYTLADAGKQLGVTAQRVSQML
SSGELTGPQYPQQRVPKNAVRVWKWSLDQELARRRSTPPPRHSRRQQPDP
GADQGYLAQWWSRREARVNAAAHELKVAADLARQHASDERRKARQLMSKL
ARLTIQQAELIERLKDDLAVESERTEQIIDAYSDALTQLLAPDFGPPP
>MUP004c MUP004c, hypothetical protein
MSKLTPQELPEADLADLTKPKSRGAGLADLVNAAQPRPASAPEQAAVEVP
VEDRPVAATPSKTSDVSAAKVPARKPGRPRTRSKPTTAVYVSPGVKSRFE
KYRHNVKSTNLHVVLEAISIKYQAGELAEIIEKSRYSTAPTHDLFPADPS
AVRYLGGGSAQIAFSLTAEQEEVIDTIGSKLGFDTRSTWIAPVLNAFLPG
RKDA
>MUP006c MUP006c, hypothetical protein
MSTQLKQLRAFLAAAGYPDPAAVTQERTIWAAVQHLTPTSRICSASSMRQ
AATVIDPEILKAVQTVYSTDLGLPEEWTPAQRMEFLTAEADKISSMAASM
AETLWEQAITAWTRSRNGQTPNHATKVALLGDARAQAVTRVLNSELYELI
ETEETDETDMSPPQSGQPIPHRNQVDWQQRWTNPIYQSDPTPDLKALIDR
LWPASDFSAPRRWWPPCARSLTSLRPPVVRRWEWEELAGDDRPHGKAK
>MUP007c MUP007c, conserved hypothetical protein
MNAIQHVGHDDRCRGQMMGRRRHRFEPQHVNTTIRQTNQSIGILESQETP
PGVGMAAQARRPTVGQPARACGA
>MUP008c MUP008c, possible nucleic acid binding protein
MGPTAALFSQAKMPQGDSAVRQLAHSAMISPNPIPAALSARVPELAQQLV
DIVAIARIPGVRAKVAVRSRVAGINPVSVCIGWGGLRIADVEKGLGGERI
HVVAYHVDPATYVINVLGCPGTGTGAAAAAEERSRRIRVRVEAHDYPRTV
GKAGQNVRLASKLTGQKIEILVEKSCTADCFA
>MUP009 MUP009, hypothetical protein
MRLGFVDGCGLAGLASHSTRSSRSVAIRSTILYISSGVHGSSRMAQLRTA
QRPLTTAQQAPPLGVAGDAVKGGWAEARRIATAVVGPWETDPTLVHYAQF
DSGVVKGPGAVDFRLGSPIGDALKGHNVIAGFSSAQDRHQPL
>MUP010 MUP010, hypothetical protein
MVLELPNADEAVETVRAMADNSAALTFPFSAEPFPTEPFSIPRHPDTSAV
TFHHDVPAPEPVGGERFSVVAFTAHGPFVLAQWADAAESADKSAQLIATT
LELQEPRIDTFTPTPPDRISDLPLDPSGMLSRVLPPEEENKTVDDGVYDH
LGIVHFGGDPVRRQAMFKSAGLQQAAYTVTADVYEAGDAESAQRIVAELA
AEAIDDGLMAATGVQGMPKARCLEGDLAYWCVAAADRYAYTVQGHEGAVH
QMVAAQYRILTGK
>MUP012c MUP012c, hypothetical protein
MSHSELLDPALPKSPFGKQGYRYDDVDVFLRLIADELQGSDLSESDIHSI
TFRCAAPLTRGYDTESVDRFLDRIAETIARGHGDEGQ
>MUP013c MUP013c, possible conserved membrane protein
MRTSRVLVATLILLVGTTLVVTIVALFQPGSATHEAMPLWWFPTFTSAGL
LVSVVPGVAMALALLTSKRDWAHANEYRVVLPTVAVGSLFGSLPRCLTGT
SGCWAPRCCLRLGLFGSYADCHATIAAQFGLRCVVARTHSGAAATRTLSS
TR
>MUP014c MUP014c, putative integral membrane protein
MLISGLWARLVVAPWWVRVLANGLALAGIAVVFTIGFVAHFVSQTNWAWA
TAAAGATGVGYGALITSVRGPIHLRFAATVAGLTVQQRGQALTALRRGDV
PSDPRVVAAASRLGALMLAYGRRTHPLQWAFLGAFAAVAVVLRITHSQPG
FFCLLLVYLAATPTLIWQRRLLKRLPDRLARLHAAASPAIWAAQEQPDPS
TMAQQRLLSSALTGFVAGAFAVAALVGTQYVSQPQAAAGDAQGAALLLAP
VTATVGR
>MUP015c MUP015c, possible secreted protein
MTDNRSGRAQLVTAMLAVLVMIAITLTAYAVGARWPAHPRQHRLAPGLAA
LSGERLAELLPVQTDLPPGWTRYDDPYRRTPAAGFGYHRYHSNGVDWGYQ
PAECIDVRYGNGHVTPSPAAEFAQYAPGHPTEVHQVADLQIQISREFNPA
LFTDMLAQVSRCRRFIARQPFGTARFTVRVLEDSHPLHGPHRFRYAVTVT
YSDDDLLPSETRYVYYARTSRLVVAASDSSGNPQLLDTVFDKALHQICTR
ATESRQRC
>MUP016c MUP016c, hypothetical protein
MMMAASPDPRSTAVRRRGSQRGLAAVALATVGVCGLLLLMVPPPSVNSLL
SRSNWELATALPAIDAFPADWNYDLWSDIVQPTPAATTGSGPLTTRGSVS
QSVPAGCGDVPTLVGLYDGVSRSAAMVHVDLRTDEIAKAILASSDEPEPN
ARFVIWRKPNGAQLIADYLGWIGRCGSYRVTSPPPDNHTTTVTTTVSVES
VTGTDAAVIATRSSISSEDSQTYHVMFYALRGVVLECATNLTGDQVDMVR
RLADQTLHRLHTL
>MUP017c MUP017c, possible conserved transmembrane protein
MSHPYTNAQPHHYPYPNPQHPHPYPYPQRPAYPPPPQVGYYPPHAAAPWT
PQPTAPSVLAASDHQPLFSVRVTKHTGLVVAFYQQSYTVSGTFAQCETAL
REAQQHNLVVGWWGVASLVLWNWIAITNNRSARKALHRAAADRGYNTGGT
>MUP018c MUP018c, probable forkhead-associated protein
MQQPTEHTTPMDSLAPPALVIKTPHQAFTAHAAQGPVVIGRDAPAQVRIP
DERISRAHVRVTYTSTGWVAVDMSSNGTFVNGVQQATIPITDGLTIHLGH
PLEGIAISFSYTIAPPLAVDDATEHIEDTSDPKMVRVGAEIAARREELGL
PKRELDRRGVVGQATMTDIEKGRRFPRPFTRDKIEAVLGWPRGHIMWLYE
QDIQPDEERTHVLTATTPVPEITGATKMALKGFTAAADSLPPEDDPAFTD
RVGALLADLRELESVAASATHSAHGTSAAALVLSDVRKLYKALMLRAAGA
PGATLGQRFYAARYRAELTTQDAAAAAGLPVAIIEAAEADAPLDPAATAA
VQALLTSLLSQR
>MUP019 MUP019, probable conserved membrane protein
MLGITFGLAPRRMITMSVDTEPTMPSHTWDARPSTGPTPVDPWGPPITTM
GQPRPTNPQTPEARYSWAPPPMAFQPGFGRAPGWPYPPATPNTAPHRASA
RIAAAVVGAALAVIALIVLITTVGGSTKPGAALTHTTPTVSAQPPSTTTT
PPPPPPIAPAALAGLLLDTDTINALINSGELAVDPKHTTTKLFTDTADHP
ACGGVLVNASKQVYDGSGWVAAQTQALRDNTTRQHVVYESVISYPSARTA
TKMVAQEAHNWQRCNGRSITTTATSYPAQTWFVATVDNHDGMLTALMNQE
GARGWACQHALTARNNIVIDIEVCGADITHQATAIAKKVAQNVH
>MUP020 MUP020, conserved hypothetical protein
MSTDSQSLGNDQHLERKPTEPRAGKGCRPALQPQTDVIVMAVLSLPVVPI
TPLFQGWCSL
>MUP021 MUP021, possible transcriptional regulatory protein
MTAANPVKLGVGLCVYDDEDKWFSGYSSNRKAAAICANCPIIVSCAERAL
RLQVTDGVWVTVVMPGSRHTDALEQARARLRRVIEHY
>MUP022 MUP022, probable transposase for the insertion element IS2606
MMTKTVVAVQPSQNHEDEAGAVAAIDVLSSRDVDELEVARELVRQAREAG
VGLTGPGGLLKAMTKTVIEIALDEELSEHLGYDRHDRAGYGSGNSRNGTR
SKTVLTDACGSIEIDVPRDRAGTFEPKIVEKRQRRLTDVDEVVLSLYARG
LTTGDISAHFAHIYDAAVSKDTVSRITDCVLEEMTAWHTRPLERVYAAVF
IDALHVKIRDGQVGPRPVYAAVGVDLAGHRDVLGMWAGEGDGESAKYWLA
VLTELKNRGVADIFFLVCDGLKGLPDSVSAVFPDTVVQTCIIHLIRGTFR
YAGRQHHTAIARALKPIYTAVNAAAAAEALDAFDTEWGHRYPAAIRLWRT
AWNEFIPFLDYDTEIRKVICSTNAIESLNARYRRAIRARGHFPTEQSALK
CLYLVTRSLDPTGTGQKRWTMRWKPALNAFAITFADRMPGSETT
>MUP023c MUP023c, hypothetical protein
MSPRPKPKAPRTPPEQLIREPDDNIDYSGTLWRVHRTEGEHILPWNTLRT
FGPLPSMRWDPHPGPQPSSHADGVLYAAADVATSLAEVYQTTRVIDTRAN
APTLTAWQPQRRLRLLDLSGTWLLRNTASAALLAAPRSICRRWARAIYTT
WPELDGLYVPSTMTGRPNIVLWNAAADSIPTMPSFTRPLTHPLVWSIGQA
AAAEIGYRIQ
>MUP024c MUP024c, hypothetical protein
MGGSTAPALGALLARHQIDLTVEEVLDELDSGFAAIPGAATLSTTEVDFL
RANAGPGTAAVIDAWSASNERPARARIALQQLTGALSGSVSIKEAATMLG
VDRSRVSRRITAKALWAFDLQGNRRIPRWQFLSNELLPGLDVIVPAIARG
TTPAVLDVFMHTPQPDFDDRTPIEHLAAGGDPALVAGFIADLARW
>MUP025 MUP025, putative transposase
MAKPMAMHVARTPSAHVDKAGNARRYEAVLVRRSYRDGKKVRHQTLANLS
KLPAHVIDVVEASLKGQALVAPESVCEITRSLPHGHVAAVGAMARTLGLP
ALLGPRCRSRDLVLGLIISRVLRPASKLATLAWWADTTLGEDLNATNAST
GEIYEAMDWLLARQDAIEKQLAAKHLAASVNPSRMALFDLSSSWMTGQCC
DLAARGYSRDGKKGLPQIEYGLLTDPAGRPVAIRVFAGNTADPAAFTDIV
EVVRERFGLDRLVLVGDRGMITSARIAALRELNNDPDTATGFGWITALRA
PAIAKLAGDDGPLQLSLFDTQDLATITHPDYPSERLIACRNPLLATQRAR
KRAELLTVTEAALAPIIAAVACGRLAGAGRIGVKVGKVLAKFKMAKHFHL
DITDTTLTVTRDQTKIDAEAALDGIYVLRTSVTANELDPAAVVVSYKNLA
NVERDFRSIKTDDLDLRPIHHRLDDRVKAHILIAMLACYLVWHLRKAWAP
MTYTDENPPARENPVTPAQRSAAAKTKAARHQDANGATLRSFSGLLEHLA
TLTRNDVNFTHTTNPIPMLATPPRPTTRL
>MUP026 MUP026, probable transposase for the insertion element IS2606
MMTKTVVAVQPSQNHEDEAGAVAAIDVLSSRDVDELEVARELVRQAREAG
VGLTGPGGLLKAMTKTVIEIALDEELSEHLGYDRHDRAGYGSGNSRNGTR
SKTVLTDACGSIEIDVPRDRAGTFEPKIVEKRQRRLTDVDEVVLSLYARG
LTTGEISAHFAHIYDAAVSKDTVSRITDCVLEEMTAWHTRPLERVYAAVF
IDALHVKIRDGQVGPRPVYAAVGVDLAGHRDVLGMWAGEGDGESAKYWLA
VLTELKNRGVADIFFLVCDGLKGLPDSVSAVFPDTVVQTCIIHLIRGTFR
YAGRQHHTAIARALKPIYTAVNAAAAAEALDAFDTEWGHRYPAAIRLWRT
AWNEFIPFLDYDTEIRKVICSTNAIESLNARYRRAIRARGHFPTEQSALK
CLYLVTRSLDPTGTGQKRWTMRWKPALNAFAITFADRMPGSETT
>MUP027c MUP027c, putative transposase
MTRERTREIQRLEKLLEDAGIKLSSVAADLNGKSSRAMLEALLAGETDTA
VMADLAEKQLRRKIPQLTEALYGRFTAHHAFLARMHLNLIDQHTAAIEAL
TERIEVVMQPFRGFRDLICTIPGIGGFTADVVVAETGADMTKFPTAQHLA
SRAGKTPGNNESAGKVKSRRTRPGNPYLQGALGTAAMSISHTRDTYLAAK
YRRVTARRGPLRANVAVQRALLVAIWNIATTNTGYHDPGGDYFTRLNPQK
ARHNAVRQLEAMGYHVTLDRVS
>MUP028c MUP028c, putative transposase
MAKPMAMHVARTPSAHVDKAGNARRYEAVLVRRSYRDGKKVRHQTLANLS
KLPAHVIDVVEASLKGQALVAPESVCEITRSLPHGHVAAVGAMARTLGLP
ALLGPRCRSRDLVLGLIISRVLRPASKLATLAWWADTTLGEDLNATNAST
GEIYEAMDWLLARQDAIEKQLAAKHLAASVNPSRMALFDLSSSWMTGQCC
DLAARGYSRDGKKGLPQIEYGLLTDPAGRPVAIRVFAGNTADPAAFTDIV
EVVRERFGLDRLVLVGDRGMITSARIAALRELNNDPDTATGFGWITALRA
PAIAKLAGDDGPLQLSLFDTQDLATITHPDYPSERLIACRNPLLATQRAR
KRAELLTVTEAALAPIIAAVACGRLAGAGRIGVKVGKVLAKFKMAKHFHL
DITDTTLTVTRDQTKIDAEAALDGIYVLRTSVTANELDPAAVVVSYKNLA
NVERDFRSIKTDDLDLRPIHHRLDDRVKAHILIAMLACYLVWHLRKAWAP
MTYTDENPPARENPVTPAQRSAAAKTKAARHQDANGATLRSFSGLLEHLA
TLTRNDVNFTHTTNPIPMLATPPRPTTRL
>MUP029c MUP029c, probable transposase for the insertion element IS2404 (fragment)
MTATDQHSVEVVYAICSLPFEHARPTAIMTWMRQHCGIENSLHWIRDVTF
DEDRHRAHTGNGAQVLATLRNTAINLHRLNGADNIAEACRITALTANRRL
DLLNPQFPSSQAC
>MUP030c MUP030c, probable transposase for the insertion element IS2404 (fragment)
MALLAIAVLATAAGMRGYAGFATWAATASDDVLAQVGVRFRRPSEKTFRA
VLSRLDPADLNARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRAKATA
THLVSVFAHRARLVLGQLAVAEKSNEIPCVCALLTLLPDNLRWLVTVDAM
HTQVVTAKLICATLKSHYLMIVKSNQAKILARITALPWAEVPAAATDDSR
GHGRVKTRTLQIITTARGIGFPYAKQIIRITRER
>MUP031c MUP031c, probable transposase for the insertion element IS2606
MMTKTVVAVQPSQNHEDEAGAVAAIDVPSSRDVDELEVARELVRQAREAG
VGLTGPGGLLKAMTKTVIETALDEELSEHLGYDRHDRAGYGNGNSRNGTR
SKTVLTDACGSIEIDVPRDRAGTFEPKIVEKRQRRLTDVDEVVLSLYARG
LTTGEISAHFAHIYDAAVSKDTVSRITDRVLEEMTAWHTRPLERVYAAVF
IDALHVKIRDGQVGPRPVYAAVGVDLAGHRDVLGMWAGEGDGESAKYWLA
VLTELKNRGVADIFFLVCDGLKGLPDSVSAVFPDTVVQTCIIHLIRGTFR
YAGRQHHTAIARALKPIYTAVNAAAAAEALDAFDTEWGHRYPAAIRLWRN
AWNEFIPFLDYDTEIRKVICSTNAIEWLNARYRRAIRARGHFPTEQSALK
CLYLVTRSLDPTGTGQKRWTMRWKPALNAFAITFADRMPGSETT
>MUP033c MUP033c, putative transposase
MVLMVSVVPLADPGVCWGRSWRIIVTLVCIGDLEGLGRGQRVGKLHRRRP
RPGDKWHLAEVLVKVNGITRCLWRAIDQDGNVSDVLVRSRRNAKRHLLSA
IDSRTEMADRFAVGYEVTVLDAAA
>MUP034c MUP034c, putative transposase
MAMDFQFDFTTDSKAVKIASMVDEHIRESLLNIVERSITAERLIAGLEWV
FAAAGGPPKVLRMDNGPELISQALRQFCDRKVGLCYIQPRTPWNNATSNR
STTGYEGTASTAAAGPTHSKTAWSWTISTTNTIIGIVTPHWVTCPPRSRL
PDAATLIAPWPIALWPVRSNDICVKGTRL
>MUP035 MUP035, putative transposase
MGRSSRAIIGGVDTHAATHHGAVIDSRGRLLADAEYPASGRGYAAMLTWM
RSKGNLTKVGVEGTGAYGAGLARYLHEQGVEVLEVPRPDRRIRRQRGKSD
PIDAEAAARTVLAGRASGASKLVDGPIEAIRMLRVARSDAVKAKTAAVNA
LRAMLITTHQDANGATLRSFSGLLEHLATLTRNDVNFTHTTNPIPMLATP
PPTNNAPLTSSEPPSPSPAPRSHQRQPTKPQNPQVNS
>MUP036c MUP036c, probable transposase for the insertion element IS2606
MMTKTVVAVQPSQNHEDEAGAVAAIDVPSSRDVDELEVARELVRQAREAG
VGLTGPGGLLKAMTKTVIEIALDEELSEHLGYDRHDRAGYGSGNSRNGTR
SKTVLTDACGSIEIDVPRDRAGTFEPKIVEKRQRRLTDVDEVVLSLYARG
LTTGEISAHFAHIYDAAVSKDTVSRIADCVLEEMTAWHTRPLERVYAAVF
IDALHVKIRDGQVGPRPVYAAVGVDLAGHRDVLGMWAGEGDGESAKYWLA
VLTELKNRGVADIFFLVCDGLKGLPDSVSAVFPDTVVQTCIIHLIRGTFR
YAGRQHHTAIARALKPIYTAVNAAAAAEALDAFDTEWGHRYPAAIRLWRN
AWNEFIPFLDYDTEIRKVICSTNAIESLNARYRRAIRARGHFPTEQSALK
CLYLVTRSLDPTGTGQKRWTMRWKPALNAFAITFADRMPGSETT
>MUP037 MUP037, putative transposase
MNATNASTGEIYEAMDWLLARQDAIEKQLAAKHLAASVNPSRMALFDLSS
SWMTGQCCDLAARGYSRDGKKGLPQIEYGLLTDPAGRPVAIRVFAGNTAD
PAAFTDIVEVVRERFGLDRLVLVGDRGMITSARIAALRELNNDPDTATGF
GWITALRAPAIAKLAGDDGPLQLSLFDTQDLATITHPDYPSERLIACRNP
LLATQRARKRAELLTVTEAALAPIIAAVACGRLAGAGRIGVKVGKVLAKF
KMAKHFHLDITDTTLTVTRDQTKIDAEAALDGIYVLRTSVTANELDPAAV
VVSYKNLANVERDFRSIKTDDLDLRPIHHRLDDRVKAHILIAMLACYLVW
HLRKAWAPMTYTDENPPARENPVTPAEGTTSPGSTPRKRDTTLFANSKPW
ATTSPSTERPDRSPSKRTSRQSSRQVNKFTDASSAKRNVSTPAVGAQSGN
L
>MUP038c MUP038c, possible thioesterase
MIVWPEVVSTVVDVDGVAMSALVAEPDQEPKAVILALHGGATNARYFDCP
GHRALSLLHTGAAAGFTVVALDRPGYGSSAGDPDAMNRPHQRAALAYGAL
DRILAQRPRGAGVFIMGHSNGCELAMWMATETRGAELLGIELAGTGWHYQ
PEAREILTTATGEHRWVGLYDLLWHPQRLYPPEVLNAAIISSSAPAYEEQ
MMADWTRRTFLELVPAVRVPVHFSIAQHEKVWQRDSSALDEIAVLFSGAP
RFILHEQPEAGHNISLGHTAGDYHTTVLSFVQQCLAERLANAQQDVDLAA
E
>MUP041c MUP041c, putative transposase
MVLMVSVVPLADPGVCWGRSWRIIVTLVCIGDLEGLGRGQRVGKLHRRRP
RPGDKWHLAEVLVKVNGITRCLWRAIDQDGNVSDVLVRSRRNAKRHLLSA
IDSRTEMADRFAVGYEVTVLDAAA
>MUP042c MUP042c, putative transposase
MAMDFQFDFTTDSKAVKIASMVDEHIRESLLNIVERSITAERLIAGLEWV
FAAAGGPPKVLRMDNGPELISQALRQFCDRKVGLCYIQPRTPWNNATSNR
STTGYEGTASTAAAGPTHSKTAWSWTISTTNTIIGIVTPHWVTCPPRSRL
PDAATLIAPWPIALWPVRSNDICVKGTRL
>MUP043 MUP043, putative transposase
MGRSSRAIIGGVDTHAATHHGAVIDSRGRLLADAEYPASGRGYAAMLTWM
RSKGNLTKVGVEGTGAYGAGLARYLHEQGVEVLEVPRPDRRIRRQRGKSD
PIDAEAAARTVLAGRASGASKLVDGPIEAIRMLRVARSDAVKAKTAAVNA
LRAMLITTPDTLRSQLHGLSSAQLVAAYTKLRPDAANLTDPVQAAKAALR
SIATRTRQLELESQALRTQLDGLIKSVAPATSAVFGLGPDTASALLVTIG
DNPDRLRSEAAFARLCGVAPIPASSGKTHRHRLHRGGDRTGNHALHIAAV
VRLRYDPRSRAYADRRTSEGLSKPEIIRCQKRYLAREVFDALRTDFTKLN
T
>MUP044c MUP044c, putative truncated transposase
MDVDAGKELKELREQNTRLKRLPAETELVKDALREVAKDAPIDVKC
>MUP045 MUP045, probable beta-ketoacyl synthase-like protein
MIWNDIYISGTGRFIPSMRPINDIQVDGVPNDHTIVQSDYISFTEADEPA
TVMATRAATEALTTSELVSADVGVLIYAAIIGDAHHFAPVCHVQRVLRAP
DALAFELSAASNGGTQGIAVAANLMTADASVKAALVCTAYRHPIDIISRW
SSGMVFGDGAAAAVLSRDGGMVRLISGYHGSLPELEVLARNRSNERLGFV
LPDVGLGKYLTAIARMYQAVIAQVLEEAQTSIAEIDYFGLIGIGIPSLTA
TILEPNGIPVNKTSWGLLRQMGHVGACDPLLSLNHLFEQNVLKRGDKVLL
LGGGVGYRLTCIVAEIAMNPGVPGHSTS
>MUP046 MUP046, possible membrane protein
MGWRWWLRGAGEGDGGSDEVKGALLVGGGVASIGVGLVVWVKLTSLRVRV
ARCSSRPLKLRSVAPLASWWRAALVLAAAERWAGATGFSRAGGFSSV
>MUP047 MUP047, probable transposase for the insertion element IS2404 (fragment)
MALLAIAVLATAAGMRGYAGFATWAATASDDVLAQVGVRFRRPSEKTFRA
VLSRLDPADLNARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRAKATA
THLVSVFAHRARLVLGQLAVAEKSNEIPCVRALLTLLPDNLRWLVTVDAM
HTQVVTAKLICATLKSHYLMIVKSNQAKILARITALPWAEVPAAATDDSR
GHGRVETRTLQIITTARGIGFPYAKQIIRITRER
>MUP048 MUP048, probable transposase for the insertion element IS2404 (fragment)
MTATDQHSVEVVYAICSLPFEHARPTAIMTWMRQHCGIENSLHWIRDVTF
DEDRHRAHTGNGAQVLATLRNTAINLHRLNGADNIAEACRITALTANRRL
DLLNPQFPSSQAC
>MUP049c MUP049c, putative transposase
MAAQPRQSWSSCDLSGPDTPGPIRPLTPLPRQSLPEIRRRFDPEFREGAV
RIVRETGKPIAQVARELGVNAGTLGNWVAKDRAEREGSQGLSTDGVAELK
RLRSEVAELRMERDVLKRSVVLWVKEATR
>MUP050 MUP050, probable transposase for the insertion element IS2606
MMTKTVVAVQPSQNHEDEAGAVAAIDVLSSRDVDELEVARELVRQAREAG
VGLTGPGGLLKAMTKTVIETALDEELSEHLGYDRHDRAGYGSGNSRNGTR
SKTVLTDACGFIEIDVPRDRAGTFEPKIVEKRQRRLTDVDEVVLSLYARG
LTTGEISAHFAHIYDAAVSKDTVSRITDCVLEEMTAWHTRPLERVYAAVF
IDALHVKIRDGQVGPRPVYAAVGVDLAGHRDVLGMWAGEGDGESAKYWLA
VLTELKNRGVADIFFLVCDGLKGLPDSVSAVFPDTVVQTCIIHLIRGTFR
YPGRQHHTAIARALKPIYTAVNAAAAAEALDAFDTEWGHRYPAAIRLWRN
AWNEFIPFLDYDTEIRKVICSTNAIESLNARYRRAIRARGHFPTEQSALK
CLYLVTRSLDPTGTGQKRWTMRWKPALNAFAITFADRMPGSETT
>MUP051 MUP051, putative transposase
MAGRKRYSAEDIVRKLRRADELAAAGSSGEQIAAELGVSAATLYNWRRAY
GGMDLDAAKELKELREQNGRLKRLLADAELEKDALREVAKVKF
>MUP052 MUP052, putative transposase
MFKKVLSISERLACKAVGLARSTYRRVRIVDTPANPDAQLRAWLRSYATK
HPCHGFRRAWSALRYDEHHQVNKKKIHRLWCQEGLQVAARSPRKRGGVAS
VPPIIADAPRMVWALDFQFDSTIDGAAIKIASMVDEHTRESLLDMTQRSI
TAEALVDELERVFTTAGGPPKVLRMDNGPELISQALQQFCHGKVGVTYIP
PGQPWNNGYVESFNSRLRKECLNRNYWTDLLEAKVVIGDFKHEHHHRHRH
SSLGYRTPAEYAAQCRHTHHPMACDIN
>MUP054c MUP054c, possible integrase fragment
MMAAQPRQSWSSCDLSGPDTPGPIRPLTPLPRQSLENCRNRLTPVDVALF
RGLGSHRDRAIVLAMLLGGLRAGEVRRLLLADVDQGRRQLRVVGKGGRER
VVPVDDAFFAELASYLSHQQQCPQSTAVIPCGYRRQRCDATGTDDADLIS
RSTRRTMQLMYLPKEAS
>MUP055 MUP055, probable transposase for the insertion element IS2606
MMTKTVVAVQPSQNHEDEAGAVAAIDVPSSRDVDELEVARELVRQAREAG
VGLTGPGGLLKAMTKTVIETALDEELSEHLGYDRHDRAGYGNGNSRNGTR
SKTVLTDACGSIEIDVPRDRAGTFEPKTVEKRQRRLTDVDEVVLSLYARG
LTTGEISAHFAHIYDAAVSKDTVSRITDCVLEEMTAWHTRPLERVYAAVF
IDALHVKIRDGQVGPRPVYAAVGVDLAGHRDVLGMWAGEGDGESAKYWLA
VLTELKNRGVADIFFLVCDGLKGLPDSVSAVFPDTVVQTCIIHLIRGTFR
YAGRQHHTAIARALKPIYTAVNAAAAAEALDAFDTEWGHRYPAAIRLWRN
AWNEFIPFLDYDTEIRKVICSTNAIESLNARYRRAIRARGHFPTEQSALK
CLYLVTRSLDPTGTGQKRWTMRWKPALNAFAITFADRMPGSETT
>MUP056c MUP056c, hypothetical protein
MHQPADEHTAAAEVEFATTTLQNLLHQIEVGNANAAAYPRPGRYTYLVSH
AWADRTTIHLVYTAPPSDITWGLVRDTRESLIDPGGWNSVDDAPRYYYLL
DLDENWPGHALRQAGDDPHAIRWRGD
>MUP057c MUP057c, possible lipoprotein
MNLPTRRGRPRIHARTAIGSTVAALAMATVGCSSDSDVVATSMAPSSSST
GVSARQSTTASVPAQDRPTGYVSRKTWTDGPWPLIIDEAVLDCQGDSLVT
ITANESTYALNSAAYSQTELPDYAPAIGAHDPDKPGSYLDAGPLIERGLA
LCGTPATTSTPISGSNRPAGLVERKTWTDGRWPFTVDSATLFCTKPAGPQ
SERVTVVANHEMYALNGTAQDANLWPAFDPIWRDDPIAPGMKINIGPMIE
RGLALCEG
>MUP058c MUP058c, possible site-specific recombinase
MLETNSRDASPALVHPGWFGEFLADRAIRKPSPHTTKAYRQDFEAVALLL
AGGADAISTLPTNALNKESLRAAFAVYAETHSAASIRRCWSTWNTLCTFL
YTANLLDANPMPAIGRPKVPKALPKSYTADAVNNLIAAIDTDDGSARRSN
WPERDRAIIFTALLTGLRADELIRANIGDIRCNADGGVLHVSGKGNKDRR
IPCDTKLIEMLQRYLQTREDRIPHLRKRSASPDVMGRFSPTAPLFVGSDG
ERITRGTLQYRILRGFKKAGINNDRAAGALVHGLRHTFATELANANVSVY
ALMKLLGHESMVTSQRYVDGAAADTRRAAERNPLYSLLNIETESQ
>MUP059c MUP059c, probable transposase for the insertion element IS2404
MALLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKIFRA
VLSRLDPADLNARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRAKATA
THLVSVFAHRARLVLGQLAVAEKSNEIPCVCALLTLLPDNLRWLVTVDAM
HTQVVTAKLICATLKSHYLMIVKSNQAKILARITALPWAEVPAAATDDSR
GHGRVETRTLQIITAARGIGFPYAKQIIRITRERLITATDQRSVEVVYAI
CSLPFEHARPTAIMTWMRQHCGIENSLHWIRDVTFDEDRHRAHTGNGAQV
LATLRNTAINLHRLNGADNIAEACRITALTANRRLDLLNPQFPSSQAC
>MUP060 MUP060, probable transposase for the insertion element IS2606 (fragment)
MMTKTVVAVQPSQNHEDEAGAVAAIDVPSSRDVDELEVARELVRQAREAG
VGLTGPGGLLKAMTKTVIEIALDEELSEHLGYDRHDRAGYGSGNSRNGTR
SKTVLTDGCGFIEIDVPRDRAGTFEPKIVEKRQRRLTDVDEVGVVAVCPR
VDHRGDQRPFRPHLRRGGVQGHR
>MUP061 MUP061, probable transposase for the insertion element IS2606 (fragment)
MLSLYARGLTTGEISAHFAHIYDAAVSKDTVSRITDCVLEEMTAWHTRPL
ERVYAAVFIDALHVKIRDGQVGPRPVYAAVGVDLAGHRDVLGMWAGEGDG
ESAKYWLAVLTELKNRGVADIFFLVCDGLKGLPDSVSAVFPDTVVQTCII
HLIRGTFRYAGRQHHTAIARALKPIYTAVNAAAAAEALDAFDTEWGHRYP
AAIRLWRNAWNEFIPFLDYDTEIRKVICSTNAIEWLNARYRRAIRARGHF
PTEQSALKCLYLVTRSLDPTGTGQKRWTMRWKPALNAFAITFADRMPGSE
TT
>MUP062 MUP062, probable transposase for the insertion element IS2404 (fragment)
MALLAIAVLATAAGMRGYAGFATWAATASDDVLAQVGVRFRRPSEKTFRA
VLSRLDPADLNARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRAKATA
THLVSVFAHRARLVLGQLAVAEKSNEIPCVCALLTLLPDNLRWLVTVDAM
HTQVVTAKLICATLKSHYLMIVKSNQAKILARITALPWAEVPAAATDDSR
GHGRVKTRTLQIITTARGIGFPYAKQIIRITRER
>MUP063 MUP063, probable transposase for the insertion element IS2404 (fragment)
MTATDQRSVEVVYAICSLPFEHARPTAIMTWMRQHCGIENSLHWIRDVTF
DEDRHRAHTGNGAQVLATLRNTAINLHRLNGADNIAEACRITALTANRRL
DLLNPQFPSSQAC
>MUP064c MUP064c, possible conserved membrane protein
MTTPPPPPGWYPDPSDPEKRIYWDGAAWSRPASAATEVDKANKSKKTAIT
IGVCVLVVIGLVMSMQSVSLMTGSGPVWTGVAVVAAGTAIAFFLRAARWV
RVVAALILALALANAGLRRC
>MUP065c MUP065c, conserved hypothetical protein
MYLRGGLRRTGDAVSSLWIHQGDVLRAYAQKHQDTADIALELPTGTGKTL
PGLLIAEWVRRKAAGPVLYATPTKQLARQVLATAHAEGVPARLLVGSHLH
WNVTDHSDVDGGEAIAITTYSSIFNSSPKLPVPNLIVFDDAHAGEQFVGE
HYSVEIRRYEDEPAYLAVLDALRPFMSGLQIQRLQGVPDPGAHHQVRLIL
PAVKPGAPARLDTTLSNLGDSYKFDFAMIWYHASTKPSPFCGNPSNRRLV
LLPFSWIIFIIHEKALGIDRSPGEPRPLAAWATTRRDASQARG
>MUP066c MUP066c, conserved hypothetical protein
MPTAVGIALWVDPVLHANESRRFDSFVVRGHYERDCDFFTGAIGTDGYGR
FYISRGKELCVRPHRYALARSRGALRSHELALHECDNPLCVKISDASASS
QHVVVGTQADNMRRMARMRRGGGRKPIAGDGLRVRRERSVALRAVLRERG
WDRDAVEAALLGDMPTLW
>MUP067c MUP067c, conserved hypothetical protein
MYPGCVAPGLPQRPSPPARSKPSAWVHHCQPACRVGNEVQRSLPGVELFA
ACGRVSQRQFDIDVIDYASTVVAGGVNGASLSGQQFYAGVGSAAVAGVDA
SSIAAAQLARFALFSVVNVDVGQRQWLRQSHFGDRVGQCVVGGMSGAADA
ALRLGEHVVALPRRSRRGYGGDHLLGLGCDPFVIDAFDLASRPGKDRASA
TDTVQRSHAG
>MUP068c MUP068c, conserved membrane protein
MPLKLASKTQVSGHFFVRRRLAFALLRRSVSMEINPVRWHRTLLVLSAVL
GVVLVVGAFVYGWFRPAGTIDASSKIVTDRSSGALFVVVGGRLHPALNLM
SAQLIAGSPDRPTFVSSAQIAEWPKGPAVGIAGAPVQTPTVLSPQVSRWA
VCDAAAETLRGVPVVTGIDGQLALGQGAAELGGAEALLLSYGQQVYVVAN
GVRMPVDVSEPALAGPLGISAGAAPTAMSEALFDALPAGDRLVVPVVSGA
GGAPGVDLGPRVVVGAVVASRDVAADTDRFYVVLADGVQEISPVVASMLR
QHDSFGLPTPPRVSPDRLAKVPRPIGTTAPIAGAPPARVAAVAAPPALSA
AACAASAGDSTNIIGQSLTLSETHPV
>MUP069c MUP069c, probable transposase for the insertion element IS2606
MMTKTVVAVQPSQNHEDEAGAVAAIDVPSSRDVDELEVARELVRQAREAG
VGLTGPGGLLKAMTKTVIETALDEELSEHLGYDRHDRAGYGSGNSRNGTR
SKTVLTDACGSIEIDVPRDRAGTFEPKIVEKRQRRLTDVDEVVLSLYARG
LTTGEISAHFAHIYDAAVSKNTVSRITDCVLEEITAWHTRPLERVYAAVF
IDALHVKIRDGQVGPRPVYAAVGVDLAGHRDVLGMWAGEGDGESAKYWLA
VLTELKNRGVADIFFLVCDGLKGLPDSVSAVFPDTVVQTCIIHLIRGTFR
YAGRQHHTAIARALKPIYTAVNAAAAAEALDAFDTEWGHRYPAAIRLWRN
AWNESIPFLDYDTEIRKVICSTNAIESLNARYRRAIRARGHFPTEQSALK
CLYLVTRSLDPTGTGQKRWTMRWKPALNAFAITFADRMPGSETT
>MUP070c MUP070c, conserved hypothetical protein
MSRYPRVSICVTSCSGPSTGANLIRRERKKDGFRDTLIWYTTQHIAVADR
DCEIWLVSTNHRDFGDKSQAVDHEACPYPLHPHLLEDLDTADLSGRVSYV
RTLGRLVQHLAGKYDSQPESQREALISQLNQDKFEIALAASVDRFRLNPA
AAALPLKAAYGVVHAFDRDVGSLEFVDVAMRGGGEWTAQFKQPIYATVDL
TDRAAETSNVDKTLNVAGRLTVGANGHVRSMIVTSIEALPDDPMLRAWRM
AWDPMSDTGFAQNIRQQIDPLSNPNTAKRIHESPMRPPEPDEPQKLASPS
LDELAANDVASPSGEDSDPDENNGTTRDCRGKGVSGLIGPGVSGPERSHD
DQDCRGCAAITKPRR
>MUP071c MUP071c, conserved hypothetical protein
MLRSPDWMELIEKAPQWDLRFAVPEVCLLEAVADVPREWRKRRSEVAKLA
VGEFGLTQSQNEWLDVIDRKIDGYEEALGARLADIEADIVPIPEGVNLRD
IVQRAIDRRKPYQEGEEKRRLSRHADLVYDTTHRGCGS
>MUP072c MUP072c, conserved hypothetical protein
MTVTPAGAGVATGSGTAHAPAMMLPPPGLGAPAAPVAASGAAGGAAAVTP
AGSSATPSGSAGPAGPTGGSPAGSGAAMVVPASVVSAGTTNRSRAESPEL
AAAKELVRRLRRDSDMVNYACIEWAVGVFRSEANGTTECVAMSNEGFGYI
PWGVFLPRTARLLAADKLADDQFRQRWFGAADPAEVMDEYARLRASRGAH
LVAMAVTADSPFGPPPGVEHAVCGRDLAGDGYVRPTLDDMGMHRLEATHP
DLFARIQRLTGTEDQARLVENQVVLPLAMQMIDPVQTTSVQTPPELRQMW
NELGTGDSIGSDAWQKYNIAATVFFVNVSANRPGPDGEVAVREQYRGQWV
AARAMELLQGWERRPIATADMVYAAATAYPGDFAAKLERLLRGPEDGG
>MUP073c MUP073c, conserved hypothetical protein
MADVAAPSIAANVIITPDMVLPPAEPLEAQAAQYKTLADQVSDLAQQLRA
ANAMREDAAQSPGWDVGHEKNRQLAADYGCMAGIYTAAAQYFTGVAGVYR
DAQRAQRSVVNRANQELSQAKNAVQQQAIVARWHAHARALTTSAVGAATA
RATVFQETAGTDITALTSRLGGTPMPRDPVLPHSGGSGIAVPPGKEKPLG
LGDEPEPQEADPTGISNSNDASDAGVTHGHRPAGPREHVAGTEGTATGSP
MPTPALGTPFPPGATQLPAGGGSGVRSVGFPPGLSSPGGLGSMLGTGGSS
GGLGGLPASTGQLPGTQAAGLGGLPVSAPAAGGQAAQAAQLGRRFLEACR
QGRGWDHYRRQPESEPRQPHRRARRRRLDWPQAALRLRGWPRPARHR
>MUP074c MUP074c, possible membrane protein
MSVVLHPPHVPEAAGPPPVSPPPQLLRGVRPGRGIVWGSAAALLAAAVAL
VIAAMGWVAPSGGAVSTVVVPWSPPSPSAAEVAAARTQACKLWGITASAM
DDASNLVAHTPGDWNAPDMQEALANEARVIAVEGAYLRRALPDHTPAAIR
SGIEDYLAASFDMENATTHRQGTSRNAAIDRANAAEDRVNAACR
>MUP075c MUP075c, hypothetical protein
MQADETLQVTPPAAESPNVLTGAATVEFAARVPEQSALATAQGAADAVHA
ASMWAPSVSASAGVSQQVPAAAAALAPHGGAVAQCNSQSLAEDLETDTQN
AQDLTPHPAVSI
>MUP076c MUP076c, possible membrane protein
MNPQPVGQHGPPGGPGGWPAPGNGAPNNAAVLAPPAAQVPPPPGPGRLPV
SRPRRRGWIAVGVMLTLAVLMALTAVVMSTIKLTAPAPTATTTTVMAPPP
PPTSFSPDQLAAAKKEACHASESAATTINTAQEKYLIAARDRQSPTYGPA
LANFQLVAILETQYMQQHLPPATPNNVVDSTNGYITAILALADAHTRGLS
EDEAQQFVVAARKAGRQLDEACV
>MUP077c MUP077c, conserved hypothetical
MTVYEYPRYQADKDLTASVVTRLERVSSMLSKVLEEIDGIDVEAIGGDGD
VVLSVNAHGQLTSLSLAQGCTTRYTHGGLAELINTTLGEAVNAAAAETSA
VGEGQDPAALDHAVQAFIDPDSQVWKPQSR
>MUP078c MUP078c, conserved hypothetical
MLHVDTAGLQSVAADLGSAASALAGLAAQPLVHPPLATDAVSMSAAARLS
GHGAVVASRALDGAAVLEAGAQAITQAALGYAAMDEANRAVVSLQGSPGA
PTPTLVSAVTADVVAPGVPIAAPAAQPAETTAAMIEAGRPAAGDGFVSGC
AALSKGFREGAVSARSAAIAVSEHLSGQAGSRISAALNRYADWAQEMAGY
AEAVGQHAGDHKSRFGEVKHATPTTSEFTHRHRELQNAIQLNSSFPSPGS
AAAVSQAHANLVQLTNRTHVVAAGYHTSEIPAAPSGPPPPVSPIVEPGGG
QGDTTTPVPTTQSEQEPNPTESGPDNEGHDERDVDADSDDLDELGVDGEL
AADPLGAGGAGSLPGMASAVPAMLTGALGAGVGMAGQIPQKLGQQLQGLA
QEATQGMTGLASGLTGAEGVDIDSEGLDGALGGFGAGSGGGGGLTEPAGA
GGVGDVAPAGSAPSGDGFADGIGVAAVDWCRWRLRCRGACGCRIRGNAAD
VHAADGRWYGRRWRGRDPQRQRARQIH
>MUP079c MUP079c, conserved hypothetical protein
MFEARLTDYDTVVFEAASHDESIVVAVGRGGNALGVELQAPAMALTDAEL
ANRIVKLNTLAHLRSQAALRHEWEAQHVNVSATLPTEDQVAGYEALIDF
>MUP080c MUP080c, conserved hypothetical protein
MRWAITQEHAADRCASARAENPHAIATAESWGPLFAEARRATVDAVNARE
ATLREQEEQHRAMARQLRIAAARMEEMDAENRAALTISTD
>MUP081c MUP081c, conserved hypothetical protein
MTGWADVVLSEVFGGVSEVLGSPFPQTPSPHDAVSSVPITSTDQLTPLDR
AKLTYMGIDPDSAGLDQINRALGRDQPLSVPPPAAPPPRPTTPAATPPNP
DSPELRGAAAEAAKRLDEALARNHSAINDADDQLADAVLKASSSSAEGHQ
RLQRLQQEIIDEIDKLGSSLDTPAGQQQLAEFLQGKTGDILNVLKNAGLD
SDSQARVLDGLSARYRGLHHDAGSDETPASDGQPQDATTSSGDAPPSTGD
GGEDGLPAADPLLDGLASDPLLSGLGIMAGPAMGALGSLPGALGSAIPSL
GGGGLGGAGLGDLGSALGSALHDGAAPSELDSDEHVEPLTETPQSNPEDD
QSPDPTQPAGLADHGRDHPQLGDSDDAAAPAQPLEASTQVELPDQSVRTA
GSSALATAGRAVLAGENIDDAYAAAQVQLPPPGAPISAPISPGRLQFGDV
GQYTDHRVMALDKDHVWLNGQVTPIDQLETGPNFLGWTRPSTTTSTLTSV
TTQPAPLTPATEPS
>MUP053c cyp150, probable cytochrome p450 150 cyp150
MRQRLNWIAAHGLLRGTARLAARLGDVQSRLVADPMVMANPAPFCDELRA
IGPVVSSYGTHLVVSHAIAHELLRSEDFEVVSLGSNLPAPMRWLERRTRD
DTPHLLLPPSLLAVEPPNHTRYRKAVSSVFTPKAVAGLRDHVEETASALL
DQLTDQASAVDIIARYCSQLPVAVICDILGVPSRDRNRVLKFGQLAGPCL
DFGLTWRQHQQVRQGLQGLHFWITEHLEELRSNPGDDLMSQMIHASENGS
SETHLHATEVRMIGLVLGASFATTMDLLGNGIQVLLDAPELRDALSQRPQ
LWPNAVEEILRLEPPVQLAGRMARKDTEVAGTAIKRGQLVAIYLGAVNRD
PSVFADPHRFDITRANANRHLAFSGGRHFCLGAALARVEGEVGLRMLFER
FPDVRAAGPGNRRDTRTLRGWSQLPVQLGAARSMAIR
>MUP040c mlsA1, Type I modular polyketide synthase
MIFGDAHQNCRGGRVLGDAVAVVGMSCRVPGASDPDALWALLRDGISVVD
EIPSARWNLDGLVAHRLTDEQRSALRHGAFLDDVEGFDAAFFGINPSEAG
SMDPQQRLMLELTWAALEDARIVPEHLSGSSSGVFTGAMSDDYTTAVTYR
AAMTAHTFAGTHRSLIANRVSYTLGLRGPSLVIDTGQSSSLVAVHVAMES
LRREETSLAIAGGIHLNLSLAAALSAAHFGALSPDGRCYTFDARANGYVR
GEGGGVVVLKRLNDALADGNHIYCVIRGSSVNNDGATQDLTAPGVDGQRQ
ALLQAYERAEIDPSEVQYVELHGTGTRLGDPTEAHSLHSVFGTSTVPRSP
LLVGSIKTNIGHLEGAAGILGLIKTALAVHHRQLPPSLNYTVPNPKIPLE
QLGLRVQTTLSEWPDLDKPLTAGVSSFSMGGTNAHLILQQPPTPDTTQTP
NPTTGSDPAVGSDPAVGVLVWPLSARSAPGLSAQAARLYQHLSAHPDLDP
IDVAHSLATTRSHHPHRATITTSIEHHSENNHDTTDALAALHALANNGTH
PLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPVFAHALDEVAA
ALNPHLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALHRLFTHA
GIHPDYLLGHSIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTPGGTML
ALQASEAEVQPLLEGLDHAVSIAAINGATSIVLSGDHDSLEQIGEHFITQ
DRRTTRLQVSHAFHSPHMDPILEQFRQIAAQLTFSAPTLPILSNLTGQIA
RHDQLASPDYWTQQLRNTVRFHDTVAALLGAGEQVFLELSPHPVLTQAIT
DTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISPSWNVLYCQAR
PLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENRGWVFT
GRISPRTQPWLNEHAVESAVLFPNTGFVELALHVADRAGYSSVNELIVHT
PLLLAGHDTADLQITVTDTDDMGRQSLNIHSHPHIGHDNTTTGDEQPEWV
LHASAVLTAQTTDHNHLPLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGP
TFQGVQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALFDAALHPLLAL
TQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLRVRLTRTGADA
ITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWPPHPDT
TTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPV
PVPSNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIV
TRHGVSTSAHDPVPDLAHAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTL
TTALTLPTRENQLAIRRDTIHIPRLTRHSSDGALTAPVVVDPEGTVLITG
GTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVT
ITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQ
VLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALD
ALADYRHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATS
HGLALFDAALATGQPVSIPAPINTHTLARHARDNTLAPILSALITTPRRR
AASAATDLAARLNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAF
KDLGIDSLTALELRNTLTHNTGLNLSSTLIFDHPTPHAVAEHLLEQIPGI
GALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVG
NFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREA
RAMDPQQRLLLEVCWEALETAGIPAHTLAGTSTGVFVGAWAQSYGATNSD
DAEGYAMTGGATSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSL
RNNESQLALAGGVTVMSTPAVFTDFSRQRGLAPDGRCKAFAATADGTGWG
EGAAVLVLERLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRV
INQALANAGLTHDQVDAVEAHGTGTTLGDPIEAGALHATYGHHHTPDQPL
WLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSS
GTVRLLTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPN
PTTGSDPAVGSDSAVGSDPAVGVLVWPLSARSAPGLSAQAARLYQHLSAH
PDLDPIDVAHSLATTRSHHPHRATITTSIEHHSENNHDTTDALAALHALA
NNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPVFAHAL
DEVAAALNPHLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALHR
LFTHAGIHPDYLLGHSIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTP
GGTMLALQASEAEVQPLLEGLDHAVSIAAINGATSIVLSGDHDSLEQIGE
HFITQDRRTTRLQVSHAFHSPHMDPILEQFRQIAAQLTFSAPTLPILSNL
TGQIARHDQLASPDYWTQQLRNTVRFHDTVAALLGAGEQVFLELSPHPVL
TQAITDTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISPSWNVL
YCQARPLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENR
GWVFTGRISPRTQPWLNEHAVESAVLFPNTGFVELALHVADRAGYSSVNE
LIVHTPLLLAGHDTADLQITVTDTDDMGRQSLNIHSRPHIGHDNTTTGDE
QPEWVLHASAVLTAQTTDHNHLPLTPVPWPPPGTAAIEVDDFYDDLAAQG
YNYGPTFQGVQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALFDAALH
PLLALTQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLRVRLTR
TGADAITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWP
PHPDTTTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADV
VVWPVPVPSNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGT
RLVIVTRHGVSTSAHDPVPDLAHAAVWGLIRSAQNEHPGRFTLLDTDDNT
NSDTLTTALTLPTRENQLAIRRDTIHIPRLTRHSSDGALTAPVVVDPEGT
VLITGGTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDL
GAHVTITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTG
DQLDQVLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAA
NTALDALADYRHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLM
PIATSHGLALFDAALATGQPVSIPAPINTHTLARHARDNTLAPILSALIT
TPRRRAASAATDLAARLNGLSPQQQQQTLATLVAAATATVLGHHTPESIS
PATAFKDLGIDSLTALELRNTLTHNTGLNLSSTLIFDHPTPHAVAEHLLE
QIPGIGALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAG
RDVVGNFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGI
SPREARAMDPQQRLLLEVCWEALETAGIPAHTLAGTSTGVFVGAGAQSYG
ATNSDDAEGYAMTGGATSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHL
ACQSLRNNESQLALAGGVTVMSTPAIFTEFSRQRGLAPDGRCKAFAATAD
GTGWGEGAAVLVLERLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGP
SQQRVINQALANAGLTHDQVDAVEAHGTGTTLGDPIEAGALHATYGHHHT
PDQPLWLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPH
IDWSSGTVRLLTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDT
TQTPNPTTGSDPAVGSDSAVGSDPAVGVLVWPLSARSAPGLSAQAARLYQ
HLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIEHHSENNHDTTDALAA
LHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPV
FAHALDACDAALQPFTGWSVLAVLHDEPEAPSLERVDVVQPVLFSVMVSL
AALWRWAGITPDAVIGHSQGEIAAAHVAGALTLPEAAAVVALRSRVLTDL
AGAGAMASVLSPEEPLTQLLARWDGKITVAAVNGPASAVVSGDTTAITEL
LITCEHENIDARAIPVDYPSHSPYMEHIRHQFLDELPELTPRPSTIAMYS
TVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHDTVAALLGAGEQVFLELS
PHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISP
SWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATE
LAENRGWVFTGRISPRTQPWLNEHAVESAVLFPGTGFVELALHVADRAGY
SSVNELIVHTPLLLAGHDTADLQITVTDTDDMGRQSLNIHSRPHIGHDNT
TTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPWPPPGTAAIEVDDFYDD
LAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALF
DAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLR
VRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLL
HLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTST
TEADVVVWPVPVPSNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHP
DTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGLIRSAQNEHPGRFTLLD
TDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRLTRTAVLTPPDSGPWR
LDTTGKGDLANLALLPTAHTALASGQIRIDVRAAGLNFHDVVVALGLIPD
DGFGGEAAGVISEIGPDVYGFAVGDAVTGMTVSGAFAPSTVADHRMVMTI
PARWSFPQAASIPVVFLTAYIALAEISGLSRGQRVLIHAGTGGVGMAAIQ
LAHHLGAEVFATASAAKWSTLEALGVPRDHIASSRTLDFSNAFLDATNGA
GVDVVLNCLSGEFVEASLALLPRGGHFVEIGKTDIRDTEVIAATHPGVIY
RALDLLSVSPDHIQRTLAQLSPLFATDTLKPLPTTNYSIYQAISALRDMS
QARHTGKIVLTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLL
TSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHR
LTAVVHTAVVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSAF
IMFSSMAGMIGSPGLGNYAAANTALDALADYRHRLGLPATSLAWGYWQTR
TGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINT
HTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTL
ATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLN
LSSTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGM
ACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGKT
YTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGIP
AHTLAGTSTGVFVGAGAQSYGATNSDGAEGYAMTGGAISVMSGRIAYTLG
LEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAIFTE
FSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAI
VAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTG
TTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKM
IQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAVS
SFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDSAVGSDPAVGVL
VWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRAT
ITTSIEHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFV
FPGQGSQYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPE
APSLERVDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAG
ALTLPEAAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITV
AAVNGPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIR
HQFLDELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTV
RFHDTVAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKD
RPDAVAFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPT
AGDFSGANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESA
VLFPGTGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDT
DDMGRQSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPL
TPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIY
AEVELPEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQ
VRLPYAFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSL
ITRPLTTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYRVIAEP
TQQLPRYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVS
SRIHTLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHA
AVWGLIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDT
IHIPRLTRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGV
RHLLLTSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSV
PTQHRLTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEH
NLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWG
YWQTHTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIP
APINTHTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQ
QQQTLATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTH
NTGLDLPPTLIFDHPTPTALTQHLHTRLTTGALVPAPVVIAAGRTEEPVA
VVGMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDA
VGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALET
AGIPAHTLAGTSTGVFVGAWAQSYGATNSDDAEGYAMTGGAISVMSGRIA
YTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPA
VFTDFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHP
VLAIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEA
HGTGTTLGDPIEAGALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAG
VVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRT
AAVSSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVW
PLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATIT
TSIEHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFP
GQGSQYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAP
SLERVDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAGAL
TLPEAAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITVAA
VNGPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIRHQ
FLDELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRF
HDTVAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRP
DAVAFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAG
DFSGANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVL
FPGTGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDD
MGRQSLNIHSHPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTP
VPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAE
VELPEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVR
LPYAFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLIT
RPLTTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQ
QLPRYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSR
IHTLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAV
WGLIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIH
IPRLTRTAVLTPPDSGPWRLDTTGKGDLANLALLPTAHTALASGQIRIDV
RAAGLNFHDVVVALGLIPDDGFGGEAAGVISEIGPDVYGFAVGDAVTGMT
VSGAFAPSTVADHRMVMTIPARWSFPQAASIPVVFLTAYIALAEISGLSR
GQRVLIHAGTGGVGMAAIQLAHHLGAEVFATASAAKWSTLEALGVPRDHI
ASSRTLDFSNAFLDATNGAGVDVVLNCLSGEFVEASLALLPRGGHFVEIG
KTDIRDTEVIAATHPGVIYRALDLLSVSPDHIQRTLAQLSPLFATDTLKP
LPTTNYSIYQAISALRDMSQARHTGKIVLTAPVVVDPEGTVLITGGTGTL
GALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVTITACD
ISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQVLAPK
IDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADY
RHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATSHGLAL
FDAALATGQPVSIPAPINTHTLARHARDNTLAPILSALITTPRRRAASAA
TDLAARLNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAFKDLGI
DSLTALELRNTLTHNTGLNLSSTLIFDHPTPHAVAEHLLEQIPGIGALVP
APVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVGNFPAD
RGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDP
QQRLLLEVCWEALETAGIPAHTLAGTSTGVFVGAGAQSYGATNSDDAEGY
AMTGGATSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNES
QLALAGGVTVMSTPAVFTEFSRQRGLAPDGRCKAFAATADGTGFGEGAAV
LVLERLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRVINQAL
ANAGLTHDQVDAVEAHGTGTTLGDPIEAGALHATYGHHHTPDQPLWLGSI
KSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRL
LTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPNPTQTPEDCSPA
QSPCATITDAGTGLSFVPWVISAKSAEALSAQASRLLTRLDDDPVVDAID
LGWSLIATRSMFEHRAVVVGADRHQLQRGLAELASGNLGADVVVGRARAA
GETVMVFPGQGSQRLGMGAQLYEQFPVFAAAFDDVVDALDQYLRLPLRQV
MWGDDEGLLNSTEFAQPSLFAVEVALFALLRFWGVVPDYVIGHSVGELAA
AQVAGVLSLQDAAKLVSARGRLMQALPAGGAMVAVAASQHEVEPLLVEGV
DIAALNAPGSVVISGDQAAVRLIANRLADRGYRAHELAVSHAFHSSLMEP
MLEEFARLASEIVVEQPQIPLISNVTGQLANADYGSAGYWVDHIRRPVRF
ADSVASLEAMGASCFIEVGPASGLGAAIEQSLKSAEPTVSVSALSTDKPE
SVAVLRAAARLSTSGIPVDWQSVFDGRSTQTVNLPTYAFQRQRFWLDANR
IGQGDPASQPQAQNVESRFWEAVEREDVDGLADSIGVTASAMQTVLPALS
SWRRAERTQSELDSWRYQVTWLSSPATPSSITLSGIWLLIVPSELAKTDP
VIGCAAALEAHGALVTIITIFEPDFNRSLMGASLKDIGSHISGVISFLGI
HGSEFSDSGAVKTLNLVQAMGDVHLDVPLWCLTQGAVSISADDLIRCSSA
ALVWGLGRVVALEHPGSWGGLVDLPESPDDAAWERLCALLAQPTDEDQFA
IRPSGVFLRRLIHAPATTTSKSSTAWAPRGTVLITGGTGALGAHVARWLA
HKYESVDLLLTSRRGMAADGATELVDDLRTAGASVTVHACDVTDRTSVEA
AIAGKSLDAVFHLAGRHQPTLLTELEDESFSDELAPKVHGAQVLSDITSN
LTLSAFVMFSSVAGIWGGKSQGAYAAANAFLDSLAEKRRTLGLPATSVAW
GLWAGGGMGDRPSASGLNLIGLKSMSADLAVQALSDAIDRPQATLTVASV
NWDRFYPTFALARPRPFLHEITEVMAYRESMRSSSASTATLLTSKLAGLT
ATEQRAVTRKLVLDQAASVLGYASTESLDTHESFKDLGFDSLTALELRDH
LQTATGLNLSSTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTE
EPVAVVGMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDP
DPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWE
ALETAGIPAHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGSTSVMS
GRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVM
STPAIFTEFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARR
NNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVD
AVEAHGTGTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAA
GAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTD
HPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVG
VLVWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHR
ATITTSIEHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTV
FVFPGQGSQYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDE
PEAPSLERVDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHV
AGALTLPEAAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKI
TVAAVNGPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEH
IRHQFLDELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRN
TVRFHDTVAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALR
KDRPDAVAFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLL
PTAGDFSGANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVE
SAVLFPGTGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVT
DTDDMGRQSLNIHSHPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHL
PLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDV
IYAEVELPEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTG
DQVRLPYAFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIID
SLITRPLTTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIA
EPTQQLPRYLHDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIH
TLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWG
LIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIP
RLTRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLL
LTSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQH
RLTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSA
FIMFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQT
HTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPIN
THTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQT
LATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGL
DLPPTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVG
MACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGK
TYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGI
PAHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGSTSVMSGRIAYTL
GLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFT
EFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLA
IVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGT
GTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVK
MIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAV
SSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVWPLS
ARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSI
EHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQG
SQYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSLE
RVDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAGALTLP
EAAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITVAAVNG
PASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIRHQFLD
ELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHDT
VAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAV
AFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFS
GANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPG
TGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGR
QSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPW
PPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVEL
PEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPY
AFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPL
TTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLP
RYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHT
LTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGL
IRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPR
LTRTAVLTPPDSGPWRLDTTGKGDLANLALLPTAHTALASGQIRIDVRAA
GLNFHDVVVALGLIPDDGFGGEAAGVISEIGPDVYGFAVGDAVTGMTVSG
AFAPSTVADHRMVMTIPARWSFPQAASIPVVFLTAYIALAEISGLSRGQR
VLIHAGTGGVGMAAIQLAHHLGAEVFATASAAKWSTLEALGVPRDHIASS
RTLDFSNAFLDATNGAGVDVVLNCLSGEFVEASLALLPRGGHFVEIGKTD
IRDTEVIAATHPGVIYRALDLLSVSPDHIQRTLAQLSPLFATDTLKPLPT
TNYSIYQAISALRDMSQARHTGKIVLTAPVVVDPEGTVLITGGTGTLGAL
FAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVTITACDISD
PEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDA
AWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADYRHR
LGLPATSLAWGYWQTRTGVTAHLTDVDLARMTRLGLMPIATSHGLALFDA
ALATGQPVSIPAPINTHTLARHARDNTLTPILSALITTPRRRAASAATDL
AARLNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAFKDLGIDSL
TALELRNTLTHNTGLDLPPTLIFDHPTPTALTQHLHTRLTTGALVPAPVV
IAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWD
VAGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRL
LLEVCWEALETAGIPAHTLAGTSTGVFVGAGAQSYGATNSDDAEGYAMTG
GAISVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLAL
AGGVTVMSTPAVFTDFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLE
RLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAG
LTHDQVDAVEAHGTGTTLGDPIEAGALHATYGHHHTPDQPLWLGSIKSNI
GHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEP
IQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPDTTQTPNPTTGSDPAV
GSDSAVGSDPAVGVLVWPLSARSAPGLSAQAARLYQHLSAHPDLDPIDVA
HSLATTRSHHPHRATITTSIEHHSENNHDTTDALAALHALANNGTHPLLS
RGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPVFAHALDEVAAALNP
HLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALHRLFTHAGIHP
DYLLGHSIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTPGGTMLALQA
SEAEVQPLLEGLDHAVSIAAINGATSIVLSGDHDSLEQIGEHFITQDRRT
TRLQVSHAFHSPHMDPILEQFRQIAAQLTFSAPTLPILSNLTGQIARHDQ
LASPDYWTQQLRNTVRFHDTVAALLGAGEQVFLELSPHPVLTQAITDTVE
QAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISPSWNVLYCQARPLTL
PTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENRGWVFTGRIS
PRTQPWLNEHAVESAVLFPNTGFVELALHVADRAGYSSVNELIVHTPLLL
AGHDTADLQITVTDTDDMGRQSLNIHSRPHIGHDNTTTGDEQPEWVLHAS
AVLTAQTTDHNHLPLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGPTFQG
VQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALFDAALHPLLALTQPP
TNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLRVRLTRTGADAITVH
TSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWPPHPDTTTDT
DTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPVPV
PSNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIVTR
HGVSTSAHDPVPDLAHAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTLTT
ALTLPTRENQLAIRRDTIHIPRLTRHSSDGALTAPVVVDPEGTVLITGGT
GTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVTIT
ACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQVL
APKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDAL
ADYRHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATSHG
LALFDAALATGQPVSIPAPINTHTLARHARDNTLAPILSALITTPRRRAA
SAATDLAARLNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAFKD
LGIDSLTALELRNTLTHNTGLDLPPTLIFDHPTPTALTQHLHTRLTQIES
PNSEDSMLNLKNLDRIESYIFRNSGEDRAHVIANRLRSILSKWDGTRSPE
LPAELHLESATDDELFSLANMFRTPTSEISPTLEGGRGVN
>MUP039c mlsA2, Type I modular polyketide synthase
MVSTEENLRVYLKQVITDLHQMQARLRKIEKQRSERVAVVGMACRFPGGV
ASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGKTYTRYGAFL
DDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGIPAHTLAGTS
TGVFVGAWAQSYGATNSDGAEGYAMTGGSTSVMSGRIAYTLGLEGPAITV
DTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTEFSRQRGLA
PDGRCKAFAATADGTGWGEGAAVLVLERLSEARRNNHPVLAIVAGSAINQ
DGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTGTTLGDPIE
ASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKMIQAITHAT
LPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAVSSFGISGTN
AHLILQQPPTPDTTQTPNTTTGSDPAVGSDSAVGSDPAVGVLVWPLSARS
APGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIEHH
SENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQY
PGMGADLYRQFPVFAHALDEVAAALNPHLDVALLEVMFSQQDTAMAQLLD
QTFYAQPALFALGTALHRLFTHAGIHPDYLLGHSIGELTAAYAAGVLSLQ
DAATLVTSRGRLMQSCTPGGTMLALQASEAEVQPLLEGLDHAVSIAAING
ATSIVLSGDHDSLEQIGEHFITQDRRTTRLQVSHAFHSPHMDPILEQFRQ
IAAQLTFSAPTLPILSNLTGQIARHDQLASPDYWTQQLRNTVRFHDTVAA
LLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFA
AALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGAN
THAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPGTGF
VELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGRQSL
NIHSHPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPWPPP
GTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELPED
TDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFT
GISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTTA
TGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLPRYL
HDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHTLTR
QTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGLIRS
AQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRLTR
TAVLTPPDSGPWRLDTTGKGDLANLALLPTAHTALASGQIRIDVRAAGLN
FHDVVVALGLIPDDGFGGEAAGVISEIGPDVYGFAVGDAVTGMTVSGAFA
PSTVADHRMVMTIPARWSFPQAASIPVVFLTAYIALAEISGLSRGQRVLI
HAGTGGVGMAAIQLAHHLGAEVFATASAAKWSTLEALGVPRDHIASSRTL
DFSNAFLDATNGAGVDVVLNCLSGEFVEASLALLPRGGHFVEIGKTDIRD
TEVIAATHPGVIYRALDLLSVSPDHIQRTLAQLSPLFATDTLKPLPTTNY
SIYQAISALRDMSQARHTGKIVLTAPVVVDPEGTVLITGGTGTLGALFAE
HLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEA
LAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQ
LHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGL
PATSLAWGYWQTRTGVTAHLTDVDLARMTRLGLMPIATSHGLALFDAALA
TGQPVSIPAPINTHTLARHARDNTLTPILSALITTPRRRAASAATDLAAR
LNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAFKDLGIDSLTAL
ELRNTLTHNTGLDLPPTLIFDHPTPHALTQHLHTRLTQSHTPVGPIASLL
SHAIDEGKFRAGADLLMAASNLNQSFSNMAELNQLPAVTDIADASPDGLL
TLICISTSENEYARLAAANIHSLTFAEIAAPGFYDAQLPNSIETSAEALA
TAITGAYANTSIVLVAHSIVCELAQATMTRLQDADIDLVGLVLLDPLEGT
NSTEDYVETVLTRIEHINAPRVGVDGYLAALGRYLQFHEDRRIPIPETRH
MTLHSDTKIDRAQTPMNLLQDEAALTALKIGNWMNDTGSIAVTLRDGPVF
LGRARSVNMR
>MUP032c mlsB, Type I modular polyketide synthase
MIFGDAHQNCRGGRVLGDAVAVVGMSCRVPGASDPDALWALLRDGISVVD
EIPSARWNLDGLVAHRLTDEQRSALRHGAFLDDVEGFDAAFFGINPSEAG
SMDPQQRLMLELTWAALEDARIVPEHLSGSSSGVFTGAMSDDYTTAVTYR
AAMTAHTFAGTHRSLIANRVSYTLGLRGPSLVIDTGQSSSLVAVHVAMES
LRREETSLAIAGGIHLNLSLAAALSAAHFGALSPDGRCYTFDARANGYVR
GEGGGVVVLKRLNDALADGNHIYCVIRGSSVNNDGATQDLTAPGVDGQRQ
ALLQAYERAEIDPSEVQYVELHGTGTRLGDPTEAHSLHSVFGTSTVPRSP
LLVGSIKTNIGHLEGAAGILGLIKTALAVHHRQLPPSLNYTVPNPKIPLE
QLGLRVQTTLSEWPDLDKPLTAGVSSFSMGGTNAHLILQQPPTPDTTQTP
NPTTGSDPAVGSDPAVGVLVWPLSARSAPGLSAQAARLYQHLSAHPDLDP
IDVAHSLATTRSHHPHRATITTSIEHHSENNHDTTDALAALHALANNGTH
PLLSRGLLTPQGPGKTVFVFPGQGSQYPGMGADLYRQFPVFAHALDEVAA
ALNPHLDVALLEVMFSQQDTAMAQLLDQTFYAQPALFALGTALHRLFTHA
GIHPDYLLGHSIGELTAAYAAGVLSLQDAATLVTSRGRLMQSCTPGGTML
ALQASEAEVQPLLEGLDHAVSIAAINGATSIVLSGDHDSLEQIGEHFITQ
DRRTTRLQVSHAFHSPHMDPILEQFRQIAAQLTFSAPTLPILSNLTGQIA
RHDQLASPDYWTQQLRNTVRFHDTVAALLGAGEQVFLELSPHPVLTQAIT
DTVEQAGGGGAAVPALRKDRPDAVAFAAALGQLHCHGISPSWNVLYCQAR
PLTLPTYAFQHQRYWLLPTAGDFSGANTHAMHPLLDTATELAENRGWVFT
GRISPRTQPWLNEHAVESAVLFPNTGFVELALHVADRAGYSSVNELIVHT
PLLLAGHDTADLQITVTDTDDMGRQSLNIHSHPHIGHDNTTTGDEQPEWV
LHASAVLTAQTTDHNHLPLTPVPWPPPGTAAIEVDDFYDDLAAQGYNYGP
TFQGVQRIWRDHATPDVIYAEVELPEDTDIDGYGIHPALFDAALHPLLAL
TQPPTNDTDDTNTADTGDQVRLPYAFTGISLHATHATRLRVRLTRTGADA
ITVHTSDTTGAPVAIIDSLITRPLTTATGSAPATTAAGLLHLSWPPHPDT
TTDTDTDTDALRYQVIAEPTQQLPRYLHDLHTSTDLHTSTTEADVVVWPV
PVPSNEELQAHQASDTAVSSRIHTLTRQTLTVVQDWLTHPDTTGTRLVIV
TRHGVSTSAHDPVPDLAHAAVWGLIRSAQNEHPGRFTLLDTDDNTNSDTL
TTALTLPTRENQLAIRRDTIHIPRLTRHSSDGALTAPVVVDPEGTVLITG
GTGTLGALFAEHLVSAHGVRHLLLTSRRGPQAHGATDLQQRLTDLGAHVT
ITACDISDPEALAALVNSVPTQHRLTAVVHTAAVLADTPVTELTGDQLDQ
VLAPKIDAAWQLHQLTYEHNLSAFIMFSSMAGMIGSPGQGNYAAANTALD
ALADYRHRLGLPATSLAWGYWQTHTGLTAHLTDVDLARMTRLGLMPIATS
HGLALFDAALATGQPVSIPAPINTHTLARHARDNTLAPILSALITTPRRR
AASAATDLAARLNGLSPQQQQQTLATLVAAATATVLGHHTPESISPATAF
KDLGIDSLTALELRNTLTHNTGLNLSSTLIFDHPTPHAVAEHLLEQIPGI
GALVPAPVVIAAGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVG
NFPADRGWDVEGLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREA
RAMDPQQRLLLEVCWEALETAGIPAHTLAGTSTGVFVGAWAQSYGATNSD
DAEGYAMTGGATSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSL
RNNESQLALAGGVTVMSTPAVFTEFSRQRGLAPDGRCKAFAATADGTGWG
EGAAVLVLERLSEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRV
INQALANAGLTHDQVDAVEAHGTGTTLGDPIEASALHATYGHHHTPDQPL
WLGSIKSNIGHTQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSS
GTVRLLTEPIQWPNTDHPRTAAVSSFGISGTNAHLILQQPPTPNPTQTPE
DCSPAQSPCATITDAGTGLSFVPWVISAKSAEALSAQASRLLTRLDDDPV
VDAIDLGWSLIATRSMFEHRAVVVGADRHQLQRGLAELASGNLGADVVVG
RARAAGETVMVFPGQGSQRLGMGAQLYEQFPVFAAAFDDVVDALDQYLRL
PLRQVMWGDDEGLLNSTEFAQPSLFAVEVALFALLRFWGVVPDYVIGHSV
GELAAAQVAGVLSLQDAAKLVSARGRLMQALPAGGAMVAVAASQHEVEPL
LVEGVDIAALNAPGSVVISGDQAAVRLIANRLADRGYRAHELAVSHAFHS
SLMEPMLEEFARLASEIVVEQPQIPLISNVTGQLANADYGSAGYWVDHIR
RPVRFADSVASLEAMGASCFIEVGPASGLGAAIEQSLKSAEPTVSVSALS
TDKPESVAVLRAAARLSTSGIPVDWQSVFDGRSTQTVNLPTYAFQRQRFW
LDANRIGQGDPASQPQAQNVESRFWEAVEREDVDGLADSIGVTASAMQTV
LPALSSWRRAERTQSELDSWRYQVTWLSSPATPSSITLSGIWLLIVPSEL
AKTDPVIGCAAALEAHGALVTIITIFEPDFNRSLMGASLKDIGSHISGVI
SFLGIHGSEFSDSGAVKTLNLVQAMGDVHLDVPLWCLTQGAVSISADDLI
RCSSAALVWGLGRVVALEHPGSWGGLVDLPESPDDAAWERLCALLAQPTD
EDQFAIRPSGVFLRRLIHAPATTTSKSSTAWAPRGTVLITGGTGALGAHV
ARWLAHKYESVDLLLTSRRGMAADGATELVDDLRTAGASVTVHACDVTDR
TSVEAAIAGKSLDAVFHLAGRHQPTLLTELEDESFSDELAPKVHGAQVLS
DITSNLTLSAFVMFSSVAGIWGGKSQGAYAAANAFLDSLAEKRRTLGLPA
TSVAWGLWAGGGMGDRPSASGLNLIGLKSMSADLAVQALSDAIDRPQATL
TVASVNWDRFYPTFALARPRPFLHEITEVMAYRESMRSSSASTATLLTSK
LAGLTATEQRAVTRKLVLDQAASVLGYASTESLDTHESFKDLGFDSLTAL
ELRDHLQTATGLNLSSTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIA
AGRTEEPVAVVGMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVE
GLFDPDPDAVGKTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLL
EVCWEALETAGIPAHTLAGTSTGVFVGAWAQSYGATNSDDAEGYAMTGGA
TSVMSGRIAYTLGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAG
GVTVMSTPAVFTEFSRQRGLAPDGRCKAFAATADGTGWGEGAAVLVLERL
SEARRNNHPVLAIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLT
HDQVDAVEAHGTGTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGH
TQAAAGAAGVVKMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQ
WPNTDHPRTAAVSSFGISGTNAHLILQQPPTPNPTQTPEDCSPAQSPCAT
ITDAGTGLSFVPWVISAKSAEALSAQASRLLTRLDDDPVVDAIDLGWSLI
ATRSMFEHRAVVVGADRHQLQRGLAELASGNLGADVVVGRARAAGETVMV
FPGQGSQRLGMGAQLYEQFPVFAAAFDDVVDALDQYLRLPLRQVMWGDDE
GLLNSTEFAQPSLFAVEVALFALLRFWGVVPDYVIGHSVGELAAAQVAGV
LSLQDAAKLVSARGRLMQALPAGGAMVAVAASQHEVEPLLVEGVDIAALN
APGSVVISGDQAAVRLIANRLADRGYRAHELAVSHAFHSSLMEPMLEEFA
RLASEIVVEQPQIPLISNVTGQLANADYGSAGYWVDHIRRPVRFADSVAS
LEAMGASCFIEVGPASGLGAAIEQSLKSAEPTVSVSALSTDKPESVAVLR
AAARLSTSGIPVDWQSVFDGRSTQTVNLPTYAFQRQRFWLDANRIGQGDP
ASQPQAQNVESRFWEAVEREDVDGLADSIGVTASAMQTVLPALSSWRRAE
RTQSELDSWRYQVTWLSSPATPSSITLSGIWLLIVPSELAKTDPVIGCAA
ALEAHGALVTIITIFEPDFNRSLMGASLKDIGSHISGVISFLGIHGSEFS
DSGAVKTLNLVQAMGDVHLDVPLWCLTQGAVSISADDLIRCSSAALVWGL
GRVVALEHPGSWGGLVDLPESPDDAAWERLCALLAQPTDEDQFAIRPSGV
FLRRLIHAPATTTSKSSTAWAPRGTVLITGGTGALGAHVARWLAHKYESV
DLLLTSRRGMAADGATELVDDLRTAGASVTVHACDVTDRTSVEAAIAGKS
LDAVFHLAGRHQPTLLTELEDESFSDELAPKVHGAQVLSDITSNLTLSAF
VMFSSVAGIWGGKSQGAYAAANAFLDSLAEKRRTLGLPATSVAWGLWAGG
GMGDRPSASGLNLIGLKSMSADLAVQALSDAIDRPQATLTVASVNWDRFY
PTFALARPRPFLHEITEVMAYRESMRSSSASTATLLTSKLAGLTATEQRA
VTRKLVLDQAASVLGYASTESLDTHESFKDLGFDSLTALELRDHLQTATG
LNLSSTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVV
GMACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVG
KTYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAG
IPAHTLAGTSTGVFVGAWAQSYGATNSDDAEGYAMTGGAISVMSGRIAYT
LGLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALTGGVTVMSTPAIF
TEFSRQRGLAPDGRCKAFAATADGTGWGEGAAVLVLERLSEARRNNHPVL
AIVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHG
TGTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVV
KMIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAA
VSSFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVWPL
SARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTS
IEHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQ
GSQYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSL
ERVDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAGALTL
PEAAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITVAAVN
GPASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIRHQFL
DELPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHD
TVAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDA
VAFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDF
SGANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFP
GTGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMG
RQSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVP
WPPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVE
LPEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLP
YAFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRP
LTTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQL
PRYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIH
TLTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWG
LIRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIP
RLTRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLL
LTSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQH
RLTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSA
FIMFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQT
HTGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPIN
THTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQT
LATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGL
DLPPTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVG
MACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGK
TYTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGI
PAHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGSTSVMSGRIAYTL
GLEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFT
EFSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLA
IVAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGT
GTTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVK
MIQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAV
SSFGISGTNAHLILQQPPTPDTTQTPNPTTGSDPAVGSDPAVGVLVWPLS
ARSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSI
EHHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQG
SQYPGMGADLYRQFPVFAHALDEVAAALNPHLDVALLEVMFSQQDTAMAQ
LLDQTFYAQPALFALGTALHRLFTHAGIHPDYLLGHSIGELTAAYAAGVL
SLQDAATLVTSRGRLMQSCTPGGTMLALQASEAEVQPLLEGLDHAVSIAA
INGATSIVLSGDHDSLEQIGEHFITQDRRTTRLQVSHAFHSPHMDPILEQ
FRQIAAQLTFSAPTLPILSNLTGQIARHDQLASPDYWTQQLRNTVRFHDT
VAALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAV
AFAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFS
GANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPG
TGFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGR
QSLNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPW
PPPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVEL
PEDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPY
AFTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPL
TTATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLP
RYLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHT
LTRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGL
IRSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPR
LTRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLL
TSRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHR
LTAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSAF
IMFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQTH
TGLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINT
HTLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTL
ATLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLD
LPPTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGM
ACRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGKT
YTRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGIP
AHTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGATSVMSGRIAYTLG
LEGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTE
FSRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAI
VAGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTG
TTLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKM
IQAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAVS
SFGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVWPLSA
RSAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIE
HHSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGS
QYPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSLER
VDVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAGALTLPE
AAAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITVAAVNGP
ASAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIRHQFLDE
LPELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHDTV
AALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVA
FAAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSG
ANTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPGT
GFVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGRQ
SLNIHSHPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPWP
PPGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELP
EDTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYA
FTGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLT
TATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYRVIAEPTQQLPR
YLHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHTL
TRQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGLI
RSAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRL
TRHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLT
SRRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRL
TAVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSAFI
MFSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQTHT
GLTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINTH
TLARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTLA
TLVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDL
PPTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGMA
CRFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGKTY
TRYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGIPA
HTLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGATSVMSGRIAYTLGL
EGPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTEF
SRQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAIV
AGSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTGT
TLGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKMI
QAITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAVSS
FGISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVWPLSAR
SAPGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIEH
HSENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQ
YPGMGADLYRQFPVFAHALDACDAALQPFTGWSVLAVLHDEPEAPSLERV
DVVQPVLFSVMVSLAALWRWAGITPDAVIGHSQGEIAAAHVAGALTLPEA
AAVVALRSRVLTDLAGAGAMASVLSPEEPLTQLLARWDGKITVAAVNGPA
SAVVSGDTTAITELLITCEHENIDARAIPVDYPSHSPYMEHIRHQFLDEL
PELTPRPSTIAMYSTVDGEPHDTAYDTTTMTADYWYRNIRNTVRFHDTVA
ALLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAF
AAALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGA
NTHAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPGTG
FVELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGRQS
LNIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPWPP
PGTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELPE
DTDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAF
TGISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTT
ATGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLPRY
LHDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHTLT
RQTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGLIR
SAQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRLT
RHSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLTS
RRGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRLT
AVVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSAFIM
FSSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQTHTG
LTAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINTHT
LARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTLAT
LVAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDLP
PTLIFDHPTPHAVAEHLLEQIPGIGALVPAPVVIAAGRTEEPVAVVGMAC
RFPGGVASADQLWDLVIAGRDVVGNFPADRGWDVEGLFDPDPDAVGKTYT
RYGAFLDDAAGFDAGFFGISPREARAMDPQQRLLLEVCWEALETAGIPAH
TLAGTSTGVFAGAWAQSYGATNSDDAEGYAMTGGATSVMSGRIAYTLGLE
GPAITVDTACSSSLVAIHLACQSLRNNESQLALAGGVTVMSTPAVFTEFS
RQRGLAPDGRCKAFAATADGTGFGEGAAVLVLERLSEARRNNHPVLAIVA
GSAINQDGASNGLTAPHGPSQQRVINQALANAGLTHDQVDAVEAHGTGTT
LGDPIEASALHATYGHHHTPDQPLWLGSIKSNIGHTQAAAGAAGVVKMIQ
AITHATLPATLHVDQPSPHIDWSSGTVRLLTEPIQWPNTDHPRTAAVSSF
GISGTNAHLILQQPPTPDTTQTPNTTTGSDPAVGSDPAVGVLVWPLSARS
APGLSAQAARLYQHLSAHPDLDPIDVAHSLATTRSHHPHRATITTSIEHH
SENNHDTTDALAALHALANNGTHPLLSRGLLTPQGPGKTVFVFPGQGSQY
PGMGADLYRQFPVFAHALDEVAAALNPHLDVALLEVMFSQQDTAMAQLLD
QTFYAQPALFALGTALHRLFTHAGIHPDYLLGHSIGELTAAYAAGVLSLQ
DAATLVTSRGRLMQSCTPGGTMLALQASEAEVQPLLEGLDHAVSIAAING
ATSIVLSGDHDSLEQIGEHFITQDRRTTRLQVSHAFHSPHMDPILEQFRQ
IAAQLTFSAPTLPILSNLTGQIARHDQLASPDYWTQQLRNTVRFHDTVAA
LLGAGEQVFLELSPHPVLTQAITDTVEQAGGGGAAVPALRKDRPDAVAFA
AALGQLHCHGISPSWNVLYCQARPLTLPTYAFQHQRYWLLPTAGDFSGAN
THAMHPLLDTATELAENRGWVFTGRISPRTQPWLNEHAVESAVLFPGTGF
VELALHVADRAGYSSVNELIVHTPLLLAGHDTADLQITVTDTDDMGRQSL
NIHSRPHIGHDNTTTGDEQPEWVLHASAVLTAQTTDHNHLPLTPVPWPPP
GTAAIEVDDFYDDLAAQGYNYGPTFQGVQRIWRDHATPDVIYAEVELPED
TDIDGYGIHPALFDAALHPLLALTQPPTNDTDDTNTADTGDQVRLPYAFT
GISLHATHATRLRVRLTRTGADAITVHTSDTTGAPVAIIDSLITRPLTTA
TGSAPATTAAGLLHLSWPPHPDTTTDTDTDTDALRYQVIAEPTQQLPRYL
HDLHTSTDLHTSTTEADVVVWPVPVPSNEELQAHQASDTAVSSRIHTLTR
QTLTVVQDWLTHPDTTGTRLVIVTRHGVSTSAHDPVPDLAHAAVWGLIRS
AQNEHPGRFTLLDTDDNTNSDTLTTALTLPTRENQLAIRRDTIHIPRLTR
HSSDGALTAPVVVDPEGTVLITGGTGTLGALFAEHLVSAHGVRHLLLTSR
RGPQAHGATDLQQRLTDLGAHVTITACDISDPEALAALVNSVPTQHRLTA
VVHTAAVLADTPVTELTGDQLDQVLAPKIDAAWQLHQLTYEHNLSAFIMF
SSMAGMIGSPGQGNYAAANTALDALADYRHRLGLPATSLAWGYWQTHTGL
TAHLTDVDLARMTRLGLMPIATSHGLALFDAALATGQPVSIPAPINTHTL
ARHARDNTLAPILSALITTPRRRAASAATDLAARLNGLSPQQQQQTLATL
VAAATATVLGHHTPESISPATAFKDLGIDSLTALELRNTLTHNTGLDLPP
TLIFDHPTPHALTQHLHTRLTQSHTPVGPIASLLSHAIDEGKFRAGADLL
MAASNLNQSFSNMAELNQLPAVTDIADASPDGLLTLICISTSENEYARLA
AANIHSLTFAEIAAPGFYDAQLPNSIETSAEALATAITGAYANTSIVLVA
HSIVCELAQATMTRLQDADIDLVGLVLLDPLEGTNSTEDYVETVLTRIEH
INAPRVGVDGYLAALGRYLQFHEDRRIPIPETRHMTLHSDTKIDRAQTPM
NLLQDEAALTALKIGNWMNDVGVALSVNLE
>MUP005c parA, possible chromosome partitioning protein ParA
MSRSVAVWDDAGLVEPAVNWSKLGNVYLFANGKGGVGKSTCSTHSAALVA
SDGARALLVDLNGQGNVANMLGFANTEADDKGRNLYSAITAGAALTPIPD
VRPGLDVVPGGPFVRRIAPVMAAEMGNPQTAKQVLMSLALALAQISDHYG
VIFIDSPPENQLLLQAALCASRFVVVPMKTDDLSRTGLRELAGDLRAMRE
HNPSVVLLGCFVFASGTSSTRIREEMKKNVAADLGQNDDVMLDSFIRHSE
AVGRDVPKFGRLAHELEQEIVNNPSRSAIRQGRADATTVVSTTSVSVAED
FAKLTGEILTRGSKIRQDLINEGVWP
>MUP011 pknQ, probable transmembrane serine/threonine-protein kinase PknQ
MALPLGTVVAGYVIEGVLGSGGMGTVYLARHPTLPRSDALKILSAELSQD
EQFRVRFIREADLAATLSHPNIVTVFNRGETDDGQLWIAMQYVEGTTASD
LHATVLTPARVAAIITDVGAALDYAHSRRVLHRDIKPSNFLVSADHERVL
LADFGIARAFDDTTLTAIGSLVGTASYAAPEAIQGGSVDQRADVYSLGCA
LFRLLTGRAPYEDLRGPAMLMAHVLQPIPRPSHIVVGLPPAIDDVIAVAM
AKDPAARFPTAGALAGAARAALSGQPLPQAPPGGPKTRIWAAPPLSYPTT
RPPGISGGAISPAGFAGAAHPGLAGAASSSDERGGPRAPRRRKRGIIAAA
MGLVTIVAAAVLAGVLLTEHRGATLAPYQPQSMTGTLGTVELHHRPVAVA
ALGPGDADAVLSLGVQPVAIGGTHGQTPSWLAPMVKSSPAMLPTADPATL
AETRPDLIIDTGSLDKATYNQLAAIAPTLTRPADTTQEWNWQNQLTWIAT
SLGRTTTATTLLNNAVAEQTQIKSDNPAFSGKTITVVNLSDTTTTVATKV
SPPTAYLEGLGFAYNVYFKRGPNDPPEVEVDEDSFDWGRAKMTDVMIVIR
TDRAAGGGGFGGLPSKFALFNEPLVIVDDLATITALNSGGPAATTYLDTT
LVNKLAHQIH
>MUP001 rep, probable replication protein Rep
MPAPSVFVGLELDTNSYTGVPCWSAGPAHWAHNTVAVAYDLRYQQIRSLM
CDGGIARKTLIVIAAAMARHADWSTGRNCRPTNNQLEAATGFHQRTIQRA
HECLRLLGVATEVLRGRQRTYIERMASWRMGDRHRGWASVWALHDNGEVA
RVVHSLSPHLERSSVTTHTSPKTSLFTTHPGATRARESGATRRNSPDARG
RRLAAQWRADPHAPPWTRRYSPTSWSAMLAAPAAAGWTARDLTALVQDWL
GTGHRIPSTPARPIALLGTLLAWHTSHNSIQHRPAALEEAREAADLAAVR
KRIQKQTAEHHANLAAREAGRAALGGPGHQAARIAAAAAARNAARRRTKM
VAAEVASVDAAIRRARGR