TitleGenColors Logo

Gene list

Applied filters:

COG category: Secondary metabolites biosynthesis, transport and catabolism
Organism: Mycobacterium tuberculosis CDC1551, CDC1551
Gene type: CDS

Number of genes found: 266

Free access
Sort by:

 



# Mycobacterium tuberculosis CDC1551, CDC1551

>MT2016 conserved hypothetical protein
MVIVADKAAGRVADPVLRPVGALGDFFAMTLDTSVCMFKPPFAWREYLLQ
CWFVARVSTLPGVLMTIPWAVISGFLFNVLLTDIGAADFSGTGCAIFTVN
QSAPIVTVLVVAGAGATAMCADLGARTIREELDALRVMGINPIQALAAPR
VLAATTVSLALNSVVTATGLIGAFFCSVFLMHVSAGAWVTGLTTLTHTVD
VVISMIKATLFGLMAGLIACYKGMSVGGGPAGVGRAVNETVVFAFIVLFV
INIVVTAVGIPFMVS
>MT2020 virulence factor mce family protein
MRRRAQGQRQGRPAGVHQAGRSGQRAVRAEMKSFAERNRLAIGTVGIVVV
AAVALAALQYQRLPFFNQGTRVSAYFADAGGLRTGNTVEVSGYPVGKVSS
ISLDGPGVLVEFKVDTDVRLGNRTEVAIKTKGLLGSKFLDVTPRGDGRLD
SPIPIERTTSPYQLPDALGDLAATISGLHTERLSESLATLAQTFADTPAH
FRNAIHGVARLAQTLDERDNQLRSLLANAAKATGVLANRTDQIVGLVRDT
NVVLAQLRTQSAALDRIWANISAVAEQLRGFIAENRQQLRPALDKLNGVL
AIVENRKERVRQAIPLINTYVMSLGESLSSGPFFKAYVVNLLPGQFVQPF
ISAAFSDLGLDPATLLPSQLTDPPTGQPGTPPLPMPYPRTGQGGEPRLTL
PDAITGNPGDPRYPYRPEPPAPPPGGPPPGPPAQQPGDQP
>MT3202 conserved hypothetical protein
MSPSPSALLADHPDRIRWNAKYECADPTEAVFAPISWLGDVLQFGVPEGP
VLELACGRSGTALGLAAAGRCVTAIDVSDTALVQLELEATRRELADRLTL
VHADLCSWQSGDGRFALVLCRLFWHPPTFRQACEAVAPGGVVAWEAWRRP
IDVARDTRRAEWCLKPGQPESELPAGFTVIRVVDTDGSEPSRRIIAQRSL
>MT1770 conserved hypothetical protein
MARTDDDNWDLTSSVGVTATIVAVGRALATKDPRGLINDPFAEPLVRAVG
LDLFTKMMDGELDMSTIADVSPAVAQAMVYGNAVRTKYFDDYLLNATAGG
IRQVAILASGLDSRAYRLPWPTRTVVYEIDQPKVMEFKTTTLADLGAEPS
AIRRAVPIDLRADWPTALQAAGFDSAAPTAWLAEGLLIYLKPQTQDRLFD
NITALSAPGSMVATEFVTGIADFSAERARTISNPFRCHGVDVDLASLVYT
GPRNHVLDYLAAKGWQPEGVSLAELFRRSGLDVRAADDDTIFISGCLTDH
SSISPPTAAGWR
>MT1439 P450 heme-thiolate protein
MATATTQRPLKGPAKRMSTWTMTREAITIGFDAGDGFLGRLRGSDITRFR
CAGRRFVSISHPDYVDHVLHEARLKYVKSDEYGPIRATAGLNLLTDEGDS
WARHRGALNSTFARRHLRGLVGLMIDPIADVTAALVPGAQFDMHQSMVET
TLRVVANALFSQDFGPLVQSMHDLATRGLRRAEKLERLGLWGLMPRTVYD
TLIWCIYSGVHLPPPLREMQEITLTLDRAINSVIDRRLAEPTNSADLLNV
LLSADGGIWPRQRVRDEALTFMLAGHETTANAMSWFWYLMALNPQARDHM
LTELDDVLGMRRPTADDLGKLAWTTACLQESQRYFSSVWIIAREAVDDDI
IDGHRIRRGTTVVIPIHHIHHDPRWWPDPDRFDPGRFLRCPTDRPRCAYL
PFGGGRRICIGQSFALMEMVLMAAIMSQHFTFDLAPGYHVELEATLTLRP
KHGVHVIGRRR
>MT2336 P450 heme-thiolate protein
MTATVLLEVPFSARGDRIPDAVAELRTREPIRKVRTITGAEAWLVSSYAL
CTQVLEDRRFSMKETAAAGAPRLNALTVPPEVVNNMGNIADAGLRKAVMK
AITPKAPGLEQFLRDTANSLLDNLITEGAPADLRNDFADPLATALHCKVL
GIPQEDGPKLFRSLSIAFMSSADPIPAAKINWDRDIEYMAGILENPNITT
GLMGELSRLRKDPAYSHVSDELFATIGVTFFGAGVISTGSFLTTALISLI
QRPQLRNLLHEKPELIPAGVEELLRINLSFADGLPRLATADIQVGDVLVR
KGELVLVLLEGANFDPEHFPNPGSIELDRPNPTSHLAFGRGQHFCPGSAL
GRRHAQIGIEALLKKMPGVDLAVPIDQLVWRTRFQRRIPERLPVLW
>MT3802 conserved hypothetical protein
MGAMTDEVMDWDSAYREQGAFEGPPPWNIGEPQPELATLIAAGKVRSDVL
DAGCGYAELSLALAADGYTVVGIDLTPTAVAAATKAAEERGLTTASFVQA
DITEFAAYPAGSAGRFSTVIDSTLFHSLPVDSRDRYLSSVHRAAAPGASY
YVLVFAKGAFPAELEVKPNEVDEDELRAAVSKYWKIDEIRPAFIHVNPVT
IPPQLAGAPVEFPPYDHDEKGRVKFPAYLLTAHKAG
>MT1929 P450 heme-thiolate protein
MHGVIRGIAAIGIRRGDLQARLIADPAVATDPVPFYDEVRSHGALVRNRA
NYLTVDHRLAHDLLRSDDFRVVSFGENLPPPLRWLERRTRGDQLHPLREP
SLLAVEPPDHTRYRKTVSAVFTSRAVSALRDLVEQTAINLLDRFAEQPGI
VDVVGRYCSQLPIVVISEILGVPEHDRPRVLEFGELAAPSLDIGIPWRQY
LRVQQGIRGFDCWLEGHLQQLRHAPGDDLMSQLIQIAESGDNETQLDETE
LRAIAGLVLVAGFETTVNLLGNGIRMLLDTPEHLATLRQHPELWPNTVEE
ILRLDSPVQLTARVACRDVEVAGVRIKRGEVVVIYLAAANRDPAVFPDPH
RFDIERPNAGRHLAFSTGRHFCLGAALARAEGEVGLRTFFDRFPDVRAAG
AGSRRDTRVLRGWSTLPVTLGPARSMVSP
>MT3600 virulence factor mce family protein
MGRVAMLTGSRGLRYATVIALVAALVGGVYVLSSTGNKRTIVGYFTSAVG
LYPGDQVRVLGVPVGEIDMIEPRSSDVKITMSVSKDVKVPVDVQAVIMSP
NLVAARFIQLTPVYTGGAVLPDNGRIDLDRTAVPVEWDEVKEGLTRLAAD
LSPAAGELQGPLGAAINQAADTLDGNGDSLHNALRELAQVAGRLGDSRGD
IFGTVKNLQVLVDALSESDEQIVQFAGHVASVSQVLADSSANLDQTLGTL
NQALSDIRGFLRENNSTLIETVNQLNDFAQTLSDQSENIEQVLHVAGPGI
TNFYNIYDPAQGTLNGLLSIPNFANPVQFICGGSFDTAAGPSAPDYYRRA
EICRERLGPVLRRLTVNYPPIMFHPLNTITAYKGQIIYDTPATEAKSETP
VPELTWVPAGGGAPVGNPADLQSLLVPPAPGPAPAPPAPGAGPGEHGGGG
>MT1222 acyl-CoA synthase
MSDSSVLSLLRERAGLQPDDAAFTYIDYEQDWAGITETLTWSEVFRRTRI
VAHEVRRHCTTGDRAVILAPQGLAYIAAFLGSMQAGAIAVPLSVPQIGSH
DERVSAVLADASPSVILTTSAVAEAVAEHIHRPNTNNVGPIIEIDSLDLT
GNSPSFRVKDLPSAAYLQYTSGSTRAPAGVMISHRNLQANFQQLMSNYFG
DRNGVAPPDTTIVSWLPFYHDMGLVLGIIAPILGGYRSELTSPLAFLQRP
ARWLHSLANGSPSWSAAPNFAFELAVRKTTDADIEGLDLGNVLGITSGAE
RVHPNTLSRFCNRFAPYNFREDMIRPSYGLAEATLYVASRNSGDKPEVVY
FEPDKLSTGSANRCEPKTGTPLLSYGMPTSPTVRIVDPDTCIECPAGTIG
EIWVKGDNVAEGYWNKPDETRHTFGAMLVHPSAGTPDGSWLRTGDLGFLS
EDEMFIVGRMKDMLIVYGRNHYPEDIESTVQEITGGRVAAISVPVDHTEK
LVTVIELKLLGDSAGEAMDELDVIKNNVTAAISRSHGLNVADLVLVPPGS
IPTTTSGKIRRAACVEQYRLQQFTRLDG
>MT3937 oxidoreductase, putative
MTGYDAIVIGAGHNGLTAAVLLQRAGLRTACLDAKRYAGGMASTVELFDG
YRFEIAGSVQFPTSSAVSSELGLDSLPTVDLEVMSVALRGVGDDPVVQFT
DPTKMLTHLHRVHGADAVTGMAGLLAWSQAPTRALGRFEAGTLPKSFDEM
YACATNEFERSAIDDMLFGSVTDVLDRHFPDREKHGALRGSMTVLAVNTL
YRGPATPGSAAALAFGLGVPEGDFVRWKKLRGGIGALTTHLSQLLERTGG
EVRLRSKVTEIVVDNSRSSARVRGVRTAAGDTLTSPIVVSAIAPDVTINE
LIDPAVLPSEIRDRYLRIDHRGSYLQMHFALAQPPAFAAPYQALNDPSMQ
ASMGIFCTPEQVQQQWEDCRRGIVPADPTVVLQIPSLHDPSLAPAGKQAA
SAFAMWFPIEGGSKYGGYGRAKVEMGQNVIDKITRLAPNFKGSILRYTTF
TPKHMGVMFGAPGGDYCHALLHSDQIGPNRPGPKGFIGQPIPIAGLYLGS
AGCHGGPGITFIPGYNAARQALADRRAANCCVLSGR
>MT0283 substrate--CoA ligase
MPNLTDLPGQAVSKLQKSIGQYVARGTAELHYLRKIIESGAIGLEPPLNY
AALAADIRKWGEVGMLPSHNARRAPNRAAVIDEEGTLTFSELDEAAHAVA
NGLLAKGVRAGDGVAILARNHRWFVIANYGAARVGARIILLNSEFSGPQI
KEVSDREGAKVIIYDDEYTKAVSLAQPPLGKLRALGVNPDDDKPSGSSDE
TLAELIAHSSTAPAPKASRRASIIILTSGTTGTPKGANRNTPPTLAPIGG
ILSHVPFKAGEVTLLPSPMFHALGYMHAALAMFLGSTLVLRRRFKPALVL
EDIEKHKATSMVVVPVMLSRILDQLEKTEPKPDLSSLKIVFVSGSQLGAE
LATRALGDLGPVIYNMYGSTEVAFATIAGPKDLQFNPSTVGPVVKGVTVK
ILDENGNEVPQGAVGRIFVGNAFPFEGYTGGGGKQIIDGLLSSGDVGYFD
ERGLLYVSGRDDEMIVSGGENVFPAEVEDLISGHPDVVEAAAIGVDDKEF
GARLRAFVVKKPGADLDEDTIKQYVRDHLARYKVPREVIFLDELPRNPTG
KVLKRELRKL
>MT1703 polyketide synthase, putative
MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFLSQARHV
GKVVLTMPDAWAAGTVLITGGTGMAGSAVARHLVSRYGVRQVVLASRAGE
HTESVAALVDELGSAGARVQVVSCDVADRDAVAGLVASQPDLTAVFHAAG
VLDDAVITGLTPERVDKVLRAKVDGAWNLHELTRHLDVSAFVLFSSMAGI
VGAPGQANYAAANAFLDGLAAYRRSRGLAALSVAWGLWEQASAMTEHLGE
RDRVRMSRVGLAPLPTNQAMGFLDAALLADRPVVVAARLDRAALAGAELP
ALFSQLVAGPIRRIIDGADEVSGSGLASRLHGLTPEQRHRELTELVCSNA
AIVLGHSGTEIDAHKAFQDLGFDSLTAVELRNRLKTATGLTLPPTLIFDY
PTAAELAEHLDIQLANAPAVTVDQPNPSTRFNEVTRELQALLDQPNWNPD
DKTRLIKRLQAILTDCTAPPASSGPSTTHDDEDITTATESQLFAILDDEL
GP
>MT1180 hypothetical protein
MTSGAAASASRVDHPLFARIWPVVAAHEAEAIRALRRENLAGLSGRVLEV
GAGVGTNFAYYPVAVEQVIAMEPEPRLAAKARIAAADAPVPIVVTDKTVE
EFRDTETFDAVVCSLVLCSVSDPGAVLAHLRSLLRRGGELRYLEHVASAG
ARGRVQRFVDATFWPRLAGNCHTHRHTERAILDAGFVVDSSRREWAFPAW
VPLPVSELALGRAHRT
>MT3311 isochorismate synthase, putative
MSAHVATLHPEPPFALCGPRGTLIARGVRTRYCDVRAAQAALRSGTAPIL
LGALPFDVSRPAALMVPDGVLRARKLPDWPTGPLPKVRVAAALPPPADYL
TRIGRARDLLAAFDGPLHKVVLARAVQLTADAPLDARVLLRRLVVADPTA
YGYLVDLTSAGNDDTGAALVGASPELLVARSGNRVMCKPFAGSAPRAADP
KLDAANAAALASSAKNRHEHQLVVDTMRVALEPLCEDLTIPAQPQLNRTA
AVWHLCTAITGRLRNISTTAIDLALALHPTPAVGGVPTKAATELIAELEG
DRGFYAGAVGWCDGRGDGHWVVSIRCAQLSADRRAALAHAGGGIVAESDP
DDELEETTTKFATILTALGVEQ
>MT3599 virulence factor mce family protein
MNRIWLRAIILTASSALLAGCQFGGLNSLPLPGTAGHGEGAYSVTVEMAD
VATLPQNSPVMVDDVTVGSVAGIVAVQRPDGSFYAAVKLDLDKNVLLPAN
AVAKVSQTSLLGSLHVELAPPTDRPPTGRLVDGSRITEANTDRFPTTEEV
FSALGVVVNKGNVGALEEIIDETHQAVAGRQAQFVNLVPRLAELTAGLNR
QVHDIIDALDGLNRVSAILARDKDNLGRALDTLPDAVRVLNQNRDHIVDA
FAALKRLTMVTSHVLAETKVDFGEDLKDLYSIVKALNDDRKDFVTSLQLL
LTFPFPNFGIKQAVRGDYLNVFTTFDLTLRRIGETFFTTAYFDPNMAHMD
EILNPPDFLIGELANLSGQAADPFKIPPGTASGQ
>MT3004 polyketide synthase
MTSLAERAAQLSPNARAALARELVRAGTTFPTDICEPVAVVGIGCRFPGN
VTGPESFWQLLADGVDTIEQVPPDRWDADAFYDPDPSASGRMTTKWGGFV
SDVDAFDADFFGITPREAVAMDPQHRMLLEVAWEALEHAGIPPDSLSGTR
TGVMMGLSSWDYTIVNIERRADIDAYLSTGTPHCAAVGRIAYLLGLRGPA
VAVDTACSSSLVAIHLACQSLRLRETDVALAGGVQLTLSPFTAIALSKWS
ALSPTGRCNSFDANADGFVRGEGCGVVVLKRLADAVRDQDRVLAVVRGSA
TNSDGRSNGMTAPNALAQRDVITSALKLADVTPDSVNYVETHGTGTVLGD
PIEFESLAATYGLGKGQGESPCALGSVKTNIGHLEAAAGVAGFIKAVLAV
QRGHIPRNLHFTRWNPAIDASATRLFVPTESAPWPAAAGPRRAAVSSFGL
SGTNAHVVVEQAPDTAVAAAGGMPYVSALNVSGKTAARVASAAAVLADWM
SGPGAAAPLADVAHTLNRHRARHAKFATVIARDRAEAIAGLRALAAGQPR
VGVVDCDQHAGGPGRVFVYSGQGSQWASMGQQLLANEPAFAKAVAELDPI
FVDQVGFSLQQTLIDGDEVVGIDRIQPVLVGMQLALTELWRSYGVIPDAV
IGHSMGEVSAAVVAGALTPEQGLRVITTRSRLMARLSGQGAMALLELDAD
AAEALIAGYPQVTLAVHASPRQTVIAGPPEQVDTVIAAVATQNRLARRVE
VDVASHHPIIDPILPELRSALADLTPQPPSIPIISTTYESAQPVADADYW
SANLRNPVRFHQAVTAAGVDHNTFIEISPHPVLTHALTDTLDPDGSHTVM
STMNRELDQTLYFHAQLAAVGVAASEHTTGRLVDLPPTPWHHQRFWVTDR
SAMSELAATHPLLGAHIEMPRNGDHVWQTDVGTEVCPWLADHKVFGQPIM
PAAGFAEIALAAASEALGTAADAVAPNIVINQFEVEQMLPLDGHTPLTTQ
LIRGGDSQIRVEIYSRTRGGEFCRHATAKVEQSPRECAHAHPEAQGPATG
TTVSPADFYALLRQTGQHHGPAFAALSRIVRLADGSAETEISIPDEAPRH
PGYRLHPVVLDAALQSVGAAIPDGEIAGSAEASYLPVSFETIRVYRDIGR
HVRCRAHLTNLDGGTGKMGRIVLINDAGHIAAEVDGIYLRRVERRAVPLP
LEQKIFDAEWTESPIAAVPAPEPAAETTRGSWLVLADATVDAPGKAQAKS
MADDFVQQWRSPMRRVHTADIHDESAVLAAFAETAGDPEHPPVGVVVFVG
GASSRLDDELAAARDTVWSITTVVRAVVGTWHGRSPRLWLVTGGGLSVAD
DEPGTPAAASLKGLVRVLAFEHPDMRTTLVDLDITQDPLTALSAELRNAG
SGSRHDDVIAWRGERRFVERLSRATIDVSKGHPVVRQGASYVVTGGLGGL
GLVVARWLVDRGAGRVVLGGRSDPTDEQCNVLAELQTRAEIVVVRGDVAS
PGVAEKLIETARQSGGQLRGVVHAAAVIEDSLVFSMSRDNLERVWAPKAT
GALRMHEATADCELDWWLGFSSAASLLGSPGQAAYACASAWLDALVGWRR
ASGLPAAVINWGPWSEVGVAQALVGSVLDTISVAEGIEALDSLLAADRIR
TGVARLRADRALVAFPEIRSISYFTQVVEELDSAGDLGDWGGPDALADLD
PGEARRAVTERMCARIAAVMGYTDQSTVEPAVPLDKPLTELGLDSLMAVR
IRNGARADFGVEPPVALILQGASLHDLTADLMRQLGLNDPDPALNNADTI
RDRARQRAAARHGAAMRRRPKPEVQGG
>MT0341 hypothetical protein
MVATDFSDVAVAQLRRSAQARGVSARVQPIVHDLRQPLPVKTGSIDGAFA
HMALCMALSTSEIHAVVAEVGRVLRPGGKFIYTVRHTGDAHYGAGQAHGD
DIFECAGFAVHFFRRELVARLATGWVLEEVHDFEEGELPRRLWRVTVTKP
A
>MT2058 oxidoreductase, short-chain dehydrogenase/reductase family
MSGRLIGKVALVSGGARGMGASHVRAMVAEGAKVVFGDILDEEGKAVAAE
LADAARYVHLDVTQPAQWTAAVDTAVTAFGGLHVLVNNAGILNIGTIEDY
ALTEWQRILDVNLTGVFLGIRAVVKPMKEAGRGSIINISSIEGLAGTVAC
HGYTATKFAVRGLTKSTALELGPGGIRVNSIHPGLVKTPMTDWVPEDIFQ
TALGRAAEPVEVSNLVVYLASDESSYSTGAEFVVDGGTVAGLAHNDFGAV
EVSSQPEWVT
>MT0616 conserved hypothetical protein
MTTHAVIITYLRDQTQPAVDAIGGFYRTCVLTGKALVRRPFHWREAIEQG
WFITSVSLLPTLAVSIPLTVLIIFTLNILLAEFGAADISGAGAALGAVTQ
LGPLTTVLVIAGAGATAICADLGARTIREEIDAMEVLGIDPIHRLVVPRV
VAATIVAALLNGAVITIGLVGGFVFSVFIQHVSAGAYVGTLTLVTGLPEV
IISVVKSATFGLIAGLVGCYRGLTTKGGPKGVGTAVNETLVLCVIALFAT
NVVLTTIGVRFGTGH
>MT1417 chalcone/stilbene synthase family protein
MNVSAESGAPRRAGQRHEVGLAQLPPAPPTTVAVIEGLATGTPRRVVNQS
DAADRVAELFLDPGQRERIPRVYQKSRITTRRMAVDPLDAKFDVFRREPA
TIRDRMHLFYEHAVPLAVDVSKRALAGLPYRAAEIGLLVLATSTGFIAPG
VDVAIVKELGLSPSISRVVVNFMGCAAAMNALGTATNYVRAHPAMKALVV
CIELCSVNAVFADDINDVVIHSLFGDGCAALVIGASQVQEKLEPGKVVVR
SSFSQLLDNTEDGIVLGVNHNGITCELSENLPGYIFSGVAPVVTEMLWDN
GLQISDIDLWAIHPGGPKIIEQSVRSLGISAELAAQSWDVLARFGNMLSV
SLIFVLETMVQQAESAKAISTGVAFAFGPGVTVEGMLFDIIRR
>MT2449 polyketide synthase, putative
MWGGVMAPKQLPDGRVAVLLSAHAEELIGPDARAIADYLERFPATTVTEV
ARQLRKTRRVRRHRAVLRAADRLELAEGLRALAAGREHPLIARSSLGSAP
RQAFVFPGQGGHWPGMGAVAYRELPTYRTATDTCAAAFAAAGVDSPLPYL
IAPPGTDERQAFCEIEIEGAQFVHAVALAEVWRSCGVLPDLTVGHSLGEV
AAAYLAGSITLSDAVAVVAARANVVGRLPGRYAVAALGIGEQDASALIAT
TGGWLELSVVNASSTVAVSGERQAVAAIVDTVRSSGHFARGITVGFPVHT
SVLESLRDELCEQLPDSEFMEAPVQFIGGTTGDVVAPGTTFGDYWYANLR
HTVRFDRAVESAIRCGARAFIEISAHPALLFAIGQNCEGAANLPDGPAVL
VGSARRGERFVDALSANIVSAAVADPGYPWGDLGGDPLDGDVDLSGFPNA
PMRAVPMWAHPEPLPPVSGLTIAVERWERMVPSTPVAGRHRHLAVLDLGA
HRALAQTLCAAIDSHPDTELSAARDAELILVIAPDFEHTDAVRAAGALAD
LVGAGLLDYPMHIGARCQSVCLVTVGAEQVDAADAVPSAGQAALAAMHRS
IGFEHPEQTFSHLDLPSWDLDPVLGVSVITAVLRGFGETALRGSVNGYTL
FERTLADAPAVPNWSLDSGVLDDVVVTGGAGAIGMHYARYLAEHGARRIV
LLSRRAADQATVAMLRKQHGTVIVSPPCDITDPTQLSAIAAEYGGVGASL
IVHAAGSVISGTAPGVTSAAVVDNFAAKVLGLAQMIELWPLRPDVRTLLC
SSVMGVWGGHGVVAYSAANRLLDVMAAQLRAQGRHCVAVKWGLWQAPKAG
EPARGIADAVTIARVERSGLRQMAPQQAIEASLHEFTVDPLVFAADAARL
QMLLDSRQFERYEGPTDPNLTIVDAVRTQLAAVLGIPQAGEVNLQESLFD
LGVDSMLALDLRNRLKRSIGATVSLATLMGDITGDGLVAKLEDADERSHT
AQKVDISRD
>MT3009 conserved hypothetical protein
MFPGSVIRKLSHSEEVFAQYEVFTSMTIQLRGVIDVDALSDAFDALLETH
PVLASHLEQSSDGGWNLVADDLLHSGICVIDGTAATNGSPSGNAELRLDQ
SVSLLHLQLILREGGAELTLYLHHCMADGHHGAVLVDELFSRYTDAVTTG
DPGPITPQPTPLSMEAVLAQRGIRKQGLSGAERFMSVMYAYEIPATETPA
VLAHPGLPQAVPVTRLWLSKQQTSDLMAFGREHRLSLNAVVAAAILLTEW
QLRNTPHVPIPYVYPVDLRFVLAPPVAPTEATNLLGAASYLAEIGPNTDI
VDLASDIVATLRADLANGVIQQSGLHFGTAFEGTPPGLPPLVFCTDATSF
PTMRTPPGLEIEDIKGQFYCSISVPLDLYSCAVYAGQLIIEHHGHIAEPG
KSLEAIRSLLCTVPSEYGWIME
>MT1554 hypothetical protein
MRFDQLVRIVNAADPFSINDLGCGYGALLDYLDARGFKTDYTGIDVSPEM
VRAAALRFEGRANADFICAARIDREADYSVASGIFNVRLKSLDTEWCAHI
EATLDMLNAASRRGFSFNCLTSYSDASKMRDDLYYADPCALFDLCKRRYS
KSVALLHDYGLYEFTILVRKAS
>MT3940 hypothetical protein
MFWAMAMNLLHRRHCSSAGWEKAVANQLLPWALQHVELGPRTLEIGPGYG
ATLQALLGLTASLTAVEVDNSMVERLNRRYGQRARIIRGDGTQTGLPDDH
FTSVVCFTMLHHVASAQMQDQLFAEAYRVLQPGGVFAGSDGVPSLPFRLI
HIADTYTPIAPADLPGRLRAVGFTDIHVDVAGARLRWRATKPVAA
>MT3507 conserved hypothetical protein
MARPMGKLPSNTRKCAQCAMAEALLEIAGQTINQKDLGRSGRMTRTDNDT
WDLASSVGATATMIATARALASRAENPLINDPFAEPLVRAVGIDLFTRLA
SGELRLEDIGDHATGGRWMIDNIAIRTKFYDDFFGDATTAGIRQVVILAA
GLDTRAYRLPWPPGTVVYEIDQPAVIKFKTRALANLNAEPNAERHAVAVD
LRNDWPTALKNAGFDPARPTAFSAEGLLSYLPPQGQDRLLDAITALSAPD
SRLATQSPLVLDLAEEDEKKMRMKSAAEAWRERGFDLDLTELIYFDQRND
VADYLAGSGWQVTTSTGKELFAAQGLPPFEDDHITRFADRRYISAVLK
>MT3498 oxidoreductase, short-chain dehydrogenase/reductase family
MRYVVTGGTGFIGRHVVSRLLDGRPEARLWALVRRQSLSRFERLAGQWGD
RVRPLVGDLTELELSERTIAELGDIDHVLHCAAVHDTTWADATRAVIELA
ARLDATFHHVSSIAVAGDFAGHYTEADFDVGQRLPTPYHRMTFEAERLVR
STPGLRYRIYRPAVVVGDSRTGEMDTIDGPYYLFGVLAKLAVLPSFTPML
LPDIGRTNIVPVDYVADALVALMHADGRDGQTFHLTAPTAIGLRGIYRGI
AGAAGLPPLLGTLPGFVAAPVLNARGRAKVLRNMAATQLGIPAEIFDVVG
CAPTFTSDTTREALRGTGIHVPEFATYAPGLWRYWAEHLDPDRARRNDPL
LGRHVIITGASSGIGRASAIAVAKRGATVFALARNGNALDELVTEIRAHG
GQAHAFTCDVTDSASVEHTVKDILGRFDHVDYLVNNAGRSIRRSVVNSTD
RLHDYERVMAVNYFGAVRMVLALLPHWRERRFGHVVNVSSAGVQARNPKY
SSYLPTKAALDAFADVVASETLSDHITFTNIHMPLVATPMIVPSRRLNPV
RAISAERAAAMVIRGLVEKPARIDTPLGTLAEAGNYVAPRLSRRILHQLY
LGYPDSAAAQGISRPDADRPPAPRRPRRSARAGVPRPLRRLGRLVPGVHW
>MT1834 P450 heme-thiolate protein
MVGARPRAILARSAPLGYVCDLRQERRHERLERMTTPGEDHAGSFYLPRL
EYSTLPMAVDRGVGWKTLRDAGPVVFMNGWYYLTRREDVLAALRNPKVFS
SRKALQPPGNPLPVVPLAFDPPEHTRYRRILQPYFSPAALSKALPSLRRH
TVAMIDAIAGRGECEAMADLANLFPFQLFLVLYGLPLEDRDRLIGWKDAV
IAMSDRPHPTEADVAAARELLEYLTAMVAERRRNPGPDVLSQVQIGEDPL
SEIEVLGLSHLLILAGLDTVTAAVGFSLLELARRPQLRAMLRDNPKQIRV
FIEEIVRLEPSAPVAPRVTTEPVTVGGMTLPAGSPVRLCMAAVNRDGSDA
MSTDELVMDGKVHRHWGFGGGPHRCLGSHLARLELTLLVGEWLNQIPDFE
LAPDYAPEIRFPSKSFALKNLPLRWS
>MT2023 virulence factor mce family protein
MLHLPRRVIVQLAVFTVIAVGVLAITFLHFVRLPAMLFGVGRYTVTMELV
EAGGLYRTGNVTYRGFEVGRVAAVRLTDTGVQAVLALKSGIDIPSDLKAE
VHSHTAIGETYVELLPRNAASPPLKNGDVIALADTSVPPDINDLLSAANT
ALEAIPHENLQTVIDESYTAVAGLGLELSRLIKGSAELAIDARANLDPLV
ALIDRAGPVLDSQTHTSDAIAAWAAQLAAVTGQLQTHDSAVGDLIDRGGP
ALGETRQLLERLQPTVPILLANLVSVGQVALTYHNDIEQLLVVFPMAIAA
EQAGILANLNTKQAYRGQYLSFNLNLNLPPPCTTGFLPAQQRRIPTFEDY
PDRPAGDLYCRVPQDSPFNVRGARNIPCETVPGKRAPTVKLCESDEPYLP
LNDGYNWKGDPNATVPGLGSGQDIPQTWQTMLLPPGS
>MT2330 P450 heme-thiolate protein
MTATQSPPEPAPDRVRLAGCPLAGTPDVGLTAQDATTALGVPTRRRASSG
GIPVATSMWRDAQTVRTYGPAVAKALALRVAGKARSRLTGRHCRKFMQLT
DFDPFDPAIAADPYPHYRELLAGERVQYNPKRDVYILSRYADVREAARNH
DTLSSARGVTFSRGWLPFLPTSDPPAHTRMRKQLAPGMARGALETWRPMV
DQLARELVGGLLTQTPADVVSTVAAPMPMRAITSVLGVDGPDEAAFCRLS
NQAVRITDVALSASGLISLVQGFAGFRRLRALFTHRRDNGLLRECTVLGK
LATHAEQGRLSDDELFFFAVLLLVAGYESTAHMISTLFLTLADYPDQLTL
LAQQPDLIPSAIEEHLRFISPIQNICRTTRVDYSVGQAVIPAGSLVLLAW
GAANRDPRQYEDPDVFRADRNPVGHLAFGSGIHLCPGTQLARMEGQAILR
EIVANIDRIEVVEPPTWTTNANLRGLTRLRVAVTPRVAP
>MT0851 conserved hypothetical protein
MVRADRDRWDLATSVGATATMVAAQRALAADPRYALIDDPYAAPLVRAVG
MDVYTRLVDWQIPVEGDSEFDPQRMATGMACRTRFFDQFFLDATHSGIGQ
FVILASGLDARAYRLAWPVGSIVYEVDMPEVIEFKTATLSDLGAEPATER
RTVAVDLRDDWATALQTAGFDPKVPAAWSAEGLLVYLPVEAQDALFDNIT
ALSAPGSRLAFEFVPDTAIFADERWRNYHNRMSELGFDIDLNELVYHGQR
GHVLDYLTRDGWQTSALTVTQLYEANGFAYPDDELATAFADLTYSSATLM
R
>MT3584 hypothetical protein
MPPTKEWPVSQTARRLGPQDMFFLYSESSTTMMHVGALMPFTPPSGAPPD
LLRQLVDESKASEVVEPWSLRLSHPELLYHPTQSWVVDDNFDLDYHVRRS
ALASPGDERELGIPVSRLHSHALDLRRPPWEVHFIEGLEGGRFAIYIKMH
HSLIDGYTGQKMLARSLSTDPHDTTHPLFFNIPTPGRSPADTQDSVGGGL
IAGAGNVLDGLGDVVRGLGGLVSGVGSVLGSVAGAGRSTFELTKALVNAQ
LRSDHEYRNLVGSVQAPHCILNTRISRNRRFATQQYPLDRLKAIGAQYDA
TINDVALAIIGGGLRRFLDELGELPNKSLIVVLPVNVRPKDDEGGGNAVA
TILATLGTDVADPVQRLAAVTASTRAAKAQLRSMDKDAILAYSAALMAPY
GVQLASTLSGVKPPWPYTFNLCVSNVPGPEDVLYVRGSRMEASYPVSLVA
HSQALNVTLQSYAGTLNFGFIGCRDTLPHLQRLAVYTGEALDQLAAADGA
AGLGS
>MT2925 oxidoreductase, short-chain dehydrogenase/reductase family
MMDLSQRLAGRVAVITGGGSGIGLAAGRRMRAEGATIVVGDVDVEAGGAA
ADELSGLFVPTDVCDEDAVNGLFDGAAETYGRIDIAFNNAGISPPEDNLI
ENTELAAWQRVQDVNLKSVYLCCRAALRHMVLAGKGSIVNTASFVAVMGS
ATSQISYTASKGGVLAMSRELGVQFARQGIRVNALCPGPVNTPLLQELFA
KNPERAARRMVHVPLGRFAEPDEIAAAVAFLASDDASFITASTFLVDGGI
SSAYVTPL
>MT0180 virulence factor mce family protein
MRIGLMGIVVALLVVAVGQSFTSVPMLFAKPSYYGQFTDSGGLHKGDRVR
IAGLGVGTVEGLKIDGDHIVVKFSIGTNTIGTESRLAIRTDTILGRKVLE
IEPRGAQALPPGGVLPVGQSTTPYQIYDAFFDVTKAASGWDIETVKRSLN
VLSETVDQTYPHLSAALDGVAKFSDTIGKRDEQITHLLAQANQVASILGD
RSEQVDRLLVNAKTLIAAFNERGRAVDALLGNISAFSAQVQNLINDNPNL
NHVLEQLRILTDLLVDRKEDLAETLTILGRFSASFGETFASGPYFKVLLA
NLVPGQILQPFVDAAFKKRGISPEDFWRSAGLPAYRWPDPNGTRFPNGAP
PPPPPVLEGTPEHPGPAVPPGSPCSYTPPADGLPRPWDPLPCANLTQGPF
GGPDFPAPLDVATSPPNPDGPPPAPGLPIAGRPGEVPPNVPGTPVPIPQE
APPGARTLPLGPAPGPAPPPTAPGPPAPPGPGPQLPAPFINPGGTGGSGV
TGGSEN
>MT0156 oxidoreductase, short-chain dehydrogenase/reductase family
MIAERSLMPGVQDRVIVVTGAGGGLGREYALTLAGEGASVVVNDLGGARD
GTGAGSAMADEVVAEIRDKGGRAVANYDSVATEDGAANIIKTALDEFGAV
HGVVSNAGILRDGTFHKMSFENWDAVLKVHLYGGYHVLRAAWPHFREQSY
GRVVVATSTSGLFGNFGQTNYGAAKLGLVGLINTLALEGAKYNIHANALA
PIAATRMTQDILPPEVLEKLTPEFVAPVVAYLCTEECADNASVYVVGGGK
VQRVALFGNDGANFDKPPSVQDVAARWAEITDLSGAKIAGFKL
>MT0622 virulence factor mce family protein
MSTIFDIRSLRLPKLSAKVVVVGGLVVVLAVVAAAAGARLYRKLTTTTVV
AYFSEALALYPGDKVQIMGVRVGSIDKIEPAGDKMRVTLHYSNKYQVPAT
ATASILNPSLVASRTIQLSPPYTGGPVLQDGAVIPIERTQVPVEWDQLRD
SINGILRQLGPTERQPKGPFGDLIESAADNLAGKGRQLNETLNSLSQALT
ALNEGRGDFVAITRSLALFVSALYQNDQQFVALNENLAEFTDWFTKSDHD
LADTVERIDDVLGTVRKFVSDNRSVLAADVNNLADATTTLVQPEPRDGLE
TALHVLPTYASNFNNLYYPLHSSLVGQFVFPNFANPIQLICSAIQAGSRL
GYQESAELCAQYLAPVLDALKFNYLPFGSNPFSSAATLPKEVAYSEERLR
PPPGYKDTTVPGIFSRDTPFSHGNHEPGWVVAPGMQGMQVQPFTANMLTP
ESLAELLGGPDIAPPPPGTNLPGPPNAYDESNPLPPPWYPQPASLPAAGA
TGQPGPGQ
>MT0624 virulence factor mce family protein
MSATRRTRMTRRADRWWKGLSEEMLTRAIKTQLVLLTVLAVIAVVVLGWY
FLRIPSLVGIGRYTLYAELPRSGGLYRTANVTYRGITIGKVTGVEPTERG
ARATMSIDNGYQIPTDASANVHSVSAVGEQFVDLVSTRTSGPYLRHGQTI
TTTTVPSQIGPALDAANRGLAVLPKDRVASVLHEASEAVGGLGSSLNRLI
EATQAIAHDVRGSLEDIDDIIERSAPIIDSQVNSGNEIARWAANLNTLAA
QTAQTDPAVRSILANAAPTADQVNATFSDVRESLPQTLANLEVVIDMLKR
YHNGVEQALVFLPQSGAIAQSVTTEFPGQAGLGVGGLALNQPPPCLTGFL
PASEWRSPADTSTAPLPKGTYCRIPMDASNVVRGARNNPCVDVPGKRAAT
PRECRSNEAYVPGGTNPWYGDPNQMLSCPAPAARCDQPVKPGQVIPAPSV
NNGINPLPADQLPGTPPPVNDPLQRPGSGTVQCNGQQPNPCVYTPSTFPT
TIYDVQSGKVVAPDGVVYSVEASTHAGADGWKVMLAPTG
>MT2496 hypothetical protein
MDNLPIESAESTRLAKAAMTRRFYTRSVVKGEITLPAVPSMIDEYVTMCA
GLFAGVGRKFSDEELAHLRAVLQGQLAEAYAASQRSTIVISYNAPMGPTL
HYQVRAQWRTVAQEYENWIATREPPLFGTEPDARVWALANEAADPTTHRV
LEIGAGTGRNALALARRGHPVDVVEMTPKFADIIRSDAERDSLDVRVIMR
DVFSTMDDLRQDYQLMVLSEVVPDFRTTQQLRNLFELAAQCLAPGARLVF
NAFLANGDYAPDQAAREFGQQMYTGMCTRAEMSAAAAGLPLELVADDSVY
DYEKTHLPPGAWPPTSWYADWIRGLDVFTTNVESCPIEMRWLVFQRRR
>MT3869 metallo-beta-lactamase superfamily protein
MHRSSGNVVPMEHKPPTAVIQAAHGEHSLPLHDTTDFDDADRGFIAALSP
CVIKAADGRVVWDNDAYSFLDGAAPTSVHPSLWRQSQLTAKQGLYQVVPG
IYQVRGFDISNISFVEGDTGLIVIDPLVSTEVAAAALDLYRAHRGADRPV
VAVIYTHSHVDHFGGVLGVTTQADVDAGKVAVLAPEGFTAHAVQENIYAG
SAMMRRAGYMYGTVLARGLRGHVGCGLGQTLSTGEVSLVVPTVDITETGE
THTIDGVEIEFQMAPGTEAPAEMHFYFPRFRALCMAENATHNLHNLLTLR
GALVRDPRAWSGYLTEAIDTFADRTDVVFASHHWPTWGREKIVEFLSQQR
DMYSYLHDQTLRLLNQGYTGVEIAEMFQLPPALQRAWHTHGYYGSVSHNV
KAIYQRYMGWFDGNPGWLWPHPPEALAPRYVDALGGIDRVLELAREAFDA
GDFRWAATLLDHAVFADSEHAAARGLYADTLEQLAYGAECATWRNFFLTG
AAELRDGNPGSSGQVPAPTFFAQLTPDQIFDVLAISINGPRAWDLDLAID
FTFTEPDVNYRLTLRNGVLIHRKLPADPATANATVTVGDKVRLVAAALGD
ISSPGFEVFGDRTVLQTFLSVLDRPDSAFNIVTP
>MT2022 virulence factor mce family protein
MRIGLTLVMIAAVVASCGWRGLNSLPLPGTQGNGPGSFAVQAQLPDVNNI
QPNSRVRVADVTVGHVTKIERQGWHALVTMRLDGDVDLPANATAKIGTTS
LLGSYHIELAPPKGEARQGKLRDGSLIALSHGSAYPSTEQTLAALSLVLN
GGGLGQVQDITEALSTAFAGREHDLRGLIGQLDTFTAYLNNQSGDIIAAT
DSLNRLVGKFADQQPVFDRALATIPDALAVLADERDTLVEAAEQLSKFSA
LTVDSVNKTTANLVTELRQLGPVLESLANSGPALTRSLSLLATFPFPNET
FQNFQRGEYANLTAIVDLTLSRIDQGLLTGTRWECHLTQLELQWGRTIGQ
FPSPCTAGYRGTPGNPLTIAYRWDQGP
>MT3907 polyketide synthase
MADVAESQENAPAERAELTVPEMRQWLRNWVGKAVGKAPDSIDESVPMVE
LGLSSRDAVAMAADIEDLTGVTLSVAVAFAHPTIESLATRIIEGEPETDL
AGDDAEDWSRTGPAERVDIAIVGLSTRFPGEMNTPEQTWQALLEGRDGIT
DLPDGRWSEFLEEPRLAARVAGARTRGGYLKDIKGFDSEFFAVAKTEADN
IDPQQRMALELTWEALEHARIPASSLRGQAVGVYIGSSTNDYSFLAVSDP
TVAHPYAITGTSSSIIANRVSYFYDFHGPSVTIDTACSSSLVAIHQGVQA
LRNGEADVVVAGGVNALITPMVTLGFDEIGAVLAPDGRIKSFSADADGYT
RSEGGGMLVLKRVDDARRDGDAILAVIAGSAVNHDGRSNGLIAPNQDAQA
DVLRRAYKDAGIDPRTVDYIEAHGTGTILGDPIEAEALGRVVGRGRPADR
PALLGAVKTNVGHLESAAGAASMAKVVLALQHDKLPPSINFAGPSPYIDF
DAMRLKMITTPTDWPRYGGYALAGVSSFGFGGANAHVVVREVLPRDVVEK
EPEPEPEPKAAAEPAEAPTLAGHALRFDEFGNIITDSAVAEEPEPELPGV
TEEALRLKEAALEELAAQEVTAPLVPLAVSAFLTSRKKAAAAELADWMQS
PEGQASSLESIGRSLSRRNHGRSRAVVLAHDHDEAIKGLRAVAAGKQAPN
VFSVDGPVTTGPVWVLAGFGAQHRKMGKSLYLRNEVFAAWIEKVDALVQD
ELGYSVLELILDDAQDYGIETTQVTIFAIQIALGELLRHHGAKPAAVIGQ
SLGEAASAYFAGGLSLRDATRAICSRSHLMGEGEAMLFGEYIRLMALVEY
SADEIREVFSDFPDLEVCVYAAPTQTVIGGPPEQVDAILARAEAEGKFAR
KFATKGASHTSQMDPLLGELTAELQGIKPTSPTCGIFSTVHEGRYIKPGG
EPIHDVEYWKKGLRHSVYFTHGIRNAVDSGHTTFLELAPNPVALMQVALT
TADAGLHDAQLIPTLARKQDEVSSMVSTMAQLYVYGHDLDIRTLFSRASG
PQDYANIPPTRFKRKEHWLPAHFSGDGSTYMPGTHVALPDGRHVWEYAPR
DGNVDLAALVRAAAAHVLPDAQLTAAEQRAVPGDGARLVTTMTRHPGGAS
VQVHARIDESFTLVYDALVSRAGSESVLPTAVGAATAIAVADGAPVAPET
PAEDADAETLSDSLTTRYMPSGMTRWSPDSGETIAERLGLIVGSAMGYEP
EDLPWEVPLIELGLDSLMAVRIKNRVEYDFDLPPIQLTAVRDANLYNVEK
LIEYAVEHRDEVQQLHEHQKTQTAEEIARAQAELLHGKVGKTEPVDSEAG
VALPSPQNGEQPNPTGPALNVDVPPRDAAERVTFATWAIVTGKSPGGIFN
ELPRLDDEAAAKIAQRLSERAEGPITAEDVLTSSNIEALADKVRTYLEAG
QIDGFVRTLRARPEAGGKVPVFVFHPAGGSTVVYEPLLGRLPADTPMYGF
ERVEGSIEERAQQYVPKLIEMQGDGPYVLVGWSLGGVLAYACAIGLRRLG
KDVRFVGLIDAVRAGEEIPQTKEEIRKRWDRYAAFAEKTFNVTIPAIPYE
QLEELDDEGQVRFVLDAVSQSGVQIPAGIIEHQRTSYLDNRAIDTAQIQP
YDGHVTLYMADRYHDDAIMFEPRYAVRQPDGGWGEYVSDLEVVPIGGEHI
QAIDEPIIAKVGEHMSRALGQIEADRTSEVGKQ
>MT1701 polyketide synthase
MNSTPEDLVKALRRSLKQNERLKRENRDLLARTTEPVAVVGMGCRYPGGV
DSPETLWELVAHGRDAVSEFPADRGWDVAGLFDPDPDAVGKSYTRCGGFL
TDVAGFDAEFFGIAPSEALAMDPQQRLLLEVSWEALERAGIDPITLRGSQ
TGVFAGVFHGSYGGQGRVPGDLERYGLRGSTLSVASGRVAYVLGLQGPAV
SVDTACSSSLVALHLAVQSLRLGECDLALVGGVTVMATPAMFIEFSRQRA
LSADGRCKAYAGAADGTAFAEGAGVLVLARLADARRLGHPVLALVRGSAV
NQDGASNGLATPNGPAQQRVITAALASARLGVADVDVVEGHGTGTTLGDP
IEAQAILATYGQRPADRPLWLGSIKSNIGHTSAAAGVAGVIKMVQAMRHG
VLPKTLHVDVPTPHVDWSAGAVSLLTEPRPWHVPGRPRRAGVSSFGISGT
NAHVILEEAPAVEPVGAAHGNDPVAVPWVLSARSAQALTNQARRLLAWVG
ADENVRPLDVGWSLVNTRSLFDHRAVVVGADRTQLMEGLTGLAAGVPGAD
VVAGRAQTVGKTAFVFPGQGAQWLGMGAQLCATAPVFAEHIHRCERALRE
HVEWSLLDVLRGAPGAPGLDRVDVVQPALWAVMVSLAELWRSVGVVPDAV
IGHSQGEIAAAYVAGALSLRDAAAVVALRSRLLVRLGGAGGMVSLACGQP
QAEKLASQWGDRLNIAAVNGVSSVVLAGETDAVTELMQRCEAEGIRARRI
DVDYASHSAQVDAIREELIAALRGIEPRTSTVAFFSTVTGELMDTAGVNA
EYWYRSIRQPVQFERAVRNAFDGGYRVFVESSPHPVLIAGIEETLVDCDR
GATGEPIVIPTLGRDDGGVGRFWLSAGQAHVAGVGVDWRAAFADLGGRRV
ELPTYAFARQRFWLDGLGAVGGDLGGVGLVGAEHGLLAAVVQRPDSGGVV
LTGRISVVAAPWLADHAVGPVVLFPGTGFVELALRAGDEVGCSVLQELTL
QAPLVLPADGVRVQVVVGGVEQSGTRNVWVYSAAGQADSSPGWTLHAQGV
LGVGSVQPAAELSVWPPVGARAMDVADGYQVLAARGYGYGPAFRGLQALW
RRGAEVFADVTLPEGVPIRGFGIHPAVLDAALHAWGIVEGEQQTMLPFSW
QGVCLHASGAARVRVRLAPVGRGAVSVELADPQGLPVLSVRQLMVRPVSA
AALSRSTAGDRGLLEMIWTPVPLEGGDIGDDAVVWELPPHAGAQAGGDVL
AAVYRGVHEVLEVLQSWLASDATGLGVVVTRGAVGPVDDDVTDLAGAAVW
GLVRSAQAEHPGRVVLVDTDGSVAVEDAVGFGARSGEPQLVVRRGRVYAA
RLAPVAAGLTLPSASAGGWRLVAGGGGTLADVVVAPVAPVELATGQVRVA
VGAVGVNFRDVLVALGMYPGGGELGVDGAGVVVEVGPGVTGLAVGDRVMG
LLGLVGSEAVVDARLVTMVPAGWSLVEAAAVPVAFLTAFYGLSVLAEVAA
GQKVLVHAGTGGVGMAAVSLARYWGAEVFVTASRAKWDTLRAMGFDDIHI
SDSRSLEFEEAFLRATEGSGVDVVLNSLAGEFTDASLRLLPSGGRFIELG
KTDIRDGQTVAERHRGVRYRAFDLVEAGPDRIAAMLSEVVGLLAAGVLAR
LPVKTFDARCAPAAYRFVSQARHIGKVVLTIPDGPGGQSGLAGGTVVVTG
GTGMAGSAVATHLVRRHGVANLVLVSRSGEQADRAAEVAALLREGGAQVA
VVSCDVADRDALAALLAGLDPRYPLKGVFHAAGVLDDAVITGLTPDRVDT
VLRAKVDGAWNLHELTEDMDLSAFVVFSSMAGIVGTPAQGNYAAANAFLD
GLVAYRRSRGLAGLSVAWGLWEQASAMTRHLGERDRARMTQAGLAPLTTE
QALGFLDTALQADRAVVVAARLDRAALAGAGAALPALFSQLAAGPTRRRI
DAADTAVSMSGLVSRLHALTPERRQRELTDLVISNAAAVLGRSSSVDINA
HKAFQDLGFDSLTAVELRNRLKTATGLTLSPTLIFDYPTPATLAEHLDSR
LVTASGSDQQSLSDRVDDITRELVVLLDQPDLSANVKAHLRTRLQTMLTS
LTTEDDDIAAATESQLFAILDEELGS
>MT1937 hypothetical protein
MPRTNNDAWDLATSVGATATMVAAARAVATRADNPLIDDPFAEPLVRAVG
IDFFTRWAAGNIKATDVDDPDGTWGLQRLADLLAARTRYFDAFFRDATSA
GIRQAVILASGLDARAYR
>MT3021 substrate--CoA ligase
MRNGNLAGLLAEQASEAGWYDRPAFYAADVVTHGQIHDGAARLGEVLRNR
GLSSGDRVLLCLPDSPDLVQLLLACLARGVMAFLANPELHRDDHALAARN
TEPALVVTSDALRDRFQPSRVAEAAELMSEAARVAPGGYEPMGGDALAYA
TYTSGTTGPPKAAIHRHADPLTFVDAMCRKALRLTPEDTGLCSARMYFAY
GLGNSVWFPLATGGSAVINSAPVTPEAAAILSARFGPSVLYGVPNFFARV
IDSCSPDSFRSLRCVVSAGEALELGLAERLMEFFGGIPILDGIGSTEVGQ
TFVSNRVDEWRLGTLGRVLPPYEIRVVAPDGTTAGPGVEGDLWVRGPAIA
KGYWNRPDSPVANEGWLDTRDRVCIDSDGWVTYRCRADDTEVIGGVNVDP
REVERLIIEDEAVAEAAVVAVRESTGASTLQAFLVATSGATIDGSVMRDL
HRGLLNRLSAFKVPHRFAVVDRLPRTPNGKLVRGALRKQSPTKPIWELSL
TEPGSGVRAQRDDLSASNMTIAGGNDGGATLRERLVALRQERQRLVVDAV
CAEAAKMLGEPDPWSVDQDLAFSELGFDSQMTVTLCKRLAAVTGLRLPET
VGWDYGSISGLAQYLEAELAGGHGRLKSAGPVNSGATGLWAIEEQLNKVE
ELVAVIADGEKQRVADRLRALLGTIAGSEAGLGKLIQAASTPDEIFQLID
SELGK
>MT3653 oxidoreductase, short-chain dehydrogenase/reductase family
MTLAEAADAINFGLAGRVVLVTGGVRGVGAGISSVFAEQGATVITCARRA
VDGQPYEFHRCDIRDEDSVKRLVGEIGERHGRLDMLVNNAGGSPYALAAE
ATHNFHRKIVELNVLAPLLVSQHANVLMQAQPNGGSIVNICSVSGRRPTP
GTAAYGAAKAGLENLTTTLAVEWAPKVRVNAVVVGMVETERSELFYGDAE
SIARVAATVPLGRLARPADIGWAAAFLASDAASYISGATLEVHGGGEPPP
YLGASSANK
>MT1449 conserved hypothetical protein
MTIDTPAREDQTLAATHRAMWALGDYALMAEEVMAPLGPILVAAAGIGPG
VRVLDVAAGSGNISLPAAKTGATVISTDLTPELLQRSQARAAQQGLTLQY
QEANAQALPFADDEFDTVISAIGVMFAPDHQAAADELVRVCRPGGTIGVI
SWTCEGFFGRMLATIRPYRPSVSADLPPSALWGREAYVTGLLGDGVTGLK
TARGLLEVKRFDTAQAVHDYFKNNYGPTIEAYAHIGDNAVLAAELDRQLV
ELAAQYLSDGVMEWEYLLLTAEKR
>MT1563 conserved hypothetical protein
MRLARRARNILRRNGIEVSRYFAELDWERNFLRQLQSHRVSAVLDVGANS
GQYARGLRGAGFAGRIVSFEPLPGPFAVLQRSASTDPLWECRRCALGDVD
GTISINVAGNEGASSSVLPMLKRHQDAFPPANYVGAQRVPIHRLDSVAAD
VLRPNDIAFLKIDVQGFEKQVIAGGDSTVHDRCVGMQLELSFQPLYEGGM
LIREALDLVDSLGFTLSGLQPGFTDPRNGRMLQADGIFFRGSD
>MT1991 oxidoreductase
MNHPDLAGKVAIVTGAGAGIGLAVARRLADEGCHVLCADIDGDAADAAAT
KIGCGAAACRVDVSDEQQIIAMVDACVAAFGGVDKLVANAGVVHLASLID
TTVEDFDRVIAINLRGAWLCTKHAAPRMIERGGGAIVNLSSLAGQVAVGG
TGAYGMSKAGIIQLSRITAAELRSSGIRSNTLLPAFVDTPMQQTAMAMFD
GALGAGGARSMIARLQGRMAAPEEMAGIVVFLLSDDASMITGTTQIADGG
TIAALW
>MT2344 hypothetical protein
MTTVDFHFDPLCPFAYQTSVWIRDVRAQLGITINWRFFSLEEINLVAGKK
HPWERDWSYGWSLMRIGALLRRTNMSLLDRWYAAIGHELHTLGGKPHDPA
VARRLLCDVGVNAAILDAALDDPTTHDDVRADHQRVVAAGGYGVPTLFLD
GQCLFGPVLVDPPAGPAALNLWSVVTGMAGLPHVYELQRPKSPADVELIA
QQLRPYLDGRDWVSINRGEIVDIDRLAGRS
>MT2447 peptide synthetase
MGPVAVTRADARGAIDDVMALSPLQQGLFSRATLVAAESGSEAAEADPYV
IAMAADAAGPLDIALLRDCAAAMLTRHPNLRASFLHGNLSRPVQVIPSSA
EVLWRHVRAHPSEVGALAAEERRRRFDVGRGPLIRFLLIELPDECWHLVI
VAHHIVIDGWSLPLFVSELLALYRAGGHVAALPAAPRPYRDYIGWLAGRD
QTASRAMWADHLNGLDGPTLLSPALADTPVQPGIPGRTEVRLDREATAEL
ADAARTRGVTISTLVQMAWATTLSAFTGRGDVTFGVTVSGRPSELSGVET
MIGLFINTVPLRVRLDARATVGGQCAVLQRQFAMLRDHSYLGFNEFRAIA
GIGEMFDTLLVYENFPPGEVVGTAEFVANGVTFRPVALESLSHFPVTVAA
HRSTGELTLLVEVLDGALGTMAPESLGRRVLAVLQRLVSRWDRPLRDVDI
LLDGEHDPTAPGLPDVTTSAPAVHTRFAEIAAAQPDSVAVSWADGQLTYR
ELDALADRLATGLRRADVSRETPVAVALSRGPRYVAAMLAVLKAGGMIVP
LDPAMPGERVAEILRQTSAPVVIDEGVFAASVGADILEDDRAITVPVDQA
AYVIFTSGTTGTPKGVIGTHRALSAYADDHIERVLRPAAQRLGRPLRIAH
AWSFTFDAAWQPLVALLDGHAVHIVDDHRQRDAGALVEAIDRFGLDMIDT
TPSMFAQLHNAGLLDRAPLAVLALGGEALGAATWRMIQQNCARTAMTAFN
CYGPTETTVEAVVAAVAEHARPVIGRPTCTTRAYVMDSWLRPVPDGVAGE
LYLAGAQLTRGYLGRPAETAARFVAEPNGRGSRMYRTGDVVRRLPDGGLE
FLGRSDDQVKIRGFRVEPGEIAAVLNGHHAVHGCHVTARGHASGPRLTAY
VAGGPQPPPVAELRAMLLERLPRYLVPHHIVVLDELPLTPHGKIDENALA
AINVTEGPATPPQTPTELVLAEAFADVMETSNVDVTAGFLQMGLDSIVAL
SVVQAARRRGIALRARLMVECDTIRELAAAIDSDAAWQAPANDAGEPIPV
LPNTHWLYEYGDPRRLAQTEVIRLPDRITRERLDAVLAAVVDGHEVLRCR
FDRDAMALVAQPKTDILSEVWVSGELVTAVAEQTLGALASLDPQAGRLLS
AVWLREPDGPGVLVLTAHVLAMDPASWRIVLGELDAGLHALAAGRAPSPA
RENTSYRQWSRLLAQRAKALDSVDFWVAELEGADPPLGARRVAPQTDRVG
ELAITMSISDADLTARLLSTGRSMTDLLATAAARMVTAWRRQRGQQTPAP
LLALETHGRADVHVDKTADTSDTVGLLSAIYPLRIHCDGATDFARIPGSG
IDYGLLRYLRADTAERLRAHREPQLLLNYLGSLHVGVGDLAVDRALLADV
GQLPEPEQPVRHELTVLAALLGPADAPVLATRWRTLPDILSADDVATLQS
LWQGALAEITA
>MT2106 hypothetical protein
MRIAALVAVSLLIAGCSREVGGDVGQSQTIAPPAPAPSAAPSTPPAAGAP
ITTIVSWIEAGHPVDPAAYHVATRDGVTTQLGDDVAFSASSGTVACMTDA
RHTSGTLACLVRLANPPPRPETAYGEWKGGWVDFDGIHLQVGSARADPGP
FVYGNGPELANGDTLSIGDYRCRSYQAGLFCVNYAHQSAVRFASAGIEPF
GCLKPAPPPDGVGVAFGC
>MT2998 thioesterase
MSMLARHGPRYGGSVNGHSDDSSGDAKQAAPTLYIFPHAGGTAKDYVAFS
REFSADVKRIAVQYPGQHDRSGLPPLESIPTLADEIFAMMKPSARIDDPV
AFFGHSMGGMLAFEVALRYQSAGHRVLAFFVSACSAPGHIRYKQLQDLSD
REMLDLFTRMTGMNPDFFTDDEFFVGALPTLRAVRAIAGYSCPPETKLSC
PIYAFIGDKDWIATQDDMDPWRDRTTEEFSIRVFPGDHFYLNDNLPELVS
DIEDKTLQWHDRA
>MT0183 virulence factor mce family protein
MLTRFIRRQLILFAIVSVVAIVVLGWYYLRIPSLVGIGQYTLKADLPASG
GLYPTANVTYRGITIGKVTAVEPTDQGARVTMSIASNYKIPVDASANVHS
VSAVGEQYIDLVSTGAPGKYFSSGQTITKGTVPSEIGPALDNSNRGLAAL
PTEKIGLLLDETAQAVGGLGPALQRLVDSTQAIVGDFKTNIGDVNDIIEN
SGPILDSQVNTGDQIERWARKLNNLAAQTATRDQNVRSILSQAAPTADEV
NAVFSGVRDSLPQTLANLEVVFDMLKRYHAGVEQLLVFLPQGAAIAQTVL
TPTPGAAQLPLAPAINYPPPCLTGFLPASEWRSPADTSPRPLPSGTYCKI
PQDAQLQVRGARNIPCVDVPGKRAATPKECRSKDPYVPLGTNPWFGDPNQ
ILTCPAPGARCDQPVKPGLVIPAPSINTGLNPAPADQVQGTPPPVSDPLQ
RPGSGTVQCNGQQPNPCVYTPTSGPSAVYSPASGELVGPDGVKYAVANSS
TTGDDGWKEMLAPAS
>MT1476 P49 protein
MTTAVVVGAGPNGLAAAIHLARHGVDVQVLEARDTIGGGARSGELTVPGV
IHDHCSAFHPLGVGSPFWAAIDLQRYGLTWKWPDVDCAHPLDDGTAGVLY
RSIEATAAGLGPDGKRWQRAVGDLAAGFDELAEDLLRPVLNMPRHPIRLA
RFGPRAALPATAMARRFHTERARALFGGAAAHVYTRLDRPLTASLGLMIL
ASGHRHGWPVARGGSGSITKALAAALDAYGGTVATGVTVTSRRDIPDADI
VMLDLSPAAVLGIYGDVMPTRINRSYRRYRAGSSAFKVDFAIEGDVGWTN
PDCRRAGTVHLGGTFAEIADTERQRAQGTMVQRPFVLVGQQYLADPSRSV
GNINPIWAYAHVPFGYTGDATAAVIDQIERFAPGFRDRIVATVSTSTTEL
QTYNRNFIGGDIIGGANDRLQVIFRPRVAVDPYAIGVPGVYLCSQSAPPG
AGIHGLCGYHAAESALRWLRKRR
>MT3030 conserved hypothetical protein
MKSLKLARFIARSAAFEVSRRYSERDLKHQFVKQLKSRRVDVVFDVGANS
GQYAAGLRRAAYKGRIVSFEPLSGPFTILESKASTDPLWDCRQHALGDSD
GTVTINIAGNAGQSSSVLPMLKSHQNAFPPANYVGTQEASIHRLDSVAPE
FLGMNGVAFLKVDVQGFEKQVLAGGKSTIDDHCVGMQLELSFLPLYEGGM
LIPEALDLVYSLGFTLTGLLPCFIDANNGRMLQADGIFFREDD
>MT1947 conserved hypothetical protein
MTTPEYGSLRSDDDHWDIVSNVGYTALLVAGWRALHTTGPKPLVQDEYAK
HFITASADPYLEGLLANPRTSEDGTAFPRLYGVQTRFFDDFFNCADEAGI
RQAVIVAAGLDCRAYRLDWQPGTTVFEIDVPKVLEFKARVLSERGAVPKA
HRVAVPADLRTDWPTPLTAAGFDPQRPSAWSVEGLLPYLTGDAQYALFAR
IDELCAPGSRVALGALGSRLDHEQLAALETAHPGVNMSGDVNFSALTYDD
KTDPVEWLVEHGWAVDPVRSTLELQVGYGLTPPDVDVKIDSFMRSQYITA
VRA
>MT0224 substrate--CoA ligase, putative
MPRGELYKRFRLVMGGIAPCGSGRRAATYPRRMQIRPYIGADKPAVILYP
SGTVISFDELEARANRLAHWFRQAGLREDDVVAILMENNEHVHAVMWAAR
RSGLYYVPINTHLTASEAAYIVDNSGAKAIVGSAALRETCHGLAEHLPGG
LPDLLMLAGGGLVGWMTYPECVADQPDTPIEDEREGDLLQYSSGTTGRPK
GIKRELPHVSPDAAPGMMPALLDFWMDADSVYLSPAPMYHTAPSVWTMSA
LAAGVTTVVMEKFDAEGALDAIQRYRVTHAQFVPAMFVRMLKLPEAVRNS
YDMSSLRRVIHAAAPCPVQIKEQMIHWWGPIIDEYYASSEASGSTLITAE
DWLTHPGSVGKPIQGGVHIVGADGSELPPNQPGEIYFEGGYPFEYLNDPA
KTAASRNKHGWVTVGDVGYLDDDGYLFLTGRRHHMIISGGVNIYPQEAEN
LLVAHPKVLDAAVFGVPDDEMGQRVMAAVQTVDSADANDQFAGELLAWLR
DRLSHFKCPRSIAFEPQLPRTDTGKLYKSGLVEKYSV
>MT3145 P450 heme-thiolate protein
MATIHPPAYLLDQAKRRFTPSFNNFPGMSLVEHMLLNTKFPEKKLAEPPP
GSGLKPVVGDAGLPILGHMIEMLRGGPDYLMFLYKTKGPVVFGDSAVLPG
VAALGPDAAQVIYSNRNKDYSQQGWVPVIGPFFHRGLMLLDFEEHMFHRR
IMQEAFVRSRLAGYLEQMDRVVSRVVADDWVVNDARFLVYPAMKALTLDI
ASMVFMGHEPGTDHELVTKVNKAFTITTRAGNAVIRTSVPPFTWWRGLRA
RELLENYFTARVKERREASGNDLLTVLCQTEDDDGNRFSDADIVNHMIFL
MMAAHDTSTSTATTMAYQLAAHPEWQQRCRDESDRHGDGPLDIESLEQLE
SLDLVMNESIRLVTPVQWAMRQTVRDTELLGYYLPKGTNVIAYPGMNHRL
PEIWTDPLTFDPERFTEPRNEHKRHRYAFTPFGGGVHKCIGMVFDQLEIK
TILHRLLRRYRLELSRPDYQPRWDYSAMPIPMDGMPIVLRPR
>MT1914 oxidoreductase, short-chain dehydrogenase/reductase family
MPGRTSIGVKIRDKVQDKVIAITGGARGIGLATAAALHNLGAKVAIGDID
EAMAKESGADLDLDMYGKLDVTDPDSFSGFLDAVERQLGPIDVLVNNAGI
MPVGRIVDEPDPVTRRILDINVYGVILGSKLAAQRMVPRGRGHVINVASL
AGEIYAVGVATYCASKHAVVAFTDSARLEYRSAGVKFSMVLPSFVNTELI
AGTGGIKGFKNAEPADIADAIVGLIVHPKPRVRVTKAAGSMIVAQRFMPR
QVSEGLNRLLGGEHVFTDDVDMEKRRTYEARARGEE
>MT0789 oxidoreductase, short-chain dehydrogenase/reductase family
MPRFEPHPARRTTVVAGASSGIGAATATELAGRGFPVALGARRMDKLAEL
VDKIRADGGEAVAFPLDVTDPESVKSFVAQTVEALGEVELLVSSAGDMLP
GQLHEVSTEAFAEQVQIHLVGANRLATAVLPAMVARRRGDLIFVGSDVGL
RQRPHMGAYGAAKAGLAAMVTNLQMELEGTGVRASIVHPGPTLTGMGWQL
SAEQVGPMLADWAKWGQARHNYFLRPSDLARAIAFVAETPRGCVVVNMEI
QPEAPLRDAPAHRQKLVLGEEGMPG
>MT1827 P450 heme-thiolate protein
MRRSPKGSPGAVLDLQRRVDQAVSADHAELMTIAKDANTFFGAESVQDPY
PLYERMRAAGSVHRIANSDFYAVCGWDAVNEAIGRPEDFSSNLTATMTYT
AEGTAKPFEMDPLGGPTHVLATADDPAHAVHRKLVLRHLAAKRIRVMEQF
TVQAADRLWVDGMQDGCIEWMGAMANRLPMMVVAELIGLPDPDIAQLVKW
GYAATQLLEGLVENDQLVAAGVALMELSGYIFEQFDRAAADPRDNLLGEL
ATACASGELDTLTAQVMMVTLFAAGGESTAALLGSAVWILATRPDIQQQV
RANPELLGAFIEETLRYEPPFRGHYRHVRNATTLDGTELPADSHLLLLWG
AANRDPAQFEAPGEFRLDRAGGKGHISFGKGAHFCVGAALARLEARIVLR
LLLDRTSVIEAADVGGWLPSILVRRIERLELAVQ
>MT2450 polyketide synthase
MPMSDNDPVVIVGLAIEAPGGVETADDYWTLLSEQREGLGPFPTDRGWAL
RELFDGSRRNGFKPIHNLGGFLSSATTFDPEFFRISPREATAMDPQQRVG
LRVAWRTLENSGINPDDLAGHDVGCYVGASALEYGPALTEFSHHSGHLIT
GTSLGVISGRIAYTLDLAGPALTVDTSCSSALAAFHTAVQAIRAGDCDLA
LAGGVCVMGTPGYFVEFSKQHALSDDGHCRPYSAHASGTAWAEGAAMFLL
QRRSRATADRRRVLAEVRASCLNSDGLSDGLTAPSGDAQTRLLRRAIAQA
AVVPADVGMVEGHGTATRLGDRTELRSLAASYGTAPAGRGPLLGSVKSNI
GHAQAAAGGLGLVKVILAAQHAAIPPTLHVDEPSREIDWEKQGLRLADKL
TPWRAVDGWRTAAVSAFGMSGTNSHVIVSMPDTVSAPERGPECGEV
>MT1472 conserved hypothetical protein
MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRA
SITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDILLSGLVRAHR
LGHARFLEVAMQYVSLLEPADRVSTIIELVNRSARLVDLVADQLIVAYEH
EHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWV
DSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSP
APTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA
LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSW
LRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAA
FRVQMALEVCRWMAPAVLRAKQ
>MT0455 oxidoreductase, short-chain dehydrogenase/reductase family
MQEAPHRVMPLVRPCPRPSDPGEYVRPMTANDNKTRKWSAADVPDQSGRV
VVVTGANTGIGYHTAAVFADRGAHVVLAVRNLEKGNAARARIMAARPGAH
VTLQQLDLCSLDSVRAAADALRTAYPRIDVLINNAGVMWTPKQVTKDGFE
LQFGTNHLGHFALTGLVLDHMLPVPGSRVVTVSSQGHRIHAAIHFDDLQW
ERRYNRVAAYGQAKLANLLFTYELQRRLGEAGKSTIAVAAHPGGSNTELT
RNLPRLIRPVATVLGPLLFQSPEMGALPTLRAATDPTTQGGQYYGPDGFG
EQRGHPKVVQSSAQSHDKDLQRRLWTVSEELTGVSFGV
>MT3603 virulence factor mce family protein
MSGGGSRRTSVRVAAALLAGLMVGSAVLTYLSYTAAFTSTDTVTVSSPRA
GLVMEKGAKVKYRGIQVGKVTDISYSGNQARLKLAIDSGEMGFIPSNATV
RIAGNTIFGAKSVEFIPPKTPSPKPLSPNAHVAASQVQLEVNTLFQSLID
LLHKIDPLETNATLSALSEGLRGHGDDLGALLSGLNTLTRQANPKLPALQ
EDFRKAAVVANVYADAAGDLNTVFDNLPTINKTIVDQKDNLNDTLLATIG
LSNNAYETLAPAEQNFIDAINRLRAPLKVTSDYSPVFGCLFKGIARGVKE
FAPLIGVRKAGLFTSSSFVLGAPSYTYPESLPIVNASGGPNCRGLPDIPT
KQTGGSFYRAPFLVTDNALIPYQPFTELQVDAPSTLQFLFNGAFAERDDF
>MT1080 oxidoreductase, short-chain dehydrogenase/reductase family
MMARQRFRDQVVLITGASSGIGEATAKAFAREGAVVALAARREGALRRVA
REIEAAGGRAMVAPLDVSSSESVRAMVADVVGEFGRIDVVFNNAGVSLVG
PVDAETFLDDTREMLEIDYLGTVRVVREVLPIMKQQRSGRIMNMSSVVGR
KAFARFAGYSSAMHAIAGFSDALRQELRGSGIAVSVIHPALTQTPLLANV
DPADMPPPFRSLTPIPVHWVAAAVLDGVARRRARVVVPFQPRLLMVGDAF
SPRYGDRVVRLLESKIFGRLIGSYRGSVYRHQPTESAKAQAAQPERGYSS
AR
>MT3445 methyltransferase, putative
MSLSFGSAVGAYERGRPSYPPEAIDWLLPAAARRVLDLGAGTGKLTTRLV
ERGLDVVAVDPIPEMLDVLRAALPQTVALLGTAEEIPLDDNSVDAVLVAQ
AWHWVDPARAIPEVARVLRPGGRLGLVWNTRDERLGWVRELGEIIGRDGD
PVRDRVTLPEPFTTVQRHQVEWTNYLTPQALIDLVASRSYCITSPAQVRT
KTLDRVRQLLATHPALANSNGLALPYVTVCVRATLA
>MT0707 hypothetical protein
MVEKPLRADRATHSRLATFALALAAAALPLAGCSSTANPPAATTTPATAT
TTTATSGPTAAPTVTTGESTTASIQIGDMLTYGSIGTTATLDCADGKSLN
VAGSDNTLTVNGTCETVTVGGANNKIAFDRIDERLVVVGLDNTVTYKNGD
PTIDNLGAGNRINKE
>MT3895 conserved hypothetical protein
MARTDDDSWDLATGVGATATLVAAGRARAARAAQPLIDDPFAEPLVRAVG
VEFLTRWATGELDAADVDDPDAAWGLQRMTTELVVRTRYFDQFFLDAAAA
GVRQAVILASGLDARGYRLPWPADTTVFEVDQPRVLEFKAQTLAGLGAQP
TADLRMVPADLRHDWPDALRRGGFDAAEPAAWIAEGLFGYLPPDAQNRLL
DHVTDLSAPGSRLALEAFLGSADRDSARVEEMIRTATRGWREHGFHLDIW
ALNYAGPRHEVSGYLDNHGWRSVGTTTAQLLAAHDLPAAPALPAGLADRP
NYWTCVLG
>MT0874 oxidoreductase, short-chain dehydrogenase/reductase family
MRAVDGFPGRGAVITGGASGIGLATGTEFARRGARVVLGDVDKPGLRQAV
NHLRAEGFDVHSVMCDVRHREEVTHLADEAFRLLGHVDVVFSNAGIVVGG
PIVEMTHDDWRWVIDVDLWGSIHTVEAFLPRLLEQGTGGHVVFTASFAGL
VPNAGLGAYGVAKYGVVGLAETLAREVTADGIGVSVLCPMVVETNLVANS
ERIRGAACAQSSTTGSPGPLPLQDDNLGVDDIAQLTADAILANRLYVLPH
AASRASIRRRFERIDRTFDEQAAEGWRH
>MT1704 polyketide synthase
MQPTGIAIIGLACRFPTVVSPGDLWDLLRDGREAAGSIDNVADFDADFFN
LSPREASAMDPRQRLALELTWELLEDAFVVPETLRGQPIAVYLGAMNDDY
AVLTLAADRVDHHAFAGTSRAIIANRVSFAFGLRGPSVTIDSGQSSSLVA
VHLACESVRTGEAPLAIAGGVHLNLARETAMLEQEFGAVSPSGHTYAFDE
RADGYVPGDGGGLVLLKPVQAALDDGDRIHAIIRGSAVGNAGHSATGLTV
PSVAGQVDVIRRAMSGAGVDCHQVHYVEAHGTGTKIGDPIEARALGEIFA
ARQRRPVSVGSVKTNIGHTGGAAGIAGLLKAVLAIENAVIPPSLNYVGAA
IDLDSLGLRVDTALTPWPVADEPRRAGVSSFGMGGTNAHVILEQGPTQSP
EIVESVAAAGSNAPVAVPWVLAARSPQALTNQAGRLLAHLTADDGLTALD
VGWSLVSTRSVFDHRAVVVGADRGRLMAGLAGLAAGEPGAGVVVGRARSV
GKTVFVFPGQGSQWLGMGRQLYGRYSVFARAFDEVVAVLDGQLRLSVRQV
MWGADAGLLESTEFAQPALFVVQVALAALLQDWGVLPDLVMGHSVGEIAA
AYVAGALSLVDAARVVAARGRLMQALPAGGVMVAVAASEDEVAPLLTEGV
CIAAVNAPESVVISGEQAAVGVVVDRLVGLGRRVRRLAVSHAFHSVLMDP
MVEEFSKVLADVCVRAPRIGLVSNVTGQLAGAGYGSPAYWVEHVRKPVRF
FDGVGLAESLGARVFVEVGPGAGLEASVALLARDRPEVESVLAGVGRLFA
EGVAVDWSSVFAGLGGRRVELPTYGFARQRFWLGDNGELSVDQTGKDAGA
IARLQSLAPPELQRQLVELVCFHAAIVLGRKSSHDIDPECAFQDLGFDSM
SGVELRNRLQMAIGLPGLSLPRTLIFDYPTASALAECLGQLLGGQHESSD
DESIWQLLKNIPIHQLRRTGLLDKLLLLAGQPEESLAGRTVSDEVIDSLS
PEALIGLALDEDENDIR
>MT0106 dioxygenase, putative
MTLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKDVHPSPR
EFIKLGRIIGQIVPYYEPMYHHEDHPEIFVSSTEEGQGVPKTGAFWHIDY
MFMPEPFAFSMVLPLAVPGHDRGTYFIDLARVWQSLPAAKRDPARGTVST
HDPRRHIKIRPSDVYRPIGEVWDEINRTTPPIKWPTVIRHPKTGQEILYI
CATGTTKIEDKDGNPVDPEVLQELMAATGQLDPEYQSPFIHTQHYQVGDI
ILWDNRVLMHRAKHGSAAGTLTTYRLTMLDGLKTPGYAA
>MT0623 virulence factor mce family protein
MRCGVSAGSANGKPNRWTLRCGVSAGHRGSVFLLAVLLAPVVLTSCTWRG
IANVPLPVGRGMGPDRMTIYVQMPDTLALNTNSRVRVADVWVGTVRDISL
RNWIATLTLELEPTVRLPANATAKIGQTSLLGTQHVELAAPPIPSPQPLK
SGDTIGLKNSSAYPTVERTLASVALILTGGGIVNLDVIQTEILNILDGHA
GQIREFLERLATFTAELNNQRGDLTRAIDSTNQLLTIIANRNDTLDRVLT
DVPPLIEHFADTGQLFADATESLGRFSEVANRALAATRPNLHQTLQSLQR
PLRQLERASPYVVGALKLGLTAPFNIDEVPNVIRGDYVNVSATFDVTLSA
LDNALLSGTGISGMLRALEQAWGRDPDTMIPDVRYTPNPNDAPGGPLVER
AE
>MT1244 very-long-chain acyl-CoA synthetase, putative
MSDYYGGAHTTVRLIDLATRMPRVLADTPVIVRGAMTGLLARPNSKASIG
TVFQDRAARYGDRVFLKFGDQQLTYRDANATANRYAAVLAARGVGPGDVV
GIMLRNSPSTVLAMLATVKCGAIAGMLNYHQRGEVLAHSLGLLDAKVLIA
ESDLVSAVAECGASRGRVAGDVLTVEDVERFATTAPATNPASASAVQAKD
TAFYIFTSGTTGFPKASVMTHHRWLRALAVFGGMGLRLKGSDTLYSCLPL
YHNNALTVAVSSVINSGATLALGKSFSASRFWDEVIANRATAFVYIGEIC
RYLLNQPAKPTDRAHQVRVICGNGLRPEIWDEFTTRFGVARVCEFYAASE
GNSAFINIFNVPRTAGVSPMPLAFVEYDLDTGDPLRDASGRVRRVPDGEP
GLLLSRVNRLQPFDGYTDPVASEKKLVRNAFRDGDCWFNTGDVMSPQGMG
HAAFVDRLGDTFRWKGENVATTQVEAALASDQTVEECTVYGVQIPRTGGR
AGMAAITLRAGAEFDGQALARTVYGHLPGYALPLFVRVVGSLAHTTTFKS
RKVELRNQAYGADIEDPLYVLAGPDEGYVPYYAEYPEEVSLGRRPQG
>MT3005 polyketide synthase
MSIPENAIAVVGMAGRFPGAKDVSAFWSNLRRGKESIVTLSEQELRDAGV
SDKTLADPAYVRRAPLLDGIDEFDAGFFGFPPLAAQVLDPQHRLFLQCAW
HALEDAGADPARFDGSIGVYGTSSPSGYLLHNLLSHRDPNAVLAEGLNFD
QFSLFLQNDKDFLATRISHAFNLRGPSIAVQTACSSSLVAVHLACLSLLS
GECDMALAGGSSLCIPHRVGYFTSPGSMVSAVGHCRPFDVRADGTVFGSG
VGLVVLKPLAAAIDAGDRIHAVIRGSAINNDGSAKMGYAAPNPAAQADVI
AEAHAVSGIDSSTVSYVECHGTGTPLGDPIEIQGLRAAFEVSQTSRSAPC
VLGSVKSNIGHLEVAAGIAGLIKTILCLKNKALPATLHYTSPNPELRLDQ
SPFVVQSKYGPWECDGVRRAGVSSFGVGGTNAHVVLEEAPAEASEVSAHA
EPAGPQVILLSAQTAAALGESRTALAAALETQDGPRLSDVAYTLARRRKH
NVTMAAVVHDREHAATVLRAAEHDNVFVGEAAHDGEHGDRADAAPTSDRV
VFLFPGQGAQHVGMAKGLYDTEPVFAQHFDTCAAGFRDETGIDLHAEVFD
GTATDLERIDRSQPALFTVEYALAKLVDTFGVRAGAYIGYSTGEYIAATL
AGVFDLQTAIKTVSLRARLMHESPPGAMVAVALGPDDVTQYLPPEVELSA
VNDPGNCVVAGPKDQIRALRQRLTEAGIPVRRVRATHAFHTSAMDPMLGQ
FQEFLSRQQLRPPRTPLLSNLTGSWMSDQQVVDPASWTRQISSPIRFADE
LDVVLAAPSRILVEVGPGGSLTGSAMRHPKWSTTHRTVRLMRHPLQDVDD
RDTFLRALGELWSAGVEVDWTPRRPAVPHLVSLPGYPFARQRHWVEPNHT
VWAQAPGANNGSPAGTADGSTAATVDAARNGESQTEVTLQRIWSQCLGVS
SVDRNANFFDLGGDSLMAISIAMAAANEGLTITPQDLYEYPTLASLTAAV
DASFASSGLAKPPEAQANPAVPPNVTYFLDRGLRDTGRCRVPLILRLDPK
IGLPDIRAVLTAVVNHHDALRLHLVGNDGIWEQHIAAPAEFTGLSNRSVP
NGVAAGSPEERAAVLGILAELLEDQTDPNAPLAAVHIAAAHGGPHYLCLA
IHAMVTDDSSRQILATDIVTAFGQRLAGEEITLEPVSTGWREWSLRCAAL
ATHPAALDTRSYWIENSTKATLWLADALPNAHTAHPPRADELTKLSSTLS
VEQTSELDDGRRRFRRSIQTILLAALGRTIAQTVGEGVVAVELEGEGRSV
LRPDVDLRRTVGWFTTYYPVPLACATGLGALAQLDAVHNTLKSVPHYGIG
YGLLRYVYAPTGRVLGAQRTPDIHFRYAGVIPELPSGDAPVQFDSDMTLP
VREPIPGMGHAIELRVYRFGGSLHLDWWYDTRRIPAATAEALERTFPLAL
SALIQEAIAAEHTEHDDSEIVGEPEAGALVDLSSMDAG
>MT2108 polyketide synthase
MVDQLQHATEALRKALVQVERLKRTNRALLERSSEPIAIVGMSCRFPGGV
DSPEGLWQMVADARDVMSEFPTDRGWDLAGLFDPDPDVRHKSYARTGGFV
DGVADFDPAFFGISPSEALAMDPQHRMLLELSWEALERAGIDPTGLRGSA
TGVFAGLIVGGYGMLAEEIEGYRLTGMTSSVASGRVAYVLGLEGPAVSVD
TACSSSLVALHMAVGSLRSGECDLALAGGVTVNATPTVFVEFSRHRGLAP
DGRCKPYAGRADGVGWSEGGGMLVLQRLSDARRLGHPVLAVVVGSAVNQD
GASNGLTAPNGPSQQRVVRAALANAGLSAAEVDVVEGHGTGTTLGDPIEA
QALLATYGQDRGEPGEPLWLGSVKSNMGHTQAAAGVAGVIKMVLAMRHEL
LPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTRRAGVSSFGISGTN
AHVIIEAVPVVPRREAGWAGPVVPWVVSAKSESALRGQAARLAAYVRGDD
GLDVADVGWSLAGRSVFEHRAVVVGGDRDRLLAGLDELAGDQLGGSVVRG
TATAAGKTVFVFPGQGSQWLGMGIELLDTAPAFAQQIDACAEAFAEFVDW
SLVDVLRGAPGAPGLDRVDVVQPVLFAVMVSLAELWKSVAVHPDAVIGHS
QGEIAAAYVAGALSLRDAARVVTLRSKLLAGLAGPGGMVSIACGADQARD
LLAPFGDRVSIAVVNGPSAVVVSGEVGALEELIAVCSTKELRTRRIEVDY
ASHSVEVEAIRGPLAEALSGIEPRSTRTVFFSTVTGNRLDTAGLDADYWY
RNVRQTVLFDQAVRNACEQGYRTFIESSPHPALITGVEETFAACTDGDSE
AIVVPTLGRGDGGLHRFLLSAASAFVAGVAVNWRGTLDGAGYVELPTYAF
DKRRFWLSAEGSGADVSGLGLGASEHPLLGAVVDLPASGGVVLTGRLSPN
VQPWLADHAVSDVVLFPGTGFVELAIRAGDEVGCSVLDELTLAAPLLLPA
TGSVAVQVVVDAGRDSNSRGVSIFSRADAQAGWLLHAEGILRPGSVEPGA
DLSVWPPAGAVTVDVADGYERLATRGYRYGPAFRGLTAMWARGEEIFAEV
RLPEAAGGVGGFGVHPALLDAVLHAVVIAGDPDELALPFAWQGVSLHATG
ASAVRARIAPAGPSAVSVELADGLGLPVLSVASMVARPVTERQLLAAVSG
SGPDRLFEVIWSPASAATSPGPTPAYQIFESVAADQDPVAGSYVRSHQAL
AAVQSWLTDHESGVLVVATRGAMALPREDVADLAGAAVWGLVRSAQTEHP
GRIVLVDSDAATDDAAIAMALATGEPQVVLRGGQVYTARVRGSRAADAIL
VPPGDGPWRLGLGSAGTFENLRLEPVPNADAPLGPGQVRVAMRAIAANFR
DIMITLGMFTHDALLGGEGAGVVVEVGPGVTEFSVGDSVFGFFPDGSGTL
VAGDVRLLLPMPADWSYAEAAAISAVFTTAYYAFIHLADVQPGQRVLIHA
GTGGVGMAAVQLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEF
EDKFRAATGGRGFDVVLDSLAGEFVDASLRLVAPGGVFLEMGKTDIRDPG
VIAQQYPGVRYRAFDLFEPGRPRMHQYMLELATLFGDGVLRPLPVTTFDV
RRAPAALRYLSQARHTGKVVMLMPGSWAAGTVLITGGTGMAGSAVARHVV
ARHGVRNLVLVSRRGPDAPGAAELVAELAAAGAQVQVVACDAADRAALAK
VIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRVDVVLRSKVDAAWHLHE
LTRDLDVSAFVMFSSMAGLVGSSGQANYAAANSFLDALAAHRRAHGLPAI
SLGWGLWDQASAMTGGLDAADLARLGREGVLALSTAEALELFDTAMIVDE
PFLAPARIDLTALRAHAVAVPPMFSDLASAPTRRQVDDSVAAAKSKSALA
HRLHGLPEAEQHAVLLGLVRLHIATVLGNITPEAIDPDKAFQDLGFDSLT
AVEMRNRLKSATGLSLSPTLIFDYPTPNRLASYIRTELAGLPQEIKHTPA
VRTTSEDPIAIVGMACRYPGGVNSPDDMWDMLIQGRDVLSEFPADRGWDL
AGLYNPDPDAAGACYTRTGGFVDGVGDFDPAFFGVGPSEALAMDPQQRML
LELSWEALERAGIDPTGLRGSATGVFAGVMTQGYGMFAAEPVEGFRLTGQ
LSSVASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSLRSGECDLALA
GGVTVNATPTVFVEFSRHRGLAPDGRCKPYAGRADGVGWSEGGGMLVLQR
LSDARRLGHPVLAVVVGSAVNQDGASNGLTAPNGPSQQRVVRAALANAGL
SAAEVDVVEGHGTGTTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNM
GHTQAAAGVAGVIKMVLAMRHELLPATLHVDVPSPHVDWSAGAVELLTAP
RVWPAGARTRRAGVSSFGISGTNAHVIIEAVPVVPRREAGWAGPVVPWVV
SAKSESALRGQAARLAAYVRGDDGLDVADVGWSLAGRSVFEHRAVVVGGD
RDRLLAGLDELAGDQLGGSVVRGTATAAGKTVFVFPGQGSQWLGMGMGLH
AGYPVFAEAFNTVVGELDRHLLRPLREVMWGHDENLLNSTEFAQPALFAV
EVALFRLLGSWGVRPDFVMGHSIGELSAAHVAGVLSLENAAVLVAARGRL
MQALPAGGAMVAVQAAEEEVRPLLSAEVDIAAVNGPASLVISGAQNAVAA
VADQLRADGRRVHQLAVSHAFHSPLMDPMIDEFAAVAAGIAIGRPTIGVI
SNVTGQLAGDDFGSAAYWRRHIRQAVRFADSVRFAQAAGGSRFLEVGPSG
GLVASIEESLPDVAVTTMSALRKDRPEPATLTNAVAQGFVTGMDLDWRAV
VGEAQFVELPTYAFQRRRFWLSGDGVAADAAGLGLAASEHALLGAVIDLP
ASGGVVLTGRLSPSVQGWLADHSVAGVTIFPGAGFVELAIRAGDEVGCGV
VDELTLAAPLVLPASGSVAVQVVVNGPDESGVRGVSVYSRGDVGTGWVLH
AEGALRAGSAEPTADLAMWPPAGAVPVEVADGYQQLAERGYGYGPAFRGL
TAMWRRGDEVFAEVALPADAGVSVTGFGVHPVLLDAALHAVVLSAESAER
GQGSVLVPFSWQGVSLHAAGASAVRARIAPVGPSAVSIELADGLGLPVLS
VASMLARPVTDQQLRAAVSSSGPDRLFEVTWSPQPSAAVEPLPVCAWGTT
EDSAAVVFESVPLAGDVVAGVYAATSSVLDVLQSWLTRDGAGVLVVMTRG
AVALPGEDVTDLAGAAVWGLVRSAQTEHPGRIVLVDSDAPLDDSALAAVV
TTGEPQVLWRRGEVYTARVHGSRAVGGLLVPPSDRPWRLAMSTAGTFENL
RLELIPDADAPLGPGQVRVAVSAIAANFRDVMIALGLYPDPDAVMGVEAC
GVVIETSLNKGSFAVGDRVMGLFPEGTGTVASTDQRLLVKVPAGWSHTAA
ATTSVVFATAHYALVDLAAARSGQRVLIHAGTGGVGMAAVQLARHLGLEV
FATASKGKWDTLRAMGFDDDHISDSRSLEFEDKFRAATGGRGFDVVLDSL
AGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVRYRAFDLFEAG
PDRIAQILAELATLFGDGVLRPLPVTTFDVRCAPAALRYLSQARHTGKVV
MLMPGSWAAGTVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPG
AAELVAELAAAGAQVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGA
LDDAVVMSLTPDRVDVVLRSKVDAAWHLHELTRDLDVSAFVMFSSMAGLV
GSSGQANYAAANSFLDALAAHRRAHGLPAISLGWGLWDQASAMTGGLATV
DFKRFARDGIVAMSSADALQLFDTAMIVDEPFMLPAHIDFAALKVKFDGG
TLPPMFVDLINAPTRRQVDDSLAAAKSKSALLQRLEGLPEDEQHAVLLDL
VRSHIATVLGSASPEAIDPDRAFQELGFDSLTAVEMRNRLKSATGLALSP
TLIFDYPNSAALAGYMRRELLGSSPQDTSAVAAGEAELQRIVASIPVKRL
RQAGVLDLLLALANETETSGQDPALAPTAEQEIADMDLDDLVNAAFRNDD
E
>MT0621 virulence factor mce family protein
MHTAMRTLTEFNRGRVGMMGAVVTVLVVGVAQSFTSVPMLFATPTYYAQF
ADTGGINTGDKVEIAGVNVGLVRSLAIRGNRVLIGFSLPGKTIGMQSRAA
IRTDTILGRKNLEIEPRGSEPLKPNGFLPLAQTTTPYQIYDAFVDVTKAA
TGWDIDAVKRSLNVLSETFDQTAPHLSAALEGVKAFSDTVGRRGEQIEQL
LANANRIARVLGDRSEQVNGLLVNAKTLLAAFKQRSQALRILLTNVSEAS
AQVSGLITDNPNLNHVLAQLRTVSEELVKRKNELADVAVLLGRYTAALTE
AVGSGPFFKAMVVNLLPYQILQPWVDAAFKKRGIDPENFWRSAGLPEFRW
PDPNGTRFPNGAAGGATGAGGYTQASGTGRPAGNAVLLHTGGGRVATARH
PTTLRGRHRWPVRWTRLPGTARCPAVAALIPMGRRRRRAS
>MT1565 hypothetical protein
MAQARRRDAEPQGARGCVALPAPTRLSSLTMSTNPGPAEGANQVMAQEHS
AGAVQFTAHNVRLDDGTLTIPESSRTLDESSWFISARGILETVFPGDKSH
LRLADVGCLEGGYAVGFARMGFQVLGIEVRELNMAACNYIKSKTNLPNLR
FVHDNALNIANHGLFDTVFCCGLFYHLENPKQYLETLSSVTNKLLILQTH
FSIINRSDKWLRLPTTARQLTDRLLRRPAPVKFMLSAPTEHEGLPGRWFT
EFSDDRSFGQRDTAKWASWDNRRSFWIQREHLLQAIKDVGVDLVMEEYDN
LEPSIAESLLGGSYAANLRGTFIGIKTR
>MT1088 medium-chain-fatty-acid--CoA ligase, putative
MYGTMQDFPLTITAIMRHGCGVHGRRTVTTATGEGYRHSSYRDVGQRAGQ
LANALRRLGVTGDQRVATFMWNNTEHLVTYFAVPSMGAVLHTLNIRLFPE
QIAYVTNEAEDRVILVDLSLARLLAPVLPKLDTVHTVIAVGEGDTTPLRE
AGKTVLRFAELIDAESPDFGWPQIDENSAAAMCYTSGTTGNPKGVVYSHR
SSFLHTMAACTTNGIGVGSSDKVLPIVPMFHANGWGLPYAALMAGADLVL
PDRHLDARSLIHMVETLKPTLAGAVPTIWNDVMHYLEKDPDHDMSSLRLV
ACGGSAVPESLMRTFEDKHDVQIRQLWGMTETSPLATMAWPPPGTPDDQH
WAFRITQGQPVCGVETRIVDDDGQVLPNDGNAVGEVEVRGPWIAGSYYGG
RDESKFDSGWLRTGDVGRIDEQGFITLTDRAKDVIKSGGEWISSVELENC
LIAHPDVLEAAVVGVPDERWQERPLAVVVVREGATVSAGDLRAFLADKVV
RWWLPERWAFVDEIPRTSVGKYDKKAIRSRYAEGAYQITEVHT
>MT2981 hypothetical protein
MLAWRQLNDLEETVTYDVIIRDGLWFDGTGNAPLTRTLGIRDGVVATVAA
GALDETGCPEVVDAAGKWVVPGFIDVHTHYDAEVLLDPGLRESVRHGVTT
VLLGNCSLSTVYANSEDAADLFSRVEAVPREFVLGALRDNQTWSTPAEYI
EAIDALPLGPNVSSLLGHSDLRTAVLGLDRATDDTVRPTEAELAKMAKLL
DEALEAGMLGMSGMDAAIDKLDGDRFRSRALPSTFATWRERRKLISVLRH
RGRILQSAPDVDNPVSALLFFLASSRIFNRRKGVRMSMLVSADAKSMPLA
VHVFGLGTRVLNKLLGSQVRFQHLPVPFELYSDGIDLPVFEEFGAGTAAL
HLRDQLQRNELLADRSYRRSFRREFDRIKLGPSLWHRDFHDAVIVECPDK
SLIGKSFGAIADERGLHPLDAFLDVLVDNGERNVRWTTIVANHRPNQLNK
LAAEPSVHMGFSDAGAHLRNMAFYNFGLRLLKRARDADRAGQPFLSIERA
VYRLTGELAEWFGIGAGTLRQGDRADFAVIDPTHLDESVDGYHEEAVPYY
GGLRRMVNRNDATVVATGVGGTVVFRGGQFGGQFRDGYGQNVKSGRYLRA
GELGAALSRSA
>MT3908 acyl-CoA synthase, putative
MAYHNPFIVNGKIRFPANTNLVRHVEKWAKVRGDKLAYRFLDFSTERDGV
ARDILWSDFSARNRAVGARLQQVTQPGDRVAIXCPQNLDYLISFFGALYS
GRIAVPLFDPAEPGHVGRLHAVLDDCAPSTILTTTDSAEGVRKFIRARSA
KERPRVIAVDAVPTEVAATWQQPEANEETVAYLQYTSGSTRIPSGVQITH
LNLPTNVVQVLNALEGQEGDRGVSWLPFFHDMGLITVLLASVLGHSFTFM
TPAAFVRRPGRWIRELARKPGETGGTFSAAPNFAFEHAAVRGVPRDDEPP
LDLSNVKGILNGSEPVSPASMRKFFEAFAPYGLKQTAVKPSYGLAEATLF
VSTTPMDEVPTVIHVDRDELNNQRFVEVAADAPNAVAQVSAGKVGVSEWA
VIVDADTASELPDGQIGEIWLHGNNLGTGYWGKEEESAQTFKNILKSRIS
ESRAEGAPDDALWVRTGDYGTYFKDHLYIAGRIKDLVIIDGRNHYPQDLE
CTAQESTKALRVGYAAAFSVPANQLPQTVFDDSHAGLKFDPEDTSEQLVI
VGERAAGTHKLDHQPIVDDIRAAIAVGHGVTVRDVLLVSAGTIPRTSSGK
IGRRACRAAYLDGSLRSGVGSPTVFATSD
>MT0921 oxidoreductase, putative
MSDHDRDFDVVVVGGGHNGLVAAAYLARAGLRVRLLERLAQTGGAAVSIQ
AFDGVEVALSRYSYLVSLLPSRIVADLGAPVRLARRPFSSYTPAPATAGR
SGLLIGPTGEPRAAHLAAIGAAPDAHGFAAFYRRCRLVTARLWPTLIEPL
RTREQARRDIVEYGGHEAAAAWQAMVDEPIGHAIAGAVANDLLRGVIATD
ALIGTFARMHEPSLMQNICFLYHLVGGGTGVWHVPIGGMGSVTSALATAA
ARHGAEIVTGADVFALDPDGTVRYHSDGSDGAEHLVRGRFVLVGVTPAVL
ASLLGEPVAALAPGAQVKVNMVVRRLPRLRDDSVTPQQAFAGTFHVNETW
SQLDAAYSQAASGRLPDPLPCEAYCHSLTDPSILSARLRDAGAQTLTVFG
LHTPHSVFGDTEGLAERLTAAVLASLNSVLAEPIQDVLWTDAQSKPCIET
TTTLDLQRTLGMTGGNIFHGALSWPFADNDDPLDTPARQWGVATDHERIM
LCGSGARRGGAVSGIGGHNAAMAVLACLASRRKSP
>MT3011 acyl-CoA synthase
MSVRSLPAALRACARLQPHDPAFTFMDYEQDWDGVAITLTWSQLYRRTLN
VAQELSRCGSTGDRVVISAPQGLEYVVAFLGALQAGRIAVPLSVPQGGVT
DERSDSVLSDSSPVAILTTSSAVDDVVQHVARRPGESPPSIIEVDLLDLD
APNGYTFKEDEYPSTAYLQYTSGSTRTPAGVVMSHQNVRVNFEQLMSGYF
ADTDGIPPPNSALVSWLPFYHDMGLVIGICAPILGGYPAVLTSPVSFLQR
PARWMHLMASDFHAFSAAPNFAFELAARRTTDDDMAGRDLGNILTILSGS
ERVQAATIKRFADRFARFNLQERVIRPSYGLAEATVYVATSKPGQPPETV
DFDTESLSAGHAKPCAGGGATSLISYMLPRSPIVRIVDSDTCIECPDGTV
GEIWVHGDNVANGYWQKPDESERTFGGKIVTPSPGTPEGPWLRTGDSGFV
TDGKMFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAISVPGDRST
EKLVAIIELKKRGDSDQDAMARLGAIKREVTSALSSSHGLSVADLVLVAP
GSIPITTSGKVRRGACVEQYRQDQFARLDA
>MT3652 oxidoreductase, short-chain dehydrogenase/reductase family
MPTSDEQRSNGVMGLVDGRVVIVTGAGGGIGRAHALAFAAEGARVVVNDI
GVGLDGSPASGGSAAQDVVDEILAAGGQAVADGSDISDWDQAANLIQAAV
ETYGGVDVLVNNAGIVRDRMIANTSEEEFDAVIAVHLKGHFATMRHAASH
WRGLSKAGKAPKDIDARIINTSSGAGLQGSVGQGNYSAAKAGIAALTLVG
AAEMRRYGVTVNAIAPAARTRMTETVFAEMMAKPQEGFDAMAPENVSPLV
VWLGSAESRDVTGKVFEVEGGIIRVAEGWAHGPQVDKGVKWDPAELGPVV
SDLLAKSRPPVPVYGA
>MT1187 O-methyltransferase, putative
MSAHKPAKQRVALTGVSETALLTLNARAAEARRRDAIIDDPMAVALVESI
DFDFAKFGPTGQGFALRARAFDMAAQHYLDQHPAATVVALAEGLQTSFWR
LDVAIPGGQFRWLTVDLPPIVDLRTRLLPSSPRVSVCAQSALDYSWMDSV
DPAGGVFITAEGLLMYLQPEQALGLIAQCAQTFPGGQMLFDLPPRWFAGW
SRLGLRTSLRYKVPRMPFSMSVAQAADLVNKVPGVVAVRDLRVPPGRGLW
VNMALSTVYRLPVFDPLRPCLTLLEFSRPARG
>MT1041 polyketide synthase, putative
MFHNARTATTGMVTGEPHMPVRHTWGEVHERARCIAGGLAAAGVGLGDVV
GVLAGFPVEIAPTAQALWMRGASLTMLHQPTPRTDLAVWAEDTMTVIGMI
EAKAVIVSEPFLVAIPILEQKGMQVLTVADLLASDPIGPIEVGEDDLALM
QLTSGSTGSPKAVQITHRNIYSNAEAMFVGAQYDVDKDVMVSWLPCFHDM
GMVGFLTIPMFFGAELVKVTPMDFLRDTLLWAKLIDKYQGTMTAAPNFAY
ALLAKRLRRQAKPGDFDLSTLRFALSGAEPVEPADVEDLLDAGKPFGLRP
SAILPAYGMAETTLAVSFSECNAGLVVDEVDADLLAALRRAVPATKGNTR
RLATLGPLLQDLEARIIDEQGDVMPARGVGVIELRGESLTPGYLTMGGFI
PAQDEHGWYDTGDLGYLTEEGHVVVCGRVKDVIIMAGRNIYPTDIERAAG
RVDGVRPGCAVAVRLDAGHSRESFAVAVESNAFEDPAEVRRIEHQVAHEV
VAEVDVRPRNVVVLGPGTIPKTPSGKLRRANSVTLVT
>MT0684 ABC transporter, ATP-binding protein
MGVSIEVNGLTKSFGSSRIWEDVTLTIPAGEVSVLLGPSGTGKSVFLKSL
IGLLRPERGSIIIDGTDIIECSAKELYEIRTLFGVLFQDGALFGSMNLYD
NTAFPLREHTKKKESEIRDIVMEKLALVGLGGDEKKFPGEISGGMRKRAG
LARALVLDPQIILCDEPDSGLDPVRTAYLSQLIMDINAQIDATILIVTHN
INIARTVPDNMGMLFRKHLVMFGPREVLLTSDEPVVRQFLNGRRIGPIGM
SEEKDEATMAEEQALLDAGHHAGGVEEIEGVPPQISATPGMPERKAVARR
QARVREMLHTLPKKAQAAILDDLEGTHKYAVHEIGQ
>MT1583 hypothetical protein
MSDPLTAQEQHKRRQAVRELMPRTPFIGGLGIVFERYEPDDVVIRLPFRT
DLTNDGTYFHGGVIASVMDTAGAAAAWSNHDFDRGTRAATVAMSIQYTGA
AKRCDLLCHARTARRRKELTFTEITATDPDGNIVAHAVQTYRIV
>MT1572 acyl-CoA synthase
MSVVESSLPGVLRERASFQPNDKALTFIDYERSWDGVEETLTWSQLYRRT
LNLAAQLREHGSTGDRALILAPQSLDYVVSFIASLQAGIVAVPLSIPQGG
AHDERTVSVFADTAPAIVLTASSVVDNVVEYVQPQPGQNAPAVIEVDRLD
LDARPSSGSRSAAHGHPDILYLQYTSGSTRTPAGVMVSNKNLFANFEQIM
TSYYGVYGKVAPPGSTVVSWLPFYHDMGFVLGLILPILAGIPAVLTSPIG
FLQRPARWIQMLASNTLAFTAAPNFAFDLASRKTKDEDMEGLDLGGVHGI
LNGSERVQPVTLKRFIDRFAPFNLDPKAIRPSYGMAEATVYVATRKAGQP
PKIVQFDPQKLPDGQAERTESDGGTPLVSYGIVDTQLVRIVDPDTGIERP
AGTIGEIWVHGDNVAIGYWQKPEATERTFSATIVNPSEGTPAGPWLRTGD
SGFLSEGELFIMGRIKDLLIVYGRNHSPDDIEATIQTISPGRCAAIAVSE
HGAEKLVAIIELKKKDESDDEAAERLGFVKREVTSAISKSHGLSVADLVL
VSPGSIPITTSGKIRRAQCVELYRQDEFTRLDA
>MT3000 polyketide synthase
MTGSISGEADLRHWLIDYLVTNIGCTPDEVDPDLSLADLGVSSRDAVVLS
GELSELLGRTVSPIDFWEHPTINALAAYLAAPEPSPDSDAAVKRGARNSL
DEPIAVVGMGCRFPGGISCPEALWDFLCERRSSISQVPPQRWQPFEGGPP
EVAAALARTTRWGSFLPDIDAFDAEFFEISPSEADKMDPQQRLLLEVAWE
ALEHAGIPPGTLRRSATGVFAGACLSEYGAMASADLSQVDGWSNSGGAMS
IIANRLSYFLDLRGPSVAVDTACSSSLVAIHLACQSLRTQDCHLAIAAGV
NLLLSPAVFRGFDQVGALSPTGQCRAFDATADGFVRGEGAGVVVLKRLTD
AQRDGDRVLAVICGSAVNQDGRSNGLMAPNPAAQMAVLRAAYTNAGMQPS
EVDYVEAHGTGTLLGDPIEARALGTVLGRGRPEDSPLLIGSVKTNLGHTE
AAAGIAGFIKTVLAVQHGQIPPNQHFETANPHIPFTDLRMKVVDTQTEWP
ATGHPRRAGVSSFGFGGTNAHVVIEQGQEVRPAPGQGLSPAVSTLVVAGK
TMQRVSATAGMLADWMEGPGADVALADVAHTLNHHRSRQPKFGTVVARDR
TQAIAGLRALAAGQHAPGVVNPAEGSPGPGTVFVYSGRGSQWAGMGRQLL
ADEPAFAAAVAELEPVFVEQAGFSLHDVLANGEELVGIEQIQLGLIGMQL
ALTELWCSYGVRPDLVIGHSMGEVAAAVVAGALTPAEGLRVTATRSRLMA
PLSGQGGMALLELDAPTTEALIADFPQVTLGIYNSPRQTVIAGPTEQIDE
LIARVRAQNRFASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGII
STTYADLHTQPVFDAEHWATNMRNPVHFQQAIASAGSGADGAYHTFIEIS
AHPLLTQAIIDTLHSAQPGARYTSLGTLQRDTDDVVTFRTNLNKAHTIHP
PHTPHPPEPHPPIPTTPWQHTRHWITTKYPAGSVGSAPRAGTLLGQHTTV
ATVSASPPSHLWQARLAPDAKPYQGGHRFHQVEVVPASVVLHTILSAATE
LGYSALSEVRFEQPIFADRPRLIQVVADNRAISLASSPAAGTPSDRWTRH
VTAQLSSSPSDSASSLNEHHRANGQPPERAHRDLIPDLAELLAMRGIDGL
PFSWTVASWTQHSSNLTVAIDLPEALPEGSTGPLLDAAVHLAALSDVADS
RLYVPASIEQISLGDVVTGPRSSVTLNRTAHDDDGITVDVTVAAHGEVPS
LSMRSLRYRALDFGLDVGRAQPPASTGPVEAYCDATNFVHTIDWQPQTVP
DATHPGAEQVTHPGPVAIIGDDSAALCETLEGAGYQPAVMSDGVSQARYV
VYVADSDPAGADETDVDFAVRICTEITGLVRTLAERDADKPAALWILTRG
VHESVAPSALRQSFLWGLAGVIAAEHPELWGGLVDLAINDDLGEFGPALA
ELLAKPSKSILVRRDGVVLAPALAPVRGEPARKSLQCRPDAAYLITGGLG
ALGLLMADWLADRGAHRLVLTGRTPLPPRRDWQLDTLDTELRRRIDAIRA
LEMRGVTVEAVAADVGCREDVQALLAARDRDGAAPIRGIIHAAGITNDQL
VTSMTGDAVRQVMWPKIGGSQVLHDAFPPGSVDFFYLTASAAGIFGIPGQ
GSYAAANSYLDALARARRQQGCHTMSLDWVAWRGLGLAADAQLVSEELAR
MGSRDITPSEAFTAWEFVDGYDVAQAVVVPMPAPAGADGSGANAYLLPAR
NWSVMAATEVRSELEQGLRRIIAAELRVPEKELDTDRPFAELGLNSLMAM
AIRREAEQFVGIELSATMLFNHPTVKSLASYLAKRVAPHDVSQDNQISAL
SSSAGSVLDSLFDRIESAPPEAERSV
>MT0144 P450 heme-thiolate protein
MSEVVTAAPAPPVVRLPPAVRGPKLFQGLAFVVSRRRLLGRFVRRYGKAF
TANILMYGRVVVVADPQLARQVFTSSPEELGNIQPNLSRMFGSGSVFALD
GDDHRRRRRLLAPPFHGKSMKNYETIIEEETLRETANWPQGQAFATLPSM
MHITLNAILRAIFGAGGSELDELRRLIPPWVTLGSRLAALPKPKRDYGRL
SPWGRLAEWRRQYDTVIDKLIEAERADPNFADRTDVLALMLRSTYDDGSI
MSRKDIGDELLTLLAAGHETTAATLGWAFERLSRHPDVLAALVEEVDNGG
HELRQAAILEVQRARTVIDFAARRVNPPVYQLGEWVIPRGYSIIINIAQI
HGDPDVFPQPDRFDPQRYIGSKPSPFAWIPFGGGTRRCVGAAFANMEMDV
VLRTVLRHFTLETTTAAGERSHGRGVAFTPKDGGRVVMRRR
>MT1895 conserved hypothetical protein
MQPSPDSPAPLNVTVPFDSELGLQFTELGPDGARAQLDVRPKLLQLTGVV
HGGVYCAMIESIASMAAFAWLNSHGEGGSVVGVNNNTDFVRSISSGMVYG
TAEPLHRGRRQQLWLVTITDDTDRVVARGQVRLQNLEARP
>MT1595 oxidoreductase, short-chain dehydrogenase/reductase family
MQGRNLNDAVKGKVVLITGGSSGIGAAAAKKIAEAGGTVVLVARTLENLE
NVANDIRAIRGNGGTAHVYPCDLSDMDAIAVMADQVLGDLGGVDILINNA
GRSIRRSLELSYDRIHDYQRTMQLNYLGAVQLILKFIPGMRERHFGHIVN
VSSVGVQTRAPRFGAYIASKAALDSLCDALQAETVHDNVRFTTVHMALVR
TPMISPTTIYDKFPTLTPDQAAGVITDAIVHRPRRASSPFGQFAAVADAV
NPAVMDRVRNRAFNMFGDSSAAKGSESQTDTSELDKRSETFVRATRGIHW
>MT2863 EntD-related protein
MTVGTLVASVLPATVFEDLAYAELYSDPPGLTPLPEEAPLIARSVAKRRN
EFITVRHCARIALDQLGVPPAPILKGDKGEPCWPDGVVGSLTHCAGYRGA
VVGRRDAVRSVGIDAEPHDVLPNGVLDAISLPAERADMPRTMPAALHWDR
ILFCAKEATYKAWFPLTKRWLGFEDAHITFETDSTGWTGRFVSRILIDGS
TLSGPPLTTLRGRWSVERGLVLTAIVL
>MT0586 conserved hypothetical protein
MSTVLTYIRAVDIYEHMTESLDLEFESAYRGESVAFGEGVRPPWSIGEPQ
PELAALIVQGKFRGDVLDVGCGEAAISLALAERGHTTVGLDLSPAAVELA
RHEAAKRGLANASFEVADASSFTGYDGRFDTIVDSTLFHSMPVESREGYL
QSIVRAAAPGASYFVLVFDRAAIPEGPINAVTEDELRAAVSKYWIIDEIK
PARLYARFPAGFAGMPALLDIREEPNGLQSIGGWLLSAHLG
>MT1223 hypothetical protein
MRIAGVGLGQLLLALDATVVSLVDAPRGLDLPVASTALIDSDDVRLGLAA
AAGSADVFFLIGVTDDEAVRWVDDQARQRAPVAIFVKHPSDSVVAGAVRA
GSAVVAVEPRARWERLYHLVNHVLEHHGDRADPTDDSGTDLFGLAQSLAD
RIHGMISIEDAQSHVLAYSASNDEADELRRLSILGRAGPPEHLQWIGQWG
IFDALRAGREVVRVAERPELGLRPRLAIGIHQPGVGALRPPVFAGTIWVQ
QGSQPLADDAEEMLRGAAVLAARIMSRLATQPNTHALRVQQLLGLAELNA
TTAPVDVSTIARELGVAAEGNATLIGFDTAENRDTAVRHVRLVDVMALSA
SAFRHDAQVAANGSRIYVLLPQTTTGRAVTSWVRGTISALRAELGVALRA
AIAGPVAGLAEVNPARVEVDRVLESAERHPILGQVTSLAEARTTVLLDEI
VTLVGTDQRLVDPRIRDLGAQDPVLAQTLRAYLDAFGDIGAAARSLQVHP
NTVRYRIRRIEQLLSTSLGDPDVRLLFSLGLRAMERTA
>MT1550 mmcH protein, putative
MIPVKVENNTSLDQVQDALNCVGYAVVEDVLDEASLAATRDRMYRVQERI
LTEIGKERLARAGELGVLRLMMKYDPHFFTFLEIPEVLSIVDRVLSETAI
LHLQNGFILPSFPPFSTPDVFQNAFHQDFPRVLSGYIASVNIMFAIDPFT
RDTGATLVVPGSHQRIEKPDHTYLARNAVPVQCAAGSLFVFDSTLWHAAG
RNTSGKDRLAINHQFTRSFFKQQIDYVRALGDAVVLEQPARTQQLLGWYS
RVVTNLDEYYQPPDKRLYRKGQG
>MT0418 polyketide synthase
MTDGSVTADKLQKWFREYLSTHIECHPNEVSLDVPIRDLGLKSIDVLAIP
GDLGDRFGFCIPDLAVWDNPSANDLIDSLLNQRSADSLRESHGHADRNTQ
GRGSINEPVAVIGVGCRFPGDIDGPERLWDFLTEKKCAITAYPDRGFTNA
GTFAESGGFLKDVAGFDNRFFDIPPDEALRMDPQQRLLLEVSWEALEHAG
IIPESLRLSRTGVFVGVSSTDYVRLVSASAQQKSTIWDNTGGSSSIIANR
ISYFLDIQGPSIVIDTACSSSLVAVHLACRSLSTWDCDIALVGGTNVLIS
PEPWGGFREAGILSQTGCCHAFDKSADGMVRGEGCGVIVLQRLSDARLEG
RRILAILTGSAVNQDGKSNGIMAPNPSAQIGVLENACKSARVDPLEIGYV
EAHGTGTSLGDRIEAHALGMVFGRKRPGSGPLMIGSIKPNIGHLEGAAGI
AGLIKAVLMVERGSLLPSGGFTEPNPAIPFTELGLRVVDELQEWPVVAGR
PRRAGVSSFGFGGTNAHVIVEEAGSVGADTVSGRADVGGSGGGVVAWVIS
GKTASALAAQAGRLGRYVRARPALDVVDVGYSLVSTRSVFDHRAVVVGQT
RDELLAGLAGVVAGRPEAGVVCGVGKPAGKTAFVFAGQGSQWLGMGSELY
AAYPVFAEALDAVVDELDRHLRYPLRDVIWGHDQDLLNTTEFAQPALFAV
EVALYRLLMSWGVRPGLVLGHSVGELAAAHVAGALCLPDAAMLVAARGRL
MQALPAGGAMFAVQAREDEVAPMLGHDVSIAAVNGPASVVISGAHDAVSA
IADRLRGQGRRVHRLAVSHAFHSALMEPMIAEFTAVAAELSVGLPTIPVI
SNVTGQLVADDFASADYWARHIRAVVRFGDSVRSAHCAGASRFIEVGPGG
GLTSLIEASLADAQIVSVPTLRKDRPEPVSVMTAAAQGFVSGMGLDWASV
FSGYRPKRVELPTYAFQHQKFWLAPAPSVSDPTAAGQIGASDGGAELLAS
SGFAARLAGRSADEQLAAAIEVVCEHAAAVLGRDGAAGLDAGQAFADSGF
NSLSAVELRNRLTAVTAVTLPATAIFDHPTPTELAQYLITQIDGHGSSAA
AAANPAERIDALTDVFLQACDAGRDADGWKMVALASNTRERMSSPVRNNV
SKNVALLADGISDVVVICIPTLTVLSDQREYRDIANAMTGRHSVYSLTLP
GFDSSDALPQNADMIVETVSNAIIDVVGGSCRFVLSGYSSGGVLAYALCS
HLSVKHQRNPLGVALIDTYLPSQIANPSMNEGFSPNDTGKGLSREVIRVA
RMLNRLTATRLTAAATYAAIFQAWEPGRSMAPVLNIVAKDRIATVENLRE
ERINRWRTAAAEAAYSVAEVPGDHFGMMSTSSEAIATEIHDWISGLVRGP
HP
>MT1177 oxidoreductase, short-chain dehydrogenase/reductase family
MKTKDAVAVVTGGASGLGLATTKRLLDAGAQVVVVDLRGDDVVGGLGDRA
RFAQADVTDEAAVSNALELADSLGPVRVVVNCAGTGNAIRVLSRDGVFPL
AAFRKIVDINLVGTFNVLRLGAERIAKTEPIGEERGVIINTASVAAFDGQ
IGQAAYSASKGGVVGMTLPIARDLASKLIRVVTIAPGLFDTPLLASLPAE
AKASLGQQVPHPSRLGNPDEYGALVLHIIENPMLNGEVIRLDGAIRMAPR
>MT0572 oxidoreductase, short-chain dehydrogenase/reductase family
MSKRPLRWLTEQITLAGMRPPISPQLLINRPAMQPVDLTGKRILLTGASS
GIGAAATKQFGLHRAVVVAVARRKDLLDAVADRITGDGGTAMSLPCDLSD
MEAIDALVEDVEKRIGGIDILINNAGRSIRRPLAESLERWHDVERTMVLN
YYAPLRLIRGLAPGMLERGDGHIINVATWGVLSEASPLFSVYNASKAALS
AVSRIIETEWGSQGVHSTTLYYPLVATPMIAPTKAYDGLPALTAAEAAEW
MVTAARTRPVRIAPRVAVAVNALDSIGPRWVNALMQRRNEQLNP
>MT1705 chalcone/stilbene synthase family protein
MSVIAGVFGALPPHRYSQSEITDSFVEFPGLKEHEEIIRRLHAAAKVNGR
HLVLPLQQYPSLTDFGDANEIFIEKAVDLGVEALLGALDDANLRPSDIDM
IATATVTGVAVPSLDARIAGRLGLRPDVRRMPLFGLGCVAGAAGVARLRD
YLRGAPDDVAVLVSVELCSLTYPAVKPTVSSLVGTALFGDGAAAVVAVGD
RRAEQVRAGGPDILDSRSSLYPDSLHIMGWDVGSHGLRLRLSPDLTNLIE
RYLANDVTTFLDAHRLTKDDIGAWVSHPGGPKVIDAVATSLALPPEALEL
TWRSLGEIGNLSSASILHILRDTIEKRPPSGSAGLMLAMGPGFCTELVLL
RWR
>MT0110 peptide synthetase, putative
MASRAGGCVHRVRLSRSQRNLYNGVRQDNNPALYLIGKSYRFRRLELARF
LAALHATVLDNPVQLCVLENSGADYPDLVPRLRFGDIVRVGSADEHLQST
WCSGILGKPLVRHTVHTDPNGYVTGLDVHTHHILLDGGATGTIEADLARY
LTTDPAGETPSVGAGLAKLREAHRRETAKVEESRGRLSAVVQRELADEAY
HGGHGHSVSDAPGTAAKGVLHESATICGNAFDAILTLSEAQRVPLNVLVA
AAAVAVDASLRQNTETLLVHTVDNRFGDSDLNVATCLVNSVAQTVRFPPF
ASVSDVVRTLDRGYVKAVRRRWLREEHYRRMYLAINRTSHVEALTLNFIR
EPCAPGLRPFLSEVPIATDIGPVEGMTVASVLDEEQRTLNLAIWNRADLP
ACKTHPKVAERIAAALESMAAMWDRPIAMIVNDWFGIGPDGTRCQGDWPA
RQPSTPAWFLDSARGVHQFLGRRRFVYPWVAWLVQRGAAPGDVLVFTDDD
TDKTIDLLIACHLAGCGYSVCDTADEISVRTNAITEHGDGILVTVVDVAA
TQLAVVGHDELRKVVDERVTQVTHDALLATKTAYIMPTSGTTGQPKLVRI
SHGSLAVFCDAISRAYGWGAHDTVLQCAPLTSDISVEEIFGGAACGARLV
RSAAMKTGDLAALVDDLVARETTIVDLPTAVWQLLCADGDAIDAIGRSRL
RQIVIGGEAIRCSAVDKWLESAASQGISLLSSYGPTEATVVATFLPIVCD
QTTMDGALLRLGRPILPNTVFLAFGEVVIVGDLVADGYLGIDGDGFGTVT
AADGSRRRAFATGDRVTVDAEGFPVFSGRKDAVVKISGKRVDIAEVTRRI
AEDPAVSDVAVELHSGSLGVWFKSQRTREGEQDAAAATRIRLVLVSLGVS
SFFVVGVPNIPRKPNGKIDSDNLPRLPQWSAAGLNTAETGQRAAGLSQIW
SRQLGRAIGPDSSLLGEGIGSLDLIRILPETRRYLGWRLSLLDLIGADTA
ANLADYAPTPDAPTGEDRFRPLVAAQRPAAIPLSFAQRRLWFLDQLQRPA
PVYNMAVALRLRGYLDTEALGAAVADVVGRHESLRTVFPAVDGVPRQLVI
EARRADLGCDIVDATAWPADRLQRAIEEAARHSFDLATEIPLRTWLFRIA
DDEHVLVAVAHHIAADGWSVAPLTADLSAAYASRCAGRAPDWAPLPVQYV
DYTLWQREILGDLDDSDSPIAAQLAYWENALAGMPERLRLPTARPYPPVA
DQRGASLVVDWPASVQQQVRRIARQHNATSFMVVAAGLAVLLSKLSGSPD
VAVGFPIAGRSDPALDNLVGFFVNTLVLRVNLAGDPSFAELLGQVRARSL
AAYENQDVPFEVLVDRLKPTRALTHHPLIQVMLAWQDNPVGQLNLGDLQA
TPMPIDTRTARMDLVFSLAERFSEGSEPAGIGGAVEYRTDVFEAQAIDVL
IERLRKVLVAVAAAPERTVSSIDALDGTERARLDEWGNRAVLTAPAPTPV
SIPQMLAAQVARIPEAEAVCCGDASMTYRELDEASNRLAHRLAGCGAGPG
ECVALLFERCAPAVVAMVAVLKTGAAYLPIDPANPPPRVAFMLGDAVPVA
AVTTAGLRSRLAGHDLPIIDVVDALAAYPGTPPPMPAAVNLAYILYTSGT
TGEPKGVGITHRNVTRLFASLPARLSAAQVWSQCHSYGFDASAWEIWGAL
LGGGRLVIVPESVAASPNDFHGLLVAEHVSVLTQTPAAVAMLPTQGLESV
ALVVAGEACPAALVDRWAPGRVMLNAYGPTETTICAAISAPLRPGSGMPP
IGVPVSGAALFVLDSWLRPVPAGVAGELYIAGAGVGVGYWRRAGLTASRF
VACPFGGSGARMYRTGDLVCWRADGQLEFLGRTDDQVKIRGYRIELGEVA
TALAELAGVGQAVVIAREDRPGDKRLVGYATEIAPGAVDPAGLRAQLAQR
LPGYLVPAAVVVIDALPLTVNGKLDHRALPAPEYGDTNGYRAPAGPVEKT
VAGIFARVLGLERVGVDDSFFELGGDSLAAMRVIAAINTTLNADLPVRAL
LHASSTRGLSQLLGRDARPTSDPRLVSVHGDNPTEVHASDLTLDRFIDAD
TLATAVNLPGPSPELRTVLLTGATGFLGRYLVLELLRRLDVDGRLICLVR
AESDEDARRRLEKTFDSGDPELLRHFKELAADRLEVVAGDKSEPDLGLDQ
PMWRRLAETVDLIVDSAAMVNAFPYHELFGPNVAGTAELIRIALTTKLKP
FTYVSTADVGAAIEPSAFTEDADIRVISPTRTVDGGWAGGYGTSKWAGEV
LLREANDLCALPVAVFRCGMILADTSYAGQLNMSDWVTRMVLSLMATGIA
PRSFYEPDSEGNRQRAHFDGLPVTFVAEAIAVLGARVAGSSLAGFATYHV
MNPHDDGIGLDEYVDWLIEAGYPIRRIDDFAEWLQRFEASLGALPDRQRR
HSVLPMLLASNSQRLQPLKPTRGCSAPTDRFRAAVRAAKVGSDKDNPDIP
HVSAPTIINYVTNLQLLGLL
>MT2446 hypothetical protein
MNPTLAVLGAGAKAVAVAAKASVLRDMGVDVPDVIAVERIGVGANWQASG
GWTDGAHRLGTSPEKDVGFPYRSALVPRRNAELDERMTRYSWQSYLIATA
SFAEWIDRGRPAPTHRRWSQYLAWVADHIGLKVIHGEVERLAVTGDRWAL
CTHETTVQADALMITGPGQAEKSLLPGNPRVLSIAQFWDRAAGHDRINAE
RVAVIGGGETAASMLNELFRHRVSTITVISPQVTLFTRGEGFFENSLFSD
PTDWAALTFDERRDALARTDRGVFSATVQEALLADDRIHHLRGRVAHAVG
RQGQIRLTLSTNRGSENFETVHGFDLVIDGSGADPLWFTSLFSQHTLDLL
ELGLGGPLTADRLQEAIGYDLAVTDVTPKLFLPTLSGLTQGPGFPNLSCL
GLLSDRVLGAGIFTPTKHNDTRRSGEHQSFR
>MT3610 4-coumarate-CoA ligase, putative
MTPTHPTVTELLLPLSEIDDRGVYFEDSFTSWRDHIRHGAAIAAALRERL
DPARPPHVGVLLQNTPFFSATLVAGALSGIVPVGLNPVRRGAALAGDIAK
ADCQLVLTGSGSAEVPADVEHINVDSPEWTDEVAAHRDTEVRFRSADLAD
LFMLIFTSGTSGDPKAVKCSHRKVAIAGVTITQRFSLGRDDVCYVSMPLF
HSNAVLVGWAVAAACQGSMALRRKFSASQFLADVRRYGATYANYVGKPLS
YVLATPELPDDADNPLRAVYGNEGVPGDIDRVGRRFGCVVMDGFGSTEGG
VAITRTLDTPAGALGPLPGGIQIVDPDTGEPCPTGVVGELVNTAGPGGFE
GYYNDEAAEAERMAGGVYHSGDLAYRDDAGYAYFAGRLGDWMRVDGENLG
TAPIERVLMRYPDATEVAVYPVPDPVVGDQVMAALVLAPGTKFDADKFRA
FLTEQPDLGHKQWPSYVRVSAGLPRTMTFKVIKRQLSAEGVACADPVWPI
RR
>MT1470 substrate--CoA ligase, putative
MRIRQAFGLIATMRRAGLIAPLRPDRYLRIVAAMRREGMGFTAGFAGAAR
RCPDRPGLIDELGTLTWRQLDERGNALAAALQALPAGPPRVVGIMCRNHR
GFVDALLAVNRIGAHILLLNTSFAGPALAEVVTREGVDTVVYDEEFSATV
DRALAEKPQATRIVAWTDEDHDLTVEKLVAAHAGRRPEHTGSHGKVILLT
SGTTGTPKGARHSGGGIGTLKAILDRTPWRAEEVTVIVAPMFHAWGFSQL
VLASSLACTIVTRRRFDPEATLDLIDRHHATGLVVVPVMFDRIMDLPAEI
RNRYDGRSLRFAAASGSRMRPDVVIAFMDQFGDVIYNNYNATEAGMIATA
TPADLRTAPDTAGRPAEGTEIRILDQQFTEVPTGEVGTIYVRNDSQFDGY
TSGAAKDFHAGFMSSGDVGYLDENGRLFVVGRDDEMIVSGGENIYPIEVE
KTLATHPDVAEAAVIGVDDQQYGQRLAAFVVLKPGVSATPETLKQHVRDN
LANYKVPRDIAVLDELPRGITGKILRTELQSRVGS
>MT0919 conserved hypothetical protein
MTAPPIAVERNTRSKVRQQQEADVVALGRKPGLLCVPERFRAMDLPMAAA
DALFLWAETPTRPLHVGALAVLSQPDNGTGRYLRKVFSAAVARQQVAPWW
RRRPHRSLTSLGQWSWRTETEVDLDYHVRLSALPPRAGTAELWALVSELH
AGMLDRSRPLWQVDLIEGLPGGRCAVYVKVHHALADGVSVMRLLQRIVTA
DPHQRQMPTLWEVPAQASVAKHTAPRGSSRPLTLAKGVLGQARGVPGMVR
VVADTTWRAAQCRSGPLTLAAPHTPLNEPIAGARSVAGCSFPIERLRQVA
EHADATINDVVLAMCGGALRAYLISRGALPGAPLIAMVPVSLRDTAVIDV
FGQGPGNKIGTLMCSLATHLASPVERLSAIRASMRDGKAAIAGRSRNQAL
AMSALGAAPLALAMALGRVPAPLRPPNVTISNVPGPQGALYWNGARLDAL
YLLSAPVDGAALNITCSGTNEQITFGLTGCRRAVPALSILTDQLAHELEL
LVGVSEAGPGTRLRRIAGRR
>MT3029 hypothetical protein
MDLVQFQDVRLMRVVVCRRLGPAKGQRRWRPLDLGTTGCFENLGAQRPTY
RMRAIRMLECAMPNRLVRSLQRWRPFGLPPHRWRLAPWYWRGLQVTLEPG
SAIAWIVRLTGGFEETEIDIAAALYSALYPDRCILDVGANVGIHSLAWAR
LAPVVALEPAPGTHSRLEANVAANGLQDRIRTLRTAAGDAVGEVDFFVAA
DSAFSSLNDTGRIRIRERTRVPCTTLDALAAELPLPVGLLKIDVEGLERA
VIAGAAELLRRDRPVLLVEIYGGAASNPDPERTIADIRAYGYEPFVYADD
AGLQPYQRHRDDRYCYFFIPSRKG
>MT3514 dioxygenase, putative
MTDLITVKKLGSRIGAQIDGVRLGGDLDPAAVNEIRAALLAHKVVFFRGQ
HQLDDAEQLAFAGLLGTPIGHPAAIALADDAPIITPINSEFGKANRWHTD
VTFAANYPAASVLRAVSLPSYGGSTLWANTAAAYAELPEPLKCLTENLWA
LHTNRYDYVTTKPLTAAQRAFRQVFEKPDFRTEHPVVRVHPETGERTLLA
GDFVRSFVGLDSHESRVLFEVLQRRITMPENTIRWNWAPGDVAIWDNRAT
QHRAIDDYDDQHRLMHRVTLMGDVPVDVYGQASRVISGAPMEIAG
>MT2328 P450 heme-thiolate protein
MGLNTAIATRVNGTPPPEVPIAGIELGSLDFWALDDDVRDGAFATLRREA
PISFWPTIELPGFVAGNGHWALTKNDDVFYASRHPDIFSSYPNITINDQT
PELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEAAVRDRAHRLVS
SMIANNPDRQADLVSELAGPLPLQIICDMMGIPKADHQRIFHWTNVILGF
GDPDLATDFDEFMQVSADIGAYATALAEDRRVNHHDDLTSSLVEAEVDGE
RLSSREIASFFILLVVAGNETTRNAITHGVLALSRYPEQRDRWWSDFDGL
APTAVEEIVRWASPVVYMRRTLTQDIELRGTKMAAGDKVSLWYCSANRDE
SKFADPWTFDLARNPNPHLGFGGGGAHFCLGANLARREIRVAFDELRRQM
PDVVATEEPARLLSQFIHGIKTLPVTWS
>MT3928 conserved hypothetical protein
MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQQAQHLRR
YRDHVARGLDMSRLMIFTWDLPGRCNIRAMNYAINAHLRRHDTYHSWFEF
DNAEHIVRHTIADPADIEVVQAEHQNMTSAELRHHIATPQPLQWDCFLFG
IIQSDDHFTFYASIAHLCVDPMIVGVLFIEIHMMYSALVGGDPPIELPPA
GRYDDHCVRQYADTAALTLDSARVRRWVEFAANNDGTLPHFPLPLGDLSV
PHTGKLLTETLMDEQQGERFEAACVAAGARFSGGVFACAALAERELTNCE
TFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVASGLFDSAARVAQISFD
SGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIAPLSTVANSD
LNFRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNPIASESVANYIAAMK
SIYIRTADGTLATLKPGT
>MT1702 polyketide synthase
MSGTTTHVDYLKRLTADLRRTRRRLSDLEAKLSEPVAVVGMGCRYPGGVD
SPETLWELVAQGRDAVSDFPADRGWDVDGLFDPDPDACGKMYTRRGTFLE
HAGDFDAGFFGIGPSEALAMDPQQRLLLEVSWEALERTGIDPTKLRGSAT
GVFAGVIHAGYGGQLSGELEGYGLTGSTLSVASGRVAYVLGLEGPAVSVD
TACSSSLVALHLAVQSLRSGECDLALAGGVTVMATPAAFVEFSRQRALAR
DGRCKVYAGAADGTAWSEGAGVLVVERLVDARRLGHPVLALVRGSAVNQD
GASNGLTAPNGPSQQRVIRAALASARLRAVEVDVVEGHGTGTMLGDPIEA
QALLATYGQDRVEPLWLGSIKSNIGHTSAAAGVAGVIKMVQAMRHGVMPK
TLHVDVPTPHVDWSVGAVSLLTQPRAWSVHGRPRRAGVSSFGISGTNAHV
ILEQAPVVESVVPEVASPTAASAVPWVLSARSEQALAGQAQRLLAFVAAN
PDLDPIDVGWSLVKTRAMFEHRAVVVGADRGALLAGLAALAAGESGAGVA
VGRARSVGKTVFVFPGQGAQWVGMGAQLYAELPLFALAFDAVAEELDRHL
RLPLRNVLWEGDEALLTSTEFAQPALFAIEVALATLLQHWGISPDFLIGH
SVGEIAAAHLAGVLSLTDAAGLVAARGRLMAELPAGGVMVVVAASEEEVL
PVLVDGANLAAVNAPHSVVVSGCEAAVSDIADHFARRGRRVHRLAVSHAF
HSLLMEPMLAEFTRIAAGISVSKPRIPLVSNVTGQMAGAGYGDGQYWVEH
ARRPVRFAEGVQLLNAVGATRFVEVGPGGGLTALVEQSLPLGEALSVAMM
RREHPEVSSVLGAVATLFTAGAQMDWPAVFGSPGRRIELPTYAFQRQRYW
LPPTSAGSADISGVGLLAARHGLLGAVVEQPDSDVVVLTGRLSVGEQRWL
ADHVIAGVVLLAGAAFVELALRAADQVDCGVVEELTVVTPLVLPTVGGVQ
LQVVVGVGEMGQRPVSIYSRNAESDSGWVLHARGVLGAKAVAPAADLSVW
PPLGAAPVDVDGAYQRFAELGYEYGRAFQGLTAMWRRESELFADVAVPDD
VDVTLSGFGIHPLVLDAALHAMGMVGEQAATMLPFSWQGVSLHAAGASRV
RARIAPAGDGTVSVELADQAGLPVLSVQALVMRSVSSQLLSAAVAAADAA
GRGLLEVAWLPVELAHNDISADLVVWELESFQDGVGPVYSATHRVLVALQ
SWLAQERAGRLVVLTQGSVGQDATNLAGAAVWGLVRSAQAEHPGRVMLVD
SDGSMDVGDVIGCGEEQLMIRNGTAYAARLAQLRPQPILQLPDTNSGWRL
VAGGAGTLEDLTLASCPAKELAPGQVRIEVRALGVNFRDVLVALGIYPGA
AELGAEGAGVVTEVGPGVTGLAVGDPVMGLLGVAGSEAVVDARLVVKLPN
RWPLTDAAGVPVVFLTAYYALRVLAQVQPGESVLVHAAAGGVGMAAVQLA
RLWGLEVFATASRGKWDTLHTMGCDNTHVADSRTLAFEETFWLTTEGRGV
DVVLNSLAGEFTDASLRLLPRGGRFIEMGKTEFGTPRSLPRTILGWPTGL
ST
>MT3899 oxidoreductase, short-chain dehydrogenase/reductase family
MVLDAVGNPQTVLLLGGTSEIGLAICERYLHNSAARIVLACLPDDPRRED
AAAAMKQAGARSVELIDFDALDTDSHPKMIEAAFSGGDVDVAIVAFGLLG
DAEELWQNQRKAVQIAEINYTAAVSVGVLLAEKMRAQGFGQIIAMSSAAG
ERVRRANFVYGSTKAGLDGFYLGLSEALREYGVRVLVIRPGQVRTRMSAH
LKEAPLTVDKEYVANLAVTASAKGKELVWAPAAFRYVMMVLRHIPRSIFR
KLPI
>MT1931 oxidoreductase, short-chain dehydrogenase/reductase family
MKAIFITGAGSGMGREGATLFHANGWRVGAIDRNEDGLAALRVQLGAERL
WARAVDVTDKAALEGALADFCAGNVGGGLDMMWNNAGIGEGGWFEDVPYE
AAVRVVDVNFKAVLTGAYAALPYLKKAPGSLMFSTSSSSGTYGMPRIAVY
SATKHAVKGLTEALSVEWQRHGVRVADVLPGLIDTAILTSTRQHSDEGPY
TISAEQIRAAAPKKGMFRLMPSSSVAEAAWRAYQHPTRLHWYVPRSIRWI
DRLKGVSPEFVRRHIAKSLATLEPKRK
>MT0098 methyltransferase, putative
MDQPWNANIHYDALLDAMVPLGTQCVLDVGCGDGLLAARLARRIPYVTAV
DIDAPVLRRAQTRFANAPIRWLHADIMTAELPNAGFDAVVSNAALHHIED
TRTALSRLGGLVTPGGTLAVVTFVTPSLRNGLWHLTSWVACGMANRVKGK
WEHSAPIKWPPPQTLHELRSHVRALLPGACIRRLLYGRVLVTWRAPV
>MT0175 substrate--CoA ligase
MMQPDAPALRFVGNTMTWADLRRRVAALAGALSGRGVGFGDRVMILMLNR
TEFVESVLAANMIGAIAVPLNFRLTPTEIAVLVEDCVAHVMLTEAALAPV
AIGVRNIQPLLSVIVVAGGSSQDSVFGYEDLLNEAGDVHEPVDIPNDSPA
LIMYTSGTTGRPKGAVLTHANLTGQAMTALYTSGANINSDVGFVGVPLFH
IAGIGNMLTGLLLGLPTVIYPLGAFDPGQLLDVLEAEKVTGIFLVPAQWQ
AVCTEQQARPRDLRLRVLSWGAAPAPDALLRQMSATFPETQILAAFGQTE
MSPVTCMLLGEDAIAKRGSVGRVIPTVAARVVDQNMNDVPVGEVGEIVYR
APTLMSCYWNNPEATAEAFAGGWFHSGDLVRMDSDGYVWVVDRKKDMIIS
GGENIYCAELENVLASHPDIAEVAVIGRADEKWGEVPIAVAAVTNDDLRI
EDLGEFLTDRLARYKHPKALEIVDALPRNPAGKVLKTELRLRYGACVNVE
RRSASAGFTERRENRQKL
>MT0954 oxidoreductase, short-chain dehydrogenase/reductase family
MILDMFRLDDKVAVITGGGRGLGAAIALAFAQAGADVLIASRTSSELDAV
AEQIRAAGRRAHTVAADLAHPEVTAQLAGQAVGAFGKLDIVVNNVGGTMP
NTLLSTSTKDLADAFAFNVGTAHALTVAAVPLMLEHSGGGSVINISSTMG
RLAARGFAAYGTAKAALAHYTRLAALDLCPRVRVNAIAPGSILTSALEVV
AANDELRAPMEQATPLRRLGDPVDIAAAAVYLASPAGSFLTGKTLEVDGG
LTFPNLDLPIPDL
>MT3023 acyl-CoA synthase
MSESSLADLLQKAASQYPNRAAYKFIDYDTDPAGFTETVTWWQVHRRAMI
VAEELWIYASSGDRVAILAPQGLEYIIAFMGVLQAGLIAVPLPVPQFGIH
DERISSALRDSAPSIILTTSSVIDEVTTYAPHACAAQGQSAPIVVAVDAL
DLSSSRALDPTRFERPSTAYLQYTSGSTRAPAGVVLSHKNVITNCVQLMS
DYIGDSEKVPSTPVSWLPFYHDMGLMLGIILPMINQDTAVLMSPMAFLQR
PARWMQLLAKHRAQISSAPNFGFELAVRRTSDDDMAGLDLGHVRTIVTGA
ERVNVATLRRFTERFAPFNLSETAIRPSYGLAEATVYVATAGPGRAPKSV
CFDYQQLSVGQAKRAENGSEGANLVSYGAPRASTVRIVDPETRMENPAGT
VGEIWVQGDNVGLGYWRNPQQTEATFRARLVTPSPGTSEGPWLRTGDLGV
IFEGELFITGRIKELLVVDGANHYPEDIEATIQEITGGRVVAIAVPDDRT
EKLVTIIELMKRGRTDEEEKNRLRTVKREVASAISRSHRLRVADVVMVAP
GSIPVTTSGKVRRSASVERYLHHEFSRLDAMA
>MT1447 methyltransferase, putative
MTVYTPTSERQAPATTHRQMWALGDYAAIAEELLAPLGPILVSTSGIRRG
DRVLDVAAGSGNVSIPAAMAGAHVTASDLTPELLRRAQARAAAAGLELGW
REANAEALPFSAGEFDAVLSTIGVMFAPRHQRTADELARVCRRGGKISTL
NWTPEGFYGKLLSTIRPYRPTLPAGAPHEVWWGSEDYVSGLFRDHVSDIR
TRRGSLTVDRFGCPDECRDYFKNFYGPAINAYRSIADSPECVATLDAEIT
ELCREYLCDGVMQWEYLIFTARKC
>MT2127 conserved hypothetical protein
MTDDHPRADIVSRQYHRWLYPHPIADLEAWTTANWEWFDPVHSHRILWPD
REYRPDLDILIAGCGTNQAAIFAFTNRAAKVVAIDISRPALDHQQYLKDK
HGLANLELHLLPIEELATLGRDFDLVVSTGVLHHLADPRAGMKELAHCLR
RDGVVAAMLYGKYGRIGVELLGSVFRDLGLGQDDASIKLAKEAISLLPTY
HPLRNYLTKARDLLSDSALVDTFLHGRQRSYTVEECVDLVTSAGLVFQGW
FHKAPYYPHDFFVPNSEFYAAVNTLPEVKAWSVMERLKTLNATHLFMACR
RDRPKEQYTIDFSTVAALDYVPLMRTRCGVSGTDMFWPGWRMAPSPAQLA
FLQQVDGRRTIREIAGCVARTGEPSGGSLADLEEFGRKLFQSLWRLDFVA
VALPASG
>MT3429 hypothetical protein
MGCFCVCSAQVQEVAKNSLRGVPESVVMSYSYFVELPRLEDIEPGAHTDV
LIANSRVDQGRIRAAVEAVFDAHPALGTVFEPRVDTLTSRPGGGGWGWGV
EPPGAAVAEVIARHSASFDMYTGRLFAVSLLPGSPDRLVLTASRLCVDDA
SWQTVVEDLVRQYDESVLVPAR
>MT0181 virulence factor mce family protein
MSTIFDIRNLRLPQLSRASVVIGSLVVVLALAAGIVGVRLYQKLTNNTVV
AYFTQANALYVGDKVQIMGLPVGSIDKIEPAGDKMKVTFHYQNKYKVPAN
ASAVILNPTLVASRNIQLEPPYRGGPVLADNAVIPVERTQVPTEWDELRD
SVSHIIDELGPTPEQPKGPFGEVIEAFADGLAGKGKQINTTLNSLSQALN
ALNEGRGDFFAVVRSLALFVNALHQDDQQFVALNKNLAEFTDRLTHSDAD
LSNAIQQFDSLLAVARPFFAKNREVLTHDVNNLATVTTTLLQPDPLDGLE
TVLHIFPTLAANINQLYHPTHGGVVSLSAFTNFANPMEFICSSIQAGSRL
GYQESAELCAQYLAPVLDAIKFNYFPFGLNVASTASTLPKEIAYSEPRLQ
PPNGYKDTTVPGIWVPDTPLSHRNTQPGWVVAPGMQGVQVGPITQGLLTP
ESLAELMGGPDIAPPSSGLQTPPGPPNAYDEYPVLPPIGLQAPQVPIPPP
PPGPDVIPGPVPPTPAPVGAPLPAEAGGGQ
>MT0417 acyl-CoA synthase
MSVISTLRDRATTTPSDEAFVFMDYDTKTGDQIDRMTWSQLYSRVTAVSA
YLISYGRHADRRRTAAISAPQGLDYVAGFLGALCAGWTPVPLPEPLGSLR
DKRTGLAVLDCAADVVLTTSQAETRVRATIATHGASVTTPVIALDTLDEP
SGDNCDLDSQLSDWSSYLQYTSGSTANPRGVVLSMRNVTENVDQIIRNYF
RHEGGAPRLPSSVVSWLPLYHDMGLMVGLFIPLFVGCPVILTSPEAFIRK
PARWMQLLAKHQAPFSAAPNFAFDLAVAKTSEEDMAGLDLGHVNTIINGA
EQVQPNTITKFLRRFRPYNLMPAAVKPSYGMAEAVVYLATTKAGSPPTST
EFDADSLARGHAELSTFETERATRLIRYHSDDKEPLLRIVDPDSNIELGP
GRIGEIWIHGKNVSTGYHNADDALNRDKFQASIREASAGTPRSPWLRTGD
LGFIVGDEFYIVGRMKDLIIQDGVNHYPDDIETTVKEFTGGRVAAFSVSD
DGVEHLVIAAEVRTEHGPDKVTIMDFSTIKRLVVSALSKLHGLHVTDFLL
VPPGALPKTTSGKISRAACAKQYGANKLQRVATFP
>MT0700 hydrolase/esterase, putative
MLRRVAILLAAVLAFAGCSGGTRLAAGFGNGNSVHTLDVDGAGRSYRLYK
PVGLPSSAPLVVMLHGGFGSAKQAERSYGWDELADSEKFLVAYPDGYHRA
WNANGGGCCGRPAREGVDDIGFVRAVVADIANNVSIDPARVYVTGMSNGA
IMSYTLACNTSIFAAIGVVSGTQLDPCQSPRPVSVIHIHGTADPLVRYHG
GPGAGFARIDGPPVPDLNAFWREVNRCGALDTTTEGPVTTSGATCADNRR
VVLLTVDDAGHRWPSFATQTLWRFFAAHFR
>MT0316 oxidoreductase, short-chain dehydrogenase/reductase family
MNTGTAVITGASSGLGLQCARALLRRDASWHVVLAVRDPARGRAAMEELG
EPNRCSVLEVDLASVRSVRSFVETVRTTPLPPIRALVCNAGLQVVSGIAF
TDDGVEMTFGVNHLGHFALVTGILDWLARPARIVVVSSGTHDPSKHTGMP
DPRYTCAADLAHPPTDQNTPAEGRRRYTTSKLCNVLFTYELDRRLDHGEQ
GVMVNAFDPGLMPGSGLARDYPPILRLAYRLLSPMLRVLPFVHSTRVSGE
HLAALAVDPRFAGVTGQYFAGAKAIRSSAESYDRAKALDLWETSERLLAQ
VT
>MT0594 P450 heme-thiolate protein
MSGTSSMGLPPGPRLSGSVQAVLMLRHGLRFLTACQRRYGSVFTLHVAGF
GHMVYLSDPAAIKTVFAGNPSVFHAGEANSMLAGLLGDSSLLLIDDDVHR
DRRRLMSPPFHRDAVARQAGPIAEIAAANIAGWPMAKAFAVAPKMSEITL
EVILRTVIGASDPVRLAALRKVMPRLLNVGPWATLALANPSLLNNRLWSR
LRRRIEEADALLYAEIADRRADPDLAARTDTLAMLVRAADEDGRTMTERE
LRDQLITLLVAGHDTTATGLSWALERLTRHPVTLAKAVQAADASAAGDPA
GDEYLDAVAKETLRIRPVVYDVGRVLTEAVEVAGYRLPAGVMVVPAIGLV
HASAQLYPDPERFDPDRMVGATLSPTTWLPFGGGNRRCLGATFAMVEMRV
VLREILRRVELSTTTTSGERPKLKHVIMVPHRGARIRVRATRDVSATSQA
TAQGAGCPAARGGGPSRAVGSQ
>MT0790 P450 heme-thiolate protein
MTVRVGDPELVLDPYDYDFHEDPYPYYRRLRDEAPLYRNEERNFWAVSRH
HDVLQGFRDSTALSNAYGVSLDPSSRTSEAYRVMSMLAMDDPAHLRMRTL
VSKGFTPRRIRELEPQVLELARIHLDSALQTESFDFVAEFAGKLPMDVIS
ELIGVPDTDRARIRALADAVLHREDGVADVPPPAMAASIELMRYYADLIA
EFRRRPANNLTSALLAAELDGDRLSDQEIMAFLFLMVIAGNETTTKLLAN
AVYWAAHHPGQLARVFADHSRIPMWVEETLRYDTSSQILARTVAHDLTLY
DTTIPEGEVLLLLPGSANRDDRVFDDPDDYRIGREIGCKLVSFGSGAHFC
LGAHLARMEARVALGALLRRIRNYEVDDDNVVRVHSSNVRGFAHLPISVQ
AR
>MT2270 oxidoreductase, short-chain dehydrogenase/reductase family
MPATQQMSRLVDSPDGVRIAVYHEGNPDGPTVVLVHGFPDSHVLWDGVVP
LLAERFRIVRYDNRGVGRSSVPKPISAYTMAHFADDFDAVIGELSPGEPV
HVLAHDWGSVGVWEYLRRPGASDRVASFTSVSGPSQDHLVNYVYGGLRRP
WRPRTFLRAISQTLRLSYMALFSVPVVAPLLLRVALSSAAVRRNMVGDIP
VDQIHHSETLARDAAHSVKTYPANYFRSFSSSRRGRAIPIVDVPVQLIVN
SQDPYVRPYGYDQTARWVPRLWRRDIKAGHFSPMSHPQVMAAAVHDFADL
ADGKQPSRALLRAQVGRPRGYFGDTLVSVTGAGSGIGRETALAFAREGAE
IVISDIDEATVKDTAAEIAARGGIAYPYVLDVSDAEAVEAFAERVSAEHG
VPDIVVNNAGIGQAGRFLDTPAEQFDRVLAVNLGGVVNGCRAFGQRLVER
GTGGHIVNVSSMAAYAPLQSLSAYCTSKAATYMFSDCLRAELDAAGVGLT
TICPGVIDTNIVATTGFHAPGTDEEKIDGRRGQIDKMFALRSYGPDKVAD
AIVSAVKKKKPIRPVAPEAYALYGISRVLPQALRSTARLRVI
>MT2187 oxidoreductase, short-chain dehydrogenase/reductase family
MPCSGWTCSRRGGTFSAMTSLQGKVVFITGAARGIGAEVARRLHNKGAKL
VLTDLSKSELAVMGAELGGDDRLLTVVADVRDLPAMQAAAETAVERFGGI
DVVVANAGIASYGSVLKVDPQAFRRVLDVNLLGNFHTVRATLPALIDRRG
YVLIVSSLAAFAAPPGMAPYNMSKAGNEHFANALRLEVAHLGVSVGSAHM
SWIDTALVRDTKADLPAFAELLARLPWPLNKTTSVNKCAAAFVNGIEGRK
DRVYCPGWVALFRWLKPLLSTRVGQRPIRNTVAKLMPQMDAEVAALGRFA
SAYTESLENS
>MT3170 oxidoreductase, short-chain dehydrogenase/reductase family
MSSFEGKVAVITGAGSGIGRALALNLSEKRAKLALSDVDTDGLAKTVRLA
QALGAQVKSDRLDVAEREAVLAHADAVVAHFGTVHQVYNNAGIAYNGNVD
KSEFKDIERIIDVDFWGVVNGTKAFLPHVIASGDGHIVNISSLFGLIAVP
GQSAYNAAKFAVRGFTEALRQEMLVARHPVKVTCVHPGGIKTAVARNATV
ADGEDQQTFAEFFDRRLALHSPEMAAKTIVNGVAKGQARVVVGLEAKAVD
VLARIMGSSYQRLVAAGVAKFFPWAK
>MT1976 acyl-CoA synthase
MNDGSRQELRVRSGLLQIEDCLDADGGIALPAGTTLISLIERNIKYVGDL
VAYRYLDHARSAAGCALEVTWTQFGMRLAAIGAHVQRFAGPGDRVAILAP
QGIDYVCGFYAAIKAGTVAVPLFAPELPGHAERLDTALRDSEPAVILTTA
AAKNAVEGFLNNVPRLRKPTVLVIDQIPDREGELFVPVELDIDAVSHLQY
TSGSTRPPVGVEITHRAVGTNLVQMILSIDLLNRNTHGVSWLPLYHDMGL
SMIGFPAVYGGHSTLMSPTAFVRRPLRWIQALSEGSRTGRVVTAAPNFAY
EWAAQRGLPAQGDDVDLSNVVLIIGSEPVSIDAVTTFNKAFAPYGLPRTA
FKPSYGIAEATLLVATIDHAAEPTVVYLDPEQLGAGHATRVAPDAPNAVV
HVSCGHVARSLWAVIVDPDTGPEAGAELPDGEIGEVWLQGDNVARGYWGR
PEETRMTFGARLQSPLAEGSHADGSAIDDTWLRTGDLGVYLDGELYITGR
IADLLTIDGRNHYPQDIEATAAEASPMVRRGYITAFTVPASDGDDRNQRL
VIIAERAAGTSRSDPRPALDAIRAAVCNRHGLSVADLSFLPAGAIPRTTS
GKLARQACRAQYLSGRLGVH
>MT0127 coenzyme A synthetase, putative
MLIVPNPHTEHMEGAFAMASDFGPRIADLVEVAATRLPEAPALVVTADRI
AISHRDLARLVDELAGQLTRSGLLPGDRVALRMGSNAEFVVALLAASRAD
LVVVPLDPALPITEQRVRSQAAGARVVLIDADGPHDRAEPTTRWWPLTVN
VGGDSGPSGGTLSVHLDAATEPNPATSTPEGLRPDDAMIMFTGGTTGLPK
MVPWTHANIASSVRAIITGYRLSPRDATVAVMPLYHGHGLIASLLATLAS
GGAVSLPARGRFSAHTFWDDIKAVGATWYTAVPTIHQILLERSATEPSGR
KPAALRFIRSCSAPLTAQAALALQTEFAAPVVCAFGMTEATHQVTTTQIE
GIDQTETPVVSTGLVGRSTGAQIRIVGSDGLPLPAGAVGEIWLRGTTVVR
GYLGDPTITAANFTDGWLRTGDLGSLSAAGDLSIRGRIKELINRGGEKIS
PERVEGVLASHPNVMEAAVFGVPHQLYGEAVAAVIVPRESAPPTREELVQ
FCRERLAAFEIPASFQEASGLPHTAKGSLDRRAVAERFGHSV
>MT3321 oxidoreductase, short-chain dehydrogenase/reductase family
MSLNGKTMFISGASRGIGLAIAKRAARDGANIALIAKTAEPHPKLPGTVF
TAAKELEEAGGQALPIVGDIRDPDAVASAVATTVEQFGGIDICVNNASAI
NLGSITEVPMKRFDLMNGIQVRGTYAVSQACIPHMKGRENPHILTLSPPI
LLEKKWLRPTAYMMAKYGMTLCALGIAEEMRADGIASNTLWPRTMVATAA
VQNLLGGDEAMARSRKPEVYADAAYVIVNKPATEYTGKTLLCEDVLVESG
VTDLSVYDCVPGATLGVDLWVEDANPPGYLPA
>MT0802 P450 heme-thiolate protein
MTTAAGLSGIDLTDLDNFADGFPHHLFAIHRREAPVYWHRPTEHTPDGEG
FWSVATYAETLEVLRDPVTYSSVTGGQRRFGGTVLQDLPVAGQVLNMMDD
PRHTRIRRLVSSGLTPRMIRRVEDDLRRRARGLLDGVEPGAPFDFVVEIA
AELPMQMICILLGVPETDRHWLFEAVEPGFDFRGSRRATMPRLNVEDAGS
RLYTYALELIAGKRAEPADDMLSVVANATIDDPDAPALSDAELYLFFHLL
FSAGAETTRNSIAGGLLALAENPDQLQTLRSDFELLPTAIEEIVRWTSPS
PSKRRTASRAVSLGGQPIEAGQKVVVWEGSANRDPSVFDRADEFDITRKP
NPHLGFGQGVHYCLGANLARLELRVLFEELLSRFGSVRVVEPAEWTRSNR
HTGIRHLVVELRGG
>MT3602 virulence factor mce family protein
MAGSGVPSHRSMVIKVSVFAVVMLLVAAGLVVVFGDFRFGPTTVYHATFT
DASRLKAGQKVRIAGVPVGSVKAVKLNPDHSIDVAFAIDRSYTLYSSTRA
VIRYENLVGDRFLEITSGPGELRKLPPGGTINVAHTQPALDLDALLGGLR
PVLKGFDADKINTITSAVIELLQGQGGPLANVLADTGAFSAALGARDQLI
GEVITNLNAVLATVDAKSAQFSASVDQLQQLVSGLAKNRDPIAGAISPLA
STTTDLTELLRNSRRPLQGILENARPLATELDNRKAEVNNDIEQLGEDYL
RLSALGSYGAFFNIYFCSVTIKINGPAGSDILLPIGGQPDPSKGRCAFAK
>MT1574 methyltransferase, putative
MGSAGVPAADAGGRDAASEQIARWTQTCTVVLVCGHGPAKWAFRSWCTSR
SCDTLPVALRYRLQSNPLVGKLTTKYFLPLGTRQVGDHVVFFNFGYEEDP
PMALPLSESDEPNRYCIQLYHQTASQVDLTGKEVLEVSCGAGGGASYIAR
NLGPASYTGLDLNPASIDLCRAKHRLPGLQFVQGDAQNLPFPDESFDAVV
NVEASHQYPDFRGFLAEVARVLRPGGHFLYTDSRRNPVVAEWEAALADAP
LRTISQRDIGAQAKRGLDANTARSQEAIGRRAPVLLAGLTRCAVRVLDWD
LRRGGGFSYRIYLFAKD
>MT3619 P450 heme-thiolate protein
MRANQPVFRDRNGLAAASTYQAVIDAERQPELFSNAGGIRPDQPALPMMI
DMDDPAHLLRRKLVNAGFTRKRVKDKEASIAALCDTLIDAVCERGECDFV
RDLAAPLPMAVIGDMLGVRPEQRDMFLRWSDDLVTFLSSHVSQEDFQITM
DAFAAYNDFTRATIAARRADPTDDLVSVLVSSEVDGERLSDDELVMETLL
ILIGGDETTRHTLSGGTEQLLRNRDQWDLLQRDPSLLPGAIEEMLRWTAP
VKNMCRVLTADTEFHGTALCAGEKMMLLFESANFDEAVFCEPEKFDVQRN
PNSHLAFGFGTHFCLGNQLARLELSLMTERVLRRLPDLRLVADDSVLPLR
PANFVSGLESMPVVFTPSPPLG
>MT2999 acyl-CoA synthase
MPVTDRSVPSLLQERADQQPDSTAYTYIDYGSDPKGFADSLTWSQVYSRA
CIIAEELKLCGLPGDRVAVLAPQGLEYVLAFLGALQAGFIAVPLSTPQYG
IHDDRVSAVLQDSKPVAILTTSSVVGDVTKYAASHDGQPAPVVVEVDLLD
LDSPRQMPAFSRQHTGAAYLQYTSGSTRTPAGVIVSHTNVIANVTQSMYG
YFGDPAKIPTGTVVSWLPLYHDMGLILGICAPLVARRRAMLMSPMSFLRR
PARWMQLLATSGRCFSAAPNFAFELAVRRTSDQDMAGLDLRDVVGIVSGS
ERIHVATVRRFIERFAPYNLSPTAIRPSYGLAEATLYVAAPEAGAAPKTV
RFDYEQLTAGQARPCGTDGSVGTELISYGSPDPSSVRIVNPETMVENPPG
VVGEIWVHGDHVTMGYWQKPKQTAQVFDAKLVDPAPAAPEGPWLRTGDLG
VISDGELFIMGRIKDLLIVDGRNHYPDDIEATIQEITGGRAAAIAVPDDI
TEQLVAIIEFKRRGSTAEEVMLKLRSVKREVTSAISKSHSLRVADLVLVS
PGSIPITTSGKIRRSACVERYRSDGFKRLDVAV
>MT2017 conserved hypothetical protein
MTAAKALVSEWNRMGSQMRFFVGTLAGIPDALMHYRGELLRVIAQMGLGT
GVLAVIGGTVAIVGFLAMTTGAIVAVQGYNQFASVGVEALTGFASAFFNT
REIQPGTVMVALAATVGAGTTAALGAMRINEEIDALEVIGIRSISYLAST
RVLAGVVVAVPLFCVGLMTAYLAARVGTTAIYGQGSGVYDHYFNTFLRPT
DVLWSSVEVVVVALMIMLVCTYYGYAAHGGPAGVGEAVGRAVRASMVVAS
IAILVMTLAIYGQSPNFHLAT
>MT3932 conserved hypothetical protein
MRIGPVELSAVKDWDPAPGVLVSWHPTPASCAKALAAPVSAVPPSYVQAR
QIRSFSEQAARGLDHSRLLIASVEVFGHCDLRAMTYVINAHLRRHDTYRS
WFELRDTDHIVRHSIADPADIEFVPTTHGEMTSADLRQHIVATPDSLHWD
CFSFGVIQRADSFTFYASIDHLHADGQFVGVGLMEFQSMYTALIMGEPPI
GLSEAGSYVDFCVRQHEYTSALTVDSPEVRAWIDFAEINNGTFPEFPLPL
GDPSVRCGGDLLSMMLMDEQQTQRFESACMAANARFIGGILACIAIAIHE
LTGADTYFGITPKDIRTPADLMTQGWFTGQIPVTVPVAGLSFNEIARIAQ
TSFDTGADLAKVPFERVVELSPSLRRPQPLFSLVNFFDAQVGPLSAVTKL
FEGLNVGTYSDGRVTYPLSTMVGRFDETAASVLFPDNPVARESVTAYLRA
IRSVCMRIANGGTAERVGNVVALSPGRRNNIERMTWRSCRAGDFIDICNL
KVANVTVDREA
>MT0109 pp-binding family protein
MRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQ
SELPSRLAANPSIAGWLRELEAVCTEFG
>MT0372 conserved hypothetical protein
MHPDELDPEYHHHGGFPEYGPASPGAGFGQFVATMRRLQDLAVAADPGDA
VWDEAAERAAALVELLSPFEADEGKAPAGRTPGLPGMGSLLLPPWTVTRY
GTDGVEMRGSFSRFHVGGNSAVHGGVLPLLFDHMFGMISHAAGRPISRTA
FLHVDYRRITPIDVPLIVRGRVTNTEGRKAFVCAELFDSDETLLAEGNGL
MVRLLPGQP
>MT0330 muconolactone isomerase, putative
MEFLVTMTTRVPDSMPADAVERVRAREAARSRELAAQGKLLRLWRPPLRP
GEWRTLGLFAADDNGELEQLLASMPPRSWRTDDVTPLGAHPNDPVGQGIT
IAPGKGPEFLIATTIMVPPGTPAQVVDDTVAREARRAPELAGRGHLVRLW
ALPDGPDGQRTLGLWRARDPGELMAILESLPLAGWMTIETTPLSPHPDDP
IRMP
>MT2030 hypothetical protein
MRTRDVERGRAAMGEANIREQAIATMPRGGPDASWLDRRFQTDALEYLDR
DDVPDEVKQKIIGVLDRVGTLTNLHEKYARIALKLVSDIPNPRILELGAG
HGKLSAKILELHPTATVTISDLDPTSVANIAAGELGTHPRARTQVIDATA
IDGHDHSYDLAVFALAFHHLPPTVACKAIAEATRVGKRFLIIDLKRQKPL
SFTLSSVLLLPLHLLLLPWSSMRSSMHDGFISALRAYSPSALQTLARAAD
PGMQVEILPAPTRLFPPSLAVVFSRSSSAPTESSECSADRQPGE
>MT3143 oxidoreductase, short-chain dehydrogenase/reductase family
MLQRGAGQYFAGKRCFVTGAASGIGRATALRLAAQGAELYLTDRDRDGLA
QTVCDARALGAQVPEHRVLDVSDYQDVAAFAADIHARHPSMDVVLNIAGV
SAWGTVDQLTHDQWSRMVAINLMGPIHVIETLVPPMVAAGRGGHLVNVSS
AAGLVGLPWHAAYSASKYGLRGLSEVLRFDLARHGIGVSVVVPGAVKTPL
VNTVEIAGVDRDDPRVNRWVERFSGHAVTPEKAADKILAGVTRNRYLVYT
SADIRALYAFKRYAWWPYTLVMRRVNVFFTRALRPGP
>MT0756 conserved hypothetical protein
MTQTGSARFEGDSWDLASSVGLTATMVAAARAVAGRAPGALVNDQFAEPL
VRAVGVDFFVRMASGELDPDELAEDEANGLRRFADAMAIRTHYFDNFFLD
ATRAGIRQAVILASGLDSRAYRLRWPAGTIVFEVDQPQVIDFKTTTLAGL
GAAPTTDRRTVAVDLRDDWPTALQKAGFDNAQRTAWIAEGLLGYLSAEAQ
DRLLDQITAQSVPGSQFATEVLRDINRLNEEELRGRMRRLAERFRRHGLD
LDMSGLVYFGDRTDARTYLADHGWRTASASTTDLLAEHGLPPIDGDDAPF
GEVIYVSAELKQKHQDTR
>MT1230 substrate--CoA ligase
MLLASLNPAVVSAADIADAVRIDGDVLSRSDLVGAATSVAERVAGAHRVA
VLATPTASTVLAITGCLIAGVPVVPVPADVGVTERRHMLTDSGVQAWLGP
LPDDPAGLPHIPVRTHARSWHRYPEPSPGAIAMVVYTSGTTGPPKGVQLS
RRAIAADLDALAEAWQWTAEDVLVHGLPLYHVHGLVLGLLGSLRFGNRFV
HTGKPTPAGYAQACYEAHGTLFFGVPTVWSRVAADQAAAGALKPARLLVS
GSAALPVPVFDKLVQLTGHRPVERYGASESLITLSTRADGERRPGWVGLP
LAGVQTRLVDDDGGEVPHDGETVGKLQVRGPTLFDGYLNQPDATAAAFDA
DSWYRTGDVAVVDGSGMHRIVGRESVDLIKSGGYRVGAGEIETVLLGHPD
VAEAAVVGVPDDDLGQRIVAYVVGSANVDADGLINFVAQQLSVHKRPREV
RIVDALPRNALGKVLKKQLLSEG
>MT3026 methyltransferase, putative
MAFSRTHSLLARAGSTSTYKRVWRYWYPLMTRGLGNDEIVFINWAYEEDP
PMDLPLEASDEPNRAHINLYHRTATQVDLGGKQVLEVSCGHGGGASYLTR
TLHPASYTGLDLNQAGIKLCKKRHRLPGLDFVRGDAENLPFDDESFDVVL
NVEASHCYPHFRRFLAEVVRVLRPGGYFPYADLRPNNEIAAWEADLAATP
LRQLSQRQINAEVLRGIGNNSQKSRDLVDRHLPAFLRFAGREFIGVQGTQ
LSRYLEGGELSYRMYCFTKD
>MT1546 methyltransferase
MLDVGCGSGRMALPLTGYLNSEGRYAGFDISQKAIAWCQEHITSAHPNFQ
FEVSDIYNSLYNPKGKYQSLDFRFPYPDASFDVVFLTSVFTHMFPPDVEH
YLDEISRVLKPGGRCLSTYFLLNDESLAHIAEGKSAHNFQHEGPGYRTIH
KKRPEEAIGLPETFVRDVYGKFGLAVHEPLHYGSWSGREPHLSFQDIVIA
TKTAS
>MT3075 P49 protein
MDVTVVGSGPNGLATAVICARAGLNVQVVEAQATFGGGARSAADFEFPEV
LHDVCSAVHPLALASPFFAEFDLPARGVTLTVPDIAYANPLPGRPAAIAY
HDLAHTCAKLDDGASWRRLLGPLVAHSETVVEFMLSDKRSLPTALGSVLR
LGLRMLAQGTPAWRSLAGEDARALFTGVAAHAISPLPSLVSAGAGLMLAT
LAHSVGWPIPVGGTQAIADALIADLRAHGGRLAAGVEITEPQRSVVVFDT
APTALLRVYRDKLPHRYAKALRRYRFRAGIAKVDFVLSDEIPWSDPRLRR
AATLHLGGTRDQMARAEADVAAGRHADWPMVLAACPHVADPGRIDETGRR
PFWTYAHVPSGSTLDATETVTSVLERFAPGFRDIVVAARAVPAARMADHN
ANYVGGDITVGANSTWRAIAGPTPRLNPWRTPIPKVYLCSAATPPGAGVH
GMCGWYAARTLLRTEFGITRMPPLGHELRP
>MT2319 methyltransferase-related protein
MSGALETTEEFGNRFVAAIDSAGLAILVSVGHQTGLLDTMAGLPPATSME
IAEAAGLEERYVREWLGGMTTGQIVEYDAGSSTYSLPAHRAGMLTRAAGP
DNLAVIAQFVSLLGEVEQKVIRCFREGGGVPYSEYPRFHKLMAEMSGMVF
DAALIDVVLPLVDGLPDRLRSGADVADFGCGSGRAVKLMAQAFGASRFTG
IDFSDEAVAAGTEEAARLGLANATFERHDLAELDKVGAYDVITVFDAIHD
QAQPARVLQNIYRALRPGGVLLMVDIKASSQLEDNVGVPLSTYLYTTSLM
HCMTVSLALDGAGLGTVWGRQLATSMLADAGFTDVTVAEIESDVLNNYYI
ARK
>MT1580 acyl-CoA synthase
MVASSIPTALRERASVHPNGAAITYIDYEQDWAGVAETLTWSQLYRRMLN
VAEPLRHVGATGDRAVILAPQGIEYVVGFLGALQAGRIAVPLPVPHAGAH
DERTISVLSDTSPAVILTTSGAVDDVRECAQPQPGQSAPSIVELDLLDLD
SRQRSRSPGARPTGRDTPETAYLQYTSGSTRTPAGVMVSNKNVFANFEQI
VADFFAPEGGVVPPDLTVVSWLPLYHDMGLLLGAIMPILAGVPTVLTSPV
GFLQRPARWIQLLARNGRTISAGPNFAFELAVRKTSDDDMDGLDLAGVHT
ILNGSERVHPATLKRFAERFGRFNFAAAALRPAYGMAEATVYIATRNVNE
PPEIVDFESEKLPAGQAIRCPSGSGTPLVSYGVPRSQLVRIVDPDTCIEC
PQGSVGEIWVQGGNVASGYWHKPEESKRTFGARIVTPSAGTPEAPWLRTG
DSGFVSGGELFIIGRIKDLLIVYGRNHAPDDIEATIQEITSGRCAAIAVP
DHGTEKLVAIIELKKRGDSDEDVADRLRIVKRDVAAAIFDSHGLSVADLV
LVSPGSIPITTSGKIRRAQCVQLYRRREFTRLDA
>MT2021 virulence factor mce family protein
MTTKLRRARSVLATALVLVAGVILAMRTADAAARTTVVAYFDNSNGVFAG
DDVLIRGVPVGKIVKIEPQPLRAKISFWFDRKYRVPADAAAAILSPQLVT
GRAIQLTPPYAGGPTMADGTVIPQERTVVPVEWDDLRAQLQRLTALLQPT
RPGGVSTLGALINTAADNLRGQGATIRDTIIKLSQAISALGDHSKDIFST
VTNLSTLVTALHDSADLLERLNHNLAAVTSLLADGPDKIGQAAEDLNAVV
ADVGSFAAEHREAIGTASDKLASITTALVDSLDDIKQTLHISPTVLQNFN
NIFEPANGALTGALAGNNMANPIAFLCGAIQAASRLGGEQAAKLCVQYLA
PIVKNRQYNYPPLGANLFVGAQARPNEVTYSEDWLRPDYVAPVADTPPDP
AAAVTVDPATGLRGMMMPPGGGS
>MT3028 conserved hypothetical protein
MLEVGAGIGDHTQFFLDRGCKVLCTEPRGENLDVIRQRFGSNPNVTVDHL
DLDGDLPAEAHQYDVVYCYGVLYHLSRPAEALAWMCDRAVDLLLLETCVS
YSGEDEPFLVSERASSPSQAITGTGCRPSRVWVMNRLREKMPHVYVTATQ
PRHRQFPLDWRANGPIASTGLARAVFVASRAPLNLPTLVEELPMVQRRC
>MT3604 conserved hypothetical protein
MSYDVTIRFRRFFSRLQRPVDNFGEQALFYGETMRYVPNAITRYRKETVR
LVAEMTLGAGALVMIGGTVGVAAFLTLASGGVIAVQGYSSLGDIGIEALT
GFLSAFLNVRVVAPVIAGIALAATIGAGATAQLGAMRVSEEIDAVECMAV
HSVSYLVSTRLIAGLVAIIPLYSLSVLAAFFAARFTTVFVNGQSAGLYDH
YFNTFLIPSDLLWSFMQAIAMSIAVMLVHTYYGYNASGGSVGVGVAVGQA
VRTSLIVVVVITLFISLAVYGASGNFNLSG
>MT3352 conserved hypothetical protein
MTGRVGNPKDHAVVIGASIAGLCAARVLSDFYSTVTVFERDELPEAPANR
ATVPQDRHLHMLMARGAQEFDSLFPGLLHDMVAAGVPMLENRPDCIYLGA
AGHVLGTGHTLRKEFTAYVPSRPHLEWQLRRRVLQLSNVQIVRRLVTEPQ
FERRQQRVVGVLLDSPGSGQDREREEFIAADLVVDAAGRGTRLPVWLTQW
GYRRPAEDTVDIGISYASHQFRIPDGLIAEKVVVAGASHDQSLGLGMLCY
EDGTWVLTTFGVADAKPPPTFDEMRALADKLLPARFTAALAQAQPIGCPA
FHAFPASRWRRYDKLERFPRGIVPFGDAVASFNPTFGQGMTMTSLQAGHL
RRALKARNSAMKGDLAAELNRATAKTTYPVWMMNAIGDISFHHATAEPLP
RWWRPAGSLFDQFLGAAETDPVLAEWFLRRFSLLDSLYMVPSVPIIGRAI
AHNLRLWLKEQRERRQPVTTRRSP
>MT0938 dioxygenase, putative
MDITIVGKYLSTLPEDDDHPYRTGPWRPQTTEWDADDLTTVTGEVPADLD
GIYLRNTENPLHPAFATYHPFDGDGMIHVVGFRDGKAFYRNRFIRTDGFL
AENEAGGPLWPGLAEPVQLAKREHGWGARGLMKDASSTDVIVHRGIALTS
FYQCGDLYRIDPYSANTLGKESWHGRFPFDWGVSAHPKVDNKTGELLFFN
YSKQEPYMRYGVVDQNNELVHYVDVPLPGPRLPHDMAFTENYVILNDFPL
FWDPRLLERDVHLPRFYPEIPSRFAVVARRGNDIRWFEADPTFVLHFTNA
YEQGDEIVLDGFYEGDPQPLDTGGTKWEKLFRFLALDRLQSRLHRWRLNM
VTGAVHEEQLSESITEFGTINADYAASSYRYTYAATGKPSWFLFDGLVKH
DLLTGNHECYSFGDGVYGSETAMAPRVGSSAEDDGYLVTLTTDMNDDASY
CLVFDAARPGDGPICKLALPERISSGTHSAWVPGAELRRWDHAESPAAAV
GL
>MT2059 conserved hypothetical protein
MVKRSRATRLSPSIWSGWESPQCRSIRARLLLPRGRSRPPNADCCWNQLA
VTPDTRMPASSAAGRDAAAYDAWYDSPTGRPILATEVAALRPLIEVFAQP
RLEIGVGTGRFADLLGVRFGLDPSRDALMFARRRGVLVANAVGEAVPFVS
RHFGAVLMAFTLCFVTDPAAIFRETRRLLADGGGLVIGFLPRGTPWADLY
ALRAARGQPGYRDARFYTAAELEQLLADSGFRVIARRCTLHQPPGLARYD
IEAAHDGIQAGAGFVAISAVDQAHEPKDDHPLESE
>MT0542 methyltransferase-related protein
MGGCSITCLNISEVPNETNRKKNRQAGLDRSIRVIHGSFDDIPEPDSGYD
VVWSQDAILHAPDRRKVLEEAFRVLRPGGELIFTDPMQADDVPDGVLQPV
YDRLNLRDLGSMRFYA
>MT1295 P450 heme-thiolate protein
MTSVMSHEFQLATAETWPNPWPMYRALRDHDPVHHVVPPQRPEYDYYVLS
RHADVWSAARDHQTFSSAQGLTVNYGELEMIGLHDTPPMVMQDPPVHTEF
RKLVSRGFTPRQVETVEPTVRKFVVERLEKLRANGGGDIVTELFKPLPSM
VVAHYLGVPEEDWTQFDGWTQAIVAANAVDGATTGALDAVGSMMAYFTGL
IERRRTEPADDAISHLVAAGVGADGDTAGTLSILAFTFTMVTGGNDTVTG
MLGGSMPLLHRRPDQRRLLLDDPEGIPDAVEELLRLTSPVQGLARTTTRD
VTIGDTTIPAGRRVLLLYGSANRDERQYGPDAAELDVTRCPRNILTFSHG
AHHCLGAAAARMQCRVALTELLARCPDFEVAESRIVWSGGSYVRRPLSVP
FRVTS
>MT3071 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase, putative
MTAREIAEHPFGTPTFTGRSWPLADVRLLAPILASKVVCVGKNYADHIAE
MGGRPPADPVIFLKPNTAIIGPNTPIRLPANASPVHFEGELAIVIGRACK
DVPAAQAVDNILGYTIGNDVSARDQQQSDGQWTRAKGHDTFCPVGPWIVT
DLAPFDPADLELRTVVNGDVKQHARTSLMIHDIGAIVEWISAIMTLLPGD
LILTGTPAGVGPIEDGDTVSITIEGIGTLTNPVVRKGKP
>MT0344 MitM-related protein
MRLTHPARRYLSSQAARPTGAFGRLLGRIWRAETADVNRIAVELLAPGPG
ERVCEIGFGPGRTLGLLAAAGAQVSGVEVSTTMIAIAAHHNAKAIAAGLI
SLYHGDGVTLPVADHSLDKVLGVHNFYFWPDPRASLCDIARALRPGGRLV
LTSISDDQPLAARFDPAIYRVPPTLDTAAWLGAAGFIDVGIKRSADHPAT
VWFTATAT
>MT3397 esterase, putative
MPWARMLSLIVLMVCLAGCGGDQLLARHASSVATFQFGGLTRSYRLHVPP
AEPSGLVISLHGGGGTGAGQEALTDFDAVADAADLLVVYPDGYDKSWADG
RGASPADRRHLDDVGFLVALAAKLVHDFDIAPGHVFATGMSNGGFMSNRL
ACDRADIFAAVAPVAGTLGVGVTCNPSRPVSVLEAHGTADPLVPFNGGAV
RGRGGLSHSISVASLVDRWRAVDGCQGDPSAAELPDVGDGTMVHLFDSSS
CAAGTEVISYQIDNGGHTWPGGRQYLPKAVIGATTRAFDGSQVIAQFFAT
HGRD
>MT1375 conserved hypothetical protein
MNSITDVGGIRVGHYQRLDPDASLGAGWACGVTVVLPPPGTVGAVDCRGG
APGTRETDLLDPANSVRFVDALLLAGGSAYGLAAADGVMRWLEEHRRGVA
MDSGVVPIVPGAVIFDLPVGGWNCRPTADFGYSACAAAGVDVAVGTVGVG
VGARAGALKGGVGTASATLQSGVTVGVLAVVNAAGNVVDPATGLPWMADL
VGEFALRAPPAEQIAALAQLSSPLGAFNTPFNTTIGVIACDAALSPAACR
RIAIAAHDGLARTIRPAHTPLDGDTVFALATGAVAVPPEAGVPAALSPET
QLVTAVGAAAADCLARAVLAGVLNAQPVAGIPTYRDMFPGAFGS
>MT3021.1 polyketide synthase
MIEEQRTMSVEGADQQSEKLFHYLKKVAVELDETRARLREYEQRATEPVA
VVGIGCRFPGGVDGPDGLWDVVSAGRDVVSEFPTDRGWDVEGLYDPDPDA
EGKTYTRWGAFLDDATGFDAGFFGIAPSEVLAMDPQQRLMLEVSWEALEH
AGIDPLSLRGSATGVYTGIFAASYGNRDTGGLQGYGLTGTSISVASGRVS
YVLGLQGPAVSVDTACSSSLVAIHWAMSSLRSGECDLALAGGVTVMGLPS
IFVGFSRQRGLAADGRCKAFAAAADGTGWGEGAGVVVLERLSDARRLGHS
VLAVVRGSAVNQDGASNGLTAPNGLAQQRVIQAALANAGLSAADVDVVEA
HGTATTLGDPIEAQALLSTYGQGGPAEQPLWVGSIKSNMGHTQAAAGVAG
VIKMVQAMRHGVMPATLHVDEPSPRVDWTSGAVSVLTEAREWSVDGRPRR
AAVSSFGISGTNAHLILEEAPVPAPAEAPVEASESTGGRGRRWCRG
>MT3203 P450 heme-thiolate protein
MTSTSIPTFPFDRPVPTEPSPMLSELRNSCPVAPIELPSGHTAWLVTRFD
DVKGVLSDKRFSCRAAAHPSSPPFVPFVQLCPSLLSIDGPQHTAARRLLA
QGLNPGFIARMRPVVQQIVDNALDDLAAAEPPVDFQEIVSVPIGEQLMAK
LLGVEPETVHELAAHVDAAMSVCEIGDEEVSRRWSALCTMVIDILHRKLA
EPGDDLLSTIAQANRQQSTMTDEQVVGMLLTVVIGGVDTPIAVITNGLAS
LLHHRDQYERLVEDPGRVARAVEEIVRFNPATEIEHLRVVTEDVVIAGTA
LSAGSPAFTSITSANRDSDQFLDPDEFDVERNPNEHIAFGYGPHACPASA
YSRMCLTTFFTSLTQRFPQLQLARPFEDLERRGKGLHSVGIKELLVTWPT
>MT0861 methyltransferase, UbiE/COQ5 family
MNDKRRAIYTHGYHESVLRSHRRRTAENSAGYLLPYLVPGLSVLDVGCGP
GTITVDLAARVVPGSVTGVEPTDDALSLARAEAQLHRLSNISFTTSDVHK
LDFPDDAFDVVHAHQVLQHVADPVRALQEMRRVCTPGGIVAARDADYSGF
IWFPKLPALDRWLDLYERAARANGGEPDAGRRLLSWARAAGFDDVTPTAS
VWCFATASAREWWGLVWADRILQSDLAHQLVDSGLATAAQLEEISTAWRE
WAAAPDGWLAIPHGEILCRA
>MT1706 P450 heme-thiolate protein
MRTYRTVRYPLGEALLALYRWRGPLINAGVGGHGYTYLLGAEANRFVFAN
ADAFSWSQTFESLVPVDGPTALIVSDGADHRRRRSVVAPGLRHHHVQRYV
ATMVSNIDTVIDGWQPGQRLDIYQELRSAVRRSTAESLFGQRLAVHSDFL
GEQLQPLLDLTRRPPQVMRLQQRVNSPGWRRAMAARKRIDDLIDAQIADA
RTAPRPDDHMLTTLISGCSEEGTTLSDNEIRDSIVSLITAGYETTSGALA
WAIYALLTVPGTWESAASEVARVLGGRVPAADDLSALTYLNGVVHETLRL
YSPGVISARRVLRDLWFDGHRIRAGRLLIFSAYVTHRLPEIWPEPTEFRP
LRWDPNAADYRKPAPHEFIPFSGGLHRCIGAVMATTEMTVILARLVARAM
LQLPAQRTHRIRAANFAALRPWPGLTVEIRKSAPAQ
>MT1421 conserved hypothetical protein
MSPSPTYTPPKLASMPGIDFDALYRGESPGEGLPPITTPPWDTKAPKDNV
IGWHTGGWVHGDVLDIGCGLGDNAIYLARNGYQVTGLDISPTALTTAKRR
ASDAGVDVKFAVGDATKLTGYTGAFDTVIDCGMFHCLDDDGKRSYAASVH
RATRPGATLLLSCFSNAMPPDEEWPRSTVSEQTLRDVLGGAGWDIESLEP
ATVRRELDGTEVEMAFWNVRAQRRGS
>MT3934 acyl-CoA synthase
MVSLSIPSMLRQCVNLHPDGTAFTYIDYERDSEGISESLTWSQVYRRTLN
VAAEVRRHAAIGDRAVILAPQGLDYIVAFLGALQAGLIAVPLSAPLGGAS
DERVDAVVRDAKPNVVLTTSAIMGDVVPRVTPPPGIASPPTVAVDQLDLD
SPIRSNIVDDSLQTTAYLQYTSGSTRTPAGVMITYKNILANFQQMISAYF
ADTGAVPPLDLFIMSWLPFYHDMGLVLGVCAPIIVGCGAVLTSPVAFLQR
PARWLQLMAREGQAFSAAPNFAFELTAAKAIDDDLAGLDLGRIKTILCGS
ERVHPATLKRFVDRFSRFNLREFAIRPAYGLAEATVYVATSQAGQPPEIR
YFEPHELSAGQAKPCATGAGTALVSYPLPQSPIVRIVDPNTNTECPPGTI
GEIWVHGDNVAGGYWEKPDETERTFGGALVAPSAGTPVGPWLRTGDSGFV
SEDKFFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAIAVPSNGVE
KLVAIVELNNRGNLDTERLSFVTREVTSAISTSHGLSVSDLVLVAPGSIP
ITTSGKVRRAECVKLYRHNEFTRLDAKPLQASDL
>MT1698 chalcone/stilbene synthase family protein
MSVIAGVFGALPPYRYSQRELTDSFVSIPDFEGYEDIVRQLHASAKVNSR
HLVLPLEKYPKLTDFGEANKIFIEKAVDLGVQALAGALDESGLRPEDLDV
LITATVTGLAVPSLDARIAGRLGLRADVRRVPLFGLGCVAGAAGVARLHD
YLRGAPDGVAALVSVELCSLTYPGYKPTLPGLVGSALFADGAAAVVAAGV
KRAQDIGADGPDILDSRSHLYPDSLRTMGYDVGSAGFELVLSRDLAAVVE
QYLGNDVTTFLASHGLSTTDVGAWVTHPGGPKIINAITETLDLSPQALEL
TWRSLGEIGNLSSASVLHVLRDTIAKPPPSGSPGLMIAMGPGFCSELVLL
RWH
>MT0619 virulence factor mce family protein
MGTPIRREHDQPMKTTGTTIKLGIVWLVLSVFTVMIIVVFGQVRFHHTTG
YSAVFTHVSGLRAGQFVRAAGVEVGKVAKVTLIDGDKQVLVDFTVDRSLS
LDQATTASIRYLNLIGDRYLELGRGHSGQRVAPGATIPLEHTHPALDLDA
LLGGFRPLFQTLDPDKVNSIASSIITVFQGQGATINDILDQTASLTATLA
DRDHAIGEVVNNLNTVLATTVKHQTEFDRTVDKLEVLITGLKNRADPLAA
AAAHISSAAGTLADLLGRIVHCCTAASGTSRASSSRS
>MT1633 hypothetical protein
MARTFEDLVAEAASASVGGWDFSWLDGRATEERPSWGYQRQLSQRLANAT
AALDLETGGGEVLAGAGNFPPTMVATEAWPPNAAMATRRLHPLGAVVVIT
GDKPPLPFADAAFDLVTSRHPSTRWWTEIARVLRAGGSYFAQHVGPATLW
DLREHFLGPREHNGADQYAQVVRTCITDAGLEIVDLQMERLRVEFFDVGA
VIYFLRKVIWFLPDFTVEGYHDRLRALHERIQAEGPFVTYSTRALIEARK
PS
>MT3018 polyketide synthase
MVPWVISARSAEALTAQAGRLMAHVQANPGLDPIDVGCSLASRSVFEHRA
VVVGASREQLIAGLAGLAAGEPGAGVAVGQPGSVGKTVVVFPGQGAQRIG
MGRELYGELPVFAQAFDAVADELDRHLRLPLRDVIWGADADLLDSTEFAQ
PALFAVEVASFAVLRDWGVLPDFVMGHSVGELAAAHAAGVLTLADAAMLV
VARGRLMQALPAGGAMVAVAASEDEVEPLLGEGVGIAAINAPESVVISGA
QAAANAIADRFAAQGRRVHQLAVSHAFHSPLMEPMLEEFARVAARVQARE
PQLGLVSNVTGELAGPDFGSAQYWVDHVRRPVRFADSARHLQTLGATHFI
EAGPGSGLTGSIEQSLAPAEAMVVSMLGKDRPELASALGAAGQVFTTGVP
VQWSAVFAGSGGRRVQLPTYAFQRRRFWETPGADGPADAAGLGLGATEHA
LLGAVVERPDSDEVVLTGRLSLADQPWLADHVVNGVVLFPGAGFVELVIR
AGDEVGCALIEELVLAAPLVMHPGVGVQVQVVVGAADESGHRAVSVYSRG
DQSQGWLLNAEGMLGVAAAETPMDLSVWPPEGAESVDISDGYAQLAERGY
AYGPAFQGLVAIWRRGSELFAEVVAPGEAGVAVDRMGMHPAVLDAVLHAL
GLAVEKTQASTETRLPFCWRGVSLHAGGAGRVRARFASAGADAISVDVCD
ATGLPVLTVRSLVTRPITAEQLRAAVTAAGGASDQGPLEVVWSPISVVSG
GANGSAPPAPVSWADFCAGSDGDASVVVWELESAGGQASSVVGSVYAATH
TALEVLQSWLGADRAATLVVLTHGGVGLAGEDISDLAAAAVWGMARSAQA
ENPGRIVLIDTDAAVDASVLAGVGEPQLLVRGGTVHAPRLSPAPALLALP
AAESAWRLAAGGGGTLEDLVIQPCPEVQAPLQAGQVRVAVAAVGVNFRDV
VAALGMYPGQAPPLGAEGAGVVLETGPEVTDLAVGDAVMGFLGGAGPLAV
VDQQLVTRVPQGWSFAQAAAVPVVFLTAWYGLADLAEIKAGESVLIHAGT
GGVGMAAVQLARQWGVEVFVTASRGKWDTLRAMGFDDDHIGDSRTCEFEE
KFLAVTEGRGVDVVLDSLAGEFVDASLRLLVRGGRFLEMGKTDIRDAQEI
AANYPGVQYRAFDLSEAGPARMQEMLAEVRELFDTRELHRLPVTTWDVRC
APAAFRFMSQARHIGKVVLTMPSALADRLADGTVVITGATGAVGGVLARH
LVGAYGVRHLVLASRRGDRAEGAAELAADLTEAGAKVQVVACDVADRAAV
AGLFAQLSREYPPVRGVIHAAGVLDDAVITSLTPDRIDTVLRAKVDAAWN
LHQATSDLDLSMFALCSSIAATVGSPGQGNYSAANAFLDGLAAHRQAAGL
AGISLAWGLWEQPGGMTAHLSSRDLARMSRSGLAPMSPAEAVELFDAALA
IDHPLAVATLLDRAALDARAQAGALPALFSGLARRPRRRQIDDTGDATSS
KSALAQRLHGLAADEQLELLVGLVCLQAAAVLGRPSAEDVDPDTEFGDLG
FDSLTAVELRNRLKTATGLTLPPTVIFDHPTPTAVAEYVAQQMSGSRPTE
SGDPTSQVVEPAAAEVSVHA
>MT0234 conserved hypothetical protein
MAVTDVFARRATLRRSLRLLADFRYEQRDPARFYRTLAADTAAMIGDLWL
ATHSEPPVGRTLLDVGGGPGYFATAFSDAGVGYIGVEPDPDEMHAAGPAF
TGRPGMFVRASGMALPFADDSVDICLSSNVAEHVPRPWQLGTEMLRVTKP
GGLVVLSYTVWLGPFGGHEMGLSHYLGGARAAARYVRKHGHPAKNNYGSS
LFAVSAAEGLRWAAGTGAALAVFPRYHPRWAWWLTSVPVLREFLVSNLVL
VLTP
>MT1793 very-long-chain acyl-CoA synthetase, putative
MTDTIQSLLRQHVSDPTIAVKYGGLQWTWSQYLAESAARAAALITIADPQ
RPTHIGSLLGNTPEMLAQLAAAGLGGYVLCGLNTTRRGDALAADVRRADC
QIVVTDADHRALLDGLDLAGARILDTSTPRWAELVAGDGAFVPYREVDTM
DPFMMIFTSGTSGNPKAVPVSHLMATFAGRSLTERFGLTEQDTCYVSMPL
FHSNAVVAGWAPAVVSGAAIAPATFSATGFLDDVRRYHATYMNYVGKPLA
YILATPERDDDADNPLRVAFGNEANDKDIEEFSRRFGVQVEDGFGSTENA
VIVIREPGTPPGSIGRGAHGVAVYNGETVTECAVARFDAHGALTNADEAI
GELVNTTGSGFFTGYYNDPEANAERMRHGMYWSGDLAYRDSEGWIYLAGR
TADWMRVDGENLTAAPIERILLRYKAINRVAVYAVPDEYVGDQVMAALVL
RAGDTFDPDAFEAFLDAQPDLSTKARPRYIRIAADLPSTATHKVLKRQLI
DEGTAVGKADTLWVREPRGSAYHHASGPAKAI
>MT0179 virulence factor mce family protein
MKITGTVVKLGIVSVVLLFFTVMIIVIFGQMRFDRTNGYTAEFSNVSGLR
QGQFVRASGVEIGKVKALHLVDGGRRVRVEFNIDRSVPLYQSTTAQIRYS
DLIGNRYVELKRGEGKGANDLLPPGGLIPLSRTSPALDLDALIGGFKPVF
RALDPAKVNNIANALITVFQGQGGTINDILDQTAQLTSQIAERDQAIGEV
VKNLNIVLDTTVKHRKEFDETVNNLENLITGLRNHSDQLAGGLAHISNGA
GTVADLLAENRTLVRKAVSYLDAIQQPVIDQRVELDDLLHKTPTALTALG
RANGTYGDFQNFYLCDLQIKWNGFQAGGPVRTVKLFSQPTGRCTPQ
>MT0080 conserved hypothetical protein
MGDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRISAVDFAG
SACPDMNLVDLGESTLLPGLVDAHAHLCWDPDGRPEDLAGDPHAVLVGRA
RRHAAAALRSGITTIRDLGDRDYAALALREEYRQKTTVGPELVVSGPPLT
RSGGHCWFLGGVADSVEELVDAVQERAARGADWIKVMATGGFVTTASDPW
QPQYGSGQLAAVVAAAEQVGLPVTAHAHATAGIAAAVAAGVDGIEHCTFL
SEGSAAASPDVVEAIVAQGVWCGMTIPRVYPEMPENLVAVVQDGWRNIRR
LIDAGARVALSTDAGVAPGRRHDVLPDDLVYLSRHGFTSTEVLTGATAAA
AASCGLGHRKGRIAPGYDADLLAVAAGVDHDPAGLCDVKAVWRSGTQVPL
QASAVGYNTPS
>MT3122 conserved hypothetical protein
MRARFGDRAPWLVETTLLRRRAAGKLGELCPNVGVSQWLFTDEALQQATA
APVARHRARRLAGRVVHDATCSIGTELAALRELAVRAVGSDIDPVRLAMA
RHNLAALGMEADLCRADVLHPVTRDAVVVIDPARRSNGRRRFHLADYQPG
LGPLLDRYRGRDVVVKCAPGIDFEEVGRLGFEGEIEVISYRGGVREACLW
SAGLAGSGIRRRASILDSGEQIGDDEPDDCGVRPAGKWIVDPDGAVVRAG
LVRNYGARHGLWQLDPQIAYLSGDRLPPALRGFEVLEQLAFDERRLRQVL
SALDCGAAEILVRGVAIDPDALRRRLRLRGSRPLAVVITRIGAGSLSHVT
AYVCRPSR
>MT0182 virulence factor mce family protein
MSVLARMRVMRHRAWQGLVLLVLALLLSSCGWRGISNVAIPGGPGTGPGS
YTIYVQMPDTLAINGNSRVMVADVWVGSIRAIKLKNWVATLTLSLKKDVT
LPKNATAKIGQTSLLGSQHVELAAPPDPSPVPLKDGDTIPLKRSSAYPTT
EQTLASIATLLRGGGLVNLEGIQQEINAIVTGRADQIRAFLGKLDTFTDE
LNQQRDDITRAIDSTNRLLAYVGGRSEVLNRVLTDLPPLIKHFADKQELL
INASDAVGRLSQSADQYLSAARGDLHQDLQALQCPLKELRRAAPYLVGAL
KLILTQPFDVDTVPQLVRGDYMNLSLTLDLTYSAIDNAFLTGTGFSGALR
ALEQSFGRDPETMIPDIRYTPNPNDAPGGPLVERGNRQC
>MT3649 P450 heme-thiolate protein
MSWNHQSVEIAVRRTTVPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAA
PIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRF
KNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQER
AQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEM
TGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDG
EKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPE
TAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEV
FQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPD
LKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH
>MT1387 AMP-binding family protein
MGLVGEPTVELVAAIQGAWLAGAAVSILPGPVRGANDQRWADATLTRFLG
IGVRTVLSQGSYLARLRSVDTAGVTIGDLSTAAHTNRSATPVASEGPAVL
QGTAGSTGAPRTAILSPGAVLSNLRGLNQRVGTDAATDVGCSWLPLYHDM
GLAFVLSAALAGAPLWLAPTTAFTASPFRWLSWLSDSGATMTAAPNFAYN
LIGKYARRVSEVDLGALRVTLNGGEPVDCDGLTRFAEAMAPFGFDAGAVL
PSYGLAESTCAVTVPVPGIGLLADRVIDGSGAHKHAVLGNPIPGMEVRIS
CGDQAAGNASREIGEIEIRGASMMAGYLGQQPIDPDDWFATGDLGYLGAG
GLVVCGRAKEVISIAGRNIFPTEVELVAAQVRGVREGAVVALGTGDRSTR
PGLVVAAEFRGPDEANARAELIQRVASECGIVPSDVVFVSPGSLPRTSSG
KLRRLAVRRSLEMAD
>MT1231 conserved hypothetical protein
MAWQQPSPRIRELIREGARIALNPSPEWIEELDRATIAANPAIANDPVLA
KVVQTANRANLVYWAAANLRDPGARVPANLGTEPLRMARDLVRRGLDTVA
FNIYRTGEHIGWRFWMGIAFELTSDPQELRELLDVSARSVNDFIEATLTG
IAAQVQSEHDELTRSTHAERLEVVGLILDGAPISPERAEAKLGYPLSRAH
TAAIIWSDELDGDHSYLDRAADLFCHAVGSTRPLTVVAGAASRWAWVTDA
DGLDIDTVQAAVDNAPGARIAIGTTANGVEGFRRSHLEALITQRTLSRLR
STQRVAFFADVKMVALISQNPDAASEFITSTLGDLESASPDLQTALLTFI
NEQCNASRAAKRLHTHRNTFLRRLESAQRLLPRPLDHTSVHVAVALEALQ
WRGNKAHALSSPGRRSNSVPA
>MT2835 carboxymethylenebutenolidase, putative
MPKTTDTAATPDGTCAVRLFTPDGPGRWPGVVMFPDAGGVRDTFDRMAAK
LAGFGYVVLLPDVYYREGDWAPFDMKTAFGDPQERARIMFMIGTLTPDRV
TRDADALLNYLASRPEVIGDRFGVCGYCMGGRMSVVVAGRLPDRVAAAAA
FHPGGLVANSPDSPHLLADRISATVYIGGAENDPSFTADHAEKLDKAFSA
AGVPHRIECYPAAHGFAVPDNPSYDAAADERHWAAMTETFGAALN
>MT0715 oxidoreductase, short-chain dehydrogenase/reductase family
MSARGGSLHGRVAFVTGAARAQGRSHAVRLAREGADIVALDICAPVSGSV
TYPPATSEDLGETVRAVEAEGRKVLAREVDIRDDAELRRLVADGVEQFGR
LDIVVANAGVLGWGRLWELTDEQWETVIGVNLTGTWRTLRATVPAMIDAG
NGGSIVVVSSSAGLKATPGNGHYAASKHALVALTNTLAIELGEFGIRVNS
IHPYSVDTPMIEPEAMIQTFAKHPGYVHSFPPMPLQPKGFMTPDEISDVV
VWLAGDGSGALSGNQIPVDKGALKY
>MT3664 oxidoreductase
MNLSVAPKEIAGHGLLDGKVVVVTAAAGTGIGSATARRALAEGADVVISD
HHERRLGETAAELSALGLGRVEHVVCDVTSTAQVDALIDSTTARMGRLDV
LVNNAGLGGQTPVADMTDDEWDRVLDVSLTSVFRATRAALRYFRDAPHGG
VIVNNASVLGWRAQHSQSHYAAAKAGVMALTRCSAIEAAEYGVRINAVSP
SIARHKFLDKTASAELLDRLAAGEAFGRAAEPWEVAATIAFLASDYSSYL
TGEVISVSCQHP
>MT3003 polyketide synthase
MTAATPDRRAIITEALHKIDDLTARLEIAEKSSSEPIAVIGMGCRFPGGV
NNPEQFWDLLCAGRSGIVRVPAQRWDADAYYCDDHTVPGTICSTEGGFLT
SWQPDEFDAEFFSISPREAAAMDPQQRLLIEVAWEALEDAGVPQHTIRGT
QTSVFVGVTAYDYMLTLAGRLRPVDLDAYIPTGNSANFAAGRLAYILGAR
GPAVVIDTACSSSLVAVHLACQSLRGRESDMALVGGTNLLLSPGPSIACS
RWGMLSPEGRCKTFDASADGYVRGEGAAVVVLKRLDDAVRDGNRILAVVR
GSAVNQDGASSGVTVPNGPAQQALLAKALTSSKLTAADIDYVEAHGTGTP
LGDPIELDSLSKVFSDRAGSDQLVIGSVKTNLGHLEAAAGVAGLMKAVLA
VHNGYIPRHLNFHQLTPHASEAASRLRIAADGIDWPTTGRPRRAGVSSFG
VSGTNAHVVIEQAPDPMAAAGTEPQRGPVPAVSTLVVFGKTAPRVAATAS
VLADWLDGPGAAVPLADVAHTLNHHRARQTRFGTVAAVDRRQAVIGLRAL
AAGQSAPGVVAPREGSIGGGTVFVYSGRGSQWAGMGRQLLADEPAFAAAI
AELEPEFVAQGGFSLRDVIAGGKELVGIEQIQLGLIGMQLALTALWRSYG
VTPDAVIGHSMGEVAAAVVAGALTPAQGLRVTAVRSRLMAPLSGQGTMAL
LELDAEATEALIADYPEVSLGIYASPRQTVISGPPLLIDELIDKVRQQNG
FATRVNIEVAPHNPAMDALQPAMRSELADLTPQPPTIPIISTTYADLGIS
LGSGPRFDAEHWATNMRNPVRFHQAIAHAGADHHTFIEISAHPLLTHSIS
DTLRASYDVDNYLRIGTLQRDAHDTLEFHTNLNTTHTTHPPQTPHPPEPH
PVLPTTPWQHTQHWITATSAAYHRPDTHPLLGVGVTDPTNGTRVWESELD
PDLLWLADHVIDDLVVLPGAAYAEIALAAATDTFAVEQDQPWMISELDLR
QMLHVTPGTVLVTTLTGDEQRCQVEIRTRSGSSGWTTHATATVARAEPLA
PLDHEGQRREVTTADLEDQLDPDDLYQRLRGAGQQHGPAFQGIVGLAVTQ
AGVARAQVRLPASARTGSREFMLHPVMMDIALQTLGATRTATDLAGGQDA
RQGPSSNSALVVPVRFAGVHVYGDITRGVRAVGSLAAAGDRLVGEVVLTD
ANGQPLLVVDEVEMAVLGSGSGATELTNRLFMLEWEPAPLEKTAEATGAL
LLIGDPAAGDPLLPALQSSLRDRITDLELASAADEATLRAAISRTSWDGI
VVVCPPRANDESMPDEAQLELARTRTLLVASVVETVTRMGARKSPRLWIV
TRGAAQFDAGESVTLAQTGLRGIARVLTFEHSELNTTLVDIEPDGTGSLA
ALAEELLAGSEADEVALRDGQRYVNRLVPAPTTTSGDLAAEARHQVVNLD
SSGASRAAVRLQIDQPGRLDALNVHEVKRGRPQGDQVEVRVVAAGLNFSD
VLKAMGVYPGLDGAAPVIGGECVGYVTAIGDEVDGVEVGQRVIAFGPGTF
GTHLGTIADLVVPIPDTLADNEAATFGVAYLTAWHSLCEVGRLSPGERVL
IHSATGGVGMAAVSIAKMIGARIYTTAGSDAKREMLSRLGVEYVGDSRSV
DFADEILELTDGYGVDVVLNSLAGEAIQRGVQILAPGGRFIELGKKDVYA
DASLGLAALAKSASFSVVDLDLNLKLQPARYRQLLQHILQHVADGKLEVL
PVTAFSLHDAADAFRLMASGKHTGKIVISIPQHGSIEAIAAPPPLPLVSR
DGGYLIVGGMGGLGFVVARWLAEQGAGLIVLNGRSAPSDEVAAAIAELNA
SGSRIEVITGDITEPDTAERLVRAVEDAGFRLAGVVHSAMVLADEIVLNM
TDSAARRVFAPKVTGSWRLHVATAARDVDWWLTFSSAAALLGTPGQGAYA
AANSWVDGLVAHRRSAGLPAVGINWGPWADVGRAQFFKDLGVEMINAEQG
LAAMQAVLTADRGRTGVFSLDARQWFQSFPAVAGSSLFAKLHDSAARKSG
QRRGGGAIRAQLDALDAAERPGHLASAIADEIRAVLRSGDPIDHHRPLET
LGLDSLMGLELRNRLEASLGITLPVALVWAYPTISDLATALCERMDYATP
AAAQEISDTEPELSDEEMDLLADLVDASELEAATRGES
>MT1393 oxidoreductase, short-chain dehydrogenase/reductase family
MASLLNARTAVITGGAQGLGLAIGQRFVAEGARVVLGDVNLEATEVAAKR
LGGDDVALAVRCDVTQADDVDILIRTAVERFGGLDVMVNNAGITRDATMR
TMTEEQFDQVIAVHLKGTWNGTRLAAAIMRERKRGAIVNMSSVSGKVGMV
GQTNYSAAKAGIVGMTKAAAKELAHLGIRVNAIAPGLIRSAMTEAMPQRI
WDQKLAEVPMGRAGEPSEVASVAVFLASDLSSYMTGTVLDVTGGRFI
>MT0340 hypothetical protein
MGPKGSLRLVKRQPELLVAQHEHWQDTYRAHPVLYGTRPSEPGVYAAEVF
NADGVQRVLELAAGHGRDTLYFAG
>MT2045.1 hypothetical protein
MGRLEGKVAFITGVARGQGRSHAVRLADGQARALGKVDVEACGALVGEVE
VWGRDVRDDRRVFVESPADEFGACRRVARQGIRVVGLPVSQRELVEPEAG
CAARRSAAGSQ
>MT3598 virulence factor mce family protein
MIDRLAKIQLSIFAVITVITLSVMAIFYLRLPATFGIGTYGVSADFVAGG
GLYKNANVTYRGVAVGRVESVGLNPNGVTAHMRLNSGTAIPSNVTATVRS
VSAIGEQYIDLVPPENPSSTKLRNGFRIQRQNTRIGQDVADLLRQAETLL
GSLGDTRLRELLHEAFIATNGAGPELARLIESARLLVDEANANYPQVSQL
IDQAGPFLQAQIRAGGDIKSLADGLARFTWQLRAADPRLRDTLADAPDAI
DEANTAFSGIRPSFPALAASLANLGRVGVIYHKSIEQLLVVFPALFAAII
TSAGGVPQDEGAKLDFKIDLHDPPPCMTGFLPPPLVRSPADESVREIPRD
MYCKTAQNDPSTVRGARNYPCQEFPGKRAPTVQLCRDPRGYVPVGTNPWR
GPPIPYGTEVTDGRNILPPNKFPYIPPGADPDPGVPIVGPPPPGQVAGPG
PAPHQPAQPAPPPNDNGPPPPFTSWMPPGYPPEPPQVPYPATIPPPPPPE
GTGPPPGPAPGPQPQASGPAYTIYDQLSGAFADPAGGTGIFAPGMTGASS
AENWVDLMRDPRQL
>MT2667 substrate--CoA ligase, putative
MSINDQRLTRRVEDLYASDAQFAAASPNEAITQAIDQPGVALPQLIRMVM
EGYADRPALGQRALRFVTDPDSGRTMVELLPRFETITYRELWARAGTLAT
ALSAEPAIRPGDRVCVLGFNSVDYTTIDIALIRLGAVSVPLQTSAPVTGL
RPIVTETEPTMIATSIDNLGDAVEVLAGHAPARLVVFDYHGKVDTHREAV
EAARARLAGSVTIDTLAELIERGRALPATPIADSADDALALLIYTSGSTG
APKGAMYRESQVMSFWRKSSGWFEPSGYPSITLNFMPMSHVGGRQVLYGT
LSNGGTAYFVAKSDLSTLFEDLALVRPTELCFVPRIWDMVFAEFHSEVDR
RLVDGADRAALEAQVKAELRENVLGGRFVMALTGSAPISAEMTAWVESLL
ADVHLVEGYGSTEAGMVLNDGMVRRPAVIDYKLVDVPELGYFGTDQPYPR
GELLVKTQTMFPGYYQRPDVTAEVFDPDGFYRTGDIMAKVGPDQFVYLDR
RNNVLKLSQGEFIAVSKLEAVFGDSPLVRQIFIYGNSARAYPLAVVVPSG
DALSRHGIENLKPVISESLQEVARAAGLQSYEIPRDFIIETTPFTLENGL
LTGIRKLARPQLKKFYGERLERLYTELADSQSNELRELRQSGPDAPVLPT
LCRAAAALLGSTAADVRPDAHFADLGGDSLSALSLANLLHEIFGVDVPVG
VIVSPASDLRALADHIEAARTGVRRPSFASIHGRSATEVHASDLTLDKFI
DAATLAAAPNLPAPSAQVRTVLLTGATGFLGRYLALEWLDRMDLVNGKLI
CLVRARSDEEAQARLDATFDSGDPYLVRHYRELGAGRLEVLAGDKGEADL
GLDRVTWQRLADTVDLIVDPAALVNHVLPYSQLFGPNAAGTAELLRLALT
GKRKPYIYTSTIAVGEQIRPEAFTEDADIRAISPTRRIDDSYANGYANSK
WAGEVLLREAHEQCGLPVTVFRCDMILADTSYTGQLNLPDMFTRLMLSLA
ATGIAPGSFYELDAHGNRQRAHYDGLPVEFVAEAICTLGTHSPDRFVTYH
VMNPYDDGIGLDEFVDWLNSPTSGSGCTIQRIADYGEWLQRFETSLRALP
DRQRHASLLPLLHNYREPAKPICGSIAPTDQFRAAVQEAKIGPDKDIPHL
TAAIIAKYISNLRLLGLL
>MT0176 conserved hypothetical protein
MTTSTTLGGYVRDQLQTPLTLVGGFFRMCVLTGKALFRWPFQWREFILQC
WFIMRVGFLPTIMVSIPLTVLLIFTLNILLAQFGAADISGSGAAIGAVTQ
LGPLTTVLVVAGAGSTAICADLGARTIREEIDAMEVLGIDPIHRLVVPRV
LASMLVATLLNGLVITVGLVGGFLFGVYLQNVSGGAYLATLTLITGLPEV
VIATIKAATFGLIAGLVGCYRGLTVRGGSKGLGTAVNETVVLCVIALFAV
NVILTTIGVRFGTGR
>MT0788 P450 heme-thiolate protein
MSAVALPRVSGGHDEHGHLEEFRTDPIGLMQRVRDECGDVGTFQLAGKQV
VLLSGSHANEFFFRAGDDDLDQAKAYPFMTPIFGEGVVFDASPERRKEML
HNAALRGEQMKGHAATIEDQVRRMIADWGEAGEIDLLDFFAELTIYTSSA
CLIGKKFRDQLDGRFAKLYHELERGTDPLAYVDPYLPIESFRRRDEARNG
LVALVADIMNGRIANPPTDKSDRDMLDVLIAVKAETGTPRFSADEITGMF
ISMMFAGHHTSSGTASWTLIELMRHRDAYAAVIDELDELYGDGRSVSFHA
LRQIPQLENVLKETLRLHPPLIILMRVAKGEFEVQGHRIHEGDLVAASPA
ISNRIPEDFPDPHDFVPARYEQPRQEDLLNRWTWIPFGAGRHRCVGAAFA
IMQIKAIFSVLLREYEFEMAQPPESYRNDHSKMVVQLAQPACVRYRRRTG
V
>MT2836 oxidoreductase, short-chain dehydrogenase/reductase family
MTSLDLTGRTAIITGASRGIGLAIAQQLAAAGAHVVLTARRQEAADEAAA
QVGDRALGVGAHAVDEDAARRCVDLTLERFGSVDILINNAGTNPAYGPLL
EQDHARFAKIFDVNLWAPLMWTSLVVTAWMGEHGGAVVNTASIGGMHQSP
AMGMYNATKAALIHVTKQLALELSPRIRVNAICPGVVRTRLAEALWKDHE
DPLAATIALGRIGEPADIASAVAFLVSDAASWITGETMIIDGGLLLGNAL
GFRAAPSTEH
>MT0074 oxidoreductase, short-chain dehydrogenase/reductase family
MTKWTAADIPDQTGRTAVITGANTGLGFETAAALAAHGAHVVLAVRNLDK
GKQAAARITEATPGAEVELQELDLTSLASVRAAAAQLKSDHQRIDLLINN
AGVMYTPRQTTADGFEMQFGTNHLGHFALTGLLIDRLLPVAGSRVVTISS
VGHRIRAAIHFDDLQWERRYRRVAAYGQAKLANLLFTYELQRRLAPGGTT
IAVASHPGVSNTELVRNMPRPLVAVAAILAPLMQDAELGALPTLRAATDP
AVRGGQYFGPDGFGEIRGYPKVVASSAQSHDEQLQRRLWAVSEELTGVVY
PVG
>MT3589 oxidoreductase, short-chain dehydrogenase/reductase family
MVQNGENLFQFRREGPQVQLSFQDRTYLVTGGGSGIGKGVAAGLVAAGAA
VMIVGRNPDKLAAAVKDIEALKTGAIGYEPADITDEEQTLRVVDAATAWH
GRLHGVVHCAGGSQTIGPITQIDSQAWRRTVDLNVNGTMYVLKHAARELV
RGGGGSFVGISSIAASNTHRWFGAYGVTKSAVDHMMKLAADELGPSWVRV
NSIRPGLIRTDLVVPVTESPELSADYRVCTPLPRVGEVEDVANLAMFLLS
DAASWITGQVINVDGGHMLRRGPDFSPMLEPVFGADGLRGVVG
>MT0153 conserved hypothetical protein
MTELDDVSSLPSSRRTAGDTWAITESVGATALGVAAARAVETAATNPLIR
DEFAKVLVSSAGTAWARLADADLAWLDGDQLGRRVHRVACDYQAVRTHFF
DEYFGAAVDAGVRQVVILAAGLDARAYRLNWPAGTVVYEIDQPSVLEYKA
GILQSHGAVPTARRHAVAVDLRDDWPAALIAAGFDGTQPTAWLAEGLLPY
LPGDAADRLFDMVTALSAPGSQVAVEAFTMNTKGNTQRWNRMRERLGLDI
DVQALTYHEPDRSDAAQWLATHGWQVHSVSNREEMARLGRAIPQDLVDET
VRTTLLRGRLVTPAQPA
>MT3123 hypothetical protein
MTRSSNIPADATPNPHATAEQVAAARHDSKLAQVLYHDWEAENYDEKWSI
SYDQRCVDYARGRFDAIVPDEVIAQLPYDRALELGCGTGFFLLNLIQAGV
ARRGSVTDLSPGMVKVATRNGQALGLDIDGRVADAEGIPYDDDAFDLVVG
HAVLHHIPDVELSLREVVRVLKPGGRFVFAGEPTTVGDGYARTLSTLTWR
VVTNATKLPGLRGWRRPQGELDESSRAAALEALVDLHTFTPQDLQRIAHN
AGAVEVQTATEEFTAAMLGWPLRTFECTVPPGRLGWGWARFAFTSWKTLG
WVDANVWRHVVPKGWFYNVMITGVKPS
>MT2821 conserved hypothetical protein, truncation
MARNPAAQTAFGPMVLAAVEQNEPPGRRLVDDDLADLFLPRPLRWLAGAT
RSAVLRRLLISASEWSGRGLWANLACRKRFIGDKLDEALGDIDAVVILGA
GLDTRAYRLTRRVRMPVFEVDLPVNIARKAKTVRRVLGELPLSVRLVALD
FEHDDLLTALAEHGYRTEYRVFFVCEGVTQYLTERAVRRTLEGLRAAAPG
SRMVFTYVRRDFIDGTNRYGTRTLYHTVRQRRQLWHFGLDPEEVAGFLAD
YGWRLTEQAGPEELVQRYVEPTGRNLNASQIEWSAYAEKSEPVTPR
>MT3616 substrate--CoA ligase, putative
MAVALNIADLAEHAIDAVPDRVAVICGDEQLTYAQLEDKANRLAHHLIDQ
GVQKDDKVGLYCRNRIEIVIAMLGIVKAGAILVNVNFRYVEGELRYLFDN
SDMVALVHERRYADRVANVLPDTPHVRTILVVEDGSDQDYRRYGGVEFYS
AIAAGSPERDFGERSADAIYLLYTGGTTGFPKGVMWRHEDIYRVLFGGTD
FATGEFVKDEYDLAKAAAANPPMIRYPIPPMIHGATQSATWMALFSGQTT
VLAPEFNADEVWRTIHKHKVNLLFFTGDAMARPLVDALVKGNDYDLSSLF
LLASTAALFSPSIKEKLLELLPNRVITDSIGSSETGFGGTSVVAAGQAHG
GGPRVRIDHRTVVLDDDGNEVKPGSGMRGVIAKKGNIPVGYYKDEKKTAE
TFRTINGVRYAIPGDYAQVEEDGTVTMLGRGSVSINSGGEKVYPEEVEAA
LKGHPDVFDALVVGVPDPRYGQQVAAVVQARPGCRPSLAELDSFVRSEIA
GYKVPRSLWFVDEVKRSPAGKPDYRWAKEQTEARPADDVHAGHVTSGG
>MT3423 hypothetical protein
MSVQTDPALREHPNRVDWNARYERAGSAHAPFAPVPWLADVLRAGVPDGP
VLELASGRSGTALALAAHGRQVTAIDVSDVALLQLDSEAVRRGVADRLNL
VQADLGCWEPGETRFALVLSRLFWDAAIFHRACEAVMPGGVLAWESLALS
GAEAGTASAKRRVKPGEPACLLPADFTVVHEGQGNCDSAPSRIMIARRSP
LPGA
>MT0751 conserved hypothetical protein
MTYTGSIRCEGDTWDLASSVGATATMVAAARAMATRAANPLINDQFAEPL
VRAVGVDVLTRLASGELTASDIDDPERPNASMVRMAEHHAVRTKFFDEFF
MDATRAGIRQVVILASGLDSRAYRLAWPAQTVVYEIDQPQVMEFKTRTLA
ELGATPTADRRVVTADLRADWPTALGAAGFDPTQPTAWSAEGLLRYLPPE
AQDRLLDNVTALSVPDSRFATESIRNFKPHHEERMRERMTILANRWRAYG
FDLDMNELVYFGDRNEPASYLSDNGWLLTEIKSQDLLTANGFQPFEDEEV
PLPDFFYVSARLQRKHRQYPAHRKPAPSWRHTACPVNELSKSAAYTMTRS
DAHQASTTAPPPPGLTG
>MT2749 conserved hypothetical protein
MTAQFDPADPTRFEEMYRDDRVAHGLPAATPWDIGGPQPVVQQLVALGAI
RGEVLDPGTGPGHHAIYYAAKGYAATGIDGSVAAIERARDNARKAGVSVN
FQVGDATTLDGLDGRFDTVVDCAFYHTFSTAPELQRCYVRALRRASKPGA
RLYMFEFGEHNVNGFSMPRSLSEDDFRQVLPVGGWEITYLGTTTYQVNLS
VEALELMAARNPDMADQVRCVLERFRAIKPWLVGGRVHAPFWEVHATRVD
>MT0108 substrate--CoA ligase, putative
MGGKKFQAMPQLPSTVLDRVFEQARQQPEAIALRRCDGTSALRYRELVAE
VGGLAADLRAQSVSRGSRVLVISDNGPETYLSVLACAKLGAIAVMADGNL
PIAAIERFCQITDPAAALVAPGSKMASSAVPEALHSIPVIAVDIAAVTRE
SEHSLDAASLAGNADQGSEDPLAMIFTSGTTGEPKAVLLANRTFFAVPDI
LQKEGLNWVTWVVGETTYSPLPATHIGGLWWILTCLMHGGLCVTGGENTT
SLLEILTTNAVATTCLVPTLLSKLVSELKSANATVPSLRLVGYGGSRAIA
ADVRFIEATGVRTAQVYGLSETGCTALCLPTDDGSIVKIEAGAVGRPYPG
VDVYLAATDGIGPTAPGAGPSASFGTLWIKSPANMLGYWNNPERTAEVLI
DGWVNTGDLLERREDGFFYIKGRSSEMIICGGVNIAPDEVDRIAEGVSGV
REAACYEIPDEEFGALVGLAVVASAELDESAARALKHTIAARFRRESEPM
ARPSTIVIVTDIPRTQSGKVMRASLAAAATADKARVVVRG
>MT0040 AMP-binding family protein
MTAALLSPAIAWQQISACTDRTLTITCEDSEVISYQDLIARAAACIPPLR
RLDLKRGEPVLITAHTNLEFLSCFLGLMLHGAVPVPIPPREALKTTERFM
TRLGPLLRHHRVLICTPAEHDEIRAAASTDCQISRFTALAEAGDEQFGRA
TAQQLADTATADWPLCTLDDDAYVQYTSGSTAAPRGVVITYRNLLSNMRA
MAVGSQFQHGDVMGSWLPLHHDMGLVGSLFAALFNSVSAVFTTPHRFLYD
PLGFLRLLTSSGATHTFMPNFALEWLINAYHRRGADIEGIDLHKMRRLII
ASEPVHAEGMRRFAATFAGVGLAPTALGSGYGLAEATVAVSMSAPNTGFR
TETHAAAEVVTGGRVLPGYEVRIDAAPGARAGTIKLRGDSVAAKAYVGGK
KLDALDEEGFCDTHDLGFLVDDEIVILGRQDEVFIVHGENRFPYDIEFII
RGESEQHRTKVACFGVNERVVVVLESPLDSIIDKAEADRLRCQVVAATGL
QLDELITVRRGAIPTTTSGKLKRRAVAQAYRDGTLPRLATHAWTADPDSA
PKTTRSSLEGAH
>MT1283 oxidoreductase, short-chain dehydrogenase/reductase family
MEGFAGKVAVVTGAGSGIGQALAIELARSGAKVAISDVDTDGLADTEHRL
KAISTPVKTDRLDVTEREAFLAYADAVNEHFGTVNQIYNNAGIAFTGDIE
VSQFKDIERVMDVDFWGVVNGTKAFLPHLIASGDGHVINISSVFGLFSAP
GQAAYNSAKFAVRGFTEALRQEMALAGHPVKVTTVHPGGVKTAIARNATA
AEGLDQAELAETFDKRVAHLSPQRAAQIILTGVAKNKARVLVGVDAKVLD
LVVRLTGSGYQRIFPIITGRLIPRPR
>MT2019 virulence factor mce family protein
MRENLGGVVVRLGVFLAVCLLTAFLLIAVFGEVRFGDGKTYYAEFANVSN
LRTGKLVRIAGVEVGKVTRISINPDATVRVQFTADNSVTLTRGTRAVIRY
DNLFGDRYLALEEGAGGLAVLRPGHTIPLARTQPALDLDALIGGFKPLFR
ALNPEQVNALSEQLLHAFAGQGPTIGSLLAQSAAVTNTLADRDRLIGQVI
TNLNVVLGSLGAHTDRLDQAVTSLSALIHRLAQRKTDISNAVAYTNAAAG
SVADLLSQARAPLAKVVRETDRVAGIAAADHDYLDNLLNTLPDKYQALVR
QGMYGDFFAFYLCDVVLKVNGKGGQPVYIKLAGQDSGRCAPK
>MT3874 conserved hypothetical protein
MPRTDNDSWAITESVGATALGVAAARAAETESDNPLINDPFARIFVDAAG
DGIWSMYTNRTLLAGATDLDPDLRAPIQQMIDFMAARTAFFDEYFLATAD
AGVRQVVILASGLDSRAWRLPWPDGTVVYELDQPKVLEFKSATLRQHGAQ
PASQLVNVPIDLRQDWPKALQKAGFDPSKPCAWLAEGLVRYLPARAQDLL
FERIDALSRPGSWLASNVPGAGFLDPERMRRQRADMRRMRAAAAKLVETE
ISDVDDLWYAEQRTAVAEWLRERGWDVSTATLPELLARYGRSIPHSGEDS
IPPNLFVSAQRATS
>MT2323 oxidoreductase, short-chain dehydrogenase/reductase family
MAKDLVATVPDLSGKLAIITGANSGLGFGLARRLSAAGADVIMAIRNRAK
GEAAVEEIRTAVPDAKLTIKALDLSSLASVAALGEQLMADGRPIDLLINN
AGVMTPPERVTTADGFELQFGSNHLGHFALTAHLLPLLRAAQRARVVSLS
SLAARRGRIHFDDLQFERSYAPMTAYGQSKLAVLMFARELDRRSRAAGWG
IISNAAHPGLTKTNLQIAGPSHGRDKPALMERLYKTSWRFAPFLWQEIEE
GILPALYAAATPQADGGAFYGPRGRYEVAGGGVREAKVPAAARNDADSKR
LWEVSEQLTGVSYPKSR
>MT2697 conserved hypothetical protein
MANKRGNAGQPLPLSDRDDDHMQGHWLLARLGKRVLRPGGVELTRTLLAR
AEVTDADVLELAPGLGRTAAEILARNPRSYVGAESDPNAANLVRHVLAGR
GDVRVTDAADTGLSDASADVVIGEAMLTMQGNAAKHTIVAEAARVLRPGG
RYAIHELALVPDDVAEQVRTDLRQSLARALKVNARPLTVAEWSHLLAGHG
LVVEHVVTASMALLQPRRVIADEGLLGALRFAGNLLIHRAARRRVLLMRH
TFRRHRERLTAVAIVAHKPHVDS
>MT2820 oxidoreductase, short-chain dehydrogenase/reductase family
MIDRPLEGKVAFITGAARGLGRAHAVRLAADGANIIAVDICEQIASVPYP
LSTADDLAATVELVEDAGGGIVARQGDVRDRASLSVALQAGLDEFGRLDI
VVANAGIAMMQAGDDGWRDVIDVNLTGVFHTVQVAIPTLIEQGTGGSIVL
ISSAAGLVGIGSSDPGSLGYAAAKHGVVGLMRAYANHLAPQNIRVNSVHP
CGVDTPMINNEFFQQWLTTADMDAPHNLGNALPVELVQPTDIANAVAWLA
SEEARYVTGVTLPVDAGFVNKR
>MT3605 conserved hypothetical protein
MSMDTARAAFRRPFQFREFLDQTWMVARVSLVPTLLVSIPFTVLVAFTLN
ILLREIGAADLSGAGTAFGTITQLGPVVTVLVVAGAGATAICADLGARTI
REEIDAMRVLGIDPIQRLVVPRVLASTLVALLLNGLVCAIGLSGGYAFSV
FLQGVNPGAFINGLTVLTGLRELILAEIKALLFGVMAGLVGCYRGLTVKG
GPKGVGNAVNETVVYAFICLFVINVVMTAIGVRISAQ
>MT0154 conserved hypothetical protein
MSAMRTHDDTWDIKTSVGATAVMVAAARAVETDRPDPLIRDPYARLLVTN
AGAGAIWEAMLDPTLVAKAAAIDAETAAIVAYLRSYQAVRTNFFDTYFAS
AVAAGIRQVVILASGLDSRAYRLDWPAGTIVYEIDQPKVLSYKSTTLAEN
GVTPSAGRREVPADLRQDWPAALRDAGFDPTARTAWLAEGLLMYLPAEAQ
DRLFTQVGAVSVAGSRIAAETAPVHGEERRAEMRARFKKVADVLGIEQTI
DVQELVYHDQDRASVADWLTDHGWRARSQRAPDEMRRVGRWVEGVPMADD
PTAFAEFVTAERL
>MT2302 conserved hypothetical protein
MNDNQLAPVARPRSPLELLDTVPDSLLRRLKQYSGRLATEAVSAMQERLP
FFADLEASQRASVALVVQTAVVNFVEWMHDPHSDVGYTAQAFELVPQDLT
RRIALRQTVDMVRVTMEFFEEVVPLLARSEEQLTALTVGILKYSRDLAFT
AATAYADAAEARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTTAP
ATVLVGTPAPGPNGSNSDGDSERASQDVRDTAARHGRAALTDVHGTWLVA
IVSGQLSPTEKFLKDLLAAFADAPVVIGPTAPMLTAAHRSASEAISGMNA
VAGWRGAPRPVLARELLPERALMGDASAIVALHTDVMRPLADAGPTLIET
LDAYLDCGGAIEACARKLFVHPNTVRYRLKRITDFTGRDPTQPRDAYVLR
VAATVGQLNYPTPH
>MT0577 substrate--CoA ligase
MSTAGDDAVGVPPACGGRSDAVGVPQLARESGAMRDQDCSGELLRSPTHN
GHLLVGALKRHQNKPVLFLGDTRLTGGQLADRISQYIQAFEALGAGTGVA
VGLLSLNRPEVLMIIGAGQARGYRRTALHPLGSLADHAYVLNDAGISSLI
IDPNPMFVERALALLEQVDSLQQILTIGPVPDALKHVAVDLSAEAAKYQP
QPLVAADLPPDQVIGLTYTGGTTGKPKGVIGTAQSIATMTSIQLAEWEWP
ANPRFLMCTPLSHAGAAFFTPTVIKGGEMIVLAKFDPAEVLRIIEEQRIT
ATMLVPSMLYALLDHPDSHTRDLSSLETVYYGASAINPVRLAEAIRRFGP
IFAQYYGQSEAPMVITYLAKGDHDEKRLTSCGRPTLFARVALLDEHGKPV
KQGEVGEICVSGPLLAGGYWNLPDETSRTFKDGWLHTGDLAREDSDGFYY
IVDRVKDMIVTGGFNVFPREVEDVVAEHPAVAQVCVVGAPDEKWGEAVTA
VVVLRSNAARDEPAIEAMTAEIQAAVKQRKGSVQAPKRVVVVDSLPLTGL
GKPDKKAVRARFWEGAGRAVG
>MT0231 hypothetical protein
MHTLKVAVIELDSDRQEFGVDAFREVIAGRLHKLEPLGYQLVDVPLKFHH
PMWREHCQVDLNYHIRPWRLRAPGGRRELDEAVGEIASTPLNRDHPLWEM
YFVEGLANHRIAVVAKIHHALADGVASANMMARGMDLLPGPEVGRYVPDP
APTKRQLLSAAFIDHLRHLGRIPATIRYTTQGLGRVRRSSRKLSPALTMP
FTPPPTFMNHRLTPERRFATATLALIDVKATAKLLGATINDMVLAMSTGA
LRTLLLRYDGKAEPLLASVPVSYDFSPERISGNRFTGMLVALPADSDDPL
QRVRVCHENAVSAKESHQLLGPELISRWAAYWPPAGAEALFRWLSERDGQ
NKVLNLNISNVPGPRERGRVGAALVTEIYSVGPLTAGSGLNITVWSYVDQ
LNISVLTDGSTVQDPHEVTAGMIADFIEIRRAAGLSVELTVVESAMAQA
>MT3787 P450 heme-thiolate protein
MVLRSLASPAALTDPKRCASVVGVAAFAVRREHAPDALGGPPGLPAPRGF
RAAFAAAYAVAYLAGGERRMLRLIRRYGPIMTMPILSLGDVAIVSDSALA
KEVFTAPTDVLLGGEGVGPAAAIYGSGSMFVQEEPEHLRRRKLLTPPLHG
AALDRYVPIIENSTRAAMHTWPVDRPFAMLTVARSLMLDVIVKVIFGVDD
PEEVRRLGRPFERLLNLGVSEQLTVRYALRRLGALRVWPARARANTEIDD
VVMALIAQRRADPRLGERHDVLSLLVSARGESGEQLSDSEIRDDLITLVL
AGHETTATTLAWAFDLLLHHPDALRRVRAEAVGGGEAFTTAVINETLRVR
PPAPLTARVAAQPLTIGGYRVEAGTRIVVHIIAINRSAEVYEHPHEFRPE
RFLGTRPQTYAWVPFGGGVKRCLGANFSMRELITVLHVLLREGEFTAVDD
EPERIVRRSIMLVPRRGTRVRFRPAR
>MT0342 P450 heme-thiolate protein
MASTLTTGLPPGPRLPRYLQSVLYLRFREWFLPAMHRKYGDVFSLRVPPY
ADNLVVYTRPEHIKEIFAADPRSLHAGEGNHILGFVMGEHSVLMTDEAEH
ARMRSLLMPAFTRAALRGYRDMIASVAREHITRWRPHATINSLDHMNALT
LDIILRVVFGVTDPKVKAELTSRLQQIINIHPAILAGVPYPSLKRMNPWK
RFFHNQTKIDEILYREIASRRIDSDLTARTDVLSRLLQTKDTPTKPLTDA
ELRDQLITLLLAGHETTAAALSWTLWELAHAPEIQSQVVWAAVGGDDGFL
EAVLKEGMRRHTVIASTARKVTAPAEIGGWRLPAGTVVNTSILLAHASEV
SHPKPTEFRPSRFLDGSVAPNTWLPFGGGVRRCLGFGFALTEGAVILQEI
FRRFTITAAGPSKGETPLVRNITTVPKHGAHLRLIPQRRLGGLGDSDPP
>MT3114 methyltransferase, putative
MCAFVPHVPRHSRGDNPPSASTASPAVLTLTGERTIPDLDIENYWFRRHQ
VVYQRLAPRCTARDVLEAGCGEGYGADLIACVARQVIAVDYDETAVAHVR
SRYPRVEVMQANLAELPLPDASVDVVVNFQVIEHLWDQARFVRECARVLR
GSGLLMVSTPNRITFSPGRDTPINPFHTRELNADELTSLLIDAGFVDVAM
CGLFHGPRLRDMDARHGGSIIDAQIMRAVAGAPWPPELAADVAAVTTADF
EMVAAGHDRDIDDSLDLIAIAVRP
>MT3002 polyketide synthase
MMRTAFSRISGMTAQQRTSLADEFDRVSRIAVAEPVAVVGIGCRFPGDVD
GPESFWDFLVAGRNAISTVPADRWDAEAFYHPDPLTPGRMTTKWGGFVPD
VAGFDAEFFGITPREAAAMDPQQRMLLEVAWEALEHAGIPPDSLGGTRTA
VMMGVYFNEYQSMLAASPQNVDAYSGTGNAHSITVGRISYLLGLRGPAVA
VDTACSSSLVAVHLACQSLRLRETDLALAGGVSITLRPETQIAISAWGLL
SPQGRCAAFDAAADGFVRGEGAGVVVLKRLTDAVRDGDQVLAVVRGSAVN
QDGRSNGVTAPNTAAQCDVIADALRSGDVAPDSVNYVEAHGTGTVLGDPI
EFEALAATYGHGGDACALGAVKTNIGHLEAAAGIAGFIKATLAVQRATIP
PNLHFSQWNPAIDAASTRFFVPTQNSPWPTAEGPRRAAVSSFGLGGTNAH
VIIEQGSELAPVSEGGEDTGVSTLVVTGKTAQRMAATAQVLADWMEGPGA
EVAVADVAHTVNHHRARQATFGTVVARDRAQAIAGLRALAAGQHAPGVVS
HQDGSPGPGTVFVYSGRGSQWAGMGRQLLADEPAFAAAVAELEPVFVEQA
GFSLRDVIATGKELVGIEQIQLGLIGMQLTLTELWRSYGVQPDLVIGHSM
GEVAAAVVAGALTPAEGLRVTATRARLMAPLSGQGGMALLGLDAAATEAL
IADYPQVTVGIYNSPRQTVIAGPTEQIDELIARVRAQNRFASRVNIEVAP
HNPAMDALQPAMRSELADLTPRTPTIGIISTTYADLHTQPIFDAEHWATN
MRNPVRFQQAIASAGSGADGAYHTFIEISAHPLLTQAIADTLEDAHRPTK
SAAKYLSIGTLQRDADDTVTFRTNLYTADIAHPPHTCHPPEPHPTIPTTP
WQHTHHWIATTHPSTAAPEDPGSNKVVVNGQSTSESRALEDWCHQLAWPI
RPAVSADPPSTAAWLVVADNELCHELARAADSRVDSLSPPALAAGSDPAA
LLDALRGVDNVLYAPPVPGELLDIESAYQVFHATRRLAAAMVASSATAIS
PPKLFIMTRNAQPISEGDRANPGHAVLWGLGRSLALEHPEIWGGIIDLDD
SMPAELAVRHVLTAAHGTDGEDQVVYRSGARHVPRLQRRTLPGKPVTLNA
DASQLVIGATGNIGPHLIRQLARMGAKTIVAMARKPGALDELTQCLAATG
TDLIAVAADATDPAAMQTLFDRFGTELPPLEGIYLAAFAGRPALLSEMTD
DDVTTMFRPKLDALALLHRRSLKSPVRHFVLFSSVSGLLGSRWLAHYTAT
SAFLDSFAGARRTMGLPATVVDWGLWKSLADVQKDATQISAESGLQPMAD
EVAIGALPLVMNPDAAVATVVVAADWPLLAAAYRTRGALRIVDDLLPAPE
DVGKGESEFRTSLRSCPAEKRRDMLFDHVGALAATVMGMPPTEPLDPSAG
FFQLGMDSLMSVTLQRALSESLGEFLPASVVFDYPTVYSLTDYLATVLPE
LLEIGATAVATQQATDSYHELTEAELLEQLSERLRGTQ
>MT3174 substrate--CoA ligase
MKNIGWMLRQRATVSPRLQAYVEPSTDVRMTYAQMNALANRCADVLTALG
IAKGDRVALLMPNSVEFCCLFYGAAKLGAVAVPINTRLAAPEVSFILSDS
GSKVVIYGAPSAPVIDAIRAQADPPGTVTDWIGADSLAERLRSAAADEPA
VECGGDDNLFIMYTSGTTGHPKGVVHTHESVHSAASSWASTIDVRYRDRL
LLPLPMFHVAALTTVIFSAMRGVTLISMPQFDATKVWSLIVEERVCIGGA
VPAILNFMRQVPEFAELDAPDFRYFITGGAPMPEALIKIYAAKNIEVVQG
YALTESCGGGTLLLSEDALRKAGSAGRATMFTDVAVRGDDGVIREHGEGE
VVIKSDILLKEYWNRPEATRDAFDNGWFRTGDIGEIDDEGYLYIKDRLKD
MIISGGENVYPAEIESVIIGVPGVSEVAVIGLPDEKWGEIAAAIVVADQN
EVSEQQIVEYCGTRLARYKLPKKVIFAEAIPRNPTGKILKTVLREQYSAT
VPK
>MT3666 substrate--CoA ligase
MPAALDRLVRQLPDHTALIAEDRRFTSTELRDAVYGAAAALIALGVEPAD
RVAIWSPNTWHWVVACLAIHHAGAAVVPLNTRYTATEATDILDRAGAPVL
FAAGLFLGADRAAGLDRAALPALRHVVRVPVEADDGTWDEFIATGAGALD
AVAARAAAVAPQDVSDILFTSGTTGRSKGVLCAHRQSLSASASWAANGKI
TSDDRYLCINPFFHNFGYKAGILACLQTGATLIPHVTFDPLHALRAIERH
RITVLPGPPTIYQSLLDHPARKDFDLSSLRFAVTGAATVPVVLVERMQSE
LDIDIVLTAYGLTEANGMGTMCRPEDDAVTVATTCGRPFADFELRIADDG
EVLLRGPNVMVGYLDDTEATAAAIDADGWLHTGDIGAVDQAGNLRITDRL
KDMYICGGFNVYPAEVEQVLARMDGVADAAVIGVPDQRLGEVGRAFVVAR
PGTGLDEASVIAYTREHLANFKTPRSVRFVDVLPRNAAGKVSKPQLRELG
>MT2114 carboxymethylenebutenolidase, putative
MTTIEIDAPAGPIDALLGLPPGQGPWPGVVVVHDAVGYVPDNKLISERIA
RAGYVVLTPNMYARGGRARCITRVFRELLTKRGRALDDILAARDHLLAMP
ECSGRVGIVGFCMGGQFALVLSPRGFGATAPFYGTPLPRHLSETLNGACP
IVASFGTRDPLGIGAANRLRKVTAAKNIPADIKSYPGAGHSFANKLPGQP
LVRIAGFGYNEAATEDAWRRVFEFFGQHLRAGSPGEP
>MT1219 hypothetical protein
MLRVGPLTIGTLDDWAPSTGSTVSWRPSAVAHTKASQAPISDVPVSYMQA
QHIRGYCEQKAKGLDYSRLMVVSCQQPGQCDIRAANYVINAHLRRHDTYR
SWFQYNGNGQIIRRTIQDPADIEFVPVHHGELTLPQIREIVQNTPDPLQW
GCFRFGIVQGCDHFTFFASVDHVHVDAMIVGVTLMEFHLMYAALVGGHAP
LELPPAGSYDDFCRRQHTFSSTLTVESPQVRAWTKFAEGTNGSFPDFPLP
LGDPSKPSDADIVTVMMLDEEQTAQFESVCTAAGARFIGGVLACCGLAEH
ELTGTTTYYGLTPRDTRRTPADAMTQGWFTGLIPITVPIAGSAFGDAARA
AQTSFDSGVKLAEVPYDRVVELSSTLTMPRPNFPVVNFLDAGAAPLSVLL
TAELTGTNIGVYSDGRYSYQLSIYVIRVEQGTAVAVMFPDNPIARESVAR
YLATLKSVFQRVAESGQQQNVA
>MT2133 oxidoreductase, short-chain dehydrogenase/reductase family
MDDTGAAPVVIFGGRSQIGGELARRLAAGATMVLAARNADQLADQAAALR
AAGAIAVHTREFDADDLAAHGPLVASLVAEHGPIGTAVLAFGILGDQARA
ETDAAHAVAIVHTDYVAQVSLLTHLAAAMRTAGRGSLVVFSSVAGIRVRR
ANYVYGSAKAGLDGFASGLADALHGTGVRLLIARPGFVIGRMTEGMTPAP
LSVTPERVAAATACALVNGKRVVWIPWALRPMFVALRLLPRFVWRRMPR
>MT0793 oxidoreductase, short-chain dehydrogenase/reductase family
MFDSKVAIVTGAAQGIGQAYAQALAREGASVVVADINADGAAAVAKQIVA
DGGTAIHVPVDVSDEDSAKAMVDRAVGAFGGIDYLVNNAAIYGGMKLDLL
LTVPLDYYKKFMSVNHDGVLVCTRAVYKHMAKRGGGAIVNQSSTAAWLYS
NFYGLAKVGVNGLTQQLARELGGMKIRINAIAPGPIDTEATRTVTPAELV
KNMVQTIPLSRMGTPEDLVGMCLFLLSDSASWITGQIFNVDGGQIIRS
>MT0683 dioxygenase, putative
MTTAQAAESQNPYLEGFLAPVSTEVTATDLPVTGRIPEHLDGRYLRNGPN
PVAEVDPATYHWFTGDAMVHGVALRDGKARWYRNRWVRTPAVCAALGEPI
SARPHPRTGIIEGGPNTNVLTHAGRTLALVEAGVVNYELTDELDTVGPCD
FDGTLHGGYTAHPQRDPHTGELHAVSYSFARGHRVQYSVIGTDGHARRTV
DIEVAGSPMMHSFSLTDNYVVIYDLPVTFDPMQVVPASVPRWLQRPARLV
IQSVLGRVRIPDPIAALGNRMQGHSDRLPYAWNPSYPARVGVMPREGGNE
DVRWFDIEPCYVYHPLNAYSECRNGAEVLVLDVVRYSRMFDRDRRGPGGD
SRPSLDRWTINLATGAVTAECRDDRAQEFPRINETLVGGPHRFAYTVGIE
GGFLVGAGAALSTPLYKQDCVTGSSTVASLDPDLLIGEMVFVPNPSARAE
DDGILMGYGWHRGRDEGQLLLLDAQTLESIATVHLPQRVPMGFHGNWAPT
T
>MT3633 oxidoreductase, short-chain dehydrogenase/reductase family
MTGMLKRKVIVVSGVGPGLGTTLAHRCARDGADLVLAARSAERLDDVAKQ
IIDTGRRAVAVRTDITDDDDVSNLVQATLAAYGKADVLINNAFRVPSMKP
LAGTTFEHIRDAIELSALGTLRLIQAFTPALAQSHGAIVNVNSMVIRHSQ
PKYGTYKMAKSVLLAMSHSLATELGEQGIRVNSVAPGYIWGDTLKSYFDH
QAGKYGTTVDQIYQATAANSDLKRLPTEDEVASAILFLASDLASGITGQT
LDVNCGEYHT
>MT2580 substrate--CoA ligase
MAAAEVVDPNRLSYDRGPSAPSLLESTIGANLAATAARYGHREALVDMVA
RRRFNYSELLTDVHRLATGLVRAGIGPGDRVGIWAPNRWEWVLVQYATAE
IGAILVTINPAYRVREVEYALRQSGVAMVIAVASFKDADYAAMLAEVGPR
CPDLADVILLESDRWDALAGAEPDLPALQQTAARLDGSDPVNIQYTSGTT
AYPKGVTLSHRNILNNGYLVGELLGYTAQDRICIPVPFYHCFGMVMGNLA
ATSHGAAMVIPAPGFDPAATLRAVQDERCTSLYGVPTMFIAELGLPDFTD
YELGSLRTGIMAGAACPVEVMRKVISRMHMPGVSICYGMTETSPVSTQTR
ADDSVDRRVGTVGRVGPHLEIKVVDPATGETVPRGVVGEFCTRGYSVMAG
YWNDPQKTAEVIDADGWMHTGDLAEMDPSGYVRIAGRIKDLVVRGGENIS
PREIEELLHTHPDIVDGHVIGVPDAKYGEELMAVVKLRNDAPELTIERLR
EYCMGRIARFKIPRYLWIVDEFPMTVTGKVRKVEMRQQALEYLRGQQ
>MT1557 hypothetical protein
MFALSNNLNRVNACMDGFLARIRSHVDAHAPELRSLFDTMAAEARFARDW
LSEDLARLPVGAALLEVGGGVLLLSCQLAAEGFDITAIEPTGEGFGKFRQ
LGDIVLELAAARPTIAPCKAEDFISEKRFDFAFSLNVMEHIDLPDEAVRR
VSEVLKPGASYHFLCPNYVFPYEPHFNIPTFFTKELTCRVMRHRIEGNTG
MDDPKGVWRSLNWITVPKVKRFAAKDATLTLRFHRAMLVWMLERALTDKE
FAGRRAQWMVAAIRSAVKLRVHHLAGYVPATLQPIMDVRLTKR
>MT3615.2 fatty-acid-CoA ligase-related protein
MAASLSENLSCHSSNMCRLSGNAATNLERPGEEPPGDRCTRRQAVRPART
LAKKGNIPVGYYKDEKKTAETFRTINGVRYAIPGDYAQVEEDGTVTMLGR
GSVSINSGGEKVYPEEVEAALKGHPDVFDALVVGVPDPRYGQQVAAVVQA
RPGCRPSLAELDSFVRSEIAGYKVPRSLWFVDEVKRSPAGKPDYRWAKEQ
TEARPADDVHAGHVTSGS
>MT3263 oxidoreductase
MTSLAERTVLVTGANRGMGREYVAQLLGRKVAKVYAATRNPLAIDVSDPR
VIPLQLDVTDAVSVAEAADLATDVGILINNAGISRASSVLDKDTSALRGE
LETNLFGPLALASAFADRIAERSGAIVNVSSVLAWLPLGMSYGVSKAAMW
SATESMRIELAPRGVQVVGVYVGLVDTDMGRFADAPKSDPADVVRQVLDG
IEAGKEDVLADEMSRQVRASLNVPARERIARLMGN
>MT0328 hypothetical protein
MSNTVVITGHCTSLTVSGMRNSVTVDSVDTIEAAGFNNEVTYHSGSPKIS
NAGGSNSVQQG
>MT0869 copper-binding protein, putative
MPELATSGNAFDKRRFSRRGFLGAGIASGFALAACASKPTASGAAGMTAA
IDAAEAARPHSGRTVTATLTPQPARIDLGGPIVSTLTYGNTIPGPLIRAT
VGDEIVVSVTNRLGDPTSVHWHGIALRNDMDGTEPATANIGPGGDFTYRF
SVPDPGTYWAHPHVGLQGDHGLYLPVVVDDPTEPGHYDAEWIIILDDWTD
GIGKSPQQLYGELTDPNKPTMQNTTGMPEGEGVDSNLLGGDGGDIAYPYY
LINGRIPVAATSFKAKPGQRIRIRIINSAADTAFRIALAGHSMTVTHTDG
YPVIPTEVDALLIGMAERYDVMVTAAGGVFPLVALAEGKNALARALLSTG
AGSPPDPQFRPDELNWRVGTVEMFTAATTANLGRPEPTHDLPVTLGGTMA
KYDWTINGEPYSTTNPLHVRLGQRPTLMFDNTTMMYHPIHLHGHTFQMIK
ADGSPGARKDTVIVLPKQKMRAVLVADNPGVWVMHCHNNYHQVAGMATRL
DYIL
>MT3601 virulence factor mce family protein
MLNRKPSSKHERDPLRTGIFGLVLVICVVLIAFGYSGLPFWPQGKTYDAY
FTDAGGITPGNSVYVSGLKVGAVSAVSLAGNSAKVTFSVDRSIVVGDQSL
AAIRTDTILGERSIAVSPAGSGKSTTIPLSRTTTPYTLNGVLQDLGRNAN
DLNRPQFEQALNVFTQALHDATPQVRGAVDGLTSLSRALNRRDEALQGLL
AHAKSVTSVLSERAEQVNKLVEDGNQLFAALDARRAALSALISGIDDVAA
QISGFVADNRKEFGPALSKLNLVLANLNERRDYITEALKRLPTYATTLGE
VVGSGPGFNVNVYSVLPGPLVATVFDLVFQPGKLPDSLADYLRGFIQERW
IIRPKSP
>MT0971 oxidoreductase, short-chain dehydrogenase/reductase family
MLTGVTRQKILITGASSGLGAGMARSFAAQGRDLALCARRTDRLTELKAE
LSQRYPDIKIAVAELDVNDHERVPKVFAELSDEIGGIDRVIVNAGIGKGA
RLGSGKLWANKATIETNLVAALVQIETALDMFNQRGSGHLVLISSVLGVK
GVPGVKAAYAASKAGVRSLGESLRAEYAQRPIRVTVLEPGYIESEMTAKS
ASTMLMVDNATGVKALVAAIEREPGRAAVPWWPWAPLVRLMWVLPPRLTR
RFA
>MT0293 conserved hypothetical protein
MRTEGDSWDITTSVGSTALFVATARALEAQKSDPLVVDPYAEAFCRAVGG
SWADVLDGKLPDHKLKSTDFGEHFVNFQGARTKYFDEYFRRAAAAGARQV
VILAAGLDSRAYRLPWPDGTTVFELDRPQVLDFKREVLASHGAQPRALRR
EIAVDLRDDWPQALRDSGFDAAAPSAWIAEGLLIYLPATAQERLFTGIDA
LAGRRSHVAVEDGAPMGPDEYAAKVEEERAAIAEGAEEHPFFQLVYNERC
APAAEWFGERGWTAVATLLNDYLEAVGRPVPGPESEAGPMFARNTLVSAA
RV
>MT1753.1 oxidoreductase, short-chain dehydrogenase/reductase family
MEEMALAQQVPNLGLARFSVQDKSILITGATGSLGRVAARALADAGARLT
LAGGNSAGLAELVNGAGIDDAAVVTCRPDSLADAQQMVEAALGRYGRLDG
VLVASGSNHVAPITEMAVEDFDAVMDANVRGAWLVCRAAGRVLLEQGQGG
SVVLVSSVRGGLGNAAGYSAYCPSKAGTDLLAKTLAAEWGGHGIRVNALA
PTVFRSAVTEWMFTDDPKGRATREAMLARIPLRRFAEPEDFVGALIYLLS
DASSFYTGQVMYLDGGYTAC
>MT2439 conserved hypothetical protein
MVLPKPTPRGRELIRQAAKVALHPTPEWLDELDRATLAAHPSIAADPALA
TVVSRANRSHLIHFATANLRKPGQPVPANLGPDPLRMARDLVRRGLDASA
LDVYRVGQNVAWQRWTEIAFGLTTDPQELHELLTLPFRSASEFIDATLAG
LAAQMQLEYDELTRDVHAEHRRIVELILDGAPISRQSAEAKLGYPLDRSH
TAAIIWYDDPDDNQNHLDHTARAFGRALGCPQPLIAVASAATRWVWVSDA
ATLDTDRIHQVLDHAPHARIAVGTTARGIDGFRRSHRDALATQRMLARLR
SQQRLAFFADIHMIAVLTENPDSAADFITSTLGDLESASPQLLTTVLTYI
NEQCNASRAAHVLHTHRNTLLRRLETAQRLLPRPLDHTIIQVAVAISALQ
WRGSQTSDPVETPVEGITSPPPESLGRRRSRLAQLER
>MT1904 oxidoreductase, short-chain dehydrogenase/reductase family
MAVEVLVTGGDTDLGRTMAEGFRNDGHKVTLVGARRGDLEVAAKELDVDA
VVCDTTDPTSLTEARGLFPRHLDTIVNVPAPSWDAGDPRAYSVSDTANAW
RNALDATVLSVVLTVQSVGDHLRSGGSIVSVVAENPPAGGAESAIKAALS
NWIAGQAAVFGTRGITINTVACGRSVQTGYEGLSRTPAPVAAEIARLALF
LTTPAARHITGQTLHVSHGALAHFG
>MT2448 peptide synthetase
MSIPCWHWTCVTDSNDQSARRCRWPRSWATSPVMDLSRNSKMPTSAHTPH
RKWTFRVTNTADIGARLDEARLELLRRRLADRGLSSAAQDIGPHTDDRLS
DGQARMWFVQMADPSGALLNICVSYRITGDIDLARLRDAVNAVARRHRIL
RTTYPVGDDGVAQPTVHADLRPGWTQYDLTDLSQRAQRLRLEVLAQREFC
APFELSRDAPLRITVVRTAADEHVLLLVAHHIAWDDGSWRVFFTDLTQAY
SRADLGADLGPEHRPSAASGPDTTEADLNYWRAIMADPPEPLELPGPAGT
CVPTSWRAARATLRLPADTAARVATMAKNTGCTPYMVLLAAFGALVHRYT
HSDDFLVAAPVLNRGAGTEDAIGYFGNTVAMRLRPQSAMSFRELLTATRD
IASGAFAHQRINLDRVVRELNPDRRHGAERMTRVSFGFREPDGGGFNPPG
IECERYDLRSNITQLPLGFMVEFDRAGVLVEAEHLVEILEPALAKQMLRH
FGVLLDNALAAPDNTLSGLALMDERDAARLREVSRGERFDTPVKTLVDLV
NEQTTRTPDATAVVYEGQHFTYHDLNEASNRLGHWLIEQGIGSEDRVAVL
LDKSPDLIVTALGVVKSGAVYVPVDPSYPQDRLDFILADCDAKLVLRTPV
RELAGYRSDDPTDADRIRPLRPDNTAYLIYTSGTTGLPKGVAVPHRPVAE
YFVWFKGEYDVDDTDRLLQVASPSFDVSIAEIFGTLACGARMVIPRPGGL
TDIGYLTALLRDEGITAMHFVPSLLGLFLSLPGVSQWRTLQRVPIGGEPL
PGEVADKFHATFDALLHNFYGPTETVINASRFKVVGPQGTRIVPIGRPKI
NTTMHLLDDSLQPVPTGVIGEIYIGGTHVAYGYHRRAGLTAERFVADPFN
PGSRMYRSGDLARRNADGDIEFVGRADEQVKIRGFRIELGDVAAAIAVDP
TVGQAVVVVSDLPRLGKSLVGYVTPAAGGDGPADVGVDLDRIRARVAAAL
PEYMLPAAYVVLDEIPITAHGKIDRAALPEPQIASDTEFRAPQTATERRL
AQLFGELLGRDRVGADDSFFDLGGHSLLATKLVAAVRNAFGVDVGVREIF
EFATVTALAGHIDTLDSDSARPRLTRVDHDGPVRLSSSQMRSWFNYRFDG
PNAVNNIPFAAALHGPCDTNAFAAAITDVVARHEILRTVYREIGGVPHQI
IQPPAEVPVRCAAGSDAAWLRAELNNERGYVFDLETDWPIRAALLSTPEQ
TVLSLVVHHIAGDHWSAGVLFTDLLTAYRARSTGQRPSWAPLPVQYADYS
VWQSALLDDGAGIVGPQRDYWIRQLGGLAGETGLRPDFPRPALLSGAGDA
VEFRLGAAIRDKLAAVSRDLGVTEFMLLQAAVAVVLHKAGGGVDVPIGAP
VAGRSEANLDQLIGFFINIVVLRNDLRGNPTLREVLQRTRQMALAAYAHQ
DLPFDQVVEAVNPQRSLSRNPLFDIVVHVREQMPQDHVIDTGPDGDTTLR
VLEPTFDAAQADLSVNFFACGDEYRGHVIYRTELYERATAQRFADWLVRV
VEAFADRPDQPLREVEMVSAQARRRILDRSNAGAGTARVYLLDDALKPVP
VGVVGDVYYGGGPAVGARLARPSETATRFVADPFAAQPGSRLYRNGERGV
WKADGQLELLAEIERLPTAQAAPVPAEPADTETERALAAILADVLEVGEV
GRYDDFFNLGGDSILATQVAARARDGGIPLTARMVFEHPVLCELAAAVDA
KPHVEAEPDDKHHAPMSTSGLSPDELSALTASWDQWP
>MT0917 conserved hypothetical protein
MRTEDDSWDVTTSVGSTGLLVAAARALETQKADPLAIDPYAEVFCRAAGG
EWADVLDGKLPDHYLTTGDFGEHFVNFQGARTRYFDEYFSRATAAGMKQV
VILAAGLDSRAFRLQWPIGTTIFELDRPQVLDFKNAVLADYHIRPRAQRR
SVAVDLRDEWQIALCNNGFDANRPSAWIAEGLLVYLSAEAQQRLFIGIDT
LASPGSHVAVEEATPLDPCEFAAKLERERAANAQGDPRRFFQMVYNERWA
RATEWFDERGWRATATPLAEYLRRVGRAVPEADTEAAPMVTAITFVSAVR
TGLVADPARTSPSSTSIGFKRFEAD
>MT0617 conserved hypothetical protein
MVESSTASAAAVLRARYPRTAASLDRYGGGTARRLERTGTFARFTRISVV
QIGWALRRYRRETLRLVAEIGMGTGAMAVVGGTVAIIGFVTLSGGSLIAI
QGFASLGNIGVEAFTGFFAALANTRVAAPIVSGVALAATVGAGATAQLGA
MRISEEIDALEVMGIKSISFLVSTRILGGLVVIMPLYALALDMAFTSGQV
VTTVFYGQSNGTYEHYFRTFLRPEDVGWSVVEVVIIAVVVMIIHCYYGYT
ASGGPVGVGQAVGRSMRFSLVSVVVVVLLAELALYGVDPNFNLTV
>MT3735 conserved hypothetical protein
MTQSSSVERLVGEIDEFGYTVVEDVLDADSVAAYLADTRRLERELPTVIA
NSTTVVKGLARPGHVPVDRVDHDWVRIDNLLLHGTRYEALPVHPKLLPVI
EGVLGRDCLLSWCMTSNQLPGAVAQRLHCDDEMYPLPRPHQPLLCNALIA
LCDFTADNGATQVVPGSHRWPERPSPPYPEGKPVEINAGDALIWNGSLWH
TAAANRTDAPRPALTINFCVGFVRQQVNQQLSIPRELVRCFEPRLQELIG
YGLYAGKMGRIDWRPPADYLDADRHPFLDAVADRLQTSVRL
>MT0038 acp-1, acyl carrier protein
MKEAINATIQRILRTDRGITANQVLVDDLGFDSLKLFQLITELEDEFDIA
ISFRDAQNIKTVGDVYTSVAVWFPETAKPAPLGKGTA
>MT1385 acp-2, acyl carrier protein
MWRYPLSTRLALPNTPGVASFAMTSSPSTVSTTLLSILRDDLNIDLTRVT
PDARLVDDVGLDSVAFAVGMVAIEERLGVALSEEELLTCDTVGELEAAIA
AKYRDE
>MT2304 acp-3, acyl carrier protein
MPVTQEEIIAGIAEIIEEVTGIEPSEITPEKSFVDDLDIDSLSMVEIAVQ
TEDKYGVKIPDEDLAGLRTVGDVVAYIQKLEEENPEAAQALRAKIESENP
DAVANVQARLEAESK
>MT2452 entE, 2,3-dihydroxybenzoate-AMP ligase
MPPKAADGRRPSPDGGLGGFVPFPADRAASYRAAGYWSGRTLDTVLSDAA
RRWPDRLAVADAGDRPGHGGLSYAELDQRADRAAAALHGLGITPGDRVLL
QLPNGCQFAVALFALLRAGAIPVMCLPGHRAAELGHFAAVSAATGLVVAD
VASGFDYRPMARELVADHPTLRHVIVDGDPGPFVSWAQLCAQAGTGSPAP
PADPGSPALLLVSGGTTGMPKLIPRTHDDYVFNATASAALCRLSADDVYL
VVLAAGHNFPLACPGLLGAMTVGATAVFAPDPSPEAAFAAIERHGVTVTA
LVPALAKLWAQSCEWEPVTPKSLRLLQVGGSKLEPEDARRVRTALTPGLQ
QVFGMAEGLLNFTRIGDPPEVVEHTQGRPLCPADELRIVNADGEPVGPGE
EGELLVRGPYTLNGYFAAERDNERCFDPDGFYRSGDLVRRRDDGNLVVTG
RVKDVICRAGETIAASDLEEQLLSHPAIFSAAAVGLPDQYLGEKICAAVV
FAGAPITLAELNGYLDRRGVAAHTRPDQLVAMPALPTTPIGKIDKRAIVR
QLGIATGPVTTQRCH
>MT2305 fabF-1, 3-oxoacyl-(acyl-carrier-protein) synthase II
MSQPSTANGGFPSVVVTAVTATTSISPDIESTWKGLLAGESGIHALEDEF
VTKWDLAVKIGGHLKDPVDSHMGRLDMRRMSYVQRMGKLLGGQLWESAGS
PEVDPDRFAVVVGTGLGGAERIVESYDLMNAGGPRKVSPLAVQMIMPNGA
AAVIGLQLGARAGVMTPVSACSSGSEAIAHAWRQIVMGDADVAVCGGVEG
PIEALPIAAFSMMRAMSTRNDEPERASRPFDKDRDGFVFGEAGALMLIET
EEHAKARGAKPLARLLGAGITSDAFHMVAPAADGVRAGRAMTRSLELAGL
SPADIDHVNAHGTATPIGDAAEANAIRVAGCDQAAVYAPKSALGHSIGAV
GALESVLTVLTLRDGVIPPTLNYETPDPEIDLDVVAGEPRYGDYRYAVNN
SFGFGGHNVALAFGRY
>MT2306 fabF-2, 3-oxoacyl-(acyl-carrier-protein) synthase II
MTELVTGKAFPYVVVTGIAMTTALATDAETTWKLLLDRQSGIRTLDDPFV
EEFDLPVRIGGHLLEEFDHQLTRIELRRMGYLQRMSTVLSRRLWENAGSP
EVDTNRLMVSIGTGLGSAEELVFSYDDMRARGMKAVSPLTVQKYMPNGAA
AAVGLERHAKAGVMTPVSACASGAEAIARAWQQIVLGEADAAICGGVETR
IEAVPIAGFAQMRIVMSTNNDDPAGACRPFDRDRDGFVFGEGGALLLIET
EEHAKARGANILARIMGASITSDGFHMVAPDPNGERAGHAITRAIQLAGL
APGDIDHVNAHATGTQVGDLAEGRAINNALGGNRPAVYAPKSALGHSVGA
VGAVESILTVLALRDQVIPPTLNLVNLDPEIDLDVVAGEPRPGNYRYAIN
NSFGFGGHNVAIAFGRY
>MT0256 fabG-1, 3-oxoacyl-(acyl-carrier-protein) reductase
MAPKRSSDLFSQVVNSGPGSFLARQLGVPQPETLRRYRAGEPPLTGSLLI
GGAGRVVEPLRAALEKDYDLVGNNLGGRWADSFGGLVFDATGITEPAGLK
GLHEFFTPVLRNLGRCGRVVVVGGTPEAAASTNERIAQRALEGFTRSLGK
ELRRGATTALVYLSPDAKPAATGLESTMRFLLSAKSAYVDGQVFSVGADD
STPPADWEKPLDGKVAIVTGAARGIGATIAEVFARDGAHVVAIDVESAAE
NLAETASKVGGTALWLDVTADDAVDKISEHLRDHHGGKADILVNNAGITR
DKLLANMDDARWDAVLAVNLLAPLRLTEGLVGNGSIGEGGRVIGLSSIAG
IAGNRGQTNYATTKAGMIGITQALAPGLAAKGITINAVAPGFIETQMTAA
IPLATREVGRRLNSLLQGGQPVDVAEAIAYFASPASNAVTGNVIRVCGQA
MIGA
>MT1530 fabG-2, 3-oxoacyl-(acyl-carrier-protein) reductase
MTATATEGAKPPFVSRSVLVTGGNRGIGLAIAQRLAADGHKVAVTHRGSG
APKGLFGVECDVTDSDAVDRAFTAVEEHQGPVEVLVSNAGLSADAFLMRM
TEEKFEKVINANLTGAFRVAQRASRSMQRNKFGRMIFIGSVSGSWGIGNQ
ANYAASKAGVIGMARSIARELSKANVTANVVAPGYIDTDMTRALDERIQQ
GALQFIPAKRVGTPAEVAGVVSFLASEDASYISGAVIPVDGGMGMGH
>MT3606 fabG-3, 3-oxoacyl-(acyl-carrier-protein) reductase
MKLTESNRSPRTTNTTDLSGKVAVVTGAAAGLGRAEALGLARLGATVVVN
DVASALDASDVVDEIGAAAADAGAKAVAVAGDISQRATADELLASAVGLG
GLDIVVNNAGITRDRMLFNMSDEEWDAVIAVHLRGHFLLTRNAAAYWRDK
AKDAEGGSVFGRLVNTSSEAGLVGPVGQANYAAAKAGITALTLSAARALG
RYGVCANVICPRARTAMTADVFGAAPDVEAGQIDPLSPQHVVSLVQFLAS
PAAAEVNGQVFIVYGPQVTLVSPPHMERRFSADGTSLGSHRAHRDAAGLL
CWSGSGTELFGDRSDASVTRGYRRPIIGIGVRITTPT
>MT1218 mas-1, mycocerosic acid synthase
MGFGSIHPRLVQGDCVVRTATATSVAVIGMACRLPGGIDSPQRLWEALLR
GDDLVGEIPADRWDANVYYDPEPGVPGRSVSRWGAFLDDVGGFDCDFFGL
TEREATAIDPQHRLLLEVSWEAIEHAGVDPATLAESQTGVFVGLTHGDYE
LLSADCGAAEGPYGFTGTSNSFASGRVAYTLGLHGPAVTVDTACSSGLTA
VHQACRSLDDGESDLALAGGVVVTLEPRKSVSGSLQGMLSPTGRCHAFDE
AADGFVSGEGCVVLLLKRLPDAVRDGDRVLAIVRGTAANQDGRTVNIAAP
SAQAQIAVYQQALAAAGVEASTVGMVEAHGTGTPVGDPVEYASLAAVYGT
EGPCALTSVKTNFGHLQSASGPLGLMKTILALRHGVVPQNLHFCRLPDQL
AEIDTELFVPQANTSWPDNTGQPRRAAVSSYGMSGTNVHAILEQAPVSEP
AASGPELTPEAGGLALFPVSATSAEQLHVTAARLADWVDQNGNAGSRVSM
RDLGYTLSCRRAHRPVRTVVTASSFDELSAALRDVAGDQIPYQPAVGHDD
RGPVWVFSGQGSQWPGMGTELLVAEPVFAATVAAMEPVIARESGFSVTEA
MSAPQTVSGIDRVQPTIFAVQVALAAALKSYGVRPGAIIGHSLGEAAAAV
VAGALSLHDGLRVICRRSRLMSRIAGSGAMASVELPGQQVLSELAIRGIS
DVVLSVVASPTSTVVGGATQSIRDLVAAWEQQDVLAREVAVDVASHTPQV
DPILDELLEVLAEVDPTAPEIPYYSATLWDPRERPSFTGEYWVENLRYTV
RFAAAVQAALKDGYRVFGELAPHPLLTYAVEQNAASLDMPIATLAAMRRG
EQLPFGLRGFVADVHNAGAKVDFSVQYPDGRLVDAPLPSWTHRTLMLSRE
DSHRSHTGAVQAVHPLLGAHVHLLEEPERHVWQAGVGTGAHPWLGDHRIH
NVAAFPGAAYCEMALAAARTTLGELSEVRDIKFEQTLLLDEQTVVSSAAT
IAAPGILQFAVESHQEGEPARRASAMLHALEEMPQPPGYDTNALTAAHES
SMSGEELRKMFNSLGIQYGPAFSGLVAVHTARGDVTTVLAEVALPGAIRS
QQSAYASHPALLDACFQSVLVHPEVQKATVGGLMLPVGVRRLRNYHSTRS
AHYCLARVTSSSRAGECEADLDVFDQAGTVLLTVEGLRLAAGISEHERAN
RVFDERLLTIEWERGELPEVPQIDAGSWLLLSASEADPLTAQLADALNAV
GAQSTSVASASDVAQLRSLLGGRLTGVVVVTGPPTGGLTQCGRDYVSQLV
GIARELAELPGEPPRLFVVTRSAASVLPSDLANLEQAGLRGLMRVIDSEH
PHLGATAIDVDNDETVAALVASQLQSGSQEDETAWRNGIWYTARLRPGPL
RPAERRTAVVEYRRDGMRLQIRTPGDLESLEFVTFDRVAPGPGEIEVAVT
ASSVNFADVLVAFGRYPTFEGYRQQLGIDFAGVVTAVGPDVTEHRIGDHV
GGMSANGCWSTFVRCDARLAVTLPPELPVAAAAAVPTASATAWYALHDLA
RICSDDKVLIHSGTGGVGQAAIAIARAAGCEIFATAGSAQRRQLLHDMGV
EHVYDSRSTEFAEQIRGDTDGYGVDVVLNSLPGAAQRAGIELLAFGGRFV
EIGKRDIYGDTRLGLFPFRRNLSLYAVDLALLTHSHPHTVRRLLKTVYQH
TVEGTLPVPQTTHYPIHDAAVAIRLVGGAGHTGKVVLDVPRTGEGVAVVP
PEQVRTSRPDGAYLVTGGLGGLGLFLAGELAAAGCGRIVLNSRSTPSPHA
TRVIERLRAAGADIQVECGDIADAATAHRVVAVATASGLPVRGVLHAAAV
VEDATLANVTDELIDRCWAPKVHGAWNIHRATAAQPLEWFCLFSSAAALV
GSPGQGAYAAANSWLDAFAHWRRAQGLPATSIAWGAWAEIGRATALAEGT
GAAIAPAEGARAFQTLLRYGRAYSGYAPIMGTPWLTAFAQRSRFAEAFHA
TGQNQPATGKFLAELGSLPREEWPRTVRRLVSDQISLLLRRTIDPDRPLS
DYGLDSLGNLELRTRIETETGIRVSPTKITTVRGLAEHVCDELAAAQSAP
V
>MT3010 mas-2, mycocerosic acid synthase
MESRVTPVAVIGMGCRLPGGINSPDKLWESLLRGDDLVTEIPPDRWDADD
YYDPEPGVPGRSVSRWGGFLDDVAGFDAEFFGISEREATSIDPQQRLLLE
TSWEAIEHAGLDPASLAGSSTAVFTGLTHEDYLVLTTTAGGLASPYVVTG
LNNSVASGRIAHTLGLHGPAMTFDTACSSGLMAVHLACRSLHDGEADLAL
AGGCAVLLEPHASVAASAQGMLSSTGRCHSFDADADGFVRSEGCAMVLLK
RLPDALRDGNRIFAVVRGTATNQDGRTETLTMPSEDAQVAVYRAALAAAG
VQPETVGVVEAHGTGTPIGDPIEYRSLARVYGAGTPCALGSAKSNMGHST
ASAGTVGLIKAILSLRHGVVPPLLHFNRLPDELSDVETGLFVPQAVTPWP
NGNDHTPKRVAVSSFGMSGTNVHAIVEEAPAEASAPESSPGDAEVGPRLF
MLSSTSSDALRQTARQLATWVEEHQDCVAASDLAYTLARGRAHRPVRTAV
VAANLPELVEGLREVADGDALYDAAVGHGDRGPVWVFSGQGSQWAAMGTQ
LLASEPVFAATIAKLEPVIAAESGFSVTEAITAQQTVTGIDKVQPAVFAV
QVALAATMEQTYGVRPGAVVGHSMGESAAAVVAGALSLEDAARVICRRSK
LMTRIAGAGAMGSVELPAKQVNSELMARGIDDVVVSVVASPQSTVIGGTS
DTVRDLIARWEQRDVMAREVAVDVASHSPQVDPILDDLAAALADIAPMTP
KVPYYSATLFDPREQPVCDGAYWVDNLRNTVQFAAAVQAAMEDGYRVFAE
LSPHPLLTHAVEQTGRSLDMSVAALAGMRREQPLPHGLRGLLTELHRAGA
ALDYSALYPAGRLVDAPLPAWTHARLFIDDDGQEQRAQGACTITVHPLLG
SHVRLTEEPERHVWQGDVGTSVLSWLSDHQVHNVAALPGAAYCEMALAAA
AEVFGEAAEVRDITFEQMLLLDEQTPIDAVASIDAPGVVNFTVETNRDGE
TTRHATAALRAAEDDCPPPGYDITALLQAHPHAVNGTAMRESFAERGVTL
GAAFGGLTTAHTAEAGAATVLAEVALPASIRFQQGAYRIHPALLDACFQS
VGAGVQAGTATGGLLLPLGVRSLRAYGPTRNARYCYTRLTKAFNDGTRGG
EADLDVLDEHGTVLLAVRGLRMGTGTSERDERDRLVSERLLTLGWQQRAL
PEVGDGEAGSWLLIDTSNAVDTPDMLASTLTDALKSHGPQGTECASLSWS
VQDTPPNDQAGLEKLGSQLRGRDGVVIVYGPRVGDPDEHSLLAGREQVRH
LVRITRELAEFEGELPRLFVVTRQAQIVKPHDSGERANLEQAGLRGLLRV
ISSEHPMLRTTLIDVDEHTDVERVAQQLLSGSEEDETAWRNGDWYVARLT
PSPLGHEERRTAVLDPDHDGMRVQVRRPGDLQTLEFVASDRVPPGPGQIE
VAVSMSSINFADVLIAFGRFPIIDDREPQLGMDFVGVVTAVGEGVTGHQV
GDRVGGFSEGGCWRTFLTCDANLAVTLPPGLTDEQAITAATAHATAWYGL
NDLAQIKAGDKVLIHSATGGVGQAAISIARAKGAEIFATAGNPAKRAMLR
DMGVEHVYDSRSVEFAEQIRRDTDGYGVDIVLNSLTGAAQRAGLELLAFG
GRFVEIGKADVYGNTRLGLFPFRRGLTFYYLDLALMSVTQPDRVRELLAT
VFKLTADGVLTAPQCTHYPLAEAADAIRAMSNAEHTGKLVLDVPRSGRRS
VAVTPEQAPLYRRDGSYIITGGLGGLGLFFASKLAAAGCGRIVLTARSQP
NPKARQTIEGLRAAGADIVVECGNIAEPDTADRLVSAATATGLPLRGVLH
SAAVVEDATLTNITDELIDRDWSPKVFGSWNLHRATLGQPLDWFCLFSSG
AALLGSPGQGAYAAANSWVDVFAHWRRAQGLPVSAIAWGAWGEVGRATFL
AEGGEIMITPEEGAYAFETLVRHDRAYSGYIPILGAPWLADLVRRSPWGE
MFASTGQRSRGPSKFRMELLSLPQDEWAGRLRRLLVEQASVILRRTIDAD
RSFIEYGLDSLGMLEMRTHVETETGIRLTPKVIATNNTARALAQYLADTL
AEEQAAAPAAS
>MT3933 mas-3, mycocerosic acid synthase
MGLGSAASGTGADRGAWTLAEPRVTPVAVIGMACRLPGGIDSPELLWKAL
LRGDDLITEVPPDRWDCDEFYDPQPGVPGRTVCKWGGFLDNPADFDCEFF
GIGEREAIAIDPQQRLLLETSWEAMEHAGLTQQTLAGSATGVFAGVTHGD
YTMVAADAKQLEEPYGYLGNSFSMASGRVAYAMRLHGPAITVDTACSSGL
TAVHMACRSLHEGESDVALAGGVALMLEPRKAAAGSALGMLSPTGRCRAF
DVAADGFVSGEGCAVVVLKRLPDALADGDRILAVIRGTSANQDGHTVNIA
TPSQPAQVAAYRAALAAGGVDAATVGMVEAHGPGTPIGDPIEYASVSEVY
GVDGPCALASVKTNFGHTQSTAGVLGLIKVVLALKHGVVPRNLHFTRLPD
EIAGITTNLFVPEVTTPWPTNGRQVPRRAAVSSYGFSGTNVHAVVEQAPQ
TEAQPHAASTPPTGTPALFTLSASSADALRQTAQRLTDWIQQHADSLVLS
DLAYTLARRRTHRSVRTAVIASSVDELIAGLGEVADGDTVYQPAVGQDDR
GPVWLFSGQGSQWAAMGADLLTNESVFAATVAELEPLIAAESGFSVTEAM
TAPETVTGIDRVQPTIFAMQVALAATMAAYGVRPGAVIGHSMGESAAAVV
AGVLSAEDGVRVICRRSKLMATIAGSAAMASVELPALAVQSELTALGIDD
VVVAVVTAPQSTVIAGGTESVRKLVDIWERRDVLARAVAVDVASHSPQVD
PILDELIAALADLNPKAPEIPYYSATLFDPREAPACDARYWADNLRHTVR
FSAAVRSALDDGYRVFAELSPHPLLTHAVDQIAGSVGMPVAALAGMRREQ
PLPLGLRRLLTDLHNAGAAVDFSVLCPQGRLVDAPLPAWSHRFLFYDREG
VDNRSPGGSTVAVHPLLGAHVRLPEEPERHAWQADVGTATLPWLGDHRIH
NVAALPGAAYCEMALSAARAVLGEQSEVRDMRFEAMLLLDDQTPVSTVAT
VTSPGVVDFAVEALQEGVGHHLRRASAVLQQVSGECEPPAYDMASLLEAH
PCRVDGEDLRRQFDKHGVQYGPAFTGLAVAYVAEDATATMLAEVALPGSI
RSQQGLYAIHPALLDACFQSVGAHPDSQSVGSGLLVPLGVRRVRAYAPVR
TARYCYTRVTKVELVGVEADIDVLDAHGTVLLAVCGLRIGTGVSERDKHN
RVLNERLLTIEWHQRELPEMDPSGAGKWLLISDCAASDVTATRLADAFRE
HSAACTTMRWPLHDDQLAAADQLRDQVGSDEFSGVVVLTGSNTGTPHQGS
ADRGAEYVRRLVGIARELSDLPGAVPRMYVVTRGAQRVLADDCVNLEQGG
LRGLLRTIGAEHPHLRATQIDVDEQTGVEQLARQLLATSEEDETAWRDNE
WYVARLCPTPLRPQERRTIVADHQQSGMRLQIRTPGDMQTIELAAFHRVP
PGPGQIEVAVRASSVNFADVLIAFGRYPSFEGHLPQLGTDFAGVVTAVGP
GVTDHKVGDHVGGMSPNGCWGTFVTCDARLAATLPPGLGDAQAAAVTTAH
ATAWYGLHELARIRAGDTVLIHSGTGGVGQAAIAIARAAGAEIFATAGTP
QRRELLRNMGIEHVYDSRSIEFAEQIRRDTNGRGVDVVLNSVTGAAQLAG
LKLLAFRGRFVEIGKRDIYGDTKLGLFPFRRNLSFYAVDLGLLSATHPEE
LRDLLGTVYRLTAAGELPMPQSTHYPLVEAATAIRVMGNAEHTGKLVLHI
PQTGKSLVTLPPEQAQVFRPDGSYIITGGLGGLGLFLAEKMAAAGCGRIV
LNSRTQPTQKMRETIEAIAAMGSEVVVECGDIAQPGTAERLVATAVATGL
PVRGVLHAAAVVEDATLANITDELLARDWAPKVHGAWELHEATSGQPLDW
FCLFSSAAALTGSPGQSAYSAANSWLDAFAHWRQAQGLPATAIAWGAWSD
IGQLGWWSASPARASALEESNYTAITPDEGAYAFEALLRHNRVYTGYAPV
IGAPWLVAFAERSRFFEVFSSSNGSGTSKFRVELNELPRDEWPARLRQLV
AEQVSLILRRTVDPDRPLPEYGLDSLGALELRTRIETETGIRLAPKNVSA
TVRGLADHLYEQLAPDDAPAAALSSQ
>MT0178 mce-1, virulence factor
MSFGPSWRPSSSLRSSWSATATTGTPPVEAPSVSARPSADRCVSRWSRCR
SLSCLQRWRSTVSTRTSISRCSRMTTPGKLNKARVPPYKTAGLGLVLVFA
LVVALVYLQFRGEFTPKTQLTMLSARAGLVMDPGSKVTYNGVEIGRVDTI
SEVTRDGESAAKFILDVDPRYIHLIPANVNADIKATTVFGGKYVSLTTPK
NPTKRRITPKDVIDVRSVTTEINTLFQTLTSIAEKVDPVKLNLTLSAAAE
ALTGLGDKFGESIVNANTVLDDLNSRMPQSRHDIQQLAALGDVYADAAPD
LFDFLDSSVTTARTINAQQAELDSALLAAAGFGNTTADVFDRGGPYLQRG
VADLVPTATLLDTYSPELFCTIRNFYDADPLAKAASGGGNGYSLRTNSEI
LSGIGISLLSPLALATNGAAIGIGLVAGLIAPPLAVAANLAGALPGIVGG
APNPYTYPENLPRVNARGGPGGAPGCWQPITRDLWPAPYLVMDTGASLAP
YNHMEVGSPYAVEYVWGRQVGDNTINP
>MT0618 mce-2, virulence factor
MPTLVTRKNRRAWLYVEGVVLLLVGALVLVLVYKQFRGEFTPKTELTMVA
SRAGLVMEAGSKVTYNGVEIGRVGSISEIERDGRPAAKLVLDVNPRYISL
IPVNVVADIEAATLFGNKYVALSAPKIPQQQRISSHDVIDVGSVTTEFNT
LFETITSIAEKVDPIELNATLSAVAQALDGLGGKFGESIVNGNQILAQLN
PRLPQLGYDVRRLADLGEVYVDASPDLWSFLQNALTTARTLTSQQRDLDA
ALLAATGAGNTGEDVFARGGPYLARAAADLVPTATLLDTYSPELFCMIRN
FHDAAPKVADAVGGNGYSLAAAGTILGAPNPYVYPDNLPRVNAHGGPGGR
PGCWQTITRELWPAPYLVMDTGASLAPYNHVELGQPMFTEYVWGRQYGEN
TINP
>MT2018 mce-3, virulence factor
MRRGPGRHRLHDAWWTLILFAVIGVAVLVTAVSFTGSLRSTVPVTLAADR
SGLVMDSGAKVMMRGVQVGRVAQIGRIEWAQNGASLRLEIDPDQIRYIPA
NVEAQISATTAFGAKFVDLVMPQNPSRARLSAGAVLHSKNVSTEINTVFE
NVVDLLNMIDPLKLNAVLTAVADAVRGQGERIGQATTDLNEVLEALNARG
DTIGGNWRSLKNFTDTYDAAAQDILTILNAASTTSATVVNHSTQLDALLL
NAIGLSNAGTNLLGSSRDNLVGAADILAPTTSLLFKYNPEYTCFLQGAKW
YLDNGGYAAWGGADGRTLQLDVALLFGNDPYVYPDNLPVVAAKGGPGGRP
GCGPLPDATHNFPVRQLVTNTGWGTGLDIRPNPGIGHPCWANYFPVTRAV
PEPPSIRQCIPGPAIGPNPAAGEQP
>MT0567 menE, o-succinylbenzoate--CoA ligase
MLGGSDPALVAVPTQHESLLGALRVGEQIDDDVALVVTTSGTTGPPKGAM
LTAAALTASASAAHDRLGGPGSWLLAVPPYHIAGLAVLVRSVIAGSVPVE
LNVSAGFDVTELPNAIKRLGSGRRYTSLVAAQLAKALTDPAATAALAELD
AVLIGGGPAPRPILDAAAAAGITVVRTYGMSETSGGCVYDGVPLDGVRLR
VLAGGRIAIGGATLAKGYRNPVSPDPFAEPGWFHTDDLGALESGDSGVLT
VLGRADEAISTGGFTVLPQPVEAALGTHPAVRDCAVFGLADDRLGQRVVA
AIVVGDGCPPPTLEALRAHVARTLDVTAAPRELHVVNVLPRRGIGKVDRA
ALVRRFAGEADQ
>MT3640 mhpD, 2-keto-4-pentenoate hydratase
MLRDATRDELAADLAQAERSRDPIGQLTAAHPEIDVVDAYEIQLINIRQR
VAEGARVVGHKVGLSSPIMQQMMGVDEPDYGHLLDDMQVFEDTPVQASRY
LSPRVEVEVGFILAADLPGAGCTEDDVLAATEALVPAIELIDTRIKDWQI
KICDTIADNASAAGFVLGAARVPPADLDVRAIDAKLTRNGEVVAEGRSDA
VLGNPATAVAWLAGKVESFGVRLRKGDIVLPGSCTFAVEARAGDEFVADF
TGLGLVRLSFE
>MT3639 mhpF, acetaldehyde dehydrogenase (acetylating)
MPSKAKVAIVGSGNISTDLLYKLLRSEWLEPRWMVGIDPESDGLARAAKL
GLETTHEGVDWLLAQPDKPDLVFEATSAYVHRDAAPKYAEAGIRAIDLTP
AAVGPAVIPPANLREHLDAPNVNMITCGGQATIPIVYAVSRIVEVPYAEI
VASVASVSAGPGTRANIDEFTKTTARGVQTIGGAARGKAIIILNPADPPM
IMRDTIFCAIPTDADREAIAASIHDVVKEVQTYVPGYRLLNEPQFDEPSI
NSGGQALVTTFVEVEGAGDYLPPYAGNLDIMTAAATKVGEEIAKETLVVG
GAR
>MT3671 nhoA, N-hydroxyarylamine O-acetyltransferase
MALDLTAYFDRINYRGATDPTLDVLQDLVTVHSRTIPFENLDPLLGVPVD
DLSPQALADKLVLRRRGGYCFEHNGLMGYVLAELGYRVRRFAARVVWKLA
PDAPLPPQTHTLLGVTFPGSGGCYLVDVGFGGQTPTSPLRLETGAVQPTT
HEPYRLEDRVDGFVLQAMVRDTWQTLYEFTTQTRPQIDLKVASWYASTHP
ASKFVTGLTAAVITDDARWNLSGRDLAVHRAGGTEKIRLADAAAVVDTLS
ERFGINVADIGERGALETRIDELLARQPGADAP
>MT2451 pchE, dihydroaeruginoic acid synthetase
MVHATACSEIIRAEVAELLGVRADALHPGANLVGQGLDSIRMMSLVGRWR
RKGIAVDFATLAATPTIEAWSQLVSAGTGVAPTAVAAPGDAGLSQEGEPF
PVAPMQHAMWVGRHDHQQLGGVAGHLYVEFDGARVDPDRLRAAATRLALR
HPMLRVQFLPDGTQRIPPAAGSRDFPISVADLRHVAPDVVDQRLAGIRDA
KSHQQLDGAVFELALTLLPGERTRLHVDLDMQAADAMSYRILLADLAALY
DGREPPALGYTYREYRQAIEAEETLPQPVRDADRDWWAQRIPQLPDPPAL
PTRAGGERDRRRSTRRWHWLDPQTRDALFARARARGITPAMTLAAAFANV
LARWSASSRFLLNLPLFSRQALHPDVDLLVGDFTSSLLLDVDLTGARTAA
ARAQAVQEALRTAAGHSAYPGLSVLRDLSRHRGTQVLAPVVFTSALGLGD
LFCPDVTEQFGTPGWIISQGPQVLLDAQVTEFDGGVLVNWDVREGVFAPG
VIDAMFTHQVDELLRLAAGDDAWDAPSPSALPAAQRAVRAALNGRTAAPS
TEALHDGFFRQAQQQPDAPAVFASSGDLSYAQLRDQASAVAAALRAAGLR
VGDTVAVLGPKTGEQVAAVLGILAAGGVYLPIGVDQPRDRAERILATGSV
NLALVCGPPCQVRVPVPTLLLADVLAAAPAEFVPGPSDPTALAYVLFTSG
STGEPKGVEVAHDAAMNTVETFIRHFELGAADRWLALATLECDMSVLDIF
AALRSGGAIVVVDEAQRRDPDAWARLIDTYEVTALNFMPGWLDMLLEVGG
GRLSSLRAVAVGGDWVRPDLARRLQVQAPSARFAGLGGATETAVHATIFE
VQDAANLPPDWASVPYGVPFPNNACRVVADSGDDCPDWVAGELWVSGRGI
ARGYRGRPELTAERFVEHDGRTWYRTGDLARYWHDGTLEFVGRADHRVKI
SGYRVELGEIEAALQRLPGVHAAAATVLPGGSDVLAAAVCVDDAGVTAES
IRQQLADLVPAHMIPRHVTLLDRIPFTDSGKIDRAEVGALLAAEVERSGD
RSAPYAAPRTVLQRALRRIVADILGRANDAVGVHDDFFALGGDSVLATQV
VAGIRRWLDSPSLMVADMFAARTIAALAQLLTGREANADRLELVAEVYLE
IANMTSADVMAALDPIEQPAQPAFKPWVKRFTGTDKPGAVLVFPHAGGAA
AAYRWLAKSLVANDVDTFVVQYPQRADRRSHPAADSIEALALELFEAGDW
HLTAPLTLFGHCMGAIVAFEFARLAERNGVPVRALWASSGQAPSTVAASG
PLPTADRDVLADMVDLGGTDPVLLEDEEFVELLVPAVKADYRALSGYSCP
PDVRIRANIHAVGGNRDHRISREMLTSWETHTSGRFTLSHFDGGHFYLND
HLDAVARMVSADVR
>MT2103 pncA, pyrazinamidase/nicotinamidase
MRALIIVDVQNDFCEGGSLAVTGGAALARAISDYLAEAADYHHVVATKDF
HIDPGDHFSGTPDYSSSWPPHCVSGTPGADFHPSLDTSAIEAVFYKGAYT
GAYSGFEGVDENGTPLLNWLRQRGVDEVDVVGIATDHCVRQTAEDAVRNG
LATRVLVDLTAGVSADTTVAALEEMRTASVELVCSS
>MT0593 tcmO, tetracenpmycin polyketide synthesis 8-o-methyltransferase
MVELSPDRIMAIGGGYGPSKVLLTAVGLGLFTELGDEAMTAEAIADRLGL
LKRPAIDFLDALVSLDLLARDGDGPGSHYRNTPETAHFLDEARPTYAGGL
LKIWNERNYRFWADLTEALKTGKAQSEVKQTGRPFFEALYADPRRLEAFM
AAMDAASRRNIELLAKRFPFERYRRLCDVGCADGLLSRIVAAAHPHLQCV
SFDLPAVTEIARRKLTAEGLGERVQACAGDFLADPLPAADVITMGQILHD
WNLDRKQQLVAKAYEALSKEGAFIVIETLIDDARRENTTGLMMSLNMLIE
FGDAFDYSAADFRGWCGEAGFRSFEVIPLAGGSSAAVAYK