Gene list
Applied filters:
COG category: Secondary metabolites biosynthesis, transport and catabolism
Organism: Mycobacterium tuberculosis CDC1551, CDC1551
Gene type: CDS
Number of genes found: 266
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Mycobacterium tuberculosis CDC1551, CDC1551 >MT2016 conserved hypothetical protein MVIVADKAAGRVADPVLRPVGALGDFFAMTLDTSVCMFKPPFAWREYLLQ CWFVARVSTLPGVLMTIPWAVISGFLFNVLLTDIGAADFSGTGCAIFTVN QSAPIVTVLVVAGAGATAMCADLGARTIREELDALRVMGINPIQALAAPR VLAATTVSLALNSVVTATGLIGAFFCSVFLMHVSAGAWVTGLTTLTHTVD VVISMIKATLFGLMAGLIACYKGMSVGGGPAGVGRAVNETVVFAFIVLFV INIVVTAVGIPFMVS >MT2020 virulence factor mce family protein MRRRAQGQRQGRPAGVHQAGRSGQRAVRAEMKSFAERNRLAIGTVGIVVV AAVALAALQYQRLPFFNQGTRVSAYFADAGGLRTGNTVEVSGYPVGKVSS ISLDGPGVLVEFKVDTDVRLGNRTEVAIKTKGLLGSKFLDVTPRGDGRLD SPIPIERTTSPYQLPDALGDLAATISGLHTERLSESLATLAQTFADTPAH FRNAIHGVARLAQTLDERDNQLRSLLANAAKATGVLANRTDQIVGLVRDT NVVLAQLRTQSAALDRIWANISAVAEQLRGFIAENRQQLRPALDKLNGVL AIVENRKERVRQAIPLINTYVMSLGESLSSGPFFKAYVVNLLPGQFVQPF ISAAFSDLGLDPATLLPSQLTDPPTGQPGTPPLPMPYPRTGQGGEPRLTL PDAITGNPGDPRYPYRPEPPAPPPGGPPPGPPAQQPGDQP >MT3202 conserved hypothetical protein MSPSPSALLADHPDRIRWNAKYECADPTEAVFAPISWLGDVLQFGVPEGP VLELACGRSGTALGLAAAGRCVTAIDVSDTALVQLELEATRRELADRLTL VHADLCSWQSGDGRFALVLCRLFWHPPTFRQACEAVAPGGVVAWEAWRRP IDVARDTRRAEWCLKPGQPESELPAGFTVIRVVDTDGSEPSRRIIAQRSL >MT1770 conserved hypothetical protein MARTDDDNWDLTSSVGVTATIVAVGRALATKDPRGLINDPFAEPLVRAVG LDLFTKMMDGELDMSTIADVSPAVAQAMVYGNAVRTKYFDDYLLNATAGG IRQVAILASGLDSRAYRLPWPTRTVVYEIDQPKVMEFKTTTLADLGAEPS AIRRAVPIDLRADWPTALQAAGFDSAAPTAWLAEGLLIYLKPQTQDRLFD NITALSAPGSMVATEFVTGIADFSAERARTISNPFRCHGVDVDLASLVYT GPRNHVLDYLAAKGWQPEGVSLAELFRRSGLDVRAADDDTIFISGCLTDH SSISPPTAAGWR >MT1439 P450 heme-thiolate protein MATATTQRPLKGPAKRMSTWTMTREAITIGFDAGDGFLGRLRGSDITRFR CAGRRFVSISHPDYVDHVLHEARLKYVKSDEYGPIRATAGLNLLTDEGDS WARHRGALNSTFARRHLRGLVGLMIDPIADVTAALVPGAQFDMHQSMVET TLRVVANALFSQDFGPLVQSMHDLATRGLRRAEKLERLGLWGLMPRTVYD TLIWCIYSGVHLPPPLREMQEITLTLDRAINSVIDRRLAEPTNSADLLNV LLSADGGIWPRQRVRDEALTFMLAGHETTANAMSWFWYLMALNPQARDHM LTELDDVLGMRRPTADDLGKLAWTTACLQESQRYFSSVWIIAREAVDDDI IDGHRIRRGTTVVIPIHHIHHDPRWWPDPDRFDPGRFLRCPTDRPRCAYL PFGGGRRICIGQSFALMEMVLMAAIMSQHFTFDLAPGYHVELEATLTLRP KHGVHVIGRRR >MT2336 P450 heme-thiolate protein MTATVLLEVPFSARGDRIPDAVAELRTREPIRKVRTITGAEAWLVSSYAL CTQVLEDRRFSMKETAAAGAPRLNALTVPPEVVNNMGNIADAGLRKAVMK AITPKAPGLEQFLRDTANSLLDNLITEGAPADLRNDFADPLATALHCKVL GIPQEDGPKLFRSLSIAFMSSADPIPAAKINWDRDIEYMAGILENPNITT GLMGELSRLRKDPAYSHVSDELFATIGVTFFGAGVISTGSFLTTALISLI QRPQLRNLLHEKPELIPAGVEELLRINLSFADGLPRLATADIQVGDVLVR KGELVLVLLEGANFDPEHFPNPGSIELDRPNPTSHLAFGRGQHFCPGSAL GRRHAQIGIEALLKKMPGVDLAVPIDQLVWRTRFQRRIPERLPVLW >MT3802 conserved hypothetical protein MGAMTDEVMDWDSAYREQGAFEGPPPWNIGEPQPELATLIAAGKVRSDVL DAGCGYAELSLALAADGYTVVGIDLTPTAVAAATKAAEERGLTTASFVQA DITEFAAYPAGSAGRFSTVIDSTLFHSLPVDSRDRYLSSVHRAAAPGASY YVLVFAKGAFPAELEVKPNEVDEDELRAAVSKYWKIDEIRPAFIHVNPVT IPPQLAGAPVEFPPYDHDEKGRVKFPAYLLTAHKAG >MT1929 P450 heme-thiolate protein MHGVIRGIAAIGIRRGDLQARLIADPAVATDPVPFYDEVRSHGALVRNRA NYLTVDHRLAHDLLRSDDFRVVSFGENLPPPLRWLERRTRGDQLHPLREP SLLAVEPPDHTRYRKTVSAVFTSRAVSALRDLVEQTAINLLDRFAEQPGI VDVVGRYCSQLPIVVISEILGVPEHDRPRVLEFGELAAPSLDIGIPWRQY LRVQQGIRGFDCWLEGHLQQLRHAPGDDLMSQLIQIAESGDNETQLDETE LRAIAGLVLVAGFETTVNLLGNGIRMLLDTPEHLATLRQHPELWPNTVEE ILRLDSPVQLTARVACRDVEVAGVRIKRGEVVVIYLAAANRDPAVFPDPH RFDIERPNAGRHLAFSTGRHFCLGAALARAEGEVGLRTFFDRFPDVRAAG AGSRRDTRVLRGWSTLPVTLGPARSMVSP >MT3600 virulence factor mce family protein MGRVAMLTGSRGLRYATVIALVAALVGGVYVLSSTGNKRTIVGYFTSAVG LYPGDQVRVLGVPVGEIDMIEPRSSDVKITMSVSKDVKVPVDVQAVIMSP NLVAARFIQLTPVYTGGAVLPDNGRIDLDRTAVPVEWDEVKEGLTRLAAD LSPAAGELQGPLGAAINQAADTLDGNGDSLHNALRELAQVAGRLGDSRGD IFGTVKNLQVLVDALSESDEQIVQFAGHVASVSQVLADSSANLDQTLGTL NQALSDIRGFLRENNSTLIETVNQLNDFAQTLSDQSENIEQVLHVAGPGI TNFYNIYDPAQGTLNGLLSIPNFANPVQFICGGSFDTAAGPSAPDYYRRA EICRERLGPVLRRLTVNYPPIMFHPLNTITAYKGQIIYDTPATEAKSETP VPELTWVPAGGGAPVGNPADLQSLLVPPAPGPAPAPPAPGAGPGEHGGGG >MT1222 acyl-CoA synthase MSDSSVLSLLRERAGLQPDDAAFTYIDYEQDWAGITETLTWSEVFRRTRI VAHEVRRHCTTGDRAVILAPQGLAYIAAFLGSMQAGAIAVPLSVPQIGSH DERVSAVLADASPSVILTTSAVAEAVAEHIHRPNTNNVGPIIEIDSLDLT GNSPSFRVKDLPSAAYLQYTSGSTRAPAGVMISHRNLQANFQQLMSNYFG DRNGVAPPDTTIVSWLPFYHDMGLVLGIIAPILGGYRSELTSPLAFLQRP ARWLHSLANGSPSWSAAPNFAFELAVRKTTDADIEGLDLGNVLGITSGAE RVHPNTLSRFCNRFAPYNFREDMIRPSYGLAEATLYVASRNSGDKPEVVY FEPDKLSTGSANRCEPKTGTPLLSYGMPTSPTVRIVDPDTCIECPAGTIG EIWVKGDNVAEGYWNKPDETRHTFGAMLVHPSAGTPDGSWLRTGDLGFLS EDEMFIVGRMKDMLIVYGRNHYPEDIESTVQEITGGRVAAISVPVDHTEK LVTVIELKLLGDSAGEAMDELDVIKNNVTAAISRSHGLNVADLVLVPPGS IPTTTSGKIRRAACVEQYRLQQFTRLDG >MT3937 oxidoreductase, putative MTGYDAIVIGAGHNGLTAAVLLQRAGLRTACLDAKRYAGGMASTVELFDG YRFEIAGSVQFPTSSAVSSELGLDSLPTVDLEVMSVALRGVGDDPVVQFT DPTKMLTHLHRVHGADAVTGMAGLLAWSQAPTRALGRFEAGTLPKSFDEM YACATNEFERSAIDDMLFGSVTDVLDRHFPDREKHGALRGSMTVLAVNTL YRGPATPGSAAALAFGLGVPEGDFVRWKKLRGGIGALTTHLSQLLERTGG EVRLRSKVTEIVVDNSRSSARVRGVRTAAGDTLTSPIVVSAIAPDVTINE LIDPAVLPSEIRDRYLRIDHRGSYLQMHFALAQPPAFAAPYQALNDPSMQ ASMGIFCTPEQVQQQWEDCRRGIVPADPTVVLQIPSLHDPSLAPAGKQAA SAFAMWFPIEGGSKYGGYGRAKVEMGQNVIDKITRLAPNFKGSILRYTTF TPKHMGVMFGAPGGDYCHALLHSDQIGPNRPGPKGFIGQPIPIAGLYLGS AGCHGGPGITFIPGYNAARQALADRRAANCCVLSGR >MT0283 substrate--CoA ligase MPNLTDLPGQAVSKLQKSIGQYVARGTAELHYLRKIIESGAIGLEPPLNY AALAADIRKWGEVGMLPSHNARRAPNRAAVIDEEGTLTFSELDEAAHAVA NGLLAKGVRAGDGVAILARNHRWFVIANYGAARVGARIILLNSEFSGPQI KEVSDREGAKVIIYDDEYTKAVSLAQPPLGKLRALGVNPDDDKPSGSSDE TLAELIAHSSTAPAPKASRRASIIILTSGTTGTPKGANRNTPPTLAPIGG ILSHVPFKAGEVTLLPSPMFHALGYMHAALAMFLGSTLVLRRRFKPALVL EDIEKHKATSMVVVPVMLSRILDQLEKTEPKPDLSSLKIVFVSGSQLGAE LATRALGDLGPVIYNMYGSTEVAFATIAGPKDLQFNPSTVGPVVKGVTVK ILDENGNEVPQGAVGRIFVGNAFPFEGYTGGGGKQIIDGLLSSGDVGYFD ERGLLYVSGRDDEMIVSGGENVFPAEVEDLISGHPDVVEAAAIGVDDKEF GARLRAFVVKKPGADLDEDTIKQYVRDHLARYKVPREVIFLDELPRNPTG KVLKRELRKL >MT1703 polyketide synthase, putative MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFLSQARHV GKVVLTMPDAWAAGTVLITGGTGMAGSAVARHLVSRYGVRQVVLASRAGE HTESVAALVDELGSAGARVQVVSCDVADRDAVAGLVASQPDLTAVFHAAG VLDDAVITGLTPERVDKVLRAKVDGAWNLHELTRHLDVSAFVLFSSMAGI VGAPGQANYAAANAFLDGLAAYRRSRGLAALSVAWGLWEQASAMTEHLGE RDRVRMSRVGLAPLPTNQAMGFLDAALLADRPVVVAARLDRAALAGAELP ALFSQLVAGPIRRIIDGADEVSGSGLASRLHGLTPEQRHRELTELVCSNA AIVLGHSGTEIDAHKAFQDLGFDSLTAVELRNRLKTATGLTLPPTLIFDY PTAAELAEHLDIQLANAPAVTVDQPNPSTRFNEVTRELQALLDQPNWNPD DKTRLIKRLQAILTDCTAPPASSGPSTTHDDEDITTATESQLFAILDDEL GP >MT1180 hypothetical protein MTSGAAASASRVDHPLFARIWPVVAAHEAEAIRALRRENLAGLSGRVLEV GAGVGTNFAYYPVAVEQVIAMEPEPRLAAKARIAAADAPVPIVVTDKTVE EFRDTETFDAVVCSLVLCSVSDPGAVLAHLRSLLRRGGELRYLEHVASAG ARGRVQRFVDATFWPRLAGNCHTHRHTERAILDAGFVVDSSRREWAFPAW VPLPVSELALGRAHRT >MT3311 isochorismate synthase, putative MSAHVATLHPEPPFALCGPRGTLIARGVRTRYCDVRAAQAALRSGTAPIL LGALPFDVSRPAALMVPDGVLRARKLPDWPTGPLPKVRVAAALPPPADYL TRIGRARDLLAAFDGPLHKVVLARAVQLTADAPLDARVLLRRLVVADPTA YGYLVDLTSAGNDDTGAALVGASPELLVARSGNRVMCKPFAGSAPRAADP KLDAANAAALASSAKNRHEHQLVVDTMRVALEPLCEDLTIPAQPQLNRTA AVWHLCTAITGRLRNISTTAIDLALALHPTPAVGGVPTKAATELIAELEG DRGFYAGAVGWCDGRGDGHWVVSIRCAQLSADRRAALAHAGGGIVAESDP DDELEETTTKFATILTALGVEQ >MT3599 virulence factor mce family protein MNRIWLRAIILTASSALLAGCQFGGLNSLPLPGTAGHGEGAYSVTVEMAD VATLPQNSPVMVDDVTVGSVAGIVAVQRPDGSFYAAVKLDLDKNVLLPAN AVAKVSQTSLLGSLHVELAPPTDRPPTGRLVDGSRITEANTDRFPTTEEV FSALGVVVNKGNVGALEEIIDETHQAVAGRQAQFVNLVPRLAELTAGLNR QVHDIIDALDGLNRVSAILARDKDNLGRALDTLPDAVRVLNQNRDHIVDA FAALKRLTMVTSHVLAETKVDFGEDLKDLYSIVKALNDDRKDFVTSLQLL LTFPFPNFGIKQAVRGDYLNVFTTFDLTLRRIGETFFTTAYFDPNMAHMD EILNPPDFLIGELANLSGQAADPFKIPPGTASGQ >MT3004 polyketide synthase MTSLAERAAQLSPNARAALARELVRAGTTFPTDICEPVAVVGIGCRFPGN VTGPESFWQLLADGVDTIEQVPPDRWDADAFYDPDPSASGRMTTKWGGFV SDVDAFDADFFGITPREAVAMDPQHRMLLEVAWEALEHAGIPPDSLSGTR TGVMMGLSSWDYTIVNIERRADIDAYLSTGTPHCAAVGRIAYLLGLRGPA VAVDTACSSSLVAIHLACQSLRLRETDVALAGGVQLTLSPFTAIALSKWS ALSPTGRCNSFDANADGFVRGEGCGVVVLKRLADAVRDQDRVLAVVRGSA TNSDGRSNGMTAPNALAQRDVITSALKLADVTPDSVNYVETHGTGTVLGD PIEFESLAATYGLGKGQGESPCALGSVKTNIGHLEAAAGVAGFIKAVLAV QRGHIPRNLHFTRWNPAIDASATRLFVPTESAPWPAAAGPRRAAVSSFGL SGTNAHVVVEQAPDTAVAAAGGMPYVSALNVSGKTAARVASAAAVLADWM SGPGAAAPLADVAHTLNRHRARHAKFATVIARDRAEAIAGLRALAAGQPR VGVVDCDQHAGGPGRVFVYSGQGSQWASMGQQLLANEPAFAKAVAELDPI FVDQVGFSLQQTLIDGDEVVGIDRIQPVLVGMQLALTELWRSYGVIPDAV IGHSMGEVSAAVVAGALTPEQGLRVITTRSRLMARLSGQGAMALLELDAD AAEALIAGYPQVTLAVHASPRQTVIAGPPEQVDTVIAAVATQNRLARRVE VDVASHHPIIDPILPELRSALADLTPQPPSIPIISTTYESAQPVADADYW SANLRNPVRFHQAVTAAGVDHNTFIEISPHPVLTHALTDTLDPDGSHTVM STMNRELDQTLYFHAQLAAVGVAASEHTTGRLVDLPPTPWHHQRFWVTDR SAMSELAATHPLLGAHIEMPRNGDHVWQTDVGTEVCPWLADHKVFGQPIM PAAGFAEIALAAASEALGTAADAVAPNIVINQFEVEQMLPLDGHTPLTTQ LIRGGDSQIRVEIYSRTRGGEFCRHATAKVEQSPRECAHAHPEAQGPATG TTVSPADFYALLRQTGQHHGPAFAALSRIVRLADGSAETEISIPDEAPRH PGYRLHPVVLDAALQSVGAAIPDGEIAGSAEASYLPVSFETIRVYRDIGR HVRCRAHLTNLDGGTGKMGRIVLINDAGHIAAEVDGIYLRRVERRAVPLP LEQKIFDAEWTESPIAAVPAPEPAAETTRGSWLVLADATVDAPGKAQAKS MADDFVQQWRSPMRRVHTADIHDESAVLAAFAETAGDPEHPPVGVVVFVG GASSRLDDELAAARDTVWSITTVVRAVVGTWHGRSPRLWLVTGGGLSVAD DEPGTPAAASLKGLVRVLAFEHPDMRTTLVDLDITQDPLTALSAELRNAG SGSRHDDVIAWRGERRFVERLSRATIDVSKGHPVVRQGASYVVTGGLGGL GLVVARWLVDRGAGRVVLGGRSDPTDEQCNVLAELQTRAEIVVVRGDVAS PGVAEKLIETARQSGGQLRGVVHAAAVIEDSLVFSMSRDNLERVWAPKAT GALRMHEATADCELDWWLGFSSAASLLGSPGQAAYACASAWLDALVGWRR ASGLPAAVINWGPWSEVGVAQALVGSVLDTISVAEGIEALDSLLAADRIR TGVARLRADRALVAFPEIRSISYFTQVVEELDSAGDLGDWGGPDALADLD PGEARRAVTERMCARIAAVMGYTDQSTVEPAVPLDKPLTELGLDSLMAVR IRNGARADFGVEPPVALILQGASLHDLTADLMRQLGLNDPDPALNNADTI RDRARQRAAARHGAAMRRRPKPEVQGG >MT0341 hypothetical protein MVATDFSDVAVAQLRRSAQARGVSARVQPIVHDLRQPLPVKTGSIDGAFA HMALCMALSTSEIHAVVAEVGRVLRPGGKFIYTVRHTGDAHYGAGQAHGD DIFECAGFAVHFFRRELVARLATGWVLEEVHDFEEGELPRRLWRVTVTKP A >MT2058 oxidoreductase, short-chain dehydrogenase/reductase family MSGRLIGKVALVSGGARGMGASHVRAMVAEGAKVVFGDILDEEGKAVAAE LADAARYVHLDVTQPAQWTAAVDTAVTAFGGLHVLVNNAGILNIGTIEDY ALTEWQRILDVNLTGVFLGIRAVVKPMKEAGRGSIINISSIEGLAGTVAC HGYTATKFAVRGLTKSTALELGPGGIRVNSIHPGLVKTPMTDWVPEDIFQ TALGRAAEPVEVSNLVVYLASDESSYSTGAEFVVDGGTVAGLAHNDFGAV EVSSQPEWVT >MT0616 conserved hypothetical protein MTTHAVIITYLRDQTQPAVDAIGGFYRTCVLTGKALVRRPFHWREAIEQG WFITSVSLLPTLAVSIPLTVLIIFTLNILLAEFGAADISGAGAALGAVTQ LGPLTTVLVIAGAGATAICADLGARTIREEIDAMEVLGIDPIHRLVVPRV VAATIVAALLNGAVITIGLVGGFVFSVFIQHVSAGAYVGTLTLVTGLPEV IISVVKSATFGLIAGLVGCYRGLTTKGGPKGVGTAVNETLVLCVIALFAT NVVLTTIGVRFGTGH >MT1417 chalcone/stilbene synthase family protein MNVSAESGAPRRAGQRHEVGLAQLPPAPPTTVAVIEGLATGTPRRVVNQS DAADRVAELFLDPGQRERIPRVYQKSRITTRRMAVDPLDAKFDVFRREPA TIRDRMHLFYEHAVPLAVDVSKRALAGLPYRAAEIGLLVLATSTGFIAPG VDVAIVKELGLSPSISRVVVNFMGCAAAMNALGTATNYVRAHPAMKALVV CIELCSVNAVFADDINDVVIHSLFGDGCAALVIGASQVQEKLEPGKVVVR SSFSQLLDNTEDGIVLGVNHNGITCELSENLPGYIFSGVAPVVTEMLWDN GLQISDIDLWAIHPGGPKIIEQSVRSLGISAELAAQSWDVLARFGNMLSV SLIFVLETMVQQAESAKAISTGVAFAFGPGVTVEGMLFDIIRR >MT2449 polyketide synthase, putative MWGGVMAPKQLPDGRVAVLLSAHAEELIGPDARAIADYLERFPATTVTEV ARQLRKTRRVRRHRAVLRAADRLELAEGLRALAAGREHPLIARSSLGSAP RQAFVFPGQGGHWPGMGAVAYRELPTYRTATDTCAAAFAAAGVDSPLPYL IAPPGTDERQAFCEIEIEGAQFVHAVALAEVWRSCGVLPDLTVGHSLGEV AAAYLAGSITLSDAVAVVAARANVVGRLPGRYAVAALGIGEQDASALIAT TGGWLELSVVNASSTVAVSGERQAVAAIVDTVRSSGHFARGITVGFPVHT SVLESLRDELCEQLPDSEFMEAPVQFIGGTTGDVVAPGTTFGDYWYANLR HTVRFDRAVESAIRCGARAFIEISAHPALLFAIGQNCEGAANLPDGPAVL VGSARRGERFVDALSANIVSAAVADPGYPWGDLGGDPLDGDVDLSGFPNA PMRAVPMWAHPEPLPPVSGLTIAVERWERMVPSTPVAGRHRHLAVLDLGA HRALAQTLCAAIDSHPDTELSAARDAELILVIAPDFEHTDAVRAAGALAD LVGAGLLDYPMHIGARCQSVCLVTVGAEQVDAADAVPSAGQAALAAMHRS IGFEHPEQTFSHLDLPSWDLDPVLGVSVITAVLRGFGETALRGSVNGYTL FERTLADAPAVPNWSLDSGVLDDVVVTGGAGAIGMHYARYLAEHGARRIV LLSRRAADQATVAMLRKQHGTVIVSPPCDITDPTQLSAIAAEYGGVGASL IVHAAGSVISGTAPGVTSAAVVDNFAAKVLGLAQMIELWPLRPDVRTLLC SSVMGVWGGHGVVAYSAANRLLDVMAAQLRAQGRHCVAVKWGLWQAPKAG EPARGIADAVTIARVERSGLRQMAPQQAIEASLHEFTVDPLVFAADAARL QMLLDSRQFERYEGPTDPNLTIVDAVRTQLAAVLGIPQAGEVNLQESLFD LGVDSMLALDLRNRLKRSIGATVSLATLMGDITGDGLVAKLEDADERSHT AQKVDISRD >MT3009 conserved hypothetical protein MFPGSVIRKLSHSEEVFAQYEVFTSMTIQLRGVIDVDALSDAFDALLETH PVLASHLEQSSDGGWNLVADDLLHSGICVIDGTAATNGSPSGNAELRLDQ SVSLLHLQLILREGGAELTLYLHHCMADGHHGAVLVDELFSRYTDAVTTG DPGPITPQPTPLSMEAVLAQRGIRKQGLSGAERFMSVMYAYEIPATETPA VLAHPGLPQAVPVTRLWLSKQQTSDLMAFGREHRLSLNAVVAAAILLTEW QLRNTPHVPIPYVYPVDLRFVLAPPVAPTEATNLLGAASYLAEIGPNTDI VDLASDIVATLRADLANGVIQQSGLHFGTAFEGTPPGLPPLVFCTDATSF PTMRTPPGLEIEDIKGQFYCSISVPLDLYSCAVYAGQLIIEHHGHIAEPG KSLEAIRSLLCTVPSEYGWIME >MT1554 hypothetical protein MRFDQLVRIVNAADPFSINDLGCGYGALLDYLDARGFKTDYTGIDVSPEM VRAAALRFEGRANADFICAARIDREADYSVASGIFNVRLKSLDTEWCAHI EATLDMLNAASRRGFSFNCLTSYSDASKMRDDLYYADPCALFDLCKRRYS KSVALLHDYGLYEFTILVRKAS >MT3940 hypothetical protein MFWAMAMNLLHRRHCSSAGWEKAVANQLLPWALQHVELGPRTLEIGPGYG ATLQALLGLTASLTAVEVDNSMVERLNRRYGQRARIIRGDGTQTGLPDDH FTSVVCFTMLHHVASAQMQDQLFAEAYRVLQPGGVFAGSDGVPSLPFRLI HIADTYTPIAPADLPGRLRAVGFTDIHVDVAGARLRWRATKPVAA >MT3507 conserved hypothetical protein MARPMGKLPSNTRKCAQCAMAEALLEIAGQTINQKDLGRSGRMTRTDNDT WDLASSVGATATMIATARALASRAENPLINDPFAEPLVRAVGIDLFTRLA SGELRLEDIGDHATGGRWMIDNIAIRTKFYDDFFGDATTAGIRQVVILAA GLDTRAYRLPWPPGTVVYEIDQPAVIKFKTRALANLNAEPNAERHAVAVD LRNDWPTALKNAGFDPARPTAFSAEGLLSYLPPQGQDRLLDAITALSAPD SRLATQSPLVLDLAEEDEKKMRMKSAAEAWRERGFDLDLTELIYFDQRND VADYLAGSGWQVTTSTGKELFAAQGLPPFEDDHITRFADRRYISAVLK >MT3498 oxidoreductase, short-chain dehydrogenase/reductase family MRYVVTGGTGFIGRHVVSRLLDGRPEARLWALVRRQSLSRFERLAGQWGD RVRPLVGDLTELELSERTIAELGDIDHVLHCAAVHDTTWADATRAVIELA ARLDATFHHVSSIAVAGDFAGHYTEADFDVGQRLPTPYHRMTFEAERLVR STPGLRYRIYRPAVVVGDSRTGEMDTIDGPYYLFGVLAKLAVLPSFTPML LPDIGRTNIVPVDYVADALVALMHADGRDGQTFHLTAPTAIGLRGIYRGI AGAAGLPPLLGTLPGFVAAPVLNARGRAKVLRNMAATQLGIPAEIFDVVG CAPTFTSDTTREALRGTGIHVPEFATYAPGLWRYWAEHLDPDRARRNDPL LGRHVIITGASSGIGRASAIAVAKRGATVFALARNGNALDELVTEIRAHG GQAHAFTCDVTDSASVEHTVKDILGRFDHVDYLVNNAGRSIRRSVVNSTD RLHDYERVMAVNYFGAVRMVLALLPHWRERRFGHVVNVSSAGVQARNPKY SSYLPTKAALDAFADVVASETLSDHITFTNIHMPLVATPMIVPSRRLNPV RAISAERAAAMVIRGLVEKPARIDTPLGTLAEAGNYVAPRLSRRILHQLY LGYPDSAAAQGISRPDADRPPAPRRPRRSARAGVPRPLRRLGRLVPGVHW >MT1834 P450 heme-thiolate protein MVGARPRAILARSAPLGYVCDLRQERRHERLERMTTPGEDHAGSFYLPRL EYSTLPMAVDRGVGWKTLRDAGPVVFMNGWYYLTRREDVLAALRNPKVFS SRKALQPPGNPLPVVPLAFDPPEHTRYRRILQPYFSPAALSKALPSLRRH TVAMIDAIAGRGECEAMADLANLFPFQLFLVLYGLPLEDRDRLIGWKDAV IAMSDRPHPTEADVAAARELLEYLTAMVAERRRNPGPDVLSQVQIGEDPL SEIEVLGLSHLLILAGLDTVTAAVGFSLLELARRPQLRAMLRDNPKQIRV FIEEIVRLEPSAPVAPRVTTEPVTVGGMTLPAGSPVRLCMAAVNRDGSDA MSTDELVMDGKVHRHWGFGGGPHRCLGSHLARLELTLLVGEWLNQIPDFE LAPDYAPEIRFPSKSFALKNLPLRWS >MT2023 virulence factor mce family protein MLHLPRRVIVQLAVFTVIAVGVLAITFLHFVRLPAMLFGVGRYTVTMELV EAGGLYRTGNVTYRGFEVGRVAAVRLTDTGVQAVLALKSGIDIPSDLKAE VHSHTAIGETYVELLPRNAASPPLKNGDVIALADTSVPPDINDLLSAANT ALEAIPHENLQTVIDESYTAVAGLGLELSRLIKGSAELAIDARANLDPLV ALIDRAGPVLDSQTHTSDAIAAWAAQLAAVTGQLQTHDSAVGDLIDRGGP ALGETRQLLERLQPTVPILLANLVSVGQVALTYHNDIEQLLVVFPMAIAA EQAGILANLNTKQAYRGQYLSFNLNLNLPPPCTTGFLPAQQRRIPTFEDY PDRPAGDLYCRVPQDSPFNVRGARNIPCETVPGKRAPTVKLCESDEPYLP LNDGYNWKGDPNATVPGLGSGQDIPQTWQTMLLPPGS >MT2330 P450 heme-thiolate protein MTATQSPPEPAPDRVRLAGCPLAGTPDVGLTAQDATTALGVPTRRRASSG GIPVATSMWRDAQTVRTYGPAVAKALALRVAGKARSRLTGRHCRKFMQLT DFDPFDPAIAADPYPHYRELLAGERVQYNPKRDVYILSRYADVREAARNH DTLSSARGVTFSRGWLPFLPTSDPPAHTRMRKQLAPGMARGALETWRPMV DQLARELVGGLLTQTPADVVSTVAAPMPMRAITSVLGVDGPDEAAFCRLS NQAVRITDVALSASGLISLVQGFAGFRRLRALFTHRRDNGLLRECTVLGK LATHAEQGRLSDDELFFFAVLLLVAGYESTAHMISTLFLTLADYPDQLTL LAQQPDLIPSAIEEHLRFISPIQNICRTTRVDYSVGQAVIPAGSLVLLAW GAANRDPRQYEDPDVFRADRNPVGHLAFGSGIHLCPGTQLARMEGQAILR EIVANIDRIEVVEPPTWTTNANLRGLTRLRVAVTPRVAP >MT0851 conserved hypothetical protein MVRADRDRWDLATSVGATATMVAAQRALAADPRYALIDDPYAAPLVRAVG MDVYTRLVDWQIPVEGDSEFDPQRMATGMACRTRFFDQFFLDATHSGIGQ FVILASGLDARAYRLAWPVGSIVYEVDMPEVIEFKTATLSDLGAEPATER RTVAVDLRDDWATALQTAGFDPKVPAAWSAEGLLVYLPVEAQDALFDNIT ALSAPGSRLAFEFVPDTAIFADERWRNYHNRMSELGFDIDLNELVYHGQR GHVLDYLTRDGWQTSALTVTQLYEANGFAYPDDELATAFADLTYSSATLM R >MT3584 hypothetical protein MPPTKEWPVSQTARRLGPQDMFFLYSESSTTMMHVGALMPFTPPSGAPPD LLRQLVDESKASEVVEPWSLRLSHPELLYHPTQSWVVDDNFDLDYHVRRS ALASPGDERELGIPVSRLHSHALDLRRPPWEVHFIEGLEGGRFAIYIKMH HSLIDGYTGQKMLARSLSTDPHDTTHPLFFNIPTPGRSPADTQDSVGGGL IAGAGNVLDGLGDVVRGLGGLVSGVGSVLGSVAGAGRSTFELTKALVNAQ LRSDHEYRNLVGSVQAPHCILNTRISRNRRFATQQYPLDRLKAIGAQYDA TINDVALAIIGGGLRRFLDELGELPNKSLIVVLPVNVRPKDDEGGGNAVA TILATLGTDVADPVQRLAAVTASTRAAKAQLRSMDKDAILAYSAALMAPY GVQLASTLSGVKPPWPYTFNLCVSNVPGPEDVLYVRGSRMEASYPVSLVA HSQALNVTLQSYAGTLNFGFIGCRDTLPHLQRLAVYTGEALDQLAAADGA AGLGS >MT2925 oxidoreductase, short-chain dehydrogenase/reductase family MMDLSQRLAGRVAVITGGGSGIGLAAGRRMRAEGATIVVGDVDVEAGGAA ADELSGLFVPTDVCDEDAVNGLFDGAAETYGRIDIAFNNAGISPPEDNLI ENTELAAWQRVQDVNLKSVYLCCRAALRHMVLAGKGSIVNTASFVAVMGS ATSQISYTASKGGVLAMSRELGVQFARQGIRVNALCPGPVNTPLLQELFA KNPERAARRMVHVPLGRFAEPDEIAAAVAFLASDDASFITASTFLVDGGI SSAYVTPL >MT0180 virulence factor mce family protein MRIGLMGIVVALLVVAVGQSFTSVPMLFAKPSYYGQFTDSGGLHKGDRVR IAGLGVGTVEGLKIDGDHIVVKFSIGTNTIGTESRLAIRTDTILGRKVLE IEPRGAQALPPGGVLPVGQSTTPYQIYDAFFDVTKAASGWDIETVKRSLN VLSETVDQTYPHLSAALDGVAKFSDTIGKRDEQITHLLAQANQVASILGD RSEQVDRLLVNAKTLIAAFNERGRAVDALLGNISAFSAQVQNLINDNPNL NHVLEQLRILTDLLVDRKEDLAETLTILGRFSASFGETFASGPYFKVLLA NLVPGQILQPFVDAAFKKRGISPEDFWRSAGLPAYRWPDPNGTRFPNGAP PPPPPVLEGTPEHPGPAVPPGSPCSYTPPADGLPRPWDPLPCANLTQGPF GGPDFPAPLDVATSPPNPDGPPPAPGLPIAGRPGEVPPNVPGTPVPIPQE APPGARTLPLGPAPGPAPPPTAPGPPAPPGPGPQLPAPFINPGGTGGSGV TGGSEN >MT0156 oxidoreductase, short-chain dehydrogenase/reductase family MIAERSLMPGVQDRVIVVTGAGGGLGREYALTLAGEGASVVVNDLGGARD GTGAGSAMADEVVAEIRDKGGRAVANYDSVATEDGAANIIKTALDEFGAV HGVVSNAGILRDGTFHKMSFENWDAVLKVHLYGGYHVLRAAWPHFREQSY GRVVVATSTSGLFGNFGQTNYGAAKLGLVGLINTLALEGAKYNIHANALA PIAATRMTQDILPPEVLEKLTPEFVAPVVAYLCTEECADNASVYVVGGGK VQRVALFGNDGANFDKPPSVQDVAARWAEITDLSGAKIAGFKL >MT0622 virulence factor mce family protein MSTIFDIRSLRLPKLSAKVVVVGGLVVVLAVVAAAAGARLYRKLTTTTVV AYFSEALALYPGDKVQIMGVRVGSIDKIEPAGDKMRVTLHYSNKYQVPAT ATASILNPSLVASRTIQLSPPYTGGPVLQDGAVIPIERTQVPVEWDQLRD SINGILRQLGPTERQPKGPFGDLIESAADNLAGKGRQLNETLNSLSQALT ALNEGRGDFVAITRSLALFVSALYQNDQQFVALNENLAEFTDWFTKSDHD LADTVERIDDVLGTVRKFVSDNRSVLAADVNNLADATTTLVQPEPRDGLE TALHVLPTYASNFNNLYYPLHSSLVGQFVFPNFANPIQLICSAIQAGSRL GYQESAELCAQYLAPVLDALKFNYLPFGSNPFSSAATLPKEVAYSEERLR PPPGYKDTTVPGIFSRDTPFSHGNHEPGWVVAPGMQGMQVQPFTANMLTP ESLAELLGGPDIAPPPPGTNLPGPPNAYDESNPLPPPWYPQPASLPAAGA TGQPGPGQ >MT0624 virulence factor mce family protein MSATRRTRMTRRADRWWKGLSEEMLTRAIKTQLVLLTVLAVIAVVVLGWY FLRIPSLVGIGRYTLYAELPRSGGLYRTANVTYRGITIGKVTGVEPTERG ARATMSIDNGYQIPTDASANVHSVSAVGEQFVDLVSTRTSGPYLRHGQTI TTTTVPSQIGPALDAANRGLAVLPKDRVASVLHEASEAVGGLGSSLNRLI EATQAIAHDVRGSLEDIDDIIERSAPIIDSQVNSGNEIARWAANLNTLAA QTAQTDPAVRSILANAAPTADQVNATFSDVRESLPQTLANLEVVIDMLKR YHNGVEQALVFLPQSGAIAQSVTTEFPGQAGLGVGGLALNQPPPCLTGFL PASEWRSPADTSTAPLPKGTYCRIPMDASNVVRGARNNPCVDVPGKRAAT PRECRSNEAYVPGGTNPWYGDPNQMLSCPAPAARCDQPVKPGQVIPAPSV NNGINPLPADQLPGTPPPVNDPLQRPGSGTVQCNGQQPNPCVYTPSTFPT TIYDVQSGKVVAPDGVVYSVEASTHAGADGWKVMLAPTG >MT2496 hypothetical protein MDNLPIESAESTRLAKAAMTRRFYTRSVVKGEITLPAVPSMIDEYVTMCA GLFAGVGRKFSDEELAHLRAVLQGQLAEAYAASQRSTIVISYNAPMGPTL HYQVRAQWRTVAQEYENWIATREPPLFGTEPDARVWALANEAADPTTHRV LEIGAGTGRNALALARRGHPVDVVEMTPKFADIIRSDAERDSLDVRVIMR DVFSTMDDLRQDYQLMVLSEVVPDFRTTQQLRNLFELAAQCLAPGARLVF NAFLANGDYAPDQAAREFGQQMYTGMCTRAEMSAAAAGLPLELVADDSVY DYEKTHLPPGAWPPTSWYADWIRGLDVFTTNVESCPIEMRWLVFQRRR >MT3869 metallo-beta-lactamase superfamily protein MHRSSGNVVPMEHKPPTAVIQAAHGEHSLPLHDTTDFDDADRGFIAALSP CVIKAADGRVVWDNDAYSFLDGAAPTSVHPSLWRQSQLTAKQGLYQVVPG IYQVRGFDISNISFVEGDTGLIVIDPLVSTEVAAAALDLYRAHRGADRPV VAVIYTHSHVDHFGGVLGVTTQADVDAGKVAVLAPEGFTAHAVQENIYAG SAMMRRAGYMYGTVLARGLRGHVGCGLGQTLSTGEVSLVVPTVDITETGE THTIDGVEIEFQMAPGTEAPAEMHFYFPRFRALCMAENATHNLHNLLTLR GALVRDPRAWSGYLTEAIDTFADRTDVVFASHHWPTWGREKIVEFLSQQR DMYSYLHDQTLRLLNQGYTGVEIAEMFQLPPALQRAWHTHGYYGSVSHNV KAIYQRYMGWFDGNPGWLWPHPPEALAPRYVDALGGIDRVLELAREAFDA GDFRWAATLLDHAVFADSEHAAARGLYADTLEQLAYGAECATWRNFFLTG AAELRDGNPGSSGQVPAPTFFAQLTPDQIFDVLAISINGPRAWDLDLAID FTFTEPDVNYRLTLRNGVLIHRKLPADPATANATVTVGDKVRLVAAALGD ISSPGFEVFGDRTVLQTFLSVLDRPDSAFNIVTP >MT2022 virulence factor mce family protein MRIGLTLVMIAAVVASCGWRGLNSLPLPGTQGNGPGSFAVQAQLPDVNNI QPNSRVRVADVTVGHVTKIERQGWHALVTMRLDGDVDLPANATAKIGTTS LLGSYHIELAPPKGEARQGKLRDGSLIALSHGSAYPSTEQTLAALSLVLN GGGLGQVQDITEALSTAFAGREHDLRGLIGQLDTFTAYLNNQSGDIIAAT DSLNRLVGKFADQQPVFDRALATIPDALAVLADERDTLVEAAEQLSKFSA LTVDSVNKTTANLVTELRQLGPVLESLANSGPALTRSLSLLATFPFPNET FQNFQRGEYANLTAIVDLTLSRIDQGLLTGTRWECHLTQLELQWGRTIGQ FPSPCTAGYRGTPGNPLTIAYRWDQGP >MT3907 polyketide synthase MADVAESQENAPAERAELTVPEMRQWLRNWVGKAVGKAPDSIDESVPMVE LGLSSRDAVAMAADIEDLTGVTLSVAVAFAHPTIESLATRIIEGEPETDL AGDDAEDWSRTGPAERVDIAIVGLSTRFPGEMNTPEQTWQALLEGRDGIT DLPDGRWSEFLEEPRLAARVAGARTRGGYLKDIKGFDSEFFAVAKTEADN IDPQQRMALELTWEALEHARIPASSLRGQAVGVYIGSSTNDYSFLAVSDP TVAHPYAITGTSSSIIANRVSYFYDFHGPSVTIDTACSSSLVAIHQGVQA LRNGEADVVVAGGVNALITPMVTLGFDEIGAVLAPDGRIKSFSADADGYT RSEGGGMLVLKRVDDARRDGDAILAVIAGSAVNHDGRSNGLIAPNQDAQA DVLRRAYKDAGIDPRTVDYIEAHGTGTILGDPIEAEALGRVVGRGRPADR PALLGAVKTNVGHLESAAGAASMAKVVLALQHDKLPPSINFAGPSPYIDF DAMRLKMITTPTDWPRYGGYALAGVSSFGFGGANAHVVVREVLPRDVVEK EPEPEPEPKAAAEPAEAPTLAGHALRFDEFGNIITDSAVAEEPEPELPGV TEEALRLKEAALEELAAQEVTAPLVPLAVSAFLTSRKKAAAAELADWMQS PEGQASSLESIGRSLSRRNHGRSRAVVLAHDHDEAIKGLRAVAAGKQAPN VFSVDGPVTTGPVWVLAGFGAQHRKMGKSLYLRNEVFAAWIEKVDALVQD ELGYSVLELILDDAQDYGIETTQVTIFAIQIALGELLRHHGAKPAAVIGQ SLGEAASAYFAGGLSLRDATRAICSRSHLMGEGEAMLFGEYIRLMALVEY SADEIREVFSDFPDLEVCVYAAPTQTVIGGPPEQVDAILARAEAEGKFAR KFATKGASHTSQMDPLLGELTAELQGIKPTSPTCGIFSTVHEGRYIKPGG EPIHDVEYWKKGLRHSVYFTHGIRNAVDSGHTTFLELAPNPVALMQVALT TADAGLHDAQLIPTLARKQDEVSSMVSTMAQLYVYGHDLDIRTLFSRASG PQDYANIPPTRFKRKEHWLPAHFSGDGSTYMPGTHVALPDGRHVWEYAPR DGNVDLAALVRAAAAHVLPDAQLTAAEQRAVPGDGARLVTTMTRHPGGAS VQVHARIDESFTLVYDALVSRAGSESVLPTAVGAATAIAVADGAPVAPET PAEDADAETLSDSLTTRYMPSGMTRWSPDSGETIAERLGLIVGSAMGYEP EDLPWEVPLIELGLDSLMAVRIKNRVEYDFDLPPIQLTAVRDANLYNVEK LIEYAVEHRDEVQQLHEHQKTQTAEEIARAQAELLHGKVGKTEPVDSEAG VALPSPQNGEQPNPTGPALNVDVPPRDAAERVTFATWAIVTGKSPGGIFN ELPRLDDEAAAKIAQRLSERAEGPITAEDVLTSSNIEALADKVRTYLEAG QIDGFVRTLRARPEAGGKVPVFVFHPAGGSTVVYEPLLGRLPADTPMYGF ERVEGSIEERAQQYVPKLIEMQGDGPYVLVGWSLGGVLAYACAIGLRRLG KDVRFVGLIDAVRAGEEIPQTKEEIRKRWDRYAAFAEKTFNVTIPAIPYE QLEELDDEGQVRFVLDAVSQSGVQIPAGIIEHQRTSYLDNRAIDTAQIQP YDGHVTLYMADRYHDDAIMFEPRYAVRQPDGGWGEYVSDLEVVPIGGEHI QAIDEPIIAKVGEHMSRALGQIEADRTSEVGKQ >MT1701 polyketide synthase MNSTPEDLVKALRRSLKQNERLKRENRDLLARTTEPVAVVGMGCRYPGGV DSPETLWELVAHGRDAVSEFPADRGWDVAGLFDPDPDAVGKSYTRCGGFL TDVAGFDAEFFGIAPSEALAMDPQQRLLLEVSWEALERAGIDPITLRGSQ TGVFAGVFHGSYGGQGRVPGDLERYGLRGSTLSVASGRVAYVLGLQGPAV SVDTACSSSLVALHLAVQSLRLGECDLALVGGVTVMATPAMFIEFSRQRA LSADGRCKAYAGAADGTAFAEGAGVLVLARLADARRLGHPVLALVRGSAV NQDGASNGLATPNGPAQQRVITAALASARLGVADVDVVEGHGTGTTLGDP IEAQAILATYGQRPADRPLWLGSIKSNIGHTSAAAGVAGVIKMVQAMRHG VLPKTLHVDVPTPHVDWSAGAVSLLTEPRPWHVPGRPRRAGVSSFGISGT NAHVILEEAPAVEPVGAAHGNDPVAVPWVLSARSAQALTNQARRLLAWVG ADENVRPLDVGWSLVNTRSLFDHRAVVVGADRTQLMEGLTGLAAGVPGAD VVAGRAQTVGKTAFVFPGQGAQWLGMGAQLCATAPVFAEHIHRCERALRE HVEWSLLDVLRGAPGAPGLDRVDVVQPALWAVMVSLAELWRSVGVVPDAV IGHSQGEIAAAYVAGALSLRDAAAVVALRSRLLVRLGGAGGMVSLACGQP QAEKLASQWGDRLNIAAVNGVSSVVLAGETDAVTELMQRCEAEGIRARRI DVDYASHSAQVDAIREELIAALRGIEPRTSTVAFFSTVTGELMDTAGVNA EYWYRSIRQPVQFERAVRNAFDGGYRVFVESSPHPVLIAGIEETLVDCDR GATGEPIVIPTLGRDDGGVGRFWLSAGQAHVAGVGVDWRAAFADLGGRRV ELPTYAFARQRFWLDGLGAVGGDLGGVGLVGAEHGLLAAVVQRPDSGGVV LTGRISVVAAPWLADHAVGPVVLFPGTGFVELALRAGDEVGCSVLQELTL QAPLVLPADGVRVQVVVGGVEQSGTRNVWVYSAAGQADSSPGWTLHAQGV LGVGSVQPAAELSVWPPVGARAMDVADGYQVLAARGYGYGPAFRGLQALW RRGAEVFADVTLPEGVPIRGFGIHPAVLDAALHAWGIVEGEQQTMLPFSW QGVCLHASGAARVRVRLAPVGRGAVSVELADPQGLPVLSVRQLMVRPVSA AALSRSTAGDRGLLEMIWTPVPLEGGDIGDDAVVWELPPHAGAQAGGDVL AAVYRGVHEVLEVLQSWLASDATGLGVVVTRGAVGPVDDDVTDLAGAAVW GLVRSAQAEHPGRVVLVDTDGSVAVEDAVGFGARSGEPQLVVRRGRVYAA RLAPVAAGLTLPSASAGGWRLVAGGGGTLADVVVAPVAPVELATGQVRVA VGAVGVNFRDVLVALGMYPGGGELGVDGAGVVVEVGPGVTGLAVGDRVMG LLGLVGSEAVVDARLVTMVPAGWSLVEAAAVPVAFLTAFYGLSVLAEVAA GQKVLVHAGTGGVGMAAVSLARYWGAEVFVTASRAKWDTLRAMGFDDIHI SDSRSLEFEEAFLRATEGSGVDVVLNSLAGEFTDASLRLLPSGGRFIELG KTDIRDGQTVAERHRGVRYRAFDLVEAGPDRIAAMLSEVVGLLAAGVLAR LPVKTFDARCAPAAYRFVSQARHIGKVVLTIPDGPGGQSGLAGGTVVVTG GTGMAGSAVATHLVRRHGVANLVLVSRSGEQADRAAEVAALLREGGAQVA VVSCDVADRDALAALLAGLDPRYPLKGVFHAAGVLDDAVITGLTPDRVDT VLRAKVDGAWNLHELTEDMDLSAFVVFSSMAGIVGTPAQGNYAAANAFLD GLVAYRRSRGLAGLSVAWGLWEQASAMTRHLGERDRARMTQAGLAPLTTE QALGFLDTALQADRAVVVAARLDRAALAGAGAALPALFSQLAAGPTRRRI DAADTAVSMSGLVSRLHALTPERRQRELTDLVISNAAAVLGRSSSVDINA HKAFQDLGFDSLTAVELRNRLKTATGLTLSPTLIFDYPTPATLAEHLDSR LVTASGSDQQSLSDRVDDITRELVVLLDQPDLSANVKAHLRTRLQTMLTS LTTEDDDIAAATESQLFAILDEELGS >MT1937 hypothetical protein MPRTNNDAWDLATSVGATATMVAAARAVATRADNPLIDDPFAEPLVRAVG IDFFTRWAAGNIKATDVDDPDGTWGLQRLADLLAARTRYFDAFFRDATSA GIRQAVILASGLDARAYR >MT3021 substrate--CoA ligase MRNGNLAGLLAEQASEAGWYDRPAFYAADVVTHGQIHDGAARLGEVLRNR GLSSGDRVLLCLPDSPDLVQLLLACLARGVMAFLANPELHRDDHALAARN TEPALVVTSDALRDRFQPSRVAEAAELMSEAARVAPGGYEPMGGDALAYA TYTSGTTGPPKAAIHRHADPLTFVDAMCRKALRLTPEDTGLCSARMYFAY GLGNSVWFPLATGGSAVINSAPVTPEAAAILSARFGPSVLYGVPNFFARV IDSCSPDSFRSLRCVVSAGEALELGLAERLMEFFGGIPILDGIGSTEVGQ TFVSNRVDEWRLGTLGRVLPPYEIRVVAPDGTTAGPGVEGDLWVRGPAIA KGYWNRPDSPVANEGWLDTRDRVCIDSDGWVTYRCRADDTEVIGGVNVDP REVERLIIEDEAVAEAAVVAVRESTGASTLQAFLVATSGATIDGSVMRDL HRGLLNRLSAFKVPHRFAVVDRLPRTPNGKLVRGALRKQSPTKPIWELSL TEPGSGVRAQRDDLSASNMTIAGGNDGGATLRERLVALRQERQRLVVDAV CAEAAKMLGEPDPWSVDQDLAFSELGFDSQMTVTLCKRLAAVTGLRLPET VGWDYGSISGLAQYLEAELAGGHGRLKSAGPVNSGATGLWAIEEQLNKVE ELVAVIADGEKQRVADRLRALLGTIAGSEAGLGKLIQAASTPDEIFQLID SELGK >MT3653 oxidoreductase, short-chain dehydrogenase/reductase family MTLAEAADAINFGLAGRVVLVTGGVRGVGAGISSVFAEQGATVITCARRA VDGQPYEFHRCDIRDEDSVKRLVGEIGERHGRLDMLVNNAGGSPYALAAE ATHNFHRKIVELNVLAPLLVSQHANVLMQAQPNGGSIVNICSVSGRRPTP GTAAYGAAKAGLENLTTTLAVEWAPKVRVNAVVVGMVETERSELFYGDAE SIARVAATVPLGRLARPADIGWAAAFLASDAASYISGATLEVHGGGEPPP YLGASSANK >MT1449 conserved hypothetical protein MTIDTPAREDQTLAATHRAMWALGDYALMAEEVMAPLGPILVAAAGIGPG VRVLDVAAGSGNISLPAAKTGATVISTDLTPELLQRSQARAAQQGLTLQY QEANAQALPFADDEFDTVISAIGVMFAPDHQAAADELVRVCRPGGTIGVI SWTCEGFFGRMLATIRPYRPSVSADLPPSALWGREAYVTGLLGDGVTGLK TARGLLEVKRFDTAQAVHDYFKNNYGPTIEAYAHIGDNAVLAAELDRQLV ELAAQYLSDGVMEWEYLLLTAEKR >MT1563 conserved hypothetical protein MRLARRARNILRRNGIEVSRYFAELDWERNFLRQLQSHRVSAVLDVGANS GQYARGLRGAGFAGRIVSFEPLPGPFAVLQRSASTDPLWECRRCALGDVD GTISINVAGNEGASSSVLPMLKRHQDAFPPANYVGAQRVPIHRLDSVAAD VLRPNDIAFLKIDVQGFEKQVIAGGDSTVHDRCVGMQLELSFQPLYEGGM LIREALDLVDSLGFTLSGLQPGFTDPRNGRMLQADGIFFRGSD >MT1991 oxidoreductase MNHPDLAGKVAIVTGAGAGIGLAVARRLADEGCHVLCADIDGDAADAAAT KIGCGAAACRVDVSDEQQIIAMVDACVAAFGGVDKLVANAGVVHLASLID TTVEDFDRVIAINLRGAWLCTKHAAPRMIERGGGAIVNLSSLAGQVAVGG TGAYGMSKAGIIQLSRITAAELRSSGIRSNTLLPAFVDTPMQQTAMAMFD GALGAGGARSMIARLQGRMAAPEEMAGIVVFLLSDDASMITGTTQIADGG TIAALW >MT2344 hypothetical protein MTTVDFHFDPLCPFAYQTSVWIRDVRAQLGITINWRFFSLEEINLVAGKK HPWERDWSYGWSLMRIGALLRRTNMSLLDRWYAAIGHELHTLGGKPHDPA VARRLLCDVGVNAAILDAALDDPTTHDDVRADHQRVVAAGGYGVPTLFLD GQCLFGPVLVDPPAGPAALNLWSVVTGMAGLPHVYELQRPKSPADVELIA QQLRPYLDGRDWVSINRGEIVDIDRLAGRS >MT2447 peptide synthetase MGPVAVTRADARGAIDDVMALSPLQQGLFSRATLVAAESGSEAAEADPYV IAMAADAAGPLDIALLRDCAAAMLTRHPNLRASFLHGNLSRPVQVIPSSA EVLWRHVRAHPSEVGALAAEERRRRFDVGRGPLIRFLLIELPDECWHLVI VAHHIVIDGWSLPLFVSELLALYRAGGHVAALPAAPRPYRDYIGWLAGRD QTASRAMWADHLNGLDGPTLLSPALADTPVQPGIPGRTEVRLDREATAEL ADAARTRGVTISTLVQMAWATTLSAFTGRGDVTFGVTVSGRPSELSGVET MIGLFINTVPLRVRLDARATVGGQCAVLQRQFAMLRDHSYLGFNEFRAIA GIGEMFDTLLVYENFPPGEVVGTAEFVANGVTFRPVALESLSHFPVTVAA HRSTGELTLLVEVLDGALGTMAPESLGRRVLAVLQRLVSRWDRPLRDVDI LLDGEHDPTAPGLPDVTTSAPAVHTRFAEIAAAQPDSVAVSWADGQLTYR ELDALADRLATGLRRADVSRETPVAVALSRGPRYVAAMLAVLKAGGMIVP LDPAMPGERVAEILRQTSAPVVIDEGVFAASVGADILEDDRAITVPVDQA AYVIFTSGTTGTPKGVIGTHRALSAYADDHIERVLRPAAQRLGRPLRIAH AWSFTFDAAWQPLVALLDGHAVHIVDDHRQRDAGALVEAIDRFGLDMIDT TPSMFAQLHNAGLLDRAPLAVLALGGEALGAATWRMIQQNCARTAMTAFN CYGPTETTVEAVVAAVAEHARPVIGRPTCTTRAYVMDSWLRPVPDGVAGE LYLAGAQLTRGYLGRPAETAARFVAEPNGRGSRMYRTGDVVRRLPDGGLE FLGRSDDQVKIRGFRVEPGEIAAVLNGHHAVHGCHVTARGHASGPRLTAY VAGGPQPPPVAELRAMLLERLPRYLVPHHIVVLDELPLTPHGKIDENALA AINVTEGPATPPQTPTELVLAEAFADVMETSNVDVTAGFLQMGLDSIVAL SVVQAARRRGIALRARLMVECDTIRELAAAIDSDAAWQAPANDAGEPIPV LPNTHWLYEYGDPRRLAQTEVIRLPDRITRERLDAVLAAVVDGHEVLRCR FDRDAMALVAQPKTDILSEVWVSGELVTAVAEQTLGALASLDPQAGRLLS AVWLREPDGPGVLVLTAHVLAMDPASWRIVLGELDAGLHALAAGRAPSPA RENTSYRQWSRLLAQRAKALDSVDFWVAELEGADPPLGARRVAPQTDRVG ELAITMSISDADLTARLLSTGRSMTDLLATAAARMVTAWRRQRGQQTPAP LLALETHGRADVHVDKTADTSDTVGLLSAIYPLRIHCDGATDFARIPGSG IDYGLLRYLRADTAERLRAHREPQLLLNYLGSLHVGVGDLAVDRALLADV GQLPEPEQPVRHELTVLAALLGPADAPVLATRWRTLPDILSADDVATLQS LWQGALAEITA >MT2106 hypothetical protein MRIAALVAVSLLIAGCSREVGGDVGQSQTIAPPAPAPSAAPSTPPAAGAP ITTIVSWIEAGHPVDPAAYHVATRDGVTTQLGDDVAFSASSGTVACMTDA RHTSGTLACLVRLANPPPRPETAYGEWKGGWVDFDGIHLQVGSARADPGP FVYGNGPELANGDTLSIGDYRCRSYQAGLFCVNYAHQSAVRFASAGIEPF GCLKPAPPPDGVGVAFGC >MT2998 thioesterase MSMLARHGPRYGGSVNGHSDDSSGDAKQAAPTLYIFPHAGGTAKDYVAFS REFSADVKRIAVQYPGQHDRSGLPPLESIPTLADEIFAMMKPSARIDDPV AFFGHSMGGMLAFEVALRYQSAGHRVLAFFVSACSAPGHIRYKQLQDLSD REMLDLFTRMTGMNPDFFTDDEFFVGALPTLRAVRAIAGYSCPPETKLSC PIYAFIGDKDWIATQDDMDPWRDRTTEEFSIRVFPGDHFYLNDNLPELVS DIEDKTLQWHDRA >MT0183 virulence factor mce family protein MLTRFIRRQLILFAIVSVVAIVVLGWYYLRIPSLVGIGQYTLKADLPASG GLYPTANVTYRGITIGKVTAVEPTDQGARVTMSIASNYKIPVDASANVHS VSAVGEQYIDLVSTGAPGKYFSSGQTITKGTVPSEIGPALDNSNRGLAAL PTEKIGLLLDETAQAVGGLGPALQRLVDSTQAIVGDFKTNIGDVNDIIEN SGPILDSQVNTGDQIERWARKLNNLAAQTATRDQNVRSILSQAAPTADEV NAVFSGVRDSLPQTLANLEVVFDMLKRYHAGVEQLLVFLPQGAAIAQTVL TPTPGAAQLPLAPAINYPPPCLTGFLPASEWRSPADTSPRPLPSGTYCKI PQDAQLQVRGARNIPCVDVPGKRAATPKECRSKDPYVPLGTNPWFGDPNQ ILTCPAPGARCDQPVKPGLVIPAPSINTGLNPAPADQVQGTPPPVSDPLQ RPGSGTVQCNGQQPNPCVYTPTSGPSAVYSPASGELVGPDGVKYAVANSS TTGDDGWKEMLAPAS >MT1476 P49 protein MTTAVVVGAGPNGLAAAIHLARHGVDVQVLEARDTIGGGARSGELTVPGV IHDHCSAFHPLGVGSPFWAAIDLQRYGLTWKWPDVDCAHPLDDGTAGVLY RSIEATAAGLGPDGKRWQRAVGDLAAGFDELAEDLLRPVLNMPRHPIRLA RFGPRAALPATAMARRFHTERARALFGGAAAHVYTRLDRPLTASLGLMIL ASGHRHGWPVARGGSGSITKALAAALDAYGGTVATGVTVTSRRDIPDADI VMLDLSPAAVLGIYGDVMPTRINRSYRRYRAGSSAFKVDFAIEGDVGWTN PDCRRAGTVHLGGTFAEIADTERQRAQGTMVQRPFVLVGQQYLADPSRSV GNINPIWAYAHVPFGYTGDATAAVIDQIERFAPGFRDRIVATVSTSTTEL QTYNRNFIGGDIIGGANDRLQVIFRPRVAVDPYAIGVPGVYLCSQSAPPG AGIHGLCGYHAAESALRWLRKRR >MT3030 conserved hypothetical protein MKSLKLARFIARSAAFEVSRRYSERDLKHQFVKQLKSRRVDVVFDVGANS GQYAAGLRRAAYKGRIVSFEPLSGPFTILESKASTDPLWDCRQHALGDSD GTVTINIAGNAGQSSSVLPMLKSHQNAFPPANYVGTQEASIHRLDSVAPE FLGMNGVAFLKVDVQGFEKQVLAGGKSTIDDHCVGMQLELSFLPLYEGGM LIPEALDLVYSLGFTLTGLLPCFIDANNGRMLQADGIFFREDD >MT1947 conserved hypothetical protein MTTPEYGSLRSDDDHWDIVSNVGYTALLVAGWRALHTTGPKPLVQDEYAK HFITASADPYLEGLLANPRTSEDGTAFPRLYGVQTRFFDDFFNCADEAGI RQAVIVAAGLDCRAYRLDWQPGTTVFEIDVPKVLEFKARVLSERGAVPKA HRVAVPADLRTDWPTPLTAAGFDPQRPSAWSVEGLLPYLTGDAQYALFAR IDELCAPGSRVALGALGSRLDHEQLAALETAHPGVNMSGDVNFSALTYDD KTDPVEWLVEHGWAVDPVRSTLELQVGYGLTPPDVDVKIDSFMRSQYITA VRA >MT0224 substrate--CoA ligase, putative MPRGELYKRFRLVMGGIAPCGSGRRAATYPRRMQIRPYIGADKPAVILYP SGTVISFDELEARANRLAHWFRQAGLREDDVVAILMENNEHVHAVMWAAR RSGLYYVPINTHLTASEAAYIVDNSGAKAIVGSAALRETCHGLAEHLPGG LPDLLMLAGGGLVGWMTYPECVADQPDTPIEDEREGDLLQYSSGTTGRPK GIKRELPHVSPDAAPGMMPALLDFWMDADSVYLSPAPMYHTAPSVWTMSA LAAGVTTVVMEKFDAEGALDAIQRYRVTHAQFVPAMFVRMLKLPEAVRNS YDMSSLRRVIHAAAPCPVQIKEQMIHWWGPIIDEYYASSEASGSTLITAE DWLTHPGSVGKPIQGGVHIVGADGSELPPNQPGEIYFEGGYPFEYLNDPA KTAASRNKHGWVTVGDVGYLDDDGYLFLTGRRHHMIISGGVNIYPQEAEN LLVAHPKVLDAAVFGVPDDEMGQRVMAAVQTVDSADANDQFAGELLAWLR DRLSHFKCPRSIAFEPQLPRTDTGKLYKSGLVEKYSV >MT3145 P450 heme-thiolate protein MATIHPPAYLLDQAKRRFTPSFNNFPGMSLVEHMLLNTKFPEKKLAEPPP GSGLKPVVGDAGLPILGHMIEMLRGGPDYLMFLYKTKGPVVFGDSAVLPG VAALGPDAAQVIYSNRNKDYSQQGWVPVIGPFFHRGLMLLDFEEHMFHRR IMQEAFVRSRLAGYLEQMDRVVSRVVADDWVVNDARFLVYPAMKALTLDI ASMVFMGHEPGTDHELVTKVNKAFTITTRAGNAVIRTSVPPFTWWRGLRA RELLENYFTARVKERREASGNDLLTVLCQTEDDDGNRFSDADIVNHMIFL MMAAHDTSTSTATTMAYQLAAHPEWQQRCRDESDRHGDGPLDIESLEQLE SLDLVMNESIRLVTPVQWAMRQTVRDTELLGYYLPKGTNVIAYPGMNHRL PEIWTDPLTFDPERFTEPRNEHKRHRYAFTPFGGGVHKCIGMVFDQLEIK TILHRLLRRYRLELSRPDYQPRWDYSAMPIPMDGMPIVLRPR >MT1914 oxidoreductase, short-chain dehydrogenase/reductase family MPGRTSIGVKIRDKVQDKVIAITGGARGIGLATAAALHNLGAKVAIGDID EAMAKESGADLDLDMYGKLDVTDPDSFSGFLDAVERQLGPIDVLVNNAGI MPVGRIVDEPDPVTRRILDINVYGVILGSKLAAQRMVPRGRGHVINVASL AGEIYAVGVATYCASKHAVVAFTDSARLEYRSAGVKFSMVLPSFVNTELI AGTGGIKGFKNAEPADIADAIVGLIVHPKPRVRVTKAAGSMIVAQRFMPR QVSEGLNRLLGGEHVFTDDVDMEKRRTYEARARGEE >MT0789 oxidoreductase, short-chain dehydrogenase/reductase family MPRFEPHPARRTTVVAGASSGIGAATATELAGRGFPVALGARRMDKLAEL VDKIRADGGEAVAFPLDVTDPESVKSFVAQTVEALGEVELLVSSAGDMLP GQLHEVSTEAFAEQVQIHLVGANRLATAVLPAMVARRRGDLIFVGSDVGL RQRPHMGAYGAAKAGLAAMVTNLQMELEGTGVRASIVHPGPTLTGMGWQL SAEQVGPMLADWAKWGQARHNYFLRPSDLARAIAFVAETPRGCVVVNMEI QPEAPLRDAPAHRQKLVLGEEGMPG >MT1827 P450 heme-thiolate protein MRRSPKGSPGAVLDLQRRVDQAVSADHAELMTIAKDANTFFGAESVQDPY PLYERMRAAGSVHRIANSDFYAVCGWDAVNEAIGRPEDFSSNLTATMTYT AEGTAKPFEMDPLGGPTHVLATADDPAHAVHRKLVLRHLAAKRIRVMEQF TVQAADRLWVDGMQDGCIEWMGAMANRLPMMVVAELIGLPDPDIAQLVKW GYAATQLLEGLVENDQLVAAGVALMELSGYIFEQFDRAAADPRDNLLGEL ATACASGELDTLTAQVMMVTLFAAGGESTAALLGSAVWILATRPDIQQQV RANPELLGAFIEETLRYEPPFRGHYRHVRNATTLDGTELPADSHLLLLWG AANRDPAQFEAPGEFRLDRAGGKGHISFGKGAHFCVGAALARLEARIVLR LLLDRTSVIEAADVGGWLPSILVRRIERLELAVQ >MT2450 polyketide synthase MPMSDNDPVVIVGLAIEAPGGVETADDYWTLLSEQREGLGPFPTDRGWAL RELFDGSRRNGFKPIHNLGGFLSSATTFDPEFFRISPREATAMDPQQRVG LRVAWRTLENSGINPDDLAGHDVGCYVGASALEYGPALTEFSHHSGHLIT GTSLGVISGRIAYTLDLAGPALTVDTSCSSALAAFHTAVQAIRAGDCDLA LAGGVCVMGTPGYFVEFSKQHALSDDGHCRPYSAHASGTAWAEGAAMFLL QRRSRATADRRRVLAEVRASCLNSDGLSDGLTAPSGDAQTRLLRRAIAQA AVVPADVGMVEGHGTATRLGDRTELRSLAASYGTAPAGRGPLLGSVKSNI GHAQAAAGGLGLVKVILAAQHAAIPPTLHVDEPSREIDWEKQGLRLADKL TPWRAVDGWRTAAVSAFGMSGTNSHVIVSMPDTVSAPERGPECGEV >MT1472 conserved hypothetical protein MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRA SITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDILLSGLVRAHR LGHARFLEVAMQYVSLLEPADRVSTIIELVNRSARLVDLVADQLIVAYEH EHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWV DSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSP APTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSW LRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAA FRVQMALEVCRWMAPAVLRAKQ >MT0455 oxidoreductase, short-chain dehydrogenase/reductase family MQEAPHRVMPLVRPCPRPSDPGEYVRPMTANDNKTRKWSAADVPDQSGRV VVVTGANTGIGYHTAAVFADRGAHVVLAVRNLEKGNAARARIMAARPGAH VTLQQLDLCSLDSVRAAADALRTAYPRIDVLINNAGVMWTPKQVTKDGFE LQFGTNHLGHFALTGLVLDHMLPVPGSRVVTVSSQGHRIHAAIHFDDLQW ERRYNRVAAYGQAKLANLLFTYELQRRLGEAGKSTIAVAAHPGGSNTELT RNLPRLIRPVATVLGPLLFQSPEMGALPTLRAATDPTTQGGQYYGPDGFG EQRGHPKVVQSSAQSHDKDLQRRLWTVSEELTGVSFGV >MT3603 virulence factor mce family protein MSGGGSRRTSVRVAAALLAGLMVGSAVLTYLSYTAAFTSTDTVTVSSPRA GLVMEKGAKVKYRGIQVGKVTDISYSGNQARLKLAIDSGEMGFIPSNATV RIAGNTIFGAKSVEFIPPKTPSPKPLSPNAHVAASQVQLEVNTLFQSLID LLHKIDPLETNATLSALSEGLRGHGDDLGALLSGLNTLTRQANPKLPALQ EDFRKAAVVANVYADAAGDLNTVFDNLPTINKTIVDQKDNLNDTLLATIG LSNNAYETLAPAEQNFIDAINRLRAPLKVTSDYSPVFGCLFKGIARGVKE FAPLIGVRKAGLFTSSSFVLGAPSYTYPESLPIVNASGGPNCRGLPDIPT KQTGGSFYRAPFLVTDNALIPYQPFTELQVDAPSTLQFLFNGAFAERDDF >MT1080 oxidoreductase, short-chain dehydrogenase/reductase family MMARQRFRDQVVLITGASSGIGEATAKAFAREGAVVALAARREGALRRVA REIEAAGGRAMVAPLDVSSSESVRAMVADVVGEFGRIDVVFNNAGVSLVG PVDAETFLDDTREMLEIDYLGTVRVVREVLPIMKQQRSGRIMNMSSVVGR KAFARFAGYSSAMHAIAGFSDALRQELRGSGIAVSVIHPALTQTPLLANV DPADMPPPFRSLTPIPVHWVAAAVLDGVARRRARVVVPFQPRLLMVGDAF SPRYGDRVVRLLESKIFGRLIGSYRGSVYRHQPTESAKAQAAQPERGYSS AR >MT3445 methyltransferase, putative MSLSFGSAVGAYERGRPSYPPEAIDWLLPAAARRVLDLGAGTGKLTTRLV ERGLDVVAVDPIPEMLDVLRAALPQTVALLGTAEEIPLDDNSVDAVLVAQ AWHWVDPARAIPEVARVLRPGGRLGLVWNTRDERLGWVRELGEIIGRDGD PVRDRVTLPEPFTTVQRHQVEWTNYLTPQALIDLVASRSYCITSPAQVRT KTLDRVRQLLATHPALANSNGLALPYVTVCVRATLA >MT0707 hypothetical protein MVEKPLRADRATHSRLATFALALAAAALPLAGCSSTANPPAATTTPATAT TTTATSGPTAAPTVTTGESTTASIQIGDMLTYGSIGTTATLDCADGKSLN VAGSDNTLTVNGTCETVTVGGANNKIAFDRIDERLVVVGLDNTVTYKNGD PTIDNLGAGNRINKE >MT3895 conserved hypothetical protein MARTDDDSWDLATGVGATATLVAAGRARAARAAQPLIDDPFAEPLVRAVG VEFLTRWATGELDAADVDDPDAAWGLQRMTTELVVRTRYFDQFFLDAAAA GVRQAVILASGLDARGYRLPWPADTTVFEVDQPRVLEFKAQTLAGLGAQP TADLRMVPADLRHDWPDALRRGGFDAAEPAAWIAEGLFGYLPPDAQNRLL DHVTDLSAPGSRLALEAFLGSADRDSARVEEMIRTATRGWREHGFHLDIW ALNYAGPRHEVSGYLDNHGWRSVGTTTAQLLAAHDLPAAPALPAGLADRP NYWTCVLG >MT0874 oxidoreductase, short-chain dehydrogenase/reductase family MRAVDGFPGRGAVITGGASGIGLATGTEFARRGARVVLGDVDKPGLRQAV NHLRAEGFDVHSVMCDVRHREEVTHLADEAFRLLGHVDVVFSNAGIVVGG PIVEMTHDDWRWVIDVDLWGSIHTVEAFLPRLLEQGTGGHVVFTASFAGL VPNAGLGAYGVAKYGVVGLAETLAREVTADGIGVSVLCPMVVETNLVANS ERIRGAACAQSSTTGSPGPLPLQDDNLGVDDIAQLTADAILANRLYVLPH AASRASIRRRFERIDRTFDEQAAEGWRH >MT1704 polyketide synthase MQPTGIAIIGLACRFPTVVSPGDLWDLLRDGREAAGSIDNVADFDADFFN LSPREASAMDPRQRLALELTWELLEDAFVVPETLRGQPIAVYLGAMNDDY AVLTLAADRVDHHAFAGTSRAIIANRVSFAFGLRGPSVTIDSGQSSSLVA VHLACESVRTGEAPLAIAGGVHLNLARETAMLEQEFGAVSPSGHTYAFDE RADGYVPGDGGGLVLLKPVQAALDDGDRIHAIIRGSAVGNAGHSATGLTV PSVAGQVDVIRRAMSGAGVDCHQVHYVEAHGTGTKIGDPIEARALGEIFA ARQRRPVSVGSVKTNIGHTGGAAGIAGLLKAVLAIENAVIPPSLNYVGAA IDLDSLGLRVDTALTPWPVADEPRRAGVSSFGMGGTNAHVILEQGPTQSP EIVESVAAAGSNAPVAVPWVLAARSPQALTNQAGRLLAHLTADDGLTALD VGWSLVSTRSVFDHRAVVVGADRGRLMAGLAGLAAGEPGAGVVVGRARSV GKTVFVFPGQGSQWLGMGRQLYGRYSVFARAFDEVVAVLDGQLRLSVRQV MWGADAGLLESTEFAQPALFVVQVALAALLQDWGVLPDLVMGHSVGEIAA AYVAGALSLVDAARVVAARGRLMQALPAGGVMVAVAASEDEVAPLLTEGV CIAAVNAPESVVISGEQAAVGVVVDRLVGLGRRVRRLAVSHAFHSVLMDP MVEEFSKVLADVCVRAPRIGLVSNVTGQLAGAGYGSPAYWVEHVRKPVRF FDGVGLAESLGARVFVEVGPGAGLEASVALLARDRPEVESVLAGVGRLFA EGVAVDWSSVFAGLGGRRVELPTYGFARQRFWLGDNGELSVDQTGKDAGA IARLQSLAPPELQRQLVELVCFHAAIVLGRKSSHDIDPECAFQDLGFDSM SGVELRNRLQMAIGLPGLSLPRTLIFDYPTASALAECLGQLLGGQHESSD DESIWQLLKNIPIHQLRRTGLLDKLLLLAGQPEESLAGRTVSDEVIDSLS PEALIGLALDEDENDIR >MT0106 dioxygenase, putative MTLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKDVHPSPR EFIKLGRIIGQIVPYYEPMYHHEDHPEIFVSSTEEGQGVPKTGAFWHIDY MFMPEPFAFSMVLPLAVPGHDRGTYFIDLARVWQSLPAAKRDPARGTVST HDPRRHIKIRPSDVYRPIGEVWDEINRTTPPIKWPTVIRHPKTGQEILYI CATGTTKIEDKDGNPVDPEVLQELMAATGQLDPEYQSPFIHTQHYQVGDI ILWDNRVLMHRAKHGSAAGTLTTYRLTMLDGLKTPGYAA >MT0623 virulence factor mce family protein MRCGVSAGSANGKPNRWTLRCGVSAGHRGSVFLLAVLLAPVVLTSCTWRG IANVPLPVGRGMGPDRMTIYVQMPDTLALNTNSRVRVADVWVGTVRDISL RNWIATLTLELEPTVRLPANATAKIGQTSLLGTQHVELAAPPIPSPQPLK SGDTIGLKNSSAYPTVERTLASVALILTGGGIVNLDVIQTEILNILDGHA GQIREFLERLATFTAELNNQRGDLTRAIDSTNQLLTIIANRNDTLDRVLT DVPPLIEHFADTGQLFADATESLGRFSEVANRALAATRPNLHQTLQSLQR PLRQLERASPYVVGALKLGLTAPFNIDEVPNVIRGDYVNVSATFDVTLSA LDNALLSGTGISGMLRALEQAWGRDPDTMIPDVRYTPNPNDAPGGPLVER AE >MT1244 very-long-chain acyl-CoA synthetase, putative MSDYYGGAHTTVRLIDLATRMPRVLADTPVIVRGAMTGLLARPNSKASIG TVFQDRAARYGDRVFLKFGDQQLTYRDANATANRYAAVLAARGVGPGDVV GIMLRNSPSTVLAMLATVKCGAIAGMLNYHQRGEVLAHSLGLLDAKVLIA ESDLVSAVAECGASRGRVAGDVLTVEDVERFATTAPATNPASASAVQAKD TAFYIFTSGTTGFPKASVMTHHRWLRALAVFGGMGLRLKGSDTLYSCLPL YHNNALTVAVSSVINSGATLALGKSFSASRFWDEVIANRATAFVYIGEIC RYLLNQPAKPTDRAHQVRVICGNGLRPEIWDEFTTRFGVARVCEFYAASE GNSAFINIFNVPRTAGVSPMPLAFVEYDLDTGDPLRDASGRVRRVPDGEP GLLLSRVNRLQPFDGYTDPVASEKKLVRNAFRDGDCWFNTGDVMSPQGMG HAAFVDRLGDTFRWKGENVATTQVEAALASDQTVEECTVYGVQIPRTGGR AGMAAITLRAGAEFDGQALARTVYGHLPGYALPLFVRVVGSLAHTTTFKS RKVELRNQAYGADIEDPLYVLAGPDEGYVPYYAEYPEEVSLGRRPQG >MT3005 polyketide synthase MSIPENAIAVVGMAGRFPGAKDVSAFWSNLRRGKESIVTLSEQELRDAGV SDKTLADPAYVRRAPLLDGIDEFDAGFFGFPPLAAQVLDPQHRLFLQCAW HALEDAGADPARFDGSIGVYGTSSPSGYLLHNLLSHRDPNAVLAEGLNFD QFSLFLQNDKDFLATRISHAFNLRGPSIAVQTACSSSLVAVHLACLSLLS GECDMALAGGSSLCIPHRVGYFTSPGSMVSAVGHCRPFDVRADGTVFGSG VGLVVLKPLAAAIDAGDRIHAVIRGSAINNDGSAKMGYAAPNPAAQADVI AEAHAVSGIDSSTVSYVECHGTGTPLGDPIEIQGLRAAFEVSQTSRSAPC VLGSVKSNIGHLEVAAGIAGLIKTILCLKNKALPATLHYTSPNPELRLDQ SPFVVQSKYGPWECDGVRRAGVSSFGVGGTNAHVVLEEAPAEASEVSAHA EPAGPQVILLSAQTAAALGESRTALAAALETQDGPRLSDVAYTLARRRKH NVTMAAVVHDREHAATVLRAAEHDNVFVGEAAHDGEHGDRADAAPTSDRV VFLFPGQGAQHVGMAKGLYDTEPVFAQHFDTCAAGFRDETGIDLHAEVFD GTATDLERIDRSQPALFTVEYALAKLVDTFGVRAGAYIGYSTGEYIAATL AGVFDLQTAIKTVSLRARLMHESPPGAMVAVALGPDDVTQYLPPEVELSA VNDPGNCVVAGPKDQIRALRQRLTEAGIPVRRVRATHAFHTSAMDPMLGQ FQEFLSRQQLRPPRTPLLSNLTGSWMSDQQVVDPASWTRQISSPIRFADE LDVVLAAPSRILVEVGPGGSLTGSAMRHPKWSTTHRTVRLMRHPLQDVDD RDTFLRALGELWSAGVEVDWTPRRPAVPHLVSLPGYPFARQRHWVEPNHT VWAQAPGANNGSPAGTADGSTAATVDAARNGESQTEVTLQRIWSQCLGVS SVDRNANFFDLGGDSLMAISIAMAAANEGLTITPQDLYEYPTLASLTAAV DASFASSGLAKPPEAQANPAVPPNVTYFLDRGLRDTGRCRVPLILRLDPK IGLPDIRAVLTAVVNHHDALRLHLVGNDGIWEQHIAAPAEFTGLSNRSVP NGVAAGSPEERAAVLGILAELLEDQTDPNAPLAAVHIAAAHGGPHYLCLA IHAMVTDDSSRQILATDIVTAFGQRLAGEEITLEPVSTGWREWSLRCAAL ATHPAALDTRSYWIENSTKATLWLADALPNAHTAHPPRADELTKLSSTLS VEQTSELDDGRRRFRRSIQTILLAALGRTIAQTVGEGVVAVELEGEGRSV LRPDVDLRRTVGWFTTYYPVPLACATGLGALAQLDAVHNTLKSVPHYGIG YGLLRYVYAPTGRVLGAQRTPDIHFRYAGVIPELPSGDAPVQFDSDMTLP VREPIPGMGHAIELRVYRFGGSLHLDWWYDTRRIPAATAEALERTFPLAL SALIQEAIAAEHTEHDDSEIVGEPEAGALVDLSSMDAG >MT2108 polyketide synthase MVDQLQHATEALRKALVQVERLKRTNRALLERSSEPIAIVGMSCRFPGGV DSPEGLWQMVADARDVMSEFPTDRGWDLAGLFDPDPDVRHKSYARTGGFV DGVADFDPAFFGISPSEALAMDPQHRMLLELSWEALERAGIDPTGLRGSA TGVFAGLIVGGYGMLAEEIEGYRLTGMTSSVASGRVAYVLGLEGPAVSVD TACSSSLVALHMAVGSLRSGECDLALAGGVTVNATPTVFVEFSRHRGLAP DGRCKPYAGRADGVGWSEGGGMLVLQRLSDARRLGHPVLAVVVGSAVNQD GASNGLTAPNGPSQQRVVRAALANAGLSAAEVDVVEGHGTGTTLGDPIEA QALLATYGQDRGEPGEPLWLGSVKSNMGHTQAAAGVAGVIKMVLAMRHEL LPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTRRAGVSSFGISGTN AHVIIEAVPVVPRREAGWAGPVVPWVVSAKSESALRGQAARLAAYVRGDD GLDVADVGWSLAGRSVFEHRAVVVGGDRDRLLAGLDELAGDQLGGSVVRG TATAAGKTVFVFPGQGSQWLGMGIELLDTAPAFAQQIDACAEAFAEFVDW SLVDVLRGAPGAPGLDRVDVVQPVLFAVMVSLAELWKSVAVHPDAVIGHS QGEIAAAYVAGALSLRDAARVVTLRSKLLAGLAGPGGMVSIACGADQARD LLAPFGDRVSIAVVNGPSAVVVSGEVGALEELIAVCSTKELRTRRIEVDY ASHSVEVEAIRGPLAEALSGIEPRSTRTVFFSTVTGNRLDTAGLDADYWY RNVRQTVLFDQAVRNACEQGYRTFIESSPHPALITGVEETFAACTDGDSE AIVVPTLGRGDGGLHRFLLSAASAFVAGVAVNWRGTLDGAGYVELPTYAF DKRRFWLSAEGSGADVSGLGLGASEHPLLGAVVDLPASGGVVLTGRLSPN VQPWLADHAVSDVVLFPGTGFVELAIRAGDEVGCSVLDELTLAAPLLLPA TGSVAVQVVVDAGRDSNSRGVSIFSRADAQAGWLLHAEGILRPGSVEPGA DLSVWPPAGAVTVDVADGYERLATRGYRYGPAFRGLTAMWARGEEIFAEV RLPEAAGGVGGFGVHPALLDAVLHAVVIAGDPDELALPFAWQGVSLHATG ASAVRARIAPAGPSAVSVELADGLGLPVLSVASMVARPVTERQLLAAVSG SGPDRLFEVIWSPASAATSPGPTPAYQIFESVAADQDPVAGSYVRSHQAL AAVQSWLTDHESGVLVVATRGAMALPREDVADLAGAAVWGLVRSAQTEHP GRIVLVDSDAATDDAAIAMALATGEPQVVLRGGQVYTARVRGSRAADAIL VPPGDGPWRLGLGSAGTFENLRLEPVPNADAPLGPGQVRVAMRAIAANFR DIMITLGMFTHDALLGGEGAGVVVEVGPGVTEFSVGDSVFGFFPDGSGTL VAGDVRLLLPMPADWSYAEAAAISAVFTTAYYAFIHLADVQPGQRVLIHA GTGGVGMAAVQLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEF EDKFRAATGGRGFDVVLDSLAGEFVDASLRLVAPGGVFLEMGKTDIRDPG VIAQQYPGVRYRAFDLFEPGRPRMHQYMLELATLFGDGVLRPLPVTTFDV RRAPAALRYLSQARHTGKVVMLMPGSWAAGTVLITGGTGMAGSAVARHVV ARHGVRNLVLVSRRGPDAPGAAELVAELAAAGAQVQVVACDAADRAALAK VIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRVDVVLRSKVDAAWHLHE LTRDLDVSAFVMFSSMAGLVGSSGQANYAAANSFLDALAAHRRAHGLPAI SLGWGLWDQASAMTGGLDAADLARLGREGVLALSTAEALELFDTAMIVDE PFLAPARIDLTALRAHAVAVPPMFSDLASAPTRRQVDDSVAAAKSKSALA HRLHGLPEAEQHAVLLGLVRLHIATVLGNITPEAIDPDKAFQDLGFDSLT AVEMRNRLKSATGLSLSPTLIFDYPTPNRLASYIRTELAGLPQEIKHTPA VRTTSEDPIAIVGMACRYPGGVNSPDDMWDMLIQGRDVLSEFPADRGWDL AGLYNPDPDAAGACYTRTGGFVDGVGDFDPAFFGVGPSEALAMDPQQRML LELSWEALERAGIDPTGLRGSATGVFAGVMTQGYGMFAAEPVEGFRLTGQ LSSVASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSLRSGECDLALA GGVTVNATPTVFVEFSRHRGLAPDGRCKPYAGRADGVGWSEGGGMLVLQR LSDARRLGHPVLAVVVGSAVNQDGASNGLTAPNGPSQQRVVRAALANAGL SAAEVDVVEGHGTGTTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNM GHTQAAAGVAGVIKMVLAMRHELLPATLHVDVPSPHVDWSAGAVELLTAP RVWPAGARTRRAGVSSFGISGTNAHVIIEAVPVVPRREAGWAGPVVPWVV SAKSESALRGQAARLAAYVRGDDGLDVADVGWSLAGRSVFEHRAVVVGGD RDRLLAGLDELAGDQLGGSVVRGTATAAGKTVFVFPGQGSQWLGMGMGLH AGYPVFAEAFNTVVGELDRHLLRPLREVMWGHDENLLNSTEFAQPALFAV EVALFRLLGSWGVRPDFVMGHSIGELSAAHVAGVLSLENAAVLVAARGRL MQALPAGGAMVAVQAAEEEVRPLLSAEVDIAAVNGPASLVISGAQNAVAA VADQLRADGRRVHQLAVSHAFHSPLMDPMIDEFAAVAAGIAIGRPTIGVI SNVTGQLAGDDFGSAAYWRRHIRQAVRFADSVRFAQAAGGSRFLEVGPSG GLVASIEESLPDVAVTTMSALRKDRPEPATLTNAVAQGFVTGMDLDWRAV VGEAQFVELPTYAFQRRRFWLSGDGVAADAAGLGLAASEHALLGAVIDLP ASGGVVLTGRLSPSVQGWLADHSVAGVTIFPGAGFVELAIRAGDEVGCGV VDELTLAAPLVLPASGSVAVQVVVNGPDESGVRGVSVYSRGDVGTGWVLH AEGALRAGSAEPTADLAMWPPAGAVPVEVADGYQQLAERGYGYGPAFRGL TAMWRRGDEVFAEVALPADAGVSVTGFGVHPVLLDAALHAVVLSAESAER GQGSVLVPFSWQGVSLHAAGASAVRARIAPVGPSAVSIELADGLGLPVLS VASMLARPVTDQQLRAAVSSSGPDRLFEVTWSPQPSAAVEPLPVCAWGTT EDSAAVVFESVPLAGDVVAGVYAATSSVLDVLQSWLTRDGAGVLVVMTRG AVALPGEDVTDLAGAAVWGLVRSAQTEHPGRIVLVDSDAPLDDSALAAVV TTGEPQVLWRRGEVYTARVHGSRAVGGLLVPPSDRPWRLAMSTAGTFENL RLELIPDADAPLGPGQVRVAVSAIAANFRDVMIALGLYPDPDAVMGVEAC GVVIETSLNKGSFAVGDRVMGLFPEGTGTVASTDQRLLVKVPAGWSHTAA ATTSVVFATAHYALVDLAAARSGQRVLIHAGTGGVGMAAVQLARHLGLEV FATASKGKWDTLRAMGFDDDHISDSRSLEFEDKFRAATGGRGFDVVLDSL AGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVRYRAFDLFEAG PDRIAQILAELATLFGDGVLRPLPVTTFDVRCAPAALRYLSQARHTGKVV MLMPGSWAAGTVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPG AAELVAELAAAGAQVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGA LDDAVVMSLTPDRVDVVLRSKVDAAWHLHELTRDLDVSAFVMFSSMAGLV GSSGQANYAAANSFLDALAAHRRAHGLPAISLGWGLWDQASAMTGGLATV DFKRFARDGIVAMSSADALQLFDTAMIVDEPFMLPAHIDFAALKVKFDGG TLPPMFVDLINAPTRRQVDDSLAAAKSKSALLQRLEGLPEDEQHAVLLDL VRSHIATVLGSASPEAIDPDRAFQELGFDSLTAVEMRNRLKSATGLALSP TLIFDYPNSAALAGYMRRELLGSSPQDTSAVAAGEAELQRIVASIPVKRL RQAGVLDLLLALANETETSGQDPALAPTAEQEIADMDLDDLVNAAFRNDD E >MT0621 virulence factor mce family protein MHTAMRTLTEFNRGRVGMMGAVVTVLVVGVAQSFTSVPMLFATPTYYAQF ADTGGINTGDKVEIAGVNVGLVRSLAIRGNRVLIGFSLPGKTIGMQSRAA IRTDTILGRKNLEIEPRGSEPLKPNGFLPLAQTTTPYQIYDAFVDVTKAA TGWDIDAVKRSLNVLSETFDQTAPHLSAALEGVKAFSDTVGRRGEQIEQL LANANRIARVLGDRSEQVNGLLVNAKTLLAAFKQRSQALRILLTNVSEAS AQVSGLITDNPNLNHVLAQLRTVSEELVKRKNELADVAVLLGRYTAALTE AVGSGPFFKAMVVNLLPYQILQPWVDAAFKKRGIDPENFWRSAGLPEFRW PDPNGTRFPNGAAGGATGAGGYTQASGTGRPAGNAVLLHTGGGRVATARH PTTLRGRHRWPVRWTRLPGTARCPAVAALIPMGRRRRRAS >MT1565 hypothetical protein MAQARRRDAEPQGARGCVALPAPTRLSSLTMSTNPGPAEGANQVMAQEHS AGAVQFTAHNVRLDDGTLTIPESSRTLDESSWFISARGILETVFPGDKSH LRLADVGCLEGGYAVGFARMGFQVLGIEVRELNMAACNYIKSKTNLPNLR FVHDNALNIANHGLFDTVFCCGLFYHLENPKQYLETLSSVTNKLLILQTH FSIINRSDKWLRLPTTARQLTDRLLRRPAPVKFMLSAPTEHEGLPGRWFT EFSDDRSFGQRDTAKWASWDNRRSFWIQREHLLQAIKDVGVDLVMEEYDN LEPSIAESLLGGSYAANLRGTFIGIKTR >MT1088 medium-chain-fatty-acid--CoA ligase, putative MYGTMQDFPLTITAIMRHGCGVHGRRTVTTATGEGYRHSSYRDVGQRAGQ LANALRRLGVTGDQRVATFMWNNTEHLVTYFAVPSMGAVLHTLNIRLFPE QIAYVTNEAEDRVILVDLSLARLLAPVLPKLDTVHTVIAVGEGDTTPLRE AGKTVLRFAELIDAESPDFGWPQIDENSAAAMCYTSGTTGNPKGVVYSHR SSFLHTMAACTTNGIGVGSSDKVLPIVPMFHANGWGLPYAALMAGADLVL PDRHLDARSLIHMVETLKPTLAGAVPTIWNDVMHYLEKDPDHDMSSLRLV ACGGSAVPESLMRTFEDKHDVQIRQLWGMTETSPLATMAWPPPGTPDDQH WAFRITQGQPVCGVETRIVDDDGQVLPNDGNAVGEVEVRGPWIAGSYYGG RDESKFDSGWLRTGDVGRIDEQGFITLTDRAKDVIKSGGEWISSVELENC LIAHPDVLEAAVVGVPDERWQERPLAVVVVREGATVSAGDLRAFLADKVV RWWLPERWAFVDEIPRTSVGKYDKKAIRSRYAEGAYQITEVHT >MT2981 hypothetical protein MLAWRQLNDLEETVTYDVIIRDGLWFDGTGNAPLTRTLGIRDGVVATVAA GALDETGCPEVVDAAGKWVVPGFIDVHTHYDAEVLLDPGLRESVRHGVTT VLLGNCSLSTVYANSEDAADLFSRVEAVPREFVLGALRDNQTWSTPAEYI EAIDALPLGPNVSSLLGHSDLRTAVLGLDRATDDTVRPTEAELAKMAKLL DEALEAGMLGMSGMDAAIDKLDGDRFRSRALPSTFATWRERRKLISVLRH RGRILQSAPDVDNPVSALLFFLASSRIFNRRKGVRMSMLVSADAKSMPLA VHVFGLGTRVLNKLLGSQVRFQHLPVPFELYSDGIDLPVFEEFGAGTAAL HLRDQLQRNELLADRSYRRSFRREFDRIKLGPSLWHRDFHDAVIVECPDK SLIGKSFGAIADERGLHPLDAFLDVLVDNGERNVRWTTIVANHRPNQLNK LAAEPSVHMGFSDAGAHLRNMAFYNFGLRLLKRARDADRAGQPFLSIERA VYRLTGELAEWFGIGAGTLRQGDRADFAVIDPTHLDESVDGYHEEAVPYY GGLRRMVNRNDATVVATGVGGTVVFRGGQFGGQFRDGYGQNVKSGRYLRA GELGAALSRSA >MT3908 acyl-CoA synthase, putative MAYHNPFIVNGKIRFPANTNLVRHVEKWAKVRGDKLAYRFLDFSTERDGV ARDILWSDFSARNRAVGARLQQVTQPGDRVAIXCPQNLDYLISFFGALYS GRIAVPLFDPAEPGHVGRLHAVLDDCAPSTILTTTDSAEGVRKFIRARSA KERPRVIAVDAVPTEVAATWQQPEANEETVAYLQYTSGSTRIPSGVQITH LNLPTNVVQVLNALEGQEGDRGVSWLPFFHDMGLITVLLASVLGHSFTFM TPAAFVRRPGRWIRELARKPGETGGTFSAAPNFAFEHAAVRGVPRDDEPP LDLSNVKGILNGSEPVSPASMRKFFEAFAPYGLKQTAVKPSYGLAEATLF VSTTPMDEVPTVIHVDRDELNNQRFVEVAADAPNAVAQVSAGKVGVSEWA VIVDADTASELPDGQIGEIWLHGNNLGTGYWGKEEESAQTFKNILKSRIS ESRAEGAPDDALWVRTGDYGTYFKDHLYIAGRIKDLVIIDGRNHYPQDLE CTAQESTKALRVGYAAAFSVPANQLPQTVFDDSHAGLKFDPEDTSEQLVI VGERAAGTHKLDHQPIVDDIRAAIAVGHGVTVRDVLLVSAGTIPRTSSGK IGRRACRAAYLDGSLRSGVGSPTVFATSD >MT0921 oxidoreductase, putative MSDHDRDFDVVVVGGGHNGLVAAAYLARAGLRVRLLERLAQTGGAAVSIQ AFDGVEVALSRYSYLVSLLPSRIVADLGAPVRLARRPFSSYTPAPATAGR SGLLIGPTGEPRAAHLAAIGAAPDAHGFAAFYRRCRLVTARLWPTLIEPL RTREQARRDIVEYGGHEAAAAWQAMVDEPIGHAIAGAVANDLLRGVIATD ALIGTFARMHEPSLMQNICFLYHLVGGGTGVWHVPIGGMGSVTSALATAA ARHGAEIVTGADVFALDPDGTVRYHSDGSDGAEHLVRGRFVLVGVTPAVL ASLLGEPVAALAPGAQVKVNMVVRRLPRLRDDSVTPQQAFAGTFHVNETW SQLDAAYSQAASGRLPDPLPCEAYCHSLTDPSILSARLRDAGAQTLTVFG LHTPHSVFGDTEGLAERLTAAVLASLNSVLAEPIQDVLWTDAQSKPCIET TTTLDLQRTLGMTGGNIFHGALSWPFADNDDPLDTPARQWGVATDHERIM LCGSGARRGGAVSGIGGHNAAMAVLACLASRRKSP >MT3011 acyl-CoA synthase MSVRSLPAALRACARLQPHDPAFTFMDYEQDWDGVAITLTWSQLYRRTLN VAQELSRCGSTGDRVVISAPQGLEYVVAFLGALQAGRIAVPLSVPQGGVT DERSDSVLSDSSPVAILTTSSAVDDVVQHVARRPGESPPSIIEVDLLDLD APNGYTFKEDEYPSTAYLQYTSGSTRTPAGVVMSHQNVRVNFEQLMSGYF ADTDGIPPPNSALVSWLPFYHDMGLVIGICAPILGGYPAVLTSPVSFLQR PARWMHLMASDFHAFSAAPNFAFELAARRTTDDDMAGRDLGNILTILSGS ERVQAATIKRFADRFARFNLQERVIRPSYGLAEATVYVATSKPGQPPETV DFDTESLSAGHAKPCAGGGATSLISYMLPRSPIVRIVDSDTCIECPDGTV GEIWVHGDNVANGYWQKPDESERTFGGKIVTPSPGTPEGPWLRTGDSGFV TDGKMFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAISVPGDRST EKLVAIIELKKRGDSDQDAMARLGAIKREVTSALSSSHGLSVADLVLVAP GSIPITTSGKVRRGACVEQYRQDQFARLDA >MT3652 oxidoreductase, short-chain dehydrogenase/reductase family MPTSDEQRSNGVMGLVDGRVVIVTGAGGGIGRAHALAFAAEGARVVVNDI GVGLDGSPASGGSAAQDVVDEILAAGGQAVADGSDISDWDQAANLIQAAV ETYGGVDVLVNNAGIVRDRMIANTSEEEFDAVIAVHLKGHFATMRHAASH WRGLSKAGKAPKDIDARIINTSSGAGLQGSVGQGNYSAAKAGIAALTLVG AAEMRRYGVTVNAIAPAARTRMTETVFAEMMAKPQEGFDAMAPENVSPLV VWLGSAESRDVTGKVFEVEGGIIRVAEGWAHGPQVDKGVKWDPAELGPVV SDLLAKSRPPVPVYGA >MT1187 O-methyltransferase, putative MSAHKPAKQRVALTGVSETALLTLNARAAEARRRDAIIDDPMAVALVESI DFDFAKFGPTGQGFALRARAFDMAAQHYLDQHPAATVVALAEGLQTSFWR LDVAIPGGQFRWLTVDLPPIVDLRTRLLPSSPRVSVCAQSALDYSWMDSV DPAGGVFITAEGLLMYLQPEQALGLIAQCAQTFPGGQMLFDLPPRWFAGW SRLGLRTSLRYKVPRMPFSMSVAQAADLVNKVPGVVAVRDLRVPPGRGLW VNMALSTVYRLPVFDPLRPCLTLLEFSRPARG >MT1041 polyketide synthase, putative MFHNARTATTGMVTGEPHMPVRHTWGEVHERARCIAGGLAAAGVGLGDVV GVLAGFPVEIAPTAQALWMRGASLTMLHQPTPRTDLAVWAEDTMTVIGMI EAKAVIVSEPFLVAIPILEQKGMQVLTVADLLASDPIGPIEVGEDDLALM QLTSGSTGSPKAVQITHRNIYSNAEAMFVGAQYDVDKDVMVSWLPCFHDM GMVGFLTIPMFFGAELVKVTPMDFLRDTLLWAKLIDKYQGTMTAAPNFAY ALLAKRLRRQAKPGDFDLSTLRFALSGAEPVEPADVEDLLDAGKPFGLRP SAILPAYGMAETTLAVSFSECNAGLVVDEVDADLLAALRRAVPATKGNTR RLATLGPLLQDLEARIIDEQGDVMPARGVGVIELRGESLTPGYLTMGGFI PAQDEHGWYDTGDLGYLTEEGHVVVCGRVKDVIIMAGRNIYPTDIERAAG RVDGVRPGCAVAVRLDAGHSRESFAVAVESNAFEDPAEVRRIEHQVAHEV VAEVDVRPRNVVVLGPGTIPKTPSGKLRRANSVTLVT >MT0684 ABC transporter, ATP-binding protein MGVSIEVNGLTKSFGSSRIWEDVTLTIPAGEVSVLLGPSGTGKSVFLKSL IGLLRPERGSIIIDGTDIIECSAKELYEIRTLFGVLFQDGALFGSMNLYD NTAFPLREHTKKKESEIRDIVMEKLALVGLGGDEKKFPGEISGGMRKRAG LARALVLDPQIILCDEPDSGLDPVRTAYLSQLIMDINAQIDATILIVTHN INIARTVPDNMGMLFRKHLVMFGPREVLLTSDEPVVRQFLNGRRIGPIGM SEEKDEATMAEEQALLDAGHHAGGVEEIEGVPPQISATPGMPERKAVARR QARVREMLHTLPKKAQAAILDDLEGTHKYAVHEIGQ >MT1583 hypothetical protein MSDPLTAQEQHKRRQAVRELMPRTPFIGGLGIVFERYEPDDVVIRLPFRT DLTNDGTYFHGGVIASVMDTAGAAAAWSNHDFDRGTRAATVAMSIQYTGA AKRCDLLCHARTARRRKELTFTEITATDPDGNIVAHAVQTYRIV >MT1572 acyl-CoA synthase MSVVESSLPGVLRERASFQPNDKALTFIDYERSWDGVEETLTWSQLYRRT LNLAAQLREHGSTGDRALILAPQSLDYVVSFIASLQAGIVAVPLSIPQGG AHDERTVSVFADTAPAIVLTASSVVDNVVEYVQPQPGQNAPAVIEVDRLD LDARPSSGSRSAAHGHPDILYLQYTSGSTRTPAGVMVSNKNLFANFEQIM TSYYGVYGKVAPPGSTVVSWLPFYHDMGFVLGLILPILAGIPAVLTSPIG FLQRPARWIQMLASNTLAFTAAPNFAFDLASRKTKDEDMEGLDLGGVHGI LNGSERVQPVTLKRFIDRFAPFNLDPKAIRPSYGMAEATVYVATRKAGQP PKIVQFDPQKLPDGQAERTESDGGTPLVSYGIVDTQLVRIVDPDTGIERP AGTIGEIWVHGDNVAIGYWQKPEATERTFSATIVNPSEGTPAGPWLRTGD SGFLSEGELFIMGRIKDLLIVYGRNHSPDDIEATIQTISPGRCAAIAVSE HGAEKLVAIIELKKKDESDDEAAERLGFVKREVTSAISKSHGLSVADLVL VSPGSIPITTSGKIRRAQCVELYRQDEFTRLDA >MT3000 polyketide synthase MTGSISGEADLRHWLIDYLVTNIGCTPDEVDPDLSLADLGVSSRDAVVLS GELSELLGRTVSPIDFWEHPTINALAAYLAAPEPSPDSDAAVKRGARNSL DEPIAVVGMGCRFPGGISCPEALWDFLCERRSSISQVPPQRWQPFEGGPP EVAAALARTTRWGSFLPDIDAFDAEFFEISPSEADKMDPQQRLLLEVAWE ALEHAGIPPGTLRRSATGVFAGACLSEYGAMASADLSQVDGWSNSGGAMS IIANRLSYFLDLRGPSVAVDTACSSSLVAIHLACQSLRTQDCHLAIAAGV NLLLSPAVFRGFDQVGALSPTGQCRAFDATADGFVRGEGAGVVVLKRLTD AQRDGDRVLAVICGSAVNQDGRSNGLMAPNPAAQMAVLRAAYTNAGMQPS EVDYVEAHGTGTLLGDPIEARALGTVLGRGRPEDSPLLIGSVKTNLGHTE AAAGIAGFIKTVLAVQHGQIPPNQHFETANPHIPFTDLRMKVVDTQTEWP ATGHPRRAGVSSFGFGGTNAHVVIEQGQEVRPAPGQGLSPAVSTLVVAGK TMQRVSATAGMLADWMEGPGADVALADVAHTLNHHRSRQPKFGTVVARDR TQAIAGLRALAAGQHAPGVVNPAEGSPGPGTVFVYSGRGSQWAGMGRQLL ADEPAFAAAVAELEPVFVEQAGFSLHDVLANGEELVGIEQIQLGLIGMQL ALTELWCSYGVRPDLVIGHSMGEVAAAVVAGALTPAEGLRVTATRSRLMA PLSGQGGMALLELDAPTTEALIADFPQVTLGIYNSPRQTVIAGPTEQIDE LIARVRAQNRFASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGII STTYADLHTQPVFDAEHWATNMRNPVHFQQAIASAGSGADGAYHTFIEIS AHPLLTQAIIDTLHSAQPGARYTSLGTLQRDTDDVVTFRTNLNKAHTIHP PHTPHPPEPHPPIPTTPWQHTRHWITTKYPAGSVGSAPRAGTLLGQHTTV ATVSASPPSHLWQARLAPDAKPYQGGHRFHQVEVVPASVVLHTILSAATE LGYSALSEVRFEQPIFADRPRLIQVVADNRAISLASSPAAGTPSDRWTRH VTAQLSSSPSDSASSLNEHHRANGQPPERAHRDLIPDLAELLAMRGIDGL PFSWTVASWTQHSSNLTVAIDLPEALPEGSTGPLLDAAVHLAALSDVADS RLYVPASIEQISLGDVVTGPRSSVTLNRTAHDDDGITVDVTVAAHGEVPS LSMRSLRYRALDFGLDVGRAQPPASTGPVEAYCDATNFVHTIDWQPQTVP DATHPGAEQVTHPGPVAIIGDDSAALCETLEGAGYQPAVMSDGVSQARYV VYVADSDPAGADETDVDFAVRICTEITGLVRTLAERDADKPAALWILTRG VHESVAPSALRQSFLWGLAGVIAAEHPELWGGLVDLAINDDLGEFGPALA ELLAKPSKSILVRRDGVVLAPALAPVRGEPARKSLQCRPDAAYLITGGLG ALGLLMADWLADRGAHRLVLTGRTPLPPRRDWQLDTLDTELRRRIDAIRA LEMRGVTVEAVAADVGCREDVQALLAARDRDGAAPIRGIIHAAGITNDQL VTSMTGDAVRQVMWPKIGGSQVLHDAFPPGSVDFFYLTASAAGIFGIPGQ GSYAAANSYLDALARARRQQGCHTMSLDWVAWRGLGLAADAQLVSEELAR MGSRDITPSEAFTAWEFVDGYDVAQAVVVPMPAPAGADGSGANAYLLPAR NWSVMAATEVRSELEQGLRRIIAAELRVPEKELDTDRPFAELGLNSLMAM AIRREAEQFVGIELSATMLFNHPTVKSLASYLAKRVAPHDVSQDNQISAL SSSAGSVLDSLFDRIESAPPEAERSV >MT0144 P450 heme-thiolate protein MSEVVTAAPAPPVVRLPPAVRGPKLFQGLAFVVSRRRLLGRFVRRYGKAF TANILMYGRVVVVADPQLARQVFTSSPEELGNIQPNLSRMFGSGSVFALD GDDHRRRRRLLAPPFHGKSMKNYETIIEEETLRETANWPQGQAFATLPSM MHITLNAILRAIFGAGGSELDELRRLIPPWVTLGSRLAALPKPKRDYGRL SPWGRLAEWRRQYDTVIDKLIEAERADPNFADRTDVLALMLRSTYDDGSI MSRKDIGDELLTLLAAGHETTAATLGWAFERLSRHPDVLAALVEEVDNGG HELRQAAILEVQRARTVIDFAARRVNPPVYQLGEWVIPRGYSIIINIAQI HGDPDVFPQPDRFDPQRYIGSKPSPFAWIPFGGGTRRCVGAAFANMEMDV VLRTVLRHFTLETTTAAGERSHGRGVAFTPKDGGRVVMRRR >MT1895 conserved hypothetical protein MQPSPDSPAPLNVTVPFDSELGLQFTELGPDGARAQLDVRPKLLQLTGVV HGGVYCAMIESIASMAAFAWLNSHGEGGSVVGVNNNTDFVRSISSGMVYG TAEPLHRGRRQQLWLVTITDDTDRVVARGQVRLQNLEARP >MT1595 oxidoreductase, short-chain dehydrogenase/reductase family MQGRNLNDAVKGKVVLITGGSSGIGAAAAKKIAEAGGTVVLVARTLENLE NVANDIRAIRGNGGTAHVYPCDLSDMDAIAVMADQVLGDLGGVDILINNA GRSIRRSLELSYDRIHDYQRTMQLNYLGAVQLILKFIPGMRERHFGHIVN VSSVGVQTRAPRFGAYIASKAALDSLCDALQAETVHDNVRFTTVHMALVR TPMISPTTIYDKFPTLTPDQAAGVITDAIVHRPRRASSPFGQFAAVADAV NPAVMDRVRNRAFNMFGDSSAAKGSESQTDTSELDKRSETFVRATRGIHW >MT2863 EntD-related protein MTVGTLVASVLPATVFEDLAYAELYSDPPGLTPLPEEAPLIARSVAKRRN EFITVRHCARIALDQLGVPPAPILKGDKGEPCWPDGVVGSLTHCAGYRGA VVGRRDAVRSVGIDAEPHDVLPNGVLDAISLPAERADMPRTMPAALHWDR ILFCAKEATYKAWFPLTKRWLGFEDAHITFETDSTGWTGRFVSRILIDGS TLSGPPLTTLRGRWSVERGLVLTAIVL >MT0586 conserved hypothetical protein MSTVLTYIRAVDIYEHMTESLDLEFESAYRGESVAFGEGVRPPWSIGEPQ PELAALIVQGKFRGDVLDVGCGEAAISLALAERGHTTVGLDLSPAAVELA RHEAAKRGLANASFEVADASSFTGYDGRFDTIVDSTLFHSMPVESREGYL QSIVRAAAPGASYFVLVFDRAAIPEGPINAVTEDELRAAVSKYWIIDEIK PARLYARFPAGFAGMPALLDIREEPNGLQSIGGWLLSAHLG >MT1223 hypothetical protein MRIAGVGLGQLLLALDATVVSLVDAPRGLDLPVASTALIDSDDVRLGLAA AAGSADVFFLIGVTDDEAVRWVDDQARQRAPVAIFVKHPSDSVVAGAVRA GSAVVAVEPRARWERLYHLVNHVLEHHGDRADPTDDSGTDLFGLAQSLAD RIHGMISIEDAQSHVLAYSASNDEADELRRLSILGRAGPPEHLQWIGQWG IFDALRAGREVVRVAERPELGLRPRLAIGIHQPGVGALRPPVFAGTIWVQ QGSQPLADDAEEMLRGAAVLAARIMSRLATQPNTHALRVQQLLGLAELNA TTAPVDVSTIARELGVAAEGNATLIGFDTAENRDTAVRHVRLVDVMALSA SAFRHDAQVAANGSRIYVLLPQTTTGRAVTSWVRGTISALRAELGVALRA AIAGPVAGLAEVNPARVEVDRVLESAERHPILGQVTSLAEARTTVLLDEI VTLVGTDQRLVDPRIRDLGAQDPVLAQTLRAYLDAFGDIGAAARSLQVHP NTVRYRIRRIEQLLSTSLGDPDVRLLFSLGLRAMERTA >MT1550 mmcH protein, putative MIPVKVENNTSLDQVQDALNCVGYAVVEDVLDEASLAATRDRMYRVQERI LTEIGKERLARAGELGVLRLMMKYDPHFFTFLEIPEVLSIVDRVLSETAI LHLQNGFILPSFPPFSTPDVFQNAFHQDFPRVLSGYIASVNIMFAIDPFT RDTGATLVVPGSHQRIEKPDHTYLARNAVPVQCAAGSLFVFDSTLWHAAG RNTSGKDRLAINHQFTRSFFKQQIDYVRALGDAVVLEQPARTQQLLGWYS RVVTNLDEYYQPPDKRLYRKGQG >MT0418 polyketide synthase MTDGSVTADKLQKWFREYLSTHIECHPNEVSLDVPIRDLGLKSIDVLAIP GDLGDRFGFCIPDLAVWDNPSANDLIDSLLNQRSADSLRESHGHADRNTQ GRGSINEPVAVIGVGCRFPGDIDGPERLWDFLTEKKCAITAYPDRGFTNA GTFAESGGFLKDVAGFDNRFFDIPPDEALRMDPQQRLLLEVSWEALEHAG IIPESLRLSRTGVFVGVSSTDYVRLVSASAQQKSTIWDNTGGSSSIIANR ISYFLDIQGPSIVIDTACSSSLVAVHLACRSLSTWDCDIALVGGTNVLIS PEPWGGFREAGILSQTGCCHAFDKSADGMVRGEGCGVIVLQRLSDARLEG RRILAILTGSAVNQDGKSNGIMAPNPSAQIGVLENACKSARVDPLEIGYV EAHGTGTSLGDRIEAHALGMVFGRKRPGSGPLMIGSIKPNIGHLEGAAGI AGLIKAVLMVERGSLLPSGGFTEPNPAIPFTELGLRVVDELQEWPVVAGR PRRAGVSSFGFGGTNAHVIVEEAGSVGADTVSGRADVGGSGGGVVAWVIS GKTASALAAQAGRLGRYVRARPALDVVDVGYSLVSTRSVFDHRAVVVGQT RDELLAGLAGVVAGRPEAGVVCGVGKPAGKTAFVFAGQGSQWLGMGSELY AAYPVFAEALDAVVDELDRHLRYPLRDVIWGHDQDLLNTTEFAQPALFAV EVALYRLLMSWGVRPGLVLGHSVGELAAAHVAGALCLPDAAMLVAARGRL MQALPAGGAMFAVQAREDEVAPMLGHDVSIAAVNGPASVVISGAHDAVSA IADRLRGQGRRVHRLAVSHAFHSALMEPMIAEFTAVAAELSVGLPTIPVI SNVTGQLVADDFASADYWARHIRAVVRFGDSVRSAHCAGASRFIEVGPGG GLTSLIEASLADAQIVSVPTLRKDRPEPVSVMTAAAQGFVSGMGLDWASV FSGYRPKRVELPTYAFQHQKFWLAPAPSVSDPTAAGQIGASDGGAELLAS SGFAARLAGRSADEQLAAAIEVVCEHAAAVLGRDGAAGLDAGQAFADSGF NSLSAVELRNRLTAVTAVTLPATAIFDHPTPTELAQYLITQIDGHGSSAA AAANPAERIDALTDVFLQACDAGRDADGWKMVALASNTRERMSSPVRNNV SKNVALLADGISDVVVICIPTLTVLSDQREYRDIANAMTGRHSVYSLTLP GFDSSDALPQNADMIVETVSNAIIDVVGGSCRFVLSGYSSGGVLAYALCS HLSVKHQRNPLGVALIDTYLPSQIANPSMNEGFSPNDTGKGLSREVIRVA RMLNRLTATRLTAAATYAAIFQAWEPGRSMAPVLNIVAKDRIATVENLRE ERINRWRTAAAEAAYSVAEVPGDHFGMMSTSSEAIATEIHDWISGLVRGP HP >MT1177 oxidoreductase, short-chain dehydrogenase/reductase family MKTKDAVAVVTGGASGLGLATTKRLLDAGAQVVVVDLRGDDVVGGLGDRA RFAQADVTDEAAVSNALELADSLGPVRVVVNCAGTGNAIRVLSRDGVFPL AAFRKIVDINLVGTFNVLRLGAERIAKTEPIGEERGVIINTASVAAFDGQ IGQAAYSASKGGVVGMTLPIARDLASKLIRVVTIAPGLFDTPLLASLPAE AKASLGQQVPHPSRLGNPDEYGALVLHIIENPMLNGEVIRLDGAIRMAPR >MT0572 oxidoreductase, short-chain dehydrogenase/reductase family MSKRPLRWLTEQITLAGMRPPISPQLLINRPAMQPVDLTGKRILLTGASS GIGAAATKQFGLHRAVVVAVARRKDLLDAVADRITGDGGTAMSLPCDLSD MEAIDALVEDVEKRIGGIDILINNAGRSIRRPLAESLERWHDVERTMVLN YYAPLRLIRGLAPGMLERGDGHIINVATWGVLSEASPLFSVYNASKAALS AVSRIIETEWGSQGVHSTTLYYPLVATPMIAPTKAYDGLPALTAAEAAEW MVTAARTRPVRIAPRVAVAVNALDSIGPRWVNALMQRRNEQLNP >MT1705 chalcone/stilbene synthase family protein MSVIAGVFGALPPHRYSQSEITDSFVEFPGLKEHEEIIRRLHAAAKVNGR HLVLPLQQYPSLTDFGDANEIFIEKAVDLGVEALLGALDDANLRPSDIDM IATATVTGVAVPSLDARIAGRLGLRPDVRRMPLFGLGCVAGAAGVARLRD YLRGAPDDVAVLVSVELCSLTYPAVKPTVSSLVGTALFGDGAAAVVAVGD RRAEQVRAGGPDILDSRSSLYPDSLHIMGWDVGSHGLRLRLSPDLTNLIE RYLANDVTTFLDAHRLTKDDIGAWVSHPGGPKVIDAVATSLALPPEALEL TWRSLGEIGNLSSASILHILRDTIEKRPPSGSAGLMLAMGPGFCTELVLL RWR >MT0110 peptide synthetase, putative MASRAGGCVHRVRLSRSQRNLYNGVRQDNNPALYLIGKSYRFRRLELARF LAALHATVLDNPVQLCVLENSGADYPDLVPRLRFGDIVRVGSADEHLQST WCSGILGKPLVRHTVHTDPNGYVTGLDVHTHHILLDGGATGTIEADLARY LTTDPAGETPSVGAGLAKLREAHRRETAKVEESRGRLSAVVQRELADEAY HGGHGHSVSDAPGTAAKGVLHESATICGNAFDAILTLSEAQRVPLNVLVA AAAVAVDASLRQNTETLLVHTVDNRFGDSDLNVATCLVNSVAQTVRFPPF ASVSDVVRTLDRGYVKAVRRRWLREEHYRRMYLAINRTSHVEALTLNFIR EPCAPGLRPFLSEVPIATDIGPVEGMTVASVLDEEQRTLNLAIWNRADLP ACKTHPKVAERIAAALESMAAMWDRPIAMIVNDWFGIGPDGTRCQGDWPA RQPSTPAWFLDSARGVHQFLGRRRFVYPWVAWLVQRGAAPGDVLVFTDDD TDKTIDLLIACHLAGCGYSVCDTADEISVRTNAITEHGDGILVTVVDVAA TQLAVVGHDELRKVVDERVTQVTHDALLATKTAYIMPTSGTTGQPKLVRI SHGSLAVFCDAISRAYGWGAHDTVLQCAPLTSDISVEEIFGGAACGARLV RSAAMKTGDLAALVDDLVARETTIVDLPTAVWQLLCADGDAIDAIGRSRL RQIVIGGEAIRCSAVDKWLESAASQGISLLSSYGPTEATVVATFLPIVCD QTTMDGALLRLGRPILPNTVFLAFGEVVIVGDLVADGYLGIDGDGFGTVT AADGSRRRAFATGDRVTVDAEGFPVFSGRKDAVVKISGKRVDIAEVTRRI AEDPAVSDVAVELHSGSLGVWFKSQRTREGEQDAAAATRIRLVLVSLGVS SFFVVGVPNIPRKPNGKIDSDNLPRLPQWSAAGLNTAETGQRAAGLSQIW SRQLGRAIGPDSSLLGEGIGSLDLIRILPETRRYLGWRLSLLDLIGADTA ANLADYAPTPDAPTGEDRFRPLVAAQRPAAIPLSFAQRRLWFLDQLQRPA PVYNMAVALRLRGYLDTEALGAAVADVVGRHESLRTVFPAVDGVPRQLVI EARRADLGCDIVDATAWPADRLQRAIEEAARHSFDLATEIPLRTWLFRIA DDEHVLVAVAHHIAADGWSVAPLTADLSAAYASRCAGRAPDWAPLPVQYV DYTLWQREILGDLDDSDSPIAAQLAYWENALAGMPERLRLPTARPYPPVA DQRGASLVVDWPASVQQQVRRIARQHNATSFMVVAAGLAVLLSKLSGSPD VAVGFPIAGRSDPALDNLVGFFVNTLVLRVNLAGDPSFAELLGQVRARSL AAYENQDVPFEVLVDRLKPTRALTHHPLIQVMLAWQDNPVGQLNLGDLQA TPMPIDTRTARMDLVFSLAERFSEGSEPAGIGGAVEYRTDVFEAQAIDVL IERLRKVLVAVAAAPERTVSSIDALDGTERARLDEWGNRAVLTAPAPTPV SIPQMLAAQVARIPEAEAVCCGDASMTYRELDEASNRLAHRLAGCGAGPG ECVALLFERCAPAVVAMVAVLKTGAAYLPIDPANPPPRVAFMLGDAVPVA AVTTAGLRSRLAGHDLPIIDVVDALAAYPGTPPPMPAAVNLAYILYTSGT TGEPKGVGITHRNVTRLFASLPARLSAAQVWSQCHSYGFDASAWEIWGAL LGGGRLVIVPESVAASPNDFHGLLVAEHVSVLTQTPAAVAMLPTQGLESV ALVVAGEACPAALVDRWAPGRVMLNAYGPTETTICAAISAPLRPGSGMPP IGVPVSGAALFVLDSWLRPVPAGVAGELYIAGAGVGVGYWRRAGLTASRF VACPFGGSGARMYRTGDLVCWRADGQLEFLGRTDDQVKIRGYRIELGEVA TALAELAGVGQAVVIAREDRPGDKRLVGYATEIAPGAVDPAGLRAQLAQR LPGYLVPAAVVVIDALPLTVNGKLDHRALPAPEYGDTNGYRAPAGPVEKT VAGIFARVLGLERVGVDDSFFELGGDSLAAMRVIAAINTTLNADLPVRAL LHASSTRGLSQLLGRDARPTSDPRLVSVHGDNPTEVHASDLTLDRFIDAD TLATAVNLPGPSPELRTVLLTGATGFLGRYLVLELLRRLDVDGRLICLVR AESDEDARRRLEKTFDSGDPELLRHFKELAADRLEVVAGDKSEPDLGLDQ PMWRRLAETVDLIVDSAAMVNAFPYHELFGPNVAGTAELIRIALTTKLKP FTYVSTADVGAAIEPSAFTEDADIRVISPTRTVDGGWAGGYGTSKWAGEV LLREANDLCALPVAVFRCGMILADTSYAGQLNMSDWVTRMVLSLMATGIA PRSFYEPDSEGNRQRAHFDGLPVTFVAEAIAVLGARVAGSSLAGFATYHV MNPHDDGIGLDEYVDWLIEAGYPIRRIDDFAEWLQRFEASLGALPDRQRR HSVLPMLLASNSQRLQPLKPTRGCSAPTDRFRAAVRAAKVGSDKDNPDIP HVSAPTIINYVTNLQLLGLL >MT2446 hypothetical protein MNPTLAVLGAGAKAVAVAAKASVLRDMGVDVPDVIAVERIGVGANWQASG GWTDGAHRLGTSPEKDVGFPYRSALVPRRNAELDERMTRYSWQSYLIATA SFAEWIDRGRPAPTHRRWSQYLAWVADHIGLKVIHGEVERLAVTGDRWAL CTHETTVQADALMITGPGQAEKSLLPGNPRVLSIAQFWDRAAGHDRINAE RVAVIGGGETAASMLNELFRHRVSTITVISPQVTLFTRGEGFFENSLFSD PTDWAALTFDERRDALARTDRGVFSATVQEALLADDRIHHLRGRVAHAVG RQGQIRLTLSTNRGSENFETVHGFDLVIDGSGADPLWFTSLFSQHTLDLL ELGLGGPLTADRLQEAIGYDLAVTDVTPKLFLPTLSGLTQGPGFPNLSCL GLLSDRVLGAGIFTPTKHNDTRRSGEHQSFR >MT3610 4-coumarate-CoA ligase, putative MTPTHPTVTELLLPLSEIDDRGVYFEDSFTSWRDHIRHGAAIAAALRERL DPARPPHVGVLLQNTPFFSATLVAGALSGIVPVGLNPVRRGAALAGDIAK ADCQLVLTGSGSAEVPADVEHINVDSPEWTDEVAAHRDTEVRFRSADLAD LFMLIFTSGTSGDPKAVKCSHRKVAIAGVTITQRFSLGRDDVCYVSMPLF HSNAVLVGWAVAAACQGSMALRRKFSASQFLADVRRYGATYANYVGKPLS YVLATPELPDDADNPLRAVYGNEGVPGDIDRVGRRFGCVVMDGFGSTEGG VAITRTLDTPAGALGPLPGGIQIVDPDTGEPCPTGVVGELVNTAGPGGFE GYYNDEAAEAERMAGGVYHSGDLAYRDDAGYAYFAGRLGDWMRVDGENLG TAPIERVLMRYPDATEVAVYPVPDPVVGDQVMAALVLAPGTKFDADKFRA FLTEQPDLGHKQWPSYVRVSAGLPRTMTFKVIKRQLSAEGVACADPVWPI RR >MT1470 substrate--CoA ligase, putative MRIRQAFGLIATMRRAGLIAPLRPDRYLRIVAAMRREGMGFTAGFAGAAR RCPDRPGLIDELGTLTWRQLDERGNALAAALQALPAGPPRVVGIMCRNHR GFVDALLAVNRIGAHILLLNTSFAGPALAEVVTREGVDTVVYDEEFSATV DRALAEKPQATRIVAWTDEDHDLTVEKLVAAHAGRRPEHTGSHGKVILLT SGTTGTPKGARHSGGGIGTLKAILDRTPWRAEEVTVIVAPMFHAWGFSQL VLASSLACTIVTRRRFDPEATLDLIDRHHATGLVVVPVMFDRIMDLPAEI RNRYDGRSLRFAAASGSRMRPDVVIAFMDQFGDVIYNNYNATEAGMIATA TPADLRTAPDTAGRPAEGTEIRILDQQFTEVPTGEVGTIYVRNDSQFDGY TSGAAKDFHAGFMSSGDVGYLDENGRLFVVGRDDEMIVSGGENIYPIEVE KTLATHPDVAEAAVIGVDDQQYGQRLAAFVVLKPGVSATPETLKQHVRDN LANYKVPRDIAVLDELPRGITGKILRTELQSRVGS >MT0919 conserved hypothetical protein MTAPPIAVERNTRSKVRQQQEADVVALGRKPGLLCVPERFRAMDLPMAAA DALFLWAETPTRPLHVGALAVLSQPDNGTGRYLRKVFSAAVARQQVAPWW RRRPHRSLTSLGQWSWRTETEVDLDYHVRLSALPPRAGTAELWALVSELH AGMLDRSRPLWQVDLIEGLPGGRCAVYVKVHHALADGVSVMRLLQRIVTA DPHQRQMPTLWEVPAQASVAKHTAPRGSSRPLTLAKGVLGQARGVPGMVR VVADTTWRAAQCRSGPLTLAAPHTPLNEPIAGARSVAGCSFPIERLRQVA EHADATINDVVLAMCGGALRAYLISRGALPGAPLIAMVPVSLRDTAVIDV FGQGPGNKIGTLMCSLATHLASPVERLSAIRASMRDGKAAIAGRSRNQAL AMSALGAAPLALAMALGRVPAPLRPPNVTISNVPGPQGALYWNGARLDAL YLLSAPVDGAALNITCSGTNEQITFGLTGCRRAVPALSILTDQLAHELEL LVGVSEAGPGTRLRRIAGRR >MT3029 hypothetical protein MDLVQFQDVRLMRVVVCRRLGPAKGQRRWRPLDLGTTGCFENLGAQRPTY RMRAIRMLECAMPNRLVRSLQRWRPFGLPPHRWRLAPWYWRGLQVTLEPG SAIAWIVRLTGGFEETEIDIAAALYSALYPDRCILDVGANVGIHSLAWAR LAPVVALEPAPGTHSRLEANVAANGLQDRIRTLRTAAGDAVGEVDFFVAA DSAFSSLNDTGRIRIRERTRVPCTTLDALAAELPLPVGLLKIDVEGLERA VIAGAAELLRRDRPVLLVEIYGGAASNPDPERTIADIRAYGYEPFVYADD AGLQPYQRHRDDRYCYFFIPSRKG >MT3514 dioxygenase, putative MTDLITVKKLGSRIGAQIDGVRLGGDLDPAAVNEIRAALLAHKVVFFRGQ HQLDDAEQLAFAGLLGTPIGHPAAIALADDAPIITPINSEFGKANRWHTD VTFAANYPAASVLRAVSLPSYGGSTLWANTAAAYAELPEPLKCLTENLWA LHTNRYDYVTTKPLTAAQRAFRQVFEKPDFRTEHPVVRVHPETGERTLLA GDFVRSFVGLDSHESRVLFEVLQRRITMPENTIRWNWAPGDVAIWDNRAT QHRAIDDYDDQHRLMHRVTLMGDVPVDVYGQASRVISGAPMEIAG >MT2328 P450 heme-thiolate protein MGLNTAIATRVNGTPPPEVPIAGIELGSLDFWALDDDVRDGAFATLRREA PISFWPTIELPGFVAGNGHWALTKNDDVFYASRHPDIFSSYPNITINDQT PELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEAAVRDRAHRLVS SMIANNPDRQADLVSELAGPLPLQIICDMMGIPKADHQRIFHWTNVILGF GDPDLATDFDEFMQVSADIGAYATALAEDRRVNHHDDLTSSLVEAEVDGE RLSSREIASFFILLVVAGNETTRNAITHGVLALSRYPEQRDRWWSDFDGL APTAVEEIVRWASPVVYMRRTLTQDIELRGTKMAAGDKVSLWYCSANRDE SKFADPWTFDLARNPNPHLGFGGGGAHFCLGANLARREIRVAFDELRRQM PDVVATEEPARLLSQFIHGIKTLPVTWS >MT3928 conserved hypothetical protein MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQQAQHLRR YRDHVARGLDMSRLMIFTWDLPGRCNIRAMNYAINAHLRRHDTYHSWFEF DNAEHIVRHTIADPADIEVVQAEHQNMTSAELRHHIATPQPLQWDCFLFG IIQSDDHFTFYASIAHLCVDPMIVGVLFIEIHMMYSALVGGDPPIELPPA GRYDDHCVRQYADTAALTLDSARVRRWVEFAANNDGTLPHFPLPLGDLSV PHTGKLLTETLMDEQQGERFEAACVAAGARFSGGVFACAALAERELTNCE TFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVASGLFDSAARVAQISFD SGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIAPLSTVANSD LNFRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNPIASESVANYIAAMK SIYIRTADGTLATLKPGT >MT1702 polyketide synthase MSGTTTHVDYLKRLTADLRRTRRRLSDLEAKLSEPVAVVGMGCRYPGGVD SPETLWELVAQGRDAVSDFPADRGWDVDGLFDPDPDACGKMYTRRGTFLE HAGDFDAGFFGIGPSEALAMDPQQRLLLEVSWEALERTGIDPTKLRGSAT GVFAGVIHAGYGGQLSGELEGYGLTGSTLSVASGRVAYVLGLEGPAVSVD TACSSSLVALHLAVQSLRSGECDLALAGGVTVMATPAAFVEFSRQRALAR DGRCKVYAGAADGTAWSEGAGVLVVERLVDARRLGHPVLALVRGSAVNQD GASNGLTAPNGPSQQRVIRAALASARLRAVEVDVVEGHGTGTMLGDPIEA QALLATYGQDRVEPLWLGSIKSNIGHTSAAAGVAGVIKMVQAMRHGVMPK TLHVDVPTPHVDWSVGAVSLLTQPRAWSVHGRPRRAGVSSFGISGTNAHV ILEQAPVVESVVPEVASPTAASAVPWVLSARSEQALAGQAQRLLAFVAAN PDLDPIDVGWSLVKTRAMFEHRAVVVGADRGALLAGLAALAAGESGAGVA VGRARSVGKTVFVFPGQGAQWVGMGAQLYAELPLFALAFDAVAEELDRHL RLPLRNVLWEGDEALLTSTEFAQPALFAIEVALATLLQHWGISPDFLIGH SVGEIAAAHLAGVLSLTDAAGLVAARGRLMAELPAGGVMVVVAASEEEVL PVLVDGANLAAVNAPHSVVVSGCEAAVSDIADHFARRGRRVHRLAVSHAF HSLLMEPMLAEFTRIAAGISVSKPRIPLVSNVTGQMAGAGYGDGQYWVEH ARRPVRFAEGVQLLNAVGATRFVEVGPGGGLTALVEQSLPLGEALSVAMM RREHPEVSSVLGAVATLFTAGAQMDWPAVFGSPGRRIELPTYAFQRQRYW LPPTSAGSADISGVGLLAARHGLLGAVVEQPDSDVVVLTGRLSVGEQRWL ADHVIAGVVLLAGAAFVELALRAADQVDCGVVEELTVVTPLVLPTVGGVQ LQVVVGVGEMGQRPVSIYSRNAESDSGWVLHARGVLGAKAVAPAADLSVW PPLGAAPVDVDGAYQRFAELGYEYGRAFQGLTAMWRRESELFADVAVPDD VDVTLSGFGIHPLVLDAALHAMGMVGEQAATMLPFSWQGVSLHAAGASRV RARIAPAGDGTVSVELADQAGLPVLSVQALVMRSVSSQLLSAAVAAADAA GRGLLEVAWLPVELAHNDISADLVVWELESFQDGVGPVYSATHRVLVALQ SWLAQERAGRLVVLTQGSVGQDATNLAGAAVWGLVRSAQAEHPGRVMLVD SDGSMDVGDVIGCGEEQLMIRNGTAYAARLAQLRPQPILQLPDTNSGWRL VAGGAGTLEDLTLASCPAKELAPGQVRIEVRALGVNFRDVLVALGIYPGA AELGAEGAGVVTEVGPGVTGLAVGDPVMGLLGVAGSEAVVDARLVVKLPN RWPLTDAAGVPVVFLTAYYALRVLAQVQPGESVLVHAAAGGVGMAAVQLA RLWGLEVFATASRGKWDTLHTMGCDNTHVADSRTLAFEETFWLTTEGRGV DVVLNSLAGEFTDASLRLLPRGGRFIEMGKTEFGTPRSLPRTILGWPTGL ST >MT3899 oxidoreductase, short-chain dehydrogenase/reductase family MVLDAVGNPQTVLLLGGTSEIGLAICERYLHNSAARIVLACLPDDPRRED AAAAMKQAGARSVELIDFDALDTDSHPKMIEAAFSGGDVDVAIVAFGLLG DAEELWQNQRKAVQIAEINYTAAVSVGVLLAEKMRAQGFGQIIAMSSAAG ERVRRANFVYGSTKAGLDGFYLGLSEALREYGVRVLVIRPGQVRTRMSAH LKEAPLTVDKEYVANLAVTASAKGKELVWAPAAFRYVMMVLRHIPRSIFR KLPI >MT1931 oxidoreductase, short-chain dehydrogenase/reductase family MKAIFITGAGSGMGREGATLFHANGWRVGAIDRNEDGLAALRVQLGAERL WARAVDVTDKAALEGALADFCAGNVGGGLDMMWNNAGIGEGGWFEDVPYE AAVRVVDVNFKAVLTGAYAALPYLKKAPGSLMFSTSSSSGTYGMPRIAVY SATKHAVKGLTEALSVEWQRHGVRVADVLPGLIDTAILTSTRQHSDEGPY TISAEQIRAAAPKKGMFRLMPSSSVAEAAWRAYQHPTRLHWYVPRSIRWI DRLKGVSPEFVRRHIAKSLATLEPKRK >MT0098 methyltransferase, putative MDQPWNANIHYDALLDAMVPLGTQCVLDVGCGDGLLAARLARRIPYVTAV DIDAPVLRRAQTRFANAPIRWLHADIMTAELPNAGFDAVVSNAALHHIED TRTALSRLGGLVTPGGTLAVVTFVTPSLRNGLWHLTSWVACGMANRVKGK WEHSAPIKWPPPQTLHELRSHVRALLPGACIRRLLYGRVLVTWRAPV >MT0175 substrate--CoA ligase MMQPDAPALRFVGNTMTWADLRRRVAALAGALSGRGVGFGDRVMILMLNR TEFVESVLAANMIGAIAVPLNFRLTPTEIAVLVEDCVAHVMLTEAALAPV AIGVRNIQPLLSVIVVAGGSSQDSVFGYEDLLNEAGDVHEPVDIPNDSPA LIMYTSGTTGRPKGAVLTHANLTGQAMTALYTSGANINSDVGFVGVPLFH IAGIGNMLTGLLLGLPTVIYPLGAFDPGQLLDVLEAEKVTGIFLVPAQWQ AVCTEQQARPRDLRLRVLSWGAAPAPDALLRQMSATFPETQILAAFGQTE MSPVTCMLLGEDAIAKRGSVGRVIPTVAARVVDQNMNDVPVGEVGEIVYR APTLMSCYWNNPEATAEAFAGGWFHSGDLVRMDSDGYVWVVDRKKDMIIS GGENIYCAELENVLASHPDIAEVAVIGRADEKWGEVPIAVAAVTNDDLRI EDLGEFLTDRLARYKHPKALEIVDALPRNPAGKVLKTELRLRYGACVNVE RRSASAGFTERRENRQKL >MT0954 oxidoreductase, short-chain dehydrogenase/reductase family MILDMFRLDDKVAVITGGGRGLGAAIALAFAQAGADVLIASRTSSELDAV AEQIRAAGRRAHTVAADLAHPEVTAQLAGQAVGAFGKLDIVVNNVGGTMP NTLLSTSTKDLADAFAFNVGTAHALTVAAVPLMLEHSGGGSVINISSTMG RLAARGFAAYGTAKAALAHYTRLAALDLCPRVRVNAIAPGSILTSALEVV AANDELRAPMEQATPLRRLGDPVDIAAAAVYLASPAGSFLTGKTLEVDGG LTFPNLDLPIPDL >MT3023 acyl-CoA synthase MSESSLADLLQKAASQYPNRAAYKFIDYDTDPAGFTETVTWWQVHRRAMI VAEELWIYASSGDRVAILAPQGLEYIIAFMGVLQAGLIAVPLPVPQFGIH DERISSALRDSAPSIILTTSSVIDEVTTYAPHACAAQGQSAPIVVAVDAL DLSSSRALDPTRFERPSTAYLQYTSGSTRAPAGVVLSHKNVITNCVQLMS DYIGDSEKVPSTPVSWLPFYHDMGLMLGIILPMINQDTAVLMSPMAFLQR PARWMQLLAKHRAQISSAPNFGFELAVRRTSDDDMAGLDLGHVRTIVTGA ERVNVATLRRFTERFAPFNLSETAIRPSYGLAEATVYVATAGPGRAPKSV CFDYQQLSVGQAKRAENGSEGANLVSYGAPRASTVRIVDPETRMENPAGT VGEIWVQGDNVGLGYWRNPQQTEATFRARLVTPSPGTSEGPWLRTGDLGV IFEGELFITGRIKELLVVDGANHYPEDIEATIQEITGGRVVAIAVPDDRT EKLVTIIELMKRGRTDEEEKNRLRTVKREVASAISRSHRLRVADVVMVAP GSIPVTTSGKVRRSASVERYLHHEFSRLDAMA >MT1447 methyltransferase, putative MTVYTPTSERQAPATTHRQMWALGDYAAIAEELLAPLGPILVSTSGIRRG DRVLDVAAGSGNVSIPAAMAGAHVTASDLTPELLRRAQARAAAAGLELGW REANAEALPFSAGEFDAVLSTIGVMFAPRHQRTADELARVCRRGGKISTL NWTPEGFYGKLLSTIRPYRPTLPAGAPHEVWWGSEDYVSGLFRDHVSDIR TRRGSLTVDRFGCPDECRDYFKNFYGPAINAYRSIADSPECVATLDAEIT ELCREYLCDGVMQWEYLIFTARKC >MT2127 conserved hypothetical protein MTDDHPRADIVSRQYHRWLYPHPIADLEAWTTANWEWFDPVHSHRILWPD REYRPDLDILIAGCGTNQAAIFAFTNRAAKVVAIDISRPALDHQQYLKDK HGLANLELHLLPIEELATLGRDFDLVVSTGVLHHLADPRAGMKELAHCLR RDGVVAAMLYGKYGRIGVELLGSVFRDLGLGQDDASIKLAKEAISLLPTY HPLRNYLTKARDLLSDSALVDTFLHGRQRSYTVEECVDLVTSAGLVFQGW FHKAPYYPHDFFVPNSEFYAAVNTLPEVKAWSVMERLKTLNATHLFMACR RDRPKEQYTIDFSTVAALDYVPLMRTRCGVSGTDMFWPGWRMAPSPAQLA FLQQVDGRRTIREIAGCVARTGEPSGGSLADLEEFGRKLFQSLWRLDFVA VALPASG >MT3429 hypothetical protein MGCFCVCSAQVQEVAKNSLRGVPESVVMSYSYFVELPRLEDIEPGAHTDV LIANSRVDQGRIRAAVEAVFDAHPALGTVFEPRVDTLTSRPGGGGWGWGV EPPGAAVAEVIARHSASFDMYTGRLFAVSLLPGSPDRLVLTASRLCVDDA SWQTVVEDLVRQYDESVLVPAR >MT0181 virulence factor mce family protein MSTIFDIRNLRLPQLSRASVVIGSLVVVLALAAGIVGVRLYQKLTNNTVV AYFTQANALYVGDKVQIMGLPVGSIDKIEPAGDKMKVTFHYQNKYKVPAN ASAVILNPTLVASRNIQLEPPYRGGPVLADNAVIPVERTQVPTEWDELRD SVSHIIDELGPTPEQPKGPFGEVIEAFADGLAGKGKQINTTLNSLSQALN ALNEGRGDFFAVVRSLALFVNALHQDDQQFVALNKNLAEFTDRLTHSDAD LSNAIQQFDSLLAVARPFFAKNREVLTHDVNNLATVTTTLLQPDPLDGLE TVLHIFPTLAANINQLYHPTHGGVVSLSAFTNFANPMEFICSSIQAGSRL GYQESAELCAQYLAPVLDAIKFNYFPFGLNVASTASTLPKEIAYSEPRLQ PPNGYKDTTVPGIWVPDTPLSHRNTQPGWVVAPGMQGVQVGPITQGLLTP ESLAELMGGPDIAPPSSGLQTPPGPPNAYDEYPVLPPIGLQAPQVPIPPP PPGPDVIPGPVPPTPAPVGAPLPAEAGGGQ >MT0417 acyl-CoA synthase MSVISTLRDRATTTPSDEAFVFMDYDTKTGDQIDRMTWSQLYSRVTAVSA YLISYGRHADRRRTAAISAPQGLDYVAGFLGALCAGWTPVPLPEPLGSLR DKRTGLAVLDCAADVVLTTSQAETRVRATIATHGASVTTPVIALDTLDEP SGDNCDLDSQLSDWSSYLQYTSGSTANPRGVVLSMRNVTENVDQIIRNYF RHEGGAPRLPSSVVSWLPLYHDMGLMVGLFIPLFVGCPVILTSPEAFIRK PARWMQLLAKHQAPFSAAPNFAFDLAVAKTSEEDMAGLDLGHVNTIINGA EQVQPNTITKFLRRFRPYNLMPAAVKPSYGMAEAVVYLATTKAGSPPTST EFDADSLARGHAELSTFETERATRLIRYHSDDKEPLLRIVDPDSNIELGP GRIGEIWIHGKNVSTGYHNADDALNRDKFQASIREASAGTPRSPWLRTGD LGFIVGDEFYIVGRMKDLIIQDGVNHYPDDIETTVKEFTGGRVAAFSVSD DGVEHLVIAAEVRTEHGPDKVTIMDFSTIKRLVVSALSKLHGLHVTDFLL VPPGALPKTTSGKISRAACAKQYGANKLQRVATFP >MT0700 hydrolase/esterase, putative MLRRVAILLAAVLAFAGCSGGTRLAAGFGNGNSVHTLDVDGAGRSYRLYK PVGLPSSAPLVVMLHGGFGSAKQAERSYGWDELADSEKFLVAYPDGYHRA WNANGGGCCGRPAREGVDDIGFVRAVVADIANNVSIDPARVYVTGMSNGA IMSYTLACNTSIFAAIGVVSGTQLDPCQSPRPVSVIHIHGTADPLVRYHG GPGAGFARIDGPPVPDLNAFWREVNRCGALDTTTEGPVTTSGATCADNRR VVLLTVDDAGHRWPSFATQTLWRFFAAHFR >MT0316 oxidoreductase, short-chain dehydrogenase/reductase family MNTGTAVITGASSGLGLQCARALLRRDASWHVVLAVRDPARGRAAMEELG EPNRCSVLEVDLASVRSVRSFVETVRTTPLPPIRALVCNAGLQVVSGIAF TDDGVEMTFGVNHLGHFALVTGILDWLARPARIVVVSSGTHDPSKHTGMP DPRYTCAADLAHPPTDQNTPAEGRRRYTTSKLCNVLFTYELDRRLDHGEQ GVMVNAFDPGLMPGSGLARDYPPILRLAYRLLSPMLRVLPFVHSTRVSGE HLAALAVDPRFAGVTGQYFAGAKAIRSSAESYDRAKALDLWETSERLLAQ VT >MT0594 P450 heme-thiolate protein MSGTSSMGLPPGPRLSGSVQAVLMLRHGLRFLTACQRRYGSVFTLHVAGF GHMVYLSDPAAIKTVFAGNPSVFHAGEANSMLAGLLGDSSLLLIDDDVHR DRRRLMSPPFHRDAVARQAGPIAEIAAANIAGWPMAKAFAVAPKMSEITL EVILRTVIGASDPVRLAALRKVMPRLLNVGPWATLALANPSLLNNRLWSR LRRRIEEADALLYAEIADRRADPDLAARTDTLAMLVRAADEDGRTMTERE LRDQLITLLVAGHDTTATGLSWALERLTRHPVTLAKAVQAADASAAGDPA GDEYLDAVAKETLRIRPVVYDVGRVLTEAVEVAGYRLPAGVMVVPAIGLV HASAQLYPDPERFDPDRMVGATLSPTTWLPFGGGNRRCLGATFAMVEMRV VLREILRRVELSTTTTSGERPKLKHVIMVPHRGARIRVRATRDVSATSQA TAQGAGCPAARGGGPSRAVGSQ >MT0790 P450 heme-thiolate protein MTVRVGDPELVLDPYDYDFHEDPYPYYRRLRDEAPLYRNEERNFWAVSRH HDVLQGFRDSTALSNAYGVSLDPSSRTSEAYRVMSMLAMDDPAHLRMRTL VSKGFTPRRIRELEPQVLELARIHLDSALQTESFDFVAEFAGKLPMDVIS ELIGVPDTDRARIRALADAVLHREDGVADVPPPAMAASIELMRYYADLIA EFRRRPANNLTSALLAAELDGDRLSDQEIMAFLFLMVIAGNETTTKLLAN AVYWAAHHPGQLARVFADHSRIPMWVEETLRYDTSSQILARTVAHDLTLY DTTIPEGEVLLLLPGSANRDDRVFDDPDDYRIGREIGCKLVSFGSGAHFC LGAHLARMEARVALGALLRRIRNYEVDDDNVVRVHSSNVRGFAHLPISVQ AR >MT2270 oxidoreductase, short-chain dehydrogenase/reductase family MPATQQMSRLVDSPDGVRIAVYHEGNPDGPTVVLVHGFPDSHVLWDGVVP LLAERFRIVRYDNRGVGRSSVPKPISAYTMAHFADDFDAVIGELSPGEPV HVLAHDWGSVGVWEYLRRPGASDRVASFTSVSGPSQDHLVNYVYGGLRRP WRPRTFLRAISQTLRLSYMALFSVPVVAPLLLRVALSSAAVRRNMVGDIP VDQIHHSETLARDAAHSVKTYPANYFRSFSSSRRGRAIPIVDVPVQLIVN SQDPYVRPYGYDQTARWVPRLWRRDIKAGHFSPMSHPQVMAAAVHDFADL ADGKQPSRALLRAQVGRPRGYFGDTLVSVTGAGSGIGRETALAFAREGAE IVISDIDEATVKDTAAEIAARGGIAYPYVLDVSDAEAVEAFAERVSAEHG VPDIVVNNAGIGQAGRFLDTPAEQFDRVLAVNLGGVVNGCRAFGQRLVER GTGGHIVNVSSMAAYAPLQSLSAYCTSKAATYMFSDCLRAELDAAGVGLT TICPGVIDTNIVATTGFHAPGTDEEKIDGRRGQIDKMFALRSYGPDKVAD AIVSAVKKKKPIRPVAPEAYALYGISRVLPQALRSTARLRVI >MT2187 oxidoreductase, short-chain dehydrogenase/reductase family MPCSGWTCSRRGGTFSAMTSLQGKVVFITGAARGIGAEVARRLHNKGAKL VLTDLSKSELAVMGAELGGDDRLLTVVADVRDLPAMQAAAETAVERFGGI DVVVANAGIASYGSVLKVDPQAFRRVLDVNLLGNFHTVRATLPALIDRRG YVLIVSSLAAFAAPPGMAPYNMSKAGNEHFANALRLEVAHLGVSVGSAHM SWIDTALVRDTKADLPAFAELLARLPWPLNKTTSVNKCAAAFVNGIEGRK DRVYCPGWVALFRWLKPLLSTRVGQRPIRNTVAKLMPQMDAEVAALGRFA SAYTESLENS >MT3170 oxidoreductase, short-chain dehydrogenase/reductase family MSSFEGKVAVITGAGSGIGRALALNLSEKRAKLALSDVDTDGLAKTVRLA QALGAQVKSDRLDVAEREAVLAHADAVVAHFGTVHQVYNNAGIAYNGNVD KSEFKDIERIIDVDFWGVVNGTKAFLPHVIASGDGHIVNISSLFGLIAVP GQSAYNAAKFAVRGFTEALRQEMLVARHPVKVTCVHPGGIKTAVARNATV ADGEDQQTFAEFFDRRLALHSPEMAAKTIVNGVAKGQARVVVGLEAKAVD VLARIMGSSYQRLVAAGVAKFFPWAK >MT1976 acyl-CoA synthase MNDGSRQELRVRSGLLQIEDCLDADGGIALPAGTTLISLIERNIKYVGDL VAYRYLDHARSAAGCALEVTWTQFGMRLAAIGAHVQRFAGPGDRVAILAP QGIDYVCGFYAAIKAGTVAVPLFAPELPGHAERLDTALRDSEPAVILTTA AAKNAVEGFLNNVPRLRKPTVLVIDQIPDREGELFVPVELDIDAVSHLQY TSGSTRPPVGVEITHRAVGTNLVQMILSIDLLNRNTHGVSWLPLYHDMGL SMIGFPAVYGGHSTLMSPTAFVRRPLRWIQALSEGSRTGRVVTAAPNFAY EWAAQRGLPAQGDDVDLSNVVLIIGSEPVSIDAVTTFNKAFAPYGLPRTA FKPSYGIAEATLLVATIDHAAEPTVVYLDPEQLGAGHATRVAPDAPNAVV HVSCGHVARSLWAVIVDPDTGPEAGAELPDGEIGEVWLQGDNVARGYWGR PEETRMTFGARLQSPLAEGSHADGSAIDDTWLRTGDLGVYLDGELYITGR IADLLTIDGRNHYPQDIEATAAEASPMVRRGYITAFTVPASDGDDRNQRL VIIAERAAGTSRSDPRPALDAIRAAVCNRHGLSVADLSFLPAGAIPRTTS GKLARQACRAQYLSGRLGVH >MT0127 coenzyme A synthetase, putative MLIVPNPHTEHMEGAFAMASDFGPRIADLVEVAATRLPEAPALVVTADRI AISHRDLARLVDELAGQLTRSGLLPGDRVALRMGSNAEFVVALLAASRAD LVVVPLDPALPITEQRVRSQAAGARVVLIDADGPHDRAEPTTRWWPLTVN VGGDSGPSGGTLSVHLDAATEPNPATSTPEGLRPDDAMIMFTGGTTGLPK MVPWTHANIASSVRAIITGYRLSPRDATVAVMPLYHGHGLIASLLATLAS GGAVSLPARGRFSAHTFWDDIKAVGATWYTAVPTIHQILLERSATEPSGR KPAALRFIRSCSAPLTAQAALALQTEFAAPVVCAFGMTEATHQVTTTQIE GIDQTETPVVSTGLVGRSTGAQIRIVGSDGLPLPAGAVGEIWLRGTTVVR GYLGDPTITAANFTDGWLRTGDLGSLSAAGDLSIRGRIKELINRGGEKIS PERVEGVLASHPNVMEAAVFGVPHQLYGEAVAAVIVPRESAPPTREELVQ FCRERLAAFEIPASFQEASGLPHTAKGSLDRRAVAERFGHSV >MT3321 oxidoreductase, short-chain dehydrogenase/reductase family MSLNGKTMFISGASRGIGLAIAKRAARDGANIALIAKTAEPHPKLPGTVF TAAKELEEAGGQALPIVGDIRDPDAVASAVATTVEQFGGIDICVNNASAI NLGSITEVPMKRFDLMNGIQVRGTYAVSQACIPHMKGRENPHILTLSPPI LLEKKWLRPTAYMMAKYGMTLCALGIAEEMRADGIASNTLWPRTMVATAA VQNLLGGDEAMARSRKPEVYADAAYVIVNKPATEYTGKTLLCEDVLVESG VTDLSVYDCVPGATLGVDLWVEDANPPGYLPA >MT0802 P450 heme-thiolate protein MTTAAGLSGIDLTDLDNFADGFPHHLFAIHRREAPVYWHRPTEHTPDGEG FWSVATYAETLEVLRDPVTYSSVTGGQRRFGGTVLQDLPVAGQVLNMMDD PRHTRIRRLVSSGLTPRMIRRVEDDLRRRARGLLDGVEPGAPFDFVVEIA AELPMQMICILLGVPETDRHWLFEAVEPGFDFRGSRRATMPRLNVEDAGS RLYTYALELIAGKRAEPADDMLSVVANATIDDPDAPALSDAELYLFFHLL FSAGAETTRNSIAGGLLALAENPDQLQTLRSDFELLPTAIEEIVRWTSPS PSKRRTASRAVSLGGQPIEAGQKVVVWEGSANRDPSVFDRADEFDITRKP NPHLGFGQGVHYCLGANLARLELRVLFEELLSRFGSVRVVEPAEWTRSNR HTGIRHLVVELRGG >MT3602 virulence factor mce family protein MAGSGVPSHRSMVIKVSVFAVVMLLVAAGLVVVFGDFRFGPTTVYHATFT DASRLKAGQKVRIAGVPVGSVKAVKLNPDHSIDVAFAIDRSYTLYSSTRA VIRYENLVGDRFLEITSGPGELRKLPPGGTINVAHTQPALDLDALLGGLR PVLKGFDADKINTITSAVIELLQGQGGPLANVLADTGAFSAALGARDQLI GEVITNLNAVLATVDAKSAQFSASVDQLQQLVSGLAKNRDPIAGAISPLA STTTDLTELLRNSRRPLQGILENARPLATELDNRKAEVNNDIEQLGEDYL RLSALGSYGAFFNIYFCSVTIKINGPAGSDILLPIGGQPDPSKGRCAFAK >MT1574 methyltransferase, putative MGSAGVPAADAGGRDAASEQIARWTQTCTVVLVCGHGPAKWAFRSWCTSR SCDTLPVALRYRLQSNPLVGKLTTKYFLPLGTRQVGDHVVFFNFGYEEDP PMALPLSESDEPNRYCIQLYHQTASQVDLTGKEVLEVSCGAGGGASYIAR NLGPASYTGLDLNPASIDLCRAKHRLPGLQFVQGDAQNLPFPDESFDAVV NVEASHQYPDFRGFLAEVARVLRPGGHFLYTDSRRNPVVAEWEAALADAP LRTISQRDIGAQAKRGLDANTARSQEAIGRRAPVLLAGLTRCAVRVLDWD LRRGGGFSYRIYLFAKD >MT3619 P450 heme-thiolate protein MRANQPVFRDRNGLAAASTYQAVIDAERQPELFSNAGGIRPDQPALPMMI DMDDPAHLLRRKLVNAGFTRKRVKDKEASIAALCDTLIDAVCERGECDFV RDLAAPLPMAVIGDMLGVRPEQRDMFLRWSDDLVTFLSSHVSQEDFQITM DAFAAYNDFTRATIAARRADPTDDLVSVLVSSEVDGERLSDDELVMETLL ILIGGDETTRHTLSGGTEQLLRNRDQWDLLQRDPSLLPGAIEEMLRWTAP VKNMCRVLTADTEFHGTALCAGEKMMLLFESANFDEAVFCEPEKFDVQRN PNSHLAFGFGTHFCLGNQLARLELSLMTERVLRRLPDLRLVADDSVLPLR PANFVSGLESMPVVFTPSPPLG >MT2999 acyl-CoA synthase MPVTDRSVPSLLQERADQQPDSTAYTYIDYGSDPKGFADSLTWSQVYSRA CIIAEELKLCGLPGDRVAVLAPQGLEYVLAFLGALQAGFIAVPLSTPQYG IHDDRVSAVLQDSKPVAILTTSSVVGDVTKYAASHDGQPAPVVVEVDLLD LDSPRQMPAFSRQHTGAAYLQYTSGSTRTPAGVIVSHTNVIANVTQSMYG YFGDPAKIPTGTVVSWLPLYHDMGLILGICAPLVARRRAMLMSPMSFLRR PARWMQLLATSGRCFSAAPNFAFELAVRRTSDQDMAGLDLRDVVGIVSGS ERIHVATVRRFIERFAPYNLSPTAIRPSYGLAEATLYVAAPEAGAAPKTV RFDYEQLTAGQARPCGTDGSVGTELISYGSPDPSSVRIVNPETMVENPPG VVGEIWVHGDHVTMGYWQKPKQTAQVFDAKLVDPAPAAPEGPWLRTGDLG VISDGELFIMGRIKDLLIVDGRNHYPDDIEATIQEITGGRAAAIAVPDDI TEQLVAIIEFKRRGSTAEEVMLKLRSVKREVTSAISKSHSLRVADLVLVS PGSIPITTSGKIRRSACVERYRSDGFKRLDVAV >MT2017 conserved hypothetical protein MTAAKALVSEWNRMGSQMRFFVGTLAGIPDALMHYRGELLRVIAQMGLGT GVLAVIGGTVAIVGFLAMTTGAIVAVQGYNQFASVGVEALTGFASAFFNT REIQPGTVMVALAATVGAGTTAALGAMRINEEIDALEVIGIRSISYLAST RVLAGVVVAVPLFCVGLMTAYLAARVGTTAIYGQGSGVYDHYFNTFLRPT DVLWSSVEVVVVALMIMLVCTYYGYAAHGGPAGVGEAVGRAVRASMVVAS IAILVMTLAIYGQSPNFHLAT >MT3932 conserved hypothetical protein MRIGPVELSAVKDWDPAPGVLVSWHPTPASCAKALAAPVSAVPPSYVQAR QIRSFSEQAARGLDHSRLLIASVEVFGHCDLRAMTYVINAHLRRHDTYRS WFELRDTDHIVRHSIADPADIEFVPTTHGEMTSADLRQHIVATPDSLHWD CFSFGVIQRADSFTFYASIDHLHADGQFVGVGLMEFQSMYTALIMGEPPI GLSEAGSYVDFCVRQHEYTSALTVDSPEVRAWIDFAEINNGTFPEFPLPL GDPSVRCGGDLLSMMLMDEQQTQRFESACMAANARFIGGILACIAIAIHE LTGADTYFGITPKDIRTPADLMTQGWFTGQIPVTVPVAGLSFNEIARIAQ TSFDTGADLAKVPFERVVELSPSLRRPQPLFSLVNFFDAQVGPLSAVTKL FEGLNVGTYSDGRVTYPLSTMVGRFDETAASVLFPDNPVARESVTAYLRA IRSVCMRIANGGTAERVGNVVALSPGRRNNIERMTWRSCRAGDFIDICNL KVANVTVDREA >MT0109 pp-binding family protein MRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQ SELPSRLAANPSIAGWLRELEAVCTEFG >MT0372 conserved hypothetical protein MHPDELDPEYHHHGGFPEYGPASPGAGFGQFVATMRRLQDLAVAADPGDA VWDEAAERAAALVELLSPFEADEGKAPAGRTPGLPGMGSLLLPPWTVTRY GTDGVEMRGSFSRFHVGGNSAVHGGVLPLLFDHMFGMISHAAGRPISRTA FLHVDYRRITPIDVPLIVRGRVTNTEGRKAFVCAELFDSDETLLAEGNGL MVRLLPGQP >MT0330 muconolactone isomerase, putative MEFLVTMTTRVPDSMPADAVERVRAREAARSRELAAQGKLLRLWRPPLRP GEWRTLGLFAADDNGELEQLLASMPPRSWRTDDVTPLGAHPNDPVGQGIT IAPGKGPEFLIATTIMVPPGTPAQVVDDTVAREARRAPELAGRGHLVRLW ALPDGPDGQRTLGLWRARDPGELMAILESLPLAGWMTIETTPLSPHPDDP IRMP >MT2030 hypothetical protein MRTRDVERGRAAMGEANIREQAIATMPRGGPDASWLDRRFQTDALEYLDR DDVPDEVKQKIIGVLDRVGTLTNLHEKYARIALKLVSDIPNPRILELGAG HGKLSAKILELHPTATVTISDLDPTSVANIAAGELGTHPRARTQVIDATA IDGHDHSYDLAVFALAFHHLPPTVACKAIAEATRVGKRFLIIDLKRQKPL SFTLSSVLLLPLHLLLLPWSSMRSSMHDGFISALRAYSPSALQTLARAAD PGMQVEILPAPTRLFPPSLAVVFSRSSSAPTESSECSADRQPGE >MT3143 oxidoreductase, short-chain dehydrogenase/reductase family MLQRGAGQYFAGKRCFVTGAASGIGRATALRLAAQGAELYLTDRDRDGLA QTVCDARALGAQVPEHRVLDVSDYQDVAAFAADIHARHPSMDVVLNIAGV SAWGTVDQLTHDQWSRMVAINLMGPIHVIETLVPPMVAAGRGGHLVNVSS AAGLVGLPWHAAYSASKYGLRGLSEVLRFDLARHGIGVSVVVPGAVKTPL VNTVEIAGVDRDDPRVNRWVERFSGHAVTPEKAADKILAGVTRNRYLVYT SADIRALYAFKRYAWWPYTLVMRRVNVFFTRALRPGP >MT0756 conserved hypothetical protein MTQTGSARFEGDSWDLASSVGLTATMVAAARAVAGRAPGALVNDQFAEPL VRAVGVDFFVRMASGELDPDELAEDEANGLRRFADAMAIRTHYFDNFFLD ATRAGIRQAVILASGLDSRAYRLRWPAGTIVFEVDQPQVIDFKTTTLAGL GAAPTTDRRTVAVDLRDDWPTALQKAGFDNAQRTAWIAEGLLGYLSAEAQ DRLLDQITAQSVPGSQFATEVLRDINRLNEEELRGRMRRLAERFRRHGLD LDMSGLVYFGDRTDARTYLADHGWRTASASTTDLLAEHGLPPIDGDDAPF GEVIYVSAELKQKHQDTR >MT1230 substrate--CoA ligase MLLASLNPAVVSAADIADAVRIDGDVLSRSDLVGAATSVAERVAGAHRVA VLATPTASTVLAITGCLIAGVPVVPVPADVGVTERRHMLTDSGVQAWLGP LPDDPAGLPHIPVRTHARSWHRYPEPSPGAIAMVVYTSGTTGPPKGVQLS RRAIAADLDALAEAWQWTAEDVLVHGLPLYHVHGLVLGLLGSLRFGNRFV HTGKPTPAGYAQACYEAHGTLFFGVPTVWSRVAADQAAAGALKPARLLVS GSAALPVPVFDKLVQLTGHRPVERYGASESLITLSTRADGERRPGWVGLP LAGVQTRLVDDDGGEVPHDGETVGKLQVRGPTLFDGYLNQPDATAAAFDA DSWYRTGDVAVVDGSGMHRIVGRESVDLIKSGGYRVGAGEIETVLLGHPD VAEAAVVGVPDDDLGQRIVAYVVGSANVDADGLINFVAQQLSVHKRPREV RIVDALPRNALGKVLKKQLLSEG >MT3026 methyltransferase, putative MAFSRTHSLLARAGSTSTYKRVWRYWYPLMTRGLGNDEIVFINWAYEEDP PMDLPLEASDEPNRAHINLYHRTATQVDLGGKQVLEVSCGHGGGASYLTR TLHPASYTGLDLNQAGIKLCKKRHRLPGLDFVRGDAENLPFDDESFDVVL NVEASHCYPHFRRFLAEVVRVLRPGGYFPYADLRPNNEIAAWEADLAATP LRQLSQRQINAEVLRGIGNNSQKSRDLVDRHLPAFLRFAGREFIGVQGTQ LSRYLEGGELSYRMYCFTKD >MT1546 methyltransferase MLDVGCGSGRMALPLTGYLNSEGRYAGFDISQKAIAWCQEHITSAHPNFQ FEVSDIYNSLYNPKGKYQSLDFRFPYPDASFDVVFLTSVFTHMFPPDVEH YLDEISRVLKPGGRCLSTYFLLNDESLAHIAEGKSAHNFQHEGPGYRTIH KKRPEEAIGLPETFVRDVYGKFGLAVHEPLHYGSWSGREPHLSFQDIVIA TKTAS >MT3075 P49 protein MDVTVVGSGPNGLATAVICARAGLNVQVVEAQATFGGGARSAADFEFPEV LHDVCSAVHPLALASPFFAEFDLPARGVTLTVPDIAYANPLPGRPAAIAY HDLAHTCAKLDDGASWRRLLGPLVAHSETVVEFMLSDKRSLPTALGSVLR LGLRMLAQGTPAWRSLAGEDARALFTGVAAHAISPLPSLVSAGAGLMLAT LAHSVGWPIPVGGTQAIADALIADLRAHGGRLAAGVEITEPQRSVVVFDT APTALLRVYRDKLPHRYAKALRRYRFRAGIAKVDFVLSDEIPWSDPRLRR AATLHLGGTRDQMARAEADVAAGRHADWPMVLAACPHVADPGRIDETGRR PFWTYAHVPSGSTLDATETVTSVLERFAPGFRDIVVAARAVPAARMADHN ANYVGGDITVGANSTWRAIAGPTPRLNPWRTPIPKVYLCSAATPPGAGVH GMCGWYAARTLLRTEFGITRMPPLGHELRP >MT2319 methyltransferase-related protein MSGALETTEEFGNRFVAAIDSAGLAILVSVGHQTGLLDTMAGLPPATSME IAEAAGLEERYVREWLGGMTTGQIVEYDAGSSTYSLPAHRAGMLTRAAGP DNLAVIAQFVSLLGEVEQKVIRCFREGGGVPYSEYPRFHKLMAEMSGMVF DAALIDVVLPLVDGLPDRLRSGADVADFGCGSGRAVKLMAQAFGASRFTG IDFSDEAVAAGTEEAARLGLANATFERHDLAELDKVGAYDVITVFDAIHD QAQPARVLQNIYRALRPGGVLLMVDIKASSQLEDNVGVPLSTYLYTTSLM HCMTVSLALDGAGLGTVWGRQLATSMLADAGFTDVTVAEIESDVLNNYYI ARK >MT1580 acyl-CoA synthase MVASSIPTALRERASVHPNGAAITYIDYEQDWAGVAETLTWSQLYRRMLN VAEPLRHVGATGDRAVILAPQGIEYVVGFLGALQAGRIAVPLPVPHAGAH DERTISVLSDTSPAVILTTSGAVDDVRECAQPQPGQSAPSIVELDLLDLD SRQRSRSPGARPTGRDTPETAYLQYTSGSTRTPAGVMVSNKNVFANFEQI VADFFAPEGGVVPPDLTVVSWLPLYHDMGLLLGAIMPILAGVPTVLTSPV GFLQRPARWIQLLARNGRTISAGPNFAFELAVRKTSDDDMDGLDLAGVHT ILNGSERVHPATLKRFAERFGRFNFAAAALRPAYGMAEATVYIATRNVNE PPEIVDFESEKLPAGQAIRCPSGSGTPLVSYGVPRSQLVRIVDPDTCIEC PQGSVGEIWVQGGNVASGYWHKPEESKRTFGARIVTPSAGTPEAPWLRTG DSGFVSGGELFIIGRIKDLLIVYGRNHAPDDIEATIQEITSGRCAAIAVP DHGTEKLVAIIELKKRGDSDEDVADRLRIVKRDVAAAIFDSHGLSVADLV LVSPGSIPITTSGKIRRAQCVQLYRRREFTRLDA >MT2021 virulence factor mce family protein MTTKLRRARSVLATALVLVAGVILAMRTADAAARTTVVAYFDNSNGVFAG DDVLIRGVPVGKIVKIEPQPLRAKISFWFDRKYRVPADAAAAILSPQLVT GRAIQLTPPYAGGPTMADGTVIPQERTVVPVEWDDLRAQLQRLTALLQPT RPGGVSTLGALINTAADNLRGQGATIRDTIIKLSQAISALGDHSKDIFST VTNLSTLVTALHDSADLLERLNHNLAAVTSLLADGPDKIGQAAEDLNAVV ADVGSFAAEHREAIGTASDKLASITTALVDSLDDIKQTLHISPTVLQNFN NIFEPANGALTGALAGNNMANPIAFLCGAIQAASRLGGEQAAKLCVQYLA PIVKNRQYNYPPLGANLFVGAQARPNEVTYSEDWLRPDYVAPVADTPPDP AAAVTVDPATGLRGMMMPPGGGS >MT3028 conserved hypothetical protein MLEVGAGIGDHTQFFLDRGCKVLCTEPRGENLDVIRQRFGSNPNVTVDHL DLDGDLPAEAHQYDVVYCYGVLYHLSRPAEALAWMCDRAVDLLLLETCVS YSGEDEPFLVSERASSPSQAITGTGCRPSRVWVMNRLREKMPHVYVTATQ PRHRQFPLDWRANGPIASTGLARAVFVASRAPLNLPTLVEELPMVQRRC >MT3604 conserved hypothetical protein MSYDVTIRFRRFFSRLQRPVDNFGEQALFYGETMRYVPNAITRYRKETVR LVAEMTLGAGALVMIGGTVGVAAFLTLASGGVIAVQGYSSLGDIGIEALT GFLSAFLNVRVVAPVIAGIALAATIGAGATAQLGAMRVSEEIDAVECMAV HSVSYLVSTRLIAGLVAIIPLYSLSVLAAFFAARFTTVFVNGQSAGLYDH YFNTFLIPSDLLWSFMQAIAMSIAVMLVHTYYGYNASGGSVGVGVAVGQA VRTSLIVVVVITLFISLAVYGASGNFNLSG >MT3352 conserved hypothetical protein MTGRVGNPKDHAVVIGASIAGLCAARVLSDFYSTVTVFERDELPEAPANR ATVPQDRHLHMLMARGAQEFDSLFPGLLHDMVAAGVPMLENRPDCIYLGA AGHVLGTGHTLRKEFTAYVPSRPHLEWQLRRRVLQLSNVQIVRRLVTEPQ FERRQQRVVGVLLDSPGSGQDREREEFIAADLVVDAAGRGTRLPVWLTQW GYRRPAEDTVDIGISYASHQFRIPDGLIAEKVVVAGASHDQSLGLGMLCY EDGTWVLTTFGVADAKPPPTFDEMRALADKLLPARFTAALAQAQPIGCPA FHAFPASRWRRYDKLERFPRGIVPFGDAVASFNPTFGQGMTMTSLQAGHL RRALKARNSAMKGDLAAELNRATAKTTYPVWMMNAIGDISFHHATAEPLP RWWRPAGSLFDQFLGAAETDPVLAEWFLRRFSLLDSLYMVPSVPIIGRAI AHNLRLWLKEQRERRQPVTTRRSP >MT0938 dioxygenase, putative MDITIVGKYLSTLPEDDDHPYRTGPWRPQTTEWDADDLTTVTGEVPADLD GIYLRNTENPLHPAFATYHPFDGDGMIHVVGFRDGKAFYRNRFIRTDGFL AENEAGGPLWPGLAEPVQLAKREHGWGARGLMKDASSTDVIVHRGIALTS FYQCGDLYRIDPYSANTLGKESWHGRFPFDWGVSAHPKVDNKTGELLFFN YSKQEPYMRYGVVDQNNELVHYVDVPLPGPRLPHDMAFTENYVILNDFPL FWDPRLLERDVHLPRFYPEIPSRFAVVARRGNDIRWFEADPTFVLHFTNA YEQGDEIVLDGFYEGDPQPLDTGGTKWEKLFRFLALDRLQSRLHRWRLNM VTGAVHEEQLSESITEFGTINADYAASSYRYTYAATGKPSWFLFDGLVKH DLLTGNHECYSFGDGVYGSETAMAPRVGSSAEDDGYLVTLTTDMNDDASY CLVFDAARPGDGPICKLALPERISSGTHSAWVPGAELRRWDHAESPAAAV GL >MT2059 conserved hypothetical protein MVKRSRATRLSPSIWSGWESPQCRSIRARLLLPRGRSRPPNADCCWNQLA VTPDTRMPASSAAGRDAAAYDAWYDSPTGRPILATEVAALRPLIEVFAQP RLEIGVGTGRFADLLGVRFGLDPSRDALMFARRRGVLVANAVGEAVPFVS RHFGAVLMAFTLCFVTDPAAIFRETRRLLADGGGLVIGFLPRGTPWADLY ALRAARGQPGYRDARFYTAAELEQLLADSGFRVIARRCTLHQPPGLARYD IEAAHDGIQAGAGFVAISAVDQAHEPKDDHPLESE >MT0542 methyltransferase-related protein MGGCSITCLNISEVPNETNRKKNRQAGLDRSIRVIHGSFDDIPEPDSGYD VVWSQDAILHAPDRRKVLEEAFRVLRPGGELIFTDPMQADDVPDGVLQPV YDRLNLRDLGSMRFYA >MT1295 P450 heme-thiolate protein MTSVMSHEFQLATAETWPNPWPMYRALRDHDPVHHVVPPQRPEYDYYVLS RHADVWSAARDHQTFSSAQGLTVNYGELEMIGLHDTPPMVMQDPPVHTEF RKLVSRGFTPRQVETVEPTVRKFVVERLEKLRANGGGDIVTELFKPLPSM VVAHYLGVPEEDWTQFDGWTQAIVAANAVDGATTGALDAVGSMMAYFTGL IERRRTEPADDAISHLVAAGVGADGDTAGTLSILAFTFTMVTGGNDTVTG MLGGSMPLLHRRPDQRRLLLDDPEGIPDAVEELLRLTSPVQGLARTTTRD VTIGDTTIPAGRRVLLLYGSANRDERQYGPDAAELDVTRCPRNILTFSHG AHHCLGAAAARMQCRVALTELLARCPDFEVAESRIVWSGGSYVRRPLSVP FRVTS >MT3071 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase, putative MTAREIAEHPFGTPTFTGRSWPLADVRLLAPILASKVVCVGKNYADHIAE MGGRPPADPVIFLKPNTAIIGPNTPIRLPANASPVHFEGELAIVIGRACK DVPAAQAVDNILGYTIGNDVSARDQQQSDGQWTRAKGHDTFCPVGPWIVT DLAPFDPADLELRTVVNGDVKQHARTSLMIHDIGAIVEWISAIMTLLPGD LILTGTPAGVGPIEDGDTVSITIEGIGTLTNPVVRKGKP >MT0344 MitM-related protein MRLTHPARRYLSSQAARPTGAFGRLLGRIWRAETADVNRIAVELLAPGPG ERVCEIGFGPGRTLGLLAAAGAQVSGVEVSTTMIAIAAHHNAKAIAAGLI SLYHGDGVTLPVADHSLDKVLGVHNFYFWPDPRASLCDIARALRPGGRLV LTSISDDQPLAARFDPAIYRVPPTLDTAAWLGAAGFIDVGIKRSADHPAT VWFTATAT >MT3397 esterase, putative MPWARMLSLIVLMVCLAGCGGDQLLARHASSVATFQFGGLTRSYRLHVPP AEPSGLVISLHGGGGTGAGQEALTDFDAVADAADLLVVYPDGYDKSWADG RGASPADRRHLDDVGFLVALAAKLVHDFDIAPGHVFATGMSNGGFMSNRL ACDRADIFAAVAPVAGTLGVGVTCNPSRPVSVLEAHGTADPLVPFNGGAV RGRGGLSHSISVASLVDRWRAVDGCQGDPSAAELPDVGDGTMVHLFDSSS CAAGTEVISYQIDNGGHTWPGGRQYLPKAVIGATTRAFDGSQVIAQFFAT HGRD >MT1375 conserved hypothetical protein MNSITDVGGIRVGHYQRLDPDASLGAGWACGVTVVLPPPGTVGAVDCRGG APGTRETDLLDPANSVRFVDALLLAGGSAYGLAAADGVMRWLEEHRRGVA MDSGVVPIVPGAVIFDLPVGGWNCRPTADFGYSACAAAGVDVAVGTVGVG VGARAGALKGGVGTASATLQSGVTVGVLAVVNAAGNVVDPATGLPWMADL VGEFALRAPPAEQIAALAQLSSPLGAFNTPFNTTIGVIACDAALSPAACR RIAIAAHDGLARTIRPAHTPLDGDTVFALATGAVAVPPEAGVPAALSPET QLVTAVGAAAADCLARAVLAGVLNAQPVAGIPTYRDMFPGAFGS >MT3021.1 polyketide synthase MIEEQRTMSVEGADQQSEKLFHYLKKVAVELDETRARLREYEQRATEPVA VVGIGCRFPGGVDGPDGLWDVVSAGRDVVSEFPTDRGWDVEGLYDPDPDA EGKTYTRWGAFLDDATGFDAGFFGIAPSEVLAMDPQQRLMLEVSWEALEH AGIDPLSLRGSATGVYTGIFAASYGNRDTGGLQGYGLTGTSISVASGRVS YVLGLQGPAVSVDTACSSSLVAIHWAMSSLRSGECDLALAGGVTVMGLPS IFVGFSRQRGLAADGRCKAFAAAADGTGWGEGAGVVVLERLSDARRLGHS VLAVVRGSAVNQDGASNGLTAPNGLAQQRVIQAALANAGLSAADVDVVEA HGTATTLGDPIEAQALLSTYGQGGPAEQPLWVGSIKSNMGHTQAAAGVAG VIKMVQAMRHGVMPATLHVDEPSPRVDWTSGAVSVLTEAREWSVDGRPRR AAVSSFGISGTNAHLILEEAPVPAPAEAPVEASESTGGRGRRWCRG >MT3203 P450 heme-thiolate protein MTSTSIPTFPFDRPVPTEPSPMLSELRNSCPVAPIELPSGHTAWLVTRFD DVKGVLSDKRFSCRAAAHPSSPPFVPFVQLCPSLLSIDGPQHTAARRLLA QGLNPGFIARMRPVVQQIVDNALDDLAAAEPPVDFQEIVSVPIGEQLMAK LLGVEPETVHELAAHVDAAMSVCEIGDEEVSRRWSALCTMVIDILHRKLA EPGDDLLSTIAQANRQQSTMTDEQVVGMLLTVVIGGVDTPIAVITNGLAS LLHHRDQYERLVEDPGRVARAVEEIVRFNPATEIEHLRVVTEDVVIAGTA LSAGSPAFTSITSANRDSDQFLDPDEFDVERNPNEHIAFGYGPHACPASA YSRMCLTTFFTSLTQRFPQLQLARPFEDLERRGKGLHSVGIKELLVTWPT >MT0861 methyltransferase, UbiE/COQ5 family MNDKRRAIYTHGYHESVLRSHRRRTAENSAGYLLPYLVPGLSVLDVGCGP GTITVDLAARVVPGSVTGVEPTDDALSLARAEAQLHRLSNISFTTSDVHK LDFPDDAFDVVHAHQVLQHVADPVRALQEMRRVCTPGGIVAARDADYSGF IWFPKLPALDRWLDLYERAARANGGEPDAGRRLLSWARAAGFDDVTPTAS VWCFATASAREWWGLVWADRILQSDLAHQLVDSGLATAAQLEEISTAWRE WAAAPDGWLAIPHGEILCRA >MT1706 P450 heme-thiolate protein MRTYRTVRYPLGEALLALYRWRGPLINAGVGGHGYTYLLGAEANRFVFAN ADAFSWSQTFESLVPVDGPTALIVSDGADHRRRRSVVAPGLRHHHVQRYV ATMVSNIDTVIDGWQPGQRLDIYQELRSAVRRSTAESLFGQRLAVHSDFL GEQLQPLLDLTRRPPQVMRLQQRVNSPGWRRAMAARKRIDDLIDAQIADA RTAPRPDDHMLTTLISGCSEEGTTLSDNEIRDSIVSLITAGYETTSGALA WAIYALLTVPGTWESAASEVARVLGGRVPAADDLSALTYLNGVVHETLRL YSPGVISARRVLRDLWFDGHRIRAGRLLIFSAYVTHRLPEIWPEPTEFRP LRWDPNAADYRKPAPHEFIPFSGGLHRCIGAVMATTEMTVILARLVARAM LQLPAQRTHRIRAANFAALRPWPGLTVEIRKSAPAQ >MT1421 conserved hypothetical protein MSPSPTYTPPKLASMPGIDFDALYRGESPGEGLPPITTPPWDTKAPKDNV IGWHTGGWVHGDVLDIGCGLGDNAIYLARNGYQVTGLDISPTALTTAKRR ASDAGVDVKFAVGDATKLTGYTGAFDTVIDCGMFHCLDDDGKRSYAASVH RATRPGATLLLSCFSNAMPPDEEWPRSTVSEQTLRDVLGGAGWDIESLEP ATVRRELDGTEVEMAFWNVRAQRRGS >MT3934 acyl-CoA synthase MVSLSIPSMLRQCVNLHPDGTAFTYIDYERDSEGISESLTWSQVYRRTLN VAAEVRRHAAIGDRAVILAPQGLDYIVAFLGALQAGLIAVPLSAPLGGAS DERVDAVVRDAKPNVVLTTSAIMGDVVPRVTPPPGIASPPTVAVDQLDLD SPIRSNIVDDSLQTTAYLQYTSGSTRTPAGVMITYKNILANFQQMISAYF ADTGAVPPLDLFIMSWLPFYHDMGLVLGVCAPIIVGCGAVLTSPVAFLQR PARWLQLMAREGQAFSAAPNFAFELTAAKAIDDDLAGLDLGRIKTILCGS ERVHPATLKRFVDRFSRFNLREFAIRPAYGLAEATVYVATSQAGQPPEIR YFEPHELSAGQAKPCATGAGTALVSYPLPQSPIVRIVDPNTNTECPPGTI GEIWVHGDNVAGGYWEKPDETERTFGGALVAPSAGTPVGPWLRTGDSGFV SEDKFFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAIAVPSNGVE KLVAIVELNNRGNLDTERLSFVTREVTSAISTSHGLSVSDLVLVAPGSIP ITTSGKVRRAECVKLYRHNEFTRLDAKPLQASDL >MT1698 chalcone/stilbene synthase family protein MSVIAGVFGALPPYRYSQRELTDSFVSIPDFEGYEDIVRQLHASAKVNSR HLVLPLEKYPKLTDFGEANKIFIEKAVDLGVQALAGALDESGLRPEDLDV LITATVTGLAVPSLDARIAGRLGLRADVRRVPLFGLGCVAGAAGVARLHD YLRGAPDGVAALVSVELCSLTYPGYKPTLPGLVGSALFADGAAAVVAAGV KRAQDIGADGPDILDSRSHLYPDSLRTMGYDVGSAGFELVLSRDLAAVVE QYLGNDVTTFLASHGLSTTDVGAWVTHPGGPKIINAITETLDLSPQALEL TWRSLGEIGNLSSASVLHVLRDTIAKPPPSGSPGLMIAMGPGFCSELVLL RWH >MT0619 virulence factor mce family protein MGTPIRREHDQPMKTTGTTIKLGIVWLVLSVFTVMIIVVFGQVRFHHTTG YSAVFTHVSGLRAGQFVRAAGVEVGKVAKVTLIDGDKQVLVDFTVDRSLS LDQATTASIRYLNLIGDRYLELGRGHSGQRVAPGATIPLEHTHPALDLDA LLGGFRPLFQTLDPDKVNSIASSIITVFQGQGATINDILDQTASLTATLA DRDHAIGEVVNNLNTVLATTVKHQTEFDRTVDKLEVLITGLKNRADPLAA AAAHISSAAGTLADLLGRIVHCCTAASGTSRASSSRS >MT1633 hypothetical protein MARTFEDLVAEAASASVGGWDFSWLDGRATEERPSWGYQRQLSQRLANAT AALDLETGGGEVLAGAGNFPPTMVATEAWPPNAAMATRRLHPLGAVVVIT GDKPPLPFADAAFDLVTSRHPSTRWWTEIARVLRAGGSYFAQHVGPATLW DLREHFLGPREHNGADQYAQVVRTCITDAGLEIVDLQMERLRVEFFDVGA VIYFLRKVIWFLPDFTVEGYHDRLRALHERIQAEGPFVTYSTRALIEARK PS >MT3018 polyketide synthase MVPWVISARSAEALTAQAGRLMAHVQANPGLDPIDVGCSLASRSVFEHRA VVVGASREQLIAGLAGLAAGEPGAGVAVGQPGSVGKTVVVFPGQGAQRIG MGRELYGELPVFAQAFDAVADELDRHLRLPLRDVIWGADADLLDSTEFAQ PALFAVEVASFAVLRDWGVLPDFVMGHSVGELAAAHAAGVLTLADAAMLV VARGRLMQALPAGGAMVAVAASEDEVEPLLGEGVGIAAINAPESVVISGA QAAANAIADRFAAQGRRVHQLAVSHAFHSPLMEPMLEEFARVAARVQARE PQLGLVSNVTGELAGPDFGSAQYWVDHVRRPVRFADSARHLQTLGATHFI EAGPGSGLTGSIEQSLAPAEAMVVSMLGKDRPELASALGAAGQVFTTGVP VQWSAVFAGSGGRRVQLPTYAFQRRRFWETPGADGPADAAGLGLGATEHA LLGAVVERPDSDEVVLTGRLSLADQPWLADHVVNGVVLFPGAGFVELVIR AGDEVGCALIEELVLAAPLVMHPGVGVQVQVVVGAADESGHRAVSVYSRG DQSQGWLLNAEGMLGVAAAETPMDLSVWPPEGAESVDISDGYAQLAERGY AYGPAFQGLVAIWRRGSELFAEVVAPGEAGVAVDRMGMHPAVLDAVLHAL GLAVEKTQASTETRLPFCWRGVSLHAGGAGRVRARFASAGADAISVDVCD ATGLPVLTVRSLVTRPITAEQLRAAVTAAGGASDQGPLEVVWSPISVVSG GANGSAPPAPVSWADFCAGSDGDASVVVWELESAGGQASSVVGSVYAATH TALEVLQSWLGADRAATLVVLTHGGVGLAGEDISDLAAAAVWGMARSAQA ENPGRIVLIDTDAAVDASVLAGVGEPQLLVRGGTVHAPRLSPAPALLALP AAESAWRLAAGGGGTLEDLVIQPCPEVQAPLQAGQVRVAVAAVGVNFRDV VAALGMYPGQAPPLGAEGAGVVLETGPEVTDLAVGDAVMGFLGGAGPLAV VDQQLVTRVPQGWSFAQAAAVPVVFLTAWYGLADLAEIKAGESVLIHAGT GGVGMAAVQLARQWGVEVFVTASRGKWDTLRAMGFDDDHIGDSRTCEFEE KFLAVTEGRGVDVVLDSLAGEFVDASLRLLVRGGRFLEMGKTDIRDAQEI AANYPGVQYRAFDLSEAGPARMQEMLAEVRELFDTRELHRLPVTTWDVRC APAAFRFMSQARHIGKVVLTMPSALADRLADGTVVITGATGAVGGVLARH LVGAYGVRHLVLASRRGDRAEGAAELAADLTEAGAKVQVVACDVADRAAV AGLFAQLSREYPPVRGVIHAAGVLDDAVITSLTPDRIDTVLRAKVDAAWN LHQATSDLDLSMFALCSSIAATVGSPGQGNYSAANAFLDGLAAHRQAAGL AGISLAWGLWEQPGGMTAHLSSRDLARMSRSGLAPMSPAEAVELFDAALA IDHPLAVATLLDRAALDARAQAGALPALFSGLARRPRRRQIDDTGDATSS KSALAQRLHGLAADEQLELLVGLVCLQAAAVLGRPSAEDVDPDTEFGDLG FDSLTAVELRNRLKTATGLTLPPTVIFDHPTPTAVAEYVAQQMSGSRPTE SGDPTSQVVEPAAAEVSVHA >MT0234 conserved hypothetical protein MAVTDVFARRATLRRSLRLLADFRYEQRDPARFYRTLAADTAAMIGDLWL ATHSEPPVGRTLLDVGGGPGYFATAFSDAGVGYIGVEPDPDEMHAAGPAF TGRPGMFVRASGMALPFADDSVDICLSSNVAEHVPRPWQLGTEMLRVTKP GGLVVLSYTVWLGPFGGHEMGLSHYLGGARAAARYVRKHGHPAKNNYGSS LFAVSAAEGLRWAAGTGAALAVFPRYHPRWAWWLTSVPVLREFLVSNLVL VLTP >MT1793 very-long-chain acyl-CoA synthetase, putative MTDTIQSLLRQHVSDPTIAVKYGGLQWTWSQYLAESAARAAALITIADPQ RPTHIGSLLGNTPEMLAQLAAAGLGGYVLCGLNTTRRGDALAADVRRADC QIVVTDADHRALLDGLDLAGARILDTSTPRWAELVAGDGAFVPYREVDTM DPFMMIFTSGTSGNPKAVPVSHLMATFAGRSLTERFGLTEQDTCYVSMPL FHSNAVVAGWAPAVVSGAAIAPATFSATGFLDDVRRYHATYMNYVGKPLA YILATPERDDDADNPLRVAFGNEANDKDIEEFSRRFGVQVEDGFGSTENA VIVIREPGTPPGSIGRGAHGVAVYNGETVTECAVARFDAHGALTNADEAI GELVNTTGSGFFTGYYNDPEANAERMRHGMYWSGDLAYRDSEGWIYLAGR TADWMRVDGENLTAAPIERILLRYKAINRVAVYAVPDEYVGDQVMAALVL RAGDTFDPDAFEAFLDAQPDLSTKARPRYIRIAADLPSTATHKVLKRQLI DEGTAVGKADTLWVREPRGSAYHHASGPAKAI >MT0179 virulence factor mce family protein MKITGTVVKLGIVSVVLLFFTVMIIVIFGQMRFDRTNGYTAEFSNVSGLR QGQFVRASGVEIGKVKALHLVDGGRRVRVEFNIDRSVPLYQSTTAQIRYS DLIGNRYVELKRGEGKGANDLLPPGGLIPLSRTSPALDLDALIGGFKPVF RALDPAKVNNIANALITVFQGQGGTINDILDQTAQLTSQIAERDQAIGEV VKNLNIVLDTTVKHRKEFDETVNNLENLITGLRNHSDQLAGGLAHISNGA GTVADLLAENRTLVRKAVSYLDAIQQPVIDQRVELDDLLHKTPTALTALG RANGTYGDFQNFYLCDLQIKWNGFQAGGPVRTVKLFSQPTGRCTPQ >MT0080 conserved hypothetical protein MGDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRISAVDFAG SACPDMNLVDLGESTLLPGLVDAHAHLCWDPDGRPEDLAGDPHAVLVGRA RRHAAAALRSGITTIRDLGDRDYAALALREEYRQKTTVGPELVVSGPPLT RSGGHCWFLGGVADSVEELVDAVQERAARGADWIKVMATGGFVTTASDPW QPQYGSGQLAAVVAAAEQVGLPVTAHAHATAGIAAAVAAGVDGIEHCTFL SEGSAAASPDVVEAIVAQGVWCGMTIPRVYPEMPENLVAVVQDGWRNIRR LIDAGARVALSTDAGVAPGRRHDVLPDDLVYLSRHGFTSTEVLTGATAAA AASCGLGHRKGRIAPGYDADLLAVAAGVDHDPAGLCDVKAVWRSGTQVPL QASAVGYNTPS >MT3122 conserved hypothetical protein MRARFGDRAPWLVETTLLRRRAAGKLGELCPNVGVSQWLFTDEALQQATA APVARHRARRLAGRVVHDATCSIGTELAALRELAVRAVGSDIDPVRLAMA RHNLAALGMEADLCRADVLHPVTRDAVVVIDPARRSNGRRRFHLADYQPG LGPLLDRYRGRDVVVKCAPGIDFEEVGRLGFEGEIEVISYRGGVREACLW SAGLAGSGIRRRASILDSGEQIGDDEPDDCGVRPAGKWIVDPDGAVVRAG LVRNYGARHGLWQLDPQIAYLSGDRLPPALRGFEVLEQLAFDERRLRQVL SALDCGAAEILVRGVAIDPDALRRRLRLRGSRPLAVVITRIGAGSLSHVT AYVCRPSR >MT0182 virulence factor mce family protein MSVLARMRVMRHRAWQGLVLLVLALLLSSCGWRGISNVAIPGGPGTGPGS YTIYVQMPDTLAINGNSRVMVADVWVGSIRAIKLKNWVATLTLSLKKDVT LPKNATAKIGQTSLLGSQHVELAAPPDPSPVPLKDGDTIPLKRSSAYPTT EQTLASIATLLRGGGLVNLEGIQQEINAIVTGRADQIRAFLGKLDTFTDE LNQQRDDITRAIDSTNRLLAYVGGRSEVLNRVLTDLPPLIKHFADKQELL INASDAVGRLSQSADQYLSAARGDLHQDLQALQCPLKELRRAAPYLVGAL KLILTQPFDVDTVPQLVRGDYMNLSLTLDLTYSAIDNAFLTGTGFSGALR ALEQSFGRDPETMIPDIRYTPNPNDAPGGPLVERGNRQC >MT3649 P450 heme-thiolate protein MSWNHQSVEIAVRRTTVPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAA PIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRF KNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQER AQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEM TGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDG EKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPE TAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEV FQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPD LKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH >MT1387 AMP-binding family protein MGLVGEPTVELVAAIQGAWLAGAAVSILPGPVRGANDQRWADATLTRFLG IGVRTVLSQGSYLARLRSVDTAGVTIGDLSTAAHTNRSATPVASEGPAVL QGTAGSTGAPRTAILSPGAVLSNLRGLNQRVGTDAATDVGCSWLPLYHDM GLAFVLSAALAGAPLWLAPTTAFTASPFRWLSWLSDSGATMTAAPNFAYN LIGKYARRVSEVDLGALRVTLNGGEPVDCDGLTRFAEAMAPFGFDAGAVL PSYGLAESTCAVTVPVPGIGLLADRVIDGSGAHKHAVLGNPIPGMEVRIS CGDQAAGNASREIGEIEIRGASMMAGYLGQQPIDPDDWFATGDLGYLGAG GLVVCGRAKEVISIAGRNIFPTEVELVAAQVRGVREGAVVALGTGDRSTR PGLVVAAEFRGPDEANARAELIQRVASECGIVPSDVVFVSPGSLPRTSSG KLRRLAVRRSLEMAD >MT1231 conserved hypothetical protein MAWQQPSPRIRELIREGARIALNPSPEWIEELDRATIAANPAIANDPVLA KVVQTANRANLVYWAAANLRDPGARVPANLGTEPLRMARDLVRRGLDTVA FNIYRTGEHIGWRFWMGIAFELTSDPQELRELLDVSARSVNDFIEATLTG IAAQVQSEHDELTRSTHAERLEVVGLILDGAPISPERAEAKLGYPLSRAH TAAIIWSDELDGDHSYLDRAADLFCHAVGSTRPLTVVAGAASRWAWVTDA DGLDIDTVQAAVDNAPGARIAIGTTANGVEGFRRSHLEALITQRTLSRLR STQRVAFFADVKMVALISQNPDAASEFITSTLGDLESASPDLQTALLTFI NEQCNASRAAKRLHTHRNTFLRRLESAQRLLPRPLDHTSVHVAVALEALQ WRGNKAHALSSPGRRSNSVPA >MT2835 carboxymethylenebutenolidase, putative MPKTTDTAATPDGTCAVRLFTPDGPGRWPGVVMFPDAGGVRDTFDRMAAK LAGFGYVVLLPDVYYREGDWAPFDMKTAFGDPQERARIMFMIGTLTPDRV TRDADALLNYLASRPEVIGDRFGVCGYCMGGRMSVVVAGRLPDRVAAAAA FHPGGLVANSPDSPHLLADRISATVYIGGAENDPSFTADHAEKLDKAFSA AGVPHRIECYPAAHGFAVPDNPSYDAAADERHWAAMTETFGAALN >MT0715 oxidoreductase, short-chain dehydrogenase/reductase family MSARGGSLHGRVAFVTGAARAQGRSHAVRLAREGADIVALDICAPVSGSV TYPPATSEDLGETVRAVEAEGRKVLAREVDIRDDAELRRLVADGVEQFGR LDIVVANAGVLGWGRLWELTDEQWETVIGVNLTGTWRTLRATVPAMIDAG NGGSIVVVSSSAGLKATPGNGHYAASKHALVALTNTLAIELGEFGIRVNS IHPYSVDTPMIEPEAMIQTFAKHPGYVHSFPPMPLQPKGFMTPDEISDVV VWLAGDGSGALSGNQIPVDKGALKY >MT3664 oxidoreductase MNLSVAPKEIAGHGLLDGKVVVVTAAAGTGIGSATARRALAEGADVVISD HHERRLGETAAELSALGLGRVEHVVCDVTSTAQVDALIDSTTARMGRLDV LVNNAGLGGQTPVADMTDDEWDRVLDVSLTSVFRATRAALRYFRDAPHGG VIVNNASVLGWRAQHSQSHYAAAKAGVMALTRCSAIEAAEYGVRINAVSP SIARHKFLDKTASAELLDRLAAGEAFGRAAEPWEVAATIAFLASDYSSYL TGEVISVSCQHP >MT3003 polyketide synthase MTAATPDRRAIITEALHKIDDLTARLEIAEKSSSEPIAVIGMGCRFPGGV NNPEQFWDLLCAGRSGIVRVPAQRWDADAYYCDDHTVPGTICSTEGGFLT SWQPDEFDAEFFSISPREAAAMDPQQRLLIEVAWEALEDAGVPQHTIRGT QTSVFVGVTAYDYMLTLAGRLRPVDLDAYIPTGNSANFAAGRLAYILGAR GPAVVIDTACSSSLVAVHLACQSLRGRESDMALVGGTNLLLSPGPSIACS RWGMLSPEGRCKTFDASADGYVRGEGAAVVVLKRLDDAVRDGNRILAVVR GSAVNQDGASSGVTVPNGPAQQALLAKALTSSKLTAADIDYVEAHGTGTP LGDPIELDSLSKVFSDRAGSDQLVIGSVKTNLGHLEAAAGVAGLMKAVLA VHNGYIPRHLNFHQLTPHASEAASRLRIAADGIDWPTTGRPRRAGVSSFG VSGTNAHVVIEQAPDPMAAAGTEPQRGPVPAVSTLVVFGKTAPRVAATAS VLADWLDGPGAAVPLADVAHTLNHHRARQTRFGTVAAVDRRQAVIGLRAL AAGQSAPGVVAPREGSIGGGTVFVYSGRGSQWAGMGRQLLADEPAFAAAI AELEPEFVAQGGFSLRDVIAGGKELVGIEQIQLGLIGMQLALTALWRSYG VTPDAVIGHSMGEVAAAVVAGALTPAQGLRVTAVRSRLMAPLSGQGTMAL LELDAEATEALIADYPEVSLGIYASPRQTVISGPPLLIDELIDKVRQQNG FATRVNIEVAPHNPAMDALQPAMRSELADLTPQPPTIPIISTTYADLGIS LGSGPRFDAEHWATNMRNPVRFHQAIAHAGADHHTFIEISAHPLLTHSIS DTLRASYDVDNYLRIGTLQRDAHDTLEFHTNLNTTHTTHPPQTPHPPEPH PVLPTTPWQHTQHWITATSAAYHRPDTHPLLGVGVTDPTNGTRVWESELD PDLLWLADHVIDDLVVLPGAAYAEIALAAATDTFAVEQDQPWMISELDLR QMLHVTPGTVLVTTLTGDEQRCQVEIRTRSGSSGWTTHATATVARAEPLA PLDHEGQRREVTTADLEDQLDPDDLYQRLRGAGQQHGPAFQGIVGLAVTQ AGVARAQVRLPASARTGSREFMLHPVMMDIALQTLGATRTATDLAGGQDA RQGPSSNSALVVPVRFAGVHVYGDITRGVRAVGSLAAAGDRLVGEVVLTD ANGQPLLVVDEVEMAVLGSGSGATELTNRLFMLEWEPAPLEKTAEATGAL LLIGDPAAGDPLLPALQSSLRDRITDLELASAADEATLRAAISRTSWDGI VVVCPPRANDESMPDEAQLELARTRTLLVASVVETVTRMGARKSPRLWIV TRGAAQFDAGESVTLAQTGLRGIARVLTFEHSELNTTLVDIEPDGTGSLA ALAEELLAGSEADEVALRDGQRYVNRLVPAPTTTSGDLAAEARHQVVNLD SSGASRAAVRLQIDQPGRLDALNVHEVKRGRPQGDQVEVRVVAAGLNFSD VLKAMGVYPGLDGAAPVIGGECVGYVTAIGDEVDGVEVGQRVIAFGPGTF GTHLGTIADLVVPIPDTLADNEAATFGVAYLTAWHSLCEVGRLSPGERVL IHSATGGVGMAAVSIAKMIGARIYTTAGSDAKREMLSRLGVEYVGDSRSV DFADEILELTDGYGVDVVLNSLAGEAIQRGVQILAPGGRFIELGKKDVYA DASLGLAALAKSASFSVVDLDLNLKLQPARYRQLLQHILQHVADGKLEVL PVTAFSLHDAADAFRLMASGKHTGKIVISIPQHGSIEAIAAPPPLPLVSR DGGYLIVGGMGGLGFVVARWLAEQGAGLIVLNGRSAPSDEVAAAIAELNA SGSRIEVITGDITEPDTAERLVRAVEDAGFRLAGVVHSAMVLADEIVLNM TDSAARRVFAPKVTGSWRLHVATAARDVDWWLTFSSAAALLGTPGQGAYA AANSWVDGLVAHRRSAGLPAVGINWGPWADVGRAQFFKDLGVEMINAEQG LAAMQAVLTADRGRTGVFSLDARQWFQSFPAVAGSSLFAKLHDSAARKSG QRRGGGAIRAQLDALDAAERPGHLASAIADEIRAVLRSGDPIDHHRPLET LGLDSLMGLELRNRLEASLGITLPVALVWAYPTISDLATALCERMDYATP AAAQEISDTEPELSDEEMDLLADLVDASELEAATRGES >MT1393 oxidoreductase, short-chain dehydrogenase/reductase family MASLLNARTAVITGGAQGLGLAIGQRFVAEGARVVLGDVNLEATEVAAKR LGGDDVALAVRCDVTQADDVDILIRTAVERFGGLDVMVNNAGITRDATMR TMTEEQFDQVIAVHLKGTWNGTRLAAAIMRERKRGAIVNMSSVSGKVGMV GQTNYSAAKAGIVGMTKAAAKELAHLGIRVNAIAPGLIRSAMTEAMPQRI WDQKLAEVPMGRAGEPSEVASVAVFLASDLSSYMTGTVLDVTGGRFI >MT0340 hypothetical protein MGPKGSLRLVKRQPELLVAQHEHWQDTYRAHPVLYGTRPSEPGVYAAEVF NADGVQRVLELAAGHGRDTLYFAG >MT2045.1 hypothetical protein MGRLEGKVAFITGVARGQGRSHAVRLADGQARALGKVDVEACGALVGEVE VWGRDVRDDRRVFVESPADEFGACRRVARQGIRVVGLPVSQRELVEPEAG CAARRSAAGSQ >MT3598 virulence factor mce family protein MIDRLAKIQLSIFAVITVITLSVMAIFYLRLPATFGIGTYGVSADFVAGG GLYKNANVTYRGVAVGRVESVGLNPNGVTAHMRLNSGTAIPSNVTATVRS VSAIGEQYIDLVPPENPSSTKLRNGFRIQRQNTRIGQDVADLLRQAETLL GSLGDTRLRELLHEAFIATNGAGPELARLIESARLLVDEANANYPQVSQL IDQAGPFLQAQIRAGGDIKSLADGLARFTWQLRAADPRLRDTLADAPDAI DEANTAFSGIRPSFPALAASLANLGRVGVIYHKSIEQLLVVFPALFAAII TSAGGVPQDEGAKLDFKIDLHDPPPCMTGFLPPPLVRSPADESVREIPRD MYCKTAQNDPSTVRGARNYPCQEFPGKRAPTVQLCRDPRGYVPVGTNPWR GPPIPYGTEVTDGRNILPPNKFPYIPPGADPDPGVPIVGPPPPGQVAGPG PAPHQPAQPAPPPNDNGPPPPFTSWMPPGYPPEPPQVPYPATIPPPPPPE GTGPPPGPAPGPQPQASGPAYTIYDQLSGAFADPAGGTGIFAPGMTGASS AENWVDLMRDPRQL >MT2667 substrate--CoA ligase, putative MSINDQRLTRRVEDLYASDAQFAAASPNEAITQAIDQPGVALPQLIRMVM EGYADRPALGQRALRFVTDPDSGRTMVELLPRFETITYRELWARAGTLAT ALSAEPAIRPGDRVCVLGFNSVDYTTIDIALIRLGAVSVPLQTSAPVTGL RPIVTETEPTMIATSIDNLGDAVEVLAGHAPARLVVFDYHGKVDTHREAV EAARARLAGSVTIDTLAELIERGRALPATPIADSADDALALLIYTSGSTG APKGAMYRESQVMSFWRKSSGWFEPSGYPSITLNFMPMSHVGGRQVLYGT LSNGGTAYFVAKSDLSTLFEDLALVRPTELCFVPRIWDMVFAEFHSEVDR RLVDGADRAALEAQVKAELRENVLGGRFVMALTGSAPISAEMTAWVESLL ADVHLVEGYGSTEAGMVLNDGMVRRPAVIDYKLVDVPELGYFGTDQPYPR GELLVKTQTMFPGYYQRPDVTAEVFDPDGFYRTGDIMAKVGPDQFVYLDR RNNVLKLSQGEFIAVSKLEAVFGDSPLVRQIFIYGNSARAYPLAVVVPSG DALSRHGIENLKPVISESLQEVARAAGLQSYEIPRDFIIETTPFTLENGL LTGIRKLARPQLKKFYGERLERLYTELADSQSNELRELRQSGPDAPVLPT LCRAAAALLGSTAADVRPDAHFADLGGDSLSALSLANLLHEIFGVDVPVG VIVSPASDLRALADHIEAARTGVRRPSFASIHGRSATEVHASDLTLDKFI DAATLAAAPNLPAPSAQVRTVLLTGATGFLGRYLALEWLDRMDLVNGKLI CLVRARSDEEAQARLDATFDSGDPYLVRHYRELGAGRLEVLAGDKGEADL GLDRVTWQRLADTVDLIVDPAALVNHVLPYSQLFGPNAAGTAELLRLALT GKRKPYIYTSTIAVGEQIRPEAFTEDADIRAISPTRRIDDSYANGYANSK WAGEVLLREAHEQCGLPVTVFRCDMILADTSYTGQLNLPDMFTRLMLSLA ATGIAPGSFYELDAHGNRQRAHYDGLPVEFVAEAICTLGTHSPDRFVTYH VMNPYDDGIGLDEFVDWLNSPTSGSGCTIQRIADYGEWLQRFETSLRALP DRQRHASLLPLLHNYREPAKPICGSIAPTDQFRAAVQEAKIGPDKDIPHL TAAIIAKYISNLRLLGLL >MT0176 conserved hypothetical protein MTTSTTLGGYVRDQLQTPLTLVGGFFRMCVLTGKALFRWPFQWREFILQC WFIMRVGFLPTIMVSIPLTVLLIFTLNILLAQFGAADISGSGAAIGAVTQ LGPLTTVLVVAGAGSTAICADLGARTIREEIDAMEVLGIDPIHRLVVPRV LASMLVATLLNGLVITVGLVGGFLFGVYLQNVSGGAYLATLTLITGLPEV VIATIKAATFGLIAGLVGCYRGLTVRGGSKGLGTAVNETVVLCVIALFAV NVILTTIGVRFGTGR >MT0788 P450 heme-thiolate protein MSAVALPRVSGGHDEHGHLEEFRTDPIGLMQRVRDECGDVGTFQLAGKQV VLLSGSHANEFFFRAGDDDLDQAKAYPFMTPIFGEGVVFDASPERRKEML HNAALRGEQMKGHAATIEDQVRRMIADWGEAGEIDLLDFFAELTIYTSSA CLIGKKFRDQLDGRFAKLYHELERGTDPLAYVDPYLPIESFRRRDEARNG LVALVADIMNGRIANPPTDKSDRDMLDVLIAVKAETGTPRFSADEITGMF ISMMFAGHHTSSGTASWTLIELMRHRDAYAAVIDELDELYGDGRSVSFHA LRQIPQLENVLKETLRLHPPLIILMRVAKGEFEVQGHRIHEGDLVAASPA ISNRIPEDFPDPHDFVPARYEQPRQEDLLNRWTWIPFGAGRHRCVGAAFA IMQIKAIFSVLLREYEFEMAQPPESYRNDHSKMVVQLAQPACVRYRRRTG V >MT2836 oxidoreductase, short-chain dehydrogenase/reductase family MTSLDLTGRTAIITGASRGIGLAIAQQLAAAGAHVVLTARRQEAADEAAA QVGDRALGVGAHAVDEDAARRCVDLTLERFGSVDILINNAGTNPAYGPLL EQDHARFAKIFDVNLWAPLMWTSLVVTAWMGEHGGAVVNTASIGGMHQSP AMGMYNATKAALIHVTKQLALELSPRIRVNAICPGVVRTRLAEALWKDHE DPLAATIALGRIGEPADIASAVAFLVSDAASWITGETMIIDGGLLLGNAL GFRAAPSTEH >MT0074 oxidoreductase, short-chain dehydrogenase/reductase family MTKWTAADIPDQTGRTAVITGANTGLGFETAAALAAHGAHVVLAVRNLDK GKQAAARITEATPGAEVELQELDLTSLASVRAAAAQLKSDHQRIDLLINN AGVMYTPRQTTADGFEMQFGTNHLGHFALTGLLIDRLLPVAGSRVVTISS VGHRIRAAIHFDDLQWERRYRRVAAYGQAKLANLLFTYELQRRLAPGGTT IAVASHPGVSNTELVRNMPRPLVAVAAILAPLMQDAELGALPTLRAATDP AVRGGQYFGPDGFGEIRGYPKVVASSAQSHDEQLQRRLWAVSEELTGVVY PVG >MT3589 oxidoreductase, short-chain dehydrogenase/reductase family MVQNGENLFQFRREGPQVQLSFQDRTYLVTGGGSGIGKGVAAGLVAAGAA VMIVGRNPDKLAAAVKDIEALKTGAIGYEPADITDEEQTLRVVDAATAWH GRLHGVVHCAGGSQTIGPITQIDSQAWRRTVDLNVNGTMYVLKHAARELV RGGGGSFVGISSIAASNTHRWFGAYGVTKSAVDHMMKLAADELGPSWVRV NSIRPGLIRTDLVVPVTESPELSADYRVCTPLPRVGEVEDVANLAMFLLS DAASWITGQVINVDGGHMLRRGPDFSPMLEPVFGADGLRGVVG >MT0153 conserved hypothetical protein MTELDDVSSLPSSRRTAGDTWAITESVGATALGVAAARAVETAATNPLIR DEFAKVLVSSAGTAWARLADADLAWLDGDQLGRRVHRVACDYQAVRTHFF DEYFGAAVDAGVRQVVILAAGLDARAYRLNWPAGTVVYEIDQPSVLEYKA GILQSHGAVPTARRHAVAVDLRDDWPAALIAAGFDGTQPTAWLAEGLLPY LPGDAADRLFDMVTALSAPGSQVAVEAFTMNTKGNTQRWNRMRERLGLDI DVQALTYHEPDRSDAAQWLATHGWQVHSVSNREEMARLGRAIPQDLVDET VRTTLLRGRLVTPAQPA >MT3123 hypothetical protein MTRSSNIPADATPNPHATAEQVAAARHDSKLAQVLYHDWEAENYDEKWSI SYDQRCVDYARGRFDAIVPDEVIAQLPYDRALELGCGTGFFLLNLIQAGV ARRGSVTDLSPGMVKVATRNGQALGLDIDGRVADAEGIPYDDDAFDLVVG HAVLHHIPDVELSLREVVRVLKPGGRFVFAGEPTTVGDGYARTLSTLTWR VVTNATKLPGLRGWRRPQGELDESSRAAALEALVDLHTFTPQDLQRIAHN AGAVEVQTATEEFTAAMLGWPLRTFECTVPPGRLGWGWARFAFTSWKTLG WVDANVWRHVVPKGWFYNVMITGVKPS >MT2821 conserved hypothetical protein, truncation MARNPAAQTAFGPMVLAAVEQNEPPGRRLVDDDLADLFLPRPLRWLAGAT RSAVLRRLLISASEWSGRGLWANLACRKRFIGDKLDEALGDIDAVVILGA GLDTRAYRLTRRVRMPVFEVDLPVNIARKAKTVRRVLGELPLSVRLVALD FEHDDLLTALAEHGYRTEYRVFFVCEGVTQYLTERAVRRTLEGLRAAAPG SRMVFTYVRRDFIDGTNRYGTRTLYHTVRQRRQLWHFGLDPEEVAGFLAD YGWRLTEQAGPEELVQRYVEPTGRNLNASQIEWSAYAEKSEPVTPR >MT3616 substrate--CoA ligase, putative MAVALNIADLAEHAIDAVPDRVAVICGDEQLTYAQLEDKANRLAHHLIDQ GVQKDDKVGLYCRNRIEIVIAMLGIVKAGAILVNVNFRYVEGELRYLFDN SDMVALVHERRYADRVANVLPDTPHVRTILVVEDGSDQDYRRYGGVEFYS AIAAGSPERDFGERSADAIYLLYTGGTTGFPKGVMWRHEDIYRVLFGGTD FATGEFVKDEYDLAKAAAANPPMIRYPIPPMIHGATQSATWMALFSGQTT VLAPEFNADEVWRTIHKHKVNLLFFTGDAMARPLVDALVKGNDYDLSSLF LLASTAALFSPSIKEKLLELLPNRVITDSIGSSETGFGGTSVVAAGQAHG GGPRVRIDHRTVVLDDDGNEVKPGSGMRGVIAKKGNIPVGYYKDEKKTAE TFRTINGVRYAIPGDYAQVEEDGTVTMLGRGSVSINSGGEKVYPEEVEAA LKGHPDVFDALVVGVPDPRYGQQVAAVVQARPGCRPSLAELDSFVRSEIA GYKVPRSLWFVDEVKRSPAGKPDYRWAKEQTEARPADDVHAGHVTSGG >MT3423 hypothetical protein MSVQTDPALREHPNRVDWNARYERAGSAHAPFAPVPWLADVLRAGVPDGP VLELASGRSGTALALAAHGRQVTAIDVSDVALLQLDSEAVRRGVADRLNL VQADLGCWEPGETRFALVLSRLFWDAAIFHRACEAVMPGGVLAWESLALS GAEAGTASAKRRVKPGEPACLLPADFTVVHEGQGNCDSAPSRIMIARRSP LPGA >MT0751 conserved hypothetical protein MTYTGSIRCEGDTWDLASSVGATATMVAAARAMATRAANPLINDQFAEPL VRAVGVDVLTRLASGELTASDIDDPERPNASMVRMAEHHAVRTKFFDEFF MDATRAGIRQVVILASGLDSRAYRLAWPAQTVVYEIDQPQVMEFKTRTLA ELGATPTADRRVVTADLRADWPTALGAAGFDPTQPTAWSAEGLLRYLPPE AQDRLLDNVTALSVPDSRFATESIRNFKPHHEERMRERMTILANRWRAYG FDLDMNELVYFGDRNEPASYLSDNGWLLTEIKSQDLLTANGFQPFEDEEV PLPDFFYVSARLQRKHRQYPAHRKPAPSWRHTACPVNELSKSAAYTMTRS DAHQASTTAPPPPGLTG >MT2749 conserved hypothetical protein MTAQFDPADPTRFEEMYRDDRVAHGLPAATPWDIGGPQPVVQQLVALGAI RGEVLDPGTGPGHHAIYYAAKGYAATGIDGSVAAIERARDNARKAGVSVN FQVGDATTLDGLDGRFDTVVDCAFYHTFSTAPELQRCYVRALRRASKPGA RLYMFEFGEHNVNGFSMPRSLSEDDFRQVLPVGGWEITYLGTTTYQVNLS VEALELMAARNPDMADQVRCVLERFRAIKPWLVGGRVHAPFWEVHATRVD >MT0108 substrate--CoA ligase, putative MGGKKFQAMPQLPSTVLDRVFEQARQQPEAIALRRCDGTSALRYRELVAE VGGLAADLRAQSVSRGSRVLVISDNGPETYLSVLACAKLGAIAVMADGNL PIAAIERFCQITDPAAALVAPGSKMASSAVPEALHSIPVIAVDIAAVTRE SEHSLDAASLAGNADQGSEDPLAMIFTSGTTGEPKAVLLANRTFFAVPDI LQKEGLNWVTWVVGETTYSPLPATHIGGLWWILTCLMHGGLCVTGGENTT SLLEILTTNAVATTCLVPTLLSKLVSELKSANATVPSLRLVGYGGSRAIA ADVRFIEATGVRTAQVYGLSETGCTALCLPTDDGSIVKIEAGAVGRPYPG VDVYLAATDGIGPTAPGAGPSASFGTLWIKSPANMLGYWNNPERTAEVLI DGWVNTGDLLERREDGFFYIKGRSSEMIICGGVNIAPDEVDRIAEGVSGV REAACYEIPDEEFGALVGLAVVASAELDESAARALKHTIAARFRRESEPM ARPSTIVIVTDIPRTQSGKVMRASLAAAATADKARVVVRG >MT0040 AMP-binding family protein MTAALLSPAIAWQQISACTDRTLTITCEDSEVISYQDLIARAAACIPPLR RLDLKRGEPVLITAHTNLEFLSCFLGLMLHGAVPVPIPPREALKTTERFM TRLGPLLRHHRVLICTPAEHDEIRAAASTDCQISRFTALAEAGDEQFGRA TAQQLADTATADWPLCTLDDDAYVQYTSGSTAAPRGVVITYRNLLSNMRA MAVGSQFQHGDVMGSWLPLHHDMGLVGSLFAALFNSVSAVFTTPHRFLYD PLGFLRLLTSSGATHTFMPNFALEWLINAYHRRGADIEGIDLHKMRRLII ASEPVHAEGMRRFAATFAGVGLAPTALGSGYGLAEATVAVSMSAPNTGFR TETHAAAEVVTGGRVLPGYEVRIDAAPGARAGTIKLRGDSVAAKAYVGGK KLDALDEEGFCDTHDLGFLVDDEIVILGRQDEVFIVHGENRFPYDIEFII RGESEQHRTKVACFGVNERVVVVLESPLDSIIDKAEADRLRCQVVAATGL QLDELITVRRGAIPTTTSGKLKRRAVAQAYRDGTLPRLATHAWTADPDSA PKTTRSSLEGAH >MT1283 oxidoreductase, short-chain dehydrogenase/reductase family MEGFAGKVAVVTGAGSGIGQALAIELARSGAKVAISDVDTDGLADTEHRL KAISTPVKTDRLDVTEREAFLAYADAVNEHFGTVNQIYNNAGIAFTGDIE VSQFKDIERVMDVDFWGVVNGTKAFLPHLIASGDGHVINISSVFGLFSAP GQAAYNSAKFAVRGFTEALRQEMALAGHPVKVTTVHPGGVKTAIARNATA AEGLDQAELAETFDKRVAHLSPQRAAQIILTGVAKNKARVLVGVDAKVLD LVVRLTGSGYQRIFPIITGRLIPRPR >MT2019 virulence factor mce family protein MRENLGGVVVRLGVFLAVCLLTAFLLIAVFGEVRFGDGKTYYAEFANVSN LRTGKLVRIAGVEVGKVTRISINPDATVRVQFTADNSVTLTRGTRAVIRY DNLFGDRYLALEEGAGGLAVLRPGHTIPLARTQPALDLDALIGGFKPLFR ALNPEQVNALSEQLLHAFAGQGPTIGSLLAQSAAVTNTLADRDRLIGQVI TNLNVVLGSLGAHTDRLDQAVTSLSALIHRLAQRKTDISNAVAYTNAAAG SVADLLSQARAPLAKVVRETDRVAGIAAADHDYLDNLLNTLPDKYQALVR QGMYGDFFAFYLCDVVLKVNGKGGQPVYIKLAGQDSGRCAPK >MT3874 conserved hypothetical protein MPRTDNDSWAITESVGATALGVAAARAAETESDNPLINDPFARIFVDAAG DGIWSMYTNRTLLAGATDLDPDLRAPIQQMIDFMAARTAFFDEYFLATAD AGVRQVVILASGLDSRAWRLPWPDGTVVYELDQPKVLEFKSATLRQHGAQ PASQLVNVPIDLRQDWPKALQKAGFDPSKPCAWLAEGLVRYLPARAQDLL FERIDALSRPGSWLASNVPGAGFLDPERMRRQRADMRRMRAAAAKLVETE ISDVDDLWYAEQRTAVAEWLRERGWDVSTATLPELLARYGRSIPHSGEDS IPPNLFVSAQRATS >MT2323 oxidoreductase, short-chain dehydrogenase/reductase family MAKDLVATVPDLSGKLAIITGANSGLGFGLARRLSAAGADVIMAIRNRAK GEAAVEEIRTAVPDAKLTIKALDLSSLASVAALGEQLMADGRPIDLLINN AGVMTPPERVTTADGFELQFGSNHLGHFALTAHLLPLLRAAQRARVVSLS SLAARRGRIHFDDLQFERSYAPMTAYGQSKLAVLMFARELDRRSRAAGWG IISNAAHPGLTKTNLQIAGPSHGRDKPALMERLYKTSWRFAPFLWQEIEE GILPALYAAATPQADGGAFYGPRGRYEVAGGGVREAKVPAAARNDADSKR LWEVSEQLTGVSYPKSR >MT2697 conserved hypothetical protein MANKRGNAGQPLPLSDRDDDHMQGHWLLARLGKRVLRPGGVELTRTLLAR AEVTDADVLELAPGLGRTAAEILARNPRSYVGAESDPNAANLVRHVLAGR GDVRVTDAADTGLSDASADVVIGEAMLTMQGNAAKHTIVAEAARVLRPGG RYAIHELALVPDDVAEQVRTDLRQSLARALKVNARPLTVAEWSHLLAGHG LVVEHVVTASMALLQPRRVIADEGLLGALRFAGNLLIHRAARRRVLLMRH TFRRHRERLTAVAIVAHKPHVDS >MT2820 oxidoreductase, short-chain dehydrogenase/reductase family MIDRPLEGKVAFITGAARGLGRAHAVRLAADGANIIAVDICEQIASVPYP LSTADDLAATVELVEDAGGGIVARQGDVRDRASLSVALQAGLDEFGRLDI VVANAGIAMMQAGDDGWRDVIDVNLTGVFHTVQVAIPTLIEQGTGGSIVL ISSAAGLVGIGSSDPGSLGYAAAKHGVVGLMRAYANHLAPQNIRVNSVHP CGVDTPMINNEFFQQWLTTADMDAPHNLGNALPVELVQPTDIANAVAWLA SEEARYVTGVTLPVDAGFVNKR >MT3605 conserved hypothetical protein MSMDTARAAFRRPFQFREFLDQTWMVARVSLVPTLLVSIPFTVLVAFTLN ILLREIGAADLSGAGTAFGTITQLGPVVTVLVVAGAGATAICADLGARTI REEIDAMRVLGIDPIQRLVVPRVLASTLVALLLNGLVCAIGLSGGYAFSV FLQGVNPGAFINGLTVLTGLRELILAEIKALLFGVMAGLVGCYRGLTVKG GPKGVGNAVNETVVYAFICLFVINVVMTAIGVRISAQ >MT0154 conserved hypothetical protein MSAMRTHDDTWDIKTSVGATAVMVAAARAVETDRPDPLIRDPYARLLVTN AGAGAIWEAMLDPTLVAKAAAIDAETAAIVAYLRSYQAVRTNFFDTYFAS AVAAGIRQVVILASGLDSRAYRLDWPAGTIVYEIDQPKVLSYKSTTLAEN GVTPSAGRREVPADLRQDWPAALRDAGFDPTARTAWLAEGLLMYLPAEAQ DRLFTQVGAVSVAGSRIAAETAPVHGEERRAEMRARFKKVADVLGIEQTI DVQELVYHDQDRASVADWLTDHGWRARSQRAPDEMRRVGRWVEGVPMADD PTAFAEFVTAERL >MT2302 conserved hypothetical protein MNDNQLAPVARPRSPLELLDTVPDSLLRRLKQYSGRLATEAVSAMQERLP FFADLEASQRASVALVVQTAVVNFVEWMHDPHSDVGYTAQAFELVPQDLT RRIALRQTVDMVRVTMEFFEEVVPLLARSEEQLTALTVGILKYSRDLAFT AATAYADAAEARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTTAP ATVLVGTPAPGPNGSNSDGDSERASQDVRDTAARHGRAALTDVHGTWLVA IVSGQLSPTEKFLKDLLAAFADAPVVIGPTAPMLTAAHRSASEAISGMNA VAGWRGAPRPVLARELLPERALMGDASAIVALHTDVMRPLADAGPTLIET LDAYLDCGGAIEACARKLFVHPNTVRYRLKRITDFTGRDPTQPRDAYVLR VAATVGQLNYPTPH >MT0577 substrate--CoA ligase MSTAGDDAVGVPPACGGRSDAVGVPQLARESGAMRDQDCSGELLRSPTHN GHLLVGALKRHQNKPVLFLGDTRLTGGQLADRISQYIQAFEALGAGTGVA VGLLSLNRPEVLMIIGAGQARGYRRTALHPLGSLADHAYVLNDAGISSLI IDPNPMFVERALALLEQVDSLQQILTIGPVPDALKHVAVDLSAEAAKYQP QPLVAADLPPDQVIGLTYTGGTTGKPKGVIGTAQSIATMTSIQLAEWEWP ANPRFLMCTPLSHAGAAFFTPTVIKGGEMIVLAKFDPAEVLRIIEEQRIT ATMLVPSMLYALLDHPDSHTRDLSSLETVYYGASAINPVRLAEAIRRFGP IFAQYYGQSEAPMVITYLAKGDHDEKRLTSCGRPTLFARVALLDEHGKPV KQGEVGEICVSGPLLAGGYWNLPDETSRTFKDGWLHTGDLAREDSDGFYY IVDRVKDMIVTGGFNVFPREVEDVVAEHPAVAQVCVVGAPDEKWGEAVTA VVVLRSNAARDEPAIEAMTAEIQAAVKQRKGSVQAPKRVVVVDSLPLTGL GKPDKKAVRARFWEGAGRAVG >MT0231 hypothetical protein MHTLKVAVIELDSDRQEFGVDAFREVIAGRLHKLEPLGYQLVDVPLKFHH PMWREHCQVDLNYHIRPWRLRAPGGRRELDEAVGEIASTPLNRDHPLWEM YFVEGLANHRIAVVAKIHHALADGVASANMMARGMDLLPGPEVGRYVPDP APTKRQLLSAAFIDHLRHLGRIPATIRYTTQGLGRVRRSSRKLSPALTMP FTPPPTFMNHRLTPERRFATATLALIDVKATAKLLGATINDMVLAMSTGA LRTLLLRYDGKAEPLLASVPVSYDFSPERISGNRFTGMLVALPADSDDPL QRVRVCHENAVSAKESHQLLGPELISRWAAYWPPAGAEALFRWLSERDGQ NKVLNLNISNVPGPRERGRVGAALVTEIYSVGPLTAGSGLNITVWSYVDQ LNISVLTDGSTVQDPHEVTAGMIADFIEIRRAAGLSVELTVVESAMAQA >MT3787 P450 heme-thiolate protein MVLRSLASPAALTDPKRCASVVGVAAFAVRREHAPDALGGPPGLPAPRGF RAAFAAAYAVAYLAGGERRMLRLIRRYGPIMTMPILSLGDVAIVSDSALA KEVFTAPTDVLLGGEGVGPAAAIYGSGSMFVQEEPEHLRRRKLLTPPLHG AALDRYVPIIENSTRAAMHTWPVDRPFAMLTVARSLMLDVIVKVIFGVDD PEEVRRLGRPFERLLNLGVSEQLTVRYALRRLGALRVWPARARANTEIDD VVMALIAQRRADPRLGERHDVLSLLVSARGESGEQLSDSEIRDDLITLVL AGHETTATTLAWAFDLLLHHPDALRRVRAEAVGGGEAFTTAVINETLRVR PPAPLTARVAAQPLTIGGYRVEAGTRIVVHIIAINRSAEVYEHPHEFRPE RFLGTRPQTYAWVPFGGGVKRCLGANFSMRELITVLHVLLREGEFTAVDD EPERIVRRSIMLVPRRGTRVRFRPAR >MT0342 P450 heme-thiolate protein MASTLTTGLPPGPRLPRYLQSVLYLRFREWFLPAMHRKYGDVFSLRVPPY ADNLVVYTRPEHIKEIFAADPRSLHAGEGNHILGFVMGEHSVLMTDEAEH ARMRSLLMPAFTRAALRGYRDMIASVAREHITRWRPHATINSLDHMNALT LDIILRVVFGVTDPKVKAELTSRLQQIINIHPAILAGVPYPSLKRMNPWK RFFHNQTKIDEILYREIASRRIDSDLTARTDVLSRLLQTKDTPTKPLTDA ELRDQLITLLLAGHETTAAALSWTLWELAHAPEIQSQVVWAAVGGDDGFL EAVLKEGMRRHTVIASTARKVTAPAEIGGWRLPAGTVVNTSILLAHASEV SHPKPTEFRPSRFLDGSVAPNTWLPFGGGVRRCLGFGFALTEGAVILQEI FRRFTITAAGPSKGETPLVRNITTVPKHGAHLRLIPQRRLGGLGDSDPP >MT3114 methyltransferase, putative MCAFVPHVPRHSRGDNPPSASTASPAVLTLTGERTIPDLDIENYWFRRHQ VVYQRLAPRCTARDVLEAGCGEGYGADLIACVARQVIAVDYDETAVAHVR SRYPRVEVMQANLAELPLPDASVDVVVNFQVIEHLWDQARFVRECARVLR GSGLLMVSTPNRITFSPGRDTPINPFHTRELNADELTSLLIDAGFVDVAM CGLFHGPRLRDMDARHGGSIIDAQIMRAVAGAPWPPELAADVAAVTTADF EMVAAGHDRDIDDSLDLIAIAVRP >MT3002 polyketide synthase MMRTAFSRISGMTAQQRTSLADEFDRVSRIAVAEPVAVVGIGCRFPGDVD GPESFWDFLVAGRNAISTVPADRWDAEAFYHPDPLTPGRMTTKWGGFVPD VAGFDAEFFGITPREAAAMDPQQRMLLEVAWEALEHAGIPPDSLGGTRTA VMMGVYFNEYQSMLAASPQNVDAYSGTGNAHSITVGRISYLLGLRGPAVA VDTACSSSLVAVHLACQSLRLRETDLALAGGVSITLRPETQIAISAWGLL SPQGRCAAFDAAADGFVRGEGAGVVVLKRLTDAVRDGDQVLAVVRGSAVN QDGRSNGVTAPNTAAQCDVIADALRSGDVAPDSVNYVEAHGTGTVLGDPI EFEALAATYGHGGDACALGAVKTNIGHLEAAAGIAGFIKATLAVQRATIP PNLHFSQWNPAIDAASTRFFVPTQNSPWPTAEGPRRAAVSSFGLGGTNAH VIIEQGSELAPVSEGGEDTGVSTLVVTGKTAQRMAATAQVLADWMEGPGA EVAVADVAHTVNHHRARQATFGTVVARDRAQAIAGLRALAAGQHAPGVVS HQDGSPGPGTVFVYSGRGSQWAGMGRQLLADEPAFAAAVAELEPVFVEQA GFSLRDVIATGKELVGIEQIQLGLIGMQLTLTELWRSYGVQPDLVIGHSM GEVAAAVVAGALTPAEGLRVTATRARLMAPLSGQGGMALLGLDAAATEAL IADYPQVTVGIYNSPRQTVIAGPTEQIDELIARVRAQNRFASRVNIEVAP HNPAMDALQPAMRSELADLTPRTPTIGIISTTYADLHTQPIFDAEHWATN MRNPVRFQQAIASAGSGADGAYHTFIEISAHPLLTQAIADTLEDAHRPTK SAAKYLSIGTLQRDADDTVTFRTNLYTADIAHPPHTCHPPEPHPTIPTTP WQHTHHWIATTHPSTAAPEDPGSNKVVVNGQSTSESRALEDWCHQLAWPI RPAVSADPPSTAAWLVVADNELCHELARAADSRVDSLSPPALAAGSDPAA LLDALRGVDNVLYAPPVPGELLDIESAYQVFHATRRLAAAMVASSATAIS PPKLFIMTRNAQPISEGDRANPGHAVLWGLGRSLALEHPEIWGGIIDLDD SMPAELAVRHVLTAAHGTDGEDQVVYRSGARHVPRLQRRTLPGKPVTLNA DASQLVIGATGNIGPHLIRQLARMGAKTIVAMARKPGALDELTQCLAATG TDLIAVAADATDPAAMQTLFDRFGTELPPLEGIYLAAFAGRPALLSEMTD DDVTTMFRPKLDALALLHRRSLKSPVRHFVLFSSVSGLLGSRWLAHYTAT SAFLDSFAGARRTMGLPATVVDWGLWKSLADVQKDATQISAESGLQPMAD EVAIGALPLVMNPDAAVATVVVAADWPLLAAAYRTRGALRIVDDLLPAPE DVGKGESEFRTSLRSCPAEKRRDMLFDHVGALAATVMGMPPTEPLDPSAG FFQLGMDSLMSVTLQRALSESLGEFLPASVVFDYPTVYSLTDYLATVLPE LLEIGATAVATQQATDSYHELTEAELLEQLSERLRGTQ >MT3174 substrate--CoA ligase MKNIGWMLRQRATVSPRLQAYVEPSTDVRMTYAQMNALANRCADVLTALG IAKGDRVALLMPNSVEFCCLFYGAAKLGAVAVPINTRLAAPEVSFILSDS GSKVVIYGAPSAPVIDAIRAQADPPGTVTDWIGADSLAERLRSAAADEPA VECGGDDNLFIMYTSGTTGHPKGVVHTHESVHSAASSWASTIDVRYRDRL LLPLPMFHVAALTTVIFSAMRGVTLISMPQFDATKVWSLIVEERVCIGGA VPAILNFMRQVPEFAELDAPDFRYFITGGAPMPEALIKIYAAKNIEVVQG YALTESCGGGTLLLSEDALRKAGSAGRATMFTDVAVRGDDGVIREHGEGE VVIKSDILLKEYWNRPEATRDAFDNGWFRTGDIGEIDDEGYLYIKDRLKD MIISGGENVYPAEIESVIIGVPGVSEVAVIGLPDEKWGEIAAAIVVADQN EVSEQQIVEYCGTRLARYKLPKKVIFAEAIPRNPTGKILKTVLREQYSAT VPK >MT3666 substrate--CoA ligase MPAALDRLVRQLPDHTALIAEDRRFTSTELRDAVYGAAAALIALGVEPAD RVAIWSPNTWHWVVACLAIHHAGAAVVPLNTRYTATEATDILDRAGAPVL FAAGLFLGADRAAGLDRAALPALRHVVRVPVEADDGTWDEFIATGAGALD AVAARAAAVAPQDVSDILFTSGTTGRSKGVLCAHRQSLSASASWAANGKI TSDDRYLCINPFFHNFGYKAGILACLQTGATLIPHVTFDPLHALRAIERH RITVLPGPPTIYQSLLDHPARKDFDLSSLRFAVTGAATVPVVLVERMQSE LDIDIVLTAYGLTEANGMGTMCRPEDDAVTVATTCGRPFADFELRIADDG EVLLRGPNVMVGYLDDTEATAAAIDADGWLHTGDIGAVDQAGNLRITDRL KDMYICGGFNVYPAEVEQVLARMDGVADAAVIGVPDQRLGEVGRAFVVAR PGTGLDEASVIAYTREHLANFKTPRSVRFVDVLPRNAAGKVSKPQLRELG >MT2114 carboxymethylenebutenolidase, putative MTTIEIDAPAGPIDALLGLPPGQGPWPGVVVVHDAVGYVPDNKLISERIA RAGYVVLTPNMYARGGRARCITRVFRELLTKRGRALDDILAARDHLLAMP ECSGRVGIVGFCMGGQFALVLSPRGFGATAPFYGTPLPRHLSETLNGACP IVASFGTRDPLGIGAANRLRKVTAAKNIPADIKSYPGAGHSFANKLPGQP LVRIAGFGYNEAATEDAWRRVFEFFGQHLRAGSPGEP >MT1219 hypothetical protein MLRVGPLTIGTLDDWAPSTGSTVSWRPSAVAHTKASQAPISDVPVSYMQA QHIRGYCEQKAKGLDYSRLMVVSCQQPGQCDIRAANYVINAHLRRHDTYR SWFQYNGNGQIIRRTIQDPADIEFVPVHHGELTLPQIREIVQNTPDPLQW GCFRFGIVQGCDHFTFFASVDHVHVDAMIVGVTLMEFHLMYAALVGGHAP LELPPAGSYDDFCRRQHTFSSTLTVESPQVRAWTKFAEGTNGSFPDFPLP LGDPSKPSDADIVTVMMLDEEQTAQFESVCTAAGARFIGGVLACCGLAEH ELTGTTTYYGLTPRDTRRTPADAMTQGWFTGLIPITVPIAGSAFGDAARA AQTSFDSGVKLAEVPYDRVVELSSTLTMPRPNFPVVNFLDAGAAPLSVLL TAELTGTNIGVYSDGRYSYQLSIYVIRVEQGTAVAVMFPDNPIARESVAR YLATLKSVFQRVAESGQQQNVA >MT2133 oxidoreductase, short-chain dehydrogenase/reductase family MDDTGAAPVVIFGGRSQIGGELARRLAAGATMVLAARNADQLADQAAALR AAGAIAVHTREFDADDLAAHGPLVASLVAEHGPIGTAVLAFGILGDQARA ETDAAHAVAIVHTDYVAQVSLLTHLAAAMRTAGRGSLVVFSSVAGIRVRR ANYVYGSAKAGLDGFASGLADALHGTGVRLLIARPGFVIGRMTEGMTPAP LSVTPERVAAATACALVNGKRVVWIPWALRPMFVALRLLPRFVWRRMPR >MT0793 oxidoreductase, short-chain dehydrogenase/reductase family MFDSKVAIVTGAAQGIGQAYAQALAREGASVVVADINADGAAAVAKQIVA DGGTAIHVPVDVSDEDSAKAMVDRAVGAFGGIDYLVNNAAIYGGMKLDLL LTVPLDYYKKFMSVNHDGVLVCTRAVYKHMAKRGGGAIVNQSSTAAWLYS NFYGLAKVGVNGLTQQLARELGGMKIRINAIAPGPIDTEATRTVTPAELV KNMVQTIPLSRMGTPEDLVGMCLFLLSDSASWITGQIFNVDGGQIIRS >MT0683 dioxygenase, putative MTTAQAAESQNPYLEGFLAPVSTEVTATDLPVTGRIPEHLDGRYLRNGPN PVAEVDPATYHWFTGDAMVHGVALRDGKARWYRNRWVRTPAVCAALGEPI SARPHPRTGIIEGGPNTNVLTHAGRTLALVEAGVVNYELTDELDTVGPCD FDGTLHGGYTAHPQRDPHTGELHAVSYSFARGHRVQYSVIGTDGHARRTV DIEVAGSPMMHSFSLTDNYVVIYDLPVTFDPMQVVPASVPRWLQRPARLV IQSVLGRVRIPDPIAALGNRMQGHSDRLPYAWNPSYPARVGVMPREGGNE DVRWFDIEPCYVYHPLNAYSECRNGAEVLVLDVVRYSRMFDRDRRGPGGD SRPSLDRWTINLATGAVTAECRDDRAQEFPRINETLVGGPHRFAYTVGIE GGFLVGAGAALSTPLYKQDCVTGSSTVASLDPDLLIGEMVFVPNPSARAE DDGILMGYGWHRGRDEGQLLLLDAQTLESIATVHLPQRVPMGFHGNWAPT T >MT3633 oxidoreductase, short-chain dehydrogenase/reductase family MTGMLKRKVIVVSGVGPGLGTTLAHRCARDGADLVLAARSAERLDDVAKQ IIDTGRRAVAVRTDITDDDDVSNLVQATLAAYGKADVLINNAFRVPSMKP LAGTTFEHIRDAIELSALGTLRLIQAFTPALAQSHGAIVNVNSMVIRHSQ PKYGTYKMAKSVLLAMSHSLATELGEQGIRVNSVAPGYIWGDTLKSYFDH QAGKYGTTVDQIYQATAANSDLKRLPTEDEVASAILFLASDLASGITGQT LDVNCGEYHT >MT2580 substrate--CoA ligase MAAAEVVDPNRLSYDRGPSAPSLLESTIGANLAATAARYGHREALVDMVA RRRFNYSELLTDVHRLATGLVRAGIGPGDRVGIWAPNRWEWVLVQYATAE IGAILVTINPAYRVREVEYALRQSGVAMVIAVASFKDADYAAMLAEVGPR CPDLADVILLESDRWDALAGAEPDLPALQQTAARLDGSDPVNIQYTSGTT AYPKGVTLSHRNILNNGYLVGELLGYTAQDRICIPVPFYHCFGMVMGNLA ATSHGAAMVIPAPGFDPAATLRAVQDERCTSLYGVPTMFIAELGLPDFTD YELGSLRTGIMAGAACPVEVMRKVISRMHMPGVSICYGMTETSPVSTQTR ADDSVDRRVGTVGRVGPHLEIKVVDPATGETVPRGVVGEFCTRGYSVMAG YWNDPQKTAEVIDADGWMHTGDLAEMDPSGYVRIAGRIKDLVVRGGENIS PREIEELLHTHPDIVDGHVIGVPDAKYGEELMAVVKLRNDAPELTIERLR EYCMGRIARFKIPRYLWIVDEFPMTVTGKVRKVEMRQQALEYLRGQQ >MT1557 hypothetical protein MFALSNNLNRVNACMDGFLARIRSHVDAHAPELRSLFDTMAAEARFARDW LSEDLARLPVGAALLEVGGGVLLLSCQLAAEGFDITAIEPTGEGFGKFRQ LGDIVLELAAARPTIAPCKAEDFISEKRFDFAFSLNVMEHIDLPDEAVRR VSEVLKPGASYHFLCPNYVFPYEPHFNIPTFFTKELTCRVMRHRIEGNTG MDDPKGVWRSLNWITVPKVKRFAAKDATLTLRFHRAMLVWMLERALTDKE FAGRRAQWMVAAIRSAVKLRVHHLAGYVPATLQPIMDVRLTKR >MT3615.2 fatty-acid-CoA ligase-related protein MAASLSENLSCHSSNMCRLSGNAATNLERPGEEPPGDRCTRRQAVRPART LAKKGNIPVGYYKDEKKTAETFRTINGVRYAIPGDYAQVEEDGTVTMLGR GSVSINSGGEKVYPEEVEAALKGHPDVFDALVVGVPDPRYGQQVAAVVQA RPGCRPSLAELDSFVRSEIAGYKVPRSLWFVDEVKRSPAGKPDYRWAKEQ TEARPADDVHAGHVTSGS >MT3263 oxidoreductase MTSLAERTVLVTGANRGMGREYVAQLLGRKVAKVYAATRNPLAIDVSDPR VIPLQLDVTDAVSVAEAADLATDVGILINNAGISRASSVLDKDTSALRGE LETNLFGPLALASAFADRIAERSGAIVNVSSVLAWLPLGMSYGVSKAAMW SATESMRIELAPRGVQVVGVYVGLVDTDMGRFADAPKSDPADVVRQVLDG IEAGKEDVLADEMSRQVRASLNVPARERIARLMGN >MT0328 hypothetical protein MSNTVVITGHCTSLTVSGMRNSVTVDSVDTIEAAGFNNEVTYHSGSPKIS NAGGSNSVQQG >MT0869 copper-binding protein, putative MPELATSGNAFDKRRFSRRGFLGAGIASGFALAACASKPTASGAAGMTAA IDAAEAARPHSGRTVTATLTPQPARIDLGGPIVSTLTYGNTIPGPLIRAT VGDEIVVSVTNRLGDPTSVHWHGIALRNDMDGTEPATANIGPGGDFTYRF SVPDPGTYWAHPHVGLQGDHGLYLPVVVDDPTEPGHYDAEWIIILDDWTD GIGKSPQQLYGELTDPNKPTMQNTTGMPEGEGVDSNLLGGDGGDIAYPYY LINGRIPVAATSFKAKPGQRIRIRIINSAADTAFRIALAGHSMTVTHTDG YPVIPTEVDALLIGMAERYDVMVTAAGGVFPLVALAEGKNALARALLSTG AGSPPDPQFRPDELNWRVGTVEMFTAATTANLGRPEPTHDLPVTLGGTMA KYDWTINGEPYSTTNPLHVRLGQRPTLMFDNTTMMYHPIHLHGHTFQMIK ADGSPGARKDTVIVLPKQKMRAVLVADNPGVWVMHCHNNYHQVAGMATRL DYIL >MT3601 virulence factor mce family protein MLNRKPSSKHERDPLRTGIFGLVLVICVVLIAFGYSGLPFWPQGKTYDAY FTDAGGITPGNSVYVSGLKVGAVSAVSLAGNSAKVTFSVDRSIVVGDQSL AAIRTDTILGERSIAVSPAGSGKSTTIPLSRTTTPYTLNGVLQDLGRNAN DLNRPQFEQALNVFTQALHDATPQVRGAVDGLTSLSRALNRRDEALQGLL AHAKSVTSVLSERAEQVNKLVEDGNQLFAALDARRAALSALISGIDDVAA QISGFVADNRKEFGPALSKLNLVLANLNERRDYITEALKRLPTYATTLGE VVGSGPGFNVNVYSVLPGPLVATVFDLVFQPGKLPDSLADYLRGFIQERW IIRPKSP >MT0971 oxidoreductase, short-chain dehydrogenase/reductase family MLTGVTRQKILITGASSGLGAGMARSFAAQGRDLALCARRTDRLTELKAE LSQRYPDIKIAVAELDVNDHERVPKVFAELSDEIGGIDRVIVNAGIGKGA RLGSGKLWANKATIETNLVAALVQIETALDMFNQRGSGHLVLISSVLGVK GVPGVKAAYAASKAGVRSLGESLRAEYAQRPIRVTVLEPGYIESEMTAKS ASTMLMVDNATGVKALVAAIEREPGRAAVPWWPWAPLVRLMWVLPPRLTR RFA >MT0293 conserved hypothetical protein MRTEGDSWDITTSVGSTALFVATARALEAQKSDPLVVDPYAEAFCRAVGG SWADVLDGKLPDHKLKSTDFGEHFVNFQGARTKYFDEYFRRAAAAGARQV VILAAGLDSRAYRLPWPDGTTVFELDRPQVLDFKREVLASHGAQPRALRR EIAVDLRDDWPQALRDSGFDAAAPSAWIAEGLLIYLPATAQERLFTGIDA LAGRRSHVAVEDGAPMGPDEYAAKVEEERAAIAEGAEEHPFFQLVYNERC APAAEWFGERGWTAVATLLNDYLEAVGRPVPGPESEAGPMFARNTLVSAA RV >MT1753.1 oxidoreductase, short-chain dehydrogenase/reductase family MEEMALAQQVPNLGLARFSVQDKSILITGATGSLGRVAARALADAGARLT LAGGNSAGLAELVNGAGIDDAAVVTCRPDSLADAQQMVEAALGRYGRLDG VLVASGSNHVAPITEMAVEDFDAVMDANVRGAWLVCRAAGRVLLEQGQGG SVVLVSSVRGGLGNAAGYSAYCPSKAGTDLLAKTLAAEWGGHGIRVNALA PTVFRSAVTEWMFTDDPKGRATREAMLARIPLRRFAEPEDFVGALIYLLS DASSFYTGQVMYLDGGYTAC >MT2439 conserved hypothetical protein MVLPKPTPRGRELIRQAAKVALHPTPEWLDELDRATLAAHPSIAADPALA TVVSRANRSHLIHFATANLRKPGQPVPANLGPDPLRMARDLVRRGLDASA LDVYRVGQNVAWQRWTEIAFGLTTDPQELHELLTLPFRSASEFIDATLAG LAAQMQLEYDELTRDVHAEHRRIVELILDGAPISRQSAEAKLGYPLDRSH TAAIIWYDDPDDNQNHLDHTARAFGRALGCPQPLIAVASAATRWVWVSDA ATLDTDRIHQVLDHAPHARIAVGTTARGIDGFRRSHRDALATQRMLARLR SQQRLAFFADIHMIAVLTENPDSAADFITSTLGDLESASPQLLTTVLTYI NEQCNASRAAHVLHTHRNTLLRRLETAQRLLPRPLDHTIIQVAVAISALQ WRGSQTSDPVETPVEGITSPPPESLGRRRSRLAQLER >MT1904 oxidoreductase, short-chain dehydrogenase/reductase family MAVEVLVTGGDTDLGRTMAEGFRNDGHKVTLVGARRGDLEVAAKELDVDA VVCDTTDPTSLTEARGLFPRHLDTIVNVPAPSWDAGDPRAYSVSDTANAW RNALDATVLSVVLTVQSVGDHLRSGGSIVSVVAENPPAGGAESAIKAALS NWIAGQAAVFGTRGITINTVACGRSVQTGYEGLSRTPAPVAAEIARLALF LTTPAARHITGQTLHVSHGALAHFG >MT2448 peptide synthetase MSIPCWHWTCVTDSNDQSARRCRWPRSWATSPVMDLSRNSKMPTSAHTPH RKWTFRVTNTADIGARLDEARLELLRRRLADRGLSSAAQDIGPHTDDRLS DGQARMWFVQMADPSGALLNICVSYRITGDIDLARLRDAVNAVARRHRIL RTTYPVGDDGVAQPTVHADLRPGWTQYDLTDLSQRAQRLRLEVLAQREFC APFELSRDAPLRITVVRTAADEHVLLLVAHHIAWDDGSWRVFFTDLTQAY SRADLGADLGPEHRPSAASGPDTTEADLNYWRAIMADPPEPLELPGPAGT CVPTSWRAARATLRLPADTAARVATMAKNTGCTPYMVLLAAFGALVHRYT HSDDFLVAAPVLNRGAGTEDAIGYFGNTVAMRLRPQSAMSFRELLTATRD IASGAFAHQRINLDRVVRELNPDRRHGAERMTRVSFGFREPDGGGFNPPG IECERYDLRSNITQLPLGFMVEFDRAGVLVEAEHLVEILEPALAKQMLRH FGVLLDNALAAPDNTLSGLALMDERDAARLREVSRGERFDTPVKTLVDLV NEQTTRTPDATAVVYEGQHFTYHDLNEASNRLGHWLIEQGIGSEDRVAVL LDKSPDLIVTALGVVKSGAVYVPVDPSYPQDRLDFILADCDAKLVLRTPV RELAGYRSDDPTDADRIRPLRPDNTAYLIYTSGTTGLPKGVAVPHRPVAE YFVWFKGEYDVDDTDRLLQVASPSFDVSIAEIFGTLACGARMVIPRPGGL TDIGYLTALLRDEGITAMHFVPSLLGLFLSLPGVSQWRTLQRVPIGGEPL PGEVADKFHATFDALLHNFYGPTETVINASRFKVVGPQGTRIVPIGRPKI NTTMHLLDDSLQPVPTGVIGEIYIGGTHVAYGYHRRAGLTAERFVADPFN PGSRMYRSGDLARRNADGDIEFVGRADEQVKIRGFRIELGDVAAAIAVDP TVGQAVVVVSDLPRLGKSLVGYVTPAAGGDGPADVGVDLDRIRARVAAAL PEYMLPAAYVVLDEIPITAHGKIDRAALPEPQIASDTEFRAPQTATERRL AQLFGELLGRDRVGADDSFFDLGGHSLLATKLVAAVRNAFGVDVGVREIF EFATVTALAGHIDTLDSDSARPRLTRVDHDGPVRLSSSQMRSWFNYRFDG PNAVNNIPFAAALHGPCDTNAFAAAITDVVARHEILRTVYREIGGVPHQI IQPPAEVPVRCAAGSDAAWLRAELNNERGYVFDLETDWPIRAALLSTPEQ TVLSLVVHHIAGDHWSAGVLFTDLLTAYRARSTGQRPSWAPLPVQYADYS VWQSALLDDGAGIVGPQRDYWIRQLGGLAGETGLRPDFPRPALLSGAGDA VEFRLGAAIRDKLAAVSRDLGVTEFMLLQAAVAVVLHKAGGGVDVPIGAP VAGRSEANLDQLIGFFINIVVLRNDLRGNPTLREVLQRTRQMALAAYAHQ DLPFDQVVEAVNPQRSLSRNPLFDIVVHVREQMPQDHVIDTGPDGDTTLR VLEPTFDAAQADLSVNFFACGDEYRGHVIYRTELYERATAQRFADWLVRV VEAFADRPDQPLREVEMVSAQARRRILDRSNAGAGTARVYLLDDALKPVP VGVVGDVYYGGGPAVGARLARPSETATRFVADPFAAQPGSRLYRNGERGV WKADGQLELLAEIERLPTAQAAPVPAEPADTETERALAAILADVLEVGEV GRYDDFFNLGGDSILATQVAARARDGGIPLTARMVFEHPVLCELAAAVDA KPHVEAEPDDKHHAPMSTSGLSPDELSALTASWDQWP >MT0917 conserved hypothetical protein MRTEDDSWDVTTSVGSTGLLVAAARALETQKADPLAIDPYAEVFCRAAGG EWADVLDGKLPDHYLTTGDFGEHFVNFQGARTRYFDEYFSRATAAGMKQV VILAAGLDSRAFRLQWPIGTTIFELDRPQVLDFKNAVLADYHIRPRAQRR SVAVDLRDEWQIALCNNGFDANRPSAWIAEGLLVYLSAEAQQRLFIGIDT LASPGSHVAVEEATPLDPCEFAAKLERERAANAQGDPRRFFQMVYNERWA RATEWFDERGWRATATPLAEYLRRVGRAVPEADTEAAPMVTAITFVSAVR TGLVADPARTSPSSTSIGFKRFEAD >MT0617 conserved hypothetical protein MVESSTASAAAVLRARYPRTAASLDRYGGGTARRLERTGTFARFTRISVV QIGWALRRYRRETLRLVAEIGMGTGAMAVVGGTVAIIGFVTLSGGSLIAI QGFASLGNIGVEAFTGFFAALANTRVAAPIVSGVALAATVGAGATAQLGA MRISEEIDALEVMGIKSISFLVSTRILGGLVVIMPLYALALDMAFTSGQV VTTVFYGQSNGTYEHYFRTFLRPEDVGWSVVEVVIIAVVVMIIHCYYGYT ASGGPVGVGQAVGRSMRFSLVSVVVVVLLAELALYGVDPNFNLTV >MT3735 conserved hypothetical protein MTQSSSVERLVGEIDEFGYTVVEDVLDADSVAAYLADTRRLERELPTVIA NSTTVVKGLARPGHVPVDRVDHDWVRIDNLLLHGTRYEALPVHPKLLPVI EGVLGRDCLLSWCMTSNQLPGAVAQRLHCDDEMYPLPRPHQPLLCNALIA LCDFTADNGATQVVPGSHRWPERPSPPYPEGKPVEINAGDALIWNGSLWH TAAANRTDAPRPALTINFCVGFVRQQVNQQLSIPRELVRCFEPRLQELIG YGLYAGKMGRIDWRPPADYLDADRHPFLDAVADRLQTSVRL >MT0038 acp-1, acyl carrier protein MKEAINATIQRILRTDRGITANQVLVDDLGFDSLKLFQLITELEDEFDIA ISFRDAQNIKTVGDVYTSVAVWFPETAKPAPLGKGTA >MT1385 acp-2, acyl carrier protein MWRYPLSTRLALPNTPGVASFAMTSSPSTVSTTLLSILRDDLNIDLTRVT PDARLVDDVGLDSVAFAVGMVAIEERLGVALSEEELLTCDTVGELEAAIA AKYRDE >MT2304 acp-3, acyl carrier protein MPVTQEEIIAGIAEIIEEVTGIEPSEITPEKSFVDDLDIDSLSMVEIAVQ TEDKYGVKIPDEDLAGLRTVGDVVAYIQKLEEENPEAAQALRAKIESENP DAVANVQARLEAESK >MT2452 entE, 2,3-dihydroxybenzoate-AMP ligase MPPKAADGRRPSPDGGLGGFVPFPADRAASYRAAGYWSGRTLDTVLSDAA RRWPDRLAVADAGDRPGHGGLSYAELDQRADRAAAALHGLGITPGDRVLL QLPNGCQFAVALFALLRAGAIPVMCLPGHRAAELGHFAAVSAATGLVVAD VASGFDYRPMARELVADHPTLRHVIVDGDPGPFVSWAQLCAQAGTGSPAP PADPGSPALLLVSGGTTGMPKLIPRTHDDYVFNATASAALCRLSADDVYL VVLAAGHNFPLACPGLLGAMTVGATAVFAPDPSPEAAFAAIERHGVTVTA LVPALAKLWAQSCEWEPVTPKSLRLLQVGGSKLEPEDARRVRTALTPGLQ QVFGMAEGLLNFTRIGDPPEVVEHTQGRPLCPADELRIVNADGEPVGPGE EGELLVRGPYTLNGYFAAERDNERCFDPDGFYRSGDLVRRRDDGNLVVTG RVKDVICRAGETIAASDLEEQLLSHPAIFSAAAVGLPDQYLGEKICAAVV FAGAPITLAELNGYLDRRGVAAHTRPDQLVAMPALPTTPIGKIDKRAIVR QLGIATGPVTTQRCH >MT2305 fabF-1, 3-oxoacyl-(acyl-carrier-protein) synthase II MSQPSTANGGFPSVVVTAVTATTSISPDIESTWKGLLAGESGIHALEDEF VTKWDLAVKIGGHLKDPVDSHMGRLDMRRMSYVQRMGKLLGGQLWESAGS PEVDPDRFAVVVGTGLGGAERIVESYDLMNAGGPRKVSPLAVQMIMPNGA AAVIGLQLGARAGVMTPVSACSSGSEAIAHAWRQIVMGDADVAVCGGVEG PIEALPIAAFSMMRAMSTRNDEPERASRPFDKDRDGFVFGEAGALMLIET EEHAKARGAKPLARLLGAGITSDAFHMVAPAADGVRAGRAMTRSLELAGL SPADIDHVNAHGTATPIGDAAEANAIRVAGCDQAAVYAPKSALGHSIGAV GALESVLTVLTLRDGVIPPTLNYETPDPEIDLDVVAGEPRYGDYRYAVNN SFGFGGHNVALAFGRY >MT2306 fabF-2, 3-oxoacyl-(acyl-carrier-protein) synthase II MTELVTGKAFPYVVVTGIAMTTALATDAETTWKLLLDRQSGIRTLDDPFV EEFDLPVRIGGHLLEEFDHQLTRIELRRMGYLQRMSTVLSRRLWENAGSP EVDTNRLMVSIGTGLGSAEELVFSYDDMRARGMKAVSPLTVQKYMPNGAA AAVGLERHAKAGVMTPVSACASGAEAIARAWQQIVLGEADAAICGGVETR IEAVPIAGFAQMRIVMSTNNDDPAGACRPFDRDRDGFVFGEGGALLLIET EEHAKARGANILARIMGASITSDGFHMVAPDPNGERAGHAITRAIQLAGL APGDIDHVNAHATGTQVGDLAEGRAINNALGGNRPAVYAPKSALGHSVGA VGAVESILTVLALRDQVIPPTLNLVNLDPEIDLDVVAGEPRPGNYRYAIN NSFGFGGHNVAIAFGRY >MT0256 fabG-1, 3-oxoacyl-(acyl-carrier-protein) reductase MAPKRSSDLFSQVVNSGPGSFLARQLGVPQPETLRRYRAGEPPLTGSLLI GGAGRVVEPLRAALEKDYDLVGNNLGGRWADSFGGLVFDATGITEPAGLK GLHEFFTPVLRNLGRCGRVVVVGGTPEAAASTNERIAQRALEGFTRSLGK ELRRGATTALVYLSPDAKPAATGLESTMRFLLSAKSAYVDGQVFSVGADD STPPADWEKPLDGKVAIVTGAARGIGATIAEVFARDGAHVVAIDVESAAE NLAETASKVGGTALWLDVTADDAVDKISEHLRDHHGGKADILVNNAGITR DKLLANMDDARWDAVLAVNLLAPLRLTEGLVGNGSIGEGGRVIGLSSIAG IAGNRGQTNYATTKAGMIGITQALAPGLAAKGITINAVAPGFIETQMTAA IPLATREVGRRLNSLLQGGQPVDVAEAIAYFASPASNAVTGNVIRVCGQA MIGA >MT1530 fabG-2, 3-oxoacyl-(acyl-carrier-protein) reductase MTATATEGAKPPFVSRSVLVTGGNRGIGLAIAQRLAADGHKVAVTHRGSG APKGLFGVECDVTDSDAVDRAFTAVEEHQGPVEVLVSNAGLSADAFLMRM TEEKFEKVINANLTGAFRVAQRASRSMQRNKFGRMIFIGSVSGSWGIGNQ ANYAASKAGVIGMARSIARELSKANVTANVVAPGYIDTDMTRALDERIQQ GALQFIPAKRVGTPAEVAGVVSFLASEDASYISGAVIPVDGGMGMGH >MT3606 fabG-3, 3-oxoacyl-(acyl-carrier-protein) reductase MKLTESNRSPRTTNTTDLSGKVAVVTGAAAGLGRAEALGLARLGATVVVN DVASALDASDVVDEIGAAAADAGAKAVAVAGDISQRATADELLASAVGLG GLDIVVNNAGITRDRMLFNMSDEEWDAVIAVHLRGHFLLTRNAAAYWRDK AKDAEGGSVFGRLVNTSSEAGLVGPVGQANYAAAKAGITALTLSAARALG RYGVCANVICPRARTAMTADVFGAAPDVEAGQIDPLSPQHVVSLVQFLAS PAAAEVNGQVFIVYGPQVTLVSPPHMERRFSADGTSLGSHRAHRDAAGLL CWSGSGTELFGDRSDASVTRGYRRPIIGIGVRITTPT >MT1218 mas-1, mycocerosic acid synthase MGFGSIHPRLVQGDCVVRTATATSVAVIGMACRLPGGIDSPQRLWEALLR GDDLVGEIPADRWDANVYYDPEPGVPGRSVSRWGAFLDDVGGFDCDFFGL TEREATAIDPQHRLLLEVSWEAIEHAGVDPATLAESQTGVFVGLTHGDYE LLSADCGAAEGPYGFTGTSNSFASGRVAYTLGLHGPAVTVDTACSSGLTA VHQACRSLDDGESDLALAGGVVVTLEPRKSVSGSLQGMLSPTGRCHAFDE AADGFVSGEGCVVLLLKRLPDAVRDGDRVLAIVRGTAANQDGRTVNIAAP SAQAQIAVYQQALAAAGVEASTVGMVEAHGTGTPVGDPVEYASLAAVYGT EGPCALTSVKTNFGHLQSASGPLGLMKTILALRHGVVPQNLHFCRLPDQL AEIDTELFVPQANTSWPDNTGQPRRAAVSSYGMSGTNVHAILEQAPVSEP AASGPELTPEAGGLALFPVSATSAEQLHVTAARLADWVDQNGNAGSRVSM RDLGYTLSCRRAHRPVRTVVTASSFDELSAALRDVAGDQIPYQPAVGHDD RGPVWVFSGQGSQWPGMGTELLVAEPVFAATVAAMEPVIARESGFSVTEA MSAPQTVSGIDRVQPTIFAVQVALAAALKSYGVRPGAIIGHSLGEAAAAV VAGALSLHDGLRVICRRSRLMSRIAGSGAMASVELPGQQVLSELAIRGIS DVVLSVVASPTSTVVGGATQSIRDLVAAWEQQDVLAREVAVDVASHTPQV DPILDELLEVLAEVDPTAPEIPYYSATLWDPRERPSFTGEYWVENLRYTV RFAAAVQAALKDGYRVFGELAPHPLLTYAVEQNAASLDMPIATLAAMRRG EQLPFGLRGFVADVHNAGAKVDFSVQYPDGRLVDAPLPSWTHRTLMLSRE DSHRSHTGAVQAVHPLLGAHVHLLEEPERHVWQAGVGTGAHPWLGDHRIH NVAAFPGAAYCEMALAAARTTLGELSEVRDIKFEQTLLLDEQTVVSSAAT IAAPGILQFAVESHQEGEPARRASAMLHALEEMPQPPGYDTNALTAAHES SMSGEELRKMFNSLGIQYGPAFSGLVAVHTARGDVTTVLAEVALPGAIRS QQSAYASHPALLDACFQSVLVHPEVQKATVGGLMLPVGVRRLRNYHSTRS AHYCLARVTSSSRAGECEADLDVFDQAGTVLLTVEGLRLAAGISEHERAN RVFDERLLTIEWERGELPEVPQIDAGSWLLLSASEADPLTAQLADALNAV GAQSTSVASASDVAQLRSLLGGRLTGVVVVTGPPTGGLTQCGRDYVSQLV GIARELAELPGEPPRLFVVTRSAASVLPSDLANLEQAGLRGLMRVIDSEH PHLGATAIDVDNDETVAALVASQLQSGSQEDETAWRNGIWYTARLRPGPL RPAERRTAVVEYRRDGMRLQIRTPGDLESLEFVTFDRVAPGPGEIEVAVT ASSVNFADVLVAFGRYPTFEGYRQQLGIDFAGVVTAVGPDVTEHRIGDHV GGMSANGCWSTFVRCDARLAVTLPPELPVAAAAAVPTASATAWYALHDLA RICSDDKVLIHSGTGGVGQAAIAIARAAGCEIFATAGSAQRRQLLHDMGV EHVYDSRSTEFAEQIRGDTDGYGVDVVLNSLPGAAQRAGIELLAFGGRFV EIGKRDIYGDTRLGLFPFRRNLSLYAVDLALLTHSHPHTVRRLLKTVYQH TVEGTLPVPQTTHYPIHDAAVAIRLVGGAGHTGKVVLDVPRTGEGVAVVP PEQVRTSRPDGAYLVTGGLGGLGLFLAGELAAAGCGRIVLNSRSTPSPHA TRVIERLRAAGADIQVECGDIADAATAHRVVAVATASGLPVRGVLHAAAV VEDATLANVTDELIDRCWAPKVHGAWNIHRATAAQPLEWFCLFSSAAALV GSPGQGAYAAANSWLDAFAHWRRAQGLPATSIAWGAWAEIGRATALAEGT GAAIAPAEGARAFQTLLRYGRAYSGYAPIMGTPWLTAFAQRSRFAEAFHA TGQNQPATGKFLAELGSLPREEWPRTVRRLVSDQISLLLRRTIDPDRPLS DYGLDSLGNLELRTRIETETGIRVSPTKITTVRGLAEHVCDELAAAQSAP V >MT3010 mas-2, mycocerosic acid synthase MESRVTPVAVIGMGCRLPGGINSPDKLWESLLRGDDLVTEIPPDRWDADD YYDPEPGVPGRSVSRWGGFLDDVAGFDAEFFGISEREATSIDPQQRLLLE TSWEAIEHAGLDPASLAGSSTAVFTGLTHEDYLVLTTTAGGLASPYVVTG LNNSVASGRIAHTLGLHGPAMTFDTACSSGLMAVHLACRSLHDGEADLAL AGGCAVLLEPHASVAASAQGMLSSTGRCHSFDADADGFVRSEGCAMVLLK RLPDALRDGNRIFAVVRGTATNQDGRTETLTMPSEDAQVAVYRAALAAAG VQPETVGVVEAHGTGTPIGDPIEYRSLARVYGAGTPCALGSAKSNMGHST ASAGTVGLIKAILSLRHGVVPPLLHFNRLPDELSDVETGLFVPQAVTPWP NGNDHTPKRVAVSSFGMSGTNVHAIVEEAPAEASAPESSPGDAEVGPRLF MLSSTSSDALRQTARQLATWVEEHQDCVAASDLAYTLARGRAHRPVRTAV VAANLPELVEGLREVADGDALYDAAVGHGDRGPVWVFSGQGSQWAAMGTQ LLASEPVFAATIAKLEPVIAAESGFSVTEAITAQQTVTGIDKVQPAVFAV QVALAATMEQTYGVRPGAVVGHSMGESAAAVVAGALSLEDAARVICRRSK LMTRIAGAGAMGSVELPAKQVNSELMARGIDDVVVSVVASPQSTVIGGTS DTVRDLIARWEQRDVMAREVAVDVASHSPQVDPILDDLAAALADIAPMTP KVPYYSATLFDPREQPVCDGAYWVDNLRNTVQFAAAVQAAMEDGYRVFAE LSPHPLLTHAVEQTGRSLDMSVAALAGMRREQPLPHGLRGLLTELHRAGA ALDYSALYPAGRLVDAPLPAWTHARLFIDDDGQEQRAQGACTITVHPLLG SHVRLTEEPERHVWQGDVGTSVLSWLSDHQVHNVAALPGAAYCEMALAAA AEVFGEAAEVRDITFEQMLLLDEQTPIDAVASIDAPGVVNFTVETNRDGE TTRHATAALRAAEDDCPPPGYDITALLQAHPHAVNGTAMRESFAERGVTL GAAFGGLTTAHTAEAGAATVLAEVALPASIRFQQGAYRIHPALLDACFQS VGAGVQAGTATGGLLLPLGVRSLRAYGPTRNARYCYTRLTKAFNDGTRGG EADLDVLDEHGTVLLAVRGLRMGTGTSERDERDRLVSERLLTLGWQQRAL PEVGDGEAGSWLLIDTSNAVDTPDMLASTLTDALKSHGPQGTECASLSWS VQDTPPNDQAGLEKLGSQLRGRDGVVIVYGPRVGDPDEHSLLAGREQVRH LVRITRELAEFEGELPRLFVVTRQAQIVKPHDSGERANLEQAGLRGLLRV ISSEHPMLRTTLIDVDEHTDVERVAQQLLSGSEEDETAWRNGDWYVARLT PSPLGHEERRTAVLDPDHDGMRVQVRRPGDLQTLEFVASDRVPPGPGQIE VAVSMSSINFADVLIAFGRFPIIDDREPQLGMDFVGVVTAVGEGVTGHQV GDRVGGFSEGGCWRTFLTCDANLAVTLPPGLTDEQAITAATAHATAWYGL NDLAQIKAGDKVLIHSATGGVGQAAISIARAKGAEIFATAGNPAKRAMLR DMGVEHVYDSRSVEFAEQIRRDTDGYGVDIVLNSLTGAAQRAGLELLAFG GRFVEIGKADVYGNTRLGLFPFRRGLTFYYLDLALMSVTQPDRVRELLAT VFKLTADGVLTAPQCTHYPLAEAADAIRAMSNAEHTGKLVLDVPRSGRRS VAVTPEQAPLYRRDGSYIITGGLGGLGLFFASKLAAAGCGRIVLTARSQP NPKARQTIEGLRAAGADIVVECGNIAEPDTADRLVSAATATGLPLRGVLH SAAVVEDATLTNITDELIDRDWSPKVFGSWNLHRATLGQPLDWFCLFSSG AALLGSPGQGAYAAANSWVDVFAHWRRAQGLPVSAIAWGAWGEVGRATFL AEGGEIMITPEEGAYAFETLVRHDRAYSGYIPILGAPWLADLVRRSPWGE MFASTGQRSRGPSKFRMELLSLPQDEWAGRLRRLLVEQASVILRRTIDAD RSFIEYGLDSLGMLEMRTHVETETGIRLTPKVIATNNTARALAQYLADTL AEEQAAAPAAS >MT3933 mas-3, mycocerosic acid synthase MGLGSAASGTGADRGAWTLAEPRVTPVAVIGMACRLPGGIDSPELLWKAL LRGDDLITEVPPDRWDCDEFYDPQPGVPGRTVCKWGGFLDNPADFDCEFF GIGEREAIAIDPQQRLLLETSWEAMEHAGLTQQTLAGSATGVFAGVTHGD YTMVAADAKQLEEPYGYLGNSFSMASGRVAYAMRLHGPAITVDTACSSGL TAVHMACRSLHEGESDVALAGGVALMLEPRKAAAGSALGMLSPTGRCRAF DVAADGFVSGEGCAVVVLKRLPDALADGDRILAVIRGTSANQDGHTVNIA TPSQPAQVAAYRAALAAGGVDAATVGMVEAHGPGTPIGDPIEYASVSEVY GVDGPCALASVKTNFGHTQSTAGVLGLIKVVLALKHGVVPRNLHFTRLPD EIAGITTNLFVPEVTTPWPTNGRQVPRRAAVSSYGFSGTNVHAVVEQAPQ TEAQPHAASTPPTGTPALFTLSASSADALRQTAQRLTDWIQQHADSLVLS DLAYTLARRRTHRSVRTAVIASSVDELIAGLGEVADGDTVYQPAVGQDDR GPVWLFSGQGSQWAAMGADLLTNESVFAATVAELEPLIAAESGFSVTEAM TAPETVTGIDRVQPTIFAMQVALAATMAAYGVRPGAVIGHSMGESAAAVV AGVLSAEDGVRVICRRSKLMATIAGSAAMASVELPALAVQSELTALGIDD VVVAVVTAPQSTVIAGGTESVRKLVDIWERRDVLARAVAVDVASHSPQVD PILDELIAALADLNPKAPEIPYYSATLFDPREAPACDARYWADNLRHTVR FSAAVRSALDDGYRVFAELSPHPLLTHAVDQIAGSVGMPVAALAGMRREQ PLPLGLRRLLTDLHNAGAAVDFSVLCPQGRLVDAPLPAWSHRFLFYDREG VDNRSPGGSTVAVHPLLGAHVRLPEEPERHAWQADVGTATLPWLGDHRIH NVAALPGAAYCEMALSAARAVLGEQSEVRDMRFEAMLLLDDQTPVSTVAT VTSPGVVDFAVEALQEGVGHHLRRASAVLQQVSGECEPPAYDMASLLEAH PCRVDGEDLRRQFDKHGVQYGPAFTGLAVAYVAEDATATMLAEVALPGSI RSQQGLYAIHPALLDACFQSVGAHPDSQSVGSGLLVPLGVRRVRAYAPVR TARYCYTRVTKVELVGVEADIDVLDAHGTVLLAVCGLRIGTGVSERDKHN RVLNERLLTIEWHQRELPEMDPSGAGKWLLISDCAASDVTATRLADAFRE HSAACTTMRWPLHDDQLAAADQLRDQVGSDEFSGVVVLTGSNTGTPHQGS ADRGAEYVRRLVGIARELSDLPGAVPRMYVVTRGAQRVLADDCVNLEQGG LRGLLRTIGAEHPHLRATQIDVDEQTGVEQLARQLLATSEEDETAWRDNE WYVARLCPTPLRPQERRTIVADHQQSGMRLQIRTPGDMQTIELAAFHRVP PGPGQIEVAVRASSVNFADVLIAFGRYPSFEGHLPQLGTDFAGVVTAVGP GVTDHKVGDHVGGMSPNGCWGTFVTCDARLAATLPPGLGDAQAAAVTTAH ATAWYGLHELARIRAGDTVLIHSGTGGVGQAAIAIARAAGAEIFATAGTP QRRELLRNMGIEHVYDSRSIEFAEQIRRDTNGRGVDVVLNSVTGAAQLAG LKLLAFRGRFVEIGKRDIYGDTKLGLFPFRRNLSFYAVDLGLLSATHPEE LRDLLGTVYRLTAAGELPMPQSTHYPLVEAATAIRVMGNAEHTGKLVLHI PQTGKSLVTLPPEQAQVFRPDGSYIITGGLGGLGLFLAEKMAAAGCGRIV LNSRTQPTQKMRETIEAIAAMGSEVVVECGDIAQPGTAERLVATAVATGL PVRGVLHAAAVVEDATLANITDELLARDWAPKVHGAWELHEATSGQPLDW FCLFSSAAALTGSPGQSAYSAANSWLDAFAHWRQAQGLPATAIAWGAWSD IGQLGWWSASPARASALEESNYTAITPDEGAYAFEALLRHNRVYTGYAPV IGAPWLVAFAERSRFFEVFSSSNGSGTSKFRVELNELPRDEWPARLRQLV AEQVSLILRRTVDPDRPLPEYGLDSLGALELRTRIETETGIRLAPKNVSA TVRGLADHLYEQLAPDDAPAAALSSQ >MT0178 mce-1, virulence factor MSFGPSWRPSSSLRSSWSATATTGTPPVEAPSVSARPSADRCVSRWSRCR SLSCLQRWRSTVSTRTSISRCSRMTTPGKLNKARVPPYKTAGLGLVLVFA LVVALVYLQFRGEFTPKTQLTMLSARAGLVMDPGSKVTYNGVEIGRVDTI SEVTRDGESAAKFILDVDPRYIHLIPANVNADIKATTVFGGKYVSLTTPK NPTKRRITPKDVIDVRSVTTEINTLFQTLTSIAEKVDPVKLNLTLSAAAE ALTGLGDKFGESIVNANTVLDDLNSRMPQSRHDIQQLAALGDVYADAAPD LFDFLDSSVTTARTINAQQAELDSALLAAAGFGNTTADVFDRGGPYLQRG VADLVPTATLLDTYSPELFCTIRNFYDADPLAKAASGGGNGYSLRTNSEI LSGIGISLLSPLALATNGAAIGIGLVAGLIAPPLAVAANLAGALPGIVGG APNPYTYPENLPRVNARGGPGGAPGCWQPITRDLWPAPYLVMDTGASLAP YNHMEVGSPYAVEYVWGRQVGDNTINP >MT0618 mce-2, virulence factor MPTLVTRKNRRAWLYVEGVVLLLVGALVLVLVYKQFRGEFTPKTELTMVA SRAGLVMEAGSKVTYNGVEIGRVGSISEIERDGRPAAKLVLDVNPRYISL IPVNVVADIEAATLFGNKYVALSAPKIPQQQRISSHDVIDVGSVTTEFNT LFETITSIAEKVDPIELNATLSAVAQALDGLGGKFGESIVNGNQILAQLN PRLPQLGYDVRRLADLGEVYVDASPDLWSFLQNALTTARTLTSQQRDLDA ALLAATGAGNTGEDVFARGGPYLARAAADLVPTATLLDTYSPELFCMIRN FHDAAPKVADAVGGNGYSLAAAGTILGAPNPYVYPDNLPRVNAHGGPGGR PGCWQTITRELWPAPYLVMDTGASLAPYNHVELGQPMFTEYVWGRQYGEN TINP >MT2018 mce-3, virulence factor MRRGPGRHRLHDAWWTLILFAVIGVAVLVTAVSFTGSLRSTVPVTLAADR SGLVMDSGAKVMMRGVQVGRVAQIGRIEWAQNGASLRLEIDPDQIRYIPA NVEAQISATTAFGAKFVDLVMPQNPSRARLSAGAVLHSKNVSTEINTVFE NVVDLLNMIDPLKLNAVLTAVADAVRGQGERIGQATTDLNEVLEALNARG DTIGGNWRSLKNFTDTYDAAAQDILTILNAASTTSATVVNHSTQLDALLL NAIGLSNAGTNLLGSSRDNLVGAADILAPTTSLLFKYNPEYTCFLQGAKW YLDNGGYAAWGGADGRTLQLDVALLFGNDPYVYPDNLPVVAAKGGPGGRP GCGPLPDATHNFPVRQLVTNTGWGTGLDIRPNPGIGHPCWANYFPVTRAV PEPPSIRQCIPGPAIGPNPAAGEQP >MT0567 menE, o-succinylbenzoate--CoA ligase MLGGSDPALVAVPTQHESLLGALRVGEQIDDDVALVVTTSGTTGPPKGAM LTAAALTASASAAHDRLGGPGSWLLAVPPYHIAGLAVLVRSVIAGSVPVE LNVSAGFDVTELPNAIKRLGSGRRYTSLVAAQLAKALTDPAATAALAELD AVLIGGGPAPRPILDAAAAAGITVVRTYGMSETSGGCVYDGVPLDGVRLR VLAGGRIAIGGATLAKGYRNPVSPDPFAEPGWFHTDDLGALESGDSGVLT VLGRADEAISTGGFTVLPQPVEAALGTHPAVRDCAVFGLADDRLGQRVVA AIVVGDGCPPPTLEALRAHVARTLDVTAAPRELHVVNVLPRRGIGKVDRA ALVRRFAGEADQ >MT3640 mhpD, 2-keto-4-pentenoate hydratase MLRDATRDELAADLAQAERSRDPIGQLTAAHPEIDVVDAYEIQLINIRQR VAEGARVVGHKVGLSSPIMQQMMGVDEPDYGHLLDDMQVFEDTPVQASRY LSPRVEVEVGFILAADLPGAGCTEDDVLAATEALVPAIELIDTRIKDWQI KICDTIADNASAAGFVLGAARVPPADLDVRAIDAKLTRNGEVVAEGRSDA VLGNPATAVAWLAGKVESFGVRLRKGDIVLPGSCTFAVEARAGDEFVADF TGLGLVRLSFE >MT3639 mhpF, acetaldehyde dehydrogenase (acetylating) MPSKAKVAIVGSGNISTDLLYKLLRSEWLEPRWMVGIDPESDGLARAAKL GLETTHEGVDWLLAQPDKPDLVFEATSAYVHRDAAPKYAEAGIRAIDLTP AAVGPAVIPPANLREHLDAPNVNMITCGGQATIPIVYAVSRIVEVPYAEI VASVASVSAGPGTRANIDEFTKTTARGVQTIGGAARGKAIIILNPADPPM IMRDTIFCAIPTDADREAIAASIHDVVKEVQTYVPGYRLLNEPQFDEPSI NSGGQALVTTFVEVEGAGDYLPPYAGNLDIMTAAATKVGEEIAKETLVVG GAR >MT3671 nhoA, N-hydroxyarylamine O-acetyltransferase MALDLTAYFDRINYRGATDPTLDVLQDLVTVHSRTIPFENLDPLLGVPVD DLSPQALADKLVLRRRGGYCFEHNGLMGYVLAELGYRVRRFAARVVWKLA PDAPLPPQTHTLLGVTFPGSGGCYLVDVGFGGQTPTSPLRLETGAVQPTT HEPYRLEDRVDGFVLQAMVRDTWQTLYEFTTQTRPQIDLKVASWYASTHP ASKFVTGLTAAVITDDARWNLSGRDLAVHRAGGTEKIRLADAAAVVDTLS ERFGINVADIGERGALETRIDELLARQPGADAP >MT2451 pchE, dihydroaeruginoic acid synthetase MVHATACSEIIRAEVAELLGVRADALHPGANLVGQGLDSIRMMSLVGRWR RKGIAVDFATLAATPTIEAWSQLVSAGTGVAPTAVAAPGDAGLSQEGEPF PVAPMQHAMWVGRHDHQQLGGVAGHLYVEFDGARVDPDRLRAAATRLALR HPMLRVQFLPDGTQRIPPAAGSRDFPISVADLRHVAPDVVDQRLAGIRDA KSHQQLDGAVFELALTLLPGERTRLHVDLDMQAADAMSYRILLADLAALY DGREPPALGYTYREYRQAIEAEETLPQPVRDADRDWWAQRIPQLPDPPAL PTRAGGERDRRRSTRRWHWLDPQTRDALFARARARGITPAMTLAAAFANV LARWSASSRFLLNLPLFSRQALHPDVDLLVGDFTSSLLLDVDLTGARTAA ARAQAVQEALRTAAGHSAYPGLSVLRDLSRHRGTQVLAPVVFTSALGLGD LFCPDVTEQFGTPGWIISQGPQVLLDAQVTEFDGGVLVNWDVREGVFAPG VIDAMFTHQVDELLRLAAGDDAWDAPSPSALPAAQRAVRAALNGRTAAPS TEALHDGFFRQAQQQPDAPAVFASSGDLSYAQLRDQASAVAAALRAAGLR VGDTVAVLGPKTGEQVAAVLGILAAGGVYLPIGVDQPRDRAERILATGSV NLALVCGPPCQVRVPVPTLLLADVLAAAPAEFVPGPSDPTALAYVLFTSG STGEPKGVEVAHDAAMNTVETFIRHFELGAADRWLALATLECDMSVLDIF AALRSGGAIVVVDEAQRRDPDAWARLIDTYEVTALNFMPGWLDMLLEVGG GRLSSLRAVAVGGDWVRPDLARRLQVQAPSARFAGLGGATETAVHATIFE VQDAANLPPDWASVPYGVPFPNNACRVVADSGDDCPDWVAGELWVSGRGI ARGYRGRPELTAERFVEHDGRTWYRTGDLARYWHDGTLEFVGRADHRVKI SGYRVELGEIEAALQRLPGVHAAAATVLPGGSDVLAAAVCVDDAGVTAES IRQQLADLVPAHMIPRHVTLLDRIPFTDSGKIDRAEVGALLAAEVERSGD RSAPYAAPRTVLQRALRRIVADILGRANDAVGVHDDFFALGGDSVLATQV VAGIRRWLDSPSLMVADMFAARTIAALAQLLTGREANADRLELVAEVYLE IANMTSADVMAALDPIEQPAQPAFKPWVKRFTGTDKPGAVLVFPHAGGAA AAYRWLAKSLVANDVDTFVVQYPQRADRRSHPAADSIEALALELFEAGDW HLTAPLTLFGHCMGAIVAFEFARLAERNGVPVRALWASSGQAPSTVAASG PLPTADRDVLADMVDLGGTDPVLLEDEEFVELLVPAVKADYRALSGYSCP PDVRIRANIHAVGGNRDHRISREMLTSWETHTSGRFTLSHFDGGHFYLND HLDAVARMVSADVR >MT2103 pncA, pyrazinamidase/nicotinamidase MRALIIVDVQNDFCEGGSLAVTGGAALARAISDYLAEAADYHHVVATKDF HIDPGDHFSGTPDYSSSWPPHCVSGTPGADFHPSLDTSAIEAVFYKGAYT GAYSGFEGVDENGTPLLNWLRQRGVDEVDVVGIATDHCVRQTAEDAVRNG LATRVLVDLTAGVSADTTVAALEEMRTASVELVCSS >MT0593 tcmO, tetracenpmycin polyketide synthesis 8-o-methyltransferase MVELSPDRIMAIGGGYGPSKVLLTAVGLGLFTELGDEAMTAEAIADRLGL LKRPAIDFLDALVSLDLLARDGDGPGSHYRNTPETAHFLDEARPTYAGGL LKIWNERNYRFWADLTEALKTGKAQSEVKQTGRPFFEALYADPRRLEAFM AAMDAASRRNIELLAKRFPFERYRRLCDVGCADGLLSRIVAAAHPHLQCV SFDLPAVTEIARRKLTAEGLGERVQACAGDFLADPLPAADVITMGQILHD WNLDRKQQLVAKAYEALSKEGAFIVIETLIDDARRENTTGLMMSLNMLIE FGDAFDYSAADFRGWCGEAGFRSFEVIPLAGGSSAAVAYK