TitleGenColors Logo

Gene list

Applied filters:

COG category: Secondary metabolites biosynthesis, transport and catabolism
Gene type: CDS
Genomic element: chromosome

Number of genes found: 276

Free access
Sort by:

 



# Mycobacterium tuberculosis H37Rv, H37Rv

>Rv2675c CONSERVED HYPOTHETICAL PROTEIN
MTAQFDPADPTRFEEMYRDDRVAHGLPAATPWDIGGPQPVVQQLVALGAI
RGEVLDPGTGPGHHAIYYAAKGYAATGIDGSVAAIERARDNARKAGVSVN
FQVGDATTLDGLDGRFDTVVDCAFYHTFSTAPELQRCYVRALRRASKPGA
RLYMFEFGEHNVNGFSMPRSLSEDDFRQVLPVGGWEITYLGTTTYQVNLS
VEALELMAARNPDMADQVRCVLERFRAIKPWLVGGRVHAPFWEVHATRVD
>Rv1856c POSSIBLE OXIDOREDUCTASE
MAVEVLVTGGDTDLGRTMAEGFRNDGHKVTLVGARRGDLEVAAKELDVDA
VVCDTTDPTSLTEARGLFPRHLDTIVNVPAPSWDAGDPRAYSVSDTANAW
RNALDATVLSVVLTVQSVGDHLRSGGSIVSVVAENPPAGGAESAIKAALS
NWIAGQAAVFGTRGITINTVACGRSVQTGYEGLSRTPAPVAAEIARLALF
LTTPAARHITGQTLHVSHGALAHFG
>Rv2073c Probable shortchain dehydrogenase
MDDTGAAPVVIFGGRSQIGGELARRLAAGATMVLAARNADQLADQAAALR
AAGAIAVHTREFDADDLAAHGPLVASLVAEHGPIGTAVLAFGILGDQARA
ETDAAHAVAIVHTDYVAQVSLLTHLAAAMRTAGRGSLVVFSSVAGIRVRR
ANYVYGSAKAGLDGFASGLADALHGTGVRLLIARPGFVIGRMTEGMTPAP
LSVTPERVAAATARALVNGKRVVWIPWALRPMFVALRLLPRFVWRRMPR
>Rv3829c PROBABLE DEHYDROGENASE
MTGYDAIVIGAGHNGLTAAVLLQRAGLRTACLDAKRYAGGMASTVELFDG
YRFEIAGSVQFPTSSAVSSELGLDSLPTVDLEVMSVALRGVGDDPVVQFT
DPTKMLTHLHRVHGADAVTGMAGLLAWSQAPTRALGRFEAGTLPKSFDEM
YACATNEFERSAIDDMLFGSVTDVLDRHFPDREKHGALRGSMTVLAVNTL
YRGPATPGSAAALAFGLGVPEGDFVRWKKLRGGIGALTTHLSQLLERTGG
EVRLRSKVTEIVVDNSRSSARVRGVRTAAGDTLTSPIVVSAIAPDVTINE
LIDPAVLPSEIRDRYLRIDHRGSYLQMHFALAQPPAFAAPYQALNDPSMQ
ASMGIFCTPEQVQQQWEDCRRGIVPADPTVVLQIPSLHDPSLAPAGKQAA
SAFAMWFPIEGGSKYGGYGRAKVEMGQNVIDKITRLAPNFKGSILRYTTF
TPKHMGVMFGAPGGDYCHALLHSDQIGPNRPGPKGFIGQPIPIAGLYLGS
AGCHGGPGITFIPGYNAARQALADRRAANCCVLSGR
>Rv3085 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MSSFEGKVAVITGAGSGIGRALALNLSEKRAKLALSDVDTDGLAKTVRLA
QALGAQVKSDRLDVAEREAVLAHADAVVAHFGTVHQVYNNAGIAYNGNVD
KSEFKDIERIIDVDFWGVVNGTKAFLPHVIASGDGHIVNISSLFGLIAVP
GQSAYNAAKFAVRGFTEALRQEMLVARHPVKVTCVHPGGIKTAVARNATV
ADGEDQQTFAEFFDRRLALHSPEMAAKTIVNGVAKGQARVVVGLEAKAVD
VLARIMGSSYQRLVAAGVAKFFPWAK
>Rv0221 CONSERVED HYPOTHETICAL PROTEIN
MKRLSGWDAVLLYSETPNVHMHTLKVAVIELDSDRQEFGVDAFREVIAGR
LHKLEPLGYQLVDVPLKFHHPMWREHCQVDLNYHIRPWRLRAPGGRRELD
EAVGEIASTPLNRDHPLWEMYFVEGLANHRIAVVAKIHHALADGVASANM
MARGMDLLPGPEVGRYVPDPAPTKRQLLSAAFIDHLRHLGRIPATIRYTT
QGLGRVRRSSRKLSPALTMPFTPPPTFMNHRLTPERRFATATLALIDVKA
TAKLLGATINDMVLAMSTGALRTLLLRYDGKAEPLLASVPVSYDFSPERI
SGNRFTGMLVALPADSDDPLQRVRVCHENAVSAKESHQLLGPELISRWAA
YWPPAGAEALFRWLSERDGQNKVLNLNISNVPGPRERGRVGAALVTEIYS
VGPLTAGSGLNITVWSYVDQLNISVLTDGSTVQDPHEVTAGMIADFIEIR
RAAGLSVELTVVESAMAQA
>Rv1928c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MSVLDLFDLHGKRALITGASTGIGKRVALAYVEAGAQVAIAARHLDALEK
LADEIGTSGGKVVPVCCDVSQHQQVTSMLDQVTAELGGIDIAVCNAGIIT
VTPMLDMPLEEFQRLQNTNVTGVFLTAQAAAKAMVKQGQGGVIINTASMS
GHIINVPQQVSHYCASKAAVIHLTKAMAVELAPHKIRVNSVSPGYILTEL
VEPYTEYQPLWEPKIPLGRLGRPEELAGLYLYLASEASSYMTGSDIVIDG
GYTCP
>Rv2750 PROBABLE DEHYDROGENASE
MIDRPLEGKVAFITGAARGLGRAHAVRLAADGANIIAVDICEQIASVPYP
LSTADDLAATVELVEDAGGGIVARQGDVRDRASLSVALQAGLDEFGRLDI
VVANAGIAMMQAGDDGWRDVIDVNLTGVFHTVQVAIPTLIEQGTGGSIVL
ISSAAGLVGIGSSDPGSLGYAAAKHGVVGLMRAYANHLAPQNIRVNSVHP
CGVDTPMINNEFFQQWLTTADMDAPHNLGNALPVELVQPTDIANAVAWLA
SEEARYVTGVTLPVDAGFVNKR
>Rv1513 CONSERVED HYPOTHETICAL PROTEIN
MRLARRARNILRRNGIEVSRYFAELDWERNFLRQLQSHRVSAVLDVGANS
GQYARGLRGAGFAGRIVSFEPLPGPFAVLQRSASTDPLWECRRCALGDVD
GTISINVAGNEGASSSVLPMLKRHQDAFPPANYVGAQRVPIHRLDSVAAD
VLRPNDIAFLKIDVQGFEKQVIAGGDSTVHDRCVGMQLELSFQPLYEGGM
LIREALDLVDSLGFTLSGLQPGFTDPRNGRMLQADGIFFRGSD
>Rv3548c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MGLVDGRVVIVTGAGGGIGRAHALAFAAEGARVVVNDIGVGLDGSPASGG
SAAQDVVDEILAAGGQAVADGSDISDWDQAANLIQAAVETYGGVDVLVNN
AGIVRDRMIANTSEEEFDAVIAVHLKGHFATMRHAASHWRGLSKAGKAPK
DIDARIINTSSGAGLQGSVGQGNYSAAKAGIAALTLVGAAEMRRYGVTVN
AIAPAARTRMTETVFAEMMAKPQEGFDAMAPENVSPLVVWLGSAESRDVT
GKVFEVEGGIIRVAEGWAHGPQVDKGVKWDPAELGPVVSDLLAKSRPPVP
VYGA
>Rv0314c POSSIBLE CONSERVED MEMBRANE PROTEIN
MIVVWEHLCMNPEDDPEARIRELERPLADVARASELGGSQSGGYTYPPGP
PPPPYSYGGPFGGPSPRSSSGNRAWWILAAVVVVGVLVLVGGIAAFSAQR
LSQGNFVVLSPTPSVSRAVPTPTAQPATTLPPAGASLSVSGVNVNRTIAC
NDSIVSVSGMSNTVVITGHCTSLTVSGMRNSVTVDSVDTIEAAGFNNEVT
YHSGSPKISNAGGSNSVQQG
>Rv1501 CONSERVED HYPOTHETICAL PROTEIN
MIPVKVENNTSLDQVQDALNCVGYAVVEDVLDEASLAATRDRMYRVQERI
LTEIGKERLARAGELGVLRLMMKYDPHFFTFLEIPEVLSIVDRVLSETAI
LHLQNGFILPSFPPFSTPDVFQNAFHQDFPRVLSGYIASVNIMFAIDPFT
RDTGATLVVPGSHQRIEKPDHTYLARNAVPVQCAAGSLFVFDSTLWHAAG
RNTSGKDRLAINHQFTRSFFKQQIDYVRALGDAVVLEQPARTQQLLGWYS
RVVTNLDEYYQPPDKRLYRKGQG
>Rv0851c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MDGFPGRGAVITGGASGIGLATGTEFARRGARVVLGDVDKPGLRQAVNHL
RAEGFDVHSVMCDVRHREEVTHLADEAFRLLGHVDVVFSNAGIVVGGPIV
EMTHDDWRWVIDVDLWGSIHTVEAFLPRLLEQGTGGHVVFTASFAGLVPN
AGLGAYGVAKYGVVGLAETLAREVTADGIGVSVLCPMVVETNLVANSERI
RGAACAQSSTTGSPGPLPLQDDNLGVDDIAQLTADAILANRLYVLPHAAS
RASIRRRFERIDRTFDEQAAEGWRH
>Rv1523 Probable methyltransferase
MTITALTVTLPLLWRRLTTAGVKYADQGHFVGSAGVPAADAGGRDAASEQ
IARWTQTCTVVLVCGHGPAKWAFRSWCTSRSCDTLPVALRYRLQSNPLVG
KLTTKYFLPLGTRQVGDHVVFFNFGYEEDPPMALPLSESDEPNRYCIQLY
HQTASQVDLTGKEVLEVSCGAGGGASYIARNLGPASYTGLDLNPASIDLC
RAKHRLPGLQFVQGDAQNLPFPDESFDAVVNVEASHQYPDFRGFLAEVAR
VLRPGGHFLYTDSRRNPVVAEWEAALADAPLRTISQRDIGAQAKRGLDAN
TARSQEAIGRRAPVLLAGLTRCAVRVLDWDLRRGGGFSYRIYLFAKD
>Rv3535c PROBABLE ACETALDEHYDE DEHYDROGENASE (ACETALDEHYDE DEHYDROGENASE [ACETYLATING])
MPSKAKVAIVGSGNISTDLLYKLLRSEWLEPRWMVGIDPESDGLARAAKL
GLETTHEGVDWLLAQPDKPDLVFEATSAYVHRDAAPKYAEAGIRAIDLTP
AAVGPAVIPPANLREHLDAPNVNMITCGGQATIPIVYAVSRIVEVPYAEI
VASVASVSAGPGTRANIDEFTKTTARGVQTIGGAARGKAIIILNPADPPM
IMRDTIFCAIPTDADREAIAASIHDVVKEVQTYVPGYRLLNEPQFDEPSI
NSGGQALVTTFVEVEGAGDYLPPYAGNLDIMTAAATKVGEEIAKETLVVG
GAR
>Rv2913c POSSIBLE D-AMINO ACID AMINOHYDROLASE (D-AMINO ACID HYDROLASE)
MLAWRQLNDLEETVTYDVIIRDGLWFDGTGNAPLTRTLGIRDGVVATVAA
GALDETGCPEVVDAAGKWVVPGFIDVHTHYDAEVLLDPGLRESVRHGVTT
VLLGNCSLSTVYANSEDAADLFSRVEAVPREFVLGALRDNQTWSTPAEYI
EAIDALPLGPNVSSLLGHSDLRTAVLGLDRATDDTVRPTEAELAKMAKLL
DEALEAGMLGMSGMDAAIDKLDGDRFRSRALPSTFATWRERRKLISVLRH
RGRILQSAPDVDNPVSALLFFLASSRIFNRRKGVRMSMLVSADAKSMPLA
VHVFGLGTRVLNKLLGSQVRFQHLPVPFELYSDGIDLPVFEEFGAGTAAL
HLRDQLQRNELLADRSYRRSFRREFDRIKLGPSLWHRDFHDAVIVECPDK
SLIGKSFGAIADERGLHPLDAFLDVLVDNGERNVRWTTIVANHRPNQLNK
LAAEPSVHMGFSDAGAHLRNMAFYNFGLRLLKRARDADRAGQPFLSIERA
VYRLTGELAEWFGIGAGTLRQGDRADFAVIDPTHLDESVDGYHEEAVPYY
GGLRRMVNRNDATVVATGVGGTVVFRGGQFGGQFRDGYGQNVKSGRYLRA
GELGAALSRSA
>Rv0146 CONSERVED HYPOTHETICAL PROTEIN
MRTHDDTWDIKTSVGATAVMVAAARAVETDRPDPLIRDPYARLLVTNAGA
GAIWEAMLDPTLVAKAAAIDAETAAIVAYLRSYQAVRTNFFDTYFASAVA
AGIRQVVILASGLDSRAYRLDWPAGTIVYEIDQPKVLSYKSTTLAENGVT
PSAGRREVPADLRQDWPAALRDAGFDPTARTAWLAEGLLMYLPAEAQDRL
FTQVGAVSVAGSRIAAETAPVHGEERRAEMRARFKKVADVLGIEQTIDVQ
ELVYHDQDRASVADWLTDHGWRARSQRAPDEMRRVGRWVEGVPMADDPTA
FAEFVTAERL
>Rv2954c HYPOTHETICAL PROTEIN
MRLPGMLRPTAERHFHSIFYLRHNARRQEHLATLGLDLGNKSVLEVGAGI
GDHTQFFLDRGCKVLCTEPRGENLDVIRQRFGSNPNVTVDHLDLDGDLPA
EAHQYDVVYCYGVLYHLSRPAEALAWMCDRAVDLLLLETCVSYSGEDEPF
LVSERASSPSQAITGTGCRPSRVWVMNRLREKMPHVYVTATQPRHRQFPL
DWRANGPIASTGLARAVFVASRAPLNLPTLVEELPMVQRRC
>Rv1245c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MEGFAGKVAVVTGAGSGIGQALAIELARSGAKVAISDVDTDGLADTEHRL
KAISTPVKTDRLDVTEREAFLAYADAVNEHFGTVNQIYNNAGIAFTGDIE
VSQFKDIERVMDVDFWGVVNGTKAFLPHLIASGDGHVINISSVFGLFSAP
GQAAYNSAKFAVRGFTEALRQEMALAGHPVKVTTVHPGGVKTAIARNATA
AEGLDQAELAETFDKRVAHLSPQRAAQIILTGVAKNKARVLVGVDAKVLD
LVVRLTGSGYQRIFPIITGRLIPRPR
>Rv0895 CONSERVED HYPOTHETICAL PROTEIN
MRQQQEADVVALGRKPGLLCVPERFRAMDLPMAAADALFLWAETPTRPLH
VGALAVLSQPDNGTGRYLRKVFSAAVARQQVAPWWRRRPHRSLTSLGQWS
WRTETEVDLDYHVRLSALPPRAGTAELWALVSELHAGMLDRSRPLWQVDL
IEGLPGGRCAVYVKVHHALADGVSVMRLLQRIVTADPHQRQMPTLWEVPA
QASVAKHTAPRGSSRPLTLAKGVLGQARGVPGMVRVVADTTWRAAQCRSG
PLTLAAPHTPLNEPIAGARSVAGCSFPIERLRQVAEHADATINDVVLAMC
GGALRAYLISRGALPGAPLIAMVPVSLRDTAVIDVFGQGPGNKIGTLMCS
LATHLASPVERLSAIRASMRDGKAAIAGRSRNQALAMSALGAAPLALAMA
LGRVPAPLRPPNVTISNVPGPQGALYWNGARLDALYLLSAPVDGAALNIT
CSGTNEQITFGLTGCRRAVPALSILTDQLAHELELLVGVSEAGPGTRLRR
IAGRR
>Rv1432 PROBABLE DEHYDROGENASE
MTTAVVVGAGPNGLAAAIHLARHGVDVQVLEARDTIGGGARSGELTVPGV
IHDHCSAFHPLGVGSPFWAAIDLQRYGLTWKWPDVDCAHPLDDGTAGVLY
RSIEATAAGLGPDGKRWQRAVGDLAAGFDELAEDLLRPVLNMPRHPIRLA
RFGPRAALPATAMARRFHTERARALFGGAAAHVYTRLDRPLTASLGLMIL
ASGHRHGWPVARGGSGSITKALAAALDAYGGTVATGVTVTSRRDIPDADI
VMLDLSPAAVLGIYGDVMPTRINRSYRRYRAGSSAFKVDFAIEGDVGWTN
PDCRRAGTVHLGGTFAEIADTERQRAQGTMVQRPFVLVGQQYLADPSRSV
GNINPIWAYAHVPFGYTGDATAAVIDQIERFAPGFRDRIVATVSTSTTEL
QTYNRNFIGGDIIGGANDRLQVIFRPRVAVDPYAIGVPGVYLCSQSAPPG
AGIHGLCGYHAAESALRWLRKRR
>Rv3536c PROBABLE HYDRATASE
MLRDATRDELAADLAQAERSRDPIGQLTAAHPEIDVVDAYEIQLINIRQR
VAEGARVVGHKVGLSSPIMQQMMGVDEPDYGHLLDDMQVFEDTPVQASRY
LSPRVEVEVGFILAADLPGAGCTEDDVLAATEALVPAIELIDTRIKDWQI
KICDTIADNASAAGFVLGAARVPPADLDVRAIDAKLTRNGEVVAEGRSDA
VLGNPATAVAWLAGKVESFGVRLRKGDIVLPGSCTFAVEARAGDEFVADF
TGLGLVRLSFE
>Rv2766c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MTSLDLTGRTAIITGASRGIGLAIAQQLAAAGAHVVLTARRQEAADEAAA
QVGDRALGVGAHAVDEDAARRCVDLTLERFGSVDILINNAGTNPAYGPLL
EQDHARFAKIFDVNLWAPLMWTSLVVTAWMGEHGGAVVNTASIGGMHQSP
AMGMYNATKAALIHVTKQLALELSPRIRVNAICPGVVRTRLAEALWKDHE
DPLAATIALGRIGEPADIASAVAFLVSDAASWITGETMIIDGGLLLGNAL
GFRAAPSTEH
>Rv3322c POSSIBLE METHYLTRANSFERASE
MSVQTDPALREHPNRVDWNARYERAGSAHAPFAPVPWLADVLRAGVPDGP
VLELASGRSGTALALAAHGRQVTAIDVSDVALLQLDSEAVRRGVADRLNL
VQADLGCWEPGETRFALVLSRLFWDAAIFHRACEAVMPGGVLAWESLALS
GAEAGTASAKRRVKPGEPACLLPADFTVVHEGQGNCDSAPSRIMIARRSP
LPGA
>Rv2956 CONSERVED HYPOTHETICAL PROTEIN
MKSLKLARFIARSAAFEVSRRYSERDLKHQFVKQLKSRRVDVVFDVGANS
GQYAAGLRRAAYKGRIVSFEPLSGPFTILESKASTDPLWDCRQHALGDSD
GTVTINIAGNAGQSSSVLPMLKSHQNAFPPANYVGTQEASIHRLDSVAPE
FLGMNGVAFLKVDVQGFEKQVLAGGKSTIDDHCVGMQLELSFLPLYEGGM
LIPEALDLVYSLGFTLTGLLPCFIDANNGRMLQADGIFFREDD
>Rv1896c CONSERVED HYPOTHETICAL PROTEIN
MTTPEYGSLRSDDDHWDIVSNVGYTALLVAGWRALHTTGPKPLVQDEYAK
HFITASADPYLEGLLANPRTSEDGTAFPRLYGVQTRFFDDFFNCADEAGI
RQAVIVAAGLDCRAYRLDWQPGTTVFEIDVPKVLEFKARVLSERGAVPKA
HRVAVPADLRTDWPTPLTAAGFDPQRPSAWSVEGLLPYLTGDAQYALFAR
IDELCAPGSRVALGALGSRLDHEQLAALETAHPGVNMSGDVNFSALTYDD
KTDPVEWLVEHGWAVDPVRSTLELQVGYGLTPPDVDVKIDSFMRSQYITA
VRA
>Rv0325 HYPOTHETICAL PROTEIN
MGPKGSLRLVKRQPELLVAQHEHWQDTYRAHPVLYGTRPSEPGVYAAEVF
NADGVQRVLELAAGHGRDTLYFAG
>Rv2952 POSSIBLE METHYLTRANSFERASE (METHYLASE)
MAFSRTHSLLARAGSTSTYKRVWRYWYPLMTRGLGNDEIVFINWAYEEDP
PMDLPLEASDEPNRAHINLYHRTATQVDLGGKQVLEVSCGHGGGASYLTR
TLHPASYTGLDLNQAGIKLCKKRHRLPGLDFVRGDAENLPFDDESFDVVL
NVEASHCYPHFRRFLAEVVRVLRPGGYFPYADLRPNNEIAAWEADLAATP
LRQLSQRQINAEVLRGIGNNSQKSRDLVDRHLPAFLRFAGREFIGVQGTQ
LSRYLEGGELSYRMYCFTKD
>Rv2258c Possible transcriptional regulatory protein
MSGALETTEEFGNRFVAAIDSAGLAILVSVGHQTGLLDTMAGLPPATSME
IAEAAGLEERYVREWLGGMTTGQIVEYDAGSSTYSLPAHRAGMLTRAAGP
DNLAVIAQFVSLLGEVEQKVIRCFREGGGVPYSEYPRFHKLMAEMSGMVF
DAALIDVVLPLVDGLPDRLRSGADVADFGCGSGRAVKLMAQAFGASRFTG
IDFSDEAVAAGTEEAARLGLANATFERHDLAELDKVGAYDVITVFDAIHD
QAQPARVLQNIYRALRPGGVLLMVDIKASSQLEDNVGVPLSTYLYTTSLM
HCMTVSLALDGAGLGTVWGRQLATSMLADAGFTDVTVAEIESDVLNNYYI
ARK
>Rv3120 CONSERVED HYPOTHETICAL PROTEIN
MSPSPSALLADHPDRIRWNAKYECADPTEAVFAPISWLGDVLQFGVPEGP
VLELACGRSGTALGLAAAGRCVTAIDVSDTALVQLELEATRRELADRLTL
VHADLCSWQSGDGRFALVLCRLFWHPPTFRQACEAVAPGGVVAWEAWRRP
IDVARDTRRAEWCLKPGQPESELPAGFTVIRVVDTDGSEPSRRIIAQRSL
>Rv2129c Probable oxidoreductase
MTSLQGKVVFITGAARGIGAEVARRLHNKGAKLVLTDLSKSELAVMGAEL
GGDDRLLTVVADVRDLPAMQAAAETAVERFGGIDVVVANAGIASYGSVLK
VDPQAFRRVLDVNLLGNFHTVRATLPALIDRRGYVLIVSSLAAFAAPPGM
APYNMSKAGNEHFANALRLEVAHLGVSVGSAHMSWIDTALVRDTKADLPA
FAELLARLPWPLNKTTSVNKCAAAFVNGIEGRKDRVYCPGWVALFRWLKP
LLSTRVGQRPIRNTVAKLMPQMDAEVAALGRFASAYTESLENS
>Rv1865c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE
MPGRTSIGVKIRDKVQDKVIAITGGARGIGLATAAALHNLGAKVAIGDID
EAMAKESGADLDLDMYGKLDVTDPDSFSGFLDAVERQLGPIDVLVNNAGI
MPVGRIVDEPDPVTRRILDINVYGVILGSKLAAQRMVPRGRGHVINVASL
AGEIYAVGVATYCASKHAVVAFTDSARLEYRSAGVKFSMVLPSFVNTELI
AGTGGIKGFKNAEPADIADAIVGLIVHPKPRVRVTKAAGSMIVAQRFMPR
QVSEGLNRLLGGEHVFTDDVDMEKRRTYEARARGEE
>Rv0769 PROBABLE DEHYDROGENASE/REDUCTASE
MFDSKVAIVTGAAQGIGQAYAQALAREGASVVVADINADGAAAVAKQIVA
DGGTAIHVPVDVSDEDSAKAMVDRAVGAFGGIDYLVNNAAIYGGMKLDLL
LTVPLDYYKKFMSVNHDGVLVCTRAVYKHMAKRGGGAIVNQSSTAAWLYS
NFYGLAKVGVNGLTQQLARELGGMKIRINAIAPGPIDTEATRTVTPAELV
KNMVQTIPLSRMGTPEDLVGMCLFLLSDSASWITGQIFNVDGGQIIRS
>Rv1144 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MKTKDAVAVVTGGASGLGLATTKRLLDAGAQVVVVDLRGDDVVGGLGDRA
RFAQADVTDEAAVSNALELADSLGPVRVVVNCAGTGNAIRVLSRDGVFPL
AAFRKIVDINLVGTFNVLRLGAERIAKTEPIGEERGVIINTASVAAFDGQ
IGQAAYSASKGGVVGMTLPIARDLASKLIRVVTIAPGLFDTPLLASLPAE
AKASLGQQVPHPSRLGNPDEYGALVLHIIENPMLNGEVIRLDGAIRMAPR
>Rv3038c CONSERVED HYPOTHETICAL PROTEIN
MTRSSNIPADATPNPHATAEQVAAARHDSKLAQVLYHDWEAENYDEKWSI
SYDQRCVDYARGRFDAIVPDEVIAQLPYDRALELGCGTGFFLLNLIQAGV
ARRGSVTDLSPGMVKVATRNGQALGLDIDGRVADAEGIPYDDDAFDLVVG
HAVLHHIPDVELSLREVVRVLKPGGRFVFAGEPTTVGDGYARTLSTLTWR
VVTNATKLPGLRGWRRPQGELDESSRAAALEALVDLHTFTPQDLQRIAHN
AGAVEVQTATEEFTAAMLGWPLRTFECTVPPGRLGWGWARFAFTSWKTLG
WVDANVWRHVVPKGWFYNVMITGVKPS
>Rv0731c CONSERVED HYPOTHETICAL PROTEIN
MTQTGSARFEGDSWDLASSVGLTATMVAAARAVAGRAPGALVNDQFAEPL
VRAVGVDFFVRMASGELDPDELAEDEANGLRRFADAMAIRTHYFDNFFLD
ATRAGIRQAVILASGLDSRAYRLRWPAGTIVFEVDQPQVIDFKTTTLAGL
GAAPTTDRRTVAVDLRDDWPTALQKAGFDNAQRTAWIAEGLLGYLSAEAQ
DRLLDQITAQSVPGSQFATEVLRDINRLNEEELRGRMRRLAERFRRHGLD
LDMSGLVYFGDRTDARTYLADHGWRTASASTTDLLAEHGLPPIDGDDAPF
GEVIYVSAELKQKHQDTR
>Rv1344 PROBABLE ACYL CARRIER PROTEIN (ACP)
MWRYPLSTRLALPNTPGVASFAMTSSPSTVSTTLLSILRDDLNIDLTRVT
PDARLVDDVGLDSVAFAVGMVAIEERLGVALSEEELLTCDTVGELEAAIA
AKYRDE
>Rv2740 CONSERVED HYPOTHETICAL PROTEIN
MAELTETSPETPETTEAIRAVEAFLNALQNEDFDTVDAALGDDLVYENVG
FSRIRGGRRTATLLRRMQGRVGFEVKIHRIGADGAAVLTERTDALIIGPL
RVQFWVCGVFEVDDGRITLWRDYFDVYDMFKGLLRGLVALVVPSLKATL
>Rv3502c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MKLTESNRSPRTTNTTDLSGKVAVVTGAAAGLGRAEALGLARLGATVVVN
DVASALDASDVVDEIGAAAADAGAKAVAVAGDISQRATADELLASAVGLG
GLDIVVNNAGITRDRMLFNMSDEEWDAVIAVHLRGHFLLTRNAAAYWRDK
AKDAEGGSVFGRLVNTSSEAGLVGPVGQANYAAAKAGITALTLSAARALG
RYGVCANVICPRARTAMTADVFGAAPDVEAGQIDPLSPQHVVSLVQFLAS
PAAAEVNGQVFIVYGPQVTLVSPPHMERRFSADGTSWDPTELTATLRDYF
AGRDPEQSFSATDLMRQ
>Rv2915c CONSERVED HYPOTHETICAL PROTEIN
MKRVDTIRPRSRAVRLHVRGLGLPDETAIQLWIVDGRISTEPVAGADTVF
DGGWILPGLVDAHCHVGLGKHGNVELDEAIAQAETERDVGALLLRDCGSP
TDTRGLDDHEDLPRIIRAGRHLARPKRYIAGFAVELEDESQLPAAVAEQA
RRGDGWVKLVGDWIDRQIGDLAPLWSDDVLKAAIDTAHAQGARVTAHVFS
EDALPGLINAGIDCIEHGTGLTDDTIALMLEHGTALVPTLINLENFPGIA
DAAGRYPTYAAHMRDLYARGYGRVAAAREAGVPVYAGTDAGSTIEHGRIA
DEVAALQRIGMTAHEALGAACWDARRWLGRPGLDDRASADLLCYAQDPRQ
GPGVLQHPDLVILRGRTFGP
>Rv2997 POSSIBLE ALANINE RICH DEHYDROGENASE
MDVTVVGSGPNGLATAVICARAGLNVQVVEAQATFGGGARSAADFEFPEV
LHDVCSAVHPLALASPFFAEFDLPARGVTLTVPDIAYANPLPGRPAAIAY
HDLAHTCAKLDDGASWRRLLGPLVAHSETVVEFMLSDKRSLPTALGSVLR
LGLRMLAQGTPAWRSLAGEDARALFTGVAAHAISPLPSLVSAGAGLMLAT
LAHSVGWPIPVGGTQAIADALIADLRAHGGRLAAGVEITEPQRSVVVFDT
APTALLRVYRDKLPHRYAKALRRYRFRAGIAKVDFVLSDEIPWSDPRLRR
AATLHLGGTRDQMARAEADVAAGRHADWPMVLAACPHVADPGRIDETGRR
PFWTYAHVPSGSTLDATETVTSVLERFAPGFRDIVVAARAVPAARMADHN
ANYVGGDITVGANSTWRAIAGPTPRLNPWRTPIPKVYLCSAATPPGAGVH
GMCGWYAARTLLRTEFGITRMPPLGHELRP
>Rv1405c PUTATIVE METHYLTRANSFERASE
MTIDTPAREDQTLAATHRAMWALGDYALMAEEVMAPLGPILVAAAGIGPG
VRVLDVAAGSGNISLPAAKTGATVISTDLTPELLQRSQARAAQQGLTLQY
QEANAQALPFADDEFDTVISAIGVMFAPDHQAAADELVRVCRPGGTIGVI
SWTCEGFFGRMLATIRPYRPSVSADLPPSALWGREAYVTGLLGDGVTGLK
TARGLLEVKRFDTAQAVHDYFKNNYGPTIEAYAHIGDNAVLAAELDRQLV
ELAAQYLSDGVMEWEYLLLTAEKR
>Rv2466c CONSERVED HYPOTHETICAL PROTEIN
MLEKAPQKSVADFWFDPLCPWCWITSRWILEVAKVRDIEVNFHVMSLAIL
NENRDDLPEQYREGMARAWGPVRVAIAAEQAHGAKVLDPLYTAMGNRIHN
QGNHELDEVITQSLADAGLPAELAKAATSDAYDNALRKSHHAGMDAVGED
VGTPTIHVNGVAFFGPVLSKIPRGEEAGKLWDASVTFASYPHFFELKRTR
TEPPQFD
>Rv0089 POSSIBLE METHYLTRANSFERASE/METHYLASE
MDQPWNANIHYDALLDAMVPLGTQCVLDVGCGDGLLAARLARRIPYVTAV
DIDAPVLRRAQTRFANAPIRWLHADIMTAELPNAGFDAVVSNAALHHIED
TRTALSRLGGLVTPGGTLAVVTFVTPSLRNGLWHLTSWVACGMANRVKGK
WEHSAPIKWPPPQTLHELRSHVRALLPGACIRRLLYGRVLVTWRAPV
>Rv2622 POSSIBLE METHYLTRANSFERASE (METHYLASE)
MANKRGNAGQPLPLSDRDDDHMQGHWLLARLGKRVLRPGGVELTRTLLAR
AEVTDADVLELAPGLGRTAAEILARNPRSYVGAESDPNAANLVRHVLAGR
GDVRVTDAADTGLSDASADVVIGEAMLTMQGNAAKHTIVAEAARVLRPGG
RYAIHELALVPDDVAEQVRTDLRQSLARALKVNARPLTVAEWSHLLAGHG
LVVEHVVTASMALLQPRRVIADEGLLGALRFAGNLLIHRAARRRVLLMRH
TFRRHRERLTAVAIVAHKPHVDS
>Rv1194c CONSERVED HYPOTHETICAL PROTEIN
MAWQQPSPRIRELIREGARIALNPSPEWIEELDRATIAANPAIANDPVLA
KVVQTANRANLVYWAAANLRDPGARVPANLGTEPLRMARDLVRRGLDTVA
FNIYRTGEHIGWRFWMGIAFELTSDPQELRELLDVSARSVNDFIEATLTG
IAAQVQSEHDELTRSTHAERLEVVGLILDGAPISPERAEAKLGYPLSRAH
TAAIIWSDELDGDHSYLDRAADLFCHAVGSTRPLTVVAGAASRWAWVTDA
DGLDIDTVQAAVDNAPGARIAIGTTANGVEGFRRSHLEALITQRTLSRLR
STQRVAFFADVKMVALISQNPDAASEFITSTLGDLESASPDLQTALLTFI
NEQCNASRAAKRLHTHRNTFLRRLESAQRLLPRPLDHTSVHVAVALEALQ
WRGNKAHALSSPGRRSNSVPA
>Rv0068 PROBABLE OXIDOREDUCTASE
MTKWTAADIPDQTGRTAVITGANTGLGFETAAALAAHGAHVVLAVRNLDK
GKQAAARITEATPGAEVELQELDLTSLASVRAAAAQLKSDHQRIDLLINN
AGVMYTPRQTTADGFEMQFGTNHLGHFALTGLLIDRLLPVAGSRVVTISS
VGHRIRAAIHFDDLQWERRYRRVAAYGQAKLANLLFTYELQRRLAPGGTT
IAVASHPGVSNTEVVRNMPRPLVAVAAILAPLMQDAELGALPTLRAATDP
AVRGGQYFGPDGFGEIRGYPKVVASSAQSHDEQLQRRLWAVSEELTGVVY
PVG
>Rv3633 CONSERVED HYPOTHETICAL PROTEIN
MTQSSSVERLVGEIDEFGYTVVEDVLDADSVAAYLADTRRLERELPTVIA
NSTTVVKGLARPGHVPVDRVDHDWVRIDNLLLHGTRYEALPVHPKLLPVI
EGVLGRDCLLSWCMTSNQLPGAVAQRLHCDDEMYPLPRPHQPLLCNALIA
LCDFTADNGATQVVPGSHRWPERPSPPYPEGKPVEINAGDALIWNGSLWH
TAAANRTDAPRPALTINFCVGFVRQQVNQQLSIPRELVRCFEPRLQELIG
YGLYAGKMGRIDWRPPADYLDADRHPFLDAVADRLQTSVRL
>Rv3519 HYPOTHETICAL PROTEIN
MPVSQHTIAGTVLTMPVRIRTANLHSAMFSVPADPAQRLIDYSGLRVCEY
LPGKAIVMQMLVRYVDGDLGRYHEYGTAIMVNPPGTQRRGPRALTRAAAF
IHHLPVDQVFTLEAGRTIWGFPKIMADFNVTDGRRFGFDVSADGRLIAGI
EFSTGLPVPTLGWQMLKTYSHHDGVTREIPWEMKVSGLRARLGGARLRLG
DHPYAKELASLGLPKRALLSQSAANVEMTFGDGHPI
>Rv2751 CONSERVED HYPOTHETICAL PROTEIN
MARNPAAQTAFGPMVLAAVEQNEPPGRRLVDDDLADLFLPRPLRWLAGAT
RSAVLRRLLISASEWSGRGLWANLACRKRFIGDKLDEALGDIDAVVILGA
GLDTRAYRLTRRVRMPVFEVDLPVNIARKAKTVRRVLGELPLSVRLVALD
FEHDDLLTALAEHGYRTEYRVFFVCEGVTQYLTERAVRRTLEGLRAAAPG
SRMVFTYVRRDFIDGTNRYGTRTLYHTVRQRRQLWHFGLDPEEVAGFLAD
YGWRLTEQAGPEELVQRYVEPTGRNLNASQIEWSAYAEKSEPVTPR
>Rv2423 HYPOTHETICAL PROTEIN
MDNLPIESAESTRLAKAAMTRRFYTRSVVKGEITLPAVPSMIDEYVTMCA
GLFAGVGRKFSDEELAHLRAVLQGQLAEAYAASQRSTIVISYNAPMGPTL
HYQVRAQWRTVAQEYENWIATREPPLFGTEPDARVWALANEAADPTTHRV
LEIGAGTGRNALALARRGHPVDVVEMTPKFADIIRSDAERDSLDVRVIMR
DVFSTMDDLRQDYQLMVLSEVVPDFRTTQQLRNLFELAAQCLAPGARLVF
NAFLANGDYAPDQAAREFGQQMYTGMCTRAEMSAAAAGLPLELVADDSVY
DYEKTHLPPGAWPPTSWYADWIRGLDVFTTNVESCPIEMRWLVFQRRR
>Rv0830 CONSERVED HYPOTHETICAL PROTEIN
MVRADRDRWDLATSVGATATMVAAQRALAADPRYALIDDPYAAPLVRAVG
MDVYTRLVDWQIPVEGDSEFDPQRMATGMACRTRFFDQFFLDATHSGIGQ
FVILASGLDARAYRLAWPVGSIVYEVDMPEVIEFKTATLSDLGAEPATER
RTVAVDLRDDWATALQTAGFDPKVPAAWSAEGLLVYLPVEAQDALFDNIT
ALSAPGSRLAFEFVPDTAIFADERWRNYHNRMSELGFDIDLNELVYHGQR
GHVLDYLTRDGWQTSALTVTQLYEANGFAYPDDELATAFADLTYSSATLM
R
>Rv1506c HYPOTHETICAL PROTEIN
MRIVNAADPFSINDLGCGYGALLDYLDARGFKTDYTGIDVSPEMVRAAAL
RFEGRANADFICAARIDREADYSVASGIFNVRLKSLDTEWCAHIEATLDM
LNAASRRGFSFNCLTSYSDASKMRDDLYYADPCALFDLCKRRYSKSVALL
HDYGLYEFTILVRKAS
>Rv2003c CONSERVED HYPOTHETICAL PROTEIN
MVKRSRATRLSPSIWSGWESPQCRSIRARLLLPRGRSRPPNADCCWNQLA
VTPDTRMPASSAAGRDAAAYDAWYDSPTGRPILATEVAALRPLIEVFAQP
RLEIGVGTGRFADLLGVRFGLDPSRDALMFARRRGVLVANAVGEAVPFVS
RHFGAVLMAFTLCFVTDPAAIFRETRRLLADGGGLVIGFLPRGTPWADLY
ALRAARGQPGYRDARFYTAAELEQLLADSGFRVIARRCTLHQPPGLARYD
IEAAHDGIQAGAGFVAISAVDQAHEPKDDHPLESE
>Rv0316 POSSIBLE MUCONOLACTONE ISOMERASE
MEFLVTMTTRVPDSMPADAVERVRAREAARSRELAAQGKLLRLWRPPLRP
GEWRTLGLFAADDNGELEQLLASMPPRSWRTDDVTPLGAHPNDPVGQGIT
IAPGKGPEFLIATTIMVPPGTPAQVVDDTVAREARRAPELAGRGHLVRLW
ALPDGPDGQRTLGLWRARDPGELMAILESLPLAGWMTIETTPLSPHPDDP
IRMP
>Rv2370c CONSERVED HYPOTHETICAL PROTEIN
MVLPKPTPRGRELIRQAAKVALHPTPEWLDELDRATLAAHPSIAADPALA
TVVSRANRSHLIHFATANLRKPGQPVPANLGPDPLRMARDLVRRGLDASA
LDVYRVGQNVAWQRWTEIAFGLTTDPQELHELLTLPFRSASEFIDATLAG
LAAQMQLEYDELTRDVHAEHRRIVELILDGAPISRQSAEAKLGYPLDRSH
TAAIIWYDDPDDNQNHLDHTARAFGRALGCPQPLIAVASAATRWVWVSDA
ATLDTDRIHQVLDHAPHARIAVGTTARGIDGFRRSHRDALATQRMLARLR
SQQRLAFFADIHMIAVLTENPDSAADFITSTLGDLESASPQLLTTVLTYI
NEQCNASRAAHVLHTHRNTLLRRLETAQRLLPRPLDHTIIQVAVAISALQ
WRGSQTSDPVETPVEGITSPPPESLGRRRSRLAQLER
>Rv1498c PROBABLE METHYLTRANSFERASE
MLDVGCGSGRMALPLTGYLNSEGRYAGFDISQKAIAWCQEHITSAHPNFQ
FEVSDIYNSLYNPKGKYQSLDFRFPYPDASFDVVFLTSVFTHMFPPDVEH
YLDEISRVLKPGGRCLCTYFLLNDESLAHIAEGKSAHNFQHEGPGYRTIH
KKRPEEAIGLPETFVRDVYGKFGLAVHEPLHYGSWSGREPRLSFQDIVIA
TKTAS
>Rv1714 Probable oxidoreductase
MEEMALAQQVPNLGLARFSVQDKSILITGATGSLGRVAARALADAGARLT
LAGGNSAGLAELVNGAGIDDAAVVTCRPDSLADAQQMVEAALGRYGRLDG
VLVASGSNHVAPITEMAVEDFDAVMDANVRGAWLVCRAAGRVLLEQGQGG
SVVLVSSVRGGLGNAAGYSAYCPSKAGTDLLAKTLAAEWGGHGIRVNALA
PTVFRSAVTEWMFTDDPKGRATREAMLARIPLRRFAEPEDFVGALIYLLS
DASSFYTGQVMYLDGGYTAC
>Rv0074 CONSERVED HYPOTHETICAL PROTEIN
MGDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRISAVDFAG
SACPDMNLVDLGESTLLPGLVDAHAHLCWDPDGRPEDLAGDPHAVLVGRA
RRHAAAALRSGITTIRDLGDRDYAALALREEYRQKTTVGPELVVSGPPLT
RSGGHCWFLGGVADSVEELVDAVQERAARGADWIKVMATGGFVTTASDPW
QPQYGSGQLAAVVAAAEQVGLPVTAHAHATAGIAAAVAAGVDGIEHCTFL
SEGSAAASPDVVEAIVAQGVWCGMTIPRVYPEMPENLVAVVQDGWRNIRR
LIDAGARVALSTDAGVAPGRRHDVLPDDLVYLSRHGFTSTEVLTGATAAA
AASCGLGHRKGRIAPGYDADLLAVAAGVDHDPAGLCDVKAVWRSGTQVPL
QASAVGYNTPS
>Rv1429 CONSERVED HYPOTHETICAL PROTEIN
MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRA
SITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHR
LGHARFLEVAMQYVSLLEPADRVSTIIELVNRSARLVDLVADQLIVAYEH
EHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWV
DSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSP
APTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA
LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSW
LRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAA
FRVQMALEVCRWMAPAVLRAKQ
>Rv2263 Possible oxidoreductase
MAKDLVATVPDLSGKLAIITGANSGLGFGLARRLSAAGADVIMAIRNRAK
GEAAVEEIRTAVPDAKLTIKALDLSSLASVAALGEQLMADGRPIDLLINN
AGVMTPPERVTTADGFELQFGSNHLGHFALTAHLLPLLRAAQRARVVSLS
SLAARRGRIHFDDLQFERSYAPMTAYGQSKLAVLMFARELDRRSRAAGWG
IISNAAHPGLTKTNLQIAGPSHGRDKPALMERLYKTSWRFAPFLWQEIEE
GILPALYAAATPQADGGAFYGPRGRYEVAGGGVREAKVPAAARNDADSKR
LWEVSEQLTGVSYPKSR
>Rv1847 CONSERVED HYPOTHETICAL PROTEIN
MQPSPDSPAPLNVTVPFDSELGLQFTELGPDGARAQLDVRPKLLQLTGVV
HGGVYCAMIESIASMAAFAWLNSHGEGGSVVGVNNNTDFVRSISSGMVYG
TAEPLHRGRRQQLWLVTITDDTDRVVARGQVRLQNLEARP
>Rv0329c CONSERVED HYPOTHETICAL PROTEIN
MRLTHPARRYLSSQAARPTGAFGRLLGRIWRAETADVNRIAVELLAPGPG
ERVCEIGFGPGRTLGLLAAAGAQVSGVEVSTTMIAIAAHHNAKAIAAGLI
SLYHGDGVTLPVADHSLDKVLGVHNFYFWPDPRASLCDIARALRPGGRLV
LTSISDDQPLAARFDPAIYRVPPTLDTAAWLGAAGFIDVGIKRSADHPAT
VWFTATAT
>Rv0846c PROBABLE OXIDASE
MPELATSGNAFDKRRFSRRGFLGAGIASGFALAACASKPTASGAAGMTAA
IDAAEAARPHSGRTVTATLTPQPARIDLGGPIVSTLTYGNTIPGPLIRAT
VGDEIVVSVTNRLGDPTSVHWHGIALRNDMDGTEPATANIGPGGDFTYRF
SVPDPGTYWAHPHVGLQGDHGLYLPVVVDDPTEPGHYDAEWIIILDDWTD
GIGKSPQQLYGELTDPNKPTMQNTTGMPEGEGVDSNLLGGDGGDIAYPYY
LINGRIPVAATSFKAKPGQRIRIRIINSAADTAFRIALAGHSMTVTHTDG
YPVIPTEVDALLIGMAERYDVMVTAAGGVFPLVALAEGKNALARALLSTG
AGSPPDPQFRPDELNWRVGTVEMFTAATTANLGRPEPTHDLPVTLGGTMA
KYDWTINGEPYSTTNPLHVRLGQRPTLMFDNTTMMYHPIHLHGHTFQMIK
ADGSPGARKDTVIVLPKQKMRAVLVADNPGVWVMHCHNNYHQVAGMATRL
DYIL
>Rv0725c CONSERVED HYPOTHETICAL PROTEIN
MPRAHDDNWDLASSVGATATMVAAGRALATKDPRGLINDPFAEPLVRAVG
LDFFTKLIDGELDIATTGNLSPGRAQAMIDGIAVRTKYFDDYFRTATDGG
VRQVVILAAGLDARAYRLPWPAGTVVYEIDQPQVIDFKTTTLAGIGAKPT
AIRRTVYIDLRADWPAALQAAGLDSTAPTAWLAEGMLIYLPPDPRTGCST
TAPNSVLRAARSLPNLSRALWISTQAGYEKWRIRFASTAWTSTWRRWCIP
ANAATSSTTCAPRAGTLRAQCGPTYSGAMVCPFPPHTTTIRSAKSSSSAV
V
>Rv3762c POSSIBLE HYDROLASE
MPMEHKPPTAVIQAAHGEHSLPLHDTTDFDDADRGFIAALSPCVIKAADG
RVVWDNDAYSFLDGAAPTSVHPSLWRQSQLTAKQGLYQVVPGIYQVRGFD
ISNISFVEGDTGLIVIDPLVSTEVAAAALDLYRAHRGADRPVVAVIYTHS
HVDHFGGVLGVTTQADVDAGKVAVLAPEGFTAHAVQENIYAGSAMMRRAG
YMYGTVLARGLRGHVGCGLGQTLSTGEVSLVVPTVDITETGETHTIDGVE
IEFQMAPGTEAPAEMHFYFPRFRALCMAENATHNLHNLLTLRGALVRDPR
AWSGYLTEAIDTFADRTDVVFASHHWPTWGREKIVEFLSQQRDMYSYLHD
QTLRLLNQGYTGVEIAEMFQLPPALQRAWHTHGYYGSVSHNVKAIYQRYM
GWFDGNPGWLWPHPPEALAPRYVDALGGIDRVLELAREAFDAGDFRWAAT
LLDHAVFADSEHAAARGLYADTLEQLAYGAECATWRNFFLTGAAELRDGN
PGSSGQVPAPTFFAQLTPDQIFDVLAISINGPRAWDLDLAIDFTFTEPDV
NYRLTLRNGVLIHRKLPADPATANATVTVGDKVRLVAAALGDISSPGFEV
FGDRTVLQTFLSVLDRPDSAFNIVTP
>Rv0897c PROBABLE OXIDOREDUCTASE
MSDHDRDFDVVVVGGGHNGLVAAAYLARAGLRVRLLERLAQTGGAAVSIQ
AFDGVEVALSRYSYLVSLLPSRIVADLGAPVRLARRPFSSYTPAPATAGR
SGLLIGPTGEPRAAHLAAIGAAPDAHGFAAFYRRCRLVTARLWPTLIEPL
RTREQARRDIVEYGGHEAAAAWQAMVDEPIGHAIAGAVANDLLRGVIATD
ALIGTFARMHEPSLMQNICFLYHLVGGGTGVWHVPIGGMGSVTSALATAA
ARHGAEIVTGADVFALDPDGTVRYHSDGSDGAEHLVRGRFVLVGVTPAVL
ASLLGEPVAALAPGAQVKVNMVVRRLPRLRDDSVTPQQAFAGTFHVNETW
SQLDAAYSQAASGRLPDPLPCEAYCHSLTDPSILSARLRDAGAQTLTVFG
LHTPHSVFGDTEGLAERLTAAVLASLNSVLAEPIQDVLWTDAQSKPCIET
TTTLDLQRTLGMTGGNIFHGALSWPFADNDDPLDTPARQWGVATDHERIM
LCGSGARRGGAVSGIGGHNAAMAVLACLASRRKSP
>Rv1941 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MNHPDLAGKVAIVTGAGAGIGLAVARRLADEGCHVLCADIDGDAADAAAT
KIGCGAAACRVDVSDEQQIIAMVDACVAAFGGVDKLVANAGVVHLASLID
TTVEDFDRVIAINLRGAWLCTKHAAPRMIERGGGAIVNLSSLAGQVAVGG
TGAYGMSKAGIIQLSRITAAELRSSGIRSNTLLPAFVDTPMQQTAMAMFD
GALGAGGARSMIARLQGRMAAPEEMAGIVVFLLSDDASMITGTTQIADGG
TIAALW
>Rv2286c CONSERVED HYPOTHETICAL PROTEIN
MTTVDFHFDPLCPFAYQTSVWIRDVRAQLGITINWRFFSLEEINLVAGKK
HPWERDWSYGWSLMRIGALLRRTNMSLLDRWYAAIGHELHTLGGKPHDPA
VARRLLCDVGVNAAILDAALDDPTTHDDVRADHQRVVAAGGYGVPTLFLD
GQCLFGPVLVDPPAGPAALNLWSVVTGMAGLPHVYELQRPKSPADVELIA
QQLRPYLDGRDWVSINRGEIVDIDRLAGRS
>Rv3767c CONSERVED HYPOTHETICAL PROTEIN
MPRTDNDSWAITESVGATALGVAAARAAETESDNPLINDPFARIFVDAAG
DGIWSMYTNRTLLAGATDLDPDLRAPIQQMIDFMAARTAFFDEYFLATAD
AGVRQVVILASGLDSRAWRLPWPDGTVVYELDQPKVLEFKSATLRQHGAQ
PASQLVNVPIDLRQDWPKALQKAGFDPSKPCAWLAEGLVRYLPARAQDLL
FERIDALSRPGSWLASNVPGAGFLDPERMRRQRADMRRMRAAAAKLVETE
ISDVDDLWYAEQRTAVAEWLRERGWDVSTATLPELLARYGRSIPHSGEDS
IPPNLFVSAQRATS
>Rv0913c POSSIBLE DIOXYGENASE
MDITIVGKYLSTLPEDDDHPYRTGPWRPQTTEWDADDLTTVTGEVPADLD
GIYLRNTENPLHPAFATYHPFDGDGMIHVVGFRDGKAFYRNRFIRTDGFL
AENEAGGPLWPGLAEPVQLAKREHGWGARGLMKDASSTDVIVHRGIALTS
FYQCGDLYRIDPYSANTLGKESWHGRFPFDWGVSAHPKVDNKTGELLFFN
YSKQEPYMRYGVVDQNNELVHYVDVPLPGPRLPHDMAFTENYVILNDFPL
FWDPRLLERDVHLPRFYPEIPSRFAVVARRGNDIRWFEADPTFVLHFTNA
YEQGDEIVLDGFYEGDPQPLDTGGTKWEKLFRFLALDRLQSRLHRWRLNM
VTGAVHEEQLSESITEFGTINADYAASSYRYTYAATGKPSWFLFDGLVKH
DLLTGNHECYSFGDGVYGSETAMAPRVGSSAEDDGYLVTLTTDMNDDASY
CLVFDAARPGDGPICKLALPERISSGTHSAWVPGAELRRWDHAESPAAAV
GL
>Rv2993c POSSIBLE 2-HYDROXYHEPTA-2,4-DIENE-1,7-DIOATE ISOMERASE (HHDD ISOMERASE)
MTAREIAEHPFGTPTFTGRSWPLADVRLLAPILASKVVCVGKNYADHIAE
MGGRPPADPVIFLKPNTAIIGPNTPIRLPANASPVHFEGELAIVIGRACK
DVPAAQAVDNILGYTIGNDVSARDQQQSDGQWTRAKGHDTFCPVGPWIVT
DLAPFDPADLELRTVVNGDVKQHARTSLMIHDIGAIVEWISAIMTLLPGD
LILTGTPAGVGPIEDGDTVSITIEGIGTLTNPVVRKGKP
>Rv2242 CONSERVED HYPOTHETICAL PROTEIN
MNDNQLAPVARPRSPLELLDTVPDSLLRRLKQYSGRLATEAVSAMQERLP
FFADLEASQRASVALVVQTAVVNFVEWMHDPHSDVGYTAQAFELVPQDLT
RRIALRQTVDMVRVTMEFFEEVVPLLARSEEQLTALTVGILKYSRDLAFT
AATAYADAAEARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTTAP
ATVLVGTPAPGPNGSNSDGDSERASQDVRDTAARHGRAALTDVHGTWLVA
IVSGQLSPTEKFLKDLLAAFADAPVVIGPTAPMLTAAHRSASEAISGMNA
VAGWRGAPRPVLARELLPERALMGDASAIVALHTDVMRPLADAGPTLIET
LDAYLDCGGAIEACARKLFVHPNTVRYRLKRITDFTGRDPTQPRDAYVLR
VAATVGQLNYPTPH
>Rv0687 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MSARGGSLHGRVAFVTGAARAQGRSHAVRLAREGADIVALDICAPVSGSV
TYPPATSEDLGETVRAVEAEGRKVLAREVDIRDDAELRRLVADGVEQFGR
LDIVVANAGVLGWGRLWELTDEQWETVIGVNLTGTWRTLRATVPAMIDAG
NGGSIVVVSSSAGLKATPGNGHYAASKHALVALTNTLAIELGEFGIRVNS
IHPYSVDTPMIEPEAMIQTFAKHPGYVHSFPPMPLQPKGFMTPDEISDVV
VWLAGDGSGALSGNQIPVDKGALKY
>Rv0224c POSSIBLE METHYLTRANSFERASE (METHYLASE)
MAVTDVFARRATLRRSLRLLADFRYEQRDPARFYRTLAADTAAMIGDLWL
ATHSEPPVGRTLLDVGGGPGYFATAFSDAGVGYIGVEPDPDEMHAAGPAF
TGRPGMFVRASGMALPFADDSVDICLSSNVAEHVPRPWQLGTEMLRVTKP
GGLVVLSYTVWLGPFGGHEMGLSHYLGGARAAARYVRKHGHPAKNNYGSS
LFAVSAAEGLRWAAGTGAALAVFPRYHPRWAWWLTSVPVLREFLVSNLVL
VLTP
>Rv0726c CONSERVED HYPOTHETICAL PROTEIN
MTYTGSIRCEGDTWDLASSVGATATMVAAARAMATRAANPLINDQFAEPL
VRAVGVDVLTRLASGELTASDIDDPERPNASMVRMAEHHAVRTKFFDEFF
MDATRAGIRQVVILASGLDSRAYRLAWPAQTVVYEIDQPQVMEFKTRTLA
ELGATPTADRRVVTADLRADWPTALGAAGFDPTQPTAWSAEGLLRYLPPE
AQDRLLDNVTALSVPDSRFATESIRNFKPHHEERMRERMTILANRWRAYG
FDLDMNELVYFGDRNEPASYLSDNGWLLTEIKSQDLLTANGFQPFEDEEV
PLPDFFYVSARLQRKHRQYPAHRKPAPSWRHTACPVNELSKSAAYTMTRS
DAHQASTTAPPPPGLTG
>Rv2857c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MMDLSQRLAGRVAVITGGGSGIGLAAGRRMRAEGATIVVGDVDVEAGGAA
ADELSGLFVPTDVCDEDAVNGLFDGAAETYGRIDIAFNNAGISPPEDNLI
ENTELAAWQRVQDVNLKSVYLCCRAALRHMVLAGKGSIVNTASFVAVMGS
ATSQISYTASKGGVLAMSRELGVQFARQGIRVNALCPGPVNTPLLQELFA
KNPERAARRMVHVPLGRFAEPDEIAAAVAFLASDDASFITASTFLVDGGI
SSAYVTPL
>Rv3787c CONSERVED HYPOTHETICAL PROTEIN
MARTDDDSWDLATGVGATATLVAAGRARAARAAQPLIDDPFAEPLVRAVG
VEFLTRWATGELDAADVDDPDAAWGLQRMTTELVVRTRYFDQFFLDAAAA
GVRQAVILASGLDARGYRLPWPADTTVFEVDQPRVLEFKAQTLAGLGAQP
TADLRMVPADLRHDWPDALRRGGFDAAEPAAWIAEGLFGYLPPDAQNRLL
DHVTDLSAPGSRLALEAFLGSADRDSARVEEMIRTATRGWREHGFHLDIW
ALNYAGPRHEVSGYLDNHGWRSVGTTTAQLLAAHDLPAAPALPAGLADRP
NYWTCVLG
>Rv0547c POSSIBLE OXIDOREDUCTASE
MSKRPLRWLTEQITLAGMRPPISPQLLINRPAMQPVDLTGKRILLTGASS
GIGAAATKQFGLHRAVVVAVARRKDLLDAVADRITGDGGTAMSLPCDLSD
MEAIDALVEDVEKRIGGIDILINNAGRSIRRPLAESLERWHDVERTMVLN
YYAPLRLIRGLAPGMLERGDGHIINVATWGVLSEASPLFSVYNASKAALS
AVSRIIETEWGSQGVHSTTLYYPLVATPMIAPTKAYDGLPALTAAEAAEW
MVTAARTRPVRIAPRVAVAVNALDSIGPRWVNALMQRRNEQLNP
>Rv0927c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MILDMFRLDDKVAVITGGGRGLGAAIALAFAQAGADVLIASRTSSELDAV
AEQIRAAGRRAHTVAADLAHPEVTAQLAGQAVGAFGKLDIVVNNVGGTMP
NTLLSTSTKDLADAFAFNVGTAHALTVAAVPLMLEHSGGGSVINISSTMG
RLAARGFAAYGTAKAALAHYTRLAALDLCPRVRVNAIAPGSILTSALEVV
AANDELRAPMEQATPLRRLGDPVDIAAAAVYLASPAGSFLTGKTLEVDGG
LTFPNLDLPIPDL
>Rv0145 CONSERVED HYPOTHETICAL PROTEIN
MTELDDVSSLPSSRRTAGDTWAITESVGATALGVAAARAVETAATNPLIR
DEFAKVLVSSAGTAWARLADADLAWLDGDQLGRRVHRVACDYQAVRTHFF
DEYFGAAVDAGVRQVVILAAGLDARAYRLNWPAGTVVYEIDQPSVLEYKA
GILQSHGAVPTARRHAVAVDLRDDWPAALIAAGFDGTQPTAWLAEGLLPY
LPGDAADRLFDMVTALSAPGSQVAVEAFTMNTKGNTQRWNRMRERLGLDI
DVQALTYHEPDRSDAAQWLATHGWQVHSVSNREEMARLGRAIPQDLVDET
VRTTLLRGRLVTPAQPA
>Rv0356c CONSERVED HYPOTHETICAL PROTEIN
MTDASVHPDELDPEYHHHGGFPEYGPASPGAGFGQFVATMRRLQDLAVAA
DPGDAVWDEAAERAAALVELLSPFEADEGKAPAGRTPGLPGMGSLLLPPW
TVTRYGTDGVEMRGSFSRFHVGGNSAVHGGVLPLLFDHMFGMISHAAGRP
ISRTAFLHVDYRRITPIDVPLIVRGRVTNTEGRKAFVCAELFDSDETLLA
EGNGLMVRLLPGQP
>Rv3174 PROBABLE SHORT-CHAIN DEHYDROGENASE/REDUCTASE
MTSLAERTVLVTGANRGMGREYVAQLLGRKVAKVYAATRNPLAIDVSDPR
VIPLQLDVTDAVSVAEAADLATDVGILINNAGISRASSVLDKDTSALRGE
LETNLFGPLALASAFADRIAERSGAIVNVSSVLAWLPLGMSYGVSKAAMW
SATESMRIELAPRGVQVVGVYVGLVDTDMGRFADAPKSDPADVVRQVLDG
IEAGKEDVLADEMSRQVRASLNVPARERIARLMGN
>Rv0654 PROBABLE DIOXYGENASE
MTTAQAAESQNPYLEGFLAPVSTEVTATDLPVTGRIPEHLDGRYLRNGPN
PVAEVDPATYHWFTGDAMVHGVALRDGKARWYRNRWVRTPAVCAALGEPI
SARPHPRTGIIEGGPNTNVLTHAGRTLALVEAGVVNYELTDELDTVGPCD
FDGTLHGGYTAHPQRDPHTGELHAVSYSFARGHRVQYSVIGTDGHARRTV
DIEVAGSPMMHSFSLTDNYVVIYDLPVTFDPMQVVPASVPRWLQRPARLV
IQSVLGRVRIPDPIAALGNRMQGHSDRLPYAWNPSYPARVGVMPREGGNE
DVRWFDIEPCYVYHPLNAYSECRNGAEVLVLDVVRYSRMFDRDRRGPGGD
SRPSLDRWTINLATGAVTAECRDDRAQEFPRINETLVGGPHRFAYTVGIE
GGFLVGAGAALSTPLYKQDCVTGSSTVASLDPDLLIGEMVFVPNPSARAE
DDGILMGYGWHRGRDEGQLLLLDAQTLESIATVHLPQRVPMGFHGNWAPT
T
>Rv3530c POSSIBLE OXIDOREDUCTASE
MTGMLKRKVIVVSGVGPGLGTTLAHRCARDGADLVLAARSAERLDDVAKQ
IIDTGRRAVAVRTDITDDDDVSNLVQATLAAYGKADVLINNAFRVPSMKP
LAGTTFEHIRDAIELSALGTLRLIQAFTPALAQSHGAIVNVNSMVIRHSQ
PKYGTYKMAKSVLLAMSHSLATELGEQGIRVNSVAPGYIWGDTLKSYFDH
QAGKYGTTVDQIYQATAANSDLKRLPTEDEVASAILFLASDLASGITGQT
LDVNCGEYHT
>Rv1050 PROBABLE OXIDOREDUCTASE
MARQRFRDQVVLITGASSGIGEATAKAFAREGAVVALAARREGALRRVAR
EIEAAGGRAMVAPLDVSSSESVRAMVADVVGEFGRIDVVFNNAGVSLVGP
VDAETFLDDTREMLEIDYLGTVRVVREVLPIMKQQRSGRIMNMSSVVGRK
AFARFAGYSSAMHAIAGFSDALRQELRGSGIAVSVIHPALTQTPLLANVD
PADMPPPFRSLTPIPVHWVAAAVLDGVARRRARVVVPFQPRLLMVGDAFS
PRYGDRVVRLLESKIFGRLIGSYRGSVYRHQPTESAKAQAAQPERGYSSA
R
>Rv3342 POSSIBLE METHYLTRANSFERASE (METHYLASE)
MTCSRRDMSLSFGSAVGAYERGRPSYPPEAIDWLLPAAARRVLDLGAGTG
KLTTRLVERGLDVVAVDPIPEMLDVLRAALPQTVALLGTAEEIPLDDNSV
DAVLVAQAWHWVDPARAIPEVARVLRPGGRLGLVWNTRDERLGWVRELGE
IIGRDGDPVRDRVTLPEPFTTVQRHQVEWTNYLTPQALIDLVASRSYCIT
SPAQVRTKTLDRVRQLLATHPALANSNGLALPYVTVCVRATLA
>Rv3030 CONSERVED HYPOTHETICAL PROTEIN
MCAFVPHVPRHSRGDNPPSASTASPAVLTLTGERTIPDLDIENYWFRRHQ
VVYQRLAPRCTARDVLEAGCGEGYGADLIACVARQVIAVDYDETAVAHVR
SRYPRVEVMQANLAELPLPDASVDVVVNFQVIEHLWDQARFVRECARVLR
GSGLLMVSTPNRITFSPGRDTPINPFHTRELNADELTSLLIDAGFVDVAM
CGLFHGPRLRDMDARHGGSIIDAQIMRAVAGAPWPPELAADVAAVTTADF
EMVAAGHDRDIDDSLDLIAIAVRP
>Rv1453 POSSIBLE TRANSCRIPTIONAL ACTIVATOR PROTEIN
MALRETSPRIHELIREAARIALNPTQEWLDEFDRAILAANPSIAADPALA
TVVKRSNRAHLIHFAAANLRNPGAPVPANLGPEPLRMARDLVRVGLDALA
LDIYRIGQNVAWRRWTDIAFGLTSDPDELHELLDVPFRTANEFVDTTLAG
ITTEMQLERDKLTRDVPAERRKIVQLLIDGAPISREHAEARLGYPLDRSH
TAAVIWGDQAQGDHSHLDRVADAFGHAGGCPHPLVVVAGAATRWVWVKDA
PGFDIDLIHEVLHDIPDARIAIGATAPGIEGFRRSHRDALTTARMIIRLE
SPHRVAFFTDVEMVALLTENAEGADDFIQRTLGNLESASPALKTTLLTFI
NQQCNASRAARLLFTHRNTLMNRLETAQRLLPRPLADTTIHVAVALEAQQ
WREKPTSDPPAKKESNGTKMR
>Rv3057c PROBABLE SHORT CHAIN ALCOHOL DEHYDROGENASE/REDUCTASE
MLQRGAGQYFAGKRCFVTGAASGIGRATALRLAAQGAELYLTDRDRDGLA
QTVCDARALGAQVPEHRVLDVSDYQDVAAFAADIHARHPSMDVVLNIAGV
SAWGTVDQLTHDQWSRMVAINLMGPIHVIETLVPPMVAAGRGGHLVNVSS
AAGLVGLPWHAAYSASKYGLRGLSEVLRFDLARHGIGVSVVVPGAVKTPL
VNTVEIAGVDRDDPRVNRWVERFSGHAVTPEKAADKILAGVTRNRYLVYT
SADIRALYAFKRYAWWPYTLVMRRVNVFFTRALRPGP
>Rv1333 PROBABLE HYDROLASE
MNSITDVGGIRVGHYQRLDPDASLGAGWACGVTVVLPPPGTVGAVDCRGG
APGTRETDLLDPANSVRFVDALLLAGGSAYGLAAADGVMRWLEEHRRGVA
MDSGVVPIVPGAVIFDLPVGGWNCRPTADFGYSACAAAGVDVAVGTVGVG
VGARAGALKGGVGTASATLQSGVTVGVLAVVNAAGNVVDPATGLPWMADL
VGEFALRAPPAEQIAALAQLSSPLGAFNTPFNTTIGVIACDAALSPAACR
RIAIAAHDGLARTIRPAHTPLDGDTVFALATGAVAVPPEAGVPAALSPET
QLVTAVGAAAADCLARAVLAGVLNAQPVAGIPTYRDMFPGAFGS
>Rv0326 HYPOTHETICAL PROTEIN
MVATDFSDVAVAQLRRSAQARGVSARVQPIVHDLRQPLPVKTGSIDGAFA
HMALCMALSTSEIHAVVAEVGRVLRPGGKFIYTVRHTGDAHYGAGQAHGD
DIFECAGFAVHFFRRELVARLATGWVLEEVHDFEEGELPRRLWRVTVTKP
A
>Rv2067c CONSERVED HYPOTHETICAL PROTEIN
MTDDHPRADIVSRQYHRWLYPHPIADLEAWTTANWEWFDPVHSHRILWPD
REYRPDLDILIAGCGTNQAAIFAFTNRAAKVVAIDISRPALDHQQYLKDK
HGLANLELHLLPIEELATLGRDFDLVVSTGVLHHLADPRAGMKELAHCLR
RDGVVAAMLYGKYGRIGVELLGSVFRDLGLGQDDASIKLAKEAISLLPTY
HPLRNYLTKARDLLSDSALVDTFLHGRQRSYTVEECVDLVTSAGLVFQGW
FHKAPYYPHDFFVPNSEFYAAVNTLPEVKAWSVMERLETLNATHLFMACR
RDRPKEQYTIDFSTVAALDYVPLMRTRCGVSGTDMFWPGWRMAPSPAQLA
FLQQVDGRRTIREIAGCVARTGEPSGGSLADLEEFGRKLFQSLWRLDFVA
VALPASG
>Rv1543 POSSIBLE FATTY ACYL-CoA REDUCTASE
MNLGDLTNFVEKPLAAVSNIVNTPNSAGRYRPFYLRNLLDAVQGRNLNDA
VKGKVVLITGGSSGIGAAAAKKIAEAGGTVVLVARTLENLENVANDIRAI
RGNGGTAHVYPCDLSDMDAIAVMADQVLGDLGGVDILINNAGRSIRRSLE
LSYDRIHDYQRTMQLNYLGAVQLILKFIPGMRERHFGHIVNVSSVGVQTR
APRFGAYIASKAALDSLCDALQAETVHDNVRFTTVHMALVRTPMISPTTI
YDKFPTLTPDQAAGVITDAIVHRPRRASSPFGQFAAVADAVNPAVMDRVR
NRAFNMFGDSSAAKGSESQTDTSELDKRSETFVRATRGIHW
>Rv0839 CONSERVED HYPOTHETICAL PROTEIN
MNDKRRAIYTHGYHESVLRSHRRRTAENSAGYLLPYLVPGLSVLDVGCGP
GTITVDLAARVVPGSVTGVEPTDDALSLARAEAQLHRLSNISFTTSDVHK
LDFPDDAFDVVHAHQVLQHVADPVRALQEMRRVCTPGGIVAARDADYSGF
IWFPKLPALDRWLDLYERAARANGGEPDAGRRLLSWARAAGFDDVTPTAS
VWCFATASAREWWGLVWADRILQSDLAHQLVDSGLATAAQLEEISTAWRE
WAAAPDGWLAIPHGEILCRA
>Rv2794c CONSERVED HYPOTHETICAL PROTEIN
MTVGTLVASVLPATVFEDLAYAELYSDPPGLTPLPEEAPLIARSVAKRRN
EFITVRHCARIALDQLGVPPAPILKGDKGEPCWPDGMVGSLTHCAGYRGA
VVGRRDAVRSVGIDAEPHDVLPNGVLDAISLPAERADMPRTMPAALHWDR
ILFCAKEATYKAWFPLTKRWLGFEDAHITFETDSTGWTGRFVSRILIDGS
TLSGPPLTTLRGRWSVERGLVLTAIVL
>Rv1597 HYPOTHETICAL PROTEIN
MARTFEDLVAEAASASVGGWGFSWLDGRATEERPSWGYQRQLSQRLANAT
AALDLETGGGEVLAGAGNFPPTMVATEAWPPNAAMATRRLHPLGAVVVIT
GDKPPLPFADAAFDLVTSRHPSTRWWTEIARVLRAGGSYFAQHVGPATLW
DLREHFLGPREHNGADQYAQVVRTCITDAGLEIVDLQMERLRVEFFDVGA
VIYFLRKVIWFLPDFTVEGYHDRLRALHERIQAEGPFVTYSTRALIEARK
PS
>Rv0560c POSSIBLE BENZOQUINONE METHYLTRANSFERASE (METHYLASE)
MSTVLTYIRAVDIYEHMTESLDLEFESAYRGESVAFGEGVRPPWSIGEPQ
PELAALIVQGKFRGDVLDVGCGEAAISLALAERGHTTVGLDLSPAAVELA
RHEAAKRGLANASFEVADASSFTGYDGRFDTIVDSTLFHSMPVESREGYL
QSIVRAAAPGASYFVLVFDRAAIPEGPINAVTEDELRAAVSKYWIIDEIK
PARLYARFPAGFAGMPALLDIREEPNGLQSIGGWLLSAHLG
>Rv3224 POSSIBLE IRON-REGULATED SHORT-CHAIN DEHYDROGENASE/REDUCTASE
MSLNGKTMFISGASRGIGLAIAKRAARDGANIALIAKTAEPHPKLPGTVF
TAAKELEEAGGQALPIVGDIRDPDAVASAVATTVEQFGGIDICVNNASAI
NLGSITEVPMKRFDLMNGIQVRGTYAVSQACIPHMKGRENPHILTLSPPI
LLEKKWLRPTAYMMAKYGMTLCALGIAEEMRADGIASNTLWPRTMVATAA
VQNLLGGDEAMARSRKPEVYADAAYVIVNKPATEYTGKTLLCEDVLVESG
VTDLSVYDCVPGATLGVDLWVEDANPPGYLPA
>Rv3549c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MTLAEAADAINFGLAGRVVLVTGGVRGVGAGISSVFAEQGATVITCARRA
VDGQPYEFHRCDIRDEDSVKRLVGEIGERHGRLDMLVNNAGGSPYALAAE
ATHNFHRKIVELNVLAPLLVSQHANVLMQAQPNGGSIVNICSVSGRRPTP
GTAAYGAAKAGLENLTTTLAVEWAPKVRVNAVVVGMVETERSELFYGDAE
SIARVAATVPLGRLARPADIGWAAAFLASDAASYISGATLEVHGGGEPPP
YLGASSANK
>Rv0281 CONSERVED HYPOTHETICAL PROTEIN
MRTEGDSWDITTSVGSTALFVATARALEAQKSDPLVVDPYAEAFCRAVGG
SWADVLDGKLPDHKLKSTDFGEHFVNFQGARTKYFDEYFRRAAAAGARQV
VILAAGLDSRAYRLPWPDGTTVFELDRPQVLDFKREVLASHGAQPRALRR
EIAVDLRDDWPQALRDSGFDAAAPSAWIAEGLLIYLPATAQERLFTGIDA
LAGRRSHVAVEDGAPMGPDEYAAKVEEERAAIAEGAEEHPFFQLVYNERC
APAAEWFGERGWTAVATLLNDYLEAVGRPVPGPESEAGPMFARNTLVSAA
RV
>Rv0100 CONSERVED HYPOTHETICAL PROTEIN
MRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQ
SELPSRLAANPSIAGWLRELEAVCTEFG
>Rv3485c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MNSRAPRNLAVSSPSAQVTGRMVQNGENLFQFRREGPQVQLSFQDRTYLV
TGGGSGIGKGVAAGLVAAGAAVMIVGRNPDKLAAAVKDIEALKTGAIGYE
PADITDEEQTLRVVDAATAWHGRLHGVVHCAGGSQTIGPITQIDSQAWRR
TVDLNVNGTMYVLKHAARELVRGGGGSFVGISSIAASNTHRWFGAYGVTK
SAVDHMMKLAADELGPSWVRVNSIRPGLIRTDLVVPVTESPELSADYRVC
TPLPRVGEVEDVANLAMFLLSDAASWITGQVINVDGGHMLRRGPDFSPML
EPVFGADGLRGVVG
>Rv1403c PUTATIVE METHYLTRANSFERASE
MTVYTPTSERQAPATTHRQMWALGDYAAIAEELLAPLGPILVSTSGIRRG
DRVLDVAAGSGNVSIPAAMAGAHVTASDLTPELLRRAQARAAAAGLELGW
REANAEALPFSAGEFDAVLSTIGVMFAPRHQRTADELARVCRRGGKISTL
NWTPEGFYGKLLSTIRPYRPTLPAGAPHEVWWGSEDYVSGLFRDHVSDIR
TRRGSLTVDRFGCPDECRDYFKNFYGPAINAYRSIADSPECVATLDAEIT
ELCREYLCDGVMQWEYLIFTARKC
>Rv3037c CONSERVED HYPOTHETICAL PROTEIN
MRARFGDRAPWLVETTLLRRRAAGKLGELCPNVGVSQWLFTDEALQQATA
APVARHRARRLAGRVVHDATCSIGTELAALRELAVRAVGSDIDPVRLAMA
RHNLAALGMEADLCRADVLHPVTRDAVVVIDPARRSNGRRRFHLADYQPG
LGPLLDRYRGRDVVVKCAPGIDFEEVGRLGFEGEIEVISYRGGVREACLW
SAGLAGSGIRRRASILDSGEQIGDDEPDDCGVRPAGKWIVDPDGAVVRAG
LVRNYGARHGLWQLDPQIAYLSGDRLPPALRGFEVLEQLAFDERRLRQVL
SALDCGAAEILVRGVAIDPDALRRRLRLRGSRPLAVVITRIGAGSLSHVT
AYVCRPSR
>Rv0765c PROBABLE OXIDOREDUCTASE
MPRFEPHPARRTTVVAGASSGIGAATATELAGRGFPVALGARRMDKLAEL
VDKIRADGGEAVAFPLDVTDPESVKSFVAQTVEALGEVELLVSSAGDMLP
GQLHEVSTEAFAEQVQIHLVGANRLATAVLPAMVARRRGDLIFVGSDVGL
RQRPHMGAYGAAKAGLAAMVTNLQMELEGTGVRASIVHPGPTLTGMGWQL
SAEQVGPMLADWAKWGQARHNYFLRPSDLARAIAFVAETPRGCVVVNMEI
QPEAPLRDAPAHRQKLVLGEEGMPG
>Rv2054 CONSERVED HYPOTHETICAL PROTEIN
MTTIEIDAPAGPIDALLGLPPGQGPWPGVVVVHDAVGYVPDNKLISERIA
RAGYVVLTPNMYARGGRARCITRVFRELLTKRGRALDDILAARDHLLAMP
ECSGRVGIVGFCMGGQFALVLSPRGFGATAPFYGTPLPRHLSETLNGACP
IVASFGTRDPLGIGAANRLRKVTAAKNIPADIKSYPGAGHSFANKLPGQP
LVRIAGFGYNEAATEDAWRRVFEFFGQHLRAGSPGEP
>Rv1515c CONSERVED HYPOTHETICAL PROTEIN
MSTNPGPAEGANQVMAQEHSAGAVQFTAHNVRLDDGTLTIPESSRTLDES
SWFISARGILETVFPGDKSHLRLADVGCLEGGYAVGFARMGFQVLGIEVR
ELNMAACNYIKSKTNLPNLRFVHDNALNIANHGLFDTVFCCGLFYHLENP
KQYLETLSSVTNKLLILQTHFSIINRSDKWLRLPTTARQLTDRLLRRPAP
VKFMLSAPTEHEGLPGRWFTEFSDDRSFGQRDTAKWASWDNRRSFWIQRE
HLLQAIKDVGVDLVMEEYDNLEPSIAESLLGGSYAANLRGTFIGIKTR
>Rv0945 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MLTGVTRQKILITGASSGLGAGMARSFAAQGRDLALCARRTDRLTELKAE
LSQRYPDIKIAVAELDVNDHERVPKVFAELSDEIGGIDRVIVNAGIGKGA
RLGSGKLWANKATIETNLVAALVQIETALDMFNQRGSGHLVLISSVLGVK
GVPGVKAAYAASKAGVRSLGESLRAEYAQRPIRVTVLEPGYIESEMTAKS
ASTMLMVDNATGVKALVAAIEREPGRAAVPWWPWAPLVRLMWVLPPRLTR
RFA
>Rv3559c PROBABLE OXIDOREDUCTASE
MNLSVAPKEIAGHGLLDGKVVVVTAAAGTGIGSATARRALAEGADVVISD
HHERRLGETAAELSALGLGRVEHVVCDVTSTAQVDALIDSTTARMGRLDV
LVNNAGLGGQTPVADMTDDEWDRVLDVSLTSVFRATRAALRYFRDAPHGG
VIVNNASVLGWRAQHSQSHYAAAKAGVMALTRCSAIEAAEYGVRINAVSP
SIARHKFLDKTASAELLDRLAAGEAFGRAAEPWEVAATIAFLASDYSSYL
TGEVISVSCQHP
>Rv0148 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MPGVQDRVIVVTGAGGGLGREYALTLAGEGASVVVNDLGGARDGTGAGSA
MADEVVAEIRDKGGRAVANYDSVATEDGAANIIKTALDEFGAVHGVVSNA
GILRDGTFHKMSFENWDAVLKVHLYGGYHVLRAAWPHFREQSYGRVVVAT
STSGLFGNFGQTNYGAAKLGLVGLINTLALEGAKYNIHANALAPIAATRM
TQDILPPEVLEKLTPEFVAPVVAYLCTEECADNASVYVVGGGKVQRVALF
GNDGANFDKPPSVQDVAARWAEITDLSGAKIAGFKL
>Rv0303 PROBABLE DEHYDROGENASE/REDUCTASE
MNTGTAVITGASSGLGLQCARALLRRDASWHVVLAVRDPARGRAAMEELG
EPNRCSVLEVDLASVRSVRSFVETVRTTPLPPIRALVCNAGLQVVSGIAF
TDDGVEMTFGVNHLGHFALVTGILDWLARPARIVVVSSGTHDPSKHTGMP
DPRYTCAADLAHPPTDQNTPAEGRRRYTTSKLCNVLFTYELDRRLDHGEQ
GVMVNAFDPGLMPGSGLARDYPPILRLAYRLLSPMLRVLPFVHSTRVSGE
HLAALAVDPRFAGVTGQYFAGAKAIRSSAESYDRAKALDLWETSERLLAQ
VT
>Rv1889c CONSERVED HYPOTHETICAL PROTEIN
MPRTNNDAWDLATSVGATATMVAAARAVATRADNPLIDDPFAEPLVRAVG
IDFFTRWAAGNIKATDVDDPDGTWGLQRLADLLAARTRYFDAFFRDATSA
GIRQAVILASGLDARAYR
>Rv0893c CONSERVED HYPOTHETICAL PROTEIN
MRTEDDSWDVTTSVGSTGLLVAAARALETQKADPLAIDPYAEVFCRAAGG
EWADVLDGKLPDHYLTTGDFGEHFVNFQGARTRYFDEYFSRATAAGMKQV
VILAAGLDSRAFRLQWPIGTTIFELDRPQVLDFKNAVLADYHIRPRAQRR
SVAVDLRDEWQIALCNNGFDANRPSAWIAEGLLVYLSAEAQQRLFIGIDT
LASPGSHVAVEEATPLDPCEFAAKLERERAANAQGDPRRFFQMVYNERWA
RATEWFDERGWRATATPLAEYLRRVGRAVPEADTEAAPMVTAITFVSAVR
TGLVADPARTSPSSTSIGFKRFEAD
>Rv1882c PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MKAIFITGAGSGMGREGATLFHANGWRVGAIDRNEDGLAALRVQLGAERL
WARAVDVTDKAALEGALADFCAGNVGGGLDMMWNNAGIGEGGWFEDVPYE
AAVRVVDVNFKAVLTGAYAALPYLKKAPGSLMFSTSSSSGTYGMPRIAVY
SATKHAVKGLTEALSVEWQRHGVRVADVLPGLIDTAILTSTRQHSDEGPY
TISAEQIRAAAPKKGMFRLMPSSSVAEAAWRAYQHPTRLHWYVPRSIRWI
DRLKGVSPEFVRRHIAKSLATLEPKRK
>Rv1683 Possible long-chain acyl-CoA synthase
MVDLNFSMVTRPIERLVATAQNGLEVLRLGGLETGSVPSPSQIVESVPMY
KLRRYFPPDNRPGQPPVGPPVLMVHPMMMSADMWDVTREDGAVGILHASG
LDPWVIDFGSPDEVEGGMRRNLADHIVALSEAVDTVKDATGHDVHFVGYS
QGGMFCYQAAAYRRSKDIASVVAFGSPVDTLAALPMGIPANMGAAVADFM
ADHVFNRLDIPSWMARMGFQMMDPLKTAKARVDFVRQLHDREALLPREQQ
RRFLESEGWIAWSGPAISELLKQFIAHNRMMTGGFAISGQMVTLTDITCP
ILAFVGEVDDIGQPASVRGIRRAAPNSEVYECLIRAGHFGLVVGSRAAQQ
SWPTVADWVRWISGDGTKPENIHLMADQPAEHTDSGVAFSSRVAHGIGEV
SEAALALARGAADAVVAANRSVRTLAVETVRTLPRLARLGQLNDHTRISL
GRIIDEQAHDAPKGEFLLFDGRVHTYEAVNRRINNVVRGLIAVGVRQGDR
VGVLMETRPSALVAIAALSRLGAVAVVMRPDTDLSASVRLGRVTEILTDP
TNLDAARQLPGQVLVLGGGESRDLDLPADALEQGQVIDMEKIDPDAVELP
AWYRPNPGLARDLAFIAFSSADGDLVAKQITNYRWAVSAFGTASTAALGR
RDTVYCLTPLHHESALLVSLGGAVVGGTRIALSRGLRPDRFVAEVRQYGV
TVVSYTWAMLRDVVDDPAFVLHGNHPVRLFIGSGMPTGLWERVVEAFAPA
HVVEFFATTDGQAVLANVAGAKIGSKGRPLPGAGRVELGAYDAEHDLILE
NDRGFVQVAGVNQVGVLLAQSRGPIDPTASVKRGVFAPADTWISTDYLFW
RDDDGDYWLAGGRGSVVRTARGMVYTEPVTNALGLITGVDLAVTYGVLVR
GRHVAVSAVTLLPGATITAADLTEAVASMPVGLGPDIVHVVPQLTLSGTY
RPTVSALRANGIPKAGRQAWYFNSGGNEYRRLTPAVRTELTGQHRRGNA
>Rv3832c CONSERVED HYPOTHETICAL PROTEIN
MAMNLLHRRHCSSAGWEKAVANQLLPWALQHVELGPRTLEIGPGYGATLQ
ALLGLTASLTAVEVDNSMVERLNRRYGQRARIIRGDGTQTGLPDDHFTSV
VCFTMLHHVASAQLQDQLFAEAYRVLQPGGVFAGSDGVPSLPFRLIHIAD
TYTPIAPADLPGRLRAVGFTDIHVDVAGARLRWRATKPVAA
>Rv1372 CONSERVED HYPOTHETICAL PROTEIN
MNVSAESGAPRRAGQRHEVGLAQLPPAPPTTVAVIEGLATGTPRRVVNQS
DAADRVAELFLDPGQRERIPRVYQKSRITTRRMAVDPLDAKFDVFRREPA
TIRDRMHLFYEHAVPLAVDVSKRALAGLPYRAAEIGLLVLATSTGFIAPG
VDVAIVKELGLSPSISRVVVNFMGCAAAMNALGTATNYVRAHPAMKALVV
CIELCSVNAVFADDINDVVIHSLFGDGCAALVIGASQVQEKLEPGKVVVR
SSFSQLLDNTEDGIVLGVNHNGITCELSENLPGYIFSGVAPVVTEMLWDN
GLQISDIDLWAIHPGGPKIIEQSVRSLGISAELAAQSWDVLARFGNMLSV
SLIFVLETMVQQAESAKAISTGVAFAFGPGVTVEGMLFDIIRR
>Rv0520 POSSIBLE METHYLTRANSFERASE/METHYLASE (FRAGMENT)
MGGCSITCLNISEVPNETNRKKNRQAGLDRSIRVIHGSFDDIPEPDSGYD
VVWSQDAILHAPDRRKVLEEAFRVLRPGGELIFTDPMQADDVPDGVLQPV
YDRLNLRDLGSMRFYA
>Rv0567 PROBABLE METHYLTRANSFERASE/METHYLASE
MELSPDRIMAIGGGYGPSKVLLTAVGLGLFTELGDEAMTAEAIADRLGLL
KRPAIDFLDALVSLDLLARDGDGPGSHYRNTPETAHFLDEARPTYAGGLL
KIWNERNYRFWADLTEALKTGKAQSEVKQTGRPFFEALYADPRRLEAFMA
AMDAASRRNIELLAKRFPFERYRRLCDVGCADGLLSRIVAAAHPHLQCVS
FDLPAVTEIARRKLTAEGLGERVQACAGDFLADPLPAADVITMGQILHDW
NLDRKQQLVAKAYEALSKEGAFIVIETLIDDARRENTTGLMMSLNMLIEF
GDAFDYSAADFRGWCGEAGFRSFEVIPLAGGSSAAVAYK
>Rv3406 PROBABLE DIOXYGENASE
MTDLITVKKLGSRIGAQIDGVRLGGDLDPAAVNEIRAALLAHKVVFFRGQ
HQLDDAEQLAFAGLLGTPIGHPAAIALADDAPIITPINSEFGKANRWHTD
VTFAANYPAASVLRAVSLPSYGGSTLWANTAAAYAELPEPLKCLTENLWA
LHTNRYDYVTTKPLTAAQRAFRQVFEKPDFRTEHPVVRVHPETGERTLLA
GDFVRSFVGLDSHESRVLFEVLQRRITMPENTIRWNWAPGDVAIWDNRAT
QHRAIDDYDDQHRLMHRVTLMGDVPVDVYGQASRVISGAPMEIAG
>Rv1888A CONSERVED HYPOTHETICAL PROTEIN
MVPVDLRRDWPTPLRQAGFDPNQPSAWLAEGLLAFLPPDAQDRLLDNITA
LSAPGSR
>Rv1978 CONSERVED HYPOTHETICAL PROTEIN
MGEANIREQAIATMPRGGPDASWLDRRFQTDALEYLDRDDVPDEVKQKII
GVLDRVGTLTNLHEKYARIALKLVSDIPNPRILELGAGHGKLSAKILELH
PTATVTISDLDPTSVANIAAGELGTHPRARTQVIDATAIDGHDHSYDLAV
FALAFHHLPPTVACKAIAEATRVGKRFLIIDLKRQKPLSFTLSSVLLLPL
HLLLLPWSSMRSSMHDGFISALRAYSPSALQTLARAADPGMQVEILPAPT
RLFPPSLAVVFSRSSSAPTESSECSADRQPGE
>Rv1186c CONSERVED HYPOTHETICAL PROTEIN
MRIAGVGLGQLLLALDATVVSLVDAPRGLDLPVASTALIDSDDVRLGLAA
AAGSADVFFLIGVTDDEAVRWVDDQARQRAPVAIFVKHPSDSVVAGAVRA
GSAVVAVEPRARWERLYHLVNHVLEHHGDRADPTDDSGTDLFGLAQSLAD
RIHGMISIEDAQSHVLAYSASNDEADELRRLSILGRAGPPEHLQWIGQWG
IFDALRPGREVVRVAERPELGLRPRLAIGIHQPGVGALRPPVFAGTIWVQ
QGSQPLADDAEEMLRGAAVLAARIMSRLATQPNTHALRVQQLLGLAELNA
TTAPVDVSTIARELGVAAEGNATLIGFDTAENRDTAVRHVRLVDVMALSA
SAFRHDAQVAANGSRIYVLLPQTTTGRAVTSWVRGTISALRAELGVALRA
AIAGPVAGLAEVNPARVEVDRVLESAERHPILGQVTSLAEARTTVLLDEI
VTLVGTDQRLVDPRIRDLGAQDPVLAQTLRAYLDAFGDIGAAARSLQVHP
NTVRYRIRRIEQLLSTSLGDPDVRLLFSLGLRAMERTA
>Rv3480c CONSERVED HYPOTHETICAL PROTEIN
MSQTARRLGPQDMFFLYSESSTTMMHVGALMPFTPPSGAPPDLLRQLVDE
SKASEVVEPWSLRLSHPELLYHPTQSWVVDDNFDLDYHVRRSALASPGDE
RELGIPVSRLHSHALDLRRPPWEVHFIEGLEGGRFAIYIKMHHSLIDGYT
GQKMLARSLSTDPHDTTHPLFFNIPTPGRSPADTQDSVGGGLIAGAGNVL
DGLGDVVRGLGGLVSGVGSVLGSVAGAGRSTFELTKALVNAQLRSDHEYR
NLVGSVQAPHCILNTRISRNRRFATQQYPLDRLKAIGAQYDATINDVALA
IIGGGLRRFLDELGELPNKSLIVVLPVNVRPKDDEGGGNAVATILATLGT
DVADPVQRLAAVTASTRAAKAQLRSMDKDAILAYSAALMAPYGVQLASTL
SGVKPPWPYTFNLCVSNVPGPEDVLYLRGSRMEASYPVSLVAHSQALNVT
LQSYAGTLNFGFIGCRDTLPHLQRLAVYTGEALDQLAAADGAAGLGS
>Rv1532c CONSERVED HYPOTHETICAL PROTEIN
MSDPLTAQEQHKRRQAVRELMPRTPFIGGLGIVFERYEPDDVVIRLPFRT
DLTNDGTYFHGGVIASVMDTAGAAAAWSNHDFDRGTRAATVAMSIQYTGA
AKRCDLLCHARTARRRKELTFTEITATDPDGNIVAHAVQTYRIV
>Rv0679c CONSERVED HYPOTHETICAL THREONINE RICH PROTEIN
MVEKPLRADRATHSRLATFALALAAAALPLAGCSSTANPPAATTTPATAT
TTTATSGPTAAPTVTTGESTTASIQIGDMLTYGSIGTTATLDCADGKSLN
VAGSDNTLTVNGTCETVTVGGANNKIAFDRIDERLVVVGLDNTVTYKNGD
PTIDNLGAGNRINKE
>Rv1377c PUTATIVE TRANSFERASE
MPGIDFDALYRGESPGEGLPPITTPPWDTKAPKDNVIGWHTGGWVHGDVL
DIGCGLGDNAIYLARNGYQVTGLDISPTALTTAKRRASDAGVDVKFAVGD
ATKLTGYTGAFDTVIDCGMFHCLDDDGKRSYAASVHRATRPGATLLLSCF
SNAMPPDEEWPRSTVSEQTLRDVLGGAGWDIESLEPATVRRELDGTEVEM
AFWNVRAQRRGS
>Rv3699 CONSERVED HYPOTHETICAL PROTEIN
MTDEVMDWDSAYREQGAFEGPPPWNIGEPQPELATLIAAGKVRSDVLDAG
CGYAELSLALAADGYTVVGIDLTPTAVAAATKAAEERGLTTASFVQADIT
EFAAYPAGSAGRFSTVIDSTLFHSLPVDSRDRYLSSVHRAAAPGASYYVL
VFAKGAFPAELEVKPNEVDEDELRAAVSKYWKIDEIRPAFIHVNPVTIPP
QLAGAPVEFPPYDHDEKGRVKFPAYLLTAHKAG
>Rv1147 CONSERVED HYPOTHETICAL PROTEIN
MTSGAAASASRVDHPLFARIWPVVAAHEAEAIRALRRENLAGLSGRVLEV
GAGVGTNFAYYPVAVEQVIAMEPEPRLAAKARIAAADAPVPIVVTDKTVE
EFRDTETFDAVVCSLVLCSVSDPGAVLAHLRSLLRRGGELRYLEHVASAG
ARGRVQRFVDATFWPRLAGNCHTHRHTERAILDAGFVVDSSRREWAFPAW
VPLPVSELALGRAHRT
>Rv1990A POSSIBLE DEHYDROGENASE (FRAGMENT)
MGRLEGKVAFITGVARGQGRSHAVRLADGQARALGKVDVEACGALVGEVE
VWGRDVRDDRRVFVESPADEFGACRRVARQGIRVVGLPVSQRELVEPEAG
CAARRSAAGSQ
>Rv0439c PROBABLE DEHYDROGENASE/REDUCTASE
MTANDNKTRKWSAADVPDQSGRVVVVTGANTGIGYHTAAVFADRGAHVVL
AVRNLEKGNAARARIMAARPGAHVTLQQLDLCSLDSVRAAADALRTAYPR
IDVLINNAGVMWTPKQVTKDGFELQFGTNHLGHFALTGLVLDHMLPVPGS
RVVTVSSQGHRIHAAIHFDDLQWERRYNRVAAYGQAKLANLLFTYELQRR
LGEAGKSTIAVAAHPGGSNTELTRNLPRLIRPVATVLGPLLFQSPEMGAL
PTLRAATDPTTQGGQYYGPDGFGEQRGHPKVVQSSAQSHDKDLQRRLWTV
SEELTGVSFGV
>Rv0097 POSSIBLE OXIDOREDUCTASE
MTLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKDVHPSPR
EFIKLGRIIGQIVPYYEPMYHHEDHPEIFVSSTEEGQGVPKTGAFWHIDY
MFMPEPFAFSMVLPLAVPGHDRGTYFIDLARVWQSLPAAKRDPARGTVST
HDPRRHIKIRPSDVYRPIGEVWDEINRTTPPIKWPTVIRHPKTGQEILYI
CATGTTKIEDKDGNPVDPEVLQELMAATGQLDPEYQSPFIHTQHYQVGDI
ILWDNRVLMHRAKHGSAAGTLTTYRLTMLDGLKTPGYAA
>Rv1729c CONSERVED HYPOTHETICAL PROTEIN
MARTDDDNWDLTSSVGVTATIVAVGRALATKDPRGLINDPFAEPLVRAVG
LDLFTKMMDGELDMSTIADVSPAVAQAMVYGNAVRTKYFDDYLLNATAGG
IRQVAILASGLDSRAYRLPWPTRTVVYEIDQPKVMEFKTTTLADLGAEPS
AIRRAVPIDLRADWPTALQAAGFDSAAPTAWLAEGLLIYLKPQTQDRLFD
NITALSAPGSMVATEFVTGIADFSAERARTISNPFRCHGVDVDLASLVYT
GPRNHVLDYLAAKGWQPEGVSLAELFRRSGLDVRAADDDTIFISGCLTDH
SSISPPTAAGWR
>Rv3399 CONSERVED HYPOTHETICAL PROTEIN
MARPMGKLPSNTRKCAQCAMAEALLEIAGQTINQKDLGRSGRMTRTDNDT
WDLASSVGATATMIATARALASRAENPLINDPFAEPLVRAVGIDLFTRLA
SGELRLEDIGDHATGGRWMIDNIAIRTKFYDDFFGDATTAGIRQVVILAA
GLDTRAYRLPWPPGTVVYEIDQPAVIKFKTRALANLNAEPNAERHAVAVD
LRNDWPTALKNAGFDPARPTAFSAEGLLSYLPPQGQDRLLDAITALSAPD
SRLATQSPLVLDLAEEDEKKMRMKSAAEAWRERGFDLDLTELIYFDQRND
VADYLAGSGWQVTTSTGKELFAAQGLPPFADDHITRFADRRYISAVLK
>Rv2955c CONSERVED HYPOTHETICAL PROTEIN
MQFQDVRLMRVVVCRRLGPAKGQRRWRPLDLGTTGCFENLGAQRPTYRMR
AIRMLECAMPNRLVRSLQRWRPFGLPPHRWRLAPWYWRGLQVTLEPGSAI
AWIVRLTGGFEETEIDIAAALYSALYPDRCILDVGANVGIHSLAWARLAP
VVALEPAPGTHSRLEANVAANGLQDRIRTLRTAAGDAVGEVDFFVAADSA
FSSLNDTGRIRIRERTRVPCTTLDALAAELPLPVGLLKIDVEGLERAVIA
GAAELLRRDRPVLLVEIYGGAASNPDPERTIADIRAYGYEPFVYADDAGL
QPYQRHRDDRYCYFFIPSRKG
>Rv3791 PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MVLDAVGNPQTVLLLGGTSEIGLAICERYLHNSAARIVLACLPDDPRRED
AAAAMKQAGARSVELIDFDALDTDSHPKMIEAAFSGGDVDVAIVAFGLLG
DAEELWQNQRKAVQIAEINYTAAVSVGVLLAEKMRAQGFGQIIAMSSAAG
ERVRRANFVYGSTKAGLDGFYLGLSEALREYGVRVLVIRPGQVRTRMSAH
LKEAPLTVDKEYVANLAVTASAKGKELVWAPAAFRYVMMVLRHIPRSIFR
KLPI
>Rv1509 HYPOTHETICAL PROTEIN
MFALSNNLNRVNACMDGFLARIRSHVDAHAPELRSLFDTMAAEARFARDW
LSEDLARLPVGAALLEVGGGVLLLSCQLAAEGFDITAIEPTGEGFGKFRQ
LGDIVLELAAARPTIAPCKAEDFISEKRFDFAFSLNVMEHIDLPDEAVRR
VSEVLKPGASYHFLCPNYVFPYEPHFNIPTFFTKELTCRVMRHRIEGNTG
MDDPKGVWRSLNWITVPKVKRFAAKDATLTLRFHRAMLVWMLERALTDKE
FAGRRAQWMVAAIRSAVKLRVHHLAGYVPATLQPIMDVRLTKR
>Rv2765 PROBABLE ALANINE RICH HYDROLASE
MPKTTDTAATPDGTCAVRLFTPDGPGRWPGVVMFPDAGGVRDTFDRMAAK
LAGFGYVVLLPDVYYREGDWAPFDMKTAFGDPQERARIMFMIGTLTPDRV
TRDADALLNYLASRPEVIGDRFGVCGYCMGGRMSVVVAGRLPDRVAAAAA
FHPGGLVANSPDSPHLLADRISATVYIGGAENDPSFTADHAEKLDKAFSA
AGVPHRIECYPAAHGFAVPDNPSYDAAADERHWAAMTETFGAALN
>Rv0033 acpA, PROBABLE ACYL CARRIER PROTEIN ACPA (ACP)
MKEAINATIQRILRTDRGITANQVLVDDLGFDSLKLFQLITELEDEFDIA
ISFRDAQNIKTVGDVYTSVAVWFPETAKPAPLGKGTA
>Rv2244 acpM, MEROMYCOLATE EXTENSION ACYL CARRIER PROTEIN ACPM
MPVTQEEIIAGIAEIIEEVTGIEPSEITPEKSFVDDLDIDSLSMVEIAVQ
TEDKYGVKIPDEDLAGLRTVGDVVAYIQKLEEENPEAAQALRAKIESENP
DAVANVQARLEAESK
>Rv3391 acrA1, POSSIBLE MULTI-FUNCTIONAL ENZYME WITH ACYL-CoA-REDUCTASE ACTIVITY ACRA1
MRYVVTGGTGFIGRHVVSRLLDGRPEARLWALVRRQSLSRFERLAGQWGD
RVRPLVGDLTELELSERTIAELGDIDHVLHCAAVHDTTWADATRAVIELA
ARLDATFHHVSSIAVAGDFAGHYTEADFDVGQRLPTPYHRMTFEAERLVR
STPGLRYRIYRPAVVVGDSRTGEMDTIDGPYYLFGVLAKLAVLPSFTPML
LPDIGRTNIVPVDYVADALVALMHADGRDGQTFHLTAPTAIGLRGIYRGI
AGAAGLPPLLGTLPGFVAAPVLNARGRAKVLRNMAATQLGIPAEIFDVVG
CAPTFTSDTTREALRGTGIHVPEFATYAPGLWRYWAEHLDPDRARRNDPL
LGRHVIITGASSGIGRASAIAVAKRGATVFALARNGNALDELVTEIRAHG
GQAHAFTCDVTDSASVEHTVKDILGRFDHVDYLVNNAGRSIRRSVVNSTD
RLHDYERVMAVNYFGAVRMVLALLPHWRERRFGHVVNVSSAGVQARNPKY
SSYLPTKAALDAFADVVASETLSDHITFTNIHMPLVATPMIVPSRRLNPV
RAISAERAAAMVIRGLVEKPARIDTPLGTLAEAGNYVAPRLSRRILHQLY
LGYPDSAAAQGISRPDADRPPAPRRPRRSARAGVPRPLRRLGRLVPGVHW
>Rv2276 cyp121, CYTOCHROME P450 121 CYP121
MTATVLLEVPFSARGDRIPDAVAELRTREPIRKVRTITGAEAWLVSSYAL
CTQVLEDRRFSMKETAAAGAPRLNALTVPPEVVNNMGNIADAGLRKAVMK
AITPKAPGLEQFLRDTANSLLDNLITEGAPADLRNDFADPLATALHCKVL
GIPQEDGPKLFRSLSIAFMSSADPIPAAKINWDRDIEYMAGILENPNITT
GLMGELSRLRKDPAYSHVSDELFATIGVTFFGAGVISTGSFLTTALISLI
QRPQLRNLLHEKPELIPAGVEELLRINLSFADGLPRLATADIQVGDVLVR
KGELVLVLLEGANFDPEHFPNPGSIELDRPNPTSHLAFGRGQHFCPGSAL
GRRHAQIGIEALLKKMPGVDLAVPIDQLVWRTRFQRRIPERLPVLW
>Rv0766c cyp123, PROBABLE CYTOCHROME P450 123 CYP123
MTVRVGDPELVLDPYDYDFHEDPYPYYRRLRDEAPLYRNEERNFWAVSRH
HDVLQGFRDSTALSNAYGVSLDPSSRTSEAYRVMSMLAMDDPAHLRMRTL
VSKGFTPRRIRELEPQVLELARIHLDSALQTESFDFVAEFAGKLPMDVIS
ELIGVPDTDRARIRALADAVLHREDGVADVPPPAMAASIELMRYYADLIA
EFRRRPANNLTSALLAAELDGDRLSDQEIMAFLFLMVIAGNETTTKLLAN
AVYWAAHHPGQLARVFADHSRIPMWVEETLRYDTSSQILARTVAHDLTLY
DTTIPEGEVLLLLPGSANRDDRVFDDPDDYRIGREIGCKLVSFGSGAHFC
LGAHLARMEARVALGALLRRIRNYEVDDDNVVRVHSSNVRGFAHLPISVQ
AR
>Rv2266 cyp124, Probable cytochrome P450 124 CYP124
MGLNTAIATRVNGTPPPEVPIADIELGSLDFWALDDDVRDGAFATLRREA
PISFWPTIELPGFVAGNGHWALTKYDDVFYASRHPDIFSSYPNITINDQT
PELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEAAVRDRAHRLVS
SMIANNPDRQADLVSELAGPLPLQIICDMMGIPKADHQRIFHWTNVILGF
GDPDLATDFDEFMQVSADIGAYATALAEDRRVNHHDDLTSSLVEAEVDGE
RLSSREIASFFILLVVAGNETTRNAITHGVLALSRYPEQRDRWWSDFDGL
APTAVEEIVRWASPVVYMRRTLTQDIELRGTKMAAGDKVSLWYCSANRDE
SKFADPWTFDLARNPNPHLGFGGGGAHFCLGANLARREIRVAFDELRRQM
PDVVATEEPARLLSQFIHGIKTLPVTWS
>Rv3545c cyp125, PROBABLE CYTOCHROME P450 125 CYP125
MSWNHQSVEIAVRRTTVPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAA
PIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRF
KNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQER
AQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEM
TGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDG
EKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPE
TAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEV
FQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPD
LKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH
>Rv0778 cyp126, POSSIBLE CYTOCHROME P450 126 CYP126
MTTAAGLSGIDLTDLDNFADGFPHHLFAIHRREAPVYWHRPTEHTPDGEG
FWSVATYAETLEVLRDPVTYSSVTGGQRRFGGTVLQDLPVAGQVLNMMDD
PRHTRIRRLVSSGLTPRMIRRVEDDLRRRARGLLDGVEPGAPFDFVVEIA
AELPMQMICILLGVPETDRHWLFEAVEPGFDFRGSRRATMPRLNVEDAGS
RLYTYALELIAGKRAEPADDMLSVVANATIDDPDAPALSDAELYLFFHLL
FSAGAETTRNSIAGGLLALAENPDQLQTLRSDFELLPTAIEEIVRWTSPS
PSKRRTASRAVSLGGQPIEAGQKVVVWEGSANRDPSVFDRADEFDITRKP
NPHLGFGQGVHYCLGANLARLELRVLFEELLSRFGSVRVVEPAEWTRSNR
HTGIRHLVVELRGG
>Rv2268c cyp128, PROBABLE CYTOCHROME P450 128 CYP128
MTATQSPPEPAPDRVRLAGCPLAGTPDVGLTAQDATTALGVPTRRRASSG
GIPVATSMWRDAQTVRTYGPAVAKALALRVAGKARSRLTGRHCRKFMQLT
DFDPFDPAIAADPYPHYRELLAGERVQYNPKRDVYILSRYADVREAARNH
DTLSSARGVTFSRGWLPFLPTSDPPAHTRMRKQLAPGMARGALETWRPMV
DQLARELVGGLLTQTPADVVSTVAAPMPMRAITSVLGVDGPDEAAFCRLS
NQAVRITDVALSASGLISLVQGFAGFRRLRALFTHRRDNGLLRECTVLGK
LATHAEQGRLSDDELFFFAVLLLVAGYESTAHMISTLFLTLADYPDQLTL
LAQQPDLIPSAIEEHLRFISPIQNICRTTRVDYSVGQAVIPAGSLVLLAW
GAANRDPRQYEDPDVFRADRNPVGHLAFGSGIHLCPGTQLARMEGQAILR
EIVANIDRIEVVEPPTWTTNANLRGLTRLRVAVTPRVAP
>Rv1256c cyp130, PROBA BLE CYTOCHROME P450 130 CYP130
MTSVMSHEFQLATAETWPNPWPMYRALRDHDPVHHVVPPQRPEYDYYVLS
RHADVWSAARDHQTFSSAQGLTVNYGELEMIGLHDTPPMVMQDPPVHTEF
RKLVSRGFTPRQVETVEPTVRKFVVERLEKLRANGGGDIVTELFKPLPSM
VVAHYLGVPEEDWTQFDGWTQAIVAANAVDGATTGALDAVGSMMAYFTGL
IERRRTEPADDAISHLVAAGVGADGDTAGTLSILAFTFTMVTGGNDTVTG
MLGGSMPLLHRRPDQRRLLLDDPEGIPDAVEELLRLTSPVQGLARTTTRD
VTIGDTTIPAGRRVLLLYGSANRDERQYGPDAAELDVTRCPRNILTFSHG
AHHCLGAAAARMQCRVALTELLARCPDFEVAESRIVWSGGSYVRRPLSVP
FRVTS
>Rv1394c cyp132, PROBABLE CYTOCHROME P450 132 CYP132
MATATTQRPLKGPAKRMSTWTMTREAITIGFDAGDGFLGRLRGSDITRFR
CAGRRFVSISHPDYVDHVLHEARLKYVKSDEYGPIRATAGLNLLTDEGDS
WARHRGALNSTFARRHLRGLVGLMIDPIADVTAARVPGAQFDMHQSMVET
TLRVVANALFSQDFGPLVQSMHDLATRGLRRAEKLERLGLWGLMPRTVYD
TLIWCIYSGVHLPPPLREMQEITLTLDRAINSVIDRRLAEPTNSADLLNV
LLSADGGIWPRQRVRDEALTFMLAGHETTANAMSWFWYLMALNPQARDHM
LTELDDVLGMRRPTADDLGKLAWTTACLQESQRYFSSVWIIAREAVDDDI
IDGHRIRRGTTVVIPIHHIHHDPRWWPDPDRFDPGRFLRCPTDRPRCAYL
PFGGGRRICIGQSFALMEMVLMAAIMSQHFTFDLAPGYHVELEATLTLRP
KHGVHVIGRRR
>Rv0327c cyp135A1, POSSIBLE CYTOCHROME P450 135A1 CYP135A1
MASTLTTGLPPGPRLPRYLQSVLYLRFREWFLPAMHRKYGDVFSLRVPPY
ADNLVVYTRPEHIKEIFAADPRSLHAGEGNHILGFVMGEHSVLMTDEAEH
ARMRSLLMPAFTRAALRGYRDMIASVAREHITRWRPHATINSLDHMNALT
LDIILRVVFGVTDPKVKAELTSRLQQIINIHPAILAGVPYPSLKRMNPWK
RFFHNQTKIDEILYREIASRRIDSDLTARTDVLSRLLQTKDTPTKPLTDA
ELRDQLITLLLAGHETTAAALSWTLWELAHAPEIQSQVVWAAVGGDDGFL
EAVLKEGMRRHTVIASTARKVTAPAEIGGWRLPAGTVVNTSILLAHASEV
SHPKPTEFRPSRFLDGSVAPNTWLPFGGGVRRCLGFGFALTEGAVILQEI
FRRFTITAAGPSKGETPLVRNITTVPKHGAHLRLIPQRRLGGLGDSDPP
>Rv0568 cyp135B1, POSSIBLE CYTOCHROME P450 135B1 CYP135B1
MSGTSSMGLPPGPRLSGSVQAVLMLRHGLRFLTACQRRYGSVFTLHVAGF
GHMVYLSDPAAIKTVFAGNPSVFHAGEANSMLAGLLGDSSLLLIDDDVHR
DRRRLMSPPFHRDAVARQAGPIAEIAAANIAGWPMAKAFAVAPKMSEITL
EVILRTVIGASDPVRLAALRKVMPRLLNVGPWATLALANPSLLNNRLWSR
LRRRIEEADALLYAEIADRRADPDLAARTDTLAMLVRAADEDGRTMTERE
LRDQLITLLVAGHDTTATGLSWALERLTRHPVTLAKAVQAADASAAGDPA
GDEYLDAVAKETLRIRPVVYDVGRVLTEAVEVAGYRLPAGVMVVPAIGLV
HASAQLYPDPERFDPDRMVGATLSPTTWLPFGGGNRRCLGATFAMVEMRV
VLREILRRVELSTTTTSGERPKLKHVIMVPHRGARIRVRATRDVSATSQA
TAQGAGCPAARGGGPSRAVGSQ
>Rv3059 cyp136, PROBABLE CYTOCHROME P450 136 CYP136
MATIHPPAYLLDQAKRRFTPSFNNFPGMSLVEHMLLNTKFPEKKLAEPPP
GSGLKPVVGDAGLPILGHMIEMLRGGPDYLMFLYKTKGPVVFGDSAVLPG
VAALGPDAAQVIYSNRNKDYSQQGWVPVIGPFFHRGLMLLDFEEHMFHRR
IMQEAFVRSRLAGYLEQMDRVVSRVVADDWVVNDARFLVYPAMKALTLDI
ASMVFMGHEPGTDHELVTKVNKAFTITTRAGNAVIRTSVPPFTWWRGLRA
RELLENYFTARVKERREASGNDLLTVLCQTEDDDGNRFSDADIVNHMIFL
MMAAHDTSTSTATTMAYQLAAHPEWQQRCRDESDRHGDGPLDIESLEQLE
SLDLVMNESIRLVTPVQWAMRQTVRDTELLGYYLPKGTNVIAYPGMNHRL
PEIWTDPLTFDPERFTEPRNEHKRHRYAFTPFGGGVHKCIGMVFDQLEIK
TILHRLLRRYRLELSRPDYQPRWDYSAMPIPMDGMPIVLRPR
>Rv3685c cyp137, PROBABLE CYTOCHROME P450 137 CYP137
MVLRSLASPAALTDPKRCASVVGVAAFAVRREHAPDALGGPPGLPAPRGF
RAAFAAAYAVAYLAGGERRMLRLIRRYGPIMTMPILSLGDVAIVSDSALA
KEVFTAPTDVLLGGEGVGPAAAIYGSGSMFVQEEPEHLRRRKLLTPPLHG
AALDRYVPIIENSTRAAMHTWPVDRPFAMLTVARSLMLDVIVKVIFGVDD
PEEVRRLGRPFERLLNLGVSEQLTVRYALRRLGALRVWPARARANTEIDD
VVMALIAQRRADPRLGERHDVLSLLVSARGESGEQLSDSEIRDDLITLVL
AGHETTATTLAWAFDLLLHHPDALRRVRAEAVGGGEAFTTAVINETLRVR
PPAPLTARVAAQPLTIGGYRVEAGTRIVVHIIAINRSAEVYEHPHEFRPE
RFLGTRPQTYAWVPFGGGVKRCLGANFSMRELITVLHVLLREGEFTAVDD
EPERIVRRSIMLVPRRGTRVRFRPAR
>Rv0136 cyp138, PROBABLE CYTOCHROME P450 138 CYP138
MSEVVTAAPAPPVVRLPPAVRGPKLFQGLAFVVSRRRLLGRFVRRYGKAF
TANILMYGRVVVVADPQLARQVFTSSPEELGNIQPNLSRMFGSGSVFALD
GDDHRRRRRLLAPPFHGKSMKNYETIIEEETLRETANWPQGQAFATLPSM
MHITLNAILRAIFGAGGSELDELRRLIPPWVTLGSRLAALPKPKRDYGRL
SPWGRLAEWRRQYDTVIDKLIEAERADPNFADRTDVLALMLRSTYDDGSI
MSRKDIGDELLTLLAAGHETTAATLGWAFERLSRHPDVLAALVEEVDNGG
HELRQAAILEVQRARTVIDFAARRVNPPVYQLGEWVIPRGYSIIINIAQI
HGDPDVFPQPDRFDPQRYIGSKPSPFAWIPFGGGTRRCVGAAFANMEMDV
VLRTVLRHFTLETTTAAGERSHGRGVAFTPKDGGRVVMRRR
>Rv1666c cyp139, Probable cytochrome P450 139 CYP139
MRYPLGEALLALYRWRGPLINAGVGGHGYTYLLGAEANRFVFANADAFSW
SQTFESLVPVDGPTALIVSDGADHRRRRSVVAPGLRHHHVQRYVATMVSN
IDTVIDGWQPGQRLDIYQELRSAVRRSTAESLFGQRLAVHSDFLGEQLQP
LLDLTRRPPQVMRLQQRVNSPGWRRAMAARKRIDDLIDAQIADARTAPRP
DDHMLTTLISGCSEEGTTLSDNEIRDSIVSLITAGYETTSGALAWAIYAL
LTVPGTWESAASEVARVLGGRVPAADDLSALTYLNGVVHETLRLYSPGVI
SARRVLRDLWFDGHRIRAGRLLIFSAYVTHRLPEIWPEPTEFRPLRWDPN
AADYRKPAPHEFIPFSGGLHRCIGAVMATTEMTVILARLVARAMLQLPAQ
RTHRIRAANFAALRPWPGLTVEIRKSAPAQ
>Rv1880c cyp140, Probable cytochrome p450 140 CYP140
MKDKLHWLAMHGVIRGIAAIGIRRGDLQARLIADPAVATDPVPFYDEVRS
HGALVRNRANYLTVDHRLAHDLLRSDDFRVVSFGENLPPPLRWLERRTRG
DQLHPLREPSLLAVEPPDHTRYRKTVSAVFTSRAVSALRDLVEQTAINLL
DRFAEQPGIVDVVGRYCSQLPIVVISEILGVPEHDRPRVLEFGELAAPSL
DIGIPWRQYLRVQQGIRGFDCWLEGHLQQLRHAPGDDLMSQLIQIAESGD
NETQLDETELRAIAGLVLVAGFETTVNLLGNGIRMLLDTPEHLATLRQHP
ELWPNTVEEILRLDSPVQLTARVACRDVEVAGVRIKRGEVVVIYLAAANR
DPAVFPDPHRFDIERPNAGRHLAFSTGRHFCLGAALARAEGEVGLRTFFD
RFPDVRAAGAGSRRDTRVLRGWSTLPVTLGPARSMVSP
>Rv3121 cyp141, PROBABLE CYTOCHROME P450 141 CYP141
MTSTSIPTFPFDRPVPTEPSPMLSELRNSCPVAPIELPSGHTAWLVTRFD
DVKGVLSDKRFSCRAAAHPSSPPFVPFVQLCPSLLSIDGPQHTAARRLLA
QGLNPGFIARMRPVVQQIVDNALDDLAAAEPPVDFQEIVSVPIGEQLMAK
LLGVEPKTVHELAAHVDAAMSVCEIGDEEVSRRWSALCTMVIDILHRKLA
EPGDDLLSTIAQANRQQSTMTDEQVVGMLLTVVIGGVDTPIAVITNGLAS
LLHHRDQYERLVEDPGRVARAVEEIVRFNPATEIEHLRVVTEDVVIAGTA
LSAGSPAFTSITSANRDSDQFLDPDEFDVERNPNEHIAFGYGPHACPASA
YSRMCLTTFFTSLTQRFPQLQLARPFEDLERRGKGLHSVGIKELLVTWPT
>Rv3518c cyp142, PROBABLE CYTOCHROME P450 MONOOXYGENASE 142 CYP142
MTEAPDVDLADGNFYASREARAAYRWMRANQPVFRDRNGLAAASTYQAVI
DAERQPELFSNAGGIRPDQPALPMMIDMDDPAHLLRRKLVNAGFTRKRVK
DKEASIAALCDTLIDAVCERGECDFVRDLAAPLPMAVIGDMLGVRPEQRD
MFLRWSDDLVTFLSSHVSQEDFQITMDAFAAYNDFTRATIAARRADPTDD
LVSVLVSSEVDGERLSDDELVMETLLILIGGDETTRHTLSGGTEQLLRNR
DQWDLLQRDPSLLPGAIEEMLRWTAPVKNMCRVLTADTEFHGTALCAGEK
MMLLFESANFDEAVFCEPEKFDVQRNPNSHLAFGFGTHFCLGNQLARLEL
SLMTERVLRRLPDLRLVADDSVLPLRPANFVSGLESMPVVFTPSPPLG
>Rv1785c cyp143, PROBABLE CYTOCHROME P450 143 CYP143
MTTPGEDHAGSFYLPRLEYSTLPMAVDRGVGWKTLRDAGPVVFMNGWYYL
TRREDVLAALRNPKVFSSRKALQPPGNPLPVVPLAFDPPEHTRYRRILQP
YFSPAALSKALPSLRRHTVAMIDAIAGRGECEAMADLANLFPFQLFLVLY
GLPLEDRDRLIGWKDAVIAMSDRPHPTEADVAAARELLEYLTAMVAERRR
NPGPDVLSQVQIGEDPLSEIEVLGLSHLLILAGLDTVTAAVGFSLLELAR
RPQLRAMLRDNPKQIRVFIEEIVRLEPSAPVAPRVTTEPVTVGGMTLPAG
SPVRLCMAAVNRDGSDAMSTDELVMDGKVHRHWGFGGGPHRCLGSHLARL
ELTLLVGEWLNQIPDFELAPDYAPEIRFPSKSFALKNLPLRWS
>Rv1777 cyp144, Probable cytochrome p450 144 CYP144
MRRSPKGSPGAVLDLQRRVDQAVSADHAELMTIAKDANTFFGAESVQDPY
PLYERMRAAGSVHRIANSDFYAVCGWDAVNEAIGRPEDFSSNLTATMTYT
AEGTAKPFEMDPLGGPTHVLATADDPAHAVHRKLVLRHLAAKRIRVMEQF
TVQAADRLWVDGMQDGCIEWMGAMANRLPMMVVAELIGLPDPDIAQLVKW
GYAATQLLEGLVENDQLVAAGVALMELSGYIFEQFDRAAADPRDNLLGEL
ATACASGELDTLTAQVMMVTLFAAGGESTAALLGSAVWILATRPDIQQQV
RANPELLGAFIEETLRYEPPFRGHYRHVRNATTLDGTELPADSHLLLLWG
AANRDPAQFEAPGEFRLDRAGGKGHISFGKGAHFCVGAALARLEARIVLR
LLLDRTSVIEAADVGGWLPSILVRRIERLELAVQ
>Rv0764c cyp51, CYTOCHROME P450 51 CYP51 (CYPL1) (P450-L1A1) (STEROL 14-ALPHA DEMETHYLASE) (LANOSTEROL 14-ALPHA DEMETHYLASE) (P450-14DM)
MSAVALPRVSGGHDEHGHLEEFRTDPIGLMQRVRDECGDVGTFQLAGKQV
VLLSGSHANEFFFRAGDDDLDQAKAYPFMTPIFGEGVVFDASPERRKEML
HNAALRGEQMKGHAATIEDQVRRMIADWGEAGEIDLLDFFAELTIYTSSA
CLIGKKFRDQLDGRFAKLYHELERGTDPLAYVDPYLPIESFRRRDEARNG
LVALVADIMNGRIANPPTDKSDRDMLDVLIAVKAETGTPRFSADEITGMF
ISMMFAGHHTSSGTASWTLIELMRHRDAYAAVIDELDELYGDGRSVSFHA
LRQIPQLENVLKETLRLHPPLIILMRVAKGEFEVQGHRIHEGDLVAASPA
ISNRIPEDFPDPHDFVPARYEQPRQEDLLNRWTWIPFGAGRHRCVGAAFA
IMQIKAIFSVLLREYEFEMAQPPESYRNDHSKMVVQLAQPACVRYRRRTG
V
>Rv3215 entC, PROBABLE ISOCHORISMATE SYNTHASE ENTC (ISOCHORISMATE HYDROXYMUTASE) (ENTEROCHELIN BIOSYNTHESIS)
MSAHVATLHPEPPFALCGPRGTLIARGVRTRYCDVRAAQAALRSGTAPIL
LGALPFDVSRPAALMVPDGVLRARKLPDWPTGPLPKVRVAAALPPPADYL
TRIGRARDLLAAFDGPLHKVVLARAVQLTADAPLDARVLLRRLVVADPTA
YGYLVDLTSAGNDDTGAALVGASPELLVARSGNRVMCKPFAGSAPRAADP
KLDAANAAALASSAKNRHEHQLVVDTMRVALEPLCEDLTIPAQPQLNRTA
AVWHLCTAITGRLRNISTTAIDLALALHPTPAVGGVPTKAATELIAELEG
DRGFYAGAVGWCDGRGDGHWVVSIRCAQLSADRRAALAHAGGGIVAESDP
DDELEETTTKFATILTALGVEQ
>Rv2214c ephD, Possible short-chain dehydrogenase EphD
MPATQQMSRLVDSPDGVRIAVYHEGNPDGPTVVLVHGFPDSHVLWDGVVP
LLAERFRIVRYDNRGVGRSSVPKPISAYTMAHFADDFDAVIGELSPGEPV
HVLAHDWGSVGVWEYLRRPGASDRVASFTSVSGPSQDHLVNYVYGGLRRP
WRPRTFLRAISQTLRLSYMALFSVPVVAPLLLRVALSSAAVRRNMVGDIP
VDQIHHSETLARDAAHSVKTYPANYFRSFSSSRRGRAIPIVDVPVQLIVN
SQDPYVRPYGYDQTARWVPRLWRRDIKAGHFSPMSHPQVMAAAVHDFADL
ADGKQPSRALLRAQVGRPRGYFGDTLVSVTGAGSGIGRETALAFAREGAE
IVISDIDEATVKDTAAEIAARGGIAYPYVLDVSDAEAVEAFAERVSAEHG
VPDIVVNNAGIGQAGRFLDTPAEQFDRVLAVNLGGVVNGCRAFGQRLVER
GTGGHIVNVSSMAAYAPLQSLSAYCTSKAATYMFSDCLRAELDAAGVGLT
TICPGVIDTNIVATTGFHAPGTDEEKIDGRRGQIDKMFALRSYGPDKVAD
AIVSAVKKKKPIRPVAPEAYALYGISRVLPQALRSTARLRVI
>Rv1483 fabG1, 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE FABG1 (3-KETOACYL-ACYL CARRIER PROTEIN REDUCTASE) (MYCOLIC ACID BIOSYNTHESIS A PROTEIN)
MTATATEGAKPPFVSRSVLVTGGNRGIGLAIAQRLAADGHKVAVTHRGSG
APKGLFGVECDVTDSDAVDRAFTAVEEHQGPVEVLVSNAGLSADAFLMRM
TEEKFEKVINANLTGAFRVAQRASRSMQRNKFGRMIFIGSVSGSWGIGNQ
ANYAASKAGVIGMARSIARELSKANVTANVVAPGYIDTDMTRALDERIQQ
GALQFIPAKRVGTPAEVAGVVSFLASEDASYISGAVIPVDGGMGMGH
>Rv1350 fabG2, PROBABLE 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE FABG2 (3-KETOACYL-ACYL CARRIER PROTEIN REDUCTASE)
MASLLNARTAVITGGAQGLGLAIGQRFVAEGARVVLGDVNLEATEVAAKR
LGGDDVALAVRCDVTQADDVDILIRTAVERFGGLDVMVNNAGITRDATMR
TMTEEQFDQVIAVHLKGTWNGTRLAAAIMRERKRGAIVNMSSVSGKVGMV
GQTNYSAAKAGIVGMTKAAAKELAHLGIRVNAIAPGLIRSAMTEAMPQRI
WDQKLAEVPMGRAGEPSEVASVAVFLASDLSSYMTGTVLDVTGGRFI
>Rv2002 fabG3, POSSIBLE 20-BETA-HYDROXYSTEROID DEHYDROGENASE FABG3 (Cortisone reductase) ((R)-20-hydroxysteroid dehydrogenase)
MSGRLIGKVALVSGGARGMGASHVRAMVAEGAKVVFGDILDEEGKAVAAE
LADAARYVHLDVTQPAQWTAAVDTAVTAFGGLHVLVNNAGILNIGTIEDY
ALTEWQRILDVNLTGVFLGIRAVVKPMKEAGRGSIINISSIEGLAGTVAC
HGYTATKFAVRGLTKSTALELGPSGIRVNSIHPGLVKTPMTDWVPEDIFQ
TALGRAAEPVEVSNLVVYLASDESSYSTGAEFVVDGGTVAGLAHNDFGAV
EVSSQPEWVT
>Rv0242c fabG4, PROBABLE 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE FABG4 (3-KETOACYL-ACYL CARRIER PROTEIN REDUCTASE)
MAPKRSSDLFSQVVNSGPGSFLARQLGVPQPETLRRYRAGEPPLTGSLLI
GGAGRVVEPLRAALEKDYDLVGNNLGGRWADSFGGLVFDATGITEPAGLK
GLHEFFTPVLRNLGRCGRVVVVGGTPEAAASTNERIAQRALEGFTRSLGK
ELRRGATTALVYLSPDAKPAATGLESTMRFLLSAKSAYVDGQVFSVGADD
STPPADWEKPLDGKVAIVTGAARGIGATIAEVFARDGAHVVAIDVESAAE
NLAETASKVGGTALWLDVTADDAVDKISEHLRDHHGGKADILVNNAGITR
DKLLANMDDARWDAVLAVNLLAPLRLTEGLVGNGSIGEGGRVIGLSSIAG
IAGNRGQTNYATTKAGMIGITQALAPGLAAKGITINAVAPGFIETQMTAA
IPLATREVGRRLNSLLQGGQPVDVAEAIAYFASPASNAVTGNVIRVCGQA
MIGA
>Rv1750c fadD1, POSSIBLE FATTY-ACID-CoA LIGASE FADD1 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MTDTIQSLLRQHVSDPTIAVKYGGLQWTWSQYLAESAARAAALITIADPQ
RPTHIGSLLGNTPEMLAQLAAAGLGGYVLCGLNTTRRGDALAADVRRADC
QIVVTDADHRALLDGLDLAGARILDTSTPRWAELVAGDGAFVPYREVDTM
DPFMMIFTSGTSGNPKAVPVSHLMATFAGRSLTERFGLTEQDTCYVSMPL
FHSNAVVAGWAPAVVSGAAIAPATFSATGFLDDVRRYHATYMNYVGKPLA
YILATPERDDDADNPLRVAFGNEANDKDIEEFSRRFGVQVEDGFGSTENA
VIVIREPGTPPGSIGRGAHGVAVYNGETVTECAVARFDAHGALTNADEAI
GELVNTTGSGFFTGYYNDPEANAERMRHGMYWSGDLAYRDSEGWIYLAGR
TADWMRVDGENLTAAPIERILLRYKAINRVAVYAVPDEYVGDQVMAALVL
RAGDTFDPDAFEAFLDAQPDLSTKARPRYIRIAADLPSTATHKVLKRQLI
DEGTAVGKADTLWVREPRGSAYHHASGPAKAI
>Rv0099 fadD10, POSSIBLE FATTY-ACID-CoA LIGASE FADD10 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MGGKKFQAMPQLPSTVLDRVFEQARQQPEAIALRRCDGTSALRYRELVAE
VGGLAADLRAQSVSRGSRVLVISDNGPETYLSVLACAKLGAIAVMADGNL
PIAAIERFCQITDPAAALVAPGSKMASSAVPEALHSIPVIAVDIAAVTRE
SEHSLDAASLAGNADQGSEDPLAMIFTSGTTGEPKAVLLANRTFFAVPDI
LQKEGLNWVTWVVGETTYSPLPATHIGGLWWILTCLMHGGLCVTGGENTT
SLLEILTTNAVATTCLVPTLLSKLVSELKSANATVPSLRLVGYGGSRAIA
ADVRFIEATGVRTAQVYGLSETGCTALCLPTDDGSIVKIEAGAVGRPYPG
VDVYLAATDGIGPTAPGAGPSASFGTLWIKSPANMLGYWNNPERTAEVLI
DGWVNTGDLLERREDGFFYIKGRSSEMIICGGVNIAPDEVDRIAEGVSGV
REAACYEIPDEEFGALVGLAVVASAELDESAARALKHTIAARFRRESEPM
ARPSTIVIVTDIPRTQSGKVMRASLAAAATADKARVVVRG
>Rv1427c fadD12, POSSIBLE LONG-CHAIN-FATTY-ACID--CoA LIGASE FADD12 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MRIRQAFGLIATMRRAGLIAPLRPDRYLRIVAAMRREGMGFTAGFAGAAR
RCPDRPGLIDELGTLTWRQLDERGNALAAALQALPAGPPRVVGIMCRNHR
GFVDALLAVNRIGAHILLLNTSFAGPALAEVVTREGVDTVVYDEEFSATV
DRALAEKPQATRIVAWTDEDHDLTVEKLVAAHAGRRPEHTGSHGKVILLT
SGTTGTPKGARHSGGGIGTLKAILDRTPWRAEEVTVIVAPMFHAWGFSQL
VLASSLACTIVTRRRFDPEATLDLIDRHHATGLVVVPVMFDRIMDLPAEI
RNRYDGRSLRFAAASGSRMRPDVVIAFMDQFGDVIYNNYNATEAGMIATA
TPADLRTAPDTAGRPAEGTEIRILDQQFTEVPTGEVGTIYVRNDSQFDGY
TSGAAKDFHAGFMSSGDVGYLDENGRLFVVGRDDEMIVSGGENIYPIEVE
KTLATHPDVAEAAVIGVDDQQYGQRLAAFVVLKPGVSATPETLKQHVRDN
LANYKVPRDIAVLDELPRGITGKILRTELQSRVGS
>Rv3089 fadD13, PROBABLE CHAIN-FATTY-ACID-CoA LIGASE FADD13 (FATTY-ACYL-CoA SYNTHETASE)
MKNIGWMLRQRATVSPRLQAYVEPSTDVRMTYAQMNALANRCADVLTALG
IAKGDRVALLMPNSVEFCCLFYGAAKLGAVAVPINTRLAAPEVSFILSDS
GSKVVIYGAPSAPVIDAIRAQADPPGTVTDWIGADSLAERLRSAAADEPA
VECGGDDNLFIMYTSGTTGHPKGVVHTHESVHSAASSWASTIDVRYRDRL
LLPLPMFHVAALTTVIFSAMRGVTLISMPQFDATKVWSLIVEERVCIGGA
VPAILNFMRQVPEFAELDAPDFRYFITGGAPMPEALIKIYAAKNIEVVQG
YALTESCGGGTLLLSEDALRKAGSAGRATMFTDVAVRGDDGVIREHGEGE
VVIKSDILLKEYWNRPEATRDAFDNGWFRTGDIGEIDDEGYLYIKDRLKD
MIISGGENVYPAEIESVIIGVPGVSEVAVIGLPDEKWGEIAAAIVVADQN
EVSEQQIVEYCGTRLARYKLPKKVIFAEAIPRNPTGKILKTVLREQYSAT
VPK
>Rv1058 fadD14, PROBABLE MEDIUM CHAIN FATTY-ACID-CoA LIGASE FADD14 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MYGTMQDFPLTITAIMRHGCGVHGRRTVTTATGEGYRHSSYRDVGQRAGQ
LANALRRLGVTGDQRVATFMWNNTEHLVTYFAVPSMGAVLHTLNIRLFPE
QIAYVTNEAEDRVILVDLSLARLLAPVLPKLDTVHTVIAVGEGDTTPLRE
AGKTVLRFAELIDAESPDFGWPQIDENSAAAMCYTSGTTGNPKGVVYSHR
SSFLHTMAACTTNGIGVGSSDKVLPIVPMFHANGWGLPYAALMAGADLVL
PDRHLDARSLIHMVETLKPTLAGAVPTIWNDVMHYLEKDPDHDMSSLRLV
ACGGSAVPESLMRTFEDKHDVQIRQLWGMTETSPLATMAWPPPGTPDDQH
WAFRITQGQPVCGVETRIVDDDGQVLPNDGNAVGEVEVRGPWIAGSYYGG
RDESKFDSGWLRTGDVGRIDEQGFITLTDRAKDVIKSGGEWISSVELENC
LIAHPDVLEAAVVGVPDERWQERPLAVVVVREGATVSAGDLRAFLADKVV
RWWLPERWAFVDEIPRTSVGKYDKKAIRSRYAEGAYQITEVHT
>Rv3506 fadD17, POSSIBLE FATTY-ACID-CoA SYNTHETASE FADD17 (FATTY-ACID-CoA SYNTHASE) (FATTY-ACID-CoA LIGASE)
MTPTHPTVTELLLPLSEIDDRGVYFEDSFTSWRDHIRHGAAIAAALRERL
DPARPPHVGVLLQNTPFFSATLVAGALSGIVPVGLNPVRRGAALAGDIAK
ADCQLVLTGSGSAEVPADVEHINVDSPEWTDEVAAHRDTEVRFRSADLAD
LFMLIFTSGTSGDPKAVKCSHRKVAIAGVTITQRFSLGRDDVCYVSMPLF
HSNAVLVGWAVAAACQGSMALRRKFSASQFLADVRRYGATYANYVGKPLS
YVLATPELPDDADNPLRAVYGNEGVPGDIDRFGRRFGCVVMDGFGSTEGG
VAITRTLDTPAGALGPLPGGIQIVDPDTGEPCPTGVVGELVNTAGPGGFE
GYYNDEAAEAERMAGGVYHSGDLAYRDDAGYAYFAGRLGDWMRVDGENLG
TAPIERVLMRYPDATEVAVYPVPDPVVGDQVMAALVLAPGTKFDADKFRA
FLTEQPDLGHKQWPSYVRVSAGLPRTMTFKVIKRQLSAEGVACADPVWPI
RR
>Rv3513c fadD18, PROBABLE FATTY-ACID-CoA LIGASE FADD18 (FRAGMENT) (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MAASLSENLSCHSSNMCRLSGNAATNLERPGEEPPGDRCTRRQAVRPART
LAKKGNIPVGYYKDEKKTAETFRTINGVRYAIPGDYAQVEEDGTVTMLGR
GSVSINSGGEKVYPEEVEAALKGHPDVFDALVVGVPDPRYGQQVAAVVQA
RPGCRPSLAELDSFVRSEIAGYKVPRSLWFVDEVKRSPAGKPDYRWAKEQ
TEARPADDVHAGHVTSGS
>Rv3515c fadD19, PROBABLE FATTY-ACID-CoA LIGASE FADD19 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MAVALNIADLAEHAIDAVPDRVAVICGDEQLTYAQLEDKANRLAHHLIDQ
GVQKDDKVGLYCRNRIEIVIAMLGIVKAGAILVNVNFRYVEGELRYLFDN
SDMVALVHERRYADRVANVLPDTPHVRTILVVEDGSDQDYRRYGGVEFYS
AIAAGSPERDFGERSADAIYLLYTGGTTGFPKGVMWRHEDIYRVLFGGTD
FATGEFVKDEYDLAKAAAANPPMIRYPIPPMIHGATQSATWMALFSGQTT
VLAPEFNADEVWRTIHKHKVNLLFFTGDAMARPLVDALVKGNDYDLSSLF
LLASTAALFSPSIKEKLLELLPNRVITDSIGSSETGFGGTSVVAAGQAHG
GGPRVRIDHRTVVLDDDGNEVKPGSGMRGVIAKKGNIPVGYYKDEKKTAE
TFRTINGVRYAIPGDYAQVEEDGTVTMLGRGSVSINSGGEKVYPEEVEAA
LKGHPDVFDALVVGVPDPRYGQQVAAVVQARPGCRPSLAELDSFVRSEIA
GYKVPRSLWFVDEVKRSPAGKPDYRWAKEQTEARPADDVHAGHVTSGG
>Rv0270 fadD2, PROBABLE FATTY-ACID-CoA LIGASE FADD2 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MPNLTDLPGQAVSKLQKSIGQYVARGTAELHYLRKIIESGAIGLEPPLNY
AALAADIRKWGEVGMLPSHNARRAPNRAAVIDEEGTLTFSELDEAAHAVA
NGLLAKGVRAGDGVAILARNHRWFVIANYGAARVGARIILLNSEFSGPQI
KEVSDREGAKVIIYDDEYTKAVSLAQPPLGKLRALGVNPDDDKPSGSSDE
TLAELIAHSSTAPAPKASRRASIIILTSGTTGTPKGANRNTPPTLAPIGG
ILSHVPFKAGEVTLLPSPMFHALGYMHAALAMFLGSTLVLRRRFKPALVL
EDIEKHKATSMVVVPVMLSRILDQLEKTEPKPDLSSLKIVFVSGSQLGAE
LATRALGDLGPVIYNMYGSTEVAFATIAGPKDLQFNPSTVGPVVKGVTVK
ILDENGNEVPQGAVGRIFVGNAFPFEGYTGGGGKQIIDGLLSSGDVGYFD
ERGLLYVSGRDDEMIVSGGENVFPAEVEDLISGHPDVVEAAAIGVDDKEF
GARLRAFVVKKPGADLDEDTIKQYVRDHLARYKVPREVIFLDELPRNPTG
KVLKRELRKL
>Rv1185c fadD21, PROBABLE FATTY-ACID--CoA LIGASE FADD21 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MSDSSVLSLLRERAGLQPDDAAFTYIDYEQDWAGITETLTWSEVFRRTRI
VAHEVRRHCTTGDRAVILAPQGLAYIAAFLGSMQAGAIAVPLSVPQIGSH
DERVSAVLADASPSVILTTSAVAEAVAEHIHRPNTNNVGPIIEIDSLDLT
GNSPSFRVKDLPSAAYLQYTSGSTRAPAGVMISHRNLQANFQQLMSNYFG
DRNGVAPPDTTIVSWLPFYHDMGLVLGIIAPILGGYRSELTSPLAFLQRP
ARWLHSLANGSPSWSAAPNFAFELAVRKTTDADIEGLDLGNVLGITSGAE
RVHPNTLSRFCNRFAPYNFREDMIRPSYGLAEATLYVASRNSGDKPEVVY
FEPDKLSTGSANRCEPKTGTPLLSYGMPTSPTVRIVDPDTCIECPAGTIG
EIWVKGDNVAEGYWNKPDETRHTFGAMLVHPSAGTPDGSWLRTGDLGFLS
EDEMFIVGRMKDMLIVYGRNHYPEDIESTVQEITGGRVAAISVPVDHTEK
LVTVIELKLLGDSAGEAMDELDVIKNNVTAAISRSHGLNVADLVLVPPGS
IPTTTSGKIRRAACVEQYRLQQFTRLDG
>Rv2948c fadD22, PROBABLE FATTY-ACID-CoA LIGASE FADD22 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MRNGNLAGLLAEQASEAGWYDRPAFYAADVVTHGQIHDGAARLGEVLRNR
GLSSGDRVLLCLPDSPDLVQLLLACLARGVMAFLANPELHRDDHALAARN
TEPALVVTSDALRDRFQPSRVAEAAELMSEAARVAPGGYEPMGGDALAYA
TYTSGTTGPPKAAIHRHADPLTFVDAMCRKALRLTPEDTGLCSARMYFAY
GLGNSVWFPLATGGSAVINSAPVTPEAAAILSARFGPSVLYGVPNFFARV
IDSCSPDSFRSLRCVVSAGEALELGLAERLMEFFGGIPILDGIGSTEVGQ
TFVSNRVDEWRLGTLGRVLPPYEIRVVAPDGTTAGPGVEGDLWVRGPAIA
KGYWNRPDSPVANEGWLDTRDRVCIDSDGWVTYRCRADDTEVIGGVNVDP
REVERLIIEDEAVAEAAVVAVRESTGASTLQAFLVATSGATIDGSVMRDL
HRGLLNRLSAFKVPHRFAVVDRLPRTPNGKLVRGALRKQSPTKPIWELSL
TEPGSGVRAQRDDLSASNMTIAGGNDGGATLRERLVALRQERQRLVVDAV
CAEAAKMLGEPDPWSVDQDLAFSELGFDSQMTVTLCKRLAAVTGLRLPET
VGWDYGSISGLAQYLEAELAGGHGRLKSAGPVNSGATGLWAIEEQLNKVE
ELVAVIADGEKQRVADRLRALLGTIAGSEAGLGKLIQAASTPDEIFQLID
SELGK
>Rv3826 fadD23, PROBABLE FATTY-ACID-CoA LIGASE FADD23 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MVSLSIPSMLRQCVNLHPDGTAFTYIDYERDSEGISESLTWSQVYRRTLN
VAAEVRRHAAIGDRAVILAPQGLDYIVAFLGALQAGLIAVPLSAPLGGAS
DERVDAVVRDAKPNVVLTTSAIMGDVVPRVTPPPGIASPPTVAVDQLDLD
SPIRSNIVDDSLQTTAYLQYTSGSTRTPAGVMITYKNILANFQQMISAYF
ADTGAVPPLDLFIMSWLPFYHDMGLVLGVCAPIIVGCGAVLTSPVAFLQR
PARWLQLMAREGQAFSAAPNFAFELTAAKAIDDDLAGLDLGRIKTILCGS
ERVHPATLKRFVDRFSRFNLREFAIRPAYGLAEATVYVATSQAGQPPEIR
YFEPHELSAGQAKPCATGAGTALVSYPLPQSPIVRIVDPNTNTECPPGTI
GEIWVHGDNVAGGYWEKPDETERTFGGALVAPSAGTPVGPWLRTGDSGFV
SEDKFFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAIAVPSNGVE
KLVAIVELNNRGNLDTERLSFVTREVTSAISTSHGLSVSDLVLVAPGSIP
ITTSGKVRRAECVKLYRHNEFTRLDAKPLQASDL
>Rv1529 fadD24, PROBABLE FATTY-ACID-CoA LIGASE FADD24 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MVASSIPTALRERASVHPNGAAITYIDYEQDWAGVAETLTWSQLYRRMLN
VAEPLRHVGATGDRAVILAPQGIEYVVGFLGALQAGRIAVPLPVPHAGAH
DERTISVLSDTSPAVILTTSGAVDDVRECAQPQPGQSAPSIVELDLLDLD
SRQRSRSPGARPTGRDTPETAYLQYTSGSTRTPAGVMVSNKNVFANFEQI
VADFFAPEGGVVPPDLTVVSWLPLYHDMGLLLGAIMPILAGVPTVLTSPV
GFLQRPARWIQLLARNGRTISAGPNFAFELAVRKTSDDDMDGLDLAGVHT
ILNGSERVHPATLKRFAERFGRFNFAAAALRPAYGMAEATVYIATRNVNE
PPEIVDFESEKLPAGQAIRCPSGSGTPLVSYGVPRSQLVRIVDPDTCIEC
PQGSVGEIWVQGGNVASGYWHKPEESKRTFGARIVTPSAGTPEAPWLRTG
DSGFVSGGELFIIGRIKDLLIVYGRNHAPDDIEATIQEITSGRCAAIAVP
DHGTEKLVAIIELKKRGDSDEDVADRLRIVKRDVAAAIFDSHGLSVADLV
LVSPGSIPITTSGKIRRAQCVQLYRRREFTRLDA
>Rv1521 fadD25, PROBABLE FATTY-ACID-CoA LIGASE FADD25 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MSVVESSLPGVLRERASFQPNDKALTFIDYERSWDGVEETLTWSQLYRRT
LNLAAQLREHGSTGDRALILAPQSLDYVVSFIASLQAGIVAVPLSIPQGG
AHDERTVSVFADTAPAIVLTASSVVDNVVEYVQPQPGQNAPAVIEVDRLD
LDARPSSGSRSAAHGHPDILYLQYTSGSTRTPAGVMVSNKNLFANFEQIM
TSYYGVYGKVAPPGSTVVSWLPFYHDMGFVLGLILPILAGIPAVLTSPIG
FLQRPARWIQMLASNTLAFTAAPNFAFDLASRKTKDEDMEGLDLGGVHGI
LNGSERVQPVTLKRFIDRFAPFNLDPKAIRPSYGMAEATVYVATRKAGQP
PKIVQFDPQKLPDGQAERTESDGGTPLVSYGIVDTQLVRIVDPDTGIERP
AGTIGEIWVHGDNVAIGYWQKPEATERTFSATIVNPSEGTPAGPWLRTGD
SGFLSEGELFIMGRIKDLLIVYGRNHSPDDIEATIQTISPGRCAAIAVSE
HGAEKLVAIIELKKKDESDDEAAERLGFVKREVTSAISKSHGLSVADLVL
VSPGSIPITTSGKIRRAQCVELYRQDEFTRLDA
>Rv2930 fadD26, FATTY-ACID-CoA LIGASE FADD26 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MPVTDRSVPSLLQERADQQPDSTAYTYIDYGSDPKGFADSLTWSQVYSRA
CIIAEELKLCGLPGDRVAVLAPQGLEYVLAFLGALQAGFIAVPLSTPQYG
IHDDRVSAVLQDSKPVAILTTSSVVGDVTKYAASHDGQPAPVVVEVDLLD
LDSPRQMPAFSRQHTGAAYLQYTSGSTRTPAGVIVSHTNVIANVTQSMYG
YFGDPAKIPTGTVVSWLPLYHDMGLILGICAPLVARRRAMLMSPMSFLRR
PARWMQLLATSGRCFSAAPNFAFELAVRRTSDQDMAGLDLRDVVGIVSGS
ERIHVATVRRFIERFAPYNLSPTAIRPSYGLAEATLYVAAPEAGAAPKTV
RFDYEQLTAGQARPCGTDGSVGTELISYGSPDPSSVRIVNPETMVENPPG
VVGEIWVHGDHVTMGYWQKPKQTAQVFDAKLVDPAPAAPEGPWLRTGDLG
VISDGELFIMGRIKDLLIVDGRNHYPDDIEATIQEITGGRAAAIAVPDDI
TEQLVAIIEFKRRGSTAEEVMLKLRSVKREVTSAISKSHSLRVADLVLVS
PGSIPITTSGKIRRSACVERYRSDGFKRLDVAV
>Rv2941 fadD28, FATTY-ACID-CoA LIGASE FADD28 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MSVRSLPAALRACARLQPHDPAFTFMDYEQDWDGVAITLTWSQLYRRTLN
VAQELSRCGSTGDRVVISAPQGLEYVVAFLGALQAGRIAVPLSVPQGGVT
DERSDSVLSDSSPVAILTTSSAVDDVVQHVARRPGESPPSIIEVDLLDLD
APNGYTFKEDEYPSTAYLQYTSGSTRTPAGVVMSHQNVRVNFEQLMSGYF
ADTDGIPPPNSALVSWLPFYHDMGLVIGICAPILGGYPAVLTSPVSFLQR
PARWMHLMASDFHAFSAAPNFAFELAARRTTDDDMAGRDLGNILTILSGS
ERVQAATIKRFADRFARFNLQERVIRPSYGLAEATVYVATSKPGQPPETV
DFDTESLSAGHAKPCAGGGATSLISYMLPRSPIVRIVDSDTCIECPDGTV
GEIWVHGDNVANGYWQKPDESERTFGGKIVTPSPGTPEGPWLRTGDSGFV
TDGKMFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAISVPGDRST
EKLVAIIELKKRGDSDQDAMARLGAIKREVTSALSSSHGLSVADLVLVAP
GSIPITTSGKVRRGACVEQYRQDQFARLDA
>Rv2950c fadD29, PROBABLE FATTY-ACID-CoA LIGASE FADD29 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MKTNSSFHAAGEVATQPAWGTGEQAAQPLNGSTSRFAMSESSLADLLQKA
ASQYPNRAAYKFIDYDTDPAGFTETVTWWQVHRRAMIVAEELWIYASSGD
RVAILAPQGLEYIIAFMGVLQAGLIAVPLPVPQFGIHDERISSALRDSAP
SIILTTSSVIDEVTTYAPHACAAQGQSAPIVVAVDALDLSSSRALDPTRF
ERPSTAYLQYTSGSTRAPAGVVLSHKNVITNCVQLMSDYIGDSEKVPSTP
VSWLPFYHDMGLMLGIILPMINQDTAVLMSPMAFLQRPARWMQLLAKHRA
QISSAPNFGFELAVRRTSDDDMAGLDLGHVRTIVTGAERVNVATLRRFTE
RFAPFNLSETAIRPSYGLAEATVYVATAGPGRAPKSVCFDYQQLSVGQAK
RAENGSEGANLVSYGAPRASTVRIVDPETRMENPAGTVGEIWVQGDNVGL
GYWRNPQQTEATFRARLVTPSPGTSEGPWLRTGDLGVIFEGELFITGRIK
ELLVVDGANHYPEDIEATIQEITGGRVVAIAVPDDRTEKLVTIIELMKRG
RTDEEEKNRLRTVKREVASAISRSHRLRVADVVMVAPGSIPVTTSGKVRR
SASVERYLHHEFSRLDAMA
>Rv3561 fadD3, PROBABLE FATTY-ACID-CoA LIGASE FADD3 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MINDLRTVPAALDRLVRQLPDHTALIAEDRRFTSTELRDAVYGAAAALIA
LGVEPADRVAIWSPNTWHWVVACLAIHHAGAAVVPLNTRYTATEATDILD
RAGAPVLFAAGLFLGADRAAGLDRAALPALRHVVRVPVEADDGTWDEFIA
TGAGALDAVAARAAAVAPQDVSDILFTSGTTGRSKGVLCAHRQSLSASAS
WAANGKITSDDRYLCINPFFHNFGYKAGILACLQTGATLIPHVTFDPLHA
LRAIERHRITVLPGPPTIYQSLLDHPARKDFDLSSLRFAVTGAATVPVVL
VERMQSELDIDIVLTAYGLTEANGMGTMCRPEDDAVTVATTCGRPFADFE
LRIADDGEVLLRGPNVMVGYLDDTEATAAAIDADGWLHTGDIGAVDQAGN
LRITDRLKDMYICGGFNVYPAEVEQVLARMDGVADAAVIGVPDQRLGEVG
RAFVVARPGTGLDEASVIAYTREHLANFKTPRSVRFVDVLPRNAAGKVSK
PQLRELG
>Rv0404 fadD30, PROBABLE FATTY-ACID-CoA LIGASE FADD30 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MSVISTLRDRATTTPSDEAFVFMDYDTKTGDQIDRMTWSQLYSRVTAVSA
YLISYGRHADRRRTAAISAPQGLDYVAGFLGALCAGWTPVPLPEPLGSLR
DKRTGLAVLDCAADVVLTTSQAETRVRATIATHGASVTTPVIALDTLDEP
SGDNCDLDSQLSDWSSYLQYTSGSTANPRGVVLSMRNVTENVDQIIRNYF
RHEGGAPRLPSSVVSWLPLYHDMGLMVGLFIPLFVGCPVILTSPEAFIRK
PARWMQLLAKHQAPFSAAPNFAFDLAVAKTSEEDMAGLDLGHVNTIINGA
EQVQPNTITKFLRRFRPYNLMPAAVKPSYGMAEAVVYLATTKAGSPPTST
EFDADSLARGHAELSTFETERATRLIRYHSDDKEPLLRIVDPDSNIELGP
GRIGEIWIHGKNVSTGYHNADDALNRDKFQASIREASAGTPRSPWLRTGD
LGFIVGDEFYIVGRMKDLIIQDGVNHYPDDIETTVKEFTGGRVAAFSVSD
DGVEHLVIAAEVRTEHGPDKVTIMDFSTIKRLVVSALSKLHGLHVTDFLL
VPPGALPKTTSGKISRAACAKQYGANKLQRVATFP
>Rv1925 fadD31, PROBABLE ACYL-CoA LIGASE FADD31 (ACYL-COA SYNTHETASE) (ACYL-CoA SYNTHASE)
MNDGSRQELRVRSGLLQIEDCLDADGGIALPAGTTLISLIERNIKYVGDL
VAYRYLDHARSAAGCALEVTWTQFGMRLAAIGAHVQRFAGPGDRVAILAP
QGIDYVCGFYAAIKAGTVAVPLFAPELPGHAERLDTALRDSEPAVILTTA
AAKNAVEGFLNNVPRLRKPTVLVIDQIPDREGELFVPVEMDIDAVSHLQY
TSGSTRPPVGVEITHRAVGTNLVQMILSIDLLNRNTHGVSWLPLYHDMGL
SMIGFPAVYGGHSTLMSPTAFVRRPLRWIQALSEGSRTGRVVTAAPNFAY
EWAAQRGLPAQGDDVDLSNVVLIIGSEPVSIDAVTTFNKAFAPYGLPRTA
FKPSYGIAEATLLVATIDHAAEPTVVYLDPEQLGAGHATRVAPDAPNAVV
HVSCGHVARSLWAVIVDPDTGPEAGAELPDGEIGEVWLQGDNVARGYWGR
PEETRMTFGARLQSPLAEGSHADGSAIDDTWLRTGDLGVYLDGELYITGR
IADLLTIDGRNHYPQDIEATAAEASPMVRRGYITAFTVPASDGDDRNQRL
VIIAERAAGTSRSDPRPALDAIRAAVCNRHGLSVADLSFLPAGAIPRTTS
GKLARQACRAQYLSGRLGVH
>Rv3801c fadD32, PROBABLE FATTY-ACID-CoA LIGASE FADD32 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE
MFVTGESGMAYHNPFIVNGKIRFPANTNLVRHVEKWAKVRGDKLAYRFLD
FSTERDGVARDILWSDFSARNRAVGARLQQVTQPGDRVAILCPQNLDYLI
SFFGALYSGRIAVPLFDPAEPGHVGRLHAVLDDCAPSTILTTTDSAEGVR
KFIRARSAKERPRVIAVDAVPTEVAATWQQPEANEETVAYLQYTSGSTRI
PSGVQITHLNLPTNVVQVLNALEGQEGDRGVSWLPFFHDMGLITVLLASV
LGHSFTFMTPAAFVRRPGRWIRELARKPGETGGTFSAAPNFAFEHAAVRG
VPRDDEPPLDLSNVKGILNGSEPVSPASMRKFFEAFAPYGLKQTAVKPSY
GLAEATLFVSTTPMDEVPTVIHVDRDELNNQRFVEVAADAPNAVAQVSAG
KVGVSEWAVIVDADTASELPDGQIGEIWLHGNNLGTGYWGKEEESAQTFK
NILKSRISESRAEGAPDDALWVRTGDYGTYFKDHLYIAGRIKDLVIIDGR
NHYPQDLECTAQESTKALRVGYAAAFSVPANQLPQTVFDDSHAGLKFDPE
DTSEQLVIVGERAAGTHKLDHQPIVDDIRAAIAVGHGVTVRDVLLVSAGT
IPRTSSGKIGRRACRAAYLDGSLRSGVGSPTVFATSD
>Rv1345 fadD33, POSSIBLE POLYKETIDE SYNTHASE FADD33
MSELAAVLTRSMQASAGDLMVLDRETSLWCRHPWPEVHGLAESVAAWLLD
HDRPAAVGLVGEPTVELVAAIQGAWLAGAAVSILPGPVRGANDQRWADAT
LTRFLGIGVRTVLSQGSYLARLRSVDTAGVTIGDLSTAAHTNRSATPVAS
EGPAVLQGTAGSTGAPRTAILSPGAVLSNLRGLNQRVGTDAATDVGCSWL
PLYHDMGLAFVLSAALAGAPLWLAPTTAFTASPFRWLSWLSDSGATMTAA
PNFAYNLIGKYARRVSEVDLGALRVTLNGGEPVDCDGLTRFAEAMAPFGF
DAGAVLPSYGLAESTCAVTVPVPGIGLLADRVIDGSGAHKHAVLGNPIPG
MEVRISCGDQAAGNASREIGEIEIRGASMMAGYLGQQPIDPDDWFATGDL
GYLGAGGLVVCGRAKEVISIAGRNIFPTEVELVAAQVRGVREGAVVALGT
GDRSTRPGLVVAAEFRGPDEANARAELIQRVASECGIVPSDVVFVSPGSL
PRTSSGKLRRLAVRRSLEMAD
>Rv0035 fadD34, PROBABLE FATTY-ACID-CoA LIGASE FADD34 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MTAALLSPAIAWQQISACTDRTLTITCEDSEVISYQDLIARAAACIPPLR
RLDLKRGEPVLITAHTNLEFLSCFLGLMLHGAVPVPIPPREALKTTERFM
TRLGPLLRHHRVLICTPAEHDEIRAAASTDCQISRFTALAEAGDEQFGRA
TAQQLADTATADWPLCTLDDDAYVQYTSGSTAAPRGVVITYRNLLSNMRA
MAVGSQFQHGDVMGSWLPLHHDMGLVGSLFAALFNSVSAVFTTPHRFLYD
PLGFLRLLTSSGATHTFMPNFALEWLINAYHRRGADIEGIDLHKMRRLII
ASEPVHAEGMRRFAATFAGVGLAPTALGSGYGLAEATVAVSMSAPNTGFR
TETHAAAEVVTGGRVLPGYEVRIDAAPGARAGTIKLRGDSVAAKAYVGGK
KLDALDEEGFCDTHDLGFLVDDEIVILGRQDEVFIVHGENRFPYDIEFII
RGESEQHRTKVACFGVNERVVVVLESPLDSIIDKAEADRLRCQVVAATGL
QLDELITVRRGAIPTTTSGKLKRRAVAQAYRDGTLPRLATHAWTADPDSA
PKTTRSSLEGAH
>Rv2505c fadD35, PROBABLE FATTY-ACID-CoA LIGASE FADD35 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MAAAEVVDPNRLSYDRGPSAPSLLESTIGANLAATAARYGHREALVDMVA
RRRFNYSELLTDVHRLATGLVRAGIGPGDRVGIWAPNRWEWVLVQYATAE
IGAILVTINPAYRVREVEYALRQSGVAMVIAVASFKDADYAAMLAEVGPR
CPDLADVILLESDRWDALAGAEPDLPALQQTAARLDGSDPVNIQYTSGTT
AYPKGVTLSHRNILNNGYLVGELLGYTAQDRICIPVPFYHCFGMVMGNLA
ATSHGAAMVIPAPGFDPAATLRAVQDERCTSLYGVPTMFIAELGLPDFTD
YELGSLRTGIMAGAACPVEVMRKVISRMHMPGVSICYGMTETSPVSTQTR
ADDSVDRRVGTVGRVGPHLEIKVVDPATGETVPRGVVGEFCTRGYSVMAG
YWNDPQKTAEVIDADGWMHTGDLAEMDPSGYVRIAGRIKDLVVRGGENIS
PREIEELLHTHPDIVDGHVIGVPDAKYGEELMAVVKLRNDAPELTIERLR
EYCMGRIARFKIPRYLWIVDEFPMTVTGKVRKVEMRQQALEYLRGQQ
>Rv1193 fadD36, PROBABLE FATTY-ACID-CoA LIGASE FADD36 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MLLASLNPAVVSAADIADAVRIDGDVLSRSDLVGAATSVAERVAGAHRVA
VLATPTASTVLAITGCLIAGVPVVPVPADVGVTERRHMLTDSGVQAWLGP
LPDDPAGLPHIPVRTHARSWHRYPEPSPGAIAMVVYTSGTTGPPKGVQLS
RRAIAADLDALAEAWQWTAEDVLVHGLPLYHVHGLVLGLLGSLRFGNRFV
HTGKPTPAGYAQACYEAHGTLFFGVPTVWSRVAADQAAAGALKPARLLVS
GSAALPVPVFDKLVQLTGHRPVERYGASESLITLSTRADGERRPGWVGLP
LAGVQTRLVDDDGGEVPHDGETVGKLQVRGPTLFDGYLNQPDATAAAFDA
DSWYRTGDVAVVDGSGMHRIVGRESVDLIKSGGYRVGAGEIETVLLGHPD
VAEAAVVGVPDDDLGQRIVAYVVGSANVDADGLINFVAQQLSVHKRPREV
RIVDALPRNALGKVLKKQLLSEG
>Rv0214 fadD4, PROBABLE FATTY-ACID-CoA LIGASE FADD4 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MPRGELYKRFRLVMGGIAPCGSGRRAATYPRRMQIRPYIGADKPAVILYP
SGTVISFDELEARANRLAHWFRQAGLREDDVVAILMENNEHVHAVMWAAR
RSGLYYVPINTHLTASEAAYIVDNSGAKAIVGSAALRETCHGLAEHLPGG
LPDLLMLAGGGLVGWMTYPECVADQPDTPIEDEREGDLLQYSSGTTGRPK
GIKRELPHVSPDAAPGMMPALLDFWMDADSVYLSPAPMYHTAPSVWTMSA
LAAGVTTVVMEKFDAEGALDAIQRYRVTHAQFVPAMFVRMLKLPEAVRNS
YDMSSLRRVIHAAAPCPVQIKEQMIHWWGPIIDEYYASSEASGSTLITAE
DWLTHPGSVGKPIQGGVHIVGADGSELPPNQPGEIYFEGGYPFEYLNDPA
KTAASRNKHGWVTVGDVGYLDDDGYLFLTGRRHHMIISGGVNIYPQEAEN
LLVAHPKVLDAAVFGVPDDEMGQRVMAAVQTVDSADANDQFAGELLAWLR
DRLSHFKCPRSIAFEPQLPRTDTGKLYKSGLVEKYSV
>Rv0166 fadD5, PROBABLE FATTY-ACID-CoA LIGASE FADD5 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MTAQLASHLTRALTLAQQQPYLARRQNWVNQLERHAMMQPDAPALRFVGN
TMTWADLRRRVAALAGALSGRGVGFGDRVMILMLNRTEFVESVLAANMIG
AIAVPLNFRLTPTEIAVLVEDCVAHVMLTEAALAPVAIGVRNIQPLLSVI
VVAGGSSQDSVFGYEDLLNEAGDVHEPVDIPNDSPALIMYTSGTTGRPKG
AVLTHANLTGQAMTALYTSGANINSDVGFVGVPLFHIAGIGNMLTGLLLG
LPTVIYPLGAFDPGQLLDVLEAEKVTGIFLVPAQWQAVCTEQQARPRDLR
LRVLSWGAAPAPDALLRQMSATFPETQILAAFGQTEMSPVTCMLLGEDAI
AKRGSVGRVIPTVAARVVDQNMNDVPVGEVGEIVYRAPTLMSCYWNNPEA
TAEAFAGGWFHSGDLVRMDSDGYVWVVDRKKDMIISGGENIYCAELENVL
ASHPDIAEVAVIGRADEKWGEVPIAVAAVTNDDLRIEDLGEFLTDRLARY
KHPKALEIVDALPRNPAGKVLKTELRLRYGACVNVERRSASAGFTERREN
RQKL
>Rv1206 fadD6, PROBABLE FATTY-ACID-CoA LIGASE FADD6 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MSDYYGGAHTTVRLIDLATRMPRVLADTPVIVRGAMTGLLARPNSKASIG
TVFQDRAARYGDRVFLKFGDQQLTYRDANATANRYAAVLAARGVGPGDVV
GIMLRNSPSTVLAMLATVKCGAIAGMLNYHQRGEVLAHSLGLLDAKVLIA
ESDLVSAVAECGASRGRVAGDVLTVEDVERFATTAPATNPASASAVQAKD
TAFYIFTSGTTGFPKASVMTHHRWLRALAVFGGMGLRLKGSDTLYSCLPL
YHNNALTVAVSSVINSGATLALGKSFSASRFWDEVIANRATAFVYIGEIC
RYLLNQPAKPTDRAHQVRVICGNGLRPEIWDEFTTRFGVARVCEFYAASE
GNSAFINIFNVPRTAGVSPMPLAFVEYDLDTGDPLRDASGRVRRVPDGEP
GLLLSRVNRLQPFDGYTDPVASEKKLVRNAFRDGDCWFNTGDVMSPQGMG
HAAFVDRLGDTFRWKGENVATTQVEAALASDQTVEECTVYGVQIPRTGGR
AGMAAITLRAGAEFDGQALARTVYGHLPGYALPLFVRVVGSLAHTTTFKS
RKVELRNQAYGADIEDPLYVLAGPDEGYVPYYAEYPEEVSLGRRPQG
>Rv0119 fadD7, PROBABLE FATTY-ACID-CoA LIGASE FADD7 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MASDFGPRIADLVEVAATRLPEAPALVVTADRIAISHRDLARLVDELAGQ
LTRSGLLPGDRVALRMGSNAEFVVALLAASRADLVVVPLDPALPITEQRV
RSQAAGARVVLIDADGPHDRAEPTTRWWPLTVNVGGDSGPSGGTLSVHLD
AATEPNPATSTPEGLRPDDAMIMFTGGTTGLPKMVPWTHANIASSVRAII
TGYRLSPRDATVAVMPLYHGHGLIASLLATLASGGAVSLPARGRFSAHTF
WDDIKAVGATWYTAVPTIHQILLERSATEPSGRKPAALRFIRSCSAPLTA
QAALALQTEFAAPVVCAFGMTEATHQVTTTQIEGIDQTETPVVSTGLVGR
STGAQIRIVGSDGLPLPAGAVGEIWLRGTTVVRGYLGDPTITAANFTDGW
LRTGDLGSLSAAGDLSIRGRIKELINRGGEKISPERVEGVLASHPNVMEA
AVFGVPHQLYGEAVAAVIVPRESAPPTREELVQFCRERLAAFEIPASFQE
ASGLPHTAKGSLDRRAVAERFGHSV
>Rv0551c fadD8, PROBABLE FATTY-ACID-CoA LIGASE FADD8 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MSTAGDDAVGVPPACGGRSDAVGVPQLARESGAMRDQDCSGELLRSPTHN
GHLLVGALKRHQNKPVLFLGDTRLTGGQLADRISQYIQAFEALGAGTGVA
VGLLSLNRPEVLMIIGAGQARGYRRTALHPLGSLADHAYVLNDAGISSLI
IDPNPMFVERALALLEQVDSLQQILTIGPVPDALKHVAVDLSAEAAKYQP
QPLVAADLPPDQVIGLTYTGGTTGKPKGVIGTAQSIATMTSIQLAEWEWP
ANPRFLMCTPLSHAGAAFFTPTVIKGGEMIVLAKFDPAEVLRIIEEQRIT
ATMLVPSMLYALLDHPDSHTRDLSSLETVYYGASAINPVRLAEAIRRFGP
IFAQYYGQSEAPMVITYLAKGDHDEKRLTSCGRPTLFARVALLDEHGKPV
KQGEVGEICVSGPLLAGGYWNLPDETSRTFKDGWLHTGDLAREDSDGFYY
IVDRVKDMIVTGGFNVFPREVEDVVAEHPAVAQVCVVGAPDEKWGEAVTA
VVVLRSNAARDEPAIEAMTAEIQAAVKQRKGSVQAPKRVVVVDSLPLTGL
GKPDKKAVRARFWEGAGRAVG
>Rv2245 kasA, 3-OXOACYL-[ACYL-CARRIER PROTEIN] SYNTHASE 1 KASA (BETA-KETOACYL-ACP SYNTHASE) (KAS I)
MSQPSTANGGFPSVVVTAVTATTSISPDIESTWKGLLAGESGIHALEDEF
VTKWDLAVKIGGHLKDPVDSHMGRLDMRRMSYVQRMGKLLGGQLWESAGS
PEVDPDRFAVVVGTGLGGAERIVESYDLMNAGGPRKVSPLAVQMIMPNGA
AAVIGLQLGARAGVMTPVSACSSGSEAIAHAWRQIVMGDADVAVCGGVEG
PIEALPIAAFSMMRAMSTRNDEPERASRPFDKDRDGFVFGEAGALMLIET
EEHAKARGAKPLARLLGAGITSDAFHMVAPAADGVRAGRAMTRSLELAGL
SPADIDHVNAHGTATPIGDAAEANAIRVAGCDQAAVYAPKSALGHSIGAV
GALESVLTVLTLRDGVIPPTLNYETPDPEIDLDVVAGEPRYGDYRYAVNN
SFGFGGHNVALAFGRY
>Rv2246 kasB, 3-OXOACYL-[ACYL-CARRIER PROTEIN] SYNTHASE 2 KASB (BETA-KETOACYL-ACP SYNTHASE) (KAS I)
MGVPPLAGASRTDMEGTFARPMTELVTGKAFPYVVVTGIAMTTALATDAE
TTWKLLLDRQSGIRTLDDPFVEEFDLPVRIGGHLLEEFDHQLTRIELRRM
GYLQRMSTVLSRRLWENAGSPEVDTNRLMVSIGTGLGSAEELVFSYDDMR
ARGMKAVSPLTVQKYMPNGAAAAVGLERHAKAGVMTPVSACASGAEAIAR
AWQQIVLGEADAAICGGVETRIEAVPIAGFAQMRIVMSTNNDDPAGACRP
FDRDRDGFVFGEGGALLLIETEEHAKARGANILARIMGASITSDGFHMVA
PDPNGERAGHAITRAIQLAGLAPGDIDHVNAHATGTQVGDLAEGRAINNA
LGGNRPAVYAPKSALGHSVGAVGAVESILTVLALRDQVIPPTLNLVNLDP
EIDLDVVAGEPRPGNYRYAINNSFGFGGHNVAIAFGRY
>Rv2046 lppI, Probable lipoprotein lppI
MRIAALVAVSLLIAGCSREVGGDVGQSQTIAPPAPAPSAAPSTPPAAGAP
ITTIVSWIEAGHPVDPAAYHVATRDGVTTQLGDDVAFSASSGTVACMTDA
RHTSGTLACLVRLANPPPRPETAYGEWKGGWVDFDGIHLQVGSARADPGP
FVYGNGPELANGDTLSIGDYRCRSYQAGLFCVNYAHQSAVRFASAGIEPF
GCLKPAPPPDGVGVAFGC
>Rv3298c lpqC, POSSIBLE ESTERASE LIPOPROTEIN LPQC
MPWARMLSLIVLMVCLAGCGGDQLLARHASSVATFQFGGLTRSYRLHVPP
AEPSGLVISLHGGGGTGAGQEALTDFDAVADAADLLVVYPDGYDKSWADG
RGASPADRRHLDDVGFLVALAAKLVHDFDIAPGHVFATGMSNGGFMSNRL
ACDRADIFAAVAPVAGTLGVGVTCNPSRPVSVLEAHGTADPLVPFNGGAV
RGRGGLSHSISVASLVDRWRAVDGCQGDPSAAELPDVGDGTMVHLFDSSS
CAAGTEVISYQIDNGGHTWPGGRQYLPKAVIGATTRAFDGSQVIAQFFAT
HGRD
>Rv0671 lpqP, POSSIBLE CONSERVED LIPOPROTEIN LPQP
MLRRVAILLAAVLAFAGCSGGTRLAAGFGNGNSVHTLDVDGAGRSYRLYK
PVGLPSSAPLVVMLHGGFGSAKQAERSYGWDELADSEKFLVAYPDGYHRA
WNANGGGCCGRPAREGVDDIGFVRAVVADIANNVSIDPARVYVTGMSNGA
IMSYTLACNTSIFAAIGVVSGTQLDPCQSPRPVSVIHIHGTADPLVRYHG
GPGAGFARIDGPPVPDLNAFWREVNRCGALDTTTEGPVTTSGATCADNRR
VVLLTVDDAGHRWPSFATQTLWRFFAAHFR
>Rv0173 lprK, POSSIBLE MCE-FAMILY LIPOPROTEIN LPRK (MCE-FAMILY LIPOPROTEIN MCE1E)
MMSVLARMRVMRHRAWQGLVLLVLALLLSSCGWRGISNVAIPGGPGTGPG
SYTIYVQMPDTLAINGNSRVMVADVWVGSIRAIKLKNWVATLTLSLKKDV
TLPKNATAKIGQTSLLGSQHVELAAPPDPSPVPLKDGDTIPLKRSSAYPT
TEQTLASIATLLRGGGLVNLEGIQQEINAIVTGRADQIRAFLGKLDTFTD
ELNQQRDDITRAIDSTNRLLAYVGGRSEVLNRVLTDLPPLIKHFADKQEL
LINASDAVGRLSQSADQYLSAARGDLHQDLQALQCPLKELRRAAPYLVGA
LKLILTQPFDVDTVPQLVRGDYMNLSLTLDLTYSAIDNAFLTGTGFSGAL
RALEQSFGRDPETMIPDIRYTPNPNDAPGGPLVERGNRQC
>Rv0593 lprL, POSSIBLE MCE-FAMILY LIPOPROTEIN LPRL (MCE-FAMILY LIPOPROTEIN MCE2E)
MRCGVSAGSANGKPNRWTLRCGVSAGHRGSVFLLAVLLAPVVLTSCTWRG
IANVPLPVGRGMGPDRMTIYVQMPDTLALNTNSRVRVADVWVGTVRDISL
RNWIATLTLELEPTVRLPANATAKIGQTSLLGTQHVELAAPPIPSPQPLK
SGDTIGLKNSSAYPTVERTLASVALILTGGGIVNLDVIQTEILNILDGHA
GQIREFLERLATFTAELNNQRGDLTRAIDSTNQLLTIIANRNDTLDRVLT
DVPPLIEHFADTGQLFADATESLGRFSEVANRALAATRPNLHQTLQSLQR
PLRQLERASPYVVGALKLGLTAPFNIDEVPNVIRGDYVNVSATFDVTLSA
LDNALLSGTGISGMLRALEQAWGRDPDTMIPDVRYTPNPNDAPGGPLVER
AE
>Rv1970 lprM, POSSIBLE MCE-FAMILY LIPOPROTEIN LPRM (MCE-FAMILY LIPOPROTEIN MCE3E)
MRIGLTLVMIAAVVASCGWRGLNSLPLPGTQGNGPGSFAVQAQLPDVNNI
QPNSRVRVADVTVGHVTKIERQGWHALVTMRLDGDVDLPANATAKIGTTS
LLGSYHIELAPPKGEARQGKLRDGSLIALSHGSAYPSTEQTLAALSLVLN
GGGLGQVQDITEALSTAFAGREHDLRGLIGQLDTFTAYLNNQSGDIIAAT
DSLNRLVGKFADQQPVFDRALATIPDALAVLADERDTLVEAAEQLSKFSA
LTVDSVNKTTANLVTELRQLGPVLESLANSGPALTRSLSLLATFPFPNET
FQNFQRGEYANLTAIVDLTLSRIDQGLLTGTRWECHLTQLELQWGRTIGQ
FPSPCTAGYRGTPGNPLTIAYRWDQGP
>Rv3495c lprN, POSSIBLE MCE-FAMILY LIPOPROTEIN LPRN (MCE-FAMILY LIPOPROTEIN MCE4E)
MNRIWLRAIILTASSALLAGCQFGGLNSLPLPGTAGHGEGAYSVTVEMAD
VATLPQNSPVMVDDVTVGSVAGIVAVQRPDGSFYAAVKLDLDKNVLLPAN
AVAKVSQTSLLGSLHVELAPPTDRPPTGRLVDGSRITEANTDRFPTTEEV
FSALGVVVNKGNVGALEEIIDETHQAVAGRQAQFVNLVPRLAELTAGLNR
QVHDIIDALDGLNRVSAILARDKDNLGRALDTLPDAVRVLNQNRDHIVDA
FAALKRLTMVTSHVLAETKVDFGEDLKDLYSIVKALNDDRKDFVTSLQLL
LTFPFPNFGIKQAVRGDYLNVFTTFDLTLRRIGETFFTTAYFDPNMAHMD
EILNPPDFLIGELANLSGQAADPFKIPPGTASGQ
>Rv2940c mas, PROBABLE MULTIFUNCTIONAL MYCOCEROSIC ACID SYNTHASE MEMBRANE-ASSOCIATED MAS
MESRVTPVAVIGMGCRLPGGINSPDKLWESLLRGDDLVTEIPPDRWDADD
YYDPEPGVPGRSVSRWGGFLDDVAGFDAEFFGISEREATSIDPQQRLLLE
TSWEAIEHAGLDPASLAGSSTAVFTGLTHEDYLVLTTTAGGLASPYVVTG
LNNSVASGRIAHTLGLHGPAMTFDTACSSGLMAVHLACRSLHDGEADLAL
AGGCAVLLEPHASVAASAQGMLSSTGRCHSFDADADGFVRSEGCAMVLLK
RLPDALRDGNRIFAVVRGTATNQDGRTETLTMPSEDAQVAVYRAALAAAG
VQPETVGVVEAHGTGTPIGDPIEYRSLARVYGAGTPCALGSAKSNMGHST
ASAGTVGLIKAILSLRHGVVPPLLHFNRLPDELSDVETGLFVPQAVTPWP
NGNDHTPKRVAVSSFGMSGTNVHAIVEEAPAEASAPESSPGDAEVGPRLF
MLSSTSSDALRQTARQLATWVEEHQDCVAASDLAYTLARGRAHRPVRTAV
VAANLPELVEGLREVADGDALYDAAVGHGDRGPVWVFSGQGSQWAAMGTQ
LLASEPVFAATIAKLEPVIAAESGFSVTEAITAQQTVTGIDKVQPAVFAV
QVALAATMEQTYGVRPGAVVGHSMGESAAAVVAGALSLEDAARVICRRSK
LMTRIAGAGAMGSVELPAKQVNSELMARGIDDVVVSVVASPQSTVIGGTS
DTVRDLIARWEQRDVMAREVAVDVASHSPQVDPILDDLAAALADIAPMTP
KVPYYSATLFDPREQPVCDGAYWVDNLRNTVQFAAAVQAAMEDGYRVFAE
LSPHPLLTHAVEQTGRSLDMSVAALAGMRREQPLPHGLRGLLTELHRAGA
ALDYSALYPAGRLVDAPLPAWTHARLFIDDDGQEQRAQGACTITVHPLLG
SHVRLTEEPERHVWQGDVGTSVLSWLSDHQVHNVAALPGAAYCEMALAAA
AEVFGEAAEVRDITFEQMLLLDEQTPIDAVASIDAPGVVNFTVETNRDGE
TTRHATAALRAAEDDCPPPGYDITALLQAHPHAVNGTAMRESFAERGVTL
GAAFGGLTTAHTAEAGAATVLAEVALPASIRFQQGAYRIHPALLDACFQS
VGAGVQAGTATGGLLLPLGVRSLRAYGPTRNARYCYTRLTKAFNDGTRGG
EADLDVLDEHGTVLLAVRGLRMGTGTSERDERDRLVSERLLTLGWQQRAL
PEVGDGEAGSWLLIDTSNAVDTPDMLASTLTDALKSHGPQGTECASLSWS
VQDTPPNDQAGLEKLGSQLRGRDGVVIVYGPRVGDPDEHSLLAGREQVRH
LVRITRELAEFEGELPRLFVVTRQAQIVKPHDSGERANLEQAGLRGLLRV
ISSEHPMLRTTLIDVDEHTDVERVAQQLLSGSEEDETAWRNGDWYVARLT
PSPLGHEERRTAVLDPDHDGMRVQVRRPGDLQTLEFVASDRVPPGPGQIE
VAVSMSSINFADVLIAFGRFPIIDDREPQLGMDFVGVVTAVGEGVTGHQV
GDRVGGFSEGGCWRTFLTCDANLAVTLPPGLTDEQAITAATAHATAWYGL
NDLAQIKAGDKVLIHSATGGVGQAAISIARAKGAEIFATAGNPAKRAMLR
DMGVEHVYDSRSVEFAEQIRRDTDGYGVDIVLNSLTGAAQRAGLELLAFG
GRFVEIGKADVYGNTRLGLFPFRRGLTFYYLDLALMSVTQPDRVRELLAT
VFKLTADGVLTAPQCTHYPLAEAADAIRAMSNAEHTGKLVLDVPRSGRRS
VAVTPEQAPLYRRDGSYIITGGLGGLGLFFASKLAAAGCGRIVLTARSQP
NPKARQTIEGLRAAGADIVVECGNIAEPDTADRLVSAATATGLPLRGVLH
SAAVVEDATLTNITDELIDRDWSPKVFGSWNLHRATLGQPLDWFCLFSSG
AALLGSPGQGAYAAANSWVDVFAHWRRAQGLPVSAIAWGAWGEVGRATFL
AEGGEIMITPEEGAYAFETLVRHDRAYSGYIPILGAPWLADLVRRSPWGE
MFASTGQRSRGPSKFRMELLSLPQDEWAGRLRRLLVEQASVILRRTIDAD
RSFIEYGLDSLGMLEMRTHVETETGIRLTPKVIATNNTARALAQYLADTL
AEEQAAAPAAS
>Rv2384 mbtA, BIFUNCTIONAL ENZYME MBTA: SALICYL-AMP LIGASE (SAL-AMP LIGASE) + SALICYL-S-ArCP SYNTHETASE
MPPKAADGRRPSPDGGLGGFVPFPADRAASYRAAGYWSGRTLDTVLSDAA
RRWPDRLAVADAGDRPGHGGLSYAELDQRADRAAAALHGLGITPGDRVLL
QLPNGCQFAVALFALLRAGAIPVMCLPGHRAAELGHFAAVSAATGLVVAD
VASGFDYRPMARELVADHPTLRHVIVDGDPGPFVSWAQLCAQAGTGSPAP
PADPGSPALLLVSGGTTGMPKLIPRTHDDYVFNATASAALCRLSADDVYL
VVLAAGHNFPLACPGLLGAMTVGATAVFAPDPSPEAAFAAIERHGVTVTA
LVPALAKLWAQSCEWEPVTPKSLRLLQVGGSKLEPEDARRVRTALTPGLQ
QVFGMAEGLLNFTRIGDPPEVVEHTQGRPLCPADELRIVNADGEPVGPGE
EGELLVRGPYTLNGYFAAERDNERCFDPDGFYRSGDLVRRRDDGNLVVTG
RVKDVICRAGETIAASDLEEQLLSHPAIFSAAAVGLPDQYLGEKICAAVV
FAGAPITLAELNGYLDRRGVAAHTRPDQLVAMPALPTTPIGKIDKRAIVR
QLGIATGPVTTQRCH
>Rv2383c mbtB, PHENYLOXAZOLINE SYNTHASE MBTB (PHENYLOXAZOLINE SYNTHETASE)
MVHATACSEIIRAEVAELLGVRADALHPGANLVGQGLDSIRMMSLVGRWR
RKGIAVDFATLAATPTIEAWSQLVSAGTGVAPTAVAAPGDAGLSQEGEPF
PLAPMQHAMWVGRHDHQQLGGVAGHLYVEFDGARVDPDRLRAAATRLALR
HPMLRVQFLPDGTQRIPPAAGSRDFPISVADLRHVAPDVVDQRLAGIRDA
KSHQQLDGAVFELALTLLPGERTRLHVDLDMQAADAMSYRILLADLAALY
DGREPPALGYTYREYRQAIEAEETLPQPVRDADRDWWAQRIPQLPDPPAL
PTRAGGERDRRRSTRRWHWLDPQTRDALFARARARGITPAMTLAAAFANV
LARWSASSRFLLNLPLFSRQALHPDVDLLVGDFTSSLLLDVDLTGARTAA
ARAQAVQEALRSAAGHSAYPGLSVLRDLSRHRGTQVLAPVVFTSALGLGD
LFCPDVTEQFGTPGWIISQGPQVLLDAQVTEFDGGVLVNWDVREGVFAPG
VIDAMFTHQVDELLRLAAGDDAWDAPSPSALPAAQRAVRAALNGRTAAPS
TEALHDGFFRQAQQQPDAPAVFASSGDLSYAQLRDQASAVAAALRAAGLR
VGDTVAVLGPKTGEQVAAVLGILAAGGVYLPIGVDQPRDRAERILATGSV
NLALVCGPPCQVRVPVPTLLLADVLAAAPAEFVPGPSDPTALAYVLFTSG
STGEPKGVEVAHDAAMNTVETFIRHFELGAADRWLALATLECDMSVLDIF
AALRSGGAIVVVDEAQRRDPDAWARLIDTYEVTALNFMPGWLDMLLEVGG
GRLSSLRAVAVGGDWVRPDLARRLQVQAPSARFAGLGGATETAVHATIFE
VQDAANLPPDWASVPYGVPFPNNACRVVADSGDDCPDWVAGELWVSGRGI
ARGYRGRPELTAERFVEHDGRTWYRTGDLARYWHDGTLEFVGRADHRVKI
SGYRVELGEIEAALQRLPGVHAAAATVLPGGSDVLAAAVCVDDAGVTAES
IRQQLADLVPAHMIPRHVTLLDRIPFTDSGKIDRAEVGALLAAEVERSGD
RSAPYAAPRTVLQRALRRIVADILGRANDAVGVHDDFFALGGDSVLATQV
VAGIRRWLDSPSLMVADMFAARTIAALAQLLTGREANADRLELVAEVYLE
IANMTSADVMAALDPIEQPAQPAFKPWVKRFTGTDKPGAVLVFPHAGGAA
AAYRWLAKSLVANDVDTFVVQYPQRADRRSHPAADSIEALALELFEAGDW
HLTAPLTLFGHCMGAIVAFEFARLAERNGVPVRALWASSGQAPSTVAASG
PLPTADRDVLADMVDLGGTDPVLLEDEEFVELLVPAVKADYRALSGYSCP
PDVRIRANIHAVGGNRDHRISREMLTSWETHTSGRFTLSHFDGGHFYLND
HLDAVARMVSADVR
>Rv2382c mbtC, POLYKETIDE SYNTHETASE MBTC (POLYKETIDE SYNTHASE)
MSDNDPVVIVGLAIEAPGGVETADDYWTLLSEQREGLGPFPTDRGWALRE
LFDGSRRNGFKPIHNLGGFLSSATTFDPEFFRISPREATAMDPQQRVGLR
VAWRTLENSGINPDDLAGHDVGCYVGASALEYGPALTEFSHHSGHLITGT
SLGVISGRIAYTLDLAGPALTVDTSCSSALAAFHTAVQAIRAGDCDLALA
GGVCVMGTPGYFVEFSKQHALSDDGHCRPYSAHASGTAWAEGAAMFLLQR
RSRATADRRRVLAEVRASCLNSDGLSDGLTAPSGDAQTRLLRRAIAQAAV
VPADVGMVEGHGTATRLGDRTELRSLAASYGTAPAGRGPLLGSVKSNIGH
AQAAAGGLGLVKVILAAQHAAIPPTLHVDEPSREIDWEKQGLRLADKLTP
WRAVDGWRTAAVSAFGMSGTNSHVIVSMPDTVSAPERGPECGEV
>Rv2381c mbtD, POLYKETIDE SYNTHETASE MBTD (POLYKETIDE SYNTHASE)
MAPKQLPDGRVAVLLSAHAEELIGPDARAIADYLERFPATTVTEVARQLR
KTRRVRRHRAVLRAADRLELAEGLRALAAGREHPLIARSSLGSAPRQAFV
FPGQGGHWPGMGAVAYRELPTYRTATDTCAAAFAAAGVDSPLPYLIAPPG
TDERQAFCEIEIEGAQFVHAVALAEVWRSCGVLPDLTVGHSLGEVAAAYL
AGSITLSDAVAVVAARANVVGRLPGRYAVAALGIGEQDASALIATTGGWL
ELSVVNASSTVAVSGERQAVAAIVDTVRSSGHFARGITVGFPVHTSVLES
LRDELCEQLPDSEFMEAPVQFIGGTTGDVVAPGTTFGDYWYANLRHTVRF
DRAVESAIRCGARAFIEISAHPALLFAIGQNCEGAANLPDGPAVLVGSAR
RGERFVDALSANIVSAAVADPGYPWGDLGGDPLDGDVDLSGFPNAPMRAV
PMWAHPEPLPPVSGLTIAVERWERMVPSTPVAGRHRHLAVLDLGAHRALA
QTLCAAIDSHPDTELSAARDAELILVIAPDFEHTDAVRAAGALADLVGAG
LLDYPMHIGARCQSVCLVTVGAEQVDAADAVPSAGQAALAAMHRSIGFEH
PEQTFSHLDLPSWDLDPVLGVSVITAVLRGFGETALRGSVNGYTLFERTL
ADAPAVPNWSLDSGVLDDVVVTGGAGAIGMHYARYLAEHGARRIVLLSRR
AADQATVAMLRKQHGTVIVSPPCDITDPTQLSAIAAEYGGVGASLIVHAA
GSVISGTAPGVTSAAVVDNFAAKVLGLAQMIELWPLRPDVRTLLCSSVMG
VWGGHGVVAYSAANRLLDVMAAQLRAQGRHCVAVKWGLWQAPKAGEPARG
IADAVTIARVERSGLRQMAPQQAIEASLHEFTVDPLVFAADAARLQMLLD
SRQFERYEGPTDPNLTIVDAVRTQLAAVLGIPQAGEVNLQESLFDLGVDS
MLALDLRNRLKRSIGATVSLATLMGDITGDGLVAKLEDADERSHTAQKVD
ISRD
>Rv2380c mbtE, PEPTIDE SYNTHETASE MBTE (PEPTIDE SYNTHASE)
MWFVQMADPSGALLNICVSYRITGDIDLARLRDAVNAVARRHRILRTTYP
VGDDGVAQPTVHADLRPGWTQYDLTDLSQRAQRLRLEVLAQREFCAPFEL
SRDAPLRITVVRTAADEHVLLLVAHHIAWDDGSWRVFFTDLTQAYSRADL
GADLGPEHRPSAASGPDTTEADLNYWRAIMADPPEPLELPGPAGTCVPTS
WRAARATLRLPADTAARVATMAKNTGCTPYMVLLAAFGALVHRYTHSDDF
LVAAPVLNRGAGTEDAIGYFGNTVAMRLRPQSAMSFRELLTATRDIASGA
FAHQRINLDRVVRELNPDRRHGAERMTRVSFGFREPDGGGFNPPGIECER
YDLRSNITQLPLGFMVEFDRAGVLVEAEHLVEILEPALAKQMLRHFGVLL
DNALAAPDNTLSGLALMDERDAARLREVSRGERFDTPVKTLVDLVNEQTT
RTPDATAVVYEGQHFTYHDLNEASNRLGHWLIEQGIGSEDRVAVLLDKSP
DLIVTALGVVKSGAVYVPVDPSYPQDRLDFILADCDAKLVLRTPVRELAG
YRSDDPTDADRIRPLRPDNTAYLIYTSGTTGLPKGVAVPHRPVAEYFVWF
KGEYDVDDTDRLLQVASPSFDVSIAEIFGTLACGARMVIPRPGGLTDIGY
LTALLRDEGITAMHFVPSLLGLFLSLPGVSQWRTLQRVPIGGEPLPGEVA
DKFHATFDALLHNFYGPTETVINASRFKVVGPQGTRIVPIGRPKINTTMH
LLDDSLQPVPTGVIGEIYIGGTHVAYGYHRRAGLTAERFVADPFNPGSRM
YRSGDLARRNADGDIEFVGRADEQVKIRGFRIELGDVAAAIAVDPTVGQA
VVVVSDLPRLGKSLVGYVTPAAGGDGPADVGVDLDRIRARVAAALPEYML
PAAYVVLDEIPITAHGKIDRAALPEPQIASDTEFRAPQTATERRLAQLFG
ELLGRDRVGADDSFFDLGGHSLLATKLVAAVRNAFGVDVGVREIFEFATV
TALAGHIDTLDSDSARPRLTRVDHDGPVRLSSSQMRSWFNYRFDGPNAVN
NIPFAAALHGPCDTNAFAAAITDVVARHEILRTVYREIGGVPHQIIQPPA
EVPVRCAAGSDAAWLRAELNNERGYVFDLETDWPIRAALLSTPEQTVLSL
VVHHIAGDHWSAGVLFTDLLTAYRARSTGQRPSWAPLPVQYADYSVWQSA
LLDDGAGIVGPQRDYWIRQLGGLAGETGLRPDFPRPALLSGAGDAVEFRL
GAAIRDKLAAVSRDLGVTEFMLLQAAVAVVLHKAGGGVDVPIGAPVAGRS
EANLDQLIGFFINIVVLRNDLRGNPTLREVLQRTRQMALAAYAHQDLPFD
QVVEAVNPQRSLSRNPLFDIVVHVREQMPQDHVIDTGPDGDTTLRVLEPT
FDAAQADLSVNFFACGDEYRGHVIYRTELYERATAQRFADWLVRVVEAFA
DRPDQPLREVEMVSAQARRRILDRSNAGAGTARVYLLDDALKPVPVGVVG
DVYYGGGPAVGARLARPSETATRFVADPFAAQPGSRLYRNGERGVWKADG
QLELLAEIERLPTAQAAPVPAEPADTETERALAAILADVLEVGEVGRYDD
FFNLGGDSILATQVAARARDGGIPLTARMVFEHPVLCELAAAVDAKPHVE
AEPDDKHHAPMSTSGLSPDELSALTASWDQWP
>Rv2379c mbtF, PEPTIDE SYNTHETASE MBTF (PEPTIDE SYNTHASE)
MGPVAVTRADARGAIDDVMALSPLQQGLFSRATLVAAESGSEAAEADPYV
IAMAADAAGPLDIALLRDCAAAMLTRHPNLRASFLHGNLSRPVQVIPSSA
EVLWRHVRAHPSEVGALAAEERRRRFDVGRGPLIRFLLIELPDECWHLVI
VAHHIVIDGWSLPLFVSELLALYRAGGHVAALPAAPRPYRDYIGWLAGRD
QTASRAMWADHLNGLDGPTLLSPALADTPVQPGIPGRTEVRLDREATAEL
ADAARTRGVTISTLVQMAWATTLSAFTGRGDVTFGVTVSGRPSELSGVET
MIGLFINTVPLRVRLDARATVGGQCAVLQRQFAMLRDHSYLGFNEFRAIA
GIGEMFDTLLVYENFPPGEVVGTAEFVANGVTFRPVALESLSHFPVTVAA
HRSTGELTLLVEVLDGALGTMAPESLGRRVLAVLQRLVSRWDRPLRDVDI
LLDGEHDPTAPGLPDVTTSAPAVHTRFAEIAAAQPDSVAVSWADGQLTYR
ELDALADRLATGLRRADVSRETPVAVALSRGPRYVAAMLAVLKAGGMIVP
LDPAMPGERVAEILRQTSAPVVIDEGVFAASVGADILEEDRAITVPVDQA
AYVIFTSGTTGTPKGVIGTHRALSAYADDHIERVLRPAAQRLGRPLRIAH
AWSFTFDAAWQPLVALLDGHAVHIVDDHRQRDAGALVEAIDRFGLDMIDT
TPSMFAQLHNAGLLDRAPLAVLALGGEALGAATWRMIQQNCARTAMTAFN
CYGPTETTVEAVVAAVAEHARPVIGRPTCTTRAYVMDSWLRPVPDGVAGE
LYLAGAQLTRGYLGRPAETAARFVAEPNGRGSRMYRTGDVVRRLPDGGLE
FLGRSDDQVKIRGFRVEPGEIAAVLNGHHAVHGCHVTARGHASGPRLTAY
VAGGPQPPPVAELRAMLLERLPRYLVPHHIVVLDELPLTPHGKIDENALA
AINVTEGPATPPQTPTELVLAEAFADVMETSNVDVTAGFLQMGLDSIVAL
SVVQAARRRGIALRARLMVECDTIRELAAAIDSDAAWQAPANDAGEPIPV
LPNTHWLYEYGDPRRLAQTEVIRLPDRITRERLDAVLAAVVDGHEVLRCR
FDRDAMALVAQPKTDILSEVWVSGELVTAVAEQTLGALASLDPQAGRLLS
AVWLREPDGPGVLVLTAHVLAMDPASWRIVLGELDAGLHALAAGRAPSPA
RENTSYRQWSRLLAQRAKALDSVDFWVAELEGADPPLGARRVAPQTDRVG
ELAITMSISDADLTARLLSTGRSMTDLLATAAARMVTAWRRQRGQQTPAP
LLALETHGRADVHVDKTADTSDTVGLLSAIYPLRIHCDGATDFARIPGSG
IDYGLLRYLRADTAERLRAHREPQLLLNYLGSLHVGVGDLAVDRALLADV
GQLPEPEQPVRHELTVLAALLGPADAPVLATRWRTLPDILSADDVATLQS
LWQGALAEITA
>Rv2378c mbtG, LYSINE-N-OXYGENASE MBTG (L-LYSINE 6-MONOOXYGENASE) (LYSINE N6-HYDROXYLASE)
MNPTLAVLGAGAKAVAVAAKASVLRDMGVDVPDVIAVERIGVGANWQASG
GWTDGAHRLGTSPEKDVGFPYRSALVPRRNAELDERMTRYSWQSYLIATA
SFAEWIDRGRPAPTHRRWSQYLAWVADHIGLKVIHGEVERLAVTGDRWAL
CTHETTVQADALMITGPGQAEKSLLPGNPRVLSIAQFWDRAAGHDRINAE
RVAVIGGGETAASMLNELFRHRVSTITVISPQVTLFTRGEGFFENSLFSD
PTDWAALTFDERRDALARTDRGVFSATVQEALLADDRIHHLRGRVAHAVG
RQGQIRLTLSTNRGSENFETVHGFDLVIDGSGADPLWFTSLFSQHTLDLL
ELGLGGPLTADRLQEAIGYDLAVTDVTPKLFLPTLSGLTQGPGFPNLSCL
GLLSDRVLGAGIFTPTKHNDTRRSGEHQSFR
>Rv0169 mce1A, MCE-FAMILY PROTEIN MCE1A
MTTPGKLNKARVPPYKTAGLGLVLVFALVVALVYLQFRGEFTPKTQLTML
SARAGLVMDPGSKVTYNGVEIGRVDTISEVTRDGESAAKFILDVDPRYIH
LIPANVNADIKATTVFGGKYVSLTTPKNPTKRRITPKDVIDVRSVTTEIN
TLFQTLTSIAEKVDPVKLNLTLSAAAEALTGLGDKFGESIVNANTVLDDL
NSRMPQSRHDIQQLAALGDVYADAAPDLFDFLDSSVTTARTINAQQAELD
SALLAAAGFGNTTADVFDRGGPYLQRGVADLVPTATLLDTYSPELFCTIR
NFYDADPLAKAASGGGNGYSLRTNSEILSGIGISLLSPLALATNGAAIGI
GLVAGLIAPPLAVAANLAGALPGIVGGAPNPYTYPENLPRVNARGGPGGA
PGCWQPITRDLWPAPYLVMDTGASLAPYNHMEVGSPYAVEYVWGRQVGDN
TINP
>Rv0170 mce1B, MCE-FAMILY PROTEIN MCE1B
MKITGTVVKLGIVSVVLLFFTVMIIVIFGQMRFDRTNGYTAEFSNVSGLR
QGQFVRASGVEIGKVKALHLVDGGRRVRVEFNIDRSVPLYQSTTAQIRYS
DLIGNRYVELKRGEGKGANDLLPPGGLIPLSRTSPALDLDALIGGFKPVF
RALDPAKVNNIANALITVFQGQGGTINDILDQTAQLTSQIAERDQAIGEV
VKNLNIVLDTTVKHRKEFDETVNNLENLITGLRNHSDQLAGGLAHISNGA
GTVADLLAENRTLVRKAVSYLDAIQQPVIDQRVELDDLLHKTPTALTALG
RANGTYGDFQNFYLCDLQIKWNGFQAGGPVRTVKLFSQPTGRCTPQ
>Rv0171 mce1C, MCE-FAMILY PROTEIN MCE1C
MRTLEPPNRMRIGLMGIVVALLVVAVGQSFTSVPMLFAKPSYYGQFTDSG
GLHKGDRVRIAGLGVGTVEGLKIDGDHIVVKFSIGTNTIGTESRLAIRTD
TILGRKVLEIEPRGAQALPPGGVLPVGQSTTPYQIYDAFFDVTKAASGWD
IETVKRSLNVLSETVDQTYPHLSAALDGVAKFSDTIGKRDEQITHLLAQA
NQVASILGDRSEQVDRLLVNAKTLIAAFNERGRAVDALLGNISAFSAQVQ
NLINDNPNLNHVLEQLRILTDLLVDRKEDLAETLTILGRFSASFGETFAS
GPYFKVLLANLVPGQILQPFVDAAFKKRGISPEDFWRSAGLPAYRWPDPN
GTRFPNGAPPPPPPVLEGTPEHPGPAVPPGSPCSYTPPADGLPRPWDPLP
CANLTQGPFGGPDFPAPLDVATSPPNPDGPPPAPGLPIAGRPGEVPPNVP
GTPVPIPQEAPPGARTLPLGPAPGPAPPPAAPGPPAPPGPGPQLPAPFIN
PGGTGGSGVTGGSEN
>Rv0172 mce1D, MCE-FAMILY PROTEIN MCE1D
MSTIFDIRNLRLPQLSRASVVIGSLVVVLALAAGIVGVRLYQKLTNNTVV
AYFTQANALYVGDKVQIMGLPVGSIDKIEPAGDKMKVTFHYQNKYKVPAN
ASAVILNPTLVASRNIQLEPPYRGGPVLADNAVIPVERTQVPTEWDELRD
SVSHIIDELGPTPEQPKGPFGEVIEAFADGLAGKGKQINTTLNSLSQALN
ALNEGRGDFFAVVRSLALFVNALHQDDQQFVALNKNLAEFTDRLTHSDAD
LSNAIQQFDSLLAVARPFFAKNREVLTHDVNNLATVTTTLLQPDPLDGLE
TVLHIFPTLAANINQLYHPTHGGVVSLSAFTNFANPMEFICSSIQAGSRL
GYQESAELCAQYLAPVLDAIKFNYFPFGLNVASTASTLPKEIAYSEPRLQ
PPNGYKDTTVPGIWVPDTPLSHRNTQPGWVVAPGMQGVQVGPITQGLLTP
ESLAELMGGPDIAPPSSGLQTPPGPPNAYDEYPVLPPIGLQAPQVPIPPP
PPGPDVIPGPVPPTPAPVGAPLPAEAGGGQ
>Rv0174 mce1F, MCE-FAMILY PROTEIN MCE1F
MLTRFIRRQLILFAIVSVVAIVVLGWYYLRIPSLVGIGQYTLKADLPASG
GLYPTANVTYRGITIGKVTAVEPTDQGARVTMSIASNYKIPVDASANVHS
VSAVGEQYIDLVSTGAPGKYFSSGQTITKGTVPSEIGPALDNSNRGLAAL
PTEKIGLLLDETAQAVGGLGPALQRLVDSTQAIVGDFKTNIGDVNDIIEN
SGPILDSQVNTGDQIERWARKLNNLAAQTATRDQNVRSILSQAAPTADEV
NAVFSGVRDSLPQTLANLEVVFDMLKRYHAGVEQLLVFLPQGAAIAQTVL
TPTPGAAQLPLAPAINYPPPCLTGFLPASEWRSPADTSPRPLPSGTYCKI
PQDAQLQVRGARNIPCVDVLGKRAATPKECRSKDPYVPLGTNPWFGDPNQ
ILTCPAPGARCDQPVKPGLVIPAPSINTGLNPAPADQVQGTPPPVSDPLQ
RPGSGTVQCNGQQPNPCVYTPTSGPSAVYSPASGELVGPDGVKYAVANSS
TTGDDGWKEMLAPAS
>Rv0589 mce2A, MCE-FAMILY PROTEIN MCE2A
MPTLVTRKNRRAWLYVEGVVLLLVGALVLVLVYKQFRGEFTPKTELTMVA
FRAGLVMEAGSKVTYNGVEIGRVGSISEIERDGRPAAKLVLDVNPRYISL
IPVNVVADIEAATLFGNKYVALSAPKIPQQQRISSHDVIDVGSVTTEFNT
LFETITSIAEKVDPIELNATLSAVAQALDGLGGKFGESIVNGNQILAQLN
PRLPQLGYDVRRLADLGEVYVDASPDLWSFLQNALTTARTLTSQQRDLDA
ALLAATGAGNTGEDVFARGGPYLARAAADLVPTATLLDTYSPELFCMIRN
FHDAAPKVADAVGGNGYSLAAAGTILGAPNPYVYPDNLPRVNAHGGPGGR
PGCWQTITRELWPAPYLVMDTGASLAPYNHVELGQPMFTEYVWGRQYGEN
TINP
>Rv0590 mce2B, MCE-FAMILY PROTEIN MCE2B
MKTTGTTIKLGIVWLVLSVFTVMIIVVFGQVRFHHTTGYSAVFTHVSGLR
AGQFVRAAGVEVGKVAKVTLIDGDKQVLVDFTVDRSLSLDQATTASIRYL
NLIGDRYLELGRGHSGQRLAPGATIPLEHTHPALDLDALLGGFRPLFQTL
DPDKVNSIASSIITVFQGQGATINDILDQTASLTATLADRDHAIGEVVNN
LNTVLATTVKHQTEFDRTVDKLEVLITGLKNRADPLAAAAAHISSAAGTL
ADLLGRIVHCCTAASGTSRASSSRS
>Rv0591 mce2C, MCE-FAMILY PROTEIN MCE2C
MRTLTEFNRGRVGMMGAVVTVLVVGVAQSFTSVPMLFATPTYYAQFADTG
GINTGDKVEIAGVNVGLVRSLAIRGNRVLIGFSLPGKTIGMQSRAAIRTD
TILGRKNLEIEPRGSEPLKPNGFLPLAQTTTPYQIYDAFVDVTKAATGWD
IDAVKRSLNVLSETFDQTAPHLSAALEGVKAFSDTVGRRGEQIEQLLANA
NRIARVLGDRSEQVNGLLVNAKTLLAAFKQRSQALRILLTNVSEASAQVS
GLITDNPNLNHVLAQLRTVSEELVKRKNELADVAVLLGRYTAALTEAVGS
GPFFKAMVVNLLPYQILQPWVDAAFKKRGIDPENFWRSAGLPEFRWPDPN
GTRFPNGAPPAAPPVREGTPKHPGPAVPPGTPCSYTPAAGALPRPDTPLP
CAGATVGPFGGPDFPAPLDVQPSPPNPDGPPPTPGILSAGRPGEPAPAVP
GIPMPLPPNAPPGARTQPLEPFPDGTGGSNQ
>Rv0592 mce2D, MCE-FAMILY PROTEIN MCE2D
MSTIFDIRSLRLPKLSAKVVVVGGLVVVLAVVAAAAGARLYRKLTTTTVV
AYFSEALALYPGDKVQIMGVRVGSIDKIEPAGDKMRVTLHYSNKYQVPAT
ATASILNPSLVASRTIQLSPPYTGGPVLQDGAVIPIERTQVPVEWDQLRD
SINGILRQLGPTERQPKGPFGDLIESAADNLAGKGRQLNETLNSLSQALT
ALNEGRGDFVAITRSLALFVSALYQNDQQFVALNENLAEFTDWFTKSDHD
LADTVERIDDVLGTVRKFVSDNRSVLAADVNNLADATTTLVQPEPRDGLE
TALHVLPTYASNFNNLYYPLHSSLVGQFVFPNFANPIQLICSAIQAGSRL
GYQESAELCAQYLAPVLDALKFNYLPFGSNPFSSAATLPKEVAYSEERLR
PPPGYKDTTVPGIFSRDTPFSHGNHEPGWVVAPGMQGMQVQPFTANMLTP
ESLAELLGGPDIAPPPPGTNLPGPPNAYDESNPLPPPWYPQPASLPAAGA
TGQPGPGQ
>Rv0594 mce2F, MCE-FAMILY PROTEIN MCE2F
MLTRAIKTQLVLLTVLAVIAVVVLGWYFLRIPSLVGIGRYTLYAELPRSG
GLYRTANVTYRGITIGKVTGVEPTERGARATMSIDNGYQIPTDASANVHS
VSAVGEQFVDLVSTRTSGPYLRHGQTITTTTVPSQIGPALDAANRGLAVL
PKDRVASVLHEASEAVGGLGSSLNRLIEATQAIAHDVRGSLEDIDDIIER
SAPIIDSQVNSGNEIARWAANLNTLAAQTAQTDPAVRSILANAAPTADQV
NATFSDVRESLPQTLANLEVVIDMLKRYHNGVEQALVFLPQSGAIAQSVT
TEFPGQAGLGVGGLALNQPPPCLTGFLPASEWRSPADTSTAPLPKGTYCR
IPMDASNVVRGARNNPCVDVPGKRAATPRECRSNEAYVPGGTNPWYGDPN
QMLSCPAPAARCDQPVKPGQVIPAPSVNNGINPLPADQLPGTPPPVNDPL
QRPGSGTVQCNGQQPNPCVYTPSTFPTTIYDVQSGKVVAPDGVVYSVEAS
THAGADGWKVMLAPTG
>Rv1966 mce3A, MCE-FAMILY PROTEIN MCE3A
MRRGPGRHRLHDAWWTLILFAVIGVAVLVTAVSFTGSLRSTVPVTLAADR
SGLVMDSGAKVMMRGVQVGRVAQIGRIEWAQNGASLRLEIDPDQIRYIPA
NVEAQISATTAFGAKFVDLVMPQNPSRARLSAGAVLHSKNVSTEINTVFE
NVVDLLNMIDPLKLNAVLTAVADAVRGQGERIGQATTDLNEVLEALNARG
DTIGGNWRSLKNFTDTYDAAAQDILTILNAASTTSATVVNHSTQLDALLL
NAIGLSNAGTNLLGSSRDNLVGAADILAPTTSLLFKYNPEYTCFLQGAKW
YLDNGGYAAWGGADGRTLQLDVALLFGNDPYVYPDNLPVVAAKGGPGGRP
GCGPLPDATHNFPVRQLVTNTGWGTGLDIRPNPGIGHPCWANYFPVTRAV
PEPPSIRQCIPGPAIGPNPAAGEQP
>Rv1967 mce3B, MCE-FAMILY PROTEIN MCE3B
MRENLGGVVVRLGVFLAVCLLTAFLLIAVFGEVRFGDGKTYYAEFANVSN
LRTGKLVRIAGVEVGKVTRISINPDATVRVQFTADNSVTLTRGTRAVIRY
DNLFGDRYLALEEGAGGLAVLRPGHTIPLARTQPALDLDALIGGFKPLFR
ALNPEQVNALSEQLLHAFAGQGPTIGSLLAQSAAVTNTLADRDRLIGQVI
TNLNVVLGSLGAHTDRLDQAVTSLSALIHRLAQRKTDISNAVAYTNAAAG
SVADLLSQARAPLAKVVRETDRVAGIAAADHDYLDNLLNTLPDKYQALVR
QGMYGDFFAFYLCDVVLKVNGKGGQPVYIKLAGQDSGRCAPK
>Rv1968 mce3C, MCE-FAMILY PROTEIN MCE3C
MKSFAERNRLAIGTVGIVVVAAVALAALQYQRLPFFNQGTRVSAYFADAG
GLRTGNTVEVSGYPVGKVSSISLDGPGVLVEFKVDTDVRLGNRTEVAIKT
KGLLGSKFLDVTPRGDGRLDSPIPIERTTSPYQLPDALGDLAATISGLHT
ERLSESLATLAQTFADTPAHFRNAIHGVARLAQTLDERDNQLRSLLANAA
KATGVLANRTDQIVGLVRDTNVVLAQLRTQSAALDRIWANISAVAEQLRG
FIAENRQQLRPALDKLNGVLAIVENRKERVRQAIPLINTYVMSLGESLSS
GPFFKAYVVNLLPGQFVQPFISAAFSDLGLDPATLLPSQLTDPPTGQPGT
PPLPMPYPRTGQGGEPRLTLPDAITGNPGDPRYPYRPEPPAPPPGGPPPG
PPAQQPGDQP
>Rv1969 mce3D, MCE-FAMILY PROTEIN MCE3D
MTTKLRRARSVLATALVLVAGVILAMRTADAAARTTVVAYFDNSNGVFAG
DDVLIRGVPVGKIVKIEPQPLRAKISFWFDRKYRVPADAAAAILSPQLVT
GRAIQLTPPYAGGPTMADGTVIPQERTVVPVEWDDLRAQLQRLTALLQPT
RPGGVSTLGALINTAADNLRGQGATIRDTIIKLSQAISALGDHSKDIFST
VTNLSTLVTALHDSADLLERLNHNLAAVTSLLADGPDKIGQAAEDLNAVV
ADVGSFAAEHREAIGTASDKLASITTALVDSLDDIKQTLHISPTVLQNFN
NIFEPANGALTGALAGNNMANPIAFLCGAIQAASRLGGEQAAKLCVQYLA
PIVKNRQYNYPPLGANLFVGAQARPNEVTYSEDWLRPDYVAPVADTPPDP
AAAVTVDPATGLRGMMMPPGGGS
>Rv1971 mce3F, MCE-FAMILY PROTEIN MCE3F
MLHLPRRVIVQLAVFTVIAVGVLAITFLHFVRLPAMLFGVGRYTVTMELV
EAGGLYRTGNVTYRGFEVGRVAAVRLTDTGVQAVLALKSGIDIPSDLKAE
VHSHTAIGETYVELLPRNAASPPLKNGDVIALADTSVPPDINDLLSAANT
ALEAIPHENLQTVIDESYTAVAGLGLELSRLIKGSAELAIDARANLDPLV
ALIDRAGPVLDSQTHTSDAIAAWAAQLAAVTGQLQTHDSAVGDLIDRGGP
ALGETRQLLERLQPTVPILLANLVSVGQVALTYHNDIEQLLVVFPMAIAA
EQAGILANLNTKQAYRGQYLSFNLNLNLPPPCTTGFLPAQQRRIPTFEDY
PDRPAGDLYCRVPQDSPFNVRGARNIPCETVPGKRAPTVKLCESDAPYLP
LNDGYNWKGDPNATVPGLGSGQDIPQTWQTMLLPPGS
>Rv3499c mce4A, MCE-FAMILY PROTEIN MCE4A
MSGGGSRRTSVRVAAALLAGLMVGSAVLTYLSYTAAFTSTDTVTVSSPRA
GLVMEKGAKVKYRGIQVGKVTDISYSGNQARLKLAIDSGEMGFIPSNATV
RIAGNTIFGAKSVEFIPPKTPSPKPLSPNAHVAASQVQLEVNTLFQSLID
LLHKIDPLETNATLSALSEGLRGHGDDLGALLSGLNTLTRQANPKLPALQ
EDFRKAAVVANVYADAAGDLNTVFDNLPTINKTIVDQKDNLNDTLLATIG
LSNNAYETLAPAEQNFIDAINRLRAPLKVTSDYSPVFGCLFKGIARGVKE
FAPLIGVRKAGLFTSSSFVLGAPSYTYPESLPIVNASGGPNCRGLPDIPT
KQTGGSFYRAPFLVTDNALIPYQPFTELQVDAPSTLQFLFNGAFAERDDF
>Rv3498c mce4B, MCE-FAMILY PROTEIN MCE4B
MAGSGVPSHRSMVIKVSVFAVVMLLVAAGLVVVFGDFRFGPTTVYHATFT
DASRLKAGQKVRIAGVPVGSVKAVKLNPDHSIDVAFAIDRSYTLYSSTRA
VIRYENLVGDRFLEITSGPGELRKLPPGGTINVAHTQPALDLDALLGGLR
PVLKGFDADKINTITSAVIELLQGQGGPLANVLADTGAFSAALGARDQLI
GEVITNLNAVLATVDAKSAQFSASVDQLQQLVSGLAKNRDPIAGAISPLA
STTTDLTELLRNSRRPLQGILENARPLATELDNRKAEVNNDIEQLGEDYL
RLSALGSYGAFFNIYFCSVTIKINGPAGSDILLPIGGQPDPSKGRCAFAK
>Rv3497c mce4C, MCE-FAMILY PROTEIN MCE4C
MLNRKPSSKHERDPLRTGIFGLVLVICVVLIAFGYSGLPFWPQGKTYDAY
FTDAGGITPGNSVYVSGLKVGAVSAVSLAGNSAKVTFSVDRSIVVGDQSL
AAIRTDTILGERSIAVSPAGSGKSTTIPLSRTTTPYTLNGVLQDLGRNAN
DLNRPQFEQALNVFTQALHDATPQVRGAVDGLTSLSRALNRRDEALQGLL
AHAKSVTSVLSERAEQVNKLVEDGNQLFAALDARRAALSALISGIDDVAA
QISGFVADNRKEFGPALSKLNLVLANLNERRDYITEALKRLPTYATTLGE
VVGSGPGFNVNVYSVLPGPLVATVFDLVFQPGKLPDSLADYLRGFIQERW
IIRPKSP
>Rv3496c mce4D, MCE-FAMILY PROTEIN MCE4D
MMGRVAMLTGSRGLRYATVIALVAALVGGVYVLSSTGNKRTIVGYFTSAV
GLYPGDQVRVLGVPVGEIDMIEPRSSDVKITMSVSKDVKVPVDVQAVIMS
PNLVAARFIQLTPVYTGGAVLPDNGRIDLDRTAVPVEWDEVKEGLTRLAA
DLSPAAGELQGPLGAAINQAADTLDGNGDSLHNALRELAQVAGRLGDSRG
DIFGTVKNLQVLVDALSESDEQIVQFAGHVASVSQVLADSSANLDQTLGT
LNQALSDIRGFLRENNSTLIETVNQLNDFAQTLSDQSENIEQVLHVAGPG
ITNFYNIYDPAQGTLNGLLSIPNFANPVQFICGGSFDTAAGPSAPDYYRR
AEICRERLGPVLRRLTVNYPPIMFHPLNTITAYKGQIIYDTPATEAKSET
PVPELTWVPAGGGAPVGNPADLQSLLVPPAPGPAPAPPAPGAGPGEHGGG
G
>Rv3494c mce4F, MCE-FAMILY PROTEIN MCE4F
MIDRLAKIQLSIFAVITVITLSVMAIFYLRLPATFGIGTYGVSADFVAGG
GLYKNANVTYRGVAVGRVESVGLNPNGVTAHMRLNSGTAIPSNVTATVRS
VSAIGEQYIDLVPPENPSSTKLRNGFRIQRQNTRIGQDVADLLRQAETLL
GSLGDTRLRELLHEAFIATNGAGPELARLIESARLLVDEANANYPQVSQL
IDQAGPFLQAQIRAGGDIKSLADGLARFTWQLRAADPRLRDTLADAPDAI
DEANTAFSGIRPSFPALAASLANLGRVGVIYHKSIEQLLVVFPALFAAII
TSAGGVPQDEGAKLDFKIDLHDPPPCMTGFLPPPLVRSPADESVREIPRD
MYCKTAQNDPSTVRGARNYPCQEFPGKRAPTVQLCRDPRGYVPVGTNPWR
GPPIPYGTEVTDGRNILPPNKFPYIPPGADPDPGVPIVGPPPPGQVAGPG
PAPHQPAQPAPPPNDNGPPPPFTSWMPPGYPPEPPQVPYPATIPPPPPPE
GTGPPPGPAPGPQPQASGPAYTIYDQLSGAFADPAGGTGIFAPGMTGASS
AENWVDLMRDPRQL
>Rv0542c menE, POSSIBLE O-SUCCINYLBENZOIC ACID--CoA LIGASE MENE (OSB-CoA SYNTHETASE) (O-SUCCINYLBENZOATE-CoA SYNTHASE)
MLGGSDPALVAVPTQHESLLGALRVGEQIDDDVALVVTTSGTTGPPKGAM
LTAAALTASASAAHDRLGGPGSWLLAVPPYHIAGLAVLVRSVIAGSVPVE
LNVSAGFDVTELPNAIKRLGSGRRYTSLVAAQLAKALTDPAATAALAELD
AVLIGGGPAPRPILDAAAAAGITVVRTYGMSETSGGCVYDGVPLDGVRLR
VLAGGRIAIGGATLAKGYRNPVSPDPFAEPGWFHTDDLGALESGDSGVLT
VLGRADEAISTGGFTVLPQPVEAALGTHPAVRDCAVFGLADDRLGQRVVA
AIVVGDGCPPPTLEALRAHVARTLDVTAAPRELHVVNVLPRRGIGKVDRA
ALVRRFAGEADQ
>Rv0655 mkl, POSSIBLE RIBONUCLEOTIDE-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER MKL
MRYSDSYHTTGRWQPRASTEGFPMGVSIEVNGLTKSFGSSRIWEDVTLTI
PAGEVSVLLGPSGTGKSVFLKSLIGLLRPERGSIIIDGTDIIECSAKELY
EIRTLFGVLFQDGALFGSMNLYDNTAFPLREHTKKKESEIRDIVMEKLAL
VGLGGDEKKFPGEISGGMRKRAGLARALVLDPQIILCDEPDSGLDPVRTA
YLSQLIMDINAQIDATILIVTHNINIARTVPDNMGMLFRKHLVMFGPREV
LLTSDEPVVRQFLNGRRIGPIGMSEEKDEATMAEEQALLDAGHHAGGVEE
IEGVPPQISATPGMPERKAVARRQARVREMLHTLPKKAQAAILDDLEGTH
KYAVHEIGQ
>Rv3566c nat, ARYLAMINE N-ACETYLTRANSFERASE NAT (ARYLAMINE ACETYLASE)
MALDLTAYFDRINYRGATDPTLDVLQDLVTVHSRTIPFENLDPLLGVPVD
DLSPQALADKLVLRRRGGYCFEHNGLMGYVLAELGYRVRRFAARVVWKLA
PDAPLPPQTHTLLGVTFPGSGGCYLVDVGFGGQTPTSPLRLETGAVQPTT
HEPYRLEDRVDGFVLQAMVRDTWQTLYEFTTQTRPQIDLKVASWYASTHP
ASKFVTGLTAAVITDDARWNLSGRDLAVHRAGGTEKIRLADAAAVVDTLS
ERFGINVADIGERGALETRIDELLARQPGADAP
>Rv0101 nrp, PROBABLE PEPTIDE SYNTHETASE NRP (PEPTIDE SYNTHASE)
MHRVRLSRSQRNLYNGVRQDNNPALYLIGKSYRFRRLELARFLAALHATV
LDNPVQLCVLENSGADYPDLVPRLRFGDIVRVGSADEHLQSTWCSGILGK
PLVRHTVHTDPNGYVTGLDVHTHHILLDGGATGTIEADLARYLTTDPAGE
TPSVGAGLAKLREAHRRETAKVEESRGRLSAVVQRELADEAYHGGHGHSV
SDAPGTAAKGVLHESATICGNAFDAILTLSEAQRVPLNVLVAAAAVAVDA
SLRQNTETLLVHTVDNRFGDSDLNVATCLVNSVAQTVRFPPFASVSDVVR
TLDRGYVKAVRRRWLREEHYRRMYLAINRTSHVEALTLNFIREPCAPGLR
PFLSEVPIATDIGPVEGMTVASVLDEEQRTLNLAIWNRADLPACKTHPKV
AERIAAALESMAAMWDRPIAMIVNDWFGIGPDGTRCQGDWPARQPSTPAW
FLDSARGVHQFLGRRRFVYPWVAWLVQRGAAPGDVLVFTDDDTDKTIDLL
IACHLAGCGYSVCDTADEISVRTNAITEHGDGILVTVVDVAATQLAVVGH
DELRKVVDERVTQVTHDALLATKTAYIMPTSGTTGQPKLVRISHGSLAVF
CDAISRAYGWGAHDTVLQCAPLTSDISVEEIFGGAACGARLVRSAAMKTG
DLAALVDDLVARETTIVDLPTAVWQLLCADGDAIDAIGRSRLRQIVIGGE
AIRCSAVDKWLESAASQGISLLSSYGPTEATVVATFLPIVCDQTTMDGAL
LRLGRPILPNTVFLAFGEVVIVGDLVADGYLGIDGDGFGTVTAADGSRRR
AFATGDRVTVDAEGFPVFSGRKDAVVKISGKRVDIAEVTRRIAEDPAVSD
VAVELHSGSLGVWFKSQRTREGEQDAAAATRIRLVLVSLGVSSFFVVGVP
NIPRKPNGKIDSDNLPRLPQWSAAGLNTAETGQRAAGLSQIWSRQLGRAI
GPDSSLLGEGIGSLDLIRILPETRRYLGWRLSLLDLIGADTAANLADYAP
TPDAPTGEDRFRPLVAAQRPAAIPLSFAQRRLWFLDQLQRPAPVYNMAVA
LRLRGYLDTEALGAAVADVVGRHESLRTVFPAVDGVPRQLVIEARRADLG
CDIVDATAWPADRLQRAIEEAARHSFDLATEIPLRTWLFRIADDEHVLVA
VAHHIAADGWSVAPLTADLSAAYASRCAGRAPDWAPLPVQYVDYTLWQRE
ILGDLDDSDSPIAAQLAYWENALAGMPERLRLPTARPYPPVADQRGASLV
VDWPASVQQQVRRIARQHNATSFMVVAAGLAVLLSKLSGSPDVAVGFPIA
GRSDPALDNLVGFFVNTLVLRVNLAGDPSFAELLGQVRARSLAAYENQDV
PFEVLVDRLKPTRALTHHPLIQVMLAWQDNPVGQLNLGDLQATPMPIDTR
TARMDLVFSLAERFSEGSEPAGIGGAVEYRTDVFEAQAIDVLIERLRKVL
VAVAAAPERTVSSIDALDGTERARLDEWGNRAVLTAPAPTPVSIPQMLAA
QVARIPEAEAVCCGDASMTYRELDEASNRLAHRLAGCGAGPGECVALLFE
RCAPAVVAMVAVLKTGAAYLPIDPANPPPRVAFMLGDAVPVAAVTTAGLR
SRLAGHDLPIIDVVDALAAYPGTPPPMPAAVNLAYILYTSGTTGEPKGVG
ITHRNVTRLFASLPARLSAAQVWSQCHSYGFDASAWEIWGALLGGGRLVI
VPESVAASPNDFHGLLVAEHVSVLTQTPAAVAMLPTQGLESVALVVAGEA
CPAALVDRWAPGRVMLNAYGPTETTICAAISAPLRPGSGMPPIGVPVSGA
ALFVLDSWLRPVPAGVAGELYIAGAGVGVGYWRRAGLTASRFVACPFGGS
GARMYRTGDLVCWRADGQLEFLGRTDDQVKIRGYRIELGEVATALAELAG
VGQAVVIAREDRPGDKRLVGYATEIAPGAVDPAGLRAQLAQRLPGYLVPA
AVVVIDALPLTVNGKLDHRALPAPEYGDTNGYRAPAGPVEKTVAGIFARV
LGLERVGVDDSFFELGGDSLAAMRVIAAINTTLNADLPVRALLHASSTRG
LSQLLGRDARPTSDPRLVSVHGDNPTEVHASDLTLDRFIDADTLATAVNL
PGPSPELRTVLLTGATGFLGRYLVLELLRRLDVDGRLICLVRAESDEDAR
RRLEKTFDSGDPELLRHFKELAADRLEVVAGDKSEPDLGLDQPMWRRLAE
TVDLIVDSAAMVNAFPYHELFGPNVAGTAELIRIALTTKLKPFTYVSTAD
VGAAIEPSAFTEDADIRVISPTRTVDGGWAGGYGTSKWAGEVLLREANDL
CALPVAVFRCGMILADTSYAGQLNMSDWVTRMVLSLMATGIAPRSFYEPD
SEGNRQRAHFDGLPVTFVAEAIAVLGARVAGSSLAGFATYHVMNPHDDGI
GLDEYVDWLIEAGYPIRRIDDFAEWLQRFEASLGALPDRQRRHSVLPMLL
ASNSQRLQPLKPTRGCSAPTDRFRAAVRAAKVGSDKDNPDIPHVSAPTII
NYVTNLQLLGLL
>Rv1153c omt, PROBABLE O-METHYLTRANSFERASE OMT
MSAHKPAKQRVALTGVSETALLTLNARAAEARRRDAIIDDPMAVALVESI
DFDFAKFGPTGQGFALRARAFDMAAQHYLDQHPAATVVALAEGLQTSFWR
LDVAIPGGQFRWLTVDLPPIVDLRTRLLPSSPRVSVCAQSALDYSWMDSV
DPAGGVFITAEGLLMYLQPEQALGLIAQCAQTFPGGQMLFDLPPRWFAGW
SRLGLRTSLRYKVPRMPFSMSVAQAADLVNKVPGVVAVRDLRVPPGRGLW
VNMALSTVYRLPVFDPLRPCLTLLEFSRPARG
>Rv0266c oplA, PROBABLE 5-OXOPROLINASE OPLA (5-OXO-L-PROLINASE) (PYROGLUTAMASE) (5-OPASE)
MVGAGWHFWVDRGGTFTDVVARRPDGRLLTHKLLSDNPARYRDAAVAGIR
ALLANGEAGTRVDAVRMGTTVATNALLERTGERTLLVITRGFGDALRIAY
QNRPRIFDRRIVLPEMLYERVVEVDERVTADGRVLRAPDLEALGEKMRQA
HADGIRAVAVVCLHSYLYPGHEREIGTLAQRIGFAQISLSSEVSPLMKLV
PRGDTTVVDAYLSPVLRRYINQVADQMRGVRLMFMQSNGGLAQAGHFRGK
DAILSGPAGGIVGMVRMSALAGFDHVIGFDMGGTSTDVSHYAGEYERVFT
TQVAGVRLRAPMLDIHTVAAGGGSILHFDGSRYRVGPDSAGADPGPACYR
GGGPLCVTDANVMLGRIQPTHFPSVFGPSGDQPLDAGTVRRGFTDLAADI
AARTGDDRSPEQVAEGYLRIAVANMANAVKKISVQKGHDVTRYALTTFGG
AGGQHACAVADALGIRTVLIPPMAGVLSALGIGLADTTAMREQSVEIPLG
PAAPQRLASVAESLERAARAELLDEGVPGERIRVVRRVHLRYEGTDTAIP
VQLAEIETMATAFESSHRALYTFLLDRPLIAEAISVEATGLTDQPDLSQL
GDQANDTTGSSETVRIYSNGLWRDAPLRRREAMRPGDVLTGPAIIAEANA
TTVVDDGWQATMTETGHLLAQRVVTPPRPDAATRAGFEAGFEADPVLLEI
FNNLFMSIAEQMGFRLEATAQSVNIRERLDFSCALFDPDGNLVANAPHIP
VHLGSMGTTVKEVIRRRLSGMKPGDVYAVNDPYHGGTHLPDITVITPVFN
TGGEDVLFFVASRGHHAEIGGITPGSMPADSREIHEEGVLFDNWLLAENG
RFREAETRRLLTEAPFGSRNPDTNLADLRAQIAANQKGVDEVGKMIDHFG
RDVVAAYMRHVQDNAEEAVRRVIDRLDNGAYRYRMDSGATIAVRITVDRA
ARSATIDFTGTSAQLDTNFNAPTSVVNAAVLYVFRTLVADDIPLNDGCLR
PLRIVVPEGSMLAPTHPAAVVAGNVETSQAITGALFAALGVQAEGSGTMN
NVTFGNERHQYYETVGSGSGAGDGYHGASVVQTHMTNSRLTDPEVLEWRY
PVLLREFAVRQGSGGAGRWRGGDGAVRRLEFTEPMTVSTLSGHRRVRPYG
MAGGSPGELGRNRVERADGSTVELAGCGSTHVEPGDTLVIETPGGGGYGP
ASTSARRRR
>Rv3824c papA1, PROBABLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA1
MRIGPVELSAVKDWDPAPGVLVSWHPTPASCAKALAAPVSAVPPSYVQAR
QIRSFSEQAARGLDHSRLLIASVEVFGHCDLRAMTYVINAHLRRHDTYRS
WFELRDTDHIVRHSIADPADIEFVPTTHGEMTSADLRQHIVATPDSLHWD
CFSFGVIQRADSFTFYASIDHLHADGQFVGVGLMEFQSMYTALIMGEPPI
GLSEAGSYVDFCVRQHEYTSALTVDSPEVRAWIDFAEINNGTFPEFPLPL
GDPSVRCGGDLLSMMLMDEQQTQRFESACMAANARFIGGMLACIAIAIHE
LTGADTYFGITPKDIRTPADLMTQGWFTGQIPVTVPVAGLSFNEIARIAQ
TSFDTGADLAKVPFERVVELSPSLRRPQPLFSLVNFFDAQVGPLSAVTKL
FEGLNVGTYSDGRVTYPLSTMVGRFDETAASVLFPDNPVARESVTAYLRA
IRSVCMRIANGGTAERVGNVVALSPGRRNNIERMTWRSCRAGDFIDICNL
KVANVTVDREA
>Rv3820c papA2, POSSIBLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA2
MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQQAQHLRR
YRDHVARGLDMSRLMIFTWDLPGRCNIRAMNYAINAHLRRHDTYHSWFEF
DNAEHIVRHTIADPADIEVVQAEHQNMTSAELRHHIATPQPLQWDCFLFG
IIQSDDHFTFYASIAHLCVDPMIVGVLFIEIHMMYSALVGGDPPIELPPA
GRYDDHCVRQYADTAALTLDSARVRRWVEFAANNDGTLPHFPLPLGDLSV
PHTGKLLTETLMDEQQGERFEAACVAAGARFSGGVFACAALAERELTNCE
TFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVASGLFDSAARVAQISFD
SGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIAPLSTVANSD
LNFRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNPIASESVANYIAAMK
SIYIRTADGTLATLKPGT
>Rv1182 papA3, PROBABLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA3
MLRVGPLTIGTLDDWAPSTGSTVSWRPSAVAHTKASQAPISDVPVSYMQA
QHIRGYCEQKAKGLDYSRLMVVSCQQPGQCDIRAANYVINAHLRRHDTYR
SWFQYNGNGQIIRRTIQDPADIEFVPVHHGELTLPQIREIVQNTPDPLQW
GCFRFGIVQGCDHFTFFASVDHVHVDAMIVGVTLMEFHLMYAALVGGHAP
LELPPAGSYDDFCRRQHTFSSTLTVESPQVRAWTKFAEGTNGSFPDFPLP
LGDPSKPSDADIVTVMMLDEEQTAQFESVCTAAGARFIGGVLACCGLAEH
ELTGTTTYYGLTPRDTRRTPADAMTQGWFTGLIPITVPIAGSAFGDAARA
AQTSFDSGVKLAEVPYDRVVELSSTLTMPRPNFPVVNFLDAGAAPLSVLL
TAELTGTNIGVYSDGRYSYQLSIYVIRVEQGTAVAVMFPDNPIARESVAR
YLATLKSVFQRVAESGQQQNVA
>Rv2939 papA5, POSSIBLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA5
MFPGSVIRKLSHSEEVFAQYEVFTSMTIQLRGVIDVDALSDAFDALLETH
PVLASHLEQSSDGGWNLVADDLLHSGICVIDGTAATNGSPSGNAELRLDQ
SVSLLHLQLILREGGAELTLYLHHCMADGHHGAVLVDELFSRYTDAVTTG
DPGPITPQPTPLSMEAVLAQRGIRKQGLSGAERFMSVMYAYEIPATETPA
VLAHPGLPQAVPVTRLWLSKQQTSDLMAFGREHRLSLNAVVAAAILLTEW
QLRNTPHVPIPYVYPVDLRFVLAPPVAPTEATNLLGAASYLAEIGPNTDI
VDLASDIVATLRADLANGVIQQSGLHFGTAFEGTPPGLPPLVFCTDATSF
PTMRTPPGLEIEDIKGQFYCSISVPLDLYSCAVYAGQLIIEHHGHIAEPG
KSLEAIRSLLCTVPSEYGWIME
>Rv2946c pks1, PROBABLE POLYKETIDE SYNTHASE PKS1
MISARSAEALTAQAGRLMAHVQANPGLDPIDVGCSLASRSVFEHRAVVVG
ASREQLIAGLAGLAAGEPGAGVAVGQPGSVGKTVVVFPGQGAQRIGMGRE
LYGELPVFAQAFDAVADELDRHLRLPLRDVIWGADADLLDSTEFAQPALF
AVEVASFAVLRDWGVLPDFVMGHSVGELAAAHAAGVLTLADAAMLVVARG
RLMQALPAGGAMVAVAASEDEVEPLLGEGVGIAAINAPESVVISGAQAAA
NAIADRFAAQGRRVHQLAVSHAFHSPLMEPMLEEFARVAARVQAREPQLG
LVSNVTGELAGPDFGSAQYWVDHVRRPVRFADSARHLQTLGATHFIEAGP
GSGLTGSIEQSLAPAEAMVVSMLGKDRPELASALGAAGQVFTTGVPVQWS
AVFAGSGGRRVQLPTYAFQRRRFWETPGADGPADAAGLGLGATEHALLGA
VVERPDSDEVVLTGRLSLADQPWLADHVVNGVVLFPGAGFVELVIRAGDE
VGCALIEELVLAAPLVMHPGVGVQVQVVVGAADESGHRAVSVYSRGDQSQ
GWLLNAEGMLGVAAAETPMDLSVWPPEGAESVDISDGYAQLAERGYAYGP
AFQGLVAIWRRGSELFAEVVAPGEAGVAVDRMGMHPAVLDAVLHALGLAV
EKTQASTETRLPFCWRGVSLHAGGAGRVRARFASAGADAISVDVCDATGL
PVLTVRSLVTRPITAEQLRAAVTAAGGASDQGPLEVVWSPISVVSGGANG
SAPPAPVSWADFCAGSDGDASVVVWELESAGGQASSVVGSVYAATHTALE
VLQSWLGADRAATLVVLTHGGVGLAGEDISDLAAAAVWGMARSAQAENPG
RIVLIDTDAAVDASVLAGVGEPQLLVRGGTVHAPRLSPAPALLALPAAES
AWRLAAGGGGTLEDLVIQPCPEVQAPLQAGQVRVAVAAVGVNFRDVVAAL
GMYPGQAPPLGAEGAGVVLETGPEVTDLAVGDAVMGFLGGAGPLAVVDQQ
LVTRVPQGWSFAQAAAVPVVFLTAWYGLADLAEIKAGESVLIHAGTGGVG
MAAVQLARQWGVEVFVTASRGKWDTLRAMGFDDDHIGDSRTCEFEEKFLA
VTEGRGVDVVLDSLAGEFVDASLRLLVRGGRFLEMGKTDIRDAQEIAANY
PGVQYRAFDLSEAGPARMQEMLAEVRELFDTRELHRLPVTTWDVRCAPAA
FRFMSQARHIGKVVLTMPSALADRLADGTVVITGATGAVGGVLARHLVGA
YGVRHLVLASRRGDRAEGAAELAADLTEAGAKVQVVACDVADRAAVAGLF
AQLSREYPPVRGVIHAAGVLDDAVITSLTPDRIDTVLRAKVDAAWNLHQA
TSDLDLSMFALCSSIAATVGSPGQGNYSAANAFLDGLAAHRQAAGLAGIS
LAWGLWEQPGGMTAHLSSRDLARMSRSGLAPMSPAEAVELFDAALAIDHP
LAVATLLDRAALDARAQAGALPALFSGLARRPRRRQIDDTGDATSSKSAL
AQRLHGLAADEQLELLVGLVCLQAAAVLGRPSAEDVDPDTEFGDLGFDSL
TAVELRNRLKTATGLTLPPTVIFDHPTPTAVAEYVAQQMSGSRPTESGDP
TSQVVEPAAAEVSVHA
>Rv1660 pks10, Possible chalcone synthase pks10
MSVIAGVFGALPPYRYSQRELTDSFVSIPDFEGYEDIVRQLHASAKVNSR
HLVLPLEKYPKLTDFGEANKIFIEKAVDLGVQALAGALDESGLRPEDLDV
LITATVTGLAVPSLDARIAGRLGLRADVRRVPLFGLGCVAGAAGVARLHD
YLRGAPDGVAALVSVELCSLTYPGYKPTLPGLVGSALFADGAAAVVAAGV
KRAQDIGADGPDILDSRSHLYPDSLRTMGYDVGSAGFELVLSRDLAAVVE
QYLGNDVTTFLASHGLSTTDVGAWVTHPGGPKIINAITETLDLSPQALEL
TWRSLGEIGNLSSASVLHVLRDTIAKPPPSGSPGLMIAMGPGFCSELVLL
RWH
>Rv1665 pks11, Possible chalcone synthase pks11
MSVIAGVFGALPPHRYSQSEITDSFVEFPGLKEHEEIIRRLHAAAKVNGR
HLVLPLQQYPSLTDFGDANEIFIEKAVDLGVEALLGALDDANLRPSDIDM
IATATVTGVAVPSLDARIAGRLGLRPDVRRMPLFGLGCVAGAAGVARLRD
YLRGAPDDVAVLVSVELCSLTYPAVKPTVSSLVGTALFGDGAAAVVAVGD
RRAEQVRAGGPDILDSRSSLYPDSLHIMGWDVGSHGLRLRLSPDLTNLIE
RYLANDVTTFLDAHRLTKDDIGAWVSHPGGPKVIDAVATSLALPPEALEL
TWRSLGEIGNLSSASILHILRDTIEKRPPSGSAGLMLAMGPGFCTELVLL
RWR
>Rv2048c pks12, Probable polyketide synthase pks12
MVDQLQHATEALRKALVQVERLKRTNRALLERSSEPIAIVGMSCRFPGGV
DSPEGLWQMVADARDVMSEFPTDRGWDLAGLFDPDPDVRHKSYARTGGFV
DGVADFDPAFFGISPSEALAMDPQHRMLLELSWEALERAGIDPTGLRGSA
TGVFAGLIVGGYGMLAEEIEGYRLTGMTSSVASGRVAYVLGLEGPAVSVD
TACSSSLVALHMAVGSLRSGECDLALAGGVTVNATPTVFVEFSRHRGLAP
DGRCKPYAGRADGVGWSEGGGMLVLQRLSDARRLGHPVLAVVVGSAVNQD
GASNGLTAPNGPSQQRVVRAALANAGLSAAEVDVVEGHGTGTTLGDPIEA
QALLATYGQDRGEPGEPLWLGSVKSNMGHTQAAAGVAGVIKMVLAMRHEL
LPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTRRAGVSSFGISGTN
AHVIIEAVPVVPRREAGWAGPVVPWVVSAKSESALRGQAARLAAYVRGDD
GLDVADVGWSLAGRSVFEHRAVVVGGDRDRLLAGLDELAGDQLGGSVVRG
TATAAGKTVFVFPGQGSQWLGMGIELLDTAPAFAQQIDACAEAFAEFVDW
SLVDVLRGAPGAPGLDRVDVVQPVLFAVMVSLAELWKSVAVHPDAVIGHS
QGEIAAAYVAGALSLRDAARVVTLRSKLLAGLAGPGGMVSIACGADQARD
LLAPFGDRVSIAVVNGPSAVVVSGEVGALEELIAVCSTKELRTRRIEVDY
ASHSVEVEAIRGPLAEALSGIEPRSTRTVFFSTVTGNRLDTAGLDADYWY
RNVRQTVLFDQAVRNACEQGYRTFIESSPHPALITGVEETFAACTDGDSE
AIVVPTLGRGDGGLHRFLLSAASAFVAGVAVNWRGTLDGAGYVELPTYAF
DKRRFWLSAEGSGADVSGLGLGASEHPLLGAVVDLPASGGVVLTGRLSPN
VQPWLADHAVSDVVLFPGTGFVELAIRAGDEVGCSVLDELTLAAPLLLPA
TGSVAVQVVVDAGRDSNSRGVSIFSRADAQAGWLLHAEGILRPGSVEPGA
DLSVWPPAGAVTVDVADGYERLATRGYRYGPAFRGLTAMWARGEEIFAEV
RLPEAAGGVGGFGVHPALLDAVLHAVVIAGDPDELALPFAWQGVSLHATG
ASAVRARIAPAGPSAVSVELADGLGLPVLSVASMVARPVTERQLLAAVSG
SGPDRLFEVIWSPASAATSPGPTPAYQIFESVAADQDPVAGSYVRSHQAL
AAVQSWLTDHESGVLVVATRGAMALPREDVADLAGAAVWGLVRSAQTEHP
GRIVLVDSDAATDDAAIAMALATGEPQVVLRGGQVYTARVRGSRAADAIL
VPPGDGPWRLGLGSAGTFENLRLEPVPNADAPLGPGQVRVAMRAIAANFR
DIMITLGMFTHDALLGGEGAGVVVEVGPGVTEFSVGDSVFGFFPDGSGTL
VAGDVRLLLPMPADWSYAEAAAISAVFTTAYYAFIHLADVQPGQRVLIHA
GTGGVGMAAVQLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEF
EDKFRAATGGRGFDVVLDSLAGEFVDASLRLVAPGGVFLEMGKTDIRDPG
VIAQQYPGVRYRAFDLFEPGRPRMHQYMLELATLFGDGVLRPLPVTTFDV
RRAPAALRYLSQARHTGKVVMLMPGSWAAGTVLITGGTGMAGSAVARHVV
ARHGVRNLVLVSRRGPDAPGAAELVAELAAAGAQVQVVACDAADRAALAK
VIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRVDVVLRSKVDAAWHLHE
LTRDLDVSAFVMFSSMAGLVGSSGQANYAAANSFLDALAAHRRAHGLPAI
SLGWGLWDQASAMTGGLDAADLARLGREGVLALSTAEALELFDTAMIVDE
PFLAPARIDLTALRAHAVAVPPMFSDLASAPTRRQVDDSVAAAKSKSALA
HRLHGLPEAEQHAVLLGLVRLHIATVLGNITPEAIDPDKAFQDLGFDSLT
AVEMRNRLKSATGLSLSPTLIFDYPTPNRLASYIRTELAGLPQEIKHTPA
VRTTSEDPIAIVGMACRYPGGVNSPDDMWDMLIQGRDVLSEFPADRGWDL
AGLYNPDPDAAGACYTRTGGFVDGVGDFDPAFFGVGPSEALAMDPQHRML
LELSWEALERAGIDPTGLRGSATGVFAGVMTQGYGMFAAEPVEGFRLTGQ
LSSVASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSLRSGECDLALA
GGVTVNATPDIFVEFSRWRGLSPDGRCKAFAAAADGTGFSEGGGMLVLQR
LSDARRLGHPVLAVVVGSAVNQDGASNGLTAPNGPSQQRVVRAALANAGL
SAAEVDVVEGHGTGTTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNM
GHTQAAAGVAGVIKMVLAMRHELLPATLHVDVPSPHVDWSAGAVELLTAP
RVWPAGARTRRAGVSSFGISGTNAHVIIEAVPVVPRREAGWAGPVVPWVV
SAKSESALRGQAARLAAYVRGDDGLDVADVGWSLAGRSVFEHRAVVVGGD
RDRLLAGLDELAGDQLGGSVVRGTATAAGKTVFVFPGQGSQWLGMGMGLH
AGYPVFAEAFNTVVGELDRHLLRPLREVMWGHDENLLNSTEFAQPALFAV
EVALFRLLGSWGVRPDFVMGHSIGELSAAHVAGVLSLENAAVLVAARGRL
MQALPAGGAMVAVQAAEEEVRPLLSAEVDIAAVNGPASLVISGAQNAVAA
VADQLRADGRRVHQLAVSHAFHSPLMDPMIDEFAAVAAGIAIGRPTIGVI
SNVTGQLAGDDFGSAAYWRRHIRQAVRFADSVRFAQAAGGSRFLEVGPSG
GLVASIEESLPDVAVTTMSALRKDRPEPATLTNAVAQGFVTGMDLDWRAV
VGEAQFVELPTYAFQRRRFWLSGDGVAADAAGLGLAASEHALLGAVIDLP
ASGGVVLTGRLSPSVQGWLADHSVAGVTIFPGAGFVELAIRAGDEVGCGV
VDESTLAAPLVLPASGSVAVQVVVNGPDESGVRGVSVYSRGDVGTGWVLH
AEGALRAGSAEPTADLAMWPPAGAVPVEVADGYQQLAERGYGYGPAFRGL
TAMWRRGDEVFAEVALPADAGVSVTGFGVHPVLLDAALHAVVLSAESAER
GQGSVLVPFSWQGVSLHAAGASAVRARIAPVGPSAVSIELADGLGLPVLS
VASMLARPVTDQQLRAAVSSSGPDRLFEVTWSPQPSAAVEPLPVCAWGTT
EDSAAVVFESVPLAGDVVAGVYAATSSVLDVLQSWLTRDGAGVLVVMTRG
AVALPGEDVTDLAGAAVWGLVRSAQTEHPGRIVLVDSDAPLDDSALAAVV
TTGEPQVLWRRGEVYTARVHGSRAVGGLLVPPSDRPWRLAMSTAGTFENL
RLELIPDADAPLGPGQVRVAVSAIAANFRDVMIALGLYPDPDAVMGVEAC
GVVIETSLNKGSFAVGDRVMGLFPEGTGTVASTDQRLLVKVPAGWSHTAA
ATTSVVFATAHYALVDLAAARSGQRVLIHAGTGGVGMAAVQLARHLGLEV
FATASKGKWDTLRAMGFDDDHISDSRSLEFEDKFRAATGGRGFDVVLDSL
AGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVRYRAFDLFEPG
PDRIAQILAELATLFGDGVLRPLPVTTFDVRCAPAALRYLSQARHTGKVV
MLMPGSWAAGTVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPG
AAELVAELAAAGAQVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGA
LDDAVVMSLTPDRVDVVLRSKVDAAWHLHELTRDLDVSAFVMFSSMAGLV
GSSGQANYAAANSFLDALAAHRRAHGLPAISLGWGLWDQASAMTGGLATV
DFKRFARDGIVAMSSADALQLFDTAMIVDEPFMLPAHIDFAALKVKFDGG
TLPPMFVDLINAPTRRQVDDSLAAAKSKSALLQRLEGLPEDEQHAVLLDL
VRSHIATVLGSASPEAIDPDRAFQELGFDSLTAVEMRNRLKSATGLALSP
TLIFDYPNSAALAGYMRRELLGSSPQDTSAVAAGEAELQRIVASIPVKRL
RQAGVLDLLLALANETETSGQDPALAPTAEQEIADMDLDDLVNAAFRNDD
E
>Rv3800c pks13, POLYKETIDE SYNTHASE PKS13
MADVAESQENAPAERAELTVPEMRQWLRNWVGKAVGKAPDSIDESVPMVE
LGLSSRDAVAMAADIEDLTGVTLSVAVAFAHPTIESLATRIIEGEPETDL
AGDDAEDWSRTGPAERVDIAIVGLSTRFPGEMNTPEQTWQALLEGRDGIT
DLPDGRWSEFLEEPRLAARVAGARTRGGYLKDIKGFDSEFFAVAKTEADN
IDPQQRMALELTWEALEHARIPASSLRGQAVGVYIGSSTNDYSFLAVSDP
TVAHPYAITGTSSSIIANRVSYFYDFHGPSVTIDTACSSSLVAIHQGVQA
LRNGEADVVVAGGVNALITPMVTLGFDEIGAVLAPDGRIKSFSADADGYT
RSEGGGMLVLKRVDDARRDGDAILAVIAGSAVNHDGRSNGLIAPNQDAQA
DVLRRAYKDAGIDPRTVDYIEAHGTGTILGDPIEAEALGRVVGRGRPADR
PALLGAVKTNVGHLESAAGAASMAKVVLALQHDKLPPSINFAGPSPYIDF
DAMRLKMITTPTDWPRYGGYALAGVSSFGFGGANAHVVVREVLPRDVVEK
EPEPEPEPKAAAEPAEAPTLAGHALRFDEFGNIITDSAVAEEPEPELPGV
TEEALRLKEAALEELAAQEVTAPLVPLAVSAFLTSRKKAAAAELADWMQS
PEGQASSLESIGRSLSRRNHGRSRAVVLAHDHDEAIKGLRAVAAGKQAPN
VFSVDGPVTTGPVWVLAGFGAQHRKMGKSLYLRNEVFAAWIEKVDALVQD
ELGYSVLELILDDAQDYGIETTQVTIFAIQIALGELLRHHGAKPAAVIGQ
SLGEAASAYFAGGLSLRDATRAICSRSHLMGEGEAMLFGEYIRLMALVEY
SADEIREVFSDFPDLEVCVYAAPTQTVIGGPPEQVDAILARAEAEGKFAR
KFATKGASHTSQMDPLLGELTAELQGIKPTSPTCGIFSTVHEGRYIKPGG
EPIHDVEYWKKGLRHSVYFTHGIRNAVDSGHTTFLELAPNPVALMQVALT
TADAGLHDAQLIPTLARKQDEVSSMVSTMAQLYVYGHDLDIRTLFSRASG
PQDYANIPPTRFKRKEHWLPAHFSGDGSTYMPGTHVALPDGRHVWEYAPR
DGNVDLAALVRAAAAHVLPDAQLTAAEQRAVPGDGARLVTTMTRHPGGAS
VQVHARIDESFTLVYDALVSRAGSESVLPTAVGAATAIAVADGAPVAPET
PAEDADAETLSDSLTTRYMPSGMTRWSPDSGETIAERLGLIVGSAMGYEP
EDLPWEVPLIELGLDSLMAVRIKNRVEYDFDLPPIQLTAVRDANLYNVEK
LIEYAVEHRDEVQQLHEHQKTQTAEEIARAQAELLHGKVGKTEPVDSEAG
VALPSPQNGEQPNPTGPALNVDVPPRDAAERVTFATWAIVTGKSPGGIFN
ELPRLDDEAAAKIAQRLSERAEGPITAEDVLTSSNIEALADKVRTYLEAG
QIDGFVRTLRARPEAGGKVPVFVFHPAGGSTVVYEPLLGRLPADTPMYGF
ERVEGSIEERAQQYVPKLIEMQGDGPYVLVGWSLGGVLAYACAIGLRRLG
KDVRFVGLIDAVRAGEEIPQTKEEIRKRWDRYAAFAEKTFNVTIPAIPYE
QLEELDDEGQVRFVLDAVSQSGVQIPAGIIEHQRTSYLDNRAIDTAQIQP
YDGHVTLYMADRYHDDAIMFEPRYAVRQPDGGWGEYVSDLEVVPIGGEHI
QAIDEPIIAKVGEHMSRALGQIEADRTSEVGKQ
>Rv2947c pks15, PROBABLE POLYKETIDE SYNTHASE PKS15
MIEEQRTMSVEGADQQSEKLFHYLKKVAVELDETRARLREYEQRATEPVA
VVGIGCRFPGGVDGPDGLWDVVSAGRDVVSEFPTDRGWDVEGLYDPDPDA
EGKTYTRWGAFLDDATGFDAGFFGIAPSEVLAMDPQQRLMLEVSWEALEH
AGIDPLSLRGSATGVYTGIFAASYGNRDTGGLQGYGLTGTSISVASGRVS
YVLGLQGPAVSVDTACSSSLVAIHWAMSSLRSGECDLALAGGVTVMGLPS
IFVGFSRQRGLAADGRCKAFAAAADGTGWGEGAGVVVLERLSDARRLGHS
VLAVVRGSAVNQDGASNGLTAPNGLAQQRVIQVALANAGLSAADVDVVEA
HGTATTLGDPIEAQALLSTYGQGGPAEQPLWVGSIKSNMGHTQAAAGVAG
VIKMVQAMRHGVMPATLHVDEPSPRVDWTSGAVSVLTEAREWSVDGRPRR
AAVSSFGISGTNAHLILEEAPVPAPAEAPVEASESTGGRGRRWCRG
>Rv1013 pks16, PUTATIVE POLYKETIDE SYNTHASE PKS16
MSRFTEKMFHNARTATTGMVTGEPHMPVRHTWGEVHERARCIAGGLAAAG
VGLGDVVGVLAGFPVEIAPTAQALWMRGASLTMLHQPTPRTDLAVWAEDT
MTVIGMIEAKAVIVSEPFLVAIPILEQKGMQVLTVADLLASDPIGPIEVG
EDDLALMQLTSGSTGSPKAVQITHRNIYSNAEAMFVGAQYDVDKDVMVSW
LPCFHDMGMVGFLTIPMFFGAELVKVTPMDFLRDTLLWAKLIDKYQGTMT
AAPNFAYALLAKRLRRQAKPGDFDLSTLRFALSGAEPVEPADVEDLLDAG
KPFGLRPSAILPAYGMAETTLAVSFSECNAGLVVDEVDADLLAALRRAVP
ATKGNTRRLATLGPLLQDLEARIIDEQGDVMPARGVGVIELRGESLTPGY
LTMGGFIPAQDEHGWYDTGDLGYLTEEGHVVVCGRVKDVIIMAGRNIYPT
DIERAAGRVDGVRPGCAVAVRLDAGHSRESFAVAVESNAFEDPAEVRRIE
HQVAHEVVAEVDVRPRNVVVLGPGTIPKTPSGKLRRANSVTLVT
>Rv1663 pks17, Probable polyketide synthase pks17
MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFLSQARHV
GKVVLTMPDAWAAGTVLITGGTGMAGSAVARHLVSRYGVRQVVLASRAGE
HTESVAALVDELGSAGARVQVVSCDVADRDAVAGLVASQPDLTAVFHAAG
VLDDAVITGLTPERVDKVLRAKVDGAWNLHELTRHLDVSAFVLFSSMAGI
VGAPGQANYAAANAFLDGLAAYRRSRGLAALSVAWGLWEQASAMTEHLGE
RDRVRMSRVGLAPLPTNQAMGFLDAALLADRPVVVAARLDRAALAGAELP
ALFSQLVAGPIRRIIDGADEVSGSGLASRLHGLTPEQRHRELTELVCSNA
AIVLGHSGTEIDAHKAFQDLGFDSLTAVELRNRLKTATGLTLPPTLIFDY
PTAAELAEHLDIQLANAPAVTVDQPNPSTRFNEVTRELQALLDQPNWNPD
DKTRLIKRLQAILTDCTAPPASSGPSTTHDDEDITTATESQLFAILDDEL
GP
>Rv3825c pks2, PROBABLE POLYKETIDE SYNTHASE PKS2
MGLGSAASGTGADRGAWTLAEPRVTPVAVIGMACRLPGGIDSPELLWKAL
LRGDDLITEVPPDRWDCDEFYDPQPGVPGRTVCKWGGFLDNPADFDCEFF
GIGEREAIAIDPQQRLLLETSWEAMEHAGLTQQTLAGSATGVFAGVTHGD
YTMVAADAKQLEEPYGYLGNSFSMASGRVAYAMRLHGPAITVDTACSSGL
TAVHMACRSLHEGESDVALAGGVALMLEPRKAAAGSALGMLSPTGRCRAF
DVAADGFVSGEGCAVVVLKRLPDALADGDRILAVIRGTSANQDGHTVNIA
TPSQPAQVAAYRAALAAGGVDAATVGMVEAHGPGTPIGDPIEYASVSEVY
GVDGPCALASVKTNFGHTQSTAGVLGLIKVVLALKHGVVPRNLHFTRLPD
EIAGITTNLFVPEVTTPWPTNGRQVPRRAAVSSYGFSGTNVHAVVEQAPQ
TEAQPHAASTPPTGTPALFTLSASSADALRQTAQRLTDWIQQHADSLVLS
DLAYTLARRRTHRSVRTAVIASSVDELIAGLGEVADGDTVYQPAVGQDDR
GPVWLFSGQGSQWAAMGADLLTNESVFAATVAELEPLIAAESGFSVTEAM
TAPETVTGIDRVQPTIFAMQVALAATMAAYGVRPGAVIGHSMGESAAAVV
AGVLSAEDGVRVICRRSKLMATIAGSAAMASVELPALAVQSELTALGIDD
VVVAVVTAPQSTVIAGGTESVRKLVDIWERRDVLARAVAVDVASHSPQVD
PILDELIAALADLNPKAPEIPYYSATLFDPREAPACDARYWADNLRHTVR
FSAAVRSALDDGYRVFAELSPHPLLTHAVDQIAGSVGMPVAALAGMRREQ
PLPLGLRRLLTDLHNAGAAVDFSVLCPQGRLVDAPLPAWSHRFLFYDREG
VDNRSPGGSTVAVHPLLGAHVRLPEEPERHAWQADVGTATLPWLGDHRIH
NVAALPGAAYCEMALSAARAVLGEQSEVRDMRFEAMLLLDDQTPVSTVAT
VTSPGVVDFAVEALQEGVGHHLRRASAVLQQVSGECEPPAYDMASLLEAH
PCRVDGEDLRRQFDKHGVQYGPAFTGLAVAYVAEDATATMLAEVALPGSI
RSQQGLYAIHPALLDACFQSVGAHPDSQSVGSGLLVPLGVRRVRAYAPVR
TARYCYTRVTKVELVGVEADIDVLDAHGTVLLAVCGLRIGTGVSERDKHN
RVLNERLLTIEWHQRELPEMDPSGAGKWLLISDCAASDVTATRLADAFRE
HSAACTTMRWPLHDDQLAAADQLRDQVGSDEFSGVVVLTGSNTGTPHQGS
ADRGAEYVRRLVGIARELSDLPGAVPRMYVVTRGAQRVLADDCVNLEQGG
LRGLLRTIGAEHPHLRATQIDVDEQTGVEQLARQLLATSEEDETAWRDNE
WYVARLCPTPLRPQERRTIVADHQQSGMRLQIRTPGDMQTIELAAFHRVP
PGPGQIEVAVRASSVNFADVLIAFGRYPSFEGHLPQLGTDFAGVVTAVGP
GVTDHKVGDHVGGMSPNGCWGTFVTCDARLAATLPPGLGDAQAAAVTTAH
ATAWYGLHELARIRAGDTVLIHSGTGGVGQAAIAIARAAGAEIFATAGTP
QRRELLRNMGIEHVYDSRSIEFAEQIRRDTNGRGVDVVLNSVTGAAQLAG
LKLLAFRGRFVEIGKRDIYGDTKLGLFPFRRNLSFYAVDLGLLSATHPEE
LRDLLGTVYRLTAAGELPMPQSTHYPLVEAATAIRVMGNAEHTGKLVLHI
PQTGKSLVTLPPEQAQVFRPDGSYIITGGLGGLGLFLAEKMAAAGCGRIV
LNSRTQPTQKMRETIEAIAAMGSEVVVECGDIAQPGTAERLVATAVATGL
PVRGVLHAAAVVEDATLANITDELLARDWAPKVHGAWELHEATSGQPLDW
FCLFSSAAALTGSPGQSAYSAANSWLDAFAHWRQAQGLPATAIAWGAWSD
IGQLGWWSASPARASALEESNYTAITPDEGAYAFEALLRHNRVYTGYAPV
IGAPWLVAFAERSRFFEVFSSSNGSGTSKFRVELNELPRDEWPARLRQLV
AEQVSLILRRTVDPDRPLPEYGLDSLGALELRTRIETETGIRLAPKNVSA
TVRGLADHLYEQLAPDDAPAAALSSQ
>Rv1180 pks3, PROBABLE POLYKETIDE BETA-KETOACYL SYNTHASE PKS3
MRTATATSVAVIGMACRLPGGIDSPQRLWEALLRGDDLVGEIPADRWDAN
VYYDPEPGVPGRSVSRWGAFLDDVGGFDCDFFGLTEREATAIDPQHRLLL
EVSWEAIEHAGVDPATLAESQTGVFVGLTHGDYELLSADCGAAEGPYGFT
GTSNSFASGRVAYTLGLHGPAVTVDTACSSGLTAVHQACRSLDDGESDLA
LAGGVVVTLEPRKSVSGSLQGMLSPTGRCHAFDEAADGFVSGEGCVVLLL
KRLPDAVRDGDRVLAIVRGTAANQDGRTVNIAAPSAQAQIAVYQQALAAA
GVEASTVGMVEAHGTGTPVGDPVEYASLAAVYGTEGPCALTSVKTNFGHL
QSASGPLGLMKTILALRHGVVPQNLHFCRLPDQLAEIDTELFVPQANTSW
PDNTGQPRRAAVSSYGMSGTNVHAILEQAPVSEPAASGPELTPEAGGLAL
FPVSATSAEQLHVTAARLADWVDQNGNAGSRVSMRDLG
>Rv1181 pks4, PROBABLE POLYKETIDE BETA-KETOACYL SYNTHASE PKS4
MTASSFDELSAALRDVAGDQIPYQPAVGHDDRGPVWVFSGQGSQWPGMGT
ELLVAEPVFAATVAAMEPVIARESGFSVTEAMSAPQTVSGIDRVQPTIFA
VQVALAAALKSYGVRPGAIIGHSLGEAAAAVVAGALSLHDGLRVICRRSR
LMSRIAGSGAMASVELPGQQVLSELAIRGISDVVLSVVASPTSTVVGGAT
QSIRDLVAAWEQQDVLAREVAVDVASHTPQVDPILDELLEVLAEVDPTAP
EIPYYSATLWDPRERPSFTGEYWVENLRYTVRFAAAVQAALKDGYRVFGE
LAPHPLLTYAVEQNAASLDMPIATLAAMRRGEQLPFGLRGFVADVHNAGA
KVDFSVQYPDGRLVDAPLPSWTHRTLMLSREDSHRSHTGAVQAVHPLLGA
HVHLLEEPERHVWQAGVGTGAHPWLGDHRIHNVAAFPGAAYCEMALAAAR
TTLGELSEVRDIKFEQTLLLDEQTVVSSAATIAAPGILQFAVESHQEGEP
ARRASAMLHALEEMPQPPGYDTNALTAAHESSMSGEELRKMFNSLGIQYG
PAFSGLVAVHTARGDVTTVLAEVALPGAIRSQQSAYASHPALLDACFQSV
LVHPEVQKATVGGLMLPVGVRRLRNYHSTRSAHYCLARVTSSSRAGECEA
DLDVFDQAGTVLLTVEGLRLAAGISEHERANRVFDERLLTIEWERGELPE
VPQIDAGSWLLLSASEADPLTAQLADALNAVGAQSTSVASASDVAQLRSL
LGGRLTGVVVVTGPPTGGLTQCGRDYVSQLVGIARELAELPGEPPRLFVV
TRSAASVLPSDLANLEQAGLRGLMRVIDSEHPHLGATAIDVDNDETVAAL
VASQLQSGSQEDETAWRNGIWYTARLRPGPLRPAERRTAVVEYRRDGMRL
QIRTPGDLESLEFVTFDRVAPGPGEIEVAVTASSVNFADVLVAFGRYPTF
EGYRQQLGIDFAGVVTAVGPDVTEHRIGDHVGGMSANGCWSTFVRCDARL
AVTLPPELPVAAAAAVPTASATAWYALHDLARICSDDKVLIHSGTGGVGQ
AAIAIARAAGCEIFATAGSAQRRQLLHDMGVEHVYDSRSTEFAEQIRGDT
DGYGVDVVLNSLPGAAQRAGIELLAFGGRFVEIGKRDIYGDTRLGLFPFR
RNLSLYAVDLALLTHSHPHTVRRLLKTVYQHTVEGTLPVPQTTHYPIHDA
AVAIRLVGGAGHTGKVVLDVPRTGEGVAVVPPEQVRTSRPDGAYLVTGGL
GGLGLFLAGELAAAGCGRIVLNSRSTPSPHATRVIERLRAAGADIQVECG
DIADAATAHRVVAVATASGLPVRGVLHAAAVVEDATLANVTDELIDRCWA
PKVHGAWNIHRATAAQPLEWFCLFSSAAALVGSPGQGAYAAANSWLDAFA
HWRRAQGLPATSIAWGAWAEIGRATALAEGTGAAIAPAEGARAFQTLLRY
GRAYSGYAPIMGTPWLTAFAQRSRFAEAFHATGQNQPATGKFLAELGSLP
REEWPRTVRRLVSDQISLLLRRTIDPDRPLSDYGLDSLGNLELRTRIETE
TGIRVSPTKITTVRGLAEHVCDELAAAQSAPV
>Rv1527c pks5, Probable polyketide synthase pks5
MGKERTKTVDRTRVTPVAVIGMGCRLPGGIDSPDRLWEALLRGDDLVTEI
PADRWDIDEYYDPEPGVPGRTDCKWGAYLDNVGDFDPEFFGIGEKEAIAI
DPQHRLLLETSWEAMEHGGLTPNQMASRTGVFVGLVHTDYILVHADNQTF
EGPYGNTGTNACFASGRVAYAMGLQGPAITVDTACSSGLTAIHLACRSLH
DGESDIALAGGVYVMLEPRRFASGSALGMLSATGRCHAFDVSADGFVSGE
GCVMLALKRLPDALADGDRILAVIRGTAANQDGHTVNIATPSRSAQVAAY
REALDVAGVDPATVGMVEAHGPGTPVGDPIEYASLAEVYGNDGPCALASV
KTNFGHTQSAAGALGLMKAVLALQHGVVPQNLHFTALPDKLAAIETNLFV
PQEITPWPGADQETPRRAAVSSYGMTGTNVHAIVEQAPVPAPESGAPGDT
PATPGIDGALLFALSASSQDALRQTAARLADWVDAQGPELAPADLAYTLA
RRRGHRPVRTAVLAATTAELTEALREVATGEPPYPPAVGQDDRGPVWVFS
GQGSQWAGMGADLLATEPVFAATIAAIEPLIAAESGFSVTEAMTAPEVVT
GIDRVQPTLFAMQVALAATMKSYGVAPGAVIGHSLGESAAAVVAGALCLE
DGVRVICRRSALMTRIAGAGAMASVELPAQQVLSELMARGVNDAVVAVVA
SPQSTVIGGATQTVRDLVAAWEQRDVLAREVAVDVASHSPQVDPILDELA
EALAEISPLQPEIPYYSATSFDPREEPYCDAYYWVDNLRHTVRFAAAVQA
ALEDGYRVFTELTPHPLLTHAVDQTARSLDMSAAALAGMRREQPLPHGLR
ALAGDLYAAGAAVDFAVLYPTGRLINAPLPTWNHRRLLLDDTTRRIAHAN
TVAVHPLLGSHVRLPEEPERHVWQGEVGTVTQPWLADHQIHGAAALPGAA
YCEMALAAARAVLGEASEVRDIRFEQMLLLDDETPIGVTATVEAPGVVPL
TVETSHDGRYTRQLAAVLHVVREADDAPDQPPQKNIAELLASHPHKVDGA
EVRQWLDKRGHRLGPAFAGLVDAYIAEGAGDTVLAEVNLPGPLRSQVKAY
GVHPVLLDACFQSVAAHPAVQGMADGGLLLPLGVRRLRSYGSARHARYCC
TTVTACGVGVEADLDVLDEHGAVVLAVRGLQLGTGASQASERARVLGERL
LSIEWHERELPENSHAEPGAWLLISTCDATDLVAAQLTDALKVHDAQCTT
MSWPQRADHAAQAARLRDQLGTGGFTGVFVLTAPQTGDPDAESPVRGGEL
VKHVVRIAREIPEITAQEPRLYVLTHNAQAVLSGDRPNLEQGGMRGLLRV
IGAEHPHLKASYVDVDEQTGAESVARQLLAASGEDETAWRNDQWYTARLC
PAPLRPEERQTTVVDHAEAGMRLQIRTPGDLQTLEFAAFDRVPPGPGEIE
VAVTASSINFADVLVTFGRYQTLDGRQPQLGTDFAGVVSAVGPGVSELKV
GDRVGGMSPNGCWATFVTCDARLATRLPEGLTDAQAAAVTTASATAWYGL
QDLARIKAGDKVLIHSATGGVGQAAIAIARAAGAQIYATAGNEKRRDLLR
DMGIEHVYDSRSVEFAEQIRRDTAGYGVDIVLNSVTGAAQLAGLKLLALG
GRFIEIGKRDIYSNTRLELLPFRRNLAFYGLDLGLMSVSHPAAVRELLST
VYRLTVEGVLPMPQSTHYPLAEAATAIRVMGAAEHTGKLILDVPHAGRSS
VVLPPEQARVFRSDGSYIITGGLGGLGLFLAEKMANAGAGRIVLSSRSQP
SQKALETIELVRAIGSDVVVECGDIAQPDTADRLVTAATATGLPLRGVLH
AAAVVEDATLANITDELIERDWAPKAYGAWQLHRATADQPLDWFCSFSSA
AALVGSPGQGAYAAANSWLDTFTHWRRAQDLPATSIAWGAWGQIGRAIAF
AEQTGDAIAPEEGAYAFETLLRHNRAYSGYAPVIGSPWLTAFAQHSPFAE
KFQSLGQNRSGTSKFLAELVDLPREEWPDRLRRLLSKQVGLILRRTIDTD
RLLSEYGLDSLSSQELRARVEAETGIRISATEINTTVRGLADLMCDKLAA
DRDAPAPA
>Rv0405 pks6, PROBABLE MEMBRANE BOUND POLYKETIDE SYNTHASE PKS6
MTDGSVTADKLQKWFREYLSTHIECHPNEVSLDVPIRDLGLKSIDVLAIP
GDLGDRFGFCIPDLAVWDNPSANDLIDSLLNQRSADSLRESHGHADRNTQ
GRGSINEPVAVIGVGCRFPGDIDGPERLWDFLTEKKCAITAYPDRGFTNA
GTFAESGGFLKDVAGFDNRFFDIPPDEALRMDPQQRLLLEVSWEALEHAG
IIPESLRLSRTGVFVGVSSTDYVRLVSASAQQKSTIWDNTGGSSSIIANR
ISYFLDIQGPSIVIDTACSSSLVAVHLACRSLSTWDCDIALVGGTNVLIS
PEPWGGFREAGILSQTGCCHAFDKSADGMVRGEGCGVIVLQRLSDARLEG
RRILAILTGSAVNQDGKSNGIMAPNPSAQIGVLENACKSARVDPLEIGYV
EAHGTGTSLGDRIEAHALGMVFGRKRPGSGPLMIGSIKPNIGHLEGAAGI
AGLIKAVLMVERGSLLPSGGFTEPNPAIPFTELGLRVVDELQEWPVVAGR
PRRAGVSSFGFGGTNAHVIVEEAGSVGADTVSGRADVGGSGGGVVAWVIS
GKTASALAAQAGRLGRYVRARPALDVVDVGYSLVSTRSVFDHRAVVVGQT
RDELLAGLAGVVAGRPEAGVVCGVGKPAGKTAFVFAGQGSQWLGMGSELY
AAYPVFAEALDAVVDELDRHLRYPLRDVIWGHDQDLLNTTEFAQPALFAV
EVALYRLLMSWGVRPGLVLGHSVGELAAAHVAGALCLPDAAMLVAARGRL
MQALPAGGAMFAVQAREDEVAPMLGHDVSIAAVNGPASVVISGAHDAVSA
IADRLRGQGRRVHRLAVSHAFHSALMEPMIAEFTAVAAELSVGLPTIPVI
SNVTGQLVADDFASADYWARHIRAVVRFGDSVRSAHCAGASRFIEVGPGG
GLTSLIEASLADAQIVSVPTLRKDRPEPVSVMTAAAQGFVSGMGLDWASV
FSGYRPKRVELPTYAFQHQKFWLAPAPSVSDPTAAGQIGASDGGAELLAS
SGFAARLAGRSADEQLAAAIEVVCEHAAAVLGRDGAAGLDAGQAFADSGF
NSLSAVELRNRLTAVTAVTLPATAIFDHPTPTELAQYLITQIDGHGSSAA
AAANPAERIDALTDLFLQACDAGRDADGWKMVALASNTRERMSSPVRNNV
SKNVALLADGISDVVVICIPTLTVLSDQREYRDIANAMTGRHSVYSLTLP
GFDSSDALPQNADMIVETVSNAIIDVVGGSCRFVLSGYSSGGVLAYALCS
HLSVKHQRNPLGVALIDTYLPSQIANPSMNEGFSPNDTGKGLSREVIRVA
RMLNRLTATRLTAAATYAAIFQAWEPGRSMAPVLNIVAKDRIATVENLRE
ERINRWRTAAAEAAYSVAEVPGDHFGMMSTSSEAIATEIHDWISGLVRGP
HR
>Rv1661 pks7, Probable polyketide synthase pks7
MNSTPEDLVKALRRSLKQNERLKRENRDLLARTTEPVAVVGMGCRYPGGV
DSPETLWELVAHGRDAVSEFPADRGWDVAGLFDPDPDAVGKSYTRCGGFL
TDVAGFDAEFFGIAPSEALAMDPQQRLLLEVSWEALERAGIDPITLRGSQ
TGVFAGVFHGSYGGQGRVPGDLERYGLRGSTLSVASGRVAYVLGLQGPAV
SVDTACSSSLVALHLAVQSLRLGECDLALVGGVTVMATPAMFIEFSRQRA
LSADGRCKAYAGAADGTAFAEGAGVLVLARLADARRLGHPVLALVRGSAV
NQDGASNGLATPNGPAQQRVITAALASARLGVADVDVVEGHGTGTTLGDP
IEAQAILATYGQRPADRPLWLGSIKSNIGHTSAAAGVAGVIKMVQAMRHG
VLPKTLHVDVPTPHVDWSAGAVSLLTEPRPWHVPGRPRRAGVSSFGISGT
NAHVILEEAPAVEPVGAAHGNDPVAVPWVLSARSAQALTNQARRLLAWVG
ADENVRPLDVGWSLVNTRSLFDHRAVVVGADRTQLMEGLTGLAAGVPGAD
VVAGRAQTVGKTAFVFPGQGAQWLGMGAQLCATAPVFAEHIHRCERALRE
HVEWSLLDVLRGAPGAPGLDRVDVVQPALWAVMVSLAELWRSVGVVPDAV
IGHSQGEIAAAYVAGALSLRDAAAVVALRSRLLVRLGGAGGMVSLACGQP
QAEKLASQWGDRLNIAAVNGVSSVVLAGETDAVTELMQRCEAEGIRARRI
DVDYASHSAQVDAIREELIAALRGIEPRTSTVAFFSTVTGELMDTAGVNA
EYWYRSIRQPVQFERAVRNAFDGGYRVFVESSPHPVLIAGIEETLVDCDR
GATGEPIVIPTLGRDDGGVGRFWLSAGQAHVAGVGVDWRAAFADLGGRRV
ELPTYAFARQRFWLDGLGAVGGDLGGVGLVGAEHGLLAAVVQRPDSGGVV
LTGRISVVAAPWLADHAVGPVVLFPGTGFVELALRAGDEVGCSVLQELTL
QAPLVLPADGVRVQVVVGGVEQSGTRNVWVYSAAGQADSSPGWTLHAQGV
LGVGSVQPAAELSVWPPVGARAMDVADGYQVLAARGYGYGPAFRGLQALW
RRGAEVFADVTLPEGVPIRGFGIHPAVLDAALHAWGIVEGEQQTMLPFSW
QGVCLHASGAARVRVRLAPVGRGAVSVELADPQGLPVLSVRQLMVRPVSA
AALSRSTAGDRGLLEMIWTPVPLEGGDIGDDAVVWELPPHAGAQAGGDVL
AAVYRGVHEVLEVLQSWLASDATGLGVVVTRGAVGPVDDDVTDLAGAAVW
GLVRSAQAEHPGRVVLVDTDGSVAVEDAVGFGARSGEPQLVVRRGRVYAA
RLAPVAAGLTLPSASAGGWRLVAGGGGTLADVVVAPVAPVELATGQVRVA
VGAVGVNFRDVLVALGMYPGGGELGVDGAGVVVEVGPGVTGLAVGDRVMG
LLGLVGSEAVVDARLVTMVPAGWSLVEAAAVPVAFLTAFYGLSVLAEVAA
GQKVLVHAGTGGVGMAAVSLARYWGAEVFVTASRAKWDTLRAMGFDDIHI
SDSRSLEFEEAFLRATEGSGVDVVLNSLAGEFTDASLRLLPSGGRFIELG
KTDIRDGQTVAERHRGVRYRAFDLVEAGPDRIAAMLSEVVGLLAAGVLAR
LPVKTFDARCAPAAYRFVSQARHIGKVVLTIPDGPGGQSGLAGGTVVVTG
GTGMAGSAVATHLVRRHGVANLVLVSRSGEQADRAAEVAALLREGGAQVA
VVSCDVADRDALAALLAGLDPRYPLKGVFHAAGVLDDAVITGLTPDRVDT
VLRAKVDGAWNLHELTEDMDLSAFVVFSSMAGIVGTPAQGNYAAANAFLD
GLVAYRRSRGLAGLSVAWGLWEQASAMTRHLGERDRARMTQAGLAPLTTE
QALGFLDTALQADRAVVVAARLDRAALAGAGAALPALFSQLAAGPTRRRI
DAADTAVSMSGLVSRLHALTPERRQRELTDLVISNAAAVLGRSSSVDINA
HKAFQDLGFDSLTAVELRNRLKTATGLTLSPTLIFDYPTPATLAEHLDSR
LVTASGSDQQSLSDRVDDITRELVVLLDQPDLSANVKAHLRTRLQTMLTS
LTTEDDDIAAATESQLFAILDEELGS
>Rv1662 pks8, Probable polyketide synthase pks8
MSGTTTHVDYLKRLTADLRRTRRRLSDLEAKLSEPVAVVGMGCRYPGGVD
SPETLWELVAQGRDAVSDFPADRGWDVDGLFDPDPDACGKMYTRRGTFLE
HAGDFDAGFFGIGPSEALAMDPQQRLLLEVSWEALERTGIDPTKLRGSAT
GVFAGVIHAGYGGQLSGELEGYGLTGSTLSVASGRVAYVLGLEGPAVSVD
TACSSSLVALHLAVQSLRSGECDLALAGGVTVMATPAAFVEFSRQRALAR
DGRCKVYAGAADGTAWSEGAGVLVVERLVDARRLGHPVLALVRGSAVNQD
GASNGLTAPNGPSQQRVIRAALASARLRAVEVDVVEGHGTGTMLGDPIEA
QALLATYGQDRVEPLWLGSIKSNIGHTSAAAGVAGVIKMVQAMRHGVMPK
TLHVDVPTPHVDWSVGAVSLLTQPRAWSVHGRPRRAGVSSFGISGTNAHV
ILEQAPVVESVVPEVASPTAASAVPWVLSARSEQALAGQAQRLLAFVAAN
PDLDPIDVGWSLVKTRAMFEHRAVVVGADRGALLAGLAALAAGESGAGVA
VGRARSVGKTVFVFPGQGAQWVGMGAQLYAELPLFALAFDAVAEELDRHL
RLPLRNVLWEGDEALLTSTEFAQPALFAIEVALATLLQHWGISPDFLIGH
SVGEIAAAHLAGVLSLTDAAGLVAARGRLMAELPAGGVMVVVAASEEEVL
PVLVDGANLAAVNAPHSVVVSGCEAAVSDIADHFARRGRRVHRLAVSHAF
HSLLMEPMLAEFTRIAAGISVSKPRIPLVSNVTGQMAGAGYGDGQYWVEH
ARRPVRFAEGVQLLNAVGATRFVEVGPGGGLTALVEQSLPLGEALSVAMM
RREHPEVSSVLGAVATLFTAGAQMDWPAVFGSPGRRIELPTYAFQRQRYW
LPPTSAGSADISGVGLLAARHGLLGAVVEQPDSDVVVLTGRLSVGEQRWL
ADHVIAGVVLLAGAAFVELALRAADQVDCGVVEELTVVTPLVLPTVGGVQ
LQVVVGVGEMGQRPVSIYSRNAESDSGWVLHARGVLGAKAVAPAADLSVW
PPLGAAPVDVDGAYQRFAELGYEYGRAFQGLTAMWRRESELFADVAVPDD
VDVTLSGFGIHPLVLDAALHAMGMVGEQAATMLPFSWQGVSLHAAGASRV
RARIAPAGDGTVSVELADQAGLPVLSVQALVMRSVSSQLLSAAVAAADAA
GRGLLEVAWLPVELAHNDISADLVVWELESFQDGVGPVYSATHRVLVALQ
SWLAQERAGRLVVLTQGSVGQDATNLAGAAVWGLVRSAQAEHPGRVMLVD
SDGSMDVGDVIGCGEEQLMIRNGTAYAARLAQLRPQPILQLPDTNSGWRL
VAGGAGALEDLTLASCPAKELAPGQVRIEVRALGVNFRDVLVALGIYPGA
AELGAEGAGVVTEVGPGVTGLAVGDPVMGLLGVAGSEAVVDARLVVKLPN
RWPLTDAAGVPVVFLTAYYALRVLAQVQPGESVLVHAAAGGVGMAAVQLA
RLWGLEVFATASRGKWDTLHTMGCDNTHVADSRTLAFEETFWLTTEGRGV
DVVLNSLAGEFTDASLRLLPRGGRFIEMGKTEFGTPRSLPRTILGWPTGL
ST
>Rv1664 pks9, Probable polyketide synthase pks9
MQPTGIAIIGLACRFPTVVSPGDLWDLLRDGREAAGSIDNVADFDADFFN
LSPREASAMDPRQRLALELTWELLEDAFVVPETLRGQPIAVYLGAMNDDY
AVLTLAADRVDHHAFAGTSRAIIANRVSFAFGLRGPSVTIDSGQSSSLVA
VHLACESVRTGEAPLAIAGGVHLNLARETAMLEQEFGAVSPSGHTYAFDE
RADGYVPGDGGGLVLLKPVQAALDDGDRIHAIIRGSAVGNAGHSATGLTV
PSVAGQVDVIRRAMSGAGVDCHQVHYVEAHGTGTKIGDPIEARALGEIFA
ARQRRPVSVGSVKTNIGHTGGAAGIAGLLKAVLAIENAVIPPSLNYVGAA
IDLDSLGLRVDTALTPWPVADEPRRAGVSSFGMGGTNAHVILEQGPTQSP
EIVESVAAAGSNAPVAVPWVLAARSPQALTNQAGRLLAHLTADDGLTALD
VGWSLVSTRSVFDHRAVVVGADRGRLMAGLAGLAAGEPGAGVVVGRARSV
GKTVFVFPGQGSQWLGMGRQLYGRYSVFARAFDEVVAVLDGQLRLSVRQV
MWGADAGLLESTEFAQPALFVVQVALAALLQDWGVLPDLVMGHSVGEIAA
AYVAGALSLVDAARVVAARGRLMQALPAGGVMVAVAASEDEVAPLLTEGV
CIAAVNAPESVVISGEQAAVGVVVDRLVGLGRRVRRLAVSHAFHSVLMDP
MVEEFSKVLADVCVRAPRIGLVSNVTGQLAGAGYGSPAYWVEHVRKPVRF
FDGVGLAESLGARVFVEVGPGAGLEASVALLARDRPEVESVLAGVGRLFA
EGVAVDWSSVFAGLGGRRVELPTYGFARQRFWLGDNGELSVDQTGKDAGA
IARLQSLAPPELQRQLVELVCFHAAIVLGRKSSHDIDPECAFQDLGFDSM
SGVELRNRLQMAIGLPGLSLPRTLIFDYPTASALAECLGQLLGGQHESSD
DESIWQLLKNIPIHQLRRTGLLDKLLLLAGQPEESLAGRTVSDEVIDSLS
PEALIGLALDEDENDIR
>Rv2043c pncA, PYRAZINAMIDASE/NICOTINAMIDAS PNCA (PZase)
MRALIIVDVQNDFCEGGSLAVTGGAALARAISDYLAEAADYHHVVATKDF
HIDPGDHFSGTPDYSSSWPPHCVSGTPGADFHPSLDTSAIEAVFYKGAYT
GAYSGFEGVDENGTPLLNWLRQRGVDEVDVVGIATDHCVRQTAEDAVRNG
LATRVLVDLTAGVSADTTVAALEEMRTASVELVCSS
>Rv2931 ppsA, PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSA
MTGSISGEADLRHWLIDYLVTNIGCTPDEVDPDLSLADLGVSSRDAVVLS
GELSELLGRTVSPIDFWEHPTINALAAYLAAPEPSPDSDAAVKRGARNSL
DEPIAVVGMGCRFPGGISCPEALWDFLCERRSSISQVPPQRWQPFEGGPP
EVAAALARTTRWGSFLPDIDAFDAEFFEISPSEADKMDPQQRLLLEVAWE
ALEHAGIPPGTLRRSATGVFAGACLSEYGAMASADLSQVDGWSNSGGAMS
IIANRLSYFLDLRGPSVAVDTACSSSLVAIHLACQSLRTQDCHLAIAAGV
NLLLSPAVFRGFDQVGALSPTGQCRAFDATADGFVRGEGAGVVVLKRLTD
AQRDGDRVLAVICGSAVNQDGRSNGLMAPNPAAQMAVLRAAYTNAGMQPS
EVDYVEAHGTGTLLGDPIEARALGTVLGRGRPEDSPLLIGSVKTNLGHTE
AAAGIAGFIKTVLAVQHGQIPPNQHFETANPHIPFTDLRMKVVDTQTEWP
ATGHPRRAGVSSFGFGGTNAHVVIEQGQEVRPAPGQGLSPAVSTLVVAGK
TMQRVSATAGMLADWMEGPGADVALADVAHTLNHHRSRQPKFGTVVARDR
TQAIAGLRALAAGQHAPGVVNPADGSPGPGTVFVYSGRGSQWAGMGRQLL
ADEPAFAAAVAELEPVFVEQAGFSLHDVLANGEELVGIEQIQLGLIGMQL
ALTELWCSYGVRPDLVIGHSMGEVAAAVVAGALTPAEGLRVTATRSRLMA
PLSGQGGMALLELDAPTTEALIADFPQVTLGIYNSPRQTVIAGPTEQIDE
LIARVRAQNRFASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGII
STTYADLHTQPVFDAEHWATNMRNPVRFQQAIASAGSGADGAYHTFIEIS
AHPLLTQAIIDTLHSAQPGARYTSLGTLQRDTDDVVTFRTNLNKAHTIHP
PHTPHPPEPHPPIPTTPWQHTRHWITTKYPAGSVGSAPRAGTLLGQHTTV
ATVSASPPSHLWQARLAPDAKPYQGGHRFHQVEVVPASVVLHTILSAATE
LGYSALSEVRFEQPIFADRPRLIQVVADNRAISLASSPAAGTPSDRWTRH
VTAQLSSSPSDSASSLNEHHRANGQPPERAHRDLIPDLAELLAMRGIDGL
PFSWTVASWTQHSSNLTVAIDLPEALPEGSTGPLLDAAVHLAALSDVADS
RLYVPASIEQISLGDVVTGPRSSVTLNRTAHDDDGITVDVTVAAHGEVPS
LSMRSLRYRALDFGLDVGRAQPPASTGPVEAYCDATNFVHTIDWQPQTVP
DATHPGAEQVTHPGPVAIIGDDGAALCETLEGAGYQPAVMSDGVSQARYV
VYVADSDPAGADETDVDFAVRICTEITGLVRTLAERDADKPAALWILTRG
VHESVAPSALRQSFLWGLAGVIAAEHPELWGGLVDLAINDDLGEFGPALA
ELLAKPSKSILVRRDGVVLAPALAPVRGEPARKSLQCRPDAAYLITGGLG
ALGLLMADWLADRGAHRLVLTGRTPLPPRRDWQLDTLDTELRRRIDAIRA
LEMRGVTVEAVAADVGCREDVQALLAARDRDGAAPIRGIIHAAGITNDQL
VTSMTGDAVRQVMWPKIGGSQVLHDAFPPGSVDFFYLTASAAGIFGIPGQ
GSYAAANSYLDALARARRQQGCHTMSLDWVAWRGLGLAADAQLVSEELAR
MGSRDITPSEAFTAWEFVDGYDVAQAVVVPMPAPAGADGSGANAYLLPAR
NWSVMAATEVRSELEQGLRRIIAAELRVPEKELDTDRPFAELGLNSLMAM
AIRREAEQFVGIELSATMLFNHPTVKSLASYLAKRVAPHDVSQDNQISAL
SSSAGSVLDSLFDRIESAPPEAERSV
>Rv2932 ppsB, PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSB
MMRTAFSRISGMTAQQRTSLADEFDRVSRIAVAEPVAVVGIGCRFPGDVD
GPESFWDFLVAGRNAISTVPADRWDAEAFYHPDPLTPGRMTTKWGGFVPD
VAGFDAEFFGITPREAAAMDPQQRMLLEVAWEALEHAGIPPDSLGGTRTA
VMMGVYFNEYQSMLAASPQNVDAYSGTGNAHSITVGRISYLLGLRGPAVA
VDTACSSSLVAVHLACQSLRLRETDLALAGGVSITLRPETQIAISAWGLL
SPQGRCAAFDAAADGFVRGEGAGVVVLKRLTDAVRDGDQVLAVVRGSAVN
QDGRSNGVTAPNTAAQCDVIADALRSGDVAPDSVNYVEAHGTGTVLGDPI
EFEALAATYGHGGDACALGAVKTNIGHLEAAAGIAGFIKATLAVQRATIP
PNLHFSQWNPAIDAASTRFFVPTQNSPWPTAEGPRRAAVSSFGLGGTNAH
VIIEQGSELAPVSEGGEDTGVSTLVVTGKTAQRMAATAQVLADWMEGPGA
EVAVADVAHTVNHHRARQATFGTVVARDRAQAIAGLRALAAGQHAPGVVS
HQDGSPGPGTVFVYSGRGSQWAGMGRQLLADEPAFAAAVAELEPVFVEQA
GFSLRDVIATGKELVGIEQIQLGLIGMQLTLTELWRSYGVQPDLVIGHSM
GEVAAAVVAGALTPAEGLRVTATRARLMAPLSGQGGMALLGLDAAATEAL
IADYPQVTVGIYNSPRQTVIAGPTEQIDELIARVRAQNRFASRVNIEVAP
HNPAMDALQPAMRSELADLTPRTPTIGIISTTYADLHTQPIFDAEHWATN
MRNPVRFQQAIASAGSGADGAYHTFIEISAHPLLTQAIADTLEDAHRPTK
SAAKYLSIGTLQRDADDTVTFRTNLYTADIAHPPHTCHPPEPHPTIPTTP
WQHTHHWIATTHPSTAAPEDPGSNKVVVNGQSTSESRALEDWCHQLAWPI
RPAVSADPPSTAAWLVVADNELCHELARAADSRVDSLSPPALAAGSDPAA
LLDALRGVDNVLYAPPVPGELLDIESAYQVFHATRRLAAAMVASSATAIS
PPKLFIMTRNAQPISEGDRANPGHAVLWGLGRSLALEHPEIWGGIIDLDD
SMPAELAVRHVLTAAHGTDGEDQVVYRSGARHVPRLQRRTLPGKPVTLNA
DASQLVIGATGNIGPHLIRQLARMGAKTIVAMARKPGALDELTQCLAATG
TDLIAVAADATDPAAMQTLFDRFGTELPPLEGIYLAAFAGRPALLSEMTD
DDVTTMFRPKLDALALLHRRSLKSPVRHFVLFSSVSGLLGSRWLAHYTAT
SAFLDSFAGARRTMGLPATVVDWGLWKSLADVQKDATQISAESGLQPMAD
EVAIGALPLVMNPDAAVATVVVAADWPLLAAAYRTRGALRIVDDLLPAPE
DVGKGESEFRTSLRSCPAEKRRDMLFDHVGALAATVMGMPPTEPLDPSAG
FFQLGMDSLMSVTLQRALSESLGEFLPASVVFDYPTVYSLTDYLATVLPE
LLEIGATAVATQQATDSYHELTEAELLEQLSERLRGTQ
>Rv2933 ppsC, PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSC
MTAATPDRRAIITEALHKIDDLTARLEIAEKSSSEPIAVIGMGCRFPGGV
NNPEQFWDLLCAGRSGIVRVPAQRWDADAYYCDDHTVPGTICSTEGGFLT
SWQPDEFDAEFFSISPREAAAMDPQQRLLIEVAWEALEDAGVPQHTIRGT
QTSVFVGVTAYDYMLTLAGRLRPVDLDAYIPTGNSANFAAGRLAYILGAR
GPAVVIDTACSSSLVAVHLACQSLRGRESDMALVGGTNLLLSPGPSIACS
RWGMLSPEGRCKTFDASADGYVRGEGAAVVVLKRLDDAVRDGNRILAVVR
GSAVNQDGASSGVTVPNGPAQQALLAKALTSSKLTAADIDYVEAHGTGTP
LGDPIELDSLSKVFSDRAGSDQLVIGSVKTNLGHLEAAAGVAGLMKAVLA
VHNGYIPRHLNFHQLTPHASEAASRLRIAADGIDWPTTGRPRRAGVSSFG
VSGTNAHVVIEQAPDPMAAAGTEPQRGPVPAVSTLVVFGKTAPRVAATAS
VLADWLDGPGAAVPLADVAHTLNHHRARQTRFGTVAAVDRRQAVIGLRAL
AAGQSAPGVVAPREGSIGGGTVFVYSGRGSQWAGMGRQLLADEPAFAAAI
AELEPEFVAQGGFSLRDVIAGGKELVGIEQIQLGLIGMQLALTALWRSYG
VTPDAVIGHSMGEVAAAVVAGALTPAQGLRVTAVRSRLMAPLSGQGTMAL
LELDAEATEALIADYPEVSLGIYASPRQTVISGPPLLIDELIDKVRQQNG
FATRVNIEVAPHNPAMDALQPAMRSELADLTPQPPTIPIISTTYADLGIS
LGSGPRFDAEHWATNMRNPVRFHQAIAHAGADHHTFIEISAHPLLTHSIS
DTLRASYDVDNYLSIGTLQRDAHDTLEFHTNLNTTHTTHPPQTPHPPEPH
PVLPTTPWQHTQHWITATSAAYHRPDTHPLLGVGVTDPTNGTRVWESELD
PDLLWLADHVIDDLVVLPGAAYAEIALAAATDTFAVEQDQPWMISELDLR
QMLHVTPGTVLVTTLTGDEQRCQVEIRTRSGSSGWTTHATATVARAEPLA
PLDHEGQRREVTTADLEDQLDPDDLYQRLRGAGQQHGPAFQGIVGLAVTQ
AGVARAQVRLPASARTGSREFMLHPVMMDIALQTLGATRTATDLAGGQDA
RQGPSSNSALVVPVRFAGVHVYGDITRGVRAVGSLAAAGDRLVGEVVLTD
ANGQPLLVVDEVEMAVLGSGSGATELTNRLFMLEWEPAPLEKTAEATGAL
LLIGDPAAGDPLLPALQSSLRDRITDLELASAADEATLRAAISRTSWDGI
VVVCPPRANDESMPDEAQLELARTRTLLVASVVETVTRMGARKSPRLWIV
TRGAAQFDAGESVTLAQTGLRGIARVLTFEHSELNTTLVDIEPDGTGSLA
ALAEELLAGSEADEVALRDGQRYVNRLVPAPTTTSGDLAAEARHQVVNLD
SSGASRAAVRLQIDQPGRLDALNVHEVKRGRPQGDQVEVRVVAAGLNFSD
VLKAMGVYPGLDGAAPVIGGECVGYVTAIGDEVDGVEVGQRVIAFGPGTF
GTHLGTIADLVVPIPDTLADNEAATFGVAYLTAWHSLCEVGRLSPGERVL
IHSATGGVGMAAVSIAKMIGARIYTTAGSDAKREMLSRLGVEYVGDSRSV
DFADEILELTDGYGVDVVLNSLAGEAIQRGVQILAPGGRFIELGKKDVYA
DASLGLAALAKSASFSVVDLDLNLKLQPARYRQLLQHILQHVADGKLEVL
PVTAFSLHDAADAFRLMASGKHTGKIVISIPQHGSIEAIAAPPPLPLVSR
DGGYLIVGGMGGLGFVVARWLAEQGAGLIVLNGRSAPSDEVAAAIAELNA
SGSRIEVITGDITEPDTAERLVRAVEDAGFRLAGVVHSAMVLADEIVLNM
TDSAARRVFAPKVTGSWRLHVATAARDVDWWLTFSSAAALLGTPGQGAYA
AANSWVDGLVAHRRSAGLPAVGINWGPWADVGRAQFFKDLGVEMINAEQG
LAAMQAVLTADRGRTGVFSLDARQWFQSFPAVAGSSLFAKLHDSAARKSG
QRRGGGAIRAQLDALDAAERPGHLASAIADEIRAVLRSGDPIDHHRPLET
LGLDSLMGLELRNRLEASLGITLPVALVWAYPTISDLATALCERMDYATP
AAAQEISDTEPELSDEEMDLLADLVDASELEAATRGES
>Rv2934 ppsD, PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSD
MTSLAERAAQLSPNARAALARELVRAGTTFPTDICEPVAVVGIGCRFPGN
VTGPESFWQLLADGVDTIEQVPPDRWDADAFYDPDPSASGRMTTKWGGFV
SDVDAFDADFFGITPREAVAMDPQHRMLLEVAWEALEHAGIPPDSLSGTR
TGVMMGLSSWDYTIVNIERRADIDAYLSTGTPHCAAVGRIAYLLGLRGPA
VAVDTACSSSLVAIHLACQSLRLRETDVALAGGVQLTLSPFTAIALSKWS
ALSPTGRCNSFDANADGFVRGEGCGVVVLKRLADAVRDQDRVLAVVRGSA
TNSDGRSNGMTAPNALAQRDVITSALKLADVTPDSVNYVETHGTGTVLGD
PIEFESLAATYGLGKGQGESPCALGSVKTNIGHLEAAAGVAGFIKAVLAV
QRGHIPRNLHFTRWNPAIDASATRLFVPTESAPWPAAAGPRRAAVSSFGL
SGTNAHVVVEQAPDTAVAAAGGMPYVSALNVSGKTAARVASAAAVLADWM
SGPGAAAPLADVAHTLNRHRARHAKFATVIARDRAEAIAGLRALAAGQPR
VGVVDCDQHAGGPGRVFVYSGQGSQWASMGQQLLANEPAFAKAVAELDPI
FVDQVGFSLQQTLIDGDEVVGIDRIQPVLVGMQLALTELWRSYGVIPDAV
IGHSMGEVSAAVVAGALTPEQGLRVITTRSRLMARLSGQGAMALLELDAD
AAEALIAGYPQVTLAVHASPRQTVIAGPPEQVDTVIAAVATQNRLARRVE
VDVASHHPIIDPILPELRSALADLTPQPPSIPIISTTYESAQPVADADYW
SANLRNPVRFHQAVTAAGVDHNTFIEISPHPVLTHALTDTLDPDGSHTVM
STMNRELDQTLYFHAQLAAVGVAASEHTTGRLVDLPPTPWHHQRFWVTDR
SAMSELAATHPLLGAHIEMPRNGDHVWQTDVGTEVCPWLADHKVFGQPIM
PAAGFAEIALAAASEALGTAADAVAPNIVINQFEVEQMLPLDGHTPLTTQ
LIRGGDSQIRVEIYSRTRGGEFCRHATAKVEQSPRECAHAHPEAQGPATG
TTVSPADFYALLRQTGQHHGPAFAALSRIVRLADGSAETEISIPDEAPRH
PGYRLHPVVLDAALQSVGAAIPDGEIAGSAEASYLPVSFETIRVYRDIGR
HVRCRAHLTNLDGGTGKMGRIVLINDAGHIAAEVDGIYLRRVERRAVPLP
LEQKIFDAEWTESPIAAVPAPEPAAETTRGSWLVLADATVDAPGKAQAKS
MADDFVQQWRSPMRRVHTADIHDESAVLAAFAETAGDPEHPPVGVVVFVG
GASSRLDDELAAARDTVWSITTVVRAVVGTWHGRSPRLWLVTGGGLSVAD
DEPGTPAAASLKGLVRVLAFEHPDMRTTLVDLDITQDPLTALSAELRNAG
SGSRHDDVIAWRGERRFVERLSRATIDVSKGHPVVRQGASYVVTGGLGGL
GLVVARWLVDRGAGRVVLGGRSDPTDEQCNVLAELQTRAEIVVVRGDVAS
PGVAEKLIETARQSGGQLRGVVHAAAVIEDSLVFSMSRDNLERVWAPKAT
GALRMHEATADCELDWWLGFSSAASLLGSPGQAAYACASAWLDALVGWRR
ASGLPAAVINWGPWSEVGVAQALVGSVLDTISVAEGIEALDSLLAADRIR
TGVARLRADRALVAFPEIRSISYFTQVVEELDSAGDLGDWGGPDALADLD
PGEARRAVTERMCARIAAVMGYTDQSTVEPAVPLDKPLTELGLDSLMAVR
IRNGARADFGVEPPVALILQGASLHDLTADLMRQLGLNDPDPALNNADTI
RDRARQRAAARHGAAMRRRPKPEVQGG
>Rv2935 ppsE, PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSE
MSIPENAIAVVGMAGRFPGAKDVSAFWSNLRRGKESIVTLSEQELRDAGV
SDKTLADPAYVRRAPLLDGIDEFDAGFFGFPPLAAQVLDPQHRLFLQCAW
HALEDAGADPARFDGSIGVYGTSSPSGYLLHNLLSHRDPNAVLAEGLNFD
QFSLFLQNDKDFLATRISHAFNLRGPSIAVQTACSSSLVAVHLACLSLLS
GECDMALAGGSSLCIPHRVGYFTSPGSMVSAVGHCRPFDVRADGTVFGSG
VGLVVLKPLAAAIDAGDRIHAVIRGSAINNDGSAKMGYAAPNPAAQADVI
AEAHAVSGIDSSTVSYVECHGTGTPLGDPIEIQGLRAAFEVSQTSRSAPC
VLGSVKSNIGHLEVAAGIAGLIKTILCLKNKALPATLHYTSPNPELRLDQ
SPFVVQSKYGPWECDGVRRAGVSSFGVGGTNAHVVLEEAPAEASEVSAHA
EPAGPQVILLSAQTAAALGESRTALAAALETQDGPRLSDVAYTLARRRKH
NVTMAAVVHDREHAATVLRAAEHDNVFVGEAAHDGEHGDRADAAPTSDRV
VFLFPGQGAQHVGMAKGLYDTEPVFAQHFDTCAAGFRDETGIDLHAEVFD
GTATDLERIDRSQPALFTVEYALAKLVDTFGVRAGAYIGYSTGEYIAATL
AGVFDLQTAIKTVSLRARLMHESPPGAMVAVALGPDDVTQYLPPEVELSA
VNDPGNCVVAGPKDQIRALRQRLTEAGIPVRRVRATHAFHTSAMDPMLGQ
FQEFLSRQQLRPPRTPLLSNLTGSWMSDQQVVDPASWTRQISSPIRFADE
LDVVLAAPSRILVEVGPGGSLTGSAMRHPKWSTTHRTVRLMRHPLQDVDD
RDTFLRALGELWSAGVEVDWTPRRPAVPHLVSLPGYPFARQRHWVEPNHT
VWAQAPGANNGSPAGTADGSTAATVDAARNGESQTEVTLQRIWSQCLGVS
SVDRNANFFDLGGDSLMAISIAMAAANEGLTITPQDLYEYPTLASLTAAV
DASFASSGLAKPPEAQANPAVPPNVTYFLDRGLRDTGRCRVPLILRLDPK
IGLPDIRAVLTAVVNHHDALRLHLVGNDGIWEQHIAAPAEFTGLSNRSVP
NGVAAGSPEERAAVLGILAELLEDQTDPNAPLAAVHIAAAHGGPHYLCLA
IHAMVTDDSSRQILATDIVTAFGQRLAGEEITLEPVSTGWREWSLRCAAL
ATHPAALDTRSYWIENSTKATLWLADALPNAHTAHPPRADELTKLSSTLS
VEQTSELDDGRRRFRRSIQTILLAALGRTIAQTVGEGVVAVELEGEGRSV
LRPDVDLRRTVGWFTTYYPVPLACATGLGALAQLDAVHNTLKSVPHYGIG
YGLLRYVYAPTGRVLGAQRTPDIHFRYAGVIPELPSGDAPVQFDSDMTLP
VREPIPGMGHAIELRVYRFGGSLHLDWWYDTRRIPAATAEALERTFPLAL
SALIQEAIAAEHTEHDDSEIVGEPEAGALVDLSSMDAG
>Rv2928 tesA, PROBABLE THIOESTERASE TESA
MLARHGPRYGGSVNGHSDDSSGDAKQAAPTLYIFPHAGGTAKDYVAFSRE
FSADVKRIAVQYPGQHDRSGLPPLESIPTLADEIFAMMKPSARIDDPVAF
FGHSMGGMLAFEVALRYQSAGHRVLAFFVSACSAPGHIRYKQLQDLSDRE
MLDLFTRMTGMNPDFFTDDEFFVGALPTLRAVRAIAGYSCPPETKLSCPI
YAFIGDKDWIATQDDMDPWRDRTTEEFSIRVFPGDHFYLNDNLPELVSDI
EDKTLQWHDRA
>Rv0167 yrbE1A, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE1A
MTTSTTLGGYVRDQLQTPLTLVGGFFRMCVLTGKALFRWPFQWREFILQC
WFIMRVGFLPTIMVSIPLTVLLIFTLNILLAQFGAADISGSGAAIGAVTQ
LGPLTTVLVVAGAGSTAICADLGARTIREEIDAMEVLGIDPIHRLVVPRV
LASMLVATLLNGLVITVGLVGGFLFGVYLQNVSGGAYLATLTLITGLPEV
VIATIKAATFGLIAGLVGCYRGLTVRGGSKGLGTAVNETVVLCVIALFAV
NVILTTIGVRFGTGR
>Rv0168 yrbE1B, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE1B
MSTAAVLRARFPRAVANLRQYGGAAARGLDEAGQLTWFALTSIGQIAHAL
RYYRKETLRLIAQIGMGTGAMAVVGGTVAIVGFVTLSGSSLVAIQGFASL
GNIGVEAFTGFFAALINVRIAGPVVTGVALAATVGAGATAELGAMRISEE
IDALEVMGIKSISFLASTRIMAGLVVIIPLYALAMIMSFLSPQITTTVLY
GQSNGTYEHYFQTFLRPDDVFWSFLEALIITAIVMVSHCYYGYAAGGGPV
GVGEAVGRSMRFSLVSVQVVVLFAALALYGVDPNFNLTV
>Rv0587 yrbE2A, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE2A
MTTHAVIITYLRDQTQPAVDAIGGFYRTCVLTGKALVRRPFHWREAIEQG
WFITSVSLLPTLAVSIPLTVLIIFTLNILLAEFGAADISGAGAALGAVTQ
LGPLTTVLVIAGAGATAICADLGARTIREEIDAMEVLGIDPIHRLVVPRV
VAATIVAALLNGAVITIGLVGGFVFSVFIQHVSAGAYVGTLTLVTGLPEV
IISVVKSATFGLIAGLVGCYRGLTTKGGPKGVGTAVNETLVLCVIALFAT
NVVLTTIGVRFGTGH
>Rv0588 yrbE2B, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE2B
MVESSTASAAAVLRARYPRTAASLDRYGGGTARRLERTGTFARFTRISVV
QIGWALRRYRRETLRLVAEIGMGTGAMAVVGGTVAIIGFVTLSGGSLIAI
QGFASLGNIGVEAFTGFFAALANTRVAAPIVSGVALAATVGAGATAQLGA
MRISEEIDALEVMGIKSISFLVSTRILGGLVVIMPLYALALDMAFTSGQV
VTTVFYGQSNGTYEHYFRTFLRPEDVGWSVVEVVIIAVVVMITHCYYGYT
ASGGPVGVGQAVGRSMRFSLVSVVVVVLLAELALYGVDPNFNLTV
>Rv1964 yrbE3A, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE3A
MVIVADKAAGRVADPVLRPVGALGDFFAMTLDTSVCMFKPPFAWREYLLQ
CWFVARVSTLPGVLMTIPWAVISGFLFNVLLTDIGAADFSGTGCAIFTVN
QSAPIVTVLVVAGAGATAMCADLGARTIREELDALRVMGINPIQALAAPR
VLAATTVSLALNSVVTATGLIGAFFCSVFLMHVSAGAWVTGLTTLTHTVD
VVISMIKATLFGLMAGLIACYKGMSVGGGPAGVGRAVNETVVFAFIVLFV
INIVVTAVGIPFMVS
>Rv1965 yrbE3B, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE3B
MTAAKALVSEWNRMGSQMRFFVGTLAGIPDALMHYRGELLRVIAQMGLGT
GVLAVIGGTVAIVGFLAMTTGAIVAVQGYNQFASVGVEALTGFASAFFNT
REIQPGTVMVALAATVGAGTTAALGAMRINEEIDALEVIGIRSISYLAST
RVLAGVVVAVPLFCVGLMTAYLAARVGTTAIYGQGSGVYDHYFNTFLRPT
DVLWSSVEVVVVALMIMLVCTYYGYAAHGGPAGVGEAVGRAVRASMVVAS
IAILVMTLAIYGQSPNFHLAT
>Rv3501c yrbE4A, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE4A
MIQQLAVPARAVGGFFEMSMDTARAAFRRPFQFREFLDQTWMVARVSLVP
TLLVSIPFTVLVAFTLNILLREIGAADLSGAGTAFGTITQLGPVVTVLVV
AGAGATAICADLGARTIREEIDAMRVLGIDPIQRLVVPRVLASTLVALLL
NGLVCAIGLSGGYAFSVFLQGVNPGAFINGLTVLTGLRELILAEIKALLF
GVMAGLVGCYRGLTVKGGPKGVGNAVNETVVYAFICLFVINVVMTAIGVR
ISAQ
>Rv3500c yrbE4B, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE4B
MSYDVTIRFRRFFSRLQRPVDNFGEQALFYGETMRYVPNAITRYRKETVR
LVAEMTLGAGALVMIGGTVGVAAFLTLASGGVIAVQGYSSLGDIGIEALT
GFLSAFLNVRVVAPVIAGIALAATIGAGATAQLGAMRVSEEIDAVECMAV
HSVSYLVSTRLIAGLVAIIPLYSLSVLAAFFAARFTTVFVNGQSAGLYDH
YFNTFLIPSDLLWSFMQAIAMSIAVMLVHTYYGYNASGGSVGVGVAVGQA
VRTSLIVVVVITLFISLAVYGASGNFNLSG