TitleGenColors Logo

Gene list

Applied filters:

COG category: Amino acid transport and metabolism
Gene type: CDS
Genomic element: chromosome

Number of genes found: 252

Free access
Sort by:

 



# Mycobacterium tuberculosis H37Rv, H37Rv

>Rv1673c CONSERVED HYPOTHETICAL PROTEIN
MTITDPAVSAHADATIGLFEITDHITIDSTQGAHTVEMWCPVIGDGAFQR
VLDVEVTSEDPYDLTREPEFGNLMLYSRLRLATAASWSIRYVVERRAIGH
APDPARARPLATAQLFSRALIPEAHVDVDERTRTLAQDVVGPETNPLEQA
RRIYDYVTGAMDYDATKQSFLGSTEHALTCSVGNCNDIHALFVSLCRSVD
IPARFVLGQALELPQPGAQDCEVCGYHCWAEFFVAGLGWLPADASCATKY
GTHGLFANLQANHIAWSIGRDILLAPPQRAGRSLFFAGPYAEIDGETHPA
QRQIRFTAMT
>Rv3856c CONSERVED HYPOTHETICAL PROTEIN
MDPVTALRQIAYYKDRNRHDPRRVMAYRNAADIIEGLDDAARQRHGQANS
WQSLAGIGPKTAKVIAQAWSGREPDLLAELRADAEDLGGGAIRAALRGDL
HLHSNWSDGSAPIEEMMATAAALGHQYCALTDHSPRLTIANGLSPDRLRK
QLDVIDELREKFAPLRILTGIEVDILEDGSLDQEPEMLDRLDIVVASVHS
KLSMDSAAMTRRMVRAVANGHTDVLGHCTGRLIAGNRGIRPESKFDAEAV
FTACREHGTAVEINSRPERRDPPTRLLHLARDIGCVFSIDTDAHAPGQLD
FLGYGAQRALDAEVPADRIVNTWPADTLLAWTGSH
>Rv2209 Probable conserved integral membrane protein
MPASRLVRQVSAPRNLFGRLVAQGGFYTAGLQLGSGAVVLPVICAHQGLT
WAAGLLYPAFCIGAILGNSLSPLILQRAGQLRHLLMAAISATAAALVVCN
AAVPWTGVGVAAVFLATTGAGGVVTGVSSVAYTDMISSMLPAVRRGELLL
TQGAAGSVLATGVTLVIVPMLAHGNEMARYHDLLWLGAAGLVCSGIAALF
VGPMRSVSVTTATRMPLREIYWMGFAIARSQPWFRRYMTTYLLFVPISLG
TTFFSLRAAQSNGSLHVLVILSSIGLVVGSMLWRQINRLFGVRGLLLGSA
LLNAAAALLCMVAESCGQWVHAWAYGTAFLLATVAAQTVVAASISWISVL
APERYRATLICVGSTLAAVEATVLGVALGGIAQKHATIWPVVVVLTLAVI
AAVASLRAPTRIGVTADTSPQAATLQAYRPATPNPIHSDERSTPPDHLSV
RRGQLRHVWDSRRPAPPLNRPSCRRAARRPAPGKPAAALPQPRHPAVGVR
EGAPLDAGQRIA
>Rv2600 PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MVATVLYFLVGAAVLVAGFLMVNLLTPGDLRRLVFIDRRPNAVVLAATMY
VALAIVTIAAIYASSNQLAQGLIGVAVYGIVGVALQGVALVILEIAVPGR
FREHIDAPALHPAVFATAVMLLAVAGVIAAALS
>Rv0394c POSSIBLE SECRETED PROTEIN
MTEPRPVFAVVISAGLSAIPMVGGPLQTVFDAIEERTRHRAETTTREICE
SVGGADTVLSRIDKNPELEPLLSQAIEAATRTSMEAKRRLLAQAAAAALE
DDQKVEPASLIVATLSQLEPVHIHALVRLAKAAKSSPDQDEIQRREVMRA
ASKVEPVPVLAALIQTGVAIATTTVWHGNGTGTPAEESGHILIHDVSDFG
HRLLAYLRAADAGAELLILPSGGSAPTGDHPTPHPSTSR
>Rv1258c PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN
MRNSNRGPAFLILFATLMAAAGDGVSIVAFPWLVLQREGSAGQASIVASA
TMLPLLFATLVAGTAVDYFGRRRVSMVADALSGAAVAGVPLVAWGYGGDA
VNVLVLAVLAALAAAFGPAGMTARDSMLPEAAARAGWSLDRINGAYEAIL
NLAFIVGPAIGGLMIATVGGITTMWITATAFGLSILAIAALQLEGAGKPH
HTSRPQGLVSGIAEGLRFVWNLRVLRTLGMIDLTVTALYLPMESVLFPKY
FTDHQQPVQLGWALMAIAGGGLVGALGYAVLAIRVPRRVTMSTAVLTLGL
ASMVIAFLPPLPVIMVLCAVVGLVYGPIQPIYNYVIQTRAAQHLRGRVVG
VMTSLAYAAGPLGLLLAGPLTDAAGLHATFLALALPIVCTGLVAIRLPAL
RELDLAPQADIDRPVGSAQ
>Rv3700c CONSERVED HYPOTHETICAL PROTEIN
MRRSGANSPAGDSLADRWRAARPPVAGLHLDSAACSRQSFAALDAAAQHA
RHEAEVGGYVAAEAAAAVLDAGRAAVAALSGLPDAEVVFTTGSLHALDLL
LGSWPGENRTLACLPGEYGPNLAVMAAHGFDVRPLPTLQDGRVALDDAAF
MLADDPPDLVHLTVVASHRGVAQPLAMVAQLCTELKLPLVVDAAQGLGHV
DCAVGADVTYASSRKWIAGPRGVGVLAVRPELMERLRARLPAPDWMPPLT
VAQQLGFGEANVAARVGFSVALGEHLACGPQAIRARLAELGDIARTVLAD
VSGWRVVEAVDEPSAITTLAPIDGADPAAVRAWLLSQRRIVTTYAGVERA
PLELPAPVLRISPHVDNTADDLDAFAEALVAATAATSGER
>Rv1188 PROBABLE PROLINE DEHYDROGENASE
MAGWFAHTLRPAMLAAGRSDRLGRIVERSPLTRGVVRRFVPGDTLDDVVD
IVTALRDSGRYLSIDYLGENVTDADDAAAAVRAYLGLLDVLGRRGDIACD
GVRPLEVSLKLSALGQALDRDGQKIALDNARAICERAERVGAWVTVDAED
HTTTDSTLSISGDLRVDFPWLGTVVQAYLRRTLADCAELAAVGARVRLCK
GAYDEPASVAYRDAAQVTDSYLRCLRVLTAGRGYPMVATHDPVIIAAVPG
ITRESGRSQGDFEYQMLYGVRDDEQRRLTGAGNHVRVYVPFGTRWYGYFL
RRLAERPANLAFFLRALTDRRRARGCAER
>Rv3722c CONSERVED HYPOTHETICAL PROTEIN
MSFDSLSPQELAALHARHQQDYAALQGMKLALDLTRGKPSAEQLDLSNQL
LSLPGDDYRDPEGTDTRNYGGQHGLPGLRAIFAELLGIAVPNLIAGNNSS
LELMHDIVAFSMLYGGVDSPRPWIQEQDGIKFLCPVPGYDRHFAITETMG
IEMIPIPMLQDGPDVDLIEELVAVDPAIKGMWTVPVFGNPSGVTYSWETV
RRLVQMRTAAPDFRLFWDNAYAVHTLTLDFPRQVDVLGLAAKAGNPNRPY
VFASTSKITFAGGGVSFFGGSLGNIAWYLQYAGKKSIGPDKVNQLRHLRF
FGDADGVRLHMLRHQQILAPKFALVAEVLDQRLSESKIASWTEPKGGYFI
SLDVLPGTARRTVALAKDVGIAVTEAGASFPYRKDPDDKNIRIAPSFPSV
PDLRNAVDGLATCALLAATETLLNQGLASSAPNVR
>Rv2566 LONG CONSERVED HYPOTHETICAL PROTEIN
MPLRPTQVSGTGRTRCAGRSGVISSAAMSIKVALEHRTSYTFDRLVRVYP
HIVRLRPAPHSRTSIEAYSLRIEPADHFINWQQDALGNFLARLVFPNPMR
QLRITVGLIADLKVINPFDFFIEDWAEIWPCAGMAYPKALADDLRPYLRP
VDEDGDGSGPGELTQAWVRNFTVPDGTRTIDFLVALNRAINADVGYCVRM
EPGVQTPDFTLRTGVGSCRDSAWLLVSILRQFGLAARFVSGYLVQLASDI
EALDGPSGPAADFTDLHAWAEAYIPGAGWIGLDPTSGLLAGEGHIPLAAT
PHPASAAPISGGTDVCDTVLEFSNTVTRVHEDPRVTLPYTDESWKTICEV
GQRVDERLAAADVRLTVGGEPTFVSVDNQVAEEWRTAADGPHKRERASDL
AARLKAVWAPQGLIHRGQGRWYPGEPLPRWQIALYWRTDGRPLWTNDALL
ADPWGAPPADPVDDDAAYRVLAGIADGLGLPISQVRPAYEDPLSRLAAAV
RMPAGDPVESGDDLGCDTNPDTPTGRAALLARLDEAITSPAAYVLPLHRR
DDGQGWASANWRLRRGRIVLLEGDSPAGLRLPLDSISWRPPRASFDADPV
AVRSTLPAELHTDRAVVEDPETAPTTALVAEVRGGLVHIFLPPTDALEHF
IDLVARVEAAATTANCPVVIEGYGPPPDPRLTSTTITPDPGVIEVNIAPT
ASFAEQRQQLETLYQQARLARLTTEAFDVDGTHGGTGGGNHITLGGVTPA
DSPLLRRPDLLVSLLTYWQRHPSLSYLFAGRFVGTTSQAPRVDEGRAEAL
YELEIAFAEILRLSPSSGGGRPQPWVTDRALRHLLTDITGNTHRAEFCID
KLYSPDSARGRLGLLELRGFEMPPHLHMAMVQSLLVRSLVAWFWDQPLRA
PLIRHGANLHGRYLLPHFLIHDIADVAADLRAHGIAFETSWLDPFTEFRF
PRIGTAVFDGIEIELRGAIEPWHTLGEEATAAGTARYVDSSVERIQVRII
GADRHRYVVTCNGYPMPLLATDNPDIHVGGVRFKAWQPPSALHPTITVDG
PLRFELIDIATATSCGGCTYHVAHPGGRAYDEPPVNAVEAEARRARRFEA
TGFTPGKLDLSDIREKQARISTDIGAPGILDLRRVRTVQQ
>Rv0264c CONSERVED HYPOTHETICAL PROTEIN
MDAALACTVLDYGDHALMLQCDSTADAMAWTDALRAAALPGVVDIVAASR
TVLVKLDAPRYQGVTRQRLRRLRVTPEAVAAADHRCDLVIDVVYDGPDLA
EVARCTGLTTAAVINAHTATGWRAGFSGSAPGFAYLIDGDPSLRVPRRPE
RRTSMPPGSVALADGFSAIYPSQAPSDWQIIGHTDAVLWDVDRPQPALLT
PGMWVQFRAA
>Rv1250 PROBABLE DRUG-TRANSPORT INTEGRAL MEMBRANE PROTEIN
MTTAIRRAAGSSYFRNPWPALWAMMVGFFMIMLDSTVVAIANPTIMAQLR
IGYATVVWVTSAYLLAYAVPMLVAGRLGDRFGPKNLYLIGLGVFTVASLG
CGLSSGAGMLIAARVVQGVGAGLLTPQTLSTITRIFPAHRRGVALGAWGT
VASVASLVGPLAGGALVDSMGWEWIFFVNVPVGVIGLILAAYLIPALPHH
PHRFDWFGVGLSGAGMFLIVFGLQQGQSANWQPWIWAVIVGGIGFMSLFV
YWQARNAREPLIPLEVFNDRNFSLSNLRIAIIAFAGTGMMLPVTFYAQAV
CGLSPTHTAVLFAPTAIVGGVLAPFVGMIIDRSHPLCVLGFGFSVLAIAM
TWLLCEMAPGTPIWRLVLPFIALGVAGAFVWSPLTVTATRNLRPHLAGAS
SGVFNAVRQLGAVLGSASMAAFMTSRIAAEMPGGVDALTGPAGQDATVLQ
LPEFVREPFAAAMSQSMLLPAFVALFGIVAALFLVDFTGAAVAKEPLPES
DGDADDDDYVEYILRREPEEDCDTQPLRASRPAAAAASRSGAGGPLAVSW
STSAQGMPPGPPGRRAWQADTESTAPSAL
>Rv1676 HYPOTHETICAL PROTEIN
MACPEWEISRSKRTRKPVLRPRHSVSTLTNRFLAEFCHRYGIGVPTRLAR
GATVPTRRLQDINDQPVDVPAATGRTHLQFRRFAACPICHLHLRSFANRH
QEVADSGITEVVFFHSAADALRGYQSLLPFAVIADPDRVQYREFGVEKSL
GAITHPRALWAAVRGSAAMLHRNDPERAGVGFGDGTTHLGLPADFLLDAD
GTVAAVHYGRHADDQWSVDQLIDINRSLGGKGTQ
>Rv2522c CONSERVED HYPOTHETICAL PROTEIN
MSASRRRIASKSGFSCDSASARELVERVREVLPSVRCDLEELVRIESVWA
DPDRRDEVHRSARAVADLLSQAGFDDVRIVSERGAPAVIARYPAPPGAPT
VLLYAHHDVQPEGDRGQWVSPPFEPTERGGRLYGRGTADDKAGIATHVAA
FWAHGGRPPVGVTVFVEGEEESGSPSLGRLLAAHRDALAADVIVIADSDN
WSTDIPALTVSLRGMADCVVEVATLDHGLHSGLWGGVVPDALTVLVRLLA
SLHDDDGNVAVAGMHESTAARVDYPAGRVRAESGLLDGVSEIGTGSVPQR
LWAKPAITVIGIDTTSVAAASNTLIPRARAKISIRVAPGGDATAHLDAVE
AHLRRHAPWGAQVTVTRGEVGQPYAIEASGPVYDAARSAFRQAWGADPID
MGMGGSIPFIAEFAAAFPQATILVTGVEDPGTQAHSVNESLHLGVLERAA
TAEALLLAKLAAIPTGRAEA
>Rv1178 PROBABLE AMINOTRANSFERASE
MSASLPVFPWDTLADAKALAGAHPDGIVDLSVGTPVDPVAPLIQEALAAA
SAAPGYPATAGTARLRESVVAALARRYGITRLTEAAVLPVIGTKELIAWL
PTLLGLGGADLVVVPELAYPTYDVGARLAGTRVLRADALTQLGPQSPALL
YLNSPSNPTGRVLGVDHLRKVVEWARGRGVLVVSDECYLGLGWDAEPVSV
LHPSVCDGDHTGLLAVHSLSKSSSLAGYRAGFVVGDLEIVAELLAVRKHA
GMMVPAPVQAAMVAALDDDAHERQQRERYAQRRAALLPALGSAGFAVDYS
DAGLYLWATRGEPCRDSAAWLAQRGILVAPGDFYGPGGAQHVRVALTATD
ERVAAAVGRLTC
>Rv0854 CONSERVED HYPOTHETICAL PROTEIN
MAIKESRDIVIEASPEEILDVIADFEAMTEWSPAHQSVEILETGDDGRPS
KVKMKVKTAGITDEQVVAYSWTDRSVRWTLVSSTQQRSQDGKYELTPKGD
NTLVQFEITVDPQVPLPGFVLKRAIKGTIDTATEALRSQVLKVKKGQ
>Rv0263c CONSERVED HYPOTHETICAL PROTEIN
MTTLEILRSGPLALVEDLGRAGLAHLGVGRSGAADRRSHTLANRLVANPD
DWATVEVTFGGFSARVRGGDVDIAVTGADTDPTVNGIMVGTNSIHHVRDG
QVISLGTPRAGLRTYLAVRGGVCVEPVLGSRSYDVMSAIGPSPLRAGDVL
PVGEHTDDYPELDQAPVAAIEEHLVELRVVPGPRDDWLVDPDALVHTIWM
ASNRSDRVGMRLQGRPLQHRWPDRQLPGEGVTRGAIQVPPNGLPVILGPD
HPITGSYPVVGVITDEDIDKVAQIRPGQYVRLHWARPRSRLPGQGVTQAW
>Rv1322A CONSERVED HYPOTHETICAL PROTEIN
MMTTDQVHARHMLATSLVTGLDHVGIAVADLDVAIEWYHDHLGMILVHEE
INDDQGIREALLAVPGSAAQIQLMAPLDESSVIAKFLDKRGPGIQQLACR
VSDLDAMCRRLRSQGVRLVYETARRGTANSRINFIHPKDAGGVLIELVEP
AP
>Rv2690c PROBABLE CONSERVED INTEGRAL MEMBRANE ALANINE AND VALINE AND LEUCINE RICH PROTEIN
MSKLSTAARRLLIGRPFRSDRLSHTLLPKRIALPVFASDAMSSIAYAPEE
IFLVLSVAGLAAYSMAPLIGLAVAAVLLVVVSSYRQNVHAYPSGGGDYEV
VTTNLGATGGLVVASALMVDYVLTVAVSISSAASNIGSVSPFVYEHKVLF
AVGAIVLIMAMNLRGVRESGLAFAIPTYAFIAGIGTMLVWGLFRIFVLGN
PVRAESAAFEMHAEHGQIVGFALVFLVARSFSSGCAALTGVEAISNGVPA
FQKPKSRNAATTLLMLGIIAVSMFMGMIVLAVETGVQVVDDPDTQLTGAP
PGYQQKTLVAQLAQAVFGGFYLGFLLIAAVTALILVLAANTAFNGFPVLG
SVLAQHSYLPRQLHTRGDRLAFSNGILFLAAAAIGAVVAFRAELTALIQL
YIVGVFISFTMSQVGMVRHWTRLLSAETDPRARRAMLRSRAVNTVGFVST
GTVLLIVLVTKFLAGAWIAIVAMGGFFMMMKLIHRHYDAVNRELAEQAEE
AEITLPSRNHAVVLVSKLHLPTLRALTYARATRPDVLEAVTVNVDDAETR
ELVRQWQDSDVSVPLKVIASPYREITRPVLDYVKRVSKESPRTVVTVFIP
EYVVGRWWEQLLHNQSALRLKGRLLFMPGVMVTSVPWQLTSSERIKTLQP
HAAPGDT
>Rv2323c CONSERVED HYPOTHETICAL PROTEIN
MENTQRPSFDCEIRAKYRWFMTDSYVAAARLGSPARRTPRTRRYAMTPPA
FFAVAYAINPWMDVTAPVDVQVAQAQWEHLHQTYLRLGHSVDLIEPISGL
PDMVYTANGGFIAHDIAVVARFRFPERAGESRAYASWMSSVGYRPVTTRH
VNEGQGDLLMVGERVLAGYGFRTDQRAHAEIAAVLGLPVVSLELVDPRFY
HLDTALAVLDDHTIAYYPPAFSTAAQEQLSALFPDAIVVGSADAFVFGLN
AVSDGLNVVLPVAAMGFAAQLRAAGFEPVGVDLSELLKGGGSVKCCTLEI
HP
>Rv3778c POSSIBLE AMINOTRANSFERASE
MAYDVARVRGLHPSLGDGWVHFDAPAGMLIPDSVATTVSTAFRRSGASTV
GAHPSARRSAAVLDAAREAVADLVNADPGGVVLGADRAVLLSLLAEASSS
RAGLGYEVIVSRLDDEANIAPWLRAAHRYGAKVKWAEVDIETGELPTWQW
ESLISKSTRLVAVNSASGTLGGVTDLRAMTKLVHDVGALVVVDHSAAAPY
RLLDIRETDADVVTVNAHAWGGPPIGAMVFRDPSVMNSFGSVSTNPYATG
PARLEIGVHQFGLLAGVVASIEYLAALDESARGSRRERLAVSMQSADAYL
NRVFDYLMVSLRSLPLVMLIGRPEAQIPVVSFAVHKVPADRVVQRLADNG
ILAIANTGSRVLDVLGVNDVGGAVTVGLAHYSTMAEVDQLVRALASLG
>Rv2000 HYPOTHETICAL PROTEIN
MRPGFVGLGFGQWPVYVVRWPKLHLTPRQRKRVLHRRRLLTDRPISLSQI
PIRTGGPMNDPWPRPTQGPAKTIETDYLVIGAGAMGMAFTDTLITESGAR
VVMIDRACQPGGHWTTAYPFVRLHQPSAYYGVNSRALGNNTIDLVGWNQG
LNELAPVGEICAYFDAVLQQQLLPTGRVDYFPMSEYLGDGRFRTLAGTEY
VVTVNRRIVDATYLRAVVPSMRPAPYSVAPGVDCVAPNELPKLGTRDRYV
VVGAGKTGMDVCLWLLRNDVCPDKLTWIMPRDSWLIDRATLQPGPTFVRQ
FRESYGATLEAIGAATSTDDLFDRLETAGTLLRIDPSVRPSMYRCATVSH
LELEQLRRIRDIVRMGHVQRIEPTTIVLDGGSVPATPTALYIDCTADGAP
QRPAKPVFDADHLTLQAVRGCQQVFSAAFIAHVEFAYEDDAVKNELCTPI
PHPDCDLDWMRLMHSDLGNFQRWLNDPDLTDWLSSARLNLLADLLPPLSH
KPRVRERVVSMFQKRLGTAGDQLAKLLDAATATTEQR
>Rv1333 PROBABLE HYDROLASE
MNSITDVGGIRVGHYQRLDPDASLGAGWACGVTVVLPPPGTVGAVDCRGG
APGTRETDLLDPANSVRFVDALLLAGGSAYGLAAADGVMRWLEEHRRGVA
MDSGVVPIVPGAVIFDLPVGGWNCRPTADFGYSACAAAGVDVAVGTVGVG
VGARAGALKGGVGTASATLQSGVTVGVLAVVNAAGNVVDPATGLPWMADL
VGEFALRAPPAEQIAALAQLSSPLGAFNTPFNTTIGVIACDAALSPAACR
RIAIAAHDGLARTIRPAHTPLDGDTVFALATGAVAVPPEAGVPAALSPET
QLVTAVGAAAADCLARAVLAGVLNAQPVAGIPTYRDMFPGAFGS
>Rv3661 CONSERVED HYPOTHETICAL PROTEIN
MTVSDSPAQRQTPPQTPGGTAPRARTAAFFDLDKTIIAKSSTLAFSKPFF
AQGLLNRRAVLKSSYAQFIFLLSGADHDQMDRMRTHLTNMCAGWDVAQVR
SIVNETLHDIVTPLVFAEAADLIAAHKLCGRDVVVVSASGEEIVGPIARA
LGATHAMATRMIVEDGKYTGEVAFYCYGEGKAQAIRELAASEGYPLEHCY
AYSDSITDLPMLEAVGHASVVNPDRGLRKEASVRGWPVLSFSRPVSLRDR
IPAPSAAAIATTAAVGISALAAGAVTYALLRRFAFQP
>Rv1413 CONSERVED HYPOTHETICAL PROTEIN
MATIGEVEVFVDHGADDVFITYPLWIGTRQADRLRQLADRARIAVGAGTA
EGASNTGARLADAAGAIDVLIEIDSGHHRSGVRAEQVLEVAHAVGEAGLH
LVGVFTFPGHSYAPGKPGEAGEQERRALNDAANALVAVGFPISCRSGGST
PTALLTAADGASETSRRLCAR
>Rv1885c CONSERVED HYPOTHETICAL PROTEIN
MLTRPREIYLATAVSIGILLSLIAPLGPPLARADGTSQLAELVDAAAERL
EVADPVAAFKWRAQLPIEDSGRVEQQLAKLGEDARSQHIDPDYVTRVFDD
QIRATEAIEYSRFSDWKLNPASAPPEPPDLSASRSAIDSLNNRMLSQIWS
HWSLLSAPSCAAQLDRAKRDIVRSRHLDSLYQRALTTATQSYCQALPPA
>Rv2531c PROBABLE AMINO ACID DECARBOXYLASE
MNPNSVRPRRLHVSALAAVANPSYTRLDTWNLLDDACRHLAEVDLAGLDT
THDVARAKRLMDRIGAYERYWLYPGAQNLATFRAHLDSHSTVRLTEEVSL
AVRLLSEYGDRTALFDTSASLAEQELVAQAKQQQFYTVLLADDSPATAPD
SLAECLRQLRNPADEVQFELLVVASIEDAITAVALNGEIQAAIIRHDLPL
RSRDRVPLMTTLLGTDGDEAVANETHDWVECAEWIRELRPHIDLYLLTDE
SIAAETQDEPDVYDRTFYRLNDVTDLHSTVLAGLRNRYATPFFDALRAYA
AAPVGQFHALPVARGASIFNSKSLHDMGEFYGRNIFMAETSTTSGGLDSL
LDPHGNIKTAMDKAAVTWNANQTYFVTNGTSTANKIVVQALTRPGDIVLI
DRNCHKSHHYGLVLAGAYPMYLDAYPLPQYAIYGAVPLRTIKQALLDLEA
AGQLHRVRMLLLTNCTFDGVVYNPRRVMEEVLAIKPDICFLWDEAWYAFA
TAVPWARQRTAMIAAERLEQMLSTAEYAEEYRNWCASMDGVDRSEWVDHR
LLPDPNRARVRVYATHSTHKSLSALRQASMIHVRDQDFKALTRDAFGEAF
LTHTSTSPNQQLLASLDLARRQVDIEGFELVRHVYNMALVFRHRVRKDRL
ISKWFRILDESDLVPDAFRSSTVSSYRQVRQGALADWNEAWRSDQFVLDP
TRLTLFIGATGMNGYDFREKILMERFGIQINKTSINSVLLIFTIGVTWSS
VHYLLDVLRRVAIDLDRSQKAASGADLALHRRHVEEITQDLPHLPDFSEF
DLAFRPDDASSFGDMRSAFYAGYEEADREYVQIGLAGRRLAEGKTLVSTT
FVVPYPPGFPVLVPGQLVSKEIIYFLAQLDVKEIHGYNPDLGLSVFTQAA
LARMEAARNAVATVGAALPAFEVPRDASALNGTVNGDSVLQGVAEDA
>Rv1490 PROBABLE MEMBRANE PROTEIN
MSQCFAVKGIGGADQATLGSAEILVKYAQLADKRARVYVLVSTWLVVWGI
WHVYFVEAVFPNAILWLHYYAASYEFGFVRRGLGGELIRMLTGDHFFAGA
YTVLWTSITVWLIALAVVVWLILSTGNRSERRIMLALLVPVLPFAFSYAI
YNPHPELFGMTALVAFSIFLTRAHTSRTRVILSTLYGLTMAVLALIHEAI
PLEFALGAVLAIIVLSKNATGATRRICTALAIGPGTVSVLLLAVVGRRDI
ADQLCAHIPHGMVENPWAVATTPQRVLDYIFGRVESHADYHDWVCEHVTP
WFNLDWITSAKLVAVVGFRALFGAFLLGLLFFVATTSMIRYVSAVPVRTF
FAELRGNLALPVLASALLVPLFITAVDWTRWWVMITLDVAIVYILYAIDR
PEIEQPPSRRNVQVFVCVVLVLAVIPTGSANNIGR
>Rv2294 Probable aminotransferase
MIPNPLEELTLEQLRSQRTSMKWRAHPADVLPLWVAEMDVKLPPTVADAL
RRAIDDGDTGYPYGTEYAEAVREFACQRWQWHDLEVSRTAIVPDVMLGIV
EVLRLITDRGDPVIVNSPVYAPFYAFVSHDGRRVIPAPLRGDGRIDLDAL
QEAFSSARASSGSSGNVAYLLCNPHNPTGSVHTADELRGIAERAQRFGVR
VVSDEIHAPLIPSGARFTPYLSVPGAENAFALMSASKAWNLGGLKAALAI
AGREAAADLARMPEEVGHGPSHLGVIAHTAAFRTGGNWLDALLRGLDHNR
TLLGALVDEHLPGVQYRWPQGTYLAWLDCRELGFDDAASDEMTEGLAVVS
DLSGPARWFLDHARVALSSGHVFGIGGAGHVRINFATSRAILIEAVSRMS
RSLLERR
>Rv1342c CONSERVED MEMBRANE PROTEIN
MTAPETPAAQHAEPAIAVERIRTALLGYRIMAWTTGLWLIALCYEIVVRY
VVKVDNPPTWIGVVHGWVYFTYLLLTLNLAVKVRWPLGKTAGVLLAGTIP
LLGIVVEHFQTKEIKARFGL
>Rv0697 PROBABLE DEHYDROGENASE
MTAAVRHSDVLVVGAGSAGSVVAERLSMDSSCVVTVLEAGPGLADPGLLA
QTANGLQLPIGAGSPLVERYRTRLTDRPVRHLPIVRGATVGGSGAINGGY
FCRGLPSDFDRASIPGWAWSDVLEHFRAIETDLDFETPVHGRSGPIPVRR
THEMTGITESFMAAAEDAGFAWIADLNDVGPEMPSGVGAVPLNIVNGVRT
SSAVGYLMPALGRPNLTLLARTRAVRLRFSATTAVGVDAIGPGGPVSLSA
DRIVLCAGAIQSAHLLMLSGVGEEEVLRSAGVKVLMALPVGMGCSDHPEW
VMPTNWAVAVDRPVLEVLLSTHDGIEIRPYTGGFVAMTGDGTAGHRDWPH
IGVALMQPRARGRITLVSSDPQIPVRIEHRYDSEPADVAALRQGSALAHE
LCGAATRIGPAVWATSQHLCGSAPMGTDDDPRAVVDPRCRVRGIENLWVI
DGSVLPSITSRGPHATIVMLGHRAAEFVQ
>Rv1200 PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN
MKRVALACLVGSAIEFYDFLIYGTAAALVFPTVFFPHLDPTVAAVASMGT
FAVAFLSRPFGAAVFGYFGDRLGRKKTLVATLLIMGLATVTVGLVPTTVA
IGAAAPLILTTMRLLQGFAVGGEWAGSALLSAEYAPASKRGWYGMFTVVG
GGIALVLTSLTFLGVNYTIGESSPTFMQWGWRIPFLVSAALIAVALYVRF
NIDETPVFARERADEKTRLGPAETPIAQVLRRQRREIVLAAGSAVCCFGF
VYLASTYLASYAQTRLGYSRGSILFDSVLGGLLCIVFTALSSALCDQLGR
RRVLLAGWAVALPWSLLVMPLIDSGSPSLFAVAVVGMYAIGGFGFGPTAS
FIPELFATSYRYTGSALAANLAGVAGGALPPVIAGALVATYGSWAIGVML
AILALISLVCTYRLPETAGSALVSR
>Rv2515c CONSERVED HYPOTHETICAL PROTEIN
MGIGHPMWVGWCIIIAMRSIPASVESSVLRWARESCGLTEVAAARKLGLP
DDRVAAWEVGEVVPTIAQLRKAAEVYKRSLAVFFLSEPPEGFDTLRDFRR
LDGAASGQWTPGLHEEFRRAHTQRDFALELADAEDREIPGAWRLPLSGDE
ADADIAARIRKALIEVSPLPIPVASVDPYEHLNAWVSAIETSGVLVLATR
GGKVAIDEMRGMCLYFDELPVIVLNGSDHPRPRLFSLLHEFVHVVLHTEG
LCDVIADAHPSTQDRSLEARCNAIAAAVLMPADVVRARPEVIVRSETPSS
WDYESLRPVAAHFGVSAEAFLRRLSTLGIVPVEVYRQRRAEFIAAHEDEA
ERARSAGGGNWYRNTVRDLGKGYVRAVTDAHRRRVIDSNTAAIYLDAKVS
QIPKLAESAELRSVV
>Rv3726 POSSIBLE DEHYDROGENASE
MKAVTCTNAKLEVVDRPSPAPAKGQLLLDVLRCGICGSDLHARLHCDELA
DVMAESGYHAFMRSNQQVVFGHEFCGEVVDYGPGTRRTPRRGTPVVAMPL
LRRGNKEVHGIGLSTMAPGAYAERLVVEQSLTFPVPNGLAPEIAALTEPM
AVGWHAVRRGEVGKGDVAIVIGCGPIGLAVICMLKSRGVHTVIASDFSPG
RRALATACGADSVVDPVQDSPYAVAAGLGQGNRHLQSILDAFDLAVGTVE
RLQRLRLPWWHLWRAAEAAGAATPKRPVIFECVGVPGIIDGIIASAPLFS
RVVVVGVCMGSDHIRPAMAINKEINLRFVLGYTPLEFRDTLHMLADGKVN
AAPLITGTVGLPGVAAAFDALGDPEAHAKIMIDPKSNAASPQPFRVE
>Rv0787 HYPOTHETICAL PROTEIN
MHRPPWLAQLRRRLRIGVQLGSRVVLEQGRQPRDVYVIGVLVGDQDRGQT
GDSLEAVRESTGIEEQAGLTELSEEAGMAEMRELHVYDCALMGAFPMRLI
LATMLVAGRLLATLMAAPSAQAEPETCPPICDQIPATAWISTHAVPLNSQ
YRWPAMAGAAVAVTRATPRFGFEQVCATPAFPHDSRDWAVAGRVTVVHPD
GQWQLQAQVLHWRGDTARGGQIAASVFGTAVAALRACQLGAPLQSPSVTD
DEPTRMAAVISGPVIMYTYLVAHVSSSTISELTLWSSGPPQVPWPTVADS
AVLDALTAPLCEAYIGSCP
>Rv0537c PROBABLE INTEGRAL MEMBRANE PROTEIN
MGLSSDDTRRREVVRDLAAGALLIGALFFPWNLYFGFRIPDSSKTVFGLL
LAVTSLSLASLAVTFAGRRSQLRLGLNVPYLLLVLAFVVFDAIQTIRLGG
TVHVPGGVGPGGWLGITGALLSAQPALTGATTDEGSHSRWLRATQFLGYA
SMLGAALSTGFNLSWRVRYALEPAAGASGFGKQNLAVIDTAVVYGVVALA
AVLVASRWLLRPTAAEALSTVALGGSTLIAGSIVWSLPIGREIDAFHGIA
QNTSTAGVGYEGYLVWAAAAAMCAPLTLFRSPNAPPIDKTVWRAASRNGL
LLIAVWCLGSVAMRLTDLVVAVLLNYPFSRYDSMALAAFDLATAVLAIWL
RFNMATEALPARLISSLCGLLCTFTVSRVIVGVVLAPRFQASSGGSAHPV
YGNDLAQQITSTFDVVLCGLALSILAAAIVIGRLRQLPQPPHTPALSRPA
GSPRIFRSAGSTHPVRPKIYRPPDHSS
>Rv3254 CONSERVED HYPOTHETICAL PROTEIN
MVIGASIAGLCAARVLSDFYSTVTVFERDELPEAPANRATVPQDRHLHML
MARGAQEFDSLFPGLLHDMVAAGVPMLENRPDCIYLGAAGHVLGTGHTLR
KEFTAYVPSRPHLEWQLRRRVLQLSNVQIVRRLVTEPQFERRQQRVVGVL
LDSPGSGQDREREEFIAADLVVDAAGRGTRLPVWLTQWGYRRPAEDTVDI
GISYASHQFRIPDGLIAEKVVVAGASHDQSLGLGMLCYEDGTWVLTTFGV
ADAKPPPTFDEMRALADKLLPARFTAALAQAQPIGCPAFHAFPASRWRRY
DKLERFPRGIVPFGDAVASFNPTFGQGMTMTSLQAGHLRRALKARNSAMK
GDLAAELNRATAKTTYPVWMMNAIGDISFHHATAEPLPRWWRPAGSLFDQ
FLGAAETDPVLAEWFLRRFSLLDSLYMVPSVPIIGRAIAHNLRLWLKEQR
ERRQPVTTRRSP
>Rv0849 PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN
MGARAIFRGFNRPSRVLMINQFGINIGFYMLMPYLADYLAGPLGLAAWAV
GLVMGVRNFSQQGMFFVGGTLADRFGYKPLIIAGCLIRTGGFALLVVAQS
LPSVLIAAAATGFAGALFNPAVRGYLAAEAGERKIEAFAMFNVFYQSGIL
LGPLVGLVLLALDFRITVLAAAGVFGLLTVAQLVALPQHRADSEREKTSI
LQDWRVVVRNRPFLTLAAAMTGCYALSFQIYLALPMQASILMPRNQYLLI
AAMFAVSGLVAVGGQLRITRWFAVRWGAERSLVVGATILAASFIPVAVIP
NGQRFGVAVAVMALVLSASLLAVASAALFPFEMRAVVALSGDRLVATHYG
FYSTIVGVGVLVGNLAIGSLMSAARRLNTDEIVWGGLILVGIVAVAGLRR
LDTFTSGSQNMTGRWAAPR
>Rv2585c POSSIBLE CONSERVED LIPOPROTEIN
MAPRRRRHTRIAGLRVVGTATLVAATTLTACSGSAAAQIDYVVDGALVTY
NTNTVIGAASAGAQAFARTLTGFGYHGPDGQVVADRDFGTVSVVEGSPLI
LDYQISDDAVYSDGRPVTCDDLVLAWAAQSGRFPGFDAATQAGYVDIANI
ECTAGQKKARVSFIPDRSVVDHSQLFTATSLMPSHVIADQLHIDVTAALL
SNNVSAVEQIARLWNSTWDLKPGRSHDEVRSRFPSSGPYKIESVLDDGAV
VLVANDRWWGTKAITKRITVWPQGADIQDRVNNRSVDVVDVAAGSSGSLV
TPDSYQRTDYPSAGIEQLIFAPQGSLAQSRTRRALALCVPRDAIARDAGV
PIANSRLSPATDDALTDADGAAEARQFGRVDPAAARDALGGTPLTVRIGY
GRPNARLAATIGTIADACAPAGITVSDVTVDTPGPQALRDGKIDVLLAST
GGATGSGSSGSCAMDAYDLHSGNGNNLSGYANAQIDGIISALAVSADPAE
RARLLAEAAPVLWDEMPTLPLYRQQRTLLMSTKMYAVSRNPTRWGAGWNM
DRWALAR
>Rv2729c PROBABLE CONSERVED INTEGRAL MEMBRANE ALANINE VALINE AND LEUCINE RICH PROTEIN
MASVEFATILALGAALLAGIGYVTLQRSARQVTAEEYVGHFTLFHLSLRH
ALWWLGSLAAVASFTLQAIALTMGSVVLVQSLQATALLFALLIDARLTHH
RCTPREWMWAVLLAGAVAVIVMSGNPAAGTTRAPFSTWAVVAVVVVPAVV
LCVVGARIASGSLSAVLLAVASSATLAVFTVLTKGVVTELGEGFATLIRT
PALYAWILVLPIGLMLQQSSLRVGALTASLPTITVARPVIASVLGITVLD
EVLHTGRVALVALVAAVVVVVVATVALARDEVAMMTVSAGELGAAGQLAV
R
>Rv2333c PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN
MNRTQLLTLIATGLGLFMIFLDALIVNVALPDIQRSFAVGEDGLQWVVAS
YSLGMAVFIMSAATLADLDGRRRWYLIGVSLFTLGSIACGLAPSIAVLTT
ARGAQGLGAAAVSVTSLALVSAAFPEAKEKARAIGIWTAIASIGTTTGPT
LGGLLVDQWGWRSIFYVNLPMGALVLFLTLCYVEESCNERARRFDLSGQL
LFIVAVGALVYAVIEGPQIGWTSVQTIVMLWTAAVGCALFVWLERRSSNP
MMDLTLFRDTSYALAIATICTVFFAVYGMLLLTTQFLQNVRGYTPSVTGL
MILPFSAAVAIVSPLVGHLVGRIGARVPILAGLCMLMLGLLMLIFSEHRS
SALVLVGLGLCGSGVALCLTPITTVAMTAVPAERAGMASGIMSAQRAIGS
TIGFAVLGSVLAAWLSATLEPHLERAVPDPVQRHVLAEIIIDSANPRAHV
GGIVPRRHIEHRDPVAIAEEDFIEGIRVALLVATATLAVVFLAGWRWFPR
DVHTAGSDLSERLPTAMTVECAVSHMPGATWCRLWPA
>Rv1075c CONSERVED EXPORTED PROTEIN
MPRRSTIALATAGALASTGTAYLGARNLLVGQATHARTVIPKSFDAPPRA
DGVYTRGGGPVQRWRREVPFDVHLMIFGDSTATGYGCASAEEVPGVLIAR
GLAEQTGKRIRLSTKAIVGATSKGVCGQVDAMFVVGPPPDAAVIMIGAND
ITALNGIGPSAQRLADCVRRLRTRGAVVVVGTCPDLGVITAIPQPLRALA
HTRGVRLARAQTAAVKAAGGVPVPLGHLLAPKFRAMPELMFSADRYHPSA
PAYALAADLLFLALRDALTEKLDIPIHETPSRPGTATLEPGHTRHSMMSR
LRRPRPARAVPTGG
>Rv2017 POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN
MNGLGDVLAVARKARGLTQIELAELVGLTQPAINRYESGDRDPDQHIVAK
LAEILGVTDDLLIHGNRFRGALAVDAHMRRHKTTKASAWRQLEARLNLLR
VHASFLFEEVAINSEQHVPAFDPEFTAAEDAARLVRAQWRMPMGPVVNLT
RWMEAAGCLVFEEDFATQRIDGLSQWVDDYPVMLINANAAPDRKRLTLAH
ELGHLVLHSTNPTENMETEATAFAAEFLMPESEIRPELRRLDLGKLLELK
REWGVSMQALLARAYRMGLVSAEARTKLYKAMNARGWKTKEPGIESIVRE
KPSLPAHIGMTLRSRGFTDQQAAAIAGYANPADNPFRPEGGRLHAI
>Rv3534c PROBABLE 4-HYDROXY-2-OXOVALERATE ALDOLASE (HOA)
MTDMWDVRITDTSLRDGSHHKRHQFTKDEVGAIVAALDAAGVPVIEVTHG
DGLGGSSFNYGFSKTPEQELIKLAAATAKEARIAFLMLPGVGTKDDIKEA
RDNGGSICRIATHCTEADVSIQHFGLARELGLETVGFLMMAHTIAPEKLA
AQARIMADAGCQCVYVVDSAGALVLDGVADRVSALVAELGEDAQVGFHGH
ENLGLGVANSVAAVRAGAKQIDGSCRRFGAGAGNAPVEALIGVFDKIGVK
TGIDFFDIADAAEDVVRPAMPAECLLDRNALIMGYSGVYSSFLKHAVRQA
ERYGVPASALLHRAGQRKLIGGQEDQLIDIALEIKRELDSGAAVTH
>Rv3221A POSSIBLE ANTI-SIGMA FACTOR
MSENCGPTDAHADHDDSHGGMGCAEVIAEVWTLLDGECTPETRERLRRHL
EACPGCLRHYGLEERIKALIGTKCRGDRAPEGLRERLRLEIRRTTIIRGG
P
>Rv1672c PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN
MATIAASPTHNALGKAARRLLPLLFVLYVINFVDRANISVAALAMNADLR
LSATAYGTAAGVFFLGYVLFQVPANAALARFGAGRTLTAVVLAWGVCSAA
TALVTSAHTLYLARFALGVAEGGFFPGVIAYLTVWFPCAQRARAVATFLL
AIPVANTVGLPLSGLIVGHVHMAGLPGWRAMFVIEALPALLLAPLLRRLL
PDNPQRASWLTPEERAELSARLTEDTPAPTGRSSGAGWDLVLFAVVYGGL
YFALYALQFFLPQLVASLAHGTATLTAATLAALPYGVAALAMLAWSHRSI
DRSGAQAGHITLPTTAAGSAALGAALSPMSPIVTLSWLTIAVAGILAAMP
AFWSRCTAALAGPRVAVAIATVNAVASLASFAGPYATGHLKDATGTYHLA
LLTVAAVLAAAAACSLLLRHAGRTVCANDSEIMLHPSPATPFV
>Rv2337c HYPOTHETICAL PROTEIN
MRAGRWGPGMTGLDPAEFLSLVEAAALAPSADNRREVQLEHAGRRVRLWG
DQTWRSAPEHRRIMSLVAIGAAVENVKLRAGRLGFETKVCWFPDSGNPGL
VAEIDVDRLPQTRVDPIEGAIERRRTNRRVRFRGPPLSQGELGALSAEAT
GIDGIQLHWFDSPETRKQILRLVRLAETERFRSRELHEELFSAVRFDIGW
TASSDDGLPPGSLEVEAWMRPMFRGLRHWRVLRLLRTVGMHHALGLRAAY
LPCRLAPHVGALTTSLDLASGALTAGAVFERIWLRTTLLGAELQPFAASA
VLSLPACEWVAPHVRAALVGGWNLLAPGHWPMMVFRIGHARAPSVRTMRQ
SVEAYCYAPAERSGSDSESRFA
>Rv1999c PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MRRPLDPRDIPDELRRRLGLLDAVVIGLGSMIGAGIFAALAPAAYAAGSG
LLLGLAVAAVVAYCNAISSARLAARYPASGGTYVYGRMRLGDFWGYLAGW
GFVVGKTASCAAMALTVGFYVWPAQAHAVAVAVVVALTAVNYAGIQKSAW
LTRSIVAVVLVVLTAVVVAAYGSGAADPARLDIGVDAHVWGMLQAAGLLF
FAFAGYARIATLGEEVRDPARTIPRAIPLALGITLAVYALVAVAVIAVLG
PQRLARAAAPLSEAMRVAGVNWLIPVVQIGAAVAALGSLLALILGVSRTT
LAMARDRHLPRWLAAVHPRFKVPFRAELVVGAVVAALAATADIRGAIGFS
SFGVLVYYAIANASALTLGLDEGRPRRLIPLVGLIGCVVLAFALPLSSVA
AGAAVLGVGVAAYGVRRIITRRARQTDSGDTQRSGHPSAT
>Rv0075 PROBABLE AMINOTRANSFERASE
MQDSIFNLLTEEQLRGRNTLKWNYFGPDVVPLWLAEMDFPTAPAVLDGVR
ACVDNEEFGYPPLGEDSLPRATADWCRQRYGWCPRPDWVRVVPDVLKGME
VVVEFLTRPESPVALPVPAYMPFFDVLHVTGRQRVEVPMVQQDSGRYLLD
LDALQAAFVRGAGSVIICNPNNPLGTAFTEAELRAIVDIAARHGARVIAD
EIWAPVVYGSRHVAAASVSEAAAEVVVTLVSASKGWNLPGLMCAQVILSN
RRDAHDWDRINMLHRMGASTVGIRANIAAYHHGESWLDELLPYLRANRDH
LARALPELAPGVEVNAPDGTYLSWVDFRALALPSEPAEYLLSKAKVALSP
GIPFGAAVGSGFARLNFATTRAILDRAIEAIAAALRDIID
>Rv3877 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MSAPAVAAGPTAAGATAARPATTRVTILTGRRMTDLVLPAAVPMETYIDD
TVAVLSEVLEDTPADVLGGFDFTAQGVWAFARPGSPPLKLDQSLDDAGVV
DGSLLTLVSVSRTERYRPLVEDVIDAIAVLDESPEFDRTALNRFVGAAIP
LLTAPVIGMAMRAWWETGRSLWWPLAIGILGIAVLVGSFVANRFYQSGHL
AECLLVTTYLLIATAAALAVPLPRGVNSLGAPQVAGAATAVLFLTLMTRG
GPRKRHELASFAVITAIAVIAAAAAFGYGYQDWVPAGGIAFGLFIVTNAA
KLTVAVARIALPPIPVPGETVDNEELLDPVATPEATSEETPTWQAIIASV
PASAVRLTERSKLAKQLLIGYVTSGTLILAAGAIAVVVRGHFFVHSLVVA
GLITTVCGFRSRLYAERWCAWALLAATVAIPTGLTAKLIIWYPHYAWLLL
SVYLTVALVALVVVGSMAHVRRVSPVVKRTLELIDGAMIAAIIPMLLWIT
GVYDTVRNIRF
>Rv3253c POSSIBLE CATIONIC AMINO ACID TRANSPORT INTEGRAL MEMBRANE PROTEIN
MAGRRRMKSVEQSIADTDEPTTRLRKDLTWWDLVVFGVSVVIGAGIFTVT
ASTAGDITGPAIWISFLIAAATCALAALCYAEFASTLPVAGSAYTFSYAT
FGEFLAWVIGWNLVLELAMGAAVVAKGWSSYLGTVFGFGNGTGHLGSLQL
DWGALVIVTLVATLIALGTKLSSRFSAVVTAIKVSVVVLVVVVGAFYIRA
ANYSPFIPEPEVQHHGGGLDQSVFSLLTGAQGSHYGWYGVLAGASIVFFA
FIGFDIVATMAEETKRPQRDVPRGILASLGVVTLLYVAVSVVLSGMVPYT
QLRTVPGRGPANLATAFQANGVYWASGIISVGALAGLTTVVMVLMLGQCR
VLFAMARDGLVPRQLAKTGSRGTPVRVTVLVAVLVATTASVFPITKLEEM
VNVGTLFAFILVSAGVVVLRRTRPDLQRGFTAPWVPLLPIAAVCACLWLM
LNLTALTWIRFGIWLVAGTAIYVGYGRRHSAQGLRQARESATRRC
>Rv3067 CONSERVED HYPOTHETICAL PROTEIN
MLTVGVGIGAAILLGWFTLAHRHPDQPGAAATPPPAGLTTRSAPTAAPPS
TLQSPDLDSVFLGNLHDRGISFTNPDAAVYNGKMVCTNLGGGMTVQQVVE
ALQSSSPALGDRTTAYVAVSIRTYCPKYDAVLPPGS
>Rv2197c Probable conserved transmembrane protein
MVSRYSAYRRGPDVISPDVIDRILVGACAAVWLVFTGVSVAAAVALMDLG
RGFHEMAGNPHTTWVLYAVIVVSALVIVGAIPVLLRARRMAEAEPATRPT
GASVRGGRSIGSGHPAKRAVAESAPVQHADAFEVAAEWSSEAVDRIWLRG
TVVLTSAIGIALIAVAAATYLMAVGHDGPSWISYGLAGVVTAGMPVIEWL
YARQLRRVVAPQSS
>Rv1706A CONSERVED HYPOTHETICAL PROTEIN
MGSLAAFKLGWLLSAMAPNVVLLTAFRVPQGLTMLTVFATGQAGQHRCRT
FHVTP
>Rv0876c POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MAPTPGRRTRNGSVNGHPGMANYPPDDANYRRSRRPPPMPSANRYLPPLG
EQPEPERSRVPPRTTRAGERITVTRAAAMRSREMGSRMYLLVHRAATADG
ADKSGLTALTWPVMANFAVDSAMAVALANTLFFAAASGESKSRVALYLLI
TIAPFAVIAPLIGPALDRLQHGRRVALALSFGLRTALAVVLIMNYDGATG
SFPSWVLYPCALAMMVFSKSFSVLRSAVTPRVMPPTIDLVRVNSRLTVFG
LLGGTIAGGAIAAGVEFVCTHLFQLPGALFVVVAITIAGASLSMRIPRWV
EVTSGEVPATLSYHRDRGRLRRRWPEEVKNLGGTLRQPLGRNIITSLWGN
CTIKVMVGFLFLYPAFVAKAHEANGWVQLGMLGLIGAAAAVGNFAGNFTS
ARLQLGRPAVLVVRCTVLVTVLAIAAAVAGSLAATAIATLITAGSSAIAK
ASLDASLQHDLPEESRASGFGRSESTLQLAWVLGGAVGVLVYTELWVGFT
AVSALLILGLAQTIVSFRGDSLIPGLGGNRPVMAEQETTRRGAAVAPQ
>Rv1414 CONSERVED HYPOTHETICAL PROTEIN
MLGDAQQLELGRCAPADIALTVAATVVSRQDCRSGLRRIVLDCGSKILGS
DRPAWATGFGRLIDHADARIAALSEHHATVVWPDDAPLPPVGTRLRVIPN
HVCLTTNLVDDVAVVRDATLIDRWKVAARGKNH
>Rv2805 CONSERVED HYPOTHETICAL PROTEIN
MGRGNGKILDPVVATTGMGRSTARQMLTGPRLPGPAEQVDGRSLRPRGFS
DEARALLEHVWALMGMPCGKYLVVMHDLWLPLLTAAGDLDKPLVTEASVA
ELKATALPGANRMPHWAAGTLPDGFPARAVRTRT
>Rv1125 CONSERVED HYPOTHETICAL PROTEIN
MAGHRMAAVDAQFYWMSAKVPNDQFLLYAFDGEPTDLERAVAQVYRRARG
CPGLGMRVQDRGALAYPQWVPTPVQRDQLVCHDLADRSWQGCLAAVVGLA
SKQLDMRRMPWRLHVFTPVHDVPGVSGLGTVAVMQFAHALGDGARASAMA
AWLFGRPAAVPEIARSRAGFLPWRAAHAARAHLRLVRDTNAGLVAPGVGS
RPPLSTNARPEGVRAVRTLLRRRSQLAGPTVTVTVLAAVSTGLLGLLGGD
VDTLGAEVPMAKPGVPRSYNHFGNVVVGLYPRLEPDERVRRIATDLANAR
RRFEHPAMLSADRAFAAVPAALLRWGVSQFDAEVRPVRVAGNTVVSSVYR
GAADLSFGDAPVVLTAGYPALSPAMGLTHGVHGIGDTVAISVHAAESAVS
DIDAYMRLLDAALQ
>Rv2456c PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN
MSGTVVAVPPRVARALDLLNFSLADVRDGLGPYLSIYLLLIHDWDQASIG
FVMAVGGIAAIVAQTPIGALVDRTTAKRALVVAGAVLVTAAAVAMPLFAG
LYSISVLQAVTGIASSVFAPALAAITLGAVGPQFFARRIGRNEAFNHAGN
ASAAGATGALAYFFGPVVVFWVLAGMALISVLATLRIPPDAVDHDLARGM
DHAPGEPHPQPSRFTVLAHNRELVIFGAAVVAFHFANAAMLPLVGELLAL
HNRDEGTALMSSCIVAAQVVMVPVAYVVGTRADAWGRKPIFLVGFAVLTA
RGFLYTLSDNSYWLVGVQLLDGIGAGIFGALFPLVVQDVTHGTGHFNISL
GAVTTATGIGAALSNLVAGWIVVVAGYDAAFMSLGALAGAGFLLYLVAMP
ETVDSDVRVRSRPTLGGK
>Rv2569c CONSERVED HYPOTHETICAL PROTEIN
MSADSSLSLPLSGTHRYRVTHRTEYRYSDVVTSSYGRGFLTPRNSLRQRC
VAHRLTIDPAPADRSTSRDGYGNISSYFHVTEPHRTLTITSDSIVDVSPP
PPGLYTSGPALQPWEAARPAGLPGSLATEFTLDLNPPEITDAVREYAAPS
FLPKRPLVEVLRDLASRIYTDFTYRSGSTTISTGVNEVLLAREGVCQDFA
RLAIACLRANGLAACYVSGYLATDPPPGKDRMIGIDATHAWASVWTPQQP
GRFEWLGLDPTNDQLVDQRYIVVGRGRDYADVPPLRGIIYTNSENSVIDV
SVDVVPFEGDALHA
>Rv3684 PROBABLE LYASE
MIEADARRSADTHLLRYPLPAAWCTDVDVELYLKDETTHITGSLKHRLAR
SLFLYALCNGWINENTTVVEASSGSTAVSEAYFAALLGLPFIAVMPAATS
ASKIALIESQGGRCHFVQNSSQVYAEAERVAKETGGHYLDQFTNAERATD
WRGNNNIAESIYVQMREEKHPTPEWIVVGAGTGGTSATIGRYIRYRRHAT
RLCVVDPENSAFFPAYSEGRYDIVMPTSSRIEGIGRPRVEPSFLPGVVDR
MVAVPDAASIAAARHVSAVLGRRVGPSTGTNLWGAFGLLAEMVKQGRSGS
VVTLLADSGDRYADTYFSDEWVSAQGLDPAGPAAALVEFERSCRWT
>Rv2409c CONSERVED HYPOTHETICAL PROTEIN
MWRTRVVHTTGYVYQSPVTASYNEARLTPRSSSRQNLVLNRVETIPATRS
YRYIDYWGTAVTAFDLHAPHTELTVTSSSVVETERPEPLAAKATWADLQS
TAVIDRFDEVLRPTPHTPASARVDAVGRRIRKCHEPSEAVVAAARWARSE
LDYIPGTTSVHSSGLDALEQGKGVCQDFVHLSLMVLRSMGIPCRYVSGYL
HPKRDAVVGKTVDGRSHAWVQAWTGGWWHYDPTNDNEITEQYISVGVGRD
YTDVSPLKGIYSGEGVTDLDVVVEITRLA
>Rv2265 Possible conserved integral membrane protein
MGANGDVALSRIGATRPALSAWRFVTVFGVVGLLADVVYEGARSITGPLL
ASLGATGLVVGVVTGVGEAAALGLRLVSGPLADRSRRFWAWTIAGYTLTV
VTVPLLGIAGALWVACALVIAERVGKAVRGPAKDTLLSHAASVTGRGRGF
AVHEALDQVGAMIGPLTVAGMLAITGNAYAPALGVLTLPGGAALALLLWL
QRRVPRPESYEDCPVVLGNPSAPRPWALPAQFWLYCGFTAITMLGFGTFG
LLSFHMVSHGVLAAAMVPVVYAAAMAADALTALASGFSYDRYGAKTLAVL
PILSILVVLFAFTDNVTMVVIGTLVWGAAVGIQESTLRGVVADLVASPRR
ASAYGVFAAGLGAATAGGGALIGWLYDISIGTLVVVVIALELMALVMMFA
IRLPRVAPS
>Rv0842 PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MRYTGPERCSGDGQVRAAGDRYSTVIWLLGGNLLVRSAGFGYPFLAYHVA
GRGHGAGAVGAVVAAYGLGWAVGQLLCGWLVDRVGARVTLVSTMLVAAAV
LVLMAGLHTVPGLLVGAMIAGLVCDAPRPVLGAVIAELVADPQRRAQLDG
WRYGWVLNIGAAITGGVGGVVAGWLDTPVLYWINGIGCAIFAGLAGRCIP
ADVCRRTESGLRACTAMSKVGYRQALSDKRLVLLAVSGLATLTTLMGFFA
AVPMLMSASGLGVGAYGWVQLINALAVVAVTPLLTPWLSKQLALGPRPDI
LAGAGVWVTLCMAAAGLARTTVGFSVAAAACSPGEIAWFVVAAGIVHRIA
PPAHGGRYHGIWSMAVAASSVAAPILAAFNLANGGRLVLAATTVTVGFFG
AALCLPLARVLAAASCGPLSSKEPSRDSYQ
>Rv2561 CONSERVED HYPOTHETICAL PROTEIN
MGIQRAVLLIADIGGYTNYMHWNRKHLAHAQWTVAQLLESVIDAAKGMKL
AKLEGDAAFFWAPGGQHQCPGMRPAPADAPEVPHAARADQKRPSLRL
>Rv0037c PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MPRVEVGLVIHSRMHARAPVDVWRSVRSLPDFWRLLQVRVASQFGDGLFQ
AGLAGALLFNPDRAADPMAIAGAFAVLFLPYSLLGPFAGALMDRWDRRWV
LVGANTGRLALIAGVGTILAVGAGDVPLLVGALVANGLARFVASGLSAAL
PHVVPREQVVTMNSVAIASGAVSAFLGANFMLLPRWLLGSGDEGASAIVF
LVAIPVSIALLWSLRFGPRVLGPDDTERAIHGSAVYAVVTGWLHGARTVV
QLPTVAAGLSGLAAHRMVVGINSLLILLLVRHVTARAVGGLGTALLFFAA
TGLGAFLANVLTPTAIRRWGRYATANGALAAAATIQVAAAGLLVPVMVVC
GFLLGVAGQVVKLCADSAMQMDVDDALRGHVFAVQDALFWVSYILSITVA
AALIPEHGHAPVFVLFGSAIYLAGLVVHTIVGRRGQPVIGR
>Rv1410c AMINOGLYCOSIDES/TETRACYCLINE-TRANSPORT INTEGRAL MEMBRANE PROTEIN
MRAGRRVAISAGSLAVLLGALDTYVVVTIMRDIMNSVGIPINQLHRITWI
VTMYLLGYIAAMPLLGRASDRFGRKLMLQVSLAGFIIGSVVTALAGHFGD
FHMLIAGRTIQGVASGALLPITLALGADLWSQRNRAGVLGGIGAAQELGS
VLGPLYGIFIVWLLHDWRDVFWINVPLTAIAMVMIHFSLPSHDRSTEPER
VDLVGGLLLALALGLAVIGLYNPNPDGKHVLPDYGAPLLVGALVAAVAFF
GWERFARTRLIDPAGVHFRPFLSALGASVAAGAALMVTLVDVELFGQGVL
QMDQAQAAGMLLWFLIALPIGAVTGGWIATRAGDRAVAFAGLLIAAYGYW
LISHWPVDLLADRHNILGLFTVPAMHTDLVVAGLGLGLVIGPLSSATLRV
VPSAQHGIASAAVVVARMTGMLIGVAALSAWGLYRFNQILAGLSAAIPPN
ASLLERAAAIGARYQQAFALMYGEIFTITAIVCVFGAVLGLLISGRKEHA
DEPEVQEQPTLAPQVEPL
>Rv1634 Possible drug efflux membrane protein
MTETASETGSWRELLSRYLGTSIVLAGGVALYATNEFLTISLLPSTIADI
GGSRLYAWVTTLYLVGSVVAATTVNTMLLRVGARSSYLMGLAVFGLASLV
CAAAPSMQILVAGRTLQGIAGGLLAGLGYALINSTLPKSLWTRGSALVSA
MWGVATLIGPATGGLFAQLGLWRWAFGVMTLLTALMAMLVPVALGAGGVG
PGGETPVGSTHKVPVWSLLLMGAAALAISVAALPNYLVQTAGLLAAAALL
VAVFVVVDWRIHAAVLPPSVFGSGPLKWIYLTMSVQMIAAMVDTYVPLFG
QRLGHLTPVAAGFLGAALAVGWTVGEVASASLNSARVIGHVVAAAPLVMA
SGLALGAVTQRADAPVGIIALWALALLIIGTGIGIAWPHLTVRAMDSVAD
PAESSAAAAAINVVQLISGAFGAGLAGVVVNTAKGGEVAAARGLYMAFTV
LAAAGVIASYQATHRDRRLPR
>Rv0841 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MVAASIVHHSAAPANRGRYHGIWSMTPVVASVVVPIMASYGPIHGAHLLA
AVVVGSAGAALCLPLARALRRPTPSAMTTD
>Rv3098c HYPOTHETICAL PROTEIN
MASLRIAEVDPVDRSPNHHASGSVETSSSRSRSASVRACLIHTSRSSSCS
ARRMTSLLRSPLRIAALMICSSFSVGRKPMVAVMSTTIADVAQSYSNCST
HSGTPTPAFAASFLLDAINAPRVIAGRFASESVRFPAAAPHGSVPSRLPV
>Rv1895 POSSIBLE DEHYDROGENASE
MRAVVIDGAGSVRVNTQPDPALPGPDGVVVAVTAAGICGSDLHFYEGEYP
FTEPVALGHEAVGTIVEAGPQVRTVGVGDLVMVSSVAGCGVCPGCETHDP
VMCFSGPMIFGAGVLGGAQADLLAVPAADFQVLKIPEGITTEQALLLTDN
LATGWAAAQRADISFGSAVAVIGLGAVGLCALRSAFIHGAATVFAVDRVK
GRLQRAATWGATPIPSPAAETILAATRGRGADSVIDAVGTDASMSDALNA
VRPGGTVSVVGVHDLQPFPVPALTCLLRSITLRMTMAPVQRTWPELIPLL
QSGRLDVDGIFTTTLPLDEAAKGYATARARSGEELRFCLRPDSRDVLGAH
ETVDLYVHVRRCQSVADLQLEGAADGVDGPSMLN
>Rv0790c HYPOTHETICAL PROTEIN
MTLANNGTGMDHFLTPTEYLDAGHPLVRTTAATLIRDAVSDTERVRRIYY
YVRDVPYDVLASFRYLAQGHHRASDVIGHGVAFCMGKASSFVALCRAAGV
PARIAFQTIDAPDKEFLSPQVRALWGGRTGRPFPWHSLGEAYLGRRWVKL
DATIDAPTAARLGKPYRQEFDGATPIPTVEGTILRENGSYADYPSAVAQW
YERIAQSVLKALQSTEVHALVAADEELWTGPPVELADATHRL
>Rv2994 PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MSRDPTGVGARWAIMIVSLGVTASSFLFINGVAFLIPRLENARGTPLSHA
GLLASMPSWGLVVTMFAWGYLLDHVGERMVMAVGSALTAAAAYAAASVHS
LLWIGVFLFLGGMAAGGCNSAGGRLVSGWFPPQQRGLAMGIRQTAQPLGI
ASGALVIPELAERGVHAGLMFPAVVCTLAAVASVLGIVDPPRKSRTKASE
QELASPYRGSSILWRIHAASALLMMPQTVTVTFMLVWLINHHGWSVAQAG
VLVTISQLLGALGRVAVGRWSDHVGSRMRPVRLIAAAAAATLFLLAAVDN
EGSRYDVLLMIAISVIAVLDNGLEATAITEYAGPYWSGRALGIQNTTQRL
MAAAGPPLFGSLITTAAYPTAWALCGVFPLAAVPLVPVRLLPPGLETRAR
RQSVRRHRWWQAVRCHAWPNGPRRPGPPGQPRRVRQGGTAITPPT
>Rv2747 POSSIBLE TRANSFERASE
MTERPRDCRPVVRRARTSDVPAIKQLVDTYAGKILLEKNLVTLYEAVQEF
WVAEHPDLYGKVVGCGALHVLWSDLGEIRTVAVDPAMTGHGIGHAIVDRL
LQVARDLQLQRVFVLTFETEFFARHGFTEIEGTPVTAEVFDEMCRSYDIG
VAEFLDLSYVKPNILGNSRMLLVL
>Rv0460 CONSERVED HYDROPHOBIC PROTEIN
MLVGNAIGLLAGVACSVLVHARIRPDIVIAMVVGIPSAIGLLVILFSGRR
WVTMLGAFILALAPGWFGVLVAIQVASSG
>Rv0457c PROBABLE PEPTIDASE
MTFEPAPDGADPYLWLEDVTGAEALDWVRARNKPTTAAFCDAEFERMRVE
ALEVLDTDARIPYVNRRGNYLYNFWRDAANPRGLWRRTTLDSYRTDSPGW
DVLIDVDELGRADDQKWVWGGAGVIEPDYTRALIGLSPGGSDASIVREFD
MLTREFVEDGFQLPPAKSQITWEDPDTVLLGTDFGGDSLTTSGYPRVIKR
WRRGKPLADAETIFEGAGTDVRVNASADRTPGFERTLLGRALDFWNEEVY
ELRGSELIRIEAPTDASVSIHRDWLLIELRTDWTVATTRYTAGSLLAAEY
DEFLAGSAELQVVFEPDEHTALYQYAWTRDRLLIVTLADVASRVEIATPG
SWRREPLSGIPAATNTVIVSADSHGDEFFLDSSGFDTPSRLMRGTDDGRL
AEIKSAPAFFDAENMAVTQYFATSDDGTSIPYFVVRRTDADNPGPTLLNG
YGGFETSRTPTYDGVLGRLWLARGGTYALANIRGGGEYGPGWHTQAMREG
RDKVAQDFAAVATDLVTRGITTAEQLGARGGSNGGLLMGIMLTGYPEKFG
ALVCDVPLLDMKRYHLLLAGASWMAEYGDPDNPDDWKFISEYSPYQNISA
NRKYPPVLMTTSTRDDRVHPGHARKMTAALQAAGHPVWYYENIEGGHAGA
ADNAQIAFKSALSFAFLWRMLAG
>Rv1201c PROBABLE TRANSFERASE
MSTVTGAAGIGLATLAADGSVLDTWFPAPELTESGTSATSRLAVSDVPVE
LAALIGRDDDRRTETIAVRTVIGSLDDVAADPYDAYLRLHLLSHRLVAPH
GLNAGGLFGVLTNVVWTNHGPCAIDGFEAVRARLRRRGPVTVYGVDKFPR
MVDYVVPTGVRIADADRVRLGAHLAPGTTVMHEGFVNYNAGTLGASMVEG
RISAGVVVGDGSDVGGGASIMGTLSGGGTHVISIGKRCLLGANSGLGISL
GDDCVVEAGLYVTAGTRVTMPDSNSVKARELSGSSNLLFRRNSVSGAVEV
LARDGQGIALNEDLHAN
>Rv1924c HYPOTHETICAL PROTEIN
MDPADVINPTSTRDAALARVLAYRQRVRARPLLIRATLAVVGGGLFVVSL
PMIVLLPELGIPALLVAFRLLAVEAQWAVRAYAWTDWRFTQLREWFHRQV
LVTRAAILVGLFLAAVALVWLLVYEF
>Rv0812 PROBABLE AMINO ACID AMINOTRANSFERASE
MVVTLDGEILQPGMPLLHADDLAAVRGDGVFETLLVRDGRACLVEAHLQR
LTQSARLMDLPEPDLPRWRRAVEVATQRWVASTADEGALRLIYSRGREGG
SAPTAYVMVSPVPARVIGARRDGVSAITLDRGLPADGGDAMPWLIASAKT
LSYAVNMAVLRHAARQGAGDVIFVSTDGYVLEGPRSTVVIATDGDQGGGN
PCLLTPPPWYPILRGTTQQALFEVARAKGYDCDYRALRVADLFDSQGIWL
VSSMTLAARVHTLDGRRLPRTPIAEVFAELVDAAIVSDR
>Rv1279 PROBABLE DEHYDROGENASE FAD flavoprotein GMC oxidoreductase
MDTQSDYVVVGTGSAGAVVASRLSTDPATTVVALEAGPRDKNRFIGVPAA
FSKLFRSEIDWDYLTEPQPELDGREIYWPRGKVLGGSSSMNAMMWVRGFA
SDYDEWAARAGPRWSYADVLGYFRRIENVTAAWHFVSGDDSGVTGPLHIS
RQRSPRSVTAAWLAAARECGFAAARPNSPRPEGFCETVVTQRRGARFSTA
DAYLKPAMRRKNLRVLTGATATRVVIDGDRAVGVEYQSDGQTRIVYARRE
VVLCAGAVNSPQLLMLSGIGDRDHLAEHDIDTVYHAPEVGCNLLDHLVTV
LGFDVEKDSLFAAEKPGQLISYLLRRRGMLTSNVGEAYGFVRSRPELKLP
DLELIFAPAPFYDEALVPPAGHGVVFGPILVAPQSRGQITLRSADPHAKP
VIEPRYLSDLGGVDRAAMMAGLRICARIAQARPLRDLLGSIARPRNSTEL
DEATLELALATCSHTLYHPMGTCRMGSDEASVVDPQLRVRGVDGLRVADA
SVMPSTVRGHTHAPSVLIGEKAADLIRS
>Rv1979c POSSIBLE CONSERVED PERMEASE
MVGPRTRGYAIHKLGFCSVVMLGINSIIGAGIFLTPGEVIGLAGPFAPMA
YVLAGIFAGVVAIVFATAARYVRTNGASYAYTTAAFGRRIGIYVGVTHAI
TASIAWGVLASFFVSTLLRVAFPDKAWADAEQLFSVKTLTFLGFIGVLLA
INLFGNRAIKWANGTSTVGKAFALSAFIVGGLWIITTQHVNNYATAWSAY
SATPYSLLGVAEIGKGTFSSMALATIVALYAFTGFESIANAAEEMDAPDR
NLPRAIPIAIFSVGAIYLLTLTVAMLLGSNKIAASDDTVKLAAAIGNATF
RTIIVVGALISMFGINVAASFGAPRLWTALADSGVLPTRLSRKNQYDVPM
VSFAITASLALAFPLALRFDNLHLTGLAVIARFVQFIIVPIALIALARSQ
AVEHAAVRRNAFTDKVLPLVAIVVSVGLAVSYDYRCIFLVRGGPNYFSIA
LIVITFVVVPAMAYLHYYRIIRRVGDRPSTR
>Rv1496 Possible transport system kinase
MMAASHDDDTVDGLATAVRGGDRAALPRAITLVESTRPDHREQAQQLLLR
LLPDSGNAHRVGITGVPGVGKSTAIEALGMHLIERGHRVAVLAVDPSSTR
TGGSILGDKTRMARLAVHPNAYIRPSPTSGTLGGVTRATRETVVLLEAAG
FDVILIETVGVGQSEVAVANMVDTFVLLTLARTGDQLQGIKKGVLELADI
VVVNKADGEHHKEARLAARELSAAIRLIYPREALWRPPVLTMSAVEGRGL
AELWDTVERHRQVLTGAGEFDARRRDQQVDWTWQLVRDAVLDRVWSNPTV
RKVRSELERRVRAGELTPALAAQQILEIANLTDR
>Rv0546c CONSERVED HYPOTHETICAL PROTEIN
MEILASRMLLRPADYQRSLSFYRDQIGLAIAREYGAGTVFFAGQSLLELA
GYGEPDHSRGPFPGALWLQVRDLEATQTELVSRGVSIAREPRREPWGLHE
MHVTDPDGITLIFVEVPEGHPLRTDTRA
>Rv0699 HYPOTHETICAL PROTEIN
MGDRRVDLLAAKDSEIRRSMGAVPVGAGSSQVATSWASDRCIRCRAAILS
ADCANLARANSRGGLAVGGSAVS
>Rv0858c PROBABLE AMINOTRANSFERASE
MTVSRLRPYATTVFAEMSALATRIGAVNLGQGFPDEDGPPKMLQAAQDAI
AGGVNQYPPGPGSAPLRRAIAAQRRRHFGVDYDPETEVLVTVGATEAIAA
AVLGLVEPGSEVLLIEPFYDSYSPVVAMAGAHRVTVPLVPDGRGFALDAD
ALRRAVTPRTRALIINSPHNPTGAVLSATELAAIAEIAVAANLVVITDEV
YEHLVFDHARHLPLAGFDGMAERTITISSAAKMFNCTGWKIGWACGPAEL
IAGVRAAKQYLSYVGGAPFQPAVALALDTEDAWVAALRNSLRARRDRLAA
GLTEIGFAVHDSYGTYFLCADPRPLGYDDSTEFCAALPEKVGVAAIPMSA
FCDPAAGQASQQADVWNHLVRFTFCKRDDTLDEAIRRLSVLAERPAT
>Rv0274 CONSERVED HYPOTHETICAL PROTEIN
MIKPHNTNTEFELGGINHVALVCSDMARTVDFYSNILGMPLIKALDLPGG
QGQHFFFDAGNGDCVAFFWFADAPDRVPGLSSPVAIPGIGDITSAVSTMN
HLAFHVPAERFDAYRQRLKDKGVRVGPVLNHDDSETQVSAVVHPGVYVRS
FYFQDPDGITLEFACWTKEFTTSDAQAVPKTAADRRPPVAADR
>Rv1769 CONSERVED HYPOTHETICAL PROTEIN
MHEVAAREQRSDGPMRLDAQGRLQRYEEAFADYDAPFAFVDLDAMWGNAD
QLLARAGDKPIRVASKSLRCRPLQREILDASERFDGLLTFTLTETLWLAG
QGFSNLLLAYPPTDRAALRALGELTAKDPDGAPIVMVDSVEHLDLIERTT
DKPVRLCLDFDAGYWRAGGRIKIGSKRSPLHTPEQARALAVEIARRPALT
LAALMCYEAHIAGLGDNVAGKRVHNAIIRRMQRMSFEELRERRARAVELV
REVADIKIVNAGGTGDLQLVAQEPLITEATAGSGFYAPTLFDSYSTFTLQ
PAAMFALPVCRRPGAKTVTALGGGYLASGVGAKDRMPTPYLPVGLKLNAL
EGTGEVQTPLSGDAARRLKLGDKVYFRHTKAGELCERFDHLHLVRGAEVV
DTVPTYRGEGRTFL
>Rv1159 CONSERVED TRANSMEMBRANE PROTEIN
MCRTLIDGPVRSAIAKVRQIDTTSSTPAAARRVTSPPARETRAAVLLLVL
SVGARLAWTYLAPNGANFVDLHVYVSGAASLDHPGTLYGYVYADQTPDFP
LPFTYPPFAAVVFYPLHLVPFGLIALLWQVVTMAALYGAVRISQRLMGGT
AETGHFAAMLWTAIAIWIEPLRSTFDYGQINVLLMLAALWAVYTPRWWLS
GLLVGVASGVKLTPAITAVYLVGVRRLHAAAFSVVVFLATVGVSLLVVGD
EARYYFTDLLGDAGRVGPIATSFNQSWRGAISRILGHDAGFGPLVLAAIA
STAVLAILAWRALDRSDRLGKLLVVELFGLLLSPISWTHHWVWLVPLMIW
LIDGPARERPGARILGWGWLVLTIVGVPWLLSFAQPSIWQIGRPWYLAWA
GLVYVVATLATLGWIAASERYVRIRPRRMAN
>Rv2459 PROBABLE CONSERVED INTEGRAL MEMBRANE TRANSPORT PROTEIN
MTPRQRLTVLATGLGIFMVFVDVNIVNVALPSIQKVFHTGEQGLQWAVAG
YSLGMAAVLMSCALLGDRYGRRRSFVFGVTLFVVSSIVCVLPVSLAVFTV
ARVIQGLGAAFISVLSLALLSHSFPNPRMKARAISNWMAIGMVGAASAPA
LGGLMVDGLGWRSVFLVNVPLGAIVWLLTLVGVDESQDPEPTQLDWVGQL
TLIPAVALIAYTIIEAPRFDRQSAGFVAALLLAAGVLLWLFVRHEHRAAF
PLVDLKLFAEPLYRSVLIVYFVVMSCFFGTLMVITQHFQNVRDLSPLHAG
LMMLPVPAGFGVASLLAGRAVNKWGPQLPVLTCLAAMFIGLAIFAISMDH
AHPVALVGLTIFGAGAGGCATPLLHLGMTKVDDGRAGMAAGMLNLQRSLG
GIFGVAFLGTIVAAWLGAALPNTMADEIPDPIARAIVVDVIVDSANPHAH
AAFIGPGHRITAAQEDEIVLAADAVFVSGIKLALGGAAVLLTGAFVLGWT
RFPRTPAS
>Rv2141c CONSERVED HYPOTHETICAL PROTEIN
MTDETGASSDHSDDVAQVVSRLIRFDTTNSGEPGTTKGEAECARWVAEQL
AEVGYQPEYVESGAPGRGNVFARLAGADSSRGALLIHGHLDVVPAEPAEW
SVHPFSGAIEDGYVWGRGAVDMKDMVGMMIVVARHLRQAAIVPPRDLVFA
FVADEEHGGKYGSHWLVDNRPDLFDGITEAIGEVGGFSLTVPRHDGGERR
LYLIETAEKGIQWMRLTARGRAGHGSMVHDQNAVTAVCEAVARLGRHQFP
LVCTDTVAQFLAVVGEETGLAFDLDSPDLAGTIDKLGPMARMLKAVLHDT
ANPTMLKAGYKANVVPATAEAVVDCRVLPGRRAAFEAEVDALIGPDVTRE
WVSDLPSYETTFDGDLVAAMNAAVLAVDPDGRTVPYMLSGGTDAKAFARL
GIRCFGFSPLRLPPDLDFTSLFHGVDERVPIDGLRFGTEVLTHLLTHC
>Rv1877 PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MAGPTAPTTAPTAIRAGGPLLSPVRRNIIFTALVFGVLVAATGQTIVVPA
LPTIVAELGSTVDQSWAVTSYLLGGTVVVVVAGKLGDLLGRNRVLLGSVV
VFVVGSVLCGLSQTMTMLAISRALQGVGAGAISVTAYALAAEVVPLRDRG
RYQGVLGAVFGVNTVTGPLLGGWLTDYLSWRWAFWINVPVSIAVLTVAAT
AVPALARPPKPVIDYLGILVIAVATTALIMATSWGGTTYAWGSATIVGLL
IGAAVALGFFVWLEGRAAAAILPPRLFGSPVFAVCCVLSFVVGFAMLGAL
TFVPIYLGYVDGASATASGLRTLPMVIGLLIASTGTGVLVGRTGRYKIFP
VAGMALMAVAFLLMSQMDEWTPPLLQSLYLVVLGAGIGLSMQVLVLIVQN
TSSFEDLGVATSGVTFFRVVGASFGTATFGALFVNFLDRRLGSALTSGAV
PVPAVPSPAVLHQLPQSMAAPIVRAYAESLTQVFLCAVSVTVVGFILALL
LREVPLTDIHDDADDLGDGFGVPRAESPEDVLEIAVRRMLPNGVRLRDIA
TQPGCGLGVAELWALLRIYQYQRLFEAVRLTDIGRHLHVPYQVFEPVFDR
LVQTGYAARDGDILTLTPSGHRQVDSLAVLIRQWLLDHLAVAPGLKRQPD
HQFEAALQHVTDAVLVQRDWYEDLGDLSESRQLAATT
>Rv0492c PROBABLE OXIDOREDUCTASE GMC-TYPE
MSRLADRAKSYPLASFGAALLPPELGGPLPAQFVQRVDRYVTRLPATSRF
AVRAGLASLAAASYLTTGRSLPRLHPDERARVLHRIAALSPEVAAAVEGL
KAIVLLANGADTYAHELLARAQEHDAARPDAELTVILSADSPSVTRADAV
VVGSGAGGAMVARTLARAGLDVVVLEEGRRWTVEEFRSTHPVDRYAGLYR
GAGATVALGRPAVVLPMGRAVGGTTVVNSGTCFRPSLAVQRRWRDEFGLG
LADPDQLGRRLDDAEQTLRVAPVPLEIMGRNGRLLLQAAKSLGWRAAPIP
RNAPGCRGCCQCAIGCPSNAKFGVHLNALPQACAAGARIISWARVERILH
RAGRAYGVRARRPDGTTLDVLADAVVVAAGATETPGLLRRSGLGGHPRLG
HNLALHPATMLAGLFDDDVFAWRGVLQSAAVHEFHESDGVLIEATSTPPG
MGSMVFPGYGAELLRWLDRAPQIATFGAMVADRGVGTVRSVRGETVVRYD
IAPGEIAKLRVALQAIGRLLFAAGAVEVLTGIPGAPPMRSLPELQDVLRR
ANPRSLHLAAFHPTGTAAAGADEQLCPVDATGRLRGVEGVWVADASILPS
CPEVNPQLSIMAMALAVADQTVAKVVGVR
>Rv0948c CONSERVED HYPOTHETICAL PROTEIN
MRPEPPHHENAELAAMNLEMLESQPVPEIDTLREEIDRLDAEILALVKRR
AEVSKAIGKARMASGGTRLVHSREMKVIERYSELGPDGKDLAILLLRLGR
GRLGH
>Rv2328 PE23, PE FAMILY PROTEIN
MQFLSVIPEQVESAAQDLAGIRSALSASYAAAAGPTTAVVSAAEDEVSTA
IASIFGAYGRQCQVLSAQASAFHDEFVNLLKTGATAYRNTEFANAQSNVL
NAVNAPARSLLGHPSAAESVQNSAPTLGGGHSTVTAGLAAQAGRAVATVE
QQAAAAVAPLPSAGAGLAQVVNGVVTAGQGSAAKLATALQSAAPWLAKSG
GEFIVAGQSALTGVALLQPAVVGVVQAGGTFLTAGTSAATGLGLLTLAGV
EFSQGVGNLALASGTAATGLGLLGSAGVQLFSPAFLLAVPTALGGVGSLA
IAVVQLVQGVQHLSLVVPNVVAGIAALQTAGAQFAQGVNHTMLAAQLGAP
GIAVLQTAGGHFAQGIGHLTTAGNAAVTVLIS
>Rv1905c aao, PROBABLE D-AMINO ACID OXIDASE AAO
MAIGEQQVIVIGAGVSGLTSAICLAEAGWPVRVWAAALPQQTTSAVAGAV
WGPRPKEPVAKVRGWIEQSLHVFRDLAKDPATGVRMTPALSVGDRIETGA
MPPGLELIPDVRPADPADVPGGFRAGFHATLPMIDMPQYLDCLTQRLAAT
GCEIETRPLRSLAEAAEAAPIVINCAGLGARELAGDATVWPRFGQHVVLT
NPGLEQLFIERTGGSEWICYFAHPQRVVCGGISIPGRWDPTPEPEITERI
LQRCRRIQPRLAEAAVIETITGLRPDRPSVRVEAEPIGRALCIHNYGHGG
DGVTLSWGCAREVVNLVGGG
>Rv1530 adh, Probable alcohol dehydrogenase adh
MSDGAVVRALVLEAPRRLVVRQYRLPRIGDDDALVRVEACGLCGTDHEQY
TGELAGGFAFVPGHETVGTIAAIGPRAEQRWGVSAGDRVAVEVFQSCRQC
ANCRGGEYRRCVRHGLADMYGFIPVDREPGLWGGYAEYQYLAPDSMVLRV
AGDLSPEVATLFNPLGAGIRWGVTIPETKPGDVVAVLGPGIRGLCAAAAA
KGAGAGFVMVTGLGPRDADRLALAAQFGADLAVDVAIDDPVAALTEQTGG
LADVVVDVTAKAPAAFAQAIALARPAGTVVVAGTRGVGSGAPGFSPDVVV
FKELRVLGALGVDATAYRAALDLLVSGRYPFASLPRRCVRLEGAEDLLAT
MAGERDGVPPIHGVLTP
>Rv2780 ald, SECRETED L-ALANINE DEHYDROGENASE ALD (40 KDA ANTIGEN) (TB43)
MRVGIPTETKNNEFRVAITPAGVAELTRRGHEVLIQAGAGEGSAITDADF
KAAGAQLVGTADQVWADADLLLKVKEPIAAEYGRLRHGQILFTFLHLAAS
RACTDALLDSGTTSIAYETVQTADGALPLLAPMSEVAGRLAAQVGAYHLM
RTQGGRGVLMGGVPGVEPADVVVIGAGTAGYNAARIANGMGATVTVLDIN
IDKLRQLDAEFCGRIHTRYSSAYELEGAVKRADLVIGAVLVPGAKAPKLV
SNSLVAHMKPGAVLVDIAIDQGGCFEGSRPTTYDHPTFAVHDTLFYCVAN
MPASVPKTSTYALTNATMPYVLELADHGWRAACRSNPALAKGLSTHEGAL
LSERVATDLGVPFTEPASVLA
>Rv1538c ansA, Probable L-aparaginase ansA
MGANHVRNDPIMARLTVITTGGTISTTAGPDGVLRPTHCGATLIAGLDMD
SDIEVVDLMALDSSKLTPADWDRIGAAVQEAFRGGADGVVITHGTDTLEE
TALWLDLTYAGSRPVVLTGAMLSADAPGADGPANLRDALAVAADPAARDL
GVLVSFGGRVLQPLGLHKVANPDLCGFAGESLGFTSGGVRLTRTKTRPYL
GDLGAAVAPRVDIVAVYPGSDAVAMDACVAAGARAVVLEALGSGNAGAAV
IEGVRRHCRDGSDPVVIAVSTRVAGARVGAGYGPGHDLVEAGAVMVPRLP
PSQARVLLMAALAANSPVADVIDRWG
>Rv2127 ansP1, Probable L-asparagine permease ansP1
MSAASQRVGAFGEEAGYHKGLKPRQLQMIGIGGAIGTGLFLGAGGRLAKA
GPGLFLVYGVCGVFVFLILRALGELVLHRPSSGSFVSYAREFFGEKAAYA
VGWMYFLHWAMTSIVDTTAIATYLQRWTIFTVVPQWILALIALTVVLSMN
LISVEWFGELEFWAALIKVLALMAFLVVGTVFLAGRYPVDGHSTGLSLWN
NHGGLFPTSWLPLLIVTSGVVFAYSAVELVGTAAGETAEPEKIMPRAINS
VVARIAIFYVGSVALLALLLPYTAYKAGESPFVTFFSKIGFHGAGDLMNI
VVLTAALSSLNAGLYSTGRVMHSIAMSGSAPRFTARMSKSGVPYGGIVLT
AVITLFGVALNAFKPGEAFEIVLNMSALGIIAGWATIVLCQLRLHKLANA
GIMQRPRFRMPFSPYSGYLTLLFLLVVLVTMASDKPIGTWTVATLIIVIP
ALTAGWYLVRKRVMAVARERLGHTGPFPAVANPPVRSRD
>Rv0346c ansP2, POSSIBLE L-ASPARAGINE PERMEASE ANSP2 (L-ASPARAGINE TRANSPORT PROTEIN)
MPPLDITDERLTREDTGYHKGLHSRQLQMIALGGAIGTGLFLGAGGRLAS
AGPGLFLVYGICGIFVFLILRALGELVLHRPSSGSFVSYAREFYGEKVAF
VAGWMYFLNWAMTGIVDTTAIAHYCHYWRAFQPIPQWTLALIALLVVLSM
NLISVRLFGELEFWASLIKVIALVTFLIVGTVFLAGRYKIDGQETGVSLW
SSHGGIVPTGLLPIVLVTSGVVFAYAAIELVGIAAGETAEPAKIMPRAIN
SVVLRIACFYVGSTVLLALLLPYTAYKEHVSPFVTFFSKIGIDAAGSVMN
LVVLTAALSSLNAGLYSTGRILRSMAINGSGPRFTAPMSKTGVPYGGILL
TAGIGLLGIILNAIKPSQAFEIVLHIAATGVIAAWATIVACQLRLHRMAN
AGQLQRPKFRMPLSPFSGYLTLAFLAGVLILMYFDEQHGPWMIAATVIGV
PALIGGWYLVRNRVTAVAHHAIDHTKSVAVVHSADPI
>Rv3170 aofH, PROBABLE FLAVIN-CONTAINING MONOAMINE OXIDASE AOFH (AMINE OXIDASE) (MAO)
MTNPPWTVDVVVVGAGFAGLAAARELTRQGHEVLVFEGRDRVGGRSLTGR
VAGVPADMGGSFIGPTQDAVLALATELGIPTTPTHRDGRNVIQWRGSARS
YRGTIPKLSLTGLIDIGRLRWQFERIARGVPVAAPWDARRARELDDVSLG
EWLRLVRATSSSRNLMAIMTRVTWGCEPDDVSMLHAARYVRAAGGLDRLL
DVKNGAQQDRVPGGTQQIAQAAAAQLGARVLLNAAVRRIDRHGAGVTVTS
DQGQAEAGFVIVAIPPAHRVAIEFDPPLPPEYQQLAHHWPQGRLSKAYAA
YSTPFWRASGYSGQALSDEAPVFITFDVSPHADGPGILMGFVDARGFDSL
PIEERRRDALRCFASLFGDEALDPLDYVDYRWGTEEFAPGGPTAAVPPGS
WTKYGHWLREPVGPIHWASTETADEWTGYFDGAVRSGQRAAAEVAALL
>Rv1001 arcA, PROBABLE ARGININE DEIMINASE ARCA (ADI) (AD) (ARGININE DIHYDROLASE)
MGVELGSNSEVGALRVVILHRPGAELRRLTPRNTDQLLFDGLPWVSRAQD
EHDEFAELLASRGAEVLLLSDLLTEALHHSGAARMQGIAAAVDAPRLGLP
LAQELSAYLRSLDPGRLAHVLTAGMTFNELPSDTRTDVSLVLRMHHGGDF
VIEPLPNLVFTRDSSIWIGPRVVIPSLALRARVREASLTDLIYAHHPRFT
GVRRAYESRTAPVEGGDVLLLAPGVVAVGVGERTTPAGAEALARSLFDDD
LAHTVLAVPIAQQRAQMHLDTVCTMVDTDTMVMYANVVDTLEAFTIQRTP
DGVTIGDAAPFAEAAAKAMGIDKLRVIHTGMDPVVAEREQWDDGNNTLAL
APGVVVAYERNVQTNARLQDAGIEVLTIAGSELGTGRGGPRCMSCPAARD
PL
>Rv1654 argB, Probable Acetylglutamate kinase argB
MSRIEALPTHIKAQVLAEALPWLKQLHGKVVVVKYGGNAMTDDTLRRAFA
ADMAFLRNCGIHPVVVHGGGPQITAMLRRLGIEGDFKGGFRVTTPEVLDV
ARMVLFGQVGRELVNLINAHGPYAVGITGEDAQLFTAVRRSVTVDGVATD
IGLVGDVDQVNTAAMLDLVAAGRIPVVSTLAPDADGVVHNINADTAAAAV
AEALGAEKLLMLTDIDGLYTRWPDRDSLVSEIDTGTLAQLLPTLESGMVP
KVEACLRAVIGGVPSAHIIDGRVTHCVLVELFTDAGTGTKVVRG
>Rv1652 argC, PROBABLE N-ACETL-GAMMA-GLUTAMYL-PHOSHATE REDUCTASE ARGC
MQNRQVANATKVAVAGASGYAGGEILRLLLGHPAYADGRLRIGALTAATS
AGSTLGEHHPHLTPLAHRVVEPTEAAVLGGHDAVFLALPHGHSAVLAQQL
SPETLIIDCGADFRLTDAAVWERFYGSSHAGSWPYGLPELPGARDQLRGT
RRIAVPGCYPTAALLALFPALAADLIEPAVTVVAVSGTSGAGRAATTDLL
GAEVIGSARAYNIAGVHRHTPEIAQGLRAVTDRDVSVSFTPVLIPASRGI
LATCTARTRSPLSQLRAAYEKAYHAEPFIYLMPEGQLPRTGAVIGSNAAH
IAVAVDEDAQTFVAIAAIDNLVKGTAGAAVQSMNLALGWPETDGLSVVGV
AP
>Rv1655 argD, Probable Acetylornithine aminotransferase argD
MTGASTTTATMRQRWQAVMMNNYGTPPIALASGDGAVVTDVDGRTYIDLL
GGIAVNVLGHRHPAVIEAVTRQMSTLGHTSNLYATEPGIALAEELVALLG
ADQRTRVFFCNSGAEANEAAFKLSRLTGRTKLVAAHDAFHGRTMGSLALT
GQPAKQTPFAPLPGDVTHVGYGDVDALAAAVDDHTAAVFLEPIMGESGVV
VPPAGYLAAARDITARRGALLVLDEVQTGMGRTGAFFAHQHDGITPDVVT
LAKGLGGGLPIGACLAVGPAAELLTPGLHGSTFGGNPVCAAAALAVLRVL
ASDGLVRRAEVLGKSLRHGIEALGHPLIDHVRGRGLLLGIALTAPHAKDA
EATARDAGYLVNAAAPDVIRLAPPLIIAEAQLDGFVAALPAILDRAVGAP
>Rv1656 argF, Probable Ornithine carbamoyltransferase, anabolic ArgF
MIRHFLRDDDLSPAEQAEVLELAAELKKDPVSRRPLQGPRGVAVIFDKNS
TRTRFSFELGIAQLGGHAVVVDSGSTQLGRDETLQDTAKVLSRYVDAIVW
RTFGQERLDAMASVATVPVINALSDEFHPCQVLADLQTIAERKGALRGLR
LSYFGDGANNMAHSLLLGGVTAGIHVTVAAPEGFLPDPSVRAAAERRAQD
TGASVTVTADAHAAAAGADVLVTDTWTSMGQENDGLDRVKPFRPFQLNSR
LLALADSDAIVLHCLPAHRGDEITDAVMDGPASAVWDEAENRLHAQKALL
VWLLERS
>Rv1658 argG, Probable Argininosuccinate synthase argG
MSERVILAYSGGLDTSVAISWIGKETGREVVAVAIDLGQGGEHMDVIRQR
ALDCGAVEAVVVDARDEFAEGYCLPTVLNNALYMDRYPLVSAISRPLIVK
HLVAAAREHGGGIVAHGCTGKGNDQVRFEVGFASLAPDLEVLAPVRDYAW
TREKAIAFAEENAIPINVTKRSPFSIDQNVWGRAVETGFLEHLWNAPTKD
IYAYTEDPTINWGVPDEVIVGFERGVPVSVDGKPVSMLAAIEELNRRAGA
QGVGRLDVVEDRLVGIKSREIYEAPGAMVLITAHTELEHVTLERELGRFK
RQTDQRWAELVYDGLWYSPLKAALEAFVAKTQEHVSGEVRLVLHGGHIAV
NGRRSAESLYDFNLATYDEGDSFDQSAARGFVYVHGLSSKLAARRDLR
>Rv1659 argH, Probable Argininosuccinate lyase argH
MSTNEGSLWGGRFAGGPSDALAALSKSTHFDWVLAPYDLTASRAHTMVLF
RAGLLTEEQRDGLLAGLDSLAQDVADGSFGPLVTDEDVHAALERGLIDRV
GPDLGGRLRAGRSRNDQVAALFRMWLRDAVRRVATGVLDVVGALAEQAAA
HPSAIMPGKTHLQSAQPILLAHHLLAHAHPLLRDLDRIVDFDKRAAVSPY
GSGALAGSSLGLDPDAIAADLGFSAAADNSVDATAARDFAAEAAFVFAMI
AVDLSRLAEDIIVWSSTEFGYVTLHDSWSTGSSIMPQKKNPDIAELARGK
SGRLIGNLAGLLATLKAQPLAYNRDLQEDKEPVFDSVAQLELLLPAMAGL
VASLTFNVQRMAELAPAGYTLATDLAEWLVRQGVPFRSAHEAAGAAVRAA
EQRGVGLQELTDDELAAISPELTPQVREVLTIEGSVSARDCRGGTAPGRV
AEQLNAIGEAAERLRRQLVR
>Rv1653 argJ, Probable Glutamate n-acetyltransferase argJ
MTDLAGTTRLLRAQGVTAPAGFRAAGVAAGIKASGALDLALVFNEGPDYA
AAGVFTRNQVKAAPVLWTQQVLTTGRLRAVILNSGGANACTGPAGFADTH
ATAEAVAAALSDWGTETGAIEVAVCSTGLIGDRLPMDKLLAGVAHVVHEM
HGGLVGGDEAAHAIMTTDNVPKQVALHHHDNWTVGGMAKGAGMLAPSLAT
MLCVLTTDAAAEPAALERALRRAAAATFDRLDIDGSCSTNDTVLLLSSGA
SEIPPAQADLDEAVLRVCDDLCAQLQADAEGVTKRVTVTVTGAATEDDAL
VAARQIARDSLVKTALFGSDPNWGRVLAAVGMAPITLDPDRISVSFNGAA
VCVHGVGAPGAREVDLSDADIDITVDLGVGDGQARIRTTDLSHAYVEENS
AYSS
>Rv3227 aroA, 3-PHOSPHOSHIKIMATE 1-CARBOXYVINYLTRANSFERASE AROA (5-ENOLPYRUVYLSHIKIMATE-3-PHOSPHATE SYNTHASE) (EPSP SYNTHASE) (EPSPS)
MKTWPAPTAPTPVRATVTVPGSKSQTNRALVLAALAAAQGRGASTISGAL
RSRDTELMLDALQTLGLRVDGVGSELTVSGRIEPGPGARVDCGLAGTVLR
FVPPLAALGSVPVTFDGDQQARGRPIAPLLDALRELGVAVDGTGLPFRVR
GNGSLAGGTVAIDASASSQFVSGLLLSAASFTDGLTVQHTGSSLPSAPHI
AMTAAMLRQAGVDIDDSTPNRWQVRPGPVAARRWDIEPDLTNAVAFLSAA
VVSGGTVRITGWPRVSVQPADHILAILRQLNAVVIHADSSLEVRGPTGYD
GFDVDLRAVGELTPSVAALAALASPGSVSRLSGIAHLRGHETDRLAALST
EINRLGGTCRETPDGLVITATPLRPGIWRAYADHRMAMAGAIIGLRVAGV
EVDDIAATTKTLPEFPRLWAEMVGPGQGWGYPQPRSGQRARRATGQGSGG
>Rv2538c aroB, 3-DEHYDROQUINATE SYNTHASE AROB
MTDIGAPVTVQVAVDPPYPVVIGTGLLDELEDLLADRHKVAVVHQPGLAE
TAEEIRKRLAGKGVDAHRIEIPDAEAGKDLPVVGFIWEVLGRIGIGRKDA
LVSLGGGAATDVAGFAAATWLRGVSIVHLPTTLLGMVDAAVGGKTGINTD
AGKNLVGAFHQPLAVLVDLATLQTLPRDEMICGMAEVVKAGFIADPVILD
LIEADPQAALDPAGDVLPELIRRAITVKAEVVAADEKESELREILNYGHT
LGHAIERRERYRWRHGAAVSVGLVFAAELARLAGRLDDATAQRHRTILSS
LGLPVSYDPDALPQLLEIMAGDKKTRAGVLRFVVLDGLAKPGRMVGPDPG
LLVTAYAGVCAP
>Rv2537c aroD, 3-DEHYDROQUINATE DEHYDRATASE AROD (AROQ) (3-DEHYDROQUINASE) (TYPE II DHQASE)
MSELIVNVINGPNLGRLGRREPAVYGGTTHDELVALIEREAAELGLKAVV
RQSDSEAQLLDWIHQAADAAEPVILNAGGLTHTSVALRDACAELSAPLIE
VHISNVHAREEFRRHSYLSPIATGVIVGLGIQGYLLALRYLAEHVGT
>Rv2552c aroE, PROBABLE SHIKIMATE 5-DEHYDROGENASE AROE (5-DEHYDROSHIKIMATE REDUCTASE)
MSEGPKKAGVLGSPIAHSRSPQLHLAAYRALGLHDWTYERIECGAAELPV
VVGGFGPEWVGVSVTMPGKFAALRFADERTARADLVGSANTLVRTPHGWR
ADNTDIDGVAGALGAAAGHALVLGSGGTAPAAVVGLAELGVTDITVVARN
SDKAARLVDLGTRVGVATRFCAFDSGGLADAVAAAEVLVSTIPAEVAAGY
AGTLAAIPVLLDAIYDPWPTPLAAAVGSAGGRVISGLQMLLHQAFAQVEQ
FTGLPAPREAMTCALAALD
>Rv2540c aroF, PROBABLE CHORISMATE SYNTHASE AROF (5-ENOLPYRUVYLSHIKIMATE-3-PHOSPHATE PHOSPHOLYASE)
MLRWITAGESHGRALVAVVEGMVAGVHVTSADIADQLARRRLGYGRGARM
TFERDAVTVLSGIRHGSTLGGPIAIEIGNTEWPKWETVMAADPVDPAELA
DVARNAPLTRPRPGHADYAGMLKYGFDDARPVLERASARETAARVAAGTV
ARAFLRQALGVEVLSHVISIGASAPYEGPPPRAEDLPAIDASPVRAYDKA
AEADMIAQIEAAKKDGDTLGGVVEAVALGLPVGLGSFTSGDHRLDSQLAA
AVMGIQAIKGVEIGDGFQTARRRGSRAHDEMYPGPDGVVRSTNRAGGLEG
GMTNGQPLRVRAAMKPISTVPRALATVDLATGDEAVAIHQRSDVCAVPAA
GVVVETMVALVLARAALEKFGGDSLAETQRNIAAYQRSVADREAPAARVS
G
>Rv2178c aroG, Probable 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase AroG (DAHP synthetase, phenylalanine-repressible)
MNWTVDIPIDQLPSLPPLPTDLRTRLDAALAKPAAQQPTWPADQALAMRT
VLESVPPVTVPSEIVRLQEQLAQVAKGEAFLLQGGDCAETFMDNTEPHIR
GNVRALLQMAVVLTYGASMPVVKVARIAGQYAKPRSADIDALGLRSYRGD
MINGFAPDAAAREHDPSRLVRAYANASAAMNLVRALTSSGLASLHLVHDW
NREFVRTSPAGARYEALATEIDRGLRFMSACGVADRNLQTAEIYASHEAL
VLDYERAMLRLSDGDDGEPQLFDLSAHTVWIGERTRQIDGAHIAFAQVIA
NPVGVKLGPNMTPELAVEYVERLDPHNKPGRLTLVSRMGNHKVRDLLPPI
VEKVQATGHQVIWQCDPMHGNTHESSTGFKTRHFDRIVDEVQGFFEVHRA
LGTHPGGIHVEITGENVTECLGGAQDISETDLAGRYETACDPRLNTQQSL
ELAFLVAEMLRD
>Rv2539c aroK, SHIKIMATE KINASE AROK (SK)
MAPKAVLVGLPGSGKSTIGRRLAKALGVGLLDTDVAIEQRTGRSIADIFA
TDGEQEFRRIEEDVVRAALADHDGVLSLGGGAVTSPGVRAALAGHTVVYL
EISAAEGVRRTGGNTVRPLLAGPDRAEKYRALMAKRAPLYRRVATMRVDT
NRRNPGAVVRHILSRLQVPSPSEAAT
>Rv3708c asd, ASPARTATE-SEMIALDEHYDE DEHYDROGENASE ASD (ASA DEHYDROGENASE) (ASADH) (ASPARTIC SEMIALDEHYDE DEHYDROGENASE) (L-ASPARTATE-BETA-SEMIALDEHYDE DEHYDROGENAS
MGLSIGIVGATGQVGQVMRTLLDERDFPASAVRFFASARSQGRKLAFRGQ
EIEVEDAETADPSGLDIALFSAGSAMSKVQAPRFAAAGVTVIDNSSAWRK
DPDVPLVVSEVNFERDAHRRPKGIIANPNCTTMAAMPVLKVLHDEARLVR
LVVSSYQAVSGSGLAGVAELAEQARAVIGGAEQLVYDGGALEFPPPNTYV
APIAFNVVPLAGSLVDDGSGETDEDQKLRFESRKILGIPDLLVSGTCVRV
PVFTGHSLSINAEFAQPLSPERARELLDGATGVQLVDVPTPLAAAGVDES
LVGRIRRDPGVPDGRGLALFVSGDNLRKGAALNTIQIAELLTADL
>Rv3709c ask, ASPARTOKINASE ASK (ASPARTATE KINASE) [CONTAINS: ASPARTOKINASE ALPHA SUBUNIT (ASK-ALPHA); AND ASPARTOKINASE BETA SUBUNIT (ASK-BETA)]
MALVVQKYGGSSVADAERIRRVAERIVATKKQGNDVVVVVSAMGDTTDDL
LDLAQQVCPAPPPRELDMLLTAGERISNALVAMAIESLGAHARSFTGSQA
GVITTGTHGNAKIIDVTPGRLQTALEEGRVVLVAGFQGVSQDTKDVTTLG
RGGSDTTAVAMAAALGADVCEIYTDVDGIFSADPRIVRNARKLDTVTFEE
MLEMAACGAKVLMLRCVEYARRHNIPVHVRSSYSDRPGTVVVGSIKDVPM
EDPILTGVAHDRSEAKVTIVGLPDIPGYAAKVFRAVADADVNIDMVLQNV
SKVEDGKTDITFTCSRDVGPAAVEKLDSLRNEIGFSQLLYDDHIGKVSLI
GAGMRSHPGVTATFCEALAAVGVNIELISTSEIRISVLCRDTELDKAVVA
LHEAFGLGGDEEATVYAGTGR
>Rv2201 asnB, Probable asparagine synthetase AsnB
MCGLLAFVAAPAGAAGPEGADAASAIARASHLMRHRGPDESGTWHAVDGA
SGGVVFGFNRLSIIDIAHSHQPLRWGPPEAPDRYVLVFNGEIYNYLELRD
ELRTQHGAVFATDGDGEAILAGYHHWGTEVLQRLRGMFAFALWDTVTREL
FCARDPFGIKPLFIATGAGGTAVASEKKCLLDLVELVGFDTEIDHRALQH
YTVLQYVPEPETLHRGVRRLESGCFARIRADQLAPVITRYFVPRFAASPI
TNDNDQARYDEITAVLEDSVAKHMRADVTVGAFLSGGIDSTAIAALAIRH
NPRLITFTTGFEREGFSEIDVAVASAEAIGARHIAKVVSADEFVAALPEI
VWYLDEPVADPALVPLFFVAREARKHVKVVLSGEGADELFGGYTIYREPL
SLRPFDYLPKPLRRSMGKVSKPLPEGMRGKSLLHRGSLTLEERYYGNARS
FSGAQLREVLPGFRPDWTHTDVTAPVYAESAGWDPVARMQHIDLFTWLRG
DILVKADKITMANSLELRVPFLDPEVFAVASRLPAGAKITRTTTKYALRR
ALEPIVPAHVLHRPKLGFPVPIRHWLRAGELLEWAYATVGSSQAGHLVDI
AAVYRMLDEHRCGSSDHSRRLWTMLIFMLWHAIFVEHSVVPQISEPQYPV
QL
>Rv3565 aspB, POSSIBLE ASPARTATE AMINOTRANSFERASE ASPB (TRANSAMINASE A) (ASPAT) (GLUTAMIC--OXALOACETIC TRANSAMINASE) (GLUTAMIC--ASPARTIC TRANSAMINASE)
MTDRVALRAGVPPFYVMDVWLAAAERQRTHGDLVNLSAGQPSAGAPEPVR
AAAAAALHLNQLGYSVALGIPELRDAIAADYQRRHGITVEPDAVVITTGS
SGGFLLAFLACFDAGDRVAMASPGYPCYRNILSALGCEVVEIPCGPQTRF
QPTAQMLAEIDPPLRGVVVASPANPTGTVIPPEELAAIASWCDASDVRLI
SDEVYHGLVYQGAPQTSCAWQTSRNAVVVNSFSKYYAMTGWRLGWLLVPT
VLRRAVDCLTGNFTICPPVLSQIAAVSAFTPEATAEADGNLASYAINRSL
LLDGLRRIGIDRLAPTDGAFYVYADVSDFTSDSLAFCSKLLADTGVAIAP
GIDFDTARGGSFVRISFAGPSGDIEEALRRIGSWLPSQ
>Rv0337c aspC, PROBABLE ASPARTATE AMINOTRANSFERASE ASPC (TRANSAMINASE A) (ASPAT)
MDNDGTIVDVTTHQLPWHTASHQRQRAFAQSAKLQDVLYEIRGPVHQHAA
RLEAEGHRILKLNIGNPAPFGFEAPDVIMRDIIQALPYAQGYSDSQGILS
ARRAVVTRYELVPGFPRFDVDDVYLGNGVSELITMTLQALLDNGDQVLIP
SPDYPLWTASTSLAGGTPVHYLCDETQGWQPDIADLESKITERTKALVVI
NPNNPTGAVYSCEILTQMVDLARKHQLLLLADEIYDKILYDDAKHISLAS
IAPDMLCLTFNGLSKAYRVAGYRAGWLAITGPKEHASSFIEGIGLLANMR
LCPNVPAQHAIQVALGGHQSIEDLVLPGGRLLEQRDIAWTKLNEIPGVSC
VKPAGALYAFPRLDPEVYDIDDDEQLVLDLLLSEKILVTQGTGFNWPAPD
HLRLVTLPWSRDLAAAIERLGNFLVSYRQ
>Rv3568c bphC, PROBABLE BIPHENYL-2,3-DIOL 1,2-DIOXYGENASE BPHC (23OHBP OXYGENASE) (2,3-DIHYDROXYBIPHENYL DIOXYGENASE) (2,3-DIHYDROXYBIPHENYL 1,2-DIOXYGENASE) (DHBD)
MSIRSLGYLRIEATDMAAWREYGLKVLGMVEGKGAPEGALYLRMDDFPAR
LVVVPGEHDRLLEAGWECANAEGLQEIRNRLDLEGTPYKEATAAELADRR
VDEMIRFADPSGNCLEVFHGTALEHRRVVSPYGHRFVTGEQGMGHVVLST
RDDAEALHFYRDVLGFRLRDSMRLPPQMVGRPADGPPAWLRFFGCNPRHH
SLAFLPMPTSSGIVHLMVEVEQADDVGLCLDRALRRKVPMSATLGRHVND
LMLSFYMKTPGGFDIEFGCEGRQVDDRDWIARESTAVSLWGHDFTVGARG
>Rv2641 cadI, CADMIUM INDUCIBLE PROTEIN CADI
MSRVQLALNVDDLEAAITFYSRLFNAEPAKRKPGYANFAIADPPLKLVLL
ENPGTGGTLNHLGVEVGSSNTVHAEIARLTEAGLVTEKEIGTTCCFATQD
KVWVTGPGGERWEVYTVLADSETFGSGPRHNDTSDGEASMCCDGQVAVGA
SG
>Rv1383 carA, PROBABLE CARBAMOYL-PHOSPHATE SYNTHASE SMALL CHAIN CARA (Carbamoyl-phosphate synthetase glutamine chain)
MSKAVLVLEDGRVFTGRPFGATGQALGEAVFSTGMSGYQETLTDPSYHRQ
IVVATAPQIGNTGWNGEDSESRGERIWVAGYAVRDPSPRASNWRATGTLE
DELIRQRIVGIAGIDTRAVVRHLRSRGSMKAGVFSDGALAEPADLIARVR
AQQSMLGADLAGEVSTAEPYVVEPDGPPGVSRFTVAALDLGIKTNTPRNF
ARRGIRCHVLPASTTFEQIAELNPHGVFLSNGPGDPATADHVVALTREVL
GAGIPLFGICFGNQILGRALGLSTYKMVFGHRGINIPVVDHATGRVAVTA
QNHGFALQGEAGQSFATPFGPAVVSHTCANDGVVEGVKLVDGRAFSVQYH
PEAAAGPHDAEYLFDQFVELMAGEGR
>Rv1384 carB, PROBABLE CARBAMOYL-PHOSPHATE SYNTHASE LARGE CHAIN CARB (Carbamoyl-phosphate synthetase ammonia chain)
MPRRTDLHHVLVIGSGPIVIGQACEFDYSGTQACRVLRAEGLQVSLVNSN
PATIMTDPEFADHTYVEPITPAFVERVIAQQAERGNKIDALLATLGGQTA
LNTAVALYESGVLEKYGVELIGADFDAIQRGEDRQRFKDIVAKAGGESAR
SRVCFTMAEVRETVAELGLPVVVRPSFTMGGLGSGIAYSTDEVDRMAGAG
LAASPSANVLIEESIYGWKEFELELMRDGHDNVVVVCSIENVDPMGVHTG
DSVTVAPAMTLTDREYQRMRDLGIAILREVGVDTGGCNIQFAVNPRDGRL
IVIEMNPRVSRSSALASKATGFPIAKIAAKLAIGYTLDEIVNDITGETPA
CFEPTLDYVVVKAPRFAFEKFPGADPTLTTTMKSVGEAMSLGRNFVEALG
KVMRSLETTRAGFWTAPDPDGGIEEALTRLRTPAEGRLYDIELALRLGAT
VERVAEASGVDPWFIAQINELVNLRNELVAAPVLNAELLRRAKHSGLSDH
QIASLRPELAGEAGVRSLRVRLGIHPVYKTVDTCAAEFEAQTPYHYSSYE
LDPAAETEVAPQTERPKVLILGSGPNRIGQGIEFDYSCVHAATTLSQAGF
ETVMVNCNPETVSTDYDTADRLYFEPLTFEDVLEVYHAEMESGSGGPGVA
GVIVQLGGQTPLGLAHRLADAGVPIVGTPPEAIDLAEDRGAFGDLLSAAG
LPAPKYGTATTFAQARRIAEEIGYPVLVRPSYVLGGRGMEIVYDEETLQG
YITRATQLSPEHPVLVDRFLEDAVEIDVDALCDGAEVYIGGIMEHIEEAG
IHSGDSACALPPVTLGRSDIAKVRKATEAIAHGIGVVGLLNVQYALKDDV
LYVLEANPRASRTVPFVSKATAVPLAKACARIMLGATIAQLRAEGLLAVT
GDGAHAARNAPIAVKEAVLPFHRFRRADGAAIDSLLGPEMKSTGEVMGID
RDFGSAFAKSQTAAYGSLPAQGTVFVSVANRDKRSLVFPVKRLADLGFRV
LATEGTAEMLRRNGIPCDDVRKHFEPAQPGRPTMSAVDAIRAGEVNMVIN
TPYGNSGPRIDGYEIRSAAVAGNIPCITTVQGASAAVQGIEAGIRGDIGV
RSLQELHRVIGGVER
>Rv3409c choD, PROBABLE CHOLESTEROL OXIDASE PRECURSOR CHOD (CHOLESTEROL-O2 OXIDOREDUCTASE)
MKPDYDVLIIGSGFGGSVTALRLTEKGYRVGVLEAGRRFSDEEFAKTSWD
LRKFLWAPRLGCYGIQRIHPLRNVMILAGAGVGGGSLNYANTLYVPPEPF
FADQQWSHITDWRGELMPHYQQAQRMLGVVQNPTFTDADRIVKEVADEMG
FGDTWVPTPVGVFFGPDGTKTPGKTVPDPYFGGAGPARTGCLECGCCMTG
CRHGAKNTLVKNYLGLAESAGAQVIPMTTVKGFERRSDGLWEVRTVRTGS
WLRRDRRTFTATQLVLAAGTWGTQHLLFKMRDRGRLPGLSKRLGVLTRTN
SESIVGAATLKVNPDLDLTHGVAITSSIHPTADTHIEPVRYGKGSNAMGL
LQTLMTDGSGPQGTDVPRWRQLLQTASQDPRGTIRMLNPRQWSERTVIAL
VMQHLDNSITTFTKRGKLGIRWYSSKQGHGEPNPTWIPIGNQVTRRIAAK
IDGVAGGTWGELFNIPLTAHFLGGAVIGDDPEHGVIDPYHRVYGYPTLYV
VDGAAISANLGVNPSLSIAAQAERAASLWPNKGETDRRPPQGEPYRRLAP
IQPAHPVVPADAPGALRWLPIDPVSNAG
>Rv2231c cobC, Possible aminotransferase CobC
MLWILGPHTGPLLFDAVASLDTSPLAAARYHGDQDVAPGVLDFAVNVRHD
RPPEWLVRQLAALLPELARYPSTDDVHRAQDAVAERHGRTRDEVLPLVGA
AEGFALLHNLSPVRAAIVVPAFTEPAIALSAAGITAHHVVLKPPFVLDTA
HVPDDADLVVVGNPTNPTSVLHLREQLLELRRPGRILVVDEAFADWVPGE
PQSLADDSLPDVLVLRSLTKTWSLAGLRVGYALGSPDVLARLTVQRAHWP
LGTLQLTAIAACCAPRAVAAAAADAVRLTALRAEMVAGLRSVGAEVVDGA
APFVLFNIADADGLRNYLQSKGIAVRRGDTFVGLDARYLRAAVRPEWPVL
VAAIAEWAKRGGRR
>Rv1464 csd, PROBABLE CYSTEINE DESULFURASE CSD
MTASVNSLDLAAIRADFPILKRIMRGGNPLAYLDSGATSQRPLQVLDAER
EFLTASNGAVHRGAHQLMEEATDAYEQGRADIALFVGADTDELVFTKNAT
EALNLVSYVLGDSRFERAVGPGDVIVTTELEHHANLIPWQELARRTGATL
RWYGVTDDGRIDLDSLYLDDRVKVVAFTHHSNVTGVLTPVSELVSRAHQS
GALTVLDACQSVPHQPVDLHELGVDFAAFSGHKMLGPNGIGVLYGRRELL
AQMPPFLTGGSMIETVTMEGATYAPAPQRFEAGTPMTSQVVGLAAAARYL
GAIGMAAVEAHERELVAAAIEGLSGIDGVRILGPTSMRDRGSPVAFVVEG
VHAHDVGQVLDDGGVAVRVGHHCALPLHRRFGLAATARASFAVYNTADEV
DRLVAGVRRSRHFFGRA
>Rv1704c cycA, PROBABLE D-SERINE/ALANINE/GLYCINE TRANSPORTER PROTEIN CYCA
MPDDIAAADPTDTQPHLRRDLANRHIQLIAIGGAIGTGLFMGSGRTISLA
GPAVMVVYGIIGFFVFFVLRAMGELLLSNLNYKSFVDFAADLRGPAAGFF
VGWSYWFAWVVTGIADLVAITGYARFWWPGLPIWVPALVTVALILAVNLF
SVRHFGELEFWFALIKVAAIVCLIAVGAILVATNFVSPHGVHATIENLWN
DNGFFPTGFLGVVSGFQIAFFAYIGVELVGTAAAETADPRRTLPRAINAV
PLRVAVFYIGALLAILAVVPWRQFASGESPFVTMFSLAGLAAAASVVNFV
VVTAAASSANSGFFSTGRMLFGLADEGHAPAAFHQLNRGGVPAPALLLTA
PLLLTSIPLLYAGRSVIGAFTLVTTVSSLLFMFVWAMIIISYLVYRRRHP
QRHTDSVYKMPGGVVMCWAVLVFFAFVIWTLTTETETATALAWFPLWFVL
LAVGWLVTQRRQSRRSFGFHCQVVGVRQQLGRGMARLAMKIHARPKLRSA
VVVEPVSAGEPGARRSAKSVRKLASDDSQSAHCPVAVVGLADGGRDPQYH
HDGPDR
>Rv1285 cysD, PROBABLE SULFATE ADENYLYLTRANSFERASE SUBUNIT 2 CYSD
MAITINMVNPTGFIRYEDVEQEAMTSDVTVGPAPGQYQLSHLRLLEAEAI
HVIREVAAEFERPVLLFSGGKDSIVMLHLALKAFRPGRLPFPVMHVDTGH
NFDEVIATRDELVAAAGVRLVVASVQDDIDAGRVVETIPSRNPIQTVTLL
RAIRENQFDAAFGGARRDEEKARAKERVFSFRDEFGQWDPKAQRPELWNL
YNGRHHKGEHIRVFPLSNWTEFDIWSYIGAEQVRLPSIYFAHRRKVFQRD
GMLLAVHRHMQPRADEPVFEATVRFRTVGDVTCTGCVESSASTVAEVIAE
TAVARLTERGATRADDRISEAGMEDRKRQGYF
>Rv2335 cysE, PROBABLE SERINE ACETYLTRANSFERASE CYSE (SAT)
MLTAMRGDIRAARERDPAAPTALEVIFCYPGVHAVWGHRLAHWLWQRGAR
LLARAAAEFTRILTGVDIHPGAVIGARVFIDHATGVVIGETAEVGDDVTI
YHGVTLGGSGMVGGKRHPTVGDRVIIGAGAKVLGPIKIGEDSRIGANAVV
VKPVPPSAVVVGVPGQVIGQSQPSPGGPFDWRLPDLVGASLDSLLTRVAR
LEALGGGPQAAGVIRPPEAGIWHGEDFSI
>Rv2392 cysH, PROBABLE 3'-PHOSPHOADENOSINE 5'-PHOSPHOSULFATE REDUCTASE CYSH (PAPS REDUCTASE, THIOREDOXIN DEP.) (PADOPS REDUCTASE) (3'-PHOSPHOADENYLYLSULFATE REDUCTA
MSGETTRLTEPQLRELAARGAAELDGATATDMLRWTDETFGDIGGAGGGV
SGHRGWTTCNYVVASNMADAVLVDLAAKVRPGVPVIFLDTGYHFVETIGT
RDAIESVYDVRVLNVTPEHTVAEQDELLGKDLFARNPHECCRLRKVVPLG
KTLRGYSAWVTGLRRVDAPTRANAPLVSFDETFKLVKVNPLAAWTDQDVQ
EYIADNDVLVNPLVREGYPSIGCAPCTAKPAEGADPRSGRWQGLAKTECG
LHAS
>Rv2334 cysK1, PROBABLE CYSTEINE SYNTHASE A CYSK1 (O-ACETYLSERINE SULFHYDRYLASE A) (O-ACETYLSERINE (THIOL)-LYASE A) (CSASE A)
MSIAEDITQLIGRTPLVRLRRVTDGAVADIVAKLEFFNPANSVKDRIGVA
MLQAAEQAGLIKPDTIILEPTSGNTGIALAMVCAARGYRCVLTMPETMSL
ERRMLLRAYGAELILTPGADGMSGAIAKAEELAKTDQRYFVPQQFENPAN
PAIHRVTTAEEVWRDTDGKVDIVVAGVGTGGTITGVAQVIKERKPSARFV
AVEPAASPVLSGGQKGPHPIQGIGAGFVPPVLDQDLVDEIITVGNEDALN
VARRLAREEGLLVGISSGAATVAALQVARRPENAGKLIVVVLPDFGERYL
STPLFADVAD
>Rv0848 cysK2, POSSIBLE CYSTEINE SYNTHASE A CYSK2 (O-ACETYLSERINE SULFHYDRYLASE) (O-ACETYLSERINE (THIOL)-LYASE) (CSASE)
MRSRQTRDRYRLLPEGYQVTPGRNRHPGTMVGNTPVLWIPELSGTSDPDR
GFWAKLEGFNPGGMKDRPALYMVECARARGDIAPGAAIVESTGGTLGLGL
ALAGKVYRHPVTLVTDPGLEPIIARMLTAYGAGVDMVTQPHPVGGWQQAR
KDRVAQLMAEYPGAWNPNQYGNPDNVGAYRSLALELVAQLGRIDVLVCSV
GTGGHSAGVARVLREFNPDMRLIGVDTIGSTIFGQPASNRLMRGLGSSIY
PRNVDYRAFDEVHWVAPPEAVWACRSLAATHYASGGWSVGAVALVAGWAA
RNLPADTTIAAVFPDGPQRYFDTIYNDAYCNEHELLGGQPPTEPDEIASP
LDAVVTRWTRSTTVIDPTQVVS
>Rv1336 cysM, PROBABLE CYSTEINE SYNTHASE B CYSM (CSASE B) (O-acetylserine sulfhydrylase B) (O-acetylserine (Thiol)-lyase B)
MTRYDSLLQALGNTPLVGLQRLSPRWDDGRDGPHVRLWAKLEDRNPTGSI
KDRPAVRMIEQAEADGLLRPGATILEPTSGNTGISLAMAARLKGYRLICV
MPENTSVERRQLLELYGAQIIFSAAEGGSNTAVATAKELAATNPSWVMLY
QYGNPANTDSHYCGTGPELLADLPEITHFVAGLGTTGTLMGTGRFLREHV
ANVKIVAAEPRYGEGVYALRNMDEGFVPELYDPEILTARYSVGAVDAVRR
TRELVHTEGIFAGISTGAVLHAALGVGAGALAAGERADIALVVADAGWKY
LSTGAYAGSLDDAETALEGQLWA
>Rv2753c dapA, PROBABLE DIHYDRODIPICOLINATE SYNTHASE DAPA (DHDPS) (DIHYDRODIPICOLINATE SYNTHETASE)
MTTVGFDVAARLGTLLTAMVTPFSGDGSLDTATAARLANHLVDQGCDGLV
VSGTTGESPTTTDGEKIELLRAVLEAVGDRARVIAGAGTYDTAHSIRLAK
ACAAEGAHGLLVVTPYYSKPPQRGLQAHFTAVADATELPMLLYDIPGRSA
VPIEPDTIRALASHPNIVGVKDAKADLHSGAQIMADTGLAYYSGDDALNL
PWLAMGATGFISVIAHLAAGQLRELLSAFGSGDIATARKINIAVAPLCNA
MSRLGGVTLSKAGLRLQGIDVGDPRLPQVAATPEQIDALAADMRAASVLR
>Rv2773c dapB, DIHYDRODIPICOLINATE REDUCTASE DAPB (DHPR)
MRVGVLGAKGKVGATMVRAVAAADDLTLSAELDAGDPLSLLTDGNTEVVI
DFTHPDVVMGNLEFLIDNGIHAVVGTTGFTAERFQQVESWLVAKPNTSVL
IAPNFAIGAVLSMHFAKQAARFFDSAEVIELHHPHKADAPSGTAARTAKL
IAEARKGLPPNPDATSTSLPGARGADVDGIPVHAVRLAGLVAHQEVLFGT
EGETLTIRHDSLDRTSFVPGVLLAVRRIAERPGLTVGLEPLLDLH
>Rv1202 dapE, PROBABLE SUCCINYL-DIAMINOPIMELATE DESUCCINYLASE DAPE
MLDLRGDPIELTAALIDIPSESRKEARIADEVEAALRAQASGFEIIRNGN
AVLARTKLNRSSRVLLAGHLDTVPVAGNLPSRRENDQLHGCGAADMKSGD
AVFLHLAATLAEPTHDLTLVFYDCEEIDSAANGLGRIQRELPDWLSADVA
ILGEPTAGCIEAGCQGTLRVVLSVTGTRAHSARSWLGDNAIHKLGAVLDR
LAVYRARSVDIDGCTYREGLSAVRVAGGVAGNVIPDAASVTINYRFAPDR
SVAAALQHVHDVFDGLDVQIEQTDAAAGALPGLSEPAAKALVEAAGGQVR
AKYGWTDVSRFAALGIPAVNYGPGDPNLAHCRDERVPVGNITAAVDLLRR
YLGG
>Rv2726c dapF, PROBABLE DIAMINOPIMELATE EPIMERASE DAPF (DAP EPIMERASE)
MIFAKGHGTQNDFVLLPDVDAELVLTAARVAALCDRRKGLGADGVLRVTT
AGAAQAVGVLDSLPEGVRVTDWYMDYRNADGSAAQMCGNGVRVFAHYLRA
SGLEVRDEFVVGSLAGPRPVTCHHVEAAYADVSVDMGKANRLGAGEAVVG
GRRFHGLAVDVGNPHLACVDSQLTVDGLAALDVGAPVSFDGAQFPDGVNV
EVLTAPVDGAVWMRVHERGVGETRSCGTGTVAAAVAALAAVGSPTGTLTV
HVPGGEVVVTVTDATSFLRGPSVLVARGDLADDWWNAMG
>Rv3666c dppA, PROBABLE PERIPLASMIC DIPEPTIDE-BINDING LIPOPROTEIN DPPA
MVRQMRAALAALATGLLVLAPVAGCGGGVLSPDVVLVNGGEPPNPLIPTG
TNDSNGGRIIDRLFAGLMSYDAVGKPSLEVAQSIESADNVNYRITVKPGW
KFTDGSPVTAHSFVDAWNYGALSTNAQLQQHFFSPIEGFDDVAGAPGDKS
RTTMSGLRVVNDLEFTVRLKAPTIDFTLRLGHSSFYPLPDSAFRDMAAFG
RNPIGNGPYKLADGPAGPAWEHNVRIDLVPNPDYHGNRKPRNKGLRFEFY
ANLDTAYADLLSGNLDVLDTIPPSALTVYQRDLGDHATSGPAAINQTLDT
PLRLPHFGGEEGRLRRLALSAAINRPQICQQIFAGTRSPARDFTARSLPG
FDPNLPGNEVLDYDPQRARRLWAQADAISPWSGRYAIAYNADAGHRDWVD
AVANSIKNVLGIDAVAAPQPTFAGFRTQITNRAIDSAFRAGWRGDYPSMI
EFLAPLFTAGAGSNDVGYINPEFDAALAAAEAAPTLTESHELVNDAQRIL
FHDMPVVPLWDYISVVGWSSQVSNVTVTWNGLPDYENIVKA
>Rv3665c dppB, PROBABLE DIPEPTIDE-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER DPPB
MGWYVARRVAVMVPVFLGATLLIYGMVFLLPGDPVAALAGDRPLTPAVAA
QLRSHYHLDDPFLVQYLRYLGGILHGDLGRAYSGLPVSAVLAHAFPVTIR
LALIALAVEAVLGIGFGVIAGLRQGGIFDSAVLVTGLVIIAIPIFVLGFL
AQFLFGVQLEIAPVTVGERASVGRLLLPGIVLGAMSFAYVVRLTRSAVAA
NAHADYVRTATAKGLSRPRVVTVHILRNSLIPVVTFLGADLGALMGGAIV
TEGIFNIHGVGGVLYQAVTRQETPTVVSIVTVLVLIYLITNLLVDLLYAA
LDPRIRYG
>Rv3664c dppC, PROBABLE DIPEPTIDE-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER DPPC
MIAAALILLILVVAAFPSLFTAADPTYADPSQSMLAPSAAHWFGTDLQGH
DIYSRTVYGARASVTVGLGATLAVFVVGGALGALAGFYGSWIDAVVSRVT
DVFLGLPLLLAAIVLMQVMHHRTVWTVIAILALFGWPQVARIARGAVLEV
RASDYVLAAKALGLNRFQILLRHALPNAVGPVIAVATVALGIFIVTEATL
SYLGVGLPTSVVSWGGDINVAQTRLRSGSPILFYPAGALAITVLAFMMMG
DALRDALDPASRAWRA
>Rv2846c efpA, POSSIBLE INTEGRAL MEMBRANE EFFLUX PROTEIN EFPA
MTALNDTERAVRNWTAGRPHRPAPMRPPRSEETASERPSRYYPTWLPSRS
FIAAVIAIGGMQLLATMDSTVAIVALPKIQNELSLSDAGRSWVITAYVLT
FGGLMLLGGRLGDTIGRKRTFIVGVALFTISSVLCAVAWDEATLVIARLS
QGVGSAIASPTGLALVATTFPKGPARNAATAVFAAMTAIGSVMGLVVGGA
LTEVSWRWAFLVNVPIGLVMIYLARTALRETNKERMKLDATGAILATLAC
TAAVFAFSIGPEKGWMSGITIGSGLVALAAAVAFVIVERTAENPVVPFHL
FRDRNRLVTFSAILLAGGVMFSLTVCIGLYVQDILGYSALRAGVGFIPFV
IAMGIGLGVSSQLVSRFSPRVLTIGGGYLLFGAMLYGSFFMHRGVPYFPN
LVMPIVVGGIGIGMAVVPLTLSAIAGVGFDQIGPVSAIALMLQSLGGPLV
LAVIQAVITSRTLYLGGTTGPVKFMNDVQLAALDHAYTYGLLWVAGAAII
VGGMALFIGYTPQQVAHAQEVKEAIDAGEL
>Rv0783c emrB, POSSIBLE MULTIDRUG RESISTANCE INTEGRAL MEMBRANE EFFLUX PROTEIN EMRB
MLGNAMVEACPAEGDAPVPITPAGRPRSGQRSYPDRLDVGLLRTAGVCVL
ASVMAHVDVTVVSVAQRTFVADFGSTQAVVAWTMTGYMLALATVIPTAGW
AADRFGTRRLFMGSVLAFTLGSLLCAVAPNILLLIIFRVVQGFGGGMLTP
VSFAILAREAGPKRLGRVMAVVGIPMLLGPVGGPILGGWLIGAYGWRWIF
LVNLPVGLSALVLAAIVFPRDRPAASENFDYMGLLLLSPGLATFLFGVSS
SPARGTMADRHVLIPAITGLALIAAFVAHSWYRTEHPLIDMRLFQNRAVA
QANMTMTVLSLGLFGSFLLLPSYLQQVLHQSPMQSGVHIIPQGLGAMLAM
PIAGAMMDRRGPAKIVLVGIMLIAAGLGTFAFGVARQADYLPILPTGLAI
MGMGMGCSMMPLSGAAVQTLAPHQIARGSTLISVNQQVGGSIGTALMSVL
LTYQFNHSEIIATAKKVALTPESGAGRGAAVDPSSLPRQTNFAAQLLHDL
SHAYAVVFVIATALVVSTLIPAAFLPKQQASHRRAPLLSA
>Rv3106 fprA, NADPH:ADRENODOXIN OXIDOREDUCTASE FPRA (NADPH-FERREDOXIN REDUCTASE)
MRPYYIAIVGSGPSAFFAAASLLKAADTTEDLDMAVDMLEMLPTPWGLVR
SGVAPDHPKIKSISKQFEKTAEDPRFRFFGNVVVGEHVQPGELSERYDAV
IYAVGAQSDRMLNIPGEDLPGSIAAVDFVGWYNAHPHFEQVSPDLSGARA
VVIGNGNVALDVARILLTDPDVLARTDIADHALESLRPRGIQEVVIVGRR
GPLQAAFTTLELRELADLDGVDVVIDPAELDGITDEDAAAVGKVCKQNIK
VLRGYADREPRPGHRRMVFRFLTSPIEIKGKRKVERIVLGRNELVSDGSG
RVAAKDTGEREELPAQLVVRSVGYRGVPTPGLPFDDQSGTIPNVGGRING
SPNEYVVGWIKRGPTGVIGTNKKDAQDTVDTLIKNLGNAKEGAECKSFPE
DHADQVADWLAARQPKLVTSAHWQVIDAFERAAGEPHGRPRVKLASLAEL
LRIGLG
>Rv0522 gabP, PROBABLE GABA PERMEASE GABP (4-AMINO BUTYRATE TRANSPORT CARRIER) (GAMA-AMINOBUTYRATE PERMEASE)
MIAIGGVIGAGLFVGSGVVIRATGPAAFLTYALCGALIVLVMRMLGEMAA
ANPSTGAFADYAAKALGGWAGFSVGWLYWYFWVIVVGFEAVAGGKVLTYW
IDAPLWLASLCLMMMMTATNLVSVSSFGEFEFWFAGVKVATIVGFLVLGT
AFAFGLLPGHGMDFSNLSAHGGFFPDGVGAVFAAIVVAIFSMTGTEVVTI
AAAEAPDPQRAVQRAMSTVVARIVIFFVGSVFLLTVILPWNSLELGASPY
VAALRHMGIGGADQIMNAVVLTAVLSCLNSGLYTASRMLFVLAARQEAPA
QLVKVNRRGVPTFAIMGSSVVGFLCVIMAWVSPATVFVFLLNSSGAVILF
VYLLIALSQIVLRRQTSGQNLGVRMWLFPGLSIVTVTGIVAVLARMAFDY
AARSQLWLSLLSWAVVVGCYLVTTLVRRPLNRPW
>Rv2589 gabT, 4-AMINOBUTYRATE AMINOTRANSFERASE GABT (GAMMA-AMINO-N-BUTYRATE TRANSAMINASE) (GABA TRANSAMINASE) (GLUTAMATE:SUCCINIC SEMIALDEHYDE TRANSAMINASE) (GABA A
MASLQQSRRLVTEIPGPASQALTHRRAAAVSSGVGVTLPVFVARAGGGIV
EDVDGNRLIDLGSGIAVTTIGNSSPRVVDAVRTQVAEFTHTCFMVTPYEG
YVAVAEQLNRITPGSGPKRSVLFNSGAEAVENAVKIARSYTGKPAVVAFD
HAYHGRTNLTMALTAKSMPYKSGFGPFAPEIYRAPLSYPYRDGLLDKQLA
TNGELAAARAIGVIDKQVGANNLAALVIEPIQGEGGFIVPAEGFLPALLD
WCRKNHVVFIADEVQTGFARTGAMFACEHEGPDGLEPDLICTAKGIADGL
PLSAVTGRAEIMNAPHVGGLGGTFGGNPVACAAALATIATIESDGLIERA
RQIERLVTDRLTTLQAVDDRIGDVRGRGAMIAVELVKSGTTEPDAGLTER
LATAAHAAGVIILTCGMFGNIIRLLPPLTIGDELLSEGLDIVCAILADL
>Rv3432c gadB, PROBABLE GLUTAMATE DECARBOXYLASE GADB
MSRSHPSVPAHSIAPAYTGRMFTAPVPALRMPDESMDPEAAYRFIHDELM
LDGSSRLNLATFVTTWMDPEAEKLMAETFDKNMIDKDEYPATAAIEARCV
SMVADLFHAEGLRDHDPTSATGVSTIGSSEAVMLGGLALKWRWRQRVGSW
KGRMPNLVMGSNVQVVWEKFCRYFDVEPRYLPMERGRYVITPEQVLAAVD
ENTIGVVAILGTTYTGELEPIAEICAALDKLAAGGGVDVPVHVDAASGGF
VVPFLHPDLVWDFRLPRVVSINVSGHKYGLTYPGVGFVVWRGPEHLPEDL
VFRVNYLGGDMPTFTLNFSRPGNQVVGQYYNFLRLGRDGYTKVMQALSHT
ARWLGDQLREVDHCEVISDGSAIPVVSFRLAGDRGYTEFDVSHELRTFGW
QVPAYTMPDNATDVAVLRIVVREGLSADLARALHDDAVTALAALDKVKPG
GHFDAQHFAH
>Rv1832 gcvB, Probable glycine dehydrogenase gcvB (Glycine decarboxylase) (Glycine cleavage system P-protein)
MSDHSTFADRHIGLDSQAVATMLAVIGVDSLDDLAVKAVPAGILDTLTDT
GAAPGLDSLPPAASEAEALAELRALADANTVAVSMIGQGYYDTHTPPVLL
RNIIENPAWYTAYTPYQPEISQGRLEALLNFQTLVTDLTGLEIANASMLD
EGTAAAEAMTLMHRAARGPVKRVVVDADVFTQTAAVLATRAKPLGIEIVT
ADLRAGLPDGEFFGVIAQLPGASGRITDWSALVQQAHDRGALVAVGADLL
ALTLIAPPGEIGADVAFGTTQRFGVPMGFGGPHAGYLAVHAKHARQLPGR
LVGVSVDSDGTPAYRLALQTREQHIRRDKATSNICTAQVLLAVLAAMYAS
YHGAGGLTAIARRVHAHAEAIAGALGDALVHDKYFDTVLARVPGRADEVL
ARAKANGINLWRVDADHVSVACDEATTDTHVAVVLDAFGVAAAAPAHTDI
ATRTSEFLTHPAFTQYRTETSMMRYLRALADKDIALDRSMIPLGSCTMKL
NAAAEMESITWPEFGRQHPFAPASDTAGLRQLVADLQSWLVLITGYDAVS
LQPNAGSQGEYAGLLAIHEYHASRGEPHRDICLIPSSAHGTNAASAALAG
MRVVVVDCHDNGDVDLDDLRAKVGEHAERLSALMITYPSTHGVYEHDIAE
ICAAVHDAGGQVYVDGANLNALVGLARPGKFGGDVSHLNLHKTFCIPHGG
GGPGVGPVAVRAHLAPFLPGHPFAPELPKGYPVSSAPYGSASILPITWAY
IRMMGAEGLRAASLTAITSANYIARRLDEYYPVLYTGENGMVAHECILDL
RGITKLTGITVDDVAKRLADYGFHAPTMSFPVAGTLMVEPTESESLAEVD
AFCEAMIGIRAEIDKVGAGEWPVDDNPLRGAPHTAQCLLASDWDHPYTRE
QAAYPLGTAFRPKVWPAVRRIDGAYGDRNLVCSCPPVEAFA
>Rv1826 gcvH, PROBABLE GLYCINE CLEAVAGE SYSTEM H PROTEIN GCVH
MSDIPSDLHYTAEHEWIRRSGDDTVRVGITDYAQSALGDVVFVQLPVIGT
AVTAGETFGEVESTKSVSDLYAPISGKVSEVNSDLDGTPQLVNSDPYGAG
WLLDIQVDSSDVAALESALTTLLDAEAYRGTLTE
>Rv2211c gcvT, Probable aminomethyltransferase GcvT (Glycine cleavage system T protein)
MCQQGRPLGWDAVSDVPELIHGPLEDRHRELGASFAEFGGWLMPVSYAGT
VSEHNATRTAVGLFDVSHLGKALVRGPGAAQFVNSALTNDLGRIGPGKAQ
YTLCCTESGGVIDDLIAYYVSDDEIFLVPNAANTAAVVGALQAAAPGGLS
ITNLHRSYAVLAVQGPCSTDVLTALGLPTEMDYMGYADASYSGVPVRVCR
TGYTGEHGYELLPPWESAGVVFDALLAAVSAAGGEPAGLGARDTLRTEMG
YPLHGHELSLDISPLQARCGWAVGWRKDAFFGRAALLAEKAAGPRRLLRG
LRMVGRGVLRPGLAVLVGDETVGVTTSGTFSPTLQVGIGLALIDSDAGIE
DGQQINVDVRGRAVECQVVCPPFVAVKTR
>Rv2476c gdh, PROBABLE NAD-DEPENDENT GLUTAMATE DEHYDROGENASE GDH (NAD-GDH) (NAD-DEPENDENT GLUTAMIC DEHYDROGENASE)
MTIDPGAKQDVEAWTTFTASADIPDWISKAYIDSYRGPRDDSSEATKAAE
ASWLPASLLTPAMLGAHYRLGRHRAAGESCVAVYRADDPAGFGPALQVVA
EHGGMLMDSVTVLLHRLGIAYAAILTPVFDVHRSPTGELLRIEPKAEGTS
PHLGEAWMHVALSPAVDHKGLAEVERLLPKVLADVQRVATDATALIATLS
ELAGEVESNAGGRFSAPDRQDVGELLRWLGDGNFLLLGYQRCRVADGMVY
GEGSSGMGVLRGRTGSRPRLTDDDKLLVLAQARVGSYLRYGAYPYAIAVR
EYVDGSVVEHRFVGLFSVAAMNADVLEIPTISRRVREALAMAESDPSHPG
QLLLDVIQTVPRPELFTLSAQRLLTMARAVVDLGSQRQALLFLRADRLQY
FVSCLVYMPRDRYTTAVRMQFEDILVREFGGTRLEFTARVSESPWALMHF
MVRLPEVGVAGEGAAAPPVDVSEANRIRIQGLLTEAARTWADRLIGAAAA
AGSVGQADAMHYAAAFSEAYKQAVTPADAIGDIAVITELTDDSVKLVFSE
RDEQGVAQLTWFLGGRTASLSQLLPMLQSMGVVVLEERPFSVTRPDGLPV
WIYQFKISPHPTIPLAPTVAERAATAHRFAEAVTAIWHGRVEIDRFNELV
MRAGLTWQQVVLLRAYAKYLRQAGFPYSQSYIESVLNEHPATVRSLVDLF
EALFVPVPSGSASNRDAQAAAAAVAADIDALVSLDTDRILRAFASLVQAT
LRTNYFVTRQGSARCRDVLALKLNAQLIDELPLPRPRYEIFVYSPRVEGV
HLRFGPVARGGLRWSDRRDDFRTEILGLVKAQAVKNAVIVPVGAKGGFVV
KRPPLPTGDPAADRDATRAEGVACYQLFISGLLDVTDNVDHATASVNPPP
EVVRRDGDDAYLVVAADKGTATFSDIANDVAKSYGFWLGDAFASGGSVGY
DHKAMGITARGAWEAVKRHFREIGIDTQTQDFTVVGIGDMSGDVFGNGML
LSKHIRLIAAFDHRHIFLDPNPDAAVSWAERRRMFELPRSSWSDYDRSLI
SEGGGVYSREQKAIPLSAQVRAVLGIDGSVDGGAAEMAPPNLIRAILRAP
VDLLFNGGIGTYIKAESESDADVGDRANDPVRVNANQVRAKVIGEGGNLG
VTALGRVEFDLSGGRINTDALDNSAGVDCSDHEVNIKILIDSLVSAGTVK
ADERTQLLESMTDEVAQLVLADNEDQNDLMGTSRANAASLLPVHAMQIKY
LVAERGVNRELEALPSEKEIARRSEAGIGLTSPELATLMAHVKLGLKEEV
LATELPDQDVFASRLPRYFPTALRERFTPEIRSHQLRREIVTTMLINDLV
DTAGITYAFRIAEDVGVTPIDAVRTYVATDAIFGVGHIWRRIRAANLPIA
LSDRLTLDTRRLIDRAGRWLLNYRPQPLAVGAEINRFAAMVKALTPRMSE
WLRGDDKAIVEKTAAEFASQGVPEDLAYRVSTGLYRYSLLDIIDIADIAD
IDAAEVADTYFALMDRLGTDGLLTAVSQLPRHDRWHSLARLAIRDDIYGA
LRSLCFDVLAVGEPGESSEQKIAEWEHLSASRVARARRTLDDIRASGQKD
LATLSVAARQIRRMTRTSGRGISG
>Rv0773c ggtA, PROBABLE BIFUNCTIONAL ACYLASE GGTA: CEPHALOSPORIN ACYLASE (GL-7ACA ACYLASE) + GAMMA-GLUTAMYLTRANSPEPTIDASE (GGT)
MPILATNVVCTSQPLAAQAGLRMLADGGNAVDAAVATAITLTVVEPVSNG
IGSDAFSIVWDGQKLHGLNASGRSPSAWTPEYFGGNAVPVLGWNSVTVPG
AVSAWVELHARFGRLPFETLFEPAISYGRNGFLVSPTVAAQWAAQVPLFA
SQPGFADAFMPGGRAPKPGELFTFPDHAATLEKIAATNGEEFYRGELAAK
LEAHSAANGGVMRADDLAAHRVDWVDTITGTYRGYTIHQIPPNGQGIVAL
IALGILEHFDMSSWSVDSAESVHVQIEALKLAFADAQACVADIDYMPVHP
KRLLDKEYLRQRATLIDPKRAMPAATGIPRGGTVYLAAADAAGMMVSMIQ
SNYLGFGSGVVVPGTGISLHNRGSDFTVVPRHPNRVGPRKRPYHTIIPGF
VTRDGAPVMSFGVMGGMMQPQGHVQVLVRIADYGQNPQAACDGPRFRWVN
GMRVSFENGFPDSTLDELRQRGHDLVAVADYSQFGSCQAIWRLDDGYLAA
SDPRRDGQAAAC
>Rv2394 ggtB, PROBABLE GAMMA-GLUTAMYLTRANSPEPTIDASE PRECURSOR GGTB (GAMMA-GLUTAMYLTRANSFERASE) (GLUTAMYL TRANSPEPTIDASE)
MSVWLRAGALVAAVMLSLSGCGGFHAGAPSTAGPCEIVPNGTPAPKTPPA
TVPSSRNLATNPEIATGYRRDMTVVRTAHYAAATANPLATQVACRVLRDG
GTAADAVVAAQAVLGLVEPQSSGIGGGGYLVYFDARTGSVQAYDGREVAP
AAATENYLRWVSDVDRSAPRPNARASGRSIGVPGILRMLEMVHNEHGRTP
WRDLFGPAVTLADGGFDISARMGAAISDAAPQLRDDPEARKYFLNPDGSP
KPAGTRLTNPAYSKTLSAIASAGANAFYSGDIAHDIVAAASDTSNGRTPG
LLTIEDLAGYLAKRRQPLCTTYRGREICGMPSSGGVAVAATLGILEHFPM
SDYAPSKVDLNGGRPTVMGVHLIAEAERLAYADRDQYIADVDFVRLPGGS
LTTLVDPGYLAARAALISPQHSMGSARPGDFGAPTAVAPPVPEHGTSHLS
VVDSYGNAATLTTTVESSFGSYHLVDGFILNNQLSDFSAEPHATDGSPVA
NRVEPGKRPRSSMAPTLVFDHSSAGRGALYAVLGSPGGSMIIQFVVKTLV
AMLDWGLNPQQAVSLVDFGAANSPHTNLGGENPEINTSDDGDHDPLVQGL
RALGHRVNLAEQSSGLSAITRSEAGWAGGADPRREGAVMGDDA
>Rv2220 glnA1, GLUTAMINE SYNTHETASE GLNA1 (GLUTAMINE SYNTHASE) (GS-I)
MTEKTPDDVFKLAKDEKVEYVDVRFCDLPGIMQHFTIPASAFDKSVFDDG
LAFDGSSIRGFQSIHESDMLLLPDPETARIDPFRAAKTLNINFFVHDPFT
LEPYSRDPRNIARKAENYLISTGIADTAYFGAEAEFYIFDSVSFDSRANG
SFYEVDAISGWWNTGAATEADGSPNRGYKVRHKGGYFPVAPNDQYVDLRD
KMLTNLINSGFILEKGHHEVGSGGQAEINYQFNSLLHAADDMQLYKYIIK
NTAWQNGKTVTFMPKPLFGDNGSGMHCHQSLWKDGAPLMYDETGYAGLSD
TARHYIGGLLHHAPSLLAFTNPTVNSYKRLVPGYEAPINLVYSQRNRSAC
VRIPITGSNPKAKRLEFRSPDSSGNPYLAFSAMLMAGLDGIKNKIEPQAP
VDKDLYELPPEEAASIPQTPTQLSDVIDRLEADHEYLTEGGVFTNDLIET
WISFKRENEIEPVNIRPHPYEFALYYDV
>Rv2222c glnA2, PROBABLE GLUTAMINE SYNTHETASE GLNA2 (GLUTAMINE SYNTHASE) (GS-II)
MDRQKEFVLRTLEERDIRFVRLWFTDVLGFLKSVAIAPAELEGAFEEGIG
FDGSSIEGFARVSESDTVAHPDPSTFQVLPWATSSGHHHSARMFCDITMP
DGSPSWADPRHVLRRQLTKAGELGFSCYVHPEIEFFLLKPGPEDGSVPVP
VDNAGYFDQAVHDSALNFRRHAIDALEFMGISVEFSHHEGAPGQQEIDLR
FADALSMADNVMTFRYVIKEVALEEGARASFMPKPFGQHPGSAMHTHMSL
FEGDVNAFHSADDPLQLSEVGKSFIAGILEHACEISAVTNQWVNSYKRLV
QGGEAPTAASWGAANRSALVRVPMYTPHKTSSRRVEVRSPDSACNPYLTF
AVLLAAGLRGVEKGYVLGPQAEDNVWDLTPEERRAMGYRELPSSLDSALR
AMEASELVAEALGEHVFDFFLRNKRTEWANYRSHVTPYELRTYLSL
>Rv1878 glnA3, PROBABLE GLUTAMINE SYNTHETASE GLNA3 (GLUTAMINE SYNTHASE) (GS-I)
MTATPLAAAAIAQLEAEGVDTVIGTVVNPAGLTQAKTVPIRRTNTFANPG
LGASPVWHTFCIDQCSIAFTADISVVGDQRLRIDLSALRIIGDGLAWAPA
GFFEQDGTPVPACSRGTLSRIEAALADAGIDAVIGHEVEFLLVDADGQRL
PSTLWAQYGVAGVLEHEAFVRDVNAAATAAGIAIEQFHPEYGANQFEISL
APQPPVAAADQLVLTRLIIGRTARRHGLRVSLSPAPFAGSIGSGAHQHFS
LTMSEGMLFSGGTGAAGMTSAGEAAVAGVLRGLPDAQGILCGSIVSGLRM
RPGNWAGIYACWGTENREAAVRFVKGGAGSAYGGNVEVKVVDPSANPYLA
SAAILGLALDGMKTKAVLPSETTVDPTQLSDVDRDRAGILRLAADQADAI
AVLDSSKLLRCILGDPVVDAVVAVRQLEHERYGDLDPAQLADKFRMAWSV
>Rv2860c glnA4, PROBABLE GLUTAMINE SYNTHETASE GLNA4 (GLUTAMINE SYNTHASE) (GS-II)
MTGPGSPPLAWTELERLVAAGDVDTVIVAFTDMQGRLAGKRISGRHFVDD
IATRGVECCSYLLAVDVDLNTVPGYAMASWDTGYGDMVMTPDLSTLRLIP
WLPGTALVIADLVWADGSEVAVSPRSILRRQLDRLKARGLVADVATELEF
IVFDQPYRQAWASGYRGLTPASDYNIDYAILASSRMEPLLRDIRLGMAGA
GLRFEAVKGECNMGQQEIGFRYDEALVTCDNHAIYKNGAKEIADQHGKSL
TFMAKYDEREGNSCHIHVSLRGTDGSAVFADSNGPHGMSSMFRSFVAGQL
ATLREFTLCYAPTINSYKRFADSSFAPTALAWGLDNRTCALRVVGHGQNI
RVECRVPGGDVNQYLAVAALIAGGLYGIERGLQLPEPCVGNAYQGADVER
LPVTLADAAVLFEDSALVREAFGEDVVAHYLNNARVELAAFNAAVTDWER
IRGFERL
>Rv2919c glnB, PROBABLE NITROGEN REGULATORY PROTEIN P-II GLNB
MKLITAIVKPFTLDDVKTSLEDAGVLGMTVSEIQGYGRQKGHTEVYRGAE
YSVDFVPKVRIEVVVDDSIVDKVVDSIVRAARTGKIGDGKVWVSPVDTIV
RVRTGERGHDAL
>Rv0411c glnH, PROBABLE GLUTAMINE-BINDING LIPOPROTEIN GLNH (GLNBP)
MTRRALLARAAAPLAPLALAMVLASCGHSETLGVEATPTLPLPTPVGMEI
MPPQPPLPPDSSSQDCDPTASLRPFATKAEADAAVADIRARGRLIVGLDI
GSNLFSFRDPITGEITGFDVDIAGEVARDIFGVPSHVEYRILSAAERVTA
LQKSQVDIVVKTMSITCERRKLVNFSTVYLDANQRILAPRDSPITKVSDL
SGKRVCVARGTTSLRRIREIAPPPVIVSVVNWADCLVALQQREIDAVSTD
DTILAGLVEEDPYLHIVGPDMADQPYGVGINLDNTGLVRFVNGTLERIRN
DGTWNTLYRKWLTVLGPAPAPPTPRYVD
>Rv3859c gltB, PROBABLE FERREDOXIN-DEPENDENT GLUTAMATE SYNTHASE [NADPH] (LARGE SUBUNIT) GLTB (L-GLUTAMATE SYNTHASE) (L-GLUTAMATE SYNTHETASE) (NADH-GLUTAMATE SYNTHASE
MTPKRVGLYNPAFEHDSCGVAMVVDMHGRRSRDIVDKAITALLNLEHRGA
QGAEPRSGDGAGILIQVPDEFLREAVDFELPAPGSYATGIAFLPQSSKDA
AAACAAVQKIAEAEGLQVLGWRSVPTDDSSLGALSRDAMPTFRQVFLAGA
SGMALERRCYVVRKRAEHELGTKGPGQDGPGRETVYFPSLSGQTLVYKGM
LTTPQLKAFYLDLQDERLTSALGIVHSRFSTNTFPSWPLAHPFRRIAHNG
EINTVTGNENWMRAREALIKTDIFGSAADVEKLFPICTPGASDTARFDEV
LELLHLGGRSLAHAVLMMIPEAWERHESMDPARRAFYQYHASLMEPWDGP
ASMTFTDGTVVGAVLDRNGLRPSRIWVTDDGLVVMASEAGVLDLHPSTVV
RRMRLQPGRMFLVDTAQGRIVSDEEIKADLAAEHPYQEWLDNGLVPLDEL
PEGKDVRMPHHRIVMRQLAFGYTYEELNLLVAPMARLGAEPIGSMGTDTP
VAVLSQRPRMLYDYFHQLFAQVTNPPLDAIREEVVTSLQGTTGGERDLLN
PDQNSCHQIVLPQPILRNHELAKLVSLDPNDKVNGRPHGLRSKVIRCLYR
VSEGGAGLAAALEEVRGAAAAAIADGARIIILSDRESDEEMAPIPSLLAV
AGVHHHLVRERTRTQVGLVVESGDAREVHHMAALVGFGAAAINPYLVFES
IEDMLDRGVIEGIDRTAALNNYIKAAGKGVLKVMSKMGISTLASYTGAQL
FQAVGISEQVLDEYFTGLTCPTGGITLDDIAADVAARHRLAYLDRPDERA
HRELEVGGEYQWRREGEYHLFNPETVFKLQHSTRTGQYKIFKEYTRLVDD
QSERMASLRGLLKFRTGVRPPVPLDEVEPASEIVKRFSTGAMSYGSISAE
AHETLAIAMNRLGARSNCGEGGEDVKRFDRDPNGDWRRSAIKQVASARFG
VTSHYLTNCTDLQIKMAQGAKPGEGGQLPGHKVYPWVAEVRHSTPGVGLI
SPPPHHDIYSIEDLAQLIHDLKNANPSARVHVKLVSENGVGTVAAGVSKA
HADVVLISGHDGGTGATPLTSMKHAGAPWELGLAETQQTLLLNGLRDRIV
VQVDGQLKTGRDVMIATLLGAEEFGFATAPLVVAGCIMMRVCHLDTCPVG
VATQNPLLRERFTGKPEFVENFFMFIAEEVREYLAQLGFRTVNEAVGQAG
ALDTTLARAHWKAHKLDLAPVLHEPESAFMNQDLYCSSRQDHGLDKALDQ
QLIVMSREALDSGKPVRFSTTIGNVNRTVGTMLGHELTKAYGGQGLPDGT
IDITFDGSAGNSFGAFVPKGITLRVYGDANDYVGKGLSGGRIVVRPSDDA
PQDYVAEDNIIGGNVILFGATSGEVYLRGVVGERFAVRNSGAHAVVEGVG
DHGCEYMTGGRVVILGRTGRNFAAGMSGGVAYVYDPDGELPANLNSEMVE
LETLDEDDADWLHGTIQVHVDATDSAVGQRILSDWSGQQRHFVKVMPRDY
KRVLQAIALAERDGVDVDKAIMAAAHG
>Rv3858c gltD, PROBABLE NADH-DEPENDENT GLUTAMATE SYNTHASE (SMALL SUBUNIT) GLTD (L-GLUTAMATE SYNTHASE) (L-GLUTAMATE SYNTHETASE) (NADH-GLUTAMATE SYNTHASE) (GLUTAMATE S
MADPGGFLKYTHRKLPKRRPVPLRLRDWREVYEEFDNESLRQQATRCMDC
GIPFCHNGCPLGNLIPEWNDLVRRGRWRDAIERLHATNNFPDFTGRLCPA
PCEPACVLGINQDPVTIKQIELEIIDKAFDEGWVQPRPPRKLTGQTVAVV
GSGPAGLAAAQQLTRAGHTVTVFEREDRIGGLLRYGIPEFKMEKRHLDRR
LDQMRSEGTEFRPGVNVGVDISAEKLRADFDAVVLAGGATAWRELPIPGR
ELEGVHQAMEFLPWANRVQEGDDVLDEDGQPPITAKGKKVVIIGGGDTGA
DCLGTVHRQGAIAVHQFEIMPRPPDARAESTPWPTYPLMYRVSAAHEEGG
ERVFSVNTEAFVGTDGRVSALRAHEVTMLDGKFVKVEGSDFELEADLVLL
AMGFVGPERAGLLTDLGVKFTERGNVARGDDFDTSVPGVFVAGDMGRGQS
LIVWAIAEGRAAAAAVDRYLMGSSALPAPVKPTAAPLQ
>Rv1093 glyA1, Probable Serine hydroxymethyltransferase 1 glyA1
MSAPLAEVDPDIAELLAKELGRQRDTLEMIASENFVPRAVLQAQGSVLTN
KYAEGLPGRRYYGGCEHVDVVENLARDRAKALFGAEFANVQPHSGAQANA
AVLHALMSPGERLLGLDLANGGHLTHGMRLNFSGKLYENGFYGVDPATHL
IDMDAVRATALEFRPKVIIAGWSAYPRVLDFAAFRSIADEVGAKLLVDMA
HFAGLVAAGLHPSPVPHADVVSTTVHKTLGGGRSGLIVGKQQYAKAINSA
VFPGQQGGPLMHVIAGKAVALKIAATPEFADRQRRTLSGARIIADRLMAP
DVAKAGVSVVSGGTDVHLVLVDLRDSPLDGQAAEDLLHEVGITVNRNAVP
NDPRPPMVTSGLRIGTPALATRGFGDTEFTEVADIIATALATGSSVDVSA
LKDRATRLARAFPLYDGLEEWSLVGR
>Rv0070c glyA2, PROBABLE SERINE HYDROXYMETHYLTRANSFERASE GLYA2 (SERINE METHYLASE 2) (SHMT 2)
MNTLNDSLTAFDPDIAALIDGELRRQESGLEMIASENYAPLAVMQAQGSV
LTNKYAEGYPGRRYYGGCEFVDGVEQLAIDRVKALFGAEYANVQPHSGAT
ANAATMHALLNPGDTILGLSLAHGGHLTHGMRINFSGKLYHATAYEVSKE
DYLVDMDAVAEAARTHRPKMIIAGWSAYPRQLDFARFRAIADEVDAVLMV
DMAHFAGLVAAGVHPSPVPHAHVVTSTTHKTLGGPRGGIILCNDPAIAKK
INSAVFPGQQGGPLEHVIAAKATAFKMAAQPEFAQRQQRCLDGARILAGR
LTQPDVAERGIAVLTGGTDVHLVLVDLRDAELDGQQAEDRLAAVDITVNR
NAVPFDPRPPMITSGLRIGTPALAARGFSHNDFRAVADLIAAALTATNDD
QLGPLRAQVQRLAARYPLYPELHRT
>Rv0114 gmhB, POSSIBLE D-ALPHA,BETA-D-HEPTOSE-1,7-BIPHOSPHATE PHOSPHATASE GMHB (D-GLYCERO-D-MANNO-HEPTOSE 7-PHOSPHATE KINASE)
MVAERAGHQWCLFLDRDGVINRQVVGDYVRNWRQFEWLPGAARALKKLRA
WAPYIVVVTNQQGVGAGLMSAVDVMVIHRHLQMQLASDGVLIDGFQVCPH
HRSQRCGCRKPRPGLVLDWLGRHPDSEPLLSIVVGDSLSDLELAHNVAAA
AGACASVQIGGASSGGVADASFDSLWEFAVAVGHARGERG
>Rv1603 hisA, PROBABLE PHOSPHORIBOSYLFORMIMINO-5-AMINOIMIDAZOLE CARBOXAMIDE RIBOTIDE ISOMERASE HISA
MMPLILLPAVDVVEGRAVRLVQGKAGSQTEYGSAVDAALGWQRDGAEWIH
LVDLDAAFGRGSNHELLAEVVGKLDVQVELSGGIRDDESLAAALATGCAR
VNVGTAALENPQWCARVIGEHGDQVAVGLDVQIIDGEHRLRGRGWETDGG
DLWDVLERLDSEGCSRFVVTDITKDGTLGGPNLDLLAGVADRTDAPVIAS
GGVSSLDDLRAIATLTHRGVEGAIVGKALYARRFTLPQALAAVRD
>Rv1601 hisB, Probable imidazole glycerol-phosphate dehydratase hisB
MTTTQTAKASRRARIERRTRESDIVIELDLDGTGQVAVDTGVPFYDHMLT
ALGSHASFDLTVRATGDVEIEAHHTIEDTAIALGTALGQALGDKRGIRRF
GDAFIPMDETLAHAAVDLSGRPYCVHTGEPDHLQHTTIAGSSVPYHTVIN
RHVFESLAANARIALHVRVLYGRDPHHITEAQYKAVARALRQAVEPDPRV
SGVPSTKGAL
>Rv1600 hisC1, Probable histidinol-phosphate aminotransferase hisC1
MTRSGHPVTLDDLPLRADLRGKAPYGAPQLAVPVRLNTNENPHPPTRALV
DDVVRSVREAAIDLHRYPDRDAVALRADLAGYLTAQTGIQLGVENIWAAN
GSNEILQQLLQAFGGPGRSAIGFVPSYSMHPIISDGTHTEWIEASRANDF
GLDVDVAVAAVVDRKPDVVFIASPNNPSGQSVSLPDLCKLLDVAPGIAIV
DEAYGEFSSQPSAVSLVEEYPSKLVVTRTMSKAFAFAGGRLGYLIATPAV
IDAMLLVRLPYHLSSVTQAAARAALRHSDDTLSSVAALIAERERVTTSLN
DMGFRVIPSDANFVLFGEFADAPAAWRRYLEAGILIRDVGIPGYLRATTG
LAEENDAFLRASARIATDLVPVTRSPVGAP
>Rv3772 hisC2, PROBABLE HISTIDINOL-PHOSPHATE AMINOTRANSFERASE HISC2 (IMIDAZOLE ACETOL-PHOSPHATE TRANSAMINASE) (IMIDAZOLYLACETOLPHOSPHATE AMINOTRANSFERASE)
MTARLRPELAGLPVYVPGKTVPGAIKLASNETVFGPLPSVRAAIDRATDT
VNRYPDNGCVQLKAALARHLGPDFAPEHVAVGCGSVSLCQQLVQVTASVG
DEVVFGWRSFELYPPQVRVAGAIPIQVPLTDHTFDLYAMLATVTDRTRLI
FVCNPNNPTSTVVGPDALARFVEAVPAHILIAIDEAYVEYIRDGMRPDSL
GLVRAHNNVVVLRTFSKAYGLAGLRIGYAIGHPDVITALDKVYVPFTVSS
IGQAAAIASLDAADELLARTDTVVAERARVSAELRAAGFTLPPSQANFVW
LPLGSRTQDFVEQAADARIVVRPYGTDGVRVTVAAPEENDAFLRFARRWR
SDQ
>Rv1599 hisD, Probable histidinol dehydrogenase HisD (HDH)
MLTRIDLRGAELTAAELRAALPRGGADVEAVLPTVRPIVAAVAERGAEAA
LDFGASFDGVRPHAIRVPDAALDAALAGLDCDVCEALQVMVERTRAVHSG
QRRTDVTTTLGPGATVTERWVPVERVGLYVPGGNAVYPSSVVMNVVPAQA
AGVDSLVVASPPQAQWDGMPHPTILAAARLLGVDEVWAVGGAQAVALLAY
GGTDTDGAALTPVDMITGPGNIYVTAAKRLCRSRVGIDAEAGPTEIAILA
DHTADPVHVAADLISQAEHDELAASVLVTPSEDLADATDAELAGQLQTTV
HRERVTAALTGRQSAIVLVDDVDAAVLVVNAYAAEHLEIQTADAPQVASR
IRSAGAIFVGPWSPVSLGDYCAGSNHVLPTAGCARHSSGLSVQTFLRGIH
VVEYTEAALKDVSGHVITLATAEDLPAHGEAVRRRFER
>Rv2122c hisE, Probable phosphoribosyl-AMP pyrophosphatase HisE
MQQSLAVKTFEDLFAELGDRARTRPADSTTVAALDGGVHALGKKLLEEAG
EVWLAAEHESNDALAEEISQLLYWTQVLMISRGLSLDDVYRKL
>Rv1605 hisF, Probable cyclase hisF
MYADRDLPGAGGLAVRVIPCLDVDDGRVVKGVNFENLRDAGDPVELAAVY
DAEGADELTFLDVTASSSGRATMLEVVRRTAEQVFIPLTVGGGVRTVADV
DSLLRAGADKVAVNTAAIACPDLLADMARQFGSQCIVLSVDARTVPVGSA
PTPSGWEVTTHGGRRGTGMDAVQWAARGADLGVGEILLNSMDADGTKAGF
DLALLRAVRAAVTVPVIASGGAGAVEHFAPAVAAGADAVLAASVFHFREL
TIGQVKAALAAEGITVR
>Rv2121c hisG, Probable ATP phosphoribosyltransferase HisG
MLRVAVPNKGALSEPATEILAEAGYRRRTDSKDLTVIDPVNNVEFFFLRP
KDIAIYVGSGELDFGITGRDLVCDSGAQVRERLALGFGSSSFRYAAPAGR
NWTTADLAGMRIATAYPNLVRKDLATKGIEATVIRLDGAVEISVQLGVAD
AIADVVGSGRTLSQHDLVAFGEPLCDSEAVLIERAGTDGQDQTEARDQLV
ARVQGVVFGQQYLMLDYDCPRSALKKATAITPGLESPTIAPLADPDWVAI
RALVPRRDVNGIMDELAAIGAKAILASDIRFCRF
>Rv1602 hisH, Probable amidotransferase hisH
MTAKSVVVLDYGSGNLRSAQRALQRVGAEVEVTADTDAAMTADGLVVPGV
GAFAACMAGLRKISGERIIAERVAAGRPVLGVCVGMQILFACGVEFGVQT
PGCGHWPGAVIRLEAPVIPHMGWNVVDSAAGSALFKGLDVDARFYFVHSY
AAQRWEGSPDALLTWATYRAPFLAAVEDGALAATQFHPEKSGDAGAAVLS
SWVDGL
>Rv1606 hisI, Probable phosphoribosyl-AMP 1,6 cyclohydrolase hisI
MTLDPKIAARLKRNADGLVTAVVQERGSGDVLMVAWMNDEALARTLQTRE
ATYYSRSRAEQWVKGATSGHTQHVHSVRLDCDGDAVLLTVDQVGGACHTG
DHSCFDAAVLLEPDD
>Rv1559 ilvA, Probable threonine dehydratase ilvA
MSAELSQSPSSSPLFSLSGADIDRAAKRIAPVVTPTPLQPSDRLSAITGA
TVYLKREDLQTVRSYKLRGAYNLLVQLSDEELAAGVVCSSAGNHAQGFAY
ACRCLGVHGRVYVPAKTPKQKRDRIRYHGGEFIDLIVGGSTYDLAAAAAL
EDVERTGATLVPPFDDLRTIAGQGTIAVEVLGQLEDEPDLVVVPVGGGGC
IAGITTYLAERTTNTAVLGVEPAGAAAMMAALAAGEPVTLDHVDQFVDGA
AVNRAGTLTYAALAAAGDMVSLTTVDEGAVCTAMLDLYQNEGIIAEPAGA
LSVAGLLEADIEPGSTVVCLISGGNNDVSRYGEVLERSLVHLGLKHYFLV
DFPQEPGALRRFLDDVLGPNDDITLFEYVKRNNRETGEALVGIELGSAAD
LDGLLARMRATDIHVEALEPGSPAYRYLL
>Rv3003c ilvB1, PROBABLE ACETOLACTATE SYNTHASE (LARGE SUBUNIT) ILVB1 (ACETOHYDROXY-ACID SYNTHASE)
MSAPTKPHSPTFKPEPHSAANEPKHPAARPKHVALQQLTGAQAVIRSLEE
LGVDVIFGIPGGAVLPVYDPLFDSKKLRHVLVRHEQGAGHAASGYAHVTG
RVGVCMATSGPGATNLVTPLADAQMDSIPVVAITGQVGRGLIGTDAFQEA
DISGITMPITKHNFLVRSGDDIPRVLAEAFHIAASGRPGAVLVDIPKDVL
QGQCTFSWPPRMELPGYKPNTKPHSRQVREAAKLIAAARKPVLYVGGGVI
RGEATEQLRELAELTGIPVVTTLMARGAFPDSHRQNLGMPGMHGTVAAVA
ALQRSDLLIALGTRFDDRVTGKLDSFAPEAKVIHADIDPAEIGKNRHADV
PIVGDVKAVITELIAMLRHHHIPGTIEMADWWAYLNGVRKTYPLSYGPQS
DGSLSPEYVIEKLGEIAGPDAVFVAGVGQHQMWAAQFIRYEKPRSWLNSG
GLGTMGFAIPAAMGAKIALPGTEVWAIDGDGCFQMTNQELATCAVEGIPV
KVALINNGNLGMVRQWQSLFYAERYSQTDLATHSHRIPDFVKLAEALGCV
GLRCEREEDVVDVINQARAINDCPVVIDFIVGADAQVWPMVAAGTSNDEI
QAARGIRPLFDDITEGHA
>Rv3470c ilvB2, PROBABLE ACETOLACTATE SYNTHASE (LARGE SUBUNIT) ILVB2 (AHAS) (ACETOHYDROXY-ACID SYNTHASE LARGE SUBUNIT) (ALS)
MTVGDHLVARMRAAGISVVCGLPTSRLDSLLVRLSRDAGFQIVLARHEGG
AGYLADGFARASGKSAAVFVAGPGATNVISAVANASVNQVPMLILTGEVA
VGEFGLHSQQDTSDDGLGLGATFRRFCRCSVSIESIANARSKIDSAFRAL
ASIPRGPVHIALPRDLVDERLPAHQLGTAAAGLGGLRTLAPCGPDVADEV
IGRLDRSRAPMLVLGNGCRLDGIGEQIVAFCEKAGLPFATTPNGRGIVAE
THPLSLGVLGIFGDGRADEYLFDTPCDLLIAVGVSFGGLVTRSFSPRWRG
LKADVVHVDPDPSAVGRFVATSLGITTSGRAFVNALNCGRPPRFCRRVGV
RPPAPAALPGTPQARGESIHPLELMHELDRELAPNATICADVGTCISWTF
RGIPVRRPGRFFATVDFSPMGCGIAGAIGVALARPEEHVICIAGDGAFLM
HGTEISTAVAHGIRVTWAVLNDGQMSASAGPVSGRMDPSPVARIGANDLA
AMARALGAEGIRVDTRCELRAGVQKALAATGPCVLDIAIDPEINKPDIGL
GR
>Rv3001c ilvC, PROBABLE KETOL-ACID REDUCTOISOMERASE ILVC (Acetohydroxy-acid isomeroreductase) (Alpha-keto-beta-hydroxylacil reductoisomerase)
MFYDDDADLSIIQGRKVGVIGYGSQGHAHSLSLRDSGVQVRVGLKQGSRS
RPKVEEQGLDVDTPAEVAKWADVVMVLAPDTAQAEIFAGDIEPNLKPGDA
LFFGHGLNVHFGLIKPPADVAVAMVAPKGPGHLVRRQFVDGKGVPCLVAV
EQDPRGDGLALALSYAKAIGGTRAGVIKTTFKDETETDLFGEQTVLCGGT
EELVKAGFEVMVEAGYPAELAYFEVLHELKLIVDLMYEGGLARMYYSVSD
TAEFGGYLSGPRVIDAGTKERMRDILREIQDGSFVHKLVADVEGGNKQLE
ELRRQNAEHPIEVVGKKLRDLMSWVDRPITETA
>Rv0189c ilvD, PROBABLE DIHYDROXY-ACID DEHYDRATASE ILVD (DAD)
MPQTTDEAASVSTVADIKPRSRDVTDGLEKAAARGMLRAVGMDDEDFAKP
QIGVASSWNEITPCNLSLDRLANAVKEGVFSAGGYPLEFGTISVSDGISM
GHEGMHFSLVSREVIADSVEVVMQAERLDGSVLLAGCDKSLPGMLMAAAR
LDLAAVFLYAGSILPGRAKLSDGSERDVTIIDAFEAVGACSRGLMSRADV
DAIERAICPGEGACGGMYTANTMASAAEALGMSLPGSAAPPATDRRRDGF
ARRSGQAVVELLRRGITARDILTKEAFENAIAVVMAFGGSTNAVLHLLAI
AHEANVALSLQDFSRIGSGVPHLADVKPFGRHVMSDVDHIGGVPVVMKAL
LDAGLLHGDCLTVTGHTMAENLAAITPPDPDGKVLRALANPIHPSGGITI
LHGSLAPEGAVVKTAGFDSDVFEGTARVFDGERAALDALEDGTITVGDAV
VIRYEGPKGGPGMREMLAITGAIKGAGLGKDVLLLTDGRFSGGTTGLCVG
HIAPEAVDGGPIALLRNGDRIRLDVAGRVLDVLADPAEFASRQQDFSPPP
PRYTTGVLSKYVKLVSSAAVGAVCG
>Rv2210c ilvE, PROBABLE BRANCHED-CHAIN AMINO ACID TRANSAMINASE ILVE
MTSGSLQFTVLRAVNPATDAQRESMLREPGFGKYHTDHMVSIDYAEGRGW
HNARVIPYGPIELDPSAIVLHYAQEVFEGLKAYRWADGSIVSFRADANAA
RLRSSARRLAIPELPDAVFIESLRQLIAVDKAWVPGAGGEEALYLRPFIF
ATEPGLGVRPATQYRYLLIASPAGAYFKGGIAPVSVWVSTEYVRACPGGT
GAAKFGGNYAASLLAQAEAAENGCDQVVWLDAVERRYIEEMGGMNIFFVL
GSGGSARLVTPELSGSLLPGITRDSLLQLAIDAGFAVEERRIDIDEWQKK
AAAGEITEVFACGTAAVITPVARVRHGASEFRIADGQPGEVTMALRDTLT
GIQRGTFADTHGWMARLG
>Rv1820 ilvG, Probable Acetolactate synthase ilvG (Acetohydroxy-acid synthase)(ALS)
MSTDTAPAQTMHAGRLIARRLKASGIDTVFTLSGGHLFSIYDGCREEGIR
LIDTRHEQTAAFAAEGWSKVTRVPGVAALTAGPGITNGMSAMAAAQQNQS
PLVVLGGRAPALRWGMGSLQEIDHVPFVAPVARFAATAQSAENAGLLVDQ
ALQAAVSAPSGVAFVDFPMDHAFSMSSDNGRPGALTELPAGPTPAGDALD
RAAGLLSTAQRPVIMAGTNVWWGHAEAALLRLVEERHIPVLMNGMARGVV
PADHRLAFSRARSKALGEADVALIVGVPMDFRLGFGGVFGSTTQLIVADR
VEPAREHPRPVAAGLYGDLTATLSALAGSGGTDHQGWIEELATAETMARD
LEKAELVDDRIPLHPMRVYAELAALLERDALVVIDAGDFGSYAGRMIDSY
LPGCWLDSGPFGCLGSGPGYALAAKLARPQRQVVLLQGDGAFGFSGMEWD
TLVRHNVAVVSVIGNNGIWGLEKHPMEALYGYSVVAELRPGTRYDEVVRA
LGGHGELVSVPAELRPALERAFASGLPAVVNVLTDPSVAYPRRSNLA
>Rv3002c ilvN, PROBABLE ACETOLACTATE SYNTHASE (SMALL SUBUNIT) ILVN (ACETOHYDROXY-ACID SYNTHASE) (AHAS) (ALS)
MSPKTHTLSVLVEDKPGVLARVAALFSRRGFNIESLAVGATECKDRSRMT
IVVSAEDTPLEQITKQLNKLINVIKIVEQDDEHSVSRELALIKVQADAGS
RSQVIEAVNLFRANVIDVSPESLTVEATGNRGKLEALLRVLEPFGIREIA
QSGMVSLSRGPRGIGTAK
>Rv3509c ilvX, PROBABLE ACETOHYDROXYACID SYNTHASE ILVX (ACETOLACTATE SYNTHASE)
MNGAQALINTLVDGGVDVCFANPGTSEMHFVAALDAVPRMRGMLTLFEGV
ATGAADGYARIAGRPAAVLLHLGPGLGNGLANLHNARRARVPMVVVVGDH
ATYHKKYDAPLESDIDAVAGTVSGWVRRTEAAADVGADAEAAIAASRSGS
QIATLILPADVCWSDGAHAAAGVPAQAAAAPVDVGPVAGVLRSGEPAMML
IGGDATRGPGLTAAARIVQATGARWLCETFPTCLERGAGIPAVERLAYFA
EGAAAQLDGVKHLVLAGARSPVSFFAYPGMPSDLVPAGCEVHVLAEPGGA
ADALAALADEVAPGTVAPVAGASRPQLPTGDLTSVSAADVVGALLPERAI
VVDESNTCGVLLPQATAGAPAHDWLTLTGGAIGYGIPAAVGAAVAAPDRP
VLCLESDGSAMYTISGLWSQARENLDVTTVIYNNGAYDILRIELQRVGAG
SDPGPKALDLLDISRPTMDFVKIAEGMGVPARRVTTCEEFADALRAAFAE
PGPHLIDVVVPSLVG
>Rv3025c iscS, PROBABLE CYSTEINE DESULFURASE ISCS (NIFS PROTEIN HOMOLOG) (NITROGENASE METALLOCLUSTERS BIOSYNTHESIS PROTEIN NIFS)
MAYLDHAATTPMHPAAIEAMAAVQRTIGNASSLHTSGRSARRRIEEAREL
IADKLGARPSEVIFTAGGTESDNLAVKGIYWARRDAEPHRRRIVTTEVEH
HAVLDSVNWLVEHEGAHVTWLPTAADGSVSATALREALQSHDDVALVSVM
WANNEVGTILPIAEMSVVAMEFGVPMHSDAIQAVGQLPLDFGASGLSAMS
VAGHKFGGPPGVGALLLRRDVTCVPLMHGGGQERDIRSGTPDVASAVGMA
TAAQIAVDGLEENSARLRLLRDRLVEGVLAEIDDVCLNGADDPMRLAGNA
HFTFRGCEGDALLMLLDANGIECSTGSACTAGVAQPSHVLIAMGVDAASA
RGSLRLSLGHTSVEADVDAALEVLPGAVARARRAALAAAGASR
>Rv3476c kgtP, PROBABLE DICARBOXYLIC ACID TRANSPORT INTEGRAL MEMBRANE PROTEIN KGTP (DICARBOXYLATE TRANSPORTER)
MTVSIAPPSRPSQAETRRAIWNTIRGSSGNLVEWYDVYVYTVFATYFEDQ
FFDRADRNSTVYVYAIFAVTFVTRPVGSWFLGRFADRRGRRAALTFSVSL
MAACSLIVALVPSRSSIGVAAPILLILCRLVQGFATGGEYGTSATYMSEA
ATRERRGYFSSFQYVTLVGGHVLAQFTLLVILAVFTREQVHEFGWRIGFA
VGGGAAIVVFWLRRTMDESLSQERLTAIKAGRDHDSGSLRELATHYWKPL
LLCFLVTLGGTVAFYTYSVNAPAIVKSVYGSQAMTATWINLVGLILLMML
QPIGGMISDKIGRKPLLLWFGVGGLIYTYVLVTYLPETRSPTMSFLLVAV
GYVILTGYCSINALVKSELFPAHVRALGVGVGYALANSVFGGTAPLIYQA
LKERDQVPMFIAYVTACIAVSLIVYVFFIKNKADTYLDREQGFAFYGHA
>Rv3290c lat, PROBABLE L-LYSINE-EPSILON AMINOTRANSFERASE LAT (L-LYSINE AMINOTRANSFERASE) (LYSINE 6-AMINOTRANSFERASE)
MAAVVKSVALAGRPTTPDRVHEVLGRSMLVDGLDIVLDLTRSGGSYLVDA
ITGRRYLDMFTFVASSALGMNPPALVDDREFHAELMQAALNKPSNSDVYS
VAMARFVETFARVLGDPALPHLFFVEGGALAVENALKAAFDWKSRHNQAH
GIDPALGTQVLHLRGAFHGRSGYTLSLTNTKPTITARFPKFDWPRIDAPY
MRPGLDEPAMAALEAEALRQARAAFETRPHDIACFVAEPIQGEGGDRHFR
PEFFAAMRELCDEFDALLIFDEVQTGCGLTGTAWAYQQLDVAPDIVAFGK
KTQVCGVMAGRRVDEVADNVFAVPSRLNSTWGGNLTDMVRARRILEVIEA
EGLFERAVQHGKYLRARLDELAADFPAVVLDPRGRGLMCAFSLPTTADRD
ELIRQLWQRAVIVLPAGADTVRFRPPLTVSTAEIDAAIAAVRSALPVVT
>Rv3710 leuA, 2-ISOPROPYLMALATE SYNTHASE LEUA (ALPHA-ISOPROPYLMALATE SYNTHASE) (ALPHA-IPM SYNTHETASE)
MTTSESPDAYTESFGAHTIVKPAGPPRVGQPSWNPQRASSMPVNRYRPFA
EEVEPIRLRNRTWPDRVIDRAPLWCAVDLRDGNQALIDPMSPARKRRMFD
LLVRMGYKEIEVGFPSASQTDFDFVREIIEQGAIPDDVTIQVLTQCRPEL
IERTFQACSGAPRAIVHFYNSTSILQRRVVFRANRAEVQAIATDGARKCV
EQAAKYPGTQWRFEYSPESYTGTELEYAKQVCDAVGEVIAPTPERPIIFN
LPATVEMTTPNVYADSIEWMSRNLANRESVILSLHPHNDRGTAVAAAELG
FAAGADRIEGCLFGNGERTGNVCLVTLGLNLFSRGVDPQIDFSNIDEIRR
TVEYCNQLPVHERHPYGGDLVYTAFSGSHQDAINKGLDAMKLDADAADCD
VDDMLWQVPYLPIDPRDVGRTYEAVIRVNSQSGKGGVAYIMKTDHGLSLP
RRLQIEFSQVIQKIAEGTAGEGGEVSPKEMWDAFAEEYLAPVRPLERIRQ
HVDAADDDGGTTSITATVKINGVETEISGSGNGPLAAFVHALADVGFDVA
VLDYYEHAMSAGDDAQAAAYVEASVTIASPAQPGEAGRHASDPVTIASPA
QPGEAGRHASDPVTSKTVWGVGIAPSITTASLRAVVSAVNRAAR
>Rv2995c leuB, PROBABLE 3-ISOPROPYLMALATE DEHYDROGENASE LEUB (BETA-IPM DEHYDROGENASE) (IMDH) (3-IPM-DH)
MKLAIIAGDGIGPEVTAEAVKVLDAVVPGVQKTSYDLGARRFHATGEVLP
DSVVAELRNHDAILLGAIGDPSVPSGVLERGLLLRLRFELDHHINLRPAR
LYPGVASPLSGNPGIDFVVVREGTEGPYTGNGGAIRVGTPNEVATEVSVN
TAFGVRRVVADAFERARRRRKHLTLVHKTNVLTFAGGLWLRTVDEVGECY
PDVEVAYQHVDAATIHMITDPGRFDVIVTDNLFGDIITDLAAAVCGGIGL
AASGNIDATRANPSMFEPVHGSAPDIAGQGIADPTAAIMSVALLLSHLGE
HDAAARVDRAVEAHLATRGSERLATSDVGERIAAAL
>Rv2988c leuC, PROBABLE 3-ISOPROPYLMALATE DEHYDRATASE (LARGE SUBUNIT) LEUC (ISOPROPYLMALATE ISOMERASE) (ALPHA-IPM ISOMERASE) (IPMI)
MALQTGEPRTLAEKIWDDHIVVSGGGCAPDLIYIDLHLVHEVTSPQAFDG
LRLAGRRVRRPELTLATEDHNVPTVDIDQPIADPVSRTQVETLRRNCAEF
GIRLHSMGDIEQGIVHVVGPQLGLTQPGMTIVCGDSHTSTHGAFGALAMG
IGTSEVEHVLATQTLPLRPFKTMAVNVDGRLPDGVSAKDIILALIAKIGT
GGGQGHVIEYRGSAIESLSMEGRMTICNMSIEAGARAGMVAPDETTYAFL
RGRPHAPTGAQWDTALVYWQRLRTDVGAVFDTEVYLDAASLSPFVTWGTN
PGQGVPLAAAVPDPQLMTDDAERQAAEKALAYMDLRPGTAMRDIAVDAVF
VGSCTNGRIEDLRVVAEVLRGRKVADGVRMLIVPGSMRVRAQAEAEGLGE
IFTDAGAQWRQAGCSMCLGMNPDQLASGERCAATSNRNFEGRQGAGGRTH
LVSPAVAAATAVRGTLSSPADLN
>Rv2987c leuD, PROBABLE 3-ISOPROPYLMALATE DEHYDRATASE (SMALL SUBUNIT) LEUD (ISOPROPYLMALATE ISOMERASE) (ALPHA-IPM ISOMERASE) (IPMI)
MEAFHTHSGIGVPLRRSNVDTDQIIPAVFLKRVTRTGFEDGLFAGWRSDP
AFVLNLSPFDRGSVLVAGPDFGTGSSREHAVWALMDYGFRVVISSRFGDI
FRGNAGKAGLLAAEVAQDDVELLWKLIEQSPGLEITANLQDRIITAATVV
LPFKIDDHSAWRLLEGLDDIALTLRKLDEIEAFEGACAYWKPRTLPAP
>Rv2046 lppI, Probable lipoprotein lppI
MRIAALVAVSLLIAGCSREVGGDVGQSQTIAPPAPAPSAAPSTPPAAGAP
ITTIVSWIEAGHPVDPAAYHVATRDGVTTQLGDDVAFSASSGTVACMTDA
RHTSGTLACLVRLANPPPRPETAYGEWKGGWVDFDGIHLQVGSARADPGP
FVYGNGPELANGDTLSIGDYRCRSYQAGLFCVNYAHQSAVRFASAGIEPF
GCLKPAPPPDGVGVAFGC
>Rv1166 lpqW, PROBABLE CONSERVED LIPOPROTEIN LPQW
MGVPSPVRRVCVTVGALVALACMVLAGCTVSPPPAPQSTDTPRSTPPPPR
RPTQIIMGIDWIGPGFNPHLLSDLSPVNAAISALVLPSAFRPIPDPNTPT
GSRWEMDPTLLVSADVTNNHPFTVTYKIRPEAQWTDNAPIAADDFWYLWQ
QMVTQPGVVDPAGYHLITSVQSLEGGKQAVVTFAQPYPAWRELFTDILPA
HIVKDIPGGFASGLARALPVTGGQFRVENIDPQRDEILIARNDRYWGPPS
KPGIILFRRAGAPAALADSVRNGDTQVAQVHGGSAAFAQLSAIPDVRTAR
IVTPRVMQFTLRANVPKLADTQVRKAILGLLDVDLLAAVGAGTDNTVTLD
QAQIRSPSDPGYVPTAPPAMSSAAALGLLEASGFQVDTNTSVSPAPSVPD
STTTSVSTGPPEVIRGRISKDGEQLTLVIGVAANDPTSVAVANTAADQLR
DVGIAATVLALDPVTLYHDALNDNRVDAIVGWRQAGGNLATLLASRYGCP
ALQATTVPAANAPTTAPSAPIGPTPSAAPDTATPPPTAPRRPSDPGALVK
APSNLTGICDRSIQSNIDAALNGTKNINDVITAVEPRLWNMSTVLPILQD
TTIVAAGPSVQNVSLSGAVPVGIVGDAGQWVKTGQ
>Rv0962c lprP, POSSIBLE LIPOPROTEIN LPRP
MKRTSRSLTAALLGIAALLAGCIKPNTFDPYANPGRGELDRRQKIVNGRP
DLETVQQQLANLDATIRAMIAKYSPQTRFSTGVTVSHLTNGCNDPFTRTI
GRQEASELFFGRPAPTPQQWLQIVTELAPVFKAAGFRPNNSVPGDPPQPL
GAPNYSQIRDDGVTINLVNGDNRGPLGYSYNTGCHPPAAWRTAPPPLNMR
PANDPDVHYPYLYGSPGGRTRDAY
>Rv1293 lysA, PROBABLE DIAMINOPIMELATE DECARBOXYLASE LYSA (DAP DECARBOXYLASE)
MNELLHLAPNVWPRNTTRDEVGVVCIAGIPLTQLAQEYGTPLFVIDEDDF
RSRCRETAAAFGSGANVHYAAKAFLCSEVARWISEEGLCLDVCTGGELAV
ALHASFPPERITLHGNNKSVSELTAAVKAGVGHIVVDSMTEIERLDAIAG
EAGIVQDVLVRLTVGVEAHTHEFISTAHEDQKFGLSVASGAAMAAVRRVF
ATDHLRLVGLHSHIGSQIFDVDGFELAAHRVIGLLRDVVGEFGPEKTAQI
ATVDLGGGLGISYLPSDDPPPIAELAAKLGTIVSDESTAVGLPTPKLVVE
PGRAIAGPGTITLYEVGTVKDVDVSATAHRRYVSVDGGMSDNIRTALYGA
QYDVRLVSRVSDAPPVPARLVGKHCESGDIIVRDTWVPDDIRPGDLVAVA
ATGAYCYSLSSRYNMVGRPAVVAVHAGNARLVLRRETVDDLLSLEVR
>Rv2386c mbtI, PUTATIVE ISOCHORISMATE SYNTHASE MBTI
MSELSVATGAVSTASSSIPMPAGVNPADLAAELAAVVTESVDEDYLLYEC
DGQWVLAAGVQAMVELDSDELRVIRDGVTRRQQWSGRPGAALGEAVDRLL
LETDQAFGWVAFEFGVHRYGLQQRLAPHTPLARVFSPRTRIMVSEKEIRL
FDAGIRHREAIDRLLATGVREVPQSRSVDVSDDPSGFRRRVAVAVDEIAA
GRYHKVILSRCVEVPFAIDFPLTYRLGRRHNTPVRSFLLQLGGIRALGYS
PELVTAVRADGVVITEPLAGTRALGRGPAIDRLARDDLESNSKEIVEHAI
SVRSSLEEITDIAEPGSAAVIDFMTVRERGSVQHLGSTIRARLDPSSDRM
AALEALFPAVTASGIPKAAGVEAIFRLDECPRGLYSGAVVMLSADGGLDA
ALTLRAAYQVGGRTWLRAGAGIIEESEPEREFEETCEKLSTLTPYLVARQ
>Rv3341 metA, PROBABLE HOMOSERINE O-ACETYLTRANSFERASE META (HOMOSERINE O-TRANS-ACETYLASE) (HOMOSERINE TRANSACETYLASE) (HTA)
MTISDVPTQTLPAEGEIGLIDVGSLQLESGAVIDDVCIAVQRWGKLSPAR
DNVVVVLHALTGDSHITGPAGPGHPTPGWWDGVAGPGAPIDTTRWCAVAT
NVLGGCRGSTGPSSLARDGKPWGSRFPLISIRDQVQADVAALAALGITEV
AAVVGGSMGGARALEWVVGYPDRVRAGLLLAVGARATADQIGTQTTQIAA
IKADPDWQSGDYHETGRAPDAGLRLARRFAHLTYRGEIELDTRFANHNQG
NEDPTAGGRYAVQSYLEHQGDKLLSRFDAGSYVILTEALNSHDVGRGRGG
VSAALRACPVPVVVGGITSDRLYPLRLQQELADLLPGCAGLRVVESVYGH
DGFLVETEAVGELIRQTLGLADREGACRR
>Rv1079 metB, PROBABLE CYSTATHIONINE GAMMA-SYNTHASE METB (CGS) (O-SUCCINYLHOMOSERINE [THIOL]-LYASE)
MSEDRTGHQGISGPATRAIHAGYRPDPATGAVNVPIYASSTFAQDGVGGL
RGGFEYARTGNPTRAALEASLAAVEEGAFARAFSSGMAATDCALRAMLRP
GDHVVIPDDAYGGTFRLIDKVFTRWDVQYTPVRLADLDAVGAAITPRTRL
IWVETPTNPLLSIADITAIAELGTDRSAKVLVDNTFASPALQQPLRLGAD
VVLHSTTKYIGGHSDVVGGALVTNDEELDEEFAFLQNGAGAVPGPFDAYL
TMRGLKTLVLRMQRHSENACAVAEFLADHPSVSSVLYPGLPSHPGHEIAA
RQMRGFGGMVSVRMRAGRRAAQDLCAKTRVFILAESLGGVESLIEHPSAM
THASTAGSQLEVPDDLVRLSVGIEDIADLLGDLEQALG
>Rv3340 metC, PROBABLE O-ACETYLHOMOSERINE SULFHYDRYLASE METC (HOMOCYSTEINE SYNTHASE) (O-ACETYLHOMOSERINE (THIOL)-LYASE) (OAH SULFHYDRYLASE) (O-ACETYL-L-HOMOSERINE S
MSADSNSTDADPTAHWSFETKQIHAGQHPDPTTNARALPIYATTSYTFDD
TAHAAALFGLEIPGNIYTRIGNPTTDVVEQRIAALEGGVAALFLSSGQAA
ETFAILNLAGAGDHIVSSPRLYGGTYNLFHYSLAKLGIEVSFVDDPDDLD
TWQAAVRPNTKAFFAETISNPQIDLLDTPAVSEVAHRNGVPLIVDNTIAT
PYLIQPLAQGADIVVHSATKYLGGHGAAIAGVIVDGGNFDWTQGRFPGFT
TPDPSYHGVVFAELGPPAFALKARVQLLRDYGSAASPFNAFLVAQGLETL
SLRIERHVANAQRVAEFLAARDDVLSVNYAGLPSSPWHERAKRLAPKGTG
AVLSFELAGGIEAGKAFVNALKLHSHVANIGDVRSLVIHPASTTHAQLSP
AEQLATGVSPGLVRLAVGIEGIDDILADLELGFAAARRFSADPQSVAAF
>Rv1133c metE, PROBABLE5-METHYLTETRAHYDROPTEROYLTRIGLUTAMATE-- HOMOCYSTEINE METHYLTRANSFERASE METE (methionine synthase, vitamin-B12 independent isozyme)
MTQPVRRQPFTATITGSPRIGPRRELKRATEGYWAGRTSRSELEAVAATL
RRDTWSALAAAGLDSVPVNTFSYYDQMLDTAVLLGALPPRVSPVSDGLDR
YFAAARGTDQIAPLEMTKWFDTNYHYLVPEIGPSTTFTLHPGKVLAELKE
ALGQGIPARPVIIGPITFLLLSKAVDGAGAPIERLEELVPVYSELLSLLA
DGGAQWVQFDEPALVTDLSPDAPALAEAVYTALCSVSNRPAIYVATYFGD
PGAALPALARTPVEAIGVDLVAGADTSVAGVPELAGKTLVAGVVDGRNVW
RTDLEAALGTLATLLGSAATVAVSTSCSTLHVPYSLEPETDLDDALRSWL
AFGAEKVREVVVLARALRDGHDAVADEIASSRAAIASRKRDPRLHNGQIR
ARIEAIVASGAHRGNAAQRRASQDARLHLPPLPTTTIGSYPQTSAIRVAR
AALRAGEIDEAEYVRRMRQEITEVIALQERLGLDVLVHGEPERNDMVQYF
AEQLAGFFATQNGWVQSYGSRCVRPPILYGDVSRPRAMTVEWITYAQSLT
DKPVKGMLTGPVTILAWSFVRDDQPLADTANQVALAIRDETVDLQSAGIA
VIQVDEPALRELLPLRRADQAEYLRWAVGAFRLATSGVSDATQIHTHLCY
SEFGEVIGAIADLDADVTSIEAARSHMEVLDDLNAIGFANGVGPGVYDIH
SPRVPSAEEMADSLRAALRAVPAERLWVNPDCGLKTRNVDEVTASLHNMV
AAAREVRAG
>Rv2124c metH, Probable 5-methyltetrahydrofolate--homocystein methyltransferase MetH (Methionine synthase, vitamin-B12 dependent isozyme) (MS)
MTAADKHLYDTDLLDVLSQRVMVGDGAMGTQLQAADLTLDDFRGLEGCNE
ILNETRPDVLETIHRNYFEAGADAVETNTFGCNLSNLGDYDIADRIRDLS
QKGTAIARRVADELGSPDRKRYVLGSMGPGTKLPTLGHTEYAVIRDAYTE
AALGMLDGGADAILVETCQDLLQLKAAVLGSRRAMTRAGRHIPVFAHVTV
ETTGTMLLGSEIGAALTAVEPLGVDMIGLNCATGPAEMSEHLRHLSRHAR
IPVSVMPNAGLPVLGAKGAEYPLLPDELAEALAGFIAEFGLSLVGGCCGT
TPAHIREVAAAVANIKRPERQVSYEPSVSSLYTAIPFAQDASVLVIGERT
NANGSKGFREAMIAEDYQKCLDIAKDQTRDGAHLLDLCVDYVGRDGVADM
KALASRLATSSTLPIMLDSTETAVLQAGLEHLGGRCAINSVNYEDGDGPE
SRFAKTMALVAEHGAAVVALTIDEEGQARTAQKKVEIAERLINDITGNWG
VDESSILIDTLTFTIATGQEESRRDGIETIEAIRELKKRHPDVQTTLGLS
NISFGLNPAARQVLNSVFLHECQEAGLDSAIVHASKILPMNRIPEEQRNV
ALDLVYDRRREDYDPLQELMRLFEGVSAASSKEDRLAELAGLPLFERLAQ
RIVDGERNGLDADLDEAMTQKPPLQIINEHLLAGMKTVGELFGSGQMQLP
FVLQSAEVMKAAVAYLEPHMERSDDDSGKGRIVLATVKGDVHDIGKNLVD
IILSNNGYEVVNIGIKQPIATILEVAEDKSADVVGMSGLLVKSTVVMKEN
LEEMNTRGVAEKFPVLLGGAALTRSYVENDLAEIYQGEVHYARDAFEGLK
LMDTIMSAKRGEAPDENSPEAIKAREKEAERKARHQRSKRIAAQRKAAEE
PVEVPERSDVAADIEVPAPPFWGSRIVKGLAVADYTGLLDERALFLGQWG
LRGQRGGEGPSYEDLVETEGRPRLRYWLDRLSTDGILAHAAVVYGYFPAV
SEGNDIVVLTEPKPDAPVRYRFHFPRQQRGRFLCIADFIRSRELAAERGE
VDVLPFQLVTMGQPIADFANELFASNAYRDYLEVHGIGVQLTEALAEYWH
RRIREELKFSGDRAMAAEDPEAKEDYFKLGYRGARFAFGYGACPDLEDRA
KMMALLEPERIGVTLSEELQLHPEQSTDAFVLHHPEAKYFNV
>Rv0391 metZ, PROBABLE O-SUCCINYLHOMOSERINE SULFHYDRYLASE METZ (OSH SULFHYDRYLASE)
MTDESSVRTPKALPDGVSQATVGVRGGMLRSGFEETAEAMYLTSGYVYGS
AAVAEKSFAGELDHYVYSRYGNPTVSVFEERLRLIEGAPAAFATASGMAA
VFTSLGALLGAGDRLVAARSLFGSCFVVCSEILPRWGVQTVFVDGDDLSQ
WERALSVPTQAVFFETPSNPMQSLVDIAAVTELAHAAGAKVVLDNVFATP
LLQQGFPLGVDVVVYSGTKHIDGQGRVLGGAILGDREYIDGPVQKLMRHT
GPAMSAFNAWVLLKGLETLAIRVQHSNASAQRIAEFLNGHPSVRWVRYPY
LPSHPQYDLAKRQMSGGGTVVTFALDCPEDVAKQRAFEVLDKMRLIDISN
NLGDAKSLVTHPATTTHRAMGPEGRAAIGLGDGVVRISVGLEDTDDLIAD
IDRALS
>Rv3469c mhpE, PROBABLE 4-HYDROXY-2-OXOVALERATE ALDOLASE MHPE (HOA)
MLMTATHREPIVLDTTVRDGSYAVNFQYTDDDVRRIVGDLDAAGIPYIEI
GHGVTIGAAAAQGPAAHTDEEYFRAARSVVRNARLGAVIVPALARIETVD
LAGDYLDFLRICVIATEFELVMPFVERAQSKGLEVSIQLVKSHLFEPDVL
AAAGKRARDVGVRIVYVVDTTGTFLPEDARRYVEALRGASDVSVGFHGHN
NLAMAVANTLEAFDAGADFLDGTLMGFGRGAGNCQIECLVAALQRRGHLA
AVDLDRIFDAARSDMLGRSPQSYGIDPWEISFGFHGLDSLQVEHLRAAAQ
QAGLSVSHVIRQTAKSHAGQWLSPQDIDRVVVGMRA
>Rv2458 mmuM, PROBABLE HOMOCYSTEINE S-METHYLTRANSFERASE MMUM (S-METHYLMETHIONINE:HOMOCYSTEINE METHYLTRANSFERASE) (CYSTEINE METHYLTRANSFERASE)
MELVSDSVLISDGGLATELEARGHDLSDPLWSARLLVDAPHAITAVHTAY
FRAGAQIATTASYQASFEGFAARGIGHDDATVLLRRSVELAQAARDEVGV
GGLSVAASVGPYGAALADGSEYRGYYGLSVAALMKWHLPRLEVLVDAGAD
MLALETIPDIDEAEALVNLVRRLATPAWLSYTINGTRTRAGQPLTDAFAV
AAGVPEIVAVGVNCCAPDDVLPAIAFAVAHTGKPVIVYPNSGEGWDGRRR
AWVGPRRFSGSSGQLAREWVAAGARIVGGCCRVRPIDIAEIGRALTTAPP
RG
>Rv1859 modC, PROBABLE MOLYBDENUM-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER MODC
MSKLQLRAVVADRRLDVEFSVSAGEVLAVLGPNGAGKSTALHVIAGLLRP
DAGLVRLGDRVLTDTEAGVNVATHDRRVGLLLQDPLLFPHLSVAKNVAFG
PQCRRGMFGSGRARTRASALRWLREVNAEQFADRKPRQLSGGQAQRVAIA
RALAAEPDVLLLDEPLTGLDVAAAAGIRSVLRSVVARSGCAVVLTTHDLL
DVFTLADRVLVLESGTIAEIGPVADVLTAPRSRFGARIAGVNLVNGTIGP
DGSLRTQSGAHWYGTPVQDLPTGHEAIAVFPPTAVAVYPEPPHGSPRNIV
GLTVAEVDTRGPTVLVRGHDQPGGAPGLAACITVDAATELRVAPGSRVWF
SVKAQEVALHPAPHQHASS
>Rv1902c nanT, PROBABLE SIALIC ACID-TRANSPORT INTEGRAL MEMBRANE PROTEIN NANT
MAAPRLTGDQRNAFMASFLGWTMDAFDYFLVVLVYADIATTFHHTKTDVA
FLTTATLAMRPVGALLFGLWADRVGRRVPLMVDVSFYSVIGFLCAFAPNF
TVLVILRLLYGIGMGGEWGLGAALSMEKVPAERRGVFSGLLQEGYAFGYL
LASVAALVVMNWLGLSWRWLFGLSIIPALISLIIRYRVKESEVWEAAQDR
MRLTKTRIRDVLGNPAIVRRFVYLVLLMTAFNWMSHGTQDVYPTFLTATT
DHGAGLSSLTARWIVVIYNIGAIIGGLAFGTLSQRFSRRYTIVFCAALGL
PIVPLFAYSRTAAMLCLGSFLMQVFVQGAWGVIPAHLTEMSPDAIRGVYP
GVTYQLGNLLAAFNLPIQERLAESHGYPFALAATIVPVLLVVAVLTAIGK
DATGIRFGTTETAFLVRHRNRH
>Rv0266c oplA, PROBABLE 5-OXOPROLINASE OPLA (5-OXO-L-PROLINASE) (PYROGLUTAMASE) (5-OPASE)
MVGAGWHFWVDRGGTFTDVVARRPDGRLLTHKLLSDNPARYRDAAVAGIR
ALLANGEAGTRVDAVRMGTTVATNALLERTGERTLLVITRGFGDALRIAY
QNRPRIFDRRIVLPEMLYERVVEVDERVTADGRVLRAPDLEALGEKMRQA
HADGIRAVAVVCLHSYLYPGHEREIGTLAQRIGFAQISLSSEVSPLMKLV
PRGDTTVVDAYLSPVLRRYINQVADQMRGVRLMFMQSNGGLAQAGHFRGK
DAILSGPAGGIVGMVRMSALAGFDHVIGFDMGGTSTDVSHYAGEYERVFT
TQVAGVRLRAPMLDIHTVAAGGGSILHFDGSRYRVGPDSAGADPGPACYR
GGGPLCVTDANVMLGRIQPTHFPSVFGPSGDQPLDAGTVRRGFTDLAADI
AARTGDDRSPEQVAEGYLRIAVANMANAVKKISVQKGHDVTRYALTTFGG
AGGQHACAVADALGIRTVLIPPMAGVLSALGIGLADTTAMREQSVEIPLG
PAAPQRLASVAESLERAARAELLDEGVPGERIRVVRRVHLRYEGTDTAIP
VQLAEIETMATAFESSHRALYTFLLDRPLIAEAISVEATGLTDQPDLSQL
GDQANDTTGSSETVRIYSNGLWRDAPLRRREAMRPGDVLTGPAIIAEANA
TTVVDDGWQATMTETGHLLAQRVVTPPRPDAATRAGFEAGFEADPVLLEI
FNNLFMSIAEQMGFRLEATAQSVNIRERLDFSCALFDPDGNLVANAPHIP
VHLGSMGTTVKEVIRRRLSGMKPGDVYAVNDPYHGGTHLPDITVITPVFN
TGGEDVLFFVASRGHHAEIGGITPGSMPADSREIHEEGVLFDNWLLAENG
RFREAETRRLLTEAPFGSRNPDTNLADLRAQIAANQKGVDEVGKMIDHFG
RDVVAAYMRHVQDNAEEAVRRVIDRLDNGAYRYRMDSGATIAVRITVDRA
ARSATIDFTGTSAQLDTNFNAPTSVVNAAVLYVFRTLVADDIPLNDGCLR
PLRIVVPEGSMLAPTHPAAVVAGNVETSQAITGALFAALGVQAEGSGTMN
NVTFGNERHQYYETVGSGSGAGDGYHGASVVQTHMTNSRLTDPEVLEWRY
PVLLREFAVRQGSGGAGRWRGGDGAVRRLEFTEPMTVSTLSGHRRVRPYG
MAGGSPGELGRNRVERADGSTVELAGCGSTHVEPGDTLVIETPGGGGYGP
ASTSARRRR
>Rv1280c oppA, PROBABLE PERIPLASMIC OLIGOPEPTIDE-BINDING LIPOPROTEIN OPPA
MADRGQRRGCAPGIASALRASFQGKSRPWTQTRYWAFALLTPLVVAMVLT
GCSASGTQLELAPTADRRAAVGTTSDINQQDPATLQDGGNLRLSLTDFPP
NFNILHIDGNNAEVAAMMKATLPRAFIIGPDGSTTVDTNYFTSIELTRTA
PQVVTYTINPEAVWSDGTPITWRDIASQIHAISGADKAFEIASSSGAERV
ASVTRGVDDRQAVVTFAKPYAEWRGMFAGNGMLLPASMTATPEAFNKGQL
DGPGPSAGPFVVSALDRTAQRIVLTRNPRWWGARPRLDSITYLVLDDAAR
LPALQNNTIDATGVGTLDQLTIAARTKGISIRRAPGPSWYHFTLNGAPGS
ILADKALRLAIAKGIDRYTIARVAQYGLTSDPVPLNNHVFVAGQDGYQDN
SGVVAYNPEQAKRELDALGWRRSGAFREKDGRQLVIRDLFYDAQSTRQFA
QIAQHTLAQIGVKLELQAKSGSGFFSDYVNVGAFDIAQFGWVGDAFPLSS
LTQIYASDGESNFGKIGSPQIDAAIERTLAELDPGKARALANQVDELIWA
EGFSLPLTQSPGTVAVRSTLANFGATGLADLDYTAIGFMRR
>Rv1283c oppB, PROBABLE OLIGOPEPTIDE-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER OPPB
MTRYLARRLLNYLVLLALASFLTYCLTSLAFSPLESLMQRSPRPPQAVID
AKAHDLGLDRPILARYANWVSHAVRGDFGTTITGQPVGTELGRRIGVSLR
LLVVGSVFGTVAGVVIGAWGAIRQYRLSDRVMTTLALLVLSTPTFVVANL
LILGALRVNWAVGIQLFDYTGETSPGVAGGVWDRLGDRLQHLILPSLTLA
LAAAAGFSRYQRNAMLDVLGQDFIRTARAKGLTRRRALLKHGLRTALIPM
ATLFAYGVAGLVTGAVFVEKIFGWHGMGEWMVRGISTQDTNIVAAITVFS
GAVVLLAGLLSDVIYAALDPRVRVS
>Rv1282c oppC, PROBABLE OLIGOPEPTIDE-TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER OPPC
MTEFASRRTLVVRRFLRNRAAVASLAALLLLFVSAYALPPLLPYSYDDLD
FNALLQPPGTKHWLGTNALGQDLLAQTLRGMQKSMLIGVCVAVISTGIAA
TVGAISGYFGGWRDRTLMWVVDLLLVVPSFILIAIVTPRTKNSANIMFLV
LLLAGFGWMISSRMVRGMTMSLREREFIRAARYMGVSSRRIIVGHVVPNV
ASILIIDAALNVAAAILAETGLSFLGFGIQPPDVSLGTLIADGTASATAF
PWVFLFPASILVLILVCANLTGDGLRDALDPASRSLRRGVR
>Rv0118c oxcA, PROBABLE OXALYL-CoA DECARBOXYLASE OXCA
MTTRSASPCTVLTDGCHLVVDALKANDVDTIYGVVGIPITDLARAAQASG
IRYIGFRHEASAGNAAAAAGFLTARPGVCLTTSGPGFLNGLPALANATTN
CFPMIQISGSSSRPMVDLQRGDYQDLDQLNAARPFVKAAYRIGQVQDIGR
GVARAIRTATSGRPGGVYLDIPGDVLGQAVEASAASGAIWRPVDPAPRLL
PAPEAIDRALDVLAQAQRPLLVLSKGAAYAQADNVIREFVEHTGIPFLPM
SMAKGLLPDSHPQSAAAARSLAMARADVVLLVGARLNWLLGNGESPQWSA
DAKFIQVDIEASEFDSNRPIVAPLTGDIGSVMSALLEAAADRSSVASAAW
TGELADRKARNSAKMRRRLADDHHPMRFYNALGAIRSVLQRNPDVYVVNE
GANALDLARNIIDMHLPRHRLDSGTWGVMGIGMGYAIAAAVETGRPVVAI
EGDSAFGFSGMEFETICRYRLPVTVVILNNGGVYRGDEATIFRSAAPVWR
HDPAPTVLNAHARHELIAEAFGGKGYHVSTPTELESALTDALASNGPSLI
DCELDPADGVESGHLAKLNTTSAATPAISGDG
>Rv1005c pabB, Probable para-aminobenzoate synthase component I PABD
MNLAWELSTRTKSPRSHLRCENPQFCQARTVRIDRLGDLGGAPAVLRAVG
RATSRLDLPPPAALTGEWFGALAVIAPSVSIQPVSGDDVFSGPPGTGGPD
ATGAVGGGWVGYLSYPDAGADGRPHRIPEAAGGWTDCVLRRDRDGQWWYE
SLSGAPIADWLASALATTRASVARPAPACRIDWEPADRAAHRDGVLACLE
AIGAGEVYQACVCTQFAGTVTGSPLDFFIDGFGRTAPSRSAFVAGPWGAV
ASLSPELFLRRRGSVVTSSPIKGTLPLDAPPSALRASAKEVAENIMIVDL
VRNDLGRVAVTGTVTVPELLVVRPAPGVWHLVSTVSARVPLEEPMSALLD
AAFPPASVTGTPKLRARQLISQWERYRRGIYCGTVGLASPVAGCELNVAI
RTVEFDTAGNAVLGVGGGITADSDPDAEWAECLHKAAPIVGLPAATRTTP
ARLASKVR
>Rv2213 pepB, Probable aminopeptidase PepB
MTTEPGYLSPSVAVATSMPKRGVGAAVLIVPVVSTGEEDRPGAVVASAEP
FLRADTVAEIEAGLRALDATGASDQVHRLAVPSLPVGSVLTVGLGKPRRE
WPADTIRCAAGVAARALNSSEAVITTLAELPGDGICSATVEGLILGSYRF
SAFRSDKTAPKDAGLRKITVLCCAKDAKKRALHGAAVATAVATARDLVNT
PPSHLFPAEFAKRAKTLSESVGLDVEVIDEKALKKAGYGGVIGVGQGSSR
PPRLVRLIHRGSRLAKNPQKAKKVALVGKGITFDTGGISIKPAASMHHMT
SDMGGAAAVIATVTLAARLRLPIDVIATVPMAENMPSATAQRPGDVLTQY
GGTTVEVLNTDAEGRLILADAIVRACEDKPDYLIETSTLTGAQTVALGTR
IPGVMGSDEFRDRVAAISQRVGENGWPMPLPDDLKDDLKSTVADLANVSG
QRFAGMLVAGVFLREFVAESVDWAHIDVAGPAYNTGSAWGYTPKGATGVP
TRTMFAVLEDIAKNG
>Rv0800 pepC, PROBABLE AMINOPEPTIDASE PEPC
MAATAHGLCEFIDASPSPFHVCATVAGRLLGAGYRELREADRWPDKPGRY
FTVRAGSLVAWNAEQSGHTQVPFRIVGAHTDSPNLRVKQHPDRLVAGWHV
VALQPYGGVWLHSWLDRDLGISGRLSVRDGTGVSHRLVLIDDPILRVPQL
AIHLAEDRKSLTLDPQRHINAVWGVGERVESFVGYVAQRAGVAAADVLAA
DLMTHDLTPSALIGASVNGTASLLSAPRLDNQASCYAGMEALLAVDVDSA
SSGFVPVLAIFDHEEVGSASGHGAQSDLLSSVLERIVLAAGGTREDFLRR
LTTSMLASADMAHATHPNYPDRHEPSHPIEVNAGPVLKVHPNLRYATDGR
TAAAFALACQRAGVPMQRYEHRADLPCGSTIGPLAAARTGIPTVDVGAAQ
LAMHSARELMGAHDVAAYSAALQAFLSAELSEA
>Rv2089c pepE, Probable dipeptidase PepE
MGSRRFDAEVYARRLALAAAATADAGLAGLVITPGYDLCYLIGSRAETFE
RLTALVLPAAGAPAVVLPRLELAALKQSAAAELGLRVCDWVDGDDPYGLV
SAVLGGAPVATAVTDSMPALHMLPLADALGVLPVLATDVLRRLRMVKEET
EIDALRKAGAAIDRVHARVPEFLVPGRTEADVAADIAEAIVAEGHSEVAF
VIVGSGPHGADPHHGYSDRELREGDIVVVDIGGTYGPGYHSDSTRTYSIG
EPDSDVAQSYSMLQRAQRAAFEAIRPGVTAEQVDAAARDVLAEAGLAEYF
VHRTGHGIGLCVHEEPYIVAGNDLVLVPGMAFSIEPGIYFPGRWGARIED
IVIVTEDGAVSVNNCPHELIVVPVS
>Rv2467 pepN, PROBABLE AMINOPEPTIDASE N PEPN (LYSYL AMINOPEPTIDASE) (LYS-AP) (ALANINE AMINOPEPTIDASE)
MALPNLTRDQAVERAALITVDSYQIILDVTDGNGAPGERTFRSTTTVVFD
ALPGADTVIDISAHTVRRASLNDQDLDVSGYDEAAGIPLRGLAQRNVVVV
DADCHYSNTGEGLHRFVDPVDGETYLYSQFETADAKRMFACFDQPDLKAT
FDVRVTAPAHWKVISNGAPLAAANGVHTFATTPRMSTYLVALIAGPYAAW
TDTYIDDHGEIPLGIYCRASLAEYMDAERLFTQTKQGFGFYHKHFGLPYA
FGKYDQLFVPEFNAGAMENAGAVTFLEDYVFRSKVTRASYERRAETVLHE
MAHMWFGDLVTMTWWDDLWLNESFATFASVLCQSEATEFTEAWTTFATVE
KSWAYRQDQLPSTHPIAADIPDLAAVEVNFDGITYAKGASVLKQLVAYVG
LERFLAGLRDYFRTHAFGNASFDDLLAALEKASGRDLSNWGEQWLKTTGL
NTLRPDFEVDAEGRFTRFAVTQSGAAPGAGETRVHRLAVGIYDDDGSKSS
GKLVRVHREELDVSGPITNVPALVGVSRGKLILVNDDDLTYCSLRLDERS
LQTALDRIADIAEPLPRTLVWSAAWEMTREAELRARDFVSLVSGGVHAET
EVGVAQRLLLQAQTALGCYAEPGWARERGWPQFADRLLELAREAEPGSDH
QLAYINSLCSSVLSPRHVQTLGALLEGEPAACGLAGLAVDTDLRWRIVTA
LATAGAIDADGPETPRIDAEVQRDPTAAGKRHAAQARAARPQFVVKDEAF
TTVVEDDTLANATGRAMIAGIAAPGQGELLKPFARRYFQAIPGVWARRSS
EVAQSVVIGLYPHWDISEQGITAAEEFLSDPEVPPALRRLVLEGQAAVQR
SLRARNFDADG
>Rv2535c pepQ, PROBABLE CYTOPLASMIC PEPTIDASE PEPQ
MTHSQRRDKLKAQIAASGLDAMLISDLINVRYLSGFSGSNGALLVFADER
DAVLATDGRYRTQAASQAPDLEVAIERAVGRYLAGRAGEAGVGKLGFESH
VVTVDGLDALAGALEGKNTELVRASGTVESLREVKDAGELALLRLACEAA
DAALTDLVARGGLRPGRTERQVSRELEALMLDHGADAVSFETIVAAGANS
AIPHHRPTDAVLQVGDFVKIDFGALVAGYHSDMTRTFVLGKAADWQLEIY
QLVAEAQQAGRQALLPGAELRGVDAAARQLIADAGYGEHFGHGLGHGVGL
QIHEAPGIGVTSAGTLLAGSVVTVEPGVYLPGRGGVRIEDTLVVAGGTPK
MPETAGQTPELLTRFPKELAIL
>Rv3838c pheA, POSSIBLE PREPHENATE DEHYDRATASE PHEA
MVRIAYLGPEGTFTEAALVRMVAAGLVPETGPDALQRMPVESAPAALAAV
RDGGADYACVPIENSIDGSVLPTLDSLAIGVRLQVFAETTLDVTFSIVVK
PGRNAADVRTLAAFPVAAAQVRQWLAAHLPAADLRPAYSNADAARQVADG
LVDAAVTSPLAAARWGLAALADGVVDESNARTRFVLVGRPGPPPARTGAD
RTSAVLRIDNQPGALVAALAEFGIRGIDLTRIESRPTRTELGTYLFFVDC
VGHIDDEAVAEALKAVHRRCADVRYLGSWPTGPAAGAQPPLVDEASRWLA
RLRAGKPEQTLVRPDDQGAQA
>Rv2483c plsC, POSSIBLE TRANSMEMBRANE PHOSPHOLIPID BIOSYNTHESIS BIFUNCTIONNAL ENZYME PLSC: PUTATIVE L-3-PHOSPHOSERINE PHOSPHATASE (O-PHOSPHOSERINE PHOSPHOHYDROLASE)
MSAADEQGEERATRKSAPDLRLPGSVAEILASPAGPKVGAFFDLDGTLVA
GFTAVILTQERLRRRDMGVGELLGMVQAGLNHTLGRIEFEDLIGKAAAAL
AGRLLTDLEEIGERLFAQRIESRIYPEMRELVRAHVARGHTVVLSSSALT
IQVGPVARFLGINNMLTNKFETNEDGILTGGVLKPILWCPGKATAVQRFA
AEHDIDLKDSYFYADGDEDVALMYLVGNPRPTNPEGKMAAVAKRRGWPIL
KFNSRGGVGIRRQLRTLAGLSTIVPVAAGAVGIGVLTGSRRRGVNFFTST
FSQLLLATSGVHLNVIGKENLTAQRPAVFIFNHRNQVDPVIAGALVRDNW
VGVGKKELASDPIMGTLGKLLDGVFIDRDDPVAAVETLHTVEERARNGLS
IVIAPEGTRLDTTEVGSFKKGPFRIAMAAKIPIVPIVIRNAEIVASRNST
TINPGTVDVAVFPPIPVDDWTLDALPDRIAEVRQLYLDTLADWPVDGLPA
VDLYAEQKAARKARAQVAKATAKRVPAKKAPAKSAANKGAAATKAATKKA
SPKAKPSESKIAGKDGEASASPSSSAKGRS
>Rv2427c proA, PROBABLE GAMMA-GLUTAMYL PHOSPHATE REDUCTASE PROTEIN PROA (GPR) (GLUTAMATE-5-SEMIALDEHYDE DEHYDROGENASE) (GLUTAMYL-GAMMA-SEMIALDEHYDE DEHYDROGENASE)
MTVPAPSQLDLRQEVHDAARRARVAARRLASLPTTVKDRALHAAADELLA
HRDQILAANAEDLNAAREADTPAAMLDRLSLNPQRVDGIAAGLRQVAGLR
DPVGEVLRGYTLPNGLQLRQQRVPLGVVGMIYEGRPNVTVDAFGLTLKSG
NAALLRGSSSAAKSNEALVAVLRTALVGLELPADAVQLLSAADRATVTHL
IQARGLVDVVIPRGGAGLIEAVVRDAQVPTIETGVGNCHVYVHQAADLDV
AERILLNSKTRRPSVCNAAETLLVDAAIAETALPRLLAALQHAGVTVHLD
PDEADLRREYLSLDIAVAVVDGVDAAIAHINEYGTGHTEAIVTTNLDAAQ
RFTEQIDAAAVMVNASTAFTDGEQFGFGAEIGISTQKLHARGPMGLPELT
STKWIAWGAGHTRPA
>Rv2439c proB, PROBABLE GLUTAMATE 5-KINASE PROTEIN PROB (GAMMA-GLUTAMYL KINASE) (GK)
MRSPHRDAIRTARGLVVKVGTTALTTPSGMFDAGRLAGLAEAVERRMKAG
SDVVIVSSGAIAAGIEPLGLSRRPKDLATKQAAASVGQVALVNSWSAAFA
RYGRTVGQVLLTAHDISMRVQHTNAQRTLDRLRALHAVAIVNENDTVATN
EIRFGDNDRLSALVAHLVGADALVLLSDIDGLYDCDPRKTADATFIPEVS
GPADLDGVVAGRSSHLGTGGMASKVAAALLAADAGVPVLLAPAADAATAL
ADASVGTVFAARPARLSARRFWVRYAAEATGALTLDAGAVRAVVRQRRSL
LAAGITAVSGRFCGGDVVELRAPDAAMVARGVVAYDASELATMVGRSTSE
LPGELRRPVVHADDLVAVSAKQAKQV
>Rv0500 proC, PROBABLE PYRROLINE-5-CARBOXYLATE REDUCTASE PROC (P5CR) (P5C REDUCTASE)
MLFGMARIAIIGGGSIGEALLSGLLRAGRQVKDLVVAERMPDRANYLAQT
YSVLVTSAADAVENATFVVVAVKPADVEPVIADLANATAAAENDSAEQVF
VTVVAGITIAYFESKLPAGTPVVRAMPNAAALVGAGVTALAKGRFVTPQQ
LEEVSALFDAVGGVLTVPESQLDAVTAVSGSGPAYFFLLVEALVDAGVGV
GLSRQVATDLAAQTMAGSAAMLLERMEQDQGGANGELMGLRVDLTASRLR
AAVTSPGGTTAAALRELERGGFRMAVDAAVQAAKSRSEQLRITPE
>Rv3758c proV, POSSIBLE OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER PROV
MICFDDVSKVYAHGATAVDRLTLEVPNGMLTVFVGPSGCGKTTALRMINR
MVDPTSGTITVDGTDVSTVNAVKLRLGIGYVIQNAGLMPHQRVIDNVATV
PVLKGQPRRAARKAGYEVLERVGLDPKVATRYPAQLSGGEQQRVGVARAL
AADPPILLMDEPFSAVDPVVRHELQNEILRLQAELHKTIVFVTHDIDEAL
KLADLVAVFAPGGALAQYDETARLLSSPANDFVSKFIGLGRGYRWLQLFD
AAGLPVRDIEQVSVNGLSDARDRQVRDGWVLVVDGAGAPLGWIDADGRRR
HRGGAALSDAMTVGGSVFRPNGNLSQALDAALSSPSGVGVAVDGGGKVIG
GILAADVLAEFQKGKKAGGGAKPCTT
>Rv3757c proW, POSSIBLE OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER PROW
MHYLMTHPGAAWALTVVHLRLSLLPVLIGLMSAVPLGLLVQRAPLLRRLT
TATASVIFTIPSLALFVVLPLIIGTRILDEANVIVALAAYTTALLVRAVL
EALDAVPAQVHDAATAIGYSRIAQMLKVELPLSIPVLVAGLRVVAVTNIA
MVSVGSVIGIGGLGTWFTAGYQTNKSDQIVAGVVAMFLLAIVVDVVINLA
GRLATPWERAPRAARRRRQVAAPITGGAR
>Rv3756c proZ, POSSIBLE OSMOPROTECTANT (GLYCINE BETAINE/CARNITINE/CHOLINE/L-PROLINE) TRANSPORT INTEGRAL MEMBRANE PROTEIN ABC TRANSPORTER PROZ
MNFLQQALSYLLTASNWTGPVGLAVRTCEHLEYTAVAVAASALIAVPVGL
LIGHTGRGTLLVVGAVNGLRALPTLGVLLLGVLLFGLGLGPPLVALMLLG
IPSLLASTYAGIASVDPLVVDAARAMGMTESQVLLRVEVPNALPLMLGGL
RSATLQVVATATVAAYASLGGLGGYLIDGIKERRFHIALVGAMMVAALAL
TLDGLLALAGWVSVPGTGRMRKLAAVVDKPAAGGGHALR
>Rv1017c prsA, PROBABLE RIBOSE-PHOSPHATE PYROPHOSPHOKINASE PRSA (Phosphoribosyl pyrophosphate synthetase) (PRPP synthetase)
MSHDWTDNRKNLMLFAGRAHPELAEQVAKELDVHVTSQDAREFANGEIFV
RFHESVRGCDAFVLQSCPAPVNRWLMEQLIMIDALKRGSAKRITAVMPFY
PYARQDKKHRGREPISARLIADLLKTAGADRIVTVDLHTDQIQGFFDGPV
DHMRGQNLLTGYIRDNYPDGNMVVVSPDSGRVRIAEKWADALGGVPLAFI
HKTRDPRVPNQVVSNRVVGDVAGRTCVLIDDMIDTGGTIAGAVALLHNDG
AGDVIIAATHGVLSDPAAQRLASCGAREVIVTNTLPIGEDKRFPQLTVLS
IAPLLASTIRAVFENGSVTGLFDGDA
>Rv0781 ptrBa, PROBABLE PROTEASE II PTRBA [FIRST PART] (OLIGOPEPTIDASE B)
MMHRTALPSPPVAKRVQTRREHHGDVFVDPYEWLRDKDSPEVIAYLEAEN
DYTERTTAHLEPLRQKIFHEIKARTKETDLSVPTRRGNWWYYARTFEGKQ
YGVHCRCPVTDPDDWNPPEFDERTEIPGEQLLLDENVEADGHDFFALGAA
SVSLDDNLLAYSVDVVGDERYTLRFKDLRTGEQYPDEIAGIGAGVTWAAD
NHCLLHHRGRGLASGHSVAIPTRVRRIVGAGLPRSR
>Rv0782 ptrBb, PROBABLE PROTEASE II PTRBB [SECOND PART] (OLIGOPEPTIDASE B)
MTNDIPCGSRIYAPENSTRTRSPGSERESPGQLTTTVYYTTVDAAWRPDT
VWRYRLGSGESSERVYHEADDRFWLAVGRTRSNAYLLIAAGSSITSEVRY
AHAADPTAQFSVVLPRRDGVEYSVEHAVIAGQDRFLILHNDGAVNFTLVE
APVEDPARQRTLIAHRDDVRLDAVDALAGHLVVSYRREALPRVQLWPIGP
DGNYGEPEEISFDSELMSAGLGPNPNWDSPKLRVGAGSFVTPVRIYDIDL
VTGERTLLKEQPVLGGYRREDYVERRDWAYGDDGTRIPVSIVHRADIEFP
APALIYGYGAYEICEDPRFSIARLSLLDRGMVFVVAHVRGGGEMGRLWYE
NGKLLDKKNTFTDFIAVARHLVDTGLTSQQQLVALGGSAGGLLMGAVANM
APDLFAGILAQVPFVDPLTTILDPSLPLTVTEWDEWGNPLNDSDVYAYVK
SYSPYENVTAQKYPAILAMTSLNDTRVYYVEPAKWVAALRHAKTDGNSVL
LKTQMHAGHGGISGRYERWKETAFQYGWLLATADSDRYGGGQGNDLDGAA
PA
>Rv2322c rocD1, PROBABLE ORNITHINE AMINOTRANSFERASE (N-terminus part) ROCD1 (ORNITHINE--OXO-ACID AMINOTRANSFERASE)
MTNLADATQATMALVERHAAHNYSPLPVVAASAEGAWIADIDGLRYLDWL
AAYSAVNLGHRNPASTATAHAQVDTVTLLNRALHADRLGPLGAALAQLCG
KDVVLPMNSDAEAVESGLRVARKWGADVNGLPAGRHDIILANNNFHGHTS
SVVSFSSDPAAGSGVEPSTPGLRSVPFGDAAAPAQTIDDNTVADLLEPIP
GQAGIIVPADDYLPAASSTTC
>Rv2321c rocD2, PROBABLE ORNITHINE AMINOTRANSFERASE (C-terminus part) ROCD2 (ORNITHINE--OXO-ACID AMINOTRANSFERASE)
MIADEIQSGLACTGYPFACDHGGVLPDIYLLGKTLGGGAVPLSAMVADRE
IFGVVHPGEHGSTFGGNPLAAAIGTPVVSMVVWGECQARSAKLGAHLHQR
LADLIGDGAVALRGLGWWADVDIERALAIGTDMSMRLADRGVLLKDTYGA
ALRFAPPLVITAQEIDCAVRRFADALWEAGS
>Rv2320c rocE, PROBABLE CATIONIC AMINO ACID TRANSPORT INTEGRAL MEMBRANE PROTEIN ROCE
MPTTSMSLRELMLRRRPVSGAPVASGASGNLKRSFGTFQLTMFGVGATIG
TGIFFVLAQAVPEAGPGVIVSFIIAGIAAGLAAICYAELASAVPISGSAY
SYAYTTLGEAVAMVVAACLLLEYGVATAAVAVGWSGYVNKLLSNLFGFQM
PHVLSAAPWDTHPGWVNLPAVILIGLCALLLIRGASESARVNAIMVLIKL
GVLGMFMIIAFSAYSADHLKDFVPFGVAGIGSAAGTIFFSYIGLDAVSTA
GDEVKDPQKTMPRALIAALVVVTGVYVLVALAALGTQPWQDFAEQETAGL
AIILDNVTHGEWASTILAAGAVVSIFTVTLVTMYGQTRILFAMGRDGLLP
ARFAKVNPRTMTPVHNTVIVAIFASTLAAFIPLDSLADMVSIGTLTAFSV
VAVGVIVLRVREPDLPRGFKVPGYPVTPVLSVLACGYILASLHWYTWLAF
SGWVAVAVIFYLMWGRHHSALNEEVP
>Rv0069c sdaA, PROBABLE L-SERINE DEHYDRATASE SDAA (L-SERINE DEAMINASE) (SDH) (L-SD)
MTISVFDLFTIGIGPSSSHTVGPMRAANQFVVALRRRGHLDDLEAMRVDL
FGSLAATGAGHGTMSAILLGLEGCQPETITTEHKERRLAEIAASGVTRIG
GVIPVPLTERDIDLHPDIVLPTHPNGMTFTAAGPHGRVLATETYFSVGGG
FIVTEQTSGNSGQHPCSVALPYVSAQELLDICDRLDVSISEAALRNETCC
RTENEVRAALLHLRDVMVECEQRSIAREGLLPGGLRVRRRAKVWYDRLNA
EDPTRKPEFAEDWVNLVALAVNEENASGGRVVTAPTNGAAGIVPAVLHYA
IHYTSAGAGDPDDVTVRFLLTAGAIGSLFKERASISGAEVGCQGEVGSAA
AMAAAGLAEILGGTPRQVENAAEIAMEHSLGLTCDPIAGLVQIPCIERNA
ISAGKAINAARMALRGDGIHRVTLDQVIDTMRATGADMHTKYKETSAGGL
AINVAVNIVEC
>Rv2996c serA1, PROBABLE D-3-PHOSPHOGLYCERATE DEHYDROGENASE SERA1 (PGDH)
MSLPVVLIADKLAPSTVAALGDQVEVRWVDGPDRDKLLAAVPEADALLVR
SATTVDAEVLAAAPKLKIVARAGVGLDNVDVDAATARGVLVVNAPTSNIH
SAAEHALALLLAASRQIPAADASLREHTWKRSSFSGTEIFGKTVGVVGLG
RIGQLVAQRIAAFGAYVVAYDPYVSPARAAQLGIELLSLDDLLARADFIS
VHLPKTPETAGLIDKEALAKTKPGVIIVNAARGGLVDEAALADAITGGHV
RAAGLDVFATEPCTDSPLFELAQVVVTPHLGASTAEAQDRAGTDVAESVR
LALAGEFVPDAVNVGGGVVNEEVAPWLDLVRKLGVLAGVLSDELPVSLSV
QVRGELAAEEVEVLRLSALRGLFSAVIEDAVTFVNAPALAAERGVTAEIC
KASESPNHRSVVDVRAVGADGSVVTVSGTLYGPQLSQKIVQINGRHFDLR
AQGINLIIHYVDRPGALGKIGTLLGTAGVNIQAAQLSEDAEGPGATILLR
LDQDVPDDVRTAIAAAVDAYKLEVVDLS
>Rv0728c serA2, POSSIBLE D-3-PHOSPHOGLYCERATE DEHYDROGENASE SERA2 (PHOSPHOGLYCERATE DEHYDROGENASE) (PGDH)
MTPRPRALVTAPLRGPGFAQLRRLADVVYDPWIDQRPLRIYSAEQLADRI
TAVAADVLVVESDSVGGPVFERGLRVVAATRGDPSNVDIPGATAAGIPVL
HTPARNADAVAEMTVALLLAVARHLIPADADVRSGNIFRDGTIPYQRFRG
AEIAGLTAGLVGLGAVGRAVRWRLSGLGLRVIAHDPYRDDAGHSLDELLA
EADIVSMHAAVTDDTIGMIGAQQFAAMRDGAVFLNTARSQLRDTDALVDA
LRGGKLAAAGLDHFTGEWLPTDHPLVSMPNVVLTPHIGGATWNTEARQAR
MVADDLGALLSGNRPAHVVNPEVLGS
>Rv0505c serB1, POSSIBLE PHOSPHOSERINE PHOSPHATASE SERB1 (PSP) (O-PHOSPHOSERINE PHOSPHOHYDROLASE) (PSPASE)
MGLTCWPRTAAGRVHDESRCGLANFDTALGLQINPRQPRAPPRICRIGLI
TAAASATGQAPRLGVMMVSSHLGSPDQAGHVDLASPADPPPPDASASHSP
VDMPAPVAAAGSDRQPPIDLTAAAFFDVDNTLVQGSSAVHFGRGLAARHY
FTYRDVLGFLYAQAKFQLLGKENSNDVAAGRRKALAFIEGRSVAELVALG
EEIYDEIIADKIWDGTRELTQMHLDAGQQVWLITATPYELAATIARRLGL
TGALGTVAESVDGIFTGRLVGEILHGTGKAHAVRSLAIREGLNLKRCTAY
SDSYNDVPMLSLVGTAVAINPDARLRSLARERGWEIRDFRIARKAARIGV
PSALALGAAGGALAALASRRQSR
>Rv3042c serB2, PROBABLE PHOSPHOSERINE PHOSPHATASE SERB2 (PSP) (O-PHOSPHOSERINE PHOSPHOHYDROLASE) (PSPASE)
MPAKVSVLITVTGMDQPGVTSALFEVLAQHGVELLNVEQVVIRGRLTLGV
LVSCPLDVADGTALRDDVAAAIHGVGLDVAIERSDDLPIIRQPSTHTIFV
LGRPITAGAFSAVARGVAALGVNIDFIRGISDYPVTGLELRVSVPPGCVG
PLQIALTKVAAEEHVDVAVEDYGLAWRTKRLIVFDVDSTLVQGEVIEMLA
ARAGAQGQVAAITEAAMRGELDFAESLQRRVATLAGLPATVIDDVAEQLE
LMPGARTTIRTLRRLGFRCGVVSGGFRRIIEPLARELMLDFVASNELEIV
DGILTGRVVGPIVDRPGKAKALRDFASQYGVPMEQTVAVGDGANDIDMLG
AAGLGIAFNAKPALREVADASLSHPYLDTVLFLLGVTRGEIEAADAGDCG
VRRVEIPAD
>Rv0884c serC, POSSIBLE PHOSPHOSERINE AMINOTRANSFERASE SERC (PSAT)
MADQLTPHLEIPTAIKPRDGRFGSGPSKVRLEQLQTLTTTAAALFGTSHR
QAPVKNLVGRVRSGLAELFSLPDGYEVILGNGGATAFWDAAAFGLIDKRS
LHLTYGEFSAKFASAVSKNPFVGEPIIITSDPGSAPEPQTDPSVDVIAWA
HNETSTGVAVAVRRPEGSDDALVVIDATSGAGGLPVDIAETDAYYFAPQK
NFASDGGLWLAIMSPAALSRIEAIAATGRWVPDFLSLPIAVENSLKNQTY
NTPAIATLALLAEQIDWLVGNGGLDWAVKRTADSSQRLYSWAQERPYTTP
FVTDPGLRSQVVGTIDFVDDVDAGTVAKILRANGIVDTEPYRKLGRNQLR
VAMFPAVEPDDVSALTECVDWVVERL
>Rv3331 sugI, PROBABLE SUGAR-TRANSPORT INTEGRAL MEMBRANE PROTEIN SUGI
MTTLWQPHRNDYSPIPGRGVHARRGARRPRPRGGRAERPGTGQLTRSGRR
ALLVGLTAASVGVLYGYDLSAIAGALLSLSEEFELTTREQELLTTTAVLG
QIAGALGGGILANAIGRKKSVVLIVAGYAVFALLGATSVSVPMLVVARLL
LGVTIGLSVVVVPVYVAESAPAAVRGSLVTAYQLATLSGIVVGYLVGYLL
AGSHGWRAMFGLAAAPATLLLPLLWRMPDTARWYLLKGRIADARSALRRI
QPEADIDAELADMAAAVDERGGGIGEMVRRPYLRATLFVIALGFLVQITG
INAIIYYSPRLFAAMGFAGYFAMLALPAMVQVAGLAAVCASLFLVDRLGR
RPILLSGIATMITADAVLITVFANDSDGGTGLVLGFAGVLLFIIGFNFGF
GSLVWVYAAESFPSRLRSMGSSPMLTSTLTANAIVAAFSLTMLRVLGGAG
VFAVFGTFAVVAFVVVYRFAPETKGRKLEEIRHFWENGGRWPAERSPAAD
EP
>Rv0415 thiO, POSSIBLE THIAMINE BIOSYNTHESIS OXIDOREDUCTASE THIO
MASDLHTGSLAVIGGGVIGLSVARRAAQAGWPVRVHRSDERGASWVAGGM
LAPHSEGWPGEERLLRLGLQSLRLWREGSFLDGLGPQLVTAHESLVVAVD
RADVADLRTVADWLSAQGHPVIWESAARDVEPLLAQGIRHGFRAPTELAV
DNRALLDALCRDCERLGVRWSSQVSSLSDVDAHTVVIANGIDAPALWPGL
PIRPVKGEVLRLRWRPGCMPLPQRVIRARVRGRQVYLVPRSDGVVVGATQ
YEHGRDTAPVVSGVRDLLDDACTVLPALGEYELAECEAGLRPMTPDNLPL
VQRLDSRTLVAAGHGRSGFLLAPWTAEQIVSELVSVGAAS
>Rv1294 thrA, PROBABLE HOMOSERINE DEHYDROGENASE THRA
MPGDEKPVGVAVLGLGNVGSEVVRIIENSAEDLAARVGAPLVLRGIGVRR
VTTDRGVPIELLTDDIEELVAREDVDIVVEVMGPVEPSRKAILGALERGK
SVVTANKALLATSTGELAQAAESAHVDLYFEAAVAGAIPVIRPLTQSLAG
DTVLRVAGIVNGTTNYILSAMDSTGADYASALADASALGYAEADPTADVE
GYDAAAKAAILASIAFHTRVTADDVYREGITKVTPADFGSAHALGCTIKL
LSICERITTDEGSQRVSARVYPALVPLSHPLAAVNGAFNAVVVEAEAAGR
LMFYGQGAGGAPTASAVTGDLVMAARNRVLGSRGPRESKYAQLPVAPMGF
IETRYYVSMNVADKPGVLSAVAAEFAKREVSIAEVRQEGVVDEGGRRVGA
RIVVVTHLATDAALSETVDALDDLDVVQGVSSVIRLEGTGL
>Rv1296 thrB, PROBABLE HOMOSERINE KINASE THRB
MVTQALLPSGLVASAVVAASSANLGPGFDSVGLALSLYDEIIVETTDSGL
TVTVDGEGGDQVPLGPEHLVVRAVQHGLQAAGVSAAGLAVRCRNAIPHSR
GLGSSAAAVVGGLAAVNGLVVQTDSSPSSDAELIQLASEFEGHPDNAAAA
VLGGAVVSWTDHSGDRPNYSAVSLRLHPDIRLFTAIPEQRSSTAETRVLL
PAQVSHDDARFNVSRAALLVVALTERPDLLMAATEDLLHQPQRAAAMTAS
AEYLRLLRRHNVAAALSGAGPSLIALSTDSELPTDAVEFGAAKGFAVTEL
TVGEAVRWSPTVRVPG
>Rv1295 thrC, PROBABLE THREONINE SYNTHASE THRC
MTVPPTATHQPWPGVIAAYRDRLPVGDDWTPVTLLEGGTPLIAATNLSKQ
TGCTIHLKVEGLNPTGSFKDRGMTMAVTDALAHGQRAVLCASTGNTSASA
AAYAARAGITCAVLIPQGKIAMGKLAQAVMHGAKIIQIDGNFDDCLELAR
KMAADFPTISLVNSVNPVRIEGQKTAAFEIVDVLGTAPDVHALPVGNAGN
ITAYWKGYTEYHQLGLIDKLPRMLGTQAAGAAPLVLGEPVSHPETIATAI
RIGSPASWTSAVEAQQQSKGRFLAASDEEILAAYHLVARVEGVFVEPASA
ASIAGLLKAIDDGWVARGSTVVCTVTGNGLKDPDTALKDMPSVSPVPVDP
VAVVEKLGLA
>Rv1613 trpA, Probable tryptophan synthase, alpha subunit trpA
MVAVEQSEASRLGPVFDSCRANNRAALIGYLPTGYPDVPASVAAMTALVE
SGCDIIEVGVPYSDPGMDGPTIARATEAALRGGVRVRDTLAAVEAISIAG
GRAVVMTYWNPVLRYGVDAFARDLAAAGGLGLITPDLIPDEAQQWLAASE
EHRLDRIFLVAPSSTPERLAATVEASRGFVYAASTMGVTGARDAVSQAAP
ELVGRVKAVSDIPVGVGLGVRSRAQAAQIAQYADGVIVGSALVTALTEGL
PRLRALTGELAAGVRLGMSA
>Rv1612 trpB, Probable tryptophan synthase, beta subunit trpB
MSAAIAEPTSHDPDSGGHFGGPSGWGGRYVPEALMAVIEEVTAAYQKERV
SQDFLDDLDRLQANYAGRPSPLYEATRLSQHAGSARIFLKREDLNHTGSH
KINNVLGQALLARRMGKTRVIAETGAGQHGVATATACALLGLDCVIYMGG
IDTARQALNVARMRLLGAEVVAVQTGSKTLKDAINEAFRDWVANADNTYY
CFGTAAGPHPFPTMVRDFQRIIGMEARVQIQGQAGRLPDAVVACVGGGSN
AIGIFHAFLDDPGVRLVGFEAAGDGVETGRHAATFTAGSPGAFHGSFSYL
LQDEDGQTIESHSISAGLDYPGVGPEHAWLKEAGRVDYRPITDSEAMDAF
GLLCRMEGIIPAIESAHAVAGALKLGVELGRGAVIVVNLSGRGDKDVETA
AKWFGLLGND
>Rv1611 trpC, Probable indole-3-glycerol phosphate synthase trpC
MSPATVLDSILEGVRADVAAREASVSLSEIKAAAAAAPPPLDVMAALREP
GIGVIAEVKRASPSAGALATIADPAKLAQAYQDGGARIVSVVTEQRRFQG
SLDDLDAVRASVSIPVLRKDFVVQPYQIHEARAHGADMLLLIVAALEQSV
LVSMLDRTESLGMTALVEVHTEQEADRALKAGAKVIGVNARDLMTLDVDR
DCFARIAPGLPSSVIRIAESGVRGTADLLAYAGAGADAVLVGEGLVTSGD
PRAAVADLVTAGTHPSCPKPAR
>Rv2192c trpD, Probable anthranilate phosphoribosyltransferase TrpD
MALSAEGSSGGSRGGSPKAEAASVPSWPQILGRLTDNRDLARGQAAWAMD
QIMTGNARPAQIAAFAVAMTMKAPTADEVGELAGVMLSHAHPLPADTVPD
DAVDVVGTGGDGVNTVNLSTMAAIVVAAAGVPVVKHGNRAASSLSGGADT
LEALGVRIDLGPDLVARSLAEVGIGFCFAPRFHPSYRHAAAVRREIGVPT
VFNLLGPLTNPARPRAGLIGCAFADLAEVMAGVFAARRSSVLVVHGDDGL
DELTTTTTSTIWRVAAGSVDKLTFDPAGFGFARAQLDQLAGGDAQANAAA
VRAVLGGARGPVRDAVVLNAAGAIVAHAGLSSRAEWLPAWEEGLRRASAA
IDTGAAEQLLARWVRFGRQI
>Rv1609 trpE, Probable anthranilate synthase component I trpE (GLUTAMINE AMIDOTRANSFERASE)
MHADLAATTSREDFRLLAAEHRVVPVTRKVLADSETPLSAYRKLAANRPG
TFLLESAENGRSWSRWSFIGAGAPTALTVREGQAVWLGAVPKDAPTGGDP
LRALQVTLELLATADRQSEPGLPPLSGGMVGFFAYDMVRRLERLPERAVD
DLCLPDMLLLLATDVAAVDHHEGTITLIANAVNWNGTDERVDWAYDDAVA
RLDVMTAALGQPLPSTVATFSRPEPRHRAQRTVEEYGAIVEYLVDQIAAG
EAFQVVPSQRFEMDTDVDPIDVYRILRVTNPSPYMYLLQVPNSDGAVDFS
IVGSSPEALVTVHEGWATTHPIAGTRWRGRTDDEDVLLEKELLADDKERA
EHLMLVDLGRNDLGRVCTPGTVRVEDYSHIERYSHVMHLVSTVTGKLGEG
RTALDAVTACFPAGTLSGAPKVRAMELIEEVEKTRRGLYGGVVGYLDFAG
NADFAIAIRTALMRNGTAYVQAGGGVVADSNGSYEYNEARNKARAVLNAI
AAAETLAAPGANRSGC
>Rv0013 trpG, POSSIBLE ANTHRANILATE SYNTHASE COMPONENT II TRPG (GLUTAMINE AMIDOTRANSFERASE)
MRILVVDNYDSFVFNLVQYLGQLGIEAEVWRNDDHRLSDEAAVAGQFDGV
LLSPGPGTPERAGASVSIVHACAAAHTPLLGVCLGHQAIGVAFGATVDRA
PELLHGKTSSVFHTNVGVLQGLPDPFTATRYHSLTILPKSLPAVLRVTAR
TSSGVIMAVQHTGLPIHGVQFHPESILTEGGHRILANWLTCCGWTQDDTL
VRRLENEVLTAISPHFPTSTASAGEATGRTSA
>Rv3754 tyrA, PREPHENATE DEHYDROGENASE TYRA (PDH) (HYDROXYPHENYLPYRUVATE SYNTHASE)
MRAAAAAGREVFGYNRSVEGAHGARSDGFDAITDLNQTLTRAAATEALIV
LAVPMPALPGMLAHIRKSAPGCPLTDVTSVKCAVLDEVTAAGLQARYVGG
HPMTGTAHSGWTAGHGGLFNRAPWVVSVDDHVDPTVWSMVMTLALDCGAM
VVPAKSDEHDAAAAAVSHLPHLLAEALAVTAAEVPLAFALAAGSFRDATR
VAATAPDLVRAMCEANTGQLAPAADRIIDLLSRARDSLQSHGSIADLADA
GHAARTRYDSFPRSDIVTVVIGADKWREQLAAAGRAGGVITSALPSLDSP
Q
>Rv1848 ureA, Urease gamma subunit ureA (Urea amidohydrolase)
MRLTPHEQERLLLSYAAELARRRRARGLRLNHPEAIAVIADHILEGARDG
RTVAELMASGREVLGRDDVMEGVPEMLAEVQVEATFPDGTKLVTVHQPIA
>Rv1849 ureB, Urease beta subunit ureB
MIPGEIFYGSGDIEMNAAALSRLQMRIINAGDRPVQVGSHVHLPQANRAL
SFDRATAHGYRLDIPAATAVRFEPGIPQIVGLVPLGGRREVPGLTLNPPG
RLDR
>Rv1850 ureC, Urease alpha subunit ureC (Urea amidohydrolase)
MARLSRERYAQLYGPTTGDRIRLADTNLLVEVTEDRCGGPGLAGDEAVFG
GGKVLRESMGQGRASRADGAPDTVITGAVIIDYWGIIKADIGIRDGRIVG
IGKAGNPDIMTGVHRDLVVGPSTEIISGNRRIVTAGTVDCHVHLICPQII
VEALAAGTTTIIGGGTGPAEGTKATTVTPGEWHLARMLESLDGWPVNFAL
LGKGNTVNPDALWEQLRGGASGFKLHEDWGSTPAAIDTCLAVADVAGVQV
ALHSDTLNETGFVEDTIGAIAGRSIHAYHTEGAGGGHAPDIITVAAQPNV
LPSSTNPTRPHTVNTLDEHLDMLMVCHHLNPRIPEDLAFAESRIRPSTIA
AEDVLHDMGAISMIGSDSQAMGRVGEVVLRTWQTAHVMKARRGALEGDPS
GSQAADNNRVRRYIAKYTICPAIAHGMDHLIGSVEVGKLADLVLWEPAFF
GVRPHVVLKGGAIAWAAMGDANASIPTPQPVLPRPMFGAAAATAAATSVH
FVAPQSIDARLADRLAVNRGLAPVADVRAVGKTDLPLNDALPSIEVDPDT
FTVRIDGQVWQPQPAAELPMTQRYFLF