TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Gene type: CDS
Genomic element: chromosome

Number of genes found: 988

Free access
Sort by:

 



# Mycobacterium tuberculosis H37Rv, H37Rv

>Rv1888c POSSIBLE TRANSMEMBRANE PROTEIN
MQPDAYPVRVRGDLDPALSRWQWLVKWFLAIPHYIVLFFLHVAAVVVTVI
AFFAILFTGRYPRTLFDFNVGVMRWRWRVAFYALSALGTDRYPPFSLQTK
AEYPADLEVDYPERLSRGLVLIKWWLLAIPHYLILAVFLSSGWRVFLIDP
HDRVGIMWPSLLVILLLVAVVALLFTGRYPIGLYNL
>Rv2901c CONSERVED HYPOTHETICAL PROTEIN
MSAEDLEKYETEMELSLYREYKDIVGQFSYVVETERRFYLANSVEMVPRN
TDGEVYFELRLADAWVWDMYRPARFVKQVRVVTFKDVNIEEVEKPELRLP
E
>Rv2668 POSSIBLE EXPORTED ALANINE AND VALINE RICH PROTEIN
MRHWLIVLATLLVAAAGVAAANDVPRAWAGDAPIGHIGDTLRVDTGTYVA
DVTVSSVVPVDPPPGFGYTRSGVPVKSFPDSSVTRADVTVRAVRVPNSFI
LATNFSFTGVTPFADAYKPRPCDASDWLDAALGNAPQGSIVRGGVYWDAY
RDPVSVVVLLDEKTGQHLAQWNL
>Rv0504c CONSERVED HYPOTHETICAL PROTEIN
MTVPEEAQTLIGKHYRAPDHFLVGREKIREFAVAVKDDHPTHYSEPDAAA
AGYPALVAPLTFLAIAGRRVQLEIFTKFNIPINIARVFHRDQKFRFHRPI
LANDKLYFDTYLDSVIESHGTVLAEIRSEVTDAEGKPVVTSVVTMLGEAA
HHEADADATVAAIASI
>Rv0965c CONSERVED HYPOTHETICAL PROTEIN
MRVNRPQCARVPYSAESLVRVEASWYGRTLRAIPEVLSQVGYQQADHGES
LLTSHHCCLGAAEGARPGWVGSSAGALSGLLDSWAEASTAHAARIGDHSY
GMHLAAVGFAEMEEHNAAALAAVYPTGGGSARCDGVDVS
>Rv2735c CONSERVED HYPOTHETICAL PROTEIN
MAREWSYWTRNKLEILAGYLPAFNRASQTSRERIYLDLMAGQPENIDRDM
GEKFDGSSLIAMKADPPFTRLRFCELNPLASELDVALRTRFPGDGRYRVV
AGDSNVTIDETLAELGPWRWAPTFAFIDQQAAEVHWETINKVAAFRQNPR
NLKTELWMLMSPTMIARGVKGTNAELFIEQVTRMYGDADWKRIQAARWRH
HLTAPAYRAEMVNLMRVKLEYELGYKYSHRIPMQMHNKVTIFDMVFATDH
WAGDAIMCHLYNRAAQKEPEMMRQAKSAKQQKESEDRGEMGLFSVGELAV
QDSNAGQILWAPSPTWDPRARGWWSEDPGF
>Rv1241 CONSERVED HYPOTHETICAL PROTEIN
MRTTLTLDDDVVRLVEDAVHRERRPMKQVINDALRRALAPPVKRQEQYRL
EPHESAVRSGLDLAGFNKLADELEDEALLDATRRAR
>Rv0258c CONSERVED HYPOTHETICAL PROTEIN
MARSQEPSRGLLDPVAKMLRLPFGTPDFIEKIVTGSVNQVGRRTLYVLIT
TWDAAGGGPFAASAIATTGLAKTAEIVQSMFIGPVFNPLLKMLGADKIAI
RASLCAAQLVGLGIMRYGVRSEPLHSMSVEMLVDAIGPTMQRYLVGDIGR
G
>Rv2307B HYPOTHETICAL GLYCINE RICH PROTEIN
MEEVPTGPPAMGHRACGGQKAAFPTRMNSGVEKMYKNSIAIAIGTLTMAV
EFSMVSANAEPAPPPGQDPHMPNSAMGYCPGGGFGGITGWGYCDGIRYPD
GSYWHQVRVPAPFVGTTLTLSCVIDDGSPVPPLAAPGSCGGGA
>Rv0027 HYPOTHETICAL PROTEIN
MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSEL
RDTGRELLELRKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNI
IDGSR
>Rv3438 CONSERVED HYPOTHETICAL PROTEIN
MPRIRKLVAALHRRGPHRVLRGDLAFAGLPGVVYTPEAGLHLPGVAFGHD
WLTGTSRYSGLLEHLASWGIVAAAPDSERGLAPSVLNLAFDLGVALDIVA
GVRLGPGKISVHPAKLGLVGHGFGGSAAVFAAAGLTGTHVKSVAAIFPTV
TNPAAEQPAATLDVPGLILTAPGDPKTLTSNALGLSRAWDKATLRIVSKA
RAGGLVEGRRLTKVLGLPGPHRRTQRSVRALLTGYLLYTLGGDKTYRRFA
DPDLQLPKTDPIDPEAPPITPGEKIVTLLK
>Rv0048c POSSIBLE MEMBRANE PROTEIN
MAKWLGAPLARGVSTATRAKDSDRQDACRILDDALRDGELSMEEHRERVS
AATKAVTLGDLQRLVADLQVESAPAQMPALKSRAKRTELGLLAAAFVASV
LLGVGIGWGVYGNTRSPLDFTSDPGAKPDGIAPVVLTPPRQLHSLGGLTG
LLEQTRKRFGDTMGYRLVIYPEYASLDRVDPADDRRVLAYTYRGGWGDAT
SSAKSIADVSVVDLSKFDAKTAVGIMRGAPETLGLKQSDVKSMYLIVEPV
KDPTTPAALSLSLYVSSDYGGGYLVFAGDGTIKHVSYPS
>Rv3599c HYPOTHETICAL SHORT PROTEIN
MPASSLGTGSPAADRLDATHERRREVI
>Rv2254c Probable integral membrane protein
MRYRDLETVAAPTINVLRVWPEIVGAIVLLVIAAMGIGHGLRPSPEPVPA
PQKQLGCVRFALIFGLTAINPATFVYFTAVAVTLARALRATTAIAVVVGV
ALASLLWQLLLVSAGAFLRSRATARVRRMTVLAGNAVIAAFGAVLVVHAF
A
>Rv2525c CONSERVED HYPOTHETICAL PROTEIN
MSVSRRDVLKFAAATPGVLGLGVVASSLRAAPASAGSLGTLLDYAAGVIP
ASQIRAAGAVGAIRYVSDRRPGGAWMLGKPIQLSEARDLSGNGLKIVSCY
QYGKGSTADWLGGASAGVQHARRGSELHAAAGGPTSAPIYASIDDNPSYE
QYKNQIVPYLRSWESVIGHQRTGVYANSKTIDWAVNDGLGSYFWQHNWGS
PKGYTHPAAHLHQVEIDKRKVGGVGVDVNQILKPQFGQWA
>Rv0521 POSSIBLE METHYLTRANSFERASE/METHYLASE (FRAGMENT)
MREAAQALGFEVLDQRDLVRNLRTHYSRVFEELEARRLELEGKSSQEYLD
KMRVGLKNWVEAADNGHSRVGHPTFPRTRLTPICQLPTAAIDSTAGRRRY
R
>Rv0246 PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MAKTSHRVSSADGMSKRILRLIIAQSGFYSAALQLGNVSIVLPFVVAELD
AELWIAALIFPAFTAGGAIGNVVAPPAVAAVPRRHRLFIIVSCLAVLAGV
NALCATIGKGSVAGILLVVNVTLIGVVSAISFVAFADLVAAMPSGTARAR
ILLTEVGVGAALTAVVAATLSFVPDQHPLSRNIHLLWTAAVAMAISAAIC
RALPHRIVPRVHAAPGLHKLVYVGWTAIRTNGWYRRYLLVQVLFGSVVLG
SSFHSIRVAAVPGDQPDEVVAVVLFVCVGLLGGIALWNRVRERFGLVGLF
VGSALVSIAAAVLSIAFDLAGAWPNVVAIGLVIALVSIANQSVFTAGQLW
IARDAEPGLRTSLISFGQLVINAGLVGMGLALGLIAQDHDAVWPVMIVLL
LNLTAAYSATRFAPAKSVDVRGLPQVSRTSRPKTGG
>Rv2663 HYPOTHETICAL PROTEIN
MEVRASARKHGINDDAMLHAYRNALRYVELEYHGEVQLLVIGPDQTGRLL
ELVIPADEPPRIIHANVLRPKFYDYLR
>Rv2077c POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MLATLSQIRAWSTEHLIDAAGYWTETADRWEDVFLQMRNQAHAIAWNGAG
GDGLRQRTRADFSTVSGIADQLRRAATIARNGAGTIDAAQRRVMYAVEDA
QDAGFNVGEDLSVTDTKTTQPAAVQAARLAQAQALAGDIRLRVGQLVAAE
NEVSGQLAATTGDVGNVRFAGAPVVAHSAVQLVDFFKQDGPTPPPPGAPH
PSGGADGPYSDPITSMMLPPAGTEAPVSDATKRWVDNMVNELAARPPDDP
IAVEARRLAFQALHRPCNSAEWTAAVAGFAGSSAGVVGTALAIPAGPADW
ALLGAALLGVGGSGAAVVNCATK
>Rv1048c HYPOTHETICAL PROTEIN
MQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRS
EVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILA
APTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAP
LDARIGWRRATLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEK
NGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDP
TAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD
LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYAD
LRTAGVRGEDAAEHLREAMTK
>Rv3647c CONSERVED HYPOTHETICAL PROTEIN
MSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASA
LAEMIQEAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPR
WLPGPRELRAWTLAAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPT
LIGTRGTRPALRISGRRRLSRLVENVGEPPDGAEAWVQWPRT
>Rv0963c CONSERVED HYPOTHETICAL PROTEIN
MLQRELTRLQNGWLSRDGVWHTDTDKLADLRALRDTLAAHPGTSLILLDT
ASDPRKVLAAVGVGDVDNAERVGVTMGGLNTRVSSSVGDMVKEAGIQRAK
AAELRERAGWPNYDAVASIAWLGYDAPDGLKDVMHDWSARDAAGPLNRFD
KGLAATTNVSDQHITAFGHSYGSLVTSLALQQGAPVSDVVLYGSPGTELT
HASQLGVEPGHAFYMIGVNDHVANTIPEFGAFGSAPQDVPGMTQLSVNTG
LAPGPLLGDGQLHERA
>Rv3880c CONSERVED HYPOTHETICAL PROTEIN
MSMDELDPHVARALTLAARFQSALDGTLNQMNNGSFRATDEAETVEVTIN
GHQWLTGLRIEDGLLKKLGAEAVAQRVNEALHNAQAAASAYNDAAGEQLT
AALSAMSRAMNEGMA
>Rv3234c CONSERVED HYPOTHETICAL PROTEIN
MVTRLSASDASFYQLENTATPMYVGLLLILRRPRAGLSYEALLETVEQRL
PQIPRYRQKVQEVKLGLARPVWIDDRDFDITYHVRRSALPSPGSDEQLHE
LIARLAARPLDKSRPLWEMYLVEGLEKNRIALYTKSHQALINGVTALAIG
HVIADRTRRPPAFPEDIWVPERDPGTTRLLLRAVGDWLVRPGAQLQAVGS
AVAGLVTNSGQLVETGRKVLDIARTVARGTAPSSPLNATVSRNRRFTVAR
ASLDDYRTVRARYDCDSTTWC
>Rv2133c CONSERVED HYPOTHETICAL PROTEIN
MLADGELTVLGRIRSASNATFLCESTLGLRSLHCVYKPVSGERPLWDFPD
GTLAGRELSAYLVSTQLGWNLVPHTIIRDGPAGIGMLQLWVQQPGDAVDS
DPLPGPDLVDLFPAHRPRPGYLPVLRAYDYAGDEVVLMHADDIRLRRMAV
FDVLINNADRKGGHILCGIDGQVYGVDHGLCLHVENKLRTVLWGWAGKPI
DDQILQAVAGLADALGGPLAEALAGRIAAAEIGALRRRAQSLLDQPVMPG
PNGHRPIPWPAF
>Rv3748 CONSERVED HYPOTHETICAL PROTEIN
MIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDD
PDRRVDVEVWPPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRW
VLVVTGGAGTISLPLIVTG
>Rv0320 POSSIBLE CONSERVED EXPORTED PROTEIN
MGRHELARDRRKSSAVLAAVLAPAAVFFATGGDVSTLAARADANPVLGDD
APCCVQIVPVAPLAFSSQISGGEIGTGLAASQFASASRWRIVSRYLPVGV
APEQGLQVKTVLTARSISAAFPEIREIGGVRPDALRWHPNGLALDVMVPN
PGTAEGIALGNEIVAFVLKNATRFGMQDVIWRGAYYTPNGARTTGAGHYD
HIHITTVGGGYPTGEELYIR
>Rv1744c PROBABLE MEMBRANE PROTEIN
MVINRSIASIDSIAVAGSAATTGAVAVAGSVATAGSVAVAGSVATAGSVA
IAGAAATAGSVGIIGSLLTVLCVAVRQCVACLACITCTRCVACIGCVRCT
DCVGCLWCVNCSGLRNVVGARNLRVGNLGRVSN
>Rv1118c CONSERVED HYPOTHETICAL PROTEIN
MQSGPHLVGRVGTSFPLIARHQGATRDDAGDTGQPDPLPHVAHPDRLYPP
MVHGVDPSTLALDRALNETRTGDLWLFRGRSRPDRAIQTLTNAPVNHVGM
TVAIDDLPPLIWHAELGDKLLDVWTGTNHRGVQLNDARQVVQQWAGRYRQ
RCWLRQLTPHANRDQEDKLLRVIARMNGTPFPTTARLTGRWLRGRLPTLN
DWLRGIPVLDRKVREQTQRRKQQQRTMGLATAYCAETVAITYEEMGLLVT
DKDAHWFDPGKFWSGDSLPLAPGYRLGHEIAVDVGG
>Rv2730 HYPOTHETICAL PROTEIN
MMMNWRQTNITTKRCAQTRASSSASEFCGIFAAPGLMRNCHHGGSAPSAV
GGSAVQLTVAYGPQRFHGRCASNSSVRPLTTGGSWTPTSISSTDGGKAQG
HDTHDRQISRRTVCQAASILASILLETVAGPGEGIGPTTSVPLRAADARH
TREGLQGR
>Rv2042c CONSERVED HYPOTHETICAL PROTEIN
MAPPNRDELLAAVERSPQAAAAHDRAGWVGLFTGDARVEDPVGSQPQVGH
EAIGRFYDTFIGPRDITFHRDLDIVSGTVVLRDLELEVAMDSAVTVFIPA
FLRYDLRPVTGEWQIAALRAYWELPAMMLQFLRTGSGATRPALQLSRALL
GNQGLGGTAGFLTGFRRAGRRHKKLVETFLNAASRADKSAAYHALSRTAT
MTLGEDELLDIVELFEQLRGASWTKVTGAGSTVAVSLASDHRRGIMFADV
PWRGNRINRIRYFPA
>Rv3750c POSSIBLE EXCISIONASE
MTSLLEVLGAPEVSVCGNAGQPMTLPEPVRDALYNVVLALSQGKGISLVP
RHLKLTTQEAADLLNISRPTLVRLLEDGRIPFEKPGRHRRVSLDALLEYQ
QETRSNRRAALGELSRDALGELQAALAEKK
>Rv0559c POSSIBLE CONSERVED SECRETED PROTEIN
MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAW
LAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCP
EHVGVLQRAGTR
>Rv1648 Probable transmembrane protein
MIYRVACLLARIRFTVGYVAALASVSTTILMHGPQVHAQVIRHASTNLHN
LAHGHLGTLWNSAFVIDEGPLYFWLPCLACLLAVAELQLRSLRLTVAFVV
GHIGATLLVAAVLAGAIEIGWLPWSISRVSDVGMSYGALAALGALTAAIP
GRWRPAWIGWWVSLGLATATIGGGFTDAGHTVALLLGMLVTACFTRPARW
TLGRCALLAVASGFCLVLLAHSWWSLVSGSALGLLGALGAAGFARWTRAR
ATSLPPGALAIPQPALSR
>Rv3096 CONSERVED HYPOTHETICAL PROTEIN
MHRRTALKLPLLLAAGTVLGQAPRAAAEEPGRWSADRAHRWYQAHGWLVG
ANYITSNAINQLEMFQPGTYDPRRIDNELGLARFHGFNTVRVFLHDLLWA
QDAPGFQTRLAQFVAIAARYHIKPLFVLFDSCWDPLPRPGRQRAPRAGVH
NSGWVQSPGAERLDDRRYASTLYNYVTGVLGQFRNDDRVLGWDLWNEPDN
PARVYRKVERKDKLERVAELLPQVFRWARTVDPVQPLTSGVWQGNWGDPG
RRSTISAIQLDNADVITFHSYAAPAEFEGRIAELAPLQRPILCTEYLARS
QGSTVEGILPIAKRHNVGAFNWGLVAGKTQTYLPWDSWDHPYRAPPKVWF
HDLLHPNGRPYRDGEVQTIRKLNGMPSQD
>Rv2049c HYPOTHETICAL PROTEIN
MLTRGEVRALPADAVVLSADDAADLSDRVYQVRCAAEDVVTALDEGAAAT
ELRDLCDELIRAARAADGWRRAGA
>Rv2360c HYPOTHETICAL PROTEIN
MPSLPDRLASILRDVLPAEEEPDGALTVRHDGTFASLRVVSIAEDLELVS
LTQILAWDLPLTKRLAEQVAKQARDINFGSVSLREKVSEKAARRSSGRPA
SNTADVMLRYNFPGTGLTDDALRTLILLVLETGATIRSALVG
>Rv2104c CONSERVED HYPOTHETICAL PROTEIN
MRTTVTLDDDVEQLVRRRMAERQVSFKKALNDAIRDGASGRPAPSHFSTR
TADLGVPAVNLDRALQLAADLEDEELVRRQRRGS
>Rv3162c POSSIBLE INTEGRAL MEMBRANE PROTEIN
MTSFAHPGTRGLSTVFGLMMVGSAAVGSHGLAVVVGLAAVIAVGVAAVFR
LAATLAVVLSVVMIVVSGPTHVLAALSGFCAAVYLVCRYGAGVVAGSWPT
TVAAVGFTFAGLAATSFPLQVPWLPLAAPLAVLATYVLATRPFSR
>Rv1728c CONSERVED HYPOTHETICAL PROTEIN
MSVNGLPGAHNAGLQPIDSKGCHTRRTRHTKVLFVSKGVLANGRGRWLAI
AASLVVSAAILYAQGAEHTCCRETPAAIPTGPDSAPANAPRIASPTEADL
LAASAPVAAQQFQFALPAGVASEEGLQVKTIWVARAVSVLFPQITNIFGY
RQDPLKWHPNGLAIDVMIPNHHSDEGIQLGNQVAGLALANAKRWGVLHVI
WRQGYYPGIGAPSWTADYGSETLNHYDHVHIATDGGGYPTGRETYYVGSM
SPTPPE
>Rv3348 PROBABLE TRANSPOSASE
MTAENPGRSRRTLVGIDAAITACHHIAIRDDVGARSIRFSVEPTLAGLRT
LTDKLSGYDDIDATVEPTSMTWLPLTIAVENAGDTMHMAGARHCARLRGA
IVGKSKSDVIDAEVLTRASEVFDLTPLTLPTPAQLALRRSVIRRAGAVID
ANRSWRRLMSLAR
>Rv3642c HYPOTHETICAL PROTEIN
MFVQATELQKVKRRFRNVRATRRNTELEGTRSTAATRADQNDYARGKITA
AELGERVRRRYNIQ
>Rv1494 HYPOTHETICAL PROTEIN
MPFLVALSGIISGVRDHSMTVRLDQQTRQRLQDIVKGGYRSANAAIVDAI
NKRWEALHDEQLDAAYAAAIHDNPAYPYESEAERSAARARRNARQQRSAQ
>Rv3792 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MPSRRKSPQFGHEMGAFTSARAREVLVALGQLAAAVVVAVGVAVVSLLAI
ARVEWPAFPSSNQLHALTTVGQVGCLAGLVGIGWLWRHGRFRRLARLGGL
VLVSAFTVVTLGMPLGATKLYLFGISVDQQFRTEYLTRLTDTAALRDMTY
IGLPPFYPPGWFWIGGRAAALTGTPAWEMFKPWAITSMAIAVAVALVLWW
RMIRFEYALLVTVATAAVMLAYSSPEPYAAMITVLLPPMLVLTWSGLGAR
DRQGWAAVVGAGVFLGFAATWYTLLVAYGAFTVVLMALLLAGSRLQSGIK
AAVDPLCRLAVVGAIAAAIGSTTWLPYLLRAARDPVSDTGSAQHYLPADG
AALTFPMLQFSLLGAICLLGTLWLVMRARSSAPAGALAIGVLAVYLWSLL
SMLATLARTTLLSFRLQPTLSVLLVAAGAFGFVEAVQALGKRGRGVIPMA
AAIGLAGAIAFSQDIPDVLRPDLTIAYTDTDGYGQRGDRRPPGSEKYYPA
IDAAIRRVTGKRRDRTVVLTADYSFLSYYPYWGFQGLTPHYANPLAQFDK
RATQIDSWSGLSTADEFIAALDKLPWQPPTVFLMRHGAHNSYTLRLAQDV
YPNQPNVRRYTVDLRTALFADPRFVVEDIGPFVLAIRKPQESA
>Rv3129 CONSERVED HYPOTHETICAL PROTEIN
MVQGRTVLFRTAEGAKLFSAVAKCAVAFEADDHNVAEGWSVIVKVRAQVL
TTDAGVREAERAQLLPWTATLKRHCVRVIPWEITGRHFRFGPEPDRSQTF
ACEASSHNQR
>Rv3867 CONSERVED HYPOTHETICAL PROTEIN
MVDPPGNDDDHGDLDALDFSAAHTNEASPLDALDDYAPVQTDDAEGDLDA
LHALTERDEEPELELFTVTNPQGSVSVSTLMDGRIQHVELTDKATSMSEA
QLADEIFVIADLARQKARASQYTFMVENIGELTDEDAEGSALLREFVGMT
LNLPTPEEAAAAEAEVFATRYDVDYTSRYKADD
>Rv2804c HYPOTHETICAL PROTEIN
MHDHQVLAARHAHQGPHVLQQRPGFVAEAPRPKATPVDLLGRARQPRAGQ
HLPRRRAAHPRGGHHRIQNLAVAPPHHRRQQQRGHSRRSIGSTSPSDDSA
SYSQRPRDVADPPVEASTLEGQEAVVTVELGGAVVDGVDDQGAGAVVPGT
GHGSDEGIEEKIATETGALLLPVERQASEDEHWDRVGSGWPRPGRDGTRI
RSMLPMASA
>Rv0188 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MSTVHSSIDQHPDLLALRASFDRAAESTIAHFTFGLALLAGLYVAASPWI
VGFSATRGLPTCDLIVGIAVAYLAYGFASALDRTHGMTWTLPVLGVWVIF
SPWVLPGVAVTAGMMWSHIIAGAVVAVLGFYFGMRTRAAANQG
>Rv3916c CONSERVED HYPOTHETICAL PROTEIN
MSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWL
SMVMLEWGSCGQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAP
VSADAVLLTSMGIERGQADDDLPHSLIARVIEELVRRGVRALEAFGRTPA
ATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDANFLMDVGFVVVAPHPYF
PRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTAGNTS
>Rv0471c HYPOTHETICAL PROTEIN
MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSG
LVAGLLAIGEPGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYA
RARYAQHPAATGANRAAYTTPRRTTSCGSPERALEPTTPRWARAVGRSCW
RRSPTGCCAPRC
>Rv3196 CONSERVED HYPOTHETICAL PROTEIN
MSARSVAPSQVMRRAASALYSLNPAMPVLLRPDGAVQVGWDPRRAVLVRP
PRGLTATGLAALLRSMRSPIPITELQRQAAERGLVDGDAMANLVAQLVGA
GVATPLANPGNLDSRRRAASIRVHGRGPLSDLLVQALRCSGARIRHSSQP
HAAVTPAGVDLVVLSDYLVADPHMVRDLHTERVPHLPVRVRDGTGMVGPL
VVPGVTSCLGCADLHRSDRDAAWPAIAAQLRDTVGVADRATLLATAALAL
SQVNRVIAAVRGQEATPEPPSALNTTLEFDLNAGSIVARQWTRHPRCFC
>Rv2844 CONSERVED HYPOTHETICAL ALANINE RICH PROTEIN
MTSSEPAHGATPKRSPSEGSADNAALCDALAVEHATIYGYGIVSALSPPG
VNFLVADALKQHRHRRDDVIVMLSARGVTAPIAAAGYQLPMQVSSAADAA
RLAVRMENDGATAWRAVVEHAETADDRVFASTALTESAVMATRWNRVLGA
WPITAAFPGGDE
>Rv2542 CONSERVED HYPOTHETICAL PROTEIN
MLDAVSDARRDGFAVGEDYTVTDRSTGGSRQQRAARLGQAQGHADFIRHR
VGALLATDRDIATRVSAATQGLDELAFEDVPGVDTPAEDGVQAVDFRQAP
PPGAPGGMSSGDIDAIDAANRALLQDMLAEYSRLPDGQVKTDRLADIAAI
QEALRVPDSHLIYVARPDDPADMIPAVTAVGDPFTADHVSVTVPGVSGTT
RQTIATMTQETRGLREEARVIAHSVGESENVATIAWVGYQPPPVLASWNT
VDDDLAQAGAPKLEAFLRDLQAGSHNPGHTTALFGHSYGSLLSGIALKDG
ASSLVDNAVLYGSPGFDATSPAKLGMNDHNFFVMTTPDDPIRYPARLAPL
HGWGSDGADTIGTVGRQGTPARVGIRPQRDHRRIPGPLPLHPSADRRGIH
SAG
>Rv3630 PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MAVGAAAVTEVGDTASPVGSSGASGGAIASGSVARVGTAAAVTALCGYAV
IYLAARNLAPNGFSVFGVFWGAFGLVTGAANGLLQETTREVRSLGYLDVS
ADGRRTHPLRVSGMVGLGSLVVIAGSSPLWSGRVFAEARWLSVALLSIGL
AGFCLHATLLGMLAGTNRWTQYGALMVADAVIRVVVAAATFVIGWQLVGF
IWATVAGSVAWLIMLMTSPPTRAAARLMTPGATATFLRGAAHSIIAAGAS
AILVMGFPVLLKLTSNELGAQGGVVILAVTLTRAPLLVPLTAMQGNLIAH
FVDERTERIRALIAPAALIGGVGAVGMLAAGVVGPWIMRVAFGSEYQSSS
ALLAWLTAAAVAIAMLTLTGAAAVAAALHRAYSLGWVGATVGSGLLLLLP
LSLETRTVVALLCGPLVGIGVHLVALARTDE
>Rv3439c CONSERVED HYPOTHETICAL ALANINE AND PROLINE RICH PROTEIN
MADRLNVAERLAEGRPAAEHTQSYVRACHLVGYQHPDLTAYPAQIHDWYG
SEDGLDLHALDADCAQLRAAASVLMEALRMERSQVAVLAAAWTGSGADAA
VHFVQRHCETGNSVVTEVRAAAQRCESLRDNLWQLVDSKVATAIAIDERA
LAQRPAWLAAAEALTTEGADRPTAVEVVRQQIQPYVDDDVRNDWLTTMRS
TTAGVAASYDAVTDQLASAPRAHFEIPDDLGPGRQPSPASVPAQPSATAA
ITPAAALPPPDPVPAVTSRPVTPSDFGSAPGDGSATPAGVGSAGGFGDAG
GTGGLGGFAGLAGLANRIVDAVDSLLGSVAEQLGDPLAADNPPGAVDPFA
EDAADNADDGDDAHPEEADEAAEPKEATEPDEADEVDDADESVPAERAQD
VAEEATLPPVAEPPPPAAPPVAEPPPPVAAPAPPGAPEPANGPSPEALSE
GATPCEIAADELPQAGP
>Rv2664 HYPOTHETICAL PROTEIN
MKHKTDIDEWLDTIEPNPADAHDASHLRRIIAAKEAVQTAESELRAAVNA
ARAAGDTWAAIGVALGITRQAAFQRFGPHSTASP
>Rv0598c CONSERVED HYPOTHETICAL PROTEIN
MKPPLAVDTSVAIPLLVRTHTAHAAVVAWWAHREAALCGHALAETYSVLT
RLPRDLRLAPMDAARLLTERFAAPLLLSSRTTEHLPRVLAQFEITGGAVY
DALVALAAAEHRAELATRDARAKDTYEKIGVHVVVAA
>Rv0203 POSSIBLE EXPORTED PROTEIN
MKTGTATTRRRLLAVLIALALPGAAVALLAEPSATGASDPCAASEVARTV
GSVAKSMGDYLDSHPETNQVMTAVLQQQVGPGSVASLKAHFEANPKVASD
LHALSQPLTDLSTRCSLPISGLQAIGLMQAVQGARR
>Rv1303 CONSERVED HYPOTHETICAL TRANSMEMBRANE PROTEIN
MTTPAQDAPLVFPSVAFRPVRLFFINVGLAAVAMLVAGVFGHLTVGMFLG
LGLLLGLLNALLVRRSAESITAKEHPLKRSMALNSASRLAIITILGLIIA
YIFRPAGLGVVFGLAFFQVLLVATTALPVLKKLRTATEEPVATYSSNGQT
GGSEGRSASDD
>Rv0979c HYPOTHETICAL PROTEIN
MGFRTQVGAATIASTMTWRIPVEDGPAQFRAGVGPGRDRQFTVVAPMVVG
LWDRNRRPGWQWPS
>Rv3466 CONSERVED HYPOTHETICAL PROTEIN
MGSGSRERIVEVFDALDAELDRLDEVSFEVLTTPERLRSLERLECLVRRL
PAVGHALINQLDAQASEEELGGTLCCALANRLRITKPDAARRIADAADLG
PRRALTGEPLAPQLTATATAQRQGLIGEAHVKVIRALFRPPARRGGCVHP
PGRRSRPGRQSRSISSRRAGPLRPAGHGLATPRRRPHRHRTRPQTRHHPE
QPAIRRHVTAKWLPDPPSAGHL
>Rv3413c HYPOTHETICAL ALANINE AND PROLINE RICH PROTEIN
MREFGNPLGDRPPLDELARTDLLLDALAEREEVDFADPRDDALAALLGQW
RDDLRWPPASALVSQDEAVAALRAGVAQRRRARRSLAAVGSVAAALLVLS
GFGAVVADARPGDLLYGLHAMMFNRSRVSDDQIVLSAKANLAKVEQMIAQ
GQWAEAQDELAEVSSTVQAVTDGSRRQDLINEVNLLNTKVETRDPNATLR
PGSPSNPAAPGSVGNSWTPLAPVVEPPTPPTPASAAEPSMSAGVSESPMP
NSTSTVAASPSTPSSKPEPGSIDPSLEPADEATNPAGQPAPETPVSPTH
>Rv1434 HYPOTHETICAL PROTEIN
MRASPAERVDGAYAGAGPHTQSVLEEDQRQRAPAGAEAEGPGRTG
>Rv0395 HYPOTHETICAL PROTEIN
MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLA
VRRRGVPAAIGCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDK
GFPNVALLRLRDMAPSEHGSRCSSARGRLCLSMS
>Rv0463 PROBABLE CONSERVED MEMBRANE PROTEIN
MTRRASTDTPQIIMGAIGGVVTGYILWLAAISVGDGLTTVSQWSRVVLLL
SVLVAVCGAAGGLRLRSRGKLAWSAFAFSLPIPPVVLTVAVLADIYL
>Rv2179c CONSERVED HYPOTHETICAL PROTEIN
MRYFYDTEFIEDGHTIELISIGVVAEDGREYYAVSTEFDPERAGSWVRTH
VLPKLPPPASQLWRSRQQIRLDLEEFLRIDGTDSIELWAWVGAYDHVALC
QLWGPMTALPPTVPRFTRELRQLWEDRGCPRMPPRPRDVHDALVDARDQL
RRFRLITSTDDAGRGAAR
>Rv2401 HYPOTHETICAL PROTEIN
MRDFGQRSRSGGKAIAEHCRTHELHIRPRTGGESATTVQVGRSAANERAD
IAPRKTRCCVHVAKPNRIRLADQLARSSMGEKPGHDHQRNQRDQNQRDVR
PRHPGYLGA
>Rv0912 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MTRRLRPGWLVALSAAVIAASTWMPWLTTTVGGGGWVNAIGGTHGSLELP
HGFGPGQLIVLLSSTLLVVGAMAGRGLSVKLSSIAALVVSLLIVALTVWY
YKLNVNPPVSAEYGLYFGAAGGVCAVGCSLWAAVSAASPGRRRHREVVR
>Rv1439c HYPOTHETICAL PROTEIN
MQMSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQ
REFTRILGLLPDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRF
LLRGDKAVERVSYFDSLPLLIKVVKHPSAWRGWLTTMRSRA
>Rv0387c CONSERVED HYPOTHETICAL PROTEIN
MSLLPTLQSFLPPPFDAIPNPIEDLDVLVAAAVAVAAGSLGVSAAQLGEI
YRHDVVDEAQKAPHCPAESDQTPAGAAGDGDLPEVGGRVTSPPQPPVAAL
TGYSANIGGLSVPHSWNLPPAVRQVAAMFPGATPMYMTGSSDGSYAGLAA
AGLAGTGLAGLAARGGSAPTPAAAAPAGAGGAGPAATRPAAQQTPAVPAA
AAGSAIPGLPPGLPPGVVANLAATLAAIPGATIIVVPPSPNANQ
>Rv1157c CONSERVED HYPOTHETICAL ALA-, PRO-RICH PROTEIN
MRRLTNTEHRENTTVASTWSVCKGLAAVVITSAAAFALCPNAAADPATPQ
PNPTQQLPGLPALAQLSPIIQQAAMNPAQATQLLMAAASAFAGNPAVPTE
SKNVASSVNQFVAEPTNPDSAALGVPAPHGVALPEAIPVPHVPPLGAEPG
VQAHLPTGIDPSHAAGPAPAVAPTVTPPVAAPPASAPAPAPDAAQPVAVP
GPPPAPPAPRAAAPAPASAAPAPAAAPAPASGFGADAPPTQDFMYPSIGP
NCVADGSNSIATALSVAGPAKIPLPGPGPGQTAYVFTAVGTPGPADVQRL
PLNVTWVNLTTGKSGSATLRPRSDINPDGPTTLTVIADTGSGSIMSTIFG
QVTTKDRQCQFMPTIGSTVVP
>Rv3864 CONSERVED HYPOTHETICAL PROTEIN
MASGSGLCKTTSNFIWGQLLLLGEGIPDPGDIFNTGSSLFKQISDKMGLA
IPGTNWIGQAAEAYLNQNIAQQLRAQVMGDLDKLTGNMISNQAKYVSDTR
DVLRAMKKMIDGVYKVCKGLEKIPLLGHLWSWELAIPMSGIAMAVVGGAL
LYLTIMTLMNATNLRGILGRLIEMLTTLPKFPGLPGLPSLPDIIDGLWPP
KLPDIPIPGLPDIPGLPDFKWPPTPGSPLFPDLPSFPGFPGFPEFPAIPG
FPALPGLPSIPNLFPGLPGLGDLLPGVGDLGKLPTWTELAALPDFLGGFA
GLPSLGFGNLLSFASLPTVGQVTATMGQLQQLVAAGGGPSQLASMGSQQA
QLISSQAQQGGQQHATLVSDKKEDEEGVAEAERAPIDAGTAASQRGQEGT
VL
>Rv0493c CONSERVED HYPOTHETICAL PROTEIN
MGESTTQPAGGAAVDDETRSAALPRWRGAAGRLEVWYATLSDPLTRTGLW
VHCETVAPTTGGPYAHGWVTWFPPDAPPGTERFGPQPAQPAAGPAWFDIA
GVRMAPAELTGRTRSLAWELSWKDTAAPLWTFPRVAWERELLPGAQVVIA
PTAVFAGSLAVGETTHRVDSWRGSVAHIYGHGNAKRWGWIHADLGDGDVL
EVVTAVSHKPGLRRLAPLAFVRFRIDGKDWPASPLPSLRMRTTLGVRHWQ
LEGRIGGREALIRVDQPPERCVSLGYTDPDGAKAVCTNTEQADIHIELGG
RHWSVLGTGHAEVGLRGTAAPAIKEGTPA
>Rv0401 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MRPRRALAGLAADVVAVLVFCAVGRRSHAEGLSVTGLAATAWPFLTGTGI
GWVLARGWRRPTALAPTGVIVWLCTIVVGMVLRKVSSAGVAASFVVVASA
VTAVLLLGWRAAVALMAPHRADG
>Rv0078A HYPOTHETICAL PROTEIN
MNAVESTLRRVAKDLTGLRQRWALVGGFAVSARSEPRFTRDVDIVVAVAN
DDAAESLVRQLLTQQYHLLASVEQDAARRLAAVRLGATADTAANVVVDLL
FASCGIEPEIAEAAEEIEILPDLVAPVATTAHLIAMKLLARDDDRRPQDR
SDLRALVDAASPQDIQDARKAIELITLRGFHRDRDLAAEWTRLAAKW
>Rv0961 PROBABLE INTEGRAL MEMBRANE PROTEIN
MRVPSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCH
DSACDASYHVFPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGL
LALGLVYVAADAVLH
>Rv3566A HYPOTHETICAL PROTEIN
MSGADPPTRRAFGQMARAATGWVSVSGQFAVAADTCRCEGTLFAVDPETH
VANHNRCDIVGRLRDERPNTLRSVRRGDEVRMATWHWI
>Rv3172c HYPOTHETICAL PROTEIN
MSVALLREMFDRMVVAKNAELIEHYYDPDFLMYSDGLSQSFAKFRDSHRK
LYATAISYAVEYDEHAWVEAQTRLPGGCGSPRRDLARSRPASRWYSLPPT
ATAEFTGSGRRRGRVGATWPPSTITETTTDRLAMRNQLRAGAATLLFCDP
MLQRFPATRK
>Rv0300 CONSERVED HYPOTHETICAL PROTEIN
MSDVLIRDIPDDVLASLDAIAARLGLSRTEYIRRRLAQDAQTARVTVTAA
DLRRLRGAVAGLGDPELMRQAWR
>Rv1054 PROBABLE INTEGRASE (FRAGMENT)
MTGKGIVESTTKTKRDRHVPVPEPVWRRLHAELPTDPNALVFPGRKGGFL
PLGEYRWAFDNAGDQVGIEGWYRTVWGTPRPRWRSAQALTSRSCNGSLDT
QQRR
>Rv0095c CONSERVED HYPOTHETICAL PROTEIN
MRYLPVSTRRIWVNPLCHFSFTVISGALFVSARRYDSNMLANSREELVEV
FDALDADLDRLDEVSFEVLSTPERLRSLERLECLARRLPAAQHTLINQLD
TQASEEELGGTLCCALANRLRITKPEAGRRSAEAKP
>Rv1584c Possible phiRv1 phage protein
MSTIYHHRGRVAALSRSRASDDPEFIAAKTDLVAANIADYLIRTLAAAPP
LTDEQRTRLAELLRPVRRSGGAR
>Rv1585c Possible phage phiRv1 protein
MSRHHNIVIVCDHGRKGDGRIEHERCDLVAPIIWVDETQGWLPQAPAVAT
LLDDDNQPRAVIGLPPNESRLRPEMRRDGWVRLHWEFACLRYGAAGVRTC
EQRPVRVRNGDLQTLCENVPRLLTGLAGNPDYAPGFAVQSDAVVVAMWLW
RTLCESDTPNKLRATPTRGSC
>Rv1794 CONSERVED HYPOTHETICAL PROTEIN
MDQQSTRTDITVNVDGFWMLQALLDIRHVAPELRCRPYVSTDSNDWLNEH
PGMAVMREQGIVVNDAVNEQVAARMKVLAAPDLEVVALLSRGKLLYGVID
DENQPPGSRDIPDNEFRVVLARRGQHWVSAVRVGNDITVDDVTVSDSASI
AALVMDGLESIHHADPAAINAVNVPMEEMLEATKSWQESGFNVFSGGDLR
RMGISAATVAALGQALSDPAAEVAVYARQYRDDAKGPSASVLSLKDGSGG
RIALYQQARTAGSGEAWLAICPATPQLVQVGVKTVLDTLPYGEWKTHSRV
>Rv0877 CONSERVED HYPOTHETICAL PROTEIN
MTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGV
SYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTA
LLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDETAA
EIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPL
AGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDD
GVLDIIEKPAES
>Rv2778c CONSERVED HYPOTHETICAL PROTEIN
MPDPDGPSVTVTVEIDANPDLVYGLITDLPTLASLAEEVVAMQLRKGDDV
RKGAVFVGRNENGGRRWTTTCTVTDADPGRVFAFDVRSGIIPISRWQYGI
VATEHGCRVTESTWDRRPSWFRAVARMATGVKDRASVNTEHIRRTLQRLK
DRAEAG
>Rv3188 CONSERVED HYPOTHETICAL PROTEIN
MAVTLDRAVEASEIVDALKPFGVTQVDVAAVIQVSDRAVRGWRTGDIRPE
RYDRLAQLRDLVLLLSDSLTPRGVGQWLHAKNRLLDGQRPVDLLAKDRYE
DVRSAAESFIDGAYV
>Rv2401A POSSIBLE CONSERVED MEMBRANE PROTEIN
MGPMNGFLSWWDGVELWLSGLPFALQALAVMPVVLALAYFTAALLDALLG
RVIQLIRRARRPDQAPR
>Rv1919c CONSERVED HYPOTHETICAL PROTEIN
MSGRKFSFEVTKTSSAPAATLFRLVTDGGNWATWAKPIVAQSSWARRGDP
APGGIGAIRKLGMWPVFVQEETVEYEQDRRHVYKLVGARTPVQDYFGEVV
LTPNASGGTDLRWSGSFTEKVRGTGPVMRAALGGAVRFFAGQLVKAAERE
AVRR
>Rv2253 Possible secreted unknown protein
MSGHRKKAMLALAAASLAATLAPNAVAAAEPSWNGQYLVTLSANAKTGTS
MAANRPEYPHKANYTFSSRCASDVCIATVVDAPPPKNEFIPRPIEYTWNG
TQWVREISWQWDCLLPDGTIEYAPAKSITAYTPGQYGILTGVFHTDIASG
TCKGNVDMPVSAKPIVG
>Rv0336 CONSERVED 13E12 REPEAT FAMILY PROTEIN
MPSPEAIAHFDERFECHAPRTTRVSAAFIDRICSATRAENRAAAAQLVAL
GELFAYRWSRCGGREEWVMDTMAAVAAEVAAALRISQGLAASRLRYARAM
RERLPKTAEVFSAGDIGYLMFATIVYRTDLIVDPDVLAAVDAQLAANVAR
WPSMTKARLAGQVDKIVARADADAVRRRKEYQAQRQFWVGESQDGVCQIG
GSLLAVDAHALDARLSALAGTVCEHDPRSREQRRADALGALAGGADRLGC
GCGRADCAAGKRPAAPPVVIHLIAEAATINGTGSAPASQMNADGLITAEL
VAELAKTATLVPLVHPGDAPPEPGYAPSKALADFVRCRDLTCRWPGCDEP
ATNCDLDHTIPYAAGGPTHASNLKCYCRTHHLVKTFWGWRDQQLPDGTLI
LTSPSGHTYVSTPGSALLFPSLCHFSGGIPAPEADPPYDHCDQRTAMMPK
RRRTRAQDRAYRIATERRQNHAARQRAQVLTQTAAATDTHGPPPDPNDDP
PPF
>Rv3734c CONSERVED HYPOTHETICAL PROTEIN
MDLMMPNDSMFLFIESREHPMHVGGLSLFEPPQGAGPEFVREFTERLVAN
DEFQPMFRKHPATIGGGIARVAWAYDDDIDIDYHVRRSALPSPGRVRDLL
ELTSRLHTSLLDRHRPLWELHVVEGLNDGRFAMYTKMHHALIDGVSAMKL
AQRTLSADPDDAEVRAIWNLPPRPRTRPPSDGSSLLDALFKMAGSVVGLA
PSTLKLARAALLEQQLTLPFAAPHSMFNVKVGGARRCAAQSWSLDRIKSV
KQAAGVTVNDAVLAMCAGALRYYLIERNALPDRPLIAMVPVSLRSKEDAD
AGGNLVGSVLCNLATHVDDPAQRIQTISASMDGNKKVLSELPQLQVLALS
ALNMAPLTLAGVPGFLSAVPPPFNIVISNVPGPVDPLYYGTARLDGSYPL
SNIPDGQALNITLVNNAGNLDFGLVGCRRSVPHLQRLLAHLESSLKDLEQ
AVGI
>Rv3088 CONSERVED HYPOTHETICAL PROTEIN
MTRINPIDLSFLLLERANRPNHMAAYTIFEKPKGQKSSFGPRLFDAYRHS
QAAKPFNHKLKWLGTDVAAWETVEPDMGYHIRHLALPAPGSMQQFHETVS
FLNTGLLDRGHPMWECYIIDGIERGRIAILLKVHHALIDGEGGLRAMRNF
LSDSPDDTTLAGPWMSAQGADRPRRTPATVSRRAQLQGQLQGMIKGLTKL
PSGLFGVSADAADLGAQALSLKARKASLPFTARRTLFNNTAKSAARAYGN
VELPLADVKALAKATGTSVNDVVMTVIDDALHHYLAEHQASTDRPLVAFM
PMSLREKSGEGGGNRVSAELVPMGAPKASPVERLKEINAATTRAKDKGRG
MQTTSRQAYALLLLGSLTVADALPLLGKLPSANVVISNMKGPTEQLYLAG
APLVAFSGLPIVPPGAGLNVTFASINTALCIAIGAAPEAVHEPSRLAELM
QRAFTELQTEAGTTSPTTSKSRTP
>Rv0513 POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MTPTGDTKPKLLFYEPGASWYWVLTGPLAAVSVLLLEISSGAGVGLITPA
IFLVMVSAFVALQVKAARIHTSVELTHDALRQGTETIRLAEIVKIYPEAD
GRETSGEEPAKWQSARTLGELVGVPRGRVGIGLKLTGGRTAQAWARRHQQ
LRAALTPLVQERLGPVDSDVADVNGDDAGPAR
>Rv1748 HYPOTHETICAL PROTEIN
MPGGVCSGRPWGRPWWHPGLVGLLIRLAELLVVMLPLIGVLYVGIKALSS
FTRRLGEASGDLASDSPAMPRPTTVENDAARWRAITRAVEAHERTDARWL
EYELDAAKLLDFPVMTDMRDPLTTAFHKAKLQADFHKPLRAEDLLDDPDA
AGHYLDAVRDYVTAFDTAEAEAMRRRRTGFSREEQQRLARAQSLLRVASD
AGATAQERERAYRLARTELDGLIVLPDRTRAGIERGIAGELDD
>Rv2452c HYPOTHETICAL PROTEIN
MAFRDILVLFSMKTLLTLAMAAASSTALTTVGVSGARLITYCVGVEDI
>Rv3217c PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MPVRAPAAVRGAGLIVAVQGGAALVVAAALLVRGLAGADQHIVNGLGTAG
WFVLVGGAVLAAGCRLAVGKLWGRGLAVFAQLLLLPVAWYLIVGSHQPAI
GIPVGIIALGVLVLLFSPPSIRWAAGRDQRGAASAANRGPDSR
>Rv3189 CONSERVED HYPOTHETICAL PROTEIN
MKLADAIATAPRRTLKGTYWHQGPTRHPVTSCADPARGPGRYHRTGEPGV
WYASNKEQGAWAELFRHFVDDGVDPFEVRRRVGRVAVTLQVLDLTDERTR
SHLGVDETDLLSDDYTTTQAIAAARDANFDAVLAPAAALPGCQTLAVFVH
ALPNIEPERSEVRQPPPRLANLLPLIRPHEHMPDSVRRLLATLTRAGAEA
IRRRRR
>Rv1950c CONSERVED HYPOTHETICAL PROTEIN
MLPTLSHIHAWDTEHLIEAAYYWTKVADQWEDVFLEMRNRSHFIAWEGAG
GDGCDSEPALTYR
>Rv3770B PROBABLE REMNANT OF A TRANSPOSASE
MRAERARAIGLFRYQLIREAADAAHSTKERGKMVRELASREHTDPFGRKV
RISRHTIDRWIRN
>Rv0614 CONSERVED HYPOTHETICAL PROTEIN
MPAIPFQGEARAGRRPGRPRRCPAGVVRCRPRSMGHVRPGFSPRLGSHRT
LRPRWPPYAAASRGLTSGTSRWGWPRLGFGVVTAPTRWTLADGRELLFFS
LPGPRTSGTAAERVARHAQAQTFAGDIRQRAIQLVVSEQEVASKITAATA
GIATTTFPETPSIDDTIIGNDNRDTGVRLVDVKQDGGTSPPPPFAPWDTP
DGTPPPGTGLSPTLQQMILGGDPANLTGQGLADNVQRFVQSLPANDPNTA
WLRGQVADLQAHVADIEYARTHCSTNDWIDRTAQFASGAIVFSIGVLTAE
TGAGVVAAAAGGVGAATAGVSLLQCLVGSK
>Rv0750 CONSERVED HYPOTHETICAL PROTEIN
MRAIVGDCVIHIMPMGTGVELSKLADLALDIGRSVGCSAYENDFTLPDIP
TQWRNQPLGWYTQGLAPYLPGLSDPKDAAEG
>Rv2433c HYPOTHETICAL PROTEIN
MGLRDADERWDTVGQAIGLFLRGHTLRTAAPTALIVGTVLCAVNQGATLA
EGAATIGTWVRMVINYLVPFLVASVGYLGARRGVRRASGRSDPSAQ
>Rv1754c CONSERVED HYPOTHETICAL PROTEIN
MYRYQVRVQQRRSEMNRWVATRSRRHTYQWITDHKSPRDHYRHISELRTS
IATSSPGRCDMSPIPRIVSVSLAWAAAIGLMVPIGLAPPAMAAPCSGDAA
NAPPPPSAIVTDPGATALGPVRPGHGPIPTGRKPRGANDRAPLPKLGPLI
SALLNPGARNAAPLQQQALVPRANPGPNPAPNPPATGPQPPNATQLTPNP
APAPDPAPAAAPDPGATLAGATTSLAEWVTGPDSPNKTLERFGISGTDLG
IPWDNGDPANRQVLMIFGDTFGYCAVDGHQWRYNTLFRSQDRDLGNGVHV
TSGDASNRYSGSPVRQPGFSKQLINSIKWARDETGIIPTAGIAVGKTQYV
NFMSIRNWGRDGEWTTNYSGIAVSKDNGQTWGVFPGTIRASGPDSGGKAR
FVPGNENFQMGAYLKSNDGYLYSFGTPPGRGGSAYLARVPQRFVPDLTKY
QYWNGDSNSWVPNKPDAATPVIPGPVGEMSVQYNTYLKQYLALYTNGMND
VVARTAPAPQGPWSAEQMLVSSWQMPGGIYAPMMHPWSTGKDVYFNLSLW
SAYNVMLMHTVLP
>Rv0862c CONSERVED HYPOTHETICAL PROTEIN
MTEHTPDIPLGSWLAALPDERLTQLLELRPDLAQPPPGSIAALAARAQAR
QSVKAATDELDFLRLAVFDALLVLQADTAPVPIVRLLAVIGDRAAQADVL
GALADLKQRALAWGETAVRVATDAGTALPWHPGQVTLEGSSRSGDQLADL
IAGLDPAQRDVLDKLLQGSPVGRTRDAAPGAPSDRPVPRLLAMGLLRRID
AETVILPRHVGQVLRGEQPGPMELTAPDPVVSTTTPDDADAAAAGAVIDL
LREVDVLLENLGATPVAELRSGGLGVREFKRLAKATGIDEPRLGLILEIA
AAAGLIASGMPDPEPPHSDGPFWAPTVAADRFATMSPAERWHLLASAWLD
LPGRPALIGTRGPDAKPYGALSDSLFSTAAPLDRRLLLGMLAELPAGAGV
DASRASATLIWRRPRWARRLQPAPIADLLTEGHALGLVGRGAISTPARAL
LDEALEPATAPAAAVGVMARALPKPIDHFLVQADLTVVVPGPLQRELADD
LTTVATVESAGTAMVYRVSEQSIRHALDVGKSRDWLQEFFANRSKTPVPQ
GLTYLIDDVARRHGQLRIGMAASFVRCEDPTLLAQVVAAPEADGLALRAL
APTVAVSPAPISEVLVTLRGAGFAPAAEDSTGAVVDVRTRGARVPTPQRR
RPYRPPPRPNSEALKAVVAVLREVTAAPFANVRVDPAVTMSLLQRAAKDQ
ATLVISYLDAAGVATQRVVAPITLRGGQLVAFDSSSGRLRDFAIHRITLV
VSAHDR
>Rv0298 HYPOTHETICAL PROTEIN
MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRVALRDYT
AKTVPALDIDAYAQRVYQANRAAGS
>Rv0340 CONSERVED HYPOTHETICAL PROTEIN
MANSLLDFVISLVRDPEAAARYAANPERSIAEAHLTDVTRADVNSLIPVV
SDSLSMSEPIGAAGGAHAGDRGNVWASGAATAALDAFAPHADAGVVQQHG
AVGSVLNQPTPPGPGVTPTDPRPFRAGPHETSALLTSAEIPDTTSEDGGL
PTDHPAVWNHPVVDPHTVEPDHHGYDIHG
>Rv2297 HYPOTHETICAL PROTEIN
MAMEMAMMGLLGTVVGASAMGIGGIAKSIAEAYVPGVAAAKDRRQQMNVD
LQARRYEAVRVWRSGLCSASNAYRQWEAGSRDTHAPNVVGDEWFEGLRPH
LPTTGEAAKFRTAYEVRCDNPTLMVLSLEIGRIEKEWMVEASGRTPKHRG
>Rv1831 HYPOTHETICAL PROTEIN
MRLCVCSAVDWTTHRSSAGEFCGCQLRTPKEQYLSVNLSGTRTARDYDAS
GKRWRPLAVLTRRWGKAIHLTVDRVAESLRRLACR
>Rv1810 CONSERVED HYPOTHETICAL PROTEIN
MQLQRTMGQCRPMRMLVALLLSAATMIGLAAPGKADPTGDDAAFLAALDQ
AGITYADPGHAITAAKAMCGLCANGVTGLQLVADLRDYNPGLTMDSAAKF
AAIASGAYCPEHLEHHPS
>Rv0738 CONSERVED HYPOTHETICAL PROTEIN
MDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQ
VGRWAASPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEV
PGQVFIGLRTTDVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQF
RGPGKPFADEKPCPRERPPADQLAAFLGRTVR
>Rv0749A HYPOTHETICAL PROTEIN (FRAGMENT)
MVRKHAFHWRYDSTEELELLNQLWQLVSLRLNFFTPTKKALGFRP
>Rv3903c HYPOTHETICAL ALANINE AND PROLINE RICH PROTEIN
MAPLAVDPAALDSAGGAVVAAGAGLGAVISSLTAALAGCAGMAGDDPAGA
VFGRSYDGSAAALVQAMSVARNGLCNLGDGVRMSAHNYSLAEAMSDVAGR
AAPLPAPPPSGCVGVGAPPSAVGGGGGAPKGWGWVAPYIGMIWPNGDSTK
LRAAAVAWRSAGTQFALTEIQSTAGPMGVIRAQQLPEAGLIESAFADAYA
STTAVVGQCHQLAAQLDAYAARIDAVHAAVLDLLARICDPLTGIKEVWEF
LTDQDEDEIQRIAHDIAVVVDQFSGEVDALAAEITAVVSHAEAVITAMAD
HAGKQWDRFLHSNPVGVVIDGTGQQLKGFGEEAFGMAKDSWDLGPLRASI
DPFGWYRSWEEMLTGMAPLAGLGGENAPGVVESWKQFGKSLIHWDEWTTN
PNEALGKTVFDAATLALPGGPLSKLGSKGRDILAGVRGLKERLEPTTPHL
EPPATPPRPGPQPPRIEPPESGHPAPAPAAKPAPVPANGPLPHSPTESKP
PPVDRPAEPVAPSSASAGQPRVSAATTPGTHVPHGLPQPGEHVPAQAPPA
TTLLGGPPVESAPATAHQPQWATTPAAPAAAPHSTPGGVHSTESGPHGRS
LSAHGSEPTHDGASHGSGHGSGSEPPGLHAPHREQQLAMHSNEPAGEGWH
RLSDEAVDPQYGEPLSRHWDFTDNPADRSRINPVVAQLMEDPNAPFGRDP
QGQPYTQERYQERFNSVGPWGQQYSNFPPNNGAVPGTRIAYTNLEKFLSD
YGPQLDRIGGDQGKYLAIMEHGRPASWEQRALHVTSLRDPYHAYTIDWLP
EGWFIEVSEVAPGCGQPGGSIQVRIFDHQNEMRKVEELIRRGVLRQ
>Rv2302 CONSERVED HYPOTHETICAL PROTEIN
MHAKVGDYLVVKGTTTERHDQHAEIIEVRSADGSPPYVVRWLVNGHETTV
YPGSDAVVVTATEHAEAEKRAAARAGHAAT
>Rv1046c HYPOTHETICAL PROTEIN
MKVQARVGWNRRQLSAVGGRGQQLFANAPGHIPSTSHRRGTGDINRKIDE
SLAGAARPQANANYGATSDPPLTHQPKPGSPTQVGPRSPSPPGLRGLVKQ
LPEVHQSSLHLDTVASLPSSRPSPHHTPLALRSRSGHFSPDEIRNRRSRK
RSQSHMPPRTPPRGRCLRAPEALA
>Rv0531 POSSIBLE CONSERVED MEMBRANE PROTEIN
MSEAPNDKTTRGVVDILVYATARLLLVVAVSAAIFGVARLIGLTEFPVVV
ATLFGLIIAMPLGIWVFSPLRRRATAALAVAGERRRAERERLRARLRGES
LPEEQ
>Rv0898c CONSERVED HYPOTHETICAL PROTEIN
MGKGRKPTDSETLAHIRDLVAEEKALRAQLRHGGISESEEQQQLRRIEIE
LDQCWDLLRQRRALRQTGGDPREAVVRPADQVEGYTG
>Rv3776 CONSERVED HYPOTHETICAL PROTEIN
MFEISLSDPVELRDADDAALLAAIEDCARAEVAAGARRLSAIAELTSRRT
GNDQRADWACDGWDCAAAEVAAALTVSHRKASGQMHLSLTLNRLPQVAAL
FLAGQLSARLVSIIAWRTYLVRDPEALSLLDAALAKHATAWGPLSAPKLE
KAIDSWIDRYDPAALRRTRISARSRDLCIGDPDEDAGTAALWGRLFATDA
AMLDKRLTQLAHGVCDDDPRTIAQRRADALGALAAGADRLTCGCGNSDCP
SSAGNHRQATGVVIHVVADAAALGAAPDPRLSGPEPALAPEAPATPAVKP
PAALISGGGVVPAPLLAELIRGGAALSRMRHPGDLRSEPHYRPSAKLAEF
VRIRDMTCRFPGCDQPTEFCDIDHTLPYPLGPTHPSNLKCLCRKHHLLKT
FWTGWRDVQLPDGTIIWTAPNGHTYTTHPDSRIFLPSWHTTTAALPPAPS
PPAIGPTHTLLMPRRRRTRAAELAHRIKRERAHVTQRNKPPPSGGDTAVA
EGFEPPDGVSRLSLSRRVH
>Rv1839c CONSERVED HYPOTHETICAL PROTEIN
MSKRLQVLLDPDEWEELREIARRHRTTVSEWVRRTLREAREREPRGDLDM
KLRSVRAAARHEFPTADVEQMLEEIERGRGAEREGSR
>Rv0441c HYPOTHETICAL PROTEIN
MGAKKVDLKRLAAALPDYPFAYLITVDDGHRVHTVAVEPVLRELPDGPDG
PRAVVDVGLIGGRTRQNLAHRSEVTLLWPPSDPSGYSLIVDGRAQASDAG
PDDDTARCGVVPIRALLHRDAAPDSPTAAKGCLHDCVVFSVP
>Rv1727 CONSERVED HYPOTHETICAL PROTEIN
MDLYSNLVEAEQRLVALVSSIEADSYSSPTPCDRWDVRALLSHALASIDA
FAAAVDGAPGPDMAQVFSGADIVGDDPLGATQRITRRSQAAWSTVRDLNA
ELSTFIGVMPAGQALAIITFSTVVHGWDLAVATGQAGELPEHLAEAAQQV
AAELVPVLRPRGLFAHDVDLAGEATPTQRLVALTGRKPR
>Rv2348c HYPOTHETICAL PROTEIN
MLLPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPE
DGADVDVQAAEGADAEAAAMDEWDEWQAWNEWVAENAEPRFEVPRSSSSV
IPHSPAAG
>Rv0424c HYPOTHETICAL PROTEIN
MAEKNTRRATSQREAVAKIREAETIVMNLPICGQVKIPRPEHLAYYGGLA
ALAALELIDWPVALVIATGHILANNHHNRVLEELGEAMEEA
>Rv0497 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MTGPHPETESSGNRQISVAELLARQGVTGAPARRRRRRRGDSDAITVAEL
TGEIPIIRDDHHHAGPDAHASQSPAANGRVQVGEAAPQSPAEPVAEQVAE
EPTRTVYWSQPEPRWPKSPPQDRRESGPELSEYPRPLRHTHSDRAPAGPP
SGAEHMSPDPVEHYPDLWVDVLDTEVGEAEAETEVREAQPGRGERHAAAA
AAGTDVEGDGAAEARVARRALDVVPTLWRGALVVLQSILAVAFGAGLFIA
FDQLWRWNSIVALVLSVMVILGLVVSVRAVRKTEDIASTLIAVAVGALIT
LGPLALLQSG
>Rv0610c HYPOTHETICAL PROTEIN
MDDELRGLLARYARGELSADDARRAILRYPKWRVAEIDGELETVALDDGT
PMLIAESSASDGREYSGLELVRDIAPLVGGLSFDPDEPWGSAFRPGALPE
LQNWARTVELEDAVAKPGPGQRDLLYEGPWWVAVSPGTGRPAVHRADGLD
VITIMTAPDAAATFRRTERHRGLDVVRLGPALWGDLAKRSDFDGVRLNPL
RPLAQLWPPHVPAMLVAGCDPRPNAEPLPARTVAEIHLWLDQHGARQEKR
ELSNRATPVGEVTVARAWWNYDRREIAFTRVAPASDTEGLGSVPSRILCA
GKLRQSIQSKLAGLPRLTWRADAWHRQRAALAVGWALELEKLVCGERVPF
AALRTPEGAHLWHLEPQAFTARAIRKLRDRAASFR
>Rv1951c CONSERVED HYPOTHETICAL PROTEIN
MKAGELRVNIQQVAATASQWSGRSTELSVLAPPPLGQPFQPTTAAVGGAH
AAVGLAVAAFTARTHATASAVEAAAAEYANNEAAAAAEMAAVPQTRLV
>Rv2175c conserved hypothetical regulatory protein
MPGRAPGSTLARVGSIPAGDDVLDPDEPTYDLPRVAELLGVPVSKVAQQL
REGHLVAVRRAGGVVIPQVFFTNSGQVVKSLPGLLTILHDGGYRDTEIMR
WLFTPDPSLTITRDGSRDAVSNARPVDALHAHQAREVVRRAQAMAY
>Rv2083 CONSERVED HYPOTHETICAL PROTEIN
MTSIESHPEQYWAAAGRPGPVPLALGPVHPGGPTLIDLLMALFGLSTNAD
LGGANADIEGDDTDRRAHAADAARKFSANEANAAEQMQGVGAQGMAQMAS
GIGGALSGALGGVMGPLTQLPQQAMQAGQGAMQPLMSAMQQAQGADGLAA
VDGARLLDSIGGEPGLGSGAGGGDVGGGGAGGTTPTGYLGPPPVPTSSPP
TTPAGAPTKSATMPPPGGASPASAHMGAAGMPMVPPGAMGARGEGSGQEK
PVEKRLTAPAVPNGQPVKGRLTVPPSAPTTKPTDGKPVVRRRILLPEHKD
FGRIAPDEKTDAGE
>Rv2655c POSSIBLE phiRv2 PROPHAGE PROTEIN
MADIPYGRDYPDPIWCDEDGQPMPPVGAELLDDIRAFLRRFVVYPSDHEL
IAHTLWIAHCWFMEAWDSTPRIAFLSPEPGSGKSRALEVTEPLVPRPVHA
INCTPAYLFRRVADPVGRPTVLYDECDTLFGPKAKEHEEIRGVINAGHRK
GAVAGRCVIRGKIVETEELPAYCAVALAGLDDLPDTIMSRSIVVRMRRRA
PTEPVEPWRPRVNGPEAEKLHDRLANWAAAINPLESGWPAMPDGVTDRRA
DVWESLVAVADTAGGHWPKTARATAETDATANRGAKPSIGVLLLRDIRRV
FSDRDRMRTSDILTGLNRMEEGPWGSIRRGDPLDARGLATRLGRYGIGPK
FQHSGGEPPYKGYSRTQFEDAWSRYLSADDETPEERDLSVSAVSAVSPPV
GDPGDATGATDATDLPEAGDLPYEPPAPNGHPNGDAPLCSGPGCPNKLLS
TEAKAAGKCRPCRGRAAASARDGAR
>Rv2432c HYPOTHETICAL PROTEIN
MTVRAEHCRGAGGCDECPSVMPEHPTALFHDVAAIALAQPGAEPGAMMGF
PCRPALLPHLSRAVMRCVRTRSASTSLGVSVIAGQLPAAGSRHRLGAPCR
HVRWWLASDGHWGMVSYIPTALNVSMGGIVGWRCVP
>Rv1134 HYPOTHETICAL PROTEIN
MAAYQKFGQEHAAAIRGGAVLHPTATATTVRVTGARGGDVVTGDGPYEAA
DLDEQGPFPMETVYLWEDGPNGTTRMTL
>Rv1155 CONSERVED HYPOTHETICAL PROTEIN
MARQVFDDKLLAVISGNSIGVLATIKHDGRPQLSNVQYHFDPRKLLIQVS
IAEPRAKTRNLRRDPRASILVDADDGWSYAVAEGTAQLTPPAAAPDDDTV
EALIALYRNIAGEHSDWDDYRQAMVTDRRVLLTLPISHVYGLPPGMR
>Rv0098 CONSERVED HYPOTHETICAL PROTEIN
MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATE
DSVLAYGNFTIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIR
VLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKFSARLLCRDLQVIER
TWRYLKVPCVIEFWDENGGAASGEIELAALNIP
>Rv0680c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MKWNTVAASLAAGVITIAVALAAPPPAAHAKNGDTHVTGQGIERTLDCNE
STLLVNGTQNIVTALGTCWAVTVMGSSNTVVADTIINDITVYGWDETVFF
RNGDPFIWDRGRELGMVNRLQRVG
>Rv0660c CONSERVED HYPOTHETICAL PROTEIN
MLSFRADDHDVDLADAWARRLHIGRSELLRDALRRHLAALAADQDVQAYT
ERPLTDDENALAEIADWGPAEDWADWADAAR
>Rv3099c CONSERVED HYPOTHETICAL PROTEIN
MTTPGRPLTTLDKSDVLAGLFAVWHSLDALLDGLLETDWQATSPLPGWDV
KAVVSHIIGTESFLLGIAAPEPDTDVSALAHVRNPIGVMNECWVRHLGTE
SGVGLLERFRAVTSQRRKVLASLSDDEWNAPTTTPSGPDSYGRFMRIRIF
DCWMHEQDIRAAVQRPSSDDELGGPASPLVLDEIAATMGFVVGKLAKAPD
GSRVLLELTGPLSRSIRVSVDGRARVVDDFGGPAPTATIRLDGLQFTRLA
GGRPMSPARSQDVELGGDKELAGHILERLNFVI
>Rv2960c HYPOTHETICAL PROTEIN
MGRNATAVVSLPVVALSPRAGQAGYLWQSITRGLRVTPICCYHPPCGGGV
QKMLSRKLGRVCPAPSPKDAARGAHNVGANAV
>Rv2513 HYPOTHETICAL PROTEIN
MDDIAAFKLDSLPDITFTVTRAISSGGENPAGFLNFAARREQPEILGGGG
RPGPVGPEAVDTPRIRGGKVPFVFRTLPGYTFYASQIEPRVGDPEGPTLL
AGFGNIPETSQRSPGWIRITCTGPDDDEELEFFGFAGPES
>Rv1870c CONSERVED HYPOTHETICAL PROTEIN
MPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLV
LCMLASKPIGAATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYV
RYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKRMLKTFNGIG
DTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSN
ALLAAALVRVA
>Rv1954c HYPOTHETICAL PROTEIN
MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTH
PDGTSSAAAALVLRRIDVHPLLTGLGRGRQTVSLRNGHLVATANRAILSR
RRSRLTRGRSFTSHLITSCPRLDDHQHRHPTRCRAEHAGCTVATCIPNAH
DPAPGHQTPRWGPFRLKPAYTRI
>Rv3472 CONSERVED HYPOTHETICAL PROTEIN
MRPVDEQWIEILRIQALCARYCLTIDTQDGEGWAGCFTEDGAFEFDGWVI
RGRPALREYADAHARVVRGRHLTTDLLYEVDGDVATGRSASVVTLATAAG
YKILGSGEYQDRLIKQDGQWRIAYRRLRNDRLVSDPSVAVNVADADVAAV
VGHLLAAARRLGTQMSDT
>Rv0495c CONSERVED HYPOTHETICAL PROTEIN
MWRPAQGARWHVPAVLGYGGIPRRASWSNVESVANSRRRPVHPGQEVELD
FAREWVEFYDPDNPEHLIAADLTWLLSRWACVFGTPACQGTVAGRPNDGC
CSHGAFLSDDDDRTRLADAVHKLTDDDWQFRAKGLRRKGYLELDEHDGQP
QHRTRKHKGACIFLNRPGFAGGAGCALHSKALKLGVPPLTMKPDVCWQLP
IRRSQEWVTRPDGTEILKTTLTEYDRRGWGSGGADLHWYCTGDPAAHVGT
KQVWQSLADELTELLGEKAYGELAAMCKRRSQLGLIAVHPATRAAQ
>Rv0431 PUTATIVE TUBERCULIN RELATED PEPTIDE
MLVTVGSMNERVPDSSGLPLRAMVMVLLFLGVVFLLLVWQALGSSPNSED
DSSAISTMTTTTAAPTSTSVKPAAPRAEVRVYNISGTEGAAARTADRLKA
AGFTVTDVGNLSLPDVAATTVYYTEVEGERATADAVGRTLGAAVELRLPE
LSDQPPGVIVVVTG
>Rv0664 HYPOTHETICAL PROTEIN
MEKSRCHAVAHGGGCAGSAKSHKSGGRCGQGRGAGDSHGTRGAGRRYRAA
SAPHPLAVGAHLRDELAKRSADPRLTDELNDLAGHTLDDL
>Rv1137c HYPOTHETICAL PROTEIN
MLSARCHIRHIGSPGKDARCAHLSATLRPGIGISPTNVGNATVLADGTPA
KPIQGAETMQRARHTGSCFSANARGPAISSGNPSRAGCGVPSSTTTPSST
PQAIRLLACTDSDALTVTRTAR
>Rv2517c HYPOTHETICAL PROTEIN
MNSAIIKIAKWAQSQQWTVEDDASGYTRFYNPQGVYIARFPATPSNEYRR
MRDLLGALKKAGLTWPPPSKKERRAQHRKEGAQ
>Rv0997 HYPOTHETICAL PROTEIN
MAGIAGVDRDPPGWPQHSHLLAGDPERFRHQLQRAETTNSIECFVAEWHH
AGVAADMTRPWPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPT
TDIEHSVGAAEVQRHRGAVPLGSGGDAAGKVEGGRTPQPFVQP
>Rv1312 CONSERVED HYPOTHETICAL SECRETED PROTEIN
MSAPMIGMVVLVVVLGLAVLALSYRLWKLRQGGTAGIMRDIPAVGGHGWR
HGVIRYRGGEAAFYRLSSLRLWPDRRLSRRGVEIISRRAPRGDEFDIMTD
EIVVVELCDSTQDRRVGYEIALDRGALTAFLSWLESRPSPRARRRSM
>Rv0760c CONSERVED HYPOTHETICAL PROTEIN
MTQTTQSPALIASQSSWRCVQAHDREGWLALMADDVVIEDPIGKSVTNPD
GSGIKGKEAVGAFFDTHIAANRLTVTCEETFPSSSPDEIAHILVLHSEFD
GGFTSEVRGVFTYRVNKAGLITNMRGYWNLDMMTFGNQE
>Rv3076 CONSERVED HYPOTHETICAL PROTEIN
MVLDGVVSDTRRSRTIAARQQTIWDVLADFGSLSSWVEGVDHSCVLNHGP
DGGALGSTRRVQVGRNTLVERVIEFDPPTTLAYRIEGLPARLRKVTNRWT
LRPADPVGAVTVVTLTSTIEIGGNPLARLAELVVGRAMAKRSNTMLAGLA
QRLEDKHG
>Rv2078 HYPOTHETICAL PROTEIN
MFVDVELLHSGANESHYAGEHAHGGADQLSRGPLLSGMFGTFPVAQTFHD
AVGAAHAQQMRNLHAHRQALITVGEKARHAATGFTDMDDGNAAELKAVVC
SCAT
>Rv0026 CONSERVED HYPOTHETICAL PROTEIN
MAFDAAMSTHEDLLATIRYVRDRTGDPNAWQTGLTPTEVTAVVTSTTRSE
QLDAILRKIRQRHSNLYYPAPPDREQGDAARAIADAEAALAHQNSATAQL
DLQVVSAILNAHLKTVEGGESLHELQQEIEAAVRIRSDLDTPAGARDFQR
FLIGKLKDIREVVATASLDAASKSALMAAWTSLYDASKGDRGDADDRGPA
SVGSGGAPARGAGQQPELPTRAEPDCLLDSLLLEDPGLLADDLQVPGGTS
AAIPSASSTPSLPNLGGATMPGGGATPALVPGVSAPGGLPLSGLLRGVGD
EPELTDFDERGQEVRDPADYEHSNEPDERRADDREGADEDAGLGKSESPP
QAPTTVTLPNGETVTAASPQLAAAIKAAASGTPIADAFQQQGIAIPLPGT
AVANPVDPARISAGDVGVFTATPLPLALAKLFWTARFNTSQPCEGQTF
>Rv0616c HYPOTHETICAL PROTEIN
MRIPGNRQCLLVQVLRQVDGSAHRLILTSLHRDARADAHRYSNGTDHAGR
AADEPAETAHEPCWVAARGLASQASRAMSATYRPSSFI
>Rv3222c CONSERVED HYPOTHETICAL PROTEIN
MSSPVSSRRLANLVKESLQGSVLGGVVSDAVLPAVSDDVKPGAGEDAYRV
PVVVAAGSGAVVQVGGLEVGSAAVAGEVADTVAELFVCRPTEPDVGDFVG
LAGGAGDAGQAGQQFGLGVGVRGESFGARRRLALSTVGASGATAGLRKTH
DGHHGCQARGALTQRRLYIGNPSEITDTRMVHQ
>Rv1598c CONSERVED HYPOTHETICAL PROTEIN
MSAKDHPNNAPGVPMVFPLWLERLQVKYINRALKPIARYLPGTATIEHRG
RKSGKPYQTIVTAYRKDGVLAIALAHGKTDWVKNVLAAGEADVHFARGVV
HVINPRIVPAGSDGQGLPRMARLQLRRIGVFVGDIA
>Rv1045 HYPOTHETICAL PROTEIN
MTKPYSSPPTNLRSLRDRLTQVAERQGVVFGRLQRHVAMIVVAQFAATLT
DDTGAPLLLVKGGSSLELRRGIPDSRTSKDFDTVARRDIELIHEQLADAG
ETGWEGFTAIFTAPEEIDVPGMPVKPRRFTAKLSYRGRAFATVPIEVSSV
EAGNADQFDTLTSDALGLVGVPAAVAVPCMTIPWQIAQKLHAVTAVLEEP
KVNDRAHDLVDLQLLEGLLLDADLMPTRSACIAIFEARAQHPWPPRVATL
PHWPLIYAGALEGLDHLELARTVDAAAQAVQRFVARIDRATKR
>Rv2128 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MLRRGESIIRNRYASKPPLYGMAMVFLAMAVVAVTAYFRMGWWSIIGYAA
AAIIGVIGFALAFRDLS
>Rv2023c HYPOTHETICAL PROTEIN
MAARHARAGRWAAQPRPMLGSGAVRYEVGANIDATGFGGIAAVHRLVTRL
GLVTRLGLVERVDAHSRFSSSNLPKSSRRISGRVSLSGMSNSAAKVVAST
SSSPWGQPLSVGLRRRWRS
>Rv0885 CONSERVED HYPOTHETICAL PROTEIN
MDRTRIVRRWRRNMDVADDAEYVEMLATLSEGSVRRNFNPYTDIDWESPE
FAVTDNDPRWILPATDPLGRHPWYQAQSRERQIEIGMWRQANVAKVGLHF
ESILIRGLMNYTFWMPNGSPEYRYCLHESVEECNHTMMFQEMVNRVGADV
PGLPRRLRWVSPLVPLVAGPLPVAFFIGVLAGEEPIDHTQKNVLREGKSL
HPIMERVMSIHVAEEARHISFAHEYLRKRLPRLTRMQRFWISLYFPLTMR
SLCNAIVVPPKAFWEEFDIPREVKKELFFGSPESRKWLCDMFADARMLAH
DTGLMNPIARLVWRLCKIDGKPSRYRSEPQRQHLAAAPAA
>Rv3881c CONSERVED HYPOTHETICAL ALANINE AND GLYCINE RICH PROTEIN
MTQSQTVTVDQQEILNRANEVEAPMADPPTDVPITPCELTAAKNAAQQLV
LSADNMREYLAAGAKERQRLATSLRNAAKAYGEVDEEAATALDNDGEGTV
QAESAGAVGGDSSAELTDTPRVATAGEPNFMDLKEAARKLETGDQGASLA
HFADGWNTFNLTLQGDVKRFRGFDNWEGDAATACEASLDQQRQWILHMAK
LSAAMAKQAQYVAQLHVWARREHPTYEDIVGLERLYAENPSARDQILPVY
AEYQQRSEKVLTEYNNKAALEPVNPPKPPPAIKIDPPPPPQEQGLIPGFL
MPPSDGSGVTPGTGMPAAPMVPPTGSPGGGLPADTAAQLTSAGREAAALS
GDVAVKAASLGGGGGGGVPSAPLGSAIGGAESVRPAGAGDIAGLGQGRAG
GGAALGGGGMGMPMGAAHQGQGGAKSKGSQQEDEALYTEDRAWTEAVIGN
RRRQDSKESK
>Rv2628 HYPOTHETICAL PROTEIN
MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSA
TIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIG
DWPAAYAIGEHLSVEIAVAV
>Rv3289c POSSIBLE TRANSMEMBRANE PROTEIN
MHEVGGPSRGDRLGRDDSEVHSAIRFAVVAAVVGVGFLIMGALLVSTCSG
VDTAACGPPQRILLALGGPLILCAAGLWAFLRTYRVWRAEGTWWGWHGAG
WFLLTLMVLTLCIGVPPIAGPVMAP
>Rv2660c HYPOTHETICAL PROTEIN
MIAGVDQALAATGQASQRAAGASGGVTVGVGVGTEQRNLSVVAPSQFTFS
SRSPDFVDETAGQSWCAILGLNQFH
>Rv1782 PROBABLE CONSERVED MEMBRANE PROTEIN
MAEESRGQRGSGYGLGLSTRTQVTGYQFLARRTAMALTRWRVRMEIEPGR
RQTLAVVASVSAALVICLGALLWSFISPSGQLNESPIIADRDSGALYVRV
GDRLYPALNLASARLITGRPDNPHLVRSSQIATMPRGPLVGIPGAPSSFS
PKSPPASSWLVCDTVATSSSIGSLQGVTVTVIDGTPDLTGHRQILSGSDA
VVLRYGGDAWVIREGRRSRIEPTNRAVLLPLGLTPEQVSQARPMSRALFD
ALPVGPELLVPEVPNAGGPATFPGAPGPIGTVIVTPQISGPQQYSLVLGD
GVQTLPPLVAQILQNAGSAGNTKPLTVEPSTLAKMPVVNRLDLSAYPDNP
LEVVDIREHPSTCWWWERTAGENRARVRVVSGPTIPVAATEMNKVVSLVK
ADTSGRQADQVYFGPDHANFVAVTGNNPGAQTSESLWWVTDAGARFGVED
SKEARDALGLTLTPSLAPWVALRLLPQGPTLSRADALVEHDTLPMDMTPA
ELVVPK
>Rv2369c HYPOTHETICAL PROTEIN
MIVGLADRHGHGRDVAAHRQAQLAGPRVAAVRRHRTGGHRQASSRIKVSA
HGLGVVRCAPTPSLTGVRMKLQHSSVRQVPVDRPESRHQKPGDVPRDPRC
>Rv3605c PROBABLE CONSERVED SECRETED PROTEIN
MGPTRKRDLTAAVVGAAAVGYLLVAVLYRWFPPITVWTGLSLLAVAVAEA
LWARYVRVKISDGEIGDGPGWLHPLVVARSLMVAKASAWVGALVTGWWIG
VLAYFLPRRSWLRAAAEDTTGTVVAAGSALALVVAALWLQHCCKSPQDPT
EHADGAES
>Rv3278c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MSYPENVLAAGEQVVLHRHPHWNRLIWPVVVLVLLTGLAAFGSGFVNSTP
WQQIAKNVIHAVIWGIWLVIVGWLTLWPFLSWLTTHFVVTNRRVMFRHGV
LTRSGIDIPLARINSVEFRDRIFERIFRTGTLIIESASQDPLEFYNIPRL
REVHALLYHEVFDTLGSDESPS
>Rv0049 CONSERVED HYPOTHETICAL PROTEIN
MDYTLRRRSLLAEVYSGRTGVSEVCDANPYLLRAAKFHGKPSRVICPICR
KEQLTLVSWVFGEHLGAVSGSARTAEELILLATRFSEFAVHVVEVCRTCS
WNHLVKSYVLGAARPARPPRGSGGTRTARNGARTASE
>Rv2473 POSSIBLE ALANINE AND PROLINE RICH MEMBRANE PROTEIN
MAPTSSSVASELLMPWPSAAASGVVGWRTTATASQRYHRPMSDTPFAEPY
PEQRPPWGVPPPGWDGSSRPAPSTTPRSPGRWSLVAALALAVVSLGVGIV
GWFHRQPHDKPSPAPSAPTFTSQQISDAKENVCAAHRIVRQAAVLNTNQA
NPVPGDPTGDLAVAANARLALYSGGDYLLRRLTAEPATPAELRDAVRSLA
NALQELAVNYLAGAPDSVVTPLRLALERDTRAVDPLCV
>Rv0290 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MSGTVMQIVRVAILADSRLTEMALPAELPLREILPAVQRLVVPSAQNGDG
GQADSGAAVQLSLAPVGGQPFSLDASLDTVGVVDGDLLVLQPVPAGPAAP
GIVEDIADAAMIFSTSRLKPWGIAHIQRGALAAVIAVALLATGLTVTYRV
ATGVLAGLLAVAGIAVASALAGLLITIRSPRSGIALSIAALVPIGAALAL
AVPGKFGPAQVLLGAAGVAAWSLIALMIPSAERERVVAFFTAAAVVGASV
ALAAGAQLLWQLPLLSIGCGLIVAALLVTIQAAQLSALWARFPLPVIPAP
GDPTPSAPPLRLLEDLPRRVRVSDAHQSGFIAAAVLLSVLGSVAIAVRPE
ALSVVGWYLVAATAAAATLRARVWDSAACKAWLLAQPYLVAGVLLVFYTA
TGRYVAAFGAVLVLAVLMLAWVVVALNPGIASPESYSLPLRRLLGLVAAG
LDVSLIPVMAYLVGLFAWVLNR
>Rv2700 POSSIBLE CONSERVED SECRETED ALANINE RICH PROTEIN
MVAQITEGTAFDKHGRPFRRRNPRPAIVVVAFLVVVTCVMWTLALTRPPD
VREAAVCNPPPQPAGSAPTNLGEQVSRTDMTDVAPAKLSDTKVHVLNASG
RGGQAADIAGALQDLGFAQPTAANDPIYAGTRLDCQGQIRFGTAGQATAA
ALWLVAPCTELYHDSRADDSVDLALGTDFTTLAHNDDIDAVLANLRPGAT
EPSDPALLAKIHANSC
>Rv0964c HYPOTHETICAL PROTEIN
MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIK
MRLQVIRDAAREHHLTIAYATGTALPPPDLSSYSPADQQAILNTAIRRAS
NVCWPTPRPPMRIWPRRFDAPPGPCRASRSMPNSAMRHPQCRRCRRRTAT
LRRSSGGGIR
>Rv2081c POSSIBLE TRANSMEMBRANE PROTEIN
MFANAGLSPFVAIWTARAASLYTSHNFWCAAAVSAAVYVGSAVVPAAVAG
PLFVGRVSATIKAAAPSTTAAIATLATAANGQLRERGGAGGWVGVHCPVV
GGGGVGHPRKAIAAAVSVHSTCMPAAFGGHLGLGDRSRSVSLSGTP
>Rv0461 PROBABLE TRANSMEMBRANE PROTEIN
MPDFDTGAHSQRFLSLAGQQDRAGKSWPGSTPKPQEDPVGVAPSASVEVL
GSEPAATLAHSVTVPGRYTYLKWWKFVLVVLGVWIGAGEVGLSLFYWWYH
TLDKTAAVFVVLVYVVACTVGGLILALVPGRPLITALSLGVMSGPFASVA
AAAPLYGYYYCERMSHCLVGVIPY
>Rv0292 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MNPIPSWPGRGRVTLVLLAVVPVALAYPWQSTRDYVLLGVAAAVVIGLFG
FWRGLYFTTIARRGLAILRRRRRIAEPATCTRTTVLVWVGPPASDTNVLP
LTLIARYLDRYGIRADTIRITSRVTASGDCRTWVGLTVVADDNLAALQAR
SARIPLQETAQVAARRLADHLREIGWEAGTAAPDEIPALVAADSRETWRG
MRHTDSDYVAAYRVSANAELPDTLPAIRSRPAQETWIALEIAYAAGSSTR
YTVAAACALRTDWRPGGTAPVAGLLPQHGNHVPALTALDPRSTRRLDGHT
DAPADLLTRLHWPTPTAGAHRAPLTNAVSRT
>Rv0250c CONSERVED HYPOTHETICAL PROTEIN
MSTTAELAELHDLVGGLRRCVTALKARFGDNPATRRIVIDADRILTDIEL
LDTDVSELDLERAAVPQPSEKIAIPDTEYDREFWRDVDDEGVGGHRY
>Rv3678A CONSERVED HYPOTHETICAL PROTEIN
MTQPTAWEYATVPLLTHATKQILDQWGADGWELVAVLPGPTGEQHVAYLK
RPK
>Rv0289 CONSERVED HYPOTHETICAL PROTEIN
MDATPNAVELTVDNAWFIAETIGAGTFPWVLAITMPYSDAAQRGAFVDRQ
RDELTRMGLLSPQGVINPAVADWIKVVCFPDRWLDLRYVGPASADGACEL
LRGIVALRTGTGKTSNKTGNGVVALRNAQLVTFTAMDIDDPRALVPILGV
GLAHRPPARFDEFSLPTRVGARADERLRSGVPLGEVVDYLGIPASARPVV
ESVFSGPRSYVEIVAGCNRDGRHTTTEVGLSIVDTSAGRVLVSPSRAFDG
EWVSTFSPGTPFAIAVAIQTLTACLPDGQWFPGQRVSRDFSTQSS
>Rv0910 CONSERVED HYPOTHETICAL PROTEIN
MAKLSGSIDVPLPPEEAWMHASDLTRYREWLTIHKVWRSKLPEVLEKGTV
VESYVEVKGMPNRIKWTIVRYKPPEGMTLNGDGVGGVKVKLIAKVAPKEH
GSVVSFDVHLGGPALLGPIGMIVAAALRADIRESLQNFVTVFAG
>Rv1271c CONSERVED HYPOTHETICAL SECRETED PROTEIN
MLSPLSPRIIAAFTTAVGAAAIGLAVATAGTAGANTKDEAFIAQMESIGV
TFSSPQVATQQAQLVCKKLASGETGTEIAEEVLSQTNLTTKQAAYFVVDA
TKAYCPQYASQLT
>Rv2972c POSSIBLE CONSERVED MEMBRANE OR EXPORTED PROTEIN
MNRRTLLWLSAIAALALVVAYQTLGSSAGRHADEFAARAGVPTVQPGADV
LAGIAVLPKRIHRYDYRRSAFGHPWDDRNDAPGGHNGCDTRDDILDRDLV
DKTYVSIKRCPNAVATGTLRDPYTNTTVAFQRGASVGQSVQIDHIVPLSY
AWDMGAYRWPNSERMRFANDPANLLAVQGQANQDKGDSPPAQWMPPNKAF
ACQYAMQFIAVLRGYSLPVDQPSSDVLRQAAATCPTG
>Rv1883c CONSERVED HYPOTHETICAL PROTEIN
MCLDQVMEGSATVHMAAPPDKIWTLIADVRNTGRFSPETFEAEWLDGATG
PALGARFRGHVRRNGIGPVYWTVCEPGREFGFAVLLGDRPVNNWHYRLTP
TADGTEVTESFRLPPSVLTTVYYRVFGGWLRQRRNIRDMTKTLQRIKDLV
EAG
>Rv0177 PROBABLE CONSERVED MCE ASSOCIATED PROTEIN
MSPRRKFEPGEGALLAPQSIEPSRRWGLPLALTASAVVMAAAISACALMR
ISHESHQRAAHKDIVMLSDVRSFMTMFTSPDPFHANEYAERVLSHATGDF
AKQYHERANDILIRISGVEPTTGTVLDAGVQRWNEDGSANVLVVTQITSK
SADGKRVVSNANRWLVTAKQEGNEWKISSLLPVI
>Rv2797c CONSERVED HYPOTHETICAL PROTEIN
MPLTVADIDRWNAQAVREVFHAASARAEVTFEASRQLAALSIFANSGGKT
AEAAAHHNAGIRRDLDAHGNEALAVARAADRAADGIVKVQSELAALRHAA
AAAELTIDALINRVVPIPGLRSTEAQWARTLAKQTELQAELDAIMAEANA
VDEELASAVNMADGDAPIPADSGPPVGPEGLTPTQLASDANEERLREERA
RLQAHLERLQAEYDQLSVRAARDYHNGILDGDAVGRLAALTDELSAARGR
LGELDAVDEALSRAPETYLTQLQIPEDPNQQVLAAVAVGNPDTAANVSVT
VPGVGSTTRGALPGMVTEARDLRSEVIRQLNAAGKPASVATIAWMGYHPP
PNPLDTGSAGDLWQTMTDGQAHAGAADLSRYLQQVRANNPSGHLTVLGHS
YGSLTASLALQDLDAQSAHPVNDVVFYGSPGLELYSPAQLGLDHGHAYVM
QAPHDLITNLVAPLAPLHGWGLDPYLTPGFTELSSQAGFDPGGIWRDGVY
AHGDYPRSFLDAAGQPQLRMSGYNLAAIAAGLPDNTVGPPLLPPILGGGM
PAAPGPALRGGR
>Rv2451 HYPOTHETICAL PROLINE AND SERINE RICH PROTEIN
MGRAVSVRHGSGALDLPGAAASRRLRVGQPIQPSPAPLARGSVDSIVEIS
CCPSAGPRGPYDNDLDSSSPANRDISSITSRSRRGGTIVVAGQKCGFGSA
VSLRPRRYREPNHANIVTPDTDLSPSWPWSGI
>Rv3885c POSSIBLE CONSERVED MEMBRANE PROTEIN
MTSKLTGFSPRSARRVAGVWTVFVLASAGWALGGQLGAVMAVVVGVALVF
VQWWGQPAWSWAVLGLRGRRPVKWNDPITLANNRSGGGVRVQDGVAVVAV
QLLGRAHRATTVTGSVTVESDNVIDVVELAPLLRHPLDLELDSISVVTFG
SRTGTVGDYPRVYDAEIGTPPYAGRRETWLIMRLPVIGNTQALRWRTSVG
AAAISVAQRVASSLRCQGLRAKLATATDLAELDRRLGSDAVAGSAQRWKA
IRGEAGWMTTYAYPAEAISSRVLSQAWTLRADEVIQNVTVYPDATCTATI
TVRTPTPAPTPPSVILRRLNGEQAAAAAANMCGPRPHLRGQRRCPLPAQL
VTEIGPSGVLIGKLSNGDRLMIPVTDAGELSRVFVAADDTIAKRIVIRVV
GAGERVCVHTRDQERWASVRMPQLSIVGTPRPAPRTTVGVVEYVRRRKNG
DDGKSEGSGVDVAISPTPRPASVITIARPGTSLSESDRHGFEVTIEQIDR
ATVKVGAAGQNWLVEMEMFRAENRYVSLEPVTMSIGR
>Rv1945 CONSERVED HYPOTHETICAL PROTEIN
MRSDTREEISAALDAYHASLSRVLDLKCDALTTPELLACLQRLEVERRRQ
GAAEHALINQLAGQACEEELGGTLRTALANRLHITPGEASRRIAEAEDLG
ERRALTGEPLPAQLTATAAAQREGKIGREHIKEIQAFFKELSAAVDLGIR
EAAEAQLAELATSRRPDHLHGLATQLMDWLHPDGNFSDQERARKRGITMG
KQEFDGMSRISGLLTPELRATIEAVLAKLAAPGACNPDDQTPVVDDTPDA
DAVRRDTRSQAQRHHDGLLAGLRGLLASGELGQHRGLPVTVVVSTTLKEL
EAATGKGVTGGGSRVPMSDLIRMASNAHHYLALFDGAKPLALYHTKRLAS
PAQRIMLYAKDRGCSRPGCDAPAYHSEVHHVTPWTTTHRTDINDLTLACG
PDNRLVEKGWKTRKNAKGDTEWLPPAHLDHGQPRINRYHHPEKILCEPDD
DEPH
>Rv0541c PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MRIGRREGLAVAIGFVLVGAAFVLPRLNLGIKPRSDIGLERFATRAGAAP
IFGYWDAHVGWGTAPAVLTAVAVVAWGPVVAHRLPWRVLTLSTWATAAAW
AFSLAMIDGWQRGFAGRLTTRDEYLWQVPGIADIPATLRTFTSRILDFQP
NSWVTHVSGHPPGALLTFVWLDRIGLRGGGWAGLVCLLVGSSAAAAVLIA
VRVLASEQMARRTAPFVAVAPTAIWIAVSADGYFAGVAAWGIALLAVAVH
GATRFPALVAAGAGLLLGWGVFLNYGLVLIVLPGMAVLAAADWRPVLRAL
GPAVLAALVVAVSFAVAGFSWFDGYTLVQQRYWQGIAKDRPFGYWSWANL
ACVVCAIGLGSVAGLSRVFDRAAISRRSGCHLLLLAVLAAIALADLSMLS
KAETERIWLPFTIWLTAAPALLPPRSHRLWLAVNAAGALLLNSIIFTNW
>Rv3346c CONSERVED HYPOTHETICAL PROTEIN
MTVRAVLRRTVGAQWPILAGVNFWRRGALLIGIGVGVAAVLRLVLSEERA
GLLVVRSKGIDFVTTVTVAAAMVYIASTIDPLGTG
>Rv3655c CONSERVED HYPOTHETICAL PROTEIN
MEAALAIATLVLVLVLCLAGVTAVSMQVRCIDAAREAARLAARGDVRSAT
DVARSIAPRAALVQVHRDGEFVVATVTAHSNLLPTLDIAARAISVAEPGS
TAARPPCLPSRWSRCCCASPVRVHI
>Rv2481c HYPOTHETICAL PROTEIN
MALRRRHEPDGWPFSQRSEKPNAVRHAVRCSAVSAAASTANGTPVNWVSG
RVTRAMGVHRQTRGGVASVHADSLRGAVLVHGQLRNSIPISANVPASGAN
TKSSIAH
>Rv3281 CONSERVED HYPOTHETICAL PROTEIN
MGTCPCESSERNEPVSRVSGTNEVSDGNETNNPAEVSDGNETNNPAEVSD
GNETNNPAPVSRVSGTNEVSDGNETNNPAPVSRVSGTNEVSDGNETNNPA
PVTEKPLHPHEPHIEILRGQPTDQELAALIAVLGSISGSTPPAQPEPTRW
GLPVDQLRYPVFSWQRITLQEMTHMRR
>Rv3668c POSSIBLE PROTEASE
MQTAHRRFAAAFAAVLLAVVCLPANTAAADDKLPLGGGAGIVVNGDTMCT
LTTIGHDKNGDLIGFTSAHCGGPGAQIAAEGAENAGPVGIMVAGNDGLDY
AVIKFDPAKVTPVAVFNGFAINGIGPDPSFGQIACKQGRTTGNSCGVTWG
PGESPGTLVMQVCGGPGDSGAPVTVDNLLVGMIHGAFSDNLPSCITKYIP
LHTPAVVMSINADLADINAKNRPGAGFVPVPA
>Rv2562 CONSERVED HYPOTHETICAL PROTEIN
MAEQKVKRNVELAGVDVILVHRMLKNEVPVSEYLFMTDVVAQCLDESVRK
LATPLTHDFEGIGETSTHYIDLATSDMPPAVPDHSFFGLLWADVKFEWHA
LPYLLGFKKACAGFRSLGRGATEEPAEMG
>Rv0689c HYPOTHETICAL PROTEIN
MLGWTVKPGRVADGWQAPGVHLMARCSGPQPASERRADMDGGDIDAAVAR
VRAAGALAEPSRQPDDMSAECADDQGARCHLGQL
>Rv3768 HYPOTHETICAL PROTEIN
MGSTPPRTPQEVFAHHGQALAAGDLDEIVADYADDSFVITPAGIARGKEG
IRQLFVKLLDDIPNALWDLKTQIFEGDILFLEWTANSAVSRVDDGVDTFV
FRDGTIWAHTVRYTPHPKT
>Rv0028 HYPOTHETICAL PROTEIN
MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILL
TADVSCLKALLEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRN
R
>Rv1949c CONSERVED HYPOTHETICAL PROTEIN
WLRQRTGADLQIVSGIAEHLRQASGLAREGAGTIGAAQRRVIYAVQDAHN
AGFNVEEDLSVTDTRTSRTFAEQAARQAQAQALAGDIRQRATQLIGVEHE
VAAKIATATAPLNTVGFHEPPIAPSLPTPVPHNEKPQIHAVDRSWKQDPP
SPMPGDPKDMTAVQARAAWDAVNADIARYNARCGRTFVLPNEQAAYDACI
ADKGSLFERQAAIRARLGELGVPVEGEPPPAPDPAGPQPNEGLPPPGVSP
PAESNLTVGPPSRPIQQARGGESLWDENGGEWRYFPGDNYRYPHWDYNPH
DSPTARWQNIPIGDLPTHK
>Rv0333 HYPOTHETICAL PROTEIN
MTTSEIATVLAWHDALNAADIETLVALSTDDIDIGDAHGAVQGHDALRGW
ASSLTTTAELGRMYVHHGVVVVEQKITSGEDPGIARTGAAAFRVVQDHVA
SVFRHEDLASALAATELTEDDLVD
>Rv2283 HYPOTHETICAL PROTEIN
MLEKCPHASVDCGASKIGITDNDPATATNRRLASTIRKPPIEHAAGPLGS
TSRAGHRSYGGVAS
>Rv0968 CONSERVED HYPOTHETICAL PROTEIN
MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRL
AREAERKAGESAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH
>Rv2536 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MTNWMLRGLAFAAAMVVLRLFQGALINAWQMLSGLISLVLLLLFAIGGVV
WGVMDGRADAKASPDPDRRQDLAMTWLLAGLVAGALSGAVAWLISLFYKA
IYTGGPINELTTFAAFTALIVFLVGIVGVAVGRWLVDRQLAKAPVRHHGL
AAEHERAADTDVFSAVRADDSPTGEMQVAQPEAQTAAVATVEREAPTEVI
RTTESDTPTEVIRTDTEADQTKPGDEPKKD
>Rv3766 HYPOTHETICAL PROTEIN
MRSAFDSGRLTFGIVYTYARPNWWANANTVRSMIDAAGGLHPRVALMLDV
ESGGNPPGDGSSWINRLYWNLADYAGSPVRIIGYANAYDFFNMWRVRPAG
LRVIGAGYGSNPNLPGQVAHQYTDGSGYSPNLPQGAPPFGRCDMNSANGL
TPQQFAAACGVTTTGGPLMALTDEEQTELLTKVREIWDQLRGPNGAGWPQ
LGQNEQGQDLTPVDAIAVIKNDVAAMLAE
>Rv3547 CONSERVED HYPOTHETICAL PROTEIN
MPKSPPRFLNSPLSDFFIKWMSRINTWMYRRNDGEGLGGTFQKIPVALLT
TTGRKTGQPRVNPLYFLRDGGRVIVAASKGGAEKNPMWYLNLKANPKVQV
QIKKEVLDLTARDATDEERAEYWPQLVTMYPSYQDYQSWTDRTIPIVVCE
P
>Rv2199c Possible conserved integral membrane protein
MHIEARLFEFVAAFFVVTAVLYGVLTSMFATGGVEWAGTTALALTGGMAL
IVATFFRFVARRLDSRPEDYEGAEISDGAGELGFFSPHSWWPIMVALSGS
VAAVGIALWLPWLIAAGVAFILASAAGLVFEYYVGPEKH
>Rv1036c PROBABLE IS1560 TRANSPOSASE (FRAGMENT)
MIPGRMVLNWEDGLNALVAEGIEAIVFRTLGDQCWLWESLLPDEVRRLPE
ELARVDALLDDPAFFAPFVPFFDPRRGRPSTPMEVYLQLMFVKFRYRLGY
ESLCREVADSIT
>Rv3087 CONSERVED HYPOTHETICAL PROTEIN
MRRLNGVDALMLYLDGGSAYNHTLKISVLDPSTDPDGWSWPKARQMFEER
AHLLPVFRLRYLPTPLGLHHPIWVEDPEFDLDAHVRRVVCPAPGGMAEFC
ALVEQIYAHPLDRDRPLWQTWVVEGLDGGRVALVTLLHHAYSDGVGVLDM
LAAFYNDTPDEAPVVAPPWEPPPLPSTRQRLGWALRDLPSRLGKIAPTVR
AVRDRVRIEREFAKDGDRRVPPTFDRSAPPGPFQRGLSRSRRFSCESFPL
AEVREVSKTLGVTINDVFLACVAGAVRRYLERCGSPPTDAMVATMPLAVT
PAAERAHPGNYSSVDYVWLRADIADPLERLHATHLAAEATKQHFAQTKDA
DVGAVVELLPERLISGLARANARTKGRFDTFKNVVVSNVPGPREPRYLGR
WRVDQWFSTGQISHGATLNMTVWSYCDQFNLCVMADAVAVRNTWELLGGF
RASHEELLAAARAQATPKEMAT
>Rv0954 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MTYSPGNPGYPQAQPAGSYGGVTPSFAHADEGASKLPMYLNIAVAVLGLA
AYFASFGPMFTLSTELGGGDGAVSGDTGLPVGVALLAALLAGVALVPKAK
SHVTVVAVLGVLGVFLMVSATFNKPSAYSTGWALWVVLAFIVFQAVAAVL
ALLVETGAITAPAPRPKFDPYGQYGRYGQYGQYGVQPGGYYGQQGAQQAA
GLQSPGPQQSPQPPGYGSQYGGYSSSPSQSGSGYTAQPPAQPPAQSGSQQ
SHQGPSTPPTGFPSFSPPPPVSAGTGSQAGSAPVNYSNPSGGEQSSSPGG
APV
>Rv0492A HYPOTHETICAL PROTEIN
MSFLLDPPLLFVCGVLIERRLPVDRRDAAEAAALGVFFGASFGLYHNVPG
LGMLWRPFRAQNGRDFMWNSGVFSVDVARAEWPLHAMAAAIFATYPFFIK
LGRRLGRRI
>Rv0310c CONSERVED HYPOTHETICAL PROTEIN
MCCNGVVTPGDPADIAAIKQLKYRYLRALDTKHWDDFTDTLAEDVTGDYG
SSVGTELHFTNRADLVDYLRQALGPGVITEHRVTHPEITVTGDTATGIWY
LQDRVIVAEFNFMLIGAAFYHDQYRRTTDGWRISATGYDRTYEATMSLAG
LNFNIRPGRALAD
>Rv2256c CONSERVED HYPOTHETICAL PROTEIN
MEPKEQQMRASNQFADVTSGVVYIHASPAAVCPHVEWALSSTLQAKANLV
WTPQPALPPQLRAVTNWVGPVGTGARLANALRSWSVLRFEVTEDPSPGVD
GQRFSHTPQLGLWSGAMSANGDIMVGEMRLRAMMAQGADTLAAELDSVLG
TAWDQALEVYRDGGDAGEVTWLSRGVG
>Rv3333c HYPOTHETICAL PROLINE RICH PROTEIN
MFTGIASHAGALGAALVVLIGAAILHDGPAAADPNQDDRFLALLEKKEIP
AVANVPRVIDAAHKVCRKLDGGMPVNDIVDGLRNDAYNIDPVMRLYPVRL
TTTMTRFISAAVEIYCPNHHSKMAFAMANFEPGSNEPTHRVAASTRSAVN
SGSDLRASVSDMTIMSPGWREPTGAMLASVLGAVRAGDPLIPNPPPIPVP
PPAAQTLIPPPPIVAPPPPRPAPPQQPPPPPPEVEPPAGVPQSGGAAGSG
GAGSGGGGGGDGPVEPSPARPMPPGFIRLAP
>Rv3770c HYPOTHETICAL LEUCINE RICH PROTEIN
MLSGIQQNTLMDNDPLAHGYYVADLLVALAVVVLMLRARRTRPELARMLL
LGTLIGLVWELPVFGLSAWTNTPIIEWATPLPLPTVVFLLAHSVWDGPLL
TMGWLLARALTGEPAGALGLTVQVLWGQLTALAVELSAILAGTWSYVDDL
WFNPVMFWFRGHPVTAAMQLTWLLAPLCFAALVRRLALTAR
>Rv1587c Partial REP13E12 repeat protein
MLAKLAAPGATNPDDHTPVIDTTPDAAAIDRDTRSQAQRNHDGLLAGLRA
LIASGKLGQHNGLPVSIVVTTTLTDLQTGAGKGFTGGGTLLPMADVIRMT
SHAHHYSPASGRYPQAIFDHGTPLALYHTKRLASPAQRIMLFANDRGCTK
PGCDAPAYHSQAHHVTAWTSTGRTDITELTLACGPDNRLAEKGWTTHNNT
HGHTEWLPPPHLDHGQPWTCEIHYTCACCCLPPNLRRPLRRTARRGPPTR
GLPKAVRAAKMGARRVPRQRRQRINRQAPPRLRADVGRHHRRQDRRRGGL
GPGPAPSPSHRAGSLHVISRREAAGPGHRRRRR
>Rv1804c CONSERVED HYPOTHETICAL PROTEIN
MRVVSTLLSIPLMIGLAVPAHAGPSGDDAVFLASLERAGITYSHPDQAIA
SGKAVCALVESGESGLQVVNELRTRNPGFSMDGCCKFAAISAHVYCPHQI
TKTSVSAK
>Rv2248 CONSERVED HYPOTHETICAL PROTEIN
MTRQQLDVQVKNGGLVRVWYGVYAAQEPDLLGRLAALDVFMGGHAVACLG
TAAALYGFDTENTVAIHMLDPGVRMRPTVGLMVHQRVGARLQRVSGRLAT
APAWTAVEVARQLRRPRALATLDAALRSMRCARSEIENAVAEQRGRRGIV
AARELLPFADGRAESAMESEARLVMIDHGLPLPELQYPIHGHGGEMWRVD
FAWPDMRLAAEYESIEWHAGPAEMLRDKTRWAKLQELGWTIVPIVVDDVR
REPGRLAARIARHLDRARMAG
>Rv1887 HYPOTHETICAL PROTEIN
MDTVLGLSITPTTLGWVLAEGHGADGAILDRNELELHSGRNAQAIHTAEQ
LAAEVLLAHEVAAAGDHRLRVIGVTWNAEASAQAALLVESLTGAGFDNVV
PVRRLRAIETLAQAIAPVIGYEQIAVCVLEHESATVVMVDTHDGKTQIAV
KHVCRGLSGLTSWLTGMFGRDAWRPAGVVVVGSDSEVSEFSWQLERVLPV
PVFAQTMAQVTVARGAALAAAQSTEFTDAQLVADSVSQPTVAPRRSRHYA
GAAAALAAAAVTFVASLSLAVGIQLAPHNDTGTAKHGAHKPTPRIAKAVA
PAVPPPPTVTPPVPARAPRPAAQHEPPARVTSGEALTEPNPPEEQPNASA
PQQDRNDSQPITRVLEHIPGAYGDSAPPAE
>Rv2016 HYPOTHETICAL PROTEIN
MTELGDKFLAALVGTIRDTRFDIADMRNWRPGWFPTMHSRCLSNLIHDRI
WAHLVTLIASNPGTSIKDKGATREIVVGAHLRLRIKRHHAGDEISTYPTR
TAIEFWQQGSQPAFPGLEEVRIAVGYRWDPDTREIGAPLLSLRDGKDHVI
WVVELDEPAAGVKITWTPIEPTLPSIDFGDLGEDSGASGER
>Rv2137c CONSERVED HYPOTHETICAL PROTEIN
MRNMKSTSHESESGKLLSISSCRPREMVLQRYSLGMTVTADRHLADKREE
FAVEDISTGIFASGYGQVGDGRSFSFHIEHRSLVVEIYRPRVAGPVPQAE
DVVAMAVRGLVDIDLTDERSLAAAVRDSVASAAPVSR
>Rv0909 CONSERVED HYPOTHETICAL PROTEIN
MGILDKVKNLLSQNADKVETVINKAGEFVDEQTQGNYSDAIHKLHDAASN
VVGMSDQQS
>Rv2647 HYPOTHETICAL PROTEIN
MHVCHTIADVVDRAKAERSENTLRKDFTPSELLAAGRRIAELERPKAKQR
QREGGDHGRQARYSGLGSMEPKPESERDAHKADTAISEALGISRGHYQRL
KRIDNATRSEAGYRDGLNGWSG
>Rv2181 Probable conserved integral membrane protein
MSAWRAPEVGSRLGRRVLWCLLWLLAGVALGYVAWRLFGHTPYRIDIDIY
QMGARAWLDGRPLYGGGVLFHTPIGLNLPFTYPPLAAVLFSPFAWLQMPA
ASVAITVLTLVLLIASTAIVLTGLDAWPTSRLVPAPARLRRLWLAVLIVA
PATIWLEPISSNFAFGQINVVLMTLVIVDCFPRRTPWPRGLMLGLGIALK
LTPAVFLLYFLLRRDGRAALTALASFAVATLLGFVLAWRDSWEYWTHTLH
HTDRIGAAALNTDQNIAGALARLTIGDDERFALWVAGSLLVLAATIWAMR
RVLRAGEPTLAVICVALFGLVVSPVSWSHHWVWMLPAVLVIGLLGWRRRN
VALAMLSLAGVVLMRWTPIDLLPQHRETTAVWWRQLAGMSYVWWALAVIV
VAGLTVTARMTPQRSLTRGLTPAPTAS
>Rv3839 CONSERVED HYPOTHETICAL PROTEIN
MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAV
AVPVDRGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVE
TLDLIATDNPNPALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGA
EPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARLVSRLPAPLRRGQ
IRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRVLMGCPF
RNGLRARR
>Rv1616 CONSERVED MEMBRANE PROTEIN
MEASGRQRRYAAAGSVVLLAGALGYIGLVDPHNSNSLYPPCLFKLLTGWN
CPACGGLRMIHDLLHGELAASINDNVFLLVGVPVLASWVLLRRRHGDLAL
PIPVMIAVAVAVIAWTVLRNLPGFPLVPTISG
>Rv0857 CONSERVED HYPOTHETICAL PROTEIN
MIANLVAVAIRASREVVIEAPPEVIVEALADMDAVPSWSSVHKRVEVVDT
YSDGRPHHVKVTIKVAGIVDTELLEYHWGPDWVVWDAAKTAQQHGQHGEY
NLRREDNDKTRVRFTLTVEPSAPLPAFWVNIARKKILHAATEGLRKQVVG
RRRFTSG
>Rv3165c HYPOTHETICAL PROTEIN
MKRLIALGIFLIVGIELLALILHDRRLVLAGSGLALALVLLNVRRMLGNR
DELTAAPDSDDLGEGLRRWLSNTETTIRWSESTRADWDRHLRPMLARRFE
IATGHRQAKDPVAFAATGRMLFGDELWEWVNPNNVTHTGDRQPGPGRAAL
EEILQKLEQV
>Rv3269 CONSERVED HYPOTHETICAL PROTEIN
MAIQVFLAKATTTVITGLAGVTAYEILKKAAAKAPLRQTAVSAAALGLRG
TRKAEEAAESARLKVADVMAEARERIGEESPTPAISDLHDHDH
>Rv3467 CONSERVED HYPOTHETICAL PROTEIN
MSTRQAAEADLAGKAAQYRPDELARYAQRVMDWLHPDGDLTDTERARKRG
ITLSNQQYDGMSRLSGYLTPQARATFEAVLAKLAAPGATNPDDHTPVIDT
TPDAAAIDRDTRSQAQRNHDGLLAGLRALIASGKLGQHNGLPVSIVVTTT
LTDLQTGAGKGFTGGGTLLPMADVIRMTSHAHHYSPASGRYPQAIFDHGT
PLALYHTKRLASPAQRIMLFANDRGCTKPGCDAPAYHSQAHHVTAWTSTG
RTDITELTLACGPDNRLAEKGWTTHNNTHGHTEWLPPPHLDHGQPRTNTF
HHPERFLHNQDDDDKPD
>Rv1419 HYPOTHETICAL PROTEIN
MGELRLVGGVLRVLVVVGAVFDVAVLNAGAASADGPVQLKSRLGDVCLDA
PSGSWFSPLVINPCNGTDFQRWNLTDDRQVESVAFPGECVNIGNALWARL
QPCVNWISQHWTVQPDGLVKSDLDACLTVLGGPDPGTWVSTRWCDPNAPD
QQWDSVP
>Rv3899c CONSERVED HYPOTHETICAL PROTEIN
MVTGQPAAAGAHSLSEGAMTAMQSGSVPPPQATPPITTPPVVSAPTMAAG
IEATHGPVDTPANTSGAPPASTGTTGPVAPTVVTAGPVAAPAAPVVGGSA
VPAGPLPAYGSDLRPPVVAAPAVPSVPTAPVSGAPVAPSASSAPSAGGAL
VSPVERAASKAVAGQAGASSSTMAGASALSATAGATAGAVSARAAEQQRL
QRIVDAVARQEPRISWAAGLRDDGTTTLLVTDLAGGWIPPHVRLPANVTL
LEPTARRRDADVIDLLGAVVAVAAHESNTYVAEPGPDAPALTGDRSARSA
IPKVDEFGPTLVEAVRRRDSLPRIAQAIALPAVRKTGVLENEAELLHGCI
TAVKESVLKAYPSHELTAVGDWMLLAAIEALIDEQDYLANYHLAWYAVTT
RRGGSRGFAA
>Rv3831 HYPOTHETICAL PROTEIN
MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASI
ALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTI
ANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQHR
HERSRATVGA
>Rv2693c PROBABLE CONSERVED INTEGRAL MEMBRANE ALANINE AND LEUCINE RICH PROTEIN
MNANRTSAQRLLAQAGGVSGLVYSSLPVVTFVVASSAAGLLPAIGFALSM
AGLILLWRLLRRESARPVVAGFCGVAVCALIAYLVGQSKGYFLLGIWMSL
LWAVVFTLSILIRRPIVGYLWSWLSGRDRAWRDVSRAVFAFDVATLGWTL
VFAARFIVQRHLYDADKTGWLGVARIGMGWPLTALAALATYAAIKAAQRA
ILASHDAAAVGGAAEFDADAGRE
>Rv0226c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MRWFRPGYALVLVLLLAAPLLRPGYLLLRDAVSTPRSYVSANALGLTSAP
RATPQDFAVALASHLVDGGVVVKALLLLGLWLAGWGAARLVATALPAAGA
AGQFVAITLAIWNPYVAERLLQGHWSLLVGYGCLPWVATAMLTMRTTVGA
GWFGLFGLAFWVALAGLTPSGLLLAATVAVVCVAMPGAGRPRWQCGVAAL
GSALVGALPWLTASALGSSLTSHTAANQLGVTAFAPRAEPGLGTLGSLAS
LGGIWNGEAVPSSRTTLFAVASAVVLLAMVAIGLPTVARRPVAVPLLTLA
AVSVMVPAVLATGPGLHALRVVVDAAPGLGVLRDGQKWVALAVPGYTLSG
AGTVLTLRRWLRPATAAVVCCLALVLTLPDLAWGVWGKVAPVHYPSGWAA
VAAAINADPRTVAVLPAGTMRRFSWSGSAPVLDPLPRWVRADVLTTGDLV
ISGVTVPGEDAHARAVQELLLTGPHPSTLAAAGVGWLVVESDSAGDMGAA
ARTLGRLAAAHRDDELALYRVGGQTSGASSARLKATMLAHWAWLSMLLVG
GAGAAGYWVRRHLHHCEDTPASRAQD
>Rv3906c CONSERVED HYPOTHETICAL PROTEIN
MEYCIAGDDGSAGIWNRPFDVDLDGDGRLDAIGLDLDGDGLRDDALADFD
GDDVADHAVFDVDNDGTPESYFIDDGSGTWAVAVDRGGQLRWYGLDGVEH
TGGPLVDFDGFGGLDDRLLDTDGDGLADRVLCAGEQRVTGYVDTDGDGRW
DVRLTDTDGDGTADGASSL
>Rv2189c CONSERVED HYPOTHETICAL PROTEIN
MRDGPAAPAQVVAPADGFVALRVADDRTVRLLSLGGAATDRLLSRIAAGI
DAAVDEVVAFWGTDWSHDIFVVAAGSDEQFHAAAGGGLASQWADIAAITV
VDRVDPARRTVVGQRIVFAPGAAHMSPAALRIVLGHELFHYAARADTALD
APRWLAEGVADFVARPKTPPPADAVSVALSLPSDTDLDTPGPQRSLAYDR
AWWFARFVAAAYGTAKLRELYLATCGVGHFDLATAAHDVLGIDAAGLLAR
WQRWLMG
>Rv2742c CONSERVED HYPOTHETICAL ARGININE RICH PROTEIN
MLVDELGVKIVHAQHVPAPYLVQRMREIHERDENRQRHAQVDVQRRRDQP
ERGQHQHRRNRDADHHPDGRTLAGQIVAHPVSHRVRQPRPVAIADVLPRV
GPRADCVVAHSLQGSPRRRERRRGQTAHQRLGRRSGNAIACPLYLENAAG
PEPDTKRAEGRRFGAFGGGDLRWMADRVPRQGSGRRGLGSRSGAGVPQGA
DARGWRHTADGVPRVGQPAIRRGVPGFWCWLDHVLTGFGGRNAICAIEDG
VEPRVAWWALCTDFDVPRSMGRRTPGG
>Rv3371 CONSERVED HYPOTHETICAL PROTEIN
MAQLTALDAGFLKSRDPERHPGLAIGAVAVVNGAAPSYDQLKTVLTERIK
SIPRCTQVLATEWIDYPGFDLTQHVRRVALPRPGDEAELFRAIALALERP
LDPDRPLWECWIIEGLNGNRWAILIKIHHCMAGAMSAAHLLARLCDDADG
SAFANNVDIKQIPPYGDARSWAETLWRMSVSIAGAVCTAAARAVSWPAVT
SPAGPVTTRRRYQAVRVPRDAVDAVCHKFGVTANDVALAAITEGFRTVLL
HRGQQPRADSLRTLEKTDGSSAMLPYLPVEYDDPVRRLRTVHNRSQQSGR
RQPDSLSDYTPLMLCAKMIHALARLPQQGIVTLATSAPRPRHQLRLMGQK
MDQVLPIPPTALQLSTGIAVLSYGDELVFGITADYDAASEMQQLVNGIEL
GVARLVALSDDSVLLFTKDRRKRSSRALPSAARRGRPSVPTARARH
>Rv3091 CONSERVED HYPOTHETICAL PROTEIN
MPIPFADGMLSRLGRRGAALDLIEEFEDESGEPPASLSPADLLAAEPALL
LQKMENRLVRHHLANPDVLSGEQLRKLRYILNFARLADFEPGAAGPGGSR
GRGDISVGGQVAPWRSRVVDALYAPLREEPDPVTALEGAKDVLATLVDDQ
DDQRRVLIERHGSDFSATELDAEVGYKKLVTVLGGGGGAGFVYIGGMQRL
LAAGQVPDYMIGSSFGSIIGSLVARELPVPIDEYAEWAKTVSYRAILGPE
RRRSRHGLAGMFTLRFDQFAHTLLSRADGERMRMSDLAIPFDVVVAGVRR
QPYAALPSRFRHRERSTLTLRSLPFLPIGIGPWVAARMWQVAAFIDLRVV
KPIVISADGATRDVNVVDAASFSSAIPGVLHHETSDPRMLPILDELCADQ
DVAAMVDGGAASNVPVELAWERVRDGRLGTRNACYLAFDCFHPHWDPRHL
WLVPITQAVQLQMVRNLPYADHLVRFEPTLSPVNLAPSAAAIDRACRWGR
DSVEPAIAVTSALLEPTWWEGDRPPAAEPKERTKSAASSMSAVMAAIQAP
TGRFRRWRSRHLT
>Rv3033 HYPOTHETICAL PROTEIN
MAHSIVRTLLASGAATALIAIPTACSFSIGTSHSHSVSKAEVARQITAKM
TDAAGNKPESVTCPSDLPAEVGAELNCEMKIKDRTFNVNVTVTSVDGSDV
KFDMVETVDKNQVANIISDKLFQRVGARPDSVTCPDNLKGVEGAKLRCRL
TDGSKTYGISVIVTSVDAGDVNFDFKVDDHPE
>Rv1976c CONSERVED HYPOTHETICAL PROTEIN
MRWIVDGMNVIGSRPDGWWRDRHRAMVMLVERLEGWAITKARGDDVTVVF
ERPPSTAIPSSVVEVAHAPKAAANSADDEIVRLVRSGAQPQEIRVVTSDK
ALTDRVRDLGAAVYPAERFRDLIDPRGSNAARRTQ
>Rv0393 CONSERVED 13E12 REPEAT FAMILY PROTEIN
MAVGRCAIPRFDQAASGSAINGGQVHLSDGSTSPARQLPAPWPGDAGAAA
EGRAGVCCRGNRLPHVSDVGVSHRFDHRPAGVGAGGCRAGAAGAGLAVDD
PGQLAAAIDRIVAVADPDAVRQVRERARDREVSIWNSADGMGEVYAQLYA
TDAQALDARLNALVATVCAGDPRSTDQRRADALGALAAGADRLACRCDNP
DCAAEGRPVSAVVIHVVAEQASVKGHGQAPAALLGGDGLIPAELVAELAK
TAGLQPIPVPAGTEPGYRPSVKLAAFVRARDLTCRAPGCDRPATQCDLDH
TIAFADGGATHAANLKCLCRLHHLLATFCGWRAQQLPDGTVIWTLPGNQT
YVTTPGSALLFPALCTPTGDPPAPEPARADRRGQRTAMMPRRASTRTQNR
AHCIAAERHRNHQARRIAQAAVIATETHGPPPDPDDDPPPF
>Rv3479 POSSIBLE TRANSMEMBRANE PROTEIN
MAGVTREINLLAQASQWRRLGGTFPTNSQLTNESAASLRLYAQLIDLLDM
VVDVDILSGTSAGGINAALLASSRVTGSDLGGIRDLWLDLGALTELLRDP
RDKKTPSLLYGDERIFAALAKRLPKLATGPFPPTTFPEAARTPSTTLYIT
TTLLAGETSRFTDSFGTLVQDVDLRGLFTFTETDLARPDTAPALALAARS
SASFPLAFEPSFLPFTKGTAKKGEVPARPAMAPFTSLTRPHWVSDGGLLD
NRPIGVLFKRIFDRPARRPVRRVLLFVVPSSGPAPDPMHEPPPDNVDEPL
GLIDGLLKGLAAVTTQSIAADLRAIRAHQDCMEARTDAKLRLAELAATLR
NGTRLLTPSLLTDYRTREATKQAQTLTSALLRRLSTCPPESGPATESLPK
SWSAELTVGGDADKVCRQQITATILLSWSQPTAQPLPQSPAELARFGQPA
YDLAKGCALTVIRAAFQLARSDADIAALAEVTEAIHRAWRPTASSDLSVL
VRTMCSRPAIRQGSLENAADQLAADYLQQSTVPGDAWERLGAALVNAYPT
LTQLAASASADSGAPTDSLLARDHVAAGQLETYLSYLGTYPGRADDSRDA
PTMAWKLFDLATTQRAMLPADAEIEQGLELVQVSADTRSLLAPDWQTAQQ
KLTGMRLHHFGAFYKRSWRANDWMWGRLDGAGWLVHVLLDPRRVRWIVGE
RADTNGPQSGAQWFLGKLKELGAPDFPSPGYPLPAVGGGPAQHLTEDMLL
DELGFLDDPAKPLPASIPWTALWLSQAWQQRVLEEELDGLANTVLDPQPG
KLPDWSPTSSRTWATKVLAAHPGDAKYALLNENPIAGETFASDKGSPLMA
HTVAKAAATAAGAAGSVRQLPSVLKPPLITLRTLTLSGYRVVSLTKGIAR
STIIAGALLLVLGVAAAIQSVTVFGVTGLIAAGTGGLLVVLGTWQVSGRL
LFALLSFSVVGAVLALATPVVREWLFGTQQQPGWVGTHAYWLGAQWWHPL
VVVGLIALVAIMIAAATPGRR
>Rv3142c HYPOTHETICAL PROTEIN
MTEQEMTEQWLEGCAVQRIMFRDGLVLNFDDYNELVISVPLQLTLPAIET
SPAEVVAIDPNDPADHERPLFDFAGATCTAFVWYDTGDLHLEFSDGHQID
VHPDDRVTAWELYGKYHGYAACLAPGKLRVVRHDVADANGDQ
>Rv3760 POSSIBLE CONSERVED MEMBRANE PROTEIN
MPGSVPGKAPEEPPVKFTRAAAVWSALIVGFLILILLLIFIAQNTASAQF
AFFGWRWSLPLGVAILLAAVGGGLITVFAGTARILQLRRAAKKTHAAALR
>Rv2929 HYPOTHETICAL PROTEIN
MIELSYAPDVAGRRSNWPKGSGVNTWTAIRWTFAEDSPYVGTGLERMASD
THGGGGGRPVTPPPPGMHHLGCSRGVLLISSQRDAGHKTCDPAAGGTLTS
VLT
>Rv3861 HYPOTHETICAL PROTEIN
MTWLADPVGNSRIARAQACKTSISAPIVESWRAQRGAQCGQREKSCRCSR
AVHIQGISPPLFRRPLEPAVQAAVASCRLGRHPVVAHRVTVALGQGSQLA
QRECPRPA
>Rv0744c POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN
METLLKTSEAAQILGVSRQHVVNMCDRGEMVCVHVGSHRRVPSSEVERVT
SRRLTREEERSLWLHRALLSPLLTEPDTVVSAARENLRRWSGMHRRDGMA
GWYFTKWQRVLNDGLDAVMHVLTSPSEDAREMRQNSPFAGILPEATRVAV
LRSFKDHWDREHERAMTE
>Rv0955 PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MNRVSASADDRAAGARPARDLVRVAFGPGVVALGIIAAVTLLQLLIANSD
MTGAWGAIASMWLGVHLVPISIGGRALGVMPLLPVLLMVWATARSTARAT
SPQSSGLVVRWVVASALGGPLLMAAIALAVIHDASSVVTELQTPSALRAF
TSVLVVHSVGAATGVWSRVGRRALAATALPDWLHDSMRAAAAGVLALLGL
SGVVTAGSLVVHWATMQELYGITDSIFGQFSLTVLSVLYAPNVIVGTSAI
AVGSSAHIGFATFSSFAVLGGDIPALPILAAAPTPPLGPAWVALLIVGAS
SGVAVGQQCARRALPFVAAMAKLLVAAVAGALVMAVLGYGGGGRLGNFGD
VGVDEGALVLGVLFWFTFVGWVTVVIAGGISRRPKRLRPAPPVELDADES
SPPVDMFDGAASEQPPASVAEDVPPSHDDIANGLKAPTADDEALPLSDEP
PPRAD
>Rv1128c CONSERVED HYPOTHETICAL PROTEIN
MCSTREEITEAFASLATALSRVLGLTFDALTTPERLALLEHCETARRQLP
SVEHTLINQIGEQSTEEELGGKLGLTLADRLRITRSEAKRRVAEAADLGQ
RRALTGEPLPPLLTATAKAQRHGLIGDGHVEVIRAFVHRLPSWVDLKTLE
KAERDLAKQATQYRPDQLAKLAARIMDCLNPDGDYTDEDRARRRGLTLGK
QDVDGMSRLSGYVTPELRATIEAVWAKLAAPGMCNPEQKAPCVNGAPSKE
QARRDTRSCPQRNHDALNAELRSLLTSGNLGQHNGLPASIIVTTTLKDLE
AAAGAGLTGGGTILPISDVIRLARHANHYLAIFDRGKALALYHTKRLASP
AQRIMLYAKDSGCSAPGCDVPGYYCEVHHVTPYAQCRNTDVNDLTLGCGG
HHPLAERGWTTRKNAHGDTEWLPPPHLDHGQPRVNTFHHPEKLLADDEGD
P
>Rv0121c CONSERVED HYPOTHETICAL PROTEIN
MGEFDPKLRFAQSPVARLATSTPDGTPHLVPVVFALGARRPAEATGADVI
YTAVDAKRKTTQRLRRLANLEHNPRASVLVDSYADDWTQLWWVRADGVAA
IHRDGEVMRAAYRLLRAKYAQYQSVPLNGPVIAIAVQRWASWHA
>Rv0184 CONSERVED HYPOTHETICAL PROTEIN
MTNDKMLARIAALLRQAEGTDNPHEADAFMSTAQRLATAASIDLAVARSH
AGNRSPAQAPTQRTITIGAAGTRGLRTYVQLFVLIAAANDVRCDVASNST
FVYAYGFAEDIDTSHALYASLVVQMVRASDAYLASGAHRPTPTITARLNF
QLAFGARVGQRLADAREQTRQEATKDRDRPPGTAIALRDKDIELHEYYRR
SSKARGAWRASRATAGYSSAARRAGDRAGRQARLGNNPELPGARAALGR
>Rv0807 CONSERVED HYPOTHETICAL PROTEIN
MSARDRVDPAKTRQVVLALADWLRDETLPAPDTDVLAAAVRLTARTLAAL
APGASVEVRIPPFAAVQCISGPRHTRGTPPNVVQTDPRTWLLVATGLSGV
AQARGSGALQLSGSRAGEIEAWLPLVDLG
>Rv0123 HYPOTHETICAL PROTEIN
MTKKPRNPADYVIGDDVEVSDVDLKQEEVYVDGERLTDERVEQMASESLR
LAREREANLIPGGKSLSGGSAHSPAVQVVVSKATHAKLKELARSRKMSVS
KLLRPVLDEFVQRETGRILPRR
>Rv3196A HYPOTHETICAL PROTEIN
MQEGGPQETMSARSTQHDAADALFRAIIETLDKHRNERTLTEDVLDTLAR
AYASISTNVPEQGRLG
>Rv2722 CONSERVED HYPOTHETICAL PROTEIN
MPCLARQPVDLPPWAGPRCGPYCPRARITLLQRTTIAKSNRKYYENGYPA
DVKLMPGHAAVVSNRAAARAGFALPCRKRQPD
>Rv2144c Probable transmembrane protein
MLIIALVLALIGLLALVFAVVTSNQLVAWVCIGASVLGVALLIVDALRER
QQGGADEADGAGETGVAEEADVDYPEEAPEESQAVDAGVIGSEEPSEEAS
EATEESAVSADRSDDSAK
>Rv1836c CONSERVED HYPOTHETICAL PROTEIN
MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSE
GHYSAVGGYSASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPG
DWQAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTA
AARCVGGKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAG
SDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGSQAISDSRSLV
ISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP
SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTA
AMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAA
VADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKP
PSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPND
EGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPV
NGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV
ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVA
QLSGGSYQNLETSASPDLATAVNIFLS
>Rv1111c CONSERVED HYPOTHETICAL PROTEIN
MSAQRARSAVQASHRSIHPHIPGVPWWAAILIAVTATAIGYAIDAGSGHK
ALTLVFTGCYIAGCVGAVLAVRQSDLFTALVQPPLILFCAVPGAYWLFHG
GTIGKFKDLLINCGYSLIERFPLMLGTAAGVLLIGLVRWYLGTALFDSIA
RKLSSLMTGDSDDDGGRRSAQRPARTRSRHARPPSEDNREPIAERRSRRR
PRPQNDPHPRRNAHERPAPRSSRFDSYRSYQPSEPSGPAEPVNRYERRGA
RYQPYARYEPTYEPQRRRARPSEPTNPTHHPISQVRYRGSATRDARRDNY
REEQRFDRRDRSRAPRRPPAESWEYDV
>Rv2558 CONSERVED HYPOTHETICAL PROTEIN
MPGSAGWRKVFGGTGGATGALPRHGRGSIVYARSTTIEAQPLSVDIGIAH
VRDVVMPALQEIDGCVGVSLLVDRQSGRCIATSAWETLEAMRASVERVAP
IRDRAALMFAGSARVEEWDIALLHRDHPSHEGACVRATWLKVVPDQLGRS
LEFYRTSVLPELESLDGFCSASLMVDHPACRRAVSCSTFDSMDAMARNRD
RASELRSRRVRELGAEVLDVAEFELAIAHLRVPELV
>Rv2673 POSSIBLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MYGALVTAADSIRTGLGASLLAGFRPRTGAPSTATILRSALWPAAVLSVL
HRSIVLTTNGNITDDFKPVYRAVLNFRRGWDIYNEHFDYVDPHYLYPPGG
TLLMAPFGYLPFAPSRYLFISINTAAILVAAYLLLRMFNFTLTSVAAPAL
ILAMFATETVTNTLVFTNINGCILLLEVLFLRWLLDGRASRQWCGGLAIG
LTLVLKPLLGPLLLLPLLNRQWRALVAAVVVPVVVNVAALPLVSDPMSFF
TRTLPYILGTRDYFNSSILGNGVYFGLPTWLILFLRILFTAITFGALWLL
YRYYRTGDPLFWFTTSSGVLLLWSWLVMSLAQGYYSMMLFPFLMTVVLPN
SVIRNWPAWLGVYGFMTLDRWLLFNWMRWGRALEYLKITYGWSLLLIVTF
TVLYFRYLDAKADNRLDGGIDPAWLTPEREGQR
>Rv3446c HYPOTHETICAL ALANINE AND VALINE RICH PROTEIN
MSPHRAVIEAGPGAIRRLCCGADVVADTAVSAAALAAIDDQVALLDERPV
AVDSLWFDALRSVAVDHRDGPVVVHPSWWSAARVEVVTAAARTLTRDVVV
HPRSWLLRQASSGVSAATVVVEIAERLVLVAGAEVAAVARRTDAESVAGQ
VGSVIARMTRGITAVVLIDVPSTVAGAAALAAAIAGAVRGTGSSVVEIDG
VRLARLARAALPPSDEPADPAARPATRSRVPTLARVAAAGVALALLAPAA
VVRHGATTLQRPPTTLLVEGRVALTIPADWSTQRVVSGPGSARVQVTSPA
DPEVALHVTQSPVPGETLPGTAQRLKRAIDASPAGVFVDFNPSDIRAGRP
AVTYREVRAGHQVRWTILLDGAVRISVGCQSGPGHEDLLREVCAQAVRSV
HAVG
>Rv3869 POSSIBLE CONSERVED MEMBRANE PROTEIN
MGLRLTTKVQVSGWRFLLRRLEHAIVRRDTRMFDDPLQFYSRSIALGIVV
AVLILAGAALLAYFKPQGKLGGTSLFTDRATNQLYVLLSGQLHPVYNLTS
ARLVLGNPANPATVKSSELSKLPMGQTVGIPGAPYATPVSAGSTSIWTLC
DTVARADSTSPVVQTAVIAMPLEIDASIDPLQSHEAVLVSYQGETWIVTT
KGRHAIDLTDRALTSSMGIPVTARPTPISEGMFNALPDMGPWQLPPIPAA
GAPNSLGLPDDLVIGSVFQIHTDKGPQYYVVLPDGIAQVNATTAAALRAT
QAHGLVAPPAMVPSLVVRIAERVYPSPLPDEPLKIVSRPQDPALCWSWQR
SAGDQSPQSTVLSGRHLPISPSAMNMGIKQIHGTATVYLDGGKFVALQSP
DPRYTESMYYIDPQGVRYGVPNAETAKSLGLSSPQNAPWEIVRLLVDGPV
LSKDAALLEHDTLPADPSPRKVPAGASGAP
>Rv2576c POSSIBLE CONSERVED MEMBRANE PROTEIN
MPAGVGNASGSVLDMTSVRTVPSAVALVTFAGAALSGVIPAIARADPVGH
QVTYTVTTTSDLMANIRYMSADPPSMAAFNADSSKYMITLHTPIAGGQPL
VYTATLANPSQWAIVTASGGLRVNPEFHCEIVVDGQVVVSQDGGSGVQCS
TRPW
>Rv3865 CONSERVED HYPOTHETICAL PROTEIN
MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSK
FNDTLQEFETTRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDK
IFG
>Rv3395c CONSERVED HYPOTHETICAL PROTEIN
MTAAFASDQRLENGAEQLESLRRQMALLSEKVSGGPSRSGDLVPAGPVSL
PPGTVGVLSGARSLLLSMVASVTAAGGNAAIVGQPDIGLLAAVEMGADLS
RLAVIPDPGTDPVEVAAVLIDGMDLVVLGLGGRRVTRARARAVVARARQK
GCTLLVTDGDWQGVSTRLAARVCGYEITPALRGVPTPGLGRISGVRLQIN
GRGR
>Rv3482c PROBABLE CONSERVED MEMBRANE PROTEIN
MEHDVATSPPAGWYTDPDGSAGQRYWDGDRWTRHRRPNPSAPRSPLALRV
DGLRSRWLGMPAGLRLTVPVAAVLTMVGVAVYAWIRPLPDDWSQLPKRLS
CQLRPGPTPPATITVASVDVSHPRGAVLRLVVRFAEPLPPSPSGSFASGF
AGYLLTYTIANNGKEFAELGPQQDTDELAIRKPGESRGTEPNMRPDRNTN
ARRTAPDTVEINLETKRLGLDQAPVDPQLTFAAQFRTPSTVTVDFGSQFC
QGERLAGQRR
>Rv3353c CONSERVED HYPOTHETICAL PROTEIN
MSRQTFLRGAVGAPATSAVFPTILARATPGDGWASLASSIGGQVLLPANG
RAFTSGKQIFNSNYSGLNPAAVVTVASQADVRKAVS
>Rv1760 CONSERVED HYPOTHETICAL PROTEIN
MPRGCAGARFACNACLNFLAGLGISEPISPGWAAMERLSGLDAFFLYMET
PSQPLNVCCVLELDTSTMPGGYTYGRFHAALEKYVKAAPEFRMKLADTEL
NLDHPVWVDDDNFQIRHHLRRVAMPAPGGRRELAEICGYIAGLPLDRDRP
LWEMWVIEGGARSDTVAVMLKVHHAVVDGVAGANLLSHLCSLQPDAPAPQ
PVRGTGGGNVLQIAASGLEGFASRPVRLATVVPATVLTLVRTLLRAREGR
TMAAPFSAPPTPFNGPLGRLRNIAYTQLDMRDVKRVKDRFGVTINDVVVA
LCAGALRRFLLEHGVLPEAPLVATVPVSVHDKSDRPGRNQATWMFCRVPS
QISDPAQRIRTIAAGNTVAKDHAAAIGPTLLHDWIQFGGSTMFGAAMRIL
PHISITHSPAYNLILSNVPGPQAQLYFLGCRMDSMFPLGPLLGNAGLNIT
VMSLNGELGVGIVSCPDLLPDLWGVADGFPEALKELLECSDDQPEGSNHQ
DS
>Rv2484c CONSERVED HYPOTHETICAL PROTEIN
MAESGESPRLSDELGPVDYLMHRGEANPRTRSGIMALELLDGTPDWDRFR
TRFENASRRVLRLRQKVVVPTLPTAAPRWVVDPDFNLDFHVRRVRVSGPA
TLREVLDLAEVILQSPLDISRPLWTATLVEGMADGRAAMLLHVSHAVTDG
VGGVEMFAQIYDLERDPPPRSTPPQPIPEDLSPNDLMRRGINHLPIAVVG
GVLDALSGAVSMAGRAVLEPVSTVSGILGYARSGIRVLNRAAEPSPLLRR
RSLTTRTEAIDIRLADLHKAAKAGGGSINDAYLAGLCGALRRYHEALGVP
ISTLPMAVPVNLRAEGDAAGGNQFTGVNLAAPVGTIDPVARMKKIRAQMT
QRRDEPAMNIIGSIAPVLSVLPTAVLEGITGSVIGSDVQASNVPVYPGDT
YLAGAKILRQYGIGPLPGVAMMVVLISRGGWCTVTVRYDRASVRNDELFA
QCLQAGFDEILALAGGPAPRVLPASFDTQGAGSVPRSVSGS
>Rv2557 CONSERVED HYPOTHETICAL PROTEIN
MTGGATGALPRTMKEGWIVYARSTTIQAQSECIDTGIAHVRDVVMPALQG
MDGCIGVSLLVDRQSGRCIATSAWETAEAMHASREQVTPIRDRCAEMFGG
TPAVEEWEIAAMHRDHRSAEGACVRATWVKVPADQVDQGIEYYKSSVLPQ
IEGLDGFCSASLLVDRTSGRAVSSATFDSFDAMERNRDQSNALKATSLRE
AGGEELDECEFELALAHLRVPELV
>Rv3415c CONSERVED HYPOTHETICAL PROTEIN
MNETPHAPVVEQVLVAAAFGNQPGSWPLPTAITPHHLWLRAVAAGGQGRY
AHAYGDLSVLRRLVPAGPLASLAHSTQGSLLRQLGWHTLARGWDGRALAL
AGADREAGADALIGLAADALGVGRFAAAGALLDRADPLVVSPLVADRLAV
RRRWVAAELAMATGDGATAVRHAEEAVELTQAMAVASARHRVKSDVVLAA
ALCSAGAVARARAVGEEALDATARFGLLPLRWALACLLIDIGTVTFSAQQ
LRELTKIRNICAGQVRRAGGCWRTA
>Rv2617c PROBABLE TRANSMEMBRANE PROTEIN
MSIRPTTSPALADQLKDPAYSAYVLLRTLFTVAPILFGLDKFFNLLTHPQ
HWNMYLAGWINDLVPGTADQCMYLVGAIEIVAGVLVAVAPRIGAWVVAAW
LAGIILNLVTGPGFYDIALRDFGLLVGAIALARLAQGVHSGGIGRP
>Rv1356c HYPOTHETICAL PROTEIN
MLIAGYLTDWRIMTTAQLRPIAPQKLHFSENLSVWVSDAQCRLVVSQPAL
DPTLWNTYLQGALRAYSKHGVECTLDLDAISDGSDTQLFFAAIDIGGDVV
GGARVIGPLRSADDSHAVVEWAGNPGLSAVRKMINDRAPFGVVEVKSGWV
NSDAQRSDAIAAALARALPLSMSLLGVQFVMGTAAAHALDRWRSSGGVIA
ARIPAAAYPDERYRTKMIWWDRRTLANHAEPKQLSRMLVESRKLLRDVEA
LSATTAATAGAEQ
>Rv0523c CONSERVED HYPOTHETICAL PROTEIN
MQLPQWLARFNRYVTNPIQRLWAGWLPAFAILEHVGRRSGKPYRTPLNVF
SADVDGRAGVAILLTYGPNRDWLKNITAAGGGRMRRYGKTFGVANPRRLT
KAEAAPYVSSRWRPVFARLPFDEAVLLTKAD
>Rv1148c CONSERVED HYPOTHETICAL PROTEIN
MSETFCLTDHSEPMTARFLSVVLRRIRGMRSDTREEISAALDAYHASLSR
VLDLKCDALTTPELLACLQRLEVERRRQGAAEHALINQLAGQACEEELGG
TLRTALANRLHITPGEASRRIAEAEDLGERRALTGEPLPAQLTATAAAQR
EGKIGREHIKEIQAFFKELSAAVDLGIREAAEAQLAELATSRRPDHLHGL
ATQLMDWLHPDGNFSDQERARKRGITMGKQEFDGMSRISGLLTPELRATI
EAVLAKLAAPGACNPDDQTPLVDDTPDADAVRRDTRSQAQRNHDAFLAAL
RGLLASGELGQHKGLPVTIVVSTTLKELEAATGKGVTGGGSRVPMSDLIR
MASHANHYLALFDGAKPLALYHTKRLASPAQRIMLYAKDRGCSRPGCDAP
AYHSEVHHVTPWTTTHRTDINDLTLACGPDNRLVEKGWKTRKNAHGDTEW
LPPPHLDHGQPRINRYHHPAKILCEQDDDEPH
>Rv0662c CONSERVED HYPOTHETICAL PROTEIN
MFLPNTRAYRRYNRSVWAVRGSTRPQWQPPPKFQHAKCMSMRLAHRLQIL
LDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRLLDAA
DMSVPEPRELKQELEALRARRG
>Rv2112c CONSERVED HYPOTHETICAL PROTEIN
MFWVGGPCLMPASSAARCAARIVGGRCLMPASSAARCAARIVGGPRLYGM
QRIIGTEVEYGISSPSDPTANPILTSTQAVLAYAAAAGIQRAKRTRWDYE
VESPLRDARGFDLSRSAGPPPVVDADEVGAANMILTNGARLYVDHAHPEY
SAPECTDPLDAVIWDKAGERVMEAAARHVASVPGAAKLQLYKNNVDGKGA
SYGSHENYLMSRQTPFSAIITGLTPFLVSRQVVTGSGRVGIGPSGDEPGF
QLSQRSDYIEVEVGLETTLKRGIINTRDEPHADADRYRRLHVIIGDANLA
ETSTYLKLGTTALVLDLIEEGPAHAIDLTDLALARPVHAVHAISRDPSLR
ATVALADGRELTGLALQRIYLDRVAKLVDSRDPDPRAADIVETWAHVLDQ
LERDPMDCAELLDWPAKLRLLDGFRQRENLSWSAPRLHLVDLQYSDVRLD
KGLYNRLVARGSMKRLVTEHQVLSAVENPPTDTRAYFRGECLRRFGADIA
AASWDSVIFDLGGDSLVRIPTLEPLRGSKAHVGALLDSVDSAVELVEQLT
AEPR
>Rv1083 CONSERVED HYPOTHETICAL PROTEIN
MNQILLSVIAEGGPGNTGPDFGKASPVGLLVIVLLVIATLFLVRSMNQQL
KKVPKSFDRDHPELDQAADEGTDRDGPARPPGPPHESG
>Rv2305 HYPOTHETICAL PROTEIN
MTQTLRLTALDEMFITDDIDIVPSVQIEARVSGRFDLDRLAAALRAAVAK
HALARARLGRASLTARTLYWEVPDRADHLAVEITDEPVGEVRSRFYARAP
ELHRSPVFAVAVVRETVGDRLLLNFHHAAFDGMGGLRLLLSLARAYAGEP
DEVGGPPIEEARNLKGVAGSRDLFDVLIRARGLAKPAIDRKRTTRVAPDG
GSPDGPRFVFAPLTIESDEMATAVARRPEGATVNDLAMAALALTILQWNR
THDVPAADSVSVNMPVNFRPTAWSTEVISNFASYLAIVLRVDEVTDLEKA
TAIVAGITGPLKQSGAAGWVVDLLEGGKVLPAMLKRQLQLLLPLVEDRFV
ESVCLSNLGRVDVPAFGGEAGDTTEVWFSPTAAMSVMPIGVGLVGFGGTL
RAMFRGDGRTIGGEALGRFAALYRDTLLT
>Rv2645 HYPOTHETICAL PROTEIN
MTTTPRQPLFCAHADTNGDPGRCACGQQLADVGPATPPPPWCEPGTEPIW
EQLTERYGGVTICQWTRYFPAGDPVAADVWIAADDRVVDGRVLRTQPAIH
YTEPPVLGIGPAAARRLAAELLNAADTLDDGRRQLDDLGEHRR
>Rv0976c CONSERVED HYPOTHETICAL PROTEIN
MRIGNCSGFYGDRLSAMREMLTGGELDYLTGDYLAELTMLILGRDRMKNP
DRGYAKTFLAQLEDCLGLAHDRGVRIVTNAGGLNPAGLANAVRALAARLG
IPAQVAHVEGDDLQPRAAELGLGTPLTANAYLGAWGIVDCFERGADVVVT
GRVTDASVVVGAAAAHFGWGRTDYHRLAGAVVAGHVIECGVQATGGNYAF
FTEIGDLTHAGFPLAEIAADGSSVITKHHGTGGLVSVDTITAQLLYEITG
ARYANPDVTARMDSVELSPDGPDRVRISGVIGEPPPPTYKVSLNSIGGFR
NAMTFVLTGLDIDAKADLVRRQLEAALTVKPAELQWTLARTDHPDADTEE
TASALLTCVARDPDPANVGRQFSSAAVELALASYPGFTATAPPGDGQVYG
VFTPGYVDAGKVAHIAVHADGTRTEIPCATETLELAPAHPPALPDPLPAG
PTRRVPLGLIAGARSGDKGGSANVGVWVRTDEQWRWLAHTLTVELLKELL
PETAGLVVTRHVLPNLRALNFVIEAILGQGVAYQARFDPQAKGLGEWLRS
RHVEIPETLL
>Rv2075c Possible hypothetical exported or envelope protein
MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVA
IPCVALGKFADAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHR
TARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQL
DIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVL
PQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSL
IYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW
SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTR
PPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRA
GAGACALQGADGRWVAASCGDPHPAACRDAAGRWTVTPAPVVFAGAALAC
TAIGADFTLPRTGNQNARLHAVAGPAGGAWVHYLLPP
>Rv3611 HYPOTHETICAL ARGININE AND PROLINE RICH PROTEIN
MAIANPAEPGAAGRHHQPRGDRKPRAWRQCGPQNGPRRSQAITPEPGAAG
RHHQPRGDRKPRAWRQCGPQNGPRRSQAITPEPGAAGRHHQPRGDRKPRA
WRQCGPQNGPRRSQAITPEPGAAGRHHQPRGDRKPRAWRQCGPQNGPRRS
QAITPEPGAAGRHHQPRGDRKPRAWRQCGPQNGPRRSQAITPEPGAAGRH
WLDQRPVVPDGVGKSDS
>Rv1455 CONSERVED HYPOTHETICAL PROTEIN
MKLARPDVFHPRVVLAGWPQQPAGDGDDAGLVAALRHRGLHAGWLSWDDP
EIVHADLVILRATRDYPARLDEFLAWTTRVANLLNSRPVVAWNVERRYLR
DLMDRGVPTVPGEVYVPGEPVRLPRKGQVFVGPTIGTGTRRCSARFAAEF
VAQLHAAGQAVLVQPGGSGDETVLVFLGGEPSHAFTKQADTWRQTEPDFE
IWDVGAAAVAGAAAQVGVDPGELLYARAHITGGSRDPRLLELQLVDPSLG
WQWLDPDIRNLAQRDFALCVQSALERLGLGPFSHRRP
>Rv1813c CONSERVED HYPOTHETICAL PROTEIN
MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGL
PIPPIIHYGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRF
TRCGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGGRIVNWACN
>Rv2712c HYPOTHETICAL PROTEIN
MTKYRGQFELNRPATLIAALPAILGFVPEKSLVLVSLAAGELGSVMRADL
CDELADRVGHLAELVAAANPAAAIAVIVDANGAQCPRCNEEYRQLCAALA
AALSQRDIVLWAAHVVDRVAAGGRWHCVDGCGCSGVIDDPSASPLAMAAV
LDGRQLYPRRSDLQAVIAVDDPVRSAELAVALGHQAADREIAHRADSVGC
SRQDVENALAAAARVADGQSLSDTELARLGCALGDARVRDMLYALAVGEN
AGAAESLWALLARVLPEPWRVEALVLLAFSAYARGDGPLAGVSLQAALCC
EPGHRMAGMLDTALQSGLRPEHIRDIAVTGYQRAEQLGIRLPPRRAFGQR
AG
>Rv2806 POSSIBLE MEMBRANE PROTEIN
MKTNPRYGPAFYSVMTVLFLALFVLNVCTHGSTLGLISTGGLAVLMGYIG
YRGWSGKRHINRQ
>Rv2658c POSSIBLE PROPHAGE PROTEIN
MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGR
HIEWKLECRACRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIA
EARHVIPFSALCLRLSQLGG
>Rv0999 HYPOTHETICAL PROTEIN
MRPPLAPQFAADLLVKTVSTLRSSGAALGRLTTMRKAVLAVGSVCWLVGC
SSGASSTTASTGDIAKVAEVKSGFGPEYTVTDVTPRAIDPGFFSARKLPD
GLSFDPANCAQVAAGPQLPTGLQGNMAAVSAEGNGNRFVVIAVETSQPLP
APSPGKDCSKVTFSGTQLRGGIEVVDVPHIDGTQTLGVHRVLQAVVGGSA
RTGELYDYSARFGDYQVIVIANPLVIPGRPVARVDTQRARDLLVQAVAAV
RG
>Rv3660c CONSERVED HYPOTHETICAL PROTEIN
MLTDPGLRDELDRVAAAVGVRVVHLGGRHPVSRKTWSAAAAVVLDHAAAD
RCGRLALPRRTHVSVLTGTEAATATWAAAITVGAQHVLRMPEQEGELVRE
LAEAAESARDDGICGAVVAVIGGRGGAGASLFAVALAQAAADALLVDLDP
WAGGIDLLVGGETAPGLRWPDLALQGGRLNWSAVRAALPRPRGISVLSGT
RRGYELDAGPVDAVIDAGRRGGVTVVCDLPRRLTDATQAALDAADLVVLV
SPCDVRACAAAATMAPVLTAINPNLGLVVRGPSPGGLRAAEVADVAGVPL
LASMRAQPRLAEQLEHGGLRLRRRSVLASAARRVLGVLPRAGSGRHGRAA
>Rv0669c POSSIBLE HYDROLASE
MLSVGRGIADITGEAADCGMLGYGKSDQRTAGIHQRLRSRAFVFRDDSQD
GDARLLLIVAELPLPMQNVNEEVLRRLADLYGDTYSEQNTLITATHTHAG
PGGYCGYLLYNLTTSGFRPATFAAIVDGIVESVEHAHADVAPAEVSLSHG
ELYGASINRSPSAFDRNPPADKAFFPKRVDPHTTLVRIDRGEATVGVIHF
FATHGTSMTNRNHLISGDNKGFAAYHWERTVGGADYLAGQPDFIAAFAQT
NPGDMSPNVDGPLSPEAPPDREFDNTRRTGLCQFEDAFTQLSGATPIGAG
IDARFTYVDLGSVLVRGEYTPDGEERRTGRPMFGAGAMAGTDEGPGFHGF
RQGRNPFWDRLSRAMYRLARPTAAAQAPKGIVMPARLPNRIHPFVQEIVP
VQLVRIGRLYLIGIPGEPTIVAGLRLRRMVASIVGADLADVLCVGYTNAY
IHYVTTPEEYLEQRYEGGSTLFGRWELCALMQTVAELAEAMRDGRPVTLG
RRPRPTRELSWVRGAPADAGSFGAVIAEPSATYRPGQAVEAVFVSALPNN
DLRRGGTYLEVVRREGASWVRIADDGDWATSFRWQRQGRAGSHVSIRWDV
PGDTTPGQYRIVHHGTARDRNGMLTAFSATTREFTVV
>Rv0476 POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MLVLLVAVLVTAVYAFVHAALQRPDAYTAADKLTKPVWLVILGAAVALAS
ILYPVLGVLGMAMSACASGVYLVDVRPKLLEIQGKSR
>Rv1109c CONSERVED HYPOTHETICAL PROTEIN
MATAPYGVRLLVGAATVAVEETMKLPRTILMYPMTLASQAAHVVMRFQQG
LAELVIKGDNTLETLFPPKDEKPEWATFDEDLPDALEGTSIPLLGLSDAS
EAKNDDRRSDGRFALYSVSDTPETTTASRSADRSTNPKTAKHPKSAAKPT
VPTPAVAAELDYPALTLAQLRARLHTLDVPELEALLAYEQATKARAPFQT
LLANRITRATAK
>Rv1222 CONSERVED HYPOTHETICAL PROTEIN
MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAF
VDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLL
GLLSEIPRCPPEGPSKGSSGGSSQGPPDGAAAGFGDRFADGDGGNRGRQS
RVRR
>Rv2596 CONSERVED HYPOTHETICAL PROTEIN
MIAPDTSVLVAGFATWHEGHEAAVRALNRGVHLIAHAAVETYSVLTRLPP
PHRIAPVAVHAYLADITSSNYLALDACSYRGLTDHLAEHDVTGGATYDAL
VGFTAKAAGAKLLTRDLRAVETYERLRVEVELVT
>Rv0996 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MPSIPQSLLWISLVVLWLFVLVPMLISKRDAVRRTSDVALATRVLNGGAG
ARLLKRGGPAAGHRWGYLPPEGQGDDPDWKPEEDWRDDPVEDGFADVEHD
IDEDQEADDARRRGAVVMKVAAPQTAGADEPDYLDVDVVEEDSEALPVGA
GAAVGESADEADAEAADGVAGHADPEADPVEYEYEYEYVEDTCGLELEED
DQEAPPTVASGTSRRRRFDTKTAAAVSARKYTFRKRALIVMAVILVGSAA
AAFELTPVAWWICGSATGVTVLYLAYLRRQTRIEEKVRRRRMQRIARARL
GVENTRDREYDVVPSRLRRPGAVVLEIDDEDPIFTHLESAAPIRNYGWPR
DLPRAVGQ
>Rv1517 CONSERVED HYPOTHETICAL TRANSMEMBRANE PROTEIN
MWTMVLLLGLGMAIDPARLGLAVVMLSRRRPMLNLFAFWVGGMVAGVGIA
LAVLVFMRDVALAAIQGVVSAANEFREAVGILAGGRLHIVIGVIMLLLAA
RMVARARAQVGVPVGPVGVADGGMSALALAQRPPGLVARLEVRTQQMLQG
DVVWPAFVVGVASSAPPFESVVALTVIMASGAEIGTQLGAFVVFTLLVLA
VIEIPLVAYLAIPQQTQQVMLRFQDWVRSNRRQISLTILIGVGFLFLYQG
VTSL
>Rv2206 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MKLLGHRKSHGHQRADASPDAGSKDGCRPDSGRTSGSDTSRGSQTTGPKG
RPTPKRNQSRRHTKKGPVAPAPMTAAQARARRKSLAGPKLSREERRAEKA
ANRARMTERRERMMAGEEAYLLPRDRGPVRRYVRDVVDSRRNLLGLFMPS
ALTLLFVMFAVPQVQFYLSPAMLILLALMTIDAIILGRKVGRLVDTKFPS
NTESRWRLGLYAAGRASQIRRLRAPRPQVERGGDVG
>Rv2530A CONSERVED HYPOTHETICAL PROTEIN
MRTTLQIDDDVLEDARSIARSEGKSVGAVISELARRSLRPVGIVEVDGFP
VFDVPPDAPTVTSEDVVRALEDDV
>Rv1290A HYPOTHETICAL PROTEIN
MLALHGLSEGVSGSGGSGGRWGAGEVLEGARIGVIADGVSCFPTKADCRR
IRGVPVFDGYTRMVARLMGSLAVLRSVSIPKGYRDFGFGSLRAVAPKNCP
DVSG
>Rv2074 CONSERVED HYPOTHETICAL PROTEIN
MAMVNTTTRLSDDALAFLSERHLAMLTTLRADNSPHVVAVGFTFDPKTHI
ARVITTGGSQKAVNADRSGLAVLSQVDGARWLSLEGRAAVNSDIDAVRDA
ELRYAQRYRTPRPNPRRVVIEVQIERVLGSADLLDRA
>Rv0235c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MGWFSAPEYWLGRLALERGTAIIYLIAFVAAAQQFRPLIGEHGMLPVPRY
LAGQSFWRTPSIFHFRYSDRVFAGVCWLGAVLSAAVVAGAASFVPLWATM
LIWLTLWVLYLSIVNVGQAWYSFGWESLLLETGFLMIFLGNERTAPPILT
LLLARWLLFRVEFGAGLIKMRGDSCWRSLTCLYYHHETQPMPGPLSWFFH
HLPKPLHRIEVAGNHFAQLVVPFGLFTPQPAASIAAAIIVVTQLWLVASG
NFSWLNWLTILLACSAIDTSSAAALLPMPAQPALSAPPQWFAGLVVVFTA
AVLLLSYWPARNLLSSHQRMNMSFNPFHLVNTYGAFGSICRTRREVVIEG
TDESPITEQTVWKAYEFKGKPGDPRRLPRQWAPYHLRLDWLMWFAAISPG
YALPWMTPFLNRLLRNDPATLKLLRHNPFPQSPPRYVRAQLYQYRFTTVA
ELRRDRAWWHRTLIGRYVPPMSLRKVASPPAD
>Rv1671 PROBABLE MEMBRANE PROTEIN
MPTVGPADHAAGLDRRATPDQLPIWRIGIISGLVGMLCCVGPTILALVGI
ISAATAFAWANDLYDNYAWWFRVSGLAVLAILVWWALRHRNRCSVNAIRR
LRWRLMAVLAIAVGTYGVLSAVTTWFGTFV
>Rv2709 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MWDSRVMKHGLRLGFNGQFDDFDDFDDKGRPVLITAAAPSYEVEHRTRVR
KYLTLMAFRVPALILAAIAYGAWHNGLISLLIVAASVPLPWMAVLIANDR
PPRRADEPRRFDVARRRIPLFPTAERPALEPRRQPAERSAPRGFADHG
>Rv2662 HYPOTHETICAL PROTEIN
MDDLTRLRRELLDRFDVRDFTDWPPASLRALIATYDPWIDMTASPPQPVS
PGGPRLRLVRLTTNPSARAAPIGNGGDSSVCAGEKQCRPP
>Rv1546 CONSERVED HYPOTHETICAL PROTEIN
MASVELSADVPISPQDTWDHVSELSELGEWLVIHEGWRSELPDQLGEGVQ
IVGVARAMGMRNRVTWRVTKWDPPHEVAMTGSGKGGTKYGVTLTVRPTKG
GSALGLRLELGGRALFGPLGSAAARAVKGDVEKSLKQFAELYG
>Rv0257 CONSERVED HYPOTHETICAL PROTEIN
MTRVSWLPDRCLPRLPACGRGLRGSLPGDSGGTAPDSHRLPASSSPDGKN
IGMQSVDLHVERHLPSRGRSHRTVATVTCVTALGDIRSAQLSATGAWPAV
LFPSWSWLCGIGGGVDLQKPSCRA
>Rv3686c CONSERVED HYPOTHETICAL PROTEIN
MVYTGSDAGDHASAPQPSGSGSVPASVNVPGLVVAAVWAVGLVAGLVALT
IGHLAVAAAALVVAVMAPWCRVAYIAHGQHRVCGETLRGTPAGETASFPT
GWRGLRFSTR
>Rv1006 HYPOTHETICAL PROTEIN
MVLRSRKSTLGVVVCLALVLGGPLNGCSSSASHRGPLNAMGSPAIPSTAQ
EIPNPLRGQYEDLMEPLFPQGNPAQQRYPPWPASYDASLRVSWRQLQPTD
PRTLPPDAPDDRKYDFSVIDNALTRLADRGMRLTLRVYAYSSCCKASYPD
GTNIAIPDWERAIASTNTSYPGPATDPSTGVVQVVPNFNDSTYLNDFAQL
LAALGRRYDGDERLSVFEFSGYGDFSENHVAYLRDTLGAPGPGPDESVAT
LGYYSQFRDQNITTASIKQLIAANVSAFPHTQLVTSPANPEIVRELFADE
VTNKLAAPVGVRSDCLGVDAPLPAWAESSTSHYVQTKDPVVAALRQRLAT
APVITEWCELPTGSSPRAYYEKGLRDVIRYHVSMTSSVNFPDQTATSPMD
PALYLVWAQANAAAGYRYSVEAQPGSQALAGKVATISVTWTNYGAAAATE
KWVPGYRLVDSTGQVVRTLPAAVDLKTLVSDQRGDRSSDQPTPASVAETV
RVDLSGLPAGHYTLRAAIDWQQHKPNGSHVVNYPPMLLSRDGRDDSGFYP
VATLDIPRDAQTAVNAS
>Rv3166c CONSERVED HYPOTHETICAL PROTEIN
MPGTKPGSDKPTGRVVVVIVLLMLAGAALRGHLPADDGAPLAAAGGSRAA
LMFIVAALAATLALIALAIITRLRHPLPVAPSAGELSAMLGGAAGRPNWR
VLLLGLGTILAWLLIAILLARLFVPDDVGPAAPIPDSTATPDASSTTPSR
PQPPQDNNDDVLGILFASTIGLFLMVVAGSLITSRRQRKSAPARISGDRI
ESPAPSARSESLARAAEIGLAEMADLRREPREAIIACYVAMERELSHVPG
VAPQDFDTPTEVLARAVEHRALHGASAAALVSLFAEARFSPHVMNEEHRE
VAMRLLRLVLDELSTRTAI
>Rv2760c CONSERVED HYPOTHETICAL PROTEIN
MSLNIKSQRTVALVRELAARTGTNQTAAVEDAVARRLSELDREDRARAEA
RRAAAEQTLRDLDKLLSDDDKRLIRRHEVDLYDDSGLPR
>Rv0801 CONSERVED HYPOTHETICAL PROTEIN
MALKVEMVTFDCSDPAKLAGWWAEQFDGTTRELLPGEFVVVARTDGPRLG
FQKVPDPAPGKNRVHLDFTTKDLDAEVLRLVAAGASEVGRHQVGESFRWV
VLADPEGNAFCVAGQ
>Rv0882 PROBABLE TRANSMEMBRANE PROTEIN
MNDQRDQAVPWATGLAVAGFVAAVIAVAVVVLSLGLIRVHPLLAVGLNIV
AVSGLAPTLWGWRRTPVLRWFVLGAAVGVAGAWLALLALTLGDG
>Rv2288 HYPOTHETICAL PROTEIN
MSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCD
GDVDGRKLLPPARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQRE
PGWAPFGWLHEPSGARCPKADGQSV
>Rv2085 CONSERVED HYPOTHETICAL PROTEIN
MSDMCDVVSFVGAAERVLRARFRPSPESGPPVHARRCGWSLGISAETLRR
WAGQAEVDSGVVAGVSASRSGSVKTSELEQTIEILKVATSFFARKCDPRH
R
>Rv1669 HYPOTHETICAL PROTEIN
MSRRPGYSNGRAGASRQAARGGSAGASSVAFSSQPNCGLTESVLGHQVTG
ICLGTIHLDAMQWPWSSAYRLEPAVATTLIGISAWWANGSVKQYAGDLTD
RVATMTVCRRTPAPRVHYRQ
>Rv1289 HYPOTHETICAL PROTEIN
MCVSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFR
TALRDSLDIYGVMATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHR
WLHGHADESSVEFEVSPYVNASAALRIANDGKIQLPKSAILGLLAVAVFA
PENKGEVIPPDYQLSWYDHVFFISVWWGWQDHFREIVNVDRASLVALDFG
DLWNGWTPVG
>Rv0029 CONSERVED HYPOTHETICAL PROTEIN
MAIFGRWSARQRLRRATRESLTIPTFSSSLDCTTRVIGGLWPAELSSNTA
ETATLAEHLKADLHRIVGSANDELMVIWRAGMADSTRRAEEDRVIDRARA
SAMRRVESAMRELRQITGRVPVEIPRMRGAGGSDLDTTRLMPAVTVVQPA
DQACTDWPVAAAEDDEARLQRLLAFVARQEPRLNWAVGVHADGTTVLVTD
VAHGWIPPGIALPEGVRLLAPARRAGRAPELVGITTCCKTYTPGDSLRRA
VDSTAPTSSVQPRALPAIAGLSVELGIATQRHDGLPKIVHAMATAAGNGA
AAEEVDLLRVHVDTALHHVLAQYPRVDPALLLNCMLLAATERSVTGDPIA
ANYHFAWFRELDSRR
>Rv0192A CONSERVED SECRETED PROTEIN
MSRWKQGWTRGSLFAALNIAAVVAVLMLGAGVAVADPDAAPGDPGGPGAP
GAQRDPSTRRQLTCWRRHPTRWRCRRHLTRWRRRHLTRSRRPRLTRWQCR
>Rv3863 HYPOTHETICAL ALANINE RICH PROTEIN
MAGERKVCPPSRLVPANKGSTQMSKAGSTVGPAPLVACSGGTSDVIEPRR
GVAIIGHSCRVGTQIDDSRISQTHLRAVSDDGRWRIVGNIPRGMFVGGRR
GSSVTVSDKTLIRFGDPPGGKALTFEVVRPSDSAAQHGRVQPSADLSDDP
AHNAAPVAPDPGVVRAGAAAAARRRELDISQRSLAADGIINAGALIAFEK
GRSWPRERTRAKLEEVLQWPAGTIARIRRGEPTEPATNPDASPGLRPADG
PASLIAQAVTAAVDGCSLAIAALPATEDPEFTERAAPILADLRQLEAIAV
QATRISRITPELIKALGAVRRHHDELMRLGATAPGATLAQRLYAARRRAN
LSTLETAQAAGVAEEMIVGAEAEEELPAEATEAIEALIRQIN
>Rv3321c CONSERVED HYPOTHETICAL PROTEIN
MRTTLSIDDDVLLAVKERARREKRTAGEILSDLARQALTNQNPQPAASQE
DAFHGFEPLPHRGGAVSNALIDRLRDEEAV
>Rv0817c PROBABLE CONSERVED EXPORTED PROTEIN
MPMRKVLVGVTGAAIVVAVLIVGAVGADFGASIYAEYRLSTTVRKAANLR
SDPFVAILRFPFIPQAMREHYAELEIKAFAVEHAGSGTATLEATMHSIDL
SYASWLIRPDAKLPVGELESRIIIDSMHLGRYLGISDLMVAAPRQESNDA
TGGTTESGISGSRGLVFSGTPISANFAHRVSVLVDLSVASDDRATLVITP
TAVVTGPDTADQPVPDDKRDAVLHAFASKLPNQKLPFGVVPNTVGARGSD
VIIEGITRGVTISLDEFKQS
>Rv1097c PROBABLE MEMBRANE GLYCINE AND PROLINE RICH PROTEIN
MTVPPAGPYGNYPYGPNTYGQDPYWGGQPQGGSYPPAYPPQQYPPGWPAG
PYPPGPPPPGPGSKTPWLILAGLAVLGVILLVVILVIGLRGDNKSTTATS
PATSAPTSQPFSQQTATGCTPNVSGGVQPIGDSISAGKLSFPTSAAPGWS
AFSDDQNPNLIDAVGVGHEVAGADQWMMQAEVAITNFVTTMDVAAQASKL
MQCVADGPGYAGSSPTLGPTKTSSITVDGVRAARVDADITIADSSRNVKG
DSVTIIAVDTKPVTVFLGATPIGDATSRATVERVIEALKVNKS
>Rv0420c POSSIBLE TRANSMEMBRANE PROTEIN
MRLHDASAAAPESRMHIARHGEAVNRRQMFIGITGLLLAVIGLMALWFPV
YLDQYDAYGIKVTCGSGWRSNLTQALYADGNDNTQALVTRCDTALLVRRA
WAIPSVALGWLLVTGFLVMWVHNDQHQGQSYPGYRA
>Rv0580c CONSERVED HYPOTHETICAL PROTEIN
MTDQSYAVDIAHPPAALLRLVNPILRSLLHTPLAGPLRTQLMVVSFTGRK
TGRHFSIPLSAHVIDNDLYALTEAGWKHNFSDGAAAQVVYDGKTTAMRGE
LIRDRAVVSELFLRAAQAYGVKRGQRMLGLSFRDRRIPTLEEFAEAVDRL
KLVAIRLTPADNS
>Rv2365c CONSERVED HYPOTHETICAL PROTEIN
MMRRPITLAEQLDAEDAKLVVLARAAMARAEAGAGAAVRDVDGRTYAAAP
VALSALELTGLQAAVAAAVSSGATGLQAAVLVAGSVDDPGIAAVRELAPT
AAIIVTDRAGNPL
>Rv2862c CONSERVED HYPOTHETICAL PROTEIN
MTETGGDMVALRVSDADRNGTMRRLHNAVALGLINIDEFEQRSSRVSFAC
TRSELDGLVGDLPRPGAIVTSAADRVELRGWAGSLKRHGEWIVPTRLALV
RRLGSIELDLVKARFAGPVVVIELDMMFGSLEVRLPNGASASIDDVEVYV
GSASDRRKDAPAEGTPHVVLTGRMVCGSVVIKGPRRALLRRHRG
>Rv1573 Probable phiRV1 phage protein
MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALV
RLQAALNRHEHTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAI
NRQLGLAGDDEPDGDDTPPWSRMIGLGGGSPAEDER
>Rv1721c CONSERVED HYPOTHETICAL PROTEIN
MSAMVQIRNVPDELLHELKARAAAQRMSLSDFLLARLAEIAEEPALDDVL
DRLAALPRRDLGASAAELVDEARSE
>Rv3613c HYPOTHETICAL PROTEIN
MCTMPKLWRAFMAGRPLGSTFTPRQPTGAAPNHVRALDDSIDPSSAPAAR
AAL
>Rv3889c HYPOTHETICAL PROTEIN
MLTTTVDGLWVLQAVTGVEQTCPELGLRPLLPRLDTAERALRHPVAAELM
AVGALDQAGNADPMVREWLTVLLRRDLGLLVTIGVPGGEPTRAAICRFAT
WWVVLERHGNLVRLYPAGTASDEAGAGELVVGQVERLCGVAEAAPLRPVT
VDADELLHAVRDAGTLRSYLLSQRLDVDQLQMVTMAADPTRSAHATLVAL
QAGVGPEKSARILVGDSTVAIVDTAAGRICVESVTSGQRRYQVLSPGSRS
DIGGAVQRLIRRLPAGDEWYSYRRVV
>Rv1435c Probable conserved Proline, Glycine, Valine-rich secreted protein
MTLMAIVNRFNIKVIAGAGLFAAAIALSPDAAADPLMTGGYACIQGMAGD
APVAAGDPVAAGGPAAAGACSAALTDMAGVPFVAPGPVPAAAPVPIGAPV
PIPGAPVPIPGAPVPIPGGPVPIPGAPVPVPAVPAPVIPVGTPLIALGPV
LAGAPGDGVVSAPIIGMSGVKDALTDPAPAGGPVPGQPVLPGPSASAPAG
AR
>Rv1738 CONSERVED HYPOTHETICAL PROTEIN
MCGDQSDHVLQHWTVDISIDEHEGLTRAKARLRWREKELVGVGLARLNPA
DRNVPEIGDELSVARALSDLGKRMLKVSTHDIEAVTHQPARLLY
>Rv0397 CONSERVED 13E12 REPEAT FAMILY PROTEIN
MLATFWGWRAQQLPDGTVIWTLPGDQTYVTTPGSALLFPALCTPTGDPPR
PDPARADRRGQRTAMMPRRASTRAQNRAHYIAAERHRNHQARRIAHVVTQ
TATTAPETNGPPPDPDDDPPPF
>Rv0193c HYPOTHETICAL PROTEIN
MIQISRDMSSLGQTATTQALPDNSDGIQLTKFAADDILPLEYAPPIGPEL
VSQDQLPAAWAYKRFRDLDDKESYRRKLLQELTDALAAQGSEAAEIATAA
LRDLIDQMAEQGAVVLADIVESDDFLELVKRYDELMAREGSRSFIHRFLD
LRRSPGMLTDPAVNGALVHPLMIALISYAVGGPIRMIDARGKDAEPLSVL
AQDNMLHIDNTPFNDEYKILITWRRGTAQGPAGQNFTFLPGTHKLARTCF
VNEDGVPWSSENASIFTTPDSIRKVFDAQRQLGGQDHPTVIEVTDSERPL
SGVFAAGSLVHHRFRTASGSARSCIILVFHRVADNPGRMVSDVEDSSDVS
LSELLTRGVPDESYQQRFIATLCAAADEIAELLLKWKKTPQRPVSLPLQT
KQIDGARFEEWISAATKAPEVREIRNRELTIPYGEVLSAEEFFDLIWRLM
RFDKHGPLDLILYHDNREEPRKWARNLIREMSADRLYERLLGWLADIQQP
RPADCLRPLQIHALISEVLKTLPLDEDQDPPADWHFDLLGMSHAEAARSV
KHLLEDVAEALLRCEDMAAYLSTSLFAFWAVDAAYSLDGRRNLVVKDCAR
RLLRHYTMLSLTCFQ
>Rv2627c CONSERVED HYPOTHETICAL PROTEIN
MASSASDGTHERSAFRLSPPVLSGAMGPFMHTGLYVAQSWRDYLGQQPDK
LPIARPTIALAAQAFRDEIVLLGLKARRPVSNHRVFERISQEVAAGLEFY
GNRRWLEKPSGFFAQPPPLTEVAVRKVKDRRRSFYRIFFDSGFTPHPGEP
GSQRWLSYTANNREYALLLRHPEPRPWLVCVHGTEMGRAPLDLAVFRAWK
LHDELGLNIVMPVLPMHGPRGQGLPKGAVFPGEDVLDDVHGTAQAVWDIR
RLLSWIRSQEEESLIGLNGLSLGGYIASLVASLEEGLACAILGVPVADLI
ELLGRHCGLRHKDPRRHTVKMAEPIGRMISPLSLTPLVPMPGRFIYAGIA
DRLVHPREQVTRLWEHWGKPEIVWYPGGHTGFFQSRPVRRFVQAALEQSG
LLDAPRTQRDRSA
>Rv0249c PROBABLE SUCCINATE DEHYDROGENASE [MEMBRANE ANCHOR SUBUNIT] (SUCCINIC DEHYDROGENASE)
MSAPTANRPAIGVFTPTRAQIPERTLRTDLWWLPPLLTNLGLLAFICYAT
TRAFWGSQYWVEKYHYLTPFYSPCVSASCQPGASHLGVWFGHFPGWIPLG
AMVLPFLLGFRLTCYYYRKAYYRSVWQSPTSCAVPEPRAHYTGETRLPLI
VQNTHRYFFYIAVVVSLINTYDAIAAFHSPSGFGFGLGNVILTINVVLLW
AYTISCHSCRHATGGRLKHFSKHPVRYWIWTQVSKLNTRHMQFAWITLGT
LALTDFYIMLVASGSITDLRFIG
>Rv1398c CONSERVED HYPOTHETICAL PROTEIN
MKRTNIYLDEEQTASLDKLAAQEGVSRAELIRLLLNRALTTAGDDLASDL
QAINDSFGTLRHLDPPVRRSGGREQHLAQVWRATS
>Rv2574 CONSERVED HYPOTHETICAL PROTEIN
MYPCERVGLSFTETAPYLFRNTVDLAITPEQLFEVLADPQAWPRWATVIT
KVTWTSPEPFGAGTTRIVEMRGGIVGDEEFISWEPFTRMAFRFNECSTRA
VGAFAEDYRVQAIPGGCRLTWTMAQKLAGPARPALFVFRPLLNLALRRFL
RNLRRYTDARFAAAQQS
>Rv0942 HYPOTHETICAL PROTEIN
MGRSATIAMVPKRRDAMNRHSGPILSSGFIASSSNSCPANSLRMPSALAA
ETLSFDDRAVRRSTHHPGGGYPQKHAINLQSGLCPAYANASR
>Rv1579c Probable phiRv1 phage protein
MTPINRPLTNDERQLMHELAVQVVCSQTGCSPDAAVEALESFAKDGTLIL
RGDTENAYLEAGGNVLVHADRDWLAFHASYPGNDPLRDARPIEQDDDQGA
GSPS
>Rv0745 CONSERVED HYPOTHETICAL PROTEIN
MGPPHRSRPPLPSPGPTCQVLPTTAVIHTVTAEALGRIGIDAPRIPGSLD
VAAHAAIGLLPLVAGCDRRHRRPVRGARAGRAAQVSLCMTAIRVEPVSSN
AVCTGPAAQVGDQSRSPQRDYAHQALQPDVPRRRARRHRPRRCSAKTGSS
SSTMRCTCHQNQCLWSSGVSWALAR
>Rv3528c HYPOTHETICAL PROTEIN
MMLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRA
LDKYPVKEAVLVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVD
ALFLFDVLLHQVSPDWDTILDMYAKNVRCLLIYNQQWIGSTTTVRLLDLG
EKHYFRNVPHSKLNKAYRDLFQKLDKKHPDHDKPWRDIPDIWQWGITDAD
LESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ
>Rv1891 CONSERVED HYPOTHETICAL PROTEIN
MIRELVTTAAITGAAIGGAPVAGADPQRYDGDVPGMNYDASLGAPCSSWE
RFIFGRGPSGQAEACHFPPPNQFPPAETGYWVISYPLYGVQQVGAPCPKP
QAAAQSPDGLPMLCLGARGWQPGWFTGAGFFPPEP
>Rv3639c CONSERVED HYPOTHETICAL PROTEIN
MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPP
AAKDSAQDGFRHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFG
GPTPAPRGLATRQCPPRTVHVDRVRPNGAERALRARFRPILRPQFTLGDG
ANGLPLAACTKTGAYVPHLPYSPIAVDPQPSAGQQGPS
>Rv1425 CONSERVED HYPOTHETICAL PROTEIN
MKRLSSVDAAFWSAETAGWHMHVGALAICDPSDAPEYSFQRLRELIIERL
PEIPQLRWRVTGAPLGLDRPWFVEDEELDIDFHIRRIGVPAPGGRRELEE
LVGRLMSYKLDRSRPLWELWVIEGVEGGRIATLTKMHHAIVDGVSGAGLG
EILLDITPEPRPPQQETVGFVGFQIPGLERRAIGALINVGIMTPFRIVRL
LEQTVRQQIAALGVAGKPARYFEAPKTRFNAPVSPHRRVTGTRVELARAK
AVKDAFGVKLNDVVLALVAGAARQYLQKRDELPAKPLIAQIPVSTRSEET
KADVGNQVSSMTASLATHIEDPAKRLAAIHESTLSAKEMAKAPSAHQIMG
LTETTPPGLLQLAARAYTASGLSHNLAPINLVVSNVPGPPFPLYMAGARL
DSLVPLGPPVMDVALNITCFSYQDYLDFGLVTTPEVANDIDEMADAIEPA
LAELERAAE
>Rv1583c Probable phiRv1 phage protein
MTAGAGGSPPTRRCPATEDRAPATVATPSSADPTASRAVSWWSVHEHVAP
VLDAAGSWPMAGTPAWRQLDDADPRKWAAICDAARHWALRVETCQEAMAQ
ASRDVSAAADWPGIAREIVRRRGVYIPRAGVA
>Rv1929c CONSERVED HYPOTHETICAL PROTEIN
MADVPLDAQERLELCDLLEELGPAVATLIEGWTAHDLAAHIVLRERDLVA
GLCIVLPGPFQRFAERRRARLAQSKDFTWLVARIRSGPPMGFFRIGWVRT
LANLNEFFVHHEDVRRASGRGPRSLTPEMDAALWRNVRRGSHFLSRRLHG
CGLEIEWVGTGKRVRVRSGEPTARLTGPPGELLLYVFGRRAVARVEVSGP
LEAIAAVHRTHFGM
>Rv0227c PROBABLE CONSERVED MEMBRANE PROTEIN
MLRFAACGAIGLGAALLIAALLLSTYTTSRIAEIPLDIDATLISDGTGTA
LDSASLATEHIVVNQDVPLVSQQQVTVESPANADVVTLQVGSSLRRTDKQ
KDSGLLLAIVDTVTLNRKTAMAVSDDTHTGGAVQKPRGLNDENPPTAIPL
RHDGLSYRFPFHTEKKTYPYFDPIAQKAFDANYEGEEDVNGLTTYRFTQN
VGYTPEGKLVAPLKYPSLYAGDEDGKVTTSAAMWGLPGDPNEQITMTRYY
AAQRTFWVDPVSGTIVKETERANHYFARDPLKPEVTFADYQVTSTEETVE
SQVNAARDERDRLALWSRVLPITFTAAGLVALVGGGLFASFSLRTEGALM
AASGDRDDHDYRRGGFEEPVPGAEAETEKLPTQRPDFPREPSGSDPPRLG
SAQPPPPPDAGHPDPGPPERR
>Rv2656c POSSIBLE phiRv2 PROPHAGE PROTEIN
MTAVGGSPPTRRCPATEDRAPATVATPSSTDPTASRAVSWWSVHEYVAPT
LAAAVEWPMAGTPAWCDLDDTDPVKWAAICDAARHWALRVETCQAASAEA
SRDVSAAADWPAVSREIQRRRDAYIRRVVV
>Rv3819 HYPOTHETICAL PROTEIN
MMQFYDDGVVQLDRAALTLRRYHFPSGTAKVIPLDQIRGYQAESLGFLMA
RFNIWGRPDLRRWLPLDVYRPLKSTLVTLDVPGMRPKPACTPTRPKEFIA
LLDELLALHRT
>Rv1055 POSSIBLE INTEGRASE (FRAGMENT)
MTLDRHGHLLNDDLAVWPMRCAKSSRTLRYHCGMRRRNRVGLRA
>Rv2514c CONSERVED HYPOTHETICAL PROTEIN
MLYSFDTSAILNGRRDLFRPAVFRSLWGRVEDAISAGQIRSVDEVQRELA
RRDDDAKRWADGQTGLFCPLDEQIQQAARHILRLHPNMVRQGGRRSAADP
FVIALAMVNNATVVTQETASGNIEKPRIPDVCDALGVPWLTLMGYIEAQG
WTF
>Rv3046c CONSERVED HYPOTHETICAL PROTEIN
MTKTFSHPHFFRSVLRWLQVGYPEGVPGPDRVALLSLLRSTPLTEEQIGE
VVRHFTENGSPAVADRVIDRDEIAEFISEVTHHDAGPENIQRVAGILAAA
GWPLAGVDVGESESGSDRAPASQG
>Rv2654c POSSIBLE phiRv2 PROPHAGE PROTEIN
MSGHALAARTLLAAADELVGGPPVEASAAALAGDAAGAWRTAAVELARAL
VRAVAESHGVAAVLFAATAAAAAAVDRGDPP
>Rv2164c PROBABLE CONSERVED PROLINE RICH MEMBRANE PROTEIN
MRAKREAPKSRSSDRRRRADSPAAATRRTTTNSAPSRRIRSRAGKTSAPG
RQARVSRPGPQTSPMLSPFDRPAPAKNTSQAKARAKARKAKAPKLVRPTP
MERLAARLTSIDLRPRTLANKVPFVVLVIGSLGVGLGLTLWLSTDAAERS
YQLSNARERTRMLQQHKEALERDVREAASAPALAEAARRQGMIPTRDTAH
LVQDPDGNWVVVGTPKPADGVPPPPLNTKLPEDPPPPPKPAAVPLEVPVR
VTPGPDDPAPPARSGPEVLVRTPDGTATLGGATHLPTQAGPQLPGPVPIP
GAPGPMPAPPLGAVPSPAPAENPVPLQVGAAPPAGLPGPAPVAATPGLSG
GSQPMVAPPAPVPANGEQFGPVTAPVPTAPGAPR
>Rv0209 HYPOTHETICAL PROTEIN
MRGQGHQIFVDELARFATSSADQRVVAIAQRAAEPLRVAVRGRPGVGCRT
VARALQGAGSSSGMTVTPQARAADSDVDLVVYVTVEVVKPEDREAIAATR
RPVVAVLNKADLAGPLSGAGPIVMAQARCAQFSTLLGVPMESMIGLLAVA
ALDDLDDTLRAVLRALAAHPDGFDALDRAVAGFLAAALPVPTEVRLRLLD
TLDLFGIALGMAAFRPGRPSRTPAQLRTLLRRVSGVDAVIDKVTAAGSEV
RYRRLLDAVAELEALAAQAKEIGGPIGEFLRDDDTVLARMAAAVDVALAV
GLDVGPLDDPAAHLPRAVRWHRYSLDNGDMHRTCGADIARGSLRLWSLAG
GMPLHRYRKSS
>Rv3656c CONSERVED HYPOTHETICAL PROTEIN
MLVITMFRVLVARMTALAVDESGMSTVEYAIGTIAAAAFGAILYTVVTGD
SIVSALNRIIGRALSTKV
>Rv1893 CONSERVED HYPOTHETICAL PROTEIN
MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSI
DIATHAVDRTKEVFTGKTDDEG
>Rv0031 POSSIBLE REMNANT OF A TRANSPOSASE
MLARHFGAGRKAHSRAVATLKADIQAWHPAGIQTPKPRCESDVFARIGHT
SHPSTRKSRVGPGASEAPLA
>Rv3643 HYPOTHETICAL PROTEIN
MERSIGLEAAAQQAGHSGSEITRRHYVERSVTVPDYTAALDEYSRPIRAF
RPLKSNRPGDIPT
>Rv1291c CONSERVED HYPOTHETICAL SECRETED PROTEIN
MFTRRFAASMVGTTLTAATLGLAALGFAGTASASSTDEAFLAQLQADGIT
PPSAARAIKDAHAVCDALDEGHSAKAVIKAVAKATGLSAKGAKTFAVDAA
SAYCPQYVTSS
>Rv3492c CONSERVED HYPOTHETICAL MCE ASSOCIATED PROTEIN
MRRLISVAYALMVATIVGLSAAGGWFYWDRVQTGGEASARALLPKLAMQE
IPQVFGYDYQTVERSLTAVYPLLTPDYRQEFQKSANAQIIPEAKKREVVV
QANVVGVGVMDAKRDCASVMVYLNRTVTDKTRQPLYDGSRLRVDFQRIDG
KWLIAYITPI
>Rv0611c HYPOTHETICAL PROTEIN
MPDRPQHPTASRQSSMVSWNHGAAGWLHCVQCGSATNPTACLDWLPPIHA
RSGPMYAEHDVVVLTRDVPDKSLIAGDVGAVVGRYAAGGYEVDFTAANGC
TVAVVTLAGDDIRPRRRREIPHVREVA
>Rv0970 PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MIHDLMLRWVVTGLFVLTAAECGLAIIAKRRPWTLIVNHGLHFAMAVAMA
VMAWPWGARVPTTGPAVFFLLAAVWFGATAVVAVRGTATRGLYGYHGLMM
LATAWMYAAMNPRLLPVRSCTEYATEPDGSMPAMDMTAMNMPPNSGSPIW
FSAVNWIGTVGFAVAAVFWACRFVMERRQEATQSRLPGSIGQAMMAAGMA
MLFFAMLFPV
>Rv3103c HYPOTHETICAL PROLINE-RICH PROTEIN
MKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSP
PTQVVPPGFVPDPDYTWVPRTRVQPPTVKATPTTTSSTPPVSPPETTTDS
AVPPPFELPPPFGPGTTTPTPPAPLPQPGPGPTAGTYPKSEPPTR
>Rv2172c CONSERVED HYPOTHETICAL PROTEIN
MTLNTIALELVPPNLEGGKERAIEDARKVVQYSAASGLDGRIRHVMMPGM
IAEDDDRPIPMQPKLDVLDFWSIIKPELAGVHGLCTQVTAFMDEPSLHRR
LVDLSDAGMEGIVFVGVPRTMQDGEGSGVAPTDALSLYRQLVANRGVIVI
PTRDGEQGRLNFKCSRGATYGMTQLLYSDAIVGFLREFARTTEHRPEILL
SFGFVPKVETRIGLINWLIQDPGNAAVADEQAFVQKLAGSEPARRRRLMV
DLYKRVLDGVADLGFPLSIHLEATYGVSAAAFETFAEMLAYWSPAEPGKP
D
>Rv1535 HYPOTHETICAL PROTEIN
MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLV
DGAARLLSIPLRHLYAALWRVGLLEVQA
>Rv1482c CONSERVED HYPOTHETICAL PROTEIN
MTDPFLGSEALAAGVLTPYELRSRYVALHKDVYVPQGVELTAQLRAKALW
LRSRRRGVLAGYSASAFHGAKWIDADLPAAIIDTNRRRAPGLQVWEERIE
PDEICVIEGMRVTTPERTALDLTSRFPLDPAVAAVDALIQATDLKVADVE
PLIERYRGRRGMKAARAALDLVDGGAQSPKETWLRLLLIRAGFPRPQTQI
AVRNEWGWAEAHLDMGWQDIKVAAEYDGDHHLTSRYHYRKDILRHEKVQH
RYGWIVVRVVAEDHPADIIRRVGEARAFRA
>Rv2998 HYPOTHETICAL PROTEIN
MDVIWSATIATTVATGMRKPRMHGMPPITSGSMVTRVTRMSIRLAGDSTL
GRFSTSRLGLSSAKSKPEGDFGTACGAVSGGDAGVVALAEGVDDGQSKPG
AAGGARGVGGFRESRADCGEQFGVASWTPQGEFEFGGQEAKGVRSSWPAS
LTN
>Rv3587c PROBABLE CONSERVED MEMBRANE PROTEIN
MLDLEPRGPLPTEIYWRRRGLALGIAVVVVGIAVAIVIAFVDSSAGAKPV
SADKPASAQSHPGSPAPQAPQPAGQTEGNAAAAPPQGQNPETPTPTAAVQ
PPPVLKEGDDCPDSTLAVKGLTNAPQYYVGDQPKFTMVVTNIGLVSCKRD
VGAAVLAAYVYSLDNKRLWSNLDCAPSNETLVKTFSPGEQVTTAVTWTGM
GSAPRCPLPRPAIGPGTYNLVVQLGNLRSLPVPFILNQPPPPPGPVPAPG
PAQAPPPESPAQGG
>Rv0239 CONSERVED HYPOTHETICAL PROTEIN
MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPRRDAASD
TWQPPTPRRLGPFRASEETWRELANEA
>Rv2474c CONSERVED HYPOTHETICAL PROTEIN
MVERGLWLPDPAHRADLATFVDHALRLDDAAVIRIRARSTGLLSAWVATG
FDVLASRVVAGKVRPDDLSVAARSLAHGLATTDASGYVDPGYSMDSAWRG
GLPPESGFTYLDDVPARVMLDLAHRGARLAKEHGSSAGPPVSLLDQEVIQ
VSSADVVVGLPMRCVFALTAMGFLPQSAETISADELIRVRISPAWLRLDA
RFGSVYRHRGHAALVLR
>Rv3821 PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MWSTVLVLALSVICEPVRIGLVVLMLNRRRPLLHLLTFLCGGYTMAGGVA
MVTLVVLGATPLAGHFSVAEVQIGTGLIALLIAFALTTNVIGKHVRRATH
ARVGDDGGRVLRESVPPSGAHKLAVRARCFLQGDSLYVAGVSGLGAALPS
ANYMGAMAAILASGATPATQALAVVTFNVVAFTVAEVPLVSYLAAPRKTR
AFMAALQSWLRSRSRRDAALLVAAGGCLMLTLGLSNL
>Rv1778c HYPOTHETICAL PROTEIN
MRVSLFLSDAAQADAQSGKVHALGLGWRQCQTPTPPFALVLFLDIDWDET
NKQHQLKCQLLTADGDPVVVPGPHGPQRILFEAAAEAGRAPGAIHGTSVR
MPLTLNIPAGIPLEPGIYEWRVEVEGYERATAVEAFIVAGGGHPPASCG
>Rv0236c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MAPLSRKWLPVVGAVALALTFAQSPGQVSPDTKLDLTANPLRFLARATNL
WNSDLPFGQAQNQAYGYLFPHGTFFVIGHLLGVPGWVTQRLWWAVLLTVG
FWGLLRVAEALGVGGPSSRVVGAVAFALSPRVLTTLGSISSETLPMMLAP
WVLLPTILALRGTSGRSVRALAAQAGLAVALMGAVNAIATLAGCLPAVIW
WACHRPNRLWWRYTAWWLLAMALATLWWVMALTQLHGVSPPFLDFIESSG
VTTQWSSLVEVLRGTDSWTPFVAPNATAGAPLVTGSAAILGTCLVAAAGL
AGLTSPAMPARGRLVTMLLVGVVLLAVGHRGGLASPVAHPVQAFLDAAGT
PLRNVHKVGPVIRLPLVLGLAQLLSRVPLPGSAPRPAWLRAFAHPERDKR
VAVAVVALTALMVSTSLAWTGRVAPPGTFGALPQYWQEAADWLRTHHAAT
PTPGRVLVVPGAPFATQVWGTSHDEPLQVLGDGPWGVRDSIPLTPPQTIR
ALDSVQRLFAAGRPSAGLADTLARQGISYVLVRNDLDPETSRSARPILLH
RSIAGSPGLAKLAEFGAPVGPDPLAGFVNDSGLRPRYPAIEIYRVSAPAN
PGAPYFAATDQLARVDGGPEVLLRLDERRRLQGQPPLGPVLMTADARAAG
LPVPQVAVTDTPVARETDYGRVDHHSSAIRAPGDARHTYNRVPDYPVPGA
EPVVGGWTGGRITVSSSSADATAMPDVAPASAPAAAVDGDPATAWVSNAL
QAAVGQWLQVDFDRPVTNAVVTLTPSATAVGAQVRRILIETVNGSTTLRF
DEAGKPLTAALPYGETPWVRFTAAATDDGSAGVQFGITDLAITQYDASGF
AHPVQLRHTVLVPGPPPGSAIAGWDLGSELLGRPGCAPGPDGVRCAASMA
LAPEEPANLSRTLTVPRPVSVTPMVWVRPRQGPKLADLIAAPSTTRASGD
SDLVDILGSAYAAADGDPATAWTAPQRVVQHKTPPTLTLTLPRPTVVTGL
RLAASRSMLPAHPTVVAINLGDGPQVRQLQVGELTTLWLHPRVTDTVSVS
LLDWDDVIDRNALGFDQLKPPGLAEVVVLSAGGAPIAPADAARNRARALT
VDCDHGPVVAVAGRFVHTSIRTTVGALLDGEPVAALPCEREPIALPAGQQ
ELLISPGAAFVVDGAQLSTPGAGLSSATVTSAETGAWGPTHREVRVPESA
TSRVLVVPESINSGWVARTSTGARLTPIAVNGWQQAWVVPAGNPGTITLT
FAPNSLYRASLAIGLALLPLLALLAFWRTGRRQLADRPTPPWRPGAWAAA
GVLAAGAVIASIAGVMVMGTALGVRYALRRRERLRDRVTVGLAAGGLILA
GAALSRHPWRSVDGYAGNWASVQLLALISVSVVAASVVATSESRGQDRMQ
>Rv2309A HYPOTHETICAL PROTEIN
MATSSDDITINRHPPLNCAVNRHDESRRSPLRRGLLANGLRERQAGALFE
RYESQFDSFGYIEKVRYRGSGYRVEDVYARADSGPSAGAELPVGP
>Rv2516c HYPOTHETICAL PROTEIN
MTADWVVTFTFDADPSMETMDAWETQLEGFDALVSRVPGHGIDVTVYAPG
DWSVFDALAKMAGEVMPVVQAKSPIAVQIISEPEHRLRAEAFTTPELMSA
AEIADELGVSRQRVHQLRSTAGFPAPLADLRGGAVWDAAAVRRFAETWER
KPGRPHTGTAKFAYSWAVGPAVGRSGKAPNVRWRVENPDKIRFVLRNIGD
DIAEDVEIDLSRIDAITRNVPKKTVIRPGEGLNMVLIAAWGHPLPNQLYV
RWAGQDEWAAVPLHPAH
>Rv3897c CONSERVED HYPOTHETICAL PROTEIN
MMQQAVSGITGALGGAVGGVMGPLTQLPQQAMQAGQGAMQPLMSALQQTY
GAEGLDVADGARLVDSIEGEPGLGGEPGAGDVGAGGGGGGTTPTGYLGPP
PVPTSSPPTTPAGAPAKSVTPDPVSGTPRASGPAGMTGMPMVPPGALGAG
AEGANKDKPVEKRVTGCAEWSTGQGPLNSTAECSGEICRRQAGGHQVDAT
DPCCAERRQG
>Rv2772c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MTRRTLYVQLIIAFMCVAMVAYLVMLGRVAVAMIGSGRAAAAGLGLALLI
LPVIGLWAMIATLRAGFAYQRLARLIAEDGLDIDASALPRRASGRIQRDA
ADALFAAVRTELEDDADDWRRWYRLARAYDYAGDRRRAREAMKTALQLEG
RARPGAR
>Rv2111c CONSERVED HYPOTHETICAL PROTEIN
MAQEQTKRGGGGGDDDDIAGSTAAGQERREKLTEETDDLLDEIDDVLEEN
AEDFVRAYVQKGGQ
>Rv3493c CONSERVED HYPOTHETICAL MCE ASSOCIATED ALANINE AND VALINE RICH PROTEIN
MAADTGVAGGQQSTTRRARRKASRPAGPAEGESSRPAQGAATVRAAARTE
SKPAKAAKPALRPVKPPPRRPAHRVLVGWLSLAAGLLAIAALAWGVTALV
MQNRDADARQARNQRFVDAATQTVVNMFSYTPDTIDESVNRFVNGTSGPL
RGMLNANNNVDNLKGLFRATNATSEAVVNGAALEGIDEISDNASVLVSVR
VTVADIDGVNKPSMPYRLRVIVHEDENGRMTGYDLKYPDGGN
>Rv0332 CONSERVED HYPOTHETICAL PROTEIN
MRKPASSLAKVDYSSAYLEQTHAFGELIRNVDQSTPVPTCPGWSLGQLFR
HVGRGDRWAAQIVRDRLDHFLDPRSVEGGKPPPDPDDAISWLYGGARLLV
DAVEQTGVETPVWTFLGPRPAGWWVRRRLHEVAVHRADVAITVGGEFTLE
PNVAADGISEFLERIAVQAGSGGTPLPLEDDDTLHLHATDPGLLEAGEWT
VRRDERGVTWSHRHGKGAVALRGGATELLLAMVRRLSVADTGIELLGDAG
VWQKWLDRTPL
>Rv1805c HYPOTHETICAL PROTEIN
MTASVVATSRERHSHKAAKQRACEITDFEPEGRFRVRKRRRGRIGTKRSS
ISDTDYRRDSFRSHLLTAGAHGDADAQHKGMTAQQTTELGTPLVRALAPH
GVSGRSSRKPLGLNP
>Rv2698 PROBABLE CONSERVED ALANINE RICH TRANSMEMBRANE PROTEIN
MSGTRLAPHSVRYRERLWVPWWWWPLAFALAALIAFEVNLGVAALPDWVP
FATLFTVAAGTLLWLGRVEIRVTAGSADGAGVKLWAGPAHLPVAVIARSA
EIPATAKSAALGRQLDPAAYVLHRAWVGPMVLVVLDDPNDPTPYWLVSCR
HPERVLSALRS
>Rv3614c CONSERVED HYPOTHETICAL PROTEIN
MDLPGNDFDSNDFDAVDLWGADGAEGWTADPIIGVGSAATPDTGPDLDNA
HGQAETDTEQEIALFTVTNPPRTVSVSTLMDGRIDHVELSARVAWMSESQ
LASEILVIADLARQKAQSAQYAFILDRMSQQVDADEHRVALLRKTVGETW
GLPSPEEAAAAEAEVFATRYSDDCPAPDDESDPW
>Rv3612c CONSERVED HYPOTHETICAL PROTEIN
MVAVLTYARQLGFCRSTPPTIPHSRNQLVNKTAGQAAVAESWADRVSPGA
VTHATGAMCPTLGAHQFEPNQVRCTACLTRTLSCRIFRRRRELPVVGLAS
GDPLHPALG
>Rv3845 HYPOTHETICAL PROTEIN
MDRVRRVVTDRDSGAGALARHPLAGRRTDPQLAAFYHRLMTTQRHCHTQA
TIAVARKLAERTRVTITTGRPYQLRDTNGDPVTARGAKELIDAHYHVDTR
THPHNRAHTDTMQNSKPAR
>Rv2050 CONSERVED HYPOTHETICAL PROTEIN
MADRVLRGSRLGAVSYETDRNHDLAPRQIARYRTDNGEEFEVPFADDAEI
PGTWLCRNGMEGTLIEGDLPEPKKVKPPRTHWDMLLERRSIEELEELLKE
RLELIRSRRRG
>Rv3707c CONSERVED HYPOTHETICAL PROTEIN
MLRIGPTAGTGTPTGDYGIGATDLCEFVEFPSQLLQVCGDSFAGQGVGFG
GWYAPVALHVDTESIDDPAGVRYTGVTGVGTPLLADPTPPGDSQLPAGVV
QINRRNYLMVTTTKDLQPQNSRLVRAEAARGGWQTVSGSRRNAAYQDGRQ
TQISGYYDPVPTPDSPTGWVYIVADSFTRGEPAVLYRATPESFTDRSRWQ
GWAGGPDGGWNKPPTPLWPDQLGEMSIRQIDGQTVLSYFNASTGNMEVRV
AHHPTSLGAAPVTTVVRHDEWPEPAESLPPPYDNRLAQPYGGYISPGSTI
DELRIFVSQWDTRARQNGPYRVIQFAVNPFKPWSDP
>Rv2446c PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MTDRSREPADPWKGFSAVMAATLILEAIVVLLAIPVVDAVGGGLRPASLG
YLVGLAVLLILLTGLQRRPWAIWVNLGAQPVLVAGFAVYPGVGFIGVLFA
ALWVLIAYLRAEVRRRRDYRVSQ
>Rv2019 HYPOTHETICAL PROTEIN
MQPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQS
VSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADIL
AEQVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL
>Rv2644c HYPOTHETICAL PROTEIN
MSPRRTSGGVVPVDRYRIDEGLIVVLVFAGRDERRRTVCFADKFGCVHIG
NPDLYRPQTSLPQPLPISSHAISGSRFVETTNRADQQEPIGPNRAELFDQ
ALHAG
>Rv1203c HYPOTHETICAL PROTEIN
MLLAYVLITKGEFGAAASMLEPAAATLERTGYSWGPLSLMLLATAIAQQG
HIAESAKTLQRAEARHGTKSALFAPELGLARAWTRAAAQDMTGAIAAARE
AARTAERAGQAAVALCAWHNAVRLGDIRAVDPVTRLAAEIDCTVGNILVK
HARGLADGDAAELTAVAEELAGIGMAAAAADATKAAARLGPQQR
>Rv0879c POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRP
VTVAGLATGLLGTTIFVWQLAAARRGARGAQAGLETYLDPK
>Rv2035 CONSERVED HYPOTHETICAL PROTEIN
MTRPRTDAIHHHVVVNAPIERAFAVFTTRFGDFKPREHNLLAIPITETVF
ECHAGGHIYDRGVDGSVCKWARVLVYEPPSRVLFTWDIGPTWRPETDLAK
TSEVEVRFTAQSAETTRVDLEHRHLDRHGPGWESVADGVDSEAGWPLYLR
RYTDLLCIQVQP
>Rv1249c POSSIBLE MEMBRANE PROTEIN
MSARRIRSWKRFDNRSANAAEPDPQLAGTGGRPKVSTRALAQVIERSSRI
QGPAAQAYVARLRRAHPGASPAKIVAKLEKRFLSVVTASGAAVGAAATLP
GIGTLAAWFAAAGEVVVFLEATALFVLALASVHAIPLDHRERRRALVLAV
LVGDNTTAVADLLGPGRTSGGWVSETMASLPLPAISSLNSRMLKYVVKRF
ALKRGALMFGKLVPMGIGAIIGAIGNRLVGKKLVRNARSAFGTPPARWPV
TLHVLPTVRDAS
>Rv0666 POSSIBLE MEMBRANE PROTEIN
MTPRTDEGAAAPCLMPDVTMPVKRGDARGALGVGPALFVVSVSSSLVRAR
SCRCTAD
>Rv3785 HYPOTHETICAL PROTEIN
MVTVARRPVCPVTLTPGDPALASVRDLVDAWSAHDALAELVTMFGGAFPQ
TDHLEARLASLDKFSTAWDYRARARAARALHGEPVRCQDSGGGARWLIPR
LDLPAKKRDAIVGLAQQLGLTLESTPQGTTFDHVLVIGTGRHSNLIRARW
ARELAKGRQVGHIVLAAASRRLLPSEDDAVAVCAPGARTEFELLAAAARD
AFGLDVHPAVRYVRQRDDNPHRDSMVWRFAADTNDLGVPITLLEAPSPEP
DSSRATSADTFTFTAHTLGMQDSTCLLVTGQPFVPYQNFDALRTLALPFG
IQVETVGFGIDRYDGLGELDQQHPAKLLQEVRSTIRAARALLERIEAGER
MATDPRR
>Rv0349 HYPOTHETICAL PROTEIN
MPELETPDDPESIYLARLEDVGEHRPTFTGDIYRLGDGRMVMILQHPCAL
RHGVDLHPRLLVAPVRPDSLRSNWARAPFGTMPLPKLIDGQDHSADFINL
ELIDSPTLPTCERIAVLSQSGVNLVMQRWVYHSTRLAVPTHTYSDSTVGP
FDEADLIEEWVTDRVDDGADPQAAEHECASWLDERISGRTRRALLSDRQH
ASSIRREARSHRKSVKLAD
>Rv0500B CONSERVED HYPOTHETICAL PROTEIN
MGSVIKKRRKRMSKKKHRKLLRRTRVQRRKLGK
>Rv3450c PROBABLE CONSERVED MEMBRANE PROTEIN
MPSPATTWLHVSGYRFLLRRIECALLFGDVCAATGALRARTTSLALGCVL
AIVAAMGCAFVALLRPQSALGQAPIVMGRESGALYVRVDDVWHPVLNLAS
ARLIAATNANPQPVSESELGHTKRGPLLGIPGAPQLLDQPLAGAESAWAI
CDSDNGGSTTVVVGPAEDSSAQVLTAEQMILVATESGSPTYLLYGGRRAV
VDLADPAVVWALRLQGRVPHVVAQSLLNAVPEAPRITAPRIRGGGRASVG
LPGFLVGGVVRITRASGDEYYVVLEDGVQRIGQVAADLLRFGDSQGSVNV
PTVAPDVIRVAPIVNTLPVSAFPDRPPTPVDGSPGRAVTTLCVTWTPAQP
GAARVAFLAGSGPPVPLGGVPVTLAQADGRGPALDAVYLPPGRSAYVAAR
SLSGGGTGTRYLVTDTGVRFAIHDDDVAHDLGLPTAAIPAPWPVLATLPS
GPELSRANASVARDTVAPGP
>Rv2077A CONSERVED HYPOTHETICAL PROTEIN
MGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGI
NAAICCAAAEFATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV
>Rv3531c HYPOTHETICAL PROTEIN
MYSDPLREAIAEAEQLVAAAPHIETEADLLEGLQYLAGCIAGCMHLAFDY
ERDHPFLQSGTGPFTKMGLDNPDTLYFGTRLQANRDYVVSGRRGTTTDLS
FQLLGGEYTDYNVPASQAAFDDRELDIAADGSFEWRLRPSAPGQLVIREV
YGDWSQQRGTLAIARLDTVGTAPPPLTRELMEKRYATAGSQLVNRVKTWL
QFPQWFYLNIPVNTMVAPRLTPGGLATQYSSAGHFELRPGQALVITVPVS
DAPYLGFQLGSMWYISLDYINHQTSLNASQAQADPDGKVRIVVAEQNPGV
TNWVETLGHRRGFLQFRWQRVSRELTEADGPTVELVDFDAIPAALPHYQH
NKISEDDWRARIALRQRQIATRMLG
>Rv1351 HYPOTHETICAL PROTEIN
MTPRSLPRYGNSSRRKSFPMHRPSNVATATRKKSSIGWVLLACSVAGCKG
IDTTEFILGRAGAFELAVRAAQHRHRYLTMVNVGRAPPRRCRTVCMAATD
TPRNIRLNG
>Rv0381c HYPOTHETICAL PROTEIN
MRILVAWATCGAVVLSGLTGCSGSSHSGRTYGAQSARTGESLAVLGWNMS
VSNLRWSGDYVLIDVDASPTDPHAPHAKPEDIRFGLYGALAHPMESAALG
SCGDAMAHVRDVVSPLSAPAGRLTGTVCLGPLKERSAVRGVYTYSPRDRI
PGTAAAYPAAFPVGMLPTNQNDAGLVVKTTSVSAWRADGMQLGKPQLGDP
VAFTGNGYMLLGLEVDAVPDRYRDDSAARGGPMMLLAAPTLPGRGLSPAC
ATYGSSVLILPDALLDAVHISASLCTQGEINEALLYATVATVGTHAALWT
SR
>Rv3127 CONSERVED HYPOTHETICAL PROTEIN
MLKNAVLLACRAPSVHNSQPWRWVAESGSEHTTVHLFVNRHRTVPATDHS
GRQAIISCGAVLDHLRIAMTAAHWQANITRFPQPNQPDQLATVEFSPIDH
VTAGQRNRAQAILQRRTDRLPFDSPMYWHLFEPALRDAVDKDVAMLDVVS
DDQRTRLVVASQLSEVLRRDDPYYHAELEWWTSPFVLAHGVPPDTLASDA
ERLRVDLGRDFPVRSYQNRRAELADDRSKVLVLSTPSDTRADALRCGEVL
STILLECTMAGMATCTLTHLIESSDSRDIVRGLTRQRGEPQALIRVGIAP
PLAAVPAPTPRRPLDSVLQIRQTPEKGRNASDRNARETGWFSPP
>Rv1610 POSSIBLE CONSERVED MEMBRANE PROTEIN
MAANAGSVRPNRRARPMIGIAQLLLVVAAGALWMAARLPWVVIGSFDELG
PPKEVTLTGASWSTALLPLALLMLAAAVAALAVRGWPLRALAVLLAAASF
AVGYLGISLWVVPDVAARGADLAHVPVVTLVGSARHYWGAVAAVLAAVCA
LLAAVFLMSSAAIRGSAGEDMARYAAPRARRSIARRQHSNAAGRAAPQDD
GPDMGPRMSERMIWEALDEGRDPTDREQESDTEGR
>Rv3130c CONSERVED HYPOTHETICAL PROTEIN
MNHLTTLDAGFLKAEDVDRHVSLAIGALAVIEGPAPDQEAFLSSLAQRLR
PCTRFGQRLRLRPFDLGAPKWVDDPDFDLGRHVWRIALPRPGNEDQLFEL
IADLMARRLDRGRPLWEVWVIEGLADSKWAILTKLHHCMADGIAATHLLA
GLSDESMSDSFASNIHTTMQSQSASVRRGGFRVNPSEALTASTAVMAGIV
RAAKGASEIAAGVLSPAASSLNGPISDLRRYSAAKVPLADVEQVCRKFDV
TINDVALAAITESYRNVLIQRGERPRFDSLRTLVPVSTRSNSALSKTDNR
VSLMLPNLPVDQENPLQRLRIVHSRLTRAKAGGQRQFGNTLMAIANRLPF
PMTAWAVGLLMRLPQRGVVTVATNVPGPRRPLQIMGRRVLDLYPVSPIAM
QLRTSVAMLSYADDLYFGILADYDVVADAGQLARGIEDAVARLVAISKRR
KVTRRRGALSLVV
>Rv0538 POSSIBLE CONSERVED MEMBRANE PROTEIN
MDVALGVAVTDRVARLALVDSAAPGTVIDQFVLDVAEHPVEVLTETVVGT
DRSLAGENHRLVATRLCWPDQAKADELQHALQDSGVHDVAVISEAQAATA
LVGAAHAGSAVLLVGDETATLSVVGDPDAPPTMVAVAPVAGADATSTVDT
LMARLGDQALAPGDVFLVGRSAEHTTVLADQLRAASTMRVQTPDDPTFAL
ARGAAMAAGAATMAHPALVADATTSLPRAEAGQSGSEGEQLAYSQASDYE
LLPVDEYEEHDEYGAAADRSAPLSRRSLLIGNAVVAFAVIGFASLAVAVA
VTIRPTAASKPVEGHQNAQPGKFMPLLPTQQQAPVPPPPPDDPTAGFQGG
TIPAVQNVVPRPGTSPGVGGTPASPAPEAPAVPGVVPAPVPIPVPIIIPP
FPGWQPGMPTIPTAPPTTPVTTSATTPPTTPPTTPVTTPPTTPPTTPVTT
PPTTPPTTPVTTPPTTVAPTTVAPTTVAPTTVAPTTVAPATATPTTVAPQ
PTQQPTQQPTQQMPTQQQTVAPQTVAPAPQPPSGGRNGSGGGDLFGGF
>Rv2597 PROBABLE MEMBRANE PROTEIN
MGNLLVVIAVALFIAAIVVLVVAIRRPKTPATPGGRRDPLAFDAMPQFGP
RQLGPGAIVSHGGIDYVVRGSVTFREGPFVWWEHLLEGGDTPTWLSVQED
DGRLELAMWVKRTDLGLQPGGQHVIDGVTFQETERGHAGYTTEGTTGLPA
GGEMDYVDCASAGQGADESMLLSFERWAPDMGWEIATGKSVLAGELTVYP
APPVSA
>Rv1815 CONSERVED HYPOTHETICAL PROTEIN
MVRLVPRAFAATVALLAAGFSPATASADPVLVFPGMEIRQDNHVCTLGYV
DPALKIAFTAGHCRGGGAVTSRDYKVIGHLRAIRDNTPSGSTVATHELIA
DYEAIVLADDVTASNILPSGRALESRPGVVLHPGQAVCHFGVSTGETCGT
VESVNNGWFTMSHGVLSEKGDSGGPVYLAPDGGPAQIVGIFNSVWGGFPA
AVSWRSTSEQVHADLGVTPLA
>Rv3879c HYPOTHETICAL ALANINE AND PROLINE RICH PROTEIN
MSITRPTGSYARQMLDPGGWVEADEDTFYDRAQEYSQVLQRVTDVLDTCR
QQKGHVFEGGLWSGGAANAANGALGANINQLMTLQDYLATVITWHRHIAG
LIEQAKSDIGNNVDGAQREIDILENDPSLDADERHTAINSLVTATHGANV
SLVAETAERVLESKNWKPPKNALEDLLQQKSPPPPDVPTLVVPSPGTPGT
PGTPITPGTPITPGTPITPIPGAPVTPITPTPGTPVTPVTPGKPVTPVTP
VKPGTPGEPTPITPVTPPVAPATPATPATPVTPAPAPHPQPAPAPAPSPG
PQPVTPATPGPSGPATPGTPGGEPAPHVKPAALAEQPGVPGQHAGGGTQS
GPAHADESAASVTPAAASGVPGARAAAAAPSGTAVGAGARSSVGTAAASG
AGSHAATGRAPVATSDKAAAPSTRAASARTAPPARPPSTDHIDKPDRSES
ADDGTPVSMIPVSAARAARDAATAAASARQRGRGDALRLARRIAAALNAS
DNNAGDYGFFWITAVTTDGSIVVANSYGLAYIPDGMELPNKVYLASADHA
IPVDEIARCATYPVLAVQAWAAFHDMTLRAVIGTAEQLASSDPGVAKIVL
EPDDIPESGKMTGRSRLEVVDPSAAAQLADTTDQRLLDLLPPAPVDVNPP
GDERHMLWFELMKPMTSTATGREAAHLRAFRAYAAHSQEIALHQAHTATD
AAVQRVAVADWLYWQYVTGLLDRALAAAC
>Rv2100 CONSERVED HYPOTHETICAL PROTEIN
MAGALFEPSFAAAHPAGLLRRPVTRTVVLSVAATSIAHMFEISLPDPTEL
CRSDDGALVAAIEDCARVEAAASARRLSAIAELTGRRTGADQRADWACDF
WDCAAAEVAAALTISHGKASGQMHLSLALNRLPQVAALFLAGHLGARLFS
IIAWRTYLVRDPHALSLLDAALAEHAGAWGPLSAPKLEKAIDSWIDRYDP
GALRRSRISARTRDLCIGDPDEDAGTAALWGRLYATDAAMLDRRLTEMAH
GVCEDDPRTLAQRRADALGALAAGADHLACGCGKPDCPSGAGNDERAAGV
VIHVVADASALDAQPDPHLSGDEPPSRPLTPETTLFEALTPDPEPDPPAT
HAPAELITTGGGVVPAPLLAELIRGGATISQVRHPGDLAAEPHYRPSAKL
AEFVRMRDLTCRFPGCDVPAEFCDIDHSAPWPLGPTHPSNLKCACRKHHL
LKTFWTGWRDVQLPDGTVIWTAPNGHTYTTHPGSRIFFPTWHTTTAELPQ
TSTAAVNVDARGLMMPRRRRTRAAELAHRINAERALNDAYMAERNKPPSF
>Rv0748 CONSERVED HYPOTHETICAL PROTEIN
MRTTVSISDEILAAAKRRARERGQSLGAVIEDALRREFAAAHVGGARPTV
PVFDGGTGPRRGIDLTSNRALSEVLDEGLELNSRK
>Rv1116A CONSERVED HYPOTHETICAL PROTEIN (FRAGMENT)
MGALGTVRGLQDSNTAFVGALHSGNLLGATGAVLQAPGNAVNGFLFGQTS
ISQSIDVSPEYGYELVAVSDPVGGTAGSARAGHGYVHADLR
>Rv0692 CONSERVED HYPOTHETICAL PROTEIN
MWGLLTVPAPAQARRADSSEFDPDRGWRLHPQVAVRPEPFGALLYHFGTR
KLSFLKNRTILAVVQTLADYPDIRSACRGAGVDDCDQDPYLHALSVLAGS
NMLVPRQTT
>Rv1958c HYPOTHETICAL PROTEIN
MIPTPSIGAVINAKISHRACRTFPRPTDIHPRRYLPRKHGGTNPRRLSMN
PGGMRIRCRRGDKSRKLLSRSQVQPLVGRPAKIPSPAANAPPSRARTASP
VFENLELRAAAGLAFGFRLRPFGGTAADSPPVAAQDLDPCRWADSPALHL
AVGVETMVVGQLDSPSFGQGVPLVAGHWAPGETGIGRDNISRVNGGSARR
PVRS
>Rv2269c HYPOTHETICAL PROTEIN
MANDARPLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRYGGRAGI
GRSETVTDHGAVGRRYHQPCAGQIRLSELRVTILLRCETLCETAQLLRCP
PLPCDCSTPL
>Rv3616c CONSERVED HYPOTHETICAL ALANINE AND GLYCINE RICH PROTEIN
MSRAFIIDPTISAIDGLYDLLGIGIPNQGGILYSSLEYFEKALEELAAAF
PGDGWLGSAADKYAGKNRNHVNFFQELADLDRQLISLIHDQANAVQTTRD
ILEGAKKGLEFVRPVAVDLTYIPVVGHALSAAFQAPFCAGAMAVVGGALA
YLVVKTLINATQLLKLLAKLAELVAAAIADIISDVADIIKGTLGEVWEFI
TNALNGLKELWDKLTGWVTGLFSRGWSNLESFFAGVPGLTGATSGLSQVT
GLFGAAGLSASSGLAHADSLASSASLPALAGIGGGSGFGGLPSLAQVHAA
STRQALRPRADGPVGAAAEQVGGQSQLVSAQGSQGMGGPVGMGGMHPSSG
ASKGTTTKKYSEGAAAGTEDAERAPVEADAGGGQKVLVRNVV
>Rv1957 HYPOTHETICAL PROTEIN
MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYD
LEFEPAVDADPATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVAT
ADFEFAALFDYHLQEGEDDPTEEELTAYAATTGRFALYPYIREYVYDLTG
RLALPPLTLEILSRPMPVSPGAQWPATRGTP
>Rv3435c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MGRILRVVVGLVLVIAAYVTVIALYHSTGLGRPHEVAHGRPTADGTTVTL
HVEQLQTIKGVLVANLAVSPGTELLDSQTQGLKDDLTVTVTSVVTPTKRT
WSSGSLPGVFPVPLTISGDPANWPFDHYRSGPITVQLYRGAAHAPERVSV
TFVDRLPGWNVDISGVGDANVPAPYRVGLHRSPSSVAFGTVIVGVLIALA
GVGLFVAVQTARGRRQFQPPMTTWYAAMLFAVIPLRNALPDAPPIGFWID
VTVVLWVVVALVTSMVLYILCWWWHLKPDVDETM
>Rv3898c CONSERVED HYPOTHETICAL PROTEIN
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPA
DIANGALFAAGNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQ
FQGVGAQAEA
>Rv1590 CONSERVED HYPOTHETICAL PROTEIN
MVEIVAGKQRAPVAAGVYNVYTGELADTATPTAARMGLEPPRFCAQCGRR
MVVQVRPDGWWARCSRHGQVDSADLATQR
>Rv2063 CONSERVED HYPOTHETICAL PROTEIN
MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAERE
ASHAETTTQAVRDEDREWEGTVGDGLG
>Rv0236A SMALL SECRETED PROTEIN
MNRIVAPAAASVVVGLLLGAAAIFGVTLMVQQDKKPPLPGGDPSSSVLNR
VEYGNRS
>Rv0479c PROBABLE CONSERVED MEMBRANE PROTEIN
MTNPQGPPNDPSPWARPGDQGPLARPPASSEASTGRLRPGEPAGHIQEPV
SPPTQPEQQPQTEHLAASHAHTRRSGRQAAHQAWDPTGLLAAQEEEPAAV
KTKRRARRDPLTVFLVLIIVFSLVLAGLIGGELYARHVANSKVAQAVACV
VKDQATASFGVAPLLLWQVATRHFTNISVETAGNQIRDAKGMQIKLTIQN
VRLKNTPNSRGTIGALDATITWSSEGIKESVQNAIPILGAFVTSSVVTHP
ADGTVELKGLLNNITAKPIVAGKGLELQIINFNTLGFSLPKETVQSTLNE
FTSSLTKNYPLGIHADSVQVTSTGVVSRFSTRDAAIPTGIQNPCFSHI
>Rv1363c POSSIBLE MEMBRANE PROTEIN
MAETTEPPSDAGTSQADAMALAAEAEAAEAEALAAAARARARAARLKREA
LAMAPAEDENVPEEYADWEDAEDYDDYDDYEAADQEAARSASWRRRLRVR
LPRLSTIAMAAAVVIICGFTGLSGYIVWQHHEATERQQRAAAFAAGAKQG
VINMTSLDFNKAKEDVARVIDSSTGEFRDDFQQRAADFTKVVEQSKVVTE
GTVNATAVESMNEHSAVVLVAATSRVTNSAGAKDEPRAWRLKVTVTEEGG
QYKMSKVEFVP
>Rv0150c CONSERVED HYPOTHETICAL PROTEIN
MLTLPDDRAPTGLPDPGIEALAHTKIASTISTVVADGYAVVLSTADIANS
LLANAIGYPIAASVALVTPAAGANSSCWPADPSQHHRIAESRACA
>Rv1545 HYPOTHETICAL PROTEIN
MPNGVLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVACREEVG
ITTILAGRDECGVCDKTAGLDGAAP
>Rv0378 CONSERVED HYPOTHETICAL GLYCINE RICH PROTEIN
MSGRWEAGNADGNGGSAGLIGSGGAGGDGGSGGATGAGGEGGDAGASGSI
NGNAGDPGNSGERGAVGKPGAPG
>Rv0556 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MISPKPLLHILIHGLSDELPDTRGRIVLRWLRIAVLIVTGLVTLQSVLLV
AGAWRNDIAIQRNMGVAQAEVLSAGPRRSTIEFVTPDRITYRPQLGVLYP
SELSTGMRIYVEYNKRDPNLVRVQHRNAGLAIIPAGSIAVVAWLIAAAAL
VVLAVLDKRLERRENSASATG
>Rv3878 CONSERVED HYPOTHETICAL ALANINE RICH PROTEIN
MAEPLAVDPTGLSAAAAKLAGLVFPQPPAPIAVSGTDSVVAAINETMPSI
ESLVSDGLPGVKAALTRTASNMNAAADVYAKTDQSLGTSLSQYAFGSSGE
GLAGVASVGGQPSQATQLLSTPVSQVTTQLGETAAELAPRVVATVPQLVQ
LAPHAVQMSQNASPIAQTISQTAQQAAQSAQGGSGPMPAQLASAEKPATE
QAEPVHEVTNDDQGDQGDVQPAEVVAAARDEGAGASPGQQPGGGVPAQAM
DTGAGARPAASPLAAPVDPSTPAPSTTTTL
>Rv1574 Probable phiRV1 phage related protein
MGYKPESERHSTKTDTAIGAALGISAGTYRRLKRIDNATHSDDKEIRRFA
EKQMAPLVAGSPSWNARKPRSANARVVASVHRSPMPALVPWNQSRLSATL
TRR
>Rv0487 CONSERVED HYPOTHETICAL PROTEIN
MTSSLPTVQRVIQNALEVSQLKYSQHPRPGGAPPALIVELPGERKLKINT
ILSVGEHSVRVEAFVCRKPDENREDVYRFLLRRNRRLYGVAYTLDNVGDI
YLVGQMALSAVDADEVDRVLGQVLEVVDSDFNALLELGFRSSIQREWQWR
LSRGESLQNLQAFAHLRPTTMQSAQRDEKELGG
>Rv1486c CONSERVED HYPOTHETICAL PROTEIN
MWCPSVSLSIWANAWLAGKAAPDDVLDALSLWAPTQSVAAYDAVAAGHTG
LPWPDVHDAGTVSLLQTLRAAVGRRRLRGTINVVLPVPGDVRGLAAGTQF
EHDALAAGEAVIVANPEDPGSAVGLVPEFSYGDVDEAAQSEPLTPELCAL
SWMVYSLPGAPVLEHYELGDAEYALRSAVRSAAEALSTIGLGSSDVAKPR
GLVEQLLESSRQHRVPDHAPSRALRVLENAAHVDAIIAVSAGLSRLPIGT
QSLSDAQRATDALRPLTAVVRSARMSAVTAILHSAWPD
>Rv3178 CONSERVED HYPOTHETICAL PROTEIN
MRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQA
ENPQWYRNLPPNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADF
DSCQSWTERGIPVIILRPR
>Rv2809 HYPOTHETICAL PROTEIN
MTYAARDDTTLPKLLAQMRWVVLVDKRQLAVLLLENEGPVASATDTLDTR
GDSDYENQPVDAVERLCRRLADQAVRQWGFMQGLKQKLGPGVDVRMKLVE
WNR
>Rv1571 CONSERVED HYPOTHETICAL PROTEIN
MVHSIELVFDSDTEAAIRRIWAGLAAAGIPSQAPASRPHVSLAVAERIAP
EVDEPLGAVARRLPLDCVIGAPVLFGRANVVFTRLVVPTSELLALHAEVH
RLCGPHLAPAPMANSLPGQWTAHVTLARRVGGHQLGRALRIAGRPSRIDG
RFAGLRRWDGNTRAEYLLG
>Rv1234 PROBABLE TRANSMEMBRANE PROTEIN
MTSPFQPRQVPGSTPAAAGAGRRGVPALPTPPKGWPVGSYPTYAEAQRAV
DYLSEQQFPVQQVTIVGVDLMQVERVTGRLTWPKVLGGGVLSGAWLGLFI
GLVLGFFSPNPWSALVTGLVAGVFFGLITSAVPYAMARGTRDFSSTMQLV
AGRYDVLCDPQNAEKARDLLARLAI
>Rv3243c HYPOTHETICAL PROTEIN
MSPRVPRLRWDDPFRALDMLASLWSSTGMSLVSAGAAQAVAAPYRTLFTT
LQQLLIGKEVTVRIGDHDVVLTVTELDSALEPQGLAVGQLGEVRVAARGI
SWDQHHLHSAVAVLRNVHIRPGVPPLVIAAPVELSSALPTEIFDDVLRQA
TPQLRGELSESGAARLRWARRPDWGGLEVDVDVAGTTSQTTLWLRPRTVI
TGQRRWTLPARTPAYRVPLPELPHGLRITDVSLAADCLQLSALLPEWRTE
LPLRYLESVITQLSQGALSFVWPPLRSGAD
>Rv1693 CONSERVED HYPOTHETICAL PROTEIN
MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLA
ALESAEKG
>Rv2219 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MAKPRNAAESKAAKAQANAARKAAARQRRAQLWQAFTLQRKEDKRLLPYM
IGAFLLIVGASVGVGVWAGGFTMFTMIPLGVLLGALVAFVIFGRRAQRTV
YRKAEGQTGAAAWALDNLRGKWRVTPGVAATGNLDAVHRVIGRPGVIFVG
EGSAARVKPLLAQEKKRTARLVGDVPIYDIIVGNGDGEVPLAKLERHLTR
LPANITVKQMDTVESRLAALGSRAGAGVMPKGPLPTTAKMRSVQRTVRRK
>Rv0108c HYPOTHETICAL PROTEIN
MVPVETLHSGDPITDVNGGGQRYIVLESKTVGDSCVVLELESRVNHQLQV
IEKSFPAGYHVGRAHHRIL
>Rv1943c CONSERVED HYPOTHETICAL PROTEIN
MKTARLQVTLRCAVDLINSSSDQCFARIEHVASDQADPRPGVWHSSGMNR
IRLSTTVDAALLTSARDMRAGITDAALIDEALAALLARHRSAEVDASYAA
YDKHPVDEPDEWGDLASWRRAAGDS
>Rv2665 HYPOTHETICAL ARGININE RICH PROTEIN
MIVVRTAEAAEQALTEGQLVCPRRGCGDTLRRWRYGRRRHVRSLGSQVID
VRPQRVRCRRCESTHVLLPAALQPRLGRGGGGQLRPGVWCTGR
>Rv1100 CONSERVED HYPOTHETICAL PROTEIN
MVGDCPRSRTVRWSWDTGHVTAEPQPTPRPAKPRLLQDGRDMFWSLAPLV
VGCILLAGLVGMCSFQLGGTKRGPIPSYDAAQALRADAKTLGFPIRLPQL
PGGWTPNSGGRGGIENGRADPATGQRRNAATSIVGFISPTGRYLSLTQSN
ADEDKLVGSIHPSMYPTGTVDVGGTRWVVYEGSDENGAVEPVWTTRLTGP
GGATQLAITGAGSIDQFRTLASATQSQPPLPAR
>Rv1362c POSSIBLE MEMBRANE PROTEIN
MTDDVRDVNTETTDATEVAEIDSAAGEAGDSATEAFDTDSATESTAQKGQ
RHRDLWRMQVTLKPVPVILILLMLISGGATGWLYLEQYRPDQQTDSGAAR
AAVAAASDGTIALLSYSPDTLDQDFATARSHLAGDFLSYYDQFTQQIVAP
AAKQKSLKTTAKVVRAAVSELHPDSAVVLVFVDQSTTSKDSPNPSMAASS
VMVTLAKVDGNWLITKFTPV
>Rv3169 CONSERVED HYPOTHETICAL PROTEIN
MPQMLGPLDEYPLHQLPQPIAWPGSSDRNFYDRSYFNAHDRTGNIFLITG
IGYYPNLGVKDAFVLIRRADIQTAVHLSDAIDSDRLHQHVNGYRVEVVEP
LRKLRIVLDETEGVAADLTWEGLFDVVQEQPHVLRSGNRVTLDAQRFAQL
GTWSGRIVVDGERIAVDPATWLGSRDRSWGIRPVGEPEPAGRPADPPFEG
MWWLYVPLAFDDFAVVLIIQEEPDGFRSLNDCTRIWRDGHVEQLGWPRVR
IHYRSGTRIPTGATIEASTPDGAPVHFDVESKLAVPTHVGGGYGGDSDWS
HGMWKGEKFVERRTYDMTDPTIIARAGFGVIDHVGRALCRDGDGNPVQGW
GLFEHGALGRHDPSGFADWSTLAP
>Rv3081 CONSERVED HYPOTHETICAL PROTEIN
MTPHYRQAAASRLDTHRTQKLRSQTNGGKDRHQLTYEQFARMLTLMGPSD
LWTVERAARHWGVSASRARAILSSRHIHRVSGYPAQAIKAVTLRQGARTD
LKTANHLVPAAQAFTMAETGAAIGETEDERARLRIFFEFLRGADETGTSA
LDLIVDEPALIGEHRFDALLAAAAEYISARWGRPGPLWSVSIERFLDTAW
WVSDLPSARAFAAVWTPAPFRRRGIYLDRHDLTSDGVCVMPEPVFNRTEL
QRAFTALAAKLERRGVVGQVHVVGGAAMLLAYNSRVTTRDIDALFSTDGP
MLEAIREVADEMGWPRTWLNNQASGYVSRTPGEGAPVFDHPFLHVVATPA
QHLLAMKVVAARGVRDGEDIRLLLDRLRITSAAGVWEIVARYFPAETITD
RSRLLVEDLLNQ
>Rv1592c CONSERVED HYPOTHETICAL PROTEIN
MVEPGNLAGATGAEWIGRPPHEELQRKVRPLLPSDDPFYFPPAGYQHAVP
GTVLRSRDVELAFMGLIPQPVTATQLLYRTTNMYGNPEATVTTVIVPAEL
APGQTCPLLSYQCAIDAMSSRCFPSYALRRRAKALGSLTQMELLMISAAL
AEGWAVSVPDHEGPKGLWGSPYEPGYRVLDGIRAALNSERVGLSPATPIG
LWGYSGGGLASAWAAEACGEYAPDLDIVGAVLGSPVGDLGHTFRRLNGTL
LAGLPALVVAALQHSYPGLARVIKEHANDEGRQLLEQLTEMTTVDAVIRM
AGRDMGDFLDEPLEDILSTPEISHVFGDTKLGSAVPTPPVLIVQAVHDYL
IDVSDIDALADSYTAGGANVTYHRDLFSEHVSLHPLSAPMTLRWLTDRFA
GKPLTDHRVRTTWPTIFNPMTYAGMARLAVIAAKVITGRKLSRRPL
>Rv2520c POSSIBLE CONSERVED MEMBRANE PROTEIN
MVDRDPNTIKQEIDQTRDQLAATIDSLAERANPRRLADDAKTRVIAFLRK
PIVTVSLVGIGSVVVVVVIHKIRNR
>Rv1973 POSSIBLE CONSERVED MCE ASSOCIATED MEMBRANE PROTEIN
MSWSRVIAYGLLPGLALALTCGAGLLKWQDGAVRDAAVARAESVRAATDG
TTALLSYRPDTVQHDLESARSRLTGTFLDAYTQLTHDVVIPGAQQKQISA
VATVAAAASVSTSADRAVVLLFVNQTITVGKDAPTTAASSVRVTLDNING
RWLISQFEPI
>Rv3437 POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MVGRAVPSPNRRYRRVWPPRTKGQHLSNPYAQHQLKLIRHTGALILWQQR
TYVVSGTREQCEAAYKSAQTYNLLVGWWSLVSLLAMNWIALISNFNAIRR
VRAAADGASVPHGPHAIAHPAVPRGPIPAGWYPDPSGAGLRYWDGATWTH
WTHPPRHR
>Rv3705c CONSERVED HYPOTHETICAL PROTEIN
MRIAAAVVSIGLAVIAGFAVPVADAHPSEPGVVSYAVLGKGSVGNIVGAP
MGWEAVFTRPFQAFWVELPACNNWVDIGLPEVYDDPDLASFNGATTQTSA
TDQTHLVKQAVGVFASNDAADRAFHRVVDRTVGCSGQTTAIHLDDGTTQV
WSFAGGPSTGTDEAWTKQEAGTDRRCFVQTRLRENVLLQAKVCQSGNAGP
AVNVLAGAMQNTLG
>Rv3843c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MIQVCSQCGTGWNVRERQRVWCPRCRGMLLAPLADMPAEARWRTPARPQV
PTASDTRRTPPRLPPGFRWIAVRPGAAPPPRHGPRLRGPTPRYAGIPRWG
LTDHVDQAPVPASAKAGPSPAAVRTTLLVSLLVFSIAVVVFVVRYVLLVI
NRNTLLNSVVASASVWLGVLVSLAAIAAAGTTIVLLVRWLVARRAAAFMH
QGLPERRSARELWAGCLLPMVNLLWAPLYVIELALVEDRYTRLRRPIVVW
WIVWIVSNAISMFAFATSWVTDAQGIANNTTMMVLAYLCAAAAVAAAARV
FEGFEQKPVERPAHRWVVVNTDGRSAPASSVAVELDGQEPAA
>Rv0059 HYPOTHETICAL PROTEIN
MITRYKPESGFVARSGGPDRKRPHDWIVWHFTHADNLPGIITAGRLLADS
AVTPTTEVAYNPVKELRRHKVVAPDSRYPASMASDHVPFYIAARSPMLYV
VCKGHSGYSGGAGPLVHLGVALGDIIDADLTWCASDGNAAASYTKFSRQV
DTLGTFVDFDLLCQRQWHNTDDDPNRQSRRAAEILVYGHVPFELVSYVCC
YNTETMTRVRTLLDPVGGVRKYVIKPGMYY
>Rv1269c CONSERVED PROBABLE SECRETED PROTEIN
MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSW
GRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGP
TLAAAMKDALTKLGGGYIDTWACN
>Rv2114 HYPOTHETICAL PROTEIN
MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALD
PQALATEHKAATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATV
LLSSMNPAPYTVSQVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVD
PNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNREARVLQDDLTN
SYSLVTA
>Rv2183c CONSERVED HYPOTHETICAL PROTEIN
MSGAHTDVRPELRKLAQAILDGIDPAVRVAAAMASGGGPGTGKCQQVWCP
LCALAALVTGEQHPLLTVIADHSLALLEVIRAIVDDIDRSAKPPPEGPPG
GGQTGASGGENTNGEGSMKSHYQAIPVTIEE
>Rv3231c CONSERVED HYPOTHETICAL PROTEIN
MTQVYIPATLAMLQRLVADGALWPVNGTAFAVTPTLRESYAEGDDEELAE
VALREAALASLRLLAADIGATADALPPRRAVLAAEVDDATYRPDLDDAVV
RLAGPITIDQVVAAYVDNAGAEPAVMAAIAVIDAADLGDEDAELVVGDAQ
DHDLAWYANQELPFLLDLL
>Rv2285 CONSERVED HYPOTHETICAL PROTEIN
MKLLSPLDQMFARMEAPRTPMHIGAFAVFDLPKGAPRRFIRDLYEAISQL
AFLPFPFDSVIAGGASMAYWRQVQPDPSYHVRLSALPYPGTGRDLGALVE
RLHSTPLDMAKPLWELHLIEGLTGRQFAMYFKAHHCAVDGLGGVNLIKSW
LTTDPEAPPGSGKPEPFGDDYDLASVLAAATTKRAVEGVSAVSELAGRLS
SMVLGANSSVRAALTTPRTPFNTRVNRHRRLAVQVLKLPRLKAVAHATDC
TVNDVILASVGGACRRYLQELGDLPTNTLTASVPVGFERDADTVNAASGF
VAPLGTSIEDPVARLTTISASTTRGKAELLAMSPNALQHYSVFGLLPIAV
GQKTGALGVIPPLFNFTVSNVVLSKDPLYLSGAKLDVIVPMSFLCDGYGL
NVTLVGYTDKVVLGFLGCRDTLPHLQRLAQYTGAAFEELETAALP
>Rv0178 PROBABLE CONSERVED MCE ASSOCIATED MEMBRANE PROTEIN
MEDQQSASGDLTQKSVANGESTDTASAATEGHRGEIDAAGEPDERGAAVA
DSQADEDDSAATAARGGKTRARRSRGRRLAITVGVAAALFVGSAAFAGAT
VEPYLSERAVVATKLMVARTAANAITTLWTYTPENMDTLADRAANYLSGD
FAAQYRRFVDQIAAANKQAKITNDTEVTGAAVESLSGRDAVAIVYTNTTT
TSPVTKNIPALKYLSYRLFMKRYDARWLVTRMTTITSLDLTPQV
>Rv1322 CONSERVED HYPOTHETICAL PROTEIN
MARRRKPLHRQRPEPPSWALRRVEAGPDGHEYEVRPVAAARAVKTYRCPG
CDHEIRSGTAHVVVWPTDLPQAGVDDRRHWHTPCWANRATRGPTRKWT
>Rv2843 PROBABLE CONSERVED TRANSMEMBRANE ALANINE RICH PROTEIN
MLRAAPVINRLTNRPISRRGVLAGGAALAALGVVSACGESAPKAPAVEEL
RSPLDQARHDGALAAAAATAIGIPPQVAAALTVVATQRTSHARALATEIA
RAAGKLVSATSETSSSSPSPTDPAAPPPAVSDVIDSLRTSAGEASRLVAT
TSGYRAGLLASIAASCTASYTVALVPSGPSI
>Rv3126c HYPOTHETICAL PROTEIN
MVIRFDQIGSLVLSMKSLASLSFQRCLRENSSLVAALDRLDAAVDELSAL
SFDALTTPERDRARRDRDHHPWSRSRSQLSPRMAHGAVHQCQWPKAVWAV
IDNP
>Rv0283 POSSIBLE CONSERVED MEMBRANE PROTEIN
MTNQQHDHDFDHDRRSFASRTPVNNNPDKVVYRRGFVTRHQVTGWRFVMR
RIAAGIALHDTRMLVDPLRTQSRAVLMGVLIVITGLIGSFVFSLIRPNGQ
AGSNAVLADRSTAALYVRVGEQLHPVLNLTSARLIVGRPVSPTTVKSTEL
DQFPRGNLIGIPGAPERMVQNTSTDANWTVCDGLNAPSRGGADGVGVTVI
AGPLEDTGARAAALGPGQAVLVDSGAGTWLLWDGKRSPIDLADHAVTSGL
GLGADVPAPRIIASGLFNAIPEAPPLTAPIIPDAGNPASFGVPAPIGAVV
SSYALKDSGKTISDTVQYYAVLPDGLQQISPVLAAILRNNNSYGLQQPPR
LGADEVAKLPVSRVLDTRRYPSEPVSLVDVTRDPVTCAYWSKPVGAATSS
LTLLAGSALPVPDAVHTVELVGAGNGGVATRVALAAGTGYFTQTVGGGPD
APGAGSLFWVSDTGVRYGIDNEPQGVAGGGKAVEALGLNPPPVPIPWSVL
SLFVPGPTLSRADALLAHDTLVPDSRPARPVSAEGGYR
>Rv1987 POSSIBLE CHITINASE
MAGLNIYVRRWRTALHATVSALIVAILGLAITPVASAATARATLSVTSTW
QTGFIARFTITNSSTAPLTDWKLEFDLPAGESVLHTWNSTVARSGTHYVL
SPANWNRIIAPGGSATGGLRGGLTGSYSPPSSCLLNGQYPCT
>Rv0590A MCE-FAMILY RELATED PROTEIN
MLHSSFGHLEGIQQPLIDELAELDHVLGKLPDAYRIIGRAGGIYGDFFNF
YLCDISLKVNGLQPGGPVRTVKLFGQPTGRCTPQ
>Rv2132 CONSERVED HYPOTHETICAL PROTEIN
MRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVANRFQQQ
TYDMGEGIDYSNIGDAIETLDGPASG
>Rv3481c PROBABLE INTEGRAL MEMBRANE PROTEIN
MRGLLPVAGHWVSVLTGLVPLALVIALSPLSVIPAVLVVHSPQPRPSSLA
FLGGWLLGLAVVTAVFVAASGALGGLSTTSPAWASWLRVVLGSALIVFGV
LRWLTRHRHTEMPGWMRAFASFTPARAGLVGAVLVVVRPEVLIICAAAGL
AIGSGGHGAAGSWIYTAFFAMLAASTVAIPILAYVAAGDRLDDSLERLKD
WMEKNHAGMVAAILVVIGLLLLYNGVHAM
>Rv1211 CONSERVED HYPOTHETICAL PROTEIN
MLGADQARAGGPARIWREHSMAAMKPRTGDGPLEATKEGRGIVMRVPLEG
GGRLVVELTPDEAAALGDELKGVTS
>Rv3654c CONSERVED HYPOTHETICAL PROTEIN
MVARHRAQAAADLASLAAAARLPSGLAAACARATLVARAMRVEHAQCRVV
DLDVVVTVEVAVAFAGVATATARAGPAKVPTTPG
>Rv0810c CONSERVED HYPOTHETICAL PROTEIN
MGRGRAKAKQTKVARELKYSSPQTDFQRLQRELSGTGTDRLDGDGPSDDD
SWNDEDDWRR
>Rv1417 POSSIBLE CONSERVED MEMBRANE PROTEIN
MTAAPNDWDVVLRPHWTPLFAYAAAFLIAVAHVAGGLLLKVGSSGVVFQT
ADQVAMGALGLVLAGAVLLFARPRLRVGSAGLSVRNLLGDRIVGWSEVIG
VSFPGGSRWARIDLADDEYIPVMAIQAVDKDRAVAAMDTVRSLLARYRPD
LCAR
>Rv1733c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MIATTRDREGATMITFRLRLPCRTILRVFSRNPLVRGTDRLEAVVMLLAV
TVSLLTIPFAAAAGTAVQDSRSHVYAHQAQTRHPATATVIDHEGVIDSNT
TATSAPPRTKITVPARWVVNGIERSGEVNAKPGTKSGDRVGIWVDSAGQL
VDEPAPPARAIADAALAALGLWLSVAAVAGALLALTRAILIRVRNASWQH
DIDSLFCTQR
>Rv0057 HYPOTHETICAL PROTEIN
MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMC
LKANTPGAVTWLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLN
TDVDGYAHAMHSSINSGPLEYLPATFSVFPALGDVGDLGGGVGAATYALD
RLSNMRSGACVGGGESPWRSLMT
>Rv2822c HYPOTHETICAL PROTEIN
MSVIQDDYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSA
NPTLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRD
GLLRFCRYMEALAAYKKYLDPKDK
>Rv1065 CONSERVED HYPOTHETICAL PROTEIN
MVMPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGV
PQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNE
YRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTL
SVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEGSG
>Rv0615 PROBABLE INTEGRAL MEMBRANE PROTEIN
MMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGLLVVTGQ
TLMAISVAFLVALGGPLVVVNHRRAERSRG
>Rv2203 POSSIBLE CONSERVED MEMBRANE PROTEIN
MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADD
AALPPAAYPGVPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTN
GANTAGTFSEGPAKTAIQGYLNALENRDVDTIVRNALCGIHDGVRDKRSD
QALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTPAAGGPP
RGQVQGIAQLLFQRGQVLVCSYVLRTAGSY
>Rv1489 CONSERVED HYPOTHETICAL PROTEIN
MSGLTSPKTYAVLAALQAGDAVACAIPLPPIARLLDDLDVPVSVRPVLPV
VKAASAVGLLSVTRFPALARLTTAMLTLYFILAVGAHVRVRDRVVNAIPA
ASFLTLFALMTAKGPERT
>Rv2545 CONSERVED HYPOTHETICAL PROTEIN
MSTTIVAGVIQGHLPVILPTRRRARDLGHTTALFRAQTLQCIYLSIEYLY
VCSMSRRTTIDIDDILLARAQAALGTTGLKDRVDAALRAAVR
>Rv2274c HYPOTHETICAL PROTEIN
MSIARSAQPIGWISCPPKGGSSCCRCGGGYTHIFCVSAWTGLVVDLQAEQ
VRSVVTERLRRRIGRGAPILAGTLAPGVGLAAQNREFRQFTGRSAPPSAT
IAFGE
>Rv2546 CONSERVED HYPOTHETICAL PROTEIN
MVFCVDTSAWHHAARPEVARRWLAALSADQIGICDHVRLEILYSANSATD
YDALADELDGLARIPVGAETFTRACQVQRELAHVAGLHHRSVKIADLVIA
AAAELSGTIVWHYDENYDRVAAITGQPTEWIVPRGTL
>Rv2910c CONSERVED HYPOTHETICAL PROTEIN
MCAVLDRSMLSVAEISDRLEIQQLLVDYSSAIDQRRFDDLDRVFTPDAYI
DYRALGGIDGRYPKIKQWLSQVLGNFPVYAHMLGNFSVRVDGDTASSRVI
CFNPMVFAGDRQQVLFCGLWYDDDFVRTPDGWRIIRRVETKCFQKMM
>Rv2240c HYPOTHETICAL PROTEIN
MGQIVAGEIGGQRTTPVGGGLPLACCLDGRPPIVPHRRRRRIAALRSVLR
MRDTPRPARSRCDQVTSHAVLIGWRAVPRRHGGELPRRGALALGCIALLL
MGIVGCTTVTDGTAMPDTNVAPAYRSSVSASVSASAATSSIRESQRQQSL
TTKAIRTSCDALAATSKDAIDKVNAYVAAFNQGRNTGPTEGPAIDALNNS
ASTVSGSLSAALSAQLGDALNAYVDAARAVANAIGAHASTAEFNRRVDRL
NDTKTKALTMCVAAF
>Rv3378c HYPOTHETICAL PROTEIN
MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDD
YQQAALRQSIRILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMAL
LANDEEILSFYKEHEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSS
NTEHRLCFGVFGNDAAESVAQFSISWNETHGKPPTRREIIEGYYGEYVDK
ADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTLRRILYDHIYL
RHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG
>Rv2807 CONSERVED HYPOTHETICAL PROTEIN
MVSTTGMGRSTARRMLTGPGLPEPAEQVDGRRLRARGFSDDARALLEHVW
ALMGMPCGKYLVVMLELWLPLEAAAGDLDKPFATEAAVAELKAMSAATVD
RYLKPARERMRIKGISTTKPSPLLRNSITIHTCSDEAPKVPGVIEADTVA
HCGPSLIGEFARTLTMTDLVTGWTENASIRNNAAKWILEGIKECQQRFPF
PMTVFDSDCGGEFINHDVAGWLQARDIAQTRSRPYQKNDQAHVESKNNHV
VRKHAFYWRYDTGEELELLNRLWPLVSLRCNFFTPTKKPVGYTSTVNGRR
KRIYDKPATPWQRLQASGVLDAQQLSTVAARIEGFNPADLTRQINAIQMQ
LLDLAKTKTEALATARHIDLQSLQPSINRLAKAK
>Rv3902c HYPOTHETICAL PROTEIN
MTIGVDLSTDLQDWIRLSGMNMIQGSETNDGRTILWNKGGEVRYFIDRLA
GWYVITSSDRMSREGYEFAAASMSVIEKYLYGYFGGSVRSERELPAIRAP
FQPEELMPEYSIGTMTFAGRQRDTLIDSSGTVVAITAADRLVELSHYLDV
SVNVIKDSFLDSEGKPLFTLWKDYKG
>Rv2493 CONSERVED HYPOTHETICAL PROTEIN
MRTTLDLDDDVIAAARELASSQRRSLGSVISELARRGLMPGRVEADDGLP
VIRVPAGTPPITPEMVRRALDED
>Rv1972 PROBABLE CONSERVED MCE ASSOCIATED MEMBRANE PROTEIN
MSVAVDSDAEDDAVSEIAEAAGVSPAPAKPSMSAPRRMLLFGLVVVVALA
VLLCCWGFRVQRARHAQDQRGHFLQAARQCALNLTTIDWRNAEADVRRIL
DGATGEFYNDFAQRSQPFVEVLRHAKASTVGTITEAGLQTQTADTAQALV
AVSVQTSNAGEADPVPRAWRMRITVQRVGDRVKVSDVGFVP
>Rv1588c Partial REP13E12 repeat protein
MLANSREELVEVFDALDAELDRLDEVSFEVLTTPERLRSLERLECLVRRL
PAVGHALINQLDAQASEEELGGTLCCALANRLRITKPDAARRIADAADLG
PRRALTGEPLAPQLTATATAQRQGLIGEAHVKVIRALFRPPARRGGCVHP
PGRRSRPGRQSRSISSRRAGPLRPAGHGLATPRRRPHRHRTRPQTRHHPE
QPAIRRHVTAKWLPDPPSAGHL
>Rv0412c POSSIBLE CONSERVED MEMBRANE PROTEIN
MTVELAHPSTEPLGSRSPAEPAHPRRWFISTTPGRIMTIGIVLAALGVAS
AFATSTTIEHRQQVLTAVLDHTEPLSFAAGRLYTTLSVADAAAATAFIAQ
AEPGGVRLRYEQAITDASVAVTRASSGLTDESLVQLLGRINAELAVYTGL
VEIARANNRAGNPVGSSYLSEASGLMQSTILPDAQRLYQATSARVDRETT
ASTQIPAPVILVVATTVVFGAFAHRWLARRTRRRINPGLVVGALGILVMV
VWVGTALTISTTASRSAKDTAAESLKTITNLAITAQQARADETLSLIRRG
DEEVRKQAFYQRIDAMQRQLNDYMARRHAVDKPDLQGADQLLVRWRQAND
RINSDISVGNYRAATQVALGKGEDDATPAFDKLDEALTKAMGQSRTQLRH
DILNAHRGLAGAQVGGVVLSLGAAIAVALGLWPRLKEYR
>Rv0219 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MFDIATRFKNSYGSGPLHLLAMVSGFALLGYIVATARPSALWNQATWWQS
IAVWFVAAVVAHDLLLYPLYALADRILARLVGRRDVSAPRRRPELPVRNY
IRIPALAAGLTLLVFLPGIIRQGAPTYLDATGQTQEPFLGRWLLLTAVAF
GISAAAYAIRLVVAHVRRRRAGCSRVDAIDEE
>Rv3312A SECRETED PROTEIN ANTIGEN
MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPA
WGPNWDPYTCHDDFHRDSDGPDHSRDYPGPILEGPVLDDPGAAPPPPAAG
GGA
>Rv0607 HYPOTHETICAL PROTEIN
MGAWQTADTMGIFQALPDVWGGWRTECWEDRFEEQLIRCNGALRLPELDL
AAGMDSAREWLRDRIFQRFSDSPAGQILKLSELLADVGPGLVVSDDAVTN
GGARPNNEEWARFVAACDLVRGAHAESA
>Rv0347 PROBABLE CONSERVED MEMBRANE PROTEIN
MPGARELTLRVERGALFRRRWAASAASSARAAIRRDPRRCALGTRPRWVS
FLVIVLVIMNVVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEEL
ARWTPILEQEEVRQVNLETGEHTAHSQKKLVARDRRTAITFRPDAMTLEV
TDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGLRYINEIRASLAEPSGW
AYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSLTLRYAGARGA
VIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAER
LHTPIGPLFESLITSELRTKVLQQPGQE
>Rv0360c CONSERVED HYPOTHETICAL PROTEIN
MTKRTITPMTSMGDLLGPEPILLPGDSDAEAELLANESPSIVAAAHPSAS
VAWAVLAEGALADDKTVTAYAYARTGYHRGLDQLRRHGWKGFGPVPYSHQ
PNRGFLRCVAALARAAAAIGETDEYGRCLDLLDDCDPAARPALGL
>Rv1775 CONSERVED HYPOTHETICAL PROTEIN
MASDLYLGYRNDDADTPFGKFFKPEMAPLPQHVVVALQHGPQAGMALLAF
DDAASIVDEGYQQTENGYGILGDGSMQVSVRTDMPGVTPAMWAWWFGWHG
SDTRRYKLWHPRAHLSARWKDGDQDSGAGRRGAQRYVGRWSMISEYIGST
KLGAAIQFVEPAAMGLPDDSDDTVSICARLGSADAPVDAGWFVHQVRSTP
GGSEMRSRFWMGGPHIAVRKAPEVASKAVRPIASKLIGVSESTARNLLVY
CAQEMNHLAGFLADLWESFGDE
>Rv2239c CONSERVED HYPOTHETICAL PROTEIN
MPIATVCTWPAETEGGSTVVAADHASNYARKLGIQRDQLIQEWGWDEDTD
DDIRAAIEEACGGELLDEDTDEVIDVVLLWWRDGDGDLVDTLMDAIGPLA
EDGVIWVVTPKTGQPGHVLPAEIAEAAPTAGLMPTSSVNLGNWSASRLVQ
PKSRAGKR
>Rv1051c CONSERVED HYPOTHETICAL PROTEIN
MRADVTAEHLTQVVRDIAVIDIDDGVAFNLDTSSVQEIRERADYPGLRVR
VAMSVGPWQGIAAWDVSTGEPIAPWPTRVTIDRILGEPITLLGYAPETII
AEKGVTILERGITSTRWRDYVDIVQLDRRGIDDDELLRSARAVAQYRGAT
LEPVAPHLAGYGAVAQAKWATEHGRCQHCWRHWKPAHVGRRNMDLLDAKQ
VSEMIGVPVGTLRHWRHSDIGPASFTLGRRVVYRRDEVSRWISKRESATR
R
>Rv2084 HYPOTHETICAL PROTEIN
MSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPL
VIVHGPLFQAVKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRAL
LIDVLNVLLAAITAAGVERAYACAERRAMAAAVVAKNYRDALGVELQCNS
VCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAA
GELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLVQLWQARRAVT
DGDVDLRDARTLLTDLDSILREMRTAATIQQSGTAGDGGGGRRQDSRRRN
GPRRPARRGTSRGRRCAPRVAIGWHTPIGDPLAVEGVEEIGASLPGREST
PSDDGGSLHPSGRPRRVHRRRWCGLGLC
>Rv2798c CONSERVED HYPOTHETICAL PROTEIN
MFQISPEQWMHSAAQVTTQGEGLAVGHLSSDYRMQAAQFGWQGASAMALN
AKMDDWLDASRALLTRIGDHAFGLQEAAIQHAAAEAERAQALAQVGVSAD
VVAGPRGV
>Rv0569 CONSERVED HYPOTHETICAL PROTEIN
MKAKVGDWLVIKGATIDQPDHRGLIIEVRSSDGSPPYVVRWLETDHVATV
IPGPDAVVVTAEEQNAADERAQHRFGAVQSAILHARGT
>Rv1581c Probable phiRv1 phage protein
MTAVAITPASGGRHSVRFAYDSAIVSLIKSTIPAYARSWSAHTRCWFIDA
DWTPLLAAELRYHGHTVTGPADPAQQQCTDWAKALFRAVGPQRTPAVYRA
LSKVLHPDAPTGCPILQQQLNAARTALTNPA
>Rv0883c CONSERVED HYPOTHETICAL PROTEIN
MRELKVVGLDADGKNIICQGAIPSEQFKLPVDDRLRAALRDDSVQPEQAQ
LDIEVTNVLSPKEIQARIRAGASVEQVAAASGSDIARIRRFAHPVLLERS
RAAELATAAHPVLADGPAVLTMQETVAAALVARGLNPDSLTWDAWRNEDS
RWTVQLAWKAGRSDNLAHFRFTPGAHGGTATAIDDTAHELINPTFNRPLR
PLAPVAHLDFDEPEPAQPTLTVPSAQPVSNRRGKPAIPAWEDVLLGVRSG
GRR
>Rv2160c CONSERVED HYPOTHETICAL PROTEIN
MGRIPGTRRAGGCFFAAAAADVDSQPGPVRDRIAATGRAGIAAITADVET
AQRRGEIRADIEVRQLAFELHAYAMEANWALLLLDDDGAGERARTAIDAA
LARVGTTQEGVES
>Rv3912 HYPOTHETICAL ALANINE RICH PROTEIN
MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQA
QQILRALNRVRRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGA
AHAARPHVHPVRMIAGAAGLCAVATAIGVGAVVDAPPPAPSAPTTAQHIT
VSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLSGLGYPAS
TPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLASTV
VPRA
>Rv2732c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MMSHEHDAGDLDALRAEIEAAERRVAREIEPGARALVVAILVFVLLGSFI
LPHTGSVRGWDVLFSSHGAGRAAVALPSRVFAWLALVFGVGFSMLALLTR
RWALAWVALAGSAMASGTGLLAVWSRQTVAAGHPGPGIGLIVAWITAIVL
TFHWAQVVWSRTIVQLAAEERRRRVVAQQQCKTLLDHVQTDSEAGTTPDR
GTDR
>Rv3822 
MKCPGVSDCVATVRHDNVFAIAAGLRWSAAVPPLHKGDAVTKLLVGAIAG
GMLACAAILGDGIASADTALIVPGTAPSPYGPLRSLYHFNPAMQPQIGAN
YYNPTATRHVVSYPGSFWPVTGLNSPTVGSSVSAGTNNLDAAIRSTDGPI
FVAGLSQGTLVLDREQARLANDPTAPPPGQLTFIKAGDPNNLLWRAFRPG
THVPIIDYTVPAPAESQYDTINIVGQYDIFSDPPNRPGNLLADLNAIAAG
GYYGHSATAFSDPARVAPRDITTTTNSLGATTTTYFIRTDQLPLVRALVD
MAGLPPQAAGTVDAALRPIIDRAYQPGPAPAVNPRDLVQGIRGIPAIAPA
IAIPIGSTTGASAATSTAAATAAATNALRGANVGPGANKALSMVRGLLPK
GKKH
>Rv2405 CONSERVED HYPOTHETICAL PROTEIN
MQRFAENLVFTEAPKLVRHLQNTQETLRTIRQAVKITANIMTTAVPSPPA
EIAAGRPVTSTSCPTAARARRLVYAPDLDGRADPGEIVWTWVAYEQDPTR
GKDRPVLVVGRDRSVLLGLLVSSQERHAADRDWVGIGSGAWDYEGRESWV
RLDRVLDVPEESIRREGAILEREVFDVVAARLRADYAWR
>Rv3779 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN ALANINE AND LEUCINE RICH
MGLWFGTLIALILLIAPGAMVARIAQLRWPVAIAVGPALTYGVVALAIIP
YGALGIPWNGWTALAALAVTCAVATGLQLLLARFRDLDAEALAVSRWPAV
TVAAGVLLGALLIGWAAYRGIPHWQSIPSTWDAVWHANTVRFILDTGQAS
STHMGELRNVETHAPLYYPSVFHGLVAVFCQLTGAAPTTGYTLSSLAASV
WLFPVSAAVLTWRAVRSHPGALWSASCASAEWRAAGAAGTAAALSASFTA
VPYVEFDTAAMPNLAAYGIAVPTMVLITSTLRHRDRIPVAVLALVGVFSL
HITGGIVVALLVSAWWLFEALRHPVRSRLADLLTLAGVAAMAGLVMLPQF
LSVRQQEDIIAGHAFPTYLSKKRGLFDAVFQHSRHLNDFPVQYALIVLAA
IGGLILLVKKIWWPLAVWLLLIVMNVDAGTPLGGPIGGVAGALGEFFYHD
PRRIAAATTLLLMLMAGVALFATVMLLVAAAKRLTDRFRPQPVSVWASAT
ATLLIGATLVSAWHYFPRHRFLFGDKYDSVMIDQKDLDAMAYLASLPGAR
DTLIGNANTDGTAWMYAVAGLHPLWTHYDYPLQQGPGYHRFIFWAYGRNG
ESDPRVLEAIQVLRIRYILTSTPTVRGFAVPDGLVSLETSRSWAKIYDNG
EARIYEWRGTAAATHS
>Rv0612 CONSERVED HYPOTHETICAL PROTEIN
MLGPIRQPRLTVRPGRLPGMIAGVAAKRMNREQFFRAASGLDEDRLRKAL
WNLYWRGTANMRERIEAELASAGRARPARKIKPPADPDIVGWEVDEFVSL
ARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDALRAEDAEPAASALEQ
LIDLAREADGYDYFRSDDPVAAAGFVVSDVAAAGHPHFREFAAEIGAAIP
P
>Rv2134c CONSERVED HYPOTHETICAL PROTEIN
MARAIHVFRTPDRFVAGTVGQPGNRTFYLQAVHDSRVVSVVLEKQQVAVL
AERIGALLFEVNRRFGTPVPPEPTEIDDLSPLIMPVDAEFRVGTMGLGWD
SEAQSVVVELLAVTDAEFDASVVLDDTEEGPDAVRVFLTPESARQFATRS
YRVISAGRPPCPLCDEPLDPEGHICARTNGYRRDVLLGSGDDPAG
>Rv1961 HYPOTHETICAL PROTEIN
MFLPTNAQYQLLVVGVSPWDTPSPSGRISWGSAWPHQARRAQTCQRVRRH
WMIDTTEAAYRLTYQPDGTSITVRENLVDILARELLGPIRGPQEVLPFSP
RSQYLVGHLAPVKLTGAALIDDNAVQARANAEALAEGGGVPAYAADETTP
TPTTTPKTAHPSRA
>Rv3491 HYPOTHETICAL PROTEIN
MNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGV
TWTITALCDQPSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQN
FSGRARMSSMLWTFQVNQADGVACPDGSTAPSSETYAFSDETLTGTHTTV
HGAVCGLQPKLSKQPFSLQLIGPPPSPVQRYPLYCNNIAMCY
>Rv2304c HYPOTHETICAL PROTEIN
MSHDIATEEADDGALDRCVLCDLTGKRVDVKEATCTGRPATTFEQAFAVE
RDAGFDDFLHGPVGPRSTP
>Rv2418c HYPOTHETICAL PROTEIN
MSSRRGRRPALLVFADSLAYYGPTGGLPADDPRIWPNIVASQLDWDLELI
GRIGWTCRDVWWAATQDPRAWAALPRAGAVIFATGGMDSLPSVLPTALRE
LIRYVRPSWLRRWVRDGYAWVQPRLSPVARAALPPHLTAEYLEKTRGAID
FNRPGIPIIASLPSVHIAETYGKAHHGRAGTVAAITEWAQHHDIPLVDLK
AAVAEQILSGYGNRDGIHWNFEAHQAVAELMLKALAEAGVPNEKSRG
>Rv1797 CONSERVED HYPOTHETICAL PROTEIN
MKAQRSFGLALSWPRVTAVFLVDVLILAVASHCPDSWQADHHVAWWVGVG
VAAVVTLLSVVSYHGITVISGLATWVRDWSADPGTTLGAGCTPAIDHQRR
FGRDTVGVREYNGRLVSVIEVTCGESGPSGRHWHRKSPVPMLPVVAVADG
LRQFDIHLDGIDIVSVLVRGGVDAAKASASLQEWEPQGWKSEERAGDRTV
ADRRRTWLVLRMNPQRNVAAVACRDSLASTLVAATERLVQDLDGQSCAAR
PVTADELTEVDSAVLADLEPTWSRPGWRHLKHFNGYATSFWVTPSDITSE
TLDELCLPDSPEVGTTVVTVRLTTRVGSPALSAWVRYHSDTRLPKEVAAG
LNRLTGRQLAAVRASLPAPTHRPLLVIPSRNLRDHDELVLPVGQELEHAT
SSFVGQ
>Rv1580c Probable phiRv1 phage protein
MAETPDHAELRRRIADMAFNADVGMATCKRCGDAVPYIILPNLQTGEPVM
GVADNKWKRANCPVDVGKPCPFLIAEGVADSTDDTIEVDQ
>Rv3311 CONSERVED HYPOTHETICAL PROTEIN
MVADLVPIRLSLSAGDRYTLWAPRWRDAGDEWEAFLGKDDDLYGFESVSD
LVAFVRTDTENDLVDHPAWQDLTGAHAHNLNPAEDNQFDLVVVEELLAEK
PTAESVAALAASLAIVSAIGSVCELAAVSKFFNGNPILGTVSGGLEHFTG
KAGNKRWNSIAEVIGRSWDDVLAAIDEIISTPEVDAELSEKVAEELAEEP
EGAEEVAAEVEATQDTQEAAESDDEEADAPGDSVVLGGDRDFWLQVGIDP
IQIMTGTATFYTLRCYLDDRPIFLGRNGRISVFGSERALARYLADEHDHD
LSDLSTYDDIRTAATDGSLAVAVTDDNVYVLSGLVDDFADGPDAVDREQL
DLAVELLRDIGDYSEDSAVDKALETTRPLGQLVAYVLDPHSVGKPTAPYA
AAVREWEKLERFVESRLRRE
>Rv1424c POSSIBLE MEMBRANE PROTEIN
MTVVPGAPSRPASAVSRPSYRQCVQASAQTSARRYSFPSYRRPPAEKLVF
PVLLGILTLLLSACQTASASGYNEPRGYDRATLKLVFSMDLGMCLNRFTY
DSKLAPSRPQVVACDSREARIRNDGFHANAPSCMRIDYELITQNHRAYYC
LKYLVRVGYCYPAVTTPGKPPSVLLYAPSACDESLPSPRVATALVPGTRS
ANREFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAIPPASSQLVCV
APK
>Rv0863 CONSERVED HYPOTHETICAL PROTEIN
MCSVIADQRRPDQPCGVGGCKTCQNGFVADIAEGKARKTRYVDHGWPTTD
PDDHAVSELVTDRTGALSPFGELTFPVPSDDLPYIHPVTVINR
>Rv0543c CONSERVED HYPOTHETICAL PROTEIN
MNRFLTSIVAWLRAGYPEGIPPTDSFAVLALLCRRLSHDEVKAVANELMR
LGDFDQIDIGVVITHFTDELPSPEDVERVRARLAAQGWPLDDVRDREEHA
>Rv0544c POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MSAWFNYTATLKILIFSLLAGALLPGLFAVGVRLQAAGDGADATARRRPL
LVAVSWAIFALVLAVVIIGVLYIARDFIAHHTGWAFLGATPK
>Rv0313 CONSERVED HYPOTHETICAL PROTEIN
MGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFE
DLSRRSRPAPETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKD
NTDPKRKVRFLPYGIAVSVLDDPVDEAQ
>Rv1742 HYPOTHETICAL PROTEIN
MSALLDGVLDAHGGLQRWRAAETVHGRVRTGGLLLRTRVPGNRFADYRIT
VHVQQARTVLDPFPRDGYRGVFESGQVRIESHDGAVISSRAHPRAAFFGR
SGLRRNIRWDPLDSVYFAGYAMWNYLTTPYLLTREGVAVEEGAPWQQEGE
TWRRLIVSFPPDIDTHSPRQTFYVDASGLLRRHDYVPEVVGHWARAAHYC
ADPVDVDGFVFPTCRWVHPIGPGNRSLPFPTLVSILLTDIRVETD
>Rv3448 PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MPTSDPGLRRVTVHAGAQAVDLTLPAAVPVATLIPSIVDILGDRGASPAT
AARYQLSALGAPALPNATTLAQCGIRDGAVLVLHKSSAQPPTPRCDDVAE
AVAAALDTTARPQCQRTTRLSGALAASCITAGGGLMLVRNALGTNVTRYS
DATAGVVAAAGLAALLFAVIACRTYRDPIAGLTLSVIATIFGAVAGLLAV
PGVPGVHSVLVAAMAAAATSVLAMRITGCGGITLTAVACCAVVVAAATLV
GAITAAPVPAIGSLATLASFGLLEVSARMAVLLAGLSPRLPPALNPDDAD
ALPTTDRLTTRANRADAWLTSLLAAFAASATIGAIGTAVATHGIHRSSMG
GIALAAVTGALLLLRARSADTRRSLVFAICGITTVATAFTVAADRALEHG
PWIAALTAMLAAVAMFLGFVAPALSLSPVTYRTIELLECLALIAMVPLTA
WLCGAYSAVRHLDLTWT
>Rv2717c CONSERVED HYPOTHETICAL PROTEIN
MTRDLAPALQALSPLLGSWAGRGAGKYPTIRPFEYLEEVVFAHVGKPFLT
YTQQTRAVADGKPLHSETGYLRVCRPGCVELVLAHPSGITEIEVGTYSVT
GDVIELELSTRADGSIGLAPTAKEVTALDRSYRIDGDELSYSLQMRAVGQ
PLQDHLAAVLHRQR
>Rv0637 CONSERVED HYPOTHETICAL PROTEIN
MALKTDIRGMIWRYPDYFIVGREQCREFARAVKCDHPAFFSEEAAADLGY
DALVAPLTFVTILAKYVQLDFFRHVDVGMETMQIVQVDQRFVFHKPVLAG
DKLWARMDIHSVDERFGADIVVTRNLCTNDDGELVMEAYTTLMGQQGDGS
ARLKWDKESGQVIRTA
>Rv2255c HYPOTHETICAL PROTEIN
MDGIVDRGVRARPCQKVVAVLRRSKSHIDKRLDAATGNAFLGKQVLSAAG
VVEYRPPRRSPLST
>Rv0966c CONSERVED HYPOTHETICAL PROTEIN
MSNSAQRDARNSRDESARASDTDRIQIAQLLAYAAEQGRLQLTDYEDRLA
RAYAATTYQELDRLRADLPGAAIGPRRGGECNPAPSTLLLALLGGFERRG
RWNVPKKLTTFTLWGSGVLDLRYADFTSTEVDIRAYSIMGAQTILLPPEV
NVEIHGHRVMGGFDRKVVGEGTRGVPTVRIRGFSLWGDVGIKRKPRKPRK
>Rv0039c POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MFLAGVLCMCAAAASALFGSWSLCHTPTADPTALALRAMAPTQLAAAVML
AAGGVVAVAAPGHTALMVVIVCIAGAVGTLAAGSWQSAQYALRRETASPT
ANCVGSCAVCTQACH
>Rv3212 CONSERVED HYPOTHETICAL ALANINE VALINE RICH PROTEIN
MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTP
APAREVPTSLKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESL
WSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYA
DPRVRLFSDGTTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQS
GCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEPIQRIVPEPGV
RPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS
TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMA
GQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRG
DTLVALG
>Rv0138 CONSERVED HYPOTHETICAL PROTEIN
MSASEFSRAELAAAFEKFEKTVARAAATRDWDCWVQHYTPDVEYIEHAAG
IMRGRQRVRAWIQETMTTFPGSHMVAFPSLWSVIDESTGRIICELDNPML
DPGDGSVISATNISIITYAGNGQWCRQEDIYNPLRFLRAAMKWCRKAQEL
GTLDEDAARWMRRHGGP
>Rv0061 HYPOTHETICAL PROTEIN
MCADAQPSGSVGLLGRNCPTATTRWRRAGEGLTAADTIEVKLWAGKPRLH
PLVPKRAVGVLLAVAHGQVAKTPSATRAIAFRHVRLMRVRWICAGNRGRK
HKRRCTTQYRSTQASKLQLHFKLRQTLNRLGGLQAMVSACG
>Rv2275 CONSERVED HYPOTHETICAL PROTEIN
MSYVAAEPGVLISPTDDLQSPRSAPAAHDENADGITGGTRDDSAPNSRFQ
LGRRIPEATAQEGFLVRPFTQQCQIIHTEGDHAVIGVSPGNSYFSRQRLR
DLGLWGLTNFDRVDFVYTDVHVAESYEALGDSAIEARRKAVKNIRGVRAK
ITTTVNELDPAGARLCVRPMSEFQSNEAYRELHADLLTRLKDDEDLRAVC
QDLVRRFLSTKVGPRQGATATQEQVCMDYICAEAPLFLDTPAILGVPSSL
NCYHQSLPLAEMLYARGSGLRASRNQGHAIVTPDGSPAE
>Rv1558 CONSERVED HYPOTHETICAL PROTEIN
MPLSGEYAPSPLDWSREQADTYMKSGGTEGTQLQGKPVILLTTVGAKTGK
LRKTPLMRVEHDGQYAIVASLGGAPKNPVWYHNVVKNPRVELQDGTVTGD
YDAREVFGDEKAIWWQRAVAVWPDYASYQTKTDRQIPVFVLTPVRAGG
>Rv1119c HYPOTHETICAL PROTEIN
MTARVAGQAVGGQILVGEPVHDAVSDCADIRFGSYRLFSLDAAPGPDLD
>Rv2811 CONSERVED HYPOTHETICAL PROTEIN
MVTVEADVDQVERRLAAGELSCPSCGGVLAGWGRARSRQLRGPAGPVELC
PRRSRCTGCGVTHVLLPVSALLRRADTAAVIVSALAAKATSRVGFRRIAT
DVARPAETVRGWLRRFAERVEAVRSVFTVWLCAVDADPVMPDAGGGGFVD
AVVAIGALAAAIGRRFSLPTVSLAETAVAVSGGRLLAPGWPGEWVQHEST
LP
>Rv0875c POSSIBLE CONSERVED EXPORTED PROTEIN
MKRGVATLPVILVILLSVAAGAGAWLLVRGHGPQQPEISAYSHGHLTRVG
PYLYCNVVDLDDCQTPQAQGELPVSERYPVQLSVPEVISRAPWRLLQVYQ
DPANTTSTLFRPDTRLAVTIPTVDPQRGRLTGIVVQLLTLVVDHSGELRD
VPHAEWSVRLIF
>Rv3047c HYPOTHETICAL PROTEIN
MGGPFDADAEAHFDEVAEAFAKLTNVDRDVGVDLEKELCMTVEADDRSDA
LVTRRLLPRVPRCIPLAARLAPGTIGCPSFWNPIATGGASRQAL
>Rv1507c CONSERVED HYPOTHETICAL PROTEIN
MKKVAIVQSNYIPWRGYFDLIAFVDEFIIYDDMQYTKRDWRNRNRIKTSQ
GLQWITVPVQVKGRFHQKIRETLIDGTDWAKAHWRALEFNYSAAAHFAEI
ADWLAPIYLEEQHTNLSLLNRRLLNAICSYLGISTRLANSWDYELADGKT
ERLANLCQQAAATEYVSGPSARSYVDERVFDELSIRVTWFDYDGYRDYKQ
LWGGFEPAVSILDLLFNVGAEAPDYLRYCRQ
>Rv3844 POSSIBLE TRANSPOSASE
MTAENPGRSRRTLVGIDAAITACHHIAIRDDVGARSIRFSVEPTLAGLRT
LTDKLSGYDDIDATVEPTSMTWLPLTIAVENAGDTMHMAGARHCARLRGA
IVGKSKSDVIDAEVLTRASEVFDLTPLTLPTPAQLALRRSVIRRAGAVID
ANRSWRRLMSLAR
>Rv1572c CONSERVED HYPOTHETICAL PROTEIN
MECSSAVHGQPRTNTFHHHEKLLRHNDEDNHDDP
>Rv2180c Probable conserved integral membrane protein
MEVFHWLQHDIVDRGRLPLLCCLVAFVLTFLVTRSFVRFIHRRAADGRPA
RWWQPRNVHIGSVHIHHVAFGVVLVMISGLTLVTLSVDGREPEFTIAASI
FGVGAALVLDEYALILHLSDVYWEEDGRTSVDAVFAAVAVAGLLIMGLHP
LIFFLPVRQGANWVVLQTTLIAGLVLTLPLAVVVLLKGKVWTGLLGMFVV
VLLVVGAVRLSRPHAPWARWRYTRHPEKMRRALQRERTWRRPVVRIKLWL
QYVIAGTPRMPDERAVDAQLDQDVRPAPPPERTAPILISGSVWSD
>Rv0455c CONSERVED HYPOTHETICAL PROTEIN
MSRLSSILRAGAAFLVLGIAAATFPQSAAADSTEDFPIPRRMIATTCDAE
QYLAAVRDTSPVYYQRYMIDFNNHANLQQATINKAHWFFSLSPAERRDYS
EHFYNGDPLTFAWVNHMKIFFNNKGVVAKGTEVCNGYPAGDMSVWNWA
>Rv3209 CONSERVED HYPOTHETICAL THREONIN AND PROLINE RICH PROTEIN
MALGAVATAVIINSGDSTSTKAIVGAPAPRTVISTSPRPTAPTSTSPHPS
PSTLRPQLPPETVTTVAPPGTGPTTVPTRTPTAAPPQTAVPPPAPLNPRT
VVYRVTGTKQLFDLVNVVYTDARGFPVTDFNVSLPWTKMVVLNPGVQTES
VVATSLYSRLNCSIVNTGAQTVVASTNNAIIATCTR
>Rv2657c PROBABLE phiRv2 PROPHAGE PROTEIN
MCAFPSPSLGWTVSHETERPGMADAPPLSRRYITISEAAEYLAVTDRTVR
QMIADGRLRGYRSGTRLVRLRRDEVDGAMHPFGGAA
>Rv2186c CONSERVED HYPOTHETICAL PROTEIN
MNSIQIADETYVAADAARVSAAVADRCSWRRWWPDLRLQVTEDRADKGIR
WTVTGALTGTMEIWLEPSMDGVLLHYFLHAEPTGVAAWQLARMNLARMTH
HRRVAGKKMAFEVKTVLERSRPIGVSPVT
>Rv3434c POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MADASVVARLRSWALAVWHFVSNAPLTYAWLVVLVITTIIQNNLTGSQLH
FVLLHRSTNIAELGRDPLEVLFSSLLWIDGRNLEPYLLLFTLFLAPAEHW
LGHLRWLTVGLTAHIGATYLSEGLLYLAIQHRDASERMVHARDIGVSYFL
VGVMAVLTYHIAKPWRWGYLGVLLVIFGFPLIAMDKAELDFTAVGHFASI
LIGLLFYPMARERDGRLWNPARIKSLLHRRGTRGRRA
>Rv0634A HYPOTHETICAL PROTEIN
MGSDCGCGGYLWSMLKRVEIEVDDDLIQKVIRRYRVKGAREAVNLALRTL
LGEADTAEHGHDDEYDEFSDPNAWVPRRSRDTG
>Rv1952 CONSERVED HYPOTHETICAL PROTEIN
MIRNLPEGTKAALRVRAARHHHSVEAEARAILTAGLLGEEVPMPVLLAAD
SGHDIDFEPERLGLIARTPQL
>Rv0076c PROBABLE MEMBRANE PROTEIN
MPAVTTPSNHWGDERRKLSHQPPVRGQILGRRQARRLSQHFARVGVEAPP
KRLQEMLLGAPAADEEWTDVKFALIVTQLNHEKRVAKFHRLQRRATHSLI
CLGLVLVALNFLICLAYIFFSLTQHAAAL
>Rv3635 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MPAPRMPRVALVAVLLITVQLVVRVVLAFGGYFYWDDLILVGRAGTGGLL
SPSYLFDDHDGHVMPGAFLVAGAIIRVAPLVWTGPAISLVVLQLLESLAL
LRALYVISSWRPVLLIPLTFALFTPLAVPGFAWWAAALNSLPMLAALAWV
CADAILLVRTGNHRYAVTGVLVYLGGLLFFEKAAVIPFVSFAVAALQCHV
RGDRSALATVWRAGVRLWTPSLALTVGWVALYLAVVDQRRWSSDLSMTWD
LLCRSVTHGIVPALAGGPWDWARWAPASPWATPPAVVMVLGWLVLIAVLA
LSLVRKRRIGPVWLTAAGYAVACQVPIFLMRSSPFTALELAQTLRYFPDL
VVVLALLAAVALQAPNRAGTRWLDASPARAVATVASAVLFLTSSLYSTAT
FLASWRDNPTEGYLKNAQASLAAAASGAPLLDQEVDPLVLQRVAWPENLA
SHMFALLRVRPEFATTTTQLRMFTSTGRLVDAKVTWVRTIIAGPVPQCGY
FVQPDRPERLILDGPLLPGDWTVELNYLANSDGSMALALSDGPERKVPVH
PGLNRVYARLPGAGDAITVRANTTALSLCIGAAPVGFLAPA
>Rv2307A HYPOTHETICAL GLYCINE RICH PROTEIN
MAFVDLRYPWCRGDGWISPPVVAVALGWAMRRKPFSRFNEYVGSASNTCW
FARALELRTLLIR
>Rv2598 CONSERVED HYPOTHETICAL PROTEIN
MPLHQLAIAPVDVSGALLGLVLNAPAPRPLATHRLAHTDGSALQLGVLGA
SHVVTVEGRFCEEVSCVARSRGGDLPESTHAPGYHLQSHTETHDEAAFRR
LARHLRERCTRATGWLGGVFPGDDAALTALAAEPDGTGWRWRTWHLYPSA
SGGTVVHTTSRWRP
>Rv3632 POSSIBLE CONSERVED MEMBRANE PROTEIN
MNWIQVLLIASIIGLLFYLLRSRRSARSRAWVKVGYVLFVLAGIYAVLRP
DDTTVVANWFGVRRGTDLMLYALVMAFSFTTLSTYMRFKDLELRYARIAR
ALALEGAQAPEQCR
>Rv3669 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MSKIDRKNGVPSTLTTIPLADPHAGPAEPSIGDLIKDATTQMSTLVRAEV
ELARAEITRDVKKGLTGSVFFISSLVVGFYSTFFFFFFVAELLDTWIWRW
VAFLLVFAIMVVVTAVLALLGFLKVRRIRGPRQTIASVKETRTALTPGHD
KTPVTPKPVTSDRATPVDPSGW
>Rv3541c CONSERVED HYPOTHETICAL PROTEIN
MTVVGAVLPELKLYGDPTFIVSTALATRDFQDVHHDRDKAVAQGSKDIFV
NILTDTGLVQRYVTDWAGPSALIKSIGLRLGVPWYAYDTVTFSGEVTAVN
DGLITVKVVGRNTLGDHVTATVELSMRDS
>Rv1874 HYPOTHETICAL PROTEIN
MLMRPEPDDDWCARQRAQVADALLGLGVAGLSINVRDSTVRDSLMTLTTL
YPPVAAVVSLWTQQCYGEQVAAALRLLAQECDELGAYLVTESVPLTFPSL
VESGSRTPGLANIALLRRPDGLDQATWLTRWQRDHTQVAIEAQATFGYTQ
NWVVRALTPEAPGIAGIVEELFPVAATTDLKAFFGAADDNDLRNRISRMV
ASTSAFGANQNIDTVPTSRYVFRTPFKD
>Rv0398c POSSIBLE SECRETED PROTEIN
MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGD
AALRQRISQQLFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQ
RVLGCQQNTAGSGAGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACV
KSGWRKATAGTPTSMCNNGWTYPPFADTRRGEEGYFVLLAGTASDFCSAP
NANYRTTASSWPG
>Rv1044 CONSERVED HYPOTHETICAL PROTEIN
MCAKPYLIDTIAHMAIWDRLVEVAAEQHGYVTTRDARDIGVDPVQLRLLA
GRGRLERVGRGVYRVPVLPRGEHDDLAAAVSWTLGRGVISHESALALHAL
ADVNPSRIHLTVPRNNHPRAAGGELYRVHRRDLQAAHVTSVDGIPVTTVA
RTIKDCVKTGTDPYQLRAAIERAEAEGTLRRGSAAELRAALDETTAGLRA
RPKRASA
>Rv2632c CONSERVED HYPOTHETICAL PROTEIN
MTDSEHVGKTCQIDVLIEEHDERTRAKARLSWAGRQMVGVGLARLDPADE
PVAQIGDELAIARALSDLANQLFALTSSDIEASTHQPVTGLHH
>Rv2737A CONSERVED HYPOTHETICAL CYSTEINE RICH PROTEIN (FRAGMENT)
MRPDLRARLVRITDDLLNTASLAGSGVLTGPDLTFRRRSCCLFYRVPAGG
KCGDCPL
>Rv3424c HYPOTHETICAL PROTEIN
MPNPVTMLYGRKADLVILPHVLAEERPHPYSTPGRKRGAQIALTTGIDAL
ASFAPQIVNPRHGLSRVVQCLGGCENKRHAYFRSISKTPHIRARGVPSVC
AVRTVGVDGAKRPPKPIPVQ
>Rv0210 HYPOTHETICAL PROTEIN
MIRAASDDPAGVDELVAAIAPGLAGLGLPVINRREVVLVTGPWLAGVSGV
RAALAERLPQRRFVETAELGPGDAPVAVVFVVSAATALTESDCVLLDTAA
EHTDAVVAVVSKIDVHRGWRDVLTSNRDRLAARASRYARVPWVGAAAAPE
LGEPYLDDLVAAIQKQLADPAVARRNMLRAWESRLLMVARRFDGDAQSAG
RRARVDALRQQRRTVLRQGRQSKSEHTIALRAQIQHARVKLSYFARNRCS
LLRVELQEHVAGLSRKDIARFAAYTRGRVQEVVAEVGEGAVAHLADVAQL
LGVPVQPPVLENLPAVLPTVVAPPLTSRRLEIRLTTLLGAGFGLGIALTL
SRLVAGLTPGLAASGMVAGVAIGLAVTAWVVNARALLHDRVVVDRWTGEV
TASLRSVVEQLVATRVVAVETLLSTAISERDDAENARVADQVSIIDGELR
EHAVAAARAAALRDREMPAVRAALEAVRAELGEPGAPTTGLF
>Rv3015c CONSERVED HYPOTHETICAL PROTEIN
MSVFATATGIGSWPGTAAREAAQVVVGELAGALAYLTELPARGVGADMLG
RAGGLLVDVAIDTVPRGYRIAARPGAVTRRAASLLDEDMDALEEAWETAG
LRGCGRAVKVQAPGPVTLVAGLELANGHRAITDPGAVRDLAASLAEGVAA
HRAALARRLDTPVVVQFDEPSLPAALGGRLTGVTALSPVAPLDETVAEAL
LDTCIAAVDADVALHSCSPDLPWDLLQRSRISAVSVDASTLQAADLDAVA
AFVESGRTVVLGLVPVTAPERAPSMEEVAAAAVAVTDRLGVPRSALRDRL
GVSPACGLANATGQWARTAVGLARDVAEAFARDPEAI
>Rv0396 HYPOTHETICAL PROTEIN
MRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWW
GSANWTVRAGSHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPE
PNLVQVEFDDAAMAEAMEEMAEPDDDGEDW
>Rv2390c CONSERVED HYPOTHETICAL PROTEIN
MAIFGRGHGASEPGGTGEPAETPGRGRLTRSVIGWVGAVAVVVSLAGSGW
CGWVLFEKHQTDVAAGQALQAARSYVVKLATMDCERIDHNMRDILEGSTG
EFKDKYGKSSAHLRQLLADNRVATHGTVVAASVKSATTNKVVVLMFIDQS
VSNRNSPTPQIDRSRIKVIMDKVNGRWLASKVELL
>Rv1749c POSSIBLE INTEGRAL MEMBRANE PROTEIN
MLRAVNEIRQHDGTLKLGKGVGMFTIVGVIVALIGAFVQSRRHRHRPAAD
IHMLWWMVLIVGVVSIIGAGYHVFDGERTAELIGYTRGDGGFQWENAMGD
LAIGVVGLMAYRFRGHFWLATIVVLTIQYVGDAAGHIYYWVVENNTNPYN
IGVPLWTDILLPIVMWALYAWSWHSNGDAVPKGQP
>Rv0426c POSSIBLE TRANSMEMBRANE PROTEIN
MSVVGGTVRTVGRTVSGAATATTAAAGAVGGAAVSGIVGGVTGAAKGIQK
GLSSGSKSTAAAALAIGAIGVAGLVDWPILLAVGGGALLLRKLNRTPEVA
APPVKAKLAPVPDKPAAAKEAPAKASKTTARKTSGRRAGTAELRSTN
>Rv1761c HYPOTHETICAL EXPORTED PROTEIN
MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNP
ERIQIGDWRYEVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQI
VSRYGATVIPNINAAIEVLGTGTDYRF
>Rv2799 PROBABLE MEMBRANE PROTEIN
MYTPGKGPPRAGGVVFTRVRLIGGLGALTAAVVVVGTVGWQGIPPAPTGG
DAVQLRSTAAPMSTTMKSPIVATTDPSPFDPCRDIPFDVIQRLGLAYTPP
EAEEGLRCHFDAGNYQMAVEPIIWRTYAQTLPPDAIETTIAGHRAAQYWV
RKPTYHNSFWYSSCMVTFKTSYGVIQQSLFYSTVYSEPDVDCPSTNLQRA
NDLVPYYRF
>Rv3256c CONSERVED HYPOTHETICAL PROTEIN
MNVARAIDLEDTEGLIAADRGALLRAASMAGAQVRAIAAAADEGELDLLR
GSDRPRSVIWVTGRGTAETAGTILASTLGAGAAEPIVLASAAPPWVGPLD
VLIVAGDDPGDPALVGAAAIGVRRGARVVVVAPYEGPLRDSTAGRVAVLE
PRLRVPDEFGLSRYLAAGLAALQTVDPKLRIDLASLADELDAEALRNSAG
REVFTNPAKALAARVSGCQLALAGDNAATLALARHGSSVMLRIANQVVAA
TRLSDAVVALRAGTPPDALFHDEEIDGPAPQRLRVLALALAGERTVVAAR
VAGLDDAYLVAAEDVPELLDAPVGSGGAVLAVRLEMAAVYLRLVRG
>Rv3749c CONSERVED HYPOTHETICAL PROTEIN
MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFAFGYNDL
IAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDD
FLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKAR
RLFPSGSPFGLGVLLPFDQ
>Rv0698 CONSERVED HYPOTHETICAL PROTEIN
MGRRGNRRVHVDRVRLTGTERELRAENQSPPIFRPQNTLGDGANGLPLAV
CTTTAHTCHTSHTHPSRWTPNPVPATKGVPAGLVQATFIIENLDPGNNDT
PTPPTPKLRLARKPGHHRRSEYDADSVLRRKDTSRRCVQADDVRCVQLVQ
DPRRGRVELGGYRAELTVGRRAAVNCQRPQYGADGWPVRLGCGVGGAARG
DQR
>Rv1591 PROBABLE TRANSMEMBRANE PROTEIN
MTEPPGFGGPSEPSGAPRTSRTRAVLFVMLGLSATGVLVGGLWAWIAPPI
HAVVAITRAGERVHEYLGSESQNFFIAPFMLLGLLSVLAVVASALMWQWR
EHRGPQMVAGLSIGLTTAAAIAAGVGALVVRLRYGALDFDTVPLSRGDHA
LTYVTQAPPVFFARRPLQIALTLMWPAGIASLVYALLAAGTARDDLGGYP
AVDPSSNARTEALETPQAPVS
>Rv3604c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN RICH IN ALANINE AND ARGININE AND PROLINE
MTVLSRGARVRRGGRRPGWVLLTALLVLAIGASSALVFTDRVELLKLAVL
LALWAAVAGAFVSVLYRRQSDVDQARVRDLKLVYDLQLDREISARREYEL
TLESQLRRELASELRAPAADEVAALRAELAALRTSLEILFDADLEHRPAL
GTVEKEARAARALDGESPPADWVSSDRVMAVRGGDGASRTDEASIIDVPE
VGVPPVSGGPRHYEAPPPPQPEPLFEPRHRPPPLPPQQERPVWQPVTSHG
QWLPAETPGSQWASVEPETTPAAPPPGRRRRARHASPADQAYNPPAYVEL
AAQYGESGRRSRHSAEHRDHDIGGSGAGTGERPPSPPMAPPPPAEPTRRH
RTADTPPDDSGGLHARDPLTGGQSVADLMARLQVESTGGGRRRRRGE
>Rv3529c CONSERVED HYPOTHETICAL PROTEIN
MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGE
AGLTVLGSKMNRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLV
RTGTTALHRLLGADPAHQGLHMWLAEYPQPRPPRETWESNPLYRQLDAQF
TQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEALAHVPSYADWL
SRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDAL
VVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF
NAARAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTV
HAESQSGARAPKHSYSLADYGLTVEMVKERFAGL
>Rv3453 POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MPGVITNSESPTAADHDRITATRETLEDYTLRLAPRSYRRWPPAVVGISA
LGGIAYLADFAIGANVGITWGTANALCGIAIFALVVFVTGLPLAYYAARY
NIDLDLIYPR
>Rv0122 HYPOTHETICAL PROTEIN
MAGSVSAAAGIGWVGLNVTETNRDQCYRVERTTVDALTHPEYRVHTRGVQ
RVRVTRNARKHRVSKHRIVAAMRHCGVPVIQEDGSLYYQGRDTSGRLTEV
VAVEADDGDLIITHAMPKEWKR
>Rv2818c HYPOTHETICAL PROTEIN
MLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPV
FRNHLVELSAEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVST
PARALSKPGDRESPDAYDLELMWDANDDNQPGAPNRCFEATSAALGALLE
RANLKQLIVSYDYSAAVTIAADSRLPDQVSNLIRGAMHRSRLEHLVAPKF
FKDTAFTYDPANKVAEYISALALLAKREQWAEFARSATPAITIVLRAAVA
KHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLA
LLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQL
LKILARETGADLTLYDRLNDEIIRQIDMAPLG
>Rv0515 CONSERVED 13E12 REPEAT FAMILY PROTEIN
MPSPEAIAHFDERFECHAPRTTRVSAAFIDRICSATRAENRAAAAQLVAL
GELFAYRWSRCGGREEWVMDTMAAVAAEVAAALRISQGLAASRLRYARAM
RERLPKTAEVFSAGDIGYLMFATIVYRTDLIVDPDVLAAVDAQLAANVAR
WPSMTKARLAGQVDKIVARADADAVRRRKEYQAQRQFWVGESQDGVCQIG
GSLLAVDAHALDARLSALAGTVCEHDPRSREQRRADALGALAGGADRLGC
GCGRADCAAGKRPAAPPVVIHLIAEAATINGTGSAPASQMNADGLITAEL
VAELAKTATLVPLVHPGDAPPEPGYAPSKALADFVRCRDLTCRWPGCDEP
ATNCDLDHTIPYAAGGPTHASNLKCYCRTHHLVKTFWGWRDQQLPDGTLI
LTSPSGHTYVSTPGSALLFPSLCHFSGGIPAPEADPPYDHCDQRTAMMPK
RRRTRAQDRAYRIATERRQNHAARQRAQVLTQTAAATDTHGPPPDHNDDP
PPF
>Rv1012 HYPOTHETICAL PROTEIN
MPRAARGIRACRGRWVDRLAHQHASGRAAGIRPREVGGAHQSQAQKPYHD
ATEPLGESLRYRPAHGDSCINGHRDNPSARESSQFTAGSTAKAVTKL
>Rv2871 CONSERVED HYPOTHETICAL PROTEIN
MRTTIRIDDELYREVKAKAARSGRTVAAVLEDAVRRGLNPPKPQAAGRYR
VQPSGKGGLRPGVDLSSNAALAEAMNDGVSVDAVR
>Rv2706c HYPOTHETICAL PROTEIN
MLVGVMLAEKKLGSGGQLGAHPSCSATAVAAVCSSQLRTGQSCVHGSPFS
GIFTFSDVRGSRRVPRPLSGVSFLTTFAPANRAGW
>Rv0272c HYPOTHETICAL PROTEIN
MTGRAATPGVIREFVGLPSRTAGRAAAGGHPCQGLYHHSVGRKPKVALIA
AHYQIDFSEHYLAEYMAIRGIGFLGWNTRFRGFESSFLLDHALVDIGVGV
RWLREVQGVETVVLLGNSGGGSLMAAYQSQAVDPNVTPLDGMRPAAGVTE
LPAADAYVAAAAHPGRPDVLTAWMDAAVIDENDPVATDPELDLFDERNGP
PYSPEFISRYRSAQVKRNHTITDWAESELKRVRAAGFSDRPFSVMRTWAD
PRMVDPSIEPTKRRPNQCYAGTPVKANRSAHGIAAACTLRGWLGMWSLRV
AQTRAAPHLARITCPALVLNAEADTGIFPSDAQQIYDGLASSDKTQVSID
TDHYFTTPGARSEQADTIAKWIAKRWR
>Rv3122 HYPOTHETICAL PROTEIN
MYSGCWINNQNGETRVGEDSLEDLEQRRARLYDQLAATGDFRRGSISENY
RRCGKPNCVCAQEGHPGHGPRYLWTRTVAGRGTKGRQLSVEEVDKVRAEL
ANYHRFAQVSEQIVAVNEAICEARPPNPAATAPPAGTTGHKKGGSATRSR
RSSPPR
>Rv0311 HYPOTHETICAL PROTEIN
MSQSRYAGLSRSELAVLLPELLLIGQLIDRSGMAWCIQAFGRQEMLQIAI
EEWAGASPIYTKRMQKALNFEGDDVPTIFKGLQLDIGAPPQFMDFRFTLH
DRWHGEFHLDHCGALLDVEPMGDDYVVGMCHTIEDPTFDATAIATNPRAQ
VRPIHRPPRKPADRHPHCAWTVIIDESYPEAEGIPALDAVRETKAATWEL
DNVDASDDGLVDYSGPLVSDLDFGAFSHSALVRMADEVCLQMHLLNLSFA
IAVRKRAKADAQLAISVNTRQLIGVAGLGAERIHRAMALPGGIEGALGVL
ELHPLLNPAGYVLAETSPDRLVVHNSPAHADGAWISLCTPASVQPLQAIA
TAVDPHLKVRISGTDTDWTAELIEADAPASELPEVLVAKVSRGSVFQFEP
RRSLPLTVK
>Rv3691 CONSERVED HYPOTHETICAL PROTEIN
MAPASTSSTGGHALATLLGNHGVEVVVADSIADVEAAARPDSLLLVAQTQ
YLVDNALLDRLAKAPGDLLLVAPTSRTRTALTPQLRIAAASPFNSQPNCT
LREANRAGSVQWGPSDTYQATGDLVLTSCYGGALVRFRAEGRTITVVGSS
NFMTNGGLLPAGNAALAMNLAGNRPRLVWYAPDHIEGEMSSPSSLSDLIP
ENVHWTIWQLWLVVLLVALWKGRRIGPLVAEELPVVIRASETVEGRGRLY
RSRRARDRAADALRTATLQRLRPRLGVGAGAPAPAVVTTIAQRSKADPPF
VAYHLFGPAPATDNDLLQLARALDDIERQVTHS
>Rv0080 CONSERVED HYPOTHETICAL PROTEIN
MSPGSRRASPQSAREVVELDRDEAMRLLASVDHGRVVFTRAALPAIRPVN
HLVVDGRVIGRTRLTAKVSVAVRSSADAGVVVAYEADDLDPRRRTGWSVV
VTGLATEVSDPEQVARYQRLLHPWVNMAMDTVVAIEPEIVTGIRIVADSR
TP
>Rv2876 POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MFGQWEFDVSPTGGIAVASTEVEHFAGSQHEVDTAEVPSAAWGWSRIDHR
TWHIVGLCIFGFLLAMLRGNHVGHVEDWFLITFAAVVLFVLARDLWGRRR
GWIR
>Rv0036c CONSERVED HYPOTHETICAL PROTEIN
MADPGPFVADLRAESDDLDALVAHLPADRWADPTPAPGWTIAHQIGHLLW
TDRVALTAVTDEAGFAELMTAAAANPAGFVDDAATELAAVSPAELLTDWR
VTRGRLHEELLAVPDGRKLAWFGPPMSAASMATARLMETWAHGLDVADAL
GVIRPATQRLRSIAHLGVRTRDYAFIVNNLTPPAEPFLVELRGPSGDTWS
WGPSDAAQRVTGSAEDFCFLVTQRRALSTLDVNAVGEDAQRWLTIAQAFA
GPPGRGR
>Rv0789c HYPOTHETICAL PROTEIN
MSRRAIHSGRAAPRRSGNSHLVLRNRVPSSKDSPRRRPHHEFMTESIGEP
LSTNLIERYLRARGRRYFRGHHDAEFFFVANAHLRLHVHLEISPAYRDVF
TIRVSPAYFFPATDHTRLAEIVNAWNLQNHEVTAIVHGSSDPHRIGVAAE
RSLIRDRIRFDDFATFVDNAVSAATELFGQLTAAGLPPTATPPLLRDAG
>Rv2375 CONSERVED HYPOTHETICAL PROTEIN
MIFKGVREGKPYPEHGLSYRDWSQIPPQQIRLDELVTTTTVLALDRLLSE
DSTFYGDLFPHAVKWRGTTYLEDGLHRAVRAALRNRTVLHARVFDMDASP
GGRRS
>Rv3698 CONSERVED HYPOTHETICAL PROTEIN
MRTISPFLRCRHETCCISNVGEEVTRTTYSREHQREYRRKVRLCLDVFET
MLAQTRFEADRPLTGMEIECNLVDADYQPAMSNRYVLDAIADPAYQTELG
AYNIEFNVPPRPLPGRTCLELEDEVRASLNDAETKASCSGAHIVMIGILP
TLMPEHLTDGWMSASARYAALNESIFKARGEDIPINIAGPEPLSCHAGSI
APESACTSVQLHLQLAPADFPANWNAAQVLAGPQLALGANSPYFFGHQLW
SETRIELFTQSTDARPEELKSRGVRPRVWFGERWITSVLDLFQENIRYFP
TLLPEVSDEDPLAELSAGRIPHLSELRLHNGTVYRWNRPVYDVVDGRPHL
RLENRVLPAGPTVVDMLANHAFYYGALRGLSEADPPLWTQMNFAAAQANF
LAAARYGMDAQLDWPGLGEVTTRELVLGTLLPMAHEGLRRWGVDAEVRDR
FLGVIGGRAQTGRNGARWQVATVAALQDGGLTRPAALAEMLRRYCEHMHS
NEPVHTWDT
>Rv3718c CONSERVED HYPOTHETICAL PROTEIN
MGQVSAASTILINAEPTATLDALADYETVRPKILSPHYSEYQVLEGGKGR
GTVAKWRLQATQSRVRDVQVNVDVAGHTVIEKDMNSSMVTNWTVAPAGPG
SSVTVKTTWTGAGGVKGFFEKTFAPLGLKKIQAEVLSNLKTELEGDA
>Rv0175 PROBABLE CONSERVED MCE ASSOCIATED MEMBRANE PROTEIN
MKAADSAESDAGADQTGPQVKAADSAESDAGELGEDACPEQALVERRPSR
LRRGWLVGIAATLLALAGGLGAAGYFALRSHQESQSIAREDLAAIEAAKD
CVAATQAPDAGAMSASMQKIIECGTGDFGAQASLYTSMLVEAYQAASVHV
QVTDMRAAVERNNNDGSVDVLVALRVKVSNTDSDAHEVGYRLRVRMALDE
GRYKIAKLDQVTK
>Rv3271c PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN
METTTEHRDESTLDSPVSVAREAEWQRNVRWARWLAWVSLAVLLTEGAVG
LWQGIAVGSVALTGWALGGGSEGLASAMVLWRFTGDRTWSATAEHRAQRG
VAVSFWLTAPYLVAESIRHLAGEHRAETSVIGIGLTAIALLLMPVLGWAN
HRVGERLGSGATAGEGTQNYLCAAQAAAVLLGLAITAVWSNGWWIDPAIG
LAIAGIAVWQGIRTWRGHGCGC
>Rv0307c HYPOTHETICAL PROTEIN
MAVIVRKWFGLGRLPADLRCQVEAEGLIYLAEYVAVTRRFTGVIPGLRAS
HSIASYVGALAFTEQRVLGTLSMVPKLAGRVVDARWDGPQAGAATAEISP
TGLQLDLDVADVDPKFSGQLALHFKATIGEDVLSRLPRRSLAFDVPAEYV
NLAVGVTYSP
>Rv0622 POSSIBLE MEMBRANE PROTEIN
MSFCVYCGAELADPTRCGACGAYKIGSTWHRTTTPTVGAATTATGWRPDP
TGRHEGRYFVAGQPTDLVREGDAEAVDPLGQQQLDQSGAVGVSPSAVSGW
VRSGHRRLWWALAGVVAFLGLVGAGVVGTLFLNRDRESIDDKYLAALRRS
GLTGEFNSDANAIARGKQVCRQLQDGGEQQGMPVDQVAVQYYCPQFSDGF
HILETITVTGSFTLKDESPNVYAPAITVSGSGCSGSAGYADIDRGTQVTV
KNGQGDILATAFLQAGQGGRFLCTFPFSFEITEGEDRYVVSVSRRGEMSY
SFADLKANGLSLVLG
>Rv3780 CONSERVED HYPOTHETICAL PROTEIN
MRKRMVIGLSTGSDDDDVEVIGGVDPRLIAVQENDSDESSLTDLVEQPAK
VMRIGTMIKQLLEEVRAAPLDEASRNRLRDIHATSIRELEDGLAPELREE
LDRLTLPFNEDAVPSDAELRIAQAQLVGWLEGLFHGIQTALFAQQMAARA
QLQQMRQGALPPGVGKSGQHGHGTGQYL
>Rv2635 HYPOTHETICAL PROTEIN
MVAADHRALGSNKSYPASQTAEAIWPPARTLRYDRQSPWLATGFDRRMSQ
TVTGVGVQNCAVSKRRCSAVDHSSRTPYRR
>Rv3835 PROBABLE CONSERVED MEMBRANE PROTEIN
MLDAPEQDPVDPGDPASPPHGEAEQPLPGPRWPRALRASATRRALLLTAL
GGLLIAGLVTAIPAVGRAPERLAGYIASNPVPSTGAKINASFNRVASGDC
LMWPDGTPESAAIVSCADEHRFEVAESIDMRTFPGMEYGQNAAPPSPARI
QQISEEQCEAAVRRYLGTKFDPNSKFTISMLWPGDRAWRQAGERRMLCGL
QSPGPNNQQLAFKGKVADIDQSKVWPAGTCLGIDATTNQPIDVPVDCAAP
HAMEVSGTVNLAERFPDALPSEPEQDGFIKDACTRMTDAYLAPLKLRTTT
LTLIYPTLTLPSWSAGSRVVACSIGATLGNGGWATLVNSAKGALLINGQP
PVPPPDIPEERLNLPPIPLQLPTPRPAPPAQQLPSTPPGTQHLPAQQPVV
TPTRPPESHAPASAAPAETQPPPPDAGAPPATQSPEATPPGPAEPAPAG
>Rv3555c CONSERVED HYPOTHETICAL PROTEIN
MDELPWPVLGSEVLAAKAIPERAMRQLYEPVYPGVYAPAGVELTARQRAH
AAWLWSRRRAVVAGNSAAALLGAKWVNPALDAELVHANRKPPPRIVVHTD
RLAPHETVAVDGVAVTTPARTAFDIGRRTPSRLQAVQRLDALANSTDVKV
ADVQAVIAEHTGARGLVRLRAVLPLIDGGAESPQETWTRLVLIDAGLPKP
QTQIRVFDDYGDFVARIDLGYEQLRVGVEYDGPQHWTDPAQRARDIERST
ALLDLGWTIIRVTSELLWYRRGTFVGRVDAAMRAAGWRP
>Rv0740 CONSERVED HYPOTHETICAL PROTEIN
MLPKNTRPTSETAEEFWDNSLWCSWGDRETGYTRTVTVSICQVADGEREA
EGVRDMMRLECPAGLDLRTPNPEAYEITGQRPGEFVFVLGYLGHVRAIVG
NCYIEIMPMGTRVELSKLADVALDIGRSVGCSAYENDFTLPDIPTQWRNQ
PLGWYTQGLAPYLPGLSDPKDAAEG
>Rv1373 GLYCOLIPID SULFOTRANSFERASE
MNSEHPMTDRVVYRSLMADNLRWDALQLRDGDIIISAPSKSGLTWTQRLV
SLLVFDGPDLPGPLSTVSPWLDQTIRPIEEVVATLDAQQHRRFIKTHTPL
DGLVLDDRVSYICVGRDPRDAAVSMLYQSANMNEDRMRILHEAVVPFHER
IAPPFAELGHARSPTEEFRDWMEGPNQPPPGIGFTHLKGIGTLANILHQL
GTVWVRRHLPNVALFHYADYQADLAGELLRPARVLGIAATRDRARDLAQY
ATLDAMRSRASEIAPNTTDGIWHSDERFFRRGGSGDWQQFFTEAEHLRYY
HRINQLAPPDLLAWAHEGRRGYDPAN
>Rv0430 CONSERVED HYPOTHETICAL PROTEIN
MDSAMARAIRSGDDAEVADGLTRREHDILAFERQWWKFAGVKEEAIKELF
SMSATRYYQVLNALVDRPEALAADPMLVKRLRRLRASRQKARAARRLGFE
VT
>Rv3747 CONSERVED HYPOTHETICAL PROTEIN
MILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPD
SSDRDITVEMRPPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGR
WVLVVTGGTGAISLPVLVSDMPATIGF
>Rv3847 HYPOTHETICAL PROTEIN
MGTGSGGPIGVSPFHSRGALKGFVISGRWPDSTKEWAQLLMVAVRVASLP
GLLSTTTVFGAREELPDEPEPGTVGLVLAEGTVFGESAIQPGYFADHQPP
ALLMLHPPSETTPSLPECTGAASGCVLLPGLPYLGLEHRAAWVEAEADGT
ITSMVSRVGVDPISHPDTAILAMLLAA
>Rv1892 PROBABLE MEMBRANE PROTEIN
MIMCEGRPTESPIPRWLRFVLTSDRAGSAWYIGAGFFFAPVLAVLSPWPT
ITAVLWWIIGLAGLWLGLLGIAMAVGLARVLRSGAEIPEAYWRTLVDYRS
ANE
>Rv3224A CONSERVED HYPOTHETICAL PROTEIN
MRRSASTCGWKTPTRRGTSRPSDSKTLILELPDERAVAIVPVPSKLSLKA
AGGPRGAQSGHG
>Rv1795 CONSERVED HYPOTHETICAL MEMBRANE PROTEIN
MTAVADAPQADIEGVASPQAVVVGVMAGEGVQIGVLLDANAPVSVMTDPL
LKVVNSRLRELGEAPLEATGRGRWALCLVDGAPLRATQSLTEQDVYDGDR
LWIRFIADTERRSQVIEHISTAVASDLSKRFARIDPIVAVQVGASMVATG
VVLATGVLGWWRWHHNTWLTTIYTAVIGVLVLAVAMLLLMRAKTDADRRV
ADIMLMSAIMPVTVAAAAAPPGPVGSPQAVLGFGVLTVAAALALRFTGRR
LGIYTTIVIIGALTMLAALARMVAATSAVTLLSSLLLICVVAYHAAPALS
RRLAGIRLPVFPSATSRWVFEARPDLPTTVVVSGGSAPVLEGPSSVRDVL
LQAERARSFLSGLLTGLGVMVVVCMTSLCDPHTGQRWLPLILAGFTSGFL
LLRGRSYVDRWQSITLAGTAVIIAAAVCVRYALELSSPLAVSIVAAILVL
LPAAGMAAAAHVPHTIYSPLFRKFVEWIEYLCLMPIFPLALWLMNVYAAI
RYR
>Rv3355c CONSERVED HYPOTHETICAL PROTEIN
MTVRAVFRRTVGAQWPILLVGSIFAVGFVLAGANFWRRGALLIGIGVGVA
AVLRLVLSEERAGLLVVRSKGIDFVTTVTVAAAMVYIASTIDPLGTG
>Rv3805c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MVRVSLWLSVTAVAVLFGWGSWQRRWIADDGLIVLRTVRNLLAGNGPVFN
QGERVEANTSTAWTYLLYVGGWVGGPMRLEYVALALAMVLSLLGMVLLML
GTGRLYAPSLRGRRAIMLPAGALVYIAVPPARDFATSGLESGLVLAYLGL
LWWMMVCWSQPLRARPDSQMFLGALAFVAGCSVLVRPEFALIGGLALIMM
LIAARTWRRRVLIVLAGGFLPVAYQIFRMGYYGLLVPSTALAKDAAGDKW
SQGMIYVSNFNRPYALWVPLVLSVPLGLLLMTARRRPSFLRPVLAPDYGR
VARAVQSPPAVVAFIVGSGVLQALYWIRQGGDFMHGRVLLAPLFCLLAPV
GVIPILLPDGKDFSRETGRWLVGALSGLWLGIAGWSLWAANSPGMGDDAT
RVTYSGIVDERRFYAQATGHAHPLTAADYLDYPRMAAVLTALNNTPEGAL
LLPSGNYNQWDLVPMIRPSSGTAPGGKPAPKPQHAVFFTNMGMLGMNVGL
DVRVIDQIGLVNPLAAHTERLKHARIGHDKNLFPDWVIADGPWVKWYPGI
PGYIDQQWVTQAEAALQCPATRAVLNSVRAPITLHRFLSNVLHSYEFTRY
RIDRVPRYELVRCGLDVPDGPGPPPRE
>Rv2472 CONSERVED HYPOTHETICAL PROTEIN
MMMRIAVRLPGEVITFVDSEVSQIRIPSRRAAVVLRASNASDAAILTATE
PNHHLDALAGQAAKLAPTSIDAAHPARPARRDPCLYPRTGQALPRTG
>Rv0572c HYPOTHETICAL PROTEIN
MGEHAIKRHMRQRKPTKHPLAQKRGARILVFTDDPRRSVLIVPGCHLDSM
RREKNAYYFQDGNALVGMVVSGGTVEYDADDRTYVVQLTDGRHTTESSFE
HSSPSRSPQSDDL
>Rv3901c POSSIBLE MEMBRANE PROTEIN
MQAANRRSADTICGVTAPAPLPIPRTRSWPAIVVAAIAAVVAVAALIVAL
TNARPAATPATTSVPTYTAAQTAAAQRQLCDTYKLVAHAVPVDTNGSDKA
LARITLTNAAAILDNAAADPALDAKHRDAARASDRLPHNDRNGEWWHSS
>Rv1678 PROBABLE INTEGRAL MEMBRANE PROTEIN
MARVRRGTELLLSPQSPPATGGLIVLTGLRLLAGLIWLYNVVWKVPPDFG
ERGRRDLYHFTHLAVEHPVFTPFSWVIEHAVLPYFTAFGWGVLFAESALA
VLLLTGTAVRLAALIGIGQSVAIGLSVAESPGEWPWAYAMLLGIHVVLLF
TCSTRYAAVDAVRAAATGSAARTAAQRLLAGWGIVLGLIGLVAVWRGLGD
DRPAYVGIRALEFSLGEYNLRGALALIAIALAMLAAAKRGWRTVALVAAV
VAVAAAAAIYLQVGRTAVWLGGTNTTAAVFVCAAVVSLATEFRIGRVEGA
>Rv0603 POSSIBLE EXPORTED PROTEIN
MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRARAAAVQA
VPGGTAGEVETETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADG
DGG
>Rv2620c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MSAGPAIEVAVAFVWLGMVVAISFLEAPLKFRAAGVTLQIGLGIGRLVFR
ALNTVEVGFALVILAIVVVGSTPARIAAAFSVALAALAVQLIAVRPRLTR
RSNQVLAGLQAPRSRGHHIYVGLEIVKVVALLVAGILLLNG
>Rv3706c CONSERVED HYPOTHETICAL PROLINE RICH PROTEIN
MRHMSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGK
HAGHGGFHHRQHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSP
PATPAP
>Rv2550c HYPOTHETICAL PROTEIN
MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQP
GPDPVDAFVGSFVGEADLSASVDDVVYGKHE
>Rv1132 CONSERVED MEMBRANE PROTEIN
MGFLQPRLPDIDLAEWSQGSRSQKIRPMAQHWAEVGFGTPVLLHLFYVAK
ILLYVLVGWLIVLTTKGIDGFTDAAAWYAEPIVFEKVVLYTMLFEVIGLG
CGFGPLNNRFFPPMGSILYWMRFGTIRLPPWPDRVPWTRGTKRKPVDVAL
YALLVMMLLSALFTDGAGPIPELGTTVGLLPAWQIVLILLLLGVLGLRDK
VIFLAARGEVYATLTVTFLFGRLNGIDMIVAAKLVFLVIWIGAATSKLNR
HFPFVISTMMSNNPLFRPRFIKRMFFKKFPGDLRPGLLSRIVAHVSTVIE
MCVPVVLFVAHGGWPTVVAATIMVCFHLGILTAIPMGVPLEWNVFMIFGV
LSLFVGHACLGLADVKNPVPLAILIAVVAGIVIAGNVFPRKISFLAAMRY
YAGNWDTTLWCIKPSAEDKINRGIVAIASMPAAQLERFYGKDRAQIPMYL
GYAFRAMNSHGRALFTLAHRAMAGHDEDDYVITDGERVCSTAVGWNFGDG
HLHNEQLIAAMQQRCGFQPGEVRVVLLDAQPIHRQTQEYRLVDAATGEFE
RGYVRVADMVNRQPWDDDVPVHVLPG
>Rv1374c HYPOTHETICAL PROTEIN
MVTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIG
ARSFAVGRKICRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCR
EVGNYAQRRVGRFAFFEQTFVRHALTPRCSRTDSKTSYTQLNRICKFPPH
WV
>Rv3887c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MTAPHKVAFPARCAVNICYDKHLCSQVFPAGIPVEGFFEGMVELFDADLK
RKGFDGVALPAGSYELHKINGVRLDINKSLDELGVQDGDTLVLVPRVAGE
SFEPQYESLSTGLAAMGKWLGRDGGDRMFAPVTSLTAAHTAMAIIAMAVG
VVLALTLRTRTITDSPVPAAMAGGIGVLLVIGALVVWWGWRERRDLFSGF
GWLAVVLLAVAAACAPPGALGAAHALIGLVVVVLGAITIGVATRKRWQTA
VVTAVVTVCGILAAVAAVRMFRPVSMQVLAICVLVGLLVLIRMTPTVALW
VARVRPPHFGSITGRDLFARRAGMPVDTVAPVSEADADDEDNELTDITAR
GTAIAASARLVNAVQVGMCVGVSLVLPAAVWGVLTPRQPWAWLALLVAGL
TVGLFITQGRGFAAKYQAVALVCGASAAVCAGVLKYALDTPKGVQTGLLW
PAIFVAAFAALGLAVALVVPATRFRPIIRLTVEWLEVLAMIALLPAAAAL
GGLFAWLRH
>Rv1035c PROBABLE TRANSPOSASE (FRAGMENT)
MPHPTTLMKLTTRCGSAAIDGLNEALLAKAAEAKLLGTNRIRADTTVARA
NVSYPTDLGLLAKAMRRIAATGKRIQAAGGAVRTRVGDRSRAAGRRAHAV
AAKLRSRAELGRDEARAAVLRFTGELAELAQAAAQEAQQLLDNAKQAVLR
AKAKAAALAARGERDAVAGRRCGGLVRAVNDLTELLNATRQIVAQTRQRV
AGITSDGASRRVSLHDGDARPDHQGSAR
>Rv3802c PROBABLE CONSERVED MEMBRANE PROTEIN
MAKNSRRKRHRILAWIAAGAMASVVALVIVAVVIMLRGAESPPSAVPPGV
LPPGPTPAHPHKPRPAFQDASCPDVQMISVPGTWESSPQQNPLNPVQFPK
ALLLKVTGPIAQQFAPARVQTYTVAYTAQFHNPLTTDNQMSYNDSRAEGT
RAMVAAMTDMNNRCPLTSYVLIGFSQGAVIAGDVASDIGNGRGPVDEDLV
LGVTLIADGRRQQGVGNQVPPSPRGEGAEITLHEVPVLSGLGLTMTGPRP
GGFGALDGRTNEICAQGDLICAAPAQAFSPANLPTTLNTLAGGAGQPVHA
MYATPEFWNSDGEPATEWTLNWAHQLIENAPHPKHR
>Rv1473A POSSIBLE TRANSCRIPTIONAL REGULATORY PROTEIN
MRKSKKTRDQLLRELRNAYEGGASIRNLAATTGRSYGSIHSMLRESGTTM
RGRGGPNRRSRPR
>Rv3909 CONSERVED HYPOTHETICAL PROTEIN
MTALQLGWAALARVTSAIGVVAGLGMALTVPSAAPHALAGEPSPTPFVQV
RIDQVTPDVVTTSSEPHVTVSGTVTNTGDRPVRDVMVRLEHAAAVTSSTA
LRTSLDGGTDQYQPAADFLTVAPELDRGQEAGFTLSAPLRSLTRPSLAVN
QPGIYPVLVNVNGTPDYGAPARLDNARFLLPVVGVPPDQATDFGSAVAPE
TTAPVWITMLWPLADRPRLAPGAPGGTVPVRLVDDDLANSLANGGRLDIL
LSAAEFATNREVDPDGAVGRALCLAIDPDLLITVNAMTGGYVVSDSPDGA
AQLPGTPTHPGTGQAAASSWLDRLRTLVHRTCVTPLPFAQADLDALQRVN
DPRLSAIATISPADIVDRILDVSSTRGATVLPDGPLTGRAINLLSTHGNT
VAVAAADFSPEEQQGSSQIGSALLPATAPRRLSPRVVAAPFDPAVGAALA
AAGTNPTVPTYLDPSLFVRIAHESITARRQDALGAMLWRSLEPNAAPRTQ
ILVPPASWSLASDDAQVILTALATAIRSGLAVPRPLPAVIADAAARTEPP
EPPGAYSAARGRFNDDITTQIGGQVARLWKLTSALTIDDRTGLTGVQYTA
PLREDMLRALSQSLPPDTRNGLAQQRLAVVGKTIDDLFGAVTIVNPGGSY
TLATEHSPLPLALHNGLAVPIRVRLQVDAPPGMTVADVGQIELPPGYLPL
RVPIEVNFTQRVAVDVSLRTPDGVALGEPVRLSVHSNAYGKVLFAITLSA
AAVLVTLAGRRLWHRFRGQPDRADLDRPDLPTGKHAPQRRAVASRDDEKH
RV
>Rv0383c POSSIBLE CONSERVED SECRETED PROTEIN
MVPLWFTLSALCFVGAVVLLYVDIDRRRGRSRRRKSWARSHGFDYEREST
EILKRWTRGVMSTVGDVAAHNVVLGQIRGEAVYIFDLEEVATVIALHRKV
GTNVVVDLRLKGLKEPRESDIWLLGAIGPRMVYSTNLDAARRACDRRMVT
FAHTAPDCAEIMWNEQNWTLVSMPIASTRAQWDEGLRTVRQFNDLLRVLP
PLPQEMPQQTGVGPRGAAPGRPVAPGGPAELPPRRAQPDPATTVLPDPAR
RAPEPIRRDEGRSEGVRRPPPAGRNGQQATNYQH
>Rv3857c POSSIBLE MEMBRANE PROTEIN
MNCALGFDTKPILLASYVTHGARRATANQFERPAKGAGVLMALLILGEMA
GFAVVVTGVVFGQLV
>Rv1507A HYPOTHETICAL PROTEIN
MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRWSEFKRFCDI
FNMVLGKARMGRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRYRGR
SVSGFVLMIKSASVHEIDSWSSPSVAMSIGVALCSYPHYAAARTSPPNRD
WGEDTTRSRPVTGLLAG
>Rv0609A CONSERVED HYPOTHETICAL PROTEIN
MEGQRLWAHRRPKGTGSAVIDVSLARRCEAHGYDYFRSDDPVAAAGFVVS
AVWSCGRGPGNATGSGRLPKPLRHS
>Rv1914c HYPOTHETICAL PROTEIN
MVLSRTSTGRVILVPTQLRFDRWFLPLAVPLGLGPKNSELWVGAGSLHVK
MGWAFAADIPLTSITKAEATNARVYAAGVHFGFGRWLVNGSRKGLVALTI
DPPEQAKMWKKSMTVRELWVSVTDPDALVTACTAK
>Rv3395A PROBABLE MEMBRANE PROTEIN
MQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGF
ELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPT
LHNAAEALHRQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARFRDAFA
SDLAAHHRLRGGSVTTADHTLILVAGNGDLDVARRLVEEAGGDWNATTIA
HGRREFVN
>Rv2774c HYPOTHETICAL PROTEIN
MGTAVEVGWRDPCGLAVGELRCAPAVSDQPVVGCAGCPLVDMVDFAPVTG
CVAVGSTMGAVPALLRVRFPWPPFEPDVRLSPYLALHGICRWGGSDSCDR
TTVQVFHLHSINKRLTAHAGFGAAAVVGLEDGPV
>Rv2307D HYPOTHETICAL PROTEIN
MWRHLWLMQPQRRYPRGSGTTRTARRDAGVAPLYGVSRVTVLASTTATTA
PPVKSFPDLL
>Rv3123 HYPOTHETICAL PROTEIN
MRSRSVRWDPRCRPGRSGVGDPHCDDPAGLLAAGAAAGRRHRAPGPAHRL
RARALRVVRRLPRQEPRYRAGPGPVAPRLLPLPHLRAWDGAPWIWNLATA
ILPEATPIVDLYHARQHVHDLAGQLAPALGEHHSDWLTARLVDLDSGDIE
TLVQQPIGQHTGHT
>Rv1103c CONSERVED HYPOTHETICAL PROTEIN
MYLPWGVVLAGGANGFGAGAYQTGTICEVSTQIAVRLPDEIVAFIDDEVR
GQHARSRAAVVLRALERERRRRLAERDAEILATNTSATGDLDTLAGHCAR
TALDID
>Rv2438A CONSERVED HYPOTHETICAL PROTEIN
MARTGHVQYRRGVGRRVTDGGVVSAGGNAHEPVLVGGVKVHRPFIVAQRR
QNARITRRVSTLDTVESPALLADGGIDRRGDATDWAAADPGP
>Rv0011c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MPKSKVRKKNDFTVSAVSRTPMKVKVGPSSVWFVSLFIGLMLIGLIWLMV
FQLAAIGSQAPTALNWMAQLGPWNYAIAFAFMITGLLLTMRWH
>Rv0724A CONSERVED HYPOTHETICAL PROTEIN
SQDRLFDNSTELSVAGSTIATELVPGIVDFDAGRVREMADSFRKHGVDID
MASLVYSGERSHVVDYLRAKGWDVEGTVRTDLFRRNGLPVPAPHDDDPLG
EIIFISGRLNG
>Rv2990c HYPOTHETICAL PROTEIN
MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERP
WGTVLDAGTGVKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQD
RLLVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLA
DHGRLYLVGLEPYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPL
DWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARIERFSSNGLGM
AMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM
>Rv0361 PROBABLE CONSERVED MEMBRANE PROTEIN
MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITT
SDNDAAVTQPEAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTT
PPRMPTGMPPKTAVPQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAI
LGTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDG
YVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDP
QVRSTRSLDLQFRDDQWKICQSSSN
>Rv3651 CONSERVED HYPOTHETICAL PROTEIN
MTHDWLLVETLGDEPAVVARGRELKKLVPITTFLRRSPYLAAVRTAIAET
LQTGQSLTSITPKHDRVIRTEPVIMTDGRMHGVQVWSGPTDAEPPDRPIP
GPLKWDLTRGVATDTPESLTNSGKNPEVEITYGRAFAEDLPARELNPNET
QVLAMAVKAKPGKTLCSIWDLTDWQGTPIRIGFVARSALEPGPNGRDHLV
ARAMNWRAETKAPAVPVDDLAQRILIGLAQAGVHRALVDLKTWTLLKWLD
QPCSFYDWRRSAADGPRLHPDDQHVIDAMTRDLANGSASHVLRLPGHDVD
WVPVHVTVNRIELEPDTFAGLVALRLPTDEELADAGLPKATDVTT
>Rv3615c CONSERVED HYPOTHETICAL PROTEIN
MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQ
FNDTLNVYLTAHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDG
LFT
>Rv1993c CONSERVED HYPOTHETICAL PROTEIN
MVTHELLVKAAGAVLTGLVGVSAYETLRKALGTAPIRRASVTVMEWGLRG
TRRAEAAAESARLTVADVVAEARGRIGEEAPLPAGARVDE
>Rv3233c CONSERVED HYPOTHETICAL PROTEIN
MIAGALGNWLMSRGEAVAPTATVRAMAPLSVYADDQLDSTGPGQAISQVT
PFLVDLPVGEGNAVVRLSQIAHATESNPTAASLVDARTIVTLSGLAPATL
HAMGVRVATSFSARLFNLLITNAPGTQSQMYIAGTKLLETYSVPPLLHNQ
ALAISVTSYNGMLYFGINADRDAMSDVDLLPGLLSQALDELLEASR
>Rv1907c HYPOTHETICAL PROTEIN
MIGPARRSTTTRRSTPRADRLAGCWCLPGAICQTPRAWWSQARRDGDDET
GMRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAY
TVGLTRRGLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAG
PLVETVQVTHPDAHLYCAIAIFGDKVTALQLVWADRRGRWPWAADFDEGR
GTQPVLGMRATRRSA
>Rv0367c HYPOTHETICAL PROTEIN
MPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTA
SRRRVEAALAGHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQ
GITTVALNDAGDIVEHRPDGTSVVLAATP
>Rv1459c POSSIBLE CONSERVED INTEGRAL MEMBRANE PROTEIN
MAARHHTLSWSIASLHGDEQAVGAPLTTTELTALARTRLFGATGTVLMAI
GALGAGARPVVQDPTFGVRLLNLPSRIQTVSLTMTTTGAVMMALAWLMLG
RFTLGRRRMSRGKLDRTLLLWMLPLLIAPPMYSKDVYSYLAQSEIGRDGL
DPYRVGPASGLGLGHVFTLSVPSLWRETPAPYGPLFLWIGRGISSLTGEN
IVAAVLCHRLVVLIGVTLIVWATPRLAQRCGVAEVSALWLGAANPLLIMH
LVAGIHNEALMLGLMLTGVEFALRGLDMANTPRPSPETWRLGPATIRASR
RPELGASPRAGASRAVKPRPEWGPLAMLLAGSILITLSSQVKLPSLLAMG
FVTTVLAYRWGGNLRALLLAAAVMASLTLAIMAILGWASGLGFGWINTLG
TANVVRSWMSPPTLLALGTGHVGILLGLGDHTTAVLSLTRAIGVLIITVM
VCWLLLAVLRGRLHPIGGLGVALAVTVLLFPVVQPWYLLWAIIPLAAWAT
RPGFRVAAILATLIVGIFGPTANGDRFALFQIVDATAASAIIVILLIALT
YTRLPWRPLAAEQVVTAAESASKTPATRRPTAAPDAYADST
>Rv3723 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MGRKVAVLWHASFSIGAGVLYFYFVLPRWPELMGDTGHSLGTGLRIATGA
LVGLAALPVVFTLLRTRKPELGTPQLALSMRIWSIMAHVLAGALIVGTAI
SEVWLSLDAAGQWLFGIYGAAAAIAVLGFFGFYLSFVAELPPPPPKPLKP
KKPKQRRLRRKKTAKGDEAEPEAAEEAENTELAAQEDEEAVEAPPESIES
PGGEPESATREAPAAETATAEEPRGGLRNRRPTGKTSHRRRRTRSGVQVA
KVDE
>Rv2808 HYPOTHETICAL PROTEIN
MSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDAVIDVLS
DALMKTFGPDWVPNDYGLKIERAIDAYLETWPIYR
>Rv3412 CONSERVED HYPOTHETICAL PROTEIN
MRDHLPPGLPPDPFADDPCDPSAALEAVEPGQPLDQQERMAVEADLADLA
VYEALLAHKGIRGLVVCCDECQQDHYHDWDMLRSNLLQLLIDGTVRPHEP
AYDPEPDSYVTWDYCRGYADASLNEAAPDADRFRRR
>Rv2079 CONSERVED HYPOTHETICAL PROTEIN
MQLRHINIRALIAEAGGDPWAIEHSLHAGRPAQIAELAEAFHAAGRYTAE
ANAAFEEARRRFEASWNRENGEHPINDSAEVQRVTAALGVQSLQLPKIGV
DLENIAADLAEAQRAAAGRIATLESQLQRIDDQLDQALELEHDPRLAAAE
RSELDALITCLEQDAIDDTASALGQLQSIRAGYSDHLQQSLAMLRADGYD
GAGLQGLDAPQSPVKPEEPIQIPPPGTGAPEVHRWWTSLTSEERQRLIAE
HPEQIGNLNGVPVSARSDANIAVMTRDLNRVRDIATRYRTSVDDVLGDPA
KYGLSAGDITRYRNADETKKGLDHNARNDPRNPSPVYLFAYDPMAFGGKG
RAAIAIGNPDTAKHTAVIVPGTSSSVKGGWLHDNHDDALNLFNQAKAADP
NNPTAVIAWMGYDAPNDFTDPRIATPMLARIGGAALAEDVNGLWVTHLGV
GQNVTVLGHSYGSTTVADAFALGGMHANDAVLLGCPGTDLAHSAASFHLD
GGRVYVGAASTDPISMLGQLDSLSQYVNRGNLAGQLQGLAVGLGTDPAGD
GFGSVRFRAEVPNSDGINPHDHSYYYHRGSEALRSMADIASGHGDALASD
GMLAQPRHQPGVEIDIPGLGSVEIDIPGTPASIDPEWSRPPGSITDDHVF
DAPLHR
>Rv3483c CONSERVED HYPOTHETICAL PROTEIN
MSDEIDPDWPAPAYQPSDDVDTTPPAPGGSWPTAWLVALVVLACVAAAVV
AYAGMHRVRPGANQAAPATTSAPARPTSPASQVGPCGPDEATAVRAALAQ
LAPDSKTGRPWNSTPEDSNYDPCADLSAVLVTVQDATNSSPDQALMFHRG
TFVGTATPRAYPFTNLIGPASTNDIVVLSYRTRQSCDGCQDGILTIVGFA
WRGDHVQILDSLPELFDAPP
>Rv3882c POSSIBLE CONSERVED MEMBRANE PROTEIN
MRNPLGLRFSTGHALLASALAPPCIIAFLETRYWWAGIALASLGVIVATV
TFYGRRITGWVAAVYAWLRRRRRPPDSSSEPVVGATVKPGDHVAVRWQGE
FLVAVIELIPRPFTPTVIVDGQAHTDDMLDTGLVEELLSVHCPDLEADIV
SAGYRVGNTAAPDVVSLYQQVIGTDPAPANRRTWIVLRADPERTRKSAQR
RDEGVAGLARYLVASATRIADRLASHGVDAVCGRSFDDYDHATDIGFVRE
KWSMIKGRDAYTAAYAAPGGPDVWWSARADHTITRVRVAPGMAPQSTVLL
TTADKPKTPRGFARLFGGQRPALQGQHLVANRHCQLPIGSAGVLVGETVN
RCPVYMPFDDVDIALNLGDAQTFTQFVVRAAAAGAMVTVGPQFEEFARLI
GAHIGQEVKVAWPNATTYLGPHPGIDRVILRHNVIGTPRHRQLPIRRVSP
PEESRYQMALPK
>Rv2728c CONSERVED HYPOTHETICAL ALANINE RICH PROTEIN
MLSAIGIVPSAPVLVPELAGAAAAELADLGAAVIAAASLLPKSWIAVGTG
RADDVVRPTDVGTFAGFGADVRVGLAPQDGDGVAVPVELPLCALLTAWVR
GQARPEARAQVHVYASDHGSDAAVARGRQLRADIDREPDPIGVLVVADGL
NTLTPRAPGGYDPDGAGMQRALDDALASGDLAVLTRLPAQVLGRVAFQVL
AGLAEPGPRSAKEFYRGAPHGVGYFAGVWQP
>Rv2526 HYPOTHETICAL PROTEIN
MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRR
RIVDHLAHAGTHVDADVLLSEQAWR
>Rv0514 POSSIBLE TRANSMEMBRANE PROTEIN
MIARYRAGAELFLACAALAGSAASWSRTRSTVAVAPVIDGQPVTLSVVYH
PQPLVLTLLLATIAGVLSVVGTARLRRARAGLNAHPDGLNQRPPGGWCH
>Rv1382 PROBABLE EXPORT OR MEMBRANE PROTEIN
MNSGTLAGSLIFAAVLVMLIAVLARLMMRGWRRRSERQAELLGDLPDVPE
HVSSATVTTRGLYVGATLSPAWNERVTVGDLGYRSKAVLTRYPSGIMVER
ARAQPIWIPTESIAAIRMERGVAGKVVAGIGILAIRWRLPSGTEIDVGFR
ADNRDEYQEWLEEPV
>Rv1875 CONSERVED HYPOTHETICAL PROTEIN
MTTLNEAAALAAAERGLAVVSTVRADGTVQASLVNVGLLPHPVSGEPSLG
FTTYGKVKLGNLRARPQLAVTFRNGWQWATVEGRAQLVGPDDPRPWLVDG
ERLRLLLREVFTAAGGTHDDWDEYDRVMAQEQRAVVLITPTRIYSNG
>Rv1780 CONSERVED HYPOTHETICAL PROTEIN
MQNHDYVTYEEFGRRFFEVAVTPDRVAAAFADIAGSEFAMEPISQGPGGI
AKVSANVKIREPRVTRKLGDLITFVIHIPLSIDLLLDLRLDKQRFMVAGD
IALRATARAAEPLLLIVDVAKPRPSDITVNVSSKSIRGEVLRILAGVDGE
IRRFIAQYVSAEIDSPKSQAAQVINVAEQLDSTWSGP
>Rv0185 CONSERVED HYPOTHETICAL PROTEIN
MIGADVPRDSQRARVYAAEAFVRTLFDRVTAHGSPTVEFFGTQLTLPPEG
RFGSVASVQRYVDDVLALPAVGQNWPTVSPVRVRARRAATAAHYENHGGT
GTIAVPDRHTAGWAMRELVVLHEVAHHLCQVPPPHGPEFVATVCTLTELV
MGPEVGHVFRVVYAQEGVR
>Rv2312 HYPOTHETICAL PROTEIN
MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINTPGPGR
TKQFMEELSQLASAPGPDIDGGIDLTDDEFQAFLQAARS
>Rv1081c PROBABLE CONSERVED MEMBRANE PROTEIN
MTHTPIPRPDARYGRPRLSRRARRRVAIALGVLVAAAGIVIAVIGYQRIS
TSAVTGSLVGYRLVDDETASVTISVTRSDPSRPVACIVRVRATNGSETGR
RELLVPPSEATTVQVTTTVKSSQPPVMADVYGCGTEVPSYLRLP
>Rv2091c Probable membrane protein
MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYT
PQYQQPADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQY
GQPGQYGQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLF
IGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKC
NNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRPQ
>Rv1499 HYPOTHETICAL PROTEIN
MPSGEPSTAGHFEHLPRGSFGRILSVLNAAADHHPRELLVVGIATFDQKR
PAVGVDEHDPGGAATPAVVINYESRSSAGGTIGHSTTSQVACCLYQQPKR
PALRPTKAAATTAATTWIERVQNRRGRHSALV
>Rv2708c CONSERVED HYPOTHETICAL PROTEIN
MSGMQTQTIERTDADERVDDGTGSDTPKYFHYVKKDKIAESAVMGSHVVA
LCGEVFPVTRAPKPGSPVCPDCKRIYDTLKKG
>Rv2507 POSSIBLE CONSERVED PROLINE RICH MEMBRANE PROTEIN
MNDPRRPQRFGPPLSGYGPTGPQVPPNPPTADPAYADQSPYASTYGGYVS
PPWSPGGPPPRPPQWPPGPHEASPTQQLPQYWQYDQPPPGGFPPDGLTPP
PPQGPRTPRWLWFAAGSAVLLVVALVIALVIANGSVKKQTAIEPLPPMPG
PSPTRPTTTTPTPPSPSAAPAPTTTTGTPSETVAGAMQTVVYDVTGEGRA
ISITYMDSGNVIQTEFNVALPWRKEVSLSKSSLHPASVTIVNIGHNVTCS
VTVAGVQVRQRTGAGLTICDAPS
>Rv3773c CONSERVED HYPOTHETICAL PROTEIN
MPPESRPGPDSPPTDELACAEAALQVLQQVLHTIGRQDKAKQTPCPGYDV
KKLTEHLLNSIMVLGGMVGAEFSLRADIDSVERLVSGAARSALDAWHRHG
LEGDVSLGPGSMSAKVAVSVFSVEFLVHAWDYAVAVGSELKAADSLAEYV
LELARKLIKPEERSVAGFNEPVDVPEDGGALERLIAFTGRNPAR
>Rv0200 POSSIBLE CONSERVED TRANSMEMBRANE PROTEIN
MRNAWRLVVFDVLAPLATIAALAAIGVLLGWPLWWVSTCSVLVLLVVEGV
AINFWLLRRDSVTVGTDDDAPGLRLAVVFLCAAAISAAVVTGYLRWTTPD
RDFNRDSREVVHLATGMAETVASFSPSAPAAAVDRAAAMMVPEHAGGFKE
QYAKSSADLARRGVTAQAATLAAGVEAIGPSAASVAVILRVSQSIPGQPT
SQAARALRVTLTKRGSGWLVLDVTPINAR
>Rv1265 HYPOTHETICAL PROTEIN
MVLARPDAVFAPARNRCHVSLPVNAMSLKMKVCNHVIMRHHHMHGRRYGR
PGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGL
ESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGE
RILFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHT
EEWLAKLRTAMSAVDDLRAQGPDLPA
>Rv0477 POSSIBLE CONSERVED SECRETED PROTEIN
MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTE
WAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMR
AQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF
>Rv2113 Probable integral membrane protein
MSLSVRRPPAARAAAIVEAESWFLKRGLPSVLTMRGRCRRLWPRSAPMLA
AWAVVEGCLMAVFFVTDGGEVFISATPTTAQWVILALLAVALPLASLVGW
LVSQISSGRGQAAVATMAVAFAAASDVIESGPIQLLRTAVVVGLVLLQTG
CGVGSVLGWAVRMTLEHLATVGTLAVRALPIVLLTALVFFNTYVWLMAAN
INGERLTLAMVFLLAIAGAFVVSKTVERVRPLLRSTTVMPQGSQSLAGTP
FATMGDPSPGFPLTRAERLNVVFLLAASQLVEILVVASVGAAIYLVLGMI
ILTPPLLREWTHYDSMTTTVLGMTFPAPDSLIRMCLFLGALTFMYISARA
VDDAEYRAMFLDPLIDDLHTALLARNRYRNNVVTAPCAGVDAGHVDD
>Rv0831c CONSERVED HYPOTHETICAL PROTEIN
MLPETNQDEVQPNAPVALVTVEIRHPTTDSLTESANRELKHLLINDLPIE
RQAQDVSWGMTAPGGAPTPVADRFVRYVNRDNTTAASLKNQAIVVETTAY
RSFEAFTDVVMRVVDARAQVSSIVGLERIGLRFVLEIRVPAGVDGRITWS
NWIDEQLLGPQRFTPGGLVLTEWQGAAVYRELQPGKSLIVRYGPGMGQAL
DPNYHLRRITPAQTGPFFLLDIDSFWTPSGGSIPEYNRDALVSTFQDLYG
PAQVVFQEMITSRLKDELLRQ
>Rv2767c POSSIBLE MEMBRANE PROTEIN
MVGYEGARGRAGREMSESATAGARSSRIPFGIIRNHEAVRPRRSRHLNHA
RDTPQMVAVAQVWREVVQATAIAIAPPLPVVSWGLISLAFLSHTVRGRYR
RSPPAESGHHSNRRQAK
>Rv2714 CONSERVED HYPOTHETICAL ALANINE AND LEUCINE RICH PROTEIN
MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDA
GHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDD
PELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGL
GTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMA
QHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAA
AEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG
DELGAEFERFLAQQAEKKSDDDPT
>Rv2271 CONSERVED HYPOTHETICAL PROTEIN
MTTPPDKARRRFLRDAYKNAERVARTALLTIDQDQLEQLLDYVDERLGEQ
PCDHTARHAQRWAQSHRIEWETLAEGLQEFGGYCDCEIVMNVEPEAIFG
>Rv1724c HYPOTHETICAL PROTEIN
MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKY
WIQALAKHFQRQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAV
YNAVRNAGRAIENEQAALDHKLAEVRKRRMDTWDESYFR
>Rv0678 CONSERVED HYPOTHETICAL PROTEIN
MSVNDGVDQMGAEPDIMEFVEQMGGYFESRSLTRLAGRLLGWLLVCDPER
QSSEELATALAASSGGISTNARMLIQFGFIERLAVAGDRRTYFRLRPNAF
AAGERERIRAMAELQDLADVGLRALGDAPPQRSRRLREMRDLLAYMENVV
SDALGRYSQRTGEDD
>Rv0094c CONSERVED HYPOTHETICAL PROTEIN
MSTRQAAEADLAGKAAQYRPDELARYAQRVMDWLHPDGDLTDTERARKRG
ITLSNQQYDGMSRLSGYLTPQARATFEAVLAKLAAPGATNPDDHTPVIDT
TPDAAAIDRDTRSQAQRNHDGLLAGLRALIASGKLGQHNGLPVSIVVTTT
LTDLQTGAGKGFTGGGTLLPMADVIRMTSHAHHYSPASGRYPQAIFDHGT
PLALYHTKRLASPAQRIMLFANDRGCTKPGCDAPAYHSQAHHVTAWTSTG
RTDITELTLACGPDNRLAEKGWTTHNNTHGHTEWLPPPHLDHGQPRTNTF
HHPERFLHNQDDDDKPD
>Rv0008c POSSIBLE MEMBRANE PROTEIN
MSEQVETRLTPRERLTRGLAYSAVGPVDVTRGLLELGVGLGLQSARSTAA
GLRRRYREGRLAREVAAAQETLAQELTAAQDVVANLPQALQDARTQRRSK
HHLWIFAGIAAAILAGGAVAFSIVRRSSRPEPSPRPPSVEVQPRS
>Rv0743c HYPOTHETICAL PROTEIN
MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEAD
IAFVNDPARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDR
LVSWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVG
VIQARVLLLPEETDPRIGQRIAAWLNYYGAGNHSS
>Rv2694c CONSERVED HYPOTHETICAL PROTEIN
MGAQGYLRRLTRRLTEDLEQRDVEELSDEVLNAGAQRAIDCQRGQEVTVV
GTLRSVETNGKGCSGGVRAELFDGSDTVTLVWLGQRRIPGIDTGRTLRVR
GRLGKLENGTKAIYNPHYEIQR
>Rv2731 CONSERVED HYPOTHETICAL ALANINE AND ARGININE RICH PROTEIN
MTADEPRSDDSSGSAPQPAATPVPRPGPRPGPRPVPRPTSYPVGAHPPSD
PHRFGRIDDDGTVWLVSASGERIVGSWQAGDPEAAFAHFGRRFDDLSTEI
MLMDERLASGTGDARKIKAHAIALAETLPTACVLGDVDALADRLTSIRDR
AEVIAAADRSRREEHRAAQTARKEALAAEAEELAANATQWKVAGDRLRAI
LDEWKTISGVDRKVDDALWKRYSTARDTFNRRRGSHFAELDRERSGVRQS
KERLCERAEELSESTDWTATSAEFRKLLADWKAAGRASKDVDDALWRRFK
AAQDSFFTARNAATAEKEAELRANADAKEALLAEAERLDTTNHEAARAAL
RSIAEKWDAIGKVSRERAAELERRLRAVEKKVREAGEADWSDPQARARAE
QFRARAEQFEHQAEKAAAAGRTKEADEAKANAEQWRQWAEAAADALTRRP
>Rv1431 CONSERVED MEMBRANE PROTEIN
MGFLKPDLPDVDHDTWLTQPRRTRLQVVTRDWVEHGFGTPYAVYLLYLTK
IAVYVAAGAAIISLNPGLGGLSRIGDWWTQPIVYQKVIVFTLLFEVLGFG
CGSGPLTGRFWPPIGGFLYWLRPNTIRLPAWPDKVPFTQGDTRTVVDVAL
YAIVLIGGVWALLSPGSPGPGGTPVTAAGDVGLINPVLVVPTIVALGVLG
LRDKTIFLAARGEHYWLKLFVFFFPFTDQIAAFKIIMLCLWWGAATSKLN
HHFPYVVAVMTSNNALLRSRVFNPIKHLLYRDHANDLRPSWLPKLMAHGG
GTTAEFLVPGILVLVADGHPWRWFLIGFMVLFHLNILSNLPMGVPLEWNV
FFIFSLCYLFGHYGAITATDLRSPLLLAIVIAVVAVVIMGNLLPEKISFL
PAMRYYAGNWATSIWCFRGDAEATMETSVVKSSALVVNQLAKLYDGATAE
IMTDKVAAFRAMHTHGRALNGLLPRALDDEAHYRIREGEIVAGPLVGWNF
GEGHLHNEQLVAAVQRRCNFADGDLRVIILEGQPIHVQKQWYRIVDAKTG
LFEAGYVTVEDMLSRQPWPEPGDEFPVHVTTQRGTPSKP
>Rv2633c HYPOTHETICAL PROTEIN
MNAYDVLKRHHTVLKGLGRKVGEAPVNSEERHVLFDEMLIELDIHFRIED
DLYYPALSAAGKPITGTHAEHRQVVDQLATLLRTPQRAPGYEEEWNVFRT
VLEAHADVEERDMIPAPTPVHITDAELEELGDKMAARIEQLRGSPLYTLR
TKGKADLLKAI
>Rv2159c CONSERVED HYPOTHETICAL PROTEIN
MKFVNHIEPVAPRRAGGAVAEVYAEARREFGRLPEPLAMLSPDEGLLTAG
WATLRETLLVGQVPRGRKEAVAAAVAASLRCPWCVDAHTTMLYAAGQTDT
AAAILAGTAPAAGDPNAPYVAWAAGTGTPAGPPAPFGPDVAAEYLGTAVQ
FHFIARLVLVLLDETFLPGGPRAQQLMRRAGGLVFARKVRAEHRPGRSTR
RLEPRTLPDDLAWATPSEPIATAFAALSHHLDTAPHLPPPTRQVVRRVVG
SWHGEPMPMSSRWTNEHTAELPADLHAPTRLALLTGLAPHQVTDDDVAAA
RSLLDTDAALVGALAWAAFTAARRIGTWIGAAAEGQVSRQNPTG
>Rv3190c HYPOTHETICAL PROTEIN
MEYVQLFSKGRLNDLAGSLAGFLGKASQATAQRLQSWDADDLLNTPVDDV
VEQLVELGSVECPDLRVDDAFMLPATEVDQQYRDWGEQRTRRVTRLVLVV
PFEGHKDIFNLRPDQFTTMPPQVLRLQGHEIHLAIDNLSNDAAAINAAFH
KQIANIEKYLGWSRRQIDLHNQGLRNELPGMVARRREQLLATRNLQAEIG
FPVRRRKDADTYAAPISRKSVRPRPHRPAGARAAFKPEPAMQDEDYQSAL
RVLRNQRNALERTPSVAAKLDGEEIRDMLLVGLNAQFEGDAGGELFNGAG
KTDILIRVDDRNIFIGECKVWSGPRTMDDVLKQLFGYLVWRDTKAAILLF
IRNKDVTAVIDNAIAKIKEHPNHKRCPAHRAGADQYEFTMHADGDPEREI
HLTLIPFALRPTAEVPTTTIP
>Rv3740c CONSERVED HYPOTHETICAL PROTEIN
MSPIDALFLSAESREHPLHVGALQLFEPPAGAGRGFVRETYQAMLQCREI
APLFRKRPTSLHGALINLGWSTDADVDLGYHARRSALPAPGRVRELLELT
SRLHSNLLDRHRPLWETHVIEGLRDGRFAIYSKMHHALVDGVSGLTLMRQ
PMTTDPIEGKLRTAWSPATQHTAIKRRRGRLQQLGGMLGSVAGLAPSTLR
LARSALIEQQLTLPFGAPHTMLNVAVGGARRCAAQSWPLDRVKAVKDAAG
VSLNDVVLAMCAGALREYLDDNDALPDTPLVAMVPVSLRTDRDSVGGNMV
GAVLCNLATHLDDPADRLNAIHASMRGNKNVLSQLPRAQALAVSLLLLSP
AALNTLPGLAKATPPPFNVCISNVPGAREPLYFNGARMVGNYPMSLVLDG
QALNITLTSTADSLDFGVVGCRRSVPHVQRVLSHLETSLKELERAVGL
>Rv3354 CONSERVED HYPOTHETICAL PROTEIN
MNLRRHQTLTLRLLAASAGILSAAAFAAPAQANPVDDAFIAALNNAGVNY
GDPVDAKALGQSVCPILAEPGGSFNTAVASVVARAQGMSQDMAQTFTSIA
ISMYCPSVMADVASGNLPALPDMPGLPGS
>Rv0656c CONSERVED HYPOTHETICAL PROTEIN
MAAATTTGTHRGLELRAAQRAVGSCEPQRAEFCRSARNADEFDQMSRMFG
DVYPDVPVPKSVWRWIDSAQHRLARAGAVGALSVVDLLICDTAAARGLVV
LHDDADYELAERHLPDIRVRRVVSADD
>Rv2738c CONSERVED HYPOTHETICAL PROTEIN
MLAGVRLTEFHERVALHFGAAYGSSVLLDHVLTGFDGRSAAQAIEDGVEP
RDVWRALCADFDVPHDRW
>Rv2142c HYPOTHETICAL PROTEIN
MTRRLRVHNGVEDDLFEAFSYYADAAPDQIDRLYNLFVDAVTKRIPQAPN
AFAPLFKHYRHIYLRPFRYYVAYRTTDEAIDILAVRHGMENPNAVEAEIS
GRTFE
>Rv1352 CONSERVED HYPOTHETICAL PROTEIN
MARTLALRASAGLVAGMAMAAITLAPGARAETGEQFPGDGVFLVGTDIAP
GTYRTEGPSNPLILVFGRVSELSTCSWSTHSAPEVSNENIVDTNTSMGPM
SVVIPPTVAAFQTHNCKLWMRIS
>Rv0199 PROBABLE CONSERVED MEMBRANE PROTEIN
MPDGEQSQPPAQEDAEDDSRPDAAEAAAAEPKSSAGPMFSTYGIASTLLG
VLSVAAVVLGAMIWSAHRDDSGERTYLTRVMLTAAEWTAVLINMNADNID
ASLQRLHDGTVGQLNTDFDAVVQPYRQVVEKLRTHSSGRIEAVAIDTVHR
ELDTQSGAARPVVTTKLPPFATRTDSVLLVATSVSENAGAKPQTVHWNLR
LDVSDVDGKLMISRLESIR
>Rv3770A PROBABLE REMNANT OF A TRANSPOSASE
MGSTPWCPNPCQCTLRTPVEVLELAVALRPENPDRTAGAIQRILRAQLAG
DRIALRGRGS
>Rv0633c POSSIBLE EXPORTED PROTEIN
MVDSMGWVLSSWHEVTGVDSGTWLAWAAWAALGLGVVALVVTKRQIQRNR
RLAAEQTRPYVAMFMEPHVADWHVIELVVRNFGRTAAYDVRFSFPNPPTV
AQYENAANGYADVVELRLPQELPMLAPGQEWRMVWDSALDRAEIGRGIES
RFPGTVTYYDRPEQPRRWRFWRRGRRPLETKVVLDWDALPPVARIELMTT
HDLAKREKQKLELLRSLLTYFHYASKETRPDVFRSEIDRINRAAAETQDR
WRARQVEVPTEVSQRSEGQGPQPTRIPAG
>Rv3304 CONSERVED HYPOTHETICAL PROTEIN
MPLYAAYGSNMHPEQMLERAPHSPMAGTGWLPGWRLTFGGEDIGWEGALA
TVVEDPDSKVFVVLYDMTPADEKNLDRWEGSEFGIHQKIRCRVERISSDT
TTDPVLAWLYVLDAWEGGLPSARYLGVMADAAEIAGAPSDYVHDLRTRPA
RNIGPGTIA
>Rv3207c CONSERVED HYPOTHETICAL PROTEIN
MSTYGWRAYALPVLMVLTTVVVYQTVTGTSTPRPAAAQTVRDSPAIGVVG
TAILDAPPRGLAVFDANLPAGTLPDGGPFTEAGDKTWRVVPGTTPQVGQG
TVKVFRYTVEIENGLDPTMYGGDNAFAQMVDQTLTNPKGWTHNPQFAFVR
IDSGKPDFRISLVSPTTVRGGCGYEFRLETSCYNPSFGGMDRQSRVFINE
ARWVRGAVPFEGDVGSYRQYVINHEVGHAIGYLRHEPCDQQGGLAPVMMQ
QTFSTSNDDAAKFDPDFVKADGKTCRFNPWPYPIP
>Rv3714c CONSERVED HYPOTHETICAL PROTEIN
MLISRMSVRSASMSVMGDVFIGSEAITAGRLTRHELQRWYQPMFRGVYVS
RRSVPTLWDRTVGAWLATRRHGVIAGNAASALHGAQWVDVDVAIELISPT
TRPQHGLVIRRETLCDDEITRVVGLPVTTLARTAYDLGRHLSRGEAVARL
DALMRATPFSRDDVLLLAKRHAGARGVRRLRDVLPLVDGGAASPKETWLR
LLLIDAGLPVPTTQIPVVHRWRNVGVLDMGWEKYMVAAEYDGDQHRSDRG
RYVKDQRRLRKLAELGWIVIRVIAEDNPDDVVNRVRAALLARGWRP
>Rv1004c PROBABLE MEMBRANE PROTEIN
MSISCRVREGFVMRLAIVGTAAAAAIGGTLAVAPLTLSTPERVAGGTCSA
GQQCDRLAAVLMPDTATPSGPAAAEHAVPAPFEPVADTIAPGLVPRPGVP
AAAAVPRVGPPAVPGLPNIPGAAGPALPPPPALPNLAAPSVPGVGIPGIG
IPGIGIPGIGIPGVPDPITGVNTAAAVVNGVLGVGGTAAGVVTASAVAVT
YLVLAVNALESSGILPTARGTASTVASLLLPGAQSAAAALPAVGLPALPG
VTPASLLAMAAAAGLPGVGFPSLPGVSPTDLMAMAAAAGLPTSLPGLAGM
SPAELTALVAGGLPMLAAAGLPAGLAGVDPATLAAALPALAAGGLPPGLP
ALPGVDPAALAAALPALAAGLPALPAGLPPLPAVPALPAPPPLPGPPPLP
ALPSRLCTPGFGPIGVCIP
>Rv0900 POSSIBLE MEMBRANE PROTEIN
MDFVIQWSCYLLAFLGGSAVAWVVVTLSIKRASRDEGAAEAPSAAETGAQ
>Rv1371 PROBABLE CONSERVED MEMBRANE PROTEIN
MTNDLPDVRERDGGPRPAPPAGGPRLSDVWVYNGRAYDLSEWISKHPGGA
FFIGRTKNRDITAIVKSYHRDPAIVERILQRRYALGRDATPRDIHPKHNA
PAFLFKDDFNSWRDTPKYRFDDPNDLLHRVKARLAEPALAARIKRMDTLF
NAIVAVLAVGYFAVQGVRLVEPSWMPLWAFVIAMVLLRSSLAGFGHYALH
RAQRGLNRVFNNAFDLNYVALSLVTADGHTLLHHPYTQSEVDIKKNVFTM
MMRLPWLYRVPVHTIHKFGHMLSGMAIRIVDVFRITRKVGVEESYGSWRA
ALPHFLGSAGVRLLLVSELVVFAIAGDFWPWALQFVATLWVSTFLVVASH
EFEDDTQGGAVNGEDWGIDQLEHANDLTVIGNRYVDCFLSAGLSSHRVHH
VLPFQRSGFANIVTEDVLREEAAKFGVEWLPAKGFITDRLPRLCRKYLLT
PSRQAKERHWGFVREHCSPAALKASASYVVAGFVGIGSV
>Rv3108 HYPOTHETICAL PROTEIN
MTPNAASTGDSAKNTITGCCLITARALVARTRSISLPGMPFRMPADYHNA
SSDEPTNRHPWPAPARCCRHEWRTMRRTNACDRRRFGLSLTIHEDACRII
SVVPVVLEVRRAEPAHPATPYPEPLARCSRSPGLNESSHMSGRIPP
>Rv3572 HYPOTHETICAL PROTEIN
MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDC
HPNGSCDFVAGANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATS
QFERVMKAGGSDVITTVYFGEGPPDKYQTTGVIDSTNWSTGQPMTDVNVI
VCTHMQVVYPGVNLTSPSTCAQANFS
>Rv0358 CONSERVED HYPOTHETICAL PROTEIN
MYTAENAPGVAVLLSGDADVPGPLTGLPTHQDNLDTVIGRYSRLIVVGAD
ADLGAVLTRLLRTDRLDVEVGYVPRRRSPATRAYRLPAGRRAARRARCGV
ARRVPLIRDETGSVIVGRAQWLPAEEQALIHGEAVVDDTVLFDGDVAGVC
IEPTLTLPGLRAAVDGAGKWRRWIGGRAAQLGTTGAAVLRDGVAAPRPVR
RSTFYRNVEGWLLVR
>Rv3489 HYPOTHETICAL PROTEIN
MSTKSDHGEIGDVEPLADSTASQARRVVAAYANDADECRIFLSMLGIGPA
KLES
>Rv3258c CONSERVED HYPOTHETICAL PROTEIN
MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGP
LATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADA
VREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVLAPPEPGAGR
RRGHLRVLPDPAD
>Rv1053c HYPOTHETICAL PROTEIN
MDSHKVCMNNNTQLPTGPIIGVHPAVRDGVERVAYLDGDLLRCNTDVEFT
SSPPPGPVLYRTKHTRVEIADEMVTEKLIKRQRAFNSRRHQ
>Rv0713 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MAGSDPPTGGPASQAGSDAGASPEHKHMSRRKHLVLDVCIILGVLIAYVF
SLLGYDWLAHTPGPLPQPDVGTTDDTVVLIRFEELHTVANRLDVKVLVLP
DDSMIDHRLQVLTTDTSVRLYPENELGDLQYPVGKLPAQVATTIEAHGNP
GAWPFDTYTTDTVQADVLVGAGDNRQYVPARVEVTGSLEGWDISAVRVGE
SSQTSDRPDNVIITLKRAKGPLVFDLGICLVLITLPTLALFVAIQMITGR
RKFQPPFGTWYAAMLFAVVPLRTILPGSPPAGAWIDRAVVIWVLIALAAA
MVVYIVAWYRESD
>Rv3705A CONSERVED HYPOTHETICAL PROLINE RICH PROTEIN
MTETPQPAAPPPSAATTSPPPSPQQEKPPRLYRAAAWVVIVAGIVFTVAV
IFFSGALVLGQGKCPYHRYYHHGMFRPVGPVAPGPGMGWVFGFPGGPPPP
GMGPGFPGGPGGPAVGPTGPGPTTAPARP
>Rv0888 PROBABLE EXPORTED PROTEIN
MDYAKRIGQVGALAVVLGVGAAVTTHAIGSAAPTDPSSSSTDSPVDACSP
LGGSASSLAAIPGASVPQVGVRQVDPGSIPDDLLNALIDFLAAVRNGLVP
IIENRTPVANPQQVSVPEGGTVGPVRFDACDPDGNRMTFAVRERGAPGGP
QHGIVTVDQRTASFIYTADPGFVGTDTFSVNVSDDTSLHVHGLAGYLGPF
HGHDDVATVTVFVGNTPTDTISGDFSMLTYNIAGLPFPLSSAILPRFFYT
KEIGKRLNAYYVANVQEDFAYHQFLIKKSKMPSQTPPEPPTLLWPIGVPF
SDGLNTLSEFKVQRLDRQTWYECTSDNCLTLKGFTYSQMRLPGGDTVDVY
NLHTNTGGGPTTNANLAQVANYIQQNSAGRAVIVTGDFNARYSDDQSALL
QFAQVNGLTDAWVQVEHGPTTPPFAPTCMVGNECELLDKIFYRSGQGVTL
QAVSYGNEAPKFFNSKGEPLSDHSPAVVGFHYVADNVAVR
>Rv1519 CONSERVED HYPOTHETICAL PROTEIN
MRCGCLACDGVLCANGPGRPRRPALTCTAVATRTLHSLATNAELVESADL
TVTEDICSRIVSLPVHDHMAIADVARVVAPFGEGLARGG
>Rv1158c CONSERVED HYPOTHETICAL ALA-, PRO-RICH PROTEIN
MPTIWTFVRAAAVLVGSSAALLTGGIAHADPAPAPAPAPNIPQQLISSAA
NAPQILQNLATALGATPPLSAPKVAEPAPAAPGITATFPGLTPAAPAAAA
APALTPSIPGVNAPIPGITPAAPALPVTAPAAAPTIPGVNAPIPGITAPA
PAAAAVPASVPGVPSAKVDLPQLPYLPLQVPQQLSLPADLPALASGVIPA
APIAPTPPAPGAPALPPGPPSLLAALP
>Rv1567c Probable hypothetical membrane protein
MVTMTSWPSRLFAFTDNVCPPDACPLVPFGVNYYIYPVMWGGIGAAIATA
VIGPFVSMLKGWYMSFWPIISIAVITVTSIAGYAIAGFSERYWH
>Rv1052 HYPOTHETICAL PROTEIN
MDCCEERGVARHKGLSQVGTPGCPRWSQAVSCRCSAYREAAVTAVQMPLT
PGYGETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMP
TAVEGSLSLDELLSDHFVRDLHARMFGPV
>Rv2169c PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MPLSDHEQRMLDQIESALYAEDPKFASSVRGGGFRAPTARRRLQGAALFI
IGLGMLVSGVAFKETMIGSFPILSVFGFVVMFGGVVYAITGPRLSGRMDR
GGSAAGASRQRRTKGAGGSFTSRMEDRFRRRFDE
>Rv2758c CONSERVED HYPOTHETICAL PROTEIN
MHRGYALVVCSPGVTRTMIDIDDDLLARAAKELGTTTKKDTVHAALRAAL
RASAARSLMNRMAENATGTQDEALVNAMWRDGHPENTA
>Rv2422 HYPOTHETICAL PROTEIN
MPASVSTVLVDTSVAVAPVVADHDHHEDTFQALRGRTLGLAGHAAFERRT
LATVAKLLAHTFPATRFLGAGAAMSLLPELAPAEIAGGAV
>Rv1510 conserved probable membrane protein
MYERRHERGMCDRAVEMTDVGATAAPTGPIARGSVARVGAATALAVACVY
TVIYLAARDLPPACFSIFAVFWGALGIATGATHGLLQETTREVRWVRSTQ
IVAGHRTHPLRVAGMIGTVAAVVIAGSSPLWSRQLFVEGRWLSVGLLSVG
VAGFCAQATLLGALAGVDRWTQYGSLMVTDAVIRLAVAAAAVVIGWGLAG
YLWAATAGAVAWLLMLMASPTARSAASLLTPGGIATFVRGAAHSITAAGA
SAILVMGFPVLLKVTSDQLGAKGGAVILAVTLTRAPLLVPLSAMQGNLIA
HFVDRRTQRLRALIAPALVVGGIGAVGMLAAGLTGPWLLRVGFGPDYQTG
GALLAWLTAAAVAIAMLTLTGAAAVAAALHRAYLLGWVSATVASTLLLLL
PMPLETRTVIALLFGPTVGIAIHVAALARRPD
>Rv3895c PROBABLE CONSERVED MEMBRANE PROTEIN
MPLSLSNRDQNSGHLFYNRRLRAATTRFSVRMKHDDRKQTAALALSMVLV
AIAAGWMMLLNVLKPTGIVGDSAIIGDRDSGALYARIDGRLYPALNLTSA
RLATGTAGQPTWVKPAEIAKYPTGPLVGIPGAPAAMPVNRGAVSAWAVCD
TAGRPRSADKPVVTSIAGPITGGGRATHLRDDAGLLVTFDGSTYVIWGGK
RSQIDPTNRAVTLSLGLDPGVTSPIQISRALFDGLPATEPLRVPAVPEAG
TPSTWVPGARVGSVLQAQTAGGGSQFYVLLPDGVQKISSFVADLLRSANS
YGAAAPRVVTPDVLVHTPQVTSLPVEYYPAGRLNFVDTAADPTTCVSWEK
ASTDPQARVAVYNGRGLPVPPSMDSRIVRLVRDDRAPASVVATQVLVLPG
AANFVTSTSGVITAESRESLFWVSGNGVRFGIANDEATLRALGLDPGAAV
QAPWPLLRTFAAGPALSRDAALLARDTVPTLGQVAIVTTTAKAGA
>Rv1116 HYPOTHETICAL PROTEIN
MCSRMADEPRLEAGAHPFEEGRDKAPELRATQMDHVRFTEGRRERNRDRL
ERSQQFRQPGR
>Rv2036 CONSERVED HYPOTHETICAL PROTEIN
MIAADDDTEKSMMDMARAERAELAAFLTTLTLQQWETPSLCAGWSVKEVV
AHMISYEDLGVFGLLKRFAKGRIVRANEVGVDEFAGLSPQELADYVGRHL
QPRGLTAGFGGMIALVDGMIHHQDIRRPLGQPRTIPAQRLDRVLRLMPKN
PRLRARPRIKGLRLRATDLDWTIGTGPEVTGPGEALLMAMAGRPAAVSDL
SGPGKPTLAGRLG
>Rv3527 HYPOTHETICAL PROTEIN
MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADD
PQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWN
TEASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLEHHDPNR
>Rv2661c HYPOTHETICAL PROTEIN
MRARSDAGGQSVKSRTSNRSRSSRRSRVRSSISALVDNPQARPRELPVLC
GWPVVRVEPVCEFVPEPVCGQAEVLGEPAAAHRVTSARRSPSTTVCSRSQ
KASAVVISSVSSVARVRRASVSSVDATTA
>Rv1575 Probable phiRV1 phage protein
MEPKPSQRHTDKEVGAALGISAGTYKRLKRIDNATRSDDKEIRLFAEKQM
APLAAGSPSWNGRKPSSGNRKAATMAARLDILAWGPWAPSQNRSVVRRKQ
TLLSAQPSASPPAPTGGSNESTTQPAASWRVGGPAPLSRGRPRLALSYLR
GSLHLQNSKRVAHQHI
>Rv2097c CONSERVED HYPOTHETICAL PROTEIN
MQRRIMGIETEFGVTCTFHGHRRLSPDEVARYLFRRVVSWGRSSNVFLRN
GARLYLDVGSHPEYATAECDSLVQLVTHDRAGEWVLEDLLVDAEQRLADE
GIGGDIYLFKNNTDSAGNSYGCHENYLIVRAGEFSRISDVLLPFLVTRQL
ICGAGKVLQTPKAATYCLSQRAEHIWEGVSSATTRSRPIINTRDEPHADA
EKYRRLHVIVGDSNMSETTTMLKVGTAALVLEMIESGVAFRDFSLDNPIR
AIREVSHDVTGRRPVRLAGGRQASALDIQREYYTRAVEHLQTREPNAQIE
QVVDLWGRQLDAVESQDFAKVDTEIDWVIKRKLFQRYQDRYDMELSHPKI
AQLDLAYHDIKRGRGIFDLLQRKGLAARVTTDEEIAEAVDQPPQTTRARL
RGEFISAAQEAGRDFTVDWVHLKLNDQAQRTVLCKDPFRAVDERVKRLIA
SM
>Rv0739 CONSERVED HYPOTHETICAL PROTEIN
MVLTRRAREVALTQHIGVSAETDRAVVPKLRQAYDSLVCGRRRLGAIGAE
IENAVAHQRALGLDTPAGARNFSRFLATKAHDITRVLAATAAESQAGAAR
LRSLASSYQAVGFGPKPQEPPPDPVPFPPYQPKVWAACRARGQDPDKVVR
TFHHAPMSARFRSLPAGDSVLYCGNDKYGLLHIQAKHGRQWHDIADARWP
SAGNWRYLADYAIGATLAYPERVEYNQDNDTFAVYRRMSLPDGRYVFTTR
VIISARDGKIITAFPQTT
>Rv1113 CONSERVED HYPOTHETICAL PROTEIN
MRTTVTVDDALLAKAAELTGVKEKSTLLREGLQTLVRVESARRLAALGGT
DPQATAAPRRRTSPR
>Rv2273 PROBABLE CONSERVED TRANSMEMBRANE PROTEIN
MNRHSTAASDRGLQAERTTLAWTRTAFALLVNGVLLTLKDTQGADGPAGL
IPAGLAGAAASCCYVIALQRQRALSHRPLPARITPRGQVHILATAVLVLM
VVTAFAQLL
>Rv1209 CONSERVED HYPOTHETICAL PROTEIN
MALVLVYLVVLVLVAIVLFAAASLLFGRGEQLPPLPRATTATTLPAFGVT
RADVDAVKFTQVLRGYKTSEVDWVLERLGRELEALRSQLGAIHASSEDAE
AESDASNPSRGETVVHYRSDPA
>Rv2342 CONSERVED HYPOTHETICAL PROTEIN
MIGYVAVLGLGYVLGAKAGRRRYEQIASTYRALTGSPVARSMIEGGRRKI
ANRISPDAGFVTLAEIDNQTAVVQRGVERQPKTAR
>Rv2980 POSSIBLE CONSERVED SECRETED PROTEIN
MTGESDGPPRAVLIAAAALAAAVIGVILVVAANRQPPERPVVIPAVPAPQ
ATGPGCKALLAALPQRLGEYRRAPVAEPTTAGATAWRTGPNSTPVILRCG
LDRPAEFVVGSAIQVVDRVQWFQVAAQNPDEPGRSTWYTVDRPVYVALTL
PSGSGPTAIQELSDVIDHTIPAVPIDPAPAR
>Rv2174 Possible conserved integral membrane protein
MTTPSHAPAVDLATAKDAVVQHLSRLFEFTTGPQGGPARLGFAGAVLITA
GGLGAGSVRQHDPLLESIHMSWLRFGHGLVLSSILLWTGVGVMLLAWLGL
GRRVLAGEATEFTMRATTVIWLAPLLLSVPVFSRDTYSYLAQGALLRDGL
DPYAVGPVGNPNALLDDVSPIWTITTAPYGPAFILVAKFVTVIVGNNVVA
GTMLLRLCMLPGLALLVWATPRLASHLGTHGPTALWICVLNPLVLIHLMG
GVHNEMLMVGLMTAGIALTVQGRNVAGIILITVAIAVKATAGIALPFLVW
VWLRHLRERRGYRPVQAFLAAAAISLLIFVAVFAVLSAVAGVGLGWLTAL
AGSVKIINWLTVPTGAANVIHALGRGLFTVDFYTLLRITRLIGIVIIAVS
LPLLWWRFRRDDRAALTGVAWSMLIVVLFVPAALPWYYSWPLAVAAPLAQ
ARRAIAAIAGLSTWVMVIFKPDGSHGMYSWLHFWIATACALTAWYVLYRS
PDRRGVQAATPVVNTP
>Rv1947 HYPOTHETICAL PROTEIN
MDRYNDQASGRALIEIRLCNERATPMPIPIGLWMFQTKLHVNAGGADVFL
PVCDVLEQDLAERDEEVRQLNLQYRNRLEYAIGRTCSAAWSVNGSRRPSA
VWTTWLPVAETPHTRARSVENALLSMDSRGGVT
>Rv1906c CONSERVED HYPOTHETICAL PROTEIN
MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTY
AVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEP
TDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLLGPTG
GRVPQP
>Rv3851 POSSIBLE MEMBRANE PROTEIN
MTAIGMSHPPRVHRRVGGQRTALTAGIGLLLAALVLTTIANPPAAFAHTA
QLSTATPAPAVAATDANDVPTWPFVVGTVAAVAVAALWAVRRGR
>Rv1871c CONSERVED HYPOTHETICAL PROTEIN
MNAAMNLKREFVHRVQRFVVNPIGRQLPMTMLETIGRKTGQPRRTAVGGR
VVDNQFWMVSEHGEHSDYVYNIKANPAVRVRIGGRWRSGTAYLLPDDDPR
QRLRGLPRLNSAGVRAMGTDLLTIRVDLD
>Rv0813c CONSERVED HYPOTHETICAL PROTEIN
MSSGAGSDATGAGGVHAAGSGDRAVAAAVERAKATAARNIPAFDDLPVPA
DTANLREGADLNNALLALLPLVGVWRGEGEGRGPDGDYRFGQQIVVSHDG
GDYLNWESRSWRLTATGDYQEPGLREAGFWRFVADPYDPSESQAIELLLA
HSAGYVELFYGRPRTQSSWELVTDALARSRSGVLVGGAKRLYGIVEGGDL
AYVEERVDADGGLVPHLSARLSRFVG
>Rv0030 HYPOTHETICAL PROTEIN
MVSGSDSRSEPSQLSDRDLVESVLRDLSEAADKWEALVTQAETVTYSVDL
GDVRAVANSDGRLLELTLHPGVMTGYAHGELADRVNLAITALRDEVEAEN
RARYGGRLQ
>Rv2033c CONSERVED HYPOTHETICAL PROTEIN
MLDRYGTDVLAAGGRRRPRSVEHPVELGMVVEDAETGYVGAVVRVEYGRI
DLEDRYGKTRGFPLGPGYLLDGLPVILTAPRCAAAAGPRRTASGSVAVPG
ARARVARASRIYVEGRHDAELIAAVWGADLRIEGVVVEHLGGVDDLVEIV
AKFRPGPRRRLGVLVDHLVAGSKEARIAEVVRRGPGGSDTLVVGHPYVDI
WQAVKPQRVGLAAWPRVPRHIEWKHGVCDALGWPHADQADIAAAWRRIRS
QVRDWTDLEPALIGRVEELIDFVTQPAGDE
>Rv0151c PE1, PE FAMILY PROTEIN
MAPFGFTPKARHNRGVALRSTYRLDGWVMGPVDKEGWGLSYVFAQPSVLA
AAATDLAGIGSAINQATAAVAAPTTGLAAAAADEVSTALATLFGAYGQQF
QAISAQVAAFHNEFTQRLAAAANAFVNAEATNTSALVQEATAGLFKPTSP
PVLPPMFNQNTAIIMGGTGSPIPTPSYVNAITTLFIDPVVSNPVVKALVT
PEELYPITGVKSLPFQTSVQLGLQILDGAIWEQINAGNHVTVFGYSQSAV
IASLEMQHLISLGPNAPSPSQLNFILIGNEMNPNGGILARIPGLNVTTLG
LPFYGATPDNPYPTTTYTLEYDGFADFPRYPLNVLSDINAVFGILTVHTT
YADLTPAQIASATQLPTQGTTSNTYYIIETEHLPLLAPLRAIPVIGPPLA
ALVEPNLEVIVNLGYGDPRFGYSTSPANVPTPFGLFPDVPASVVADALVA
GTQQGVNDFMVELPAALNTLPQTPMPAFPPYVPTLLPPPPPPQPATLINI
ADTFASVVSTGYSILLPTADLGLAFVTILPAYDLTLFVNQLAAGNLRAAI
ELPLAATIGLAALGGMIEFIAIVVTLADITQQLQSFSI
>Rv1089 PE10, PE FAMILY PROTEIN
SFAGAEAANASQLQSIARQVRGAVNAVAGQVTGNGGSGNSGTSAAAANPN
SDNTASIADRGTSAIMTTASATASSTGVDGGIAATYAVASQWDGGYVANY
TITQFGRDFDDRLAVAIHFA
>Rv1169c PE11, PE FAMILY PROTEIN
MSFVTTRPDSIGETAANLHEIGVTMSAHDDGVTPLITNVESPAHDLVSIV
TSMLFSMHGELYKAIARQAHVIHESFVQTLQTSKTSYWLTELANRAGTST
>Rv1172c PE12, PE FAMILY PROTEIN
MSFVFAAPEALAAAAADMAGIGSTLNAANVVAAVPTTGVLAAAADEVSTQ
VAALLSAHAQGYQQLSRQMMTAFHDQFVQALRASADAYATAEASAAQTMV
NAVNAPARALLGHPLISADASTGGGSNALSRVQSMFLGTGGSSALGGSAA
ANAAASGALQLQPTGGASGLSAVGALLPRAGAAAAAALPALAAESIGNAI
KNLYNAVEPWVQYGFNLTAWAVGWLPYIGILAPQINFFYYLGEPIVQAVL
FNAIDFVDGTVTFSQALTNIETATAASINQFINTEINWIRGFLPPLPPIS
PPGFPSLP
>Rv1195 PE13, PE FAMILY PROTEIN
MSFVMAYPEMLAAAADTLQSIGATTVASNAAAAAPTTGVVPPAADEVSAL
TAAHFAAHAAMYQSVSARAAAIHDQFVATLASSASSYAATEVANAAAAS
>Rv1214c PE14, PE FAMILY PROTEIN
MLASAATDLAGIGSALSAANAAAAAPTTAMLAACADEVSAVVASLFARHA
QAYQALSLQATAFHQQFVQALTGAGGAYAAAEAVNAAVAQSVQQDVLNVI
NAPTQALFDR
>Rv1386 PE15, PE FAMILY PROTEIN
MTLRVVPESLAGASAAIEAVTARLAAAHAAAAPFIAAVIPPGSDSVSVCN
AVEFSVHGSQHVAMAAQGVEELGRSGVGVAESGASYAARDALAAASYLSG
GL
>Rv1430 PE16, PE FAMILY PROTEIN
MSFVFAVPEMVAATASDLASLGAALSEATAAAAIPTTQVLAAAADEVSAA
IAELFGAHGQEFQALSAQASAFHDRFVRALSAAAGWYVDAEAANAALVDT
AATGASELGSGGRTALILGSTGTPRPPFDYMQQVYDRYIAPHYLGYAFSG
LYTPAQFQPWTGIPSLTYDQSVAEGAGYLHTAIMQQVAAGNDVVVLGFSQ
GASVATLEMRHLASLPAGVAPSPDQLSFVLLGNPNNPNGGILARFPGLYL
QSLGLTFNGATPDTDYATTIYTTQYDGFADFPKYPLNILADVNALLGIYY
SHSLYYGLTPEQVASGIVLPVSSPDTNTTYILLPNEDLPLLQPLRGIVPE
PLLDLIEPDLRAIIELGYDRTGYADVPTPAALFPVHIDPIAVPPQIGAAI
GGPLTALDGLLDTVINDQLNPVVTSGIYQAGAELSVAAAGYGAPAGVTNA
IFIGQQVLPILVEGPGALVTADTHYLVDAIQDLAAGDLSGFNQNLQLIPA
TNIALLVFAAGIPAVAAVAILTGQDFPV
>Rv1646 PE17, PE FAMILY PROTEIN
MSFLTVAPDMVTAAAGNLESVGSALNEAAAAAAPATVGLAAPAADRVSAV
VAAMLGAYARDFQGISAQIAGFHNQFVGALRGGAAAYASAEAANVQQTVV
NAVNAPAQALLGHPLIGPETVGSSAAAVSFGFGPLLLAGSDPLLAVPFSY
PASLPTPFGPVTMTLNGSFDPLTQQVVFDSGSLTAPAPFVYGLGAVGPAL
TTMTALQNSGTAFSGAVQSGNLLGAAGALLQAPGNAVTGFLFGQTAISQS
IPGPSNLGYESVGISVPVGGLLAPLQPVTVTLTPTSGMPTAIQLSGTQFG
GLLPALLNGF
>Rv1788 PE18, PE FAMILY PROTEIN
MSFVTTQPEALAAAAGSLQGIGSALNAQNAAAATPTTGVVPAAADEVSAL
TAAQFAAHAQIYQAVSAQAAAIHEMFVNTLQMSSGSYAATEAANAAAAG
>Rv1791 PE19, PE FAMILY PROTEIN
MSFVTTQPEALAAAAANLQGIGTTMNAQNAAAAAPTTGVVPAAADEVSAL
TAAQFAAHAQMYQTVSAQAAAIHEMFVNTLVASSGSYAATEAANAAAAG
>Rv0152c PE2, PE FAMILY PROTEIN
MRCRPPSRNRSAHTARNTRPCSLKSRRFTVRFHQTLAAAANSYADAEAAI
ASTRQNQLAVPAAAPTPAAAAMIPPFPANLTTLFFGPTGIPLPPPSMLTP
PIRCRSVRRALQAVFTPEELYPLTGVRSLVLNTSVEEGLTILHDAIMVEL
ATTGNAVTVFGWSQSAIIASLEMQRFTAMGGAAPSASDLNFVLVGNEMNP
NGGMLARFPDLTLPTLDLTFYGATPSDTIYPTAIYTLEYDGFADFSRYPL
NFISDLNAVAGITFVHTKYLDLTPAQVEGATKLPTSPGYTGVTDYYIIRT
ENRPLLQPLRAVPVIGDPLADLIQPNLKVIVNLGYGDPNYGYSTSYADVR
TPFGLWPNVPPQVIADALAAGTQEGILDFTADLQALSAQPLTLPQIQLPQ
PADLVAAVAAAPTPAEVVNTLARIISTNYAVLLPTVDIALALVTTLPLYT
TQLFVRQLAAGNLINAIGYPLAATVGLGTIDSGRRGIAHPPRGGLGHRSK
HRGPRHLTDSRRHRRPPTTVYRPRQ
>Rv1806 PE20, PE FAMILY PROTEIN
MAFVLVCPDALAIAAGQLRHVGSVIAARNAVAAPATAELAPAAADEVSAL
TATQFNFHAAMYQAVGAQAIAMNEAFVAMLGASADSYAATEAANIIAVS
>Rv2099c PE21, PE FAMILY PROTEIN
MSFVIASPEALLAAATDLAAIRSTIRAANAAAAVPTTGALAPAADEVSAG
IAALFGAQ
>Rv2107 PE22, PE FAMILY PROTEIN
MSFVNVDPFGMLAAAATLESLGSHMAVSNAAVASVTTKVPPPAADYVSKK
LSLFFSSHGQQYQVQAARGTAFHRKLVRTLANGALAYEEVEIANNEGF
>Rv2408 PE24, POSSIBLE PE FAMILY-RELATED PROTEIN
MLIARPDILCSRGPEAMRAKAADLDLAAAAKTVGVQPAADQVAAAIAAIL
LSHAQIYQDISTQMAAFHDQLVENRTADSTSYASAEANAQQSLLNAMDAP
SWQQRRETVGEVGLPADPAGSGTATAAVAAATTARAGSRSAAQATVAPIG
GLKLRRESALSQPGDLHHHVEVGDALPRVDPFQRGNVGVVAAYTHTDVLL
GDLIVIGGVVVPPSTGPGLNPGMAAPVYRLSHHGITLRV
>Rv2519 PE26, PE FAMILY PROTEIN
MSRLIVAPDWLASAAAEVQSIGSALSAANAAAAAPTTLLVAAAEDEVSAA
AAALFANYGREYQTLSVRFASLDQQFAQALNSAAASYQTAEATGASLVQT
ATQGVLGVINAPTEFMFGRSLIGDGADGTAASPIGEPGGILYGDGGNGYS
QTTPGAVGGAGGSAGFIGNGGAGGAGGPGAGGGTGGLGGWLWGNNGAAGT
GDPVNVAVPLRVENNFPLVNLLVNRGPTVPILLDTGSSSLVIPFWKIGWQ
NLGLPTGFDVVHYGNGVSIVYADVPTTVDFGGGAATTPTSVHVGILPYPR
NLDSLVLIASGGAFGPNGNGILGIGPNVGSYAVSGPGNVVTTDLPGQLNE
GTLIDIPGGYMQFGPNTGTPITSVTGAPITVLNVQIGGYDPNGGYWSLPS
IFDSGGNHGTLPAVILGTGQTTGYAPPGTVISISIHDNQTLLYQYTTTAS
NSPVVTADPRLNTGLTPFLLGPVYISNNPSGVGTVVFNYPPP
>Rv2769c PE27, PE FAMILY PROTEIN
MSFLTTQPEELAAAAGKLETIGSAMVAQNAAAAAPTTTGVIPAAADEISV
LQAPLFTAYGTLYQQVSAEAAAVYDLFVKTLGVSAGTYAATEAANSSAAA
SPLSGIASILGSTPGKVPSWISDIANIFNIGAGNWASAASDLLGLASGGL
LPAAEEAALEEGLEGAGLSELGAAEAAVGEAPIAAGLGAAPLAAGLSRAS
SIGALSVPPSWAGQANLVSSTSTLQGAGWTTAAPHGAAGTVIPGMPGLAS
ATRSSAGFGAPRYGAKPIVVPKPAV
>Rv3018A PE27A, PE FAMILY PROTEIN
MTLSVVPEGLAAASAAVEALTARLAAAH
>Rv3022A PE29, PE FAMILY PROTEIN
MTLRVVPEGLAAASAAVEALTARLAAAHAGAAPAITAVVAPAADPVSLQS
AVGFSALGSEHAAIAGEGVEELGRSGVAVGESGIGYAAGDAVAAATYLVS
GGSL
>Rv0159c PE3, PE FAMILY PROTEIN
MSYVIAAPEMLATTAADVDGIGSAIRAASASAAGPTTGLLAAAADEVSSA
AAALFSEYARECQEVLKQAAAFHGEFTRALAAAGAAYAQAEASNTAAMSG
TAGSSGALGSVGMLSGNPLTALMMGGTGEPILSDRVLAIIDSAYIRPIFG
PNNPVAQYTPEQWWPFIGNLSLDQSIAQGVTLLNNGINAELQNGHDVVVF
GYSQSAAVATNEIRALMALPPGQAPDPSRLAFTLIGNINNPNGGVLERYV
GLYLPFLDMSFNGATPPDSPYQTYMYTGQYDGYAHNPQYPLNILSDLNAF
MGIRWVHNAYPFTAAEVANAVPLPTSPGYTGNTHYYMFLTQDLPLLQPIR
AIPFVGTPIAELIQPDLRVLVDLGYGYGYADVPTPASLFAPINPIAVASA
LATGTVQGPQAALVSIGLLPQSALPNTYPYLPSANPGLMFNFGQSSVTEL
SVLSGALGSVARLIPPIA
>Rv3477 PE31, PE FAMILY PROTEIN
MSFTAQPEMLAAAAGELRSLGATLKASNAAAAVPTTGVVPPAADEVSLLL
ATQFRTHAATYQTASAKAAVIHEQFVTTLATSASSYADTEAANAVVTG
>Rv3622c PE32, PE FAMILY PROTEIN
MSIMHAEPEMLAATAGELQSINAVARAGNAAVAGPTTGVVPAAADLVSLL
TASQFAAHAQLYQAISAEAMAVQEQLATTLGISAGSYAATEAANAATIA
>Rv3650 PE33, PE FAMILY PROTEIN
MSFVIAAPEALDSAATDLVVLGSTLGAATAAAAAQTTGIVAAAHDEVSAA
IAALFSAHGQAYQAASAQAAAFHTRFIRARSRHPQQETTCRRVR
>Rv3746c PE34, PROBABLE PE FAMILY PROTEIN (PE FAMILY-RELATED PROTEIN)
MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWA
VTAFTTAATGLLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEA
IPRPGQTLARE
>Rv3872 PE35, PE FAMILY-RELATED PROTEIN
MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQA
ATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE
>Rv3893c PE36, PE FAMILY PROTEIN
MVWSVQPEAVLASAAAESAISAETEAAAAGAAPALLSTTPMGGDPDSAMF
SAALNACGASYLGVVAEHASQRGLFAG
>Rv0160c PE4, PE FAMILY PROTEIN
MSHLVTAPDMLATAAAHVDEIASTLRAANAAAAGPTCNLLAAAGDEVSAA
TAALFSAYGREYQAVVKQAAAFHSEFTRTLEAAGNAYAHAEAANAARVSH
ALDTINAPIRTLLGRAPLSPNGSSGAGGLPAIAQLAAESPITALIMGGTN
NPLPDPEYVTDINKAFIQTLFPGAVSQGLFTPEQFWPVTPDLGNLTFNQS
VTEGVALLNTAVNNQLALDNKVVAFGYSQSATIINNYINSLMAMGSPNPD
DISFVMIGSGNNPVGGLLARFPGFYIPFLDVPFNGATPANSPYPTHIYTA
QYDGIAHAPQFPLRILSDINAFMGYFYVHNTYPELMATQVDNAVPLPTSP
GYTGNTQYYMFLTQDLPLLQPIRDIPYAGPPIADLFQPQLRVLVDLGYAD
YGPGGNYADIPTPAGLFSIPNPFAVTYYLIKGSLQAPYGAIVEIGVEAGL
IGPEWFPDSYPWVPSINPGLNFYFGQPQVTLLSLMSGGLGNILHLIPPPV
FT
>Rv0285 PE5, PE FAMILY PROTEIN
MTLRVVPEGLAAASAAVEALTARLAAAHASAAPVITAVVPPAADPVSLQT
AAGFSAQGVEHAVVTAEGVEELGRAGVGVGESGASYLAGDAAAAATYGVV
GG
>Rv0916c PE7, PE FAMILY PROTEIN
MSFVTIQPVVLAAATGDLPTIGTAVSARNTAVCAPTTGVLPPAANDVSVL
TAARFTAHTKHYRVVSKPAALVHGMFVALPAATADAYATTEAVNVVATG
>Rv1040c PE8, PE FAMILY PROTEIN
MSFLKTVPEELTAAAAQLGTIGAAMAAQNAAAAAPTTAIAPAALDEVSAL
QAALFTAYGTFYQQVSAEAQAMHDMFVNTLGISAGTYGVTESLNSSAAAS
PLSGITGEASAIIQATTGLFPPELSGGIGNILNIGAGNWASATSTLIGLA
GGGLLPAEEAAEAASALGGEAALGELGALGAAEAALGEAGIAAGLGSASA
IGMLSVPPAWAGQATLVSTTSTLPGAGWTAAAPQAAAGTFIPGMPGVASA
ARNSAGFGAPRYGVKPIVMPKPATV
>Rv1088 PE9, PE FAMILY PROTEIN
MSYMIATPAALTAAATDIDGIGSAVSVANAAAVAATTGVLAAGGDEVLAA
IARLFNANAEEYHALSAQVAAFQTLFVRTLTGGCGVFRRRRGRQCVTAAE
HRAAGAGRRQRRRRSGDGQWRLRQQRHFGCGGQPEFRQHSEHRR
>Rv0109 PE_PGRS1, PE-PGRS FAMILY PROTEIN
MSLLITSPATVAAAATHLAGIGSALSTANAAAAAPTTALSVAGADEVSVL
IAALFEAYAQEYQALSAQALAFHDQFVQALNMGAVCYAAAETANATPLQA
LQTVQQNVLTVVNAPTQALLGRPIIGNGANGLPNTGQDGGPGGLLFGNGG
NGGSGGVDQAGGNGGAAGLIGNGGSGGVGGPGIAGSAGGAGGAGGLLFGN
GGPGGAGGIGTTGDGGPGGAGGNAIGLFGSGGTGGMGGVGGMGGVGNGGN
AGNGGTAGLFGHGGAGGAGGIGSADGGLGGGGGNGRFMGNGGVGGAGGYG
ASGDGGNAGNGGLGGVFGDGGAGGTGGLGDVNGGLAGIGGNAGFVRNGGA
GGNGQLGSGAVSSAGGMGGNGGLVFGNGGPGGLGGPGTSAGNGGMGGNAV
GLFGQGGAGGAGGSGFGAGIPGGRGGDGGSGGLIGDGGTGGGAGAGDAAA
SAGGNGGNARLIGNGGDGGPGMFGGPGGAGGSGGTIFGFAGTPGPS
>Rv0747 PE_PGRS10, PE-PGRS FAMILY PROTEIN
MSWVMVSPELVVAAAADLAGIGSAISSANAAAAVNTTGLLTAGADEVSTA
IAALFGAQGQAYQAASAQAAAFYAQFVQALSAGGGAYAAAEAAAVSPLLA
PINAQFVAATGRPLIGNGANGAPGTGANGGPGGWLIGNGGAGGSGAPGAG
AGGNGGAGGLFGSGGAGGASTDVAGGAGGAGGAGGNAGMLFGAAGVGGVG
GFSNGGATGGAGGAGGAGGLFGAGRERGSGGSGNLTGGAGGAGGNAGTLA
TGDGGAGGTGGASRSGGFGGAGGAGGDAGMFFGSGGSGGAGGISKSVGDS
AAGGAGGAPGLIGNGGNGGNGGASTGGGDGGPGGAGGTGVLIGNGGNGGS
GGTGATLGKAGIGGTGGVLLGLDGFTAPASTSPLHTLQQDVINMVNDPFQ
TLTGRPLIGNGANGTPGTGADGGAGGWLFGNGGNGGQGTIGGVNGGAGGA
GGAGGILFGTGGTGGSGGPGATGLGGIGGAGGAALLFGSGGAGGSGGAGA
VGGNGGAGGNAGALLGAAGAGGAGGAGAVGGNGGAGGNGGLFANGGAGGP
GGFGSPAGAGGIGGAGGNGGLFGAGGTGGAGGGSTLAGGAGGAGGNGGLF
GAGGTGGAGSHSTAAGVSGGAGGAGGDAGLLSLGASGGAGGSGGSSLTAA
GVVGGIGGAGGLLFGSGGAGGSGGFSNSGNGGAGGAGGDAGLLVGSGGAG
GAGASATGAATGGDGGAGGKSGAFGLGGDGGAGGATGLSGAFHIGGKGGV
GGSAVLIGNGGNGGNGGNSGNAGKSGGAPGPSGAGGAGGLLLGENGLNGL
M
>Rv0832 PE_PGRS12, PE-PGRS FAMILY PROTEIN
MSYVSVLPATLATAATEVARIGSALSLASAVAAAQTSAVQAAAADEVSAA
IAALFSAHGRDFQALSARAAAFHHEFVQALAAGAGSYAVAEIAAASPLQS
LIDVFNAPIQAATGRPLIGNGANGQPGTGAPGGPAGG
>Rv0833 PE_PGRS13, PE-PGRS FAMILY PROTEIN
MIGNGGAGGSGAPGAIGGAGGPAGLIGVGGAGGAGGDSAVAGVIGGAGGA
GGAALLFGAGGAGGAGGSGGSGAAGGAGGAGGAGGLFASGGSGGFGGFAS
TGTGGAGGTGGAGGLFASGGVGGTGGGAGSGGTGGVGGTGGAGGLFASGG
AGGAGGSGGTGGAGGTGGAGGLFGAGGAGGLGGQGNHTGGHGGAGGSAGL
LALGDGGAGGAGGAATTGTGGAGGAGGKAGLLFGSGGAGGSGGAAGTFGD
TGNSGGAGGAGGKAGLLFGSGGAGGSGGAGGFANGSTGGAGGAGGGAGLI
GNGGNGGSGGTSVATGGAGNGGAGGAGGGAGLIGNGGNGGSGGMGDAPGG
TGVGGIGGLLLGLDGANAPASTNPLHTAQQQALAAVNAPIQAVTGRPLIG
NGANGAPGSGAPGGHGGWLFGGGGTGGSGVSGGAGGDGGAGGILFGAGGA
GGAGGAVTGTGATGGSGGAGGGALLFGAGGAGGAGGSSGIGGFAAGGAGG
PGGAGGLFNGGGAGGAGGSGVSGGAGGEGGAGGAGGLFAGGGAGGAGGSG
NNVGGAGGAGGVGGLFGAGGAGGSGGGGSVAGDSGAGGNAGLLAPGLAGG
AGGGGGQGFDTGGAGGPGGDAGLLVGSGGVGGAGGFGLTTGGPGAAGGDA
GLLFGSGGAGGAGGSGRTDLGGAGGAGGKAGLIGNGGNGGAGGAGGNGGG
DGGPGGAAFGLGNGGNGGNGGTGTSAGSPGAGGAGGSLIGAEGLPGLLP
>Rv0834c PE_PGRS14, PE-PGRS FAMILY PROTEIN
MSFVIAAPDLVAMATEDLAGIGASLTAANAAAAVPTSGLLAAAGDEVSAA
IAALFSSHGQQYQAMSAQAAAFHARFVQALAGAMGAYAAAEAANASPLQT
LEQGLLGAINAPAAALSGRPFIGNGTNGAPGTGEAGGPGGWLLGNGGNGG
SGAPGQTGGAGGAAGLLGHGGTGGAGGTGASGGKGGTGGWLWGSGGAGGA
GGSGGGSGGAGGNALMFGIGGNGGAGGAASGVGNGGVGGAGGAGGALVAI
GGAGGAGGAATTGTGGAGGAGSNALGLFLGLGGSGGQGGDSAMGSGGAGG
AGGSGGAASPFGIDIGIGGAGGHGGAGTNGGAGGAGGAGGSSGTVFALDL
SWGGAGGNGGAATTGTGGAGGTGGFAVAPDFIGFGAAYGGAGGLGGAATG
AGGTGGTGGVGAGGFAALGVGVGGAGGAGGAATETGGIGGAGGLGVGLLG
GAGGAGGPGGAASAGSGGHGGTGGDALGLIGAGIGGVGGVGGAATDTGGN
GGAGGSGTGLLGGVGGAGGHGGGASVGTGGSGGAGGDGFGFVGAGGNGGN
AGTGVGVNGANGGNGGSATGALAAVGGAGAAGGDATSGTGGFGGAGGSAR
GLIFALGGAGAAGGDASTGVGGPGGPGGTGTASSPFGIAIAIGGAGAQGG
AGTSGATGGAGGDGVFEGIAVLGLGFGGAAGAGGAATGDGATGGAGGFGG
AGAGIANFLGFSVLHGGAGGAGGTATGTGGNGGAGGGGGLSSPVILGIGI
GGAGGDGGGALGVLGGMGGDGGDGGEAVAVGIAVGGAGGAGGAAPTGNGG
AGGNGGDALGLVGVGGNGGNAGTGFGANTGGNGGDTTIVVNGMLAPSTLG
YGGNGGNGVNGGAGGTGGKAGVFGAPGQNGLP
>Rv0872c PE_PGRS15, PE-PGRS FAMILY PROTEIN
MSYVLATPEMVAAAANNLAQIGSTLSAANAAALAPTTGVLAAGADEVSAA
VASLFSGHAQAYQTLGTQAAAFHERFIQALSTAAGAYGSAEAANASPLQQ
ALNVINAPTQTLLGRPLIGNGTNGAPGTGQAGGPGGLLYGNGGNGGSGGV
GQAGGAGGSAGLIGIGGTGGAGGAGAVGGVGGNGGWLYGNGGAGGLGGTG
VAGVNGGMGAAGGAGGNAYLFGSGGAGGQGGMGAAGADGVNPTPTGTADA
GSTGTDQTLGGNAIGGNGGPGDAGDAMTSGGAGGSGGNAVSTVNGDAVGG
EGGKGGEGAYGGAGGAGGSAASIGNAAIGGNGGAGGNAQAPGGVGGAGGE
GGDAQVGTNSPSNAEAGNGGSGGNGFDSFASGGTGGAGGTGGAGGRGGLL
IGDGGAGGAGGVGGTGGSGAPGGGGGAGGDGGAANTDSAGSSRKAFGGDG
GVGGDGASALGTGGEGGIGGQGGNGGAGGLLIGNGGAGGVGGTAGAGGTG
GSGGAGGAGGAGGGGTNSGPGAAFGGNGNTGGNGGNGGAPGALGGKGGSG
GLIGRAGSDGGVGAGGAGGAGGAGGTGGEGGTGGDGKTTDGNPGMGGSPG
SAGQPG
>Rv1067c PE_PGRS19, PE-PGRS FAMILY PROTEIN
MSFVLVSPSQLMAAAADVAGIGSAISAANAAALAPTSVLAAAGADEVSAA
VAALFSAHAGQYQQLGARAALFHEQFVQALTGAASAYASAEATNVEQQVL
GLINAPTQALLGRPLIGNGADGTAANPNGGAGGLLYGNGGNGFSQTTAGL
TGGTGGSAGLIGNGGNGGAGGAGANGGAGGNGGWLYGSGGNGGAGGAGPA
GAIGAPGVAGGAGGAGGTAGLFGNGGVGGVGGDGGQGGNGAGAGASGTKG
GDAGAGGAGGAGGWIHGHGGAGGDGGAGGAGGQASPGAPGPPSQPGGAGG
AGGAGGRGGDGGSAGWLSGNGGDAGNGGGGGTAGGAGNGGQFGGDGGTGG
TGGTAGAGGNGGRGAVLFGHGGNAGHGGAGGNGAAAGAGGEHVVATAGKG
GTGGVGGDGGGGGAGGGGGLLYGNGGAGGAGNSGGDGGTGLNAALGGNGG
GGGVGGNAGAGGTGGSAGWLSGNGGAGGSGGSAGAGGAGGKGGDTPNGLA
INPGIGGNGGDTGNAGNGGNGGSAARLFGGGGAGGAGGTGSTAGSGGSGG
TNPPTGLQAAGGNGGSGHAGGHGGNGGGAGLLGGGGTGGNGGGGGQGGLG
AAAGGVDGNGGNGGNGGKGGDAQLVGDGGNGGNGGKGGAGLIAGLDGAGG
AGGTRGLIFGNAGTPGQ
>Rv0124 PE_PGRS2, PE-PGRS FAMILY PROTEIN
MSFVSVAPEIVVAAATDLAGIGSAISAANAAAAAPTTAVLAAGADEVSAA
IAALFSGHAQAYQALSAQAAAFHQQFVQTLAGGAGAYAAAEAQVEQQLLA
AINAPTQALLGRPLIGNGADGAPGTGQAGGAGGILYGNGGNGGSGAAGQA
GGAGGPAGLIGHGGSGGAGGSGAAGGAGGHGGWLWGNGGVGGSGGAGVGA
GVAGGHGGAGGAAGLWGAGGGGGNGGNGADANIVSGGDGGLGGAGGGGGW
LYGDGGAGGHGGQGAIGLGGGAGGDGGQGGAGRGLWGTGGAGGHGGQGGG
TGGPPLPGQAGMGAAGGAGGLIGNGGAGGDGGVGASGGVAGVGGAGGNAM
LIGHGGAGGAGGDSSFANGAAGGAGGAGGHLFGNGGSGGHGGAVTAGNTG
IGGAGGVGGDARLIGHGGAGGAGGDRAGALVGRDGGPGGNGGAGGQLYGN
GGDGAPGTGGTLQAAVSGLVTALFGAPGQPGDTGQPG
>Rv1068c PE_PGRS20, PE-PGRS FAMILY PROTEIN
MSYMIAVPDMLSSAAGDLASIGSSINASTRAAAAATTRLLPAAADEVSAH
IAALFSGHGEGYQAIARQMAAFHDQFTLALTSSAGAYASAEATNVEQQVL
GLINAPTQALLGRPLIGNGADGTAANPNGGAGGLLYGNGGNGFSQTTAGL
TGGTGGSAGLIGNGGNGGAGGAGANGGAGGNGGWLYGSGGNGGAGGAGPA
GAIGAPGVAGGAGGAGGTAGLFGNGGAGGAGGAGGAGGRGGDGGSAGWLS
GNGGDAGTGGGGGNAGNGGNGGSAGWLSGNGGTGGGGGTAGAGGQGGNGN
SGIDPGNGGQGADTGNAGNGGHGGSAAKLFGDGGAGGAGGMGSTGGTGGG
GGFGGGTGGNGGNGHAGGAGGSGGTAGLLGSGGSGGTGGDGGNGGLGAGS
GAKGNGGNGGDGGKGGDAQLIGNGGNGGNGGKGGTGLMPGINGTGGAGGS
RGQISGNPGTPGQ
>Rv1087 PE_PGRS21, PE-PGRS FAMILY PROTEIN
MSFVVVAPEVLAAAASDLAGIGSTLAQANAAALAPTTAVLAAGADEVSAA
IASLFGAHGQAYQAVSAQMSAFHAQFMQALTGAGGAYAAAEAVNVSAAQS
VEQDLLAAINARFERIFGRPLIGDGANGGPGQDGGPGGLLYGNGGNGGTS
TTVGMAGGNGGAAGLIGNGGFGGGGGPGAAGGNGGAGGWLFGNGGAGGAG
GLGVAPGVPGGAGGAGGAGGVGGPAGLWGHGGAGGAGGAGVAGAGGFEGT
IGAGGAGGVGGAGGVGGAGGAGGWLYGDAGAGGDGGVGGAGGTGGLGNRG
GAGGAGGAGGVGGAGGAAGLWGGGGAGGVGGTGGGAGLGAQSVTFSSSLS
GLSGGDGGAGGAGGAGGAGGTGGWLYGGGGAAGSGGDGGTGGQGGAGGAG
VFSLFGSGGGPGGNGGVGGVGGVGGAGGRAGLFGVGGLGGAGGDAGDSGE
GGFGGPGLAGGLFGNPGNGGVGGIGGDAAAGGAGGAGGNGGAGGNGGWLF
GNGGAGGSGGDGGAAGRGGAGNLGSAGGINAPAGNPGSGSVGIGGAGGAG
GTAGLFGDGGAGGAGGAGAAGGFGGISAATPSAGSEGAMGGAGGVGGNAR
LLGTGGAGGVGGGGGAGGDGGRGGVATPGGQGGDAGDGGAGGAGGNGGGA
SGAGGWLLGTGGAGGAGGNGGNGGKAGFSPGPTNFGLNGAGGGGGVGGNG
ATGPWLFGDGGPTPGSTGAGAAGGHGGDAQLIGNGGHGGAGGTGVPNGSG
GAGGLSGLLFGEPGANG
>Rv1091 PE_PGRS22, PE-PGRS FAMILY PROTEIN
MSFVIAAPEALVAVASDLAGIGSALAEANAAALAPTTALLAAGADEVSAA
IAALFGAHGQAYQTVSAQASAFHAQFVQALTGGGGAYAAAEAANVSAAQS
TDQRLLDLINGPTQALLGRPLIGDGANGGPGQDGGPGGLLYGNGGNGGTS
TTAGVAGGNGGAAGLIGNGGAGGGGGAGAAGGNGGAGGWLYGNGGAGGAG
GTSVIPGVAGGNGGAGGSAGLWGTGGAGGDGGNGRSGPVNVAGSAGGNGG
AGGAAGLFGDAGAGGNGGKGGAGGAAFSINFTAGDGGAGGAGGSGGHALL
WGAGGAGGNGGSGGTGGAGGSTAGAGGNGGAGGGGGTGGLLFGNGGAGGH
GAAAGNGLAAGNGVSSSGGGGAGGTGGAGGDGGAGGAGGNARLWGVGGAG
GAGGDGGAGGAGGKGGSGLSGNANGGAGGDSGRGGTGGAGGEGGAAGLLV
GTGGHGGDGGAGGAAVKGGDGGAAAGTGIAGAGGRGGAGGSGGSGGDGGG
GAAGPAGWLFGDGGAGGNGGAAAAGGAGGQAGGGGGNGGNGGNGGNGGNG
GNGATGGWLYGNGGAGGQGATAGAGGAGANGVSSTNGGGTGGNGGIGGTG
GSGGAGGNAGLLGVGGAGGHGASGGAGDRGGAGGTGFISSDGGAGGDGGD
GGNGGAGGTGGLLFGAGGNGGPGGSGGAADIGGNGGAGNGGGTDGNGGNG
GSGGGAGSGGDGGGAGGNGAWLFGNGGAGGGGGKGGNGAGGGLGGGSFGL
PGLNGSGGDGGDGGNGAPGGVLYGNGGAGGQGSSGGIGGPGATGGAGGKG
GDGGDAQLIGDGGNGGNGGAGGTGGTPGPGGPGGSGGLGGLLFGQTGTAG
VSP
>Rv1243c PE_PGRS23, PE-PGRS FAMILY PROTEIN
MEYLIAAQDVLVAAAADLEGIGSALAAANRAAEAPTTGLLAAGADEVSAA
IASLFSGNAQAYQALSAQAAAFHQQFVRALSSAAGSYAAAEAANASPMQA
VLDVVNGPTQLLLGRPLIGDGANGGPGQNGGDGGLLYGNGGNGGSSSTPG
QPGGRGGAAGLIGNGGAGGAGGPGANGGAGGNGGWLYGNGGLGGNGGAAT
QIGGNGGNGGHGGNAGLWGNGGAGGAGAAGAAGANGQNPVSHQVTHATDG
ADGTTGPDGNGTDAGSGSNAVNPGVGGGAGGIGGDGTNLGQTDVSGGAGG
DGGDGANFASGGAGGNGGAAQSGFGDAVGGNGGAGGNGGAGGGGGLGGAG
GSANVANAGNSIGGNGGAGGNGGIGAPGGAGGAGGNANQDNPPGGNSTGG
NGGAGGDGGVGASADVGGAGGFGGSGGRGGLLLGTGGAGGDGGVGGDGGI
GAQGGSGGNGGNGGIGADGMANQDGDGGDGGNGGDGGAGGAGGVGGNGGT
GGAGGLFGQSGSPGSGAAGGLGGAGGNGGAGGGGGTGFNPGAPGDPGTQG
ATGANGQHGLNG
>Rv1325c PE_PGRS24, PE-PGRS FAMILY PROTEIN
MSFVIAAPETLVRAASDLANIGSTLGAANAAALGPTTELLAAGADEVSAA
IASLFAAHGQAYQAVSAQMSAFHAQFVQTFTAGAGAYASAEAAAAAPLEG
LLNIVNTPTQLLLGRPLIGNGANGAPGTGQAGGAGGLLYGNGGAGGSGAP
GQAGGPGGAAGLFGNGGAGGAGGDGPGNGAAGGAGGAGGLLFGSGGAGGP
GGVGNTGTGGLGGDGGAAGLFGAGGIGGAGGPGFNGGAGGAGGRSGLFEV
LAAGGAGGTGGLSVNGGTGGTGGTGGGGGLFSNGGAGGAGGFGVSGSAGG
NGGTGGDGGIFTGNGGTGGTGGTGTGNQLVGGEGGAGGAGGNAGILFGAG
GIGGTGGTGLGAPDPGGTGGKGGVGGIGGAGALFGPGGAGGTGGFGASSA
DQMAGGIGGSGGSGGAAKLIGDGGAGGTGGDSVRGAAGSGGTGGTGGLIG
DGGAGGAGGTGIEFGSVGGAGGAGGNAAGLSGAGGAGGAGGFGETAGDGG
AGGNAGLLNGDGGAGGAGGLGIAGDGGNGGKGGKAGMVGNGGDGGAGGAS
VVANGGVGGSGGNATLIGNGGNGGNGGVGSAPGKGGAGGTAGLLGLNGSP
GLS
>Rv1396c PE_PGRS25, PE-PGRS FAMILY PROTEIN
MSFLFAQPEMLGAAATDLASIGSAISTANAAAAAATTRVLAAGADEVSAA
VAALFSGHAQTYQALRTQAAAFHQQIVQTLTSTAGAYASAEAANVEQQLL
GAINAPTMALLGRPLIGHGADGAPGTGQAGGAGGILYGNGGNGGSGATGQ
AGGAGGAAGLIGHGGAGGLGGTGASGGAGGAGGWLWGNGGAGGNGGVGVA
GDPGGVGGAGGAGGAAGLWGSGGSGGTGGQGGVGGGKSGDGGTGGIGGAG
GGGGWLHGDGGAGGHGGQGGTGVSSGGNGGAGGTGGDGRGLSGSGGAGGR
GGQTGVGGKVGENNFGGAGGAGGTGGLIGNGGAGGNGGQGAISGAGGAGG
NAWLIGDGGAGGNGGDIRGQGGGAGGAGGAGGQLIGNGGTGGAGGTVTSP
NGLGGAGGAGGSAGLIGHGGTGGAGGHSAQGPDGNGGIGGAGGAGGNGGQ
LYGTGGTGGTGGKGGDGFGVFGKGGAGGTGGRGGAAGLIGDAGTGGTGGK
GGTAGEDGTGGNGGTGGNGGAAVLIGNGGGGGAGGNGGAGNDGTPGNGGG
GGVGGTGGTLFGQPGQPGPPGQPGPA
>Rv1441c PE_PGRS26, PE-PGRS FAMILY PROTEIN
MSNVMVVPGMLSAAAADVASIGAALSAANGAAAPTTAGVLAAGADEVSAA
IASLFSGYARDYQALSAQMARFHQQFVQALTASVGSYAAAEAANASPLQA
LEQQVLAAINAPTQTLLGRPLIGNGADGLPGQNGGAGGLLWGNGGNGGAG
DAAHPNGGNGGDAGMFGNGGAGGAGYSPAAGTGAAGGAGGAGGAGGWLSG
NGGAGGNGGTGASGADGGGGLPPVPASPGGNGGGGDAGGAAGMFGTGGAG
GTGGDGGAGGAGDSPNSGANGARGGDGGNGAAGGAGGRLFGNGGAGGNGG
TAGQGGDGGTALGAGGIGGDGGTGGAGGTGGTAGIGGSSAGAGGAGGDGG
AGGTGGGSSMIGGKGGTGGNGGVGGTGGASALTIGNGSSAGAGGAGGAGG
TGGTGGYIESLDGKGQAGNGGNGGNGAAGGAGGGGTGAGGNGGAGGNGGD
GGPSQGGGNPGFGGDGGTGGPGGVGVPDGIGGANGAQGKHG
>Rv1450c PE_PGRS27, PE-PGRS FAMILY PROTEIN
MSLVIVAPETVAAAALDVARIGSSIGAANAAAAGSTTSVLAAGADEVSAA
IATLFGSHAREYQAISTQVAAFHDRFAQTLSAAVGSYVSAEATNAAPLAT
LEHNVLNALNAPTQALLGRPLIGDGAAGAPGTGQAGGAGGILWGNGGAGG
SGAPGQVGGAGGAAGLFGTGGAGGAGGAGAAGGAGGSGGWLLGNGGVGGA
GGQSLLGGATGGAGGNAGLFGVGGTGGPGGPGGPGGVGGTGGAGGLGGTL
YGAGGHGGAGGPGPIGGVGGHGGVGGAAGLLGVGGHGGAGGHGAEGVAGA
AGEDLSPHGTSGGVGGDAGDGGTGGRGGWLAGAGGAGGAGGVGGTGGAGG
AGFSRALIVAGDNGGDPGAGGAGGTGGAGSTIGAHGAAGASPTSGGNGGA
GGNGAHFSSGGKAGGNGGAGGAGGLVGNGGAGGAGGNGAPGAPPSGGDPN
GGGGGAGGAGGKGGDGGAQAGDGGAGGAGGKGGNGGNGATGATGLNGLGA
GADGTDGGKGGNGGAGGGGGAGGQGGKALAATHQDGSMGAGGAGGNGGAG
GMGGDGGNGAKGTFDNGGDGVGGNGGNGGSRGIGGAGGIGGAGSTAGADG
ARGATPTSGGNGGTGGNGANATVAGGAGGAGGKGGNGGLVGNGGAGGKGG
DGMAGVAGSSPTTAGESGTSGQNGGAGGAGGAGGRGGDFGGDGGTGGAGG
NGANGANATTPGAKGGDGGHGGPGAQGGNGGQGGPGGLAGNLFGQNGIQG
VGGSGGKGGAGGLAGDGGNGANGNFAFGDGNGGHGGNGGNPGAGGQGGSG
GAGSTPGAKGAHGFTPTSGGDGGDGGNGGNSQVVGGNGGDGGNGGNGGSA
GTGGNGGRGGDGAFGGMSANATNPGENGPNGNPGGNGGAGGAGGAGLNGG
NGGAGGNGGLGGFGGNGAAGANGVAVGAPGQPGGAGGHGGAGGNGGAGGN
GGQGVVSDGAGGAGGAGGDGGAPGDGANGGNGQGAGAFAGGGGGRGGDGG
NAGNAGAGGPGGTGSTAGKAGPAGSILHDGGNGGHGGHGAASGGNGGPGG
HGGNGGNGGTGANGGNGGIGGTGGAGSTGAKGVLGTNEGDGGDGGRGGNG
GRGGNGGQGLTGAGGNGGTGGTPGNGGNGGNGASGDLVTSPGDGGGGGRG
GDAGRGGDAGLGGSSGPGGTPGDWGTGGTGGTGGTGGQGANGGLTGGRGG
TGGNGGNGNTGGTGGAGGTGGTGHNGSQPGMGGNGGAGGFGGNGFAGVGG
RGGMGGSGGTGGTGDAGPFGTGTGGTGGHGGQGGGGGFSILLGLGGLGGL
GSPGSIATGTAGGAGGGGGFGGLGGGEFV
>Rv1452c PE_PGRS28, PE-PGRS FAMILY PROTEIN
MSLVIVTPETVAAAASDVARIGSSIGVANSAAAGSTTSVLAAGADEVSAA
IATLFGSHAREYQAISTQVAAFHDRFAQTLSAAVGSYVSAEATNAAPLAT
LEHNVLNALNAPTQALLGRPLIGDGAAGAPGTGQAGGAGGILWGNGGAGG
SGAPGQVGGAGGAAGLFGTGGAGGAGGAGAAGGAGGSGGWLLGNGGVGGA
GGQSLLGGATGGAGGNAGLFGVGGTGGPGGPGGPGGVGGTGGAGGLGGTL
YGAGGHGGAGGPGPIGGVGGHGGVGGAAGLLGVGGHGGAGGHGAEGVAGA
AGEDLSPHGTSGGVGGDAGDGGTGGRGGWLAGAGGAGGAGGVGGTGGAGG
AGFSRALIVAGDNGGDGGNGGMGGAGGAGGPGGAGGLISLLGGQGAGGAG
GTGGAGGVGGDRGAGGPGNQAFNAGAGGAGGHGGDPGAGGAGGTGGAGSI
TGAQGAIGATPTSGGNGGAGGNGANATTAGTNGANGGPGGHGGLVGNGGA
GGNGANGAAGTNASDSGAVGGKGNSGGNGGQGGAGGDGGTLAGNGGAGGT
GGRGADGGLGGSGAEGANATTAGERGQDGGKGGNGGVGGTGGNAVAPGAN
GGHGGNGGNPGFSGAGGLGGLSGDGVTRAAQGATPDFADTGGKGGNGGNG
ANAVAPGGTGASGGAGGNAGAGGKGGENIIGDGGGGNGGAGGKGGAGTLL
GLTVFGDNGGAGVLGDSTDPDGSGGAGGAGGAGGAGGDPTI
>Rv1468c PE_PGRS29, PE-PGRS FAMILY PROTEIN
MSFVVANTEFVSGAAGNLARLGSMISAANSAAAAQTTAVAAAGADEVSAA
VAALFGAHGQTYQVLSAQAAAFHSQFVQALSGGAQAYAAAEATNFGPLQP
LFDVINAPTLALLNRPLIGNGADGTAANPNGQAGGLLIGNGGNGFSPAAG
PGGNGGAAGLLGHGGNGGVGALGANGGAGGTGGWLFGNGGAGGNSGGGGG
AGGIGGSAVLFGAGGAGGISPNGMGAGGSGGNGGLFFGNGGAGASSFLGG
GGAGGRAFLFGDGGAGGAALSAGSAGRGGDAGFFYGNGGAGGSGAGGASS
AHGGAGGQAGLFGNGGEGGDGGALGGNGGNGGNAQLIGNGGDGGDGGGAG
APGLGGRGGLLLGLPGANGT
>Rv0278c PE_PGRS3, PE-PGRS FAMILY PROTEIN
MSFVIAAPEVIAAAATDLASLGSSISAANAAAAANTTALMAAGADEVSTA
IAALFGAHGQAYQALSAQAQAFHAQFVQALTSGGGAYAAAEAAAVSPLLD
PINEFFLANTGRPLIGNGANGAPGTGANGGDGGWLIGNGGAGGSGAAGVN
GGAGGNGGAGGNGGAGGLIGNGGAGGAGGVASSGIGGSGGAGGNAMLFGA
GGAGGAGGGVVALTGGAGGAGGAGGNAGLLFGAAGVGGAGGFTNGSALGG
AGGAGGAGGLFATGGVGGSGGAGSSGGAGGAGGAGGLFGAGGTGGHGGFA
DSSFGGVGGAGGAGGLFGAGGEGGSGGHSLVAGGDGGAGGNAGMLALGAA
GGAGGIGGDGGTLTAGGIGGAGGAGGNAGLLFGSGGSGGAGGFGFADGGQ
GGPGGNAGTVFGSGGAGGNGGVGQGFAGGIGGAGGTPGLIGNGGNGGNGG
ASAVTGGNGGIGGTGVLIGNGGNGGSGGIGAGKAGVGGVSGLLLGLDGFN
APASTSPLHTLQQNVLNVVNEPFQTLTGRPLIGNGANGTPGTGADGGAGG
WLFGNGANGTPGTGAAGGAGGWLFGNGGNGGHGATNTAATATGGAGGAGG
ILFGTGGNGGTGGIATGAGGIGGAGGAGGVSLLIGSGGTGGNGGNSIGVA
GIGGAGGRGGDAGLLFGAAGTGGHGAAGGVPAGVGGAGGNGGLFANGGAG
GAGGFNAAGGNGGNGGLFGTGGTGGAGTNFGAGGNGGNGGLFGAGGTGGA
AGSGGSGITTGGGGHGGNAGLLSLGASGGAGGSGGASSLAGGAGGTGGNG
ALLFGFRGAGGAGGHGGAALTSIQQGGAGGAGGNGGLLFGSAGAGGAGGS
GANALGAGTGGTGGDGGHAGVFGNGGDGGCRRVWRRYRRQRWCRRQRRAD
RQRRQRRQRRQSRGHARCRRHRRAAARRERTQRLAIAGRPATTRGVEGIS
CSPQMMP
>Rv1651c PE_PGRS30, PE-PGRS FAMILY PROTEIN
MSFLLVEPDLVTAAAANLAGIRSALSEAAAAASTPTTALASAGADEVSAA
VSRLFGAYGQQFQALNARAATFHAEFVSLLNGGAAAYTGAEAASVSSMQA
LLDAVNAPTQTLLGRPLIGNGADGVAGTGSNAGGNGGPGGILYGNGGNGG
AGGNGGAAGLIGNGGAGGAGGAGGAGGAGGAGGTGGLLYGNGGAGGNGGS
AAAAGGAGGNALLFGNGGNGGSGASGGAAGHAGTIFGNGGNAGAGSGLAG
ADGGLFGNGGDGGSSTSKAGGAGGNALFGNGGDGGSSTVAAGGAGGNTLV
GNGGAGGAGGTSGLTGSGVAGGAGGSVGLWGSGGAGGDGGAATSLLGVGM
NAGAGGAGGNAGLLYGNGGAGGAGGNGGDTTVPLFDSGVGGAGGAGGNAS
LFGNGGTGGVGGKGGTSSDLASATSGAGGAGGAGGVGGLLYGNGGNGGAG
GIGGAAINILANAGAGGAGGAAGSSFIGNGGNGGAGGAGGAAALFSSGVG
GAGGSGGTALLLGSGGAGGNGGTGGANSGSLFASPGGTGGAGGHGGAGGL
IWGNGGAGGNGGNGGTTADGALEGGTGGIGGTGGSAIAFGNGGQGGAGGT
GGDHSGGNGIGGKGGASGNGGNAGQVFGDGGTGGTGGAGGAGSGTKAGGT
GSDGGHGGNATLIGNGGDGGAGGAGGAGSPAGAPGNGGTGGTGGVLFGQS
GSSGPPGAAALAFPSLSSSVPILGPYEDLIANTVANLASIGNTWLADPAP
FLQQYLANQFGYGQLTLTALTDATRDFAIGLAGIPPSLQSALQALAAGDV
SGAVTDVLGAVVKVFVSGVDASDLSNILLLGPVGDLFPILSIPGAMSQNF
TNVVMTVTDTTIAFSIDTTNLTGVMTFGLPLAMTLNAVGSPITTAIAFAE
STTAFVSAVQAGNLQAAAAALVGAPANVANGFLNGEARLPLALPTSATGG
IPVTVEVPVGGILAPLQPFQATAVIPVIGPVTVTLEGTPAGGIVPALVNY
APTQLAQAIAP
>Rv1768 PE_PGRS31, PE-PGRS FAMILY PROTEIN
MSYLVVVPELVAAAATDLANIGSSISAANAAAAAPTTALVAAGGDEVSAA
IAALFGAHARAYQALSAQAAMFHEQFVRALAAGGNSYAVAEAATAQSVQQ
DLLNLINAPTQALLGRPLIGNGANGLPGTGQNGGDGGILYGNGGNGGSGG
VNQAGGNGGNAGLWGNGGSGGAGGNATTAGRNGFNGGAGGSGGLLWGNGG
AGGAGGNGGPAPLVGGVGTTGGAGGNGGGAGLFYGFGGAGGNGGMGGVAP
STGPSMGILPAGGVGGPGGSGGASALAFGSGGVGGAGGLGGPTDGTVQGV
GGFGGQGGNGGQSGLLFGNAGAGGAGAAGGAGTGDTESFGGHGGAGGDGG
AVGLIGNGGAGGTGSPGAVVGGNGGVGGLGGAGSPGGLLYGTGGAGGNGG
PGGDGGTGATVGFAGSGGFGGAGGIAQLFGTGGMGGSGGGIGAGTTTVVP
PDVAPVGGTGGNGGRAGLLLGVGGMGGNGGATSVGGTLYAAGGNGGDGGL
VWGNGGTGGSGGAGGAGSVGNGGAGGNAALLFGNGGAGGAGGAGGIGAGG
AGGFGAVLFGNGGAGGSGAPGGIGAGGNGGNALLVGNGGNGGAGTGGAAG
GAGGSGGLLFGQNGMPGP
>Rv1803c PE_PGRS32, PE-PGRS FAMILY PROTEIN
MWTSQMIVAPAFVDAAAKDLATIGSAISRANAEALVPITALLPAGADDVS
AAIAALFATHGQAYQELSAHAVAFHEQFVQLMSAGAAQYASAEAANSSPL
QIVGQTALDAINSPVQTLTGRPLIGNGANGVAGTGQNGGDGGWLYGNGGN
GGSGGTGQNGGNGGSAGLWGSGGNGGQGGAGANGAAGQPGKAGGSGGNGG
AGGWIYGHGGHGGAGGNGGNATAPGGASAGFDGGAGGNGGSGGRGGLLFG
NGGNGSVGGMGGQGTNDTAGDSAGSGGLGGNGGNGAQGGWLIGNGGQGGD
SGAGGGTDSTQTGVMNGASGGSAGIAGNGGDAGLVGNGGAGGNGGNGAAG
SALGTTIFGGSGGVGGSGGDGGNGGWLFGSGASGGNGGQGGDAGTNGFAG
FGGSAGGGGWVGAVNFGPISVQGFGLFGHGGDGGNGGDVGAGSLSIQFGA
SGGDGGQGGVLYGNGGNGGNAGSGGGTGFEGSAGQGGAAILIGNGGAGGN
GATGGTGVGNIIQEAGGDGSDGGAGGSGGLLFGSGGAGGIGGAGGVGGSG
NDGGNGGDGGQGGASGLGIGNGGPGGSGGTGGAGGTGGSAGTGGAGGDGG
NAALLIGTGGDGGDGVPPAPGGQGGKGGLIGLPGQNGQP
>Rv1818c PE_PGRS33, PE-PGRS FAMILY PROTEIN
MSFVVTIPEALAAVATDLAGIGSTIGTANAAAAVPTTTVLAAAADEVSAA
MAALFSGHAQAYQALSAQAALFHEQFVRALTAGAGSYAAAEAASAAPLEG
VLDVINAPALALLGRPLIGNGANGAPGTGANGGDGGILIGNGGAGGSGAA
GMPGGNGGAAGLFGNGGAGGAGGNVASGTAGFGGAGGAGGLLYGAGGAGG
AGGRAGGGVGGIGGAGGAGGNGGLLFGAGGAGGVGGLAADAGDGGAGGDG
GLFFGVGGAGGAGGTGTNVTGGAGGAGGNGGLLFGAGGVGGVGGDGVAFL
GTAPGGPGGAGGAGGLFGVGGAGGAGGIGLVGNGGAGGSGGSALLWGDGG
AGGAGGVGSTTGGAGGAGGNAGLLVGAGGAGGAGALGGGATGVGGAGGNG
GTAGLLFGAGGAGGFGFGGAGGAGGLGGKAGLIGDGGDGGAGGNGTGAKG
GDGGAGGGAILVGNGGNGGNAGSGTPNGSAGTGGAGGLLGKNGMNGLP
>Rv1840c PE_PGRS34, PE-PGRS FAMILY PROTEIN
MSFVVAAPEVVVAAASDLAGIGSAIGAANAAAAVPTMGVLAAGADEVSAA
VADLFGAHAQAYQALSAQAALFHEQFVHAMTAGAGAYAGAEAADAAALDV
LNGPFQALFGRPLIGDGANGAPGQPGGPGGLLYGNGGNGGNGGIGQPGGA
GGDAGLIGNGGNGGIGGPGATGLAGGAGGVGGLLFGDGGNGGAGGLGTGP
VGATGGIGGPGGAAVGLFGHGGAGGAGGLGKAGFAGGAGGTGGTGGLLYG
NGGNGGNVPSGAADGGAGGDARLIGNGGDGGSVGAAPTGIGNGGNGGNGG
WLYGDGGSGGSTLQGFSDGGTGGNAGMFGDGGNGGFSFFDGNGGDGGTGG
TLIGNGGDGGNSVQTDGFLRGHGGDGGNAVGLIGNGGAGGAGSAGTGVFA
PGGGSGGNGGNGALLVGNGGAGGSGGPTQIPSVAVPVTGAGGTGGNGGTA
GLIGNGGNGGAAGVSGDGTPGTGGNGGYAQLIGDGGDGGPGDSGGPGGSG
GTGGTLAGQNGSPGG
>Rv2098c PE_PGRS36, PE-PGRS FAMILY PROTEIN
GQSYQAVSAQAAAFHDRFVQLLNAGGGSYASAEIANAQQNLLNAVNAPTQ
TLLGRPLVGDGADGASGPVGQPGGDGGILWGNGGNGGDSTSPGVAGGAGG
SAGLIGNGGRGGNGAPGGAGGNGGLGGLLLGNGGAGGVGGTGDNGVGDLG
AGGGGGDGGLGGRAGLIGHGGAGGNGGDGGHGGSGKAGGSGGSGGFGQFG
GAGGLLYGNGGAAGSGGNGGDAGTGVSSDGFAGLGGSGGRGGDAGLIGVG
GGGGGNGGDPGLGARLFQVGSRGGDGGVGGWLYGDGGGGGDGGNGGLPFI
GSTNAGNGGSARLIGNGGAGGSGGSGAPGSVSSGGVGGAGNPGGSGGNGG
VWYGNGGAGGAAGQGGPGMNTTSPGGPGGVGGHGGTAILFGDGGAGGAGA
AGGPGTPDGAAGPGGSGGTGGLLFGVPGPSGPDG
>Rv2126c PE_PGRS37, PE-PGRS FAMILY PROTEIN
MIGDGANGGPGQPGGPGGLLYGNGGHGGAGAAGQDRGAGNSAGLIGNGGA
GGAGGNGGIGGAGAPGGLGGDGGKGGFADEFTGGFAQGGRGGFGGNGNTG
ASGGMGGAGGAGGAGGAGGLLIGDGGAGGAGGIGGAGGVGGGGGAGGTGG
GGVASAFGGGNAFGGRGGDGGDGGDGGTGGAGGARGAGGAGGAGGWLSGH
SGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKAGAPGTQGDSG
DPGPPG
>Rv2162c PE_PGRS38, PE-PGRS FAMILY PROTEIN
MSFVIAAPEVMAAAATDLANIGSSISAASAAAAGPTMGILAAGADEVSVA
ISALFGSHAQGYQTLSAQLAAYHNQFVRALNAGAGSYASAEAANVQQTLL
NAINAPTQTLLGRPLIGNGADGGPGQNGGPGGLLYGNGGNGGAGDTANPN
GGNGGSAGLIGNGGAGGAGAATGAGGAGGNGGWLYGNGGPGGAAGLGTAG
GVSPAGGAGGAAGLWGHGGAGGAGGSASGAPGAGGAGGDGGRGGLLYGDG
GAGGAGGNGSNGVTGVHGGNGGAGGAAGLIGNGGAGGDGGNGGLSNTGAS
GGAGGAGGAALIGNGGDGGHGGNGGHGNSGGAGGAGGAGGAGGAGGHVGL
IGNGGNGGAGGNGGNDNSSTLADAGSGGAGAAGGNGGLFYGNGGVGGRGG
NGGFSSAGTSGGDGGIGGAGGIGGLIGSGGGGGDGGNGGQAPTPGNAGDG
GAGGNARLIGDGGRGGNGGEGGDGPPGVKGDGGNGGNGGNAVVIGNGGNG
GAGGFGIPVGSGGAGGSRGVLFGTPGANGADG
>Rv2340c PE_PGRS39, PE-PGRS FAMILY PROTEIN
MSHVTAAPNVLAASAGELAAIGSTMRAANAAAAAPTAGVLAAGGDDVSAG
IAALFGARAQAYQAISAQAALFHDRFVQILQEGAAAYAMAEAANALPLQK
AQGVVSELAQDRTGGTGTGQSRGAGGFGGVGQAGGKGWDGGPIGNGQVGE
QHGAGQLGSTDGNPGVAGAAHGSGVSASHGSGATGAAGVADPGGSGAGVG
SAAGNGTGAGSADAVGGAGTGRDIVGSVRGDGGVGMASGDGGLSTGAAGA
SAEGGLMPGFGGAPWVGGHWGLGGEGHSGAIGGVGEQVAPAVATAPAVSP
ATTSAVAAESGSTPATKAQAMHATTNPGNAAHQGNPADPGNSARRADGGR
DEQLLLLPLTSLRGLRHTLKKLSGLRARNGLLTASGDNASGSGRPWDRDQ
LLRALGLRPPGHE
>Rv0279c PE_PGRS4, PE-PGRS FAMILY PROTEIN
MSFVIAAPEVIAAAATDLASLESSIAAANAAAAANTTALLAAGADEVSTA
VAALFGAHGQAYQALSAQAQAFHAQFVQALTSGGGAYAAAEAAATSPLLA
PINEFFLANTGRPLIGNGTNGAPGTGANGGDGGWLIGNGGAGGSGAAGVN
GGAGGNGGAGGLIGNGGAGGAGGRASTGTGGAGGAGGAAGMLFGAAGVGG
PGGFAAAFGATGGAGGAGGNGGLFADGGVGGAGGATDAGTGGAGGSGGNG
GLFGAGGTGGPGGFGIFGGGAGGDGGSGGLFGAGGTGGSGGTSIINVGGN
GGAGGDAGMLSLGAAGGAGGSGGSNPDGGGGAGGIGGDGGTLFGSGGAGG
VCGLGFDAGGAGGAGGKAGLLIGAGGAGGAGGGSFAGAGGTGGAGGAPGL
VGNAGNGGNGGASANGAGAAGGAGGSGVLIGNGGNGGSGGTGAPAGTAGA
GGLGGQLLGRDGFNAPASTPLHTLQQQILNAINEPTQALTGRPLIGNGAN
GTPGTGADGGAGGWLFGNGGNGGHGATGADGGDGGSGGAGGILSGIGGTG
GSGGIGTTGQGGTGGTGGAALLIGSGGTGGSGGFGLDTGGAGGRGGDAGL
FLGAAGTGGQAALSQNFIGAGGTAGAGGTGGLFANGGAGGAGGFGANGGT
GGNGLLFGAGGTGGAGTLGADGGAGGHGGLFGAGGTGGAGGSSGGTFGGN
GGSGGNAGLLALGASGGAGGSGGSALNVGGTGGVGGNGGSGGSLFGFGGA
GGTGGSSGIGSSGGTGGDGGTAGVFGNGGDGGAGGFGADTGGNSSSVPNA
VLIGNGGNGGNGGKAGGTPGAGGTSGLIIGENGLNGL
>Rv2371 PE_PGRS40, PE-PGRS FAMILY PROTEIN
MSLVSVAPELVVTAVPDVARIGSSIGAPDTAAAARPTTSVLAAGADEVSA
DVVALFGWVAR
>Rv2396 PE_PGRS41, PE-PGRS FAMILY PROTEIN
MSFLIASPEALAATATYLTGIGSAISAANAVAAAPTTEILAAGTDEVSTA
ISALFGAHAQAYQALSAHVAAFHDQFVHTLTAGAGSYMAAEAAAASPLQA
LQLELLNAINAPTLALLGRPLIGDGTDAAPGSGGAGGAGGILIGNGGTGG
ASDLAGTGRGGVGGAGGAGGLFGIGGAGGGCGSAVAIGGDGGAGGAGGVF
SGGGAGGAGDAIGGSGGAGGTGGLLGGGGGAGGAGGAGGNGGGASNSASI
GGDGGSGGAGGMLYGAGGVGGNGGAAVAIGGDGGAGGRAGAIGNGGDGGN
GGTSNTPGGSGGDGGNGGNAGLIGNGGNGGNAEIVISGGSVAGTGGNGGL
LLGFNGTNGLP
>Rv2487c PE_PGRS42, PE-PGRS FAMILY PROTEIN
MSLVIATPQLLATAALDLASIGSQVSAANAAAAMPTTEVVAAAADEVSAA
IAGLFGAHARQYQALSVQVAAFHEQFVQALTAAAGRYASTEAAVERSLLG
AVNAPTEALLGRPLIGNGADGTAPGQPGAAGGLLFGNGGNGAAGGFGQTG
GSGGAAGLIGNGGNGGAGGTGAAGGAGGNGGWLWGNGGNGGVGGTSVAAG
IGGAGGNGGNAGLFGHGGAGGTGGAGLAGANGVNPTPGPAASTGDSPADV
SGIGDQTGGDGGTGGHGTAGTPTGGTGGDGATATAGSGKATGGAGGDGGT
AAAGGGGGNGGDGGVAQGDIASAFGGDGGNGSDGVAAGSGGGSGGAGGGA
FVHIATATSTGGSGGFGGNGAASAASGADGGAGGAGGNGGAGGLLFGDGG
NGGAGGAGGIGGDGATGGPGGSGGNAGIARFDSPDPEAEPDVVGGKGGDG
GKGGSGLGVGGAGGTGGAGGNGGAGGLLFGNGGNGGNAGAGGDGGAGVAG
GVGGNGGGGGTATFHEDPVAGVWAVGGVGGDGGSGGSSLGVGGVGGAGGV
GGKGGASGMLIGNGGNGGSGGVGGAGGVGGAGGDGGNGGSGGNASTFGDE
NSIGGAGGTGGNGGNGANGGNGGAGGIAGGAGGSGGFLSGAAGVSGADGI
GGAGGAGGAGGAGGSGGEAGAGGLTNGPGSPGVSGTEGMAGAPG
>Rv2490c PE_PGRS43, PE-PGRS FAMILY PROTEIN
MSYVIATPEMMATAAFDLARIGSQVSAASAVAAMPTTEVVAAGADEVSAG
IAALFSAHAQEYQALSAQAAAFHDQFVHTLTAAARWYTATEIANAAAMRV
VLGAVNAPTQTLLGRPLIGDGAHGTAPGQPGGAGGLLFGNGGNGAAGAVG
QVGGAGGAAGLFGIGGAGGAGGAGAPGGTGGTGGWLAGGGGVGGMGGAGG
GAGGAGGNAGLFGNGGAGGAGGAGGGAGGAGGNAGWFGHGGAGGVGGVGA
AGANGATPGQDGAAGVAGSDDGAGGDGLAGSDGGDGGAGGVGGNGGRGGW
LLGNGGAGGVGGVGGAGGAGAAGGAGGAGATGINGPAGISAAGGDGGAGG
NGGAGGNGGVGGAGGAGGSAGLLGYVGRAGDGGAGGGGGLGGAPGDGGAG
GNGGSWLAAGDGGAGGHGGDPGLGGAGGAGGASGGAGARAGANGLAAGND
GPVSGGNGGKGGNGAHAPVAGGHGGNGGAGGNGGLVGDGGAGGHGGDGAA
GAGYADMTAIFLGSSGTPGEDGGNGGAGGAGGAGGAHAGDGGAGGAGGNG
GAGGAGGNGAHGFNAVLVSDGGNGGDGGAGGRGGDGGAGGAGGDAPAGRA
GSQGVGGDGGAGGAGGAPGNGGSGGRGDMAFKDGDGGAGGDGGDPGAGGK
GGAGGAGATEGVTGATGATVHSGGNGGKGGNGADATVAGANGGKGGAGGN
GGLVGDGGAGGDGGSGAAGANGANVGEDGADGTLSGQPGEGSEANGGQGG
VGGGGAGGAGGDGGAGSSALGSGGNGGRGDAGQAGGAGGAGGAGGAGGSV
SGDGGPGGKGGAGGAGGAGASGGGGGKGASGADSAEAVGGAGGKGGDGGV
GGVGGDGGPGGDGGAGGAAPAGQVGSHGVGGVGGDGGLGGAGGNGGDGGH
GSDGGDGGDGGDPGAGGLGGLGGDSGNGTRAASGVDASDHGPGSGGNGGN
GGNGAQASVAGGAGGNGGDGGNAGRVGDGGAGGNGGDGAAGANGANSGAP
GSDALALGQPGGNGGQGDAGQAGGAGGAGGAGGAGGSVSGDGGAGGNGGA
GGNGGVGASGGAGARGANGIDSIGGTGGAGGGGGDGGAGGVGGHGGDGGV
GGAAPSGTVGSHGTGGVGGDGGLGGAGGVGGAGGNGGIGITVGGAGGAGG
NGGDPGAGGRGGLGGDSGNGTSAANGVDASKHGPLTGGDGGVGGNGAKAA
AAGGDGGQGGDGGNAGLFGDGGAGGDGADGTAAEALGGDGGAGGAGGKGG
DAGDIGDGGDGGKGGDGAHGALGGLTVAGGNGGAGGAGGAGGAGGAFLGD
GGNGGAGGQGGAGRGGSPGGGGGVGGHGGAGGDAGMNGGGGTGGQGGNGA
AGGAGWSPDSDLKGFDGFDGGSGGAGGDGGAGGAGGTQTGDGGDGGAGGL
GGAGGVGGNGVDGFDINETTGRDGGDGGDGGYGGWGGAGGNGGAGGSAPA
GEVGNRGVGGDGGDGGSGGDAGNGGLGGDGFTYLADFDGEPGGDGGDGGD
GGWGRPGGQGGFGSTSGAHGKAGFGAPGGDGGDGGNGGHGGDGNGSFADA
GDGGPGGNGGNGGLGGAGRDGGAPGGDGGDGGTGGSGGFGAPPPRSIGGG
DGGDGGRGGDGGRGAGGLTSGGVGSSGESGGSGNGRGDPGSGGSGGEGGE
GGPSISVNVT
>Rv2591 PE_PGRS44, PE-PGRS FAMILY PROTEIN
MSFVTAAPEMLATAAQNVANIGTSLSAANATAAASTTSVLAAGADEVSQA
IARLFSDYATHYQSLNAQAAAFHHSFVQTLNAAGGAYSSAEAANASAQAL
EQNLLAVINAPAQALFGRPLIGNGANGTAASPNGGDGGILYGNGGNGFSQ
TTAGVAGGAGGSAGLIGNGGNGGAGGAGAAGGAGGAGGWLLGNGGAGGPG
GPTDVPAGTGGAGGAGGDAPLIGWGGNGGPGGFAAFGNGGAGGNGGASGS
LFGVGGAGGVGGSSEDVGGTGGAGGAGRGLFLGLGGDGGAGGTSNNNGGD
GGAGGTAGGRLFSLGGDGGNGGAGTAIGSNAGDGGAGGDSSALIGYAQGG
SGGLGGFGESTGGDGGLGGAGAVLIGTGVGGFGGLGGGSNGTGGAGGAGG
TGATLIGLGAGGGGGIGGFAVNVGNGVGGLGGQGGQGAALIGLGAGGAGG
AGGATVVGLGGNGGDGGDGGGLFSIGVGGDGGNAGNGAMPANGGNGGNAG
VIANGSFAPSFVGFGGNGGNGVNGGTGGSGGILFGANGANGPS
>Rv2615c PE_PGRS45, PE-PGRS FAMILY PROTEIN
MSFVNVAPQLVSTAAADAARIGSAINTANTAAAATTQVLAAAQDEVSTAI
AALFGSHGQHYQAISAQVAAYQQRFVLALSQAGSTYAVAEAASATPLQNV
LDAINAPVQSLTGRPLIGDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPG
QAGGAGGAAGLIGNGGAGGAGGQGLPFEAGANGGAGGAGGWLFGNGGAGG
NGGIGGAGTNLAIGGHGGNGGNAGLIGAGGTGGAGGTGGGEPSAGASGGN
GGNGGNGGLLIGNSGDGGAAGNGAGISQNGPASGFGGNGGHAGTTGLIGN
GGNGGAGGAGGDVSADFGGVGFGGQGGNGGAGGLLYGNGGAGGNGGAAGS
PGSVTAFGGNGGSGGSGGNGGNALIGNAGAGGSAGAGGNGASAGTAGGSG
GDGGKGGNGGSVGLIGNGGNGGNGGAGSLFNGAPGFGGPGGSGGASLLGP
PGLAGTNGADG
>Rv2634c PE_PGRS46, PE-PGRS FAMILY PROTEIN
MSFVIAVPEALTMAASDLANIGSTINAANAAAALPTTGVVAAAADEVSAA
VAALFGSYAQSYQAFGAQLSAFHAQFVQSLTNGARSYVVAEATSAAPLQD
LLGVVNAPAQALLGRPLIGNGANGADGTGAPGGPGGLLLGNGGNGGSGAP
GQPGGAGGDAGLIGNGGTGGKGGDGLVGSGAAGGVGGRGGWLLGNGGTGG
AGGAAGATLVGGTGGVGGATGLIGSGGFGGAGGAAAGVGTTGGVGGSGGV
GGVFGNGGFGGAGGLGAAGGVGGAASYFGTGGGGGVGGDGAPGGDGGAGP
LLIGNGGVGGLGGAGAAGGNGGAGGMLLGDGGAGGQGGPAVAGVLGGMPG
AGGNGGNANWFGSGGAGGQGGTGLAGTNGVNPGSIANPNTGANGTDNSGN
GNQTGGNGGPGPAGGVGEAGGVGGQGGLGESLDGNDGTGGKGGAGGTAGT
DGGAGGAGGAGGIGETDGSAGGVATGGEGGDGATGGVDGGVGGAGGKGGQ
GHNTGVGDAFGGDGGIGGDGNGALGAAGGNGGTGGAGGNGGRGGMLIGNG
GAGGAGGTGGTGGGGAAGFAGGVGGAGGEGLTDGAGTAEGGTGGLGGLGG
VGGTGGMGGSGGVGGNGGAAGSLIGLGGGGGAGGVGGTGGIGGIGGAGGN
GGAGGAGTTTGGGATIGGGGGTGGVGGAGGTGGTGGAGGTTGGSGGAGGL
IGWAGAAGGTGAGGTGGQGGLGGQGGNGGNGGTGATGGQGGDFALGGNGG
AGGAGGSPGGSSGIQGNMGPPGTQGADG
>Rv2741 PE_PGRS47, PE-PGRS FAMILY PROTEIN
MSFVIAAPEFLTAAAMDLASIGSTVSAASAAASAPTVAILAAGADEVSIA
VAALFGMHGQAYQALSVQASAFHQQFVQALTAGAYSYASAEAAAVTPLQQ
LVDVINAPFRSALGRPLIGNGANGKPGTGQDGGAGGLLYGSGGNGGSGLA
GSGQKGGNGGAAGLFGNGGAGGAGASNQAGNGGAGGNGGAGGLIWGTAGT
GGNGGFTTFLDAAGGAGGAGGAGGLFGAGGAGGVGGAALGGGAQAAGGNG
GAGGVGGLFGAGGAGGAGGFSDTGGTGGAGGAGGLFGPGGGSGGVGGFGD
TGGTGGDGGSGGLFGVGGAGGHGGFGSAAGGDGGAGGAGGTVFGSGGAGG
AGGVATVAGHGGHGGNAGLLYGTGGAGGAGGFGGFGGDGGDGGIGGLVGS
GGAGGSGGTGTLSGGRGGAGGNAGTFYGSGGAGGAGGESDNGDGGNGGVG
GKAGLVGEGGNGGDGGATIAGKGGSGGNGGNAWLTGQGGNGGNAAFGKAG
TGSVGVGGAGGLLEGQNGENGLLPS
>Rv2853 PE_PGRS48, PE-PGRS FAMILY PROTEIN
MLYVVASPDLMTAAATNLAEIGSAISTANGAAALPTVEVVAAAADEVSTQ
IAALFGAHARSYQTLSTQAAAFHSRFVQALTTAAASYASVEAANASPLQV
ALDVINAPAQTLLGRPLIGNGADGSTPGQAGGPGGLLYGNGGNGAAGGPN
QAGGAGGNAGLIGNGGAGGAGGVGAVGGKRGTGGLLFGNGGAGGQGGLGL
AGINGGSGGQGGHGGNAILFGQGGAGGPGGTGAMGVAGTNPTPIGTAAPG
SDGVNQIGNGGNTDLTGGAGGDGNAGSTTVNGGNGGTGGAARNSSGGTGN
SFGGAGGAGGDGANGGDGGAGGEALTEGGATAVSGAGGKGGNAEASGGAG
GNGGKGGFAQATTSVTGGNGGNGGNGHDSNAPGGAGGSGGVGGDGGRGGL
LAGNGGTGGAGGNGGTGGAGAPGGAGGAGGKADIANSLGDNATVTGGNGG
TGGDGGSALGTGGAGGAGGLGGHGGAGGLLIGNGGAGGAGGLGGAGGAGG
AGGEGGAGGAGGEAIPGGASTNSAGGDGGAGGTGGNGGDGGAGGAPGLGG
AGGAGGWLIGQSGSTGGGGAGGAGGAGGAGGAGGSGGAGGHGDTTSGKNG
SSGTAGFDGNPGQPG
>Rv3344c PE_PGRS49, PE-PGRS FAMILY PROTEIN
AQASPAAHGGSGGAGGNGGAGSAGNGGAGGAGGNGGAGGNGGGGDAGNAG
SGGNGGKGGDGVGPGSTGGAGGKGGAGANGGSSNGNARGGNAGNGGHGGA
GGSGDTGGAGGAGGQGGFGGTGGSGSGIGGGAGGNGGNGGAGGTGVVLGG
KGGDGGNGDHGGPATNPGSGSRGGAGGSGGNGGAGGNATGSGGKGGAGGN
GGDGSFGATSGPASIGVTGAPGGNGGKGGAGGSNPNGSGGDGGKGGNGGA
GGNGGSIGANSGIVGGSGGAGGAGGAGGNGSLSSGEGGKGGDGGHGGDGV
GGNSSVTQGGSGGGGGAGGAGGSGFFGGKGGFGGDGGQGGPNGGGTVGTV
AGGGGNGGVGGRGGDGVFAGAGGQGGLGGQGGNGGGSTGGNGGLGGAGGG
GGNAPDGGFGGNGGKGGQGGIGGGTQSATGLGGDGGDGGDGGNGGNSGAK
AGGAGGKGQAGQPNSGTEPGFGGDGGLGGAGATP
>Rv0297 PE_PGRS5, PE-PGRS FAMILY PROTEIN
MSFVIAQPEMIAAAAGELASIRSAINAANAAAAAQTTGVMSAAADEVSTA
VAALFSSHAQAYQAASAQAAAFHAQVVRTLTVDAGAYASAEAANAGPNML
AAVNAPAQALLGRPLIGNGANGAPGTGQAGGDGGLLFGNGGNGGSGAPGQ
AGGAGGAAGFFGNGGNGGDGGAGANGGAGGTAGWFFGFGGNGGAGGIGVA
GINGGLGGAGGDGGNAGFFGNGGNGGMGGAGAAGVNAVNPGLATPVTPAA
NGGNGLNLVGVPGTAGGGADGANGSAIGQAGGAGGDGGNASTSGGIGIAQ
TGGAGGAGGAGGDGAPGGNGGNGGSVEHTGATGSSASGGNGATGGNGGVG
APGGAGGNGGHVSGGSVNTAGAGGKGGNGGTGGAGGPGGHGGSVLSGPVG
DSGNGGAGGDGGAGVSATDIAGTGGRGGNGGHGGLWIGNGGDGGAGGVGG
VGGAGAAGAIGGHGGDGGSVNTPIGGSEAGDGGKGGLGGDGGGRGIFGQF
GAGGAGGAGGVGGAGGAGGTGGGGGNGGAIFNAGTPGAAGTGGDGGVGGT
GAAGGKGGAGGSGGVNGATGADGAKGLDGATGGKGNNGNPG
>Rv3345c PE_PGRS50, PE-PGRS FAMILY PROTEIN
MVMSLMVAPELVAAAAADLTGIGQAISAANAAAAGPTTQVLAAAGDEVSA
AIAALFGTHAQEYQALSARVATFHEQFVRSLTAAGSAYATAEAANASPLQ
ALEQQVLGAINAPTQLWLGRPLIGDGVHGAPGTGQPGGAGGLLWGNGGNG
GSGAAGQVGGPGGAAGLFGNGGSGGSGGAGAAGGVGGSGGWLNGNGGAGG
AGGTGANGGAGGNAWLFGAGGSGGAGTNGGVGGSGGFVYGNGGAGGIGGI
GGIGGNGGDAGLFGNGGAGGAGAAGLPGAAGLNGGDGSDGGNGGTGGNGG
RGGLLVGNGGAGGAGGVGGDGGKGGAGDPSFAVNNGAGGNGGHGGNPGVG
GAGGAGGLLAGAHGAAGATPTSGGNGGDGGIGATANSPLQAGGAGGNGGH
GGLVGNGGTGGAGGAGHAGSTGATGTALQPTGGNGTNGGAGGHGGNGGNG
GAQHGDGGVGGKGGAGGSGGAGGNGFDAATLGSPGADGGMGGNGGKGGDG
GKAGDGGAGAAGDVTLAVNQGAGGDGGNGGEVGVGGKGGAGGVSANPALN
GSAGANGTAPTSGGNGGNGGAGATPTVAGENGGAGGNGGHGGSVGNGGAG
GAGGNGVAGTGLALNGGNGGNGGIGGNGGSAAGTGGDGGKGGNGGAGANG
QDFSASANGANGGQGGNGGNGGIGGKGGDAFATFAKAGNGGAGGNGGNVG
VAGQGGAGGKGAIPAMKGATGADGTAPTSGGDGGNGGNGASPTVAGGNGG
DGGKGGSGGNVGNGGNGGAGGNGAAGQAGTPGPTSGDSGTSGTDGGAGGN
GGAGGAGGTLAGHGGNGGKGGNGGQGGIGGAGERGADGAGPNANGANGEN
GGSGGNGGDGGAGGNGGAGGKAQAAGYTDGATGTGGDGGNGGDGGKAGDG
GAGENGLNSGAMLPGGGTVGNPGTGGNGGNGGNAGVGGTGGKAGTGSLTG
LDGTDGITPNGGNGGNGGNGGKGGTAGNGSGAAGGNGGNGGSGLNGGDAG
NGGNGGGALNQAGFFGTGGKGGNGGNGGAGMINGGLGGFGGAGGGGAVDV
AATTGGAGGNGGAGGFASTGLGGPGGAGGPGGAGDFASGVGGVGGAGGDG
GAGGVGGFGGQGGIGGEGRTGGNGGSGGDGGGGISLGGNGGLGGNGGVSE
TGFGGAGGNGGYGGPGGPEGNGGLGGNGGAGGNGGVSTTGGDGGAGGKGG
NGGDGGNVGLGGDAGSGGAGGNGGIGTDAGGAGGAGGAGGNGGSSKSTTT
GNAGSGGAGGNGGTGLNGAGGAGGAGGNAGVAGVSFGNAVGGDGGNGGNG
GHGGDGTTGGAGGKGGNGSSGAASGSGVVNVTAGHGGNGGNGGNGGNGSA
GAGGQGGAGGSAGNGGHGGGATGGDGGNGGNGGNSGNSTGVAGLAGGAAG
AGGNGGGTSSAAGHGGSGGSGGSGTTGGAGAAGGNGGAGAGGGSLSTGQS
GGPRRQRWCRWQRRRWLGRQRRRRWCRWQRRCRRQRWRWRCRQRRLRRQW
RQGRRRCRPWLHRRRGRQGRRWRQRRFQQRQRSRWQRR
>Rv3367 PE_PGRS51, PE-PGRS FAMILY PROTEIN
MSFVVAVPEALAAAASDVANIGSALSAANAAAAAGTTGLLAAGADEVSAA
LASLFSGHAVSYQQVAAQATALHDQFVQALTGAGGSYALTEAANVQQNLL
NAINAPTQALLGRPLIGDGAVGTASSPDGQDGGLLFGNGGAGYNSAATPG
MAGGNGGNAGLIGNGGTGGSGGAGAAGGAGGSGGWLYGNGGNGGIGGNAI
VAGGAGGNGGAGGAAGLWGSGGSGGQGGNGLTGNDGVNPAPVTNPALNGA
AGDSNIEPQTSVLIGTQGGDGTPGGAGVNGGNGGAGGDANGNPANTSIAN
AGAGGNGAAGGDGGANGGAGGAGGQAASAGSSVGGDGGNGGAGGTGTNGH
AGGAGGAGGAGGRGGWLVGNGGNGGNGAAGGNGAIGGTGGAGGVPANQGG
NSALGTQPVGGDGGDGGNGGTGGTGGRGGDGGSGGAGGASGWLMGNGGNG
GNGGTGGSGGVGGNGGIGGDGAGGGNATSTSSIPFDAHGGNGGAGGDAGH
GGTGGDGGDGGHAGTGGRGGLLAGQHANSGNGGGGGTGGAGGTHGTPGSG
NAGGTGTGNADSTNGGPGSDGLGGDAFNGSRGTDGNPG
>Rv3388 PE_PGRS52, PE-PGRS FAMILY PROTEIN
MSFVIANPEMLAAAATDLAGIRSAISAATAAAAAPTIQVAAAGADEVSLA
ISALFGQHAQAYQALSAQATIFHDQFVQALTSGGNLYAAAESHTVEQMVL
NAINAPTQTLFGRPLIGDGANGTAENPDGQNGGLLFGNGGNGFTQTTAGV
AGGNGGSAGLIGNGGAGGGGGAGAAGGLGGNGGWLYGNGGAGGIGGAGTG
TGGHGGAGGAGGRAWLWGTGGAGGAGGDGGWLFGDGGAGGTGGNGGSGFN
SLTSSVGGAGGAGGHAGLFGAGGTGGTGGIGGQNTETGPAASNGGAGGAG
GGGGYLVGDGGAGGTGGAGGKNSSGGATLTGGTGGTGGAGGAAGWLYGSG
GAGGAGGAGGLNNAGGATGGTGGTGGAGGSGAWLYGNGGAAGAGGNGGNN
TSAGTGGVGASGGTGGNAGLIGAGGHGGAGGAGGNQTGGVGNGGAGGNGG
AGGAGGQLYGNGGDGGNGGAGGANIAGGNGSDGGAAGHGGAGGSARLIGA
GGHGGDGGAGGNTAGRRADAIAGTGGDGGNGGNGGLLSGNAGAGGHGGAG
GSSTATTTTGTPPTGATGGNGGNGGAGGTAGFTGSGGIGGNGGAGGTGGN
AGVALSVGSTGGLGGNGGSGGLGGGGGSLFGNGGAGGVGATGGNGGSGIG
PASVGGNGGKGGVGAAGGLAGQIGNGGSGGSGGAGGNGGTGDTAGNGGNG
GAGAVGGNAQLIGNGGNGGGGGNGGTGADGT
>Rv3507 PE_PGRS53, PE-PGRS FAMILY PROTEIN
MSFVLVSPETVAAVATDLKRIGASLAHENASAAASTTAVVSAAADEVSTA
VAALFSQHAQGYQAAAAQVAAFHSRFVQALTAGAGAYAFAEAANASPLQS
AMGAVSASAQTLLSRPLIGNGANATTPGGNGGDGGWLFGSGGNGAPGAAG
QSGGNGGSAGLWGNGGAGGAGGSGGAAGGNGGNGGWLFGAGGTGGIGGTG
APGAMGGTGGNGGNGALLIGGGGLGGAGGMGGTGGGTGGTGGNGGNGALL
IGAGGVGGAGGIGGQGTGAGGAAGAGGTGGNGGAGGLFMNGGDGGAGGQG
GDGAAGDAAASAGGTGGKGGQGGDGGTGGAGGAGPVLFGHGGAGGMGGQG
GTGGMGGAGGDGTTVIAAGTGGEGGTGGAAGAGGAAGARGALTSGGLAGG
VGAGGTGGTGGTGGNGADAAAVVGFGANGDPGFAGGKGGNGGIGGAAVTG
GVAGDGGTGGKGGTGGAGGAGNDAGSTGNPGGKGGDGGIGGAGGAGGAAG
TGNGGHAGNTGDGGDGGTGGNGGNGTGGVNGADNTLNPDTPGGAGEPGGA
GGAGGAGGAAGGPGGTGGTGGNGGNGGNGGNGGNGGNGGNGGNAGNNSTN
APVGGEGGAGGDGGAGGAGGAANGGTAGSQGTGGVGGDGGAGGNGGGGKA
GTGNSGNFGVDGEAGFSGGAGGNGGVGGAAGANGGTGGSGGNGGDGGAGG
IGGAGGNGIPGTGTEPAGGTGAKGGDGGDGGAGGAGGNAGGAGGQGGNAG
QGGAGGAGGNAVIPGDGVGKAPHGDAGGSGGDGGKGGQGGSGGTGGSGAP
IGGGAGGTGGSGGHAGKGGAGGIGAQGTTITVPGNGGNAGDGGNGGNAGA
GGNGGSGDFGGNTTSGASGSGGNGGNAGTAGSGGAGGTGGTGLSGGNGGN
GGNGGNGGDGGNGAHGTVGAQFVPATSLPTPNGGAGGNGGTGSNGGAPGP
AGAPGPTTGGNAGSQGIGGDGGNGGDGGKGGDGADAVNVVFMPTEPQAAT
GTAGSAGDPTGGNGGPGTPGSPMVAPPPPTPITQVQQGGDGGAGGTGSTN
ANDGTATGGKGGEGGVGSILGGPGGNGGTGGNASATGTNGVANAGNGGKG
GDGGQFGAGGNGGAGGSVTDGSAGSTAGNGGNGGNATNGTIAGQPAGGNG
SAGGKGGDGGNIAAGATGTAGNGGNGGNGNDGAVNAGTGGSGGNGGNAGG
GGANGGDGGAGGAGGAGGRGGKGIDGGFGGDGGNGGSNNGTGAGGNGGNG
GTGGVGSVGAAGGDGGNGGTGGFAGFGGTAGNGGSGGTGGAGGDGGTGGD
GGNGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAGGNGGTGGASEDGD
NGNAGSGATGGTGGNGGTGGDGGAAGLGGVA
>Rv3508 PE_PGRS54, PE-PGRS FAMILY PROTEIN
MSFVLIAPEFVTAAAGDLTNLGSSISAANASAASATTQVLAAGADEVSAR
IAALFGGFGLEYQAISAQVAAYHQRFVQALSTGAGAYASAEAAAAEQIVL
GVINAPTQALLGRPLIGDGANATTPGGAGGAGGLLFGNGGAGAAGAPGQA
GGPGGPAGLWGNGGPGGAGGSGGGTGGAGGAGGWLFGVGGAGGVGGAGGG
TGGAGGPGGLIWGGGGAGGVGGAGGGTGGAGGRAELLFGAGGAGGAGTDG
GPGATGGTGGHGGVGGDGGWLAPGGAGGAGGQGGAGGAGSDGGALGGTGG
TGGTGGAGGAGGRGALLLGAGGQGGLGGAGGQGGTGGAGGDGVLGGVGGT
GGKGGVGGVAGLGGAGGAAGQLFSAGGAAGAVGVGGTGGQGGAGGAGAAG
ADAPASTGLTGGTGFAGGAGGVGGQGGNAIAGGINGSGGAGGTGGQGGAG
GMGGSGADNASGIGADGGAGGTGGNAGAGGAGGAAGTGGTGGVVGAAGKA
GIGGTGGQGGAGGAGSAGTDATATGATGGTGFSGGAGGAGGAGGNTGVGG
TNGSGGQGGTGGAGGAGGAGGVGADNPTGIGGTGGTGGKGGAGGAGGQGG
SSGAGGTNGSGGAGGTGGQGGAGGAGGAGADNPTGIGGAGGTGGTGGAAG
AGGAGGAIGTGGTGGAVGSVGNAGIGGTGGTGGVGGAGGAGAAAAAGSSA
TGGAGFAGGAGGEGGAGGNSGVGGTNGSGGAGGAGGKGGTGGAGGSGADN
PTGAGFAGGAGGTGGAAGAGGAGGATGTGGTGGVVGATGSAGIGGAGGRG
GDGGDGASGLGLGLSGFDGGQGGQGGAGGSAGAGGINGAGGAGGNGGDGG
DGATGAAGLGDNGGVGGDGGAGGAAGNGGNAGVGLTAKAGDGGAAGNGGN
GGAGGAGGAGDNNFNGGQGGAGGQGGQGGLGGASTTSINANGGAGGNGGT
GGKGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGAAGKAGGGGNGGR
GGDGGDGASGLGLGLSGFDGGQGGQGGAGGSAGAGGINGAGGAGGNGGDG
GDGATGAAGLGDNGGVGGDGGAGGAAGNGGNAGVGLTAKAGDGGAAGNGG
NGGAGGAGGAGDNNFNGGQGGAGGQGGQGGLGGASTTSINANGGAGGNGG
TGGKGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGAAGKAGGGGNGG
VGGDGGEGASGLGLGLSGFDGGQGGQGGAGGSAGAGGINGAGGAGGTGGA
GGDGAPATLIGGPDGGDGGQGGIGGDGGNAGFGAGVPGDGGDGGNAGFGA
GVPGDGGIGGTGGAGGAGGAGADGDPSIDGGQGGAGGHGGQGGKGGLNST
GLASAASGDGGNGGAGGAGGNGGDGDGFIGGSGGTGGTGGDAGVGGLANT
GGTAGNAGIGGAGGRGGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGA
GGINGAGGTGGTGGAGGDGQNGTTGVASEGGAGGQGGDGGQGGIGGAGGN
AGFGAGVPGDGGIGGTGGAGGAGGAGADGDPSIDGGQGGAGGHGGQGGKG
GLNSTGLASAASGDGGNGGAGGAGGNGGDGDGFIGGSGGTGGTGGDAGVG
GLANTGGTAGNAGIGGAGGRGGDGGAGDSGALSQDGNGFAGGQGGQGGVG
GNAGAGGINGAGGTGGTGGAGGDGQNGTTGVASEGGAGGQGGDGGQGGIG
GAGGNAGFGAGVPGDGGIGGTGGAGGAGGAGADGDPSIDGGQGGAGGHGG
QGGKGGLNSTGLASAASGDGGNGGAGGAGGNGGAGGLGGGGGTGGTNGNG
GLGGGGGNGGAGGAGGTPTGSGTEGTGGDGGDAGAGGNGGSATGVGNGGN
GGDGGNGGDGGNGAPGGFGGGAGAGGLGGSGAGGGTDGDDGNGGSPGTDG
S
>Rv3511 PE_PGRS55, PE-PGRS FAMILY PROTEIN
MSFVLISPEVVSAAAGDLANVGSTISAANKAAAAATTQVLAAGADEVSAR
IAALFGMYGLEYQAISAQVAAYHQQFVQTLRTGAASYMLAEATNVEQNLL
NLINAPTQTLLGRPLIGDGANATTPGGAGGDGGLLFGSGGNGAPGAPGQA
GGAGGSAGLLGNGGSGGAGGTGAPGGNGGNAGWLYGRGGVGGAGGIGGGT
GGAGGHAWLFGHGGTGGIGGGPGGNGGWLLGNGGHGGAGGIGGGSGGAGG
NGGWLLGNGGIGGAGGTGGGAGGTGGNAAWLLGGGGTGGAGGIGGGNGGH
GGNGGWLLGNGGNGGLGGDGDGGTGGGHGGNGGNPGWLLGTAGGGGNGGA
GSTGTAGGGSGGTGGDGGTGGRGGLLMGAGAGGHGGTGGAGGAGVNGGGA
GGAGGAGGNGGAGGQAALLFGRGGTGGAGGYGGDGGGGGDGFDGTMAGLG
GTGGSGGTGGDGGAPGNGGAGGAGQLLSHSGVAGASGKGGAGGTGGNGGA
GSAGADAPAGSGAMGSTGFAGGAGGDGGNGGGSGASQGNGGNGGNGGTGG
KGGTGGAGMNSLDPLLAAQDGGQGGTGGTGGNAGAGGTGFTQGADGNAGN
GGDGGVGGNGGNGADNTTTAAAGTTGGAGGAGGAGGTGGAAGTGTGGQQG
NGGNGGNGGTGGKGGTGGAGMNSLDPLLAAQDGGQGGTGGTGGNAGAGGT
GFTPRRRRQRRQRR
>Rv3512 PE_PGRS56, PE-PGRS FAMILY PROTEIN
PQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGTTGGAGGAGGAGGTGGT
GGAAGTGTGGQQGNGGNGGNGGTGGKGGTGGDGALAGSSGGAGGKGGNGG
DAGKAGTGSAPGTAGTGGDGGKGGNGGIGAAGTTGPVGTGASGGTGGSGG
AGGTGGDGGAANGGTAGAGGAGGNGGKGGDGGAGVTSSTAGNSGGAGGSG
GKGGDAGAGGAGATPGANGIAGNGGDGGDGAAGAVGISGATGAGDGGHGG
TGAAGGNGGTGGAGGSGIDGVGGGTGGTGGNGGNGAIGGAGGDAGGSGNS
GGNGGIGGKGGNAGAGGAAGSNGGTVGANGTGGDGGNGGAAGAATAGSNG
GAGTGSAGGNGGTGGRGGSGGAGGDGIGGVGGGKGGNGADGEVGGAGGAG
GSGPNTSPGGNGGQGGQGGSGGAGGAAGAGGAGGGANGTAGNGGQGGAGG
TGGAGAASSATNGGSGGAGGTGGDGGSGGAGGTGGAGGTGGAAGDGGQGG
QGGAGGGAGGQGGAGGAGGTGGNGGNITGGTAGTAGAAGNGGAAGKGGAG
GQGGTGGGTGGQGGAGGDGGAGGTGGDRTVGGGTVPAGSGGQGGNAGGGG
AGGQGGADGGSGGDGGDAGTGGNGGNGGNRNSGNGTGGAGGNGGGGANGG
AGGAGGSGGGTGGNGGAGGDAGDAGNGGNGNGTGNGGNGGNGGIAGMGGN
GGAGTGSGNGGNGGSGGNGGNAGMGGNSGTGSGDGGAGGNGGAAGTGGTG
GDGGLTGTGGTGGSGGTGGDGGNGGNGADNTANMTAQAGGDGGNGGDGGF
GGGAGAGGGGLTAGANGTGGQGGAGGDGGNGAIGGHGPLTDDPGGNGGTG
GNGGTGGTGGAGIGSLGGGTGGDGGNGGNGGTGGEGGEVGGAGGTGGAAG
NGGDGGTGGTGGGDGGAGGTGGTGGTGGLGDPRVGGSGGDGGTGGSGGAA
GNGGNGGNAGAGGNGNGGTGGAGGIGGTGGNGGDAEPGVPPGAGGAGGAG
TTGGKGGTGGNGSGTGSGGTGGDGGTGGGGGNGGTGWNGGKGDTGSGGGA
GDGGKAPAGGTGGAGGDGGAGGKGGSGGV
>Rv3514 PE_PGRS57, PE-PGRS FAMILY PROTEIN
MSFVLIAPEFVTAAAGDLTNLGSSISAANASAASATTQVLAAGADEVSAR
IAALFGGFGLEYQAISAQVAAYHQRFVQALSTGAGAYASAEAAAAEQIVL
GVINAPTQALLGRPLIGDGANATTPGGAGGAGGLLFGNGGAGAAGAPGQA
GGPGGPAGLWGNGGPGGAGGSGGGTGGAGGAGGWLFGVGGAGGVGGAGGG
TGGAGGPGGLIWGGGGAGGVGGAGGGTGGAGGRAELLFGAGGAGGAGTDG
GPGATGGTGGHGGVGGDGGWLAPGGAGGAGGQGGAGGAGSDGGALGGTGG
TGGTGGAGGAGGRGALLLGAGGQGGLGGAGGQGGTGGAGGDGVLGGVGGT
GGKGGVGGVAGLGGAGGAAGQLFSASGAAGNAGVGGAGGQGGDGGAGGAG
ADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGQGGAGGAGG
AGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNAGVGG
AGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGS
GGAGGTGGQGGAGGAGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGT
GGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAG
GAGKAGGSSSAGGTNSSGSAGGTGRQSGTGGAGGAGADNPTGIGGTGGDG
GTGGAAGAGGAGGAAGTGGTGGMIGTTGNAGVGGAGGSSGAGGTNGSGGA
GGTDGQGGAGGAGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGT
GGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAG
GSGGSSCAGGTNGSGGAGGTCGQVVAGGAGISFSNGSNGGTGGTGGVGGT
GGDGGNAGTGAGDPGKGGTGGTGGTGGSGGAGGSGGANFNGGTGGTGGTG
GKGGLNTDGLSSATSGTGGTGGTGGKGGTGGAGDDSAGGTGGTGGAGGNA
GAGGLANTGGTAGNAGIGGDGGQGGNGGQGDSGSGLGGQPGFAGGAGGKG
GAGGSSGAGGTNGSGGAGGAGGQGGAGGAGISFSNGSNGGTGGTGGVGGT
GGDGGNAGTGAGDPGKGGTGGTGGTGGSGGAGGSGGANFNGGTGGTGGTG
GTGGKGGMGGIAGDGGPGGDGGNAGVGGKGGTNGNGGSGGTGGTGGAGGN
AGAGGLANTGGTAGNAGIGGDGGQGGNGGQGDSGSGLGGQPGFAGGPGGK
GGAGGNAGTGGTNGSGAGGAGGQGGAGGAGISFSNGSNGGTGGTGGVGGT
GGDGGNAGTGAGDPGKGGTGGTGGTGGSGGAGGSGGANFNGGTGGTGGTG
GTGGKGGMGGIAGDGGPGGDGGNAGVGGKGGTNGNGGSGGTGGTGGPGGS
GGAPTGSGTGGKGGAGGDGGDGADGGAATGVGDGGDGGNGGNGGNGGTGV
GSPGGLGGAGGTGGLGGAGAGGGADGDDGDDGQPGNNGS
>Rv3590c PE_PGRS58, PE-PGRS FAMILY PROTEIN
MSFVIVAPEALMSVASEVAGIGSALNAANAAAAAPTTGVLAAAADEVSAA
MAALFGAHAQEYQRLSAQAAGFHAQFVQALNAGVNSYASAEAANASPLQA
VEQQVLGLINGPAQTLLGRPLIGNGADGAPGTGQPGGPGGLLWGNGGNGG
SGVAGVGGPGGSGGAAGLFGHGGNGGAGGSNAAGAGGVGGAGGAGWLVGN
GGAGGFGGVGTTVSGNGGAGGAAGAFGNGGVGGAGGAAVIGGLPGNGGAG
GNAGLIGAGGDGGVGGVGAPGTNGMNPPPNQTSQAANGSPGANNGAGSGG
AGLPGNPGAVPGRAGGAGGLGGSGSDTSEGPVTGGNGGNGGDGGPGAPGG
NGAPGGIGVNTGTGWAYGGNGGNGGDGGAGARGGDGGNGGNGLALNGGNG
IGGNGGAGGRGGTGAAGGNGGIGGGATGTLTFFGSGGDGGPGGAGANTAG
TGGVGGVGGAGGQGGLLFGDGGNGGAGGAGGIGGTGASGGAGGKGGSGLV
GGDGGNGGAGGAGGNGGKGGAGGAGGGAGMFSQPGVHGAGGTGGQGGAGG
AGGAGGAAGAGTVVAGNPGDPGGFGAAGADGLPG
>Rv3595c PE_PGRS59, PE-PGRS FAMILY PROTEIN
MSFVIAVPEFLSAAATDLANLGSTISAANAAASIPTTGVLAAGADDVSAA
IAALFGAHAQAYQTISAQAATFHAQFVQTLSAGAGAYANAEAANVQQSLL
NAINAPTQALLGRPLIGDGADGTAPGQNGGAGGLLYGNGGNGAAGVNAGI
AGGSGGAAGLIGNGGSGGAGGAGAAGGSGGQGGLLYGNGGAGGNGGAATI
PGGNGGAGGAGGNAWLFGNGGAGGLGAAGAAGAAGVNPLTVPAGQGSMGN
NGEPGGPGQPGTEFGQTGGTGGTGGTGLSVGGTGGTGGTGGTGGAGGSGG
RGGLLVGDGGAGGIGGTGGEGGIGARGGTGGQGGMGGAGQPGVGGDAGDG
GNGGIGGDGGAGGDGGAGGAGGAGGLFGVSGSSGLGGAAGSGGNGGGGGE
PGVAGSPGVGPAGRGGDGNLGQFGPEGAPGQPGQPGQPG
>Rv0532 PE_PGRS6, PE-PGRS FAMILY PROTEIN
MSNLLVTPELVAAAAADLAGIGSAIGAANAAAGAPTMALLAAGADEVSAA
VAAVFSSYAQQYQALSAAAAAFHDQFVRALAAGAGAYAGAEAANVEQQLL
NAINAPTLALLGRPLIGNGADGAAGTGQAGGAGGLLYGNGGNGGSGAAGQ
AGGAGGAAGLIGHGGTGGAVTGVSTTGGPGGHGGDAGLYGFGGAGGAGGF
GQSGAAGGAGGAGGWLYGDGGDGGAGDNGGNESGTGVSAVGGVGGAGGAG
GLLFGNGGDGGVGGDGGDGSSTQDSGGDGGAGGAGGAGGWLLGNGGAGGA
GGAASIKVATGGLGGDGGDAGLFGFGGDGGWGGRGVDARFGAAGGAAGAG
GAGGWLYGDGGAGGVGGVGGAVFSLSSGDGGAGGAGGGGGWLFGNGGDGG
AGGGGGGRFGSGSGAGGDGAVGGAGGAGAWFGNGGAGGVGGGGGRGTTAI
GGDGGAGGAGGAGGWLYGDGGAGGAGGGGGRGGTGNDGGDGGDGGRGGDA
QLLGNGGDGGAGGAGGPAGLALPPGPARPAGAAVPAVRCSAAPARPARTA
DPWLAPIFARSTLRHSHHLGGIAQTGAVADQQGQIAGLGRAGRQ
>Rv3652 PE_PGRS60, PE-PGRS FAMILY-RELATED PROTEIN
MSYVIAAPEALVAAATDLATLGSTIGAANAAAAGSTTALLTAGADEVSAA
IAAYSECTARPIRHSVRGRRRSMSGSCRPWPQVGAPMRPPRPPASRRCRA
RSIC
>Rv3653 PE_PGRS61, PE-PGRS FAMILY-RELATED PROTEIN
MLNAPTQALLGRPLVGNGANGAPGTGANGGDGGILFGSGGAGGSGAAGMA
GGNGGAAGLFGNGGAGGAGGSATAGAAGAGGNGGAGGLLFGTAGAGGNGG
LSLGLGVAGGAGGAGGSGGSDTAGHGGTGGAGGLLFGAGEDGTTPGGNGG
AGGVAGLFGDGGNGGNAGVGTPAGNVGAGGTGGLLLGQDGMTGLT
>Rv3812 PE_PGRS62, PE-PGRS FAMILY PROTEIN
MSFVVTVPEAVAAAAGDLAAIGSTLREATAAAAGPTTGLAAAAADDVSIA
VSQLFGRYGQEFQTVSNQLAAFHTEFVRTLNRGAAAYLNTESANGGQLFG
QIEAGQRAVSAAAAAAPGGAYGQLVANTATNLESLYGAWSANPFPFLRQI
IANQQVYWQQIAAALANAVQNFPALVANLPAAIDAAVQQFLAFNAAYYIQ
QIISSQIGFAQLFATTVGQGVTSVIAGWPNLAAELQLAFQQLLVGDYNAA
VANLGKAMTNLLVTGFDTSDVTIGTMGTTISVTAKPKLLGPLGDLFTIMT
IPAQEAQYFTNLMPPSILRDMSQNFTNVLTTLSNPNIQAVASFDIATTAG
TLSTFFGVPLVLTYATLGAPFASLNAIATSAETIEQALLAGNYLGAVGAL
IDAPAHALDGFLNSATVLDTPILVPTGLPSPLPPTVGITLHLPFDGILVP
PHPVTATISFPGAPVPIPGFPTTVTVFGTPFMGMAPLLINYIPQQLALAI
KPAA
>Rv0578c PE_PGRS7, PE-PGRS FAMILY PROTEIN
MSFVIATPEMLTTAATDLAKIGSTITAANTAAAAVAKVLPASADEVSVAV
AALFGTHAQEYQTVSAQVATFHDRFVQTLSAAASSYVAAEAVNVEQSLLA
AVNAPTQALFGRPLIGNGADGSPGTGQAGGPGGILYGNGGNGGSGAPGQR
GGAGGAAGLIGNGGNGGAGGVGTTGGAGGHGGAGGWLYGNGGAGGFGGAG
AVGGNGGAGGTAGLFGVGGAGGAGGNGIAGVTGTSASTPGGSGTAGGAGG
IGGNGGAGGAGGVLMGNGGNGGAGGEGGPGGAGGAGASGAHATNLGADGQ
AGGNGGNGGAGGTGGVGGPGGGHGLLGLGGSHGAGGAGGSGGDGGAPGDG
GNGATGTWGHNLGAGGTGGNGGNPGAGGAGGAGGASVGGSAHGANGAPGT
TSTSGGNGGDGGKGADAISSGQTGANGGRGGDGGQVGNGGAGGAGGRGGA
GGLGFGSEAPGRPGGAGGTGGAGGNGGTQAGDGGTGGAGGAGGDGGSGGA
GSIGFNASAPGAAGSPGGNGGNGGPGGAGGEGGAGGLALAASGQNGSQGA
GGDGGAGGNGGTPGNGGHGAAGALGVNGGVGGAGGHGGDPGVGGAGGQGG
SGSTPGANGAPGNTPTSGGNGGNGGRGADATGFGQTGASGGRGGDGGLVG
NGGAGGAGGNGSKGLPGLGRLGNPGLDGGTGGNGGAGGSGGAWAGNGGTG
GAGGTGGVGGTGGSGSDGVNGSSAGADGHPGGTGGVGGTGGKGGDGGDGG
AAPNGVAGSQGPGGAGGDGGTGGVGGNGGRGIDGADGATAGARGQDGGAG
GAGGKGGRGGTGGPGGAGPAGTTGSQGAGGNGGSGGTGGDPGDGGNGANG
SVFTNNGIGGNGGNGGNAGPSGAGGSGGAGSTFGATGSSSSIHVNGGNGG
NGGNGDHALSGNGAAGGNGGNGGNGSLRGSGGAGGHGGNGGNASRGMGGD
GGTGGAGGNAGQIGNGGAGGNGGDGGTGSDGNPGAITGSGGRGGDGGVGG
QGGSVAGDGADGGRGGAGGTGGTGLRGTTGATGATGTFDAGADGHGGNGG
TGGVGGTGGAGGGGGNGGAGGKALSPTGNNGSQGAGGDGGAGGAGGTGGT
GGDGGRGAHGTLFSSLAGTGGTGGNGGTGGTGGTGGAGGAGGTGSTLGAT
GATGAAGRAGNGGVGGSGGLGSAFGPGGTGGMGGAGGTSTVSAGGDGGRG
GFGGDGLDASSGGNGGDGGHGGDGFRTAGAGGRGGDGGKGADPGGLFPIP
GAGGKGGTGGTGGTAHLGPLAIIGQSGQPGQFGSPGADGRGGAGGAGGGG
GAGGSF
>Rv0742 PE_PGRS8, PE-PGRS FAMILY PROTEIN
MSFVIAAPEAIAAAATDLASIGSTIGAANAAAAANTTAVLAAGADQVSVA
IAAAFGAHGQAYQALSAQAATFHIQFVQALTAGAGSYAAAEAASAASITS
PLLDAINAPFLAALGRPLIGNGADGAPGTGAAGGAGGLLFGNGGAGGSGA
PGGAGGLLFGNGGAGGPGASGGALG
>Rv0746 PE_PGRS9, PE-PGRS FAMILY PROTEIN
MSFVLAMPEVLGSAATDLAALGSVLGAADAAAAATTTGIVAAAQDEVSAA
IAALFSAHGRAYQVASAQAAAVHAQFVEALSAGAGAYASAEAAGAAVLAN
PAQSVQQDLLAAVNAQSVALTGRPLIGNGANGAPGTGANGAPGGWLLGNG
GAGGSAAAGSGLPGGAGGAAGLFGTGGAGGAGGSSTVGDGEAGGAGGSGG
WLLGTGGVGGVGGLGAGAGGAGGVGGAGGLLGAGGHGGAGGLGAVTGGVG
GTGGAGGLLAGLLAGPGGAGGTGGRGFLNNGGVGGAGGNAGLLFGAGGTG
GSGGAGLGGDGGAGGAGGNTGVLFGNAGSGGTGGFGDTDGGAGGAGGDAG
WLGSGGVGGAGGFGETGDGGVGGAGGKAGLLIGNGGAGGAGGQGAVTGGT
GGAGGDGVLIGNGGNAGIGGTGPTAGDTGAGGISGLLLGADGFNTPASAS
PLHTLKQQALAAINAPTQTLTGRPLIGNGTPGAVGSGATGAPGGWLLGDG
GAGGSGAAGSGAPGGAGGAAGLWGTGGAGGAGGSSAGGGGAGGAGGAGGW
LLGDGGAGGIGGASTVLGGTGGGGGVGGLWGAGGAGGAGGTGLVGGDGGA
GGAGGTGGLLAGLIGAGGGHGGTGGLSTNGDGGVGGAGGNAGMLAGPGGA
GGAGGDGENLDTGGDGGAGGSAGLLFGSGGAGGAGGFGFLGGDGGAGGNA
GLLLSSGGAGGFGGFGTAGGVGGAGGNAGWLGFGGAGGVGGSAGLIGTGG
NGGNGGTGANAGSPGTGGAGGLLLGQNGLNGLP
>Rv0096 PPE1, PPE FAMILY PROTEIN
MAIPPEVHSGLLSAGCGPGSLLVAAQQWQELSDQYALACAELGQLLGEVQ
ASSWQGTAATQYVAAHGPYLAWLEQTAINSAVTAAQHVAAAAAYCSALAA
MPTPAELAANHAIHGVLIATNFFGINTVPIALNEADYVRMWLQAADTMAA
YQAVADAATVAVPSTQPAPPIRAPGGDAADTRLDVLSSIGQLIRDILDFI
ANPYKYFLEFFEQFGFSPAVTVVLALVALQLYDFLWYPYYASYGLLLLPF
FTPTLSALTALSALIHLLNLPPAGLLPIAAALGPGDQWGANLAVAVTPAT
AAVPGGSPPTSNPAPAAPSSNSVGSASAAPGISYAVPGLAPPGVSSGPKA
GTKSPDTAADTLATAGAARPGLARAHRRKRSESGVGIRGYRDEFLDATAT
VDAATDVPAPANAAGSQGAGTLGFAGTAPTTSGAAAGMVQLSSHSTSTTV
PLLPTTWTTDAEQ
>Rv0442c PPE10, PPE FAMILY PROTEIN
MTSPHFAWLPPEINSALMFAGPGSGPLIAAATAWGELAEKLLASIASLGS
VTSELTSGAWLGPSAAAMMAVATQYLAWLSTAAAQAEQAAAQAMAIATAF
EAALAATVQPAVVAANRGLMQLLAATNWFGQNAPALMDVEAAYEQMWALD
VAAMAGYHFDASAAVAQLAPWQQVLRNLGIDIGKNGQINLGFGNTGSGNI
GNNNIGNNNIGSGNTGTGNIGSGNTGSGNLGLGNLGDGNIGFGNTGSGNI
GFGITGDHQMGFGGFNSGSGNIGFGNSGTGNVGLFNSGSGNIGIGNSGSL
NSGIGTSGTINAGLGSAGSLNTSFWNAGMQNAALGSAAGSEAALVSSAGY
ATGGMSTAALSSGILASALGSTGGLQHGLANVLNSGLTNTPVAAPASAPV
GGLDSGNPNPGSGSAAAGSGANPGLRSPGTSYPSFVNSGSNDSGLRNTAV
REPSTPGSGIPKSNFYPSPDRESAYASPRIGQPVGSE
>Rv0453 PPE11, PPE FAMILY PROTEIN
MTSALIWMASPPEVHSALLSSGPGPGPVLAAATGWSSLGREYAAVAEELG
ALLAAVQAGVWQGPSAESFAAACLPYLSWLTQASADCAAAAARLEAVTAA
YAAALVAMPTLAELAANHATHGAMVATNFFGINTIPIAVNEADYVRMWLQ
AATTMATYQAVADSAVRSIPDSVPPPRILKSNAQSQHSSSNNSGGADPVD
DFIAEILKIITGGRVIWDPEAGTVNGLPYDAYTNPGTLMWWIARSLELLQ
DFQEFAKLLFTNPVKAFQFLVDLILFDWPTHMLQLATWLAENPQLLVAAL
TPAISGLGAVSGLAGLTGLVPQPPVVPAPAPDAVVPTVLPLAGTATPTTA
PASAPAAGAAPGPPAGTATATSASVPTSAGGFPPYLVGSGPGIDFDAGTP
AGSRRAQPAADNVTAVAAAQVSARHQARRRRRAAAKERGNADEFVDMDSG
PAIPPSGERDAWASNSGVGGLGFAGTASNETVAAPAGLTTLADDEFQCGP
RMPMLPGAWDLGTWDRGD
>Rv0755c PPE12, PPE FAMILY PROTEIN
MVGFAWLPPETNSLRMYLGAGSRPLLAAAGAWDGLAEELHAAASSFGSVT
SELAGGAWQGPASAAMANAAGPYASWLTAAGAQAELAARQARAAAGAFEE
ALAGVVHPAVVQANRVRTWLLAVSNVFGQNAPAIAAMESTYEQMWAQDVA
VMAGYHAASSAAAAQLASWQPALPNINLGVGNIGNLNVGNGNTGDYNLGN
GNLGNANFGGGNGSAFHGQISSFNVGSGNIGNFNLGSGNGNVGIGPSSFN
VGSGNIGNANVGGGNSGDNNFGFGNFGNANIGIGNAGPNMSSPAVPTPGN
GNVGIGNGGNGNFGGGNTGNANIGLGNVGDGNVGFGNSGSYNFGFGNTGN
NNIGIGLTGSNQIGFGGLNSGSGNIGFGNSGTGNIGFFNSGSGNFGVGNS
GVTNTGVANSGNINTGFGNSGFINTGFGNALSVNTGFGNSGQANTGIGNA
GDFNTGNFNGGIINTGSFNSGAFNSGSFNGGDANSGFLNSGLTNTGFANS
GNINTGGFNAGNLNTGFGNTTDGLGENSGFGNAGSGNSGFNNSGRGNSGA
QNVGNLQISGFANSGQSVTGYNNSVSVTSGFGNKGTGLFSGFMSGFGNTG
FLQSGFGNLEANPDNNSATSGFGNSGKQDSGGFNSIDFVSGFFHR
>Rv0878c PPE13, PPE FAMILY PROTEIN
MNFMVLPPEVNSARIYAGAGPAPMLAAAVAWDGLAAELGMAAASFSLLIS
GLTAGPGSAWQGPAAAAMAAAAAPYLSWLNAATARAEGAAAGAKAAAAVY
EAARAATAHPALVAANRNQLLSLVLSNLFGQNLPAIAATEASYEQLWAQD
VAAMVGYHGGASTVASQLTPWQQLLSVLPPVVTAAPAGAVGVPAALAIPA
LGVENIGVGNFLGIGNIGNNNVGSGNTGDYNFGIGNIGNANLGNGNIGNA
NLGSGNAGFFNFGNGNDGNTNFGSGNAGFLNIGSGNEGSGNLGFGNAGDD
NTGWGNSGDTNTGGFNSGDLNTGIGSPVTQGVANSGFGNTGTGHSGFFNS
GNSGSGFQNLGNGSSGFGNASDTSSGFQNAGTALTRASSTWADSPRAWPI
RAPSRLQVWRTRATTARECSIRVIISRVSSTGAPPQKKVGNSG
>Rv0915c PPE14, PPE FAMILY PROTEIN
MDFGLLPPEVNSSRMYSGPGPESMLAAAAAWDGVAAELTSAAVSYGSVVS
TLIVEPWMGPAAAAMAAAATPYVGWLAATAALAKETATQARAAAEAFGTA
FAMTVPPSLVAANRSRLMSLVAANILGQNSAAIAATQAEYAEMWAQDAAV
MYSYEGASAAASALPPFTPPVQGTGPAGPAAAAAATQAAGAGAVADAQAT
LAQLPPGILSDILSALAANADPLTSGLLGIASTLNPQVGSAQPIVIPTPI
GELDVIALYIASIATGSIALAITNTARPWHIGLYGNAGGLGPTQGHPLSS
ATDEPEPHWGPFGGAAPVSAGVGHAALVGALSVPHSWTTAAPEIQLAVQA
TPTFSSSAGADPTALNGMPAGLLSGMALASLAARGTTGGGGTRSGTSTDG
QEDGRKPPVVVIREQPPPGNPPR
>Rv1039c PPE15, PPE FAMILY PROTEIN
MDFGALPPEINSARMYAGAGAGPMMAAGAAWNGLAAELGTTAASYESVIT
RLTTESWMGPASMAMVAAAQPYLAWLTYTAEAAAHAGSQAMASAAAYEAA
YAMTVPPEVVAANRALLAALVATNVLGINTPAIMATEALYAEMWAQDALA
MYGYAAASGAAGMLQPLSPPSQTTNPGGLAAQSAAVGSAAATAAVNQVSV
ADLISSLPNAVSGLASPVTSVLDSTGLSGIIADIDALLATPFVANIINSA
VNTAAWYVNAAIPTAIFLANALNSGAPVAIAEGAIEAAEGAASAAAAGLA
DSVTPAGLGASLGEATLVGRLSVPAAWSTAAPATTAGATALEGSGWTVAA
EEAGPVTGMMPGMASAAKGTGAYAGPRYGFKPTVMPKQVVV
>Rv1168c PPE17, PPE FAMILY PROTEIN
MDFTIFPPEFNSLNIQGSARPFLVAANAWKNLSNELSYAASRFESEINGL
ITSWRGPSSTIMAAAVAPFRAWIVTTASLAELVADHISVVAGAYEAAHAA
HVPLPVIETNRLTRLALATTNIFGIHTPAIFALDALYAQYWSQDGEAMNL
YATMAAAAARLTPFSPPAPIANPGALARLYELIGSVSETVGSFAAPATKN
LPSKLWTLLTKGTYPLTAARISSIPVEYVLAFVEGSNMGQMMGNLAMRSL
TPTLKGPLELLPNAVRPAVSATLGNADTIGGLSVPPSWVADKSITPLAKA
VPTSAPGGPSGTSWAQLGLASLAGGAVGAVAARTRSGVILRSPAAG
>Rv1196 PPE18, PPE FAMILY PROTEIN
MVDFGALPPEINSARMYAGPGSASLVAAAQMWDSVASDLFSAASAFQSVV
WGLTVGSWIGSSAGLMVAAASPYVAWMSVTAGQAELTAAQVRVAAAAYET
AYGLTVPPPVIAENRAELMILIATNLLGQNTPAIAVNEAEYGEMWAQDAA
AMFGYAAATATATATLLPFEEAPEMTSAGGLLEQAAAVEEASDTAAANQL
MNNVPQALQQLAQPTQGTTPSSKLGGLWKTVSPHRSPISNMVSMANNHMS
MTNSGVSMTNTLSSMLKGFAPAAAAQAVQTAAQNGVRAMSSLGSSLGSSG
LGGGVAANLGRAASVGSLSVPQAWAAANQAVTPAARALPLTSLTSAAERG
PGQMLGGLPVGQMGARAGGGLSGVLRVPPRPYVMPHSPAAG
>Rv1361c PPE19, PPE FAMILY PROTEIN
MVDFGALPPEINSARMYAGPGSASLVAAAKMWDSVASDLFSAASAFQSVV
WGLTTGSWIGSSAGLMVAAASPYVAWMSVTAGQAELTAAQVRVAAAAYET
AYGLTVPPPVIAENRAELMILIATNLLGQNTPAIAVNEAEYGEMWAQDAA
AMFGYAATAATATEALLPFEDAPLITNPGGLLEQAVAVEEAIDTAAANQL
MNNVPQALQQLAQPTKSIWPFDQLSELWKAISPHLSPLSNIVSMLNNHVS
MTNSGVSMASTLHSMLKGFAPAAAQAVETAAQNGVQAMSSLGSQLGSSLG
SSGLGAGVAANLGRAASVGSLSVPQAWAAANQAVTPAARALPLTSLTSAA
QTAPGHMLGGLPLGQLTNSGGGFGGVSNALRMPPRAYVMPRVPAAG
>Rv0256c PPE2, PPE FAMILY PROTEIN
MTAPIWMASPPEVHSALLSSGPGPGPLLVSAEGWHSLSIAYAETADELAA
LLAAVQAGTWDGPTAAVYVAAHTPYLAWLVQASANSAAMATRQETAATAY
GTALAAMPTLAELGANHALHGVLMATNFFGINTIPIALNESDYARMWIQA
ATTMASYQAVSTAAVAAAPQTTPAPQIVKANAPTAASDEPNQVQEWLQWL
QKIGYTDFYNNVIQPFINWLTNLPFLQAMFSGFDPWLPSLGNPLTFLSPA
NIAFALGYPMDIGSYVAFLSQTFAFIGADLAAAFASGNPATIAFTLMFTT
VEAIGTIITDTIALVKTLLEQTLALLPAALPLLAAPLAPLTLAPASAAGG
FAGLSGLAGLVGIPPSAPPVIPPVAAIAPSIPTPTPTPAPAPAPTAVTAP
TPPPGPPPPPVTAPPPVTGAGIQSFGYLVGDLNSAAQARKAVGTGVRKKT
PEPDSAEAPAAAAAPEEQVQPQRRRRPKIKQLGRGYEYLDLDPETGHDPT
GSPQGAGTLGFAGTTHKASPGQVAGLITLPNDAFGGSPRTPMMPGTWDTD
SATRVE
>Rv1387 PPE20, PPE FAMILY PROTEIN
MTEPWIAFPPEVHSAMLNYGAGVGPMLISATQNGELSAQYAEAASEVEEL
LGVVASEGWQGQAAEAFVAAYMPFLAWLIQASADCVEMAAQQHVVIEAYT
AAVELMPTQVELAANQIKLAVLVATNFFGINTIPIAINEAEYVEMWVRAA
TTMATYSTVSRSALSAMPHTSPPPLILKSDELLPDTGEDSDEDGHNHGGH
SHGGHARMIDNFFAEILRGVSAGRIVWDPVNGTLNGLDYDDYVYPGHAIW
WLARGLEFFQDGEQFGELLFTNPTGAFQFLLYVVVVDLPTHIAQIATWLG
QYPQLLSAALTGVIAHLGAITGLAGLSGLSAIPSAAIPAVVPELTPVAAA
PPMLAVAGVGPAVAAPGMLPASAPAPAAAAGATAAGPTPPATGFGGFPPY
LVGGGGPGIGFGSGQSAHAKAAASDSAAAESAAQASARAQARAARRGRSA
AKARGHRDEFVTMDMGFDAAAPAPEHQPGARASDCGAGPIGFAGTVRKEA
VVKAAGLTTLAGDDFGGGPTMPMMPGTWTHDQGVFDEHR
>Rv1705c PPE22, PPE FAMILY PROTEIN
MDFGALPPEVNSGRMYCGPGSAPMVAAASAWNGLAAELSVAAVGYERVIT
TLQTEEWLGPASTLMVEAVAPYVAWMRATAIQAEQAASQARAAAAAYETA
FAAIVPPPLIAANRARLTSLVTHNVFGQNTASIAATEAQYAEMWAQDAMA
MYGYAGSSATATKVTPFAPPPNTTSPSAAATQLSAVAKAAGTSAGAAQSA
IAELIAHLPNTLLGLTSPLSSALTAAATPGWLEWFINWYLPISQLFYNTV
GLPYFAIGIGNSLITSWRALGWIGPEAAEAAAAAPAAVGAAVGGTGPVSA
GLGNAATIGKLSLPPNWAGASPSLAPTVGSASAPLVSDIVEQPEAGAAGN
LLGGMPLAGSGTGTGGAGPRYGFRVTVMSRPPFAG
>Rv1706c PPE23, PPE FAMILY PROTEIN
MTLDVPVNQGHVPPGSVACCLVGVTAVADGIAGHSLSNFGALPPEINSGR
MYSGPGSGPLMAAAAAWDGLAAELSSAATGYGAAISELTNMRWWSGPASD
SMVAAVLPFVGWLSTTATLAEQAAMQARAAAAAFEAAFAMTVPPPAIAAN
RTLLMTLVDTNWFGQNTPAIATTESQYAEMWAQDAAAMYGYASAAAPATV
LTPFAPPPQTTNATGLVGHATAVAALRGQHSWAAAIPWSDIQKYWMMFLG
ALATAEGFIYDSGGLTLNALQFVGGMLWSTALAEAGAAEAAAGAGGAAGW
SAWSQLGAGPVAASATLAAKIGPMSVPPGWSAPPATPQAQTVARSIPGIR
SAAEAAETSVLLRGAPTPGRSRAAHMGRRYGRRLTVMADRPNVG
>Rv1787 PPE25, PPE FAMILY PROTEIN
MDFGALPPEINSGRMYCGPGSGPMLAAAAAWDGVAVELGLAATGYASVIA
ELTGAPWVGAASLSMVAAATPYVAWLSQAAARAEQAGMQAAAAAAAYEAA
FVMTVPPPVITANRVLVMTLIATNFFGQNSAAIAVAEAQYAEMWAQDAVA
MYGYAAASASASRLIPFAAPPKTTNSAGVVAQVAAVAAMPGLLQRLSSAA
SVSWSNPNDWWLVRLLGSITPTERTTIVRLLGQSYFATGMAQFFASIAQQ
LTFGPGGTTAGSGGAWYPTPQFAGLGASRAVSASLARANKIGALSVPPSW
VKTTALTESPVAHAVSANPTVGSSHGPHGLLRGLPLGSRITRRSGAFAHR
YGFRHSVVARPPSAG
>Rv1789 PPE26, PPE FAMILY PROTEIN
MDFGALPPEVNSVRMYAGPGSAPMVAAASAWNGLAAELSSAATGYETVIT
QLSSEGWLGPASAAMAEAVAPYVAWMSAAAAQAEQAATQARAAAAAFEAA
FAATVPPPLIAANRASLMQLISTNVFGQNTSAIAAAEAQYGEMWAQDSAA
MYAYAGSSASASAVTPFSTPPQIANPTAQGTQAAAVATAAGTAQSTLTEM
ITGLPNALQSLTSPLLQSSNGPLSWLWQILFGTPNFPTSISALLTDLQPY
ASFFYNTEGLPYFSIGMGNNFIQSAKTLGLIGSAAPAAVAAAGDAAKGLP
GLGGMLGGGPVAAGLGNAASVGKLSVPPVWSGPLPGSVTPGAAPLPVSTV
SAAPEAAPGSLLGGLPLAGAGGAGAGPRYGFRPTVMARPPFAG
>Rv1790 PPE27, PPE FAMILY PROTEIN
MDFGALPPEINSGRMYCGPGSGPMLAAAAAWDGVAVELGLAATGYASVIA
ELTGAPWVGAASLSMVAAATPYVAWLSQAAARAEQAGMQAAAAAAAYEAA
FVMTVPPPVITANRVLVMTLIATNFFGQNSAAIAVAEAQYAEMWAQDAVA
MYGYAAASASASRLIPFAAPPKTTNSAGVVAQAVASVSWPNPNDWWLVRL
LGSITPTERTTIVRLLGQSYLATGMARFLTSIAQQLTFGPGGTTAGSGGA
WYPTPQFAGLGAGPAVSASLARAEPVGRLSVPPSWAVAAPAFAEKPEAGT
PMSVIGEASSCGQGGLLRGIPLARAGRRTGAFAHRYGFRHSVITRSPSAG
>Rv1801 PPE29, PPE FAMILY PROTEIN
MDFGLLPPEINSGRMYTGPGPGPMLAAATAWDGLAVELHATAAGYASELS
ALTGAWSGPSSTSMASAAAPYVAWMSATAVHAELAGAQARLAIAAYEAAF
AATVPPPVIAANRAQLMVLIATNIFGQNTPAIMMTEAQYMEMWAQDAAAM
YGYAGSSATASRMTAFTEPPQTTNHGQLGAQSSAVAQTAATAAGGNLQSA
FPQLLSAVPRALQGLALPTASQSASATPQWVTDLGNLSTFLGGAVTGPYT
FPGVLPPSGVPYLLGIQSVLVTQNGQGVSALLGKIGGKPITGALAPLAEF
ALHTPILGSEGLGGGSVSAGIGRAGLVGKLSVPQGWTVAAPEIPSPAAAL
QATRLAAAPIAATDGAGALLGGMALSGLAGRAAAGSTGHPIGSAAAPAVG
AAAAAVEDLATEANIFVIPAMDD
>Rv0280 PPE3, PPE FAMILY PROTEIN
MTLWMASPPEVHSALLSSGPGPGSVLSAAGVWSSLSAEYAAVADELIGLL
GAVQTGAWQGPSAAAYVAAHAPYLAWLMRASETSAEAAARHETVAAAYTT
AVAAMPTLVELAANHTLHGVLVATNFFGINTIPIALNEADYARMWTQAAS
TMATYQAVAEAAVASAPQTTPAPPILAAEAADDDHDHDHDHGGEPTPLDY
LVAEILRIISGGRLIWDPAEGTMNGIPFEDYTDAAQPIWWVVRAIEFSKD
FETFVQELFVNPVEAFQFYFELLLFDYPTHIVQIVEALSQSPQLLAVALG
SVISNLGAVTGFAGLSGLAGMQPAAIPALAPVAAAPSTLPAVAMAPTMAA
PGAAVASAAAPASAPAASTVASATPAPPPAPGAAGFGYPYAIAPPGIGFG
SGMSASASAQRKAPQPDSAAAAAAAAAVRDQARARRRRRVTRRGYGDEFM
DMNIDVDPDWGPPPGEDPVTSTVASDRGAGHLGFAGTARREAVADAAGMT
TLAGDDFGDGPTTPMVPGSWDPDRDAPGSAEPGDRG
>Rv1802 PPE30, PPE FAMILY PROTEIN
MDFGVLPPEINSGRMYAGPGSGPMLAAAAAWDGLATELQSTAADYGSVIS
VLTGVWSGQSSGTMAAAAAPYVAWMSATAALAREAAAQASAAAAAYEAAF
AATVPPPVVAANRAELAVLAATNIFGQNTGAIAAAEARYAEMWAQDAAAM
YGYAGSSSVATQVTPFAAPPPTTNAAGLATQGVAVAQAVGASAGNARSLV
SEVLEFLATAGTNYNKTVASLMNAVTGVPYASSVYNSMLGLGFAESKMVL
PANDTVISTIFGMVQFQKFFNPVTPFNPDLIPKSALGAGLGLRSAISSGL
GSTAPAISAGASQAGSVGGMSVPPSWAAATPAIRTVAAVFSSTGLQAVPA
AAISEGSLLSQMALASVAGGALGGAAARATGGFLGGGRVTAVKKSLKDSD
SPDKLRRVVAHMMEKPESVQHWHTDEDGLDDLLAELKKKPGIHAVHMAGG
NKAEIAPTISESG
>Rv1807 PPE31, PPE FAMILY PROTEIN
LDFATLPPEINSARMYSGAGSAPMLAAASAWHGLSAELRASALSYSSVLS
TLTGEEWHGPASASMTAAAAPYVAWMSVTAVRAEQAGAQAEAAAAAYEAA
FAATVPPPVIEANRAQLMALIATNVLGQNAPAIAATEAQYAEMWSQDAMA
MYGYAGASAAATQLTPFTEPVQTTNASGLAAQSAAIAHATGASAGAQQTT
LSQLIAAIPSVLQGLSSSTAATFASGPSGLLGIVGSGSSWLDKLWALLDP
NSNFWNTIASSGLFLPSNTIAPFLGLLGGVAAADAAGDVLGEATSGGLGG
ALVAPLGSAGGLGGTVAAGLGNAATVGTLSVPPSWTAAAPLASPLGSALG
GTPMVAPPPAVAAGMPGMPFGTMGGQGFGRAVPQYGFRPNFVARPPAAG
>Rv1808 PPE32, PPE FAMILY PROTEIN
MDFGALPPEINSGRMYAGPGSGPLLAAAAAWDALAAELYSAAASYGSTIE
GLTVAPWMGPSSITMAAAVAPYVAWISVTAGQAEQAGAQAKIAAGVYETA
FAATVPPPVIEANRALLMSLVATNIFGQNTPAIAATEAHYAEMWAQDAAA
MYGYAGSSATASQLAPFSEPPQTTNPSATAAQSAVVAQAAGAAASSDITA
QLSQLISLLPSTLQSLATTATATSASAGWDTVLQSITTILANLTGPYSII
GLGAIPGGWWLTFGQILGLAQNAPGVAALLGPKAAAGALSPLAPLRGGYI
GDITPLGGGATGGIARAIYVGSLSVPQGWAEAAPVMRAVASVLPGTGAAP
ALAAEAPGALFGEMALSSLAGRALAGTAVRSGAGAARVAGGSVTEDVAST
TTIIVIPAD
>Rv1809 PPE33, PPE FAMILY PROTEIN
MDFGLQPPEITSGEMYLGPGAGPMLAAAVAWDGLAAELQSMAASYASIVE
GMASESWLGPSSAGMAAAAAPYVTWMSGTSAQAKAAADQARAAVVAYETA
FAAVVPPPQIAANRSQLISLVATNIFGQNTAAIAATEAEYGEMWAQDTMA
MFGYASSSATASRLTPFTAPPQTTNPSGLAGQAAATGQATALASGTNAVT
TALSSAAAQFPFDIIPTLLQGLATLSTQYTQLMGQLINAIFGPTGATTYQ
NVFVTAANVTKFSTWANDAMSAPNLGMTEFKVFWQPPPAPEIPKSSLGAG
LGLRSGLSAGLAHAASAGLGQANLVGDLSVPPSWASATPAVRLVANTLPA
TSLAAAPATQIPANLLGQMALGSMTGGALGAAAPAIYTGSGARARANGGT
PSAEPVKLEAVIAQLQKQPDAVRHWNVDKADLDGLLDRLSKQPGIHAVHV
SNGDKPKVALPDTQLGSH
>Rv2108 PPE36, PPE FAMILY PROTEIN
MPNFWALPPEINSTRIYLGPGSGPILAAAQGWNALASELEKTKVGLQSAL
DTLLESYRGQSSQALIQQTLPYVQWLTTTAEHAHKTAIQLTAAANAYEQA
RAAMVPPAMVRANRVQTTVLKAINWFGQFSTRIADKEADYEQMWFQDALV
MENYWEAVQEAIQSTSHFEDPPEMADDYDEAWMLNTVFDYHNENAKEEVI
HLVPDVNKERGPIELVTKVDKEGTIRLVYDGEPTFSYKEHPKF
>Rv2123 PPE37, PPE FAMILY PROTEIN
MTFPMWFAVPPEVPSAWLSTGMGPGPLLAAARAWHALAAQYTEIATELAS
VLAAVQASSWQGPSADRFVVAHQPFRYWLTHAATVATAAAAAHETAAAGY
TSALGGMPTLAELAANHAMHGALVTTNFFGVNTIPIALNEADYLRMWIQA
ATVMSHYQAVAHESVAATPSTPPAPQIVTSAASSAASSSFPDPTKLILQL
LKDFLELLRYLAVELLPGPLGDLIAQVLDWFISFVSGPVFTFLAYLVLDP
LIYFGPFAPLTSPVLLPAGLTGLAGLGAVSGPAGPMVERVHSDGPSRQSW
PAATGVTLVGTNPAALVTTPAPAPTTSAAPTAPSTPGSSAAQGLYAVGGP
DGEGFNPIAKTTALAGVTTDAAAPAAKLPGDQAQSSASKATRLRRRLRQH
RFEFLADDGRLTMPNTPEMADVAAGNRGLDALGFAGTIPKSAPGSATGLT
HLGGGFADVLSQPMLPHTWDGSD
>Rv2352c PPE38, PPE FAMILY PROTEIN
MILDFSWLPPEINSARIYAGAGSGPLFMAAAAWEGLAADLRASASSFDAV
IAGLAAGPWSGPASVAMAGAAAPYVGWLSAAAGQAELSAGQATAAATAFE
AALAATVHPAAVTANRVLLGALVATNILGQNTPAIAATEFDYVEMWAQDV
GAMVGYHAGAAAVAETLTPFSVPPLDLAGLASQAGAQLTGMATSVSAALS
PIAEGAVEGVPAVVAAAQSVAAGLPVDAALQVGQAAAYPASMLIGPMMQL
AQMGTTANTAGLAGAEAAGLAAADVPTFAGDIASGTGLGGAGGLGAGMSA
ELGKARLVGAMSVPPTWEGSVPARMASSAMAGLGAMPAEVPAAGGPMGMM
PMPMGMGGAGAGMPAGMMGRGGANPHVVQARPSVVPRVGIG
>Rv0286 PPE4, PPE FAMILY PROTEIN
MAAPIWMASPPEVHSALLSNGPGPGSLVAAATAWSQLSAEYASTAAELSG
LLGAVPGWAWQGPSAEWYVAAHLPYVAWLTQASADAAGAAAQHEAAAAAY
TTALAAMPTLAELAANHVIHTVLVATNFFGINTIPITLNEADYVRMWLQA
AAVMGLYQAASGAALASAPRTVPAPTVMNPGGGAASTVGAVNPWQWLLAL
LQQLWNAYTGFYGWMLQLIWQFLQDPIGNSIKIIIAFLTNPIQALITYGP
LLFALGYQIFFNLVGWPTWGMILSSPFLLPAGLGLGLAAIAFLPIVLAPA
VIPPASTPLAAAAVAAGSVWPAVSMAVTGAGTAGAATPAAGAAPSAGAAP
APAAPATASFAYAVGGSGDWGPSLGPTVGGRGGIKAPAATVPAAAAAAAT
RGQSRARRRRRSELRDYGDEFLDMDSDSGFGPSTGDHGAQASERGAGTLG
FAGTATKERRVRAVGLTALAGDEFGNGPRMPMVPGTWEQGSNEPEAPDGS
GRGGGDGLPHDSK
>Rv2356c PPE40, PPE FAMILY PROTEIN
MVNFSVLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAESFGLVT
SGLAGGSGQAWQGAAAAAMVVAAAPYAGWLAAAAARAGGAAVQAKAVAGA
FEAARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAA
DVAAMVGYHGGASAAAAALAPWQQAVPGLSGLLGGAANAPAAAAQGAAQG
LAELTLNLGVGNIGSLNLGSGNIGGTNVGSGNVGGTNLGSGNYGSLNWGS
GNTGTGNAGSGNTGDYNPGSGNFGSGNFGSGNIGSLNVGSGNFGTLNLAN
GNNGDVNFGGGNTGDFNFGGGNNGTLNFGFGNTGSGNFGFGNTGNNNIGI
GLTGDGQIGIGGLNSGTGNIGFGNSGNNNIGFFNSGDGNIGFFNSGDGNT
GFGNAGNINTGFWNAGNLNTGFGSAGNGNVGIFDGGNSNSGSFNVGFQNT
GFGNSGAGNTGFFNAGDSNTGFANAGNVNTGFFNGGDINTGGFNGGNVNT
GFGSALTQAGANSGFGNLGTGNSGWGNSDPSGTGNSGFFNTGNGNSGFSN
AGPAMLPGFNSGFANIGSFNAGIANSGNNLAGISNSGDDSSGAVNSGSQN
SGAFNAGVGLSGFFR
>Rv2430c PPE41, PPE FAMILY PROTEIN
MHFEAYPPEVNSANIYAGPGPDSMLAAARAWRSLDVEMTAVQRSFNRTLL
SLMDAWAGPVVMQLMEAAKPFVRWLTDLCVQLSEVERQIHEIVRAYEWAH
HDMVPLAQIYNNRAERQILIDNNALGQFTAQIADLDQEYDDFWDEDGEVM
RDYRLRVSDALSKLTPWKAPPPIAHSTVLVAPVSPSTASSRTDT
>Rv2608 PPE42, PPE FAMILY PROTEIN
MNFAVLPPEVNSARIFAGAGLGPMLAAASAWDGLAEELHAAAGSFASVTT
GLAGDAWHGPASLAMTRAASPYVGWLNTAAGQAAQAAGQARLAASAFEAT
LAATVSPAMVAANRTRLASLVAANLLGQNAPAIAAAEAEYEQIWAQDVAA
MFGYHSAASAVATQLAPIQEGLQQQLQNVLAQLASGNLGSGNVGVGNIGN
DNIGNANIGFGNRGDANIGIGNIGDRNLGIGNTGNWNIGIGITGNGQIGF
GKPANPDVLVVGNGGPGVTALVMGGTDSLLPLPNIPLLEYAARFITPVHP
GYTATFLETPSQFFPFTGLNSLTYDVSVAQGVTNLHTAIMAQLAAGNEVV
VFGTSQSATIATFEMRYLQSLPAHLRPGLDELSFTLTGNPNRPDGGILTR
FGFSIPQLGFTLSGATPADAYPTVDYAFQYDGVNDFPKYPLNVFATANAI
AGILFLHSGLIALPPDLASGVVQPVSSPDVLTTYILLPSQDLPLLVPLRA
IPLLGNPLADLIQPDLRVLVELGYDRTAHQDVPSPFGLFPDVDWAEVAAD
LQQGAVQGVNDALSGLGLPPPWQPALPRLF
>Rv2768c PPE43, PPE FAMILY PROTEIN
MDFGALPPEINSTRMYAGAGAAPLMAAGATWNGLAVELSTTASSVESVIM
QLTTEQWLGPASMSMVVAAQPYLAWLTYTAESAAHAAAQAMASAAAFEAA
FAMTVPPAEVAANRALLAALVATNVLGQNTPAIMATEAHYGEMWAQDALA
MYGYAASSAAAGRLNPLITPSQTANMAGLAGQAAAVSHAAAASTVQQVGL
GSLISNLPNAVMGFASPLTSAADAAGLGGIIQDIEELLGITFVQNAINGA
VNTTAWFVMATIPNAVFLGHAFAALNPATVTAAADAVPAAAAAAGLAHTV
TPVGVGGASLTASLGEASSVGGLSVPAGWSTAAPAMTSGTTALEGSGWAV
PEEAGPVAAMPGMAGISGAAKGAGAYAGPRYGFKPIVMPKQVVV
>Rv2770c PPE44, PPE FAMILY PROTEIN
MDFGALPPEVNSARMYGGAGAADLLAAAAAWNGIAVEVSTAASSVGSVIT
RLSTEHWMGPASLSMAAAVQPYLVWLTCTAESSALAAAQAMASAAAFETA
FALTVPPAEVVANRALLAELTATNILGQNVSAIAATEARYGEMWAQDASA
MYGYAAASAVAARLNPLTRPSHITNPAGLAHQAAAVGQAGASAFARQVGL
SHLISDVADAVLSFASPVMSAADTGLEAVRQFLNLDVPLFVESAFHGLGG
VADFATAAIGNMTLLADAMGTVGGAAPGGGAAAAVAHAVAPAGVGGTALT
ADLGNASVVGRLSVPASWSTAAPATAAGAALDGTGWAVPEEDGPIAVMPP
APGMVVAANSVGADSGPRYGVKPIVMPKHGLF
>Rv2892c PPE45, PPE FAMILY PROTEIN
MDFGVLPPEINSGRMYAGPGSGPMMAAAAAWDSLAAELGLAAGGYRLAIS
ELTGAYWAGPAAASMVAAVTPYVAWLSATAGQAEQAGMQARAAAAAYELA
FAMTVPPPVVVANRALLVALVATNFFGQNTPAIAATEAQYAEMWAQDAAA
MYAYAGSAAIATELTPFTAAPVTTSPAALAGQAAATVSSTVPPLATTAAV
PQLLQQLSSTSLIPWYSALQQWLAENLLGLTPDNRMTIVRLLGISYFDEG
LLQFEASLAQQAIPGTPGGAGDSGSSVLDSWGPTIFAGPRASPSVAGGGA
VGGVQTPQPYWYWALDRESIGGSVSAALGKGSSAGSLSVPPDWAARARWA
NPAAWRLPGDDVTALRGTAENALLRGFPMASAGQSTGGGFVHKYGFRLAV
MQRPPFAG
>Rv3018c PPE46, PPE FAMILY PROTEIN
MTAPVWLASPPEVHSALLSAGPGPGSLQAAAAGWSALSAEYAAVAQELSV
VVAAVGAGVWQGPSAELFVAAYVPYVAWLVQASADSAAAAGEHEAAAAGY
VCALAEMPTLPELAANHLTHAVLVATNFFGINTIPIALNEADYVRMWVQA
ATVMSAYEAVVGAALVATPHTGPAPVIVKPGANEASNAVAAATITPFPWH
EIVQFLEETFAAYDQYLSALLSELPAVAWVWFQLFVDILGFNIIGFIITL
ASNAQLLTEFAINASYVAVGLLYAIAGVIDIVVEWVIGNLFGVVPLLGGP
LLGALAAAVVPGVAGLAGVAGLAALPAVGAAAGAPAALVGSVAPVSGGVV
SPQARLVSAVEPAPASTSVSVLASDRGAGALGFVGTAGKESVGQPAGLTV
LADEFGDGAPVPMLPGSWGPDLVGVAGDGGLVSV
>Rv3021c PPE47, PPE FAMILY PROTEIN
MVGAASADSAAAAGEHEAAAAGYVCALAEMPTLPELAANHLTHAVLVATN
FFGINTIPIALNEADYVRMWVQAATVMSAYEAVVGAALVATPHTGPAPVI
VKPGANEASNAVAAATITPFPFGELAKFLEMAAQAFTEVGELIMKSAEAW
AVGFVELITGLVNFEPWLVLTGMIDMFFATVGFALGVFVLVPLLEFAVVL
ELAILSIGWIISNIFGAIPVLGGPLLGALAAAVVPGVAGLAGVAGLAALP
AVGAAAGAPAALVGSVAPVSGGVVSPQARLVSAVEPAPASTSVSVLASDR
GAGALGFVGTAGKESVGQPAGLTVLADEFGDGAPVPMLPGSWGPDLVGVA
GDGGLVSV
>Rv3022c PPE48, PPE FAMILY PROTEIN
VTAPVWLASPPEVHSALLSAGPGPGSLQAAAAGWSALSAEYAAVAQELSV
VVAAVGAGVWQGPSAELFVAAYVPYVAWLVQ
>Rv3125c PPE49, PPE FAMILY PROTEIN
MVLGFSWLPPEINSARMFAGAGSGPLFAAASAWEGLAADLWASASSFESV
LAALTTGPWTGPASMSMAAAASPYVGWLSTVASQAQLAAIQARAAATAFE
AALAATVHPTAVTANRVSLASLIAANVLGQNTPAIAATEFDYLEMWAQDV
AAMVGYHAGAKSVAATLAPFSLPPVSLAGLAAQVGTQVAGMATTASAAVT
PVVEGAMASVPTVMSGMQSLVSQLPLQHASMLFLPVRILTSPITTLASMA
RESATRLGPPAGGLAAANTPNPSGAAIPAFKPLGGRELGAGMSAGLGQAQ
LVGSMSVPPTWQGSIPISMASSAMSGLGVPPNPVALTQAAGAAGGGMPMM
LMPMSISGAGAGMPGGLMDRDGAGWHVTQARLTVIPRTGVG
>Rv3135 PPE50, PPE FAMILY PROTEIN
MDYAFLPPEINSARMYSGPGPNSMLVAAASWDALAAELASAAENYGSVIA
RLTGMHWWGPASTSMLAMSAPYVEWLERTAAQTKQTATQARAAAAAFEQA
HAMTVPPALVTGIRGAIVVETASASNTAGTPP
>Rv3136 PPE51, PPE FAMILY PROTEIN
MDFALLPPEVNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEAYGSVLS
GLAALHWRGPAAESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQA
YAMTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQDAAA
MYGYATASAAAALLTPFSPPRQTTNPAGLTAQAAAVSQATDPLSLLIETV
TQALQALTIPSFIPEDFTFLDAIFAGYATVGVTQDVESFVAGTIGAESNL
GLLNVGDENPAEVTPGDFGIGELVSATSPGGGVSASGAGGAASVGNTVLA
SVGRANSIGQLSVPPSWAAPSTRPVSALSPAGLTTLPGTDVAEHGMPGVP
GVPVAAGRASGVLPRYGVRLTVMAHPPAAG
>Rv3144c PPE52, PPE-FAMILY PROTEIN
MSFVVLPPEINSLRMFIGAGTAPMLAAAAAWDGLAEELGTAAQSFASVTA
GLAGQAWQGPAALAMAAAAAPYAGWLTAAAAQSAGAAGQARAVASIFEAA
QAATVLPAAVAANRDAFVQLVMTNLFGQNAPLIAAAEGVYEEMWAADVAA
MSGYYSGASAIAAQVVPWASLLQRFPGLGAGATGATGGESVGTGATGGES
VGTGGGESVGTGGATASGGGVGYVGSGVASAGLAAGDPAHGSVGQGNFGG
GDVGAGDVVASSATSAHAGVVSPGFIGAPLALAALGQMARGGTNSAPGTA
TESARAPEPAASAPPEAVVEVPELEVPAMGVLPTVDPKVAAKAAPLSTTR
VGQSAGSGIPESTLRTAQGQQASETSAAEETAPSLRPEAAAGQLRPRVRK
DPKIQMRGG
>Rv3159c PPE53, PPE FAMILY PROTEIN
MNYSVLPPEINSLRMFTGAGSAPMLAASVAWDRLAAELAVAASSFGSVTS
GLAGQSWQGAAAAAMAAAAAPYAGWLAAAAARAAGASAQAKAVASAFEAA
RAATVHPMLVAANRNAFVQLVLSNLFGQNAPAIAAAEAMYEQMWAADVAA
MVGYHGGASAAAAQLSSWSIGLQQALPAAPSALAAAIGLGNIGVGNLGGG
NTGDYNLGSGNSGNANVGSGNSGNANVGSGNDGATNLGSGNIGNTNLGSG
NVGNVNLGSGNRGFGNLGNGNFGSGNLGSGNTGSTNFGGGNLGSFNLGSG
NIGSSNIGFGNNGDNNLGLGNNGNNNIGFGLTGDNLVGIGALNSGIGNLG
FGNSGNNNIGFFNSGNNNVGFFNSGNNNFGFGNAGDINTGFGNAGDTNTG
FGNAGFFNMGIGNAGNEDMGVGNGGSFNVGVGNAGNQSVGFGNAGTLNVG
FANAGSINTGFANSGSINTGGFDSGDRNTGFGSSVDQSVSSSGFGNTGMN
SSGFFNTGNVSAGYGNNGDVQSGINNTNSGGFNVGFYNSGAGTVGIANSG
LQTTGIANSGTLNTGVANTGDHSSGGFNQGSDQSGFFGQP
>Rv3425 PPE57, PPE FAMILY PROTEIN
MHPMIPAEYISNIIYEGPGADSLFFASGQLRELAYSVETTAESLEDELDE
LDENWKGSSSDLLADAVERYLQWLSKHSSQLKHAAWVINGLANAYNDTRR
KVVPPEEIAANREERRRLIASNVAGVNTPAIADLDAQYDQYRARNVAVMN
AYVSWTRSALSDLPRWREPPQIYRGG
>Rv3426 PPE58, PPE FAMILY PROTEIN
MHLMIPAEYISNVIYEGPRADSLYAADQRLRQLADSVRTTAESLNTTLDE
LHENWKGSSSEWMADAALRYLDWLSKHSRQILRTARVIESLVMAYEETLL
RVVPPATIANNREEVRRLIASNVAGGKHSSNRRPRGTIRAVPGRKYPSNG
PLSKLDPICAIEAAPMAGAAADPQERVGPRGRRGLAGQQQCRGRPGPSLR
CSHDTPRFQMNQAFHTMVNMLLTCFACQEKPR
>Rv3478 PPE60, PE FAMILY PROTEIN
MVDFGALPPEINSARMYAGPGSASLVAAAKMWDSVASDLFSAASAFQSVV
WGLTVGSWIGSSAGLMAAAASPYVAWMSVTAGQAQLTAAQVRVAAAAYET
AYRLTVPPPVIAENRTELMTLTATNLLGQNTPAIEANQAAYSQMWGQDAE
AMYGYAATAATATEALLPFEDAPLITNPGGLLEQAVAVEEAIDTAAANQL
MNNVPQALQQLAQPAQGVVPSSKLGGLWTAVSPHLSPLSNVSSIANNHMS
MMGTGVSMTNTLHSMLKGLAPAAAQAVETAAENGVWAMSSLGSQLGSSLG
SSGLGAGVAANLGRAASVGSLSVPPAWAAANQAVTPAARALPLTSLTSAA
QTAPGHMLGGLPLGHSVNAGSGINNALRVPARAYAIPRTPAAG
>Rv3532 PPE61, PPE FAMILY PROTEIN
MFMDFAMLPPEVNSTRMYSGPGAGSLWAAAAAWDQVSAELQSAAETYRSV
IASLTGWQWLGPSSVRMGAAVTPYVEWLTTTAAQARQTATQITAAATGFE
QAFAMTVPPPAIMANRAQVLSLIATNFFGQNTAAIAALETQYAEMWEQDA
TAMYDYAATSAAARTLTPFTSPQQDTNSAGLPAQSAEVSRATANAGAADG
NWLGNLLEEIGILLLPIAPELTPFFLEAGEIVNAIPFPSIVGDEFCLLDG
LLAWYATIGSINNINSMGTGIIGAEKNLGILPELGSAAAAAAPPPADIAP
AFLAPLTSMAKSLSDGALRGPGEVSAAMRGAGTIGQMSVPPAWKAPAVTT
VRAFDATPMTTLPGGDAPAAGVPGLPGMPASGAGRAGVVPRYGVRLTVMT
RPLSGG
>Rv3539 PPE63, PPE FAMILY PROTEIN
MADFLTLSPEVNSARMYAGGGPGSLSAAAAAWDELAAELWLAAASFESVC
SGLADRWWQGPSSRMMAAQAARHTGWLAAAATQAEGAASQAQTMALAYEA
AFAATVHPALVAANRALVAWLAGSNVFGQNTPAIAAAEAIYEQMWAQDVV
AMLNYHAVASAVGARLRPWQQLLHELPRRLGGEHSDSTNTELANPSSTTT
RITVPGASPVHAATLLPFIGRLLAARYAELNTAIGTNWFPGTTPEVVSYP
ATIGVLSGSLGAVDANQSIAIGQQMLHNEILAATASGQPVTVAGLSMGSM
VIDRELAYLAIDPNAPPSSALTFVELAGPERGLAQTYLPVGTTIPIAGYT
VGNAPESQYNTSVVYSQYDIWADPPDRPWNLLAGANALMGAAYFHDLTAY
AAPQQGIEIAAVTSSLGGTTTTYMIPSPTLPLLLPLKQIGVPDWIVGGLN
NVLKPLVDAGYSQYAPTAGPYFSHGNLVW
>Rv3621c PPE65, PPE FAMILY PROTEIN
MLDFAQLPPEVNSALMYAGPGSGPMLAAAAAWEALAAELQTTASTYDALI
TGLADGPWQGSSAASMVAAATPQVAWLRSTAGQAEQAGSQAVAAASAYEA
AFFATVPPPEIAANRALLMALLATNFLGQNTAAIAATEAQYAEMWAQDAA
AMYGYAGASAAATQLSPFNPAAQTINPAGLASQAASVGQAVSGAANAQAL
TDIPKALFGLSGIFTNEPPWLTDLGKALGLTGHTWSSDGSGLIVGGVLGD
FVQGVTGSAELDASVAMDTFGKWVSPARLMVTQFKDYFGLAHDLPKWASE
GAKAAGEAAKALPAAVPAIPSAGLSGVAGAVGQAASVGGLKVPAVWTATT
PAASPAVLAASNGLGAAAAAEGSTHAFGGMPLMGSGAGRAFNNFAAPRYG
FKPTVIAQPPAGG
>Rv3738c PPE66, PPE FAMILY PROTEIN
MTTAYASALAAMPTLTELAANHTSHAVLLGTNFFGINTIPIALNEADYAR
MWIQAATTMSIYEGTSDAALASAPQTTPAPVLFNGGAGVASALPAISAAT
LDPASIIGIIIEILIQLFLISLEILFAIVAYTIIIVLILPLVIFAYAIVF
AVLAIIFGPPLLVIASPFVLTGSVIAVPTSLSTSLSTAVPIGVGQYLADL
ASADAQAIEVGLKTADVAPVAVRPAAAPPLRESAAVRPEARLVSAVAPAP
AGTSASVLASDRGAGVLGFAGTAGKESVGRPAGLTTLAGGEFGGSPSVPM
VPASWEQLVGAGEAG
>Rv3739c PPE67, PPE FAMILY PROTEIN
MTAPIWFASPPEVHSALLSAGPGPASLQAAAAEWTSLSAEYASAAQELTA
VLAAVQGGAWEGPSAEAYVAAHLPYLA
>Rv3873 PPE68, PPE FAMILY PROTEIN
MLWHAMPPELNTARLMAGAGPAPMLAAAAGWQTLSAALDAQAVELTARLN
SLGEAWTGGGSDKALAAATPMVVWLQTASTQAKTRAMQATAQAAAYTQAM
ATTPSLPEIAANHITQAVLTATNFFGINTIPIALTEMDYFIRMWNQAALA
MEVYQAETAVNTLFEKLEPMASILDPGASQSTTNPIFGMPSPGSSTPVGQ
LPPAATQTLGQLGEMSGPMQQLTQPLQQVTSLFSQVGGTGGGNPADEEAA
QMGLLGTSPLSNHPLAGGSGPSAGAGLLRAESLPGAGGSLTRTPLMSQLI
EKPVAPSVMPAAAAGSSATGGAAPVGAGAMGQGAQSGGSTRPGLVAPAPL
AQEREEDDEDDWDEEDDW
>Rv3892c PPE69, PPE FAMILY PROTEIN
MPDPGWAARTPEANDLLLTAGTGVGTHLANQTAWTTLGASHHASGVASAI
NTAATAASWLGVGSAASALNVTMLNATLHGLAGWVDVKPAVVSTAIAAFE
TANAAMRPAPECMENRDEWGVDNAINPSVLWTLTPRIVSLDVEYFGVMWP
NNAAVGATYGGVLAALAESLAIPPPVATMGASPAAPAQAAAAVGQAAAEA
AAGDGMRSAYQGVQAGSTGAGQSTSAGENFGNQLSTFMQPMQAVMQAAPQ
ALQAPSGLMQAPMSAMQPLQSMVGMFANPGALGMGGAAPGASAASAAGGI
SAAATEVGAGGGGAALGGGGMPATSFTRPVSAFESGTSGRPVGLRPSGAL
GADVVRAPTTTVGGTPIGGMPVGHAAGGHRGSHGKSEQAATVRVVDDRR
>Rv0354c PPE7, PPE FAMILY PROTEIN
MSVCVIYIPFKGCVKHVSVTIPITTEHLGPYEIDASTINPDQPIDTAFTQ
TLDFAGSGTVGAFPFGFGWQQSPGFFNSTTTPSSGFFNSGAGGASGFLND
AAAAVSGLGNVFTETSGFFNAGGVGIRASKTSATCCRAGRT
>Rv0388c PPE9, PPE FAMILY PROTEIN
MDFGALPPEINSARIYSGPGSRPLMQAAAAWQRLANELTATAASYSSVIS
GLTGDDWLGPSALSMAAAAVPYVAWMRATAASAEQAAAQAVAAANAYESA
YAATVPPTVIAANRRTMLSLVQTNVFGQNTPAIATSETHYGEMWAHDILA
MDGYAGASGAASQLRRSPATGDHQRGRVAE
>Rv3036c TB22.2, PROBABLE CONSERVED SECRETED PROTEIN TB22.2
MRYLIATAVLVAVVLVGWPAAGAPPSCAGLGGTVQAGQICHVHASGPKYM
LDMTFPVDYPDQQALTDYITQNRDGFVNVAQGSPLRDQPYQMDATSEQHS
SGQPPQATRSVVLKFFQDLGGAHPSTWYKAFNYNLATSQPITFDTLFVPG
TTPLDSIYPIVQRELARQTGFGAAILPSTGLDPAHYQNFAITDDSLIFYF
AQGELLPSFVGACQAQVPRSAIPPLAI
>Rv1174c TB8.4, LOW MOLECULAR WEIGHT T-CELL ANTIGEN TB8.4
MRLSLTALSAGVGAVAMSLTVGAGVASADPVDAVINTTCNYGQVVAALNA
TDPGAAAQFNASPVAQSYLRNFLAAPPPQRAAMAAQLQAVPGAAQYIGLV
ESVAGSCNNY
>Rv3208A TB9.4, CONSERVED HYPOTHETICAL PROTEIN TB9.4
MEVKIGITDSPRELVFSSAQTPSEVEELVSNALRDDSGLLTLTDERGRRF
LIHTARIAYVEIGVADARRVGFGVGVDAAAGSAGKVATSG
>Rv1860 apa, ALANINE AND PROLINE RICH SECRETED PROTEIN APA (FIBRONECTIN ATTACHMENT PROTEIN) (Immunogenic protein MPT32) (Antigen MPT-32) (45-kDa glycoprotein) (45/
MHQVDPNLTRRKGRLAALAIAAMASASLVTVAVPATANADPEPAPPVPTT
AASPPSTAAAPPAPATPVAPPPPAAANTPNAQPGDPNAAPPPADPNAPPP
PVIAPNAPQPVRIDNPVGGFSFALPAGWVESDAAHFDYGSALLSKTTGDP
PFPGQPPPVANDTRIVLGRLDQKLYASAEATDSKAAARLGSDMGEFYMPY
PGTRINQETVSLDANGVSGSASYYEVKFSDPSKPNGQIWTGVIGSPAANA
PDAGPPQRWFVVWLGTANNPVDKGAAKALAESIRPLVAPPPAPAPAPAEP
APAPAPAGEVAPTPTTPTPQRTLPA
>Rv1089A celA2a, PROBABLE CELLULASE CELA2A (ENDO-1,4-BETA-GLUCANASE) (ENDOGLUCANASE) (CARBOXYMETHYL CELLULASE)
MNGAAPTNGAPLSYPSICEGVHWGHLVGGHQPAY
>Rv1090 celA2b, PROBABLE CELLULASE CELA2B (ENDO-1,4-BETA-GLUCANASE) (ENDOGLUCANASE) (CARBOXYMETHYL CELLULASE)
MGTNLPTEVGQILSAPTSIDYNYPTTGVWDASYDICLDSTPKTTGVNQQE
IMIWFNHQGSIQPVGSPVGNTTIEGKNFVVWDGSNGMNNAMAYVATEPIE
VWSFDVMSFVDHTATMEPITDSWYLTSIRAGLEPWSDGVGLGVDSFSAKV
N
>Rv2376c cfp2, LOW MOLECULAR WEIGHT ANTIGEN CFP2 (LOW MOLECULAR WEIGHT PROTEIN ANTIGEN 2) (CFP-2)
MKMVKSIAAGLTAAAAIGAAAAGVTSIMAGGPVVYQMQPVVFGAPLPLDP
ASAPDVPTAAQLTSLLNSLADPNVSFANKGSLVEGGIGGTEARIADHKLK
KAAEHGDLPLSFSVTNIQPAAAGSATADVSVSGPKLSSPVTQNVTFVNQG
GWMLSRASAMELLQAAGN
>Rv1984c cfp21, PROBABLE CUTINASE PRECURSOR CFP21
MTPRSLVRIVGVVVATTLALVSAPAGGRAAHADPCSDIAVVFARGTHQAS
GLGDVGEAFVDSLTSQVGGRSIGVYAVNYPASDDYRASASNGSDDASAHI
QRTVASCPNTRIVLGGYSQGATVIDLSTSAMPPAVADHVAAVALFGEPSS
GFSSMLWGGGSLPTIGPLYSSKTINLCAPDDPICTGGGNIMAHVSYVQSG
MTSQAATFAANRLDHAG
>Rv3004 cfp6, LOW MOLECULAR WEIGHT PROTEIN ANTIGEN 6 (CFP-6)
MAHFAVGFLTLGLLVPVLTWPVSAPLLVIPVALSASIIRLRTLADERGVT
VRTLVGSRAVRWDDIDGLRFHRGSWARATLKDGTELRLPAVTFATLPHLT
EASSGRVPNPYR
>Rv0806c cpsY, POSSIBLE UDP-GLUCOSE-4-EPIMERASE CPSY (GALACTOWALDENASE) (UDP-GALACTOSE-4-EPIMERASE) (URIDINE DIPHOSPHATE GALACTOSE-4-EPIMERASE) (URIDINE DIPHOSPHO-GA
MPKISSRDGGRPAQRTVNPIIVTRRGKIARLESGLTPQEAQIEDLVFLRK
VLNRADIPYLLIRNHKNRPVLAINIELRAGLERALAAACATEPMYAKTID
EPGLSPVLVATDGLSQLVDPRVVRLYRRRIAPGGFRYGPAFGVELQFWVY
EETVIRCPVENSLSRKVLPRNEITPTNVKLYGYKWPTLDGMFAPHASDVV
FDIDMVFSWVDGSDPEFRARRMAQMSQYVVGEGDDAEARIRQIDELKYAL
RSVNMFAPWIRRIFIATDSTPPPWLAEHPKITIVRAEDHFSDRSALPTYN
SHAVESQLHHIPGLSEHFLYSNDDMFFGRPLKASMFFSPGGVTRFIEAKT
RIGLGANNPARSGFENAARVNRQLLFDRFGQVITRHLEHTAVPLRKSVLI
EMEREFPEEFARTAASPFRSDTDISVTNSFYHYYALMTGRAVPQEKAKVL
YVDTTSYAGLRLLPKLRKHRGYDFFCLNDGSFPEVPAAQRAERVVSFLER
YFPIPAPWEKIAADVSRRDFAVPRTSAPSEGA
>Rv1758 cut1, PROBABLE CUTINASE CUT1
MPGRFREDFIDALRSKIGEKSMGVYGVDYPATTDFPTAMAGIYDAGTHVE
QTAANCPQSKLVLGGFSQGAAVMGFVTAAAIPDGAPLDAPRPMPPEVADH
VAAVTLFGMPSVAFMHSIGAPPIVIGPLYAEKTIQLCAPGDPVCSSGGNW
AAHNGYADDGMVEQAAVFAAGRLG
>Rv2301 cut2, PROBABLE CUTINASE CUT2
MNDLLTRRLLTMGAAAAMLAAVLLLTPITVPAGYPGAVAPATAACPDAEV
VFARGRFEPPGIGTVGNAFVSALRSKVNKNVGVYAVKYPADNQIDVGAND
MSAHIQSMANSCPNTRLVPGGYSLGAAVTDVVLAVPTQMWGFTNPLPPGS
DEHIAAVALFGNGSQWVGPITNFSPAYNDRTIELCHGDDPVCHPADPNTW
EANWPQHLAGAYVSSGMVNQAADFVAGKLQ
>Rv3451 cut3, PROBABLE CUTINASE PRECURSOR CUT3
MNNRPIRLLTSGRAGLGAGALITAVVLLIALGAVWTPVAFADGCPDAEVT
FARGTGEPPGIGRVGQAFVDSLRQQTGMEIGVYPVNYAASRLQLHGGDGA
NDAISHIKSMASSCPNTKLVLGGYSQGATVIDIVAGVPLGSISFGSPLPA
AYADNVAAVAVFGNPSNRAGGSLSSLSPLFGSKAIDLCNPTDPICHVGPG
NEFSGHIDGYIPTYTTQAASFVVQRLRAGSVPHLPGSVPQLPGSVLQMPG
TAAPAPESLHGR
>Rv3452 cut4, PROBABLE CUTINASE PRECURSOR CUT4
MIPRPQPHSGRWRAGAARRLTSLVAAAFAAATLLLTPALAPPASAGCPDA
EVVFARGTGEPPGLGRVGQAFVSSLRQQTNKSIGTYGVNYPANGDFLAAA
DGANDASDHIQQMASACRATRLVLGGYSQGAAVIDIVTAAPLPGLGFTQP
LPPAADDHIAAIALFGNPSGRAGGLMSALTPQFGSKTINLCNNGDPICSD
GNRWRAHLGYVPGMTNQAARFVASRI
>Rv3724A cut5a, PROBABLE CUTINASE PRECURSOR [FIRST PART] CUT5A
MDVIRWARRLAVVAGTAAAVTTPGLLSAHVPMVSAEPCPDVEVVFARGTG
EPPGIGSVGGLFVDALRFPGWRQVTRGLRR
>Rv0824c desA1, PROBABLE ACYL-[ACYL-CARRIER PROTEIN] DESATURASE DESA1 (ACYL-[ACP] DESATURASE) (STEAROYL-ACP DESATURASE) (PROTEIN DES)
MSAKLTDLQLLHELEPVVEKYLNRHLSMHKPWNPHDYIPWSDGKNYYALG
GQDWDPDQSKLSDVAQVAMVQNLVTEDNLPSYHREIAMNMGMDGAWGQWV
NRWTAEENRHGIALRDYLVVTRSVDPVELEKLRLEVVNRGFSPGQNHQGH
YFAESLTDSVLYVSFQELATRISHRNTGKACNDPVADQLMAKISADENLH
MIFYRDVSEAAFDLVPNQAMKSLHLILSHFQMPGFQVPEFRRKAVVIAVG
GVYDPRIHLDEVVMPVLKKWRIFEREDFTGEGAKLRDELALVIKDLELAC
DKFEVSKQRQLDREARTGKKVSAHELHKTAGKLAMSRR
>Rv1094 desA2, POSSIBLE ACYL-[ACYL-CARRIER PROTEIN] DESATURASE DESA2 (ACYL-[ACP] DESATURASE) (STEAROYL-ACP DESATURASE)
MAQKPVADALTLELEPVVEANMTRHLDTEDIWFAHDYVPFDQGENFAFLG
GRDWDPSQSTLPRTITDACEILLILKDNLAGHHRELVEHFILEDWWGRWL
GRWTAEEHLHAIALREYLVVTREVDPVANEDVRVQHVMKGYRAEKYTQVE
TLVYMAFYERCGAVFCRNLAAQIEEPILAGLIDRIARDEVRHEEFFANLV
THCLDYTRDETIAAIAARAADLDVLGADIEAYRDKLQNVADAGIFGKPQL
RQLISDRITAWGLAGEPSLKQFVTG
>Rv3794 embA, INTEGRAL MEMBRANE INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE EMBA (ARABINOSYLINDOLYLACETYLINOSITOL SYNTHASE)
MPHDGNERSHRIARLAAVVSGIAGLLLCGIVPLLPVNQTTATIFWPQGST
ADGNITQITAPLVSGAPRALDISIPCSAIATLPANGGLVLSTLPAGGVDT
GKAGLFVRANQDTVVVAFRDSVAAVAARSTIAAGGCSALHIWADTGGAGA
DFMGIPGGAGTLPPEKKPQVGGIFTDLKVGAQPGLSARVDIDTRFITTPG
ALKKAVMLLGVLAVLVAMVGLAALDRLSRGRTLRDWLTRYRPRVRVGFAS
RLADAAVIATLLLWHVIGATSSDDGYLLTVARVAPKAGYVANYYRYFGTT
EAPFDWYTSVLAQLAAVSTAGVWMRLPATLAGIACWLIVSRFVLRRLGPG
PGGLASNRVAVFTAGAVFLSAWLPFNNGLRPEPLIALGVLVTWVLVERSI
ALGRLAPAAVAIIVATLTATLAPQGLIALAPLLTGARAIAQRIRRRRATD
GLLAPLAVLAAALSLITVVVFRDQTLATVAESARIKYKVGPTIAWYQDFL
RYYFLTVESNVEGSMSRRFAVLVLLFCLFGVLFVLLRRGRVAGLASGPAW
RLIGTTAVGLLLLTFTPTKWAVQFGAFAGLAGVLGAVTAFTFARIGLHSR
RNLTLYVTALLFVLAWATSGINGWFYVGNYGVPWYDIQPVIASHPVTSMF
LTLSILTGLLAAWYHFRMDYAGHTEVKDNRRNRILASTPLLVVAVIMVAG
EVGSMAKAAVFRYPLYTTAKANLTALSTGLSSCAMADDVLAEPDPNAGML
QPVPGQAFGPDGPLGGISPVGFKPEGVGEDLKSDPVVSKPGLVNSDASPN
KPNAAITDSAGTAGGKGPVGINGSHAALPFGLDPARTPVMGSYGENNLAA
TATSAWYQLPPRSPDRPLVVVSAAGAIWSYKEDGDFIYGQSLKLQWGVTG
PDGRIQPLGQVFPIDIGPQPAWRNLRFPLAWAPPEADVARIVAYDPNLSP
EQWFAFTPPRVPVLESLQRLIGSATPVLMDIATAANFPCQRPFSEHLGIA
ELPQYRILPDHKQTAASSNLWQSSSTGGPFLFTQALLRTSTIATYLRGDW
YRDWGSVEQYHRLVPADQAPDAVVEEGVITVPGWGRPGPIRALP
>Rv3795 embB, INTEGRAL MEMBRANE INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE EMBB (ARABINOSYLINDOLYLACETYLINOSITOL SYNTHASE)
MTQCASRRKSTPNRAILGAFASARGTRWVATIAGLIGFVLSVATPLLPVV
QTTAMLDWPQRGQLGSVTAPLISLTPVDFTATVPCDVVRAMPPAGGVVLG
TAPKQGKDANLQALFVVVSAQRVDVTDRNVVILSVPREQVTSPQCQRIEV
TSTHAGTFANFVGLKDPSGAPLRSGFPDPNLRPQIVGVFTDLTGPAPPGL
AVSATIDTRFSTRPTTLKLLAIIGAIVATVVALIALWRLDQLDGRGSIAQ
LLLRPFRPASSPGGMRRLIPASWRTFTLTDAVVIFGFLLWHVIGANSSDD
GYILGMARVADHAGYMSNYFRWFGSPEDPFGWYYNLLALMTHVSDASLWM
RLPDLAAGLVCWLLLSREVLPRLGPAVEASKPAYWAAAMVLLTAWMPFNN
GLRPEGIIALGSLVTYVLIERSMRYSRLTPAALAVVTAAFTLGVQPTGLI
AVAALVAGGRPMLRILVRRHRLVGTLPLVSPMLAAGTVILTVVFADQTLS
TVLEATRVRAKIGPSQAWYTENLRYYYLILPTVDGSLSRRFGFLITALCL
FTAVFIMLRRKRIPSVARGPAWRLMGVIFGTMFFLMFTPTKWVHHFGLFA
AVGAAMAALTTVLVSPSVLRWSRNRMAFLAALFFLLALCWATTNGWWYVS
SYGVPFNSAMPKIDGITVSTIFFALFAIAAGYAAWLHFAPRGAGEGRLIR
ALTTAPVPIVAGFMAAVFVASMVAGIVRQYPTYSNGWSNVRAFVGGCGLA
DDVLVEPDTNAGFMKPLDGDSGSWGPLGPLGGVNPVGFTPNGVPEHTVAE
AIVMKPNQPGTDYDWDAPTKLTSPGINGSTVPLPYGLDPARVPLAGTYTT
GAQQQSTLVSAWYLLPKPDDGHPLVVVTAAGKIAGNSVLHGYTPGQTVVL
EYAMPGPGALVPAGRMVPDDLYGEQPKAWRNLRFARAKMPADAVAVRVVA
EDLSLTPEDWIAVTPPRVPDLRSLQEYVGSTQPVLLDWAVGLAFPCQQPM
LHANGIAEIPKFRITPDYSAKKLDTDTWEDGTNGGLLGITDLLLRAHVMA
TYLSRDWARDWGSLRKFDTLVDAPPAQLELGTATRSGLWSPGKIRIGP
>Rv3874 esxB, 10 KDA CULTURE FILTRATE ANTIGEN ESXB (LHP) (CFP10)
MAEMKTDAATLAQEAGNFERISGDLKTQIDQVESTAGSLQGQWRGAAGTA
AQAAVVRFQEAANKQKQELDEISTNIRQAGVQYSRADEEQQQALSSQMGF
>Rv3891c esxD, POSSIBLE ESAT-6 LIKE PROTEIN ESXD
MADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTG
VVASHMTATEITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQAL
FGASHGS
>Rv3904c esxE, PUTATIVE ESAT-6 LIKE PROTEIN ESXE (HYPOTHETICAL ALANINE RICH PROTEIN) (ESAT-6 LIKE PROTEIN 12)
MDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQ
RHWAAGEAMMRQALAQLTAAGQSAHANYTGAMATNLGMWS
>Rv0287 esxG, ESAT-6 LIKE PROTEIN ESXG (CONSERVED HYPOTHETICAL PROTEIN TB9.8)
MSLLDAHIPQLVASQSAFAAKAGLMRHTIGQAEQAAMSAQAFHQGESSAA
FQAAHARFVAAAAKVNTLLDVAQANLGEAAGTYVAADAAAASTYTGF
>Rv1037c esxI, PUTATIVE ESAT-6 LIKE PROTEIN ESXI (ESAT-6 LIKE PROTEIN 1)
MTINYQFGDVDAHGAMIRAQAGSLEAEHQAIISDVLTASDFWGGAGSAAC
QGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA
>Rv1038c esxJ, ESAT-6 LIKE PROTEIN ESXJ (ESAT-6 LIKE PROTEIN 2)
MASRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAE
ATSLDTMTQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS
>Rv1197 esxK, ESAT-6 LIKE PROTEIN ESXK (ESAT-6 LIKE PROTEIN 3)
MASRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAE
ATSLDTMAQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS
>Rv1198 esxL, PUTATIVE ESAT-6 LIKE PROTEIN ESXL (ESAT-6 LIKE PROTEIN 4)
MTINYQFGDVDAHGAMIRAQAGLLEAEHQAIIRDVLTASDFWGGAGSAAC
QGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA
>Rv1793 esxN, PUTATIVE ESAT-6 LIKE PROTEIN ESXN (ESAT-6 LIKE PROTEIN 5)
MTINYQFGDVDAHGAMIRAQAASLEAEHQAIVRDVLAAGDFWGGAGSVAC
QEFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA
>Rv2346c esxO, PUTATIVE ESAT-6 LIKE PROTEIN ESXO (ESAT-6 LIKE PROTEIN 6)
MTINYQFGDVDAHGAMIRAQAGLLEAEHQAIVRDVLAAGDFWGGAGSVAC
QEFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA
>Rv2347c esxP, PUTATIVE ESAT-6 LIKE PROTEIN ESXP (ESAT-6 LIKE PROTEIN 7)
MATRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAE
ATSLDTMAQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS
>Rv3017c esxQ, ESAT-6 LIKE PROTEIN ESXQ (TB12.9) (ESAT-6 LIKE PROTEIN 8)
MSQSMYSYPAMTANVGDMAGYTGTTQSLGADIASERTAPSRACQGDLGMS
HQDWQAQWNQAMEALARAYRRCRRALRQIGVLERPVGDSSDCGTIRVGSF
RGRWLDPRHAGPATAADAGD
>Rv3020c esxS, ESAT-6 LIKE PROTEIN ESXS
MSLLDAHIPQLIASHTAFAAKAGLMRHTIGQAEQQAMSAQAFHQGESAAA
FQGAHARFVAAAAKVNTLLDIAQANLGEAAGTYVAADAAAASSYTGF
>Rv3619c esxV, PUTATIVE ESAT-6 LIKE PROTEIN ESXV (ESAT-6 LIKE PROTEIN 1)
MTINYQFGDVDAHGAMIRAQAGSLEAEHQAIISDVLTASDFWGGAGSAAC
QGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA
>Rv3620c esxW, PUTATIVE ESAT-6 LIKE PROTEIN ESXW (ESAT-6 LIKE PROTEIN 10)
MTSRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAE
ATSLDTMTQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS
>Rv0649 fabD2, POSSIBLE MALONYL COA-ACYL CARRIER PROTEIN TRANSACYLASE FABD2 (MCT)
MSGRSRLPGSSSRRDAARIVAERVVATVAGVAVAVDEVDAAEARLRDGPR
AAALPASGTSEGRQLRRWLTQLIVTERVVAAEAAARGLTAAGAPAEADLL
PDATARLEIGSVAAAVLADPLARALFAAVTARVAVTDDAVADYHARNPLR
FAAPCPGQHGWRAPAAAAPPLDQVRRAITEHLLGAARRRAFRVWLDARRN
ALVVLAPGYEHPGDPRQPDNTRRH
>Rv3852 hns, POSSIBLE HISTONE-LIKE PROTEIN HNS
MPDPQDRPDSEPSDASTPPAKKLPAKKAAKKAPARKTPAKKAPAKKTPAK
GAKSAPPKPAEAPVSLQQRIETNGQLAAAAKDAAAQAKSTVEGANDALAR
NASVPAPSHSPVPLIVAVTLSLLALLLIRQLRRR
>Rv0341 iniB, ISONIAZID INDUCTIBLE GENE PROTEIN INIB
MTSLIDYILSLFRSEDAARSFVAAPGRAMTSAGLIDIAPHQISSVAANVV
PGLNLGAGDPMSGLRQAVAARHGFAQDVANVGFAGDAGAGVASVITTDVG
AGLASGLGAGFLGQGGLALAASSGGFGGQVGLAAQVGLGFTAVIEAEVGA
QVGAGLGIGTGLGAQAGMGFGGGVGLGLGGQAGGVIGGSAAGAIGAGVGG
RLGGNGQIGVAGQGAVGAGVGAGVGGQAGIASQIGVSAGGGLGGVGNVSG
LTGVSSNAVLASNASGQAGLIASEGAALNGAAMPHLSGPLAGVGVGGQAG
AAGGAGLGFGAVGHPTPQPAALGAAGVVAKTEAAAGVVGGVGGATAAGVG
GAHGDILGHEGAALGSVDTVNAGVTPVEHGLVLPSGPLIHGGTGGYGGMN
PPVTDAPAPQVPARAQPMTTAAEHTPAVTQPQHTPVEPPVHDKPPSHSVF
DVGHEPPVTHTPPAPIELPSYGLFGLPGF
>Rv1028A kdpF, Probable membrane protein kdpF
MTTVDNIVGLVIAVALMAFLFAALLFPEKF
>Rv2543 lppA, PROBABLE CONSERVED LIPOPROTEIN LPPA
MIAPQPISRTLPRWQRIVALTMIGISTALIGGCTMDHNPDTSRRLTGEQK
IQLIDSMRNKGSYEAARERLTATARIIADRVSAAIPGQTWKFDDDPNIQQ
SDRNGALCDKLTADIARRPIANSVMFGATFSAEDFKIAANIVREEAAKYG
ATTESSLFNESAKRDYDVQGNGYEFRLLQIKFATLNITGDCFLLQKVLDL
PAGQLPPEPPIWPTTSTPH
>Rv2544 lppB, PROBABLE CONSERVED LIPOPROTEIN LPPB
MIAPQPIPRTLPRWQRIVALTMIGISTALIGGCTMGQNPDKSPHLTGEQK
IQLIDSMRHKGSYEAARERLTATAQIIADRVSAAIPGQTWKFNDDSYGQD
FYRNGSLCKELSADIARRPMAKPVDFGSTFSAEDFKIAANIVREEAAKYG
VTTESSLFNESAKRDYDVQGNGYEFNLGQIKFATLNITGDCFLLQKVLDL
PAGQLPPEPPIWPTTSTPTP
>Rv1881c lppE, POSSIBLE CONSERVED LIPOPROTEIN LPPE
MCNRLVTVTGVAMVVAAGLSACGQAQTVPRKAARLTIDGVTHTTRPATCS
QEHSYRTIDIRNHDSTVQAVVLLSGDRVIPQWVKIRNVDGFNGSFWHGGV
GNARADRARNTYTVAGSAYGISSKKPNTVVSTDFNILAEC
>Rv1946c lppG, POSSIBLE LIPOPROTEIN
MIRGSAVSGLLMPSVNGGTAGSVACVQCLFLPKVAVDLINLSGIQCFARI
EHVAHAQAHPFVVLVGKPAQHGARIGAVAGAILTGDVIVSHDGELYRAVT
ALRQNGPRPHASRRLHAPALCSARSRRGHLRPSCWLPPPRFAGRQSLVAR
>Rv2080 lppJ, Possible lipoprotein lppJ
MPHSTADRRLRLTRQALLAAAVVPLLAGCALVMHKPHSAGSSNPWDDSAH
PLTDDQAMAQVVEPAKQIVAAADLQAVRAGFSFTSCNDQGDPPYQGTVRM
AFLLQGDHDAYFQHVRAAMLSHGWIDGPPPGQYFHGITLHKNGVTANMSL
ALDHSYGEMILDGECRNTTDHHHDDETTNITNQLVQP
>Rv2116 lppK, Probable conserved lipoprotein lppK
MRRNIRVTLGAATIVAALGLSGCSHPEFKRSSPPAPSLPPVTSSPLEAAP
ITPLPAPEALIDVLSRLADPAVPGTNKVQLIEGATPENAAALDRFTTALR
DGSYLPMTFAANDIAWSDNKPSDVMATVVVTTAHPDNREFTFPMEFVSFK
GGWQLSRQTAEMLLAMGNSPDSTPSATSPAPAPSPTPPG
>Rv2138 lppL, Probable conserved lipoprotein LppL
MLTGNKPAVQRRFIGLLMLSVLVAGCSSNPLANFAPGYPPTIEPAQPAVS
PPTSQDPAGAVRPLSGHPRAALFDNGTRQLVALRPGADSAAPASIMVFDD
VHVAPRVIFLPGPAAALTSDDHGTAFLAARGGYFVADLSSGHTARVNVAD
AAHTDFTAIARRSDGKLVLGSADGAVYTLAKNPAVDPASGAATVASRTKI
FARVDALVTQGNTTVVLDRGQTSVTTIGADGHAQQALRAGQGATTMAADP
LGRVLIADTRGGQLLVYGVDPLILRQAYPVRQAPYGLAGSRELAWVSQTA
SNTVIGYDLTTGIPVEKVRYPTVQQPNSLAFDETSDTLYVVSGSGAGVQV
IEHAAGTR
>Rv2270 lppN, PROBABLE LIPOPROTEIN LPPN
MRLPGRHVLYALSAVTMLAACSSNGARGGIASTNMNPTNPPATAETATVS
PTPAPQSARTETWINLQVGDCLADLPPADLSRITVTIVDCATAHSAEVYL
RAPVAVDAAVVSMANRDCAAGFAPYTGQSVDTSPYSVAYLIDSHQDRTGA
DPTPSTVICLLQPANGQLLTGSARR
>Rv2330c lppP, PROBABLE LIPOPROTEIN LPPP
MRRQRSAVPILALLALLALLALIVGLGASGCAWKPPTTRPSPPNTCKDSD
GPTADTVRQAIAAVPIVVPGSKWVEITRGHTRNCRLHWVQIIPTIASQST
PQQLLFFDRNIPLGSPTRNPKPYITVLPAGDDTVTVQYQWQIGSDQECCP
TGIGTVRFHIGSDGKLEALGSIPHQ
>Rv2341 lppQ, PROBABLE CONSERVED LIPOPROTEIN LPPQ
MPVGGRQHVFEKLASILGLVAAPLMLLGLSACGRSAGKTSEPTCPTEPID
AADSSTTPDPSCVVRATEINGNGSRIQTWTGSYDAAATQSGGVCGGTCNF
HATVRFTVDEGQISGSVDQVYQAAMVAIATRPTSPSLAP
>Rv2403c lppR, PROBABLE CONSERVED LIPOPROTEIN LPPR
MTNRWRWVVPLFAVFLAAGCTTTTTGKAGLAPNAVPRPLMGSLIQRVPLD
GAALSTLLNQPFQALPPFPPVFGGSDSLGDSDVSARPADCVGVGYLTQRN
VYRSVEVKSVARVSWRHDGSSVKVDDVDEGVVALPSAAAADDLFARFSAQ
WKECDGTTLTVPASAFGQRSITDVRVADSVVAATVSLRRGTHSILASVPQ
ARAVGVRGNCVVEVAVTFFGITHPSDQGSADISTSAVDIAHAMMDRISEL
S
>Rv1799 lppT, PROBABLE LIPOPROTEIN LPPT
MSVKSKNGRLAARVLVALAALFAMIALTGSACLAEGPPLGRNPQGAPAPV
GGTVIVAPMHSGV
>Rv2784c lppU, PROBABLE LIPOPROTEIN LPPU
MRAWLAAATTALFVVATGCSSATNVAELKVGDCVKLAGTPDRPQATKAEC
GSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIG
GCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGVG
YVYTQRRFAVCVEDVTGGPRS
>Rv2796c lppV, PROBABLE CONSERVED LIPOPROTEIN LPPV
MRWPTAWLLALVCVMATGCGPSGHGTRAGEEGPLSPEKVAELENPLRAKP
PLEDAKDQYRAAVTQLANAITALVPGLTWRTDMDTWTGCGGEYEWTRAKA
AYFMIVFSGPIPDDKWLQAVQIVKDGVEQFGATGFGVMKNKPADHDVYFA
GHGGVEFKCSTQKAAVLTAQSDCRISRTDTPKPSPTP
>Rv2905 lppW, PROBABLE CONSERVED ALANINE RICH LIPOPROTEIN LPPW
MRARPLTLLTALAAVTLVVVAGCEARVEAEAYSAADRISSRPQARPQPQP
VELLLRAITPPRAPAASPNVGFGELPTRVRQATDEAAAMGATLSVAVLDR
ATGQLVSNGNTQIIATASVAKLFIADDLLLAEAEGKVTLSPEDHHALDVM
LQSSDDGAAERFWSQDGGNAVVTQVARRYGLRSTAPPSDGRWWNTISSAP
DLIRYYDMLLDGSGGLPLDRAAVIIADLAQSTPTGIDGYPQRFGIPDGLY
AEPVAVKQGWMCCIGSSWMHLSTGVIGPERRYIMVIESLQPADDATARAT
ITQAVRTMFPNGRI
>Rv2999 lppY, PROBABLE CONSERVED LIPOPROTEIN LPPY
MAGAKHAGRIVAITTAAAVILAACSSGSKGGAGSGHAGKARSAVTTTDAD
WKPVADALGRSGKLGDNNTAYRINLPRNDLHITSYGVDIKPGLSLGGYAA
FARYDNNETLLMGDLVITEEELPKVTDALQAHGIAQTALHKHLLQQDPPV
WWTHIHGMGDAARLAQGLKAALDATTIGPPTPPPARQPPVDIDVAGVDQA
LGRKGTQDGGLMKYSIPRKDTIIEDGHVLPAVSLNLTTVINFQPVGRGRA
AINGDFILIAPEVQEVIRAMRAGNITIVELHNHGLTEEPRLFYMHYWAVD
DAVTLARALRPAMDATNLQSS
>Rv3016 lpqA, PROBABLE LIPOPROTEIN LPQA
MVGLTRPLLLCGATLLIAACTRVVGGTASATFGGDRQGMLDVATILLDQS
RMQAITGSGDDLTIIPTMDTTYPVDVDDFAQPIPRECRFIYAETAVFGSE
IEAFHKTTFQDRPDGSLISEAAAAYRDAGTARRAFDTLAVTVHDCAASPA
GWLFVSRWTAGGNSLHIRAGDCGRDYRVLSAALLEVTFCGFPESVSDIVM
TNIAANVPG
>Rv3244c lpqB, PROBABLE CONSERVED LIPOPROTEIN LPQB
MRLTILLFLGAVLAGCASVPSTSAPQAIGTVERPVPSNLPKPSPGMDPDV
LLREFLKATADPANRHLAARQFLTESASNAWDDAGSALLIDHVVFVETRS
AEKVSVTMRADILGSLSDVGVFETAEGQLPDPGPIELVKTSDGWRIDRLP
NGVFLDWQQFQETYKRNTLYFADPTGKTVVPDPRYVAVSDRDQLATELVS
KLLAGPRPEMARTVRNLLAPPLRLRGPVTRADGGKSGIGRGYGGARVDME
KLSTTDPHSRQLLAAQIIWTLARADIRGPYVINADGAPLEDRFAEGWTTS
DVAATDPGVADGAAAGLHALVNGSLVAMDAQRVTPVPGAFGRMPEQTAAA
VSRSGRQVASVVTLGRGAPDEAASLWVGDLGGEAVQSADGHSLSRPSWSL
DDAVWVVVDTNVVLRAIQDPASGQPARIPVDSTAVASRFPGAINDLQLSR
DGTRAAMVIGGQVILAGVEQTQAGQFALTYPRRLGFGLGSSVVSLSWRTG
DDIVVTRTDAAHPVSYVNLDGVNSDAPSRGLQTPLTAIAANPSTVYVAGP
QGVLMYSASVESRPGWADVPGLMVPGAAPVLPG
>Rv3584 lpqE, POSSIBLE CONSERVED LIPOPROTEIN LPQE
MNRCNIRLRLAGMTTWVASIALLAAALSGCGAGQISQTANQKPAVNGNRL
TINNVLLRDIRIQAVQTSDFIQPGKAVDLVLVAVNQSPDVSDRLVGITSD
IGSVTVAGDARLPASGMLFVGTPDGQIVAPGPLPSNQAAKATVNLTKPIA
NGLTYNFTFKFEKAGQGSVMVPISAGLATPHE
>Rv3763 lpqH, 19 KDA LIPOPROTEIN ANTIGEN PRECURSOR LPQH
MKRGLTVAVAGAAILVAGLSGCSSNKSTTGSGETTTAAGTTASPGAASGP
KVVIDGKDQNVTGSVVCTTAAGNVNIAIGGAATGIAAVLTDGNPPEVKSV
GLGNVNGVTLGYTSGTGQGNASATKDGSHYKITGTATGVDMANPMSPVNK
SFEIEVTCS
>Rv0344c lpqJ, PROBABLE LIPOPROTEIN LPQJ
MRLSLIARGMAALLAATALVAGCNTTIDGRPVASPGSGPTEPTFPTPRPT
TAPPGTTAPTLPTTPVSPTAPAGAIPLPPDSNGYVFIETKSGMTRCQINR
DSVGCEAPFTNSPLRDGEHANGIHITAGGSVQWVLGNLGAIPTVSIDYRT
YEAQGWTIDATTDGTRFTNNRTGHGMFVSIEKVDTF
>Rv0583c lpqN, PROBABLE CONSERVED LIPOPROTEIN LPQN
MKHFTAAVATVALSLALAGCSFNIKTDSAPTTSPTTTSPTTSTTTTSATT
SAQAAGPNYTIADYIRDNHIQETPVHHGDPGSPTIDLPVPDDWRLLPESS
RAPYGGIVYTQPADPNDPPTIVAILSKLTGDIDPAKVLQFAPGELKNLPG
FQGSGDGSAATLGGFSAWQLGGSYSKNGKLRTVAQKTVVIPSQGAVFVLQ
LNADALDDETMTLMDAANVIDEQTTITP
>Rv0604 lpqO, PROBABLE CONSERVED LIPOPROTEIN LPQO
MIRRRGARMAALLAAAALALTACAGSDDKGEPDDGGDRGASLATTSDADW
KPVADILGRTGKLNDGSVYKIGFARSDLSVQTKGVTVAPALSLGSWVAFA
RTPDGQTMLMGDLVVTEDELASVTDAVQAGGLQQTALHKHLLEQSPPIWW
THIAGHGDAADLARAVRSALDATDTPPPASATSGQTSLDLDTAAIDEALG
RSGTIAGGVYKFFIARRDPVTMSGMLIPPSMGLATALNFQPTGNGRAAIN
GDFVMTAAEVQDVVQALRGGGIDIVAIHNHGFDEQPRLFYMHFWAENDAV
ALARTLRAAVDATAAR
>Rv0835 lpqQ, POSSIBLE LIPOPROTEIN LPQQ
MCCSTAAKSAVIVCCAAIATTACSFQATSTQPSTAPPTSRVDSLIVSIED
VRRIANYEELAAHFQTDLREPPEADTNVPGPCRVVGSSDRTFGTDWSEFR
SAGYHGVTDDLRPGGPVMVETVSQAIALYPDPSTARGVFHRLESSLAECA
GLHDPYFDFILDRPDASTVRIGAAGWSHVYRLKSSVFISVGVLGIEPAEP
IANVILQTISDRIQ
>Rv0847 lpqS, PROBABLE LIPOPROTEIN LPQS
MVWMRSAIVAVALGVTVAAVAAACWLPQLHRHVAHPNHPLTTSVGSEFVI
NTDHGHLVDNSMPPCPERLATAVLPRSATPVLLPDVVAAAPGMTAALTDP
VAPAARGPPAAQGSVRTGQDLLTRFCLARR
>Rv1016c lpqT, PROBABLE CONSERVED LIPOPROTEIN LPQT
MAGRRCPQDSVRPLAVAVAVATLAMSAVACGPKSPDFQSILSTSPTTSAV
STTTEVPVPLWKYLESVGVTGEPVAPSSLTDLTVSIPTPPGWAPMKNPNI
TPNTEMIAKGESYPTAMLMVFKLHRDFDIAEALKHGTADARLSTNFTELD
SSTADFNGFPSSMIQGSYDLHGRRLHTWNRIVFPTGAPPAKQRYLVQLTI
TSLANEAVKHASDIEAIIAGFVVAAK
>Rv1064c lpqV, POSSIBLE LIPOPROTEIN LPQV
MRPSRYAPLLCAMVLALAWLSAVAGCSRGGSSKAGRSSSVAGTLPAGVVG
VSPAGVTTRVDAPAESTEEEYYQACHAARLWMDAQPGSGESLIEPYLAVV
QASPSGVAGSWHIRWAALTPARQAAVIVAARAAANAECG
>Rv1228 lpqX, PROBABLE LIPOPROTEIN LPQX
MSRQWHWLAATLLLITTAACSRPGTEEPDCPTKITLPPGATPTTTLDPRC
IVRATTTGTADGDAASRWTGTVRIAGFYASICNAVWDGNVSLAGKDELTG
KATLILVETSCPGKVVAGELVLKGNVGSDSLAITWAHPELPQRAFDLGAG
QGTIRRSGDRAEGTFNSDMGGGTEFFLTWSLTMRN
>Rv1274 lprB, POSSIBLE LIPOPROTEIN LPRB
MRRKVRRLTLAVSALVALFPAVAGCSDSGDNKPGATIPSTPANAEGRHGP
FFPQCGGVSDQTVTELTRVTGLVNTAKNSVGCQWLAGGGILGPHFSFSWY
RGSPIGRERKTEELSRASVEDINIDGHSGFIAIGNEPSLGDSLCEVGIQF
SDDFIEWSVSFSQKPFPLPCDIAKELTRQSIANSK
>Rv1275 lprC, POSSIBLE LIPOPROTEIN LPRC
MRRVLVGAAALITALLVLTGCTKSISGTAVKAGGAGVPRNNNSQERYPNL
LKECEVLTTDILAKTVGADPLDIQSTFVGAICRWQAANPAGLIDITRFWF
EQGSLSNERKVAEGLKYQVETRAIQGVDSIVMRTGDPNGACGVASDAAGV
VGWWVNPQAPGIDACGQAIKLMELTLATNA
>Rv1252c lprE, PROBABLE LIPOPROTEIN LPRE
MPGVWSPPCPTTPRVGVVAALVAATLTGCGSGDSTVAKTPEATPSLSTAH
PAPPSSEPSPPSATAAPPSNHSAAPVDPCAVNLASPTIAKVVSELPRDPR
SEQPWNPEPLAGNYNECAQLSAVVIKANTNAGNPTTRAVMFHLGKYIPQG
VPDTYGFTGIDTSQCTGDTVALTYASGIGLNNVVKFRWNGGGVELIGNTT
GG
>Rv1368 lprF, PROBABLE CONSERVED LIPOPROTEIN LPRF
MNGLISQACGSHRPRRPSSLGAVAILIAATLFATVVAGCGKKPTTASSPS
PGSPSPEAQQILQDSSKATKGLHSVHVVVTVNNLSTLPFESVDADVTNQP
QGNGQAVGNAKVRMKPNTPVVATEFLVTNKTMYTKRGGDYVSVGPAEKIY
DPGIILDKDRGLGAVVGQVQNPTIQGRDAIDGLATVKVSGTIDAAVIDPI
VPQLGKGGGRLPITLWIVDTNASTPAPAANLVRMVIDKDQGNVDITLSNW
GAPVTIPNPAG
>Rv1411c lprG, PROBABLE CONSERVED LIPOPROTEIN LPRG
MRTPRRHCRRIAVLAAVSIAATVVAGCSSGSKPSGGPLPDAKPLVEEATA
QTKALKSAHMVLTVNGKIPGLSLKTLSGDLTTNPTAATGNVKLTLGGSDI
DADFVVFDGILYATLTPNQWSDFGPAADIYDPAQVLNPDTGLANVLANFA
DAKAEGRDTINGQNTIRISGKVSAQAVNQIAPPFNATQPVPATVWIQETG
DHQLAQAQLDRGSGNSVQMTLSKWGEKVQVTKPPVS
>Rv1418 lprH, PROBABLE LIPOPROTEIN LPRH
MACLGRPGCRGWAGASLVLVVVLALAACTESVAGRAMRATDRSSGLPTSA
KPARARDLLLQDGDRAPFGQVTQSRVGDSYFTSAVPPECSAALLFKGSPL
RPDGSSDHAEAAYNVTGPLPYAESVDVYTNVLNVHDVVWNGFRDVSHCRG
DAVGVSRAGRSTPMRLRYFATLSDGVLVWTMSNPRWTCDYGLAVVPHAVL
VLSACGFKPGFPMAEWASKRRAQLDSQV
>Rv1690 lprJ, PROBABLE LIPOPROTEIN LPRJ
MTAHTHDGTRTWRTGRQATTLLALLAGVFGGAASCAAPIQADMMGNAFLT
ALTNAGIAYDQPATTVALGRSVCPMVVAPGGTFESITSRMAEINGMSRDM
ASTFTIVAIGTYCPAVIAPLMPNRLQA
>Rv0179c lprO, POSSIBLE LIPOPROTEIN LPRO
MWIRAERVAVLTPTASLRRLTACYAALAVCAALACTTGQPAARAADGREM
LAQAIATTRGSYLVYNFGGGHPMPLLNAGGHWYEMNNGGHLMIIKNASQR
LSPHLLVDTHTGDQARCEHNPGARTGEGLWQASEIYPPLKAWQRMGRPTI
AVNANFFDVRGQKGGSWRSTGCSSPLGAYVDNTRGQGRANQAVTGTVAYA
GKQGLSGGNELWSSLTTMILPVGGAPYVLRPKSRQDYDLATPVIEDLLNK
NARFVAVAGIGLLSPGNTGQLHDGGPSAARTALAYAKQKDEMYIFQGGNY
TPDNIQDLFRGLGSDTAILLDGGGSSAIVLRRDTGGMWAGAGSPKGSCDT
RQVLCDSHERALPSWLAFN
>Rv3597c lsr2, PROBABLE IRON-REGULATED LSR2 PROTEIN PRECURSOR
MAKKVTVTLVDDFDGSGAADETVEFGLDGVTYEIDLSTKNATKLRGDLKQ
WVAAGRRVGGRRRGRSGSGRGRGAIDREQSAAIREWARRNGHNVSTRGRI
PADVIDAYHAAT
>Rv1388 mihF, PUTATIVE INTEGRATION HOST FACTOR MIHF
MLGNTIHVPCQPCRHGHGAPSRGLRGRPADRWPVARATPTLHVCPQNQGV
GLDFVRKPEYGRLRWPAYPAGTNNDRLISMRDGGIVALPQLTDEQRAAAL
EKAAAARRARAELKDRLKRGGTNLTQVLKDAESDEVLGKMKVSALLEALP
KVGKVKAQEIMTELEIAPTRRLRGLGDRQRKALLEKFGSA
>Rv0403c mmpS1, PROBABLE CONSERVED MEMBRANE PROTEIN MMPS1
MFGVAKRFWIPMVIVIVVAVAAVTVSRLHSVFGSHQHAPDTGNLDPIIAF
YPKHVLYEVFGPPGTVASINYLDADAQPHEVVNAAVPWSFTIVTTLTAVV
ANVVARGDGASLGCRITVNEVIREERIVNAYHAHTSCLVKSA
>Rv0506 mmpS2, PROBABLE CONSERVED MEMBRANE PROTEIN MMPS2
MRMISVSGAVKRMWLLLAIVVVAVVGGLGIYRLHSIFGVHEQPTVMVKPD
FDVPLFNPKRVTYEVFGPAKTAKIAYLDPDARVHRLDSVSLPWSVTVETT
LPAVSVNLMAQSNADVISCRIIVNGAVKDERSETSPRALTSCQVSSG
>Rv2198c mmpS3, PROBABLE CONSERVED MEMBRANE PROTEIN MMPS3
MSGPNPPGREPDEPESEPVSDTGDERASGNHLPPVAGGGDKLPSDQTGET
DAYSRAYSAPESEHVTGGPYVPADLRLYDYDDYEESSDLDDELAAPRWPW
VVGVAAIIAAVALVVSVSLLVTRPHTSKLATGDTTSSAPPVQDEITTTKP
APPPPPPAPPPTTEIPTATETQTVTVTPPPPPPPATTTAPPPATTTTAAA
PPPTTTTPTGPRQVTYSVTGTKAPGDIISVTYVDAAGRRRTQHNVYIPWS
MTVTPISQSDVGSVEASSLFRVSKLNCSITTSDGTVLSSNSNDGPQTSC
>Rv0451c mmpS4, PROBABLE CONSERVED MEMBRANE PROTEIN MMPS4
MLMRTWIPLVILVVVIVGGFTVHRIRGFFGSENRPSYSDTNLENSKPFNP
KHLTYEIFGPPGTVADISYFDVNSEPQRVDGAVLPWSLHITTNDAAVMGN
IVAQGNSDSIGCRITVDGKVRAERVSNEVNAYTYCLVKSA
>Rv1926c mpt63, IMMUNOGENIC PROTEIN MPT63 (ANTIGEN MPT63/MPB63) (16 kDa IMMUNOPROTECTIVE EXTRACELLULAR PROTEIN)
MKLTTMIKTAVAVVAMAAIATFAAPVALAAYPITGKLGSELTMTDTVGQV
VLGWKVSDLKSSTAVIPGYPVAGQVWEATATVNAIRGSVTPAVSQFNART
ADGINYRVLWQAAGPDTISGATIPQGEQSTGKIYFDVTGPSPTIVAMNNG
MEDLLIWEP
>Rv1980c mpt64, IMMUNOGENIC PROTEIN MPT64 (ANTIGEN MPT64/MPB64)
MRIKIFMLVTAVVLLCCSGVATAAPKTYCEELKGTDTGQACQIQMSDPAY
NINISLPSYYPDQKSLENYIAQTRDKFLSAATSSTPREAPYELNITSATY
QSAIPPRGTQAVVLKVYQNAGGTHPTTTYKAFDWDQAYRKPITYDTLWQA
DTDPLPVVFPIVQGELSKQTGQQVSIAPNAGLDPVNYQNFAVTNDGVIFF
FNPGELLPEAAGPTQVLVPRSAIDSMLA
>Rv0040c mtc28, SECRETED PROLINE RICH PROTEIN MTC28 (PROLINE RICH 28 KDA ANTIGEN)
MIQIARTWRVFAGGMATGFIGVVLVTAGKASADPLLPPPPIPAPVSAPAT
VPPVQNLTALPGGSSNRFSPAPAPAPIASPIPVGAPGSTAVPPLPPPVTP
AISGTLRDHLREKGVKLEAQRPHGFKALDITLPMPPRWTQVPDPNVPDAF
VVIADRLGNSVYTSNAQLVVYRLIGDFDPAEAITHGYIDSQKLLAWQTTN
ASMANFDGFPSSIIEGTYRENDMTLNTSRRHVIATSGADKYLVSLSVTTA
LSQAVTDGPATDAIVNGFQVVAHAAPAQAPAPAPGSAPVGLPGQAPGYPP
AGTLTPVPPR
>Rv1528c papA4, PROBABLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA4
MTQLPQPTWRWWQQRETEQVQSSHIDGEIVGALIPDLAVLHSEDASRAAV
GREKHRCSLDPLGGGFRSRRASMPAGALLLSAVIAIQLDRMNARVFGDGW
IGAQACMWVNKFHEESTVTALSPSSPIAQGSIARHPETMQSAYVRIAEGG
SRDVAPAAQLQRRRP
>Rv3810 pirG, EXPORTED REPETITIVE PROTEIN PRECURSOR PIRG (CELL SURFACE PROTEIN) (EXP53)
MPNRRRRKLSTAMSAVAALAVASPCAYFLVYESTETTERPEHHEFKQAAV
LTDLPGELMSALSQGLSQFGINIPPVPSLTGSGDASTGLTGPGLTSPGLT
SPGLTSPGLTDPALTSPGLTPTLPGSLAAPGTTLAPTPGVGANPALTNPA
LTSPTGATPGLTSPTGLDPALGGANEIPITTPVGLDPGADGTYPILGDPT
LGTIPSSPATTSTGGGGLVNDVMQVANELGASQAIDLLKGVLMPSIMQAV
QNGGAAAPAASPPVPPIPAAAAVPPTDPITVPVA
>Rv0867c rpfA, POSSIBLE RESUSCITATION-PROMOTING FACTOR RPFA
MSGRHRKPTTSNVSVAKIAFTGAVLGGGGIAMAAQATAATDGEWDQVARC
ESGGNWSINTGNGYLGGLQFTQSTWAAHGGGEFAPSAQLASREQQIAVGE
RVLATQGRGAWPVCGRGLSNATPREVLPASAAMDAPLDAAAVNGEPAPLA
PPPADPAPPVELAANDLPAPLGEPLPAAPADPAPPADLAPPAPADVAPPV
ELAVNDLPAPLGEPLPAAPADPAPPADLAPPAPADLAPPAPADLAPPAPA
DLAPPVELAVNDLPAPLGEPLPAAPAELAPPADLAPASADLAPPAPADLA
PPAPAELAPPAPADLAPPAAVNEQTAPGDQPATAPGGPVGLATDLELPEP
DPQPADAPPPGDVTEAPAETPQVSNIAYTKKLWQAIRAQDVCGNDALDSL
AQPYVIG
>Rv1884c rpfC, PROBABLE RESUSCITATION-PROMOTING FACTOR RPFC
MHPLPADHGRSRCNRHPISPLSLIGNASATSGDMSSMTRIAKPLIKSAMA
AGLVTASMSLSTAVAHAGPSPNWDAVAQCESGGNWAANTGNGKYGGLQFK
PATWAAFGGVGNPAAASREQQIAVANRVLAEQGLDAWPTCGAASGLPIAL
WSKPAQGIKQIINEIIWAGIQASIPR
>Rv2389c rpfD, PROBABLE RESUSCITATION-PROMOTING FACTOR RPFD
MTPGLLTTAGAGRPRDRCARIVCTVFIETAVVATMFVALLGLSTISSKAD
DIDWDAIAQCESGGNWAANTGNGLYGGLQISQATWDSNGGVGSPAAASPQ
QQIEVADNIMKTQGPGAWPKCSSCSQGDAPLGSLTHILTFLAAETGGCSG
SRDD
>Rv2450c rpfE, PROBABLE RESUSCITATION-PROMOTING FACTOR RPFE
MKNARTTLIAAAIAGTLVTTSPAGIANADDAGLDPNAAAGPDAVGFDPNL
PPAPDAAPVDTPPAPEDAGFDPNLPPPLAPDFLSPPAEEAPPVPVAYSVN
WDAIAQCESGGNWSINTGNGYYGGLRFTAGTWRANGGSGSAANASREEQI
RVAENVLRSQGIRAWPVCGRRG
>Rv0979A rpmF, PROBABLE 50S RIBOSOMAL PROTEIN L32 RPMF
MAVPKRRKSRSNTRSRRSQWKAAKTELVGVTVAGHAHKVPRRLLKAARLG
LIDFDKR
>Rv3118 sseC1, CONSERVED HYPOTHETICAL PROTEIN SSEC1
MCSGPKQGLTLPASVDLEKETVITGRVVDGDGQAVGGAFVRLLDSSDEFT
AEVVASATGDFRFFAAPGSWTLRALSAAGNGDAVVQPSGAGIHEVDVKIT
>Rv0814c sseC2, CONSERVED HYPOTHETICAL PROTEIN SSEC2
MCSGPKQGLTLPASVDLEKETVITGRVVDGDGQAVGGAFVRLLDSSDEFT
AEVVASATGDFRFFAAPGSWTLRALSAAGNGDAVVQPSGAGIHEVDVKIT
>Rv3288c usfY, PUTATIVE PROTEIN USFY
MGQIPPQPVRRVLPLMVVPGNGQKWRNRTETEEAMGDTYRDPVDHLRTTR
PLAGESLIDVVHWPGYLLIVAGVVGGVGALAAFGTGHHAEGMTFGVVAIV
VTVVGLAWLAFEHRRIRKIADRWYTEHPEVRRQRLAG
>Rv1759c wag22, PE-PGRS FAMILY PROTEIN
MSFVIAVPETIAAAATDLADLGSTIAGANAAAAANTTSLLAAGADEISAA
IAALFGAHGRAYQAASAEAAAFHGRFVQALTTGGGAYAAAEAAAVTPLLN
SINAPVLAATGRPLIGNGANGAPGTGANGGDAGWLIGNGGAGGSGAKGAN
GGAGGPGGAAGLFGNGGAGGAGGTATANNGIGGAGGAGGSAMLFGAGGAG
GAGGAATSLVGGIGGTGGTGGNAGMLAGAAGAGGAGGFSFSTAGGAGGAG
GAGGLFTTGGVGGAGGQGHTGGAGGAGGAGGLFGAGGMGGAGGFGDHGTL
GTGGAGGDGGGGGLFGAGGDGGAGGSGLTTGGAAGNGGNAGTLSLGAAGG
AGGTGGAGGTVFGGGKGGAGGAGGNAGMLFGSGGGGGTGGFGFAAGGQGG
VGGSAGMLSGSGGSGGAGGSGGPAGTAAGGAGGAGGAPGLIGNGGNGGNG
GESGGTGGVGGAGGNAVLIGNGGEGGIGALAGKSGFGGFGGLLLGADGYN
APESTSPWHNLQQDILSFINEPTEALTGRPLIGNGDSGTPGTGDDGGAGG
WLFGNGGNGGAGAAGTNGSAGGAGGAGGILFGTGGAGGAGGVGTAGAGGA
GGAGGSAFLIGSGGTGGVGGAATTTGGVGGAGGNAGLLIGAAGLGGCGGG
AFTAGVTTGGAGGTGGAAGLFANGGAGGAGGTGSTAGGAGGAGGAGGLYA
HGGTGGPGGNGGSTGAGGTGGAGGPGGLYGAGGSGGAGGHGGMAGGGGGV
GGNAGSLTLNASGGAGGSGGSSLSGKAGAGGAGGSAGLFYGSGGAGGNGG
YSLNGTGGDGGTGGAGQITGLRSGFGGAGGAGGASDTGAGGNGGAGGKAG
LYGNGGDGGAGGDGATSGKGGAGGNAVVIGNGGNGGNAGKAGGTAGAGGA
GGLVLGRDGQHGLT
>Rv3219 whiB1, PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN WHIB-LIKE WHIB1
MDWRHKAVCRDEDPELFFPVGNSGPALAQIADAKLVCNRCPVTTECLSWA
LNTGQDSGVWGGMSEDERRALKRRNARTKARTGV
>Rv3260c whiB2, PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN WHIB-LIKE WHIB2
MVPEAPAPFEEPLPPEATDQWQDRALCAQTDPEAFFPEKGGSTREAKKIC
MGCEVRHECLEYALAHDERFGIWGGLSERERRRLKRGII
>Rv3416 whiB3, TRANSCRIPTIONAL REGULATORY PROTEIN WHIB-LIKE WHIB3
MPQPEQLPGPNADIWNWQLQGLCRGMDSSMFFHPDGERGRARTQREQRAK
EMCRRCPVIEACRSHALEVGEPYGVWGGLSESERDLLLKGTMGRTRGIRR
TA
>Rv3681c whiB4, PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN WHIB-LIKE WHIB4
MSGTRPAARRTNLTAAQNVVRSVDAEERIAWVSKALCRTTDPDELFVRGA
AQRKAAVICRHCPVMQECAADALDNKVEFGVWGGMTERQRRALLKQHPEV
VSWSDYLEKRKRRTGTAG
>Rv0022c whiB5, PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN WHIB-LIKE WHIB5
MAHPCATDPELWFGYPDDDGSDGAAKARAYERSATQARIQCLRRCPLLQQ
RRCAQHAVEHRVEYGVWAGIKLPGGQYRKREQLAAAHDVLRRIAGGEINS
RQLPDNAALLARNEGLEVTPVPGVVVHLPIAQVGPQPAA
>Rv3197A whiB7, PROBABLE TRANSCRIPTIONAL REGULATORY PROTEIN WHIB-LIKE WHIB7
MSVLTVPRQTPRQRLPVLPCHVGDPDLWFADTPAGLEVAKTLCVSCPIRR
QCLAAALQRAEPWGVWGGEIFDQGSIVSHKRPRGRPRKDAVA