TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Gene type: CDS
Genomic element: chromosome

Number of genes found: 218

Free access
Sort by:

 



# Mycobacterium tuberculosis H37Rv, H37Rv

>Rv3394c CONSERVED HYPOTHETICAL PROTEIN
MMASARVLAIWCMDWPAVAAAAAAGLSATAPVAVTLANRVIACSATARAA
GVRRGLRRREAAARCPQLFIATADADRDARLFEGVIAAVDDLVPRAELLR
PGLLVLPVRGPARFFGSEQMAAERLIDAVAAAGAECQVGIADRLSTAVFA
ARAGRIVEPGGDARFLSLLSIRQLATEPSLSGPGRDDLTDLLWRMGIRTI
GQFAALSRTDVASRFGADAVAAHRFARGEPERAPCGREPPPDLAAELACD
PPIDRVDAAAFAGRSLAAELHRALMAAGVGCTRLAIHAVTANGEERSRVW
RCAEPLTEDATADRVRWQLDGWLNNRNARDRPTAAVTLLRLQAVETVSAS
EGLQLPLWGGLGEQDRLRARRALVRVQGLLGPEAVRVPVLSGGHGPAERI
TLTVLGLVAPEPVPQADPGQPWPGRLPDPSPAVLFDDPVDLLDAQGNPIR
VTSRGMFSADPARLRVRGRDDRLRWWAGPWPDDERWWDPDRASGRTARAQ
VLLDGDPGTALLLCYRQRRWYLEGSYE
>Rv0071 POSSIBLE MATURASE
MSSITVSVDPVDPVDPVDPVDPVDAVVAAGSDGLTVARIESEIGALEFLN
ELRTELKSGQFRPQPVRERKIPKPGGLGKVRRLGIPTVADRVVQAALKLV
LEPIFETDFEPVSYGFRPARRAHDTIAEIHLFGTQEYRWVLDADIKACFD
RIDHADLMDRVRHRIKDKRVLRLVNWQRIRHRWNWTDVRRWLTDPTGRWH
PISADGITLFNPAAVPIRRYRYRGNTIPTPWTQAV
>Rv2666 PROBABLE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS1081 (FRAGMENT)
MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADALCGAGYR
ERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAE
RALTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEA
FRTRPLDAGPYTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILG
IQVTSAEDGAGWLAFFRDLVARGLSGVALVTSDAHAGLVAAIGATLPAAA
WQRCRTHYAANHGRHNA
>Rv3908 CONSERVED HYPOTHETICAL PROTEIN
MSDGEQAKSRRRRGRRRGRRAAATAENHMDAQPAGDATPTPATAKRSRSR
SPRRGSTRMRTVHETSAGGLVIDGIDGPRDAQVAALIGRVDRRGRLLWSL
PKGHIELGETAEQTAIREVAEETGIRGSVLAALGRIDYWFVTDGRRVHKT
VHHYLMRFLGGELSDEDLEVAEVAWVPIRELPSRLAYADERRLAEVADEL
IDKLQSDGPAALPPLPPSSPRRRPQTHSRARHADDSAPGQHNGPGPGP
>Rv2105 PROBABLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv0605 POSSIBLE RESOLVASE
MACCRNRGMNLAAWAERNGVARVTAYRWFHAGLLPVPARKVGRLILVDEL
ASEAGAQPKTAVYARVSSADQKSDLDRQVARVTSWATAEQIPVDKVVTEV
GSVLNGHRRKFPAVLRDLSVTRIVVEHRDRFCRFGSEYVHAALAAQGREL
VVVDSAEVDDDLVWDMTEILTSMCARLYGKRAAQNRAKRAVAAAAVDDHE
AA
>Rv1764 PUTATIVE TRANSPOSASE
MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH
AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI
ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR
RILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS
IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE
DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>Rv1047 PROBABLE TRANSPOSASE
MTSSHLIDAEQLLADQLAQASPDLLRGLLSTFIAALMGAEADALCGAGYR
ERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAE
RALTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEA
FRTRPLDAGPYTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILG
IQVTSAEDGAGWLAFFRDLVARGLSGVALVTSDAHAGLVAAIGATLPAAA
WQRCRTHYAANLMAATPKPSWPWVRTLLHSIYDQPDAESVVAQYDRVLDA
LTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIWSNNPQERLNREVRRRT
DVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRARAALTSTE
EPAKQQTTNTPALTT
>Rv2086 CONSERVED HYPOTHETICAL PROTEIN
MRPATPLICAFGDKHKHTYGVTPICRALAVHGVQIASRTYFADRAAAPSK
RALWDTTITEILAGYYEPDAEGKRPPECLYGSLKMWAHLQRQGFRWPSAT
VKTIMRANGWRGVPLAAHITHHRTRPGRGPGPRPGGSAMAGFSNEPAGSG
RLHLRADDVEFRLHRVRGRRLRRCDRGLGMLADQRRSVRRTRITPRPSRL
T
>Rv1321 CONSERVED HYPOTHETICAL PROTEIN
MSRVRLVIAQCTVDYIGRLTAHLPSARRLLLFKADGSVSVHADDRAYKPL
NWMSPPCWLTEESGGQAPVWVVENKAGEQLRITIEGIEHDSSHELGVDPG
LVKDGVEAHLQALLAEHIQLLGEGYTLVRREYMTAIGPVDLLCSDERGGS
VAVEIKRRGEIDGVEQLTRYLELLNRDSVLAPVKGVFAAQQIKPQARILA
TDRGIRCLTLDYDTMRGMDSGEYRLF
>Rv2478c CONSERVED HYPOTHETICAL PROTEIN
MVGHIVNDLQRRKVGDQEVVKFRVASNSRRRTSDGGWEPGNSLFITVNCW
GRLVTGVGAALGKGAPVIVVGHVYTSEYEDRDGIRRSSLEMRATSVGPDL
SRVIVRIEKPAYTGPSAGDLPAATGTGAAGAADAPASAADSVSDVVVDDA
ITGHNPLPISA
>Rv3475 POSSIBLE TRANSPOSASE FOR INSERTION ELEMENT IS6110 [SECOND PART]
AEALAAGQRRIAKGERDFKDRVGFLRGRARPASTLITRFIADHQGHREGP
DGLRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISR
VHAANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRT
TIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAY
ARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQY
TSIRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRS
IEDVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>Rv1369c PROBABLE TRANSPOSASE
MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH
AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI
ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR
RILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS
IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE
DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>Rv2898c CONSERVED HYPOTHETICAL PROTEIN
MTTLKTMTRVQLGAMGEALAVDYLTSMGLRILNRNWRCRYGELDVIACDA
ATRTVVFVEVKTRTGDGYGGLAHAVTERKVRRLRRLAGLWLADQEERWAA
VRIDVIGVRVGPKNSGRTPELTHLQGIG
>Rv2087 CONSERVED HYPOTHETICAL PROTEIN
MLAGLRPSIGIVGDALDNALCETTTGPHRTECSHGSPFRSGPIRTLADLE
DIASAWVEHTCHTQQGVRIPGRLQPA
>Rv3428c POSSIBLE TRANSPOSASE
MATIAQRLRDDHGVAASESSVRRWIATHFAEEVARERVTVPRGPVDAGSE
AQIDYGRLGMWFDPATARRVAVWAFVMVLAFSRHLFVRPVIRMDQTAWCA
CHVAAFEFFDGVPARLVCDNLRTGVDKPDLYDPQINRSYAELASHYATLV
DPARARKPKDKPRVERPMTYVRDSFWKGREFDSLAQMQQAAVTWSTEVAG
LRYLRALEGAQPLRMFEAVEQQALIALPPRAFELTSWSIGTVGVDTHLKV
GKALYSVPWRLIGQRLHARTAGDVVQIFAGNDVVATHVRRPSGRSTDFSH
YPPEKIAFHMRTPTWCRHTAELVGPASQQVIAEFMRDNAIHHLRSAQGVL
GLRDKHGCDRLEAACARAIEVGDPSYRTIKGILVAGTEHAANEPTTSSPA
STAGGVPARP
>Rv2944 POSSIBLE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS1533
MSQCPGWPIAPAPRTGATKNTWPPACSGKCQPGSPMVVRAASAPPASRLG
SRWKSSTLSMLVASNATPSHIWAPWISSPPAITSCFWAPPGTGKTHLAVG
LAIRACQAGHRVLFATAAEWVARLAEAHHAGRIYAELTRLCRYPLLVVDE
VGYIPFEPEAANLFFQLVSSRYERASLIVTSNKAFGRWGEVFGGDDVVAA
AMIDRLVHHAEVVALKGDSYRLKDRDLGRVPPAGTTEE
>Rv3187 PROBABLE TRANSPOSASE
LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREP
SRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLM
TKLGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYV
STWAGFAYVAFVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGV
LDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYDNALAETIN
GLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA
AYYAQRQRPAAG
>Rv3751 PROBABLE INTEGRASE (FRAGMENT)
MKRAKVQQITPHDLRHTAASLAVSAGVNVLALQRILGHKSAKVTLDTYAD
LFDADLDAVAVTLGKDADQQT
>Rv0921 POSSIBLE RESOLVASE
MNLADWAESVGVNRHTAYRWFREGTLPVPAERVGRLILVKTAASASAAAA
GVVLYARVSSHDRRSDLDRQVARLTAWATERDLGVGQVVCEVGSGLNGKR
PKLRRILSDPDARVIVVEHRDRLARFGVEHLEAALSAQGRRIVVADPGET
TDDLVCDMIEVLTGMCARLYGRRGARNRAMRAVTEAKREPGAG
>Rv2311 CONSERVED HYPOTHETICAL PROTEIN
MAPTGQAVDVAVREGAGDVGYSVERENLPADDPVRNGNRWRVIAVDTEHH
RIAARRLGDGARAAFSGDYLHEHITHGYAITVHASQGTTAHSTHAVLGDN
TSRATLYVAMTPARESNTAYLCERTAGEGARVDLAGWDLWVSGKAEAMSD
EKSASPVWCRVGARCDHRGKRSCW
>Rv2812 PROBABLE TRANSPOSASE
MAVGDDEEKVRAERARAIGLFRYQLIWEAADAAHSTKQRGKMVRELASRE
HTDPFGRRVRISRQTIDRWIRGWRAGGFDALVPNPRQCTPRTPAEVLELA
VALRRENPQRTAAAIRRILRTQLGWAPDERTLQRNFHRLGLTGATTGSAP
AVFGRFEAEHPNALWTGDVLHGIRIDLRKTYLFAFLDDHSRLVPGYRWGH
AEDTVRLAAALRPALASRGVPNAVYVDNGSPYVDAWLLRACAKLGVRLVH
STPGRPQGRGKIERFFRTVREQFLVEITGEPDVVGRHYVADLAELNRLFT
AWVETVYHRSVHSETGQTPLARWSAGGPIPLPAPETLTEAFLWEEHRRVT
KTATVSLHGNRYEIDPALVGRKVELVFDPFDLTRIEVRLAGAPMRRAIPY
HIGRHSHPKAKPETPTAPPKPSGIDYAQLIETAHAAELARGVNYTALTGA
ADQIPGQLDLLTGQEAQPK
>Rv2090 Probable 5'-3' exonuclease
MPAPDPMRGDPPHPAPPRLRSPLDPTSGDPLHPAPPRLRSPLDPTSGDPL
HPAPPRLRSPLDPTSGDPLHPAPPRLRSPLVLLDGASMWFRSFFGVPSSI
TAPDGRPVNAVRGFIDSMAVVITQQRPNRLAVCLDLDWRPQFRVDLIPSY
KAHRVAEPEPNGQPDVEEVPDELTPQVDMIMELLDAFGIAMAGAPGFEAD
DVLGTLATRERRDPVIVVSGDRDLLQVVADDPVPVRVLYLGRGLAKATLF
GPAEVAERYGLPAHRAGAAYAELALLRGDPSDGLPGVPGVGEKTAATLLA
RHGSLDQIMAAADDRKTTMAKGLRTKLLAASAYIKAADRVVRVATDAPVT
LSTPTDRFPLVAADPERTAELATRFGVESSIARLQKALDTLPG
>Rv3386 POSSIBLE TRANSPOSASE
MFRTVGDQASLWESVLPEELRRLPEELARVDALLDDSAFFCPFVPFFDPR
MGRPSIPMETYLRLMFLKFRYRLGYESLCREVTDSITWRRFCRIPLEGSV
PHPTTLMKLTTRCGEDAVAGLNEALLAKAASEKLLRTNKVRADTTVVEGD
VGYPTDTGLLAKAVGSMARTVARIKAADAGSAPLGGSSGPRDRLQAAVTR
RAATRSGAGLRAPDHRGASRDRRAGADRGCRGGT
>Rv2609c PROBABLE CONSERVED MEMBRANE PROTEIN
MTWLVLAGAVLLVVLVAFGAWGYQTANRLNRLNVRYDLSWQSLDSALARR
AVVARAVAIDAYGGAPQGSRLAALADAAEGAPRHARENAENELSAALAMV
NPASLPAALIAELADAEARVLLARRFHNDAVRDTLALGERRLVRLLRLGG
TAVLPTYFEIVERPHALVHGDQGASGRRTSARVVLLDDSGAVLLLCGSDP
ANPAFRDGAAPKWWFTVGGQVRPGERLAQAAARELAEETGLRVAPADMIG
PIWRRDEVFEFNGSLIDSEEFYLVHRTRRFEPAVQGRTELERRYIRDARW
CDANDIAQLVAAGERVYPLQLGELLPAANRLVDVALDNGAARDAGVPQPI
R
>Rv2279 PROBABLE TRANSPOSASE
LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREP
SRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLM
TKLGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYV
STWAGFAYVAFVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGV
LDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYDNALAETIN
GLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA
AYYAQRQRPAAG
>Rv2885c PROBABLE TRANSPOSASE
MMARLKVPEGWCVQAFRFTLNPTQTQAASLARHFGARRKAFNWTVTALKA
DIKAWRADGTESAKPSLRVLRKRWNTVKDQVCVNAQTGQVWWPECSKEAY
ADGIAGAVDAYWNWQSCRAGKRAGKTVGVPRFKKKGRDADRVCFTTGAMR
VEPDRRHLTLPVIGTIRTYENTRRVERLIAKGRARVLAITVRRNGTRLDA
SVRVLVQRPQQRRVALPDSRVGVDVGVRRLATVADAEGTVLEQVPNPRPL
DAALRGLRRVSRARSRCTKGSRRYCERTTELSRLHRRVNDVRTHHLHVLT
TRLAKTHGRIVVEGLDAAGMLRQKGLPGARARRRALSDAALATPRRHLSY
KTGWYGSSLVVADRWFPSSKTCHACRHVQDIGWDEKWQCDGCSITHQRDD
NAAINLARYEEPPSVVGPVGAAVKRGADRKTGPGPAGGREARKATGHPAG
EQPRDGVQVK
>Rv2815c PROBABLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv2013 POSSIBLE TRANSPOSASE
MDTLLEAGITVVVISPNQLKNLRGRYGSAGNKDDRFDAFVLADTLRTDRS
RLRPLLPDTPATATLRRTCRPRKDLVAHRVALANQLRAHLRVVFPGVVGL
FADLDSPISLAFLTFLPRFDCQDRADWLSVKRLAGWLAAAGYCGRAPRPA
HRCPARRHR
>Rv2424c PROBABLE TRANSPOSASE
MQCRAREERPGRKTDLLDAEWLVHLLECGLLRGWLIPPADIKAARDVIRY
RRKLVEHRTSKLQRLGNVLQDAGIKADSVASSVTPKSVRAMVEALIDGER
RPAVLADLARGSMRSKIPDLQRALEGRFDDHHALMCRLHLAHLDQLDAMI
GALDEQIEQLMHPFCARRELIASIPGIGVGASATVISEIGADPAAWFPSA
EHLASWVRLCPGNHESAGKRHHGARRTGNQHLQPVLVECAWAAVRTDGYL
REYYRRQVRKFGGFRSPAANKKAITTVAHKLIVIIWHVLATGRPHQDLGA
DYFTTRMDPDKERRRLVAKLEAQGLGVTLEPAA
>Rv3325 PROBABLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv1149 POSSIBLE TRANSPOSASE
MTRVGVISDEFWAVVEPLMPSHEGKPGRRFSDHRLILEGIAWRFRTGSPW
RDLPAEFGPWQTVWKRHHRWSLDGTCDEVFAHVAAVFGVDAEVAEDIEKL
LSVDSTNVRAHQHSAGACSDTLATGGTVGLQEIRR
>Rv3798 PROBABLE TRANSPOSASE
MRNVRLFRALLGVDKRTVIEDIEFEEDDAGDGARVIARVRPRSAVLRRCG
RCGRKASWYDRGAGLRQWRSLDWGTVEVFLEAEAPRVNCPTHGPTVVAVP
WARHHAGHTYAFDDTVAWLAVACSKTAVCELMRIAWRTVGAIVARVWADT
EKRIDRFANLRRIGIDEISYKRHHRYLTVVVDHDSGRLVWAAPGHDKATL
GLFFDALGAERAAQITHVSADAADWIADVVTERCPDAIQCADPFHVVAWA
TEALDVERRRAWNDARAIARTEPKWGRGRPGKNAAPRPGRERARRLKGAR
YALWKNPEDLTERQSAKLAWIAKTDPRLYRAYLLKESLRHVFSVKGEEGK
QALDRWISWAQRCRIPVFVELAARIKRHRVAIDAALDHGLSQGLIESTNT
KIRLLTRIAFGFRSPQALIALAMLTLAGHRPTLPGRHNHPQISQ
>Rv2415c CONSERVED HYPOTHETICAL PROTEIN
MRTELPAERLQRRLGAVPDIDSHAASAHLDPEPHDPTDDGPDHDEPRDDP
NSLLPRWLPDTSRGQGWADRIRADPGRAGAVALAVIAALAVLVTVFTLIR
DRTEPVMSAKLPPVEPVSPTNPRSSASPGSPDRSGLPVVVSVVGLVHTPG
LVTLAPGARIADALQAAGGAVDGADTVGLNMARQLGDGEQIVVGLAPPSG
QPRVLGSSVGAGTPGPAGTSGTATTGPKTAPKTAEVLDLNTATVEQLDAL
PGIGPVTAAAIVAWRQRNGRFTSVDQLADVDGIGPARLDKRRNLVRV
>Rv2961 PROBABLE TRANSPOSASE
MEHGNPHDAPQLAPAVERITTRAGRPPGTVTADRGYGEKRVEDDLHDLGV
RTVAIPRKGRPSQARRAEEQRPSFRRTVKWRTGSEGRISTLKRNYGWNRS
CIDGTEGTRIWTRHGILTHNLIKISSLAA
>Rv3828c POSSIBLE RESOLVASE
MSVVCCRNRWMNLAVWAERNGVAWVIAYRWFRAGLLPVPAQRVGRLILVN
DPAVEESGRGRTLVYARVSSADQRSDLDRRVARVTAWATSQHLSVDKVVA
EGGWALNGHRRKFFALLGDPVVTRIVVEHRDRFCWFGSEYVEAALVAQGR
ELVVVDLAEVDDDLVGDMTEILTSMCARLYGERAAQNGAKRALAAAVGDA
EAA
>Rv3474 POSSIBLE TRANSPOSASE FOR INSERTION ELEMENT IS6110 [FIRST PART]
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv2820c HYPOTHETICAL PROTEIN
MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLG
ELVACSTLRLTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPA
AQLGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFR
FELDAGLWLLATGSESELGLLTRLLKGISALGGERTSGFGAFNLTESEAP
AALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRSGFVASSTYAD
MPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES
AA
>Rv2649 PROBABLE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS6110
KDRVGFLRGRARPASTLITRFIADHQGHREGPDGLRWGVESICTQLTELG
VPIAPSTYYDHINREPSRRELRDGELKEHISRVHAANYGVYGARKVWLTL
NREGIEVARCTVERLMTKLGLSGTTRGKARRTTIADPATARPADLVQRRF
GPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYARRILGWRVASTMATSMV
LDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSV
GAVGSSYDNALAETINGLYKTELIKPGKPWRSIEDVELATARWVDWFNHR
RLYQYCGDVPPVELEAAYYAQRQRPAAG
>Rv3637 POSSIBLE TRANSPOSASE
MPGRVFASPADFNTQLQAWLVRANHRQHRVLGCRPADRIEADTAAMLTLP
PVGPSIGWRTSTRLPRDHYVRLDGNDYSVHPVAIGRRIEITADLSRVRVW
CGGTLVADHDRIWAKHQTISDPEHVVAAKLLRRKRFDIVGPPHHVEVEQR
LLTTYDTVLGLDGPVA
>Rv3733c CONSERVED HYPOTHETICAL PROTEIN
MPKLSAGVLLYRARAGVVDVLLAHPGGPFWAGKDDGAWSIPKGEYTGGED
PWLAARREFSEEIGLCVPDGPRIDFGSLKQSGGKVVTVFGVRADLDITDA
RSSTFELDWPKGSGKMRKFPEVDRVSWFPVARARTKLLKGQRGFLDRLMA
HPAVAGLSEGPESLPR
>Rv2464c POSSIBLE DNA GLYCOSYLASE
MPEGHTLHRLARLHQRRFAGAPVSVSSPQGRFADSASALNGRVLRRASAW
GKHLFHHYVGGPVVHVHLGLYGTFTEWARPTDGWLPEPAGQVRMRMVGAE
FGTDLRGPTVCESIDDGEVADVVARLGPDPLRSDANPSSAWSRITKSRRP
IGALLMDQTVIAGVGNVYRNELLFRHRIDPQRPGRGIGEPEFDAAWNDLV
SLMKVGLRRGKIIVVRPEHDHGLPSYLPDRPRTYVYRRAGEPCRVCGGVI
RTALLEGRNVFWCPVCQT
>Rv1701 PROBABLE INTEGRASE/RECOMBINASE
MKTLALQLQGYLDHLTIERGVAANTLSSYRRDLRRYSKHLEERGITDLAK
VGEHDVSEFLVALRRGDPDSGTAALSAVSAARALIAVRGLHRFAAAEGLA
ELDVARAVRPPTPSRRLPKSLTIDEVLSLLEGAGGDKPSDGPLTLRNRAV
LELLYSTGARISEAVGLDLDDIDTHARSVLLRGKGGKQRLVPVGRPAVHA
LDAYLVRGRPDLARRGRGTAAIFLNARGGRLSRQSAWQVLQDAAERAGIT
AGVSPHMLRHSFATHLLEGGADVRVVQELLGHASVTTTQIYTLVTVHALR
EVWAGAHPRAR
>Rv2821c CONSERVED HYPOTHETICAL PROTEIN
MTTSYAKIEITGTLTVLTGLQIGAGDGFSAIGAVDKPVVRDPLSRLPMIP
GTSLKGKVRTLLSRQYGADTETFYRKPNEDHAHIRRLFGDTEEYMTGRLV
FRDTKLTNKDDLEARGAKTLTEVKFENAINRVTAKANLRQMERVIPGSEF
AFSLVYEVSFGTPGEEQKASLPSSDEIIEDFNAIARGLKLLELDYLGGSG
TRGYGQVKFSNLKARAAVGALDGSLLEKLNHELAAV
>Rv2896c CONSERVED HYPOTHETICAL PROTEIN
MIDPTARAWAYLSRVAEPPCAQLAALVRCVGPVEAADRVRRGQVGNELAQ
HTGARREIDRAADDLELLMRRGGRLITPDDDEWPVLAFAAFSGAGARARP
CGHSPLVLWALGPARLDEVAPRAAAVVGTRAATAYGEHVAADLAAGLAER
DVSVVSGGAYGIDGAAHRAALDSEGITVAVLAGGFDIPYPAGHSALLHRI
AQHGVLFTEYPPGVRPARHRFLTRNRLVAAVARAAVVVEAGLRSGAANTA
AWARALGRVVAAVPGPVTSSASAGCHTLLRHGAELVTRADDIVEFVGHIG
ELAGDEPRPGAALDVLSEAERQVYEALPGRGAATIDEIAVGSGLLPAQVL
GPLAILEVAGLAECRDGRWRILRAGAGQAAAKGAAARLV
>Rv0944 POSSIBLE FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE (FAPY-DNA GLYCOSYLASE)
MAGTPQPRALGPDALDVSTDDLAGLLAGNTGRIKTVITDQKVIAGIGNAY
SDEILHVAKISPFATAGKLSGAQLTCLHEAMASVLSDAVRRSVGQGAAML
KGEKRSGLRVHARTGLPCPVCGDTVREVSFADKSFQYCPTCQTGGKALAD
RRMSRLLK
>Rv0795 PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS6110 (FRAGMENT)
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv2354 PROBABLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv0850 PUTATIVE TRANSPOSASE (FRAGMENT)
MTRDPHSPDCGREGSYRDTITRPLTDLPVAGYPLVPRVASPRYRCTTPQC
GRAVFNQDLANVDQYLVVNQLAHQLIDGSSLIPDADKRWDARRHADMTHH
LTSSLKENQS
>Rv0920c PROBABLE TRANSPOSASE
MDAAQVIEPAHAGQDVDEAAVAARELSGAERALVGDLVRQARAEGVALTG
PDGLLKALTKTVLEAALQEEMTEHLGYDRHAAAGRGSGNSRNGSRNKKVI
TDACGQVEIAVPRDRNGTFEPVIVGKRKRRVTDVDRVVLSLYAKGLTTGE
IAAHFADVYGVSVSKDTISRITDRVIEEMQAWWSRPLEKVYAAVFIDAIM
VKIRDGQVRNRPVYAAIGVDLDGHKDILGMWAGEGDGESAKFWLAVLTDL
RNRGVKDIFFLVCDGLKGLPDSVSAAFPLATVQTCIIHLIRNTFRYASRK
YWDKISVDLKPIYTAASAAEARLRYEEFAEKWGKPYPAITRLWDSAWEEF
IPFLDYDVEIRRVPCSTNAIESLNARYRRAVRARGHFPNEQSALKTLYLV
TRSLDPKGTGQTKWAVRWKPALNALAITFADRMPAAEER
>Rv3638 POSSIBLE TRANSPOSASE
MAAKTATNSRDVAAELAYLTRALKAPTLRGAIEQLADRARTKTWSYEEFL
AACLQREVSARESHGGEGRIRAARFPSRKSLEEFDFDHARGLKRDTIAHL
GTLDFVTLAIGIAIRACQAGHRVLFATASQWVDRLAAAHHSGTLQSELIR
LARYPLLVVDEVGYIPFEPEAANLFFQLVSSRYERASLIVTSNKPFGRWG
EVFGDDVVAAAMIDRLVHHAEVIALKGDSYRIKDRDLGRVPTVTADDQ
>Rv1461 CONSERVED HYPOTHETICAL PROTEIN
MTLTPEASKSVAQPPTQAPLTQEEAIASLGRYGYGWADSDVAGANAQRGL
SEAVVRDISAKKNEPDWMLQSRLKALRIFDRKPIPKWGSNLDGIDFDNIK
YFVRSTEKQAASWDDLPEDIRNTYDRLGIPEAEKQRLVAGVAAQYESEVV
YHQIREDLEAQGVIFLDTDTGLREHPDIFKEYFGTVIPAGDNKFSALNTA
VWSGGSFIYVPPGVHVDIPLQAYFRINTENMGQFERTLIIADEGSYVHYV
EGCLPAGELITTADGDLRPIESIRVGDFVTGHDGRPHRVTAVQVRDLDGE
LFTFTPMSPANAFSVTAEHPLLAIPRDEVRVMRKERNGWKAEVNSTKLRS
AEPRWIAAKDVAEGDFLIYPKPKPIPHRTVLPLEFARLAGYYLAEGHACL
TNGCESLIFSFHSDEFEYVEDVRQACKSLYEKSGSVLIEEHKHSARVTVY
TKAGYAAMRDNVGIGSSNKKLSDLLMRQDETFLRELVDAYVNGDGNVTRR
NGAVWKRVHTTSRLWAFQLQSILARLGHYATVELRRPGGPGVIMGRNVVR
KDIYQVQWTEGGRGPKQARDCGDYFAVPIKKRAVREAHEPVYNLDVENPD
SYLAYGFAVHNCTAPIYKSDSLHSAVVEIIVKPHARVRYTTIQNWSNNVY
NLVTKRARAEAGATMEWIDGNIGSKVTMKYPAVWMTGEHAKGEVLSVAFA
GEDQHQDTGAKMLHLAPNTSSNIVSKSVARGGGRTSYRGLVQVNKGAHGS
RSSVKCDALLVDTVSRSDTYPYVDIREDDVTMGHEATVSKVSENQLFYLM
SRGLTEDEAMAMVVRGFVEPIAKELPMEYALELNRLIELQMEGAVG
>Rv2943 PROBABLE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS1533
MLTVEDWAEIRRLHRAEGLPIKMIARVLGISKNTVKSALESNQQPKYERA
PQGSIVDAVEPRIRELLQAYPTMPATVIAERIGWERSIRVLSARVAELRP
VYLPPDPASRTTYVAGEIAQCDFWFPPIELPVGFGQTRTAKQLPVLTMVC
AYSRWLLAMLLPSRCAEDLFAGWWRLIEALGAVPRVLVWDGEGAIGRWRG
GRSELTTECQAFRGTLAAKVLICRPADPEAKGLIERAHDYLERSFLPGRV
FASPADFNAQLGAWLALVNTRTRRALGCAPTDRIGADRAAMLSLPPVAPA
TGWCTSLRLPRDHYVRCDSNDYSVHPGVIGHRVLVRADLERVHVFCDGEL
VADHERIWAVHQTVSDPAHVEAAKVLRRRHFSAASPVVEPQVQVRSLSDY
DDALGVDIDGGVA
>Rv1277 CONSERVED HYPOTHETICAL PROTEIN
MSPRPGPAGRGPAPCRCADLHSLCVDSHALRRDGMRFLHTADWQLGMTRH
FLAGDAQPRYSAARRDAVAGLKALAADVGAEFVVVAGDVFEHNQLAPQIV
GQSLEAMRVIGLPVYLLPGNHDPLDASSVYTSTLFRAERPDNVVVLDRAG
VHEVRPGVQIVAAPWRSKAPTTDPVAEVLAGLPTDAAIRLLVAHGGVDAL
DPDHDKPSLIRLAALDDALTRQAIHYVALGDKHSLTQVGSSGRVWYSGAP
EVTNFDDVEPDPGHVLVVDIDESDPRHPVTVDARRIGRWRFVTLHHQVDT
SRDIADLDLNLDLMTDKDRTVVRLALTGSLTVTDRAALDTCLDKYARLFA
WLGLWERHTDLAVIPVDAEFTDLGIGGFAAAAVDELVATARGGDDESAVD
AQAALALLLRLADRGAA
>Rv1586c Probable phiRv1 integrase
MRYTTPVRAAVYLRISEDRSGEQLGVARQREDCLKLCGQRKWVPVEYLDN
DVSASTGKRRPAYEQMLADITAGKIAAVVAWDLDRLHRRPIELEAFMSLA
DEKRLALATVAGDVDLATPQGRLVARLKGSVAAHETEHKKARQRRAARQK
AERGHPNWSKAFGYLPGPNGPEPDPRTAPLVKQAYADILAGASLGDVCRQ
WNDAGAFTITGRPWTTTTLSKFLRKPRNAGLRAYKGARYGPVDRDAIVGK
AQWSPLVDEATFWAAQAVLDAPGRAPGRKSVRRHLLTGLAGCGKCGNHLA
GSYRTDGQVVYVCKACHGVAILADNIEPILYHIVAERLAMPDAVDLLRRE
IHDAAEAETIRLELETLYGELDRLAVERAEGLLTARQVKISTDIVNAKIT
KLQARQQDQERLRVFDGIPLGTPQVAGMIAELSPDRFRAVLDVLAEVVVQ
PVGKSGRIFNPERVQVNWR
>Rv3380c PROBABLE TRANSPOSASE
MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH
AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI
ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR
RILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS
IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE
DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>Rv2886c PROBABLE RESOLVASE
MSRILTHVPGRTVNRSYALPALVGSAAGRLSGNHSHGREAYIALPQWACS
RQPSTPPLQTPGRINALWSLRPVLPMPGRGCQLLRLGGRWLSVVCCRNGS
MNLVVWAEGNGVARVIAYRWLRVGRLPVPARRVGRVILVDEPAGQPGRWG
RTAVCARLSSADQKVDLDRQVVGVTAWATAEQIPVGKVVTEVGSALYGRR
RTFLTLLGDPTVRRIVMKRRDRLGRFGFECVQAVLAADGRELVVVDSADV
DDDVVGDITEILTSICARLYGKRAAGNRAARAVAAAARAGGHEAR
>Rv3201c PROBABLE ATP-DEPENDENT DNA HELICASE
MTQTAAPARYSPAELACALGLFPPTAEQAAVIAAPPGPLVVIAGAGAGKT
ETMAARVVWLVANGYAEPGQVLGLTFTRKAAGQLLRRVRSRLARLAGIGL
GCGDPAACAPVVSTYHAFAGSLLRDYGLLLPLEPDTRLLSETELWQLAFD
VVSGYDGVLCTDKSPAAVTSIVVRLWGQLGEHLVDTRALRDTHVELERLV
HALPAGRYQRDRGPSQWLLRMLATQTQRAELVPLLDALGERMHAGKVMDF
AMQMASAARLAATSPQVGQDLRRRYRVVLLDEYQDTGHAQRVVLSSLFGG
GVDDGLALTAVGDPIQSIYGWRGASATNLPRFTTDFPLSDGTPAPVLELL
TSWRNPPQALRVANGISAEARRRSVAVRALRPRPDAPPGAVRCALLPDVQ
AEREWIADHLRMRYQRAEADGVKPPTAAVLVRRNADAAAIADTLRARGIP
AEVVGLAGLLSIPEVAEVVAMLRLVADPTAGAAAMRVLTGPRWRLGARDL
AALWRRALTLSGESPSTASPESIAMAASADADNPCLADAISDPGSAEGYS
VAGYGRIGALAGELSALRGRLGHSLPDLVAEVRRVLGVDCEVRASAPVSG
GWAGPEHLDAFADVVAGYAERASARSSEASVAGLLAYLDVAEVVENGLPP
AELTVACDRVQVLTVHAAKGLEWQVVAVAHLSRGVFPSTVSRSSWLTDPA
ELPPLLRGDRASAGAHGIPVLDTSAVADRKQLSDKISEHRRLLDRRRVDE
ERRLLYVAVTRAEDTLLVSGHHWGPTGTKPRGPSEFLCELKDIIDRSAAA
GDPCGVVEQWASAPAGDERNPLCDNAIEAVWPADPLAARRGDVERGAALV
AAAMSADLPGSTTDIDHPPRPGDAPWSTDVDALLAERAHAARGAPARGLP
NHLSVSSLVELVGDPVGARQRLMCRLPKRPDPHAWLGDAFHAWVQQFYGA
ELLFDLGDLPGAADREVGDPEELAALQRAFTASSWAARTPAAVEVPFEMP
IGDTVVRGRIDAVFVDPDGGATVVDWKTGKPPHGPAAMRQAAVQLAVYRL
AWAALRGCPTSSVRTAFYYVRSGITVVPDELPAPGELAMLLTDCAGRRSD
T
>Rv2529 HYPOTHETICAL PROTEIN
MHLAHRVASSRDTPSSSATPNAVSGSASNAADRPCLVRPPTAPPWAHGPR
LRRDPTGGGSTPSIVLSRSTDRSKDGHRIVPAGARKSGVRASTGRLPSTR
KTTRSPDCRPSASRTAFGTVTCPFDVTMGSSECLLHRCRTPPVPSHSVEL
LVAANPAEDSRLPYLIRLPVGAGLVFATSDVWPRTKALYCHRLDIADWPA
DPVVVDRVELRSCSRRGAAIDVVAARARENRSQLVHTMARGRQVVFWQSP
KTRKQSRPGVRTPTARAAGIPELHIVVDAHERYPYTFADKPAKTTREALP
CGDYGLKVAGQLVAAVERKALADLTSGVLNGNLKYQLTELAALPRAAVVV
EDRYSEIFAHSFARPTAIADGLAELQIGFPNVPIVFCQTRKLAQEYTYRY
LAAALTWFVDDADATTVFEPAAAEPEPSSAELRAWAKSVGLPVSDRGRLR
PQILQAWRAAHPR
>Rv1313c POSSIBLE TRANSPOSASE
MRNVRLFRALLGVDKRTVIEDIEFEEDDAGDGARVIARVRPRSAVLRRCG
RCGRKASWYDRGAGLRQWRSLDWGTVEVFLEAEAPRVNCPTHGPTVVAVP
WARHHAGHTYAFDDTVAWLAVACSKTAVCELMRIAWRTVGAIVARVWADT
EKRIDRFANLRRIGIDEISYKRHHRYLTVVVDHDSGRLVWAAPGHDKATL
GLFFDALGAERAAQITHVSADAADWIADVVTERCPDAIQCADPFHVVAWA
TEALDVERRRAWNDARAIARTEPKWGRGRPGKNAAPRPGRERARRLKGAR
YALWKNPEDLTERQSAKLAWIAKTDPRLYRAYLLKESLRHVFSVKGEEGK
QALDRWISWAQRCRIPVFVELAARIKRHRVAIDAALDHGLSQGLIESTNT
KIRLLTRIAFGFRSPQALIALAMLTLAGHRPTLPGRHNHPQISQ
>Rv2480c POSSIBLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv3675 POSSIBLE MEMBRANE PROTEIN
MFTLLVSWLLVACVPGLLMLATLGLGRLERFLARDTVTATDVAEFLEQAE
AVDVHTLARNGMPEALDYLHRRQARRITDSPPLGSGAGPRYAGPLFVTDL
DSPVEPPRHGQPNPQFRTARHANHV
>Rv1370c PROBABLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv2653c POSSIBLE phiRv2 PROPHAGE PROTEIN
MTHKRTKRQPAIAAGLNAPRRNRVGRQHGWPADVPSAEQRRAQRQRDLEA
IRRAYAEMVATSHEIDDDTAELALLSMHLDDEQRRLEAGMKLGWHPYHFP
DEPDSKQ
>Rv2791c PROBABLE TRANSPOSASE
MAKFEIPEGWMVQAFRFTLDPTAEQARALARHFGARRKAYNWTVATLKAD
IDAWQATGIQTAKPSLRVLRKRWNTVKNDVCVNIETGVVWWPECSKEAYA
DGIDGAVDAYWNWQNSRSGKRDGKRMGFPRFKKKGRDPDRVTFTTGAMRV
EPDRRHLTLPVIGTVRTHENTRRVERLIAKGRSRVLAITVRRNGTRIDAS
VRVLVQRPQQPKVTDPGSRVGVDVGVRRLATVATADGAVLERVPNPRPLD
AALNELRHVCRARSRCTKGSRRYRERTTEISRLHRRVNDVRTHHLHCLTT
HLAKTHGRIVVEGLDAAGMLRQQGLSGARARRRGLSDAALGTPRRHLSYK
TGWYGSQLVVADRWFPSSKTCHVCGHVQEIGWAEHWQCDSCSASHQRDDC
AAINLARYEDTSSVVGPVGAAVKRGADRKTRPGRAGGREARKGSSRKAAE
QPRDGVQVA
>Rv2646 PROBABLE INTEGRASE
MNTATRVRLARKRADRLNLKLIKNGHHFRLRDADEITLAVGHLGVVEAFL
AAAKSQNKPPGPPPSLHAPPSWRRDIDDYLLNLNAAGQRPATIRLRKTVL
CAAAHGLGRPPADVTAEHLLDWLGKQQHLSPEGRKTYRSTLRGFFVWAYE
MDRVRDYVADSLPKVRCPKQPPRPAGDDVWQAALAKADRRIELMIRLAGE
AGLRRAEAAQAHTGDLMDGGLLLVHGKGGKRRIVPISDYLAALIRDTPHG
YLFPNGTGGHLTAEHVGKLVSRALPGDATMHTLRHRYATRAYRGSHNLRA
VQQLLGHASIVTTERYTALCDDEVRAAAAAAW
>Rv0007 POSSIBLE CONSERVED MEMBRANE PROTEIN
MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAA
TRQSQAGHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAV
RTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGG
AGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWS
TLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGS
SAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL
ADRD
>Rv2819c HYPOTHETICAL PROTEIN
MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADI
PAHKRKSFEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSI
EPRRASRGRGGRMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYL
QSLVHKRTAQPVRVPGHQTREHRQYGERFERKELRKSGRPNTRPQDAVND
LFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRECLAPGTSISH
RVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV
GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPL
VLKRTKIDNICYEMGQCELSIRRAE
>Rv3202c POSSIBLE ATP-DEPENDENT DNA HELICASE
MSHIWGVEAGAALAPGLRGPVLVLGGPGTGKSTLLVEAAVAHIGAGTDPE
SVLLLTGSGRMGMRARSALTTALLRSRTNGPCRAAIREPVVRTVHSYAYA
VLRKAAQRAGDALPRLLTSAEQDAIIRELLAGDAEDGPAATTTWPAHLRP
ALTTAGFATELRNLLARCAERGLDPLELQQLGRRRGRPEWIAAGQFAQRY
EQVMLLRGAVGLAAPQATAPALSAAELVGAALEAFAVDPELLAAERARVR
TLLVDDAQQLDPQAARLVRMLAAGTELALIAGDPNQAVFGFRGGEPTGLL
ADDPPPAGGAPIPSVTLTVSHRCAPAVARAVTGIARRLPGRSVGRRIEGT
GTEVGSVTVRLAGSAHAEAAMIADALRRAHLIDGVPWSQMAVIVRSVPRA
VRLPRALAAAGVPVAPPAVGGPLSAEPAVRALLTVLEATADGLDGDQALL
LLTGPIGGVDPVSLRQLRRTLQRARPGQTSRKFGDLLVEVLGGDAPPSGP
GSRALRRVRAVLTAAARCHRSGSLGGQDPRHTLWAAWQRSGLQRRWLAAS
EHGGAAAVQATRDLETVTALFDITDHYVSRTSGASLRGLVEHVTALQLPV
VRPEPAAPTEQVMVLSAHAALGHEWDLVVIAGLQDGLWPNTVPRGGVLGT
QRLLDELDGVTKDASMRAPLLAEERRLLVTAMGRARRRLLVTAVDSDAGG
GGHEAVLPSAFFFEIAQWADGDGEPVAMQPVSAPRVLSAAAVVGRLRVVV
CAPACAVDDADRDCAATQLARLAKAGVPGADPSEWHGLAPVSTSDPLCDS
DDLVTLTPSTLQALNDCPLRWLAERHGGTNTRELPSAVGSVLHALFAEPG
RSESQLLAELDRVWGHLPFGAQWYSANELARHRAMIQAFVQWRAQSRSEL
TEVGVEVDIDGALEDGSGQARKIRLRGRADRLERDPAGRLVIVDIKTGKT
PVSKDDAQQHAQLAMYQLAVAEGLVRAGDEPGGARLVYVGKSGAAGVAER
KQDPLTPAARDEWRNLVRQLAAATAGPQFIARRNDGCTHCPLRPGCPAHV
RGSAP
>Rv2554c CONSERVED HYPOTHETICAL PROTEIN
MVPAQHRPPDRPGDPAHDPGRGRRLGIDVGAARIGVACSDPDAILATPVE
TVRRDRSGKHLRRLAALAAELEAVEVIVGLPRTLADRIGRSAQDAIELAE
ALARRVSPTPVRLADERLTTVSAQRSLRQAGVRASEQRAVIDQAAAVAIL
QSWLDERLAAMAGTQEGSDA
>Rv3259 CONSERVED HYPOTHETICAL PROTEIN
MRGPLLPPTVPGWRSRAERFDMAVLEAYEPIERRWQERVSQLDIAVDEIP
RIAAKDPESVQWPPEVIADGPIALARLIPAGVDVRGNATRARIVLFRKPI
ERRAKDTEELGELLHEILVAQVAIYLDVDPSVIDPTIDD
>Rv0025 CONSERVED HYPOTHETICAL PROTEIN
MSEQAGSSVAVIQERQALLARQHDAVAEADRELADVLASAHAAMRESVRR
LDAIAAELDRAVPDQDQLAVDTPMGAREFQTFLVAKQREIVAVVAAAHEL
DRAKSAVLKRLRAQYTEPAR
>Rv1700 CONSERVED HYPOTHETICAL PROTEIN
MAEHDFETISSETLHTGAIFALRRDQVRMPGGGIVTREVVEHFGAVAIVA
MDDNGNIPMVYQYRHTYGRRLWELPAGLLDVAGEPPHLTAARELREEVGL
QASTWQVLVDLDTAPGFSDESVRVYLATGLREVGRPEAHHEEADMTMGWY
PIAEAARRVLRGEIVNSIAIAGVLAVHAVTTGFAQPRPLDTEWIDRPTAF
AARRAER
>Rv0938 POSSIBLE ATP DEPENDENT DNA LIGASE (ATP DEPENDENT POLYDEOXYRIBONUCLEOTIDE SYNTHASE) (THERMOSTABLE DNA LIGASE) (ATP DEPENDENT POLYNUCLEOTIDE LIGASE) (SE
MGSASEQRVTLTNADKVLYPATGTTKSDIFDYYAGVAEVMLGHIAGRPAT
RKRWPNGVDQPAFFEKQLALSAPPWLSRATVAHRSGTTTYPIIDSATGLA
WIAQQAALEVHVPQWRFVAEPGSGELNPGPATRLVFDLDPGEGVMMAQLA
EVARAVRDLLADIGLVTFPVTSGSKGLHLYTPLDEPVSSRGATVLAKRVA
QRLEQAMPALVTSTMTKSLRAGKVFVDWSQNSGSKTTIAPYSLRGRTHPT
VAAPRTWAELDDPALRQLSYDEVLTRIARDGDLLERLDADAPVADRLTRY
RRMRDASKTPEPIPTAKPVTGDGNTFVIQEHHARRPHYDFRLECDGVLVS
WAVPKNLPDNTSVNHLAIHTEDHPLEYATFEGAIPSGEYGAGKVIIWDSG
TYDTEKFHDDPHTGEVIVNLHGGRISGRYALIRTNGDRWLAHRLKNQKDQ
KVFEFDNLAPMLATHGTVAGLKASQWAFEGKWDGYRLLVEADHGAVRLRS
RSGRDVTAEYPQLRALAEDLADHHVVLDGEAVVLDSSGVPSFSQMQNRGR
DTRVEFWAFDLLYLDGRALLGTRYQDRRKLLETLANATSLTVPELLPGDG
AQAFACSRKHGWEGVIAKRRDSRYQPGRRCASWVKDKHWNTQEVVIGGWR
AGEGGRSSGVGSLLMGIPGPGGLQFAGRVGTGLSERELANLKEMLAPLHT
DESPFDVPLPARDAKGITYVKPALVAEVRYSEWTPEGRLRQSSWRGLRPD
KKPSEVVRE
>Rv2355 PROBABLE TRANSPOSASE
LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREP
SRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLM
TKLGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYV
STWAGFAYVAFVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGV
LDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYDNALAETIN
GLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA
AYYAQRQRPAAG
>Rv2659c PROBABLE phiRv2 PROPHAGE INTEGRASE
MTQTGKRQRRKFGRIRQFNSGRWQASYTGPDGRVYIAPKTFNAKIDAEAW
LTDRRREIDRQLWSPASGQEDRPGAPFGEYAEGWLKQRGIKDRTRAHYRK
LLDNHILATFADTDLRDITPAAVRRWYATTAVGTPTMRAHSYSLLRAIMQ
TALADDLIDSNPCRISGASTARRVHKIRPATLDELETITKAMPDPYQAFV
LMAAWLAMRYGELTELRRKDIDLHGEVARVRRAVVRVGEGFKVTTPKSDA
GVRDISIPPHLIPAIEDHLHKHVNPGRESLLFPSVNDPNRHLAPSALYRM
FYKARKAAGRPDLRVHDLRHSGAVLAASTGATLAELMQRLGHSTAGAALR
YQHAAKGRDREIAALLSKLAENQEM
>Rv0741 PROBABLE TRANSPOSASE (FRAGMENT)
MFSVKGEEGKQALDRWISWARRCRIPVFVELAGGIVRHRQAIDAALDHGL
WQGLIESTNTKIRLLTRIAFGFRSPEALIALAMLALGGRRPALPGRTKHP
RISQ
>Rv3023c PROBABLE TRANSPOSASE
MTSSHLIDAEQLLADQLAQASPDLLRGLLSTFIAALMGAEADALCGAGYR
ERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAE
RALTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEA
FRTRPLDAGPYTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILG
IQVTSAEDGAGWLAFFRDLVARGLSGVALVTSDAHAGLVAAIGATLPAAA
WQRCRTHYAANLMAATPKPSWPWVRTLLHSIYDQPDAESVVAQYDRVLDA
LTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIWSNNPQERLNREVRRRT
DVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRARAALTSTE
EPAKQQTTNTPALTT
>Rv0269c CONSERVED HYPOTHETICAL PROTEIN
MSRMAAPVSLDVHGRQVIVTHPGRVVFPAHNDRKGYTKFDLVRYYLAVAE
GAMRGVAGRPMILKRFVKGISAEAVFQKRAPANRPDWVDVAELHYASGRS
AAEAVIHDAAGLAWVINLGCVDLNPHPVLAGDLDHPDELRVDLDPMPGVA
WQRVVEVALVVREVLEDYGLTAWPKTSGSRGFHVYARIAPCWSFPQVRLA
AQTVAREVERRLPDAATSRWWKEEREGVFVDFNQNAKDRTVASAYSVRAT
PDARVSTPLHWEEVPGCDPAVFTMATVPSRLADIGDPWAGMDDAVGRLDR
LLMLAEELGPPQKAQSAKPLIEIARAKTRAEAMAALDIWRDRYPGAAALL
RPADVLVDGMRGPSSIWYRIRINLQHVPADQRPPQEELIADYSPWPR
>Rv1765A PUTATIVE TRANSPOSASE (FRAGMENT)
MWVADITFVRTWQGFCYTAFVTDVCTRKIVVWAVSATMRTEDLPVQVFNH
AVWQSNSDLSELVHHSDPGSQ
>Rv2309c POSSIBLE INTEGRASE (FRAGMENT)
MTGAGIVETTTNRVRHVPVPEPVSERLRDELPTEPNALVFPSYRGGHLPI
EEYRRAFDKGCKAVGIADLVPHGLRHTTASLAISAGANVKVVQRLLGHAT
AAMTLDRHGHLLSDDLAGVAGLLVQAIKSAAASLRYSDPDSVAVENISAA
S
>Rv1034c PROBABLE TRANSPOSASE (FRAGMENT)
MQQGNPPDAPQLAPAVAWVKKRAGRTPRTVTADRGYGEAAVDQQLTEVGV
KNVLIPRKGKPSQDRRAEEHRKAFRRTIKWRTGCEGRISHLKRGYGWDRG
RIGGLEGTRTWVGHGVFAHNLVTISALPA
>Rv3191c PROBABLE TRANSPOSASE
MRQISSRYLSEEERINIADLRRSGLSIRKIADQLGRAPSTVSRELRRNSR
RDGQYRPFEAHRWAVQRRVRRHRRRIDKNPDLCELIAELLAQRWSPQQIA
RHLRRKYPDDRSMWLCHESIYQAVYQPQSRLIRPPQVKSPHRGPLRTGRT
HRRAHLRPGRRRPRFAQPMLSIHQRPFDPADRSEPGHWEGDLIVGKNQGS
AIGTLVERQTRLIRLLHLPTHDAYCLRIAITETMSDLPVTLVRSITWDQG
IEMARHIDITADLGAPVYFCDSRSPWQRASNENSNGLLRQYFPKGTSLST
YTPDHLRAVEYEINNRPRQVLGHRSPAELFTALLTSPDHQLLRR
>Rv2792c POSSIBLE RESOLVASE
MNLAVWAERNGVARVTAYRWFHAGLLPVPARKAGRLILVDDQPADRSRRA
RTAVYARVSSADQKPDLDRQVARVTAWATTEQIAVDKVVTEVGSALNGHR
RKFLALLRDPSVKRIVVEHRDRFCRFGSEYVEAALAAQGRELVVVDSAEV
DDDLVRDMTEILTSMCARLYGKRAAQNRAKRALAAAAEESEAA
>Rv0606 POSSIBLE TRANSPOSASE (FRAGMENT)
MPRLEIPNGWCVQAFRFTLDPTAEQAHALARHFGARRKAYNWTVAQLKAD
IQAWRATGAQTAKPSLRVLRKRWNTVKDEVCVNAETGTVWWPECSKEAYA
DGIAGAVDAYWNWQQRRAGKRDGKRMGFPRFKKKGRDADRVSFTTGAMRV
EPDRRHLTLPVIGCVRTHENTRRIERLIAKDRARVLAITVRRNGTRLDAS
VRVLVQRPQQPNVELPESRIGVDVGVRRLATVATADGACCPVLVPDG
>Rv3186 PROBABLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv3640c PROBABLE TRANSPOSASE
MALPQSALSELLDAFRTGDGVDLIRDAVRLVLQELSELEATERIGAARYE
RSDTRVTDRNGARSRVLSTQAGDVELRIPKLRKGSFFPAILEPRRRIDQA
LYAVVMEAYVHGISTRAVDDLVEAMGVETGISKSEVSRICAGLDEIVGAF
RTRTLGHIEFPYVYLDATYLNVRNGTGQVVSMAVIVASGIAADGSREILG
LDVGDSEDETFWRGFLTSLKGRGLGGVRLVISDQHAGLVKALKRCFQGAG
HQRCRVHFARNLLAHVPKDKADMVASMFRMIFSAPDAEAVHATWEGVRDR
LAASFPKIGPLMDDARAEVLAFTAFPKAHWQKIWSTNPLERINKEIKRRS
RVVGIFPNPAAVIRLVGAVLADMHDEWQASERRYLSEASMALLYPDSDNA
VVAAISGGQ
>Rv3040c CONSERVED HYPOTHETICAL PROTEIN
MNSPREPLVPPPTPRPAATVMLVRDPDAGSASGLAVFLMRRHAAMDFAAG
VMVFPGGGVDDRDRDADLGRLGAWAGPPPQWWAQRFGIEPDLAEALVCAA
ARETFEESGVLFAGPVDQDHSAPNSIVSDASVYGDARRALADRTLSFADF
LQREKLVLRSDLLRPWANWVTPEAELTRRYDTYFFVGALPEGQRADGENT
ESDRAGWVLPADAIADFAAGRNFLLPPTWTQLDSLAGHTVADVLAVERQI
VPVQPQLARNGDNWEIEFFDSDRYNQARRSGGSTGWPL
>Rv3636 POSSIBLE TRANSPOSASE
MLSVEDWAEIRRLRRSERLPISEIARVLKISRNTVKSALASDGPPKYQRA
AKGSVADEAEPRIRELLAAYPRMPATVIAERIGWWYSIRTLSGRVRELRP
LYLPPDPASRDICGR
>Rv3263 PROBABLE DNA METHYLASE (MODIFICATION METHYLASE) (METHYLTRANSFERASE)
MQPSHPTRPGAVIRYVGSSLDTCPMTTFAGKTAASADKVRGGYYTPPAVA
RFLAHWVHQAGPKILEPSCGDGRILRELSAITDHAHGVELVAREAKKSRD
FASVDTENLFTWLHKTQLGSWDGVAGNPPYIRFGNWASEQRDPALELMRR
VGLRPTKLTNAWVPFVVASTTLARDGGRVGLVVPAELLQVTYAAQLREFL
LSRYREITLVTFERLVFDGILQEVVLFCGVVGPGPAHIRTVRLGDANDLN
ALGDKDFTNESAPALLHEKEKWTKYFLDPAQIRLLRGLKQSATMIRLGEL
ADVDVGIVTGRNSFFTFTDAKAQALGLRAHCVPLVSRSAQLSGLIYDEDC
RACDVAGNHRTWLLDAADYPTDPALVAHITAGEAAGVHLGYKCSIRKPWW
STPSLWMPDLFMLRQIHFAPRLTVNAAAATSTDTVHRVRLDPNVDPATLA
AVFHNSATFAFAEIMGRSYGGGILELEPREAEQLPMPPPAYGSAELAQDV
DLLLKANEIDKALDVVDRHVLIDGLGLSPRLVAGCRAAWLTLRDRRTKRG
SRR
>Rv3184 PROBABLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv2559c CONSERVED HYPOTHETICAL ALANINE LEUCINE VALINE RICH PROTEIN
MPEAVSDGLFDVPGVPMTSGHDLGASAGAPLAVRMRPASLDEVVGQDHLL
APGSPLRRLVEGSGVASVILYGPPGSGKTTLAALISQATGRRFEALSALS
AGVKEVRAVIENSRKALLHGEQTVLFIDEVHRFSKTQQDALLSAVEHRVV
LLVAATTENPSFSVVAPLLSRSLILQLRPLTAEDTRAVVQRAIDDPRGLG
RAVAVAPEAVDLLVQLAAGDARRALTALEVAAEAAQAAGELVSVQTIERS
VDKAAVRYDRDGDQHYDVVSAFIKSVRGSDVDAALHYLARMLVAGEDPRF
IARRLMILASEDIGMAGPSALQVAVAAAQTVALIGMPEAQLTLAHATIHL
ATAPKSNAVTTALAAAMNDIKAGKAGLVPAHLRDGHYSGAAALGNAQGYK
YSHDDPDGVVAQQYPPDELVDVDYYRPTGRGGEREIAGRLDRLRAIIRKK
RG
>Rv2978c PROBABLE TRANSPOSASE
MPKFEVPDGWTVQAFRFTLDPTEDQAKALARHFGARRKAYNWTVATLKAD
IQAWHASGTVTAKPSLRVLRKRWNTVKDDVCVNTETGVAWWPECSKEAYA
DGIAGAVEAYWNWQTSRAGKRAGKRVGFPRFKRKGRDQDRVSFTTGAMRV
EPDRRHLTLPVIGTVRTHENTRRIERLIKAGRARVLAISVRRNGTRLDAS
VRVLVQRPQQPKVVHPGSRVGVDVGVRRLATVATADGTAIEQVENPRPLG
AALRELRHVCRARSRCTKGSRRYRERTTQISRLHRRVNDVRTHHLHVLTT
RLAQTHGRIVVEGLDATEMLRQKGLPGARARRRGLSDAALGTPRRHLSYK
TVWYGSALVVADRWFPSSKTCHACRHVQDIGWDEQWQCDRCSVVHQRDDC
AAINLARYEETSSIVGPVGAAVKRGADRKTGPRPAGGCEARKGSSPKAAE
QPRDGVQVA
>Rv3349c PROBABLE TRANSPOSASE
MAIDPAAAYASAIRTPGLLPNAKLVVDHFHVTTLANDALTAVRRRVTWAF
HDRRGRKIDPQWANRRRLLTARERLSDKSFAKMRNRINAVDPRAQILSAW
IAKEELRTLLSTVRTGGDPHLARHHLHRFLPGASTRRSPNCSPWPPPLTS
HPRSTPSWSPASPTRASVVGEVAEMLGDIDGQCVQVEVPVPERGPAGCGG
LDGLGRAGVSATPRVCAAMTAVNVAGRCAGQQADVGPTPQHRCRGR
>Rv1763 PUTATIVE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv1042c PROBABLE IS LIKE-2 TRANSPOSASE
MTRVGVISDEFWAVVEPLMPSHEGKPGRRFSDHRLILEGIAWRFRTGSPW
RDLPAEFGPWQTVWKRHHRWSLDGTCDEVFAHVAAVFGVDAEVAEDIEKL
LSVDSTNVRAHQHSAGACSDTLATGGTVGLQEIRR
>Rv0922 POSSIBLE TRANSPOSASE
MIVRMRSCAQAAKVAEATGGVQLAGKPKPDGTPTFSRYVEIGVDFEAHRP
VVESVSVLFELYDGDANSYAATGGPGAQLPSGWMVTAAKFEVEWPADPQR
AGLVRSHFGARRKAFNWGLAQVKADLDAKAADPAHESVDWDLKSLRWAWN
RAKDDVAPWWAENSKECYSSGLADLAQGLANWKAGKNGTRKGRRVGFPRF
KSGRRDPGRVRFTTGTMRIEDDRRTITVPVIGPLRAKENTRRVQRHLVSG
RAQILNMTLSQRWGRLFVAVCYALRTPTTRSPLTQPTVRAGMDLGVRTLA
TVATLDTATGEQTIIEYPNPAPLKATLVARRRAGRELSRRIPGSHGHRAV
KAKLARLDRRCVHLRREAAHQLTTELAGTYGQVVIEDLDVAAMKRSMRRR
AFRRSVSDAAMGLVAPQLAYKTAKCSGVLTVADRWFASSQIHHGCTSPDG
TPCRLQGKGRIDKHLLCPVTGEVVDRDRNAALNLRDWPDNASRGPVGTTA
PSAPGPTTTVGTGHGADTGSSGAGGASVRPRPRRAGRGEAKTQTPQGDAA
>Rv3431c POSSIBLE TRANSPOSASE (FRAGMENT)
MFAELIRAGLQALIEAEATEAIGAGRYERSDGRIVHRNGHRPKTVSTTAG
DIEVQIPKLRAGSFFPSLLERRRRIDKALHAVIMEAYVHGVSTRSVDDLV
AAMGVQAGVSKSEVSRICAGLDTEIEAFRTRSLTHTEFPYVFCDATFCKV
RVGAHVVSQALVVATGVSIDGTREVLGTAVGDSESYEFWREFLASLKARG
LTGVHLVISDAHAGLKAAVAQQFSGASWQRCRVHFMRNLYTAVAAKHAPA
VTVAVKTIFAHTDPEEVGAQWDRVADPLCQP
>Rv2177c POSSIBLE TRANSPOSASE
MRSKIPDLQRALEGRFDDHHALMCRLHLAHLDQLDAMIGALDEQIEQLMH
PFCARRELIASIPGIGVGASATVISEIGADPAAWFPSAEHLASWVRLCPG
NHESAGKRHHGARRTGNQHLQPVLVECAWAAVRTDGYLREYYRRQVRKFG
GFRSPAANKKAIIAVAHKLIVIIWHVLATGRPYQDLGADYFTTRMDPDKE
RRRLVAKLEAQGLGVTLEPAA
>Rv2014 POSSIBLE TRANSPOSASE
MLHDRLTGAPRGATGDEGAANAHITRAMVAALTSVATQIKTLDAQIAEQL
SLHADAHIFTSLPRSGTVRAARLLAEIGDCRARFPTPESLACLAGVAPST
RQSGKVKHVGFRWAADKQLRDAVCDFAGDSRRANLWAADRYNRAIARGHD
HPHAVRILARAWLYAIWHCWQDGAAYHPANHRALQALLNQDQDRAA
>Rv0755A PUTATIVE TRANSPOSASE (FRAGMENT)
MKELSVAEQRYQAVLAVISDGLSISQVAEKVGVSRQTLHTWLARYEAEGL
DGLRIGTGTAL
>Rv3327 PROBABLE TRANSPOSASE FUSION PROTEIN
MVVVGTDAHKYSHTFVATDEVGRQLGEKTVKATTAGHATAIMWAREQFGL
ELIWGIEDCRNMSARLERDLLAAGQQVVRVPTKLMAQTRKSARSRGKSDP
IDALAVARAVLRETDLPLATHDETSRELKLLTDRRDVLVAQRTSAINRLR
WLVHELDPERAPAARSLDAAKHQQALRTWLDTQPGLVAELARAELTDIIR
LTGEINTLAQRISARVHQVAPALLEIPGCAELTAAKIVGEAAGVTRFKSE
AAFACHAAVAPIPVWSGNTAGQMRLSRSGNRQLNAALHRIALTQIRMTDS
RGQAYYQRLQDAGKTKRAALRCLKRRLARTVFQALRTVHQPSSEHTQPAA
ACHRSYCSSHLGEPPRLTDMTQKTRIQPLPPKRAGLLIRALYRIAKRRFG
EVPEPFTVTAHHRRLLIANVVHEALLQRASRKLPPSVRELAVFWTARSIG
CSWCVDFGAMLQRLDGLDVDRLTDIDNYATSSKFSDDERAAIAYAEAMTA
DPHSVTDEQVADLRARFGEAGVIELTYQIGVENMRARMNSALGITEQGFN
SGDACRVPWAAPDVPSAESR
>Rv2413c CONSERVED HYPOTHETICAL PROTEIN
MHLVLGDEELLVERAVADVLRSARQRAGTADVPVSRMRAGDVGAYELAEL
LSPSLFAEERIVVLGAAAEAGKDAAAVIESAAADLPAGTVLVVVHSGGGR
AKSLANQLRSMGAQVHPCARITKVSERADFIRSEFASLRVKVDDETVTAL
LDAVGSDVRELASACSQLVADTGGAVDAAAVRRYHSGKAEVRGFDIADKA
VAGDVAGAAEALRWAMMRGEPLVVLADALAEAVHTIGRVGPQSGDPYRLA
AQLGMPPWRVQKAQKQARRWSRDTVATAMRLVAELNANVKGAVADADYAL
ESAVRQVAELVADRGR
>Rv2106 PROBABLE TRANSPOSASE
LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREP
SRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLM
TKLGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYV
STWAGFAYVAFVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGV
LDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYDNALAETIN
GLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA
AYYAQRQRPAAG
>Rv3387 POSSIBLE TRANSPOSASE
MVRNAQRAVRRASGRRKAWLRQAINHLEKLIGRTERVVDQARSRLAGVMP
DSSSRLVSLHDADARPIRKGRLGKPVEFGYKAQVVDNADGVILDHSVELG
NPADAPQLAPAIERISRRTGRPPRAVTADRGCGDASVEDDLHQLGVRNVA
IPRKSKPSATRRAFEHRRAFRDKIKWRTGSEGRINHLKRSYGWNRTELTG
ITGARTWCGHGVFAHNLVKISTLAA
>Rv3866 CONSERVED HYPOTHETICAL PROTEIN
MTGPSAAGRAGTADNVVGVEVTIDGMLVIADRLHLVDFPVTLGIRPNIPQ
EDLRDIVWEQVQRDLTAQGVLDLHGEPQPTVAEMVETLGRPDRTLEGRWW
RRDIGGVMVRFVVCRRGDRHVIAARDGDMLVLQLVAPQVGLAGMVTAVLG
PAEPANVEPLTGVATELAECTTASQLTQYGIAPASARVYAEIVGNPTGWV
EIVASQRHPGGTTTQTDAAAGVLDSKLGRLVSLPRRVGGDLYGSFLPGTQ
QNLERALDGLLELLPAGAWLDHTSDHAQASSRG
>Rv3827c POSSIBLE TRANSPOSASE
MMARFEVPEGWCVQAFRFTLDPTEDQARALARHFGARRKAYNWAVATLKA
DIEAWRVTGIGTVKPSLRVLRKRWNTVKDEVCVNAETGAVWWPECSKEAY
ADGIGGAVDAYWNWQNSRSGKREGKTMGFPRFKKKGRDQDRVTFTTGAMR
VEPDRRHLTLPVVGTVRTHENTRRIERLIATGRARVLAISVRRNGTRLDA
SVRVLVQRPQQPNVAQPGSRVGVDVGVRRLATVANEAGAVLEEVPNPRPL
DTALKELRYASRARSRCTKGSRRYRERTTEISRLHRRVNDVRTHHLHVLT
TRLAQTHGHIVVEGLDAAGMLRQKGLPGARARRRGLSDSALGTPRRHLSY
KTGWYGSALVVADRWFPSLSVEPTVRPGLARLVAVKRGREAAAWLPNNPE
TGCKSRDH
>Rv2167c PROBABLE TRANSPOSASE
AEALAAGQRRIAKGERDFKDRVGFLRGRARPASTLITRFIADHQGHREGP
DGLRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISR
VHAANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRT
TIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAY
ARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQY
TSIRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRS
IEDVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>Rv2816c CONSERVED HYPOTHETICAL PROTEIN
MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGF
GYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYG
RGRLVSAEEFVFF
>Rv1156 CONSERVED HYPOTHETICAL PROTEIN
MPNLQLVQEPAADALLNANPFALLVGMLLDQQVPMETAFAGPKKIADRMG
SFDAGDIADYDPDKFVALCSERPAIHRFPGSMAKRIQALAQIIVDRYDGD
AAALWTAGEPDGNELLRRLKGLPGFGEQKARIFLALLGKQYGVTPKGWQV
AAGEFGQPGTYLSVADIVDAGSLGQVRSHKRQRKAAAKAEGKAPT
>Rv0829 POSSIBLE TRANSPOSASE (FRAGMENT)
MGPSSKTCHACRHVQDIGWDEKWQCDGCSITHQRDDNAAINLARYEEPPS
VVGPVGAAVKRGADRKTGPGPAGGREARKATGHPAGEQPRDGVLVA
>Rv3326 PROBABLE TRANSPOSASE
LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREP
SRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLM
TKLGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYV
STWAGFAYVAFVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGV
LDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYDNALAETIN
GLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA
AYYAQRQRPAAG
>Rv2917 CONSERVED HYPOTHETICAL ALANINE AND ARGININE RICH PROTEIN
MRVTRLVDAESTRCDVGPAPKSVAMLHFTAATSRFRLGRERANSVRSDGG
WGVLQPVSATFNPPLRGWQRRALVQYLGTQPRDFLAVATPGSGKTSFALR
IAAELLRYHTVEQVTVVVPTEHLKVQWAHAAAAHGLSLDPKFANSNPQTS
PEYHGVMVTYAQVASHPTLHRVRTEARKTLVVFDEIHHGGDAKTWGDAIR
EAFGDATRRLALTGTPFRSDDSPIPFVSYQPDADGVLRSQADHTYGYAEA
LADGVVRPVVFLAYSGQARWRDSAGEEYEARLGEPLSAEQTARAWRTALD
PEGEWMPAVITAADRRLRQLRAHVPDAGGMIIASDRTTARAYARLLTTMT
AEEPTVVLSDDPGSSARITEFAQGTSRWLVAVRMVSEGVDVPRLSVGVYA
TNASTPLFFAQAIGRFVRSRRPGETASIFVPSVPNLLQLASALEVQRNHV
LGRPHRESAHDPLDGDPATRTQTERGGAERGFTALGADAELDQVIFDGSS
FGTATPTGSDEEADYLGIPGLLDAEQMRALLHRRQDEQLRKRAQLQKGAT
QPATSGASASVHGQLRDLRRELHTLVSIAHHRTGKPHGWIHDERRRRCGG
PPIAAATRAQIKARIDALRQLNSERS
>Rv1444c HYPOTHETICAL PROTEIN
MTVMADRSGRPAPVRRRMKTLTQAALNADKTVEQVEDVLDGLGKTMAELN
SSLSQLNSTVERLEDGLDHLEGTLHSLDDLAKRLIVLVEPVEAIVDRIDY
IVSLGETVMSPLSVTEHAVRGVLDRLRNRTVHEPTN
>Rv3430c POSSIBLE TRANSPOSASE
MIDTAIEEMIPLIGVRAACAATGRAPASYYRAHSKRLSAQSDTFTSTAVT
DPSGPRESAQPRALSAAEREHVLAVLNSQRFADMAPAVVYATLLDEGIYL
CSESTMYRLLRERGQTGDRRRQATHPAAVKPELVAHQPNSVWSWDITKLR
GPAKWSYYYLYVILDIFSRYVVGWMVASRESKVLAERLIAQTLAAQHISA
DQLTLHADRGSSMSSKPVALLLADLGVTKSHSRPHTSNDNPLSEAQFKTL
KYRPDFPKRFESIEAARVHCDRFFGWYNHEHKHSGIGLHTPADVHYGRAD
QIRRHRATVLDTAYRDHLERIRSQTTRATRATGLQRDQPTTEGGPADSIN
PRKSCLRNVDRFRPGLLDLPAPAPVDLRRLLPSGQIR
>Rv2979c PROBABLE RESOLVASE
MNLATWAERNGVAPGTAYRWFRAGLLSVMARRVGRLILVDEPAGDAGMRS
PTAVYARVSSADQKADLDRQVARVTAWATAQQMPVDKVVTEVGSAFNEHR
RKFLSLLRDPSVHRIVVEHRDRFCRLGSKYVQAAFAAQGRELVVVDSAEV
DDDLVRDMTEILTSMCARLYGKRAAENRTKRALAAAAGEDHEAA
>Rv2512c TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS1081
MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADALCGAGYR
ERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAE
RALTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEA
FRTRPLDAGPYTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILG
IQVTSAEDGAGWLAFFRDLVARGLSGVALVTSDAHAGLVAAIGATLPAAA
WQRCRTHYAANLMAATPKPSWPWVRTLLHSIYDQPDAESVVAQYDRVLDA
LTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIWSNNPQERLNREVRRRT
DVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRARAALTSTE
EPAKQQTTNTPALTT
>Rv1259 CONSERVED HYPOTHETICAL PROTEIN
MNIAAESSAKPVWGPPNFCAAAARMQDVRVLMHPKTGRAFRSPVEPGSGW
PGDPATPQTPVAADAAQVSALAGGAGSICELNALISVCRACPRLVSWREE
VAVVKRRAFADQPYWGRPVPGWGSKRPRLLILGLAPAAHGANRTGRMFTG
DRSGDQLYAALHRAGLVNSPVSVDAADGLRANRIRITAPVRCAPPGNSPT
PAERLTCSPWLNAEWRLVSDHIRAIVALGGFAWQVALRLAGASGTPKPRF
GHGVVTELGAGVRLLGCYHPSQQNMFTGRLTPTMLDDIFREAKKLAGIE
>Rv2810c PROBABLE TRANSPOSASE
PLRLQAHTGGPPVALRQETTGGPSPTNDLITEPPRHYKQQTRVRQAPALL
TVSAGTGVPVVLEELAKLGRTLWRCRHDVLAYFDHHASNGPTEAINGRLE
ALCRNALGFRNLTHYRIRSLLHCGNLAQLIHAL
>Rv3517 CONSERVED HYPOTHETICAL PROTEIN
MIEPFLGSEAIASGALTRHRLRSAYATIHPDVYVSPGADLTAWSRAQAAW
LWSRRRGVIAGQSAAAMHGAKWVDARQAAELLYDHRRPPAGIHTWSDRVA
DDEIQPISGMNTTTPARTALDLARRYPVGKAVAAIDALARATDLKLADVE
MLAERYRGSRGIRNARIALDLVDPGAESPRETWLRLLLIRAGFPRPQTQI
PVYDEYGQLVAVIDMGWAGIKVGVDYEGDHHRTDRRTFNKDIKRAEALTE
LGWTDVRVTVEDTEGGIIWRVSAAWQRRT
>Rv3672c CONSERVED HYPOTHETICAL PROTEIN
MSAGGTPLQAGATPTGSRGTVALRPDAGPSWLRPLVDNVGQIPDAYRRRL
PADVLAMVTAAGAVSAMTSSRRDHREAAVLVLFSGPEAGPGDGGVPDDAD
LLLTVRASTLRHHAGQAAFPGGVVDPADDGPVATALREANEETGIDPSRL
HPLATMERTFIAPSRFHVVPVLAYSPDPGPVAVVNEAETAIVARVPVRAF
INPANRLMVYRRPHTRRWAGPAFLLNQMLVWGFTGQVISAVLDVAGWAQP
WDTGDIRELDAAMVLIDDESDPR
>Rv1756c PUTATIVE TRANSPOSASE
MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH
AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI
ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR
RILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS
IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE
DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>Rv3644c POSSIBLE DNA POLYMERASE
MSGVFTRLVGQQAVEAELLATAKAARRDSAHSAGGGGTMTHAWLLTGPPG
SGRSVAALCFAAALQCTSGGEPGCGRCRACTTTLAGTHADVRRVIPEGLS
IGVDEMRAIVQIAARRPTTGHWQIVVIEDADRLTEGAANALLKVVEEPPP
STVFLLCAPSVDPEDIAVTLRSRCRHVALVTPSTHAIAQVLSDGDGLDPD
TANWAASVSGGHVGRARRLATDPQARQRRERALGLARDAATPSRAYAAAE
ELVAGAEAEALALTAQRIEAETEELRTALGAGGTGKGTGAALRGATGAMK
DLERRQKSRQTRASRDALDRALIDLATYFRDALLVAAHAGGVRANHPDMA
DRVAALAAHAPPERLLRCIEAVLACREALAVNVKPKFAVDAMVATIGQEL
R
>Rv3730c CONSERVED HYPOTHETICAL PROTEIN
MAAAAEELDVDGIAVRLTSPDRMYFPKLGSHGTKRRLVEYYFAVAGGPML
TALRDRPTHLQRFPDGVDGEQIYQKRIPRHRPDYLQTCRVTFPSGRMADA
LKVTHPAAIVWAAQMGTITLHPWQVRCPDTEHPDELRIDLDPQPGTGFVE
ARTVAVDVLRSVLDDLGLVGYPKTSGGRGIHVFLRIATDWDFVEVRRAGI
ALAREVERRAPDAVTTSWWKEERGARIFIDFNQNARDRTMASAYSVRPTP
IATVSMPLTWEELAGADPDDYTMTTVPELVKIRDDPWAGMDDVAQSIAPL
LDLAAADEERGLGDMPYPPNYPKMPGEPKRVQPSRDTDLKGGNTSK
>Rv3115 PROBABLE TRANSPOSASE
MTSSHLIDAEQLLADQLAQASPDLLRGLLSTFIAALMGAEADALCGAGYR
ERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAE
RALTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEA
FRTRPLDAGPYTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILG
IQVTSAEDGAGWLAFFRDLVARGLSGVALVTSDAHAGLVAAIGATLPAAA
WQRCRTHYAANLMAATPKPSWPWVRTLLHSIYDQPDAESVVAQYDRVLDA
LTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIWSNNPQERLNREVRRRT
DVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRARAALTSTE
EPAKQQTTNTPALTT
>Rv2119 CONSERVED HYPOTHETICAL PROTEIN
MADQPDPPTPRPALSPSRATDFKQCPLLYRFRAIDRLPEATSAAQLRGSV
VHAALEQLYGLPAGLRSPDTARSLVQRAWDQMVAAEPELAGELDPGQPTQ
LLEDARALVSGYYRLEDPTRFDPQCCEQRVEVELADGTLLRGYIDRIDVA
ATGELRVVDYKTGKAPPAARALAEFKAMFQMKFYAVALFRSRGVPPTRLR
LIYLADGQLLDYSPDRDELLRFEKTLMAIWRAIQSAGETGDFRPNPSRLC
DWCPHQQRCPAFGGTPPPYPGWPTEPAA
>Rv1041c PROBABLE IS LIKE-2 TRANSPOSASE
MRASPADGLAITGLSWKGSRGGSVREVRGGTCPLSSGRGKRCGSAITVGR
WMVPATRCSPTLPRCSGWTLRWPRISRSCCRWIPRTCGHTSIRRAPARTR
SPQGALSDYKKSADEPDDHAIGRSRGGLTTKIHALTDQREAPVRIRLTAG
QAGDNPQLLPLLDDYRHASTEYALGSTDFRLLADKAYSHPSTRAALRSKK
IKHTIPERQDQIDRRKAKGSAGGRPPAFDAALYGLRNTVERGFHRLKQWR
GIATRYDKYALTYLGGVLLACAVIHARVGTPKLGDTP
>Rv2479c PROBABLE TRANSPOSASE
AEALAAGQRRIAKGERDFKDRVGFLRGRARPASTLITRFIADHQGHREGP
DGLRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISR
VHAANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRT
TIADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAY
ARRILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQY
TSIRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRS
IEDVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>Rv2578c CONSERVED HYPOTHETICAL PROTEIN
MRWARQAVAVNGMPVDDGALPGLQRIGLVRSVRAPQFDGITFHEVLCKSA
LNKVPNAAALPFRYTVNGYRGCSHACRYCFARPTHEYLDFNPGTDFDTQV
VVKTNVAAVLRHELRRPSWRRETVALGTNTDPYQRAEGRYALMPGIIGAL
AASGTPLSILTKGTLLRRDLPLIAEAAQQVPVSVAVSLAVGDPELHRDVE
SGTPTPQARLALITAIRAAGLDCHVMVAPVLPQLTDSGEHLDQLLGQIAA
AGATGVTVFGLHLRGSTRGWFMCWLARAHPELVSRYRELYRRGPYLPPSY
REMLRERVAPLIAKYRLAGDHRPAPPETEAALVPVQATLF
>Rv3849 CONSERVED HYPOTHETICAL PROTEIN
MSTTFAARLNRLFDTVYPPGRGPHTSAEVIAALKAEGITMSAPYLSQLRS
GNRTNPSGATMAALANFFRIKAAYFTDDEYYEKLDKELQWLCTMRDDGVR
RIAQRAHGLPSAAQQKVLDRIDELRRAEGIDA
>Rv3185 PROBABLE TRANSPOSASE
LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREP
SRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLM
TKLGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYV
STWAGFAYVAFVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGV
LDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYDNALAETIN
GLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA
AYYAQRQRPAAG
>Rv1199c POSSIBLE TRANSPOSASE
MTSSHLIDTEQLLADQLAQASPDLLRGLLSTFIAALMGAEADALCGAGYR
ERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAE
RALTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEA
FRTRPLDAGPYTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILG
IQVTSAEDGAGWLAFFRDLVARGLSGVALVTSDAHAGLVAAIGATLPAAA
WQRCRTHYAANLMAATPKPSWPWVRTLLHSIYDQPDAESVVAQYDRVLDA
LTDKLPAVAEHLDTARTDLLAFTAFPKQIWRQIWSNNPQERLNREVRRRT
DVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRARAALTSTE
EPAKQQTTNTPALTT
>Rv2814c PROBABLE TRANSPOSASE
LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREP
SRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLM
TKLGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYV
STWAGFAYVAFVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGV
LDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYDNALAETIN
GLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA
AYYAQRQRPAAG
>Rv2943A POSSIBLE TRANSPOSASE
MPTTKATQRRDVSTEIAYLTRALKAPTLRESVSRLADRARAENWSHEEYL
AACLQREVSARESHGGEGRIRAARFPARKSLEEFDFEHARGLKRDTIAHL
GTLDFITARDNVVFLGPAWHREDSSCGRPGDTRVSGRSSGAVRHRRRMGS
TARRGSPRRAHLRRTHPALPLSAPGG
>Rv2278 PROBABLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv0142 CONSERVED HYPOTHETICAL PROTEIN
MRSIDVVVEAVVTFAGAAGFAHTLAPLRRGQQDPCFRVPGDGTIWRTSLL
PTGPVTARISRAGRDAARCVAWGSGAEEFVDMAPAMLGAADDASDFVPLH
PAVAAAHRRLPNLRLGRTGQVLEALIPAVIEQRVPGADAFRSWRLLVSKY
GTQAPGPAPPGMRVPPSAEVWRHIPSWEFHRANVDPGRARAVVGCAQRAA
SLERLVSLPAARAAEALTSLPGVGVWTAAETTQRVFGDADAVSVGDYHIP
KMIGWTLVGRPVDDAGMLELLEPMRPHRHRVVRLLEASGLAREPRRGPRL
PVQNIRAL
>Rv3204 POSSIBLE DNA-METHYLTRANSFERASE (MODIFICATION METHYLASE)
MAPVTDEQVELVRSLVAAIPLGRVSTYGDIAALTGLSSPRIVGWIMRTDS
SDLPWHRVIRASGRPAQHLATRQLELLRAEGVLSVDGRVALSEIRYEFPP
G
>Rv0797 PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS1547
MVVVGTDAHKYSHTFVATDEVGRQLGEKTVKATTAGHATAIMWAREQFGL
ELIWGIEDCRNMSARLERDLLAAGQQVVRVPTKLMAQTRKSARSRGKSDP
IDALAVARAVMRETDLPLATHDETSRELKLLTDRRDVLVAQRTSAINRLR
WLVHELDPERAPAARSLDAAKHQQALRTWLDTQPGLVAELARAELTDIIR
LTGEINTLAQRISARVHQVAPALLEIPGCAELTAAKIVGEAAGVTRFKSE
AAFACHAAVAPIPVWSGNTAGQMRLSRSGNRQLNAALHRIALTQIRMTDS
RGQAYYQRLQDAGKTKRAALRCLKRRLARTVFQALRTVHQPSSEHTQPAA
ACHRSYCSRSCLSG
>Rv1179c HYPOTHETICAL PROTEIN
MDPHRDLESRAFAGNWRVYQQQALDAFDADVAAGDNRAYLVLPPGAGKTM
IGLEAARRLGRRSLVLVPNTAVQAQWAAAWDNSFPSSDRSASKCGTERGL
ASAMNVLTYQSLAVIDAETDSTVRREVLRNRDQQALLDLLHPNGRAVIER
AATLGPWTLVLDECHHLLATWGALVSALASVLGAQTALIGLTATPATELT
AWQHTLHDELFGTADFVIPTPALVREGDLAPYQELVYLTQPTPEEQAWIG
THRARFADLMLALIDQKVGSMSLAAWLHTRIVDRATREGNQIAWSTFERA
EPDLACSGLRFAYDGLIPLPDGVRLREQHRIAPDAQDWVNVLTDFSVGHL
QQSADPRDAHALTAIKRVLPGLGYRLTSRGVRVATSPVDRLCALSESKIA
ATAHILDTEDAVLGARLRALVLCDFESMTGALPTSLKGAPVSEQSGSAQL
VAAMLAASDHRRRTPLHALLVTGQTFACPAAIEDDLIAFCAERGALVTAE
PLDAHPSLRVMRGTGGFTPRTWVALATEYFLAGRARVLVGTRSLLGEGWD
CAAVNVNIDLTSATTQAAITQMRGRAIRNDPSDGHKVADNWSVCCIATEH
PRGDADYLRLVRKHDGYYAATPQGLIESGVTHCDPSLSPYGPPVTDTHAI
TARALQRVAERAQARSWWRIGEPYEGVDVATIRVRSRQPLGVAAPRIPAS
ALTPPVPGQFSPVRLARGAVAAVSVVGASTATAVASANLGMLAGAGTAGA
IVAAGVGLVATAAAAESRRLDHAPNALEQLAAVVADALYAAGGAQRGSAA
LRLASDPEGWIRCQLDGVPTEQSLRFTAALDELLAPLAEPRYLIGRKILT
PPARPVARRLFAVRAVVGLSLPGTVAWHAVPRWFARNKDRRQHLAQAWRK
HIGPPRQLPADSPQGQAILDLFRGDNPLSVTTQLRTTWR
>Rv2191 CONSERVED HYPOTHETICAL PROTEIN
MQGPNVAAMGATGGTQLSFADLAHAQGAAWTPADEMSLRETTFVVVDLET
TGGRTTGNDATPPDAITEIGAVKVCGGAVLGEFATLVNPQHSIPPQIVRL
TGITTAMVGNAPTIDAVLPMFFEFAGDSVLVAHNAGFDIGFLRAAARRCD
ITWPQPQVLCTMRLARRVLSRDEAPSVRLAALARLFAVASNPTHRALDDA
RATVDVLHALIERVGNQGVHTYAELRSYLPNVTQAQRCKRVLAETLPHRP
GVYLFRGPSGEVLYVGTAADLRRRVSQYFNGTDRRKRMTEMVMLASSIDH
VECAHPLEAGVRELRMLSTHAPPYNRRSKFPYRWWWVALTDEAFPRLSVI
RAPRHDRVVGPFRSRSKAAETAALLARCTGLRTCTTRLTRSARHGPACPE
LEVSACPAARDVTAAQYAEAVLRAAALIGGLDNAALAAAVQQVTELAERR
RYESAARLRDHLATAIEALWHGQRLRALAALPELIAAKPDGPREGGYQLA
VIRHGQLAAAGRAPRGVPPMPVVDAIRRGAQAILPTPAPLGGALVEEIAL
IARWLAEPGVRIVGVSNDAAGLASPVRSAGPWAAWAATARSAQLAGEQLS
RGWQSDLPTEPHPSREQLFGRTGVDCRTGPPQPLLPGRQPFSTAG
>Rv2966c POSSIBLE METHYLTRANSFERASE (METHYLASE)
MTRIIGGVAGGRRIAVPPRGTRPTTDRVRESLFNIVTARRDLTGLAVLDL
YAGSGALGLEALSRGAASVLFVESDQRSAAVIARNIEALGLSGATLRRGA
VAAVVAAGTTSPVDLVLADPPYNVDSADVDAILAALGTNGWTREGTVAVV
ERATTCAPLTWPEGWRRWPQRVYGDTRLELAERLFANV
>Rv1000c CONSERVED HYPOTHETICAL PROTEIN. THOUGHT TO BE REGULATED BY Rv2720|LEXA.
MCDKLGGVAIAVQGALFEHNERRQLGDGAFIDIRSGWLTGGEELLDALLS
TVPWRAERRQMYDRVVDVPRLVSFHDLTIEDPPHPQLARMRRRLNDIYGG
ELGEPFTTAGLCYYRDGSDSVAWHGDTIGRGSTEDTMVAIVSLGATRVFA
LRPRGRGPSLRLPLAHGDLLVMGGSCQRTFEHAVPKTSAPTGPRVSIQFR
PRDVR
>Rv3427c POSSIBLE TRANSPOSASE
MSICDPALRNALRTLKLSGMLDTLDARLAQTRNGDLGHLEFLQALREDEI
ARRESAALTRRLRRAKFEAQATFEDFDFTANPKLPGAMLRDLAALRWLDA
GESVILHGPVGVGKTHVAQALVHAVARRGGDVRFAKTSRMLSDLAGGHAD
RSWGQRIREYTKPLVLILDDFAMREHTAMHADDLYELISDRAITGKPLIL
TSNRAPNNWYGLFPNPVVAESLLDRLINTSHQILMDGPSYRPRKRPGRTT
S
>Rv3381c PROBABLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv2648 PROBABLE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS6110
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv2168c PROBABLE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv0796 PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT IS6110
LITRFIADHQGHREGPDGLRWGVESICTQLTELGVPIAPSTYYDHINREP
SRRELRDGELKEHISRVHAANYGVYGARKVWLTLNREGIEVARCTVERLM
TKLGLSGTTRGKARRTTIADPATARPADLVQRRFGPPAPNRLWVADLTYV
STWAGFAYVAFVTDAYARRILGWRVASTMATSMVLDAIEQAIWTRQQEGV
LDLKDVIHHTDRGSQYTSIRFSERLAEAGIQPSVGAVGSSYDNALAETIN
GLYKTELIKPGKPWRSIEDVELATARWVDWFNHRRLYQYCGDVPPVELEA
AYYAQRQRPAAG
>Rv1757c PUTATIVE TRANSPOSASE
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>Rv2817c CONSERVED HYPOTHETICAL PROTEIN
MVQLYVSDSVSRISFADGRVIVWSEELGESQYPIETLDGITLFGRPTMTT
PFIVEMLKRERDIQLFTTDGHYQGRISTPDVSYAPRLRQQVHRTDDPAFC
LSLSKRIVSRKILNQQALIRAHTSGQDVAESIRTMKHSLAWVDRSGSLAE
LNGFEGNAAKAYFTALGHLVPQEFAFQGRSTRPPLDAFNSMVSLGYSLLY
KNIIGAIERHSLNAYIGFLHQDSRGHATLASDLMEVWRAPIIDDTVLRLI
ADGVVDTRAFSKNSDTGAVFATREATRSIARAFGNRIARTATYIKGDPHR
YTFQYALDLQLQSLVRVIEAGHPSRLVDIDITSEPSGA
>Rv1578c Probable phiRv1 phage protein
MPRPPKPARLKLVEGRSPGRDSGGRKVPESPKFIRQAPDAPDWLDAEALA
EWRRVAPTLERLDLLKPEDRALLSAYCETWSVYVAAVQRVRAEGLTITSP
KSGVVHRNPAVTVAETARMHLLRLASEFGLTPAAEQRLAVAPGDDGDGLN
PFAPDR
>Rv2362c CONSERVED HYPOTHETICAL PROTEIN
MRLYRDRAVVLRQHKLGEADRIVTLLTRDHGLVRAVAKGVRRTRSKFGAR
LEPFAHIEVQLHPGRNLDIVTQVVSVDAFATDIVADYGRYTCGCAILETA
ERLAGEERAPAPALHRLTVGALRAVADGQRPRDLLLDAYLLRAMGIAGWA
PALTECARCATPGPHRAFHIATGGSVCAHCRPAGSTTPPLGVVDLMSALY
DGDWEAAEAAPQSARSHVSGLVAAHLQWHLERQLKTLPLVERFYQADRSV
AERRAALIGQDIAGG
>Rv1253 deaD, PROBABLE COLD-SHOCK DEAD-BOX PROTEIN A HOMOLOG DEAD (ATP-dependent RNA helicase deaD homolog)
MAFPEYSPAASAATFADLQIHPRVLRAIGDVGYESPTAIQAATIPALMAG
SDVVGLAQTGTGKTAAFAIPMLSKIDITSKVPQALVLVPTRELALQVAEA
FGRYGAYLSQLNVLPIYGGSSYAVQLAGLRRGAQVVVGTPGRMIDHLERA
TLDLSRVDFLVLDEADEMLTMGFADDVERILSETPEYKQVALFSATMPPA
IRKLSAKYLHDPFEVTCKAKTAVAENISQSYIQVARKMDALTRVLEVEPF
EAMIVFVRTKQATEEIAEKLRARGFSAAAISGDVPQAQRERTITALRDGD
IDILVATDVAARGLDVERISHVLNYDIPHDTESYVHRIGRTGRAGRSGAA
LIFVSPRELHLLKAIEKATRQTLTEAQLPTVEDVNTQRVAKFADSITNAL
GGPGIELFRRLVEEYEREHDVPMADIAAALAVQCRGGEAFLMAPDPPLSR
RNRDQRRDRPQRPKRRPDLTTYRVAVGKRHKIGPGAIVGAIANEGGLHRS
DFGQIRIGPDFSLVELPAKLPRATLKKLAQTRISGVLIDLRPYRPPDAAR
RHNGGKPRRKHVG
>Rv1329c dinG, PROBABLE ATP-DEPENDENT HELICASE DING
MSESVSMSVPELLAIAVAALGGTRRRGQQEMAAAVAHAFETGEHLVVQAG
TGTGKSLAYLVPAIIRALCDDAPVVVSTATIALQRQLVDRDLPQLVDSLT
NALPRRPKFALLKGRRNYLCLNKIHNSVTASDHDDERPQEELFDPVAVTA
LGRDVQRLTAWASTTVSGDRDDLKPGVGDRSWSQVSVSARECLGVARCPF
GSECFSERARGAAGLADVVVTNHALLAIDAVAESAVLPEHRLLVVDEAHE
LADRVTSVAAAELTSATLGMAARRITRLVDPKVTQRLQAASATFSSAIHD
ARPGRIDCLDDEMATYLSALRDAASAARSAIDTGSDTTTASVRAEAGAVL
TEISDTASRILASFAPAIPDRSDVVWLEHEDNHESARAVLRVAPLSVAEL
LATQVFARATTVLTSATLTIGGSFDAMATAWGLTADTPWRGLDVGSPFQH
AKSGILYVAAHLPPPGRDGSGSAEQLTEIAELITAAGGRTLGLFSSMRAA
RAATEAMRERLSTPVLCQGDDSTSTLVEKFTADAATSLFGTLSLWQGVDV
PGPSLSLVLIDRIPFPRPDDPLLSARQRAVAARGGNGFMTVAASHAALLL
AQGSGRLLRRVTDRGVVAVLDSRMATARYGEFLRASLPPFWQTTNATQVR
AALRRLARADAKAH
>Rv3056 dinP, POSSIBLE DNA-DAMAGE-INDUCIBLE PROTEIN P DINP (DNA POLYMERASE V) (POL IV 2) (DNA NUCLEOTIDYLTRANSFERASE (DNA-DIRECTED))
MPTAAPRWILHVDLDQFLASVELLRHPELAGLPVIVGGNGDPTEPRKVVT
CASYEARAYGVRAGMPLRTAARRCPEATFLPSNPAAYNAASEEVVALLRD
LGYPVEVWGWDEAYLAVAPGTPDDPIEVAEEIRKVILSQTGLSCSIGISD
NKQRAKIATGLAKPAGIYQLTDANWMAIMGDRTVEALWGVGPKTTKRLAK
LGINTVYQLAHTDSGLLMSTFGPRTALWLLLAKGGGDTEVSAQAWVPRSR
SHAVTFPRDLTCRSEMESAVTELAQRTLNEVVASSRTVTRVAVTVRTATF
YTRTKIRKLQAPSTDPDVITAAARHVLDLFELDRPVRLLGVRLELA
>Rv1537 dinX, PROBABLE DNA POLYMERASE IV DINX (POL IV 1) (DNA NUCLEOTIDYLTRANSFERASE (DNA-DIRECTED))
MLHLDMDAFFASVEQLTRPTLRGRPVLVGGLGGRGVVAGASYEARAYGAR
SAMPMHQARRLIGVTAVVLPPRGVVYGIASRRVFDTVRGLVPVVEQLSFD
EAFAEPPQLAGAVAEDVETFCERLRRRVRDETGLIASVGAGSGKQIAKIA
SGLAKPDGIRVVRHAEEQALLSGLPVRRLWGIGPVAEEKLHRLGIETIGQ
LAALSDAEAANILGATIGPALHRLARGIDDRPVVERAEAKQISAESTFAV
DLTTMEQLHEAIDSIAEHAHQRLLRDGRGARTITVKLKKSDMSTLTRSAT
MPYPTTDAGALFTVARRLLPDPLQIGPIRLLGVGFSGLSDIRQESLFADS
DLTQETAAAHYVETPGAVVPAAHDATMWRVGDDVAHPELGHGWVQGAGHG
VVTVRFETRGSGPGSARTFPVDTGDISNASPLDSLDWPDYIGQLSVEGSA
GASAPTVDDVGDR
>Rv0001 dnaA, CHROMOSOMAL REPLICATION INITIATOR PROTEIN DNAA
MTDDPGSGFTTVWNAVVSELNGDPKVDDGPSSDANLSAPLTPQQRAWLNL
VQPLTIVEGFALLSVPSSFVQNEIERHLRAPITDALSRRLGHQIQLGVRI
APPATDEADDTTVPPSENPATTSPDTTTDNDEIDDSAAARGDNQHSWPSY
FTERPHNTDSATAGVTSLNRRYTFDTFVIGASNRFAHAAALAIAEAPARA
YNPLFIWGESGLGKTHLLHAAGNYAQRLFPGMRVKYVSTEEFTNDFINSL
RDDRKVAFKRSYRDVDVLLVDDIQFIEGKEGIQEEFFHTFNTLHNANKQI
VISSDRPPKQLATLEDRLRTRFEWGLITDVQPPELETRIAILRKKAQMER
LAVPDDVLELIASSIERNIRELEGALIRVTAFASLNKTPIDKALAEIVLR
DLIADANTMQISAATIMAATAEYFDTTVEELRGPGKTRALAQSRQIAMYL
CRELTDLSLPKIGQAFGRDHTTVMYAQRKILSEMAERREVFDHVKELTTR
IRQRSKR
>Rv0058 dnaB, PROBABLE REPLICATIVE DNA HELICASE DNAB
MAVVDDLAPGMDSSPPSEDYGRQPPQDLAAEQSVLGGMLLSKDAIADVLE
RLRPGDFYRPAHQNVYDAILDLYGRGEPADAVTVAAELDRRGLLRRIGGA
PYLHTLISTVPTAANAGYYASIVAEKALLRRLVEAGTRVVQYGYAGAEGA
DVAEVVDRAQAEIYDVADRRLSEDFVALEDLLQPTMDEIDAIASSGGLAR
GVATGFTELDEVTNGLHPGQMVIVAARPGVGKSTLGLDFMRSCSIRHRMA
SVIFSLEMSKSEIVMRLLSAEAKIKLSDMRSGRMSDDDWTRLARRMSEIS
EAPLFIDDSPNLTMMEIRAKARRLRQKANLKLIVVDYLQLMTSGKKYESR
QVEVSEFSRHLKLLAKELEVPVVAISQLNRGPEQRTDKKPMLADLRESGC
LTASTRILRADTGAEVAFGELMRSGERPMVWSLDERLRMVARPMINVFPS
GRKEVFRLRLASGREVEATGSHPFMKFEGWTPLAQLKVGDRIAAPRRVPE
PIDTQRMPESELISLARMIGDGSCLKNQPIRYEPVDEANLAAVTVSAAHS
DRAAIRDDYLAARVPSLRPARQRLPRGRCTPIAAWLAGLGLFTKRSHEKC
VPEAVFRAPNDQVALFLRHLWSAGGSVRWDPTNGQGRVYYGSTSRRLIDD
VAQLLLRVGIFSWITHAPKLGGHDSWRLHIHGAKDQVRFLRHVGVHGAEA
VAAQEMLRQLKGPVRNPNLDSAPKKVWAQVRNRLSAKQMMDIQLHEPTMW
KHSPSRSRPHRAEARIEDRAIHELARGDAYWDTVVEITSIGDQHVFDGTV
SGTHNFVANGISLHNSLEQDADVVILLHRPDAFDRDDPRGGEADFILAKH
RNGPTKTVTVAHQLHLSRFANMAR
>Rv1547 dnaE1, PROBABLE DNA POLYMERASE III (ALPHA CHAIN) DNAE1 (DNA NUCLEOTIDYLTRANSFERASE)
MSGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVGMTDHGN
MFGASEFYNSATKAGIKPIIGVEAYIAPGSRFDTRRILWGDPSQKADDVS
GSGSYTHLTMMAENATGLRNLFKLSSHASFEGQLSKWSRMDAELIAEHAE
GIIITTGCPSGEVQTRLRLGQDREALEAAAKWREIVGPDNYFLELMDHGL
TIERRVRDGLLEIGRALNIPPLATNDCHYVTRDAAHNHEALLCVQTGKTL
SDPNRFKFDGDGYYLKSAAEMRQIWDDEVPGACDSTLLIAERVQSYADVW
TPRDRMPVFPVPDGHDQASWLRHEVDAGLRRRFPAGPPDGYRERAAYEID
VICSKGFPSYFLIVADLISYARSAGIRVGPGRGSAAGSLVAYALGITDID
PIPHGLLFERFLNPERTSMPDIDIDFDDRRRGEMVRYAADKWGHDRVAQV
ITFGTIKTKAALKDSARIHYGQPGFAIADRITKALPPAIMAKDIPLSGIT
DPSHERYKEAAEVRGLIETDPDVRTIYQTARGLEGLIRNAGVHACAVIMS
SEPLTEAIPLWKRPQDGAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDA
IDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLDGGPMRDLLRR
MQPTGFEDVVAVIALYRPGPMGMNAHNDYADRKNNRQAIKPIHPELEEPL
REILAETYGLIVYQEQIMRIAQKVASYSLARADILRKAMGKKKREVLEKE
FEGFSDGMQANGFSPAAIKALWDTILPFADYAFNKSHAAGYGMVSYWTAY
LKANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLPPDVNESGLNF
ASVGQDIRYGLGAVRNVGANVVGSLLQTRNDKGKFTDFSDYLNKIDISAC
NKKVTESLIKAGAFDSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLF
GSNDDGTGTADPVFTIKVPDDEWEDKHKLALEREMLGLYVSGHPLNGVAH
LLAAQVDTAIPAILDGDVPNDAQVRVGGILASVNRRVNKNGMPWASAQLE
DLTGGIEVMFFPHTYSSYGADIVDDAVVLVNAKVAVRDDRIALIANDLTV
PDFSNAEVERPLAVSLPTRQCTFDKVSALKQVLARHPGTSQVHLRLISGD
RITTLALDQSLRVTPSPALMGDLKELLGPGCLGS
>Rv3370c dnaE2, PROBABLE DNA POLYMERASE III (ALPHA CHAIN) DNAE2 (DNA NUCLEOTIDYLTRANSFERASE)
MERVLNGKPRHAGVPAFDADGDVPRSRKRGAYQPPGRERVGSSVAYAELH
AHSAYSFLDGASTPEELVEEAARLGLCALALTDHDGLYGAVRFAEAAAEL
DVRTVFGAELSLGATARTERPDPPGPHLLVLARGPEGYRRLSRQLAAAHL
AGGEKGKPRYDFDALTEAAGGHWHILTGCRKGHVRQALSQGGPAAAQRAL
ADLVDRFTPSRVSIELTHHGHPLDDERNAALAGLAPRFGVGIVATTGAHF
ADPSRGRLAMAMAAIRARRSLDSAAGWLAPLGGAHLRSGEEMARLFAWCP
EAVTAAAELGERCAFGLQLIAPRLPPFDVPDGHTEDSWLRSLVMAGARER
YGPPKSAPRAYSQIEHELKVIAQLRFPGYFLVVHDITRFCRDNDILCQGR
GSAANSAVCYALGVTAVDPVANELLFERFLSPARDGPPDIDIDIESDQRE
KVIQYVYHKYGRDYAAQVANVITYRGRSAVRDMARALGFSPGQQDAWSKQ
VSHWTGQADDVDGIPEQVIDLATQIRNLPRHLGIHSGGMVICDRPIADVC
PVEWARMANRSVLQWDKDDCAAIGLVKFDLLGLGMLSALHYAKDLVAEHK
GIEVDLARLDLSEPAVYEMLARADSVGVFQVESRAQMATLPRLKPRVFYD
LVVEVALIRPGPIQGGSVHPYIRRRNGVDPVIYEHPSMAPALRKTLGVPL
FQEQLMQLAVDCAGFSAAEADQLRRAMGSKRSTERMRRLRGRFYDGMRAL
HGAPDEVIDRIYEKLEAFANFGFPESHALSFASLVFYSAWFKLHHPAAFC
AALLRAQPMGFYSPQSLVADARRHGVAVHGPCVNASLAHATCENAGTEVR
LGLGAVRYLGAELAEKLVAERTANGPFTSLPDLTSRVQLSVPQVEALATA
GALGCFGMSRREALWAAGAAATGRPDRLPGVGSSSHIPALPGMSELELAA
ADVWATGVSPDSYPTQFLRADLDAMGVLPAERLGSVSDGDRVLIAGAVTH
RQRPATAQGVTFINLEDETGMVNVLCTPGVWARHRKLAHTAPALLIRGQV
QNASGAITVVAERMGRLTLAVGARSRDFR
>Rv2343c dnaG, PROBABLE DNA PRIMASE DNAG
MSGRISDRDIAAIREGARIEDVVGDYVQLRRAGADSLKGLCPFHNEKSPS
FHVRPNHGHFHCFGCGEGGDVYAFIQKIEHVSFVEAVELLADRIGHTISY
TGAATSVQRDRGSRSRLLAANAAAAAFYAQALQSDEAAPARQYLTERSFD
AAAARKFGCGFAPSGWDSLTKHLQRKGFEFEELEAAGLSRQGRHGPMDRF
HRRLLWPIRTSAGEVVGFGARRLFDDDAMEAKYVNTPETLLYKKSSVMFG
IDLAKRDIAKGHQAVVVEGYTDVMAMHLAGVTTAVASCGTAFGGEHLAML
RRLMMDDSFFRGELIYVFDGDEAGRAAALKAFDGEQKLAGQSFVAVAPDG
MDPCDLRLKCGDAALRDLVARRTPLFEFAIRAAIAEMDLDSAEGRVAALR
RCVPMVGQIKDPTLRDEYARQLAGWVGWADVAQVIGRVRGEAKRTKHPRL
GRLGSTTIARAAQRPTAGPPTELAVRPDPRDPTLWPQREALKSALQYPAL
AGPVFDALTVEGFTHPEYAAVRAAIDTAGGTSAGLSGAQWLDMVRQQTTS
TVTSALISELGVEAIQVDDDKLPRYIAGVLARLQEVWLGRQIAEVKSKLQ
RMSPIEQGDEYHALFGDLVAMEAYRRSLLEQASGDDLTA
>Rv0002 dnaN, DNA POLYMERASE III (BETA CHAIN) DNAN (DNA NUCLEOTIDYLTRANSFERASE)
MDAATTRVGLTDLTFRLLRESFADAVSWVAKNLPARPAVPVLSGVLLTGS
DNGLTISGFDYEVSAEAQVGAEIVSPGSVLVSGRLLSDITRALPNKPVDV
HVEGNRVALTCGNARFSLPTMPVEDYPTLPTLPEETGLLPAELFAEAISQ
VAIAAGRDDTLPMLTGIRVEILGETVVLAATDRFRLAVRELKWSASSPDI
EAAVLVPAKTLAEAAKAGIGGSDVRLSLGTGPGVGKDGLLGISGNGKRST
TRLLDAEFPKFRQLLPTEHTAVATMDVAELIEAIKLVALVADRGAQVRME
FADGSVRLSAGADDVGRAEEDLVVDYAGEPLTIAFNPTYLTDGLSSLRSE
RVSFGFTTAGKPALLRPVSGDDRPVAGLNGNGPFPAVSTDYVYLLMPVRL
PG
>Rv3711c dnaQ, PROBABLE DNA POLYMERASE III (EPSILON SUBUNIT) DNAQ
MSHTWGRPASHQDRGWAVIDVETSGFRPGQARIISLAVLGLDAAGRLEQS
VVSLLNPKVDPGPTHVHGLTAAMLDGQPQFADIAGEVVDVLRGRTLVAHN
VAFDYAFLAAEAEIAEAELPVDFVMCTVELARRLQLGVDNLRLETLAAHW
GVPQQRPHDAFDDVRVLTGILAAALESARELDVWLPVHPVTRRRWPNGRV
THDELRPLKAVAARMACPYLNPGRYVQGRPLVQGMRVGLAAEVKRTHEEL
VERILHAGLAYSDVVDRDTSLVVCNATAPEHGKGYHALQLGVPVMPEARF
MECIGAVVGGASVEDFTDVAPVEKQLALF
>Rv3721c dnaZX, DNA POLYMERASE III (SUBUNIT GAMMA/TAU) DNAZ/X
MALYRKYRPASFAEVVGQEHVTAPLSVALDAGRINHAYLFSGPRGCGKTS
SARILARSLNCAQGPTANPCGVCESCVSLAPNAPGSIDVVELDAASHGGV
DDTRELRDRAFYAPVQSRYRVFIVDEAHMVTTAGFNALLKIVEEPPEHLI
FIFATTEPEKVLPTIRSRTHHYPFRLLPPRTMRALLARICEQEGVVVDDA
VYPLVIRAGGGSPRDTLSVLDQLLAGAADTHVTYTRALGLLGVTDVALID
DAVDALAACDAAALFGAIESVIDGGHDPRRFATDLLERFRDLIVLQSVPD
AASRGVVDAPEDALDRMREQAARIGRATLTRYAEVVQAGLGEMRGATAPR
LLLEVVCARLLLPSASDAESALLQRVERIETRLDMSIPAPQAVPRPSAAA
AEPKHQPAREPRPVLAPTPASSEPTVAAVRSMWPTVRDKVRLRSRTTEVM
LAGATVRALEDNTLVLTHESAPLARRLSEQRNADVLAEALKDALGVNWRV
RCETGEPAAAASPVGGGANVATAKAVNPAPTANSTQRDEEEHMLAEAGRG
DPSPRRDPEEVALELLQNELGARRIDNA
>Rv0670 end, PROBABLE ENDONUCLEASE IV END (ENDODEOXYRIBONUCLEASE IV) (APURINASE)
MLIGSHVSPTDPLAAAEAEGADVVQIFLGNPQSWKAPKPRDDAAALKAAT
LPIYVHAPYLINLASANNRVRIPSRKILQETCAAAADIGAAAVIVHGGHV
ADDNDIDKGFQRWRKALDRLETEVPVYLENTAGGDHAMARRFDTIARLWD
VIGDTGIGFCLDTCHTWAAGEALTDAVDRIKAITGRIDLVHCNDSRDEAG
SGRDRHANLGSGQIDPDLLVAAVKAAGAPVICETADQGRKDDIAFLRERT
GS
>Rv0861c ercc3, PROBABLE DNA HELICASE ERCC3
MQSDKTVLLEVDHELAGAARAAIAPFAELERAPEHVHTYRITPLALWNAR
AAGHDAEQVVDALVSYSRYAVPQPLLVDIVDTMARYGRLQLVKNPAHGLT
LVSLDRAVLEEVLRNKKIAPMLGARIDDDTVVVHPSERGRVKQLLLKIGW
PAEDLAGYVDGEAHPISLHQEGWQLRDYQRLAADSFWAGGSGVVVLPCGA
GKTLVGAAAMAKAGATTLILVTNIVAARQWKRELVARTSLTENEIGEFSG
ERKEIRPVTISTYQMITRRTKGEYRHLELFDSRDWGLIIYDEVHLLPAPV
FRMTADLQSKRRLGLTATLIREDGREGDVFSLIGPKRYDAPWKDIEAQGW
IAPAECVEVRVTMTDSERMMYATAEPEERYRICSTVHTKIAVVKSILAKH
PDEQTLVIGAYLDQLDELGAELGAPVIQGSTRTSEREALFDAFRRGEVAT
LVVSKVANFSIDLPEAAVAVQVSGTFGSRQEEAQRLGRILRPKADGGGAI
FYSVVARDSLDAEYAAHRQRFLAEQGYGYIIRDADDLLGPAI
>Rv2924c fpg, PROBABLE FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE FPG (FAPY-DNA GLYCOSYLASE)
MPELPEVEVVRRGLQAHVTGRTITEVRVHHPRAVRRHDAGPADLTARLRG
ARINGTDRRGKYLWLTLNTAGVHRPTDTALVVHLGMSGQMLLGAVPCAAH
VRISALLDDGTVLSFADQRTFGGWLLADLVTVDGSVVPVPVAHLARDPLD
PRFDCDAVVKVLRRKHSELKRQLLDQRVVSGIGNIYADEALWRAKVNGAH
VAATLRCRRLGAVLHAAADVMREALAKGGTSFDSLYVNVNGESGYFERSL
DAYGREGENCRRCGAVIRRERFMNRSSFYCPRCQPRPRK
>Rv0006 gyrA, DNA GYRASE (SUBUNIT A) GYRA (DNA TOPOISOMERASE (ATP-HYDROLYSING)) (DNA TOPOISOMERASE II) (TYPE II DNA TOPOISOMERASE)
MTDTTLPPDDSLDRIEPVDIEQEMQRSYIDYAMSVIVGRALPEVRDGLKP
VHRRVLYAMFDSGFRPDRSHAKSARSVAETMGNYHPHGDASIYDSLVRMA
QPWSLRYPLVDGQGNFGSPGNDPPAAMRYTEARLTPLAMEMLREIDEETV
DFIPNYDGRVQEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLRELADA
VFWALENHDADEEETLAAVMGRVKGPDFPTAGLIVGSQGTADAYKTGRGS
IRMRGVVEVEEDSRGRTSLVITELPYQVNHDNFITSIAEQVRDGKLAGIS
NIEDQSSDRVGLRIVIEIKRDAVAKVVINNLYKHTQLQTSFGANMLAIVD
GVPRTLRLDQLIRYYVDHQLDVIVRRTTYRLRKANERAHILRGLVKALDA
LDEVIALIRASETVDIARAGLIELLDIDEIQAQAILDMQLRRLAALERQR
IIDDLAKIEAEIADLEDILAKPERQRGIVRDELAEIVDRHGDDRRTRIIA
ADGDVSDEDLIAREDVVVTITETGYAKRTKTDLYRSQKRGGKGVQGAGLK
QDDIVAHFFVCSTHDLILFFTTQGRVYRAKAYDLPEASRTARGQHVANLL
AFQPEERIAQVIQIRGYTDAPYLVLATRNGLVKKSKLTDFDSNRSGGIVA
VNLRDNDELVGAVLCSAGDDLLLVSANGQSIRFSATDEALRPMGRATSGV
QGMRFNIDDRLLSLNVVREGTYLLVATSGGYAKRTAIEEYPVQGRGGKGV
LTVMYDRRRGRLVGALIVDDDSELYAVTSGGGVIRTAARQVRKAGRQTKG
VRLMNLGEGDTLLAIARNAEESGDDNAVDANGADQTGN
>Rv0005 gyrB, DNA GYRASE (SUBUNIT B) GYRB (DNA TOPOISOMERASE (ATP-HYDROLYSING)) (DNA TOPOISOMERASE II) (TYPE II DNA TOPOISOMERASE)
MGKNEARRSALAPDHGTVVCDPLRRLNRMHATPEESIRIVAAQKKKAQDE
YGAASITILEGLEAVRKRPGMYIGSTGERGLHHLIWEVVDNAVDEAMAGY
ATTVNVVLLEDGGVEVADDGRGIPVATHASGIPTVDVVMTQLHAGGKFDS
DAYAISGGLHGVGVSVVNALSTRLEVEIKRDGYEWSQVYEKSEPLGLKQG
APTKKTGSTVRFWADPAVFETTEYDFETVARRLQEMAFLNKGLTINLTDE
RVTQDEVVDEVVSDVAEAPKSASERAAESTAPHKVKSRTFHYPGGLVDFV
KHINRTKNAIHSSIVDFSGKGTGHEVEIAMQWNAGYSESVHTFANTINTH
EGGTHEEGFRSALTSVVNKYAKDRKLLKDKDPNLTGDDIREGLAAVISVK
VSEPQFEGQTKTKLGNTEVKSFVQKVCNEQLTHWFEANPTDAKVVVNKAV
SSAQARIAARKARELVRRKSATDIGGLPGKLADCRSTDPRKSELYVVEGD
SAGGSAKSGRDSMFQAILPLRGKIINVEKARIDRVLKNTEVQAIITALGT
GIHDEFDIGKLRYHKIVLMADADVDGQHISTLLLTLLFRFMRPLIENGHV
FLAQPPLYKLKWQRSDPEFAYSDRERDGLLEAGLKAGKKINKEDGIQRYK
GLGEMDAKELWETTMDPSVRVLRQVTLDDAAAADELFSILMGEDVDARRS
FITRNAKDVRFLDV
>Rv2092c helY, PROBABLE ATP-DEPENDENT DNA HELICASE HELY
MTELAELDRFTAELPFSLDDFQQRACSALERGHGVLVCAPTGAGKTVVGE
FAVHLALAAGSKCFYTTPLKALSNQKHTDLTARYGRDQIGLLTGDLSVNG
NAPVVVMTTEVLRNMLYADSPALQGLSYVVMDEVHFLADRMRGPVWEEVI
LQLPDDVRVVSLSATVSNAEEFGGWIQTVRGDTTVVVDEHRPVPLWQHVL
VGKRMFDLFDYRIGEAEGQPQVNRELLRHIAHRREADRMADWQPRRRGSG
RPGFYRPPGRPEVIAKLDAEGLLPAITFVFSRAGCDAAVTQCLRSPLRLT
SEEERARIAEVIDHRCGDLADSDLAVLGYYEWREGLLRGLAAHHAGMLPA
FRHTVEELFTAGLVKAVFATETLALGINMPARTVVLERLVKFNGEQHMPL
TPGEYTQLTGRAGRRGIDVEGHAVVIWHPEIEPSEVAGLASTRTFPLRSS
FAPSYNMTINLVHRMGPQQAHRLLEQSFAQYQADRSVVGLVRGIERGNRI
LGEIAAELGGSDAPILEYARLRARVSELERAQARASRLQRRQAATDALAA
LRRGDIITITHGRRGGLAVVLESARDRDDPRPLVLTEHRWAGRISSADYS
GTTPVGSMTLPKRVEHRQPRVRRDLASALRSAAAGLVIPAARRVSEAGGF
HDPELESSREQLRRHPVHTSPGLEDQIRQAERYLRIERDNAQLERKVAAA
TNSLARTFDRFVGLLTEREFIDGPATDPVVTDDGRLLARIYSESDLLVAE
CLRTGAWEGLKPAELAGVVSAVVYETRGGDGQGAPFGADVPTPRLRQALT
QTSRLSTTLRADEQAHRITPSREPDDGFVRVIYRWSRTGDLAAALAAADV
NGSGSPLLAGDFVRWCRQVLDLLDQVRNAAPNPELRATAKRAIGDIRRGV
VAVDAG
>Rv2101 helZ, PROBABLE HELICASE HELZ
MLVLHGFWSNSGGMRLWAEDSDLLVKSPSQALRSARPHPFAAPADLIAGI
HPGKPATAVLLLPSLRSAPLDSPELIRLAPRPAARTDPMLLAWTVPVVDL
DPTAALAAFDQPAPDVRYGASVDYLAELAVFARELVERGRVLPQLRRDTH
GAAACWRPVLQGRDVVAMTSLVSAMPPVCRAEVGGHDPHELATSALDAMV
DAAVRAALSPMDLLPPRRGRSKRHRAVEAWLTALTCPDGRFDAEPDELDA
LAEALRPWDDVGIGTVGPARATFRLSEVETENEETPAGSLWRLEFLLQST
QDPSLLVPAEQAWNDDGSLRRWLDRPQELLLTELGRASRIFPELVPALRT
ACPSGLELDADGAYRFLSGTAAVLDEAGFGVLLPSWWDRRRKLGLVLSAY
TPVDGVVGKASKFGREQLVEFRWELAVGDDPLSEEEIAALTETKSPLIRL
RGQWVALDTEQMRRGLEFLERKPTGRKTTAEILALAASHPDDVDTPLEVT
AVRADGWLGDLLAGAAAASLQPLDPPDGFTATLRPYQQRGLAWLAFLSSL
GLGSCLADDMGLGKTVQLLALETLESVQRHQDRGVGPTLLLCPMSLVGNW
PQEAARFAPNLRVYAHHGGARLHGEALRDHLERTDLVVSTYTTATRDIDE
LAEYEWNRVVLDEAQAVKNSLSRAAKAVRRLRAAHRVALTGTPMENRLAE
LWSIMDFLNPGLLGSSERFRTRYAIPIERHGHTEPAERLRASTRPYILRR
LKTDPAIIDDLPEKIEIKQYCQLTTEQASLYQAVVADMMEKIENTEGIER
RGNVLAAMAKLKQVCNHPAQLLHDRSPVGRRSGKVIRLEEILEEILAEGD
RVLCFTQFTEFAELLVPHLAARFGRAARDIAYLHGGTPRKRRDEMVARFQ
SGDGPPIFLLSLKAGGTGLNLTAANHVVHLDRWWNPAVENQATDRAFRIG
QRRTVQVRKFICTGTLEEKIDEMIEEKKALADLVVTDGEGWLTELSTRDL
REVFALSEGAVGE
>Rv2986c hupB, PROBABLE DNA-BINDING PROTEIN HU HOMOLOG HUPB (HISTONE-LIKE PROTEIN) (HLP) (21-KDA LAMININ-2-BINDING PROTEIN)
MNKAELIDVLTQKLGSDRRQATAAVENVVDTIVRAVHKGDSVTITGFGVF
EQRRRAARVARNPRTGETVKVKPTSVPAFRPGAQFKAVVSGAQRLPAEGP
AVKRGVGASAAKKVAKKAPAKKATKAAKKAATKAPARKAATKAPAKKAAT
KAPAKKAVKATKSPAKKVTKAVKKTAVKASVRKAATKAPAKKAAAKRPAT
KAPAKKATARRGRK
>Rv3014c ligA, PROBABLE DNA LIGASE [NAD DEPENDENT] LIGA (POLYDEOXYRIBONUCLEOTIDE SYNTHASE [NAD+])
MSSPDADQTAPEVLRQWQALAEEVREHQFRYYVRDAPIISDAEFDELLRR
LEALEEQHPELRTPDSPTQLVGGAGFATDFEPVDHLERMLSLDNAFTADE
LAAWAGRIHAEVGDAAHYLCELKIDGVALSLVYREGRLTRASTRGDGRTG
EDVTLNARTIADVPERLTPGDDYPVPEVLEVRGEVFFRLDDFQALNASLV
EEGKAPFANPRNSAAGSLRQKDPAVTARRRLRMICHGLGHVEGFRPATLH
QAYLALRAWGLPVSEHTTLATDLAGVRERIDYWGEHRHEVDHEIDGVVVK
VDEVALQRRLGSTSRAPRWAIAYKYPPEEAQTKLLDIRVNVGRTGRITPF
AFMTPVKVAGSTVGQATLHNASEIKRKGVLIGDTVVIRKAGDVIPEVLGP
VVELRDGSEREFIMPTTCPECGSPLAPEKEGDADIRCPNARGCPGQLRER
VFHVASRNGLDIEVLGYEAGVALLQAKVIADEGELFALTERDLLRTDLFR
TKAGELSANGKRLLVNLDKAKAAPLWRVLVALSIRHVGPTAARALATEFG
SLDAIAAASTDQLAAVEGVGPTIAAAVTEWFAVDWHREIVDKWRAAGVRM
VDERDESVPRTLAGLTIVVTGSLTGFSRDDAKEAIVARGGKAAGSVSKKT
NYVVAGDSPGSKYDKAVELGVPILDEDGFRRLLADGPASRT
>Rv3062 ligB, PROBABLE ATP-DEPENDENT DNA LIGASE LIGB (POLYDEOXYRIBONUCLEOTIDE SYNTHASE [ATP]) (POLYNUCLEOTIDE LIGASE [ATP]) (SEALASE) (DNA REPAIR PROTEIN) (DNA JOIN
MLLHDVAITSMDVAATSSRLTKVARIAALLHRAAPDTQLVTIIVSWLSGE
LPQRHIGVGWAALRSLPPPAPQPALTVTGVDATLSKIGTLPGKGSQAQRA
ALVAELFSAATEAEQTFLLRLLGGELRQGAKGGIMADAVAQAAGLPAATV
QRAAMLGGDLAAAAAAGLSGAALDTFTLRVGRPIGPMLAQTATSVHDALE
RHGGTTIFEAKLDGARVQIHRANDQVRIYTRSLDDVTARLPEVVEATLAL
PVRDLVADGEAIALCPDNRPQRFQVTASRFGRSVDVAAARATQPLSVFFF
DILHRDGTDLLEAPTTERLAALDALVPARHRVDRLITSDPTDAANFLDAT
LAAGHEGVMAKAPAARYLAGRRGAGWLKVKPVHTLDLVVLAVEWGSGRRR
GKLSNIHLGARDPATGGFVMVGKTFKGMTDAMLDWQTTRFHEIAVGPTDG
YVVQLRPEQVVEVALDGVQRSSRYPGGLALRFARVVRYRADKDPAEADTI
DAVRALY
>Rv3731 ligC, POSSIBLE ATP-DEPENDENT DNA LIGASE LIGC (POLYDEOXYRIBONUCLEOTIDE SYNTHASE [ATP]) (POLYNUCLEOTIDE LIGASE [ATP]) (SEALASE) (DNA REPAIR PROTEIN) (DNA JOIN
MQLPVMPPVSPMLAKSVTAIPPDASYEPKWDGFRSICFRDGDQVELGSRN
ERPMTRYFPELVAAIRAELPHRCVIDGEIIIATDHGLDFEALQQRIHPAE
SRVRMLADRTPASFIAFDLLALGDDDYTGRPFSERRAALVDAVTGSGADA
DLSIHVTPATTDMATAQRWFSEFEGAGLDGVIAKPPHITYQPDKRVMFKI
KHLRTADCVVAGYRVHKSGSDAIGSLLLGLYQEDGQLASVGVIGAFPMAE
RRRLLTELQPLVTSFDDHPWNWAAHVAGQRTPRKNEFSRWNVGKDLSFVP
LRPERVVEVRYDRMEGARFRHTAQFNRWRPDRDPRSCSYAQLERPLTVSL
SDIVPGLR
>Rv1020 mfd, PROBABLE TRANSCRIPTION-REPAIR COUPLING FACTOR MFD (TRCF)
MTAPGPACSDTPIAGLVELALSAPTFQQLMQRAGGRPDELTLIAPASARL
LVASALARQGPLLVVTATGREADDLAAELRGVFGDAVALLPSWETLPHER
LSPGVDTVGTRLMALRRLAHPDDAQLGPPLGVVVTSVRSLLQPMTPQLGM
MEPLTLTVGDESPFDGVVARLVELAYTRVDMVGRRGEFAVRGGILDIFAP
TAEHPVRVEFWGDEITEMRMFSVADQRSIPEIDIHTLVAFACRELLLSED
VRARAAQLAARHPAAESTVTGSASDMLAKLAEGIAVDGMEAVLPVLWSDG
HALLTDQLPDGTPVLVCDPEKVRTRAADLIRTGREFLEASWSVAALGTAE
NQAPVDVEQLGGSGFVELDQVRAAAARTGHPWWTLSQLSDESAIELDVRA
APSARGHQRDIDEIFAMLRAHIATGGYAALVAPGTGTAHRVVERLSESDT
PAGMLDPGQAPKPGVVGVLQGPLRDGVIIPGANLVVITETDLTGSRVSAA
EGKRLAAKRRNIVDPLALTAGDLVVHDQHGIGRFVEMVERTVGGARREYL
VLEYASAKRGGGAKNTDKLYVPMDSLDQLSRYVGGQAPALSRLGGSDWAN
TKTKARRAVREIAGELVSLYAKRQASPGHAFSPDTPWQAELEDAFGFTET
VDQLTAIEEVKADMEKPIPMDRVICGDVGYGKTEIAVRAAFKAVQDGKQV
AVLVPTTLLADQHLQTFGERMSGFPVTIKGLSRFTDAAESRAVIDGLADG
SVDIVIGTHRLLQTGVRWKDLGLVVVDEEQRFGVEHKEHIKSLRTHVDVL
TMSATPIPRTLEMSLAGIREMSTILTPPEERYPVLTYVGPHDDKQIAAAL
RRELLRDGQAFYVHNRVSSIDAAAARVRELVPEARVVVAHGQMPEDLLET
TVQRFWNREHDILVCTTIVETGLDISNANTLIVERADTFGLSQLHQLRGR
VGRSRERGYAYFLYPPQVPLTETAYDRLATIAQNNELGAGMAVALKDLEI
RGAGNVLGIEQSGHVAGVGFDLYVRLVGEALETYRDAYRAAADGQTVRTA
EEPKDVRIDLPVDAHLPPDYIASDRLRLEGYRRLAAASSDREVAAVVDEL
TDRYGALPEPARRLAAVARLRLLCRGSGITDVTAASAATVRLSPLTLPDS
AQVRLKRMYPGAHYRATTATVQVPIPRAGGLGAPRIRDVELVQMVADLIT
ALAGKPRQHIGITNPSPPGEDGRGRNTTIKERQP
>Rv1688 mpg, POSSIBLE 3-METHYLADENINE DNA GLYCOSYLASE MPG
MNAEELAIDPVAAAHRLLGATIAGRGVRAMVVEVEAYGGVPDGPWPDAAA
HSYRGRNGRNDVMFGPPGRLYTYRSHGIHVCANVACGPDGTAAAVLLRAA
AIEDGAELATSRRGQTVRAVALARGPGNLCAALGITMADNGIDLFDPSSP
VRLRLNDTHRARSGPRVGVSQAADRPWRLWLTGRPEVSAYRRSSRAPARG
ASD
>Rv2985 mutT1, POSSIBLE HYDROLASE MUTT1
MSIQNSSARRRSAGRIVYAAGAVLWRPGSADSEGPVEIAVIHRPRYDDWS
LPKGKVDPGETAPVGAVREILEETGHRANLGRRLLTVTYPTDSPFRGVKK
VHYWAARSTGGEFTPGSEVDELIWLPVPDAMNKLDYAQDRKVLCRFAKHP
ADTQTVLVVRHGTAGSKAHFSGDDSKRPLDKRGRAQAEALVPQLLAFGAT
DVYAADRVRCHQTMEPLAAELNVTIHNEPTLTEESYANNPKRGRHRVLQI
VEQVGTPVICTQGKVIPDLITWWCERDGVHPDKSRNRKGSTWVLSLSAGR
LVTADHIGGALAANVRA
>Rv1160 mutT2, PROBABLE MUTATOR PROTEIN MUTT (7,8-dihydro-8-oxoguanine-triphosphatase) (8-OXO-DGTPASE)
MLNQIVVAGAIVRGCTVLVAQRVRPPELAGRWELPGGKVAAGETERAALA
RELAEELGLEVADLAVGDRVGDDIALNGTTTLRAYRVHLLGGEPRARDHR
ALCWVTAAELHDVDWVPADRGWIADLARTLNGSAADVHRRC
>Rv0413 mutT3, POSSIBLE MUTATOR PROTEIN MUTT3 (7,8-DIHYDRO-8-OXOGUANINE-TRIPHOSPHATASE) (8-OXO-DGTPASE) (DGTP PYROPHOSPHOHYDROLASE)
MPSCPPAYSEQVRGDGDGWVVSDSGVAYWGRYGAAGLLLRAPRPDGTPAV
LLQHRALWSHQGGTWGLPGGARDSHETPEQTAVRESSEEAGLSAERLEVR
ATVVTAEVCGVDDTHWTYTTVVADAGELLDTVPNRESAELRWVAENEVAD
LPLHPGFAASWQRLRTAPATVPLARCDERRQRLPRTIQIEAGVFLWCTPG
DADQAPSPLGRRISSLL
>Rv3589 mutY, PROBABLE ADENINE GLYCOSYLASE MUTY
MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFM
LQQTPAARVLAIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRL
HECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFAYRQRVPVVDT
NVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGA
TVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLL
DVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP
GEGF
>Rv3297 nei, PROBABLE ENDONUCLEASE VIII NEI
MPEGDTVWHTAATLRRHLAGRTLTRCDIRVPRFAAVDLTGEVVDEVISRG
KHLFIRTGTASIHSHLQMDGSWRVGNRPVRVDHRARIILEANQQEQAIRV
VGVDLGLLEVIDRHNDGAVVAHLGPDLLADDWDPQRAAANLIVAPDRPIA
EALLDQRVLAGIGNVYCNELCFVSGVLPTAPVSAVADPRRLVTRARDMLW
VNRFRWNRCTTGDTRAGRRLWVYGRAGQGCRRCGTLIAYDTTDERVRYWC
PACQR
>Rv3674c nth, PROBABLE ENDONUCLEASE III NTH (DNA-(APURINIC OR APYRIMIDINIC SITE)LYASE) (AP LYASE) (AP ENDONUCLEASE CLASS I) (ENDODEOXYRIBONUCLEASE (APURINIC OR APYR
MPGRWSAETRLALVRRARRMNRALAQAFPHVYCELDFTTPLELAVATILS
AQSTDKRVNLTTPALFARYRTARDYAQADRTELESLIRPTGFYRNKAASL
IGLGQALVERFGGEVPATMDKLVTLPGVGRKTANVILGNAFGIPGITVDT
HFGRLVRRWRWTTAEDPVKVEQAVGELIERKEWTLLSHRVIFHGRRVCHA
RRPACGVCVLAKDCPSFGLGPTEPLLAAPLVQGPETDHLLALAGL
>Rv3199c nudC, PROBABLE NADH PYROPHOSPHATASE NUDC (NAD+ DIPHOSPHATASE) (NAD+ PYROPHOSPHATASE) (NADP PYROPHOSPHATASE)
MTNVSGVDFQLRSVPLLSRVGADRADRLRTDMEAAAAGWPGAALLRVDSR
NRVLVANGRVLLGAAIELADKPPPEAVFLGRVEGGRHVWAVRAALQPIAD
PDIPAEAVDLRGLGRIMDDTSSQLVSSASALLNWHDNARFSALDGAPTKP
ARAGWSRVNPITGHEEFPRIDPAVICLVHDGADRAVLARQAAWPERMFSL
LAGFVEAGESFEVCVAREIREEIGLTVRDVRYLGSQQWPFPRSLMVGFHA
LGDPDEEFSFSDGEIAEAAWFTRDEVRAALAAGDWSSASESKLLLPGSIS
IARVIIESWAACE
>Rv1316c ogt, PROBABLE METHYLATED-DNA--PROTEIN-CYSTEINE METHYLTRANSFERASE OGT (6-O-methylguanine-DNA methyltransferase) (O-6-methylguanine-DNA-alkyltransferase)
MIHYRTIDSPIGPLTLAGHGSVLTNLRMLEQTYEPSRTHWTPDPGAFSGA
VDQLNAYFAGELTEFDVELDLRGTDFQQRVWKALLTIPYGETRSYGEIAD
QIGAPGAARAVGLANGHNPIAIIVPCHRVIGASGKLTGYGGGINRKRALL
ELEKSRAPADLTLFD
>Rv0015c pknA, TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE A PKNA (PROTEIN KINASE A) (STPK A)
MSPRVGVTLSGRYRLQRLIATGGMGQVWEAVDNRLGRRVAVKVLKSEFSS
DPEFIERFRAEARTTAMLNHPGIASVHDYGESQMNGEGRTAYLVMELVNG
EPLNSVLKRTGRLSLRHALDMLEQTGRALQIAHAAGLVHRDVKPGNILIT
PTGQVKITDFGIAKAVDAAPVTQTGMVMGTAQYIAPEQALGHDASPASDV
YSLGVVGYEAVSGKRPFAGDGALTVAMKHIKEPPPPLPPDLPPNVRELIE
ITLVKNPAMRYRSGGPFADAVAAVRAGRRPPRPSQTPPPGRAAPAAIPSG
TTARVAANSAGRTAASRRSRPATGGHRPPRRTFSSGQRALLWAAGVLGAL
AIIIAVLLVIKAPGDNSPQQAPTPTVTTTGNPPASNTGGTDASPRLNWTE
RGETRHSGLQSWVVPPTPHSRASLARYEIAQ
>Rv1746 pknF, ANCHORED-MEMBRANE SERINE/THREONINE-PROTEIN KINASE PKNF (PROTEIN KINASE F) (STPK F)
MPLAEGSTFAGFTIVRQLGSGGMGEVYLARHPRLPRQDALKVLRADVSAD
GEYRARFNREADAAASLWHPHIVAVHDRGEFDGQLWIDMDFVDGTDTVSL
LRDRYPNGMPGPEVTEIITAVAEALDYAHERRLLHRDVKPANILIANPDS
PDRRIMLADFGIAGWVDDPSGLTATNMTVGTVSYAAPEQLMGNELDGRAD
QYALAATAFHLLTGSPPFQHANPAVVISQHLSASPPAIGDRVPELTPLDP
VFAKALAKQPKDRYQRCVDFARALGHRLGGAGDPDDTRVSQPVAVAAPAK
RSLLRTAVIVPAVLAMLLVMAVAVAVREFQRADDERAAQPARTRTTTSAG
TTTSVAPASTTRPAPTTPTTTGAADTATASPTAAVVAIGALCFPLGSTGT
TKTGATAYCSTLQGTNTTIWSLTEDTVASPTVTATADPTEAPLPIEQESP
IRVCMQQTGQTRRECREEIRRSNGWP
>Rv0410c pknG, SERINE/THREONINE-PROTEIN KINASE PKNG (PROTEIN KINASE G) (STPK G)
MAKASETERSGPGTQPADAQTATSATVRPLSTQAVFRPDFGDEDNFPHPT
LGPDTEPQDRMATTSRVRPPVRRLGGGLVEIPRAPDIDPLEALMTNPVVP
ESKRFCWNCGRPVGRSDSETKGASEGWCPYCGSPYSFLPQLNPGDIVAGQ
YEVKGCIAHGGLGWIYLALDRNVNGRPVVLKGLVHSGDAEAQAMAMAERQ
FLAEVVHPSIVQIFNFVEHTDRHGDPVGYIVMEYVGGQSLKRSKGQKLPV
AEAIAYLLEILPALSYLHSIGLVYNDLKPENIMLTEEQLKLIDLGAVSRI
NSFGYLYGTPGFQAPEIVRTGPTVATDIYTVGRTLAALTLDLPTRNGRYV
DGLPEDDPVLKTYDSYGRLLRRAIDPDPRQRFTTAEEMSAQLTGVLREVV
AQDTGVPRPGLSTIFSPSRSTFGVDLLVAHTDVYLDGQVHAEKLTANEIV
TALSVPLVDPTDVAASVLQATVLSQPVQTLDSLRAARHGALDADGVDFSE
SVELPLMEVRALLDLGDVAKATRKLDDLAERVGWRWRLVWYRAVAELLTG
DYDSATKHFTEVLDTFPGELAPKLALAATAELAGNTDEHKFYQTVWSTND
GVISAAFGLARARSAEGDRVGAVRTLDEVPPTSRHFTTARLTSAVTLLSG
RSTSEVTEEQIRDAARRVEALPPTEPRVLQIRALVLGGALDWLKDNKAST
NHILGFPFTSHGLRLGVEASLRSLARVAPTQRHRYTLVDMANKVRPTSTF
>Rv1266c pknH, PROBABLE TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE H PKNH (PROTEIN KINASE H) (STPK H)
MSDAQDSRVGSMFGPYHLKRLLGRGGMGEVYEAEHTVKEWTVAVKLMTAE
FSKDPVFRERMKREARIAGRLQEPHVVPIHDYGEVDGQMFLEMRLVEGTD
LDSVLKRFGPLTPPRAVAIITQIASALDAAHADGVMHRDVKPQNILITRD
DFAYLVDFGIASATTDEKLTQLGTAVGTWKYMAPERFSNDEVTYRADIYA
LACVLHECLTGAPPYRADSAGTLVSSHLMGPIPQPSAIRPGIPKAFDAVV
ARGMAKKPEDRYASAGDLALAAHEALSDPDQDHAADILRRSQESTLPAPP
KPVPPPTMPATAMAPRQPPAPPVTPPGVQPAPKPSYTPPAQPGPAGQRPG
PTGQPSWAPNSGPMPASGPTPTPQYYQGGGWGAPPSGGPSPWAQTPRKTN
PWPLVAGAAAVVLVLVLGAIGIWIAIRPKPVQPPQPVAEERLSALLLNSS
EVNAVMGSSSMQPGKPITSMDSSPVTVSLPDCQGALYTSQDPVYAGTGYT
AINGLISSEPGDNYEHWVNQAVVAFPTADKARAFVQTSADKWKNCAGKTV
TVTNKAKTYRWTFADVKGSPPTITVIDTQEGAEGWECQRAMSVANNVVVD
VNACGYRITNQAGQIAAKIVDKVNKE
>Rv2914c pknI, PROBABLE TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE I PKNI (PROTEIN KINASE I) (STPK I) (PHOSPHORYLASE B KINASE KINASE) (HYDROXYALKYL-PROTEIN KINASE
MALASGVTFAGYTVVRMLGCSAMGEVYLVQHPGFPGWQALKVLSPAMAAD
DEFRRRFQRETEVAARLFHPHILEVHDRGEFDGQLWIAMDYVDGIDATQH
MADRFPAVLPVGEVLAIVTAVAGALDYAHQRGLLHRDVNPANVVLTSQSA
GDQRILLADFGIASQPSYPAPELSAGADVDGRADQYALALTAIHLFAGAP
PVDRSHTGPLQPPKLSAFRPDLARLDGVLSRALATAPADRFGSCREFADA
MNEQAGVAIADQSSGGVDASEVTAAAGEEAYVVDYPAYGWPEAVDCKEPS
ARAPAPAAPTPQRRGSMLQSAAGVLARRLDNFSTATKAPASPTRRRPRRI
LVGAVAVLLLAGLFAVGIVIGRKTNTTATEVARPPTSGSAVPSAPTTTVA
VTAPVPLDGTYRIEIQRSKQTYDYTPTPQPPDVNTWWAFRTSCTPTECLA
AATMLDDNDHTQAKTPPVRPFLMQFGEGQWKSRPETVQFPCVGPNGSPST
QATTQLLALRPQPQGDLVGEMVVTVHSNECGQQGAVIRIPAVASRSGDLP
PAVTVPDPATIPDTPDTTSTATLTPPTTTAPGPGR
>Rv2088 pknJ, PROBABLE TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE J PKNJ (PROTEIN KINASE J) (STPK J)
MAHELSAGSVFAGYRIERMLGAGGMGTVYLARNPDLPRSEALKVLAAELS
RDLDFRARFVREADVAAGLDHPNIVAVHQRGQFEGRLWIAMQFVDGGNAE
DALRAATMTTARAVYVIGEVAKALDYAHQQGVIHRDIKPANFLLSRAAGG
DERVLLSDFGIARALGDTGLTSTGSVLATLAYAAPEVLAGQGFDGRADLY
SLGCALFRLLTGEAPFAAGAGAAVAVVAGHLHQPPPTVSDRVPGLSAAMD
AVIATAMAKDPMRRFTSAGEFAHAAAAALYGGATDGWVPPSPAPHVISQG
AVPGSPWWQHPVGSVTALATPPGHGWPPGLPPLPRRPRRYRRGVAAVAAV
MVVAAAAVTAVTMTSHQPRTATPPSAAALSPTSSSTTPPQPPIVTRSRLP
GLLPPLDDVKNFVGIQNLVAHEPMLQPQTPNGSINPAECWPAVGGGVPSA
YDLGTVIGFYGLTIDEPPTGTAPNQVGQLIVAFRDAATAQRHLADLASIW
RRCGGRTVTLFRSEWRRPVELSTSVPEVVDGITTMVLTAQGPVLRVREDH
AIAAKNNVLVDVDIMTPDTSRGQQAVIGITNYILAKIPG
>Rv2176 pknL, PROBABLE TRANSMEMBRANE SERINE/THREONINE-PROTEIN KINASE L PKNL (PROTEIN KINASE L) (STPK L)
MVEAGTRDPLESALLDSRYLVQAKIASGGTSTVYRGLDVRLDRPVALKVM
DSRYAGDEQFLTRFRLEARAVARLNNRALVAVYDQGKDGRHPFLVMELIE
GGTLRELLIERGPMPPHAVVAVLRPVLGGLAAAHRAGLVHRDVKPENILI
SDDGDVKLADFGLVRAVAAASITSTGVILGTAAYLSPEQVRDGNADPRSD
VYSVGVLVYELLTGHTPFTGDSALSIAYQRLDADVPRASAVIDGVPPQFD
ELVACATARNPADRYADAIAMGADLEAIAEELALPEFRVPAPRNSAQHRS
AALYRSRITQQGQLGAKPVHHPTRQLTRQPGDCSEPASGSEPEHEPITGQ
FAGIAIEEFIWARQHARRMVLVWVSVVLAITGLVASAAWTIGSNLSGLL
>Rv1629 polA, PROBABLE DNA POLYMERASE I POLA
MVTTASAPSEDRAKPTLMLLDGNSLAFRAFYALPAENFKTRGGLTTNAVY
GFTAMLINLLRDEAPTHIAAAFDVSRQTFRLQRYPEYKANRSSTPDEFAG
QIDITKEVLGALGITVLSEPGFEADDLIATLATQAENEGYRVLVVTGDRD
ALQLVSDDVTVLYPRKGVSELTRFTPEAVVEKYGLTPRQYPDFAALRGDP
SDNLPGIPGVGEKTAAKWIAEYGSLRSLVDNVDAVRGKVGDALRANLASV
VRNRELTDLVRDVPLAQTPDTLRLQPWDRDHIHRLFDDLEFRVLRDRLFD
TLAAAGGPEVDEGFDVRGGALAPGTVRQWLAEHAGDGRRAGLTVVGTHLP
HGGDATAMAVAAADGEGAYLDTATLTPDDDAALAAWLADPAKPKALHEAK
AAVHDLAGRGWTLEGVTSDTALAAYLVRPGQRSFTLDDLSLRYLRRELRA
ETPQQQQLSLLDDDDTDAETIQTTILRARAVIDLADALDAELARIDSTAL
LGEMELPVQRVLAKMESAGIAVDLPMLTELQSQFGDQIRDAAEAAYGVIG
KQINLGSPKQLQVVLFDELGMPKTKRTKTGYTTDADALQSLFDKTGHPFL
QHLLAHRDVTRLKVTVDGLLQAVAADGRIHTTFNQTIAATGRLSSTEPNL
QNIPIRTDAGRRIRDAFVVGDGYAELMTADYSQIEMRIMAHLSGDEGLIE
AFNTGEDLHSFVASRAFGVPIDEVTGELRRRVKAMSYGLAYGLSAYGLSQ
QLKISTEEANEQMDAYFARFGGVRDYLRAVVERARKDGYTSTVLGRRRYL
PELDSSNRQVREAAERAALNAPIQGSAADIIKVAMIQVDKALNEAQLASR
MLLQVHDELLFEIAPGERERVEALVRDKMGGAYPLDVPLEVSVGYGRSWD
AAAH
>Rv1402 priA, PUTATIVE PRIMOSOMAL PROTEIN N' PRIA (Replication factor Y)
MLSVPHLDRDFDYLVPAEHSDDAQPGVRVRVRFHGRLVDGFVLERRSDSD
HHGKLGWLDRVVSPEPVLTTEIRRLVDAVAARYAGTRQDVLRLAVPARHA
RVEREITTAPGRPVVAPVDPSGWAAYGRGRQFLAALADSRAARAVWQALP
GELWADRFAEAAAQTVRAGRTVLAIVPDQRDLDTLWQAATALVDEHSVVA
LSAGLGPEARYRRWLAALRGSARLVIGTRSAVFAPLSELGLVMVWADADD
SLAEPRAPYPHAREVAMLRAHQARCAALIGGYARTAEAHALVRSGWAHDV
VAPRPEVRARSPRVVALDDSGYDDARDPAARTARLPSIALRAARSALQSG
APVLVQVPRRGYIPSLACGRCRAIARCRSCTGPLSLQGAGSPGAVCRWCG
RVDPTLRCVRCGSDVVRAVVVGARRTAEELGRAFPGTAVITSAGDTLVPQ
LDAGPALVVATPGAEPRAPGGYGAALLLDSWALLGRQDLRAAEDALWRWM
TAAALVRPRGAGGVVTVVAESSIPTVQSLIRWDPVGHAEAELAARTEVGL
PPSVHIAALDGPAGTVTALLEAARLPDPDRLQADLLGPVDLPPGVRRPAG
IPADAPVIRMLLRVCREQGLELAASLRRGIGVLSARQTRQTRSLVRVQID
PLHIG
>Rv2737c recA, RECA PROTEIN (RECOMBINASE A) [CONTAINS: ENDONUCLEASE PI-MTUI (MTU RECA INTEIN)].
MTQTPDREKALELAVAQIEKSYGKGSVMRLGDEARQPISVIPTGSIALDV
ALGIGGLPRGRVIEIYGPESSGKTTVALHAVANAQAAGGVAAFIDAEHAL
DPDYAKKLGVDTDSLLVSQPDTGEQALEIADMLIRSGALDIVVIDSVAAL
VPRAELEGEMGDSHVGLQARLMSQALRKMTGALNNSGTTAIFINQLRDKI
GVMFGSPETTTGGKALKFYASVRMDVRRVETLKDGTNAVGNRTRVKVVKN
KCLAEGTRIFDPVTGTTHRIEDVVDGRKPIHVVAAAKDGTLHARPVVSWF
DQGTRDVIGLRIAGGAIVWATPDHKVLTEYGWRAAGELRKGDRVAQPRRF
DGFGDSAPIPADHARLLGYLIGDGRDGWVGGKTPINFINVQRALIDDVTR
IAATLGCAAHPQGRISLAIAHRPGERNGVADLCQQAGIYGKLAWEKTIPN
WFFEPDIAADIVGNLLFGLFESDGWVSREQTGALRVGYTTTSEQLAHQIH
WLLLRFGVGSTVRDYDPTQKRPSIVNGRRIQSKRQVFEVRISGMDNVTAF
AESVPMWGPRGAALIQAIPEATQGRRRGSQATYLAAEMTDAVLNYLDERG
VTAQEAAAMIGVASGDPRGGMKQVLGASRLRRDRVQALADALDDKFLHDM
LAEELRYSVIREVLPTRRARTFDLEVEELHTLVAEGVVVHNCSPPFKQAE
FDILYGKGISREGSLIDMGVDQGLIRKSGAWFTYEGEQLGQGKENARNFL
VENADVADEIEKKIKEKLGIGAVVTDDPSNDGVLPAPVDF
>Rv0630c recB, PROBABLE EXONUCLEASE V (BETA CHAIN) RECB (EXODEOXYRIBONUCLEASE V BETA CHAIN)(EXODEOXYRIBONUCLEASE V POLYPEPTIDE) (CHI-SPECIFIC ENDONUCLEASE)
MDRFELLGPLPREGTTTVLEASAGTGKTFALAGLVTRYLAETAATLDEML
LITFNRAASRELRERVRGQIVEAVGALQGDAPPSGELVEHLLRGSDAERA
QKRSRLRDALANFDAATIATTHEFCGSVLKSLGVAGDNAADVELKESLTD
LVTEIVDDRYLANFGRQETDPELTYAEALALALAVVDDPCAQLRPPDPEP
GSKAAVRLRFAAEVLEELERRKGRLRAQGFNDLLIRLATALEAADSPARD
RMRERWRIVLVDEFQDTDPMQWRVLERAFSRHSALILIGDPKQAIYGFRG
GDIHTYLKAAGTADARYTLGVNWRSDRALVESLQTVLRDATLGHADIVVR
GTDAHHAGHRLASAPRPAPFRLRVVKRHTLGYDGTAHVPIEALRRHIPDD
LAADVAALLASGATFAGRPVVAADIAVIVEHHKDARACRNALAEAGIPAI
YTGDTDVFASQAAKDWLCLLEAFDAPQRSGLVRAAACTMFFGETAESLAA
EGDALTDRVAGTLREWADHARHRGVAAVFQAAQLAGMGRRVLSQRGGERD
LTDLAHIAQLLHEAAHRERLGLPGLRDWLRRQAKAGAGPPEHNRRLDSDA
AAVQIMTVFVAKGLQFPIVYLPFAFNRNVRSDDILLYHDDGTRCLYIGGK
DGGAQRRTVEGLNRVEAAHDNLRLTYVALTRAQSQVVAWWAPTFDEVNGG
LSRLLRGRRPGQSQVPDRCTPRVTDEQAWAVFAQWEAAGGPSVEESVIGA
RSSLEKPVPVPGFEVRHFHRRIDTTWRRTSYSDLVRGSEAVTVTSEPAAG
GRADEVEIAVVAAPGSGADLTSPLAALPSGASFGSLVHAVLETADPAAPD
LAAELEAQVRRHAPWWTVDVDHAQLAPELARALLPMHDTPLGPAAAALTL
RQIGVRDRLRELDFEMPLAGGDLRGRSPDVSLADVGELLASHLPGDDPLS
PYADRLGSAGLGDQPLRGYLAGSIDVVLRLPGQRYLVVDYKTNHLGDTAA
DYGFERLTEAMLHSDYPLQALLYVVVLHRFLRWRQRDYAPARHLGGVLYL
FVRGMCGAATPVTAGHPAGVFTWNPPTALVVALSDLLDRGRLQS
>Rv0631c recC, PROBABLE EXONUCLEASE V (GAMMA CHAIN) RECC (EXODEOXYRIBONUCLEASE V GAMMA CHAIN)(EXODEOXYRIBONUCLEASE V POLYPEPTIDE)
MALHLHRAERTDLLADGLGALLADPQPDPFAQELVLVAARGVERWLSQRL
SLVLGCGPGRADGVCAGIAFRNPQSLIAEITGTLDDDPWSPEALAWPLLA
VIDASLDEPWCRTLASHLGHFATTDAEAELRRGRRYSVARRLAGLFASYA
RQRPGLLAAWLDGDLGELPGDLAWQPPLWRALVTTVGADPPHVRHDKTIA
RLRDGPADLPARLSLFGHTRLACTDVQLLDALAVHHDLHLWLPHPSDELW
RALAGFQGADGLLPRRQDTSRRAAQHPLLETLGRDVRELQRALPAARATD
EFLGATTKPDTLLGWLQADIAGNAPRPAGRSLSDADRSVQVHACHGPARQ
IDVLREVLLGLLEDDPTLQPRDIVVMCPDIDTYAPLIVAGFGLGEVAGDC
HPAHRLRVRLADRALTQTNPLLSVAAELLTIAETRATASQLLNLAQAAPV
RAKFGFADDDLDTITTWVRESNIRWGFDPTHRRRYGLDTVVHNTWRFGLD
RILTGVAMSEDSQAWLDTALPLDDVGSNRVELAGRLAEFVERLHHVVGGL
SGARPLVAWLDALATGIDLLTACNDGWQRAQVQREFADVLARAGSRAAPL
LRLPDVRALLDAQLAGRPTRANFRTGTLTVCTMVPMRSVPHRVVCLVGLD
DGVFPRLSHPDGDDVLAREPMTGERDIRSEDRQLLLDAIGAATQTLVITY
TGADERTGQPRPPAVPLAELLDALDQTTSAPVRERILVTHPLQPFDRKNV
TPGALLGAKPFTFDPAALAAAQAAAGKRCPPTAFISGRLPAPPAADVTLA
DLLDFFKDPVKGFFRALDYTLPWDVDTVEDSIPVQVDALAEWTVGERMLR
DMLRGLHPDDAAHSEWRRGTLPPGRLGVRRAKEIRNRARDLAAAALAHRD
GHGQAHDVDVDLGDGRRLSGTVTPVFGGRTVSVTYSKLAPKHVLPAWIGL
VTLAAQEPGREWSALCIGRSKTRNHIARRLFVPPPDPVAVLRELVLLYDA
GRREPLPLPLKTSCAWAQARRDGQDPYPPARECWQTNRFRPGDDDAPAHV
RAWGPRAPFEVLLGKPRAGEEVAGEETRLGALAARLWLPLLAAEGSV
>Rv0629c recD, PROBABLE EXONUCLEASE V (ALPHA CHAIN) RECD (EXODEOXYRIBONUCLEASE V ALPHA CHAIN) (EXODEOXYRIBONUCLEASE V POLYPEPTIDE)
MKLTDVDFAVEASGMVRAFNQAGVLDVSDVHVAQRLCALAGESDERVALA
VAVAVRALRAGSVCVDLLSIARVAGHDDLPWPDPADWLAAVRASPLLADP
PVLHLYDDRLLYLDRYWREEEQVCADLLALLTSRRPAGVPDLRRLFPTGF
DEQRRAAEIALSQGVTVLTGGPGTGKTTTVARLLALVAEQAELAGEPRPR
IALAAPTGKAAARLAEAVRREMAKLDATDRARLGDLHAVTLHRLLGAKPG
ARFRQDRQNRLPHNVIVVDETSMVSLTLMARLAEAVRPGARLILVGDADQ
LASVEAGAVLADLVDGFSVRDDALVAQLRTSHRFGKVIGTLAEAIRAGDG
DAVLGLLRSGEERIEFVDDEDPAPRLRAVLVPHALRLREAALLGASDVAL
ATLDEHRLLCAHRDGPTGVLHWNRRVQAWLAEETGQPPWTPWYAGRPLLV
TANDYGLRVYNGDTGVVLAGPTGLRAVISGASGPLDVATGRLGDVETMHA
MTIHKSQGSQVDEVTVLMPQEDSRLLTRELLYTAVTRAKRKVRVVGSEAS
VRAAIARRAVRASGLRMRLQSTGCG
>Rv0003 recF, DNA REPLICATION AND REPAIR PROTEIN RECF (SINGLE-STRAND DNA BINDING PROTEIN)
MYVRHLGLRDFRSWACVDLELHPGRTVFVGPNGYGKTNLIEALWYSTTLG
SHRVSADLPLIRVGTDRAVISTIVVNDGRECAVDLEIATGRVNKARLNRS
SVRSTRDVVGVLRAVLFAPEDLGLVRGDPADRRRYLDDLAIVRRPAIAAV
RAEYERVLRQRTALLKSVPGARYRGDRGVFDTLEVWDSRLAEHGAELVAA
RIDLVNQLAPEVKKAYQLLAPESRSASIGYRASMDVTGPSEQSDIDRQLL
AARLLAALAARRDAELERGVCLVGPHRDDLILRLGDQPAKGFASHGEAWS
LAVALRLAAYQLLRVDGGEPVLLLDDVFAELDVMRRRALATAAESAEQVL
VTAAVLEDIPAGWDARRVHIDVRADDTGSMSVVLP
>Rv2973c recG, PROBABLE ATP-DEPENDENT DNA HELICASE RECG
MASLSDRLDRVLGATAADALDEQFGMRTVDDLLRHYPRSYVEGAARVGIG
DARPEAGEHITIVDVITDTYSFPMKKKPNRKCLRITVGGGRNKVTATFFN
ADYIMRDLTKHTKVMLSGEVGYYKGAMQLTHPAFLILDSPDGKNHGTRSL
KSIADASKAISGELVVEEFERRFFPIYPASTKVQSWDIFKCVRQVLDVLD
RVDDPLPAELRAKHGLIPEDEALRAIHLAESQSLRERARERLTFDEAVGL
QWALVARRHGELSESGPSAAWKSNGLAAELLRRLPFELTAGQREVLDVLS
DGLAANRPLNRLLQGEVGSGKTIVAVLAMLQMVDAGYQCALLAPTEVLAA
QHLRSIRDVLGPLAMGGQLGGAENATRVALLTGSMTAGQKKQVRAEIASG
QVGIVIGTHALLQEAVDFHNLGMVVVDEQHRFGVEQRDQLRAKAPAGITP
HLLVMTATPIPRTVALTVYGDLETSTLRELPLGRQPIATNVIFVKDKPAW
LDRAWRRIIEEAAAGRQAYVVAPRIDESDDTDVQGGVRPSATAEGLFSRL
RSAELAELRLALMHGRLSADDKDAAMAAFRAGEVDVLVCTTVIEVGVDVP
NATVMLVMDADRFGISQLHQLRGRIGRGEHPSVCLLASWVPPDTPAGQRL
RAVAGTMDGFALADLDLKERKEGDVLGRNQSGKAITLRLLSLAEHEEYIV
AARDFCIEAYKNPTDPALALMAARFTSTDRIEYLDKS
>Rv1696 recN, PROBABLE DNA REPAIR PROTEIN RECN (RECOMBINATION PROTEIN N)
MLTELRIESLGAISVATAEFDRGFTVLTGETGTGKTMVVTGLHLLGGARA
DATRVRSGADRAVVEGRFTTTDLDDATVAGLQAVLDSSGAERDEDGSVIA
LRSISRDGPSRAYLGGRGVPAKSLSGFTNELLTLHGQNDQLRLMRPDEQR
GALDRFAAAGEAVQRYRKLRDAWLTARRDLVDRRNRARELAQEADRLKFA
LNEIDTVDPQPGEDVALVADIARLSELDTLREAATTARATLCGTPDADAF
DRGAVDSLGRARAALQSSDDAALRGLAEQVGEALTVVVDAVAELGAYLDE
LPADASALDAKLARQAQLRTLTRKYAADIDGVLRWADEARARLAQLDVSE
EGLAALERRTGELAHELGQAAVDLSTIRRKAAKRLAKEVSAELSALAMAD
AEFTIGVTTELADHGDPVALALASGELARAGADGVDAVEFGFVAHRGMTV
LPLAKSASGGELSRVMLSLEVVLATSRKQAAGTTMVFDEIDAGVGGWAAV
QIGRRLARLARTHQVIVVTHLPQVAAYADVHLMVQRTGRDGASGVRRLTS
EDRVAELARMLAGLGDSDSGRAHARELLETAQNDELT
>Rv3715c recR, PROBABLE RECOMBINATION PROTEIN RECR
MFEGPVQDLIDELGKLPGIGPKSAQRIAFHLLSVEPSDIDRLTGVLAKVR
DGVRFCAVCGNVSDNERCRICSDIRRDASVVCIVEEPKDIQAVERTREFR
GRYHVLGGALDPLSGIGPDQLRIRELLSRIGERVDDVDVTEVIIATDPNT
EGEATATYLVRMLRDIPGLTVTRIASGLPMGGDLEFADELTLGRALAGRR
VLA
>Rv3211 rhlE, PROBABLE ATP-DEPENDENT RNA HELICASE RHLE
MTAVKHTTESTFAKLGVRDEIVRALGEEGIKRPFAIQELTLPLALDGEDV
IGQARTGMGKTFAFGVPLLQRITSGDGTRPLTGAPRALVVVPTRELCLQV
TDDLATAGKYLTAGPDTDDAAAVRRRLSVVSIYGGRPYEPQIEALRAGAD
VVVGTPGRLLDLCQQGHLQLGGLSVLVLDEADEMLDLGFLPDIERILRQI
PADRQSMLFSATMPDPIITLARTFMVRPTHIRAEAPHSSAVHDATEQFVY
RAHALDKVELVSRVLQARDRGATMIFTRTKRTAQKVADELTERGFAVGAV
HGDLGQLAREKALKAFRTGGIDVLVATDVAARGIDIDDVTHVINYQCPED
EKMYVHRIGRTGRAGRTGVAVTLVDWDELPRWSMIDQALGLGSPDPAETY
SNSPHLYAELAIPATAGGTVGPARKSQGRRRDTDCDGQKTAQHARNTPRR
RRTRGGKPVTGHPGTNPISSPIVGGDATSEPGSGTASDSGSDVVSGSRSG
NGEAARRRRRRRRRPTHAQDGFAARAN
>Rv2593c ruvA, PROBABLE HOLLIDAY JUNCTION DNA HELICASE RUVA
MIASVRGEVLEVALDHVVIEAAGVGYRVNATPATLATLRQGTEARLITAM
IVREDSMTLYGFPDGETRDLFLTLLSVSGVGPRLAMAALAVHDAPALRQV
LADGNVAALTRVPGIGKRGAERMVLELRDKVGVAATGGALSTNGHAVRSP
VVEALVGLGFAAKQAEEATDTVLAANHDATTSSALRSALSLLGKAR
>Rv2592c ruvB, PROBABLE HOLLIDAY JUNCTION DNA HELICASE RUVB
MTERSDRDVSPALTVGEGDIDVSLRPRSLREFIGQPRVREQLQLVIEGAK
NRGGTPDHILLSGPPGLGKTSLAMIIAAELGSSLRVTSGPALERAGDLAA
MLSNLVEHDVLFIDEIHRIARPAEEMLYLAMEDFRVDVVVGKGPGATSIP
LEVAPFTLVGATTRSGALTGPLRDRFGFTAHMDFYEPAELERVLARSAGI
LGIELGADAGAEIARRSRGTPRIANRLLRRVRDFAEVRADGVITRDVAKA
ALEVYDVDELGLDRLDRAVLSALTRSFGGGPVGVSTLAVAVGEEAATVEE
VCEPFLVRAGMVARTPRGRVATALAWTHLGMTPPVGASQPGLFE
>Rv2594c ruvC, PROBABLE CROSSOVER JUNCTION ENDODEOXYRIBONUCLEASE RUVC (HOLLIDAY JUNCTION NUCLEASE) (HOLLIDAY JUNCTION RESOLVASE)
MRVMGVDPGLTRCGLSLIESGRGRQLTALDVDVVRTPSDAALAQRLLAIS
DAVEHWLDTHHPEVVAIERVFSQLNVTTVMGTAQAGGVIALAAAKRGVDV
HFHTPSEVKAAVTGNGSADKAQVTAMVTKILALQAKPTPADAADALALAI
CHCWRAPTIARMAEATSRAEARAAQQRHAYLAKLKAAR
>Rv0054 ssb, PROBABLE SINGLE-STRAND BINDING PROTEIN SSB (HELIX-DESTABILIZING PROTEIN)
MAGDTTITIVGNLTADPELRFTPSGAAVANFTVASTPRIYDRQTGEWKDG
EALFLRCNIWREAAENVAESLTRGARVIVSGRLKQRSFETREGEKRTVIE
VEVDEIGPSLRYATAKVNKASRSGGFGSGSRPAPAQTSSASGDDPWGSAP
ASGSFGGGDDEPPF
>Rv1210 tagA, PROBABLE DNA-3-METHYLADENINE GLYCOSYLASE I TAGA (TAG I) (3-methyladenine-DNA glycosylase I, constitutive) (DNA-3-methyladenine glycosidase I )
MSGDGLVRCPWAEVRPGPDAQLYRDYHDNEWGRPLYGRVALFERMSLEAF
QSGLSWLIILRKRENFRRAFSGFDIDKIARYTDTDVRRLLADDGIVRNRA
KIEATIANARAAADLGSSEDLSELLWSFAPPPRPRPVDGSEIPSVSTESK
AMSRELKRRGFRFVGPTTAYALMQATGMVDDHIQACWVPTERPFDQPGCP
MAAR
>Rv1008 tatD, PROBABLE DEOXYRIBONUCLEASE TATD (YJJV PROTEIN)
MVDAHTHLDACGARDADTVRSLVERAAAAGVTAVVTVADDLESARWVTRA
AEWDRRVYAAVALHPTRADALTDAARAELERLVAHPRVVAVGETGIDMYW
PGRLDGCAEPHVQREAFAWHIDLAKRTGKPLMIHNRQADRDVLDVLRAEG
APDTVILHCFSSDAAMARTCVDAGWLLSLSGTVSFRTARELREAVPLMPV
EQLLVETDAPYLTPHPHRGLANEPYCLPYTVRALAELVNRRPEEVALITT
SNARRAYGLGWMRQ
>Rv2976c ung, PROBABLE URACIL-DNA GLYCOSYLASE UNG (UDG)
MTARPLSELVERGWAAALEPVADQVAHMGQFLRAEIAAGRRYLPAGSNVL
RAFTFPFDNVRVLIVGQDPYPTPGHAVGLSFSVAPDVRPWPRSLANIFDE
YTADLGYPLPSNGDLTPWAQRGVLLLNRVLTVRPSNPASHRGKGWEAVTE
CAIRALAARAAPLVAILWGRDASTLKPMLAAGNCVAIESPHPSPLSASRG
FFGSRPFSRANELLVGMGAEPIDWRLP
>Rv1638 uvrA, PROBABLE EXCINUCLEASE ABC (SUBUNIT A-DNA-BINDING ATPase) UVRA
MADRLIVKGAREHNLRSVDLDLPRDALIVFTGLSGSGKSSLAFDTIFAEG
QRRYVESLSAYARQFLGQMDKPDVDFIEGLSPAVSIDQKSTNRNPRSTVG
TITEVYDYLRLLYARAGTPHCPTCGERVARQTPQQIVDQVLAMPEGTRFL
VLAPVVRTRKGEFADLFDKLNAQGYSRVRVDGVVHPLTDPPKLKKQEKHD
IEVVVDRLTVKAAAKRRLTDSVETALNLADGIVVLEFVDHELGAPHREQR
FSEKLACPNGHALAVDDLEPRSFSFNSPYGACPECSGLGIRKEVDPELVV
PDPDRTLAQGAVAPWSNGHTAEYFTRMMAGLGEALGFDVDTPWRKLPAKA
RKAILEGADEQVHVRYRNRYGRTRSYYADFEGVLAFLQRKMSQTESEQMK
ERYEGFMRDVPCPVCAGTRLKPEILAVTLAGESKGEHGAKSIAEVCELSI
ADCADFLNALTLGPREQAIAGQVLKEIRSRLGFLLDVGLEYLSLSRAAAT
LSGGEAQRIRLATQIGSGLVGVLYVLDEPSIGLHQRDNRRLIETLTRLRD
LGNTLIVVEHDEDTIEHADWIVDIGPGAGEHGGRIVHSGPYDELLRNKDS
ITGAYLSGRESIEIPAIRRSVDPRRQLTVVGAREHNLRGIDVSFPLGVLT
SVTGVSGSGKSTLVNDILAAVLANRLNGARQVPGRHTRVTGLDYLDKLVR
VDQSPIGRTPRSNPATYTGVFDKIRTLFAATTEAKVRGYQPGRFSFNVKG
GRCEACTGDGTIKIEMNFLPDVYVPCEVCQGARYNRETLEVHYKGKTVSE
VLDMSIEEAAEFFEPIAGVHRYLRTLVDVGLGYVRLGQPAPTLSGGEAQR
VKLASELQKRSTGRTVYILDEPTTGLHFDDIRKLLNVINGLVDKGNTVIV
IEHNLDVIKTSDWIIDLGPEGGAGGGTVVAQGTPEDVAAVPASYTGKFLA
EVVGGGASAATSRSNRRRNVSA
>Rv1633 uvrB, PROBABLE EXCINUCLEASE ABC (SUBUNIT B-HELICASE) UVRB
MRAGGHFEVVSPHAPAGDQPAAIDELERRINAGERDVVLLGATGTGKSAT
TAWLIERLQRPTLVMAPNKTLAAQLANELREMLPHNAVEYFVSYYDYYQP
EAYIAQTDTYIEKDSSINDDVERLRHSATSALLSRRDVVVVASVSCIYGL
GTPQSYLDRSVELKVGEEVPRDGLLRLLVDVQYTRNDMSFTRGSFRVRGD
TVEIIPSYEELAVRIEFFGDEIEALYYLHPLTGEVIRQVDSLRIFPATHY
VAGPERMAHAVSAIEEELAERLAELESQGKLLEAQRLRMRTNYDIEMMRQ
VGFCSGIENYSRHIDGRGPGTPPATLLDYFPEDFLLVIDESHVTVPQIGG
MYEGDISRKRNLVEYGFRLPSACDNRPLTWEEFADRIGQTVYLSATPGPY
ELSQTGGEFVEQVIRPTGLVDPKVVVKPTKGQIDDLIGEIRTRADADQRV
LVTTLTKKMAEDLTDYLLEMGIRVRYLHSEVDTLRRVELLRQLRLGDYDV
LVGINLLREGLDLPEVSLVAILDADKEGFLRSSRSLIQTIGRAARNVSGE
VHMYADKITDSMREAIDETERRRAKQIAYNEANGIDPQPLRKKIADILDQ
VYREADDTAVVEVGGSGRNASRGRRAQGEPGRAVSAGVFEGRDTSAMPRA
ELADLIKDLTAQMMAAARDLQFELAARFRDEIADLKRELRGMDAAGLK
>Rv1420 uvrC, PROBABLE EXCINUCLEASE ABC (SUBUNIT C-NUCLEASE) UVRC
MPDPATYRPAPGSIPVEPGVYRFRDQHGRVIYVGKAKSLRSRLTSYFADV
ASLAPRTRQLVTTAAKVEWTVVGTEVEALQLEYTWIKEFDPRFNVRYRDD
KSYPVLAVTLGEEFPRLMVYRGPRRKGVRYFGPYSHAWAIRETLDLLTRV
FPARTCSAGVFKRHRQIDRPCLLGYIDKCSAPCIGRVDAAQHRQIVADFC
DFLSGKTDRFARALEQQMNAAAEQLDFERAARLRDDLSALKRAMEKQAVV
LGDGTDADVVAFADDELEAAVQVFHVRGGRVRGQRGWIVEKPGEPGDSGI
QLVEQFLTQFYGDQAALDDAADESANPVPREVLVPCLPSNAEELASWLSG
LRGSRVVLRVPRRGDKRALAETVHRNAEDALQQHKLKRASDFNARSAALQ
SIQDSLGLADAPLRIECVDVSHVQGTDVVGSLVVFEDGLPRKSDYRHFGI
REAAGQGRSDDVACIAEVTRRRFLRHLRDQSDPDLLSPERKSRRFAYPPN
LYVVDGGAPQVNAASAVIDELGVTDVAVIGLAKRLEEVWVPSEPDPIIMP
RNSEGLYLLQRVRDEAHRFAITYHRSKRSTRMTASALDSVPGLGEHRRKA
LVTHFGSIARLKEATVDEITAVPGIGVATATAVHDALRPDSSGAAR
>Rv0949 uvrD1, PROBABLE ATP-DEPENDENT DNA HELICASE II UVRD1
MSVHATDAKPPGPSPADQLLDGLNPQQRQAVVHEGSPLLIVAGAGSGKTA
VLTRRIAYLMAARGVGVGQILAITFTNKAAAEMRERVVGLVGEKARYMWV
STFHSTCVRILRNQAALIEGLNSNFSIYDADDSRRLLQMVGRDLGLDIKR
YSPRLLANAISNLKNELIDPHQALAGLTEDSDDLARAVASVYDEYQRRLR
AANALDFDDLIGETVAVLQAFPQIAQYYRRRFRHVLVDEYQDTNHAQYVL
VRELVGRDSNDGIPPGELCVVGDADQSIYAFRGATIRNIEDFERDYPDTR
TILLEQNYRSTQNILSAANSVIARNAGRREKRLWTDAGAGELIVGYVADN
EHDEARFVAEEIDALAEGSEITYNDVAVFYRTNNSSRSLEEVLIRAGIPY
KVVGGVRFYERKEIRDIVAYLRVLDNPGDAVSLRRILNTPRRGIGDRAEA
CVAVYAENTGVGFGDALVAAAQGKVPMLNTRAEKAIAGFVEMFDELRGRL
DDDLGELVEAVLERTGYRRELEASTDPQELARLDNLNELVSVAHEFSTDR
ENAAALGPDDEDVPDTGVLADFLERVSLVADADEIPEHGAGVVTLMTLHT
AKGLEFPVVFVTGWEDGMFPHMRALDNPTELSEERRLAYVGITRARQRLY
VSRAIVRSSWGQPMLNPESRFLREIPQELIDWRRTAPKPSFSAPVSGAGR
FGSARPSPTRSGASRRPLLVLQVGDRVTHDKYGLGRVEEVSGVGESAMSL
IDFGSSGRVKLMHNHAPVTKL
>Rv3198c uvrD2, PROBABLE ATP-DEPENDENT DNA HELICASE II UVRD2
MSIASDPLIAGLDDQQREAVLAPRGPVCVLAGAGTGKTRTITHRIASLVA
SGHVAAGQVLAVTFTQRAAGEMRSRLRALDAAARTGSGVGAVQALTFHAA
AYRQLRYFWSRVIADTGWQLLDSKFAVVARAASRTRLHASTDDVRDLAGE
IEWAKASLIGPEEYVTAVAAARRDPPLDAAQIAAVYSEYEALKARGDGVT
LLDFDDLLLHTAAAIENDAAVAEEFQDRYRCFVVDEYQDVTPLQQRVLSA
WLGDRDDLTVVGDANQTIYSFTGASPRFLLDFSRRFPDAAVVRLERDYRS
TPQVVSLANRVIAAARGRVAGSKLRLSGQREPGPVPSFHEHSDEPAEAAT
VAASIARLIASGTPPSEVAILYRVNAQSEVYEEALTQAGIAYQVRGGEGF
FNRQEIKQALLALQRVSERDTDAALSDVVRAVLAPLGLTAQPPVGTRARE
RWEALTALAELVDDELAQRPALQLPGLLAELRRRAEARHPPVVQGVTLAS
LHAAKGLEWDAVFLVGLADGTLPISHALAHGPNSEPVEEERRLLYVGITR
ARVHLALSWALSRSPGGRQSRKPSRFLNGIAPQTRADPVPGTSRRNRGAA
ARCRICNNELNTSAAVMLRRCETCAADVDEELLLQLKSWRLSTAKEQNVP
AYVVFTDNTLIAIAELLPTDDAALIAIPGIGARKLEQYGSDVLQLVRGRT
>Rv2894c xerC, PROBABLE INTEGRASE/RECOMBINASE XERC
MQAILDEFDEYLALQCGRSVHTRRAYLGDLRSLFAFLADRGSSLDALTLS
VLRSWLAATAGAGAARTTLARRTSAVKAFTAWAVRRGLLAGDPAARLQVP
KARRTLPAVLRQDQALRAMAAAESGAEQGDPLALRDRLIVELLYATGIRV
SELCGLDVDDIDTGHRLVRVLGKGNKQRTVPFGQPAADALHAWLVDGRRA
LVTAESGHALLLGARGRRLDVRQARTAVHQTVAAVDGAPDMGPHGLRHSA
ATHLLEGGADLRVVQELLGHSSLATTQLYTHVAVARLRAVHERAHPRA
>Rv1108c xseA, PROBABLE EXODEOXYRIBONUCLEASE VII (LARGE SUBUNIT) XSEA (EXONUCLEASE VII LARGE SUBUNIT)
MTQNSAENPFPVRAVAIRVAGWIDKLGAVWVEGQLAQITMRPDAKTVFMV
LRDPAADMSLTVTCSRDLVLSAPVKLAEGVQVVVCGKPSFYTGRGTFSLR
LSEIRAVGIGELLARIDRLRRLLDAEGLFDPRLKRPIPYLPNMIGLITGR
ASAAERDVTTVASARWPAARFAVRNVAVQGPNAVGQIVEALRELDRDPDV
DVIVLARGGGSVEDLLPFSDETLCRAIAACRTPVVSAVGHEPDNPLCDLV
VDLRAATPTDAAKKVVPDTAAEQRLIDDLRRRSAQALRNWVSREQRAVAQ
LRSRPVLADPMTMVSVRAEEVHRARSTLRRNLTLMVAAETERIGHLAARL
ATLGPAATLARGYAIVQTVAQTGPEGGSEPQVLRSVHDAPEGTKLRVRVA
DGALAAVSEGQTNGL
>Rv1107c xseB, PROBABLE EXODEOXYRIBONUCLEASE VII (SMALL SUBUNIT) XSEA (EXONUCLEASE VII SMALL SUBUNIT)
MVCDPNGDDTGRTHATVPVSQLGYEACRDELMEVVRLLEQGGLDLDASLR
LWERGEQLAKRCEEHLAGARQRVSDVLAGDEAQNG
>Rv0427c xthA, PROBABLE EXODEOXYRIBONUCLEASE III PROTEIN XTHA (EXONUCLEASE III) (EXO III) (AP ENDONUCLEASE VI)
MPDGTIDGGHPQRPASPRLRSPLLRLATWNVNSIRTRLDRVLDWLGRADV
DVLAMQETKCPDGQFPALPLFELGYDVAHVGFDQWNGVAIASRVGLDDVR
VGFDGQPSWSGKPEVAATTEARALGATCGGIRVWSLYVPNGRALDDPHYT
YKLDWLAALRDTAEGWLRDDPAAPIALMGDWNIAPTDDDVWSTEFFAGCT
HVSEPERKAFNAIVDAQFTDVVRPFTPGPGVYTYWDYTQLRFPKKQGMRI
DFILGSPALAARVMDAQIVREERKGKAPSDHAPVLVDLHAG