TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Mycobacterium tuberculosis CDC1551, CDC1551
Gene type: CDS

Number of genes found: 180

Free access
Sort by:

 



# Mycobacterium tuberculosis CDC1551, CDC1551

>MT3695 A/G-specific adenine glycosylase, putative
MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFM
LQQTPAARVLAIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRL
HECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFAYRQRVPVVDT
NVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGA
TVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLL
DVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP
GEGF
>MT1304 serine/threonine protein kinase
MSDAQDSRVGSMFGPYHLKRLLGRGGMGEVYEAEHTVKEWTVAVKLMTAE
FSKDPVFRERMKREARIAGRLQEPHVVPIHDYGEVDGQMFLEMRLVEGTD
LDSVLKRFGPLTPPRAVAIITQIASALDAAHADGVMHRDVKPQNILITRD
DFAYLVDFGIASATTDEKLTQLGTAVGTWKYMAPERFSNDEVTYRADIYA
LACVLHECLTGAPPYRADSAGTLVSSHLMGPIPQPSAIRPGIPKAFDAVV
ARGMAKKPEDRYASAGDLALAAHEALSDPDQDHAADILRRSQESTLPAPP
KPVPPPTMPATAMAPRQPPAPPVTPPGVQPAPKPSYTPPAQPGPAGQRPG
PTGQPSWAPNSGPMPASGPTPTPQYYQGGGWGAPPSGGPSPWAQTPRKTN
PWPLVAGAAAVVLVLVLGAIGIWIAIRPKPVQPPQPVAEERLSALLLNSS
EVNAVMGSSSMQPGKPITSMDSSPVTVSLPDCQGALYTSQDPVYAGTGYT
AINGLISSEPGDNYEHWVNQAVVAFPTADKARAFVQTSADKWKNCAGKTV
TVTNKAKTYRWTFADVKGSPPTITVIDTQEGAEGWECQRAMSVANNVVVD
VNACGYQITNQAGQIAAKIVDKVNKE
>MT2888 CRISPR-associated protein, TM1792 family
MTTSYAKIEITGTLTVLTGLQIGAGDGFSAIGAVDKPVVRDPLSRLPMIP
GTSLKGKVRTLLSRQYGADTETFYRKPNEDHAHIRRLFGDTEEYMTGRLV
FRDTKLTNKDDLEARGAKTLTEVKFENAINRVTAKANLRQMERVIPGSEF
AFSLVYEVSFGTPGEEQKASLPSSDEIIEDFNAIARGLKLLELDYLGGSG
TRGYGQVKFSNLKARAAVGALDGSLLEKLNHELAAV
>MT3935 IS1537, transposase
MMARFEVPEGWCVQAFRFTLDPTEDQARALARHFGARRKAYNWAVATLKA
DIEAWRVTGIGTVKPSLRVLRKRWNTVKDEVCVNAETGAVWWPECSKEAY
ADGIGGAVDAYWNWQNSRSGKREGKTMGFPRFKKKGRDQDRVTFTTGAMR
VEPDRRHLTLPVVGTVRTHENTRRIERLIATGRARVLAISVRRNGTRLDA
SVRVLVQRPQQPNVAQPGSRVGVDVGVRRLATVANEAGAVLEEVPNPRPL
DAALKELRYASRARSRCTKGSRRYRERTTEISRLHRRVNDVRTHHLHVLT
TRLAQTHGHIVVEGLDAAGMLRQKGLPGARARRRGLSDSALGTPRRHLSY
KTGWYGSALVVADRWFPSLSVEPTVRPGLARLVAVKRGREAAAWLPNNPE
TGCKSRDH
>MT3057 IS1538, resolvase
MNLATWAERNGVAPGTAYRWFRAGLLSVMARRVGRLILVDEPAGDAGMRS
PTAVYARVSSADQKADLDRQVARVTAWATAQQMPVDKVVTEVGSAFNEHR
RKFLSLLRDPSVHRIVVEHRDRFCRLGSKYVQAAFAAQGRELVVVDSAEV
DDDLVRDMTEILTSMCARLYGKRAAENRTKRALAAAAGEDHEAA
>MT0635 IS1536, transposase, truncated
MANVLLRTGPSGPSRLPLSMIMRRPEMPRLEIPNGWCVQAFRFTLDPTAE
QAHALARHFGARRKAYNWTVAQLKADIQAWRATGAQTAKPSLRVLRKRWN
TVKDEVCVNAETGTVWWPECSKEAYADGIAGAVDAYWNWQQRRAGKRDGK
RMGFPRFKKKGRDADRVSFTTGAMRVEPDRRHLTLPVIGCVRTHENTRRI
ERLIAKDRARVLAITVRRNGTRLDASVRVLVQRPQQPNVELPESRIGVDV
GVRRLATVATADGACCPVLVPDG
>MT1589 DNA-damage-inducible protein P, putative
MIAIAKTSSVAPNSITGVESRWVLHLDMDAFFASVEQLTRPTLRGRPVLV
GGLGGRGVVAGASYEARAYGARSAMPMHQARRLIGVTAVVLPPRGVVYGI
ASRRVFDTVRGLVPVVEQLSFDEAFAEPPQLAGAVAEDVETFCERLRRRV
RDETGLIASVGAGSGKQIAKIASGLAKPDGIRVVRHAEEQALLSGLPVRR
LWGIGPVAEEKLHRLGIETIGQLAALSDAEAANILGATIGPALHRLARGI
DDRPVVERAEAKQISAESTFAVDLTTMEQLHEAIDSIAEHAHQRLLRDGR
GARTITVKLKKSDMSTLTRSATMPYPTTDAGALFTVARRLLPDPLQIGPI
RLLGVGFSGLSDIRQESLFADSDLTQETAAAHYVETPGAVVPAAHDATMW
RVGDDVAHPELGHGWVQGAGHGVVTVRFETRGSGPGSARTFPVDTGDISN
ASPLDSLDWPDYIGQLSVEGSAGASAPTVDDVGDR
>MT1803 IS6110, transposase
MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH
AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI
ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR
RILGWRVASTMATSMVLDAIEQAIWTRQQESVLDLKDVIHHTDRGSQYTS
IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE
DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>MT0077 conserved hypothetical protein
MSSITVSVDPVDPVDPVDPVDPVDPVDAVVAAGSDGLTVARIESEIGALE
FLNELRTELKSGQFRPQPVRERKIPKPGGLGKVRRLGIPTVADRVVQAAL
KLVLEPIFETDFEPVSYGFRPARRAHDTIAEIHLFGTQEYRWVLDADIKA
CFDRIDHADLMDRVRHRIKDKRVLRLVNWQRIRHRWNWTDVRRWLTDPTG
RWHPISADGITLFNPAAVPIRRYRYRGNTIPTPWTQAV
>MT3501 hypothetical protein
MANRVIACSATARAAGVRRGLRRREAAARCPQLFIATADADRDARLFEGV
IAAVDDLVPRAELLRPGLLVLPVRGPARFFGSEQMAAERLIDAVAAAGAE
CQVGIADRLSTAVFAARAGRIVEPGGDARFLSLLSIRQLATEPSLSGPGR
DDLTDLLWRMGIRTIGQFAALSRTDVASRFGADAVAAHRFARGEPERAPC
GREPPPDLAAELACDPPIDRVDAAAFAGRSLAAELHRALMAAGVGCTRLA
IHAVTANGEERSRVWRCAEPLTEDATADRVRWQLDGWLNNRNARDRPTAA
VTLLRLQAVETVSASEGLQLPLWGGLGEQDRLRARRALVRVQGLLGPEAV
RVPVLSGGHGPAERITLTVLGLVAPEPVPQADPGQPWPGRLPDPSPAVLF
DDPVDLLDAQGNPIRVTSRGMFSADPARLRVRGRDDRLRWWAGPWPDDER
WWDPDRASGRTARAQVLLDGDPGTALLLCYRQRRWYLEGSYE
>MT3056 IS1538, transposase
MPKFEVPDGWTVQAFRFTLDPTEDQAKALARHFGARRKAYNWTVATLKAD
IQAWHASGTVTAKPSLRVLRKRWNTVKDDVCVNTETGVAWWPECSKEAYA
DGIAGAVEAYWNWQTSRAGKRAGKRVGFPRFKRKGRDQDRVSFTTGAMRV
EPDRRHLTLPVIGTVRTHENTRRIERLIKAGRARVLAISVRRNGTRLDAS
VRVLVQRPQQPKVVHPGSRVGVDVGVRRLATVATADGTAIEQVENPRPLG
AALRELRHVCRARSRCTKGSRRYRERTTQISRLHRRVNDVRTHHLHVLTT
RLAQTHGRIVVEGLDATEMLRQKGLPGARARRRGLSDAALGTPRRHLSYK
TVWYGSALVVADRWFPSSKTCHACRHVQDIGWDEQWQCDRCSVVHQRDDC
AAINLARYEETSSIVGPVGAAVKRGADRKTGPRPAGGCEARKGSSPKAAE
QPRDGVQVA
>MT3148 DNA ligase
MLLHDVAITSMDVAATSSRLTKVARIAALLHRAAPDTQLVTIIVSWLSGE
LPQRHIGVGWAALRSLPPPAPQPALTVTGVDATLSKIGTLSGKGSQAQRA
ALVAELFSAATEAEQTFLLRLLGGELRQGAKGGIMADAVAQAAGLPAATV
QRAAMLGGDLAAAAAAGLSGAALDTFTLRVGRPIGPMLAQTATSVHDALE
RHGGTTIFEAKLDGARVQIHRANDQVRIYTRSLDDVTARLPEVVEATLAL
PVRDLVADGEAIALCPDNRPQRFQVTASRFGRSVDVAAARATQPLSVFFF
DILHRDGTDLLEAPTTERLAALDALVPARHRVDRLITSDPTDAANFLDAT
LAAGHEGVMAKAPAARYLAGRRGAGWLKVKPVHTLDLVVLAVEWGSGRRR
GKLSNIHLGARDPATGGFVMVGKTFKGMTDAMLDWQTTRFHEIAVGPTDG
YVVQLRPEQVVEVALDGVQRSSRYPGGLALRFARVVRYRADKDPAEADTI
DAVRALY
>MT2553 single-strand DNA binding protein
MFETPLTVVGHIVNDLQRRKVGDQEVVKFRVASNSRRRTSDGGWEPGNSL
FITVNCWGRLVTGVGAALGKGAPVIVVGHVYTSEYEDRDGIRRSSLEMRA
TSVGPDLSRVIVRIEKPAYTGPSAGDLPAATGTGAAGAADAPASAADSVS
DVVVDDAITGHNPLPISA
>MT1363 conserved hypothetical protein
MSRVRLVIAQCTVDYIGRLTAHLPSARRLLLFKADGSVSVHADDRAYKPL
NWMSPPCWLTEESGGQAPVWVVENKAGEQLRITIEGIEHDSSHELGVDPG
LVKDGVEAHLQALLAEHIQLLGEGYTLVRREYMTAIGPVDLLCRDERGGS
VAVEIKRRGEIDGVEQLTRYLELLNRDSVLAPVKGVFAAQQIKPQARILA
TDRGIRCLTLDYDTMRGMDSGEYRLF
>MT0018 serine/threonine protein kinase
MSPRVGVTLSGRYRLQRLIATGGMGQVWEAVDNRLGRRVAVKVLKSEFSS
DPEFIERFRAEARTTAMLNHPGIASVHDYGESQMNGEGRTAYLVMELVNG
EPLNSVLKRTGRLSLRHALDMLEQTGRALQIAHAAGLVHRDVKPGNILIT
PTGQVKITDFGIAKAVDAAPVTQTGMVMGTAQYIAPEQALGHDASPASDV
YSLGVVGYEAVSGKRPFAGDGALTVAMKHIKEPPPPLPPDLPPNVRELIE
ITLVKNPAMRYRSGGPFADAVAAVRAGRRPPRPSQTPPPGRAAPAAIPSG
TTARVAANSAGRTAASRRSRPATGGHRPPRRTFSSGQRALLWAAGVLGAL
AIIIAVLLVIKAPGDNSPQQAPTPTVTTTGNPPASNTGGTDASPRLNWTE
RGETRHSGLQSWVVPPTPHSRASLARYEIAQ
>MT0873 IS1606', transposase
MTRDPHSPDCGREGSYRDTITRPLTDLPVAGYPLVPRVASPRYRCTTPQC
GRAVFNQDLANVDQYLVVNQLAHQLIDGSSLIPDADKRWDARRHADMTHH
LTSSLKENQS
>MT2985 RNA helicase, putative
MLHFTAATSRFRLGRERANSVRSDGGWGVLQPVSATFNPPLRGWQRRALV
QYLGTQPRDFLAVATPGSGKTSFALRIAAELLRYHTVEQVTVVVPTEHLK
VQWAHAAAAHGLSLDPKFANSNPQTSPEYHGVMVTYAQVASHPTLHRVRT
EARKTLVVFDEIHHGGDAKTWGDAIREAFGDATRRLALTGTPFRSDDSPI
PFVSYQPDADGVLRSQADHTYGYAEALADGVVRPVVFLAYSGQARWRDSA
GEEYEARLGEPLSAEQTARAWRTALDPEGEWMPAVITAADRRLRQLRAHV
PDAGGMIIASDRTTARAYARLLTTMTAEEPTVVLSDDPGSSARITEFAQG
TGRWLVAVRMVSEGVDVPRLSVGVYATNASTPLFFAQAIGRFVRSRRPGE
TASIFVPSVPNLLQLASALEVQRNHVLGRPHRESAHDPLDGDPATRTQTE
RGGAERGFTALGADAELDQVIFDGSSFGTATPTGSDEEADYLGIPGLLDA
EQMRALLHRRQDEQLRKRAQLQKGATQPATSGASASVHGQLRDLRRELHT
LVSIAHHRTGKPHGWIHDELRRRCGGPPIAAATRAQIKARIDALRQLNSE
RS
>MT3107 IS1081, transposase
MTSSHLIDAEQLLADQLAQASPDLLRGLLSTFIAALMGAEADALCGAGYR
ERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAE
RALTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEA
FRTRPLDAGPYTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILG
IQVTSAEDGAGWLAFFRDLVARGLSGVALVTSDAHAGLVAAIGATLPAAA
WQRCRTHYAANLMAATPKPSWPWVRTLLHSIYDQPDAESVVAQYDRVLDA
LTDKLPAVAEHLDTARTDLLAFTAFPKKIWRQIWSNNPQERLNREVRRRT
DVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRARAALTSTE
EPAKQQTTNTPALTT
>MT0414 IS6110, hypothetical protein
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>MT3291 ATP-dependent DNA helicase
MSIASDPLIAGLDDQQREAVLAPRGPVCVLAGAGTGKTRTITHRIASLVA
SGHVAAGQVLAVTFTQRAAGEMRSRLRALDAAARTGSGVGAVQALTFHAA
AYRQLRYFWSRVIADTGWQLLDSKFAVVARAASRTRLHASTDDVRDLAGE
IEWAKASLIGPEEYVTAVAAARRDPPLDAAQIAAVYSEYEALKARGDGVT
LLDFDDLLLHTAAAIENDAAVAEEFQDRYRCFVVDEYQDVTPLQQRVLSA
WLGDRDDLTVVGDANQTIYSFTGASPRFLLDFSRRFPDAAVVRLERDYRS
TPQVVSLANRVIAAARGRVAGSKLRLSGQREPGPVPSFHEHSDEPAEAAT
VAASIARLIASGTPPSEVAILYRVNAQSEVYEEALTQAGIAYQVRGGEGF
FNRQEIKQALLALQRVSERDTDAALSDVVRAVLAPLGLTAQPPVGTRARE
RWEALTALAELVDDELAQRPALQLPGLLAELRRRAEARHPPVVQGVTLAS
LHAAKGLEWDAVFLVGLADGTLPISHALAHGPNSEPVEEERRLLYVGITR
ARVHLALSWALSRSPGGRQSRKPSRFLNGIAPQTRADPVPGTSRRNRGAA
ARCRICNNELNTSAAVMLRRCETCAADVDEELLLQLKSWRLSTAKEQNVP
AYVVFTDNTLIAIAELLPTDDAALIAIPGIGARKLEQYGSDVLQLVRGRT
>MT2488 comE operon protein 1, putative
MIAALAVLVTVFTLIRDRTEPVMSAKLPPVEPVSPTNPRSSASPGSPDRS
GLPVVVSVVGLVHTPGLVTLAPGARIADALQAAGGAVDGADTVGLNMARQ
LGDGEQIVVGLAPPSGQPRVLGSSVGAGTPGPAGTSGTATTGPKTAPKTA
EVLDLNTATVEQLDALPGIGPVTAAAIVAWRQRNGRFTSVDQLADVDGIG
PARLDKRRNLVRV
>MT2877 IS1555', transposase, truncation
MLAYFDHHASNGPTEAINGRLEALCRNALGFRNLTHYRIRSLLHCGNLAQ
LIHAL
>MT0413 IS6110, transposase
MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH
AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI
ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR
RILGWRVASTMATSMVLDAIEQAIWTRQQESVLDLKDVIHHTDRGSQYTS
IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE
DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>MT3063 MutT/Nudix family protein
MSIQNSSARRRSAGRIVYAAGAVLWRPGSADSEGPVEIAVIHRPRYDDWS
LPKGKVDPGETAPVGAVREILEETGHRANLGRRLLTVTYPTDSPFRGVKK
VHYWAARSTGGEFTPGSEVDELIWLPVPDAMNKLDYAQDRKVLCRFAKHP
ADTQTVLVVRHGTAGSKAHFSGDDSKRPLDKRGRAQAEALVPQLLAFGAT
DVYAADRVRCHQTMEPLAAELNVTIHNEPTLTEESYANNPKRGRHRVLQI
VEQVGTPVICTQGKVIPDLITWWCERDGVHPDKSRNRKGSTWVLSLSAGR
LVTADHIGGALAANVRA
>MT0850 IS1605', transposase, truncation
MGQHVESGTDQVLSAQLGARGTDGKALGVGGRIIGLGPSSKTCHACRHVQ
DIGWDEKWQCDGCSITHQRDDNAAINLARYEEPPSVVGPVGAAVKRGADR
KTGPGPAGGREARKGTGHPAGEQPRDGVLVA
>MT2735 integrase
MRVYCAGQGTTVTQTGKRQRRKFGRIRQFNSGRWQASYTGPDGRVYIAPK
TFNAKIDAEAWLTDRRREIDRQLWSPASGQEDRPGAPFGEYAEGWLKQRG
IKDRTRAHYRKLLDNHILATFADTDLRDITPAAVRRWYATTAVGTPTMRA
HSYSLLRAIMQTALADDLIDSNPCRISGASTARRVHKIRPATLDELETIT
KAMPDPYQAFVLMAAWLAMRYGELTELRRKDIDLHGEVARVRRAVVRVGE
GFKVTTPKSDAGVRDISIPPHLIPAIEDHLHKHVNPGRESLLFPSVNDPN
RHLAPSALYRMFYKARKAAGRPDLRVHDLRHSGAVLAASTGATLAELMQR
LGHSTAGAALRYQHAAKGRDREIAALLSKLAENQEM
>MT3015 IS1533, OrfA
MLTVEDWAEIRRLHRAEGLPIKMIARVLGISKNTVKSALESNQQPKYERA
PQGSIVDAVEPRIRELLQAYPTMPATVIAERIGWERSIRVLSARVAELRP
VYLPPDPASRTTYVAGEIAQCDFWFPPIELPVGFGQTRTAKQLPVLTMVC
AYSRWLLAMLLPSRCAEDLFAGWWRLIEALGAVPRVLVWDGEGAIGRWRG
GRSELTTECQAFRGTLAAKVLICRPADPEAKGLIERAHDYLERSFLPGRV
FASPADFNAQLGAWLALVNTRTRRALGCAPTDRIGADRAAMLSLPPVAPA
TGWCTSLRLPRDHYVRCDSNDYSVHPGVIGHRVLVRADLERVHVFCDGEL
VADHERIWAVHQTVSDPAHVEAAKVLRRRHFSAASPVVEPQVQVRSLSDY
DDALGVDIDGGVA
>MT0007 conserved hypothetical protein
MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAA
TRQSQAGHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAV
RTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGG
AGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWS
TLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGS
SAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL
ADRD
>MT2486 conserved hypothetical protein
MSEAKPLHLVLGDEELLVERAVADVLRSARQRAGTADVPVSRMRAGDVGA
YELAELLSPSLFAEERIVVLGAAAEAGKDAAAVIESAAADLPAGTVLVVV
HSGGGRAKSLANQLRSMGAQVHPCARITKVSERADFIRSEFASLRVKVDD
ETVTALLDAVGSDVRELASACSQLVADTGGAVDAAAVRRYHSGKAEVRGF
DIADKAVAGDVAGAAEALRWAMMRGEPLVVLADALAEAVHTIGRVGPQSG
DPYRLAAQLGMPPWRVQKAQKQARRWSRDTVATAMRLVAELNANVKGAVA
DADYALESAVRQVAELVADRGR
>MT2149 serine/threonine protein kinase
MAHELSAGSVFAGYRIERMLGAGGMGTVYLARNPDLPRSEALKVLAAELS
RDLDFRARFVREADVAAGLDHPNIVAVHQRGQFEGRLWIAMQFVDGGNAE
DALRAATMTTARAVYVIGEVAKALDYAHQQGVIHRDIKPANFLLSRAAGG
DERVLLSDFGIARALGDTGLTSTGSVLATLAYAAPEVLAGQGFDGRADLY
SLGCALFRLLTGEAPFAAGAGAAVAVVAGHLHQPPPTVSDRVPGLSAAMD
AVIATAMAKDPMRRFTSAGEFAHAAAAALYGGATDGWVPPSPAPHVISQG
AVPGSPWWQHPVGSVTALATPPGHGWPPGLPPLPRRPRRYRRGVAAVAAV
MVVAAAAVTAVTMTSHQPRTATPPSAAALSPTSSSTTPPQPPIVTRSRLP
GLLPPLDDVKNFVGIQNLVAHEPMLQPQTPNGSINPAECWPAVGGGVPSA
YDLGTVIGFYGLTIDEPPTGTAPNQVGQLIVAFRDAATAQRHLADLASIW
RRCGGRTVTLFRSEWRRPVELSTSVPEVVDGITTMVLTAQGPVLRVREDH
AIAAKNNVLVDVDIMTPDTSRGQQAVIGITNYILAKIPG
>MT3142 DNA-damage-inducible protein P, putative
MPTAAPRWILHVDLDQFLASVELLRHPELAGLPVIVGGNGDPTEPRKVVT
CASYEARAYGVRAGMPLRTAARRCPEATFLPSNPAAYNAASEEVVALLRD
LGYPVEVWGWDEAYLAVAPGTPDDPIEVAEEIRKVILSQTGLSCSIGISD
NKQRAKIATGLAKPAGIYQLTDANWMAIMGDRTVEALWGVGPKTTKRLAK
LGINTVYQLAHTDSGLLMSTFGPRTALWLLLAKGGGDTEVSAQAWVPRSR
SHAVTFPRDLTCRSEMESAVTELAQRTLNEVVASSRTVTRVAVTVRTATF
YTRTKIRKLQAPSTDPDVITAAARHVLDLFELDRPVRLLGVRLELA
>MT1191 hypothetical protein
MPNLQLVQEPAADALLNANPFALLVGMLLDQQVPMETAFAGPKKIADRMG
SFDAGDIADYDPDKFVALCSERPAIHRFPGSMAKRIQALAQIIVDRYDGD
AAALWTAGEPDGNELLRRLKGLPGFGEQKARIFLALLGKQYGVTPKGWQV
AAGEFGQPGTYLSVADIVDAGSLGQVRSHKRQRKAAAKAEGKAPT
>MT0426 MutT/nudix family protein
MPSCPPAYSEQVRGDGDGWVVSDSGVAYWGRYGAAGLLLRAPRPDGTPAV
LLQHRALWSHQGGTWGLPGGARDSHETPEQTAVRESSEEAGLSAERLEVR
ATVVTAEVCGVDDTHWTYTTVVADAGELLDTVPNRESAELRWVAENEVAD
LPLHPGFAASWQRLRTAPATVPLARCDERRQRLPRTIQIEAGVFLWCTPG
DADQAPSPLGRRISSLL
>MT2887 CRISPR-associated protein, TM1808 family
MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLG
ELVACSTLRLTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPA
AQLGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFR
FELDAGLWLLATGSESELGLLTRLLKGISALGGERTSGFGAFNLTESEAP
AALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRSGFVASSTYAD
MPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES
AA
>MT3295 helicase, UvrD/Rep family
MTQTAAPARYSPAELACALGLFPPTAEQAAVIAAPPGPLVVIAGAGAGKT
ETMAARVVWLVANGYAEPGQVLGLTFTRKAAGQLLRRVRSRLARLAGIGL
GCGDPAACAPVVSTYHAFAGSLLRDYGLLLPLEPDTRLLSETELWQLAFD
VVSGYDGVLCTDKSPAAVTSIVVRLWGQLGEHLVDTRALRDTHVELERLV
HALPAGRYQRDRGPSQWLLRMLATQTQRAELVPLLDALGERMHAGKVMDF
AMQMASAARLAATSPQVGQDLRRRYRVVLLDEYQDTGHAQRVVLSSLFGG
GVDDGLALTAVGDPIQSIYGWRGASATNLPRFTTDFPLSDGTPAPVLELL
TSWRNPPQALRVANGISAEARRRSVAVRALRPRPDAPPGAVRCALLPDVQ
AEREWIADHLRMRYQRAEADGVKPPTAAVLVRRNADAAAIADTLRARGIP
AEVVGLAGLLSIPEVAEVVAMLRLVADPTAGAAAMRVLTGPRWRLGARDL
AALWRRALTLSGESPSTASPESIAMAASADADNPCLADAISDPGSAEGYS
VAGYGRIGALAGELSALRGRLGHSLPDLVAEVRRVLGVDCEVRASAPVSG
GWAGPEHLDAFADVVAGYAERASARSSEASVAGLLAYLDVAEVVENGLPP
AELTVACDRVQVLTVHAAKGLEWQVVAVAHLSRGVFPSTVSRSSWLTDPA
ELPPLLRGDRASAGAHGIPVLDTSAVADRKQLSDKISEHRRLLDRRRVDE
ERRLLYVAVTRAEDTLLVSGHHWGPTGTKPRGPSEFLCELKDIIDRSAAA
GDPCGVVEQWASAPAGDERNPLCDNAIEAVWPADPLAARRGDVERGAALV
AAAMSADLPGSTTDIDHPPRPGDAPWSTDVDALLAERAHAARGAPARGLP
NHLSVSSLVELVGDPVGARQRLMCRLPKRPDPHAWLGDAFHAWVQQFYGA
ELLFDLGDLPGAADREVGDPEELAALQRAFTASSWAARTPAAVEVPFEMP
IGDTVVRGRIDAVFVDPDGGATVVDWKTGKPPHGPAAMRQAAVQLAVYRL
AWAALRGCPTSSVRTAFYYVRSGITVVPDELPAPGELAMLLTDCAGRRSD
T
>MT2953 IS1539, transposase
MMARLKVPEGWCVQAFRFTLNPTQTQAASLARHFGARRKAFNWTVTALKA
DIKAWRADGTESAKPSLRVLRKRWNTVKDQVCVNAQTGQVWWPECSKEAY
ADGIAGAVDAYWNWQSCRAGKRAGKTVGVPRFKKKGRDADRVCFTTGAMR
VEPDRRHLTLPVIGTIRTYENTRRVERLIAKGRARVLAITVRRNGTRLDA
SVRVLVQRPQQRRVALPDSRVGVDVGVRRLATVADAEGTVLEQVPNPRPL
DAALRGLRRVSRARSRCTKGSRRYCERTTELSRLHRRVNDVRTHHLHVLT
TRLAKTHGRIVVEGLDAAGMLRQKGLPGARARRRALSDAALATPRRHLSY
KTGWYGSSLVVADRWFPSSKTCHACRHVQDIGWDEKWQCDGCSITHQRDD
NAAINLARYEEPPSVVGPVGAAVKRGADRKTGPGPAGGREARKATGHPAG
EQPRDGVQVK
>MT1446 primosomal protein N'
MLSVPHLDRDFDYLVPAEHSDDAQPGVRVRVRFHGRLVDGFVLERRSDSD
HHGKLGWLDRVVSPEPVLTTEIRRLVDAVAARYAGTRQDVLRLAVPARHA
RVEREITTAPGRPVVAPVDPSGWAAYGRGRQFLAALADSRAARAVWQALP
GELWADRFAEAAAQTVRAGRTVLAIVPDQRDLDTLWQAATALVDEHSVVA
LSAGLGPEARYRRWLAALRGSARLVIGTRSAVFAPLSELGLVMVWADADD
SLAEPRAPYPHAREVAMLRAHQARCAALIGGYARTAEAHALVRSGWAHDV
VAPRPEVRARSPRVVALDDSGYDDARDPAARTARLPSYRWRRGSALQSGA
PVLVQVPRRGYIPSLACGRCRAIARCRSCTGPLSLQGAGSPGAVCRWCGR
VDPTLRCVRCGSDVVRAVVVGARRTAEELGRAFPGTAVITSAGDTLVPQL
DAGPALVVATPGAEPRAPGGYGAALLLDSWALLGRQDLRAAEDALWRWMT
AAALVRPRGAGGVVTVVAESSIPTVQSLIRWDPVGHAEAELAARTEVGLP
PSVHIAALDGPAGTVTALLEAARLPDPDRLQADLLGPVDLPPGVRRPAGI
PADAPVIRMLLRVCREQGLELAASLRRGIGVLSARQTRQTRSLVRVQIDP
LHIG
>MT3430 IS1547, transposase
MAFAPTEMCPPTGPTSTPPQVKEATTMVVVGTDAHKYSHTFVATDEVGRQ
LGEKTVKATTAGHATAIMWAREQFGLELIWGIEDCRNMSARLERDLLAAG
QQVVRVPTKLMAQTRKSARSRGKSDPIDALAVARAVMRETDLPLATHDET
SRELKLLTDRRDVLVAQRTSAINRLRWLVHELDPERAPAARSLDAAKHQQ
ALRTWLDTQPGLVAELARAELTDIIRLTGEINTLAQRISARVHQVAPALL
EIPGCAELTAAKIVGEAAGVTRFKSEAAFACHAAVAPIPVWSGNTAGQMR
LSRSGNRQLNAALHRIALTQIRMTDSRGQAYYQRLQDAGKTKRAALRCLK
RRLARTVFQALRTVHQPSSEHTQPAAACHRSYCSSHLGEPPRLTDMTQKT
RIQPLPPKRAGLLIRALYRIAKRRFGEVPEPFTVTAHHRRLLIANVVHEA
LLQRASRKLPPSVRELAVFWTARSIGCSWCVDFGAMLQRLDGLDVDRLTD
IDNYATSSKFSDDERAAIAYAEAMTADPHSVTDEQVADLRARFGEAGVIE
LTYQIGVENMRARMNSALGITEQGFNSGDACRVPWAAPDVPSAESR
>MT0965 conserved hypothetical protein/DNA ligase
MGSASEQRVTLTNADKVLYPATGTTKSDIFDYYAGVAEVMLGHIAGRPAT
RKRWPNGVDQPAFFEKQLALSAPPWLSRATVAHRSGTTTYPIIDSATGLA
WIAQQAALEVHVPQWRFVAEPGSGELNPGPATRLVFDLDPGEGVMMAQLA
EVARAVRDLLADIGLVTFPVTSGSKGLHLYTPLDEPVSSRGATVLAKRVA
QRLEQAMPALVTSTMTKSLRAGKVFVDWSQNSGSKTTIAPYSLRGRTHPT
VAAPRTWAELDDPALRQLSYDEVLTRIARDGDLLERLDADAPVADRLTRY
RRMRDASKTPEPIPTAKPVTGDGNTFVIQEHHARRPHYDFRLERDGVLVS
WAVPKNLPDNTSVNHLAIHTEDHPLEYATFEGAIPSGEYGAGKVIIWDSG
TYDTEKFHDDPHTGEVIVNLHGGRISGRYALIRTNGDRWLAHRLKNQKDQ
KVFEFDNLAPMLATHGTVAGLKASQWAFEGKWDGYRLLVEADHGAVRLRS
RSGRDVTAEYPQLRALAEDLADHHVVLDGEAVVLDSSGVPSFSQMQNRGR
DTRVEFWAFDLLYLDGRALLGTRYQDRRKLLETLANATSLTVPELLPGDG
AQAFACSRKHGWEGVIAKRRDSRYQPGRRCASWVKDKHWNTQEVVIGGWR
AGEGGRSSGVGSLLMGIPGPGGLQFAGRVGTGLSERELANLKEMLAPLHT
DESPFDVPLPARDAKGITYVKPALVAEVRYSEWTPEGRLRQSSWRGLRPD
KKPSEVVRE
>MT2862 IS1602, resolvase
MNLAVWAERNGVARVTAYRWFHAGLLPVPARKAGRLILVDDQPADRSRRA
RTAVYARVSSADQKPDLDRQVARVTAWATTEQIAVDKVVTEVGSALNGHR
RKFLALLRDPSVKRIVVEHRDRFCRFGSEYVEAALAAQGRELVVVDSAEV
DDDLVRDMTEILTSMCARLYGKRAAQNRAKRALAAAAEESEAA
>MT1353 IS1557, transposase
MRNVRLFRALLGVDKRTVIEDIEFEEDDAGDGARVIARVRPRSAVLRRCG
RCGRKASWYDRGAGLRQWRSLDWGTVEVFLEAEAPRVNCPTHGPTVVAVP
WARHHAGHTYAFDDTVAWLAVACSKTAVCELMRIAWRTVGAIVARVWADT
EKRIDRFANLRRIGIDEISYKRHHRYLTVVVDHDSGRLVWAAPGHDKATL
GLFFDALGAERAAQITHVSADAADWIADVVTERCPDAIQCADPFHVVAWA
TEALDVERRRAWNDARAIARTEPKWGRGRPGKNAAPRPGRERARRLKGAR
YALWKNPEDLTERQSAKLAWIAKTDPRLYRAYLLKESLRHVFSVKGEEGK
QALDRWISWAQRCRIPVFVELAARIKRHRVAIDAALDHGLSQGLIESTNT
KIRLLTRIAFGFRSPQALIALAMLTLAGHRPTRPGRHNHPQISQ
>MT0970 formamidopyrimidine-DNA-glycosylase
MSSSAGVLVAGTPQPRALGPDALDVSTDDLAGLLAGNTGRIKTVITDQKV
IAGIGNAYSDEILHVAKISPFATAGKLSGAQLTCLHEAMASVLSDAVRRS
VGQGAAMLKGEKRSGLRVHARTGLPCPVCGDTVREVSFADKSFQYCPTCQ
TGGKALADRRMSRLLK
>MT2954 IS1539, resolvase
MSRILTHVPGRTVNRSYALPALVGSAAGRLSGNHSHGREAYIALPQWACS
RQPSTPPLQTPGRINALWSLRPVLPMPGRGCQLLRLGGRWLSVVCCRNGS
MNLVVWAEGNGVARVIAYRWLRVGRLPVPARRVGRVILVDEPAGQPGRWG
RTAVCARLSSADQKVDLDRQVVGVTAWATAEQIPVGKVVTEVGSALYGRR
RTFLTLLGDPTVRRIVMKRRDRLGRFGFECVQAVLAADGRELVVVDSADV
DDDVVGDITEILTSICARLYGKRAAGNRAARAVAAAARAGGHEAR
>MT0947 IS1554, transposase
MDAAQVIEPAHAGQDVDEAAVAARELSGAERALVGDLVRQARAEGVALTG
PDGLLKALTKTVLEAALQEEMTEHLGYDRHAAAGRGSGNSRNGSRNKKVI
TDACGQVEIAVPRDRNGTFEPVIVGKRKRRVTDVDRVVLSLYAKGLTTGE
IAAHFADVYGVSVSKDTISRITDRVIEEMQAWWSRPLEKVYAAVFIDAIM
VKIRDGQVRNRPVYAAIGVDLDGHKDILGMWAGEGDGESAKFWLAVLTDL
RNRGVKDIFFLVCDGLKGLPDSVSAAFPLATVQTCIIHLIRNTFRYASRK
YWDKISVDLKPIYTAASAAEARLRYEEFAEKWGKPYPAITRLWDSAWEEF
IPFLDYDVEIRRVPCSTNAIESLNARYRRAVRARGHFPNEQSALKTLYLV
TRSLDPKGTGQTKWAVRWKPALNALAITFADRMPAAEER
>MT1788 serine/threonine protein kinase
MPLAEGSTFAGFTIVRQLGSGGMGEVYLARHPRLPRQDALKVLRADVSAD
GEYRARFNREADAAASLWHPHIVAVHDRGEFDGQLWIDMDFVDGTDTVSL
LRDRYPNGMPGPEVTEIITAVAEALDYAHERRLLHRDVKPANILIANPDS
PDRRIMLADFGIAGWVDDPSGLTATNMTVGTVSYAAPEQLMGNELDGRAD
QYALAATAFHLLTGSPPFQHANPAVVISQHLSASPPAIGDRVPELTPLDP
VFAKALAKQPKDRYQRCVDFARALGHRLGGAGDPDDTRVSQPVAVAAPAK
RSLLRTAVIVPAVLAMLLVMAVAVAVREFQRADDERAAQPARTRTTTSAG
TTTSVAPASTTRPAPTTPTTTGAADTATASPTAAVVAIGALCFPLGSTGT
TKTGATAYCSTLQGTNTTIWSLTEDTVASPTVTATADPTEAPLPIEQESP
IRVCMQQTGQTRRECREEIRRSNGWP
>MT2431 conserved hypothetical protein
MRLYRDRAVVLRQHKLGEADRIVTLLTRDHGLVRAVAKGVRRTRSKFGAR
LEPFAHIEVQLHPGRNLDIVTQVVSVDAFATDIVADYGRYTCGCAILETA
ERLAGEERAPAPALHRLTVGALRAVADGQRPRDLLLDAYLLRAMGIAGWA
PALTECARCATPGPHRAFHIATGGSVCAHCRPAGSTTPPLGVVDLMSALY
DGDWEAAEAAPQSARSHVSGLVAAHLQWHLERQLKTLPLVERFYQADRSV
AERRAALIGQDIAGG
>MT1076 IS1081, transposase
MGGHRVILRNDQQKSIEGNDAMTSSHLIDAEQLLADQLAQASPDLLRGLL
STFIAALMGAEADALCGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAI
PKLRQGSYFPDWLLQRRKRAERALTSVVATCYLLGVSTRRMERLVETLGV
TKLSKSQVSIMAKELDEAVEAFRTRPLDAGPYTFLAADALVLKVREAGRV
VGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRDLVARGLSGVAL
VTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRTLLH
SIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKKI
WRQIWSNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIE
GRRYLGLEVLTRARAALTSTEEPAKQQTTNTPALTT
>MT0884 ATP-dependent DNA helicase, putative
MQSDKTVLLEVDHELAGAARAAIAPFAELERAPEHVHTYRITPLALWNAR
AAGHDAEQVVDALVSYSRYAVPQPLLVDIVDTMARYGRLQLVKNPAHGLT
LVSLDRAVLEEVLRNKKIAPMLGARIDDDTVVVHPSERGRVKQLLLKIGW
PAEDLAGYVDGEAHPISLHQEGWQLRDYQRLAADSFWAGGSGVVVLPCGA
GKTLVGAAAMAKAGATTLILVTNIVAARQWKRELVARTSLTENEIGEFSG
ERKEIRPVTISTYQMITRRTKGEYRHLELFDSRDWGLIIYDEVHLLPAPV
FRMTADLQSKRRLGLTATLIREDGREGDVFSLIGPKRYDAPWKDIEAQGW
IAPAECVEVRVTMTDSERMMYATAEPEERYRICSTVHTKIAVVKSILAKH
PDEQTLVIGAYLDQLDELGAELGAPVIQGSTRTSEREALFDAFRRGEXAT
LVVSKVANFSIDLPEAAVAVQVSGTFGSRQEEAQRLGRILRPKADGGGAI
FYSVVARDSLDAEYAAHRQRFLAEQGYGYIIRDADDLLGPAI
>MT0423 serine/threonine protein kinase
MADGARPSPRPAHAEVCGLMAKASETERSGPGTQPADAQTATSATVRPLS
TQAVFRPDFGDEDNFPHPTLGPDTEPQDRMATTSRVRPPVRRLGGGLVEI
PRAPDIDPLEALMTNPVVPESKRFCWNCGRPVGRSDSETKGASEGWCPYC
GSPYSFLPQLNPGDIVAGQYEVKGCIAHGGLGWIYLALDRNVNGRPVVLK
GLVHSGDAEAQAMAMAERQFLAEVVHPSIVQIFNFVEHTDRHGDPVGYIV
MEYVGGQSLKRSKGQKLPVAEAIAYLLEILPALSYLHSIGLVYNDLKPEN
IMLTEEQLKLIDLGAVSRINSFGYLYGTPGFQAPEIVRTGPTVATDIYTV
GRTLAALTLDLPTRNGRYVDGLPEDDPVLKTYDSYGRLLRRAIDPDPRQR
FTTAEEMSAQLTGVLREVVAQDTGVPRPGLSTIFSPSRSTFGVDLLVAHT
DVYLDGQVHAEKLTANEIVTALSVPLVDPTDVAASVLQATVLSQPVQTLD
SLRAARHGALDADGVDFSESVELPLMEVRALLDLGDVAKATRKLDDLAER
VGWRWRLVWYRAVAELLTGDYDSATKHFTEVLDTFPGELAPKLALAATAE
LAGNTDEHKFYQTVWSTNDGVISAAFGLARARSAEGDRVGAVRTLDEVPP
TSRHFTTARLTSAVTLLSGRSTSEVTEEQIRDAARRVEALPPTEPRVLQI
RALVLGGALDWLKDNKASTNHILGFPFTSHGLRLGVEASLRSLARVAPTQ
RHRYTLVDMANKVRPTSTF
>MT3618 conserved hypothetical protein
MHGAKWVDARQAAELLYDHRRPPAGIHTWSDRVADDEIQPISGMNTTTPA
RTALDLARRYPVGKAVAAIDALARATDLKLADVEMLAERYRGSRGIRNAR
IALDLVDPGAESPRETWLRLLLIRAGFPRPQTQIPVYDEYGQLVAVIDMG
WAGIKVGVDYEGDHHRTDRRTFNKDIKRAEALTELGWTDVRVTVEDTEGG
IIWRVSAAWQRRT
>MT3905 IS1557, transposase
MRNVRLFRALLGVDKRTVIEDIEFEEDDAGDGARVIARVRPRSAVLRRCG
RCGRKASWYDRGAGLRQWRSLDWGTVEVFLEAEAPRVNCPTHGPTVVAVP
WARHHAGHTYAFDDTVAWLAVACSKTAVCELMRIAWRTVGAIVARVWADT
EKRIDRFANLRRIGIDEISYKRHHRYLTVVVDHDSGRLVWAAPGHDKATL
GLFFDALGAERAAQITHVSADAADWIADVVTERCPDAIQCADPFHVVAWA
TEALDVERRRAWNDARAIARTEPKWGRGRPGKNAAPRPGRERARRLKGAR
YALWKNPEDLTERQSAKLAWIAKTDPRLYRAYLLKESLRHVFSVKGEEGK
QALDRWISWAQRCRIPVFVELAARIKRHRVAIDAALDHGLSQGLIESTNT
KIRLLTRIAFGFRSPQALIALAMLTLAGHRPTLPGRHNHPQISQ
>MT3573 integrase, putative
MRAAVYLRISEDRSGEQLGVARQREDCLKLCGQRKWVPVEYLDNDVSAST
GKRRPAYEQMLADITAGKIAAVVAWDLDRLHRRPIELEAFMSLADEKRLA
LATVAGDVDLATPQGRLVARLKGSVAAHETEHKKARQRRAARQKAERGHP
NWSKAFGYLPGPNGPEPDPRTAPLVKQAYADILAGASLGDVCRQWNDAGA
FTITGRPWTTTTLSKFLRKPRNAGLRAYKGARYGPVDRDAIVGKAQWSPL
VDEATFWAAQAVLDAPGRAPGRKSVRRHLLTGLAGCGKCGNHLAGSYRTD
GQVVYVCKACHGVAILADNIEPILYHIVAERLAMPDAVDLLRREIHDAAE
AETIRLELETLYGELDRLAVERAEGLLTARQVKISTDIVNAKITKLQARQ
QDQERLRVFDGIPLGTPQVAGMIAELSPDRFRAVLDVLAEVVVQPVGKSG
RIFNPERVQVNWR
>MT2373 hypothetical protein
MAPTGQAVDVAVREGAGDVGYSVERENLPADDPVRNGNRWRVIAVDTEHH
RIAARRLGDGARAAFSGDYLHEHITHGYAITVHASQGTTAHSTHAVLGDN
TSRATLYVAMTPARESNTAYLCERTAGEGARVDLAGWDLWVSGKAEAMSD
EKSASPVWCRVGARCDHRGKRSCW
>MT2730 hypothetical protein
MTHKRTKRQPAIAAGLNAPRRNRVGRQHGWPADVPSAEQRRAQRQRDLEA
IRRAYAEMVATSHEIDDDTAELALLSMHLDDEQRRLEAGMKLGWHPYHFP
DEPDSKQ
>MT0818 IS1547, transposase
MCPPTGPTSTPPQVKEATTMVVVGTDAHKYSHTFVATDEVGRQLGEKTVK
ATTAGHATAIMWAREQFGLELIWGIEDCRNMSARLERDLLAAGQQVVRVP
TKLMAQTRKSARSRGKSDPIDALAVARAVMRETDLPLATHDETSRELKLL
TDRRDVLVAQRTSAINRLRWLVHELDPERAPAARSLDAAKHQQALRTWLD
TQPGLVAELARAELTDIIRLTGEINTLAQRISARVHQVAPALLEIPGCAE
LTAAKIVGEAAGVTRFKSEAAFACHAAVAPIPVWSGNTAGQMRLSRSGNR
QLNAALHRIALTQIRMTDSRGQAYYQRLQDAGKTKRAALRCLKRRLARTV
FQALRTVHQPSSEHTQPAAACHRSYCSRSCLSG
>MT3197 IS1081, transposase
MGGHRVILRNDQQKSIEGNDAMTSSHLIDAEQLLADQLAQASPDLLRGLL
STFIAALMGAEADALCGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAI
PKLRQGSYFPDWLLQRRKRAERALTSVVATCYLLGVSTRRMERLVETLGV
TKLSKSQVSIMAKELDEAVEAFRTRPLDAGPYTFLAADALVLKVREAGRV
VGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRDLVARGLSGVAL
VTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRTLLH
SIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKKI
WRQIWSNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIE
GRRYLGLEVLTRARAALTSTEEPAKQQTTNTPALTT
>MT2179 conserved hypothetical protein
MADQPDPPTPRPALSPSRATDFKQCPLLYRFRAIDRLPEATSAAQLRGSV
VHAALEQLYGLPAGLRSPDTARSLVQRAWDQMVAAEPELAGELDPGQPTQ
LLEDARALVSGYYRLEDPTRFDPQCCEQRVEVELADGTLLRGYIDRIDVA
ATGELRVVDYKTGKAPPAARALAEFKAMFQMKFYAVALFRSRGVPPTRLR
LIYLADGQLLDYSPDRDELLRFEKTLMAIWRAIQSAGETGDFRPNPSRLC
DWCPHQQRCPAFGGTPPPYPGWPTEPAA
>MT2247 DNA polymerase III, epsilon subunit, putative
MGATGGTQLSFADLAHAQGAAWTPADEMSLRETTFVVVDLETTGGRTTGN
DATPPDAITEIGAVKVCGGAVLGEFATLVNPQHSIPPQIVRLTGITTAMV
GNAPTIDAVLPMFFEFAGDSVLVAHNAGFDIGFLRAAARRCDITWPQPQV
LCTMRLARRVLSRDEAPSVRLAALARLFAVASNPTHRALDDARATVDVLH
ALIERVGNQGVHTYAELRSYLPNVTQAQRCKRVLAETLPHRPGVYLFRGP
SGEVLYVGTAADLRRRVSQYFNGTDRRKRMTEMVMLASSIDHVECAHPLE
AGVRELRMLSTHAPPYNRRSKFPYRWWWVALTDEAFPRLSVIRAPRHDRV
VGPFRSRSKAAETAALLARCTGLRTCTTRLTRSARHGPACPELEVSACPA
ARDVTAAQYAEAVLRAAALIGGLDNAALAAAVQQVTELAERRRYESAARL
RDHLATAIEALWHGQRLRALAALPELIAAKPDGPREGGYQLAVIRHGQLA
AAGRAPRGVPPMPVVDAIRRGAQAILPTPAPLGGALVEEIALIARWLAEP
GVRIVGVSNDAAGLASPVRSAGPWAAWAATARSAQLAGEQLSRGWQSDLP
TEPHPSREQLFGRTGVDCRTGPPQPLLPGRQPFSTAG
>MT3456 IS1561', transposase
MAIDPAAAYASAIRTPGLLPNAKLVVDHFHVTTLANDALTAVRRRVTWAF
HDRRGRKIDPQWANRRRLLTARERLSDKSFAKMRNRINAVDPRAQILSAW
IAKEELRTLLSTVRTGGDPHLARHHLHRFLPGASTRRSPNCSPWPPPLTS
HPRSTPSWSPASPTRASVVGEVAEMLGDIDGQCVQVEVPVPERGPAGCGG
LDGLGRAGVSATPRVCAAMTAVNVAGRCAGQQADVGPTPQHRCRGR
>MT3281 IS1603, transposase
MRQISSRYLSEEERINIADLRRSGLSIRKIADQLGRAPSTVSRELRRNSR
RDGQYRPFEAHRWAVQRRVRRHRRRIDKNPDLCELIAELLAQRWSPQQIA
RHLRRKYPDDRSMWLCHESIYQAVYQPQSRLIRPPQVKSPHRGPLRTGRT
HRRAHLRPGRRRPRFAQPMLSIHQRPFDPADRSEPGHWEGDLIVGKNQGS
AIGTLVERQTRLIRLLHLPTHDAYCLRIAITETMSDLPVTLVRSITWDQG
IEMARHIDITADLGAPVYFCDSRSPWQRASNENSNGLLRQYFPKGTSLST
YTPDHLRAVEYEINNRPRQVLGHRSPAELFTALLTSPDHQLLRR
>MT2604 hypothetical protein
MHLAHRVASSRDTPSSSATPNAVSGSASNAADRPCLVRPPTAPPWAHGPR
LRRDPTGGGSTPSIVLSRSTDRSKDGHRIVPAGARKSGVRASTGRLPSTR
KTTRSPDCRPSASRTAFGXVTCPFDVTMGSSECLLHRCRTPPVPSHSVEL
LVAANPAEDSRLPYLIRLPVGAGLVFATSDVWPRTKALYCHRLDIADWPA
DPVVVDRVELRSCSRRGAAIDVVAARARENRSQLVHTMARGRQVVFWQSP
KTRKQSRPGVRTPTARAAGIPELHIVVDAHERYPYTFADKPAKTTREALP
CGDYGLKVAGQLVAAVERKALADLTSGVLNGNLKYQLTELAALPRAAVVV
EDRYSEIFAHSFARPTAIADGLAELQIGFPNVPIVFCQTRKLAQEYTYRY
LAAALTWFVDDADATTVFEPAAAEPEPSSAELRAWAKSVGLPVSDRGRLR
PQILQAWRAAHPR
>MT3480 DNA polymerase III, alpha subunit, putative
MFDILWNVGWSNGPPSWAEMERVLNGKPRHAGVPAFDADGDVPRSRKRGA
YQPPGRERVGSSVAYAELHAHSAYSFLDGASTPEELVEEAARLGLCALAL
TDHDGLYGAVRFAEAAAELDVRTVFGAELSLGATARTERPDPPGPHLLVL
ARGPEGYRRLSRQLAAAHLAGGEKGKPRYDFDALTEAAGGHWHILTGCRK
GHVRQALSQGGPAAAQRALADLVDRFTPSRVSIELTHHGHPLDDERNAAL
AGLAPRFGVGIVATTGAHFADPSRGRLAMAMAAIRARRSLDSAAGWLAPL
GGAHLRSGEEMARLFAWCPEAVTAAAELGERCAFGLQLIAPRLPPFDVPD
GHTEDSWLRSLVMAGARERYGPPKSAPRAYSQIEHELKVIAQLRFPGYFL
VVHDITRFCRDNDILCQGRGSAANSAVCYALGVTAVDPVANELLFERFLS
PARDGPPDIDIDIESDQREKVIQYVYHKYGRDYAAQVANVITYRGRSAVR
DMARALGFSPGQQDAWSKQVSHWTGQADDVDGIPEQVIDLATQIRNLPRH
LGIHSGGMVICDRPIADVCPVEWARMANRSVLQWDKDDCAAIGLVKFDLL
GLGMLSALHYAKDLVAEHKGIEVDLARLDLSEPAVYEMLARADSVGVFQV
ESRAQMATLPRLKPRVFYDLVVEVALIRPGPIQGGSVHPYIRRRNGVDPV
IYEHPSMAPALRKTLGVPLFQEQLMQLAVDCAGFSAAEADQLRRAMGSKR
STERMRRLRGRFYDGMRALHGAPDEVIDRIYEKLEAFANFGFPESHALSF
ASLVFYSAWFKLHHPAAFCAALLRAQPMGFYSPQSLVADARRHGVAVHGP
CVNASLAHATCENAGTEVRLGLGAVRYLGAELAEKLVAERTANGPFTSLP
DLTSRVQLSVPQVEALATAGALGCFGMSRREALWAAGAAATGRPDRLPGV
GSSSHIPALPGMSELELAAADVWATGVSPDSYPTQFLRADLDAMGVLPAE
RLGSVSDGDRVLIAGAVTHRQRPATAQGVTFINLEDETGMVNVLCTPGVW
ARHRKLAHTAPALLIRGQVQNASGAITVVAERMGRLTLAVGARSRDFR
>MT1063 IS1560' protein
MQQGNPPDAPQLAPAVAWVKKRAGRTPRTVTADRGYGEAAVDQQLTEVGV
KNVLIPRKGKPSQDRRAEEHRKAFRRTIKWRTGCEGRISHLKRGYGWDRG
RIGGLEGTRTWVGHGVFAHNLVTISALPA
>MT2069 IS1607, transposase
MSIVDARGREVRRATIEHNAAGLRELLELLSRAGAREVAIERPDGPVVDT
LLEAGITVVVISPNQLKNLRGRYGSAGNKDDRFDAFVLADTLRTDRSRLR
PLLPDTPATATLRRTCRPRKDLVAHRVALANQLRAHLRVVFPGVVGLFAD
LDSPISLAFLTFLPRFDCQDRADWLSVKRLAGWLAAAGYCGRAPRPAHRC
PARRHR
>MT3742 IS1553, transposase
MALPQSALSELLDAFRTGDGVDLIRDAVRLVLQELSELEATERIGAARYE
RSDTRVTDRNGARSRVLSTQAGDVELRIPKLRKGSFFPAILEPRRRIDQA
LYAVVMEAYVHGISTRAVDDLVEAMGVETGISKSEVSRICAGLDEIVGAF
RTRTLGHIEFPYVYLDATYLNVLNGTGQVVSMAVIVASGIAADGSREILG
LDVGDSEDETFWRGFLTSLKGRGLGGVRLVISDQHAGLVKALKRCFQGAG
HQRCRVHFARNLLAHVPKDKADMVASMFRMIFSAPDAEAVHATWEGVRDR
LAASFPKIGPLMDDARAEVLAFTAFPKAHWQKIWSTNPLERINKEIKRRS
RVVGIFPNPAAVIRLVGAVLADMHDEWQASERRYLSEASMALLYPDSDNA
VVAAISGGQ
>MT2070 IS1607, transposase
MLHDRLTGAPRGATGDEGAANAHITRAMVAALTSVATQIKTLDAQIAEQL
SLHADAHIFTSLPRSGTVRAARLLAEIGDCRARFPTPESLACLAGVAPST
RQSGKVKHVGFRWAADKQLRDAVCDFAGDSRRANLWAADRYNRAIARGHD
HPHAVRILARAWLYAIWHCWQDGAAYHPANHRALQALLNQDQDRAA
>MT2655 conserved hypothetical protein
MRWARQAVAVNGMPVDDGALPGLQRIGLVRSVRAPQFDGITFHEVLCKSA
LNKVPNAAALPFRYTVNGYRGCSHACRYCFARPTHEYLDFNPGTDFDTQV
VVKTNVAAVLRHELRRPSWRRETVALGTNTDPYQRAEGRYALMPGIIGAL
AASGTPLSILTKGTLLRRDLPLIAEAAQQVPVSVAVSLAVGDPELHRDVE
SGTPTPQARLALITAIRAAGLDCHVMVAPVLPQLTDSGEHLDQLLGQIAA
AGATGVTVFGLHLRGSTRGWFMCWLARAHPELVSRYRELYRRGPYLPPSY
REMLRERVAPLIAKYRLAGDHRPAPPETEAALVPVQATLF
>MT3099 IS6110, hypothetical protein
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>MT2861 IS1602, transposase
MAKFEIPEGWMVQAFRFTLDPTAEQARALARHFGARRKAYNWTVATLKAD
IDAWQATGIQTAKPSLRVLRKRWNTVKNDVCVNIETGVVWWPECSKEAYA
DGIDGAVDAYWNWQNSRSGKRDGKRMGFPRFKKKGRDPDRVTFTTGAMRV
EPDRRHLTLPVIGTVRTHENTRRVERLIAKGRSRVLAITVRRNGTRIDAS
VRVLVQRPQQPKVTDPGSRVGVDVGVRRXATVATADGAVLERVPNPRPLD
AALNELRHVCRARSRCTKGSRRYRERTTEISRLHRRVNDVRTHHLHCLTT
HLAKTHGRIVVEGLDAAGMLRQQGLSGARARRRGLSDAALGTPRRHLSYK
TGWYGSQLVVADRWFPSSKTCHVCGHVQEIGWAEHWQCDSCSASHQRDDC
AAINLARYEDTSSVVGPVGAAVKRGADRKTRPGRAGGREARKGSSRKAAE
QPRDGVQVA
>MT2884 CRISPR-associated protein Cas1
MVQLYVSDSVSRISFADGRVIVWSEELGESQYPIETLDGITLFGRPTMTT
PFIVEMLKRERDIQLFTTDGHYQGRISTPDVSYAPRLRQQVHRTDDPAFC
LSLSKRIVSRKILNQQALIRAHTSGQDVAESIRTMKHSLAWVDRSGSLAE
LNGFEGNAAKAYFTALGHLVPQEFAFQGRSTRPPLDAFNSMVSLGYSLLY
KNIIGAIERHSLNAYIGFLHQDSRGHATLASDLMEVWRAPIIDDTVLRLI
ADGVVDTRAFSKNSDTGAVFATREATRSIARAFGNRIARTATYIKGDPHR
YTFQYALDLQLQSLVRVIEAGHPSRLVDIDITSEPSGA
>MT3494.1 IS1560, transposase
MVRNAQRAVRRASGRRKAWLRQAINHLEKLIGRTERVVDQARSRLAGVMP
DSSSRLVSLHDADARPIRKGRLGKPVEFGYKAQVVDNADGVILDHSVELG
NPADAPQLAPAIERISRRTGRPPRAVTADRGCGDASVEDDLHQLGVRNVA
IPRKSKPSATRRAFEHRRAFRDKIKWRTGSEGRINHLKRSYGWNRTELTG
ITGARTWCGHGVFAHNLVKISTLAA
>MT0767 IS1557', transposase
MFSVKGEEGKQALDRWISWARRCRIPVFVELAGGIVRHRQAIDAALDHGL
WQGLIESTNTKIRLLTRIAFGFRSPEALIALAMLALGGRRPALPGRTKHP
RISQ
>MT2883 CRISPR-associated protein Cas2
MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGF
GYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYG
RGRLVSAEEFVFF
>MT0949 IS1535, transposase
MIVRMRSCAQAAKVAEATGGVQLAGKPKPDGTPTFSRYVEIGVDFEAHRP
VVESVSVLFELYDGDANSYAATGGPGAQLPSGWMVTAAKFEVEWPADPQR
AGLVRSHFGARRKAFNWGLAQVKADLDAKAADPAHESVDWDLKSLRWAWN
RAKDDVAPWWAENSKECYSSGLADLAQGLANWKAGKNGTRKGRRVGFPRF
KSGRRDPGRVRFTTGTMRIEDDRRTITVPVIGPLRAKENTRRVQRHLVSG
RAQILNMTLSQRWGRLFVAVCYALRTPTTRSPLTQPTVRAGMDLGVRTLA
TVATLDTATGEQTIIEYPNPAPLKATLVARRRAGRELSRRIPGSHGHRAV
KAKLARLDRRCVHLRREAAHQLTTELAGTYGQVVIEDLDVAAMKRSMRRR
AFRRSVSDAAMGLVAPQLAYKTAKCSGVLTVADRWFASSQIHHGCTSPDG
TPCRLQGKGRIDKHLLCPVTGEVVDRDRNAALNLRDWPDNASRGPVGTTA
PSAPGPTTTVGTGHGADTGSSGAGGASVRPRPRRAGRGEAKTQTPQGDAA
>MT0699 AP endonuclease, family 2
MLIGSHVSPTDPLAAAEAEGADVVQIFLGNPQSWKAPKPRDDAAALKAAT
LPIYVHAPYLINLASANNRVRIPSRKILQETCAAAADIGAAAVIVHGGHV
ADDNDIDKGFQRWRKALDRLETEVPVYLENTAGGDHAMARRFDTIARLWD
VIGDTGIGFCLDTCHTWAAGEALTDAVDRIKAITGRIDLVHCNDSRDEAG
SGRDRHANLGSGQIDPDLLVAAVKAAGAPVICETADQGRKDDIAFLRERT
GS
>MT0282 conserved hypothetical protein
MAAPVSLDVHGRQVIVTHPGRVVFPAHNDRKGYTKFDLVRYYLAVAEGAM
RGVAGRPMILKRFVKGISAEAVFQKRAPANRPDWVDVAELHYASGRSAAE
AVIHDAAGLAWVINLGCVDLNPHPVLAGDLDHPDELRVDLDPMPGVAWQR
VVEVALVVREVLEDYGLTAWPKTSGSRGFHVYARIAPCWSFPQVRLAAQT
VAREVERRLPDAATSRWWKEEREGVFVDFNQNAKDRTVASAYSVRATPDA
RVSTPLHWEEVPGCDPAVFTMATVPSRLADIGDPWAGMDDAVGRLDRLLM
LAEELGPPQKAQSAKPLIEIARAKTRAEAMAALDIWRDRYPGAAALLRPA
DVLVDGMRGPSSIWYRIRINLQHVPADQRPPQEELIADYSPWPR
>MT3016 IS1533, OrfB
MSQCPGWPIAPAPRTGATKNTWPPACSGKCQPGSPMVVRAASAPHASRLG
SRWKSSTLSMLVASNATPSHIWAPWISSPPAITSCFWAPPGTGKTHLAVG
LAIRACQAGHRVLFATAAEWVARLAEAHHAGRIYAELTRLCRYPLLVVDE
VGYIPFEPEAANLFFQLVSSRYERASLIVTSNKAFGRWGEVFGGDDVVAA
AMIDRLVHHAEVVALKGDSYRLKDRDLGRVPPAGTTEE
>MT1314 exonuclease SbcD-related protein
MSPRPGPAGRGPAPCRCADLHSLCVDSHALRRDGMRFLHTADWQLGMTRH
FLAGDAQPRYSAARRDAVAGLKALAADVGAEFVVVAGDVFEHNQLAPQIV
GQSLEAMRVIGLPVYLLPGNHDPLDASSVYTSTLFRAERPDNVVVLDRAG
VHEVRPGVQIVAAPWRSKAPTTDPVAEVLAGLPTDAAIRLLVAHGGVDAL
DPDHDKPSLIRLAALDDALTRQAIHYVALGDKHSLTQVGSSGRVWYSGAP
EVTNFDDVEPDPGHVLVVDIDESDPRHPVTVDARRIGRWRFVTLHHQVDT
SRDIADLDLNLDLMTDKDRTVVRLALTGSLTVTDRAALDTCLDKYARLFA
WLGLWERHTDLAVIPVDAEFTDLGIGGFAAAAVDELVATARGGDDESAVD
AQAALALLLRLADRGAA
>MT3534 IS1540, transposase
MIDTAIEEMIPLIGVRAACAATGRAPASYYRAHSKRLSAQSDTFTSTAVT
DPSGPRESAQPRALSAAEREHVLAVLNSQRFADMAPAVVYATLLDEGIYL
CSESTMYRLLRERGQTGDRRRQATHPAAVKPELVAHQPNSVWSWDITKLR
GPAKWSYYYLYVILDIFSRYVVGWMVASRESKVLAERLIAQTLAAQHISA
DQLTLHADRGSSMSSKPVALLLADLGVTKSHSRPHTSNDNPLSEAQFKTL
KYRPDFPKRFESIEAARVHCDRFFGWYNHEHKHSGIGLHTPADVHYGRAD
QIRRHRATVLDTAYRDHLERIRSQTTRATRATGLQRDQPTTEGGPADSIN
PRKSCLRNVDRFRPGLLDLPAPAPVDLRRLLPSGQIR
>MT2881 IS6110, transposase
MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH
AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI
ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR
RILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS
IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE
DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>MT1197 MutT/nudix family protein
MLNQIVVAGAIVRGCTVLVAQRVRPPELAGRWELPGGKVAAGETERAALA
RELAEELGLEVADLAVGDRVGDDIALNGTTTLRAYRVHLLGGEPRARDHR
ALCWVTAAELHDVDWVPADRGWIADLARTLNGSAADVHRRC
>MT2886 CRISPR-associated protein, TM1807 family
MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADI
PAHKRKSFEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSI
EPRRASRGRGGRMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYL
QSLVHKRTAQPVRVPGHQTREHRQYGERFERKELRKSGRPNTRPQDAVND
LFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRECLAPGTSISH
RVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV
GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPL
VLKRTKIDNICYEMGQCELSIRRAE
>MT2587 IS1081, transposase
MGGHRVILRNDQQKSIEGNDAMTSSHLIDAEQLLADQLAQASPDLLRGLL
STFIAALMGAEADALCGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAI
PKLRQGSYFPDWLLQRRKRAERALTSVVATCYLLGVSTRRMERLVETLGV
TKLSKSQVSIMAKELDEAVEAFRTRPLDAGPYTFLAADALVLKVREAGRV
VGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRDLVARGLSGVAL
VTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRTLLH
SIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQI
WRQIWSNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIE
GRRYLGLEVLTRARAALTSTEEPAKQQTTNTPALTT
>MT4027 MutT/nudix family protein
MSDGEQAKSRRRRGRXRGRRAAATAENHMDAQPAGDATPTPATAKRSRSR
SPRRGSTRMRTVHETSAGGLVIDGIDGPRDAQVAALIGRVDRRGRLLWSL
PKGHIELGETAEQTAIREVAEETGIRGSVLAALGRIDYWFVTDGRRVHKT
VHHYLMRFLGGELSDEDLEVAEVAWVPIRELPSRLAYADERRLAEVADEL
IDKLQSDGPAALPPLPPSSPRRRPQTHSRARHADDSAPGQHNGPGPGP
>MT1217 hypothetical protein
MDPHRDLESRAFAGNWRVYQQQALDAFDADVAAGDNRAYLVLPPGAGKTM
IGLEAARRLGRRSLVLVPNTAVQAQWAAAWDNSFPSSDRSASKCGTERGL
ASAMNVLTYQSLAVIDAETDSTVRREVLRNRDQQALLDLLHPNGRAVIER
AATLGPWTLVLDECHHLLATWGALVSALASVLGAQTALIGLTATPATELT
AWQHTLHDELFGTADFVIPTPALVREGDLAPYQELVYLTQPTPEEQAWIG
THRARFADLMLALIDQKVGSMSLAAWLHTRIVDRATREGNQIAWSTFERA
EPDLACSGLRFAYDGLIPLPDGVRLREQHRIAPDAQDWVNVLTDFSVGHL
QQSADPRDAHALTAIKRVLPGLGYRLTSRGVRVATSPVDRLCALSESKIA
ATAHILDTEDAVLGARLRALVLCDFESMTGALPTSLKGAPVSEQSGSAQL
VAAMLAASDHRRRTPLHALLVTGQTFACPAAIEDDLIAFCAERGALVTAE
PLDAHPSLRVMRGTGGFTPRTWVALATEYFLAGRARVLVGTRSLLGEGWD
CAAVNVNIDLTSATTQAAITQMRGRAIRNDPSDGHKVADNWSVCCIATEH
PRGDADYLRLVRKHDGYYAATPQGLIESGVTHCDPSLSPYGPPVTDTHAI
TARALXASPNAPRRDPGGESASXTKESTSQPSACAPASRSGSPHPASPPR
H
>MT2368 integrase, putative
MTGAGIVETTTNRVRHVPVPEPVSERLRDELPTEPNALVFPSYRGGHLPI
EEYRRAFDKGCKAVGIADLVPHGLRHTTASLAISAGANVKVVQRLLGHAT
AAMTLDRHGHLLSDDLAGVAGLLVQAIKSAAASLRYSDPDSVAVENISAA
S
>MT2879 IS1604, transposase
MAVGDDEEKVRAERARAIGLFRYQLIWEAADAAHSTKQRGKMVRELASRE
HTDPFGRRVRISRQTIDRWIRGWRAGGFDALVPNPRQCTPRTPAEVLELA
VALRRENPQRTAAAIRRILRTQLGWAPDERTLQRNFHRLGLTGATTGSAP
AVFGRFEAEHPNALWTGDVLHGIRIDLRKTYLFAFLDDHSRLVPGYRWGH
AEDTVRLAAALRPALASRGVPNAVYVDNGSPYVDAWLLRACAKLGVRLVH
STPGRPQGRGKIERFFRTVREQFLVEITGEPDVVGRHYVADLAELNRLFT
AWVETVYHRSVHSETGQTPLARWSAGGPIPLPAPETLTEAFLWEEHRRVT
KTATVSLHGNRYEIDPALVGRKVELVFDPFDLTRIEVRLAGAPMGRAIPY
HIGRHSHPKAKPETPTAPPKPSGIDYAQLIETAHAAELARGVNYTALTGA
ADQIPGQLDLLTGQEAQPK
>MT3836 DNA ligase, putative
MQLPVMPPVSPMLAKSVTAIPPDASYEPKWDGFRSICFRDGDQVELGSRN
ERPMTRYFPELVAAIRAELPHRCVIDGEIIIATDHGLDFEALQQRIHPAE
SRVRMLADRTPASFIAFDLLALGDDDYTGRPFSERRAALVDAVTGSGADA
DLSIHVTPATTDMATAQRWFSEFEGAGLDGVIAKPPHITYQPDKRVMFKI
KHLRTADCVVAGYRVHKSGSDAIGSLLLGLYQEDGQLASVGVIGAFPMAE
RRRLLTELQPLVTSFDDHPWNWAAHVAGQRTPRKNEFSRWNVGKDLSFVP
LRPERVVEVRYDHMEGARFRHTAQFNRWRPDRDPRSCSYAQLERPLTVSL
SDIVPGLR
>MT2740 IS1081, transposase, truncation
MGGHRVILRNDQQKSIEGNDAMTSSHLIDTEQLLADQLAQASPDLLRGLL
STFIAALMGAEADALCGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAI
PKLRQGSYFPDWLLQRRKRAERALTSVVATCYLLGVSTRRMERLVETLGV
TKLSKSQVSIMAKELDEAVEAFRTRPLDAGPYTFLAADALVLKVREAGRV
VGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRDLVARGLSGVAL
VTSDAHAGLVAAIGATLPAAAWQRCRTHYAANHGRHNA
>MT2964 smf family protein
MIDPTARAWAYLSRVAEPPCAQLAALVRCVGPVEAADRVRRGQVGNELAQ
HTGARREIDRAADDLELLMRRGGRLITPDDDEWPVLAFAAFSGAGARARP
CGHSPLVLWALGPARLDEVAPRAAAVVGTRAATAYGEHVAADLAAGLAER
DVAVVSGGAYGIDGAAHRAALDSEGITVAVLAGGFDIPYPAGHSALLHRI
AQHGVLFTEYPPGVRPARHRFLTRNRLVAAVARAAVVVEAGLRSGAANTA
AWARALGRVVAAVPGPVTSSASAGCHTLLRHGAELVTRADDIVEFVGHIG
ELAGDEPRPGAALDVLSEAERQVYEALPGRGAATIDEIAVGSGLLPAQVL
GPLAILEVAGLAECRDGRWRILRAGAGQAAAKGAAARLV
>MT2684 MutT/nudix family protein
MTWLVLAGAVLLVVLVAFGAWGYQTANRLNRLNVRYDLSWQSLDSALARR
AVVARAVAIDAYGGAPQGSRLAALADAAEGAPRHARENAENELSAALAMV
NPASLPAALIAELADAEARVLLARRFHNDAVRDTLALGERRLVRLLRLGG
TAVLPTYFEIVERPHALVHGDQGASGRRTSARVVLLDDSGAVLLLCGSDP
ANPAFRDGAAPKWWFTVGGQVRPGERLAQAAARELAEETGLRVAPADMIG
PIWRRDEVFEFNGSLIDSEEFYLVHRTRRFEPAVQGRTELERRYIRDARW
CDANDIAQLVAAGERVYPLQLGELLPAANRLVDVALDNGAARDAGVPQPI
R
>MT2233 IS1558', transposase
MRSKIPDLQRALEGRFDDHHALMCRLHLAHLDQLDAMIGALDEQIEQLMH
PFCARRELIASIPGIGVGASATVISEIGADPAAWFPSAEHLASWVRLCPG
NHESAGKRHHGARRTGNQHLQPVLVECAWAAVRTDGYLREYYRRQVRKFG
GFRSPAANKKAIIAVAHKLIVIIWHVLATGRPYQDLGADYFTTRMDPDKE
RRRLVAKLEAQGLGVTLEPAA
>MT1037 conserved hypothetical protein
MYWPGRLDGCAEPHVQREAFAWHIDLAKRTGKPLMIHNRQADRDVLDVLR
AEGAPDTVILHCFSSDAAMARTCVDAGWLLSLSGTVSFRNARELREAVPL
MPVEQLLVETDAPYLTPHPHRGLANEPYCLPYTVRALAELVNRRPEEVAL
ITTSNARRAYGLGWMRQ
>MT3773 conserved hypothetical protein
MSAGGTPLQAGATPTGSRGTVALRPDAGPSWLRPLVDNVGQIPDAYRRRL
PADVLAMVTAAGAVSAMTSSRRDHREAAVLVLFSGPEAGPGDGGVPDDAD
LLLTVRASTLRHHAGQAAFPGGVVDPADDGPVATALREANEETGIDPSRL
HPLATMERTFIAPSRFHVVPVLAYSPDPGPVAVVNEAETAIVARVPVRAF
INPANRLMVYRRPHTRRWAGPAFLLNQMLVWGFTGQVISAVLDVAGWAQP
WDTGDIRELDAAMVLIDDESDPR
>MT0150 hypothetical protein
MRSIDVVVEAVVTFAGAAGFAHTLAPLRRGQQDPCFRVPGDGTIWRTSLL
PTGPVTARISRAGRDAARCVAWGSGAEEFVDMAPAMLGAADDASDFVPLH
PAVAAAHRRLPNLRLGRTGQVLEALIPAVIEQRVPGADAFRSWRLLVSKY
GTQAPGPAPPGMRVPPSAEVWRHIPSWEFHRANVDPGRARAVVGCAQRAA
SLERLVSLPAARAAEALTSLPGVGVWTAAETTQRVFGDADAVSVGDYHIP
KMIGWTLVGRPVDDAGMLELLEPMRPHRHRVVRLLEASGLAREPRRGPRL
PVQNIRAL
>MT3494 IS1560, transposase
MFRTVGDQASLWESVLPEELRRLPEELARVDALLDDSAFFCPFVPFFDPR
MGRPSIPMETYLRLMFLKFRYRLGYESLCREVTDSITWRRFCRIPLEGSV
PHPTTLMKLTTRCGEDAVAGLNEALLAKAASEKLLRTNKVRADTTVVEGD
VGYPTDTGLLAKAVGSMARTVARIKAADAGSAPLGGSSGPRDRLQAAVTR
RAATRSGAGLRAPDHRGASRDRRAGADRGCRGGT
>MT3125 conserved hypothetical protein
MNSPREPLVPPPTPRPAATVMLVRDPDAGSASGLAVFLMRRHAAMDFAAG
VMVFPGGGVDDRXRDADLGRLGAWAGPPPQWWAQRFGIEPDLAEALVCAA
ARETFEESGVLFAGPVDQDHSAPNSIVSDASVYGDARRALADRTLSFADF
LQREKLVLRSDLLRPWANWVTPEAELTRRYDTYFFVGALPEGQRADGENT
ESDRAGWVLPADAIADFAAGRNFLLPPTWTQLDSLAGHTVADVLAVERQI
VPVQPQLARNGDNWEIEFFDSDRYNQARRSGGSTGWPL
>MT3044 conserved hypothetical protein
MTRIIGGVAGGRRIAVPPRGTRPTTDRVRESLFNIVTARRDLTGLAVLDL
YAGSGALGLEALSRGAASVLFVESDQRSAAVIARNIEALGLSGATLRRGA
VAAVVAAGTTSPVDLVLADPPYNVDSADVDAILAALGTNGWTREGTVAVV
ERATTCAPLTWPEGWRRWPQRVYGDTRLELAERLFANV
>MT3936 IS1537, resolvase
MSVVCCRNRWMNLAVWAERNGVAWVIAYRWFRAGLLPVPAQRVGRLILVN
DPAVEESGRGRTLVYARVSSADQRSDLDRRVARVTAWATSQHLSVDKVVA
EGGWALNGHRRKFFALLGDPVVTRIVVEHRDRFCWFGSEYVEAALVAQGR
ELVVVDLAEVDDDLVGDMTEILTSMCARLYGERAAQNGAKRALAAAVGDA
EAA
>MT1297.1 conserved hypothetical protein
MHPKTGRAFRSPVEPGSGWPGDPATPQTPVAADAAQVSALAGGAGSICEL
NALISVCRACPRLVSWREEVAVVKRRAFADQPYWGRPVPGWGSKRPRLLI
LGLAPAAHGANRTGRMFTGDRSGDQLYAALHRAGLVNSPVSVDAADGLRA
NRIRITAPVRCAPPGNSPTPAERLTCSPWLNAEWRLVSDHIRAIVALGGF
AWQVALRLAGASGTPKPRFGHGVVTELGAGVRLLGCYHPSQQNMFTGRLT
PTMLDDIFREAKKLAGIE
>MT2724 phage integrase family protein
MNTATRVRLARKRADRLNLKLIKNGHHFRLRDADEITLAVGHLGVVEAFL
AAAKSQNKPPGPPPSLHAPPSWRRDIDDYLLNLNAAGQRPATIRLRKTVL
CAAAHGLGRPPADVTAEHLLDWLGKQQHLSPEGRKTYRSTLRGFFVWAYE
MDRVRDYVADSLPKVRCPKQPPRPAGDDVWQAALAKADRRIELMIRLAGE
AGLRRAEAAQAHTGDLMDGGLLLVHGKGGKRRIVPISDYLAALIRDTPHG
YLFPNGTGGHLTAEHVGKLVSRALPGDATMHTLRHRYATRAYRGSHNLRA
VQQLLGHASIVTTERYTALCDDEVRAAAAAAW
>MT2882 IS6110, hypothetical protein
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>MT3357 hypothetical protein
MFNRYHRRVSDSRSSSWSRRSRGGSVARRAIRRGREMRGPLLPPTVPGWR
SRAERFDMAVLEAYEPIERRWQERVSQLDIAVDEIPRIAAKDPESVQWPP
EVIADGPIALARLIPAGVDVRGNATRARIVLFRKPIERRAKDTEELGELL
HEILVAQVAIYLDVDPSVIDPTIDD
>MT3964 hypothetical protein
MSTTFAARLNRLFDTVYPPGRGPHTSAEVIAALKAEGITMSAPYLSQXRS
GNRTNPSGATMAALANFFRIKAAYFTDDEYYEKLDKELQWLCTMRDDGVR
RIAQRAHGLPSAAQQKVLDRIDELRRAEGIDA
>MT3296 helicase, UvrD/Rep family
MSHIWGVEAGAALAPGLRGPVLVLGGPGTGKSTLLVEAAVAHIGAGTDPE
SVLLLTGSGRMGMRARSALTTALLRSRTNGPCRAAIREPVVRTVHSYAYA
VLRKAAQRAGDALPRLLTSAEQDAIIRELLAGDAEDGPAATTTWPAHLRP
ALTTAGFATELRNLLARCAERGLDPLELQQLGRRRGRPEWIAAGQFAQRY
EQVMLLRGAVGLAAPQATAPALSAAELVGAALEAFAVDPELLAAERARVR
TLLVDDAQQLDPQAARLVRMLAAGTELALIAGDPNQAVFGFRGGEPTGLL
ADDPPPAGGAPIPSVTLTVSHRCAPAVARAVTGIARRLPGRSVGRRIEGT
GTEVGSVTVRLAGSAHAEAAMIADALRRAHLIDGVPWSQMAVIVRSVPRA
VRLPRALAAAGVPVAPPAVGGPLSAEPAVRALLTVLEATADGLDGDQALL
LLTGPIGGVDPVSLRQLRRTLQRARPGQTSRKFGDLLVEVLGGDAPPSGP
GSRALRRVRAVLTAAARCHRSGSLGGQDPRHTLWAAWQRSGLQRRWLAAS
EHGGAAAVQATRDLETVTALFDITDHYVSRTSGASLRGLVEHVTALQLPV
VRPEPAAPTEQVMVLSAHAALGHEWDLVVIAGLQDGLWPNTVPRGGVLGT
QRLLDELDGVTKDASMRAPLLAEERRLLVTAMGRARRRLLVTAVDSDAGG
GGHEAVLPSAFFFEIAQWADGDGEPVAMQPVSAPRVLSAAAVVGRLRVVV
CAPACAVDDADRDCAATQLARLAKAGVPGADPSEWHGLAPVSTSDPLCDS
DDLVTLTPSTLQALNDCPLRWLAERHGGTNTRELPSAVGSVLHALFAEPG
RSESQLLAELDRVWGHLPFGAQWYSANELARHRAMIQAFVQWRAQSRSEL
TEVGVEVDIDGALEDGSGQARKIRLRGRADRLERDPAGRLVIVDIKTGKT
PVSKDDAQQHAQLAMYQLAVAEGLVRAGDEPGGARLVYVGKSGAAGVAER
KQDPLTPAARDEWRNLVRQLAAATAGPQFIARRNDGCTHCPLRPGCPAHV
RGSAP
>MT2982 serine/threonine protein kinase
MGPSFGRAGRAERGYYRPMALASGVTFAGYTVVRMLGCSAMGEVYLVQHP
GFPGWQALKVLSPAMAADDEFRRRFQRETEVAARLFHPHILEVHDRGEFD
GQLWIAMDYVDGIDATQHMADRFPAVLPVGEVLAIVTAVAGALDYAHQRG
LLHRDVNPANVVLTSQSAGDQRILLADFGIASQPSYPAPELSAGADVDGR
ADQYALALTAIHLFAGAPPVDRSHTGPLQPPKLSAFRPDLARLDGVLSRA
LATAPADRFGSCREFADAMNEQAGVAIADQSSGGVDASEVTAAAGEEAYV
VDYPAYGWPEAVDCKEPSARAPAPAAPTPQRRGSMLQSAAGVLARRLDNF
STATKAPASPTRRRPRRILVGAVAVLLLAGLFAVGIVIGRKTNTTATEVA
RPPTSGSAVPSAPTTTVAVTAPVPLDGTYRIEIQRSKQTYDYTPTPQPPD
VNTWWAFRTSCTPTECLAAATMLDDNDHTQAKTPPVRPFLMQFGEGQWKS
RPETVQFPCVGPNGSPSTQATTQLLALRPQPQGDLVGEMVVTVHSNECGQ
QGAVIRIPAVASRSGDLPPAVTVPDPATIPDTPDTTSTATLTPPTTTAPG
PGR
>MT2497 IS1558, transposase
MQCRAREERPGRKTDLLDAEWLVHLLECGLLRGWLIPPADIKAARDVIRY
RRKLVEHRTSKLQRLGNVLQDAGIKADSVASSVTPKSVRAMVEALIDGER
RPAVLADLARGSMRSKIPDLQRALEGRFDDHHALMCRLHLAHLDQLDAMI
GALDEQIEQLMHPFCARRELIASIPGIGVGASATVISEIGADPAAWFPSA
EHLASWVRLCPGNHESAGKRHHGARRTGNQHLQPVLVECAWAAVRTDGYL
REYYRRQVRKFGGFRSPAANKKAIXXVAHKLIVIIWHVLATGRPXQDLGA
DYFTTRMDPDKERRRLVAKLEAQGLGVTLEPAA
>MT3776 hypothetical protein
MFTLLVSWLLVACVPGLLMLATLGLGRLERFLARDTVTATDVAEFLEQAE
AVDVHTLARNGMPEALDYLHRRQARRITDSPPLGSGAGPRYAGPLFVTDL
DSPVEPPRHGQPNPQFRTARHANHV
>MT0633 IS1536, resolvase
MACCRNRGMNLAAWAERNGVARVTAYRWFHAGLLPVPARKVGRLILVDEL
ASEAGAQPKTAVYARVSSADQKSDLDRQVARVTSWATAEQIPVDKVVTEV
GSVLNGHRRKFPAVLRDLSVTRIVVEHRDRFCRFGSEYVHAALAAQGREL
VVVDSAEVDDDLVWDMTEILTSMCARLYGKRAAQNRAKRAVAAAAVDDHE
AA
>MT3204 hypothetical protein
MPSSLAKRCSTPAPPRRGGHTKILHRLMSRQPIMGPTPIRGRRLRVVREE
FAWLRSRLPTLWTNSYFVATVGGFGLS
>MT0780 hypothetical protein
MVRLILDTPSMKELSVAEQRYQAVLAVISDGLSISQVAEKVGVSRQTLHT
WLARYEAEGLDGLRIGTGTAL
>MT2161 helicase, SNF2/RAD54 family
MLLPSLRSAPLDSPELIRLAPRPAARTDPMLLAWTVPVVDLDPTAALAAF
DQPAPDVRYGASVDYLAELAVFARELVERGRVLPQLRRDTHGAAACWRPV
LQGRDVVAMTSLVSAMPPVCRAEVGGHDPHELATSALDAMVDAAVRAALS
PMDLLPPRRGRSKRHRAVEAWLTALTCPDGRFDAEPDELDALAEALRPWD
DVGIGTVGPARATFRLSEVETENEETPAGSLWRLEFLLQSTQDPSLLVPA
EQAWNDDGSLRRWLDRPQELLLTELGRASRIFPELVPALRTACPSGLELD
ADGAYRFLSGTAAVLDEAGFGVLLPSWWDRRRKLGLVLSAYTPVDGVVGK
ASKFGREQLVEFRWELAVGDDPLSEEEIAALTETKSPLIRLRGQWVALDT
EQLRRGLEFLERKPTGRKTTAEILALAASHPDDVDTPLEVTAVRADGWLG
DLLAGAAAASLQPLDPPDGFTATLRPYQQRGLAWLAFLSSLGLGSCLADD
MGLGKTVQLLALETLESVQRHQDRGVGPTLLLCPMSLVGNWQQEAARFAP
NLRVYAHHGGARLHGEALRDHLERTDLVVSTYTTATRDIDELAEYEWNRV
VLDEAQAVKNSLSRAAKAVRRLRAAHRVALTGTPMENRLAELWSIMDFLN
PGLLGSSERFRTRYAIPIERHGHTEPAERLRASTRPYILRRLKTDPAIID
DLPEKIEIKQYCQLTTEQASLYQAVVADMMEKIENTEGIERRGNVLAAMA
KLKQVCNHPAQLLHDRSPVGRRSGKVIRLEEILEEILAEGDRVLCFTQFT
EFAELLVPHLAARFGRAARDIAYLHGGTPRKRRDEMVARFQSGDGPPIFL
LSLKAGGTGLNLTAANHVVHLDRWWNPAVENQATDRAFRIGQRRTVQVRK
FICTGTLEEKIDEMIEEKKALADLVVTDGEGWLTELSTRDLREVFALSEG
AVGE
>MT3064 DNA binding protein HU, putative
MNKAELIDVLTQKLGSDRRQATAAVENVVDTIVRAVHKGDSVTITGFGVF
EQRRRAARVARNPRTGETVKVKPTSVPAFRPGAQFKAVVSGAQRLPAEGP
AVKRGVGASAAKKVAKKAPAKKATKAAKKAATKAPARKAATKAPAKKAAT
KAPAKKAVKATKSPAKKVTKAVKKTAVKASVRKAATKAPAKKAAAKRPAT
KAPAKKATARRGRK
>MT0948 IS1535, resolvase
MNLADWAESVGVNRHTAYRWFREGTLPVPAERVGRLILVKTAASASAAAA
GVVLYARVSSHDRRSDLDRQVARLTAWATERDLGVGQVVCEVGSGLNGKR
PKLRRILSDPDARVIVVEHRDRLARFGVEHLEAALSAQGRRIVVADPGET
TDDLVCDMIEVLTGMCARLYGRRGARNRAMRAVTEAKREPGAG
>MT3835 conserved hypothetical protein
MAAAAEELDVDGIAVRLTSPDRMYFPKLGSHGTKRRLVEYYFAVAGGPML
TALRDRPTHLQRFPDGVDGEQIYQKRIPRHRPDYLQTCRVTFPSGRMADA
LKVTHPAAIVWAAQMGTITLHPWQVRCPDTEHPDELRIDLDPQPGTGFVE
ARTVAVDVLRSVLDDLGLVGYPKTSGGRGIHVFLRIATDWDFVEVRRAGI
ALAREVERRAPDAVTTSWWKEERGARIFIDFNQNARDRTMASAYSVRPTP
IATVSMPLTWEELAGADPDDYTMTTVPELVKIRDDPWAGMDDVAQSIAPL
LDLAAADEERGLGDMPYPPNYPKMPGEPKRVQPSRDTDLKGGNTSK
>MT1804 IS6110, hypothetical protein
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV
RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA
AELDRPAR
>MT2966 conserved hypothetical protein
MAGDVGASVQVGRMTTLKTMTRVQLGAMGEALAVDYLTSMGLRILNRNWR
CRYGELDVIACDAATRTVVFVEVKTRTGDGYGGLAHAVTERKVRRLRRLA
GLWLADQEERWAAVRIDVIGVRVGPKNSGRTPELTHLQGIG
>MT3100 IS6110, transposase
MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH
AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI
ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR
RILGWRVASTMATSMVLDAIEQAIWTRQQESVLDLKDVIHHTDRGSQYTS
IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE
DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG
>MT3838 hypothetical protein
MPKLSAGVLLYRARAGVVDVLLAHPGGPFWAGKDDGAWSIPKGEYTGGED
PWLAARREFSEEIGLCVPDGPRIDFGSLKQSGGKVVTVFGVRADLDITDA
RSSTFELDWPKGSGKMRKFPEVDRVSWFPVARARTKLLKGQRGFLDRLMA
HPAVAGLSEGPESLPR
>MT1508 conserved hypothetical protein, intein-containing
MTLTPEASKSVAQPPTQAPLTQEEAIASLGRYGYGWADSDVAGANAQRGL
SEAVVRDISAKKNEPDWMLQSRLKALRIFDRKPIPKWGSNLDGIDFDNIK
YFVRSTEKQAASWDDLPEDIRNTYDRLGIPEAEKQRLVAGVAAQYESEVV
YHQIREDLEAQGVIFLDTDTGLREHPDIFKEYFGTVIPAGDNKFSALNTA
VWSGGSFIYVPPGVHVDIPLQAYFRINTENMGQFERTLIIADEGSYVHYV
EGCLPAGELITTADGDLRPIESIRVGDFVTGHDGRPHRVTAVQVRDLDGE
LFTFTPMSPANAFSVTAEHPLLAIPRDEVRVMRKERNGWKAEVNSTKLRS
AEPRWIAAKDVAEGDFLIYPKPKPIPHRTVLPLEFARLAGYYLAEGHACL
TNGCESLIFSFHSDEFEYVEDVRQACKSLYEKSGSVLIEEHKHSARVTVY
TKAGYAAMRDNVGIGSSNKKLSDLLMRQDETFLRELVDAYVNGDGNVTRR
NGAVWKRVHTTSRLWAFQLQSILARLGHYATVELRRPGGPGVIMGRNVVR
KDIYQVQWTEGGRGPKQARDCGDYFAVPIKKRAVREAHEPVYNLDVENPD
SYLAYGFAVHNCTAPIYKSDSLHSAVVEIIVKPHARVRYTTIQNWSNNVY
NLVTKRARAEAGATMEWIDGNIGSKVTMKYPAVWMTGEHAKGEVLSVAFA
GEDQHQDTGAKMLHLAPNTSSNIVSKSVARGGGRTSYRGLVQVNKGAHGS
RSSVKCDALLVDTVSRSDTYPYVDIREDDVTMGHEATVSKVSENQLFYLM
SRGLTEDEAMAMVVRGFVEPIAKELPMEYALELNRLIELQMEGAVG
>MT3299 6-O-methylguanine DNA methyltransferase, putative
MAPVTDEQVELVRSLVAAIPLGRVSTYGDIAALAGLSSPRIVGWIMRTDS
SDLPWHRVIRASGRPAQHLATRQLELLRAEGVLSVDGRVALSEIRYEFPP
G
>MT2232 serine/threonine protein kinase
MVEAGTRDPLESALLDSRYLVQAKIASGGTSTVYRGLDVRLDRPVALKVM
DSRYAGDEQFLTRFRLEARAVARLNNRALVAVYDQGKDGRHPFLVMELIE
GGTLRELLIERGPMPPHAVVAVLRPVLGGLAAAHRAGLVHRDVKPENILI
SDDGDVKLADFGLVRAVAAASITSTGVILGTAAYLSPEQVRDGNADPRSD
VYSVGVLVYELLTGHTPFTGDSALSIAYQRLDADVPRASAVIDGVPPQFD
ELVACATARNPADRYADAIAMGADLEAIAEELALPEFRVPAPRNSAQHRS
AALYRSRITQQGQLGAKPVHHPTRQLTRQPGDCSEPASGSEPEHEPITGQ
FAGIAIEEFIWARQHARRMVLVWVSVVLAITGLVASAAWTIGSNLSGLL
>MT3293 MutT/nudix family protein
MRWSRPRWAGLRGTCATSCSAWYVTASCCALGRPRWTPSRPATGCSIFAR
WGASGDRKRGGAGRGGSPPGGAPVTNVSGVDFQLRSVPLLSRVGADRADR
LRTDMEAAAAGWPGAALLRVDSRNRVLVANGRVLLGAAIELADKPPPEAV
FLGRVEGGRHVWAVRAALQPIADPDIPAEAVDLRGLGRIMDDTSSQLVSS
ASALLNWHDNARFSALDGAPTKPARAGWSRVNPITGHEEFPRIDPAVICL
VHDGADRAVLARQAAWPERMFSLLAGFVEAGESFEVCVAREIREEIGLTV
RDVRYLGSQQWPFPRSLMVGFHALGDPDEEFSFSDGEIAEAAWFTRDEVR
AALAAGDWSSASESKLLLPGSISIARVIIESWAACE
>MT1029 hypothetical protein
MCDKLGGVAIAVQGALFEHNERRQLGDGAFIDIRSGWLTGGEELLDALLS
TVPWRAERRQMYDRVVDVPRLVSFHDLTIEDPPHPQLARMRRRLNDIYGG
ELGEPFTTAGLCYYRDGSDSVAWHGDTIGRGSTEDTMVAIVSLGATRVFA
LRPRGRGPSLRLPLAHGDLLVMGGSCQRTFEHAVPKTSAPTGPRVSIQFR
PRDVR
>MT3537 IS1535, transposase
MFAELIRAGLQALIEAEATEAIGAGRYERSDGRIVHRNGHRPKTVSTTAG
DIEVQIPKLRAGSFFPSLLERRRRIDKALHAVIMEAYVHGVSTRSVDDLV
AAMGVQAGVSKSEVSRICAGLDTEIEAFRTRSLTHTEFPYVFCDATFCKV
RVGAHVVSQALVVATGVSIDGTREVLGTAVGDSESYEFWREFLASLKARG
LTGVHLVISDAHAGLKAAVAQQFSGASWQRCRVHFMRNLYTAVAAKHAPA
VTVAVKTIFAHTDPEEVGAQWDRVADPLCQP
>MT1739 MutT/nudix family protein
MAEHDFETISSETLHTGAIFALRRDQVRMPGGGIVTREVVEHFGAVAIVA
MDDNGNIPMVYQYRHTYGRRLWELPAGLLDVAGEPPHLTAARELREEVGL
QASTWQVLVDLDTAPGFSDESVRVYLATGLREVGRPEAHHEEADMTMGWY
PIAEAARRVLRGEIVNSIAIAGVLAVHAVTTGFAQPRPLDTEWIDRPTAF
AARRAER
>MT2631 conserved hypothetical protein
MVPAQHRPPDRPGDPAHDPGRGRRLGIDVGAARIGVACSDPDAILATPVE
TVRRDRSGKHLRRLAALAAELEAVEVIVGLPRTLADRIGRSAQDAIELAE
ALARRVSPTPVRLADERLTTVSAQRSLRQAGVRASEQRAVIDQAAAVAIL
QSWLDERLAAMAGTQEGSDA
>MT1491 hypothetical protein
MTVMADRSGRPAPVRRRMKTLTQAALNADKTVEQVEDVLDGLGKTMAELN
SSLSQLNSTVERLEDGLDHLEGTLHSLDDLAKRLIVLVEPVEAIVDRIDY
IVSLGETVMSPLSVTEHAVRGVLDRLRNRTVHEPTN
>MT2636 ATPase, AAA family
MPEAVSDGLFDVPGVPMTSGHDLGASAGAPLAVRMRPASLDEVVGQDHLL
APGSPLRRLVEGSGVASVILYGPPGSGKTTLAALISQATGRRFEALSALS
AGVKEVRAVIENSRKALLHGEQTVLFIDEVHRFSKTQQDALLSAVEHRVV
LLVAATTENPSFSVVAPLLSRSLILQLRPLTAEDTRAVVQRAIDDPRGLG
RAVAVAPEAVDLLVQLAAGDARRALTALEVAAEAAQAAGELVSVQTIERS
VDKAAVRYDRDGDQHYDVVSAFIKSVRGSDVDAALHYLARMLVAGEDPRF
IARRLMILASEDIGMAGPSALQVAVAAAQTVALIGMPEAQLTLAHATIHL
ATAPKSNAVTTALAAAMNDIKAGKAGLVPAHLRDGHYSGAAALGNAQGYK
YSHDDPDGVVAQQYPPDELVDVDYYRPTGRGGEREIAGRLDRLRAIIRKK
RG
>MT1727.1 DNA-3-methyladenine glycosidase
MNAEELAIDPVAAAHRLLGATIAGRGVRAMVVEVEAYGGVPDGPWPDAAA
HSYRGRNGRNDVMFGPPGRLYTYRSHGIHVCANVACGPDGTAAAVLLRAA
AIEDGAELATSRRGQTVRAVALARGPGNLCAALGITMADNGIDLFDPSSP
VRLRLNDTHRARSGPRVGVSQAADRPWRLWLTGRPEVSAYRRSSRAPARG
ASD
>MT1237 IS1081, transposase
MGGHRVILRNDQQKSIEGNDAMTSSHLIDAEQLLADQLAQASPDLLRGLL
STFIAALMGAEADALCGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAI
PKLRQGSYFPDWLLQRRKRAERALTSVVATCYLLGVSTRRMERLVETLGV
TKLSKSQVSIMAKELDEAVEAFRTRPLDAGPYTFLAADALVLKVREAGRV
VGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRDLVARGLSGVAL
VTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRTLLH
SIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKKI
WRQIWSNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIE
GRRYLGLEVLTRARAALTSTEEPAKQQTTNTPALTT
>MT3573.7 conserved hypothetical protein
MPRPPKPARLKLVEGRSPGRDSGGRKVPESPKFIRQAPDAPDWLDAEALA
EWRRVAPTLERLDLLKPEDRALLSAYCETWSVYVAAVQRVRAEGLTITSP
KSGVVHRNPAVTVAETARMHLLRLASEFGLTPAAEQRLAVAPGDDGDGLN
PFAPDR
>MT3740 IS1534, istB protein
MAAKTATNSRDVAAELAYLTRALKAPTLRGAIEQLADRARTKTWSYEEFL
AACLQREVSARESHGGEGRIRAARFPSRKSLEEFDFDHARGLKRDTIAHL
GTLDFVTLAIGIAIRACQAGHRVLFATASQWVDRLAAAHHSGTLQSELIR
LARYPLLVVDEVGYIPFEPEAANLFFQLVSSRYERASLIVTSNKPFGRWG
EVFGDDVVAAAMIDRLVHHAEVIALKGDSYRIKDRDLGRVPTVTADDQ
>MT2153 ATP-dependent RNA helicase, DEAD/DEAH box family
MTELAELDRFTAELPFSLDDFQQRACSALERGHGVLVCAPTGAGKTVVGE
FAVHLALAAGSKCFYTTPLKALSNQKHTDLTARYGRDQIGLLTGDLSVNG
NAPVVVMTTEVLRNMLYADSPALQGLSYVVMDEVHFLADRMRGPVWEEVI
LQLPDDVRVVSLSATVSNAEEFGGWIQTVRGDTTVVVDEHRPVPLWQHVL
VGKRMFDLFDYRIGEAEGQPQVNRELLRHIAHRREADRMADWQPRRRGSG
RPGFYRPPGRPEVIAKLDAEGLLPAITFVFSRAGCDAAVTQCLRSPLRLT
SEEERARIAEVIDHRCGDLADSDLAVLGYYEWREGLLRGLAAHHAGMLPA
FRHTVEELFTAGLVKAVFATETLALGINMPARTVVLERLVKFNGEQHMPL
TPGEYTQLTGRAGRRGIDVEGHAVVIWHPEIEPSEVAGLASTRTFPLRSS
FAPSYNMTINLVHRMGPQQAHRLLEQSFAQYQADRSVVGLVRGIERGNRI
LGEIAAELGGSDAPILEYARLRARVSELERAQARASRLQRRQAATDALAA
LRRGDIITITHGRRGGLAVVLESARDRDDPRPLVLTEHRWAGRISSADYS
GTTPVGSMTLPKRVEHRQPRVRRDLASALRSAAAGLVIPAARRVSEAGGF
HDPELESSREQLRRHPVHTSPGLEDQIRQAERYLRIERDNAQLERKVAAA
TNSLARTFDRFVGLLTEREFIDGPATDPVVTDDGRLLARIYSESDLLVAE
CLRTGAWEGLKPAELAGVVSAVVYETRGGDGQGAPFGADVPTPRLRQALT
QTSRLSTTLRADEQAHRITPSREPDDGFVRVIYRWSRTGDLAAALAAADV
NGSGSPLLAGDFVRWCRQVLDLLDQVRNAAPNPELRATAKRAIGDIRRGV
VAVDAG
>MT3363 modification methylase
MQPSHPTRPGAVIRYVGSSLDTCPMTTFAGKTAASADKVRGGYYTPPAVA
RFLAHWVHQAGPKILEPSCGDGRILRELSAITDHAHGVELVAREAKKSRD
FASVDTENLFTWLHKTQLGSWDGVAGNPPYIRFGNWASEQRDPALELMRR
VGLRPTKLTNAWVPFVVASTTLARDGGRVGLVVPAELLQVTYAAQLREFL
LSRYREITLVTFERLVFDGILQEVVLFCGVVGPGPAHIRTVRLGDANDLN
ALGDKDFTNESAPALLHEKEKWTKYFLDPAQIRLLRGLKQSATMIRLGEL
ADVDVGIVTGRNSFFTFTDAKAQALGLRAHCVPLVSRSAQLSGLIYDEDC
RACDVAGNHRTWLLDAADYPTDPALVAHITAGEAAGVHLGYKCSIRKPWW
STPSLWMPDLFMLRQIHFAPRLTVNAAAATSTDTVHRVRLDPNVDPATLA
AVFHNSATFAFAEIMGRSYGGGILELEPREAEQLPMPPPAYGSAELAQDV
DLLLKANEIDKALDVVDRHVLIDGLGLSPRLVAGCRAAWLTLRDRRTKRG
SRR
>MT2151 5'-3'-exonuclease, putative
MRSPLVLLDGASMWFRSFFGVPSSITAPDGRPVNAVRGFIDSMAVVITQQ
RPNRLAVCLDLDWRPQFRVDLIPSYKAHRVAEPEPNGQPDVEEVPDELTP
QVDMIMELLDAFGIAMAGAPGFEADDVLGTLATRERRDPVIVVSGDRDLL
QVVADDPVPVRVLYLGRGLAKATLFGPAEVAERYGLPAHRAGAAYAELAL
LRGDPSDGLPGVPGVGEKTAATLLARHGSLDQIMAAADDRKTTMAKGLRT
KLLAASAYIKAADRVVRVATDAPVTLSTPTDRLPLVAADPERTAELATRF
GVESSIARLQKALDTLPG
>MT1292 deaD-1, ATP-dependent RNA helicase DeaD
MAFPEYSPAASAATFADLQIHPRVLRAIGDVGYESPTAIQAATIPALMAG
SDVVGLAQTGTGKTAAFAIPMLSKIDITSKVPQALVLVPTRELALQVAEA
FGRYGAYLSQLNVLPIYGGSSYAVQLAGLRRGAQVVVGTPGRMIDHLERA
TLDLSRVDFLVLDEADEMLTMGFADDVERILSETPEYKQVALFSATMPPA
IRKLSAKYLHDPFEVTCKAKTAVAENISQSYIQVARKMDALTRVLEVEPF
EAMIVFVRTKQATEEIAEKLRARGFSAAAISGDVPQAQRERTITALRDGD
IDILVATDVAARGLDVERISHVLNYDIPHDTESYVHRIGRTGRAGRSGAA
LIFVSPRELHLLKAIEKATRQTLTEAQLPTVEDVNTQRVAKFADSITNAL
GGPGIELFRRLVEEYEREHDVPMADIAAALAVQCRGGEAFLMAPDPPLSR
RNRDQRRDRPQRPKRRPDLTTYRVAVGKRHKIGPGAIVGAIANEGGLHRS
DFGQIRIGPDFSLVELPAKLPRATLKKLAQTRISGVLIDLRPYRPPDAAR
RHNGGKPRRKHVG
>MT3307 deaD-2, ATP-dependent RNA helicase DeaD
MTAVKHTTESTFAKLGVRDEIVRALGEEGIKRPFAIQELTLPLALDGEDV
IGQARTGMGKTFAFGVPLLQRITSGDGTRPLTGAPRALVVVPTRELCLQV
TDDLATAGKYLTAGPDTDDAAAVRRRLSVVSIYGGRPYEPQIEALRAGAD
VVVGTPGRLLDLCQQGHLQLGGLSVLVLDEADEMLDLGFLPDIERILRQI
PADRQSMLFSATMPDPIITLARTFMVRPTHIRAEAPHSSAVHDATEQFVY
RAHALDKVELVSRVLQARDRGATMIFTRTKRTAQKVADELTERGFAVGAV
HGDLGQLAREKALKAFRTGGIDVLVATDVAARGIDIDDVTHVINYQCPED
EKMYVHRIGRTGRAGRTGVAVTLVDWDELPRWSMIDQALGLGSPDPAETY
SNSPHLYAELAIPATAGGTVGPARKSQGRRRDTDCDGQKTAQHARNTPRR
RRTRGGKPVTGHPGTNPISSPIVGGDATSEPGSGTASDSGSDVVSGSRSG
NGEAARRRRRRRRRPTHAQDGFAARAN
>MT1371 dinG, ATP-dependent helicase DinG
MSESVSMSVPELLAIAVAALGGTRRRGQQEMAAAVAHAFETGEHLVVQAG
TGTGKSLAYLVPAIIRALCDDAPVVVSTATIALQRQLVDRDLPQLVDSLT
NALPRRPKFALLKGRRNYLCLNKIHNSVTASDHDDERPQEELFDPVAVTA
LGRDVQRLTAWASTTVSGDRDDLKPGVGDRSWSQVSVSARECLGVARCPF
GSECFSERARGAAGLADVVVTNHALLAIDAVAESAVLPEHRLLVVDEAHE
LADRVTSVAAAELTSATLGMAARRITRLVDPKVTQRLQAASATFSSAIHD
ARPGRIDCLDDEMATYLSALRDAASAARSAIDTGSDTTTASVRAEAGAVL
TEISDTASRILASFAPAIPDRSDVVWLEHEDNHESARAVLRVAPLSVAEL
LATQVFARATTVLTSATLTIGGSFDAMATAWGLTADTPWRGLDVGSPFQH
AKSGILYVAAHLPPPGRDGSGSAEQLTEIAELITAAGGRTLGLFSSMRAA
RAATEAMRERLSTPVLCQGDDSTSTLVEKFTADAATSLFGTLSLWQGVDV
PGPSLSLVLIDRIPFPRPDDPLLSARQRAVAARGGNGFMTVAASHAALLL
AQGSGRLLRRVTDRGVVAVLDSRMATARYGEFLRASLPPFWQTTNATQVR
AALRRLARADAKAH
>MT0001 dnaA, chromosomal replication initiator protein DnaA
MTDDPGSGFTTVWNAVVSELNGDPKVDDGPSSDANLSAPLTPQQRAWLNL
VQPLTIVEGFALLSVPSSFVQNEIERHLRAPITDALSRRLGHQIQLGVRI
APPATDEADDTTVPPSENPATTSPDTTTDNDEIDDSAAARGDNQHSWPSY
FTERPHNTDSATAGVTSLNRRYTFDTFVIGASNRFAHAAALAIAEAPARA
YNPLFIWGESGLGKTHLLHAAGNYAQRLFPGMRVKYVSTEEFTNDFINSL
RDDRKVAFKRSYRDVDVLLVDDIQFIEGKEGIQEEFFHTFNTLHNANKQI
VISSDRPPKQLATLEDRLRTRFEWGLITDVQPPELETRIAILRKKAQMER
LAVPDDVLELIASSIERNIRELEGALIRVTAFASLNKTPIDKALAEIVLR
DLIADANTMQISAATIMAATAEYFDTTVEELRGPGKTRALAQSRQIAMYL
CRELTDLSLPKIGQAFGRDHTTVMYAQRKILSEMAERREVFDHVKELTTR
IRQRSKR
>MT0064 dnaB, replicative DNA helicase, intein-containing
MAVVDDLAPGMDSSPPSEDYGRQPPQDLAAEQSVLGGMLLSKDAIADVLE
RLRPGDFYRPAHQNVYDAILDLYGRGEPADAVTVAAELDRRGLLRRIGGA
PYLHTLISTVPTAANAGYYASIVAEKALLRRLVEAGTRVVQYGYAGAEGA
DVAEVVDRAQAEIYDVADRRLSEDFVALEDLLQPTMDEIDAIASSGGLAR
GVATGFTELDEVTNGLHPGQMVIVAARPGVGKSTLGLDFMRSCSIRHRMA
SVIFSLEMSKSEIVMRLLSAEAKIKLSDMRSGRMSDDDWTRLARRMSEIS
EAPLFIDDSPNLTMMEIRAKARRLRQKANLKLIVVDYLQLMTSGKKYESR
QVEVSEFSRHLKLLAKELEVPVVAISQLNRGPEQRTDKKPMLADLRESGC
LTASTRILRADTGAEVAFGELMRSGERPMVWSLDERLRMVARPMINVFPS
GRKEVFRLRLASGREVEATGSHPFMKFEGWTPLAQLKVGDRIAAPRRVPE
PIDTQRMPESELISLARMIGDGSCLKNQPIRYEPVDEANLAAVTVSAAHS
DGAAIRDDYLAARVPSLRPARQRLPRGRCTPIAAWLAGLGLFTKRSHEKC
VPEAVFRAPNDQVALFLRHLWSAGGSVRWDPTNGQGRVYYGSTSRRLIDD
VAQLLLRVGIFSWITHAPKLGGHDSWRLHIHGAKDQVRFLRHVGVHGAEA
VAAQEMLRQLKGPVRNPNLDSAPKKVWAQVRNRLSAKQMMDIQLHEPTMW
KHSPSRSRPHRAEARIEDRAIHELARGDAYWDTVVEITSIGDQHVFDGTV
SGTHNFVANGISLHNSLEQDADVVILLHRPDAFDRDDPRGGEADFILAKH
RNGPTKTVTVAHQLHLSRFANMAR
>MT1598 dnaE, DNA polymerase III, alpha subunit
MSGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVGMTDHGN
MFGASEFYNSATKAGIKPIIGVEAYIAPGSRFDTRRILWGDPSQKADDVS
GSGSYTHLTMMAENATGLRNLFKLSSHASFEGQLSKWSRMDAELIAEHAE
GIIITTGCPSGEVQTRLRLGQDREALEAAAKWREIVGPDNYFLELMDHGL
TIERRVRDGLLEIGRALNIPPLATNDCHYVTRDAAHNHEALLCVQTGKTL
SDPNRFKFDGDGYYLKSAAEMRQIWDDEVPGACDSTLLIAERVQSYADVW
TPRDRMPVFPVPDGHDQASWLRHEVDAGLRRRFPAGPPDGYRERAAYEID
VICSKGFPSYFLIVADLISYARSAGIRVGPGRGSAAGSLVAYALGITDID
PIPHGLLFERFLNPERTSMPDIDIDFDDRRRGEMVRYAADKWGHDRVAQV
ITFGTIKTKAALKDSARIHYGQPGFAIADRITKALPPAIMAKDIPLSGIT
DPSHERYKEAAEVRGLIETDPDVRTIYQTARGLEGLIRNAGVHACAVIMS
SEPLTEAIPLWKRPQDGAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDA
IDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLDGGPMRDLLRR
MQPTGFEDVVAVIALYRPGPMGMNAHNDYADRKNNRQAIKPIHPELEEPL
REILAETYGLIVYQEQIMRIAQKVASYSLARADILRKAMGKKKREVLEKE
FEGFSDGMQANGFSPAAIKALWDTILPFADYAFNKSHAAGYGMVSYWTAY
LKANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLPPDVNESGLNF
ASVGQDIRYGLGAVRNVGANVVGSLLQTRNDKGKFTDFSDYLNKIDISAC
NKKVTESLIKAGAFDSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLF
GSNDDGTGTADPVFTIKVPDDEWEDKHKLALEREMLGLYVSGHPLNGVAH
LLAAQVDTAIPAILDGDVPNDAQVRVGGILASVNRRVNKNGMPWASAQLE
DLTGGIEVMFFPHTYSSYGADIVDDAVVLVNAKVAVRDDRIALIANDLTV
PDFSNAEVERPLAVSLPTRQCTFDKVSALKQVLARHPGTSQVHLRLISGD
RITTLALDQSLRVTPSPALMGDLKELLGPGCLGS
>MT2408 dnaG, DNA primase
MSGRISDRDIAAIREGARIEDVVGDYVQLRRAGADSLKGLCPFHNEKSPS
FHVRPNHGHFHCFGCGEGGDVYAFIQKIEHVSFVEAVELLADRIGHTISY
TGAATSVQRDRGSRSRLLAANAAAAAFYAQALQSDEAAPARQYLTERSFD
AAAARKFGCGFAPSGWDSLTKHLQRKGFEFEELEAAGLSRQGRHGPMDRF
HRRLLWPIRTSAGEVVGFGARRLFDDDAMEAKYVNTPETLLYKKSSVMFG
IDLAKRDIAKGHQAVVVEGYTDVMAMHLAGVTTAVASCGTAFGGEHLAML
RRLMMDDSFFRGELIYVFDGDEAGRAAALKAFDGEQKLAGQSFVAVAPDG
MDPCDLRLKCGDAALRDLVARRTPLFEFAIRAAIAEMDLDSAEGRVAALR
RCVPMVGQIKDPTLRDEYARQLAGWVGWADVAQVIGRVRGEAKRTKHPRL
GRLGSTTIARAAQRPTAGPPTELAVRPDPRDPTLWPQREALKSALQYPAL
AGPVFDALTVEGFTHPEYAAVRAAIDTAGGTSAGLSGAQWLDMVRQQTTS
TVTSALISELGVEAIQVDDDKLPRYIAGVLARLQEVWLGRQIAEVKSKLQ
RMSPIEQGDEYHALFGDLVAMEAYRRSLLEQASGDDLTA
>MT0002 dnaN, DNA polymerase III, beta subunit
MDAATTRVGLTDLTFRLLRESFADAVSWVAKNLPARPAVPVLSGVLLTGS
DNGLTISGFDYEVSAEAQVGAEIVSPGSVLVSGRLLSDITRALPNKPVDV
HVEGNRVALTCGNARFSLPTMPVEDYPTLPTLPEETGLLPAELFAEAISQ
VAIAAGRDDTLPMLTGIRVEILGETVVLAATDRFRLAVRELKWSASSPDI
EAAVLVPAKTLAEAAKAGIGGSDVRLSLGTGPGVGKDGLLGISGNGKRST
TRLLDAEFPKFRQLLPTEHTAVATMDVAELIEAIKLVALVADRGAQVRME
FADGSVRLSAGADDVGRAEEDLVVDYAGEPLTIAFNPTYLTDGLSSLRSE
RVSFGFTTAGKPALLRPVSGDDRPVAGLNGNGPFPAVSTDYVYLLMPVRL
PG
>MT3814 dnaQ, DNA polymerase III, epsilon subunit
MSHTWGRPASHQDRGWAVIDVETSGFRPGQARIISLAVLGLDAAGRLEQS
VVSLLNPKVDPGPTHVHGLTAAMLDGQPQFADIAGEVVDVLRGRTLVAHN
VAFDYAFLAAEAEIAEAELPVDFVMCTVELARRLQLGVDNLRLETLAAHW
GVPQQRPHDAFDDVRVLTGILAAALESARELDVWLPVHPVTRRRWPNGRV
THDELRPLKALAARMACPYLNPGRYVQGRPLVQGMRVGLAAEVKRTHEEL
VERILHAGLAYSDVVDRDTSLVVCNATAPEHGKGYHALQLGVPVMPEARF
MECIGAVVGGASVEDFTDVAPVEKQLALF
>MT3824 dnaZX, DNA polymerase III, gamma and tau subunits
MALYRKYRPASFAEVVGQEHVTAPLSVALDAGRINHAYLFSGPRGCGKTS
SARILARSLNCAQGPTANPCGVCESCVSLAPNAPGSIDVVELDAASHGGV
DDTRELRDRAFYAPVQSRYRVFIVDEAHMVTTAGFNALLKIVEEPPEHLI
FIFATTEPEKVLPTIRSRTHHYPFRLLPPRTMRALLARICEQEGVVVDDA
VYPLVIRAGGGSPRDTLSVLDQLLAGAADTHVTYTRALGLLGVTDVALID
DAVDALAACDAAALFGAIESVIDGGHDPRRFATDLLERFRDLIVLQSVPD
AASRGVVDAPEDALDRMREQAARIGRATLTRYAEVVQAGLGEMRGATAPR
LLLEVVCARLLLPSASDAESALLQRVERIETRLDMSIPAPQAVPRPSAAA
AEPKHQPAREPRPVLAPTPASSEPTVAAVRSMWPTVRDKVRLRSRTTEVM
LAGATVRALEDNTLVLTHESAPLARRLSEQRNADVLAEALKDALGVNWRV
RCETGEPAAAASPVGGGANVATAKAVNPAPTANSTQRDEEEHMLAEAGRG
DPSPRRDPEEVALELLQNELGARRIDNA
>MT2539 fpg-1, formamidopyrimidine-DNA glycosylase
MPEGHTLHRLARLHQRRFAGAPVSVSSPQGRFADSASALNGRVLRRASAW
GKHLFHHYVGGPVVHVHLGLYGTFTEWARPTDGWLPEPAGQVRMRMVGAE
FGTDLRGPTVCESIDDGEVADVVARLGPDPLRSDANPSSAWSRITKSRRP
IGALLMDQTVIAGVGNVYRNELLFRHRIDPQRPGRGIGEPEFDAAWNDLV
SLMKVGLRRGKIIVVRPEHDHGLPSYLPDRPRTYVYRRAGEPCRVCGGVI
RTALLEGRNVFWCPVCQT
>MT2994 fpg-2, formamidopyrimidine-DNA glycosylase
MPELPEVEVVRRGLQAHVTGRTITEVRVHHPRAVRRHDAGPADLTARLRG
ARINGTDRRGKYLWLTLNTAGVHRPTDTALVVHLGMSGQMLLGAVPCAAH
VRISALLDDGTVLSFADQRTFGGWLLADLVTVDGSVVPVPVAHLARDPLD
PRFDCDAVVKVLRRKHSELKRQLLDQRVVSGIGNIYADEALWRAKVNGAH
VAATLRCRRLGAVLHAAADVMREALAKGGTSFDSLYVNVNGESGYFERSL
DAYGREGENCRRCGAVIRRERFMNRSSFYCPRCQPRPRK
>MT0006 gyrA, DNA gyrase subunit A
MTDTTLPPDDSLDRIEPVDIQQEMQRSYIDYAMSVIVGRALPEVRDGLKP
VHRRVLYAMFDSGFRPDRSHAKSARSVAETMGNYHPHGDASIYDTLVRMA
QPWSLRYPLVDGQGNFGSPGNDPPAAMRYTEARLTPLAMEMLREIDEETV
DFIPNYDGRVQEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLRELADA
VFWALENHDADEEETLAAVMGRVKGPDFPTAGLIVGSQGTADAYKTGRGS
IRMRGVVEVEEDSRGRTSLVITELPYQVNHDNFITSIAEQVRDGKLAGIS
NIEDQSSDRVGLRIVIEIKRDAVAKVVINNLYKHTQLQTSFGANMLAIVD
GVPRTLRLDQLIRYYVDHQLDVIVRRTTYRLRKANERAHILRGLVKALDA
LDEVIALIRASETVDIARAGLIELLDIDEIQAQAILDMQLRRLAALERQR
IIDDLAKIEAEIADLEDILAKPERQRGIVRDELAEIVDRHGDDRRTRIIA
ADGDVSDEDLIAREDVVVTITETGYAKRTKTDLYRSQKRGGKGVQGAGLK
QDDIVAHFFVCSTHDLILFFTTQGRVYRAKAYDLPEASRTARGQHVANLL
AFQPEERIAQVIQIRGYTDAPYLVLATRNGLVKKSKLTDFDSNRSGGIVA
VNLRDNDELVGAVLCSADDDLLLVSANGQSIRFSATDEALRPMGRATSGV
QGMRFNIDDRLLSLNVVREGTYLLVATSGGYAKRTAIEEYPVQGRGGKGV
LTVMYDRRRGRLVGALIVDDDSELYAVTSGGGVIRTAARQVRKAGRQTKG
VRLMNLGEGDTLLAIARNAEESGDDNAVDANGADQTGN
>MT0005 gyrB, DNA gyrase subunit B
MHATPEESIRIVAAQKKKAQDEYGAASITILEGLEAVRKRPGMYIGSTGE
RGLHHLIWEVVDNAVDEAMAGYATTVNVVLLEDGGVEVADDGRGIPVATH
ASGIPTVDVVMTQLHAGGKFDSDAYAISGGLHGVGVSVVNALSTRLEVEI
KRDGYEWSQVYEKSEPLGLKQGAPTKKTGSTVRFWADPAVFETTEYDFET
VARRLQEMAFLNKGLTINLTDERVTQDEVVDEVVSDVAEAPKSASERAAE
STAPHKVKSRTFHYPGGLVDFVKHINRTKNAIHSSIVDFSGKGTGHEVEI
AMQWNAGYSESVHTFANTINTHEGGTHEEGFRSALTSVVNKYAKDRKLLK
DKDPNLTGDDIREGLAAVISVKVSEPQFEGQTKTKLGNTEVKSFVQKVCN
EQLTHWFEANPTDAKVVVNKAVSSAQARIAARKARELVRRKSATDIGGLP
GKLADCRSTDPRKSELYVVEGDSAGGSAKSGRDSMFQAILPLRGKIINVE
KARIDRVLKNTEVQAIITALGTGIHDEFDIGKLRYHKIVLMADADVDGQH
ISTLLLTLLFRFMRPLIENGHVFLAQPPLYKLKWQRSDPEFAYSDRERDG
LLEAGLKAGKKINKEDGIQRYKGLGEMDAKELWETTMDPSVRVLRQVTLD
DAAAADELFSILMGEDVDARRSFITRNAKDVRFLDV
>MT3747 holB, DNA polymerase III, delta' subunit
MPMMSGVFTRLVGQQAVEAELLATAKAARRDSAHSAGGGGTMTHAWLLTG
PPGSGRSVAALCFAAALQCTSGGEPGCGRCRACTTTLAGTHADVRRVIPE
GLSIGVDEMRAIVQIAARRPTTGHWQIVVIEDADRLTEGAANALLKVVEE
PPPSTVFLLCAPSVDPEDIAVTLRSRCRHVALVTPSTHAIAQVLSDGDGL
DPDTANWAASVSGGHVGRARRLATDPQARQRRERALGLARDAATPSRAYA
AAEELVAGAEAEALALTAQRIEAETEELRTALGAGGTGKGTGAALRGATG
AMKDLERRQKSRQTRASRDALDRALIDLATYFRDALLVAAHAGGVRANHP
DMADRVAALAAHAPPERLLRCIEAVLACREALAVNVKPKFAVDAMVATIG
QELR
>MT3094 lig, DNA ligase
MLRQWQALAEEVREHQFRYYVRDAPIISDAEFDELLRRLEALEEQHPELR
TPDSPTQLVGGAGFATDFEPVDHLERMLSLDNAFTADELAAWAGRIHAEV
GDAAHYLCELKIDGVALSLVYREGRLTRASTRGDGRTGEDVTLNARTIAD
VPERLTPGDDYPVPEVLEVRGEVFFRLDDFQALNASLVEEGKAPFANPRN
SAAGSLRQKDPAVTARRRLRMICHGLGHVEGFRPATLHQAYLALRAWGLP
VSEHTTLATDLAGVRERIDYWGEHRHEVDHEIDGVVVKVDEVALQRRLGS
TSRAPRWAIAYKYPPEEAQTKLLDIRVNVGRTGRITPFAFMTPVKVAGST
VGQATLHNASEIKRKGVLIGDTVVIRKAGDVIPEVLGPVVELRDGSEREF
IMPTTCPECGSPLAPEKEGDADIRCPNARGCPGQLRERVFHVASRNGLDI
EVLGYEAGVALLQAKVIADEGELFALTERDLLRTDLFRTKAGELSANGKR
LLVNLDKAKAAPLWRVLVALSIRHVGPTAARALATEFGSLDAIAAASTDQ
LAAVEGVGPTIAAAVTEWFAVDWHREIVDKWRAAGVRMVDERDESVPRTL
AGLTIVVTGSLTGFSRDDAKEAIVARGGKAAGSVSKKTNYVVAGDSPGSK
YDKAVELGVPILDEDGFRRLLADGPASRT
>MT1048 mfd, transcription-repair coupling factor
MTAPGPACSDTPIAGLVELALSAPTFQQLMQRAGGRPDELTLIAPASARL
LVASALARQGPLLVVTATGREADDLAAELRGVFGDAVALLPSWETLPHER
LSPGVDTVGTRLMALRRLAHPDDAQLGPPLGVVVTSVRSLLQPMTPQLGM
MEPLTLTVGDESPFDGVVARLVELAYTRVDMVGRRGEFAVRGGILDIFAP
TAEHPVRVEFWGDEITEMRMFSVADQRSIPEIDIHTLVAFACRELLLSED
VRARAAQLAARHPAAESTVTGSASDMLAKLAEGIAVDGMEAVLPVLWSDG
HALLTDQLPDGTPVLVCDPEKVRTRAADLIRTGREFLEASWSVAALGTAE
NQAPVDVEQLGGSGFVELDQVRAAAARTGHPWWTLSQLSDESAIELDVRA
APSARGHQRDIDEIFAMLRAHIATGGYAALVAPGTGTAHRVVERLSESDT
PAGMLDPGQAPKPGVVGVLQGPLRDGVIIPGANLVVITETDLTGSRVSAA
EGKRLAAKRRNIVDPLALTAGDLVVHDQHGIGRFVEMVERTVGGARREYL
VLEYASAKRGGGAKNTDKLYVPMDSLDQLSRYVGGQAPALSRLGGSDWAN
TKTKARRAVREIAGELVSLYAKRQASPGHAFSPDTPWQAELEDAFGFTET
VDQLTAIEEVKADMEKPIPMDRVICGDVGYGKTEIAVRAAFKAVQDGKQV
AVLVPTTLLADQHLQTFGERMSGFPVTIKGLSRFTDAAESRAVIDGLADG
SVDIVIGTHRLLQTGVRWKDLGLVVVDEEQRFGVEHKEHIKSLRTHVDVL
TMSATPIPRTLEMSLAGIREMSTILTPPEERYPVLTYVGPHDDKQIAAAL
RRELLRDGQAFYVHNRVSSIDAAAARVRELVPEARVVVAHGQMPEDLLET
TVQRFWNREHDILVCTTIVETGLDISNANTLIVERADTFGLSQLHQLRGR
VGRSRERGYAYFLYPPQVPLTETAYDRLATIAQNNELGAGMAVALKDLEI
RGAGNVLGIEQSGHVAGVGFDLYVRLVGEALETYRDAYRAAADGQTVRTA
EEPKDVRIDLPVDAHLPPDYIASDRLRLEGYRRLAAASSDREVAAVVDEL
TDRYGALPEPARRLAAVARLRLLCRGSGITDVTAASAATVRLSPLTLPDS
AQVRLKRMYPGAHYRATTATVQVPIPRAGGLGAPRIRDVELVQMVADLIT
ALAGKPRQHIGITNPSPPGEDGRGRNTTIKERQP
>MT3396 nei, endonuclease VIII
MPEGDTVWHTAATLRRHLAGRTLTRCDIRVPRFAAVDLTGEVVDEVISRG
KHLFIRTGTASIHSHLQMDGSWRVGNRPVRVDHRARIILEANQQEQAIRV
VGVDLGLLEVIDRHNDGAVVAHLGPDLLADDWDPQRAAANLIVAPDRPIA
EALLDQRVLAGIGNVYCNELCFVSGVLPTAPVSAVADPRRLVTRARDMLW
VNRFRWNRCTTGDTRAGRRLWVYGRAGQGCRRCGTLIAYDTTDERVRYWC
PACQR
>MT1357 ogt, methylated-DNA--protein-cysteinemethyltransferase
MATAGEDRMIHYRTIDSPIGPLTLAGHGSVLTNLRMLEQTYEPSRTHWTP
DPGAFSGAVDQLNAYFAGELTEFDVELDLRGTDFQQRVWKALLTIPYGET
RSYGEIADQIGAPGAARAVGLANGHNPIAIIVPCHRVIGASGKLTGYGGG
INRKRALLELEKSRAPADLTLFD
>MT0976 pcrA, ATP-dependent helicase PcrA
MSVHATDAKPPGPSPADQLLDGLNPQQRQAVVHEGSPLLIVAGAGSGKTA
VLTRRIAYLMAARGVGVGQILAITFTNKAAAEMRERVVGLVGEKARYMWV
STFHSTCVRILRNQAALIEGLNSNFSIYDADDSRRLLQMVGRDLGLDIKR
YSPRLLANAISNLKNELIDPHQALAGLTEDSDDLARAVASVYDEYQRRLR
AANALDFDDLIGETVAVLQAFPQIAQYYRRRFRHVLVDEYQDTNHAQYVL
VRELVGRDSNDGIPPGELCVVGDADQSIYAFRGATIRNIEDFERDYPDTR
TILLEQNYRSTQNILSAANSVIARNAGRREKRLWTDAGAGELIVGYVADN
EHDEARFVAEEIDALAEGSEITYNDVAVFYRTNNSSRSLEEVLIRAGIPY
KVVGGVRFYERKEIRDIVAYLRVLDNPGDAVSLRRILNTPRRGIGDRAEA
CVAVYAENTGVGFGDALVAAAQGKVPMLNTRAEKAIAGFVEMFDELRGRL
DDDLGELVEAVLERTGYRRELEASTDPQELARLDNLNELVSVAHEFSTDR
ENAAALGPDDEDVPDTGVLADFLERVSLVADADEIPEHGAGVVTLMTLHT
AKGLEFPVVFVTGWEDGMFPHMRALDNPTELSEERRLAYVGITRARQRLY
VSRAIVRSSWGQPMLNPESRFLREIPQELIDWRRTAPKPSFSAPVSGAGR
FGSARPSPTRSGASRRPLLVLQVGDRVTHDKYGLGRVEEVSGVGESAMSL
IDFGSSGRVKLMHNHAPVTKL
>MT3775 pdg, ultraviolet N-glycosylase/AP lyase
MTAAKSSRSKPAARAADVPGRWSAETRLALVRRARRMNRALAQAFPHVYC
ELDFTTPLELAVATILSAQSTDKRVNLTTPALFARYRTARDYAQADRTEL
ESLIRPTGFYRNKAASLIGLGQALVERFGGEVPATMDKLVTLPGVGRKTA
NVILGNAFGIPGITVDTHFGRLVRRWRWTTAEDPVKVEQAVGELIERKEW
TLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAPLVQG
PETDHLLALAGL
>MT1665 polA, DNA polymerase I
MVTTASAPSEDRAKPTLMLLDGNSLAFRAFYALPAENFKTRGGLTTNAVY
GFTAMLINLLRDEAPTHIAAAFDVSRQTFRLQRYPEYKANRSSTPDEFAG
QIDITKEVLGALGITVLSEPGFEADDLIATLATQAENEGYRVLVVTGDRD
ALQLVSDDVTVLYPRKGVSELTRFTPEAVVEKYGLTPRQYPDFAALRGDP
SDNLPGIPGVGEKTAAKWIAEYGSLRSLVDNVDAVRGKVGDALRANLASV
VRNRELTDLVRDVPLAQTPDTLRLQPWDRDHIHRLFDDLEFRVLRDRLFD
TLAAAGGPEVDEGFDVRGGALAPGTVRQWLAEHAGDGRRAGLTVVGTHLP
HGGDATAMAVAAADGEGAYLDTATLTPDDDAALAAWLADPAKPKALHEAK
AAVHDLAGRGWTLEGVTSDTALAAYLVRPGQRSFTLDDLSLRYLRRELRA
ETPQQQQLSLLDDDDTDAETIQTTILRARAVIDLADALDAELARIDSTAL
LGEMELPVQRVLAKMESAGIAVDLPMLTELQSQFGDQIRDAAEAAYGVIG
KQINLGSPKQLQVVLFDELGMPKTKRTKTGYTTDADALQSLFDKTGHPFL
QHLLAHRDVTRLKVTVDGLLQAVAADGRIHTTFNQTIAATGRLSSTEPNL
QNIPIRTDAGRRIRDAFVVGDGYAELMTADYSQIEMRIMAHLSGDEGLIE
AFNTGEDLHSFVASRAFGVPIDEVTGELRRRVKAMSYGLAYGLSAYGLSQ
QLKISTEEANEQMDAYFARFGGVRDYLRAVVERARKDGYTSTVLGRRRYL
PELDSSNRQVREAAERAALNAPIQGSAADIIKVAMIQVDKALNEAQLASR
MLLQVHDELLFEIAPGERERVEALVRDKMGGAYPLDVPLEVSVGYGRSWD
AAAH
>MT2806 recA, recA protein, intein-containing
MTQTPDREKALELAVAQIEKSYGKGSVMRLGDEARQPISVIPTGSIALDV
ALGIGGLPRGRVIEIYGPESSGKTTVALHAVANAQAAGGVAAFIDAEHAL
DPDYAKKLGVDTDSLLVSQPDTGEQALEIADMLIRSGALDIVVIDSVAAL
VPRAELEGEMGDSHVGLQARLMSQALRKMTGALNNSGTTAIFINQLRDKI
GVMFGSPETTTGGKALKFYASVRMDVRRVETLKDGTNAVGNRTRVKVVKN
KCLAEGTRIFDPVTGTTHRIEDVVDGRKPIHVVAAAKDGTLHARPVVSWF
DQGTRDVIGLRIAGGAIVWATPDHKVLTEYGWRAAGELRKGDRVAQPRRF
DGFGDSAPIPADHARLLGYLIGDGRDGWVGGKTPINFINVQRALIDDVTR
IAATLGCAAHPQGRISLAIAHRPGERNGVADLCQQAGIYGKLAWEKTIPN
WFFEPDIAADIVGNLLFGLFESDGWVSREQTGALRVGYTTTSEQLAHQIH
WLLLRFGVGSTVRDYDPTQKRPSIVNGRRIQSKRQVFEVRISGMDNVTAF
AESVPMWGPRGAALIQAIPEATQGRRRGSQATYLAAEMTDAVLNYLDERG
VTAQEAAAMIGVASGDPRGGMKQVLGASRLRRDRVQALADALDDKFLHDM
LAEELRYSVIREVLPTRRARTFDLEVEELHTLVAEGVVVHNCSPPFKQAE
FDILYGKGISREGSLIDMGVDQGLIRKSGAWFTYEGEQLGQGKENARNFL
VENADVADEIEKKIKEKLGIGAVVTDDPSNDGVLPAPVDF
>MT0658 recB, exodeoxyribonuclease V, beta subunit
MDRFELLGPLPREGTTTVLEASAGTGKTFALAGLVTRYLAETAATLDEML
LITFNRAASRELRERVRGQIVEAVGALQGDAPPSGELVEHLLRGSDAERA
QKRSRLRDALANFDAATIATTHEFCGSVLKSLGVAGDNAADVELKESLTD
LVTEIVDDRYLANFGRQETDPELTYAEALALALAVVDDPCAQLRPPDPEP
GSKAAVRLRFAAEVLEELERRKGRLRAQGFNDLLIRLATALEAADSPARD
RMRERWRIVLVDEFQDTDPMQWRVLERAFSRHSALILIGDPKQAIYGFRG
GDIHTYLKAAGTADARYTLGVNWRSDRALVESLQTVLRDATLGHADIVVR
GTDAHHAGHRLASAPRPAPFRLRVVKRHTLGYDGTAHVPIEALRRHIPDD
LAADVAALLASGATFAGRPVVAADIAVIVEHHKDARACRNALAEAGIPAI
YTGDTDVFASQAAKDWLCLLEAFDAPQRSGLVRAAACTMFFGETAESLAA
EGDALTDRVAGTLREWADHARHRGVAAVFQAAQLAGMGRRVLSQRGGERD
LTDLAHIAQLLHEAAHRERLGLPGLRDWLRRQAKAGAGPPEHNRRLDSDA
AAVQIMTVFVAKGLQFPIVYLPFAFNRNVRSDDILLYHDDGTRCLYIGGK
DGGAQRRTVEGLNRVEAAHDNLRLTYVALTRAQSQVVAWWAPTFDEVNGG
LSRLLRGRRPGQSQVPDRCTPRVTDEQAWAVFAQWEAAGGPSVEESVIGA
RSSLEKPVPVPGFEVRHFHRRIDTTWRRTSYSDLVRGSEAVTVTSEPAAG
GRADEVEIAVVAAPGSGADLTSPLAALPSGASFGSLVHAVLETADPAAPD
LAAELEAQVRRHAPWWTVDVDHAQLAPELARALLPMHDTPLGPAAAALTL
RQIGVRDRLRELDFEMPLAGGDLRGRSPDVSLADVGELLASHLPGDDPLS
PYADRLGSAGLGDQPLRGYLAGSIDVVLRLPGQRYLVVDYKTNHLGDTAA
DYGFERLTEAMLHSDYPLQALLYVVVLHRFLRWRQRDYAPARHLGGVLYL
FVRGMCGAATPVTAGHPAGVFTWNPPTALVVALSDLLDRGRLQS
>MT0659 recC, exodeoxyribonuclease V, gamma subunit
MALHLHRAERTDLLADGLGALLADPQPDPFAQELVLVAARGVERWLSQRL
SLVLGCGPGRADGVCAGIAFRNPQSLIAEITGTLDDDPWSPEALAWPLLA
VIDASLDEPWCRTLASHLGHFATTDAEAELRRGRRYSVARRLAGLFASYA
RQRPGLLAAWLDGDLGELPGDLAWQPPLWRALVTTVGADPPHVRHDKTIA
RLRDGPADLPARLSLFGHTRLACTDVQLLDALAVHHDLHLWLPHPSDELW
RALAGFQGADGLLPRRQDTSRRAAQHPLLETLGRDVRELQRALPAARATD
EFLGATTKPDTLLGWLQADIAGNAPRPAGRSLSDADRSVQVHACHGPARQ
IDVLREVLLGLLEDDPTLQPRDIVVMCPDIDTYAPLIVAGFGLGEVAGDC
HPAHRLRVRLADRALTQTNPLLSVAAELLTIAETRATASQLLNLAQAAPV
RAKFGFADDDLDTITTWVRESNIRWGFDPTHRRRYGLDTVVHNTWRLGLD
RILTGVAMSEDSQAWLDTALPLDDVGSNRVELAGRLAEFVERLHHVVGGL
SGARPLVAWLDALATGIDLLTACNDGWQRAQVQREFADVLARAGSRAAPL
LRLPDVRALLDAQLAGRPTRANFRTGTLTVCTMVPMRSVPHRVVCLVGLD
DGVFPRLSHPDGDDVLAREPMTGERDIRSEDRQLLLDAIGAATQTLVITY
TGADERTGQPRPPAVPLAELLDALDQTTSAPVRERILVTHPLQPFDRKNV
TPGALLGAKPFTFDPAALAAAQAAAGKRCPPTAFISGRLPAPPAADVTLA
DLLDFFKDPVKGFFRALDYTLPWDVDTVEDSIPVQVDALAEWTVGERMLR
DMLRGLHPDDAAHSEWRRGTLPPGRLGVRRAKEIRNRARDLAAAALAHRD
GHGQAHDVDVDLGDGRRLSGTVTPVFGGRTVSVTYSKLAPKHVLPAWIGL
VTLAAQEPGREWSALCIGRSKTRNHIARRLFVPPPDPVAVLRELVLLYDA
GRREPLPLPLKTSCAWAQARRDGQDPYPPARECWQTNRFRPGDDDAPAHV
RAWGPRAPFEVLLGKPRAGEEVAGEETRLGALAARLWLPLLAAEGSV
>MT0657 recD, exodeoxyribonuclease V, alpha subunit
MVRAFNQAGVLDVSDVHVAQRLCALAGESDERVALAVAVAVRALRAGSVC
VDLLSIARVAGHDDLPWPDPADWLAAVRASPLLADPPVLHLYDDRLLYLD
RYWREEEQVCADLLALLTSRRPAGVPDLRRLFPTGFDEQRRAAEIALSQG
VTVLTGGPGTGKTTTVARLLALVAEQAELAGEPRPRIALAAPTGKAAARL
AEAVRREMAKLDATDRARLGDLHAVTLHRLLGAKPGARFRQDRQNRLPHN
VIVVDETSMVSLTLMARLAEAVRPGARLILVGDADQLASVEAGAVLADLV
DGFSVRDDALVAQLRTSHRFGKVIGTLAEAIRAGDGDAVLGLLRSGEERI
EFVDDEDPAPRLRAVLVPHALRLREAALLGASDVALATLDEHRLLCAHRD
GPTGVLHWNRRVQAWLAEETGQPPWTPWYAGRPLLVTANDYGLRVYNGDT
GVVLAGPTGLRAVISGASGPLDVATGRLGDVETMHAMTIHKSQGSQVDEV
TVLMPQEDSRLLTRELLYTAVTRAKRKVRVVGSEASVRAAIARRAVRASG
LRMRLQSTGCG
>MT0003 recF, recF protein
MYVRHLGLRDFRSWACVDLELHPGRTVFVGPNGYGKTNLIEALWYSTTLG
SHRVSADLPLIRVGTDRAVISTIVVNDGRECAVDLEIATGRVNKARLNRS
SVRSTRDVVGVLRAVLFAPEDLGLVRGDPADRRRYLDDLAIVRRPAIAAV
RAEYERVLRQRTALLKSVPGARYRGDRGVFDTLEVWDSRLAEHGAELVAA
RIDLVNQLAPEVKKAYQLLAPESRSASIGYRASMDVTGPSEQSDTDRQLL
AARLLAALAARRDAELERGVCLVGPHRDDLILRLGDQPAKGFASHGEAWS
LAVALRLAAYQLLRVDGGEPVLLLDDVFAELDVMRRRALATAAESAEQVL
VTAAVLEDIPAGWDARRVHIDVRADDTGSMSVVLP
>MT3051 recG, ATP-dependent DNA helicase RecG
MASLSDRLDRVLGATAADALDEQFGMRTVDDLLRHYPRSYVEGAARVGIG
DARPEAGEHITIVDVITDTYSFPMKKKPNRKCLRITVGGGRNKVTATFFN
ADYIMRDLTKHTKVMLSGEVGYYKGAMQLTHPAFLILDSPDGKNHGTRSL
KSIADASKAISGELVVEEFERRFFPIYPASTKVQSWDIFKCVRQVLDVLD
RVDDPLPAELRAKHGLIPEDEALRAIHLAESQSLRERARERLTFDEAVGL
QWALVARRHGELSESGPSAAWKSNGLAAELLRRLPFELTAGQREVLDVLS
DGLAANRPLNRLLQGEVGSGKTIVAVLAMLQMVDAGYQCALLAPTEVLAA
QHLRSIRDVLGPLAMGGQLGGAENATRVALLTGSMTAGQKKQVRAEIASG
QVGIVIGTHALLQEAVDFHNLGMVVVDEQHRFGVEQRDQLRAKAPAGITP
HLLVMTATPIPRTVALTVYGDLETSTLRELPLGRQPIATNVIFVKDKPAW
LDRAWRRIIEEAAAGRQAYVVAPRIDESDDTDVQGGVRPSATAEGLFSRL
RSAELAELRLALMHGRLSADDKDAAMAAFRAGEVDVLVCTTVIEVGVDVP
NATVMLVMDADRFGISQLHQLRGRIGRGEHPSVCLLASWVPPDTPAGQRL
RAVAGTMDGFALADLDLKERKEGDVLGRNQSGKAITLRLLSLAEHEEYIV
AARDFCIEAYKNPTDPALALMAARFTSTDRIEYLDKS
>MT1735 recN, DNA repair protein RecN
MLTELRIESLGAISVATAEFDRGFTVLTGETGTGKTMVVTGLHLLGGARA
DATRVRSGADRAVVEGRFTTTDLDDATVAGLQAVLDSSGAERDEDGSVIA
LRSISRDGPSRAYLGGRGVPAKSLSGFTNELLTLHGQNDQLRLMRPDEQR
GALDRFAAAGEAVQRYRKLRDAWLTARRDLVDRRNRARELAQEADRLKFA
LNEIDTVDPQPGEDVALVADIARLSELDTLREAATTARATLCGTPDADAF
DRGAVDSLGRARAALQSSDDAALRGLAEQVGEALTVVVDAVAELGAYLDE
LPADASALDAKLARQAQLRTLTRKYAADIDGVLRWADEARARLAQLDVSE
EGLAALERRTGELAHELGQAAVDLSTIRRKAAKRLAKEVSAELSALAMAD
AEFTIGVTTGLADHGDPVALALASGELARAGADGVDAVEFGFVAHRGMTV
LPLAKSASGGELSRVMLSLEVVLATSRKQAAGTTMVFDEIDAGVGGWAAV
QIGRRLARLARTHQVIVVTHLPQVAAYADVHLMVQRTGRDGASGVRRLTS
EDRVAELARMLAGLGDSDSGRAHARELLETAQNDELT
>MT3818 recR, recR protein
MFEGPVQDLIDELGKLPGIGPKSAQRIAFHLLSVEPSDIDRLTGVLAKVR
DGVRFCAVCGNVSDNERCRICSDIRRDASVVCIVEEPKDIQAVERTREFR
GRYHVLGGALDPLSGIGPDQLRIRELLSRIGERVDDVDVTEVIIATDPNT
EGEATATYLVRMLRDIPGLTVTRIASGLPMGGDLEFADELTLGRALAGRR
VLA
>MT2670 ruvA, Holliday junction DNA helicase RuvA
MIASVRGEVLEVALDHVVIEAAGVGYRVNATPATLATLRQGTEARLITAM
IVREDSMTLYGFPDGETRDLFLTLLSVSGVGPRLAMAALAVHDAPALRQV
LADGNVAALTRVPGIGKRGAERMVLELRDKVGVAATGGALSTNGHAVRSP
VVEALVGLGFAAKQAEEATDTVLAANHDATTSSALRSALSLLGKAR
>MT2669 ruvB, Holliday junction DNA helicase RuvB
MTERSDRDVSPALTVGEGDIDVSLRPRSLREFIGQPRVREQLQLVIEGAK
NRGGTPDHILLSGPPGLGKTSLAMIIAAELGSSLRVTSGPALERAGDLAA
MLSNLVEHDVLFIDEIHRIARPAEEMLYLAMEDFRVDVVVGKGPGATSIP
LEVAPFTLVGATTRSGALTGPLRDRFGFTAHMDFYEPAELERVLARSAGI
LGIELGADAGAEIARRSRGTPRIANRLLRRVRDFAEVRADGVITRDVAKA
ALEVYDVDELGLDRLDRAVLSALTRSFGGGPVGVSTLAVAVGEEAATVEE
VCEPFLVRAGMVARTPRGRVATALAWTHLGMTPPVGASQPGLFE
>MT2671 ruvC, Holliday junction nuclease
MRVMGVDPGLTRCGLSLIESGRGRQLTALDVDVVRTPSDAALAQRLLAIS
DAVEHWLDTHHPEVVAIERVFSQLNVTTVMGTAQAGGVIALAAAKRGVDV
HFHTPSEVKAAVTGNGSADKAQVTAMVTKILALQAKPTPADAADALALAI
CHCWRAPTIARMAEATSRAEARAAQQRHAYLAKLKAAR
>MT0060 ssb, single-strand binding protein
MAGDTTITIVGNLTADPELRFTPSGAAVANFTVASTPRIYDRQTGEWKDG
EALFLRCNIWREAAENVAESLTRGARVIVSGRLKQRSFETREGEKRTVIE
VEVDEIGPSLRYATAKVNKASRSGGFGSGSRPAPAQTSSASGDDPWGSAP
ASGSFGGGDDEPPF
>MT1248 tag, DNA-3-methyladenine glycosidase I
MSGDGLVRCPWAEVRPGPDAQLYRDYHDNEWGRPLYGRVALFERMSLEAF
QSGLSWLIILRKRENFRRAFSGFDIDKIARYTDTDVRRLLADDGIVRNRA
KIEATIANARAAADLGSSEDLSELLWSFAPPPRPRPVDGSEIPSVSTESK
AMSRELKRRGFRFVGPTTAYALMQATGMVDDHIQACWVPTERPFDQPGCP
MAAR
>MT3053 ung, uracil-DNA glycosylase
MTARPLSELVERGWAAALEPVADQVAHMGQFLRAEIAAGRRYLPAGSNVL
RAFTFPFDNVRVLIVGQDPYPTPGHAVGLSFSVAPDVRPWPRSLANIFDE
YTADLGYPLPSNGDLTPWAQRGVLLLNRVLTVRPSNPASHRGKGWEAVTE
CAIRALAARAAPLVAILWGRDASTLKPMLAAGNCVAIESPHPSPLSASRG
FFGSRPFSRANELLVGMGAEPIDWRLP
>MT1675 uvrA, excinuclease ABC, subunit A
MSFSERDSVADRLIVKGAREHNLRSVDLDLPRDALIVFTGLSGSGKSSLA
FDTIFAEGQRRYVESLSAYARQFLGQMDKPDVDFIEGLSPAVSIDQKSTN
RNPRSTVGTITEVYDYLRLLYARAGTPHCPTCGERVARQTPQQIVDQVLA
MPEGTRFLVLAPVVRTRKGEFADLFDKLNAQGYSRVRVDGVVHPLTDPPK
LKKQEKHDIEVVVDRLTVKAAAKRRLTDSVETALNLADGIVVLEFVDHEL
GAPHREQRFSEKLACPNGHALAVDDLEPRSFSFNSPYGACPECSGLGIRK
EVDPELVVPDPDRTLAQGAVAPWSNGHTAEYFTRMMAGLGEALGFDVDTP
WRKLPAKARKAILEGADEQVHVRYRNRYGRTRSYYADFEGVLAFLQRKMS
QTESEQMKERYEGFMRDVPCPVCAGTRLKPEILAVTLAGESKGEHGAKSI
AEVCELSIADCADFLNALTLGPREQAIAGQVLKEIRSRLGFLLDVGLEYL
SLSRAAATLSGGEAQRIRLATQIGSGLVGVLYVLDEPSIGLHQRDNRRLI
ETLTRLRDLGNTLIVVEHDEDTIEHADWIVDIGPGAGEHGGRIVHSGPYD
ELLRNKDSITGAYLSGRESIEIPAIRRSVDPRRQLTVVGAREHNLRGIDV
SFPLGVLTSVTGVSGSGKSTLVNDILAAVLANRLNGARQVPGRHTRVTGL
DYLDKLVRVDQSPIGRTPRSNPATYTGVFDKIRTLFAATTEAKVRGYQPG
RFSFNVKGGRCEACTGDGTIKIEMNFLPDVYVPCEVCQGARYNRETLEVH
YKGKTVSEVLDMSIEEAAEFFEPIAGVHRYLRTLVDVGLGYVRLGQPAPT
LSGGEAQRVKLASELQKRSTGRTVYILDEPTTGLHFDDIRKLLNVINGLV
DKGNTVIVIEHNLDVIKTSDWIIDLGPEGGAGGGTVVAQGTPEDVAAVPA
SYTGKFLAEVVGGGASAATSRSNRRRNVSA
>MT1669 uvrB, excinuclease ABC, subunit B
MAFCARSPHGVSAGGSRLVGVAFATEHPVVAHSEYRAVEEIVRAGGHFEV
VSPHAPAGDQPAAIDELERRINAGERDVVLLGATGTGKSATTAWLIERLQ
RPTLVMAPNKTLAAQLANELREMLPHNAVEYFVSYYDYYQPEAYIAQTDT
YIEKDSSINDDVERLRHSATSALLSRRDVVVVASVSCIYGLGTPQSYLDR
SVELKVGEEVPRDGLLRLLVDVQYTRNDMSFTRGSFRVRGDTVEIIPSYE
ELAVRIEFFGDEIEALYYLHPLTGEVIRQVDSLRIFPATHYVAGPERMAH
AVSAIEEELAERLAELESQGKLLEAQRLRMRTNYDIEMMRQVGFCSGIEN
YSRHIDGRGPGTPPATLLDYFPEDFLLVIDESHVTVPQIGGMYEGDISRK
RNLVEYGFRLPSACDNRPLTWEEFADRIGQTVYLSATPGPYELSQTGGEF
VEQVIRPTGLVDPKVVVKPTKGQIDDLIGEIRTRADADQRVLVTTLTKKM
AEDLTDYLLEMGIRVRYLHSEVDTLRRVELLRQLRLGDYDVLVGINLLRE
GLDLPEVSLVAILDADKEGFLRSSRSLIQTIGRAARNVSGEVHMYADKIT
DSMREAIDETERRRAKQIAYNEANGIDPQPLRKKIADILDQVYREADDTA
VVEVGGSGRNASRGRRAQGEPGRAVSAGVFEGRDTSAMPRAELADLIKDL
TAQMMAAARDLQFELAARFRDEIADLKRELRGMDAAGLK
>MT1463 uvrC, excinuclease ABC, subunit C
MPDPATYRPAPGSIPVEPGVYRFRDQHGRVIYVGKAKSLRSRLTSYFADV
ASLAPRTRQLVTTAAKVEWTVVGTEVEALQLEYTWIKEFDPRFNVRYRDD
KSYPVLAVTLGEEFPRLMVYRGPRRKGVRYFGPYSHAWAIRETLDLLTRV
FPARTCSAGVFKRHRQIDRPCLLGYIDKCSAPCIGRVDAAQHRQIVADFC
DFLSGKTDRFARALEQQMNAAAEQLDFERAARLRDDLSALKRAMEKQAVV
LGDGTDADVVAFADDELEAAVQVFHVRGGRVRGQRGWIVEKPGEPGDSGI
QLVEQFLTQFYGDQAALDDAADESANPVPREVLVPCLPSNAEELASWLSG
LRGSRVVLRVPRRGDKRALAETVHRNAEDALQQHKLKRASDFNARSAALQ
SIQDSLGLADAPLRIECVDVSHVQGTDVVGSLVVFEDGLPRKSDYRHFGI
REAAGQGRSDDVACIAEVTRRRFLRHLRDQSDPDLLSPERKSRRFAYPPN
LYVVDGGAPQVNAASAVIDELGVTDVAVIGLAKRLEEVWVPSEPDPIIMP
RNSEGLYLLQRVRDEAHRFAITYHRSKRSTRMTASALDSVPGLGEHRRKA
LVTHFGSIARLKEATVDEITAVPGIGVATATAVHDALRPDSSGAAR
>MT2962 xerC, tyrosine recombinase XerC
MRRVDSGSRRHACDCGGVQAILDEFDEYLALQCGRSVHTRRAYLGDLRSL
FAFLADRGSSLDALTLSVLRSWLAATAGAGAARTTLARRTSAVKAFTAWA
VRRGLLAGDPAARLQVPKARRTLPAVLRQDQALRAMAAAESGAEQGDPLA
LRDRLIVELLYATGIRVSELCGLDVDDIDTGHRLVRVLGKGNKQRTVPFG
QPAADALHAWLVDGRRALVTAESGHALLLGARGRRLDVRQARTAVHQTVA
AVDGAPDMGPHGLRHSAATHLLEGGADLRVVQELLGHSSLATTQLYTHVA
VARLRAVHERAHPRA
>MT1740 xerD, integrase/recombinase XerD
MKTLALQLQGYLDHLTIERGVAANTLSSYRRDLRRYSKHLEERGITDLAK
VGEHDVSEFLVALRRGDPDSGTAALSAVSAARALIAVRGLHRFAAAEGLA
ELDVARAVRPPTPSRRLPKSLTIDEVLSLLEGAGGDKPSDGPLTLRNRAV
LELLYSTGARISEAVGLDLDDIDTHARSVLLRGKGGKQRLVPVGRPAVHA
LDAYLVRGRPDLARRGRGTAAIFLNARGGRLSRQSAWQVLQDAAERAGIT
AGVSPHMLRHSFATHLLEGGADVRVVQELLGHASVTTTQIYTLVTVHALR
EVWAGAHPRAR
>MT1139 xseA, exodeoxyribonuclease, large subunit
MTQNSAENPFPVRAVAIRVAGWIDKLGAVWVEGQLAQITMRPDAKTVFMV
LRDPAADMSLTVTCSRDLVLSAPVKLAEGVQVVVCGKPSFYTGRGTFSLR
LSEIRAVGIGELLARIDRLRRLLDAEGLFDPRLKRPIPYLPNMIGLITGR
ASAAERDVTTVASARWPAARFAVRNVAVQGPNAVGQIVEALRELDRDPDV
DVIVLARGGGSVEDLLPFSDETLCRAIAACRTPVVSAVGHEPDNPLCDLV
VDLRAATPTDAAKKVVPDTAAEQRLIDDLRRRSAQALRNWVSREQRAVAQ
LRSRPVLADPMTMVSVRAEEVHRARSTLRRNLTLMVAAETERIGHLAARL
ATLGPAATLARGYAIVQTVAQTGPEGGSEPQVLRSVHDAPEGTKLRVRVA
DGALAAVSEGQTNGL
>MT1138 xseB, exodeoxyribonuclease, small subunit
MKDKPMVCDPNGDDTGRTHATVPVSQLGYEACRDELMEVVRLLEQGGLDL
DASLRLWERGEQLAKRCEEHLAGARQRVSDVLAGDEAQNG
>MT0442 xth, exodeoxyribonuclease III
MPDGTIDGGHPQRPASPRLRSPLLRLATWNVNSIRTRLDRVLDWLGRADV
DVLAMQETKCPDGQFPALPLFELGYDVAHVGFDQWNGVAIASRVGLDDVR
VGFDGQPSWSGKPEVAATTEARALGATCGGIRVWSLYVPNGRALDDPHYT
YKLDWLAALRDTAEGWLRDDPAAPIALMGDWNIAPTDDDVWSTEFFAGCT
HVSEPERKAFNAIVDAQFTDVVRPFTPGPGVYTYWDYTQLRFPKKQGMRI
DFILGSPALAARVMDAQIVREERKGKAPSDHAPVLVDLHAG