Gene list
Applied filters:
COG category: Replication, recombination and repair
Organism: Mycobacterium tuberculosis CDC1551, CDC1551
Gene type: CDS
Number of genes found: 180
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Mycobacterium tuberculosis CDC1551, CDC1551 >MT3695 A/G-specific adenine glycosylase, putative MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFM LQQTPAARVLAIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRL HECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFAYRQRVPVVDT NVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGA TVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLL DVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP GEGF >MT1304 serine/threonine protein kinase MSDAQDSRVGSMFGPYHLKRLLGRGGMGEVYEAEHTVKEWTVAVKLMTAE FSKDPVFRERMKREARIAGRLQEPHVVPIHDYGEVDGQMFLEMRLVEGTD LDSVLKRFGPLTPPRAVAIITQIASALDAAHADGVMHRDVKPQNILITRD DFAYLVDFGIASATTDEKLTQLGTAVGTWKYMAPERFSNDEVTYRADIYA LACVLHECLTGAPPYRADSAGTLVSSHLMGPIPQPSAIRPGIPKAFDAVV ARGMAKKPEDRYASAGDLALAAHEALSDPDQDHAADILRRSQESTLPAPP KPVPPPTMPATAMAPRQPPAPPVTPPGVQPAPKPSYTPPAQPGPAGQRPG PTGQPSWAPNSGPMPASGPTPTPQYYQGGGWGAPPSGGPSPWAQTPRKTN PWPLVAGAAAVVLVLVLGAIGIWIAIRPKPVQPPQPVAEERLSALLLNSS EVNAVMGSSSMQPGKPITSMDSSPVTVSLPDCQGALYTSQDPVYAGTGYT AINGLISSEPGDNYEHWVNQAVVAFPTADKARAFVQTSADKWKNCAGKTV TVTNKAKTYRWTFADVKGSPPTITVIDTQEGAEGWECQRAMSVANNVVVD VNACGYQITNQAGQIAAKIVDKVNKE >MT2888 CRISPR-associated protein, TM1792 family MTTSYAKIEITGTLTVLTGLQIGAGDGFSAIGAVDKPVVRDPLSRLPMIP GTSLKGKVRTLLSRQYGADTETFYRKPNEDHAHIRRLFGDTEEYMTGRLV FRDTKLTNKDDLEARGAKTLTEVKFENAINRVTAKANLRQMERVIPGSEF AFSLVYEVSFGTPGEEQKASLPSSDEIIEDFNAIARGLKLLELDYLGGSG TRGYGQVKFSNLKARAAVGALDGSLLEKLNHELAAV >MT3935 IS1537, transposase MMARFEVPEGWCVQAFRFTLDPTEDQARALARHFGARRKAYNWAVATLKA DIEAWRVTGIGTVKPSLRVLRKRWNTVKDEVCVNAETGAVWWPECSKEAY ADGIGGAVDAYWNWQNSRSGKREGKTMGFPRFKKKGRDQDRVTFTTGAMR VEPDRRHLTLPVVGTVRTHENTRRIERLIATGRARVLAISVRRNGTRLDA SVRVLVQRPQQPNVAQPGSRVGVDVGVRRLATVANEAGAVLEEVPNPRPL DAALKELRYASRARSRCTKGSRRYRERTTEISRLHRRVNDVRTHHLHVLT TRLAQTHGHIVVEGLDAAGMLRQKGLPGARARRRGLSDSALGTPRRHLSY KTGWYGSALVVADRWFPSLSVEPTVRPGLARLVAVKRGREAAAWLPNNPE TGCKSRDH >MT3057 IS1538, resolvase MNLATWAERNGVAPGTAYRWFRAGLLSVMARRVGRLILVDEPAGDAGMRS PTAVYARVSSADQKADLDRQVARVTAWATAQQMPVDKVVTEVGSAFNEHR RKFLSLLRDPSVHRIVVEHRDRFCRLGSKYVQAAFAAQGRELVVVDSAEV DDDLVRDMTEILTSMCARLYGKRAAENRTKRALAAAAGEDHEAA >MT0635 IS1536, transposase, truncated MANVLLRTGPSGPSRLPLSMIMRRPEMPRLEIPNGWCVQAFRFTLDPTAE QAHALARHFGARRKAYNWTVAQLKADIQAWRATGAQTAKPSLRVLRKRWN TVKDEVCVNAETGTVWWPECSKEAYADGIAGAVDAYWNWQQRRAGKRDGK RMGFPRFKKKGRDADRVSFTTGAMRVEPDRRHLTLPVIGCVRTHENTRRI ERLIAKDRARVLAITVRRNGTRLDASVRVLVQRPQQPNVELPESRIGVDV GVRRLATVATADGACCPVLVPDG >MT1589 DNA-damage-inducible protein P, putative MIAIAKTSSVAPNSITGVESRWVLHLDMDAFFASVEQLTRPTLRGRPVLV GGLGGRGVVAGASYEARAYGARSAMPMHQARRLIGVTAVVLPPRGVVYGI ASRRVFDTVRGLVPVVEQLSFDEAFAEPPQLAGAVAEDVETFCERLRRRV RDETGLIASVGAGSGKQIAKIASGLAKPDGIRVVRHAEEQALLSGLPVRR LWGIGPVAEEKLHRLGIETIGQLAALSDAEAANILGATIGPALHRLARGI DDRPVVERAEAKQISAESTFAVDLTTMEQLHEAIDSIAEHAHQRLLRDGR GARTITVKLKKSDMSTLTRSATMPYPTTDAGALFTVARRLLPDPLQIGPI RLLGVGFSGLSDIRQESLFADSDLTQETAAAHYVETPGAVVPAAHDATMW RVGDDVAHPELGHGWVQGAGHGVVTVRFETRGSGPGSARTFPVDTGDISN ASPLDSLDWPDYIGQLSVEGSAGASAPTVDDVGDR >MT1803 IS6110, transposase MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR RILGWRVASTMATSMVLDAIEQAIWTRQQESVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG >MT0077 conserved hypothetical protein MSSITVSVDPVDPVDPVDPVDPVDPVDAVVAAGSDGLTVARIESEIGALE FLNELRTELKSGQFRPQPVRERKIPKPGGLGKVRRLGIPTVADRVVQAAL KLVLEPIFETDFEPVSYGFRPARRAHDTIAEIHLFGTQEYRWVLDADIKA CFDRIDHADLMDRVRHRIKDKRVLRLVNWQRIRHRWNWTDVRRWLTDPTG RWHPISADGITLFNPAAVPIRRYRYRGNTIPTPWTQAV >MT3501 hypothetical protein MANRVIACSATARAAGVRRGLRRREAAARCPQLFIATADADRDARLFEGV IAAVDDLVPRAELLRPGLLVLPVRGPARFFGSEQMAAERLIDAVAAAGAE CQVGIADRLSTAVFAARAGRIVEPGGDARFLSLLSIRQLATEPSLSGPGR DDLTDLLWRMGIRTIGQFAALSRTDVASRFGADAVAAHRFARGEPERAPC GREPPPDLAAELACDPPIDRVDAAAFAGRSLAAELHRALMAAGVGCTRLA IHAVTANGEERSRVWRCAEPLTEDATADRVRWQLDGWLNNRNARDRPTAA VTLLRLQAVETVSASEGLQLPLWGGLGEQDRLRARRALVRVQGLLGPEAV RVPVLSGGHGPAERITLTVLGLVAPEPVPQADPGQPWPGRLPDPSPAVLF DDPVDLLDAQGNPIRVTSRGMFSADPARLRVRGRDDRLRWWAGPWPDDER WWDPDRASGRTARAQVLLDGDPGTALLLCYRQRRWYLEGSYE >MT3056 IS1538, transposase MPKFEVPDGWTVQAFRFTLDPTEDQAKALARHFGARRKAYNWTVATLKAD IQAWHASGTVTAKPSLRVLRKRWNTVKDDVCVNTETGVAWWPECSKEAYA DGIAGAVEAYWNWQTSRAGKRAGKRVGFPRFKRKGRDQDRVSFTTGAMRV EPDRRHLTLPVIGTVRTHENTRRIERLIKAGRARVLAISVRRNGTRLDAS VRVLVQRPQQPKVVHPGSRVGVDVGVRRLATVATADGTAIEQVENPRPLG AALRELRHVCRARSRCTKGSRRYRERTTQISRLHRRVNDVRTHHLHVLTT RLAQTHGRIVVEGLDATEMLRQKGLPGARARRRGLSDAALGTPRRHLSYK TVWYGSALVVADRWFPSSKTCHACRHVQDIGWDEQWQCDRCSVVHQRDDC AAINLARYEETSSIVGPVGAAVKRGADRKTGPRPAGGCEARKGSSPKAAE QPRDGVQVA >MT3148 DNA ligase MLLHDVAITSMDVAATSSRLTKVARIAALLHRAAPDTQLVTIIVSWLSGE LPQRHIGVGWAALRSLPPPAPQPALTVTGVDATLSKIGTLSGKGSQAQRA ALVAELFSAATEAEQTFLLRLLGGELRQGAKGGIMADAVAQAAGLPAATV QRAAMLGGDLAAAAAAGLSGAALDTFTLRVGRPIGPMLAQTATSVHDALE RHGGTTIFEAKLDGARVQIHRANDQVRIYTRSLDDVTARLPEVVEATLAL PVRDLVADGEAIALCPDNRPQRFQVTASRFGRSVDVAAARATQPLSVFFF DILHRDGTDLLEAPTTERLAALDALVPARHRVDRLITSDPTDAANFLDAT LAAGHEGVMAKAPAARYLAGRRGAGWLKVKPVHTLDLVVLAVEWGSGRRR GKLSNIHLGARDPATGGFVMVGKTFKGMTDAMLDWQTTRFHEIAVGPTDG YVVQLRPEQVVEVALDGVQRSSRYPGGLALRFARVVRYRADKDPAEADTI DAVRALY >MT2553 single-strand DNA binding protein MFETPLTVVGHIVNDLQRRKVGDQEVVKFRVASNSRRRTSDGGWEPGNSL FITVNCWGRLVTGVGAALGKGAPVIVVGHVYTSEYEDRDGIRRSSLEMRA TSVGPDLSRVIVRIEKPAYTGPSAGDLPAATGTGAAGAADAPASAADSVS DVVVDDAITGHNPLPISA >MT1363 conserved hypothetical protein MSRVRLVIAQCTVDYIGRLTAHLPSARRLLLFKADGSVSVHADDRAYKPL NWMSPPCWLTEESGGQAPVWVVENKAGEQLRITIEGIEHDSSHELGVDPG LVKDGVEAHLQALLAEHIQLLGEGYTLVRREYMTAIGPVDLLCRDERGGS VAVEIKRRGEIDGVEQLTRYLELLNRDSVLAPVKGVFAAQQIKPQARILA TDRGIRCLTLDYDTMRGMDSGEYRLF >MT0018 serine/threonine protein kinase MSPRVGVTLSGRYRLQRLIATGGMGQVWEAVDNRLGRRVAVKVLKSEFSS DPEFIERFRAEARTTAMLNHPGIASVHDYGESQMNGEGRTAYLVMELVNG EPLNSVLKRTGRLSLRHALDMLEQTGRALQIAHAAGLVHRDVKPGNILIT PTGQVKITDFGIAKAVDAAPVTQTGMVMGTAQYIAPEQALGHDASPASDV YSLGVVGYEAVSGKRPFAGDGALTVAMKHIKEPPPPLPPDLPPNVRELIE ITLVKNPAMRYRSGGPFADAVAAVRAGRRPPRPSQTPPPGRAAPAAIPSG TTARVAANSAGRTAASRRSRPATGGHRPPRRTFSSGQRALLWAAGVLGAL AIIIAVLLVIKAPGDNSPQQAPTPTVTTTGNPPASNTGGTDASPRLNWTE RGETRHSGLQSWVVPPTPHSRASLARYEIAQ >MT0873 IS1606', transposase MTRDPHSPDCGREGSYRDTITRPLTDLPVAGYPLVPRVASPRYRCTTPQC GRAVFNQDLANVDQYLVVNQLAHQLIDGSSLIPDADKRWDARRHADMTHH LTSSLKENQS >MT2985 RNA helicase, putative MLHFTAATSRFRLGRERANSVRSDGGWGVLQPVSATFNPPLRGWQRRALV QYLGTQPRDFLAVATPGSGKTSFALRIAAELLRYHTVEQVTVVVPTEHLK VQWAHAAAAHGLSLDPKFANSNPQTSPEYHGVMVTYAQVASHPTLHRVRT EARKTLVVFDEIHHGGDAKTWGDAIREAFGDATRRLALTGTPFRSDDSPI PFVSYQPDADGVLRSQADHTYGYAEALADGVVRPVVFLAYSGQARWRDSA GEEYEARLGEPLSAEQTARAWRTALDPEGEWMPAVITAADRRLRQLRAHV PDAGGMIIASDRTTARAYARLLTTMTAEEPTVVLSDDPGSSARITEFAQG TGRWLVAVRMVSEGVDVPRLSVGVYATNASTPLFFAQAIGRFVRSRRPGE TASIFVPSVPNLLQLASALEVQRNHVLGRPHRESAHDPLDGDPATRTQTE RGGAERGFTALGADAELDQVIFDGSSFGTATPTGSDEEADYLGIPGLLDA EQMRALLHRRQDEQLRKRAQLQKGATQPATSGASASVHGQLRDLRRELHT LVSIAHHRTGKPHGWIHDELRRRCGGPPIAAATRAQIKARIDALRQLNSE RS >MT3107 IS1081, transposase MTSSHLIDAEQLLADQLAQASPDLLRGLLSTFIAALMGAEADALCGAGYR ERSDERSNQRNGYRHRDFDTRAATIDVAIPKLRQGSYFPDWLLQRRKRAE RALTSVVATCYLLGVSTRRMERLVETLGVTKLSKSQVSIMAKELDEAVEA FRTRPLDAGPYTFLAADALVLKVREAGRVVGVHTLIATGVNAEGYREILG IQVTSAEDGAGWLAFFRDLVARGLSGVALVTSDAHAGLVAAIGATLPAAA WQRCRTHYAANLMAATPKPSWPWVRTLLHSIYDQPDAESVVAQYDRVLDA LTDKLPAVAEHLDTARTDLLAFTAFPKKIWRQIWSNNPQERLNREVRRRT DVVGIFPDRASIIRLVGAVLAEQHDEWIEGRRYLGLEVLTRARAALTSTE EPAKQQTTNTPALTT >MT0414 IS6110, hypothetical protein MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA AELDRPAR >MT3291 ATP-dependent DNA helicase MSIASDPLIAGLDDQQREAVLAPRGPVCVLAGAGTGKTRTITHRIASLVA SGHVAAGQVLAVTFTQRAAGEMRSRLRALDAAARTGSGVGAVQALTFHAA AYRQLRYFWSRVIADTGWQLLDSKFAVVARAASRTRLHASTDDVRDLAGE IEWAKASLIGPEEYVTAVAAARRDPPLDAAQIAAVYSEYEALKARGDGVT LLDFDDLLLHTAAAIENDAAVAEEFQDRYRCFVVDEYQDVTPLQQRVLSA WLGDRDDLTVVGDANQTIYSFTGASPRFLLDFSRRFPDAAVVRLERDYRS TPQVVSLANRVIAAARGRVAGSKLRLSGQREPGPVPSFHEHSDEPAEAAT VAASIARLIASGTPPSEVAILYRVNAQSEVYEEALTQAGIAYQVRGGEGF FNRQEIKQALLALQRVSERDTDAALSDVVRAVLAPLGLTAQPPVGTRARE RWEALTALAELVDDELAQRPALQLPGLLAELRRRAEARHPPVVQGVTLAS LHAAKGLEWDAVFLVGLADGTLPISHALAHGPNSEPVEEERRLLYVGITR ARVHLALSWALSRSPGGRQSRKPSRFLNGIAPQTRADPVPGTSRRNRGAA ARCRICNNELNTSAAVMLRRCETCAADVDEELLLQLKSWRLSTAKEQNVP AYVVFTDNTLIAIAELLPTDDAALIAIPGIGARKLEQYGSDVLQLVRGRT >MT2488 comE operon protein 1, putative MIAALAVLVTVFTLIRDRTEPVMSAKLPPVEPVSPTNPRSSASPGSPDRS GLPVVVSVVGLVHTPGLVTLAPGARIADALQAAGGAVDGADTVGLNMARQ LGDGEQIVVGLAPPSGQPRVLGSSVGAGTPGPAGTSGTATTGPKTAPKTA EVLDLNTATVEQLDALPGIGPVTAAAIVAWRQRNGRFTSVDQLADVDGIG PARLDKRRNLVRV >MT2877 IS1555', transposase, truncation MLAYFDHHASNGPTEAINGRLEALCRNALGFRNLTHYRIRSLLHCGNLAQ LIHAL >MT0413 IS6110, transposase MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR RILGWRVASTMATSMVLDAIEQAIWTRQQESVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG >MT3063 MutT/Nudix family protein MSIQNSSARRRSAGRIVYAAGAVLWRPGSADSEGPVEIAVIHRPRYDDWS LPKGKVDPGETAPVGAVREILEETGHRANLGRRLLTVTYPTDSPFRGVKK VHYWAARSTGGEFTPGSEVDELIWLPVPDAMNKLDYAQDRKVLCRFAKHP ADTQTVLVVRHGTAGSKAHFSGDDSKRPLDKRGRAQAEALVPQLLAFGAT DVYAADRVRCHQTMEPLAAELNVTIHNEPTLTEESYANNPKRGRHRVLQI VEQVGTPVICTQGKVIPDLITWWCERDGVHPDKSRNRKGSTWVLSLSAGR LVTADHIGGALAANVRA >MT0850 IS1605', transposase, truncation MGQHVESGTDQVLSAQLGARGTDGKALGVGGRIIGLGPSSKTCHACRHVQ DIGWDEKWQCDGCSITHQRDDNAAINLARYEEPPSVVGPVGAAVKRGADR KTGPGPAGGREARKGTGHPAGEQPRDGVLVA >MT2735 integrase MRVYCAGQGTTVTQTGKRQRRKFGRIRQFNSGRWQASYTGPDGRVYIAPK TFNAKIDAEAWLTDRRREIDRQLWSPASGQEDRPGAPFGEYAEGWLKQRG IKDRTRAHYRKLLDNHILATFADTDLRDITPAAVRRWYATTAVGTPTMRA HSYSLLRAIMQTALADDLIDSNPCRISGASTARRVHKIRPATLDELETIT KAMPDPYQAFVLMAAWLAMRYGELTELRRKDIDLHGEVARVRRAVVRVGE GFKVTTPKSDAGVRDISIPPHLIPAIEDHLHKHVNPGRESLLFPSVNDPN RHLAPSALYRMFYKARKAAGRPDLRVHDLRHSGAVLAASTGATLAELMQR LGHSTAGAALRYQHAAKGRDREIAALLSKLAENQEM >MT3015 IS1533, OrfA MLTVEDWAEIRRLHRAEGLPIKMIARVLGISKNTVKSALESNQQPKYERA PQGSIVDAVEPRIRELLQAYPTMPATVIAERIGWERSIRVLSARVAELRP VYLPPDPASRTTYVAGEIAQCDFWFPPIELPVGFGQTRTAKQLPVLTMVC AYSRWLLAMLLPSRCAEDLFAGWWRLIEALGAVPRVLVWDGEGAIGRWRG GRSELTTECQAFRGTLAAKVLICRPADPEAKGLIERAHDYLERSFLPGRV FASPADFNAQLGAWLALVNTRTRRALGCAPTDRIGADRAAMLSLPPVAPA TGWCTSLRLPRDHYVRCDSNDYSVHPGVIGHRVLVRADLERVHVFCDGEL VADHERIWAVHQTVSDPAHVEAAKVLRRRHFSAASPVVEPQVQVRSLSDY DDALGVDIDGGVA >MT0007 conserved hypothetical protein MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAA TRQSQAGHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAV RTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGG AGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWS TLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGS SAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL ADRD >MT2486 conserved hypothetical protein MSEAKPLHLVLGDEELLVERAVADVLRSARQRAGTADVPVSRMRAGDVGA YELAELLSPSLFAEERIVVLGAAAEAGKDAAAVIESAAADLPAGTVLVVV HSGGGRAKSLANQLRSMGAQVHPCARITKVSERADFIRSEFASLRVKVDD ETVTALLDAVGSDVRELASACSQLVADTGGAVDAAAVRRYHSGKAEVRGF DIADKAVAGDVAGAAEALRWAMMRGEPLVVLADALAEAVHTIGRVGPQSG DPYRLAAQLGMPPWRVQKAQKQARRWSRDTVATAMRLVAELNANVKGAVA DADYALESAVRQVAELVADRGR >MT2149 serine/threonine protein kinase MAHELSAGSVFAGYRIERMLGAGGMGTVYLARNPDLPRSEALKVLAAELS RDLDFRARFVREADVAAGLDHPNIVAVHQRGQFEGRLWIAMQFVDGGNAE DALRAATMTTARAVYVIGEVAKALDYAHQQGVIHRDIKPANFLLSRAAGG DERVLLSDFGIARALGDTGLTSTGSVLATLAYAAPEVLAGQGFDGRADLY SLGCALFRLLTGEAPFAAGAGAAVAVVAGHLHQPPPTVSDRVPGLSAAMD AVIATAMAKDPMRRFTSAGEFAHAAAAALYGGATDGWVPPSPAPHVISQG AVPGSPWWQHPVGSVTALATPPGHGWPPGLPPLPRRPRRYRRGVAAVAAV MVVAAAAVTAVTMTSHQPRTATPPSAAALSPTSSSTTPPQPPIVTRSRLP GLLPPLDDVKNFVGIQNLVAHEPMLQPQTPNGSINPAECWPAVGGGVPSA YDLGTVIGFYGLTIDEPPTGTAPNQVGQLIVAFRDAATAQRHLADLASIW RRCGGRTVTLFRSEWRRPVELSTSVPEVVDGITTMVLTAQGPVLRVREDH AIAAKNNVLVDVDIMTPDTSRGQQAVIGITNYILAKIPG >MT3142 DNA-damage-inducible protein P, putative MPTAAPRWILHVDLDQFLASVELLRHPELAGLPVIVGGNGDPTEPRKVVT CASYEARAYGVRAGMPLRTAARRCPEATFLPSNPAAYNAASEEVVALLRD LGYPVEVWGWDEAYLAVAPGTPDDPIEVAEEIRKVILSQTGLSCSIGISD NKQRAKIATGLAKPAGIYQLTDANWMAIMGDRTVEALWGVGPKTTKRLAK LGINTVYQLAHTDSGLLMSTFGPRTALWLLLAKGGGDTEVSAQAWVPRSR SHAVTFPRDLTCRSEMESAVTELAQRTLNEVVASSRTVTRVAVTVRTATF YTRTKIRKLQAPSTDPDVITAAARHVLDLFELDRPVRLLGVRLELA >MT1191 hypothetical protein MPNLQLVQEPAADALLNANPFALLVGMLLDQQVPMETAFAGPKKIADRMG SFDAGDIADYDPDKFVALCSERPAIHRFPGSMAKRIQALAQIIVDRYDGD AAALWTAGEPDGNELLRRLKGLPGFGEQKARIFLALLGKQYGVTPKGWQV AAGEFGQPGTYLSVADIVDAGSLGQVRSHKRQRKAAAKAEGKAPT >MT0426 MutT/nudix family protein MPSCPPAYSEQVRGDGDGWVVSDSGVAYWGRYGAAGLLLRAPRPDGTPAV LLQHRALWSHQGGTWGLPGGARDSHETPEQTAVRESSEEAGLSAERLEVR ATVVTAEVCGVDDTHWTYTTVVADAGELLDTVPNRESAELRWVAENEVAD LPLHPGFAASWQRLRTAPATVPLARCDERRQRLPRTIQIEAGVFLWCTPG DADQAPSPLGRRISSLL >MT2887 CRISPR-associated protein, TM1808 family MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLG ELVACSTLRLTDLLPYVGPDYLVPKPLHSVRSDGSSMQKKLAKKIGFLPA AQLGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFR FELDAGLWLLATGSESELGLLTRLLKGISALGGERTSGFGAFNLTESEAP AALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRSGFVASSTYAD MPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPES AA >MT3295 helicase, UvrD/Rep family MTQTAAPARYSPAELACALGLFPPTAEQAAVIAAPPGPLVVIAGAGAGKT ETMAARVVWLVANGYAEPGQVLGLTFTRKAAGQLLRRVRSRLARLAGIGL GCGDPAACAPVVSTYHAFAGSLLRDYGLLLPLEPDTRLLSETELWQLAFD VVSGYDGVLCTDKSPAAVTSIVVRLWGQLGEHLVDTRALRDTHVELERLV HALPAGRYQRDRGPSQWLLRMLATQTQRAELVPLLDALGERMHAGKVMDF AMQMASAARLAATSPQVGQDLRRRYRVVLLDEYQDTGHAQRVVLSSLFGG GVDDGLALTAVGDPIQSIYGWRGASATNLPRFTTDFPLSDGTPAPVLELL TSWRNPPQALRVANGISAEARRRSVAVRALRPRPDAPPGAVRCALLPDVQ AEREWIADHLRMRYQRAEADGVKPPTAAVLVRRNADAAAIADTLRARGIP AEVVGLAGLLSIPEVAEVVAMLRLVADPTAGAAAMRVLTGPRWRLGARDL AALWRRALTLSGESPSTASPESIAMAASADADNPCLADAISDPGSAEGYS VAGYGRIGALAGELSALRGRLGHSLPDLVAEVRRVLGVDCEVRASAPVSG GWAGPEHLDAFADVVAGYAERASARSSEASVAGLLAYLDVAEVVENGLPP AELTVACDRVQVLTVHAAKGLEWQVVAVAHLSRGVFPSTVSRSSWLTDPA ELPPLLRGDRASAGAHGIPVLDTSAVADRKQLSDKISEHRRLLDRRRVDE ERRLLYVAVTRAEDTLLVSGHHWGPTGTKPRGPSEFLCELKDIIDRSAAA GDPCGVVEQWASAPAGDERNPLCDNAIEAVWPADPLAARRGDVERGAALV AAAMSADLPGSTTDIDHPPRPGDAPWSTDVDALLAERAHAARGAPARGLP NHLSVSSLVELVGDPVGARQRLMCRLPKRPDPHAWLGDAFHAWVQQFYGA ELLFDLGDLPGAADREVGDPEELAALQRAFTASSWAARTPAAVEVPFEMP IGDTVVRGRIDAVFVDPDGGATVVDWKTGKPPHGPAAMRQAAVQLAVYRL AWAALRGCPTSSVRTAFYYVRSGITVVPDELPAPGELAMLLTDCAGRRSD T >MT2953 IS1539, transposase MMARLKVPEGWCVQAFRFTLNPTQTQAASLARHFGARRKAFNWTVTALKA DIKAWRADGTESAKPSLRVLRKRWNTVKDQVCVNAQTGQVWWPECSKEAY ADGIAGAVDAYWNWQSCRAGKRAGKTVGVPRFKKKGRDADRVCFTTGAMR VEPDRRHLTLPVIGTIRTYENTRRVERLIAKGRARVLAITVRRNGTRLDA SVRVLVQRPQQRRVALPDSRVGVDVGVRRLATVADAEGTVLEQVPNPRPL DAALRGLRRVSRARSRCTKGSRRYCERTTELSRLHRRVNDVRTHHLHVLT TRLAKTHGRIVVEGLDAAGMLRQKGLPGARARRRALSDAALATPRRHLSY KTGWYGSSLVVADRWFPSSKTCHACRHVQDIGWDEKWQCDGCSITHQRDD NAAINLARYEEPPSVVGPVGAAVKRGADRKTGPGPAGGREARKATGHPAG EQPRDGVQVK >MT1446 primosomal protein N' MLSVPHLDRDFDYLVPAEHSDDAQPGVRVRVRFHGRLVDGFVLERRSDSD HHGKLGWLDRVVSPEPVLTTEIRRLVDAVAARYAGTRQDVLRLAVPARHA RVEREITTAPGRPVVAPVDPSGWAAYGRGRQFLAALADSRAARAVWQALP GELWADRFAEAAAQTVRAGRTVLAIVPDQRDLDTLWQAATALVDEHSVVA LSAGLGPEARYRRWLAALRGSARLVIGTRSAVFAPLSELGLVMVWADADD SLAEPRAPYPHAREVAMLRAHQARCAALIGGYARTAEAHALVRSGWAHDV VAPRPEVRARSPRVVALDDSGYDDARDPAARTARLPSYRWRRGSALQSGA PVLVQVPRRGYIPSLACGRCRAIARCRSCTGPLSLQGAGSPGAVCRWCGR VDPTLRCVRCGSDVVRAVVVGARRTAEELGRAFPGTAVITSAGDTLVPQL DAGPALVVATPGAEPRAPGGYGAALLLDSWALLGRQDLRAAEDALWRWMT AAALVRPRGAGGVVTVVAESSIPTVQSLIRWDPVGHAEAELAARTEVGLP PSVHIAALDGPAGTVTALLEAARLPDPDRLQADLLGPVDLPPGVRRPAGI PADAPVIRMLLRVCREQGLELAASLRRGIGVLSARQTRQTRSLVRVQIDP LHIG >MT3430 IS1547, transposase MAFAPTEMCPPTGPTSTPPQVKEATTMVVVGTDAHKYSHTFVATDEVGRQ LGEKTVKATTAGHATAIMWAREQFGLELIWGIEDCRNMSARLERDLLAAG QQVVRVPTKLMAQTRKSARSRGKSDPIDALAVARAVMRETDLPLATHDET SRELKLLTDRRDVLVAQRTSAINRLRWLVHELDPERAPAARSLDAAKHQQ ALRTWLDTQPGLVAELARAELTDIIRLTGEINTLAQRISARVHQVAPALL EIPGCAELTAAKIVGEAAGVTRFKSEAAFACHAAVAPIPVWSGNTAGQMR LSRSGNRQLNAALHRIALTQIRMTDSRGQAYYQRLQDAGKTKRAALRCLK RRLARTVFQALRTVHQPSSEHTQPAAACHRSYCSSHLGEPPRLTDMTQKT RIQPLPPKRAGLLIRALYRIAKRRFGEVPEPFTVTAHHRRLLIANVVHEA LLQRASRKLPPSVRELAVFWTARSIGCSWCVDFGAMLQRLDGLDVDRLTD IDNYATSSKFSDDERAAIAYAEAMTADPHSVTDEQVADLRARFGEAGVIE LTYQIGVENMRARMNSALGITEQGFNSGDACRVPWAAPDVPSAESR >MT0965 conserved hypothetical protein/DNA ligase MGSASEQRVTLTNADKVLYPATGTTKSDIFDYYAGVAEVMLGHIAGRPAT RKRWPNGVDQPAFFEKQLALSAPPWLSRATVAHRSGTTTYPIIDSATGLA WIAQQAALEVHVPQWRFVAEPGSGELNPGPATRLVFDLDPGEGVMMAQLA EVARAVRDLLADIGLVTFPVTSGSKGLHLYTPLDEPVSSRGATVLAKRVA QRLEQAMPALVTSTMTKSLRAGKVFVDWSQNSGSKTTIAPYSLRGRTHPT VAAPRTWAELDDPALRQLSYDEVLTRIARDGDLLERLDADAPVADRLTRY RRMRDASKTPEPIPTAKPVTGDGNTFVIQEHHARRPHYDFRLERDGVLVS WAVPKNLPDNTSVNHLAIHTEDHPLEYATFEGAIPSGEYGAGKVIIWDSG TYDTEKFHDDPHTGEVIVNLHGGRISGRYALIRTNGDRWLAHRLKNQKDQ KVFEFDNLAPMLATHGTVAGLKASQWAFEGKWDGYRLLVEADHGAVRLRS RSGRDVTAEYPQLRALAEDLADHHVVLDGEAVVLDSSGVPSFSQMQNRGR DTRVEFWAFDLLYLDGRALLGTRYQDRRKLLETLANATSLTVPELLPGDG AQAFACSRKHGWEGVIAKRRDSRYQPGRRCASWVKDKHWNTQEVVIGGWR AGEGGRSSGVGSLLMGIPGPGGLQFAGRVGTGLSERELANLKEMLAPLHT DESPFDVPLPARDAKGITYVKPALVAEVRYSEWTPEGRLRQSSWRGLRPD KKPSEVVRE >MT2862 IS1602, resolvase MNLAVWAERNGVARVTAYRWFHAGLLPVPARKAGRLILVDDQPADRSRRA RTAVYARVSSADQKPDLDRQVARVTAWATTEQIAVDKVVTEVGSALNGHR RKFLALLRDPSVKRIVVEHRDRFCRFGSEYVEAALAAQGRELVVVDSAEV DDDLVRDMTEILTSMCARLYGKRAAQNRAKRALAAAAEESEAA >MT1353 IS1557, transposase MRNVRLFRALLGVDKRTVIEDIEFEEDDAGDGARVIARVRPRSAVLRRCG RCGRKASWYDRGAGLRQWRSLDWGTVEVFLEAEAPRVNCPTHGPTVVAVP WARHHAGHTYAFDDTVAWLAVACSKTAVCELMRIAWRTVGAIVARVWADT EKRIDRFANLRRIGIDEISYKRHHRYLTVVVDHDSGRLVWAAPGHDKATL GLFFDALGAERAAQITHVSADAADWIADVVTERCPDAIQCADPFHVVAWA TEALDVERRRAWNDARAIARTEPKWGRGRPGKNAAPRPGRERARRLKGAR YALWKNPEDLTERQSAKLAWIAKTDPRLYRAYLLKESLRHVFSVKGEEGK QALDRWISWAQRCRIPVFVELAARIKRHRVAIDAALDHGLSQGLIESTNT KIRLLTRIAFGFRSPQALIALAMLTLAGHRPTRPGRHNHPQISQ >MT0970 formamidopyrimidine-DNA-glycosylase MSSSAGVLVAGTPQPRALGPDALDVSTDDLAGLLAGNTGRIKTVITDQKV IAGIGNAYSDEILHVAKISPFATAGKLSGAQLTCLHEAMASVLSDAVRRS VGQGAAMLKGEKRSGLRVHARTGLPCPVCGDTVREVSFADKSFQYCPTCQ TGGKALADRRMSRLLK >MT2954 IS1539, resolvase MSRILTHVPGRTVNRSYALPALVGSAAGRLSGNHSHGREAYIALPQWACS RQPSTPPLQTPGRINALWSLRPVLPMPGRGCQLLRLGGRWLSVVCCRNGS MNLVVWAEGNGVARVIAYRWLRVGRLPVPARRVGRVILVDEPAGQPGRWG RTAVCARLSSADQKVDLDRQVVGVTAWATAEQIPVGKVVTEVGSALYGRR RTFLTLLGDPTVRRIVMKRRDRLGRFGFECVQAVLAADGRELVVVDSADV DDDVVGDITEILTSICARLYGKRAAGNRAARAVAAAARAGGHEAR >MT0947 IS1554, transposase MDAAQVIEPAHAGQDVDEAAVAARELSGAERALVGDLVRQARAEGVALTG PDGLLKALTKTVLEAALQEEMTEHLGYDRHAAAGRGSGNSRNGSRNKKVI TDACGQVEIAVPRDRNGTFEPVIVGKRKRRVTDVDRVVLSLYAKGLTTGE IAAHFADVYGVSVSKDTISRITDRVIEEMQAWWSRPLEKVYAAVFIDAIM VKIRDGQVRNRPVYAAIGVDLDGHKDILGMWAGEGDGESAKFWLAVLTDL RNRGVKDIFFLVCDGLKGLPDSVSAAFPLATVQTCIIHLIRNTFRYASRK YWDKISVDLKPIYTAASAAEARLRYEEFAEKWGKPYPAITRLWDSAWEEF IPFLDYDVEIRRVPCSTNAIESLNARYRRAVRARGHFPNEQSALKTLYLV TRSLDPKGTGQTKWAVRWKPALNALAITFADRMPAAEER >MT1788 serine/threonine protein kinase MPLAEGSTFAGFTIVRQLGSGGMGEVYLARHPRLPRQDALKVLRADVSAD GEYRARFNREADAAASLWHPHIVAVHDRGEFDGQLWIDMDFVDGTDTVSL LRDRYPNGMPGPEVTEIITAVAEALDYAHERRLLHRDVKPANILIANPDS PDRRIMLADFGIAGWVDDPSGLTATNMTVGTVSYAAPEQLMGNELDGRAD QYALAATAFHLLTGSPPFQHANPAVVISQHLSASPPAIGDRVPELTPLDP VFAKALAKQPKDRYQRCVDFARALGHRLGGAGDPDDTRVSQPVAVAAPAK RSLLRTAVIVPAVLAMLLVMAVAVAVREFQRADDERAAQPARTRTTTSAG TTTSVAPASTTRPAPTTPTTTGAADTATASPTAAVVAIGALCFPLGSTGT TKTGATAYCSTLQGTNTTIWSLTEDTVASPTVTATADPTEAPLPIEQESP IRVCMQQTGQTRRECREEIRRSNGWP >MT2431 conserved hypothetical protein MRLYRDRAVVLRQHKLGEADRIVTLLTRDHGLVRAVAKGVRRTRSKFGAR LEPFAHIEVQLHPGRNLDIVTQVVSVDAFATDIVADYGRYTCGCAILETA ERLAGEERAPAPALHRLTVGALRAVADGQRPRDLLLDAYLLRAMGIAGWA PALTECARCATPGPHRAFHIATGGSVCAHCRPAGSTTPPLGVVDLMSALY DGDWEAAEAAPQSARSHVSGLVAAHLQWHLERQLKTLPLVERFYQADRSV AERRAALIGQDIAGG >MT1076 IS1081, transposase MGGHRVILRNDQQKSIEGNDAMTSSHLIDAEQLLADQLAQASPDLLRGLL STFIAALMGAEADALCGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAI PKLRQGSYFPDWLLQRRKRAERALTSVVATCYLLGVSTRRMERLVETLGV TKLSKSQVSIMAKELDEAVEAFRTRPLDAGPYTFLAADALVLKVREAGRV VGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRDLVARGLSGVAL VTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRTLLH SIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKKI WRQIWSNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIE GRRYLGLEVLTRARAALTSTEEPAKQQTTNTPALTT >MT0884 ATP-dependent DNA helicase, putative MQSDKTVLLEVDHELAGAARAAIAPFAELERAPEHVHTYRITPLALWNAR AAGHDAEQVVDALVSYSRYAVPQPLLVDIVDTMARYGRLQLVKNPAHGLT LVSLDRAVLEEVLRNKKIAPMLGARIDDDTVVVHPSERGRVKQLLLKIGW PAEDLAGYVDGEAHPISLHQEGWQLRDYQRLAADSFWAGGSGVVVLPCGA GKTLVGAAAMAKAGATTLILVTNIVAARQWKRELVARTSLTENEIGEFSG ERKEIRPVTISTYQMITRRTKGEYRHLELFDSRDWGLIIYDEVHLLPAPV FRMTADLQSKRRLGLTATLIREDGREGDVFSLIGPKRYDAPWKDIEAQGW IAPAECVEVRVTMTDSERMMYATAEPEERYRICSTVHTKIAVVKSILAKH PDEQTLVIGAYLDQLDELGAELGAPVIQGSTRTSEREALFDAFRRGEXAT LVVSKVANFSIDLPEAAVAVQVSGTFGSRQEEAQRLGRILRPKADGGGAI FYSVVARDSLDAEYAAHRQRFLAEQGYGYIIRDADDLLGPAI >MT0423 serine/threonine protein kinase MADGARPSPRPAHAEVCGLMAKASETERSGPGTQPADAQTATSATVRPLS TQAVFRPDFGDEDNFPHPTLGPDTEPQDRMATTSRVRPPVRRLGGGLVEI PRAPDIDPLEALMTNPVVPESKRFCWNCGRPVGRSDSETKGASEGWCPYC GSPYSFLPQLNPGDIVAGQYEVKGCIAHGGLGWIYLALDRNVNGRPVVLK GLVHSGDAEAQAMAMAERQFLAEVVHPSIVQIFNFVEHTDRHGDPVGYIV MEYVGGQSLKRSKGQKLPVAEAIAYLLEILPALSYLHSIGLVYNDLKPEN IMLTEEQLKLIDLGAVSRINSFGYLYGTPGFQAPEIVRTGPTVATDIYTV GRTLAALTLDLPTRNGRYVDGLPEDDPVLKTYDSYGRLLRRAIDPDPRQR FTTAEEMSAQLTGVLREVVAQDTGVPRPGLSTIFSPSRSTFGVDLLVAHT DVYLDGQVHAEKLTANEIVTALSVPLVDPTDVAASVLQATVLSQPVQTLD SLRAARHGALDADGVDFSESVELPLMEVRALLDLGDVAKATRKLDDLAER VGWRWRLVWYRAVAELLTGDYDSATKHFTEVLDTFPGELAPKLALAATAE LAGNTDEHKFYQTVWSTNDGVISAAFGLARARSAEGDRVGAVRTLDEVPP TSRHFTTARLTSAVTLLSGRSTSEVTEEQIRDAARRVEALPPTEPRVLQI RALVLGGALDWLKDNKASTNHILGFPFTSHGLRLGVEASLRSLARVAPTQ RHRYTLVDMANKVRPTSTF >MT3618 conserved hypothetical protein MHGAKWVDARQAAELLYDHRRPPAGIHTWSDRVADDEIQPISGMNTTTPA RTALDLARRYPVGKAVAAIDALARATDLKLADVEMLAERYRGSRGIRNAR IALDLVDPGAESPRETWLRLLLIRAGFPRPQTQIPVYDEYGQLVAVIDMG WAGIKVGVDYEGDHHRTDRRTFNKDIKRAEALTELGWTDVRVTVEDTEGG IIWRVSAAWQRRT >MT3905 IS1557, transposase MRNVRLFRALLGVDKRTVIEDIEFEEDDAGDGARVIARVRPRSAVLRRCG RCGRKASWYDRGAGLRQWRSLDWGTVEVFLEAEAPRVNCPTHGPTVVAVP WARHHAGHTYAFDDTVAWLAVACSKTAVCELMRIAWRTVGAIVARVWADT EKRIDRFANLRRIGIDEISYKRHHRYLTVVVDHDSGRLVWAAPGHDKATL GLFFDALGAERAAQITHVSADAADWIADVVTERCPDAIQCADPFHVVAWA TEALDVERRRAWNDARAIARTEPKWGRGRPGKNAAPRPGRERARRLKGAR YALWKNPEDLTERQSAKLAWIAKTDPRLYRAYLLKESLRHVFSVKGEEGK QALDRWISWAQRCRIPVFVELAARIKRHRVAIDAALDHGLSQGLIESTNT KIRLLTRIAFGFRSPQALIALAMLTLAGHRPTLPGRHNHPQISQ >MT3573 integrase, putative MRAAVYLRISEDRSGEQLGVARQREDCLKLCGQRKWVPVEYLDNDVSAST GKRRPAYEQMLADITAGKIAAVVAWDLDRLHRRPIELEAFMSLADEKRLA LATVAGDVDLATPQGRLVARLKGSVAAHETEHKKARQRRAARQKAERGHP NWSKAFGYLPGPNGPEPDPRTAPLVKQAYADILAGASLGDVCRQWNDAGA FTITGRPWTTTTLSKFLRKPRNAGLRAYKGARYGPVDRDAIVGKAQWSPL VDEATFWAAQAVLDAPGRAPGRKSVRRHLLTGLAGCGKCGNHLAGSYRTD GQVVYVCKACHGVAILADNIEPILYHIVAERLAMPDAVDLLRREIHDAAE AETIRLELETLYGELDRLAVERAEGLLTARQVKISTDIVNAKITKLQARQ QDQERLRVFDGIPLGTPQVAGMIAELSPDRFRAVLDVLAEVVVQPVGKSG RIFNPERVQVNWR >MT2373 hypothetical protein MAPTGQAVDVAVREGAGDVGYSVERENLPADDPVRNGNRWRVIAVDTEHH RIAARRLGDGARAAFSGDYLHEHITHGYAITVHASQGTTAHSTHAVLGDN TSRATLYVAMTPARESNTAYLCERTAGEGARVDLAGWDLWVSGKAEAMSD EKSASPVWCRVGARCDHRGKRSCW >MT2730 hypothetical protein MTHKRTKRQPAIAAGLNAPRRNRVGRQHGWPADVPSAEQRRAQRQRDLEA IRRAYAEMVATSHEIDDDTAELALLSMHLDDEQRRLEAGMKLGWHPYHFP DEPDSKQ >MT0818 IS1547, transposase MCPPTGPTSTPPQVKEATTMVVVGTDAHKYSHTFVATDEVGRQLGEKTVK ATTAGHATAIMWAREQFGLELIWGIEDCRNMSARLERDLLAAGQQVVRVP TKLMAQTRKSARSRGKSDPIDALAVARAVMRETDLPLATHDETSRELKLL TDRRDVLVAQRTSAINRLRWLVHELDPERAPAARSLDAAKHQQALRTWLD TQPGLVAELARAELTDIIRLTGEINTLAQRISARVHQVAPALLEIPGCAE LTAAKIVGEAAGVTRFKSEAAFACHAAVAPIPVWSGNTAGQMRLSRSGNR QLNAALHRIALTQIRMTDSRGQAYYQRLQDAGKTKRAALRCLKRRLARTV FQALRTVHQPSSEHTQPAAACHRSYCSRSCLSG >MT3197 IS1081, transposase MGGHRVILRNDQQKSIEGNDAMTSSHLIDAEQLLADQLAQASPDLLRGLL STFIAALMGAEADALCGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAI PKLRQGSYFPDWLLQRRKRAERALTSVVATCYLLGVSTRRMERLVETLGV TKLSKSQVSIMAKELDEAVEAFRTRPLDAGPYTFLAADALVLKVREAGRV VGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRDLVARGLSGVAL VTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRTLLH SIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKKI WRQIWSNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIE GRRYLGLEVLTRARAALTSTEEPAKQQTTNTPALTT >MT2179 conserved hypothetical protein MADQPDPPTPRPALSPSRATDFKQCPLLYRFRAIDRLPEATSAAQLRGSV VHAALEQLYGLPAGLRSPDTARSLVQRAWDQMVAAEPELAGELDPGQPTQ LLEDARALVSGYYRLEDPTRFDPQCCEQRVEVELADGTLLRGYIDRIDVA ATGELRVVDYKTGKAPPAARALAEFKAMFQMKFYAVALFRSRGVPPTRLR LIYLADGQLLDYSPDRDELLRFEKTLMAIWRAIQSAGETGDFRPNPSRLC DWCPHQQRCPAFGGTPPPYPGWPTEPAA >MT2247 DNA polymerase III, epsilon subunit, putative MGATGGTQLSFADLAHAQGAAWTPADEMSLRETTFVVVDLETTGGRTTGN DATPPDAITEIGAVKVCGGAVLGEFATLVNPQHSIPPQIVRLTGITTAMV GNAPTIDAVLPMFFEFAGDSVLVAHNAGFDIGFLRAAARRCDITWPQPQV LCTMRLARRVLSRDEAPSVRLAALARLFAVASNPTHRALDDARATVDVLH ALIERVGNQGVHTYAELRSYLPNVTQAQRCKRVLAETLPHRPGVYLFRGP SGEVLYVGTAADLRRRVSQYFNGTDRRKRMTEMVMLASSIDHVECAHPLE AGVRELRMLSTHAPPYNRRSKFPYRWWWVALTDEAFPRLSVIRAPRHDRV VGPFRSRSKAAETAALLARCTGLRTCTTRLTRSARHGPACPELEVSACPA ARDVTAAQYAEAVLRAAALIGGLDNAALAAAVQQVTELAERRRYESAARL RDHLATAIEALWHGQRLRALAALPELIAAKPDGPREGGYQLAVIRHGQLA AAGRAPRGVPPMPVVDAIRRGAQAILPTPAPLGGALVEEIALIARWLAEP GVRIVGVSNDAAGLASPVRSAGPWAAWAATARSAQLAGEQLSRGWQSDLP TEPHPSREQLFGRTGVDCRTGPPQPLLPGRQPFSTAG >MT3456 IS1561', transposase MAIDPAAAYASAIRTPGLLPNAKLVVDHFHVTTLANDALTAVRRRVTWAF HDRRGRKIDPQWANRRRLLTARERLSDKSFAKMRNRINAVDPRAQILSAW IAKEELRTLLSTVRTGGDPHLARHHLHRFLPGASTRRSPNCSPWPPPLTS HPRSTPSWSPASPTRASVVGEVAEMLGDIDGQCVQVEVPVPERGPAGCGG LDGLGRAGVSATPRVCAAMTAVNVAGRCAGQQADVGPTPQHRCRGR >MT3281 IS1603, transposase MRQISSRYLSEEERINIADLRRSGLSIRKIADQLGRAPSTVSRELRRNSR RDGQYRPFEAHRWAVQRRVRRHRRRIDKNPDLCELIAELLAQRWSPQQIA RHLRRKYPDDRSMWLCHESIYQAVYQPQSRLIRPPQVKSPHRGPLRTGRT HRRAHLRPGRRRPRFAQPMLSIHQRPFDPADRSEPGHWEGDLIVGKNQGS AIGTLVERQTRLIRLLHLPTHDAYCLRIAITETMSDLPVTLVRSITWDQG IEMARHIDITADLGAPVYFCDSRSPWQRASNENSNGLLRQYFPKGTSLST YTPDHLRAVEYEINNRPRQVLGHRSPAELFTALLTSPDHQLLRR >MT2604 hypothetical protein MHLAHRVASSRDTPSSSATPNAVSGSASNAADRPCLVRPPTAPPWAHGPR LRRDPTGGGSTPSIVLSRSTDRSKDGHRIVPAGARKSGVRASTGRLPSTR KTTRSPDCRPSASRTAFGXVTCPFDVTMGSSECLLHRCRTPPVPSHSVEL LVAANPAEDSRLPYLIRLPVGAGLVFATSDVWPRTKALYCHRLDIADWPA DPVVVDRVELRSCSRRGAAIDVVAARARENRSQLVHTMARGRQVVFWQSP KTRKQSRPGVRTPTARAAGIPELHIVVDAHERYPYTFADKPAKTTREALP CGDYGLKVAGQLVAAVERKALADLTSGVLNGNLKYQLTELAALPRAAVVV EDRYSEIFAHSFARPTAIADGLAELQIGFPNVPIVFCQTRKLAQEYTYRY LAAALTWFVDDADATTVFEPAAAEPEPSSAELRAWAKSVGLPVSDRGRLR PQILQAWRAAHPR >MT3480 DNA polymerase III, alpha subunit, putative MFDILWNVGWSNGPPSWAEMERVLNGKPRHAGVPAFDADGDVPRSRKRGA YQPPGRERVGSSVAYAELHAHSAYSFLDGASTPEELVEEAARLGLCALAL TDHDGLYGAVRFAEAAAELDVRTVFGAELSLGATARTERPDPPGPHLLVL ARGPEGYRRLSRQLAAAHLAGGEKGKPRYDFDALTEAAGGHWHILTGCRK GHVRQALSQGGPAAAQRALADLVDRFTPSRVSIELTHHGHPLDDERNAAL AGLAPRFGVGIVATTGAHFADPSRGRLAMAMAAIRARRSLDSAAGWLAPL GGAHLRSGEEMARLFAWCPEAVTAAAELGERCAFGLQLIAPRLPPFDVPD GHTEDSWLRSLVMAGARERYGPPKSAPRAYSQIEHELKVIAQLRFPGYFL VVHDITRFCRDNDILCQGRGSAANSAVCYALGVTAVDPVANELLFERFLS PARDGPPDIDIDIESDQREKVIQYVYHKYGRDYAAQVANVITYRGRSAVR DMARALGFSPGQQDAWSKQVSHWTGQADDVDGIPEQVIDLATQIRNLPRH LGIHSGGMVICDRPIADVCPVEWARMANRSVLQWDKDDCAAIGLVKFDLL GLGMLSALHYAKDLVAEHKGIEVDLARLDLSEPAVYEMLARADSVGVFQV ESRAQMATLPRLKPRVFYDLVVEVALIRPGPIQGGSVHPYIRRRNGVDPV IYEHPSMAPALRKTLGVPLFQEQLMQLAVDCAGFSAAEADQLRRAMGSKR STERMRRLRGRFYDGMRALHGAPDEVIDRIYEKLEAFANFGFPESHALSF ASLVFYSAWFKLHHPAAFCAALLRAQPMGFYSPQSLVADARRHGVAVHGP CVNASLAHATCENAGTEVRLGLGAVRYLGAELAEKLVAERTANGPFTSLP DLTSRVQLSVPQVEALATAGALGCFGMSRREALWAAGAAATGRPDRLPGV GSSSHIPALPGMSELELAAADVWATGVSPDSYPTQFLRADLDAMGVLPAE RLGSVSDGDRVLIAGAVTHRQRPATAQGVTFINLEDETGMVNVLCTPGVW ARHRKLAHTAPALLIRGQVQNASGAITVVAERMGRLTLAVGARSRDFR >MT1063 IS1560' protein MQQGNPPDAPQLAPAVAWVKKRAGRTPRTVTADRGYGEAAVDQQLTEVGV KNVLIPRKGKPSQDRRAEEHRKAFRRTIKWRTGCEGRISHLKRGYGWDRG RIGGLEGTRTWVGHGVFAHNLVTISALPA >MT2069 IS1607, transposase MSIVDARGREVRRATIEHNAAGLRELLELLSRAGAREVAIERPDGPVVDT LLEAGITVVVISPNQLKNLRGRYGSAGNKDDRFDAFVLADTLRTDRSRLR PLLPDTPATATLRRTCRPRKDLVAHRVALANQLRAHLRVVFPGVVGLFAD LDSPISLAFLTFLPRFDCQDRADWLSVKRLAGWLAAAGYCGRAPRPAHRC PARRHR >MT3742 IS1553, transposase MALPQSALSELLDAFRTGDGVDLIRDAVRLVLQELSELEATERIGAARYE RSDTRVTDRNGARSRVLSTQAGDVELRIPKLRKGSFFPAILEPRRRIDQA LYAVVMEAYVHGISTRAVDDLVEAMGVETGISKSEVSRICAGLDEIVGAF RTRTLGHIEFPYVYLDATYLNVLNGTGQVVSMAVIVASGIAADGSREILG LDVGDSEDETFWRGFLTSLKGRGLGGVRLVISDQHAGLVKALKRCFQGAG HQRCRVHFARNLLAHVPKDKADMVASMFRMIFSAPDAEAVHATWEGVRDR LAASFPKIGPLMDDARAEVLAFTAFPKAHWQKIWSTNPLERINKEIKRRS RVVGIFPNPAAVIRLVGAVLADMHDEWQASERRYLSEASMALLYPDSDNA VVAAISGGQ >MT2070 IS1607, transposase MLHDRLTGAPRGATGDEGAANAHITRAMVAALTSVATQIKTLDAQIAEQL SLHADAHIFTSLPRSGTVRAARLLAEIGDCRARFPTPESLACLAGVAPST RQSGKVKHVGFRWAADKQLRDAVCDFAGDSRRANLWAADRYNRAIARGHD HPHAVRILARAWLYAIWHCWQDGAAYHPANHRALQALLNQDQDRAA >MT2655 conserved hypothetical protein MRWARQAVAVNGMPVDDGALPGLQRIGLVRSVRAPQFDGITFHEVLCKSA LNKVPNAAALPFRYTVNGYRGCSHACRYCFARPTHEYLDFNPGTDFDTQV VVKTNVAAVLRHELRRPSWRRETVALGTNTDPYQRAEGRYALMPGIIGAL AASGTPLSILTKGTLLRRDLPLIAEAAQQVPVSVAVSLAVGDPELHRDVE SGTPTPQARLALITAIRAAGLDCHVMVAPVLPQLTDSGEHLDQLLGQIAA AGATGVTVFGLHLRGSTRGWFMCWLARAHPELVSRYRELYRRGPYLPPSY REMLRERVAPLIAKYRLAGDHRPAPPETEAALVPVQATLF >MT3099 IS6110, hypothetical protein MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA AELDRPAR >MT2861 IS1602, transposase MAKFEIPEGWMVQAFRFTLDPTAEQARALARHFGARRKAYNWTVATLKAD IDAWQATGIQTAKPSLRVLRKRWNTVKNDVCVNIETGVVWWPECSKEAYA DGIDGAVDAYWNWQNSRSGKRDGKRMGFPRFKKKGRDPDRVTFTTGAMRV EPDRRHLTLPVIGTVRTHENTRRVERLIAKGRSRVLAITVRRNGTRIDAS VRVLVQRPQQPKVTDPGSRVGVDVGVRRXATVATADGAVLERVPNPRPLD AALNELRHVCRARSRCTKGSRRYRERTTEISRLHRRVNDVRTHHLHCLTT HLAKTHGRIVVEGLDAAGMLRQQGLSGARARRRGLSDAALGTPRRHLSYK TGWYGSQLVVADRWFPSSKTCHVCGHVQEIGWAEHWQCDSCSASHQRDDC AAINLARYEDTSSVVGPVGAAVKRGADRKTRPGRAGGREARKGSSRKAAE QPRDGVQVA >MT2884 CRISPR-associated protein Cas1 MVQLYVSDSVSRISFADGRVIVWSEELGESQYPIETLDGITLFGRPTMTT PFIVEMLKRERDIQLFTTDGHYQGRISTPDVSYAPRLRQQVHRTDDPAFC LSLSKRIVSRKILNQQALIRAHTSGQDVAESIRTMKHSLAWVDRSGSLAE LNGFEGNAAKAYFTALGHLVPQEFAFQGRSTRPPLDAFNSMVSLGYSLLY KNIIGAIERHSLNAYIGFLHQDSRGHATLASDLMEVWRAPIIDDTVLRLI ADGVVDTRAFSKNSDTGAVFATREATRSIARAFGNRIARTATYIKGDPHR YTFQYALDLQLQSLVRVIEAGHPSRLVDIDITSEPSGA >MT3494.1 IS1560, transposase MVRNAQRAVRRASGRRKAWLRQAINHLEKLIGRTERVVDQARSRLAGVMP DSSSRLVSLHDADARPIRKGRLGKPVEFGYKAQVVDNADGVILDHSVELG NPADAPQLAPAIERISRRTGRPPRAVTADRGCGDASVEDDLHQLGVRNVA IPRKSKPSATRRAFEHRRAFRDKIKWRTGSEGRINHLKRSYGWNRTELTG ITGARTWCGHGVFAHNLVKISTLAA >MT0767 IS1557', transposase MFSVKGEEGKQALDRWISWARRCRIPVFVELAGGIVRHRQAIDAALDHGL WQGLIESTNTKIRLLTRIAFGFRSPEALIALAMLALGGRRPALPGRTKHP RISQ >MT2883 CRISPR-associated protein Cas2 MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGF GYRVQESAFEAMLTKGQLAKLVARIDRFAIDCDNIRIYKIRGVAAVTFYG RGRLVSAEEFVFF >MT0949 IS1535, transposase MIVRMRSCAQAAKVAEATGGVQLAGKPKPDGTPTFSRYVEIGVDFEAHRP VVESVSVLFELYDGDANSYAATGGPGAQLPSGWMVTAAKFEVEWPADPQR AGLVRSHFGARRKAFNWGLAQVKADLDAKAADPAHESVDWDLKSLRWAWN RAKDDVAPWWAENSKECYSSGLADLAQGLANWKAGKNGTRKGRRVGFPRF KSGRRDPGRVRFTTGTMRIEDDRRTITVPVIGPLRAKENTRRVQRHLVSG RAQILNMTLSQRWGRLFVAVCYALRTPTTRSPLTQPTVRAGMDLGVRTLA TVATLDTATGEQTIIEYPNPAPLKATLVARRRAGRELSRRIPGSHGHRAV KAKLARLDRRCVHLRREAAHQLTTELAGTYGQVVIEDLDVAAMKRSMRRR AFRRSVSDAAMGLVAPQLAYKTAKCSGVLTVADRWFASSQIHHGCTSPDG TPCRLQGKGRIDKHLLCPVTGEVVDRDRNAALNLRDWPDNASRGPVGTTA PSAPGPTTTVGTGHGADTGSSGAGGASVRPRPRRAGRGEAKTQTPQGDAA >MT0699 AP endonuclease, family 2 MLIGSHVSPTDPLAAAEAEGADVVQIFLGNPQSWKAPKPRDDAAALKAAT LPIYVHAPYLINLASANNRVRIPSRKILQETCAAAADIGAAAVIVHGGHV ADDNDIDKGFQRWRKALDRLETEVPVYLENTAGGDHAMARRFDTIARLWD VIGDTGIGFCLDTCHTWAAGEALTDAVDRIKAITGRIDLVHCNDSRDEAG SGRDRHANLGSGQIDPDLLVAAVKAAGAPVICETADQGRKDDIAFLRERT GS >MT0282 conserved hypothetical protein MAAPVSLDVHGRQVIVTHPGRVVFPAHNDRKGYTKFDLVRYYLAVAEGAM RGVAGRPMILKRFVKGISAEAVFQKRAPANRPDWVDVAELHYASGRSAAE AVIHDAAGLAWVINLGCVDLNPHPVLAGDLDHPDELRVDLDPMPGVAWQR VVEVALVVREVLEDYGLTAWPKTSGSRGFHVYARIAPCWSFPQVRLAAQT VAREVERRLPDAATSRWWKEEREGVFVDFNQNAKDRTVASAYSVRATPDA RVSTPLHWEEVPGCDPAVFTMATVPSRLADIGDPWAGMDDAVGRLDRLLM LAEELGPPQKAQSAKPLIEIARAKTRAEAMAALDIWRDRYPGAAALLRPA DVLVDGMRGPSSIWYRIRINLQHVPADQRPPQEELIADYSPWPR >MT3016 IS1533, OrfB MSQCPGWPIAPAPRTGATKNTWPPACSGKCQPGSPMVVRAASAPHASRLG SRWKSSTLSMLVASNATPSHIWAPWISSPPAITSCFWAPPGTGKTHLAVG LAIRACQAGHRVLFATAAEWVARLAEAHHAGRIYAELTRLCRYPLLVVDE VGYIPFEPEAANLFFQLVSSRYERASLIVTSNKAFGRWGEVFGGDDVVAA AMIDRLVHHAEVVALKGDSYRLKDRDLGRVPPAGTTEE >MT1314 exonuclease SbcD-related protein MSPRPGPAGRGPAPCRCADLHSLCVDSHALRRDGMRFLHTADWQLGMTRH FLAGDAQPRYSAARRDAVAGLKALAADVGAEFVVVAGDVFEHNQLAPQIV GQSLEAMRVIGLPVYLLPGNHDPLDASSVYTSTLFRAERPDNVVVLDRAG VHEVRPGVQIVAAPWRSKAPTTDPVAEVLAGLPTDAAIRLLVAHGGVDAL DPDHDKPSLIRLAALDDALTRQAIHYVALGDKHSLTQVGSSGRVWYSGAP EVTNFDDVEPDPGHVLVVDIDESDPRHPVTVDARRIGRWRFVTLHHQVDT SRDIADLDLNLDLMTDKDRTVVRLALTGSLTVTDRAALDTCLDKYARLFA WLGLWERHTDLAVIPVDAEFTDLGIGGFAAAAVDELVATARGGDDESAVD AQAALALLLRLADRGAA >MT3534 IS1540, transposase MIDTAIEEMIPLIGVRAACAATGRAPASYYRAHSKRLSAQSDTFTSTAVT DPSGPRESAQPRALSAAEREHVLAVLNSQRFADMAPAVVYATLLDEGIYL CSESTMYRLLRERGQTGDRRRQATHPAAVKPELVAHQPNSVWSWDITKLR GPAKWSYYYLYVILDIFSRYVVGWMVASRESKVLAERLIAQTLAAQHISA DQLTLHADRGSSMSSKPVALLLADLGVTKSHSRPHTSNDNPLSEAQFKTL KYRPDFPKRFESIEAARVHCDRFFGWYNHEHKHSGIGLHTPADVHYGRAD QIRRHRATVLDTAYRDHLERIRSQTTRATRATGLQRDQPTTEGGPADSIN PRKSCLRNVDRFRPGLLDLPAPAPVDLRRLLPSGQIR >MT2881 IS6110, transposase MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR RILGWRVASTMATSMVLDAIEQAIWTRQQEGVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG >MT1197 MutT/nudix family protein MLNQIVVAGAIVRGCTVLVAQRVRPPELAGRWELPGGKVAAGETERAALA RELAEELGLEVADLAVGDRVGDDIALNGTTTLRAYRVHLLGGEPRARDHR ALCWVTAAELHDVDWVPADRGWIADLARTLNGSAADVHRRC >MT2886 CRISPR-associated protein, TM1807 family MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADI PAHKRKSFEAFVMNTDGAQATAPLKEWVEPNAVKLDPAKHRGYEVKIGSI EPRRASRGRGGRMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYL QSLVHKRTAQPVRVPGHQTREHRQYGERFERKELRKSGRPNTRPQDAVND LFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRECLAPGTSISH RVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIV GPIVYLGGGAGYRSKTFVTDQDDMAKVLDAQFGKVVKHVDKTRELRVSPL VLKRTKIDNICYEMGQCELSIRRAE >MT2587 IS1081, transposase MGGHRVILRNDQQKSIEGNDAMTSSHLIDAEQLLADQLAQASPDLLRGLL STFIAALMGAEADALCGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAI PKLRQGSYFPDWLLQRRKRAERALTSVVATCYLLGVSTRRMERLVETLGV TKLSKSQVSIMAKELDEAVEAFRTRPLDAGPYTFLAADALVLKVREAGRV VGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRDLVARGLSGVAL VTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRTLLH SIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKQI WRQIWSNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIE GRRYLGLEVLTRARAALTSTEEPAKQQTTNTPALTT >MT4027 MutT/nudix family protein MSDGEQAKSRRRRGRXRGRRAAATAENHMDAQPAGDATPTPATAKRSRSR SPRRGSTRMRTVHETSAGGLVIDGIDGPRDAQVAALIGRVDRRGRLLWSL PKGHIELGETAEQTAIREVAEETGIRGSVLAALGRIDYWFVTDGRRVHKT VHHYLMRFLGGELSDEDLEVAEVAWVPIRELPSRLAYADERRLAEVADEL IDKLQSDGPAALPPLPPSSPRRRPQTHSRARHADDSAPGQHNGPGPGP >MT1217 hypothetical protein MDPHRDLESRAFAGNWRVYQQQALDAFDADVAAGDNRAYLVLPPGAGKTM IGLEAARRLGRRSLVLVPNTAVQAQWAAAWDNSFPSSDRSASKCGTERGL ASAMNVLTYQSLAVIDAETDSTVRREVLRNRDQQALLDLLHPNGRAVIER AATLGPWTLVLDECHHLLATWGALVSALASVLGAQTALIGLTATPATELT AWQHTLHDELFGTADFVIPTPALVREGDLAPYQELVYLTQPTPEEQAWIG THRARFADLMLALIDQKVGSMSLAAWLHTRIVDRATREGNQIAWSTFERA EPDLACSGLRFAYDGLIPLPDGVRLREQHRIAPDAQDWVNVLTDFSVGHL QQSADPRDAHALTAIKRVLPGLGYRLTSRGVRVATSPVDRLCALSESKIA ATAHILDTEDAVLGARLRALVLCDFESMTGALPTSLKGAPVSEQSGSAQL VAAMLAASDHRRRTPLHALLVTGQTFACPAAIEDDLIAFCAERGALVTAE PLDAHPSLRVMRGTGGFTPRTWVALATEYFLAGRARVLVGTRSLLGEGWD CAAVNVNIDLTSATTQAAITQMRGRAIRNDPSDGHKVADNWSVCCIATEH PRGDADYLRLVRKHDGYYAATPQGLIESGVTHCDPSLSPYGPPVTDTHAI TARALXASPNAPRRDPGGESASXTKESTSQPSACAPASRSGSPHPASPPR H >MT2368 integrase, putative MTGAGIVETTTNRVRHVPVPEPVSERLRDELPTEPNALVFPSYRGGHLPI EEYRRAFDKGCKAVGIADLVPHGLRHTTASLAISAGANVKVVQRLLGHAT AAMTLDRHGHLLSDDLAGVAGLLVQAIKSAAASLRYSDPDSVAVENISAA S >MT2879 IS1604, transposase MAVGDDEEKVRAERARAIGLFRYQLIWEAADAAHSTKQRGKMVRELASRE HTDPFGRRVRISRQTIDRWIRGWRAGGFDALVPNPRQCTPRTPAEVLELA VALRRENPQRTAAAIRRILRTQLGWAPDERTLQRNFHRLGLTGATTGSAP AVFGRFEAEHPNALWTGDVLHGIRIDLRKTYLFAFLDDHSRLVPGYRWGH AEDTVRLAAALRPALASRGVPNAVYVDNGSPYVDAWLLRACAKLGVRLVH STPGRPQGRGKIERFFRTVREQFLVEITGEPDVVGRHYVADLAELNRLFT AWVETVYHRSVHSETGQTPLARWSAGGPIPLPAPETLTEAFLWEEHRRVT KTATVSLHGNRYEIDPALVGRKVELVFDPFDLTRIEVRLAGAPMGRAIPY HIGRHSHPKAKPETPTAPPKPSGIDYAQLIETAHAAELARGVNYTALTGA ADQIPGQLDLLTGQEAQPK >MT3836 DNA ligase, putative MQLPVMPPVSPMLAKSVTAIPPDASYEPKWDGFRSICFRDGDQVELGSRN ERPMTRYFPELVAAIRAELPHRCVIDGEIIIATDHGLDFEALQQRIHPAE SRVRMLADRTPASFIAFDLLALGDDDYTGRPFSERRAALVDAVTGSGADA DLSIHVTPATTDMATAQRWFSEFEGAGLDGVIAKPPHITYQPDKRVMFKI KHLRTADCVVAGYRVHKSGSDAIGSLLLGLYQEDGQLASVGVIGAFPMAE RRRLLTELQPLVTSFDDHPWNWAAHVAGQRTPRKNEFSRWNVGKDLSFVP LRPERVVEVRYDHMEGARFRHTAQFNRWRPDRDPRSCSYAQLERPLTVSL SDIVPGLR >MT2740 IS1081, transposase, truncation MGGHRVILRNDQQKSIEGNDAMTSSHLIDTEQLLADQLAQASPDLLRGLL STFIAALMGAEADALCGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAI PKLRQGSYFPDWLLQRRKRAERALTSVVATCYLLGVSTRRMERLVETLGV TKLSKSQVSIMAKELDEAVEAFRTRPLDAGPYTFLAADALVLKVREAGRV VGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRDLVARGLSGVAL VTSDAHAGLVAAIGATLPAAAWQRCRTHYAANHGRHNA >MT2964 smf family protein MIDPTARAWAYLSRVAEPPCAQLAALVRCVGPVEAADRVRRGQVGNELAQ HTGARREIDRAADDLELLMRRGGRLITPDDDEWPVLAFAAFSGAGARARP CGHSPLVLWALGPARLDEVAPRAAAVVGTRAATAYGEHVAADLAAGLAER DVAVVSGGAYGIDGAAHRAALDSEGITVAVLAGGFDIPYPAGHSALLHRI AQHGVLFTEYPPGVRPARHRFLTRNRLVAAVARAAVVVEAGLRSGAANTA AWARALGRVVAAVPGPVTSSASAGCHTLLRHGAELVTRADDIVEFVGHIG ELAGDEPRPGAALDVLSEAERQVYEALPGRGAATIDEIAVGSGLLPAQVL GPLAILEVAGLAECRDGRWRILRAGAGQAAAKGAAARLV >MT2684 MutT/nudix family protein MTWLVLAGAVLLVVLVAFGAWGYQTANRLNRLNVRYDLSWQSLDSALARR AVVARAVAIDAYGGAPQGSRLAALADAAEGAPRHARENAENELSAALAMV NPASLPAALIAELADAEARVLLARRFHNDAVRDTLALGERRLVRLLRLGG TAVLPTYFEIVERPHALVHGDQGASGRRTSARVVLLDDSGAVLLLCGSDP ANPAFRDGAAPKWWFTVGGQVRPGERLAQAAARELAEETGLRVAPADMIG PIWRRDEVFEFNGSLIDSEEFYLVHRTRRFEPAVQGRTELERRYIRDARW CDANDIAQLVAAGERVYPLQLGELLPAANRLVDVALDNGAARDAGVPQPI R >MT2233 IS1558', transposase MRSKIPDLQRALEGRFDDHHALMCRLHLAHLDQLDAMIGALDEQIEQLMH PFCARRELIASIPGIGVGASATVISEIGADPAAWFPSAEHLASWVRLCPG NHESAGKRHHGARRTGNQHLQPVLVECAWAAVRTDGYLREYYRRQVRKFG GFRSPAANKKAIIAVAHKLIVIIWHVLATGRPYQDLGADYFTTRMDPDKE RRRLVAKLEAQGLGVTLEPAA >MT1037 conserved hypothetical protein MYWPGRLDGCAEPHVQREAFAWHIDLAKRTGKPLMIHNRQADRDVLDVLR AEGAPDTVILHCFSSDAAMARTCVDAGWLLSLSGTVSFRNARELREAVPL MPVEQLLVETDAPYLTPHPHRGLANEPYCLPYTVRALAELVNRRPEEVAL ITTSNARRAYGLGWMRQ >MT3773 conserved hypothetical protein MSAGGTPLQAGATPTGSRGTVALRPDAGPSWLRPLVDNVGQIPDAYRRRL PADVLAMVTAAGAVSAMTSSRRDHREAAVLVLFSGPEAGPGDGGVPDDAD LLLTVRASTLRHHAGQAAFPGGVVDPADDGPVATALREANEETGIDPSRL HPLATMERTFIAPSRFHVVPVLAYSPDPGPVAVVNEAETAIVARVPVRAF INPANRLMVYRRPHTRRWAGPAFLLNQMLVWGFTGQVISAVLDVAGWAQP WDTGDIRELDAAMVLIDDESDPR >MT0150 hypothetical protein MRSIDVVVEAVVTFAGAAGFAHTLAPLRRGQQDPCFRVPGDGTIWRTSLL PTGPVTARISRAGRDAARCVAWGSGAEEFVDMAPAMLGAADDASDFVPLH PAVAAAHRRLPNLRLGRTGQVLEALIPAVIEQRVPGADAFRSWRLLVSKY GTQAPGPAPPGMRVPPSAEVWRHIPSWEFHRANVDPGRARAVVGCAQRAA SLERLVSLPAARAAEALTSLPGVGVWTAAETTQRVFGDADAVSVGDYHIP KMIGWTLVGRPVDDAGMLELLEPMRPHRHRVVRLLEASGLAREPRRGPRL PVQNIRAL >MT3494 IS1560, transposase MFRTVGDQASLWESVLPEELRRLPEELARVDALLDDSAFFCPFVPFFDPR MGRPSIPMETYLRLMFLKFRYRLGYESLCREVTDSITWRRFCRIPLEGSV PHPTTLMKLTTRCGEDAVAGLNEALLAKAASEKLLRTNKVRADTTVVEGD VGYPTDTGLLAKAVGSMARTVARIKAADAGSAPLGGSSGPRDRLQAAVTR RAATRSGAGLRAPDHRGASRDRRAGADRGCRGGT >MT3125 conserved hypothetical protein MNSPREPLVPPPTPRPAATVMLVRDPDAGSASGLAVFLMRRHAAMDFAAG VMVFPGGGVDDRXRDADLGRLGAWAGPPPQWWAQRFGIEPDLAEALVCAA ARETFEESGVLFAGPVDQDHSAPNSIVSDASVYGDARRALADRTLSFADF LQREKLVLRSDLLRPWANWVTPEAELTRRYDTYFFVGALPEGQRADGENT ESDRAGWVLPADAIADFAAGRNFLLPPTWTQLDSLAGHTVADVLAVERQI VPVQPQLARNGDNWEIEFFDSDRYNQARRSGGSTGWPL >MT3044 conserved hypothetical protein MTRIIGGVAGGRRIAVPPRGTRPTTDRVRESLFNIVTARRDLTGLAVLDL YAGSGALGLEALSRGAASVLFVESDQRSAAVIARNIEALGLSGATLRRGA VAAVVAAGTTSPVDLVLADPPYNVDSADVDAILAALGTNGWTREGTVAVV ERATTCAPLTWPEGWRRWPQRVYGDTRLELAERLFANV >MT3936 IS1537, resolvase MSVVCCRNRWMNLAVWAERNGVAWVIAYRWFRAGLLPVPAQRVGRLILVN DPAVEESGRGRTLVYARVSSADQRSDLDRRVARVTAWATSQHLSVDKVVA EGGWALNGHRRKFFALLGDPVVTRIVVEHRDRFCWFGSEYVEAALVAQGR ELVVVDLAEVDDDLVGDMTEILTSMCARLYGERAAQNGAKRALAAAVGDA EAA >MT1297.1 conserved hypothetical protein MHPKTGRAFRSPVEPGSGWPGDPATPQTPVAADAAQVSALAGGAGSICEL NALISVCRACPRLVSWREEVAVVKRRAFADQPYWGRPVPGWGSKRPRLLI LGLAPAAHGANRTGRMFTGDRSGDQLYAALHRAGLVNSPVSVDAADGLRA NRIRITAPVRCAPPGNSPTPAERLTCSPWLNAEWRLVSDHIRAIVALGGF AWQVALRLAGASGTPKPRFGHGVVTELGAGVRLLGCYHPSQQNMFTGRLT PTMLDDIFREAKKLAGIE >MT2724 phage integrase family protein MNTATRVRLARKRADRLNLKLIKNGHHFRLRDADEITLAVGHLGVVEAFL AAAKSQNKPPGPPPSLHAPPSWRRDIDDYLLNLNAAGQRPATIRLRKTVL CAAAHGLGRPPADVTAEHLLDWLGKQQHLSPEGRKTYRSTLRGFFVWAYE MDRVRDYVADSLPKVRCPKQPPRPAGDDVWQAALAKADRRIELMIRLAGE AGLRRAEAAQAHTGDLMDGGLLLVHGKGGKRRIVPISDYLAALIRDTPHG YLFPNGTGGHLTAEHVGKLVSRALPGDATMHTLRHRYATRAYRGSHNLRA VQQLLGHASIVTTERYTALCDDEVRAAAAAAW >MT2882 IS6110, hypothetical protein MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA AELDRPAR >MT3357 hypothetical protein MFNRYHRRVSDSRSSSWSRRSRGGSVARRAIRRGREMRGPLLPPTVPGWR SRAERFDMAVLEAYEPIERRWQERVSQLDIAVDEIPRIAAKDPESVQWPP EVIADGPIALARLIPAGVDVRGNATRARIVLFRKPIERRAKDTEELGELL HEILVAQVAIYLDVDPSVIDPTIDD >MT3964 hypothetical protein MSTTFAARLNRLFDTVYPPGRGPHTSAEVIAALKAEGITMSAPYLSQXRS GNRTNPSGATMAALANFFRIKAAYFTDDEYYEKLDKELQWLCTMRDDGVR RIAQRAHGLPSAAQQKVLDRIDELRRAEGIDA >MT3296 helicase, UvrD/Rep family MSHIWGVEAGAALAPGLRGPVLVLGGPGTGKSTLLVEAAVAHIGAGTDPE SVLLLTGSGRMGMRARSALTTALLRSRTNGPCRAAIREPVVRTVHSYAYA VLRKAAQRAGDALPRLLTSAEQDAIIRELLAGDAEDGPAATTTWPAHLRP ALTTAGFATELRNLLARCAERGLDPLELQQLGRRRGRPEWIAAGQFAQRY EQVMLLRGAVGLAAPQATAPALSAAELVGAALEAFAVDPELLAAERARVR TLLVDDAQQLDPQAARLVRMLAAGTELALIAGDPNQAVFGFRGGEPTGLL ADDPPPAGGAPIPSVTLTVSHRCAPAVARAVTGIARRLPGRSVGRRIEGT GTEVGSVTVRLAGSAHAEAAMIADALRRAHLIDGVPWSQMAVIVRSVPRA VRLPRALAAAGVPVAPPAVGGPLSAEPAVRALLTVLEATADGLDGDQALL LLTGPIGGVDPVSLRQLRRTLQRARPGQTSRKFGDLLVEVLGGDAPPSGP GSRALRRVRAVLTAAARCHRSGSLGGQDPRHTLWAAWQRSGLQRRWLAAS EHGGAAAVQATRDLETVTALFDITDHYVSRTSGASLRGLVEHVTALQLPV VRPEPAAPTEQVMVLSAHAALGHEWDLVVIAGLQDGLWPNTVPRGGVLGT QRLLDELDGVTKDASMRAPLLAEERRLLVTAMGRARRRLLVTAVDSDAGG GGHEAVLPSAFFFEIAQWADGDGEPVAMQPVSAPRVLSAAAVVGRLRVVV CAPACAVDDADRDCAATQLARLAKAGVPGADPSEWHGLAPVSTSDPLCDS DDLVTLTPSTLQALNDCPLRWLAERHGGTNTRELPSAVGSVLHALFAEPG RSESQLLAELDRVWGHLPFGAQWYSANELARHRAMIQAFVQWRAQSRSEL TEVGVEVDIDGALEDGSGQARKIRLRGRADRLERDPAGRLVIVDIKTGKT PVSKDDAQQHAQLAMYQLAVAEGLVRAGDEPGGARLVYVGKSGAAGVAER KQDPLTPAARDEWRNLVRQLAAATAGPQFIARRNDGCTHCPLRPGCPAHV RGSAP >MT2982 serine/threonine protein kinase MGPSFGRAGRAERGYYRPMALASGVTFAGYTVVRMLGCSAMGEVYLVQHP GFPGWQALKVLSPAMAADDEFRRRFQRETEVAARLFHPHILEVHDRGEFD GQLWIAMDYVDGIDATQHMADRFPAVLPVGEVLAIVTAVAGALDYAHQRG LLHRDVNPANVVLTSQSAGDQRILLADFGIASQPSYPAPELSAGADVDGR ADQYALALTAIHLFAGAPPVDRSHTGPLQPPKLSAFRPDLARLDGVLSRA LATAPADRFGSCREFADAMNEQAGVAIADQSSGGVDASEVTAAAGEEAYV VDYPAYGWPEAVDCKEPSARAPAPAAPTPQRRGSMLQSAAGVLARRLDNF STATKAPASPTRRRPRRILVGAVAVLLLAGLFAVGIVIGRKTNTTATEVA RPPTSGSAVPSAPTTTVAVTAPVPLDGTYRIEIQRSKQTYDYTPTPQPPD VNTWWAFRTSCTPTECLAAATMLDDNDHTQAKTPPVRPFLMQFGEGQWKS RPETVQFPCVGPNGSPSTQATTQLLALRPQPQGDLVGEMVVTVHSNECGQ QGAVIRIPAVASRSGDLPPAVTVPDPATIPDTPDTTSTATLTPPTTTAPG PGR >MT2497 IS1558, transposase MQCRAREERPGRKTDLLDAEWLVHLLECGLLRGWLIPPADIKAARDVIRY RRKLVEHRTSKLQRLGNVLQDAGIKADSVASSVTPKSVRAMVEALIDGER RPAVLADLARGSMRSKIPDLQRALEGRFDDHHALMCRLHLAHLDQLDAMI GALDEQIEQLMHPFCARRELIASIPGIGVGASATVISEIGADPAAWFPSA EHLASWVRLCPGNHESAGKRHHGARRTGNQHLQPVLVECAWAAVRTDGYL REYYRRQVRKFGGFRSPAANKKAIXXVAHKLIVIIWHVLATGRPXQDLGA DYFTTRMDPDKERRRLVAKLEAQGLGVTLEPAA >MT3776 hypothetical protein MFTLLVSWLLVACVPGLLMLATLGLGRLERFLARDTVTATDVAEFLEQAE AVDVHTLARNGMPEALDYLHRRQARRITDSPPLGSGAGPRYAGPLFVTDL DSPVEPPRHGQPNPQFRTARHANHV >MT0633 IS1536, resolvase MACCRNRGMNLAAWAERNGVARVTAYRWFHAGLLPVPARKVGRLILVDEL ASEAGAQPKTAVYARVSSADQKSDLDRQVARVTSWATAEQIPVDKVVTEV GSVLNGHRRKFPAVLRDLSVTRIVVEHRDRFCRFGSEYVHAALAAQGREL VVVDSAEVDDDLVWDMTEILTSMCARLYGKRAAQNRAKRAVAAAAVDDHE AA >MT3204 hypothetical protein MPSSLAKRCSTPAPPRRGGHTKILHRLMSRQPIMGPTPIRGRRLRVVREE FAWLRSRLPTLWTNSYFVATVGGFGLS >MT0780 hypothetical protein MVRLILDTPSMKELSVAEQRYQAVLAVISDGLSISQVAEKVGVSRQTLHT WLARYEAEGLDGLRIGTGTAL >MT2161 helicase, SNF2/RAD54 family MLLPSLRSAPLDSPELIRLAPRPAARTDPMLLAWTVPVVDLDPTAALAAF DQPAPDVRYGASVDYLAELAVFARELVERGRVLPQLRRDTHGAAACWRPV LQGRDVVAMTSLVSAMPPVCRAEVGGHDPHELATSALDAMVDAAVRAALS PMDLLPPRRGRSKRHRAVEAWLTALTCPDGRFDAEPDELDALAEALRPWD DVGIGTVGPARATFRLSEVETENEETPAGSLWRLEFLLQSTQDPSLLVPA EQAWNDDGSLRRWLDRPQELLLTELGRASRIFPELVPALRTACPSGLELD ADGAYRFLSGTAAVLDEAGFGVLLPSWWDRRRKLGLVLSAYTPVDGVVGK ASKFGREQLVEFRWELAVGDDPLSEEEIAALTETKSPLIRLRGQWVALDT EQLRRGLEFLERKPTGRKTTAEILALAASHPDDVDTPLEVTAVRADGWLG DLLAGAAAASLQPLDPPDGFTATLRPYQQRGLAWLAFLSSLGLGSCLADD MGLGKTVQLLALETLESVQRHQDRGVGPTLLLCPMSLVGNWQQEAARFAP NLRVYAHHGGARLHGEALRDHLERTDLVVSTYTTATRDIDELAEYEWNRV VLDEAQAVKNSLSRAAKAVRRLRAAHRVALTGTPMENRLAELWSIMDFLN PGLLGSSERFRTRYAIPIERHGHTEPAERLRASTRPYILRRLKTDPAIID DLPEKIEIKQYCQLTTEQASLYQAVVADMMEKIENTEGIERRGNVLAAMA KLKQVCNHPAQLLHDRSPVGRRSGKVIRLEEILEEILAEGDRVLCFTQFT EFAELLVPHLAARFGRAARDIAYLHGGTPRKRRDEMVARFQSGDGPPIFL LSLKAGGTGLNLTAANHVVHLDRWWNPAVENQATDRAFRIGQRRTVQVRK FICTGTLEEKIDEMIEEKKALADLVVTDGEGWLTELSTRDLREVFALSEG AVGE >MT3064 DNA binding protein HU, putative MNKAELIDVLTQKLGSDRRQATAAVENVVDTIVRAVHKGDSVTITGFGVF EQRRRAARVARNPRTGETVKVKPTSVPAFRPGAQFKAVVSGAQRLPAEGP AVKRGVGASAAKKVAKKAPAKKATKAAKKAATKAPARKAATKAPAKKAAT KAPAKKAVKATKSPAKKVTKAVKKTAVKASVRKAATKAPAKKAAAKRPAT KAPAKKATARRGRK >MT0948 IS1535, resolvase MNLADWAESVGVNRHTAYRWFREGTLPVPAERVGRLILVKTAASASAAAA GVVLYARVSSHDRRSDLDRQVARLTAWATERDLGVGQVVCEVGSGLNGKR PKLRRILSDPDARVIVVEHRDRLARFGVEHLEAALSAQGRRIVVADPGET TDDLVCDMIEVLTGMCARLYGRRGARNRAMRAVTEAKREPGAG >MT3835 conserved hypothetical protein MAAAAEELDVDGIAVRLTSPDRMYFPKLGSHGTKRRLVEYYFAVAGGPML TALRDRPTHLQRFPDGVDGEQIYQKRIPRHRPDYLQTCRVTFPSGRMADA LKVTHPAAIVWAAQMGTITLHPWQVRCPDTEHPDELRIDLDPQPGTGFVE ARTVAVDVLRSVLDDLGLVGYPKTSGGRGIHVFLRIATDWDFVEVRRAGI ALAREVERRAPDAVTTSWWKEERGARIFIDFNQNARDRTMASAYSVRPTP IATVSMPLTWEELAGADPDDYTMTTVPELVKIRDDPWAGMDDVAQSIAPL LDLAAADEERGLGDMPYPPNYPKMPGEPKRVQPSRDTDLKGGNTSK >MT1804 IS6110, hypothetical protein MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETV RKWVRQAQVDAGARPGTTTEESAELKRLRRDNAELRRANAILKTASAFFA AELDRPAR >MT2966 conserved hypothetical protein MAGDVGASVQVGRMTTLKTMTRVQLGAMGEALAVDYLTSMGLRILNRNWR CRYGELDVIACDAATRTVVFVEVKTRTGDGYGGLAHAVTERKVRRLRRLA GLWLADQEERWAAVRIDVIGVRVGPKNSGRTPELTHLQGIG >MT3100 IS6110, transposase MRWGVESICTQLTELGVPIAPSTYYDHINREPSRRELRDGELKEHISRVH AANYGVYGARKVWLTLNREGIEVARCTVERLMTKLGLSGTTRGKARRTTI ADPATARPADLVQRRFGPPAPNRLWVADLTYVSTWAGFAYVAFVTDAYAR RILGWRVASTMATSMVLDAIEQAIWTRQQESVLDLKDVIHHTDRGSQYTS IRFSERLAEAGIQPSVGAVGSSYDNALAETINGLYKTELIKPGKPWRSIE DVELATARWVDWFNHRRLYQYCGDVPPVELEAAYYAQRQRPAAG >MT3838 hypothetical protein MPKLSAGVLLYRARAGVVDVLLAHPGGPFWAGKDDGAWSIPKGEYTGGED PWLAARREFSEEIGLCVPDGPRIDFGSLKQSGGKVVTVFGVRADLDITDA RSSTFELDWPKGSGKMRKFPEVDRVSWFPVARARTKLLKGQRGFLDRLMA HPAVAGLSEGPESLPR >MT1508 conserved hypothetical protein, intein-containing MTLTPEASKSVAQPPTQAPLTQEEAIASLGRYGYGWADSDVAGANAQRGL SEAVVRDISAKKNEPDWMLQSRLKALRIFDRKPIPKWGSNLDGIDFDNIK YFVRSTEKQAASWDDLPEDIRNTYDRLGIPEAEKQRLVAGVAAQYESEVV YHQIREDLEAQGVIFLDTDTGLREHPDIFKEYFGTVIPAGDNKFSALNTA VWSGGSFIYVPPGVHVDIPLQAYFRINTENMGQFERTLIIADEGSYVHYV EGCLPAGELITTADGDLRPIESIRVGDFVTGHDGRPHRVTAVQVRDLDGE LFTFTPMSPANAFSVTAEHPLLAIPRDEVRVMRKERNGWKAEVNSTKLRS AEPRWIAAKDVAEGDFLIYPKPKPIPHRTVLPLEFARLAGYYLAEGHACL TNGCESLIFSFHSDEFEYVEDVRQACKSLYEKSGSVLIEEHKHSARVTVY TKAGYAAMRDNVGIGSSNKKLSDLLMRQDETFLRELVDAYVNGDGNVTRR NGAVWKRVHTTSRLWAFQLQSILARLGHYATVELRRPGGPGVIMGRNVVR KDIYQVQWTEGGRGPKQARDCGDYFAVPIKKRAVREAHEPVYNLDVENPD SYLAYGFAVHNCTAPIYKSDSLHSAVVEIIVKPHARVRYTTIQNWSNNVY NLVTKRARAEAGATMEWIDGNIGSKVTMKYPAVWMTGEHAKGEVLSVAFA GEDQHQDTGAKMLHLAPNTSSNIVSKSVARGGGRTSYRGLVQVNKGAHGS RSSVKCDALLVDTVSRSDTYPYVDIREDDVTMGHEATVSKVSENQLFYLM SRGLTEDEAMAMVVRGFVEPIAKELPMEYALELNRLIELQMEGAVG >MT3299 6-O-methylguanine DNA methyltransferase, putative MAPVTDEQVELVRSLVAAIPLGRVSTYGDIAALAGLSSPRIVGWIMRTDS SDLPWHRVIRASGRPAQHLATRQLELLRAEGVLSVDGRVALSEIRYEFPP G >MT2232 serine/threonine protein kinase MVEAGTRDPLESALLDSRYLVQAKIASGGTSTVYRGLDVRLDRPVALKVM DSRYAGDEQFLTRFRLEARAVARLNNRALVAVYDQGKDGRHPFLVMELIE GGTLRELLIERGPMPPHAVVAVLRPVLGGLAAAHRAGLVHRDVKPENILI SDDGDVKLADFGLVRAVAAASITSTGVILGTAAYLSPEQVRDGNADPRSD VYSVGVLVYELLTGHTPFTGDSALSIAYQRLDADVPRASAVIDGVPPQFD ELVACATARNPADRYADAIAMGADLEAIAEELALPEFRVPAPRNSAQHRS AALYRSRITQQGQLGAKPVHHPTRQLTRQPGDCSEPASGSEPEHEPITGQ FAGIAIEEFIWARQHARRMVLVWVSVVLAITGLVASAAWTIGSNLSGLL >MT3293 MutT/nudix family protein MRWSRPRWAGLRGTCATSCSAWYVTASCCALGRPRWTPSRPATGCSIFAR WGASGDRKRGGAGRGGSPPGGAPVTNVSGVDFQLRSVPLLSRVGADRADR LRTDMEAAAAGWPGAALLRVDSRNRVLVANGRVLLGAAIELADKPPPEAV FLGRVEGGRHVWAVRAALQPIADPDIPAEAVDLRGLGRIMDDTSSQLVSS ASALLNWHDNARFSALDGAPTKPARAGWSRVNPITGHEEFPRIDPAVICL VHDGADRAVLARQAAWPERMFSLLAGFVEAGESFEVCVAREIREEIGLTV RDVRYLGSQQWPFPRSLMVGFHALGDPDEEFSFSDGEIAEAAWFTRDEVR AALAAGDWSSASESKLLLPGSISIARVIIESWAACE >MT1029 hypothetical protein MCDKLGGVAIAVQGALFEHNERRQLGDGAFIDIRSGWLTGGEELLDALLS TVPWRAERRQMYDRVVDVPRLVSFHDLTIEDPPHPQLARMRRRLNDIYGG ELGEPFTTAGLCYYRDGSDSVAWHGDTIGRGSTEDTMVAIVSLGATRVFA LRPRGRGPSLRLPLAHGDLLVMGGSCQRTFEHAVPKTSAPTGPRVSIQFR PRDVR >MT3537 IS1535, transposase MFAELIRAGLQALIEAEATEAIGAGRYERSDGRIVHRNGHRPKTVSTTAG DIEVQIPKLRAGSFFPSLLERRRRIDKALHAVIMEAYVHGVSTRSVDDLV AAMGVQAGVSKSEVSRICAGLDTEIEAFRTRSLTHTEFPYVFCDATFCKV RVGAHVVSQALVVATGVSIDGTREVLGTAVGDSESYEFWREFLASLKARG LTGVHLVISDAHAGLKAAVAQQFSGASWQRCRVHFMRNLYTAVAAKHAPA VTVAVKTIFAHTDPEEVGAQWDRVADPLCQP >MT1739 MutT/nudix family protein MAEHDFETISSETLHTGAIFALRRDQVRMPGGGIVTREVVEHFGAVAIVA MDDNGNIPMVYQYRHTYGRRLWELPAGLLDVAGEPPHLTAARELREEVGL QASTWQVLVDLDTAPGFSDESVRVYLATGLREVGRPEAHHEEADMTMGWY PIAEAARRVLRGEIVNSIAIAGVLAVHAVTTGFAQPRPLDTEWIDRPTAF AARRAER >MT2631 conserved hypothetical protein MVPAQHRPPDRPGDPAHDPGRGRRLGIDVGAARIGVACSDPDAILATPVE TVRRDRSGKHLRRLAALAAELEAVEVIVGLPRTLADRIGRSAQDAIELAE ALARRVSPTPVRLADERLTTVSAQRSLRQAGVRASEQRAVIDQAAAVAIL QSWLDERLAAMAGTQEGSDA >MT1491 hypothetical protein MTVMADRSGRPAPVRRRMKTLTQAALNADKTVEQVEDVLDGLGKTMAELN SSLSQLNSTVERLEDGLDHLEGTLHSLDDLAKRLIVLVEPVEAIVDRIDY IVSLGETVMSPLSVTEHAVRGVLDRLRNRTVHEPTN >MT2636 ATPase, AAA family MPEAVSDGLFDVPGVPMTSGHDLGASAGAPLAVRMRPASLDEVVGQDHLL APGSPLRRLVEGSGVASVILYGPPGSGKTTLAALISQATGRRFEALSALS AGVKEVRAVIENSRKALLHGEQTVLFIDEVHRFSKTQQDALLSAVEHRVV LLVAATTENPSFSVVAPLLSRSLILQLRPLTAEDTRAVVQRAIDDPRGLG RAVAVAPEAVDLLVQLAAGDARRALTALEVAAEAAQAAGELVSVQTIERS VDKAAVRYDRDGDQHYDVVSAFIKSVRGSDVDAALHYLARMLVAGEDPRF IARRLMILASEDIGMAGPSALQVAVAAAQTVALIGMPEAQLTLAHATIHL ATAPKSNAVTTALAAAMNDIKAGKAGLVPAHLRDGHYSGAAALGNAQGYK YSHDDPDGVVAQQYPPDELVDVDYYRPTGRGGEREIAGRLDRLRAIIRKK RG >MT1727.1 DNA-3-methyladenine glycosidase MNAEELAIDPVAAAHRLLGATIAGRGVRAMVVEVEAYGGVPDGPWPDAAA HSYRGRNGRNDVMFGPPGRLYTYRSHGIHVCANVACGPDGTAAAVLLRAA AIEDGAELATSRRGQTVRAVALARGPGNLCAALGITMADNGIDLFDPSSP VRLRLNDTHRARSGPRVGVSQAADRPWRLWLTGRPEVSAYRRSSRAPARG ASD >MT1237 IS1081, transposase MGGHRVILRNDQQKSIEGNDAMTSSHLIDAEQLLADQLAQASPDLLRGLL STFIAALMGAEADALCGAGYRERSDERSNQRNGYRHRDFDTRAATIDVAI PKLRQGSYFPDWLLQRRKRAERALTSVVATCYLLGVSTRRMERLVETLGV TKLSKSQVSIMAKELDEAVEAFRTRPLDAGPYTFLAADALVLKVREAGRV VGVHTLIATGVNAEGYREILGIQVTSAEDGAGWLAFFRDLVARGLSGVAL VTSDAHAGLVAAIGATLPAAAWQRCRTHYAANLMAATPKPSWPWVRTLLH SIYDQPDAESVVAQYDRVLDALTDKLPAVAEHLDTARTDLLAFTAFPKKI WRQIWSNNPQERLNREVRRRTDVVGIFPDRASIIRLVGAVLAEQHDEWIE GRRYLGLEVLTRARAALTSTEEPAKQQTTNTPALTT >MT3573.7 conserved hypothetical protein MPRPPKPARLKLVEGRSPGRDSGGRKVPESPKFIRQAPDAPDWLDAEALA EWRRVAPTLERLDLLKPEDRALLSAYCETWSVYVAAVQRVRAEGLTITSP KSGVVHRNPAVTVAETARMHLLRLASEFGLTPAAEQRLAVAPGDDGDGLN PFAPDR >MT3740 IS1534, istB protein MAAKTATNSRDVAAELAYLTRALKAPTLRGAIEQLADRARTKTWSYEEFL AACLQREVSARESHGGEGRIRAARFPSRKSLEEFDFDHARGLKRDTIAHL GTLDFVTLAIGIAIRACQAGHRVLFATASQWVDRLAAAHHSGTLQSELIR LARYPLLVVDEVGYIPFEPEAANLFFQLVSSRYERASLIVTSNKPFGRWG EVFGDDVVAAAMIDRLVHHAEVIALKGDSYRIKDRDLGRVPTVTADDQ >MT2153 ATP-dependent RNA helicase, DEAD/DEAH box family MTELAELDRFTAELPFSLDDFQQRACSALERGHGVLVCAPTGAGKTVVGE FAVHLALAAGSKCFYTTPLKALSNQKHTDLTARYGRDQIGLLTGDLSVNG NAPVVVMTTEVLRNMLYADSPALQGLSYVVMDEVHFLADRMRGPVWEEVI LQLPDDVRVVSLSATVSNAEEFGGWIQTVRGDTTVVVDEHRPVPLWQHVL VGKRMFDLFDYRIGEAEGQPQVNRELLRHIAHRREADRMADWQPRRRGSG RPGFYRPPGRPEVIAKLDAEGLLPAITFVFSRAGCDAAVTQCLRSPLRLT SEEERARIAEVIDHRCGDLADSDLAVLGYYEWREGLLRGLAAHHAGMLPA FRHTVEELFTAGLVKAVFATETLALGINMPARTVVLERLVKFNGEQHMPL TPGEYTQLTGRAGRRGIDVEGHAVVIWHPEIEPSEVAGLASTRTFPLRSS FAPSYNMTINLVHRMGPQQAHRLLEQSFAQYQADRSVVGLVRGIERGNRI LGEIAAELGGSDAPILEYARLRARVSELERAQARASRLQRRQAATDALAA LRRGDIITITHGRRGGLAVVLESARDRDDPRPLVLTEHRWAGRISSADYS GTTPVGSMTLPKRVEHRQPRVRRDLASALRSAAAGLVIPAARRVSEAGGF HDPELESSREQLRRHPVHTSPGLEDQIRQAERYLRIERDNAQLERKVAAA TNSLARTFDRFVGLLTEREFIDGPATDPVVTDDGRLLARIYSESDLLVAE CLRTGAWEGLKPAELAGVVSAVVYETRGGDGQGAPFGADVPTPRLRQALT QTSRLSTTLRADEQAHRITPSREPDDGFVRVIYRWSRTGDLAAALAAADV NGSGSPLLAGDFVRWCRQVLDLLDQVRNAAPNPELRATAKRAIGDIRRGV VAVDAG >MT3363 modification methylase MQPSHPTRPGAVIRYVGSSLDTCPMTTFAGKTAASADKVRGGYYTPPAVA RFLAHWVHQAGPKILEPSCGDGRILRELSAITDHAHGVELVAREAKKSRD FASVDTENLFTWLHKTQLGSWDGVAGNPPYIRFGNWASEQRDPALELMRR VGLRPTKLTNAWVPFVVASTTLARDGGRVGLVVPAELLQVTYAAQLREFL LSRYREITLVTFERLVFDGILQEVVLFCGVVGPGPAHIRTVRLGDANDLN ALGDKDFTNESAPALLHEKEKWTKYFLDPAQIRLLRGLKQSATMIRLGEL ADVDVGIVTGRNSFFTFTDAKAQALGLRAHCVPLVSRSAQLSGLIYDEDC RACDVAGNHRTWLLDAADYPTDPALVAHITAGEAAGVHLGYKCSIRKPWW STPSLWMPDLFMLRQIHFAPRLTVNAAAATSTDTVHRVRLDPNVDPATLA AVFHNSATFAFAEIMGRSYGGGILELEPREAEQLPMPPPAYGSAELAQDV DLLLKANEIDKALDVVDRHVLIDGLGLSPRLVAGCRAAWLTLRDRRTKRG SRR >MT2151 5'-3'-exonuclease, putative MRSPLVLLDGASMWFRSFFGVPSSITAPDGRPVNAVRGFIDSMAVVITQQ RPNRLAVCLDLDWRPQFRVDLIPSYKAHRVAEPEPNGQPDVEEVPDELTP QVDMIMELLDAFGIAMAGAPGFEADDVLGTLATRERRDPVIVVSGDRDLL QVVADDPVPVRVLYLGRGLAKATLFGPAEVAERYGLPAHRAGAAYAELAL LRGDPSDGLPGVPGVGEKTAATLLARHGSLDQIMAAADDRKTTMAKGLRT KLLAASAYIKAADRVVRVATDAPVTLSTPTDRLPLVAADPERTAELATRF GVESSIARLQKALDTLPG >MT1292 deaD-1, ATP-dependent RNA helicase DeaD MAFPEYSPAASAATFADLQIHPRVLRAIGDVGYESPTAIQAATIPALMAG SDVVGLAQTGTGKTAAFAIPMLSKIDITSKVPQALVLVPTRELALQVAEA FGRYGAYLSQLNVLPIYGGSSYAVQLAGLRRGAQVVVGTPGRMIDHLERA TLDLSRVDFLVLDEADEMLTMGFADDVERILSETPEYKQVALFSATMPPA IRKLSAKYLHDPFEVTCKAKTAVAENISQSYIQVARKMDALTRVLEVEPF EAMIVFVRTKQATEEIAEKLRARGFSAAAISGDVPQAQRERTITALRDGD IDILVATDVAARGLDVERISHVLNYDIPHDTESYVHRIGRTGRAGRSGAA LIFVSPRELHLLKAIEKATRQTLTEAQLPTVEDVNTQRVAKFADSITNAL GGPGIELFRRLVEEYEREHDVPMADIAAALAVQCRGGEAFLMAPDPPLSR RNRDQRRDRPQRPKRRPDLTTYRVAVGKRHKIGPGAIVGAIANEGGLHRS DFGQIRIGPDFSLVELPAKLPRATLKKLAQTRISGVLIDLRPYRPPDAAR RHNGGKPRRKHVG >MT3307 deaD-2, ATP-dependent RNA helicase DeaD MTAVKHTTESTFAKLGVRDEIVRALGEEGIKRPFAIQELTLPLALDGEDV IGQARTGMGKTFAFGVPLLQRITSGDGTRPLTGAPRALVVVPTRELCLQV TDDLATAGKYLTAGPDTDDAAAVRRRLSVVSIYGGRPYEPQIEALRAGAD VVVGTPGRLLDLCQQGHLQLGGLSVLVLDEADEMLDLGFLPDIERILRQI PADRQSMLFSATMPDPIITLARTFMVRPTHIRAEAPHSSAVHDATEQFVY RAHALDKVELVSRVLQARDRGATMIFTRTKRTAQKVADELTERGFAVGAV HGDLGQLAREKALKAFRTGGIDVLVATDVAARGIDIDDVTHVINYQCPED EKMYVHRIGRTGRAGRTGVAVTLVDWDELPRWSMIDQALGLGSPDPAETY SNSPHLYAELAIPATAGGTVGPARKSQGRRRDTDCDGQKTAQHARNTPRR RRTRGGKPVTGHPGTNPISSPIVGGDATSEPGSGTASDSGSDVVSGSRSG NGEAARRRRRRRRRPTHAQDGFAARAN >MT1371 dinG, ATP-dependent helicase DinG MSESVSMSVPELLAIAVAALGGTRRRGQQEMAAAVAHAFETGEHLVVQAG TGTGKSLAYLVPAIIRALCDDAPVVVSTATIALQRQLVDRDLPQLVDSLT NALPRRPKFALLKGRRNYLCLNKIHNSVTASDHDDERPQEELFDPVAVTA LGRDVQRLTAWASTTVSGDRDDLKPGVGDRSWSQVSVSARECLGVARCPF GSECFSERARGAAGLADVVVTNHALLAIDAVAESAVLPEHRLLVVDEAHE LADRVTSVAAAELTSATLGMAARRITRLVDPKVTQRLQAASATFSSAIHD ARPGRIDCLDDEMATYLSALRDAASAARSAIDTGSDTTTASVRAEAGAVL TEISDTASRILASFAPAIPDRSDVVWLEHEDNHESARAVLRVAPLSVAEL LATQVFARATTVLTSATLTIGGSFDAMATAWGLTADTPWRGLDVGSPFQH AKSGILYVAAHLPPPGRDGSGSAEQLTEIAELITAAGGRTLGLFSSMRAA RAATEAMRERLSTPVLCQGDDSTSTLVEKFTADAATSLFGTLSLWQGVDV PGPSLSLVLIDRIPFPRPDDPLLSARQRAVAARGGNGFMTVAASHAALLL AQGSGRLLRRVTDRGVVAVLDSRMATARYGEFLRASLPPFWQTTNATQVR AALRRLARADAKAH >MT0001 dnaA, chromosomal replication initiator protein DnaA MTDDPGSGFTTVWNAVVSELNGDPKVDDGPSSDANLSAPLTPQQRAWLNL VQPLTIVEGFALLSVPSSFVQNEIERHLRAPITDALSRRLGHQIQLGVRI APPATDEADDTTVPPSENPATTSPDTTTDNDEIDDSAAARGDNQHSWPSY FTERPHNTDSATAGVTSLNRRYTFDTFVIGASNRFAHAAALAIAEAPARA YNPLFIWGESGLGKTHLLHAAGNYAQRLFPGMRVKYVSTEEFTNDFINSL RDDRKVAFKRSYRDVDVLLVDDIQFIEGKEGIQEEFFHTFNTLHNANKQI VISSDRPPKQLATLEDRLRTRFEWGLITDVQPPELETRIAILRKKAQMER LAVPDDVLELIASSIERNIRELEGALIRVTAFASLNKTPIDKALAEIVLR DLIADANTMQISAATIMAATAEYFDTTVEELRGPGKTRALAQSRQIAMYL CRELTDLSLPKIGQAFGRDHTTVMYAQRKILSEMAERREVFDHVKELTTR IRQRSKR >MT0064 dnaB, replicative DNA helicase, intein-containing MAVVDDLAPGMDSSPPSEDYGRQPPQDLAAEQSVLGGMLLSKDAIADVLE RLRPGDFYRPAHQNVYDAILDLYGRGEPADAVTVAAELDRRGLLRRIGGA PYLHTLISTVPTAANAGYYASIVAEKALLRRLVEAGTRVVQYGYAGAEGA DVAEVVDRAQAEIYDVADRRLSEDFVALEDLLQPTMDEIDAIASSGGLAR GVATGFTELDEVTNGLHPGQMVIVAARPGVGKSTLGLDFMRSCSIRHRMA SVIFSLEMSKSEIVMRLLSAEAKIKLSDMRSGRMSDDDWTRLARRMSEIS EAPLFIDDSPNLTMMEIRAKARRLRQKANLKLIVVDYLQLMTSGKKYESR QVEVSEFSRHLKLLAKELEVPVVAISQLNRGPEQRTDKKPMLADLRESGC LTASTRILRADTGAEVAFGELMRSGERPMVWSLDERLRMVARPMINVFPS GRKEVFRLRLASGREVEATGSHPFMKFEGWTPLAQLKVGDRIAAPRRVPE PIDTQRMPESELISLARMIGDGSCLKNQPIRYEPVDEANLAAVTVSAAHS DGAAIRDDYLAARVPSLRPARQRLPRGRCTPIAAWLAGLGLFTKRSHEKC VPEAVFRAPNDQVALFLRHLWSAGGSVRWDPTNGQGRVYYGSTSRRLIDD VAQLLLRVGIFSWITHAPKLGGHDSWRLHIHGAKDQVRFLRHVGVHGAEA VAAQEMLRQLKGPVRNPNLDSAPKKVWAQVRNRLSAKQMMDIQLHEPTMW KHSPSRSRPHRAEARIEDRAIHELARGDAYWDTVVEITSIGDQHVFDGTV SGTHNFVANGISLHNSLEQDADVVILLHRPDAFDRDDPRGGEADFILAKH RNGPTKTVTVAHQLHLSRFANMAR >MT1598 dnaE, DNA polymerase III, alpha subunit MSGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVGMTDHGN MFGASEFYNSATKAGIKPIIGVEAYIAPGSRFDTRRILWGDPSQKADDVS GSGSYTHLTMMAENATGLRNLFKLSSHASFEGQLSKWSRMDAELIAEHAE GIIITTGCPSGEVQTRLRLGQDREALEAAAKWREIVGPDNYFLELMDHGL TIERRVRDGLLEIGRALNIPPLATNDCHYVTRDAAHNHEALLCVQTGKTL SDPNRFKFDGDGYYLKSAAEMRQIWDDEVPGACDSTLLIAERVQSYADVW TPRDRMPVFPVPDGHDQASWLRHEVDAGLRRRFPAGPPDGYRERAAYEID VICSKGFPSYFLIVADLISYARSAGIRVGPGRGSAAGSLVAYALGITDID PIPHGLLFERFLNPERTSMPDIDIDFDDRRRGEMVRYAADKWGHDRVAQV ITFGTIKTKAALKDSARIHYGQPGFAIADRITKALPPAIMAKDIPLSGIT DPSHERYKEAAEVRGLIETDPDVRTIYQTARGLEGLIRNAGVHACAVIMS SEPLTEAIPLWKRPQDGAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDA IDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLDGGPMRDLLRR MQPTGFEDVVAVIALYRPGPMGMNAHNDYADRKNNRQAIKPIHPELEEPL REILAETYGLIVYQEQIMRIAQKVASYSLARADILRKAMGKKKREVLEKE FEGFSDGMQANGFSPAAIKALWDTILPFADYAFNKSHAAGYGMVSYWTAY LKANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLPPDVNESGLNF ASVGQDIRYGLGAVRNVGANVVGSLLQTRNDKGKFTDFSDYLNKIDISAC NKKVTESLIKAGAFDSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLF GSNDDGTGTADPVFTIKVPDDEWEDKHKLALEREMLGLYVSGHPLNGVAH LLAAQVDTAIPAILDGDVPNDAQVRVGGILASVNRRVNKNGMPWASAQLE DLTGGIEVMFFPHTYSSYGADIVDDAVVLVNAKVAVRDDRIALIANDLTV PDFSNAEVERPLAVSLPTRQCTFDKVSALKQVLARHPGTSQVHLRLISGD RITTLALDQSLRVTPSPALMGDLKELLGPGCLGS >MT2408 dnaG, DNA primase MSGRISDRDIAAIREGARIEDVVGDYVQLRRAGADSLKGLCPFHNEKSPS FHVRPNHGHFHCFGCGEGGDVYAFIQKIEHVSFVEAVELLADRIGHTISY TGAATSVQRDRGSRSRLLAANAAAAAFYAQALQSDEAAPARQYLTERSFD AAAARKFGCGFAPSGWDSLTKHLQRKGFEFEELEAAGLSRQGRHGPMDRF HRRLLWPIRTSAGEVVGFGARRLFDDDAMEAKYVNTPETLLYKKSSVMFG IDLAKRDIAKGHQAVVVEGYTDVMAMHLAGVTTAVASCGTAFGGEHLAML RRLMMDDSFFRGELIYVFDGDEAGRAAALKAFDGEQKLAGQSFVAVAPDG MDPCDLRLKCGDAALRDLVARRTPLFEFAIRAAIAEMDLDSAEGRVAALR RCVPMVGQIKDPTLRDEYARQLAGWVGWADVAQVIGRVRGEAKRTKHPRL GRLGSTTIARAAQRPTAGPPTELAVRPDPRDPTLWPQREALKSALQYPAL AGPVFDALTVEGFTHPEYAAVRAAIDTAGGTSAGLSGAQWLDMVRQQTTS TVTSALISELGVEAIQVDDDKLPRYIAGVLARLQEVWLGRQIAEVKSKLQ RMSPIEQGDEYHALFGDLVAMEAYRRSLLEQASGDDLTA >MT0002 dnaN, DNA polymerase III, beta subunit MDAATTRVGLTDLTFRLLRESFADAVSWVAKNLPARPAVPVLSGVLLTGS DNGLTISGFDYEVSAEAQVGAEIVSPGSVLVSGRLLSDITRALPNKPVDV HVEGNRVALTCGNARFSLPTMPVEDYPTLPTLPEETGLLPAELFAEAISQ VAIAAGRDDTLPMLTGIRVEILGETVVLAATDRFRLAVRELKWSASSPDI EAAVLVPAKTLAEAAKAGIGGSDVRLSLGTGPGVGKDGLLGISGNGKRST TRLLDAEFPKFRQLLPTEHTAVATMDVAELIEAIKLVALVADRGAQVRME FADGSVRLSAGADDVGRAEEDLVVDYAGEPLTIAFNPTYLTDGLSSLRSE RVSFGFTTAGKPALLRPVSGDDRPVAGLNGNGPFPAVSTDYVYLLMPVRL PG >MT3814 dnaQ, DNA polymerase III, epsilon subunit MSHTWGRPASHQDRGWAVIDVETSGFRPGQARIISLAVLGLDAAGRLEQS VVSLLNPKVDPGPTHVHGLTAAMLDGQPQFADIAGEVVDVLRGRTLVAHN VAFDYAFLAAEAEIAEAELPVDFVMCTVELARRLQLGVDNLRLETLAAHW GVPQQRPHDAFDDVRVLTGILAAALESARELDVWLPVHPVTRRRWPNGRV THDELRPLKALAARMACPYLNPGRYVQGRPLVQGMRVGLAAEVKRTHEEL VERILHAGLAYSDVVDRDTSLVVCNATAPEHGKGYHALQLGVPVMPEARF MECIGAVVGGASVEDFTDVAPVEKQLALF >MT3824 dnaZX, DNA polymerase III, gamma and tau subunits MALYRKYRPASFAEVVGQEHVTAPLSVALDAGRINHAYLFSGPRGCGKTS SARILARSLNCAQGPTANPCGVCESCVSLAPNAPGSIDVVELDAASHGGV DDTRELRDRAFYAPVQSRYRVFIVDEAHMVTTAGFNALLKIVEEPPEHLI FIFATTEPEKVLPTIRSRTHHYPFRLLPPRTMRALLARICEQEGVVVDDA VYPLVIRAGGGSPRDTLSVLDQLLAGAADTHVTYTRALGLLGVTDVALID DAVDALAACDAAALFGAIESVIDGGHDPRRFATDLLERFRDLIVLQSVPD AASRGVVDAPEDALDRMREQAARIGRATLTRYAEVVQAGLGEMRGATAPR LLLEVVCARLLLPSASDAESALLQRVERIETRLDMSIPAPQAVPRPSAAA AEPKHQPAREPRPVLAPTPASSEPTVAAVRSMWPTVRDKVRLRSRTTEVM LAGATVRALEDNTLVLTHESAPLARRLSEQRNADVLAEALKDALGVNWRV RCETGEPAAAASPVGGGANVATAKAVNPAPTANSTQRDEEEHMLAEAGRG DPSPRRDPEEVALELLQNELGARRIDNA >MT2539 fpg-1, formamidopyrimidine-DNA glycosylase MPEGHTLHRLARLHQRRFAGAPVSVSSPQGRFADSASALNGRVLRRASAW GKHLFHHYVGGPVVHVHLGLYGTFTEWARPTDGWLPEPAGQVRMRMVGAE FGTDLRGPTVCESIDDGEVADVVARLGPDPLRSDANPSSAWSRITKSRRP IGALLMDQTVIAGVGNVYRNELLFRHRIDPQRPGRGIGEPEFDAAWNDLV SLMKVGLRRGKIIVVRPEHDHGLPSYLPDRPRTYVYRRAGEPCRVCGGVI RTALLEGRNVFWCPVCQT >MT2994 fpg-2, formamidopyrimidine-DNA glycosylase MPELPEVEVVRRGLQAHVTGRTITEVRVHHPRAVRRHDAGPADLTARLRG ARINGTDRRGKYLWLTLNTAGVHRPTDTALVVHLGMSGQMLLGAVPCAAH VRISALLDDGTVLSFADQRTFGGWLLADLVTVDGSVVPVPVAHLARDPLD PRFDCDAVVKVLRRKHSELKRQLLDQRVVSGIGNIYADEALWRAKVNGAH VAATLRCRRLGAVLHAAADVMREALAKGGTSFDSLYVNVNGESGYFERSL DAYGREGENCRRCGAVIRRERFMNRSSFYCPRCQPRPRK >MT0006 gyrA, DNA gyrase subunit A MTDTTLPPDDSLDRIEPVDIQQEMQRSYIDYAMSVIVGRALPEVRDGLKP VHRRVLYAMFDSGFRPDRSHAKSARSVAETMGNYHPHGDASIYDTLVRMA QPWSLRYPLVDGQGNFGSPGNDPPAAMRYTEARLTPLAMEMLREIDEETV DFIPNYDGRVQEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLRELADA VFWALENHDADEEETLAAVMGRVKGPDFPTAGLIVGSQGTADAYKTGRGS IRMRGVVEVEEDSRGRTSLVITELPYQVNHDNFITSIAEQVRDGKLAGIS NIEDQSSDRVGLRIVIEIKRDAVAKVVINNLYKHTQLQTSFGANMLAIVD GVPRTLRLDQLIRYYVDHQLDVIVRRTTYRLRKANERAHILRGLVKALDA LDEVIALIRASETVDIARAGLIELLDIDEIQAQAILDMQLRRLAALERQR IIDDLAKIEAEIADLEDILAKPERQRGIVRDELAEIVDRHGDDRRTRIIA ADGDVSDEDLIAREDVVVTITETGYAKRTKTDLYRSQKRGGKGVQGAGLK QDDIVAHFFVCSTHDLILFFTTQGRVYRAKAYDLPEASRTARGQHVANLL AFQPEERIAQVIQIRGYTDAPYLVLATRNGLVKKSKLTDFDSNRSGGIVA VNLRDNDELVGAVLCSADDDLLLVSANGQSIRFSATDEALRPMGRATSGV QGMRFNIDDRLLSLNVVREGTYLLVATSGGYAKRTAIEEYPVQGRGGKGV LTVMYDRRRGRLVGALIVDDDSELYAVTSGGGVIRTAARQVRKAGRQTKG VRLMNLGEGDTLLAIARNAEESGDDNAVDANGADQTGN >MT0005 gyrB, DNA gyrase subunit B MHATPEESIRIVAAQKKKAQDEYGAASITILEGLEAVRKRPGMYIGSTGE RGLHHLIWEVVDNAVDEAMAGYATTVNVVLLEDGGVEVADDGRGIPVATH ASGIPTVDVVMTQLHAGGKFDSDAYAISGGLHGVGVSVVNALSTRLEVEI KRDGYEWSQVYEKSEPLGLKQGAPTKKTGSTVRFWADPAVFETTEYDFET VARRLQEMAFLNKGLTINLTDERVTQDEVVDEVVSDVAEAPKSASERAAE STAPHKVKSRTFHYPGGLVDFVKHINRTKNAIHSSIVDFSGKGTGHEVEI AMQWNAGYSESVHTFANTINTHEGGTHEEGFRSALTSVVNKYAKDRKLLK DKDPNLTGDDIREGLAAVISVKVSEPQFEGQTKTKLGNTEVKSFVQKVCN EQLTHWFEANPTDAKVVVNKAVSSAQARIAARKARELVRRKSATDIGGLP GKLADCRSTDPRKSELYVVEGDSAGGSAKSGRDSMFQAILPLRGKIINVE KARIDRVLKNTEVQAIITALGTGIHDEFDIGKLRYHKIVLMADADVDGQH ISTLLLTLLFRFMRPLIENGHVFLAQPPLYKLKWQRSDPEFAYSDRERDG LLEAGLKAGKKINKEDGIQRYKGLGEMDAKELWETTMDPSVRVLRQVTLD DAAAADELFSILMGEDVDARRSFITRNAKDVRFLDV >MT3747 holB, DNA polymerase III, delta' subunit MPMMSGVFTRLVGQQAVEAELLATAKAARRDSAHSAGGGGTMTHAWLLTG PPGSGRSVAALCFAAALQCTSGGEPGCGRCRACTTTLAGTHADVRRVIPE GLSIGVDEMRAIVQIAARRPTTGHWQIVVIEDADRLTEGAANALLKVVEE PPPSTVFLLCAPSVDPEDIAVTLRSRCRHVALVTPSTHAIAQVLSDGDGL DPDTANWAASVSGGHVGRARRLATDPQARQRRERALGLARDAATPSRAYA AAEELVAGAEAEALALTAQRIEAETEELRTALGAGGTGKGTGAALRGATG AMKDLERRQKSRQTRASRDALDRALIDLATYFRDALLVAAHAGGVRANHP DMADRVAALAAHAPPERLLRCIEAVLACREALAVNVKPKFAVDAMVATIG QELR >MT3094 lig, DNA ligase MLRQWQALAEEVREHQFRYYVRDAPIISDAEFDELLRRLEALEEQHPELR TPDSPTQLVGGAGFATDFEPVDHLERMLSLDNAFTADELAAWAGRIHAEV GDAAHYLCELKIDGVALSLVYREGRLTRASTRGDGRTGEDVTLNARTIAD VPERLTPGDDYPVPEVLEVRGEVFFRLDDFQALNASLVEEGKAPFANPRN SAAGSLRQKDPAVTARRRLRMICHGLGHVEGFRPATLHQAYLALRAWGLP VSEHTTLATDLAGVRERIDYWGEHRHEVDHEIDGVVVKVDEVALQRRLGS TSRAPRWAIAYKYPPEEAQTKLLDIRVNVGRTGRITPFAFMTPVKVAGST VGQATLHNASEIKRKGVLIGDTVVIRKAGDVIPEVLGPVVELRDGSEREF IMPTTCPECGSPLAPEKEGDADIRCPNARGCPGQLRERVFHVASRNGLDI EVLGYEAGVALLQAKVIADEGELFALTERDLLRTDLFRTKAGELSANGKR LLVNLDKAKAAPLWRVLVALSIRHVGPTAARALATEFGSLDAIAAASTDQ LAAVEGVGPTIAAAVTEWFAVDWHREIVDKWRAAGVRMVDERDESVPRTL AGLTIVVTGSLTGFSRDDAKEAIVARGGKAAGSVSKKTNYVVAGDSPGSK YDKAVELGVPILDEDGFRRLLADGPASRT >MT1048 mfd, transcription-repair coupling factor MTAPGPACSDTPIAGLVELALSAPTFQQLMQRAGGRPDELTLIAPASARL LVASALARQGPLLVVTATGREADDLAAELRGVFGDAVALLPSWETLPHER LSPGVDTVGTRLMALRRLAHPDDAQLGPPLGVVVTSVRSLLQPMTPQLGM MEPLTLTVGDESPFDGVVARLVELAYTRVDMVGRRGEFAVRGGILDIFAP TAEHPVRVEFWGDEITEMRMFSVADQRSIPEIDIHTLVAFACRELLLSED VRARAAQLAARHPAAESTVTGSASDMLAKLAEGIAVDGMEAVLPVLWSDG HALLTDQLPDGTPVLVCDPEKVRTRAADLIRTGREFLEASWSVAALGTAE NQAPVDVEQLGGSGFVELDQVRAAAARTGHPWWTLSQLSDESAIELDVRA APSARGHQRDIDEIFAMLRAHIATGGYAALVAPGTGTAHRVVERLSESDT PAGMLDPGQAPKPGVVGVLQGPLRDGVIIPGANLVVITETDLTGSRVSAA EGKRLAAKRRNIVDPLALTAGDLVVHDQHGIGRFVEMVERTVGGARREYL VLEYASAKRGGGAKNTDKLYVPMDSLDQLSRYVGGQAPALSRLGGSDWAN TKTKARRAVREIAGELVSLYAKRQASPGHAFSPDTPWQAELEDAFGFTET VDQLTAIEEVKADMEKPIPMDRVICGDVGYGKTEIAVRAAFKAVQDGKQV AVLVPTTLLADQHLQTFGERMSGFPVTIKGLSRFTDAAESRAVIDGLADG SVDIVIGTHRLLQTGVRWKDLGLVVVDEEQRFGVEHKEHIKSLRTHVDVL TMSATPIPRTLEMSLAGIREMSTILTPPEERYPVLTYVGPHDDKQIAAAL RRELLRDGQAFYVHNRVSSIDAAAARVRELVPEARVVVAHGQMPEDLLET TVQRFWNREHDILVCTTIVETGLDISNANTLIVERADTFGLSQLHQLRGR VGRSRERGYAYFLYPPQVPLTETAYDRLATIAQNNELGAGMAVALKDLEI RGAGNVLGIEQSGHVAGVGFDLYVRLVGEALETYRDAYRAAADGQTVRTA EEPKDVRIDLPVDAHLPPDYIASDRLRLEGYRRLAAASSDREVAAVVDEL TDRYGALPEPARRLAAVARLRLLCRGSGITDVTAASAATVRLSPLTLPDS AQVRLKRMYPGAHYRATTATVQVPIPRAGGLGAPRIRDVELVQMVADLIT ALAGKPRQHIGITNPSPPGEDGRGRNTTIKERQP >MT3396 nei, endonuclease VIII MPEGDTVWHTAATLRRHLAGRTLTRCDIRVPRFAAVDLTGEVVDEVISRG KHLFIRTGTASIHSHLQMDGSWRVGNRPVRVDHRARIILEANQQEQAIRV VGVDLGLLEVIDRHNDGAVVAHLGPDLLADDWDPQRAAANLIVAPDRPIA EALLDQRVLAGIGNVYCNELCFVSGVLPTAPVSAVADPRRLVTRARDMLW VNRFRWNRCTTGDTRAGRRLWVYGRAGQGCRRCGTLIAYDTTDERVRYWC PACQR >MT1357 ogt, methylated-DNA--protein-cysteinemethyltransferase MATAGEDRMIHYRTIDSPIGPLTLAGHGSVLTNLRMLEQTYEPSRTHWTP DPGAFSGAVDQLNAYFAGELTEFDVELDLRGTDFQQRVWKALLTIPYGET RSYGEIADQIGAPGAARAVGLANGHNPIAIIVPCHRVIGASGKLTGYGGG INRKRALLELEKSRAPADLTLFD >MT0976 pcrA, ATP-dependent helicase PcrA MSVHATDAKPPGPSPADQLLDGLNPQQRQAVVHEGSPLLIVAGAGSGKTA VLTRRIAYLMAARGVGVGQILAITFTNKAAAEMRERVVGLVGEKARYMWV STFHSTCVRILRNQAALIEGLNSNFSIYDADDSRRLLQMVGRDLGLDIKR YSPRLLANAISNLKNELIDPHQALAGLTEDSDDLARAVASVYDEYQRRLR AANALDFDDLIGETVAVLQAFPQIAQYYRRRFRHVLVDEYQDTNHAQYVL VRELVGRDSNDGIPPGELCVVGDADQSIYAFRGATIRNIEDFERDYPDTR TILLEQNYRSTQNILSAANSVIARNAGRREKRLWTDAGAGELIVGYVADN EHDEARFVAEEIDALAEGSEITYNDVAVFYRTNNSSRSLEEVLIRAGIPY KVVGGVRFYERKEIRDIVAYLRVLDNPGDAVSLRRILNTPRRGIGDRAEA CVAVYAENTGVGFGDALVAAAQGKVPMLNTRAEKAIAGFVEMFDELRGRL DDDLGELVEAVLERTGYRRELEASTDPQELARLDNLNELVSVAHEFSTDR ENAAALGPDDEDVPDTGVLADFLERVSLVADADEIPEHGAGVVTLMTLHT AKGLEFPVVFVTGWEDGMFPHMRALDNPTELSEERRLAYVGITRARQRLY VSRAIVRSSWGQPMLNPESRFLREIPQELIDWRRTAPKPSFSAPVSGAGR FGSARPSPTRSGASRRPLLVLQVGDRVTHDKYGLGRVEEVSGVGESAMSL IDFGSSGRVKLMHNHAPVTKL >MT3775 pdg, ultraviolet N-glycosylase/AP lyase MTAAKSSRSKPAARAADVPGRWSAETRLALVRRARRMNRALAQAFPHVYC ELDFTTPLELAVATILSAQSTDKRVNLTTPALFARYRTARDYAQADRTEL ESLIRPTGFYRNKAASLIGLGQALVERFGGEVPATMDKLVTLPGVGRKTA NVILGNAFGIPGITVDTHFGRLVRRWRWTTAEDPVKVEQAVGELIERKEW TLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAPLVQG PETDHLLALAGL >MT1665 polA, DNA polymerase I MVTTASAPSEDRAKPTLMLLDGNSLAFRAFYALPAENFKTRGGLTTNAVY GFTAMLINLLRDEAPTHIAAAFDVSRQTFRLQRYPEYKANRSSTPDEFAG QIDITKEVLGALGITVLSEPGFEADDLIATLATQAENEGYRVLVVTGDRD ALQLVSDDVTVLYPRKGVSELTRFTPEAVVEKYGLTPRQYPDFAALRGDP SDNLPGIPGVGEKTAAKWIAEYGSLRSLVDNVDAVRGKVGDALRANLASV VRNRELTDLVRDVPLAQTPDTLRLQPWDRDHIHRLFDDLEFRVLRDRLFD TLAAAGGPEVDEGFDVRGGALAPGTVRQWLAEHAGDGRRAGLTVVGTHLP HGGDATAMAVAAADGEGAYLDTATLTPDDDAALAAWLADPAKPKALHEAK AAVHDLAGRGWTLEGVTSDTALAAYLVRPGQRSFTLDDLSLRYLRRELRA ETPQQQQLSLLDDDDTDAETIQTTILRARAVIDLADALDAELARIDSTAL LGEMELPVQRVLAKMESAGIAVDLPMLTELQSQFGDQIRDAAEAAYGVIG KQINLGSPKQLQVVLFDELGMPKTKRTKTGYTTDADALQSLFDKTGHPFL QHLLAHRDVTRLKVTVDGLLQAVAADGRIHTTFNQTIAATGRLSSTEPNL QNIPIRTDAGRRIRDAFVVGDGYAELMTADYSQIEMRIMAHLSGDEGLIE AFNTGEDLHSFVASRAFGVPIDEVTGELRRRVKAMSYGLAYGLSAYGLSQ QLKISTEEANEQMDAYFARFGGVRDYLRAVVERARKDGYTSTVLGRRRYL PELDSSNRQVREAAERAALNAPIQGSAADIIKVAMIQVDKALNEAQLASR MLLQVHDELLFEIAPGERERVEALVRDKMGGAYPLDVPLEVSVGYGRSWD AAAH >MT2806 recA, recA protein, intein-containing MTQTPDREKALELAVAQIEKSYGKGSVMRLGDEARQPISVIPTGSIALDV ALGIGGLPRGRVIEIYGPESSGKTTVALHAVANAQAAGGVAAFIDAEHAL DPDYAKKLGVDTDSLLVSQPDTGEQALEIADMLIRSGALDIVVIDSVAAL VPRAELEGEMGDSHVGLQARLMSQALRKMTGALNNSGTTAIFINQLRDKI GVMFGSPETTTGGKALKFYASVRMDVRRVETLKDGTNAVGNRTRVKVVKN KCLAEGTRIFDPVTGTTHRIEDVVDGRKPIHVVAAAKDGTLHARPVVSWF DQGTRDVIGLRIAGGAIVWATPDHKVLTEYGWRAAGELRKGDRVAQPRRF DGFGDSAPIPADHARLLGYLIGDGRDGWVGGKTPINFINVQRALIDDVTR IAATLGCAAHPQGRISLAIAHRPGERNGVADLCQQAGIYGKLAWEKTIPN WFFEPDIAADIVGNLLFGLFESDGWVSREQTGALRVGYTTTSEQLAHQIH WLLLRFGVGSTVRDYDPTQKRPSIVNGRRIQSKRQVFEVRISGMDNVTAF AESVPMWGPRGAALIQAIPEATQGRRRGSQATYLAAEMTDAVLNYLDERG VTAQEAAAMIGVASGDPRGGMKQVLGASRLRRDRVQALADALDDKFLHDM LAEELRYSVIREVLPTRRARTFDLEVEELHTLVAEGVVVHNCSPPFKQAE FDILYGKGISREGSLIDMGVDQGLIRKSGAWFTYEGEQLGQGKENARNFL VENADVADEIEKKIKEKLGIGAVVTDDPSNDGVLPAPVDF >MT0658 recB, exodeoxyribonuclease V, beta subunit MDRFELLGPLPREGTTTVLEASAGTGKTFALAGLVTRYLAETAATLDEML LITFNRAASRELRERVRGQIVEAVGALQGDAPPSGELVEHLLRGSDAERA QKRSRLRDALANFDAATIATTHEFCGSVLKSLGVAGDNAADVELKESLTD LVTEIVDDRYLANFGRQETDPELTYAEALALALAVVDDPCAQLRPPDPEP GSKAAVRLRFAAEVLEELERRKGRLRAQGFNDLLIRLATALEAADSPARD RMRERWRIVLVDEFQDTDPMQWRVLERAFSRHSALILIGDPKQAIYGFRG GDIHTYLKAAGTADARYTLGVNWRSDRALVESLQTVLRDATLGHADIVVR GTDAHHAGHRLASAPRPAPFRLRVVKRHTLGYDGTAHVPIEALRRHIPDD LAADVAALLASGATFAGRPVVAADIAVIVEHHKDARACRNALAEAGIPAI YTGDTDVFASQAAKDWLCLLEAFDAPQRSGLVRAAACTMFFGETAESLAA EGDALTDRVAGTLREWADHARHRGVAAVFQAAQLAGMGRRVLSQRGGERD LTDLAHIAQLLHEAAHRERLGLPGLRDWLRRQAKAGAGPPEHNRRLDSDA AAVQIMTVFVAKGLQFPIVYLPFAFNRNVRSDDILLYHDDGTRCLYIGGK DGGAQRRTVEGLNRVEAAHDNLRLTYVALTRAQSQVVAWWAPTFDEVNGG LSRLLRGRRPGQSQVPDRCTPRVTDEQAWAVFAQWEAAGGPSVEESVIGA RSSLEKPVPVPGFEVRHFHRRIDTTWRRTSYSDLVRGSEAVTVTSEPAAG GRADEVEIAVVAAPGSGADLTSPLAALPSGASFGSLVHAVLETADPAAPD LAAELEAQVRRHAPWWTVDVDHAQLAPELARALLPMHDTPLGPAAAALTL RQIGVRDRLRELDFEMPLAGGDLRGRSPDVSLADVGELLASHLPGDDPLS PYADRLGSAGLGDQPLRGYLAGSIDVVLRLPGQRYLVVDYKTNHLGDTAA DYGFERLTEAMLHSDYPLQALLYVVVLHRFLRWRQRDYAPARHLGGVLYL FVRGMCGAATPVTAGHPAGVFTWNPPTALVVALSDLLDRGRLQS >MT0659 recC, exodeoxyribonuclease V, gamma subunit MALHLHRAERTDLLADGLGALLADPQPDPFAQELVLVAARGVERWLSQRL SLVLGCGPGRADGVCAGIAFRNPQSLIAEITGTLDDDPWSPEALAWPLLA VIDASLDEPWCRTLASHLGHFATTDAEAELRRGRRYSVARRLAGLFASYA RQRPGLLAAWLDGDLGELPGDLAWQPPLWRALVTTVGADPPHVRHDKTIA RLRDGPADLPARLSLFGHTRLACTDVQLLDALAVHHDLHLWLPHPSDELW RALAGFQGADGLLPRRQDTSRRAAQHPLLETLGRDVRELQRALPAARATD EFLGATTKPDTLLGWLQADIAGNAPRPAGRSLSDADRSVQVHACHGPARQ IDVLREVLLGLLEDDPTLQPRDIVVMCPDIDTYAPLIVAGFGLGEVAGDC HPAHRLRVRLADRALTQTNPLLSVAAELLTIAETRATASQLLNLAQAAPV RAKFGFADDDLDTITTWVRESNIRWGFDPTHRRRYGLDTVVHNTWRLGLD RILTGVAMSEDSQAWLDTALPLDDVGSNRVELAGRLAEFVERLHHVVGGL SGARPLVAWLDALATGIDLLTACNDGWQRAQVQREFADVLARAGSRAAPL LRLPDVRALLDAQLAGRPTRANFRTGTLTVCTMVPMRSVPHRVVCLVGLD DGVFPRLSHPDGDDVLAREPMTGERDIRSEDRQLLLDAIGAATQTLVITY TGADERTGQPRPPAVPLAELLDALDQTTSAPVRERILVTHPLQPFDRKNV TPGALLGAKPFTFDPAALAAAQAAAGKRCPPTAFISGRLPAPPAADVTLA DLLDFFKDPVKGFFRALDYTLPWDVDTVEDSIPVQVDALAEWTVGERMLR DMLRGLHPDDAAHSEWRRGTLPPGRLGVRRAKEIRNRARDLAAAALAHRD GHGQAHDVDVDLGDGRRLSGTVTPVFGGRTVSVTYSKLAPKHVLPAWIGL VTLAAQEPGREWSALCIGRSKTRNHIARRLFVPPPDPVAVLRELVLLYDA GRREPLPLPLKTSCAWAQARRDGQDPYPPARECWQTNRFRPGDDDAPAHV RAWGPRAPFEVLLGKPRAGEEVAGEETRLGALAARLWLPLLAAEGSV >MT0657 recD, exodeoxyribonuclease V, alpha subunit MVRAFNQAGVLDVSDVHVAQRLCALAGESDERVALAVAVAVRALRAGSVC VDLLSIARVAGHDDLPWPDPADWLAAVRASPLLADPPVLHLYDDRLLYLD RYWREEEQVCADLLALLTSRRPAGVPDLRRLFPTGFDEQRRAAEIALSQG VTVLTGGPGTGKTTTVARLLALVAEQAELAGEPRPRIALAAPTGKAAARL AEAVRREMAKLDATDRARLGDLHAVTLHRLLGAKPGARFRQDRQNRLPHN VIVVDETSMVSLTLMARLAEAVRPGARLILVGDADQLASVEAGAVLADLV DGFSVRDDALVAQLRTSHRFGKVIGTLAEAIRAGDGDAVLGLLRSGEERI EFVDDEDPAPRLRAVLVPHALRLREAALLGASDVALATLDEHRLLCAHRD GPTGVLHWNRRVQAWLAEETGQPPWTPWYAGRPLLVTANDYGLRVYNGDT GVVLAGPTGLRAVISGASGPLDVATGRLGDVETMHAMTIHKSQGSQVDEV TVLMPQEDSRLLTRELLYTAVTRAKRKVRVVGSEASVRAAIARRAVRASG LRMRLQSTGCG >MT0003 recF, recF protein MYVRHLGLRDFRSWACVDLELHPGRTVFVGPNGYGKTNLIEALWYSTTLG SHRVSADLPLIRVGTDRAVISTIVVNDGRECAVDLEIATGRVNKARLNRS SVRSTRDVVGVLRAVLFAPEDLGLVRGDPADRRRYLDDLAIVRRPAIAAV RAEYERVLRQRTALLKSVPGARYRGDRGVFDTLEVWDSRLAEHGAELVAA RIDLVNQLAPEVKKAYQLLAPESRSASIGYRASMDVTGPSEQSDTDRQLL AARLLAALAARRDAELERGVCLVGPHRDDLILRLGDQPAKGFASHGEAWS LAVALRLAAYQLLRVDGGEPVLLLDDVFAELDVMRRRALATAAESAEQVL VTAAVLEDIPAGWDARRVHIDVRADDTGSMSVVLP >MT3051 recG, ATP-dependent DNA helicase RecG MASLSDRLDRVLGATAADALDEQFGMRTVDDLLRHYPRSYVEGAARVGIG DARPEAGEHITIVDVITDTYSFPMKKKPNRKCLRITVGGGRNKVTATFFN ADYIMRDLTKHTKVMLSGEVGYYKGAMQLTHPAFLILDSPDGKNHGTRSL KSIADASKAISGELVVEEFERRFFPIYPASTKVQSWDIFKCVRQVLDVLD RVDDPLPAELRAKHGLIPEDEALRAIHLAESQSLRERARERLTFDEAVGL QWALVARRHGELSESGPSAAWKSNGLAAELLRRLPFELTAGQREVLDVLS DGLAANRPLNRLLQGEVGSGKTIVAVLAMLQMVDAGYQCALLAPTEVLAA QHLRSIRDVLGPLAMGGQLGGAENATRVALLTGSMTAGQKKQVRAEIASG QVGIVIGTHALLQEAVDFHNLGMVVVDEQHRFGVEQRDQLRAKAPAGITP HLLVMTATPIPRTVALTVYGDLETSTLRELPLGRQPIATNVIFVKDKPAW LDRAWRRIIEEAAAGRQAYVVAPRIDESDDTDVQGGVRPSATAEGLFSRL RSAELAELRLALMHGRLSADDKDAAMAAFRAGEVDVLVCTTVIEVGVDVP NATVMLVMDADRFGISQLHQLRGRIGRGEHPSVCLLASWVPPDTPAGQRL RAVAGTMDGFALADLDLKERKEGDVLGRNQSGKAITLRLLSLAEHEEYIV AARDFCIEAYKNPTDPALALMAARFTSTDRIEYLDKS >MT1735 recN, DNA repair protein RecN MLTELRIESLGAISVATAEFDRGFTVLTGETGTGKTMVVTGLHLLGGARA DATRVRSGADRAVVEGRFTTTDLDDATVAGLQAVLDSSGAERDEDGSVIA LRSISRDGPSRAYLGGRGVPAKSLSGFTNELLTLHGQNDQLRLMRPDEQR GALDRFAAAGEAVQRYRKLRDAWLTARRDLVDRRNRARELAQEADRLKFA LNEIDTVDPQPGEDVALVADIARLSELDTLREAATTARATLCGTPDADAF DRGAVDSLGRARAALQSSDDAALRGLAEQVGEALTVVVDAVAELGAYLDE LPADASALDAKLARQAQLRTLTRKYAADIDGVLRWADEARARLAQLDVSE EGLAALERRTGELAHELGQAAVDLSTIRRKAAKRLAKEVSAELSALAMAD AEFTIGVTTGLADHGDPVALALASGELARAGADGVDAVEFGFVAHRGMTV LPLAKSASGGELSRVMLSLEVVLATSRKQAAGTTMVFDEIDAGVGGWAAV QIGRRLARLARTHQVIVVTHLPQVAAYADVHLMVQRTGRDGASGVRRLTS EDRVAELARMLAGLGDSDSGRAHARELLETAQNDELT >MT3818 recR, recR protein MFEGPVQDLIDELGKLPGIGPKSAQRIAFHLLSVEPSDIDRLTGVLAKVR DGVRFCAVCGNVSDNERCRICSDIRRDASVVCIVEEPKDIQAVERTREFR GRYHVLGGALDPLSGIGPDQLRIRELLSRIGERVDDVDVTEVIIATDPNT EGEATATYLVRMLRDIPGLTVTRIASGLPMGGDLEFADELTLGRALAGRR VLA >MT2670 ruvA, Holliday junction DNA helicase RuvA MIASVRGEVLEVALDHVVIEAAGVGYRVNATPATLATLRQGTEARLITAM IVREDSMTLYGFPDGETRDLFLTLLSVSGVGPRLAMAALAVHDAPALRQV LADGNVAALTRVPGIGKRGAERMVLELRDKVGVAATGGALSTNGHAVRSP VVEALVGLGFAAKQAEEATDTVLAANHDATTSSALRSALSLLGKAR >MT2669 ruvB, Holliday junction DNA helicase RuvB MTERSDRDVSPALTVGEGDIDVSLRPRSLREFIGQPRVREQLQLVIEGAK NRGGTPDHILLSGPPGLGKTSLAMIIAAELGSSLRVTSGPALERAGDLAA MLSNLVEHDVLFIDEIHRIARPAEEMLYLAMEDFRVDVVVGKGPGATSIP LEVAPFTLVGATTRSGALTGPLRDRFGFTAHMDFYEPAELERVLARSAGI LGIELGADAGAEIARRSRGTPRIANRLLRRVRDFAEVRADGVITRDVAKA ALEVYDVDELGLDRLDRAVLSALTRSFGGGPVGVSTLAVAVGEEAATVEE VCEPFLVRAGMVARTPRGRVATALAWTHLGMTPPVGASQPGLFE >MT2671 ruvC, Holliday junction nuclease MRVMGVDPGLTRCGLSLIESGRGRQLTALDVDVVRTPSDAALAQRLLAIS DAVEHWLDTHHPEVVAIERVFSQLNVTTVMGTAQAGGVIALAAAKRGVDV HFHTPSEVKAAVTGNGSADKAQVTAMVTKILALQAKPTPADAADALALAI CHCWRAPTIARMAEATSRAEARAAQQRHAYLAKLKAAR >MT0060 ssb, single-strand binding protein MAGDTTITIVGNLTADPELRFTPSGAAVANFTVASTPRIYDRQTGEWKDG EALFLRCNIWREAAENVAESLTRGARVIVSGRLKQRSFETREGEKRTVIE VEVDEIGPSLRYATAKVNKASRSGGFGSGSRPAPAQTSSASGDDPWGSAP ASGSFGGGDDEPPF >MT1248 tag, DNA-3-methyladenine glycosidase I MSGDGLVRCPWAEVRPGPDAQLYRDYHDNEWGRPLYGRVALFERMSLEAF QSGLSWLIILRKRENFRRAFSGFDIDKIARYTDTDVRRLLADDGIVRNRA KIEATIANARAAADLGSSEDLSELLWSFAPPPRPRPVDGSEIPSVSTESK AMSRELKRRGFRFVGPTTAYALMQATGMVDDHIQACWVPTERPFDQPGCP MAAR >MT3053 ung, uracil-DNA glycosylase MTARPLSELVERGWAAALEPVADQVAHMGQFLRAEIAAGRRYLPAGSNVL RAFTFPFDNVRVLIVGQDPYPTPGHAVGLSFSVAPDVRPWPRSLANIFDE YTADLGYPLPSNGDLTPWAQRGVLLLNRVLTVRPSNPASHRGKGWEAVTE CAIRALAARAAPLVAILWGRDASTLKPMLAAGNCVAIESPHPSPLSASRG FFGSRPFSRANELLVGMGAEPIDWRLP >MT1675 uvrA, excinuclease ABC, subunit A MSFSERDSVADRLIVKGAREHNLRSVDLDLPRDALIVFTGLSGSGKSSLA FDTIFAEGQRRYVESLSAYARQFLGQMDKPDVDFIEGLSPAVSIDQKSTN RNPRSTVGTITEVYDYLRLLYARAGTPHCPTCGERVARQTPQQIVDQVLA MPEGTRFLVLAPVVRTRKGEFADLFDKLNAQGYSRVRVDGVVHPLTDPPK LKKQEKHDIEVVVDRLTVKAAAKRRLTDSVETALNLADGIVVLEFVDHEL GAPHREQRFSEKLACPNGHALAVDDLEPRSFSFNSPYGACPECSGLGIRK EVDPELVVPDPDRTLAQGAVAPWSNGHTAEYFTRMMAGLGEALGFDVDTP WRKLPAKARKAILEGADEQVHVRYRNRYGRTRSYYADFEGVLAFLQRKMS QTESEQMKERYEGFMRDVPCPVCAGTRLKPEILAVTLAGESKGEHGAKSI AEVCELSIADCADFLNALTLGPREQAIAGQVLKEIRSRLGFLLDVGLEYL SLSRAAATLSGGEAQRIRLATQIGSGLVGVLYVLDEPSIGLHQRDNRRLI ETLTRLRDLGNTLIVVEHDEDTIEHADWIVDIGPGAGEHGGRIVHSGPYD ELLRNKDSITGAYLSGRESIEIPAIRRSVDPRRQLTVVGAREHNLRGIDV SFPLGVLTSVTGVSGSGKSTLVNDILAAVLANRLNGARQVPGRHTRVTGL DYLDKLVRVDQSPIGRTPRSNPATYTGVFDKIRTLFAATTEAKVRGYQPG RFSFNVKGGRCEACTGDGTIKIEMNFLPDVYVPCEVCQGARYNRETLEVH YKGKTVSEVLDMSIEEAAEFFEPIAGVHRYLRTLVDVGLGYVRLGQPAPT LSGGEAQRVKLASELQKRSTGRTVYILDEPTTGLHFDDIRKLLNVINGLV DKGNTVIVIEHNLDVIKTSDWIIDLGPEGGAGGGTVVAQGTPEDVAAVPA SYTGKFLAEVVGGGASAATSRSNRRRNVSA >MT1669 uvrB, excinuclease ABC, subunit B MAFCARSPHGVSAGGSRLVGVAFATEHPVVAHSEYRAVEEIVRAGGHFEV VSPHAPAGDQPAAIDELERRINAGERDVVLLGATGTGKSATTAWLIERLQ RPTLVMAPNKTLAAQLANELREMLPHNAVEYFVSYYDYYQPEAYIAQTDT YIEKDSSINDDVERLRHSATSALLSRRDVVVVASVSCIYGLGTPQSYLDR SVELKVGEEVPRDGLLRLLVDVQYTRNDMSFTRGSFRVRGDTVEIIPSYE ELAVRIEFFGDEIEALYYLHPLTGEVIRQVDSLRIFPATHYVAGPERMAH AVSAIEEELAERLAELESQGKLLEAQRLRMRTNYDIEMMRQVGFCSGIEN YSRHIDGRGPGTPPATLLDYFPEDFLLVIDESHVTVPQIGGMYEGDISRK RNLVEYGFRLPSACDNRPLTWEEFADRIGQTVYLSATPGPYELSQTGGEF VEQVIRPTGLVDPKVVVKPTKGQIDDLIGEIRTRADADQRVLVTTLTKKM AEDLTDYLLEMGIRVRYLHSEVDTLRRVELLRQLRLGDYDVLVGINLLRE GLDLPEVSLVAILDADKEGFLRSSRSLIQTIGRAARNVSGEVHMYADKIT DSMREAIDETERRRAKQIAYNEANGIDPQPLRKKIADILDQVYREADDTA VVEVGGSGRNASRGRRAQGEPGRAVSAGVFEGRDTSAMPRAELADLIKDL TAQMMAAARDLQFELAARFRDEIADLKRELRGMDAAGLK >MT1463 uvrC, excinuclease ABC, subunit C MPDPATYRPAPGSIPVEPGVYRFRDQHGRVIYVGKAKSLRSRLTSYFADV ASLAPRTRQLVTTAAKVEWTVVGTEVEALQLEYTWIKEFDPRFNVRYRDD KSYPVLAVTLGEEFPRLMVYRGPRRKGVRYFGPYSHAWAIRETLDLLTRV FPARTCSAGVFKRHRQIDRPCLLGYIDKCSAPCIGRVDAAQHRQIVADFC DFLSGKTDRFARALEQQMNAAAEQLDFERAARLRDDLSALKRAMEKQAVV LGDGTDADVVAFADDELEAAVQVFHVRGGRVRGQRGWIVEKPGEPGDSGI QLVEQFLTQFYGDQAALDDAADESANPVPREVLVPCLPSNAEELASWLSG LRGSRVVLRVPRRGDKRALAETVHRNAEDALQQHKLKRASDFNARSAALQ SIQDSLGLADAPLRIECVDVSHVQGTDVVGSLVVFEDGLPRKSDYRHFGI REAAGQGRSDDVACIAEVTRRRFLRHLRDQSDPDLLSPERKSRRFAYPPN LYVVDGGAPQVNAASAVIDELGVTDVAVIGLAKRLEEVWVPSEPDPIIMP RNSEGLYLLQRVRDEAHRFAITYHRSKRSTRMTASALDSVPGLGEHRRKA LVTHFGSIARLKEATVDEITAVPGIGVATATAVHDALRPDSSGAAR >MT2962 xerC, tyrosine recombinase XerC MRRVDSGSRRHACDCGGVQAILDEFDEYLALQCGRSVHTRRAYLGDLRSL FAFLADRGSSLDALTLSVLRSWLAATAGAGAARTTLARRTSAVKAFTAWA VRRGLLAGDPAARLQVPKARRTLPAVLRQDQALRAMAAAESGAEQGDPLA LRDRLIVELLYATGIRVSELCGLDVDDIDTGHRLVRVLGKGNKQRTVPFG QPAADALHAWLVDGRRALVTAESGHALLLGARGRRLDVRQARTAVHQTVA AVDGAPDMGPHGLRHSAATHLLEGGADLRVVQELLGHSSLATTQLYTHVA VARLRAVHERAHPRA >MT1740 xerD, integrase/recombinase XerD MKTLALQLQGYLDHLTIERGVAANTLSSYRRDLRRYSKHLEERGITDLAK VGEHDVSEFLVALRRGDPDSGTAALSAVSAARALIAVRGLHRFAAAEGLA ELDVARAVRPPTPSRRLPKSLTIDEVLSLLEGAGGDKPSDGPLTLRNRAV LELLYSTGARISEAVGLDLDDIDTHARSVLLRGKGGKQRLVPVGRPAVHA LDAYLVRGRPDLARRGRGTAAIFLNARGGRLSRQSAWQVLQDAAERAGIT AGVSPHMLRHSFATHLLEGGADVRVVQELLGHASVTTTQIYTLVTVHALR EVWAGAHPRAR >MT1139 xseA, exodeoxyribonuclease, large subunit MTQNSAENPFPVRAVAIRVAGWIDKLGAVWVEGQLAQITMRPDAKTVFMV LRDPAADMSLTVTCSRDLVLSAPVKLAEGVQVVVCGKPSFYTGRGTFSLR LSEIRAVGIGELLARIDRLRRLLDAEGLFDPRLKRPIPYLPNMIGLITGR ASAAERDVTTVASARWPAARFAVRNVAVQGPNAVGQIVEALRELDRDPDV DVIVLARGGGSVEDLLPFSDETLCRAIAACRTPVVSAVGHEPDNPLCDLV VDLRAATPTDAAKKVVPDTAAEQRLIDDLRRRSAQALRNWVSREQRAVAQ LRSRPVLADPMTMVSVRAEEVHRARSTLRRNLTLMVAAETERIGHLAARL ATLGPAATLARGYAIVQTVAQTGPEGGSEPQVLRSVHDAPEGTKLRVRVA DGALAAVSEGQTNGL >MT1138 xseB, exodeoxyribonuclease, small subunit MKDKPMVCDPNGDDTGRTHATVPVSQLGYEACRDELMEVVRLLEQGGLDL DASLRLWERGEQLAKRCEEHLAGARQRVSDVLAGDEAQNG >MT0442 xth, exodeoxyribonuclease III MPDGTIDGGHPQRPASPRLRSPLLRLATWNVNSIRTRLDRVLDWLGRADV DVLAMQETKCPDGQFPALPLFELGYDVAHVGFDQWNGVAIASRVGLDDVR VGFDGQPSWSGKPEVAATTEARALGATCGGIRVWSLYVPNGRALDDPHYT YKLDWLAALRDTAEGWLRDDPAAPIALMGDWNIAPTDDDVWSTEFFAGCT HVSEPERKAFNAIVDAQFTDVVRPFTPGPGVYTYWDYTQLRFPKKQGMRI DFILGSPALAARVMDAQIVREERKGKAPSDHAPVLVDLHAG