Gene list
Applied filters:
COG category: Replication, recombination and repair
Organism: Nitrosomonas europaea ATCC 19718, ATCC 19718
Gene type: CDS
Number of genes found: 243
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Nitrosomonas europaea ATCC 19718, ATCC 19718 >NE0271 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDHLISVNRP >NE1807 putative ATP-dependent RNA helicase protein MSFENLNLHPAIVKAVLAAGYTAPTPIQQQAIPDLIAGHDVMASAQTGTG KTAAFMLPALHRLATPAQIRGRGPRILVLTPTRELALQVSDAASKYGKFL PRINVVSILGGMPYPLQNKLLSQTVDVLVATPGRLIDHIERGRIDFSRLE MLVLDEADRMLDMGFIQDVERIALSTPATRQTLLFSATLDVAIEKIATRL LKAPKRIQVAAQHTKLDHIEQRMHYVDDLTHKNRLLDHLLRDTTIKQAIV FTATKRDADSLADNLSSQGHKAAAMHGDMTQRERTRTLTGLRQGRLKILV ATDVAARGIDIADITHVINFDLPKFAEDYVHRIGRTGRAGASGIAVSFAS GKDVAHLKRIERFTGNRFEFHVIPGIEPRTKPRFGRSDDKPGRRPSSSAA AHKTRRSWSDNPNTRTASPGHRGDKDAGFGQPFGRETRKRPFRDSKFNSA DRFARTE >NE1174 Uncharacterized protein family UPF0020 MTERFFAPCPRGLETVLAAELERLDATSIQASPGGVGFHGNWQTCYRANL ESRIASRILWQIAKDQYRSEADIYDLTHSLPWQDWFEPRLSIKVNLAAIK CPLRSLDFVTLRIKDAVCDKFRAIHGKRPDIDTVAPDMRIHGFLNAQEFT LYLDTSGEALFKRGLRQTQGEAPLRENLAAGILALTGWQPGTPLLDPMCG SGTLLLEAAQIACRIAPGSGRQFAFEQLKLFDARSWKKLKQTATERQHER TFQSIYGSDLYGSALAHTRNNLAAAGLAECVTLKQANVLEISAPAETGIL VSNPPYGVRIGDHQMLAEFYPRLGDVLKQRFSGWRAFLLTADPLLAKSIR LTPSRRTPLFNGALECRLLEYRLVAGSMRREKQPSSESSTNQPIT >NE2536 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNI >NE0447 conserved hypothetical protein MLKQNETKTSMILNYRWLYDTVRKRFESDEAMEAFLPKALTPATLKQKGD DRYLSAMSQRVFQAGMQHSVVNAKWPAFEEAFWGFVPETMVMLSPEQIEG YMKNSSIIRHYTKLQTIPRNAQFILDIRQEQGCSFGEFIADWPSADIIGL WRLLAKRGARLGGRSSAGFLRLAGKDTFLLTSDVTARLIAAGIIDHEPTG QRDRQIIQDAFNELQQDSGRPLCQLSAMLSLSINPRF >NE1843 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNNYRARAEYATATATLPQAENH >NE1552 Transposase IS4 family MDAHGMPVRILVTQGTTADCTQAGRLIEGIDADHLLADRGYDSNAIVEQA EKQGMEAVIPPKKNRKIQRPYDKELYKLRHLVENAFLHLKRWRGIATRYA KNTSSFLAVVQIRCIALWADIL >NE2190 Maturase; integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase); part of type II intron MHRALNQDDDHNQDGQDLLEAVLARDNLARAWRRVKSNRGAPGIDGVTTA EWPEHARAHWPATREQIEAGRYRPQPVRRVDIPKPDGGQRQLGIPTVTDR VIQQAIAQVLIPIFDPGFSASSFGFRPGRNAHQAIRQVQAHVKAGYRWAV DLDLARFFDNVNHDLLMSLLSRSIADKRLLALIGRYLRAGVLVGEHPQPS EVGTPQGGPLSPLLANVLLHQFDLELERRGHRFARYADDVIILVKSRRAA ERVMQSLTYFLQSTLKLTVNLAKSQVAPMSECSFLGFTLVGKKIRWTEKS LANFKHRVRQLTGRSWGVSMEYRLEKLGQYLRGWFGYYGISQYYRPIPEL DEWIRRRVRMCYWKQWRWARTKIRHLLDLGIPLKAAIQHGVSSLSYWRMA RTPVTQQAMSNDWLRAQGLLSIKDLWCKAQSYGPDKG >NE1990 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR >NE2309 Adenine specific DNA methylase Mod MASNQKLELTWIGKEKRAKLEPRILLEDPEKSYHAKQRVSESDVFDNRLI FGDNLLALKALEQEFAGEVKCVFIDPPYNTGSAFTHYDDGLEHSIWLGLM RDRLEIIKRLLSNDGSLWITIDDNECHYLKVLCDEIFGRANYKTTITWQR KYSVSNNFQGIASICDFVLVYSKSEAFKNNLLPRSEESAARYNNPDNDPR GPWKAVDYLNQATPEKRPNLCYDIVNPNTGVVIKNTKKAWKYDPTTHQRH VDEKRIWWGRDGGNSVPALKLFLSEVRDGMTPHNWWSHEEVGHTDESKKE MIGLYGPRDVFDTPKPERLLKRILEIATNPGDLVLDSFAGSGTTGAVAHK MGRRWIMVELGEHCHTHIIPRLKKVIDGEDPGGITNAVDWQGGGGFRYYS LAPSLIVEDRWGNPVINPEYNATQLSEALCKLEGFTYAPSETRWWQQGHS SERDFLYVTTQNLSASQLQALSDEVGTEQSLLVCCSAFHGISAAAAAARW PNLTLKKIPKMVLARCEWGHDDYSLNVANLPLAKPEPETPASQPAPKKKG KKTLPMPDLFGDVEDGA >NE2101 hypothetical protein MSSSPSHPFPSLQSRIVASFVSTSSTIIVARLSTLRPLRDLTMVGWSMST RMKAKLVCDALQMAVWQRQP >NE1270 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE1316 NUDIX hydrolase MIDRNGYRANVGIILLNSQNQVFWGKRARQDSWQFPQGGIKSGETPTEAM YRELAEETGLQPVHVEILGRTREWLRYDVPACWTRRDWRKNYRGQKQIWF LLRLLGRDSDVSLETCAHPEFDAWRWNQYWVELESVVEFKRQVYRQALTE LSRLLDHEAGLGNDRAYREPLEPVEKNRKKSSDTRQS >NE1558 putative transposase MPRTHGYAPIGKRCHGKCNWHARGRINVIGALIGKCLLTVGLFKNNIDAD TFLGWTIHDLLPKLPPASIVVMDNATFRKRQDIQNVITRGGHTLEYLPAY SPDLNPIEHKWAQTKAVRKQQNQTVEQLFKIESFYVT >NE0342 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPNSSGRGRCQNRGSVFVIPRPGR >NE2515 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE2156 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE2533 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE0112 hypothetical protein MLGTANLPSRNKYKAKATILHMNRNLYLVAYDICNPRRLRQVCRYLTGYK VSGQKSVFEIWVTPTELHTIRTELDKLMDTQADRLHILSLDPRMKPRCYG NASTFTVQHFCIV >NE0934 Integrase, catalytic core MNRTLKEATVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYI IKCWQNEPERFIINPYHHKVGLNSYSVFRCLDTVRLSAAHVGFFTEMREI SC >NE2446 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE2318 hypothetical protein MKLLFFCLLLVSMSVPAVAGNEKQIFELEAAIMQQQQEQQILFQRFQMLQ ELRRHEITQIEQALPTGSDVIINGEAPKYEDVARQRKERAERVHRYTDEL DELYMRYQETENERRALIEQLNGLKPGQDVSAEPKK >NE0451 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1845 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0709 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE1178 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2414 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1516 Uncharacterized protein family UPF0006 MFVDSHCHLDFPDLASSLDELLVNMQISQVTHALCVGVNLENFPRVLALA ESHSNLFASVGVHPDYEDTAEPAVEQLLKLADHAKVVALGETGLDYFRLK GDLEWQRERFRRHIRAARRCGKPLIIHTRAAAEDTLRIMEEEGAASVGGV MHCFTESWEIARRALDLNFYISFSGIVTFKNAAIIKEVAKKVPADRMLIE TDSPYLAPVPHRGETNQPAFVRHVAEEIARLRETTLAEIAAVTTNNFFNL FKVV >NE0111 Protein of unknown function DUF48 MTSLFVDRRGVELGLESGAIVFRENGERIGTVPIAPLTRVFLRGDVNLPA ALLGKLGERGVGVVILSGRTSRPSLLLARPHNDAARRVAQVRLSLDEPAS LIIARELIERKLTRQIEWFTELRENDIQARYELSRALRGLEEHRARLGNI NNAASLRGIEGSAAARYFTGLQAVIPGSLHFHGRNRRPPRDPFNALLSLT YTLLHSEITIALHGAGFDPYIGFYHRLDFGRESLASDLLEPLRPLADRFA FALVHRRVLDKDHFTTTESGCLLGKAGRVRYYAAYEEHSEILRKGINQEI EQLAEQVGSALTPESGNTPDHDSGDWE >NE0662 Transposase IS4 family MCGTPARCVAPLAALFEMLKGQCDGISIADATAIAVCDNRRIARHRVSAD SARRGKTSMGWFYGFKLHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGL FGQLFADKGYLAQWLTETLDRQNLQLITPFKKNMKPAPRTGFEKAILRRR SLIETVFDELKNLCQIKHTRHRSFFNFVVNLMAGIVAYCLSDNKPTLNLT RVNTLVKA >NE2010 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR >NE2109 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE2498 hypothetical protein MPKMDVKGIAVTVYSENAMDFISLTDMLRAKDGDFFISDWLRNRNTVEFL GIWEQVHNPNFNYGEFATIRSQAGLNSYKISVKEWVARTHAIGLVAKAGR YGGTYAHKDIAFEFGMWISAEFKIYLIKEFQRLKEAEQQQLGWDIRRNLT KINYRIHTDAIQTNLIPPALTQSQISLIYASEADLLNMALFGKTAKQWRE ENPNNKGNIRDEANVSQLVCLANLETLNAHFIHQGLPQVERLKILNQTAI HQMKLLLADRSLKQLDGN >NE1351 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2098 Maturase; integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase); part of type II intron MHRALNQDDDHNQDGQDLLEAVLARDNLARAWRRVKSNRGAPGIDGVTTA EWPEHARAHWPATREQIEAGRYRPQPVRRVDIPKPDGGQRQLGIPTVTDR VIQQAIAQVLIPIFDPGFSASSFGFRPGRNAHQAIRQVQAHVKAGYRWAV DLDLARFFDNVNHDLLMSLLSRSIADKRLLALIGRYLRAGVLVGEHPQPS EVGTPQGGPLSPLLANVLLHQFDLELERRGHRFARYADDVIILVKSRRAA ERVMQSLTYFLQSTLKLTVNLAKSQVAPMSECSFLGFTLVGKKIRWTEKS LANFKHRVRQLTGRSWGVSMEYRLEKLGQYLRGWFGYYGISQYYRPIPEL DEWIRRRVRMCYWKQWRWARTKIRHLLDLGIPLKAAIQHGVSSLSYWRMA RTPVTQQAMSNDWLRAQGLLSIKDLWCKAQSYGPDKG >NE1367 possible ISA0963-4, putative transposase MLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQLYLALNDIEHSK TKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEELQHDLDDWMAYY NSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0814 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1631 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDHLISVNRP >NE1788 transposase MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEAV >NE1925 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1667 conserved hypothetical protein MPDMTSASSGTVLAFDFGKRRIGVAIGEHELRMAHPLTTIDQSMTRPRFE KIAELIEAWQPVLLVVGLSVHADGTEHEITRLCRRFARRLEGRFRIPVAL ADERYTTVIARSVLEEVGVTGKKQRPMLDQIAAQHILQTYFDLSHAAS >NE2411 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNNYPQQNGMVKWVTGH >NE1995 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRTFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1539 hypothetical protein MLFWIKTIQTAAWFYLLMFALLAGSAHAAELKPADQTGFLIVAADRGFVG NEEIRDAFASFSANHPAALVFVTDERTRQTLQSGLASLHQQNIGRIVVLP LFISAAEPRYQLIRTLVTEENQTIPVTFTKPYGESYFAVEALATRLRGMQ HTAQQHLLVVGYGAQNDTHRRAMYDDWMRIVKQASQGVSFRSINSLILLE AQKDEEPESYGNKTKQQLATALSSLGTATKNNKNQVIAFALGPKYDSMMS LESRLERLLPENAALNHFEIEPQHLAMWMEREASRNLPLAEEDTGVILFA HGSDFHWNENLRVAVEPLMKRYKIEFAFSMADPYTIERALHKLEQRGAKA AIVVSAFASRSSYRNEIGYLAGLDIENQDDHIHDNNSGHGSHGGHGGHAK SSTPVPRILTSLPVIWTGGYEDNPLFASALFDRVLALSKDPARETVILTA HGTQDDRKNDEWLEKLNSIASQMHDQGGQKFKAFKVATWREDWPDKRAPW VKKVRAMVTEASKQGDRVIVIPTRTTSVGPEKRFLAGLEFELGEGFAPHP LFTQWVDEQIRQGINLHKEALGR >NE2108 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE0806 hypothetical protein MKYSIRFFAVIFAIFLTACSTMYYSGLEKIGIPKRDVLVYRVEKARDTQE ETREQFKSALEQFSAATNFKGGDLEGIYKKLNGEYEASVNKAKEVRSRIE DIENVSAALFREWEQEITQYSNPALKRSSQDRLTETRSYYKQLIAAMKNA ESRIQPVLTVFNDQVMYLKHNLNARAIASLKGELKTLQSNVSTLVAAMEK SINEANTFISNMEKN >NE2274 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE2011 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE2012 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNN >NE1816 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE0749 Transposase IS911 HTH and LZ region MNKQNKQNKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTL LEWVKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFA QAELDRVLKK >NE1107 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE0708 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE0134 transposase MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA ALIEGLTVRASARQCRIDKNTSFRWRHRFLTLPAAAKANHLEGIVEADET FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL SSPIVPLQAALGRENQFQLLTNT >NE1853 hypothetical protein MSDQYTQNESDQSKDKVEWTKPASLLNILGKKFAPIADLQHKQLPSWSLL VFLGILLLVFIWKQIAVNQAESRLEKGQAQIAQQLEEKSKELVKKAREYA DSQYKKEEERFGQVLAWAVRGELIRNNLDQIDQYLTELVKTKDTERVVLI SDEGKLLVSTDKRLESEEASSLYPKDVLGLQTITIKSDVDNRKLLVVPVM GLNKRLATIVISYNPPSLLN >NE0120 hypothetical protein MTPLRAILRLRSPLGTPLAGDTLFGQLCHAVREMLGEEKLEALLDGYTAG SPWLVVSDGFPSGYLPRPTVAAALQANSEEDPKKRKEAKGKRWIPHSQIA QPLRQLLSSAVSDEEVYGKQSRPIQAAAFHNTLNRLTGTTGTGEFAPYTQ SQIFYQRDQRMDLWCVLDEDRLPRETLHQLLEYIGSVGYGRDASIGLGKF AVEQIEEAALFKQTHPNANAYWTLAPCSPQGQGFKTSRSYWQVLTRFGRH GGTLALGANPFKQPLLLAATGAIFAPTNNMAQIHFIGSGLAKVSLMQTAA VHQGYAPVLGICMEAI >NE2521 conserved hypothetical protein MTSVLSPNTQAILLLTAPLIAGRGTASSDLLSPGEYKRLARHLREIQRQP ADLLSPDAAEILRACQPVIDEGRLQKLLGRGFLLSQVIERWQARAIWVVS RADAEYPRRLKARLREDAPAVLYGCGDMALLETGGLAVVGSRHVDDALID YTMTVGRLAARAGRTLVSGGAKGIDQAAMRGALEAGGKVCGVLSDSLEKT TMNREHRNLLLDGQLVLISPYDPSAGFNVGHAMQRNKLIYALADTSLVVS SDLNKGGTWAGAVEQLDKLKFVPVFIRSTGESSAGLDGLRKKGALAWPNP QDVDSFKDVFNVAMPTPTASPQVGFALFSNEEPTSVDAKPTVPVPPDTAP APQAESEPSAPVDVVSDAQPPAPALEEQPSVTPEAIPPIDDAMESAQPES SPAEVLFAAVRAAIQQLLSAPMKDADVAAALDVSNAQAKAWLQRLVDEGV LEKQKKPAGYIVKQKRLFE >NE1553 possible transposase MTHSHRRHDISDRIWSLLEGHLPGREGAWGGVATDNRQFINAVFWIIRTG APWRDLPPDYGGWSNTHRRFIRWRDKGIWEKLLEILIDDPDYEWLIMDAS HCKVHPHASGARGGNQDMNRTKGGSTPRYIWPWMRMVCRSESLLHKVPLL IARRLAA >NE2200 transposase MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEAV >NE0711 Uncharacterised protein family UPF0102 MSSAGNKGSDAEQCATIFLQQQKLTLLERNYRCRFGEIDLIMREGDTVIF VEVRMRSSDRFGGAAASITAAKQLKLTRAARHYLAGCEGDFPYRFDAILI SGERENEIEWIRNAFDES >NE1132 transposase MPDLCQGLFGQLFADKGYLAQWLTEALDQQNLQLITPLRKNMRPVPRTRF EKVILRRRSLIETVFDELKNLCQIEHTRHRSLFNFIVNLMAGIVAYCLSD NKPTLNLTRVNSLAKA >NE0197 conserved hypothetical protein MIPNPDPESSINRNQVIISGTITDLASPRYTPAGLMIAEFKLSHCSNQQE AGIQRRIEFEFEAIAIAETAEKIIRIGSGSNVEITGFIAKKNRLSNQLVL HVRDTRII >NE1585 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE0719 Uncharacterised protein family UPF0102 MSSAGNKGSDAEQCAAAFLQQQKLTLLEKNYRCRFGEIDLIMREDDTVVF VEVRMRSSDRFGGAAASITAAKQSRLIRTARHYLAGHEGDFPCRFDAVLI SGNRENEIEWIRNAFDES >NE0272 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFTWSRQMPEPR >NE1940 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNN >NE1789 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE2097 Integrase, catalytic core MPTGQNNRTTVVRKNATCPRDLVNRMFHANRPNQLWVSDFTYVSTWQGWL YVAFVIDVFARRIVGWRVSSTMSTDFVLDALEQALYDRRPADTLIHHSDR GSQYVSIRYTERLAQAGIEPSVGSRGDSYDNALAETINGLYKAELIHRRA PWKTRAAVELATLEWVAWYNHQRLLGSIGYIPPAQAEENYRQTQDNKTLM DILL >NE2504 Transposase IS4 family MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA >NE1740 Transposase IS4 family MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA >NE2273 transposase MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEVV >NE0981 HhH-GPD MALVFWEQAVNDLSARDPVMHRIIQCYSDSMPEERGNAFATLARAIVGQQ ISVKAAASVWQKVTTLIPEITPEALIATEIDLLRTCGLSARKVDYLRDLS RHFLEGTLVTVNWHDLDDETLIRKLVEVKGIGRWTAEMFLIFHLHRPDVL PLDDIGLQRAVSLHYNASQPVAKQAIRTIAESWQPWRSVATWYLWRSLDP IPVIY >NE0452 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE2311 possible helicase (Snf2/Rad54 family) MVQQLLTPHQSQYIAWQLTRRAAKDSVESLASTLVDSQVDLNPHQVDAAL FACRNPLSRGVILADEVGLGKTIEAGLVISQHWAERRRKMLIIVPANLRK QWHQELQDKFNLQGLVLEAKNYNAMRKEGVTQPFLHAGGPIICSYQFAKA KADDLRRIHWDLVVMDEAHRLRNVYKNGNVIARTIRDALEHVDAKVLLTA TPLQNTLLELYGLVSMIDERVFGDLDSFRTQFSGVRTEQSNRALRERLTP LCKRTLRRQVQQYVPYTARIAIVEEFTPSQEEQQLSALVADYLRRPNLKA LPEGQRQLISLVLWKLLASSSHAIAGALETMANRLQGQLDELPDVPDLTE SLDDDYEGLDETADEWNGATANDADASANERAAIADEAAELRRFKELATS IRQNAKGQALLTALDKAFAELERLGASKKAIIFTESKRTQNYLLSLLAET PYGIVLFNGTNTDARAQAIYKDWLQRHEGSDRITGSKTADTRAALVEHFK ERGTIMIATEAGAEGINLQFCSLVINYDLPWNPQRIEQRIGRCHRYGQKH DVVVVNFVDRSNEADARVYQLLSQKFKLFEGVFGASDEVLGAIGSGVDFE RRIAAIYQNCREPEEIRSRFEDLQRELSSEIDEAMLRTRQLLLENFDEEV QEKLRIHSQDSQAVLNKYERLLMDLSRTELRDHARFDTAEEVNGFVLHSL PDGLGLATGSREQAVMAGRYELPRRSGDAHLYRMGHPLAEWAIERAKARD LQAPARLAFDYAAYGKRLVSLEKWRGQCGWLSVTLLSVETLNDQEQHLVV SACTQAGEALPEDDPEKLLRLPAQVEGDAHLQVCAELVANVESRKSVLLR GINQRNLGYFEQEVQKLDTWADDLKLGLEQEIKAIDGEIKEVRRTAAASP TLEEKLAHQKRQRELETRRSKLRRDLFARQDEVEEQRNKLIGELEEQLKQ QVAERMLFTVEWELT >NE0931 conserved hypothetical protein MTVSSPFQPDCRDCPRLAQHLDQVKTDYPDYHARPVAPFGDSSAKLLIVG LAPGLHGANRTGRPFTGDYAGILLYRTLHKFGFASHDESVSADDPLHLTD CRITNAVKCLPPANKPQPAEIRQCNAFLAVELDNFARNGGQALLALGTIA HQAVLMALGCRNADFPFSHGAIHRVTEELKLYDSYHCSRYNTQTRRLTET MFEQIFDRICQDMAATQ >NE0830 DNA mismatch repair protein MutS family, C-terminal domain MAARCVVLLCIVENCDFRRASCDHHDFACHAAHGGLDQRFQKARSWFLLS SIQHKVAKSAKVGNTESTHVITTRTMNDTTQDTAASVWRESFILSSGKNP SGIRDTRPTADNYGVLDAKTFAAVEVDALFDEINQAQTLTGQSILYRSLA RPVTDAALLQSKQEALRELESNPDLLKVLEQYIKRIAIDEASLHHLLYGE FAGGLTTDDPRDKTGKDKLEFGGYGYRQFIDGTGFVVDLVEEAEALPMPE SDYLRTLVQTLRDFARSRTYALMHGPIYVSQGKFMTREEKPRYLLIQRFR PSMFKWPFISFFLAFVAGLLLFFQNTLNELVASYVGYGLLILVVPIIPII LQAISASDRDSVIYPLQRLFRQSPELARTIEAMGMIDELLALHRHARSIP GESVLPEIDMDGRHTLVVSGARNPLLVRTRPDYVSNDIVLDNDKHLLIVT GPNSGGKTAYCKTVVQIQLLAQAGAYVPAVQARAVPAEHIFYQIPDPGQL EEGMGRFAHELKQTREIFFNSTPRSLVVLDELAEGTTFEEKMTLSEYVLK GFHQLGATTILVTHNHELCERLQQENIGNYLQVEFVSEKPSHRLIPGISR ISHADRIASAIGFSKEDVASHLASLQE >NE0121 conserved hypothetical protein MQLDTIHKITGTLILKSGLHIGAGDSEMRIGGTDSPVVKDPLTDQPYIPG SSLKGKIRSLLEWRHGLVVATGGAPYSFKHLAQDENNSAGRDVIKLFGGA PDKAEDQLVKNIGPTRLAFWDCPLNGDWKKEAADSRHLLTTEVKSENSIN RIAGTAEHPRFIERVIAGARFDFTLTLKVLEGDDLLNTVLLGLRLLELDS LGGSGSRGYGKIKFAELKLDGTDLMEQFHAITPFNQTA >NE1348 Integrase, catalytic core MGRIYQQTFVDTYSKWAAAKLYTNKTPITSADMLNDRVLPFFAEQSMGII RILTDRGTEYCGKPENHDYQLYLALNDIEHSKTKANHPQTNGICERFHKT ILQEFYQVTFRRKIYQSIEELQHDLDDWMAYYNSVRTHQGKMCCGRTPMQ TLIDAKEIWDDKITELNN >NE1996 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE0252 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE2447 transposase MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEAV >NE1698 conserved hypothetical protein MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA ALIEGLTVRASARQCRIDKNTSFRWRHRFLTFPAAAKANHLEGIVEADET FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL SSPIVPLQAALGRENQFQLLTNT >NE1271 Integrase, catalytic core MLQVAPSAYWRHAARQRYPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWYQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1264 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2442 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1523 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0969 possible N6-adenine-specific methylase MVKADRIRIIGGQWRSRLIQFADDELLRPTPDRVRETLFNWLGQDLTGKI CLDLFAGSGALGFEAASRGAKQVTMIEQNMKAVRNLHCSIEKLGASQVKL EHVDARMFLTANSERYDVIFVDPPFKSGLLAEVLPLLPAKLEEEGVVYVE SSDKLLPDDTWSIWKQGRASHVHYCLLSLNPDG >NE2441 hypothetical protein MKKILSILSSTFILSLFMLSSVNAQVEDVHLQEAIRQTEAVVLAVDVKTM TQLVQEAERYAVEVKSTHPENEHLQEGLKHLNDVIKESQAGEPAAARKAA IVALSKFNQIERK >NE0454 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIQTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNN >NE0716 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE0561 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE2023 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR >NE0268 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2238 Transposase IS200 like MTFTTKEYQSLSHTRWDCKYHVVFIPKRRKKRIFGMLRWHLGELFHELAS HKESKIVEGHLMDDHVHMCISIPPKYAVSNVVGYLKGKSAIQIARKFGGR QKNFTGEHFWARGYFVSTVGLDDNIVRTYIRNQEDEDERYDQMKLEI >NE1061 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNS >NE2516 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE2483 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNS >NE0245 hypothetical protein MALPQRQIVRAENVKIGISWQCALCDLDIYARPLPGAEVIYFGRMVTTHG RYWKDYRNSPQPTNGYETISFDVPLDLRPVVIAINFYEGEAPQGVSGEIR IAVDENTYAAPFHISATRGNRGQGVAKIIETGKASGNHSVIVDPLHIIRA R >NE2155 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRTFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1222 Transposase IS4 family MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQARYLGKDIRCFDRRPGQSIYYDRQHDRAR SSAGRLRKRGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDC TQALDLISGFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRT YDREIYKCRNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE0562 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNT >NE0184 NUDIX hydrolase MTWKPNVTVAAVIEQDDKYLLVEEIPRGTAIKLNQPAGHLEPGESIIQAC SREVLEETGHSFLPEVLTGIYHWTCASNGTTYLRFTFSGQVVSFDPDRKL DTGIVRAAWFSIDEIRAKQAMHRTPLVMQCIEDYHAGKRYPLDILQYYD >NE2532 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1522 Transposase IS4 family MDAHGMPVRILVTQGTTADCTQAGRLIEGIDADHLLADRGYDSNAIVEQA EKQGMEAVIPPKKNRKIQRPYDKELYKLRHLVENAFLHLKRWRGIATRYA KNTSSFLAVVQIRCIALWADIL >NE1630 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFTWSRQMPEPR >NE0940 putative DNA transport competence protein, ComEA MYYIWPKRLKLRNPEALFLNFCGDSEMKKIFLILVIFFGFNLSVLAGVDI NTASQADLESVKGLGPVKAKAIIEYRNKYGMFKSVEELANVKGIGAGILK QLGDQVSVQEGAVLTETKVD >NE0880 probable ATP-dependent DNA helicase-related protein MSDLNTVFSADGLLARNIPDYRPRTQQLEMAQAIAQAIESQEVLVTEAGT GTGKTYAYLVPALLSGGKVILSTGTKTLQDQLFQRDIPTVRAALKIPVTI ALLKGRANYICHYHLERTLNSDHIHFASRTEVKYLNLIERYAGTSSHGDK SGLDKVPEQAAIWQHVTSTRENCLGSDCPHYRQCFVMEARKRALSADIVV VNHHLFFADVMLRDEGLSELLPACNTVIFDEAHQLPEVASLFFGESVSTG QIQVLVRDTDTEALLEAKDFAPLFDATAAVGKAVLDLHLTITEKHTRMSS ASAARYPGFSEARQVLQEKLVLLAGLLETQAVRSQGLQNCWLRAQTLLNR IRQWHEQSESREFICWVETYSQSLQFNTTPLSVAETFSKQLDASARAWIF TSATLSVKKDFSHYNRMMGLFEAKTANWDSPFDFPNQALLYVPSQLPDPN TPHYTESIVQAVLPVIKASQGRAFILCTSLRNMQQIHELLQVAFQREQLE FPLLLQGQEARSALLNQFRQLGNAVLVGSQSFWEGIDVKGNALSLVIIDR LPFASPDDPVLSARIEKFTREGRNAFMEYQLPHAIISLKQGAGRLIRDEK DRGVLMICDPRLVSKPYGKQIWQSLPPMKRTRDPDEVLRFLENVDQ >NE2412 transposase MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEVV >NE1675 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1182 Helix-turn-helix protein, CopG family MRNTMTHRVTITLDAETFAFLNDVASSNRSAYVNQLLKQDRKNFLQAALR KANQEEAEDTNYQEKLQAWESTLSDGLAND >NE1760 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2228 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2166 UvrD/REP helicase MNTVLQQEIPDQRERQQALDPWHSFIVQAPAGSGKTGLLTQRFLVLLATV EEPEEIVAITFTRKAASEMKHRILQALRDTAGDINSDAESETALLNDAYQ RQLRELANRVLAHDQARGWQLLQNPSRLRIQTIDSLCAWLVDRMPVCSRQ GALSSVAEDADRLYLEAARLTVEALEEEGEWTAAIEHLIGHLDNRLDRLQ QLIADMLARRDLWLRGVVDAANSDDMRDRLESVLSGRIAEAIERLADAVP AGCQSEIIELMQFAAVNLSEAGSADSNTVRWPGNALEDRLVWESMADFLL TQTGDWRKQVTKANGFPAPSSVRDADVKEYLNGMKQRMSELLVALQSEET FRQQLQLLRQLPPERYTDEEWETLQALFSLLKVAAGYLLLVFRQHGQVDF TEIAMAAVRALGEPEMPTDLALALDYRIHHLLVDEFQDTSSSQAELLQRL TAGWQTGDGRTLFLVGDPMQSIYRFRQAEVGLFLDIRDSGYFGQIQMRFL RLSVNFRSQSGIVEWVNRYFPRILPDTDSVSTGAVSYASSVAFHAASSGE AVRIYPYLQKDDRAEAEQVGAIVAQARAAQPDGRIAVLVRNRSHLASIVV HLRRKGLRFQAVEIEQLAQRVVIRDLMALTRALVHPADRIAWLALLRAPF CGLSLQDLHTVANTLPQHVLIDSLRACAGSGVLSEEGGQRVNRVLPILER ALMLYDRMSLRRCVEGIWVSLGGPASVQNETDLADAEVYFQLLENFDVTG YRPDIQELDERLVRLFALPDVAADDSLQLMTLHKAKGLEFDTVILPGLGK SPRRDQEKLLNWLEFHDQSQHPGLLCAPISAAGSDKNPISAYILSEEKKR TALEEARLLYVAVTRAKHNLHLLGHLRIDPDMQENDALKPPEDTLLARLW PAVAADFLARSREAAIGDLPASNVHTGLQLVGMVRLVSGWQPPPLPKAVA VAMHANEAGTTEEPVDFDWAGEPARLVGVVVHCLLHRIGLIGVENVDHQD LEALKLAGRSLLIQSGITPRHLEKAVQQVARALRTMCVEDETGRWILSNR HQEARCEWALSVPTAIAAGHSISVSIIDRTFVDAAGVRWIIDYKTGSHTG GSLEEFLDREQLRYRPQLDRYAQVLQRMEDRPMHLALYFPLLGKWRKWIP SRESA >NE2178 conserved hypothetical protein MENPAMTTRPFDARQTNITDLIQRLDGCATIVTGNRRLARALHQAFNQAR SAEGHGAWPAPDILPWDAWLQQLWQEVVISSRIESAPGVLLTSHQEYFVW QEILAEQSGDVPLQATNETVARIMEAWQTLHAWCIPCREADFGHNADTRL FWQLASMFEAKCRKNSWLSVAVLPGILQKYVQIDSLSVPNELVLTGFDEW TPQQSSFLRAFEQTGCSLQWLQLSGQPDRIGKLACADGRDEIRQAARWIR QRLEENPAARIALVVPELAAQREMICQTLDEVLIPQALQPEHHDRVRSYN LSLGKPLDRYPPVSLALDVLGLSETVIELPHVSRVLRSSFIAGGDREMNA RALLDARLRESGEWNLTLQKLLTNAARSGQPYSCPLLAECLSNLMKQVKV SLAPTSPGEWAQRFGQWLKAIGWPGERGLSSEEYQVIQAWQGVLREFSTL DWVIRSVSLTEALQQLRHMVAGTIFQPESAEAPVQVLGLFETSGLQFDYL WIMGLHDGVFPASSRPNPFLPLTLQREVDAPHSSARRELRVAAALLQRIT TNATEVVISYPQRKGDEILDSSPLIDAFPALSEEMLAMGTQSAWRDSVYH SRQQEVLSEDVAPTFVGTGIPGGSKLFKLQAACPFRAFAELRLVARPLGR IQIGLNALVRGTLLHRVMEMVWAELDSLAALANLSPGELNALVAGKVNEA IYEIAPRYPHTFGERLQALESKRLHALVLAWLEMEKQRPPFRVSGREMET ELELNGLRINLRIDRIDTLEEGGELLIDYKTGEVKASAWFGDRPDEPQLP LYSLAFTDDGLAGIAFARIRAGDIAFEGVASEEVSILGIKSFENLRHTRE AASWDEVLAGWRQTIEQLVQDFMAGEARVSPKQYPQTCTYCELKPLCRIG ESLEAVDDC >NE2024 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE1586 transposase MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEAV >NE0341 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE1880 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2413 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYL >NE0939 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0093 recQ; ATP-dependent DNA helicase MRHQFPEVGRLISELRDAPCEREDCQYCQTTHDPRHELKRYFGFPDFRYE QPGESLQHDIVLAGMRGQHVLAILATGGGKSLCYQLPALNRFHRNGSLTV IVSPLQSLMKDQVDGLLERNVQCAAALNGLLTMPERAEVLEKIQMGDVGI LLVSPEQFRNKAFRRAIRQRQVGAWIFDEAHCLSKWGSDFRPDYLYVSRF IREYTGDQPLAQIGCFTATAKPDVLADIQSHFRESLGIEFKVFPGGHERT NLHFDVLPCTKAEKWSRTDRLLHEHLDSQEGGAVVFVSSRKSAEELSDFL IGQGWPCKHFHAGLEPNTKTDIQDDFKAGQLRIIVATNAFGMGVDKADIR LVIHADIPGSLENYLQEAGRAGRDQGDARCVLLYDPQDIETQFGISERSK LSIRDIQQILRKLRNESNRRKGGKLVITAGEILLDDDVDTSFSADERDAE TKVVTAVAWLERGDYLKREENHTQIFPARLDMSEKEAEKRLLKAKLPQRR LEEFRAILRFLYGADADERVNTDQLMQLTSLESEEVASALKQLEEMGLLV NDSQITLYVRHGVTGASSQRLQSSLELERALLQRLPELASDAGQGEWQDL NLPALAAELKADTRQGDLLPLQVLRLLRSLADDHDANSQQRSSFELRQLN RDYLKLRIKGGHSWRQIERFGEKRRALAGVLMEFLIGKLPPGSRNKDLLV ETSFGELVKALESDLELPHLIAPDQRRKAVEHVLLYLHRQDILTLNHGMT VMRRAMTIEVKKEDKRKTFLKEDYLRLDEHYREKRIQVHVIHEYAEVALK EMAEALRLVLDYFTDSKQAFIKRHFAGREDVLKLATSEASWKSIIESLST TQKLIVADDDDINRLVLAGPGSGKTRVIVHRIAYLLRVRRVPATSIVALT FNRHAANEIRKRLLALVGADAYGVSVLTYHSMAMRLTGTRFERGDTVDER ALKRVLSDAVELLEGKRNVEGEDNLREQLLRGYRYILVDEYQDIDDLQYR LVSALAGRHAEEDGRLCIMAVGDDDQNIYAWRDTNNRYIERFREDYEASI SFLVDNYRSSLRIIEAANQLIGQNSARLKEANPIRIDRARQELPAGGLWE EQDKQRKGRVLRLLIDPSDRERGNLQAQAAMLELERLLVLEQGSWNGCAV LARTHRYLWPIQAWCEQHDIPYFLAADKETALPITRQRSFVAAIDSLREI ESALCAADAWLRLSGSNQLVEAEWKSFFQTAFEQMRGELGDCQLGSGALI DWLYDYARELRQQPKEGLYLGTVHSAKGLEFRHVVLLDGGWSTQVDTLAD ERRLYYVGMTRAEQTLALCEFADGNPFSRSLMKGVQQHAFQGQPLPELEL RFQQLTLKEIDISFAGRQLPHARIHKAIEALREGDPLTLKEEAGRYQLLD RQGNVVGRTAKSFQPQIGFAHCEVAVVIVRFAEDSEEQYRDLNKCERWEV VVPRGRG >NE0188 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1122 conserved hypothetical protein MNDLESLISQVRRCTLCAEHLPLGPRPVFQLHETARILIASQAPGRRVHE TGLPFNDPSGDRLREWLNMTRTIFYDPRRIAILPMGLCFPGTGKSGDLPP RPECAPAWRSALLSHLKNIRLTLLVGQYAQAYYFTRQGRKPVATLTENVR SWQKFWPDIVPLPHPSPRNNLWLRRNPWFEEEIIPALQERVAMILNQTTD S >NE1470 conserved hypothetical protein MRSTTLLFFCSNLVWISPLTWAQTSQQPDNLIPLPEIPESPSAGEENGLP PELGLDPSLEPEITIHEGKDKTMIEEYRVNGELYVIKITPRIGKPYYLLN RRSAVGMPHRGDMESGVSVPMWQIYRF >NE2096 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE1093 Transposase IS4 family MARFKPVQKGLMLLPVDFSRQIIPGSFEHALCYLVDHELDFSGLRERYRN NTQGAPAYDPAVLLKIILLAYSRGLIGSRRIEAMCRQNILFIAVAGDNQP HFTTLAAFIAELGDEVAKLFAQVLVVCDRQGLIGRELFAIDGVKLPSNAS KAKSGTRADYQRQAEKMEKAAKQMLVRHREIDMTPVDERQAQREACMLER LQKEAKQLKDWLAANPEDRKGPKGGVRQSNLTDNESAKMATGKGVIQGYT GVAVVDEKHQIILDAQAHGTGSEQELLVPVVQAIKPQMSNQTVITADAGY HSENNLKMLAAEGIDTYIPDNGYRKRDERYHGQEAHKTKPDPLWDKRGQP SISKRFGPGDFQLAEDGSHCLCPAGKRLYSNGSNCTFNGYAAMKFRGAER DCLPCTLRTQCLRTPEKTKTRQVAFFRGKRDGYETHTDRMKRKIDSDQGR QMITRRFATVEPVFGNLRNNKRLDRFTLRGRSKVDGQWKLYCLVHNIEKL AHYGVGQ >NE0518 transposase MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEVV >NE0715 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR >NE2028 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNTRICR >NE0560 transposase MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEVV >NE2541 Site-specific recombinase MSEVLKRRMRCAVYTRKSTDEGLDQEYNSIDAQRDAGHAYIASQRAEGWI PVADDYDDPAFSGGNMERPALQRMMADIEAGKIDVVVIYKIDRLTRSLAD FSKMVEVFERYAVSFVSVTQQFNTTTSMGRLMLNILLSFAQFEREVTGER IRDKISASKRKGMWMGGVPPLGYDVENRRLVPNEREAKLIRHIFQRFVEL GSSTALVKELKLDGVTSKAWTTQDGKTRDGRLIDKGHIYKLLSNRTYLGE LRHKDQWYQAEHPPIINRELWDSVHAILETNGRVRGNTTRAKVPYLLKGI VFGNDGRALSPWHTTKKNGRRYRYYVPQRDAKEHAGASGLPRLPAAELES AVLDQLRAILRAPNLLGEMLPQAIKLDPTLDEAKITVAMTRLDAIWDQLF PAEQTRIVKLLVEKVIVSPNDLEVRLRANGIERLVLELRPEPVEQQEVAR A >NE1261 Integrase, catalytic core MLQVAPSAYWRHAARQRYPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWYQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1560 conserved hypothetical protein MHCRSFPPIASPGSWVLILGTMPGKVSLREQQYYAHPQNLFWRITAEILG FDATSAYPLRVSSLKDHGVALWDVLQSCTRESSLDADIVAHTIVPNDFGR FFTACPDIRRVCFNGAKAAALYARHVKPFLQDAPTVEYVQLPSTSPANAA IPRADKLRAWSVIKHNA >NE0254 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1398 putative DNA polymerase-related protein, bacteriophage-type MDPNRLKELGLLPVWRVRPGAIAGQSPDSGKNMSDQIETAAESRPFEESE DRSTSIAHADWGRLRQMVSGCTACPLSQTRKQTVFGVGDEQADWLFIGEG PGAREDELGEPFVGQAGKLLDNMLQAVSLRRGQDVYIANIVKCRPPGNRN PQDAEAEQCRPYLLRQIALIQPRLIVALGKVAAQNLLATDASIASLRGRL HEFSGIPLIVTYHPAYLLRSLGDKAKAWEDLCFARDTMRNLQAAHSS >NE0845 DUF196 MLIIVTYDVSTETRAGRKRLRRVAKLCESIGQRVQKSVFECRINLMQYEE LERRLLSEIDEQEDNLRLYRLTEPAELHVKEYGNFKAIDFEGPLTI >NE0228 CHC2 zinc finger MIEQSFIQELLDRIDIVDVVARHLQLKKAGANFTACCPFHNEKTPSFTVN SSKQFYHCFGCGRHGNAISFLMEHSGASFVEAVESLATHAGMQIPDQVSI YPKIPDPGRVPSDKIKIDKEVEATSPLAGLYERMEQAAKFYRGQLKQSDQ AIAYLKERGISGRTALCFGIGYAPPGWQNLSGIFTDYPADDSSHPLVQAG LVVAHDGKKNYDRFRHRIMFPILDRKKKIVGFGGRALDGGEPKYLNSPET SLFVKGRELYNLASASPAIRKSARVIVVEGYMDVVMLVQSGVENVVATLG TATTAMHIQNLLRHTDEVVFCFDGDAAGTKAAWRALETSLPQLKDGKDIK FLFLPDKEDPDSYIRKYGRVAFEGLLEKAQPLSVFFCNELSGRVNLGTSE GRARLVQRAGPLLAQINAPVFGFMLTKRISELTGVGQNQLAAFLKTGKKN RSSTLRPEASRPLSVTPYRRLIQILLHAPDYANKLDTNLLAVNDEQNEEK VLLVALVDFLKTSACSMEEELNSVTILLHFDQTPHRVLLEKIVRDAHVKD ENWNIDAEFTGGMERLREMQRRSRMAELHSRPLVSLTPEEKNELRQLMLS >NE0231 hypothetical protein MTMIKKIETELLAAKATLSEISGRFKEFSDTQARLSADGDLLGLARLNKE HTGLEDSLLAADDTVRALESRLSVLRQAEYRPQFDKAHKTHLGAVQAETK AAEKLLAAIDAVFSAATDMQNLSDEVAATYHAARDLHNRAGLDHELRWPA PDGQIPIKISDRMNSLRDELVRTIRLYEDRLPQSQSLEGLRIIEQQQEEL VRNSGRGFR >NE0511 hypothetical protein MAHAKGGKLFDSPVRLNYIIGIKSHILDSNLQLKWSVLLDIDREPAETAE EQFDSSFFLMTEELRQLLPELMEALGGTASE >NE1991 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE0259 hypothetical protein MVKPTDAELRTSGGLTSVFLNCDTCLSDEDFNRLRRMEFTQNEHAILYGQ LGGSIAGMIELGPVASRTQSRQDEERKTEERRTAQFVQLVEQMRASIEQM EADVKRLVASFEKRDGDAWREKLALNILEADEIPQQEADESITAYRKRLE QHLINEMLNPDGTIKDKYKNDPKYGDYAEWAQTQFHLNSAKAAVAELDNS DTSPQRKEHILDEMKQRGYIEEMVFTDRISGNLDAQKSVRDIRDSQHDEA LSQVRPPEATLKFLS >NE2470 NUDIX hydrolase:Conserved hypothetical protein 52 MDFDVLEKTVCFQGFFRLERYRLRHRKFNGEWGRPITRELFERGHAAAVL PYDPQTDEVLLIEQFRAGAISAPGGPWLLEIVAGVIEANETPEQVVARES MEEANCQIGSLIPLYDYLVSPGGTTERIVLFCGRVDMQTIEAGAVYGNHG EDEDIKVHVMPLNEAIRLLSTGRINSASAIIALQWLALNRDSVRRRWLPE >NE1840 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1557 putative transposase MTYPISFRRKVLSVREKEGLTIAQTAARFCVGIASVTRWIKNPVPKESRN KPATKIDMAALAHDVREFPDAYQAERARRLGVSEKGIGHALRRMHISYKK NTAAPQSGRRQTAHLPGDD >NE0288 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNR >NE1347 conserved hypothetical protein MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKVWDEFTSKPSSTLIPSGQQRSC IPTKHQLHQLICSMTGYCRSLLNRVWALFAF >NE2385 Staphylococcus nuclease (SNase) homologues MHFTRALRIQLIPSFFFRAIYPLVVLLILLHAQSGLAETIYRSTDSHGRT LYSDIPTPAAKPLQPATPPARSKYRVTRVIDGDTIVLENNKRVRLLGINA PETGNRYHPGEPGGADAKKWLRGKLQGRSVYLEHDRQTHDHYKRMLAHLY LPDGEHINLSLVEKGLAIANLIPPNLLHANTLIRAQQRAETRKLGIWSMQ HYQPRPLIKLTEKPFGWQRYRVKAKVLKRNHRFSRLIISDNLDLSFANRD LALFPPLETYLNRPLEVRGWVSRRKNHFSIRIQHPSALILY >NE0744 conserved hypothetical protein MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA ALIEGLTVRASARQCRIDKNTSFRWRHRFLTLPAAAKANHLEGIVEADET FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL SSPIVPLQAALGRENQFQLLTNT >NE1884 possible homolog of eukaryotic DNA ligase III MTDFFRFPNTPHLLWLGQGQPRDDKILSDAEIAALLQDEVLIEEKLDGAN LGISLDEHGELRAQNRGQYLPQPFSGQFSRLNSWLGQHGEILKHTLTPEM ILFGEWCAARHSLDYNKLPDWFLLFDVYDREAGKFWSVERRNQLAQKLNI TTVPLLKRTKITCNQLVQLLDDAQSRYRSGKVEGIVIRCDSPLWCESRAK LVNREFVQAIEDHWRSRSIEWNLVHAGSVKRS >NE2469 Transposase IS4 family MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA >NE2445 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1106 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR >NE0110 hypothetical protein MPADHFLKRQAMSDFIICYDITDPRRLGRLYRYLIKRAVPLQYSVFLFRG DDRQLERCIQDAIELIDEKQDDLRVYPLPGRGLKARIGRPTLPEGIQWSG LPAKW >NE1378 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1219 UMUC family (DNA-repair) MHGMNTKNRRIAHLDMDAFYASVELLRYPELRGLPVVIGGRSVHQPVIQP DGKRSYVRLRDYTGRGVVTTSTYEARAYGVFSAMGIMRAAQLAPDAILLP ADFDTYRHYSRLFKDAIARITPHIEDRGIDEIYIDLSEHPDETASLASSI KQAVRDATGLSCSIGIAPNKLLAKISSDLEKPDGLTILTHTDIPNRIWPL SVRKINGIGPKAEEKLVRLGIQKIGELAKAELSLLQAHFGRSNAIWLHDS AHGRDSRPVVISSESKSISREATFERDLHVQEDREILSDIFTELCTRVAE DLQRKGYVGRTIGIKLRYENFQTITRDLTVRNPTADASTIRKAARDCLRR VPFEQKLRLLGVRISGLSKISALLKENYYFQEELF >NE0162 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2520 ATP-dependent DNA helicase RecQ MAYDPKRALELLRIGSGRANATFRDGQEDAIRHIVEGKGRLLVVQKTGWG KSFVYFIATKLLREAGAGPALLISPLLALMRNQIAAAERMGVRAATINSD NMDDWTVVEGKLAKGEIDILLISPERLANERFRTQVLAGIAAQISMLVID EAHCISDWGHDFRPHYRLLERIVKTLPPNLRLLATTATANNRVMEDLAAV LGPKLDVSRGDLNRTSLSLQTIRLPSQAERLAWLAEQLATLQGHGIIYTL TVRDANQVAQWLKTQGFNVEAYTGETGDRREQLEQALLNNQVKALVATTA LGMGYDKPDLAFVIHYQMPGSVVAYYQQVGRAGRALDSAYGVLLSGQEES DITDWFIRSAFPTRQEVADVLGALEDEPNGLSVPELLSRVNLSKGRVDKT IALLSLEAPAPIAKQGSKWQLTAATLSEAFWDRAERLTALRRDEHQQMQD YVSLPFGEHMGFLIGALDGDPSVVAEPALPPLPATVDAELVKAAVEFLRR TSLPIEPRKKWPDGGMPQYGVKGFIAPAHQAESGKALCVWGDAGWGGLVR QGKYHDGHFSDDLVAACVKMIQEWNPQPSPTWVTCVPSLRHPELVPNFAQ RLAAALGLPFHMVIAKTDARPEQKTMANSTQQARNIDGSLALNGQPIPPG PVLLVDDMVDSRWTLTVSAWLLRKNGSGEVWPMALSQTGHDE >NE1053 Uncharacterized ATPase related to the helicase subunit of the Holliday junction resolvase MTDSPHTIRNPAAPLAERLRPRTLDDVVGQSHLLGPGKPLRLAFESGKPH SMILWGPPGSGKTTLARLMAHAFDAEFIAISAVLSGVKDIREAIERAQIT LQRTGRATLLFVDEVHRFNKAQQDAFLPHVEQGLITFIGATTENPSFEVN GALLSRAQVYALKALTDQELHQLFERARSIAMLDLEFENTAIELLIGFAD GDARRLLNLLEQVQNAAETEEIIKIDADYLSRVLARNVRRFDKGGDAFYD QISALHKSIRGSSPDAALYWLCRMLDGGADPRYIGRRLVRTATEDIGLAD PRALTLALNACEVFERLGSPEGELALAQATLYLACAPKSNAAYVAYKQAR AFIKEDISRPVPIHLRNAPTRLMREMGHGAAYRYAHDESESYAAGENYFP DNILAVQFYRPTTHGLEAKIGEKLAYLRSLDEKTGKKRN >NE0239 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2013 hypothetical protein MCRCRLHWCSHTLESIEEHGCTAHVKGRGQQAKEKRRHPGAHARRWIIEV SHGWFNCLRKLLMRYEKLARSFLGLNHLAAAIIAFRKVPLAVNIIYE >NE0257 Site-specific recombinase MESQHNHVMKKSCRIIGYARVSTEDQHLDLQIDALKLAGCSSIFEDHGLS ATAKRRPGFEQALASLQAGDIFVVWKMDRAFRSLKNALDILEEFENRAIE FRCLTEDIDTTTPMGKCMYQIRHAFSELERNLIRERTKAGMEAARQRGAH LGRPKKLSRGQIIRMQNLLQRQPDMTPVQIADQFGVSSRTIYRALSKYST IKEELAIHAG >NE1521 possible transposase MTHSHRRHDISDRIWSLLEGHLPGREGAWGGVATDNRQFINAVFWIIRTG APWRNLPPDYGGWSNTHRRFIRWRDKGIWEKLLEILIDDPDYEWLIMDAS HCKVHPHANGARGGNQDMNRTKGGSTPRYIWPWMRMVCRSESLLHKVPLL IARRLAA >NE0340 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNT >NE2466 putative lipoprotein MYQKFNKPVNAALHLILVLSITACASQNKFFDTLDYNRDWEAIQSNLPAY PQPENLLEFDSGPATSLRYFIDAKSISVDEKRVIRYSIVIQSQQGANNVS YEGLRCETRERKRYATGNNDIRSWVRANTSEWQPLEAVAQLRAQRELAKY YFCPRGLVVGSPAEAVRALKAGVHPMVIR >NE0519 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE0244 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNIYRAQQDHFQLVGTDRLEIMRGHGIQRHASKQRWH ISDKTTQLAAQRFHVKRPETLHEIGMPVTLHDTVTAVTDMSNDIFEQPCL TGCAERRFALGSEQMPIGRKAATRHRKGRLLRIVVEW >NE0098 conserved hypothetical protein MRQQLLDITEIGPPSLDNFISSGNEEVLYTLRNLVAGNQQDRFYYLWGKT GSGKSHLLQAVADAFSEQQCNSRYIDCNQDEPNFNPGTDCIVIDNVERLD DAAQIRLFNLYNHLRDNKHGIFLASGTKPPAQLDLRQDLTTRLGWGLVYQ VHELTDEKKIEVMQDYAIRCGFELPLEICHYLLKYEQRNLSSLIRLVHAL DQLSLTRQRPITLPLLRELL >NE2232 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0119 hypothetical protein MKFLDTQELVVSTLSPVHIGCGEDYEPTEYVVDTSGVLHRFNAGILPDLS DAGISSDILTILSNDEAHTEQLRAVHKVLSKYRDKIIPLASVHVSMCTGV HAHYKSTQDKKNDFNRNGVERTSYQPFNQLPYLPGSSIKGAIRTAILNEH IAGNNPCSTVLMRQIQDFNTMIEEYDPGNGKLLLRLKLQHTKWDYDRARK NIEKAIADVSSALGTDLLGGKFETDPLRALKVSDAAPLDIEIEREIRFCL NRSRSGRRSQAQVKNLYTRLEYILEHQPAAFSLSLTLQNLHEIAGRRNHR NELISPSADKLLLWTGIVKACNSYYLNRLDDDLAMLGKLYPTSEWRKQTQ SILDAGLRDQIKTGNCLLLRIGKHGGANSNTVSGRQIKIMLNEDKREANG KEEKIRLYTFDDESRTIWYCGDDLDKPSDLLPHGWIVLSNPDQIWHADLP GFERRCARQQAIAESARRQAEAAAAEQAKAAAQAAREAALAAMTENQRRI EAFVSMCARRAEQLRGGKENPNAAIHTAARELVKAALEGADWTIDEKCAV ADAIEEWLPKLVKVELKDERKKLKLSALRT >NE0155 Integrase, catalytic core MCGVFREGVAVRYARIEQLRQHHAVAAMCRILDVSESGYHAWRQRPPSAR QQENLRLETEVKAAHQRTRETYGPRRLRSDLADHGIQTSLYRIKRIRRKL GLRCKQKRKFKATTDSRHALPLAPNLLDRQFTVAAPDRAWVSDITYVATD EGWLYLAGIKDLFNGELVGYAMSERMTTSLVSQALFRAVAAKRPARGLIH HSDRGSQYCAHAYRKQLQQFGMQASMSRKGNCWDNAPMESFWGSLKNELV HHRRFTTRTQARQEITEYIEIFYNRIRKQARLGYLSPAQFTQKYHAKQIA A >NE2201 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE0714 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWPNRT >NE1260 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE0517 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNT >NE1366 conserved hypothetical protein MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVDEFTSKRSSPLIPSGHQRSC IQQKHHLHQLICSMTGYCRSLLNRVWALFAF >NE1815 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE0935 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPGPMVKSNA >NE0844 Protein of unknown function DUF48 MTQARLCLSVLYFPFSETTPKDLDQVRGIEGDAAKTYFSALPYLVRKDIR EFFTMDGRTRRPPRDRFNAMLSFIYSLVMNDCRSALESVGLDPQIGFLHA VRPGRAALALDLMEEFRSFMADRLALTLINRGQITDQDLLVREGGAVHLE DKARKTVVVAYQERKQEEITHPLLETKVPIGLLPQLQARFMARVIRGEMD GYLPFLVR >NE0253 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE0236 ccrB, Site-specific recombinase MTQQAVIYCRVSSLKQVTEGHGLASQETRCREYAKHKGYEVVEVFHDEGI TGKLLDRPNMKAMLIYLKQHRATRPVVIIDDISRLARDIETHLHLRASIS AAGGKLESPSIEFGDDSDSRLVEHLLASVAAHQREKNAEQVFNRMKARMM NGYSVFNAPIGYRYDKVGKHGKLLVPDQPCASVIAEGLEGFASGRFETQI ELMRFFEASPHYPKDRFGTVHMQRIKEILSRVLYAGYLDKPDWGIHLVKG HHEALVSYETWKKVQARLNGQAKAPVRKDINEDFPLRGFVTCACCGSPLT ACWTRGGGGLYAYYLCYGKTSGVKCSQNGKSIPKDKLEGEFGALLSEMKP SKEMFLLAAEIFTDLWNIKRDTAKQEAETIRRNLLQIERKTEQFLDRIAD TDNSILITAYEKKIRQLEEEKIALDEKIAQCGRPLQSFDETFRTAFSFLS NPYQLWVSSRLEHKRAVMKLAFSERLRYCRNEGFRTPEKSLPFLLLEGSD EGKHEMVGLVGLEPTTKGL >NE0001 dnaA, dnaA; chromosomal replication initiator protein MQKIETFWHFCLKHFRQELNGQQFNTWIKPLKLEVCPDEKNTLILIAPNR FVLQWIKDNFVTRIDEMAQDHFNERISFRLELREPAESEAQTVRTSAQKN REDKKPAAEKTQGVTSRKTNPSQLNASFTFDAFVTGKANQLARAGAIQVA ERPGIAYNPLFIYGGVGLGKTHLMQAIGNYVLELDAGAKIRYVHAEKYVS DVVSAYQHKSFDKFKLYYHSLDLLLVDDVQFFSGKNRTQEEFFYAFNALI EAHKQVIITSDCYPKEISGLEERLVSRFGWGLTVAIEPPELEMRVAILLK KALAEKIELDENTAFFIAKYIRSNVRELEGALKRVLAFSRFTGHSISLDL AKEALKDLLAIQNRQISIENIQKTVADYYKIKVADMYSKKRVRTIVRPRQ VAMAIAKELTQLSLPDIGEAFGGRDHTTVLHAHRKIIELRTSDPGINRDF NALMHILRG >NE0194 dnaB, dnaB; replicative DNA helicase protein MNQLTSSFIQNLAEDNIYKLPPHSIEAEQSVLGGLMLDNQAWDKVADIII ESDFYRQDHQLIYQHISRLIEQNKPADVITVAESLENAAQLQHAGGLAYI GAIAQNTPSAANIRRYAEIVRERSIMRKLAQVSTQITDSAYNPAGRSAGD LLDEAESRIFEIAEQSAHGKQGFVDIQPLLKQVVERIEVLYNRSNPSDIT GIPSGFNDLDQKTSGFQPGDLIIVAGRPSMGKTAFALNIGEHVALETSKP VAVFSMEMGGVQLAMRMLGSIGRLDQHKMRTGQLNDDDWPRLTHALGKLN DAPIFIDESAGLNSLELRARARRLYRQHEGLGLIIIDYLQLMSATSPGSE NRAAEISEISRSLKALAKELQVPVIALSQLNRGLEQRPNKRPIMSDLRES GAIEQDADVILFIYRDEVYNPDTPDKGIAEIIIGKQRNGPIGKVDLTFLG EFTRFENCARTADYY >NE1978 dnaE1, dnaE1; DNA polymerase III (alpha chain) protein MPIDPVFIHLRLHSEYSVVDGIVRVEEAVAKARDVGMPALALTDLSNLFG LVKFYQCAFKAGIKPIAGCDVWVTNENDADRPFRLLLLCQSFSGYLLLSR LLSRAYRENMCRGRAELKKSWFREEDAGTEGLIALSGGGQGEVEQLLLAD PPAAVTAAQQWADLFPGRFYLEIQRCGRPNEETSGYALLDLASSLKLPVV ATHPVQFMRPEDFRAHEARVCIAQGYVLGDRRRPKEFTGQQYFKTPAEMG ELFRDVPEALANSVEIARRCSLMLELGVNRLPDFPTPAGISVEQHLRELA QTGLEARLLQSFPQVLQRDERRPIYQMRLDFEVETIIQMGFAGYFLIVAD FIGWAKQHDVPVGPGRGSGAGSLVAYSLGITDLDPLLYDLLFERFLNPER VSMPDFDIDFCQDRREQVIEYVRDRYGAESVAQIATFGTMAAKAVVRDVG RVLDLPYNFVDQLAKLVPFELGMTLRKAREIEPLLNQRAEEEEDVRNLLE LAERLEGLTRNVGMHAGGVLIAPGKITDFCPVYCADSGDAVVSQYDKDDV EKVGLVKFDFLGLRTLTILDRAVADIRQYRAASPGSAVAEPDVQSAEESH FSLESISLEDAATFSLMAKGNTVGIFQFESRGMKDLLQRARPDRFEDLIA LVALYRPGPMDLIPDFIERKHGKRVDYLDPRLQPILGPTYGIMIYQEQVM QIAQVIGGYSLGGADLLRRAMGKKKVEEMAQQRAVFVEGAIRNEMAEADA VTLFGLMEKFAGYGFNKSHAAAYALIAYQTAYLKTHYPAEFMAACMSSDM DDTDKVNVFYEDCKLNGIVILPPDINESGYYFVPVDHKTIRYGLGAVKGS GEAAISAIVQVREQGSTFTGLFDFCRRVDRRIVNRRTIEALIRAGAFDSV ETNRAALLESVGNAMEYAEQCSLAASQVSLFDENTDLIQPPAITGVAQWP EREKLQNEKMALGFYLSGHPYDSYARELSCFIPVRLSRIVPGREPQLIAG VIYAIRTQMSRRGKMAIVTLDDGLARVEVVVYSDLLSTGSHFMKADQLLV VRALVSHGNGENADRRIVAKEIYDYVTARSMHARKLRIMIDDSGLLTPAQ LKELLAANLPENGVNNVIPSSGCAVSIDFRNQVGSCEIDLSSRWRVHLHE GLIESLMDILGRDKVEVVY >NE0002 dnaN, DNA polymerase III, beta chain MKLTITDRDLLFKPLQTVSGIVERRHTLPILSNTLIEIRNGQLTLVTTDL EIEAEATSNIPELENQGALQTTVSVRKLQDILRALPSGAAIELTRSENRL QIVSGKSRFSLQLLPAEDFPRMIRDSEPCSATYTLAQRVLKKHLQRVAHA MAQQDLRYYLNGMLLLIEDNKLTLVATDTHRLGITSIDLDGNFEKSETIV PRKTVLELIRQLEDSDKPVIVEIYPKKVCFRFSDAVLVSKVISGKFLDFR RAIPQTSVFQFDVNRLDFLHALQRTAIISSSNDLFRNVHLNITNGKLNIS AKNKEQEEAQEEIDIVYSNETIDTSFNIVYLMEVLNNLDSEQIRCSFESM QSAILITLPDDEQFKHVLMPMRE >NE0141 dnaQ, probable DNA polymerase III (epsilon chain) protein MRYVFLDTETTGLDPALGHRIVEIAAVEVCNRRLTDRHFHRYLNPGRESD EGALRVHGLTREFLRDKPVFQDVCSEFLEFIADAEIFIHNAPFDVGFINR ELDLIRFESMQNHCLQIIDTLVLAKELHPGKRNNLDALCERYQIDNSHRT LHGALLDAELLAEVYLAMTRGQESLLMEMDAPASRQADNPAVGKVENLAL IVQPATQAELELHSRLVERINAESKGNCLWNG >NE0433 dnaX, dnaX; DNA polymerase III (subunits tau and gamma) protein MTDSQVLARKWRPKDFSELVGQEHVVRALINSMEQNRLHHAYLFTGTRGV GKTTVARILAKALNCEQGVTAAPCGKCAACMAIDQGNFIDLIELDAASNT QVDAMRELLDNAQYAPVAARYKVYLIDEVHMLSRSAFNAMLKTLEEPPEH VKFILATTDPQKIPVTVLSRCLQFNLKQIPPSLIVERLTEILSMEGIPAD AAGLRLLAQAAKGSLRDALSLLDQAIAFGNSVVNESDTRAMLGVLDQDHI FALLEALAEQNGAAIFAIADQLEAASVSFDQALQDLAALLHRLATAQVIP QMLDETQPDGDRLLALTKRFSPEDIQLFYQIVLHGRTDLAHAPDEYSGFT MTLMRMLAFMPDSRQPGRAYADTGTDHAREVKVEAPSCPREAKPVSDQSP NEAWLALVNQLKLSGMTRMLAQYSEAKSFSESRIELYVAEMHKHLLEKSY QDRLRSQLEIHFGKPVEVIFSQGSITGVTSAALQDRDKLARQSKAVEAIE SDPYVQELIEQFDARLNVSSIKPID >NE0875 fis, probable factor-for-inversion-stimulation transcription regulator protein MTVINENEIALCIRRAVEAYFQDLDGEKPCPIYEMVIRSVEKPLIEIAMH YAQGNQSKAAELLGINRNTLRNRLTKHQIR >NE0332 gyrA, DNA gyrase/topoisomerase IV, subunit A MEQFAKETLPVSLEDEMRRSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY AMHELSNDWNRPYKKSARIVGDVIGKYHPHGDTAVYDTIVRMAQPFSLRY MLVDGQGNFGSIDGDNAAAMRYTEIRMSRIAHELLADLDKNTVDFGPNYD GSEQEPLILPAKIPNLLINGSSGIAVGMATNIPPHNLGEVIDACLLLLRD PDVDIAELMACIPAPDFPTAGIIYGISGIKDGYQTGRGRVIMRARTHFEE LDKGNRHSIIIDELPYQVNKANLLVRIGELVRDKRIEGISDLRDESDKSG MRVVIELKRGEVPEVVLNNLYKETQMQDTFGINMVALVDGQPRLLNLKQM LDHFLRHRREVVTRRTLFELRKARERGHLLEGLAVALSNVDEIIALIKAA PTPAEAKKGLMARTWRSSLVEEMLLRAMIDAAVFRPETLAAGFGMSDQGY RLSDAQAQAILDLRLQRLTGLEQEKIVSEYREILDKIRDLLDILANPERI TTIIVEELTAIKGQFGDPRRSEVVIDAQNLNTEDLITPADMVVTLSHAGY IKSQLLDDYRAQKRGGRGKQAITTREDDFIDNLFIANTHDFILCFSSLGR VYWIKVYNVPQGSRTSRGRPVNNLVPLEQNEKINAVLPVKSFDDTRYVFM STAGGTVKKTPLSEFSRPRTNGIIAIDLDEGDYLIGVALTEGKHDVMLFS DAGKAMRFDENDVRPTGRNARGVRGMKLGAGQQVISLLVADNENMAVLTA TENGYGKRTPITEYTRHNRGTQGMIAINTNVRNGKVVAAQLVESSDEIML ITTGGVMIRTRVSEIREMGRATQGVTLINLDAGEKLAGLERIVETDED >NE0003 gyrB, DNA gyrase, subunit B:DNA topoisomerase II gyrB MNTNQPESAKKTDNSHRDYNSDSIKILKGLDAVRKRPGMYIGDTSDGTGL HHMVFEVVDNAIDEALAGYCDDISVIIHADNSVSIHDNGRGIPTDIKQDD ELKRSAAEIVMTELHAGGKFDDNSYKVSGGLHGVGVSVVNALSEWLRLTI RRNGNVYQMEFREGVAVAPLKVTGQTEKHGTEVHFLASQSVFGDITYHYD IFAKRLRELSFLNHGIKIRLADQRDDREEVFAFTGGIRNFVEYINRSKTV LHPSIFYAKGLKDNITVEIAMQWNDSYAEQVLCFTNNIPQKDGGTHLTGL RAAMTRTLNNYIEKNELAKKAKVDTTGDDMREGITCVLSVKLFEPKFSSQ TKEKLVSSEVRPAVEEIVVQKLSDFLLENPNEAKTICNKIIEAARAREAA RKARELTRRKGVLDSMGLPGKLADCQEKDPKLCELYLVEGDSAGGSAKQG RDRKFQAIMPLKGKILNVEKSRFDKLISSQEIVSLITALGTGIGKDEYNP DKLRYHRIIIMTDADVDGSHIRTLLLTFFYRQMPELIERGHIYIAQPPLY KIKHGKQERYLKDDYELKHYILGLALVGAELHTGANNPPITGEALARIAD EYLLAETVIERMSRLIDRTVMYALLKQPDIDLSSETSARDSAARLAILLD DVEILAEYDENFERYRLKIIRKQHGNLRTSYLDDDFLQSGDFARIRQAAQ ILHGLIGEGAKVKRGEQEISVREFKEALEWLLEETKKGITIQRYKGLGEM NPEQLWETTMDPGNRRLLRAQIEDSILTDEIFTTLMGDVVEPRRAFIESN ALRARNIDI >NE1137 holA, putative DNA polymerase III (delta subunit) protein MRLDPEHLARQLDGSIAPLYVVLGDELLLVMEAVDGIRAYVRGQGYTERT ILTADQRFDWMNLFQWGRQSSLFSERRMLDLRIPSGKPGREGGVAIETFC RELPRDTVTVVTLPEIDKQGRASKWFKALEQAGQVIEVKPVGRDRLAHWI KQRLDRQNQMIDQDTLQFFAGKVEGNLLAAHQEIHKLGLLYPPGRLTFEQ VKNAILDVTRFDVLQLPETMLTADMVRYRHILEGLQGEGVAPPLILAILS EQIRLLIKIHLLKNSSRGMTIEQAMTALRIWPARQKLMMGAIQRIRYPLL VQALLQAAVIDRIIKGVEQGDIWEELLNLGICFAADSSFKIIGRKDLSFI INLSLK >NE2180 holB, putative DNA polymerase III (delta' subunit) protein MATAEIFPWQRVIWQQARQSGSAQRHHALLLKGRRGIGKLGFALALAKSI LCGQGDAAGVACGKCQDCYWFEQGLHPNFRLLEPEALSAQEGATDKDDEE NRREAGSTKSGRKPSQQISIAQIRALDDFIYLSAHQARDKVVLIHPAEAM NTAAANALLKKLEEPPPEVLFILVTHNVSLIPPTVLSRCRQTAMPGPDHE MAKDWLIHQGITDPDFHLAMSGFSPLLALQYDERLAASHTDFIQCLCAPE RFDPIELAEKLHKLDLSSVTGWLQKWCYDLMSCRTSGRVRYHLKQVAVIR QQAAVIDPVAFGFLWRNLIASQQLARHPLNPRLFLEAMLLTYMDSIRPAG SAG >NE0442 holC, putative DNA polymerase III (chi subunit) protein MLIACRLCAKAVQQGLKTVVYVPDERLAGQFDKLLWTFTPTGFVPHCRVD NKLADVTPVIMNSRPVLMEAGCFGVLLNLDADVPPGFEQFPRVVEIVDEA EDGKLQARKRYRHYQEQGHDVRHHRLDGN >NE0833 hrpA, HrpA-like helicases MTYLPEQPACITYPEDLPVVARREEIAHAIQQHQAIIICGETGSGKTTQL PKICLELGQGAGRQGTGHLIGHTQPRRIAARTVAARIAAELNSPLGKLVG YKVRFSDQTHPNTRIKLMTDGILLAETQQDPLLRAYQTIIIDEAHERSLN IDFLLGYLKQLLPRRPDLKLIITSATIDAQRFASHFNDAPIIEVSGRLFP VEIHYRPNDPIDGEDRDLPRAILSTIDEAMRMGEGDTLVFLPGEREIRET AETVRKYAFSGPGGKAGLEILPLFARLSHTEQARIFAPGQQRRIVLATNV AETSLTVPGIRYVIDTGLARINRYSYRNKVEQLLVEKISQASANQRAGRC GRVMNGVCFRLYSEEDFNARPEYTDPEILRSSLAAVILRMKSLKIGDVEQ FPFIQPPAPRMIADGYQLLSELGALDERKGLTQIGHQLARFPTDPRIARM IMAAKQENCLSEVLIIAAALSLQDPRDRPFEHQQAADQAHQPFRDDRSDF MGYLKLWDFYDELLKHKKSNKKLIEQCQKNFISHRRMREWREIHGQLHIL ISEMGLRPNQVSAGYDEIHRALLSGLLGNIGFKSDEKGVYEGARAIKFSI FPGSSLRKKQPKWVVAAELAETTKLYARCAAAIDPAWLERIAGKLCKRHY FDPHWEKQRAQAMAFERITLYGLTIVPKRRIAYGPIDPAHAREIFIRQAL VAGEYESTAPFLQHNQQLIDEIRELESKVRRQDILVDEQQIFEFYAARIP AGIYSGTAFEKWRKQAEQTEPELLYLTREVLIRQAVDGTAAEQFPETLTA AGHVLPLSYRFDPGHPLDGVTVTVPLPLLNQIMPFHFDRLVPGLIREKIG WYVKMLPKQVRRHAIPVPQFVTRFLEWLDSCPDQAMLLAESLTAFIRSET GIKVPLDTWDSRLLPVHLQMNVKVIDDAGMTLGMGHDLIELKAQFGQTAQ QLFARGAGAEPDSIERDDITRWDFGELPVETRFSRAGKLLTGYPALVDQE QSVAVRIFDTQEGAQRSMRGGVLRLLCLALKDRIKQLEKNLPVDRQAILL MSSLIEMDRLKEDIRSAIIDLALIGDDPLPRNEDEFNSQTSRARTRLGSV SQEIAGLIHTIAQPCQELKKRLSVLDKSAVFLKKDMEEQLHHLIYPGFLS TTRWQYLQHLPRYLKGMILRLDKYNKNPARDQEQTEIISTLWNQYIQRLN KHRQAGVIDPNMEIFRWQIEELRISLFSQELKTPAPVSVKRLQKLWESVR E >NE2207 hupB, Bacterial histone-like DNA-binding protein MNKSDLIDVIAQSADLTKAQAGNALDGALSAIKDALGKNDSVTLVGFGTF KVGKRAARTGRNPRTGAEIKIKAAKVPKFTAGKALKDAVN >NE0952 ihfA, Bacterial histone-like DNA-binding protein MALTKAELTDLLFENIGLNKREAKEIVECFYEEMRAALQNGDGVKLSGFG NFQLRTKPQRPGRNPKTGEEIPISARRVVTFHASQKLKSMVEANYRGESG TN >NE1961 ihfB, Bacterial histone-like DNA-binding protein MTKSELISKLAERFPQLLAKDAELVVKIILDAMAKSLSRGERIEIRGFGS FDLNYRPSRVGRNPKSGEKVHVPEKYVPHFKAGKKMRELIDSGPKQHKVL DRVTG >NE0450 int, Phage integrase MQWIKRFILFHGKRHPQEMGSAEIEAFLTHLAVAGKVSASTQNQALSALL FLYKEILSIDLPWLNEIVRAKQPQRLPTVLTRTEVQAILVRMSGTYGLMA NLLYGTGMRLMECVRLRVKDVDFERGEILIRDGKGSKDRVTMLPESLAGP LQAHLLHRRTLFDDDSRLGKASVYLPDALERKYPNAATDWVWQYIFSSGS FSIDPRSGTERRHHIDEKLLQRAMKKAVQASGITKLATPHTLRHSFATHL LDSGYDIRTIQELLGHKDVHTTMIYTHVLNKGGRGVRSPLDM >NE0235 intF, Phage integrase MLTKVRLTPSRIAAHTCPADASQAFLWDTATPGLAVRATAGKRAFIFQGR FAGKSIRITIGDTEVWTIEQARQRARELQGLVDQGRDPRLVKQEKIAADV QARITDEPALPAWRDYIAARSGKWSEAHAADHLKMARDGGEPVTRGRRIG APAYTEKGILRPLLDLPLKGITREKVAQWLDNEATRRPAQARLALSLLGT FLSWCGNQPAYRNQVNSDACAKLKRELPKPTARTDCLQREQLASWFAAVR SIDNPVMSAYLQSLLLTGARREELAGLGWEDVDFQWQTIHLADKVEHSGR TIPLTPYVSQLLQSLPKINEFVFASKRAKSGRLQEPRKAHNQAIEAAGLP PLSIHGLRRSFATLSEWVEAPSGITAQIMGHKPSAIAERHYKRRPVDLLR VWHTKIEEWILSNANI >NE2189 intINeu, Integron integrase; Phage integrase; Phage integrase N-terminal SAM-like domain MGNTNTPPKLLDQVRDRIRIKHYSLRTETQYVQWIKRFILFHGKRHPQEM GAAEVEAFLTHLAVVGKVSASTQNQALSALLFLYKEVLSIDLPWLDKVVR AKQPQRLPVVLTRTEVQAILVRMSGTYGLMANLLYGTGMRLMECVRLRVK DVDFERGEILIRDGKGAKDRVTILPESLVSPLQTYLLQRRVLFDDDIRLG KASVYLPDALERKYPNAATDWIWQYIFPSGSFSIDPRSSVERRHHIDEKL LQRAMKKAVQTSGITKLATPHTLRHSFATHLLDSGYDIRTIQELLGHKDV HTTMIYTHVLNKGGRGVRSPLDM >NE1753 lig, NAD-dependent DNA ligase MISENTIEERLQALRAAIALHDFHYYVQDAPVIPDAEYDALFRTLQQLEQ QYPHLVTPDSPTQRVGAPPLKVFAQLTHQTPMLSLANAFSEEEVTAFDRR IREALNIDRVDYAVEPKFDGLAISLIYANGILTKGATRGDGYTGEDITLN LRTIPSIPLRLQVPFPTGQFEVRGEVVMLKTDFERLNEQQRKNGEKTFVN PRNAAAGSLRQLDSRITAMRRLTFFAYGIGAYHEDQPIFSTHSEILAYLA TQQFLVARQSSTVMGANGLLAYYREMNAVRLSLPYEIDGVVYKVNDLAQQ EKLGYVSRAPRFAIAHKFPAQEVSTELLAIEIQVGRTGALTPVARLAPVF VGGVTVTNATLHNEDEVQRKQIMIGDTVIVRRAGDVIPEVVAVIVERRPT HAQAFVMPDHCPVCGSKAVRLPDEAVTRCTGGLYCPAQRKQAILHFASRR AIDIDGLGEKLVDQLIDRELVHTPADLYRLDIDTLAGLERMAGKSARNLV TAIEDSKKTTLPRFIYALGIRHVGEATAKALASHTGDLDRLMDMNAEQLQ QIPDIGPIVAQSIADFFSEAHNREVIEQLLSCGLQWEKPSHIAQPSSRTN LAVPGKTFVLTGTLPTMTRDQAKNRIEQQGGKVTGSVSSATSYVVAGSDP GSKYARAIELGIPVLDEDQLLSLLRDTSSSE >NE0008 mfd, mfd: transcription-repair coupling factor MSSKLNPLSSESLPRYTGLEGSSDACALARLANRNPAGQLLAVITASALD AQRLLEEIPFFAPDLRVSLLPDWETLPYDIFSPHQDLISERLATFYQIAH NACDVLIIPVTTALYRMPPREFLAAHSFFVNQGSTLDLQSFRSQMSLAGY SHVSQVLSPGEYSIRGGLIDLFPMGSPLPYRIDLFDDEIESIRTFDVDTQ RSIYPVKEIRLLPAREFPLDDNGRSRFRTGFREKFEGDPTRCRLYQEISK GNIPAGIEYYLPLFFEQTATLFDYLAQHSTVCLHGEITPAIENFWQDTRS RYQLMRNDPDRPLLPPMDLFLPEDQFYGYLKSYKRIEMHTGQQVKTDKPF ARSLPPVRVDRRASNPIEQLTAFVHTFTQKGGRVLLLAESMGRRELMAEY LREYGLKLKLCEDFAAFQSDTASCMLSVASLHSGFILAAENLALVTENEL YATHVRGQRTRDARKTVSADSILRDLSEIKPGSPVVHEQHGIGRYLGLVN MNMGEDDSGQSSEFLALEYQGGDKLYVPVTQLHLISRYSGAAPEAAPLHK LGSGQWEKAKRKAMQQVRDTAAELLNLYAQRAARKGHIFRFNQHDYNAFA DGFGFEETPDQATAINAVIQDMVSGKSMDRLICGDVGFGKTEVALRAAFV AVTDGKQVAVLVPTTLLAEQHYQNFSDRFGLIADQWPVKIAELSRFRSAR EQAEALQSLAQGTTDIIIGTHKLIQDKVKFKNLGLVIIDEEHRFGVRQKE QLKKLRAEVDVLTLTATPIPRTLAMSLEGLRDFSVIATAPQRRLAIRTFV HPYSEGIIREACLRELKRGGQIYFLYNEVSTIQNMYTRLTTLLPEARINI AHGQMRESELEHVMRDFYQQRFNLLLCTTIIETGIDIPTANTIIIHRADK FGLAQLHQLRGRVGRSHHQAYAYLLTPPEKAALTTQATRRLEAIQAMEEL GSGFYLAMHDLEIRGAGAVLGDSQSGEMQEVGFSLYSSLLDAAIKSLKAG HEPDMQQPLGVSTEIRLHVPALLPESYCGDIHERLILYKRMAGCSDETEL DEIHQELIDRFGLLPDPARALLDSHRLRIEARQLGITRIDAGPDNIQLQF VPEPPIEAIKIIQLIQSSKEYSLSGPDRLSVRLQIPDVGERVKKIKKLMT LLKN >NE1742 mutL, mutL; DNA mismatch repair protein MRPIKLLPDGLISQIAAGEVIERPASVLKELLENAIDAGTTDISVNIAQG GLKLIRVTDNGGGISGEELPLALTRHATSKIASQEDLYRITSLGFRGEGL ASIASVSNLLLISHQPGGKHAWQIRSEGIRVMQPEPSSHAAGTTVEVRDL FFNLPARRKFLKTEATEFAHCEEIIRRMALSHAGIAFTLRHNGNLRGHWQ SAEAAVRIKTVLGEEFTRSAAWIDERSAGIGLQGMLALPAYSRAARDMQY FFVNGRFVRDKLITHALREAYRDVLHLDRHAAFVLYLDIDPEQVDVNVHP TKTEIRFREARAIHQFIYHGVSKALSLPRSGTELSQSSSQLMADDIVPPA EKRVPAAPMLNYPRQTGLPSEMIAQPFNFYQVLSGSESDSTATQNPFRQT GAGESNEHPALPPLGFALGQLHGVYILAQNWKGLVIVDMHAAHERIVYEQ LKLQMDEQTLSAQRLLIPVTFHADSLDIATAEENQSLLQQLGFEVTVLTA TTLAVRAVPAILQDADTEKLVCNVLDEIRNGDPGQLLAARRNELLATMAC HGAVRANRPLTLIEMNELLRKMEVTERSDQCNHGRPTWFEISLAELDKMF MRGK >NE2552 mutM,fpg, Formamidopyrimidine-DNA glycolase MPELPEVEITRRGIDTHLAGRVITQISIRNPVLRWPISAGLIALLPGQRI NAIARRAKYLLFACSRGTLIMHLGMSGNLRVLPESTPPQLHDHFDLQVDN GMMLRFRDPRRFGAILWWDGDIRQHPLLQKLGPEPLSDDFDGQFLYTKTR GRNASIKEVLMNQHIVVGIGNIYANEALFQAGISPLAAAGSLNTMQCERL VDAVKATLLRAIKAGGSSLRDFTDCEGSPGYFQQQYWVYGRAGQSCRQCG ELVSKTRQGQRSTFFCARCQH >NE1705 mutS, mutS; DNA mismatch repair protein MNKAEQSSHTPMMQQYLRIKAQHTDKLLFYRMGDFYELFYEDAEKAAKLL DITLTQRGSSAGEPIKMAGVPFHAADQYLARLVRLGESIAICEQTGDPAT SKGPVERQVIRILTPGTLTDAGLLEERSNSIVLALALHRGSIGLAWLNLA AGDMRVLETSSDNLTSELERLHPAEILLPESLDLPATLNNFAGPKRLPDW QFDYEHAMQQLTRQFGTRDLNAFGCEDLHAAIMAAGALFEYVRLTQQTAT DGSSGQLPGHLHTLQVERQDAYLRMDAATRRNLEITLTLRGEDAPTLSSL LDTCSTGMGSRLLRHWLHHPLRNRITLQQRLDTVSDLIGAQPETLYAGIR QQFKHIADIERITSRIALRTARPRDLSGLRDSLMRLPGIIELIATSAAAA VHRFIPPMQPDPLLTQLLVRALQPVPGAVIREGGVIADGFDAELDELRGL QGNCDEFLLQLEARERERTGIPNLKVEYNRVHGFYIEVTRAQGEKIPPDY RRRQTLKNAERYIIPELQAFEHKTLSAREQALAREKMLYERLLEQLADFI IPLQEIARSVAELDVLCAFAERAALSGYTKPVFTDDPVLIIEAGRHPVVE NQVEHYIANDVQLGAITRENRQMLVITGPNMGGKSTYMRQTALTVLLAHC GSFVPAQIARIGPIDQIFTRIGAADDLAGGRSTFMVEMTEAAGILRNATA QSLVLVDEIGRGTSTFDGLALAFAIARHLLTQNQSYTLFATHYFELTRLA EEFPQAVNIHVTAVEHKRRIVFLHRIEEGPASRSYGLHVAALAGVPDRVI RNAAKILARLEQETLSRSPQQTLFETVEENAKAVPASVHPVLDYLERIHP DELTPRGALEQLYLIKSMLNQTD >NE0056 mutY, HhH-GPD MTPRTAGTIHFPADAPDSFAGRLIRWQLECGRHSLPWQGTRDPYAIWVSE VMLQQTQVSSVIPYYQRFMASFPDVASLAGVPVGDVLTLWSGLGYYSRAR NLHRAACVIMEQYSGVFPQDAATLQRLPGIGRSTAAAIAAFAFGERGTIL DGNVKRILARYFGISGYPGEKSVEERLWQLAESLLPAEESNHQIVVSYTQ ALMDLGALVCARSRPRCQYCPLQADCIACQNDLTADLPVPKPRKTLPVRE TVHLILLDQERILLKKRPASGIWGGLWCFPEMSVDQDSIDYCEKNLHVRV TKLARLPHLQHTFTHFKLIIQPHLLQSIMHQPVCEEKCEENSYLWLTIEQ AMQQAIPVPVRKLLSMAYPYFQYHIHE >NE2223 nth, HhH-GPD:Iron-sulfur cluster loop (FCL) MNTTKRREIFTRFRAANPRPTTELEYQTPFQLLIAVILSAQATDKSVNLA TRKLFLVADTPEKILQLGETGLSPFIQRIGLFRTKTRNILATCQLLIEQY NGEVPRTRTELEKLPGVGRKTASVILNTAFGEPTIAVDTHIFRVANRIGI APGKNVLEVERKLLKVVPDEFRHDAHHWLILHGRYICKARKPLCHQCLIV DLCEFKEKNLEGTASSLDMKQLT >NE2253 ntpA, NUDIX hydrolase MQRYKLPVSVLVVIYTADLQVLLLERADHPGYWQSVTGSQDPGETLLQTA VREVREETGLNTDDYVLSDWQIQNRYEIFEEWNWRYPPGTTHNTEHVFGL ELPKTIPAVVSSREHLGYVWLPWREAAEKVFSSSNACAIRMLASKRKSEN SR >NE0885 ogt, Methylated-DNA--protein-cysteinemethyltransferase MNYYTFLESPVDRLLLTSDGEFLTGVYMEIEIQKLLPRMTDDWRQDAAPF AEAIAQLNAYFAGELIQFDLPMKATGTPFQEAVWQSLSTIPYGETVSYKN IAERLHLPKAARAVGMANGQNPISIIIPCHRVIGANGKLTGYGGGIHRKQ WLLAHEDKQTSFA >NE1468 polA, polA; DNA polymerase I protein MKTLLLVDGSSYLYRAFHALPDLRNRLNEPTGAIYGVLNMLRRLHKEYRP DYSACVFDAKGKTFRDDIYPQYKAHRPPMPEDLVCQIGPLYACIRAMGWP LLIEEGVEADDVIGTLVERAIARQAQCVIATGDKDIAQLVRPGIWLVNTM NNESLDESGILQKFGVTPAQIIDFLALVGDSVDNIPGVEKVGPKTAVKWL DQYGTLDDLIAHADEIKGVVGENLRKALDWLKVSRKLLTIKCDVPLAMDW QDLVAVPPDTARLTELYEHLEFRSWLRELKQPGPEKNEKAESSVMAAIVD DPSVPEGENDDGRDYQIILTDAQLGDWLAQCESAELVSIDTETTSLNPME AKLVGLSFCMELGQAAYIPLAHHYPGVPSQLNREQVLQRLKPWLESDEKL KIGQNLKYDRHVFANHGVMLNGIVHDTLLQSYVLESHLSHDLDSLASRHL GIQTISYDEVTGKGAKRIGFEQVEIHRAGIYAAEDADIPLRLHRVLYPVI SQDAHLEYIYQQIEIPLLEVLFRIERNGVLLDTDLLRVQSGELTQQLVAL EQQAHSLAGHAFNLNSTKQIQEILFGQHKLPVIKKTPKGVPSTDEEVLQR LASDYPLPKVLLDYRGLAKLKSTYIDKLPQMVNKQTGRVHTHYAQAVAVT GRLASNDPNLQNIPVRTPEGRRIREAFIAPDGWLIMSADYSQIELRIMAH ISGDAGLIHAFSEGQDIHRATAAEVFGVPVEQVNPEQRRYAKVINFGLIY GMSEFGLATQLGIERTAARTFIDRYFARYPGVADYMQRTRELAKQHGYVE TVLGRRLQLSDIRSNQRNRQMGAERAAINAPMQGTAADIIKLAMISVHRW LAEAQLQSKLIMQVHDELVLEVLVDELPVIKENLPRLMENVLKLDVPLKV QTGIGKNWDQAH >NE1505 priA, probable priA; primosomal protein N' (replication factor Y) MVIIRVALDVPIDRLFDYLAPDADTADIGRCVRVPFSSRQISGIIISVCE TSSVPEGKLKYAGQIDRQTPPLPQPLLGLFEFCSRYYHHPIGQVVMNGLP VLLRKFKHTGKEQPPSWRLTDTGKSITLADLPIRAKAKRQLISLLSEHGI ITAEICKAMSSHSRKLLHEFKDLGWVEQFTALPEKAVFSTASSPAPTAEQ AQAISEILDRTGTFTPWLLNGITGSGKTEVYLQVTASLLAQQKQVLILVP EINLTPQLEAVFRKRFPGTTLVSLHSGLNNSERLQGWLQAQRGKAGIVLG TRLAIFTPMPELSLIIVDEEQDHSFKQQDGLRYSARDLAIYRARQANIPV ILGSATPSLESYHQARTGRYRLLQLHSRAISQAALPTIRCIDLRVIPAQE GLSEPVLDALRHCLARKQQSLVFINRRGYSPVLLCKSCRWIATCKRCSSR LVVHLRDRQLRCHYCGDQQPVSPACPQCGDPDVLPFGHGTQRVEAALIRH FPEARILRVDRDSIRHKGAWQQMLDRIHRGEADILVGTQLLAKGHDFPNL ALVCALNADASLYSTDFRAEEHLFAQLIQVAGRAGRANVPGSVLIQTEFP QHPLYQALIRQDYAAYAQAHLKERRSAGFPPFVYLAVLRAEAPVLTDALE FLRQAAALAAVTENYPHIQLFDPVPAHMTRLKGLERAQLLIQARSRRHLQ TFLGDWHQRITALPVHSRIRWHLDVDPLTL >NE1464 radC, DNA repair protein radC family MAISDWPEAERPREKLIEKGAAALSDAELLAIFLRTGITGVSAVELARKL LTHFGSLTKLCAASLHEFSELPGMGPAKFAQLQAVMEMAKRALAEELKNG DIMDSPQSVRNYLCLSLKGKPYEVFVGIFLDARHRTIVTEELFNGTLTQA SVYPREVVKRALYHNAAAMIFAHNHPSGIAEPSTADEILTQSLKQALALV DVKVLDHFVIGSSEVVSFAERGLI >NE0507 rdgC, putative recombination associated protein rdgC MWFRNLLIYRLAGEVITSDELEAYLAKQTLQGCLGLEPQSRGWVPPGIAE ADLVYSYGQQMLIALGTEKKLLPASVVNQLAKVRAQEMESHQGYAPGRKQ MKEIKEAAYRELLSRAFAIRQRSHAWIDPVGGWFIVEGASASKADALIEA FIKSTGIGLKRIRTTMAPTSAMTAWLSGDDPPAIFSVDSDSIFRSREDKK VSVSYIRQSPDPQEITRHVRTGKEVIRLAMTWRDKISFILDENLQLKRLT LLDIDREPAETAEEQFDSNFFLMTEELRQLLPDLVEILGGMTAD >NE1932 recA, RecA bacterial DNA recombination protein:AAA ATPase superfamily MDENKNKALSAALAQIEKQYGKGSIMRLGDSDVAKDIQVVSTGSLGLDIA LGVGGLPRGRIIEIYGPESSGKTTLTLQAIAEMQKLGGTAAFIDAEHALD PQYAQKIGVNVQELLISQPDNGEQALEITDMLVRSGSVDVVVVDSVAALT PRAEIEGEMGEPQMGLQARLMSQALRKLTANIKRTNTMVIFINQIRMKIG VIFGNPETTTGGNALKFYASVRLDIRRTGSIKRGEEMVGNETRVKIVKNK VAPPFKQADFDILYGEGISRESEIIELGVLHKLIEKAGAWYSYNGEKIGQ GKDNVRDYLKEHKSIAHEIEQKIRAAVGLAETDSRVVPPSSGE >NE1850 recG, RecG-like helicases MAAHFFDSLDEALRKKLEKLGLFSDFDLVLHLPLRYEDETRLSPISQAVP GSTVQVEGVVAEQEVLVRPRRQLVCRVDDDSGTLYLRFFNFYASQVTAWS PGTRLRVLGEVRAGFHGVEMVHPKCRVVRGSMVLANTLTPVYPGMAGLPQ RTLARLIMQAFERLRAKRLLQETLPATILSACQFPAFEDSLSILHCPPAG VSITSLQQRSHPAWFRIKFDELLAQQLSMRCHYHQRRSQQAPVLQQQTGL QQALLEVLPFGLTDAQCKVVTEISKDLAQPYPMQRLLQGDVGSGKTIVAA LAALQSIGNGYQVAVMAPTEILAEQHFRKLSDWLTPLGVGVGWLSGSQKK SLRNQELERTATGEAMLVIGTHALFREAVQFKCLGLVIIDEQHRFGVGQR LALRMKGGDEEVIPHQLMMSATPIPRTLSMSYFADLDVSVIDQLPPGRSP VVTRLIDSSRREEIVARIREACLAGRQAYWVCPLIEESEALQLKTAVETY ETLSQTFPDLRIALIHGRLDSDEKSVIMAEFSQGEVQLLVATTVIEVGVD VPNASLMVIEHAERMGLSQLHQLRGRIGRGSATGVCVLMYQQPLSEVARK RLQIIFEHRDGFEIARQDLLLRGPGEFLGTRQSGVPLLRFANLEEDIDLL EMARNAAENMLRDHPLAAQCHMQRWLGRKEDYLRA >NE0010 recJ, recJ: single-stranded-DNA-specific exonuclease MANITIREFPAHAYEILSAHGFPSVLARIFAARGINHPEQLETTFARMAS FEQLKNIQRIAVLLADAIAAKKRLLVIADYDTDGATACAVALRALRQFGA MVEYLVPNRFEYGYGLTPEIVRLAADQVPPPDILITVDNGIASVEGVEEA NRLGMQVFITDHHLPGDRLPDAAVIVNPNQPGCSFPDKHIAGVGVIFYVM LALRAELRERSAFTATGKEPNLASLLDLVALGTVADVVRLEGTNRILVQQ GLQRIRNGYCCAGIHALFKAAGRDFSRVTTYELGFILAPRLNAAGRLDDM SLGIECLLTEDESHALRLASELDELNRQRREIESGMRDEAMDKLDDVIDL LNQSDTPADNGKQSVYSLCLYDPAWHQGVIGLIASRVKDRLHRPVIIFAQ GNEGEIKGSGRSIPGLHLRDALDLVAKRYPGLIVKFGGHAMAAGLTVYEQ HFEQFRTAFEQVAQSLLTPADLIQVIETDGELAETDLTLELAQYLTNQVW GQGFPEPSFNGCFRVENQRIVGEKHLKLKLRKTGAAQVYDGILFFHTERL PTEIDAVYRVQINEYNGSTRMQLLLEHWFESGQAHYG >NE1479 recN, ABC transporter:DNA repair protein RecN MLQNLSIRNFIIVDHIDLHFKSGFTVLTGETGAGKSILIDALELVLGRRA DTSQIRYGCKRAEITAQFSVNTIPALQEWLVENALEDETGICLLRKIMES GGRSRNFINGHPATLQQLRTVGEWLVDIHGQHAHQLLMHGHKQCELLDAW AGESNLAREVASAYRHWQDLCQQRLAWEQHSEQNLQEHETLTWQLQELAA LNFSLEEWENLQIEHNRLTHTASLLETAQFSLESLSENETAVLAQLSTVL TRLNSLIDIDNTLEPLCNQLQSAQIQLQEIVYELKRYQQHLDIDPRRLQE TETRIAAIHGTARKYRIMPEILPDLLETTRQRLESLENAASSEALMKAEK SARNNFENLAARLSQARQHAADQLSGLVTETMQTLAMAGGRFNVALIPIP SGNLHGMEQIEFQVSAHRDLPLRPLNKVASGGELSRISLAIQVITSKAGT VPTLIFDEVDTGIGGRIAAIVGKLLQQLGKTRQVMSITHLPQVAARGDHH WRVSKTSETEDEQLPASHISELDAAERTEEIARMLGGENLTAATRQHAAE MLGYDKQNQST >NE2564 recQ, ATP-dependent DNA helicase RecQ MISHAQTLLREIFGYSEFRGQQAEIITHVVNGDSCLVLMPTGGGKSLCYQ IPALLRKGTAIVISPLIALMENQVAVLCRQGVRAVYLNSALTPEAAAAVE RRMLAGEYDLVYVAPERLLTVRFRALLQRIPIALFAIDEAHCVSQWGHDF RPEYGKLSILPEKFPQIPRIALTASADARTRADILRCLDLHQARSFISSF DRPNLCYRITARSNSRIQLLNFIRSQHAGEAGIVYCQSRRKVEETAAWLN SNHIPALAYHAGMETSIRTRHQKKFLQGHGIVMVATSAFGLGIDKSDIRF VAHLDLPKSIESYYQETGRAGRDGLPASAWMVYGPGDIIRLRSQTESGTE RLPAPIRQAAAARLDALLVLCETTVCRRKPLLDYFGEPTGSLPCGNCDAC LETIPVQDVTIAAQKALSCVYRTGQCFGMEYLIDILSGKRTDRVRQWGHD CISTFGIGHELSTEGWRIVFRHLLALDYLVAGEDRAGGERIALQLTSAAR SVLRGETRIKLRLSHHHHSAPYQQISTGLSVPSSRCQAFSCEPQTKCGG >NE2040 rhlE, rhlE; ATP-dependent RNA helicase RhlE MSNDVTFAQLGLSSEILHAVNDEGYVNPTPIQAQVIPSILAGKDVMASAQ TGTGKTAGFTLPLLYRLQAYANTSVSPARHPVRALIMAPTRELAMQIDES VRKYGKYLALRTAVVFGGINIEPQIAALQAGVEILVATPGRLLDLVEQKA VNFSKTEILVLDEADRMLDMGFLPDIKRVMALLSPQRQSLMFSATFSGEI RKLADSLLKQPVRIEAAVQNTVNESISHVIHWVKPDSKFALLLHLIRQQN LKQALIFVKTKHGASHLAQMLSRHEISAVAIHGDRNQQQRTQALAEFKHG DVQILVATDVAARGIDIEKLSHVINYELPGNPEDYVHRIGRTGRAGSKGK AISLVSEHEKELLANIEKLLNAKLETEQIAGFDAEQFARSLPDRKNRMSA GNSRYGNKPMENGSEKSRSEKHRKLPSSQKYSGSRRGGTQKYSDPIFTQP YVPQANSTQSTTPKQPEIQSLFLTYRQEKKTIPALFTALSKSKAGQEN >NE0140 rnhA, probable ribonuclease hi protein MQLEEGVKLVEIFTDGACKGNPGIGGWGVCLKFDGEVREFFGGEPVTTNN RMELLAAIRALQALESLPDTGQSLRVQLHTDSQYVQKGISEWVHSWKKRG WLTADKKPVKNEALWKELDQLSRRYQVEWFWVRGHNGHDGNERADMLANR GVVSVLSEKAD >NE1707 rnhB, Ribonuclease HII and HIII MAERRIPLKHEYAQDGKVIYGVDEAGRGPLAGPVYAACVVLDPADVIEGL ADSKQLSEKKRISLADQIKQRARAWAIASASVEEIDRLNILQASLLAMQR AVVSLRPISNALVLVDGNHAPRLDCEVQTVIRGDSLVAEISAASILAKTA RDIEMLRLHEAYPVYGFDRHKGYPTKAHLEAIRLHGITDIHRRSFAPCVG QSVSGARTTSFINQKEA >NE0212 ruvA, probable Holliday junction DNA helicase subunit MIGRIAGLLLEKHPPLVLVDVNGIGYEIDVPMSTFCRLPGIGEQVTLHTH FWVREDAHLLFGFMTEPERVLFRQLTKISGIGARTGLAILSGLSVNDLHQ IVVSQDSTRLTRIPGIGKKTAERLLLELRDKISPAITLPETGTAMASSTD KDILNALSALGYNDREANWAVGQLSEGVTVSDGIMQSLRLLSKAK >NE0213 ruvB, ruvB; holliday junction DNA helicase protein MIESDRIITASPFSSQEEVIERALRPVQLDDYVGQEKIREQLKIFIEAAR LRQEALDHVLLFGPPGLGKTTLAHIIAREMGVNLRQTSGPVLERAGDLAA LLTNLETNDVLFIDEIHRLSPVVEEILYPAMEDYQLDIMIGEGAAARSVK IDLPSFTLVGATTRAGMLTNPLRDRFGIVSRLEFYTADELGKIVTRSAGL LNVDVTADGAREIACRSRGTPRIANRLLRRVRDFAEVRANGRIDRPVADA ALQMLDVDATGLDVLDRKLLLAVLEKFGGGPVGVDNLAAAINEERDTIEE VLEPYLIQQGFLQRTPRGRMATTMAYQHFDIIPSHQTTVPSLFDPD >NE0211 ruvC, Crossover junction endodeoxyribonuclease RuvC MTSLVYAAKGIRILGIDPGLRITGFGIVEKIGNRLVYIGSGCVVTGESGL PDRLKTILDGLNEIILQHKPEQVAVEQVFVNINPKSTLLLGQARGAAISA AVLHELSVYEYTALQVKQAVVGNGHARKEQVQEMVMRLLGLGERPRPDAA DALACAICHAHGGTGLLTLSARNRSKRSKRL >NE0671 sbcB, exodeoxyribonuclease I MQTGNSTLYWHDYETSGATPRWDRPFQFAGLRTDEALNEIGDPLVIYCQP ARDRLPHPEACLLTGITPQMAEARGLPEPEFIALIHAQLAQPGTCGVGYN TLRFDDEVTRFTLYRNFYDPYAREWQSGNSRWDVIDLARMTFALRPEGIN WPINGEGKPSFRLEDITTANGLVHDSAHDALSDVRATIALARLLRAQQPR LYDWLFRLRDKRAAGNLLDMKTHAPVLHTSRMYSSEYGCTTLVMPLLPET GNANSVLVYDLRHDPAEFVLLDIDALAERLFTPKEELAEGLQRLPVKAVR LNKCPALAPQKVLNDEVANRIGLNVEQCQQHWQLLLQHPDFMQRIKQAYS GNKVFAENDADLALYDGFASDHDRNLFPLVRDAEPGKLADLAGKFQDERY IELLFRYRARNFPDTLSVQEEHHWQMHCRRQLGENAINGSLTLNEYHQKL LQLRTDCPQQAQLDILNELEAWGRVLAQENDLPWPPDHSGSEEQTD >NE1392 sbcC, ATP/GTP-binding site motif A (P-loop):ABC transporter MQILQVRLKNLNSLVGEWEIDFTDPAFVSDGIFSITGPTGAGKTTVLDAI CLALYGRTPRLGKVSKSENEIMSRQTGECFAEVTFAAQSGCFRCHWSQHR AHKKPDGELQNPRHELAEADSGKILETRLTEVGRQIEKITGMDFARFTRS MLLAQGEFAAFLQAAPDERAPILEQITGTEIYSRISIRVHETRVSARREL DRLSAGLNGIQPLTRADEQQFHTDLAQKIQQDAGLNEQIAHERQILVWLE NIARLESELHLIAGQQQAWLSRKEALTPDISKLDSASRALELLGEYSRLT SIRNEQETDRNNLAICAASLAGLEQAVKQAEQSLKSLNEQCDRQRAKQRE TIPLLRKVRELDIQIREKESPISTASKGITAQKKTLAALRNQYQQNEIQL AGLQTTLAGLLQQLHVIQPDGAQMDFAHNQNLLNRKQAEYRQLLENRSLA DWRQEIAVLSGQKTLATRAIEAMQSLAASKQISAELEKHTCSLLAGKTQL AKQLGAEEEMLGALEREISLLETQALLLKKISHLEEARQQLKDDEPCPLC GALQHPYAAGNTPRPDDNITALNQARTMLKTRIDTISTLKIRQAETNRDI EQTACRQQEIHRQIQADETLLQQCAVSLFPGLPSAAMFPELPRLLQETDD KLARMTRILQTAEILENEISVQRESLDKTRELEQKIGILRVQHQHQSTQI RQHEAELQLRQEQLDQLQQELGNLRTTRLQLFADKQPDQEEQSLTTAIEA AQKSADNARQQLETEIQQYNRLKNRAEDLVKTITTRAVQLEKLQETFAAR LTQSGFADEAGFTAACLPEEERRRLAQRAQQLADEKTMLDTRQKDKTIAL QAEQLKSMTDQPRDFHDQVLAQLITRQQVLQQEIGGLRQKLADNENSKQK QQEQLQVIEAQKRECARWDLLHGLIGSADGKKYRNFAQGLTFEVMIRHAN RQLQKLSDRYLLIRDPVRPLELNVIDNYQAGEIRSTKNLSGGESFLISLS LALGLSRMVSRNIRVDSLFLDEGFGTLDEEALDTALETLAHLQQEGKLIG IISHVTVLQERISTRIQVIPRSGGRSVLAGPGCRHCQ >NE1390 sbcD, Serine/threonine specific protein phosphatase:Exonuclease SbcD MKILHTSDWHIGKTLYGHKRYDEFEAFFSWLVETIEQEQVDVLLIAGDIF DTSTPGNRSQQLYYRFLHRVAASACRHVVIIAGNHDSPSFLSAPRELLRA LDVHVTGSLSGNPADEILVLHDPKGDAELIVCAVPHLRDRDIRTAEAGES MEDKSRKLVEGIRDHYAEVINLARLQRTALSSSIPIIAMGHLFVAGGQTV EGDGVRELYVGSLAHVPAGIFPPDIDYLALGHLHVPQRVNGSSVMRYSGS PLPIGFGEADQEKSVCLIEFNRQISATRPAVSLINIPVFQPLERIRGNWQ VISDRISMLSAANSCAWLEINYEGDEMITDLQERLQSAIEGSRLEILRIR NNRIMNQILDQIDDGGTLEELSVNEVFEHCLSAAAIPVEQRTELWRTYQE TLVSLDEEDIRAE >NE1968 smf, SMF family MQIDQDIESWLRLGLTEGVGGGALRRLLIAFGDPARVLAASRPALEGVVK KPVATSIFLRKVDEERLARTIKWLEDPLNSLITLADSDYPKLLLNISDPP PILYFKGQRQFLAQPAMAMVGSRNATPQGLANADAFAEAASNAGFCIISG LAQGIDTAAHQGGLRGASSSMAIVGTGLDLVYPSRNHELAHKLANEGGLI SEFPLGTPAISRNFPRRNRIISGMCHACLVVEATLYSGSLITARLALEQG REVMAIPGSIHSPLSKGCHALIKQGAKLVENIQDILDELHYQPQPVPRFE SVADEGGGTGVLTGEGDDTGLLMYFSYDSTDIDTLCARSGLTVETVSAML LGLELEGRIGSLPGGKYQRIR >NE2453 ssb, Single-strand binding protein family MASLNKVMLIGNLGRDPEIRYMPSGDAMANLNIATTDTWKDKGGEKQERT EWHRVVMFGKQAEIAGEYLKKGSQIYIEGRLQTRKWTDKSNVERYTTEIV ADRMQMLGGRSGGGSYDPPADRDHDYQSQSTPPAKSNTGFDDMEDDIPF >NE0338 sss, Phage integrase:Phage integrase N-terminal SAM-like domain MNQPHDERDTPLPPLLSEYLAYLASTRSLSLLTQHSYRRDLVALVCCIAA QHQSEHENGHEVTDASLTRLHSHDIRHFIAHLHHGGLSGRSLARMLSAWR GFYRYLMRHHHHTENPCQDIRVPKSPRKLPHALSPDEAAQLLAFDPADAL ATRDLAMFELFYSSGLRLAELTRLQPTDIDFSEGIVRVTGKGSKTRIVPV GEPALRALQAWLPLRSAWLTSGETALFLSRHGQRIHPRTIAVRLHQRARL QNLDDRVHPHALRHSFASHLLQSSGDLRAVQEMLGHSSIRSTQVYTHLDF QHLAKIYDQAHPRAKKRPKTG >NE0154 tISRso8a, Transposase IS911 HTH and LZ region MERLPKGIYTPEFRAEAVKLVEAEGLSVDAAAKRLLVPKSSLGNWVRASR TGSLAKVGQGQRVPTETEIELARLRKELAEVKLERDLLKKCAAYFAKESR >NE0835 tnpA, Transposase Tn3 family MPRRSILSAAERESLLALPDTKDDLIRHYTFSDTDLAIIRQRRGPANRLG FAVQLCYLRFPGIILGVDQPPFPPLLKLVANQLKVGIESWDDYGQREQTR REHLVELQNAFGFQPFTMSHYRQAVHTLTERAMQTDKGIVLADALIEHLR RQSIILPALNAIERASSEAITRANRRIYEALSEPLSNGHRHGLDDLLKRR DNSKTTWLAWLRQSPAKPNSRHMLEHIERLKAWQALDLPPGIERLVHQNR LLKIAREGGQMTPADLAKFEPQRRYATLVALAIEGMATVTDEIIDLHDRI LGKLFNAAKNKHQQQFQASGKAINAKVRLYGRIGQVLIDAKQSGGDPFAA IEAVMSWDAFAESVTEAQKLAQPDDFDFLHRIGESYATLRRYAPEFLDVL KLRAAPAAKDVLDAIEVLRGMNTDNARKVPADAPTDFIKPRWQKLVMTDA GIDRRYYELCALSELKNSLRSGDIWVQGSRQFKDFEDYLVPPAKFASLKQ SSELPLAVATDCDQYLDDRLTLLEAQLATVNRMAAANDLPDAIITESGLK IMPLDAAVPETAQALIDQTAMILPHVKITELLLEVDEWTGFTRHFAHLKS GDLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAWLQAWHIRDET YSTALAELVNAQFRHPFAEHWGDGTTSSSDGQNFRTGSKAESTGHINPKY GSSPGRTFYTYISDQYAPFHTKVVNVGVRDSTYVLDGLLYHESDLRIEEH YTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLFVPKGEASYDALKPMI SSDKLNIKAIRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVA LRELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFNRLGE IRDRSFEQQRYRASGLNLVTAAVVLWNTVYLERAAHALRGNGHAVDDALL QYLSPLGWEHINLTGDYLWRSSAKIGEGKFRPLRPLQPA >NE0751 tnpR, Site-specific recombinase MPGKRIGYVRVSSFDQNPERQLEGIQVDRVFTDKASGKDIQRPQLDMLLD FVREDDTVVVHSMDRLARNLDDLRRLVQDLTGRGIRVEFVKEGLIFTGED SPMANLMLSVMGSFAEFERALIRERQREGITLAKQRGAYRGRKKSLNSEQ VAELKRRVVAGEQKALIARSFGISRETLYQYLKTVD >NE0836 tnpR, Site-specific recombinase MQGQRIGYVRVSSFDQNPERQLEHVEVGRVFTDKASGKDTQRPELDSLLA FVREGDTVVVHSMDRLARNLDDLRRLVQKLTKRGVRIEFVKESLTFTGED SPMANLMLSVMGAFAEFERALIRERQREGIALAKQRGVYRGRKKALSPEQ VAELRQRAAAGEQKAKLAREFGVSRETLYQYLRLDQ >NE1966 topB, topB; DNA topoisomerase III protein MSKKLIIAEKPSVASDIARALGGFVKQKDYFESDEFVVSSAIGHLLELIV PEEYEVKRGKWSFDHLPVIPPRFDLAPIEKTTDRLKLLSKLIKRKDVDML INACDAGREGELIFRYIVRHVGSKKPIKRLWLQSMTPSAIREAFANLLND AEVQSLADAAVSRSEADWLVGINGTRVMTAFNSQEGGFHKTTVGRVQTPT LAILVEREEAIKKFVVRDYWEVHATFQAESGVYKGKWFDEGFSKRKDESE SRADRIWDHAKAEVIRDKCAGRTGVVTEESKPSRENCPLLYDLTSLQRDA NSRFGFSAKVTLGLAQALYEKHKVLTYPRTDSRALPEDYPAIVKDTLQVL KGSRYDRFASQILESDWVKPNKRIFNNAKVSDHFAIIPTALVPKKLNEAE EKLYDLVTKRFLAIFYPAAEFLITTRITRVENEPFKTEGKVLVHAGWQTV YGKVESAQGQEEESVLVAVTPGETVLAQEVAVVAGKTRPPARYNESTLLS AMEGAGKLVEDEELRAAMSAKGLGTPATRAAIIEGLIHENYVERSGRELQ PTAKAFSLVTLLRGLKIPELISPELTGDWEFKLRQIEQGQLKRDVFMEKI AAMTRHIVEQAKNHRDKTISGDFATLQVPCPGCGGVIKETYKKFQCQQCD FALWKILAGRQFEAAEMETLISTREIGPLSGFRSKMGRAFNAIVRLTDDY EMKFDFGNEADQAQEKVDFSAQQPLGKCPQCGHSVYEHKLLYVCEKSVGA GAPCSFRTGKIILNRAIEAEQVVKLLQTGRTDLLAGFVSRKGRPFSAYLV VGPAGKIGFEFEQKKTKSKPADTVPETGKAAS >NE2455 uvrA1, ABC transporter:Excinuclease ABC A subunit MELIRIRGARTHNLKNIDLDLPRNQLIVITGLSGSGKSSLAFDTLYAEGQ RRYVESLSAYARQFLQLMEKPDVDLIEGLSPAIAIEQKATSHNPRSTVGT VTEIHDYLRLLFARVGEPHCPEHGIGLAAQSVSQMVDQVLQLPTDTRLMI LAPVVTGRKGEQAELFDELRAQGFVRVRLDGEVYDIDALPKLQKTKKHTI EVVVDRLKISPEVKQRLAESFETALRHAEGRALAVEMDSGKEYLFSARFS CPVCSYALSELEPRLFSFNNPAGACPKCDGLGQITFFDPARVVAFPYLSL AAGAIRSWDKRNQFYFQMLQAVANHYHFDLEIPFEQLSKEVQQAVLYGSG KEKITFTYLNEQGRAHQQVHPFEGIIPGLERRYRETESQTVREELAKFIN ARECPECGGTRLCREARHVTVNGETIFAISAWPLRQAKQFFDDMELTGHK QSIAERIIREISSRLQFLNNVGLDYLSLDRSADTLSGGEAQRIRLASQIG SGLTGVMYVLDEPSIGLHQRDNERLLDTLRHLRDLGNSVIVVEHDQDAIL LADHVVDMGIGAGEHGGCVVAEGTPTAIQANSASLTGQYLSGKRSIAIPS TRTPPNPERMLTIRGAAGNNLKQVQLNLPVGLLICVTGVSGSGKSTLIND TLYRVVARHLYGSHTDPAAYQEIDGLGFFDKVIDINQSPIGRTPRSNPAT YTGLFTPVRELFAGVPQARERGYSPGRFSFNVKGGRCEACQGDGVIKVEM HFLPDIYVACDVCHGQRYNRETLEIQYKGKNIHEILQMTVENAHAFFEAV PTIARKLQTLLDVGLGYITLGQSATTLSGGEAQRVKLSLELSKRDTGRTL YILDEPTTGLHFQDIDLLLKVLHRLRDNGNTVVIIEHNLDVIKTADWIID LGPEGGAGGGRIIAEGTPETVASIPGSFTGYFLQPLLSTTLTG >NE0785 uvrB, Helicase subunit of the DNA excision repair complex MIITFPGSPYKLNQAFQPAGDQPEAIRILVEGIESGLSFQTLLGVTGSGK TFTIANMIARLGRPAIIMAPNKTLAAQLYAEMREFFPENAVEYFVSYYDY YQPEAYVPSRDLFIEKDSSINEHIEQMRLSATKSLLEREDAIIVATVSCI YGIGDPVDYHGMILHVREHEKISQRDIIQRLTGMQYQRNEFEFARGTFRV RGDVLDVFPAENSETALRISLFDDEVESMTLFDPLTGQTRQKVSRYTVYP SSHYVTPRSTTLRAIETIKTELTGRLNYFHENHKLVEAQRLEQRTRFDLE MLNELGFCKGIENYSRHLSGRQPGDPPPTLIDYLPDNALMIIDESHVTVP QIGGMYKGDRSRKENLVAYGFRLPSALDNRPLRFEEFEKLMPQTIFVSAT PADYEIQRSGQIAEQVVRPTGLVDPVIIIRPVTTQVDDLMSEVSLRAAQN ERVLVTTLTKRMAEDLTDYFSDHGIRVRYLHSDIDTVERVEIIRDLRLGK FDVLVGINLLREGLDIPEVSLVGILDADKEGFLRSERSLIQTMGRAARHV NGTVILYADKITNSMRRAIDETERRRNKQKLFNQQNNITPRGVNKRIKDL IDGVYDSENAAEHRKVAQIQARYAAMDEAQLAKEIQRLEKSMLEAARNME FEQAAQYRDEIKNLRSKLFIGIIDPDEIREVPQTAGKKSRRKAGR >NE0933 uvrC, uvrC Nuclease subunit of the excinuclease complex MPDAHFDGKAFVLTLPAQPGVYRMLNAAGDVIYVGKAIDLRKRVSSYFQK SGLSPRIQLMVSQIAGIETTVTRSEAEALLLENNLIKSLAPRYNILFRDD KSYPYLLLTRHIFPRLAFYRGALDDRHQYFGPFPNAGVVKSSIQLLQKVF RLRTCENSVFDHRTRPCLLYQIKRCSGPCVGLITPEAYQQDVKSAAMFLQ GKQDEVLKTIEQKMFTASDQQDYEQAAQLRDQMQALRKIQEKQFVDSGKA LDADVIACAIEPDSHAVAVNLVMIRSGRHLGDKTFFPQNVYEADISTVLE AFVTQHYLNRSVPPLIILGQKIRVTLLQKLLSDQAGHKITLTTNPIGERR KWLDMAAENAQLALQQMLIQQASQEDRLQALQEALNLPGLARIECFDISH TMGEATIASCVVYDRFAMRNGEYRRYNITGIVPGDDYAAMRDVLQRRYAK LAMEEGKLPDLILIDGGKGQIRVASEVMIELGLNDIPLVGVAKGETRKPG LEQLILPWQEEALHLPDDHPALHLIQQIRDEAHRFAIQGHRAKRAKTRKI STLEQISGIGTKRRQSLLTRFGGLKGVKNASIEELQQTEGISRSLAEKIY RELR >NE1473 uvrD, UvrD/REP helicase MTALLTDLNPEQLEAVTWSHQSALVLAGAGSGKTRVLTTRIAYLLQSGRT RPQNILAVTFTNKAAREMVARIGAMLPVNTRAMWVGTFHGLCHRVLRAHH EDAGLPQAFQILDMADQLAVIKRVLKERSLDEKMLPPRQLQWFINNAKEE GLRASQVDVHGGFNQTLAECYQAYEIVSMREGTVDFAELLLRCYELLSRN EILRDHYRSRFEHILVDEFQDTNRLQYKWIKLLAGPGSQQHAAIFAVGDD DQSIYAFRGAHVGNMRDLEKDFSVPKIIRLEQNYRSHGNILDAANALIEH NKGRLGKNLWTAAGKGEPVRVYHAATDMDETSFIIDEIKALHADGLALSD IALLYRSNAQSRVLEHGLFNASVSYRVYGGMRFFDRQEVKHALAYLRLIA LPDDDNALLRIINFPPRGIGARTLEQLQDQAAMLGTSLWQAAFKVYEGGK AVATRNSQPGRGIAGFVSLVLSMQQDGEGLPLPEIIRRVIDQSGLAAHYQ AEREGGERLENLKELINAATSFVHESEDDSLTAFLAHASLEGGEHQAEGY QDAVQLMTVHAAKGLEFHSVFISGLEEGLFPHENSRNEPDGLEEERRLMY VAMTRARQRLYLSYAESRMLHGQVRVNIPSRFIDEIPQDLLKRLRSDFSG RSFRQGVSGTGQTVASTINSSQKGRSTMAAAVGMTSSGLNSAGFHVGQQV SHAKFGTGIILNYEGSGTDMRIQVNFHQAGTKWLSLAYAKLEPL >NE1458 xerD, Phage integrase:Phage integrase N-terminal SAM-like domain MRQSPDFFRDMLRMNITDTNIRMLDEFTDALWLEDGLSRNTLASYRADLM QLVEWLGRQPRTNGSLSDVTQADLLAFLSDRIGQGVKASTTCRALTCIKR FYRYLLRQGKILADPATNIDSPKISRHLPVSLTETEVEALLAAPDTRQPL GLRDRAMLEILYAAGLRVSELVGLSISQIRQDMGVVRILGKGSKERLIPL GEEALHWLSLYLQEARPVLLAGKHSNMSFVTTRGDAMTRQAFWYLIKRHA RQAGIVKLLSPHTLRHAFATHLLNHGADLRVVQLLLGHSDISTTQIYTHV ARERLKQLHARHHPRGTL >NE1172 xseA, xseA; exodeoxyribonuclease vII large subunit protein MTDHNLLPEPKKILWRVSELNRNARVILEQTFPLLWVSGEISNLKRYPSG HWYFSLKDDSAQVRCVMFRHKNLYLDWIPQEGMQVEAQALVTLYEARGEF QLTVEQLRRAGLGALFEAFERLKARLQQEGLFSPEYKQPLPRFPRQIGII TSPNTAALRDVLTTLQLRLPSIPVVIYPAPVQGEGSAAAITTALHTAAVR GECDVLILCRGGGSIEDLWAFNEEIVARAIAACPIPIVTGIGHETDFTIA DFVADARAPTPTGAAQLASPDRQAILHRLQYWLHRLQQTMERHIERRMQA TDLLAHRLIHPGERIRHQQMHLLQLRGRLQNAWNRQVEIRTWRIEETGRR IHSAKPDIQAGIRHQQELAARLQRAMAHRLENLQFKLRQQQQHLIHLDPK AVLARGYSIAYTARGDILHDSRQTRAGDNVRLVFASGWAKADITETGE >NE1159 xseB, Exonuclease VII, small subunit MRKKSSSNKEETALHPPPENFETATAELEQIVAGMETGQMSLEDALSAYK RGVELLQYCQNILKNSQQQIKILEADMLKHFSPAEHDAS >NE0023 xthA1, Exodeoxyribonuclease III:Exodeoxyribonuclease III xth MKIATWNVNSLKVRLQQVIDWLNLNQPDILCLQETKLQDEFFPMDAIAQA GYRSIYIGQKTYNGVALLSKETGEDICTALPGFDDMQKRLIAATYGDLRV ICAYVPNGEHVDSEKYIYKLEWLSQLNRFLQQQRACYGKVALLGDFNIAP EDRDVYDPEAWRGQVLCSEPERQAFRGLLDTGFVDSFHLFEQPEKTYTWW DYRMMAFRRNRGLRIDHILLSHEMADRCTIWQVDKLPRKLERPSDHAPVL VELA >NE2192 xthA2, Exodeoxyribonuclease III:Exodeoxyribonuclease III xth MRIITLNVNGLRSAAGKGLFDWLPRQEADVICVQELKAQQGDINGVMRAP DGYSGYFHCAEKKGYSGVGLYTRYSPDQIIEGTGIPEIDMEGRFLRVDFG NLTVISIYLPSGSSGEHRQAAKFFFMEHFLPLLQSLAECGREVLLCGDWN IAHKAIDLKNWRSNQKNSGFLPEERAWLSTVFDELKLVDVFRKINPEPDQ YTWWSNRGQAWAKNVGWRIDYQIATPGLAAMATGVSIYKAERFSDHAPLT IDYDFNL