Gene list
Applied filters:
COG category: Replication, recombination and repair
Gene type: CDS
Genomic element: chromosome
Number of genes found: 243
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Nitrosomonas europaea ATCC 19718, ATCC 19718 >NE2010 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR >NE0119 hypothetical protein MKFLDTQELVVSTLSPVHIGCGEDYEPTEYVVDTSGVLHRFNAGILPDLS DAGISSDILTILSNDEAHTEQLRAVHKVLSKYRDKIIPLASVHVSMCTGV HAHYKSTQDKKNDFNRNGVERTSYQPFNQLPYLPGSSIKGAIRTAILNEH IAGNNPCSTVLMRQIQDFNTMIEEYDPGNGKLLLRLKLQHTKWDYDRARK NIEKAIADVSSALGTDLLGGKFETDPLRALKVSDAAPLDIEIEREIRFCL NRSRSGRRSQAQVKNLYTRLEYILEHQPAAFSLSLTLQNLHEIAGRRNHR NELISPSADKLLLWTGIVKACNSYYLNRLDDDLAMLGKLYPTSEWRKQTQ SILDAGLRDQIKTGNCLLLRIGKHGGANSNTVSGRQIKIMLNEDKREANG KEEKIRLYTFDDESRTIWYCGDDLDKPSDLLPHGWIVLSNPDQIWHADLP GFERRCARQQAIAESARRQAEAAAAEQAKAAAQAAREAALAAMTENQRRI EAFVSMCARRAEQLRGGKENPNAAIHTAARELVKAALEGADWTIDEKCAV ADAIEEWLPKLVKVELKDERKKLKLSALRT >NE1940 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNN >NE1990 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR >NE1884 possible homolog of eukaryotic DNA ligase III MTDFFRFPNTPHLLWLGQGQPRDDKILSDAEIAALLQDEVLIEEKLDGAN LGISLDEHGELRAQNRGQYLPQPFSGQFSRLNSWLGQHGEILKHTLTPEM ILFGEWCAARHSLDYNKLPDWFLLFDVYDREAGKFWSVERRNQLAQKLNI TTVPLLKRTKITCNQLVQLLDDAQSRYRSGKVEGIVIRCDSPLWCESRAK LVNREFVQAIEDHWRSRSIEWNLVHAGSVKRS >NE0454 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIQTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNN >NE0245 hypothetical protein MALPQRQIVRAENVKIGISWQCALCDLDIYARPLPGAEVIYFGRMVTTHG RYWKDYRNSPQPTNGYETISFDVPLDLRPVVIAINFYEGEAPQGVSGEIR IAVDENTYAAPFHISATRGNRGQGVAKIIETGKASGNHSVIVDPLHIIRA R >NE1552 Transposase IS4 family MDAHGMPVRILVTQGTTADCTQAGRLIEGIDADHLLADRGYDSNAIVEQA EKQGMEAVIPPKKNRKIQRPYDKELYKLRHLVENAFLHLKRWRGIATRYA KNTSSFLAVVQIRCIALWADIL >NE2156 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE2445 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0112 hypothetical protein MLGTANLPSRNKYKAKATILHMNRNLYLVAYDICNPRRLRQVCRYLTGYK VSGQKSVFEIWVTPTELHTIRTELDKLMDTQADRLHILSLDPRMKPRCYG NASTFTVQHFCIV >NE1222 Transposase IS4 family MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQARYLGKDIRCFDRRPGQSIYYDRQHDRAR SSAGRLRKRGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDC TQALDLISGFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRT YDREIYKCRNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE1132 transposase MPDLCQGLFGQLFADKGYLAQWLTEALDQQNLQLITPLRKNMRPVPRTRF EKVILRRRSLIETVFDELKNLCQIEHTRHRSLFNFIVNLMAGIVAYCLSD NKPTLNLTRVNSLAKA >NE1470 conserved hypothetical protein MRSTTLLFFCSNLVWISPLTWAQTSQQPDNLIPLPEIPESPSAGEENGLP PELGLDPSLEPEITIHEGKDKTMIEEYRVNGELYVIKITPRIGKPYYLLN RRSAVGMPHRGDMESGVSVPMWQIYRF >NE0447 conserved hypothetical protein MLKQNETKTSMILNYRWLYDTVRKRFESDEAMEAFLPKALTPATLKQKGD DRYLSAMSQRVFQAGMQHSVVNAKWPAFEEAFWGFVPETMVMLSPEQIEG YMKNSSIIRHYTKLQTIPRNAQFILDIRQEQGCSFGEFIADWPSADIIGL WRLLAKRGARLGGRSSAGFLRLAGKDTFLLTSDVTARLIAAGIIDHEPTG QRDRQIIQDAFNELQQDSGRPLCQLSAMLSLSINPRF >NE0969 possible N6-adenine-specific methylase MVKADRIRIIGGQWRSRLIQFADDELLRPTPDRVRETLFNWLGQDLTGKI CLDLFAGSGALGFEAASRGAKQVTMIEQNMKAVRNLHCSIEKLGASQVKL EHVDARMFLTANSERYDVIFVDPPFKSGLLAEVLPLLPAKLEEEGVVYVE SSDKLLPDDTWSIWKQGRASHVHYCLLSLNPDG >NE0110 hypothetical protein MPADHFLKRQAMSDFIICYDITDPRRLGRLYRYLIKRAVPLQYSVFLFRG DDRQLERCIQDAIELIDEKQDDLRVYPLPGRGLKARIGRPTLPEGIQWSG LPAKW >NE2238 Transposase IS200 like MTFTTKEYQSLSHTRWDCKYHVVFIPKRRKKRIFGMLRWHLGELFHELAS HKESKIVEGHLMDDHVHMCISIPPKYAVSNVVGYLKGKSAIQIARKFGGR QKNFTGEHFWARGYFVSTVGLDDNIVRTYIRNQEDEDERYDQMKLEI >NE0814 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1557 putative transposase MTYPISFRRKVLSVREKEGLTIAQTAARFCVGIASVTRWIKNPVPKESRN KPATKIDMAALAHDVREFPDAYQAERARRLGVSEKGIGHALRRMHISYKK NTAAPQSGRRQTAHLPGDD >NE2232 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0252 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE1523 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2318 hypothetical protein MKLLFFCLLLVSMSVPAVAGNEKQIFELEAAIMQQQQEQQILFQRFQMLQ ELRRHEITQIEQALPTGSDVIINGEAPKYEDVARQRKERAERVHRYTDEL DELYMRYQETENERRALIEQLNGLKPGQDVSAEPKK >NE0744 conserved hypothetical protein MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA ALIEGLTVRASARQCRIDKNTSFRWRHRFLTLPAAAKANHLEGIVEADET FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL SSPIVPLQAALGRENQFQLLTNT >NE0259 hypothetical protein MVKPTDAELRTSGGLTSVFLNCDTCLSDEDFNRLRRMEFTQNEHAILYGQ LGGSIAGMIELGPVASRTQSRQDEERKTEERRTAQFVQLVEQMRASIEQM EADVKRLVASFEKRDGDAWREKLALNILEADEIPQQEADESITAYRKRLE QHLINEMLNPDGTIKDKYKNDPKYGDYAEWAQTQFHLNSAKAAVAELDNS DTSPQRKEHILDEMKQRGYIEEMVFTDRISGNLDAQKSVRDIRDSQHDEA LSQVRPPEATLKFLS >NE2098 Maturase; integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase); part of type II intron MHRALNQDDDHNQDGQDLLEAVLARDNLARAWRRVKSNRGAPGIDGVTTA EWPEHARAHWPATREQIEAGRYRPQPVRRVDIPKPDGGQRQLGIPTVTDR VIQQAIAQVLIPIFDPGFSASSFGFRPGRNAHQAIRQVQAHVKAGYRWAV DLDLARFFDNVNHDLLMSLLSRSIADKRLLALIGRYLRAGVLVGEHPQPS EVGTPQGGPLSPLLANVLLHQFDLELERRGHRFARYADDVIILVKSRRAA ERVMQSLTYFLQSTLKLTVNLAKSQVAPMSECSFLGFTLVGKKIRWTEKS LANFKHRVRQLTGRSWGVSMEYRLEKLGQYLRGWFGYYGISQYYRPIPEL DEWIRRRVRMCYWKQWRWARTKIRHLLDLGIPLKAAIQHGVSSLSYWRMA RTPVTQQAMSNDWLRAQGLLSIKDLWCKAQSYGPDKG >NE0184 NUDIX hydrolase MTWKPNVTVAAVIEQDDKYLLVEEIPRGTAIKLNQPAGHLEPGESIIQAC SREVLEETGHSFLPEVLTGIYHWTCASNGTTYLRFTFSGQVVSFDPDRKL DTGIVRAAWFSIDEIRAKQAMHRTPLVMQCIEDYHAGKRYPLDILQYYD >NE1366 conserved hypothetical protein MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVDEFTSKRSSPLIPSGHQRSC IQQKHHLHQLICSMTGYCRSLLNRVWALFAF >NE2536 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNI >NE1398 putative DNA polymerase-related protein, bacteriophage-type MDPNRLKELGLLPVWRVRPGAIAGQSPDSGKNMSDQIETAAESRPFEESE DRSTSIAHADWGRLRQMVSGCTACPLSQTRKQTVFGVGDEQADWLFIGEG PGAREDELGEPFVGQAGKLLDNMLQAVSLRRGQDVYIANIVKCRPPGNRN PQDAEAEQCRPYLLRQIALIQPRLIVALGKVAAQNLLATDASIASLRGRL HEFSGIPLIVTYHPAYLLRSLGDKAKAWEDLCFARDTMRNLQAAHSS >NE0830 DNA mismatch repair protein MutS family, C-terminal domain MAARCVVLLCIVENCDFRRASCDHHDFACHAAHGGLDQRFQKARSWFLLS SIQHKVAKSAKVGNTESTHVITTRTMNDTTQDTAASVWRESFILSSGKNP SGIRDTRPTADNYGVLDAKTFAAVEVDALFDEINQAQTLTGQSILYRSLA RPVTDAALLQSKQEALRELESNPDLLKVLEQYIKRIAIDEASLHHLLYGE FAGGLTTDDPRDKTGKDKLEFGGYGYRQFIDGTGFVVDLVEEAEALPMPE SDYLRTLVQTLRDFARSRTYALMHGPIYVSQGKFMTREEKPRYLLIQRFR PSMFKWPFISFFLAFVAGLLLFFQNTLNELVASYVGYGLLILVVPIIPII LQAISASDRDSVIYPLQRLFRQSPELARTIEAMGMIDELLALHRHARSIP GESVLPEIDMDGRHTLVVSGARNPLLVRTRPDYVSNDIVLDNDKHLLIVT GPNSGGKTAYCKTVVQIQLLAQAGAYVPAVQARAVPAEHIFYQIPDPGQL EEGMGRFAHELKQTREIFFNSTPRSLVVLDELAEGTTFEEKMTLSEYVLK GFHQLGATTILVTHNHELCERLQQENIGNYLQVEFVSEKPSHRLIPGISR ISHADRIASAIGFSKEDVASHLASLQE >NE1182 Helix-turn-helix protein, CopG family MRNTMTHRVTITLDAETFAFLNDVASSNRSAYVNQLLKQDRKNFLQAALR KANQEEAEDTNYQEKLQAWESTLSDGLAND >NE0517 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNT >NE1807 putative ATP-dependent RNA helicase protein MSFENLNLHPAIVKAVLAAGYTAPTPIQQQAIPDLIAGHDVMASAQTGTG KTAAFMLPALHRLATPAQIRGRGPRILVLTPTRELALQVSDAASKYGKFL PRINVVSILGGMPYPLQNKLLSQTVDVLVATPGRLIDHIERGRIDFSRLE MLVLDEADRMLDMGFIQDVERIALSTPATRQTLLFSATLDVAIEKIATRL LKAPKRIQVAAQHTKLDHIEQRMHYVDDLTHKNRLLDHLLRDTTIKQAIV FTATKRDADSLADNLSSQGHKAAAMHGDMTQRERTRTLTGLRQGRLKILV ATDVAARGIDIADITHVINFDLPKFAEDYVHRIGRTGRAGASGIAVSFAS GKDVAHLKRIERFTGNRFEFHVIPGIEPRTKPRFGRSDDKPGRRPSSSAA AHKTRRSWSDNPNTRTASPGHRGDKDAGFGQPFGRETRKRPFRDSKFNSA DRFARTE >NE1061 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNS >NE2469 Transposase IS4 family MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA >NE1351 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0254 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0844 Protein of unknown function DUF48 MTQARLCLSVLYFPFSETTPKDLDQVRGIEGDAAKTYFSALPYLVRKDIR EFFTMDGRTRRPPRDRFNAMLSFIYSLVMNDCRSALESVGLDPQIGFLHA VRPGRAALALDLMEEFRSFMADRLALTLINRGQITDQDLLVREGGAVHLE DKARKTVVVAYQERKQEEITHPLLETKVPIGLLPQLQARFMARVIRGEMD GYLPFLVR >NE0239 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1845 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2200 transposase MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEAV >NE1853 hypothetical protein MSDQYTQNESDQSKDKVEWTKPASLLNILGKKFAPIADLQHKQLPSWSLL VFLGILLLVFIWKQIAVNQAESRLEKGQAQIAQQLEEKSKELVKKAREYA DSQYKKEEERFGQVLAWAVRGELIRNNLDQIDQYLTELVKTKDTERVVLI SDEGKLLVSTDKRLESEEASSLYPKDVLGLQTITIKSDVDNRKLLVVPVM GLNKRLATIVISYNPPSLLN >NE2483 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNS >NE2228 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2447 transposase MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEAV >NE1740 Transposase IS4 family MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA >NE1631 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDHLISVNRP >NE2274 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE0935 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPGPMVKSNA >NE2413 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYL >NE1378 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1925 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0719 Uncharacterised protein family UPF0102 MSSAGNKGSDAEQCAAAFLQQQKLTLLEKNYRCRFGEIDLIMREDDTVVF VEVRMRSSDRFGGAAASITAAKQSRLIRTARHYLAGHEGDFPCRFDAVLI SGNRENEIEWIRNAFDES >NE2011 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE1698 conserved hypothetical protein MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA ALIEGLTVRASARQCRIDKNTSFRWRHRFLTFPAAAKANHLEGIVEADET FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL SSPIVPLQAALGRENQFQLLTNT >NE2311 possible helicase (Snf2/Rad54 family) MVQQLLTPHQSQYIAWQLTRRAAKDSVESLASTLVDSQVDLNPHQVDAAL FACRNPLSRGVILADEVGLGKTIEAGLVISQHWAERRRKMLIIVPANLRK QWHQELQDKFNLQGLVLEAKNYNAMRKEGVTQPFLHAGGPIICSYQFAKA KADDLRRIHWDLVVMDEAHRLRNVYKNGNVIARTIRDALEHVDAKVLLTA TPLQNTLLELYGLVSMIDERVFGDLDSFRTQFSGVRTEQSNRALRERLTP LCKRTLRRQVQQYVPYTARIAIVEEFTPSQEEQQLSALVADYLRRPNLKA LPEGQRQLISLVLWKLLASSSHAIAGALETMANRLQGQLDELPDVPDLTE SLDDDYEGLDETADEWNGATANDADASANERAAIADEAAELRRFKELATS IRQNAKGQALLTALDKAFAELERLGASKKAIIFTESKRTQNYLLSLLAET PYGIVLFNGTNTDARAQAIYKDWLQRHEGSDRITGSKTADTRAALVEHFK ERGTIMIATEAGAEGINLQFCSLVINYDLPWNPQRIEQRIGRCHRYGQKH DVVVVNFVDRSNEADARVYQLLSQKFKLFEGVFGASDEVLGAIGSGVDFE RRIAAIYQNCREPEEIRSRFEDLQRELSSEIDEAMLRTRQLLLENFDEEV QEKLRIHSQDSQAVLNKYERLLMDLSRTELRDHARFDTAEEVNGFVLHSL PDGLGLATGSREQAVMAGRYELPRRSGDAHLYRMGHPLAEWAIERAKARD LQAPARLAFDYAAYGKRLVSLEKWRGQCGWLSVTLLSVETLNDQEQHLVV SACTQAGEALPEDDPEKLLRLPAQVEGDAHLQVCAELVANVESRKSVLLR GINQRNLGYFEQEVQKLDTWADDLKLGLEQEIKAIDGEIKEVRRTAAASP TLEEKLAHQKRQRELETRRSKLRRDLFARQDEVEEQRNKLIGELEEQLKQ QVAERMLFTVEWELT >NE0155 Integrase, catalytic core MCGVFREGVAVRYARIEQLRQHHAVAAMCRILDVSESGYHAWRQRPPSAR QQENLRLETEVKAAHQRTRETYGPRRLRSDLADHGIQTSLYRIKRIRRKL GLRCKQKRKFKATTDSRHALPLAPNLLDRQFTVAAPDRAWVSDITYVATD EGWLYLAGIKDLFNGELVGYAMSERMTTSLVSQALFRAVAAKRPARGLIH HSDRGSQYCAHAYRKQLQQFGMQASMSRKGNCWDNAPMESFWGSLKNELV HHRRFTTRTQARQEITEYIEIFYNRIRKQARLGYLSPAQFTQKYHAKQIA A >NE1122 conserved hypothetical protein MNDLESLISQVRRCTLCAEHLPLGPRPVFQLHETARILIASQAPGRRVHE TGLPFNDPSGDRLREWLNMTRTIFYDPRRIAILPMGLCFPGTGKSGDLPP RPECAPAWRSALLSHLKNIRLTLLVGQYAQAYYFTRQGRKPVATLTENVR SWQKFWPDIVPLPHPSPRNNLWLRRNPWFEEEIIPALQERVAMILNQTTD S >NE0716 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE2309 Adenine specific DNA methylase Mod MASNQKLELTWIGKEKRAKLEPRILLEDPEKSYHAKQRVSESDVFDNRLI FGDNLLALKALEQEFAGEVKCVFIDPPYNTGSAFTHYDDGLEHSIWLGLM RDRLEIIKRLLSNDGSLWITIDDNECHYLKVLCDEIFGRANYKTTITWQR KYSVSNNFQGIASICDFVLVYSKSEAFKNNLLPRSEESAARYNNPDNDPR GPWKAVDYLNQATPEKRPNLCYDIVNPNTGVVIKNTKKAWKYDPTTHQRH VDEKRIWWGRDGGNSVPALKLFLSEVRDGMTPHNWWSHEEVGHTDESKKE MIGLYGPRDVFDTPKPERLLKRILEIATNPGDLVLDSFAGSGTTGAVAHK MGRRWIMVELGEHCHTHIIPRLKKVIDGEDPGGITNAVDWQGGGGFRYYS LAPSLIVEDRWGNPVINPEYNATQLSEALCKLEGFTYAPSETRWWQQGHS SERDFLYVTTQNLSASQLQALSDEVGTEQSLLVCCSAFHGISAAAAAARW PNLTLKKIPKMVLARCEWGHDDYSLNVANLPLAKPEPETPASQPAPKKKG KKTLPMPDLFGDVEDGA >NE0981 HhH-GPD MALVFWEQAVNDLSARDPVMHRIIQCYSDSMPEERGNAFATLARAIVGQQ ISVKAAASVWQKVTTLIPEITPEALIATEIDLLRTCGLSARKVDYLRDLS RHFLEGTLVTVNWHDLDDETLIRKLVEVKGIGRWTAEMFLIFHLHRPDVL PLDDIGLQRAVSLHYNASQPVAKQAIRTIAESWQPWRSVATWYLWRSLDP IPVIY >NE2109 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE2532 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE0253 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE0562 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNT >NE1174 Uncharacterized protein family UPF0020 MTERFFAPCPRGLETVLAAELERLDATSIQASPGGVGFHGNWQTCYRANL ESRIASRILWQIAKDQYRSEADIYDLTHSLPWQDWFEPRLSIKVNLAAIK CPLRSLDFVTLRIKDAVCDKFRAIHGKRPDIDTVAPDMRIHGFLNAQEFT LYLDTSGEALFKRGLRQTQGEAPLRENLAAGILALTGWQPGTPLLDPMCG SGTLLLEAAQIACRIAPGSGRQFAFEQLKLFDARSWKKLKQTATERQHER TFQSIYGSDLYGSALAHTRNNLAAAGLAECVTLKQANVLEISAPAETGIL VSNPPYGVRIGDHQMLAEFYPRLGDVLKQRFSGWRAFLLTADPLLAKSIR LTPSRRTPLFNGALECRLLEYRLVAGSMRREKQPSSESSTNQPIT >NE2521 conserved hypothetical protein MTSVLSPNTQAILLLTAPLIAGRGTASSDLLSPGEYKRLARHLREIQRQP ADLLSPDAAEILRACQPVIDEGRLQKLLGRGFLLSQVIERWQARAIWVVS RADAEYPRRLKARLREDAPAVLYGCGDMALLETGGLAVVGSRHVDDALID YTMTVGRLAARAGRTLVSGGAKGIDQAAMRGALEAGGKVCGVLSDSLEKT TMNREHRNLLLDGQLVLISPYDPSAGFNVGHAMQRNKLIYALADTSLVVS SDLNKGGTWAGAVEQLDKLKFVPVFIRSTGESSAGLDGLRKKGALAWPNP QDVDSFKDVFNVAMPTPTASPQVGFALFSNEEPTSVDAKPTVPVPPDTAP APQAESEPSAPVDVVSDAQPPAPALEEQPSVTPEAIPPIDDAMESAQPES SPAEVLFAAVRAAIQQLLSAPMKDADVAAALDVSNAQAKAWLQRLVDEGV LEKQKKPAGYIVKQKRLFE >NE0511 hypothetical protein MAHAKGGKLFDSPVRLNYIIGIKSHILDSNLQLKWSVLLDIDREPAETAE EQFDSSFFLMTEELRQLLPELMEALGGTASE >NE2412 transposase MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEVV >NE2023 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR >NE1271 Integrase, catalytic core MLQVAPSAYWRHAARQRYPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWYQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE0162 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0561 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE2190 Maturase; integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase); part of type II intron MHRALNQDDDHNQDGQDLLEAVLARDNLARAWRRVKSNRGAPGIDGVTTA EWPEHARAHWPATREQIEAGRYRPQPVRRVDIPKPDGGQRQLGIPTVTDR VIQQAIAQVLIPIFDPGFSASSFGFRPGRNAHQAIRQVQAHVKAGYRWAV DLDLARFFDNVNHDLLMSLLSRSIADKRLLALIGRYLRAGVLVGEHPQPS EVGTPQGGPLSPLLANVLLHQFDLELERRGHRFARYADDVIILVKSRRAA ERVMQSLTYFLQSTLKLTVNLAKSQVAPMSECSFLGFTLVGKKIRWTEKS LANFKHRVRQLTGRSWGVSMEYRLEKLGQYLRGWFGYYGISQYYRPIPEL DEWIRRRVRMCYWKQWRWARTKIRHLLDLGIPLKAAIQHGVSSLSYWRMA RTPVTQQAMSNDWLRAQGLLSIKDLWCKAQSYGPDKG >NE1788 transposase MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEAV >NE0715 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR >NE0940 putative DNA transport competence protein, ComEA MYYIWPKRLKLRNPEALFLNFCGDSEMKKIFLILVIFFGFNLSVLAGVDI NTASQADLESVKGLGPVKAKAIIEYRNKYGMFKSVEELANVKGIGAGILK QLGDQVSVQEGAVLTETKVD >NE2515 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE0197 conserved hypothetical protein MIPNPDPESSINRNQVIISGTITDLASPRYTPAGLMIAEFKLSHCSNQQE AGIQRRIEFEFEAIAIAETAEKIIRIGSGSNVEITGFIAKKNRLSNQLVL HVRDTRII >NE2101 hypothetical protein MSSSPSHPFPSLQSRIVASFVSTSSTIIVARLSTLRPLRDLTMVGWSMST RMKAKLVCDALQMAVWQRQP >NE1996 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE0134 transposase MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA ALIEGLTVRASARQCRIDKNTSFRWRHRFLTLPAAAKANHLEGIVEADET FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL SSPIVPLQAALGRENQFQLLTNT >NE0257 Site-specific recombinase MESQHNHVMKKSCRIIGYARVSTEDQHLDLQIDALKLAGCSSIFEDHGLS ATAKRRPGFEQALASLQAGDIFVVWKMDRAFRSLKNALDILEEFENRAIE FRCLTEDIDTTTPMGKCMYQIRHAFSELERNLIRERTKAGMEAARQRGAH LGRPKKLSRGQIIRMQNLLQRQPDMTPVQIADQFGVSSRTIYRALSKYST IKEELAIHAG >NE0709 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE0560 transposase MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEVV >NE1843 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNNYRARAEYATATATLPQAENH >NE2013 hypothetical protein MCRCRLHWCSHTLESIEEHGCTAHVKGRGQQAKEKRRHPGAHARRWIIEV SHGWFNCLRKLLMRYEKLARSFLGLNHLAAAIIAFRKVPLAVNIIYE >NE2166 UvrD/REP helicase MNTVLQQEIPDQRERQQALDPWHSFIVQAPAGSGKTGLLTQRFLVLLATV EEPEEIVAITFTRKAASEMKHRILQALRDTAGDINSDAESETALLNDAYQ RQLRELANRVLAHDQARGWQLLQNPSRLRIQTIDSLCAWLVDRMPVCSRQ GALSSVAEDADRLYLEAARLTVEALEEEGEWTAAIEHLIGHLDNRLDRLQ QLIADMLARRDLWLRGVVDAANSDDMRDRLESVLSGRIAEAIERLADAVP AGCQSEIIELMQFAAVNLSEAGSADSNTVRWPGNALEDRLVWESMADFLL TQTGDWRKQVTKANGFPAPSSVRDADVKEYLNGMKQRMSELLVALQSEET FRQQLQLLRQLPPERYTDEEWETLQALFSLLKVAAGYLLLVFRQHGQVDF TEIAMAAVRALGEPEMPTDLALALDYRIHHLLVDEFQDTSSSQAELLQRL TAGWQTGDGRTLFLVGDPMQSIYRFRQAEVGLFLDIRDSGYFGQIQMRFL RLSVNFRSQSGIVEWVNRYFPRILPDTDSVSTGAVSYASSVAFHAASSGE AVRIYPYLQKDDRAEAEQVGAIVAQARAAQPDGRIAVLVRNRSHLASIVV HLRRKGLRFQAVEIEQLAQRVVIRDLMALTRALVHPADRIAWLALLRAPF CGLSLQDLHTVANTLPQHVLIDSLRACAGSGVLSEEGGQRVNRVLPILER ALMLYDRMSLRRCVEGIWVSLGGPASVQNETDLADAEVYFQLLENFDVTG YRPDIQELDERLVRLFALPDVAADDSLQLMTLHKAKGLEFDTVILPGLGK SPRRDQEKLLNWLEFHDQSQHPGLLCAPISAAGSDKNPISAYILSEEKKR TALEEARLLYVAVTRAKHNLHLLGHLRIDPDMQENDALKPPEDTLLARLW PAVAADFLARSREAAIGDLPASNVHTGLQLVGMVRLVSGWQPPPLPKAVA VAMHANEAGTTEEPVDFDWAGEPARLVGVVVHCLLHRIGLIGVENVDHQD LEALKLAGRSLLIQSGITPRHLEKAVQQVARALRTMCVEDETGRWILSNR HQEARCEWALSVPTAIAAGHSISVSIIDRTFVDAAGVRWIIDYKTGSHTG GSLEEFLDREQLRYRPQLDRYAQVLQRMEDRPMHLALYFPLLGKWRKWIP SRESA >NE2024 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE0188 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2028 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNTRICR >NE1815 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE0098 conserved hypothetical protein MRQQLLDITEIGPPSLDNFISSGNEEVLYTLRNLVAGNQQDRFYYLWGKT GSGKSHLLQAVADAFSEQQCNSRYIDCNQDEPNFNPGTDCIVIDNVERLD DAAQIRLFNLYNHLRDNKHGIFLASGTKPPAQLDLRQDLTTRLGWGLVYQ VHELTDEKKIEVMQDYAIRCGFELPLEICHYLLKYEQRNLSSLIRLVHAL DQLSLTRQRPITLPLLRELL >NE0231 hypothetical protein MTMIKKIETELLAAKATLSEISGRFKEFSDTQARLSADGDLLGLARLNKE HTGLEDSLLAADDTVRALESRLSVLRQAEYRPQFDKAHKTHLGAVQAETK AAEKLLAAIDAVFSAATDMQNLSDEVAATYHAARDLHNRAGLDHELRWPA PDGQIPIKISDRMNSLRDELVRTIRLYEDRLPQSQSLEGLRIIEQQQEEL VRNSGRGFR >NE2096 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE1348 Integrase, catalytic core MGRIYQQTFVDTYSKWAAAKLYTNKTPITSADMLNDRVLPFFAEQSMGII RILTDRGTEYCGKPENHDYQLYLALNDIEHSKTKANHPQTNGICERFHKT ILQEFYQVTFRRKIYQSIEELQHDLDDWMAYYNSVRTHQGKMCCGRTPMQ TLIDAKEIWDDKITELNN >NE2504 Transposase IS4 family MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA >NE1516 Uncharacterized protein family UPF0006 MFVDSHCHLDFPDLASSLDELLVNMQISQVTHALCVGVNLENFPRVLALA ESHSNLFASVGVHPDYEDTAEPAVEQLLKLADHAKVVALGETGLDYFRLK GDLEWQRERFRRHIRAARRCGKPLIIHTRAAAEDTLRIMEEEGAASVGGV MHCFTESWEIARRALDLNFYISFSGIVTFKNAAIIKEVAKKVPADRMLIE TDSPYLAPVPHRGETNQPAFVRHVAEEIARLRETTLAEIAAVTTNNFFNL FKVV >NE1630 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFTWSRQMPEPR >NE1264 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1261 Integrase, catalytic core MLQVAPSAYWRHAARQRYPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWYQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1880 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2411 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNNYPQQNGMVKWVTGH >NE0845 DUF196 MLIIVTYDVSTETRAGRKRLRRVAKLCESIGQRVQKSVFECRINLMQYEE LERRLLSEIDEQEDNLRLYRLTEPAELHVKEYGNFKAIDFEGPLTI >NE2533 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE0244 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNIYRAQQDHFQLVGTDRLEIMRGHGIQRHASKQRWH ISDKTTQLAAQRFHVKRPETLHEIGMPVTLHDTVTAVTDMSNDIFEQPCL TGCAERRFALGSEQMPIGRKAATRHRKGRLLRIVVEW >NE0340 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNT >NE0271 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDHLISVNRP >NE1178 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0749 Transposase IS911 HTH and LZ region MNKQNKQNKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTL LEWVKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFA QAELDRVLKK >NE0662 Transposase IS4 family MCGTPARCVAPLAALFEMLKGQCDGISIADATAIAVCDNRRIARHRVSAD SARRGKTSMGWFYGFKLHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGL FGQLFADKGYLAQWLTETLDRQNLQLITPFKKNMKPAPRTGFEKAILRRR SLIETVFDELKNLCQIKHTRHRSFFNFVVNLMAGIVAYCLSDNKPTLNLT RVNTLVKA >NE0272 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFTWSRQMPEPR >NE1991 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE1522 Transposase IS4 family MDAHGMPVRILVTQGTTADCTQAGRLIEGIDADHLLADRGYDSNAIVEQA EKQGMEAVIPPKKNRKIQRPYDKELYKLRHLVENAFLHLKRWRGIATRYA KNTSSFLAVVQIRCIALWADIL >NE1560 conserved hypothetical protein MHCRSFPPIASPGSWVLILGTMPGKVSLREQQYYAHPQNLFWRITAEILG FDATSAYPLRVSSLKDHGVALWDVLQSCTRESSLDADIVAHTIVPNDFGR FFTACPDIRRVCFNGAKAAALYARHVKPFLQDAPTVEYVQLPSTSPANAA IPRADKLRAWSVIKHNA >NE0452 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE0288 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNR >NE1260 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE2273 transposase MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEVV >NE0880 probable ATP-dependent DNA helicase-related protein MSDLNTVFSADGLLARNIPDYRPRTQQLEMAQAIAQAIESQEVLVTEAGT GTGKTYAYLVPALLSGGKVILSTGTKTLQDQLFQRDIPTVRAALKIPVTI ALLKGRANYICHYHLERTLNSDHIHFASRTEVKYLNLIERYAGTSSHGDK SGLDKVPEQAAIWQHVTSTRENCLGSDCPHYRQCFVMEARKRALSADIVV VNHHLFFADVMLRDEGLSELLPACNTVIFDEAHQLPEVASLFFGESVSTG QIQVLVRDTDTEALLEAKDFAPLFDATAAVGKAVLDLHLTITEKHTRMSS ASAARYPGFSEARQVLQEKLVLLAGLLETQAVRSQGLQNCWLRAQTLLNR IRQWHEQSESREFICWVETYSQSLQFNTTPLSVAETFSKQLDASARAWIF TSATLSVKKDFSHYNRMMGLFEAKTANWDSPFDFPNQALLYVPSQLPDPN TPHYTESIVQAVLPVIKASQGRAFILCTSLRNMQQIHELLQVAFQREQLE FPLLLQGQEARSALLNQFRQLGNAVLVGSQSFWEGIDVKGNALSLVIIDR LPFASPDDPVLSARIEKFTREGRNAFMEYQLPHAIISLKQGAGRLIRDEK DRGVLMICDPRLVSKPYGKQIWQSLPPMKRTRDPDEVLRFLENVDQ >NE0711 Uncharacterised protein family UPF0102 MSSAGNKGSDAEQCATIFLQQQKLTLLERNYRCRFGEIDLIMREGDTVIF VEVRMRSSDRFGGAAASITAAKQLKLTRAARHYLAGCEGDFPYRFDAILI SGERENEIEWIRNAFDES >NE2201 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE2155 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRTFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE2108 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1107 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE2012 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP ERFIINPYHHKVGLNN >NE2470 NUDIX hydrolase:Conserved hypothetical protein 52 MDFDVLEKTVCFQGFFRLERYRLRHRKFNGEWGRPITRELFERGHAAAVL PYDPQTDEVLLIEQFRAGAISAPGGPWLLEIVAGVIEANETPEQVVARES MEEANCQIGSLIPLYDYLVSPGGTTERIVLFCGRVDMQTIEAGAVYGNHG EDEDIKVHVMPLNEAIRLLSTGRINSASAIIALQWLALNRDSVRRRWLPE >NE0939 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1219 UMUC family (DNA-repair) MHGMNTKNRRIAHLDMDAFYASVELLRYPELRGLPVVIGGRSVHQPVIQP DGKRSYVRLRDYTGRGVVTTSTYEARAYGVFSAMGIMRAAQLAPDAILLP ADFDTYRHYSRLFKDAIARITPHIEDRGIDEIYIDLSEHPDETASLASSI KQAVRDATGLSCSIGIAPNKLLAKISSDLEKPDGLTILTHTDIPNRIWPL SVRKINGIGPKAEEKLVRLGIQKIGELAKAELSLLQAHFGRSNAIWLHDS AHGRDSRPVVISSESKSISREATFERDLHVQEDREILSDIFTELCTRVAE DLQRKGYVGRTIGIKLRYENFQTITRDLTVRNPTADASTIRKAARDCLRR VPFEQKLRLLGVRISGLSKISALLKENYYFQEELF >NE1270 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE1106 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR >NE1789 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE2466 putative lipoprotein MYQKFNKPVNAALHLILVLSITACASQNKFFDTLDYNRDWEAIQSNLPAY PQPENLLEFDSGPATSLRYFIDAKSISVDEKRVIRYSIVIQSQQGANNVS YEGLRCETRERKRYATGNNDIRSWVRANTSEWQPLEAVAQLRAQRELAKY YFCPRGLVVGSPAEAVRALKAGVHPMVIR >NE1539 hypothetical protein MLFWIKTIQTAAWFYLLMFALLAGSAHAAELKPADQTGFLIVAADRGFVG NEEIRDAFASFSANHPAALVFVTDERTRQTLQSGLASLHQQNIGRIVVLP LFISAAEPRYQLIRTLVTEENQTIPVTFTKPYGESYFAVEALATRLRGMQ HTAQQHLLVVGYGAQNDTHRRAMYDDWMRIVKQASQGVSFRSINSLILLE AQKDEEPESYGNKTKQQLATALSSLGTATKNNKNQVIAFALGPKYDSMMS LESRLERLLPENAALNHFEIEPQHLAMWMEREASRNLPLAEEDTGVILFA HGSDFHWNENLRVAVEPLMKRYKIEFAFSMADPYTIERALHKLEQRGAKA AIVVSAFASRSSYRNEIGYLAGLDIENQDDHIHDNNSGHGSHGGHGGHAK SSTPVPRILTSLPVIWTGGYEDNPLFASALFDRVLALSKDPARETVILTA HGTQDDRKNDEWLEKLNSIASQMHDQGGQKFKAFKVATWREDWPDKRAPW VKKVRAMVTEASKQGDRVIVIPTRTTSVGPEKRFLAGLEFELGEGFAPHP LFTQWVDEQIRQGINLHKEALGR >NE0121 conserved hypothetical protein MQLDTIHKITGTLILKSGLHIGAGDSEMRIGGTDSPVVKDPLTDQPYIPG SSLKGKIRSLLEWRHGLVVATGGAPYSFKHLAQDENNSAGRDVIKLFGGA PDKAEDQLVKNIGPTRLAFWDCPLNGDWKKEAADSRHLLTTEVKSENSIN RIAGTAEHPRFIERVIAGARFDFTLTLKVLEGDDLLNTVLLGLRLLELDS LGGSGSRGYGKIKFAELKLDGTDLMEQFHAITPFNQTA >NE2498 hypothetical protein MPKMDVKGIAVTVYSENAMDFISLTDMLRAKDGDFFISDWLRNRNTVEFL GIWEQVHNPNFNYGEFATIRSQAGLNSYKISVKEWVARTHAIGLVAKAGR YGGTYAHKDIAFEFGMWISAEFKIYLIKEFQRLKEAEQQQLGWDIRRNLT KINYRIHTDAIQTNLIPPALTQSQISLIYASEADLLNMALFGKTAKQWRE ENPNNKGNIRDEANVSQLVCLANLETLNAHFIHQGLPQVERLKILNQTAI HQMKLLLADRSLKQLDGN >NE1093 Transposase IS4 family MARFKPVQKGLMLLPVDFSRQIIPGSFEHALCYLVDHELDFSGLRERYRN NTQGAPAYDPAVLLKIILLAYSRGLIGSRRIEAMCRQNILFIAVAGDNQP HFTTLAAFIAELGDEVAKLFAQVLVVCDRQGLIGRELFAIDGVKLPSNAS KAKSGTRADYQRQAEKMEKAAKQMLVRHREIDMTPVDERQAQREACMLER LQKEAKQLKDWLAANPEDRKGPKGGVRQSNLTDNESAKMATGKGVIQGYT GVAVVDEKHQIILDAQAHGTGSEQELLVPVVQAIKPQMSNQTVITADAGY HSENNLKMLAAEGIDTYIPDNGYRKRDERYHGQEAHKTKPDPLWDKRGQP SISKRFGPGDFQLAEDGSHCLCPAGKRLYSNGSNCTFNGYAAMKFRGAER DCLPCTLRTQCLRTPEKTKTRQVAFFRGKRDGYETHTDRMKRKIDSDQGR QMITRRFATVEPVFGNLRNNKRLDRFTLRGRSKVDGQWKLYCLVHNIEKL AHYGVGQ >NE2414 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0341 Transposase IS4 family MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD SVFRFFIHFALIVDYLISVNRP >NE0120 hypothetical protein MTPLRAILRLRSPLGTPLAGDTLFGQLCHAVREMLGEEKLEALLDGYTAG SPWLVVSDGFPSGYLPRPTVAAALQANSEEDPKKRKEAKGKRWIPHSQIA QPLRQLLSSAVSDEEVYGKQSRPIQAAAFHNTLNRLTGTTGTGEFAPYTQ SQIFYQRDQRMDLWCVLDEDRLPRETLHQLLEYIGSVGYGRDASIGLGKF AVEQIEEAALFKQTHPNANAYWTLAPCSPQGQGFKTSRSYWQVLTRFGRH GGTLALGANPFKQPLLLAATGAIFAPTNNMAQIHFIGSGLAKVSLMQTAA VHQGYAPVLGICMEAI >NE2441 hypothetical protein MKKILSILSSTFILSLFMLSSVNAQVEDVHLQEAIRQTEAVVLAVDVKTM TQLVQEAERYAVEVKSTHPENEHLQEGLKHLNDVIKESQAGEPAAARKAA IVALSKFNQIERK >NE2446 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE0268 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1840 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1558 putative transposase MPRTHGYAPIGKRCHGKCNWHARGRINVIGALIGKCLLTVGLFKNNIDAD TFLGWTIHDLLPKLPPASIVVMDNATFRKRQDIQNVITRGGHTLEYLPAY SPDLNPIEHKWAQTKAVRKQQNQTVEQLFKIESFYVT >NE1053 Uncharacterized ATPase related to the helicase subunit of the Holliday junction resolvase MTDSPHTIRNPAAPLAERLRPRTLDDVVGQSHLLGPGKPLRLAFESGKPH SMILWGPPGSGKTTLARLMAHAFDAEFIAISAVLSGVKDIREAIERAQIT LQRTGRATLLFVDEVHRFNKAQQDAFLPHVEQGLITFIGATTENPSFEVN GALLSRAQVYALKALTDQELHQLFERARSIAMLDLEFENTAIELLIGFAD GDARRLLNLLEQVQNAAETEEIIKIDADYLSRVLARNVRRFDKGGDAFYD QISALHKSIRGSSPDAALYWLCRMLDGGADPRYIGRRLVRTATEDIGLAD PRALTLALNACEVFERLGSPEGELALAQATLYLACAPKSNAAYVAYKQAR AFIKEDISRPVPIHLRNAPTRLMREMGHGAAYRYAHDESESYAAGENYFP DNILAVQFYRPTTHGLEAKIGEKLAYLRSLDEKTGKKRN >NE2520 ATP-dependent DNA helicase RecQ MAYDPKRALELLRIGSGRANATFRDGQEDAIRHIVEGKGRLLVVQKTGWG KSFVYFIATKLLREAGAGPALLISPLLALMRNQIAAAERMGVRAATINSD NMDDWTVVEGKLAKGEIDILLISPERLANERFRTQVLAGIAAQISMLVID EAHCISDWGHDFRPHYRLLERIVKTLPPNLRLLATTATANNRVMEDLAAV LGPKLDVSRGDLNRTSLSLQTIRLPSQAERLAWLAEQLATLQGHGIIYTL TVRDANQVAQWLKTQGFNVEAYTGETGDRREQLEQALLNNQVKALVATTA LGMGYDKPDLAFVIHYQMPGSVVAYYQQVGRAGRALDSAYGVLLSGQEES DITDWFIRSAFPTRQEVADVLGALEDEPNGLSVPELLSRVNLSKGRVDKT IALLSLEAPAPIAKQGSKWQLTAATLSEAFWDRAERLTALRRDEHQQMQD YVSLPFGEHMGFLIGALDGDPSVVAEPALPPLPATVDAELVKAAVEFLRR TSLPIEPRKKWPDGGMPQYGVKGFIAPAHQAESGKALCVWGDAGWGGLVR QGKYHDGHFSDDLVAACVKMIQEWNPQPSPTWVTCVPSLRHPELVPNFAQ RLAAALGLPFHMVIAKTDARPEQKTMANSTQQARNIDGSLALNGQPIPPG PVLLVDDMVDSRWTLTVSAWLLRKNGSGEVWPMALSQTGHDE >NE1316 NUDIX hydrolase MIDRNGYRANVGIILLNSQNQVFWGKRARQDSWQFPQGGIKSGETPTEAM YRELAEETGLQPVHVEILGRTREWLRYDVPACWTRRDWRKNYRGQKQIWF LLRLLGRDSDVSLETCAHPEFDAWRWNQYWVELESVVEFKRQVYRQALTE LSRLLDHEAGLGNDRAYREPLEPVEKNRKKSSDTRQS >NE1675 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2516 Transposase IS911 HTH and LZ region MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE LDRVLKK >NE0093 recQ; ATP-dependent DNA helicase MRHQFPEVGRLISELRDAPCEREDCQYCQTTHDPRHELKRYFGFPDFRYE QPGESLQHDIVLAGMRGQHVLAILATGGGKSLCYQLPALNRFHRNGSLTV IVSPLQSLMKDQVDGLLERNVQCAAALNGLLTMPERAEVLEKIQMGDVGI LLVSPEQFRNKAFRRAIRQRQVGAWIFDEAHCLSKWGSDFRPDYLYVSRF IREYTGDQPLAQIGCFTATAKPDVLADIQSHFRESLGIEFKVFPGGHERT NLHFDVLPCTKAEKWSRTDRLLHEHLDSQEGGAVVFVSSRKSAEELSDFL IGQGWPCKHFHAGLEPNTKTDIQDDFKAGQLRIIVATNAFGMGVDKADIR LVIHADIPGSLENYLQEAGRAGRDQGDARCVLLYDPQDIETQFGISERSK LSIRDIQQILRKLRNESNRRKGGKLVITAGEILLDDDVDTSFSADERDAE TKVVTAVAWLERGDYLKREENHTQIFPARLDMSEKEAEKRLLKAKLPQRR LEEFRAILRFLYGADADERVNTDQLMQLTSLESEEVASALKQLEEMGLLV NDSQITLYVRHGVTGASSQRLQSSLELERALLQRLPELASDAGQGEWQDL NLPALAAELKADTRQGDLLPLQVLRLLRSLADDHDANSQQRSSFELRQLN RDYLKLRIKGGHSWRQIERFGEKRRALAGVLMEFLIGKLPPGSRNKDLLV ETSFGELVKALESDLELPHLIAPDQRRKAVEHVLLYLHRQDILTLNHGMT VMRRAMTIEVKKEDKRKTFLKEDYLRLDEHYREKRIQVHVIHEYAEVALK EMAEALRLVLDYFTDSKQAFIKRHFAGREDVLKLATSEASWKSIIESLST TQKLIVADDDDINRLVLAGPGSGKTRVIVHRIAYLLRVRRVPATSIVALT FNRHAANEIRKRLLALVGADAYGVSVLTYHSMAMRLTGTRFERGDTVDER ALKRVLSDAVELLEGKRNVEGEDNLREQLLRGYRYILVDEYQDIDDLQYR LVSALAGRHAEEDGRLCIMAVGDDDQNIYAWRDTNNRYIERFREDYEASI SFLVDNYRSSLRIIEAANQLIGQNSARLKEANPIRIDRARQELPAGGLWE EQDKQRKGRVLRLLIDPSDRERGNLQAQAAMLELERLLVLEQGSWNGCAV LARTHRYLWPIQAWCEQHDIPYFLAADKETALPITRQRSFVAAIDSLREI ESALCAADAWLRLSGSNQLVEAEWKSFFQTAFEQMRGELGDCQLGSGALI DWLYDYARELRQQPKEGLYLGTVHSAKGLEFRHVVLLDGGWSTQVDTLAD ERRLYYVGMTRAEQTLALCEFADGNPFSRSLMKGVQQHAFQGQPLPELEL RFQQLTLKEIDISFAGRQLPHARIHKAIEALREGDPLTLKEEAGRYQLLD RQGNVVGRTAKSFQPQIGFAHCEVAVVIVRFAEDSEEQYRDLNKCERWEV VVPRGRG >NE2097 Integrase, catalytic core MPTGQNNRTTVVRKNATCPRDLVNRMFHANRPNQLWVSDFTYVSTWQGWL YVAFVIDVFARRIVGWRVSSTMSTDFVLDALEQALYDRRPADTLIHHSDR GSQYVSIRYTERLAQAGIEPSVGSRGDSYDNALAETINGLYKAELIHRRA PWKTRAAVELATLEWVAWYNHQRLLGSIGYIPPAQAEENYRQTQDNKTLM DILL >NE2178 conserved hypothetical protein MENPAMTTRPFDARQTNITDLIQRLDGCATIVTGNRRLARALHQAFNQAR SAEGHGAWPAPDILPWDAWLQQLWQEVVISSRIESAPGVLLTSHQEYFVW QEILAEQSGDVPLQATNETVARIMEAWQTLHAWCIPCREADFGHNADTRL FWQLASMFEAKCRKNSWLSVAVLPGILQKYVQIDSLSVPNELVLTGFDEW TPQQSSFLRAFEQTGCSLQWLQLSGQPDRIGKLACADGRDEIRQAARWIR QRLEENPAARIALVVPELAAQREMICQTLDEVLIPQALQPEHHDRVRSYN LSLGKPLDRYPPVSLALDVLGLSETVIELPHVSRVLRSSFIAGGDREMNA RALLDARLRESGEWNLTLQKLLTNAARSGQPYSCPLLAECLSNLMKQVKV SLAPTSPGEWAQRFGQWLKAIGWPGERGLSSEEYQVIQAWQGVLREFSTL DWVIRSVSLTEALQQLRHMVAGTIFQPESAEAPVQVLGLFETSGLQFDYL WIMGLHDGVFPASSRPNPFLPLTLQREVDAPHSSARRELRVAAALLQRIT TNATEVVISYPQRKGDEILDSSPLIDAFPALSEEMLAMGTQSAWRDSVYH SRQQEVLSEDVAPTFVGTGIPGGSKLFKLQAACPFRAFAELRLVARPLGR IQIGLNALVRGTLLHRVMEMVWAELDSLAALANLSPGELNALVAGKVNEA IYEIAPRYPHTFGERLQALESKRLHALVLAWLEMEKQRPPFRVSGREMET ELELNGLRINLRIDRIDTLEEGGELLIDYKTGEVKASAWFGDRPDEPQLP LYSLAFTDDGLAGIAFARIRAGDIAFEGVASEEVSILGIKSFENLRHTRE AASWDEVLAGWRQTIEQLVQDFMAGEARVSPKQYPQTCTYCELKPLCRIG ESLEAVDDC >NE0934 Integrase, catalytic core MNRTLKEATVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYI IKCWQNEPERFIINPYHHKVGLNSYSVFRCLDTVRLSAAHVGFFTEMREI SC >NE1995 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRTFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1367 possible ISA0963-4, putative transposase MLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQLYLALNDIEHSK TKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEELQHDLDDWMAYY NSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE1760 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE2442 Integrase, catalytic core MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN >NE0111 Protein of unknown function DUF48 MTSLFVDRRGVELGLESGAIVFRENGERIGTVPIAPLTRVFLRGDVNLPA ALLGKLGERGVGVVILSGRTSRPSLLLARPHNDAARRVAQVRLSLDEPAS LIIARELIERKLTRQIEWFTELRENDIQARYELSRALRGLEEHRARLGNI NNAASLRGIEGSAAARYFTGLQAVIPGSLHFHGRNRRPPRDPFNALLSLT YTLLHSEITIALHGAGFDPYIGFYHRLDFGRESLASDLLEPLRPLADRFA FALVHRRVLDKDHFTTTESGCLLGKAGRVRYYAAYEEHSEILRKGINQEI EQLAEQVGSALTPESGNTPDHDSGDWE >NE0451 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1521 possible transposase MTHSHRRHDISDRIWSLLEGHLPGREGAWGGVATDNRQFINAVFWIIRTG APWRNLPPDYGGWSNTHRRFIRWRDKGIWEKLLEILIDDPDYEWLIMDAS HCKVHPHANGARGGNQDMNRTKGGSTPRYIWPWMRMVCRSESLLHKVPLL IARRLAA >NE2385 Staphylococcus nuclease (SNase) homologues MHFTRALRIQLIPSFFFRAIYPLVVLLILLHAQSGLAETIYRSTDSHGRT LYSDIPTPAAKPLQPATPPARSKYRVTRVIDGDTIVLENNKRVRLLGINA PETGNRYHPGEPGGADAKKWLRGKLQGRSVYLEHDRQTHDHYKRMLAHLY LPDGEHINLSLVEKGLAIANLIPPNLLHANTLIRAQQRAETRKLGIWSMQ HYQPRPLIKLTEKPFGWQRYRVKAKVLKRNHRFSRLIISDNLDLSFANRD LALFPPLETYLNRPLEVRGWVSRRKNHFSIRIQHPSALILY >NE0708 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE0714 Integrase, catalytic core MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWPNRT >NE2541 Site-specific recombinase MSEVLKRRMRCAVYTRKSTDEGLDQEYNSIDAQRDAGHAYIASQRAEGWI PVADDYDDPAFSGGNMERPALQRMMADIEAGKIDVVVIYKIDRLTRSLAD FSKMVEVFERYAVSFVSVTQQFNTTTSMGRLMLNILLSFAQFEREVTGER IRDKISASKRKGMWMGGVPPLGYDVENRRLVPNEREAKLIRHIFQRFVEL GSSTALVKELKLDGVTSKAWTTQDGKTRDGRLIDKGHIYKLLSNRTYLGE LRHKDQWYQAEHPPIINRELWDSVHAILETNGRVRGNTTRAKVPYLLKGI VFGNDGRALSPWHTTKKNGRRYRYYVPQRDAKEHAGASGLPRLPAAELES AVLDQLRAILRAPNLLGEMLPQAIKLDPTLDEAKITVAMTRLDAIWDQLF PAEQTRIVKLLVEKVIVSPNDLEVRLRANGIERLVLELRPEPVEQQEVAR A >NE1585 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE1667 conserved hypothetical protein MPDMTSASSGTVLAFDFGKRRIGVAIGEHELRMAHPLTTIDQSMTRPRFE KIAELIEAWQPVLLVVGLSVHADGTEHEITRLCRRFARRLEGRFRIPVAL ADERYTTVIARSVLEEVGVTGKKQRPMLDQIAAQHILQTYFDLSHAAS >NE1816 Integrase, catalytic core MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL >NE1586 transposase MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEAV >NE0342 possible transposase MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP DGTGALKKTVFKLSVNHEGAGLPNSSGRGRCQNRGSVFVIPRPGR >NE1553 possible transposase MTHSHRRHDISDRIWSLLEGHLPGREGAWGGVATDNRQFINAVFWIIRTG APWRDLPPDYGGWSNTHRRFIRWRDKGIWEKLLEILIDDPDYEWLIMDAS HCKVHPHASGARGGNQDMNRTKGGSTPRYIWPWMRMVCRSESLLHKVPLL IARRLAA >NE0518 transposase MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA HQQAACGKGGRGVRLWGVPEVV >NE0931 conserved hypothetical protein MTVSSPFQPDCRDCPRLAQHLDQVKTDYPDYHARPVAPFGDSSAKLLIVG LAPGLHGANRTGRPFTGDYAGILLYRTLHKFGFASHDESVSADDPLHLTD CRITNAVKCLPPANKPQPAEIRQCNAFLAVELDNFARNGGQALLALGTIA HQAVLMALGCRNADFPFSHGAIHRVTEELKLYDSYHCSRYNTQTRRLTET MFEQIFDRICQDMAATQ >NE0519 Transposase IS4 family MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL >NE0806 hypothetical protein MKYSIRFFAVIFAIFLTACSTMYYSGLEKIGIPKRDVLVYRVEKARDTQE ETREQFKSALEQFSAATNFKGGDLEGIYKKLNGEYEASVNKAKEVRSRIE DIENVSAALFREWEQEITQYSNPALKRSSQDRLTETRSYYKQLIAAMKNA ESRIQPVLTVFNDQVMYLKHNLNARAIASLKGELKTLQSNVSTLVAAMEK SINEANTFISNMEKN >NE0228 CHC2 zinc finger MIEQSFIQELLDRIDIVDVVARHLQLKKAGANFTACCPFHNEKTPSFTVN SSKQFYHCFGCGRHGNAISFLMEHSGASFVEAVESLATHAGMQIPDQVSI YPKIPDPGRVPSDKIKIDKEVEATSPLAGLYERMEQAAKFYRGQLKQSDQ AIAYLKERGISGRTALCFGIGYAPPGWQNLSGIFTDYPADDSSHPLVQAG LVVAHDGKKNYDRFRHRIMFPILDRKKKIVGFGGRALDGGEPKYLNSPET SLFVKGRELYNLASASPAIRKSARVIVVEGYMDVVMLVQSGVENVVATLG TATTAMHIQNLLRHTDEVVFCFDGDAAGTKAAWRALETSLPQLKDGKDIK FLFLPDKEDPDSYIRKYGRVAFEGLLEKAQPLSVFFCNELSGRVNLGTSE GRARLVQRAGPLLAQINAPVFGFMLTKRISELTGVGQNQLAAFLKTGKKN RSSTLRPEASRPLSVTPYRRLIQILLHAPDYANKLDTNLLAVNDEQNEEK VLLVALVDFLKTSACSMEEELNSVTILLHFDQTPHRVLLEKIVRDAHVKD ENWNIDAEFTGGMERLREMQRRSRMAELHSRPLVSLTPEEKNELRQLMLS >NE1347 conserved hypothetical protein MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE DDVAQGEIETAHPGYLGSQDTFYVGTLKVWDEFTSKPSSTLIPSGQQRSC IPTKHQLHQLICSMTGYCRSLLNRVWALFAF >NE0236 ccrB, Site-specific recombinase MTQQAVIYCRVSSLKQVTEGHGLASQETRCREYAKHKGYEVVEVFHDEGI TGKLLDRPNMKAMLIYLKQHRATRPVVIIDDISRLARDIETHLHLRASIS AAGGKLESPSIEFGDDSDSRLVEHLLASVAAHQREKNAEQVFNRMKARMM NGYSVFNAPIGYRYDKVGKHGKLLVPDQPCASVIAEGLEGFASGRFETQI ELMRFFEASPHYPKDRFGTVHMQRIKEILSRVLYAGYLDKPDWGIHLVKG HHEALVSYETWKKVQARLNGQAKAPVRKDINEDFPLRGFVTCACCGSPLT ACWTRGGGGLYAYYLCYGKTSGVKCSQNGKSIPKDKLEGEFGALLSEMKP SKEMFLLAAEIFTDLWNIKRDTAKQEAETIRRNLLQIERKTEQFLDRIAD TDNSILITAYEKKIRQLEEEKIALDEKIAQCGRPLQSFDETFRTAFSFLS NPYQLWVSSRLEHKRAVMKLAFSERLRYCRNEGFRTPEKSLPFLLLEGSD EGKHEMVGLVGLEPTTKGL >NE0001 dnaA, dnaA; chromosomal replication initiator protein MQKIETFWHFCLKHFRQELNGQQFNTWIKPLKLEVCPDEKNTLILIAPNR FVLQWIKDNFVTRIDEMAQDHFNERISFRLELREPAESEAQTVRTSAQKN REDKKPAAEKTQGVTSRKTNPSQLNASFTFDAFVTGKANQLARAGAIQVA ERPGIAYNPLFIYGGVGLGKTHLMQAIGNYVLELDAGAKIRYVHAEKYVS DVVSAYQHKSFDKFKLYYHSLDLLLVDDVQFFSGKNRTQEEFFYAFNALI EAHKQVIITSDCYPKEISGLEERLVSRFGWGLTVAIEPPELEMRVAILLK KALAEKIELDENTAFFIAKYIRSNVRELEGALKRVLAFSRFTGHSISLDL AKEALKDLLAIQNRQISIENIQKTVADYYKIKVADMYSKKRVRTIVRPRQ VAMAIAKELTQLSLPDIGEAFGGRDHTTVLHAHRKIIELRTSDPGINRDF NALMHILRG >NE0194 dnaB, dnaB; replicative DNA helicase protein MNQLTSSFIQNLAEDNIYKLPPHSIEAEQSVLGGLMLDNQAWDKVADIII ESDFYRQDHQLIYQHISRLIEQNKPADVITVAESLENAAQLQHAGGLAYI GAIAQNTPSAANIRRYAEIVRERSIMRKLAQVSTQITDSAYNPAGRSAGD LLDEAESRIFEIAEQSAHGKQGFVDIQPLLKQVVERIEVLYNRSNPSDIT GIPSGFNDLDQKTSGFQPGDLIIVAGRPSMGKTAFALNIGEHVALETSKP VAVFSMEMGGVQLAMRMLGSIGRLDQHKMRTGQLNDDDWPRLTHALGKLN DAPIFIDESAGLNSLELRARARRLYRQHEGLGLIIIDYLQLMSATSPGSE NRAAEISEISRSLKALAKELQVPVIALSQLNRGLEQRPNKRPIMSDLRES GAIEQDADVILFIYRDEVYNPDTPDKGIAEIIIGKQRNGPIGKVDLTFLG EFTRFENCARTADYY >NE1978 dnaE1, dnaE1; DNA polymerase III (alpha chain) protein MPIDPVFIHLRLHSEYSVVDGIVRVEEAVAKARDVGMPALALTDLSNLFG LVKFYQCAFKAGIKPIAGCDVWVTNENDADRPFRLLLLCQSFSGYLLLSR LLSRAYRENMCRGRAELKKSWFREEDAGTEGLIALSGGGQGEVEQLLLAD PPAAVTAAQQWADLFPGRFYLEIQRCGRPNEETSGYALLDLASSLKLPVV ATHPVQFMRPEDFRAHEARVCIAQGYVLGDRRRPKEFTGQQYFKTPAEMG ELFRDVPEALANSVEIARRCSLMLELGVNRLPDFPTPAGISVEQHLRELA QTGLEARLLQSFPQVLQRDERRPIYQMRLDFEVETIIQMGFAGYFLIVAD FIGWAKQHDVPVGPGRGSGAGSLVAYSLGITDLDPLLYDLLFERFLNPER VSMPDFDIDFCQDRREQVIEYVRDRYGAESVAQIATFGTMAAKAVVRDVG RVLDLPYNFVDQLAKLVPFELGMTLRKAREIEPLLNQRAEEEEDVRNLLE LAERLEGLTRNVGMHAGGVLIAPGKITDFCPVYCADSGDAVVSQYDKDDV EKVGLVKFDFLGLRTLTILDRAVADIRQYRAASPGSAVAEPDVQSAEESH FSLESISLEDAATFSLMAKGNTVGIFQFESRGMKDLLQRARPDRFEDLIA LVALYRPGPMDLIPDFIERKHGKRVDYLDPRLQPILGPTYGIMIYQEQVM QIAQVIGGYSLGGADLLRRAMGKKKVEEMAQQRAVFVEGAIRNEMAEADA VTLFGLMEKFAGYGFNKSHAAAYALIAYQTAYLKTHYPAEFMAACMSSDM DDTDKVNVFYEDCKLNGIVILPPDINESGYYFVPVDHKTIRYGLGAVKGS GEAAISAIVQVREQGSTFTGLFDFCRRVDRRIVNRRTIEALIRAGAFDSV ETNRAALLESVGNAMEYAEQCSLAASQVSLFDENTDLIQPPAITGVAQWP EREKLQNEKMALGFYLSGHPYDSYARELSCFIPVRLSRIVPGREPQLIAG VIYAIRTQMSRRGKMAIVTLDDGLARVEVVVYSDLLSTGSHFMKADQLLV VRALVSHGNGENADRRIVAKEIYDYVTARSMHARKLRIMIDDSGLLTPAQ LKELLAANLPENGVNNVIPSSGCAVSIDFRNQVGSCEIDLSSRWRVHLHE GLIESLMDILGRDKVEVVY >NE0002 dnaN, DNA polymerase III, beta chain MKLTITDRDLLFKPLQTVSGIVERRHTLPILSNTLIEIRNGQLTLVTTDL EIEAEATSNIPELENQGALQTTVSVRKLQDILRALPSGAAIELTRSENRL QIVSGKSRFSLQLLPAEDFPRMIRDSEPCSATYTLAQRVLKKHLQRVAHA MAQQDLRYYLNGMLLLIEDNKLTLVATDTHRLGITSIDLDGNFEKSETIV PRKTVLELIRQLEDSDKPVIVEIYPKKVCFRFSDAVLVSKVISGKFLDFR RAIPQTSVFQFDVNRLDFLHALQRTAIISSSNDLFRNVHLNITNGKLNIS AKNKEQEEAQEEIDIVYSNETIDTSFNIVYLMEVLNNLDSEQIRCSFESM QSAILITLPDDEQFKHVLMPMRE >NE0141 dnaQ, probable DNA polymerase III (epsilon chain) protein MRYVFLDTETTGLDPALGHRIVEIAAVEVCNRRLTDRHFHRYLNPGRESD EGALRVHGLTREFLRDKPVFQDVCSEFLEFIADAEIFIHNAPFDVGFINR ELDLIRFESMQNHCLQIIDTLVLAKELHPGKRNNLDALCERYQIDNSHRT LHGALLDAELLAEVYLAMTRGQESLLMEMDAPASRQADNPAVGKVENLAL IVQPATQAELELHSRLVERINAESKGNCLWNG >NE0433 dnaX, dnaX; DNA polymerase III (subunits tau and gamma) protein MTDSQVLARKWRPKDFSELVGQEHVVRALINSMEQNRLHHAYLFTGTRGV GKTTVARILAKALNCEQGVTAAPCGKCAACMAIDQGNFIDLIELDAASNT QVDAMRELLDNAQYAPVAARYKVYLIDEVHMLSRSAFNAMLKTLEEPPEH VKFILATTDPQKIPVTVLSRCLQFNLKQIPPSLIVERLTEILSMEGIPAD AAGLRLLAQAAKGSLRDALSLLDQAIAFGNSVVNESDTRAMLGVLDQDHI FALLEALAEQNGAAIFAIADQLEAASVSFDQALQDLAALLHRLATAQVIP QMLDETQPDGDRLLALTKRFSPEDIQLFYQIVLHGRTDLAHAPDEYSGFT MTLMRMLAFMPDSRQPGRAYADTGTDHAREVKVEAPSCPREAKPVSDQSP NEAWLALVNQLKLSGMTRMLAQYSEAKSFSESRIELYVAEMHKHLLEKSY QDRLRSQLEIHFGKPVEVIFSQGSITGVTSAALQDRDKLARQSKAVEAIE SDPYVQELIEQFDARLNVSSIKPID >NE0875 fis, probable factor-for-inversion-stimulation transcription regulator protein MTVINENEIALCIRRAVEAYFQDLDGEKPCPIYEMVIRSVEKPLIEIAMH YAQGNQSKAAELLGINRNTLRNRLTKHQIR >NE0332 gyrA, DNA gyrase/topoisomerase IV, subunit A MEQFAKETLPVSLEDEMRRSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY AMHELSNDWNRPYKKSARIVGDVIGKYHPHGDTAVYDTIVRMAQPFSLRY MLVDGQGNFGSIDGDNAAAMRYTEIRMSRIAHELLADLDKNTVDFGPNYD GSEQEPLILPAKIPNLLINGSSGIAVGMATNIPPHNLGEVIDACLLLLRD PDVDIAELMACIPAPDFPTAGIIYGISGIKDGYQTGRGRVIMRARTHFEE LDKGNRHSIIIDELPYQVNKANLLVRIGELVRDKRIEGISDLRDESDKSG MRVVIELKRGEVPEVVLNNLYKETQMQDTFGINMVALVDGQPRLLNLKQM LDHFLRHRREVVTRRTLFELRKARERGHLLEGLAVALSNVDEIIALIKAA PTPAEAKKGLMARTWRSSLVEEMLLRAMIDAAVFRPETLAAGFGMSDQGY RLSDAQAQAILDLRLQRLTGLEQEKIVSEYREILDKIRDLLDILANPERI TTIIVEELTAIKGQFGDPRRSEVVIDAQNLNTEDLITPADMVVTLSHAGY IKSQLLDDYRAQKRGGRGKQAITTREDDFIDNLFIANTHDFILCFSSLGR VYWIKVYNVPQGSRTSRGRPVNNLVPLEQNEKINAVLPVKSFDDTRYVFM STAGGTVKKTPLSEFSRPRTNGIIAIDLDEGDYLIGVALTEGKHDVMLFS DAGKAMRFDENDVRPTGRNARGVRGMKLGAGQQVISLLVADNENMAVLTA TENGYGKRTPITEYTRHNRGTQGMIAINTNVRNGKVVAAQLVESSDEIML ITTGGVMIRTRVSEIREMGRATQGVTLINLDAGEKLAGLERIVETDED >NE0003 gyrB, DNA gyrase, subunit B:DNA topoisomerase II gyrB MNTNQPESAKKTDNSHRDYNSDSIKILKGLDAVRKRPGMYIGDTSDGTGL HHMVFEVVDNAIDEALAGYCDDISVIIHADNSVSIHDNGRGIPTDIKQDD ELKRSAAEIVMTELHAGGKFDDNSYKVSGGLHGVGVSVVNALSEWLRLTI RRNGNVYQMEFREGVAVAPLKVTGQTEKHGTEVHFLASQSVFGDITYHYD IFAKRLRELSFLNHGIKIRLADQRDDREEVFAFTGGIRNFVEYINRSKTV LHPSIFYAKGLKDNITVEIAMQWNDSYAEQVLCFTNNIPQKDGGTHLTGL RAAMTRTLNNYIEKNELAKKAKVDTTGDDMREGITCVLSVKLFEPKFSSQ TKEKLVSSEVRPAVEEIVVQKLSDFLLENPNEAKTICNKIIEAARAREAA RKARELTRRKGVLDSMGLPGKLADCQEKDPKLCELYLVEGDSAGGSAKQG RDRKFQAIMPLKGKILNVEKSRFDKLISSQEIVSLITALGTGIGKDEYNP DKLRYHRIIIMTDADVDGSHIRTLLLTFFYRQMPELIERGHIYIAQPPLY KIKHGKQERYLKDDYELKHYILGLALVGAELHTGANNPPITGEALARIAD EYLLAETVIERMSRLIDRTVMYALLKQPDIDLSSETSARDSAARLAILLD DVEILAEYDENFERYRLKIIRKQHGNLRTSYLDDDFLQSGDFARIRQAAQ ILHGLIGEGAKVKRGEQEISVREFKEALEWLLEETKKGITIQRYKGLGEM NPEQLWETTMDPGNRRLLRAQIEDSILTDEIFTTLMGDVVEPRRAFIESN ALRARNIDI >NE1137 holA, putative DNA polymerase III (delta subunit) protein MRLDPEHLARQLDGSIAPLYVVLGDELLLVMEAVDGIRAYVRGQGYTERT ILTADQRFDWMNLFQWGRQSSLFSERRMLDLRIPSGKPGREGGVAIETFC RELPRDTVTVVTLPEIDKQGRASKWFKALEQAGQVIEVKPVGRDRLAHWI KQRLDRQNQMIDQDTLQFFAGKVEGNLLAAHQEIHKLGLLYPPGRLTFEQ VKNAILDVTRFDVLQLPETMLTADMVRYRHILEGLQGEGVAPPLILAILS EQIRLLIKIHLLKNSSRGMTIEQAMTALRIWPARQKLMMGAIQRIRYPLL VQALLQAAVIDRIIKGVEQGDIWEELLNLGICFAADSSFKIIGRKDLSFI INLSLK >NE2180 holB, putative DNA polymerase III (delta' subunit) protein MATAEIFPWQRVIWQQARQSGSAQRHHALLLKGRRGIGKLGFALALAKSI LCGQGDAAGVACGKCQDCYWFEQGLHPNFRLLEPEALSAQEGATDKDDEE NRREAGSTKSGRKPSQQISIAQIRALDDFIYLSAHQARDKVVLIHPAEAM NTAAANALLKKLEEPPPEVLFILVTHNVSLIPPTVLSRCRQTAMPGPDHE MAKDWLIHQGITDPDFHLAMSGFSPLLALQYDERLAASHTDFIQCLCAPE RFDPIELAEKLHKLDLSSVTGWLQKWCYDLMSCRTSGRVRYHLKQVAVIR QQAAVIDPVAFGFLWRNLIASQQLARHPLNPRLFLEAMLLTYMDSIRPAG SAG >NE0442 holC, putative DNA polymerase III (chi subunit) protein MLIACRLCAKAVQQGLKTVVYVPDERLAGQFDKLLWTFTPTGFVPHCRVD NKLADVTPVIMNSRPVLMEAGCFGVLLNLDADVPPGFEQFPRVVEIVDEA EDGKLQARKRYRHYQEQGHDVRHHRLDGN >NE0833 hrpA, HrpA-like helicases MTYLPEQPACITYPEDLPVVARREEIAHAIQQHQAIIICGETGSGKTTQL PKICLELGQGAGRQGTGHLIGHTQPRRIAARTVAARIAAELNSPLGKLVG YKVRFSDQTHPNTRIKLMTDGILLAETQQDPLLRAYQTIIIDEAHERSLN IDFLLGYLKQLLPRRPDLKLIITSATIDAQRFASHFNDAPIIEVSGRLFP VEIHYRPNDPIDGEDRDLPRAILSTIDEAMRMGEGDTLVFLPGEREIRET AETVRKYAFSGPGGKAGLEILPLFARLSHTEQARIFAPGQQRRIVLATNV AETSLTVPGIRYVIDTGLARINRYSYRNKVEQLLVEKISQASANQRAGRC GRVMNGVCFRLYSEEDFNARPEYTDPEILRSSLAAVILRMKSLKIGDVEQ FPFIQPPAPRMIADGYQLLSELGALDERKGLTQIGHQLARFPTDPRIARM IMAAKQENCLSEVLIIAAALSLQDPRDRPFEHQQAADQAHQPFRDDRSDF MGYLKLWDFYDELLKHKKSNKKLIEQCQKNFISHRRMREWREIHGQLHIL ISEMGLRPNQVSAGYDEIHRALLSGLLGNIGFKSDEKGVYEGARAIKFSI FPGSSLRKKQPKWVVAAELAETTKLYARCAAAIDPAWLERIAGKLCKRHY FDPHWEKQRAQAMAFERITLYGLTIVPKRRIAYGPIDPAHAREIFIRQAL VAGEYESTAPFLQHNQQLIDEIRELESKVRRQDILVDEQQIFEFYAARIP AGIYSGTAFEKWRKQAEQTEPELLYLTREVLIRQAVDGTAAEQFPETLTA AGHVLPLSYRFDPGHPLDGVTVTVPLPLLNQIMPFHFDRLVPGLIREKIG WYVKMLPKQVRRHAIPVPQFVTRFLEWLDSCPDQAMLLAESLTAFIRSET GIKVPLDTWDSRLLPVHLQMNVKVIDDAGMTLGMGHDLIELKAQFGQTAQ QLFARGAGAEPDSIERDDITRWDFGELPVETRFSRAGKLLTGYPALVDQE QSVAVRIFDTQEGAQRSMRGGVLRLLCLALKDRIKQLEKNLPVDRQAILL MSSLIEMDRLKEDIRSAIIDLALIGDDPLPRNEDEFNSQTSRARTRLGSV SQEIAGLIHTIAQPCQELKKRLSVLDKSAVFLKKDMEEQLHHLIYPGFLS TTRWQYLQHLPRYLKGMILRLDKYNKNPARDQEQTEIISTLWNQYIQRLN KHRQAGVIDPNMEIFRWQIEELRISLFSQELKTPAPVSVKRLQKLWESVR E >NE2207 hupB, Bacterial histone-like DNA-binding protein MNKSDLIDVIAQSADLTKAQAGNALDGALSAIKDALGKNDSVTLVGFGTF KVGKRAARTGRNPRTGAEIKIKAAKVPKFTAGKALKDAVN >NE0952 ihfA, Bacterial histone-like DNA-binding protein MALTKAELTDLLFENIGLNKREAKEIVECFYEEMRAALQNGDGVKLSGFG NFQLRTKPQRPGRNPKTGEEIPISARRVVTFHASQKLKSMVEANYRGESG TN >NE1961 ihfB, Bacterial histone-like DNA-binding protein MTKSELISKLAERFPQLLAKDAELVVKIILDAMAKSLSRGERIEIRGFGS FDLNYRPSRVGRNPKSGEKVHVPEKYVPHFKAGKKMRELIDSGPKQHKVL DRVTG >NE0450 int, Phage integrase MQWIKRFILFHGKRHPQEMGSAEIEAFLTHLAVAGKVSASTQNQALSALL FLYKEILSIDLPWLNEIVRAKQPQRLPTVLTRTEVQAILVRMSGTYGLMA NLLYGTGMRLMECVRLRVKDVDFERGEILIRDGKGSKDRVTMLPESLAGP LQAHLLHRRTLFDDDSRLGKASVYLPDALERKYPNAATDWVWQYIFSSGS FSIDPRSGTERRHHIDEKLLQRAMKKAVQASGITKLATPHTLRHSFATHL LDSGYDIRTIQELLGHKDVHTTMIYTHVLNKGGRGVRSPLDM >NE0235 intF, Phage integrase MLTKVRLTPSRIAAHTCPADASQAFLWDTATPGLAVRATAGKRAFIFQGR FAGKSIRITIGDTEVWTIEQARQRARELQGLVDQGRDPRLVKQEKIAADV QARITDEPALPAWRDYIAARSGKWSEAHAADHLKMARDGGEPVTRGRRIG APAYTEKGILRPLLDLPLKGITREKVAQWLDNEATRRPAQARLALSLLGT FLSWCGNQPAYRNQVNSDACAKLKRELPKPTARTDCLQREQLASWFAAVR SIDNPVMSAYLQSLLLTGARREELAGLGWEDVDFQWQTIHLADKVEHSGR TIPLTPYVSQLLQSLPKINEFVFASKRAKSGRLQEPRKAHNQAIEAAGLP PLSIHGLRRSFATLSEWVEAPSGITAQIMGHKPSAIAERHYKRRPVDLLR VWHTKIEEWILSNANI >NE2189 intINeu, Integron integrase; Phage integrase; Phage integrase N-terminal SAM-like domain MGNTNTPPKLLDQVRDRIRIKHYSLRTETQYVQWIKRFILFHGKRHPQEM GAAEVEAFLTHLAVVGKVSASTQNQALSALLFLYKEVLSIDLPWLDKVVR AKQPQRLPVVLTRTEVQAILVRMSGTYGLMANLLYGTGMRLMECVRLRVK DVDFERGEILIRDGKGAKDRVTILPESLVSPLQTYLLQRRVLFDDDIRLG KASVYLPDALERKYPNAATDWIWQYIFPSGSFSIDPRSSVERRHHIDEKL LQRAMKKAVQTSGITKLATPHTLRHSFATHLLDSGYDIRTIQELLGHKDV HTTMIYTHVLNKGGRGVRSPLDM >NE1753 lig, NAD-dependent DNA ligase MISENTIEERLQALRAAIALHDFHYYVQDAPVIPDAEYDALFRTLQQLEQ QYPHLVTPDSPTQRVGAPPLKVFAQLTHQTPMLSLANAFSEEEVTAFDRR IREALNIDRVDYAVEPKFDGLAISLIYANGILTKGATRGDGYTGEDITLN LRTIPSIPLRLQVPFPTGQFEVRGEVVMLKTDFERLNEQQRKNGEKTFVN PRNAAAGSLRQLDSRITAMRRLTFFAYGIGAYHEDQPIFSTHSEILAYLA TQQFLVARQSSTVMGANGLLAYYREMNAVRLSLPYEIDGVVYKVNDLAQQ EKLGYVSRAPRFAIAHKFPAQEVSTELLAIEIQVGRTGALTPVARLAPVF VGGVTVTNATLHNEDEVQRKQIMIGDTVIVRRAGDVIPEVVAVIVERRPT HAQAFVMPDHCPVCGSKAVRLPDEAVTRCTGGLYCPAQRKQAILHFASRR AIDIDGLGEKLVDQLIDRELVHTPADLYRLDIDTLAGLERMAGKSARNLV TAIEDSKKTTLPRFIYALGIRHVGEATAKALASHTGDLDRLMDMNAEQLQ QIPDIGPIVAQSIADFFSEAHNREVIEQLLSCGLQWEKPSHIAQPSSRTN LAVPGKTFVLTGTLPTMTRDQAKNRIEQQGGKVTGSVSSATSYVVAGSDP GSKYARAIELGIPVLDEDQLLSLLRDTSSSE >NE0008 mfd, mfd: transcription-repair coupling factor MSSKLNPLSSESLPRYTGLEGSSDACALARLANRNPAGQLLAVITASALD AQRLLEEIPFFAPDLRVSLLPDWETLPYDIFSPHQDLISERLATFYQIAH NACDVLIIPVTTALYRMPPREFLAAHSFFVNQGSTLDLQSFRSQMSLAGY SHVSQVLSPGEYSIRGGLIDLFPMGSPLPYRIDLFDDEIESIRTFDVDTQ RSIYPVKEIRLLPAREFPLDDNGRSRFRTGFREKFEGDPTRCRLYQEISK GNIPAGIEYYLPLFFEQTATLFDYLAQHSTVCLHGEITPAIENFWQDTRS RYQLMRNDPDRPLLPPMDLFLPEDQFYGYLKSYKRIEMHTGQQVKTDKPF ARSLPPVRVDRRASNPIEQLTAFVHTFTQKGGRVLLLAESMGRRELMAEY LREYGLKLKLCEDFAAFQSDTASCMLSVASLHSGFILAAENLALVTENEL YATHVRGQRTRDARKTVSADSILRDLSEIKPGSPVVHEQHGIGRYLGLVN MNMGEDDSGQSSEFLALEYQGGDKLYVPVTQLHLISRYSGAAPEAAPLHK LGSGQWEKAKRKAMQQVRDTAAELLNLYAQRAARKGHIFRFNQHDYNAFA DGFGFEETPDQATAINAVIQDMVSGKSMDRLICGDVGFGKTEVALRAAFV AVTDGKQVAVLVPTTLLAEQHYQNFSDRFGLIADQWPVKIAELSRFRSAR EQAEALQSLAQGTTDIIIGTHKLIQDKVKFKNLGLVIIDEEHRFGVRQKE QLKKLRAEVDVLTLTATPIPRTLAMSLEGLRDFSVIATAPQRRLAIRTFV HPYSEGIIREACLRELKRGGQIYFLYNEVSTIQNMYTRLTTLLPEARINI AHGQMRESELEHVMRDFYQQRFNLLLCTTIIETGIDIPTANTIIIHRADK FGLAQLHQLRGRVGRSHHQAYAYLLTPPEKAALTTQATRRLEAIQAMEEL GSGFYLAMHDLEIRGAGAVLGDSQSGEMQEVGFSLYSSLLDAAIKSLKAG HEPDMQQPLGVSTEIRLHVPALLPESYCGDIHERLILYKRMAGCSDETEL DEIHQELIDRFGLLPDPARALLDSHRLRIEARQLGITRIDAGPDNIQLQF VPEPPIEAIKIIQLIQSSKEYSLSGPDRLSVRLQIPDVGERVKKIKKLMT LLKN >NE1742 mutL, mutL; DNA mismatch repair protein MRPIKLLPDGLISQIAAGEVIERPASVLKELLENAIDAGTTDISVNIAQG GLKLIRVTDNGGGISGEELPLALTRHATSKIASQEDLYRITSLGFRGEGL ASIASVSNLLLISHQPGGKHAWQIRSEGIRVMQPEPSSHAAGTTVEVRDL FFNLPARRKFLKTEATEFAHCEEIIRRMALSHAGIAFTLRHNGNLRGHWQ SAEAAVRIKTVLGEEFTRSAAWIDERSAGIGLQGMLALPAYSRAARDMQY FFVNGRFVRDKLITHALREAYRDVLHLDRHAAFVLYLDIDPEQVDVNVHP TKTEIRFREARAIHQFIYHGVSKALSLPRSGTELSQSSSQLMADDIVPPA EKRVPAAPMLNYPRQTGLPSEMIAQPFNFYQVLSGSESDSTATQNPFRQT GAGESNEHPALPPLGFALGQLHGVYILAQNWKGLVIVDMHAAHERIVYEQ LKLQMDEQTLSAQRLLIPVTFHADSLDIATAEENQSLLQQLGFEVTVLTA TTLAVRAVPAILQDADTEKLVCNVLDEIRNGDPGQLLAARRNELLATMAC HGAVRANRPLTLIEMNELLRKMEVTERSDQCNHGRPTWFEISLAELDKMF MRGK >NE2552 mutM,fpg, Formamidopyrimidine-DNA glycolase MPELPEVEITRRGIDTHLAGRVITQISIRNPVLRWPISAGLIALLPGQRI NAIARRAKYLLFACSRGTLIMHLGMSGNLRVLPESTPPQLHDHFDLQVDN GMMLRFRDPRRFGAILWWDGDIRQHPLLQKLGPEPLSDDFDGQFLYTKTR GRNASIKEVLMNQHIVVGIGNIYANEALFQAGISPLAAAGSLNTMQCERL VDAVKATLLRAIKAGGSSLRDFTDCEGSPGYFQQQYWVYGRAGQSCRQCG ELVSKTRQGQRSTFFCARCQH >NE1705 mutS, mutS; DNA mismatch repair protein MNKAEQSSHTPMMQQYLRIKAQHTDKLLFYRMGDFYELFYEDAEKAAKLL DITLTQRGSSAGEPIKMAGVPFHAADQYLARLVRLGESIAICEQTGDPAT SKGPVERQVIRILTPGTLTDAGLLEERSNSIVLALALHRGSIGLAWLNLA AGDMRVLETSSDNLTSELERLHPAEILLPESLDLPATLNNFAGPKRLPDW QFDYEHAMQQLTRQFGTRDLNAFGCEDLHAAIMAAGALFEYVRLTQQTAT DGSSGQLPGHLHTLQVERQDAYLRMDAATRRNLEITLTLRGEDAPTLSSL LDTCSTGMGSRLLRHWLHHPLRNRITLQQRLDTVSDLIGAQPETLYAGIR QQFKHIADIERITSRIALRTARPRDLSGLRDSLMRLPGIIELIATSAAAA VHRFIPPMQPDPLLTQLLVRALQPVPGAVIREGGVIADGFDAELDELRGL QGNCDEFLLQLEARERERTGIPNLKVEYNRVHGFYIEVTRAQGEKIPPDY RRRQTLKNAERYIIPELQAFEHKTLSAREQALAREKMLYERLLEQLADFI IPLQEIARSVAELDVLCAFAERAALSGYTKPVFTDDPVLIIEAGRHPVVE NQVEHYIANDVQLGAITRENRQMLVITGPNMGGKSTYMRQTALTVLLAHC GSFVPAQIARIGPIDQIFTRIGAADDLAGGRSTFMVEMTEAAGILRNATA QSLVLVDEIGRGTSTFDGLALAFAIARHLLTQNQSYTLFATHYFELTRLA EEFPQAVNIHVTAVEHKRRIVFLHRIEEGPASRSYGLHVAALAGVPDRVI RNAAKILARLEQETLSRSPQQTLFETVEENAKAVPASVHPVLDYLERIHP DELTPRGALEQLYLIKSMLNQTD >NE0056 mutY, HhH-GPD MTPRTAGTIHFPADAPDSFAGRLIRWQLECGRHSLPWQGTRDPYAIWVSE VMLQQTQVSSVIPYYQRFMASFPDVASLAGVPVGDVLTLWSGLGYYSRAR NLHRAACVIMEQYSGVFPQDAATLQRLPGIGRSTAAAIAAFAFGERGTIL DGNVKRILARYFGISGYPGEKSVEERLWQLAESLLPAEESNHQIVVSYTQ ALMDLGALVCARSRPRCQYCPLQADCIACQNDLTADLPVPKPRKTLPVRE TVHLILLDQERILLKKRPASGIWGGLWCFPEMSVDQDSIDYCEKNLHVRV TKLARLPHLQHTFTHFKLIIQPHLLQSIMHQPVCEEKCEENSYLWLTIEQ AMQQAIPVPVRKLLSMAYPYFQYHIHE >NE2223 nth, HhH-GPD:Iron-sulfur cluster loop (FCL) MNTTKRREIFTRFRAANPRPTTELEYQTPFQLLIAVILSAQATDKSVNLA TRKLFLVADTPEKILQLGETGLSPFIQRIGLFRTKTRNILATCQLLIEQY NGEVPRTRTELEKLPGVGRKTASVILNTAFGEPTIAVDTHIFRVANRIGI APGKNVLEVERKLLKVVPDEFRHDAHHWLILHGRYICKARKPLCHQCLIV DLCEFKEKNLEGTASSLDMKQLT >NE2253 ntpA, NUDIX hydrolase MQRYKLPVSVLVVIYTADLQVLLLERADHPGYWQSVTGSQDPGETLLQTA VREVREETGLNTDDYVLSDWQIQNRYEIFEEWNWRYPPGTTHNTEHVFGL ELPKTIPAVVSSREHLGYVWLPWREAAEKVFSSSNACAIRMLASKRKSEN SR >NE0885 ogt, Methylated-DNA--protein-cysteinemethyltransferase MNYYTFLESPVDRLLLTSDGEFLTGVYMEIEIQKLLPRMTDDWRQDAAPF AEAIAQLNAYFAGELIQFDLPMKATGTPFQEAVWQSLSTIPYGETVSYKN IAERLHLPKAARAVGMANGQNPISIIIPCHRVIGANGKLTGYGGGIHRKQ WLLAHEDKQTSFA >NE1468 polA, polA; DNA polymerase I protein MKTLLLVDGSSYLYRAFHALPDLRNRLNEPTGAIYGVLNMLRRLHKEYRP DYSACVFDAKGKTFRDDIYPQYKAHRPPMPEDLVCQIGPLYACIRAMGWP LLIEEGVEADDVIGTLVERAIARQAQCVIATGDKDIAQLVRPGIWLVNTM NNESLDESGILQKFGVTPAQIIDFLALVGDSVDNIPGVEKVGPKTAVKWL DQYGTLDDLIAHADEIKGVVGENLRKALDWLKVSRKLLTIKCDVPLAMDW QDLVAVPPDTARLTELYEHLEFRSWLRELKQPGPEKNEKAESSVMAAIVD DPSVPEGENDDGRDYQIILTDAQLGDWLAQCESAELVSIDTETTSLNPME AKLVGLSFCMELGQAAYIPLAHHYPGVPSQLNREQVLQRLKPWLESDEKL KIGQNLKYDRHVFANHGVMLNGIVHDTLLQSYVLESHLSHDLDSLASRHL GIQTISYDEVTGKGAKRIGFEQVEIHRAGIYAAEDADIPLRLHRVLYPVI SQDAHLEYIYQQIEIPLLEVLFRIERNGVLLDTDLLRVQSGELTQQLVAL EQQAHSLAGHAFNLNSTKQIQEILFGQHKLPVIKKTPKGVPSTDEEVLQR LASDYPLPKVLLDYRGLAKLKSTYIDKLPQMVNKQTGRVHTHYAQAVAVT GRLASNDPNLQNIPVRTPEGRRIREAFIAPDGWLIMSADYSQIELRIMAH ISGDAGLIHAFSEGQDIHRATAAEVFGVPVEQVNPEQRRYAKVINFGLIY GMSEFGLATQLGIERTAARTFIDRYFARYPGVADYMQRTRELAKQHGYVE TVLGRRLQLSDIRSNQRNRQMGAERAAINAPMQGTAADIIKLAMISVHRW LAEAQLQSKLIMQVHDELVLEVLVDELPVIKENLPRLMENVLKLDVPLKV QTGIGKNWDQAH >NE1505 priA, probable priA; primosomal protein N' (replication factor Y) MVIIRVALDVPIDRLFDYLAPDADTADIGRCVRVPFSSRQISGIIISVCE TSSVPEGKLKYAGQIDRQTPPLPQPLLGLFEFCSRYYHHPIGQVVMNGLP VLLRKFKHTGKEQPPSWRLTDTGKSITLADLPIRAKAKRQLISLLSEHGI ITAEICKAMSSHSRKLLHEFKDLGWVEQFTALPEKAVFSTASSPAPTAEQ AQAISEILDRTGTFTPWLLNGITGSGKTEVYLQVTASLLAQQKQVLILVP EINLTPQLEAVFRKRFPGTTLVSLHSGLNNSERLQGWLQAQRGKAGIVLG TRLAIFTPMPELSLIIVDEEQDHSFKQQDGLRYSARDLAIYRARQANIPV ILGSATPSLESYHQARTGRYRLLQLHSRAISQAALPTIRCIDLRVIPAQE GLSEPVLDALRHCLARKQQSLVFINRRGYSPVLLCKSCRWIATCKRCSSR LVVHLRDRQLRCHYCGDQQPVSPACPQCGDPDVLPFGHGTQRVEAALIRH FPEARILRVDRDSIRHKGAWQQMLDRIHRGEADILVGTQLLAKGHDFPNL ALVCALNADASLYSTDFRAEEHLFAQLIQVAGRAGRANVPGSVLIQTEFP QHPLYQALIRQDYAAYAQAHLKERRSAGFPPFVYLAVLRAEAPVLTDALE FLRQAAALAAVTENYPHIQLFDPVPAHMTRLKGLERAQLLIQARSRRHLQ TFLGDWHQRITALPVHSRIRWHLDVDPLTL >NE1464 radC, DNA repair protein radC family MAISDWPEAERPREKLIEKGAAALSDAELLAIFLRTGITGVSAVELARKL LTHFGSLTKLCAASLHEFSELPGMGPAKFAQLQAVMEMAKRALAEELKNG DIMDSPQSVRNYLCLSLKGKPYEVFVGIFLDARHRTIVTEELFNGTLTQA SVYPREVVKRALYHNAAAMIFAHNHPSGIAEPSTADEILTQSLKQALALV DVKVLDHFVIGSSEVVSFAERGLI >NE0507 rdgC, putative recombination associated protein rdgC MWFRNLLIYRLAGEVITSDELEAYLAKQTLQGCLGLEPQSRGWVPPGIAE ADLVYSYGQQMLIALGTEKKLLPASVVNQLAKVRAQEMESHQGYAPGRKQ MKEIKEAAYRELLSRAFAIRQRSHAWIDPVGGWFIVEGASASKADALIEA FIKSTGIGLKRIRTTMAPTSAMTAWLSGDDPPAIFSVDSDSIFRSREDKK VSVSYIRQSPDPQEITRHVRTGKEVIRLAMTWRDKISFILDENLQLKRLT LLDIDREPAETAEEQFDSNFFLMTEELRQLLPDLVEILGGMTAD >NE1932 recA, RecA bacterial DNA recombination protein:AAA ATPase superfamily MDENKNKALSAALAQIEKQYGKGSIMRLGDSDVAKDIQVVSTGSLGLDIA LGVGGLPRGRIIEIYGPESSGKTTLTLQAIAEMQKLGGTAAFIDAEHALD PQYAQKIGVNVQELLISQPDNGEQALEITDMLVRSGSVDVVVVDSVAALT PRAEIEGEMGEPQMGLQARLMSQALRKLTANIKRTNTMVIFINQIRMKIG VIFGNPETTTGGNALKFYASVRLDIRRTGSIKRGEEMVGNETRVKIVKNK VAPPFKQADFDILYGEGISRESEIIELGVLHKLIEKAGAWYSYNGEKIGQ GKDNVRDYLKEHKSIAHEIEQKIRAAVGLAETDSRVVPPSSGE >NE1850 recG, RecG-like helicases MAAHFFDSLDEALRKKLEKLGLFSDFDLVLHLPLRYEDETRLSPISQAVP GSTVQVEGVVAEQEVLVRPRRQLVCRVDDDSGTLYLRFFNFYASQVTAWS PGTRLRVLGEVRAGFHGVEMVHPKCRVVRGSMVLANTLTPVYPGMAGLPQ RTLARLIMQAFERLRAKRLLQETLPATILSACQFPAFEDSLSILHCPPAG VSITSLQQRSHPAWFRIKFDELLAQQLSMRCHYHQRRSQQAPVLQQQTGL QQALLEVLPFGLTDAQCKVVTEISKDLAQPYPMQRLLQGDVGSGKTIVAA LAALQSIGNGYQVAVMAPTEILAEQHFRKLSDWLTPLGVGVGWLSGSQKK SLRNQELERTATGEAMLVIGTHALFREAVQFKCLGLVIIDEQHRFGVGQR LALRMKGGDEEVIPHQLMMSATPIPRTLSMSYFADLDVSVIDQLPPGRSP VVTRLIDSSRREEIVARIREACLAGRQAYWVCPLIEESEALQLKTAVETY ETLSQTFPDLRIALIHGRLDSDEKSVIMAEFSQGEVQLLVATTVIEVGVD VPNASLMVIEHAERMGLSQLHQLRGRIGRGSATGVCVLMYQQPLSEVARK RLQIIFEHRDGFEIARQDLLLRGPGEFLGTRQSGVPLLRFANLEEDIDLL EMARNAAENMLRDHPLAAQCHMQRWLGRKEDYLRA >NE0010 recJ, recJ: single-stranded-DNA-specific exonuclease MANITIREFPAHAYEILSAHGFPSVLARIFAARGINHPEQLETTFARMAS FEQLKNIQRIAVLLADAIAAKKRLLVIADYDTDGATACAVALRALRQFGA MVEYLVPNRFEYGYGLTPEIVRLAADQVPPPDILITVDNGIASVEGVEEA NRLGMQVFITDHHLPGDRLPDAAVIVNPNQPGCSFPDKHIAGVGVIFYVM LALRAELRERSAFTATGKEPNLASLLDLVALGTVADVVRLEGTNRILVQQ GLQRIRNGYCCAGIHALFKAAGRDFSRVTTYELGFILAPRLNAAGRLDDM SLGIECLLTEDESHALRLASELDELNRQRREIESGMRDEAMDKLDDVIDL LNQSDTPADNGKQSVYSLCLYDPAWHQGVIGLIASRVKDRLHRPVIIFAQ GNEGEIKGSGRSIPGLHLRDALDLVAKRYPGLIVKFGGHAMAAGLTVYEQ HFEQFRTAFEQVAQSLLTPADLIQVIETDGELAETDLTLELAQYLTNQVW GQGFPEPSFNGCFRVENQRIVGEKHLKLKLRKTGAAQVYDGILFFHTERL PTEIDAVYRVQINEYNGSTRMQLLLEHWFESGQAHYG >NE1479 recN, ABC transporter:DNA repair protein RecN MLQNLSIRNFIIVDHIDLHFKSGFTVLTGETGAGKSILIDALELVLGRRA DTSQIRYGCKRAEITAQFSVNTIPALQEWLVENALEDETGICLLRKIMES GGRSRNFINGHPATLQQLRTVGEWLVDIHGQHAHQLLMHGHKQCELLDAW AGESNLAREVASAYRHWQDLCQQRLAWEQHSEQNLQEHETLTWQLQELAA LNFSLEEWENLQIEHNRLTHTASLLETAQFSLESLSENETAVLAQLSTVL TRLNSLIDIDNTLEPLCNQLQSAQIQLQEIVYELKRYQQHLDIDPRRLQE TETRIAAIHGTARKYRIMPEILPDLLETTRQRLESLENAASSEALMKAEK SARNNFENLAARLSQARQHAADQLSGLVTETMQTLAMAGGRFNVALIPIP SGNLHGMEQIEFQVSAHRDLPLRPLNKVASGGELSRISLAIQVITSKAGT VPTLIFDEVDTGIGGRIAAIVGKLLQQLGKTRQVMSITHLPQVAARGDHH WRVSKTSETEDEQLPASHISELDAAERTEEIARMLGGENLTAATRQHAAE MLGYDKQNQST >NE2564 recQ, ATP-dependent DNA helicase RecQ MISHAQTLLREIFGYSEFRGQQAEIITHVVNGDSCLVLMPTGGGKSLCYQ IPALLRKGTAIVISPLIALMENQVAVLCRQGVRAVYLNSALTPEAAAAVE RRMLAGEYDLVYVAPERLLTVRFRALLQRIPIALFAIDEAHCVSQWGHDF RPEYGKLSILPEKFPQIPRIALTASADARTRADILRCLDLHQARSFISSF DRPNLCYRITARSNSRIQLLNFIRSQHAGEAGIVYCQSRRKVEETAAWLN SNHIPALAYHAGMETSIRTRHQKKFLQGHGIVMVATSAFGLGIDKSDIRF VAHLDLPKSIESYYQETGRAGRDGLPASAWMVYGPGDIIRLRSQTESGTE RLPAPIRQAAAARLDALLVLCETTVCRRKPLLDYFGEPTGSLPCGNCDAC LETIPVQDVTIAAQKALSCVYRTGQCFGMEYLIDILSGKRTDRVRQWGHD CISTFGIGHELSTEGWRIVFRHLLALDYLVAGEDRAGGERIALQLTSAAR SVLRGETRIKLRLSHHHHSAPYQQISTGLSVPSSRCQAFSCEPQTKCGG >NE2040 rhlE, rhlE; ATP-dependent RNA helicase RhlE MSNDVTFAQLGLSSEILHAVNDEGYVNPTPIQAQVIPSILAGKDVMASAQ TGTGKTAGFTLPLLYRLQAYANTSVSPARHPVRALIMAPTRELAMQIDES VRKYGKYLALRTAVVFGGINIEPQIAALQAGVEILVATPGRLLDLVEQKA VNFSKTEILVLDEADRMLDMGFLPDIKRVMALLSPQRQSLMFSATFSGEI RKLADSLLKQPVRIEAAVQNTVNESISHVIHWVKPDSKFALLLHLIRQQN LKQALIFVKTKHGASHLAQMLSRHEISAVAIHGDRNQQQRTQALAEFKHG DVQILVATDVAARGIDIEKLSHVINYELPGNPEDYVHRIGRTGRAGSKGK AISLVSEHEKELLANIEKLLNAKLETEQIAGFDAEQFARSLPDRKNRMSA GNSRYGNKPMENGSEKSRSEKHRKLPSSQKYSGSRRGGTQKYSDPIFTQP YVPQANSTQSTTPKQPEIQSLFLTYRQEKKTIPALFTALSKSKAGQEN >NE0140 rnhA, probable ribonuclease hi protein MQLEEGVKLVEIFTDGACKGNPGIGGWGVCLKFDGEVREFFGGEPVTTNN RMELLAAIRALQALESLPDTGQSLRVQLHTDSQYVQKGISEWVHSWKKRG WLTADKKPVKNEALWKELDQLSRRYQVEWFWVRGHNGHDGNERADMLANR GVVSVLSEKAD >NE1707 rnhB, Ribonuclease HII and HIII MAERRIPLKHEYAQDGKVIYGVDEAGRGPLAGPVYAACVVLDPADVIEGL ADSKQLSEKKRISLADQIKQRARAWAIASASVEEIDRLNILQASLLAMQR AVVSLRPISNALVLVDGNHAPRLDCEVQTVIRGDSLVAEISAASILAKTA RDIEMLRLHEAYPVYGFDRHKGYPTKAHLEAIRLHGITDIHRRSFAPCVG QSVSGARTTSFINQKEA >NE0212 ruvA, probable Holliday junction DNA helicase subunit MIGRIAGLLLEKHPPLVLVDVNGIGYEIDVPMSTFCRLPGIGEQVTLHTH FWVREDAHLLFGFMTEPERVLFRQLTKISGIGARTGLAILSGLSVNDLHQ IVVSQDSTRLTRIPGIGKKTAERLLLELRDKISPAITLPETGTAMASSTD KDILNALSALGYNDREANWAVGQLSEGVTVSDGIMQSLRLLSKAK >NE0213 ruvB, ruvB; holliday junction DNA helicase protein MIESDRIITASPFSSQEEVIERALRPVQLDDYVGQEKIREQLKIFIEAAR LRQEALDHVLLFGPPGLGKTTLAHIIAREMGVNLRQTSGPVLERAGDLAA LLTNLETNDVLFIDEIHRLSPVVEEILYPAMEDYQLDIMIGEGAAARSVK IDLPSFTLVGATTRAGMLTNPLRDRFGIVSRLEFYTADELGKIVTRSAGL LNVDVTADGAREIACRSRGTPRIANRLLRRVRDFAEVRANGRIDRPVADA ALQMLDVDATGLDVLDRKLLLAVLEKFGGGPVGVDNLAAAINEERDTIEE VLEPYLIQQGFLQRTPRGRMATTMAYQHFDIIPSHQTTVPSLFDPD >NE0211 ruvC, Crossover junction endodeoxyribonuclease RuvC MTSLVYAAKGIRILGIDPGLRITGFGIVEKIGNRLVYIGSGCVVTGESGL PDRLKTILDGLNEIILQHKPEQVAVEQVFVNINPKSTLLLGQARGAAISA AVLHELSVYEYTALQVKQAVVGNGHARKEQVQEMVMRLLGLGERPRPDAA DALACAICHAHGGTGLLTLSARNRSKRSKRL >NE0671 sbcB, exodeoxyribonuclease I MQTGNSTLYWHDYETSGATPRWDRPFQFAGLRTDEALNEIGDPLVIYCQP ARDRLPHPEACLLTGITPQMAEARGLPEPEFIALIHAQLAQPGTCGVGYN TLRFDDEVTRFTLYRNFYDPYAREWQSGNSRWDVIDLARMTFALRPEGIN WPINGEGKPSFRLEDITTANGLVHDSAHDALSDVRATIALARLLRAQQPR LYDWLFRLRDKRAAGNLLDMKTHAPVLHTSRMYSSEYGCTTLVMPLLPET GNANSVLVYDLRHDPAEFVLLDIDALAERLFTPKEELAEGLQRLPVKAVR LNKCPALAPQKVLNDEVANRIGLNVEQCQQHWQLLLQHPDFMQRIKQAYS GNKVFAENDADLALYDGFASDHDRNLFPLVRDAEPGKLADLAGKFQDERY IELLFRYRARNFPDTLSVQEEHHWQMHCRRQLGENAINGSLTLNEYHQKL LQLRTDCPQQAQLDILNELEAWGRVLAQENDLPWPPDHSGSEEQTD >NE1392 sbcC, ATP/GTP-binding site motif A (P-loop):ABC transporter MQILQVRLKNLNSLVGEWEIDFTDPAFVSDGIFSITGPTGAGKTTVLDAI CLALYGRTPRLGKVSKSENEIMSRQTGECFAEVTFAAQSGCFRCHWSQHR AHKKPDGELQNPRHELAEADSGKILETRLTEVGRQIEKITGMDFARFTRS MLLAQGEFAAFLQAAPDERAPILEQITGTEIYSRISIRVHETRVSARREL DRLSAGLNGIQPLTRADEQQFHTDLAQKIQQDAGLNEQIAHERQILVWLE NIARLESELHLIAGQQQAWLSRKEALTPDISKLDSASRALELLGEYSRLT SIRNEQETDRNNLAICAASLAGLEQAVKQAEQSLKSLNEQCDRQRAKQRE TIPLLRKVRELDIQIREKESPISTASKGITAQKKTLAALRNQYQQNEIQL AGLQTTLAGLLQQLHVIQPDGAQMDFAHNQNLLNRKQAEYRQLLENRSLA DWRQEIAVLSGQKTLATRAIEAMQSLAASKQISAELEKHTCSLLAGKTQL AKQLGAEEEMLGALEREISLLETQALLLKKISHLEEARQQLKDDEPCPLC GALQHPYAAGNTPRPDDNITALNQARTMLKTRIDTISTLKIRQAETNRDI EQTACRQQEIHRQIQADETLLQQCAVSLFPGLPSAAMFPELPRLLQETDD KLARMTRILQTAEILENEISVQRESLDKTRELEQKIGILRVQHQHQSTQI RQHEAELQLRQEQLDQLQQELGNLRTTRLQLFADKQPDQEEQSLTTAIEA AQKSADNARQQLETEIQQYNRLKNRAEDLVKTITTRAVQLEKLQETFAAR LTQSGFADEAGFTAACLPEEERRRLAQRAQQLADEKTMLDTRQKDKTIAL QAEQLKSMTDQPRDFHDQVLAQLITRQQVLQQEIGGLRQKLADNENSKQK QQEQLQVIEAQKRECARWDLLHGLIGSADGKKYRNFAQGLTFEVMIRHAN RQLQKLSDRYLLIRDPVRPLELNVIDNYQAGEIRSTKNLSGGESFLISLS LALGLSRMVSRNIRVDSLFLDEGFGTLDEEALDTALETLAHLQQEGKLIG IISHVTVLQERISTRIQVIPRSGGRSVLAGPGCRHCQ >NE1390 sbcD, Serine/threonine specific protein phosphatase:Exonuclease SbcD MKILHTSDWHIGKTLYGHKRYDEFEAFFSWLVETIEQEQVDVLLIAGDIF DTSTPGNRSQQLYYRFLHRVAASACRHVVIIAGNHDSPSFLSAPRELLRA LDVHVTGSLSGNPADEILVLHDPKGDAELIVCAVPHLRDRDIRTAEAGES MEDKSRKLVEGIRDHYAEVINLARLQRTALSSSIPIIAMGHLFVAGGQTV EGDGVRELYVGSLAHVPAGIFPPDIDYLALGHLHVPQRVNGSSVMRYSGS PLPIGFGEADQEKSVCLIEFNRQISATRPAVSLINIPVFQPLERIRGNWQ VISDRISMLSAANSCAWLEINYEGDEMITDLQERLQSAIEGSRLEILRIR NNRIMNQILDQIDDGGTLEELSVNEVFEHCLSAAAIPVEQRTELWRTYQE TLVSLDEEDIRAE >NE1968 smf, SMF family MQIDQDIESWLRLGLTEGVGGGALRRLLIAFGDPARVLAASRPALEGVVK KPVATSIFLRKVDEERLARTIKWLEDPLNSLITLADSDYPKLLLNISDPP PILYFKGQRQFLAQPAMAMVGSRNATPQGLANADAFAEAASNAGFCIISG LAQGIDTAAHQGGLRGASSSMAIVGTGLDLVYPSRNHELAHKLANEGGLI SEFPLGTPAISRNFPRRNRIISGMCHACLVVEATLYSGSLITARLALEQG REVMAIPGSIHSPLSKGCHALIKQGAKLVENIQDILDELHYQPQPVPRFE SVADEGGGTGVLTGEGDDTGLLMYFSYDSTDIDTLCARSGLTVETVSAML LGLELEGRIGSLPGGKYQRIR >NE2453 ssb, Single-strand binding protein family MASLNKVMLIGNLGRDPEIRYMPSGDAMANLNIATTDTWKDKGGEKQERT EWHRVVMFGKQAEIAGEYLKKGSQIYIEGRLQTRKWTDKSNVERYTTEIV ADRMQMLGGRSGGGSYDPPADRDHDYQSQSTPPAKSNTGFDDMEDDIPF >NE0338 sss, Phage integrase:Phage integrase N-terminal SAM-like domain MNQPHDERDTPLPPLLSEYLAYLASTRSLSLLTQHSYRRDLVALVCCIAA QHQSEHENGHEVTDASLTRLHSHDIRHFIAHLHHGGLSGRSLARMLSAWR GFYRYLMRHHHHTENPCQDIRVPKSPRKLPHALSPDEAAQLLAFDPADAL ATRDLAMFELFYSSGLRLAELTRLQPTDIDFSEGIVRVTGKGSKTRIVPV GEPALRALQAWLPLRSAWLTSGETALFLSRHGQRIHPRTIAVRLHQRARL QNLDDRVHPHALRHSFASHLLQSSGDLRAVQEMLGHSSIRSTQVYTHLDF QHLAKIYDQAHPRAKKRPKTG >NE0154 tISRso8a, Transposase IS911 HTH and LZ region MERLPKGIYTPEFRAEAVKLVEAEGLSVDAAAKRLLVPKSSLGNWVRASR TGSLAKVGQGQRVPTETEIELARLRKELAEVKLERDLLKKCAAYFAKESR >NE0835 tnpA, Transposase Tn3 family MPRRSILSAAERESLLALPDTKDDLIRHYTFSDTDLAIIRQRRGPANRLG FAVQLCYLRFPGIILGVDQPPFPPLLKLVANQLKVGIESWDDYGQREQTR REHLVELQNAFGFQPFTMSHYRQAVHTLTERAMQTDKGIVLADALIEHLR RQSIILPALNAIERASSEAITRANRRIYEALSEPLSNGHRHGLDDLLKRR DNSKTTWLAWLRQSPAKPNSRHMLEHIERLKAWQALDLPPGIERLVHQNR LLKIAREGGQMTPADLAKFEPQRRYATLVALAIEGMATVTDEIIDLHDRI LGKLFNAAKNKHQQQFQASGKAINAKVRLYGRIGQVLIDAKQSGGDPFAA IEAVMSWDAFAESVTEAQKLAQPDDFDFLHRIGESYATLRRYAPEFLDVL KLRAAPAAKDVLDAIEVLRGMNTDNARKVPADAPTDFIKPRWQKLVMTDA GIDRRYYELCALSELKNSLRSGDIWVQGSRQFKDFEDYLVPPAKFASLKQ SSELPLAVATDCDQYLDDRLTLLEAQLATVNRMAAANDLPDAIITESGLK IMPLDAAVPETAQALIDQTAMILPHVKITELLLEVDEWTGFTRHFAHLKS GDLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAWLQAWHIRDET YSTALAELVNAQFRHPFAEHWGDGTTSSSDGQNFRTGSKAESTGHINPKY GSSPGRTFYTYISDQYAPFHTKVVNVGVRDSTYVLDGLLYHESDLRIEEH YTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLFVPKGEASYDALKPMI SSDKLNIKAIRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVA LRELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFNRLGE IRDRSFEQQRYRASGLNLVTAAVVLWNTVYLERAAHALRGNGHAVDDALL QYLSPLGWEHINLTGDYLWRSSAKIGEGKFRPLRPLQPA >NE0836 tnpR, Site-specific recombinase MQGQRIGYVRVSSFDQNPERQLEHVEVGRVFTDKASGKDTQRPELDSLLA FVREGDTVVVHSMDRLARNLDDLRRLVQKLTKRGVRIEFVKESLTFTGED SPMANLMLSVMGAFAEFERALIRERQREGIALAKQRGVYRGRKKALSPEQ VAELRQRAAAGEQKAKLAREFGVSRETLYQYLRLDQ >NE0751 tnpR, Site-specific recombinase MPGKRIGYVRVSSFDQNPERQLEGIQVDRVFTDKASGKDIQRPQLDMLLD FVREDDTVVVHSMDRLARNLDDLRRLVQDLTGRGIRVEFVKEGLIFTGED SPMANLMLSVMGSFAEFERALIRERQREGITLAKQRGAYRGRKKSLNSEQ VAELKRRVVAGEQKALIARSFGISRETLYQYLKTVD >NE1966 topB, topB; DNA topoisomerase III protein MSKKLIIAEKPSVASDIARALGGFVKQKDYFESDEFVVSSAIGHLLELIV PEEYEVKRGKWSFDHLPVIPPRFDLAPIEKTTDRLKLLSKLIKRKDVDML INACDAGREGELIFRYIVRHVGSKKPIKRLWLQSMTPSAIREAFANLLND AEVQSLADAAVSRSEADWLVGINGTRVMTAFNSQEGGFHKTTVGRVQTPT LAILVEREEAIKKFVVRDYWEVHATFQAESGVYKGKWFDEGFSKRKDESE SRADRIWDHAKAEVIRDKCAGRTGVVTEESKPSRENCPLLYDLTSLQRDA NSRFGFSAKVTLGLAQALYEKHKVLTYPRTDSRALPEDYPAIVKDTLQVL KGSRYDRFASQILESDWVKPNKRIFNNAKVSDHFAIIPTALVPKKLNEAE EKLYDLVTKRFLAIFYPAAEFLITTRITRVENEPFKTEGKVLVHAGWQTV YGKVESAQGQEEESVLVAVTPGETVLAQEVAVVAGKTRPPARYNESTLLS AMEGAGKLVEDEELRAAMSAKGLGTPATRAAIIEGLIHENYVERSGRELQ PTAKAFSLVTLLRGLKIPELISPELTGDWEFKLRQIEQGQLKRDVFMEKI AAMTRHIVEQAKNHRDKTISGDFATLQVPCPGCGGVIKETYKKFQCQQCD FALWKILAGRQFEAAEMETLISTREIGPLSGFRSKMGRAFNAIVRLTDDY EMKFDFGNEADQAQEKVDFSAQQPLGKCPQCGHSVYEHKLLYVCEKSVGA GAPCSFRTGKIILNRAIEAEQVVKLLQTGRTDLLAGFVSRKGRPFSAYLV VGPAGKIGFEFEQKKTKSKPADTVPETGKAAS >NE2455 uvrA1, ABC transporter:Excinuclease ABC A subunit MELIRIRGARTHNLKNIDLDLPRNQLIVITGLSGSGKSSLAFDTLYAEGQ RRYVESLSAYARQFLQLMEKPDVDLIEGLSPAIAIEQKATSHNPRSTVGT VTEIHDYLRLLFARVGEPHCPEHGIGLAAQSVSQMVDQVLQLPTDTRLMI LAPVVTGRKGEQAELFDELRAQGFVRVRLDGEVYDIDALPKLQKTKKHTI EVVVDRLKISPEVKQRLAESFETALRHAEGRALAVEMDSGKEYLFSARFS CPVCSYALSELEPRLFSFNNPAGACPKCDGLGQITFFDPARVVAFPYLSL AAGAIRSWDKRNQFYFQMLQAVANHYHFDLEIPFEQLSKEVQQAVLYGSG KEKITFTYLNEQGRAHQQVHPFEGIIPGLERRYRETESQTVREELAKFIN ARECPECGGTRLCREARHVTVNGETIFAISAWPLRQAKQFFDDMELTGHK QSIAERIIREISSRLQFLNNVGLDYLSLDRSADTLSGGEAQRIRLASQIG SGLTGVMYVLDEPSIGLHQRDNERLLDTLRHLRDLGNSVIVVEHDQDAIL LADHVVDMGIGAGEHGGCVVAEGTPTAIQANSASLTGQYLSGKRSIAIPS TRTPPNPERMLTIRGAAGNNLKQVQLNLPVGLLICVTGVSGSGKSTLIND TLYRVVARHLYGSHTDPAAYQEIDGLGFFDKVIDINQSPIGRTPRSNPAT YTGLFTPVRELFAGVPQARERGYSPGRFSFNVKGGRCEACQGDGVIKVEM HFLPDIYVACDVCHGQRYNRETLEIQYKGKNIHEILQMTVENAHAFFEAV PTIARKLQTLLDVGLGYITLGQSATTLSGGEAQRVKLSLELSKRDTGRTL YILDEPTTGLHFQDIDLLLKVLHRLRDNGNTVVIIEHNLDVIKTADWIID LGPEGGAGGGRIIAEGTPETVASIPGSFTGYFLQPLLSTTLTG >NE0785 uvrB, Helicase subunit of the DNA excision repair complex MIITFPGSPYKLNQAFQPAGDQPEAIRILVEGIESGLSFQTLLGVTGSGK TFTIANMIARLGRPAIIMAPNKTLAAQLYAEMREFFPENAVEYFVSYYDY YQPEAYVPSRDLFIEKDSSINEHIEQMRLSATKSLLEREDAIIVATVSCI YGIGDPVDYHGMILHVREHEKISQRDIIQRLTGMQYQRNEFEFARGTFRV RGDVLDVFPAENSETALRISLFDDEVESMTLFDPLTGQTRQKVSRYTVYP SSHYVTPRSTTLRAIETIKTELTGRLNYFHENHKLVEAQRLEQRTRFDLE MLNELGFCKGIENYSRHLSGRQPGDPPPTLIDYLPDNALMIIDESHVTVP QIGGMYKGDRSRKENLVAYGFRLPSALDNRPLRFEEFEKLMPQTIFVSAT PADYEIQRSGQIAEQVVRPTGLVDPVIIIRPVTTQVDDLMSEVSLRAAQN ERVLVTTLTKRMAEDLTDYFSDHGIRVRYLHSDIDTVERVEIIRDLRLGK FDVLVGINLLREGLDIPEVSLVGILDADKEGFLRSERSLIQTMGRAARHV NGTVILYADKITNSMRRAIDETERRRNKQKLFNQQNNITPRGVNKRIKDL IDGVYDSENAAEHRKVAQIQARYAAMDEAQLAKEIQRLEKSMLEAARNME FEQAAQYRDEIKNLRSKLFIGIIDPDEIREVPQTAGKKSRRKAGR >NE0933 uvrC, uvrC Nuclease subunit of the excinuclease complex MPDAHFDGKAFVLTLPAQPGVYRMLNAAGDVIYVGKAIDLRKRVSSYFQK SGLSPRIQLMVSQIAGIETTVTRSEAEALLLENNLIKSLAPRYNILFRDD KSYPYLLLTRHIFPRLAFYRGALDDRHQYFGPFPNAGVVKSSIQLLQKVF RLRTCENSVFDHRTRPCLLYQIKRCSGPCVGLITPEAYQQDVKSAAMFLQ GKQDEVLKTIEQKMFTASDQQDYEQAAQLRDQMQALRKIQEKQFVDSGKA LDADVIACAIEPDSHAVAVNLVMIRSGRHLGDKTFFPQNVYEADISTVLE AFVTQHYLNRSVPPLIILGQKIRVTLLQKLLSDQAGHKITLTTNPIGERR KWLDMAAENAQLALQQMLIQQASQEDRLQALQEALNLPGLARIECFDISH TMGEATIASCVVYDRFAMRNGEYRRYNITGIVPGDDYAAMRDVLQRRYAK LAMEEGKLPDLILIDGGKGQIRVASEVMIELGLNDIPLVGVAKGETRKPG LEQLILPWQEEALHLPDDHPALHLIQQIRDEAHRFAIQGHRAKRAKTRKI STLEQISGIGTKRRQSLLTRFGGLKGVKNASIEELQQTEGISRSLAEKIY RELR >NE1473 uvrD, UvrD/REP helicase MTALLTDLNPEQLEAVTWSHQSALVLAGAGSGKTRVLTTRIAYLLQSGRT RPQNILAVTFTNKAAREMVARIGAMLPVNTRAMWVGTFHGLCHRVLRAHH EDAGLPQAFQILDMADQLAVIKRVLKERSLDEKMLPPRQLQWFINNAKEE GLRASQVDVHGGFNQTLAECYQAYEIVSMREGTVDFAELLLRCYELLSRN EILRDHYRSRFEHILVDEFQDTNRLQYKWIKLLAGPGSQQHAAIFAVGDD DQSIYAFRGAHVGNMRDLEKDFSVPKIIRLEQNYRSHGNILDAANALIEH NKGRLGKNLWTAAGKGEPVRVYHAATDMDETSFIIDEIKALHADGLALSD IALLYRSNAQSRVLEHGLFNASVSYRVYGGMRFFDRQEVKHALAYLRLIA LPDDDNALLRIINFPPRGIGARTLEQLQDQAAMLGTSLWQAAFKVYEGGK AVATRNSQPGRGIAGFVSLVLSMQQDGEGLPLPEIIRRVIDQSGLAAHYQ AEREGGERLENLKELINAATSFVHESEDDSLTAFLAHASLEGGEHQAEGY QDAVQLMTVHAAKGLEFHSVFISGLEEGLFPHENSRNEPDGLEEERRLMY VAMTRARQRLYLSYAESRMLHGQVRVNIPSRFIDEIPQDLLKRLRSDFSG RSFRQGVSGTGQTVASTINSSQKGRSTMAAAVGMTSSGLNSAGFHVGQQV SHAKFGTGIILNYEGSGTDMRIQVNFHQAGTKWLSLAYAKLEPL >NE1458 xerD, Phage integrase:Phage integrase N-terminal SAM-like domain MRQSPDFFRDMLRMNITDTNIRMLDEFTDALWLEDGLSRNTLASYRADLM QLVEWLGRQPRTNGSLSDVTQADLLAFLSDRIGQGVKASTTCRALTCIKR FYRYLLRQGKILADPATNIDSPKISRHLPVSLTETEVEALLAAPDTRQPL GLRDRAMLEILYAAGLRVSELVGLSISQIRQDMGVVRILGKGSKERLIPL GEEALHWLSLYLQEARPVLLAGKHSNMSFVTTRGDAMTRQAFWYLIKRHA RQAGIVKLLSPHTLRHAFATHLLNHGADLRVVQLLLGHSDISTTQIYTHV ARERLKQLHARHHPRGTL >NE1172 xseA, xseA; exodeoxyribonuclease vII large subunit protein MTDHNLLPEPKKILWRVSELNRNARVILEQTFPLLWVSGEISNLKRYPSG HWYFSLKDDSAQVRCVMFRHKNLYLDWIPQEGMQVEAQALVTLYEARGEF QLTVEQLRRAGLGALFEAFERLKARLQQEGLFSPEYKQPLPRFPRQIGII TSPNTAALRDVLTTLQLRLPSIPVVIYPAPVQGEGSAAAITTALHTAAVR GECDVLILCRGGGSIEDLWAFNEEIVARAIAACPIPIVTGIGHETDFTIA DFVADARAPTPTGAAQLASPDRQAILHRLQYWLHRLQQTMERHIERRMQA TDLLAHRLIHPGERIRHQQMHLLQLRGRLQNAWNRQVEIRTWRIEETGRR IHSAKPDIQAGIRHQQELAARLQRAMAHRLENLQFKLRQQQQHLIHLDPK AVLARGYSIAYTARGDILHDSRQTRAGDNVRLVFASGWAKADITETGE >NE1159 xseB, Exonuclease VII, small subunit MRKKSSSNKEETALHPPPENFETATAELEQIVAGMETGQMSLEDALSAYK RGVELLQYCQNILKNSQQQIKILEADMLKHFSPAEHDAS >NE0023 xthA1, Exodeoxyribonuclease III:Exodeoxyribonuclease III xth MKIATWNVNSLKVRLQQVIDWLNLNQPDILCLQETKLQDEFFPMDAIAQA GYRSIYIGQKTYNGVALLSKETGEDICTALPGFDDMQKRLIAATYGDLRV ICAYVPNGEHVDSEKYIYKLEWLSQLNRFLQQQRACYGKVALLGDFNIAP EDRDVYDPEAWRGQVLCSEPERQAFRGLLDTGFVDSFHLFEQPEKTYTWW DYRMMAFRRNRGLRIDHILLSHEMADRCTIWQVDKLPRKLERPSDHAPVL VELA >NE2192 xthA2, Exodeoxyribonuclease III:Exodeoxyribonuclease III xth MRIITLNVNGLRSAAGKGLFDWLPRQEADVICVQELKAQQGDINGVMRAP DGYSGYFHCAEKKGYSGVGLYTRYSPDQIIEGTGIPEIDMEGRFLRVDFG NLTVISIYLPSGSSGEHRQAAKFFFMEHFLPLLQSLAECGREVLLCGDWN IAHKAIDLKNWRSNQKNSGFLPEERAWLSTVFDELKLVDVFRKINPEPDQ YTWWSNRGQAWAKNVGWRIDYQIATPGLAAMATGVSIYKAERFSDHAPLT IDYDFNL