TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Nitrosomonas europaea ATCC 19718, ATCC 19718
Gene type: CDS

Number of genes found: 243

Free access
Sort by:

 



# Nitrosomonas europaea ATCC 19718, ATCC 19718

>NE0271 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDHLISVNRP
>NE1807 putative ATP-dependent RNA helicase protein
MSFENLNLHPAIVKAVLAAGYTAPTPIQQQAIPDLIAGHDVMASAQTGTG
KTAAFMLPALHRLATPAQIRGRGPRILVLTPTRELALQVSDAASKYGKFL
PRINVVSILGGMPYPLQNKLLSQTVDVLVATPGRLIDHIERGRIDFSRLE
MLVLDEADRMLDMGFIQDVERIALSTPATRQTLLFSATLDVAIEKIATRL
LKAPKRIQVAAQHTKLDHIEQRMHYVDDLTHKNRLLDHLLRDTTIKQAIV
FTATKRDADSLADNLSSQGHKAAAMHGDMTQRERTRTLTGLRQGRLKILV
ATDVAARGIDIADITHVINFDLPKFAEDYVHRIGRTGRAGASGIAVSFAS
GKDVAHLKRIERFTGNRFEFHVIPGIEPRTKPRFGRSDDKPGRRPSSSAA
AHKTRRSWSDNPNTRTASPGHRGDKDAGFGQPFGRETRKRPFRDSKFNSA
DRFARTE
>NE1174 Uncharacterized protein family UPF0020
MTERFFAPCPRGLETVLAAELERLDATSIQASPGGVGFHGNWQTCYRANL
ESRIASRILWQIAKDQYRSEADIYDLTHSLPWQDWFEPRLSIKVNLAAIK
CPLRSLDFVTLRIKDAVCDKFRAIHGKRPDIDTVAPDMRIHGFLNAQEFT
LYLDTSGEALFKRGLRQTQGEAPLRENLAAGILALTGWQPGTPLLDPMCG
SGTLLLEAAQIACRIAPGSGRQFAFEQLKLFDARSWKKLKQTATERQHER
TFQSIYGSDLYGSALAHTRNNLAAAGLAECVTLKQANVLEISAPAETGIL
VSNPPYGVRIGDHQMLAEFYPRLGDVLKQRFSGWRAFLLTADPLLAKSIR
LTPSRRTPLFNGALECRLLEYRLVAGSMRREKQPSSESSTNQPIT
>NE2536 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNI
>NE0447 conserved hypothetical protein
MLKQNETKTSMILNYRWLYDTVRKRFESDEAMEAFLPKALTPATLKQKGD
DRYLSAMSQRVFQAGMQHSVVNAKWPAFEEAFWGFVPETMVMLSPEQIEG
YMKNSSIIRHYTKLQTIPRNAQFILDIRQEQGCSFGEFIADWPSADIIGL
WRLLAKRGARLGGRSSAGFLRLAGKDTFLLTSDVTARLIAAGIIDHEPTG
QRDRQIIQDAFNELQQDSGRPLCQLSAMLSLSINPRF
>NE1843 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNNYRARAEYATATATLPQAENH
>NE1552 Transposase IS4 family
MDAHGMPVRILVTQGTTADCTQAGRLIEGIDADHLLADRGYDSNAIVEQA
EKQGMEAVIPPKKNRKIQRPYDKELYKLRHLVENAFLHLKRWRGIATRYA
KNTSSFLAVVQIRCIALWADIL
>NE2190 Maturase; integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase); part of type II intron
MHRALNQDDDHNQDGQDLLEAVLARDNLARAWRRVKSNRGAPGIDGVTTA
EWPEHARAHWPATREQIEAGRYRPQPVRRVDIPKPDGGQRQLGIPTVTDR
VIQQAIAQVLIPIFDPGFSASSFGFRPGRNAHQAIRQVQAHVKAGYRWAV
DLDLARFFDNVNHDLLMSLLSRSIADKRLLALIGRYLRAGVLVGEHPQPS
EVGTPQGGPLSPLLANVLLHQFDLELERRGHRFARYADDVIILVKSRRAA
ERVMQSLTYFLQSTLKLTVNLAKSQVAPMSECSFLGFTLVGKKIRWTEKS
LANFKHRVRQLTGRSWGVSMEYRLEKLGQYLRGWFGYYGISQYYRPIPEL
DEWIRRRVRMCYWKQWRWARTKIRHLLDLGIPLKAAIQHGVSSLSYWRMA
RTPVTQQAMSNDWLRAQGLLSIKDLWCKAQSYGPDKG
>NE1990 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR
>NE2309 Adenine specific DNA methylase Mod
MASNQKLELTWIGKEKRAKLEPRILLEDPEKSYHAKQRVSESDVFDNRLI
FGDNLLALKALEQEFAGEVKCVFIDPPYNTGSAFTHYDDGLEHSIWLGLM
RDRLEIIKRLLSNDGSLWITIDDNECHYLKVLCDEIFGRANYKTTITWQR
KYSVSNNFQGIASICDFVLVYSKSEAFKNNLLPRSEESAARYNNPDNDPR
GPWKAVDYLNQATPEKRPNLCYDIVNPNTGVVIKNTKKAWKYDPTTHQRH
VDEKRIWWGRDGGNSVPALKLFLSEVRDGMTPHNWWSHEEVGHTDESKKE
MIGLYGPRDVFDTPKPERLLKRILEIATNPGDLVLDSFAGSGTTGAVAHK
MGRRWIMVELGEHCHTHIIPRLKKVIDGEDPGGITNAVDWQGGGGFRYYS
LAPSLIVEDRWGNPVINPEYNATQLSEALCKLEGFTYAPSETRWWQQGHS
SERDFLYVTTQNLSASQLQALSDEVGTEQSLLVCCSAFHGISAAAAAARW
PNLTLKKIPKMVLARCEWGHDDYSLNVANLPLAKPEPETPASQPAPKKKG
KKTLPMPDLFGDVEDGA
>NE2101 hypothetical protein
MSSSPSHPFPSLQSRIVASFVSTSSTIIVARLSTLRPLRDLTMVGWSMST
RMKAKLVCDALQMAVWQRQP
>NE1270 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE1316 NUDIX hydrolase
MIDRNGYRANVGIILLNSQNQVFWGKRARQDSWQFPQGGIKSGETPTEAM
YRELAEETGLQPVHVEILGRTREWLRYDVPACWTRRDWRKNYRGQKQIWF
LLRLLGRDSDVSLETCAHPEFDAWRWNQYWVELESVVEFKRQVYRQALTE
LSRLLDHEAGLGNDRAYREPLEPVEKNRKKSSDTRQS
>NE1558 putative transposase
MPRTHGYAPIGKRCHGKCNWHARGRINVIGALIGKCLLTVGLFKNNIDAD
TFLGWTIHDLLPKLPPASIVVMDNATFRKRQDIQNVITRGGHTLEYLPAY
SPDLNPIEHKWAQTKAVRKQQNQTVEQLFKIESFYVT
>NE0342 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPNSSGRGRCQNRGSVFVIPRPGR
>NE2515 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE2156 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE2533 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE0112 hypothetical protein
MLGTANLPSRNKYKAKATILHMNRNLYLVAYDICNPRRLRQVCRYLTGYK
VSGQKSVFEIWVTPTELHTIRTELDKLMDTQADRLHILSLDPRMKPRCYG
NASTFTVQHFCIV
>NE0934 Integrase, catalytic core
MNRTLKEATVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYI
IKCWQNEPERFIINPYHHKVGLNSYSVFRCLDTVRLSAAHVGFFTEMREI
SC
>NE2446 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE2318 hypothetical protein
MKLLFFCLLLVSMSVPAVAGNEKQIFELEAAIMQQQQEQQILFQRFQMLQ
ELRRHEITQIEQALPTGSDVIINGEAPKYEDVARQRKERAERVHRYTDEL
DELYMRYQETENERRALIEQLNGLKPGQDVSAEPKK
>NE0451 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1845 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE0709 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE1178 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE2414 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1516 Uncharacterized protein family UPF0006
MFVDSHCHLDFPDLASSLDELLVNMQISQVTHALCVGVNLENFPRVLALA
ESHSNLFASVGVHPDYEDTAEPAVEQLLKLADHAKVVALGETGLDYFRLK
GDLEWQRERFRRHIRAARRCGKPLIIHTRAAAEDTLRIMEEEGAASVGGV
MHCFTESWEIARRALDLNFYISFSGIVTFKNAAIIKEVAKKVPADRMLIE
TDSPYLAPVPHRGETNQPAFVRHVAEEIARLRETTLAEIAAVTTNNFFNL
FKVV
>NE0111 Protein of unknown function DUF48
MTSLFVDRRGVELGLESGAIVFRENGERIGTVPIAPLTRVFLRGDVNLPA
ALLGKLGERGVGVVILSGRTSRPSLLLARPHNDAARRVAQVRLSLDEPAS
LIIARELIERKLTRQIEWFTELRENDIQARYELSRALRGLEEHRARLGNI
NNAASLRGIEGSAAARYFTGLQAVIPGSLHFHGRNRRPPRDPFNALLSLT
YTLLHSEITIALHGAGFDPYIGFYHRLDFGRESLASDLLEPLRPLADRFA
FALVHRRVLDKDHFTTTESGCLLGKAGRVRYYAAYEEHSEILRKGINQEI
EQLAEQVGSALTPESGNTPDHDSGDWE
>NE0662 Transposase IS4 family
MCGTPARCVAPLAALFEMLKGQCDGISIADATAIAVCDNRRIARHRVSAD
SARRGKTSMGWFYGFKLHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGL
FGQLFADKGYLAQWLTETLDRQNLQLITPFKKNMKPAPRTGFEKAILRRR
SLIETVFDELKNLCQIKHTRHRSFFNFVVNLMAGIVAYCLSDNKPTLNLT
RVNTLVKA
>NE2010 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR
>NE2109 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE2498 hypothetical protein
MPKMDVKGIAVTVYSENAMDFISLTDMLRAKDGDFFISDWLRNRNTVEFL
GIWEQVHNPNFNYGEFATIRSQAGLNSYKISVKEWVARTHAIGLVAKAGR
YGGTYAHKDIAFEFGMWISAEFKIYLIKEFQRLKEAEQQQLGWDIRRNLT
KINYRIHTDAIQTNLIPPALTQSQISLIYASEADLLNMALFGKTAKQWRE
ENPNNKGNIRDEANVSQLVCLANLETLNAHFIHQGLPQVERLKILNQTAI
HQMKLLLADRSLKQLDGN
>NE1351 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE2098 Maturase; integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase); part of type II intron
MHRALNQDDDHNQDGQDLLEAVLARDNLARAWRRVKSNRGAPGIDGVTTA
EWPEHARAHWPATREQIEAGRYRPQPVRRVDIPKPDGGQRQLGIPTVTDR
VIQQAIAQVLIPIFDPGFSASSFGFRPGRNAHQAIRQVQAHVKAGYRWAV
DLDLARFFDNVNHDLLMSLLSRSIADKRLLALIGRYLRAGVLVGEHPQPS
EVGTPQGGPLSPLLANVLLHQFDLELERRGHRFARYADDVIILVKSRRAA
ERVMQSLTYFLQSTLKLTVNLAKSQVAPMSECSFLGFTLVGKKIRWTEKS
LANFKHRVRQLTGRSWGVSMEYRLEKLGQYLRGWFGYYGISQYYRPIPEL
DEWIRRRVRMCYWKQWRWARTKIRHLLDLGIPLKAAIQHGVSSLSYWRMA
RTPVTQQAMSNDWLRAQGLLSIKDLWCKAQSYGPDKG
>NE1367 possible ISA0963-4, putative transposase
MLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQLYLALNDIEHSK
TKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEELQHDLDDWMAYY
NSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE0814 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1631 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDHLISVNRP
>NE1788 transposase
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEAV
>NE1925 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1667 conserved hypothetical protein
MPDMTSASSGTVLAFDFGKRRIGVAIGEHELRMAHPLTTIDQSMTRPRFE
KIAELIEAWQPVLLVVGLSVHADGTEHEITRLCRRFARRLEGRFRIPVAL
ADERYTTVIARSVLEEVGVTGKKQRPMLDQIAAQHILQTYFDLSHAAS
>NE2411 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNNYPQQNGMVKWVTGH
>NE1995 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRTFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1539 hypothetical protein
MLFWIKTIQTAAWFYLLMFALLAGSAHAAELKPADQTGFLIVAADRGFVG
NEEIRDAFASFSANHPAALVFVTDERTRQTLQSGLASLHQQNIGRIVVLP
LFISAAEPRYQLIRTLVTEENQTIPVTFTKPYGESYFAVEALATRLRGMQ
HTAQQHLLVVGYGAQNDTHRRAMYDDWMRIVKQASQGVSFRSINSLILLE
AQKDEEPESYGNKTKQQLATALSSLGTATKNNKNQVIAFALGPKYDSMMS
LESRLERLLPENAALNHFEIEPQHLAMWMEREASRNLPLAEEDTGVILFA
HGSDFHWNENLRVAVEPLMKRYKIEFAFSMADPYTIERALHKLEQRGAKA
AIVVSAFASRSSYRNEIGYLAGLDIENQDDHIHDNNSGHGSHGGHGGHAK
SSTPVPRILTSLPVIWTGGYEDNPLFASALFDRVLALSKDPARETVILTA
HGTQDDRKNDEWLEKLNSIASQMHDQGGQKFKAFKVATWREDWPDKRAPW
VKKVRAMVTEASKQGDRVIVIPTRTTSVGPEKRFLAGLEFELGEGFAPHP
LFTQWVDEQIRQGINLHKEALGR
>NE2108 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE0806 hypothetical protein
MKYSIRFFAVIFAIFLTACSTMYYSGLEKIGIPKRDVLVYRVEKARDTQE
ETREQFKSALEQFSAATNFKGGDLEGIYKKLNGEYEASVNKAKEVRSRIE
DIENVSAALFREWEQEITQYSNPALKRSSQDRLTETRSYYKQLIAAMKNA
ESRIQPVLTVFNDQVMYLKHNLNARAIASLKGELKTLQSNVSTLVAAMEK
SINEANTFISNMEKN
>NE2274 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE2011 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE2012 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNN
>NE1816 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE0749 Transposase IS911 HTH and LZ region
MNKQNKQNKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTL
LEWVKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFA
QAELDRVLKK
>NE1107 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE0708 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE0134 transposase
MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP
VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA
ALIEGLTVRASARQCRIDKNTSFRWRHRFLTLPAAAKANHLEGIVEADET
FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML
DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV
RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL
SSPIVPLQAALGRENQFQLLTNT
>NE1853 hypothetical protein
MSDQYTQNESDQSKDKVEWTKPASLLNILGKKFAPIADLQHKQLPSWSLL
VFLGILLLVFIWKQIAVNQAESRLEKGQAQIAQQLEEKSKELVKKAREYA
DSQYKKEEERFGQVLAWAVRGELIRNNLDQIDQYLTELVKTKDTERVVLI
SDEGKLLVSTDKRLESEEASSLYPKDVLGLQTITIKSDVDNRKLLVVPVM
GLNKRLATIVISYNPPSLLN
>NE0120 hypothetical protein
MTPLRAILRLRSPLGTPLAGDTLFGQLCHAVREMLGEEKLEALLDGYTAG
SPWLVVSDGFPSGYLPRPTVAAALQANSEEDPKKRKEAKGKRWIPHSQIA
QPLRQLLSSAVSDEEVYGKQSRPIQAAAFHNTLNRLTGTTGTGEFAPYTQ
SQIFYQRDQRMDLWCVLDEDRLPRETLHQLLEYIGSVGYGRDASIGLGKF
AVEQIEEAALFKQTHPNANAYWTLAPCSPQGQGFKTSRSYWQVLTRFGRH
GGTLALGANPFKQPLLLAATGAIFAPTNNMAQIHFIGSGLAKVSLMQTAA
VHQGYAPVLGICMEAI
>NE2521 conserved hypothetical protein
MTSVLSPNTQAILLLTAPLIAGRGTASSDLLSPGEYKRLARHLREIQRQP
ADLLSPDAAEILRACQPVIDEGRLQKLLGRGFLLSQVIERWQARAIWVVS
RADAEYPRRLKARLREDAPAVLYGCGDMALLETGGLAVVGSRHVDDALID
YTMTVGRLAARAGRTLVSGGAKGIDQAAMRGALEAGGKVCGVLSDSLEKT
TMNREHRNLLLDGQLVLISPYDPSAGFNVGHAMQRNKLIYALADTSLVVS
SDLNKGGTWAGAVEQLDKLKFVPVFIRSTGESSAGLDGLRKKGALAWPNP
QDVDSFKDVFNVAMPTPTASPQVGFALFSNEEPTSVDAKPTVPVPPDTAP
APQAESEPSAPVDVVSDAQPPAPALEEQPSVTPEAIPPIDDAMESAQPES
SPAEVLFAAVRAAIQQLLSAPMKDADVAAALDVSNAQAKAWLQRLVDEGV
LEKQKKPAGYIVKQKRLFE
>NE1553 possible transposase
MTHSHRRHDISDRIWSLLEGHLPGREGAWGGVATDNRQFINAVFWIIRTG
APWRDLPPDYGGWSNTHRRFIRWRDKGIWEKLLEILIDDPDYEWLIMDAS
HCKVHPHASGARGGNQDMNRTKGGSTPRYIWPWMRMVCRSESLLHKVPLL
IARRLAA
>NE2200 transposase
MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEAV
>NE0711 Uncharacterised protein family UPF0102
MSSAGNKGSDAEQCATIFLQQQKLTLLERNYRCRFGEIDLIMREGDTVIF
VEVRMRSSDRFGGAAASITAAKQLKLTRAARHYLAGCEGDFPYRFDAILI
SGERENEIEWIRNAFDES
>NE1132 transposase
MPDLCQGLFGQLFADKGYLAQWLTEALDQQNLQLITPLRKNMRPVPRTRF
EKVILRRRSLIETVFDELKNLCQIEHTRHRSLFNFIVNLMAGIVAYCLSD
NKPTLNLTRVNSLAKA
>NE0197 conserved hypothetical protein
MIPNPDPESSINRNQVIISGTITDLASPRYTPAGLMIAEFKLSHCSNQQE
AGIQRRIEFEFEAIAIAETAEKIIRIGSGSNVEITGFIAKKNRLSNQLVL
HVRDTRII
>NE1585 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE0719 Uncharacterised protein family UPF0102
MSSAGNKGSDAEQCAAAFLQQQKLTLLEKNYRCRFGEIDLIMREDDTVVF
VEVRMRSSDRFGGAAASITAAKQSRLIRTARHYLAGHEGDFPCRFDAVLI
SGNRENEIEWIRNAFDES
>NE0272 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFTWSRQMPEPR
>NE1940 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNN
>NE1789 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE2097 Integrase, catalytic core
MPTGQNNRTTVVRKNATCPRDLVNRMFHANRPNQLWVSDFTYVSTWQGWL
YVAFVIDVFARRIVGWRVSSTMSTDFVLDALEQALYDRRPADTLIHHSDR
GSQYVSIRYTERLAQAGIEPSVGSRGDSYDNALAETINGLYKAELIHRRA
PWKTRAAVELATLEWVAWYNHQRLLGSIGYIPPAQAEENYRQTQDNKTLM
DILL
>NE2504 Transposase IS4 family
MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV
LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF
EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK
LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT
EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI
EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA
>NE1740 Transposase IS4 family
MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV
LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF
EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK
LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT
EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI
EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA
>NE2273 transposase
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEVV
>NE0981 HhH-GPD
MALVFWEQAVNDLSARDPVMHRIIQCYSDSMPEERGNAFATLARAIVGQQ
ISVKAAASVWQKVTTLIPEITPEALIATEIDLLRTCGLSARKVDYLRDLS
RHFLEGTLVTVNWHDLDDETLIRKLVEVKGIGRWTAEMFLIFHLHRPDVL
PLDDIGLQRAVSLHYNASQPVAKQAIRTIAESWQPWRSVATWYLWRSLDP
IPVIY
>NE0452 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE2311 possible helicase (Snf2/Rad54 family)
MVQQLLTPHQSQYIAWQLTRRAAKDSVESLASTLVDSQVDLNPHQVDAAL
FACRNPLSRGVILADEVGLGKTIEAGLVISQHWAERRRKMLIIVPANLRK
QWHQELQDKFNLQGLVLEAKNYNAMRKEGVTQPFLHAGGPIICSYQFAKA
KADDLRRIHWDLVVMDEAHRLRNVYKNGNVIARTIRDALEHVDAKVLLTA
TPLQNTLLELYGLVSMIDERVFGDLDSFRTQFSGVRTEQSNRALRERLTP
LCKRTLRRQVQQYVPYTARIAIVEEFTPSQEEQQLSALVADYLRRPNLKA
LPEGQRQLISLVLWKLLASSSHAIAGALETMANRLQGQLDELPDVPDLTE
SLDDDYEGLDETADEWNGATANDADASANERAAIADEAAELRRFKELATS
IRQNAKGQALLTALDKAFAELERLGASKKAIIFTESKRTQNYLLSLLAET
PYGIVLFNGTNTDARAQAIYKDWLQRHEGSDRITGSKTADTRAALVEHFK
ERGTIMIATEAGAEGINLQFCSLVINYDLPWNPQRIEQRIGRCHRYGQKH
DVVVVNFVDRSNEADARVYQLLSQKFKLFEGVFGASDEVLGAIGSGVDFE
RRIAAIYQNCREPEEIRSRFEDLQRELSSEIDEAMLRTRQLLLENFDEEV
QEKLRIHSQDSQAVLNKYERLLMDLSRTELRDHARFDTAEEVNGFVLHSL
PDGLGLATGSREQAVMAGRYELPRRSGDAHLYRMGHPLAEWAIERAKARD
LQAPARLAFDYAAYGKRLVSLEKWRGQCGWLSVTLLSVETLNDQEQHLVV
SACTQAGEALPEDDPEKLLRLPAQVEGDAHLQVCAELVANVESRKSVLLR
GINQRNLGYFEQEVQKLDTWADDLKLGLEQEIKAIDGEIKEVRRTAAASP
TLEEKLAHQKRQRELETRRSKLRRDLFARQDEVEEQRNKLIGELEEQLKQ
QVAERMLFTVEWELT
>NE0931 conserved hypothetical protein
MTVSSPFQPDCRDCPRLAQHLDQVKTDYPDYHARPVAPFGDSSAKLLIVG
LAPGLHGANRTGRPFTGDYAGILLYRTLHKFGFASHDESVSADDPLHLTD
CRITNAVKCLPPANKPQPAEIRQCNAFLAVELDNFARNGGQALLALGTIA
HQAVLMALGCRNADFPFSHGAIHRVTEELKLYDSYHCSRYNTQTRRLTET
MFEQIFDRICQDMAATQ
>NE0830 DNA mismatch repair protein MutS family, C-terminal domain
MAARCVVLLCIVENCDFRRASCDHHDFACHAAHGGLDQRFQKARSWFLLS
SIQHKVAKSAKVGNTESTHVITTRTMNDTTQDTAASVWRESFILSSGKNP
SGIRDTRPTADNYGVLDAKTFAAVEVDALFDEINQAQTLTGQSILYRSLA
RPVTDAALLQSKQEALRELESNPDLLKVLEQYIKRIAIDEASLHHLLYGE
FAGGLTTDDPRDKTGKDKLEFGGYGYRQFIDGTGFVVDLVEEAEALPMPE
SDYLRTLVQTLRDFARSRTYALMHGPIYVSQGKFMTREEKPRYLLIQRFR
PSMFKWPFISFFLAFVAGLLLFFQNTLNELVASYVGYGLLILVVPIIPII
LQAISASDRDSVIYPLQRLFRQSPELARTIEAMGMIDELLALHRHARSIP
GESVLPEIDMDGRHTLVVSGARNPLLVRTRPDYVSNDIVLDNDKHLLIVT
GPNSGGKTAYCKTVVQIQLLAQAGAYVPAVQARAVPAEHIFYQIPDPGQL
EEGMGRFAHELKQTREIFFNSTPRSLVVLDELAEGTTFEEKMTLSEYVLK
GFHQLGATTILVTHNHELCERLQQENIGNYLQVEFVSEKPSHRLIPGISR
ISHADRIASAIGFSKEDVASHLASLQE
>NE0121 conserved hypothetical protein
MQLDTIHKITGTLILKSGLHIGAGDSEMRIGGTDSPVVKDPLTDQPYIPG
SSLKGKIRSLLEWRHGLVVATGGAPYSFKHLAQDENNSAGRDVIKLFGGA
PDKAEDQLVKNIGPTRLAFWDCPLNGDWKKEAADSRHLLTTEVKSENSIN
RIAGTAEHPRFIERVIAGARFDFTLTLKVLEGDDLLNTVLLGLRLLELDS
LGGSGSRGYGKIKFAELKLDGTDLMEQFHAITPFNQTA
>NE1348 Integrase, catalytic core
MGRIYQQTFVDTYSKWAAAKLYTNKTPITSADMLNDRVLPFFAEQSMGII
RILTDRGTEYCGKPENHDYQLYLALNDIEHSKTKANHPQTNGICERFHKT
ILQEFYQVTFRRKIYQSIEELQHDLDDWMAYYNSVRTHQGKMCCGRTPMQ
TLIDAKEIWDDKITELNN
>NE1996 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE0252 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE2447 transposase
MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEAV
>NE1698 conserved hypothetical protein
MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP
VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA
ALIEGLTVRASARQCRIDKNTSFRWRHRFLTFPAAAKANHLEGIVEADET
FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML
DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV
RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL
SSPIVPLQAALGRENQFQLLTNT
>NE1271 Integrase, catalytic core
MLQVAPSAYWRHAARQRYPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWYQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1264 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE2442 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1523 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE0969 possible N6-adenine-specific methylase
MVKADRIRIIGGQWRSRLIQFADDELLRPTPDRVRETLFNWLGQDLTGKI
CLDLFAGSGALGFEAASRGAKQVTMIEQNMKAVRNLHCSIEKLGASQVKL
EHVDARMFLTANSERYDVIFVDPPFKSGLLAEVLPLLPAKLEEEGVVYVE
SSDKLLPDDTWSIWKQGRASHVHYCLLSLNPDG
>NE2441 hypothetical protein
MKKILSILSSTFILSLFMLSSVNAQVEDVHLQEAIRQTEAVVLAVDVKTM
TQLVQEAERYAVEVKSTHPENEHLQEGLKHLNDVIKESQAGEPAAARKAA
IVALSKFNQIERK
>NE0454 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIQTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNN
>NE0716 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE0561 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE2023 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR
>NE0268 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE2238 Transposase IS200 like
MTFTTKEYQSLSHTRWDCKYHVVFIPKRRKKRIFGMLRWHLGELFHELAS
HKESKIVEGHLMDDHVHMCISIPPKYAVSNVVGYLKGKSAIQIARKFGGR
QKNFTGEHFWARGYFVSTVGLDDNIVRTYIRNQEDEDERYDQMKLEI
>NE1061 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNS
>NE2516 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE2483 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNS
>NE0245 hypothetical protein
MALPQRQIVRAENVKIGISWQCALCDLDIYARPLPGAEVIYFGRMVTTHG
RYWKDYRNSPQPTNGYETISFDVPLDLRPVVIAINFYEGEAPQGVSGEIR
IAVDENTYAAPFHISATRGNRGQGVAKIIETGKASGNHSVIVDPLHIIRA
R
>NE2155 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRTFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1222 Transposase IS4 family
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQARYLGKDIRCFDRRPGQSIYYDRQHDRAR
SSAGRLRKRGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDC
TQALDLISGFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRT
YDREIYKCRNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE0562 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNT
>NE0184 NUDIX hydrolase
MTWKPNVTVAAVIEQDDKYLLVEEIPRGTAIKLNQPAGHLEPGESIIQAC
SREVLEETGHSFLPEVLTGIYHWTCASNGTTYLRFTFSGQVVSFDPDRKL
DTGIVRAAWFSIDEIRAKQAMHRTPLVMQCIEDYHAGKRYPLDILQYYD
>NE2532 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1522 Transposase IS4 family
MDAHGMPVRILVTQGTTADCTQAGRLIEGIDADHLLADRGYDSNAIVEQA
EKQGMEAVIPPKKNRKIQRPYDKELYKLRHLVENAFLHLKRWRGIATRYA
KNTSSFLAVVQIRCIALWADIL
>NE1630 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFTWSRQMPEPR
>NE0940 putative DNA transport competence protein, ComEA
MYYIWPKRLKLRNPEALFLNFCGDSEMKKIFLILVIFFGFNLSVLAGVDI
NTASQADLESVKGLGPVKAKAIIEYRNKYGMFKSVEELANVKGIGAGILK
QLGDQVSVQEGAVLTETKVD
>NE0880 probable ATP-dependent DNA helicase-related protein
MSDLNTVFSADGLLARNIPDYRPRTQQLEMAQAIAQAIESQEVLVTEAGT
GTGKTYAYLVPALLSGGKVILSTGTKTLQDQLFQRDIPTVRAALKIPVTI
ALLKGRANYICHYHLERTLNSDHIHFASRTEVKYLNLIERYAGTSSHGDK
SGLDKVPEQAAIWQHVTSTRENCLGSDCPHYRQCFVMEARKRALSADIVV
VNHHLFFADVMLRDEGLSELLPACNTVIFDEAHQLPEVASLFFGESVSTG
QIQVLVRDTDTEALLEAKDFAPLFDATAAVGKAVLDLHLTITEKHTRMSS
ASAARYPGFSEARQVLQEKLVLLAGLLETQAVRSQGLQNCWLRAQTLLNR
IRQWHEQSESREFICWVETYSQSLQFNTTPLSVAETFSKQLDASARAWIF
TSATLSVKKDFSHYNRMMGLFEAKTANWDSPFDFPNQALLYVPSQLPDPN
TPHYTESIVQAVLPVIKASQGRAFILCTSLRNMQQIHELLQVAFQREQLE
FPLLLQGQEARSALLNQFRQLGNAVLVGSQSFWEGIDVKGNALSLVIIDR
LPFASPDDPVLSARIEKFTREGRNAFMEYQLPHAIISLKQGAGRLIRDEK
DRGVLMICDPRLVSKPYGKQIWQSLPPMKRTRDPDEVLRFLENVDQ
>NE2412 transposase
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEVV
>NE1675 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1182 Helix-turn-helix protein, CopG family
MRNTMTHRVTITLDAETFAFLNDVASSNRSAYVNQLLKQDRKNFLQAALR
KANQEEAEDTNYQEKLQAWESTLSDGLAND
>NE1760 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE2228 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE2166 UvrD/REP helicase
MNTVLQQEIPDQRERQQALDPWHSFIVQAPAGSGKTGLLTQRFLVLLATV
EEPEEIVAITFTRKAASEMKHRILQALRDTAGDINSDAESETALLNDAYQ
RQLRELANRVLAHDQARGWQLLQNPSRLRIQTIDSLCAWLVDRMPVCSRQ
GALSSVAEDADRLYLEAARLTVEALEEEGEWTAAIEHLIGHLDNRLDRLQ
QLIADMLARRDLWLRGVVDAANSDDMRDRLESVLSGRIAEAIERLADAVP
AGCQSEIIELMQFAAVNLSEAGSADSNTVRWPGNALEDRLVWESMADFLL
TQTGDWRKQVTKANGFPAPSSVRDADVKEYLNGMKQRMSELLVALQSEET
FRQQLQLLRQLPPERYTDEEWETLQALFSLLKVAAGYLLLVFRQHGQVDF
TEIAMAAVRALGEPEMPTDLALALDYRIHHLLVDEFQDTSSSQAELLQRL
TAGWQTGDGRTLFLVGDPMQSIYRFRQAEVGLFLDIRDSGYFGQIQMRFL
RLSVNFRSQSGIVEWVNRYFPRILPDTDSVSTGAVSYASSVAFHAASSGE
AVRIYPYLQKDDRAEAEQVGAIVAQARAAQPDGRIAVLVRNRSHLASIVV
HLRRKGLRFQAVEIEQLAQRVVIRDLMALTRALVHPADRIAWLALLRAPF
CGLSLQDLHTVANTLPQHVLIDSLRACAGSGVLSEEGGQRVNRVLPILER
ALMLYDRMSLRRCVEGIWVSLGGPASVQNETDLADAEVYFQLLENFDVTG
YRPDIQELDERLVRLFALPDVAADDSLQLMTLHKAKGLEFDTVILPGLGK
SPRRDQEKLLNWLEFHDQSQHPGLLCAPISAAGSDKNPISAYILSEEKKR
TALEEARLLYVAVTRAKHNLHLLGHLRIDPDMQENDALKPPEDTLLARLW
PAVAADFLARSREAAIGDLPASNVHTGLQLVGMVRLVSGWQPPPLPKAVA
VAMHANEAGTTEEPVDFDWAGEPARLVGVVVHCLLHRIGLIGVENVDHQD
LEALKLAGRSLLIQSGITPRHLEKAVQQVARALRTMCVEDETGRWILSNR
HQEARCEWALSVPTAIAAGHSISVSIIDRTFVDAAGVRWIIDYKTGSHTG
GSLEEFLDREQLRYRPQLDRYAQVLQRMEDRPMHLALYFPLLGKWRKWIP
SRESA
>NE2178 conserved hypothetical protein
MENPAMTTRPFDARQTNITDLIQRLDGCATIVTGNRRLARALHQAFNQAR
SAEGHGAWPAPDILPWDAWLQQLWQEVVISSRIESAPGVLLTSHQEYFVW
QEILAEQSGDVPLQATNETVARIMEAWQTLHAWCIPCREADFGHNADTRL
FWQLASMFEAKCRKNSWLSVAVLPGILQKYVQIDSLSVPNELVLTGFDEW
TPQQSSFLRAFEQTGCSLQWLQLSGQPDRIGKLACADGRDEIRQAARWIR
QRLEENPAARIALVVPELAAQREMICQTLDEVLIPQALQPEHHDRVRSYN
LSLGKPLDRYPPVSLALDVLGLSETVIELPHVSRVLRSSFIAGGDREMNA
RALLDARLRESGEWNLTLQKLLTNAARSGQPYSCPLLAECLSNLMKQVKV
SLAPTSPGEWAQRFGQWLKAIGWPGERGLSSEEYQVIQAWQGVLREFSTL
DWVIRSVSLTEALQQLRHMVAGTIFQPESAEAPVQVLGLFETSGLQFDYL
WIMGLHDGVFPASSRPNPFLPLTLQREVDAPHSSARRELRVAAALLQRIT
TNATEVVISYPQRKGDEILDSSPLIDAFPALSEEMLAMGTQSAWRDSVYH
SRQQEVLSEDVAPTFVGTGIPGGSKLFKLQAACPFRAFAELRLVARPLGR
IQIGLNALVRGTLLHRVMEMVWAELDSLAALANLSPGELNALVAGKVNEA
IYEIAPRYPHTFGERLQALESKRLHALVLAWLEMEKQRPPFRVSGREMET
ELELNGLRINLRIDRIDTLEEGGELLIDYKTGEVKASAWFGDRPDEPQLP
LYSLAFTDDGLAGIAFARIRAGDIAFEGVASEEVSILGIKSFENLRHTRE
AASWDEVLAGWRQTIEQLVQDFMAGEARVSPKQYPQTCTYCELKPLCRIG
ESLEAVDDC
>NE2024 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE1586 transposase
MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEAV
>NE0341 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE1880 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE2413 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYL
>NE0939 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE0093 recQ; ATP-dependent DNA helicase
MRHQFPEVGRLISELRDAPCEREDCQYCQTTHDPRHELKRYFGFPDFRYE
QPGESLQHDIVLAGMRGQHVLAILATGGGKSLCYQLPALNRFHRNGSLTV
IVSPLQSLMKDQVDGLLERNVQCAAALNGLLTMPERAEVLEKIQMGDVGI
LLVSPEQFRNKAFRRAIRQRQVGAWIFDEAHCLSKWGSDFRPDYLYVSRF
IREYTGDQPLAQIGCFTATAKPDVLADIQSHFRESLGIEFKVFPGGHERT
NLHFDVLPCTKAEKWSRTDRLLHEHLDSQEGGAVVFVSSRKSAEELSDFL
IGQGWPCKHFHAGLEPNTKTDIQDDFKAGQLRIIVATNAFGMGVDKADIR
LVIHADIPGSLENYLQEAGRAGRDQGDARCVLLYDPQDIETQFGISERSK
LSIRDIQQILRKLRNESNRRKGGKLVITAGEILLDDDVDTSFSADERDAE
TKVVTAVAWLERGDYLKREENHTQIFPARLDMSEKEAEKRLLKAKLPQRR
LEEFRAILRFLYGADADERVNTDQLMQLTSLESEEVASALKQLEEMGLLV
NDSQITLYVRHGVTGASSQRLQSSLELERALLQRLPELASDAGQGEWQDL
NLPALAAELKADTRQGDLLPLQVLRLLRSLADDHDANSQQRSSFELRQLN
RDYLKLRIKGGHSWRQIERFGEKRRALAGVLMEFLIGKLPPGSRNKDLLV
ETSFGELVKALESDLELPHLIAPDQRRKAVEHVLLYLHRQDILTLNHGMT
VMRRAMTIEVKKEDKRKTFLKEDYLRLDEHYREKRIQVHVIHEYAEVALK
EMAEALRLVLDYFTDSKQAFIKRHFAGREDVLKLATSEASWKSIIESLST
TQKLIVADDDDINRLVLAGPGSGKTRVIVHRIAYLLRVRRVPATSIVALT
FNRHAANEIRKRLLALVGADAYGVSVLTYHSMAMRLTGTRFERGDTVDER
ALKRVLSDAVELLEGKRNVEGEDNLREQLLRGYRYILVDEYQDIDDLQYR
LVSALAGRHAEEDGRLCIMAVGDDDQNIYAWRDTNNRYIERFREDYEASI
SFLVDNYRSSLRIIEAANQLIGQNSARLKEANPIRIDRARQELPAGGLWE
EQDKQRKGRVLRLLIDPSDRERGNLQAQAAMLELERLLVLEQGSWNGCAV
LARTHRYLWPIQAWCEQHDIPYFLAADKETALPITRQRSFVAAIDSLREI
ESALCAADAWLRLSGSNQLVEAEWKSFFQTAFEQMRGELGDCQLGSGALI
DWLYDYARELRQQPKEGLYLGTVHSAKGLEFRHVVLLDGGWSTQVDTLAD
ERRLYYVGMTRAEQTLALCEFADGNPFSRSLMKGVQQHAFQGQPLPELEL
RFQQLTLKEIDISFAGRQLPHARIHKAIEALREGDPLTLKEEAGRYQLLD
RQGNVVGRTAKSFQPQIGFAHCEVAVVIVRFAEDSEEQYRDLNKCERWEV
VVPRGRG
>NE0188 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1122 conserved hypothetical protein
MNDLESLISQVRRCTLCAEHLPLGPRPVFQLHETARILIASQAPGRRVHE
TGLPFNDPSGDRLREWLNMTRTIFYDPRRIAILPMGLCFPGTGKSGDLPP
RPECAPAWRSALLSHLKNIRLTLLVGQYAQAYYFTRQGRKPVATLTENVR
SWQKFWPDIVPLPHPSPRNNLWLRRNPWFEEEIIPALQERVAMILNQTTD
S
>NE1470 conserved hypothetical protein
MRSTTLLFFCSNLVWISPLTWAQTSQQPDNLIPLPEIPESPSAGEENGLP
PELGLDPSLEPEITIHEGKDKTMIEEYRVNGELYVIKITPRIGKPYYLLN
RRSAVGMPHRGDMESGVSVPMWQIYRF
>NE2096 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE1093 Transposase IS4 family
MARFKPVQKGLMLLPVDFSRQIIPGSFEHALCYLVDHELDFSGLRERYRN
NTQGAPAYDPAVLLKIILLAYSRGLIGSRRIEAMCRQNILFIAVAGDNQP
HFTTLAAFIAELGDEVAKLFAQVLVVCDRQGLIGRELFAIDGVKLPSNAS
KAKSGTRADYQRQAEKMEKAAKQMLVRHREIDMTPVDERQAQREACMLER
LQKEAKQLKDWLAANPEDRKGPKGGVRQSNLTDNESAKMATGKGVIQGYT
GVAVVDEKHQIILDAQAHGTGSEQELLVPVVQAIKPQMSNQTVITADAGY
HSENNLKMLAAEGIDTYIPDNGYRKRDERYHGQEAHKTKPDPLWDKRGQP
SISKRFGPGDFQLAEDGSHCLCPAGKRLYSNGSNCTFNGYAAMKFRGAER
DCLPCTLRTQCLRTPEKTKTRQVAFFRGKRDGYETHTDRMKRKIDSDQGR
QMITRRFATVEPVFGNLRNNKRLDRFTLRGRSKVDGQWKLYCLVHNIEKL
AHYGVGQ
>NE0518 transposase
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEVV
>NE0715 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR
>NE2028 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNTRICR
>NE0560 transposase
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEVV
>NE2541 Site-specific recombinase
MSEVLKRRMRCAVYTRKSTDEGLDQEYNSIDAQRDAGHAYIASQRAEGWI
PVADDYDDPAFSGGNMERPALQRMMADIEAGKIDVVVIYKIDRLTRSLAD
FSKMVEVFERYAVSFVSVTQQFNTTTSMGRLMLNILLSFAQFEREVTGER
IRDKISASKRKGMWMGGVPPLGYDVENRRLVPNEREAKLIRHIFQRFVEL
GSSTALVKELKLDGVTSKAWTTQDGKTRDGRLIDKGHIYKLLSNRTYLGE
LRHKDQWYQAEHPPIINRELWDSVHAILETNGRVRGNTTRAKVPYLLKGI
VFGNDGRALSPWHTTKKNGRRYRYYVPQRDAKEHAGASGLPRLPAAELES
AVLDQLRAILRAPNLLGEMLPQAIKLDPTLDEAKITVAMTRLDAIWDQLF
PAEQTRIVKLLVEKVIVSPNDLEVRLRANGIERLVLELRPEPVEQQEVAR
A
>NE1261 Integrase, catalytic core
MLQVAPSAYWRHAARQRYPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWYQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1560 conserved hypothetical protein
MHCRSFPPIASPGSWVLILGTMPGKVSLREQQYYAHPQNLFWRITAEILG
FDATSAYPLRVSSLKDHGVALWDVLQSCTRESSLDADIVAHTIVPNDFGR
FFTACPDIRRVCFNGAKAAALYARHVKPFLQDAPTVEYVQLPSTSPANAA
IPRADKLRAWSVIKHNA
>NE0254 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1398 putative DNA polymerase-related protein, bacteriophage-type
MDPNRLKELGLLPVWRVRPGAIAGQSPDSGKNMSDQIETAAESRPFEESE
DRSTSIAHADWGRLRQMVSGCTACPLSQTRKQTVFGVGDEQADWLFIGEG
PGAREDELGEPFVGQAGKLLDNMLQAVSLRRGQDVYIANIVKCRPPGNRN
PQDAEAEQCRPYLLRQIALIQPRLIVALGKVAAQNLLATDASIASLRGRL
HEFSGIPLIVTYHPAYLLRSLGDKAKAWEDLCFARDTMRNLQAAHSS
>NE0845 DUF196
MLIIVTYDVSTETRAGRKRLRRVAKLCESIGQRVQKSVFECRINLMQYEE
LERRLLSEIDEQEDNLRLYRLTEPAELHVKEYGNFKAIDFEGPLTI
>NE0228 CHC2 zinc finger
MIEQSFIQELLDRIDIVDVVARHLQLKKAGANFTACCPFHNEKTPSFTVN
SSKQFYHCFGCGRHGNAISFLMEHSGASFVEAVESLATHAGMQIPDQVSI
YPKIPDPGRVPSDKIKIDKEVEATSPLAGLYERMEQAAKFYRGQLKQSDQ
AIAYLKERGISGRTALCFGIGYAPPGWQNLSGIFTDYPADDSSHPLVQAG
LVVAHDGKKNYDRFRHRIMFPILDRKKKIVGFGGRALDGGEPKYLNSPET
SLFVKGRELYNLASASPAIRKSARVIVVEGYMDVVMLVQSGVENVVATLG
TATTAMHIQNLLRHTDEVVFCFDGDAAGTKAAWRALETSLPQLKDGKDIK
FLFLPDKEDPDSYIRKYGRVAFEGLLEKAQPLSVFFCNELSGRVNLGTSE
GRARLVQRAGPLLAQINAPVFGFMLTKRISELTGVGQNQLAAFLKTGKKN
RSSTLRPEASRPLSVTPYRRLIQILLHAPDYANKLDTNLLAVNDEQNEEK
VLLVALVDFLKTSACSMEEELNSVTILLHFDQTPHRVLLEKIVRDAHVKD
ENWNIDAEFTGGMERLREMQRRSRMAELHSRPLVSLTPEEKNELRQLMLS
>NE0231 hypothetical protein
MTMIKKIETELLAAKATLSEISGRFKEFSDTQARLSADGDLLGLARLNKE
HTGLEDSLLAADDTVRALESRLSVLRQAEYRPQFDKAHKTHLGAVQAETK
AAEKLLAAIDAVFSAATDMQNLSDEVAATYHAARDLHNRAGLDHELRWPA
PDGQIPIKISDRMNSLRDELVRTIRLYEDRLPQSQSLEGLRIIEQQQEEL
VRNSGRGFR
>NE0511 hypothetical protein
MAHAKGGKLFDSPVRLNYIIGIKSHILDSNLQLKWSVLLDIDREPAETAE
EQFDSSFFLMTEELRQLLPELMEALGGTASE
>NE1991 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE0259 hypothetical protein
MVKPTDAELRTSGGLTSVFLNCDTCLSDEDFNRLRRMEFTQNEHAILYGQ
LGGSIAGMIELGPVASRTQSRQDEERKTEERRTAQFVQLVEQMRASIEQM
EADVKRLVASFEKRDGDAWREKLALNILEADEIPQQEADESITAYRKRLE
QHLINEMLNPDGTIKDKYKNDPKYGDYAEWAQTQFHLNSAKAAVAELDNS
DTSPQRKEHILDEMKQRGYIEEMVFTDRISGNLDAQKSVRDIRDSQHDEA
LSQVRPPEATLKFLS
>NE2470 NUDIX hydrolase:Conserved hypothetical protein 52
MDFDVLEKTVCFQGFFRLERYRLRHRKFNGEWGRPITRELFERGHAAAVL
PYDPQTDEVLLIEQFRAGAISAPGGPWLLEIVAGVIEANETPEQVVARES
MEEANCQIGSLIPLYDYLVSPGGTTERIVLFCGRVDMQTIEAGAVYGNHG
EDEDIKVHVMPLNEAIRLLSTGRINSASAIIALQWLALNRDSVRRRWLPE
>NE1840 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1557 putative transposase
MTYPISFRRKVLSVREKEGLTIAQTAARFCVGIASVTRWIKNPVPKESRN
KPATKIDMAALAHDVREFPDAYQAERARRLGVSEKGIGHALRRMHISYKK
NTAAPQSGRRQTAHLPGDD
>NE0288 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNR
>NE1347 conserved hypothetical protein
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKVWDEFTSKPSSTLIPSGQQRSC
IPTKHQLHQLICSMTGYCRSLLNRVWALFAF
>NE2385 Staphylococcus nuclease (SNase) homologues
MHFTRALRIQLIPSFFFRAIYPLVVLLILLHAQSGLAETIYRSTDSHGRT
LYSDIPTPAAKPLQPATPPARSKYRVTRVIDGDTIVLENNKRVRLLGINA
PETGNRYHPGEPGGADAKKWLRGKLQGRSVYLEHDRQTHDHYKRMLAHLY
LPDGEHINLSLVEKGLAIANLIPPNLLHANTLIRAQQRAETRKLGIWSMQ
HYQPRPLIKLTEKPFGWQRYRVKAKVLKRNHRFSRLIISDNLDLSFANRD
LALFPPLETYLNRPLEVRGWVSRRKNHFSIRIQHPSALILY
>NE0744 conserved hypothetical protein
MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP
VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA
ALIEGLTVRASARQCRIDKNTSFRWRHRFLTLPAAAKANHLEGIVEADET
FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML
DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV
RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL
SSPIVPLQAALGRENQFQLLTNT
>NE1884 possible homolog of eukaryotic DNA ligase III
MTDFFRFPNTPHLLWLGQGQPRDDKILSDAEIAALLQDEVLIEEKLDGAN
LGISLDEHGELRAQNRGQYLPQPFSGQFSRLNSWLGQHGEILKHTLTPEM
ILFGEWCAARHSLDYNKLPDWFLLFDVYDREAGKFWSVERRNQLAQKLNI
TTVPLLKRTKITCNQLVQLLDDAQSRYRSGKVEGIVIRCDSPLWCESRAK
LVNREFVQAIEDHWRSRSIEWNLVHAGSVKRS
>NE2469 Transposase IS4 family
MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV
LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF
EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK
LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT
EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI
EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA
>NE2445 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1106 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR
>NE0110 hypothetical protein
MPADHFLKRQAMSDFIICYDITDPRRLGRLYRYLIKRAVPLQYSVFLFRG
DDRQLERCIQDAIELIDEKQDDLRVYPLPGRGLKARIGRPTLPEGIQWSG
LPAKW
>NE1378 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1219 UMUC family (DNA-repair)
MHGMNTKNRRIAHLDMDAFYASVELLRYPELRGLPVVIGGRSVHQPVIQP
DGKRSYVRLRDYTGRGVVTTSTYEARAYGVFSAMGIMRAAQLAPDAILLP
ADFDTYRHYSRLFKDAIARITPHIEDRGIDEIYIDLSEHPDETASLASSI
KQAVRDATGLSCSIGIAPNKLLAKISSDLEKPDGLTILTHTDIPNRIWPL
SVRKINGIGPKAEEKLVRLGIQKIGELAKAELSLLQAHFGRSNAIWLHDS
AHGRDSRPVVISSESKSISREATFERDLHVQEDREILSDIFTELCTRVAE
DLQRKGYVGRTIGIKLRYENFQTITRDLTVRNPTADASTIRKAARDCLRR
VPFEQKLRLLGVRISGLSKISALLKENYYFQEELF
>NE0162 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE2520 ATP-dependent DNA helicase RecQ
MAYDPKRALELLRIGSGRANATFRDGQEDAIRHIVEGKGRLLVVQKTGWG
KSFVYFIATKLLREAGAGPALLISPLLALMRNQIAAAERMGVRAATINSD
NMDDWTVVEGKLAKGEIDILLISPERLANERFRTQVLAGIAAQISMLVID
EAHCISDWGHDFRPHYRLLERIVKTLPPNLRLLATTATANNRVMEDLAAV
LGPKLDVSRGDLNRTSLSLQTIRLPSQAERLAWLAEQLATLQGHGIIYTL
TVRDANQVAQWLKTQGFNVEAYTGETGDRREQLEQALLNNQVKALVATTA
LGMGYDKPDLAFVIHYQMPGSVVAYYQQVGRAGRALDSAYGVLLSGQEES
DITDWFIRSAFPTRQEVADVLGALEDEPNGLSVPELLSRVNLSKGRVDKT
IALLSLEAPAPIAKQGSKWQLTAATLSEAFWDRAERLTALRRDEHQQMQD
YVSLPFGEHMGFLIGALDGDPSVVAEPALPPLPATVDAELVKAAVEFLRR
TSLPIEPRKKWPDGGMPQYGVKGFIAPAHQAESGKALCVWGDAGWGGLVR
QGKYHDGHFSDDLVAACVKMIQEWNPQPSPTWVTCVPSLRHPELVPNFAQ
RLAAALGLPFHMVIAKTDARPEQKTMANSTQQARNIDGSLALNGQPIPPG
PVLLVDDMVDSRWTLTVSAWLLRKNGSGEVWPMALSQTGHDE
>NE1053 Uncharacterized ATPase related to the helicase subunit of the Holliday junction resolvase
MTDSPHTIRNPAAPLAERLRPRTLDDVVGQSHLLGPGKPLRLAFESGKPH
SMILWGPPGSGKTTLARLMAHAFDAEFIAISAVLSGVKDIREAIERAQIT
LQRTGRATLLFVDEVHRFNKAQQDAFLPHVEQGLITFIGATTENPSFEVN
GALLSRAQVYALKALTDQELHQLFERARSIAMLDLEFENTAIELLIGFAD
GDARRLLNLLEQVQNAAETEEIIKIDADYLSRVLARNVRRFDKGGDAFYD
QISALHKSIRGSSPDAALYWLCRMLDGGADPRYIGRRLVRTATEDIGLAD
PRALTLALNACEVFERLGSPEGELALAQATLYLACAPKSNAAYVAYKQAR
AFIKEDISRPVPIHLRNAPTRLMREMGHGAAYRYAHDESESYAAGENYFP
DNILAVQFYRPTTHGLEAKIGEKLAYLRSLDEKTGKKRN
>NE0239 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE2013 hypothetical protein
MCRCRLHWCSHTLESIEEHGCTAHVKGRGQQAKEKRRHPGAHARRWIIEV
SHGWFNCLRKLLMRYEKLARSFLGLNHLAAAIIAFRKVPLAVNIIYE
>NE0257 Site-specific recombinase
MESQHNHVMKKSCRIIGYARVSTEDQHLDLQIDALKLAGCSSIFEDHGLS
ATAKRRPGFEQALASLQAGDIFVVWKMDRAFRSLKNALDILEEFENRAIE
FRCLTEDIDTTTPMGKCMYQIRHAFSELERNLIRERTKAGMEAARQRGAH
LGRPKKLSRGQIIRMQNLLQRQPDMTPVQIADQFGVSSRTIYRALSKYST
IKEELAIHAG
>NE1521 possible transposase
MTHSHRRHDISDRIWSLLEGHLPGREGAWGGVATDNRQFINAVFWIIRTG
APWRNLPPDYGGWSNTHRRFIRWRDKGIWEKLLEILIDDPDYEWLIMDAS
HCKVHPHANGARGGNQDMNRTKGGSTPRYIWPWMRMVCRSESLLHKVPLL
IARRLAA
>NE0340 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNT
>NE2466 putative lipoprotein
MYQKFNKPVNAALHLILVLSITACASQNKFFDTLDYNRDWEAIQSNLPAY
PQPENLLEFDSGPATSLRYFIDAKSISVDEKRVIRYSIVIQSQQGANNVS
YEGLRCETRERKRYATGNNDIRSWVRANTSEWQPLEAVAQLRAQRELAKY
YFCPRGLVVGSPAEAVRALKAGVHPMVIR
>NE0519 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE0244 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNIYRAQQDHFQLVGTDRLEIMRGHGIQRHASKQRWH
ISDKTTQLAAQRFHVKRPETLHEIGMPVTLHDTVTAVTDMSNDIFEQPCL
TGCAERRFALGSEQMPIGRKAATRHRKGRLLRIVVEW
>NE0098 conserved hypothetical protein
MRQQLLDITEIGPPSLDNFISSGNEEVLYTLRNLVAGNQQDRFYYLWGKT
GSGKSHLLQAVADAFSEQQCNSRYIDCNQDEPNFNPGTDCIVIDNVERLD
DAAQIRLFNLYNHLRDNKHGIFLASGTKPPAQLDLRQDLTTRLGWGLVYQ
VHELTDEKKIEVMQDYAIRCGFELPLEICHYLLKYEQRNLSSLIRLVHAL
DQLSLTRQRPITLPLLRELL
>NE2232 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE0119 hypothetical protein
MKFLDTQELVVSTLSPVHIGCGEDYEPTEYVVDTSGVLHRFNAGILPDLS
DAGISSDILTILSNDEAHTEQLRAVHKVLSKYRDKIIPLASVHVSMCTGV
HAHYKSTQDKKNDFNRNGVERTSYQPFNQLPYLPGSSIKGAIRTAILNEH
IAGNNPCSTVLMRQIQDFNTMIEEYDPGNGKLLLRLKLQHTKWDYDRARK
NIEKAIADVSSALGTDLLGGKFETDPLRALKVSDAAPLDIEIEREIRFCL
NRSRSGRRSQAQVKNLYTRLEYILEHQPAAFSLSLTLQNLHEIAGRRNHR
NELISPSADKLLLWTGIVKACNSYYLNRLDDDLAMLGKLYPTSEWRKQTQ
SILDAGLRDQIKTGNCLLLRIGKHGGANSNTVSGRQIKIMLNEDKREANG
KEEKIRLYTFDDESRTIWYCGDDLDKPSDLLPHGWIVLSNPDQIWHADLP
GFERRCARQQAIAESARRQAEAAAAEQAKAAAQAAREAALAAMTENQRRI
EAFVSMCARRAEQLRGGKENPNAAIHTAARELVKAALEGADWTIDEKCAV
ADAIEEWLPKLVKVELKDERKKLKLSALRT
>NE0155 Integrase, catalytic core
MCGVFREGVAVRYARIEQLRQHHAVAAMCRILDVSESGYHAWRQRPPSAR
QQENLRLETEVKAAHQRTRETYGPRRLRSDLADHGIQTSLYRIKRIRRKL
GLRCKQKRKFKATTDSRHALPLAPNLLDRQFTVAAPDRAWVSDITYVATD
EGWLYLAGIKDLFNGELVGYAMSERMTTSLVSQALFRAVAAKRPARGLIH
HSDRGSQYCAHAYRKQLQQFGMQASMSRKGNCWDNAPMESFWGSLKNELV
HHRRFTTRTQARQEITEYIEIFYNRIRKQARLGYLSPAQFTQKYHAKQIA
A
>NE2201 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE0714 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWPNRT
>NE1260 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE0517 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNT
>NE1366 conserved hypothetical protein
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVDEFTSKRSSPLIPSGHQRSC
IQQKHHLHQLICSMTGYCRSLLNRVWALFAF
>NE1815 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE0935 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPGPMVKSNA
>NE0844 Protein of unknown function DUF48
MTQARLCLSVLYFPFSETTPKDLDQVRGIEGDAAKTYFSALPYLVRKDIR
EFFTMDGRTRRPPRDRFNAMLSFIYSLVMNDCRSALESVGLDPQIGFLHA
VRPGRAALALDLMEEFRSFMADRLALTLINRGQITDQDLLVREGGAVHLE
DKARKTVVVAYQERKQEEITHPLLETKVPIGLLPQLQARFMARVIRGEMD
GYLPFLVR
>NE0253 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE0236 ccrB, Site-specific recombinase
MTQQAVIYCRVSSLKQVTEGHGLASQETRCREYAKHKGYEVVEVFHDEGI
TGKLLDRPNMKAMLIYLKQHRATRPVVIIDDISRLARDIETHLHLRASIS
AAGGKLESPSIEFGDDSDSRLVEHLLASVAAHQREKNAEQVFNRMKARMM
NGYSVFNAPIGYRYDKVGKHGKLLVPDQPCASVIAEGLEGFASGRFETQI
ELMRFFEASPHYPKDRFGTVHMQRIKEILSRVLYAGYLDKPDWGIHLVKG
HHEALVSYETWKKVQARLNGQAKAPVRKDINEDFPLRGFVTCACCGSPLT
ACWTRGGGGLYAYYLCYGKTSGVKCSQNGKSIPKDKLEGEFGALLSEMKP
SKEMFLLAAEIFTDLWNIKRDTAKQEAETIRRNLLQIERKTEQFLDRIAD
TDNSILITAYEKKIRQLEEEKIALDEKIAQCGRPLQSFDETFRTAFSFLS
NPYQLWVSSRLEHKRAVMKLAFSERLRYCRNEGFRTPEKSLPFLLLEGSD
EGKHEMVGLVGLEPTTKGL
>NE0001 dnaA, dnaA; chromosomal replication initiator protein
MQKIETFWHFCLKHFRQELNGQQFNTWIKPLKLEVCPDEKNTLILIAPNR
FVLQWIKDNFVTRIDEMAQDHFNERISFRLELREPAESEAQTVRTSAQKN
REDKKPAAEKTQGVTSRKTNPSQLNASFTFDAFVTGKANQLARAGAIQVA
ERPGIAYNPLFIYGGVGLGKTHLMQAIGNYVLELDAGAKIRYVHAEKYVS
DVVSAYQHKSFDKFKLYYHSLDLLLVDDVQFFSGKNRTQEEFFYAFNALI
EAHKQVIITSDCYPKEISGLEERLVSRFGWGLTVAIEPPELEMRVAILLK
KALAEKIELDENTAFFIAKYIRSNVRELEGALKRVLAFSRFTGHSISLDL
AKEALKDLLAIQNRQISIENIQKTVADYYKIKVADMYSKKRVRTIVRPRQ
VAMAIAKELTQLSLPDIGEAFGGRDHTTVLHAHRKIIELRTSDPGINRDF
NALMHILRG
>NE0194 dnaB, dnaB; replicative DNA helicase protein
MNQLTSSFIQNLAEDNIYKLPPHSIEAEQSVLGGLMLDNQAWDKVADIII
ESDFYRQDHQLIYQHISRLIEQNKPADVITVAESLENAAQLQHAGGLAYI
GAIAQNTPSAANIRRYAEIVRERSIMRKLAQVSTQITDSAYNPAGRSAGD
LLDEAESRIFEIAEQSAHGKQGFVDIQPLLKQVVERIEVLYNRSNPSDIT
GIPSGFNDLDQKTSGFQPGDLIIVAGRPSMGKTAFALNIGEHVALETSKP
VAVFSMEMGGVQLAMRMLGSIGRLDQHKMRTGQLNDDDWPRLTHALGKLN
DAPIFIDESAGLNSLELRARARRLYRQHEGLGLIIIDYLQLMSATSPGSE
NRAAEISEISRSLKALAKELQVPVIALSQLNRGLEQRPNKRPIMSDLRES
GAIEQDADVILFIYRDEVYNPDTPDKGIAEIIIGKQRNGPIGKVDLTFLG
EFTRFENCARTADYY
>NE1978 dnaE1, dnaE1; DNA polymerase III (alpha chain) protein
MPIDPVFIHLRLHSEYSVVDGIVRVEEAVAKARDVGMPALALTDLSNLFG
LVKFYQCAFKAGIKPIAGCDVWVTNENDADRPFRLLLLCQSFSGYLLLSR
LLSRAYRENMCRGRAELKKSWFREEDAGTEGLIALSGGGQGEVEQLLLAD
PPAAVTAAQQWADLFPGRFYLEIQRCGRPNEETSGYALLDLASSLKLPVV
ATHPVQFMRPEDFRAHEARVCIAQGYVLGDRRRPKEFTGQQYFKTPAEMG
ELFRDVPEALANSVEIARRCSLMLELGVNRLPDFPTPAGISVEQHLRELA
QTGLEARLLQSFPQVLQRDERRPIYQMRLDFEVETIIQMGFAGYFLIVAD
FIGWAKQHDVPVGPGRGSGAGSLVAYSLGITDLDPLLYDLLFERFLNPER
VSMPDFDIDFCQDRREQVIEYVRDRYGAESVAQIATFGTMAAKAVVRDVG
RVLDLPYNFVDQLAKLVPFELGMTLRKAREIEPLLNQRAEEEEDVRNLLE
LAERLEGLTRNVGMHAGGVLIAPGKITDFCPVYCADSGDAVVSQYDKDDV
EKVGLVKFDFLGLRTLTILDRAVADIRQYRAASPGSAVAEPDVQSAEESH
FSLESISLEDAATFSLMAKGNTVGIFQFESRGMKDLLQRARPDRFEDLIA
LVALYRPGPMDLIPDFIERKHGKRVDYLDPRLQPILGPTYGIMIYQEQVM
QIAQVIGGYSLGGADLLRRAMGKKKVEEMAQQRAVFVEGAIRNEMAEADA
VTLFGLMEKFAGYGFNKSHAAAYALIAYQTAYLKTHYPAEFMAACMSSDM
DDTDKVNVFYEDCKLNGIVILPPDINESGYYFVPVDHKTIRYGLGAVKGS
GEAAISAIVQVREQGSTFTGLFDFCRRVDRRIVNRRTIEALIRAGAFDSV
ETNRAALLESVGNAMEYAEQCSLAASQVSLFDENTDLIQPPAITGVAQWP
EREKLQNEKMALGFYLSGHPYDSYARELSCFIPVRLSRIVPGREPQLIAG
VIYAIRTQMSRRGKMAIVTLDDGLARVEVVVYSDLLSTGSHFMKADQLLV
VRALVSHGNGENADRRIVAKEIYDYVTARSMHARKLRIMIDDSGLLTPAQ
LKELLAANLPENGVNNVIPSSGCAVSIDFRNQVGSCEIDLSSRWRVHLHE
GLIESLMDILGRDKVEVVY
>NE0002 dnaN, DNA polymerase III, beta chain
MKLTITDRDLLFKPLQTVSGIVERRHTLPILSNTLIEIRNGQLTLVTTDL
EIEAEATSNIPELENQGALQTTVSVRKLQDILRALPSGAAIELTRSENRL
QIVSGKSRFSLQLLPAEDFPRMIRDSEPCSATYTLAQRVLKKHLQRVAHA
MAQQDLRYYLNGMLLLIEDNKLTLVATDTHRLGITSIDLDGNFEKSETIV
PRKTVLELIRQLEDSDKPVIVEIYPKKVCFRFSDAVLVSKVISGKFLDFR
RAIPQTSVFQFDVNRLDFLHALQRTAIISSSNDLFRNVHLNITNGKLNIS
AKNKEQEEAQEEIDIVYSNETIDTSFNIVYLMEVLNNLDSEQIRCSFESM
QSAILITLPDDEQFKHVLMPMRE
>NE0141 dnaQ, probable DNA polymerase III (epsilon chain) protein
MRYVFLDTETTGLDPALGHRIVEIAAVEVCNRRLTDRHFHRYLNPGRESD
EGALRVHGLTREFLRDKPVFQDVCSEFLEFIADAEIFIHNAPFDVGFINR
ELDLIRFESMQNHCLQIIDTLVLAKELHPGKRNNLDALCERYQIDNSHRT
LHGALLDAELLAEVYLAMTRGQESLLMEMDAPASRQADNPAVGKVENLAL
IVQPATQAELELHSRLVERINAESKGNCLWNG
>NE0433 dnaX, dnaX; DNA polymerase III (subunits tau and gamma) protein
MTDSQVLARKWRPKDFSELVGQEHVVRALINSMEQNRLHHAYLFTGTRGV
GKTTVARILAKALNCEQGVTAAPCGKCAACMAIDQGNFIDLIELDAASNT
QVDAMRELLDNAQYAPVAARYKVYLIDEVHMLSRSAFNAMLKTLEEPPEH
VKFILATTDPQKIPVTVLSRCLQFNLKQIPPSLIVERLTEILSMEGIPAD
AAGLRLLAQAAKGSLRDALSLLDQAIAFGNSVVNESDTRAMLGVLDQDHI
FALLEALAEQNGAAIFAIADQLEAASVSFDQALQDLAALLHRLATAQVIP
QMLDETQPDGDRLLALTKRFSPEDIQLFYQIVLHGRTDLAHAPDEYSGFT
MTLMRMLAFMPDSRQPGRAYADTGTDHAREVKVEAPSCPREAKPVSDQSP
NEAWLALVNQLKLSGMTRMLAQYSEAKSFSESRIELYVAEMHKHLLEKSY
QDRLRSQLEIHFGKPVEVIFSQGSITGVTSAALQDRDKLARQSKAVEAIE
SDPYVQELIEQFDARLNVSSIKPID
>NE0875 fis, probable factor-for-inversion-stimulation transcription regulator protein
MTVINENEIALCIRRAVEAYFQDLDGEKPCPIYEMVIRSVEKPLIEIAMH
YAQGNQSKAAELLGINRNTLRNRLTKHQIR
>NE0332 gyrA, DNA gyrase/topoisomerase IV, subunit A
MEQFAKETLPVSLEDEMRRSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY
AMHELSNDWNRPYKKSARIVGDVIGKYHPHGDTAVYDTIVRMAQPFSLRY
MLVDGQGNFGSIDGDNAAAMRYTEIRMSRIAHELLADLDKNTVDFGPNYD
GSEQEPLILPAKIPNLLINGSSGIAVGMATNIPPHNLGEVIDACLLLLRD
PDVDIAELMACIPAPDFPTAGIIYGISGIKDGYQTGRGRVIMRARTHFEE
LDKGNRHSIIIDELPYQVNKANLLVRIGELVRDKRIEGISDLRDESDKSG
MRVVIELKRGEVPEVVLNNLYKETQMQDTFGINMVALVDGQPRLLNLKQM
LDHFLRHRREVVTRRTLFELRKARERGHLLEGLAVALSNVDEIIALIKAA
PTPAEAKKGLMARTWRSSLVEEMLLRAMIDAAVFRPETLAAGFGMSDQGY
RLSDAQAQAILDLRLQRLTGLEQEKIVSEYREILDKIRDLLDILANPERI
TTIIVEELTAIKGQFGDPRRSEVVIDAQNLNTEDLITPADMVVTLSHAGY
IKSQLLDDYRAQKRGGRGKQAITTREDDFIDNLFIANTHDFILCFSSLGR
VYWIKVYNVPQGSRTSRGRPVNNLVPLEQNEKINAVLPVKSFDDTRYVFM
STAGGTVKKTPLSEFSRPRTNGIIAIDLDEGDYLIGVALTEGKHDVMLFS
DAGKAMRFDENDVRPTGRNARGVRGMKLGAGQQVISLLVADNENMAVLTA
TENGYGKRTPITEYTRHNRGTQGMIAINTNVRNGKVVAAQLVESSDEIML
ITTGGVMIRTRVSEIREMGRATQGVTLINLDAGEKLAGLERIVETDED
>NE0003 gyrB, DNA gyrase, subunit B:DNA topoisomerase II gyrB
MNTNQPESAKKTDNSHRDYNSDSIKILKGLDAVRKRPGMYIGDTSDGTGL
HHMVFEVVDNAIDEALAGYCDDISVIIHADNSVSIHDNGRGIPTDIKQDD
ELKRSAAEIVMTELHAGGKFDDNSYKVSGGLHGVGVSVVNALSEWLRLTI
RRNGNVYQMEFREGVAVAPLKVTGQTEKHGTEVHFLASQSVFGDITYHYD
IFAKRLRELSFLNHGIKIRLADQRDDREEVFAFTGGIRNFVEYINRSKTV
LHPSIFYAKGLKDNITVEIAMQWNDSYAEQVLCFTNNIPQKDGGTHLTGL
RAAMTRTLNNYIEKNELAKKAKVDTTGDDMREGITCVLSVKLFEPKFSSQ
TKEKLVSSEVRPAVEEIVVQKLSDFLLENPNEAKTICNKIIEAARAREAA
RKARELTRRKGVLDSMGLPGKLADCQEKDPKLCELYLVEGDSAGGSAKQG
RDRKFQAIMPLKGKILNVEKSRFDKLISSQEIVSLITALGTGIGKDEYNP
DKLRYHRIIIMTDADVDGSHIRTLLLTFFYRQMPELIERGHIYIAQPPLY
KIKHGKQERYLKDDYELKHYILGLALVGAELHTGANNPPITGEALARIAD
EYLLAETVIERMSRLIDRTVMYALLKQPDIDLSSETSARDSAARLAILLD
DVEILAEYDENFERYRLKIIRKQHGNLRTSYLDDDFLQSGDFARIRQAAQ
ILHGLIGEGAKVKRGEQEISVREFKEALEWLLEETKKGITIQRYKGLGEM
NPEQLWETTMDPGNRRLLRAQIEDSILTDEIFTTLMGDVVEPRRAFIESN
ALRARNIDI
>NE1137 holA, putative DNA polymerase III (delta subunit) protein
MRLDPEHLARQLDGSIAPLYVVLGDELLLVMEAVDGIRAYVRGQGYTERT
ILTADQRFDWMNLFQWGRQSSLFSERRMLDLRIPSGKPGREGGVAIETFC
RELPRDTVTVVTLPEIDKQGRASKWFKALEQAGQVIEVKPVGRDRLAHWI
KQRLDRQNQMIDQDTLQFFAGKVEGNLLAAHQEIHKLGLLYPPGRLTFEQ
VKNAILDVTRFDVLQLPETMLTADMVRYRHILEGLQGEGVAPPLILAILS
EQIRLLIKIHLLKNSSRGMTIEQAMTALRIWPARQKLMMGAIQRIRYPLL
VQALLQAAVIDRIIKGVEQGDIWEELLNLGICFAADSSFKIIGRKDLSFI
INLSLK
>NE2180 holB, putative DNA polymerase III (delta' subunit) protein
MATAEIFPWQRVIWQQARQSGSAQRHHALLLKGRRGIGKLGFALALAKSI
LCGQGDAAGVACGKCQDCYWFEQGLHPNFRLLEPEALSAQEGATDKDDEE
NRREAGSTKSGRKPSQQISIAQIRALDDFIYLSAHQARDKVVLIHPAEAM
NTAAANALLKKLEEPPPEVLFILVTHNVSLIPPTVLSRCRQTAMPGPDHE
MAKDWLIHQGITDPDFHLAMSGFSPLLALQYDERLAASHTDFIQCLCAPE
RFDPIELAEKLHKLDLSSVTGWLQKWCYDLMSCRTSGRVRYHLKQVAVIR
QQAAVIDPVAFGFLWRNLIASQQLARHPLNPRLFLEAMLLTYMDSIRPAG
SAG
>NE0442 holC, putative DNA polymerase III (chi subunit) protein
MLIACRLCAKAVQQGLKTVVYVPDERLAGQFDKLLWTFTPTGFVPHCRVD
NKLADVTPVIMNSRPVLMEAGCFGVLLNLDADVPPGFEQFPRVVEIVDEA
EDGKLQARKRYRHYQEQGHDVRHHRLDGN
>NE0833 hrpA, HrpA-like helicases
MTYLPEQPACITYPEDLPVVARREEIAHAIQQHQAIIICGETGSGKTTQL
PKICLELGQGAGRQGTGHLIGHTQPRRIAARTVAARIAAELNSPLGKLVG
YKVRFSDQTHPNTRIKLMTDGILLAETQQDPLLRAYQTIIIDEAHERSLN
IDFLLGYLKQLLPRRPDLKLIITSATIDAQRFASHFNDAPIIEVSGRLFP
VEIHYRPNDPIDGEDRDLPRAILSTIDEAMRMGEGDTLVFLPGEREIRET
AETVRKYAFSGPGGKAGLEILPLFARLSHTEQARIFAPGQQRRIVLATNV
AETSLTVPGIRYVIDTGLARINRYSYRNKVEQLLVEKISQASANQRAGRC
GRVMNGVCFRLYSEEDFNARPEYTDPEILRSSLAAVILRMKSLKIGDVEQ
FPFIQPPAPRMIADGYQLLSELGALDERKGLTQIGHQLARFPTDPRIARM
IMAAKQENCLSEVLIIAAALSLQDPRDRPFEHQQAADQAHQPFRDDRSDF
MGYLKLWDFYDELLKHKKSNKKLIEQCQKNFISHRRMREWREIHGQLHIL
ISEMGLRPNQVSAGYDEIHRALLSGLLGNIGFKSDEKGVYEGARAIKFSI
FPGSSLRKKQPKWVVAAELAETTKLYARCAAAIDPAWLERIAGKLCKRHY
FDPHWEKQRAQAMAFERITLYGLTIVPKRRIAYGPIDPAHAREIFIRQAL
VAGEYESTAPFLQHNQQLIDEIRELESKVRRQDILVDEQQIFEFYAARIP
AGIYSGTAFEKWRKQAEQTEPELLYLTREVLIRQAVDGTAAEQFPETLTA
AGHVLPLSYRFDPGHPLDGVTVTVPLPLLNQIMPFHFDRLVPGLIREKIG
WYVKMLPKQVRRHAIPVPQFVTRFLEWLDSCPDQAMLLAESLTAFIRSET
GIKVPLDTWDSRLLPVHLQMNVKVIDDAGMTLGMGHDLIELKAQFGQTAQ
QLFARGAGAEPDSIERDDITRWDFGELPVETRFSRAGKLLTGYPALVDQE
QSVAVRIFDTQEGAQRSMRGGVLRLLCLALKDRIKQLEKNLPVDRQAILL
MSSLIEMDRLKEDIRSAIIDLALIGDDPLPRNEDEFNSQTSRARTRLGSV
SQEIAGLIHTIAQPCQELKKRLSVLDKSAVFLKKDMEEQLHHLIYPGFLS
TTRWQYLQHLPRYLKGMILRLDKYNKNPARDQEQTEIISTLWNQYIQRLN
KHRQAGVIDPNMEIFRWQIEELRISLFSQELKTPAPVSVKRLQKLWESVR
E
>NE2207 hupB, Bacterial histone-like DNA-binding protein
MNKSDLIDVIAQSADLTKAQAGNALDGALSAIKDALGKNDSVTLVGFGTF
KVGKRAARTGRNPRTGAEIKIKAAKVPKFTAGKALKDAVN
>NE0952 ihfA, Bacterial histone-like DNA-binding protein
MALTKAELTDLLFENIGLNKREAKEIVECFYEEMRAALQNGDGVKLSGFG
NFQLRTKPQRPGRNPKTGEEIPISARRVVTFHASQKLKSMVEANYRGESG
TN
>NE1961 ihfB, Bacterial histone-like DNA-binding protein
MTKSELISKLAERFPQLLAKDAELVVKIILDAMAKSLSRGERIEIRGFGS
FDLNYRPSRVGRNPKSGEKVHVPEKYVPHFKAGKKMRELIDSGPKQHKVL
DRVTG
>NE0450 int, Phage integrase
MQWIKRFILFHGKRHPQEMGSAEIEAFLTHLAVAGKVSASTQNQALSALL
FLYKEILSIDLPWLNEIVRAKQPQRLPTVLTRTEVQAILVRMSGTYGLMA
NLLYGTGMRLMECVRLRVKDVDFERGEILIRDGKGSKDRVTMLPESLAGP
LQAHLLHRRTLFDDDSRLGKASVYLPDALERKYPNAATDWVWQYIFSSGS
FSIDPRSGTERRHHIDEKLLQRAMKKAVQASGITKLATPHTLRHSFATHL
LDSGYDIRTIQELLGHKDVHTTMIYTHVLNKGGRGVRSPLDM
>NE0235 intF, Phage integrase
MLTKVRLTPSRIAAHTCPADASQAFLWDTATPGLAVRATAGKRAFIFQGR
FAGKSIRITIGDTEVWTIEQARQRARELQGLVDQGRDPRLVKQEKIAADV
QARITDEPALPAWRDYIAARSGKWSEAHAADHLKMARDGGEPVTRGRRIG
APAYTEKGILRPLLDLPLKGITREKVAQWLDNEATRRPAQARLALSLLGT
FLSWCGNQPAYRNQVNSDACAKLKRELPKPTARTDCLQREQLASWFAAVR
SIDNPVMSAYLQSLLLTGARREELAGLGWEDVDFQWQTIHLADKVEHSGR
TIPLTPYVSQLLQSLPKINEFVFASKRAKSGRLQEPRKAHNQAIEAAGLP
PLSIHGLRRSFATLSEWVEAPSGITAQIMGHKPSAIAERHYKRRPVDLLR
VWHTKIEEWILSNANI
>NE2189 intINeu, Integron integrase; Phage integrase; Phage integrase N-terminal SAM-like domain
MGNTNTPPKLLDQVRDRIRIKHYSLRTETQYVQWIKRFILFHGKRHPQEM
GAAEVEAFLTHLAVVGKVSASTQNQALSALLFLYKEVLSIDLPWLDKVVR
AKQPQRLPVVLTRTEVQAILVRMSGTYGLMANLLYGTGMRLMECVRLRVK
DVDFERGEILIRDGKGAKDRVTILPESLVSPLQTYLLQRRVLFDDDIRLG
KASVYLPDALERKYPNAATDWIWQYIFPSGSFSIDPRSSVERRHHIDEKL
LQRAMKKAVQTSGITKLATPHTLRHSFATHLLDSGYDIRTIQELLGHKDV
HTTMIYTHVLNKGGRGVRSPLDM
>NE1753 lig, NAD-dependent DNA ligase
MISENTIEERLQALRAAIALHDFHYYVQDAPVIPDAEYDALFRTLQQLEQ
QYPHLVTPDSPTQRVGAPPLKVFAQLTHQTPMLSLANAFSEEEVTAFDRR
IREALNIDRVDYAVEPKFDGLAISLIYANGILTKGATRGDGYTGEDITLN
LRTIPSIPLRLQVPFPTGQFEVRGEVVMLKTDFERLNEQQRKNGEKTFVN
PRNAAAGSLRQLDSRITAMRRLTFFAYGIGAYHEDQPIFSTHSEILAYLA
TQQFLVARQSSTVMGANGLLAYYREMNAVRLSLPYEIDGVVYKVNDLAQQ
EKLGYVSRAPRFAIAHKFPAQEVSTELLAIEIQVGRTGALTPVARLAPVF
VGGVTVTNATLHNEDEVQRKQIMIGDTVIVRRAGDVIPEVVAVIVERRPT
HAQAFVMPDHCPVCGSKAVRLPDEAVTRCTGGLYCPAQRKQAILHFASRR
AIDIDGLGEKLVDQLIDRELVHTPADLYRLDIDTLAGLERMAGKSARNLV
TAIEDSKKTTLPRFIYALGIRHVGEATAKALASHTGDLDRLMDMNAEQLQ
QIPDIGPIVAQSIADFFSEAHNREVIEQLLSCGLQWEKPSHIAQPSSRTN
LAVPGKTFVLTGTLPTMTRDQAKNRIEQQGGKVTGSVSSATSYVVAGSDP
GSKYARAIELGIPVLDEDQLLSLLRDTSSSE
>NE0008 mfd, mfd: transcription-repair coupling factor
MSSKLNPLSSESLPRYTGLEGSSDACALARLANRNPAGQLLAVITASALD
AQRLLEEIPFFAPDLRVSLLPDWETLPYDIFSPHQDLISERLATFYQIAH
NACDVLIIPVTTALYRMPPREFLAAHSFFVNQGSTLDLQSFRSQMSLAGY
SHVSQVLSPGEYSIRGGLIDLFPMGSPLPYRIDLFDDEIESIRTFDVDTQ
RSIYPVKEIRLLPAREFPLDDNGRSRFRTGFREKFEGDPTRCRLYQEISK
GNIPAGIEYYLPLFFEQTATLFDYLAQHSTVCLHGEITPAIENFWQDTRS
RYQLMRNDPDRPLLPPMDLFLPEDQFYGYLKSYKRIEMHTGQQVKTDKPF
ARSLPPVRVDRRASNPIEQLTAFVHTFTQKGGRVLLLAESMGRRELMAEY
LREYGLKLKLCEDFAAFQSDTASCMLSVASLHSGFILAAENLALVTENEL
YATHVRGQRTRDARKTVSADSILRDLSEIKPGSPVVHEQHGIGRYLGLVN
MNMGEDDSGQSSEFLALEYQGGDKLYVPVTQLHLISRYSGAAPEAAPLHK
LGSGQWEKAKRKAMQQVRDTAAELLNLYAQRAARKGHIFRFNQHDYNAFA
DGFGFEETPDQATAINAVIQDMVSGKSMDRLICGDVGFGKTEVALRAAFV
AVTDGKQVAVLVPTTLLAEQHYQNFSDRFGLIADQWPVKIAELSRFRSAR
EQAEALQSLAQGTTDIIIGTHKLIQDKVKFKNLGLVIIDEEHRFGVRQKE
QLKKLRAEVDVLTLTATPIPRTLAMSLEGLRDFSVIATAPQRRLAIRTFV
HPYSEGIIREACLRELKRGGQIYFLYNEVSTIQNMYTRLTTLLPEARINI
AHGQMRESELEHVMRDFYQQRFNLLLCTTIIETGIDIPTANTIIIHRADK
FGLAQLHQLRGRVGRSHHQAYAYLLTPPEKAALTTQATRRLEAIQAMEEL
GSGFYLAMHDLEIRGAGAVLGDSQSGEMQEVGFSLYSSLLDAAIKSLKAG
HEPDMQQPLGVSTEIRLHVPALLPESYCGDIHERLILYKRMAGCSDETEL
DEIHQELIDRFGLLPDPARALLDSHRLRIEARQLGITRIDAGPDNIQLQF
VPEPPIEAIKIIQLIQSSKEYSLSGPDRLSVRLQIPDVGERVKKIKKLMT
LLKN
>NE1742 mutL, mutL; DNA mismatch repair protein
MRPIKLLPDGLISQIAAGEVIERPASVLKELLENAIDAGTTDISVNIAQG
GLKLIRVTDNGGGISGEELPLALTRHATSKIASQEDLYRITSLGFRGEGL
ASIASVSNLLLISHQPGGKHAWQIRSEGIRVMQPEPSSHAAGTTVEVRDL
FFNLPARRKFLKTEATEFAHCEEIIRRMALSHAGIAFTLRHNGNLRGHWQ
SAEAAVRIKTVLGEEFTRSAAWIDERSAGIGLQGMLALPAYSRAARDMQY
FFVNGRFVRDKLITHALREAYRDVLHLDRHAAFVLYLDIDPEQVDVNVHP
TKTEIRFREARAIHQFIYHGVSKALSLPRSGTELSQSSSQLMADDIVPPA
EKRVPAAPMLNYPRQTGLPSEMIAQPFNFYQVLSGSESDSTATQNPFRQT
GAGESNEHPALPPLGFALGQLHGVYILAQNWKGLVIVDMHAAHERIVYEQ
LKLQMDEQTLSAQRLLIPVTFHADSLDIATAEENQSLLQQLGFEVTVLTA
TTLAVRAVPAILQDADTEKLVCNVLDEIRNGDPGQLLAARRNELLATMAC
HGAVRANRPLTLIEMNELLRKMEVTERSDQCNHGRPTWFEISLAELDKMF
MRGK
>NE2552 mutM,fpg, Formamidopyrimidine-DNA glycolase
MPELPEVEITRRGIDTHLAGRVITQISIRNPVLRWPISAGLIALLPGQRI
NAIARRAKYLLFACSRGTLIMHLGMSGNLRVLPESTPPQLHDHFDLQVDN
GMMLRFRDPRRFGAILWWDGDIRQHPLLQKLGPEPLSDDFDGQFLYTKTR
GRNASIKEVLMNQHIVVGIGNIYANEALFQAGISPLAAAGSLNTMQCERL
VDAVKATLLRAIKAGGSSLRDFTDCEGSPGYFQQQYWVYGRAGQSCRQCG
ELVSKTRQGQRSTFFCARCQH
>NE1705 mutS, mutS; DNA mismatch repair protein
MNKAEQSSHTPMMQQYLRIKAQHTDKLLFYRMGDFYELFYEDAEKAAKLL
DITLTQRGSSAGEPIKMAGVPFHAADQYLARLVRLGESIAICEQTGDPAT
SKGPVERQVIRILTPGTLTDAGLLEERSNSIVLALALHRGSIGLAWLNLA
AGDMRVLETSSDNLTSELERLHPAEILLPESLDLPATLNNFAGPKRLPDW
QFDYEHAMQQLTRQFGTRDLNAFGCEDLHAAIMAAGALFEYVRLTQQTAT
DGSSGQLPGHLHTLQVERQDAYLRMDAATRRNLEITLTLRGEDAPTLSSL
LDTCSTGMGSRLLRHWLHHPLRNRITLQQRLDTVSDLIGAQPETLYAGIR
QQFKHIADIERITSRIALRTARPRDLSGLRDSLMRLPGIIELIATSAAAA
VHRFIPPMQPDPLLTQLLVRALQPVPGAVIREGGVIADGFDAELDELRGL
QGNCDEFLLQLEARERERTGIPNLKVEYNRVHGFYIEVTRAQGEKIPPDY
RRRQTLKNAERYIIPELQAFEHKTLSAREQALAREKMLYERLLEQLADFI
IPLQEIARSVAELDVLCAFAERAALSGYTKPVFTDDPVLIIEAGRHPVVE
NQVEHYIANDVQLGAITRENRQMLVITGPNMGGKSTYMRQTALTVLLAHC
GSFVPAQIARIGPIDQIFTRIGAADDLAGGRSTFMVEMTEAAGILRNATA
QSLVLVDEIGRGTSTFDGLALAFAIARHLLTQNQSYTLFATHYFELTRLA
EEFPQAVNIHVTAVEHKRRIVFLHRIEEGPASRSYGLHVAALAGVPDRVI
RNAAKILARLEQETLSRSPQQTLFETVEENAKAVPASVHPVLDYLERIHP
DELTPRGALEQLYLIKSMLNQTD
>NE0056 mutY, HhH-GPD
MTPRTAGTIHFPADAPDSFAGRLIRWQLECGRHSLPWQGTRDPYAIWVSE
VMLQQTQVSSVIPYYQRFMASFPDVASLAGVPVGDVLTLWSGLGYYSRAR
NLHRAACVIMEQYSGVFPQDAATLQRLPGIGRSTAAAIAAFAFGERGTIL
DGNVKRILARYFGISGYPGEKSVEERLWQLAESLLPAEESNHQIVVSYTQ
ALMDLGALVCARSRPRCQYCPLQADCIACQNDLTADLPVPKPRKTLPVRE
TVHLILLDQERILLKKRPASGIWGGLWCFPEMSVDQDSIDYCEKNLHVRV
TKLARLPHLQHTFTHFKLIIQPHLLQSIMHQPVCEEKCEENSYLWLTIEQ
AMQQAIPVPVRKLLSMAYPYFQYHIHE
>NE2223 nth, HhH-GPD:Iron-sulfur cluster loop (FCL)
MNTTKRREIFTRFRAANPRPTTELEYQTPFQLLIAVILSAQATDKSVNLA
TRKLFLVADTPEKILQLGETGLSPFIQRIGLFRTKTRNILATCQLLIEQY
NGEVPRTRTELEKLPGVGRKTASVILNTAFGEPTIAVDTHIFRVANRIGI
APGKNVLEVERKLLKVVPDEFRHDAHHWLILHGRYICKARKPLCHQCLIV
DLCEFKEKNLEGTASSLDMKQLT
>NE2253 ntpA, NUDIX hydrolase
MQRYKLPVSVLVVIYTADLQVLLLERADHPGYWQSVTGSQDPGETLLQTA
VREVREETGLNTDDYVLSDWQIQNRYEIFEEWNWRYPPGTTHNTEHVFGL
ELPKTIPAVVSSREHLGYVWLPWREAAEKVFSSSNACAIRMLASKRKSEN
SR
>NE0885 ogt, Methylated-DNA--protein-cysteinemethyltransferase
MNYYTFLESPVDRLLLTSDGEFLTGVYMEIEIQKLLPRMTDDWRQDAAPF
AEAIAQLNAYFAGELIQFDLPMKATGTPFQEAVWQSLSTIPYGETVSYKN
IAERLHLPKAARAVGMANGQNPISIIIPCHRVIGANGKLTGYGGGIHRKQ
WLLAHEDKQTSFA
>NE1468 polA, polA; DNA polymerase I protein
MKTLLLVDGSSYLYRAFHALPDLRNRLNEPTGAIYGVLNMLRRLHKEYRP
DYSACVFDAKGKTFRDDIYPQYKAHRPPMPEDLVCQIGPLYACIRAMGWP
LLIEEGVEADDVIGTLVERAIARQAQCVIATGDKDIAQLVRPGIWLVNTM
NNESLDESGILQKFGVTPAQIIDFLALVGDSVDNIPGVEKVGPKTAVKWL
DQYGTLDDLIAHADEIKGVVGENLRKALDWLKVSRKLLTIKCDVPLAMDW
QDLVAVPPDTARLTELYEHLEFRSWLRELKQPGPEKNEKAESSVMAAIVD
DPSVPEGENDDGRDYQIILTDAQLGDWLAQCESAELVSIDTETTSLNPME
AKLVGLSFCMELGQAAYIPLAHHYPGVPSQLNREQVLQRLKPWLESDEKL
KIGQNLKYDRHVFANHGVMLNGIVHDTLLQSYVLESHLSHDLDSLASRHL
GIQTISYDEVTGKGAKRIGFEQVEIHRAGIYAAEDADIPLRLHRVLYPVI
SQDAHLEYIYQQIEIPLLEVLFRIERNGVLLDTDLLRVQSGELTQQLVAL
EQQAHSLAGHAFNLNSTKQIQEILFGQHKLPVIKKTPKGVPSTDEEVLQR
LASDYPLPKVLLDYRGLAKLKSTYIDKLPQMVNKQTGRVHTHYAQAVAVT
GRLASNDPNLQNIPVRTPEGRRIREAFIAPDGWLIMSADYSQIELRIMAH
ISGDAGLIHAFSEGQDIHRATAAEVFGVPVEQVNPEQRRYAKVINFGLIY
GMSEFGLATQLGIERTAARTFIDRYFARYPGVADYMQRTRELAKQHGYVE
TVLGRRLQLSDIRSNQRNRQMGAERAAINAPMQGTAADIIKLAMISVHRW
LAEAQLQSKLIMQVHDELVLEVLVDELPVIKENLPRLMENVLKLDVPLKV
QTGIGKNWDQAH
>NE1505 priA, probable priA; primosomal protein N' (replication factor Y)
MVIIRVALDVPIDRLFDYLAPDADTADIGRCVRVPFSSRQISGIIISVCE
TSSVPEGKLKYAGQIDRQTPPLPQPLLGLFEFCSRYYHHPIGQVVMNGLP
VLLRKFKHTGKEQPPSWRLTDTGKSITLADLPIRAKAKRQLISLLSEHGI
ITAEICKAMSSHSRKLLHEFKDLGWVEQFTALPEKAVFSTASSPAPTAEQ
AQAISEILDRTGTFTPWLLNGITGSGKTEVYLQVTASLLAQQKQVLILVP
EINLTPQLEAVFRKRFPGTTLVSLHSGLNNSERLQGWLQAQRGKAGIVLG
TRLAIFTPMPELSLIIVDEEQDHSFKQQDGLRYSARDLAIYRARQANIPV
ILGSATPSLESYHQARTGRYRLLQLHSRAISQAALPTIRCIDLRVIPAQE
GLSEPVLDALRHCLARKQQSLVFINRRGYSPVLLCKSCRWIATCKRCSSR
LVVHLRDRQLRCHYCGDQQPVSPACPQCGDPDVLPFGHGTQRVEAALIRH
FPEARILRVDRDSIRHKGAWQQMLDRIHRGEADILVGTQLLAKGHDFPNL
ALVCALNADASLYSTDFRAEEHLFAQLIQVAGRAGRANVPGSVLIQTEFP
QHPLYQALIRQDYAAYAQAHLKERRSAGFPPFVYLAVLRAEAPVLTDALE
FLRQAAALAAVTENYPHIQLFDPVPAHMTRLKGLERAQLLIQARSRRHLQ
TFLGDWHQRITALPVHSRIRWHLDVDPLTL
>NE1464 radC, DNA repair protein radC family
MAISDWPEAERPREKLIEKGAAALSDAELLAIFLRTGITGVSAVELARKL
LTHFGSLTKLCAASLHEFSELPGMGPAKFAQLQAVMEMAKRALAEELKNG
DIMDSPQSVRNYLCLSLKGKPYEVFVGIFLDARHRTIVTEELFNGTLTQA
SVYPREVVKRALYHNAAAMIFAHNHPSGIAEPSTADEILTQSLKQALALV
DVKVLDHFVIGSSEVVSFAERGLI
>NE0507 rdgC, putative recombination associated protein rdgC
MWFRNLLIYRLAGEVITSDELEAYLAKQTLQGCLGLEPQSRGWVPPGIAE
ADLVYSYGQQMLIALGTEKKLLPASVVNQLAKVRAQEMESHQGYAPGRKQ
MKEIKEAAYRELLSRAFAIRQRSHAWIDPVGGWFIVEGASASKADALIEA
FIKSTGIGLKRIRTTMAPTSAMTAWLSGDDPPAIFSVDSDSIFRSREDKK
VSVSYIRQSPDPQEITRHVRTGKEVIRLAMTWRDKISFILDENLQLKRLT
LLDIDREPAETAEEQFDSNFFLMTEELRQLLPDLVEILGGMTAD
>NE1932 recA, RecA bacterial DNA recombination protein:AAA ATPase superfamily
MDENKNKALSAALAQIEKQYGKGSIMRLGDSDVAKDIQVVSTGSLGLDIA
LGVGGLPRGRIIEIYGPESSGKTTLTLQAIAEMQKLGGTAAFIDAEHALD
PQYAQKIGVNVQELLISQPDNGEQALEITDMLVRSGSVDVVVVDSVAALT
PRAEIEGEMGEPQMGLQARLMSQALRKLTANIKRTNTMVIFINQIRMKIG
VIFGNPETTTGGNALKFYASVRLDIRRTGSIKRGEEMVGNETRVKIVKNK
VAPPFKQADFDILYGEGISRESEIIELGVLHKLIEKAGAWYSYNGEKIGQ
GKDNVRDYLKEHKSIAHEIEQKIRAAVGLAETDSRVVPPSSGE
>NE1850 recG, RecG-like helicases
MAAHFFDSLDEALRKKLEKLGLFSDFDLVLHLPLRYEDETRLSPISQAVP
GSTVQVEGVVAEQEVLVRPRRQLVCRVDDDSGTLYLRFFNFYASQVTAWS
PGTRLRVLGEVRAGFHGVEMVHPKCRVVRGSMVLANTLTPVYPGMAGLPQ
RTLARLIMQAFERLRAKRLLQETLPATILSACQFPAFEDSLSILHCPPAG
VSITSLQQRSHPAWFRIKFDELLAQQLSMRCHYHQRRSQQAPVLQQQTGL
QQALLEVLPFGLTDAQCKVVTEISKDLAQPYPMQRLLQGDVGSGKTIVAA
LAALQSIGNGYQVAVMAPTEILAEQHFRKLSDWLTPLGVGVGWLSGSQKK
SLRNQELERTATGEAMLVIGTHALFREAVQFKCLGLVIIDEQHRFGVGQR
LALRMKGGDEEVIPHQLMMSATPIPRTLSMSYFADLDVSVIDQLPPGRSP
VVTRLIDSSRREEIVARIREACLAGRQAYWVCPLIEESEALQLKTAVETY
ETLSQTFPDLRIALIHGRLDSDEKSVIMAEFSQGEVQLLVATTVIEVGVD
VPNASLMVIEHAERMGLSQLHQLRGRIGRGSATGVCVLMYQQPLSEVARK
RLQIIFEHRDGFEIARQDLLLRGPGEFLGTRQSGVPLLRFANLEEDIDLL
EMARNAAENMLRDHPLAAQCHMQRWLGRKEDYLRA
>NE0010 recJ, recJ: single-stranded-DNA-specific exonuclease
MANITIREFPAHAYEILSAHGFPSVLARIFAARGINHPEQLETTFARMAS
FEQLKNIQRIAVLLADAIAAKKRLLVIADYDTDGATACAVALRALRQFGA
MVEYLVPNRFEYGYGLTPEIVRLAADQVPPPDILITVDNGIASVEGVEEA
NRLGMQVFITDHHLPGDRLPDAAVIVNPNQPGCSFPDKHIAGVGVIFYVM
LALRAELRERSAFTATGKEPNLASLLDLVALGTVADVVRLEGTNRILVQQ
GLQRIRNGYCCAGIHALFKAAGRDFSRVTTYELGFILAPRLNAAGRLDDM
SLGIECLLTEDESHALRLASELDELNRQRREIESGMRDEAMDKLDDVIDL
LNQSDTPADNGKQSVYSLCLYDPAWHQGVIGLIASRVKDRLHRPVIIFAQ
GNEGEIKGSGRSIPGLHLRDALDLVAKRYPGLIVKFGGHAMAAGLTVYEQ
HFEQFRTAFEQVAQSLLTPADLIQVIETDGELAETDLTLELAQYLTNQVW
GQGFPEPSFNGCFRVENQRIVGEKHLKLKLRKTGAAQVYDGILFFHTERL
PTEIDAVYRVQINEYNGSTRMQLLLEHWFESGQAHYG
>NE1479 recN, ABC transporter:DNA repair protein RecN
MLQNLSIRNFIIVDHIDLHFKSGFTVLTGETGAGKSILIDALELVLGRRA
DTSQIRYGCKRAEITAQFSVNTIPALQEWLVENALEDETGICLLRKIMES
GGRSRNFINGHPATLQQLRTVGEWLVDIHGQHAHQLLMHGHKQCELLDAW
AGESNLAREVASAYRHWQDLCQQRLAWEQHSEQNLQEHETLTWQLQELAA
LNFSLEEWENLQIEHNRLTHTASLLETAQFSLESLSENETAVLAQLSTVL
TRLNSLIDIDNTLEPLCNQLQSAQIQLQEIVYELKRYQQHLDIDPRRLQE
TETRIAAIHGTARKYRIMPEILPDLLETTRQRLESLENAASSEALMKAEK
SARNNFENLAARLSQARQHAADQLSGLVTETMQTLAMAGGRFNVALIPIP
SGNLHGMEQIEFQVSAHRDLPLRPLNKVASGGELSRISLAIQVITSKAGT
VPTLIFDEVDTGIGGRIAAIVGKLLQQLGKTRQVMSITHLPQVAARGDHH
WRVSKTSETEDEQLPASHISELDAAERTEEIARMLGGENLTAATRQHAAE
MLGYDKQNQST
>NE2564 recQ, ATP-dependent DNA helicase RecQ
MISHAQTLLREIFGYSEFRGQQAEIITHVVNGDSCLVLMPTGGGKSLCYQ
IPALLRKGTAIVISPLIALMENQVAVLCRQGVRAVYLNSALTPEAAAAVE
RRMLAGEYDLVYVAPERLLTVRFRALLQRIPIALFAIDEAHCVSQWGHDF
RPEYGKLSILPEKFPQIPRIALTASADARTRADILRCLDLHQARSFISSF
DRPNLCYRITARSNSRIQLLNFIRSQHAGEAGIVYCQSRRKVEETAAWLN
SNHIPALAYHAGMETSIRTRHQKKFLQGHGIVMVATSAFGLGIDKSDIRF
VAHLDLPKSIESYYQETGRAGRDGLPASAWMVYGPGDIIRLRSQTESGTE
RLPAPIRQAAAARLDALLVLCETTVCRRKPLLDYFGEPTGSLPCGNCDAC
LETIPVQDVTIAAQKALSCVYRTGQCFGMEYLIDILSGKRTDRVRQWGHD
CISTFGIGHELSTEGWRIVFRHLLALDYLVAGEDRAGGERIALQLTSAAR
SVLRGETRIKLRLSHHHHSAPYQQISTGLSVPSSRCQAFSCEPQTKCGG
>NE2040 rhlE, rhlE; ATP-dependent RNA helicase RhlE
MSNDVTFAQLGLSSEILHAVNDEGYVNPTPIQAQVIPSILAGKDVMASAQ
TGTGKTAGFTLPLLYRLQAYANTSVSPARHPVRALIMAPTRELAMQIDES
VRKYGKYLALRTAVVFGGINIEPQIAALQAGVEILVATPGRLLDLVEQKA
VNFSKTEILVLDEADRMLDMGFLPDIKRVMALLSPQRQSLMFSATFSGEI
RKLADSLLKQPVRIEAAVQNTVNESISHVIHWVKPDSKFALLLHLIRQQN
LKQALIFVKTKHGASHLAQMLSRHEISAVAIHGDRNQQQRTQALAEFKHG
DVQILVATDVAARGIDIEKLSHVINYELPGNPEDYVHRIGRTGRAGSKGK
AISLVSEHEKELLANIEKLLNAKLETEQIAGFDAEQFARSLPDRKNRMSA
GNSRYGNKPMENGSEKSRSEKHRKLPSSQKYSGSRRGGTQKYSDPIFTQP
YVPQANSTQSTTPKQPEIQSLFLTYRQEKKTIPALFTALSKSKAGQEN
>NE0140 rnhA, probable ribonuclease hi protein
MQLEEGVKLVEIFTDGACKGNPGIGGWGVCLKFDGEVREFFGGEPVTTNN
RMELLAAIRALQALESLPDTGQSLRVQLHTDSQYVQKGISEWVHSWKKRG
WLTADKKPVKNEALWKELDQLSRRYQVEWFWVRGHNGHDGNERADMLANR
GVVSVLSEKAD
>NE1707 rnhB, Ribonuclease HII and HIII
MAERRIPLKHEYAQDGKVIYGVDEAGRGPLAGPVYAACVVLDPADVIEGL
ADSKQLSEKKRISLADQIKQRARAWAIASASVEEIDRLNILQASLLAMQR
AVVSLRPISNALVLVDGNHAPRLDCEVQTVIRGDSLVAEISAASILAKTA
RDIEMLRLHEAYPVYGFDRHKGYPTKAHLEAIRLHGITDIHRRSFAPCVG
QSVSGARTTSFINQKEA
>NE0212 ruvA, probable Holliday junction DNA helicase subunit
MIGRIAGLLLEKHPPLVLVDVNGIGYEIDVPMSTFCRLPGIGEQVTLHTH
FWVREDAHLLFGFMTEPERVLFRQLTKISGIGARTGLAILSGLSVNDLHQ
IVVSQDSTRLTRIPGIGKKTAERLLLELRDKISPAITLPETGTAMASSTD
KDILNALSALGYNDREANWAVGQLSEGVTVSDGIMQSLRLLSKAK
>NE0213 ruvB, ruvB; holliday junction DNA helicase protein
MIESDRIITASPFSSQEEVIERALRPVQLDDYVGQEKIREQLKIFIEAAR
LRQEALDHVLLFGPPGLGKTTLAHIIAREMGVNLRQTSGPVLERAGDLAA
LLTNLETNDVLFIDEIHRLSPVVEEILYPAMEDYQLDIMIGEGAAARSVK
IDLPSFTLVGATTRAGMLTNPLRDRFGIVSRLEFYTADELGKIVTRSAGL
LNVDVTADGAREIACRSRGTPRIANRLLRRVRDFAEVRANGRIDRPVADA
ALQMLDVDATGLDVLDRKLLLAVLEKFGGGPVGVDNLAAAINEERDTIEE
VLEPYLIQQGFLQRTPRGRMATTMAYQHFDIIPSHQTTVPSLFDPD
>NE0211 ruvC, Crossover junction endodeoxyribonuclease RuvC
MTSLVYAAKGIRILGIDPGLRITGFGIVEKIGNRLVYIGSGCVVTGESGL
PDRLKTILDGLNEIILQHKPEQVAVEQVFVNINPKSTLLLGQARGAAISA
AVLHELSVYEYTALQVKQAVVGNGHARKEQVQEMVMRLLGLGERPRPDAA
DALACAICHAHGGTGLLTLSARNRSKRSKRL
>NE0671 sbcB, exodeoxyribonuclease I
MQTGNSTLYWHDYETSGATPRWDRPFQFAGLRTDEALNEIGDPLVIYCQP
ARDRLPHPEACLLTGITPQMAEARGLPEPEFIALIHAQLAQPGTCGVGYN
TLRFDDEVTRFTLYRNFYDPYAREWQSGNSRWDVIDLARMTFALRPEGIN
WPINGEGKPSFRLEDITTANGLVHDSAHDALSDVRATIALARLLRAQQPR
LYDWLFRLRDKRAAGNLLDMKTHAPVLHTSRMYSSEYGCTTLVMPLLPET
GNANSVLVYDLRHDPAEFVLLDIDALAERLFTPKEELAEGLQRLPVKAVR
LNKCPALAPQKVLNDEVANRIGLNVEQCQQHWQLLLQHPDFMQRIKQAYS
GNKVFAENDADLALYDGFASDHDRNLFPLVRDAEPGKLADLAGKFQDERY
IELLFRYRARNFPDTLSVQEEHHWQMHCRRQLGENAINGSLTLNEYHQKL
LQLRTDCPQQAQLDILNELEAWGRVLAQENDLPWPPDHSGSEEQTD
>NE1392 sbcC, ATP/GTP-binding site motif A (P-loop):ABC transporter
MQILQVRLKNLNSLVGEWEIDFTDPAFVSDGIFSITGPTGAGKTTVLDAI
CLALYGRTPRLGKVSKSENEIMSRQTGECFAEVTFAAQSGCFRCHWSQHR
AHKKPDGELQNPRHELAEADSGKILETRLTEVGRQIEKITGMDFARFTRS
MLLAQGEFAAFLQAAPDERAPILEQITGTEIYSRISIRVHETRVSARREL
DRLSAGLNGIQPLTRADEQQFHTDLAQKIQQDAGLNEQIAHERQILVWLE
NIARLESELHLIAGQQQAWLSRKEALTPDISKLDSASRALELLGEYSRLT
SIRNEQETDRNNLAICAASLAGLEQAVKQAEQSLKSLNEQCDRQRAKQRE
TIPLLRKVRELDIQIREKESPISTASKGITAQKKTLAALRNQYQQNEIQL
AGLQTTLAGLLQQLHVIQPDGAQMDFAHNQNLLNRKQAEYRQLLENRSLA
DWRQEIAVLSGQKTLATRAIEAMQSLAASKQISAELEKHTCSLLAGKTQL
AKQLGAEEEMLGALEREISLLETQALLLKKISHLEEARQQLKDDEPCPLC
GALQHPYAAGNTPRPDDNITALNQARTMLKTRIDTISTLKIRQAETNRDI
EQTACRQQEIHRQIQADETLLQQCAVSLFPGLPSAAMFPELPRLLQETDD
KLARMTRILQTAEILENEISVQRESLDKTRELEQKIGILRVQHQHQSTQI
RQHEAELQLRQEQLDQLQQELGNLRTTRLQLFADKQPDQEEQSLTTAIEA
AQKSADNARQQLETEIQQYNRLKNRAEDLVKTITTRAVQLEKLQETFAAR
LTQSGFADEAGFTAACLPEEERRRLAQRAQQLADEKTMLDTRQKDKTIAL
QAEQLKSMTDQPRDFHDQVLAQLITRQQVLQQEIGGLRQKLADNENSKQK
QQEQLQVIEAQKRECARWDLLHGLIGSADGKKYRNFAQGLTFEVMIRHAN
RQLQKLSDRYLLIRDPVRPLELNVIDNYQAGEIRSTKNLSGGESFLISLS
LALGLSRMVSRNIRVDSLFLDEGFGTLDEEALDTALETLAHLQQEGKLIG
IISHVTVLQERISTRIQVIPRSGGRSVLAGPGCRHCQ
>NE1390 sbcD, Serine/threonine specific protein phosphatase:Exonuclease SbcD
MKILHTSDWHIGKTLYGHKRYDEFEAFFSWLVETIEQEQVDVLLIAGDIF
DTSTPGNRSQQLYYRFLHRVAASACRHVVIIAGNHDSPSFLSAPRELLRA
LDVHVTGSLSGNPADEILVLHDPKGDAELIVCAVPHLRDRDIRTAEAGES
MEDKSRKLVEGIRDHYAEVINLARLQRTALSSSIPIIAMGHLFVAGGQTV
EGDGVRELYVGSLAHVPAGIFPPDIDYLALGHLHVPQRVNGSSVMRYSGS
PLPIGFGEADQEKSVCLIEFNRQISATRPAVSLINIPVFQPLERIRGNWQ
VISDRISMLSAANSCAWLEINYEGDEMITDLQERLQSAIEGSRLEILRIR
NNRIMNQILDQIDDGGTLEELSVNEVFEHCLSAAAIPVEQRTELWRTYQE
TLVSLDEEDIRAE
>NE1968 smf, SMF family
MQIDQDIESWLRLGLTEGVGGGALRRLLIAFGDPARVLAASRPALEGVVK
KPVATSIFLRKVDEERLARTIKWLEDPLNSLITLADSDYPKLLLNISDPP
PILYFKGQRQFLAQPAMAMVGSRNATPQGLANADAFAEAASNAGFCIISG
LAQGIDTAAHQGGLRGASSSMAIVGTGLDLVYPSRNHELAHKLANEGGLI
SEFPLGTPAISRNFPRRNRIISGMCHACLVVEATLYSGSLITARLALEQG
REVMAIPGSIHSPLSKGCHALIKQGAKLVENIQDILDELHYQPQPVPRFE
SVADEGGGTGVLTGEGDDTGLLMYFSYDSTDIDTLCARSGLTVETVSAML
LGLELEGRIGSLPGGKYQRIR
>NE2453 ssb, Single-strand binding protein family
MASLNKVMLIGNLGRDPEIRYMPSGDAMANLNIATTDTWKDKGGEKQERT
EWHRVVMFGKQAEIAGEYLKKGSQIYIEGRLQTRKWTDKSNVERYTTEIV
ADRMQMLGGRSGGGSYDPPADRDHDYQSQSTPPAKSNTGFDDMEDDIPF
>NE0338 sss, Phage integrase:Phage integrase N-terminal SAM-like domain
MNQPHDERDTPLPPLLSEYLAYLASTRSLSLLTQHSYRRDLVALVCCIAA
QHQSEHENGHEVTDASLTRLHSHDIRHFIAHLHHGGLSGRSLARMLSAWR
GFYRYLMRHHHHTENPCQDIRVPKSPRKLPHALSPDEAAQLLAFDPADAL
ATRDLAMFELFYSSGLRLAELTRLQPTDIDFSEGIVRVTGKGSKTRIVPV
GEPALRALQAWLPLRSAWLTSGETALFLSRHGQRIHPRTIAVRLHQRARL
QNLDDRVHPHALRHSFASHLLQSSGDLRAVQEMLGHSSIRSTQVYTHLDF
QHLAKIYDQAHPRAKKRPKTG
>NE0154 tISRso8a, Transposase IS911 HTH and LZ region
MERLPKGIYTPEFRAEAVKLVEAEGLSVDAAAKRLLVPKSSLGNWVRASR
TGSLAKVGQGQRVPTETEIELARLRKELAEVKLERDLLKKCAAYFAKESR
>NE0835 tnpA, Transposase Tn3 family
MPRRSILSAAERESLLALPDTKDDLIRHYTFSDTDLAIIRQRRGPANRLG
FAVQLCYLRFPGIILGVDQPPFPPLLKLVANQLKVGIESWDDYGQREQTR
REHLVELQNAFGFQPFTMSHYRQAVHTLTERAMQTDKGIVLADALIEHLR
RQSIILPALNAIERASSEAITRANRRIYEALSEPLSNGHRHGLDDLLKRR
DNSKTTWLAWLRQSPAKPNSRHMLEHIERLKAWQALDLPPGIERLVHQNR
LLKIAREGGQMTPADLAKFEPQRRYATLVALAIEGMATVTDEIIDLHDRI
LGKLFNAAKNKHQQQFQASGKAINAKVRLYGRIGQVLIDAKQSGGDPFAA
IEAVMSWDAFAESVTEAQKLAQPDDFDFLHRIGESYATLRRYAPEFLDVL
KLRAAPAAKDVLDAIEVLRGMNTDNARKVPADAPTDFIKPRWQKLVMTDA
GIDRRYYELCALSELKNSLRSGDIWVQGSRQFKDFEDYLVPPAKFASLKQ
SSELPLAVATDCDQYLDDRLTLLEAQLATVNRMAAANDLPDAIITESGLK
IMPLDAAVPETAQALIDQTAMILPHVKITELLLEVDEWTGFTRHFAHLKS
GDLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAWLQAWHIRDET
YSTALAELVNAQFRHPFAEHWGDGTTSSSDGQNFRTGSKAESTGHINPKY
GSSPGRTFYTYISDQYAPFHTKVVNVGVRDSTYVLDGLLYHESDLRIEEH
YTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLFVPKGEASYDALKPMI
SSDKLNIKAIRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVA
LRELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFNRLGE
IRDRSFEQQRYRASGLNLVTAAVVLWNTVYLERAAHALRGNGHAVDDALL
QYLSPLGWEHINLTGDYLWRSSAKIGEGKFRPLRPLQPA
>NE0751 tnpR, Site-specific recombinase
MPGKRIGYVRVSSFDQNPERQLEGIQVDRVFTDKASGKDIQRPQLDMLLD
FVREDDTVVVHSMDRLARNLDDLRRLVQDLTGRGIRVEFVKEGLIFTGED
SPMANLMLSVMGSFAEFERALIRERQREGITLAKQRGAYRGRKKSLNSEQ
VAELKRRVVAGEQKALIARSFGISRETLYQYLKTVD
>NE0836 tnpR, Site-specific recombinase
MQGQRIGYVRVSSFDQNPERQLEHVEVGRVFTDKASGKDTQRPELDSLLA
FVREGDTVVVHSMDRLARNLDDLRRLVQKLTKRGVRIEFVKESLTFTGED
SPMANLMLSVMGAFAEFERALIRERQREGIALAKQRGVYRGRKKALSPEQ
VAELRQRAAAGEQKAKLAREFGVSRETLYQYLRLDQ
>NE1966 topB, topB; DNA topoisomerase III protein
MSKKLIIAEKPSVASDIARALGGFVKQKDYFESDEFVVSSAIGHLLELIV
PEEYEVKRGKWSFDHLPVIPPRFDLAPIEKTTDRLKLLSKLIKRKDVDML
INACDAGREGELIFRYIVRHVGSKKPIKRLWLQSMTPSAIREAFANLLND
AEVQSLADAAVSRSEADWLVGINGTRVMTAFNSQEGGFHKTTVGRVQTPT
LAILVEREEAIKKFVVRDYWEVHATFQAESGVYKGKWFDEGFSKRKDESE
SRADRIWDHAKAEVIRDKCAGRTGVVTEESKPSRENCPLLYDLTSLQRDA
NSRFGFSAKVTLGLAQALYEKHKVLTYPRTDSRALPEDYPAIVKDTLQVL
KGSRYDRFASQILESDWVKPNKRIFNNAKVSDHFAIIPTALVPKKLNEAE
EKLYDLVTKRFLAIFYPAAEFLITTRITRVENEPFKTEGKVLVHAGWQTV
YGKVESAQGQEEESVLVAVTPGETVLAQEVAVVAGKTRPPARYNESTLLS
AMEGAGKLVEDEELRAAMSAKGLGTPATRAAIIEGLIHENYVERSGRELQ
PTAKAFSLVTLLRGLKIPELISPELTGDWEFKLRQIEQGQLKRDVFMEKI
AAMTRHIVEQAKNHRDKTISGDFATLQVPCPGCGGVIKETYKKFQCQQCD
FALWKILAGRQFEAAEMETLISTREIGPLSGFRSKMGRAFNAIVRLTDDY
EMKFDFGNEADQAQEKVDFSAQQPLGKCPQCGHSVYEHKLLYVCEKSVGA
GAPCSFRTGKIILNRAIEAEQVVKLLQTGRTDLLAGFVSRKGRPFSAYLV
VGPAGKIGFEFEQKKTKSKPADTVPETGKAAS
>NE2455 uvrA1, ABC transporter:Excinuclease ABC A subunit
MELIRIRGARTHNLKNIDLDLPRNQLIVITGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLQLMEKPDVDLIEGLSPAIAIEQKATSHNPRSTVGT
VTEIHDYLRLLFARVGEPHCPEHGIGLAAQSVSQMVDQVLQLPTDTRLMI
LAPVVTGRKGEQAELFDELRAQGFVRVRLDGEVYDIDALPKLQKTKKHTI
EVVVDRLKISPEVKQRLAESFETALRHAEGRALAVEMDSGKEYLFSARFS
CPVCSYALSELEPRLFSFNNPAGACPKCDGLGQITFFDPARVVAFPYLSL
AAGAIRSWDKRNQFYFQMLQAVANHYHFDLEIPFEQLSKEVQQAVLYGSG
KEKITFTYLNEQGRAHQQVHPFEGIIPGLERRYRETESQTVREELAKFIN
ARECPECGGTRLCREARHVTVNGETIFAISAWPLRQAKQFFDDMELTGHK
QSIAERIIREISSRLQFLNNVGLDYLSLDRSADTLSGGEAQRIRLASQIG
SGLTGVMYVLDEPSIGLHQRDNERLLDTLRHLRDLGNSVIVVEHDQDAIL
LADHVVDMGIGAGEHGGCVVAEGTPTAIQANSASLTGQYLSGKRSIAIPS
TRTPPNPERMLTIRGAAGNNLKQVQLNLPVGLLICVTGVSGSGKSTLIND
TLYRVVARHLYGSHTDPAAYQEIDGLGFFDKVIDINQSPIGRTPRSNPAT
YTGLFTPVRELFAGVPQARERGYSPGRFSFNVKGGRCEACQGDGVIKVEM
HFLPDIYVACDVCHGQRYNRETLEIQYKGKNIHEILQMTVENAHAFFEAV
PTIARKLQTLLDVGLGYITLGQSATTLSGGEAQRVKLSLELSKRDTGRTL
YILDEPTTGLHFQDIDLLLKVLHRLRDNGNTVVIIEHNLDVIKTADWIID
LGPEGGAGGGRIIAEGTPETVASIPGSFTGYFLQPLLSTTLTG
>NE0785 uvrB, Helicase subunit of the DNA excision repair complex
MIITFPGSPYKLNQAFQPAGDQPEAIRILVEGIESGLSFQTLLGVTGSGK
TFTIANMIARLGRPAIIMAPNKTLAAQLYAEMREFFPENAVEYFVSYYDY
YQPEAYVPSRDLFIEKDSSINEHIEQMRLSATKSLLEREDAIIVATVSCI
YGIGDPVDYHGMILHVREHEKISQRDIIQRLTGMQYQRNEFEFARGTFRV
RGDVLDVFPAENSETALRISLFDDEVESMTLFDPLTGQTRQKVSRYTVYP
SSHYVTPRSTTLRAIETIKTELTGRLNYFHENHKLVEAQRLEQRTRFDLE
MLNELGFCKGIENYSRHLSGRQPGDPPPTLIDYLPDNALMIIDESHVTVP
QIGGMYKGDRSRKENLVAYGFRLPSALDNRPLRFEEFEKLMPQTIFVSAT
PADYEIQRSGQIAEQVVRPTGLVDPVIIIRPVTTQVDDLMSEVSLRAAQN
ERVLVTTLTKRMAEDLTDYFSDHGIRVRYLHSDIDTVERVEIIRDLRLGK
FDVLVGINLLREGLDIPEVSLVGILDADKEGFLRSERSLIQTMGRAARHV
NGTVILYADKITNSMRRAIDETERRRNKQKLFNQQNNITPRGVNKRIKDL
IDGVYDSENAAEHRKVAQIQARYAAMDEAQLAKEIQRLEKSMLEAARNME
FEQAAQYRDEIKNLRSKLFIGIIDPDEIREVPQTAGKKSRRKAGR
>NE0933 uvrC, uvrC Nuclease subunit of the excinuclease complex
MPDAHFDGKAFVLTLPAQPGVYRMLNAAGDVIYVGKAIDLRKRVSSYFQK
SGLSPRIQLMVSQIAGIETTVTRSEAEALLLENNLIKSLAPRYNILFRDD
KSYPYLLLTRHIFPRLAFYRGALDDRHQYFGPFPNAGVVKSSIQLLQKVF
RLRTCENSVFDHRTRPCLLYQIKRCSGPCVGLITPEAYQQDVKSAAMFLQ
GKQDEVLKTIEQKMFTASDQQDYEQAAQLRDQMQALRKIQEKQFVDSGKA
LDADVIACAIEPDSHAVAVNLVMIRSGRHLGDKTFFPQNVYEADISTVLE
AFVTQHYLNRSVPPLIILGQKIRVTLLQKLLSDQAGHKITLTTNPIGERR
KWLDMAAENAQLALQQMLIQQASQEDRLQALQEALNLPGLARIECFDISH
TMGEATIASCVVYDRFAMRNGEYRRYNITGIVPGDDYAAMRDVLQRRYAK
LAMEEGKLPDLILIDGGKGQIRVASEVMIELGLNDIPLVGVAKGETRKPG
LEQLILPWQEEALHLPDDHPALHLIQQIRDEAHRFAIQGHRAKRAKTRKI
STLEQISGIGTKRRQSLLTRFGGLKGVKNASIEELQQTEGISRSLAEKIY
RELR
>NE1473 uvrD, UvrD/REP helicase
MTALLTDLNPEQLEAVTWSHQSALVLAGAGSGKTRVLTTRIAYLLQSGRT
RPQNILAVTFTNKAAREMVARIGAMLPVNTRAMWVGTFHGLCHRVLRAHH
EDAGLPQAFQILDMADQLAVIKRVLKERSLDEKMLPPRQLQWFINNAKEE
GLRASQVDVHGGFNQTLAECYQAYEIVSMREGTVDFAELLLRCYELLSRN
EILRDHYRSRFEHILVDEFQDTNRLQYKWIKLLAGPGSQQHAAIFAVGDD
DQSIYAFRGAHVGNMRDLEKDFSVPKIIRLEQNYRSHGNILDAANALIEH
NKGRLGKNLWTAAGKGEPVRVYHAATDMDETSFIIDEIKALHADGLALSD
IALLYRSNAQSRVLEHGLFNASVSYRVYGGMRFFDRQEVKHALAYLRLIA
LPDDDNALLRIINFPPRGIGARTLEQLQDQAAMLGTSLWQAAFKVYEGGK
AVATRNSQPGRGIAGFVSLVLSMQQDGEGLPLPEIIRRVIDQSGLAAHYQ
AEREGGERLENLKELINAATSFVHESEDDSLTAFLAHASLEGGEHQAEGY
QDAVQLMTVHAAKGLEFHSVFISGLEEGLFPHENSRNEPDGLEEERRLMY
VAMTRARQRLYLSYAESRMLHGQVRVNIPSRFIDEIPQDLLKRLRSDFSG
RSFRQGVSGTGQTVASTINSSQKGRSTMAAAVGMTSSGLNSAGFHVGQQV
SHAKFGTGIILNYEGSGTDMRIQVNFHQAGTKWLSLAYAKLEPL
>NE1458 xerD, Phage integrase:Phage integrase N-terminal SAM-like domain
MRQSPDFFRDMLRMNITDTNIRMLDEFTDALWLEDGLSRNTLASYRADLM
QLVEWLGRQPRTNGSLSDVTQADLLAFLSDRIGQGVKASTTCRALTCIKR
FYRYLLRQGKILADPATNIDSPKISRHLPVSLTETEVEALLAAPDTRQPL
GLRDRAMLEILYAAGLRVSELVGLSISQIRQDMGVVRILGKGSKERLIPL
GEEALHWLSLYLQEARPVLLAGKHSNMSFVTTRGDAMTRQAFWYLIKRHA
RQAGIVKLLSPHTLRHAFATHLLNHGADLRVVQLLLGHSDISTTQIYTHV
ARERLKQLHARHHPRGTL
>NE1172 xseA, xseA; exodeoxyribonuclease vII large subunit protein
MTDHNLLPEPKKILWRVSELNRNARVILEQTFPLLWVSGEISNLKRYPSG
HWYFSLKDDSAQVRCVMFRHKNLYLDWIPQEGMQVEAQALVTLYEARGEF
QLTVEQLRRAGLGALFEAFERLKARLQQEGLFSPEYKQPLPRFPRQIGII
TSPNTAALRDVLTTLQLRLPSIPVVIYPAPVQGEGSAAAITTALHTAAVR
GECDVLILCRGGGSIEDLWAFNEEIVARAIAACPIPIVTGIGHETDFTIA
DFVADARAPTPTGAAQLASPDRQAILHRLQYWLHRLQQTMERHIERRMQA
TDLLAHRLIHPGERIRHQQMHLLQLRGRLQNAWNRQVEIRTWRIEETGRR
IHSAKPDIQAGIRHQQELAARLQRAMAHRLENLQFKLRQQQQHLIHLDPK
AVLARGYSIAYTARGDILHDSRQTRAGDNVRLVFASGWAKADITETGE
>NE1159 xseB, Exonuclease VII, small subunit
MRKKSSSNKEETALHPPPENFETATAELEQIVAGMETGQMSLEDALSAYK
RGVELLQYCQNILKNSQQQIKILEADMLKHFSPAEHDAS
>NE0023 xthA1, Exodeoxyribonuclease III:Exodeoxyribonuclease III xth
MKIATWNVNSLKVRLQQVIDWLNLNQPDILCLQETKLQDEFFPMDAIAQA
GYRSIYIGQKTYNGVALLSKETGEDICTALPGFDDMQKRLIAATYGDLRV
ICAYVPNGEHVDSEKYIYKLEWLSQLNRFLQQQRACYGKVALLGDFNIAP
EDRDVYDPEAWRGQVLCSEPERQAFRGLLDTGFVDSFHLFEQPEKTYTWW
DYRMMAFRRNRGLRIDHILLSHEMADRCTIWQVDKLPRKLERPSDHAPVL
VELA
>NE2192 xthA2, Exodeoxyribonuclease III:Exodeoxyribonuclease III xth
MRIITLNVNGLRSAAGKGLFDWLPRQEADVICVQELKAQQGDINGVMRAP
DGYSGYFHCAEKKGYSGVGLYTRYSPDQIIEGTGIPEIDMEGRFLRVDFG
NLTVISIYLPSGSSGEHRQAAKFFFMEHFLPLLQSLAECGREVLLCGDWN
IAHKAIDLKNWRSNQKNSGFLPEERAWLSTVFDELKLVDVFRKINPEPDQ
YTWWSNRGQAWAKNVGWRIDYQIATPGLAAMATGVSIYKAERFSDHAPLT
IDYDFNL