TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Xanthomonas campestris pv. campestris str. 8004, 8004
Gene type: CDS

Number of genes found: 254

Free access
Sort by:

 



# Xanthomonas campestris pv. campestris str. 8004, 8004

>XC_0224 methyltransferase
MTNKYDHLDRQTLIGLLQRRDAERQLGLVWERDEIEADQALNDDFVALSL
DAGLSHGEAPWDNLIIEGDNYDALRALRMTHKGAIRCIYIDPPYNTGNKD
FVYNDRFVDKTHRFRHSLWLEFMYRRLQLAKELLADDGVIFVSIDDNEVF
RLGMLMDRVFGENNFIANVIWQKVFSPKGTAQHFSDDHEYVIIYGRDKNK
WRPNLLARTAAQDRAYKNPDDDPRGLWTSGDLSARNYYSKGVYSIVGPTG
RVIAGPPAGTYWRFSEERFKELDADNRIWWGKSGDNMPRLKRFLADVQQG
TVPQTLWTYGEVGHTQDAKKQLLEVLNFNSTNDVFSTPKPIQLMERILSI
ASKPGDTVLDFFAGSGTFAQAVAKLNAEDGGNRKFILVSSTEATEDTPDK
NLCRDVCAERVRRVLGGYTNAKGQPVEGLGGGFAYLRTRRIPKHRLALKL
DHAEVWHALQLLHQRPLSFWPGGGFASDGELAYLADFQAAHVEQLREWLR
TRTSAVAAVYTWSTERLNGLLGEPAADLSLLPLPHHLRERFGR
>XC_2029 putative DNA repair protein RadC
MPFAVNNSCVESLPFVAAQHEDRIIQQAIALLEQRVFKAGPRLDWPADVR
DYLRLKLVDEPNEVFVVVFMDNLHQVLACEPMFRGTINSATVHARVIVQR
ALALNAAAVILSHQHPSGATEPSNADRTLTHQLEAALALIDVRVLDHIII
GKGTPFSFAERGLL
>XC_1133 holliday junction binding protein, DNA helicase
MIGRLRGILAYKQPPWLVIDVGGVGYELEAPMSTFYDLPDVGRDVILFTH
YAQKEDSVSLYGFLREGERRLFRDVQKVTGIGAKIALAVLSGVTVDEFAR
LITSGDITALTRIPGIGKKTAERMVVELRDRAADFSSGAPITGQLGPDAV
SEATVALQQLGYKPAEAARMAREAGAEGDEVATVIRKALQAALR
>XC_1524 IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>XC_2011 IS1477 transposase
MLEHTPLSERRACRLAGLSRDAFRHAPVPTPAAQALSARLVELAQTHRRF
GYRRLHDLLRPEFPSVNHKKIYRLYEEAELKVRKRRKAKRPVGERQKLLA
SSMPNDTWSMDFVFDALANARRIKCLTVVDDFTRESVDIAVDHGISGAYV
VRLLDQAACFRGYPRAVRTDNGPEFTSRAFIAWTQQHGIEHILIEPGAPT
QNAYIESFNGKFRDECLNEHWFTSLAQARDVIADWRRHYNQIRPHSSCGR
IPPAQFAANYRTQQANNAVPFNPGLYQ
>XC_2292 IS1478 transposase
MRADRRRELYGNTPSWSLSRCTAWSLACSIVAPEKPPITQRPQGLDVCSG
VPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_2625 IS1404 transposase
MDVKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMS
VPDAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>XC_0546 IS1477 transposase ORFB
MPPARFQPGQLLPVAGQVRRHGGRRREAPEGAGAGEQPPEAVAGRGAPGH
RGAEGRVRGKTLAPQRKREAIRRMCELTSISERRACRLAGISRDAFRHAP
TPTPATQTLSARLVELAQARRRFGYRRLHDLLRPESPQVNHKKIYRLYRE
AKLSVRRRKKAKFPAAQRQPLRPARHPNEVISMDFVFDQLASGRRIKCLT
VADDFTHECVDIAVDHGISGAYVVRVLEQIACFRGYPRAVRTDNGPEFTS
RAFITWAQQRGIEHILIEPGKPMQNGYIESFNGKFRDECLNEHWFTSLIQ
AREVIADWRRDFNEVRPHSSCGRIPPAQFASNHRAQTGNNAVPFNPGLYQ
>XC_0310 type V secretory pathway protein
MQLIDIGANLTHDSFDRDRDAVLQRARDAGVAQLVITGASREHSPLALQL
AQQHPGFLYATAGVHPHHAVEFTAECEREMRALQAQPQVVAVGECGLDYY
RDFAPRPAQHKAFERQLQLAADNGKPLFLHQRDAHDDFLSIMRAFDGRLG
AAVVHCFTGTREELFDYLDRDYYIGITGWLCDERRGAHLRELVRNIPANR
LMIETDAPYLLPRTLKPLPKERRNEPMFLSHIVEELARDRGEDVAVTAEN
STAAARAFFRLPVPATAA
>XC_2986 IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_2638 phage-related integrase
MRIPHHLSRSPTGRWSFVQRVPVDLQTVMGCRLIKRTLQTKDLAQAHVRA
VVLGAGYARLFAQLTDQRVDKLSKTDADLLIARLTSAENLQDLTLNRTRQ
PDGTVTEQWQIDSPKDLKLYRQLMELEAMAGAALQARAHPVAVPTMFGST
HAGPSRQSSAPAIETMTLGKARDAFLATLKGSTLPKTYTIKKTAIEALVS
FLGPTMKVHAITRSDLARWYQDMREKGASTPTLTNKQSYIGGRGGFFEWA
MASGHYPKGDNPASGHVSYSQREKRARKKLGFKAYDRAQIQALFAPEALA
KLSESARWASFLGLYTGARASEVGQLLVKDVFEEDGIPCIRISDEGEHQK
VKTEVSLRTVPLHPELLKMGFLEWVGGKRKVGETRLFPAAKATAVNGQGN
WITKAFSRHLAEVGKNWEPAKRGFHSLRKTLIQELQGAGVVSELRAQIVG
HELDDEHHSTYGRDFTVVEKLRGLGPHSPGISRLSFFQ
>XC_1540 methylated-DNA-protein-cysteine S-methyltransferase related protein
MPSPRPSKTRVAGSHAGDASATRAAEQVRLRILDVIRAIPAGEVAGYGEV
AMRAGLPGRARLVAKLLSSNQDAALPWHRVLRSDGRIALPEGSAGYQAQC
QRLRAEGVPVERGRVRRATAAQRLDAAVWGPS
>XC_3672 IS1404 transposase
MVSAPARRTLVREWIGRGASERRALAVIGMSASALRYCPREDRNGELRER
ICALAHRHRRYGVGMIYLKLRQEGRIVNYKRVERLYREQQLQVRRRKRKK
VPIGERQPLLRPSQANQVWSMDFVFDRTAEGRVIKCLVIVDDATHEVVAI
EVERAISGHGVTRVLDRLAHSRGLPKVIRTDNGKEFCGKAMVAWAHARNV
QLRLIQPGKPNQNAYVESFNGRLRDECLNEHWFPTLLHARTEIERWRREY
NEDRPKKAIGGMTPAAYAQHLATDPRKQSAA
>XC_1628 excinuclease ABC subunit B
MTDRFELVSPYSPAGDQPAAIDKLVANFEAGLAKQTLLGVTGSGKTYTIA
NVVQQVQKPTLVMAPNKTLAAQLYGEFKSFFPNNAVEYFVSYYDYYQPEA
YVPSSDTFIEKDSSINEHIEQMRLSATKTLLSRRDSLVVATVSAIYGLGA
PEDYLSLRLILSIGEHIDQRQLIRHLTDLQYTRNEFELTRGAFRVRGEVL
DVFPAESDTEALRIELFDGDIEQLTLFDPLTGETLRKLQRYTVYPKTHYA
TTRERTLSAVDTIKEELKERLEQLYSQNKLVEAQRLAQRTQFDLEMMAEV
GFCNGIENYSRHLTGKAPGEPPPTLFDYLPPDALLVIDESHVTIPQIGAM
YKGDRSRKETLVEFGFRLPSALDNRPLRFEEWEARSPRSIYVSATPGPYE
LRESAGEITELVVRPTGLIDPVVEIRPVGTQVDDLMSEVHERIKLGDRVL
VTTLTKRMAENLTEYLGEHGIRVRYLHSDIDTVERVEIIRDLRLGKFDVL
VGINLLREGLDMPEVSLVAILDADKEGFLRSTGSLIQTIGRAARNLRGKA
ILYADKMTRSMQAAIDETDRRREKQVEYNLEHGITPKSVARPISDIMEGA
REDAAEKKAGKGRSKSRQVAEEPADYRAMGPAEIAGKLKALEQKMYQHAK
DLEFEAAAQIRDQILKLKAASLA
>XC_2628 ISxcC1 transposase
MVGVARSTARYRRRPDRDEEVIALLSELAERFPERGFGKLFQIIRRRGHL
WNHKRVWRVYCLMKLNQRRRSKRRVPTRHPQPLACGDRPNAGWSIDFMSD
ALWDGRRFRTFNVIDDFSREALAIDVDLNLPAARVIRTLERIAAWRGYPN
KLRLDNGPEFVALALAEWAERKGIALDFIEPGRPMQNGFIERFNGSYRRG
VLDMHIFRTLSEVREQTEQWLADYNQQIPHDSLGGLTPAEFREQHQPQTS
SFIWH
>XC_3921 ISxac3 transposase
MSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRTFGK
SGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>XC_4289 exodeoxyribonuclease V gamma chain
MHATSAPDFRLYPSNALDTLAALLAEELRRPVPEQPVLQPEVVLIPQVAM
RRWLQSTLAAEHGVAANLEFLTPGEFVARALERNLGPADDDLDMATTQWR
LYQTLQGELGSDAALAPLAGYLADGDALKPWALAGELGSVFEKYQAWRRD
WLLRWESGADADDPQARLWRSIAGGRQYRARRIGQYLDRYARPDGPLPQG
LPKRLFAFAILNVSPDVLRVLATQARVGTLHFYLPTPTQGYWGDLQTLWQ
RRREGGAVALFAEQVQENPLLQAWGAAGRDFMALVGDYEVVHPLAEIAAY
ADPLDAGRRTLAEGGLGDSLLRRMQSDLFHRHAPAVPPVLPAVNLHDPSL
QVHACHTRLRELQVLHDQLRALLDDARFDPPLQPREIAVLSPDIDPYVPY
LDAVFGGHGSDDGLPYALADASPLASEPLADVFLTLLGLPISRFGLHEIL
DLLASAPIAEAAGLDEAGLERLRGWLHGAGARWGLDAVHRRQHQAPGDDA
YTWRFALDRLLLGHASGAEDDIDGVAPWPQLEGSALAALDTLLRLLRVLD
RHQAALAEAMTPVQWRECLLGLLEALIPAAPSAPRAQRALERLRTLIDQF
ARDAVRAEYAGNVPAEVVRAHFAAVLGESDTRAPLLTGGISFGRMVPMRL
LPFRAICLLGMNDGDFPRRDPAAGLNRLTAELGTERRRHGDRSTREDDRF
LFLQLFASAQEVFYLSYLGADARDGTVREPSVLVSELLGSAAQYHADPKA
IDALVVRHPLQPFAAAAFGAVGEDGADPRRFSYRRQWRPAVDSLAGQRQP
LAPWVAGALPADASVLPASVSIDDLRRLFADPAGQFLRHRLGMRLPDPAG
EDSDLEPLLAPTRGLEQYGLQQQVFEAALAGDADGLYERLRARALLPSGP
LGRRQLDERLRQLRPYADVFRQWRGEAPAQSQRLQVEIDGTNVHGRVPGW
YANGVGRVQVGALSGRSAIRDGLEWLLLRAAGERVPFVRFFEHDDSLGPH
PIDPEPLSQTQARAALGELLQLYRQGLQTPLAFAPYSSWKYHQAARNDEL
DKAIKDAHGQWQSSFGWSESHSPELRLVTRGRDPFGDAQQFVDFARTSHQ
LFALLEDGSAPAPLDPARVIESWRQWRGAQDDAE
>XC_2435 conserved hypothetical protein
MNDKQEVFISVDVETAGPIPGIYSLLSIGACVIENPVQTYACELKPTTTQ
ADPAALEVTGLSLERLAREGLAPEVAMREFRTWVLGVCESKGEPVFVGFN
AAFDWSFINYYFHRYLGDNPFGFSALDIKSLYMGAVQCAWHDTRSSQMAK
SLQPHLAGNHDAMQDALYQAELFRCVRALTPARPV
>XC_3698 IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_3892 DNA primase
MARIPDAFIDELLARTDIVEVVGGRVPLKRQGKEYSARCPFHDERSASFT
VSPTKQFYHCFGCGAHGTAISFLMNYDRLEFLDAVDELAKRAGMEIPRET
QQRTPQQQDDSRELYSALEAATKFFQRQLEGSDRARDYLDGRGVDAENRA
RFQIGYAPDGYSALKDTLGTDARRMSVLERAGLFSKNDRGHVYDKFRDRV
MFPIFDRRGRVIAFGGRIMGAPADGRDPGPKYLNSPETALFHKGRELYGL
WQVRQANQKIERLIVVEGYMDVVSLFQFGVTQAVATLGTATTPEHAELLF
RNAPDVYFCFDGDNAGRKAGWRALESVLPRMKDGRQAFFLFLPDGEDPDT
IVRKEGAQAFDARLKQATPLSQFFFDEMARDINLHTLDGKARLAERAKPM
LAQIPEGAFGDLMKQELARMTGVGASMSAQQSPPKARPPARMGAPTQKRS
LVRASIAILLQQPSLAMSLEGDHDFSGLRLPGIELLMELLALVRQRPEIS
TGALLEHFAEREELVALQKLAAQELPGDEHSWAIELHDVVAQLDKQLLRQ
RVEELQAKQRAQGLDNTDKYEMRELLKALAAL
>XC_1684 excinuclease ABC subunit C homolog
MVGRAERAKRSWGAPDYTYPEHLRAELDTLPATPGVYLFHGQSSTLPLYI
GKSIHLRNRVMDHFRNAAEASLLRQTRSIQVIEMAGDIGAQLLESQLIKT
LRPLYNQKLRRIPRQFSIRLYRGEVSIEHSGEIDPAAAPWLYGLYSSPRA
AKETLRRLADQHHLCYGLLGLERLQAGRPCFRAMLKRCSGACHGAEPLDA
HEERLRSVLQHLEQAAWPFPGAIALKEQGAQRTQFHVLRDWHYLGSATSL
AGARRLQATPGAFDRDCYRILRKYLETQLHCVSLL
>XC_1808 ATP-dependent DNA ligase
MSLSEYRRKRSFDKTREPEPGKLLPQGQRAIFVVQLHHASRRHYDFRLQV
GDALKSWAVPKGPSYDPAVKRMAVEVEDHPVDYASFEGEIPKGEYGGGHV
AQFDHGVWATAGDPEAQLAKGHLRFELFGSKLKGGWHLVRSSKPARQPQW
LLFKEDDAYAGTLEADDLLADVAAAPAEDVRRAGAGKAQRKALTTVPVPR
ARARNAWTNAALKLTHARRGDIDDAAFAPQLAKLGQAPPEGAQWVHEIKW
DGYRILATVTDGQVRLWSRNALEWTDKIPDIRDAIQALNLRSARLDGELI
AGRGTKEDFNLLQATLSGERQVPLALAVFDLLHIDGVEISEAPLRERKQL
LQQILANAPAGHLAYSSHVEGDGLEAFRVAGEQHFEGIISKRADRPYRGG
RSDDWRKTKQLASQEYAVVGYTAPKGSRTGFGSLLLATPDPQHGWLYVGR
VGSGFSDTLMQEVTQHLHGGGKRPTAHIPTEDTDLRGATWFAPRFVVEVF
YRGIGGQQLLRQASLKAVRLDKDIADLADSDMGDVSPAQADVADTPARGR
KRANKQAPAQGEPTLSSPTKLIYPDIRATKGDVWDYYHAVMDHLLPEIVG
RPLSIIRCPNGAEKPCFFQKHHTAGLERVSSVRLKEETGSNAYYLVVEDA
PGLLELVQFNALEFHPWGSHAARPDMADRVVFDLDPGPDVPFAEVKRAAT
DIRKLLAQLELESFLRVSGGKGLHVVVPLNPGCDWELTKRFAKGFADALA
QSEPDRFVATATKRFRNKRIFVDYLRNGRGATAVASYSLRGRPGAPVALP
LPWSDLAKLHRANAFTLRDVPDKLRRRRKDPWADIAQIQQNLARWADQG
>XC_0501 NADH pyrophosphatase
MSEPLFSLSAFAFTHAPLDRGDVLRDDPDAIARLWPTGRVLLIDAKGTAA
ADAQGQPLLSDGAALADTPGAAIFLGLRDGVGWFALAAEQVATELPHRVD
LRQAAADWPAELSTAFSYGRAMLHWQSRTRFCGVCGGAIAFRRAGFIAHC
TQCQTEHYPRVDPAIIVAVSDGQRLLLGRQASWAPRRYSVIAGFVEPGES
LEQTVEREVFEETRVQVQGCQYLGAQPWPFPGALMLGFAATAAPTELPQV
TGELEDARWVSHAEIGTALAGESGDTGIGLPPAISIARALIEHWYRTHG
>XC_2440 conserved hypothetical protein
MIDFHCHLDLYADPRQVVSQCVERGLYVLSVTTTPSAWEGTSALAGAAPR
IRTSLGLHPQIAHERKFELPLFEKLIGRVRYVGEIGLDGSPELKQHWHTQ
LAVFEQILDLCEGVDGRVMSLHSRRATKAVLDMLEKHPKSGVPILHWFSG
TQRELDRAVEMGCWFSVGPAMLRGEKGRKLAAAIPQDRILTESDGPFAKM
DSRSIWPWEAAQAVVSLAEIWQVNEQAAATRLKANLKQIGES
>XC_4109 DNA polymerase I
MSRLVLIDGSSYLYRAFHALPPLTNAQGEPTGALFGVVNMLRATLKERPA
YVAFVVDAPGKTFRDDLYADYKANRPSMPDDLRAQVQPMCDIVHALGIDI
LRIDGVEADDVIGTLALQGASDGLAVTISTGDKDFAQLVRPGVELVNTMS
GSRMDSDEAVIAKFGVRPNQIVDLLALMGDTVDNVPGVEKCGPKTAAKWL
AEYDSLDGVIANADKIKGKIGENLRAALPRLPLNRELVTIKTDVVLASGP
RALDLREPNAEALAVLYARYGFTQALRELGGAAAEAGGLTAPMAVARTEP
GRARGTGFVSAPAAAPVELDPALSAPGQYETILTQAQLDSWIARLRAAGQ
FAFDTETDSLDALQANLIGLSVAAEPGQAAYLPFGHDFPGAPAQLDRTQA
LAQLAPLLTDPAVRKLGQHGKYDLHVMRRHGIALAGYADDTLLESFVLNS
GSARHDMDSLAKRYLGYDTVKYEDICGKGAKQIKFAQVSLEDATRYAAED
ADITLRLHQVLGKRLAAEPALESVYRDIEMPLVGVLERIEANGVCVDAAE
LRRQSADLSKRMLAAQQKATELAGRTFNLDSPKQLQALLFDELKLPAVVK
TPKGQPSTNEEALEAIADQHELPRVILDYRSLAKLRSTYTDKLPEMIHPQ
SGRVHTSYHQAGAATGRLSSSDPNLQNIPIRTEDGRRIRRAFVAPAGRKL
IACDYSQIELRIMAHLSGDPGLVGAFESGADVHRATAAEVFGRTIDTVSG
DERRAAKAINFGLMYGMSAFGLARQLGIGRGEAQDYIALYFSRYPGVRDF
METTRQQARDKGYVETVFGRRLYLDFINAGSQGQRAGAERAAINAPMQGT
AADIIKRAMVSVDGWIADHAQRALMILQVHDELVFEADADFVDTLLAEVT
ARMSAAASLRVPLVVDSGVGDNWDEAH
>XC_1363 DNA-3-methyladenine glycosylase I
MSGYCSIAPGHPVHGHYHDHEYGFPQRDERELFERLVLEINQAGLSWETI
LRKRGNFQRAYDGFDVDTVAAYGEAEIARLMQDAGIIRNRLKVLAAIHNA
QVIQRLRATHGSFANWLDAQHPLDKPAWVKVFKKTFRFTGGEITGEFLMS
LGYLRGAHHADCPVFADIQALSPPWMHSA
>XC_2624 IS1477 transposase
MKKSRFSTEQIIGFIKQADAGMAVAELCRRHGFSPASFYQWRAKYGGMEA
DEAKRLKELEVQNTRLKKLLAEAHLDIEALKVGFGVKR
>XC_4115 DNA helicase II
MDVSHLLDHLNPAQREAVSAPPGHYLVLAGAGSGKTRVLIHRIAWLNEVQ
GVPNHGIFAVTFTNKAAGEMRHRTDLQLRNGSRGMWIGTFHGLAHRLLRL
HWQDARLPEGFQVMDSDDQLRLVKRVVQSLELDETKYPPKQMGWWINEQK
DEGRRPQHIQPEPNDDWTEVRRQVYAAYQERCDRSGLLDFAELLLRAHEL
LRDTPALLAHYRARFREILVDEFQDTNAIQYAFVRVLAGESGHVFVVGDD
DQAIYGWRGAKVENVQRFLKDFPGAQTVRLEQNYRSSANILGAANAVIAH
NPDRIGKQLWTDSGDGDPIDLYAAYNEVDEARYVVERARQWVRDGGSYGE
VAVLYRSNAQSRALEEALIAEQLPYRVYGGMRFFERAEIKDALAYLRMLT
NRSDDAAFERAVNTPTRGIGDRTLDEVRRLARANALSLWEAAMLCTQENT
LAARARNALATFLSLVGQLQAETGEMDLAERIDHVLMRSGLREHWAKESR
GGLDSESRTENLDELVSVASRFTRPDDEDSQGMTELVAFLAYASLEAGEG
QAQAGEEGVQLMTLHSAKGLEFPIVFLVGLEDGLFPSARSLEESGRLEEE
RRLAYVGITRARQKLVLCYAESRRIHGQDNYNVPSRFLREIPRDLLHEVR
PKVQVSRTASLGAARGGPVHAVVDAAPIKLGANVEHPKFGGGVVVDYEGA
GAHARVQVQFDEVGAKWLVMAYANLTVV
>XC_4233 ISxac3 transposase
MQAHCEEFRVCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHW
LASGSVYGHRKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFH
GGMQCKAAANLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSR
QVVGWAMRDRADTELVVQAVLSAVWRRKPNTGCLVHSDQGSVYTSDDWRS
FLASHGLVCSMSRRGNCHDNAPVESFFGLLKRERIRRRTYPTKDAARAEV
FDYIEMFYNPNRRHGSTGDLSPVEFERRYAQRGS
>XC_0589 RNA-directed DNA polymerase
MPPLPDYASTELLLGLKTRKDLASWLGVSDRALRYMLYRLGDGDKYSTFS
IRKRNGGLREIHAPKKALKYLQNKVAHALAGVVPVRQIAKGYVPGRSIYD
HAKMHRSKKWVVLVDLKSFFPSINFGRVLGLLRAPPFSLENEVAVAVAQL
CTRAGELPQGAPSSPVISNLICRKLDRQLLELAKQAGCGVSRYADDICFS
TNRKRVSVEICDFVNEHGWVPGAGLKQLICSNGFEINFSKFRVHEGRDRK
LVTGLVVNKGVSTPSRWRDQLRSSLHVIDKYGEAAGVEIISGWTSGFFRK
EPGDVLRTIRGKLGYLKWIDKVANHLATDALRRNFPSLASLMPISNDGVS
FRIMAEGPTDLLHLEAALDYLRKSGGFLDVRPRFQNFLGDVGDSELWETL
LRIAKADVNELTIGVFDCDSPAFMKKTSLVPGGNIQLGPRVYAFCLAPPG
TSISNNFCIESLYSRSDATLVDGSGRRLFFGDEFDSASGFSHDGLYKCLH
PKKKAIVVSDQVARVHDGASVLLSKAGFASQVKDKAPPFDVVSFDGFRPT
WLGIRALAIAAVRK
>XC_2629 ISxcc1 transposase
MRKSKFTESQIVATLKQVEGGRQVKDVCRELGISDATYYVWKSKYGGMEA
ADVQRLRDLETEHNKLKRMYADLAMENHALKDVIAKKL
>XC_0109 ATP-dependent DNA ligase
MSLRDYTRKRRFDQTPEPAEDAAATAHRQPIFVVQLHHASSRHYDFRLEA
DGVLKSWAVPKGPSLRAGEKRLAVQVEDHPLAYAGFEGDIPQGQYGAGHV
QVFDHGTWHCDGDALAALDAGKLDFDLQGDKLRGGFALVRTRLRGRQPQW
LLIKRDDAHAADLDADALVADSDATAQAIETPSAAAAPARRAKASRRRRT
PAEALVTEKSASASRPRASAATQAHWRTRALALPGARDAACPTGLRAQLT
LLRAEAPDGAQWPHEIKWDGYRLLTDLVDGRAQLRSRNDQAWTDSFPEVA
TAVQALPVRDARLDGELVVLDAQGRSDFSALQRAIDGTARQPLRYLVFDL
LGVAGVDLRATPLLERKQLLRALLGETPGTLAYSAHVIGRGPEVFAASAD
KGWEGIVSKRADAPYRGGRSADWVKTKHEDSDEFVVVGYTDPKGARSGFG
ALLLAQLDGTQLRYVGRVGTGFDSALLGEITAQLQALHSPQPTLELPAHI
PSRPRDVHWVRPVLIAEVAFRGWAKQGLLRQAAFKRLREDKPMSDLGGDR
ATPGKSRGARTRTAAAAAGKASRAAATRTAAVSAGGSAAKPGKAGKSSTA
ADVSTPSRVAKQRVTPAASSAAKPGKPGKSSAAATGTASPRAAKRGAVST
ASASSTPKSGKRSVSSGSAASGKPAAPSKAASSKTARTPATSSARKTSTA
TAASSIRKARASASNAPDGVAITHPERVVFPAAGISKGDVAAYYRAVAPL
VLPEIARRPLSLLRCPDGAAGACFFQKHEGRHLGAHIKAIPLKQKSGTED
YLYIEDVAGLLELVQMNTLELHPWGARVDDPEHPDRLVFDLDPGEGVAWT
QVVAAAREIRSKLRAAGLESAVRLSGGKGLHVVVPIVPQASWDQARDFCE
AFAQALATQAPERYVATMSKAKRHGVIFVDWLRNGRGNTSVCSWSLRARE
HATVAVPLRWEELGKLSGPDAFPLDKAVQRAKRQRNDPWADVLALKQVLP
G
>XC_1559 conserved hypothetical protein
MLPTATGNPQVRVRSIDVLSDNWYVLRKVTFDFQRKDGRWQTLSREAYDR
GNGATILLYSRARQTVMLTRQFRLPTLLNGNPDGMLIEACAGLLDQDDPE
ACIRKETEEETGYRIENVRKVFEAFMSPGSVTERLYFFVGEYVDGDKVSA
GGGVEEDGEEIEVLELSLDAALAMIATGGIADAKTIMLLQYAKLHGVLD
>XC_3166 conserved hypothetical protein
MSAMDLFDTPLAPLQVLDDAEGGVRYWPQLLAPAVAQAAFAALRDGADWQ
RHQRTMYDRVVDVPRLLASYRLDAPLPPGLPLQALLAAVQAQLPAPYNAL
GLNLYRDGRDSVAMHHDKLHTLLAPHPIALLSLGTPRRMQLRAKQGATRA
ITLELAPGSLLAMSHASQLTHEHGIPKTTRALGERISVVFRVRPPARMAA
GQHGPHWEAPTQTD
>XC_2598 ISxac3 transposase
MQAHCGEFRVCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHW
LASGSVYGHRKITTDLRDLGERCSRHRVHRLMRTEGLRAQVVYGRNPRFH
GGMQCKSTANLLGSQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSR
QVVGWAMRDRADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRS
FLASHGLVCSMSRRGNCHDNAPVESFFGLLKRERIRRLTYPTKDAARAEV
FDYIEMFYNPNRRHGSTGDLSPVEFERRYAQRGS
>XC_3624 ISxcC1 transposase
MVCVARSTVRYRRRPDRDEEVIALLSELAERFPERGFGKLFQIIRRRGHV
WNHKMVWRVYCLMKLNQRRRSRRRVPARHPQPLACGAHPNAGWSIDFMSD
ALWDGRRFRTFNVIDDFSREALAIEVDLNLPADRVIRTLERIAAWRGYPG
KLRLDNGPEFVALALAEWAERKGIALDFIEPGRPMQNGFIERFNGSYRRG
VLDMHIFRTLSEVREQTEHWLADYNQQIPHDSLGGLTPAEFRDQHQPQTS
SFGWH
>XC_1355 histone-like protein
MAKTAAKKAAPKKAVKKVAASKTAKPAKAASKTAAPKPIKEALSKTGLVA
HIAETTQLAPKDVRAVLASLEATAHASLSKKGVGSFVLPGMLKITSVNVP
AKPKRKGINPFTKEEQVFAARPATCKLKVRAMKRLKDAAL
>XC_2951 DNA ligase
MKRFAALYRTLDRSTGTLDKRAALVAYFRAAPPLDAAWALYLLAGGKVAS
ARMRIAASGELREWIAEAAGIADWLVADSYDHVGDLAETLALLLDDPASE
AVDVSLAEWIEQRLLPIANQDVAVRKHCIVQAWRTLAFDERLVFNKLLTG
ALRVGVSQRLVQQALAELSGVDIARIAQRMLGSWRPHATYVADLLTHEAL
PGDRQQPYPFFLASPLEADVESLGAIDDWLLEWKWDGIRLQLLRRAGEAA
LWSRGEERLDGRFPEIEQAAMGLPDGTVIDGELLAWQPEQLLPMPFTALQ
TRIQRLKPGPKTLAAAPARVVAYDLLELDGEDLRERPLHARRALLERVLA
TLADPRIIASPLVHSTDWQAAAQVRLDARARGVEGLMLKRARSPYQSGRR
RGDWWKWKIDPLTIDAVLLYAQAGHGRRSTLYTDYTFGLWHDGALVPIAK
AYSGLDDTEILQLDRWIRANTTERFGPVRAVTPHHVFELGFEGVNRSTRH
KSGIAVRFPRILRWRHDKPFAEADHLSSLQALAR
>XC_3256 ribonuclease H
MKSIEVHTDGSCLGNPGPGGWAALLRYNGREKELAGGEANSTNNRMELMA
AIMALETLTEPCQILLHTDSQYVRQGITEWMPGWVRRGWKTSGGDPVKNR
ELWERLHAATQRHSIEWRWVKGHNGDPDNERVDVLARNQAIAQRGGLATS
>XC_3112 DNA-3-methyladenine glycosylase
MAFTHLARSDRALGAWMRRIGPIAPQPGWRKPFDPVDALARAILFQQLSG
KAAATIVGRVEVAIGASRLHADTLGRVDDAALRACGVSGNKALALRDLAR
REALGEIPSLRKLAFMEDDAIVEALVPVRGIGRWTVEMMLMFRLGRPDLL
PIDDLGVRKGAQRVDKQAQMPTPKELAERGERWGPYRTYAAFYLWKIADF
SIAAKVPTPRSQE
>XC_2186 exodeoxyribonuclease III
MPTTQRTIATYNVNGIASRLPHLLQWLQREQPDIVGLQELKSTQEAFPEQ
AIRDAGYGVIWQGQRSWNGVALLARGAEPVEIRRGLPWDPADTQSRYLEA
AIHGVIVGCLYLPNGNPQPGPKFDYKLKWFQRLLRHAATLVALPHPVALI
GDFNVVPTDAHIYDPKGWRKDALLQPESRAAYAQLLAQGWTDSLQAIHGD
APVYTFWDYFRQHFARDRGLRIDHLLLNRTLAAGLRDAGVGKWVRALEKA
SDHAPTWITVDVPDTDAVPAAAAGPARKRTKVKEAAANAGEKKPSATKKA
VKNAAKTTAVAKTAARKSATKKPVAKKASANTASAAAAAKKATPATTRKA
SKRPKA
>XC_2421 phage-related integrase
MEIADYLKDMARRDLQDTTINSAARSLKILRLTCGDVPVSQIDHQHIHQM
WDVLRWAPPGLTSNPRFESMSAEDIIREGTALNVPCPASATTELHRRMVT
SFFNALLKTKAVAHSPMAAFKPKKPSLLTNTTTPSRLLSSSDIQKIFDPA
TFNAWASKFPHRWWGPILGLYTGARINEVAQLKVADIILEHGQWCMAIRM
TADDDLAQSNGAQTRQRLKGKSAIRKIPLHPEVINAGFLDFVADIKACGH
PRLFPHLSAGKNKKSGASNCRYSQGLLNQFSDYLKDLGFAKGIGFHGFRH
TLATELHAAGITPQDIALLTGHSLVKSVPVLQDHYIHKSSGNVMQRQLAA
LSLYQPSVQLPVYQQGQFKEKLRKGAKMYP
>XC_0403 phage-related integrase
MPLSDAAVRNATPADKPVRLFDGGGLYVEISPKGAKLWRWKYRFGGKEKR
LALGVYPEVSLAEVRAQHLEARKVLRSGIDPGEKRRVDRLVRVDRSQLSF
AAVAAELLALHGKKNSVLTMKRNGRIVEKDLNPEVSPHFLLANQSLAT
>XC_0533 ribonuclease
MALSYPTTGTRFTLSTNTALVGGERCTLAITASAIRDASGLSPAGNQSIA
FTVATASGGGTGYYSRVNTTSPSQLRCSLNATIRGHTVYPYSGTGTSTWT
ILEMADEDPNNSGRILDAYRNRSYAKVSDRAGTGSGLTYNREHTWPNSLG
FGSATGDRGLPYAPYTDTHMLYLTDTTFNADRGNKPYAACTSSCGERVTE
VNDGSGGGSGRYPGNSNWVRTPDGNGGTFEVWGRRKADMARAVMYMAIRY
EGGTDAATGQSEPDLELTDDRSRIVQTSASPAYMGLLSTLLAWHQADPPD
DAERARNQVVFSFQGNRNPFVDHPEWATSSLFSSAKPASCQLAN
>XC_0410 IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_3930 exodeoxyribonuclease III
MRIISFNANGLRSAASKGFFEWFATQDADVLCIQETKAQEHQLAGPEFLP
AGYKAWFRDASTKKGYSGVAIYAKREPDEVRTALGWPEFDEEGRYIEARF
GNLSVVSFYIPSGSSGELRQGYKFQVMEWLRPILSEWLASGRQYVLCGDW
NIVRSALDIKNWKSNQKNSGCLPPERDWLNGLCADLLDEADASNGRGWVD
SYRVLHPQGEDYTWWSNRGAARANNVGWRIDYQLVTPGLRDKVQACSIYR
EQRFSDHAPYIVDYAE
>XC_3622 conserved hypothetical protein
MKTSRFTDRQIIAILKQAEAGTPVPQLCREHGISSATFYK
>XC_0968 IS1480 transposase
MARARQERNACTRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQSLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>XC_1407 IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALARMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHTLLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>XC_0721 conserved hypothetical protein
MDEFDQRSWWTPAPDDPAWALPDDLRAADPLHGRDCGWVNQMRPFVRHFS
APGQQVFDPFCGFGSTLLAAALEGRQAHGMEVDPARAVVARERLRRHAVQ
APVVVGGLAQVPPAAPVDLCLTNVPYFGCHWSGPVAPGQLYASTDYASYL
AGLRAVFHALRRQLRPGGFGVAMVENVVLDGQLVPQAWDLGRILSSLFTL
REERVLCYPRAGGVLAQRGTASNRSHEYALIFQHCRTRLDLQAAAQLLQA
LREQGLPVTPHGSYARWLQAPEQVPDGPADLDLIVPGEQAVWDRLSHWLH
AQGFALSLWGAPCLPAVPLAMVAAHHYLRAERLDADGRRLQVDLQLPMDE
PAQP
>XC_2683 exodeoxyribonuclease I
MPDSFLFYDLETFGQDPRRTRIAQFAAVRTDAQLRVIEEPISFFVQPADD
LLPSPYATMVTGITPQHALREGVNEAEAFARIAEQMGRPQTCTLGYNSIR
FDDEFVRCGLFRNFYDPYEREWRGGNSRWDLLDVLRLVHALRPDGIVWPQ
REDGATSFKLEHLADANAVREGDAHEALSDVYATIGMARKFQQSQPKLWD
YALRLRDKRFAASLLDVIAMQPVLHISQRYPATRLCAAAVLPLSRHPRID
SRVIVFDLDGDPDALLRLSPDEIADRLYIRAADLPEGEQRIPLKEVHLNK
APALVAWQHLRSDDFQRLGVDRAAVEAKAARLRELGPELAEKVRQVYGAE
RAGAAAVNDADASLYDGFLAEGDKRLLTQVRSSAPGELGAMEARFRDPRL
IELLFRYRARNWPQTLSPHEHQRWNDYRRQRLLEDRGLGEVTLEQFYAQI
ADLRLAHPDDATKQSLLDQLAAWGSDLQRTL
>XC_3251 conserved hypothetical protein
MVSFDATEALTPYREGRGYGAILFDRERLRQADAGLFSPQRWGDRARPVD
EGGRGGAWFVDAPFGHSVLRQYRRGGMAARVSRDQYLWKGAGRTRSFAEF
RLMRELLKRKLPVPRPLAACYLREGLGYRAALLMERLENVRSLADHAQVA
GRGAPWEDTGRLIARFHRAGLDHADLNAHNILFDAGGHGWLIDFDRGVLR
IPATRWRERNLARLHRSLLKLRGNRTREDVDKDYERLHRAYELAWGRGY
>XC_2002 phage-related integrase
MMMALSDLTVRQAKAAEKTYSIPDTDGLGLVVAPTGGKSWHLRYYWLGKQ
KRISLGNYPEVGLREARTLRDEARALVAKGINPHADRKQKRRAIKLASDY
TFKAVFDAWVEHRAKELKEGRNSTLSQIQRIFGKDVLPSLERMSIYDIRR
PQLLGVLARIERRKAFTTTEKVRTWLSQLFRYALVIVEGMEANPATDLDV
VAEPKPPVSHNPYLRLPELPDFLRKLRLYNPRGWQTQLGIRLLFLTGVRT
GELRLATPEQFDLDRGLWIIPPQIVKQLQDEMRKAGKRPHDIPPYIVPLS
VQAIEIVRYLLGVMRPAQKHLLAHRSELKKRISENTLNAALKRVGYDAQL
TGHGIRGTISTALNEIGYPKIWVDAQLSHSDPNKVSSAYNHAKYVEPRRR
MMQDWADRLDLLEQGKVEAASTHLTIHIEGVPAMAEEPAAIAAVSAKAAV
SSTPIVVVPSTEGTTFQRLSQVPPPPTRTPEPEASAIQREREEMLALYES
PCCLPVPLFGKLAGKSKDQINRELKAGKLLSISLGNRGQRVPDWQLVPLK
HKLTQVLMNQCQGADSWDLFRMLTRPHTDLGDRAAIDVVTPTNVLAIVRT
IMGDQSFEKVRTLQSSESVGERAQQQPLHQISDQEARERPSSPSL
>XC_2007 IS1478 transposase
MGWPASVGWRVAPEKPPITQRPQGLDVCSGVPELPVFATLKAGFWQFPLM
HTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSRL
PATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGE
VVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELSR
VIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPAL
SRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERIA
VWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIAV
TACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLGY
RGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRCR
LKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSPM
TLGA
>XC_0520 DNA-binding protein
MNAAPSRPDSSRGAPKSPLREHVAQSVRRYLRDLDGSDADDVYEIVLREM
EIPLFVEVLNHCEGNQSRAAAMLGIHRATLRKKLKEYGLT
>XC_1054 IS1478 transposase
MLAMSIAGKYVVAPEKPPITQRPQGLDVCSGVPELPVFATLKAGFWQFPL
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>XC_3623 ISxcc1 transposase
MRKSKFTESQIVATLKQVDGCRQVKDVCRELGISDATYYVWKSKYGGTEA
ADVQRLRDLETEHSKLKRMYAELAMENHALKDVIAKKL
>XC_2400 DNA helicase related protein
MDQTAVAQAGEETSSLRGAGGLKDKVEKARLELLDLSTRNRLLHTPRGGR
AKTVEVVYELAKAMYQTLVIEGKRFTFVPGKEDPKQANTTGSPGDDDDSD
EVLLDEAPEDPELIAQPDFELDENGRVKAHWDAHLTTRLTPTGLQKRLLD
LYMDARTLQEEQGVNVLYLAVGYLRWRATSTPQTDRFAPLVLIPVQLERS
TAGEKFHLKWTGDDIQANLSLQLFLQREFGLRLPSIDEFETLDVDAYLAS
VEAMLEGKETWGVLRDDAILGLFSFAKFMMYRDLDPQSWEALGGLAAIPM
LRGVVQDGFPGASLTDENSELDLIIPPERMRHVVDCDSSQALVVHDVLQG
NNILVQGPPGTGKSQTIANIISAAVAEGKRVLFVAEKMAALEVVKRRLDH
VGVGVACLELHSNKANKRSLLEELRQTLHLGEPKRTSTSPIIEQLTEKRD
TLNAHVARLHEAHQPSGLTPYQVFGHMVRLRRLGYTTQSLPLEAPTSWAS
HQKEERESLLRELIERIEEIGVPDEHAWSGVQNDGLLPSERDRLLILVAG
LTERLNDWEVSTTELHSALDLSPPLRFDDAGLAVQRVKALLEAPKLGSDA
LQSKVWDAPGRASTVIDTLENAQKARAAVEAKIQMEALAQDWDATKATLR
VLPNSFVLGKELATLSGAHSALVRLGPDLTRLTQLLAEKAPLTLDLALRL
VAIAERATTIPELDRDALVAHIWERGVDAIEEVVVAVETVQQAKRNLASV
FRDAAWSKDSAELEAARGQLAMRGTSLFRFLSGDWRRANQTIRTLLSNPK
LPALDMLGSLDVLLDAKSAQKKIVENDAQGQEAFGSNWQRERSNPSFLRG
VVAWMRTLRPLGSGVRERLADISDRVLATEIAKRVKPVLEELVRDLAPVH
EALLSAERNPWGEETVLKRVVLSELEKKSALWQAASQQSALLANAQELTI
EQALAVIEQIQLAQNAITTYDALAADGTSAFGPFWGALDSKAHELRQVAS
WIEHNPELRLLAVRIVDPESLLDRAEGCSSSAGLLAGEVMDLLVGLQFEG
NAEITRDLCVAPLATLKLQLASWQADPEGLRSWVSYLAIAKEAERKGLGA
IVDGLASGKLARADALGVFDLSYYEALLQALVQRDRALASFDGQRQSQVV
SSFASLDQERMQLARYEVVKAHHSKIPRQGGATGPTAILIGEMARKKGHL
PIRQLMQRCAPAIQALKPVFMMSPLSVAQFLPAGALDFDILVIDEASQVQ
PIDALGAIARVKQLVIVGDERQLPPTRFFSKALGDGGDRDDDEGGAQASD
VESILGLCRARGLPERMLRWHYRSRHQSLIAVSNQQFYEGKLFIVPSPYT
SEAGVGLRLHHLPEAIYDRGNTRTNPKEAKAVALAVLAHAQNTPQFTLGV
ATFSTAQRRAILDELELLRRQHPETEGFFADHPAEPFFVKSLENIQGDER
DVIYISVGYGRDAQRHMTMNFGPVSNDGGERRLNVLISRAKSRCEVFTSI
TDEDIDTDRAKGKGTFALKLFLNYARTGRLTLATQARDAKHSVFEQEVAQ
ALRARGYDLHTDVGLAGFFVDIAVSNPEEPGRYILGIECDGQSYRDARSA
RDRDRLRESVLRDKGWQVYRIWSSDWFHRPEAELEKLVVAIERAKSESGT
LEPSPNSSHRAVPVEILTVDRGEVTEMGLIEAQSPPDTEPYIEASFAVPS
NQFELHLVPTGQLATIVRQIVEVESPIHRSEVVMRARTLWGLQRAGSRIQ
QAVEEAIGSVVSMGGVQVVDKDFLAIPGKEVHVRDRSLVESSTLRRPELL
PPAEIKVAVTKVVADNLGAKRDELVLIVSRQLGYRSTSTQLRQLILDQIE
ALRLAGRLADKGELVVLA
>XC_0664 site-specific recombinase
MPEAAPPVADARGSSPTATTGPGADATLSAVEPFLAHLQIERQVSAHTLD
AYRRDLAALIGWASAQGSEDVAQLDSAQLRKFVTAEHRRGLSPKSLQRRL
SACRSYYAWLLKHGRIATSPAAALRAPKAPRKLPQVLDADEAVRLVEVPT
DAPLGLRDRALLELFYSSGLRLSELCALRWRDLDLDSGLVTVLGKGGKQR
LVPVGSHAVAALRAWQRDSGGSAQTHVFPGRAGGAISQRAVQIRIKQLAV
RQGMFKHVHPHMLRHSFASHILESSGDLRGVQELLGHSDIATTQIYTHLD
FQHLAKVYDAAHPRAKRKKATE
>XC_3914 IS1404 transposase
MPAAWLQRGLVLPVAQQVRWDERARCQAAQGPRVRERAAEEVAGRAVVRE
RPDQGCTAKKVVSAPARRTLVREWIGRGASERRALAVIGMSASALRYCPR
EDRNGELRERICALAHRHRRYGVGMIYLKLRQEGRIVNYKRVERLYREQQ
LQVRRRKRKKVPIGERQPLLRPSQANQVWSMDFVFDRTAEGRVIKCLVIV
DDATHEAVAIEVERAISGHGVTRVLDRLAHSRGLPKVIRTDNGKEFCGKA
MVAWAHARNVQLRLIQPGKPNQNAYVESFNGRLRDECLNEHWFPTLLHAR
TEIERWRREYNEDRPKKAIGGMTPAAYAQHLANTDIINPGL
>XC_0858 IS1477 transposase
MPPARFQPGQLLPVAGQVRRHGGRRREAPEGAGAGEQPPEAVAGRGAPGH
RGAEGRVRGKTLAPQRKREAIRRMCELTSISERRACRLAGISRDAFRHAP
TPTPATQTLSARLVELAQARRRFGYRRLHDLLRPEFPQVNHKKIYRLYRE
AKLSVRRRKKAKFPAAQRQPLRPARHPNEVISMDFVFDQLASGRRIKCLT
VADDFTHECVDIAVDHGISGAYVVRVLEQIACFRGYPRAVRTDNGPEFTS
RAFIAWAQQRGIEHILIEPGKPMQNGYIESFNGKFRDECLNEHWFTSLIQ
AREVIADWRRDFNEVRPHSSCGRIPPAQFASNHRAQTGNNAVPFNPGLYQ
>XC_2880 DNA polymerase III alpha chain
MSTSRFVHLHVHTEFSLADSTIRVPEKPDQADPKKAKQANLLSRAVELDL
PALAVTDLNNLFALVKFYKAAEGVGIKPIAGADVMIATPDVTPWRMTLLC
RDREGYLSLSRLLTRAWMEGHRPEGGVAIHPEWLQAGHANLFALAGRDSL
AGRLFAEGRADLAEQQLADWQRVFGDGLHLELTRTGREGEERFNQFALHA
AGVRGLPVVASNDVRFLYASDFAAHEARVCISSGRVLDDPKRPRDYSDQQ
YLKSSEEMAALFADVPDAIDNTLALAQRCNIEMRLGTYFLPAYPVPEDET
LDSWIRSQSRDGLAARLEKNPIAPGKTRQDYVDRLEFELDTIIKMGFPGY
FLIVADFIQWGKNQGIPIGPGRGSGAGSLVAWALQITDLDPLPYNLLFER
FLNPERVSMPDFDIDFCMDRRDEVIDYVARKYGRERVSQIITYGTMAAKA
VVRDAGRVLGFTYGLVDSVAKLIPNILGITLKDAMGEGKDTEMASPELIQ
RYQVEDDVRDLMDLARQLEDLTRNAGKHAGGVVIAPEPLSEFCPLFAEHD
EGGRGKNPVTQFDKNDVEEVGLVKFDFLGLRTLTIIDWAVKAINVRHARA
GIDPVDITAIPLDDAPTYKGVFASGNTGAVFQFESSGMRRLLKDARPDRF
EDLIALVSLYRPGPMDLIPDFNARKHGQQDIIYPDPRTEAILKDTYGIMV
YQEQVMQMAQIVGDYSLGGADLLRRAMGKKVPAEMAKHREIFREGAAKGG
VSAQKADEIFDLMEKFAGYGFNKSHAAAYALVSYQTAWLKRHYPAEFMAA
TLSSDMDNTDKVVGFLDEVRNLGLTVLPPRVNESAYMFEAASPDTIQYGL
GAIKGVGQGACEAIVEERLRNGPYTTLLDFCTRVGTAKLNRRTLEAMINA
GAMDGLGKNRASLMLQLPEVMKATEQMARERASGQNSLFGGPDPSAPAMR
LDLPESKEWPLGQLLTGERETLGFYLSGHPFDPHRDEVRELVGCDLSALD
KILASQQRGGGGGGDGEKRAWRPEVSAILAGQVVGVRRKGDSQVFVQLED
GRGRVECSAFSDAMAEFGHLLTRDRILIIKGGLREDEFNGGYSLRIRQCW
DYEQICADHTQRLSLRLDLREKQAWSRIDTLLAKHRPGKTPLRLDLLLRS
PAGGVAGMLDLNGSHSVRIDQQLMDSLRADPAVRTLKVKYSPPWAQ
>XC_1925 integration host factor beta subunit
MTKSELIEILARRQAHLKSDDVDLAVKSLLEMMGQALSDGDRIEIRGFGS
FSLHYRPPRLGRNPKTGESVALPGKHVPHFKPGKELRERVSSVVPVDMVD
AAD
>XC_1332 DNA transport competence protein
MKSFTVVLKSLLLALLLSSNAYALDKVDINTASAEELDKVLMNVGRSKAE
AIVEHRQANGPFKSAEELALVKGIGLKTVERNRDLIEVGATMAPAKKAAK
GAAVKPVGRR
>XC_3148 DNA polymerase III alpha chain
MPRGWTVAARLRAANDDITHAAVADTLPAYAELHCLSDFSFLRGASSAEQ
LFARAHHCGYSALAITDECSLAGIVRGLEASRATGVQLIVGSEFTLVDGT
RFVLLVENAHGYPQLCSVITTGRRAAGKGAYRLGRAEVEAHFRDVVPGVF
ALWLPGDQPQAEQGAWLQRVFAERAFLAVELHREQDDAARLQALQALAQQ
LGMSALASGDVQMAQRRDRIVQDTLTAIRHTLPLADCGAHLFRNGERHLR
PRRALGNIYPHALLQASVELAQRCTFDLSKVQYTYPRELVPQGHTPASYL
RQLTEAGMRERWPEGAPAQVVAQIDSELELIAYKGYEAFFLTVQDVVRFA
RAQGILCQGRGSSANSAVCYALGITAVNPSETRLLMARFLSKERDEPPDI
DVDFEHERREEVLQYVYTKYGRERAALAATVICYRGKSAVRDVAKAFGLP
PDQIALLANCYGWGNGDTPMEQRIAEAGFDLANPLINKILAVTEHLRDHP
RHLSQHVGGFVISDEPLSMLVPVENAAMADRTIIQWDKDDLETMKLLKVD
CLALGMLTCIRKTLDLVRGHRGRDYTIATLPGEDAATYKMIQRADTVGVF
QIESRAQMAMLPRLKPREFYDLVIEVAIVRPGPIQGDMVHPYLRRRQGYE
PVSFPSPGVEEILGRTLGIPLFQEQVMELVIHAGYTDSEADQLRRSMAAW
RRGGDMEPHRVRIRELMAGRGYAPEFIDQIFEQIKGFGSYGFPQSHAASF
AKLVYASCWLKRHEPAAFACGLLNAQPMGFYSASQIVQDARRGSPERQRV
EVLPVDVLHSDWDNILVGGRPWHSDADPGEQPAIRLGLRQVSGLSEKVVE
RIVAARAQRPFADIGDLCLRAALDEKARLALAEAGALQSMVGNRNAARWA
MAGVEARRPLLPGSPAERAVELPAPRAGEEILADYRAVGLSLRQHPMALL
RPQMLQRRILGLRELQARRHGSGVHVAGLVTQRQRPATAKGTIFVTLEDE
HGMINVIVWSHLAMRRRRALLESRLLAVRGRWERVDGVEHLIAGDLYDLS
DLLGEMQLPSRDFH
>XC_2064 putative DNA topoisomerase III
MRLFLCEKPSQGKDIGRILGATQRGEGCLNGSGVTVTWCIGHLVEAAAPE
AYDEQLKRWSIEQLPIIPQRWQVEVKPKTATQFKVVKALLAKATQLVIAT
DADREGELIAREIIDLCGYRGPIERLWLSALNDASIRAALGKLRPSAETL
PMYYSALARSRADWLVGMNLSRLFTVLGRQAGYDGVLSVGRVQTPTLKLV
VDRDREIAAFVSVPYWAIDVYLTTSGQAFTAQWVSPDTCTDDSGRCLQQP
VAQQAAQQIRATGSAQVVSVETERVREGPPLPFDLGTLQELCSKQLGLDV
QETLDIAQALYETHKATTYPRSDSGYLPESMFAEVPIVLDSLPKTDFSLR
PIIDQLDRTQRSRAWNDAKVSAHHGIIPTLEPANLSAMSEKELAVYKLIR
AHYLAQFLPHHEFDRTVANLSCGQQTLTATGKQVVVKGWRLVLAEPQPEE
DSDTAARSQVLPALREGVSCQVADVDLKALKTQPSRPYTQGELVKSMKGV
AKLVSDPRLKQKLKDTVGIGTEATRANIISGLIARGYLMKKGRAIRASDA
AFTLIDAVPAAIADPGTTAVWEQALDMIEAGQLTLDVFIGKQAAWISQLI
TQYASASLSIKVPHGPACPQCGAPTRQRSGKSGPFWSCSRYPDCKGTLPV
ESGTSKRVASRSRRGGRKDS
>XC_2618 phage-related integrase
MTNFRAGLSLLPLKQWKQAMSSIVQQQLRDYITSPFGLLQIKGAHVGNIR
ATEINITGREAAIHALELFDRADERSRRVRLSKNRTLYPEPLPLGSPARL
LSVEISDYLGHRDRCGLAKETVEDTARSLKLLRIACGDVPVSRIDHAHIY
RLWDLMRWAPPLLLSDPKYQAYTFEQAVALGKELGVAPPAPATLEKHRRF
LVTFFSKLVKAKAIPMSPMDAFAEIKKDLVVDTSKPERLFDEEELQRIFS
PKTFPAWAKKYPHRWWLPMISLYTGARINELAQLKVADIVEEAKVWCIRI
QKTVDADLRHKDRDRSRQSLKGKAAVRTLPIPKPLLDAGFLDFIEDIKAT
GHPRLFPHLSAGVNRETGETNARYSQGAVNQFSSYMKTLGFGKGIGAHAF
RHTLATELHHKNVSDQDIALITGHSLRKNVPVLHDAYFHKKPKLARATQI
RILAKYKPPVELPKYERGQFSECLADPSKFYP
>XC_2423 conserved hypothetical protein
MSRPHLSKSIDQLEELYAASLSDEATLAELAEELGFRRTPRAKELATEVV
NALNRYRAQDRADSAHNQAPPAPRHAPNKSMRSRPEPGDTAEHPAGPTIS
PPDLEPVNYTNRAPDILDAWSALEVLSPTTFKRRAELASNIAQNIVSIGP
GLPWDSGSARSKPNFKVYFHIILGTLAAEPAFSKLLTRFQDARPNPPNIK
GEVVLASILVDKDGLLAGEMPVSLSAFGWGLPVALGGSLQSLAAWTQEEG
RLIEGLTKQLRPDPSGTDTPITYATIQKAFSWLVQTLGLPAELVQGPQFA
VAAYIGFRSSLVPDPLLLNSFFLRDLARAKDRAMRQALPPALARFLGQTK
PSERHDLLHDPQALDQALSPATTPLGRWITPANQRPALLQQAAVNLASQL
PKTSPLLGVNGPPGTGKSTLLRDVVADQLTRRAEAMCAFNDPNKAFKTAW
SKRIRGENQELFALDPSLRGFEMVVASSNNKAVENISEEIPALKSIASGS
TLRYFEPLATQLLGKNAWGLSAAVLGNGANRSRFRKAFWTESHLSFKTYL
ERVSGITPRQEADPEGLADLCQAPANPAEAMARWQQTKQNFQTRSAQVSK
RLAELEALRQALSALPSLRQREEHASRQRADAEQSASTTMETWQTAQSIS
TNARSQLEGSREALSQHERDRPGLFALKRLFGSISVKQWEAAHTPLAREH
DQAQVAYRKLKSEEDAALQASEQARKAAHDTAASHVTAKDALDKALARCA
SIQVGDGACVLDASFHDLEHGAKNLALPWLDRDLQTQRSELFEAAMDVHR
AFIDAAALPLLRNLNGLVGSNFKLGADRKQLAADLWASLFLVVPVISTAF
ASVERMLNQLPDESLGWLLVDEAGQATPQSTVGALLRTKHALIVGDPLQV
EPVVPLPPVLTQAVMREFKVDPDRFAAPNASAQTLADDGSLHCARFETTS
GSRAVGAPLLVHRRCASPMFDISNRVAYNNLMVQAKVPDASLIRDVLGPS
RWIDIRGSGRDKYSPEEGEQVIALLRLLQTASIAPDLYVVTPFVVVQDEL
RELVRRSGLLTGWVDSPYEWTKERIGTVHTVQGREAQAVIFVLGAPQAAQ
HGARGWAGSTPNLLNVAVTRAKEAVYVIGNRSLWSGAGHFAMLDQML
>XC_4038 ATP-dependent helicase
MSDIATPAAPSAPVPTQRTLTEPVKAGIREAYAKLQANTPGFATRRAQSQ
MIGLVSRALATSGGIGVAEAPTGVGKSLGYLTAGVPIALATKKKLVISTG
TVALQSQLVERDIPAFLKATGLEATVALAKGRTRYLCTRNAAELEGETSQ
NGMFEDEQVLYDRPLSPADVDLAKRLAKAYAARTWNGDLDDAPEPVSVPL
RMRVTTPASGCAGRRCSYAAQCPVLKARTDVREAQIVVTNHALLLSSLSL
GDAENGQPLIAPPSDMLLVLDEGHHIAGVAIDQGAANLPLDDMAKRTGRM
QILIAAAYRAVDKDKIGNLLPSEAIEVAARVSKLLKAFHTEVERVWKPEP
GERDPLWRAPNGKLPPQWGPAIEELGEETRALFNWVHAAHGTVAKGKQDD
AARERLQRSLGMALEMAEQQHNLWSGWRREDKDGQPPMARWITLSRDGDL
ICHCSPVSAAQVLRTLLWNEVDSVVMTSATLTGGGDFQSFAIDNGLPDHA
EMASLASPFDLPNQAELIVPNFPVTPDDREGHPKEVAKYLVRELDWAAKG
SIVLFTSRWKMEKVADLLPLAQRNRVLVQGEGNKSQLITEHLRRIAAGEG
SVLFGLNSFGEGLDLPGEACTTVVITQVPFAVPTDPQTSTLSEWLESRGH
NAFNLIAIPHALRTLTQFAGRLIRSSNDHGRVIILDSRLLTRRYGKRILD
ALPPFKRVIGR
>XC_2778 phage-related integrase
MGVFQGRLNSVRYRPSLYFKPLILKKYYSLPSTGVRHGVSVPYPYGPGER
LKLMARYDIDKATVRARLDPRREPYWGAPVERGLYVGFRRLAQGGSWVAR
YHKEDRTHAYKSLGPVTAENDHENAKREARIWRKTIDAGVQADRLLTVAD
VCRDYTAAIEAEGRTRAAIDARKRFDRIVYVDPIGKLRADKLTQRHLEAW
MTRMEAGEMTGRKKALPSRATFNRNLTALKAALNRSVARREIPQERVIEW
QSIKPHKGASGRRDTYLDKAQRRALLDAMGTDLRALAECVALTGCRPGDP
VAMRRKDWDSKNGLATFATKTGARTVPVSPAARALFDRLATDKQGDAWMF
TNEGEHWTPQAWAPKVKAAAAAAGLPTGVVLYVLRHAWITDAIIGGLDAV
TVARLTGTSLEMISQHYGHLAQHAAREMLGKIDFL
>XC_4232 ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRTFGKSG
VVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>XC_2993 IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHTLLHGKEDSVFGDSGYTGADNREELQDCKAAFFIAARRSTLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>XC_2766 recombination protein N
MLRHLSIKDFAVVRATELEFGPGMTVVSGETGAGKSLMVDALGFLSGLRA
DSGVVRHGADRAELSAEFQLPAEHPGLTWLADNELDDDAQCQLRRIIRAD
GGSRAWINGRPVTSSQLSDLAARLVEIHGQHEHQALMARNSQLALLDAYA
RNSAQREQVRQASQRWQALLDERDALSAQGDVSDRIGFLEHQLAELERED
LDPAAIAALDTNHRRQAHATALIGACESVVQQLNGDEGPSALGLLQDSRH
DLARVAEHEPRLGEVDALLDSAAIQIEEALALLDRVRDDLDADPTQFEAM
ERRLGRLHDLARKHRVSPDELAAHRDHLTAEVESLRGADERLQQLDKHIA
AAIGVWQGAASVLSASRQSAAQALSAATTTLIGELGMGGGQFLIQLQPQE
TLRPDPNGAERVEFLVAANAGQPPRALRKVASGGELSRISLAIEVAALGL
DSVPTMVFDEVDSGIGGAVADIVGQKLRALGEERQVLCVTHLPQVAAKGH
AHYRVSKAPVDGMTQSAVELLGPQARQEELARMLGGVEVSKEARAAARKL
LQSA
>XC_1362 conserved hypothetical protein
MSDTLRAACEAPVLSAATTPHRTFRMRRLLGIAGLLIAFAAAVPQADAHK
VPRLSLSLSLAQVLDQLRRDSAAVPASDPMPIDTVMKRYADTHGQSFDIA
SPDPEEDPSEAVPPTQPADVTDAEWHALQAYGAHTTSEADDISENRSHHY
TLIDLDEDGQRDLLDEAYVGGTGLFTQITVLQGHTDGFRAPTATPTGTPA
DREADAGFSINGRGGDQALYWLRIDGRSYAAYRDGDYFQDTLTLSRPLSP
LPAERHPTKALQIRYRYQHTLAPPRKDAAERLLEEQQADDWLAQHPAMRA
AVDTQLQHLRLDAQGRQRSPDPEARCPSPAESSDPELEAQWPWHDAGHYT
FDFVANLRVRHGSECYSASVVAFRSSFQSANTACCVLWLYEAPGNQVANL
PLLSKRTRSGIALITAAPVDASQD
>XC_2209 ATPase
MRPRTLDEMVGQKRLLAADSALRRAVESGRVHSMILWGPPGCGKTTLALL
LAHYADAEFKAISAVLSGLPDVRQVLAEAAQRFASGRRTVLFVDEVHRFN
KAQQDAFLPHIERGTILFVGATTENPSFELNSALLSRCRVHVLEGVSPQD
IVEALQRALHDAERGLGQETIQVSEASLLEIASAADGDVRRALTLLEIAA
ELATGEGGEITPRTLLQVLADRTRRFDKNGEQFYDQISALHKSVRSSNPD
AALYWLTRMLDGGCDPAYLARRLTRMAIEDIGLADPRAQSMALEAWDIYE
RLGSPEGELAFAQLVLYLASTAKSNAGYAAFNQAKAEVRASGTQEVPLHL
RNAPTKLMKTLGYGQDYQYDHDAEGGIALDQTGFPDAMGERVYYNPVPRG
MEIKLKEKLDRLREARAQARADKGKAGN
>XC_2149 succinoglycan biosynthesis protein
MLLLMWFRAWLCVIGVLTVSPALAADLIGRATVTDGDTLTVAQQRIRLWG
IDAPESAQQCTARNGQAWPCGRRAAAALDAYVQDKTVRCQPKDTDRYGRI
VAECFVQGQSINAWMVRSGWAVAYRQYATAFVADEAIARQQASQLWSGSF
QTPSEYRRAKRSASAKPAAGTSAPSNARCTIKGNVSAKGAKIFHLPGQRD
YAKTRIAPAHGERMFCSVREALDAGWRPAQR
>XC_1327 replication related protein
MSVPQLPLALRAPSDQRLDSYIAAPDGLIAQLQAFAAGQLSDWLYLAGPS
GTGKTHLALSVCAAAEQAGRSSAYLPLQAAAGRLRDALEALEGRSLVALD
GVDSIAGQCEDEVALFDFHNRARAAGITLLYTARQMPDGLALVLPDLRSR
LSQCVRISLPVLDDVARAAVLRDRAQRRGLALDEAAIDWLLTHSERELAG
LVALLDRLDRESLAAQRRVTVPFLRRVLGDRTS
>XC_0285 ATP-dependent RNA helicase
MPRMSDPAFPISPLLPQIRDSLAAHPRLVLEAPPGAGKTTQVPLALLDAP
WLAGRSIVMLEPRRVAARSAALFMARQLGEPVGETVGYRIRFENKTSART
RIEVVTEGILTRMLQDDPMLERVGALLFDEFHERHLAGDLGLALALDVQS
QVREDLRIVAMSATLDGERLASFLDAPRLSSAGRSYPVEVAHFPARRDEA
LEPQTRRAVEHALATHPGDVLVFLPGQREIARVQAALQDALDPAMQLLPL
HGELPVEAQSQVLQPDPQGRRRVVLATNVAESSVTLPGVRVVIDSGLARE
PHYDPNSGFSRLDVTSIAQASADQRAGRAGRVASGWAYRLWPQSQRLEPQ
RRAEITQVELTGLALELAAWGSDALRFVDAPPGGALAAARELLQRLGGLN
AEGGITALGRRMLALATHPRLAALLAQAGTPARLALACDLAALLEARHPL
RQGGDGLAARWRALAAFRQGRTGADANRGALAAIDAAAKQWRRRLRCDAT
PPTSVEAHALGDLLSHAFPDRIAARHPTDPLRYLLANGRSARLFDHSDLR
GEPWLVASELRYEAKDALLLRAAPVDEGYLRQSVPERFVQQDVVQWDADK
RALVARRQSSFDRIVLDSRPAGRVDPAQAAAALTEAVRQLGLDALPWTEG
LRQWRARVVSLRAWMPELGLPDLSDTALLASLDHWLRPAFAGKTRLDALD
EASLGDALKAALPWERRQAIDRHAPTRISVPSGMERAITYALDHDQQPLP
PVLAVKLQELFGLAETPRVADGRIPLTLHLLSPGGRPLQVTQDLKSCWAT
TYPDVKKEMKGRYPRHPWPDDPWTANATHRAKPRGT
>XC_0590 IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_0926 IS1404 transposase
MKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMSVP
DAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>XC_0906 IS1480 transposase
MARARQERNACTRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQSLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>XC_1416 transcription-repair coupling factor
MPSPTFPSPPLPKSGQLRAYWRAPSSPTALAWSIARAAEAHAGPVLVIAR
DNQSAHQIEADLHALLGDASALPVVPFPDWETLPYDQFSPHPEIISQRLA
ALHRLPGLTRGVVIVPVQTLLQQLAPLSYIVGGSFDLTVGQRLDLDAEKR
RLESAGYRNVPQVMDPGDFAVRGGLLDVFPMGADTPLRIELLDEDIDSIR
AFDPESQRSLDKVDAVKMLPGREVPMDDASVERVLACLRERFDVDTRRSA
LYQDLKSGIAPSGVEYYLPMFFSKTATLFDYLDTRVLPLIATGVSNAADA
FWLQAQNRYEQRRHDVERPLLPPDELYQSPDALRERLNKLARIEVWPADH
PRSDEAAPLGDQPLPPLPVAAKDAPAGQALASFLGHYPGRVLVAADSAGR
REALMEVLAAAQLKPDVVADLPAFLAATKLRFGITVAPLEDGFALDTPQI
AVLTERQLFPERANQPRRTRRVGREPEAIIRDLGELSEGAPIVHEDHGVG
RYRGLIVLDAGGMPGEFLEIEYAKGDRLYVPVAQLHLISRYSGASADTAP
LHSLGGEQWTKAKRKAAEKVRDVAAELLEIQARRRARAGLALQVDRAMYE
PFAAGFPFEETTDQLAAIDATLRDLGSSQPMDRVVCGDVGFGKTEVAVRA
AFAAASAGKQVAVLVPTTLLAEQHYRNFRDRFADYPMKVEVLSRFKSTKE
IKAELEKVASGDIDVIIGTHRLLQPDVKFKDLGLVVVDEEQRFGVRQKEA
LKAMRANVHLLTLTATPIPRTLNMAMAGLRDLSIIATPPPNRLAVQTFIT
AWDNTLLREAFQRELSRGGQLYFLHNDVESIVRMQRDLSELVPEARIGIA
HGQMPERELERVMLDFQKQRFNVLLSTTIIESGIDIPNANTIIINRADRF
GLAQLHQLRGRVGRSHHRAYAYLVVPDRRSMTSDAEKRLEAIASMDELGA
GFTLATHDLEIRGAGELLGEDQSGQMAEVGFSLYTELLERAVRSIRQGKL
PDLDAGEEVRGAEVELHVASLIPEDYLPDVHTRLTLYKRISSARDSDALR
ELQVEMIDRFGLLPDPVKHLFAIAELKLQANALGVRKLDLGENGGRLVFE
AKPSIDPMTVIQMIQKQPKIYTMDGPDKLRIKLPLPEAADRFKAARGLLT
ALAPR
>XC_1212 ISxac3 transposase
MCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHWLASGSVYGH
RKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGMQCKAAA
NLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRD
RADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRSFLASHGLVC
SMSRRGNCHDNAPVESFFGLLKRERIRRLTYPTKDAARAEVFDYIEMFYN
PNRRHGSTGDLSPVEFERRYAQRGS
>XC_3150 conserved hypothetical protein
MPLDALLAARTVWRAGHGTATANGGESTGHAALDAVLPDGGWPRRALTEL
LLPAHGIGEIALLLPTLARMTGAGSRVVLVAPPYVPYAPAWQAGGVALQQ
LEIVQAEPRDALWAFEQCLRSGACAAVLGWPQTGDARALRRLQVAADSGN
CCAFALRDRRHAVNASPAALRLEFLPERDAWQVRKCRGGQVPSQPLRLAH
>XC_0355 site-specific recombinase
MTRLTPKLLDQVRGRLRLRHYSLRTEQAYVGWIRRFILANGKRHPAQMGQ
AEVEAFLTDLATRGQVSAGTQNQALAALLFLYREILGLELPWMENLVRAK
RPRRIPVVLSVEEVTRLLTMLEGACRLMAGLLYGSGMRLLECLRLRIKDV
DMVRCEIVVRDGKGGKDRRVPLPRSLRGELMQQRERALLLHAADLAEGAG
QVFLPHALARKYPSADVEPGWQYLFPGARRSVDPRSGRVGLHHVSEEIRQ
RAVHAARRRAGIDKPATCHTLRHSFATHPLEAGHDIRTVQELLGHKDVAT
TQIYTHVLGRGASAVRSPLDGLHLSGG
>XC_0947 ATP-dependent DNA helicase
MPRARSVTPSLAVAGQAPLSSLPGVGPKVAEKFAARGILSLQDLWLHLPL
RYEDRTRLTTIAQLQGGVPAQIEGRVEAMERGFRFRPVLRVAMSDDSCGT
LVLRFFHFRAAQVAQFSPGTRLRVFGTPKPGQNGWEIVHPSYRVLAPDED
AGLGDCLDPVYPVLEGVGPATLRKLIGQALERLPPEAALELLPPHWLQDE
QLPSLRSALLTMHRPPVDTDPQQLLAGGHPAQQRLAIEELLAHQVSLRRQ
RIALQRFRAPQLRGGRLVQQLRKALPFQLTGAQQRVFEQIAHDLAQPAPM
LRLVQGDVGSGKTVVAALAAMLAVEHGKQVALAAPTELLAEQHLANLRGW
LEPLGVRIVWLAGKVTGKARVAAMAEVASGQAQVVVGTHALMQDAVVFHD
LALAIIDEQHRFGVHQRLALRDKGAAAGSVPHQLVMTATPIPRTLAMAAY
ADLHVSAIDELPPGRTPVQTIVLSAERRPELVERIRAACAEGRQAYWVCT
LIEESEDTDKGAQNGPPRIEAQAAQVTFETLSAQLPGVRVALVHGRMKPA
EKQQAMLDFKQGRTDLLVATTVIEVGVDVPNASLMIIENAERLGLAQLHQ
LRGRVGRGAAASSCVLLYQGPLSLMARQRLETMRQTNDGFVIAERDLELR
GPGELLGTRQTGLASFRIADLARDAGLLPRVQVLAERLLDEAPEIADRVV
ARWIGGAVRYAAA
>XC_3967 DNA polymerase-related protein
MPAHRIPSAPTTPLAKPVDTVPSGTLTALRAQAQDCRRCDLWKPATQVVF
GAGPARAPLMIIGEQPGDQEDQQGRPFVGPAGQLLGTLMADAGLDPAMAY
VTNTVKHFKFVPRGKRRLHQRATAGEQAACRPWLAAELLRVRPRIVLALG
AMAAQTLFGNAFRLTTERGQWRALDGRTTALASWHPSAILRMREPDRTAT
RALLREDLAQVAAALDNLR
>XC_3956 conserved hypothetical protein
MVKDYSRYRRTLLAPIARLMVRHEANMLRRLQGWKHAPALLGTLGGLALG
MEFIPGDTLSASAVVGQEVFQQLQHALRRLHAVGITHNDLHGTNVVVSAG
VPVLIDFTSAWRFPRWLRRSTLSRQLQRSDVANFQKMRRRLVGIAPSDAE
AALTAEPGWVRGVRNGWKRLYRWLKGGAA
>XC_2804 replicative DNA helicase
MSARPGFRSNRNRDRDRDDYDRPEPRLDQLRVPPHSVEAEQAVLGGLMLA
PDAFDKVNDQLTENDFYRRDHRLIYRAIRELSEKDRPFDAVTLGEWFESQ
GKLEQVGDGAYLIELASTTPSAANIAAYAEIVRDKAVLRQLIEVGTNIVN
DGFQPEGRESVELLASAEKAVFKIAEAGARGRTDFVAMPGALKDAFEELR
NRFENGGNITGLPTGYTDFDAMTAGLQPTDLIILAARPAMGKTTLALNIA
EYAAIKSKKGVAVFSMEMSASQLAMRLISSNGRINAQRLRTGALEDEDWA
RVTGAIKMLKETKIFIDDTPGVSPEVLRSKCRRLKREHDLGLIVIDYLQL
MSVPGNSENRATEISEISRGLKGLAKELNVPVIALSQLNRSLETRTDKRP
VMADLRESGAIEQDADMIVFIYRDDYYNKENSPDKGLAEIIIGKHRGGPT
GSCKLKFFGEYTRFDNLAHDSVGSFE
>XC_1164 ATP dependent RNA helicase
MSAIDTDLATTLRERRGAVDAAMSRDRGRLLGLWSRWQGKPGNPQLRQAF
EQALAASQAQRQARAAQQPAITLDTQLPIAREADRIIALIRDHPVVVIAG
ETGSGKTTQLPKLCLAAGRGAAGMIGCTQPRRIAARAVAARVAEELNTPL
GTTVGFQVRFTDRVGEDSRIKFMTDGILLAEIASDRWLSAYDTIIVDEAH
ERSLNIDFLLGYLKQLLRKRPDLKLIVTSATIDTERFSRHFDDAPVINVE
GRTFPVDVRYRPLEGESGDGDTGDVGRDGERTVNDAIVAAIDEITRIDPR
GDVLMFLPGEREIRDAHQSLERRKYRETEVVPLYARLSAADQDRVFNPGP
RRRLVLATNVAETSLTVPRIRYVVDPGYARVKRYSPRQKLDRLHIEPISQ
ASANQRMGRCGRIAEGICYRLYAEADFAARPAFTDPEIRRASLSGVILRM
LQLGLGRIEEFPFLEAPDERAVADGWQQLLELGAIDAERRLTAIGRQMAR
LPVDVKLARMLVAAQQHGCLREMIIIAAFLGIQDPRERPPEAREAADNAH
ALFADARSEFVGILRLWDAYRQVHEDLTQSKLRDWCGRHFLGFLRMREWR
ELHRQLRLLCEELGWSEEPAGAMLAPLLAGASAPVREDGQAHRATRGQLH
RAARLAREGKPDPAAPPAQAKAAAAKSSPADATDAAVRTSERERAAAYQA
LHRALLAGLPTQIGHRTEKGDFLAARQRRFVPFPGSALARKPPPWILAAT
LLDTQKVWGMTNAAIEPDWAIAELPHLLARKHFDPHWSRAQGQVVASEQI
SLFGLVLAPKKPVHFGKIDPATSHDLFVRQGLVPGEINTRAAFVADNLKV
LEQAREEEAKLRRAGIVADEDWQARWYLDRIPAELHSASGLDAWWKTLPA
DKRRSLHWSLNDLLPGEGSEADRFPKYFALGDARLPLQYRFEPGAIDDGV
TLEVPLHLLNALDPSRLSWLAPGFVADKASALIRSLPKAQRRNYVPAPDY
GRAFYEAFSTPSADDMRGELARFLSKATGAPVAALDFDEEALDTHLLMNL
RLRDEDGRVLAESRDLVGLRARFGERAGQAFAARAGRALAAEGLRDFPAT
PIPEQVAGEAGVPAYPALVDQGEDAALRIFADRNEALRAHPRGVRRLLEI
ALADKIKQARKQLPVSPKTGLLYAAIESQERLRGDLVDAALNAVLAEGLG
AIRDPAAFAQRREDAVKRLFGEAMARLTLAESILGAVAELKPLLEAPLMG
WARGNLDDMEQQLRALVHAGFLRDTPADALANYPRYLKAMILRTERAKRD
PARDQARMLELKPFVDALNDAAARGLQQHPDWQALRWDLEELRVSVFAQE
LGAKSGVSAKKLSQRVAALRG
>XC_2065 single-stranded DNA binding protein
MSTHFWGEGNVGSPPEYREFPNGNDEPRRLLRLNVYFDNPIPRKDGEFED
RGGFWAPVELWHRDAEHWKDLYQKGMRVLVIGRMEREPWTDNEDQPRETW
QINARSVGILPYRIESVALSPKPQEAEPKPQAAQESTTPKDAKRRK
>XC_2660 DNA gyrase subunit A
MAETAKEIIQVNLEDEMRKSYLDYAMSVIVGRALPDARDGLKPVHRRVLF
AMNELGAHSNKAYFKSARIVGDVIGKYHPHGDQSVYDTLVRMAQPFSLRY
LMVDGQGNFGSVDGDSAAAMRYTESRMSRLAHELMADIEKETVDFQPNYD
EKELEPTVMPTRFPSLLVNGSAGIAVGMATNIPPHNLTEAINACIALIDT
PELDIEGLMEYIPGPDFPTAGIINGTAGIAAGYRTGRGRVRIRAKADVEV
ADNGREAIVVTEIPYQVNKARLIEKIAELVKEKKLEGISELRDESDKDGM
RIYIEIKRGESAEVVLNNLYQQTQMESVFGINMVALVDGRPQLMNLKQML
EAFIRHRREVVTRRTIFELRKARARAHVLEGLTVALANIDEMIELIKTSA
NPQEARERMLAKTWEPGLVGALLGAAGAEASKPEDLAPGVGLSNGFYQLS
EVQASQILEMRLHRLTGLEQEKLTDEYKQLLEVIQGLIRILENPDVLLQV
IRDELINIREEYGDARRTEIRHSEEDLDILDLIAPEDVVVTLSHAGYAKR
QPVSAYRAQRRGGRGRSAASTKEEDFIDQLWLVNTHDTLLTFTSSGKVFW
LPVHQLPEAGSNARGRPIINWIPLESGERVQAVLPVREYADNRYVFFATR
NGTVKKTPLSEFAFRLARGKIAINLDEGDALVGVALTDGDRDVLLFASNG
KTVRFGESTVRSMGRTATGVRGIRLAKGEEVVSLIVSERAGGVEDEVEDE
SAEEVVETTDGAEPAVIDVADNGDVAYILTATENGYGKRTPLAEYPRKGR
GTQGVIGIQTTERNGKLVRAVLLGSTDEVLLISDGGTLVRTRGSEISRVG
RNTQGVTLIRLSKGEKLQAVERLDASLEEPEDVVDEAVAITSDAPPAEG
>XC_1178 ATP-dependent helicase
MSQLATASIEALSEGGALARQLDAFAPRAAQLRLTGAIAEAFEQRDVLLA
EAGTGTGKTYAYLVPALLSGLKTIVSTGTRALQDQLFHRDLPRVRAALGI
GLRSALLKGRANYLCKYRTQQARGEPRFASPEQVTQFQRIVAWSGRTQFG
DMAELDALPDDSPLLPMVTSTVDNCLGTECPFYSECFVVQARQRAQAADL
VVVNHHLLLADLALKQEGFGEILPGAQAFVIDEAHQLPELAANFFGESFG
MRPWQELARDCMVEARLVAGAQASLQAPILALDDALRGLRAGMEGLPPRG
TQWRALAKPQVREGFDAVLSALARLGEALLPLREASPGFDGCTARAQEAL
NRLSRWLGEDVPVPDFEQDLPETVDNDVLWYELSPRGFRCQRTPLDVSGP
LREHREKSQAAWVFTSATLAVGGEFDHIALRLGLNDPITLLQPSPFDWAS
QALCYLPPNLPDPAARGFGTALIAALHPVLEASNGRAFLLFASHRALREA
AEALRDGPWPLFVQGEAPRATLLQRFRTSGNGVLLGSASFREGVDVVGDA
LSVVVIDKLPFAAPDDPVFEARLDAIRRDGGNPFRDEQLPQAVIALKQGV
GRLIRSETDRGVLVLCDPRLLNKGYGRTFLNSLPPFSRTREIDDVRAFFG
SGPETGQAGSEIATLLPD
>XC_3838 conserved hypothetical protein
MHTRQRAMASRCGGGAAQTPAPSPAITPVECYGPMPAPMPRRAGFQRTQL
LLSAAQRSALHRVLDAQMPAIHTLPQARRVRWSLDVDPIDLY
>XC_3255 DNA polymerase III epsilon chain
MRQIILDTETTGLEWRKGNRVVEIGAVELLERRPSGNNFHRYLRPDCDFE
PGAQEVTGLTLEFLADKPVFAEVVEEFLAYIDGAELIIHNAAFDLGFLDN
ELSLLGDQFGRIIDRATVVDTLMMARERYPGQRNSLDALCKRLGVDNSHR
QLHGALLDAQILADVYIALTSGQEEIGFGAMDAGQHAEGGEGMIAFDPSL
LLPRPRVVVTPSELQAHEARLERLRKKAGRALWDAPELDEVAVAS
>XC_3643 ATP-dependent RNA helicase
MSFESLGLAPFLLRALAEQGYETPTAIQQQAIPLVLAGHDLLAGAQTGTG
KTAAFGLPLLQHLGTTPQPVNGPRKPRALILTPTRELATQVHDSLRGYSK
YLRIPSAVIYGGVGMGNQLDALRRGVDLLIACPGRLIDHIERRSVDLSGI
EVLILDEADRMLDMGFLPSIKRILTKLPRQDRQTLLFSATFEENIKQLAL
EFMRNPMQIQVTPSNTVAESITHRVHPVDGARKRDLLLHLLAQDSREQTL
VFARTKHGSDKLALFLEKSGIKTAAIHGNKSQGQRMRALSDFKAGRVTVL
VATDIAARGIDIDQLPKVINYDLPMVAEDYVHRIGRTGRNGSTGEAISLV
AQDEAKLLRQIVRMLGRDVEIRDVPGYEPQTPIRWGNSAPGRAEQPGGDR
APRKSHARRPHGDAPRQAHAHAGPKKPGGQRSSGPRQATAGAGAGRRDGG
RGGSGRPASRGA
>XC_3161 excinuclease ABC subunit A
MSSSASSPVPGLVRVRGAREHNLKNVDVDIPRDALVVFTGVSGSGKSSLA
FGTLFAEAQRRYLDSISPYARRLIDQVGVPEVDAIDGLPPAVALQQARGA
PSARSSVGSVTTISNSLRMLYSRAGQYPPGQEIIYADGFSPNTPAGACPT
CHGLGRIYDATEASMVPDRSLSIRERAVAAWPGAWHGQNQRDILTTLGID
VDVPWTKLPKKTRDWILYTDEQPVAPVYAGYDLDEVRRALKRKEEPSYMG
TFTSARRYVLHTFAITQSAQMKKRVAQYLISTQCPQCDGKRLRREALSVT
FAGLDIGALSQRPLDEVAELLRPAAEATPATQAAQGKRKRSATAEHPEQV
IAAQRIAEDLRARIAVVQALGLGYLTLERSTPTLSPGELQRLRLATQIRS
QLFGVVYVMDEPSAGLHPADAQALLGALDQLKAAGNSVFVVEHEVDVIRH
ADWIVDVGPAAGVHGGQVLYSGPPAGLEQVDASSTRRYLFGTPPQVHSHA
RDATGWLQLRGITRNNVRALDVDLPLGVFTTVTGVSGSGKSSLVSQALVE
LLAAHLGQTQAEEDEALDPLERGTQVPLGGAIVGGLDQVRRLVRVDQKPI
GRTPRSNLATYTGLFDPVRKLFAATPAARRRRYDPGQFSFNVAKGRCATC
EGEGSVHVELLFMPSVYAPCPTCHGARYNAKTLEIELRGHSIAQVLEMTV
DQAATFFAEDASVLRPLQVLREVGLGYLRLGQPATELSGGEAQRIKLATE
LQRAQRRDTVYVLDEPTTGLHPADVDTLMRQLQGLVAAGNTVIVVEHDMR
VAASSDWVLDMGPGAGGAGGHVVVAGTPDVVARHRGSLTAPFLGALICLM
IEVAVGRQRAV
>XC_3908 IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_3391 exodeoxyribonuclease IX
MTTPAPPLATAPLAAALRTPRPVPLYLVDASLYVFRAWHSIPDEFQDAQG
WPTNAVHGFARFLLDLLERERPQHITIAFDEALDSCFRHAIYPAYKGNRE
PAPDALRRQFAHCKALCAALGLSVLAHREYEADDLIGSALHSARARGLRG
IIVSADKDLSQLLFEHDEQWDYARNVRWGMDGVKARHGVHAHQMADYLAL
CGDAIDNIPGITGIGAKSAAVLLAHFGSLDALLERLDELPFLRLRGAAQM
ALRLREQREHALLWRQLTTIALDAPLELTESGFTRAPADTDMLTGLCDSL
RFGPLTRRRLLAASGGAVLPPPPASLSQGPFP
>XC_0665 ISxac3 transposase
MQAHCGEFRVCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHW
LASGSVYGHRKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFH
GGMQCKAAANLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSR
QVVGWAMRDRADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRS
FLASHGLVCSMSRRGNCHDNAPVESFFGLLKRERIRRLTYPTKDAARAEV
FDYIEMFYNPNRRHGSTGDLSPVEFERRYAQRGS
>XC_1475 ATP-dependent RNA helicase
MTQESSAPLLFADLGLSDAVMKAVAAVGYETPSPIQAATIPALLAGRDVL
GQAQTGTGKTAAFALPVLSNADLNQVKPQALVLAPTRELAIQVAEAFQKY
AEAIPGFRVLPVYGGQPYAQQLSALKRGVHVVVGTPGRVIDHLDRGTLDL
SQLKTLVLDEADEMLRMGFIDDVEAVLKKLPEKRQVALFSATMPPAIRRI
AQTYLKDPAEVTIAAKTTTSANIRQRYWWVSGLHKLDALTRILEVEPFDG
MIIFARTKAATEELAQKLQARGLAAAAINGDMQQAAREKTIAQLKDGKLD
ILVATDVAARGLDVERVSHVLNYDIPYDTESYVHRIGRTGRAGRNGDAIL
FVTPREKGMLRAIERATRQPIEEMQLPSVDAVNDTRVARFMTRITETLAG
GQIEMYRDLLQRYESENNVPAIDIAAAMAKLLQGDAPFLLTPPVRGARED
FAPRERNDRADRGERPRFEPKFERGPRAPDGERGARPPRPDRPAYGEDAG
AERPRREPSAPRGEPEFGMESYRIEVGHTHGVKPANIVGAIANEAGLESR
YIGRIDIQDDYSILDLPADMPRELLTHLKKVWVSGQQLNMRKLEEGEAAA
AAASKPKFPRGPRPAGRPNRPMDRAGAPHRKGPPKPRGPRSE
>XC_0536 helicase
MSTLPAPSSGGVYPSGQPGSHRYSHHQGKFFAHWLTLRSRDESAVARALS
AARVDMNPHQVDAALFALKSPVEAGVLLADEVGLGKTIEAGLVLAQHWAQ
RRRRLLLIVPATLRKQWTQELEEKFQLPSVILESKSFNAFRQLDVSNPFQ
IEDAIIVTSYEFAASKQKELAAVSWDLVVLDEAHKLRNLYKGKSASKRAV
ALNEALRGRRKVLLTATPFQNSLMELYGLVSFISDEFFGSQKAFQMQYAS
GRSSEARLNDLRHRLKPICHRTLRRQVQAEGGINFTKRFSITQDFTPSDE
ELELYDKVSAYLQDPSILAIKPTARHLVTLVVRKILASSTFAIADTLETI
IGRLESNQALTAKQIEDFETVDELSDELDSDGMERQGDAPDSDALQAELF
CRAQEIDRLRGYRDLAKQIQSNRKGDALLLVLGRALAMAEKLGGQRKAVI
FTESRRTQEYLRVLLEEHGYAGRTVLLNGSNDDVDSRQLYADWLERHAGS
TLVSGSRTADMKAAVVEAFRDHRDVLITTEAGGEGINLQFCSLLVNYDLP
WNPQRVEQRIGRIHRYGQKSDVVVVNFVNRKNRADQLVFELLEKKFKLFD
GVFGASDEILGAIESGVDIEQRILRIYQNCRDAAQIESEFATLQAELDES
IGRREESARRMLLEHFDEDVVRNLRSRRAGMLHKINQYESQFLHLIAAER
PKARISDKLVKLEEGDYAVSWPPAEEANAGLLRPDKGLGERLCRQAKERP
TLPGTLHFDYAAVDAQRADVRQWLGSRGQLRVALVTIQTAEEVLEELVCS
ALCSDGRVVPDETAARLMEIPARFVPRHRVAEDRALPAFEAAVERVLAEA
NKRNESWFLQESERLDRWGEDQRLLLQQSIDEFDLQVRDAKRTLRQLETL
EQKAQLKREIKRIEQQRDDAMLDFFEGRKRIERSQDVMLERVENALRTQH
TVQVLFDVEWTLEHPEP
>XC_2086 DNA helicase related protein
MTSDPPFLKPRRWRIDALAAGSRYPIRDLDVNLDGGLLARLTEAETATVS
LALRSVAADASNTALAQRDTHLELLPRNQWGGISHLPDLVAAFVQPNDPA
VDRLLKRTAEVLRQNNRNPALDGYTGGAKRAWELASALWGATAGMQLDYA
LPPASFEQSGQKVRSPSQIESSGLGTCFDLTLLFCAALEQAGLNPLLVFT
EGHAFAGVWLQSEEFSNTVVDDVTALRKRLRLKELVLFETTLITQRPTVP
FSYATDRGAQQVDESQDAGFRLAVDIRRARLQRIKPLASAEAAVAAVSDS
EATLSSPAVSVIEDAPDLPDSPPFKTEDASQLDPKDRLLRWQRKLLDLSL
RNNLLNFKAGKKALKLEAPDPSTLEDLLASGQSLKLRPRPDLMDGADPRD
QAIYEARERENVRRAHALEALQKREVFVGVPETELDSRLVDLFRSARTTL
QEGGANTLYLALGFLSWTREDRDGQRYRAPLVLVPVSLQRKSARSGFTLS
LHDDEPRFNPTLIEMLRQDFELNLGAVEGELPRDDAGLDVTAVWKAVGHA
IKDIKGWEVTEDVVLSMFSFAKYLMWKDLAEHSEQLRENPVVRHLLDTPR
DAYPPGAPFPQVRELDQHFDPKQVFCPLPADSSQLSAVLAASQGKDFVLI
GPPGTGKSQTIANLIAQSLAQGRRVLFVSEKIAALDVVYRRLREIGLGEF
CLELHSSKARKLDVLAQLQSAWSSSGQTDAEQWRAEAEKLKRLRDALNIY
VERLHQRRRNGLSLFDAIGTVSAGHDIPTLPLAWLSADQHDHAGIDQLRS
AVDRLEVNAQAIGHAALAQHPLALVGHRDWSPTWQQQLIAAARDVLPAAQ
ATIESAHAFVQAIGLPSPLLTPETCEALLLLAQRLTLAAGHDWRFVLRPD
ARSLSQRLQEGAARVRRHAELNTLLSTPWPASVITACADGLALLTEHRQT
HAELGEPWPVRITVQLNQALGLLAQLSEHHAALSVPYGKTIEQLDVAQLQ
QMWEQAEQTFWPKSWLGKRKVTTQLSSATTGGSQPDVANDLQHWNAIRAL
RQRIQAIDPGQQCADVWAGLDTQQDKVSTALRWQIALAAVLEGQAWEDDG
FDAIAGGQCGATLQADLQRARRLRQLDQDIAAHASLETATDGLWAGHATQ
FNCLRAALDFLSDWRSHAQQGALDAHTLVEEGACGPTLARDHQTLRQRAD
MEQALAALDDLRESTAGLWKGLATNLDDLEQACQLREDLAAVLARLATTP
EHISACKAPLHTLLGDANALLEPGGRIALAGARYVEKWEQLLPRREALAT
TGHFAEAAQTQWQSMSLDGLIEQSQSIVRAEHGLRSWCAWRQARDEALAL
GLATLVQGIKQGQVGPDQARRTFEANYARWWLNAVVDHEPVIRGFVSAEH
EQRIRDFRELDERFTALTRDWLRARLCADLPSQDNVSRNSEWGLLRHEMG
KKRAHLPLRELMAQIPEALTKLTPCLLMSPLSIAQYLQAGANAFDLVIFD
EASQIPVWDAIGAIARGHQVVMVGDPKQLPPTSFFDRAESGLDDEDVEAD
LESILDECIGANLPTRNLNWHYRSRHESLIAFSNHAYYDGGLVTFPSPVT
NDRAVSLQPVSGTYQKGGTRTNPAEAKALVADVVARLTAPGFRESGLTIG
VVTFNAEQQKLIEDLLDEARRQDPRLEPYFAESELEPLFVKNLESVQGDE
RDLIYFSITYGPDPAGQLAMNFGPLNRQGGERRLNVAITRARHELRVFAS
FHAEQMDLARTQAIGVRDLKHFLEFAERGARALAEANCGSLGGFDSPFEQ
AVAAALARRGWHVQPQIGASSFRIDLGIVDPDAPGRYLAGVECDGATYHR
SATARDRDKLREQVLRGLGWDIVRVWSTDWWIDPAGTLDRLDARLQAVLI
AQREQRAEQAERDAEAESLAQAAIAQAIASVTKPDGEMAPPAQDADPIAP
EVSATAPSQQVEEVFARQVSAEAAHANAEETTPPEASLYRITDPAEAVTG
ANPDRFFDGEYNDILLTMIAHVVDHEGPVLDALLARRIARAHGWLRTGGR
IRERVFQIARPRYRTTDEEVGTFYWPEHLDPATEPPFREPADEDSVRAAD
EISIAELASLARAVIAQGTQGEGIYQAMARRLRLQQLRAASRARLENVVR
SLRAEP
>XC_4274 integrase
MNRTIKEATVKGFHYDDHAQLQQHLANFIDAYNYGRRLKALKGLTPYEFI
CKQWTSEPDLFKVDPIHLMPGLNT
>XC_1800 3-methyladenine DNA glycosylase
MSLHSPLPRAFYAADARTVAPLLLNKVLVSADGRRGRITEVEAYCGSEDA
AAHSFRGMTPRTQVMFGAPGHLYVYFIYGMHWAINAVCGGAPGHAVLIRA
LEPLAGCDAMHAARGAAPFKSLTTGPGRLAQAFGVSAVDNGLDLTTGVAR
LWIEDDGTPSPAAPLAGPRIGIRKAVELPWRWVVPGSAYLSRPLPRVSGA
RASVTGD
>XC_2966 DNA repair protein
MLIEHERGFVLHARAWRETSLLVEVLTEQHGRVGLLARGVHGPRKQALRA
ALQPLQLIQFTAVQRGELAQLRQAEAIDTAPRLLGEAMLAGFYISELLLR
LAPRHAPVPELFDCYAQARAHLASGAALAWGLRQFERDVLDGLGFGFDLQ
HDSDGQPIDPAARYRLDPQDGARRVLSERLAQDRRETVTGAALLALGEDR
VPATEDMPGLRRSMRGVLLHHLSGRGLKSWEMLEELARRGA
>XC_2612 RadC family protein
MKRTQDRAVQYQLEMDEEGILLAAATILEQRLQRQGRIHSPDQAGDYLVA
RCAHLPHEVFGVVFLDTKHHILATEHLFSGTIDGCDVHPRVVAKRALDLN
AVAVILFHNHPSGNPEPSEADRKVTERLKQALALLDIRVLDHLVIGGRQH
TSLAARGWV
>XC_2702 endonuclease III
MSSALSAPPPRRGSTLRKPEIQELFARLRELNPHPTTELEYTTPFELLIA
VLLSAQATDVGVNKATRKLYPVANTPRDILDLGEEGLKRYISTIGLFNAK
AKNVIATCRILLERYGGEVPHDRAALEALPGVGRKTANVVLNTAFGEPTM
AVDTHIFRVANRTGLAPGKDVRVVEDKLVKVIPAEFLHDAHHWLILHGRY
VCKARKPDCPNCVIHDLCRYRDKTVAA
>XC_2134 IS1480 transposase
MARARQERNACSRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQTLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>XC_2536 topoisomerase IV subunit B
MPTRCMNTRYNAADIEVLSGLDPVKRRPGMYTDTARPNHLAQEVIDNSVD
EALAGHAKQVEVTLYKDGSCEVSDDGRGMPVDMHPEEKIPGVELILTRLH
AGGKFSNRNYTFSGGLHGVGVSVVNALSTKVELFIKREGSEHRMEFRDGN
AASKLEVVGTVGKKNTGTRLRFWADPKYFDTPKFNVRALRHLLRAKAVLC
PGLTVKLHDEATGEQDSWYFEDGLRDYLKGEMADRELLPADLFAGSLKKD
TEIVDWAAAWVPEGELTQESYVNLIPTAQHGTHVNGLRSGLTDALREFCD
FRNLLPRGVKLAPEDVWDRVTFVLSLKMTDPQFSGQTKERLSSRQAAGFI
EGAAHDAFSLYLNQNVEIGEKIAQIAIDRASARLKTEKQIVRKKVTQGPA
LPGKLADCISQDLSRTELFLVEGDSAGGSAKQARDKDFQAILPLRGKILN
TWEVASGSVLASEEVHNLAIAIGCDPGKDDITGLRYGKVVILADADSDGL
HIATLLTALFLQHFPALVAAGHVFVAMPPLFRVDVGKQVFYALDEEEKTT
LLDKIAREKMKGQISVTRFKGLGEMNPQQLRESTIHPDTRRLVQLTIDDG
EQTRSLMDMLLAKKRAGDRKQWLETKGDLASLEV
>XC_3671 IS1404 transposase
MKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMSVP
DAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>XC_2812 conserved hypothetical protein
MRLIDSHCHLDASEFDADRAAVIARAKAAGVMQQVVPAITAASWPGLREA
CALAPGLHPAYGLHPIFLDLHRPEHLELLAEWIARERPCAIGECGLDFFL
DGLDAQTQRHYFDGQLQLAKRFDLPLIVHARRAVEEVIARIKAVGGIRGV
VHSFAGSPEQAQQLWKLDFMIGLGGPVTYPRANRLRGLAAQMPLEHLLLE
TDAPDQPDAEIRGQRNEPARLRTVLDCIAQLRGEPAEAIAAQTSANARRL
FGLPA
>XC_2399 integrase/recombinase
MSKAKTAAKPGRTKTFSGTLPRGIKASQAVSSVAGVTLRTDGQLRWEARI
RRSLNGQALKFPLVRYPIDPKASPNTEHHIDAARLMAEAYVRREHASLEL
RQTPYAHTAEAWTFGDLLRRFVQEIDDGLIKQASVRTDQSNAYLFLGGGK
GLGLSQTGLPHLTRKLAKDLTQDDFLGRHAGSFVNAYIKVKRDGTTLPMA
QGSKKRALTTIRNLFRIAHENWQIDLRSPIKSLKSLNSDDARDRTLTEEE
WNAIVAQLDAGRTDPATADVIRFARMTAARRSECVKLDWADINFKKKTAR
LRETKAKNGKYNERVIPLTSEPMALIAARFEASETKKGPVFVTSRGKRIR
ADTVTQAWDRVRGQIAEKLGDPEILTARMHDLRHTRITEMGHHLNPAEAA
RISGHKDLKTFMRYFNPDPVALGKKLEQLERQGGAGANVDAVVTQLLELS
PKDMASAVALAFQARAKAQ
>XC_1086 conserved hypothetical protein
MDAPKPWHLYLLLCRNGSYYAGITNDLERRFQAHLRGTGARYTRANPPLQ
VLASHPYPDRATASRAEWLLKQQPRARKLAWLQAQGLLPAESRPDDTPLT
PA
>XC_3242 recombination protein RecR
MSSLLEQLIEAFRVLPGVGQKSAQRMAYHVLEREREGGRRLAAALANAVE
KVGHCVQCRDFTESEVCAICANSGRDRQQLCVVESPADRLAIEHATGYRG
VYFILQGRLSPLDGIGPRELGLDRLAERLAAGEVTEMIIATNATVEGEAT
AHYLAQLARQHSVRPSRLAQGMPLGGELEYVDRGTLSHAFGTRSEVL
>XC_2779 IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_3262 histone-like protein
MNKTELIDGVAAAADISKAEAGRAVDAVVSEITKALKKGDAVTLVGFGTF
QVRERAERTGRNPKTGDSIKIAASKNPAFKAGKALKDAVN
>XC_2589 phage-related integrase
MEDGGIPCIQISDEGEHQKVKTDVSLRTVPVHPDLLALGFLDWVEQARGQ
GQERLFPAAKADAKNGQGNWISKAFSRHLAEVGKNWPTAKRGFHSLRKTL
IQELQGAGVVSELRAQLVDHELDDEHHVTYSRAFTAKEKLDGLRAVSPGL
SVLDYGLSLDSLSALMNQTTSVVSLKSNALRA
>XC_2630 conserved hypothetical protein
MFRLGASTASNLHRGNPMSVDQRRRDVERHQKEIARLQTEKSREETKAVG
EKKKAFDASAAATRTKSVSTQQSKLREAQRYEGNAVAIQKKIADLETKIA
REHERLGNANRQLSAAQVQEQKKQVQEDKKRDAERKRAMAASAQELSRVQ
GRLTHHDHLHRSTEATLQRLQELPELLTVLFLASNPIDQQQLRLDEEARS
IHEMVRKSEHRDVVKLESRWAVQPLDVLQAINECRPGVVHFSGHGSEEDD
ILFQDSVGRSKLVSKEAIVQTMMAGSGDIQLVFFNTCFSHGQAAAIVEHV
PCAIGMNTSIGDQAARVFAAQFYSAVGFGLSVAGAFEQARAALMLEWIPE
AHTPELFAGPGVDPSQVFLVRPPG
>XC_1132 holliday junction resolvase, endodeoxyribonuclease
MTRILGIDPGSQRTGIGIIDVDESGRSRHVFHAPLVLLGEGDFAQRLKRL
LHGLGELIETYQPQEVAIEKVFMGKSADSALKLGHARGAAICAVVLRDLP
VHEYAATEIKLALVGKGGADKVQVQHMVGIMLNLKGKLQADAADALAVAI
THAHVRATAQRLGVNTQQAWSRKR
>XC_3035 DNA mismatch repair protein
MRPIDRQIYRPPDFRTTFLQTADTKDKTKLSTGAAEHTPLMKQFFAAKSD
YPDLLLFFRMGDFYELFYDDARKAARLLDITLTQRGSSGGAPIPMAGVPV
HAYEGYLARLVALGESVAICEQIGDPALAKGLVERKVVRIVTPGTVTDEA
LLDERRDTLLMAISRSKQGYGLAWADLAGGRFLVNEVDSVDALEAEIARL
EPAELLVPDEDNWPEFLRGRVGVRRRPPWLFDADSGRRQLLAFFKLHDLS
GFGIDDKPCATAAAGALLGYVEETQKQRLPHLTSIAMEVASEAISMNAAT
RRHLELDTRVDGDTRNTLLGVLDSTVTPMGGRLLRRWLHRPLRLREVLVQ
RHHAVGSLIDTGADTDVREAFRALGDLERILTRVALRSARPRDFSTLRDG
LALLPKVRTILAPLDSPRLQTLYAELGEHDATAHLLISAVAEQPPLKFSD
GGVIATGYDADLDELRRLSTNADQFLIDLEQRERASSGIATLKVGYNRVH
GYYIEISKGQAEKAPLHYSRRQTLTNAERYITEELKSFEDKVLSARERSL
SREKLLYEGLLDALGGELEGLKRCASALSELDVLAGFAERAQALDWSQPE
LESAPCLHIERGRHPVVEAVRDQPFEPNDLDLHPDRRMLVITGPNMGGKS
TYMRQNALIVLLAHIGSYVPASRAVIGPIDRILTRIGAGDDLARGQSTFM
VEMAETSYILHHATPQSLVLMDEIGRGTSTYDGLALADAVARHLAHTNRC
YTLFATHYFELTALADASHAGGGSGIANVHLDAVEHGERLVFMHAVKDGP
ANRSFGLQVAALAGLPKAAVQQARRRLAELEQRGGDSHAAEMAPAALDAP
QQFGLFTAPSSAAQEALQALDPDELTPKQALEALYRLKALL
>XC_1722 conserved hypothetical protein
MSRAARPASGAAEGQVRIVGGRWRNTRLSVPQLPGLRPSSDRVRETLFNW
LLPRLAGARVLDLFAGSGALGLEAVSRGAAHALLIERDPGLAQRLREHVA
RLRATEQVQVLQDDALRWLERAPTSQVDLVFVDPPFAAGLWAPVLERLSP
HLAAEAWLYLETPAELPPQVPPGWHLHREGATREVRYALYRRAAATLNGD
PTPVASV
>XC_1136 holliday junction binding protein, DNA helicase
MTEQRTIASSATREDEAADASIRPKRLADYLGQQPVRDQMEIYIQAAKAR
GEAMDHVLIFGPPGLGKTTLSHVIANELGVNLRVTSGPVIEKAGDLAALL
TNLQPHDVLFIDEIHRLSPVVEEVLYPAMEDFQIDIMIGDGPAARSIKID
LPPFTLIGATTRAGLLTAPLRDRFGIVQRLEFYSPQELTRIVIRSAAILG
IDCTPDGAAEIARRARGTPRIANRLLRRVRDFAQVKAAGHIDLAVAQAAM
QMLKVDPEGFDELDRRMLRTIVDHFDGGPVGVESLAASLSEERGTLEDVI
EPYLIQQGFLIRTARGRMVTPKAYLHLGLKPPRDRAPAIGEPGDLF
>XC_0133 IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHALLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>XC_3735 DNA polymerase related protein
MLDLPAVVAGPRVSAEFLQLASAVLCHRDVQRHAVLYRLLWRIASGERAL
LERATDVDVHRVMQWQKAVQRDSHKMKAFVRFRRLPGEEEEFVAWFEPEH
WILDRVAPFFARRFAGMRWAILTPYRSVRWDGEALTFGEGAARNQVPADD
AQETLWRTYYAHIFNPARLNPTMMRQEMPQKYWKNLPEATLLPELIREAG
VRVREMAERAPEPVRRRVPAAPAALPAVAAQSLAQLRVAARDCRRCDLWQ
PATQTVFGEGPDDAAVMVIGEQPGDEEDLSGRPFVGPAGRLFNQALGELG
IDRQRFYVTNAVKHFRFEQRGKRRLHRNPERSHVQACNGWLQAERAQLRP
AQIVCLGATAAQAVLGPGFRLMQERGQWQRLDDGTPVLATVHPSWVLRQG
TPSARDAGYRGFVADLGQLLQAPPA
>XC_0003 DNA replication and repair RecF protein
MHVARLSIHRLRRFEAVEFHPASTLNLLTGDNGAGKTSVLEALHVMAYGR
SFRGRVRDGLIRQGGQDLEIFVEWRERAGDSTERTRRAGLRHSGQEWTGR
LDGEDVAQLGSLCAALAVVTFEPGSHVLISGGGEPRRRFLDWGLFHVEPD
FLALWRRYARALKQRNALLKQGAQPQMLDAWDHELAESGETLTSRRLQYL
ERLQERLVPVATAIAPSLGLSALTFAPGWRRHEVSLADALLLARERDRQN
GYTSQGPHRADWAPLFDALPGKDALSRGQAKLTALACLLAQAEDFAHERG
EWPIMALDDLGSELDRHHQARVIQRLASAPAQVLITATELPPGLADAGKT
LRRFHVEHGQLVPTTAAD
>XC_0681 ISxac3 transposase
MCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHWLASGSVYGH
RKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGMQCKAAA
NLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRD
RADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRSFLASHGLVC
SMSRRGNCHDNAPVESFFGLLKRERIRRLTYPTKDAARAEVFDYIEMFYN
PNRRHGSTGDLSPVEFERRYAQRGS
>XC_0698 IS1404 transposase
MAQQVRWDERARCQAAQGPRVRERAAEEVAGRAAVRERPDQGCTAKKVVS
APARRTLVREWIGRGASERRALAVIGMSASALRYCPREDRNGELRERICA
LAHRHRRYGVGMIYLKLRQEGRIVNYKRVERLYREQQLQVRRRKRKKVPI
GERQPLLRPSQANQVWSMDFVFDRTAEGRVIKCLVIVDDATHEVVAIEVE
RAISGHGVTRVLDRLAHSRGLPKVIRTDNGKEFCGKAMVAWAHARNVQLR
LIQPGKPNQNAYVESFNGRLRDECLNEHWFPTLLHARTEIERWRREYNED
RPKKAIGGMTPAAYAQHLANTDIINPGL
>XC_2879 ribonuclease HII
MTRSSSDRAIVVPAAQNALFTDSPFPTPESRLIAGVDEAGRGPLAGPVAV
AAVVFDPAKPRINGLDDSKQLSAERREQLYARIVDRALAWSVVLIDSEEI
DRINIYQATMLGMRRAVEGVAHVAGFARIDGNRVPKGLPCPAEALIGGDA
LDRAIMAASIVAKVTRDRLMRELHAQHPQYRFDLHKGYSTPAHLAALQTH
GPCPQHRRSFAPVRRALGLETAQTAWDVPCAPADGLLLAE
>XC_3836 primosomal protein N
MSAPVTTLRVALPVPLPQLFDYLPLQDTDVDGPDRVGCRVRVPFGPRELI
GVVVERGQQPSAEGLRAALDWCDDTPLLIDELARSLQWLARYTHAPLGEA
QASALPGPLRRGEPLADTHAWAWQLTEAGHTGAGSLRAGSRPALLAALLL
AGPLAEEPLEQQLPQWREAARNLAKRGYAERVAVAADTLPARPGTGPQLN
DEQQAATDAIRAGSGFATYLLDGVTGSGKTEVYLQAIADCLAAGKQALVL
VPEIGLTPQTLGRFRERLGVPVHALHSGLSDGERARVWAAAWRGEAKLIV
GTRSAVFTPLPNAGLIVIDEEHDGSYKQQDGIRYHARDFALVRGKALDVP
VILGSATPSLESLHNAYSGRYRHLRLSRRAGDARPPRVRVLDVRKRPLKD
GLSPEVLAGIGATLARGEQVLVFKNRRGYAPVLLCHDCGWTAACQRCSTP
LHQTPMTVHAGGRRLQCHHCGARQPAPLACPACASLALQPQGIGTERLEE
RLVEAFPEAPVVRIDRSTTQRRDALETQLARLGTDAGILVGTQILAKGHD
LPRLTMVVVVGIDEGLFSADFRAAEKLAQQLIQVAGRAGRADRPGEVWLQ
THHPEHPLLQTLVNGGYHAFADAELQQREAAGFPPFAHLALFRAEAKDVA
AANQFLIAVRALVGTPATSARCCGALKRRVWRIPRSRWAARSAETTHLRT
AH
>XC_2124 phage-related protein
MSGLPSSNRGVSEFRNSDGTLTVIIDWFSASVDLFAVLRQVGYLDRDDAE
EVRQWSDACAENAMVIAINLFAFFFAGLGMELDKQAGPGSFYTWRVRVLD
REGKHVGIIEFGGEECRRKDGTYTARIELTGAGCAMVSAARCGHAKRWLE
LRAKLESCAGRLTRVDTAADDLLGKYPLKLAQTWYTDGEFDNRGQRPKAQ
LIDDYDSGDGKTLYVGTKKSEKQLRVYEKGREQGDKESPWVRYEAQFKAS
NRKDLSLDILRDPAGYLLGAYPVLNFLQCVALRMDITKAAVDATWKSARR
HIKRQYGATLNFIVRHCSTPDALHAVISTCTSHRLPAWATADVANQWPEI
AGVNQTLQGVTP
>XC_3580 integrase/recombinase
MSASSPAERRQRAQQLPPLRAEDDQAIQRFLDRLWAEQGVARQTLDSYRR
DLEGLARWRDGAGGGLQGADRSALFDYLRWRTEARYAPRSNARLLSTLRG
FYALCLRDGVRSDDPTALLDPPRLPRSLPKALTESQIDALLAAPEIGTPL
GLRDRAMLELMYAAGLRVSELVTLPAVAINLRQGVLRVTGKGSKERLVPL
GEESQHWLERYLETARPTLSERKAVPAVDGQVPLFIDAARRPLSRQQFWG
LVKRYAAVAGIDPDTVSPHGLRHSFATHLLNHGADLRALQMLLGHSSLST
TQIYTLVARQHLQTLHARHHPRG
>XC_1266 conserved hypothetical protein
MRILLQHDPGGNEPLRYVQLTLQPDLFGGWELLRESGQIGGRTQLRRDQY
LLQDEADRAFEKARDTQLKRGFHVITGGADAPR
>XC_2626 IS1404 transposase
MSASALRYCPREDRNGELRERICALAHRHRRYGVGMIYLKLRQEGRIVNY
KRVERLYREQQLQVRRRKRKKVPIGERQPLLRPSQANQVWSMDFVFDRTA
EGRVIKCLVIVDDATHEAVAIEVERAISGHGVTRVLDRLAHSRGLPKVIR
TDNGKEFCGKAMVAWAHARNVQLRLIQPGKPNQNAYVESFNGRLRDECLN
EHWFPTLLHARTEIERWRREYNEDRPKKAIGGMTPAAYAQHLANTDIINP
GL
>XC_2395 IS1477 transposase
MKKSRFSTEQIIGFIKQADAGMAVAELCRRHGFSPASFYQWRAKYGGMEA
DEAKRLKELEVQNTRLKKLLAEAHLDIEALKVGFGVKR
>XC_3222 DNA polymerase III delta' subunit
MTPTFSPWQQRAYDQTLAALDAGRLGHGLLICGPDGLGKRAVALALAEHV
LASAPDPAVAQRTRQLIAAGTHPDLQLVSFIANRTGDKLRTEIVIEQVRE
ISQKLSLTPQYGIAQVVIVDPADAINRAACNALLKTLEEPSPGRYLWLIS
AQPARLPATIRSRCQRLEFKLPPAHEALAWLLTQGVSERAAQEALEAARG
HPGLAAQWLREDGLAVRRAVAQDLEQIASGRVGAVDVAQRWTNDGQADQR
LRHAADLALAQASAGLTDPSRLHKLATWFDAANRTRDLLRTTVRADLAVA
ELLLAWREGERQARSRGTR
>XC_2623 IS1477 transposase
MLEHTPLSERRACRLAGLSRDAFRHAPVPTPATQALSARLVELAQTHRRF
GYRRLHDLLRPEFPSVNHKKIYRLYEEAELKVRKRRKAKRPVGERQKLLA
SSMPNDTWSMDFVFDALANARRIKCLTVVDDFTRESVDIAVDHGISGAYV
VRLLDQAACFRGYPRAVRTDNGPEFTSRAFIAWTQQHGIEHILIEPGAPT
QNAYIESFNGKFRDECLNEHWFTSLAQARDVIADWRRHYNQIRPHSSCGR
IPPAQFAANYRTQQANNAVPFNPGLYQ
>XC_3920 ISxac3 transposase
MCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHWLASGSVYGHRKI
TTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGMQCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHDNAPVESFFGLLKRERIRRRTYPTKDAARAEVFDYIEMFYNPNR
RHGSTGDLSPVEFERRYAQRGS
>XC_2785 helicase
MKLRFWKGESPETIPMDAPEFSAESSGWKGTAQEKSTHPGAIYLLQLADE
RFALKDGAGYLIPWEKLYSLLSDSDHVSSLHLLKLPQTSRLRPAIRSHGT
PTDTNFRVTLEAWIAESGEELLAERLGAVLTMPGQDLLLPQSAFTLLEAM
AELAQCGEDWDADRRMLQMGKVQEAARWAGATMDRYLEHSPVVVANKLEI
NLKQHEAAEAQIVEVEPRPIGAPEGWLSQFDRYQSVRGRYDVTDAEGGMS
HVVLSSQVREVGQQIKSMPGRRLAGKQADAFMHNPYAVLGEAAAQVISPE
RFAEAKRTAGIDEWELELQPADVDGSWDAVLVDSVGKSESVFVGSFHASE
FDSLLEEASGSEGLGVARWKKHRVLLSGLTLESLARLRQAHFKEAVGSVI
GVETLFDLAHYSDRVVGFDGKPIIVPKVQGSSPAEDWIKGAGEFVAVDPA
TGTATEGRLSTNDVQEFGERVDLAEREGHINVAVPGIEKEIPTPEARAWM
KELTKPAGEKRIESLKNPAPAEKLSLRILHNIEELEYPSEETEQARVKDV
YEAPAALRPEVALKDHQVEGVAWMQGRMRQRGDGVRGVLLADDMGLGKTL
QALTLMAWYRQTAPAPKPCLIVAPVSLLENWKAEIAKFLDGRQGATLTLY
GEHLALHRLSAREIGPELRELGVKKALNPGFAHGAAFVLTTYETLRDYQL
SMAREKWGVLVCDEAQKIKTPSAMVTRSAKAMQADFKIACTGTPVENSLA
DLWCLFDFFQPGLLDSLTKFTKVFRQQIELRSEGHEQKVEVLREQISPWV
LRRMKAEVADLKPKTELPCELAMSAKQRGLYAAAARRFRETVDSGEGGDT
AALSLLHQLRQICANPLAAADDRSEFLSLDEHLRHSPKLAWLIEKLQEIQ
ATGEKVIVFTEFRDIQRLIQRAIASRLSYQASIVNGSTSVEAGADDSRQI
IIDAFQAKPDFGVIVLSTTAVGFGVNIQAANHVIHFTRPWNPAKEDQATD
RAYRIGQEKEVFVYYPTVLGDGFESFEERVAKRLATKRKLSNDMLAPEQG
LTLEEFSDLSLGFSE
>XC_0412 similar to IS1404 transposase ORFA
MKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMSVP
DAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>XC_2595 conserved hypothetical protein
MQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCERKLNTSLQFIH
LAFLALLLRRS
>XC_1656 integration host factor alpha chain
MALTKAEMAERLFDEVGLNKREAKEFVDAFFDVLRDALEQGRQVKLSGFG
NFDLRRKNQRPGRNPKTGEEIPISARTVVTFRPGQKLKERVEAYAGSGQ
>XC_0909 IS1480 transposase
MARARQERNACTRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQTLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>XC_4280 IS1481 transposase
MIYAKARQGRKQCNRRSPAKVSRASSSLKAAAREREPWLIVASPQLQAPS
AKQLVNVYARRMQIELAFRDLKSHRYGQALEDSLTRRGERLQILLLINTL
AAFASWLAGLGCEATGIAQWLSPRNSTRKLYSTLRIGREALVRQWPMEPV
SRWIGRLRALPAAVREQMTLTV
>XC_2603 IS1477 transposase
MPPARFQPGQLLPVAGQVRRHGGRRREAPEGAGAGEQPPEAVAGRGAPGH
RGAEGRVRGKTLAPQRKREAIRRMCELTSISERRACRLAGISRDAFRHAP
TPTPATQTLSARLVELAQARRRFGYRRLHDLLRPEFPQVNHKKIYRLYRE
AKLSVRWRKKAKFPAAQRQPLRPARHPNEVISMDFVFDQLASGRRIKCLT
VADDFTHECVDIAVDHGISGAYVMRVLEQIACFRGYPRAVRTDNGPEFTS
RAFIAWAQQRGIEHILIEPGKPMQNGYIESFNGKFRDECLNEHWFTSLIQ
AREVIADWRRDFNEVRPHSSCGRIPPAQFASNHRAQTGNNAVPFNPGLYQ
>XC_3586 DNA polymerase III holoenzyme chi subunit
MPRADFYLIAKPRFLDEPLRLVCELARKANDANLSTLILARDAAQAEALD
DLLWAFDDEAYVPHQIAGTDEEDELAPVLIATPEFAAPSRPLVINLRDDP
YLGACDRVLEVVPADPAAREPLRERWKQYKALGLELTKYDM
>XC_3944 DNA repair protein
MHINDWPTDERPREKLLARGAAVLSDAELLAIFVGSGLRGQDAVRTARDL
LHRHGPLRCLLDRPAKALARLPGLGPASACKLSAALELANRHLLSDLERG
EALSDPSSVGRYFSQRLRARNYEVFAALFLDSRHRAIAFEELFTGTIDAA
EIHPREVVRRALLHNAAAVVVGHNHPSGNPEPSEADRAVTQRLLQALGLV
DIRLLDHFVIGDGRPVSLAERGWVP
>XC_0438 ATP-dependent RNA helicase
MSLKADLLSLQLQPVFESALARAGVRALTPIQVAMIPPMLAARDLIATAQ
TGSGKTLAYALPLLQQRLQAPEQAPRVLGGLILVPTRELVAQVAHTLLSL
AAALPRRLKIVAATGGEAINPQLMALRGGADIVIATPGRLLDLVTHNALR
LSQVSTLVLDEADRLLDLGFGAELDRILALLPAQRQSVLVSATFPAAIAS
LAKRRLRDPLRITLGGTPEQAPAIAQRAIAVDAGQRTQLLRHLLLEHGWP
QLLVFVTSRHGADKVAEKLSKTGIAALPLHGELSQGRRERTLRAFKQADV
QVLVATDLAGRGIDIDALPAVLNYDLPRSTVDYTHRIGRTARAGASGVAI
SFVTADSAQQWRLIEKRQGLRVPTSVIEGFEPTPVQAPAPDHASGAAARA
ADDNGGIKGKRPSKKDKLRAAAQAQAGKPG
>XC_0015 conserved hypothetical protein
MASAIKGRGATGHLPGRFEVTTPQAVDDGWHVDDSDEFAAPALRTQVTDE
TARSIISRNQSPDIGFSQSVNPYRGCEHGCSYCFARPSHAYLNLSPGLDF
ETRLFAKANAPELLRRELARPSYVPSPIALGINTDAYQPIERKRGLTRQL
IEVLWEARHPFTLITKSALVTRDLDLLAPLARARLVNVHFSVTTLDPHLS
ARLEPRASAPHARLRAMRSLHEAGVPVGVMAAPVIPWINDHELEAILQAA
ADAGASSAGYVLLRLPHEVAPLFREWLQTHHPQRAEHVMSTIAQLRGGKD
YDSTFGTRMRGQGVYADLLARRFALAHRRAGFDTRRTPPLDTEQFRRPAP
PPKPVKDSPQGQLF
>XC_1366 conserved hypothetical protein
MPEAGAIRPDATVLGFDVGSRRIGVAVGTALGAGARAVAVINVHANGPDW
VALDRVHKEWRPAGLVVGDPLTLDDKDQPARKRAHAFARELRERYALPVV
LIDERSSSVEAAQRFARERADGRKRRRDADTLDAMAAAVIVERWLSAPEQ
ATLLP
>XC_3844 uracil-DNA glycosylase
MTEGEGRIQLEPSWKARVGEWLLQPQMQELSAFLRQRKAANARVFPPGPQ
IFAAFDATPFEQVKVVVLGQDPYHGEGQAHGLCFSVLPGVPVPPSLLNIY
KEIQDDLGIPRPDHGYLMPWARQGVLLLNAVLTVEQGRAGAHQNKGWEGF
TDHVVETLNREREGLVFLLWGSYAQSKGKVIDQARHRVFKAPHPSPLSAH
RGFLGCKHFSKTNEHLQRRGLSPIDWSLPSRAALDLSLAGG
>XC_4288 exodeoxyribonuclease V beta chain
MSNSPVTDPYLHLPLHGVRLIEASAGTGKTFTLATLFTRLVVERQLRIGQ
ILAVTFTEAATQELRRRIRERLALAATLVPDARAGAAATEAQTTLLPDAS
SAGVGAALAATEPTQTQASDAPSSAINMVLPATAPSDHLSHPPAQTPPQP
HAPAAPDAVLTRAILTAHLATGTETPSALRRRLQQAVEEIDLAAIFTIHG
FCARVLREHALESGQAFAAPQLLANDRELLGEVAADLWRQRAADAAMADD
LVALWPAGPTALASDLRALVQQPELLPAVAAPTPDPQPARQAAAQAVVAA
LRAHGDTAYDAVAAAFEHKIFDGRRARRPSFDKAFEQLWQGSAEAHWVLD
DGGHLDKLLPQRLREFCKDGAHDRVPCSPLFDALAVWQQADAVVRQWEGQ
RRIRLLHALRDDAVLQLAQRKRQRRVQTYDDLVDGVARALQGPQAEALVQ
RLRAQYAIALVDEFQDTDDRQWQIFSRVFGPEHAASGEAFAPDDDADFDN
AAGTPPPRLLALIGDPKQAIYGFRGGDVQTYLAAATTAQRAPPLEHNFRS
RPGVLAAIDALYAQAGYAEAFLTEGIAFHPVQPGTKRSDADLQRDDAAAP
ALTLWRAPAPPPPAKGKPKPWSAGRARELCTAACVAAIRGWLAGGRDGSA
SINGRPVQAGDIAVLVRSHGEATRIQQALGAVGIPAVAAGKQSLFATDEA
LELLALLQALLDPGDDSRLRAALATVLIGEDAAAIAALEHDGERHRRWQQ
QALDWRERWQRGGPLALVGDLGATHGQRLLALVDGERRLTNYLQLAELLQ
EADTRALGPHGLVDWLARRIANADDNDETQQLRLESDARRVQIVTLHKSK
GLEYPLVFLPYIGIGRADKSPGRHCVVHAPPHGRQLHWNTSKWSADDTAS
WSTAETAWKHEQRAEDARLLYVGLTRAEHALWIATGAFHQHERTALAPML
RDPAALQASAGAGVIALDDTAPPATLPRLPADDAVQVPAARLAQRHVVPD
WWVYSFTQLANADAGSDPMASATLASSGGSDEPPASEPVSAPAEVEAFDP
RFAGNRFGVAMHDVFERCDFAAWRNWCPGQPAPEGQAAAILEALQRGGYA
QDELDDGLAMLTRLVGHTLTVTLPEGTCLAAVPEPQRRNEMEFHFAMRPT
RVDALLALLHRFGVVTERQAFGARQRLEGLMTGLIDLTYCADGRWYVLDY
KSNRLPAYDPDALARAMAHSEYELQALIYTIALHRWLRFRLGASYDYARD
FGGVRYLFCRGLDAARNPAADSSSILGSGSGSGSDSDSDSVNGASSVSGS
DPASDAMSDTPAPNTPVPGIYAWRFDPALVQALDALFAGNPTEPLSSDAL
KPLSPRERGWGEGTSTTGTPTP
>XC_1007 aminopeptidase
MLKPLGIAYEPSKGGPGPDVGPISAKGGAWAWLAQDGTDYFDLHHTADDT
LDKIDPKALAQNVAAYTVFAYLAAEADGDFGSRAKSVQPPNE
>XC_2021 conserved hypothetical protein
MSNSTELIADRLPRVTVEDVRRFAAIVDIRDAGAFAAELQAFVHERVEAV
ELPARLEMETMEQTLARKAAALRAETRWAPNETEVQRGRTALLKTFNQPH
NLPIPEFAKLADKSRQQIYKDILARRLLALNVGPRGQKLPDWQLDPVKQQ
LTQAVLQEVEGIDNWTIYGALSEPLEGLGGRSPVDAVTPDSIDKVSEAVF
NVLGVQVH
>XC_0413 similar to IS1404 transposase ORFB
MREWIGRGASERRALAVIGMSASALRYCPREDRNGELRERICALAHRHRR
YGVGMIYLKLRQEGRIVNYKRVERLYREQQLQVRRRKRKKVPIGERQPLL
RPSQANQVWSMDFVFDRTAEGRVIKCLVIVDDATHEVVAIEVERAISGHG
VTRVLDRLAHSRGLPKVIRTDNGKEFCGKAMVAWAHARNVQLLLIQPGKP
NQNAYVESFNGRLRDECLNEHWFPTLLHARTEIERWRREYHEDRPKKAIG
GMTPAAYAQHLANTDIINPGL
>XC_3524 conserved hypothetical protein
MPAARQQRGAAVEAAARAQLEQAGLRLVAGNANYRGGELDLVMRDGPMLV
FVEVRYRRDARFGGGAASVDFRKRRKLVLAAQLFLAAHPALAALPCRFDV
VEASGEPPLLHWIRDAFRLDDC
>XC_1641 IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>XC_1640 conserved hypothetical protein
MQGRPMQNGFIERFNGSYPRGVLNMHIFSTLSEIRKQTEHLLADYNQ
>XC_3392 MutT-like protein
MRQPALWPAHAPAPAGRQWRRRVAATSRFPVPRSLSMNRHDTPPTVVYEG
KYQRMVVRGTWEYSERVHAGGLAAIIVAVTPDDAMLFVEQFRVPLQARTI
EMPAGLVGDIHADESIELSAIRELEEETGWTADHAEVLMIGPTSAGASSE
KIAFVRATGLRKVGAGGGDASEDITVHEIPRAQVGAWLVQKMAEGYQMDP
KLWAGLYLVDHALDGTPRG
>XC_1034 ISxcc1 transposase
MRKSKFTESQIVATLKQVEGGRQVKDVCRELGISDATYYVWKSKYGGMEA
ADVQRLRDLETEHNKLKRMYAELAMENHALKDVIAKKL
>XC_2010 IS1477 transposase
MKKSRFSTEQIIGFIKQADAGMAVAELCRRHGFSPASFYQWRAKYKYGGM
EADEAKRLKELEVQNTRLKKLLAEAHLDIEALKVGFGVKR
>XC_4135 exodeoxyribonuclease III
MKIASWNVNSLNVRLPHLQQWLADFAPDVVGIQETKLEDHKFPDAALAAL
GYRSVFCGQKTYNGVAILSRSPALEVQMGIPGFDDVQQRVIAATVDGVRI
INLYVVNGQDVGTDKYAYKLRWLEAVHDWIAQELQRHPQLVVLGDFNIAP
DARDVYEPEVWSDNHILTSTAERGALHKLLALGLHDAFRLHHDDAGHFSW
WDYRQAAYRRNLGLRIDLTLVSDALRARAVEAGIDREPRTWERPSDHAPA
WVRLAEAGA
>XC_2842 topoisomerase IV subunit A
MTDLTRPTFHGFEQLPLREYAERAYLDYSMYVVLDRALPFLGDGLKPVQR
RIIFAMSELGLNAAAKPKKSARTVGDVIGKYHPHGDSACYEALVLMAQPF
SYRYPLIEGQGNFGSTDDPKSFAAMRYTESKLTPIAEVLLGEISQGTTDW
AANFDGTLEEPTWLPARLPHLLLNGTTGIAVGMATDVPPHNLNEIVSALL
HLLDDPDATVAQLCEHVLGPDYPTNAEIITPVADLRAIYETGHGSVRARA
TYKKEHANIVIDALPYQVSPSKVIEQIAQQMRAKKLPWLEDIRDESDHTS
PVRVVLVPRSNRVDAEQLMGHLFVTTDLERSYRVNLNVIGLDGRPQVKNL
KHLLSEWLTFRSDTVTRRLNHRLQKVERRLHLLEGLLIAFLNLDEVIRIV
RSEDEPKPVLIARFALSEEQAEYILETKLRQLARLEEMKIRGEQEALAKE
REQILSILGSKTKLKKLIKDELTADAKKFGDARRSPLVQRGAAQAIDETE
MVASEPMTVVLSEKGWVRAAKGHEVDPAGMSYRDGDGLLAAVRSRSTYHV
AFLDSDGRAYSTLVHTLPSARGNGEPLTGRFSPASGASFQVLASGENNAR
FVLASSHGYGFVTRFENLTGRNKAGKAMLNLTTGAHVLTPAQVLNPQTDR
IVAVTSAGNLLAITASDLPELDKGKGNKLIEIPKAKLGTERVVAVAAVAP
GNTLLVRSGARVMSLSFKDLDTYVGARASRGALLPRGWQKVDGLEVQ
>XC_2783 IS1477 transposase
MPPARFQPGQLLPVAGQVRRHGGRRREAPEGAGAGEQPPQAVAGRGAPGH
RGAEGRVRGKTLAPQRKREAIRRMCELTSISERRACRLAGISRDAFRHAP
TPTPATQTLSARLVELAQARRRFGYRRLHDLLRPEFPQVNHKKIYRLYRE
AKLSVRRRKKAKFPAAQRQPLRPARHPNEVISMDFVFDQLASGRRIKCLT
VADDFTHECVDIAVDHGISGAYVVRVLEQIACFRGYPRAVRTDNGPEFTS
RAFITWAQQRGIEHILIEPGKPMQNGYIESFNGKFRDECLNEHWFTSLIQ
AREVIADWRRDFNEVRPHSSCGRIPPAQFASNHRAQTGNNAVPFNPGLYQ
>XC_2622 IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHALLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>XC_2078 YeeB-like protein
MEGNPTNKYTVPSVSIATAQTGASSKINALGMRPMQEKAYAKRGEQHLLI
KSPPASGKSRALMFIALDKIKNQGLKQAIIVVPERSIGGSFADEKLSEQG
FWADWMVRPQWNLCNAPGEDNGKVAPSKVKAVGAFLASEDPVLVCTHATF
RFAVDEFGIEAFDGRLIAIDEFHHVSSNPDNKLGTQLGQLIGRGQVHVVA
MTGSYFRGDAVAVLSPEDEGKFESVSYTYYEQLNGYTYLKSLDIGYFFYT
GRYLDAIMKVLDPSLKTIIHIPNVNARESLKDKHKEVDEILASLGDWKGR
DEATGFHLIEIEGGRIIRVADLVDDSDAARRSKILTALKDPAQKDNRDNV
DIIIALGMAKEGFDWIWCEHALTIGYRSSLTEIVQIIGRATRDAPGKGRS
RFTNLIAEPAADSEIVVDAVNDTLKAIAASLLMEQVLAPRFEFTPKNAGE
KEGFDYGDEGYQEGNANVGVNAATGEVHVEINGLAQPKSPEAARICKEDI
NEVITGFIQDKPTLERGLFDKENTLPEEITQVQMAKIVRDRYPDLSEEDH
EAVRQHAIAVLNITQQAKQVIAQVDAKGESPNMSLIEGVRKFVNVKDLDI
DLIDRINPFDAAYAVLAKAMDERVLRQVQSAIAAKRLAISYEEAKDLAKR
AVAFKNERGRLPEIGSADPWERRMAEGVASLQRHIAQQKAAAAQGGGNG
>XC_3821 DNA processing chain A
MDLTEPDRRALLTLLLAGGRSPPRRALLDAFDAPSQILAAGPAAWRAAGC
DALQIAKLQTPDTPILDAALRWCAQPGHHLIGWRDADYPALLRHIANPPL
MLFVDGDPAALWHPCVAVVGSRAASAGGRDHTRHFAASLANAGLGIVSGM
AAGVDAIAHEAALAHADGITVAVVGTGPDVAYPVQHHSLRDRIAARSAVV
SEYLPGTCAVAAHFPARNRIIAGLALGTLVVEAAMRSGALITARLAAEAG
REVFAVPGSLHNPLARGCHHLIRQGATLVQEPAQLVEGLRLLSGELADAL
RQRLTAPTEQARTVPQPTPRRSDPDYQRLWHALGHDPTPMDSLLERTGLT
AAALSSMLLIMELEGDVVTEHGRYTRNP
>XC_0065 conserved hypothetical protein
MTPQNVAAQEAYYFNKQPRGTPGLPTAQTTGIGFHGDSDYPNYYGAGAVS
RAIYIERTHAHPVGGIAPQMHLDMQRLRFKEPLLEHNGIDLSQTGAANPQ
PYWDTSTNPPTRGLFQHTQGTHQHVSPALDLAAPNDERSGRSQHPSIDSA
LLEKVRQGVSELDRQARKPWDDNSERLSASLMLMAAEKGFTAKDDLKFAF
NTPTPNLGSGEVLHMWRASNASPDPAANRAHMPTQEALSVPAEQRLTQVE
ALQQAKAEEIQRSQQQDVVQQQLGQARSL
>XC_2394 IS1477 transposase
MPSAWFQPGQLLSVAGQVRGDGGRRGQAAERTGGPEHSIEEVAGRGAPGH
RSAEGWLRGKTLAPQRKREAIRRMLEHTPLSERRACRLAGLSRDAFRHAP
VPTPATQRLSARLVELAQTHRRFGYRRLHDLLRPEFPSVNHKKIYRLYEE
AELKVRKRRKAKRPVGERQKLLASSMPNDTWSMDFVFDALANARRIKCLT
VVDDFTRESVDIAVDHGISGAYVVRLLDQAACFRGYPRAVRTDNGPEFTS
RAFIAWTQQHGIEHILIEPGAPTQNAYIESFNGKFRDECLNEHWFTSLAQ
ARDVIADWRRHYNQIRPHSSCGRIPPAQFAANYRTQQANNAVPFNPGLYQ
>XC_1378 single-stranded DNA binding protein
MARGINKVILVGNLGNDPDTKYTQAGMAITRVSLATTSMRKDREGNNQER
TEWHRVVFFGKLGEIAGEYLRKGSQVYVEGELRYDKYTGQDGVEKYSTDI
VANEMQMLGGRGEGGGGGGMGGDRPQRTQAPRQQQGGGGGGGGQDYAPRR
QQPAQQQSAPPMDDFADDDIPF
>XC_2601 invertase/recombinase protein
MVLIGYARVSTAEQDTALQTDALRKAGCERVFEDTASGAKADRPGLADAL
AYLRAGDVLAVWRLDRLGRSMQHLIETIAALEARGVGFRSLTESIDTTTP
GGRLIFHVFGALGQFERDLIRERTKAGLTAAAARGRKGGRKPVVTADKLQ
RAREHIANGLNVREAAKRLKVSKTALYAALQSTSAANF
>XC_4239 formamidopyrimidine DNA glycosylase
MPELPEVETTLRGLAPHLVGQRIHGVILRRPDLRWPIAAQIEQLLPGATI
TDVRRRAKYLLIDTDAGGSAVLHLGMSGSLRVLPGDTPPRAHDHVDISLQ
NGRVLRFNDPRRFGCLLWQRDCETHELLASLGPEPLSAAFTGDYLHALAC
GRRAAVKTFLMDQAVVVGVGNIYAAESLHRAGISPLREAGKVSRERYRRL
ADAVKEILAYAIQRGGTTLRDFISPDGAPGYFEQELMVYGREGEACRHCG
GELKHATIGQRATVWCAACQR
>XC_2997 conserved hypothetical protein
MSALAHRAPVHAWIGGGASERRTLAAIGIGTSALSYCLRDDNNFELRWRL
GALAHRRRRYGVGMIDPKLRRQKRIVKYKRGAWLYPIGLPPQWKRENRRA
EKIVLECSESPFRCCKHRPWRPAGRASGTGGMVRTRSCAKAA
>XC_0492 (di)nucleoside polyphosphate hydrolase
MIDPDGFRPNVGIVLMREDGQVFWARRVRRDGWQFPQGGMNTDETPVEAM
YRELREETGLLPEHVELLGATPGWLRYRLPSRAVRRNERQVCIGQKQVWF
LLQFTGQESHLKLDHTDSPEFDHWRWVDFWYPVEHVVMFKRGVYARALRH
LAPLAQTVAGPAAVGVMPQRALEAWLPGSSAAGHDRPRKRPRKRGGVLPV
RINND
>XC_4287 exodeoxyribonuclease V alpha chain
MNHPNLLTALNQAGALRTLDLAFAQSLQRLAPDTDPQVLAGAALASLAVT
SGHAGLDPTRAAMLLDAREGPSPALPDPTDWQRTLAASRWVDQPNPQEPA
AADCPLVLEHGLLYLRRYREYERRLALGLQRIAAHSPPPFAAATLAPLFE
QLFPQASPLPQGEGARRAGEGTGLPEPSIYQDGTNPPEPSHHQDHQAQAA
ALALRRTLLLVTGGPGTGKTTTIARLLLLRIAQAHASNTPAPRIALAAPT
GRAAERMAESLRAAVARAIANGIDPALADALPTGASTLHRLLGVIPDSPQ
FRHTADNPLPFDLIVVDEASMVDLPLMCKLVEAVADGTQLILLGDADQLP
SVEAGDVLAAILQAAGPGDTLQPQDADALQPLLGSAPPGSTPASIQTGGH
TISHTHTGGLAGHRVHLLRGYRQADNFALTPLADAIRTSDADTALALLRS
GELAGVHFHEDGEDPLALGRDALLAHWRALADAHDPAAALRDAARLRLLT
AVRAGPQGARGLNARIEQLLAESGSGARRLGSASPWFQGRLLLITENSYR
HGLFNGDVGICLRSEASPFSERSDTNAPSERSDASTVTARSDAAASSGPS
HSIGTANRADPVAERRAQGPLVAWFEGDGDSQVRGFHPAALPAHESAFAM
TVHKAQGSEFDTVWLQLPTRDARVLSRELLYTGITRARRALHLAGSEAAL
RAALARHAARISGLAWRLGGEQMQPAPVEQTAEPPTVTPVQGSLF
>XC_0001 chromosomal replication initiator
MDAWPRCLERLEAEFPPEDVHTWLKPLQAEDRGDSIVLYAPNAFIVEQVR
ERYLPRIRELLAYFAGNGEVALAVGSRPRAPEPLPAPQAVASAPAAAPIV
PFAGNLDSHYTFANFVEGRSNQLGLAAAIQAAQKPGDRAHNPLLLYGSTG
LGKTHLMFAAGNALRQANPAAKVMYLRSEQFFSAMIRALHDKAMDQFKRQ
FHQIDALLIDDIQFFAGMDRTQEEFFHTFNALFDGRQHIILTCDRYPREV
EGLEPRLKSRLAWGLSVAIDPPDFETRAAIVLAKARERGAEIPDDVAFLI
AKKMRSNVRDLEGALNTLVARANFTGRSITVEFAQETLRDLLRAQQQAIG
IPNIQKTVADYYGLQMKDLLSKRRTRSLARPRQVAMALAKELTEHSLPEI
GDAFAGRDHTTVLHACRQIRTLMEADGKLREDWEKLIRKLSE
>XC_3915 IS1404 transposase
MKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMSVP
DAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>XC_2509 RecA protein
MDENKKRALSAALSQIEKQFGKGSVMRMGDRVIEAVEVIPTGSLMLDIAL
GIGGLPKGRVVEIYGPESSGKTTLTLQAIAECQKLGGTAAFIDAEHALDP
IYAAKLGVNVDDLLLSQPDTGEQALEIADMLVRSSSVDIVVIDSVAALTP
KAEIEGEMGDQLPGLQARLMSQALRKLTGNIKRSNTLVVFINQLRMKIGV
MMPGQSPEVTTGGNALKFYASVRLDIRRIGAIKKGDEIIGNQTKIKVVKN
KLAPPFKQVITEILYGEGISREGELIDMGVEAKLVDKAGAWYSYGDERIG
QGKDNARGYLRDNPQVAIKLEAELREKFQPAEAPREAGETESE
>XC_1226 ADP compounds hydrolase
MLRMSTRLPTIHKITDLGEGPFRRQQLDLEFSNGERRLYERQLSQGHGAV
VVVPMLDAQTVLLVREYAAGVHRYELGLVKGRIDAGETPEQAADRELKEE
AGYGARQVQVLRAMTLAPTYMSHQSWLVLARDLYPERLPGDEPEELDVIP
WPLARLDELMLREDFSEGRSLAALFIAREWLERNP
>XC_1162 DNA helicase
MSSPAHELLSRVFGYDDFRGPQQAIVEHVAAGNDALVLMPTGGGKSLCYQ
VPALLRDGIGIVVSPLIALMQDQVEALRQLGVRAEFLNSTLDAENTQRVE
RALLSGDLDLLYVAPERLLTPRFLSLLERSRIALFAIDEAHCVSQWGHDF
RPEYRQLTVLHERWPHIPRMALTATADPPTQREIAERLDLVEARHFVSSF
DRPNIRYTVVQKDNARKQLQEFLGRHRGSAGIVYAMSRRKVEETAQQLCA
QGFNALPYHAGLPAEVRAENQRRFLREDGIIMAATIAFGMGIDKPDVRFV
AHVDLPKSMEGYYQETGRAGRDGEPAEAWLCYGLGDVVLLKQMIEQGEAA
EERKRLERAKLDHLLGYCESMQCRRQVLLAGFGETYPKPCGNCDNCLTPA
AAWDATVASQKALSCVYRSGQRFGVGHLIDILRGSENERIKQLGHDQLST
YGIGRDMDERTWRGVFRQLVAASLLEVDSEGHGGLRLTDASRQVLKGERQ
VMMRRENPAAGRERDRGAQRTGLPVQPQDLGLFNALRGLRAELAKEQNVP
AFVIFHDSTLRNIAEQRPTSIDALSRVGGIGGGKLARYGAQLIEIVREQG
>XC_2366 conserved hypothetical protein
MVDAPLRQQLTVYRARWPNESEVADQFEQLLDDATDPFVRERVEGHFTGS
AWVVGADGTRTLLTHHRKLQRWLQLGGHADGDRDLAQVALREAQEESGLT
GLTLADGLLFDLDRHWIPARGEVAGHWHYDARYVVVAGADETFQVSEESL
ALAWRPIAELLAEPELDPSLRRMAEKWMAHGGS
>XC_0688 IS1480 transposase
MARARQERNACTRFLIVDAQSVKNTDTAGQKGYDTGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSKLTHVQTLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>XC_3917 ISxcC1 transposase
MKLNQRRRSKRRVPTRHPQPLACGDRPNAGWSIDFMSDALWDGRRFRTFN
VIDDFSREALAIDVDLNLPAARVIRTLERIAAWRGYPNKLRLDNGPEFVA
LALAEWAERKGIALDFIEPGRPMQNGFIERFNGSYRRGVLDMHIFRTLSE
VREQTEQWLADYNQQIPHDSLGGLTPAEFREQHQPQTSSFIWH
>XC_0004 DNA gyrase subunit B
MTDEQTTPPTPNGTYDSSKITVLRGLEAVRKRPGMYIGDVHDGTGLHHMV
FEVVDNSVDEALAGHADDIVVKIHVDGSVAVSDNGRGVPVDIHKEEGVSA
AEVILTVLHAGGKFDDNSYKVSGGLHGVGVSVVNALSEHLWLDIWRDGFH
YQQEYALGEPQYPLKQLEASTKRGTTLRFKPAVEIFSDVEFHYDILARRL
RELSFLNSGVKIALIDERGEGRRDDFHYEGGIRSFVEHLAQLKTPLHPNV
ISVTGEHNGIVVDVALQWTDAYQETMYCFTNNIPQKDGGTHLAGFRGALT
RVLSNYIEQNGIAKQAKITLTGDDMREGMIAVLSVKVPDPSFSSQTKEKL
VSSDVRPAVENAFGARLQEFLQENPNEAKAITGKIVDAARAREAARKARD
LTRRKGALDIAGLPGKLADCQEKDPALSELFIVEGDSAGGSAKQGRNRKN
QAVLPLRGKILNVERARFDRMLASDQVGTLITALGTGIGRDEYNPDKLRY
HRIILMTDADVDGSHIRTLLLTFFYRQMPELIERGYIYIGLPPLYKLKQG
KSELYLKDDAALNAYLASSAVEGAALIPASDEPPITGEALEKLLLLFAGA
KEAIARNAHRYDPALLTALIDLPPLDVVQLQAEGDVHPTLDALQAVLNRG
TLGTARYHLRFDPATDSAAASLVSVRKHMGEEFTQVLPMGAFESGELRPL
REVALALHGLVREGAQILRGNKSHPITSFAQAQAWLLEEAKRGRQVQRFK
GLGEMNAEQLWETTVNPDTRRLLQVRIEDAVAADQIFSTLMGDVVEPRRD
FIEDNALKVSNLDI
>XC_0137 IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_3603 DNA repair system specific for alkylated DNA
MRMNIRVALPQAQVHWCRGWLQAAHADALMQALLDQVQWEVHRIRMFGRV
VDSPRLSSWIGDADASYRYSGTQFAPQPWLEALQPVRTRLQDETGSPFNS
VLVNRYRSGADAMGWHSDDEPELGAQPVIASLSLGAARRFAFKHRHDAAL
KQTLELGHGDLLLMGGDTQRHYKHALPRTVKPVGERINLTFRQIAVRVSQ
R
>XC_3923 IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHALLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>XC_1032 ISxcc1 transposase
MVGVARSTARYRRRPDRDEEVIALLSELAERFPERGFGKLFQIIRRRGHL
WNHKRVWRVYCLMKLNQRRRSKRRVPTRHPQPLACGDRPNAGWSIDFMSD
ALWDGRRFRTFNIIDDFSREALAIDVDLNLPAARVIRTLERIAAWRGYPN
KLRLDNGPEFVALALAEWAERKGIALDFIEPGRPMQNGFIERFNGSYRRG
VLDMHIFRTLSEVREQTEQWLADYNQQIPHDSLGGLTPAEFREQHQPQTS
SFIWH
>XC_2619 phage-related integrase
MQPCCGAVATAMSPASTPVVGQLKTEAKRLTQPAQTSTPFSTLPSPPFRN
SIMSITSTPTAEHLSSLLNQRLPGKVIFRTLRSTASTAFVMNAESHHQRP
QRLAESDLAKIFASPAYDQWAANEPLLFWAPLLGLYMGVRASETVALSID
DIIERAGLMCIVLKNRAAPESESATASKYRTRQRHSTGSIPVPKVLLDAG
FPDYVAAIKSKDRRELFRTSQRERTADTAAWLRSKFARYLRSHGIKKSGF
SALRQTFGERLMDADISWADKRELARASERHFPAFTCFFYPSCRMSSLKK
SLNKISCDGLTLPRFMGDQVLVRTHGAARYR
>XC_4231 ATP-dependent DNA helicase
MHGLNPPQSAAVLHCEGPLLVLAGAGSGKTRVIVEKIAHLIAIGRYPAKR
IAAITFTNKSAKEMRERVAKRIRGDGADGLTICTFHALGLKFLQIEHAAA
GLKRGFSIFDSDDAAAQIKDLMHGAKPDAIEDAKNLISRAKNAGMSPEQA
MAAARSNREKEAASLYERYQARLTTFNAVDFDDLIRLPVQILEANEEIVM
GWRERIGYLLVDECQDTNDAQYRLLKMLAGPRGNFTCVGDDDQSIYAWRG
ANPENLQQMGRDYPALEIIKLEQNYRCSNRVLRAANALIAHNPHEHLKTL
WSDQADGERIRVWECRDSEHEAEKVAAEISFLGTAKQVPWSDFCILFRGN
FQSRPLEKALQLLRVPYHLTGGTAFLERQEVKDVLSWLRLIVNPEDDAAF
LRAVQSPKREVGATSLARLAELASAKSVPMSRAAESMGALQHLPPRAANG
LSAFTDILRDMREHSATLPAGELVRTLADKSGLLNDLRNQSKDEAGFQRR
KRNLDELAEWFEGGPRGASASDLAAQLALLSRNDKDDGGNQVRMMTMHAS
KGLEFRYVFIVGCEDGVLPHEVSLEEGNLQEERRLLYVGITRAKQQLWMS
YSKLTRKFGEHVRLKPSRFFDEIPAAELQRDGADPVADAERKKERANAGL
AAIQALFD
>XC_0697 IS1404 transposase
MKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMSVP
DAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>XC_2803 photolyase-like protein
MLRGLTCRCRVCMSYAIVWFRRDLRLEDNPALRAALDAGHDPIPLYIDAP
HEEGQWAPGAASRAWRHRSLAALDASLRARGSALLIRQGDSAQVLDAVIA
QTEAVAVYWNRKYEPATQPRDAQIKRSLRERGLEVQSCNAALLFEPWTLA
TQQGRPYKVFTPFWRNALTQLRLPDAMPAPRSLPPLPASLDGVHVDALNL
LPTPAWDQGFWEHWQPGEAGAHEMLEIFVDGALSGYRENRDRPDRVGTSQ
LSPHLHFGEIAPWRIASTLEAQRSARNGADIDGYIRQLGWRDFAYHLLHH
FPDTTTQNLNPRFAGFDWATVDPVTLDAWQRGRTGIPIVDAGLRQLWHTG
WMHNRVRMIVASLLCKHLRVHWLEGARWFWDTLVDADLANNTMGWQWVAG
TGADAAPYFRVFNPVTQAEKFDPQATYITRWIPELAALPVKERFAPWLHP
LSLARLAPTYPRAPIIGLAEGRDAALAAYAGTRG
>XC_2166 conserved hypothetical protein
MFLIQVLLPLADNNGVRFEQAMFDEVHHHLAMRFGGITAYTRAPVHGAWQ
EQGAQLVHDDLVIYEVMADDLDRGWWRSYRAELEQRFRQEQLIVRAQEIT
LL
>XC_3803 ISxac3 transposase
MVDSEVPMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWL
RTFGKSGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>XC_2714 ribonuclease T
MPMNEPVDLQPSPSLLPMSRRFRGYLPVVVDVETGGFDWNKHALLEIACV
PIEMDAQGHFFPGETASAHLVPAPGLEIDPKSLEITGIVLDHPFRFAKQE
KDALDHVFAPVRAAVKKYGCQRAILVGHNAHFDLNFLNAAVARVGHKRNP
FHPFSVFDTVTLAGVAYGQTVLARAAQAAGLDWNSADAHSAVYDTEQTAR
LFCKIANAWPGPASAG
>XC_2784 IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_1025 IS1404 transposase
MAVIGMSASALRYCPREDRNGELRERICALAHRHRRYGVGMIYLKLRQEG
RIVNYKRVERLYREQQLQVRRRKRKKVPIGERQPLLRPSQANQVWSMDFV
FDRTAEGRVIKCLVIVDDATHEAVAIEVERAISGHGVTRVLDRLAHSRGL
PKVIRTDNGKEFCGKAMVAWAHARNVQLRLIQPGKPNQNAYVESFNGRLR
DECLNEHWFPTLLHARTEIERWRREYNEDRPKKAIGGMTPAAYAQHLANT
DIINPGL
>XC_1211 ISxac3 transposase
MVDSEVSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWL
RTFGKSGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>XC_1026 IS1404 transposase
MDVKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMS
VPDAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>XC_2604 IS1477 transposase
MKKSRFSTEQIIGFIKQADAGMAVAELCRQHGFSPASFYQWRAKYGGMEA
EDAKRLKELEQENNRLKRLLAEAHLDIEALKVGFGVKR
>XC_3650 DNA polymerase IV
MRKIVHVDMDAFYASVEQRDDPSLRGKPVVVAWRGARSVVCAASYEARTF
GIRSAMPAVRAERLCPDAVFVPPDFARYKAVSRQVREIFHRHTDLVEPLS
LDEAYLDVTEAKTGMQLATEIAQLIRTQIREETQLTASAGIAPNKFLAKI
ASDWRKPDGQFVIAPSRVDAFLLPLPVNRIPGVGKVMDGKLAALGIVTVS
DLRLRPLEELQAHFGSFGQSLYRRARGIDERPVEPDQEVQSVSSEDTFSE
DLALDALDPHIQRLAEKTWHATRRTERIGRTVVLKLKTSNFRILTRSYTP
EQPPASLQGLVDIALGLTRRVELPPETRYRLVGVGLSGFSDPELQAAVQG
ELFGEVPQQ
>XC_0225 conserved hypothetical protein
MPMPLKEFQTVICDGIVAQFGEVRALYRQIAAAAPERIDEARRKDAAIVL
QAPTGAGKTRMAIEVMRRVSIEERVLWFWFAPFTGLVEQSRKVLSNQAPE
LALLDLDADRQLDAVRGGGVFVVTWASLAARKAESRRARQRGDAGMAIDD
VIAMAREQGLRIGCVVDEAHHGFQRETLARAFFCDVLKPDYALLMTATPR
DADMKAFERTTGYSVGEPAEWASVSRADAVDAGLLKRGVRMVRFIARDGD
TAQLVDFEHLALRECTQMHRTIRKNLADADIALTPLMLVQVPDGKVAQEA
ARTYLVEQLGFDATAVRVHTAAEPDPDLLSLAQDPTVEVLIFKMAVALGF
DAPRAFTLAALRGARDPSFGVQVIGRIMRRHALLQAQAVVPPVLDHGYVF
LANSESQEGLLLAGAQINTLTTQAPELGTQTVVTMIGDGASLQVVRSGEP
LSLLVSRAGVHVLDAEAERDAVSSAGTTSDVADALVGTPFAGMANATQAA
LEMFGGEGAWPARATSVAGAFVLAQESMYRYPRRPDAPDRLRGEQLPPVS
ADFEAGLAAHVDFSPEVLADRLRGKVQVQRLDTDLFAGHRVTEDGSDLWA
NLSPEAVAEKAEQIRLRLVEANDRELYRRLLERFVRAIEASGAEVPEDEE
LQMRQLDLLLVRRPRLLREAFKSLRQGQVLDVDVLLPAELFSDQPLRSAN
RGLYGVFPAGLNQDELAIAERLDASAQVRWWHRNQPKSGIGLYRWDEGDG
FYPDFVVSVAERSAPGIALLELKGEHLWGKPSEVDKSAAIHPDYGAVFTV
GRKRGERDFFYLRELGGRLQRAGSFDLDRMRFT
>XC_0680 ISxac3 transposase
MSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRTFGK
SGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>XC_1464 6-O-methylguanine-DNA methyltransferase
MSTLHYDTFPSPIGALTVAADTTGVRHILFAQNRHDAPGRALWQHGPDAP
LVQAAREQLLDYLYGGRRSFDLPLAPAGTPFQLQVWQTLARIPFGETWSY
AQLAQAVGRPAASRAVGAANGRNPLPIVLPCHRVIGASGALTGFGGGLPT
KQALLQLEGWSPQASARRVAAVVGEDLFAR
>XC_1833 ATP-dependent RNA helicase
MNEFSALPLSPALAPGIDALGYTTLTPIQAQSLPPILQGLDVIAQAPTGS
GKTAAFGLGLLQKLDPALTRAQALVLCPTRELADQVGKQLRKLATGIPNM
KLLVLTGGMPLGPQLASLEAHDPQVVVGTPGRIQELARKRALHLGGVRTL
VLDEADRMLDMGFEEPIREIASRCDKHRQSLLFSATFPDIIRTLARELLK
DPVEITVEGADNAPEIDQQFFEVDPTYRQKAVAGLLLRFNPESSVVFCNT
RKEVDEVAGSLQEFGFSALALHGDMEQRDRDEVLVRFVNRSCNVLVASDV
AARGLDVEDLAAVVNYELPTDTETYRHRIGRTARAGKHGLALSLVAPRES
ARAQALEAEHGQPLKWSRAPLATARPAQLPLAAMTTLRIDGGKTDKLRAG
DILGALTGEAGLSGAAIGKIAIYPTRSYVAIARAQVARALTHLQAGKIKG
RRFRVTKL
>XC_1812 exodeoxyribonuclease VII large subunit
MADRTEQILTPSQLNTLARDLLEGSFPLVWVEAELGNVTRPASGHLYFTL
KDARAQIRCAMFKPKSTWLKFQPREGLRVLARGRLTLYEARGDYQLVLDH
MEEAGEGALRRAFEELRARLAAEGVFDAERKQPLPAHVRRLAVITSPSGA
AVRDVLSVLARRFPLLEVDILPSLVQGDSAAAQITSLLQRADASGRYDVI
LITRGGGSLEDLWAFNDERLARAIAAAHTPVVSAVGHETDVSLSDFAADV
RAPTPSVAAELLVPDQRELVARVRRAQARLSQLQQHTLGQAMQHADRLAL
RLRARSPQARLQLLQRRQEDAARHLRARMQHILERLQARVQRAQAGVQSH
SPQRHLAPLQQRLRAAHPQAAMQRRLQQDHLHLRGLVRSLEAVSPLATVA
RGYAIVTRQADGSVVRSAAELTQGDRLRAQLADGSVTVVVDTSETG
>XC_2084 Tn5041 transposase
MPRHSMTCSNDRITRADGYYHVSDKYVALFSHFIPCGVHEGIYILDGLLA
NTSDIQPEIVHGDTQAQSYPVFGLAHMLGIQLMPRIRNIKDLTFFRPEPG
RAYKNIQALFGDNIDWQLIATHLHDMLRVVISIRLGKITASSILRRLGTY
SRKNKLYFAFRELGKAVRTLFLLRYIDDNKIRKTIHAATNKSEEYNGFVK
WVFFGSQGIIAENVQHEQRKIIKYSQLVANMIILHNVEGMSRTLAEMRKE
GVELTPEILAGLSPYRTSHINRFGDYHLDLEREVAPLSYTAKVLEQAP
>XC_3206 conserved hypothetical protein
MTDPTSPGHRALRRGRHSLAGHCYLLTTTTHQRQRLFDDPRLAASACGAF
TKAAPADATLLAWVLMPDHVHWLLQLGHHTPLARAVACLKAASRRAVNTQ
RAMQAPVWARAYHDHAVRHDADLRAVARYVIANPLRAGLVQRIGAYPFWD
AIWLG
>XC_1499 DNA polymerase III delta subunit
MELRPEQLAGQSSQPLQPVYLIAGPETLRVLEAADAVRARARAEGISERE
VFDADGREFDWNQLDASFNAPSLFSPRRLVEVRMPSGKPGKDGAEVITRF
CANPPPDVVLLITANDWSKAHQGKWADAVGRIGTIAVAWPIKPHELSDWI
ERRLRAQGLRADAAAVQRLSERVEGNLLAAAQEIDKLALLADGKVLDLEA
MESLVADAARYDVFRLAETTFSGQPAAVVRMLAGLRAEGEAVAALMPILI
KELLRTASLAKVQAGGGNLGAEMKAQGIWESRQAPFKRALQRHPEPRRWE
RFVAEAGLVDRMAKGRAEGDPWVALERLLVAVAEARAVRLLA
>XC_3034 IS1595 transposase
MPEFFASYGTEAKCYRALYKWRWPQGFRCPVCAGRVRSRFKRGAAIYYQC
SACRHQTSLMAGTMFEGTKLPLRTWMLALHLLTSTKTNMAALELMRHLGV
NYKSAWRMKHKIMQVMAERESTRKLAGFVQIDDAYLGGERNGGKAGRGSE
NKQSFLIAVQTDDTFTAPRFVVIEPVRSFDNPSLQDWIARRLAPGCEVYT
DGLACFRRLEDAGHAHTTLDTSGGRAATEATGARWVNVVLGNLKRAISGV
YHAIAQGKYAKRYLAEAAYRFNRRFRLREMLPRLATAMMQSTPCPEPVLR
AASNFHG
>XC_3918 ISxcc1 transposase
MRKSKFTESQIVATLKQVEGGRQVKDVCRELGISDATYYVWKSKYGGMEA
ADVQRLRDLETEHNKLKRMYADLAMENHALKDVIAKKL
>XC_0002 DNA polymerase III beta chain
MRFTLQREAFLKPLAQVVNVVERRQTLPVLANLLVQVNNGQLSLTGTDLE
VEMISRTMVEDAQDGETTIPARKLFDILRALPDGSRVTVSQTGDKVTVQA
GRSRFTLATLPANDFPSVDEVEATERVAVPEAGLKELMERTAFAMAQQDV
RYYLNGLLFDLRDGLLRCVATDGHRLALCETELEKSGSAKRQIIVPRKGV
TELLRLLEAADRDVELELGRSHIRVKRGDVTFTSKLIDGRFPDYEAVIPI
GADREVKVDREALRASLQRAAILSNEKYRGVRVEVSPGQLKISAHNPEQE
EAQEEIEADTKVDDLAIGFNVNYLLDALSALRDEHVVIQLRDANSSALVR
EASSEKSRHVVMPLRL
>XC_0633 endonuclease
MPEGPSLVILREEAAAFVGRKILRVQGNSKQDIARLQQQKVLALRSWGKH
LLIECAQFSVRIHFLLFGSYRINEDKPNAVPRLRLEFSKGETLNFYACSV
QFIERPLDEVYDWSADVMNPLWDAAQARLKLRAAPQLLAADALLDQSIFS
GVGNIIKNEVLHRIRVHPESQVGALPARKLGELVTQARDYSFDFYTWKKA
FVLKKRYQVHTKTICPRDGAPLQYRKHLGKAGRRAFFCEVCQRRYRLEEA
>XC_1817 DNA mismatch repair protein
MAIRQLPEILINQIAAGEVVERPASVVKELVENALDAGATRVDIDLEEGG
VRLIRIRDNGGGIAPEELPLAVSRHATSKIASLDDLETVATLGFRGEALP
SIASVSRFTLASRRPDAEHGSALQIEGGRLGEVMPRAHAPGTTVEVRELF
FNVPARRKFLRAERTELGHIEEWLRSLALARPDVELRVSHNGKPSRRYKP
GDLYSDARLGETLGEDFARQALRVDHSGAGLRLHGWVAQPHYSRASTDQQ
YLYVNGRSVRDRSVAHAVKMAYGDVLFHGRQPAYVLFLELDPARVDVNVH
PAKHEVRFREARLIHDFVYRTLQDALAQTRAGALPADVGVGGAAALGIGA
VAAQGGGSYVADAGAGHPGAGSGSGYASWAPSQAPLGLRVDEARAAYAAL
YAPAAGSALRDDGQPVLSGTGLPATAHDSGVPPLGYAVAQLHGIYILAEN
AEGLIVVDMHAAHERIGYERLKQAHDSIGLHAQPLLVPMTLAVGEREADT
AEREADTLASLGFEITRSGPQSLHVRSIPALLANADPEALLRDVLGDLRE
HGQSRRIATARDELLSTMACHGAVRANRRLTVPEMNALLRDMEATERSGQ
CNHGRPTWARFTLGEIDRWFLRGR
>XC_0702 IS1478 transposase
MWPLSLAKRWDSIRKPRSIVAPEKPPITQRPQGLDVCSGVPELPVFATLK
AGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTA
LEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLEN
PYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQ
AVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQ
SYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQV
EPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYE
FGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEP
TVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLK
DDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPS
FSAVPLSPMTLGA
>XC_3149 conserved hypothetical protein
MLWACILLPQLALDDVLRRREDTQAPLALVEGPAQLRSLHAVNAAAAAAG
LKPGMRLSAAHALMAEVQTCDYDPQAEARCQRFLASWAYRHSSLVSQQWG
RAIVLEAGASFRLFGPWPRFERRLREELQALGFQHRLALAPTPRAARVLA
GLRDGMAVTQLPALQALLDKVPVRRAALPGDAGERLQHMGVRTLAALRAL
PSEGVRRRFGGALLDHLDRLYGQADDPLECYAPPDHFDQRVELGYEVETH
PALLFPLRRLIGDLCTYLSIRDGGVQRFLLRLEHEEGATDVDVGLLTPER
APALLFELARNRLERVEIPRPVVAMRLLAKQLPPFVPAMRDLFDQRAQQS
VDWPQLRERLRARLGDEAVYRVLPADDPRPERAWQKAIGDDIREAAAPPR
PPRPTWLMPLPVPLHDPHLRIVSGPERLESGWWDDAEARRDYYVVETSRG
RRAWVFASPGRTDGWMLHGWFA
>XC_2046 plasmid-related protein
MPHFLWSITMSLDLESTAAQATPVQSELLDAESSPLTLSLQDFVGEFGDE
LLDSLNRANPPVYTGQPQTRRQLIVASLKRKLFEAQAEVVHAAAELLIDR
GERAAIVNGEMGCGKTTVGIATAAVLNAEGYRRTLVLSPPHLVYKWRREI
QETVAGARVWVLNGPDTLVKLIKLREQLGVQPTGQEFFVLGRVRMRMGFH
WRPVFTRRRTRHGDVAACPDCGHVITDLDGEPVNPIELDAEEYRRKCSHC
AAPLWTLIRPRSLSGSDQSSAVLKALKRIPTIGEVTAQKLMKRFGDGFLA
SMLGDNVHEFINLMDGNGELVFSDRQATRMERAMANMEFGFGEGGYQPSE
FIKRYLPQGTFDLLIADEAHEYKNGGSAQGQAMGVLAAKARKTLLLTGTL
MGGYGDDLFYLLFRALPGRMIEDGYRPSTSGSMASAAMAFMRDHGVLKDI
YSESTGTAHKTAKGSKVSVRTVKAPGFGPKGVLRCILPFTVFLKLKDIGG
NVLPPYDEEFREVAMDTAQVAAYRDLAGRLTAELKQALARRDTTLLGVVL
NVLLAWPDCCFRSETVLHPRTRDTLAFVPAQFHEFEISPKERELIDICKQ
EKAQGRNVLTYTVYTGKRDTTSRLKVLLEQEGFKVAVLRASVDASRREDW
IAEQLDRGIDVLITNPELVKTGLDLLDFPTIVFMQSGYNVYSLQQAARRS
WRIGQKEPVRVIYLGYAGSSQMTCLELMAKKIMVSQSTSGDVPESGLDVL
NQDGDSVEVALARQLVAA
>XC_2664 DNA ligase
MTASPDPAQRIDALRQRIEDANYRYHVLDEPQIADVEYDRLLRELEALEA
AHPELATADSPTQRVGYLAASRFAEVRHVLPMLSLGNAFSDEEVAEFVRR
ISERLERKQPVFCAEPKLDGLAISLRYEQGEFVQGATRGDGATGEDVSAN
LRTVKAIPLRLRGTGWPEVLEVRGEVYMPRAAFEAYNAQMRLQGGKVLAN
PRNGAAGSLRQLDARITAQRPLSFFAYGVGEVADGALPPTHSTMLAQLRE
WGFPVSQLVEVVQGSEGLLTYYRRIGEARDGLPFDIDGVVYKLDDLAGQR
EMGFVSRAPRWALAHKFPAQEQSTTVEAIEIQIGRTGAATPVARLKPVHV
AGVVVTNATLHNADQIARLDVRVGDTVIVRRAGDVIPEVAGVVAEQRPAG
THAWQMPTQCPVCGSEIVREEGQAVWRCSGELTCPAQRKEAFRHFVSRRA
MDVDGLGEKFIEVLVDSGVVQGVADLYLLNVDQLLQLRLISTADSPHAFL
REAREHLAAGAYAQVEQTMVGIGVDLAGVQPAPQTWQADLLRAGLPAFDW
NRKKIATKWAENLIEAIETSRDTTLERFLFALGIEHVGESTAKALSAWFG
ELDVIRHLPWPLFKRVPDIGGEVARSLGHFFDQAGNQQAIDDLLQRGVRI
GDAHPPSPKLRGALSFAVLLEDLDIPKVTPVRAQQLAAATASFDALIASE
ADPLLQAGVPAPVIASLQQWLARPENAALATAAQRAMDALLAQLPQADAV
QAGPLDGQTVVITGTLAALTRDAAKQRLESLGAKVAGSVSKKTAFLVAGE
EAGSKLDKAQSLGVEIWDEARLLAFLSEHGQAV
>XC_0927 IS1404 transposase
MREWIGRGASERRALAVIGMSASALRYCPREDRNGELRERICALAHRHRR
YGVGMIYLKLRQEGRIVNYKRVERLYREQQLQVRRRKRKKVPIGERQPLL
RPSQANQVWSMDFVFDRTAEGRVIKCLVIVDDATHEVVAIEVERAISGHG
VTRVLDRLAHSRGLPKVIRTDNGKEFCGKAMVAWAHARNVQLRLIQPGKP
NQNAYVESFNGRLRDECLNEHWFPTLLHARTEIERWRREYNEDRPKKAIG
GMTPAAYAQHLANTDIINPGL
>XC_3244 DNA polymerase III tau and gamma subunits
MSYLVLARKWRPKRFAELVGQEHVVRALSNALDSGRVHHAFLFTGTRGVG
KTTIARIFAKSLNCETGTSADPCGTCPACLDIDAGRYIDLLEIDAASNTG
VDDVREVIENAQYMPSRGKFKVYLIDEVHMLSKAAFNALLKTLEEPPEHV
KFLLATTDPQKLPVTVLSRCLQFNLKRLDEDQIQGQMTRILAAEEIESDP
SAIVQLSKAADGSLRDGLSLLDQAIAYAGGALREDVVRTMLGTVDRTQVG
AMLQALSDGDGAQLLKVVAALAEFSPDWSGVLEALAEALHRIQVQQLVPS
VAFVGDGIDPTGFAAQLRPEVVQLWYQMALNGRRDLYLAPSPRAGFEMAV
LRMLAFRPAAAVPAGSGDDGRGASAGGHTRGTATGVQAAPAAAAPARAAT
SAKAADVSPAPVVSAPPVAAAPSPVVVLPTAAAEPAPSAPPARTDDTPPW
AVDDAPVRAQAAPQRATAEVPAAVPLMAPEAAMALPATVADDAAPAAMDA
VVPVAPPSAPAPVTPPAATFDDGHIADAEQWLELVTRSGLNGPSRQLAAN
AAFIGHRDGVLRLALAPGFEYLNSERSIANLAQALAPELGNTPRIVIETG
SADVETLHERANRQKGERQSAAETAFMNDPNVQQLIQQQGARVVPDSIRP
YDE
>XC_2063 DNA methyltransferase
MHPQPCAPLLYGSVCSGIEAVSLAWQPLGLEAAWFAEIEPFPSAVLAHHY
PHVPNLGDMTMIARQVHAGTVPAPDILVGGTPCQSFSVAGARRGLDDPRG
ALTLAYVELANAIDQARHQNDRSPATLVWENVPGVLSDRSNAFGNFLGAL
AGESRALQPPGEKWAHAGYVSGPRRRIAWRVLDAQYFGVAQRRKRVFLVA
SGGDGFDPAEVLFERTGLRGDSSTGSAPWQEAAHAAGPSAEAAGGYAGLK
SSYGEVKTTFGFSGGIGPVDVAACLMAAGPKHDIRTETFMVQSVAGSITH
TLDTANGGKGSSEDGTGKGVPIIAFTAQGSGANATMDLTPTLRAGGHRSS
HANAGVVPAIAFAQNHRCEVRFESGHGQVACTVLSNGKPGYGVPMIACVA
LHDRQNGLAAELGSGANGHVLAPDHEAHFRYDWNDPIPRDWSQWRVRRLM
PVECERLQGMPDDYTLVPYRGKPAADAPRYKAIGNSMAMPCVAWLGQRLV
QCLHKMGSIASD
>XC_1018 phage-related integrase
MVRMLTDMVVRQAKASDKPYTLADFDGLFLYVSPVGGKAWHFRYTWVGQR
ARISLGSYPELSLRDAREFRDQARALVAKGINPRTDRKQKRQAIRLAGEN
TFMAVYEKWMEHRQLTLEEGRQSSLEQIRRVFKKDVFPYLKRYTIYEITR
PVLLEVIGRIEKRESLSVAEKVRTWLKQLADYAMVVIPGMVEHPAIDLHV
VAVPLPPVEHNPFLRMPELPLFLQTLRKYRGMQMTQLAIRLLLLTGVRTG
ELRLATPDQFDLEQGLWIIPVMSLKQRKMLTKKKRKRVTDIPPYIVPLPV
QAIEIVRHMLDLFKPAQTYLFPGVKRITARMSENTVNRAIKRLGYDGRLT
GHGIRATISTALNELGYPKVWVDAQLSHADPNRISATYNHAEYVEQRRLM
MQDWADRLDLFEQNQVQIASTHLTIHLQGVPTIAGQKVTPLPALGQHAPI
MLVAPNEQTMPAVGTGTQRLSAVQMPEYALPKISEVQRERLEVLDIFEGP
DNLVVADYAKLAGKSRRWITYEIQARNLLSIQLGNKGQRVPVWQLNMFKR
RLVQAVLKRLHRGVDTWDIYYALTRPREELDGKSPIEALTSDNQQAMVEA
VCRAVSEATTPVVEKRVPINRIAECMSEF
>XC_1679 conserved hypothetical protein
MIVPGGFPAPVTGSAVAASPPLPTVRLLSLDAHGRVLDWINWQDAACLYA
RDAVSWTLGEPCMQIHGGVSRLTGERSVLELHPIIAARGHARSRALDPTP
TLTNTALFARDSQLCMYCGQHFSRPHLTRDHVMPVSKGGRDSWENVVTAC
FQCNSRKANRTPQQAHMPLLAVPYRPSWIEHLILSNRNILSDQMAFLRAQ
LPKRSKLSL
>XC_2689 conserved hypothetical protein
MPAPMADHLPPPAAATAALHRHRPAVLLPRSPPAARSSSPSIPAYVKFFA
SCAKGLEYLLADELLALGASKATATISGVNVEGALRDAQRAVLWSRLASR
VLWPLTEFDCPDEDALYAGVAELPWHEHLSTGHTLSVDAHVSGTAITHAR
YAAQRIKDAVVDTIRRQGLERPSVDVESPDLRLNLSLRKGRATISVDLGG
GPLHRRGWRMAQNEAPLKENLAAAVLLRAGWPRAYADGGGLLDPMCGSGT
LLIEGALMAADVAPGLQRYGSDIPSRWRGFDRDSWQQLVTEARERDSVGR
AALKQVIHGSDMDPHAIRAAKENAQVAGVAEAIWFGVREVGDLQTRPQAT
GVVVCNPPYDERLAADAALYRKLGDTLQRVVPQWRASLLCGNAELAYATG
LRAGKKYQLFNGAIECALIVCDPIAVPRRTPLAAPTALSEGAQMVANRLR
KNLQKFKKWRAREGIECFRVYDADLPEYSAAIDVYQQADGDRRIFLHVQE
YAAPATIPEADVRRRLGELLAAAREVFEVPAERVALKSRERGKGGSKYGR
FEQRNEIVNVREHGALLRVNLFDYLDTGLFLDHRPLRGTMAQQSKGRRFL
NLFCYTGVASVQAAVAGASATTSVDLSGTYLQWCADNLALNGQAGSKHKL
VQADALAWLEAERAHFDVIFCDPPTFSNSARAEDFDIQREHVRLLRAAVA
RLAPGGVLYFSNNFRRFKLDEEAVSEFAQCEEISPRTIDPDFERHARIHR
AWRLTA
>XC_3105 IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>XC_3094 excinuclease ABC subunit A
MAMDFIRIRGARTHNLKNIDLDLPRDKLIVITGLSGSGKSSLAFDTIYAE
GQRRYVESLSAYARQFLSVMEKPDLDHIEGLSPAISIEQKSTSHNPRSTV
GTITEIYDYLRLLYARVGQPRCPDHGFPLEAQTVSQMVDHMLTLDPEQRY
MLLAPVIRDRKGEHAQVFEQLRAQGFVRVRVDGELYEIDAVPPLALRQKH
TIEAVIDRFRPREDIKQRLAESFETALKLGEGMVAVQSLDDATAAPHLFS
SKYSCPVCDYSLPELEPRLFSFNAPVGACPSCDGLGVAEFFDPDRVVVHP
ELSLSAGAVRGWDRRNAYYFQLIASLAKHYKFDVDAVWNTLPAKVRQAVL
FGSGDEVISFTYFTDAGGRTTRKHRFEGILPNLERRYRETESPAVREELT
KYVSQQPCPACNGTRLNRAARNVFVADRPLPELVVLPVNEALNFFRGLSL
PGWRGEIASKIVKEIGERLGFLVDVGLDYLTLERKADTLSGGEAQRIRLA
SQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLTRLRDLGNTVIVVEHDE
DAIRLADHVLDIGPGAGVHGGEICAQGTLQDILESPRSLTGQYLSGKRRI
EIPKQRHKPNPKMMLHLRGATGNNLKNVDLEIPAGLLTCITGVSGSGKST
LINDTLFTLAANEINGASHTVAPHREVENLDLFDKVVDIDQSPIGRTPRS
NPATYTGMFTPLRELFAQVPESRARGYSPGRFSFNVRGGRCEACQGDGMI
KVEMHFLPDVYVPCDVCHGKRYNRETLEIRYKGFNISDVLQMTVEDALRL
FEPVPSIARKLETLVDVGLSYIKLGQSATTLSGGEAQRVKLSKELSRRDT
GRTLYILDEPTTGLHFHDIEALLGVLHKLRDEGNTVVVIEHNLDVIKTAD
WIVDLGPEGGHRGGTILVSGTPEDVAAHKASYTGQFLAKMLPSVKARETR
PAAMANKPDARPPRKVKPEKVAKATKTATKKTAKKKAS
>XC_4189 conserved hypothetical protein
MFFRNLTLFRFPTTLDFSEIETLLPQVQLKPVGPLEMSSRGFISPFGRDE
QDVLSHRLEDFLWLTVGGEDKILPGAVVNDLLERKVAEIEEKEGRRPGGK
ARKRLKDDLIHELLPRAFVKSSRTDAILDLQHGYIAVNTSSRKSGENVMS
EIRGALGSFPALPLNAEVAPRAILTGWIAGEPLPEGLSLGEECEMKDPIE
GGAVVKCQHQELRGDEIDKHLEAGKQVTKLALVMDDNLSFVLGDDLVIRK
LKFLDGALDQLEHSEGDGARAELDARFALMSAEVRRLFLLLEDALKLSKA
EA
>XC_1955 7, 8-dihydro-8-oxoguanine-triphosphatase
MPHTPIVATLGYLLSPDGTQVLMIHRNARPGDHHLGKYNGLGGKLEADED
VLACMRREIREEAGVECGQMQLRGTISWPGFGKQGEDWLGFVFLIHSFDG
TPQTSNPEGTLEWVPIAQMDQVPMWEGDRNFLPLVFDGDPRPFHGVMPYR
DGRMQSWSYSRV
>XC_1465 DNA methylation and regulatory protein Ada
MHTAMPDRTHCDCARLARDARFDGLFFTAVRSTGIYCRPVCPAPPPKPSN
ISYYPTAAAATAAGYRPCLRCRPELSPQAQQHLGEESVQRALAMIAEGAL
QEQPVQTLADAVGMSARQLQRQFVQQLGATPIQVHGTRRLLLAKQLLTET
ALPVTEVALAAGFNSLRRFNAAFLQGCGMPPSALRKQRSDVPGGDLCLRL
GYRPPLDLPAMLTFLQRRAIPGIEQVDADGYRRVIGAPGQATLIHVSAAP
TRDELLLRIGATDPRQIPQIVRRVRRIFDLDADLHAVHATLAQDPLLEQA
ITRRPGLRVPGGWDGFEVAVRAVLGQQISVAGAATLAARLVDRHGGHLPD
MPPGLDRSFPTPAQMADAPLEQLGLPRARAATLRALASACAQGRLHFGAG
QRLPDFVAACTALPGIGPWTAHYIAMRALSHPDAFPAGDLILQQVLGAPE
RLSERATEARSQAWRPWRAYAVLHLWHLAVDRKDTRS
>XC_3804 ISxac3 transposase
MQAHCEEFRVCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHW
LASGSVYGHRKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFH
GGMQCKAAANLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSR
QVVGWAMRDRADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRS
FLASHGLVCSMSRRGNCHDNAPVESFFGLLKRERIRRLTYPTKDAARAEV
FDYIEMFYNPNRRHGSTGDLSPVEFERRYAQRGS
>XC_0869 IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_1403 endonuclease V
MQTSIDPVFAGWDGSVAQARQLQQQLAQRVALRDEVSAAPALLAGFDVGF
EDDGQTTRAAAVLLDAQTLLPLETHVARVPTSMPYVPGLLSFRELPALLR
ALALLARTPDLVFIDGQGIAHPRRFGIAAHFGVVTGLPSIGVAKQRLAGT
FIEPGGERGDHSPILLAGAQIGWALRSKPRCNPLIVSPGHRVSMQGALDW
TLRTLRAYRLPEPTRLADRLASRRGEIELQTQPTLL
>XC_0666 ISxac3 transposase
MVDSEVSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWL
RTFGKSGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>XC_0859 IS1477 transposase ORFA
MKKSRFTTEQIIGFIKQADAGMAVAELCRQHGFSPASFYQWRAKYGGMEA
EDAKRLKELEQENNRLKRLLAEAHLDIEALKVGFGVKR
>XC_2220 7, 8-dihydro-8-oxoguanine-triphosphatase
MTLQETRWHPDVTVATVVVRDGRFLQVEESIGGRLLLNQPAGHLEPDESL
LQAAVRETLEETGWDVRLTQFIGTYQWVAPTGQCFLRFAFVADALAHHPE
RSLDTGVVRALWMTPEELRAASDRLRSPLVWEVVADYLAGQRHPLALVRH
VA
>XC_3693 conserved hypothetical protein
MRRIARVLVLGSMPGSASLHAHAYYAHPRNRFWPVMQQLLGIDADAPYDA
RLQQLAERGVGLWDVIGECARRGSLDAAIVPGSIVVNPLPERLATLPQLR
LVVCNGSAAAQAWRRHVQPALSPPLTRLPVQAVPSTSPANAAWSLPRLCA
AWQPVRDALR
>XC_3825 DNA topoisomerase I
MPKHLLIVESPAKAKTINKYLGKDFTVLASYGHVRDLVPKEGAVDPDNGF
AMRYDLIEKNEKHVEAIARAAKSADDIYLATDPDREGEAISWHIAEILKE
RGLLKDKTMQRVVFTEITPRAIKEAMLKPRAIAADLVDAQQARRALDYLV
GFNLSPVLWRKVQRGLSAGRVQSPALRMIVEREEEIEAFIAREYWSIDAH
CRHPSQPFNARLIKLDGQKFEQFTVTDGDTAEAARLRIQQAAQGVLHVTD
VASKERKRRPAPPFTTSTLQQEASRKLGFTTRKTMQVAQKLYEGVALGDE
GSVGLISYMRTDSVNLSQDALAEIRDVIARDFGTASLPDQPNAYTTKSKN
AQEAHEAVRPTSALRTPAQVARFLSDDERRLYELIWRRAVACQMIPATLN
TVSVDLSAGSEHVFRASGTTVVVAGFLAVYEEGKDTKSSEDEDEGRKLPL
MKAGDNIPLDRIVTDQHFTQPPPRFTEAALVKALEEYGIGRPSTYASIIQ
TLQFRKYVEMEGRSFRPTDVGRAVSKFLSGHFTRYVDYDFTANLEDDLDA
VSRGEAEWIPLMEKFWGPFKELVEDKKDSLDKTDAGSVRVLGADPVSGKE
VSARIGRFGPMVQIGTVEDEDKPTFASLRPGQSIYSISIEDALELFKMPR
ALGQDKDQDVSVGIGRFGPFARRGSVYASLKKEDDPYTIDLARAVFLIEE
KEEIARNRVIKEFDGSDIQVLNGRFGPYISDGKLNGKIPKDREPASLTFE
EVQQLLADTGKPVRKGFGAKKATLKKNAVKDSAKEAKDAAKKTAAVKKVA
TKTAAKKAPAKKAAKKATKRVVKKAVSKAAG
>XC_3631 kinase
MIESLNIGALVAALPEKYQPIFAHPELSDGSSRGCEDRLVLIRQCAQRLQ
HALGRPLRVLDLGCAQGFFSLNLAADGHTVHGVDFLDLNVNVCKALAAEN
PACAATFEHGTVEDVIDRLEHDECDLVLGLSVFHHLIHDKGILKVSALCR
KLSETTSAGIYELALREEPLYWAPSLSQDPAELLSSYAFLRLLSQQQTHL
SAVSRPLYFASSRFWYVDGAIGNFTSWSSESHAHGRGTHLQSRRYYFSEQ
SFVKKMTLGVGDRAEINLQEFVNEVEFLGNPPESYPAPRLIASLNDSRDL
FIARSMMNGRLLSQAIDDGAAYDADEIIAQILAQLVLLERAGLYHNDVRC
WNILIAPEGRAVLIDYGAISANPFDCSWLDDLLLSFLITVKEILERKVVP
SSPSREPALDFMTLPARYRNAFIGFFGQNRSPLTFALLQQCLQQADATPH
SAPEWVTIYQRLQKALLGYNARLSAVHIETEHHRVELAARGAAIEHLRDS
TLQDQERTQAFEQGVAAAEERYKRLEEESEKLAAWAKGLEAQTIESNRDK
EALAALNAELESDKAALATRIASLGQELEERQRARELAELLAADVGRLTE
ERDAARSDLLDTQSVVEQHQATITALEARVAVQQQQISGLESSRDQERNR
LRELQVDLSRSMDGTASAREYIRELEMAVDALEGQINSLHGSRSWRVTAP
LRLFTTRVLKRGNADAATIRKVSDARLESPVHTDGVTPTPAEAAMDERLA
AVDQLGSRIRKSLK
>XC_2000 excinuclease ABC subunit C
MSARPQADFDGKAFAARLSTAPGVYRMYAADDSLLYVGKAGALRKRVGSY
FNGTPKNARLTSMLSQVARMDVTVTRSEAEALLLENQLIKSLSPRYNVSL
RDDKSYPYVLLTREDWPRIALHRGPRAVNGRYFGPYAGVTAVRETLNLMH
KLFKLRSCEDSVFRNRSRPCLQYQIGRCSAPCVDLVAAQDYQEAVRRATM
FLEGKSDQLGEEIMHSMQQASEALEFERAARLRDLLSSLRSMQNRQYVDG
RAADLDVLACATQSSQACVLLLSFRDGRNLGTRSFFPKTNGEDSAEEILA
AFVSQYYAEHAPPREILLDREIPDAELIEAALSAAAEHKVALKWNVRGER
AGYLLLASRNAQLTLVTELTSQSAQHARSEALREMLGLAEQVKRVECFDI
SHTMGEATVASCVVFDASGPVRGQYRRFNISGITPGDDYAAMRQAIERRF
RRAVEENGVLPDVLLIDGGAGQLAQAQAALADLGIENVLLVGVAKGEERR
AGHEALILADGRELRPGAASPALQFIQQVRDEAHRFAITGHRGRRQKARM
TSKLEDIPGIGPRRRASLLKHFGGLVGLKAAGEAEIARVEGVNAALAARI
YANLHGLALPDAAGESSP
>XC_1694 A/G-specific adenine glycosylase
MPVPATLTTDAFVDRLLHWFDGHGRHDLPWQHPRAPYRVWLSEIMLQQTQ
VAVVIPYFQKFVASFPTLADLAAADNDTVMAHWAGLGYYARARNLHAAAK
QCVALHAGELPRDFDALLALPGIGRSTAGAILSQAWNDRFPIMDGNVKRV
LTRIHGIAGYPGLPVVEKQLWQLAANHVAHVPAGRLADYTQAQMDFGATL
CTRARPACMVCPLQENCVARREGLVEALPTPKPGKQLPEREATALLLENA
HNEILLQRRPPTGIWASLWTLPQAETDSDLREWFAAHIDGDYDRADEMPM
IVHTFSHYRLHLQPLRLRKVALRQVLRDNDDLRWVARADLATLGLPAPIR
KLLDAL
>XC_0053 conserved hypothetical protein
MTRRALQAPVAAAAARDSSAFAWVDHDVRHKPAGAAADAAARVPAAPISN
PPPQRRTDIAGLRKMIGLRERAVSTHAPVRAASTDRHLPGNEIAPGLHLI
EAFLPQAIPRQALSLAFAKREDAVDPMDLLFFDTETTGLAGGTGTRAFMI
GVADWYTDVTQGSGLRVRQLMMSTMAAESAMLDLFRSWLSPQTVLSSYNG
RCYDAPLLKTRYRLARRGDPISALDHVDLLFPTRRRYRGTWENCKLATIE
RQLLRVVREDDLPGSEAPAAWLSYLRGGSARNLRRVAEHNHQDVVTLSLL
MQRLVAVDAQDREVIPMLETP
>XC_2593 IS1404 transposase
MDVKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMS
VPDAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>XC_2012 IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLIAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHTLLHGKEDSVFGDSGYTGADNREELQDCKAAFFIAARRSTLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>XC_3848 ATP-dependent RNA helicase
MSDKPLTDLTFSSFDLHPALVAGLESAGFTRCTPIQALTLPVALPGGDVA
GQAQTGTGKTLAFLVAVMNRLLIRPALADRKPEDPRALILAPTRELAIQI
HKDAVKFGADLGLRFALVYGGVDYDKQRELLQQGVDVIIATPGRLIDYVK
QHKVVSLHACEICVLDEADRMFDLGFIKDIRFLLRRMPERGTRQTLLFSA
TLSHRVLELAYEHMNEPEKLVVETETITAARVRQRIYFPSDEEKQTLLLG
LLSRSEGARTMVFVNTKAFVERVARTLERHGYRVGVLSGDVPQKKRESLL
NRFQKGQLEILVATDVAARGLHIDGVKYVYNYDLPFDAEDYVHRIGRTAR
LGEEGDAISFACERYAMSLPDIEAYIEQKIPVEPVTTELLTPLPRTPRAT
VEGEEVDDDAGDSVGTIFREAREQRAADEARRGGGRSGPGGASRSGSGGG
RRDGAGADGKPRPPRRKPRVEGEADPAAAPSETPVVVAAAAETPAVTAAE
GERAPRKRRRRRNGRPVEGAEPVVASTPVPAPAAPRKPTQVVAKPVRAAA
KPSGSPSLLSRIGRRLRSLVSGS
>XC_0144 IS1480 transposase
MVRARQERNACTRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQSLLCDSGYAGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>XC_1643 IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNTDHARDPEMRQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHALLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>XC_2343 putative single stranded DNA exonuclease
MTSSPTIVRRPPGQGGTWPDAMLPLLRRIYAARGVVDVHGAHPRLGQLLS
PELLHNSRVAAELLADAIAAQRRILVVGDFDCDGATACAVGVRGLRMLGA
LDVHHAVPNRMVHGYGLSPALVDELAALQPDLLVTVDHGIACHAGVAAAK
ARGWTVLVTDHHLPGEVLPPADAIVDPNLVQDSFPSKTLAGVGVIFYVLL
ALRGVLRARGAFAERAEPDLSVLLDLVAVGTVADLVPLDTNNRALVSAGL
RRLRDGKGCIGLRALIDASGRDAARLSASDIGFALAPRLNAAGRLEDMAL
GIELLLCEDWSRAREIAGLLEEINAERRAVQQLMTDDAEQAVTKVMLAAD
GALPMAACLFDPEWHPGVIGLVASKLKDRLHRPVIALAPAEPGSDQLRGS
ARSIPGLHIRDVLAAVDARHPGLIQKFGGHAMAAGLSLEHRALAAFEQAF
QTQVQAMVDASLLQAELHSDGELAAHELDHLHAEALRAAGPWGQGFPEPL
FDGQFEVLQWRLLKERHLKLTLRCAGRAGRAEPLNAIHFNGWRGSEPART
VRIAYRLVGDDYRGGTAVQLIVEHCEPAASAG
>XC_2426 RNA-directed DNA polymerase
MSDWRPQHYRKNAPAGSPASTIDAALEVASITSTANSRLPPIFSLRHLAH
YTSINYNILRAAISRSGPEPYRSFRIRKRTEKSSERYRHISVPSHALMCV
QRWINQRILQQCQVDDASTAFAKDSKLIEAATIHCKSRWLVKIDVRNFFD
SINEISIYRVFNSLGYQPLVAFELARLCTRKSARHSKNWLNFRPETRDVW
AAPDDEGKAWSSDMVIKSYDALHQGYLPQGAPTSPMLSNLAMRSFDEKVR
DLALKRGFFYTRYADDISLSTERTSSRVTCVELIRDVHQLLHEEGLSPNF
AKTKIVPPGAKKIVLGLLVNGEKPRLTREFRLRLRQHLHFLEKPNGPVEH
AAKRKFASVSGLRHHLLGLAAYAAQIEPEYGQDIQQRLRNIPWPS
>XC_1517 exodeoxyribonuclease VII small subunit
MAKKSLNESSPVARFEQSLEELEQLVQKMEVGEMSLEQSLTAYERGIGLY
RDCQQALEQAELRVRLVTDPARPEQAEAFEPPSLDGG
>XC_0545 IS1477 transposase ORFA
MKKSRFTTEQIIGFIKQADAGMAVAELCRQHGFSPASFYQWRAKYGGMEA
EDAKRLKELEQENNRLKRLLAEAHLDIEALKVGFGVKR
>XC_0034 conserved hypothetical protein
MAATAAVTTAAAVAQAKATARAAGLIYVNDQQPGISRRKAGKNFSYRDAD
GQRVTDADTLQRIRALAIPPAYTEVWICAKPNGHLQATGRDARRRKQYRY
HADWAQVRGEGKFERVIAFGEALPKLRRRLRRDLLLPGFPREKVLAIVVA
LLADTLVRVGNAEYSRSNRSYGLTTLRNRHMEFLKGGRARLKFRGKSGQE
HEIEVDDKHLVKLIRECQQLPGQSLFQYKDDDGQLQPVDSGEVNDYLREA
MGEDFTAKDFRTWGGTLAALQRLARLPVPERSSERALKQVQNDVIREVAD
ALGNTPSVCRKAYIDPCVFEGWRAGELQTLATGVRGERQWEAATLRFLSA
SRAKVRKVVKAAKSSATSVKPAKRASKCAIKRPTTTKAGARKAA
>XC_2597 ISxac3 transposase
MVDSEVSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWL
RTFGKSGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>XC_1478 IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>XC_3924 IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA