TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Xanthomonas campestris pv. campestris str. ATCC 33913, ATCC 33913
Gene type: CDS

Number of genes found: 258

Free access
Sort by:

 



# Xanthomonas campestris pv. campestris str. ATCC 33913, ATCC 33913

>gid:104406  Ada  DNA methylation and regulatory protein Ada or
MHTAMPDRTHCDCARLARDARFDGLFFTAVRSTGIYCRPVCPAPPPKPSN
ISYYPTAAAATAAGYRPCLRCRPELSPQAQQHLGEESVQRALAMIAEGAL
QEQPVQTLADAVGMSARQLQRQFVQQLGATPIQVHGTRRLLLAKQLLTET
ALPVTEVALAAGFNSLRRFNAAFLQGCGMPPSALRKQRSDVPGGDLCLRL
GYRPPLDLPAMLTFLQRRAIPGIEQVDADGYRRVIGAPGQATLIHVSAAP
TRDELLLRIGATDPRQIPQIVRRVRRIFDLDADLHAVHATLAQDPLLEQA
ITRRPGLRVPGGWDGFEVAVRAVLGQQISVAGAATLAARLVDRHGGHLPD
MPPGLDRSFPTPAQMADAPLEQLGLPRARAATLRALASACAQGRLHFGAG
QRLPDFVAACTALPGIGPWTAHYIAMRALSHPDAFPAGDLILQQVLGAPE
RLSERATEARSQAWRPWRAYAVLHLWHLAVDRKDTRS
>gid:101769  XCC0015  conserved hypothetical protein
MASAIKGRGATGHLPGRFEVTTPQAVDDGWHVDDSDEFAAPALRTQVTDE
TARSIISRNQSPDIGFSQSVNPYRGCEHGCSYCFARPSHAYLNLSPGLDF
ETRLFAKTNAPELLRRELARPSYVPSPIALGINTDAYQPIERKRGLTRQL
IEVLWEARHPFTLITKSALVTRDLDLLAPLARARLVNVHFSVTTLDPHLS
ARLEPRASAPHARLRAMRSLHEAGVPVGVMAAPVIPWINDHELEAILQAA
ADAGASSAGYVLLRLPHEVAPLFREWLQTHHPQRAEHVMSTIAQLRGGKD
YDSTFGTRMRGQGVYADLLARRFALAHRRAGFDTRRTPPLDTEQFRRPAP
PPKPVKDSPQGQLF
>gid:101788  XCC0034  conserved hypothetical protein
MAATAAVTTAAAVAQAKATARAAGLIYVNDQQPGISRRKAGKNFSYRDAD
GQRVTDADTLQRIRALAIPPAYTEVWICAKPNGHLQATGRDARRRKQYRY
HADWAQVRGEGKFERVIAFGEALPKLRRRLRRDLLLPGFPREKVLAIVVA
LLADTLVRVGNAEYSRSNRSYGLTTLRNRHMEFLKGGRARLKFRGKSGQE
HEIEVDDKHLVKLIRECQQLPGQSLFQYKDDDGQLQPVDSGEVNDYLREA
MGEDFTAKDFRTWGGTLAALQRLARLPVPERSSERALKQVQNDVIREVAD
ALGNTPSVCRKAYIDPCVFEGWRAGELQTLATGVRGERQWEAATLRFLSA
SRAKVRKVVKAAKSSATSVKPAKRASKCAIKRPTTTKAGARKAA
>gid:101807  XCC0053  conserved hypothetical protein
MTRRALQAPVAAAAARDSSAFAWVDHDVRHKPAGAAADAAARVPAAPISN
PPPQRRTDIAGLRKMIGLRERAVSTHAPVRAASTDRHLPGNEIAPGLHLI
EAFLPQAIPRQALSLAFAKREDAVDPMDLLFFDTETTGLAGGTGTRAFMI
GVADWYTDVTQGSGLRVRQLMMSTMAAESAMLDLFRSWLSPQTVLSSYNG
RCYDAPLLKTRYRLARRGDPISALDHVDLLFPTRRRYRGTWENCKLATIE
RQLLRVVREDDLPGSEAPAAWLSYLRGGSARNLRRVAEHNHQDVVTLSLL
MQRLVAVDAQDREVIPMLETP
>gid:101819  XCC0065  conserved hypothetical protein
MTPQNVAAQEAYYFNKQPRGTPGLPTAQTTGIGFHGDSDYPNYYGAGAVS
RAIYIERTHAHPVGGIAPQMHLDMQRLRFKEPLLEHNGIDLSQTGAANPQ
PYWDTSTNPPTRGLFQHTQGTHQHVSPALDLAAPNDERSGRSQHPSIDSA
LLEKVRQGVSELDRQARKPWDDNSERLSASLMLMAAEKGFTAKDDLKFAF
NTPTPNLGSGEVLHMWRASNASPDPAANRAHMPTQEALSVPAEQRLTQVE
ALQQAKAEEIQRSQQQDVVQQQLGQARSL
>gid:101859  XCC0105  ATP-dependent DNA ligase
MSLRDYTRKRRFDQTPEPAEDAAATAHRQPIFVVQLHHASSRHYDFRLEA
DGVLKSWAVPKGPSLRAGEKRLAVQVEDHPLAYAGFEGDIPQGQYGAGHV
QVFDHGTWHCDGDALAALDAGKLDFDLQGDKLRGGFALVRTRLRGRQPQW
LLIKRDDAHAADLDADALVADSDATAQAIETPSAAAAPARRAKASRRRRT
PAEALVTEKSASASRPRASAATQAHWRTRALALPGARDAACPTGLRAQLT
LLRAEAPDGAQWPHEIKWDGYRLLTDLVDGRAQLRSRNDQAWTDSFPEVA
TAVQALPVRDARLDGELVVLDAQGRSDFSALQRAIDGTARQPLRYLVFDL
LGVAGVDLRATPLLERKQLLRALLGETPGTLAYSAHVIGRGPEVFAASAD
KGWEGIVSKRADAPYRGGRSADWVKTKHEDSDEFVVVGYTDPKGARSGFG
ALLLAQLDGTQLRYVGRVGTGFDSALLGEITAQLQALHSPQPTLELPAHI
PSRPRDVHWVRPVLIAEVAFRGWAKQGLLRQAAFKRLREDKPMSDLGGDR
ATPGKSRGARTRTAAAAAGKASRAAATRTAAVSAGGSAAKPGKAGKSSTA
ADVSTPSRVAKQRVTPAASSAAKPGKPGKSSAAATGTASPRAAKRGAVST
ASASSTPKSGKRSVSSGSAASGKPAAPSKAASSKTARTPATSSARKTSTA
TAASSIRKARASASNAPDGVAITHPERVVFPAAGISKGDVAAYYRAVAPL
VLPEIARRPLSLLRCPDGAAGACFFQKHEGRHLGAHIKAIPLKQKSGTED
YLYIEDVAGLLELVQMNTLELHPWGARVDDPEHPDRLVFDLDPGEGVAWT
QVVAAAREIRSKLRAAGLESAVRLSGGKGLHVVVPIVPQASWDQARDFCE
AFAQALATQAPERYVATMSKAKRHGVIFVDWLRNGRGNTSVCSWSLRARE
HATVAVPLRWEELGKLSGPDAFPLDKAVQRAKRQRNDPWADVLALKQVLP
G
>gid:101881  XCC0127  IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHALLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>gid:101890  XCC0136  IS1480 transposase
MVRARQERNACTRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQSLLCDSGYAGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>gid:101968  XCC0214  methyltransferase
MTNKYDHLDRQTLIGLLQRRDAERQLGLVWERDEIEADQALNDDFVALSL
DAGLSHGEAPWDNLIIEGDNYDALRALRMTHKGAIRCIYIDPPYNTGNKD
FVYNDRFVDKTHRFRHSLWLEFMYRRLQLAKELLADDGVIFVSIDDNEVF
RLGMLMDRVFGENNFIANVIWQKVFSPKGTAQHFSDDHEYVIIYGRDKNK
WRPNLLARTAAQDRAYKNPDDDPRGLWTSGDLSARNYYSKGVYSIVGPTG
RVIAGPPAGTYWRFSEERFKELDADNRIWWGKSGDNMPRLKRFLADVQQG
TVPQTLWTYGEVGHTQDAKKQLLEVLNFNSTNDVFSTPKPIQLMERILSI
ASKPGDTVLDFFAGSGTFAQAVAKLNAEDGGNRKFILVSSTEATEDTPDK
NLCRDVCAERVRRVLGGYTNAKGQPVEGLGGGFAYLRTRRIPKHRLALKL
DHAEVWHALQLLHQRPLSFWPGGGFASDGELAYLADFQAAHVEQLREWLR
TRTSAVAAVYTWSTERLNGLLGEPAADLSLLPLPHHLRERFGR
>gid:101969  XCC0215  conserved hypothetical protein
MPMPLKEFQTVICDGIVAQFGEVRALYRQIAAAAPERIDEARRKDAAIVL
QAPTGAGKTRMAIEVMRRVSIEERVLWFWFAPFTGLVEQSRKVLSNQAPE
LALLDLDADRQLDAVRGGGVFVVTWASLAARKAESRRARQRGDAGMAIDD
VIAMAREQGLRIGCVVDEAHHGFQRETLARAFFCDVLKPDYALLMTATPR
DADMKAFERTTGYSVGEPAEWASVSRADAVDAGLLKRGVRMVRFIARDGD
TAQLVDFEHLALRECTQMHRTIRKNLADADIALTPLMLVQVPDGKVAQEA
ARTYLVEQLGFDATAVRVHTAAEPDPDLLSLAQDPTVEVLIFKMAVALGF
DAPRAFTLAALRGARDPSFGVQVIGRIMRRHALLQAQAVVPPVLDHGYVF
LANSESQEGLLLAGAQINTLTTQAPELGTQTVVTMIGDGASLQVVRSGEP
LSLLVSRAGVHVLDAEAERDAVSSAGTTSDVADALVGTPFAGMANATQAA
LEMFGGEGAWPARATSVAGAFVLAQESMYRYPRRPDAPDRLRGEQLPPVS
ADFEAGLAAHVDFSPEVLADRLRGKVQVQRLDTDLFAGHRVTEDGSDLWA
NLSPEAVAEKAEQIRLRLVEANDRELYRRLLERFVRAIEASGAEVPEDEE
LQMRQLDLLLVRRPRLLREAFKSLRQGQVLDVDVLLPAELFSDQPLRSAN
RGLYGVFPAGLNQDELAIAERLDASAQVRWWHRNQPKSGIGLYRWDEGDG
FYPDFVVSVAERSAPGIALLELKGEHLWGKPSEVDKSAAIHPDYGAVFTV
GRKRGERDFFYLRELGGRLQRAGSFDLDRMRFT
>gid:102098  XCC0344  site-specific recombinase
MTRLTPKLLDQVRGRLRLRHYSLRTEQAYVGWIRRFILANGKRHPAQMGQ
AEVEAFLTDLATRGQVSAGTQNQALAALLFLYREILGLELPWMENLVRAK
RPRRIPVVLSVEEVTRLLTMLEGACRLMAGLLYGSGMRLLECLRLRIKDV
DMVRCEIVVRDGKGGKDRRVPLPRSLRGELMQQRERALLLHAADLAEGAG
QVFLPHALARKYPSADVEPGWQYLFPGARRSVDPRSGRVGLHHVSEEIRQ
RAVHAARRRAGIDKPATCHTLRHSFATHPLEAGHDIRTVQELLGHKDVAT
TQIYTHVLGRGASAVRSPLDGLHLSGG
>gid:102153  XCC0399  IS1404 transposase
MKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMSVP
DAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>gid:102154  XCC0400  IS1404 transposase
MPAAWLQRGLVLPVAQQVRWDERARCQAAQGPRVRERAAEEVAGRAVVRE
RPDQGCTAKKVVSAPARRTLVREWIGRGASERRALAVIGMSASALRYCPR
EDRNGELRERICALAHRHRRYGVGMIYLKLRQEGRIVNYKRVERLYREQQ
LQVRRRKRKKVPIGERQPLLRPSQANQVWSMDFVFDRTAEGRVIKCLVIV
DDATHEVVAIEVERAISGHGVTRVLDRLAHSRGLPKVIRTDNGKEFCGKA
MVAWAHARNVQLLLIQPGKPNQNAYVESFNGRLRDECLNEHWFPTLLHAR
TEIERWRREYHEDRPKKAIGGMTPAAYAQHLANTDIINPGL
>gid:102180  XCC0426  IS1479 transposase
MQLMFSDTESLGKRKQTRHEIFLAEMEQVVAWQKLLGLIAPHYPASGRPG
RQPYALATILRIICCSRGMR
>gid:102275  XCC0521  ribonuclease
MALSYPTTGTRFTLSTNTALVGGERCTLAITASAIRDASGLSPAGNQSIA
FTVATASGGGTGYYSRVNTTSPSQLRCSLNATIRGHTVYPYSGTGTSTWT
ILEMADEDPNNSGRILDAYRNRSYAKVSDRAGTGSGLTYNREHTWPNSLG
FGSATGDRGLPYAPYTDTHMLYLTDTTFNADRGNKPYAACTSSCGERVTE
VNDGSGGGSGRYPGNSNWVRTPDGNGGTFEVWGRRKADMARAVMYMAIRY
EGGTDAATGQSEPDLELTDDRSRIVQTSASPAYMGLLSTLLAWHQADPPD
DAERARNQVVFSFQGNRNPFVDHPEWATSSLFSSAKPASCQLAN
>gid:102278  XCC0524  helicase
MSTLPAPSSGGVYPSGQPGSHRYSHHQGKFFAHWLTLRSRDESAVARALS
AARVDMNPHQVDAALFALKSPVEAGVLLADEVGLGKTIEAGLVLAQHWAQ
RRRRLLLIVPATLRKQWTQELEEKFQLPSVILESKSFNAFRQLDVSNPFQ
IEDAIIVTSYEFAASKQKELAAVSWDLVVLDEAHKLRNLYKGKSASKRAV
ALNEALRGRRKVLLTATPFQNSLMELYGLVSFISDEFFGSQKAFQMQYAS
GRSSEARLNDLRHRLKPICHRTLRRQVQAEGGINFTKRFSITQDFTPSDE
ELELYDKVSAYLQDPSILAIKPTARHLVTLVVRKILASSTFAIADTLETI
IGRLESNQALTAKQIEDFETVDELSDELDSDGMERQGDAPDSDALQAELF
CRAQEIDRLRGYRDLAKQIQSNRKGDALLLVLGRALAMAEKLGGQRKAVI
FTESRRTQEYLRVLLEEHGYAGRTVLLNGSNDDVDSRQLYADWLERHAGS
TLVSGSRTADMKAAVVEAFRDHRDVLITTEAGGEGINLQFCSLLVNYDLP
WNPQRVEQRIGRIHRYGQKSDVVVVNFVNRKNRADQLVFELLEKKFKLFD
GVFGASDEILGAIESGVDIEQRILRIYQNCRDAAQIESEFATLQAELDES
IGRREESARRMLLEHFDEDVVRNLRSRRAGMLHKINQYESQFLHLIAAER
PKARISDKLVKLEEGDYAVSWPPAEEANAGLLRPDKGLGERLCRQAKERP
TLPGTLHFDYAAVDAQRADVRQWLGSRGQLRVALVTIQTAEEVLEELVCS
ALCSDGRVVPDETAARLMEIPARFVPRHRVAEDRALPAFEAAVERVLAEA
NKRNESWFLQESERLDRWGEDQRLLLQQSIDEFDLQVRDAKRTLRQLETL
EQKAQLKREIKRIEQQRDDAMLDFFEGRKRIERSQDVMLERVENALRTQH
TVQVLFDVEWTLEHPEP
>gid:102287  XCC0533  ISxcd1 transposase
MKKSRFTTEQIIGFIKQADAGMAVAELCRQHGFSPASFYQWRAKYGGMEA
EDAKRLKELEQENNRLKRLLAEAHLDIEALKVGFGVKR
>gid:102288  XCC0534  ISxcd1 transposase
MCELTSISERRACRLAGISRDAFRHAPTPTPATQTLSARLVELAQARRRF
GYRRLHDLLRPEFPQVNHKKIYRLYREAKLSVRRRKKAKFPAAQRQPLRP
ARHPNEVISRP
>gid:102289  XCC0535  IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>gid:102294  XCC0540  conserved hypothetical protein
MTMQTPLLRGLPAEIAADCRVLVLGSMPGSASLHAHAYYAHPRNRFWPVM
QQLLGIDADAPYDARLQQLAERGVGLWDVIGECARRGSLDAAIVPGSIVV
NPLPERLATLPQLRLVVCNGSAAAQAWRRHVQPALSPPLTRLPVQAVPST
SPANAAWSLPRLCAAWQPVRDALR
>gid:102314  XCC0560  IS1404 transposase
MAQQVRWDERARCQAAQGPRVRERAAEEVAGRAAVRERPDQGCTAKKVVS
APARRTLVREWIGRGASERRALAVIGMSASALRYCPREDRNGELRERICA
LAHRHRRYGVGMIYLKLRQEGRIVNYKRVERLYREQQLQVRRRKRKKVPI
GERQPLLRPSQANQVWSMDFVFDRTAEGRVIKCLVIVDDATHEAVAIEVE
RAISGHGVTRVLDRLAHSRGLPKVIRTDNGKEFCGKAMVAWAHARNVQLR
LIQPGKPNQNAYVESFNGRLRDECLNEHWFPTLLHARTEIERWRREYNED
RPKKAIGGMTPAAYAQHLANTDIINPGL
>gid:102315  XCC0561  IS1404 transposase
MKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMSVP
DAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>gid:102324  XCC0570  IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>gid:102363  XCC0609  IS1479 transposase
MRAKVEHPFRVIKRQFGYTKVRHRSLAKNTAQVLTLFALSNLWLKRKQLM
PVVGTACL
>gid:102364  XCC0610  ISxcC1 transposase
MVCVARSTVRYRRRPDRDEEVIALLSELAERFPERGFGKLFQIIRRRGHV
WNHKMVWRVYCLMKLNQRRRSRRRVPARHPQPLACGAHPNAGWSIDFMSD
ALWDGRRFRTFNVIDDFSREALAIEVDLNLPADRVIRTLERIAAWRGYPG
KLRLDNGPEFVALALAEWAERKGIALDFIEPGRPMQNGFIERFNGSYRRG
VLDMHIFRTLSEVREQTEHWLADYNQQIPHDSLGGLTPAEFRDQHQPQTS
SFGWH
>gid:102365  XCC0611  ISxcc1 transposase
MRKSKFTESQIVATLKQVDGCRQVKDVCRELGISDATYYVWKSKYGGTEA
ADVQRLRDLETEHSKLKRMYAELAMENHALKDVIAKKL
>gid:102366  XCC0612  ISxcd1 transposase
MKTSRFTDRQIIAILKQAEAGTPVPQLCREHGISSATFYK
>gid:102392  XCC0638  IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>gid:102464  XCC0710  conserved hypothetical protein
MPAARQQRGAAVEAAARAQLEQAGLRLVAGNANYRGGELDLVMRDGPMLV
FVEVRYRRDARFGGGAASVDFRKRRKLVLAAQLFLAAHPALAALPCRFDV
VEASGEPPLLHWIRDAFRLDDC
>gid:102592  XCC0838  MutT-like protein
MRQPALWPAHAPAPAGRQWRRRVAATSRFPVPRSLSMNRHDTPPTVVYEG
KYQRMVVRGTWEYSERVHAGGLAAIIVAVTPDDAMLFVEQFRVPLQARTI
EMPAGLVGDIHADESIELSAIRELEEETGWTADHAEVLMIGPTSAGASSE
KIAFVRATGLRKVGAGGGDASEDITVHEIPRAQVGAWLVQKMAEGYQMDP
KLWAGLYLVDHALDGTPRG
>gid:102747  XCC0993  conserved hypothetical protein
MVSFDATEALTPYREGRGYGAILFDRERLRQADAGLFSPQRWGDRARPVD
EGGRGGAWFVDAPFGHSVLRQYRRGGMAARVSRDQYLWKGAGRTRSFAEF
RLMRELLKRKLPVPRPLAACYLREGLGYRAALLMERLENVRSLADHAQVA
GRGAPWEDTGRLIARFHRAGLDHADLNAHNILFDAGGHGWLIDFDRGVLR
IPATRWRERNLARLHRSLLKLRGNRTREDVDKDYERLHRAYELAWGRGY
>gid:102794  XCC1040  conserved hypothetical protein
MTDPTSPGHRALRRGRHSLAGHCYLLTTTTHQRQRLFDDPRLAASACGAF
TKAAPADATLLAWVLMPDHVHWLLQLGHHTPLARAVACLKAASRRAVNTQ
RAMQAPVWARAYHDHAVRHDADLRAVARYVIANPLRAGLVQRIGAYPFWD
AIWLG
>gid:102821  XCC1067  type III restriction-modification enzyme, helicase subunit
MPGQGQNQGLKIVSDQFFKQPILNSPYGYPSLHWELDEKGQPTQQVVKSR
RASSFVSPIPKPRRHQGEQATLALDEVESLADDGQRYRHSELINSVRREV
DAWRLLPPAQWRVTPETARLLEHWRNHKFAGVRPFFCQVEAAETAIWLAE
VAPQLGKNGERFLDHLKKASTDANPGLMRLALKLATGAGKTTVMAMLIAW
QTVNAVRHPQSKKFTRGFLLVAPGLTIKDRLRVLLPNDADSYYANREIVP
RDMLADLDKAKIVITNYHAFKRRERMELSKGGRRLLQGRTGSELETLETE
GQMLQRVMPELMGLKNILAINDEAHHCYREKPHAAVDDEDLDKDQKAEAE
DNNEAARLWISGLEAVNRKLGLQQVMDLSATPFFLAGSGYVEGTLFPWTM
SDFSLMDAIECGIVKLPRVPVADNIPGAEMPIYRELWKHIGKKMPKKGRG
KNAQLDPLAIPVELQTALEALYGHYLKTYEAWKQAGINVPPCFIVVCNNT
ATSKLVFDYISGFERTNEDGSSTRVPGRLELFRNFDEHGEPLARPNTLLI
DSEQLESGEALDDNFRGMAADEIERFKREIIERTGDRGQAENLSDSELLR
EVMNTVGKQGRLGEQIRCVVSVSMLTEGWDANTVTHILGVRAFGTQLLCE
QVIGRALRRQSYELNEQGLFDVEYADVFGIPFDFTAKPVVVTPPKPRETI
TVKALRPERDHLEIWFPRVQGYRVELPEEQLEAEFNDDHHVSLTPDRVGA
TKTHNAGIIGEAVELDIKHLGDVRQSTLLMELTKHLLFQHWRDKGQDAPI
ALFGQLKRIVRQWLDECLECKGGTYPAQLMYRELADMACQRITKGITAKE
LEKGRQVKAILDPFNPTGSTAHVRFNTSRPGSERWETLGVENQPKNQVNW
VILDSGWEGEFCRIAESHPKVLAYTKNHNLGLEVPYRFGSANRIYIPDFI
VQVDDGRGKNDSLNLIVEIKGYRREDAKEKKSTMDTYWIPGVNHLGTHGR
WAFVEFGDVYEMQDDFAKEVEAKFNQMIETAVPAPGNKEY
>gid:102822  XCC1068  possible DNA methylase
MATKKTEKHGKSIEQITHTEAKRKNIPTVEHQSVMQHHEQAPVQVAYPRA
NRQWLEELCALHDAGKVSQEFQQRLNRDLDPQLIWRGKDQQDWSDLVVNA
PPLYIQEKVKPKALIDDLRRQTDARREAAAPQQQDMLDLFGDFNGLPEGA
DRTEFYQHEGHWQNRMILGDSLQVMASLAEREGLRGKVQCIYFDPPYGIK
FNSNFQWSTTSRDVKDGNTGHITREPEQVKAFRDTWRDGIHSYLTYLRDR
LMVARDLLTESGSIFVQIGDENVHRVRAVLDEVFGEDNFVSMIQVQKTGS
QASNLLANTVDFVLWYARTKKKVKYRQLYSDRTAGHVSSDRYDQIELDDG
EERRLKRDELDRKVEIPPGRIFRQTSLISSGQASSEQIFVFQNKQYRPGA
SNHWKTTVSGLGVLAKAGRIAAGASTVHYKRYLNDFSVLPIADRWESLQI
GTGLIYVVQTATSVVERCLLMASDPGDLVLDPTCGSGTTAYVAEQWGRRW
VTIDTSRVALALARARIMGARYPYYLLADSREGQLKEGEITRCAPSSQPA
WGNIVHGFVYERVPHITLKSIANNAEIEVIWDKFRTTLESLREKLSHALG
KQWKEWEIPREADAKWSHTAKELHADWWQQRVACQKEIDASIAAKAEFEY
LYDKPYGDKKKVRVAGPFTVESLSPHRVLGVDEEDNLIDHVAETQAEYGQ
DFASLILANLRTAGVQQARKADKIEFTSLEPWPGELVCAEGRYLENGQVK
RAGILVGPEFGTVTRADLVDAAREAGDANFDVLIACAFNYDAPASEFSKL
GRINVLKARMNAELHMAEDLKNTGKGNLFVIFGEPDVDVLDTQGHSIRRY
DGKRDVIEVPADGQLVVRINGVDVFHPSTGEVRSDGADGIACWFLDTDYN
EESFFVRHAYFLGANDPYKALKTTLKAEIDPDAWATLNCDTSRPFPKPSN
GRFAVKVINHLGDEVMKVFKVN
>gid:102823  XCC1069  putative DNA helicase
MEFRIADTFTTSLARLTGDEQKAAKTTAFDLQVNPSGKGMSFHKLDRAKD
LNFWSVRVSRDIRLIVHKTAGSLLLCYVDHHDRAYQWAERRKLEVHPTTG
AAQLVEIRERVEEILVPKYVEDRASATRPKPKFFEKYSDAQLLAYGVPQE
WLGDVKASDEDSLLDLADHLPGEAAEALLELATGGTPALPEVASKGADPF
QHPDAQRRFRVMSDMDELVRALEYPWDKWTVFLHPAQRQLVERNYNGAAR
VSGSAGTGKTVVALHRAVHLAHKDEDARVLLSTFSDTLANALRGNLYRLI
WNTPKLGERIDVAAMEAIGIRLYSAEFGKPVFASRDEISTLLKAAAMQVD
GLKANTAFMLSEWEDVVDTWQVDDWESYRDAKRLGRKTRLPEVQRALYWQ
AFAQVKSQLEQAGKITAAEMFARLAEVMPKRKHPVFDYIVVDEAQDISVQ
QLRFLATISGNRANALFFSGDLGQRIFQTPFSWKHLGVDVRGRSRTLNIN
YRTSHQIRLQADRLLGPDVSDVDGNVESRKGTISVFNGPEPTICSYADAD
AEIQAVGVWLKQCNSSGVLPQEIGLFVRSESELSRAQAAVKAADLQGRAL
GKDMATEESFVTITTMHLAKGMEFRVVAVMACDDEIIPSQMRIDTAADEA
DLTEIYNTERQLLYVACTRARDQLLVSAVKPESEFLRDLVQE
>gid:102837  XCC1083  conserved hypothetical protein
MDLFDTPLAPLQVLDDAEGGVRYWPQLLAPAVAQAAFAALRDGADWQRHQ
RTMYDRVVDVPRLLASYRLDAPLPPGLPLQLLLAAVQAQLPAPYNAVGLN
LYRDGRDSVAMHHDKLHTLLAPHPIALLSLGTPRRMQLRAKQGATRAITL
ELAPGSLLAMSHASQLTHEHGIPKTTRALGERISVVFRVRPPARMAAGQH
GPHWEALTQTD
>gid:102853  XCC1099  conserved hypothetical protein
MPLDALLAARTVWRAGHGTATANGGESTGHAALDAVLPDGGWPRRALTEL
LLPAHGIGEIALLLPTLARMTGAGSRVVLVAPPYVPYAPAWQAGGVALQQ
LEIVQAEPRDALWAFEQCLRSGACAAVLGWPQTGDARALRRLQVAADSGN
CCAFALRDRRHAVNASPAALRLEFLPERDAWQVRKCRGGQVPSQPLRLAH
>gid:102854  XCC1100  conserved hypothetical protein
MLWACILLPQLALDDVLRRREDTQAPLALVEGPAQLRSLHAVNAAAAAAG
LKPGMRLSAAHALMAEVQTCDYDPQAEARCQRFLASWAYRHSSLVSQQWG
RAIVLEAGASFRLFGPWPRFERRLREELQALGFQHRLALAPTPRAARVLA
GLRDGMAVTQLPALQALLDKVPVRRAALPGDAGERLQHMGVRTLAALRAL
PSEGVRRRFGGALLDHLDRLYGQADDPLECYAPPDHFDQRVELGYEVETH
PALLFPLRRLIGDLCTYLSIRDGGVQRFLLRLEHEEGATDVDVGLLTPER
APALLFELARNRLERVEIPRPVVAMRLLAKQLPPFVPAMRDLFDQRAQQS
VDWPQLRERLRARLGDEAVYRVLPADDPRPERAWQKAIGDDIREAAAPPR
PPRPTWLMPLPVPLHDPHLRIVSGPERLESGWWDDAEARRDYYVVETSRG
RRAWVFASPGRTDGWMLHGWFA
>gid:102885  XCC1131  DNA-3-methyladenine glycosylase
MAFTHLARSDRALGAWMRRIGPIAPQPGWRKPFDPVDALARAILFQQLSG
KAAATIVGRVEVAIGASRLHADTLGRVDDAALRACGVSGNKALALRDLAR
REALGEIPSLRKLAFMEDDAIVEALVPVRGIGRWTVEMMLMFRLGRPDLL
PIDDLGVRKGAQRVDKQAQMPTPKELAERGERWGPYRTYAAFYLWKIADF
SIAAKVPTPRSQE
>gid:102931  XCC1177  IS1480 transposase
MRQRLHRRTVCRRRARDLGEQLTVQIAKRSELHTFKVMPKRWIVERSFAW
LEKNRRLWKNCERKLNTSLQFIHLAFLALLLRRS
>gid:102963  XCC1209  IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>gid:102999  XCC1245  conserved hypothetical protein
MSALAHRAPVHAWIGGGASERRTLAAIGIGTSALSYCLRDDNNFELRWRL
GALAHRRRRYGVGMIDPKLRRQKRIVKYKRGAWLYPIGLPPQWKRENRRA
EKIVLECSESPFRCCKHRPWRPAGRASGTGGMVRTRSCAKAA
>gid:103002  XCC1248  IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHTLLHGKEDSVFGDSGYTGADNREELQDCKAAFFIAARRSTLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>gid:103180  XCC1426  conserved hypothetical protein
MRLIDSHCHLDASEFDADRAAVIARAKAAGVMQQVVPAITAASWPGLREA
CALAPGLHPAYGLHPIFLDLHRPEHLELLAEWIARERPCAIGECGLDFFL
DGLDAQTQRHYFDGQLQLAKRFDLPLIVHARRAVEEVIARIKAVGGIRGV
VHSFAGSPEQAQQLWKLDFMIGLGGPVTYPRANRLRGLAAQMPLEHLLLE
TDAPDQPDAEIRGQRNEPARLRTVLDCIAQLRGEPAEAIAAQTSANARRL
FGLPA
>gid:103207  XCC1453  helicase
MKLRFWKGESPETIPMDAPEFSAESSGWKGTAQEKSTHPGAIYLLQLADE
RFALKDGAGYLIPWEKLYSLLSDSDHVSSLHLLKLPQTSRLRPAIRSHGT
PTDTNFRVTLEAWIAESGEELLAERLGAVLTMPGQDLLLPQSAFTLLEAM
AELAQCGEDWDADRRMLQMGKVQEAARWAGATMDRYLEHSPVVVANKLEI
NLKQHEAAEAQIVEVEPRPIGAPEGWLSQFDRYQSVRGRYDVTDAEGGMS
HVVLSSQVREVGQQIKSMPGRRLAGKQADAFMHNPYAVLGEAAAQVISPE
RFAEAKRTAGIDEWELELQPADVDGSWDAVLVDSVGKSESVFVGSFHASE
FDSLLEEASGSEGLGVARWKKHRVLLSGLTLESLARLRQAHFKEAVGSVI
GVETLFDLAHYSDRVVGFDGKPIIVPKVQGSSPAEDWIKGAGEFVAVDPA
TGTATEGRLSTNDVQEFGERVDLAEREGHINVAVPGIEKEIPTPEARAWM
KELTKPAGEKRIESLKNPAPAEKLSLRILHNIEELEYPSEETEQARVKDV
YEAPAALRPEVALKDHQVEGVAWMQGRMRQRGDGVRGVLLADDMGLGKTL
QALTLMAWYRQTAPAPKPCLIVAPVSLLENWKAEIAKFLDGRQGATLTLY
GEHLALHRLSAREIGPELRELGVKKALNPGFAHGAAFVLTTYETLRDYQL
SMAREKWGVLVCDEAQKIKTPSAMVTRSAKAMQADFKIACTGTPVENSLA
DLWCLFDFFQPGLLDSLTKFTKVFRQQIELRSEGHEQKVEVLREQISPWV
LRRMKAEVADLKPKTELPCELAMSAKQRGLYAAAARRFRETVDSGEGGDT
AALSLLHQLRQICANPLAAADDRSEFLSLDEHLRHSPKLAWLIEKLQEIQ
ATGEKVIVFTEFRDIQRLIQRAIASRLSYQASIVNGSTSVEAGADDSRQI
IIDAFQAKPDFGVIVLSTTAVGFGVNIQAANHVIHFTRPWNPAKEDQATD
RAYRIGQEKEVFVYYPTVLGDGFESFEERVAKRLATKRKLSNDMLAPEQG
LTLEEFSDLSLGFSE
>gid:103208  XCC1454  IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>gid:103209  XCC1455  IS1477 transposase
MAGQVRRHGGRRREAPEGAGAGEQPPQAVAGRGAPGHRGAEGRVRGKTLA
PQRKREAIRRMCELTSISERRACRLAGISRDAFRHAPTPTPATQTLSARL
VELAQARRRFGYRRLHDLLRPEFPQVNHKKIYRLYREAKLSVRRRKKAKF
PAAQRQPLRPARHPNEVISMDFVFDQLASGRRIKCLTVADDFTHECVDIA
VDHGISGAYVVRVLEQIACFRGYPRAVRTDNGPEFTSRAFITWAQQRGIE
HILIEPGKPMQNGYIESFNGKFRDECLNEHWFTSLIQAREVIADWRRDFN
EVRPHSSCGRIPPAQFASNHRAQTGNNAVPFNPGLYQ
>gid:103210  XCC1456  IS1477 transposase
MKKSRFTTEQIIGFIKQADAGMAVAELCRQHGFSPASFYQWRAKYGGMEA
EDAKRLKELEQENNRLKRLLAEAHLDIEALKVGFGVKR
>gid:103299  XCC1545  conserved hypothetical protein
MPAPMADHLPPPAAATAALHRHRPAVLLPRSPPAARSSSPSIPAYVKFFA
SCAKGLEYLLADELLALGASKATATISGVNVEGALRDAQRAVLWSRLASR
VLWPLTEFDCPDEDALYAGVAELPWHEHLSTGHTLSVDAHVSGTAITHAR
YAAQRIKDAVVDTIRRQGLERPSVDVESPDLRLNLSLRKGRATISVDLGG
GPLHRRGWRMAQNEAPLKENLAAAVLLRAGWPRAYADGGGLLDPMCGSGT
LLIEGALMAADVAPGLQRYGSDIPSRWRGFDRDSWQQLVTEARERDSVGR
AALKQVIHGSDMDPHAIRAAKENAQVAGVAEAIWFGVREVGDLQTRPQAT
GVVVCNPPYDERLAADAALYRKLGDTLQRVVPQWRASLLCGNAELAYATG
LRAGKKYQLFNGAIECALIVCDPIAVPRRTPLAAPTALSEGAQMVANRLR
KNLQKFKKWRAREGIECFRVYDADLPEYSAAIDVYQQADGDRRIFLHVQE
YAAPATIPEADVRRRLGELLAAAREVFEVPAERVALKSRERGKGGSKYGR
FEQRNEIVNVREHGALLRVNLFDYLDTGLFLDHRPLRGTMAQQSKGRRFL
NLFCYTGVASVQAAVAGASATTSVDLSGTYLQWCADNLALNGQAGSKHKL
VQADALAWLEAERAHFDVIFCDPPTFSNSARAEDFDIQREHVRLLRAAVA
RLAPGGVLYFSNNFRRFKLDEEAVSEFAQCEEISPRTIDPDFERHARIHR
AWRLTA
>gid:103351  XCC1597  phage-related integrase
MRIPHHLSRSPTGRWSFVQRVPVDLQTVMGCRLIKRTLQTKDLAQAHVRA
VVLGAGYARLFAQLTDQRVDKLSKTDADLLIARLTSAENLQDLTLNRTRQ
PDGTVTEQWQIDSPKDLKLYRQLMELEAMAGAALQARAHPVAVPTMFGST
HAGPSRQSSAPAIETMTLGKARDAFLATLKGSTLPKTYTIKKTAIEALVS
FLGPTMKVHAITRSDLARWYQDMREKGASTPTLTNKQSYIGGRGGFFEWA
MASGHYPKGDNPASGHVSYSQREKRARKKLGFKAYDRAQIQALFAPEALA
KLSESARWASFLGLYTGARASEVGQLLVKDVFEEDGIPCIRISDEGEHQK
VKTEVSLRTVPLHPELLKMGFLEWVGGKRKVGETRLFPAAKATAVNGQGN
WITKAFSRHLAEVGKNWEPAKRGFHSLRKTLIQELQGAGVVSELRAQIVG
HELDDEHHSTYGRDFTVVEKLRGLGPHSPGISRLSFFQ
>gid:103359  XCC1605  hypothetical protein
MFRLGASTASNLHRGNPMSVDQRRRDVERHQKEIARLQTEKSREETKAVG
EKKKAFDASAAATRTKSVSTQQSKLREAQRYEGNAVAIQKKIADLETKIA
REHERLGNANRQLSAAQVQEQKKQVQEDKKRDAERKRAMAASAQELSRVQ
GRLTHHDHLHRSTEATLQRLQELPELLTVLFLASNPIDQQQLRLDEEARS
IHEMVRKSEHRDVVKLESRWAVQPLDVLQAINECRPGVVHFSGHGSEEDD
ILFQDSVGRSKLVSKEAIVQTMMAGSGDIQLVFFNTCFSHGQAAAIVEHV
PCAIGMNTSIGDQAARVFAAQFYSAVGFGLSVAGAFEQARAALMLEWIPE
AHTPELFAGPGVDPSQVFLVRPPG
>gid:103360  XCC1606  ISxcc1 transposase
MRKSKFTESQIVATLKQVEGGRQVKDVCRELGISDATYYVWKSKYGGMEA
ADVQRLRDLETEHNKLKRMYADLAMENHALKDVIAKKL
>gid:103361  XCC1607  ISxcC1 transposase
MVGVARSTARYRRRPDRDEEVIALLSELAERFPERGFGKLFQIIRRRGHL
WNHKRVWRVYCLMKLNQRRRSKRRVPTRHPQPLACGDRPNAGWSIDFMSD
ALWDGRRFRTFNVIDDFSREALAIDVDLNLPAARVIRTLERIAAWRGYPN
KLRLDNGPEFVALALAEWAERKGIALDFIEPGRPMQNGFIERFNGSYRRG
VLDMHIFRTLSEVREQTEQWLADYNQQIPHDSLGGLTPAEFREQHQPQTS
SFIWH
>gid:103363  XCC1609  IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHALLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>gid:103367  XCC1613  phage-related integrase
MTNFRAGLSLLPLKQWKQAMSSIVQQQLRDYITSPFGLLQIKGAHVGNIR
ATEINITGREAAIHALELFDRADERSRRVRLSKNRTLYPEPLPLGSPARL
LSVEISDYLGHRDRCGLAKETVEDTARSLKLLRIACGDVPVSRIDHAHIY
RLWDLMRWAPPLLLSDPKYQAYTFEQAVALGKELGVAPPAPATLEKHRRF
LVTFFSKLVKAKAIPMSPMDAFAEIKKDLVVDTSKPERLFDEEELQRIFS
PKTFPAWAKKYPHRWWLPMISLYTGARINELAQLKVADIVEEAKVWCIRI
QKTVDADLRHKDRDRSRQSLKGKAAVRTLPIPKPLLDAGFLDFIEDIKAT
GHPRLFPHLSAGVNRETGETNARYSQGAVNQFSSYMKTLGFGKGIGAHAF
RHTLATELHHKNVSDQDIALITGHSLRKNVPVLHDAYFHKKPKLARATQI
RILAKYKPPVELPKYERGQFSECLADPSKFYP
>gid:103373  XCC1619  RadC family protein
MKRTQDRAVQYQLEMDEEGILLAAATILEQRLQRQGRIHSPDQAGDYLVA
RCAHLPHEVFGVVFLDTKHHILATEHLFSGTIDGCDVHPRVVAKRALDLN
AVAVILFHNHPSGNPEPSEADRKVTERLKQALALLDIRVLDHLVIGGRQH
TSLAARGWV
>gid:103381  XCC1627  IS1477 transposase
MKKSRFSTEQIIGFIKQADAGMAVAELCRQHGFSPASFYQWRAKYGGMEA
EDAKRLKELEQENNRLKRLLAEAHLDIEALKVGFGVKR
>gid:103382  XCC1628  IS1477 transposase
MPPARFQPGQLLPVAGQVRRHGGRRREAPEGAGAGEQPPEAVAGRGAPGH
RGAEGRVRGKTLAPQRKREAIRRMCELTSISERRACRLAGISRDAFRHAP
TPTPATQTLSARLVELAQARRRFGYRRLHDLLRPEFPQVNHKKIYRLYRE
AKLSVRWRKKAKFPAAQRQPLRPARHPNEVISMDFVFDQLASGRRIKCLT
VADDFTHECVDIAVDHGISGAYVMRVLEQIACFRGYPRAVRTDNGPEFTS
RAFIAWAQQRGIEHILIEPGKPMQNGYIESFNGKFRDECLNEHWFTSLIQ
AREVIADWRRDFNEVRPHSSCGRIPPAQFASNHRAQTGNNAVPFNPGLYQ
>gid:103384  XCC1630  invertase/recombinase protein
MVLIGYARVSTAEQDTALQTDALRKAGCERVFEDTASGAKADRPGLADAL
AYLRAGDVLAVWRLDRLGRSMQHLIETIAALEARGVGFRSLTESIDTTTP
GGRLIFHVFGALGQFERDLIRERTKAGLTAAAARGRKGGRKPVVTADKLQ
RAREHIANGLNVREAAKRLKVSKTALYAALQSTSAANF
>gid:103387  XCC1633  ISxac3 transposase
MCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHWLASGSVYGH
RKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGMQCKAAA
NLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRD
RADTELVVQAVLSAVWRRKPNTGCLVHSDQGSVYTSDDWRSFLASHGLVC
SMSRRGNCHDNAPVESFFGLLKRERIRRRTYPTKDAARAEVFDYIEMFYN
PNRRHGSTGDLSPVEFERRYAQRGS
>gid:103388  XCC1634  ISxac3 transposase
MSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRTFGK
SGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>gid:103390  XCC1636  IS1480 transposase
MARARQERNACTRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQSLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>gid:103392  XCC1638  IS1478 transposase
MPELPVFATLKAGFWQFPLMHTRRPAAEHMPAEELFRSRLENQIDLRHPL
AQLSQRMPWTALEQALSSRLPATQAGGGRPALPVRLIAGLLYLKHAYDLS
DEAVCERWLENPYWQFFTGEVVFQTRLPCDASSLTRWRQRLDEAGMEELL
AHTINAAHAMQAVDARELSRVIVDTTVQEKAIAYPTDSRLLEVARKKLVL
LAKRYGIGLRQSYARQGPALSRKAGRYAHARQFKRMQRVLRRQRTVLGRV
LRDIARKLDQVEPGVRERIAVWLERAQRLYTQRPKDKQKLYALHAPEVEC
IGKGKARQAYEFGVKVGIAVTACKGLVVGARSFPGNPYDGDTLAEQLEQT
RGLLQDLSVEPTVAIVDLGYRGREVDGVQVLHRGKAKTLTRRQWRWIKRR
QAVEPVIGHLKDDCRLRRCRLKGAQGDALHVLGCAAGYNLRWLLRWIAFL
RAWMRAMGWPSFSAVPLSPMTLGA
>gid:103549  XCC1795  IS1477 transposase
MPSAWFQPGQLLSVAGQVRGDGGRRGQAAERTGGPEHSIEEVAGRGAPGH
RSAEGWLRGKTLAPQRKREAIRRMLEHTPLSERRACRLAGLSRDAFRHAP
VPTPATQALSARLVELAQTHRRFGYRRLHDLLRPEFPSVNHKKIYRLYEE
AELKVRKRRKAKRPVGERQKLLASSMPNDTWSMDFVFDALANARRIKCLT
VVDDFTRESVDIAVDHGISGAYVVRLLDQAACFRGYPRAVRTDNGPEFTS
RAFIAWTQQHGIEHILIEPGAPTQNAYIESFNGKFRDECLNEHWFTSLAQ
ARDVIADWRRHYNQIRPHSSCGRIPPAQFAANYRTQQANNAVPFNPGLYQ
>gid:103550  XCC1796  IS1477 transposase
MKKSRFSTEQIIGFIKQADAGMAVAELCRRHGFSPASFYQWRAKYGGMEA
DEAKRLKELEVQNTRLKKLLAEAHLDIEALKVGFGVKR
>gid:103577  XCC1823  conserved hypothetical protein
MVDAPLRQQLTVYRARWPNESEVADQFEQLLDDATDPFVRERVEGHFTGS
AWVVGADGTRTLLTHHRKLQRWLQLGGHADGDRDLAQVALREAQEESGLT
GLTLADGLLFDLDRHWIPARGEVAGHWHYDARYVVVAGADETFQVSEESL
ALAWRPIAELLADPELDPSMRRMAEKWMVHGGS
>gid:103730  XCC1976  ATPase
MRPRTLDEMVGQKRLLAADSALRRAVESGRVHSMILWGPPGCGKTTLALL
LAHYADAEFKAISAVLSGLPDVRQVLAEAAQRFASGRRTVLFVDEVHRFN
KAQQDAFLPHIERGTILFVGATTENPSFELNSALLSRCRVHVLEGVSPQD
IVEALQRALHDAERGLGQETIQVSEASLLEIASAADGDVRRALTLLEIAA
ELATGEGGEITPRTLLQVLADRTRRFDKNGEQFYDQISALHKSVRSSNPD
AALYWLTRMLDGGCDPAYLARRLTRMAIEDIGLADPRAQSMALEAWDIYE
RLGSPEGELAFAQLVLYLASTAKSNAGYAAFNQAKAEVRASGTQEVPLHL
RNAPTKLMKTLGYGQDYQYDHDAEGGIALDQTGFPDAMGERVYYNPVPRG
MEIKLKEKLDRLREARAQARADKGKAGN
>gid:103772  XCC2018  hypothetical protein
MFLIQVLLPLADNNGVRFEQAMFDEVHHHLAMRFGGITAYTRAPVHGAWQ
EQGAQLVHDDLVIYEVMADDLDRGWWRSYRAELEQRFRQEQLIVRAQEIT
LL
>gid:103803  XCC2049  conserved hypothetical protein
MARARQERNACSRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQTLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>gid:103849  XCC2095  DNA helicase related protein
MHVTGGGTTGDSMDETQPQASAVAAELKIDVTLIAKLNLADFQNAVPLVR
DLWLINETDQVHEQVELVLTSDPPFLKPRRWRIDALAAGSRYPIRDLDVN
LDGGLLARLTEAETATVSLALRSVAADASNTALAQRDTHLELLPRNQWGG
ISHLPDLVAAFVQPNDPAVDRLLKRTAEVLRQNNRNPALDGYTGGAKRAW
ELASALWGATAGMQLDYALPPASFEQSGQKVRSPSQIESSGLGTCFDLTL
LFCAALEQAGLNPLLVFTEGHAFAGVWLQSEEFSNTVVDDVTALRKRLRL
KELVLFETTLITQRPTVPFSYATDRGAQQVDESQDAGFRLAVDIRRARLQ
RIKPLASAEAAVAAVSDSEATLSSPAVSVIEDAPDLPDSPPFKTEDASQL
DPKDRLLRWQRKLLDLSLRNNLLNFKAGKKALKLEAPDPSTLEDLLASGQ
SLKLRPRPDLMDGADPRDQAIYEARERENVRRAHALEALQKREVFVGVPE
TELDSRLVDLFRSARTTLQEGGANTLYLALGFLSWTREDRDGQRYRAPLV
LVPVSLQRKSARSGFTLSLHDDEPRFNPTLIEMLRQDFELNLGAVEGELP
RDDAGLDVTAVWKAVGHAIKDIKGWEVTEDVVLSMFSFAKYLMWKDLAEH
SEQLRENPVVRHLLDTPRDAYPPGAPFPQVRELDQHFDPKQVFCPLPADS
SQLSAVLAASQGKDFVLIGPPGTGKSQTIANLIAQSLAQGRRVLFVSEKI
AALDVVYRRLREIGLGEFCLELHSSKARKLDVLAQLQSAWSSSGQTDAEQ
WRAEAEKLKRLRDALNIYVERLHQRRRNGLSLFDAIGTVSAGHDIPTLPL
AWLSADQHDHAGIDQLRSAVDRLEVNAQAIGHAALAQHPLALVGHRDWSP
TWQQQLIAAARDVLPAAQATIESAHAFVQAIGLPSPLLTPETCEALLLLA
QRLTLAAGHDWRFVLRPDARSLSQRLQEGAARVRRHAELNTLLSTPWPAS
VITACADGLALLTEHRQTHAELGEPWPVRITVQLNQALGLLAQLSEHHAA
LSVPYGKTIEQLDVAQLQQMWEQAEQTFWPKSWLGKRKVTTQLSSATTGG
SQPDVANDLQHWNAIRALRQRIQAIDPGQQCADVWAGLDTQQDKVSTALR
WQIALAAVLEGQAWEDDGFDAIAGGQCGATLQADLQRARRLRQLDQDIAA
HASLETATDGLWAGHATQFNCLRAALDFLSDWRSHAQQGALDAHTLVEEG
ACGPTLARDHQTLRQRADMEQALAALDDLRESTAGLWKGLATNLDDLEQA
CQLREDLAAVLARLATTPEHISACKAPLHTLLGDANALLEPGGRIALAGA
RYVEKWEQLLPRREALATTGHFAEAAQTQWQSMSLDGLIEQSQSIVRAEH
GLRSWCAWRQARDEALALGLATLVQGIKQGQVGPDQARRTFEANYARWWL
NAVVDHEPVIRGFVSAEHEQRIRDFRELDERFTALTRDWLRARLCADLPS
QDNVSRNSEWGLLRHEMGKKRAHLPLRELMAQIPEALTKLTPCLLMSPLS
IAQYLQAGANAFDLVIFDEASQIPVWDAIGAIARGHQVVMVGDPKQLPPT
SFFDRAESGLDDEDVEADLESILDECIGANLPTRNLNWHYRSRHESLIAF
SNHAYYDGGLVTFPSPVTNDRAVSLQPVSGTYQKGGTRTNPAEAKALVAD
VVARLTAPGFRESGLTIGVVTFNAEQQKLIEDLLDEARRQDPRLEPYFAE
SELEPLFVKNLESVQGDERDLIYFSITYGPDPAGQLAMNFGPLNRQGGER
RLNVAITRARHELRVFASFHAEQMDLARTQAIGVRDLKHFLEFAERGARA
LAEANCGSLGGFDSPFEQAVAAALARRGWHVQPQIGASSFRIDLGIVDPD
APGRYLAGVECDGATYHRSATARDRDKLREQVLRGLGWDIVRVWSTDWWI
DPAGTLDRLDARLQAVLIAQREQRAEQAERDAEAESLAQAAIAQAIASVT
KPDGEMAPPAQDADPIAPEVSATAPSQQVEEVFARQVSAEAAHANAEETT
PPEASLYRITDPAEAVTGANPDRFFDGEYNDILLTMIAHVVDHEGPVLDA
LLARRIARAHGWLRTGGRIRERVFQIARPRYRTTDEEVGTFYWPEHLDPA
TEPPFREPADEDSVRAADEISIAELASLARAVIAQGTQGEGIYQAMARRL
RLQQLRAASRARLENVVRSLRAEP
>gid:103852  XCC2097  Tn5041 transposase
MPRHSMTCSNDRITRADGYYHVSDKYVALFSHFIPCGVHEGIYILDGLLA
NTSDIQPEIVHGDTQAQSYPVFGLAHMLGIQLMPRIRNIKDLTFFRPEPG
RAYKNIQALFGDNIDWQLIATHLHDMLRVVISIRLGKITASSILRRLGTY
SRKNKLYFAFRELGKAVRTLFLLRYIDDNKIRKTIHAATNKSEEYNGFVK
WVFFGSQGIIAENVQHEQRKIIKYSQLVANMIILHNVEGMSRTLAEMRKE
GVELTPEILAGLSPYRTSHINRFGDYHLDLEREVAPLSYTAKVLEQAP
>gid:103851  XCC2098  ISxcd1 transposase
MIRSLEQVIEWRGKPRVIRCDNGPEYISAALLAWAERNGIRIEHIQPGKP
QQNAYVERYNRTVRYAWLARTLFDTIEQVQDKATRWLWTYNHEPEHGARR
HHASDETGNGCITPLLESAKSGGITN
>gid:103857  XCC2103  YeeB-like protein
MEGNPTNKYTVPSVSIATAQTGASSKINALGMRPMQEKAYAKRGEQHLLI
KSPPASGKSRALMFIALDKIKNQGLKQAIIVVPERSIGGSFADEKLSEQG
FWADWMVRPQWNLCNAPGEDNGKVAPSKVKAVGAFLASEDPVLVCTHATF
RFAVDEFGIEAFDGRLIAIDEFHHVSSNPDNKLGTQLGQLIGRGQVHVVA
MTGSYFRGDAVAVLSPEDEGKFESVSYTYYEQLNGYTYLKSLDIGYFFYT
GRYLDAIMKVLDPSLKTIIHIPNVNARESLKDKHKEVDEILASLGDWKGR
DEATGFHLIEIEGGRIIRVADLVDDSDAARRSKILTALKDPAQKDNRDNV
DIIIALGMAKEGFDWIWCEHALTIGYRSSLTEIVQIIGRATRDAPGKGRS
RFTNLIAEPAADSEIVVDAVNDTLKAIAASLLMEQVLAPRFEFTPKNAGE
KEGFDYGDEGYQEGNANVGVNAATGEVHVEINGLAQPKSPEAARICKEDI
NEVITGFIQDKPTLERGLFDKENTLPEEITQVQMAKIVRDRYPDLSEEDH
EAVRQHAIAVLNITQQAKQVIAQVDAKGESPNMSLIEGVRKFVNVKDLDI
DLIDRINPFDAAYAVLAKAMDERVLRQVQSAIAAKRLAISYEEAKDLAKR
AVAFKNERGRLPEIGSADPWERRMAEGVASLQRHIAQQKAAAAQGGGNG
>gid:103859  XCC2105  IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>gid:103861  XCC2107  IS1595 transposase
MSINAVQFQAGLSMPEFFASYGTEAKCYRALYKWRWPQGFRCPVCAGRVR
SRFKRGAAIYYQCSACRHQTSLMAGTMFEGTKLPLRTWMLALHLLTSTKT
NMAALELMRHLGVNYKSAWRMKHKIMQVMAERESTRKLAGFVQIDDAYLG
GERNGGKAGRGSENKQSFLIAVQTDDTFTAPRFVVIEPVRSFDNPSLQDW
IARRLAPGCEVYTDGLACFRRLEDAGHAHTTLDTSGGRAATEATGARWVN
VVLGNLKRAISGVYHAIAQGKYAKRYLAEAAYRFNRRFRLREMLPRLATA
MMQSTPCPEPVLRAASNFHG
>gid:104070  XCC2316  3-methyladenine DNA glycosylase
MSLHSPLPRAFYAADARTVAPLLLNKVLVSADGRRGRITEVEAYCGSEDA
AAHSFRGMTPRTQVMFGAPGHLYVYFIYGMHWAINAVCGGAPGHAVLIRA
LEPLAGCDAMHAARGAAPFKSLTTGPGRLAQAFGVSAVDNGLDLTTGVAR
LWIEDDGTPPPAAPLAGPRIGIRKAVELPWRWVVPGSAYLSRPLPRVSGA
RASVTGD
>gid:104146  XCC2392  conserved hypothetical protein
MSRAARPASGAAEGQVRIVGGRWRNTRLSVPQLPGLRPSSDRVRETLFNW
LLPRLAGARVLDLFAGSGALGLEAVSRGAAHALLIERDPGLAQRLREHVA
RLGAAEQVQVLQDDALRWLERAPTGQVDLVFVDPPFAAGLWAPVLERLSP
HLAADAWLYLETPAELPPQVPPGWHLHREGATREVRYALYRRAAATLNGD
PTPVASV
>gid:104181  XCC2427  excinuclease ABC subunit C homolog
MVGRAERAKRSWGAPDYTYPEHLRAELDTLPATPGVYLFHGQSSTLPLYI
GKSIHLRNRVMDHFRNAAEASLLRQTRSIQVIEMAGDIGAQLLESQLIKT
LRPLYNQKLRRIPRQFSIRLYRGEVSIEHSGEIDPAAAPWLYGLYSSPRA
AKETLRRLADQHHLCYGLLGLERLQAGRPCFRAMLKRCSGACHGAEPLDA
HEERLRSVLQHLEQAAWPFPGAIALKEQGAQRTQFHVLRDWHYLGSATSL
AGARRLQATPGAFDRDCYRILRKYLETQLHCVSLL
>gid:104187  XCC2433  conserved hypothetical protein
MIVPGGFPAPVTGSAVAASPPLPTVRLLSLDAHGRVLDWINWQDAACLYA
RDAVSWTLGEPCMQIHGGVSRLTGERSVLELHPIIAARGHARSRALDPTP
TLTNTALFARDSQLCMYCGQHFSRPHLTRDHVMPVSKGGRDSWENVVTAC
FQCNSRKANRTPQQAHMPLLAVPYRPSWIEHLILSNRNILSDQMAFLRAQ
LPKRSKLSL
>gid:104224  XCC2470  IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNTDHARDPEMRQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHALLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>gid:104226  XCC2472  ISD1 transposase
MHYGAWTSSVTHCFDNRRFRSMTVVDHFTHEWPDIVVDQSLKGDDVADAM
TRLVAQRGKPTAIKVDNGSEFSGKVHGSVGIKNRVEPDFSGAANQPITPW
WKASTDTCGRSASTRVGSCHWPTHKARSNNGDASITRSGPTVHWHGKLLK
N
>gid:104227  XCC2473  ISD1 transposase
MQGRPMQNGFIERFNGSYPRGVLNMHIFSTLSEIRKQTEHLLADYNQ
>gid:104244  XCC2490  IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>gid:104313  XCC2559  conserved hypothetical protein
MLPTATGNPQVRVRSIDVLSDNWYVLRKVTFDFQRKDGRWQTLSREAYDR
GNGATILLYSRARQTVMLTRQFRLPTLLNGNPDGMLIEACAGLLDQDDPE
ACIRKETEEETGYRIENVRKVFEAFMSPGSVTERLYFFVGEYVDGDKVSA
GGGVEEDGEEIEVLELSLDAALAMIATGGIADAKTIMLLQYAKLHGVLD
>gid:104332  XCC2578  methylated-DNA-protein-cysteine S-methyltransferase related protein
MPSPRPSKTRVAGSHAGDASATRAAEQVRLRILDVIRAIPAGEVAGYGEV
AMRAGLPGRARLVAKLLSSNQDAALPWHRVLRSDGRIALPEGSAGYQAQC
QRLRAEGVPVERGRVRRATAAQRLDAAVWGPS
>gid:104461  XCC2707  ISxac3 transposase
MVDSEVSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWL
RTFGKSGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>gid:104462  XCC2708  ISxac3 transposase
MQAHCGEFRVCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHW
LASGSVYGHRKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFH
GGMQCKAAANLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSR
QVVGWAMRDRADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRS
FLASHGLVCSMSRRGNCHDNAPVESFFGLLKRERIRRLTYPTKDAARAEV
FDYIEMFYNPNRRHGSTGDLSPVEFERRYAQRGS
>gid:104463  XCC2709  IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALARMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHTLLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>gid:104501  XCC2747  conserved hypothetical protein
MPEAGAIRPDATVLGFDVGSRRIGVAVGTALGAGARAVAVINVHANGPDW
VALDRVHKEWRPAGLVVGDPLTLDDKDQPARKRAHAFARELRERYALPVV
LIDERSSSVEAAQRFARERADGRKRRRDADTLDAMAAAVIVERWLSAPEQ
ATLLP
>gid:104505  XCC2751  hypothetical protein
MSDTLRAACEAPVLSAATTPHRTFRMRRLLGIAGLLIAFAAAVPQADAHK
VPRLSLSLSLAQVLDQLRRDSAAVPASDPMPIDTVMKRYADTHGQSFDIA
SPDPEEDPSEAVPPTQPADVTDAEWHALQAYGAHTTSEADDISENRSHHY
TLIDLDEDGQRDLLDEAYVGGTGLFTQITVLQGHTDGFRAPTATPTGTPA
DREADAGFSINGRGGDQALYWLRIDGRSYAAYRDGDYFQDTLTLSRPLSP
LPAERHPTKALQIRYRYQHTLAPPRKDAAERLLEEQQADDWLAQHPAMRA
AVDTQLQHLRLDAQGRQRSPDPEARCPSPAESSDPELEAQWPWHDAGHYT
FDFVANLRVRHGSECYSASVVAFRSSFQSANTACCVLWLYEAPGNQVANL
PLLSKRTRSGIALITAAPVDASQD
>gid:104540  XCC2786  replication related protein
MSVPQLPLALRAPSDQRLDSYIAAPDGLIAQLQAFAAGQLSDWLYLAGPS
GTGKTHLALSVCAAAEQAGRSSAYLPLQAAAGRLRDALEALEGRSLVALD
GVDSIAGQCEDEVALFDFHNRARAAGITLLYTARQMPDGLALVLPDLRSR
LSQCVRISLPVLDDVARAAVLRDRAQRRGLALDEAAIDWLLTHSERELAG
LVALLDRLDRESLAAQRRVTVPFLRRVLGDRTS
>gid:104570  XCC2816  IS1480 transposase
MARARQERNACTRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQSLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>gid:104597  XCC2843  conserved hypothetical protein
MRILLQHDPGGNEPLRYVQLTLQPDLFGGWELLRESGQIGGRTQLRRDQY
LLQDEADRAFEKARDTQLKRGFHVITGGADAPR
>gid:104651  XCC2897  ISxac3 transposase
MCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHWLASGSVYGH
RKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGMQCKAAA
NLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRD
RADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRSFLASHGLVC
SMSRRGNCHDNAPVESFFGLLKRERIRRLTYPTKDAARAEVFDYIEMFYN
PNRRHGSTGDLSPVEFERRYAQRGS
>gid:104652  XCC2898  ISxac3 transposase
MVDSEVSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWL
RTFGKSGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>gid:104721  XCC2967  site-specific DNA-methyltransferase
MKNQLLQGDALTILPTLEANSFDALITDPPYASGGLHAAARAKPPSQKYV
QGGGAQLHADFVGDERDQRSHLKWMHLWLSECARVLKDGAPVLLFTDWRQ
LPLTTDALQIAGFTWRGITVWDKTEGVRPQLGRFRNQAEYIVWGSKGNMP
LDRRAPVLPGVIRESVRKADKHHLTGKPTELMRQLVRICEAGGRILDPFA
GSGTTLVAAELEGYGWTGVELTSHYSDVARTRLA
>gid:104761  XCC3007  conserved hypothetical protein
MAESIVVYGPMASGKSLNAEAICQAYGLKRVVELDERLQRKGEDWQLSQN
DVVMLTNDQALAERTAQRMRVKTVAITEARLRVGAAWRALR
>gid:104826  XCC3072  conserved hypothetical protein
MDAPKPWHLYLLLCRNGSYYAGITNDLERRFQAHLRGTGARYTRANPPLQ
VLASHPYPDRATASRAEWLLKQQPRARKLAWLQAQGLLPAESRPDDTPLT
PA
>gid:104867  XCC3113  ISxac3 transposase
MSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRTFGK
SGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>gid:104868  XCC3114  degenerated ISxac3 transposase
MCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHWLASGSVYGHRKI
TTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGMQCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHDNAPVESFFGLLKRERIRRRTYPTKDAARAEVFDYIEMFYNPNR
RHGSTGDLSPVEFERRYAQRGS
>gid:104876  XCC3122  IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>gid:104877  XCC3123  ISxcc1 transposase
MRKSKFTESQIVATLKQVEGGRQVKDVCRELGISDATYYVWKSKYGGMEA
ADVQRLRDLETEHNKLKRMYAELAMENHALKDVIAKKL
>gid:104878  XCC3124  ISxcc1 transposase
MVGVARSTARYRRRPDRDEEVIALLSELAERFPERGFGKLFQIIRRRGHL
WNHKRVWRVYCLMKLNQRRRSKRRVPTRHPQPLACGDRPNAGWSIDFMSD
ALWDGRRFRTFNIIDDFSREALAIDVDLNLPAARVIRTLERIAAWRGYPN
KLRLDNGPEFVALALAEWAERKGIALDFIEPGRPMQNGFIERFNGSYRRG
VLDMHIFRTLSEVREQTEQWLADYNQQIPHDSLGGLTPAEFREQHQPQTS
SFIWH
>gid:104879  XCC3125  IS1477 transposase
MPPARFQPGQLLPVAGQVRRHGGRRREAPEGAGAGEQPPEAVAGRGAPGH
RGAEGRVRGKTLAPQRKREAIRRMCELTSISERRACRLAGISRDAFRHAP
TPTPATQTLSARLVELAQARRRFGYRRLHDLLRPEFPQVNHKKIYRLYRE
AKLSVRRRKKAKFPAAQRQPLRPARHPNEVISMDFVFDQLASGRRIKCLT
VADDFTHECVDIAVDHGISGAYVVRVLEQIACFRGYPRAVRTDNGPEFTS
RAVDSPQFNAR
>gid:104887  XCC3133  DNA helicase related protein
MDQESIIRYWHAVELLQPQSAPKLKKRSNRYEAFIHDTPIQRPLLPWTPE
SIVSKQKLPKKRIWSHTLYAHLYDSRLVAEKLDAMYGADQGYQEPKFRES
AVFAAKFTAGGRLVDDSFVLSSEAWFLGRVLTGKDWTRGFETDQKTLRER
ANSQFEGEVSSEGLRELTHWTLQFLGLGDFFGEMDHHLFRFRSQPIKPDK
PESEDDPLNSFLLDDLADVVDAISRGVKSEPLDQYLRHHDPKPRLHMDDQ
RASLPLMGRLMPDAYASSCWPTEHHLGLVHSQQLAVNTIQSTLADGHGLL
GVNGPPGTGKTTLLRDLIAAIITSRADTLAKLCRASDAFASDGREAANDG
GKQQYSYKLNPALYGFEIVVASSNNGAVENVTLELPQRDRIDESWLPEAE
YFAELGELVSGKPAWGLISGALGSKARRNKFVDRYFYGQLPFGSEDKAAA
EVEDETEVEPDEEVDNIVEALFSTPSAETEHADDSENQNEDAPPEEDKGP
KGFLGWLNVSVEANADRSPEQRQALWQQAVSDYEAVKAEERKACNDASRI
RELILAILKIRKTIAEDSEKLRAFEQSLIEVANKLSRLDNEEGGPANMAL
KKCIDALEKLPKKPGFWSNLFSLGNARRDWRTARKLLESQHDIAKAEFAR
ITRLTQQLDSDKAQVEGKIAEARRVLQGSQQQDKTLTSDALELATTHQAD
HLLAWLKDNAIGRGDVIELAEPWRIEGWRKARARVFIQALKLHRTFFELE
ASRLRSNLFMINGMLGGSRFQGMSRAAIRSAWASLFMVVPVLSSTFASFA
RSFGSLGASEIGWLLVDEAGQAAPQAAVGALWRSRRALLVGDPLQLKPIV
TVSDAVLEHMRTRYGVDAHWIPNQKSAQVLADEATPWGRMAGPAGGKSWV
GLPLVVHRRCDRPMYALANRIAYDGAMVYGTIAPRADKETRASLLTGWIH
ISGTSEGNWVPAEGQVLRDLLKRLHGDGVEAKDISVITPFKAVQQNLKRM
LPGKMVSGTIHTMQGKEASVVIVILGGNTAGSGARNWAVSEPNLLNVAAT
RAKRRLYVIGDRNDWKHRPLFCDVMDLLPIQDIRPHE
>gid:104888  XCC3134  truncated IS1477 transposase
MKKSRFTTEQIIGFIKQADAGMAVAELCRQHGFSPASFYQWRAKYGGMEA
EDAKRLKELEQENNRLKRLLAEAHLDIEALKVGFGVKR
>gid:104889  XCC3135  ISxcc1 transposase
MVGVARSTARYRRRPDRDEEVIALLSELAERFPERGFGKLFQIIRRRGHL
WNHKRVWRVYCLMKLNQRRRSKRRVPTRHPQPLACGDRPNAGWSIDFMSD
ALWDGRRFRTFNIIDDFSREALAIDVDLNLPAARVIRTLERIAAWRGYPN
KLRLDNGPEFVALALAEWAERKGIALDFIEPGRPMQNGFIERFNGSYRRG
VLDMHIFRTLSEVREQTEQWLADYNQQIPHDSLGGLTPAEFREQHQPQTS
SFIWH
>gid:104890  XCC3136  ISxcc1 transposase
MRKSKFTESQIVATLKQVEGGRQVKDVCRELGISDATYYVWKSKYGGMEA
ADVQRLRDLETEHNKLKRMYAELAMENHALKDVIAKKL
>gid:104891  XCC3137  truncated IS1477 transposase
MCRRPLVMAGSIYGRLTLLRPEFPQVNHKKIYRLYREAKLSVRRRKKAKF
PAAQRQPLRPARHPNEVISMDFVFDQLASGRRIKCLTVADDFTHECVDIA
VDHGISGAYVVRVLEQIACFRGYPRAVRTRRPPNFE
>gid:104892  XCC3138  IS1404 transposase
MDVKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMS
VPDAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>gid:104893  XCC3139  IS1404 transposase
MPAAWLQRGLVLPVAQQVRWDERARCQAAQGPRVRERAAEEVAGRAVVRE
RPDQGCTAKKVVSAPARRTLVREWIGRGASERRALAVIGMSASALRYCPR
EDRNGELRERICALAHRHRRYGVGMIYLKLRQEGRIVNYKRVERLYREQQ
LQVRRRKRKKVPIGERQPLLRPSQANQVWSMDFVFDRTAEGRVIKCLVIV
DDATHEAVAIEVERAISGHGVTRVLDRLAHSRGLPKVIRTDNGKEFCGKA
MVAWAHARNVQLRLIQPGKPNQNAYVESFNGRLRDECLNEHWFPTLLHAR
TEIERWRREYNEDRPKKAIGGMTPAAYAQHLANTDIINPGL
>gid:104894  XCC3140  truncated IS1477 transposase
MQNGYIESFNGKFRDECLNEHWFTSLIQAREVIADWRRDFNEVRPHSSCG
RIPPAQFASNHRAQTGNNAVPFNPGLYQ
>gid:104912  XCC3158  aminopeptidase
MLKPLGIAYEPSKGGPGPDVGPISAKGGAWAWLAQDGTDYFDLHHTADDT
LDKIDPKALAQNVAAYTVFAYLAAEADGDFGSRAKSVQPPNE
>gid:104950  XCC3196  IS1480 transposase
MARARQERNACTRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQSLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>gid:104953  XCC3199  IS1480 transposase
MARARQERNACTRFLIVDAQSVKNTDTAGQKGYDAGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSNLTHVQTLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>gid:104968  XCC3214  IS1478 transposase
MPVFFCALMICNLIVLLRLRRRFPRSGEQGLFALRFCEKQRLATLLPLKV
VAPEKPPITQRPQGLDVCSGVPELPVFATLKAGFWQFPLMHTRRPAAEHM
PAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSRLPATQAGGGRP
ALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTRLPCD
ASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELSRVIVDTTVQEK
AIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPALSRKAGRYAHA
RQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERIAVWLERAQRLY
TQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIAVTACKGLVVGA
RSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLGYRGREVDGVQV
LHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRCRLKGAQGDALH
VLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSPMTLGA
>gid:104971  XCC3217  IS1404 transposase
MKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMSVP
DAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>gid:104972  XCC3218  IS1404 transposase
MPAAWLQRGLVLPVAQQVRWDERARCQAAQGPRVRERAAEEVAGRAVVRE
RPDQGCTAKKVVSAPARRTLVREWIGRGASERRALAVIGMSASALRYCPR
EDRNGELRERICALAHRHRRYGVGMIYLKLRQEGRIVNYKRVERLYREQQ
LQVRRRKRKKVPIGERQPLLRPSQANQVWSMDFVFDRTAEGRVIKCLVIV
DDATHEAVAIEVERAISGHGVTRVLDRLAHSRGLPKVIRTDNGKEFCGKA
MVAWAHARNVQLRLIQPGKPNQNAYVESFNGRLRDECLNEHWFPTLLHAR
TEIERWRREYNEDRPKKAIGGMTPAAYAQHLANTDIINPGL
>gid:105057  XCC3303  IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHALLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>gid:105060  XCC3306  IS1477 transposase
MKKSRFTTEQIIGFIKQADAGMAVAELCRQHGFSPASFYQWRAKYGGMEA
EDAKRLKELEQENNRLKRLLAEAHLDIEALKVGFGVKR
>gid:105061  XCC3307  IS1477 transposase
MPPARFQPGQLLPVAGQVRRHGGRRREAPEGAGAGEQPPEAVAGRGAPGH
RGAEGRVRGKTLAPQRKREAIRRMCELTSISERRACRLAGISRDAFRHAP
TPTPATQTLSARLVELAQARRRFGYRRLHDLLRPEFPQVNHKKIYRLYRE
AKLSVRRRKKAKFPAAQRQPLRPARHPNEVISMDFVFDQLASGRRIKCLT
VADDFTHECVDIAVDHGISGAYVVRVLEQIACFRGYPRAVRTDNGPEFTS
RAFIAWAQQRGIEHILIEPGKPMQNGYIESFNGKFRDECLNEHWFTSLIQ
AREVIADWRRDFNEVRPHSSCGRIPPAQFASNHRAQTGNNAVPFNPGLYQ
>gid:105097  XCC3343  IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>gid:105197  XCC3443  conserved hypothetical protein
MDEFDQRSWWTPAPDDPAWALPDDLRAADPLHGRDCGWVNQMRPFVRHFS
APGQQVFDPFCGFGSTLLAAALEGRQAHGMEVDPARAVVARERLRRHAVQ
APVVVGGLAQVPPAAPVDLCLTNVPYFGCHWSGPVAPGQLYASTDYASYL
AGLRAVFHALRRQLRPGGFGVAMVENVVLDGQLVPQAWDLGRILSSLFTL
REERVLCYPRAGGVLAQRGTASNRSHEYALIFQHCRTRLDLQAAAQLLQA
LREQGLPVTPHGSYARWLQAPEQVPDGPADLDLIVPGEQAVWDRLSHWLH
AQGFALSLWGAPCLPAVPLAMVAAHHYLRAERLDADGRRLQVDLQLPMDE
PAQP
>gid:105217  XCC3463  IS1404 transposase
MAQQVRWDERARCQAAQGPRVRERAAEEVAGRAAVRERPDQGCTAKKVVS
APARRTLVREWIGRGASERRALAVIGMSASALRYCPREDRNGELRERICA
LAHRHRRYGVGMIYLKLRQEGRIVNYKRVERLYREQQLQVRRRKRKKVPI
GERQPLLRPSQANQVWSMDFVFDRTAEGRVIKCLVIVDDATHEVVAIEVE
RAISGHGVTRVLDRLAHSRGLPKVIRTDNGKEFCGKAMVAWAHARNVQLR
LIQPGKPNQNAYVESFNGRLRDECLNEHWFPTLLHARTEIERWRREYNED
RPKKAIGGMTPAAYAQHLANTDIINPGL
>gid:105218  XCC3464  IS1404 transposase
MKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMSVP
DAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>gid:105227  XCC3473  IS1480 transposase
MARARQERNACTRFLIVDAQSVKNTDTAGQKGYDTGKKVSGIKRHIAVDT
QGLPHAIAVTTAEVTDRKGALQALERCQSKLTHVQTLLCDSGYTGVRFAD
GVREILGEQLTVQIAKRSELHTFKVMPKRWIVERSFAWLEKNRRLWKNCE
RKLNTSLQFIHLAFLALLLRRS
>gid:105234  XCC3480  ISxac3 transposase
MCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHWLASGSVYGH
RKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGMQCKAAA
NLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRD
RADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRSFLASHGLVC
SMSRRGNCHDNAPVESFFGLLKRERIRRLTYPTKDAARAEVFDYIEMFYN
PNRRHGSTGDLSPVEFERRYAQRGS
>gid:105235  XCC3481  ISxac3 transposase
MSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRTFGK
SGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>gid:105249  XCC3495  ISxac3 transposase
MVDSEVSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWL
RTFGKSGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>gid:105250  XCC3496  ISxac3 transposase
MQAHCGEFRVCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHW
LASGSVYGHRKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFH
GGMQCKAAANLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSR
QVVGWAMRDRADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRS
FLASHGLVCSMSRRGNCHDNAPVESFFGLLKRERIRRLTYPTKDAARAEV
FDYIEMFYNPNRRHGSTGDLSPVEFERRYAQRGS
>gid:105281  XCC3527  endonuclease
MPEGPSLVILREEAAAFVGRKILRVQGNSKQDIARLQQQKVLALRSWGKH
LLIECAQFSVRIHFLLFGSYRINEDKPNAVPRLRLEFSKGETLNFYACSV
QFIERPLDEVYDWSADVMNPLWDAAQARLKLRAAPQLLAADALLDQSIFS
GVGNIIKNEVLHRIRVHPESQVGALPARKLGELVTQARDYSFDFYTWKKA
FVLKKRYQVHTKTICPRDGAPLQYRKHLGKAGRRAFFCEVCQRRYRLEEA
>gid:105326  XCC3572  ISxcc1 transposase
MRKSKFTESQIVATLKQVEGGRQVKDVCRELGISDATYYVWKSKYGGMEA
ADVQRLRDLETEHNKLKRMYADLAMENHALKDVIAKKL
>gid:105327  XCC3573  ISxcc1 transposase
MKLNQRRRSKRRVPTRHPQPLACGDRPNAGWSIDFMSDALWDGRRFRTFN
VIDDFSREALAIDVDLNLPAARVIRTLERIAAWRGYPNKLRLDNGPEFVA
LALAEWAERKGIALDFIEPGRPMQNGFIERFNGSYRRGVLDMHIFRTLSE
VREQTEQWLADYNQQIPHDSLGGLTPAEFREQHQPQTSSFIWH
>gid:105329  XCC3575  IS1404 transposase
MKKRFTDEQVIGFLREAESGVAIKDLCRRHGFSEASYYLWRSKFGGMSVP
DAKRLKDLESENARLKKLLAEQLFENDLIKDALRKKW
>gid:105330  XCC3576  IS1404 transposase
MPAAWLQRGLVLPVAQQVRWDERARCQAAQGPRVRERAAEEVAGRAVVRE
RPDQGCTAKKVVSAPARRTLVREWIGRGASERRALAVIGMSASALRYCPR
EDRNGELRERICALAHRHRRYGVGMIYLKLRQEGRIVNYKRVERLYREQQ
LQVRRRKRKKVPIGERQPLLRPSQANQVWSMDFVFDRTAEGRVIKCLVIV
DDATHEAVAIEVERAISGHGVTRVLDRLAHSRGLPKVIRTDNGKEFCGKA
MVAWAHARNVQLRLIQPGKPNQNAYVESFNGRLRDECLNEHWFPTLLHAR
TEIERWRREYNEDRPKKAIGGMTPAAYAQHLANTDIINPGL
>gid:105333  XCC3579  truncated ISxac3 transposase
MAGGGRSDWRSFLASHGLVCSMSRRGNCHDNAPVESFFGLLKRERIRRRT
YPTKDAARAEVFDYIEMFYNPNRRHGSTGDLSPVEFERRYAQRGS
>gid:105334  XCC3580  IS1477 transposase
MPPARFQPGQLLPVAGQVRRHGGRRREAPEGAGAGEQPPEAVAGRGAPGH
RGAEGRVRGKTLAPQRKREAIRRMCELTSISERRACRLAGISRDAFRHAP
TPTPATQTLSARLVELAQARRRFGYRRLHDLLRPEFPQVNHKKIYRLYRE
AKLSVRRRKKAKFPAAQRQPLRPARHPNEVISMDFVFDQLASGRRIKCLT
VADDFTHECVDIAVDHGISGAYVVRVLEQIACFRGYPRAVRTDNGPEFTS
RAFITWAQQRGIEHILIEPGKPMQNGYIESFNGKFRDECLNEHWFTSLIQ
AREVIADWRRDFNEVRPHSSCGRIPPAQFASNHRAQTGNNAVPFNPGLYQ
>gid:105335  XCC3581  IS1477 transposase
MKKSRFTTEQIIGFIKQADAGMAVAELCRQHGFSPASFYQWRAKYGGMEA
EDAKRLKELEQENNRLKRLLAEAHLDIEALKVGFGVKR
>gid:105336  XCC3582  ISxac3 transposase
MCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHWLASGSVYGH
RKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGMQCKAAA
NLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRD
RADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDLTCPR
>gid:105337  XCC3583  ISxac3 transposase
MSMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRTFGK
SGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>gid:105339  XCC3585  IS1479 transposase
MQLTFGDAEGLGKRKQTRREIFLAEMEQVVPWQQLLGLVAPHYPVSGRPG
RQPYALATMLRIHLLQQWYALSDPAMEEALHEIPTLRRFAQLGGLDNVPD
ETTILNFRRLLETHGLAARMLEAVNAHLARKGQSLRSGTIVDATLIAAPS
STKNADHARDPEMHQTKKGNQWYFGMKAHIGVDEFSGLVHHVHCTAANVA
DVTVTHALLHGKEDSVFGDSGYTGADKREELQDCEAAFFIAAKRSVLQAI
GNKRERAREQRWEHFKASVRAKVEHPFRVIKRQFGYTKVRYRGLAKNTAQ
VLTLFALSNLWMKRKQLLPAMGSVRL
>gid:105340  XCC3586  IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>gid:105356  XCC3602  conserved hypothetical protein
MTSSFRSVSLVALLALSAGQPMQAQAGLTTITRGDASAMSAILVISAPYF
VSLALTHSAVAGSEALSHASNKASAGPLPPMRVTSVGPTPDGGSEVQLQD
PAQAANTALLRWPVRNDAPAAGFRVGETVTFQPSPAGAGWTVQSAQGEAL
AFVPTAESAAQNSSQAW
>gid:105380  XCC3626  RNA-directed DNA polymerase
MPPLPDYASTELLLGLKTRKDLASWLGVSDRALRYMLYRLGDGDKYSTFS
IRKRNGGLREIHAPKKALKYLQNKVAHALAGVVPVRQIAKGYVPGRSIYD
HAKMHRSKKWVVLVDLKSFFPSINFGRVLGLLRAPPFSLENEVAVAVAQL
CTRAGELPQGAPSSPVISNLICRKLDRQLLELAKQAGCGVSRYADDICFS
TNRKRVSVEICDFVNEHGWVPGAGLKQLICSNGFEINFSKFRVHEGRDRK
LVTGLVVNKGVSTPSRWRDQLRSSLHVIDKYGEAAGVEIISGWTSGFFRK
EPGDVLRTIRGKLGYLKWIDKVANHLATDALRRNFPSLASLMPISNDGVS
FRIMAEGPTDLLHLEAALDYLRKSGGFLDVRPRFQNFLGDVGDSELWETL
LRIAKADVNELTIGVFDCDSPAFMKKTSLVPGGNIQLGPRVYAFCLAPPG
TSISNNFCIESLYSRSDATLVDGSGRRLFFGDEFDSASGFSHDGLYKCLH
PKKKAIVVSDQVARVHDGASVLLSKAGFASQVKDKAPPFDVVSFDGFRPT
WLGIRALAIAAVRK
>gid:105381  XCC3627  IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>gid:105418  XCC3664  DNA polymerase related protein
MLDLPAVVAGPRVSAEFLQLASAVLCHRDVQRHAVLYRLLWRIASGERAL
LERATDVDVHRVMQWQKAVQRDSHKMKAFVRFRRLPGEEEEFVAWFEPEH
WILDRVAPFFARRFAGMRWAILTPYRSVRWDGEALTFGEGAARNQVPADD
AQETLWRTYYAHIFNPARLNPTMMRQEMPQKYWKNLPEATLLPELIREAG
VRVREMAERAPEPVRRRVPAAPAALPAVAAQSLAQLRVAARDCRRCDLWQ
PATQTVFGEGPDDAAVMVIGEQPGDEEDLSGRPFVGPAGRLFNQALGELG
IDRQRFYVTNAVKHFRFEQRGKRRLHRNPERSHVQACNGWLQAERAQLRP
AQIVCLGATAAQAVLGPGFRLMQERGQWQRLDDGTPVLATVHPSWVLRQG
TPSARDAGYRGFVADLGQLLQAPPA
>gid:105486  XCC3732  ISxac3 transposase
MVDSEVPMSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWL
RTFGKSGVVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>gid:105487  XCC3733  ISxac3 transposase
MQAHCEEFRVCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHW
LASGSVYGHRKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFH
GGMQCKAAANLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSR
QVVGWAMRDRADTELVVQAVLSAVWRRKPNTGCLVHSDQGSVYTSDDWRS
FLASHGLVCSMSRRGNCHDNAPVESFFGLLKRERIRRRTYPTKDAARAEV
FDYIEMFYNPNRRHGSTGDLSPVEFERRYAQRGS
>gid:105530  XCC3776  ATP-dependent RNA helicase
MSDKPLTDLTFSSFDLHPALVAGLESAGFTRCTPIQALTLPVALPGGDVA
GQAQTGTGKTLAFLVAVMNRLLIRPALADRKPEDPRALILAPTRELAIQI
HKDAVKFGADLGLRFALVYGGVDYDKQRELLQQGVDVIIATPGRLIDYVK
QHKVVSLHACEICVLDEADRMFDLGFIKDIRFLLRRMPERGTRQTLLFSA
TLSHRVLELAYEHMNEPEKLVVETETITAARVRQRIYFPSDEEKQTLLLG
LLSRSEGARTMVFVNTKAFVERVARTLERHGYRVGVLSGDVPQKKRESLL
NRFQKGQLEILVATDVAARGLHIDGVKYVYNYDLPFDAEDYVHRIGRTAR
LGEEGDAISFACERYAMSLPDIEAYIEQKIPVEPVTTELLTPLPRTPRAT
VEGEEVDDDAGDSVGTIFREAREQRAADEARRGGGRSGPGGASRSGSGGG
RRDGAGADGKPRPPRRKPRVEGEADPAAAPSETPVVVAAAAETPAVTAAE
GERAPRKRRRRRNGRPVEGAEPVVASTPVPAPAAPRKPTQVVAKPVRAAA
KPSGSPSLLSRIGRRLRSLVSGS
>gid:105625  XCC3871  conserved hypothetical protein
MVKDYSRYRRTLLAPIARLMVRHEANMLRRLQGWKHAPALLGTLGGLALG
MEFIPGDTLSASAVVGQEVFQQLQHALRRLHAVGITHNDLHGTNVVVSAG
VPVLIDFTSAWRFPRWLRRSTLSRQLQRSDVANFQKMRRRLVGIAPSDAE
AALTAEPGWVRGVRNGWKRLYRWLKGGAA
>gid:105635  XCC3881  DNA polymerase-related protein
MPAHRIPSAPTTPLAKPVDTVPSGTLTALRAQAQDCRRCDLWKPATQVVF
GAGPARAPLMIIGEQPGDQEDQQGRPFVGPAGQLLGTLMADAGLDPAMAY
VTNTVKHFKFVPRGKRRLHQRATAGEQAACRPWLAAELLRVRPRIVLALG
AMAAQTLFGNAFRLTTERGQWRALDGRTTALASWHPSAILRMREPDRTAT
RALLREDLAQVAAALDNLR
>gid:105852  XCC4098  conserved hypothetical protein
MFFRNLTLFRFPTTLDFSEIETLLPQVQLKPVGPLEMSSRGFISPFGRDE
QDVLSHRLEDFLWLTVGGEDKILPGAVVNDLLERKVAEIEEKEGRRPGGK
ARKRLKDDLIHELLPRAFVKSSRTDAILDLQHGYIAVNTSSRKSGENVMS
EIRGALGSFPALPLNAEVAPRAILTGWIAGEPLPEGLSLGEECEMKDPIE
GGAVVKCQHQELRGDEIDKHLEAGKQVTKLALVMDDNLSFVLGDDLVIRK
LKFLDGALDQLEHSEGDGARAELDARFALMSAEVRRLFLLLEDALKLSKA
EA
>gid:105894  XCC4140  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRTFGKSG
VVHRAEVDQSAEVRRLKAELRRVTEERDILKKAAAYFAKG
>gid:105895  XCC4141  ISxac3 transposase
MQAHCGEFRVCAMCRVLRVNRSGYYAWLCSPNSERAKEDDRLLGLIKHHW
LASGSVYGHRKITTDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFH
GGMQCKAAANLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSR
QVVGWAMRDRADTELVVQAVLSAVWRRKPNAGCLVHSDQGSVYTSDDWRS
FLASHGLVCSMSRRGNCHDNAPVESFFGLLKRERIRRRTYPTKDAARAEV
FDYIEMFYNPNRRHGSTGDLSPVEFERRYAQRGS
>gid:105938  XCC4184  IS1479 transposase
MQLTFGDAEYNGKRKQTRRKVFLAEMDQVVPWKDLLALIEPQYPKSGQPG
RQPYRLETMLRIHFLQTSPISRRRTSCCMARKTRSAEIAVTRVWRSVRRW
RASASCAT
>gid:105939  XCC4185  IS1479 transposase
MGTLQSQRTRQGGPSVPVIKRQFGYTKVRYRGLAKNMAQVLTLFALSTLC
MARRQLLPARGEYCLAAAKAARTLQQSHRTQHFARQ
>gid:105946  XCC4192  IS1478 transposase
MHTRRPAAEHMPAEELFRSRLENQIDLRHPLAQLSQRMPWTALEQALSSR
LPATQAGGGRPALPVRLIAGLLYLKHAYDLSDEAVCERWLENPYWQFFTG
EVVFQTRLPCDASSLTRWRQRLDEAGMEELLAHTINAAHAMQAVDARELS
RVIVDTTVQEKAIAYPTDSRLLEVARKKLVLLAKRYGIGLRQSYARQGPA
LSRKAGRYAHARQFKRMQRVLRRQRTVLGRVLRDIARKLDQVEPGVRERI
AVWLERAQRLYTQRPKDKQKLYALHAPEVECIGKGKARQAYEFGVKVGIA
VTACKGLVVGARSFPGNPYDGDTLAEQLEQTRGLLQDLSVEPTVAIVDLG
YRGREVDGVQVLHRGKAKTLTRRQWRWIKRRQAVEPVIGHLKDDCRLRRC
RLKGAQGDALHVLGCAAGYNLRWLLRWIAFLRAWMRAMGWPSFSAVPLSP
MTLGA
>gid:102385  alkB  DNA repair system specific for alkylated DNA
MRMNIRVALPQAQVHWCRGWLQAAHADALMQALLDQVQWEVHRIRMFGRV
VDSPRLSSWIGDADASYRYSGTQFAPQPWLEALQPVRTRLQDETGSPFNS
VLVNRYRSGADAMGWHSDDEPELGAQPVIASLSLGAARRFAFKHRHDAAL
KQTLELGHGDLLLMGGDTQRHYKHALPRTVKPVGERINLTFRQIAVRVSQ
R
>gid:104535  comEA  DNA transport competence protein
MKSFTVVLKSLLLALLLSSNAYALDKVDINTASAEELDKVLMNVGRSKAE
AIVEHRQANGPFKSAEELALVKGIGLKTVERNRDLIEVGATMAPAKKAAK
GAAVKPVGRR
>gid:104036  dbpA  ATP-dependent RNA helicase
MNEFSALPLSPALAPGIDALGYTTLTPIQAQSLPPILQGLDVIAQAPTGS
GKTAAFGLGLLQKLDPALTRAQALVLCPTRELADQVGKQLRKLATGIPNM
KLLVLTGGMPLGPQLASLEAHDPQVVVGTPGRIQELARKRALHLGGVRTL
VLDEADRMLDMGFEEPIREIASRCDKHRQSLLFSATFPDIIRTLARELLK
DPVEITVEGADNAPEIDQQFFEVDPTYRQKAVAGLLLRFNPESSVVFCNT
RKEVDEVAGSLQEFGFSALALHGDMEQRDRDEVLVRFVNRSCNVLVASDV
AARGLDVEDLAAVVNYELPTDTETYRHRIGRTARAGKHGLALSLVAPRES
ARAQALEAEHGQPLKWSRAPLATARPAQLPLAAMTTLRIDGGKTDKLRAG
DILGALTGEAGLSGAAIGKIAIYPTRSYVAIARAQVARALTHLQAGKIKG
RRFRVTKL
>gid:104396  deaD  ATP-dependent RNA helicase
MTQESSAPLLFADLGLSDAVMKAVAAVGYETPSPIQAATIPALLAGRDVL
GQAQTGTGKTAAFALPVLSNADLNQVKPQALVLAPTRELAIQVAEAFQKY
AEAIPGFRVLPVYGGQPYAQQLSALKRGVHVVVGTPGRVIDHLDRGTLDL
SQLKTLVLDEADEMLRMGFIDDVEAVLKKLPEKRQVALFSATMPPAIRRI
AQTYLKDPAEVTIAAKTTTSANIRQRYWWVSGLHKLDALTRILEVEPFDG
MIIFARTKAATEELAQKLQARGLAAAAINGDMQQAAREKTIAQLKDGKLD
ILVATDVAARGLDVERVSHVLNYDIPYDTESYVHRIGRTGRAGRNGDAIL
FVTPREKGMLRAIERATRQPIEEMQLPSVDAVNDTRVARFMTRITETLAG
GQIEMYRDLLQRYESENNVPAIDIAAAMAKLLQGNAPFLLTPPVRGARED
FAPRERNDRADRGERPRFEPKFERGPRAPDGERGARPPRPDRPAYGEDAG
AERPRREPSAPRGEPEFGMESYRIEVGHTHGVKPANIVGAIANEAGLESR
YIGRIDIQDDYSILDLPADMPRELLTHLKKVWVSGQQLNMRKLEEGEAAA
AAASKPKFPRGPRPAGRPNRPMDRAGAPHRKGPPKPRGPRSE
>gid:105704  dinG  ATP-dependent helicase
MSDIATPAAPSAPVPTQRTLTEPVKAGIREAYAKLQANTPGFATRRAQSQ
MIGLVSRALATSGGIGVAEAPTGVGKSLGYLTAGVPIALATKKKLVISTG
TVALQSQLVERDIPAFLKATGLEATVALAKGRTRYLCTRNAAELEGETSQ
NGMFEDEQVLYDRPLSPADVDLAKRLAKAYAARTWNGDLDDAPEPVSVPL
RMRVTTPASGCAGRRCSYAAQCPVLKARTDVREAQIVVTNHALLLSSLSL
GDAENGQPLIAPPSDMLLVLDEGHHIAGVAIDQGAANLPLDDMAKRTGRM
QILIAAAYRAVDKDKIGNLLPSEAIEVAARVSKLLKAFHTEVERVWKPEP
GERDPLWRAPNGKLPPQWGPAIEELGEETRALFNWVHAAHGTVAKGKQDD
AARERLQRSLGMALEMAEQQHNLWSGWRREDKDGQPPMARWITLSRDGDL
ICHCSPVSAAQVLRTLLWNEVDSVVMTSATLTGGGDFQSFAIDNGLPDHA
EMASLASPFDLPNQAELIVPNFPVTPDDREGHPKEVAKYLVRELDWAAKG
SIVLFTSRWKMEKVADLLPLAQRNRVLVQGEGNKSQLITEHLRRIAAGEG
SVLFGLNSFGEGLDLPGEACTTVVITQVPFAVPTDPQTSTLSEWLESRGH
NAFNLIAIPHALRTLTQFAGRLIRSSNDHGRVIILDSRLLTRRYGKRILD
ALPPFKRVIGR
>gid:102337  dinP  DNA polymerase IV
MRKIVHVDMDAFYASVEQRDDPSLRGKPVVVAWRGARSVVCAASYEARTF
GIRSAMPAVRAERLCPDAVFVPPDFARYKAVSRQVREIFHRHTDLVEPLS
LDEAYLDVTEAKTGMQLATEIAQLIRTQIREETQLTASAGIAPNKFLAKI
ASDWRKPDGQFVIAPSRVDAFLLPLPVNRIPGVGKVMDGKLAALGIVTVS
DLRLRPLEELQAHFGSFGQSLYRRARGIDERPVEPDQEVQSVSSEDTFSE
DLALDALDPHIQRLAEKTWHATRRTERIGRTVVLKLKTSNFRILTRSYTP
EQPPASLQGLVDIALGLTRRVELPPETRYRLVGVGLSGFSDPELQAAVQG
ELFGEVPQQ
>gid:101755  dnaA  chromosomal replication initiator
MDAWPRCLERLEAEFPPEDVHTWLKPLQAEDRGDSIVLYAPNAFIVEQVR
ERYLPRIRELLAYFAGNGEVALAVGSRPRAPEPLPAPQAVASAPAAAPIV
PFAGNLDSHYTFANFVEGRSNQLGLAAAIQAAQKPGDRAHNPLLLYGSTG
LGKTHLMFAAGNALRQANPAAKVMYLRSEQFFSAMIRALQDKAMDQFKRQ
FQQIDALLIDDIQFFAGKDRTQEEFFHTFNALFDGRQQIILTCDRYPREV
EGLEPRLKSRLAWGLSVAIDPPDFETRAAIVLAKARERGAEIPDDVAFLI
AKKMRSNVRDLEGALNTLVARANFTGRSITVEFAQETLRDLLRAQQQAIG
IPNIQKTVADYYGLQMKDLLSKRRTRSLARPRQVAMALAKELTEHSLPEI
GDAFAGRDHTTVLHACRQIRTLMEADGKLREDWEKLIRKLSE
>gid:103188  dnaB  replicative DNA helicase
MSARPGFRSNRNRDRDRDDYDRPEPRLDQLRVPPHSVEAEQAVLGGLMLA
PDAFDKVNDQLTENDFYRRDHRLIYRAIRELSEKDRPFDAVTLGEWFESQ
GKLEQVGDGAYLIELASTTPSAANIAAYAEIVRDKAVLRQLIEVGTNIVN
DGFQPEGRESVELLASAEKAVFKIAEAGARGRTDFVAMPGALKDAFEELR
NRFENGGNITGLPTGYTDFDAMTAGLQPTDLIILAARPAMGKTTLALNIA
EYAAIKSKKGVAVFSMEMSASQLAMRLISSNGRINAQRLRTGALEDEDWA
RVTGAIKMLKETKIFIDDTPGVSPEVLRSKCRRLKREHDLGLIVIDYLQL
MSVPGNSENRATEISEISRGLKGLAKELNVPVIALSQLNRSLETRTDKRP
VMADLRESGAIEQDADMIVFIYRDDYYNKENSPDKGLAEIIIGKHRGGPT
GSCKLKFFGEYTRFDNLAHDSVGSFE
>gid:103112  dnaE1  DNA polymerase III alpha chain
MSTSRFVHLHVHTEFSLADSTIRVPEKPDQADPKKAKQANLLSRAVELDL
PALAVTDLNNLFALVKFYKAAEGVGIKPIAGADVMIATPDVTPWRMTLLC
RDREGYLSLSRLLTRAWMEGHRPEGGVAIHPEWLQAGHANLFALAGRDSL
AGRLFAEGRADLAEQQLADWQRVFGDGLHLELTRTGREGEERFNQFALHA
AGVRGLPVVASNDVRFLYASDFAAHEARVCISSGRVLDDPKRPRDYSDQQ
YLKSSEEMAALFADVPDAIDNTLALAQRCNIEMRLGTYFLPAYPVPEDET
LDSWIRSQSRDGLAARLEKNPIAPGKTRQDYVDRLEFELDTIIKMGFPGY
FLIVADFIQWGKNQGIPIGPGRGSGAGSLVAWALQITDLDPLPYNLLFER
FLNPERVSMPDFDIDFCMDRRDEVIDYVARKYGRERVSQIITYGTMAAKA
VVRDAGRVLGFTYGLVDSVAKLIPNILGITLKDAMGEGKDTEMASPELIQ
RYQVEDDVRDLMDLARQLEDLTRNAGKHAGGVVIAPEPLSEFCPLFAEHD
EGGRGKNPVTQFDKNDVEEVGLVKFDFLGLRTLTIIDWAVKAINVRHARA
GIDPVDITAIPLDDAPTYKGVFASGNTGAVFQFESSGMRRLLKDARPDRF
EDLIALVSLYRPGPMDLIPDFNARKHGQQDIIYPDPRTEAILKDTYGIMV
YQEQVMQMAQIVGDYSLGGADLLRRAMGKKVPAEMAKHREIFREGAAKGG
VSAQKADEIFDLMEKFAGYGFNKSHAAAYALVSYQTAWLKRHYPAEFMAA
TLSSDMDNTDKVVGFLDEVRNLGLTVLPPRVNESAYMFEAASPDTIQYGL
GAIKGVGQGACEAIVEERLRNGPYTTLLDFCTRVGTAKLNRRTLEAMINA
GAMDGLGKNRASLMLQLPEVMKATEQMARERASGQNSLFGGPDPSAPAMR
LDLPESKEWPLGQLLTGERETLGFYLSGHPFDPHRDEVRELVGCDLSALD
KILASQQRGGGGGGDGEKRAWRPEVSAILAGQVVGVRRKGDSQVFVQLED
GRGRVECSAFSDAMAEFGHLLTRDRILIIKGGLREDEFNGGYSLRIRQCW
DYEQICADHTQRLSLRLDLREKQAWSRIDTLLAKHRPGKTPLRLDLLLRS
PAGGVAGMLDLNGSHSVRIDQQLMDSLRADPAVRTLKVKYSPPWAQ
>gid:102855  dnaE2  DNA polymerase III alpha chain
MPRGWTVAARLRAANDDITHAAVADTLPAYAELHCLSDFSFLRGASSAEQ
LFARAHHCGYSALAITDECSLAGIVRGLEASRATGVQLIVGSEFTLVDGT
RFVLLVENAHGYPQLCSVITTGRRAAGKGAYRLGRAEVEAHFRDVVPGVF
ALWLPGDQPQAEQGAWLQRVFAERAFLAVELHREQDDAARLQALQALAQQ
LGMSALASGDVQMAQRRDRIVQDTLTAIRHTLPLADCGAHLFRNGERHLR
PRRALGNIYPHALLQASVELAQRCTFDLSKVQYTYPRELVPQGHTPASYL
RQLTEAGMRERWPEGAPAQVVAQIDSELELIAYKGYEAFFLTVQDVVRFA
RAQGILCQGRGSSANSAVCYALGITAVNPSETRLLMARFLSKERDEPPDI
DVDFEHERREEVLQYVYTKYGRERAALAATVICYRGKSAVRDVAKAFGLP
PDQIALLANCYGWGNGDTPMEQRIAEAGFDLANPLINKILAVTEHLRDHP
RHLSQHVGGFVISDEPLSMLVPVENAAMADRTIIQWDKDDLETMKLLKVD
CLALGMLTCIRKTLDLVRGHRGRDYTIATLPGEDAATYKMIQRADTVGVF
QIESRAQMAMLPRLKPREFYDLVIEVAIVRPGPIQGDMVHPYLRRRQGYE
PVSFPSPGVEEILGRTLGIPLFQEQVMELVIHAGYTDSEADQLRRSMAAW
RRGGDMEPHRVRIRELMAGRGYAPEFIDQIFEQIKGFGSYGFPQSHAASF
AKLVYASCWLKRHEPAAFACGLLNAQPMGFYSASQIVQDARRGSPERQRV
EVLPVDVLHSDWDNILVGGRPWHSDADPGEQPAIRLGLRQVSGLSEKVVE
RIVAARAQRPFADIGDLCLRAALDEKARLALAEAGALQSMVGNRNAARWA
MAGVEARRPLLPGSPAERAVELPAPRAGEEILADYRAVGLSLRQHPMALL
RPQMLQRRILGLRELQARRHGSGVHVAGLVTQRQRPATAKGTIFVTLEDE
HGMINVIVWSHLAMRRRRALLESRLLAVRGRWERVDGVEHLIAGDLYDLS
DLLGEMQLPSRDFH
>gid:105574  dnaG  DNA primase
MARIPDAFIDELLARTDIVEVVGGRVPLKRQGKEYSARCPFHDERSASFT
VSPTKQFYHCFGCGAHGTAISFLMNYDRLEFLDAVDELAKRAGMEIPRET
QQRTPQQQDDSRELYSALEAATKFFQRQLEGSDRARDYLDGRGVDAENRA
RFQIGYAPDGYSALKDTLGTDARRMSVLERAGLFSKNDRGHVYDKFRDRV
MFPIFDRRGRVIAFGGRIMGAPADGRDPGPKYLNSPETALFHKGRELYGL
WQVRQANQKIERLIVVEGYMDVVSLFQFGVTQAVATLGTATTPEHAELLF
RNAPDVYFCFDGDNAGRKAGWRALESVLPRMKDGRQAFFLFLPDGEDPDT
IVRKEGAQAFDARLKQATPLSQFFFDEMARDINLHTLDGKARLAERAKPM
LAQIPEGAFGDLMKQELARMTGVGASMSAQQSPPKARPPARMGAPTQKRS
LVRASIAILLQQPSLAMSLEGDHDFSGLRLPGIELLMELLALVRQRPEIS
TGALLEHFAEREELVALQKLAAQELPGDEHSWAIELHDVVAQLDKQLLRQ
RVEELQAKQRAQGLDNTDKYEMRELLKALAAL
>gid:101756  dnaN  DNA polymerase III beta chain
MRFTLQREAFLKPLAQVVNVVERRQTLPVLANLLVQVNNGQLSLTGTDLE
VEMISRTMVEDAQDGETTIPARKLFDILRALPDGSRVTVSQTGDKVTVQA
GRSRFTLATLPANDFPSVDEVEATERVAVPEAGLKELMERTAFAMAQQDV
RYYLNGLLFDLRDGLLRCVATDGHRLALCETELEKSGSAKRQIIVPRKGV
TELLRLLEAADRDVELELGRSHIRVKRGDVTFTSKLIDGRFPDYEAVIPI
GADREVKVDREALRASLQRAAILSNEKYRGVRVEVSPGQLKISAHNPEQE
EAQEEIEADTKVDDLAIGFNVNYLLDALSALRDEHVVIQLRDANSSALVR
EASSEKSRHVVMPLRL
>gid:102742  dnaQ  DNA polymerase III epsilon chain
MRQIILDTETTGLEWRKGNRVVEIGAVELLERRPSGNNFHRYLRPDCDFE
PGAQEVTGLTLEFLADKPVFAEVVEEFLAYIDGAELIIHNAAFDLGFLDN
ELSLLGDQFGRIIDRATVVDTLMMARERYPGQRNSLDALCKRLGVDNSHR
QLHGALLDAQILADVYIALTSGQEEIGFGAMDAGQHAEGGEGMIAFDPSL
LLPRPRVVVTPSELQAHEARLERLRKKAGRALWDAPELDEVAVAS
>gid:102755  dnaX  DNA polymerase III tau and gamma subunits
MSYLVLARKWRPKRFAELVGQEHVVRALSNALDSGRVHHAFLFTGTRGVG
KTTIARIFAKSLNCETGTSADPCGTCPACLDIDAGRYIDLLEIDAASNTG
VDDVREVIENAQYMPSRGKFKVYLIDEVHMLSKAAFNALLKTLEEPPEHV
KFLLATTDPQKLPVTVLSRCLQFNLKRLDEDQIQGQMTRILAAEEIESDP
SAIVQLSKAADGSLRDGLSLLDQAIAYAGGALREDVVRTMLGTVDRTQVG
AMLQALSDGDGAQLLKVVAALAEFSPDWSGVLEALAEALHRIQVQQLVPS
VAFVGDGIDPTGFAAQLRPEVVQLWYQMALNGRRDLYLAPSPRAGFEMAV
LRMLAFRPAAAVPAGSGDDGRGASAGGHTRGTATGVQAAPAAAAPARAAT
SAKAADVSPAPVVSAPPVAAAPSPVVVLPTAAAEPAPSAPPARTDDTPPW
AVDDAPVRAQAAPQRATAEVPAAVPLMAPEAAMALPATVADDAAPAAMDA
VVPVAPPSAPAPVTPPAATFDDGHIADAEQWLELVTRSGLNGPSRQLAAN
AAFIGHRDGVLRLALAPGFEYLNSERSIANLAQALAPELGNTPRIVIETG
SADVETLHERANRQKGERQSAAETAFMNDPNVQQLIQQQGARVVPDSIRP
YDE
>gid:102593  exo  exodeoxyribonuclease IX
MTTPAPPLATAPLAAALRTPRPVPLYLVDASLYVFRAWHSIPDEFQDAQG
WPTNAVHGFARFLLDLLERERPQHITIAFDEALDSCFRHAIYPAYKGNRE
PAPDALRRQFAHCKALCAALGLSVLAHREYEADDLIGSALHSARARGLRG
IIVSADKDLSQLLFEHDEQWDYARNVRWGMDGVKARHGVHAHQMADYLAL
CGDAIDNIPGITGIGAKSAAVLLAHFGSLDALLERLDELPFLRLRGAAQM
ALRLREQREHALLWRQLTTIALDAPLELTESGFTRAPADTDMLTGLCDSL
RFGPLTRRRLLAASGGAVLPPPPASLSQGPFP
>gid:105600  exoA  exodeoxyribonuclease III
MRIISFNANGLRSAASKGFFEWFATQDADVLCIQETKAQEHQLAGPEFLP
AGYKAWFRDASTKKGYSGVAIYAKREPDEVRTALGWPEFDEEGRYIEARF
GNLSVVSFYIPSGSSGELRQGYKFQVMEWLRPILSEWLASGRQYVLCGDW
NIVRSALDIKNWKSNQKNSGCLPPERDWLNGLCADLLDEADASNGRGWVD
SYRVLHPQGEDYTWWSNRGAARANNVGWRIDYQLVTPGLRDKVQACSIYR
EQRFSDHAPYIVDYAE
>gid:103788  exoI  succinoglycan biosynthesis protein
MLLLMWFRAWLCVIGVLTVSPALAADLIGRATVTDGDTLTVAQQRIRLWG
IDAPESAQQCTARNGQAWPCGRRAAAALDAYVQDKTVRCQPKDTDRYGRI
VAECFVQGQSINAWMVRSGWAVAYRQYATAFVADEAIARQQASQLWSGSF
QTPSEYRRAKRSASAKPAAGTSAPSNARCTIKGNVSAKGAKIFHLPGQRD
YAKTRIAPAHGERMFCSVREALDAGWRPAQR
>gid:102262  fis  DNA-binding protein
MNAAPSRPDSSRGAPKSPLREHVAQSVRRYLRDLDGSDADDVYEIVLREM
EIPLFVEVLNHCEGNQSRAAAMLGIHRATLRKKLKEYGLT
>gid:103328  gyrA  DNA gyrase subunit A
MAETAKEIIQVNLEDEMRKSYLDYAMSVIVGRALPDARDGLKPVHRRVLF
AMNELGAHSNKAYFKSARIVGDVIGKYHPHGDQSVYDTLVRMAQPFSLRY
LMVDGQGNFGSVDGDSAAAMRYTESRMSRLAHELMADIEKETVDFQPNYD
EKELEPTVMPTRFPSLLVNGSAGIAVGMATNIPPHNLTEAINACIALIDT
PELDIEGLMEYIPGPDFPTAGIINGTAGIAAGYRTGRGRVRIRAKADVEV
ADNGREAIVVTEIPYQVNKARLIEKIAELVKEKKLEGISELRDESDKDGM
RIYIEIKRGESAEVVLNNLYQQTQMESVFGINMVALVDGRPQLMNLKQML
EAFIRHRREVVTRRTIFELRKARARAHVLEGLTVALANIDEMIELIKTSA
NPQEARERMLAKTWEPGLVGALLGAAGAEASKPEDLAPGVGLSNGFYQLS
EVQASQILEMRLHRLTGLEQEKLTDEYKQLLEVIQGLIRILENPDVLLQV
IRDELINIREEYGDARRTEIRHSEEDLDILDLIAPEDVVVTLSHAGYAKR
QPVSAYRAQRRGGRGRSAASTKEEDFIDQLWLVNTHDTLLTFTSSGKVFW
LPVHQLPEAGSNARGRPIINWIPLESGERVQAVLPVREYADNRYVFFATR
NGTVKKTPLSEFAFRLARGKIAINLDEGDALVGVALTDGDRDVLLFASNG
KTVRFGESTVRSMGRTATGVRGIRLAKGEEVVSLIVSERAGGVEDEVEDE
SAEEVVETTDGAEPAVIDVADNGDVAYILTATENGYGKRTPLAEYPRKGR
GTQGVIGIQTTERNGKLVRAVLLGSTDEVLLISDGGTLVRTRGSEISRVG
RNTQGVTLIRLSKGEKLQAVERLDASLEEPEDVVDEAVAITSDAPPAEG
>gid:101758  gyrB  DNA gyrase subunit B
MTDEQTTPPTPNGTYDSSKITVLRGLEAVRKRPGMYIGDVHDGTGLHHMV
FEVVDNSVDEALAGHADDIVVKIHVDGSVAVSDNGRGVPVDIHKEEGVSA
AEVILTVLHAGGKFDDNSYKVSGGLHGVGVSVVNALSEHLWLDIWRDGFH
YQQEYALGEPQYPLKQLEASTKRGTTLRFKPAVEIFSDVEFHYDILARRL
RELSFLNSGVKIALIDERGEGRRDDFHYEGGIRSFVEHLAQLKTPLHPNV
ISVTGEHNGIVVDVALQWTDAYQETMYCFTNNIPQKDGGTHLAGFRGALT
RVLSNYIEQNGIAKQAKITLTGDDMREGMIAVLSVKVPDPSFSSQTKEKL
VSSDVRPAVENAFGARLQEFLQENPNEAKAITGKIVDAARAREAARKARD
LTRRKGALDIAGLPGKLADCQEKDPALSELFIVEGDSAGGSAKQGRNRKN
QAVLPLRGKILNVERARFDRMLASDQVGTLITALGTGIGRDEYNPDKLRY
HRIILMTDADVDGSHIRTLLLTFFYRQMPELIERGYIYIGLPPLYKLKQG
KSELYLKDDAALNAYLASSAVEGAALIPASDEPPITGEALEKLLLLFAGA
KEAIARNAHRYDPALLTALIDLPPLDVVQLQAEGDVHPTLDALQAVLNRG
TLGTARYHLRFDPATDSAAASLVSVRKHMGEEFTQVLPMGAFESGELRPL
REVALALHGLVREGAQILRGNKSHPITSFAQAQAWLLEEAKRGRQVQRFK
GLGEMNAEQLWETTVNPDTRRLLQVRIEDAVAADQIFSTLMGDVVEPRRD
FIEDNALKVSNLDI
>gid:104211  himA  integration host factor alpha chain
MALTKAEMAERLFDEVGLNKREAKEFVDAFFDVLRDALEQGRQVKLSGFG
NFDLRRKNQRPGRNPKTGEEIPISARTVVTFRPGQKLKERVEAYAGSGQ
>gid:104371  holA  DNA polymerase III delta subunit
MELRPEQLAGQSSQPLQPVYLIAGPETLRVLEAADAVRARARAEGISERE
VFDADGREFDWNQLDASFNAPSLFSPRRLVEVRMPSGKPGKDGAEVITRF
CANPPPDVVLLITANDWSKAHQGKWADAVGRIGTIAVAWPIKPHELSDWI
ERRLRAQGLRADAAAVQRLSERVEGNLLAAAQEIDKLALLADGKVLDLEA
MESLVADAARYDVFRLAETTFSGQPAAVVRMLAGLRAEGEAVAALMPILI
KELLRTASLAKVQAGGGNLGAEMKAQGIWESRQAPFKRALQRHPEPRRWE
RFVAEAGLVDRMAKGRAEGDPWVALERLLVAVAEARAVRLLA
>gid:102777  holB  DNA polymerase III delta' subunit
MTPTFSPWQQRAYDQTLAALDAGRLGHGLLICGPDGLGKRAVALALAEHV
LASAPDPAVAQRTRQLIAAGTHPDLQLVSFIANRTGDKLRTEIVIEQVRE
ISQKLSLTPQYGIAQVVIVDPADAINRAACNALLKTLEEPSPGRYLWLIS
AQPARLPATIRSRCQRLEFKLPPAHEALAWLLTQGVSERAAQEALEAARG
HPGLAAQWLREDGLAVRRAVAQDLEQIASGRVGAVDVAQRWTNDGQADQR
LRHAADLALAQASAGLTDPSRLHKLATWFDAANRTRDLLRTTVRADLAVA
ELLLAWREGERQARSRGTR
>gid:102402  holC  DNA polymerase III holoenzyme chi subunit
MPRADFYLIAKPRFLDEPLRLVCELARKANDANLSTLILARDAAQAEALD
DLLWAFDDEAYVPHQIAGTDEEDELAPVLIATPEFAAPSRPLVINLRDDP
YLGACDRVLEVVPADPAAREPLRERWKQYKALGLELTKYDM
>gid:104699  hrpA  ATP dependent RNA helicase
MSAIDTDLATTLRERRGAVDAAMSRDRGRLLGLWSRWQGKPGNPQLRQAF
EQALAASQAQRQARAAQQPAITLDTQLPIAREADRIIALIRDHPVVVIAG
ETGSGKTTQLPKLCLAAGRGAAGMIGCTQPRRIAARAVAARVAEELNTPL
GTTVGFQVRFTDRVGEDSRIKFMTDGILLAEIASDRWLSAYDTIIVDEAH
ERSLNIDFLLGYLKQLLRKRPDLKLIVTSATIDTERFSRHFDDAPVINVE
GRTFPVDVRYRPLEGESGDGDTGDVGRDGERTVNDAIVAAIDEITRIDPR
GDVLMFLPGEREIRDAHQSLERRKYRETEVVPLYARLSAADQDRVFNPGP
RRRLVLATNVAETSLTVPRIRYVVDPGYARVKRYSPRQKLDRLHIEPISQ
ASANQRMGRCGRIAEGICYRLYAEADFAARPAFTDPEIRRASLSGVILRM
LQLGLGRIEEFPFLEAPDERAVADGWQQLLELGAIDAERRLTAIGRQMAR
LPVDVKLARMLVAAQQHGCLREMIIIAAFLGIQDPRERPPEAREAADNAH
ALFADARSEFVGILRLWDAYRQVHEDLTQSKLRDWCGRHFLGFLRMREWR
ELHRQLRLLCEELGWSEEPAGAMLAPLLAGASAPVREDGQAHRATRGQLH
RAARLAREGKPDPAAPPAQAKAAAAKSSPADATDAAVRTSERERAAAYQA
LHRALLAGLPTQIGHRTEKGDFLAARQRRFVPFPGSALARKPPPWILAAT
LLDTQKVWGMTNAAIEPDWAIAELPHLLARKHFDPHWSRAQGQVVASEQI
SLFGLVLAPKKPVHFGKIDPATSHDLFVRQGLVPGEINTRAAFVADNLKV
LEQAREEEAKLRRAGIVADEDWQARWYLDRIPAELHSASGLDAWWKTLPA
DKRRSLHWSLNDLLPGEGSEADRFPKYFALGDARLPLQYRFEPGAIDDGV
TLEVPLHLLNALDPSRLSWLAPGFVADKASALIRSLPKAQRRNYVPAPDY
GRAFYEAFSTPSADDMRGELARFLSKATGAPVAALDFDEEALDTHLLMNL
RLRDEDGRVLAESRDLVGLRARFGERAGQAFAARAGRALAAEGLRDFPAT
PIPEQVAGEAGVPAYPALVDQGEDAALRIFADRNEALRAHPRGVRRLLEI
ALADKIKQARKQLPVSPKTGLLYAAIESQERLRGDLVDAALNAVLAEGLG
AIRDPAAFAQRREDAVKRLFGEAMARLTLAESILGAVAELKPLLEAPLMG
WARGNLDDMEQQLRALVHAGFLRDTPADALANYPRYLKAMILRTERAKRD
PARDQARMLELKPFVDALNDAAARGLQQHPDWQALRWDLEELRVSVFAQE
LGAKSGVSAKKLSQRVAALRG
>gid:102029  hrpB  ATP-dependent RNA helicase
MPRMSDPAFPISPLLPQIRDSLAAHPRLVLEAPPGAGKTTQVPLALLDAP
WLAGRSIVMLEPRRVAARSAALFMARQLGEPVGETVGYRIRFENKTSART
RIEVVTEGILTRMLQDDPMLERVGALLFDEFHERHLAGDLGLALALDVQS
QVREDLRIVAMSATLDGERLASFLDAPRLSSAGRSYPVEVAHFPARRDEA
LEPQTRRAVEHALATHPGDVLVFLPGQREIARVQAALQDALDPAMQLLPL
HGELPVEAQSQVLQPDPQGRRRVVLATNVAESSVTLPGVRVVIDSGLARE
PHYDPNSGFSRLDVTSIAQASADQRAGRAGRVASGWAYRLWPQSQRLEPQ
RRAEITQVELTGLALELAAWGSDALRFVDAPPGGALAAARELLQRLGGLN
AEGGITALGRRMLALATHPRLAALLAQAGTPARLALACDLAALLEARHPL
RQGGDGLAARWRALAAFRQGRTGADANRGALAAIDAAAKQWRRRLRCDAT
PPTSVEAHALGDLLSHAFPDRIAARHPTDPLRYLLANGRSARLFDHSDLR
GEPWLVASELRYEAKDALLLRAAPVDEGYLRQSVPERFVQQDVVQWDADK
RALVARRQSSFDRIVLDSRPAGRVDPAQAAAALTEAVRQLGLDALPWTEG
LRQWRARVVSLRAWMPELGLPDLSDTALLASLDHWLRPAFAGKTRLDALD
EASLGDALKAALPWERRQAIDRHAPTRISVPSGMERAITYALDHDQQPLP
PVLAVKLQELFGLAETPRVADGRIPLTLHLLSPGGRPLQVTQDLKSCWAT
TYPDVKKEMKGRYPRHPWPDDPWTANATHRAKPRGT
>gid:104512  hup  histone-like protein
MAKTAAKKAAPKKAVKKVAASKTAKPAKAASKTAAPKPIKEALSKTGLVA
HIAETTQLAPKDVRAVLASLEATAHASLSKKGVGSFVLPGMLKITSVNVP
AKPKRKGINPFTKEEQVFAARPATCKLKVRAMKRLKDAAL
>gid:102732  hupB  histone-like protein
MNKTELIDGVAAAADISKAEAGRAVDAVVSEITKALKKGDAVTLVGFGTF
QVRERAERTGRNPKTGDSIKIAASKNPAFKAGKALKDAVN
>gid:103947  ihfB  integration host factor beta subunit
MTKSELIEILARRQAHLKSDDVDLAVKSLLEMMGQALSDGDRIEIRGFGS
FSLHYRPPRLGRNPKTGESVALPGKHVPHFKPGKELRERVSSVVPVDMVD
AAD
>gid:103396  int  phage-related integrase
MEDGGIPCIQISDEGEHQKVKTDVSLRTVPVHPDLLALGFLDWVEQARGQ
GQERLFPAAKADAKNGQGNWISKAFSRHLAEVGKNWPTAKRGFHSLRKTL
IQELQGAGVVSELRAQLVDHELDDEHHVTYSRAFTAKEKLDGLRAVSPGL
SVLDYGLSLDSLSALMNQTTSVVSLKSNALRA
>gid:102146  int  phage-related integrase
MPLSDAAVRNATPADKPVRLFDGGGLYVEISPKGAKLWRWKYRFGGKEKR
LALGVYPEVSLAEVRAQHLEARKVLRSGIDPGEKRRVDRLVRVDRSQLSF
AAVAAELLALHGKKNSVLTMKRNGRIVEKDLNPEVSPHFLLANQSLAT
>gid:104766  int  phage-related integrase
MGRGRKRKFNPAIPSHVDQQALPQGLYWESNRWFMYEPHKEGGRLTKRTV
ALASARLSELHAIVEAARGGSAQGTLAYLCDHFLGTPPIAASSEFKELAA
GTQKDYRQCAAAACSYVLKDGSTLGKMQVARMNIPMIQRLVETLANGRSA
SASQPLIEPRPSKANHVLRFLRRLFSWGMRFGLCAHNPAKGVRQARELAE
HKMPESDAFATILDFARQRGRLKAHSLGSVSPYLHAVMQLAYNLRLRGVE
VTELTDAHADEVGIRSNRRKGSRDNITTWNDELREAWQWLVGYRKDAMSA
RGRPVELKPEKRRLLVNQSGTPLSKGALDSAWQRMMQMAIREKVIREDQR
FSLHGLKHRGITDTVGNRADKQDAAGHKSPHMTQRYDHQVPVVSPPKKQ
>gid:105941  int  integrase
MNRTIKEATVKGFHYDDHAQLQQHLANFIDAYNYGRRLKALKGLTPYEFI
CKQWTSEPDLFKVDPIHLMPGLNT
>gid:103366  int  phage-related integrase
MQPCCGAVATAMSPASTPVVGQLKTEAKRLTQPAQTSTPFSTLPSPPFRN
SIMSITSTPTAEHLSSLLNQRLPGKVIFRTLRSTASTAFVMNAESHHQRP
QRLAESDLAKIFASPAYDQWAANEPLLFWAPLLGLYMGVRASETVALSID
DIIERAGLMCIVLKNRAAPESESATASKYRTRQRHSTGSIPVPKVLLDAG
FPDYVAAIKSKDRRELFRTSQRERTADTAAWLRSKFARYLRSHGIKKSGF
SALRQTFGERLMDADISWADKRELARASERHFPAFTCFFYPSCRMSSLKK
SLNKISCDGLTLPRFMGDQVLVRTHGAARYR
>gid:104900  intS  phage-related integrase
MVRMLTDMVVRQAKASDKPYTLADFDGLFLYVSPVGGKAWHFRYTWVGQR
ARISLGSYPELSLRDAREFRDQARALVAKGINPRTDRKQKRQAIRLAGEN
TFMAVYEKWMEHRQLTLEEGRQSSLEQIRRVFKKDVFPYLKRYTIYEITR
PVLLEVIGRIEKRESLSVAEKVRTWLKQLADYAMVVIPGMVEHPAIDLHV
VAVPLPPVEHNPFLRMPELPLFLQTLRKYRGMQMTQLAIRLLLLTGVRTG
ELRLATPDQFDLEQGLWIIPVMSLKQRKMLTKKKRKRVTDIPPYIVPLPV
QAIEIVRHMLDLFKPAQTYLFPGVKRITARMSENTVNRAIKRLGYDGRLT
GHGIRATISTALNELGYPKVWVDAQLSHADPNRISATYNHAEYVEQRRLM
MQDWADRLDLFEQNQVQIASTHLTIHLQGVPTIAGQKVTPLPALGQHAPI
MLVAPNEQTMPAVGTGTQRLSAVQMPEYALPKISEVQRERLEVLDIFEGP
DNLVVADYAKLAGKSRRWITYEIQARNLLSIQLGNKGQRVPVWQLNMFKR
RLVQAVLKRLHRGVDTWDIYYALTRPREELDGKSPIEALTSDNQQAMVEA
VCRAVSEATTPVVEKRVPINRIAECMSEF
>gid:103864  intS  phage-related integrase
MMMALSDLTVRQAKAAEKTYSIPDTDGLGLVVAPTGGKSWHLRYYWLGKQ
KRISLGNYPEVGLREARTLRDEARALVAKGINPHADRKQKRRAIKLASDY
TFKAVFDAWVEHRAKELKEGRNSTLSQIQRIFGKDVLPSLERMSIYDIRR
PQLLGVLARIERRKAFTTTEKVRTWLSQLFRYALVIVEGMEANPATDLDV
VAEPKPPVSHNPYLRLPELPDFLRKLRLYNPRGWQTQLGIRLLFLTGVRT
GELRLATPEQFDLDRGLWIIPPQIVKQLQDEMRKAGKRPHDIPPYIVPLS
VQAIEIVRYLLGVMRPAQKHLLAHRSELKKRISENTLNAALKRVGYDAQL
TGHGIRGTISTALNEIGYPKIWVDAQLSHSDPNKVSSAYNHAKYVEPRRR
MMQDWADRLDLLEQGKVEAASTHLTIHIEGVPAMAEEPAAIAAVSAKAAV
SSTPIVVVPSTEGTTFQRLSQVPPPPTRTPEPEASAIQREREEMLAMYES
PCCLPVPLFGKLAGKSKDQINRELKAGKLLSISLGNRGQRVPDWQLVPLK
HKLTQVLMNQCQGADSWDLFRMLTRPHTDLGDRAAIDVVTPTNVLAIVRT
IMGDQSFEKVRTLQSSESVGERAQQQPLHQISDQEARERPSSPSL
>gid:103324  lig1  DNA ligase
MTASPDPAQRIDALRQRIEDANYRYHVLDEPQIADVEYDRLLRELEALEA
AHPELATADSPTQRVGYLAASRFAEVRHVLPMLSLGNAFSDEEVAEFVRR
ISERLERKQPVFCAEPKLDGLAISLRYEQGEFVQGATRGDGATGEDVSAN
LRTVKAIPLRLRGTGWPEVLEVRGEVYMPRAAFEAYNAQMRLQGGKVLAN
PRNGAAGSLRQLDARITAQRPLSFFAYGVGEVADGALPPTHSTMLAQLRE
WGFPVSQLVEVVQGSEGLLTYYRRIGEARDGLPFDIDGVVYKLDDLAGQR
EMGFVSRAPRWALAHKFPAQEQSTTVEAIEIQIGRTGAATPVARLKPVHV
AGVVVTNATLHNADQIARLDVRVGDTVIVRRAGDVIPEVAGVVAEQRPAG
THAWQMPTQCPVCGSEIVREEGQAVWRCSGELTCPAQRKEAFRHFVSRRA
MDVDGLGEKFIEVLVDSGVVQGVADLYLLNVDQLLQLRLISTADSPHAFL
REAREHLAAGAYAQVEQTMVGIGVDLAGVQPAPQTWQADLLRAGLPAFDW
NRKKIATKWAENLIEAIETSRDTTLERFLFALGIEHVGESTAKALSAWFG
ELDVIRHLPWPLFKRVPDIGGEVARSLGHFFDQAGNQQAIDDLLQRGVRI
GDAHPPSPKLRGALSFAVLLEDLDIPKVTPVRAQQLAAATASFDALIASE
ADPLLQAGVPAPVIASLQQWLARPENAALATAAQRAMDALLAQLPQADAV
QAGPLDGQTVVITGTLAALTRDAAKQRLESLGAKVAGSVSKKTAFLVAGE
EAGSKLDKAQSLGVEIWDEARLLAFLSEHGQAV
>gid:103044  lig2  DNA ligase
MKRFAALYRTLDRSTGTLDKRAALVAYFRAAPPLDAAWALYLLAGGKVAS
ARMRIAASGELREWIAEAAGIADWLVADSYDHVGDLAETLALLLDDPASE
AVDVSLAEWIEQRLLPIANQDVAVRKHCIVQAWRTLAFDERLVFNKLLTG
ALRVGVSQRLVQQALAELSGVDIARIAQRMLGSWRPHATYVADLLTHEAL
PGDRQQPYPFFLASPLEADVESLGAIDDWLLEWKWDGIRLQLLRRAGEAA
LWSRGEERLDGRFPEIEQAAMGLPDGTVIDGELLAWQPEQLLPMPFTALQ
TRIQRLKPGPKTLAAAPARVVAYDLLELDGEDLRERPLHARRALLERVLA
TLADPRIIASPLVHSTDWQAAAQVRLDARARGVEGLMLKRARSPYQSGRR
RGDWWKWKIDPLTIDAVLLYAQAGHGRRSTLYTDYTFGLWHDGALVPIAK
AYSGLDDTEILQLDRWIRANTTERFGPVRAVTPHHVFELGFEGVNRSTRH
KSGIAVRFPRILRWRHDKPFAEADHLSSLQALAR
>gid:104061  lig3  ATP-dependent DNA ligase
MSLSEYRRKRSFDKTREPEPGKLLPQGQRAIFVVQLHHASRRHYDFRLQV
GDALKSWAVPKGPSYDPAVKRMAVEVEDHPVDYASFEGEIPKGEYGGGHV
AQFDHGVWATAGDPEAQLAKGHLRFELFGSKLKGGWHLVRSSKPARQPQW
LLFKEDDAYAGTLEADDLLADVAAAPAEDVRRAGAGKAQRKALTTVPVPR
ARARNAWTNAALKLTHARRGDIDDAAFAPQLAKLGQAPPEGAQWVHEIKW
DGYRILATVTDGQVRLWSRNALEWTDKIPDIRDAIQALNLRSARLDGELI
AGRGTKEDFNLLQATLSGERQVPLALAVFDLLHIDGVEISEAPLRERKQL
LQQILANAPAGHLAYSSHVEGDGLEAFRVAGEQHFEGIISKRADRPYRGG
RSDDWRKTKQLASQEYAVVGYTAPKGSRTGFGSLLLATPDPQHGWLYVGR
VGSGFSDTLMQEVTQHLHGGGKRPTAHIPTEDTDLRGATWFAPRFVVEVF
YRGIGGQQLLRQASLKAVRLDKDIADLADSDMGDVSPAQADVADTPARGR
KRANKQAPAQGEPTLSSPTKLIYPDIRATKGDVWDYYHAVMDHLLPEIVG
RPLSIIRCPNGAEKPCFFQKHHTAGLERVSSVRLKEETGSNAYYLVVEDA
PGLLELVQFNALEFHPWGSHAARPDMADRVVFDLDPGPDVPFAEVKRAAT
DIRKLLAQLELESFLRVSGGKGLHVVVPLNPGCDWELTKRFAKGFADALA
QSEPDRFVATATKRFRNKRIFVDYLRNGRGATAVASYSLRGRPGAPVALP
LPWSDLAKLHRANAFTLRDVPDKLRRRRKDPWADIAQIQQNLARWADQG
>gid:104452  mfd  transcription-repair coupling factor
MPSPTFPSPPLPKSGQLRAYWRAPSSPTALAWSIARAAEAHAGPVLVIAR
DNQSAHQIEADLHALLGDASALPVVPFPDWETLPYDQFSPHPEIISQRLA
ALHRLPGLTRGVVTVPVQTLLQQLAPLSYIVGGSFDLTVGQRLDLDAEKR
CLESAGYRNVPQVMDPGDFAVRGGLLDVFPMGADTPLRIELLDEDIDSIR
AFDPESQRSLDKVDAVKMLPGREVPMDDASVERVLACLRERFDVDTRRSA
LYQDLKSGIAPSGVEYYLPMFFSKTATLFDYLDTRVLPLIATGVSNAADA
FWLQAQNRYEQRRHDVERPLLPPDELYQSPDALRERLNKLARIEVWPADH
PRIDEAAPLGDQPLPPLPVAAKDAPAGQALASFLGHYPGRVLVAADSAGR
REALMEVLAAAQLKPDVVADLPAFLAATKLRFGITVAPLEDGFALDTPQI
AVLTERQLFPERANQPRRTRRVGREPEAIIRDLGELSEGAPIVHEDHGVG
RYRGLIVLDAGGMPGEFLEIEYAKGDRLYVPVAQLHLISRYSGASADTAP
LHSLGGEQWTKAKRKAAEKVRDVAAELLEIQARRRARAGLALQVDRAMYE
PFAAGFPFEETTDQLAAIDATLRDLGSSQPMDRVVCGDVGFGKTEVAVRA
AFAAASAGKQVAVLVPTTLLAEQHYRNFRDRFADYPMKVEVLSRFKSTKE
IKAELEKVASGDIDVIIGTHRLLQPDVKFKDLGLVVVDEEQRFGVRQKEA
LKAMRANVHLLTLTATPIPRTLNMAMAGLRDLSIIATPPPNRLAVQTFIT
AWDNTLLREAFQRELSRGGQLYFLHNDVESIVRMQRDLSELVPEARIGIA
HGQMPERELERVMLDFQKQRFNVLLSTTIIESGIDIPNANTIIINRADRF
GLAQLHQLRGRVGRSHHRAYAYLVVPDRRSMTSDAEKRLEAIASMDELGA
GFTLATHDLEIRGAGELLGEDQSGQMAEVGFSLYTELLERAVRSIRQGKL
PDLDAGEEVRGAEVELHVASLIPEDYLPDVHTRLTLYKRISSARDSDALR
ELQVEMIDRFGLLPDPVKHLFAIAELKLQANALGVRKLDLGENGGRLVFE
AKPSIDPMTVIQMIQKQPKIYTMDGPDKLRIKLPLPEAADRFKAARGLLT
ALAPR
>gid:102053  mttC  type V secretory pathway protein
MQLIDIGANLTHDSFDRDRDAVLQRARDAGVAQLVITGASREHSPLALQL
AQQHPGFLYATAGVHPHHAVEFTAECEREMRALQAQPQVVAVGECGLDYY
RDFAPRPAQHKAFERQLQLAADNGKPLFLHQRDAHDDFLSIMRAFDGRLG
AAVVHCFTGTREELFDYLDRDYYIGITGWLCDERRGAHLRELVRNIPANR
LMIETDAPYLLPRTLKPLPKERRNEPMFLSHIVEELARDRGEDVAVTAEN
STAAARAFFRLPVPATAA
>gid:104052  mutL  DNA mismatch repair protein
MAIRQLPEILINQIAAGEVVERPASVVKELVENALDAGATRVDIDLEEGG
VRLIRIRDNGGGIAPEELPLAVSRHATSKIASLDDLETVATLGFRGEALP
SIASVSRFTLASRRPDAEHGSALQIEGGRLGEVMPRAHAPGTTVEVRELF
FNVPARRKFLRAERTELGHIEEWLRSLALARPDVELRVSHNGKPSRRYKP
GDLYSDARLGETLGEDFARQALRVDHSGAGLRLHGWVAQPHYSRASTDQQ
YLYVNGRSVRDRSVAHAVKMAYGDVLFHGRQPAYVLFLELDPARVDVNVH
PAKHEVRFREARLIHDFVYRTLQDALAQTRAGALPADVGVGGAAALGIGA
VAAQGGGSYVADAGAGHPGAGSGSGYASWAPSQAPLGLRVDEARAAYAAL
YAPAAGSALRDDGQPVLSGTGLPATAHDSGVPPLGYAVAQLHGIYILAEN
AEGLIVVDMHAAHERIGYERLKQAHDSIGLHAQPLLVPMTLAVGEREADT
AEREADTLASLGFEITRSGPQSLHVRSIPALLANADPEALLRDVLGDLRE
HGQSRRIATARDELLSTMACHGAVRANRRLTVPEMNALLRDMEATERSGQ
CNHGRPTWARFTLGEIDRWFLRGR
>gid:105901  mutM  formamidopyrimidine DNA glycosylase
MPELPEVETTLRGLAPHLVGQRIHGVILRRPDLRWPIAAQIEQLLPGATI
TDVRRRAKYLLIDTDAGGSAVLHLGMSGSLRVLPGDTPPRAHDHVDISLQ
NGRVLRFNDPRRFGCLLWQRDCETHELLASLGPEPLSAAFTGDYLHALAC
GRRAAVKTFLMDQAVVVGVGNIYAAESLHRAGISPLREAGKVSRERYRRL
ADAVKEILAYAIQRGGTTLRDFISPDGAPGYFEQELMVYGREGEACRHCG
GELKHATIGQRATVWCAACQR
>gid:102961  mutS  DNA mismatch repair protein
MRPIDRQIYRPPDFRTTFLQTADTKDKTKLSTGAAEHTPLMKQFFAAKSD
YPDLLLFFRMGDFYELFYDDARKAARLLDITLTQRGSSGGAPIPMAGVPV
HAYEGYLARLVALGESVAICEQIGDPALAKGLVERKVVRIVTPGTVTDEA
LLDERRDTLLMAISRSKQGYGLAWADLAGGRFLVNEVDSVDALEAEIARL
EPAELLVPDEDNWPEFLRGRVGVRRRPPWLFDADSGRRQLLAFFKLHDLS
GFGIDDKPCATAAAGALLGYVEETQKQRLPHLTSIAMEVASEAISMNAAT
RRHLELDTRVDGDTRNTLLGVLDSTVTPMGGRLLRRWLHRPLRLREVLVQ
RHHAVGSLIDTGADTDVREAFRALGDLERILTRVALRSARPRDFSTLRDG
LALLPKVRTILAPLDSPRLQTLYAELGEHDATAHLLISAVAEQPPLKFSD
GGVIATGYDADLDELRRLSTNADQFLIDLEQRERASSGIATLKVGYNRVH
GYYIEISKGQAEKAPLHYSRRQTLTNAERYITEELKSFEDKVLSARERSL
SREKLLYEGLLDALGGELEGLKRCASALSELDVLAGFAERAQALDWSQPE
LESAPCLHIERGRHPVVEAVRDQPFEPNDLDLHPDRRMLVITGPNMGGKS
TYMRQNALIVLLAHIGSYVPASRAVIGPIDRILTRIGAGDDLARGQSTFM
VEMAETSYILHHATPQSLVLMDEIGRGTSTYDGLALADAVARHLAHTNRC
YTLFATHYFELTALADASHAGGGSGIANVHLDAVEHGERLVFMHAVKDGP
ANRSFGLQVAALAGLPKAAVQQARRRLAELEQRGGDSHAAEMAPAALDAP
QQFGLFTAPSSAAQEALQALDPDELTPKQALEALYRLKALL
>gid:103917  mutT  7, 8-dihydro-8-oxoguanine-triphosphatase
MPHTPIVATLGYLLSPDGTQVLMIHRNARPGDHHLGKYNGLGGKLEADED
VLACMRREIREEAGVECGQMQLRGTISWPGFGKQGEDWLGFVFLIHSFDG
TPQTSNPEGTLEWVPIAQMDQVPMWEGDRNFLPLVFDGDPRPFHGVMPYR
DGRMQSWSYSRV
>gid:103719  mutT  7, 8-dihydro-8-oxoguanine-triphosphatase
MTLQETRWHPDVTVATVVVRDGRFLQVEESIGGRLLLNQPAGHLEPDESL
LQAAVRETLEETGWDVRLTQFIGTYQWVAPTGQCFLRFAFVADALAHHPE
RSLDTGVVRALWMTPEELRAASDRLRSPLVWEVVADYLAGQRHPLALVRH
VA
>gid:104172  mutY  A/G-specific adenine glycosylase
MPVPATLTTDAFVDRLLHWFDGHGRHDLPWQHPRAPYRVWLSEIMLQQTQ
VAVVIPYFQKFVASFPTLADLAAADNDTVMAHWAGLGYYARARNLHAAAK
QCVALHAGELPRDFDALLALPGIGRSTAGAILSQAWNDRFPIMDGNVKRV
LTRIHGIAGYPGLPVVEKQLWQLAANHVAHVPAGRLADYTQAQMDFGATL
CTRARPACMVCPLQENCVARREGLVEALPTPKPGKQLPEREATALLLENA
HNEILLQRRPPTGIWASLWTLPQAETDSDLREWFAAHIDGDYDRADEMPM
IVHTFSHYRLHLQPLRLRKVALRQVLRDNDDLRWVARADLATLGLPAPIR
KLLDAL
>gid:104467  nfi  endonuclease V
MQTSIDPVFAGWDGSVAQARQLQQQLAQRVALRDEVSAAPALLAGFDVGF
EDDGQTTRAAAVLLDAQTLLPLETHVARVPTSMPYVPGLLSFRELPALLR
ALALLARTPDLVFIDGQGIAHPRRFGIAAHFGVVTGLPSIGVAKQRLAGT
FIEPGGERGDHSPILLAGAQIGWALRSKPRCNPLIVSPGHRVSMQGALDW
TLRTLRAYRLPEPTRLADRLASRRGEIELQTQPTLL
>gid:103286  nth  endonuclease III
MSSALSAPPPRRGSTLRKPEIQELFARLRELNPHPTTELEYTTPFELLIA
VLLSAQATDVGVNKATRKLYPVANTPRDILDLGEEGLKRYISTIGLFNAK
AKNVIATCRILLERYGGEVPHDRAALEALPGVGRKTANVVLNTAFGEPTM
AVDTHIFRVANRTGLAPGKDVRVVEDKLVKVIPAEFLHDAHHWLILHGRY
VCKARKPDCPNCVIHDLCRYRDKTVAA
>gid:102243  nudC  NADH pyrophosphatase
MSEPLFSLSAFAFTHAPLDRGDVLRDDPDAIARLWPTGRVLLIDAKGTAA
ADAQGQPLLSDGAALADTPGAAIFLGLRDGVGWFALAAEQVATELPHRVD
LRQAAADWPAELSTAFSYGRAMLHWQSRTRFCGVCGGAIAFRRAGFIAHC
TQCQTEHYPRVDPAIIVAVSDGQRLLLGRQASWAPRRYSVIAGFVEPGES
LEQTVEREVFEETRVQVQGCQYLGAQPWPFPGALMLGFAATAAPTELPQV
TGELEDARWVSHAEIGTALAGESGDTGIGLPPAISIARALIEHWYRTHG
>gid:104637  nudE  ADP compounds hydrolase
MLRMSTRLPTIHKITDLGEGPFRRQQLDLEFSNGERRLYERQLSQGHGAV
VVVPMLDAQTVLLVREYAAGVHRYELGLVKGRIDAGETPEQAADRELKEE
AGYGARQVQVLRAMTLAPTYMSHQSWLVLARDLYPERLPGDEPEELDVIP
WPLARLDELMLREDFSEGRSLAALFIAREWLERNP
>gid:102234  nudH  (di)nucleoside polyphosphate hydrolase
MIDPDGFRPNVGIVLMREDGQVFWARRVRRDGWQFPQGGMNTDETPVEAM
YRELREETGLLPEHVELLGATPGWLRYRLPSRAVRRNERQVCIGQKQVWF
LLQFTGQESHLKLDHTDSPEFDHWRWVDFWYPVEHVVMFKRGVYARALRH
LAPLAQTVAGPAAVGVMPQRALEAWLPGSSAAGHDRPRKRPRKRGGVLPV
RINND
>gid:104407  ogt  6-O-methylguanine-DNA methyltransferase
MSTLHYDTFPSPIGALTVAADTTGVRHILFAQNRHDAPGRALWQHGPDAP
LVQAAREQLLDYLYGGRRSFDLPLAPAGTPFQLQVWQTLARIPFGETWSY
AQLAQAVGRPAASRAVGAANGRNPLPIVLPCHRVIGASGALTGFGGGLPT
KQALLQLEGWSPQASARRVAAVVGEDLFAR
>gid:103214  orf35  phage-related integrase
MARYHKEDRTHAYKSLGPVTAENDHENAKREARIWRKTIDAGVQADRLLT
VADVCRDYTAAIEAEGRTRAAIDARKRFDRIVYVDPIGKLRADKLTQRHL
EAWMTRMEAGEMTGRKKALPSRATFNRNLTALKAALNRSVARREIPQERV
IEWQSIKPHKGASGRRDTYLDKAQRRALLDAMGTDLRALAECVALTGCRP
GDPVAMRRKDWDSKNGLATFATKTGARTVPVSPAARALFDRLATDKQGDA
WMFTNEGEHWTPQAWAPKVKAAAAAAGLPTGVVLYVLRHAWITDAIIGGL
DAVTVARLTGTSLEMISQHYGHLAQHAAREMLGKIDFL
>gid:104756  orf37  phage-related protein
MLQRLERDYGLKHRSGTSYMRGGVCPACSKKELYTFEPKPWVIKCGREAK
CGHELHVKDLYDDLFDDWSKRFPMTQASPTASADAYLESSRGFALAPLRG
LYTQESYYDIKVKEGTATVRFALDKGGWWERLIDRPHRFGKQKARFAPGK
SYAGAWWCAPAAAELMRTATEVWIVEGIFDAIALLQHGVCAVSAMSCNAF
PDESLRQLAKLRAGNLPTLVWGLDNEPGARDYTHKHARRADALGFNSRAA
LIAQPVTGKKIDWNDLHLRAQAGGDSQKQWDAALTEARYQGDLLMARSAI
EKGLLMYDHNQASDFWLEYRSRLYWFEFDTVRFEKLLRDVEPEEDSEIDP
DKLAKIRRAACSVNKIANCYPEALYFQRQEVTDESWYYFRIDFPHDANSV
KGTFTGGHISSASEFKKRLISLAAGAMFTGSGHQLDRLIEEQTEAIKTVE
AIDFVGYSKEHRAYLLGDIAVRDGEVVTANEEDYFSFKKLRLKSTQKSIR
LEIQRDPEAFRMDWLPWLWQCFGTHGMVAMTFWFGSLFAEQIRAGHKSFP
FLEATGEAGAGKTTLLTFLWKLLGRSDYEGFDPAKSSKAGRARAMGQISG
MPVVLLEADRSEPDKAHAKTFEWDELKDFFGGGTLATRGVRNGGNDTYEP
PFRGTIVISQNAAVDASEAILTRIVKLHFKRPQVTTESRIAADNLNALQV
EELSHFLIKAVRCEGAILEKFAERVKFYEARLREKPDLRLERVIKNHAQM
LALLDCLRMVITIPEEMIKATRDALLEMAFERQKAISADHAQVNEFWEVY
EYLEATGNGKPVVNHSRDASRIAINLNQFAAKAAQFSQVVPDLKVLRGLL
ADSRRHKLVSANTAVNSAVLTNGFGAGTTVKCWVFSK
>gid:103149  parC  topoisomerase IV subunit A
MTDLTRPTFHGFEQLPLREYAERAYLDYSMYVVLDRALPFLGDGLKPVQR
RIIFAMSELGLNAAAKPKKSARTVGDVIGKYHPHGDSACYEALVLMAQPF
SYRYPLIEGQGNFGSTDDPKSFAAMRYTESKLTPIAEVLLGEISQGTTDW
AANFDGTLEEPTWLPARLPHLLLNGTTGIAVGMATDVPPHNLNEIVSALL
HLLDDPDATVAQLCEHVLGPDYPTNAEIITPVADLRAIYETGHGSVRARA
TYKKEHANIVIDALPYQVSPSKVIEQIAQQMRAKKLPWLEDIRDESDHTS
PVRVVLVPRSNRVDAEQLMGHLFVTTDLERSYRVNLNVIGLDGRPQVKNL
KHLLSEWLTFRSDTVTRRLNHRLQKVERRLHLLEGLLIAFLNLDEVIRIV
RSEDEPKPVLIARFALSEEQAEYILETKLRQLARLEEMKIRGEQEALAKE
REQILSILGSKTKLKKLIKDELTADAKKFGDARRSPLVQRGAAQAIDETE
MVASEPMTVVLSEKGWVRAAKGHEVDPAGMSYRDGDGLLAAVRSRSTYHV
AFLDSDGRAYSTLVHTLPSARGNGEPLTGRFSPASGASFQVLASGENNAR
FVLASSHGYGFVTRFENLTGRNKAGKAMLNLTTGAHVLTPAQVLNPQTDR
IVAVTSAGNLLAITASDLPELDKGKGNKLIEIPKAKLGTERVVAVAAVAP
GNTLLVRSGARVMSLSFKDLDTYVGARASRGALLPRGWQKVDGLEVQ
>gid:103449  parE  topoisomerase IV subunit B
MPTRCMNTRYNAADIEVLSGLDPVKRRPGMYTDTARPNHLAQEVIDNSVD
EALAGHAKQVEVTLYKDGSCEVSDDGRGMPVDMHPEEKIPGVELILTRLH
AGGKFSNRNYTFSGGLHGVGVSVVNALSTKVELFIKREGSEHRMEFRDGN
AASKLEVVGTVGKKNTGTRLRFWADPKYFDTPKFNVRALRHLLRAKAVLC
PGLTVKLHDEATGEQDSWYFEDGLRDYLKGEMADRELLPADLFAGSLKKD
TEIVDWAAAWVPEGELTQESYVNLIPTAQHGTHVNGLRSGLTDALREFCD
FRNLLPRGVKLAPEDVWDRVTFVLSLKMTDPQFSGQTKERLSSRQAAGFI
EGAAHDAFSLYLNQNVEIGEKIAQIAIDRASARLKTEKQIVRKKVTQGPA
LPGKLADCISQDLSRTELFLVEGDSAGGSAKQARDKDFQAILPLRGKILN
TWEVASGSVLASEEVHNLAIAIGCDPGKDDITGLRYGKVVILADADSDGL
HIATLLTALFLQHFPALVAAGHVFVAMPPLFRVDVGKQVFYALDEEEKTT
LLDKIAREKMKGQISVTRFKGLGEMNPQQLRESTIHPDTRRLVQLTIDDG
EQTRSLMDMLLAKKRAGDRKQWLETKGDLASLEV
>gid:103189  phr  photolyase-like protein
MLRGLTCRCRVCMSYAIVWFRRDLRLEDNPALRAALDAGHDPIPLYIDAP
HEEGQWAPGAASRAWRHRSLAALDASLRARGSALLIRQGDSAQVLDAVIA
QTEAVAVYWNRKYEPATQPRDAQIKRSLRERGLEVQSCNAALLFEPWTLA
TQQGRPYKVFTPFWRNALTQLRLPDAMPAPRSLPPLPASLDGVHVDALNL
LPTPAWDQGFWEHWQPGEAGAHEMLEIFVDGALSGYRENRDRPDRVGTSQ
LSPHLHFGEIAPWRIASTLEAQRSARNGADIDGYIRQLGWRDFAYHLLHH
FPDTTTQNLNPRFAGFDWATVDPVTLDAWQRGRTGIPIVDAGLRQLWHTG
WMHNRVRMIVASLLCKHLRVHWLEGARWFWDTLVDADLANNTMGWQWVAG
TGADAAPYFRVFNPVTQAEKFDPQATYITRWIPELAALPVKERFAPWLHP
LSLARLAPTYPRAPIIGLAEGRDAALAAYAGTRG
>gid:105774  polA  DNA polymerase I
MSRLVLIDGSSYLYRAFHALPPLTNAQGEPTGALFGVVNMLRATLKERPA
YVAFVVDAPGKTFRDDLYADYKANRPSMPDDLRAQVQPMCDIVHALGIDI
LRIDGVEADDVIGTLALQGASDGLAVTISTGDKDFAQLVRPGVELVNTMS
GSRMDSDEAVIAKFGVRPNQIVDLLALMGDTVDNVPGVEKCGPKTAAKWL
AEYDSLDGVIANADKIKGKIGENLRAALPRLPLNRELVTIKTDVVLASGP
RALDLREPNAEALAVLYARYGFTQALRELGGAAAEAGGLTAPMAVARTEP
GRARGTGFVSAPAAAPVELDPALSAPGQYETILTQAQLDSWIARLRAAGQ
FAFDTETDSLDALQANLIGLSVAAEPGQAAYLPFGHDFPGAPAQLDRTQA
LAQLAPLLTDPAVRKLGQHGKYDLHVMRRHGIALAGYADDTLLESFVLNS
GSARHDMDSLAKRYLGYDTVKYEDICGKGAKQIKFAQVSLEDATRYAAED
ADITLRLHQVLGKRLAAEPALESVYRDIEMPLVGVLERIEANGVCVDAAE
LRRQSADLSKRMLAAQQKATELAGRTFNLDSPKQLQALLFDELKLPAVVK
TPKGQPSTNEEALEAIADQHELPRVILDYRSLAKLRSTYTDKLPEMIHPQ
SGRVHTSYHQAGAATGRLSSSDPNLQNIPIRTEDGRRIRRAFVAPAGRKL
IACDYSQIELRIMAHLSGDPGLVGAFESGADVHRATAAEVFGRTIDTVSG
DERRAAKAINFGLMYGMSAFGLARQLGIGRGEAQDYIALYFSRYPGVRDF
METTRQQARDKGYVETVFGRRLYLDFINAGSQGQRAGAERAAINAPMQGT
AADIIKRAMVSVDGWIADHAQRALMILQVHDELVFEADADFVDTLLAEVT
ARMSAAASLRVPLVVDSGVGDNWDEAH
>gid:105519  priA  primosomal protein N'
MSAPVTTLRVALPVPLPQLFDYLPLQDTDVDGPDRVGCRVRVPFGPRELI
GVVVERGQQPSAEGLRAALDWCDDTPLLIDELARSLQWLARYTHAPLGEA
QASALPGPLRRGEPLADTHAWAWQLTEAGHTGAGSLRAGSRPALLAALLL
AGPLAEEPLEQQLPQWREAARNLAKRGYAERVAVAADTLPARPGTGPQLN
DEQQAATDAIRAGSGFATYLLDGVTGSGKTEVYLQAIADCLAAGKQALVL
VPEIGLTPQTLGRFRERLGVPVHALHSGLSDGERARVWAAAWRGEAKLIV
GTRSAVFTPLPNAGLIVIDEEHDGSYKQQDGIRYHARDFALVRGKALDVP
VILGSATPSLESLHNAYSGRYRHLRLSRRAGDARPPRVRVLDVRKRPLKD
GLSPEVLAGIGATLARGEQVLVFKNRRGYAPVLLCHDCGWTAACQRCSTP
LHQTPMTVHAGGRRLQCHHCGARQPAPLACPACASLALQPQGIGTERLEE
RLVEAFPEAPVVRIDRSTTQRRDALETQLARLGTDAGILVGTQILAKGHD
LPRLTMVVVVGIDEGLFSADFRAAEKLAQQLIQVAGRAGRADRPGEVWLQ
THHPEHPLLQTLVNGGYHAFADAELQQREAAGFPPFAHLALFRAEAKDVA
AANQFLIAVRALVGAQTPAPSPAITPVECYGPMPAPMPRRAGFQRTQLLL
SAAQRSALHRVLDAQMPAIHTLPQARRVRWSLDVDPIDLY
>gid:105614  radC  DNA repair protein
MHINDWPTDERPREKLLARGAAVLSDAELLAIFVGSGLRGQDAVRTARDL
LHRHGPLRCLLDRPAKALARLPGLGPASACKLSAALELANRHLLSDLERG
EALSDPSSVGRYFSQRLRARNYEVFAALFLDSRHRAIAFEELFTGTIDAA
EIHPREVVRRALLHNAAAVVVGHNHPSGNPEPSEADRAVTQRLLQALGLV
DIRLLDHFVIGDGRPVSLAERGWVP
>gid:103476  recA  RecA protein
MDENKKRALSAALSQIEKQFGKGSVMRMGDRVIEAVEVIPTGSLMLDIAL
GIGGLPKGRVVEIYGPESSGKTTLTLQAIAECQKLGGTAAFIDAEHALDP
IYAAKLGVNVDDLLLSQPDTGEQALEIADMLVRSSSVDIVVIDSVAALTP
KAEIEGEMGDQLPGLQARLMSQALRKLTGNIKRSNTLVVFINQLRMKIGV
MMPGQSPEVTTGGNALKFYASVRLDIRRIGAIKKGDEIIGNQTKIKVVKN
KLAPPFKQVITEILYGEGISREGELIDMGVEAKLVDKAGAWYSYGDERIG
QGKDNARGYLRDNPQVAIKLEAELREKFQPAEAPREAGETESE
>gid:105953  recB  exodeoxyribonuclease V beta chain
MSNSPVTDPYLHLPLHGVRLIEASAGTGKTFTLATLFTRLVVERQLRIGQ
ILAVTFTEAATQELRRRIRERLALAATLVPDARAGAAATEAQTTLLPDAS
SAGVGAALAATEPTQTQASDAPSSAINMVLPATAPSDHLSHPPAQTPPQP
HAPAAPDAVLTRAILTAHLATGTETPSALRRRLQQAVEEIDLAAIFTIHG
FCARVLREHALESGQAFAAPQLLANDRELLGEVAADLWRQRAADAAMAAD
LVALWPAGPTALASDLRALVQQPELLPAVAAPTPDPQPARQAAAQAVVAA
LRAHGDTAYDAVAAAFEHKIFDGRRARRPSFDKAFEQLWQGSAEAHWVLD
DGGHLDKLLPQRLREFCKDGAHDRVPCSPLFDALAVWQQADAVVRQWEGQ
RRIRLLHALRDDAVLQLAQRKRQRRVQTYDDLVDGVARALQGPQAEALVQ
RLRAQYAIALVDEFQDTDDRQWQIFSRVFGPEHGASGAAFAPDDDADFDN
AAGTPPPRLLALIGDPKQAIYGFRGGDVQTYLAAATTAQRAPPLEHNFRS
RPGVLAAIDALYAQAGYAEAFLTEGIAFHPVQPGTKRSDADLQRDDAAAP
ALTLWRAPAPPPPAKGKPKPWSAGRARELCTAACVAAIRGWLAGGRDGSA
SINGRPVQAGDIAVLVRSHGEATRIQQALGAVGIPAVAAGKQSLFATDEA
LELLALLQALLDPGDDSRLRAALATVLIGEDAAAIAALEHDGERHRRWQQ
QALDWRERWQRGGPLALVGDLGATHGQRLLALVDGERRLTNYLQLAELLQ
EADTRALGPHGLVDWLARRIANADDNDETQQLRLESDARRVQIVTLHKSK
GLEYPLVFLPYIGIGRADKSPGRHCVVHAPPHGRQLHWNTSKWSADDTAS
WSTAETAWKHEQRAEDARLLYVGLTRAEHALWIATGAFHQHERTALAPML
RDPAALQASAGAGVIALDDTAPPATLPRLPADDAVQVPAARLAQRHVVPD
WWVYSFTQLANADAGSDPMASATLASSGGSDEPPASEPVSAPAEVEAFDP
RFAGNRFGVAMHDVFERCDFAAWRNWCPGQPAPEGQAAAILEALQRGGYA
QDELDDGLAMLTRLVGHTLTVTLPEGTCLAAVPEPQRRNEMEFHFAMRPT
RVDALLALLHRFGVVTERQAFGARQRLEGLMTGLIDLTYCADGRWYVLDY
KSNRLPAYDPDALARAMAHSEYELQALIYTIALHRWLRFRLGASYDYARD
FGGVRYLFCRGLDAARNPAADSSSILGSGSGSGSDSDSDSVNGASSVSGS
DPASDAMSDTPAPNTPVPGIYAWRFDPALVQALDALFAGNPTEPLSSDAL
KPLSPRERGWGEGTSTTGTPTP
>gid:105954  recC  exodeoxyribonuclease V gamma chain
MHATSAPDFRLYPSNALDTLAALLAEELRRPVPEQPVLQPEVVLIPQVAM
RRWLQSTLAAEHGVAANLEFLTPGEFVARALERNLGPADDDLDMATTQWR
LYQTLQGELGSDAALAPLAGYLADGDALKPWALAGELGSVFEKYQAWRRD
WLLRWESGADADDPQARLWRSIAGGRQYRARRIGQYLDRYARPDGPLPQG
LPKRLFAFAILNVSPDVLRVLATQARVGTLHFYLPTPTQGYWGDLQTLWQ
RRREGGAVALFAEQVQENPLLQAWGAAGRDFMALVGDYEVVHPLAEIAAY
ADPLDAGRRTLAEGGLGDSLLRRMQSDLFHRHAPAVPPVLPAVNLHDPSL
QVHACHTRLRELQVLHDQLRALLDDARFDPPLQPREIAVLSPDIDPYVPY
LDAVFGGHGSDDGLPYALADASPLASEPLADVFLTLLGLPISRFGLHEIL
DLLASAPIAEAAGLDEAGLERLRGWLHGAGARWGLDAVHRRQHQAPGDDA
YTWRFALDRLLLGHASGAEDDIDGVAPWPQLEGSALAALDTLLRLLRVLD
RHQAALAEAMTPVQWRECLLGLLEALIPAAPSAPRAQRALERLRTLIDQF
ARDAVRAEYAGNVPAEVVRAHFAAVLGESDTRAPLLTGGISFGRMVPMRL
LPFRAICLLGMNDGDFPRRDPAAGLNRLTAELGTERRRHGDRSTREDDRF
LFLQLFASAQEVFYLSYLGADARDGTVREPSVLVSELLGSAAQYHADPKA
IDALVVRHPLQPFAAAAFGAVGEDGADPRRFSYRRQWRPAVDSLAGQRQP
LAPWVAGALPADASVLPASVSIDDLRRLFADPAGQFLRHRLGMRLPDPAG
EDSDLEPLLAPTRGLEQYGLQQQVFEAALAGDADGLYERLRARALLPSGP
LGRRQLDERLRQLRPYADVFRQWRGEAPAQSQRLQVEIDGTNVHGRVPGW
YANGVGRVQVGALSGRSAIRDGLEWLLLRAAGERVPFVRFFEHDDSLGPH
PIDPEPLSQTQARAALGELLQLYRQGLQTPLAFAPYSSWKYHQAARNDEL
DKAIKDAHGQWQSSFGWSESHSPELRLVTRGRDPFGDAQQFVDFARTSHQ
LFALLEDGSAPAPLDPARVIESWRQWRGAQDDAE
>gid:105952  recD  exodeoxyribonuclease V alpha chain
MNHPNLLTALNQAGALRTLDLAFAQSLQRLAPDTDPQVLAGAALASLAVT
SGHAGLDPTRAAMLLDAREGPSPALPDPTDWQRTLAASRWVDQPNPQEPA
AADCPLVLEHGLLYLRRYREYERRLALGLQRIAAHSPPPFAAATLAPLFE
QLFPQASPLPQGEGARRAGEGTGLPEPSIYQDGTNPPEPSHHQDHQAQAA
ALALRRTLLLVTGGPGTGKTTTIARLLLLRIAQAHASNTPAPRIALAAPT
GRAAERMAESLRAAVARAIANGIDPALADALPTGASTLHRLLGVIPDSPQ
FRHTADNPLPFDLIVVDEASMVDLPLMCKLVEAVADGTQLILLGDADQLP
SVEAGDVLAAILQAAGPGDTLQPQDADALQPLLGSAPPGSTPASIQTGGH
TISHTHTGGLAGHRVHLLRGYRQADNFALTPLADAIRTSDADTALALLRS
GELAGVHFHEDGEDPLALGRDALLAHWRALADAHDPAAALRDAARLRLLT
AVRAGPQGARGLNARIEQLLAESGSGARRLGSASPWFQGRLLLITENSYR
HGLFNGDVGICLRSEASPFSERSDTNAPSERSDASTVTARSDAAASSGPS
HSIGTANRADPVAERRAQGPLVAWFEGDGDSQVRGFHPAALPAHESAFAM
TVHKAQGSEFDTVWLQLPTRDARVLSRELLYTGITRARRALHLAGSEAAL
RAALARHAARISGLAWRLGGEQMQPAPVEQTAEPPTVTPVQGSLF
>gid:101757  recF  DNA replication and repair RecF protein
MHVARLSIHRLRRFEAVEFHPASTLNLLTGDNGAGKTSVLEALHVMAYGR
SFRGRVRDGLIRQGGQDLEIFVEWRERAGDSTERTRRAGLRHSGQEWTGR
LDGEDVAQLGSLCAALAVVTFEPGSHVLISGGGEPRRRFLDWGLFHVEPD
FLALWRRYARALKQRNALLKQGAQPQMLDAWDHELAESGETLTSRRLQYL
ERLQERLVPVATAIAPSLGLSALTFAPGWRRHEVSLADALLLARERDRQN
GYTSQGPHRADWAPLFDALPGKDALSRGQAKLTALACLLAQAEDFAHERG
EWPIMALDDLGSELDRHHQARVIQRLASAPAQVLITATELPPGLADAGKT
LRRFHVEHGQLVPTTAAD
>gid:104992  recG  ATP-dependent DNA helicase
MPRARSVTPSLAVAGQAPLSSLPGVGPKVAEKFAARGILSLQDLWLHLPL
RYEDRTRLTTIAQLQGGVPAQIEGRVEAMERGFRFRPVLRVAMSDDSCGT
LVLRFFHFRAAQVAQFSPGTRLRVFGTPKPGQNGWEIVHPSYRVLAPDED
AGLGDCLDPVYPVLEGVGPATLRKLIGQALERLPPEAALELLPPHWLQDE
QLPSLRSALLTMHRPPVDTDPQQLLAGGHPAQQRLAIEELLAHQVSLRRQ
RIALQRFRAPQLRGGRLVQQLRKALPFQLTGAQQRVFEQIAHDLAQPAPM
LRLVQGDVGSGKTVVAALAAMLAVEHGKQVALAAPTELLAEQHLANLRGW
LEPLGVRIVWLAGKVTGKARVAAMAEVASGQAQVVVGTHALMQDAVVFHD
LALAIIDEQHRFGVHQRLALRDKGAAAGSVPHQLVMTATPIPRTLAMAAY
ADLHVSAIDELPPGRTPVQTIVLSAERRPELVERIRAACAEGRQAYWVCT
LIEESEDTDKGAQNGPPRIEAQAAQVTFETLSAQLPGVRVALVHGRMKPA
EKQQAMLDFKQGRTDLLVATTVIEVGVDVPNASLMIIENAERLGLAQLHQ
LRGRVGRGAAASSCVLLYQGPLSLMARQRLETMRQTNDGFVIAERDLELR
GPGELLGTRQTGLASFRIADLARDAGLLPRVQVLAERLLDEAPEIADRVV
ARWIGGAVRYAAA
>gid:103600  recJ  putative single stranded DNA exonuclease
MTSSPTIVRRPPGQGGTWPDAMLPLLRRIYAARGVVDVHGAHPRLGQLLS
PELLHNSRVAAELLADAIAAQRRILVVGDFDCDGATACAVGVRGLRMLGA
LDVHHAVPNRMVHGYGLSPALVDELAALQPDLLVTVDHGIACHAGVAAAK
ARGWTVLVTDHHLPGEVLPPADAIVDPNLVQDSFPSKTLAGVGVIFYVLL
ALRGVLRARGAFAERAEPDLSVLLDLVAVGTVADLVPLDTNNRALVSAGL
RRLRDGKGCIGLRALIDASGRDAARLSASDIGFALAPRLNAAGRLEDMAL
GIELLLCEDWSRAREIAGLLEEINAERRAVQQLMTDDAEQAVTKVMLAAD
GALPMAACLFDPEWHPGVIGLVASKLKDRLHRPVIALAPAEPGSDQLRGS
ARSIPGLHIRDVLAAVDARHPGLIQKFGGHAMAAGLSLEHRALAAFEQAF
QTQVQAMVDASLLQAELHSDGELAAHELDHLHAEALRAAGPWGQGFPEPL
FDGQFEVLQWRLLKERHLKLTLRCAGRAEPLNAIHFNGWRGSEPARTVRI
AYRLVGDDYRGGTAVQLIVEHCEPAASAG
>gid:103225  recN  recombination protein N
MLRHLSIKDFAVVRATELEFGPGMTVVSGETGAGKSLMVDALGFLSGLRA
DSGVVRHGADRAELSAEFQLPAEHPGLTWLADNELDDDAQCQLRRIIRAD
GGSRAWINGRPVTSSQLSDLAARLVEIHGQHEHQALMARNSQLALLDAYA
RNSAQREQVRQASQRWQALLDERDALSAQGDVSDRIGFLEHQLAELERED
LDPAAIAALDTNHRRQAHATALIGACESVVQQLNGDEGPSALGLLQDSRH
DLARVAEHEPRLGEVDALLDSAAIQIEEALALLDRVRDDLDADPTQFEAM
ERRLGRLHDLARKHRVSPDELAAHRDHLTAEVESLRGADERLQQLDKHIE
AAIGVWQGAASVLSASRQSAAQALSAATTTLIGELGMGGGQFLIQLQPQE
TLRPDPNGAERVEFLVAANAGQPPRALRKVASGGELSRISLAIEVAALGL
DSVPTMVFDEVDSGIGGAVADIVGQKLRALGEERQVLCVTHLPQVAAKGH
AHYRVSKAPVDGMTQSAVELLGPQARQEELARMLGGVEVSKEARAAARKL
LQSA
>gid:103029  recO  DNA repair protein
MLIEHERGFVLHARAWRETSLLVEVLTEQHGRVGLLARGVHGPRKQALRA
ALQPLQLIQFTAVQRGELAQLRQAEAIDTAPRLLGEAMLAGFYISELLLR
LAPRHAPVPELFDCYAQARAHLASGAALAWGLRQFERDVLDGLGFGFDLQ
HDSDGQPIDPAARYRLDPQDGARRVLSERLAQDRRETVTGAALLALGEDR
VPATEDMPGLRRSMRGVLLHHLSGRGLKSWEMLEELARRGA
>gid:104701  recQ  DNA helicase
MSSPAHELLSRVFGYDDFRGPQQAIVEHVAAGNDALVLMPTGGGKSLCYQ
VPALLRDGIGIVVSPLIALMQDQVEALRQLGVRAEFLNSTLDAENTQRVE
RALLSGDLDLLYVAPERLLTPRFLSLLERSRIALFAIDEAHCVSQWGHDF
RPEYRQLTVLHERWPHIPRMALTATADPPTQREIAERLDLVEARHFVSSF
DRPNIRYTVVQKDNARKQLQEFLGRHRGSAGIVYAMSRRKVEETAQQLCA
QGFNALPYHAGLPAEVRAENQRRFLREDGIIMAATIAFGMGIDKPDVRFV
AHVDLPKSMEGYYQETGRAGRDGEPAEAWLCYGLGDVVLLKQMIEQGEAA
EERKRLERAKLDHLLGYCESMQCRRQVLLAGFGETYPKPCGNCDNCLTPA
AAWDATVASQKALSCVYRSGQRFGVGHLIDILRGSENERIKQLGHDQLST
YGIGRDMDERTWRGVFRQLVAASLLEVDSEGHGGLRLTDASRQVLKGERQ
VMMRRENPAAGRERDRGAQRTGLPVQPQDLGLFNALRGLRAELAKEQNVP
AFVIFHDSTLRNIAEQRPTSIDALSRVGGIGGGKLARYGAQLIEIVREQG
>gid:102757  recR  recombination protein RecR
MSSLLEQLIEAFRVLPGVGQKSAQRMAYHVLEREREGGRRLAAALANAVE
KVGHCVQCRDFTESEVCAICANSGRDRQQLCVVESPADRLAIEHATGYRG
VYFILQGRLSPLDGIGPRELGLDRLAERLAAGEVTEMIIATNATVEGEAT
AHYLAQLARQHSVRPSRLAQGMPLGGELEYVDRGTLSHAFGTRSEVL
>gid:105893  rep  ATP-dependent DNA helicase
MHGLNPPQSAAVLHCEGPLLVLAGAGSGKTRVIVEKIAHLIAIGRYPAKR
IAAITFTNKSAKEMRERVAKRIRGDGADGLTICTFHALGLKFLQIEHAAA
GLKRGFSIFDSDDAAAQIKDLMHGAKPDAIEDAKNLISRAKNAGMSPEQA
MAAARSNREKEAASLYERYQARLTTFNAVDFDDLIRLPVQILEANEEIVM
GWRERIGYLLVDECQDTNDAQYRLLKMLAGPRGNFTCVGDDDQSIYAWRG
ANPENLQQMGRDYPALEIIKLEQNYRCSNRVLRAANALIAHNPHEHLKTL
WSDQADGERIRVWECRDSEHEAEKVAAEISFLGTAKQVPWSDFCILFRGN
FQSRPLEKALQLLRVPYHLTGGTAFLERQEVKDVLSWLRLIVNPEDDAAF
LRAVQSPKREVGATSLARLAELASAKSVPMSRAAESMGALQHLPPRAANG
LSAFTDILRDMREHSATLPAGELVRTLADKSGLLNDLRNQSKDEAGFQRR
KRNLDELAEWFEGGPRGASASDLAAQLALLSRNDKDDGGNQVRMMTMHAS
KGLEFRYVFIVGCEDGVLPHEVSLEEGNLQEERRLLYVGITRAKQQLWMS
YSKLTRKFGEHVRLKPSRFFDEIPAAELQRDGADPVADAERKKERANAGL
AAIQALFD
>gid:102344  rhlE  ATP-dependent RNA helicase
MSFESLGLAPFLLRALAEQGYETPTAIQQQAIPLVLAGHDLLAGAQTGTG
KTAAFGLPLLQHLGTTPQPVNGPRKPRALILTPTRELATQVHDSLRGYSK
YLRIPSAVIYGGVGMGNQLDALRRGVDLLIACPGRLIDHIERRSVDLSGI
EVLILDEADRMLDMGFLPSIKRILTKLPRQDRQTLLFSATFEENIKQLAL
EFMRNPMQIQVTPSNTVAESITHRVHPVDGARKRDLLLHLLAQDSREQTL
VFARTKHGSDKLALFLEKSGIKTAAIHGNKSQGQRMRALSDFKAGRVTVL
VATDIAARGIDIDQLPKVINYDLPMVAEDYVHRIGRTGRNGSTGEAISLV
AQDEAKLLRQIVRMLGRDVEIRDVPGYEPQTPIRWGNSAPGRAEQPGGDR
APRKSHARRPHGDAPRQAHAHAGPKKPGGQRSSGPRQATAGAGAGRRDGG
RGGSGRPASRGA
>gid:102179  rhlE  ATP-dependent RNA helicase
MSLKADLLSLQLQPVFESALARAGVRALTPIQVAMIPPMLAARDLIATAQ
TGSGKTLAYALPLLQQRLQAPEQAPRVLGGLILVPTRELVAQVAHTLLSL
AAALPRRLKIVAATGGEAINPQLMALRGGADIVIATPGRLLDLVTHNALR
LSQVSTLVLDEADRLLDLGFGAELDRILALLPAQRQSVLVSATFPAAIAS
LAKRRLRDPLRITLGGTPEQAPAIAQRAIAVDAGQRTQLLRHLLLEHGWP
QLLVFVTSRHGADKVAEKLSKTGIAALPLHGELSQGRRERTLRAFKQADV
QVLVATDLAGRGIDIDALPAVLNYDLPRSTVDYTHRIGRTARAGASGVAI
SFVTADSAQQWRLIEKRQGLRVPTSVIEGFEPTPVQAPAPDHASGAAARA
ADDNGGIKGKRPSKKDKLRAAAQAQAGKPG
>gid:102741  rnhA  ribonuclease H
MKSIEVHTDGSCLGNPGPGGWAALLRYNGREKELAGGEANSTNNRMELMA
AIMALETLTEPCQILLHTDSQYVRQGITEWMPGWVRRGWKTSGGDPVKNR
ELWERLHAATQRHSIEWRWVKGHNGDPDNERVDVLARNQAIAQRGGLATS
>gid:103113  rnhB  ribonuclease HII
MTRSSSDRAIVVPAAQNALFTDSPFPTPESRLIAGVDEAGRGPLAGPVAV
AAVVFDPAKPRINGLDDSKQLSAERREQLYARIVDRALAWSVVLIDSEEI
DRINIYQATMLGMRRAVEGVAHVAGFARIDGNRVPKGLPCPAEALIGGDA
LDRAIMAASIVAKVTRDRLMRELHAQHPQYRFDLHKGYSTPAHLAALQTH
GPCPQHRRSFAPVRRALGLETAQTAWDVPCAPADGLLLAE
>gid:103275  rnt  ribonuclease T
MPMNEPVDLQPSPSLLPMSRRFRGYLPVVVDVETGGFDWNKHALLEIACV
PIEMDAQGHFFPGETASAHLVPAPGLEIDPKSLEITGIVLDHPFRFAKQE
KDALDHVFAPVRAAVKKYGCQRAILVGHNAHFDLNFLNAAVARVGHKRNP
FHPFSVFDTVTLAGVAYGQTVLARAAQAAGLDWNSADAHSAVYDTEQTAR
LFCKIANAWPGPASAG
>gid:104779  ruvA  holliday junction binding protein, DNA helicase
MIGRLRGILAYKQPPWLVIDVGGVGYELEAPMSTFYDLPDVGRDVILFTH
YAQKEDSVSLYGFLREGERRLFRDVQKVTGIGAKIALAVLSGVTVDEFAR
LITSGDITALTRIPGIGKKTAERMVVELRDRAADFSSGAPITGQLGPDAV
SEATVALQQLGYKPAEAARMAREAGAEGDEVATVIRKALQAALR
>gid:104777  ruvB  holliday junction binding protein, DNA helicase
MTEQRTIASSATREDEAADASIRPKRLADYLGQQPVRDQMEIYIQAAKAR
GEAMDHVLIFGPPGLGKTTLSHVIANELGVSLRVTSGPVIEKAGDLAALL
TNLQPHDVLFIDEIHRLSPVVEEVLYPAMEDFQIDIMIGDGPAARSIKID
LPPFTLIGATTRAGLLTAPLRDRFGIVQRLEFYSPQELTRIVIRSAAILG
IDCTPDGAAEIARRARGTPRIANRLLRRVRDFAQVKAAGHIDLTVAQAAM
QMLKVDPEGFDELDRRMLRTIVDHFDGGPVGVESLAASLSEERGTLEDVI
EPYLIQQGFLIRTARGRMVTPKAYLHLGLKPPRDSAPAIGEPGDLF
>gid:104780  ruvC  holliday junction resolvase, endodeoxyribonuclease
MTRILGIDPGSQRTGIGIIDVDESGRSRHVFHAPLVLLGEGDFAQRLKRL
LHGLGELIETYQPQEVAIEKVFMGKSADSALKLGHARGAAICAVVLRDLP
VHEYAATEIKLALVGKGGADKVQVQHMVGIMLNLKGKLQADAADALAVAI
THAHVRATAQRLGVNTQQAWSRKR
>gid:103305  sbcB  exodeoxyribonuclease I
MPDSFLFYDLETFGQDPRRTRIAQFAAVRTDAQLRVIEEPISFFVQPADD
LLPSPYATMVTGITPQHALREGVNEAEAFARIAEQMGRPQTCTLGYNSIR
FDDEFVRCGLFRNFYDPYEREWRGGNSRWDLLDVLRLVHALRPDGIVWPQ
REDGATSFKLEHLADANAVREGDAHEALSDVYATIGMARKFQQSQPKLWD
YALRLRDKRFAASLLDVIAMQPVLHISQRYPATRLCAAAVLPLSRHPRID
SRVIVFDLDGDPDALLRLSPDEIADRLYIRAADLPEGEQRIPLKEVHLNK
APALVAWQHLRSDDFQRLGVDRAAVEAKAARLRELGPELAEKVRQVYGAE
RAGAAAVNDADASLYDGFLAEGDKRLLTQVRSSAPGELGAMEARFRDPRL
IELLFRYRARNWPQTLSPHEHQRWNDYRRQRLLEDRGLGEVTLEQFYAQI
ADLRLAHPDDATKQSLLDQLAAWGSDLQRTL
>gid:105505  smf  DNA processing chain A
MDLTEPDRRALLTLLLAGGRSPPRRALLDAFDAPSQILAAGPAAWRAAGC
DALQIAKLQTPDTPILDAALRWCAQPGHHLIGWRDADYPALLRHIANPPL
MLFVDGDPAALWHPCVAVVGSRAASAGGRDHTRHFAASLANAGLGIVSGM
AAGVDAIAHEAALAHADGITVAVVGTGPDVAYPVQHHSLRDRIAARSAVV
SEYLPGTCAVAAHFPARNRIIAGLALGTLVVEAAMRSGALITARLAAEAG
REVFAVPGSLHNPLARGCHHLIRQGATLVQEPAQLVEGLRLLSGELADAL
RQRLTAPTEQARTVPQPTPRRSDPDYQRLWHALGHDPTPMDSLLERTGLT
AAALSSMLLIMELEGDVVTEHGRYTRNP
>gid:104489  ssb  single-stranded DNA binding protein
MARGINKVILVGNLGNDPDTKYTQAGMAITRVSLATTSMRKDREGNNQER
TEWHRVVFFGKLGEIAGEYLRKGSQVYVEGELRYDKYTGQDGVEKYSTDI
VANEMQMLGGRGEGGGGGGMGGDRPQRTQAPRQQQGGGGGGGGQDYAPRR
QQPAQQQSAPPMDDFADDDIPF
>gid:104504  tag  DNA-3-methyladenine glycosylase I
MSGYCSIAPGHPVHGHYHDHEYGFPQRDERELFERLVLEINQAGLSWETI
LRKRGNFQRAYDGFDVDTVAAYGEAEIARLMQDAGIIRNRLKVLAAIHNA
QVIQRLRATHGSFANWLDAQHPLDKPAWVKVFKKTFRFTGGEITGEFLMS
LGYLRGAHHADCPVFADIQALSPPWMHSA
>gid:105509  topA  DNA topoisomerase I
MPKHLLIVESPAKAKTINKYLGKDFTVLASYGHVRDLVPKEGAVDPDNGF
AMRYDLIEKNEKHVEAIARAAKSADDIYLATDPDREGEAISWHIAEILKE
RGLLKDKTMQRVVFTEITPRAIKEAMLKPRAIAADLVDAQQARRALDYLV
GFNLSPVLWRKVQRGLSAGRVQSPALRMIVEREEEIEAFIAREYWSIDAH
CRHPSQPFNARLIKLDGQKFEQFTVTDGDTAEAARLRIQQAAQGVLHVTD
VASKERKRRPAPPFTTSTLQQEASRKLGFTTRKTMQVAQKLYEGVALGDE
GSVGLISYMRTDSVNLSQDALAEIRDVIARDFGTASLPDQPNAYTTKSKN
AQEAHEAVRPTSALRTPAQVARFLSDDERRLYELIWRRAVACQMIPATLN
TVSVDLSAGSEHVFRASGTTVVVAGFLAVYEEGKDTKSSEDEDEGRKLPL
MKAGDNIPLDRIVTDQHFTQPPPRFTEAALVKALEEYGIGRPSTYASIIQ
TLQFRKYVEMEGRSFRPTDVGRAVSKFLSGHFTRYVDYDFTANLEDDLDA
VSRGEAEWIPLMEKFWGPFKELVEDKKDSLDKTDAGSVRVLGADPVSGKE
VSARIGRFGPMVQIGTVEDEDKPTFASLRPGQSIYSISIEDALELFKMPR
ALGQDKDQDVSVGIGRFGPFARRGSVYASLKKEDDPYTIDLARAVFLIEE
KEEIARNRVIKEFDGSDIQVLNGRFGPYISDGKLNGKIPKDREPASLTFE
EVQQLLADTGKPVRKGFGAKKATLKKNAVKDSAKEAKDAAKKTAAVKKVA
TKTAAKKAPAKKAAKKATKRVVKKAVSKAAG
>gid:104718  umuC  polymerase V subunit
MFALIDGNNFYASCERVFQPELRGRPLVVLSNNDGCAIARSDEAKALGVT
MGQPIHKVPPQIRRRLALRSANFGLYGDIASRIGVILRQAAPRVEVYSID
ESFLDLAGIRDCRQLAVDLRERVHQWTGIPNCIGIAPTKTLAKLANRVAK
DAARKPGSYPASLAGVCDLAALSASELDAVLRATAVGDLWGVGRRWGARL
QARGVFTAADLRDAAADDLLAEFGVVMARTQRELQGHACLQLEEVEPDRQ
QIMVSRSFGTWVTDPQDMAEALATFAMRATEKLRARGLTTCAIGIFAETD
SFKPGVPQHNPSRTAPLASATSDSRVVLTVVRRLLQGFMRDGFTYKKAGV
CLMDLAAPEDLQGDLFTPARIGDDKLMSTLDAINRRFGRGTAGLGASGWQ
KSPEWASRQDLLSGRFTTSLADLPRASC
>gid:105526  ung  uracil-DNA glycosylase
MTEGEGRIQLEPSWKARVGEWLLQPQMQELSAFLRQRKAANARVFPPGPQ
IFAAFDATPFEQVKVVVLGQDPYHGEGQAHGLCFSVLPGVPVPPSLLNIY
KEIQDDLGIPRPDHGYLMPWARQGVLLLNAVLTVEQGRAGAHQNKGWEGF
TDHVVETLNREREGLVFLLWGSYAQSKGKVIDQARHRVFKAPHPSPLSAH
RGFLGCKHFSKTNEHLQRRGLSPIDWSLPSRAALDLSLAGG
>gid:102902  uvrA1  excinuclease ABC subunit A
MAMDFIRIRGARTHNLKNIDLDLPRDKLIVITGLSGSGKSSLAFDTIYAE
GQRRYVESLSAYARQFLSVMEKPDLDHIEGLSPAISIEQKSTSHNPRSTV
GTITEIYDYLRLLYARVGQPRCPDHGFPLEAQTVSQMVDHMLTLDPEQRY
MLLAPVIRDRKGEHAQVFEQLRAQGFVRVRVDGELYEIDAVPPLALRQKH
TIEAVIDRFRPREDIKQRLAESFETALKLGEGMVAVQSLDDATAAPHLFS
SKYSCPVCDYSLPELEPRLFSFNAPVGACPSCDGLGVAEFFDPDRVVVHP
ELSLSAGAVRGWDRRNAYYFQLIASLAKHYKFDVDAVWNTLPAKVRQAVL
FGSGDEVISFTYFTDAGGRTTRKHRFEGILPNLERRYRETESPAVREELT
KYVSQQPCPACNGTRLNRAARNVFVADRPLPELVVLPVNEALNFFRGLSL
PGWRGEIASKIVKEIGERLGFLVDVGLDYLTLERKADTLSGGEAQRIRLA
SQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLTRLRDLGNTVIVVEHDE
DAIRLADHVLDIGPGAGVHGGEICAQGTLQDILESPRSLTGQYLSGKRRI
EIPKQRHKPNPKMMLHLRGATGNNLKNVDLEIPAGLLTCITGVSGSGKST
LINDTLFTLAANEINGASHTVAPHREVENLDLFDKVVDIDQSPIGRTPRS
NPATYTGMFTPLRELFAQVPESRARGYSPGRFSFNVRGGRCEACQGDGMI
KVEMHFLPDVYVPCDVCHGKRYNRETLEIRYKGFNISDVLQMTVEDALRL
FEPVPSIARKLETLVDVGLSYIKLGQSATTLSGGEAQRVKLSKELSRRDT
GRTLYILDEPTTGLHFHDIEALLGVLHKLRDEGNTVVVIEHNLDVIKTAD
WIVDLGPEGGHRGGTILVSGTPEDVAAHKASYTGQFLAKMLPSVKARETR
PAAMANKPDARPPRKVKPEKVAKATKTATKKTAKKKAS
>gid:102842  uvrA2  excinuclease ABC subunit A
MSSSASSPVPGLVRVRGAREHNLKNVDVDIPRDALVVFTGVSGSGKSSLA
FGTLFAEAQRRYLDSISPYARRLIDQVGVPEVDAIDGLPPAVALQQARGA
PSARSSVGSVTTISNSLRMLYSRAGQYPPGQEIIYADGFSPNTPAGACPT
CHGLGRIYDATEASMVPDRSLSIRERAVAAWPGAWHGQNQRDILTTLGID
VDVPWTKLPKKTRDWILYTDEQPVAPVYAGYDLDEVRRALKRKEEPSYMG
TFTSARRYVLHTFAITQSAQMKKRVAQYLISTQCPQCDGKRLRREALSVT
FAGLDIGALSQRPLDEVAELLRPAAEATPATQAAQGKRKRSATAEHPEQV
IAAQRIAEDLRARIAVVQALGLGYLTLERSTPTLSPGELQRLRLATQIRS
QLFGVVYVMDEPSAGLHPADAQALLGALDQLKAAGNSVFVVEHEVDVIRH
ADWIVDVGPAAGVHGGQVLYSGPPAGLEQVDASSTRRYLFGTPPQVHSHA
RDATGWLQLRGITRNNVRALDVDLPLGVFTTVTGVSGSGKSSLVSQALVE
LLAAHLGQTQAEEDEALDPLERGTQVPLGGAIVGGLDQVRRLVRVDQKPI
GRTPRSNLATYTGLFDPVRKLFAATPAARRRRYDPGQFSFNVAKGRCATC
EGEGSVHVELLFMPSVYAPCPTCHGARYNAKTLEIELRGHSIAQVLEMTV
DQAATFFAEDASVLRPLQVLREVGLGYLRLGQPATELSGGEAQRIKLATE
LQRAQRRDTVYVLDEPTTGLHPADVDTLMRQLQGLVAAGNTVIVVEHDMR
VAASSDWVLDMGPGAGGAGGHVVVAGTPDVVARHRGSLTAPFLGALICLM
IEVAVGRQRAV
>gid:104239  uvrB  excinuclease ABC subunit B
MTDRFELVSPYSPAGDQPAAIDKLVANFEAGLAKQTLLGVTGSGKTYTIA
NVVQQVQKPTLVMAPNKTLAAQLYGEFKSFFPNNAVEYFVSYYDYYQPEA
YVPSSDTFIEKDSSINEHIEQMRLSATKTLLSRRDSLVVATVSAIYGLGA
PEDYLSLRLILSIGEHIDQRQLIRHLTDLQYTRNEFELTRGAFRVRGEVL
DVFPAESDTEALRIELFDGDIEQLTLFDPLTGETLRKLQRYTVYPKTHYA
TTRERTLSAVDTIKEELKERLEQLYSQNKLVEAQRLAQRTQFDLEMMAEV
GFCNGIENYSRHLTGKAPGEPPPTLFDYLPPDALLVIDESHVTIPQIGAM
YKGDRSRKETLVEFGFRLPSALDNRPLRFEEWEARSPRSIYVSATPGPYE
LRESAGEITELVVRPTGLIDPVVEIRPVGTQVDDLMSEVHERIKLGDRVL
VTTLTKRMAENLTEYLGEHGIRVRYLHSDIDTVERVEIIRDLRLGKFDVL
VGINLLREGLDMPEVSLVAILDADKEGFLRSTGSLIQTIGRAARNLRGKA
ILYADKMTRSMQAAIDETDRRREKQVEYNLEHGITPKSVARPISDIMEGA
REDAAEKKAGKGRSKSRQVAEEPADYRAMGPAEIAGKLKALEQKMYQHAK
DLEFEAAAQIRDQILKLKAASLA
>gid:103869  uvrC  excinuclease ABC subunit C
MSARPQADFDGKAFAARLSTAPGVYRMYAADDSLLYVGKAGALRKRVGSY
FNGTPKNARLTSMLSQVARMDVTVTRSEAEALLLENQLIKSLSPRYNVSL
RDDKSYPYVLLTREDWPRIALHRGPRAVNGRYFGPYAGVTAVRETLNLMH
KLFKLRSCEDSVFRNRSRPCLQYQIGRCSAPCVDLVAAQDYQEAVRRATM
FLEGKSDQLGEEIMHSMQQASEALEFERAARLRDLLSSLRSMQNRQYVDG
RAADLDVLACATQSSQACVLLLSFRDGRNLGTRSFFPKTNGEDSAEEILA
AFVSQYYAEHAPPREILLDREIPDAELIEAALSAAAEHKVALKWNVRGER
AGYLLLASRNAQLTLVTELTSQSAQHARSEALREMLGLAEQVKRVECFDI
SHTMGEATVASCVVFDASGPVRGQYRRFNISGITPGDDYAAMRQAIERRF
RRAVEENGVLPDVLLIDGGAGQLAQAQAALADLGIENVLLVGVAKGEERR
AGHEALILADGRELRPGAASPALQFIQQVRDEAHRFAITGHRGRRQKARM
TSKLEDIPGIGPRRRASLLKHFGGLVGLKAAGEAEIARVEGVNAALAARI
YANLHGLALPDAAGESSP
>gid:105780  uvrD  DNA helicase II
MDVSHLLDHLNPAQREAVSAPPGHYLVLAGAGSGKTRVLIHRIAWLNEVQ
GVPNHGIFAVTFTNKAAGEMRHRTDLQLRNGSRGMWIGTFHGLAHRLLRL
HWQDARLPEGFQVMDSDDQLRLVKRVVQSLELDETKYPPKQMGWWINEQK
DEGRRPQHIQPEPNDDWTEVRRQVYAAYQERCDRSGLLDFAELLLRAHEL
LRDTPALLAHYRARFREILVDEFQDTNAIQYAFVRVLAGESGHVFVVGDD
DQAIYGWRGAKVENVQRFLKDFPGAQTVRLEQNYRSSANILGAANAVIAH
NPDRIGKQLWTDSGDGDPIDLYAAYNEVDEARYVVERARQWVRDGGSYGE
VAVLYRSNAQSRALEEALIAEQLPYRVYGGMRFFERAEIKDALAYLRMLT
NRSDDAAFERAVNTPTRGIGDRTLDEVRRLARANALSLWEAAMLCTQENT
LAARARNALATFLSLVGQLQAETGEMDLAERIDHVLMRSGLREHWAKESR
GGLDSESRTENLDELVSVASRFTRPDDEDSQGMTELVAFLAYASLEAGEG
QAQAGEEGVQLMTLHSAKGLEFPIVFLVGLEDGLFPSARSLEESGRLEEE
RRLAYVGITRARQKLVLCYAESRRIHGQDNYNVPSRFLREIPRDLLHEVR
PKVQVSRTASLGAARGGPVHAVVDAAPIKLGANVEHPKFGGGVVVDYEGA
GAHARVQVQFDEVGAKWLVMAYANLTVV
>gid:102356  wxcB  kinase
MIESLNIGALVAALPEKYQPIFAHPELSDGSSRGCEDRLVLIRQCAQRLQ
HALGRPLRVLDLGCAQGFFSLNLAADGHTVHGVDFLDLNVNVCKALAAEN
PACAATFEHGTVEDVIDRLEHDECDLVLGLSVFHHLIHDKGILKVSALCR
KLSETTSAGIYELALREEPLYWAPSLSQDPAELLSSYAFLRLLSQQQTHL
SAVSRPLYFASSRFWYVDGAIGNFTSWSSESHAHGRGTHLQSRRYYFSEQ
SFVKKMTLGVGDRAEINLQEFVNEVEFLGNPPESYPAPRLIASLNDSRDL
FIARSMMNGRLLSQAIDDGAAYDADEIIAQILAQLVLLERAGLYHNDVRC
WNILIAPEGRAVLIDYGAISANPFDCSWLDDLLLSFLITVKEILERKVVP
SSPSREPALDFMTLPARYRNAFIGFFGQNRSPLTFALLQQCLQQADATPH
SAPEWVTIYQRLQKALLGYNARLSAVHIETEHHRVELAARGAAIEHLRDS
TLQDQERTQAFEQGVAAAEERYKRLEEESEKLAAWAKGLEAQTIESNRDK
EALAALNAELESDKAALATRIASLGQELEERQRARELAELLAADVGRLTE
ERDAARSDLLDTQSVVEQHQATITALEARVAVQQQQISGLESSRDQERNR
LRELQVDLSRSMDGTASAREYIRELEMAVDALEGQINSLHGSRSWRVTAP
LRLFTTRVLKRGNADAATIRKVSDARLESPVHTDGVTPTPAEAAMDERLA
AVDQLGSRIRKSLK
>gid:105251  xerC  site-specific recombinase
MPEAAPPVADARGSSPTATTGPGADATLSAVEPFLAHLQIERQVSAHTLD
AYRRDLAALIGWASAQGSEDVAQLDSAQLRKFVTAEHRRGLSPKSLQRRL
SACRSYYAWLLKHGRIATSPAAALRAPKAPRKLPQVLDADEAVRLVEVPT
DAPLGLRDRALLELFYSSGLRLSELCALRWRDLDLDSGLVTVLGKGGKQR
LVPVGSHAVAALRAWQRDSGGSAQTHVFPGRAGGAISQRAVQIRIKQLAV
RQGMFKHVHPHMLRHSFASHILESSGDLRGVQELLGHSDIATTQIYTHLD
FQHLAKVYDAAHPRAKRKKATE
>gid:102408  xerD  integrase/recombinase
MSASSPAERRQRAQQLPPLRAEDDQAIQRFLDRLWAEQGVARQTLDSYRR
DLEGLARWRDGAGGGLQGADRSALFDYLRWRTEARYAPRSNARLLSTLRG
FYALCLRDGVRSDDPTALLDPPRLPRSLPKALTESQIDALLAAPEIGTPL
GLRDRAMLELMYAAGLRVSELVTLPAVAINLRQGVLRVTGKGSKERLVPL
GEESQHWLERYLETARPTLSERKAVPAVDGQVPLFIDAARRPLSRQQFWG
LVKRYAAVAGIDPDTVSPHGLRHSFATHLLNHGADLRALQMLLGHSSLST
TQIYTLVARQHLQTLHARHHPRG
>gid:104057  xseA  exodeoxyribonuclease VII large subunit
MADRTEQILTPSQLNTLARDLLEGSFPLVWVEAELGNVTRPASGHLYFTL
KDARAQIRCAMFKPKSTWLKFQPREGLRVLARGRLTLYEARGDYQLVLDH
MEEAGEGALRRAFEELRARLAAEGVFDAERKQPLPAHVRRLAVITSPSGA
AVRDVLSVLARRFPLLEVDILPSLVQGDSAAAQITSLLQRADASGRYDVI
LITRGGGSLEDLWAFNDERLARAIAAAHTPVVSAVGHETDVSLSDFAADV
RAPTPSVAAELLVPDQRELVARVRRAQARLSQLQQHTLGQAMQHADRLAL
RLRARSPQARLQLLQRRQEDAARHLRARMQHILERLQARVQRAQAGVQSH
SPQRHLAPLQQRLRAAHPQAAMQRRLQQDHLHLRGLVRSLEAVSPLATVA
RGYAIVTRQADGSVVRSAAELTQGDRLRAQLADGSVTVVVDTSETG
>gid:104353  xseB  exodeoxyribonuclease VII small subunit
MAKKSLNESSPVARFEQSLEELEQLVQKMEVGEMSLEQSLTAYERGIGLY
RDCQQALEQAELRVRLVTDPARPEQAEAFEPPSLDGG
>gid:105800  xthA1  exodeoxyribonuclease III
MKIASWNVNSLNVRLPHLQQWLADFAPDVVGIQETKLEDHKFPDAALAAL
GYRSVFCGQKTYNGVAILSRSPALEVQMGIPGFDDVQQRVIAATVDGVRI
INLYVVNGQDVGTDKYAYKLRWLEAVHDWIAQELQRHPQLVVLGDFNIAP
DARDVYEPEVWSDNHILTSTAERGALHKLLALGLHDAFRLHHDDAGHFSW
WDYRQAAYRRNLGLRIDLTLVSDALRARAVEAGIDREPRTWERPSDHAPA
WVRLAEAGA
>gid:103752  xthA2  exodeoxyribonuclease III
MPTTQRTIATYNVNGIASRLPHLLQWLQREQPDIVGLQELKSTQEAFPEQ
AIRDAGYGVIWQGQRSWNGVALLARGAEPVEIRRGLPWDPADTQSRYLEA
AIHGVIVGCLYLPNGNPQPGPKFDYKLKWFQRLLRHAATLVALPHPVALI
GDFNVVPTDAHIYDPKGWRKDALLQPESRAAYAQLLAQGWTDSLQAIHGD
APVYTFWDYFRQHFARDRGLRIDHLLLNRTLAAGLRDAGVGKWVRALEKA
SDHAPTWITVDVPDTDAVPAAAAGPARKRTKVKEAAANAGEKKPSATKKA
VKNAAKTTAVAKTAARKSATKKPVAKKASANTASAAAAAKKATPATTRKA
SKRPKA
>gid:104684  yoaA  ATP-dependent helicase
MSQLATASIEALSEGGALARQLDAFAPRAAQLRLTGAIAEAFEQRDVLLA
EAGTGTGKTYAYLVPALLSGLKTIVSTGTRALQDQLFHRDLPRVRAALGI
GLRSALLKGRANYLCKYRTQQARGEPRFASPEQVTQFQRIVAWSGRTQFG
DMAELDALPDDSPLLPMVTSTVDNCLGTECPFYSECFVVQARQRAQAADL
VVVNHHLLLADLALKQEGFGEILPGAQAFVIDEAHQLPELAANFFGESFG
MRPWQELARDCMVEARLVAGAQASLQAPILALDDALRGLRAGMEGLPPRG
TQWRALAKPQVREGFDAVLSALARLGEALLPLREASPGFDGCTARAQEAL
NRLSRWLGEDVPVPDFEQDLPETVDNDVLWYELSPRGFRCQRTPLDVSGP
LREHREKSQAAWVFTSATLAVGGEFDHIALRLGLNDPITLLQPSPFDWAR
QALCYLPPNLPDPAARGFGTALIAALHPVLEASNGRAFLLFASHRALREA
AEALRDGPWPLFVQGEAPRATLLQRFRTSGNGVLLGSASFREGVDVVGDA
LSVVVIDKLPFAAPDDPVFEARLDAIRRDGGNPFRDEQLPQAVIALKQGV
GRLIRSETDRGVLVLCDPRLLNKGYGRTFLNSLPPFSRTREIDDVRAFFG
SGPETGQAGSEIATLLPD