TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Streptococcus agalactiae 2603V/R, 2603V/R
Gene type: CDS

Number of genes found: 150

Free access
Sort by:

 



# Streptococcus agalactiae 2603V/R, 2603V/R

>gid:113670  SAG0020  recombination protein O
MRVSQTYGLVLYNRNYREDDKLVKIFTETEGKRMFFVKHASKSKFNAVLQ
PLTIAHFILKINDNGLSYIDDYKEVLAFQETNSDLFKLSYASYITSLADV
AISDNVADAQLFIFLKKTLELIEDGLDYEILTNIFEVQLLERFGVALNFH
DCVFCHRVGLPFDFSHKYSGLLCPNHYYKDERRNHLDPNMLYLINRFQSI
QFDDLQTISVKPEMKLKIRQFLDMIYDEYVGIHLKSKKFIDDLSSWGSIM
KSD
>gid:113840  SAG0167  conserved hypothetical protein
MNFEKIETAYELILENIQTIENQLKTHIYDALIEQNSYYLGSSCDLDMVV
VNNQKLRQLDLSQEEWRRTFQFIFIKSAQTEQLQANHQFTPDSIGFILLF
LLEELTSQETVDVLEIGSGTGNLAQTLLNNSSKELNYMGIEVDDLLIDLS
ASIAEIIGSSAQFIQEDAVRPQILKESDVIISDLPVGYYPNDGIAKRYAV
SSSKEHTYAHHLLMEQSLKYLKKDGIAIFLAPENLLTSPQSDLLKEWLKG
YADVIAVLTLPETIFGSRQNAKSIFVLKKQAEQKPETFVYPLTDLQNREN
MANFIENFQKWSRENSHYSKNMIK
>gid:113868  SAG0195  IS1548, transposase
MIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETWKEMED
FIEMNEPLFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTS
LDAVHQLISVDGKTIRGNRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSN
EIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCLAVKGNQE
TLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSSDIKWL
CQNHPKWHKLRGIGMTRNTIDKDGQLSQENRYFIFSFKPDVLTFANCVRG
HWQIESMHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKK
DLSYRRKQRYISVHLEDYLVQLFGERG
>gid:113890  SAG0217  site-specific recombinase, phage integrase family
MSIHKYPSKKAKNGYLYFVKIYMVKDSQRADHIKRGFRTRKEAKDYEARL
IYLKASGKLEEFIKPTHKTYNEIFEKWYQAYQDMVEPTTASRTLDMFRLH
ILPVMGDLPISKISPLDCQNFITDKAKTFKNIKQIKSYTGKVFDFAIKMK
LLKHNPMAEIIMPKRKKTRIENYWTVQELQEFLAIVLQEEPYKHYALFRL
LAYSGLRKGELYALKWADIDFQTETLSVDKSLGRLDGQAIEKGTKNDFSV
RKIKLDSETISILQEWKSISQKEKAQLAVAPLSIEQDFLFTYCTRSGSIE
PLHADYINNVLSRIIRKHGLKKISPHGFRHTHATLMIEIGVDPVNTAKRL
GHASSQMTLDTYSHSTTTGEDRSVKQFADYLKAK
>gid:113949  SAG0261  IS1381, transposase OrfB
MKAQAIVTSQGRIVSLDIAVNYCHDMKLFKMSRRNIGQAAKILADSGYQG
IMKMYSQAQTPRKSSKLKPLTLEDKTYNHTLSKERIKVENIFAKVKTFKI
FSTTYRNRRKRFGLRMNLIAGMINRELGF
>gid:113977  SAG0289  ATP-dependent RNA helicase, DEAD/DEAH box family
MSFKDFNFKPYIQRALDELKFVDPTDVQAKLIPVVRSGRDLVGESKTGSG
KTHTFLLPIFEKLDESSDDVQVVITAPSRELGTQIYQATKQIAEHSEQEI
RVVNYVGGTDKLRQIEKLKVSQPHIVIGTPGRIYDLVKSGDLAIHKAHTF
VVDEADMTLDMGFLDTVDKIAGSLPKDVQILVFSATIPQKLQPFLKKYLT
NPVMEKIKTATVIADTIDNWLLSTKGRDKNAQILELSKLMQPYLAMIFVN
TKERADELHSYLSSNGLKVAKIHGGIAPRERKRIMNQVKNLEFEYIVATD
LAARGIDIEGVSHVINDAIPQDLSFFVHRVGRTGRNGLSGTAITLYQPSD
DSDIRELEKLGINFIPKVIKNGEFQDTYDRDRRNNREKSYQKLDTEMIGL
VKKKKKKIKPGYKKKIQWKVDEKRRKERRASNRAKGRAERKAKKQSF
>gid:113991  SAG0303  conserved hypothetical protein
MKESFKLIATAAAGLEAIVGREIRNLGIDCQVENGRVRFHGDIKTIIETN
LWLRAADRIKIIVGEFPAPTFEELFQGVYGLDWENYLPLGAKFPIAKAKC
VKSKLHNEPSVQAISKKAVAKKLQKVFHRPEGVPLQENGAEFKIEVSILK
DKATVMIDTTGSSLFKRGYRAEKGGAPIKENMAAAIIQLSNWFPDKPLID
PTCGSGTFCIEAAMIGMNIAPGFNRDFAFEAWPWVDQSQVQKVRDEAESK
ANYDIDLDISGFDLDGRMVEIARKNAEEAGLGDVIKLKQMRLQDLKTDKI
NGVIISNPPYGERLLDDKAVDILYNEMGQTFAPLKTWSKFILTSDEGFEK
KYGSQADKKRKLYNGTLKVDLYQYYGERVRRQVK
>gid:114023  SAG0336  helicase, putative
MENYLGRLWTKAQLSEQLRKIAISLPSFIKKGSDYICTRCSSSVAKNCQL
PTGNYYCRECIVFGRVTSNENLYYFPQKTFSKTNSLKWKGELTPYQNEVS
EELLKGISSKENLLVHAVTGAGKTEMIYHSVAKVIDTGGSVCIASPRIDV
CLELYKRLSNDFRCAITLMHGESPSYQRSPLTIATTHQLLKFYHAFDLLI
VDEVDAFPYVDNPILYQGVKQALKENGTSIFLTATSTTELERKVARKELK
KLHLARRFHANPLVIPEMVWVSGIQKSLQTQKLPPKLYQLINKQRQTRYP
LLLFFPHISEGQVFTEILRQAFPMEKIGFVSSKSTSRLKLVQDFRDNKLS
ILVSTTILERGVTFPSVDVFVIQANHHLFTKSSLVQISGRVGRALERPEG
LLYFLHDGKSKSMHQAIKEIKNMNHIGGF
>gid:114121  SAG0421  cell wall surface anchor family protein
MTKKHLKTLALALTTVSVVTYSQEVYGLEREESVKQEQTQSASEDDWFEE
DNERKTNVSKENSTVDETVSDLFSDGNSNNSSSKTESVVSDPKQVPKAKP
EVTQEASNSSNDASKVEVPKQDTASKKETLETSTWEAKDFVTRGDTLVGF
SKSGINKLSQTSHLVLPSHAADGTQLTQVASFAFTPDKKTAIAEYTSRLG
ENGKPSRLDIDQKEIIDEGEIFNAYQLTKLTIPNGYKSIGQDAFVDNKNI
AEVNLPESLETISDYAFAHMSLKQVKLPDNLKVIGELAFFDNQIGGKLYL
PRHLIKLAERAFKSNRIQTVEFLGSKLKVIGEASFQDNNLRNVMLPDGLE
KIESEAFTGNPGDEHYNNQVVLRTRTGQNPHQLATENTYVNPDKSLWRAT
PDMDYTKWLEEDFTYQKNSVTGFSNKGLQKVRRNKNLEIPKQHNGITITE
IGDNAFRNVDFQSKTLRKYDLEEIKLPSTIRKIGAFAFQSNNLKSFEASE
DLEEIKEGAFMNNRIGTLDLKDKLIKIGDAAFHINHIYAIVLPESVQEIG
RSAFRQNGALHLMFIGNKVKTIGEMAFLSNKLESVNLSEQKQLKTIEVQA
FSDNALSEVVLPPNLQTIREEAFKRNHLKEVKGSSTLSQITFNAFDQNDG
DKRFGKKVVVRTHNNSHMLADGERFIIDPDKLSSTMVDLEKVLKIIEGLD
YSTLRQTTQTQFREMTTAGKALLSKSNLRQGEKQKFLQEAQFFLGRVDLD
KAIAKAEKALVTKKATKNGHLLERSINKAVLAYNNSAIKKANVKRLEKEL
DLLTDLVEGKGPLAQATMVQGVYLLKTPLPLPEYYIGLNVYFDKSGKLIY
ALDMSDTIGEGQKDAYGNPILNVDEDNEGYHTLAVATLADYEGLYIKDIL
NSSLDKIKAIRQIPLAKYHRLGIFQAIRNAAAEADRLLPKTPKGYLNEVP
NYRKKQVEKNLKPVDYKTPIFNKALPNEKVDGDRAAKGHNINAETNNSVA
VTPIRSEQQLHKSQSDVNLPQTSSKNNFIYEILGYVSLCLLFLVTAGKKG
KRARK
>gid:114134  SAG0435  DNA-damage-inducible protein J, putative
MSTVAVRVDDQLKDDATELFQSLGLDMSTAVKMFLIQSVKTQSIPFEIKN
KSSVSDEEFQNLVETKLKGIRVKASDPESVNAFFGDEDFSEYEEYFK
>gid:114139  SAG0441  conserved domain protein
MAKRIKPNKKRELLKAKGINWKDVRLSQTLQEVALPKKTAPEAPKAVDLL
EGLTANKQDVLKEAGLVSLEAFAKVSEADVLALKGIGPAAIKQLVDNGVV
FAK
>gid:114146  SAG0448  transposase, IS256 family
MTQFTTELLNFLAQKQDIDEFFRSSLETAMNDLLQVELSAFLGYEPYDKA
GYNTGNSRNGAYTRRFETKYGVVNLLIPRDRNGEFSPALIPSYGRRDNHL
EEMVIKLYRTGVTTREISDIIERMYGHHYSPATVSNISKATQENVASFHE
RSLEANYTVLYLDGTYLPLRRGTVSKECIHIALGVTSYGHKAILGYDIAP
NENNASWSDLLERFKGQGVQQVSLVVSDGFNGLDQLIQQAFPMAKQQRCL
VHIGRNIASKVKRADRALILEQFKTIYRAINVEEAKQALDSFINEWKPHY
KKVIETLESIENLLIFYEFPHQIWGSIYSTNLIESLNKEIKRQTKKKVVF
PNEESLERYLVTLFSDYNFKQGQRIHKGFGQCTDTLESLFD
>gid:114150  SAG0452  type II DNA modification methyltransferase, putative
MRVVAGTFGGRPLKTLDGKTTRPTTDKVKGAIFNMIGPFFEGGRVLDLFS
GSGSLAIEAISRGMDQAVLVEKDRRAQVVIQENIAMTKSPEQFQLLKMEA
NRALEQLTGQFDLVLLDPPYAKEEIVKQIQIMDSKGLLGDDIMIACETDK
SVDLPEEIASFGIWKQKIYGISKVTVYVR
>gid:114184  SAG0487  MutT/nudix family protein
MTNPTFGEKIDNVNYRSRFGVYAIIPNPTHDKIILVQAPNGAWFLPGGEI
EENENHLEALTRELIEELGYSATIGHYYGQADEYFYSRHRDTYYYNPAYI
YEVTAYHKDQAPLEDFNHLAWFPIQEAKEKLKRGSHRWGVQAWEKNHHSR
K
>gid:114220  SAG0524  DNA polymerase III, epsilon subunit/ATP-dependent helicase DinG
MFCFIDIACYNRLTMTQKKLRKYAVVDLEATGAGPNASIIQVGIVIIQGN
KIIDSYETDVNPHESLDEHIVHLTGITDKQLAKAPDFGQVAHHIYQLIED
CIFVAHNVKFDANLLAEQLFLEGCELRTPRIDTVELSQVFYPCLEKYSLG
ALAESLNIELTDAHTAIADARATAQLFIKLKAKISSLPKEVLETILTFAD
NLLFESYLLIEEAYQEADFVNPKEYYFWQGLVLKKEKAVGKPKKLSSDFQ
VNMALLGMDARPKQVVFADLVKAHFNDQTTTFLEAQPGLGKTYGYLLPLL
DQSQKQQIIVSVPTKILQDQIMAKEIKHIQELFHIPCHSIKGPRNYLKLD
AFYKSLQVQDRNRLINRFKMQLLVWLTETTTGDLDEIKQKQRLESYFDQL
KHDGEVTQSSLFYDLDFWKRSYDKVAQSQLVIINHAYFLERVQDDKDFAK
GKVLVFDEAQKLVLGLENFSRGQLDISHQLQVIQKIIDSSIPLLQKRLLE
SISYELSHAVELFYRHNSFEFSETWLKRLKNSINALEVVGLDELQTFFTA
TYTNYWFETDKVNEKRLTILRGAREDFLKFSKFLPPTKKTYMISATLQIS
PKVYLSDLLGGFSSISTEKIAHEKNANQKVWIDTSMPNILDLSPEQYAYE
IAKRLQDIMTLKQPTLVLLTSKQTMFMVSDYLDKWEIKHLTQDKNGLAYN
VKKRFDRGESNLLLGTGSFWEGVDFVHRDRLIEVITRLPFDTPKDYFIQK
LSQSLTKEGKNFFYDYSLPMTVLKLKQALGRTTRREEQKSAVIILDSRLV
IKSYGQTIMHSLGRDFEISKEKINKVLTEMAKFLI
>gid:114239  SAG0543  IS1381, transposase OrfB
MKAQAIVTSQGRIVSLDIAVNYCHDMKLFKMSRRNIGQAAKILADSGYQG
IMKMYSQAQTPRKSSKLKPLTLEDKTYNHTLSKERIKVENIFAKVKTFKI
FSTTYRNRRKRFGLRMNLIAGMINRELGF
>gid:114243  SAG0545  prophage LambdaSa1, site-specific recombinase, phage integrase family
MWIEELANGKFKYIERYTDPLTNKYKKVSVTLDKNSSQAQKKAGLILQEK
IEDRLAIRNHSEMTYGELKKEYLKQWIPTVKDSTKRGYLVSDSHIATVLP
DDTIINKLTKRDIRLIIDKLLKHNSYHVTHKCRKRLHAIFSYAIQMDYMT
SNPTENVLVPKPKDDYKPEKVLYLTSNEVYDLCNRMIDNDEQTLADIVLF
MFLTGVRYGELSCLTYDKIDFENKEILINATYDFNTREITTTKTKKSTRK
ISVSDNILDIVNRQKKTSSFVFPNSNGVPILNAYINKRLKIYGDYHTHLF
RHSHISFLAEKGIPLNAIMDRVGHSDPKTTLSIYSHTTVNMKEIINKQTA
PFVPLLKSE
>gid:114257  SAG0559  conserved hypothetical protein
MADNKKYYYLKLKENFFESDEAIILESMPDGYIYSNILLKLYLRSLKNDG
LLMFNNLIPYNAQMLATITRHQVGTVEKAIQIFRDLQLIEILDNGAIYMT
NIQNFVGKSSTEADRIRKLRAKNNSGVQMLYKCTPEIEIEKDKKIDINID
KELELEQDKEDRFVDVVEANLGRGLVKFEFDMINDYLIGQNVSKDLFLEA
VKVAVANNVRKFNYIARILDNWINDGIKTPEQAYQAQRDFKAKKANKTMQ
SQSNVPSWSNPDYKGPDLKEFALGSIDDIEDGSGDF
>gid:114265  SAG0567  prophage LambdaSa1, reverse transcriptase/maturase family protein
MIAQKAINIFEKVQVFQRKIYLSTKADNKRKFGVLYDKVYRKDILKVAWF
YVKRNKGSAGIDDFTIEEIEAYGVQKFLDEIEDQLRNKKYQPKAVKRVYI
PKANGKKRPLGIPTVRDRVVQTAVKIVIEPIFEADFQEFSYGFRPKRSAN
QAIREIYKYLNYGCEWVIDADLKGYFDTIPHDKLLLLVKERVTDKSIIKL
LSLWLEAGIMEDNQVRSNILGTPQGGVISPLLANIYLNALDRYWKNNRLE
GRGHDAHLIRYADDFVILCSNNPKKYYQYAKQRIDKLGLTLNEEKTRIVH
ATEGFDFLGYTLRKSKSHKSGKYKTYYYPSRKSMKSIKGKVKDVIQTGQH
LNLPDVMERLNPMLRGWANYFKAGNSKQHFKSIDNYVIYNLTIMLRKKHK
KSGKGWREHPPSWYYNYFGLVCLRKLSTNINDDSQRYGR
>gid:114267  SAG0569  conserved hypothetical protein
MTFKTEFEIPIEPKPQTRPKFSKFGTYEDPKMKRWRKEVSGWIEKNYDGP
FFDDCIKVEVTFYMKAPKTLSKEPTQRSKGKTIQIYQNFVRELIWHAKKP
DIDNLIKAVFDSISDAGYDRIQKSGIVWSDDNIVCDLRAKKKYSQNPRIK
VRIEEIDR
>gid:114299  SAG0602  conserved hypothetical protein
MTVEQAERIAQSQFVWAILFILLFMIVVGYLVRTSDKREKKLMDFHDQSK
SESNKREEWLKGHLDKNTEQLQDISQTIGVVQKEMSYMSDRIGRLEKEEK
>gid:114306  SAG0610  conserved hypothetical protein
MIYSTLAKEQGVQGYLDGKGSLRDICKWYDISSRSVLQKWIKRYTSGEDL
KATSRGYSRMKQGRQATFEERVEIVNYTIAHGKDYQAAIEKFGVSYQQIY
SWVRKLEKNGSQGLVDRRVKGLESRPDLTEIEQL
>gid:114320  SAG0626  MutT/nudix family protein
MEIWDAYDQYGFITGLTLNRDQNIPQGLFHLVVDVILFHEDGDVLMMKRH
PKKKAFPAYFEATAGGSALKGENAKQAILRELKEETGIVPQCLTFLNREW
FSERSYFVDHFIAKYNGAKDIITLQEGETVDYIWLKPEYIDLFLSKNKLI
PSQIKLLKSLI
>gid:114331  SAG0639  transposase OrfB, IS3 family
MCRVLRVNRSTYYKFLKHKPSKRELDNQIYKKQILEIYTKANKRLGVKSI
KVILQRDYDTKISEGRIYHLMKNMALPKMATVKTKTALKKTQKTYPQNLL
NQKFYPDKPNQVWSTDFTYISIGYKKYVYLCAILDLYSRKCIAWKLSHRM
DAKLACNTLELALNKRKIEGTLLFHSDQGSQFKAREFRKIIDDNNIMHSF
SKPGYPYDNAVTEAFFKYLKHRQINRKHYQNIKQVQLDCFEYIENFYNNY
IPHTANLGLTPNQKEENYFNAIK
>gid:114332  SAG0640  transposase OrfA, IS3 family
MSTFKRYDEEFKQSLVNLYQTGKTQSELCKDYRVSTSALAKWIKQYSQFK
LEDNSVLTAKQIQELQKRNAQLEEENLILKKASAIFMQNLK
>gid:114365  SAG0677  hypothetical protein
MDNNKPYLSPKDKTTVEKLEDRWKKITFKVQDTGIGLKDVYLQSVKYVGG
GNNNLDLITPPGFKKEDKKVEKPKLDRPPGIDLPAPTSMRSFDYSTPPGT
KPSKPKDSLSTPPGFPDLNTPPDEAPKDSKKDAIEDKSGAIKYAKSLQLS
FVDGPILASKVNGKILQVESDGKLVIPRNALSANQFDDTSLKIYRNNNRN
KEITITTDYFADTKYVNITAVDYLSNTTFEQLATGETVDYHAIVFSSFAA
IKDKGGKIYVNDKLQETSRIALKDKSVKIGIELPNDVRHIDSLSVRRLNE
VKTVDNILKNDEQDINLSKTYQLKYNPTNRRLEFTINNINSSSEIMTTFK
DGKMPELVEQKDVSLDINDMDMSKFKTIRLGRKDSEFKGQLIAKTGTVEL
DMFFKQSQDPASIIKKIYLIQNGVPNELKKFDSSFGLTESQIDGYYIYKD
AINLKFKLTSGASLKVVYKGQEDPYSHQKEDMTKKGEQLSHSTQANENTA
KVTFANIDWSHYSKVTVNGKEVVKGSELPLTKGWTTFVLHKTENSLNVKS
LIMETGSVSKKVQQLPLSPRLSKNKHMRDMLLTMQKDSAYYETSDSLVLR
INLTADTKLNFNAVKGASALTENMMMRQFAVAGPQDDPVSEHKYPSVFLL
TPALLETASEATLNGKEITASGIIGHIKDGDKSKHVEVKMVNENGDMLGT
PVIIQGKDLTNRTKPLMSGRRVLYAGKQYEFRAKLPLSRFNTWIRVEVVT
EAGEKASIVRRMFFDQSVPELNTAVAKRDLTSDTALIHIVAKDDSLKLKL
YQDDSLLESVDKTGLYSFRNGVEITKDMTVPLEFGDNIIKLSAVDLSNYR
RNETLHIYRNRFDVKASQMTADKGAKVTVDMLMKHLVVPEMAGAYTLTID
EAPNTNESGMLTNAKVSIHYVNGGVDKVDVPIKVVDLEAIRKAEEARKAE
EARKAEEARKAEEGHKTQEAPIVEEGYKVNNVHQTDTTVKASDLPKTKTV
SAVHMARTDNKQITSHQTHVEKQIKNTLPSTGDSKRGYYITGMAIVMLSV
LFSLAKKFKSKY
>gid:114379  SAG0693  IS1548, transposase
MIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETWKEMED
FIEMNEPLFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTS
LDAVHQLISVDGKTIRGNRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSN
EIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCLAVKGNQE
TLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSSDIKWL
CQNHPKWHKLRGIGMTRNTIDKDGQLSQENRYFIFSFKPDVLTFANCVRG
HWQIESMHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKK
DLSYRRKQRYISVHLEDYLVQLFGERG
>gid:114445  SAG0760  IS1548, transposase
MIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETWKEMED
FIEMNEPLFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTS
LDAVHQLISVDGKTIRGNRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSN
EIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCLAVKGNQE
TLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSSDIKWL
CQNHPKWHKLRGIGMTRNTIDKDGQLSQENRYFIFSFKPDVLTFANCVRG
HWQIESMHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKK
DLSYRRKQRYISVHLEDYLVQLFGERG
>gid:114462  SAG0777  ATP-dependent RNA helicase, DEAD/DEAH box family
MKFTELNLSQDILSAVEKAGFVEPSPIQEMTIPLALEGKDVIGQAQTGTG
KTAAFGLPTLNKIHTEDNTIQALIIAPTRELAVQSQEELFRFGRDKGVKV
RSVYGGSSIEKQIKALRSGAHVVVGTPGRLLDLIKRKALKLNHIETLILD
EADEMLNMGFLEDIEAIISRVPETRQTLLFSATMPDPIKRIGVKFMKDPE
HVKIKATELTNVNVDQYYVRVKENEKFDTMTRLMDVDQPELSIVFGRTKR
RVDELTRGLKLRGFRAEGIHGDLDQNKRLRVIRDFKNDHIDILVATDVAA
RGLDISGVTHVYNYDIPQDPESYVHRIGRTGRAGKSGQSITFVSPNEMGY
LTIIENLTKKRMTGMKPATASEAFQAKKKVALKRIARDFEDQELVSKFDK
FKADALELATQYTPEELALYVLSLTVQDPESLPEVEITREKPLPFKPSGG
GFKGKGGRGNGRGGDRRRNDRGDRRGNRDRDDRGSRCDFKRRDDKFKKDN
RRQENKKPHKNTSSEKQTGFVIRNKGDK
>gid:114463  SAG0778  conserved hypothetical protein
MIKVPAYMYVLECSDGTLYTGYTTDVKRRLNTHNTGKGAKYTRARLPVKL
LYSEAFNSKQEAMRAEALFKQKTRQAKLTYIKQHKNEQ
>gid:114472  SAG0787  DNA polymerase III, delta subunit, putative
MIAIEEIGRITPDNLGLVTVLAGEDLGQYAQMKEKLFQVIGFNKDDLAYS
YFDLSEEDYQNAELDLESLPFLSDYKVVIFDQFQDITTDKKTYLDEQAMK
RFEAYLQNPVDTTRLVICAPGKLDGKRRLVKLLKRDARVLEANTLKESDL
KTYFQKYAHQEGLVFEAGVFDELLIKSNYDFSDTLTNIAFLKSYKTDGHI
SSNDVREAIPKSLQDNIFDLTQDVLLGRIDLARDLVRDLRLQGEDEIKLI
AIMLGQFRMFLQVKILASKGKSESQIVSELSHYIGRKINPYQVKFAVRDS
RNLPLAFLKEAIRILIETDYAIKRGTYDKDYLFDLALLKIAHKKP
>gid:114510  SAG0825  ATP-dependent RNA helicase, DEAD/DEAH box family
MITKFPDQWQDKLTQRQFDDLTDIQNKLFQPITDGDNILGISPTGTGKTL
AYLFPTLLKLQPKKSQQLLILAPNSELAGQIFDVTKEWAEPLGLTAQLFL
SGSSQKRQIERLKKGPEILIGTAGRVFELVKLKKIKMMNINTIVLDEFDE
LLGDSQYHFVDNIINRVPRDQQMIYISATNKLDNSKLADNTITIDLSNQK
LDTIKHYYITVDKRERTDLLRKFSNIPDFRGLVFFNSLSDLGACEERLQF
NRASAVSLASDINIKFRKVILEKFKNHDISLLLGTDLVARGIDIDNLEYV
INFDIARDKETYTHRSGRTGRMGKEGCVITFVTHKEELKQLKKYATVTEL
VLHNQKLHLK
>gid:114517  SAG0832  protein of unknown function
MKKQFLKSAAILSLAVTAVSTSQPVGAIVGKDETKLRQQLGYIDSKKSGK
KIDERWGEKIYNYLSYELIEANEWINRSEFQEPEYRTILSEFKDKIDSIE
YYLINLSNIAKEDAHQRNILQSLDKYEKSGIYNLDQGVYNYIYQEISSAK
HKFSDGVDKIYRLDSTLFPFSVWYDKHLDNNDNYKDNKDFKEYIALLNEI
TRKARLGYQIVNNHKDGEHKDEAEILDILIRDITFVSKDAPGYKYIPNKR
IAAKIIEDLDGIINDFFKNTGKDKPSLEKLKDTEFHKKYLNSTEPYSIET
NLPSNYKELKEKQIKKLEYGYKKSSKIYTSAHYALYSEEIDAAKELLQKV
KIAKDNYNEIKSMNLSPSIFNQYLQLLQIVISSEINLKKALDNTVDLPIE
NNFNTLDIQYNKLDTAIKSLRKFVTKYKQEVRKATKSYSKKELVNAELTK
VISNDNILLDMQAISSNYGSTKKFVYSVKRLPYVPQVIMTTTSNVLMPQK
QVEKVKLLTPFTISNKEVLNHDSLVENDAQKQKVEQEKTKSLAPQKGAVK
EQTEQKVSGNTQEIEKKSETVATPQQSSVAQTSVQQPAPVQSVVQESKAS
QEEINAAHDAISAYKSTVNIANTAGVTTAEMTTLINTQTSNLSDVEKALG
NNKVNNGAVNVLREDTARLENMIWNRAYQAIEEFNVARNTYNNQIKTETV
PVDNDIEAILAGSQAKISHLDNRIGARHMDQAFVASLLEVTEMSKSISSR
IKE
>gid:114579  SAG0895  lipoyl-binding domain protein
MAGWRTVVVNTHSKLSYKNNHLIFKDSYQTEMIHLSEIDILIMETTDIVL
STMLIKRLVDENILVIFCDDKRLPTAMLMPYYARHDSSLQLSRQMSWIED
VKADVWTSIIAQKILNQSFYLGECSFFEKSQSIMNLYHDLEPFDPSNREG
HAARIYFNTLFGNDFSREQDNPINAGLDYGYSLLLSMFAREVVKCGCMTQ
FGLKHANQFNQFNLASDIMEPFRPIVDRIIYENRQSDFVKMKRELFSMFS
ETYSYNGKEMYLSNIVSDYTKKVIKSLNSDGNGIPEFRI
>gid:114599  SAG0915  Tn916, transposase
MSEKRRDNKGRILKTGESQRKDGRYLYKYIDSFGEPQFVYSWKLVATDRV
PAGKRDCISLREKIAELQKDIHDGIDVVGKKMTLCQLYAKQNAQRPKVRK
NTETGRKYLMDILKKDKLGVRSIDSIKPSDAKEWAIRMSENGYAYQTINN
YKRSLKASFYIAIQDDCVRKNPFDFQLKAVLDDDTVPKTVLTEEQEEKLL
AFAKADKTYSKNYDEILILLKTGLRISEFGGLTLPDLDFENRLVNIDHQL
LRDTEIGYYIETPKTKSGERQVPMVEEAYQAFKRVLANRKNDKRVEIDGY
SDFLFLNRKNYPKVASDYNGMMKGLVKKYNKYNEDKLPHITPHSLRHTFC
TNYANAGMNPKALQYIMGHANIAMTLNYYAHATFDSAMAEMKRLNKEKQQ
ERLVA
>gid:114615  SAG0932  Tn916, transcriptional regulator, putative
MEGFLLNEQTWLQHLKEKRLAYGLSQNRLAVATGITRQYLSDIETGKVKP
SEDLQQSLWEALERFNPDAPLEMLFDYVRIRFPTTDVQQVVENILQLKLS
YFLHEDYGFYSYSEHYALGDIFVLCSHELDKGVLVELKGRGCRQFESYLL
AQQRSWYEFFMDVLVAGGVMKRLDLAINDKTGILNIPVLTEKCQQEECIS
VFRSFKSYRSGELVRKEEKECMGNTLYIGSLQSEVYFCIYEKDYEQYKKN
DIPIEDAEVKNRFEIRLKNERAYYAVRDLLVYDNPEHTAFKIINRYIRFV
DKDDSKPRSDWKLNEEWAWFIGNNRERLKLTTKPEPYSFQRTLNWLSHQV
APTLKVAIKLDEINQTQVVKDILDHAKLTDRHKQILKQQSVKEQDVITTK
K
>gid:114627  SAG0945  IS1548, transposase
MIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETWKEMED
FIEMNEPLFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTS
LDAVHQLISVDGKTIRGNRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSN
EIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCLAVKGNQE
TLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSSDIKWL
CQNHPKWHKLRGIGMTRNTIDKDGQLSQENRYFIFSFKPDVLTFANCVRG
HWQIESMHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKK
DLSYRRKQRYISVHLEDYLVQLFGERG
>gid:114648  SAG0966  IS1381, transposase OrfB
MKAQAIVTSQGRIVSLDIAVNYCHDMKLFKMSRRNIGQAAKILADSGYQG
IMKMYSQAQTPRKSSKLKPLTLEDKTYNHTLSKERIKVENIFAKVKTFKI
FSTTYRNRRKRFGLRMNLIAGMINRELGF
>gid:114659  SAG0978  site-specific recombinase, phage integrase family
MKRELLLEKIDELKEIMPWYVLEYYQSKLSVPYSFTTLYEYLKEYRRFLE
WLLDSGVANCHHIAEIELSVLENLTKKDMEAFILYLRERPLLNANTRQNG
VSQTTINRTLSALSSLFKYLTEEVENADGEPYFYRNVMKKVSTKKKKETL
ASRAENIKQKLFLGNETIEFLEYIDCEYQNKLSKRALAFFNKNKERDLAI
IALLLASGVRLSEAVNLDLKDINLNVMVIDVTRKGGKRDSVNVASFAKPY
LANYLDIRKNRYKAENQDIALFLSEYRGVPNRIDASSVEKMVAKYSQDFK
VRVTPHKLRHTLATRLYDATKSQVLVSHQLGHASTQVTDLYTHIVNDEQK
NALDKL
>gid:114711  SAG1031  conserved domain protein
MTERTFEDIELDLKLFQIKLDNAENSKRLLQKLKNDVMELQIELLESLKL
GDAYLTESEELEENNDFILTVNSETLSLEESYDNRINLVSKEIMDYENAL
DKLYYEKQSLMQKSNERKGG
>gid:114746  SAG1067  IS861, transposase OrfA
MKLSYEDKLEIYELRKIGMSWSQISQRYDVRISNLKYMIKLMDRYGVEIV
EKGRNEYYPPELKQEMIDKVLIHGCSQLSVSLDYALSNCSILTNWLSQFK
KNGYTIVEKTRGRPSKMGRKRKKTWEEMTELERLQEENERLRTENAFLKK
LRDLRLRDEALQSERQKQLEKWSQEDSD
>gid:114747  SAG1068  IS861, transposase OrfB
MVTGGFRLDLLLEITKIARATYYYQLKKLNKPDKDKAIKSDIQSIYDEHR
GNYGYRRIYLELRNRGFVINHKRVQGLMKSMGLTARIRRKRKYASYKGEV
GKKADNLIQRQFEGSKPYEKCYTDVTEFALPEGKLYLSPVLDGYNSEIID
FTLSRSPDLKQVQTMLERAFPAASYSETILHSDQGWQYQHKSYHQFLEDK
GIRPSMSRKGNSPDNGMMESFFGILKSEMFYGLEKSYKSLDDLEQAITDY
IFYYNNKRIKAKLKGLSPVQYRTKSFT
>gid:114811  SAG1132  conserved hypothetical protein
MDLFLSLERKFKAASDKEVSKQQEAYLRHHFKCYGIKSPERRMLYKELIK
AAKRQAKIDWQLLDKCWQSDYREYHHFVLDYLLAMSQFLTYNDCSRLEFY
ARHQQWWDSIDVLTKIFGNLSLKDDKVMNLLSEWSLDQDFWMRRLAIEHQ
LGFKEKTNTDILSLFILRNTGSQEFFINKAIGWALRDYSKYNKVWVKDFI
SNHCDELSTLSIREGSKYL
>gid:114877  SAG1195  MutT/nudix family protein
MKNRDFRVREQLETFGVRVSALIIENQKLLLIYAPHLDKYYLPGGALQVG
EDSNKAVAREVLEEIGLHSQVGDLAYIIENQFNIKRHHYHSVEFLYFVNL
LGQAPESIKEGTHKRHFVWLPIKELTKIDCNPNFLAQDLIEWPGHVVHKI
VQN
>gid:114886  SAG1204  DNA replication protein DnaD, putative
MTYLEQYQSGQLTLPSALFFHFKSIFKTADDFLVWQFFYLQNTTNLSDLT
PSRIATSLDKTVADINRSISNLTSQGLLDVKTIELNHEIEIIFDTSPVFA
KLDKLFEEDNQVIIDNKTSDSNRLKDLVGDFERELGRLLSPFELEDLQKT
LQEDQTDPDIVRAALREAVFNGKTSWNYINAILRNWRREGLTTLRQIEER
KQAREDNQMKDLAISDDFKNAMNLWE
>gid:114899  SAG1218  conserved hypothetical protein
MTDLEKIIKAIKSDSQNQNYTENGIDPLFAAPKTARINIVGQAPGLKTQE
ARLYWKDKSGDRLRQWLGVDEETFYHSGKFAVLPLDFYYPGKGKSGDLSP
RKGFAEKWHPLILKEMPNVQLTLLVGQYTQKYYLGSSAHKNLTETVKAYK
DYLPDYLPLVHPSPRNQIWLKKNPWFEKDLIVDLQKIVADILKD
>gid:114908  SAG1228  ISSdy1, transposase OrfA
MSRKVRRHFTDDFKQQIVDLYNVGRKRSSLIKVYELTPSTFDKWVRQAKT
TGSFKSIDNLTDEQRELIELRKHNKELEMQLDILKQAAVIMAQKGK
>gid:114909  SAG1229  ISSdy1, transposase OrfB
MCRWLNMPHSSYYYQAVESVSETEFEETIKRIFLDSESRYGSRKIKICLN
NEGITLSRRRIRRIMKRLNLVSVYQKATFKPHSRGKNEAPIPNHLDRQFK
QERPLQALVTDLTYVRVGNRWAYVCLIIDLYNREIIGLSLGWHKTAELVK
QAIQSIPYALTKVKMFHSDRGKEFDNQLIDEILEAFGITRSLSQAGCPYD
NAVAESTYRAFKIEFVYQETFQLLEELALKTKDYVHWWNYHRIHGSLNYQ
TPMTKRLIA
>gid:114913  SAG1235  GBSi1, group II intron, maturase
MSELLDKILSRNNMLEAYKQVKSNKGSAGINGVTIEQMDDYLHQNWRETK
QLIKERSYKPQPVLRVEIPKPNGGVRNLGIPTAMDRMIQQAIVQVLSPLC
EKHFSEYSYGFRPNRSCETAIVQLLEYLNDGYEWIVDIDLEKFFDTVPQD
RLMSLVHNIIQDGDTESLIRKYLHSGVVINGQRHKTLVGTPQGGNLSPLL
SNIMLNELDKGLEKRGLRFVRYADDCVITVGSEAAAKRVMHSVSSYIEKR
LGLKVNMTKTKIVRPNKLKYLGFGFWKSPKGWKCRPHQDSVQSFKRKLKQ
LTMRKWSIDLITRIERLNWVIRGWINYFSLGNMKSIMTQIDERLRTRIRV
IIWKQWKKKAKRLWGLLKLGVARWIADKVSGWGDHYQLVAQKSVLKRAIS
KPALAKRGLVSCLDYYLERHALKVS
>gid:114917  SAG1241  transposase OrfA, IS3 family
MSTFKRYDEEFKQSLVNLYQTGKTQSELCKDYGVSTSTLAKWIKQYSQVK
LEDNSVLTAKQIQELQKRNAQLEEENLILKKASAIFMQNLK
>gid:114918  SAG1243  ISSdy1, transposase OrfA
MSRKVRRHFTDDFKQQIVDLYNVGRKRSSLIKVYELTPSTFDKWVRQAKT
TGSFKSIDNLTDEQRELIELRKHNKELEMQLDILKQAVVIMAQKGK
>gid:114919  SAG1244  ISSdy1, transposase OrfB
MCRWLNMPHSSYYYQAVESVSETEFEETIKRIFLDSESRYGSRKIKICLN
NEGITLSRRRIRRIMKRLNLVSVYQKATFKPHSRGKNEAPIPNHLDRQFK
QERPLQALVTDLTYVRVGNRWAYVCLIIDLYNREIIGLSLGWHKTAELVK
QAIQSIPYALTKVKMFHSDRSKEFDNQLIDEILEAFGITRSLSQAGCPYD
NAVAESTYRAFKIEFVYQETFQLLEELALKTKDYVHWWNYHRIHGSLNYQ
TPMTKRLIA
>gid:114922  SAG1247  site-specific recombinase, phage integrase family
MATKSSKKYKGVYCDNKGKIFYQIDLGIDPVTGKRVQKKARKNQYGKTFE
TMKEAYDELVRIKYEFANKVSLENYNMTFENYMNKIYLRAYKQKVQSVTY
KTALPHHKLFIQYFGLKPLKAITPRDCEAFRLHIIENYSENYAKNLWSRF
KACMGYAERLGYISNMPCKALDNPRGKHPETPFWTYAEFQTFIKSFDLHD
YEELQRFTAIWLYYMTGVRVSEGLSLCWEDIDFDKKFLKVHTTLEKDENG
NWYRKDQTKTPAGERLIELDDITIEVLQVWRKNQFANQDTDFIISRFGDP
FCKSTICRIIKRKAQQVGVPVITGKGLRHSHASYLINVLKKDILYVARRM
GHADKSTTLNTYSHWFNALDKTVSEEITQNIKSAGLDSILCQNSCQTSN
>gid:114928  SAG1253  transposase, ISL3 family
MSDLLSLPDIKTIEPPQENETDMMFKVEAVGPPERCPECGFDKLYKHSSR
NQLIMDLPIRLKRVGLHLNRRRYKCRECGSTISVDEKRSMTKRLLKSIQE
QSMSKTFVEVAESVGVDEKTIRNVFKDYVALKEREYQFETPKWLGIDEIH
IIRRPRLVLTNIERRTIYDIKPNRNKETVIQRLSEISDRTYIEYVTMDMW
KPYKDAVNTILPQAKVVVDKFHVVRMANQALDNVRKSLKAHMSQKERRTL
MRERFILLKRKHDLNERESFLLDTWLGNLPALKEAYELKEEFYWIWDTPD
PDEGHLRYSQWRHRCMSSNSKDAYKDLVRAVDNWHVEIFNYFDKRLTNAY
TESINSIIRQVERMGRGYSFDALRAKILFNEKLHKKRKPRFNSSAFNKAM
LYDTFNWYEVNDHDITDNLGVDFSTLIKNLEKGDL
>gid:114943  SAG1270  ImpB/MucB/SamB family protein
MGYVDYSKEPRSDIAFVDMKSFYASIECVERGLHPLHTSLCVMSRADNSA
GLILASSPMFKKVFGKGNVGRAYDLPFDVHTRKFNYYRAKISGLPTDAKF
VSFIENWAKRTFIVPPRMDLYIQKNLEIQKVFQNYADPTDILPYSIDEGF
IDLTSSLNYFVEDKSLSRKDKLDVVSAKIQHDIWEKTGVYSTVGMSNANP
LLAKLALDNEAKTTATMRANWSYEDVETKVWNIPKMTDFWGIGSRTEKRL
NKLGIYSIKELANCDPTILKKEFGVIGVQHWFHANGIDESNVHEPYRPKA
VGIGNSQVLHKDYTRQSDIELVLREMAEQVAIRLRRRHKKATVVAINVGY
SNFENKKSINVQRKINPNNRTLVFQDEVVSLFRSKYDGGAVRSIAVRYDG
LVDENFAVISLFDDFEESEKEEKLETTIDSIRDRFGFLAVQKASSLLENS
RAISRSRLVGGHSAGGLEGLK
>gid:114949  SAG1276  conserved hypothetical protein
MTRIKIVKQKAILDVAESLGYSFRRLSGQIYEHPDHDSFRIFADTNTFKW
FSRDIQGDVIDFVQLVAGVSFKKALSYLETGGFEEAKVIEETYQPFQYYL
REEPFQQARTYLKDIRGLSNQTINSFGRQGLLAQATYQAESVLVFKSFDH
NGTLQAASLQGLVKNEEKYDRGYLKKIMKGSHGHVGISFDIGNPKRLIFC
ESVIDMMSYYQLHQKQLSDVRLISMEGLKLSVIAYQTLRLAAEEQGKLAF
LDTVKPIRLSHYLQAIQETTTFFQTHSNVITMAVDNDEAGREFYQKLSDK
GFPIFQDLPPLQRLETKSDWNDIVKRQKELSLRDLIQSAQAQVNKNHPPP
KREHALEL
>gid:114953  SAG1280  SNF2 family protein
MNQEVLLQMMRATIPRDRALLEAFLYYQAEHFDEEWDSLIHQFMTNRQEI
NKSVQVLHFETDVSAFVQASPYDTAHDLLTYTQVFGQSGLQKLDKLSPSE
KNLVIEVALFNLATRFQLLDSNGHYQTISPDSLLQKSRGANLVNVYRVAN
NLADRISRDIEQFLLTYEPELETRADETVLENEETVDEHKTSVHQAISFR
EEGSLVIASLDVDLSQLDVQIGKTSHLPAYEELSLRRKFEILTYFDQIRN
ERSKVPSFRRGDFDTEMEMTPVFDGEELLTYLEADGSPYELKRTLTTVEE
KELEKIGQAIRIENQEKLTQLGIDLSQFDPDRVGILLDAAGRFRLKNADL
ALLGGYPKASVTQLALATELLQMGLSHEKVEFFFGSQLSIEELRQVAYAF
LYQELSREDAEQFEKDKGNQPDLTLRDWKSKLEKAEGKEVVDEEFAENPL
VQRVLDTYPLGSLVSYKGQDFEVMSVSDARLNGLIRIELVNDFSDIIEQN
PVLYVRTWEEVSQALHQPKAEPQTELEEADQELNLFSFLEEEPVQSIGLL
EPDDSENGHNDTDLEETDNQIPEEEVVETIPEIPVTDFYFPEDLTDFYPK
TARDKVETNIVAIRLVKNLEVEHRNASPSEQELLAKYVGWGGLANEFFDD
YNPKFSKEREELKSLVTDKEYSDMKQSSLTAYYTDPSLIRQMWDKLERDG
FTGGKILDPSMGTGNFFAAMPKHLREKSELYGVELDTITGAIAKHLHPNS
HIEIKGFETVAFNDNSFDLVISNVPFANIRIADNRYDRPYMIHDYFVKKS
LDLLHDGGQVAIISSTGTMDKRTENILQDIRETTEFLGGVRLPDSAFKAI
AGTSVTTDMLFFQKHLDKGYVADDLAFSGSIRYDKDSRIWLNPYFDGEYN
SQVLGTYEVRNFNGGTLSVKGTSDDLIASVETALNHVKAPREIDRNEVII
NPDVLTKQVNDTSIPAEMRENLGQYSFGYQGSTVYYRDNKGIRVGTKTEE
ISYYVDEEGNFKAWDTKHSQKQIDRFNALEVTDNTALDVYVTDDAAKRGQ
FKGYYKKTVFYEAPLSYKEVARIKGMVDIRNAYQEVIAIQRYYDYDKETF
NHLLGKLNRTYDSFVKHYGYLNSAVNRNLFDSDDKYSLLASLEDESLDPS
GKSVIYTKSLAFEKALVRPEKEVKKVHTALDALNSSLADGRGVDFAYMMS
IYQVESQMTLIEELGDLIMPDPEKYLNGELTYVSRQDFLSGDVVTKLEVV
DLFVKQDNQDFNWSHYAGLLEAIKPARITLADIDYRIGSRWIPLAVYGKF
AQETFMGKAYELSDQEVATVLEVSPIDGVITYQSKFAYTYSNATDRSLGV
PASRYDSGRKIFENLLNSNQPTITKQVVEGDKKKNVTDVEKTTVLRAKET
HLQELFQGFVAKYPEVQQMIEDTYNRLYNRTVSKSYDGSHLTIDGLAQNI
SLRPHQKNAIQRIVEEKRALLAHEVGSGKTLTMLGAGFKLKELGMVHKPL
YVVPSSLTAQFGQEIMKFFPTKKVYVTTKKDFAKAKRKQFVSRIITGDYD
AIVIGDSQFEKIPMSREKQVTYINDKLEQLREIKLGSDSDYTVKEAERSI
KGLEHQLEELQKLERDTFIEFENLGIDFLFVDEAHHFKNIRPITGLGNVA
GITNTTSKKNVDMEMKVRQVQAEHGDRNVVFATGTPVSNSISELFTMMDY
IQPDVLERYLVSNFDSWVGAFGNIENSMELAPTGDKYQPKKRFKKFVNLP
ELMRIYKETADIQTSDMLDLPVPEAKIIAVESELTQAQKYYLEELVKRSD
AIKSGSVDPSRDNMLKITGEARKLAIDMRLIDPTYSLSDNQKILQVVDNV
ERIYRDGAGDKATQMIFSDIGTPKSKEEGFDVYNELKDLFVDRGIPKEEI
AFVHDANTDEKKNSLSRKVNSGEVRILMASTEKGGTGLNVQSRMKAVHYL
DVPWRPSDIVQRNGRLIRQGNMHQEVDIYHYITKGSFDNYLWQTQENKLK
YITQIMTSKDPVRSAEDIDEQTMTASDFKALATGNPYLKLKMELENELTV
LENQKRAFNRSKDEYRHTISYSEKHLPIMEKRLSQYDKDIAQSLATKSQD
FVMRFDNQAMDNRAEAGDYLRKLITYNRSETKEVRTLASFRGFDLKMTTR
GASEPLPETISLMIVGDNQYTVALDLKSDVGTIQRISNAIDHIIDDQEKT
QELVKDLKDKLRVAKVEVDKVFPKEEDYQLVKAKYDVLAPLVEKEAEIEE
IDAALAKFSEDTTPQKKQQIALEI
>gid:114968  SAG1297  C-5 cytosine-specific DNA methylase
MKFLDLFAGIGGFRLGMESQGHKCLGFCEIDKFARTSYKAMFNTEGEIEY
HDIKEVTDHDFRQFRGQVDIICGGFPCQAFSLAGRRLGFEDTRGTLFFEI
ARAAKQIQPRFLFLENVKGLLNHDEGRTFATILSTLDELGYDVEWQVLNS
KDFQVPQNRERVFIIGHSRRYRSRFIFPLRREDSPAHLERLGNINPSKHG
LNGEVYLTSGLAPTLTRGKGEGAKIAIPVLTPDRLEKRQHGRRFKDNQDP
MFTLTSQDKHGVVVAGNLPTSFDQTGRVFDISGLSPTLTTMQGGDKVPKI
LLREELPFLKIKEATKTGYAKATLGDSVNLAYPDSTKRRGRVGKGISNTL
TTSDNMGVVVAALEYRQDKWYEVTGIVLEGKLYRLRIRRLTPRECFRLQG
FPDWAYERAESVSSKSQLYKQAGNSVTVTVIEAIAREFRRTEEEEKHELT
T
>gid:115127  SAG1457  IS1381, transposase OrfB
MKAQAIVTSQGRIVSLDIAVNYCHDMKLFKMSRRNIGQAAKILADSGYQG
IMKMYSQAQTPRKSSKLKPLTLEDKTYNHTLSKERIKVENIFAKVKTFKI
FSTTYRNRRKRFGLRMNLIAGMINRELGF
>gid:115174  SAG1505  MutT/nudix family protein
MKQDYISYIRSKVGHETIFLTYSGGILTDGKGRVLLQLRADKNSWGIIGG
CMELGESSVDTLKREFFEETGLRVEPIRLLNVYTNFQDSYPNGDKAQTVG
FIYEVSCPKPVNIEGFHNEETLQLDYFSKEDVKNITIVNEQHQLILDEYF
SQTFQMGR
>gid:115190  SAG1521  transposase, IS30 family, putative
MTKHKHLTLLDRNDIQSGLDRGETFKAIGLNLLKHPTTIAKEVKRNKQLR
ESTKDCLDCPLLRKAPYVCNGCPKRRINCGYKKTFYLAKQAQRNYEKLLV
ESREGIPLNKETFWKIDRVLSNGVKKGQRIYHILKTNDLEVSSSTVYRHI
KKGYLSITPIDLPRAVKFKKRRKSTLPPIPKAIKEGRRYEDFIEHMNQSE
LNSWLEMDTVIGRIGGKVLLTFNVAFCNFIFAKLMDSKTAIETAKHIQVI
KRTLYDNKRDFFELFPVILTDNGGEFARVDDIEIDVCGQSQLFFCDPNRS
DQKARIEKNHTLVRDILPKGTSFDNLTQEDINLALSHINSVKRQALNGKT
AYELFSFTYGKDIASILGIEEITAEDVCQSPKLLKDKI
>gid:115195  SAG1526  IS861, transposase OrfA
MKLSYEDKLEIYELRKIGMSWSQISQRYDVRISNLKYMIKLMDRYGVEIV
EKGRNEYYPPELKQEMIDKVLIHGCSQLSVSLDYALSNCSILTNWLSQFK
KNGYTIVEKTRGRPSKMGRKRKKTWEEMTELERLQEENERLRTENAFLKK
LRDLRLRDEALQSERQKQLEKWSQEDSD
>gid:115196  SAG1527  IS861, transposase OrfB
MVTGGFRLDLLLEITKIARATYYYQLKKLNKPNKDKAIKSDIQSIYDEHR
GNYGYRRIYLELRNRGFVINHKRVQGLMKSMGLTARIRRKRKYASYKGEV
GKKADNLIQRQFEGSKPYEKCYTDVTEFALPEGKLYLSPVLDGYNSEIID
FTLSRSPDLKQVQTMLERAFPAASYSETILHSDQGWQYQHKSYHQFLEDK
GIRPSMSRKGNSPDNGMMESFFGILKSEMFYGLEKSYKSLDDLEQAITDY
IFYYNNKRIKAKLKGLSPVQYRTKSFT
>gid:115206  SAG1537  MutT/nudix family protein
MDFEEKTINRQTVFDGQIIKVAVDDVELPNGLGQSKRELVFHGGAVATLA
VTPEHKIVLVKQYRKAIEGISYEIPAGKLETGESGSKEEAALRELEEETG
YTGNLEILYSFYTAIGFCNEKIVLYLATDLQKVENPRPQDDDEVLELLEL
SYEDCMQMVEKGMIQDAKTIIALQYYGLKMGGHS
>gid:115218  SAG1550  IS1381, transposase OrfB
MKAQAIVTSQGRIVSLDIAVNYCHDMKLFKMSRRNIGQAAKILADSGYQG
IMKMYSQAQTPRKSSKLKPLTLEDKTYNHTLSKERIKVENIFAKVKTFKI
FSTTYRNRRKRFGLRMNLIAGMINRELGF
>gid:115241  SAG1574  DNA polymerase III, delta prime subunit, putative
MDLKRTQPKLLEKFNTILQSDRMSHAYLFSGNFASLDMALYLAQSQFCEK
RQSGLPCQECRACRLIANGEFSDVKIIEPQGQLIKTETIKELTKDFSRSG
FEGKSQVFIIKDCEKMHVNAANSLLKFIEEPQSSSYVILLTNDENNVLPT
IKSRTQIFRFPKQLDMLVHQAEQAGLLKSQASLLAQVADDPKHLEILLTN
KKLLDYLNLSQQFVTTLAKDRQTAYLEVSRLTSQVVDKNDQAFVFQWLTI
MLAKEGQLYDLENTYRAQQMWKSNVSFQNSLEYMVLS
>gid:115250  SAG1584  IS1548, transposase
MIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETWKEMED
FIEMNEPLFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTS
LDAVHQLISVDGKTIRGNRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSN
EIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCLAVKGNQE
TLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSSDIKWL
CQNHPKWHKLRGIGMTRNTIDKDGQLSQENRYFIFSFKPDVLTFANCVRG
HWQIESMHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKK
DLSYRRKQRYISVHLEDYLVQLFGERG
>gid:115262  SAG1596  integrase domain protein
MINDINNFIESKKLSLNSRKSYHYDLKQFYKIIGGHVNSEKLALYQQSLS
EFKLTARKRKLSAVNQFLFFLYNRGTLKEFYRLQETEKITLAQTKSQIMD
LSNFYQDTDYPSGRLIALLILSLGLTPAEIANLKKADFDTTFNILSIEKS
QMKRILKLPEDLLPFLLESLEEDGDLVFEHNGKPYSRQWFFNQLTDFLNE
KNEQQLTAQLLREQFILKQKENGKTMTELSRLLGLKTPITLERYYR
>gid:115284  SAG1618  Snf2 family protein
MSRMIPGRIRNQGIELYEQGLVSLISQEGNLLKAKVGDCQIEYSLVTEET
KCSCDFFARKGYCQHLAALEHFLKNDPEGKAILSKVQVQQESQQETKKKT
SFGSVFLDSLIINEDDTIKYQLSAQGEQNPYANDIWWTLKIRRLPDDRSY
VIRDIKAFLNTVRKEAYYQIGKQYFETLSLIQFDETSQELIEFLWRLIPS
HSSKIDLEFILPNQGRHLSLTRGFFEEGVTLMNALENFSFESDFHQFNHL
YFKELEGEDHLYQFKVIVHRQSIELEIKEKDLKPLFANSYLFYRDTFYHL
NLKQEKMVTAIRSLPIEGDLAKHIHFDLDDQDKLAAHLLDFKEIGLVDAP
RSFSIHDFKVNFEFDINSQNEILLQMVFDYGNDLTVHNRQELEQLTFASH
FKHEEKVFKLLEKYGFAPHFSTSHPAYSAQELYDFYTYMLPQFKKMGTVS
LSAKLESYRLIERPQIDIEAKGSLLDISFDFSDLLENDVDQALVALFDNN
PYFVNKSGQLVIFDEETKKVSATLQGLRARRAKNGHIELDNIAAFQLSEL
FANQDNVSFSQHFYQLIEDLRHPEKFKIPGLSVSASLRDYQLTGVRWLSM
LDHYGFAGILADDMGLGKTLQTISFLSTKLTRDSRVLILSPSSLIYNWQD
EFHKFAPDVDVAVAYGSKIRRDEIIAERHQVIITSYSSFRQDFETYSEGN
YDYLILDEAQVMKNAQTKIAHSLRSFEVKNCFALSGTPIENKLLEIWSIF
QIILPGLLPGKKEFLKLNPKQVARYIKPFVMRRRKEEVLPELPDLIEMNY
PNEMTDSQKVIYLAQLRQIQESIQHSSDADLNRRKIEILSGITRLRQICD
TPRLFMDYDGESGKLESLRQLLTQIKENGHRALIFSQFRGMLDIAEREMV
AMGLTTYKITGSTPANERHEMTRAFNAGSKDAFLISLKAGGVGLNLTGAD
TVVLIDLWWNPAVEMQAISRAHRLGQKENVEVYRLITRGTIEEKILEMQE
TKKHLVTTVLDGNETHASMSVDDIREILGVSK
>gid:115285  SAG1619  IS1548, transposase
MIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETWKEMED
FIEMNEPLFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTS
LDAVHQLISVDGKTIRGNRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSN
EIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCLAVKGNQE
TLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSSDIKWL
CQNHPKWHKLRGIGMTRNTIDKDGQLSQENRYFIFSFKPDVLTFANCVRG
HWQIESMHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKK
DLSYRRKQRYISVHLEDYLVQLFGERG
>gid:115288  SAG1622  conserved hypothetical protein
MSPIDEFTYIKQNKIVYDSNSLIQLYFPIMGSDAMALYDYFVHFFDDGIR
RHKFSEVLNHLQYGMPRFQDALVMLTALDLLTVYQATGTYLVKLNQAMSN
ELFLSNPIYRRLLEKRIGEVAVAELDMKIPKNARDISKKFTDVFSDLGQP
KQEVNRSKNVFDLESFKRLMMRDGLRFNNEKDDVLGIYSVSELYHLNWYD
TYQLAKQTAINGMIAPQRMKVQQNEGQHIKDNQSFTNNEKVILRESKNDS
ALVFLEKIKRSRKAVTTSGEKTLLEDLAKMNFLDEVINVMVLYTLNKTKS
ANLNKAYIMKVANDFAFQNVMTAEDAVLKIRDFSDQKVRTKTETKKKQSN
VPEWSNPDYKDEVSPEKEIELEQFKTDALKRLERLGKDGES
>gid:115384  SAG1719  MutS2 family protein
MNNKILEQLEFNKVKELILPYLKTEQSQEELSELEPMTEAPKIEKSFNEI
SDMEQIFVEHHSFGIVSLSSISESLKRLELSADLNIQELLAIKKVLQSSS
DMIHFYSDLDNVSFQSLDRLFENLEQFPNLQGSFQAINDGGFLEHFASPE
LERIRRQLTNSERRVRQILQDMLKEKAELLSENLIASRSGRSVLPVKNTY
RNRISGVVHDISSSGSTVYIEPRAVVTLNEEITQLRADERHEESRILHAF
SDLLRPHVATIRNNAWILGHLDFVRAKYLFMSDNKATIPEISNDSTLALI
NVRHPLLSNPVANDLHFDQDLTAIVITGPNTGGKTIMLKTLGLAQLMGQS
GLPVLADKGSKIAVFNNIFADIGDEQSIEQSLSTFSSHMTHIVSILNEAD
HNSLVLFDELGAGTDPQEGASLAMAILEHLRLSNIKTMATTHYPELKAYG
IETNFVENASMEFDAETLSPTYRFMQGVPGRSNAFEIASRLGLAPFIVKQ
AKQMTDSDSDVNRIIEQLEAQTLETRRRLDHIKEVEQENLKFNRAVKKLY
NEFSHERDKELEKIYQEAQEIVDMALNESDTILKKLNDKSQLKPHEIIDA
KAQIKKLAPQVDLSKNKVLNKAKKIKAARAPRIGDDIIVTSYGQRGTLTS
QLKDGRWEAQVGIIKMTLTQDEFTLVRVQEEQKVKSKQINVVKKADSSGP
RARLDLRGKRYEEAMQELDNFIDQALLNNMGQVDIIHGIGTGVIREGVTK
YLRRNKHVKHFAYAPQNAGGSGATIVTLG
>gid:115389  SAG1724  helicase, putative
MEVFFTGTIERIIFENASNFFKILLLEIEDTDSDFDDVEVIITGTMADVI
EGEEYTFWGTLTQHPKYGEQLQSVRYERAKPTSGGLVKYFSSEQFKGIGK
KTAQRIVELYGDNTIDKILESPEQLSTISGLSKINREAFIAKLKLNYGTE
QVLAKLAEYGLSNRAAIQIFDHYKEESLEVINENPYQLVEDIQGIGFKIA
DQLAEQVGIESDSPKRFRAAIIHTLVESSMEQGDTYIEARTLLEKTITLL
EEARQIELDPSIVAKELTNLIAEDKVQHIGTKIFSNTLFFAEEGIKKNLQ
RILNQPLDKQLNHKDIDREIRDIQKSLNIHYDNIQEKAIREALLSKVFIL
TGGPGTGKTTVINGIIEAYSELHHIDLNKNDIPIVLAAPTGRAARRMNEL
TGLPSATIHRHLGLNGDSDYQSLDDYLDCSLIIIDEFSMVDTWLANQLFD
ALDSHTQVIIVGDSDQLPSVGPGQVLADLLNINALPHVKLEKIFRQSEES
TIVTLANQMRQGFLPEDFTAKKADRSYFEASANIIPNMISKIVQSALKSG
IEAHEIQILAPMYRGQAGINNLNLIMQNLLNPLKDNNQFTFNDINFRIGD
KVLHLVNDTELNVFNGDIGYITDLIPAKYTESKQDEIYMTFDGQEVIYQR
KEWLKITLAYAMSIHKSQGSEFQVVILPITRQSGRMLQRNLIYTAITRSK
SKLILLGEIGAFDFAVKNEGAKRNTYLIERFENKQEIANSQKIEDSSIDQ
KIDNTIINTSIPKTATPIEQTNLSKITYRLTEENYLTIDPMIGINQQDIS
AIFDSK
>gid:115415  SAG1750  exonuclease
MKQLQEYIAFDLEFNTVGEHSHIIQVSAVKYSNHQEIALFDTYVHTKVPL
QSFINGLTGITARDIIGAPKIEIVLTDFQSFVGDTPLIGYNGYKSDLPLL
VENGLDLTSQYQVDLYDEAFVRRSTDLNGIVNLKLTTVADFLGIKGKAHN
SLEDARMTARVYEKFLDLDENKIYLKQQKEVAVDSPFATLGNLFD
>gid:115446  SAG1781  primase-related protein
MKKIDIQEVIVVEGKDDTANLRRFYNVDTYETRGSAIDEDDLERIERLHN
LRGVIVFTDPDYNGERIRKIIMNAIPTVRHAFLNRDEAKPGSKTKGRSLG
VEHASFEDLQKALSKVTQHFDDEDHFDITQADLIRWGFITASDSRKRREY
LGNQLRIGYSNGKQLLKRLRLFGVTKAEVEECMEGY
>gid:115447  SAG1782  deoxyribonuclease, TatD family
MIKIFDTHTHLNVENFEGKIDEEINLASELGVTKMNVVGFDQDTISKSLE
LSSQYAQVYSTIGWHPTEAGSYDDNIESMIISHLENPKVIALGEIGLDYY
WMEDPKDIQIEVFKRQIELSKEYNLPFVVHTRDALEDTYEVIKESGVGPF
GGIMHSFSGSLEMAQKFIDLGMMISFSGVVTFKKALDVQEAARELPLDKI
LVETDAPYLAPVPKRGRENKTAYTRYVVEKIAELRGITVEEVAEATYQNA
VRIFRLDEKN
>gid:115460  SAG1795  transposase, IS30 family, putative
MTKHKHLTLLDRNDIQSGLDRGETFKAIGLNLLKHPTTIAKEVKRNKQLR
ESTKDCLDCPLLRKAPYVCNGCPKRRINCGYKKTFYLAKQAQRNYEKLLV
ESREGIPLNKETFWKIDRVLSNGVKKGQRIYHILKTNDLEVSSSTVYRHI
KKGYLSITPIDLPRAVKFKKRRKSTLPPIPKAIKEGRRYEDFIEHMNQSE
LNSWLEMDTVIGRIGGKVLLTFNVAFCNFIFAKLMDSKTAIETAKHIQVI
KRTLYDNKRDFFELFPVILTDNGGEFARVDDIEIDVCGQSQLFFCDPNRS
DQKARIEKNHTLVRDILPKGTSFDNLTQEDINLALSHINSVKRQALNGKT
AYELFSFTYGKDIASILGIEEITAEDVCQSPKLLKDKI
>gid:115487  SAG1822  protein of unknown function
MVKKRYSKNSHNLITLLGIVVLASLISDFWSEVIATLLIIGGGYCAYYVY
DKKRLKRFTSNQRIEALKSDIKETDQDIRHLEILKKDNRSKEYIKLAHQI
LPQLDLIRNEANQLQKAIEPNIYKRITKKANTFSNEINEQLIKLHASPEL
EPISDQEDEMIRIAPELKPFYHNIQDDHFAILKKIEEADNKAELAAIHQA
NMKRFTDVLAGYIRIKQSPKNFNNAKERLEQALQAIKKFNLDLDETLRQL
NESDMKDFDVSLRMMQGERNSK
>gid:115527  SAG1859  prophage LambdaSa2, site-specific recombinase, phage integrase family
MNIVEPLRDKDDIQAMKDYLSSWNEKYYMLFLLGINTGFRVGDILKLKVK
DVQGWHIKVREQKTGKYKSIKMTRPLKNELREFVKDKELHEYLFQSRVGK
NKALSYKTVYWFLKRAAEDLGIDNVGTHTMRKTFGYHYYKKYKNVADLMS
LFNHSSPAVTLIYICVRQDELDTKMSNFSL
>gid:115537  SAG1869  prophage LambdaSa2, type II DNA modification methyltransferase, putative
MKFLDLFAGIGGFRLGMEQAGHECIGFCEINKFARASYKVIHDTEGEIEL
HDITRVSDEFIRGIGSVDVICGGFPCQAFSIAGNRRGFEDTRGTLFFEIA
RFASILRPKYLFLENVKGLLNHEGGATFETIIRTLDELGYNVEWQIFNSK
NFGVPQNRERVFIIGHLRGEGTRPIFPFESSITENYPIHTRKIGNVNPSG
NGMNGEVYDSEGLSPTLTTNKGEGVKIAVNVVGRLPGKFEMPNRVYDPDG
LAPTIRTMQGGGLEPKIIQRGRGYNQGGEYEISPTVTCNSWQENNLLKIK
EATKKGYSEAEAGDSVNLSHPNSETRRGRVGKGIANTLLTGEEQGVVVYD
LYNRRKKDIVGTLTASGHNGNTTTGTFGISNGFRIRKLTPRECWRLQGFP
DWAFDKASQVNSNSQLYKQAGNSVTVNVIAAIARRLL
>gid:115538  SAG1870  prophage LambdaSa2, DNA replication protein DnaC, putative
MNPFKNFETRQVLEETCEVHGCQLWLTKVPIKGRLEELKQCPECTKAAIN
IFENKLNSQSKINSKLADTYAVFERDSLVSDKLRAKSLENYEIKDEIDQH
AINYAKRMEQFYRQDRTGNAIITGPSGVGKSHLTYGLAKFMNEQFKAYES
PKSVLFISLVSLFTKIKESFKVDNGYRQADMIELLTRVDYLFLDDLGKES
RKGDSQNNEWTHQILYEILDNRSNTIINTNLSSKEIKALYADNYGNGALS
SRILEGVTGNSFAYPKDMEDRRY
>gid:115552  SAG1885  prophage LambdaSa2, site-specific recombinase, phage integrase family
MASYRKRENGLWEYRISYKTIDGKYKRKEKGGFKTKKLAQAAAIEIEKKL
TQNILTNDEVTLYDFVKTWSEVYKRPYVKDKTWETYSKNFKHIKNYFQEL
KVKDITPLYYQKKLNEFGEKYAQETLEKFHYQIKGAMKVAVREQVVTFNF
AEGAKVKSQVEPKNEEEDFLEEREYKALLALTRENIQYVSYFTLYLLAVT
GLRFSEAMGLTWSDIDFKNGILDINKSFDYSNTQDFADLKNESSKRKVPI
DSNTIDILREYKKNHWQANIKNRVCFGVSNSACNKLIKKIVGRKVRNHSL
RHTYASFLILNGVDIVTISKLLGHESPDITLKVYTHQMEALAERNFEKIK
NIFLVA
>gid:115578  SAG1911  DNA polymerase III, alpha subunit, Gram-positive type
MSELFKKLMDQIEMPLEIKNSSVFSSADIIEVKVHSLSRLWEFHFSFPEL
LPIEVYRELQTRLVNSFEKADIKATFDIRAETIDFSDDLLQDYYQQAFCE
PLCNSASFKSSFSQLKVHYNGSQMIISAPQFVNNNHFRQNHLPRLEQQFS
LFGFGKLAIDMVSDEQMTQDLKSSFETNREQLLEKANQEAMQALEAQKSL
EDSAPPSEEVTPTQNYDFKERIKQRQAGFEKAEITPMIEVTTEENRIVFE
GMVFSVERKTTRTGRHIINFKMTDYTSSFAMQKWAKDDEELKKYDMISKG
SWLRVRGNIENNNFTKSLTMNVQDIKEIVHHERKDLMPADQKRVEFHAHT
NMSTMDALPTVESLIDTAAKWGHPAIAITDHANVQSFPHGYHRAKKAGIK
AIFGLEANIVEDKVPISYNEVDMNLHEATYVVFDVETTGLSAANNDLIQI
AASKMFKGNIIEQFDEFIDPGHPLSAFTTELTGITDNHVRGSKPILQVLQ
EFQNFCQGTVLVAHNATFDVGFMNANYERHNLPLITQPVIDTLEFARNLY
PEYKRHGLGPLTKRFQVALEHHHMANYDAEATGRLLFIFLKEARENRDVT
NLMELNTKLVAEDSYKKARIKHATIYVQNQVGLKNIFKLVSLSNVKYFEG
VARIPRSVLDAHREGLLLGTACSDGEVFDALLSNGIDAAVTLAKYYDFIE
VMPPAIYRPLVVRDLIKDEVGIQQIIRDLIEVGRRLDKPVLATGNVHYIE
PEDEIYREIIVRSLGQGAMINRTIGRGEDAQPAPLPKAHFRTTNEMLDEF
AFLGKDLAYEIVVTNTNTFADRFEDVEVVKGDLYTPFVDRAEERVAELTY
AKAFEIYGNPLPDIIDLRIEKELASILGNGFAVIYLASQMLVQRSNERGY
LVGSRGSVGSSFVATMIGITEVNPMPPHYVCPNCQHSEFITDGSCGSGYD
LPNKNCPKCGTLYKKDGQDIPFETFLGFDGDKVPDIDLNFSGDDQPSAHL
DVRDIFGEEYAFRAGTVGTVAEKTAFGFVKGYERDYNKFYNDAEVERLAT
GAAGVKRSTGQHPGGIVVIPNYMDVYDFTPVQYPADDMTAAWQTTHFNFH
DIDENVLKLDILGHDDPTMIRKLQDLSGIDPSNILPDDPDVMKLFSGTEV
LGVTEEQIGTPTGMLGIPEFGTNFVRGMVNETHPTTFAELLQLSGLSHGT
DVWLGNAQDLIKEGIATLSTVIGCRDDIMVYLMHAGLQPKMAFTIMERVR
KGLWLKISEDERNGYIQAMRDNNVPDWYIESCGKIKYMFPKAHAAAYVLM
ALRVAYFKVHYPIFYYCAYFSIRAKAFELRTMSAGLDAVKARMKDITEKR
QRNEATNVENDLFTTLELVNEMLERGFKFGKLDLYRSHATDFIIEEDTLI
PPFVAMEGLGENVAKQIVRAREDGEFLSKTELRKRGGVSSTLVEKFDEMG
ILGNLPEDNQLSLFDDFF
>gid:115639  SAG1974  MutT/nudix family protein
MDIFDELFDFSGAKIALFCEGKILTSLRDDFPDLPYAGFWDLPGGGREDN
ETPLECLFREVDEELSLTLTRNHIDWVKTYRGMLKPDKLSVFMVGHISQK
EYDSIVLGDEGQDYKLMSIDEFLSHKKVIPQLQERLRDYLEVEDDYFTKG
GS
>gid:115643  SAG1978  ATPase, AAA family
MADNLALRMRPRNINEVIGQQHLVGNGKIIDRMVAANMLSSMILYGPPGI
GKTSIASAIAGTTKYAFRTFNATVDSKKRLQEIAEEAKFSGGLVLLLDEI
HRLDKTKQDFLLPLLENGNIIMIGATTENPFFSVTPAIRSRVQIFELEPL
SNEDIKKAIQLAISDKERGFPFLVTIDDEALDFIVTATNGDLRSAYNSLD
LAVMSTSPNEDGSRHISLETMENSLQCSYITMDKNGDGHYDILSALQKSI
RGSDVNASLHYAARLVEAGDLPSLARRLTIIAYEDIGLANPEAQIHTVTA
LEAAQRIGFPEARILIANIVVDLALSPKSNSAYLAMDAALADLRRSGNLP
IPRHLRDGHYSGSKTLGNARDYKYPHAYPEKWVKQQYLPDKLVGHNYFEA
NETGKYERALGSNKERIDKLSD
>gid:115652  SAG1986  site-specific recombinase, phage integrase family
MANARYRRRGNQNLWAYEIREEGKTVAYNSGFKTKKLAEAEAEPILQKLR
TGSIITKNISLPELYQEWLDLKIMPSNRSDVTKKKYLSRKVTLEKLFGDK
PISQIRPSEYQRIMNNYGQRVSRNFLGRLNTGVKQSLQMAIADKVMIEDF
TQNVELFSTVKSQDADSKYLHSEKAYLDLINAVKDKFNYKKSVVPYIIYF
LLKTGMRYGELIALTWEDIDFDKGIFKTYRRFNSETSQFVPPKNKTSIRI
VPVDNECLEILKNLKIEQNQSNKELGLQNTNNMVFQHFGYPNSVPSTNGT
NKVLRGIVQELNIEPIITTKGARHTYGSFLWHRGYDLGIIAKILGHKDIS
MLIEVYGHTLEEKIQEEYNEIKQLW
>gid:115654  SAG1988  conserved hypothetical protein
MFNSVKIDYFAVTVKDVTPEYVIEKLLLIPLTNFTLNEWGVNKYQRHYAC
SEIKVYFNLDPNSSMGVHVELKGQGCRQYEEFIEGNDNNWTSLVKRLIDN
NSNFTRLDIANDIFDESLNVQRLYEYSKKGLCITTARHAEYHEKFVIDSG
ELVGETVVFGARGNQQWCVYNKLMEQNGKLQTDIDINSWVRAELRCWQEK
ANLIAHQLNDMRPLASIYFEAINGHYRFVSPKARDKNKRRRESVRWWQNY
INTEEKTRLSIVREKPTLRQSEAWTDKQVSKTIAKVYMAKYEAYGIDQAE
VFLQDLLRRGVEKFTDNDEKEIEQYVREQQSSEYWGIKKADL
>gid:115659  SAG1993  site-specific recombinase, phage integrase family
MATYRQRGKKKLWDYRIFNEKSELVASGSGFKTKREAMNEAMRIEQQKLL
VNSISSDITLYDLWFEWYSLIIKPSNLAETTKNKYFTRGSVIRKLFGNQK
VNKIKHSAYQRKLNTYAEKYTKNHVRRLNSDIKKAIQFAKRDGVLLSDFT
DGVVIAGRKFVKDADDKYLHSIFDYKKVISYLENNLDYSNSIVYYLLLVL
FKTGLRVGEALALTWDDVNFEDLEIKTYRRFSGDKGTFSPPKTKTSIRTI
PISQSLALILRDLKDDQQVMLKNLKIVNMNNQIFYDYRYGVSTNSAINKS
LKNVLKILNINSKMTATGARHTYGSYLLAKGVDIWVVARLMGHKDITQLL
ETYGHVLTEVINKEYETVRSLVS
>gid:115668  SAG2002  IS1381, transposase OrfB
MKAQAIVTSQGRIVSLDIAVNYCHDMKLFKMSRRNIGQAAKILADSGYQG
IMKMYSQAQTPRKSSKLKPLTLEDKTYNHTLSKERIKVENIFAKVKTFKI
FSTTYRNRRKRFGLRMNLIAGMINRELGF
>gid:115682  SAG2017  transcriptional regulator, Cro/CI family
MGHFSEKSGSLILRRSLFIVDAIYLKKFRKKTGLKQKEFALSSGLTLKSL
RNYEQGKRKLTLEKYQEIKSHFGYLVENDSSRLQVMIDYVRITLKDVRDL
EFFCRNFLHCAFKEFQPFESKLMNYNHLWKRGDIWIFDFADKHETGNFQI
TVQLSGRGCRQLELLMETEKFTWHDWLSYLRNSYRDDMNVTRFDIAIDEL
YLGKDRENEQFHLSDMISKYYRHELDFESLRTWNYIGGGSLNFSDMEEIE
QNRQGISLYFGSRQSEMYFNFYEKRYEIAKQEGITVEEALEIFELWNRYE
IRLSQSKANAAVDEFISGVPIGEISRGLIVSKIDVYDGKNEYGSFQADRK
WQLMFGGVEPLKFVTKPEAYSIERTLRWLSDSVSPSLAMIREYDMIVDGD
YLQTILNSGEVNERGEKILDSIKASLGIL
>gid:115687  SAG2022  transposase, ISL3 family
MSDLLSLPDIKTIEPPQENETDMMFKVEAVGPPERCPECGFDKLYKHSSR
NQLIMDLPIRLKRVGLHLNRRRYKCRECGSTISVDEKRSMTKRLLKSIQE
QSMSKTFVEVAESVGVDEKTIRNVFKDYVALKEREYQFETPKWLGIDEIH
IIRRPRLVLTNIERRTIYDIKPNRNKETVIQRLSEISDRTYIEYVTMDMW
KPYKDAVNTILPQAKVVVDKFHVVRMANQALDNVRKSLKAHMSQKERRTL
MRERFILLKRKHDLNERESFLLDTWLGNLPALKEAYELKEEFYWIWDTPD
PDEGHLRYSQWRHRCMSSNSKDAYKDLVRAVDNWHVEIFNYFDKRLTNAY
TESINSIIRQVERMGRGYSFDALRAKILFNEKLHKKRKPRFNSSAFNKAM
LYDTFNWYEVNDHDITD
>gid:115755  SAG2090  conserved hypothetical protein TIGR00250
MGLDVGSKTVGVAISDPLGFTAQGLEIIKIDEESGNFGFDRLAELVKEYK
VDKFVVGLPKNMNNTSGPRVEASQAYGDKITELFNLPVEYQDERLTTVQA
ERMLVEQADISRGKRKKVIDKLAAQLILQNYLDRMF
>gid:115776  SAG2112  site-specific recombinase, phage integrase family
MRDNGSRVLIISSWRPDPYKAGNVLVKFAMRFTHPITKKSHKKYLSTGAS
KGWFTTKATPSKKLPSGKERLLVSDIKNTQLITQVTQELNKLVDDYIAEL
MGIKPKKAKKLLTLEEIAKPFDKDGNFYGKAFKAWHERVKPANNTLKTRV
TIYNRYIEPNFDTRMSITKFAFMTDEIQNLINASSMHMARNLHIYLKMIF
DWSVENGQITLTQDPIASNKVKRRVLTKSEEQDKKREDIAEKYLEASEVN
HVLRLIESWTNRPDNQLIADVLRMIFLTGMRPSEVLGLNEDMLDFEKKWI
KVHWQRASKNKSDDMMEALNLDEKERYRADLKTKESVRTIPMSPEVEKIL
RHYIDRNKFQAQFSPTYQDLGYLFTRTYIRAGNRQGSPLYHNELSQFLRG
GSSQSAKYNKKAGKPYKDIDSFLDFGRPIHVIPHMFRHSFISIMASEGID
LPTIREFVGHSEDSKEIERVYLHVIKKQKDTMRGAVEKLEKLIE
>gid:115778  SAG2114  conserved hypothetical protein
MDNSVMLDYLAVTIKGLAPDDVIEKILILPKDKFVLNEWGINKYQRHYSF
SEIKVYFNKDWQSKMGVFIELRGQGCRQYEEYMENNVNNWVTLMKRISEC
HSNVTRLDIANDIFDDSLSVPLIYSYCKKQLCISTAKTFDYHEKSLLENG
EKVGEMVTIGVRGTQQWCVYNKLLEQKLDQELPNTPLSWTRAELRCWQEK
ANLLAKQIKEGRPLKEIYFEVINGHYRFVSPRDKDSNRWRRKTVKWWNDY
LETQEKTVLSVKRTKPTLKRSEKWTEKQVSRTLGKLYVAKAESHGQEKAD
SYIQHLLELGISKLTETDETDIQQYKQEQLSSAYWGIRKDDL
>gid:115800  SAG2136  conserved hypothetical protein
MNLNDRLKIEEMEEKYDSFKPRINALVEAIDDFQKHYEDYVKLREFYGSE
DWFRLSEQTENNLKCGVLSEDQLFDFIGEHNELVGQFLDMSSQMYRHL
>gid:114466  celA  competence protein CelA
MFEIVLEKIKSHKWETTGIIVGLLLFGILGLNHFGTHHKEDNLNINLEKK
VSTITEKKVPMISHVKDKVSNQVTVDVKGAVNHPGVYSLPSQSRVTDAIK
RAGGLSNLADSKSVNLAQKLQDETVIYVAQKGEKITVVEEEKANNIATQG
NSKGKINLNKADLSSLQTISGVGAKRAQDILDYRDSQGGFKTIDDLKNVS
GIGEKTLEKLRQDVTID
>gid:114845  cpsK  polysaccharide biosynthesis protein CpsK(V)
MNNFVCHTLYHLLITIIKLKDKENTRIFICDTITNYETWVKTLNEQGIRT
EAVKEFAYREQLQSKNIEEVMELVDSDLNHYFERVDTQVYLFNDDTLIGR
YMVYLGKNYHLIEDGYNCFQAKLFLGGSVVKRVIKTYLFKKYVPYGFSKY
CLSIEVNSLVGLPHDIRSKKYKELPRKKLFDSLNKEQKSLIFKIFKTKPL
TITPKSVLLLTQPLAQDKCYKTPTERFQSIQEQYDYFDDIVQEYRTLGYN
VYLKVHPRDVVDYSKLPVELLPSNIPMEIIELMLTGRFEFGITHSSTALD
FLTCVDKKITLVDLKDIK
>gid:115391  dinP  DNA-damage-inducible protein P
MLIFPLINDTSRKIIHIDMDAFFASVEERDNPSLKGKPVIIGSDPRKTGG
RGVVSTCNYEARKFGVHSAMSSKEAYERCPQAIFISGNYQKYRQVGMEVR
DIFKKYTDLVEPMSIDEAYLDVTENKMGIKSAVKLAKMIQYDIWNDVHLT
CSAGISYNKFLAKLASDFEKPKGLTLILPDQAQDFLKPLPIEKFHGVGKR
SVEKLHALGVYTGEDLLSLSEISLIDMFGRFGYDLYRKARGINASPVKPD
RVRKSIGSEKTYGKLLYNEADIKAEISKNVQRVVASLEKNKKVGKTIVLK
VRYADFETLTKRMTLEEYTQDFQIIDQVAKAIFDTLEESVFGIRLLGVTV
TTLENEHEAIYLDF
>gid:113619  dnaA  chromosomal replication initiator protein DnaA
MTENEQLFWNRVLELSRSQIAPAAYEFFVLEARLLKIEHQTAVITLDNIE
MKKLFWEQNLGPVILTAGFEIFNAEITANYVSNDLHLQETSFSNYQQSSN
EVNTLPIRKIDSNLKEKYTFANFVQGDENRWAVSASIAVADSPGTTYNPL
FIWGGPGLGKTHLLNAIGNQVLRDNPNARVLYITAENFINEFVSHIRLDS
MEELKEKFRNLDLLLIDDIQSLAKKTLGGTQEEFFNTFNALHTNDKQIVL
TSDRNPNQLNDLEERLVTRFSWGLPVNITPPDFETRVAILTNKIQEYPYD
FPQDTIEYLAGEFDSNVRELEGALKNISLVADFKHAKTITVDIAAEAIRA
RKNDGPIVTVIPIEEIQIQVGKFYGVTVKEIKATKRTQDIVLARQVAMYL
AREMTDNSLPKIGKEFGGRDHSTVLHAYNKIKNMVAQDDNLRIEIETIKN
KIR
>gid:115540  dnaC-1  prophage LambdaSa2, replicative DNA helicase
MDELKVLPHDIQAEQSVLGSIFIKPEKMIEVAEYLKPNDFYRPAHKILFK
AMVSLADRGEAIDIVTIKSTLESTDELGMVGGISYIAEIVNAVPTSSHAE
HYAKIVAKKAQLRSIIDNLSDSIGNAYDEDMDIDEIIAKAERSLIEVSQA
SNKSSFRPIHDVLLENHSKIEERSNNTSQITGIETGFYDFDKLITGLHED
QLIVLAARPAMGKTALALNIAQNVATKSNKAVAVFSLEMGAESLVERMLS
AEGTIINHHIRTGNLTVNEWQRLIYAQGQLAEAPIFIDDTAGVKITDIRA
RARRLSQETDGLGLIVIDYLQLIQGSRSDNRQQEVSEISRQLKIIAKELK
VPVIALSQLSRGVEQRNDKRPIMSDLRESGSIEQDADIVAFLYRDAYYQD
KKEGQPENDITELIIRKNRHGNLGTVKLYFHKEYTKFSSVEEE
>gid:115803  dnaC-2  replicative DNA helicase
MVEVSELRVQPQDLLAEQAVLGSIFISPEKLIMVREFISPDDFYKYSHKV
IFRAMITLADRNDAIDAATVRNILDDQGDLQNIGGLGYIVELVNSVPTSA
NAEFYAKIVSEKAMLRDIISKLTDTVNMAYEGNDSDEIIATAEKALVDIN
EHSNRSGFRKISDVLKVNYENLELRSQQTSDVTGLPTGFRDLDRITTGLH
PDQLIILAARPAVGKTAFVLNIAQNVGTKQNRPVAIFSLEMGAESLVDRM
LAAEGMVDSHSLRTGQLTDQDWNNVTIAQGALADAPIYIDDTPGIKITEI
RARSRKLSQEVDDGLGLIVIDYLQLISGTRPENRQQEVSEISRQLKILAK
ELKVPVIALSQLSRGVEQRQDKRPVLSDIRESGSIEQDADIVAFLYRDDY
YRREGEEAEEIVEDNTVEVILEKNRAGARGTVKLMFQKEYNKFSSIAQFE
E
>gid:114621  dnaE  DNA polymerase III, alpha subunit
MFAQLDTRTVYSFMDSLVDLKTYVSKSKSLGYQTIGILDHSNLYAAYHFI
QEAQKANLRPIVGFSFDIVVENRSIEVYCIAINTVGYKNLLKLSTAQMSE
KMSLNLLTEHLEGVQLILPYQDVLDQLNLPFDYVIGVNLTSPQIPYTKPI
IAIDTVRYFQKNDIETLQMLHAIRDNVLLKDAQYASKNQELKPCQEMTLA
FKERFPEALANLESLVENVAYHFDSDFKLPIFNREIPAGEELRTLTQNNL
KSKGLWSDAYQERLEKELTIIHKMGFDDYFLIVWDLLRFGRSKGYYMGMG
RGSAAGSLVSYALNITGIDPVKHNLIFERFLNEERYSMPDIDIDLPDIYR
GEFLRYVRNRYGSMHSAQIVTFSTFGAKQAIRDVFKRFGASEYELTNITK
KIHFRDNLTSVYNRNLAFRQIIDSKIEYQKAYDIAKRIEGNPRQTSIHAA
GVVMSDDLLTDHIPLKNGEDMMITQYDASSVEDNGLLKMDFLGLRNLTFV
QKMKEKVDKDYGISIQLETIDLEDKETLKLFAAGQTKGIFQFEQSGAINL
LRRIRPECFEDVVATTSLNRPGASDYTENFINRRFGKEKIDLVDPVIAPI
LQPTYGIMLYQEQVMQIAQTYAGFTLGKSDLLRRAMSKKNSKEMQKMSQS
FLEGAVSKGHRQEDARLIFERMAKFAGYGFNRSHAFAYSALAFQLAYFKA
HYSDVFYDIMINYSNSDYLIDAIDFGFVIEKPSINTISYRDRIYKKKIYL
GLKNIKGVPNDLAYWISKNQPFQSIEDFLMRLPQQFQKSGFISPLIAIGA
FDEFDNNRRKITSNLDSLFTFVNELGSLFADTSYHWLEVEDFSNSEKYEM
EQDILGVGISPHPLVGISQKASRPFIPISQVQENSEATILVQLKQVKVIR
TKSSGQQMAFLTVMDINSKMDITVFPETFNIVRDDLQEGKYYYLHGKIQK
RDERLQMVLNGVQEATEERFWILLKNHDNDKKISEILSKYKGHIPVYLHY
ETTKETIQSKVHLVRKDSGLALDLSEFVVKTVYQ
>gid:115099  dnaG  DNA primase
MGYFCGGHDLAIDKEKISEIKNSVNIVDVIGEVVGLTKTGRNHLGLCPFH
KEKTPSFNVIEDRQFFHCFGCGRSGDVFKFVEDYQHISFLDSVQVLAERS
GIPLDTNFKGQVPKKPKANQSLLDIHRVASGFYHAYLMTTNDGERARQYL
AERGVTEDLIKHFQIGLSPGGQDFLYRRLAKEFDEKTLMSSGLFNYSENS
NQFYDSFNNRIMFPLTNDIGEVIAFSGRVWTQEDIDRKQAKYKNSRATPI
FNKSYELYHLDKARAVINKAHEVYLMEGFMDVIAAYRAGIENVVASMGTA
LTNEHVRHLKRFTKKVVLTYDGDRAGQNAIDKSLELLSDMTVDIVRIPNK
MDPDEFLQANSAEDFKQLLENGRISNTEFYIHYLKPENTDNLQSEIAYVE
KIAKLIAKSPSITAQNSYITKVAELLPDFDYFQVEQSVNNERLHHRSQQQ
ASSSVQTSATVQLPQTGKLSAITKTEMQLFHRLLNHPYLLNEFRNRDNFY
FDTTEIQVLYELLKESGEITSYDLSQESDKVNRTYYIILEEQLPVEVSIG
EIEAVEKARDRLLKERDLRKQSQLIRQSSNQGDEEGALAALENLIAQKRN
ME
>gid:115287  dnaI  primosomal protein DnaI
MKSVGQALENQGRVPRNTNDELIQMILADAQVAEFIKTHQLSQREINISM
SKFNQFLIERQKFKNKDSQYIAKGYEPILVMNEGYADVSYLETRELIEAQ
KKQAISDRINLVNLPKSYRNIRMTDFDINNESRMKAMSQLLDFVETYPSY
NHKGLYLYGDMGVGKSYLMAAMARELSERKGVSTTLLHFPSFAIDVKNAI
SSGTVKDEIDAVKSVPILILDDIGAEQATSWVRDEILQVILQHRMLEELP
TFFTSNYSFNDLERKWANIKGSDETWQAKRVMERVRYLAIEFHLEGPNRR
>gid:113620  dnaN  DNA polymerase III, beta subunit
MIHFSINKNFFLHALTVTKRAISHKNAIPILSTVKIEVTRDAIILTGSNG
QISIENTIPASNENAGLLVTNPGSILLEAGFFINIISSLPDVTLEFTEIE
QHQIVLTSGKSEITLKGKDVDQYPRLQEMTTDTPLTLETKLLKSIINETA
FAASQQESRPILTGVHLVISQNKYFKAVATDSHRMSQRTFQLEKSANNFD
LVVPSKSLREFSAVFTDDIETVEVFFSDSQMLFRSENISFYTRLLEGNYP
DTDRLLTNQFETEIIFNTNALRHAMERAYLISNATQNGTVRLEIQNETVS
AHVNSPEVGKVNEELDTVSLKGDSLNISFNPTYLIESLKAVKSETVTIRF
ISPVRPFTLTPGEDTEDFIQLITPVRTN
>gid:114513  dnaX  DNA polymerase III, gamma and tau subunits
MYQALYRKYRSQTFDEMVGQSVISTTLKQAVSSKKISHAYLFSGPRGTGK
TSAAKIFAKAMNCPNQINGEPCNHCDICRDITNGSLEDVIEIDAASNNGV
DEIRDIRDKSTYAPSRATYKVYIIDEVHMLSTGAFNALLKTLEEPTENVV
FILATTELHKIPATILSRVQRFEFKAIKLLAIRDHLAQILDKEAISYDLD
ALTLVARRAEGGMRDALSILDQALSLAKDNHISLDVAEEITGSISLSAID
DYVSNILAHDTTEALAKLEVIFDSGKSMSRFATDLLMYLRDLLVVQAGGE
DSHSSDTFIANLNVKQDILFEMIDKVTSVLPEIKNGSHPKVYAEMMTIQL
SEMVEKNSSNIPADVTAELDSLRRELKSLKNEMSQLSRADQSSSTQKVKV
NNKTFTFKVDRTKILTIMEETVVDSQRSREYLEALKSAWNEILDNITAQD
RALLMGSEPVLANSENAILAFDAAFNAEQAMKRTDLNDIFGNIMSKAAGF
SPNILAVPRNDFNQIRSDFAKKMKAQKTETEPEVNHQIPEDFSYLAERIA
IVED
>gid:114687  dprA  DprA/SMF protein, putative DNA processing factor
MNHFELFKLKKAGLTNLNIHNIINYLKKNSLTSLSVRNMAVVSKCKNPTF
FIENYKQLDLKKLRQEFKKFPVLSILDSNYPLELKEIYNPPVLLFYQGNI
ELLSKPKLAVVGARQASQIGCQSVKKIIKETNNQFVIVSGLARGIDTAAH
VSALKNGGSSIAVIGSGLDVYYPTENKKLQEYMSYNHLVLSEYFTGEQPL
KFHFPERNRIIAGLCQGIVVAEAKMRSGSLITCERALEEGREVFAIPGNI
IDGKSDGCHHLIQEGAKCIISGKDILSEYQ
>gid:115231  exoA  exodeoxyribonuclease
MKLISWNIDSLNAALTSESTRALMSRQVIDTLVAEDADIIAIQETKLSAK
GPTKKHLEVLETYFPEYDLVWRSSVEPARKGYAGTMFLYRKGLNPIVSFP
EIDAPTTMDNEGRIITLELENCYITQVYTPNAGDGLKRLADRQIWDIKYA
EYLATLDSQKPVLATGDYNVAHKEIDLANPSSNRRSAGFTAEERQGFTNL
LAKGFTDTFRYLHGDVPNVYSWWAQRSRTSKINNTGWRIDYWLTSNRVAD
KITKSEMIHSGDRQDHTPIILEIEL
>gid:114642  gyrA  DNA gyrase, A subunit
MQDKNLVDVNLTSEMKTSFIDYAMSVIVARALPDVRDGLKPVHRRILYGM
NELGVTPDKPHKKSARITGDVMGKYHPHGDSSIYEAMVRMAQWWSYRHML
VDGHGNFGSMDGDGAAAQRYTEARMSKIALEMLRDINKNTVDFQDNYDGS
EREPLVLPARFPNLLVNGATGIAVGMATNIPPHNLGESIDAVKLVMDNPD
VTTRELMEVIPGPDFPTGALVMGRSGIHRAYETGKGSIVLRSRTEIETTS
NGKERIVVTEFPYGVNKTKVHEHIVRLAQEKRIEGITAVRDESSREGVRF
VIEVRRAASANVILNNLFKLTSLQTNFSFNMLAIEKGVPKILSLRQIIDN
YIEHQKEVIVRRTQFDKAKAGARAHILEGLLVALDHLDEVITIIRNSETD
TIAQAELMSRFELSERQSQAILDMRLRRLTGLERDKIQSEYNDLLALIAD
LADILAKPERVVTIIKEEMDEVKRKYADARRTELMIGEVLSLEDEDLIEE
EDVLITLSNKGYIKRLAQDEFRAQKRGGRGIQGTGVNNDDFVRELVSTST
HDTVLFFTNLGRVYRLKAYEIPEYGRTAKGLPIVNLLKLDEGETIQTIIN
ARKEDVANKYFFFTTQQGIVKRTSVSEFSNIRQNGLRAINLKENDELINV
LLIDENEDVIIGTRTGYSVRFKVNAVRNMGRTATGVRGVNLREGDKVVGA
SRIVNGQEVLIITEKGYGKRTEASEYPTKGRGGKGIKTANITAKNGPLAR
LVTINGNEDIMVITDTGVIIRTNVANISQTGRSTMGVKVMRLDQEAKIVT
VALVEQEIEDKSNIEDTKE
>gid:114317  gyrB  DNA gyrase, B subunit
MTEETKNMEQRAQEYDASQIQVLEGLEAVRMRPGMYIGSTSKEGLHHLVW
EIVDNSIDEALAGFAGHIKVYIEPDNSITVVDDGRGIPVDIQEKTGRPAV
ETVFTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSTQLDVKVYKNGKVH
YQEYQRGVVVNDLEIIGDTDLSGTTVHFTPDPEIFTETTVFDFDKLAKRI
QELAFLNRGLRISISDKREGQEVEKEYHYEGGIGSYVEFINENKEVIFEN
PIYTDGELDGISVEVAMQYTTGYQETVMSFANNIHTHEGGTHEQGFRTAL
TRVINDYAKKNKILKENEDNLTGEDVREGLTAVISVKHPNPQFEGQTKTK
LGNSEVVKITNRLFSEAFNRFLLENPQVAKKIVEKGILASKARIAAKRAR
EVTRKKSGLEISNLPGKLADCSSNNAEMNELFIVEGDSAGGSAKSGRNRE
FQAILPIRGKILNVEKATMDKILANEEIRSLFTAMGTGFGADFDVSKVRY
QKLVIMTDADVDGAHIRTLLLTLIYRFMRPVLEAGYVYIAQPPIYGVKVG
SEIKAYIQPGVNQEEELRQALDTYSSGRSKPTVQRYKGLGEMDDHQLWET
TMDPENRLMARVSVDDAAEADKIFDMLMGDRVEPRREFIEANAVYSNLDI
>gid:115765  hexA  DNA mismatch repair protein HexA
MAKPTISPGMQQYLDIKENYPDAFLLFRMGDFYELFYDDAVKAAQILEIS
LTSRNKNAEKPIPMAGVPYHSAQQYIDVLVELGYKVAIAEQMEDPKKAVG
VVKREVVQVVTPGTVVESTKPDSANNFLVAIDSQDQQTFGLAYMDVSTGE
FQATLLTDFESVRSEILNLKAREIVVGYQLTDEKNHLLTKQMNLLLSYED
ERLNDIHLIDEQLTDLEISAAEKLLQYVHRTQKRELSHLQKVVHYEIKDY
LQMSYATKNSLDLLENARTSKKHGSLYWLLDETKTAMGTRMLRTWIDRPL
VSMNRIKERQDIIQVFLDYFFERNDLTESLKGVYDIERLASRVSFGKANP
KDLLQLGQTLSQIPRIKMILQSFNQPELDIIVNKIDTMPELESLINTAIA
PEAQATITEGNIIKSGFDKQLDNYRTVMREGTGWIADIEAKERAASGIGT
LKIDYNKKDGYYFHVTNSNLSLVPEHFFRKATLKNSERYGTAELAKIEGE
MLEAREQSSNLEYDIFMRVRAQVESYIKRLQELAKTIATVDVLQSLAVVA
ENYHYVRPKFNDQHQIKIKNGRHATVEKVMGVQEYIPNSIYFDSQTDIQL
ITGPNMSGKSTYMRQLALTVIMAQMGGFVSADEVDLPVFDAIFTRIGAAD
DLISGQSTFMVEMMEANQAVKRASDKSLILFDELGRGTATYDGMALAQSI
IEYIHDRVRAKTMFATHYHELTDLSEQLTRLVNVHVATLERDGEVTFLHK
IESGPADKSYGIHVAKIAGLPIDLLDRATDILSQLEADAVQLIVSPSQEA
VTADLNEELDSEKQQGQLSLFEEPSNAGRVIEELEAIDIMNLTPMQAMNA
IFDLKKLL
>gid:115762  hexB  DNA mismatch repair protein HexB
MNLSKIIELPDILANQIAAGEVVERPSSVVKELVENAIDAGSSQITIEVE
ESGLKKIQITDNGEGMTSEDAVLSLRRHATSKIKSQSDLFRIRTLGFRGE
ALPSIASISLMTIKTATEQGKQGTLLVAKGGNIEKQEVVSSPRGTKILVE
NLFFNTPARLKYMKSLQSELAHIIDIVNRLSLAHPEVAFTLINDGKEMTK
TSGTGDLRQAIAGIYGLNTAKKMIEISNADLDFEISGYVSLPELTRANRN
YITLLINGRYIKNFLLNRSILDGYGSKLMVGRFPIAVIDIQIDPYLADVN
VHPTKQEVRISKERELMSLISTAISESLKQYDLIPDALENLAKTSTRSVD
KPIQTSFSLKQPGLYYDRAKNDFFIGADTVSEPIANFTNLDKSDGSVDND
VKNSVNQGATQSPNIKYASRDQADSENFIHSQDYLSSKQSLNKLVEKLDS
EESSTFPELEFFGQMHGTYLFAQGNGGLYIIDQHAAQERVKYEYYREKIG
EVDNSLQQLLVPFLFEFSSSDFLQLQEKMSLLQDVGIFLEPYGNNTFILR
EHPIWMKEEEVESGIYEMCDMLLLTNEVSVKKYRAELAIMMSCKRSIKAN
HTLDDYSARHLLDQLAQCKNPYNCPHGRPVLVNFTKADMEKMFKRIQENH
TSLRDLGKY
>gid:114202  hup  DNA-binding protein HU
MANKQDLIAKVAEATELTKKDSAAAVDAVFAAVADYLAEGEKVQLIGFGN
FEVRERAARKGRNPQTGAEIEIAASKVPAFKAGKALKDAVK
>gid:114535  ligA  DNA ligase, NAD-dependent
MENRMNELVSLLNQYAKEYYTQDNPTVSDSQYDQLYRELVELEKQHPENI
LPNSPTHRVGGLVLEGFEKYQHEYPLYSLQDAFSKEELIAFDKRVKAEFP
TAAYMAELKIDGLSVSLTYVNGVLQVGATRGDGNIGENITENLKRVHDIP
LHLDQSLDITVRGECYLPKESFEAINIEKRANGEQEFANPRNAAAGTLRQ
LNTGIVAKRKLATFLYQEASPTQKETQDDVLKELESYGFSVNHHRLISSS
MEKIWDFIQTIEKDRVSLPYDIDGIVIKVNSIAMQEELGFTVKAPRWAIA
YKFPAEEKEAEILSVDWTVGRTGVVTPTANLTPVQLAGTTVSRATLHNVD
YIAEKDIRIGDTVVVYKAGDIIPAVLNVVMSKRNQQEVMLIPKLCPSCGS
ELVHFEGEVALRCINPLCPNQIKERLAHFASRDAMNITGFGPSLVEKLFD
AHLIADVADIYRLSIENLLTLDGIKEKSATKIYHAIQSSKENSAEKLLFG
LGIRHVGSKASRLLLEEFGNLRQLSQASQESIASIDGLGGVIAKSLHTFF
EKEEVDKLLEELTSYNVNFNYLGKRVSTDAQLSGLTVVLTGKLEKMTRNE
AKEKLQNLGAKVTGSVSKKTDLIVAGSDAGSKLTKAQDLGITIQDEDWLL
NL
>gid:113626  mfd  transcription-repair coupling factor
MNIIELFSQNKVVRTWHSGLVTNSRQLVMGFSGASKAIAIASAYEKLSKK
IMVVTATQTDSDKLSSDISSLIGEDNVYQFFADDVPAAEFIFSSLDKSIS
RLSALRFLKDPEKNGVLITSISGLRLLLPNPEVFSKSQYKFEIGQECYLD
KLCKNLVNLGYQKVSQVFSPGEFSQRGDILDIFEMTQEYPYRLEFFGDEI
DGIRQFDIDTQKSLKQLESVQISPADDIILQDADFERAKKKLEGYLVTAS
EVQRTYLSEVLSTTENHFKHSDIRRFLSIFYEKEWGILDYIPEGTPLFVD
DFQKIVDRNAKLDLEIASLLTEDLQQGKSHSSLNYFSDPYKQLRQYQPAT
FFSNFHKGLGNLKFDKLHHFTQYGMQEFFNQFPLLVDEINRYKKSGATVL
LQVDSQKGLNLLQENLKEYGLDLIISDKNDIVQKESQLIVGHLSNGFYFA
DEKIVLITEREIYHRRVKRKIRRSNISNAERLKDYNELSVGDYVVHNVHG
VGKFLGIETIEIQGIHRDYLTIQYQNADRISIPVEQIELLTKYVSADGKE
PKINTLNDGRFKKAKQRVAKQVEDIADDLLKLYAERSQLQGFAFSPDDNM
QNDFDNDFAYVETEDQLRSIKEIKQDMEGNRPMDRLLVGDVGFGKTEVAM
RAAFKAVNDHKQVVVLVPTTVLAQQHFENFKERFSNYPVTVDVLSRFRSK
KEQTDTLKRLSKGQVDIIIGTHRLLSQDVVFSDLGLIVIDEEQRFGVKHK
EKLKELKTKVDVLTLTATPIPRTLHMSMLGIRDLSVIETPPTNRYPVQTY
VLETNPGLVREAIIREIDRGGQVFYVYNKVDTIDQKVSELQELVPEASIG
FVHGQMSEIQLENTLIDFINGDYDVLVATTIIETGVDISNVNTLFVENAD
HMGLSTLYQLRGRVGRSNRIAYAYLMYRPDKVLTEISEKRLDAIKGFTEL
GSGFKIAMRDLSIRGAGNILGASQSGFIDSVGFEMYSQLLEQAIATKQGK
SLIRQKGNAELALQIDAYLPAEYISDERQKIEIYKRIRELETRADYEALQ
DELIDRFGEYPDQVAYLLEIGLLKAYLDLAFTELVERKGNEISILFEKAS
LKYFLTQDYFEALSKTQLKARISETNGKMEVVFNIKHKKNYEIIEELLKF
AECFIEIKSRKPVEE
>gid:115158  mutM  formamidopyrimidine-DNA glycosylase
MPELPEVETVRKGLERLVVNQEIASITIKVPKMVKTDLNDFMISLPGKTI
QQVLRRGKYLLFDFGEMVMVSHLRMEGKYLLFPNKVPDNKHFHLYFKLTN
GSTLVYQDVRKFGTFELVRKSSLKDYFTQKKLGPEPTADTFQFEPFSKGL
ANSKKPIKPLLLDQRLVAGLGNIYVDEVLWAAKIHPQRLANQLTESETSL
LHKEIIRILTLGIEKGGSTIRTYKNALGEDGTMQKYLQVYGKTGQPCPRC
GCLIKKIKVGGRGTHYCPRCQCL
>gid:114878  mutX  mutator MutT protein
MTKLATICYIDNGKELLLLHRNKKENDVHEGKWISVGGKLEAGETPDECA
KREILEETHLTVKKMDFKGVITFPEFTPGHDWYTYVFKVTDYEGELISDD
ESREGTLEWVPYDQVLSKPTWQGDYEIFKWILEDVPFFSAKFVYDEHQNL
IEKTVNFYEK
>gid:115380  mutY  A/G-specific adenine glycosylase
MWPEDRIASFRRTLLGWYDQEKRDLPWRRTTNPYYIWVSEIMLQQTQVNT
VIPYYKRFLEWFPQIKDLADAPEEQLLKAWEGLGYYSRVRNMQKAAQQVM
VDFGGIFPHTYDDIASLKGIGPYTAGAIASISFNLPEPAVDGNVMRVMAR
LFEVNYDIGDPKNRKIFQAIMEILIDPDRPGDFNQALMDLGTDIESAKTP
RPDESPIRFFNAAYLNGTYSKYPIKNTKKKPKPMRIQAFVIRNQNGQYLL
EKNTKGRLLGGFWSFPIIETSPLSQQLDLFDDNQSNPIIWQTQNETFQRE
YQLKPQWTDNHFPNIKHTFSHQKWTIELIEGVVKATDLPNAPHLKWVAIE
DFSLYPFATPQKKMLETYLKQKNA
>gid:114165  nth  endonuclease III
MLSKAKSRYIIREIIKLFPDAKPSLDFTNVFELLVAVMLSAQTTDAAVNK
VTPALFERFPNPLVLAQADPKEIEPYISKIGLYRNKARFLNQCAKQLIEH
FDGKVPRTRQELESLAGVGRKTANVVMSVGFGIPAFAVDTHVTRICKHHQ
ICKQSASPLEIEKRVMEVLPPEEWLAAHQSMIYFGRAICHPKNPKCDQYP
QLYHFPDNLK
>gid:115233  ogt  methylated-DNA--protein-cysteineS-methyltransferase
MLYQEFYQSPLGEIRLLADNLGLSGLYFVGQKYDMLAVNQEEIVNMSNSY
TLLGKKWLDAYFSQQNLPSIPLSLRGTAFQTRVWQELQKIPFGDTKTYGE
LAKELNCQSAQAVGGAIGKNSISLIIPCHRVLGRYGQLTGYAGGLERKSW
LLEYEKEK
>gid:114835  parC  DNA topoisomerase IV, A subunit
MSNIQNMSLEDIMGERFGRYSKYIIQERALPDIRDGLKPVQRRILYSMNK
DGNTFEKGFRKSAKSVGNVMGNFHPHGDSSIYDAMVRMSQDWKNRETLIE
MHGNNGSMDGDPAAAMRYTEARLSEIAGYLLQDIDKNTVPFAWNFDDTEK
EPTVLPAAFPNLLVNGATGISAGYATDIPPHNLAEVIDAVVYMIDHPKAK
LDKLMEFLPGPDFPTGAIIQGKDEIRKAYETGKGRVAVRSRTAIETLKGG
KKQIIVTEIPYEVNKSVLVKRIDDVRVNNKVPGIAEVRDESDRDGLRIAI
ELKKEADETIVLNYLFKYTDLQVNYNFNMVAIDDYTPKQVGLSRILTSYI
AHRREIIIARSKFDKEKAEKRLHIVEGLIRVLSILDEVIALIRASENKAD
AKENLKVSYEFSEAQAEAIVTLQLYRLTNTDIVTLREEEEELRQQITMLK
AIISDERTMYNVMKRELREVKKKFANTRRSELQELAETIEIDTASLIIEE
DTYVSVTRGGYVKRTSPRSFNASTVDELGKREDDELIFVSNAKTTQHLLM
FTNLGNLAYRPVHELADIRWKDVGEHLSQNLVNFASNEEIIYAELVDDFT
KETYFAVTSLGQIKRFERQEISPWRTYKSKTAKYAKLKSVEDYVVTVAPI
QLEDVILVTYNGYALRFSINDVPVVGSKAAGVKAMNLKDRDHIVSAFIAN
TTSLYLLTHRGSLKRMAIDVIPTTSRANRGLQVLRELKSKPHRVFKAGPV
YLEDSSFEFDLFSSVSNHEGDTFVLEIMSKTGKVYDVDLSQWSFSERTSN
GSFVSDKISDEEVFSVKIK
>gid:114836  parE  DNA topoisomerase IV, B subunit
MEVTLAKQDITVTNYGDDAIQVLEGLDAVRKRPGMYIGSTDGTGLHHLVW
EIVDNAVDEALSGFGNRIDVIINKDGSITVTDHGRGMPTGMHAMGKPTVE
VIFTVLHAGGKFGQGGYKTSGGLHGVGSSVVNALSSWLEVEIIRDGAIYR
QRFENGGKPVTTLKKIGTAPKSKSGTSVSFMPDQSVFSTIDFKFNTIAER
LKESAFLLKNVTLTLTDNRSEEAEHLEFHYENGVQDFVEYLNEDKETLTP
IMFFEGEEQEFHIEVALQYNDGFSDNILSFVNNVRTKDGGTHETGLKSAI
TKSMNDYARKTGLLKEKDKNLEGSDYREGLSAILSILVPEEHLQFEGQTK
DKLGSPLARPIVDGIVSEKLTYFLMENGDLASNLIRKAIKARDAREAARK
ARDESRNGKKSKKDKGLLSGKLTPAQSKNAKKNELYLVEGDSAGGSAKQG
RDRKFQAILPLRGKVLNTAKAKMADIIKNEEINTMIHTIGAGVGPDFNLD
DINYDKIIIMTDADTDGAHIQTLLLTFFYRYMRPLVEEGHVYIALPPLYK
MSKGKGKKEIVEYAWTDIELEELRQKFGKGSLLQRYKGLGEMNADQLWET
TMNPETRTLIRVTIEDLARAERRVNVLMGDKVPPRRQWIEDNVKFTLEEN
TVF
>gid:114821  pcrA  ATP-dependent DNA helicase PcrA
MNPLIIGMNDKQAEAVQTTDGPLLIMAGAGSGKTRVLTHRIAYLIDEKYV
NPWNILAITFTNKAAREMRERAIALNPATQDTLIATFHSMCVRILRREAD
YIGYNRNFTIVDPGEQRTLMKRIIKQLNLDTKKWNERSILGTISNAKNDL
LDEIAYEKQAGDMYTQVIAKCYKAYQEELRRSEAMDFDDLIMMTLRLFDQ
NKDVLAYYQQRYQYIHVDEYQDTNHAQYQLVKLLASRFKNICVVGDADQS
IYGWRGADMQNILDFEKDYPQAKVVLLEENYRSTKKILQAANNVINHNKN
RRPKKLWTQNDEGEQIVYHRANNEQEEAVFVASTIDNIVREQGKNFKDFA
VLYRTNAQSRTIEEALLKSNIPYTMVGGTKFYSRKEIRDVIAYLNILANT
SDNISFERIVNEPKRGVGPGTLEKIRSFAYEQSMSLLDASSNVMMSPLKG
KAAQAVWDLANLILTLRSNLDSLTVTEITENLLDKTGYLEALQVQNTLES
QARIENIEEFLSVTKNFDDNPEITVEGETGLDRLSRFLNDLALIADTDDS
ATETAEVTLMTLHAAKGLEFPVVFLIGMEEGVFPLSRAIEDADELEEERR
LAYVGITRAEQILFLTNANTRTLFGKTSYNRPTRFIREIDDELIQYQGLA
RPVNSSFGVKYSKEQPTQFGQGMSLQQALQARKSNSQSQVTAQLQALNAN
NSHETSWEIGDVATHKKWGDGTVLEVSGSGKTQELKINFPGIGLKKLLAS
VAPISKKEN
>gid:114082  polA  DNA polymerase I
MTNKNKLLLIDGSSVAFRAFFALYNQIDRFKNNSGLHTNAIYGFHLMLNH
ILGRVQPSHILVAFDAGKTTFRTEMYADYKGGRAKTPDEFREQFPYIRQQ
LDVLGIKHYELEHYEADDIIGTLAKQAEASNEHFDITVVSGDKDLIQLTD
TNTVVEISKKGVAEFEEFTPAYLMEKMGITPSQFIDLKALMGDKSDNIPG
VTKIGEKTGLKLLSEYGSLEGIYENIEAMKQSKMKENLINDKEQAFLSKT
LATINIASPITIGLEDILYSGPQDIKALSQFYDEMDFKQFKAALGEETSQ
EDFEVDFTEVEQLKTEMFSDNDFYYFEMLGDNYHVEDLIGIAWGNSDTIY
ATSNVSLLQEALFKKALSKPIKTYDFKRSKVLLNRFNIDLPEPAFDTRLA
KYLLSTTEDNLVSTIARLYTNLPLDTDDAVYGKGAKRAIPEKTRFLEHLA
KKVKVLVDSEANIMQQLKANEQEELLFEMEQPLANVLAKMEIRGIKVKKN
TLNEMAIENQKVIETLTQEIYELAGQEFNINSPKQLGKLLFETLGLPVEM
TKKTKTGYSTAVDVLERLAPISPLVTKILEYRQITKLQSTYIIGLQDYIL
EDGKIHTRYVQDLTQTGRLSSSDPNLQNIPVRLEQGRLIRKAFVPSEDNA
VLLSSDYSQIELRVLAHISKDEHLIAAFKEGADIHTSTAMRVFGIEKPEN
VTPNDRRNAKAVNFGIVYGISDFGLSHNLGIPRKLAKQYIDTYFERYPGI
KNYMETVVREAKDKGYVETLFHRRRSLPDINSRNFNIRQFAERTAINSPI
QGSAADILKIAMINLDRVLDKGGYKSKMLLQVHDEIVLEVPNEEIGAIRE
LVTKTMESAISLSVPLIADENAGETWYEAK
>gid:114002  priA  primosomal protein N'
MARKLAQVIVDIPLMQTDKPFSYAIPKDLEDLVQVGVRVHVPFGRGNRLL
QGFVVGFRDDDELETKDIAEVLDFEPVLNQEQLDLADQMRHTVFSYKISI
LKSMLPSLLNSQYDKLLLATDTLPSEDREDLFGHKTEIVFSSLSSQDAKK
AGRLIQKGFIEVQYLAKDKKTIKTEKIYKINRTLLEKSQIAARAKKRLEL
KEFLLENPQPGRLTALNKQFSSPVVNFFREEGIIEVIEKEASRSDNYFKG
ILKTDFLDLNQEQAKVVKIVVDQIGKEQNKPFLLEGITGSGKTEVYLHII
DNVLKLGKTAIVLVPEISLTPQMTNRFISRFGKQVAIMHSGLSEGEKFDE
WRKIKSGQAKVVVGARSAIFAPLENIGAIIIDEEHESTYKQESNPRYHAR
DVALLRAEYYKAVLLMGSATPSIESRARASRDVYKFLELKHRANPKARIP
QVEIIDFRNFIGQQEVSNFTSYLLDKIRDRLDKKEQVVLMLNRRGYSSFI
MCRDCGYVDQCPNCDISLTLHMATKTMNCHYCGFEKPIPRTCPNCNSKSI
SYYGTGTQKAYEELLKVIPDAKILRMDVDTTRQKGGHESILKRFGNHEAD
ILLGTQMIAKGLDFPNVTLVGVLNADTSLNLPDFRSSERTFQLLTQVAGR
AGRAEKEGEVVIQTYNPNHYAIQLAQKQDFEAFYQYEMNIRRQLGYPPYY
FTVGLTLSHKDEEWLIRKSYEVLSLLKQGFSDKVKLLGPTPKPIARTHNL
YHYQIIIKYRFEDNLELVLNRLLDMTQDKENRDLRLAIDHEPQNMM
>gid:114780  radC  DNA repair protein RadC
MYHIELKKEALLPRERLVDLGADRLSNQELLAILLRTGIKEKPVLEISTQ
ILENISSLADFGQLSLQELQSIKGIGQVKSVEIKAMLELAKRIHKAEYDR
KEQILSSEQLARKMMLELGDKKQEHLVAIYMDTQNRIIEQRTIFIGTVRR
SVAEPREILHYACKNMATSLIIIHNHPSGSPKPSESDLSFTKKIKRSCDH
LGIVCLDHIIVGKNKYYSFREEADIL
>gid:115758  recA  recA protein
MAKKTKKAEEITKKFGDERRKALDDALKNIEKDFGKGAVMRLGERAEQKV
QVMSSGSLALDIALGAGGYPKGRIVEIYGPESSGKTTVALHAVAQAQKEG
GIAAFIDAEHALDPAYAAALGVNIDELLLSQPDSGEQGLEIAGKLIDSGA
VDLVVVDSVAALVPRAEIDGDIGDSHVGLQARMMSQAMRKLSASINKTKT
IAIFINQLREKVGVMFGNPETTPGGRALKFYSSVRLDVRGNTQIKGTGEH
KDHNVGKETKIKVVKNKVAPPFREAFVEIMYGEGISRTGELIKIASDLDI
IQKAGAWYSYNGEKIGQGSENAKKYLADNPAIFDEIDHKVRVHFGMTEDD
SPVQSELVEEKNEADDLVLDLDNAIEIEE
>gid:115820  recF  recF protein
MWIKNISLKHYRNYEEAQVDFSPNLNIFIGRNAQGKTNFLEAIYFLALTR
SHRTRSDKELVHFKHHDVQITGEVIRKSGHLNLDIQLSEKGRITKVNHLK
QAKLSDYIGAMTVVLFAPEDLQLVKGAPSLRRKFLDIDIGQIKPTYLAEL
SNYNHVLKQRNTYLKTTNNVDKTFLTVLDEQLADYGSRVIEHRFDFIQAL
NDEADKHHYIISTELEHLSIHYKSSIEFTDKSSIREHFLNQLSKSHSRDI
FKKNTSIGPHRDDITFFINDINATFASQGQQRSLILSLKLAEIELIKTVT
NDYPILLLDDVMSELDNHRQLKLLEGIKENVQTFITTTSLEHLSALPDQL
KIFNVSDGTISINEKKATD
>gid:115348  recG  ATP-dependent DNA helicase RecG
MLLQSPISNLKGFGPKSAEKFQKLDIYTVEDLLLYYPFRYEDFKSKSVFD
LVDGEKAVITGLVVTPANVQYYGFKRNRLSFKLRQGEAVLNVSFFNQPYL
ADKIELGQEVAVFGKWDATKSAITGMKVLAQVEDDMQPVYRVAQGISQST
LIKAIKSAFEISAHLELKENLPATLLEKYRLMGRSQACLAMHFPKDITEY
KQALRRIKFEELFYFQMNLQVLKSENKSETNGLPILYSKHAMETKISSLP
FILTNAQKRSLDEILSDMSSGAHMNRLLQGDVGSGKTVIAGLSMYAAYTA
GFQSALMVPTEILAEQHYISLQELFPDLSIAILTSGMKAAVKRTVLAAIA
NGSVDMIVGTHALIQDSVQYHKLGLVITDEQHRFGVKQRRIFREKGENPD
VLMMTATPIPRTLAITAFGEMDVSIIDELPAGRKPIITRWVKHEQLGTVL
EWVKGELQKDAQVYVISPLIEESEALDLKNAVALHAELSTYFEGIAKVAL
VHGRMKNDEKDAIMQDFKDKKSHILVSTTVIEVGVNVPNATIMIIMDADR
FGLSQLHQLRGRVGRGYKQSYAVLVANPKTDSGKKRMTIMTETTDGFVLA
ESDLKMRGSGEIFGTRQSGIPEFQVADIVEDYPILEEARRVASDIVKDNN
WKENTEWALILDNLRQHSDFD
>gid:114890  recJ  single-stranded-DNA-specific exonuclease RecJ
MISAKYSWVLNNQKPDAGFFEASKKEKISEAVASLIYSRGIKTSAELHHF
LQTNLENLHDPYLLNDMDKAVNRIRRAIENNETILVYGDYDADGMTSASI
MKEALDMMGAEVQVYLPNRFTDGYGPNQSVYKYFIEQQDVSLIITVDNGV
AGHEAITYAQNQGVDVVVTDHHSMPADLPCAYAIIHPEHPDANYPFPYLA
GCGVAFKVACALLETIPTEMLDLVAIGTIADMVSLTDENRIMVKAGLEVM
KDSERIGLQELISLSNIDLKTLNEETIGFKIAPQLNALGRLDDPNPAIEL
LTGFDDEESQAIAQMIDQKNEERKEIVQTIFDQAMQMLDQTKPVQVLAKE
NWHPGVLGIVAGRILERTGQPVIVLNIEDGIAKGSARSVEALDIFQAFDQ
HRELFIAFGGHSGAAGMTLEESKVGDLSQVLCDYISKKQLDMSQKKTLTI
DSELRFDELSLDTVRDFEKLAPFGMDNKKPVFLLKDFKVSQARVMGQNGA
HLKLKLEQDGQALDLVAFNMGSQLQEFQQAQHLELAVTLSVNQWNGATTL
QLMLEDARVDGIQLFDIRSKASSLPHGVPILSQEEQSKEVILLTVPDHPQ
ELKQMTQGKQFDAIYFKNEIPKNYFISGYGTRDQFASLYKTIYQFPEFDV
RYKLKELSSYLHIPDILLIKMIQIFEELHFVTITEGIMTVNKEAEKRDIS
ESQIYQELKETVKFQELMALGTPKEIYDFMMS
>gid:114198  recN  DNA repair protein RecN
MLLEISIKNFAIIEEISLNFETGMTVLTGETGAGKSIIIDAMNMMLGSRA
SVEVIRHGANKAEIEGFFSVEKNQSLVQLLEENGIELADELIIRREIFQN
GRSVSRINGQMVNLSTLKAVGHYLVDIYGQHDQEELMKPNMHILMLDEFG
NTEFNVIKERYQSLFDAYRQLRKRVLDKQKNEQENKSRIEMLEFQIAEIE
SVALKSDEDQTLLKQRDKLMNHKNIADTLTNAYLMLDNEEFSSLSNVRSA
MNDLMALEEFDREYKDLSTNLSEAYYVIEEVTKRLGDVIDDLDFDAGLLQ
EIENRLDVINTITRKYGGDVNDVLDYFDNITKEYSLLTGSEESSDALEKE
LKILEHDLIESANQLSLERHKLAKQLENEIKQELTELYMEKADFQVQFTK
GKFNKEGNEIVEFYISTNPGEGFKPLVKVASGGELSRLMLAIKSAFSRKE
DKTSIVFDEVDTGVSGRVAQAIAQKIHKIGSHGQVLAISHLAQVIAIADY
QYFIEKISSDSSTVSTVRLLSYEERVEEIAKMLAGNNVTDTARTQAKELL
GS
>gid:114451  recR  recombination protein RecR
MLYPTPIAKLIDSFSKLPGIGTKTATRLAFYTIGMSDEDVNEFAKNLLAA
KRELTYCSVCGNLTDDDPCLICTDKTRDQSVILVVEDSKDVSAMEKIQEY
NGLYHVLHGLISPMNGISPDDINLKSLITRLMDGQVTEVIVATNATADGE
ATSMYISRVLKPAGIKVTRLARGLAVGSDIEYADEVTLLRAIENRTEL
>gid:114558  rexA  exonuclease RexA
MTFKPFLNPEDIAVIQTEEKNSDKKQKRTPEQIEAIYTFGNNVLVSASAG
SGKTFVMVERILDKLLRGVPIDSLFISTFTVKAAGELKERLEKKINESLK
SAESDDLKQFLTQQLVGIQTADIGTMDAFTQKIVNQYGYTLGISPIFRIL
QDKNEQDVIKNEVYADLFSDYMTGKNAASFIKLVKNFSGNRKDSKAFREM
VYKVYAFSQSTDNPKRWMQTVFLKGAQTYTDFEAIPDQEVSSLLNVMQTT
ANQLRDLTDQEDYKQLTAKGVPTANYKKHLKIIENLVHWSQDFNLLYGKK
GLTNLARDITNVIPSGNDVTVAGVKYPIFKQLHNRIVGLKHLEVIFKYQG
ESLFLLELLQSFVLDFSEQYLQEKIQENAFEFSDIAHFAIQILEENHDIR
QLYQDKYHEVMVDEYQDNNHTQERMLELLSNGHNRFMVGDIKQSIYRFRQ
ADPQIFNDKYKAYQDNPSQGKLIILKENFRSQSEVLDSTNSVFTHLMDEE
VGDILYDESHQLKAGSPRQQERHPNNKTQVLLLDTDEDDIDDSDSQQYDI
SPAEAKLVAKEIIRLHKEENVPFQDITLLVSSRTRNDGILQTFDRYGIPL
VTDGGEQNYLKSVEVMVMLDTLRSIDNPLNDYALVALLRSPMFGFNEDDL
TRIAIQDVKMAFYHKVKLSYHKEGHHSDLITPELSSKIDHFMKTFQTWRD
FAKWHSLYDLIWKIYNDRFYYDYVGALPKAEQRQANLYALALRANQFEKT
GFKGLSRFIRMIDKVLENENDLADVEVALPQNAVNLMTIHKSKGLEFKYV
FILNIDKKFSMVDITSPLILSRNQGIGIKYVADMRHELEEEILPAVKVSM
ETLPYQLNKRELRLATLSEQMRLLYVAMTRAEKKLYLVGKASQTKWADHY
DLVSENNHLPLASRETFVTFQDWLLAVHETYKKQELFYDINFVSLEELTD
HHIGMVNPSLPFNPDNKVENRQSEDIVRAISVLESVEQINQTYKAAIELP
TVRTPSQVKKIYEPILDIEGVDVMETITKTSVDFKLPDFSTSKKQDPAAL
GSAVHELMQRIEMSSHVKMEDIQKALTEVNAETSVKAAIQIEKINYFFQE
TSLGKYIQEEVEHLHREAPFAMLKEDPESGEKFVVRGIIDGYLLLENRII
LFDYKTDKFVNPLELKERYQGQMALYAEALKKSYEIEKIDKYLILLGGKQ
LEVVKMD
>gid:114557  rexB  exonuclease RexB
MKLLYTDINHDMTEILVNQAAHAAEAGWRIFYIAPNSLSFEKERAVLENL
PQEASFAITITRFAQLARYFTLNQPNQKESLNDIGLAMIFYRALASFEDG
QLKVFGRLKQDASFISQLVDLYKELQTANLSILELKYLHSPEKFEDLLAI
FLVVSDLLREGEYDNQSKIAFFTEQVRSGQLDVDLKNTILIVDGFTRFSA
EEEALIKSLSSRCQEIIIGAYASQKAYKANFTNGNIYSAGVDFLRYLATT
FQTKPEFILSKWESKSGFEMISKNIEGKHDFTNSSHILDDTAKDCITIWE
CINQKDEVEHVARAIRQKLYQGYRYKDILVLLGDVDSYKLQLSKIFEQYD
IPYYFGKAETMAAHPLVHFMDSLSRIKRYRFRAEDVLNLFKTGIYGEISQ
DDLDYFEAYISYADIKGPKKFFTDFVVGAKKFDLGRLNTIRQSLLTPLES
FVKTKKQDGIKTLNQFMFFLTQVGLSDNLSRLVGQMSENEQEKHQEVWKT
FTDILEQFQTIFGQEKLNLDEFLSLLNSGMMQAEYRMVPATVDVVTVKSY
DLVEPHSNQFVYALGMTQSHFPKIAQNKSLISDIERQLINDANDTDGHFD
IMTQENLKKNHFAALSLFNAAKQELVLTIPQLLNESEDQMSPYLVELRDI
GVPFNHKGRQSLKEEADNIGNYKALLSRVVDLYRSAIDKEMTKEEQTFWS
VAVRYLRRQLTSKGIEIPIITDSLDTVTVSSDVMTRRFPEDDPLKLSSSA
LTTFYNNQYKYFLQYVLGLEEQDSIHPDMRHHGTYLHRVFEILMKNQGIE
SFEEKLNSAINKTNQEDVFKSLYSEDAESRYSLEILEDIARATATILRQD
SQMTVESEEERFELMIDNTIKINGIIDRIDRLSDGSLGVVDYKSSAQKFD
IQKFYNGLSPQLVTYIDAISRDKEVEQKPPIFGAMYLHMQEPRQDLSKIK
NLDDLVTKNHQALTYKGLFSEAEKEFLANGKYHLKDSLYSETEIAILQAH
NQSLYKKASETIKSGKFLINPYTEDAKTVDGDQFKSITGFEADRHMARAR
ALYKLPAKEKRQGFLTLMQQEEENDDL
>gid:114693  rnhB  ribonuclease HII
MATIKEIKAILETIVDLKDKRWQEYQTDSRAGVQKAILQRKKNIQSDLDE
EARLEQMLVYEKKLYIEHINLIAGIDEVGRGPLAGPVVAAAVILPPNCKI
KHLNDSKKIPKKKHQEIYQNILDQALAVGIGIQDSQCIDDINIYEATKHA
MIDAVSHLSVAPEHLLIDAMVLDLSIPQTKIIKGDANSLSIAAASIVAKV
TRDKIMSDYDSTYPGYAFSKNAGYGTKEHLEGLQKYGITPIHRKSFEPIK
SML
>gid:115387  rnhC  ribonuclease HIII
MNTIVMQADKKLQEKIRTDLAQHHISNNNPYVVFSAKISGATVLLYTSGK
LVFQGSNASHIAQKYGFIEQKESCSSESQDIPIIGTDEVGNGSYFGGLAV
VASFVTPKDHAYLKKLGVGDSKTLTDQKIKQIAPLLEKAIPHKALLLSPQ
KYNQVVSPNNKHNAVSVKVALHNQAIFLLLQDGFEPEKIVIDAFTSSKNY
QNYLKNEKNQFKQTITLEEKAENKYLAVAVSSIIARNLFLENLNKLSDDV
GYKLPSGAGHQSDKVASQLLKAYGISSLEHCAKLHFANTKKAQALLK
>gid:115760  ruvA  Holliday junction DNA helicase RuvA
MYDYIKGKLSKITAKFIVVETAGLGYMIYVANPYSFSGYVNQEVTIYLHQ
VIRDDAHLLFGFHTENEKEIFLNLISVSGIGPTTALAIIAVDDNEGLVSA
IDNSDIKYLTKFPKIGKKTAQQMILDLSGKFVEASGESATSRKVSSEQNS
NLEEAMEALLALGYKATELKKVKAFFEGTNETVEQYIKSSLKMLMK
>gid:113699  ruvB  Holliday junction DNA helicase RuvB
MTRFLDSDAMGDEELVERTLRPQYLREYIGQDKVKDQLKIFIEAAKLRDE
SLDHVLLFGPPGLGKTTMAFVIANELGVNLKQTSGPAIEKSGDLVAILND
LEPGDVLFIDEIHRMPMAVEEVLYSAMEDFYIDIMIGAGETSRSVHLDLP
PFTLIGATTRAGMLSNPLRARFGITGHMEYYEENDLTEIIERTADIFEMK
ITYEAASELARRSRGTPRIANRLLKRVRDYAQIMGDGLIDDNITDKALTM
LDVDHEGLDYVDQKILRTMIEMYNGGPVGLGTLSVNIAEERDTVEDMYEP
YLIQKGFIMRTRTGRVATVKAYEHLGYQRFDK
>gid:113853  ssb-1  single-strand binding protein
MYNKVIMIGRLTAKPEMVKTPTDKSVTRATVAVNRRFKGSNGEREADFIN
VVMWGRLAETLASYGTKGSLISIDGELRTRKYEKDGQTHYITEVLASSFQ
LLESRAQRAMRENNVSGDLSDLVLEEEELPF
>gid:114264  ssb-2  prophage LambdaSa1, single-strand binding protein
MINNIVLVGRMTKDAELRYTPSNQAVATFSLAVNRNFKNQSGEREADFIN
CVIWRQQAENLANWAKKGALVGITGRIQTRNYENQQGQRIYVTEVVAENF
QLLESRNSQQQTNQSGNSSNSYFGNANKMDISDDDLPF
>gid:115378  ssb-3  single-strand binding protein
MINNVVLVGRMTRDAELRYTPSNQAVATFSLAVNRNFKNQSGEREADFIN
CVIWRQQAENLANWAKKGALVGITGRIQTRNYENQQGQRVYVTEVVAESF
QLLESRATREGGSPNSYNNGGYNNAPSNNSYSASSQQTPNFSRDESPFGN
SNPMDISDDDLPF
>gid:115531  ssb-4  prophage LambdaSa2, single-strand binding protein
MINNVVLIGRLTRDVELRYTPSNIANATFNLAVNRNFKNAAGDREADFIN
CVMWRQQAENLANWTKKGMLIGITGRIQTRSYENQQGQRIYVTEVVADSF
QILEKRDNSTNQASMDDQLPPSFGNSQPMDISDDDLPF
>gid:115759  tag  DNA-3-methyladenine glycosylase I
MKRCSWVNLDNPLYVAYHDKEWGRAVHDDHVLFELLCLETYQSGLSWETV
LNKRQEFRQVFHHYNIEKVAAMSDADLEIILQNPRVIRHRLKLFSTRQNA
RSIILIQKEFGSFDRYIWSFVDNKVQVNSVNNYNDVPASTTLSERLSKDL
KKRGFKFVGPTCLYSFIQAAGMVNDHENICDFK
>gid:114686  topA  DNA topoisomerase I
MATTTKTSTKKTSKKKSATAKKNLVIVESPAKAKTIEKYLGRNYKVVASV
GHIRDLKKSSMSIDFENNYEPQYINIRGKGPLINDLKKEAKKAKKVYLAS
DPDREGEAISWHLAHILDLDKEDRNRVVFNEITKDAVKNAFVEPRQINMD
LVDAQQARRVLDRIVGYSISPILWKKVKKGLSAGRVQSVALKLIIDRENE
IKAFQPEEYWTIDGSFKKGTRKFNATFYGLDGKKFKLSNNEDVKTVLKRI
KTDEFLVEKVEKKERRRNAPLPYTTSSLQQDAANKINFRTRKTMMIAQQL
YEGLSLGTAGHQGLITYMRTDSTRISPLAQNEATEFITNRFGANYSKHGN
KVKNASGAQDAHEAIRPSSVNHTPESIAKYLDKDQLKLYTLIWNRFIASQ
MTAAVFDTMKVNLTQNGVTFIANGSQVKFDGYMAVYNDTDKNKMLPDMEE
GESVKKVNTNPEQHFTQPPARFSEASLIKTLEENGVGRPSTYAPTLETIQ
KRYYVKLAAKRFEPTELGEIVNSLIVEFFPDIVDVTFTAEMEGKLDEVEI
GKEQWQKIIDEFYKPFEKELAKAETEMEKIQIKDEPAGFDCELCGSPMVI
KLGRYGKFYACSNFPECHNTKAITKEIGVICPICQKGQVIERKTKRNRIF
YGCDRYPECEFTSWDKPIGRTCPKSNDFLVEKKVRGGGKQVVCSNEKCDY
QEEKIK
>gid:114838  ung  uracil-DNA glycosylase
MKHSSWHDLIKRELPNHYYNKINTFMDAVYESGIVYPPRDKVFNAIQITP
LENVKVVIIGQDPYHGPQQAQGLSFSVPDNLPAPPSLQNILKELAEDIGS
RSHHDLTSWAQQGVLLLNACLTVPEHQANGHAGLIWEPFTDAVIKVVNQK
ETPVVFILWGGYARKKKSLIDNPIHHIIESPHPSPLSAYRGFFGSRPFSR
TNHFLEEEGINEIDWLN
>gid:115374  uvrA  excinuclease ABC, A subunit
MQDKLMIRGARAHNLKNISVDIPRDKLVVVTGLSGSGKSSLAFDTIYAEG
QRRYVESLSAYARQFLGNMEKPDVDSIDGLSPAISIDQKTTSKNPRSTVG
TVTEINDYLRLLYARVGTPYCINGHGAITASSVEQIVDKVLALPERTKMQ
ILAPIIRRKKGQHKSTFEKIQKDGYVRVRIDGDIHDVTEVPELSKSKMHN
IDIVVDRLINKEGIRSRLFDSVEAALRLSDGYVVIDTMDGNELLFSEHYS
CPECGFTVPELEPRLFSFNAPFGSCPTCDGLGIKLEVDIDLVIPDRSKTL
REGALVPWNPISSNYYPTMLEQAMTQFGVDMDTPFEKLSKAEQDLALYGS
GEREFHFHYINDFGGERNIDLPFEGVVNNINRRYHETNSDYTRNVMREYM
NELKCNTCHGYRLNDQALCVRVGGEEGLNIGQVSDLSIADHLELLETLRL
SSNEQLIARPIIKEIHDRLSFLNNVGLNYLNLSRSAGTLSGGESQRIRLA
TQIGSNLSGVLYVLDEPSIGLHQRDNDRLIDSLKKMRDLGNTLIVVEHDE
DTMMAADWLIDVGPGAGAFGGEIVASGTPKQVAKNTKSITGQYLSGKKVI
PVPSERRVGNGRFLEIKGAAENNLQNLDVKFPLGKFIAVTGVSGSGKSTL
INSILKKAVAQKLNRNSDKPGKYVSLEGIEYVDRLIDIDQSPIGRTPRSN
PATYTGVFDDIRDLFAQTNEAKIRGYKKGRFSFNVKGGRCESCSGDGIIK
IEMHFLPDVYVPCEVCHGTRYNSETLEVHYKEKNIAQILDMTVNDAVTFF
AAIPKIARKLQTIKDVGLGYVTLGQPATTLSGGEAQRMKLASELHKRSTG
KSLYILDEPTTGLHADDIARLLKVLDRFVDDGNTVLVIEHNLDVIKTADH
IIDLGPEGGIGGGQIVAIGTPEEVAENPKSYTGYYLKEKLAR
>gid:115133  uvrB  excinuclease ABC, B subunit
MIDRKDTNRFKLVSKYSPSGDQPQAIETLVDNIEGGEKAQILKGATGTGK
TYTMSQVIAQVNKPTLVIAHNKTLAGQLYGEFKEFFPDNAVEYFVSYYDY
YQPEAYVPSSDTYIEKDSSVNDEIDKLRHSATSSLLERNDVIVVASVSCI
YGLGSPKEYADSVVSLRPGQEISRDQLLNNLVDIQFERNDIDFQRGKFRV
RGDVVEVFPASRDEHAFRIEFFGDEIDRIREIESLTGRVLGEVEHLAIFP
ATHFMTNDEHMEEAISKIQAEMENQVELFEKEGKLIEAQRIRQRTEYDIE
MLREMGYTNGVENYSRHMDGRSEGEPPFTLLDFFPEDFLIMIDESHMTMG
QIKGMYNGDRSRKEMLVNYGFRLPSALDNRPLRREEFESHVHQIVYVSAT
PGDYEMEQTDTVVEQIIRPTGLLDPEVEVRPSMGQMDDLLGEINLRTEKG
ERTFITTLTKRMAEDLTDYLKEMGVKVKYMHSDIKTLERTEIIRDLRLGV
FDVLIGINLLREGIDVPEVSLVAILDADKEGFLRNERGLIQTIGRAARNS
NGHVIMYADKITDSMQRAMDETARRRRLQMDYNEKHGIVPQTIKKEIRDL
IAITKSNDSDKPEKVVDYSSLSKKERQAEIKALQQQMQEAAELLDFELAA
QIRDVILELKAID
>gid:114902  uvrC  excinuclease ABC, C subunit
MNELIKHKLELLPDSPGCYLHKDKNGTIIYVGKAKNLKNRVKSYFHGSHN
TKTELLVSEIEDFEYIVTTSNTEALLLEINLIQENMPKYNIRLKDDKSYP
YIKITNERYPRLMITRQVKKSDGTYFGPYPDSGAATEIKRLLDRLFPFKK
CTNPANKVCFYYHLGQCNAHTVCQTNKAYWDSLREDVKQFLNGKDNKIVN
GLTEKMKSAAMTMEFERAAEYRDLIEAISLLRTKQRVIHQDMKDRDVFGY
FVDKGWMCVQVFFVRNGKLIQRDVNMFPYYNEPEEDFLTYIGQFYQDTKH
FLPKEVFIPQDIDAKSVETIVGCKIVKPQRGEKKQLVNLAIKNARVSLQQ
KFDLLEKDIRKTHGAIENLGNLLNIPKPVRIEAFDNSNIQGTSPVAAMVV
FVNGKPSKKDYRKFKIKTVIGPDDYASMREVIHRRYSRVLKDGLTPPDLI
VIDGGQGQVNIARDVIENQFGLAIPIAGLQKNDKHQTHELLFGDPLEVVE
LPRNSEEFFLLHRIQDEVHRFAITFHRQLRSKNSFSSKLDGITGLGPKRK
QLLMKHFKSLPNIQKAEIEDIIMCGIPRTVAESLRDSLNDPPK
>gid:114193  xseA  exodeoxyribonuclease VII, large subunit
MSDYLSVSTLTKYLKLKFDKDPYLERVYLTGQVSNFRRRPNHQYFSLKDD
KSVIQATMWSGHFKKLGFELEEGMKVNVVGRVQLYEPSGSYSIIVEKAEP
DGIGALAIQFEQLKKKLSQAGYFDDRHKQLIPQFVRKIGVVTSPSGAVIR
DIITTVSRRFPGVEILLFPTKVQGEGAAQEIAQTIALANEKKDLDLLIVG
RGGGSIEDLWAFNEECVVEAIFESRLPVISSVGHETDTTLADFVADRRAA
TPTAAAELATPVTKIDILSWITERENRMYQSSLRLIRTKEERLQKSKQSV
IFRQPERLYDGFLQKLDNLNQQLTYSMRDKLQTVRQKQGLLHQKLQGIDL
KQRIHIYQERVVQSRRLLSSTMTSQYDSKLARFEKAQDALISLDSSRIVA
RGYAIIEKNHTLVSTTNGINEGDHLQVKMQDGLLEVEVKDVRQENI
>gid:114194  xseB  exodeoxyribonuclease VII, small subunit
MSDKKTFEENLQELETIVSRLETGDVALEDAIAEFQKGMLISKELQRTLK
EAEETLVKVMQADGTEVEMDT