TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Escherichia coli CFT073, CFT073
Gene type: CDS

Number of genes found: 315

Free access
Sort by:

 



# Escherichia coli CFT073, CFT073

>c5197 Transposase insD for insertion element IS2A/D/F/H/I/K
MSRAQLTLRMKASDNKPDKRRQRRDEAADAEVLSRILDIIGDMPAYGYRR
VWAILRRQSRNEGLPFVNAKRVYRIMSENSLLLLHDKPSRLQREHKGRIS
VKESDQRWCSDGFEFGCDDGEKVRVTFALDCCDREAIDWAASTGGYDKAT
VQDVMSGAIEKGFGDKVPEEPIQWLTDNGSAYRAHETRQFARELNLEPCT
TAVSSPQSNGMADGS
>c5168 Unknown protein of IS629 encoded within prophage
MTKNTRFSPEVRQRAVRMVLESQSEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>c5375 Hypothetical protein
MRVFDIQHIKGMEFEAVFFVSIDQLATLHPALFDKYLYVGITRAATYLDV
TC
>c5177 Hypothetical protein in IS
MTKNTRFSPEVRQRAVRMVLESQDEYDSQRAAICSIAPKTGCTPETLRVW
IRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>c5144 Hypothetical protein
MTLNTSQVSYYMTQRKKGITQHISAMKAGISVRSGRRIEKGEWAKNSVRH
WRTRKDPLEAVWDSMLVPLLKERPALTPTTLLEMLQDKYPGQYPNSLRRT
MQRRVREWKLQYGAEQEVMFRQRHQPGLRGLSDFTELKGVVVTIAGKLLA
HKLYHFRLEWSHWSWMRVVLGGESFSALAEGLQEALGQLGGVPVEHKTDS
LRAAWKQQGEDGRRELTERYAALCQHYGMQGVHNNAGRGHENGSVESAHG
HLKRRICQALILRGSNDFSTIEEYQAFITQQVMRHNRNNQDLVKEERLHL
KPLPLRRSADYDELTVRVSRSSTINVKHVVYSVPSRLVGQLLRVRLWDDR
LSCYVGSSEVMSCPRVRPEKGKTRARRIDFRHVIDSLAKKPGAFCHATLR
NDILPDDEWRRLWRRLCNHLEPDMAGRLMVHALKLAAGYDDISVVAKGME
QMLNTPGNVDLHRLMRFLGIKEKALPVVNVKQHNLSSYEQLLRGKGGSQ
>c5383 Hypothetical protein
MNKWLHRNGFTYKKPSGVPHKFSEEKQRQFIEYYKELKTTVGDEPILFID
GVHPTQATKISYGWIRKGQKKAVKTTGSRTRLNIMGALNLKALTSPLICE
YKTINEYNVSRFFNEIRKVYPDYNQKIHVILDGAGYHRSQLVKDWAEVVN
IRLHYLPPYSPNLNPIERMWKLMNEHARNNRYFSSTREFREAISVFFNQT
LPDIADSLTSRINDHFQVLTPAS
>c5166 Partial Transposase
MTTAMAESINGLNKAEVIHRKSWENRAEVELATLTWVDWYNNRRLLERLG
HTPPAEAEKAYYASIGNDDLAA
>c5152 Putative radC-like protein yeeS
MQQLSFLPGEMTPGERSLIQRALKTLDRHLHEPGVAFTSTRAAREWLILN
MAGLEREEFRVLYLNNQNKLIAGETLFTGTINRTEVHPREVIKRALYHNA
AAVVLAHNHPSGEVTPSKADRLITERLVQALGLVDIRVPDHLIVGGSQVF
SFAEHGLL
>c5172 Hypothetical protein
MNQPIHNAYWLSRFDILLDSALAQHRAVSLIRVDLRFPEYMPVTIMDPDP
DSAVISRFFASLKAKIQAYQRKKRCANQRVHATSLRYFWCREFGKMYGRK
HYHVILLLNKDTWCSIGDFSEPSSLATMIQEAWCSALHLEPWQGDGLVHF
SRWTPSRKPASSDARPSSDDTPLSGGCSDTWKASDKKPGEAAVLWIKRGD
VGALQKARNRASYLVKYETKLHNGSGQRNYGCSRGPGRLLDGRRSL
>c5196 Transposase insC for insertion element IS2A/D/F/H/I/K
MEPGMTVSHVARLHGIQPSLLFKWKKQYLEGSLTAVAAGEDVVPASELAA
AIKQINQLQRLLGKKTMEVEILKEAVEYAQSRKWIAHVPLLPSDKE
>c5167 Putative Transposase for IS629
MMPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWRGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQY
VSLAYTQRLKEAGLLASTGSNRRLV
>c5175 Hypothetical protein
MKLLYLGLMNAQEKWTMPIQSWNLTLSQLAIYFEGRLNNVMTL
>c5384 Hypothetical protein
MIFQQQKIMLSNMKIFITEQQKAELERLHDSSRDGRVRDRIKAILLASEG
WSSAMIAQALRLHQTTIYHHISEFLNKGKLKPENGGSDSKLSAEQTAFSS
ASYPIIYFTIPAMLLRLSPGHGTSFSAFRG
>c5215 Conserved hypothetical protein
MGLVMDENALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLD
EAIVSKFFEGEKDDVETVDVVLRPKVYFRLLQHGKDRSAGAPDIVTPLVT
PALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTIHTSFS
INFDDSIDKTAETDEEREARYAALQQEWRQYLDDSERLLKNVAGDWIKNP
EQYELAEHGYIVKTAQSGGASFHILSLYDHLLVCKKDVPLFNRFASREVH
AAESLLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGP
PGTGKTTLVLSIIATQWARAALEKSEPPVIIATSTNNQAVTNIIEAFGKD
FSQGTGAMAGRWLPELKSFGAYFPSSTRKAEAAKKYQTEDFFNQVESKEY
VEDALLFYLEKAKAAFPEKECSSPEKVIELLHGQLVAKSEQLKRLNATWQ
TLSQVRAARELIANDIEQYLDNLNKLLSGQEQKVTLLKSAKTEWKKYRAG
ESLIYSLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNID
GLLNSAEREQTTYRQQIDSAHEIVLKEQQAVQEWQRL
>c5216 Prophage P4 integrase
MALTDAKIRAAKPTDKAYKLTDGAGMFLLVHPNGSRYWRLRYRILGKEKT
LALGVYPEVSLSEARTKRDEARKLISEGVDPCEQKRAKKVVPDLQLSFEH
IARRWHASNKQWAQSHSDKVLKSLETHVFPFIGNRDITTLNTPDLLIPVR
AAEAKQIYEIASRLQQRISAVMRYAVQSGIIRYNPALDMAGALTTVKRQH
RPALDLSRLPELLSRISSYKGQPVTQLAVMLNLLVFIRSSELRYARWSEI
DIDNAMWTIPAEREPLPGVKFSHRGSKMRTPHLVPLSKQAVAILTELQTW
AGENGLIFTGAHDPRKPISENTVNKALRVMGYDTTQEVCGHGFRAMACSA
LIESGLWSRDAVERQMSHQERNGVRAAYIHKAEHLEERRLMLQWWADFLD
ANREKGISPFEYAKINNPLK
>c5198 Hypothetical protein
MKTMKEDYIAFMPKPNVRTAVHNLAVAIEHYNENHPHSALGYRSPREYRR
QRVTLT
>c5176 Putative Transposase within prophage
MKKHQTTLSDELERKIIRLFALGMSYQDISREIEDLYAFSVSTTTISAVT
DKVIPELKQWQQRPLEKVYPFVWLDAIHYKIREDGCYQSKAVYTVLALNL
EGKKEVLGLYLSESEGANFWLSVLSDLQNRGVEDILIACVDGLTGFPEAI
NSIYPQTEVQLCVIHQIRNSIKYVASKHHKAFMADLKPVYRAVSKEAAET
ALDELEAKRGQQYPVVLQSWRRKRENLSAYFRYPANIRKVIYTTNAIESV
HRQFRKLTKT
>c5371 Prophage P4 integrase
MSILISIFADSILLVYSRDTNGRKTIMALTDTKVRSAKPEEKEYSLVDGD
GMSLLVKPGGSKYWRFRFRFGGKQHLMAFGVYPDVSLADARKKREEARKL
VTAGIDPREHKRAVKEEQAKEIITFEKVAREWLVTNQKWSEDHANRVKKS
LEDNIFPTIGARNIAELGTRDLLIPIKAVEKSGRLEVASRLQQRTTAIMR
YAVQSGLIDYNPAQEMAGAVASSNRQHRPALELKRIPELLQKIDDYTGRP
LTRWATELTLLIFIRSSELRFARWSEIDFETSMWTIPPEREPIPGVKHSQ
RGSKMRTPHLVPFSKQALAILKQIKQFCGEHELIFIGDHDPRKPMSENTV
NSALRVMGYDTKVEVCGHGFRTMACSSLIESGL
>c5214 Unknown protein of IS629 encoded within prophage
MTKNTRFSPEVRQRAVRMVLESQSEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>c5382 Hypothetical protein
MALYNIANKELHALEKTTFTLEGLQERYDLQEAIKKNIDIIAPDCLVISE
EFSYWEDSRRRIDLLAIDKQANLVVIELKRDETGAHMELQALRYAAMIST
MSFAKACEYFQTYLKKQNCDADAKEKILEFVELDETELVDFGKDIRIVLA
SSDFSKELTTTAIWLRDKGVDIRCVRLTPYRFNDDVLINAEQIIPVPELE
EYQVKFREKRDEQLISSQKKEKDYTWYIYSINLKMRV
>c5213 Putative Transposase for IS629
MMPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTVSRKTVATGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
AEVELATLTWVDWYNNRRLLGRLGHTPPAEAEKAYYASIGNNDLAA
>c5178 Putative Transposase for IS629
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWRGFVYVAF
IIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>c5378 Hypothetical protein
MELIDNINRLLGDDLKQTLKPSARLKIAASCFSIYAFEALKAELEKIDEL
QFIFTSPTFTANEVTDKIRKERKEFHILKADRERSLYGSEFEIQLRNKLT
QRAIAKECADWMRRKATFKSNRSKAPMQQFACVQAAATETAYMPLHGFTA
VDLGYQQGNAVSNLVNKMDESTFTATYLSLFNQIWQDPEKLEDVTAQICD
HITSVYQENSPESIYFLMLYNIFNEFLDDINEDALPNDRTGYQDTLIWNK
LFNYQKDAATGIINKLESYNGCILADSVGLGKTFTALAVIKYYELRNKSV
LVLCPKKLADNWLNYSRNLKTNIFARDRFNYDVLCHTDLSRTSGESFGTP
LNRINWGNYDLVIIDESHNFRNNDAYKDKETRYQKLMNKVIKEGVKTKVL
MLSATPVNNRFNDLRNQLALAYEGDSENLSKKLRTGKTVEDIFRGAQASF
NAWAKLPSEDRTARAILDSLDFDFFELLDSVTIARSRKHIQTFYDTKEIG
QFPERRKPLSFHCSLTQRTDVMSFNEIFERLSLLKLAVYAPISYILPSRL
KKYEEMYDTQVAGKGKLKQVDREKSLQALMTTNLLKRLESSIESFRLTLK
SLRANHMNTLAKISTFNETSDLSGIDSRINDLTDQLENLDADDDLPCIGD
SEIGGKVKISLADMDLPSWEHDLKIDLEIIDALLTSMNKITAADDAKLQH
LKALVQEKVAAPLNPGNKKVLIFTAFADTADYLYANLAPELLATQTLHSA
KVTGKGVPKSTLKKSYDFQELLTLFSPHSKEKAIVLPNEAAEIDLLIGTD
CISEGQNLQDCDYLINYDIHWNPVRIIQRFGRVDRIGSPNSSIQLVNYWP
DISLDEYINLKERVESRMMIADVTATGDDNVLSAQANDVSYRKEQLRRLQ
EEVIELEDLKTGVSITDLGLNDFRMDLLNYVKANGELSNVPNGMHAVVSA
KPEMGLRPGVIFTLRNRNPSVNVSQHNRLHPYYLVYINREGEVIHDHTEV
KRLLDLVRSCCKGQTQPITDACLLFNKETADGSKMQVYSDLLGKAIRSMI
EVKEEKDLDSLFFAGKTTALVNTIVGLDDFELITFLVIQEAG
>c5145 Hypothetical protein
MSNIHHLERSLRKLRLTRVGAEWHALEKRALAEGWTPSRYLLTLCNEELL
WRESEKLRRYKKEARLPVAKTLSEYDFSQVPELNGAQFRQLCETTDWVDA
GENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQELRKARAQ
LKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYERGSLVIT
SNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKTAKAVTSA
T
>c5373 Hypothetical protein
MEKLKMHSPNLTQDNIARIRDLFPGCVTEAKGEDGSVKLAVDFDQLRQEL
SDSIVEGPQERYQLNWPGKREALLTANAPIAKTLRPVRTTKNSKGEHIEE
SVNFDTTKNIFIEGDNLDALKLLQENYLGKIKMVYIDPPYNTGNDFVYAD
DFVDEVSEFFLRSNQVDREGNRLTANPETSGRFHSDWLSMMYSRLKLSRN
LLRDDGLIVIHIDENEYPNLEKLLAEIYGEKNNLGTIVWDKRNPKGDATG
VAQQHELICIYCKDREFFKTTCEFQRPKENAGKMLAKAKQILSKEGGVTE
KARKEYKDWVNQQDLTGGEKAYNQIDDNGDVFRPVSMAWPNKKKAPEDYF
IPLIHPVTGKECPVPERGWRNPPATMQELLKSGLIIFGPDEKTQPTRKYR
LNDNLFENIPSLLYYGGSDDALLADLKIPFDTPKPVQVAKRLIQSICKND
DILIDFFAGSCTAAHALMLLNAEDGANRRFIMVQLPEECDEKSEAKKLGY
SVVSEIGKNRIRRAAKKIREEFSEILATRNTELDLGFRLLKVDTSNMADV
YYSPDVLEKANLDLFVDNIKPDRTPEDLLFQVMLDWGVDLALPIAKQSIQ
GKDVFFVDGNVLTACFDASGSIDETFVKELAKLQPLRVVFRDAGFKNSAV
KINVEQIFKLMSPVTEVKCI
>gid:143776  ada  ADA Regulatory protein
MKNATCLTDDQRWQSVLARDPNADGEFVFAVRTTGIFCRPSCRARHALRE
NVSFYADASEALAAGFRPCKRCQPDKANPQQHRLDKITHACRLLEQETPV
TLESLAEQLAMSPFHLHRLFKATTGMTPKAWQQAWRARRLRESLAKGESV
TQAILSAGFPDGSSYYRKADETLGMTAKQFRHGGENLAVRYALADCELGR
CLVAESERGICAILLGDDDATLISELQQMFPAADNAPADLTFQQHVREVI
ASLNQRDTPLTLPLDIRGTAFQQQVWQALRTIPCGETVSYQQLANAIGKP
KAVRAVASACAANKLAIVIPCHRVVRGDGSLSGYRWGVSRKAQLLRREVE
NEER
>gid:143616  alkA  DNA-3-methyladenine glycosylase II
MYTLNWQPPYDWSWMLGFLAARAVSGVETVADSYYARSLAVGEYRGVVTA
IPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGKLGA
ARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTSRVAQLYGERLDDFP
DYVCFPTPQRLAVADLQALKALGMPLKRAEALIHLANAALEGTLPMTIPG
DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP
AQIRRYAERWKPWRSYALLHIWYTEGWQPDEA
>gid:143775  alkB  Alkylated DNA repair protein alkB
MLDLFADAEPWQEPLAAGAVILHRFAFNAAEQLIRDINDVASQSPFRQMV
TPGGYTMSVAMTNCGRLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLC
QRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGL
PAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT
TDCRYNLTFRQAGKKE
>gid:141008  c0002  Hypothetical protein
MFYREKRRAIGCILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGF
MGYLKGKSSLMPYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIK
HQLEEDKMGEQLSIPYPGSPFTGRK
>gid:141075  c0072  Transposase
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGCILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMPYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:141090  c0086  Transposase
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGSILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMLYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:141125  c0118  Hypothetical protein
MKHSFEVKLAAVNHYLAGHAGIISTAKLFQLSHTSLSHWINLFLLHGPRA
LDCRHKRSYSPEDKLCVVLYALGHSESLPRVAARFNIPSHNTVKNWIKGY
RKSGNEAFIRRRKEKSMTRSDDTHENEANMTPEEMKNELRYLRAENAYLK
AMQEHLLEKKRQELEKKRKSSRA
>gid:141126  c0119  Putative Transposase insK for insertion sequence element IS150
MKQLIASIFHEHRGCYGYRRIHCELQKRGLKFSGKTVRKLMQQLGLKSPV
RLKKYRSYRGNMGLAAENILQRQFKAEAPCEKWVTDITEFRAGGQKLYLS
PILDLFYGEIVAWETACRPTEELVKRMLNKGLESLAEGEKPLLHSDQGWH
YRIKSYQSALADRGLVQSMSRKGNSLDNAVMENFFGHLKEEMYYRRDYRN
VEELENAVNEYITYWNQKRIKLSLGGLSPVEYRTEYQKAG
>gid:141144  c0138  Unknown protein of IS629 encoded within prophage
MTKNTRFSPEVRQRAVRMVLESQSEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>gid:141145  c0139  Putative Transposase for IS629
MMPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWRGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
AEVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>gid:141203  c0198  Transposase
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGSILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMLYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:141236  c0232  Putative Transposase insK for insertion sequence element IS150
MKQLIASIFHEHRGCYGYRRIHCELQKRGLKFSGKTVRKLMQQLGLKSPV
RLKKYRSYRGNMGLAAENILQRQFKAEAPCEKWVTDITEFRAGGQKLYLS
PILDLFNGEIVAWETACRPTEELVKRMLNKGLESLAEGEKPLLHSDQGWH
YRIKSYQSALADRGLVQSMSRKGNCLDNAVMENFFGHLKEEMYYRRDYRN
VEELENAVNEYITYWNQKRIKLSLGGLSPVEYRTEYQKAG
>gid:141237  c0233  Hypothetical protein
MQFMKHSFEVKLAAVNHYLAGHAGIISTAKLFQLSHTSLSHWINLFLLHG
PRALDCRHKRSYSPEDKLCVVLYALGHSESLPRVAARFNIPSHNTVKNWI
KGYRKSGNEAFIRRRKEKSMTRSDDTHENEANMTPEEMKNELRYLRAENA
YLKAMQEHLLEKKRQELEKKRKSSRA
>gid:141265  c0256  Hypothetical protein
MMDKPTDWRSGTRRIFSNEFKLHMVELASKPNANVAQLAREHGVDNNLIF
KWLRLWQREGRISRRMPPTIPLR
>gid:141266  c0257  Unknown in ISEc8
MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLDETPFSGHLFIFRGRR
GDTVKILWADADGLCLFTKRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTFRLNSLTML
>gid:141267  c0258  Unknown in ISEc8
MSQKYLIRIAELESQLRQKDQQLSLVEETKTFLRSALARAEEKIEEDERE
IEHLRAQIEKLRRMLFGTCSEKLRREVEQAEALLKQREQDSDRYSGREDD
PQVPRQLRQSRHRRPFPAHLPREIHRLESEESCCPECGGELDYLGEVSAE
QLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRRDRAWYRGPRITCPR
VYGKILRTPATVSSE
>gid:141268  c0259  Hypothetical protein
MFTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPLNDALY
RYVMNTRKLHTDDTPVKVLAPGLKKTKTGRIWTYVRDDRNAGSSSPPAVW
FAYSPNRQGKHPEQHLRPFRGILQADAFTGYDRLFSAEREGGALTEVACC
AHARRKIHDVYISSKSAMAEEALKRISELYAIEDEIRGLPESERLAVRQQ
RSKALLTSLHEWMVEKNGTLSKKSRLGEAFSYVLNQWDALCYYRDDGLAE
ADNNTAERALRAVCLGKKNSYDLCQILSPKRPYAAPGASLYRGLKNLKQS
DCILDFTNSYSGILAP
>gid:141270  c0261  Hypothetical protein
MHSENIAAYVGLDVHKETLAVAIAAPERLGEVRYYGTINNEAQAVRRLFQ
KLQGLYGNILSCYEAGPCGFGLYHQLTAMNIKCQVIAPSRIPKSPTDRIK
NDHRDAISLARLLRAGELTPVWIPDLTHEAMRDLIRARAAAKRDSRVARQ
RILSMLLRTDKHYAGKHWTGKHRTWLANQSFSQPSQQIAFQHYCQSLEQI
EDRILQLDQEISRLLPEWSLCNLVCQLQALKGVGQLIAITLVAELGDFSR
FSNPKQLMAFLGLVPGEYSSGNSIRPRGITKVGNSELRRLLYEAAWSYRT
PAKVGAWLIYYRPDSVTQYSKDIAWKAQQRLCSRYRSLTAKGKKSQVAIT
AVARELTGFMWDIALAAQSSFSQQKQN
>gid:141271  c0262  Hypothetical protein
MFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNRVDE
LLPWNVVLTNK
>gid:141273  c0263  Putative Transposase
MPIIAPISRDERRLMQKAIHKTHDKNYARRLTAMLMLHRGDRVSDVARTL
CCARSSVGRWINWFTQSGVEGLKSLPAGRARRWPFEHICTLLRELVKHSP
GDFGYQRSRWSTELLAIKINEITGCQLNAGTVRRWLPSAGIVWRRAAPTL
RIRDPHKDEKMAAIHKALDECCAEHPVFYEDEVDIHLNPKIGADWQLRGQ
QKRVVTPGQVKPAMWAATAKVRRCSSAC
>gid:141274  c0264  Hypothetical protein
MDNYIIHKSRETQSWLKENPKFRVIYQPVYSPWVNHVERLWQALHDTITR
NHQCRSMWQLLKKVRHFMETVSPFPGGKHGLAKV
>gid:141277  c0268  Hypothetical protein
MSNIHHLERSLRKLRLTRVGAEWHALEKRALAEGWTPSRYLLTLCNEELL
WRESEKLRRYKKEARLPVAKTLSEYDFSQVPELNGAQFRQLCETTDWVDA
GENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQELRKARAQ
LKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYERGSLVIT
SNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKTAKAVTSA
T
>gid:141282  c0273  Putative radC-like protein yeeS
MEAGTMQQLSFLPGEMTTRERSLILRALKTLDRHLHEPGVAFTSTRAARE
WLILNMAGLEREEFRVLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRA
LYHNAAAVVLAHNHPSGEVTPSKADRLITERLVQALGLVDIRVPDHLIVG
GSQVFSFAEHGLL
>gid:141300  c0291  Hypothetical protein
MSMNTSPWNKDRIIGQKRPLQISHIWGIRIRLELEGKTRDLALFNMALDS
KLRGCDLVKLKVSDVAYGSSVSSRATVLQQKTGSPVQFEITKGTREAVSA
LIKLGNLRSKDYLFRSRVGTNQHISTRQYNRIFHGWVAKLGLEDSLYSTH
SLRRTKPYLIYKKTKNLRVIQLLLGHKKLESTVRYLGIEVDDALEISESI
EV
>gid:141355  c0349  Putative Transposase within prophage
MLTKLTVETALNAELTDHLGHEKNAPKSGSNTRNGYSSKTVLCDDGEIEL
NTPRDRENTFEPQLIKKHQTRITQMDSQILSLYAKGMTTREIVATFKEMY
DADVSPTLISKVTDAVKEQVSEWQNRPLDALYPIVYLDCIVVKVRHGGSV
INKAVFLALGINTDGQKELLGMWLAENEGAKFWLGVLTELKNRGLQDILI
ACVDGLKGFPEAINSVYPQTHIQLCIIHMVRNSLKFVSWKDYKAVTSGLK
AVYQAPTEEAALMALDKFAGVWDEKYPQISKSWRTHWENLNTFFGYPPDI
RKAIYTTNAIESLNSVIRQAIKKRKVFPTDDSVRKVIYLAIQSASKKWSM
PIQNWRLAMSRFIIEFGDRLSEHL
>gid:141359  c0352  Partial Transposase
MAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLERLGHTPP
AEAEKAYYASIGNDDLAA
>gid:141360  c0354  Unknown protein of IS629 encoded within prophage
MTKNTRFSPEVRQRAVRMVLESQSEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>gid:141361  c0355  Hypothetical protein
MAWHRGHCCGMLIVNLDTHRPLVLLHGRDQHTLATWFRKYPEIQVVSRDR
CGINATAAREGAPQARQVAERWLLLNNIGDAPERMMYRHMTLIRLVTS
>gid:141362  c0357  Hypothetical protein
MSRVSHWLIIRGEENYASRFISLMCEKEPELKIAQQLVLEFYRILKTQNK
SQLSSLFTRVHESGSAELRCVAAGMEADAAAICEAISSRWSNGVVEGHVN
RLKMLKRQMYGRAGFELLKQRVMSPLA
>gid:141369  c0364  Hypothetical protein
MNIMIVTHNKYLELGLKKLLSRHSITIGADFFIPDNREHIINNNIFVILC
DKKNSMLMNYIFNGYRFYLLPVESISSLSSIYECMFSGRLLFGNSPHKLT
MNEMIILFYYVFHGWNVASIAYQFGMSSKTVYTHIYKAKKKNGISGKNLK
YKCAYERLVC
>gid:141371  c0366  Hypothetical protein
MGALNLKSPERPLISEYPTINALSISRFFNEIRKVYPDYNQTVNIILDGA
RYHHAQLVTDWAEVVNIKRVQLALIYIIIDLCKITVYHY
>gid:141372  c0367  Hypothetical protein
MLRNMKIFISDLQKAELERLHDTSRNKRVCDRIKAILLASEGWSSVMIAQ
ALRLHQTTVDHHIHEFMNKGKLKPENGGSDSKLSAEQTSLLIKHLSDNLF
HHTHEIIAFVLHTWGILFSVAGMNKWLHRNGFSYKKPAGTPHKFSEERQA
QFIEFMKT
>gid:141397  c0391  CP4-like integrase
MKLNARQIDTAKPKEKAYKLADGGGLYLLVKPGGGEYWRLKYRVAGKEKL
LALGVYPEVTLADAPAKLEEAKRGISGGIDLMEVKREEKIARETQLNNTF
KDIALEWHSNKL
>gid:141402  c0396  Insertion element IS1 1/2/3/5/6 protein insA
MATVTVHCPRCNSDKVYRHGRSCSQHERFRCRSCKRVFQLTYSYEARKPG
FKELIVEMAHNGTGAVISPEH
>gid:141404  c0397  InsB protein
MPKEKYLTGKIFTQRIERNNLTLRTRIKRLVSKTICFSRSVAIHEKVIGS
FIEKHMFY
>gid:141437  c0430  Type 1 fimbriae Regulatory protein fimB
MCLSFFYNCTRWRMTRKYLTQDEVYRLMDAAQSMSFPERNRCLIMMAFIH
GFRASELLDLRLSDIDASGKQLNIRRIKNGFSTTHPLLPDEYNLIKLWLK
QRKLIENGVEGDWLFLSRKRRPISRQHFFSIIREAGKRAGLAVKAHPHML
RHACGFALADNGVDTRLLQDYLGHRNIQHTVRYTASNAARFKGVWKKKPR
>gid:141569  c0563  Hypothetical protein
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGCILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMPYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:141777  c0767  Hypothetical protein
MGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREKRRAIGCILRKLCEWKSV
RILEAECCADHIHMLVEIPPKMSVSGFMGYLKGKSSLMPYEQFGDLKFKY
RNREFWCRGYYVDTVGKNTAKIQDYIKHQLEEDKMGEQLSIPYPGSPFTG
RK
>gid:141958  c0941  DNA adenine methylase
MSTILKWAGNKTAIMPELKKYLPAGPRLVEPFAGSCAVMIETDYPSYLVA
DINPDLINLYKKVAADCEAFISRARALFEEANREVAYYNIRQEFNYSTEI
TDFMKAVYFLYLNRHGYRGLCRYNKSGHFNIPYGNYKNPYFPEKEIRKFA
EKAQRATFICASFDETLAMLKAGDVVYCDPPYYGTFSGYHTDGFTEDDQY
HLASVLEHRSSEGHPVIVSNSDTSLIRSLYRNFTHHYIKAKRSIGVSAGE
SKSATEIIAVSGARCWVGFDPSRGVDSSAVYEVRV
>gid:142180  c1165  Putative P4-family integrase
MAVLTDTKARHIKPDDKPLPHGGITGLTLHPSSVKGRGKWVFRYVSPVTQ
KRRNAGLGTYPEVSIAEAARTARIMREQLAAGDDPLEIKKAESEKVVIPT
FADAARRVHAELSPGWENPKHVRQWLSTLENYAFPQLGAKTLDSITAADV
AETLRPVWLTLSETASRVKQRIHVVMQWGWAHGFCVANPVDVVDHLLPQQ
TRGRDEHQPAMPWRQLPLFVATSVYSDEPYNVTRALLLMVILTATRSGEA
RGMRWAEIDFHKRVWTIPAERMKARLQHRVPLSRQAIYILENIRGLHDEL
VFPSPRKQQILSDMVLTSFLRKKKAVSDIPGRVATAHGFRSTFRDWCSEQ
GYSRDLAERALAHTLKNKVEAAYHRTDLLEQRVPMMQAWADYVMSQIVNK
>gid:142184  c1168  Unknown in IS1N
MTLICELDEQWSFVENKARQQWHWYAYTTKSGGVLAYTFGPRTDETCREL
PELLKPFSTGMITRDNRSSYTREMPQDKHLVGKIFTRRIERNNLTLRTHI
KRPARKTICFSRSLEIHEKPLVHLSKTLYY
>gid:142222  c1213  Hypothetical protein
MCASAEHCCFPTLLFRSDDQNGLRYQPSWPRYAVGQVVFETRLKPTVVIG
TQAAAVHGISEQALCGAPSWTDVVRQLRHAIGDRPVIIFNARFDIRILKK
TAAAHSDPADWLEELTVYCVMELAAGYYGASNRYGTISLACAASQTGLNW
EGQAHSAIADARMTAGVVNAIAAYHLELLQEQARLKT
>gid:142227  c1218  Hypothetical protein
MVVRGFTGSETIVRDAIAKWRKGWNPPVTTAVRLPSMSRVRRWLMPWRII
RDEENYASRFISLMCEKEPELKIAQQLALEFYRILKTQNKSQLSSWFTRV
HESGSAEFRRVAAGMEADAAAICEAISSRWSNGVVEGHVNRLKMLKRQMY
GRAGFELLRQRVMSPLA
>gid:142228  c1219  Putative Transposase
MGLRCSADTLLRRVINTPETKQSGAPHVGIDEWAWHRGHRYSTLIVNLDT
HRPLVLLPGRDQRTLATWFRKYPEIQVVSRDRSGVYATAAREGAPQARQV
ADRWHLLKNIGDAPERMMYRHMPLIRLVASELSPKKSPDPEPSVPAASLR
RPEHLKQQPRKKRHQRWTEVMALHNKGCSFREISRITGLSRVTVSRWVRS
GTFPEMSTRPPKRGLLDPWREWLKEQRESGNYNASRIMAGNGGPGVYRQ
>gid:142233  c1224  Transposase insF for insertion sequence IS3A/B/C/D/E/fA
MSALNDNWRSRLRNWPSSKRPRQLREAPEMKYVFIENHRAEFSIKAMCRV
LRVARSGWYVWLWRRHQMSLRQQFRLTCDAAVHKAFFEAKQRYGAPRLAD
EMPEFNIKTIAASLRRQGLRAKASRKFSPVSYRAHGLPVLENLLEQDFSA
SGPNQKWAGDITYLRTDEGWLYLAVVIDLWSRAVIGWSMSLRMTAQLACD
ALQMALWRRRRPESVIVHTDRGGQYCSGDYQALLKLTQPAWQYECERLLL
>gid:142235  c1225  Transposase insE for insertion sequence IS3A/B/C/D/E/fA/fB
MTKPVSISKKPRKQHTPEFRNEALKLAERIGVAAAARELSLYESQLYAWR
SKQQQQMSSSERESELAAENVRLK
>gid:142256  c1248  Hypothetical protein
MLSNMKIFITEQQKAELERLHDSSRDGRVRDRIKAILLASEGWSSAMIAQ
ALRLHQTTIDHHISEFLNKGKLKPENGGSDSKLSAEQTAFLISQLSDNLF
HHTRDVIAFVTRTWNIIFSIPGMNKWLHRNGVTYKKPSGVPHKFSEEKQR
QFIEYYKELKTTVGDEPILFIDGVHPTQATKISYGWIRKGQKKAVKTTGS
RTRLNIMGALNLKALTSPLICEYKTINEYNVSRFFNEIRKVYPDYNQKIH
VILDGAGYHRSQLVKDWAEVVNIRLHYLPPYSPNLNPIERMWKLMNEHAR
NNRYFSSTREFREAISVFFNQTLPDIADSLTSRINDHFQVLTPAS
>gid:142265  c1256  Hypothetical protein
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGCILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMPYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:142274  c1262  Putative Transposase for IS629
MMPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWRGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
AEVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>gid:142275  c1263  Unknown protein of IS629 encoded within prophage
MTKNTRFSPEVRQRAVRMVLESQSEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>gid:142280  c1268  Hypothetical protein
MFFGNDHGGERSALLYGLIGACRLNGIDPESYLRHILNVLPEWPSNRVDE
LLPWNVVLTQ
>gid:142294  c1282  Putative radC-like protein yeeS
MQQISFLPGEMTPGERSLILRALKTLDRHLHEPGVAFTSTRAAREWLILN
MAGLEREEFRVLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRALYHNA
AAVVLAHNHPSGEVTPSKADRLITERLVQALGLVDIRVPDHLIVGGSQVF
SFAEHGLL
>gid:142304  c1291  Hypothetical protein
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGSILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMLYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:142396  c1384  Hypothetical protein
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGSILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMLYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:142411  c1400  Prophage lambda integrase
MSPRPRKNSTDVAGLYEKFDRRTGRVYYQYKNPVTGKFHGLGTDKGKAEK
IASTANQRIAAAEAEYFMRKIDESPSATKRRGIRLKAWVDRYLKIQDTRL
KNGDIAATTHKEKTRMAAYLVSRLGNHPLKELEVRDFALILDEWLDKDMV
STARVNRGLWVDIYKEAQHAGEVPPGWNPPEATRKPIPKVTRARLTMEDW
QKIYNATPEKHFIRNAMLLAIVTGQRRDDICHMRFSDVWNEHLHITQGKT
GMRLALPLTLRCDAIGITLKEVIDGCRDRILSPYLIHSRHQKQPKPMSKD
NLSDYFAKARDLAGIIPPAGKTSPTFHEQRSLSERLYRAQGIDTKTLLGH
KVQATTDRYNDTRGQEWVKLVI
>gid:142436  c1425  Hypothetical protein
MIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVRGQPVSGG
RLGVKIYPIMH
>gid:142439  c1427  Hypothetical protein
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGCILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMPYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:142459  c1444  Prophage Qin DNA packaging protein NU1 homolog
MATQTEVARHLSLTDRQLRRLQKLPGAPISNKRGQLDLDAWRDFYISYLR
RSKNDVPDGDSEDDYEEKLLIARWELTAEQAVTQQLKNEVSKGKLIDTGF
CIFALSKLAMALSSTLDSIPLSMQRQFPDLTPRHLDHLKTLIAKGANQCA
RAGDKLPDLLDEYIRATTE
>gid:142467  c1452  Hypothetical protein
MSGGEMPPCLFSQEAEMATKEENLNRLRQLAGLLGREADMSGSAADIAQR
VSEWEEELAVSPEGIMHSDESGADQNHTDDGEQLHNTDATDDVKAVRVRK
CLHVMGYCPETGRPVELTYRGMRVMVPSPLATAMIQHGTAEHA
>gid:142498  c1483  Putative integrase of prophage
MNYTVQKALKPVTVGDALTYWLESYVKENRVDYAALKKRLNNHVIQHIGA
MPLDKCELRHWLACFDQVAKRTPVTAGFLLQTCKQALKFCRRRRYAISNV
LDDMSVADVGKKPDISERVLSTKELGELLQALDKKIFSPYYVALIRLLIV
FGCRTVELRLSEISEWDFTEMLWTVPKEHSKTKVAIFRPIPEAILPFVTQ
LVEQNRHTGLLLGEVKQETSVSQYGRLAHRRLKHPHWSLHDIRRTFTTML
NDLGVDPHVVEQLTGHQMPGMQRVYNHSRYLDAKRDALDMWTERLGILAG
THENVTTLPIAREI
>gid:142512  c1497  Putative single stranded DNA-binding protein of prophage
MTAQIAAYGRLVDDPQVKQTSKGTPMTLARMAVSLPCSQAQDGQATLWLS
VMAFGKQADFLAKHQKGDVASVSGTMQVSQWTGQNGETRQGYQVIADSVI
SARAARPGGNRRKTTGTQGNQPPAGGDDPYGDDIPF
>gid:142528  c1513  Putative Nudix hydrolase ymfB
MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAAR
ELWEETGISAQPQHFIRMHQWIAPDKTPFLRFLFAIELEQICPTQPHDSD
IDCCRWVSAEEILQASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTK
GVI
>gid:142534  c1519  Prophage lambda integrase
MAARPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQ
VATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYISIQEDR
LQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGH
NRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPE
WQAIFDSVSRRQPYLKCGMLLALVTGQRLGDICNLKFSDIWDDMLHITQE
KTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRG
DQVSANTLTTAFKKAREKCGIKWEPGTAPTFHEQRSLSERLYREQGLDTQ
KLLGHKSRKMTDRYNDDRGKDWIIVDIKTA
>gid:142536  c1520  Unknown protein of IS629 encoded within prophage
MTKNTRFSPEVRQRAVRMVLESQSEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>gid:142537  c1521  Putative Transposase for IS629
MMPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWRGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
AEVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>gid:142546  c1529  Hypothetical protein
MSNIHHLERSLRKLRLTRVGAEWHALEKRALAEGWTPSRYLLTLCNEELL
WRESEKLRRYKKEARLPVAKTLSEYDFSQVPELNGAQFRQLCETTDWVDA
GENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQELRKARAQ
LKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYERGSLVIT
SNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKTAKAVTSA
T
>gid:142547  c1530  Hypothetical protein
MTLNTSQVSYYMTQRKKGITQHISAMKAGISVRSGRRIEKGEWAKNSVRH
WRTRKDPLEAVWDSMLVPLLKERPALTPTTLLEMLQDKYPGQYPNSLRRT
MQRRVREWKLQYGAEQEVMFRQRHQPGLRGLSDFTELKGVVVTIAGKLLA
HKLYHFRLEWSHWSWMRVVLGGESFSALAEGLQEALGQLGGVPVEHKTDS
LRAAWKQQGEDGRRELTERYAALCQHYGMQGVHNNAGRGHENGSVESAHG
HLKRRICQALILRGSNDFSTIEEYQAFITQQVMRHNRNNQDLVKEERLHL
KPLPLRRSADYDELTVRVSRSSTINVKHVVYSVPSRLVGQLLRVRLWDDR
LSCYVGSSEVMSCPRVRPEKGKTRARRIDFRHVIDSLAKKPGAFCHATLR
NDILPDDEWRRLWRRLCNHLEPDMAGRLMVHALKLAAGYDDISVVAKGME
QMLNTPGNVDLHRLMRFLGIKEKALPVVNVKQHNLSSYEQLLRGKGGSQ
>gid:142570  c1552  Unknown protein of IS629 encoded within prophage
MTKNTRFSPEVRQRAVRMVLESQSEYDSQWATICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>gid:142571  c1553  Putative Transposase for IS629
MMPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDW
LKKEIQRVYDENHKVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWRGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
AEVELATLTWVDWYNNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>gid:142610  c1593  Hypothetical protein
MSLSNPSGLYVNSAATDVFAEKTTFGCIAKADGHFHVVTNGKAVNEVYCE
YNGVTADKNIRFGGQTNTGERHLFGHIRNFRIWHKELNDRQLKEVV
>gid:142755  c1738  Hypothetical protein
MGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREKRRAIGSILRKLCEWKSV
RILEAECCADHIHMLVEIPPKMSVSGFMGYLKGKSSLMLYEQFGDLKFKY
RNREFWCRGYYVDTVGKNTAKIQDYIKHQLEEDKMGEQLSIPYPGSPFTG
RK
>gid:142834  c1819  Putative conserved protein
MLPHPVRVAQHQHAKQGFPIRFHYDYLGARWNAAVKRAGIRRRNPYHTRH
TFACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMNDEQVAMLNAR
LS
>gid:142905  c1893  Hypothetical protein
MLDIKGEIITTDAMGCQKDIAEKIQKQGGNYLFAVKGHKERINKAFE
>gid:143171  c2160  CTP pyrophosphohydrolase
MKMIEVVAAIIERDGKILLAQRPAHSDQAGLWEFAGGKVEPDESQRQALV
RELNEELGIEATVGDYVASHQREVSGRIIHLHAWHVPDFHGTLQAHEHQA
LVWCSPEEALRYPLAPADIPLLEAFMASRAARPAD
>gid:143223  c2212  Probable ATP-dependent helicase yoaA
MTDDFAPDGQLAKAIPGFKPREPQRQMAVAVTQAIEKGQPLVVEAGTGTG
KTYAYLAPALRAKKKVIISTGSKALQDQLYSRDLPTVSKALKYTGNVALL
KGRSNYLCLERLEQQALAGGDLPVQILSDVILLRSWSNQTVDGDISTCVS
VAEDSQAWPLVTSTNDNCLGSDCPMYKDCFVVKARKKAMDADVVVVNHHL
FLADMVVKESGFGELIPEADVMIFDEAHQLPDIASQYFGQSLSSRQLLDL
AKDITIAYRTELKDTQQLQKCADRLAQSAQDFRLQLGEPGYRGNLRELLA
NPQIQRAFLLLDDTLELCYDVAKLSLGRSALLDAAFERATLYRTRLKRLK
EINQPGYSYWYECTSRHFTLALTPLSVADKFKELMAQKPGSWIFTSATLS
VNDDLHHFTSRLGIEQAESLLLPSPFDYSRQALLCVPRNLPXTNQPGSAR
QLAAMLRPIIEANNGRCFMLCTSHAMMRDLAEQFRATMTLPVLLQGETSK
GQLLQQFVSAGNALLVATSSFWEGVDVRGDTLSLVIIDKLPFTSPDDPLL
KARMEDCRLRGGDPFDEVQLPDAVITLKQGVGRLIRDADDRGVLVICDNR
LVMRPYGATFLASLPPAPRTRDIARAVRFLAIPSSR
>gid:143265  c2254  Exodeoxyribonuclease X
MARRYHSITLGFFGLEHAMLRIIDTETCGLQGGIVEIASVDVIDGKIVNP
MSHLVRPDRPISPQAMAIHRITEAMVADKPWIEDVIPHYYGSEWYVAHNA
SFDRRVLPEMPGEWICTMKLARRLWPGIKYSNMALYKTRKLNVQTPPGLH
HHRALYDCYITAALLIDIMNTSGWTAEQMADITGRPSLMTTFTFGKYRGK
AVSDVAERDPGYLRWLFNNLDSMSPELRLTLKHYLENT
>gid:143268  c2257  Hypothetical protein
MMTGNGADIDCYADRENQPVQMAVRFWITRNSQLQAIQRAELLRQSTFLL
FEIAFNAFTNVLGDFQRITQCIQVTAFHKMININSRAAQEIDFQRLFFIA
DPSSQALWIQRLFDCLLNEYTPLFFHAGFTQLQVDCRTFFGINMTITFSG
KDQRKDQTFRTLIQRRTLRAKLRFVLMRRANFIFVFIPQHRTRYRSAANQ
>gid:143347  c2334  Transposase
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGCILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMPYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:143405  c2392  Putative P4-family integrase
MPKKAKELSGLVVSRLKSEGMYAVGGVDGLYLRIRNQSRAWVLCVAMGTR
INNLGRTVPRRLNMGLGPYPEVSLAEARDKARELRKQIRNGINPLQEKHE
QKARQEILARKKKTFAECCEEVLEVKDSEMKNKKHLAQWRSTLETYAYPF
IGKKAVSEITKVDLLAILEPIWLTKNETASRLRGRIETVIDYAKAKEYFE
GDNPAAWKGMLKPLLPQPSKVQITKHHAALPYNQIGSFMKELRERSGVSP
RALEFAILTAARSGEIRGAEWSEIDLEGKTWTIPASRMKATKEHRVPLSD
AAVALLKALPRFKGINFVFPATRKGQLSDTALLAVLKRMGYTDLTQHGFR
STFRDWAGETTNYPREVIEHALAHQLANKAEAAYQRGTLWPKRVALMDDW
AGYCIS
>gid:143418  c2405  Hypothetical protein
MIHMPHKKVALQLIEETLKELESPKGSLLSAIQKLQRTSDIINDDDKKIW
CAIQLGDTKYTKPITELLKFVIEAENTKNKSFQENLDKRIQELAKLGVKA
NIHYSDEELTLKNIESGGGYNNIGFIEEKYADLVRKKQGNDGTYYKNSLN
QHINYVRKKAHELASQIYNQLKFSGTVSNCFDVLKNAVDDKLLDLNPVIA
EQLMLAFKAISSDKEEEWSQALTTCRRLLEGLADELYPASKEKFNGRAVG
QGQYVNRLWAFMDGAIQSDSNKDLAKAHIDFLGSWLDKVNKLTNKGVHAE
LDRIEAVKSVFHTYLVVADLLEYMSNTKTSVSKPDINKATLDELEALLNI
NRTIAKEIVKARVREGKLDLDILKNIKGIGAKTLSNIQEVFVM
>gid:143433  c2418  Prophage P4 integrase
MLALNINPVQQRAAERGSRTPEKVFKNVALAWHKSNRKWSQNTADRLRAS
LNNHIFPVIGNLPVSELKPRHFIDLLKGIEEKGLLEVASRTRQHLSNIMR
HAVHQELIDTNPAANLGGVTTPPVRRHYPAPPLERLPELLERIGAYHQGR
ELTRHAVLLMLHVFIRSSELRFARWSEIDFTNRVWTIPATREPIIGVRYS
GRGAKMRMPHIVPLSEQSIAILKQIKDITGNNELIFPGDHNPYKPMCENT
VNKALRVMGYDTKKDICGHGFRAMACSALMESGLWAKDAVERQMSHQERN
TVRMAYIHKAEHLEARKAMMQWWSDYLEACRESYAPPYTIGKNKFIP
>gid:143441  c2425  Hypothetical protein
MFYREKRRAIGCILRKLCEWKSVRILEAECXADHIHMLVEIPPKMSVSGF
MGYLKGKSSLMPYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIK
HQLEEDKMGEQLSIPYPGSPFTGRK
>gid:143468  c2449  Prophage P4 integrase
MSLNDAKIRSLKPTDKPFKVSDSHGLYLLVKPGGSRLWYLKYRINGKESR
IALGAYPAVSLSDARQQREGIRKMLALNINPAQQRAAERGSRMQEKMFKS
VALEWHSSKKKWSQNTADRVLARLNRHVFPTIGHLPVTELKSRHFIELLK
GIEEKGLLEVASRSRQHLSNIMRYAVHQGLIEINPAANLDGVTASPARRH
YPTLPLERLPELLERIDSYHQGRELTRLAVLLTLHVFIRSSELRYARWTE
INFRNRIWTIPATREAIAGVRYSSRGAKMRTPHIVPLSEQVISILKRIKE
ISGGYELVFPGYHDPYKPMSENTINKALRQMGYNTKQDICGHGFRAMACS
ALMESGLWSQDAVERQMSHQERNTVRLAYIHKAEHMEARMDMMQWWSDYL
DMCSEIWVPPYIWSQQNINLAVT
>gid:143492  c2472  Transposase
MRKARFTEHQIIAVIKSVEAGRTVKDVCREAGISEATYYNWKSKYGGMEA
SDIKKIKDLEDENRRLKQMFADLSLENRALKDVIEKKL
>gid:143494  c2473  Transposase
MHDALVCGRRFRMFNVVDDFNREALSIEIDLNLPAQRVVRVLDRIAANRG
YPAMLRLDNGPEFISLALAEWAEKHAIKLEFIQPGKPTQNAFIERFNRTY
RTEILDFYLFRTLNEVREITEKWLSKYNCERPHESLNNMTPEEYRQRHYL
AGISKSVWN
>gid:143495  c2474  Transposase
MELKWVYLHCCYDNACVESFFHSLKVECLHGEHFISREIMRATVFNYIEC
DYSRWRRHSWCGGLTPEQFENQNLA
>gid:143514  c2496  Hypothetical protein
MEADAAAICEAISSRWSNGVVEGHVNRLKVLIRQMYGRAGFELLRRRVMS
PLA
>gid:143515  c2497  Transposase
MLIVNLDTHRPLVLLPGRDQRTLATWFRKYPEIQVVSRDRSGVYATAARE
GAPQARQVADRWHLLKSIGDEPERMMYRHMPLIRLVVRELSLNKSPEPEI
SVPVASLRRPERLKQQTRKKRHQHWTEVMALHNKGCSFREISRITGLSRV
TVSRWVRSGTFPEMSTRPPKRGLLDPWREWLKEQRESGNYNASRIWREMV
AQGGTGSETIVRDTVAKWRKGWNPPVTTAARLPSVSRVSRWLMPWRIIRG
EENYASRFISLMCEKEPELKIAQQLVLEFYRILKT
>gid:143522  c2503  Transposase
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSL
ISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAA
QTTQASNRIRGLLTQIHPALERVLGPRLDHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALQ
LITLRKQRDEVALAVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASVAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALR
DPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQAS
>gid:143528  c2509  Insertion sequence ATP-binding protein
MSNIHHLERSLRKLRLTRVGAEWHALEKRALAEGWTPSRYLLTLCNEELL
WRESEKLRRYKKEARLPVAKTLSEYDFSQVPELNGAQFRQLCETTDWVDA
GENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQELRKARAQ
LKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYERGSLVIT
SNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKTAKAVTSA
T
>gid:143532  c2511  Insertion sequence ATP-binding protein
MMMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHE
EKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFI
ERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAADLLLQLSTEQ
RQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQVIAKRYEKSAM
ILTSNLPFGQWDQTFAGDAALTSAMLDRILHHSHVVQIKGESYRLRQKRK
AGVIAEANPE
>gid:143533  c2512  Transposase
MVTFETVMEIKILHKQGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPR
PAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRGF
IRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVLGYSRM
LYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTG
QHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMT
RLRPMGITVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSML
ALPPEKKEYDVHPSENLVNFDKHPLHHPLSIYDSFCRGVA
>gid:143549  c2529  Putative radC-like protein yeeS
MTPGERSLILRALKTLDRHLHEPGVAFTSTRAAREWLILNMAGLEREEFR
VLYLNNQNQLIAGETLFTGTINRTEVHPREVIKCALYHNAAAVVLAHNHP
SGEVTPSKADRLITERLVQALGLVDIRVPDHLIVGGSQVFSFAEHGLL
>gid:143644  c2622  Transposase
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGSILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMLYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:143662  c2639  Transposase
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGCILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMPYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:143916  c2897  Type 1 fimbriae Regulatory protein fimB
MQNRKFLTHHEINLLLQSVKQKSCSSRDVCMILLAYFHGLRVSELLSLQL
SDLELTTEKIYIQRIKNGFSTVHPLQKEEVIAITNWLNERNSLNVKHFND
NPWLFVSRTGKPLSRQRFYNIVSAAGKNAGLNIKVHPHMLRHACGYSLAD
NGVDTRLIQDYLGHRNIRHTVIYTASNSMRFEKMWGIGDAKKQHFDPKCK
PNLCLEILV
>gid:143917  c2898  Type 1 fimbriae Regulatory protein fimB
MRKFITHSEWLLFFEAINGSKNEIRDKAMLQMAYVHGLRVSELIALKISD
IDFSESAIYIKRLKNGLSTVHPLQKETVLLLKKWLALRDNIVKKPFEDSL
FLSCQGNKISRQYVYKMCKKYSHNMNINIHPHMLRHGCGYALANQGLDTR
LIQDYLGHRNIHHTVLYTASNAARFKRVWEGDVLDIKKI
>gid:144175  c3146  Putative DNA-invertase from lambdoid prophage Rac
MSRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSER
PGFNRLLARLKCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLAL
GGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPP
VLNEEQKQVVFERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI
>gid:144182  c3152  Hypothetical protein
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGSILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMLYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:144219  c3189  Hypothetical protein
MKLILPFPPSVNTYWRHPNKGAFAGKSLISAAGRKFQSAACAAIVEQLRR
LPKPTSAPASVEIVLFPPDNRIRDLDNYNKALFDALTHVGVWEDDSQVKR
MLVEWGPVIPEGKVEITISKYEKPAGAAA
>gid:144234  c3204  Hypothetical protein
MGQTKLLKLPRGVTIRKHHQGETINITFTYKGVRCREPLSNLEVTPKNIK
YAERTLGEIHNKIERGTFIYAEYFPRSARLKIFGNAAASKTVKMYLDEYL
EICETRKLSPSTIGGYKKCRSALASLHICPASELTPATLKAWIQSQKTTL
KTIRNQLSFLRSALDEAVTDGVLQINPVSLVTASRYQSDKSEAESSYVVD
PLSPAEVDALLAAAGNKQWENLFRFAIHTGLRSSELCALRWHDIDFVGKT
AHVQSASVVGVIKGTKTKAGTRKVELTEEAMLALINQKPFTFMKDATVFE
DPKTNKPWTSADAIRKKAWVPTLRKAGIRYRNPYQTRHTFATRHISRGAN
LFWLAAQMGHKGPEMLFRHYGSYLKEYDGNTTSNITKKAT
>gid:144236  c3206  Hypothetical protein
MLQPAKKASVISAGLTKYLKDGVRIDDFNFRRMLREIEALRDPVSEDYLL
ALVYGAHGQVNEAIGFFERSLQVCHNKVVAKNFLVFLSDYGTLKKSFETS
IKLAESFVSPFIYLQAYENSLFMGKMALAEKYFQSYSKLFGDKEPEKMDN
NFDEVVSQVESFKQRADLSGLEYELIFNNVANVMDSQKVHLSGMRFYNIS
EEKVNAIVFMTKSSDAEQIADMNIELAFSMAEHDCLVGKDFTVWFECKDT
HIEQNAISELGLITRRVAHAG
>gid:144466  c3432  Transposase
MRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKGKSSLMLY
EQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEEDKMGEQLS
IPYPGSPFTGRK
>gid:144504  c3468  Hypothetical protein
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGCILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMPYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:144593  c3556  Prophage P4 integrase
MALTDAKIRAAKPTDKAYKLTDGAGMFLLVHPNGSRYWRLRYRILGKEKT
LALGVYPEVSLSEARTKRDEARKLISEGVDPCEQKRAKKVVPDLQLSFEH
IARRWHASNKQWAQSHSDKVLKSLETHVFPFIGNRDITTLSTPDLLIPVR
AAEVKQIYEIASRLQQRISAVMRYAVQSGIIRYNPALDMAGALTTVKRQH
RPALDLSRLPELLSRINSYKGQPVTRLAVMLNLLVFIRSSELRYARWSEI
DIDNAMWTIPAERKPLPGVKFSHRGSKMRTPHLVPLSQQAVAILAELQTW
AGENGLIFTGAHDPRKPISENTVNKALRVMGYDTTKEVCGHGFRAMACSA
LIESGLWSRDAVERQMSHQERNGVRAAYIHKAEHLEERRLMLQWWADFLD
ANREECISPFEYAKVNNPLKR
>gid:144597  c3560  Unknown protein encoded by ISEc8 within prophage
MKSLTAVRKKSPNYPVEFKIKMVELSHRPEISVAQLAREHGINDNLLFKW
RQYWREGKLRPPSTTENNVPELLPITLDAEDVVPTTSPRSQPVAAATPES
LNISCEVTFRHGSLRLNGAISENILNLLIRELKR
>gid:144598  c3561  Unknown protein encoded by ISEc8
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTTLKDDPMSGHVFIFRGRN
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTPAQLAMLLEGI
DWRQPKRLLTSLTML
>gid:144599  c3562  Hypothetical protein
MSSSLPDDINALKRLLAEQEALNRALLEKLNEREREIDHLQAQLDKLRRM
NVGSCSEKVSRRIAQMEADLKALQKESDTLTGRVDDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAASCCPECGGSLSYLGEDAA
>gid:144601  c3563  Unknown protein encoded by ISEc8 within prophage
MRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSK
YAEHTPLYRQSEIYGRQGVELSRSLLSGWVDACCRQLSPLEEALHGYVLT
DGKLHADDTPVPVLLPGNKKTKTGRLWTYVRDDRNAGSTLAPAVWFAYSP
DRKGIHPQTHLAGFSGVLQADAYAGFNELYRDGRITEAACWAHARRKIHD
VHVRTPSALTEEALKRIGELYAIEAEIRGMTAEQRLAERQLKTKPLLKSL
ESWLREKMKTLSRHSELAKAFAYALNQWPALTYYADDGWAEADNNIAENA
LRMVSLGRKNYLFFGSDHGGERGALLYSLIGTCKLNGVEPESYLRYVLDV
IADWPINRVGELLPWRVALPTE
>gid:144613  c3575  Transposase insF for insertion sequence IS3A/B/C/D/E/fA
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQR
RHSRLGNISPATFREKYHQMAA
>gid:144615  c3576  Unknown in IS
MSRKNQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR
>gid:144616  c3577  Unknown protein encoded by ISEc8 within prophage
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
>gid:144617  c3578  Unknown protein encoded by ISEc8 within prophage
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>gid:144619  c3580  Hypothetical protein
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRVQPDRDVQTERSGARKLPPLCP
>gid:144634  c3594  Putative Transposase
MAERPDQLWVADFNYVSTWQGFVYVAFIIDVFAGYIVGWWVSSSMETTFM
LDALEQALWARRPSGTIHHSDKGSQYVSLAYMERLKEAKLLASTGSTGDS
YDNAMAESINGLYKAEVIHRKIWKNRAEVELATLTWVDWYNNRRLLGRLG
HTPPAEAEKAYYASIGNNDLAA
>gid:144635  c3595  Transposase
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KKEIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTSAGKPLLQVTA
>gid:144636  c3596  Hypothetical protein in IS
MTKNTRFSPEVRQRAVRMVLESQDEYDSQRAAICSIAPKTGCTPETLRVW
IRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>gid:144637  c3597  Transposase
MISDRDGRALALTDDCGTESLAGYLRTLTDEQLLAIKTLSMDMNAGYIRA
ARIHLPCAVEKIAFDRFHVAKQLGEVVDKTRQNEHPHLPVESWRQAKGAR
FLWQYSDKWMTKSRQEKLIWLRAQMKLTSQCWALKELAKDIWNRPWSEER
RNDWERWLALAANSDVPMMKNAAKTIGKRLYGILNAMCLKRKRGGA
>gid:144648  c3607  Hypothetical protein
MTLNTSQVSYYMTQRKKGITQHISAMKAGISVRSGRRIEKGEWAKNSVRH
WRTRKDPLEAVWDSMLVPLLKERPALTPTTLLEMLQDKYPGQYPNSLRRT
MQRRVREWKLQYGAEQEVMFRQRHQPGLRGLSDFTELKGVVVTIAGKLLA
HKLYHFRLEWSHWSWMRVVLGGESFSALAEGLQEALGQLGGVPVEHKTDS
LRAAWKQQGEDGRRELTERYAALCQHYGMQGVHNNAGRGHENGSVESAHG
HLKRRICQALILRGSNDFSTIEEYQAFITQQVMRHNRNNQDLVKEERLHL
KPLPLRRSADYDELTVRVSRSSTINVKHVVYSVPSRLVGQLLRVRLWDDR
LSCYVGSSEVMSCPRVRPEKGKTRARRIDFRHVIDSLAKKPGAFCHATLR
NDILPDDEWRRLWRRLCNHLEPDMAGRLMVHALKLAAGYDDISVVAKGME
QMLNTPGNVDLHRLMRFLGIKEKALPVVNVKQHNLSSYEQLLRGKGGSQ
>gid:144649  c3608  Hypothetical protein
MSNIHHLERSLRKLRLTRVGAEWHALEKRALAEGWTPSRYLLTLCNEELL
WRESEKLRRYKKEARLPVAKTLSEYDFSQVPELNGAQFRQLCETTDWVDA
GENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQELRKARAQ
LKLNELLLKLDRYRVIVVDDLGYVKRDSAETGVLFELIAHRYERGSLVIT
SNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYRKKTAKAVTSA
T
>gid:144653  c3611  Transposase insD for insertion element IS2A/D/F/H/I/K
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIYHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLERKTAVPPSKRAHTGKVAVKESNQRWCSDGFEFRCDNGEKLRVTFAL
DCCDREALHWAVTTGGFDSETVQDVMLGAVERRFGNELPASPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIIPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>gid:144655  c3612  Transposase insC for insertion element IS2A/D/F/H/I/K
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLSAVAAGEQVVPASELAAAMKQIKELQR
LLGKKRWKMNSLKKPLNMGVQKSG
>gid:144657  c3613  Hypothetical protein
MNVGFSTLEAWVCQLRRERQEITPSAAPFTSEQQCIRELEKQVRVSVNGA
PY
>gid:144658  c3614  Hypothetical protein
MFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNRVDE
LLPWNVVLTNK
>gid:144659  c3615  Unknown in ISEc8
MSQKYLIRIAELERLLSEQAEALRQKDQQLSLVEETEAFLRSALARAEEK
IEEDEREIEHLRAQIEKLRRMLFGTRSEKLRREVEQAEALLKQREQESDR
YSGREDDPLVPRQLRQSRHRRPLPAYLPREIYRLEPEESCCPECGGELDY
LGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAG
PGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTP
LNDALYSYVMNTRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSP
EPPAVWFAYSPDHQGKHPEQHLRPFRGILQADAFAGYDRLFSAEREGGAL
TEAGCWAHARRKIHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSER
LAVRQMQSKPLLTSLYKLMQEKEHTLSKKCRLRDAFRYIRKHWVALCNFC
DDGLAEADNNTAERALRAVCLGKKNSCSSVAITAASVVHCCTG
>gid:144660  c3616  Unknown in ISEc8
MISLPSGTRIWLVAGVTDMRKSFNGLGEQIQHVLDDNPFSGHLFIFRGRR
GDTIKILWADADGLCLFTKRLEEGQFIWPAVRDGKISITRSQLAMLLDKL
DWRQPKTSRLNALTML
>gid:144661  c3617  Unknown in putative ISEc8
MMDKPTDWRSGTRRIFSNEFKLHMVELASKPNANVAQLAREHGVDNNLIF
KWLRLWQREGRISRRMPPTIVGPTVSQSFPASPTLVPVELIDTPRCATDA
PAPEALSVACAASCHVEFHYGKMMLENPSPELLTVLIRELTGRGR
>gid:144684  c3640  Unknown in ISEc8
MVKLASQLGASVARIAREHDINDNLLFKWLRLWQNEGRISRRLPVTTSSD
AGVELLPVEITPDEQKEPMAALTPLLSTPSQSTVSASSCKVEFRHGNMTL
ENPSPELLTVLIRELTGRGR
>gid:144685  c3641  Unknown in ISEc8
MISLPSDTRISLVAGVTDMRKSFNGLGEQVQHVLDENPFSGHLFIFRGRR
SDMIKILWADADGLCLFTKRLEEGLFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>gid:144687  c3643  Unknown in ISEc8
MSRKYLIRITELERLLSEQAEALRQRDLQLSLVEETEAFLRSALARAEEK
IEEEEREIEYLRAQIEKLRRMLFGTRSEKLQREVEQAEAQLKQREQESDR
YSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGSELDY
LGEVSAEQLELVSSALKVIRTVRVKKACTKCDCIVEAPAPSRPIARGIAG
SGLLARVLTGKYCEHLPLYRQSEIFARQGAELSRALISNWVDACCQLMTP
LNDALYRYVMNTRKVHTDDTPVKVLTPGRKKAKTGRIWTYVRDDRNAGSS
EPPAVWFAYSPDRQGKHPVQHLRPFRGILQADAFSGYDRLFSAKREGGAQ
TEVACWVHARRKIHDV
>gid:144690  c3645  Unknown protein encoded by ISEc8 within prophage
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDIAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELSRSLLSGWVDACCRQLSPLEEALHGYVLTDGKLHADDTP
VPVLLPGNKKTKTGRLWTCVRDDRNAGSTLAPAVWFAYSPDRKGIHPQTH
LAGFSGVLQADAYAGFNELYRDGRITEAACWAHARRKIHDVHVRTPSALT
EEALKRIGELYAIEAEIRGMTAEQRLAERQLKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYADDGWAEADNNIAENALRMVSLGRKN
YLFFGSDHGGERGALLYSLIGTCKLNGVEPESYLRYVLDVIADWPINRVG
ELLPWRVALPTE
>gid:144702  c3657  Unknown in ISEc8
MLSPDNVFIAIKPVDMRRGIDSLTQYIQDELRSTWHEGAAFVFVNKARSR
IKVLRWDKHGVWLCTRRLHKGSFRWPRANDAAWHLTPDEFNWLIAGVDWQ
QVKGHDLTKWVWQNEPELRPENTQNTLLTQ
>gid:144703  c3658  Hypothetical protein
MNIRIWSGILPCMDISALNTTNDIEKLRAMALAMVQEVMSENAEKERELL
EKSRRIQLLEEMLKLVRQQRFGKKCGTLAGMQRSLFEEDVDADIAALTAH
LDKLLPQSPEEDEKASRSRPIRKPLPVHLPRVEKIIQPDTDHCPECDEPL
HYIRDAVSEKLEYIPAHFVVNRYVRPQYSCPCCQKVFSGEMPAHILPKSA
VEPSVIAQVIINKYGDHLPLYRQQQVFARSDVGLPVSSMADMVGAAGAAL
SPLAALLHRELINRPVVHADETTLKILNTKKGGKSCSGYLWAYVSGERTG
PSVVCFDCRTGRSHEYPENWLQGWGGTLVVDGHKAYRTLANKVPEITLAG
CWAHARRGFADLYKISKDPRAAIAVKKIAGLYRLEKKISSRPVEKIRQWR
QRYARPILEELWSWLEEQEPQCSPGRALHKAIAYALSHRVELSRFLEDGA
VPLDNNVCERAIKNVVLGRKSWLFAGSQMAGERAAQIMSLLETAKRNGLE
PHAWLTDVLMRLPEWPEERLAELLPLEGFTFSG
>gid:144704  c3659  Unknown protein encoded by ISEc8
MKHRTWITEALRLHFEEHLPRVVAGRRLGVPKSTVCSMFVRFRRAGLSWP
LPAGMSEQELDACLYGQFSTVPVVRPESTVISEAPVVKKRPRRPNFPYEF
KIALVEQSLQPGACVAQIARENGINDNLLFNWRHQYRKGGLLPSGKNMPA
LLPVTLTPEPDNKIPAPAQEPEQINTPSDSLCCELVLPAGTLRLKGKLTP
ALLQTLIREIKGSSH
>gid:144705  c3660  Unknown protein encoded by ISEc8
MISLPAGSRIWLVAGITDMRNGFNGLASKVQNVLKDDPFSGHLFIFRGRR
GDQIKVLWADSDGLCLFTKRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI
NWKHPKRTERAGIRI
>gid:144707  c3662  Unknown protein encoded by ISEc8
MDTSLAHENARLRALLQTQQDTIRQMAEYNRLLSQRVAAYASEINRLKAL
VAKLQRMQFGKSSEKLRAKTERQIQEAQERISALQEEMAETLGEQYDPVL
PSALRQSSARKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVSEQLE
LISSAFKVIETQRPKQACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVTG
KYADHLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDVLRQYVL
MPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSQMPPAVWFAYS
PDRKGIHPQNHLAGYSGVLQADAYGGYLIRIRQNNGSRVYGSCPEKNPRC
ACKSAHLHHHGSPAAYR
>gid:144708  c3663  Hypothetical protein
MQSLYDWIQQQMKTLSRHSDTAKAFAYLLKQWDALSVYCSNGWVEIDNNI
TENALRGVAVGRKNWMFAGSDSGGEHAAVLYSLIGTCRLNNVEPEKWLRY
VIEHIQDWPANRVRDLLPWKVDLSSQ
>gid:144716  c3671  Putative radC-like protein yeeS
MEAGTMQQLSFLPGEMTPGERSLILRALKTLDRHLHEPGVAFTSTRAARE
WLILNMAGLEREEFRVLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRA
LYHNAAAVVLAHNHPSGEVTPSKADRLITERLVQALGLVDIRVPDHLIVG
GNQVFSFAEHGLL
>gid:144729  c3684  Hypothetical protein
MAALVVTWFNPVKEAFYMHLPAPGKTKKVALVDCMRKLLTILNAMLRKNE
E
>gid:144747  c3703  Transposase insG for insertion sequence element IS4
MFPDFFMHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTL
RKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQ
ARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPE
NDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAE
QLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEEIR
KLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTD
AMRFPGGEMAELYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELW
GVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPG
RIPELMRDLASMGQLVKLPTRRERAFPRVVKERPGNTPQPRKRASQLLN
>gid:144845  c3803  Putative Transposase
MWLFRMYNISTFFWEGQYVCEINRSRYSAYLLHVHVVFMTKYRRPVFGEL
HHANMRQYVAEVCADFGAELKECDGEADHVHMLIEYPPLVQLTKLVNSLK
SVTSRRLRNDFIDLRAAYSKPVLWSRSYFIGSCGGAPLEVVKKYIQNQRG
>gid:145050  c4010  Hypothetical protein
MKSLFNRLTGKAVSRTAFVEHLGHEVVQHHPNWKVMISTDHKLMRIDTPP
NSHY
>gid:145264  c4222  Putative DNA processing protein
MDADSSDSTVTPGRLPDGLSSCPCFYLGKKLMNLSANAQATLLLTSDFSR
AAASEYKPLSNSEWGKFALWLKHQRISPAELLVPQPQEKLTGWSDPRISQ
ERILGLLARGHSLALAVDKWQRAGLWILTRGDADYPVRLKNRLRTDAPPV
LFGCGNKALLQAEGMAIVGSRDAPTDDLRYTQQLAAKLAQQGICVISGGA
RGIDECAMASALEVGGTAVGVLADSLLKTSTLVKWREGLIAGNLVLISPF
YPEVRFTVGNAMARNKYIYCLAESAMVVRAGMTGGTITGAMEALKHQWLP
MQVKPNQDMQSANSRLVENGASWSAEQAENVTIRLPDVPGLMYDRALRNA
QPELFSLHEDDANYAVMPAHTPVDFYQLFVAELAILAKESISIERLASCT
GLTIEQISVWLNRAEEEGRVIRLGEGHYQFR
>gid:145265  c4223  Putative conserved protein
MMEKHGAELLLQRMLSNTSATFREGQWEAIDAVVNQRRKLLVVQRTGWGK
SAVYFIASKIFRDHGAGPTIIISPLLALMRNQVAAAERLGITAETLNSTN
REEWQRISDKLLQGGVDCLLISPERLANQDFLETVLYPVADRIGLLVVDE
AHCISDWGHDFRPDYRRILDILRQLPANTPILGTTATANNRVVEDIRQQL
GDIVIQRGTLARESLALDALVLGEQSSRLAWLATVIPQFSKSGIVYTLTT
RDAELVAEWLRKNGISAFAYYSGVTCEGAEDSNTAREYLEQALLANKIKV
LVATTALGMGFDKPDLGFVIHYQMPGSIVGYYQQVGRAGRAIDSAVGILL
CGGEDRAIHQFFRESAFPAEAQIHEILNVLSENDGLTLRGIEQRTNLRYG
QIEKALKLLVAENPSPVVYTEKLWRRTIVSFSPDHERINHLMNQRKSELA
DVESYITTKECKMQFLRRALDEPSAERCGKCSSCLQHPLLSPDIDSGLLH
AANLFIKHADLPLNLNKQVASGAFTQYGFKGNLPAGLQGSTGRILSRWGD
SGWGKQVAQEKKTGRFSDELVEACAEMVRQRWNPHPEPTWVCCVPSLRHL
DLVPDFARRLAAKLGLPFIDAIEKVVDNPPQKMQQNRFHQCQNLDGAFVI
TPPLMPGPALLVDDIVDSAWTLTVLTALLRQAGCPTVYPLALASTSVKN
>gid:145546  c4503  Insertion element IS1 1/2/3/5/6 protein insA
MASVNIHCPRCQSAQVYRHGQNPKGRDRFRCRDCHRVFQLTYTYQARKPG
MKEMITEMAFNGAGVRDTARTLKIGINTIIRTLKNSHQRK
>gid:145552  c4507  Transposase insF for insertion sequence IS3A/B/C/D/E/fA
MLDVHPSGFYAWLQQPHSQRHQADLRLTGQIKQFWLESGCVYGYRKIHLD
LRDSGQQCGVNRVWRLMKRVGIKAQVGYRSPRARKGEASIMSPNRLQRQF
NPDAPDERWVTDITYIRTHEGWLYLAVVVDLFSRKIIGWSMQSRMTKDIV
LNALLMAVWRRNPEKQVLVHSDQGSQYTSHEWQSFLKSHGLEGSMSRRGN
CHDNAVAESFFQLLKRERIKKKIYGTREEARSDIFDYIEMFYNSKRRHGS
SEQMSPTEYENQYYQRLGSV
>gid:145553  c4508  Conserved hypothetical protein
MFFGSDHGGERGALLYGLIGTCRLNDIDPEAYLRHILSVLPEWPSNRVDE
LLPWNVVLTNK
>gid:145557  c4511  Hypothetical protein
MSIDMHCHLDLYPRPDLVAEESKRRGTYILSVTTTPKAWHGTSLLAKESQ
RIRTALGLHPQIAHQRSHELDLFDSLLSETKYVGEIGLDGGQGFKEHWDI
QLKVFRHILNSVNRAGGKIMTIHSRGSASAVLDEIENIDGVAILHWFTGT
PKQLERAIDLGCWFSVGPAMLDTIKGKALVLKIPKSRILTETDGPFAKFR
NDPLMPWDSGIAEKQLAALWGISQMEVNAQLVDNFKVLCTS
>gid:145559  c4513  Hypothetical protein
MGTSQSSKGPGGGSPLVPPWADDQPQQPLPSPQERRFAPFRESLGNAVSN
GNRADFRKAIGHYARKASGGSSNAARRLGSVTQAGAELFGALVGMPSAPG
EPSIDLGSLAGLPCEIAISTIAQALTSQDGDSEKICAAMNHALVEALDGV
EIFDPQKITDGVIVDTMIGYLAESIFLQMVMDSNRAWNKADTPSKAIHAE
IELRELIKVVVDKHMAPKLAGNIRSFTRNQMVKIERQAIIEAWQEWEAYQ
>gid:145584  c4540  Putative maturase-related protein
MMINEAQAQATATSGRGDGQYPSGLHDGAEISTAAGGQTKAEVPLTMEAV
ITRENLMLAYQRVVENKGTAGVDNLSVAELKPWLKKNWRSVRQALIDGNY
QPRAIRRMDIPKPDGGVRTSGIPTVVDRLIQQAVQQAQRYIRGGKRWVVD
MDLEKFFDRVDHRLLMTRLARTIKDRRVL
>gid:145585  c4541  Putative maturase-related protein
MVKDGQREKRQAGMLQGGPLSPLLSNILLDELDKELERRGHSFCRYADDC
NIYVSSRKAGDHLLKNIRAFVENKLKVNEKKSAVARPWDRKFLGYSVTWH
KQAKLKIALTSVNRLKEKVHSLTTGNRSKSVKATINALTPVLCGWISYFR
LTEVRGVLEELDGWIKRKLRCLL
>gid:145597  c4552  Transposase insC for insertion element IS2A/D/F/H/I/K
MIVLILVFRPVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLNKTPDVSRL
>gid:145618  c4573  Putative radC-like protein yeeS
MTPGERSLIQRALKTLDRHLHEPGVAFTSTHAAREWLILNMAGLEREEFR
VLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRALYHNAAAVVLAHNHP
SGEVTPSKADRLITERLVQALALVDIRVPDHLIVGGNQVFSFAEHGLL
>gid:146541  c5469  Transposase
MRSGNCKCQTRNQKGVPMGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREK
RRAIGCILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKG
KSSLMPYEQFGDLKFKYRNREFWCRGYYVDTVGKNTAKIQDYIKHQLEED
KMGEQLSIPYPGSPFTGRK
>gid:145199  dam  DNA adenine methylase
MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRY
ILADINSDLISLYNIVKMRTDEYVQAARELFVPETNCAEVYYQFREEFNK
SQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKPYFPEAELYH
FAEKAQNAFFYCESYADSMARADDASVVYCDPPYAPLSATANFTAYHTNS
FTLEQQAHLAEIAEGLVDRHIPVLISNHDTMLTREWYQRAKLHVVKVRRS
ISSNGGTRKKVDELLALYKPGVVSPAKK
>gid:142832  dbpA  ATP-independent RNA helicase dbpA
MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTG
SGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLSN
TKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTL
VMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQR
DPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNT
KKDCQSVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDV
AARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEA
QRANIISDMLQIKLNWQTLPANSSIVPLEAEMATLCIDGGKKAKMRPGDV
LGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKPWKQLQGGKIKGKT
CRVRLLK
>gid:143393  dcm  DNA-cytosine methyltransferase
MQENISVTDSYSTGNAAQAMLEKLLQIYDVKTLVAQLNGVGENHWSAAIL
KRALANDSAWHRLSEKEFAHLQTLLPKPPEHHPHYAFRFIDLFAGIGGIR
RGFESIGGQCVFTSEWNKHAVRTYKANHYCDPATHHFNEDIRDITLSHQE
GVSDEAAAEHIRQHIPEHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFACD
TQGTLFFDVVRIIDARRPAMFVLENVKNLKSHDKGKTFRIIMQTLDELGY
DVADAEDNGPDDPKIIDGKHFLPQHRERIVLVGFRRDLNLKADFTLRDIS
ECFPAQRVTLAQLLDPMVEAKYILTPVLWKYLYRYAKKHQARGNGFGYGM
VYPNNPQSVTRTLSARYYKDGAEILIDRGWDMATGEKDFDDPLNQQHRPR
RLTPRECARLMGFEAPGEAKFRIPVSDTQAYRQFGNSVVVPVFAAVAKLL
EPKIKQAVALRQQEAQHGRRSR
>gid:144955  deaD  Cold-shock DEAD-box protein A
MLINFMMSYVDWPPLILRHTYYMAEFETTFADLGLKAPILEALNDLGYEK
PSPIQAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLQNLDPELKAPQIL
VLAPTRELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQI
VVGTPGRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIP
EGHQTALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVW
GMRKNEALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGD
MNQALREQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSES
YVHRIGRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAEL
LGKRRLEKFAAKVQQQLESSDLDQYRALLSKIQPTAEGEELDLETLAAAL
LKMAQGERTLIVPPDAPMRPKREFRDRDDRGPRDRNDRGPRGDREDRPRR
ERRDVGDMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFASH
STIELPKGMPGEVLQHFTRTRILNKPMNMQLLGDAQPHTGGERRGGGRGF
GGERREGGRNFSGERREGGRGDGRRFSGERREGRAPRRDDSTGRRRFGGD
A
>gid:141900  dinG  Probable ATP-dependent helicase dinG
MALTAALKAQIAAWYKALQEQIPDFIPRAPQRQMIADVAKTLAGEEGRHL
AIEAPTGVGKTLSYLIPGIAIAREEQKTLVVSTANVALQDQIYSKDLPLL
KKIIPDLKFTAAFGRGRYVCPRNLTALASTEPTQQDLLAFLDDELTPNNQ
EEQKRCAKLKGDLDTYKWDGLRDHTDIAIDDDLWRRLSTDKASCLNRNCY
YYRECPFFVARREIQEAEVVVANHALVMAAMESEAVLPDPKNLLLVLDEG
HHLPDVARDALEMSAEITAPWYRLQLDLFTKLVATCMEQFRPKTIPPLAI
PERLNAHCEELYELIASLNNILNLYMPAGQEAEHRFAMGELPDEVLEICQ
RLAKLTEMLRGLAELFLNDLSEKTGSHDIVRLHRLILQMNRALGMFEAQS
KLWRLASLAQSSGAPVTKWATREEREGQLHLWFHCVGIRVSDQLERLLWR
SIPHIIVTSATLRSLNSFSRLQEMSGLKEKAGDRFVALDSPFNHCEQGKI
VIPRMRVEPSIDNEEQHIAEMAAFFREQVESKKHLGMLVLFASGRAMQRF
LDYVTDLRLMLLVQGDQPRYRLVELHRKRVANGERSVLVGLQSFAEGLDL
KGDLLSQVHIHKIAFPPIDSPVVITEGEWLKSLNRYPFEVQSLPSASFNL
IQQVGRLIRSHGCWGEVVIYDKRLLTKNYGKRLLDALPVFPIEQPEVPEG
IVKKKEKTKSPRRRRR
>gid:141384  dinP  DNA polymerase IV
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARK
FGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPL
SLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELQLTASAGVAPVKFLAK
IASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTC
GDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMA
EDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQ
EHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLG
L
>c5026 dnaB, Replicative DNA helicase
MSPIQQQVTPFVMAGNRPFNKQQTDNRERDPQVAGLKVPPHSIEAEQSVL
GGLMLDNERWDDVAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLA
ESLERQGQLDSVGGFAYLAELSKNTPSAANISAYADIVRERAVVREMISV
ANEIAEAGFDPQGRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDA
TVARIEQLFQQPHDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKT
TFAMNLVENAAMLQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQ
LDDEDWARISGTMGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIG
LIMIDYLQLMRVPALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRS
LEQRADKRPVNSDLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIII
GKQRNGPIGTVRLTFNGQWSRFDNYAGPQYDDE
>gid:146508  dnaC  DNA replication protein dnaC
MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRA
MKMQRTFNRSGIRPLHQNCSFENYRVECEGQMNALSKARQYVEEFDGNIA
SFIFSGKPGTGKNHLAAAICNELLLRGKSVLIITVADIMSAMKDTFRNSG
TSEEQLLNDLSNVDLLVIDEIGVQTESKYEKVIINQIVDRRSSSKRPTGM
LTNSNMEEMTKLLGERVMDRMRLGNSLWVIFNWDSYRSRVTGKEY
>gid:141224  dnaE  DNA polymerase III alpha subunit
MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGL
VKFYGAGHGAGIKPIVGADFNVQCDLLGDELTHLTVLAANNIGYQNLTLL
ISKAYQRGYGAAGPIIDRDWLIELNEGLILLSGGRMGDVGRSLLRGNSAL
VDECVAFYEEHFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATN
DVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYMRSEEEMCELF
ADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGL
EERLAFLFPDEEERLKRRPEYDERLDTELQVINQMGFPGYFLIVMEFIQW
SKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP
DFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLG
HPYGFVDRISKLIPPDPGMTLAKAFEAEPQLPEIYEADEEVKALIDMARK
LEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSDVEYAG
LVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDML
QRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRPGPLQSGMVDN
FIDRKHGREEISYPDVQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGY
TLGGADMLRRAMGKKKPEEMAKQRSVFAEGAEKNGINAELAMKIFDLVEK
FAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAAVMTADMDNTEKVVGL
VDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAII
EARNKGGYFRELFDLCARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMN
SLGDALKAADQHAKAEAIGQADMFGVLAEEPEQIEQSYASCQPWPEQVVL
DGERETLGLYLTGHPINQYLKEIERYVGGVRLKDMHPTERGKVTTAAGLV
VAARVMVTKRGNRIGICTLDDRSGRLEVMLFTDALDKYQQLLEKDRILIV
SGQVSFDDFSGGLKMTAREVMDIDEAREKYARGLAISLTDRQIDDQLLNR
LRQSLEPHRSGTIPVHLYYQRADARARLRFGATWRVSPSDRLLNDLRGLI
GSEQVELEFD
>gid:144859  dnaG  DNA primase
MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSF
TVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE
AGSGPSQIERHQRQTLYQLMDGLNTFYQQSLQQPVATSARQYLEKRGLSH
EVIARFAIGFAPPGWDNVLKRFGGNPENRQSLVDAGMLVTNDQGRSYDRF
RERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYE
AQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRA
TNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLV
RKEGKEAFEARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLIS
QVPGETLRIYLRQELGNKLGILDDSQLERLMPKAAESGVSRPVPQLKRTT
MRILIGLLVQNPELATLVPPLENLDENKLPGLGLFRELVNTCLSQPGLTT
GQLLEHYRGTNNAATLEKLSMWDDIADKNIAEQTFTDSLNHMFDSLLELR
QEELIARERTHGLSNEERLELWTLNQELAKK
>gid:145667  dnaN  DNA polymerase III, beta chain
MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLE
MEMVARVALVQPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGERMLVR
SGRSRFSLSTLPAADFPNLDDWQSEVEFTLPQATMKRLIEATQFSMAHQD
VRYYLNGMLFETEGEELRTVATDGHRLAVCSMPIGQSLPSHSVIVPRKGV
IELMRMLDGGDNPLRVQIGSNNIRAHVGDFIFTSKLVDGRFPDYRRVLPK
NPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQE
EAEEILDVTYSGAEMEIGFNVSYVLDVLNALKCENVRMMLTDSVSSVQIE
DAASQSAAYVVMPMRL
>gid:141260  dnaQ  DNA polymerase III, epsilon chain
MTAMSTAITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNN
FHVYLKPDRLVDPEAFGVHGIADEFLLDKPTFAEVADEFMDYIRGAELVI
HNAAFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLAVARKMFPGKRNSLDA
LCARYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTSMAFAMEGETQQQQ
GETTIQRIVRQASKLRVVFATDEELAAHEARLDLVQKKGGSCLWRA
>gid:141595  dnaX  DNA polymerase III subunit tau
MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVG
KTSIARLLAKGLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTK
VEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHV
KFLLATTDPQKLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEP
RALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGTLDDDQAL
SLVEAMVEANGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPA
ALGNDMAAIELRMRELARTIPPTDIQLYYQTLLIGRKELPYAPDRRMGVE
MTLLRALAFHPRMPLPEPEVPRQSFAPVAPTAVMTPTQVPPQPQSAPQQA
PTVPLPETTSQVLAARQQLQRVQGATKAKKSEPAAATRARPVNNAALERL
ASVTDRVQARPVPSALEKAPAKKEAYRWKATTPVMQQKEVVATPKALKKA
LEHEKTPELAAKLAAEAIERDPWAAQVSQLSLPKLVEQVALNAWKEESDN
AVCLHLRSSQRHLNNRGAQQKLAEALSTLKGSTVELTIVEDDNPAVRTPL
EWRQAIYEEKLAQARESIIADNNIQTLRRFFDAELDEESIRPI
>gid:144567  endA  Endonuclease I precursor
MMYRYLSIAAVVLSTAFSGPALAEGINSFSQAKAAAVKVHADAPGTFYCG
CKINWQGKKGVVDLQSCGYQVRKNENRASRVEWEHVVPAWQFGHQRQCWQ
DGGRKNCAKDPVYRKMESDMHNLQPSVGEVNGDRGNFMYSQWNGGEGQYG
QCAMKVDFKEKVAEPPARARGAIARTYFYMRDQYNLTLSRQQTQLFNAWD
KMYPVTDWECERDERIAKVQGNHNPYVQRACQARKS
>gid:144398  exo  Exodeoxyribonuclease IX
MRGLFPISHPAIACSGIECYPYRLIFKGVIVAVHLLIVDALNLIRRIHAV
QGSPCVETCQHALDQLIMHSQPTHAVAVFDDENRSSGWRHQRLPDYKAGR
PPMPEELHNEMPALRAAFEQRGVPCWSASGNEADDLAATLAVKVTQAGHQ
ATIVSTDKGYCQLLSPTLRIRDYFQKRWLDAPFIDKEFGVQPQQLPDYWG
LAGISSSKVPGVAGIGPKSATQLLVEFQSLEGIYENLDAVAEKWRKKLET
HKEMAFLCRDIARLQTDLHIDGNLQQLRLVR
>c5391 fimB, Type 1 fimbriae Regulatory protein fimB
MKNKADNKKRNFLTHSEIESLLKAANTGPHAARNYCLTLLCFIHGFRASE
ICRLRISDIDLKAKCIYIHRLKKGFSTTHPLLNKEIQALKNWLSIRTSYP
HAESEWVFLSRKGNPLSRQQFYHIISTSGGNAGLSLEIHPHMLRHSCGFA
LANMGIDTRLIQDYLGHRNIRHTVWYTASNAGRFYGIWDRARGRQRHAVL
>c5392 fimE, Type 1 fimbriae Regulatory protein fimE
MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHY
QDLDLNEGRINIRRLKNGFSTVHPLRFDEREAVERWTQERANWKGADRTD
AIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERG
ADTRLIQDYLGHRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV
>gid:145067  fis  DNA-binding protein fis
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDL
YELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
>gid:141868  gipA  Peyer's patch-specific virulence factor GipA
MKRAYKYRFYPTTEQAELLAQTFGCVRFVYNSILRWRTDAYYERKEKIGY
LQANARLTALKKEPEYIWLNDVSCVPLQQSLRHQQAAFANFFAGRAAYPA
FKSKRHKQVAEFTASAFKHRDGELYIAKSKSPLDVRWSRELPSAPSTVTI
SRDSAGRYFVSCLCEFEPVSMPVTAKTVGIDVGLKDLFVTDTGFKTDNPR
HTAKYAKRLTLLQRRLSRKQKGSRNRIKARLKVARLHAKIADCRMDNLHK
LSRKLINENQVVCVESLKVKNMIRNPKLSKAIADAGWSELVRQLQYKGKW
AGRSVVAIDQYLPSSKCCSCCGFTMQKMPLNVRKWHCPECGADHDRDINA
ARNIKAAGLAVLAHGEPVNPESQHAA
>gid:143795  gyrA  DNA gyrase subunit A
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY
AMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRY
MLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYD
GTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDD
EDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEV
DAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDG
MRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDI
IAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA
PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYL
TEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLME
VIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVK
YQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVY
SMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMAT
ANGTVKKTVLTEFNRLRTAGKVAIKLVEGDELIGVDLTSGEDEVMLFSAE
GKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQN
GYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD
AGTLVRTRVSEISIVGRNTQGVILIRTSEDENVVGLQRVAEPVDEEDLDT
IDGSAAEGDDEIAPEVDVDDEPEEE
>gid:145665  gyrB  DNA gyrase subunit B
MMSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAID
EALAGHCKEIIVTIHADNSVSVQDDGRGIPTGIHPEEGVSAAEVIMTVLH
AGGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQREGKIHRQIYEHGV
PQAPLAVTGETEKTGTMVRFWPSLETFTNVTEFEYDILAKRLRELSFLNS
GVSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNIFYFSTEKDG
IGVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDK
EGYSKKAKVSATGDDAREGLIAVVSVKVPDPKFSSQTKDKLVSSEVKSAV
EQQMNELLAEYLLENPTDAKIVVGKIIDAARAREAARRAREMTRRKGALD
LAGLPGKLADCQERDPALSELYLVEGDSAGGSAKQGRNRKNQAILPLKGK
ILNVEKARFDKMLSSQEVATLITALGCGIGRDEYNPDKLRYHSIIIMTDA
DVDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDD
EAMDQYQISIALDGATLHTNASAPALAGEALEKLVSEYNATQKMINRMER
RYPKAMLKELIYQPTLTEADLSDEQTVTRWVNALVSELNDKEQHGSQWKF
DVHTNAEQNLFEPIVRVRTHGVDTDYPLDHEFITGGEYRRICTLGEKLRG
LLEEDAFIERGERRQPVASFEQALDWLVKESRRGLSIQRYKGLGEMNPEQ
LWETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKA
ANIDI
>gid:142113  helD  Helicase IV
MGGNCNVFLGWSTFSLTVTLIRNNFVWIYRVMELKATTLGKRLAQHPYDR
AVILNAGIKVSGDRHEYLIPFNQLLAIHCKRGLVWGELEFVLPDEKVVRL
HGTEWGETQRFYHHLDAHWRRWSGEMSEIASGVLRQQLDLIATRTGENKW
LTREQTSGMRQQIRQALSALPLPVNRLEEFDNCREAWRKCQAWLKDIEGA
RLQHNQAYTEAMLIEYADFFRQVESSPLNPAQARAVVNGEHSLLVLAGAG
SGKTSVLVARAGWLLARGEASPEQILLLAFGRKAAEEMDERIRERLHTED
ITARTFHALALHIIQQGSKKVPIVSKLENDTAARHELFIAEWRKQCSEKK
AQAKGWRQWLTEEMQWSVPEGNFWDDEKLQRRLASRLDRWVSLMRMHGGA
QAEMIASAPEEIRDLFSKRIKLMAPLLKAWKGALKAENAVDFSGLIHQAI
VILEKGRFISPWKHILVDEFQDISPQRAALLAALRKQNSQTTLFAVGDDW
QAIYRFSGAQMSLTTAFHENFGEGDRCDLDTTYRFNSRIGEVANRFIQQN
PGQLKKPLNSLTNGDKKAVTLLDESHLDALLDKLSGYAKPEERILILARY
HHMRPASLEKAATRWPKLQIDFMTIHASKGQQADYVIIVGLQEGSDGFPA
AARESIMEEALLPPVEDFPDAEERRLMYVALTRARHRVWALFNKENPSPF
VEILKNLDVPVARKP
>gid:141072  hepA  RNA polymerase associated protein
MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPV
TRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREV
FLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQ
RTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAA
ERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQ
LVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIE
QLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYR
PVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSAR
QELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKV
SGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTS
HRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWF
AEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRI
GQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDL
INYLASPDETEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQ
ALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPD
FPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSST
ISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGN
NLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSAR
ALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESL
DQAGWRLDALRLIVVTHQ
>gid:143121  himA  Integration host factor alpha-subunit
MALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFG
NFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENASPKDE
>gid:142065  himD  Integration host factor beta-subunit
MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGS
FSLHYRAPRTGRNPKTGDKVELEGKYVPHFKPGKELRDRANIYG
>gid:141733  holA  DNA polymerase III, delta subunit
MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEH
HTFSIDPNTDWNAIFSLCQAMSLFASRQTLLLLLPENGPNAAINEQLLTL
TGLLHDDLLLIVRGNKLSKAQENAAWFTALANRSVQVTCQTPEQAQLPRW
VAARAKQLNLELDDAANQVLCYCYEGNLLALAQALERLSLLWPDGKLTLP
RVEQAVNDAAHFTPFHWVDALLMGKSKRALHILQQLRLEGSEPVILLRTL
QRELLLLVNLKRQSAHTPLRALFDKHRVWQNRRGMMSEALNRLSQSQLRQ
AVQLLTRTELTLKQDYGQSVWAELEGLSLLLCHKPLADVFIDG
>gid:142382  holB  DNA polymerase III, delta' subunit
MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLC
QQPQGHKSCGHCRGCQLMQAGTHPDYYTLAPEKGKNALGIDAVREVTEKL
NEHARLGGAKVVWVTDAALLTDAAANALLKTLEEPPAETWFFLATREPER
LLATLRSRCRLHYLAPPPEQYAVTWLSREVTMSQDALLAALRLSAGSPGA
ALALFQGDNWQARETLCQALAYSVQSGDWYSLLAALNHEQAPARLHWLAT
LLMDALKRHYGAAHVTNVDVPGLVVELANHLSPSRLQAILGDVCHIREQL
MSVTGINRELLITDLLLRIEHYLQPGVVLPVPHL
>c5359 holC, DNA polymerase III, chi subunit
MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAY
RLDEALWARPAESFVPHNLAGEGPRGGAPVEIAWPQKRSSSPRDILISLR
TSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK
>gid:146523  holD  DNA polymerase III, psi subunit
MTSRRDWQLQQLGITQWSLRRPGALQGEIAIAIPPHVRLVMVANDLPALT
DPLVSDVLRALTVSPDQVLQLTPEKIAMLPQGSRCNSWRLGTDEPLSLEG
AQVASPALTELRANPTARAALWQQICTYEHDFFPRND
>gid:142854  hrpA  ATP-dependent helicase hrpA
MTEQQKLTFTALQQRLDSLMLRDRLRFSRRLHGVKKVKNPDAQQAIFQEM
AKEIDHAAGKVLLREAARPEITYPDNLPVSQKKQDILEAIRDHQVVIVAG
ETGSGKTTQLPKICMELGRGIKGLIGHTQPRRLAARTVANRIAEELKTEP
GGCIGYKVRFSDHVSDNTMVKLMTDGILLAEIQQDRLLMQYDTIIIDEAH
ERSLNIDFLLGYLKELLPRRPDLKIIITSATIDPERFSRHFNNAPIIEVS
GRTYPVEVRYRPIVEEADDTERDQLQAIFDAVDELSQESPGDILIFMSGE
REIRDTADALNKLNLRHTEILPLYARLSNSEQNRVFQSHSGRRIVLATNV
AETSLTVPGIKYVIDPGTARISRYSYRTKVQRLPIEPISQASANQRKGRC
GRVSEGICIRLYSEDDFLSRPEFTDPEILRTNLASVILQMTALGLGDIAA
FPFVEAPDKRNIQDGVRLLEELGAITTDEQASAYKLTPLGRQLSQLPVDP
RLARMVLEAQKHGCVREAMIITSALSIQDPRERPMDKQQASDEKHRRFHD
KESDFLAFVNLWNYLGEQQKALSSNAFRRLCRTDYLNYLRVREWQDIYTQ
LRQVVKELGIPVNIEPADYREIHIALLTGLLSHIGMKDADKQEYTGARNA
RFSIFPGSGLFKKPPKWVMVAELVETSRLWGRIAARIDPEWVEPVAQHLI
KRTYSEPHWERAQGAVMATEKVTVYGLPIVAARKVNYSQIDPALCRELFI
RHALVEGDWQTRHAFFRENLKLRAEVEELEHKTRRRDILVDDETLFEFYD
QRISHDVISARHFDSWWKKVSRETPDLLNFEKSMLIKEGAEKISKLDYPN
FWHQGNLKLRLSYQFEPGADADGVTVHIPLPLLNQVEESGFEWQIPGLRR
ELIIALIKSLPKPVRRNFVPAPNYAEAFLGRVTPLELPLLDSLERELRRM
TGVTVDREDWHWDQVPDHLKITFRVVDDKNKKLKEGRSLQDLKDALKGKV
QETLSAVADDGIEQSGLHIWSFGQLPESYEQKRGNYKVKAWPALVDERDS
VAIKLFDNPLEQKQAMWNGLRRLLLLNIPSPIKYLHEKLPNKAKLGLYFN
PYGKVLELIDDCISCGVDQLIDANGGPVWTEEGFAALHEKVRAELNDTVV
DIAKQVEQILTAVFNINKRLKGRVDMTMALGLSDIKAQMGGLVYRGFVTG
NGFKRLGDTLRYLQAIEKRLEKLAIDPHRDRAQMLKVENVQQAWQQWINK
LPPARREDEDVKEVRWMIEELRVSYFAQQLGTPYPISDKRILQAMEQISG
>gid:141185  hrpB  ATP-dependent helicase hrpB
MLQCGAKNVNPLERFVSSLPVAAVLPELLAALDGASQVLLSAPTGAGKST
WLPLQLLAHPGINGKIILLEPRRLAARNVAQRLAELLNEKPGDTVGYRMR
AQNCVGPNTRLEVITEGVLTRMIQRDPELSGVGLVILDEFHERSLQADLA
LALLLDVQQGLRDDLKLLIMSATLDNDRLQQMLPEAPVVISEGRSFPVER
RYLPLPAHQRFDEAVAVATAEMLRQESGSLLLFLPGVGEIQRVQEQLTSR
IGSDVLLCPLYGALSLNDQRKAILPAPQGMRKVVLATNIAETSLTIEGIR
LVVDCAQERVARFDPRTGLTRLITQRVSQASMTQRAGRAGRLEPGICLHL
IAKEQAERAAAQSEPEILQSDLSGLLMELLQWGCGDPAQMSWLDQPPTVN
LLAAKRLLQMLGALDGERLSAQGQKMAALGNDPRLAAMLVSAKNDDEAAT
AAKIAAILEEPPRMGNSDLGVAFSRNQPAWQQRSQQLLKRLNVRGGEADS
SLIAPLLARAYADRIARRRGQDERYQLANGMGAMLDADDALSRHEWLIAP
LLLQGSASPDARILLALPVDIDELVQRCPQLVQQSDTVAWDDAQGTLKAW
RRLQIGQLMVKVQPLAKPSEDELHQAMLNGIRDKGLSVLNWTAEAEQLRL
RLLCAAKWLPEYDWPAVDDESLLATLETWLLPHMTGVHSLRGLKSLDIYQ
ALRGLLDWVMQQRLDSELPAHYTVPTGSRIAIRYHEDNPPALAVRMQEMF
GEATNPTIAQGRVPLVLELLSPAQRPLQITRDLSAFWKGAYREVQKEMKG
RYPKHVWPDDPANTAPTRRTKKYS
>c4957 hupA, DNA-binding protein HU-alpha
MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTF
KVNHRAERTGRNPQTGKEIKIAAANVPAFVSGKALKDAVK
>gid:141561  hupB  DNA-binding protein HU-beta
MNKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTF
AVKERAARTGRNPQTGKEITIAAAKVPSFRAGKALKDAVN
>gid:145547  insB  Insertion element IS1 1/5/6 protein insB
MALICELDEQWSYVGSKARLHWLWYAYNTKTGGVLAYTFGPRTDQTCREL
LALLTPFNIGMLTSDDWDSYGRVVPKNKHLTGKIFTQRIERNNLTLRTHI
KRLARKQSASHAQLRSTKK
>gid:145550  insI  Transposase insI for insertion sequence element IS30B/C/D
MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPH
ERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRG
RRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSSEQISG
WLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRYGRR
HTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATL
VDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELAR
HLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHE
LDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD
>gid:145533  intC  Putative prophage integrase
MALTDIKVKTAKPKDKPYKLADGGGMYLLINTNGSKYWRMKYRFAGKEKM
LSIGVYPDVTLADAREKRSEARKLLAAGGDPGEAKKEEKIAQQMSLQNTF
EAIAREWHQSKADRWSLRYRDEIIDTFEKDIFPYIGKRPIAEIKPMELLE
ALRKMEKRGALEKMRKVRQRCGEVFRYAIVTGRADYNPAPDLASALATPK
KVHFPFLTANELPHFLNDLAGYTGSIITKTATQIIMLTGVRTQELRFARW
EDIDFETKLWEIPAEVMKMKRPHIVPLSEQVIMLFKQLEPISKHHPLVFI
GRNDPRKPISKESINQVIELLGYKGRLTGHGFRHTMSTILHEQGFNSAWI
EMQLAHVDKNSIRGTYNHALYLDGRREMMQWYADYIDSLSSRES
>gid:141949  intT  Integrase for prophage
MAVRKLTTGKWLCECYPAGRSGRRVRKQFATKGEALAFERHTMEETEAKP
WLGESVDRRTLKDVVELWFKLHGKSLTAGQHVYDKLLLMVDALGNPLATD
LTSKMFAHYRDKRLTGEIYFSEKWKKGASPVTINLEQSYLSSVFSELSRL
GEWSYPNPLENMRKFTIAEKEMAWLTHEQIVELLADCKRQDPILALVVKI
CLSTGARWREAINLTRSQVTKYRITFVRTKGKKNRSIPISKELYEEIMAL
DGFNFFTDCYFQFLSVMEKTSIVLPRGQLTHVLRHTFAAHFMMSGGNILA
LQKILGHHDIKMTMRYAHLAPDHLETALRFNPLATLPTSIGIF
>gid:143970  lig  DNA ligase
MESIEQQLTELRTTLRHHEYLYHVMDAPEIPDAEYDRLMRELRELETKHP
ELITPDSPTQRVGAAPLAAFSQIRHEVPMLSLDNVFDEESFLAFNKRVQD
RLKSNEKVTWCCELKLDGLAVSILYENGVLVSAATRGDGTTGEDITSNVR
TIRAIPLKLHGENIPARLEVRGEVFLPQAGFEKINEDARRTGGKVFANPR
NAAAGSLRQLDPRITAKRPLTFFCYGVGVLEGGELPDTHLGRLLQFKKWG
LPVSDRVTLCESAEEVLAFYHKVEEDRPTLGFDIDGVVIKVNSLAQQEQL
GFVARAPRWAVAFKFPAQEQMTFVRDVEFQVGRTGAITPVARLEPVHVAG
VLVSNATLHNADEIERLGLRIGDKVVIRRAGDVIPQVVNVVLSERPEDTR
EVVFPTHCPVCGSDVERVEGEAVARCTGGLICGAQRKESLKHFVSRRAMD
VDGMGDKIIDQLVEKEYVHTPADLFKLTAGKLTGLERMGPKSAQNVVNAL
EKAKETTFARFLYALGIREVGEATAAGLAAYFGTLEALEAASIEELQKVP
DVGIVVASHVHNFFAEESNRNVISELLAEGVHWPEPIVINAEEIDSPFAG
KTVVLTGSLSQMSRDDAKARLVELGAKVAGSVSKKTDLVIAGEAAGSKLA
KAQELGIEVIDETEMLRLLGS
>gid:142400  mfd  Transcription-repair coupling factor
MLIRDMPSYVEAYPNENLTTVMPEQYRYTLPVKAGEQRLLGELTGAACAT
LVAEIAERHAGPVVLIAPDMQNALRLHDEISQFTDQMVMNLADWETLPYD
SFSPHQDIISSRLSTLYQLPTMQRGVLIVPVNTLMQRVCPHSFLHGHALV
MKKGQRLSRDALRTQLDSAGYRHVDQVMEHGEYATRGALLDLFPMGSELP
YRLDFFDDEIDSLRVFDVDSQRTLEEVEEINLLPAHEFPTDKAAIELFRS
QWRDTFEVKRDPEHIYQQVSKGTLPAGIEYWQPLFFSEPLPPLFSYFPAN
TLLVNTGDLENSAERFQADTLARFENRGVDPMRPLLPPQSLWLRVDELFS
ELKNWPRVQLKTEHLPTKAANANLGFQKLPDLAIQAQQKAPLDALRKFLE
SFDGPVVFSVESEGRREALGELLARIKIAPQRIMRLDEASDRGRYLMIGA
AEHGFVDTMRNLALICESDLLGERVARRRQDSRRAINPDTLIRNLAELHI
GQPVVHLEHGVGRYAGMTTLEAGGITGEYLMLTYANDAKLYVPVSSLHLI
SRYAGGAEENAPLHKLGGDAWSRARQKAAEKVRDVAAELLDIYAQRAAKE
GFAFKHDREQYQLFCDSFPFETTPDQAQAINAVLSDMCQPLAMDRLVCGD
VGFGKTEVAMRAAFLAVDNHKQVAVLVPTTLLAQQHYDNFRDRFANWPVR
IEMISRFRSAKEQTQILAEVAEGKIDILIGTHKLLQSDVKFKDLGLLIVD
EEHRFGVRHKERIKAMRANVDILTLTATPIPRTLNMAMSGMRDLSIIATP
PARRLAVKTFVREYDSLVVREAILREILRGGQVYYLYNDVENIQKAAERL
AELVPEARIAIGHGQMRERELERVMNDFHHQRFNVLVCTTIIETGIDIPT
ANTIIIERADHFGLAQLHQLRGRVGRSXPPGICMATDAASKSDDYRCAKT
S
>gid:144459  mutH  DNA mismatch repair protein mutH
MSQPRPLLSPPETEEQLLAQAQQLSGYTLGELAALAGLVTPENLKRDKGW
IGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPL
TGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAKRRVGSPLLWSPNEEE
DRQLREDWEELMDMIVLGQIERITARHGEYLQIRPKAANAKALTEAIGAR
GERILTLPRGFYLKKNFTSALLARHFLIQ
>c5254 mutL, DNA mismatch repair protein mutL
MMPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERG
GAKLIRIRDNGCGIKKDELALALARHATSKIASLDDLEAIISLGFRGEAL
ASISSVSRLTLTSRTAEQQEAWQAYAEGRDMDVTVKPAAHPVGTTLEVLD
LFYNTPARRKFLRTEKTEFNHIDEIIRRIALARFDVTINLSHNGKIVRQY
RAVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWVADPNHTTPA
LAEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQV
DVNVHPAKHEVRFHQSRLVHDFIYQGVLSVLQQQLETPLPLDDEPQPAPR
AIPENRVAAGRNHFAEPAVREPVAPRYSPAPASGSRPAASWPNAQPGYQK
QQGEVYRQLLQTPAPMQKPKAPEPQEPALAANSQSFGRVLTIVHSDCALL
ERDGNISLLSLPVAERWLRQAQLTPGEVPVCAQPLLIPLRLKVSGEEKSA
LEKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQ
SVFEPGNIAQWIARNLMSEHAQWSMAQAITLLADVERLCPQLVKTPPGGL
LQSVDLHPAIKALKDE
>gid:145500  mutM  Formamidopyrimidine-DNA glycosylase
MPELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVL
SVQRRAKYLLLELPEGWIIIHLGMSGSLRILPEELPPEKHDHVDLVMSNG
KVLRYTDPRRFGAWLWTKELEGHNVLAHLGPEPLSDDFNGEYLHQKCEKK
KTAIKPWLMDNKLVVGVGNIYASESLFAAGIHPDRLASSLSLAECELLAR
VIKAVLLRSIEQGGTTLKDFLQSDGKPGYFAQELQVYGRKGEPCRVCGTP
IVATKHAQRATFYCRQCQK
>gid:144328  mutS  DNA mismatch repair protein mutS
MSTIENFDAHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQL
LDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPA
TSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDI
SSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRP
LWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRT
TLPHIRSITMERQQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVT
PMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLER
ILARLALRTARPRDLARMRHAFQQLPELRAQLENVDSAPVQALREKMGEF
AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL
EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAER
YIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALA
ELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANP
LNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPI
DRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTS
TYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDAL
EHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS
PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLK
SLV
>gid:141123  mutT  Mutator mutT protein
MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKVEMGETPEQAV
VRELQEEVGITPQHFSLFEKLEYEFPDRHITLWFWLVESWEGEPWGKEGQ
PGEWMSLVGLNADDFPPANEPVIAKLKRVYAG
>gid:144584  mutY  A/G-specific adenine glycosylase
MPPTTVNSVTMQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVML
QQTQVATVIPYFERFMVRFPTVTDLANAPLDEVLHLWTGLGYYARARNLH
KAAQQVAALHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGN
VKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAM
ICTRSKPKCSLCPLQNGCIAAANNSWSLYPGKKPKQTLPERTGYFLLLQH
EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAF
RHTFSHFHLDIVPMWLPVSSFTGCMDEDNALWYNLAQPPSVGLAAPVERL
LQQLRTGAPV
>gid:141803  nei  Endonuclease VIII
MPEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKSYQSRLIGQHVTHVET
RGKALLTHFSNDLTLYSHNQLYGVWRVVDTGEEPQTTRVLRVKLQTADKT
ILLYSASDIEMLTPEQLTTHPFLQRVGPDVLDPNLTPEVVKERLLSPRFR
NRQFAGLLLDQAFLAGLGNYLRVEILWQVGLTGNHKAKDLNAAQLDALAH
ALLDIPRLSYATRGQVDENKYHGALFRFKVFHRDGEPCERCGGIIEKTTL
SSRPFYWCPGCQH
>c4955 nfi, Endonuclease V
MDLASLRAQQIELASSVIREDRLDKDPPDLIAGADVGFEQGGEVTRAAMV
LLKYPSLELVEYKVARIATTMPYIPGFLSFREYPALLAAWEMLSQKPDLV
FVDGHGISHPRRLGVASHFGLLVDVPTIGVAKKRLCGKFEPLSSEPGALA
PLMDKGEQLAWVWRSKARCNPLFIATGHRVSVDSALAWVQRCMKGYRLPE
PTRWADAVASERPAFVRYTANQP
>gid:143717  nfo  Endonuclease IV
MKYIGAHVSAAGGLANAAIRAAEIDATAFALFTKNQRQWRAAPLTTQTID
EFKAACEKYHYTSAQILPHDSYLINLGHPVAEALEKSRDAFIDEMQRCEQ
LGLSLLNFHPGSHLMQISEEDCLARIAESINIALDKTQGVTAVIENTAGQ
GSNLGFKFEHLAAIIDGVEDKSRVGVCIDTCHAFAAGYDLRTPAECEKTF
ADFARIVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGNIGHDAFRWIMQD
DRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA
>gid:142586  nohA  Prophage Qin DNA packaging protein NU1 homolog
MKVNKKRLAEIFNVDPRTIERWQSQGLPCASKGSKGIESVFDTAMAIQWY
AQRETDIENEKLRKELDDLRAAAESDLQPGTIDYERYRLTKAQADAQELK
NAREDGVVLETELFTFILQRVAQEISGILVRVPLTLQRKYPDISPSHLDV
VKTEIAKASNVAAKAGENVGGWIDDFRRAEGS
>gid:144204  nohB  Prophage QSR' DNA packaging protein NU1 homolog
MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWY
AERDAEIENEKLRREVEELRLASEADLHPGTLEFERHRLTRAQATAQELK
NAKESAEVVETAFCTFVLSRIAREISSILDGIPLSVQRRFPELDNRHIDF
LKRDIIKAMNKAAALDELIPGLLSEYIEQSG
>gid:143034  nth  Endonuclease III
MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDVSVNKA
TAKLYPVANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKTCRILLEQH
NGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQF
APGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIE
DLCEYKEKVDI
>gid:143288  ntpA  dATP pyrophosphohydrolase
MAYKRPVSILVVIYAQDTKRVLMLQRRDDPDFWQSVTGSVEEGETAPQAA
MREVKEEVTIDVVAEQLTLIDCQRTVEFEIFSHLRHRYAPGVTRNTESWF
CLALPHERQIVFTEHLAYKWLDASAAAALTKSWSNRQAIEQFVINAA
>gid:142823  ogt  Methylated-DNA--protein-cysteinemethyltransferase
MLRLLEEKIATPLGPLWVICDEQFRLRAVEWEEYSERMVQLLDIHYRKES
YERISATNPGGLSDKLREYFAGNLSIIDTLPTATGGTPFQREVWKTLRTI
PCGQVMHYGQLAEQLGRPGAARAVGAANGSNPISIVVPCHRVIGRNGTMT
GYAGGVLRKEWLLRHEGYLLL
>gid:144802  parC  Topoisomerase IV subunit A
MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMS
ELGLNASAKFKKSARTVGDVLGKYHPHGDSACYEAMVLMAQPFSYRYPLV
DGQGNWGAPDDPKSFAAMRYTESRLSKYSELLLSELGQGTADWVPNFDGT
LQEPKMLPARLPNILLNGTTGIAVGMATDIPPHNLREVAQAAIALIDQPK
TTLDQLLDIVQGPDYPTEAEIITSRAEIRKIYENGRGSVRMRAVWKKEDG
AVVISALPHQVSGARVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIV
PRSNRVDMDQVMNHLFATTDLEKSYRINLNMIGLDGRPAVKNLLEILSEW
LVFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRNEDEPK
PALMSRFGLTETQAEAILELKLRHLAKLEEMKIRGEQSELEKERDQLQGI
LASERKMNNLLKKELQADAQAYGDERRSPLQEREEAKAMSEHDMLPSEPV
TIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKAAVKGKSNQPVVFVDSTG
RSYAIDPITLPSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDA
GYGFVCTFNDLVARNRAGKALITLPENAHVMPPVVIEDASDMLLAITQAG
RMLMFPVSDLPQLSKGKGNKIINIPSAEAARGEDGLAQLYVLPPQSTLTI
HVGKRKIKLRPEELQKVTGERGRRGTLMRGLQRIDRVEIDSPRRASSGDS
EE
>gid:144818  parE  Topoisomerase IV subunit B
MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAG
HAKRVDVILHADQSLEVIDDGRGMPVDIHPEEGVPAVELILCRLHAGGKF
SNKNYQFSGGLHGVGISVVNALSKRVEVNVRRDGQIYNIAFENGEKVQDL
QVVGTCGKRNTGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEI
TFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFAGDTEAVD
WALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNI
LPRGVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVV
KDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK
LADCTAQDLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEV
SSDEVLASQEVHDISVAIGIDPDSDDLSQLRYGKICILADADSDGLHIAT
LLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQ
LKRKKGKPNVQRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTD
AMMDMLLAKKRSEDRRNWLQEKGDMAEIEV
>gid:141797  phrB  Deoxyribodipyrimidine photolyase
MTTHLVWFRQDLRQHDNLALAAACRNSSARVLALYIATPRQWAAHNVSPR
QAELINTQLNALQIALAEKGIPLLFREVDDFAASVEIVKQVCAEYRVTHL
FYNYQYEVNERARDVQAERTLRNVVCEGFDDSVILPPGAVMTGNHEMYKV
FTPFKNAWLKRLREGMPECVAAPKVRSSGSIEPAPSITLNYPRQSFDTAH
FPVEEKAAIAQLRQFCLNGAGEYEQQRDFPAVEGTSRLSASLATGGLSPR
QCLHRLLAEQPQVLEGGPGSVWLNELIWREFYRHLMTYYPSLCKHRPFIA
WTDRVQWQSNPAHLKAWQEGKTGYPIVDAAMRQLNSTGWMHNRLRMITAS
FLVKDLLIDWREGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIF
NPTTQGEKFDREGEFIRQWLPELRDVPGKAVHEPWKWAEKTGVTLDYPQP
IVDHKEARLRTLAAYEEARKGA
>gid:141073  polB  DNA polymerase II
MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQV
PRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREAG
VTVYEADVRPPERYLMERFITSPVWVEGDMRNGAIVNARLKQHPDYRPPL
KWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASALDFELEYVAS
RPLLLEKLNAWFATHDPDVIIGWNVVQFDLRMLQKHAERYRIPLRLGRDN
SELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQEL
LGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMP
FLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASP
GGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHS
TEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQSNKPLSQALKIIMNAFY
GVLGTTACRFFDPRLASSITMRGHQIMRQTKTLIEAQGYDVIYGDTDSTF
VWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCR
FLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQ
ELYLRIFRNEPYQEYVRETIDKLMAGELDTQLVYRKRLRRPLSEYQRNVP
PHVRAARLADEENHKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYE
HYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF
>c4888 priA, Primosomal protein N'
MQTDPHSSTAMCILTHRISSQDDAMPVAHVALPVPLPRTFDYLLPEGMTV
KAGSRVRVPFGKQQERIGVVVSVSDVSELPLNELKAVVEVLDVEPVFTHS
VWRLLLWAADYYHHPIGDVLFHALPILLRQGRPAANAPMWYWFATEQGQA
VDLNSLKRSPKQQQALAALRQGKIWRDQVAELEFNDAALQALRKKGLCDL
ASETPEFSDWRTNYAVSGERLRLNTEQATAVGAIHSAADTFSAWLLAGVT
GSGKTEVYLSVLENVLAQGKQALVMVPEIGLTPQTIARFRERFNAPVEVL
HSGLNDSERLSAWLKAKNGEAAIVIGTRSALFTPFKNLGVIVIDEEHDSS
YKQQEGWRYHARDLAVYRAHSEQIPIILGSATPALETLCNVQQKKYRLLR
LTRRAGNARPAIQHVLDLKGQKVQAGLAPALITRMRQHLQANNQVILFLN
RRGFAPALLCHDCGWIAECPRCDHYYTLHQAQQHLRCHHCDSQRPVPRQC
PSCGSTHLVPVGLGTEQLEQTLAPLFPGVPISRIDRDTTSRKGALEQQLA
EVHRGGARILIGTQMLAKGHHFPDVTLVALLDVDGALFSADFRSAERFAQ
LYTQVAGRAGRAGKQGEVVLQTHHPEHPLLQTLLYKGYDAFAEQALAERR
MMQLPPWTSHVIVRAEDHNNQHAPLFLQQLRNLILASPLADEKLWVLGPV
PALAPKRGGRWRWQILLQHPSRVRLQHIINGTLALINTIPDSRKVKWVLD
VDPIEG
>gid:141592  priC  Primosomal replication protein N
MKTTLLLEKLEGQLATLRQRCAPVAQFATLSARFDRHLFQTRATTLQACL
DEAGDNLAALRHAVEQQQLPQVAWLAEHLAAQLEAIAREATAWSLREWDS
APPKIARWQRKRIQHQDFERRLREMVAERRARLARATALVEQQTLHREVE
AYEARLARCRHALEKIENRLARLTR
>gid:145503  radC  DNA repair protein radC
MKVKNNAQLLMPREKMLKFGISALTDVELLALFLRTGTRGKDVLTLAKEM
LENFGSLYGLLTSEYEQFSGVHGIGVAKFAQLKGIAELARRYYNVRMREE
SPLLSPEMTREFLQSQLTGEEREIFMVIFLDSQHRVITHSRLFSGTLNHV
EVHPREIIREAIKINASALILAHNHPSGCAEPSKADKLITERIIKSCQFM
DLRVLDHIVIGRGEYVSFAERGWI
>gid:141507  rdgC  Recombination associated protein rdgC
MAMQGRRQFVIMPAKFNDKAVEIIMLWFKNLMVYRLSREISLRAEEMEKQ
LASMAFTPCGSQDMAKMGWVPPMGSHSDALTHVANGQIVICARKEEKILP
SPVIKQALEAKIAKLEAEQARKLKKTEKDSLKDEVLHSLLPRAFSRFSQT
MMWIDTVNGLIMVDCASAKKAEDTLALLRKSLGSLPVVPLSMENPIELTL
TEWVRSGSTAQGFQLLDEAELKSLLEDGGVIRAKKQDLTSEEITNHIEAG
KVVTKLALDWQQRIQFVMCDDGSLKRLKFCDELRDQNEDIDREDFAQRFD
ADFILMTGELAALIQNLIEGLGGEAQR
>gid:144289  recA  RecA protein
MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDI
ALGAGGLPMGRIVEIYGPESSGKTTLTLQVIAAAQREGKTCAFIDAEHAL
DPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAAL
TPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKI
GVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVGSETRVKVVKN
KIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIG
QGKANATAWLKDNPETAKEIEKKVRELLLSNPNSTPDFSVDDSEGVAETN
EDF
>gid:144447  recB  Exodeoxyribonuclease V beta chain
MKRMSDVAETLDPLRLPLQGERLIEASAGTGKTFTIAALYLRLLLGLGGS
AAFPRPLTVEELLVVTFTEAATAELRGRIRSNIHELRIACLRETTDNPLY
ARLLEEIDDKAQAAQWLLLAERQMDEAAVFTIHGFCQRMLNLNAFESGML
FEQQLIEDESLLRYQACADFWRRHCYPLTREIAQVVFETWKGPQALLRDI
NRYLQGEAPVIKAPPPDDETLASRHAQIVARIDTVKQQWRDAVGELDALI
ESSGIDRRKFNRSNQAKWIDKISAWAEEETNSYQLPESLEKFSQRFLEDR
TKAGGETPRHPLFEAIDQLLAEPLSIRDLVITRALAEIRETVAREKRRRG
ELGFDDMLSRLDSALRSESGEVLAAAIRTRFPVAMIDEFQDTDPQQYRIF
RRIWHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSEVHAHYTLDTNW
RSAPGMVNSVNKLFSQTDDAFMFREIPFIPVKSAGKNQALRFVFKGETQP
AMKMWLMEGESCGVGDYQSTMAQVCAAQIRDWLQAGQRGEALLMNGDDAR
PVRASDISVLVRSRQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEML
WLLQAVMTPERENTLRSALATSMMGLNALDIETLNNDEHAWDAVVEEFDG
YRQIWRKRGVMPMLRALMSARNIAENLLATAGGERRLTDILHISELLQEA
GTQLESEHALVRWLSQHILEPDSNASSQQMRLESDKHLVQIVTIHKSKGL
EYPLVWLPFITNFRVQDQAFYHDRHSFEAVLDLNAAPESVDLAEAERLAE
DLRLLYVALTRSVWHCSLGVAPLVRRRGDKKSDTDVHQSALGRLLQKGEP
QDAAGLRTCIEALCDDDIAWQTAQTGDNQPWQVNDVLTAELNARTLQRLP
GDNWRVTSYSGLQQRGHGIAQDLMPRLDVDAAGVVSVVEEPTLTPHQFPR
GASPGTFLHSLFEDLDFTQPVDPNWVQEKLELGGFESQWEPVLTEWITAV
LQAPLNETGVSLNQLSDRDKQVEMEFYLPISEPLIASQLDALIRQFDPLS
AGCPPLEFMQVRGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQA
MAAAMQAHRYDLQYQLYTLALHRYLRHRIADYDYERHFGGVIYLFLRGVD
KEHPQQGIYTTRPNAGLIALMDEMFAGMTLEEA
>gid:144449  recC  Exodeoxyribonuclease V gamma chain
MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTL
SQKFGIAANIDFPLPASFIWDMFVRVLPEIPKESAFNKQSMSWKLMTLLP
QLLEREDFTLLRHYLTDDSDKRKLFQLSSKAADLFDQYLVYRPDWLAQWE
TGHLVEGLGEAQAWQAPLWKALVEYTDELGQPRWHRANLYQRFIETLESA
TTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLFTNPCRYYWGD
IKDPAYLAKLLTRQRRHSFEDRELPLFRDSENAGQLFNSDGEQDVGNPLL
ASWGKLGRDYIYLLSDLESSQELDAFVDVTPDNLLHNIQSDILELENRAV
AGVNIEEFSRSDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEED
PTLTPRDIIVMVADIDSYSPFIQAVFGSAPADRYLPYAISDRRARQSHPV
LEAFISLLSLPDSRFVSEDVLALLDVPVLAARFDITEEGLRYLRQWVNES
GIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESAQGEWQSVLPY
DESSGLIAELVGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLNAFFL
PDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDELALRLDQERI
SQRFLAGPVNICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQK
PKRGDRSRRDDDRYLFLEALISAQQKLYISYIGRSIQDNSERFPSVLVQE
LIDYIGQSHYLPGDEALNCDESEARVKAHLTCLHTRMPFDPQNYQPGERQ
SYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRFWAHPVRAFFQ
MRLQVNFRTEDSEIPDTEPFILEGLSRYQINQQLLNVLVEQDDAERLFRR
FRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGV
QITGWLPQVQPDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFL
RKDGEWRFPPLAAEQALHYLSQLIEGYREGMSAPLLVLPESGGAWLKTCY
DAQNDAMLDDDSTLQKARTKFLQAYEGNMMVRGEGDDIWYQRLWRQLTPE
TMEAIVEQSQRFLLPLFRFNQS
>gid:144446  recD  Exodeoxyribonuclease V alpha chain
MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGH
VCLPLSRLENNEASHPLLATCVSEIGELQNWEECLLASQAVSRGDEPTPM
ILCGDRLYLNRMWCNERTVARFFNEVNHAIEVDEALLAQTLDKLFPVSDE
INWQKVAAAVALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIR
LAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTLHRLLGAQPGS
QRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQ
LASVEAGAVLGDICAYANAGFTAERAGQLSRLTGSHVPAGTGTEAASLRD
SLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS
GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFG
VAGLNERIEQFMQQKRKIHRNPHSRWYEGRPVMIARNDSALGLFNGDIGI
ALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAA
LILPSQRTPVVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGL
AALFSSRE
>gid:145666  recF  DNA replication and repair protein recF
MSLTRLLIRDFRNIETADLALSPGFNFLVGANGSGKTSVLEAIYTLGHGR
AFRSLQIGRVIRHEQEAFVLHGRLQGEERETAIGLTKDKQGDSKVRIDGT
DGHKVAELAHLMPMQLITPEGFTLLNGGPKYRRAFLDWGCFHNEPGFFTA
WSNLKRLLKQRNAALRQVTRYEQLRPWDKELIPLAEQISTWRAEYSAGIA
ADMADTCKQFLPEFSLTFSFQRGWEKETEYAEVLERNFERDRQLTYTAHG
PHKADLRIRADGAPVEDTLSRGQLKLLMCALRLAQGEFLTRESGRRCLYL
IDDFASELDDERRGLLASRLKATQSQVFVSAISAEHVIDMSDENSKMFTV
EKGKITD
>gid:145518  recG  ATP-dependent DNA helicase recG
MVHHAGRRVSAMKGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLH
LPLRYEDRTHLYPIGELLPGVYATVEGEVLNCNISFGGRRMMTCQISDGS
GILTMRFFNFNAAMKNSLATGRRVLAYGEAKRGKYGAEMIHPEYRVQGDL
STPELQETLTPVYPTTEGVKQATLRKLTDQALDLLDTCAIEELLPPELSQ
GMMTLPEALRTLHRPPPTLQLSDLETGQHPAQRRLILEELLAHNLSMLAL
RAGAQRFHAQPLSANDALKNKLLAALPFKPTGAQARVVAEIERDMALDVP
MMRLVQGDVGSGKTLVAALAALRAIAHGKQVALMAPTELLAEQHANNFRN
WFAPLGIEVGWLAGKQKGKARLAQQEAIASGQVQMIVGTHAIFQEQVQFN
GLALVIIDEQHRFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTA
YADLDTSVIDELPPGRTPVTTVAIPDTRRTDIIDRVRHACITEGRQAYWV
CTLIEESELLEAQAAEATWEELKLALPELNVGLVHGRMKPADKQAVMASF
KQGELHLLVATTVIEVGVDVPNASLMIIENPERLGLAQLHQLRGRVGRGA
VASHCVLLYKTPLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTR
QTGNAEFKVADLLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETER
YSNA
>gid:144508  recJ  Single-stranded-DNA-specific exonuclease recJ
MKQQIQLRRREVDETADLPAELPPLLRRLYASRGVRSAQELERSVKGMLP
WQQLSGVEKAVEILYNAFREGTRIIVVGDFDADGATSTALSVLAMRSLGC
SNIDYLVPNRFEDGYGLSPEVVDQAHARGAQLIVTVDNGISSHAGVEHAR
SLGIPVIVTDHHLPGDTLPAAEAIINPNLRDCNFPSKSLAGVGVAFYLML
ALRTFLRDQGWFDERGIAIPNLAELLDLVALGTVADVVPLDANNRILTWQ
GMSRIRAGKCRPGIKALLEVANRDPQKLAASDLGFALGPRLNAAGRLDDM
SVGVALLLCDNIGEARVLANELDALNQTRKEIEQGMQVEALTLCEKLERS
RDTLPGGLAMYHPEWHQGVVGILASRIKERFHRPVIAFAPAGDGTLKGSG
RSIQGLHMRDALERLDTLYPGMMLKFGGHAMAAGLSLEEDKFELFQQRFG
ELVTEWLDPSLLQGEVVSDGPLSAAEMTMEVAQLLRDAGPWGQMFPEPLF
DGHFRLLQQRLVGERHLKVMVEPVGGGPLLDGIAFNVDTALWPDNGVREV
QLAYKLDINEFRGNRSLQIIIDNIWPI
>gid:144167  recN  DNA repair protein recN
MLAQLTISNFAIVRELEIDFHSGMTVITGETGAGKSIAIDALGLCLGGRA
EADMVRTGAARADLCARFSLKDTPAALRWLEENQLEDGHECLLRRVISSD
GRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQLLTKPEHQKFLLDGYA
NETSQLQEMTARYQLWHQSCRDLAHHQQLSQERAARAELLQYQLKELNEF
NPQPGEFEQIDEEYKRLANSGQLLTTSQNALALMADGEDANLQSQLYTAK
QLVSELIGMDSKLSGVLDMLEEATIQIAEASDELRHYCDRLDLDPNRLFE
LEQRISKQISLARKHHVSPEALPQYYQSLLEEQQQLDDQADSQETLALAV
TKHHQQALETARALHQQRQHYANELAQLITDSMHALSMPHGQFTIDVKFD
EHHLGADGADRIEFRVTTNPGQPMQPIAKVASGGELSRIALAIQVITARK
METPALIFDEVDVGISGPTAAVVGKLLRQLGESTQVMCVTHLPQVAGCGH
QHYFVSKETDGAMTETHMQSLDKKARLQELARLLGGSEVTRNTLANAKEL
LAA
>gid:144113  recO  DNA repair protein recO
MEGWQRAFVLHSRPWSETSLMLDVFTEESGRVRLVAKGARSKRSTLKGAL
QPFTPLLLRFGGRGEVKTLRSAEAVSLALPLSGITLYSGLYINELLSRVL
EYETRFSELFFDYLHCIQSLAGVTGTPEPALRRFELALLGHLGYGVNFTH
CAGSGEPVDDTMTYRYREEKGFIASVVIDNKTFTGRQLKALNAREFPDAD
TLRAAKRFTRMALKPYLGGKPLKSRELFRQFMPKRTVKTHYE
>gid:145798  recQ  ATP-dependent DNA helicase recQ
MNVAQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVMP
TGGGKSLCYQIPALLLNGLTVVVSPLISLMKDQVDQLQANGVAAACLNST
QTREQQLEVMTGCRTGQIRLLYIAPERLMLDNFLEHLAHWNPVLLAVDEA
HCISQWGHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGL
NDPLIQISSFDRPNIRYMLMEKFKPLDQLMRYVQEQRGKSGIIYCNSRAK
VEDTAARLQSKGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGM
GINKPNVRFVVHFDIPRNIESYYQETGRAGRDGLPAEAMLFYDPADMAWL
RRCLEEKPQGQLQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGRQEPCG
NCDICLDPPKQYDGSTDAQIALSTIGRVNQRFGMGYVVEVIRGANNQRIR
DYGHDKLKVYGMGRDKSHEHWVSVIRQLIHLGLVTQNIAQHSALQLTEAA
RPVLRGESSLQLAVPRIVALKPKAMQKSFGGNYDRKLFAKLRKLRKSIAD
ESNVPPYVVFNDATLIEMAEQMPITASEMLSVNGVGMRKLERFGKPFMAL
IRAHVDGDDEE
>gid:141598  recR  Recombination protein recR
MQTSPLLTQLMEALRCLPGVGPKSAQRMAFTLLQRDRSGGMRLAQALTRA
MSEIGHCADCRTFTEQEVCNICSNPRRQENGQICVVESPADXYAIEQTGQ
FSGRYFVLMGHLSPLDGIGPDDIGLDRLEQRLAEEKITEVILATNPTVEG
EATANYIAELCAQYDVEASRIAHGVPVGGELEMVDGTTLSHSLAGRHKIR
F
>gid:145747  rep  ATP-dependent DNA helicase rep
MRLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHI
AAVTFTNKAAREMKERVGQTLGRKEARGLMISTFHTLGLDIIKREYAALG
MKANFSLFDDTDQLALLKELTEGLIEDDKVLLQQLISTISNWKNDLKTPA
QAAAEAKGERDRIFAHCYGLYDAHLKACNVLDFDDLILLPTLLLQRNEEV
RERWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVVGDDDQSIYSW
RGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEK
RLFSELGYGAELKVLSANNEEHEAERVTGELIAHHFVNKTQYKDYAILYR
GNHQSRVFEKFLMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDS
AFLRIVNTPKREIGPATLKKLGEWAMTRNKSMFTASFDMGLSQTLSGRGY
EALTRFTHWLAEIQRLAEREPIAAVRDLIHGMDYESWLYETSPSPKAAEM
RMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMMERGESEEELD
QVQLMTLHASKGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGI
TRAQKELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIWEQERKVVSAE
ERMQKGQSHLANLKAMMAAKRGK
>gid:145495  rfaP  Lipopolysaccharide core biosynthesis protein rfaP
MVWMVELKEPFATLWRGKDPFEEVKTLQGEVFRELETRRTLRFEMAGKSY
FLKWHRGTTLKEIIKNLLSLRMPVLGADREWNAIHRLRDVGVDTMYGVAF
GEKGINPLTRTSFIITEDLTPTISLEDYCADWATNPPDVRVKRMLIKRVA
TMVRDMHAAGINHRDCYICHFLLHLPFSGKEEELKISVIDLHRAQLRTRV
PRRWRDKDLIGLYFSSMNIGLTQRDIWRFMKVYFSAPLKDILKQEQGLLS
QAEAKATKIRERTIRKSL
>gid:145749  rhlB  Putative ATP-dependent RNA helicase rhlB
MSKTHLTEQKFSDFALHPKVVEALEKKGFHNCTPIQALALPLTLAGRDVA
GQAQTGTGKTMAFLTSTFHYLLSHPAIADRKVNQPRALIMAPTRELAVQI
HADAEPLAEATGLKLGLAYGGDGYDKQLKVLESGVDILIGTTGRLIDYAK
QNHINLGAIQVVVLDEADRMYDLGFIKDIRWLFRRMPPANQRLNMLFSAT
LSYRVRELAFEQMNNAEYIEVEPEQKTGHRIKEELFYPSNEEKMRLLQTL
IEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILD
EFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRA
GASGHSISLACEEYALNLPAIETYIGHSIPVSKYNPDALMTDLPKPLRLT
RPRTGNGPRRTGAPRNRRRSG
>gid:141897  rhlE  Putative ATP-dependent RNA helicase rhlE
MSFDSLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTG
KTAGFTLPLLQHLITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSK
YLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVKLDQV
EILVLDEADRMLDMGFIHDIRRVLTKLPAKRQNLLFSATFSDDIKALAEK
LLHNPLEIEVARRNTASDQVTQHVHFVDKKRKRELLSHMIGKGNWQQVLV
FTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDIRVLV
ATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC
VDEHKLLRDIEKLLKKEIPRIAIPGYEPDPSIKAEPIQNGRQQRGGGRGQ
GGGRGQQQPRRAESGAKSGNAKPAEKPSRRLGDAKPAGEQQRRRRPRKPA
AAQ
>gid:141259  rnhA  Ribonuclease HI
MVSVSRTIWRVIAVLIAVIYVRLVVLQFDSITGSLPEMLKQVEIFTDGSC
LGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELMAAIVALEALKEHC
EVILSTDSQYVRQGITKWIHNWKKRGWKTADKKPVKNVDLWQRLDAALGQ
HQIKWEWVKGHAGHPENERCDELARAAAMNPTLEDTGYQVEV
>gid:141223  rnhB  Ribonuclease HII
MIEFVYPHTQLVAGVDEVGRGPLVGAVVTAAVILDPARPIAGLNDSKKLS
EKRRLALCEEIKEKALSWSLGRAEPHEIDELNILHATMLAMQRAVAGLHI
APEYVLIDGNRCPKLPMPSMAVVKGDSRVPEISAASILAKVTRDAEMAAL
DIVFPQYGFAQHKGYPTAFHLEKLAEYGATEHHRRSFGPVKRALGLAS
>gid:143054  rnt  Ribonuclease T
MSDNAQLTGLCDRFRGFYPVVIDVETAGFNAKTDALLEIAAITLKMDEQG
WLMPDTTLHFHVEPFVGANLQPEALAFNGIDPNDPDRGAVSEYEALHEIF
KVVRKGIKASGCNRAIMVAHNANFDHSFMMAAAERASLKRNPFHPFATFD
TAALAGLALGQTVLSKACQTAGMDFDSTQAHSALYDTERTAVLFCEIVNR
WKRLGGWPLPAAEEV
>gid:142575  rus  Crossover junction endodeoxyribonuclease rusA
MNTYSITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIG
LAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVV
KMPVTKGGRLELTITEMGNE
>gid:143284  ruvA  Holliday junction DNA helicase ruvA
MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTH
FVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVN
AVEREEVGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAADLVLTS
PASPATDDAEQEAVAALVALGYKPQEASRMVSKIARPDASSETLIREALR
AAL
>gid:143283  ruvB  Holliday junction DNA helicase ruvB
MIEADRLISAGTTLPEDVADRAIRPKLLEEYVGQPQVRSQMEIFIKAAKL
RGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAM
LTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKI
DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQYIVSRSARFM
GLEMSDDGALEVARRARGTPRIANRLLRRVRDFAEVKHDGTISADIAAQA
LDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDV
LEPYLIQQGFLQRTPRGRMATVRAWNHFGITPPEMP
>gid:143286  ruvC  Crossover junction endodeoxyribonuclease ruvC
MMAIILGIDPGSRVTGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIY
AGVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPV
FEYAARQVKQTVVGIGSAEKSQVQHMVRTLLKLPANPQADAADALAIAIT
HCHVSQNAMQMSESRLNLARGRLR
>gid:143559  sbcB  Exodeoxyribonuclease I
MIFDTLADSRNNGFNLMMNDGKQQSTFLFHDYETFGTHPALDRPAQFAAI
RTDSEFNVIGEPEVFYCKPADDYLPQPGAVLITGITPQEARAKGENEAAF
AARIHSLFTVPKTCILGYNNVRFDDEVTRNVFYRNFYDPYAWSWQHDNSR
WDLLDVMRACYALRPEGINWPENDDGLPSFRLEHLTKANGIEHSNAHDAM
ADVYATIAMAKLVKTRQPRLFDYLFTHRNKHKLMALIDVPQMKPLVHVSG
MFGAWRGNTSWVAPLAWHPENRNAVIMVDLAGDISPLLELDSDTLRERLY
TAKADLGDNAAVPVKLVHINKCPVLAQANTLRPEDADRLGINRQHCLDNL
KILRENPQVREKVVAIFAEAEPFTPSENVDAQLYNGFFSDADRAAMKIVL
ETEPRNLPALDITFVDKRIEKLLFNYRARNFPGTLDYAEQQRWLEHRRQV
FTPEFLQGYADELQMLAQQYADDKEKVALLKALWQYAEEIV
>gid:141510  sbcC  Exonuclease sbcC
MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAI
CLALYHETPRLSNVSQSQNDLMTRDTAECLAEVGFEVKGEAYRAFWSQNR
ARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRS
MLLSQGQFAAFLNAKPKERAELLEELTGTEIYGKISAMVFEQHKSARTEL
EKLQAQASGVALLTPEQVQSLTASLQVLTDEEKQLLTAQQQEQQSLNWLT
RLDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIA
EHSAALAHTRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNA
WLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALA
AITLTLTADEVASALAQHAEQRTLRQRLVALHGQIVPQQKRLAQLQVTIQ
NVTQEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQL
QAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGATLR
GQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPQDDIQP
WLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTAL
AGYALTLPQEDEEESWLATRQQEAQSWQHRQNELTALQNRIQQLTPILET
LPQSDELPHSEETVVLENWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKA
QAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLV
TQTAETLTQHQQHRPGGLSLTVTVEQIQQELAQTHQKLRENTTSQGEIRQ
QLKQDADNRQQQQTLMQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQG
LTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTL
SGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDAL
DALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESAFAMK
>gid:141511  sbcD  Nuclease sbcCD subunit D
MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQAHQVDAIIVAGDVF
DTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFL
NTTVVASAGHTPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQ
QHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIY
IGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECG
KSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVS
LEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLA
SQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFATTLHSLAGEHEA
>gid:143557  sbmC  DNA gyrase inhibitory protein
MNYEIKQEDKRTVAGFHLVGPWEQTVKKGFEQLMMWIDSKNIVPKEWVAV
YYDNPDETPAEKLRCDTVVTVPNNFTLPENSEGVILTEISGGQYAVAVAR
VVGDDFAKPWYQFFNSLLQDSAYEMLPKPCFEVYLNNGAEDGYWDIEMYV
AVQPKHH
>gid:141784  seqA  SeqA protein
MCHRAATNKVYSEFFCKPGLTLSAAGIYSYTPGDLYSAKTLHWIKMKTIE
VDDELYSYIASHTKHIGESASDILRRMLKFSAASQPAAPVTKEVRVASPA
IVEAKPVKTIKDKVRAMRELLLSDEYAEQKRAVNRFMLLLSTLYSLDAQA
FAEATESLHGRTRVYFAADEQTLLKNGNQTKPKHVPGTPYWVITNTNTGR
KCSMIEHIMQSMQFPAELIEKVCGTI
>gid:145088  smf  Unknown protein fragment 1
MVDTEIWLRLISISSLYGDDMVRIAHWLAKQSHIDAVVLQQTGLTLRQAQ
RFLSFPRKSIESSLCWLEQPNHHLIPADSEFYPPQLLATTDYPGALFVEG
ELHALHSFQLAVVGSRAHSWYGERWGRLFCETLATRGVTITSGLARGIDG
VAHKAALQVNGVSIAVLGNGLNTIHPRRHAPLAASLLEQGGALVSEFPLD
VPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCALEQGREVFAL
PGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPDAPENSFYS
PDQEDVALPFPELLANVGDEVTPVDVVAERAGQPVPEVVTQLLELELAGW
IAAVPGGYVRLRRACHVRRTNVFV
>gid:144124  srmB  ATP-dependent RNA helicase srmB
MTVTTFSELELDESLLEALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPT
GTGKTAAYLLPALQHLLDFPRKKSGPPRILILTPTRELAMQVADHARELA
KHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRA
VETLILDEADRMLDMGFAQDIEHIAGETRWRKQTLLFSATLEGDAIQDFA
ERLLEDPVEVSANPSTRERKKIHQWYYRADDLEHKTALLVHLLKQPEATR
SIVFVRKRERVHELANWLREAGINNCYLEGEMVQGKRNEAIKRLTEGRVN
VLVATDVAARGIDIPDVSHVFNFDMPRSGDTYLHRIGRTARAGRKGTAIS
LVEAHDHLLLGKVGRYIEEPIKARVIDELRPKTRAPSEKQTGKPSKKVLA
KRAEKKKAKEKEKPRVKKRHRDTKNIGKRRKPSGTGVPPQTTEE
>c5049 ssb, Single-strand binding protein
MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMK
EQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTT
EVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG
GAQSRPQQSAPAAPSNEPPMDFDDDIPF
>gid:145408  tag  DNA-3-methyladenine glycosylase I
MIINTTPGKSRIRQYYCHCMKDIGHSSPVLNIDFFTDASRKAAEIFTIGY
IARESMERCGWVSQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLS
WITVLKKRENYRACFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAI
IGNARAYLQMEQNGEPFADFVWSFVNHQPQVTQATTLSEIPTSTPASDAL
SKALKKRGFKFVGTTICYSFMQACGLVNDHVVGCCCHPGNKP
>gid:142758  topA  DNA topoisomerase I
MRRIPGRIKLGKVNMGKALVIVESPAKAKTINKYLGSDYVVKSSVGHIRD
LPTSGSTAKKSADSTSTKTAKKPKKDERGALVNRMGVDPWHNWEAHYEVL
PGKEKVVSELKQLAEKADHIYLATDLDREGEAIAWHLREVIGGDDARYSR
VVFNEITKNAIRQAFNKPGELNIDRVNAQQARRFMDRVVGYMVSPLLWKK
IARGLSAGRVQSVAVRLVVEREREIKAFVPEEFWEVDASTTTPSGEALAL
QVTHQNDKPFRPVNKEQTQAAVSLLEKARYSVLEREDKPTTSKPGAPFIT
STLQQAASTRLGFGVKKTMMMAQRLYEAGYITYMRTDSTNLSQDAVNMVR
GYISDNFGKKYLPESPNQYASKENSQEAHEAIRPSDVNVMAESLKDMEAD
AQKLYQLIWRQFVACQMTPAKYDSTTLTVGAGDFRLKARGRILRFDGWTK
VMPALRKGDEDRILPAVDKGDALTLVELTPAQHFTKPPARFSEASLVKEL
EKRGIGRPSTYASIISTIQDRGYVRVENRRFYAEKMGEIVTDRLEENFRE
LMNYDFTAQMENSLDQVANHEAEWKAVLDHFFSDFTQQLDKAEKDPEEGG
MRPNQMVLTSIDCPTCGRKMGIRTASTGVFLGCSGYALPPKERCKTTINL
VPENEVLNVLEGEDAETNALRAKRRCPKCGTAMDSYLIDPKRKLHVCGNN
PTCDGYEIEEGEFRIKGYDGPIVECEKCGSEMHLKMGRFGKYMACTNEEC
KNTRKILRNGEVAPPKEDPVPLPELPCEKSDAYFVLRDGAAGVFLAANTF
PKSRETRAPLVEELYRFRDRLPEKLRYLADAPQQDPEGNKTMVRFSRKTK
QQYVSSEKDGKATGWSAFYVDGKWVEGKK
>gid:143177  topB  DNA topoisomerase III
MANWCQRAAVVPWLRSVNSMRLFIAEKPSLARAIADVLPKPHRKGDGFIE
CGNGQVVTWCIGHLLEQAQPDAYDSRYARWNLADLPIVPEKWQLQPRPSV
TKQLNVIKHFLHEASEIVHAGDPDREGQLLVDEVLDYLQLAPEKRQQVQR
CLINDLNPQAVERAIDRLRSNSEFVPLCVSALARARADWLYGINMTRAYT
ILGRNAGYQGVLSVGRVQTPVLGLVVRRDEEIENFVAKDFFEVKAHIVTP
ADERFTAIWQPSEACEPYQDEEGRLLHRPLAEHVVNRISGQPAIVTSYND
KRESESAPLPFSLSALQIEAAKRFGLSAQNVLDICQKLYETHKLITYPRS
DCRYLPEEHFAGRHAVMNAISVHAPDLLPQPVVDPDIRNRCWDDKKVDAH
HAIIPTARSSAINLTENEAKVYNLIARQYLMQFCPDAVFRKCVIELDIAK
GKFVAKARFLAEAGWRTLLGSKERDEENDGTPLPVVAKGDELLCEKGEVV
ERQTQPPRHFTDATLLSAMTGIARFVQDKDLKKILRATDGLGTEATRAGI
IELLFKRGFLTKKGRYIHSTDAGKALFHSLPEMATRPDMTAHWESVLTQI
SEKQCRYQDFMQPLVGTLYQLIDQARSTPVRRFKGIVVPRNGDEKNKDVK
KRVKKTAQKKDATVKRD
>gid:142646  umuC  UmuC protein
MFALCDVNAFYASCETVFRPDLWGKPVVVLSNNDGCVIARNAEAKALGVK
MGDPWFKQKDLFRRCGVVCFSSNYELYADMSNRVMSTLEELSPRVEIYSI
DEAFCDLTGVHNCRDLTDFGREIRATVLQRTHLTVGVGIAQTKTLAKLAN
YAAKKWQRQTGGVVDLSNLERQRKLMSALPVDEVWGIGRRISKKLDAMGI
KTVLDLADSDIRFIRKHFNVVLERTVRGLRGEPCLQLEEFAPTKQEIICS
RSFGERITDYPSMRQAICSYAARAAEKLRSEHQYCRFISTFIKTSPFAFN
EPYYGNSASVKLLTPTQDSRDIINAATRSLDAIWQAGHRYQKAGVMLGDF
FSQGVAQLNLFDDNAPRPGSEQLMAVMDTLNAKEGRGTLYFAGQGIQQQW
QMKRAMLSPCYTTRSSDLLRVK
>gid:144129  ung  Uracil-DNA glycosylase
MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRF
TELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIATPPSLLNMYKELENTI
PGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVIS
LINQHREGVVFLLWGSHAQKKGAIIDXQRHHVLKAPHPSPLSAHRGFFGC
NHFVLANQWLEQRGETPIDWMPVLPAESX
>c5048 uvrA, Excinuclease ABC subunit A
MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT
ITEIHDYLRLLYARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLML
LAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTI
EVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSAN
FACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL
SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYG
SGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF
ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAG
QRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQ
IGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDA
IRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV
PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLI
NDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSN
PATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIK
VEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF
DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTG
QTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW
IVDLGPEGGSGGGEILVSGTPETVAECKASHTARFLKPML
>gid:141877  uvrB  Excinuclease ABC subunit B
MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIA
NVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEA
YVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGD
PDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVI
DIFPAESDDIALRVXLFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYV
TPRERIVQAMEEIKEELAARRKVLLENNKLMEEQRLTQRTQFDLEMMNEL
GYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGM
YRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYE
LEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVL
VTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVL
VGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKA
ILYGDKITPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALG
QNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELEGLMMQHAQNL
EFEEAAQIRDQLHQLRELFIAAS
>gid:143338  uvrC  Excinuclease ABC subunit C
MSDQFDAKAFLKTVTSQPGVYRMYDAGGTVIYVGKAKDLKKRLSSYFRSN
LASRKTEALVAQIQQIDVTVTHTETEALLLEHNYIKLYQPRYNVLLRDDK
SYPFIFLSGDTHPRLAMHRGAKHAKGEYFGPFPNGYAVRETLALLQKIFP
IRQCENSVYRNRSRPCLQYQIGRCLGPCVEGLVSEEEYAQQVEYVRLFLS
GKDDQVLTQLISRMETASQNLEFEEAARIRDQIQAVRRVTEKQFVSNTGD
DLDVIGVAFDAGMACVHVLFIRQGKVLGSRSYFPKVPGGTELSEVVETFV
GQFYLQGSQMRTLPGEILLDFNLSDKTLLADSLSELAGRKINVQTKPRGD
RARYLKLARTNAATALTSKLSQQSTVHQRLTALASVLKLPEVKRMECFDI
SHTMGEQTVASCVVFDANGPLRAEYRRYNITGITPGDDYAAMNQVLRRRY
GKAIDDSKIPDVILIDGGKGQLAQAKNVFAELDVSWDKNHPLLLGVAKGA
DRKAGLETLFFEPEGEGFSLPPDSPALHVIQHIRDESHDHAIGGHRKKRA
KVKNTSSLETIEGVGPKRRQMLLKYMGGLQGLRNASVEEIAKVPGISQGL
AEKIFWSLKH
>gid:145788  uvrD  DNA helicase II
MAEGVALLRPTYFYAAVPMDVSYLLDSLNDKQREAVAAPRSNLLVLAGAG
SGKTRVLVHRIAWLMSVENCSPYSIMAVTFTNKAAAEMRHRIGQLMGTSQ
GGMWVGTFHGLAHRLLRAHHMDANLPQDFQILDSEDQLRLLKRLIKAMNL
DEKQWPPRQAMWYINSQKDEGLRPHHIQSYGNPVEQTWQKVYQAYQEACD
RAGLVDFAELLLRAHELWLNKPHILQHYRERFTNILVDEFQDTNNIQYAW
IRLLAGDTGKVMIVGDDDQSIYGWRGAQVENIQRFLNDFPGAETIRLEQN
YRSTSNILSAANALIENNNGRLGKKLWTDGADGEPISLYCAFNELDEARF
VVNRIKTWQDNGGALAECAILYRSNAQSRVLEEALLQASMPYRIYGGMRF
FERQEIKDALSYLRLIANRNDDAAFERVVNTPTRGIGDRTLDVVRQTSRD
RQLTLWQACRELLQEKALAGRAASALQRFMELIDALAQETADMPLHVQTD
RVIKDSGLRTMYEQEKGEKGQTRIENLEELVTATRQFSYNEEDEDLMPLQ
AFLSHAALEAGEGQADTWQDAVQLMTLHSAKGLEFPQVFIVGMEEGMFPS
QMSLDEGGRLEEERRLAYVGVTRAMQKLTLTYAETRRLYGKEVYHRPSRF
IGELPEECVEEVRLRATVSRPVSHQRMGTPMVENDSGYKLGQRVRHAKFG
EGTIVNMEGSGEHSRLQVAFQGQGIKWLVAAYARLETV
>gid:143392  vsr  Very short patch repair protein
MVDVHDKATRSKNMRAIATRDTAIEKRLASLLTGQGLAFRVQDASLPGRP
DFVVDEYRCVIFTHGCFWHHHHCYLFKVPATRTEFWLEKIGKNVERDRRD
INRLQALGWRVLIVWECALRGREKLTDEALTERLEEWICGEGASAQIDTQ
GIHLLA
>gid:143598  wcaH  GDP-mannose mannosyl hydrolase
MMFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPG
GRVQKDETLEAAFERLTMAELGLRLPITVGQFYGVWQHFYDDNFSGTEFT
THYVVLGFRFRVAEDELLLPDEQHDDYRWLTPDALLASDNVHANSRAYFL
SEKRAGVPGL
>gid:145786  xerC  Integrase/recombinase xerC
MTDLHTDVERYLRYLSVERQLSPITLLNYQRQLEAIINFASENGLQSWQQ
CDVTMVRNFAVRSRRKGLGAASLALRLSALRSFFDWLVSQNELKANPAKG
VSAPKAPRHLPKNIDVDDMNRLLDIDINDPLAVRDRAMLEVMYGAGLRLS
ELVGLDIKHLDLESGEVWVMGKGSKERRLPIGRNAVAWIEHWLDLRDLFG
SEDDALFLSKLGKRISARNVQKRFAEWGIKQGLNNHVHPHKLRHSFATHM
LESSGDLRGVQELLGHANLSTTQIYTHLDFQHLASVYDAAHPRAKRGK
>gid:144510  xerD  Integrase/recombinase xerD
MKQDLARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATA
QSDDLQALLAERLEGGYKATSSARLLSAVRRLFQYLYREKFREDDPSAHL
ASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVS
ELVGLTMSDISLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWL
LNGVSIDVLFPSQRAQQMTRQTFWHRIKHYAVLAGIDSEKLSPHVLRHAF
ATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA
>gid:144053  xseA  Exodeoxyribonuclease VII large subunit
MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFT
LKDDTAQVRCAMFRNSNRRVTFRPQHGQQVLVRANITLYEPRGDYQIIVE
SMQPAGEGLLQQKYEQLKAKLQAEGLFELQYKKSLPSPAHCVGVITSKTG
AALHDILHVLKRRDPSLPVIIYSTAVQGDDAPGQIVRAIELANKRNECDV
LIVGRGGGSLEDLWSFNDERVARAIFASRIPIVSAVGHETDVTIADFVAD
LRAPTPSAAAEVVSRNQQELLRQVQSTRQRLEMAMDYYLANRTRRFTQIH
HRLQQQHPQLRLARQQTMLERLQKRMSFALENQLKRAGQQQQRLTRQLVQ
QNPQSRIHRAQTRIQQLEYRLAETLRAQLSATRERFGNAVTHLEAVSPLS
TLARGYSVTSAADGAVLKQVKQVKVGETLTTRLGDGVVISEVSAVTKTRK
SRKKTSNP
>gid:141538  xseB  Exodeoxyribonuclease VII small subunit
MPKKNEAPASFEKALSELEQIVTRLESGDLPLEEALNEFERGVQLARQGQ
AKLQQAEQRVQILLSDNEDASLTPFTPDNE
>gid:143161  xthA  Exodeoxyribonuclease III
MAATMKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEE
VAKLGYNVFYHGQKGHYGVALLTKETPIAVRRGFPGDDEEAQRRIIMAEI
PSPLGNVTVINGYFPQGESRDHPIKFSAKAQFYQNLQNYLETELKRENPV
LIMGDMNISPGDLDIGIGEENRKRWLRTGKCSFLPEEREWMERLMSWGLV
DTFRHANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAECCVETGI
DYEIRSMEKPSDHAPVWATFRR
>gid:141563  ybaV  Hypothetical protein ybaV precursor
MKHGIKALLITLSLACAGMSHSALAAASVAKPTTVETKAEAPAAQSKAAV
PAKASDEEGTRVSINNASAEELARAMNGVGLKKAQAIVSYREEYGPFKTV
EDLKQVPGMGNSLVERNLAVLTL
>gid:141579  ybaZ  Hypothetical protein ybaZ
MQPIACQMLVSCAMRLHSGVFPDYAEKLPQEEKMEKEDSFPQRVWQIVAA
IPEGYVTTYGDVAKLAGSPRAARQVGGVLKRLPEGSTLPWHRVVNRHGTI
SLTGPDLQRQRQALLAEGVMVSGSGQIDLQLYRWNY
>gid:142028  ybjD  Hypothetical protein ybjD
MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPES
ELYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACW
TPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQARHL
VRLMPVLRLRDARFMRRIRNGTVPNVPNVEVTARQLDFLARELSSHPQNL
SDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDII
NRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLH
PIMLSVAWHLLNLLPLQRIATTNSGELLSLTPVEHVCRLVRESSRVAAWR
LGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQC
GHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATV
RSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRVAQIPENVPMNL
RKVISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGR
AD
>gid:142046  ycaJ  Hypothetical protein ycaJ
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHL
HSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQ
NRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELN
SALLSRARVYLLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAI
AELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEIAGERSARFDN
KGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS
EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVY
TAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA
AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR
>gid:142383  ycfH  Putative deoxyribonuclease ycfH
MSITCNRALCYRFLIFKRDIMFLVDSHCHLDGLDYESLHKDVDDVLAKAA
ARDVKFCLAVATTLPGYLHMRDLVGERDNVVFSCGVHPLNQNDPYDVEDL
RRLAAEEGVVALGETGLDYYYTPETKVRQQESFIHHIQIGRELNKPVIVH
TRDARADTLAILREEKVTDCGGVLHCFTEDRETAGKLLDLGFYISFSGIV
TFRNAEQLRDAARYVPLDRLLVETDSPYLAPVPHRGKENQPAMVRDVAEY
MAVLKGVAVEELAQVTTDNFARLFHIDASRLQSIR
>gid:144846  ydcM  Hypothetical protein ydcM
MLIQKAYKYRLYPTDQQAQRLRQLCGCARFVWNYALNETLSIHDAGGKIP
SAFDLNKRLTGWKKLPELAFLSEGYTDNLQQKLKDLRSAWDRCFDKSLTA
EKPVFKKKTKGCDSIRFVNFSKYCGLDYGRVKLPSGLGWVKFRQSRKIEG
VIKNCTISQHAGHWYVSFQVELAVTDPIHASTSAIGLDAGITKLATLSDG
TVFEPVNSLKKNQDKLARLQRRLARMVKFSANWKKQKAKISRFHSHIANI
RRDYLHKTTTTISKNHAMIVIEDLKVSNMSKSAAGTVDQPGRNVAAKSGL
NRAILDQGWAEMRRQLEYKQAWRGGDVLAINPAYTSQKCACCGHTSKNNR
RTQASFICTACGYTANADVNGARNILTAGFEVMAAGQILPSVRKGRARKA
A
>gid:144209  ydfP  Hypothetical protein ydfP precursor
MNQIFTVILLVLVGFVVGNVWSDRGWQKKWAERDAAELSQEVNVQFAARI
IEQGRSISRDEAVKDAQQKAAEISARAADLSDSVNQLRAEATKYAIRLDA
AQHTANLAAAVRGKTTKAAEGMLTNMLGDIAAEAQLYAEIADKRYIAGVT
CQRIYESLRDKKYQM
>gid:143151  ydjQ  Hypothetical protein ydjQ
MVRRLTSPRLEFEAAAIYEYPEHLRSFLNDLPTRPGVYLFHGESDTMPLY
IGKSINIRSRVLSHLRTPDEAAMLRQSRRISWICTAGEIGALLLEARLIK
EQQPLFNKRLRRNRQLCALQLNEKRVDVVYAKEVDFSRAPNLFGLFANRR
AALQALQTIADEQKLCYGLLGLEPLSRGRACFRSALKRCAGACCGKESHD
DHALRLRQSLERLRVVCWPWKGAVALKEQHPEMTQYHIIQNWLWLGAVNS
LKEATTLIRAPAGFDHDGYKILCKPLLSGNYEITELDPVNDQQAS
>gid:143229  yeaB  Hypothetical protein yeaB
MEYRSLTLDDFLSRFQLLRPQINRETLNHRQAAVLIPIVRRPQPGLLLTQ
RSIHLRKHAGQVAFPGGAVDDTDTSVIAAALREAEEEVAIPPSAVEVIGV
LPPVDSVTGYQVTPVVGIIPPDLPYRASEDEVSAVFEMPLAQALHLGRYH
PLDIYRRGDSHRVWLSWYEQYFVWGMTAGIIRELALQIGVKP
>gid:143742  yejH  Hypothetical protein yejH
MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARG
RVLVLAHVKELVAQNHEKYQALGLEADIFAAGLKRKESHGKVVFGSVQSV
TRNLDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLT
ATPFRLGKGWIYQFHYHGMVRGDEKALFRDCIYELPLRYMIKHGYLTPPE
RLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPHIISQIMEFAA
TRKGVMIFAATVEHAKEIVGLLPAEDAALITGDTPGSERDVLIDDFKAQR
FRYLVNVAVLTTGFDAPHVDLIAILRPTESVSLYQQIVGRGLRLAPGKTD
CLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD
GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRE
CDTVLVDPDDMLKAALRLKDALVLRCSGMSLQHGHDEKGEWLKITYYDED
GADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALL
RHPDFVVARMKGQYWQVREKVFDYEGRFRRAHELRG
>gid:143815  yfaO  Putative Nudix hydrolase yfaO
MRQRTIVCPLIQNDGAYLLCKMADDRGVFPGQWALSGGGVEPGERIEEAL
RREIREELGEQLLLTEITPWTFSDDIRTKTYADGRKEEIYMIYLIFDCVS
ANRDVKINEEFQDYAWVKPEDLVHYDLNVATRKTLRLKGLL
>gid:143864  yfcD  Hypothetical protein yfcD
MEQRRLASTEWVDIVNEENEVIAQASREQMRAQCLRHRATYIVVHDGMGK
ILVQRRTETKDFLPGMLDATAGGVVQADEQLLESARREAEEELGIAGVPF
AEHGQFYFEDKNCRVWGALFSCVSHGPFALQEDEVSEVCWLTPEEITARC
DEFTPDSLKALALWMKRNAKNEAVETETAE
>gid:144019  yffH  Hypothetical protein yffH
MFISRCGVGMTQQITLVKDKILSDNYFTLHNITYDLTRKDGEVIRHKREV
YDRGNGATILLYNAKKKTVVLIRQFRVATWVNGNESGQLIETCAGLLDND
EPEVCIRKEAIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDSQR
ANAGGGVEDEDIEVLELPFSQALEMIKTGEIRDGKTVLLLNYLQMSHLMD
>gid:144038  yfgE  Hypothetical protein yfgE
MVNFSRFCEILVEVSLNTPAQLSLPLYLPDDETFASFWPGDNSSLLAALQ
NVLRQEHSGYIYLWAREGAGRSHLLHAACAELSQRGDAVGYVPLDKRTWF
VPEVLDGMEHLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLIT
GDRPPRQLNLGLPDLASRLDWGQIYKLQPLSDEDKLQALQLRARLRGFEL
PEDVGRFLLKRLDREMRTLFMTLDQLDRASITAQRKLTIPFVKEILKL
>gid:144458  ygdP  (Di)nucleoside polyphosphate hydrolase
MIDDDGYRPNVGIVICNRQGQVMWARRFGQHSWQFPQGGINPGESAEQAM
YRELFEEVGLSRKDVRILASTRNWLRYKLPKRLVRWDTKPVCIGQKQKWF
LLQLVSGDAEINMQTSSTPEFDGWRWVSYWYPVRQVVSFKRDVYRRVMKE
FASVVMSLQENTPKPQNASAYRRKRG
>gid:144804  ygiV  Hypothetical protein ygiV
MTNLTLDVNIIDFPSIPVAMLPHRCSPELLNYSVAKFIMWRKETGLSPVN
QSQTFGVAWDDPATTAPEAFRFDICGSVSEPIPDNRYGVSNGELTGGRYA
VARHVGELDDISHTIWGIIRHWLPASGEKMRKAPILFHYTNLAEEMTERR
LETDIYVPLA
>gid:144864  ygjF  G/U mismatch-specific DNA glycosylase
MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ
LKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ
PQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLE
KLVEAYRELDQALVVRGR
>gid:144947  yhbQ  Hypothetical protein yhbQ
MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAF
SAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAVFAELLSSLQTPEIKSD
>gid:145068  yhdJ  Hypothetical adenine-specific methylase yhdJ
MTMRTGCEPTRFGNEAKTIIQGDALTELKKLPAESVDLIFADPPYNIGKN
FDGLIEAWKEDLFIDWLFEVIAECHRVLKKQGSMYIMNSTENMPFIDLQC
RKLFTIKSRIVWSYDSSGVQAKKHYGSMYEPILMMVKDAKNYTFNGDAIL
VEAKTGSQRALINYRKNPPQPYNHQKVPGNVWDFPRVRYLMDEYENHPTQ
KPEALLKRIILASSNPGDIVLDPFAGSFTTGAVAVASGRKFIGIEINSEY
IKMGLRRLDVASHYSAEELAKVKKRKTGNLSKRSRLSEVDPDLIAK
>gid:145298  yhhF  Putative methylase yhhF
MKKPNHSGSGQIRIIGGQWRGRKLPVPDSPGLRPTTDRVRETLFNWLAPV
IVDAQCLDCFAGSGALGLEALSRYAAGATLIEMDRAVSQQLIKNLATLKA
GNARVVNSNAMSFLAQKGTPHNIVFVDPPFRRGLLEETINLLEDNGWLAD
EALIYVESEVENGLPTVPANWSLHREKVAGQVAYRLYQREAQGESDAD
>gid:145512  yicF  Hypothetical DNA ligase-like protein yicF
MMMKVWMAILISILCWQSSAWAVCPAWSPARAQEEISRLQQQIKQWDDDY
WKEGKSEVEDGVYDQLSARLTQWQRCFGNETRDVMMPPLNGAVMHPVAHT
GVRKMADKNALSLWMRERSDLWVQPKVDGVAVTLVYRDGKLNKAISRGNG
LKGEDWTQKVRLISAVPQTVSGPLANSTLQGEIFLKRKGHIQQQMGGINA
RAKVAGLMMRQGNSDTLNSLAVFVWAWPDGPHLMTDRLKDLATAGFTLTQ
TYTRAVKNADEVAHVRNEWWKAKLPFVTDGVVVRAAKEPESRHWLPGQAE
WLVAWKYQPVAQVAEVKAIQFAVGKSGKISVVASLAPVMLDDKKIQRVNI
GSVRRWQEWDIAPGDQILVSLAGQGIPRIDDVVWRGAERTKPTPPENRFN
SLTCYFASDVCQEQFISRLVWLGSKQVLGLDGIGEAGWRALHQTHRFEHI
FSWLLLTPEQLQNTPGIAKSKSAQLWHQFNLARQQPFTRWVMAMGIPLTR
AALNASDERSWSQLLFSTEQFWQQLPGTGSGRARQVIEWKENAQIKKLGS
WLSAQQITGFEP
>c4788 yigW, Conserved hypothetical protein
MEYRMFDIGVNLTSSQFAKDRDDVVARAFDAGVNGLLITGTNLRESQQAQ
KLARQYSSCWSTAGVHPHDSSQWQAVTEEAIIELAAQPEVVAIGECGLDF
NRNFSTPEEQELAFVAQLRIAAELNMPVFMHCRDAHERFMTLLEPWLDKL
PGAVLHCFTGTREEMQACVARGIYIGITGWVCDERRGLELRKLLPLIPAE
KLLIETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGEDAAWLAA
TTDANVKTLFGIAF
>c4953 yjaD, NADH pyrophosphatase
MDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQ
GEPVWLVQLQRRHDMGSVRQVIDLDVGLFQLAGRGVQLAEFYRSHKYCGY
CGHEMYPSKTEWAMLCSHCRERYYPQIAPCIIVAIRRDDSILLAQHTRHR
NGVHTVLAGFVEVGETLEQAVAREVMEESGIKVKNLRYVTSQPWPFPQSL
MTAFMAEYDSGEIVIDPKELLEANWYRYDDLPLLPPPGTVARRLIEDTVA
MCRAEYE
>gid:146532  yjjV  Putative deoxyribonuclease yjjV
MICRFIDTHCHFDFPPFSGDEEASLQRAAQAGVGKIIVPATEAENFARVQ
ALAEKYQPLYAALGLHPGMLEKHSDVSLDQLQQALERHPAKVVAVGEIGL
DLFGDDPQFERQQWLLDEQLKLAKRYDLPVILHSRRTHDKLAMHLKRHDL
PCTGVVHGFSGSLQQAERFVQLGYKIGVGGTITYPRASKTRDVIAKLPLA
SLLLETDAPDMPLNGFQGQPNRPEQAARVFAVLCELRPEPADEIAEVLLN
NTYTLFNVP
>gid:144571  yqgF  Hypothetical protein yqgF
MSGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNIIERLLK
EWQPDEIIVGLPLNMDGTEQPLTARARKFANRIHGRFGVEVKLHDERLST
VEARSGLFEQGGYRALNKGKVDSASAVIILESYFEQGY
>gid:144822  yqiE  ADP-ribose pyrophosphatase
MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHRLFNGQMSHEVR
REIFERGHAAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIE
EGESVEDVARREAIEEAGLIVKRTKPVLSFLASPGGTSERSSIMVGEVDA
TTASGIHGLADENEDIRVHVVSREQAYQWVEEGKIDNAASVIALQWLQLH
HQALKNEWA
>gid:144940  yraN  Hypothetical protein yraN
MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI
DLIMREGRTTVFIEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHN
GSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS
>gid:145086  yrdD  Hypothetical protein yrdD
MAKSALFTVRNNESCPKCGAELVIRSGKHGPFLGCSQYPACDYVRPLKSS
ADGHIVKVLEGQVCPACGANLVLRQGRFGMFIGCSNYPECEHTELIDKPD
ETAITCPQCRTGHLVQRRSRYGKTFHSCDRYPECQFAINFKPIAGECPEC
HYPLLIEKKTAQGVKHFCASKQCGKPISAE
>gid:145209  yrfE  ADP compounds hydrolase nudE
MSKSLQKPTILNVETVARSRLFTVESVDLEFSNGVRRVYERMRPTNREAV
MIVPIVDDHLILIREYAVGTESYELGFSKGLIDPGESVFEAANRELKEEV
GFGANDLTFLKKLSMAPSYFSSKMNIVVAQDLYPESLEGDEPEPLPQVRW
PLAHMMDLLEAPDFNEARNVSALFLVREWLKGQGRV