TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Escherichia coli O157:H7 EDL933, EDL933
Gene type: CDS

Number of genes found: 351

Free access
Sort by:

 



# Escherichia coli O157:H7 EDL933, EDL933

>Z2563 putative transposase (partial)
MGANSKSSALFIRLLKRLKATYCRTKTITLLEDNYIIHKGRETQRWLKDK
PKCRVIYQPV
>Z3945 
MGYKVKKFIMSSGERGCLILDKKSNLPTYYQNLFLTTDIRNRGATASTME
IVATNLLIFSNFLDGRGIDIIERVELKKYLSVAEIDDLVRYAKQRFDRQK
IINIKSTNYRFIAKRTFSYRIHVFSRYLKWLCGLVHSSKGIHAKYEVDVF
IESIRAHIPRNSSLNMNEISEKSLNEEEIKVLFRLLEIGGIENPFHKEVQ
VRNRLIFTLLLNLGLRAGELLNLKIDDFDLRDNTLSVVRRHDSKEDGRSY
QPLVKTGERVIPLSDELAREIFDYISNSREKMTKRKKHNFLFVAHCTGKN
AGEPLSISAYEKVISTLKRASPELYNLSGHRLRHSWNYMYSKRNRRC
>Z3561 putative regulator
MEQRRLASTEWVDIVNEENEVIAQASREQMRAQCLRHRATYIVVHDGMGK
ILVQRRTETKDFLPGMLDATAGGVVQADEQLLESARREAEEELGIAGVPF
AEHGQFYFEDKNCRVWGALFSCVSHGPFALQEDEVSEVCWLTPEEITARC
DEFTPDSLKALALWMKRNAKNEAVETETAE
>Z0328 unknown protein encoded in prophage CP-933I
MNDNFFTFRKIKVTGFNKLDAIIEFGSKLTILYGGSDSGKTYIYYLIRYL
LGSEKLKNKDIDHAQGYDLAYLEFNFQGRVMTIERSLQDSAHYRLYDSSI
ENVSEANLLMVFSKSASSKKSFSSYFYGRLNFKEAKVRTNLSNTLHKFNL
NNVFEFFCIDELRVLTEKSLILSDIPSEETKRKSEFKFLLTQRDDTNSLA
EKPNKKARYF
>Z1573 unknown in IS600
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>Z3333 unknown protein encoded within prophage CP-933V
MLTTQKRKFALALMSGKNKTASAXAAGYSAKTARVKGSQLAKDPEVLAFI
ARKQCETVEVDEVPVYRQKKSEPEDKPRRREAAAIPQPDETNPEMPPPVV
ISPGIEYMEDGLPDPVKAMGRLLVENINTDPRLALDAAYKLAQFTHHKKG
DAGKKSAKGDAAKKAANRFAVPPPPRLVVNNDNEGNG
>Z3024 orf, hypothetical protein
MVVTTSDVVMCQMRHSDVQGVYRVYGSWMAENFQDQVSISNQIMSKFAPS
MPHAVRSDVINNRLHNLYLHAHYFLICRHQLITHLNPHLHRN
>Z1208 unknown in putative ISEc8
MELQDWRKEPRKNYSNEFKLRMVELASQPGASVARIAREHDINDNLLFKW
LRLWQNEGRISRRLQVTTSSDTGVELLPVEITPDEQKEPVAAIAPSLSTS
TQTSVSAGSCKVEFRHGNMTLENPSPELLTLLIRELTGRGR
>Z4338 unknown protein encoded by ISEc8
MISLPSGTRIWLVAGVTDMCKSFNGLGEQVQHVLNDNPFSGHLFIFRGRR
GDTVKILWADADGLCLFTKRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>Z1657 putative DNA repair protein, RADC family
MQQLSFLPGEMTPGERSLILRALQTLDRHXHEPGVAFTSTRAAREWLILN
MAGLEREEFRVLYLNNQNQLIAGETXFTGTINRTEVHPREVIKRALYHNA
AAVVLAHNHPSGEVTPSKADRLITERLVQALGLVDIRVPDHLIVGGNQVF
SFAEHGLL
>Z3925 partial putative transposase
MGWRVSSSMETTFVLDALEQALWARRPSGTIHHSDKGSQYVSLAYTERLK
EAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTW
VDWYNNRRLLGRLGHTPPAEAEKAYYASIGNDDLAA
>Z1866 putative integrase of prophage CP-933X
MAASPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQ
VATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYNSIQEDR
LQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGH
NRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPE
WQAIFDSVSRRQPYLKCGMLLALVTGQRLSDICNLKFSDIWDDMLHITQE
KTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRG
DQVSANTLTTAFKKAREKCGIKWEQGTAPTFHEQRSLSERLYREQGLDTQ
KLLGHKSRKMTDRYNDDRGKDWIIVDIKTA
>Z6046 putative terminase subunit encoded by cryptic prophage CP-933P
MLTTQKRKFALALMSGKNKTASALAAGYSAKTARVKGSQLAKDPEVLAFI
ARKQCETVEVDEVPVYRQKKSEPEDKPRRREAAAIPQPDETNPEMPPPVV
ISPGIEYMEDGLPDPVKAMGRLLVENINTDPRLALDAAYKLAQFTHHKKG
DAGKKSAKGDAAKKAANRFAVPPPPRLVVNNDNEGNG
>Z2981 IS629 transposase encoded within prophage CP-933T
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFSGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTERLKEAKLLASTGSTGDSYDNVFY
>Z1559 putative P4-family integrase
MAVLTDTKARHIKPDDKPLPHGGITGLTLHPSSVKGRGKWVFRYVSPVTQ
KRRNAGLGTYPEVSIAEAARTARIMREQLAAGDDPLEIKKAESEKVAIPT
FADAARRVHAELSPGWENPKHVRQWLSTLENYAFPQLGAKTLDSITAADV
AETLRPVWLTLSETASRVKQRIHVVMQWGWAHGFCVANPVDVVDHLLPQQ
TRGRDEHQPAMPWRQLPLFVATSVYTDEPYNVTRALLLMVILTATRSGEA
RGMRWAEIDFHKRVWTIPAERMKARIQHRVPLSRQAIYILENIRGLHDEL
VFPSPRKQQILSDMVLTSFLRKKKAVSDIPGRVATAHGFRSTFRDWCSEQ
GYSRDLAERALAHTLKNKVEAAYHRTDLLEQRVPMMQAWADYVMSQIVNK
>Z4799 putative DNA processing protein
MDADSSNSTVTPGRLPDGLSSCPCFYLGKKLMNLSANAQATLLLTSDFSR
AAASKYKPLSNSEWGKFALWLKHQRISPAELLVPQPQEKLTGWSDPRISQ
ERILGLLARGHSLALAVDKWQRAGLWILTRGDADYPVRLKNRLRTDAPPV
LFGCGNKALLQAEGMAIVGSRDAPTDDLRYTQQLAAKLAQQGICVISGGA
RGIDECAMASALEAGGTAVGVLADSLLKTSTLVKWREGLIAGNLVLISPF
YPEVRFTVGNAMARNKYIYCLAESAMVVRAGMTGGTITGAMEALKHQWLP
VQVKPNQDMQSANSRLVENGASWSAEQAENVTIRLPDVPGLMYDRALRNA
QPELFSLHEDDANYAVMPAYTPVDFYQLFVAELAILAKESISIERLASCT
GLTIEQISVWLNRAEEEGRVIRLGEGHYQFR
>Z4193 type III secretion apparatus protein
MQLKNLQSLLDMKELLGEVVFRQDIFYSLRKVTVIQQQIAEINLEKQKIA
ERRKILNKEIVQQQAQRKHWWLKGEKYDRLKKRIKKQLLNQMLYQDELEQ
EEKYNGRSQEN
>Z4330 putative transposase
MQKAIYKTHDKNYARRLTAMLMLHRSARVSDVARTLCCARSSVRRWINWF
TLSSVEGLKSLPAGRSRRWPFEHICTLFRELIKHSPGYFGYQRSRWSTEL
LAIKINEITECQLHAGTVRRWLPLVGLVWRRAAPTLRTRNPHKAAIHKAL
DECSAEHPVFYEDEVYIYLNPKTGADWQLRRQQKHVVTPGQNEKYYLAGA
LHSGTGKVSYVGGNSKSSALFISLLKHLKLTYRRDKPITLIVDNYIIHKS
RETQRWLKENPKFRVIYQPVYSPWVNHVERL
>Z2396 putative DNA replication factor encoded by prophage CP-933R
MKNIATGGVLERIRRLAPPHVTAPFRTVAEWREWQLAEGQKRCEEINRQN
RQLRVEKILNRSGIQPLHRKCSFANYQVQNDGQRYALSQAKSIADELVTG
CTNFAFSGKPGTGKNHLAAAIGNRLLKDGQTVIVVTVADVMSALHASYDD
GQSGEKFLRELCEVDLLVLDEIGIQRETKNEQVVLHQIVDRRTASMRSVG
MLTNLNYEAMKTLLGERIMDRMTMNGGRWVNFNWESWRPNVGQPGIEK
>Z1827 putative IS encoded protein
MVRRLRFSGPKTSIICSPMTSLKTSIKTITYLSDTGCLEIQGASLVIYTT
NAIESLNSVIRHAIKKRKVFPTDDSVKKVVWLAIQAASQKWTMPLRDWRM
AMSRFIIEFGDRLDGHF
>Z4314 unknown protein encoded by ISEc8
MARGKAAITFFREPPATSCDSRCSCRTARIARRGPGNPQYQLLSTTDDYC
TVRDSVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALTE
EALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKTL
SRHLELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKNF
LFFGSDHGGERGALLYSLIGMCKLNDVDPESYXRHVLGVIADWPVNRVSE
LLPWRIALPAE
>Z5885 putative resolvase
MTMAIFGYGRVSTSQQDTENQRMELEQAGWTFDFWFSDVVSGKVPAVQRK
AFFEMLSKIRDGETLVVAKLDRLGRDAIDVLQTVRTLADRNIKVIVHQLG
TTDLTSAAGKLLLSMLAAVAEMERDLLIERTNAGLLRAKAEGRKLGRPAK
IAPEARGAILDKRAAGVSVSALAREYGVSRATLAALLNKGY
>Z2982 unknown protein of IS629 encoded within prophage CP-933T
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1933 unknown in IS629
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1160 unknown in ISEc8
MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLDENPFSGHLFIFRGRR
GDTIKILWADADGLCLFTKRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>Z3924 partial putative transposase
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVXXRPDQ
LWVADFTYVSTWQGFVYVAFIN
>Z3759 putative DNA replication factor
MVNFSRFCEILVEVSLNTPAQLSLPLYLPDDETFASFWPGDNSSLLAALQ
NVLRQEHSGYIYLWAREGAGRSHLLHAACAELSQRGDAVGYVPLDKRTWF
VPEVLDGMEHLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLIT
GDRPPRQLNLGLPDLASRLDWGQIYKLQPLSDEDKLQALQLRARLRGFEL
PEDVGRFLLKRLDREMRTLFMTLDQLDRASITAQRKLTIPFVKEILKL
>Z3095 putative transposase encoded within prophage CP-933U
MSSTGSDRYAXNCILPRQRITIVSNSDIIPDKRSARAQHDDWLKREIQRV
YDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLRGKKVRTT
ISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAFIIDVFAG
YIVGWRVSSSMETTFVLXALEQALWARRPSGTIHHSDKGSQYVSLAYTER
LKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRAEVELATL
TWVDWYNNRRLLGRLGHTPPAEAEKAYYASIGNDDLAA
>Z1882 putative DNA packaging protein of prophage CP-933X
MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAAIKWY
AERDAEIENEKLRREVEELRQDSETDLQPGTIEYERHRLTRAQADAQELK
NARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDF
LKRDIIKAMNKAAALDELIPGLLSEYIEQSG
>Z5028 
MKQQEEHNNKIDLLEKQQAQLKSQLETIQKQQTGIISSTKTLTHVIKSVK
DQQNTFIFTEFNPAKTKYFILNNGSVALAGRVLSIDATENGSVIHISLVN
LLSTPISNIGFNATWGGEKPVDAKEFARWQQLLFNTSMKSTLKLLPGQWQ
DINLTLKGVSPNNLGYLKLAINMENIQFDNLPSAENRQKRSKK
>Z2110 putative transposase encoded within prophage CP-933O
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z2060 putative DNA adenine methyltransferase encoded by prophage CP-933O
MLNTVKISSCELINADCLEFMRSLPENSVDLIVTDPPYFKVKPEGWDNQW
AGDEDYLKWLDQCLAQFWRVLKPAGSLYLFCGHRLASDTEIMMRERFNVL
NHIIWAKPSGRWNGCNKESLRAYFPATERILFAEHYQGPYRPKDAGYEAK
GRTLKQHVMAPLIAYFRDARAVLGITAKQIADATGKKNMVSHWFSAGQWQ
LPNESDYLKLQALFARVAEEKHQRGELEKPHHQLVDTYASLNRQYAELQS
EYKHLRRYFGVTVQVPYTDVWTYKPVQYYPGKHPCEKPAEMLQQIISASS
RPGDLVADFFMGSGSTVKAAMALGRRATGVELETERFEQTVREVQDLIIR
NG
>Z1161 unknown in ISEc8
MSRKYLIRITELERLLSEQAEALRQKDQQLSLVEETEAFLRSALARAEEK
IEEEEREIEHLRAQIEKLRRMLFGTRSEKLQREVEQAEAQLKQREQESDR
YSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGSELDY
LGEVSAEQLELVSSALKVIRTVRVKKACTKCDCIVEAPAPSRPIERGIAG
SGLLARVLTGKYCEHLPLYRQSEIFARQGIELSRALLSNWVDACCQLMTL
LNDTLYRYVMNTRKVHTDDTPVKVLAPGRKKAKTGRIWTYVRDDRNAGSS
EPPAVWFAYSPDRQGKHPVQHLRPFRGILQADAFSGYDRLFSAEREGGAL
TEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESER
LAVRQQRSKVLLTSLHEWMVEKNGTLSKKSRLGEACSYVLNQWDALCYYS
DDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRL
NGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK
>Z3299 unknown protein in IS629
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z2080 putative IS encoded protein within CP-933O
MLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVS
RRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRKPFPESLPRDE
KRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSSLPGYPDGTGKTCLYSV
RCHRAGTCTFAAHRAGYRRTGAAGPRADLEVCRAHPAVSPVRNIRPARCG
AEAFTAVGLGGCMLPAAVSAGRGASWLCHD
>Z5097 unknown protein encoded by ISEc8 within prophage CP-933L
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1843 unknown protein encoded by prophage CP-933C
MILGVATSLAAPLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPDA
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSG
KLQGAKDGGNREIKHWRTVAISTGEMDVETFLKSEGIKVKAGQLVRLLNV
PMEKATKFHEYSNGKEHADALKDAWTANHGAAGREWVKWLAGHQQEAKDT
VRECRERWRNLIPESYGEQVHRVGERFAILEAALVLSGHVTGWVVQECRD
AIQHNFNAWVKEFGTGNREFKQMVEQAEAFLSSFGFSRYLPHPNTDERDL
PIKELAGYRKGSIRNEDDEMRYYTFPHVFESEIAKGFNPAHFARALDAAG
MLEKGSDRRYKKKALGKIGGKQHVFYVLMFQPDDED
>Z2079 putative IS encoded protein within CP-933O
MHVRIPSALTEEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSL
ESWLREKMKTLSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENA
LRAVSLGRKNFLFFGSDHGGERGALLYSLIGTCKLK
>Z6016 unknown protein encoded in ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z4336 unknown protein encoded by ISEc8
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1638 putative transposase
MPLLDKLHEQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKXRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYI
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>Z4503 putative transposase
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z2375 orf; hypothetical protein in IS629 within prophage CP-933R
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z2127 putative IS encoded protein encoded within prophage CP-933O
MARGKAAITFFREPPATSCDSRCSCRTARIARRGPGNPQYQLLGNVPARA
APLQWQCQRKAPDSADTGTEAMIPLPSGTKIWLVAGITDMRNGFNGLAAK
VQTALKDDPMSGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWP
SARDGKVFLTQAQLAMLLEGIDWRQPKRLLTSLTML
>Z3154 unknown protein encoded by ISEc8
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
>Z5200 
MVDNVTVSRVCIQSPSFVPDLDGEKNKSQLFVDDIVAYLKSPSVYSLEKE
GPLNHFVNHCSEVELGFYSDGAYSILVSRSKQQPEGMILTVSDADAINIV
HISVSPVLIKFLDDIFTCLHTYPDDESFTKEQIKANSKYDIVDYNCLLHF
TGKPKSLIECRHFALQYCIDSMNEHTGKVPLKAYYSSPEDIQKHIPFELE
QQFNNLQKNPPPGTCVVASDKFGEALSVFFHRMEKEKLTHMTAIVQSQTH
AMAVRLRIKKTPAGETEYVVSFYDPNATNTAVRYKANNCDSFGSLQSFIN
IQQAKQKWVITDICSECVGITPYLPREQAHLLSGIENELQPPLSPPALFL
LMRMGIYKNIVLFFDKLKNSQEMTASKALDILAAKSPEGIYGLCVLLYHN
TIDKFNDYITNLKELTRKYNFSQEDLETLLLAKDNLGVSWIPRALKNNQN
KIVKAWLLAIDDFEKEFGVNKNEILLRIGKEIDSIDDLNSAIRTNDYNVV
NILLANIKAKMFKNELNKEDILKLMAAREKVAGASDKWTKASGLYSAIVK
GHTKIVAAWMETAEVIASHYENDKDVVRELLSLSRNNAVCSLYVASYKTM
SKQVIDVYLNAAIRLALQHGFTFDEILEQFTRDFDGKSFSLAVEKADDIY
GSLAENIQNCGW
>Z0324 integrase protein for prophage CP-933I
MCIGLCICSCSVWIPIHMPLNDMQIRRAKPEAKAYTFGDGLGLSLLIEPN
GSKSWRFRYRYAGKPKMISLGVYPTITLADARSRRDEARKLVAEGKNPSE
VRKEQKLAMQTESENAFEKIAREWHQLKSAKWSAGYASDIMEAFKNDIFP
YVGTRPVGEIKPLELLNVLRKIEKRGALEKMRKVRQRCSEVFRYAIATGR
AEYNPAADLSSALEVHQSNHFPFLKADEIPDFLRALEGYSGSKLVQIATK
LLMITGVRTIELRAALWQEFDLDNAIWEIPAERMKMRRPHLVPLSSQAVD
LLNELKIMTGNYRYVFPGRNDPNRPMSEASINQAIKRIGYGGKVTGHGFR
HTLSTILHEQGFESAWIEIQLAHVDKNSIRGTYNHAQYFSGRKSMMDWYS
NLIFERLKRS
>Z1600 unknown in ISEc8
MSRKYLIRITELERLLSEQAEALRQKDQQLSLVEETEAFLRSALARAEEK
IEEEEREIEHLRAQIEKLRRMLFGTRSEKLQREVEQAEAQLKQREQESDR
YSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGSELDY
LGEVSAEQLELVSSALKVIRTVRVKKACTKCDCIVEAPAPSRPIERGIAG
SGLLARVLTGKYCEHLPLYRQSEIFARQGIELSRALLSNWVDACCQLMTL
LNDTLYRYVMNTRKVHTDDTPVKVLAPGRKKAKTGRIWTYVRDDRNAGSS
EPPAVWFAYSPDRQGKHPVQHLRPFRGILQADAFSGYDRLFSAEREGGAL
TEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESER
LAVRQQRSKVLLTSLHEWMVEKNGTLSKKSRLGEACSYVLNQWDALCYYS
DDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRL
NGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK
>Z5491 orf; hypothetical protein in IS (partial)
MALICKLSQQWSFVGSKARQHWLWYVYNTKTGGVLAYTFGPRTDETCREL
LALLTLLPSAC
>Z1192 IS1 protein InsB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>Z1572 partial putative transposase
MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPH
ERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRK
RTA
>Z5088 unknown protein encoded by IS911 within prophage CP-933L
MISSPQHKTGDLMNKKTKRTFTPEFRLECAQLIVDKGYSYRQASEAMNVG
STTLESWVRQLRRERQGIAPSATPITPDQQRIRELEKQVRRLEEHNTILK
KATTALLMSDSLNGSR
>Z1772 unknown protein encoded by prophage CP-933N
MKIKHEHIRMAMNAWAYPDGEKVPAAEIARTYFELGMTFPELYDDSHPEA
LARNTQKIFRWLDKDTPDAVEKMQALLPAIEKAMPPLLVARMRSHSSEYY
REIVERRDRLVKDVDDFVAAAIAWGTLTNSGGQPGNAVVVH
>Z3922 putative transposase
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1221 putative transposase
MPLLDKLHEQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYI
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>Z1589 unknown in IS
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKAAVDSICQCNTPFN
YLFHCPRRYRLSV
>Z1660 transposase for IS629
MPLLDKLXEQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKVRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYI
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>Z5096 unknown protein encoded by ISEc8 within prophage CP-933L
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
>Z3156 unknown protein encoded by ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z1131 unknown protein encoded in ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPQYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z0395 
MTRKYLTQDEVYRLMDAAQSMSFPERNRCLIMMAFIHGFRASELLDLRLS
DIDASGKQLNIRRIKNGFSTTHPLLPDEYNLIKLWLKQRKLIENGVEGDW
LFLSRKRRPISRQHFFLSFVRLEDVQD
>Z1642 
MLAVERAFSSQISVIEGPPGTGKTQTILNIVANILIQNKTVAILSNNNSA
VSNVYEKMDKQQLGYVMARLGSTENRQQFFSTSISRSEEVLPDSPSANAI
DDVLQQVKKHLNAINQVASLKAEINELNIEYKYLQQWQSQNLRPEELFSH
KYRFSSQKTTDLMAYIHYLSDRRIGFRNRIDLLLNFMILKVKPLMIPERR
LALFTSLQLSYYEKNIREKQISLNEYEEAFKKSDFKILLGRLTSWSMLYL
KQHLRRNVSTRSSFSAETYRDEFDRFIKRFPIIGSSTHSIINSIGKGALL
DYVIIDEASQQDIVPGILGLGCARNVIVVGDRKQLPHVPVLLPNSPSPPA
EYYNCEKYSLLDSVCMLFRNMVPVTLLKEHYRCHPKIIQFCNKQFYDNAL
IPLTVDSGEASLSLVITAKGNHTRNFSNLRELESLEGHYWDEESSRGYIA
PYNAQVNLAEKVLPADFVKSTVHKFQGRECDEIVFSTVLDKKRSSQHSRN
IAFVDNPELVNVAVSRARNKFTLVTGNDVFERHAGHIAALIRYIKYYADD
GEIFESPVISAFDLLYSEYDKSLERLNSRLNSNDSHFKSEQIVACLLRDI
LSQDSYRSMMFHSQIALNQLVLLERGDFTHREQLFMRNRASCDFVVYYKV
GKTPLGVIEVDGGYHLTSVQAERDELKNSILKKCGLPLLRLRTIDSDIEG
KLGAFLSGLTG
>Z0857 putative receptor
MEHKLSDILLLIICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVH
DTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR
RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAPPELLNILDIKGKIITT
DAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHD
SYAISEKSHGREEIRLHIVCEVPDELIDFTFEWKGLKKLCVAVSFRSIIA
EQKKEPKMTVRYYISSADLTAGKFATAIRNHWHVENKLHWRLDVVMNEDD
CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASV
LAGSGLS
>Z4802 putative ATP-dependent DNA helicase (together with adjacent 3 orfs)
MIQRGTLARESLALDALVLGEHSSRLAWLATVIPQFSKSGIVYTLTTRDA
ELVAEWLRKNGISAFAYYSGVTCEGAEDSNTAREYLEQALLANKIKVLVA
TTALGMGFDKPDLGFVIHYQMPGSIVGYY
>Z0067 
MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPV
TRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREV
FLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQ
RTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAA
ERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQ
LVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIE
QLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYR
PVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSAR
QELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKV
SGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTS
HRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWF
AEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRI
GQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDL
INYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQ
ALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPD
FPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSST
ISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGN
NLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSAR
ALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESL
DQAGWRLDALRLIVVTHQ
>Z1928 unknown protein encoded by ISEc8 in prophage CP-933X
MFLTQXQLTMLLEGIDWRQPKRLLTSLTML
>Z4337 unknown protein encoded by ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z4318 
MVLLEHEYVMAKLINISASTEERMTPGERRVASRLESFLNDDCFVWYDIP
VGRKNRHPDFVIIDPDNGLVFLEVKDWTVSTLRKANQEQVTLETDGLLKS
EINPLVQVRRYACDTVNALPADPCLRQNDGQYKGRLNLAWAYGVVFTRIT
RQQLKALTGNNENAVEKIFPSAQTICQDEMTQSVLPEVFRQKIAGMFTTG
FRTRVTPRMRDILRAHLFPEVTVKQNSQIKIMDIQQEILARNIGDGHRVI
HGVAGSGKTMILLFRCLYLAETTSGSYAAPVQLPGGRFHTGYCRSPPSHC
GSIRSERPIASADR
>Z5890 partial putative integrase
MSILMSIFADSILLVYSRDTNGRKTIMALTDTKVRSAKPEEKEYSLVDGD
GMSLLVKPGGSKYWRFRFRFGGKQHLMAFGVYPDVSLADARKKREEARKL
VAAGIDPREHKRAVKEEQAKEIITFEKVAREWLVTNQKWSEDHANRVKKS
LEDNIFPTIGTRNIAELGTRDLLIPIKAVEKSGRLEVASRLQQRTTAIMR
YAVQSGLIDYDPAQEMSGAVASSNRQHRPALELKRIPELLDKIDSYTGRP
LTHCTTELTLLIFIRSSKLHFARWSEIDFETSMWTIPLDWYCSKQRVGLV
Q
>Z1344 putative endonuclease of cryptic prophage CP-933M
MRIEFVLLYPPTVNTYWRRRGSTYFVSKAGERYRRAVALIVRQQRLKLSL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLMDDEQFDEINIVR
AQPVSGGRLGVKIYPIMLEGQVKK
>Z1570 unknown protein encoded in ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPQYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z1632 IS1 protein InsB
MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVVWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>Z1020 putative ATP-dependent helicase
MALTAALKAQIAAWYKALQEQIPDFIPRAPQRQMIADVAKTLAGEEGRHL
AIEAPTGVGKTLSYLIPGIAIAREEQKTLVVSTANVALQDQIYSKDLPLL
KKIIPDLKFTAAFGRGRYVCPRNLTALASTEPTQQDLLAFLDDELTPNNQ
EEQKRCAKLKGDLDTYKWDGLRDHTDIAIDDDLWRRLSTDKASCLNRNCY
YYRECPFFVARREIQEAEVVVANHALVMAAMESEAVLPDPKNLLLVLDEG
HHLPDVARDALEMSAEITAPWYRLQLDLFTKLVATCMEQFRPKTIPPLAI
PERLNAHCEELYELIASLNNILNXYMPAGQEAEHRFAMGELPDEVLEICQ
RLAKLTEMLRGLAELFLNDLSEKTGSHDIVRLHRLILQMNRALGMFEAQS
KLWRLASLAQSSGAPVTKWATREEREGQLHLWFHCVGIRVSDQLERLLWR
SIPHIIVTSATLRSLNSFSRLQEMSGLKEKAGDRFVALDSPFNHCEQGKI
VIPRMRVEPSIDNEEQHIAEMAAFFREQVESKKHLGMLVLFASGRAMQRF
LDYVTDLRLMLLVQGDQPRYRLVELHRKRVANGERSVLVGLQSFAEGLDL
KGDMLSQVHIHKIAFPPIDSPVVITEGEWLKSLNRYPFEVQSLPSASFNL
IQQVGRLIRSHGCWGEVVIYDKRLLTKNYGKRLLDALPVFPIEQPEVPEG
IVKKKEKTKSPRRRRR
>Z2254 partial H repeat-associated protein of Rhs element
MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF
GETHPDVLK
>Z4340 unknown protein encoded by ISEc8
MLFGTRSEKLRREVEQAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHR
RPLPAHLPREIQRLESEESCCPECGGELDYLGEVSAEQLELVSSALKVIR
TERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYR
QSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYRYVMNTRKLHTDDT
PVKVLAPGLKKTKTGRIWTYVRDDRNAGSSSPPAVWFAYSPNRQGKHPEQ
HLRPFRGILQADAFTGYDRLFSAEREGGALTEVACRAHARRKIHDVYISS
KSATAEEALKRISELYAIEDEIRGLPESERLAVRQQRSKALLTSLHEWMM
EKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAETDNNTAERALRAVC
LGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWP
SNRVDELLPWNVVLTNK
>Z1150 unknown in IS
MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQWVTAARK
GLGTPGSRTVAELESEILQLRKALNEARLERDILKKAAVDSICQCNTPFN
YLFHCPRRYRLSV
>Z6014 unknown protein encoded in ISEc8
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
>Z3297 putative transposase for IS629
MMPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQHDDW
LKREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVA
FIIDVFAGYIVGWRVSSSMETTFVLDALEQALWARRPSGTIHHSDKGSQY
VSLAYTERLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
AEVELATLTWVDWYNNRRLLGRLGHAPPAEAEKAYYASIGNDDLAA
>Z1158 unknown in ISEc8
MALRKIAGLYRIEKFIRERPVEKIRQWRQRYSRPIVNDLFAWPEEQEPCC
PPDGPLNKAINYILNRRDELSCFLSDGAVPLDNNICERAIRPVVMGRKAW
LFAGSLMAGNRAAQIMSLLETAKRNGLEPHAWLTDVLTRLPEWPEDRLEE
LLPLEGFTFSG
>Z1444 putative serine/threonine kinase encoded by bacteriophage BP-933W
MLTPYKRADVEFEWISDLEEQGCFSKVYLAHDRHLAHDLVIKEIEKKENT
NHDDYFNEARLLYKHAHPNIVQVQYAAQCESNIYIAMPFYHNGSLNQLMK
KNNLTSREIIRYSIQFLSGLYHIHSKGLMHFDIKPNNIMISNRNEAMLSD
FGLSQLVNEESRAAPEFGYHFHVPPEYFSLSTNDYNFTYDIYQAGLTIYR
MCVGHDNFERERSAFSTIEQLRESIINGCYPLKEYPPHIHKKLITIVNKC
IHVDPNERYQSVLDVLNDLSAISDGVLDWRLQMTKPTNGTCEWQKKSGDA
ILSIVFDAENSSTTGFRLYDDGRKRRATNLTISSGCTPTKLYRLLKDN
>Z1202 
MLAVERAFSSQISVIEGPPGTGKTQTILNIVANILIQNKTVAILSNNNSA
VSNVYEKMDKQQLGYVMARLGSTENRQQFFSTSISRSEEVLPDSPSANAI
DDVLQQVKKHLNAINQVASLKAEINELNIEYKYLQQWQSQNLRPEELFSH
KYRFSSQKTTDLMAYIHYLSDRRIGFRNRIDLLLNFMILKVKPLMIPERR
LALFTSLQLSYYEKNIREKQISLNEYEEAFKKSDFKILLGRLTSWSMLYL
KQHLRRNVSTRSSFSAETYRDEFDRFIKRFPIIGSSTHSIINSIGKGALL
DYVIIDEASQQDIVPGILGLGCARNVIVVGDRKQLPHVPVLLPNSPSPPA
EYYNCEKYSLLDSVCMLFRNMVPVTLLKEHYRCHPKIIQFCNKQFYDNAL
IPLTVDSGEASLSLVITAKGNHTRNFSNLRELESLEGHYWDEESSRGYIA
PYNAQVNLAEKVLPADFVKSTVHKFQGRECDEIVFSTVLDKKRSSQHSRN
IAFVDNPELVNVAVSRARNKFTLVTGNDVFERHAGHIAALIRYIKYYADD
GEIFESPVISAFDLLYSEYDKSLERLNSRLNSNDSHFKSEQIVACLLRDI
LSQDSYRSMMFHSQIALNQLVLLERGDFTHREQLFMRNRASCDFVVYYKV
GKTPLGVIEVDGGYHLTSVQAERDELKNSILKKCGLPLLRLRTIDSDIEG
KLGAFLSGLTG
>Z2130 putative IS encoded protein encoded within prophage CP-933O
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z3155 unknown protein encoded by ISEc8
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z4375 orf, hypothetical protein
MTNLTLDVNIIDFPSIPVAMLPHRCRPELLNYSVAKFIMWRKETGLSPVN
QSQTFGVAWDDPATTAPEAFRFDICGSVSEPIPDNRYGVSNGELTGGRYA
VARHVGELDDISHTIWGIIRHWLPASGEKMRKAPILFHYTNLAEGVTEQR
LETNVYVPLA
>Z6019 putative transposase fragment
MATFFAYPADIRKVIYTTNAIESLNSVIRHAIRKRKVFPTDESVKKVVWL
AIQAASQKWTMPLRDWRMAMSRFIIEFGDRLDGHF
>Z5879 orf; hypothetical protein in IS
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1825 putative IS encoded protein
MDSARALIARGWGVSLVSRCLRVSRAQLYVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLERKPTVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAENFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>Z0322 unknown protein encoded by IS2
MQDVMLGAIEKRFGDKVPEQSIQWLTDNGSAYRAHETRQFARELNLEPCT
TAISSPQSNGIAERFVKTMKEDYIAFMPKPNVRTALHNLAVAIEHYNENH
PHSALGYRSPREYRRQRVTLT
>Z1133 partial putative transposase
MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPH
ERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRK
RTA
>Z6021 orf, hypothetical protein
MLSPSSINLGCSWNSLTRNLTSPDNRVLSSVRDAAVHSDSGTQVTVGNRT
YRVVVTDNKFCVTRESHSGCFTNLLHRLGWPKGEISRKIEAMLNTSPVST
TIERGSVHSNRPDLPPVDYAQPELPPADYTQSELPRVSNNKSPVPGNVIG
KGGNAVVYEDMEDTTKVLKMFTISQSHEEVTSEVRCFNQYYGSGSAEKIY
NDNGNVIGIRMNKINGESLLDIPSLPAQAEQAIYDMFDRLEKKGILFVDT
TETNVLYDRMRNEFNPIDISSYNVSDISWSEHQVMQSYHGGKLDLISVVL
SKI
>Z0367 unknown protein encoded in ISEc8
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z2073 putative transposase within CP-933O
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z5880 putative transposase
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z1599 unknown in ISEc8
MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLDENPFSGHLFIFRGRR
GDTIKILWADADGLCLFTKRLEEGQFIWPAVRDGKVSITRSQLAMLLDKL
DWRQPKTSRLNALTML
>Z0275 
MSIQSLLDYISVIPDIRQQGKVKHKLSDILFLTVCAVIAGADEWQEIEDF
GHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKIFIEWMQECHEI
TDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSN
EITAIPELLNLLDLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQG
KLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTPEFCDF
EFEWKGLKKLCVALSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRA
HWLIEHSLHWVLDVKMNEDASRIRRGNAAEIISGIKKMALNLLRDCKDIK
GGVKRKRKKVALNTCYIEEVLASCSELGFRTDKMKNLTQI
>Z3162 IS629 transposase
MMPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSAHAQHDDW
LKKEIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVA
FIIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTIHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
TEVELATLTWVDRYNNRRLLERLGHIPPAEAEKAYYASIGNNDLAA
>Z1162 unknown in ISEc8
MDISLLSTTSDPEQLRALAIAMVQKVMAENAELQNRIRILEEQMKLARQQ
RFGKKCESLAGMQRSLFEEDVDADIAEISAHLDKLLPQTGDEEKTTTRPV
RKPLPSPLPRAEKVIPPAEERCPDCDAPLHFIRDEVSEKLEYIPAQVVVN
RYIRPQYSCPCCEKVFSGKMPAHILPKSAVEPSVIAQVVISKYTDHLPLY
RQQHIFSRMGVELPVSTMADMVGVAGAALAPLAKLLRHELLTRDVIHADE
TSLRLLDTRKGGKSCSGWLWAYVSGERVSVNGAPY
>Z3923 
MPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQHDBWL
KREIQRVYDENHQVXXCA
>Z1835 putative integrase of prophage CP-933C
MMTGIKIMSRALNKLSDTQLRKINGTPAQKTAFLNDGGNLSVRHSTSGLL
TWYFTYRAGTGRGAPPERIKLGNYPDLSLKSAREKAAQCRAWLAEGKNPR
HELNYTVQEALKPVTVGDALTYWLESYAKENRVDYAALKKRLNNHVIQHI
GAMPLDKCELRHWLACFDQVAKRTPVTAGFLLQTCKQALKFCRRRRYAIS
NVLDDMSVADVGKKPDISERVLSTKELGELLQALDKKIFSPYYIALIRLL
IVFGCRTVELRLSEISEWDFTEMLWTVPKEHSKTKVAIFRPIPEAILPFV
TQLVEQNRHTGLLLGEVKQETSVSQYGRLAHRRLNHPHWSLHDIRRTFTT
MLNDLGVDPHVVEQLTGHQMPGMQRVYNHSRYLDAKRNALDMWTERLGIL
AGTHENVTTLPVARRK
>Z2101 putative endonuclease encoded within prophage CP-933O
MRIEFVLPYPPTVNTYWRRRGSTYFVSKAGERYRRDVALIVRQQRLKLNL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLIDDEQFDEINIVR
G
>Z5428 
MGWHHCQYHSFTALDVINNYVACLLLNRGKEAVMVANINLIKQESYSVVN
LEKQLMSNESPKQSLLNDFSHRLKMEGEEQLFHCVNELKYGLSSNNEYDL
FKNRETSLWKRKVPESAALAMTGTVYKLFGSRLLNARNQLDKNLLEMAMQ
MVYGKDMVTKSGDILIDIQLNKDGSLQSKHYTFDVGEVINSYNIDLFVNA
SSINQTYLHDKNPGDVISLYGTLDNIPLVVKECTGKNHQINVKAKLPQER
SEKVEDMCERMRGGISMFNHTTKTAGNIEHNLQLSFLVDSRPSITNKYSA
KDVAPDILVLPVNVYHADFKIKINNAVEKT
>Z2917 dATP pyrophosphohydrolase
MKDKVYKRPVSILVVIYAQDTKRVLMLQRRDDPDFWQSVTGSVEEGETAP
QAAMREVKEEVTIDVVAEQLTLIDCQRTVEFEIFSHLRHRYAPGVTRNTE
SWFCLALPHERQIVFTEHLAYKWLDAPAAAALTKSWSNRQAIEQFVINAA
>Z2429 unknown protein in IS629
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1571 unknown protein encoded in ISEc8
MIPLPFGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1134 unknown in IS600
MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRL
QTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLL
NQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMT
KELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMS
RKGNCYDNAPMESFWGTLKNESLSHYRFKSRDEAISVIREYIEIFYNRQR
RHSRLGNISPAAFREKYHQMAA
>Z2561 putative transposase (partial)
MSIIPPISRDERRLIQKAIHKRLTAMLMLHRGDRVSDVARTLCCARSSVG
RWIN
>Z1601 unknown in ISEc8
MDISLLSTTSDPEQLRALAIAMVQKVMAENAELQNRIRILEEQMKLARQQ
RFGKKCESLAGMQRSLFEEDVDADIAEISAHLDKLLPQTGDEEKTTTRPV
RKPLPSPLPRAEKVIPPAEERCPDCDAPLHFIRDEVSEKLEYIPAQVVVN
RYIRPQYSCPCCEKVFSGKMPAHILPKSAVEPSVIAQVVISKYTDHLPLY
RQQHIFSRMGVELPVSTMADMVGVAGAALAPLAKLLRHELLTRDVIHADE
TSLRLLDTRKGGKSCSGWLWAYVSGERVSVNGAPY
>Z1863 putative phosphohydrolase
MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAAR
ELWEETGISAQPQHFIRMHQWIAPDKTPFLRFLFAIELEQICPTQPHDSD
IDCCRWVSAEEILQASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTK
GVI
>Z1853 unknown protein encoded by prophage CP-933C
MARPPKAPAYLDDIAVKQWREKSRQLAERGDLTPADWSNLELYCVNYSIY
RKAVADLAARGFSIVNSQGGESRNPALSAKSDAERVMIKMASLLGFDPIS
RRKNPPETEEEDELDRLE
>Z1475 putative terminase small subunit of bacteriophage BP-933W
MAKLDWKKLEQAFRREHAETGITLLDWCRKKKINYNTARTRIKMGKIDHE
IDHKTDHEIDHDISDEEPCNDAGSGDEKCAKNSEKNCANSAETKRIRGSR
LLPPSNAFSQRNTHAVRHRGYAKYLEADNLMDDASDMVLFDELVFTRARA
LSVTKALKGMFADLEEATDVETRVALYDKILKAEQALDRNIARIESIERS
LLTLDVLAETAPKLRADRERINAARDKLRAETDILTNQRRGVVTPVSDIV
SSLHEMSNSGRLDDIPEE
>Z1845 putative single stranded DNA-binding protein of prophage CP-933C
MTAQIAAYGRLVDDPQVKQTSKGTPMTLARMAVSLPCSQAQDGQATLWLS
VIAFGKQADFLAKHQKGDVASVSGTMQVSQWTGQNGETRQGYQVIADSVI
SARAARPGGNRRKTTGTQGNQPPAGGDDPYGDGIPF
>Z5902 putative helicase
MTVQTPKVALSDGFLGAFARIPKAQQKKVQEFISKFRQDPTSNGLNYEKI
HDARSKNVHSVRIDQTYRGIVLKPEQGALYMLMWVDKHDEAYDWARRHDC
SIHPVTGAIQVIDISYIKPAAETVVDKPKLFAAYSAEQILALGVPPVFID
QVMALTDEAGLNQLESIMPAEAWEPLHWLAEGLDYQEVLEEFNDHRDEPV
DTNDFMEAIERSKRRFHVVENEQELLQMLNAPLEKWRVFLHPSQRKLVES
PANGPVRVLGGAGTGKTVVAMHRARWLSQRLADKPGKKVLFTTFTRNLAA
DIRANLQRLCTREEMARIDVVNIDAWMSDQLKRHGYDFRVVYDSDEGRRK
CWNYALQQAPAELGLPDNFYAEEWQRVVQPNAVYTREEYLKVSRVGRGTA
LSRIQRAKIWPVFEEFRAQMARAKLREMGDAMHEAIVLFKEKQVQLPYSS
IVVDEAQDIGAPAFTLIRSLVPESPXDLFIVGDGHQRIYRNKVVLGQCGI
NVRGRRSKKLKINYRTTEETRQFAVGLLTGVKVDNXDGEADTSNDYLSLL
HGEKPMITHAADFKEEAATXVKQIQALLANQVRSQDICITARTKHXCDRY
ASALNDAGIETFNLGNDSGDSDARPGVRVATMHRIKGLEFQYVFLVGINE
GVVPEIKALASDDPVEQRDALFNERALLHVAATRAVKGLFVSSSGVPSPL
LVAD
>Z2057 putative endonuclease of prophage CP-933O
MLIDLVLPYPPTVNTYWRRRGSTYFVSKAGERYRRAVVLIVRQQRLKLSL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVR
GQPVSGGRLGVKIYKIESE
>Z1217 putative DNA repair protein, RADC family
MQQLSFLPGEMTPGERSLILRALQTLDRHLHEPGVAFTSTRAAREWLILN
MAGLEREEFRVLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRALYHNA
AAVVLAHNHPSGEVTPSKADRLITERLVQALGLVDIRVPDHLIVGGNQVF
SFAEHGLL
>Z5899 putative ATP-dependent helicase
MNKQNYAPGMRVVIRDAEWRIRRADDSGDGGYLLTCDGISELVRGKEGLF
LTKLEQKVEILDPAKTHLVEDESANYQAAQLYIESQLRQRVPTDSKVHFG
HLAAMDSMPFQLDPTRMALAQPRQRILIADAVGLGKTLEAGILVSELIRR
GRGKRILVLAVKSMLTQFQKEFWSRFAIPLTRLDSAGLQKVRNRIPTNHN
PFHYFDKTIISIDTLKQDIEYRHHLENAWWDIIVIDEAHNVAERGTSSLR
SKLAKLLAGRSDTLIMLSATPHDGKAESFASLMNMLDPTAIANPKEYEYA
DFADKNLVVRRFKKDVKDQMSGEFPERNIVKLTRLASGAEEEAYRRLVES
QFRDDDDEQAQSNKGRLFKITLEKALFSSPMACASVVANRLKRLESRKDH
NSQSQINELESLLLALNNIDASQFSKYQLLLDTIRKDLAWKANNTEDRLV
IFTESIKTLEFLEQQLRADLKLKDDQIATLRGDQGDTVLMETVEAFGKTQ
SPLRLLVCSDVASEGINLHHLSHKMIHFDIPWSLMVFQQRNGRIDRYGQK
HQPQIRYLLTEASEPQINGDMRVLEVLINKDEQAQKNIGDSSEFTGKFTQ
EEEEEQVAEFMMQDDGASLFDQLLNSNVSESAEHDLFGEICSAVSSDASM
VTETDTSLFASEQAYCERALGYLKASGQTIQYETLPDNTLSLVAPEELRR
RFNQLPPEIAPENWQLYLSQDKTVITDAIARARGEQHAWPDVQYLWQINP
VVQWLDDKISSAFGRHQAPVIRLPYLLEPDEDHFILSGLFPNRKSHPMVN
PWIVVSFNRESLIGSQPFAEFLQRHPQLSNKLTNSGGKDRNHQRQQDLLE
AAIAHAREVFIHDRNAFETHINQQLNEHLQKLDVLRGRQLSQLELDFADN
KQQLSVKQSRKEQRQREIEHNFDSYIEWIEDTMTTEKEPYIQVIAVITGA
EG
>Z1648 unknown in putative ISEc8
MELQDWRKEXRKNYSNEFKLRMVELASQPGASVARIAREHDINDNLLFKW
LRLWQNEGRISRRLQVTTSSDTGVELLPVEITPDEQKEPVAAIAPSLSTS
TQTSVSAGSCKVEFRHGNMTLENPSPELLTLLIRELTGRGR
>Z0271 
MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF
GETHLDFLRQYGDFENAIPVHDTIARVVSCISPAKFHECFINWMRDCHSS
DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSN
EITAPPELLNILDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQG
RLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCEVPDELIDF
TFEWKGLKKLCVAVSFRSIIAEQKKEPKMTVRYYISSADLTAGKFATAIR
NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF
KAGLRRKMRKAAMDRNYLASVLAGSGLS
>Z1222 unknown in IS
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z3115 putative endonuclease encoded within prophage CP-933U
MLIDLVLPYPPTVNTYWRRRGSTYFVSKAGERYRRAVALIVRQQRLKLSL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAGLLMDDEQFDEINIVR
GQLVPGERLGIKITELECA
>Z2804 unknown protein encoded within IS629
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z5816 putative virulence protein
MKRLQAFKFQLRPGGQQEREMRRFAGACRFVFNRALALQNENHEAGNKYI
PYGKMASWLVEWKNATETQWLKDAPSQPLQQSLKDLERAYKNFFRKRAAF
PRFKKRGQNDAFRYPQGVKLDQENSRIFLPKLGWMRYRNSRQVTGVVKNV
TASQSCGKWYISIQTENEVSTPVHPSALMVGLDAGVAKLATLSDGTVFGP
VNSFQKNQKTLARLQRQLSRKVKFSNNWQKQKRKIQRLHSCIANICRDYL
HKVTTTVSKNHAMIVIEDLKVSNMSKSAAGTVSQPGRNVRAKSGLNRSIL
DQGWYEMRRQLEYKQLWRGGQVLAVPPAYTSQRCACCGHTAKENRLSQSK
FRCQACGYTANADVNGARNILAAGHAVLACGEMVQSGRPLKQEPTEMIQA
TA
>Z0946 putative integrase encoded by prophage CP-933K; partial
MIILLRYIHGLIATKKSSPAEESSRRHFINYMSKIKAIRRGLPDAPLEDI
TTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATR
AAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEM
KWSDIVDGYLYVEQSKTGVKIAIPTTLHVDALGISMKETLDKCKEILGGE
TIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYE
KQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK
>Z1132 unknown protein encoded in ISEc8
MIPLPFGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1448 regulatory protein Cro of bacteriophage BP-933W
MQNLDEPIKGVGIPEVAKACGVSERAVYKWLKNGFLPKTEFFGKTKYASK
IEEISGGKYQASEMLEISKKNLLAA
>Z2894 orf, hypothetical protein
MLRIIDTETCGLQGGIVEIASVDVIDGKIVNPMSHLVHPDRPISPQAMAI
HRITEAMVADKPWIEDVIPHYYGSEWYVAHNASFDRRVLPEMPGEWICTM
KLARRLWPGIKYSNMALYKTRKLNVQTPPGLHHHRALYDCYITAALLIDI
MNTSGWTAEQMADITGRPSLMTTFTFGKYRGKAVSDVAERDPGYLRWLFN
NLDSMSPELRLTLKHYLENT
>Z6069 putative DNA replication factor encoded within cryptic prophage CP-933P
MKNIAAVGVLERIRRLAPQGAVPPYRTVEEWREWQLAEGRKRSEEINRLN
HQVRVEKILNRAGIQPLHRKCSFGNYRVQNDGQRHALSQAKSIADELMTG
CTNFVFSGKPGTGKNHLAAAIGNRLMAKGRSVIIVTVSDVMSVLHDGYDN
GQSGEKFLQELCGVDLLVLDEIGMQRDTRNEQVTLNQIVDRRTASMRSVG
MLTNLNHAAMSTLLGDRVMDRMTMNGGRWVNFNWESWRSNVGRQGM
>Z0366 unknown protein encoded in ISEc8
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z6017 putative transposase fragment
MFLALGINIAGQKELLGMRLAENEGANFWFNVLTELKNRGLNDILIACVY
GLKEFPEARIQLCIVHMVRNSMRFVSWKEYKAVTRDLKAISLPQKRQASR
HWKRLLRPGTAAIRR
>Z5098 unknown protein encoded by ISEc8 within prophage CP-933L
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z1357 unknown protein encoded by cryptic prophage CP-933M
MLTTQKRKFALALMSGKNKTASALAAGYSAKTARVKGSQLAKDPEVLAFI
ARKQCZTVEVDEVPVYRQKKSEPEDKPRRREAAAIPQPDETNPEMPPPVV
ISPGIEYMEDGLPDPVKAMGRLLVENINTDPRLALDAAYKLAQFTHHKKG
DAGKKSAKGDAAKKAANRFAVPPPPRLVVNNDNEGNG
>Z1120 putative P4-family integrase
MAVLTDTKARHIKPDDKPLPHGGITGLTLHPSSVKGRGKWVFRYVSPVTQ
KRRNAGLGTYPEVSIAEAARTARIMREQLAAGDDPLEIKKAESEKVAIPT
FADAARRVHAELSPGWENPKHVRQWLSTLENYAFPQLGAKTLDSITAADV
AETLRPVWLTLSETASRVKQRIHVVMQWGWAHGFCVANPVDVVDHLLPQQ
TRGRDEHQPAMPWRQLPLFVATSVYTDEPYNVTRALLLMVILTATRSGEA
RGMRWAEIDFHKRVWTIPAERMKARIQHRVPLSRQAIYILENIRGLHDEL
VFPSPRKQQILSDMVLTSFLRKKKAVSDIPGRVATAHGFRSTFRDWCSEQ
GYSRDLAERALAHTLKNKVEAAYHRTDLLEQRVPMMQAWADYVMSQIVNK
>Z4313 putative pathogenicity island integrase
MALTDAKIRAAKPTDKAYKLTDGAGMFLLVHPNGSRYWRLRYRILGKEKT
LALVVYPEVSLSEARTKRDEARKLISEGVDPCEQKRAKKVVPDLQLSFEH
IARRWHASNKQWAQSHSDKVLKSLETHVFPFIGNRDITTLNTPDLLIPVR
AAEAKQIYEIASRLQQRISAVMRYAVQSGIIRYNPALDMAGALTTVKRQH
RPALDLSRLPELLSRIGSYKGQPVTQLAVMLNLLVFIRSSELRYARWSEI
DIDNAMWTIPAEREPLPGVKFSHRGSKMRTPHLVPLSKQVVAILAELQTW
AGENGLIFTGAHDPRKPISENTVNKALRVMGYDTTQDVCGHGFRAMACSA
LIESGLWSRDAVERQMSHQERNGVRAAYIHKAEHLEERRLMLQWWADFLD
ANRERFISPFEYAKINNPLKQ
>Z3622 putative resolvase
MNVRIYCRASTEGQHADRALTSLREFSKSKGWQIAGEYIENASGAKLERV
ELMRLLSEAQSGDLLLVEAIDRLSRLEHSAWVELKDTLNRKGLIIVSMDL
PTSWQMVEMAGNDLTSGILRAVNAMLIDILATMARQDYETRRKRQQQGIE
RAKSEGIYIGRAKNQEAREIVREMLEQGVKPELIMKAAGISRATYYRIKN
ELLIVKSE
>Z2791 orf, hypothetical protein
MKMIEVVAAIIERDGKILLAQRPAQSDQAGLWEFAGGKVEPDESQQQALV
RELNEELGIEATVGEYVASHQREVSGRIIHLHAWHVPDFHGTLQAHEHQA
LVWCSPEEALQYPLAPADIPLLEAFMALRAARPAD
>Z2389 putative DNA modification methyltransferase encoded within prophage CP-933R
MNVIDLFSGVGGLSLGAARAGFDVKMAVEIDQHAINTHAINFPRSLHVQE
DVSLLNAEIIKGFFKNDMPIDGIIGGPPCQGFSSIGKGNPDDSRNQLYMH
FYRLVSELQPLFFLAENVPGIMQEKYSGIRNKAFNLVSGDYDILDPIKVK
ASDYGAPTIRTRYFFIGVKKSLKLDISDEVFMPKMIDPVTVKDALYGLPD
IIDANWQSDSESWRTIKKDRKGGFYEKLWGQIPRNVGDTESIAKLKNNII
SGCTGTLHSKIVQERYASLSFGETDKISRSTRLDPNGFCPTLRAGTARDK
GSFQAVRPIHPYHPRVITPREAARLQGFPDWFRFHVTKWHSFRQIGNSVS
PIVAEYILKGLYNLLNKRVQPEYLNHNSLEVRV
>Z1826 putative IS encoded protein
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQY
>Z5815 putative transposase
MKKETDIRRGRHCVFLMHVHLVFVTRYRRQIFDHDATEKLRTYFSKVCAD
FEAELVEMDGEPDHVHLLINYPPKLAISSLVNSLKGVSGRLLRRDRPDIA
VRYYYKGVLWSPGYFASSCGGAPISAIRQYIEQQQTPG
>Z1198 putative transposase
MPLLDKLHEQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVLR
GKKXRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFAGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYI
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>Z2074 putative IS encoded protein within CP-933O
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQAXAYFAKA
EFDRLWKK
>Z1338 putative DNA replication factor encoded within cryptic prophage CP-933M
MKNIAAAGVLERIRRLAPQASVPPYRTVEEWREWQLAEGRKRSEEINRQN
HQLRVEKILNRSGIQPLHSKCSFANYQVQNDGQKYALSQAKSIADELMTG
CTNFVFSGKTGTGKNHLAAAMGNRLMAKGRSVIIVTVSDVMSVLHDSYDN
GKSGEKFLQELCSVDLLVLDEIGVQRETKNEQVVLHQIIDRRTASLCSVG
MLTNLNHAAMSTLLGERIMDRMTMNGGRWVTFNWDSWRPNVSNMRVVK
>Z2092 unknown protein encoded within prophage CP-933O
MKIKHEHIRMAMNAWARPDGEKVPAAGITQAYFELGMTFPELYDDSHPEA
LARNTQKIFRWIEKDTPDAVEKMQALLPAIEKAMPPLLVARMRSHSSEYY
REIVERRDRLVKDVDDFVAAAIAWGTLTNSGGQPGNAVVVH
>Z4315 unknown protein encoded by ISEc8
MNSQTKKDIPCFRSYLPDALRLRFEDKLTIRAIAQRLGLSHSTIHTLFQR
FIASGIAWPLPDSVSLAQLDAILYANRKKELTEPQISEGTWRKERRTSYS
REFKIRLVKQALQPGAVVARIAREHGINDNLLLKWKSQYEDGLLSDDDIQ
ECMPVPVALTDTPEPTRPVTNPFWRNKPDECPESDPGNVPRCELHLKSGV
VKLFDPLTPEMLRALIREMKGGTR
>Z4316 unknown protein encoded by ISEc8
MITLPTGTRIWIIAGITDMRCGFNGLASKVQNTLKDAPFSGHIFVFRGRS
GKMVKILWADRDGLCLFAKRLERGRFVWPVTREGKVHLTPAQLSMLLEGI
AWQHPKRTERPGIRI
>Z2851 putative enzyme
MTDDFAPDGQLAKAIPGFKPREPQRQMAVAVTQAIEKGQPLVVEAGTGTG
KTYAYLAPALRAKKKVIISTGSKALQDQLYSRDLPTVSKALKYTGNVALL
KGRSNYLCLERLEQQALAGGDLPVQILSDVILLRSWSNQTVDGDISTCVS
VAEDSQAWPLVTSTNDNCLGSDCPMYKDCFVVKARKKAMDADVVVVNHHL
FLADMVVKESGFGELIPEADVMIFDEAHQLPDIASQYFGQSLSSRQLLDL
AKDITIAYRTELKDTQQLQKCADRLAQSAQDFRLQLGEPGYRGNLRELLA
NPQIQRAFLLLDDTLELCYDVAKLSLGRSALLDAAFERATLYRTRLKRLK
EINQPGYSYWYECTSRHFTLALTPLSVADKFKELMAQKPGSWIFTSATLS
VNDDLHHFTSRLGIEQAESLLLPSPFDYSRQALLCVPRNLPQTNQPGSAR
QLAAMLRPIIEANNGRCFMLCTSHAMMRDLAEQFRATMTLPVLLQGETSK
GQLLQQFVSAGNALLVATSSFWEGVDVRGDTLSLVIIDKLPFTSPDDPLL
KARMEDCRLRGGDPFDEVQLPDAVITLKQGVGRLIRDADDRGVLVICDNR
LVMRPYGATFLASLPPAPRTRDIARAVRFLAIPSSR
>Z1562 unknown in IS1N
MALICELDEQWSFVENKARQQWHWYAYKTKADGVLAYTFGPRTDETCREL
PEFLKPFSAGMITRDNRSSYTREMPQDKHLVGKIFTRRIERNNLTLRTHI
KRPARKTICFLRSLEIHEKPLVHLSKTHVLLTGVITRASFAVFLP
>Z0854 
MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI
KTDEKSNEITAPPELLNILDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF
AVKGNQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCEV
PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPKMTVRYYISSADLTAG
KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI
LTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGLS
>Z2562 putative transposase (partial)
MWRRAAPTLRIRAPPKDKKMATIHNALDECSTEHPVFYEDEVFIHLNPKI
GADWKLLGKQKRGVTPEQNEKYSLDVALHSGTG
>Z2078 putative transposase within CP-933O
MDEKQLQALANELAKNXKTPEDLSQFDRLLKKISVEAALNAEMSHHLGYD
KNQPKPGANSRNGYSTKTVITGDGHLELRTPRDRDGSFEPZLVKKNQTRI
TGMDNQILSLYAKGLTTREIAAAFKELYDADVVSVQRGPY
>Z1597 unknown in ISEc8
MALRKIAGLYRIEKFIRERPVEKIRQWRQRYSRPIVNDLFAWPEEQEPCC
PPDGPLNKAINYILNRRDELSCFLSDGAVPLDNNICERAIRPVVMGRKAW
LFAGSLMAGNRAAQIMSLLETAKRNGLEPHAWLTDVLTRLPEWPEDRLEE
LLPLEGFTFSG
>Z1927 unknown protein encoded by ISEc8 in prophage CP-933X
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVKRKAPDSADTGTEAMIPLPXGTKNLAGLPVS
PI
>Z2081 putative IS encoded protein within CP-933O
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z4502 orf; hypothetical protein in IS629
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z5878 putative integrase
MSQRYKLYRRTSGIYVVRISVPQRFRRYAGQCEIHTSTGTHDLHEAKQKS
ALLLAVWYQTLQEYEQLDYRTLSDCAPLLAGEGMISLSNFAQSIELPISQ
LIREVVNRNLPVFWLATGQFGFYVDEFNAVEREPGAKREKQSDDEKDQPK
EVIILNSAFELGIESFANGYLRPFNPRHTLDCLLSAGVSEGEAAFRTSGD
NQSGGWFFDLPGVDITADSLLISKVHAEGLRLTWLVKTTPPAVSIHPAVP
LVAPVIANEYVHRKHYNENLSWLREEYLKHRRKGKVSEAALRDIRYYFDL
MIEVMGDIQLEDFDRDFLRAYESKLRTIPANRNLMKGKHGVKTLDELIAK
AAECGDKLMTEESVKKYINGLYGAMEWAVDDGKFLKSPCDNFFPPDDKGE
REQDHTDIFEPHEIKAIFSQPWFVAGTVERNAQGRFHQYCPFHYWAPLLG
LMTGARVNEIAQLMLDDVLADDGVYYLNLESDSENGKKLKNANSRRKIPV
HSTLIELGFIEYVDALKAAGYDRLFPELKPHKTKGYGRPVSAWFNESLLA
GRLKLERDRSKSFHSFRHSVSTLLKEKGVSSELRGQLLGHVRGKTETEVR
YSKDLKPVHMVEVVEKIDFSLPEIARFNIPDGLDAVRDALXXKRGKQTG
>Z2084 putative integrase within CP-933O; partial
MIHLIKPAIDALRSQMALTRLSKEHIIDVHLREFGRTEKQKCTFVFQPEV
SAKVKNYGDHFTVDSIRQMWDAAVKRAGIRHRKSYQSRHTYACWSLTAGA
NPAFIANQMGHADAQMVFQVYGKWMSENNNAQVTLLNTQLSEFAPTMPHN
EAMKS
>Z1929 unknown protein encoded by ISEc8 in prophage CP-933X
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRM
NFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRT
VREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQ
SEIYGRQGVELRRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTP
VQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTH
LACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALT
EEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKN
FLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVS
ELLPWRIALPAE
>Z1957 transposase for IS629
MMPLLDKLREQYGVGPVCSELHIAPSTYYHCQQQRHHPDKRSARAQHDDW
LKKEIQRVYDENHQVYGVRKVWRQLLREGIRVARCTVARLMAVMGLAGVL
RGKKVRTTVSRKAVAACDRVNRQFVAERPDQLWVADFTWVSTWQGFVYVA
FIIDVFAGYIVGWQVSSSMETTFVLDALEQALWARRPSGTIHHSDKGSQY
VSLAYTQRLKEAGLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNR
TEVELATLTWVDWYNNRRLLERLGHIPPAEAEKAYYASIGNNDLAA
>Z3943 
MEFSMENKCIESEQIFFAKMNRYSFKLSDKKWQLDKENCVYPHKVVDRMP
TKMKLSYLKTLAYYASEYSSFYIQSINNLFYEWFGAMTIDTIDDKAIYQL
NVYLGSERNYKLNLIKAFIIKWKNLNYPGVEATAIRMLEKIKIIPNQTGD
AVKRRDPNKGPLTEAEFNNIINAVGKFYHEKKIQCFLYCYILLLAITGRR
PLQLISLKAKDLIKNERGCFLNVPKVKQRKCFRKEFNMVMIEPFLYDSLS
MLINQNQAFVEDKFSVGISNYRGELPIFMNLDKITETKRIEDFLYDLTTD
FFHMKNSVMSKLLKHFPSKFDVRSERTNSYIELNARRFRYTLGSRLANEG
ASIEVIAKALDHKSVNSSIIYIKNNPDNVYDIDKRLSAFFNPLSNILMGI
EIEENKNFFIKFVSDAFFLLEDTKEDLKCLTCKKFNPWRAL
>Z1871 unknown protein encoded by prophage CP-933X
MKKAIAYMRFSSPGQMSGDSLNRQRRLIAEWLKVNSDYYLDTITYEDLGL
SAFKGKHAQSGAFSEFLDAIEHGYILPGTTLLVESLDRLSREKVGEAIER
LKLILNHGIDVITLCDNTVYNIDSLNEPYSLIKAILIAQRANEESEIKSS
RVKLSWKKKRQDALESGTIMTASCPRWLSLDDKRTAFVPDPDRVKTIELI
FKLRMERRSLNAIAKYLNDHAVKNFSGKESAWGPSVIEKLLANKALIGIC
VPSYRARGKGISEIAGYYPRVISDDLFYAVQEIRLAPFGISNSSKNPMLI
NLLRTVMKCEACGNTMIVHAVSGSLHGYYVCPMRRLHRCDRPSIKRDLVD
YNIINELLFNCSKIQPVENKKDANETLELKIIELQMKINNLIVALSVAPE
VTAIAEKIRLLDKELRRALVSLKTLKSKGVNSFSDFYAIDLTSKNGRELC
RTLAYKIFEKIIINTDNKTCDIYFMNGIVFKHYPLMKVISAQQAISALKY
MVDGEVYF
>Z3161 unknown protein encoded by IS629
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z2072 putative IS encoded proteinen coded by prophage CP-933O
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z1598 unknown in ISEc8
MEQKILSSEPRRSFSNEFKLQMVKLASQPGAXVARIAREHDINDNLLFKW
LRLWQNERRISRRLPVTTSSGAGVELLPVEITPDEQKEPMAALTPLLSTP
SQSTVSASSCKVEFRHGNMTLENPSPELLTVLIRELTGRGR
>Z1785 putative endodeoxyribonuclease of prophage CP-933N
MRIEFVLXYPPTVNTYWRRRGSTYFVSKAGERYRRAVALIVRQQRLKLSL
SGRLAIKIIAEPPDKRRRDLDNILKAPLDALTHAEVLIDDEQFDEINIVR
GQPVPGGRLGVKIYEIRGGNDGA
>Z2082 putative transposase within CP-933O; partial
MDTTLSNSSDSDQTVQAVRLPDVSPALVSKVTDAVMEQVVEWQNRPLDAV
YPIVYLDCIVLKVRQDSRVINKSVFLALGINIEGQKELLGMWLAENEGAK
FWLNVLTELKNRGLNDILIACVDGLKGFPDAINTVYPEARIQLCIVHMVR
NSLRFVSWKDYKAVTRDLKAIYQAPTEEAGQQALEAFASAWDSRYPQISR
SWQANWTNLAMFFAYPADIRKVIYTTNAIESLNSVIRHAIKKRKVFPTDD
SVKKVVWLAIQAASQKWTMPLRDWRMAMSRFIIEFGDRLDGHF
>Z1602 unknown in ISEc8
MFSGLFAMLTPDNVFLVVKPVDMRRGIDTLTQYVQNELNAAWHDGAAFVF
TNKVRSRIKVLRWDKHGVWLCTRRLHRGSFRWPRKGDATWHLTQDEFHWL
VFGVDWQQVKGHDLAKWVYQ
>Z1122 
MVHNGAGVRDSSRTLKVDINTVILTLKNAHHVK
>Z1639 unknown in IS
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1500 unknown protein encoded by bacteriophage BP-933W
MENEGDNIITLVQPKRDEEKLLNITVTGRKNYTQQSCKHRAIEVHEQDHV
ILCLQCGCVVDPFQYVLRCANDGEAVVREIRQLHNRHDQLRESVASLERE
EKNTKARLRAARTAILYAENDLKNIEQKVNQ
>Z4801 putative ATP-dependent DNA helicase (together with adjacent 3 orfs)
MGRAGRGIDSAVGILLCGSEDRAIHKFFRESAFPAEAQIHEILNVLSVND
GLTLRGIEQRTNLRYGQIEKALKLLVAENPSPVVYTEKLWRRTIVSFSPD
HERINHLMNQRKSELADVESYITTKECKMQFLRRALDEPGAEHCGKCSSC
LQHPLLSPDIDSGLLHAANLFIKHADLSLNLNKQVAAGAFTQYGFKGNLP
ASLQGSTGRILSRWGYSGWGKQVAQEKKTGRFSDELVEACAEMVRQRWNP
HPEPTWVCCVPSLKHLDLVPDFESTSTWFLILPGDWRRNLAYLLLMPLKK
SWTIHRRKCSKTVSTSVKISTGRL
>Z2771 putative excinuclease subunit
MVRRLTSPRLEFEAAAIYEYPEHLRSFLNDLPTRPGVYLFHGESDTMPLY
IGKSVNIRSRVLSHLRTPDEAAMLRQSRRISWICTAGEIGALLLEARLIK
EQQPLFNKRLRRNRQLCALQLNEKRVDVVYAKEVDFSRAPNLFGLFANRR
AALQALQSIADEQKLCYGLLGLEPLSRGRACFRSALKRCAGACCGKESHE
EHALRLRQSLERLRVVCWPWQGAVALKEQHPEMTQYHIIQNWLWLGAVNS
LEEATTLIRTPAGFDHDGYKILCKPLLSGNYEITELDPANDQRAS
>Z3348 unknown protein encoded within prophage CP-933V
MAKTSCEEMTMLLIQPGFGLSIKKGHMFGEKESQRKMVSIRLPFISIYWL
NREATNYWYTCARAAFNDPDWFVKNHHAVRQAKRKANMTYMKAYQKAWKE
HRDRYQQDMEKLESENMELRRKLGEAKRDIDAYKRLFNGESHA
>Z2376 putative IS629 transposase within prophage CP-933R
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDXLAA
>Z4200 
MMADVQEEGKPQLWNHKQNDALGLYLDLLIQAINTGTINAEDWQKGDRLK
SVALLIAYLDKANFYVMEDSGAWEEDARLNTSSVALVTSGLERLSNLLSK
KDSVFVSDLLREAKVNELDETLSTTRLNHLIDKGYERITLQLDLGGESPG
YLEKDKHYREADAALLNVIYPANLSKINTRRKEQVLKIVKKLAGPYGIKR
YEKDNYQSANFWFNDIKTDTDQNSHAKREKSFIPSTEAEWFFDSWYAKSA
AIVYKESRKEEYLNDSVQFMNRSLAQITGENMIGANGRSVPEMALPESYN
YIHKSGTLHEAPSPIIPLNWSKASMTLMLKEMSNLINDEGIK
>Z1199 unknown in IS
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1123 unknown in IS1N
MALICELDEQWSFVENKARQQWHWYAYKTKADGVLAYTFGPRTDETCREL
PEFLKPFSAGMITRDNRSSYTREMPQDKHLVGKIFTRRIERNNLTLRTHI
KRPARKTICFLRSLEIHEKPLVHLSKTHVLLTGVITRASFAVFLP
>Z3664 putative virulence protein
MKRLQAFKFQLRPGGQQEREMRRFAGACRFVFNRALALQNENHEAGNKYI
PYGKMASWLVEWKNATETQWLKDAPSQPLQQSLKELERAYKNFFRKRAAF
PRFKKRGQNDAFRYPQGVKLDQENSRIFLPKLGWMRYLNSRQVTGVVKNV
TVSQSCGKWYISIQTESEVSTPVHPSASMVGLDAGVAKLATLSDGTVFEP
VNSFQKNQKKLARLQRQLSRKVKFSNNWQKQKRKIQRLHSCTANIRRDYL
HKVTTTVSKNHAMIVIEDLKVSNMSKSAAGTVSQPGRNVRAKSGLNRSIL
DQGWYEMRRQLEYKQLWRGGQVLAVPPAYTSQRCACCGHTAKENRLSQSK
FRCQVCGYTANADVNGARNILAAGHAVLACGEMVQSGRSLKQEPTEMIQA
TA
>Z6015 unknown protein encoded in ISEc8
MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRS
GSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGI
DWRQPKRLLTSLTML
>Z5117 
MASLWKRLFYSSGRRRRYFEEGEHSFSILCGRLRGIVLTIKCSNGIIYLS
IKVSPNNRNHVFLYHKKDYVFDKLKEIFPDEAIEFTIEYEN
>Z6061 putative endonuclease encoded by cryptic prophage CP-933P
MRIEFVLPYPPTVNTYWRRRGSTYFVSKAGERYRRDVALIVRQQRLKLNL
SGRLVIKIIAEPPDKRRRDLDNILKAPLDVLTHAGLLIDDEQFDEINIVR
GQLVPGERLGIKITELECA
>Z1159 unknown in ISEc8
MEQKILSSEPRRSFSNEFKLQMVKLASQPGAXVARIAREHDINDNLLFKW
LRLWQNERRISRRLPVTTSSGAGVELLPVEITPDEQKEPMAALTPLLSTP
SQSTVSASSCKVEFRHGNMTLENPSPELLTVLIRELTGRGR
>Z3093 unknown in IS629 encoded within prophage CP-933U
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z2806 putative transposase
MPLLDKLREQYGVGPVCSELHIAPSTDYHCQQQRHHPDKRSARAQRDDWL
KREIQRVYDENHQVYGVRKVWRQLLREGIRVARXTXARLMAVMGLAGVLR
GKKVRTTVSRKAVSAGDRVNRQFVAERPDQLWVADFTYVSTWQGFVYVAF
IIDVFSGCIVGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYV
SLAYTERLKEAKLLASTGSTGDSYDNAMAESINGLYKAEVIHRKSWKNRA
EVELATLTWVDWYNNRRLLGRLGHIPPAEAEKAYYASIRNDDLAA
>Z5114 
MNEKFRTDLAHTFGIALEEQTDVLSFHDNDGHEWILECASQSEILFFYCY
LLNSESIQINSILEMNSNRELLGMFFLSLKDDNILLNIAFPADKIDITEF
ANLMENGYLLKNEIIRSLSSRPTDFLP
>Z2365 putative DNA packaging protein of prophage CP-933R; terminase small subunit
MATQTEVARHLSLTDRQLRRLQKLPGAPISNKRGQLDLDAWRDFYISYLR
RSKNDVPDGDSEDDYEEKLLIARWELTAEQAVTQQLKNEVSKGKLIDTGF
CIFALSKLAMALSSTLDSIPLSMQRQFPDLTPRHLDHLKTLIAKGANQCA
RAGDKLPDLLDEYIRATTE
>Z2111 putative transposase encoded within prophage CP-933O
MARCTVARLMAVMGLAGVLRGKKVRTTXSRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLNALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z1323 putative integrase for cryptic prophage CP-933M
MGRPRKNKKDNVLPPRVRSNGYSYVWKPEGSTRSIGLGRVRKTSVAKVWQ
NYELEKAKLHNIMTVAKLWHMFMDSPAFTELAPRTQKDYRQHQKALLMVF
GKVLADNVKTEQVRIFMDKRGLESKTQANHELASLSRVYGWGYERGYVKN
NPCKGVRKFSLKARTVYITDEQYAAIYAEAIPQLRIAMEISYLCAARLGD
VLELKWQDIMDKGIYIEQNKTGTKQIKEWSPRLRTAIQLARNVSSCTCEY
VINTTKGGKVIAKTLNNWWNQAKRAAEQKVGVPFGCNFHDIKAKGISDYE
GSSRDKQIFSGHKTENQVLIYDRKTKITPTLDLPLVVSK
>Z4064 orf; hypothetical protein
MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVA
CIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLL
YQAKLALTEDLRLKVVRKMYELRFREPPPARRSVDQLRGIEGSRVRQTYA
LLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGY
APAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLA
CRDIFRSTKLTGKLIPLIEKVLAAGEIEPPQPAPDMLPPAIPEPETLGDS
GHRGRGG
>Z6022 putative integrase fragment
MAAMLNTYVAEGKAVSARVIRSTLVDVFRGAIAEGHVATNPVTTTRAAKS
EVRRSRLTANEYVAIYHAAEHLPIWLRLSMDLAVVTGQRVGDLCRMKWSD
INDGHLHIGQSKTGAKIAIPLALTIDALDISLVDTLQKCREASSSETIIA
STYHEPLSPATVSRYLTKARNASGISFDGDPPTFHELRSLSARLYRNQIG
YKFAQRLLGHKSDSMAAHYRDSRGREWDKIEIG
>Z5089 putative transposase
MLTQNGVPMSRYRAGRLMKYLNLSSCQPGKHQYKNARQEHTCLPNLLERQ
FAVPEPDRVWCGDITYIWAGNRWCYLAVVMDLFARRVIGWSLSANADTAL
ISSALRMAYEVRGQPRDVMFHSDQGSQYTGLKYQQLLWRYRIKQSVSRRG
NCWDNSPMERFFRSLKTEWVPTDGYTGKDVARQQISSYILNYYNSVRPHH
YNGGLTPEESENRYHFYCKTVASIT
>Z4317 unknown protein encoded by ISEc8
MNNTLPDNIEQLKALLIAQQAVIVRLSGEITGYAREISSLRALVAKLQRM
LFGRSSEKSREKIEKKIARAETRITELQNRLGEAQLQLTSMAGETAPKTS
DSPVRKALPATLPRDRQVISPAETECPVCSGKLKPLGESISEQLDIINTA
FRVIETVRPKRACSRCDCIVQAPQPPKPIERSYASPALLARIIMAKFAEH
LPLYRQSEIYARQGVELHRNTMGRWVDIMGEQLRPLYDELKHYVLMPGKV
HADDTPVNVLEPGQGKTRTGRLWVYVRDDRNAGSTMPAAVWFSYSPDRKG
IHPQQHLADYRGILQADAYAGYNALYESGQATEAACMAHARRKIHDVHVR
HPTTVTGEALRRIGELYAIEAEIRGSPAEERLAVRKARTVPLMQSLYEWL
QGQMNTLSRHSDTAKAFTYLLKQWDALNEYCRNGWVEIDNNLCENALRVV
ALGRRNYRTWFLPGKGNGTKESWFFRNRPHHE
>Z2253 H repeat-associated protein of Rhs element
MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI
KTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF
AVKGNQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDV
PDELIDFTFEWKGLKKLCVAVSFRSIIAEQQNSRTAEQKKEPKNDGQILY
QFC
>Z4803 putative ATP-dependent DNA helicase (together with adjacent 3 orfs)
MEKHAAELLLQRMLGNTTATFREGQWEAIDAVVNQRRKLLVVQRTGWGKS
AVYFIASKIFRDRGAGPTIIISPLLALMRNQVAAAERLGITAETLNSTNR
EEWQRISDKLLQGEVDCLLVSPERLANQDFIETVLYPIADHIGLLVVDEA
HCISDWGHDFRPDYRRILDILRNYLRIPLFWVQPRQRITVSLRISVSNWV
TL
>Z4334 IS629 transposase
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z1661 unknown protein encoded by IS629
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z1934 putative transposase for IS629
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAXRPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z1647 partial transposase
MATYGGQFTLTDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWY
NNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>Z2430 putative transposase for IS629
MARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQ
LWVADFTYVSTWQGFVYVAFIIDVFAGYIVGWRVSSSMETTFVLDALEQA
LWARRPSGTIHHSDKGSQYVSLAYTERLKEAGLLASTGSTGDSYDNAMAE
SINGLYKAEVIHRKSWKNRAEVELATLTWVDWYNNRRLLGRLGHTPPAEA
EKAYYASIGNDDLAA
>Z1802 unknown protein encoded by prophage CP-933N
MLTTQKRKFALALMSGKNKTASAIAAGYSAKTARVKGSQLAKDPEVLAFI
ARKQCETVEVDEVPVYRQKKSEQEDKPRRREAAAIPQPDENNPEMPPSAV
MSPGIEYMEDGLPDPVKAMGRLLVENINTDPRLALDAAYKLAQFTHHKKG
DAGKKSAKGDAAKKAANRFAVPPPPRLVVNNDNEGNG
>Z1163 unknown in ISEc8
MFSGLFAMLTPDNVFLVVKPVDMRRGIDTLTQYVQNELNAAWHDGAAFVF
TNKVRSRIKVLRWDKHGVWLCTRRLHRGSFRWPRKGDATWHLTQDEFHWL
VFGVDWQQVKGHDLAKWVYQ
>Z0700 putative receptor
MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDL
GETHLNFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS
DDKDVIAIDGKTLRHSYDKSRRRRAIHVISAFSTMHSLVIGQIKTDEKSN
EITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQG
RLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDF
TFEWKGLKKLCVAVSFRSIIAEQKKESEMTVRYYISSADLTAEKFATAIR
NHWHVENKLHWRLDVVVNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF
KAGLRRKMRKAAMDRNYLASVFAGSGLS
>Z1561 
MVHNGAGVRDSSRTLKVDINTVILTLKNAHHVK
>Z1958 unknown in IS629
MTKNTRFSPEVRQRAVRMVLESQGEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGSGDGGLTTAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z0394 
MLRHACGFALADNGVDTRLLQDYLGHRNIQHTVRYTASNAARFKGVWKKK
PR
>Z1207 partial transposase
MATYGGQFTLTDNAMAESINGLYKAEVIHRKSWKNRAEVELATLTWVDWY
NNRRLLERLGHTPPAEAEKAYYASIGNDDLAA
>Z0365 unknown protein encoded in ISEc8
MGTKVSDMQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENG
INANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETL
SISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR
>Z4335 unknown protein in IS629
MTKNTRFSPEVRQRAIRMVLESQDEYDSQWAAICSIAPKIGCTPETLRVW
VRQHERDTGGGDGGLTSAERQRLKELERENRELRRSNDILRQASAYFAKA
EFDRLWKK
>Z3237 alkA, 3-methyl-adenine DNA glycosylase II, inducible
MYTLNWQPPYDWSWMLGFLAARAVSGVETVADDYYARSLAVGEYRGVVTA
IPDIARHTLHINLSADLEPVAAECLAKMSRLLDLQCNPQIVNGALGKLGA
ARPGLRLPGSVDAFEQGVRAILGQLVSVAMAAKLTAKVVQLYGERLDDFP
EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG
DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP
AQIRRYAERWKPWRSYALLHIWYTEGWQPDEA
>Z3470 alkB, DNA repair system specific for alkylated DNA
MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMV
TPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLC
QRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGL
PAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT
TDCRYNLTFRQAGKKE
>Z4740 dam, DNA adenine methylase
MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRY
ILADINSDLISLYNIVKMRTDEYVQAARELFVPETNCAEVYYQFREEFNK
SQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKPYFPEAELYH
FAEKAQNAFFYCESYADSMARADDASVVYCDPPYAPLSATANFTAYHTNS
FTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRAKLHVVKVRRS
ISSNGGTRKKVDELLALYKPGVVSPAKK
>Z2417 dbpA, ATP-dependent RNA helicase
MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTG
SGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPN
TKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTL
VMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQR
DPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNT
KKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDV
AARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEA
QRANIISDMLQIKLNWQTPPANSSIVTLEAEMATLCIDGGKKAKMRPGDV
LGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKT
CRVRLLK
>Z3054 dcm, DNA cytosine methylase
MQENISVTDSYSTGNAAQAMLEKLLQIYDVKTLVAQLNGVGENHWSAAIL
KRALANDSAWHRLSEKEFAHLQTLLPKPPAHHPHYAFRFIDLFAGIGGIR
RGFESIGGQCVFTSEWNKHAVRTYKANHYCDPATHHFNEDIRDITLSHKE
GVSDEAAAEHIRQHIPEHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFACD
TQGTLFFDVVRIIDARRPAMFVLENVKNLKSHDQGKTFRIIMQTLDELGY
DVADAEDNGPDDPKIIDGKHFLPQHRERIVLVGFRRDLNLKADFTLRDIS
ECFPAQRVTLAQLLDPMVEAKYILTPVLWKYLYRYAKKHQARGNGFGYGM
VYPNNPQSVTRTLSARYYKDGAEILIDRGWDMATGEKDFDDPLNQQHRPR
RLTPRECARLMGFEAPGEAKFRIPVSDTQAYRQFGNSVVVPVFAAVAKLL
EPKIKQAVALRQQEAQHGRRSR
>Z4523 deaD, inducible ATP-independent RNA helicase
MMSYVDWPPLILRHTYYMAEFETTFADLGLKAPILEALNDLGYEKPSPIQ
AECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLQNLDPELKAPQILVLAPT
RELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVGTP
GRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQT
ALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVWGMRKN
EALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQAL
REQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSESYVHRI
GRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAELLGKRR
LEKFAAKVQQQLESSDLDQYRALLSKIQPTAEGEELDLETLAAALLKMAQ
GERTLIVPPDAPMRPKREFRDRDDRGPRDRNDRGPRGDREDRPRRERRDV
GDMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFASHSTIEL
PKGMPGEVLQHFTRTRILNKPMNMQLLGDAQPHTGGERRGGGRGFSGERR
EGGRNFSGERREGGRGDGRRFSGERREGRAPRRDDSTGRRRFGGDA
>Z0285 dinJ, damage-inducible protein J
MAANAFVRARIDEDLKNQAADVLAGMGLTISDLVRITLTKVAREKALPFD
LREPNQLTIQSIKNSEAGVDVHKAKDADDLFDKLGV
>Z0292 dinP, damage-inducible protein P; putative tRNA synthetase
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARK
FGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPL
SLDEAYLDVTDSVHCHGSATLIAQEIRQTIFSELQLTASAGVTPVKFLAK
IASDMNKPNGQFVITPAEVSAFLQTLPLAKIPGVGKVSAAKLEAMGLRTC
GDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMA
EDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQ
EHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLG
L
>Z5193 dnaA, DNA biosynthesis; initiation of chromosome replication; can be transcription regulator
MSLSLWQQCLARLQDELPATEFSMWIRPLQAELSDNTLALYAPNRFVLDW
VRDKYLNNINGLLTSFCGADAPQLRFEVGTKSVTQTPQAAVTSNVAAPAQ
VAQTQPQRAAPSTRSGWDNVPAPAEPTYRSNVNVKHTFDNFVEGKSNQLA
RAAARQVADNPGGAYNPLFLYGGTGLGKTHLLHAVGNGIMARKPNAKVVY
MHSERFVQDMVKALQNNAIEEFKRYYRSVDALLIDDIQFFANKERSQEEF
FHTFNALLEGNQQIILTSDRYPKEINGVEDRLKSRFGWGLTVAIEPPELE
TRVAILMKKADENDIRLPGEVAFFIAKRLRSNVRELEGALNRVIANANFT
GRAITIDFVREALRDLLALQEKLVTIDNIQKTVAEYYKIKVADLLSKRRS
RSVARPRQMAMALAKELTNHSLPEIGDAFGGRDHTTVLHACRKIEQLREE
SHDIKEDFSNLIRTLSS
>Z5650 dnaB, replicative DNA helicase; part of primosome
MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDD
VAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLAESLERQGQLDSV
GGFAYLAELSKNTPSAANISSYADIVRERAVVREMISVANEIAEAGFDPQ
GRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQP
HDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAM
LQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGT
MGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIGLIMIDYLQLMRV
PALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNS
DLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVR
LTFNGQWSRFDNYAGPQYDDE
>Z5961 dnaC, chromosome replication; initiation and chain elongation
MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRA
MKMQRTFNRSGIRPLHQNCSFENYRVECEGQMNALSKARQYVEEFDGNIA
SFIFSGKPGTGKNHLAAAICNELLLRGKSVLIITVADIMSAMKDTFRNSG
TSEEQLLNDLSNVDLLVIDEIGVQTESKYEKVIINQIVDRRSSSKRPTGM
LTNSNMEEMTKLLGERVMDRMRLGNSLWVIFNWDSYRSRVTGKEY
>Z0196 dnaE, DNA polymerase III, alpha subunit
MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGL
VKFYGAGHGAGIKPIVGADFNVQCDLLGDELTHLTVLAANNTGYQNLTLL
ISKAYQRGYGAAGPIIDRDWLIELNEGLILLSGGRMGDVGRSLLRGNSAL
VDECVAFYEEHFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATN
DVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYMRSEEEMCELF
ADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGL
EERLAFLFPDEEERVKRRPEYDERLETELQVINQMGFPGYFLIVMEFIQW
SKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP
DFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLG
HPYGFVDRISKLIPPDPGMTLAKAFEAEPQLPEIYEADEEVKALIDMARK
LEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSDVEYAG
LVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDML
QRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRPGPLQSGMVDN
FIDRKHGREEISYPDVQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGY
TLGGADMLRRAMGKKKPEEMAKQRSVFAEGAEKNGINAELAMKIFDLVEK
FAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAAVMTADMDNTEKVVGL
VDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAII
EARNKGGYFRELFDLCARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMN
SLGDALKAADQHAKAEAIGQADMFGVLAEEPEQIEQSYASCQPWPEQVVL
DGERETLGLYLTGHPINQYLKEIERYVGGVRLKDMHPTERGKVITAAGLV
VAARVMVTKRGNRIGICTLDDRSGRLEVMLFTDALDKYQHLLEKDRILIV
SGQVSFDDFSGGLKMTAREVMDIDEAREKYARGLAISLTDRQIDDQLLNR
LRQSLEPHRSGTIPVHLYYQRADARARLRFGATWRVSPSDRLLNDLRGLI
GSEQVELEFD
>Z4419 dnaG, DNA biosynthesis; DNA primase
MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSF
TVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE
AGSGPSQIERHQRQTLYQLMDGLNTFYQQSLQQPVATSARQYLEKRGLSH
EVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSYDRF
RERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYE
AQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRA
TNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLV
RKEGKEAFEARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLIS
QVPGETLRIYLRQELGNKLGILDDSQLERLMPKAAESGVSRPVPQLKRTT
MRILIGLLVQNPELATLVPPLENLDENKLPGLGLFRELVNXCLSQPGLTT
GQLLEHYRGTNNAATLEKLSMWDDIADKNIAEQTFTDSXNHMFDSLLELR
QEELIARERTHGLSNEERLELWTLNQELAKK
>Z5192 dnaN, DNA polymerase III, beta-subunit
MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLE
MEMVARVALVQPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGERMLVR
SGRSRFSLSTLPAADFPNLDDWQSEVEFTLPQATMKRLIEATQFSMAHQD
VRYYLNGMLFETEGEELRTVATDGHRLAVCSMPIGQSLPSHSVIVPRKGV
IELMRMLDGGDNPLRVQIGSNNIRAHVGDFIFTSKLVDGRFPDYRRVLPK
NPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQE
EAEEILDVTYSGAEMEIGFNVSYVLDVLNALKCENVRMMLTDSVSSVQIE
DAASQSAAYVVMPMRL
>Z0241 dnaQ, DNA polymerase III, epsilon subunit
MSTAITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNNFHV
YLKPDRLVDPEAFGVHGIADEFLLDKPTFAEVADEFMDYIRGAELVIHNA
AFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLAVARKMFPGKRNSLDALCA
RYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTSMAFAMEGETQQQQGEA
TIQRLVRQASKLRVVFATDEELAAHEARLDLVQKKGGSCLWRA
>Z0587 dnaX, DNA polymerase III, tau and gamma subunits; DNA elongation factor III
MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVG
KTSIARLLAKGLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTK
VEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHV
KFLLATTDPQKLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEP
RALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGTLDDDQAL
SLVEAMVEANGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPA
ALGNDMAAIELRMRELARTIPPTDIQLYYQTLLIGRKELPYAPDRRMGVE
MTLLRALAFHPRMPLPEPEVPRQSFAPVAPTAVMTPTQVPPQPQSAPQQA
PTVPLPETTSQVLAARQQLQRVQGATKAKKSEPAAATRARPVNNAALERL
ASVTDRVQARPVPSALEKAPAKKEAYRWKATTPVMQQKEVVATPKALKKA
LEHEKTPELAAKLAAEAIERDAWAAQVSQLSLPKLVEQVALNAWKEESDN
AVCLHLRSSQRHLNNRGAQQKLAEALSMLKGSTVELTIVEDDNPAVRTPL
EWRQAIYEEKLAQARESIIADNNIQTLRRFFDAELDEDSIRPI
>Z4290 endA, DNA-specific endonuclease I
MYRYLSIAAVVLSAAFSGPALAEGINSFSQAKAAAVKVHADAPGTFYCGC
KINWQGKKGVVDLQSCGYQVRKNENRASRVEWEHVVPAWQFGHQRQCWQD
GGRKNCXKDPVYRKMESDMHNLQPSVGEVNGDRGNFMYSQWNGGEGQYGQ
CAMKVDFKEKAAEPPARARGAIARTYFYMRDHYNLTLSRQQTQLFNAWDK
MYPVTDWECERDERIAKVQGNHNPYVQRACQARKS
>Z4115 exo, 5'-3' exonuclease
MRGLFPISHPAIACSSIECYPYRLIFKGVIVAVHLLIVDALNLIRRIHAV
QGSPCVETCQHALDQLIMHSQPTHAVAVFDDENRSSGWRHQRLPDYKAGR
PPMPEELHDEMPALRAAFEQRGVPCWSTSGNEADDLAATLAVKVTQAGHQ
ATIVSTDKGYCQLLSPTLRIRDYFQKRWLDAPFIDKEFGVQPQQLPDYWG
LAGISSSKVPGVAGIGPKSATQLLVEFQSLEGIYENLDAVAEKWRKKLET
HKEMAFLCRDIARLQTDLHIDGNLQQLRLVR
>Z5910 fimB, recombinase involved in phase variation; regulator for fimA
MKNKADNKKRNFLTHSEIESLLKAANTGPHAARNYCLTLLCFIHGFRASE
ICRLRISDIDLKAKCIYIHRLKKGFSTTHPLLNKEVQALKNWLSIRTSYP
HAESEWVFLSRKGNPLSRQQFYHIISTSGGNAGLSLEIHPHMLRHSCGFA
LANMGIDTRLIQDYLGHRNIRHTVWYTASNAGRFYGIWDRARGRQRHAVL
>Z5911 fimE, recombinase involved in phase variation; regulator for fimA
MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHY
QDLDLNEGRINIRRLKNGFSTVHPLRFDEREAVERWTLERANWKGADRTD
AIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERG
ADTRLIQDYLGHRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV
>Z4621 fis, site-specific DNA inversion stimulation factor; DNA-binding protein; a trans activator for transcription
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDL
YELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
>Z3484 gyrA, DNA gyrase, subunit A, type II topoisomerase
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY
AMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRY
MLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYD
GTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDD
EDISIEGXMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEV
DAKTGRETXIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDG
MRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDI
IAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA
PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYL
TEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLME
VIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVK
YQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVY
SMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMAT
ANGTVKKTVLTEFNRLRTAGKVAIKLVEGDELIGVDLTSGEDEVMLFSAE
GKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQN
GYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD
AGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDT
IDGSAAEGDDEIAPEVDVDDEPEEE
>Z5190 gyrB, DNA gyrase subunit B, type II topoisomerase, ATPase activity
MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDE
ALAGHCKEIIVTIHADNSVSVQDDGRGIPTGIHPEEGVSAAEVIMTVLHA
GGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQREGKIHRQIYEHGVP
QAPLAVTGETEKTGTMVRFWPSLETFTNVTEFEYEILAKRLRELSFLNSG
VSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNIFYFSTEKDGI
GVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKE
GYSKKAKVSATGDDAREGLIAVVSVKVPDPKFSSQTKDKLVSSEVKSAVE
QQMNELLAEYLLENPTDAKIVVGKIXDAARAREAARRAREMTRRKGALDL
AGLPGKLADCQERDPALSELYLVEGXSAGGSAKQGRNRKNQAILPLKGKI
LNVEKARFDKMLSSQEVATLITALGCGIGRDEYNPDKLRYHSIIIMTDAD
VDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDE
AMDQYQISIALDGATLHTNASAPALAGEALEKLVSEYNATQKMINRMERR
YPKAMLKELIYQPTLTEADLSDEQTVTRWVNALVSELNDKEQHGSQWKFD
VHTNAEQNLFEPIVRVRTHGVDTDYPLDHEFITGGEYRRICTLGEKLRGL
LEEDAFIERGERRQPVASFEQALDWLVKESRRGLSIQRYKGLGEMNPEQL
WETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKAA
NIDI
>Z1313 helD, DNA helicase IV
MELKATTLGKRLAQHPYDRAVILNAGIKVSGDRHEYLIPFNQLLAIHCKR
GLVWGELEFVLPDEKVVRLHGTEWGETQRFYHHLDAHWRRWSGEMSEIAS
GVLRQQLDLIATRTGENKWLTREQTSGVQQQIRQALSALPLPVNRLEEFD
NCREAWRKCQAWLKDIESARLQHNQAYTEAMLTEYADFFRQVESSPLNPA
QARAVVNGEHSLLVLAGAGSGKTSVLVARAGWLLARGEASPEQILLLAFG
RKAAEEMDERIRERLHTEDITARTFHALALHIIQQGSKKVPIVSKLENDT
AARHELFIAEWRKQCSEKKAQAKGWRQWLTEEMQWSVPEGNFWDDEKLQR
RLASRLDRWVSLMRMHGGAQAEMIASAPEEIRDLFSKRIKLMAPLLKAWK
GALKAENAVDFSGLIHQAIVILEKGRFISPWKHILVDEFQDISPQRAALL
AALRKQNSQTTLFAVGDDWQAIYRFSGAQMSLTTAFHENFGEGERCDLDT
TYRFNSRIGEVANRFIQQNPGQLKKPLNSLTNGDKKAVTLLDESQLDALL
DKLSGYAKPEERILILARYHHMRPASLEKAATRWPKLQIDFMTIHASKGQ
QADYVLIVGLQEGSDGFPAAARESIMEEALLPPVEDFPDAEERRLMYVAL
TRARHRVWALFNKENPSPFVEILKNLDVPVARKP
>Z2741 himA, integration host factor (IHF), alpha subunit; site specific recombination
MALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFG
NFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENASPKDE
>Z1258 himD, integration host factor (IHF), beta subunit; site-specific recombination
MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGS
FSLHYRAPRTGRNPKTGDKVELEGKYVPHFKPGKELRDRANIYG
>Z0787 holA, DNA polymerase III, delta subunit
MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEH
HTFSIDPNTDWNAIFSLCQAMSLFASRQTLLLLLPENGPNAAINEQLLTL
TGLLHDDLLLIVRGNKLSKAQENAAWFTALANRSVQVTCQTPEQAQLPRW
VAARAKQLNLELDDAANQVLCYCYEGNLLALAQALERLSLLWPDGKLTLP
RVEQAVNDAAHFTPFHWVDALLMGKSKRALHILQQLRLEGSEPVILLRTL
QRELLLLVNLKRQSAHTPLRALFDKHRVWQNRRGMMGEALNRLSQPQLRQ
AVQLLTRTELTLKQDYSQSVWAELEGLSLLLCHKPLADVFIDG
>Z1738 holB, DNA polymerase III, delta prime subunit
MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLC
QQPQGHKSCGHCRGCQLMQAGTHPDYYTLAPEKGKNTLGIDAVREVTEKL
NEHARLGGAKVVWVTDAALLTDAAANALLKTLEEPPAETWFFLATREPER
LLATLRSRCRLHYLAPPPEQYAVTWLSREVTMSQDALLAALRLSAGSPGA
ALALFQGDNWQARETLCQALAYSVPSGDWYSLLAALNHEQAPARLHWLAT
LLMDALKRHHGAAQVTNVDVPGLVAELANHLSPSRLQAILGDVCHIREQL
MSVTGINRELLITDLLLRIEHYLQPGVVLPVPHL
>Z5871 holC, DNA polymerase III, chi subunit
MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAY
RLDEALWARPAESFVPHNLAGEGPRGGAPVEIAWPQKRSSSPRDILISLR
TSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK
>Z5973 holD, DNA polymerase III, psi subunit
MTSRRDWQLQQLGITQWSLRRPGALQGEIAIAIPAHVRLVMVANDLPALT
DPLVSDVLRALTVSPDQVLQLTPEKIAMLPQGSRCNSWRLGTDEPLSLEG
AQVASPALTELRANPTARAALWQQICTYEHDFFPRND
>Z2313 hrpA, helicase, ATP-dependent
MLRDRLRFSRRLHGVKKVKNPDAQQAIFQEMAKEIDQAAGKVLLREAARP
EITYPDNLPVSQKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICMELGR
GIKGLIGHTQPRRLAARTVANRIAEELKTEPGGCIGYKVRFSDHVSDNTM
VKLMTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFLLGYLKELLPR
RPDLKIIITSATIDPERFSRHFNNAPIIEVSGRTYPVEVRYRPIVEEADD
TERDQLQAIFDAVDELSQESPGDILIFMSGEREIRDTADALNKLNLRHTE
ILPLYARLSNSEQNRVFQSHSGRRIVLATNVAETSLTVPGIKYVIDPGTA
RISRYSYRTKVQRLPIEPISQASANQRKGRCGRVSEGICIRLYSEDDFLS
RPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLL
EELGAITTDEQASAYKLTPLGRQLSQLPVDPRLARMVLEAQKHGCVREAM
IITSALSIQDPRERPMDKQQASDEKHRRFHDKESDFLAFVNLWNYLGEQQ
KALSSNAFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVNSEPAEY
REIHIALLTGLLSHIGMKDADKQEYTGARNARFSIFPGSGLFKKPPKWVM
VAELVETSRLWGRIAARIDPEWVEPVAQHLIKRTYSEPHWERAQGAVMAT
EKVTVYGLPIVAARKVNYSQIDPALCRELFIRHALVEGDWQTRHAFFREN
LKLRAEVEELEHKSRRRDILVDDETLFEFYDQRIGHDVISARHFDSWWKK
VSRETPDLLNFEKSMLIKEGAEKISKLDYPNFWHQGNLKLRLSYQFEPGA
DADGVTVHIPLPLLNQVEESGFEWQIPGLRRELVIALIKSLPKPVRRNFV
PAPNYAEAFLGRVTPLELPLLDSLERELRRMTGVTVDREDWHWDQVPDHL
KITFRVVDDKNKKLKEGRSLQDLKDALKGKVQETLSAVADDGIEQSGLHI
WSFGQLPESYEQKRGNYKVKAWPALVDERDSVAIKLFDNPLEQKQAMWNG
LRRLLLLNIPSPIKYLHEKLPNKAKLGLYFNPYGKVLELIDDCISCGVDQ
LIDANGGPVWTEEGFAALHEKVRAELNDTVVDIAKQVEQILTAVFNINKR
LKGRVDMTMALGLSDIKAQMGGLVYRGFVTGNGFKRLGDTLRYLQAIEKR
LEKLAVDPHRDRAQMLKVENVQQAWQQWFNKLPPARREDEDVKEIRWMIE
ELRVSYFAQQLGTPYPISDKRILQAMEQISG
>Z0159 hrpB, helicase, ATP-dependent
MLQCGAKNVNPLERFVSSLPVAAVLPELLTALDYAPQVLLSAPTGAGKST
WLPLQLLAHPGINGKIILLEPRRLAARNVAQRLAELLNEKPGDTVGYRMR
AQNCVGPNTRLEVVTEGVLTRMIQRDPELSGVGLVILDEFHERSLQADLA
LALLLDVQQGLRDDLKLLIMSATLDNDRLQQMLPEAPVVISEGRSFPVER
RYLPLPTHQRFDDAVAVATAEMLRQESGSLLLFLPGVGEIQRVQEQLASR
IGSDVLLCPLYGALSLNDQRKAILPAPQGMRKVVLATNIAETSLTIEGIR
LVVDCAQERVARFDPRTGLTRLITQRISQASMTQRAGRAGRLEPGICLHL
IAKEQAERATAQSEPEILQSDLSGLLMELLQWGCSDPAQMSWLDQPPTVN
LLAAKRLLQMLGALDGERLSAQGQKMAALGNDPRLAAMLVSAKSDDEAAT
AAKIAAILEEPPRMGNSDLGVAFSRNQPAWQQRSQQLLKRLNVRGGEADS
SLIAPLLAGAFADRIARRRGQDGRYQLANGMGAMLDADDALSRHEWLIAP
LLLQGSASPDARILLALPVDIDELVQRCPQLVQQSDTVEWDDAQGTLKAW
RRLQIGQLTVKVQPLAKPSEDELHQAMLNGIRDKGLSVLNWTAEAEQLRL
RLLCAGKWLPEYDWPAVDDESLLATLETWLLPHMAGVHSLRGLKSLDIYQ
ALRGLLDWGMQQRLDSELPAHYTVPTGSRIAIRYHEDNPPALAVRMQEMF
GEATNPTIAQGRVPLVLELLSPAQRPLQITRDLGAFWKGAYREVQKEMKG
RYPKHVWPDDPANTAPTRRTKKYS
>Z5576 hupA, DNA-binding protein HU-alpha (HU-2)
MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTF
KVNHRAERTGRNPQTGKEIKIAAANVPAFVSGKALKDAVK
>Z0547 hupB, DNA-binding protein HU-beta, NS1 (HU-1)
MNKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTF
AVKERAARTGRNPQTGKEITIAAAKVPSFRAGKALKDAVN
>Z3613 intC, putative prophage integrase
MDKIILPTGFLPMLTVKQIEAAKPKEKPYRLLDGNGLYLYVPVSGKKVWQ
LRYKIDGKEKILTVGKYPLMTLQEARDKAWTARKDISVGIDPVKAKKASS
NNNSFSAIYKEWYEHKRQVWSAAYATELAKMFDDDILPIIGGLEIQDIEP
MQLLEVIRRFEDRGAMERANKARRRCGEVFRYAIVTGRAKYNPAPDLAEA
MKGYRKKNFPFLPADQIPAFNKALATFSGSIVSLIATKVLRYTALRTKEL
RSMQWKNVDFENRIITIEASVMKGRKIHVVPMSDQVVELLTTLSSITKPV
SEFVFAGRNDKKKSICENAVLLVIKQIGYEGLESGHGFRHEFSTIMNEHE
WPADAIEVQLAHANGGSVRGIYNHAQYLDKRREMMQWWADWLDGKVE
>Z0307 intH, putative integrase for prophage CP-933H
MHKHAAANVAQRNRLNGKQIGFWLQHFAGMQLRDITESKIYSAMQKMTNR
RHEENWKLRAEACRKKGKPVPEYTPKPASVATKATHLSFIKALLRAAERE
WKMLDKAPIIKVPQPKNKRLRWLEPHEAQRLIDECPEPLKSVVEFALATG
LRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVALNDTACRVLKKQI
GNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWKAALRRAGIDDFRF
HDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARQ
IDSILNPSVPNLSQSKNKEGTNDV
>Z5087 intL, putative integrase for prophage 933L and the LEE pathogenicity island
MALTDVKVKTAKPKERPYKLADGGGMYLLINANGSKYWRMKYRFAGKEKM
LSIGVYPDVTLADAREKRSEARKILAAGGDPGEAKKEEKIALQMSLKNTF
EAVAREWHQTKADRWSLRYRDEIIDTFEKDIFPYIGKRPIAEIKPMELLE
ALRKMEKRGALEKMRKVRQRCGEVFRYAIVTGRADYNPAPDLASALATPK
KVHFPFLTANELPHFLTDLAGYTGSIITKTATQIIMLTGVRTQELRFAHW
EDIDFEAKLWEIPAEVMKMKRPHIVPPSEQVIALFKQLEPISKHHPLVFI
GRNDPRKPISKESINQVIELLGYKGRLTGHGFRHTMSTILHEQGFNSAWI
EMQLAHVDKNSIRGTYNHAQYLDGRREMMQWYADYIDSLSELA
>Z1764 intN, partial integrase for prophage CP-933N
MSPRPRKNSTDVAGLYEKFDRRTGRVYYQYKNPVTGKFHGLGTDKGKAEK
IASTANQRIAAAEAEYFMRKIDESPSATKRRGIRLKAWVDRYLKIQDTRL
KNGDIAATTHKEKTRMAAYLVSRLGNHPLKELEVRDFALILDEWLDKDMV
STARVNRGLWVDIYKEAQHAGEVPPGWNPPEATRKPIPKVTRARLTMEDW
QKIYNATPEKHFIRNAMLLAIVTGQRRDDICHMRFSDVWNEHLHITQGKT
GMRLALPLTLRCDAIGITLKEVIDGCRDRILSPYLIHSRHQKQPKPMSKD
NLSDYFAKARDLAGVIPPAGKTPPTFHEQRSLSERLYRAQGIDTKTLLGH
KVQATTDRYNDTRGQEWVKLVI
>Z2036 intO, putative integrase for prophage CP-933O
MARPRKYKTDVPGLSPYFDKRNNKVYWRYRHPITGKNHGLGSIDQKLAET
IAAEANSRLARQQMEQMLSLQEKIISDTGGSSTVTIFLNNYRKIQQERYE
NGEIKLNTLKQKAAPLRVFDERFGTRPLDAITVKDVVSVLEEYKARGHNR
MGQIFRKVLIDVFREAQQTGDVPPGFNPAESAKKPQVRISRQRLTFDEWM
MIYNAAEKDGYFLQRGMLLALMTGQRLSDICKMQFSDIRDGYLHVEQQKT
GTRIAIPLALRCDKLNLTLDDVVSSCRDCVLSPWLLHHHHAKGTAKRGGM
VKPATLTVAFKKARDSVDYNWRANGTPPSFHEQRSLSERLFREQGVDTKI
LLGHSNQKMTDIYNDARGKEWKKLVI
>Z2566 intP_1, integrase fragment, cryptic prophage CP-933P
MREVEMKYPTGVENHGGKLRIWFVYKGVRVRENLGFLTQQKTGALQVSYA
PLFVTQ
>Z2568 intP_2, integrase fragment, cryptic prophage CP-933P
MEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEK
NLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNA
VFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAK
NLWCNTVLSGDF
>Z2415 intR, putative integrase for prophage CP-933R
MSKLPTGVEIRGKYIRIWFMFRGKRCRETLKGWEVTNSNIKKAGNLRALI
VHEINSGEFEYLRRFPQSSTGAKMVTTRVIKTFGELCDIWTKIKETELTT
NTMKKTKSQLKTLRIIICESTPISHIRYSDILNYRNELLHGETLYLDNPR
SNKKGRTVRTVDNYIALLCSLLRFAYQSGFISTKPFEGVKKLQRNRIKPD
PLSKTEFNALMESEKGQSQNLWKFAVYSGLRHGELAALAWEDVDFEKGVV
NVRRNLTILDMFGPPKTNAGIRTVTLLQPALEALKEQYKLTGHHRKSEIT
FYHREYGRTEKQKLHFVFMPRVCNGKQKPYYSVSSLGARWNAAVKRAGIR
RRNPYHTRHTFACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMND
EQVAMLNARLS
>Z2966 intT, integrase for prophage CP-933T
MSVRKIPSGKWLCECYPYGASGKRIRKQFATKSEALSYERRLMNSRVGDE
FQDGSGPRLSELIARWFEMYGKTLSSGAERKVKLEAICSRLGDPFASQFD
KNMFATYRERRLSGEWNPKGKKKLSEATVNREQSYLHAVFAELKRLGEWS
GENPLTGIRKFREEEKELAFLYVDEIERLLIACDESRNKDLGVVVRIGLA
TGARWSEAEGLKQSQVLPGRITFVKTKGKKNRTVPISPQLQAMLPKKRGA
LFSPCYEAFDAAIKRAKIELPDGQLTHVLRHTFASHFMMRGGNILVLQKI
LGHSDIKMTMRYAHFAPGHLEAAVELNPFDNRG
>Z3130 intU, putative integrase for prophage CP-933U
MGRRRKNPEHEKLPPKVYPNKYSVWKPTSRESVTLTAIEDGLAALWKKYE
ETVNHRDRAMTFGRLWEKFLASAYYSELSPRTQKDYLQHQKKLLAVFGKV
LADSVKPEHIRRYMDKRGEQSKTQANHEKSSMSRVYSWGYERGYVKANPC
AGVSKFKAKNRERYVTDKEYQAVLSVAPLPVFIAMEIAYLCAARVSDVLS
LKWEQIGNDGIFIQQGKTGKKQIKAWSPRLQAAIEKAKQLPTSAYVISNQ
YGNRYMYKGFNEMWVEARNHAGKISGILTDFTFHDLKAKGISDYEGSSRD
KQLFSGHKTEGQVLIYDRKVKVSPTLDVPLPENIPRKYSK
>Z3375 intV, putative integrase for prophage CP-933V
MSNASYPTGVENHGGSLRIWFHYNGKRVRENLGVPDTAKNRKIAGELRTS
VCFAIRMGSFDYAAQFPNSPNLKHFGLGKREITVKALSEKWLDLKKIEIC
ANALNRYQSVIKNMLPMLGEKKLVSSITKEDLLFVRRDLLTGYQKLSNGK
TSSIKGRSVVTVNYYMTTIAGMFQFATDNGYTSGNPFNGLAPLKKSKVKP
DPLTRDEFIRFIEACRHQQTKNLWILAVYTGIRHGELVSLAWEDIDLKAR
TITIRRNYTKLGEFTPPKTDAGTGRTIHLVQPAIDALKSQAEMTMLGKQH
SVEVKQREYGRTAVHKCTFVFSPQVTKQQQLSGPHYKVDSIRESWTSILK
RAGLRHRKSYQSRHTYACWSLAAGANPSFIASQMGHTNAQMVFNVYGAWM
KDNNHEQIELLNKRLSESVPCMPHKKAG
>Z1424 intW, integrase for bacteriophage BP-933W
MLLDAGGTMANSAYPAGVENHGGKLRITFKYRGKRVRENLRVPDTPKNRK
IAGELRASVCFAIRTGTFDYADRFPDSPNLKLFGLVKKDITVGELAQKWL
TLKAMEIGSNALNRYQSVMKNMLPRLGPGRLASSITKEDLLFIRKDLLTG
EKGSRKTSTSRKGRTVPTVNYYMTTTAGMFSFAAENGYLEKNPFNSITPL
RKSKPVPDPLTRDEFSRLIDACHHQQTKNLWTVAVFTGMRHGEIAALAWE
DIDLKAGTITVRRNFTKIGDFTLPKTDAGTNRVIHLLAPAIEALKNQAML
TRLSRQHQITVQLREYGRTILHECTFVFCPQIVRKNHKAGINYAVSSIGA
TWDSAIKRAGIRSRKAYQSRHTYACWALSSGANPTFIASQMGHSSASMVY
NVYGAWMPECSVTQVAMLNNVLNARAPDVPQSDQEDEIKLYFSK
>Z3677 lig, DNA ligase
MESIEQQLTELRTTLRHHEYLYHVMDAPEIPDAEYDRLMRELRELETKHP
ELITPDSPTQRVGAAPLAAFSQIRHEVPMLSLDNVFDEESFLAFNKRVQD
RLKSNEKVTWCCELKLDGLAVSILYENGVLVSAATRGDGTTGEDITSNVR
TIRAIPLKLHGENIPARLEVRGEVFLPQAGFEKINEDARRTGGKVFANPR
NAAAGSLRQLDPRITAKRPLTFFCYGVGVLEGGELPDTHLGRLLQFKKWG
LPVSDRVTLCESAEEVLAFYXXXEEDRPTLGFDIDGVVIKVNSLEQQEQL
GFVARAPRWAVAFKFPAQEQMTFVRDVEFQVGRTGAITPVARLEPVHVAG
VLVSNATLHNADEIERLGLRIGDKVVIRRAGDVIPQVVNVVLSERPEDTR
EVVFPTYCPVCGSDVERVEGEAVARCTGGLICGAQRKESLKHFVSRRAMD
VDGMGDKIIDQLVEKEYVHTPADLFKLTAGKLTGLERMGPKSAQNVVNAL
EKAKETTFARFLYALGIREVGEATAAGLAAYFGTLEALEAASIEELQKVP
DVGIVVASHVHNFFAEESNRNVISELLAEGVHWXAPIVINAEEIDSPFAG
KTVVLTGSLSQMSRDDAKARLVELGAKVAGSVSKKTDLVIAGEAAGSKLA
KAQELGIEVIDEAEMLRLLGS
>Z1754 mfd, transcription-repair coupling factor; mutation frequency decline
MPEQYRYTLPVKAGEQRLLGELTGAACATLVAEIAERHAGPVVLIAPDMQ
NALRLHDEISQFTDQMVMNLADWETLPXDSFSPHQDIISSRLSTLYQLPT
MQRGVLIVPVNTLMQRVCPHSFLHGHALVMEKGQRLSRDALRTQLDSAGY
RHVDQVMEHGEYATRGALLDLFPMGSELPYRLDFFDDEIDSLRVFDVDSQ
RTLEEVEAINLLPAHEFPTDKAAIELFRSQWRDTFEVKRDPEHIYQQVSK
GTLPAGIEYWQPLFFSEPLPPLFSYFPANTLLVNTGDLENSAERFQADTL
ARFENRGVDPMRPLLPPQSLWLRVDELFSELKNWPRVQLKTEHLPTKAAN
ANLGFQKLPDLAVQAQQKAPLDALRKFLESFDGPVVFSVESEGRREALGE
LLARIKIAPQRIMRLDEASDRGRYLMIGAAEHGFVDTVRNLALICESDLL
GERVARRRQDSRRTINPDTLIRNLAELHIGQPVVHLEHGVGRYAGMTTLE
AGGITGEYLMLTYANDAKLYVPVSSLHLISRYAGGAEENAPLHKLGGDAW
SRARQKAAEKVRDVAAELLDIYAQRAAKEGFAFKHDREQYQLFCDSFPFE
TTPDQAQAINAVLSDMXQPLAMDRLVCGDVGFGKTEVAMRAAFLAVDNHK
QVAVLVPTTLLAQQHYDNFRDRFANWPVRIEMLSRFRSAKEQTQILAEVA
EGKIDILIGTHKLLQSDVKFKDLGLLIVDEEHRFGVRHKERIKAMRANVD
ILTLTATPIPRTLNMAMSGMRDLSIIATPPARRLAVKTFVREYDSLVVRE
AILREILRGGQVYYLYNDVENIQKAAERLAELVPEARIAIGHGQMREREL
ERVMNDFHHQRFNVLVCTTIIETGIDIPTANTIIIERADHFGLAQLHQLR
GRVGRSHHQAYAWLLTPHPKAMTTDAQKRLEAIASLEDLGAGFALATHDL
EIRGAGELLGEEQSGSMETIGFSLYMELLENAVDALKAGREPSLEDLTSQ
QTEVELRMPSLLPDDFIPDVNTRLSFYKRIASAKTENELEEIKVELIDRF
GLLPDPARTLLDIARLRQQAQKLGIRKLEGNEKGGVIEFAEKNHVNPAWL
IGLLQKQPQHYRLDGPTRLKFIQDLSERKTRIEWVRQFMRELEENAIA
>Z4149 mutH, methyl-directed mismatch repair
MSQPRPLLSPPETEEQLLAQAQQLSGYTLGELAALAGLVTPENLKRDKGW
IGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPL
TGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEE
DRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGAR
GERILTLPRGFYLKKNFTSALLARHFLIQ
>Z5777 mutL, enzyme in methyl-directed mismatch repair
MPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERGG
AKLIRIRDNGCGIKKDELALALARHATSKIASLDDLEAIISLGFRGEALA
SISSVSRLTLTSRTAEQQEAWQAYAEGRDMDVTVKPAAHPVGTTLEVLDL
FYNTPARRKFLRTEKTEFNHIDEIIRRIALARFDVTINLSHNGKIVRQYR
AVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWVADPNHTTPAL
AEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQVD
VNVHPAKHEVRFHQSRLVHDFIYQGVLSVLQQQLETPLPLDDEPQPAPRP
IPENRVAAGRNHFAEPAVREPVAPRYTPAPASGSRPAAPWPNAQPGYQKQ
QGEVYRQLLQTPAPMQKPKAPEPQEPALAANSQSFGRVLTIVHSDCALLE
RDGNISLLALPVAERWLRQVQLTPGEAPVCAQPLLIPLRLKVSGEEKSAL
EKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQS
VFEPGNIAQWIARNLMSENAQWSMAQAITLLADVERLCPQLVKTPPGGLL
QSVDLHPAIKALKDE
>Z5059 mutM, formamidopyrimidine DNA glycosylase
MPELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVL
SVQRRAKYLLLELPEGWIIIHLGMSGSLRILPEELPPEKHDHVDLVMSNG
KVLRYTDPRRFGAWLWTKELEGHNVLAHLGPEPLSDDFNGEYLHQKCAKK
KTAIKPWLMDNKLVVGVGNIYASESLFAAGIHPDRLASSLSLAECELLAR
VIKAVLLRSIEQGGTTLKDFLQSDGKPGYFAQELQVYGRKGEPCRVCGTP
IVATKHAQRATFYCRQCQK
>Z4043 mutS, methyl-directed mismatch repair
MSAIENFDAHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQL
LDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPA
TSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDI
SSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRP
LWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRT
TLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVT
PMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLER
ILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEF
AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL
EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAER
YIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALA
ELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANP
LNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVXIGPI
DRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTS
TYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDAL
EHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS
PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLK
SLV
>Z0109 mutT, 7,8-dihydro-8-oxoguanine-triphosphatase, prefers dGTP, causes AT-GC transversions
MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGETPEQAV
VRELQEEVGITPQHFSLFEKLEYEFPDRHITLWFWLVESWEGEPWGKEGQ
PGKWMSLVGLNADDFPPANEPVIAKLKRVYVG
>Z4306 mutY, adenine glycosylase; G.C--> T.A transversions
MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP
YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLH
GGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYA
VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCS
LCPLQNGCIAATNNSWSLYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRP
PSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD
IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAPV
>Z0865 nei, endonuclease VIII and DNA N-glycosylase with an AP lyase activity
MPEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKTYQSQLIGQHVTHVET
RGKALLTHFSNDLTLYSHNQLYGVWRVVDTGEESQTTRVLRVKLQTADKT
ILLYSASDIEMLTPEQLTTHPFLQRVGPDVLDPNLTPEVVKERLLSPRFR
NRQFAGLLLDQAFLAGLGNYLRVEILWQVGLTGNHKAKDLNAAQLDALAH
ALLDIPRFSYATRGQVDENKHHGALFRFKVFHRDGELCERCGGIIEKTTL
SSRPFYWCPGCQH
>Z5574 nfi, endonuclease V (deoxyinosine 3'endoduclease)
MDLASLRAQQIELASSVIREDRLDKDPPDLIAGADVGFEQGGEVTRAAMV
LLKYPSLELVEYKVARIATTMPYIPGFLSFREYPALLAAWEMLSQKPDLV
FVDGHGISHPRRLGVASHFGLLVDVPTIGVAKKRLCGKFEPLSSEPGALA
PLMDKGEQLAWVWRSKARCNPLFIATGHRVSVDSALAWVQRCMKGYRLPE
PTRWADAVASERPAFVRYTANQP
>Z3416 nfo, endonuclease IV
MKYIGAHVSAAGGLANAAIRAAEIDATAFALFTKNQRQWRAAPLTTQTID
EFKAACEKYHYTSAQILPHDSYLINLGHPVTEALEKSRDAFIDEMQRCEQ
LGLSLLNFHPGSHLMQISEEDCLARIAESINIALDKTQGVTAVIENTAGQ
GSNLGFKFEHLAAIIDGVEDKSRVGVCIDTCHAFAAGYDLRTPAECEKTF
ADFARTVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGBIGHDAFRWIMQD
DRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA
>Z2644 nth, endonuclease III; specific for apurinic and/or apyrimidinic sites
MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDVSVNKA
TAKLYPVANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKXCRILLEQH
NSEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQF
APGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIE
DLCEYKEKVDI
>Z2432 ogt, O-6-alkylguanine-DNA/cysteine-proteinmethyltrans ferase
MLRLLEEKIATPLGPLWVICDEQFRLRAVEWEEYSERMVQLLDIHYRKEG
YERISATNPGGLSDKLREYFAGNLSIIDTLPTATGGTPFQREVWKTLRTI
PCGQVMHYGELAEQLGRPGAARAVGAANGSNPISIVVPCHRVIGRNGTMT
GYAGGVQRKEWLLRHEGYLLL
>Z4373 parC, DNA topoisomerase IV subunit A
MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMS
ELGLNASAKFKKSARTVGDVLGKYHPHGDSACYEAMVLMAQPFSYRYPLV
DGQGNWGAPDDPKSFAAMRYTESRLSKYSELLLSELGQGTADWVPNFDGT
LQEPKMLPARLPNILLNGTTGIAVGMATDIPPHNLREVAQAAIALIDQPK
TTLDQLLDIVQGPDYPTEAEIITSRAEIRKIYENGRGSVRMRAVWKKEDG
AVVISALPHQVSGARVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIV
PRSNRVDMDQVMNHLFATTDLEKSYRINLNMIGLDGRPAVKNLLEILSEW
LVFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRNEDEPK
PALMSRFGLTETQAEAILELKLRHLAKLEEMKIRGEQSELEKERDQLQGI
LASERKMNNLLKKELQADAQAYGDDRRSPLQEREEAKAMSEHDMLPSEPV
TIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKAAVKGKSNQPVVFVDSTG
RSYAIDPITLPSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDA
GYGFVCTFNDLVARNRAGKALITLPENAHVMPPVVIEDASDMLLAITQAG
RMLMFPVSDLPQLSKGKGNKIINIPSAEAARGEDGLAQLYVLPPQSTLTI
HVGKRKIKLRPEELQKVTGERGRRGTLMRGLQRIDRVEIDSPRRASSGDS
EE
>Z4387 parE, DNA topoisomerase IV subunit B
MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAG
HAKRVDVILHADQSLEVIDDGRGMPVDIHPEEGVPAVELILCRLHAGGKF
SNKNYQFSGGLHGVGISVVNALSKRVEXNVRRDGQVYNIAFENGEKVQDL
QVVGNCGKRNTGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEI
TFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFAGDTEAVD
WALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNI
LPRGVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVV
KDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK
LADCTAQDLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEV
SSDEVLASQEVHDISVAIGIDPDSDDLSQLRYGKICILADADSDGLHIAT
LLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQ
LKRKKGKPNVQRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTD
AMMDMLLAKKRSEDRRNWLQEKGDMAEIEV
>Z0859 phrB, deoxyribodipyrimidine photolyase (photoreactivation)
MTTHLVWFRQDLRLHDNLALAAACRNSSARLLALYIATPRQWAAHNMSPR
QAELINAQLNGLQIALAEKGIPLLFREVDDFAASVEIVKQVCAENSVTHL
FYNYQYEVNERARDVQVERTLRNVVCEGFDDSVILPPGAVMTGNHEMYKV
FTPFKNAWLKRLREGMPECVAAPKVRSSGSIKPAPSITLNYPRQSFDTAH
FPVEEKAAIAQLRQFCQNGAGEYEQQRDFPAVEGTSRLSASLATGGLSPR
QCLHRLLAEQPQALDGGAGSVWLNELIWREFYRHLMTYYPSLCKHCPFIA
WTDRVQWQXNPAHLQAWQEGKTGYPIVDAAMRQLNSTGWMHNRLRMITAS
FLVKDLLIDWREGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIF
NPTTQGEKFDREGEFIRQWLPELRNVPGKSVHEPWKWAEKAGVKLDYPQP
IVEHKEARVQTLAAYEAARKGK
>Z0318 pinH, DNA invertase from prophage CP-933H
MASFLLLSGRSTMLIGYVRVSTNDQNTDLQRNALNCAGCELIFEDKISGT
KSERPGLKKLLRTLSAGDTLVVWKLDRLGRSMRHLVILVEELRERGVNFR
SLTDAIDTSTPMGRFFFHVMGALAEMERELIVERTKAGLEAARAQGRIGG
RRPKLTPEQWAQAGRLIAAGIPRQKVAIIYDVGVSTLYKKFPAGDK
>Z5398 polA, DNA polymerase I, 3'--> 5' polymerase, 5'--> 3' and 3'--> 5' exonuclease
MVQIPQNPLILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLI
MQYKPTHAAVVFDAKGKTFRDELFEHYKSHRPPMPDDLRAQIEPLHAMVK
AMGLPLLAVSGVEADDVIGTLAREAEKAGRPVLISTGDKDMAQLVTPNIT
LINTMTNTILGPEEVVNKYGVPPELIIDFLALMGDSSDNIPGVPGVGEKT
AQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLEQNKEVAYLSYQLA
TIKTDVELELTCEQLEVQQPAAEELLGLFKKYEFKRWTADVEAGKWLQAK
GAKPAAKPQETSVADEAPEVTATVISYDNYVTILDEETLKAWIAKLEKAP
VFAFDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRER
ALELLKPLLEDEKALKVGQNLKYDRGILANYGIELRGIAFDTMLESYILN
SVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAE
DADVTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRIERNGVKIDPK
VLHNHSEELTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLK
KTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKSTYTDKLPLMINP
KTGRVHTSYHQAVTATGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYV
IVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLETVT
SEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLE
YMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAERAAINAPMQG
TAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHRDDVDAVAKQI
HQLMENCTRLDVPLLVEVGSGENWDQAH
>Z0068 polB, DNA polymerase II
MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQV
PRAQHILRGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGG
VTVYEADVRPPERYLMERFITSPVWVEGDMHNGAIVNARLKPHPDYRPPL
KWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASALDFELEYVAS
RPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRIPLRLGRDN
SELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQEL
LGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMP
FLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASP
GGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHS
TEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFY
GVLGTTACRFFDPRLVSSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTF
VWLKGAHSEEEATKIGRALVQHVNVWWAETLQKQQLTSALELEYETHFCR
FLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQ
ELYLRIFRNEPYQEYFRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVP
PHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYE
HYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF
>Z5482 priA, primosomal protein N'(= factor Y)(putative helicase)
MPVAHVALPVPLPRTFDYLLPEGMAVKAGCRVRVPFGKQQERIGVVVSVS
DVSELPLNELKAVVEVLDSEPVFTHSVWRLLLWAADYYHHPIGDVLFHAL
PILLRQGRPAANAPMWYWFATEQGHAVDLNSLKRSPKQQQALAALRQGKI
WRDQVATLEFNDAALQALRKKGLCDLASETPEFSDWRTNYAVSGERLRLN
TEQATAVGAIHSAADTFSAWLLAGVTGSGKTEVYLSVLENVLAQGKQALV
MVPEIGLTPQTIARFRERFNAPVEVLHSGLNDSERLSAWLKAKNGEAAIV
IGTRSALFTPFKNLGVIVIDEEHDSSYKQQEGWRYHARDLAVYRAHSEQI
PIILGSATPALETLCNVQQKKYRLLRLTRRAGNARPAIQHVLDLKGQKVQ
AGLAPALITRMRQHLQADNQVILFLNRRGFAPALLCHDCGWIAECPRCDH
YYTLHQAQQHLRCHHCDSQRPVPRQCPSCGSTHLVPVGLGTEQLEQTLAP
LFPGVPISRIDRDTTSRKGALEQQLAEVHRGGARILIGTQMLAKGHHFPD
VTLVALLDVDGALFSADFRSAERFAQLYTQVAGRAGRAGKQGEVVLQTHH
PEHPLLQTLLYKGYDAFAEQALAERRMMQLPPWTSHVIVRAEDHNNQHAP
LFLQQLRNLILSSPLADDKLWVLGPVPALAPKRGGRWRWQILLQHPSRVR
LQHIINGTLALINTIPDSRKVKWVLDVDPIEG
>Z5810 priB, primosomal replication protein N
MTNRLVLSGTVCRTPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQM
PVIVSGHENQAITHSITVGSRITVQGFISCHKAKNGLSKMVLHAEQIELI
DSGD
>Z0584 priC, primosomal replication protein N''
MKTALLLEKLEXQLATLRQRCAPVSQFATLSARFNRHLFQTRATTLQACL
DKAGDNLAALRHAVEQQQLPQVAWLAEHLAAQLEAIAREATAWSLREWDS
APPQIARWQRKRIQHQDFERRLREMVAERRARLARVTDLVEQQTLHREVE
AYEARLARCRHALEKIENRLARLTR
>Z5062 radC, DNA repair protein
MKVKNNAQLLMPREKMLKFGISALTDVELLALFLRTGTRGKDVLTLAKEM
LENFGSLYGLLTSEYEQFSGVHGIGVAKFAQLKGIAELARRYYNVRMREE
SPLLSPEMTREFLQSQLTGEEREIFMVIFLDSQHRVITHSRLFSGTLNHV
EVHPREIIREAIKINASALILAHNHPSGCAEPSKADKLITERIIKSCQFM
DLRVLDHIVIGRGEYVSFAERGWI
>Z4002 recA, DNA strand exchange and renaturation, DNA-dependent ATPase, DNA-and ATP-dependent coprotease
MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDI
ALGAGGLPMGRIVEIYGPESSGKTTLTLQVIAAAQREGKTCAFIDAEHAL
DPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVNVIVVDSVAAL
TPKAEIEXEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKI
GVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVGSETRVKVVKN
KIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIG
QGKANATAWLKDNPETAKEIEKKVRELLLSNPNSTPDFSVDDSEGVAETN
EDF
>Z4137 recB, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MSDVAETLDPLRLPFQGERLIEASAGTGKTFTIAALYLRLLLGLGGSAAF
PRPLTVEELLVVTFTEAATAELRGRIRSNIHELRIACLRETTDNPLYERL
LEEIDDKAQAAKWLLLAERQMDEAAVFTIHGFCQRMLNLNAFESGMLFEQ
QLIEDESLLRYQACADFWRRHCYPLPREIAQVVFETWKGPQALLRDINRY
LQGEAPVIKAPPPDDETLASRHAQIVARIDTVKQQWRDAVGELDALIESS
GIDRRKFNRSNQAKWIDKISAWAEEERNSYQLPESLEKFSQRFLEDRTKA
GGETPRHPLFEAIDQLLAEPLSIRDLVITRALAEIRETVAREKRRRGELG
FDDMLSRLDSALRSESGEVLAAAIRTRFPVAMIDEFQDTDPQQYRIFRRI
WHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSEVHAHYTLDTNWRSA
PGMVNSVNKLFSQTDDAFMFREIPFIPVKSAGKNQALRFVFKGETQPAMK
MWLMEGESCGVGDYQSTMAQVSAAQIRDWLQAGQRGEALLMNGDDARPVR
ASDISVLVRSRQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEMLWLL
QAVMTPERENTLRSALATSMMGLNALDIETLNNDEHAWDAVVEEFDGYRQ
IWRKRGVMPMMRALMSARNIAENLLATAGGERRLTDILHISELLQEAGTQ
LESEHALVRWLSQHILEPDSNASSQQMRLESDKHLVQIVTIHKSKGLEYP
LVWLPFITNFRVQDQAFYHDRHSFEAVLDLNAAPESVDLAEVERLAEDLR
LLYVALTRSVWHCSLGVAPLVRRRGDKKGDTDVHQSALGRLLQKGEPQDA
AGLRTCIEALCDDDIAWQTAQTGDNQPWQVNDALTAELNARTLQRLPGDN
WRVTSYSGLQQRGHGIAQDLMPRLDVDAAGVVSVVEEPTLTPHQFPRGAS
PGTFLHSLFEDLNFTQPVDPNWVQEKLELGGFESQWEPVLTEWITAVLQA
PLNETGVSLSQLSDRDKQVEMEFYLPISEPLIASQLDALIRQFDPLSAGC
PPLEFMQARGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQAMAA
AMQAHRYDLQYQLYTLALHRYLRHRIADYDYERHFGGVIYLFLRGVDKEH
PQQGIYTTRPNAGLIDLMDEMFAGMTLEEA
>Z4139 recC, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTL
SQKFGIAANIDFPLPASFIWDMFVRVLPEIPKESAFNKQSMSWKLMTLLP
QLLEREDFTLLRHYLTDDSDKRKLFQLSSKAADLFDQYLVYRPDWLAQWE
TGHLVEGLGEAQAWQAPLWKALVEYTHELGQPRWHRANLYQRFIETLESA
TTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLFTNPCRYYWGD
IKDPAYLAKLLTRQRRHSFEDHELPLFRDSENAGQLFNSDGEQDVGNPLL
ASWGKLGRDYIYLLSDLESSQELDAFVDVTPDNLLHNIQSDILELENRAV
AGVNIEEFSRSDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEED
PTLTPRDIIVMVADIDSYSPFIQAVFGSAPADRYLPYAISDRRARQSHPV
LEAFISLLSLPDSRFVSEDVLALLDVPVLAARFDITEEGLRYLRQWVNES
GIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESAQGEWQSVLPY
DESSGLIAELVGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLNALFL
PDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDELAQRLDQERI
SQRFLAGPVNICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQK
PKRGDRSRRDDDRYLFLEALISAQQKLYISYIGRSIQDNSERFPSVLVQE
LIDYIGQSHYLPGDEALNCDESEARVKAHLTCLHTRMPFDPQNYQPGERQ
SYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRFWAHPVRAFFQ
MRLQVNFRTEDSEIPDTEPFILEGLSRYQINQQLLNALVEQDDAERLFRR
FRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGV
QITGWLPQVQPDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFL
RKDGEWRFPPLAAEQALHYLSQLIEGYREGMSAPLLVLPESGGAWLKTCY
DAQNDAMLDDDSTLQKARTKFLQAYEGNMMVSGEGDDIWYQRLWRQLTPE
TMEAIVEQSQRFLLPLFRFNQS
>Z4136 recD, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGH
VCLPLSRLENNEESHPLLATCVSEIGELQNWEECLLASQAVSRGDEPTPM
ILCGDRLYLNRMWCNERTVARFFNEVNHAIEVDEALLAQTLDKLFPTGDE
INWQKVAAAVALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIR
LAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTLHRLLGAQPGS
QRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQ
LASVEAGAVLGDICAYANAGFTAERAGQLSRLTGTHVPAGTGTEAASLRD
SLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS
GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFG
VAGLNERIEQFMQQKRKIHRNPHSRWYEGRPVMIARNDSALGLFNGDIGI
ALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAA
LILPSQRTPVVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGL
AALFXSRE
>Z5191 recF, ssDNA and dsDNA binding, ATP binding
MSLTRLLIRDFRNIETADLALSPGFNFLVGANGSGKTSVLEAIYTLGHGR
AFRSLQIGRVIRHEQEAFVLHGRLQGEERETAIGLTKDKQGDSKVRIDGT
DGHKVAELAHLMPMQLITPEGFTLLNGGPKYRRAFLDWGCFHNEPGFFTA
WSNLKRLLKQRNAALRQVTRYEQLRPWDKELIPLAEQISTWRAEYSAGIA
ADMADTCKQFLPEFSLTFSFQRGWEKETEYAEVLERNFERDRQLTYTAHG
PHKADLRIRADGAPVEDTLSRGQLKLLMCALRLAQGEFLTRESGRRCLYL
IDDFASELDDERRGLLASRLKATQSQVFVSAISAEHVIDMSDENSKMFTV
EKGKITD
>Z5078 recG, DNA helicase, resolution of Holliday junctions, branch migration
MGYYAGCRVSAMTGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLH
LPLRYEDRTHLYPIGELLPGVYATVEGEVLNCNISFGGRRMMTCQISDGS
GILTMRFFNFSAAMKNSLATGRRVLAYGEAKRGKYGAEMIHPEYRVQGDL
STPELQETLTPVYPTTEGVKQATLRKLTDQALDLLDTCAIEELLPPELSQ
GMMTLPEALRTLHRPPPTLQLSDLETGQHPAQRRLILEELLAHNLSMLAL
RAGAQRFHAQPLSANDALKNKLLAALPFKPTGAQARVVAEIERDMALDVP
MMRLVQGDVGSGKTLVAALAALRAIAHGKQVALMAPTELLAEQHANNFRN
WFEPLGIEVGWLAGKQKGKARLSQQEAIASGQVQMIVGTHAIFQEQVQFN
GLALVIIDEQHRFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTA
YADLDTSVIDELPPGRTPVTTVAIPDTRRTDIIDRVRHACITEGRQAYWV
CTLIEESELLEAQAAEATWEELKLALPELNVGLVHGRMKPAEKQAVMASF
KQGELHLLVATTVIEVGVDVPNASLMIIENPERLGLAQLHQLRGRVGRGA
VASHCVLLYKTPLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTR
QTGNAEFKVADLLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETER
YSNA
>Z4230 recJ, ssDNA exonuclease, 5'--> 3' specific
MKQQIQLRRREVDETADLPAELPPLLRRLYASRGVRSAQELERSVKGMLP
WQQLSGVEKAVEILYNAFREGTRIIVVGDFDADGATSTALSVLAMRSLGC
SNIDYLVPNRFEDGYGLSPEVVDQAHARGAQLIVTVDNGISSHAGVEHAR
SLGIPVIVTDHHLPGDTLPAAEAIINPNLRDCNFPSKSLAGVGVAFYLML
ALRTFLRDQGWFDERGIAIPNLAELLDLVALGTVADVVPLDANNRILTWQ
GMSRIRAGKCRPGIKALLEVANRDAQKLAASDLGFALGPRLNAAGRLDDM
SVGVALLLCDNIGEARVLANELDALNQTRKEIEQGMQIEALTLCEKLERS
RDTLPGGLAMYHPEWHQGVVGILASRIKERFHRPVIAFAPAGDGTLKGSG
RSIQGLHMRDALERLDTLYPGMMLKFGGHAMAAGLSLEEDKFELFQQRFG
ELVTEWLDPSLLQGEVVSDGPLSPAEMTMEVAQLLRDAGPWGQMFPEPLF
DGHFRLLQQRLVGERHLKVMVEPVGGGPLLDGIAFNVDTALWPDNGVREV
QLAYKLDINEFRGNRSLQIIIDNIWPI
>Z3909 recN, protein used in recombination and DNA repair
MLAQLTISNFAIVRELEIDFHSGMTVITGETGAGKSIAIDALGLCLGGRA
EADMVRTGAARADLCARFSLKDTPAALRWLEENQLEDGHECLLRRVISSD
GRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQLLTKPEHQKFLLDGYA
NETSLLQEMTARYQLWHQSCRDLAHHQQLSQERAARAELLQYQLKELNEF
NPQPGEFEQIDEEYKRLANSGQLLTTSQNALALMADGEDANLQSQLYTAK
QLVSELIGMDSKLSGVLDMLEEATIQIVEASDELRHYCDRLDLDPNRLFE
LEQRISKQISLARKHHVSPETLPQYYQSLLEEQQQLDDQADSQETLALAV
TKHHQQALETARALHQQRQHYAEELAQLITDSMHALSMPHGQFTIDVKFD
EHHLGADGADRIEFRVTTNPGQPMQPIAKVASGGELSRIALAIQVITARK
METPALIFDEVDVGISGPTAAVVGKLLRQLGESTQVMCVTHLPQVAGCGH
QHYFVSKETDGAMTETHMQSLNKKTRLQELARLLGGSEVTRNTLANAKEL
LAA
>Z3846 recO, protein interacts with RecR and possibly RecF proteins
MEGWQRAFVLHSRPWSETSLMLDVFTEESGRVRLVAKGARSKRSTLKGAL
QPFTPLLLRFGGRGEVKTLRSAEAVSLALPLSGITLYSGLYINELLSRVL
EYETRFSELFFDYLHCIQSLAGVTGTPEPALRRFELALLGHLGYGVNFTH
CAGSGEPVDDTMTYRYREEKGFIASVVIDNKTFTGRQLKALNAREFPDAD
TLRAAKRFTRMALKPYLGGKPLKSRELFRQFMPKRTVKTHYE
>Z5343 recQ, ATP-dependent DNA helicase
MNVAQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVMP
TGGGKSLCYQIPALLLNGLTVVVSPLISLMKDQVDQLQANGVAAACLNST
QTREQQLEVMTGCRTGQIRLLYIAPERLMLDNFLEHLAHWNPVLLAVDEA
HCISQWGHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGL
NDPLIQISSFDRPNIRYMLMEKFKPLDQLMRYVQEQRGKSGIIYCNSRAK
VEDTAARLQSKGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGM
GINKPNVRFVVHFDIPRNIESYYQETGRAGRDGLPAEAMLFYDPADMAWL
RRCLEEKPQGQLQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGRQEPCG
NCDICLDPPKQYDGSTDAQIALSTIGRVNQRFGMGYVVEVIRGANNQRIR
DYGHDKLKVYGMGRDKSHEHWVSVIRQLIHLGLVTQNIAQHSALQLTEAA
RPVLRGESSLQLAVPRIVALKPKAMQKSFGGNYDRKLFAKLRKLRKSIAD
ESNVPPYVVFNDATLIEMAEQMPITASEMLSVNGVGMRKLERFGKPFMAL
IRAHVDGDDEE
>Z0589 recR, recombination and repair
MQTSPLLTQLMEALRCLPGVGPKSAQRMAFTLLQRDRSGGMRLAQALTRA
MSEIGHCADCRTFTEQEVCNICSNPRRQENGQICVVESPADIYAIEQTGQ
FSGRYFVLMGHLSPLDGIGPDDIGLDRLEQRLAEEKITEVILATNPTVEG
EATANYIAELCAQYDVEASRIAHGVPVGGELEMVDGTTLSHSLAGRHKIR
F
>Z2410 recT, recombinase, DNA renaturation protein encoded by prophage CP-933R
MTKQPPIAKADLQKTQGNRAPAAIKNNDVISFINQPSMKEQLAAALPRHM
TAERMIRIATTEIRKVPALGNCDTMSFVSAIVQCSQLGLEPGSALGHAYL
LPFGNKNEKSGKKNVQLIIGYRGMIDLARRSGQIASLSARVVREGDEFNF
EFGLDEKLIHRPGENEDAPVTHVYAVARLKDGGTQFEVMTRKQIELVRSQ
SKAGNNGPWVTHWEEMAKKTAIRRLFKYLPVSIEIQRAVSMDEKEPLTID
PADSSVLTGEYSVIDNSEE
>Z5288 rep, rep helicase, a single-stranded DNA dependent ATPase
MRLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHI
AAVTFTNKAAREMKERVXQTLGRKEARGLMISTFHTLGLDIIKREYAALG
MKANFSLFDDTDQLALLKELTEGLIEDDKVLLQQLISTISNWKNDLKTPA
QAAAEAKGERDRIFAHCYGLYDAHLKACNVLDFDDLILLPTLLLQRNEEV
RERWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVVGDDDQSIYSW
RGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEK
RLFSELGYGTELKVLSANNEEHEAERVTGELIAHHFVNKTQYKDYAILYR
GNHQSRVFEKFLMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDS
AFLRIVNTPKREIGPATLKKLGEWAMTRNKSMFTASFDMGLSQTLSGRGY
EALTRFTHWLAEIQRLAEREPIAAVRDLIHGMDYESWLYETSPSPKAAEM
RMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMMERGESEEELD
QVQLMTLHASKGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGI
TRAQKELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIWEQERKVVSAE
ERMQKGQSHLANLKAMMAAKRGK
>Z5290 rhlB, putative ATP-dependent RNA helicase
MSKTHLTEQKFSDFALHPKVVEALEKKGFHNCTPIQALALPLTLAGRDVA
GQAQTGTGKTMAFLTSTFHCLLSHPAIADRKVNQPRALIMAPTRELAVQI
HADAEPLAEATGLKLGLAYGGDGYDKQLKVLESGVDILIGTTGRLIDYAK
QNHINLGAIQVVVLDEADRMYDLGFIKDIRWLFRRMPPANQRLNMLFSAT
LSYRVRELAFEQMNNAEYIEVEPEQKTGHRIKEELFYPSNEEKMRLLQTL
IEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILD
EFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRA
GANGHSISLACEEYALNLPAIETYIGHSIPVSKYNPDALMTDLPKPLRLT
RPRTGNGPRRTGAPRNRRRSG
>Z1017 rhlE, putative ATP-dependent RNA helicase
MSFDSLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTG
KTAGFTLPLLQHLITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSK
YLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVKLDQV
EILVLDEADRMLDMGFIHDIRRVLTKLPAKRQNLLFSATFSDDIKALAEK
LLHNPLEIEVARRNTASDQVTQHVHFVDKKRKRELLSHMIGKGNWQQVLV
FTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDIRVLV
ATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC
VDEHKLLRDIEKLLKKEIPRIAIPGYEPDPSIKAEPIQNGRQQRGGGGRG
QGGGGRGQQQPRRGEGGAKSASAKPAEKPSRRLGDAKPAGEQQRRRRPRK
PAAAQ
>Z0239 rnhA, RNase HI, degrades RNA of DNA-RNA hybrids, participates in DNA replication
MLKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELM
AAIVALEALKEHCEVILSTDSQYVRQGITQWIHNWKKRGWKTADKKPVKN
VDLWQRLDAALGQHQIKWEWVKGHAGHPENERCDELARAAAMNPTLEDTG
YQVEV
>Z0195 rnhB, RNAse HII, degrades RNA of DNA-RNA hybrids
MIEFVYPHTQLVAGVDEVGRGPLVGAVVTAAVILDPARPIAGLNDSKKLS
EKRRLALCEEIKEKALSWSLGRAEPHEIDELNILHATMLAMQRAVAGLHI
APEYVLIDGNRCPKLPMPAMAVVKGDSRVPEISAASILAKVTRDAEMAAL
DIVFPQYGFAQHKGYPTAFHLEKLAEHGATEHHRRSFGPVKRALGLAS
>Z2671 rnt, RNase T, degrades tRNA
MSDNAQLTGLCDRFRGFYPVVIDVETAGFNAKTDALLEIAAITLKMDEQG
WLMPDTTLHFHVEPFVGANLQPEALAFNGIDPNDPDRGAVSEYEALHEIF
KVVRKGIKASGCNRAIMVAHNANFDHSFMMAAAERASLKRNPFHPFATFD
TAALAGLALGQTVLSKACQTAGMDFDSTQAHSALYDTERTAVLFCEIVNR
WKRLGGWPLPAAEEV
>Z1873 rus, endodeoxyribonuclease RUS (Holliday junction resolvase) of prophage CP-933X
MNTYSITLPWPPSNNRYYRHNRGRTHISAEGQAYRDNVARIIKGSMLDIG
LAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVV
KMPVTKGGKLELTITELGNE
>Z2913 ruvA, Holliday junction helicase subunit B; branch migration; repair
MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTH
FVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVN
AVEREEVGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAADLVLTS
PASPATDDAEQEAVAALVALGYKPQEASRMVSKIARPDASSETLIREALR
AAL
>Z2912 ruvB, Holliday junction helicase subunit A; branch migration; repair
MIEADRLISAGTTLPEDVADRAIRPKLLEEYVGQPQVRSQMEIFIKAAKL
RGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAM
LTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKI
DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQYIVSRSARFM
GLEMSDDGALEVARRARGTPRIANRLLRRVRDFAEVKHDGTISADIAAQA
LDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDV
LEPYLIQQGFLQRTPRGRMATTRAWNHFGITPPEMP
>Z2915 ruvC, Holliday junction nuclease; resolution of structures; repair
MAIILGIDPGSRVTGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIYA
GVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVF
EYAARQVKQTVVGIGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITH
CHVSQNAMQMSESRLNLARGRLR
>Z3173 sbcB, exonuclease I, 3'--> 5' specific; deoxyribophosphodiesterase
MMNDGKQQSTFLFHDYETFGTHPALDRPAQFAAIRTDDEFNVIGEPEVFY
CKPADDYLPQPGAVLITGITPQEARAKGENEAAFAARIHSLFTVPKTCIL
GYNNVRFDDEVTRNVFYRNFYDPYAWSWQHDNSRWDLLDVMRACYALRPE
GINWPENDDGLPSFRLEHLTKANGIEHSNAHDAMADVYATIAMAKLVKTR
QPRLFDYLFTHRNKHKLMALIDVPQMKPLVHVSGMFGAWRGNTSWVAPLA
WHPENRNAVIMVDLAGDISPLLELDSDTLRERLYTAKTDLGDNAAVPVKL
VHINKCPVLAQANTLRPEDADRLGINRQHCLDNLKILRENPQVREKVVAI
FAEAEPFTPSDNVDAQLYNGFFSDADRAAMKIVLETEPRNLPALDITFVD
KRIEKLLFNYRARNFPGTLDYAEQQRWLEHRRQVFTPEFLQGYAEEIQML
AQQYAXDKEKVALLKALWQYAEXXV
>Z0495 sbcC, ATP-dependent dsDNA exonuclease
MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAI
CLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNR
ARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRS
MLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTEL
EKLQAQASGVALLTPEQVQSLTASLQVLTDEEKQLLTAQQQEQQSLNWLT
RLDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIA
EHSAALAHTRQQIEEVNTRLQNTMALRASIRHHAAKQSAELQQQQQSLNT
WLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALA
AITLMLTADEVATALAQHAEQRPLRQRLVALHGQIVPQQKRLAQLMVTIQ
NVTLEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQL
QAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGAALR
GQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPQDDIQP
WLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTAL
AGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILET
LPQSDDLPHSEETVALDNWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKA
QAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLV
TQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQ
QLKQDADNRQQQQTLLQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQG
LTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTL
SGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDAL
DALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK
>Z0496 sbcD, ATP-dependent dsDNA exonuclease
MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVF
DTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFL
NTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQ
QHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIY
IGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECG
KSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVS
QEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLA
SQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA
>Z3170 sbmC, SbmC protein
MNYEIKQEDKRTVAGFHLVGPWEQTVKKGFEQLMMWVDSKNIVPKEWVAV
YYDNPDETPAEKLRCDTVVTVPNNFTLPENSEGVILTEISGGQYAVAVAR
VVGDDFAKPWYQFFNSLLQDSAYEMLPKPCFEVYLNNGAEDGYWDIEMYV
AVQPKHH
>Z0836 seqA, negative modulator of initiation of replication
MKTIEVDDELYSYIASHTKHIGESASDILRRMLKFSAASQPAAPVTKEVR
VASPAIVEAKPIKTIKDKVRAMRELLLSDEYAEQKRAVNRFMLLLSTLYS
LDAQAFAEATESLHGRTRVYFAADEQTLLKNGNQTKPKHVPGTPYWVITN
TNTGRKCSMIEHIMQSMQFPAELIEKVCGTI
>Z4656 smf, 
MVDTEIWLRLMSISSLYGDDMVRIAHWLARQSHIDAVVLQQTGLTLRQAQ
RFLSFPRKSIESSLCWLEQPNHHLIPADSEFYPPQLLATTDYPGALFVEG
ELHALHSFQLAVVGSRAHSWYGERWGRLFCETLAKHGVTITSGLARGIDG
VAHKAALQVNGVSIAVLGNGLNTIHPRRHARLAASLLEQGGALVSEFPLD
VPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCALEQGREVFAL
PGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPBAPENSFYS
PDQEDVALPFPELLANVGDEVTPVDVVAERAGQPVPEVVTQLLELELAGW
IAAVPGGYVRLRRACHVRRTNVFV
>Z3859 srmB, ATP-dependent RNA helicase
MTVTTFSELELDESLLEALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPT
GTGKTAAYLLPALQHLLDFPRKKSGPPRILILTPTRELAMQVADHARELA
KHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRA
VETLIXDEADRMLDMGFAQDIEHIAGETRWRKQTLLFSATLEGDAIQDFA
ERLLEDPVEVSANPSTRERKKIHQWYYRADDLEHKTALLVHLLKQPEATR
SIVFVRKRERVHELANWLREAGINNCYLEGEMVQGKRNEAIKRLTEGRVN
VLVATDVAARGIDIPDVSHVFNFDMPRSGDTYLHRIGRTARAGRKGTAIS
LVEAHDHLLLGKVGRYIEEPIKARVIDELRPKTRAPSEKQTGKPSKKVLA
KRAEKKKAKEKEKPRVKKRHRDTKNIGKRRKPSGTGVPPQTTEE
>Z5658 ssb, ssDNA-binding protein
MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMK
EQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTT
EVVVNVGGTMXMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG
GAQSRPQQSAPAAPSNEPPMDFDDDIPF
>Z4974 tag, 3-methyladenine DNA glycosylase I
MERCGWVSQDPLYIAYHDNEWGVPETDRKKLFEMICLEGQQAGLSWITVL
KKRENYRACFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNAR
AYLQMEQNGEPFADFVWSFVNHQPQVTQATTLSEIPTSTPASDALSKALK
KRGFKFVGTTICYSFMQACGLVNDHVVGCCCYPGNKP
>Z5361 tatD, 
MEYRMFDIGVNLTSSQFAKDRDDVVARAFDAGVNGLLITGTNLRESQQAQ
KLARQYSSCWSTAGVHPHDSSQWQAATEEAIIELAAQPEVVAIGECGLDF
NRNFSTPEEQERAFVAQLRIAADLNMPVFMHCRDAHERFMTLLEPWLDKL
PGAVLHCFTGTREEMQACVAHGIYIGITGWVCDERRGLELRELLPLIPAE
KLLIETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGEDAAWLAA
TTDANVKTLFGIAF
>Z2536 topA, DNA topoisomerase type I, omega protein
MGKALVIVESPAKAKTINKYLGSDYVVKSSVGHIRDLPTSGSAAKKSADS
TSTKTAKKPKKDERGALVNRMGVDPWHNWEAHYEVLPGKEKVVSELKQLA
EKADHIYLATDLDREGEAIAWHLREVIGGDDARYSRVVFNEITKNAIRQA
FNKPGELNIDRVNAQQARRFMDRVVGYMVSPLLWKKIARGLSAGRVQSVA
VRLVVEREREIKAFVPEEFWEVDASTTTPSGEALALQVTHQNDKPFRPVN
KEQTQAAVSLLEKARYSVLEREDKPTTSKPGAPFITSTLQQAASTRLGFG
VKKTMMMAQRLYEAGYITYMRTDSTNLSQDAVNMVRGYISDNFGKKYLPE
SPNQYASKENSQEAHEAIRPSDVNVMAESLKDMEADAQKLYQLIWRQFVA
CQMTPAKYDSTTLTVGAGDFRLKARGRILRFDGWTKVMPALRKGDEDRIL
PAVDKGDALTLVELTPAQHFTKPPARFSEASLVKELEKRGIGRPSTYASI
ISTIQDRGYVRVENRRFYAEKMGEIVTDRLEENFRELMNYDFTAQMENSL
DQVANHEAEWKAVLDHFFSDFTQQLDKAEKDPEEGGMRPNQMVLTSIDCP
TCGRKMGIRTASTGVFLGCSGYALPPKERCKTTINLVPENEVLNVLEGED
AETNALRAKRRCPKCGTAMDSYLIDPKRKLHVCGNNPTCDGYEIEEGEFR
IKGYDGPIVECEKCGSEMHLKMGRFGKYMACTNEECKNTRKILRNGEVAP
PKEDPVPLPELPCEKSDAYFVLRDGAAGVFLAANTFPKSRETRAPLVEEL
YRFRDRLPEKLRYLADAPQQDPEGNKTMVRFSRKTKQQYVSSEKDGKATG
WSAFYVDGKWVEGKK
>Z2796 topB, DNA topoisomerase III
MRLFIAEKPSLARAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQ
PDAYDSRYARWNLADLPIVPEKWQLQPRPSVTKQLNVIKRYLHEASEIVH
AGDPDREGQLLVDEVLDYLQLAPEKRQQVQRCLINDLNPQAVERAIDRLR
SNSEFVPLCVSALARARADWLYGINMTRAYTILGRNAGYQGVLSVGRVQT
PVLGLVVRRDEEIENFVAKDFFEVKAHIVTPADERFTAIWQPSEACEPYQ
DEEGRLLHRPLAEHVVNRISGQPAIVTSYNDKRESESAPLPFSLSALQIE
AAKRFGLSAQNVLDICQKLYETHKLITYPRSDCRYLPEEHFAGRHAVMNA
ISVHAPDLLPQPVVDPDIRNRCWDDKKVDAHHAIIPTARSSAINLTENEA
KVYNLISRQYLMQFCPDAVFRKCVIELDIAKGKFVAKARFLAEAGWRTLL
GSKERDEENDGTPLPVVAKGDELLCEKGEVVERQTQPPRHFTDATLLSAM
TGIARFVQDKDLKKILRATDGLGTEATRAGIIELLFKRGFLIKKGRYIHS
TDAGKALFHSLPEMATRPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLY
QLIDQAKRTPVRQFRGIVAPGSGGSADKKKAAPRKRSAKKSPPADEAGSG
AIA
>Z2993 tra8_2, IS30 transposase encoded within prophage CP-933T
MSDFINNVSVDSIGQRNSYVKTWGCGGLELWKNGTGFSEIANILGSKPGT
IFTMLRDTGGIKPHERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNR
SPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVL
EKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNI
QHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEG
DLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSEL
RKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQY
FPKKTCLAQYTQHEDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD
>Z3936 tra8_3, IS30 transposase
MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPH
ERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRG
RRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISG
WLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRR
HTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIDSF
F
>Z1947 umuC, SOS mutagenesis and repair
MFALCDVNAFYASCETVFRPELWGKPVVVLSNNDGCVIARNAEAKALGVK
MGDPWFKQKDLFRRCGVVCFSSNYELYADMSNRVMSTLEELSPRVEIYSI
DEAFCDLTGVRNCRDLTDFGREIRATVLQRTHLTVGVGIAQTKTLAKLAN
HAAKKWQRQTGGVVDLSNLXXQRKLMSALPVDDVWGIGRRISKKLDAMGI
KTVLDLADTDIRFIRKHFNVVLERTVRELRGEPCLQLEEFAPTKQEIICS
RSFGERITDYTSMRQAICSYAARAAEKLRSEHQYCRFISTFIKTSPFALN
EPYYGNSASVKLLTPTQDSRDIINAATRSLDAIWQAGHRYQKAGVMLGDF
FSQGVAQLNLFDDNAPRPGSEQLMAVMDTLNAKEGRGTLYFAGQGIQQQW
QMKRAMLSPRYTTRSSDLLRVK
>Z3864 ung, uracil-DNA-glycosylase
MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRF
TELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIATPPSLLNMYKELENTI
PGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVIS
LINQHREGVVFLLWGSHAQKKGAIIDKQRHHVLKAPHPSPLSAHRGFFGC
NHFVLANQWLEQHGETPIDWMPVLPAESE
>Z5657 uvrA, excision nuclease subunit A
MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT
ITEIHDYLRLLYARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLML
LAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTI
EVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSAN
FACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL
SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYG
SGKENIEFKYMNDRGDTSIRRHPFEXVLHNMERRYKETESSAVREELAKF
ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAG
QRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQ
IGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDA
IRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV
PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLI
NDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSN
PATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIK
VEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF
DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTG
QTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW
IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML
>Z0998 uvrB, DNA repair; excision nuclease subunit B
MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIA
NVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEA
YVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGD
PDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVI
DIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYV
TPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMNEL
GYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGM
YRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYE
LEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVL
VTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVL
VGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKA
ILYGDKITPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALG
QNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELEGLMMQHAQNL
EFEEAAQIRDQLHQLRELFIAAS
>Z3001 uvrC, excinuclease ABC, subunit C; repair of UV damage to DNA
MYDAGGTVIYVGKAKDLKKRLSSYFRSNLASRKTEALVAQIQQIDVTVTH
TETEALLLEHNYIKLYQPRYNVLLRDDKSYPFIFLSGDTHPRLAMHRGAK
HAKGEYFGPFPNGYAVRETLALLQKIFPIRQCENSVYRNRSRPCLQYQIG
RCLGPCVEGLVSEEEYAQQVEYVRLFLSGKDDQVLTQLISRMETASQNLE
FEEAARIRDQIQAVRRVTEKQFVSNTGDDLDVIGVAFDAGMACVHVLFIR
QGKVLGSRSYFPKVPGGTELSEVVETFVGQFYLQGSQMRTLPGEILLDFN
LSDKTLLADSLSELAGRKINVQTKPRGDRARYLKLARTNAATALTSKLSQ
QSTVHQRLTALASVLKLPEVKRMECFDISHTMGEQTVASCVVFDANGPLR
AEYRRYNITGITPGDDYAAMNQVLRRRYGKAIDDSKIPDVILIDGGKGQL
AQAKNVFAELDVSWDKNHPLLLGVAKGADRKAGLETLFFEPEGEGFSLPP
DSPALHVIQHIRDESHDHAIGGHRKKRAKVKNTSSLETIEGVGPKRRQML
LKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH
>Z5330 uvrD, DNA-dependent ATPase I and helicase II
MDVSYLLDSLNDKQREAVAAPRSNLLVLAGAGSGKTRVLVHRIAWLMSVE
NCSPYSIMAVTFTNKAAAEMRHRIGQLMGTSQGGMWVGTFHGLAHRLLRA
HHMDANLPQDFQILDSEDQLRLLKRLIKAMNLDEKQWPPRQAMWYINSQK
DEGLRPHHIQSYGNPVEQTWQKVYQAYQEACDRAGLVDFAELLLRAHELW
LNKPHILQHYRERFTNILVDEFQDTNNIQYAWIRLLAGDTGKVMIVGDDD
QSIYGWRGAQVENIQRFLNDFPGAETIRLEQNYRSTSNILSAANALIENN
NGRLGKKLWTDGADGEPISLYCAFNELDEARFVVNRIKTWQDNGGALAEC
AILYRSNAQSRVLEEALLQASMPYRIYGGMRFFERQEIKDALSYLRLIAN
RNDDAAFERVVNTPTRGIGDRTLDVVRQTSRDRQLTLWQACRELLQEKAL
AGRAASALQRFMELIDALAQETADMPLHVQTDRVIKDSGLRTMYEQEKGE
KGQTRIENLEELVTATRQFSYNEEDEDLMPLQAFLSHAALEAGEGQADTW
QDAVQLMTLHSAKGLEFPQVFIVGMEEGMFPSQMSLDEGGRLEEERRLAY
VGVTRAMQKLTLTYAETRRLYGKEVYHRPSRFIGELPEECVEEVRLRATV
SRPVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQV
AFQGQGIKWLVAAYARLETV
>Z3053 vsr, DNA mismatch endonuclease, patch repair protein
MADVHDKATRSKNMRAIATRDTAIEKRLASLLTGQGLAFRVQDASLPGRP
DFVVDEYRCVIFTHGCFWHHHHCYLFKVPATRTEFWLEKIGKNVERDRRD
ISRLQELGWRVLIVWECALRGREKLKDEALTERLEEWICGEGASAQIDTQ
GIHLLA
>Z5054 waaP, putative LPS biosynthesis enzyme
MVWMVELKEPFATLWRGKDPFEEVKTLQGEVFRELETRRTLRFEMAGKSY
FLKWHRGTTLKEIIKNLLSLRMPVLGADREWNAIHRLRDVGVDTMYGVAF
GEKGMNPLTRTSFIITEDLTPTISLEDYCADWATNPPDVRVKRMLIKRVA
TMVHDMHAAGINHRDCYICHFLLHLPFSGKEEELKISVIDLHRAQLRTRV
PRRWRDKDLIGLYFSSMNIGLTQRDIWRFMKVYFAAPLKDILKQEQGLLS
QAEAKATKIRERTIRKSL
>Z3196 wbdQ, GDP-mannose mannosylhydrolase
MFLHSQDFATIVRSTPLISIDLIVENEFGEILLGKRINRPAQGYWFVPGG
RVLKDEKLQTAFERLTEIELGIRLPLSVGKFYGIWQHFYEDNSMGGDFST
HYIVIAFLLKLQPNILKLPKSQHNAYCWLSRAKLINDDDVHYNCRAYFNN
KTNDAIGLDNKDIICLMRQ
>Z3215 wcaH, GDP-mannose mannosyl hydrolase
MMFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPG
GRVQKDETLEAAFERLTMAELGLRLPITAGQFYGVWQHFYDDNFSGTDFT
THYVVLGFRFRVAEEELLLPDEQHDDYRWLTPDALLASNDVHANSRAYFL
AEKRAGVPGL
>Z5328 xerC, site-specific recombinase, acts on cer sequence of ColE1, effects chromosome segregation at cell division
MTDLHTDVERYLRYLSVERQLSPITLLNYQRQLEAIINFASENGLQNWQQ
CDAAMVRNFAVRSRRKGLGAASLALRLSALRSFFDWLVSQNELKANPAKG
VSAPKTPRHLPKNIDVDDINRLLDIDINDPLAVRDRAMLEVMYGAGLRLS
ELVGLDIKHLDLESGEVWVMGKGSKERRLPIGRNALSWIEHWLDLRDLFG
SEDDALFLSKLGKRISARNVQKRFAEWGIKQGLNNHVHPHKLRHSFATHM
LESSGDLRGVQELLGHANLSTTQIYTHLDFQHLASVYDAAHPRAKRGK
>Z4232 xerD, site-specific recombinase
MKQDLARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATA
QSDDLQALLAERLEGGYKATSSARLLSAVRRLFQYLYREKFREDDPSAHL
ASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVS
ELVGLTMSDISLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWL
LNGVSIDVLFPSQRAQQMTRQTLWHRIKHYAVLAGIDSEKLSPHVLRHAF
ATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA
>Z3773 xseA, exonuclease VII, large subunit
MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFT
LKDDTAQVRCAMFRNSNRRVTFRPQHGQQVLVRANITLYXPRGDYXIIVE
SMQPAGEGLLQXKYEQLKAKLQAEGLFDLQYKKPLPSPAHCVGVITSKTG
AALHDILHVLKRRDPSLPVIIYPTAVQGDDAPGQIVRAIELANQCNECDV
LIVGRGGGSLEDLWSFNDERVARAIFASRIPVVSAVGHETDVTIADFVAD
LRAPTPSAAAEVVSRNQQELLRQVQSTRQRLEMAMDYYLANRTRRFTQIH
HRLQQQHPQLRLARQQTMLERLQKRMSFALENQLKRAGQQQQRLTQRLNQ
QNPQPKIHRAQTRIQQLEYRLAETLRAQLSATRERFGNAVTHLEAVSPLS
TLARGYSVTTATDGKVLKKVKQVKAGEMLTTRLEDGWVESEVKNIQPVKK
SRKKVH
>Z0525 xseB, exonuclease VII, small subunit
MPKKNEAPASFEKALSELEQIVTRLESGDLPLEEALNEFERGVQLARQGQ
AKLQQAEQRVQILLSDNEDASLTPFTPDNE
>Z2781 xthA, exonuclease III
MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKL
GYNVFYHGQKGHYGVALLTKETPIAVRRGFPGDEEEAQRRIIMAEIPSPL
GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRENPVLIMG
DMNISPGDLDIGIGEENRKRWLRTGKCSFLPEEREWMERLMSWGLVDTFR
HANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAECCVETGIDYEI
RSMEKPSDHAPVWATFRR
>Z0127 yacH, putative membrane protein
MKMTLPFKPHVLALICSAGLCAASAGLYIKSRTVEAPVETQSTQLAVSDA
AAVTLPATVSAPPVTPAVVKSAFSTAQIDQWVAPVALYPDALLSQVLMAS
TYPTNVAQAVQWSHDNPLKQGDAAIQAVSDQPWDASVKSLVAFPQLMALM
GENPQWVQNLGDAFLAQPQDVMDSVQRLRQLAQQTGSLKSSTEQKVITTT
KKTVPVTQTVTAPVIPSNTVSTANPVITEPATTVISIEPGNPDVVYIPNY
NPTVVYGNWANTAYPPVYLPPPAGEPFVDSFVRGFGYSMGVATTYALFSS
IDWDDDDHDHHHHDDDNYHHHDGGHRDGNGWQHNGDNINIDVNNFNRITG
EHLTDKNMAWRHNPNYRNGVPYHDQDMAKRFHQTDVNGGMSATQLPAPTR
DSQRQAAASQFQQRTHAAPVITRDTQRQAAAQRFNEAEHYGSYDDFRDFS
RRQPLTQQQKDAARQRYQSASPEQRQAVHEKMQTNPQNQQRREAARERIQ
PASPEQRQAVREKMQTNPQIQQRRDAARERIQSASPEQRQVFKEKVQQRP
LNQQQRDNARQRVQSASPEQRQVFREKVQESRPQRLNDSNHTARLNNEQR
SAVRERLSERGARRLER
>Z0288 yafM, orf, hypothetical protein
MSEYRRYYIKGGTWFFTVNLRNRRSHLLTAQFQMLRNAIINVKRDRPFEI
NAWVVLPEHMHCIWTLPESDDDFSSRWREIKKQFTHACGLKNIWQPRF
>Z0492 yaiD, orf, hypothetical protein
MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMG
SHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLK
KTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDT
LALLRKSLGSLPVVPLSMANPIELTLTEWVRSGSAAQGFQLLDEAELKSL
LEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSL
KRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGE
AQR
>Z0549 ybaV, orf, hypothetical protein
MKHGIKALLITLSLACAGMSHSALAAASVAKPTAVETKAEAPAAQNKAAV
PAKASDEEGSRVSINNASAEELARAMNGVGLKKAQAIVSYREEYGPFKTV
EDLKQVPGMGNSLVERNLAVLTL
>Z0566 ybaZ, orf, hypothetical protein
MRLHSGVFPDYAEKLSQEEKMEKEDSFPQRVWQIVAAIPEGYVTTYGDVA
KLAGSPRAARQVGGVLKRLPEGSTLPWHRVVNRHGTISLTGPDLQRQRQA
LLAEGVMVSGSGQIDLQRYRWNY
>Z1110 ybjD, orf, hypothetical protein
MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPES
DLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACW
TPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQAHHL
VRLMPVLRLRDARFMRRIRNGTVPNVPNVEVTARQLDFLARELSSHPQNL
SDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDII
NRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLH
PIMLSVAWHLLNLLPLQRIATTNSGELLSLTPVEHVCRLVRESSRVAAWR
LGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQC
GHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATV
RSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRVAQIPENVPMNL
RKIISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGR
AD
>Z1238 ycaJ, putative polynucleotide enzyme
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHL
HSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQ
NRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELN
SALLSRARVYLLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAI
AELVNGDARRALNTLEMMADMAEVDDSGKRVLXPELLTEIAGERSARFDN
KGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS
EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVY
TAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA
AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR
>Z1739 ycfH, orf, hypothetical protein
MFLVDSHCHLDGLDYESLHKDVDDVLAKAAARDVKFCLAVATTLPGYLHM
RDLVGERDNVVFSCGVHPLNQNDPYDVEDLRRLAAEEGVVALGETGLDYY
YTPETKVRQQESFIHHIQIGRELNKPVIVHTRDARADTLAILREEKVTDC
GGVLHCFTEDRETAGKLLDLGFYISFSGIVTFRNAEQLRDAARYVPLDRL
LVETDSPYLAPVPHRGKENQPAMVRDVAEYMAVLKGVAVEELAQVTTDNF
ARLFHIDASRLQSIR
>Z2856 yeaB, orf, hypothetical protein
MEYRSLTLDDFLSRFQLLRPQINRETLNHRQAAVLIPIVRRPQPGLLLTQ
RSIHLRKHAGQVAFPGGAVDDTDASVIAAALREAEEEVAIPPSAVEVIGV
LPPVDSVTGYQVTPVVGIIPPDLPYRASEDEVSAVFEMPLAQALHLGRYH
PLDIYRRGDSHRVWLSWYEQYFVWGMTAGIIRELALQIGVKP
>Z3443 yejH, putative ATP-dependent helicase
MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARG
RVLVLAHVKELVAQNHAKYQALGLEADIFAAGLKRKESHGKVVFGSVQSV
ARNLDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLT
ATPFRLGKGWIYQFHYHGMVRGDEKALFRDCIYELPLRYMIKHGYLTPPE
RLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPHIISQIMEFAA
TRKGVMIFAATVEHAKEIVGLLPAEDAALITGDTPGAERDVLIEDFKAQR
FRYLVNVAVLTTGFDAPHVDLIAILRPTESVSLYQQIVGRGLRLAPGKTD
CLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD
GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRE
CDTVLVDPDDMLKAALRLKDALVLRCSGMSLQHGHDEKGEWLKITYYDED
GADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALL
RHPDFVVARMKGQYWQVREKVFDYEGRFRRAHELRG
>Z3509 yfaO, orf, hypothetical protein
MRQRTIVCPLIQNDGAYLLCKMADDRGVFPGQWALSGGGVEPGERIEEAL
RREIREELGEQLLLTEITPWTFSDDIRTKTYADGRKEEIYMIYLIFDCVS
ANREVKINEEFQDYAWVKPEDLAHYDLNVATRKTLRLKGLL
>Z3723 yffH, orf, hypothetical protein
MTQQITLIKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATI
LLYNAKKKTVVLIRQFRVATWVNGNESGQLIETCAGLLDNDEPEVCIRKE
AIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDNQRANAGGGVED
EDIEVLELPFSQALEMIKTGEIRDGKTVLLLNYLQMSHLID
>Z3895 yfiL, orf, hypothetical protein
MKTIRYALKKEKEMMKKFIAPLLALLVSGCQIDPYTHAPTLTSTDWYDVG
MEDAISGSAIKDDDAFSDSQADRGLYLKGYAEGQKKTCQTDFTYARGLSG
KSFPASCNNVENASQLHEVWQKGADENASAIRLN
>Z4147 ygdP, putative invasion protein
MIDDDGYRPNVGIVICNRQGQVMWARRFGQHSWQFPQGGINPGESAEQAM
YRELFEEVGLSRKDVRILASTRNWLRYKLPKRLVRWDTKPVCIGQKQKWF
LLQLVSGDAEINMQTSSTPEFDGWRWVSYWYPVRQVVSFKRDVYRRVMKE
FASVVMSLQENTPKPQNASAYRRKRG
>Z4421 ygjF, orf, hypothetical protein
MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDHQ
LKPQEAQHLLDYRCGVTKLVDRPTVQANEISKQELHAGGRKLIEKIEDYQ
PQALAILGKQAYEQGFSQRGAQWGKQTLSIGSTQIWVLPNPSGLSRVSLE
KLVEAYRELDQALVVRGR
>Z4516 yhbQ, orf, hypothetical protein
MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAF
SAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAKLLSSLQTPEIKSD
>Z4622 yhdJ, putative methyltransferase
MTMRTGCEPTRFGNEAKTIIHGDALAELKKLPTESVDLIFADPPYNIGKN
FDGLIEAWKEDLFIDWLLEVIAECHRVLKKQGSMYIMNSTENMPFIDLQC
RKLFTIKSRIVWSYDSSGVQAKKHYGSMYEPILMMVKDAKNYTFNGDAIL
VEAKTGSQRALIDYRKNPPQPYNHQKVPGNVWDFPRVRYLMDEYENHPTQ
KPEALLKRIILASSNPGDIVLDPFAGSFTTGAVAIASGRKFIGIEINSEY
IKMGLRRLDVASHYSAEELAKVKKRKTGNLSKRSRLSEVDPDLITK
>Z4839 yhhF, orf, hypothetical protein
MKKPNHSGSGQIRIIGGQWRGRKLPVPDSPGLRPTTDRVRETLFNWLAPV
IVDAQCLDCFAGSGALGLEALSRYAAGATLIEMDRAVSQQLIKNLATLKA
GNARVVNSNAMSFLAQKGTPHNIVFVDPPFRRGLLEETINLLEDNGWLAD
ETLIYVESEVENGLPTVPANWSLHREKVAGQVAYRLYQREAQGESDAD
>Z5073 yicF, putative enzyme
MKMKVWMAILISILCWQSSVWAVCPAWSPARAQEEIFRLQQQIKQWDDDY
WKEGKSEVEDGVYDQLSARLTQWQRCFVSEPRDVMMPPLNGAVMHPVAHT
GVRKMADKNALSLWMRERSDLWVQPKVDGVAVTLVYRDGKLNKAISRGNG
LKGEDWTQKVSLISAVPQTVSGPLANSTLQGEIFLQREGHIQQQMGGINA
RAKVAGLMMRQDDSDTLNSLGVFVWAWPDGPQLMTDRLKELATAGFTLTQ
RYTRAVKNADEVARVRNEWWKAKLPFVTDGVVVRXAKEPESRHWLPGQAE
WLVAWKYQPVAQVAKVKAIQFAVGKSGKISVVASLAPVMLDDKKVQRVNI
GSVRRWQEWDIAPGDQILVSLAGQGIPRIDDVVWRGAERTKPTPPENRFN
SLTCYFASDVCQEQFISRLVWLGSKQVLGLDGIGEAGWRALHQTHRFEHI
FSWLLLTPEQLQNTPGIAKSKSAQLWHRFNLARKQPFTRWVMAMGIPLTR
AALNASDERSWSQLLFSTEQFWQQLPGTGSGRARQVIEWKENAQIKKLGS
WLAAQQITGFEP
>Z5571 yjaD, orf, hypothetical protein
MDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQ
GEPVWLIQQQRRYDMGSVRQVIDLDVGLFQLAGRGVQLAEFYRSHKYCGY
CGHEMYPSKTEWAMLCSHCRERYYPQIAPCIIVAIRRDDSILLAQHTRHR
NGVHTVLAGFVEVGETLEQAVAREVMEESGIKVKHLRYVTSQPWPFPQSL
MTAFMAEYDSGDIVIDPKELLEANWYRYDDLPLLPPPGTVARRLIEDTVA
MCRAEYE
>Z5980 yjjV, 
MQALAENYQPLYAALGLHPGMLEKHSDVSLDQLQQALERRPAKVVAVGES
GLDLFGDDPQFERQQWFLDEQLKLAKRYDLPVILHSRRTHDKLAMHLKRH
DLSRTGVVHGFSGSLQQAERFVQLGYKIGVGGTITYPRASKTRVVIAKLP
LASLLLETDAPDMPLNGFQGQPNRPEQAARVFDVLCELRPEPEDEIAEVL
LNNTYRCLTFVGSLPXVGSIRQSAHNA
>Z4294 yqgF, orf, hypothetical protein
MSGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNLIERLLK
EWQPDEIIVGLPLNMDGTEQPLTARARKFANRIHGRFGVEVKLHDERLST
VEARSGLFEQGGYRALNKGKIDSASAVIILESYFEQGY
>Z4391 yqiE, orf, hypothetical protein
MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHRLFNGQMSHEVR
REIFERGHAAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIE
EGESVEDVARREAIEEAGLIVKRTKPVLSFLASPGGTSERSSIMVGEVDA
TTASGIHGLADENEDIRVHVVSREQAYQWVEEGKIDNAASVIALQWLQLH
HQALKNEWA
>Z4507 yraN, orf, hypothetical protein
MATVPTRSGSPRQLTTKQTGDAWEVQARRWLEGKGLRFVAANVNERGGEI
DLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHN
GSFDTVDCRFDVVAFTGNEVEWIKDTFNDHS
>Z4654 yrdD, putative DNA topoisomerase
MRNNESCPKCGAELVIRSGKHGPFLGCSQYPACDYVRPLKSSADGHIVKV
LEGQVCPACGANLVLRQGRFGMFIGCINYPECEHTELIDKPDETAITCPQ
CRTGHLVQRRSRYGKTFHSCDRYPECQFAINFKPIAGECPECHYPLLIEK
KTAQGVKHFCASKQCGKPVSAE
>Z4751 yrfE, orf, hypothetical protein
MSKSLQKPTILNVETVARSRLFTVESVDLEFSNGVRRVYERMRPTNREAV
MIVPIVDDHLILIREYAVGTESYELGFSKGLIDPGESVFEAANRELKEEV
GFGANDLTFLKKLSMAPSYFSSKMNIVVAQDLYPESLEGDEPEPLPQVRW
PLAHMMDLLEDPDFNEARNVSALFLVREWLKGQGRV