TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Escherichia coli K12, K-12
Gene type: CDS

Number of genes found: 235

Free access
Sort by:

 



# Escherichia coli K12, K-12

>b3023 orf, hypothetical protein
MTNLTLDVNIIDFPSIPVAMLPHRCSPELLNYSVAKFIMWRKETGLSPVN
QSQTFGVAWDDPATTAPEAFRFDICGSVSEPIPDNRYGVSNGELTGGRYA
VARHVGELDDISHTVWGIIRHWLPASGEKMRKAPILFHYTNLAEGVTEQR
LETDVYVPLA
>b2299 putative regulator
MEQRRLASTEWVDIVNEENEVIAQASREQMRAQCLRHRATYIVVHDGMGK
ILVQRRTETKDFLPGMLDATAGGVVQADEQLLESARREAEEELGIAGVPF
AEHGQFYFEDKNCRVWGALFSCVSHGPFALQEDEVSEVCWLTPEEITARC
DEFTPDSLKALALWMKRNAKNEAVETETAE
>b0540 transposase insE for insertion sequence IS3
MTHMTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLY
NWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKR
LK
>b0257 putative transposase
MATLCHVFGVHRSSYKYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGAR
SIATMATQRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNH
LERQFAVTEPNQVWCGDVTYSVPGVQGGHGCLNEPRVCLEY
>b0373 transposase insE for insertion sequence IS3
MTHMTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLY
NWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKR
LK
>b1458 orf, hypothetical protein
MSIQSLLDYISVTPDIRQQGKVKHKLSAILFLTVCAVIAGADEWQEIEDF
GHERLEWLKKYGDFDNGIPVDDTIARVVSNIDSLAFEKMFIEWMQECHEI
TDGEIIAIDGKTIRGSFDKGKRKGAIHMVSAFSNENGVVLGQVKTEAKSN
EITAIPELLNLLYLKKNLITIDAMGCQKDIASKIKDKKADYLLAVKGNQG
KLHHAFEEKFPVNVFSNYKGDSFSTQEISHGRKETRLHIVSNVTPELL
>b1578 orf, hypothetical protein
MPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDG
FEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERR
FGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIA
ESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPRE
YLRQRACNGLSDNRCLEI
>b0298 transposase insE for insertion sequence IS3
MTHMTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLY
NWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKR
LK
>b1936 orf, hypothetical protein
MVVTTSDVVMCQMRRSDVQGGYRVYGSWMAENVQDQVSILNQKLSEFAPS
MPHAVRSDVINNRLQNLHLHAHHFLIRRHQLITHLNPHLHRN
>b2088 transposase insE for insertion sequence IS3
MTHMTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLY
NWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKR
LK
>b1027 transposase insE for insertion sequence IS3
MTHMTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLY
NWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKR
LK
>b1459 orf, hypothetical protein
MSFRQKKEDKSAEGVSIRYYISSKDMDAKEFAHAIRAHWLIEHSLHWVLD
VKMNEDASRIRRGNAA
>b4285 putative transposase
MPFYFRKECPLNSGYLRKNRPEKPDGRRAVLRSQVLELHGISHGSAGARS
IATMATRRGYQMGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNYL
ERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSPD
SRLTMKALEMAWETRGKPVGVMFQAIKAVIIRAGSSGSYCGDTGSGRV
>b2213 ada, O6-methylguanine-DNA methyltransferase; transcription activator/repressor
MKKATCLTDDQRWQSVLARDPNADGEFVFAVRTTGIFCRPSCRARHALRE
NVSFYANASEALAAGFRPCKRCQPEKANAQQHRLDKITHACRLLEQETPV
TLEALADQVAMSPFHLHRLFKATTGMTPKAWQQAWRARRLRESLAKGESV
TTSILNAGFPDSSSYYRKADETLGMTAKQFRHGGENLAVRYALADCELGR
CLVAESERGICAILLGDDDATLISELQQMFPAADNAPADLMFQQHVREVI
ASLNQRDTPLTLPLDIRGTAFQQQVWQALRTIPCGETVSYQQLANAIGKP
KAVRAVASACAANKLAIIIPCHRVVRGDGTLSGYRWGVSRKAQLLRREAE
NEER
>b2068 alkA, 3-methyl-adenine DNA glycosylase II, inducible
MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTA
IPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGA
ARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP
EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG
DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP
AQIRRYAERWKPWRSYALLHIWYTEGWQPDEA
>b2212 alkB, DNA repair system specific for alkylated DNA
MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMV
TPGGYTMSVAMTNCGHLGWTTHRQGYLYSPIDPQTNKPWPAMPQSFHNLC
QRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGL
PAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLT
IDCRYNLTFRQAGKKE
>b3387 dam, DNA adenine methylase
MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRY
ILADINSDLISLYNIVKMRTDEYVQAARELFVPETNCAEVYYQFREEFNK
SQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKPYFPEAELYH
FAEKAQNAFFYCESYADSMARADDASVVYCDPPYAPLSATANFTAYHTNS
FTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRAKLHVVKVRRS
ISSNGGTRKKVDELLALYKPGVVSPAKK
>b1343 dbpA, ATP-dependent RNA helicase
MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTG
SGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPN
TKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTL
VMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQR
DPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNT
KKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDV
AARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEA
QRANIISDMLQIKLNWQTPPANSSIATLEAEMATLCIDGGKKAKMRPGDV
LGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKT
CRVRLLK
>b1961 dcm, DNA cytosine methylase
MQENISVTDSYSTGNAAQAMLEKLLQIYDVKTLVAQLNGVGENHWSAAIL
KRALANDSAWHRLSEKEFAHLQTLLPKPPAHHPHYAFRFIDLFAGIGGIR
RGFESIGGQCVFTSEWNKHAVRTYKANHYCDPATHHFNEDIRDITLSHKE
GVSDEAAAEHIRQHIPEHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFACD
TQGTLFFDVVRIIDARRPAMFVLENVKNLKSHDQGKTFRIIMQTLDELGY
DVADAEDNGPDDPKIIDGKHFLPQHRERIVLVGFRRDLNLKADFTLRDIS
ECFPAQRVTLAQLLDPMVEAKYILTPVLWKYLYRYAKKHQARGNGFGYGM
VYPNNPQSVTRTLSARYYKDGAEILIDRGWDMATGEKDFDDPLNQQHRPR
RLTPRECARLMGFEAPGEAKFRIPVSDTQAYRQFGNSVVVPVFAAVAKLL
EPKIKQAVALRQQEAQHGRRSR
>b3162 deaD, inducible ATP-independent RNA helicase
MMSYVDWPPLILRHTYYMAEFETTFADLGLKAPILEALNDLGYEKPSPIQ
AECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLQNLDPELKAPQILVLAPT
RELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVGTP
GRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQT
ALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVWGMRKN
EALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQAL
REQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSESYVHRI
GRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAELLGKRR
LEKFAAKVQQQLESSDLDQYRALLSKIQPTAEGEELDLETLAAALLKMAQ
GERTLIVPPDAPMRPKREFRDRDDRGPRDRNDRGPRGDREDRPRRERRDV
GDMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFASHSTIEL
PKGMPGEVLQHFTRTRILNKPMNMQLLGDAQPHTGGERRGGGRGFGGERR
EGGRNFSGERREGGRGDGRRFSGERREGRAPRRDDSTGRRRFGGDA
>b0231 dinB, DNA polymerase IV
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARK
FGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPL
SLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELQLTASAGVAPVKFLAK
IASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTC
GDVQKCDLVMLLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMA
EDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQ
EHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLG
L
>b0799 dinG, probably ATP-dependent helicase
MALTAALKAQIAAWYKALQEQIPDFIPRAPQRQMIADVAKTLAGEEGRHL
AIEAPTGVGKTLSYLIPGIAIAREEQKTLVVSTANVALQDQIYSKDLPLL
KKIIPDLKFTAAFGRGRYVCPRNLTALASTEPTQQDLLAFLDDELTPNNQ
EEQKRCAKLKGDLDTYKWDGLRDHTDIAIDDDLWRRLSTDKASCLNRNCY
YYRECPFFVARREIQEAEVVVANHALVMAAMESEAVLPDPKNLLLVLDEG
HHLPDVARDALEMSAEITAPWYRLQLDLFTKLVATCMEQFRPKTIPPLAI
PERLNAHCEELYELIASLNNILNLYMPAGQEAEHRFAMGELPDEVLEICQ
RLAKLTEMLRGLAELFLNDLSEKTGSHDIVRLHRLILQMNRALGMFEAQS
KLWRLASLAQSSGAPVTKWATREEREGQLHLWFHCVGIRVSDQLERLLWR
SIPHIIVTSATLRSLNSFSRLQEMSGLKEKAGDRFVALDSPFNHCEQGKI
VIPRMRVEPSIDNEEQHIAEMAAFFRKQVESKKHLGMLVLFASGRAMQRF
LDYVTDLRLMLLVQGDQPRYRLVELHRKRVANGERSVLVGLQSFAEGLDL
KGDLLSQVHIHKIAFPPIDSPVVITEGEWLKSLNRYPFEVQSLPSASFNL
IQQVGRLIRSHGCWGEVVIYDKRLLTKNYGKRLLDALPVFPIEQPEVPEG
IVKKKEKTKSPRRRRR
>b0226 dinJ, damage-inducible protein J
MAANAFVRARIDEDLKNQAADVLAGMGLTISDLVRITLTKVAREKALPFD
LREPNQLTIQSIKNSEAGIDVHKAKDADDLFDKLGI
>b3702 dnaA, DNA biosynthesis; initiation of chromosome replication; can be transcription regulator
MSLSLWQQCLARLQDELPATEFSMWIRPLQAELSDNTLALYAPNRFVLDW
VRDKYLNNINGLLTSFCGADAPQLRFEVGTKPVTQTPQAAVTSNVAAPAQ
VAQTQPQRAAPSTRSGWDNVPAPAEPTYRSNVNVKHTFDNFVEGKSNQLA
RAAARQVADNPGGAYNPLFLYGGTGLGKTHLLHAVGNGIMARKPNAKVVY
MHSERFVQDMVKALQNNAIEEFKRYYRSVDALLIDDIQFFANKERSQEEF
FHTFNALLEGNQQIILTSDRYPKEINGVEDRLKSRFGWGLTVAIEPPELE
TRVAILMKKADENDIRLPGEVAFFIAKRLRSNVRELEGALNRVIANANFT
GRAITIDFVREALRDLLALQEKLVTIDNIQKTVAEYYKIKVADLLSKRRS
RSVARPRQMAMALAKELTNHSLPEIGDAFGGRDHTTVLHACRKIEQLREE
SHDIKEDFSNLIRTLSS
>b4052 dnaB, replicative DNA helicase; part of primosome
MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDD
VAERVVADDFYTRPHRHIFTEMARLQESGSPIDLITLAESLERQGQLDSV
GGFAYLAELSKNTPSAANISAYADIVRERAVVREMISVANEIAEAGFDPQ
GRTSEDLLDLAESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQP
HDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLVENAAM
LQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGT
MGILLEKRNIYIDDSSGLTPTEVRSRARRIAREHGGIGLIMIDYLQLMRV
PALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNS
DLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVR
LTFNGQWSRFDNYAGPQYDDE
>b4361 dnaC, chromosome replication; initiation and chain elongation
MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRA
MKMQRTFNRSGIRPLHQNCSFENYRVECEGQMNALSKARQYVEEFDGNIA
SFIFSGKPGTGKNHLAAAICNELLLRGKSVLIITVADIMSAMKDTFRNSG
TSEEQLLNDLSNVDLLVIDEIGVQTESKYEKVIINQIVDRRSSSKRPTGM
LTNSNMEEMTKLLGERVMDRMRLGNSLWVIFNWDSYRSRVTGKEY
>b0184 dnaE, DNA polymerase III, alpha subunit
MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGL
VKFYGAGHGAGIKPIVGADFNVQCDLLGDELTHLTVLAANNTGYQNLTLL
ISKAYQRGYGAAGPIIDRDWLIELNEGLILLSGGRMGDVGRSLLRGNSAL
VDECVAFYEEHFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATN
DVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYMRSEEEMCELF
ADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGL
EERLAFLFPDEEERLKRRPEYDERLETELQVINQMGFPGYFLIVMEFIQW
SKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP
DFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLG
HPYGFVDRISKLIPPDPGMTLAKAFEAEPQLPEIYEADEEVKALIDMARK
LEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSDVEYAG
LVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDML
QRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRPGPLQSGMVDN
FIDRKHGREEISYPDVQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGY
TLGGADMLRRAMGKKKPEEMAKQRSVFAEGAEKNGINAELAMKIFDLVEK
FAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAAVMTADMDNTEKVVGL
VDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAII
EARNKGGYFRELFDLCARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMN
SLGDALKAADQHAKAEAIGQADMFGVLAEEPEQIEQSYASCQPWPEQVVL
DGERETLGLYLTGHPINQYLKEIERYVGGVRLKDMHPTERGKVITAAGLV
VAARVMVTKRGNRIGICTLDDRSGRLEVMLFTDALDKYQQLLEKDRILIV
SGQVSFDDFSGGLKMTAREVMDIDEAREKYARGLAISLTDRQIDDQLLNR
LRQSLEPHRSGTIPVHLYYQRADARARLRFGATWRVSPSDRLLNDLRGLI
GSEQVELEFD
>b3066 dnaG, DNA biosynthesis; DNA primase
MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSF
TVNGEKQFYHCFGCGAHGNAIDFLMNYDKLEFVETVEELAAMHNLEVPFE
AGSGPSQIERHQRQTLYQLMDGLNTFYQQSLQQPVATSARQYLEKRGLSH
EVIARFAIGFAPPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSYDRF
RERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFHKGRQLYGLYE
AQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRA
TNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLV
RKEGKEAFEARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLIS
QVPGETLRIYLRQELGNKLGILDDSQLERLMPKAAESGVSRPVPQLKRTT
MRILIGLLVQNPELATLVPPLENLDENKLPGLGLFRELVNTCLSQPGLTT
GQLLEHYRGTNNAATLEKLSMWDDIADKNIAEQTFTDSLNHMFDSLLELR
QEELIARERTHGLSNEERLELWTLNQELAKK
>b3701 dnaN, DNA polymerase III, beta-subunit
MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLE
MEMVARVALVQPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGERMLVR
SGRSRFSLSTLPAADFPNLDDWQSEVEFTLPQATMKRLIEATQFSMAHQD
VRYYLNGMLFETEGEELRTVATDGHRLAVCSMPIGQSLPSHSVIVPRKGV
IELMRMLDGGDNPLRVQIGSNNIRAHVGDFIFTSKLVDGRFPDYRRVLPK
NPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQE
EAEEILDVTYSGAEMEIGFNVSYVLDVLNALKCENVRMMLTDSVSSVQIE
DAASQSAAYVVMPMRL
>b0215 dnaQ, DNA polymerase III, epsilon subunit
MSTAITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNNFHV
YLKPDRLVDPEAFGVHGIADEFLLDKPTFAEVADEFMDYIRGAELVIHNA
AFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLAVARKMFPGKRNSLDALCA
RYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTSMAFAMEGETQQQQGEA
TIQRIVRQASKLRVVFATDEEIAAHEARLDLVQKKGGSCLWRA
>b0470 dnaX, DNA polymerase III, tau and gamma subunits; DNA elongation factor III
MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVG
KTSIARLLAKGLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTK
VEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHV
KFLLATTDPQKLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEP
RALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGTLDDDQAL
SLVEAMVEANGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPA
ALGNDMAAIELRMRELARTIPPTDIQLYYQTLLIGRKELPYAPDRRMGVE
MTLLRALAFHPRMPLPEPEVPRQSFAPVAPTAVMTPTQVPPQPQSAPQQA
PTVPLPETTSQVLAARQQLQRVQGATKAKKSEPAAATRARPVNNAALERL
ASVTDRVQARPVPSALEKAPAKKEAYRWKATTPVMQQKEVVATPKALKKA
LEHEKTPELAAKLAAEAIERDPWAAQVSQLSLPKLVEQVALNAWKEESDN
AVCLHLRSSQRHLNNRGAQQKLAEALSMLKGSTVELTIVEDDNPAVRTPL
EWRQAIYEEKLAQARESIIADNNIQTLRRFFDAELDEESIRPI
>b2945 endA, DNA-specific endonuclease I
MYRYLSIAAVVLSAAFSGPALAEGINSFSQAKAAAVKVHADAPGTFYCGC
KINWQGKKGVVDLQSCGYQVRKNENRASRVEWEHVVPAWQFGHQRQCWQD
GGRKNCAKDPVYRKMESDMHNLQPSVGEVNGDRGNFMYSQWNGGEGQYGQ
CAMKVDFKEKAAEPPARARGAIARTYFYMRDQYNLTLSRQQTQLFNAWNK
MYPVTDWECERDERIAKVQGNHNPYVQRACQARKS
>b2798 exo, 5'-3' exonuclease
MRGLFPISHPAVACSGIECYPYRLIFKGVIVAVHLLIVDALNLIRRIHAV
QGSPCVETCQHALDQLIMHSQPTHAVAVFDDENRSSGWRHQRLPDYKAGR
PPMPEELHDEMPALRAAFEQRGVPCWSTSGNEADDLAATLAVKVTQAGHQ
ATIVSTDKGYCQLLSPTLRIRDYFQKRWLDAPFIDKEFGVQPQQLPDYWG
LAGISSSKVPGVAGIGPKSATQLLVEFQSLEGIYENLDAVAEKWRKKLET
HKEMAFLCRDIARLQTDLHIDGNLQQLRLVR
>b1844 exoX, exodeoxyribonuclease X
MLRIIDTETCGLQGGIVEIASVDVIDGKIVNPMSHLVRPDRPISPQAMAI
HRITEAMVADKPWIEDVIPHYYGSEWYVAHNASFDRRVLPEMPGEWICTM
KLARRLWPGIKYSNMALYKTRKLNVQTPPGLHHHRALYDCYITAALLIDI
MNTSGWTAEQMADITGRPSLMTTFTFGKYRGKAVSDVAERDPGYLRWLFN
NLDSMSPELRLTLKHYLENT
>b4312 fimB, recombinase involved in phase variation; regulator for fimA
MKNKADNKKRNFLTHSEIESLLKAANTGPHAARNYCLTLLCFIHGFRASE
ICRLRISDIDLKAKCIYIHRLKKGFSTTHPLLNKEVQALKNWLSIRTSYP
HAESEWVFLSRKGNPLSRQQFYHIISTSGGNAGLSLEIHPHMLRHSCGFA
LANMGIDTRLIQDYLGHRNIRHTVWYTASNAGRFYGIWDRARGRQRHAVL
>b4313 fimE, recombinase involved in phase variation; regulator for fimA
MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHY
QDLDLNEGRINIRRLKNGFSTVHPLRFDEREAVERWTQERANWKGADRTD
AIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERG
ADTRLIQDYLGHRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV
>b3261 fis, site-specific DNA inversion stimulation factor; DNA-binding protein; a trans activator for transcription
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDL
YELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
>b2231 gyrA, DNA gyrase, subunit A, type II topoisomerase
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY
AMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRY
MLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYD
GTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDD
EDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEV
DAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDG
MRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDI
IAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA
PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYL
TEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLME
VIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVK
YQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVY
SMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMAT
ANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAE
GKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQN
GYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD
AGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDT
IDGSAAEGDDEIAPEVDVDDEPEEE
>b3699 gyrB, DNA gyrase subunit B, type II topoisomerase, ATPase activity
MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDE
ALAGHCKEIIVTIHADNSVSVQDDGRGIPTGIHPEEGVSAAEVIMTVLHA
GGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQREGKIHRQIYEHGVP
QAPLAVTGETEKTGTMVRFWPSLETFTNVTEFEYEILAKRLRELSFLNSG
VSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNIFYFSTEKDGI
GVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKE
GYSKKAKVSATGDDAREGLIAVVSVKVPDPKFSSQTKDKLVSSEVKSAVE
QQMNELLAEYLLENPTDAKIVVGKIIDAARAREAARRAREMTRRKGALDL
AGLPGKLADCQERDPALSELYLVEGDSAGGSAKQGRNRKNQAILPLKGKI
LNVEKARFDKMLSSQEVATLITALGCGIGRDEYNPDKLRYHSIIIMTDAD
VDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDE
AMDQYQISIALDGATLHTNASAPALAGEALEKLVSEYNATQKMINRMERR
YPKAMLKELIYQPTLTEADLSDEQTVTRWVNALVSELNDKEQHGSQWKFD
VHTNAEQNLFEPIVRVRTHGVDTDYPLDHEFITGGEYRRICTLGEKLRGL
LEEDAFIERGERRQPVASFEQALDWLVKESRRGLSIQRYKGLGEMNPEQL
WETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKAA
NIDI
>b2496 hda, putative DNA replication factor
MVNFSRFCEILVEVSLNTPAQLSLPLYLPDDETFASFWPGDNSSLLAALQ
NVLRQEHSGYIYLWAREGAGRSHLLHAACAELSQRGDAVGYVPLDKRTWF
VPEVLDGMEHLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLIT
GDRPPRQLNLGLPDLASRLDWGQIYKLQPLSDEDKLQALQLRARLRGFEL
PEDVGRFLLKRLDREMRTLFMTLDQLDRASITAQRKLTIPFVKEILKL
>b0962 helD, DNA helicase IV
MELKATTLGKRLAQHPYDRAVILNAGIKVSGDRHEYLIPFNQLLAIHCKR
GLVWGELEFVLPDEKVVRLHGTEWGETQRFYHHLDAHWRRWSGEMSEIAS
GVLRQQLDLIATRTGENKWLTREQTSGVQQQIRQALSALPLPVNRLEEFD
NCREAWRKCQAWLKDIESARLQHNQAYTEAMLTEYADFFRQVESSPLNPA
QARAVVNGEHSLLVLAGAGSGKTSVLVARAGWLLARGEASPEQILLLAFG
RKAAEEMDERIRERLHTEDITARTFHALALHIIQQGSKKVPIVSKLENDT
AARHELFIAEWRKQCSEKKAQAKGWRQWLTEEMQWSVPEGNFWDDEKLQR
RLASRLDRWVSLMRMHGGAQAEMIASAPEEIRDLFSKRIKLMAPLLKAWK
GALKAENAVDFSGLIHQAIVILEKGRFISPWKHILVDEFQDISPQRAALL
AALRKQNSQTTLFAVGDDWQAIYRFSGAQMSLTTAFHENFGEGERCDLDT
TYRFNSRIGEVANRFIQQNPGQLKKPLNSLTNGDKKAVTLLDESQLDALL
DKLSGYAKPEERILILARYHHMRPASLEKAATRWPKLQIDFMTIHASKGQ
QADYVIIVGLQEGSDGFPAAARESIMEEALLPPVEDFPDAEERRLMYVAL
TRARHRVWALFNKENPSPFVEILKNLDVPVARKP
>b0059 hepA, probable ATP-dependent RNA helicase
MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPV
TRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREV
FLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQ
RTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAA
ERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQ
LVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIE
QLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYR
PVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSAR
QELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKV
SGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTS
HRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWF
AEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRI
GQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDL
INYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQ
ALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPD
FPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSST
ISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGN
NLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSAR
ALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESL
DQAGWRLDALRLIVVTHQ
>b1712 himA, integration host factor (IHF), alpha subunit; site specific recombination
MALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFG
NFDLRDKNQRPGRNPKTGEDIPITARRVVTFRPGQKLKSRVENASPKDE
>b0912 himD, integration host factor (IHF), beta subunit; site-specific recombination
MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGS
FSLHYRAPRTGRNPKTGDKVELEGKYVPHFKPGKELRDRANIYG
>b0640 holA, DNA polymerase III, delta subunit
MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEH
HTFSIDPNTDWNAIFSLCQAMSLFASRQTLLLLLPENGPNAAINEQLLTL
TGLLHDDLLLIVRGNKLSKAQENAAWFTALANRSVQVTCQTPEQAQLPRW
VAARAKQLNLELDDAANQVLCYCYEGNLLALAQALERLSLLWPDGKLTLP
RVEQAVNDAAHFTPFHWVDALLMGKSKRALHILQQLRLEGSEPVILLRTL
QRELLLLVNLKRQSAHTPLRALFDKHRVWQNRRGMMGEALNRLSQTQLRQ
AVQLLTRTELTLKQDYGQSVWAELEGLSLLLCHKPLADVFIDG
>b1099 holB, DNA polymerase III, delta prime subunit
MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLC
QQPQGHKSCGHCRGCQLMQAGTHPDYYTLAPEKGKNTLGVDAVREVTEKL
NEHARLGGAKVVWVTDAALLTDAAANALLKTLEEPPAETWFFLATREPER
LLATLRSRCRLHYLAPPPEQYAVTWLSREVTMSQDALLAALRLSAGSPGA
ALALFQGDNWQARETLCQALAYSVPSGDWYSLLAALNHEQAPARLHWLAT
LLMDALKRHHGAAQVTNVDVPGLVAELANHLSPSRLQAILGDVCHIREQL
MSVTGINRELLITDLLLRIEHYLQPGVVLPVPHL
>b4259 holC, DNA polymerase III, chi subunit
MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAY
RLDEALWARPAESFVPHNLAGEGPRGGAPVEIAWPQKRSSSRRDILISLR
TSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK
>b4372 holD, DNA polymerase III, psi subunit
MTSRRDWQLQQLGITQWSLRRPGALQGEIAIAIPAHVRLVMVANDLPALT
DPLVSDVLRALTVSPDQVLQLTPEKIAMLPQGSHCNSWRLGTDEPLSLEG
AQVASPALTDLRANPTARAALWQQICTYEHDFFPRND
>b1413 hrpA, helicase, ATP-dependent
MLRDRLRFSRRLHGVKKVKNPDAQQAIFQEMAKEIDQAAGKVLLREAARP
EITYPDNLPVSQKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICMELGR
GIKGLIGHTQPRRLAARTVANRIAEELKTEPGGCIGYKVRFSDHVSDNTM
VKLMTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFLLGYLKELLPR
RPDLKIIITSATIDPERFSRHFNNAPIIEVSGRTYPVEVRYRPIVEEADD
TERDQLQAIFDAVDELSQESHGDILIFMSGEREIRDTADALNKLNLRHTE
ILPLYARLSNSEQNRVFQSHSGRRIVLATNVAETSLTVPGIKYVIDPGTA
RISRYSYRTKVQRLPIEPISQASANQRKGRCGRVSEGICIRLYSEDDFLS
RPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLL
EELGAITTDEQASAYKLTPLGRQLSQLPVDPRLARMVLEAQKHGCVREAM
IITSALSIQDPRERPMDKQQASDEKHRRFHDKESDFLAFVNLWNYLGEQQ
KALSSNAFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVNSEPAEY
REIHIALLTGLLSHIGMKDADKQEYTGARNARFSIFPGSGLFKKPPKWVM
VAELVETSRLWGRIAARIDPEWVEPVAQHLIKRTYSEPHWERAQGAVMAT
EKVTVYGLPIVAARKVNYSQIDPALCRELFIRHALVEGDWQTRHAFFREN
LKLRAEVEELEHKSRRRDILVDDETLFEFYDQRISHDVISARHFDSWWKK
VSRETPDLLNFEKSMLIKEGAEKISKLDYPNFWHQGNLKLRLSYQFEPGA
DADGVTVHIPLPLLNQVEESGFEWQIPGLRRELVIALIKSLPKPVRRNFV
PAPNYAEAFLGRVKPLELPLLDSLERELRRMTGVTVDREDWHWDQVPDHL
KITFRVVDDKNKKLKEGRSLQDLKDALKGKVQETLSAVADDGIEQSGLHI
WSFGQLPESYEQKRGNYKVKAWPALVDERDSVAIKLFDNPLEQKQAMWNG
LRRLLLLNIPSPIKYLHEKLPNKAKLGLYFNPYGKVLELIDDCISCGVDK
LIDANGGPVWTEEGFAALHEKVRAELNDTVVDIAKQVEQILTAVFNINKR
LKGRVDMTMALGLSDIKAQMGGLVYRGFVTGNGFKRLGDTLRYLQAIEKR
LEKLAVDPHRDRAQMLKVENVQQAWQQWINKLPPARREDEDVKEIRWMIE
ELRVSYFAQQLGTPYPISDKRILQAMEQISG
>b0148 hrpB, helicase, ATP-dependent
MLQCGAKNVNPLERFVSSLPVAAVLPELLTALDCAPQVLLSAPTGAGKST
WLPLQLLAHPGINGKIILLEPRRLAARNVAQRLAELLNEKPGDTVGYRMR
AQNCVGPNTRLEVVTEGVLTRMIQRDPELSGVGLVILDEFHERSLQADLA
LALLLDVQQGLRDDLKLLIMSATLDNDRLQQMLPEAPVVISEGRSFPVER
RYLPLPAHQRFDDAVAVATAEMLRQESGSLLLFLPGVGEIQRVQEQLASR
IGSDVLLCPLYGALSLNDQRKAILPAPQGMRKVVLATNIAETSLTIEGIR
LVVDCAQERVARFDPRTGLTRLITQRVSQASMTQRAGRAGRLEPGISLHL
IAKEQAERAAAQSEPEILQSDLSGLLMELLQWGCSDPAQMSWLDQPPVVN
LLAAKRLLQMLGALEGERLSAQGQKMAALGNDPRLAAMLVSAKNDDEAAT
AAKIAAILEEPPRMGNSDLGVAFSRNQPAWQQRSQQLLKRLNVRGGEADS
SLIAPLLAGAFADRIARRRGQDGRYQLANGMGAMLDANDALSRHEWLIAP
LLLQGSASPDARILLALLVDIDELVQRCPQLVQQSDTVEWDDAQGTLKAW
RRLQIGQLTVKVQPLAKPSEDELHQAMLNGIRDKGLSVLNWTAEAEQLRL
RLLCAAKWLPEYDWPAVDDESLLAALETWLLPHMTGVHSLRGLKSLDIYQ
ALRGLLDWGMQQRLDSELPAHYTVPTGSRIAIRYHEDNPPALAVRMQEMF
GEATNPTIAQGRVPLVLELLSPAQRPLQITRDLSDFWKGAYREVQKEMKG
RYPKHVWPDDPANTAPTRRTKKYS
>b4000 hupA, DNA-binding protein HU-alpha (HU-2)
MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTF
KVNHRAERTGRNPQTGKEIKIAAANVPAFVSGKALKDAVK
>b0440 hupB, DNA-binding protein HU-beta, NS1 (HU-1)
MNKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTF
AVKERAARTGRNPQTGKEITIAAAKVPSFRAGKALKDAVN
>b0022 insA_1, IS1 protein InsA
MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR
>b0265 insA_2, IS1 protein InsA
MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR
>b0275 insA_3, IS1 protein InsA
MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRATARIMGVGLNTIFRHLKNSGRSR
>b1894 insA_5, IS1 protein InsA
MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR
>b3444 insA_6, IS1 protein InsA
MASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFTYTASQP
GTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR
>b4294 insA_7, IS1 protein InsA
MASISIRCPSCSATEGVVRNGKSTAGHQRYLCSPCRKTWQLQFTYTASQP
GKHQKIIDMAMNGVGCRASARIMGVGLNTVLRHLKNSGRSR
>b0021 insB_1, IS1 protein InsB
MPGNSPHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>b0264 insB_2, IS1 protein InsB
MPGNRPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
EQHDKVIGHYLNIKHYQ
>b0274 insB_3, IS1 protein InsB
MPGNRPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
EQHDKVIGHYLNIKHYQ
>b0988 insB_4, IS1 protein InsB
MPGNSPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERYNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>b1893 insB_5, IS1 protein InsB
MPGNSPHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>b3445 insB_6, IS1 protein InsB
MPGNSPHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYV
GAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMT
DGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSV
ELHDKVIGHYLNIKHYQ
>b2622 intA, prophage CP4-57 integrase
MARKTKPLTDTEIKAAKPKDADYQLYDGDGLTLLIKSSGSKLWQFRYYRP
LTKQRTKQSFGAYPAVSLSDARKLRAESKVLLAKDIDPQEHQKEQVRNSQ
EAKTNTFLLVAERWWNVKKTSVTEDYADDIWRSLERDIFPAIGDISITEI
KAHTLVKAVQPVQARGALETVRRLCQRINEVMIYAQNTGLIDAVPSVNIG
KAFEKPQKKNMPSIRPDQLPQLMHTMRTASISMSTRCLFMWQLLTITRPA
EAAEARWDEIDFNASEWKIPAARMKMNRDHTVPLSDGALAILEMMKPLSG
GREFIFPSRIKPNQPMNSQTVNAALKRAGLGGVLVSHGLRSIASTALNEE
GFPPDVIEAALAHVDKNEVRRAYNRSDYLEQRRPMMQWWADLVKAADSGS
IVLTHLSKIRLVG
>b4271 intB, prophage P4 integrase
MHLLVHPNGSKYWRLQYRYEGKQKMLALGVYPEITLADARVRRDEARKLL
ANGVDPGDKKKNDKVEQSKARTFKEVAIEWHGTNKKWSEDHAHRVLKSLE
DNLFAALGERNIAELKTRDLLAPIKAVEMSGRLEVAARLQQRTTAIMRYA
VQSGLIDYNPAQEMAGAVASCNRQHRPALELKRIPELLTKIDSYTGRPLT
RWAIELTLLIFIRSSELRFARWSEIDFEASIWTIPPEREPIPGVKHSHRG
SKMRTTHLVPLSTQALAILKQIKQFYGAHDLIFIGDHDSHKPMSENTVNS
ALRVMGYDTKVEVCGHGFRTMACSSLVESGLWSRDAVERQMSHMARNSVR
AAYIHKAEHLEERRLMLQWWADFLDVNRERFISPFEYAKINNPLKQ
>b0537 intD, prophage DLP12 integrase
MSLFRRNEIWYASYSLPGGKRIKESLGTKDKRQAQELHDKRKAELWRVEK
LGDLPDVTFEEACLRWLEEKADKKSLDSDKSRIEFWLEHFEGIRLKDISE
AKIYSAVSRMHNRKTKEIWKQKVQAAIRKGKELPVYEPKPVSTQTKAKHL
AMIKAILRAAERDWKWLEKAPVIKIPAVRNKRVRWLEKEEAKRLIDECPE
PLKSVVKFALATGLRKSNIINLEWQQIDMQRRVAWVNPEESKSNRAIGVA
LNDTACKVLRDQIGKHHKWVFVHTKAAKRADGTSTPAVRKMRIDSKTSWL
SACRRAGIEDFRFHDLRHTWASWLIQSGVPLSVLQEMGGWESIEMVRRYA
HLAPNHLTEHARKIDDIFGDNVPNMSHSEIMEDIKKA
>b1140 intE, prophage e14 integrase
MAARPRKNNVSVPNLYPLYSRKVNKVYWRYKHPVTGKFHALGTNEAEAIA
IATEANTRLAEQRTRQILAISDRIATSKGKAITTSTWLDRYQAIQDDRLK
SGDIRLNTYKQKAKPVSLLRERAGMKLISAVDVRDIAQLLDEYIAAGRPR
MAQVVRSVLIDVFKEAQHYGEVPPGYNPALATKQPRRKITRQRLSLEEWK
KIFDIADATHRYMGNAMLLALVTGQRLGDISRMKFSDIWDDHLHVIQEKT
GSKIAIPLSLRLNAINWSLRDVVARCRDYAVSAYLVHFFRSTSQAERGAQ
VKANTLTMNFSKARDLARIDWGEGSPATFHEQRSLSERLYKEQGLDTQKL
LGHKTQQQTDRYHDDRGKGWSKVAL
>b0281 intF, putative phage integrase
MFIPSIYLHQQLHYCKTAILNWSRKMALSRQKFTFERLRRFTLPEGKKQT
FLWDADVTTLACRATSGAKAFVFQSVYAGKTLRMTIGNINDWKIDDARAE
ARRLQTLIDTGIDPRIAKAVKIAEAESLQAESRKTKVTFSVAWEDYLQEL
RTGISAKTKRPYSTRYIADHINLSSRGGESKKRGQGPTSAGPLASLLNLP
LSELTPDYIAAWLSTERQNRPTVTAHAYRLLRAFIKWSNYQKKYQGIIPG
DLAQDYNVRKMVPVSASKADDCLQKEQLKSWFSAVRSLNNPIASAYLQVL
LLTGARREEIASLRWSDVDFKWSSMRIKDKIEGERIIPLTPYVSELLNVL
AQSPNSDVNKEGWVFRSNSKSGKIIEPRSAHNRALVLAELPHISLHGLRR
SFGTLAEWVEVPTGIVAQIMGHKPSALAEKHYRRRPLDLLRKWHEKIETW
ILNEAGITIKNNVDMR
>b1579 intQ, putative lambdoid prophage Qin defective integrase
MITDVWKYRGKSTGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQD
LTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEK
LLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGY
LADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTG
VRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKP
AIDALRSQMTLTRLSKEHIIDVHFREYGRTEKQKCTFVFQPEVSARVKNY
GDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIAN
QMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN
>b1345 intR, lambdoid prophage Rac integrase
MSKLPTGVEIRGRYIRIWFMFRGKRCRETLKGWEITNSNIKKAGNLRALI
VHEINSGEFEYLRRFPQSSTGAKMVTTRVIKTFGELCDIWTKIKETELTT
NTMKKTKSQLKTLRIIICESTPISHIRYSDILNYRNELLHGETLYLDNPR
SNKKGRTVRTVDNYIALLCSLLRFAYQSGFISTKPFEGVKKLQRNRIKPD
PLSKTEFNALMESEKGQSQNLWKFAVYSGLRHGELAALAWEDVDLEKGIV
NVRRNLTILDMFGPPKTNAGIRTVTLLQPALEALKEQYKLTGHHRKSEIT
FYHREYGRTEKQKLHFVFMPRVCNGKQKPYYSVSSLGARWNAAVKRAGIR
RRNPYHTRHTFACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMND
EQIAMLNARLS
>b2349 intS, putative prophage CPS-53 integrase
MLTVKQIEAAKPKEKPYRLLDGNGLYLYVPVSGKKVWQLRYKIDGKEKIL
TVGKYPLMTLQEARDKAWTARKDISVGIDPVKAKKASSNNNSFSAIYKEW
YEHKKQVWSVGYATELAKMFDDDILPIIGGLEIQDIEPMQLLEVIRRFED
RGAMERANKARRRCGEVFRYAIVTGRAKYNPAPDLADAMKGYRKKNFPFL
PADQIPAFNKALATFSGSIVSLIATKVLRYTALRTKELRSMLWKNVDFEN
RIITIDASVMKGRKIHVVPMSDQVVELLTTLSSITKPVSEFVFAGRNDKK
KPICENAVLLVIKQIGYEGLESGHGFRHEFSTIMNEHEWPADAIEVQLAH
ANGGSVRGIYNHAQYLDKRREMMQWWADWLDEKVE
>b2442 intZ, putative prophage integrase
MPKHARYSSRRTRRIHDRGYPVVKSRYPVLSKTPLTAKAIDAAQPQDKPY
KLTDSLTPGLFLLVHPNGSKYWRFRYWLNKREFLQAIGVYPLITLKEARR
RATESRSLIANGINPVEQARKEKAIDALNMAAGFKKVAEDWFATRVGGWS
ESYAKQVRSALEKDVYPVLGKRSIVDITARDVLALLQKKERTAPEQARKL
RRRIGEIFKFAVITELVTRNPVADLDTALKARRPGHNAWIPISEIPAFYK
ALERAGSVQIQTAIRLLILTALRTAELRLCRWEWINLEDATITLPAEVMK
ARRPHVVPLSRQAVELLQDQFTRSGYSAFVFPGRFMDKPLSASAILKALE
RIGYKSIATGHGWRTTFSTALNESGRYSPDWIEIQLAHVPKGIRGVYNQA
AYLKQRRAMMQDYADAIDSILAGNGNPLEPE
>b2411 ligA, DNA ligase
MESIEQQLTELRTTLRHHEYLYHVMDAPEIPDAEYDRLMRELRELETKHP
ELITPDSPTQRVGAAPLAAFSQIRHEVPMLSLDNVFDEESFLAFNKRVQD
RLKNNEKVTWCCELKLDGLAVSILYENGVLVSAATRGDGTTGEDITSNVR
TIRAIPLKLHGENIPARLEVRGEVFLPQAGFEKINEDARRTGGKVFANPR
NAAAGSLRQLDPRITAKRPLTFFCYGVGVLEGGELPDTHLGRLLQFKKWG
LPVSDRVTLCESAEEVLAFYHKVEEDRPTLGFDIDGVVIKVNSLAQQEQL
GFVARAPRWAVAFKFPAQEQMTFVRDVEFQVGRTGAITPVARLEPVHVAG
VLVSNATLHNADEIERLGLRIGDKVVIRRAGDVIPQVVNVVLSERPEDTR
EVVFPTHCPVCGSDVERVEGEAVARCTGGLICGAQRKESLKHFVSRRAMD
VDGMGDKIIDQLVEKEYVHTPADLFKLTAGKLTGLERMGPKSAQNVVNAL
EKAKETTFARFLYALGIREVGEATAAGLAAYFGTLEALEAASIEELQKVP
DVGIVVASHVHNFFAEESNRNVISELLAEGVHWPAPIVINAEEIDSPFAG
KTVVLTGSLSQMSRDDAKARLVELGAKVAGSVSKKTDLVIAGEAAGSKLA
KAQELGIEVIDEAEMLRLLGS
>b4346 mcrB, component of McrBC 5-methylcytosine restriction system
MRKAYLMESIQPWIEKFIKQAQQQRSQSTKDYPTSYRNLRVKLSFGYGNF
TSIPWFAFLGEGQEASNGIYPVILYYKDFDELVLAYGISDTNEPHAQWQF
SSDIPKTIAEYFQATSGVYPKKYGQSYYACSQKVSQGIDYTRFASMLDNI
INDYKLIFNSGKSVIPPMSKTESYCLEDALNDLFIPETTIETILKRLTIK
KNIILQGPPGVGKTFVARRLAYLLTGEKAPQRVNMVQFHQSYSYEDFIQG
YRPNGVGFRRKDGIFYNFCQQAKEQPEKKYIFIIDEINRANLSKVFGEVM
MLMEHDKRGENWSVPLTYSENDEERFYVPENVYIIGLMNTADRSLAVVDY
ALRRRFSFIDIEPGFDTPQFRNFLLNKKAEPSFVESLCQKMNELNQEISK
EATILGKGFRIGHSYFCCGLEDGTSPDTQWLNEIVMTDIAPLLEEYFFDD
PYKQQKWTNKLLGDS
>b1114 mfd, transcription-repair coupling factor; mutation frequency decline
MPEQYRYTLPVKAGEQRLLGELTGAACATLVAEIAERHAGPVVLIAPDMQ
NALRLHDEISQFTDQMVMNLADWETLPYDSFSPHQDIISSRLSTLYQLPT
MQRGVLIVPVNTLMQRVCPHSFLHGHALVMKKGQRLSRDALRTQLDSAGY
RHVDQVMEHGEYATRGALLDLFPMGSELPYRLDFFDDEIDSLRVFDVDSQ
RTLEEVEAINLLPAHEFPTDKAAIELFRSQWRDTFEVKRDPEHIYQQVSK
GTLPAGIEYWQPLFFSEPLPPLFSYFPANTLLVNTGDLETSAERFQADTL
ARFENRGVDPMRPLLPPQSLWLRVDELFSELKNWPRVQLKTEHLPTKAAN
ANLGFQKLPDLAVQAQQKAPLDALRKFLETFDGPVVFSVESEGRREALGE
LLARIKIAPQRIMRLDEASDRGRYLMIGAAEHGFVDTVRNLALICESDLL
GERVARRRQDSRRTINPDTLIRNLAELHIGQPVVHLEHGVGRYAGMTTLE
AGGITGEYLMLTYANDAKLYVPVSSLHLISRYAGGAEENAPLHKLGGDAW
SRARQKAAEKVRDVAAELLDIYAQRAAKEGFAFKHDREQYQLFCDSFPFE
TTPDQAQAINAVLSDMCQPLAMDRLVCGDVGFGKTEVAMRAAFLAVDNHK
QVAVLVPTTLLAQQHYDNFRDRFANWPVRIEMISRFRSAKEQTQILAEVA
EGKIDILIGTHKLLQSDVKFKDLGLLIVDEEHRFGVRHKERIKAMRANVD
ILTLTATPIPRTLNMAMSGMRDLSIIATPPARRLAVKTFVREYDSMVVRE
AILREILRGGQVYYLYNDVENIQKAAERLAELVPEARIAIGHGQMREREL
ERVMNDFHHQRFNVLVCTTIIETGIDIPTANTIIIERADHFGLAQLHQLR
GRVGRSHHQAYAWLLTPHPKAMTTDAQKRLEAIASLEDLGAGFALATHDL
EIRGAGELLGEEQSGSMETIGFSLYMELLENAVDALKAGREPSLEDLTSQ
QTEVELRMPSLLPDDFIPDVNTRLSFYKRIASAKTENELEEIKVELIDRF
GLLPDPARTLLDIARLRQQAQKLGIRKLEGNEKGGVIEFAEKNHVNPAWL
IGLLQKQPQHYRLDGPTRLKFIQDLSERKTRIEWVRQFMRELEENAIA
>b2831 mutH, methyl-directed mismatch repair
MSQPRPLLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGW
IGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPL
TGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEE
DRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGAR
GERILTLPRGFYLKKNFTSALLARHFLIQ
>b4170 mutL, enzyme in methyl-directed mismatch repair
MPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERGG
AKLIRIRDNGCGIKKDELALALARHATSKIASLDDLEAIISLGFRGEALA
SISSVSRLTLTSRTAEQQEAWQAYAEGRDMNVTVKPAAHPVGTTLEVLDL
FYNTPARRKFLRTEKTEFNHIDEIIRRIALARFDVTINLSHNGKIVRQYR
AVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWVADPNHTTPAL
AEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQVD
VNVHPAKHEVRFHQSRLVHDFIYQGVLSVLQQQLETPLPLDDEPQPAPRS
IPENRVAAGRNHFAEPAAREPVAPRYTPAPASGSRPAAPWPNAQPGYQKQ
QGEVYRQLLQTPAPMQKLKAPEPQEPALAANSQSFGRVLTIVHSDCALLE
RDGNISLLSLPVAERWLRQAQLTPGEAPVCAQPLLIPLRLKVSAEEKSAL
EKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQS
VFEPGNIAQWIARNLMSEHAQWSMAQAITLLADVERLCPQLVKTPPGGLL
QSVDLHPAIKALKDE
>b3635 mutM, formamidopyrimidine DNA glycosylase
MPELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVL
SVQRRAKYLLLELPEGWIIIHLGMSGSLRILPEELPPEKHDHVDLVMSNG
KVLRYTDPRRFGAWLWTKELEGHNVLTHLGPEPLSDDFNGEYLHQKCAKK
KTAIKPWLMDNKLVVGVGNIYASESLFAAGIHPDRLASSLSLAECELLAR
VIKAVLLRSIEQGGTTLKDFLQSDGKPGYFAQELQVYGRKGEPCRVCGTP
IVATKHAQRATFYCRQCQK
>b2733 mutS, methyl-directed mismatch repair
MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQL
LDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPA
TSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDI
SSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRP
LWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRT
TLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVT
PMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAGLQPVLRQVGDLER
ILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEF
AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL
EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAER
YIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALA
ELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANP
LNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPI
DRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTS
TYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDAL
EHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS
PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLK
SLV
>b0099 mutT, 7,8-dihydro-8-oxoguanine-triphosphatase, prefers dGTP, causes AT-GC transversions
MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGETPEQAV
VRELQEEVGITPQHFSLFEKLEYEFPDRHITLWFWLVERWEGEPWGKEGQ
PGEWMSLVGLNADDFPPANEPVIAKLKRL
>b2961 mutY, adenine glycosylase; G.C--> T.A transversions
MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP
YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLH
GGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYA
VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCS
LCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRP
PSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD
IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAPV
>b0714 nei, endonuclease VIII and DNA N-glycosylase with an AP lyase activity
MPEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKPYQSQLIGQHVTHVET
RGKALLTHFSNDLTLYSHNQLYGVWRVVDTGEEPQTTRVLRVKLQTADKT
ILLYSASDIEMLTPEQLTTHPFLQRVGPDVLDPNLTPEVVKERLLSPRFR
NRQFAGLLLDQAFLAGLGNYLRVEILWQVGLTGNHKAKDLNAAQLDALAH
ALLEIPRFSYATRGQVDENKHHGALFRFKVFHRDGEPCERCGSIIEKTTL
SSRPFYWCPGCQH
>b3998 nfi, endonuclease V (deoxyinosine 3'endoduclease)
MIMDLASLRAQQIELASSVIREDRLDKDPPDLIAGADVGFEQGGEVTRAA
MVLLKYPSLELVEYKVARIATTMPYIPGFLSFREYPALLAAWEMLSQKPD
LVFVDGHGISHPRRLGVASHFGLLVDVPTIGVAKKRLCGKFEPLSSEPGA
LAPLMDKGEQLAWVWRSKARCNPLFIATGHRVSVDSALAWVQRCMKGYRL
PEPTRWADAVASERPAFVRYTANQP
>b2159 nfo, endonuclease IV
MKYIGAHVSAAGGLANAAIRAAEIDATAFALFTKNQRQWRAAPLTTQTID
EFKAACEKYHYTSAQILPHDSYLINLGHPVTEALEKSRDAFIDEMQRCEQ
LGLSLLNFHPGSHLMQISEEDCLARIAESINIALDKTQGVTAVIENTAGQ
GSNLGFKFEHLAAIIDGVEDKSRVGVCIDTCHAFAAGYDLRTPAECEKTF
ADFARTVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGNIGHDAFRWIMQD
DRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA
>b1548 nohA, DNA packaging protein NU1 homolog from lambdoid prophage Qin
MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIRWY
AERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELK
NARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDF
LKRDIIKAMNKAAALDELIPGLLSEYNRADRQYAGGSRS
>b0560 nohB, bacteriophage DNA packaging protein
MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWY
AERDAEIENEKLRREVEELLQASETDLQPGTIEYERHRLTRAQADAQELK
NARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDF
LKRDIIKAMNKAAALDELIPGLLSEYIEQSG
>b1633 nth, endonuclease III; specific for apurinic and/or apyrimidinic sites
MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDVSVNKA
TAKLYPVANTPAAMLELGVEGVKTYIKTIGLYNSKAENIIKTCRILLEQH
NGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQF
APGKNVEQVEEKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIE
DLCEYKEKVDI
>b1865 ntpA, dATP pyrophosphohydrolase
MKDKVYKRPVSILVVIYAQDTKRVLMLQRRDDPDFWQSVTGSVEEGETAP
QAAMREVKEEVTIDVVAEQLTLIDCQRTVEFEIFSHLRHRYAPGVTRNTE
SWFCLALPHERQIVFTEHLAYKWLDAPAAAALTKSWSNRQAIEQFVINAA
>b3996 nudC, NADH pyrophosphatase
MDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQ
GEPVWLVQQQRRHDMGSVRQVIDLDVGLFQLAGRGVQLAEFYRSHKYCGY
CGHEMYPSKTEWAMLCSHCRERYYPQIAPCIIVAIRRDDSILLAQHTRHR
NGVHTVLAGFVEVGETLEQAVAREVMEESGIKVKNLRYVTSQPWPFPQSL
MTAFMAEYDSGDIVIDPKELLEANWYRYDDLPLLPPPGTVARRLIEDTVA
MCRAEYE
>b1759 nudG, CTP pyrophosphohydrolase
MKMIEVVAAIIERDGKILLAQRPAQSDQAGLWEFAGGKVEPDESQRQALV
RELREELGIEATVGEYVASHQREVSGRIIHLHAWHVPDFHGTLQAHEHQA
LVWCSPEEALQYPLAPADIPLLEAFMALRAARPAD
>b1335 ogt, O-6-alkylguanine-DNA/cysteine-proteinmethyltrans ferase
MLRLLEEKIATPLGPLWVICDEQFRLRAVEWEEYSERMVQLLDIHYRKEG
YERISATNPGGLSDKLREYFAGNLSIIDTLPTATGGTPFQREVWKTLRTI
PCGQVMHYGQLAEQLGRPGAARAVGAANGSNPISIVVPCHRVIGRNGTMT
GYAGGVQRKEWLLRHEGYLLL
>b3019 parC, DNA topoisomerase IV subunit A
MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMS
ELGLNASAKFKKSARTVGDVLGKYHPHGDSACYEAMVLMAQPFSYRYPLV
DGQGNWGAPDDPKSFAAMRYTESRLSKYSELLLSELGQGTADWVPNFDGT
LQEPKMLPARLPNILLNGTTGIAVGMATDIPPHNLREVAQAAIALIDQPK
TTLDQLLDIVQGPDYPTEAEIITSRAEIRKIYENGRGSVRMRAVWKKEDG
AVVISALPHQVSGARVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIV
PRSNRVDMDQVMNHLFATTDLEKSYRINLNMIGLDGRPAVKNLLEILSEW
LVFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRNEDEPK
PALMSRFGLTETQAEAILELKLRHLAKLEEMKIRGEQSELEKERDQLQGI
LASERKMNNLLKKELQADAQAYGDDRRSPLQEREEAKAMSEHDMLPSEPV
TIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKAAVKGKSNQPVVFVDSTG
RSYAIDPITLPSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDA
GYGFVCTFNDLVARNRAGKALITLPENAHVMPPVVIEDASDMLLAITQAG
RMLMFPVSDLPQLSKGKGNKIINIPSAEAARGEDGLAQLYVLPPQSTLTI
HVGKRKIKLRPEELQKVTGERGRRGTLMRGLQRIDRVEIDSPRRASSGDS
EE
>b3030 parE, DNA topoisomerase IV subunit B
MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAG
HAKRVDVILHADQSLEVIDDGRGMPVDIHPEEGVPAVELILCRLHAGGKF
SNKNYQFSGGLHGVGISVVNALSKRVEVNVRRDGQVYNIAFENGEKVQDL
QVVGTCGKRNTGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEI
TFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFAGDTEAVD
WALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNI
LPRGVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVV
KDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK
LADCTAQDLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEV
SSDEVLASQEVHDISVAIGIDPDSDDLSQLRYGKICILADADSDGLHIAT
LLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQ
LKRKKGKPNVQRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTD
AMMDMLLAKKRSEDRRNWLQEKGDMAEIEV
>b0708 phrB, deoxyribodipyrimidine photolyase (photoreactivation)
MTTHLVWFRQDLRLHDNLALAAACRNSSARVLALYIATPRQWATHNMSPR
QAELINAQLNGLQIALAEKGIPLLFREVDDFVASVEIVKQVCAENSVTHL
FYNYQYEVNERARDVEVERALRNVVCEGFDDSVILPPGAVMTGNHEMYKV
FTPFKNAWLKRLREGMPECVAAPKVRSSGSIEPSPSITLNYPRQSFDTAH
FPVEEKAAIAQLRQFCQNGAGEYEQQRDFPAVEGTSRLSASLATGGLSPR
QCLHRLLAEQPQALDGGAGSVWLNELIWREFYRHLITYHPSLCKHRPFIA
WTDRVQWQSNPAHLQAWQEGKTGYPIVDAAMRQLNSTGWMHNRLRMITAS
FLVKDLLIDWREGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIF
NPTTQGEKFDHEGEFIRQWLPELRDVPGKVVHEPWKWAQKAGVTLDYPQP
IVEHKEARVQTLAAYEAARKGK
>b1158 pin, inversion of adjacent DNA; at locus of e14 element
MLIGYVRVSTNDQNTDLQRNALNCAGCELIFEDKISGTKSERPGLKKLLR
TLSAGDTLVVWKLDRLGRSMRHLVVLVEELRERGINFRSLTDSIDTSTPM
GRFFFHVMGALAEMERELIVERTKAGLETARAQGRIGGRRPKLTPEQWAQ
AGRLIAAGTPRQKVAIIYDVGVSTLYKRFPAGDK
>b1545 pinQ, putative DNA-invertase from lambdoid prophage Qin
MSQIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSER
PGFNRLLARLKCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLAL
GGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPP
VLNEEQKQAVFERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI
>b1374 pinR, putative DNA-invertase from lambdoid prophage Rac
MSRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSER
PGFNRLLARLKCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLAL
GGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPP
VLNEEQKQVVFERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI
>b3863 polA, DNA polymerase I, 3'--> 5' polymerase, 5'--> 3' and 3'--> 5' exonuclease
MVQIPQNPLILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLI
MQYKPTHAAVVFDAKGKTFRDELFEHYKSHRPPMPDDLRAQIEPLHAMVK
AMGLPLLAVSGVEADDVIGTLAREAEKAGRPVLISTGDKDMAQLVTPNIT
LINTMTNTILGPEEVVNKYGVPPELIIDFLALMGDSSDNIPGVPGVGEKT
AQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLEQNKEVAYLSYQLA
TIKTDVELELTCEQLEVQQPAAEELLGLFKKYEFKRWTADVEAGKWLQAK
GAKPAAKPQETSVADEAPEVTATVISYDNYVTILDEETLKAWIAKLEKAP
VFAFDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRER
ALELLKPLLEDEKALKVGQNLKYDRGILANYGIELRGIAFDTMLESYILN
SVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAE
DADVTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRIERNGVKIDPK
VLHNHSEELTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLK
KTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKSTYTDKLPLMINP
KTGRVHTSYHQAVTATGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYV
IVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLETVT
SEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLE
YMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAERAAINAPMQG
TAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAKQI
HQLMENCTRLDVPLLVEVGSGENWDQAH
>b0060 polB, DNA polymerase II
MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQV
PRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGG
VTVYEADVRPPERYLMERFITSPVWVEGDMHNGTIVNARLKPHPDYRPPL
KWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASSLDFELEYVAS
RPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRLPLRLGRDN
SELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQEL
LGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMP
FLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASP
GGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHS
TEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFY
GVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTF
VWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCR
FLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQ
ELYLRIFRNEPYQEYVRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVP
PHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYE
HYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF
>b3935 priA, primosomal protein N'(= factor Y)(putative helicase)
MPVAHVALPVPLPRTFDYLLPEGMTVKAGCRVRVPFGKQQERIGIVVSVS
DASELPLNELKAVVEVLDSEPVFTHSVWRLLLWAADYYHHPIGDVLFHAL
PILLRQGRPAANAPMWYWFATEQGQAVDLNSLKRSPKQQQALAALRQGKI
WRDQVATLEFNDAALQALRKKGLCDLASETPEFSDWRTNYAVSGERLRLN
TEQATAVGAIHSAADTFSAWLLAGVTGSGKTEVYLSVLENVLAQGKQALV
MVPEIGLTPQTIARFRERFNAPVEVLHSGLNDSERLSAWLKAKNGEAAIV
IGTRSALFTPFKNLGVIVIDEEHDSSYKQQEGWRYHARDLAVYRAHSEQI
PIILGSATPALETLCNVQQKKYRLLRLTRRAGNARPAIQHVLDLKGQKVQ
AGLAPALITRMRQHLQADNQVILFLNRRGFAPALLCHDCGWIAECPRCDH
YYTLHQAQHHLRCHHCDSQRPVPRQCPSCGSTHLVPVGLGTEQLEQTLAP
LFPGVPISRIDRDTTSRKGALEQQLAEVHRGGARILIGTQMLAKGHHFPD
VTLVALLDVDGALFSADFRSAERFAQLYTQVAGRAGRAGKQGEVVLQTHH
PEHPLLQTLLYKGYDAFAEQALAERRMMQLPPWTSHVIVRAEDHNNQHAP
LFLQQLRNLILSSPLADEKLWVLGPVPALAPKRGGRWRWQILLQHPSRVR
LQHIINGTLALINTIPDSRKVKWVLDVDPIEG
>b4201 priB, primosomal replication protein N
MTNRLVLSGTVCRAPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQM
PVIVSGHENQAITHSITVGSRITVQGFISCHKAKNGLSKMVLHAEQIELI
DSGD
>b0467 priC, primosomal replication protein N''
MKTALLLEKLEGQLATLRQRCAPVSQFATLSARFDRHLFQTRATTLQACL
DEAGDNLAALRHAVEQQQLPQVAWLAEHLAAQLEAIAREASAWSLREWDS
APPKIARWQRKRIQHQDFERRLREMVAERRARLARVTDLVEQQTLHREVE
AYEARLARCRHALEKIENRLARLTR
>b3638 radC, DNA repair protein
MKVKNNSQLLMPREKMLKFGISALTDVELLALFLRTGTRGKDVLTLAKEM
LENFGSLYGLLTSEYEQFSGVHGIGVAKFAQLKGIAELARRYYNVRMREE
SPLLSPEMTREFLQSQLTGEEREIFMVIFLDSQHRVITHRRLFSGTLNHV
EVHPREIIREAIKINASALILAHNHPSGCAEPSKADKLITERIIKSCQFM
DLRVLDHIVIGRGEYVSFAERGWI
>b2699 recA, DNA strand exchange and renaturation, DNA-dependent ATPase, DNA-and ATP-dependent coprotease
MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDI
ALGAGGLPMGRIVEIYGPESSGKTTLTLQVIAAAQREGKTCAFIDAEHAL
DPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAAL
TPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKI
GVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVGSETRVKVVKN
KIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIG
QGKANATAWLKDNPETAKEIEKKVRELLLSNPNSTPDFSVDDSEGVAETN
EDF
>b2820 recB, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MSDVAETLDPLRLPLQGERLIEASAGTGKTFTIAALYLRLLLGLGGSAAF
PRPLTVEELLVVTFTEAATAELRGRIRSNIHELRIACLRETTDNPLYERL
LEEIDDKAQAAQWLLLAERQMDEAAVFTIHGFCQRMLNLNAFESGMLFEQ
QLIEDESLLRYQACADFWRRHCYPLPREIAQVVFETWKGPQALLRDINRY
LQGEAPVIKAPPPDDETLASRHAQIVARIDTVKQQWRDAVGELDALIESS
GIDRRKFNRSNQAKWIDKISAWAEEETNSYQLPESLEKFSQRFLEDRTKA
GGETPRHPLFEAIDQLLAEPLSIRDLVITRALAEIRETVAREKRRRGELG
FDDMLSRLDSALRSESGEVLAAAIRTRFPVAMIDEFQDTDPQQYRIFRRI
WHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSEVHAHYTLDTNWRSA
PGMVNSVNKLFSQTDDAFMFREIPFIPVKSAGKNQALRFVFKGETQPAMK
MWLMEGESCGVGDYQSTMAQVCAAQIRDWLQAGQRGEALLMNGDDARPVR
ASDISVLVRSRQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEMLWLL
QAVMTPERENTLRSALATSMMGLNALDIETLNNDEHAWDVVVEEFDGYRQ
IWRKRGVMPMLRALMSARNIAENLLATAGGERRLTDILHISELLQEAGTQ
LESEHALVRWLSQHILEPDSNASSQQMRLESDKHLVQIVTIHKSKGLEYP
LVWLPFITNFRVQEQAFYHDRHSFEAVLDLNAAPESVDLAEAERLAEDLR
LLYVALTRSVWHCSLGVAPLVRRRGDKKGDTDVHQSALGRLLQKGEPQDA
AGLRTCIEALCDDDIAWQTAQTGDNQPWQVNDVSTAELNAKTLQRLPGDN
WRVTSYSGLQQRGHGIAQDLMPRLDVDAAGVASVVEEPTLTPHQFPRGAS
PGTFLHSLFEDLDFTQPVDPNWVREKLELGGFESQWEPVLTEWITAVLQA
PLNETGVSLSQLSARNKQVEMEFYLPISEPLIASQLDTLIRQFDPLSAGC
PPLEFMQVRGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQAMAA
AMQAHRYDLQYQLYTLALHRYLRHRIADYDYEHHFGGVIYLFLRGVDKEH
PQQGIYTTRPNAGLIALMDEMFAGMTLEEA
>b2822 recC, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTL
SQKFGIAANIDFPLPASFIWDMFVRVLPEIPKESAFNKQSMSWKLMTLLP
QLLEREDFTLLRHYLTDDSDKRKLFQLSSKAADLFDQYLVYRPDWLAQWE
TGHLVEGLGEAQAWQAPLWKALVEYTHQLGQPRWHRANLYQRFIETLESA
TTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLFTNPCRYYWGD
IKDPAYLAKLLTRQRRHSFEDRELPLFRDSENAGQLFNSDGEQDVGNPLL
ASWGKLGRDYIYLLSDLESSQELDAFVDVTPDNLLHNIQSDILELENRAV
AGVNIEEFSRSDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEED
PTLTPRDIIVMVADIDSYSPFIQAVFGSAPADRYLPYAISDRRARQSHPV
LEAFISLLSLPDSRFVSEDVLALLDVPVLAARFDITEEGLRYLRQWVNES
GIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESAQGEWQSVLPY
DESSGLIAELVGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLNAFFL
PDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDELAQRLDQERI
SQRFLAGPVNICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQK
PKRGDRSRRDDDRYLFLEALISAQQKLYISYIGRSIQDNSERFPSVLVQE
LIDYIGQSHYLPGDEALNCDESEARVKAHLTCLHTRMPFDPQNYQPGERQ
SYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRFWAHPVRAFFQ
MRLQVNFRTEDSEIPDTEPFILEGLSRYQINQQLLNALVEQDDAERLFRR
FRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGV
QITGWLPQVQPDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFL
RKDGEWRFPPLAAEQALHYLSQLIEGYREGMSAPLLVLPESGGAWLKTCY
DAQNDAMLDDDSTLQKARTKFLQAYEGNMMVRGEGDDIWYQRLWRQLTPE
TMEAIVEQSQRFLLPLFRFNQS
>b2819 recD, DNA helicase, ATP-dependent dsDNA/ssDNA exonuclease V subunit, ssDNA endonuclease
MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGH
VCLPLSRLENNEASHPLLATCVSEIGELQNWEECLLASQAVSRGDEPTPM
ILCGDRLYLNRMWCNERTVARFFNEVNHAIEVDEALLAQTLDKLFPVSDE
INWQKVAAAVALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIR
LAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTLHRLLGAQPGS
QRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQ
LASVEAGAVLGDICAYANAGFTAERARQLSRLTGTHVPAGTGTEAASLRD
SLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS
GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFG
VAGLNERIEQFMQQKRKIHRHPHSRWYEGRPVMIARNDSALGLFNGDIGI
ALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAA
LILPSQRTPVVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGL
AALFSSRE
>b3700 recF, ssDNA and dsDNA binding, ATP binding
MSLTRLLIRDFRNIETADLALSPGFNFLVGANGSGKTSVLEAIYTLGHGR
AFRSLQIGRVIRHEQEAFVLHGRLQGEERETAIGLTKDKQGDSKVRIDGT
DGHKVAELAHLMPMQLITPEGFTLLNGGPKYRRAFLDWGCFHNEPGFFTA
WSNLKRLLKQRNAALRQVTRYEQLRPWDKELIPLAEQISTWRAEYSAGIA
ADMADTCKQFLPEFSLTFSFQRGWEKETEYAEVLERNFERDRQLTYTAHG
PHKADLRIRADGAPVEDTLSRGQLKLLMCALRLAQGEFLTRESGRRCLYL
IDDFASELDDERRGLLASRLKATQSQVFVSAISAEHVIDMSDENSKMFTV
EKGKITD
>b3652 recG, DNA helicase, resolution of Holliday junctions, branch migration
MKGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLHLPLRYEDRTHL
YPIGELLPGVYATVEGEVLNCNISFGGRRMMTCQISDGSGILTMRFFNFS
AAMKNSLAAGRRVLAYGEAKRGKYGAEMIHPEYRVQGDLSTPELQETLTP
VYPTTEGVKQATLRKLTDQALDLLDTCAIEELLPPELSQGMMTLPEALRT
LHRPPPTLQLSDLETGQHPAQRRLILEELLAHNLSMLALRAGAQRFHAQP
LSANDTLKNKLLAALPFKPTGAQARVVAEIERDMALDVPMMRLVQGDVGS
GKTLVAALAALRAIAHGKQVALMAPTELLAEQHANNFRNWFAPLGIEVGW
LAGKQKGKARLAQQEAIASGQVQMIVGTHAIFQEQVQFNGLALVIIDEQH
RFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTAYADLDTSVIDE
LPPGRTPVTTVAIPDTRRTDIIDRVHHACITEGRQAYWVCTLIEESELLE
AQAAEATWEELKLALPELNVGLVHGRMKPAEKQAVMASFKQGELHLLVAT
TVIEVGVDVPNASLMIIENPERLGLAQLHQLRGRVGRGAVASHCVLLYKT
PLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTRQTGNAEFKVAD
LLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETERYSNA
>b2892 recJ, ssDNA exonuclease, 5'--> 3' specific
MKQQIQLRRREVDETADLPAELPPLLRRLYASRGVRSAQELERSVKGMLP
WQQLSGVEKAVEILYNAFREGTRIIVVGDFDADGATSTALSVLAMRSLGC
SNIDYLVPNRFEDGYGLSPEVVDQAHARGAQLIVTVDNGISSHAGVEHAR
SLGIPVIVTDHHLPGDTLPAAEAIINPNLRDCNFPSKSLAGVGVAFYLML
ALRTFLRDQGWFDERNIAIPNLAELLDLVALGTVADVVPLDANNRILTWQ
GMSRIRAGKCRPGIKALLEVANRDAQKLAASDLGFALGPRLNAAGRLDDM
SVGVALLLCDNIGEARVLANELDALNQTRKEIEQGMQIEALTLCEKLERS
RDTLPGGLAMYHPEWHQGVVGILASRIKERFHRPVIAFAPAGDGTLKGSG
RSIQGLHMRDALERLDTLYPGMMLKFGGHAMAAGLSLEEDKFKLFQQRFG
ELVTEWLDPSLLQGEVVSDGPLSPAEMTMEVAQLLRDAGPWGQMFPEPLF
DGHFRLLQQRLVGERHLKVMVEPVGGGPLLDGIAFNVDTALWPDNGVREV
QLAYKLDINEFRGNRSLQIIIDNIWPI
>b2616 recN, protein used in recombination and DNA repair
MLAQLTISNFAIVRELEIDFHSGMTVITGETGAGKSIAIDALGLCLGGRA
EADMVRTGAARADLCARFSLKDTPAALRWLEENQLEDGHECLLRRVISSD
GRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQLLTKPEHQKFLLDGYA
NETSLLQEMTARYQLWHQSCRDLAHHQQLSQERAARAELLQYQLKELNEF
NPQPGEFEQIDEEYKRLANSGQLLTTSQNALALMADGEDANLQSQLYTAK
QLVSELIGMDSKLSGVLDMLEEATIQIAEASDELRHYCDRLDLDPNRLFE
LEQRISKQISLARKHHVSPEALPQYYQSLLEEQQQLDDQADSQETLALAV
TKHHQQALEIARALHQQRQQYAEELAQLITDSMHALSMPHGQFTIDVKFD
EHHLGADGADRIEFRVTTNPGQPMQPIAKVASGGELSRIALAIQVITARK
METPALIFDEVDVGISGPTAAVVGKLLRQLGESTQVMCVTHLPQVAGCGH
QHYFVSKETDGAMTETHMQSLNKKARLQELARLLGGSEVTRNTLANAKEL
LAA
>b2565 recO, protein interacts with RecR and possibly RecF proteins
MEGWQRAFVLHSRPWSETSLMLDVFTEESGRVRLVAKGARSKRSTLKGAL
QPFTPLLLRFGGRGEVKTLRSAEAVSLALPLSGITLYSGLYINELLSRVL
EYETRFSELFFDYLHCIQSLAGVTGTPEPALRRFELALLGHLGYGVNFTH
CAGSGEPVDDTMTYRYREEKGFIASVVIDNKTFTGRQLKALNAREFPDAD
TLRAAKRFTRMALKPYLGGKPLKSRELFRQFMPKRTVKTHYE
>b3822 recQ, ATP-dependent DNA helicase
MAQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVMPTG
GGKSLCYQIPALLLNGLTVVVSPLISLMKDQVDQLQANGVAAACLNSTQT
REQQLEVMTGCRTGQIRLLYIAPERLMLDNFLEHLAHWNPVLLAVDEAHC
ISQWGHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGLND
PLIQISSFDRPNIRYMLMEKFKPLDQLMRYVQEQRGKSGIIYCNSRAKVE
DTAARLQSKGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGMGI
NKPNVRFVVHFDIPRNIESYYQETGRAGRDGLPAEAMLFYDPADMAWLRR
CLEEKPQGQLQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGRQEPCGNC
DICLDPPKQYDGSTDAQIALSTIGRVNQRFGMGYVVEVIRGANNQRIRDY
GHDKLKVYGMGRDKSHEHWVSVIRQLIHLGLVTQNIAQHSALQLTEAARP
VLRGESSLQLAVPRIVALKPKAMQKSFGGNYDRKLFAKLRKLRKSIADES
NVPPYVVFNDATLIEMAEQMPITASEMLSVNGVGMRKLERFGKPFMALIR
AHVDGDDEE
>b0472 recR, recombination and repair
MQTSPLLTQLMEALRCLPGVGPKSAQRMAFTLLQRDRSGGMRLAQALTRA
MSEIGHCADCRTFTEQEVCNICSNPRRQENGQICVVESPADIYAIEQTGQ
FSGRYFVLMGHLSPLDGIGPDDIGLDRLEQRLAEEKITEVILATNPTVEG
EATANYIAELCAQYDVEASRIAHGVPVGGELEMVDGTTLSHSLAGRHKIR
F
>b1349 recT, recombinase, DNA renaturation
MTKQPPIAKADLQKTQGNRAPAAVKNSDVISFINQPSMKEQLAAALPRHM
TAERMIRIATTEIRKVPALGNCDTMSFVSAIVQCSQLGLEPGSALGHAYL
LPFGNKNEKSGKKNVQLIIGYRGMIDLARRSGQIASLSARVVREGDEFSF
EFGLDEKLIHRPGENEDAPVTHVYAVARLKDGGTQFEVMTRKQIELVRSL
SKAGNNGPWVTHWEEMAKKTAIRRLFKYLPVSIEIQRAVSMDEKEPLTID
PADSSVLTGEYSVIDNSEE
>b1564 relB, negative regulator of translation
MGSINLRIDDELKARSYAALEKMGVTPSEALRLMLEYIADNERLPFKQTL
LSDEDAELVEIVKERLRNPKPVRVTLDEL
>b3778 rep, rep helicase, a single-stranded DNA dependent ATPase
MRLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHI
AAVTFTNKAAREMKERVGQTLGRKEARGLMISTFHTLGLDIIKREYAALG
MKANFSLFDDTDQLALLKELTEGLIEDDKVLLQQLISTISNWKNDLKTPS
QAAASAIGERDRIFAHCYGLYDAHLKACNVLDFDDLILLPTLLLQRNEEV
RKRWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVVGDDDQSIYSW
RGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEK
RLFSELGYGAELKVLSANNEEHEAERVTGELIAHHFVNKTQYKDYAILYR
GNHQSRVFEKFLMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDS
AFLRIVNTPKREIGPATLKKLGEWAMTRNKSMFTASFDMGLSQTLSGRGY
EALTRFTHWLAEIQRLAEREPIAAVRDLIHGMDYESWLYETSPSPKAAEM
RMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMMERGESEEELD
QVQLMTLHASKGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGI
TRAQKELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIWEQERKVVSAE
ERMQKGQSHLANLKAMMAAKRGK
>b3630 rfaP, lipopolysaccharide core biosynthesis; phosphorylation of core heptose; attaches phosphate-containing substrate to LPS co
MVELKEPLATLWRGKDAFAEVKKLNGEVFRELETRRTLRFELSGKSYFLK
WHKGTTLKEIIKNLLSLRMPVLGADREWHAIHRLSDVGVDTMKGIGFGEK
GLNPLTRASFIITEDLTPTISLEDYCADWAVNPPDIRVKRMLIARVATMV
RKMHTAGINHRDCYICHFLLHLPFTGREDELKISVIDLHRAQIRAKVPRR
WRDKDLIGLYFSSMNIGLTQRDIWRFMKVYFGMPLRKILSLEQNLLNMAS
VKAERIKERTQRKGL
>b3624 rfaZ, lipopolysaccharide core biosynthesis
MKNIRYIDKKDVENLIENKISDDVIIFLSGPTSQKTPLSVLRTKDIIAVN
GSAQYLLSNNIVPFIYVLTDVRFLHQRRDDFYKFSQRSRYTIVNVDVYEH
ASKEDKLYILQNCLVLRSFYRREKGGFIKKIKFNILRQIHKELLISVPLS
KKGRLVGFCKDISLGYCSCHTIAFAAIQIAYSLKYARIICSGLDLTGSCS
RFYDENKNPMPSELSRDLFKILPFFRFMHDNVKDINIYNLSDDTAISYDV
IPFIKLQDISAEESKDMTRKKMQYRTSTDSYAN
>b3780 rhlB, putative ATP-dependent RNA helicase
MSKTHLTEQKFSDFALHPKVVEALEKKGFHNCTPIQALALPLTLAGRDVA
GQAQTGTGKTMAFLTSTFHYLLSHPAIADRKVNQPRALIMAPTRELAVQI
HADAEPLAEATGLKLGLAYGGDGYDKQLKVLESGVDILIGTTGRLIDYAK
QNHINLGAIQVVVLDEADRMYDLGFIKDIRWLFRRMPPANQRLNMLFSAT
LSYRVRELAFEQMNNAEYIEVEPEQKTGHRIKEELFYPSNEEKMRLLQTL
IEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILD
EFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRA
GASGHSISLACEEYALNLPAIETYIGHSIPVSKYNPDALMTDLPKPLRLT
RPRTGNGPRRTGAPRNRRRSG
>b0797 rhlE, putative ATP-dependent RNA helicase
MSFDSLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTG
KTAGFTLPLLQHLITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSK
YLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVKLDQV
EILVLDEADRMLDMGFIHDIRRVLTKLPAKRQNLLFSATFSDDIKALAEK
LLHNPLEIEVARRNTASDQVTQHVHFVDKKRKRELLSHMIGKGNWQQVLV
FTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDIRVLV
ATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC
VDEHKLLRDIEKLLKKEIPRIAIPGYEPDPSIKAEPIQNGRQQRGGGGRG
QGGGRGQQQPRRGEGGAKSASAKPAEKPSRRLGDAKPAGEQQRRRRPRKP
AAAQ
>b0214 rnhA, RNase HI, degrades RNA of DNA-RNA hybrids, participates in DNA replication
MLKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELM
AAIVALEALKEHCEVILSTDSQYVRQGITQWIHNWKKRGWKTADKKPVKN
VDLWQRLDAALGQHQIKWEWVKGHAGHPENERCDELARAAAMNPTLEDTG
YQVEV
>b0183 rnhB, RNAse HII, degrades RNA of DNA-RNA hybrids
MIEFVYPHTQLVAGVDEVGRGPLVGAVVTAAVILDPARPIAGLNDSKKLS
EKRRLALYEEIKEKALSWSLGRAEPHEIDELNILHATMLAMQRAVAGLHI
APEYVLIDGNRCPKLPMPAMAVVKGDSRVPEISAASILAKVTRDAEMAAL
DIVFPQYGFAQHKGYPTAFHLEKLAEHGATEHHRRSFGPVKRALGLAS
>b1652 rnt, RNase T, degrades tRNA
MSDNAQLTGLCDRFRGFYPVVIDVETAGFNAKTDALLEIAAITLKMDEQG
WLMPDTTLHFHVEPFVGANLQPEALAFNGIDPNDPDRGAVSEYEALHEIF
KVVRKGIKASGCNRAIMVAHNANFDHSFMMAAAERASLKRNPFHPFATFD
TAALAGLALGQTVLSKACQTAGMDFDSTQAHSALYDTERTAVLFCEIVNR
WKRLGGWPLSAAEEV
>b0550 rus, endodeoxyribonuclease RUS (Holliday junction resolvase)
MNTYSITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIG
LAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVV
KMPVTKGGRLELTITEMGNE
>b1861 ruvA, Holliday junction helicase subunit B; branch migration; repair
MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTH
FVVREDAQLLYGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVN
AVEREEVGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAADLVLTS
PASPATDDAEQEAVAALVALGYKPQEASRMVSKIARPDASSETLIREALR
AAL
>b1860 ruvB, Holliday junction helicase subunit A; branch migration; repair
MIEADRLISAGTTLPEDVADRAIRPKLLEEYVGQPQVRSQMEIFIKAAKL
RGDALDHLLIFGPPGLGKTTLANIVANEMGVNLRTTSGPVLEKAGDLAAM
LTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKI
DLPPFTLIGATTRAGSLTSPLRDRFGIVQRLEFYQVPDLQYIVSRSARFM
GLEMSDDGALEVARRARGTPRIANRLLRRVRDFAEVKHDGTISADIAAQA
LDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDV
LEPYLIQQGFLQRTPRGRMATTRAWNHFGITPPEMP
>b1863 ruvC, Holliday junction nuclease; resolution of structures; repair
MAIILGIDPGSRVTGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIYA
GVTEIITQFQPDYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVF
EYAARQVKQTVVGIGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITH
CHVSQNAMQMSESRLNLARGRLR
>b2011 sbcB, exonuclease I, 3'--> 5' specific; deoxyribophosphodiesterase
MMNDGKQQSTFLFHDYETFGTHPALDRPAQFAAIRTDSEFNVIGEPEVFY
CKPADDYLPQPGAVLITGITPQEARAKGENEAAFAARIHSLFTVPKTCIL
GYNNVRFDDEVTRNIFYRNFYDPYAWSWQHDNSRWDLLDVMRACYALRPE
GINWPENDDGLPSFRLEHLTKANGIEHSNAHDAMADVYATIAMAKLVKTR
QPRLFDYLFTHRNKHKLMALIDVPQMKPLVHVSGMFGAWRGNTSWVAPLA
WHPENRNAVIMVDLAGDISPLLELDSDTLRERLYTAKTDLGDNAAVPVKL
VHINKCPVLAQANTLRPEDADRLGINRQHCLDNLKILRENPQVREKVVAI
FAEAEPFTPSDNVDAQLYNGFFSDADRAAMKIVLETEPRNLPALDITFVD
KRIEKLLFNYRARNFPGTLDYAEQQRWLEHRRQVFTPEFLQGYADELQML
VQQYADDKEKVALLKALWQYAEEIV
>b0397 sbcC, ATP-dependent dsDNA exonuclease
MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAI
CLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNR
ARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRS
MLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTEL
EKLQAQASGVTLLTPEQVQSLTASLQVLTDEEKQLITAQQQEQQSLNWLT
RQDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIA
EHSAALAHIRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNT
WLQEHDRFRQWNNEPAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALA
AITLTLTADEVATALAQHAEQRPLRQHLVALHGQIVPQQKRLAQLQVAIQ
NVTQEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQL
QAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGATLR
GQLDAITKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPLDDIQP
WLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQLLLTTL
TGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILET
LPQSDELPHCEETVVLENWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKA
QAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLV
TQTAETLAQHQQHRPDDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIR
QQLKQDADNRQQQQTLMQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQ
GLTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRT
LSGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDA
LDALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK
>b0398 sbcD, ATP-dependent dsDNA exonuclease
MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVF
DTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFL
NTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQ
QHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIY
IGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECG
KSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVS
QEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLA
SQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA
>b2009 sbmC, SbmC protein
MNYEIKQEEKRTVAGFHLVGPWEQTVKKGFEQLMMWVDSKNIVPKEWVAV
YYDNPDETPAEKLRCDTVVTVPGYFTLPENSEGVILTEITGGQYAVAVAR
VVGDDFAKPWYQFFNSLLQDSAYEMLPKPCFEVYLNNGAEDGYWDIEMYV
AVQPKHH
>b0687 seqA, negative modulator of initiation of replication
MKTIEVDDELYSYIASHTKHIGESASDILRRMLKFSAASQPAAPVTKEVR
VASPAIVEAKPVKTIKDKVRAMRELLLSDEYAEQKRAVNRFMLLLSTLYS
LDAQAFAEATESLHGRTRVYFAADEQTLLKNGNQTKPKHVPGTPYWVITN
TNTGRKCSMIEHIMQSMQFPAELIEKVCGTI
>b4473 smf, orf, hypothetical protein
MVDTEIWLRLMSISSLYGDDMVRIAHWVAKQSHIDAVVLQQTGLTLRQAQ
RFLSFPRKSIESSLCWLEQPNHHLIPADSEFYPPQLLATTDYPGALFVEG
ELHALHSFQLAVVGSRAHSWYGERWGRLFCETLATRGVTITSGLARGIDG
VAHKAALQVNGVSIAVLGNGLNTIHPRRHARLAASLLEQGGALVSEFPLD
VPPLAYNFPRRNRIISGLSKGVLVVEAALRSGSLVTARCALEQGREVFAL
PGPIGNPGSEGPHWLIKQGAILVTEPEEILENLQFGLHWLPDAPENSFYS
PDQQDVALPFPELLANVGDEVTPVDVVAERAGQPVPEVVTQLLELELAGW
IAAVPGGYVRLRRACHVRRTNVFV
>b2576 srmB, ATP-dependent RNA helicase
MTVTTFSELELDESLLEALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPT
GTGKTAAYLLPALQHLLDFPRKKSGPPRILILTPTRELAMQVSDHARELA
KHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRA
VETLILDEADRMLDMGFAQDIEHIAGETRWRKQTLLFSATLEGDAIQDFA
ERLLEDPVEVSANPSTRERKKIHQWYYRADDLEHKTALLVHLLKQPEATR
SIVFVRKRERVHELANWLREAGINNCYLEGEMVQGKRNEAIKRLTEGRVN
VLVATDVAARGIDIPDVSHVFNFDMPRSGDTYLHRIGRTARAGRKGTAIS
LVEAHDHLLLGKVGRYIEEPIKARVIDELRPKTRAPSEKQTGKPSKKVLA
KRAEKKKAKEKEKPRVKKRHRDTKNIGKRRKPSGTGVPPQTTEE
>b4059 ssb, ssDNA-binding protein
MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMK
EQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTT
EVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG
GAQSRPQQSAPAAPSNEPPMDFDDDIPF
>b3558 t150, IS150 putative transposase
MKVLNELRQFYPLDELLRAAEIPRSTFYYHLKALSKPDKYADVKKRISEI
YHENRGRYGYRRVTLSLHREGKQINHKAVQRLMGTLSLKAAIKVKRYRSY
RGEVGQTAPNVLQRDFKATRPNEKWVTDVTEFAVNGRKLYLSPVIDLFNN
EVISYSLSERPVMNMVENMLDQAFKKLNPHEHPVLHSDQGWQYRMRRYQN
ILKEHGIKQSMSRKGNCLDNAVVECFFGTLKSECFYLDEFSNISELKDAV
TEYIEYYNSRRISLKLKGLTPIEYRNQTYMPRV
>b3549 tag, 3-methyl-adenine DNA glycosylase I, constitutive
MERCGWVSQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVL
KKRENYRACFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNAR
AYLQMEQNGEPFVDFVWSFVNHQPQVTQATTLSEIPTSTSASDALSKALK
KRGFKFVGTTICYSFMQACGLVNDHVVGCCCYPGNKP
>b4483 tatD, cytoplasmic Dnase
MFDIGVNLTSSQFAKDRDDVVACAFDAGVNGLLITGTNLRESQQAQKLAR
QYSSCWSTAGVHPHDSSQWQAATEEAIIELAAQPEVVAIGECGLDFNRNF
STPEEQERAFVAQLRIAADLNMPVFMHCRDAHERFMTLLEPWLDKLPGAV
LHCFTGTREEMQACVAHGIYIGITGWVCDERRGLELRELLPLIPAEKLLI
ETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGEDAAWLAATTDA
NVKTLFGIAF
>b1274 topA, DNA topoisomerase type I, omega protein
MGKALVIVESPAKAKTINKYLGSDYVVKSSVGHIRDLPTSGSAAKKSADS
TSTKTAKKPKKDERGALVNRMGVDPWHNWEAHYEVLPGKEKVVSELKQLA
EKADHIYLATDLDREGEAIAWHLREVIGGDDARYSRVVFNEITKNAIRQA
FNKPGELNIDRVNAQQARRFMDRVVGYMVSPLLWKKIARGLSAGRVQSVA
VRLVVEREREIKAFVPEEFWEVDASTTTPSGEALALQVTHQNDKPFRPVN
KEQTQAAVSLLEKARYSVLEREDKPTTSKPGAPFITSTLQQAASTRLGFG
VKKTMMMAQRLYEAGYITYMRTDSTNLSQDAVNMVRGYISDNFGKKYLPE
SPNQYASKENSQEAHEAIRPSDVNVMAESLKDMEADAQKLYQLIWRQFVA
CQMTPAKYDSTTLTVGAGDFRLKARGRILRFDGWTKVMPALRKGDEDRIL
PAVNKGDALTLVELTPAQHFTKPPARFSEASLVKELEKRGIGRPSTYASI
ISTIQDRGYVRVENRRFYAEKMGEIVTDRLEENFRELMNYDFTAQMENSL
DQVANHEAEWKAVLDHFFSDFTQQLDKAEKDPEEGGMRPNQMVLTSIDCP
TCGRKMGIRTASTGVFLGCSGYALPPKERCKTTINLVPENEVLNVLEGED
AETNALRAKRRCPKCGTAMDSYLIDPKRKLHVCGNNPTCDGYEIEEGEFR
IKGYDGPIVECEKCGSEMHLKMGRFGKYMACTNEECKNTRKILRNGEVAP
PKEDPVPLPELPCEKSDAYFVLRDGAAGVFLAANTFPKSRETRAPLVEEL
YRFRDRLPEKLRYLADAPQQDPEGNKTMVRFSRKTKQQYVSSEKDGKATG
WSAFYVDGKWVEGKK
>b1763 topB, DNA topoisomerase III
MRLFIAEKPSLARAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQ
PDAYDSRYARWNLADLPIVPEKWQLQPRPSVTKQLNVIKRFLHEASEIVH
AGDPDREGQLLVDEVLDYLQLAPEKRQQVQRCLINDLNPQAVERAIDRLR
SNSEFVPLCVSALARARADWLYGINMTRAYTILGRNAGYQGVLSVGRVQT
PVLGLVVRRDEEIENFVAKDFFEVKAHIVTPADERFTAIWQPSEACEPYQ
DEEGRLLHRPLAEHVVNRISGQPAIVTSYNDKRESESAPLPFSLSALQIE
AAKRFGLSAQNVLDICQKLYETHKLITYPRSDCRYLPEEHFAGRHAVMNA
ISVHAPDLLPQPVVDPDIRNRCWDDKKVDAHHAIIPTARSSAINLTENEA
KVYNLIARQYLMQFCPDAVFRKCVIELDIAKGKFVAKARFLAEAGWRTLL
GSKERDEENDGTPLPVVAKGDELLCEKGEVVERQTQPPRHFTDATLLSAM
TGIARFVQDKDLKKILRATDGLGTEATRAGIIELLFKRGFLTKKGRYIHS
TDAGKALFHSLPEMATRPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLY
QLIDQAKRTPVRQFRGIVAPGSGGSADKKKAAPRKRSAKKSPPADEVGSG
AIA
>b0372 tra5_1, transposase insF for insertion sequence IS3
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD
SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR
KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAV
VIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQY
CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR
EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENKNLA
>b0541 tra5_2, transposase insF for insertion sequence IS3
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD
SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR
KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAV
VIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQY
CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR
EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENKNLA
>b1026 tra5_3, transposase insF for insertion sequence IS3
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD
SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR
KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAV
VIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQY
CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR
EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENKNLA
>b2089 tra5_4, transposase insF for insertion sequence IS3
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD
SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR
KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAV
VIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQY
CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR
EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENKNLA
>b0299 tra5_5, transposase insF for insertion sequence IS3
MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCD
SVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASR
KFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAV
VIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQY
CSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISR
EIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENKNLA
>b0256 tra8_1, IS30 transposase
MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPH
ERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRG
RRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISG
WLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRR
HTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATL
VDRKSRYTIIVRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELAR
HLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHE
LDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD
>b1404 tra8_2, IS30 transposase
MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPH
ERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRG
RRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISG
WLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRR
HTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATL
VDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELAR
HLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHE
LDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD
>b4284 tra8_3, IS30 transposase
MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPH
ERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRG
RRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISG
WLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRR
HTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATL
VDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELAR
HLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHE
LDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD
>b0259 trs5_1, IS5 transposase
MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMV
EVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMR
LFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQ
GTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSG
LTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD
WLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGF
VKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH
>b3218 trs5_10, IS5 transposase
MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMV
EVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMR
LFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQ
GTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSG
LTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD
WLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGF
VKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH
>b3505 trs5_11, IS5 transposase
MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMV
EVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMR
LFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQ
GTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSG
LTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD
WLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGF
VKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH
>b0552 trs5_2, IS5 transposase
MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMV
EVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMR
LFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQ
GTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSG
LTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD
WLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGF
VKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH
>b0656 trs5_3, IS5 transposase
MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMV
EVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMR
LFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQ
GTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSG
LTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD
WLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGF
VKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH
>b1331 trs5_4, IS5 transposase
MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMV
EVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMR
LFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQ
GTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSG
LTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD
WLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGF
VKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH
>b1370 trs5_5, IS5Y transposase
MSHQLTFADSEFSTKRRQTRKEIFLSRMEQILPWQNMTAVIEPFYPKAGN
GRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMRLFARLSLDSALP
DRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAP
SSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANE
HDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGKVKT
LKQNPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGFVKARYKGLLKND
NQLAMLFTLANLFRVDQMIRQWERSQ
>b1994 trs5_6, IS5 transposase
MFVIWSHGTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMV
EVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMR
LFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQ
GTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSG
LTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD
WLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRAKVEHPFRIIKRQFGF
VKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH
>b2030 trs5_7, IS5 transposase
MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMV
EVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMR
LFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQ
GTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSG
LTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD
WLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGF
VKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH
>b2192 trs5_8, IS5 transposase
MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMV
EVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMR
LFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQ
GTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSG
LTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD
WLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGF
VKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH
>b2982 trs5_9, IS5 transposase
MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMV
EVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMR
LFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQ
GTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSG
LTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD
WLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGF
VKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH
>b1184 umuC, SOS mutagenesis and repair
MFALCDVNAFYASCETVFRPDLWGKPVVVLSNNDGCVIARNAEAKALGVK
MGDPWFKQKDLFRRCGVVCFSSNYELYADMSNRVMSTLEELSPRVEIYSI
DEAFCDLTGVRNCRDLTDFGREIRATVLQRTHLTVGVGIAQTKTLAKLAN
HAAKKWQRQTGGVVDLSNLERQRKLMSALPVDDVWGIGRRISKKLDAMGI
KTVLDLADTDIRFIRKHFNVVLERTVRELRGEPCLQLEEFAPTKQEIICS
RSFGERITDYPSMRQAICSYAARAAEKLRSEHQYCRFISTFIKTSPFALN
EPYYGNSASVKLLTPTQDSRDIINAATRSLDAIWQAGHRYQKAGVMLGDF
FSQGVAQLNLFDDNAPRPGSEQLMTVMDTLNAKEGRGTLYFAGQGIQQQW
QMKRAMLSPRYTTRSSDLLRVK
>b2580 ung, uracil-DNA-glycosylase
MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRF
TELGDVKVVILGQDPYHGPGQAHGLAFSVRPGIAIPPSLLNMYKELENTI
PGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVIS
LINQHREGVVFLLWGSHAQKKGAIIDKQRHHVLKAPHPSPLSAHRGFFGC
NHFVLANQWLEQRGETPIDWMPVLPAESE
>b4058 uvrA, excision nuclease subunit A
MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLSLMEKPDVDHIEGLSPAISIEQKSTSHNPRSTVGT
ITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLML
LAPIIKERKGEHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTI
EVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDPKAEELLFSAN
FACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPEL
SLAGGAIRGWDRRNFYYFQMLKSLADHYKFDVEAPWGSLSANVHKVVLYG
SGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF
ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAG
QRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQ
IGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDA
IRAADHVIDIGPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEV
PKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLI
NDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSN
PATYTGVFTPVRELFAGVPESRARGYTPGRFSFNVRGGRCEACQGDGVIK
VEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF
DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTG
QTLYILDEPTTGLHFADIQQLLDVLHKLRDQGNTIVVIEHNLDVIKTADW
IVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML
>b0779 uvrB, DNA repair; excision nuclease subunit B
MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIA
NVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEA
YVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGD
PDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVI
DIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYV
TPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMNEL
GYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGM
YRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYE
LEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVL
VTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVL
VGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKA
ILYGDKITPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALG
QNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELEGLMMQHAQNL
EFEEAAQIRDQLHQLRELFIAAS
>b1913 uvrC, excinuclease ABC, subunit C; repair of UV damage to DNA
MYDAGGTVIYVGKAKDLKKRLSSYFRSNLASRKTEALVAQIQQIDVTVTH
TETEALLLEHNYIKLYQPRYNVLLRDDKSYPFIFLSGDTHPRLAMHRGAK
HAKGEYFGPFPNGYAVRETLALLQKIFPIRQCENSVYRNRSRPCLQYQIG
RCLGPCVEGLVSEEEYAQQVEYVRLFLSGKDDQVLTQLISRMETASQNLE
FEEAARIRDQIQAVRRVTEKQFVSNTGDDLDVIGVAFDAGMACVHVLFIR
QGKVLGSRSYFPKVPGGTELSEVVETFVGQFYLQGSQMRTLPGEILLDFN
LSDKTLLADSLSELAGRKINVQTKPRGDRARYLKLARTNAATALTSKLSQ
QSTVHQRLTALASVLKLPEVKRMECFDISHTMGEQTVASCVVFDANGPLR
AEYRRYNITGITPGDDYAAMNQVLRRRYGKAIDDSKIPDVILIDGGKGQL
AQAKNVFAELDVSWDKNHPLLLGVAKGADRKAGLETLFFEPEGEGFSLPP
DSPALHVIQHIRDESHDHAIGGHRKKRAKVKNTSSLETIEGVGPKRRQML
LKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH
>b3813 uvrD, DNA-dependent ATPase I and helicase II
MDVSYLLDSLNDKQREAVAAPRSNLLVLAGAGSGKTRVLVHRIAWLMSVE
NCSPYSIMAVTFTNKAAAEMRHRIGQLMGTSQGGMWVGTFHGLAHRLLRA
HHMDANLPQDFQILDSEDQLRLLKRLIKAMNLDEKQWPPRQAMWYINSQK
DEGLRPHHIQSYGNPVEQTWQKVYQAYQEACDRAGLVDFAELLLRAHELW
LNKPHILQHYRERFTNILVDEFQDTNNIQYAWIRLLAGDTGKVMIVGDDD
QSIYGWRGAQVENIQRFLNDFPGAETIRLEQNYRSTSNILSAANALIENN
NGRLGKKLWTDGADGEPISLYCAFNELDEARFVVNRIKTWQDNGGALAEC
AILYRSNAQSRVLEEALLQASMPYRIYGGMRFFERQEIKDALSYLRLIAN
RNDDAAFERVVNTPTRGIGDRTLDVVRQTSRDRQLTLWQACRELLQEKAL
AGRAASALQRFMELIDALAQETADMPLHVQTDRVIKDSGLRTMYEQEKGE
KGQTRIENLEELVTATRQFSYNEEDEDLMPLQAFLSHAALEAGEGQADTW
QDAVQLMTLHSAKGLEFPQVFIVGMEEGMFPSQMSLDEGGRLEEERRLAY
VGVTRAMQKLTLTYAETRRLYGKEVYHRPSRFIGELPEECVEEVRLRATV
SRPVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQV
AFQGQGIKWLVAAYARLESV
>b1960 vsr, DNA mismatch endonuclease, patch repair protein
MADVHDKATRSKNMRAIATRDTAIEKRLASLLTGQGLAFRVQDASLPGRP
DFVVDEYRCVIFTHGCFWHHHHCYLFKVPATRTEFWLEKIGKNVERDRRD
ISRLQELGWRVLIVWECALRGREKLTDEALTERLEEWICGEGASAQIDTQ
GIHLLA
>b2051 wcaH, GDP-mannose mannosyl hydrolase
MMFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPG
GRVQKDETLEAAFERLTMAELGLRLPITAGQFYGVWQHFYDDNFSGTDFT
THYVVLGFRFRVSEEELLLPDEQHDDYRWLTSDALLASDNVHANSRAYFL
AEKRTGVPGL
>b3811 xerC, site-specific recombinase, acts on cer sequence of ColE1, effects chromosome segregation at cell division
MTDLHTDVERYLRYLSVERQLSPITLLNYQRQLEAIINFASENGLQSWQQ
CDVTMVRNFAVRSRRKGLGAASLALRLSALRSFFDWLVSQNELKANPAKG
VSAPKAPRHLPKNIDVDDMNRLLDIDINDPLAVRDRAMLEVMYGAGLRLS
ELVGLDIKHLDLESGEVWVMGKGSKERRLPIGRNAVAWIEHWLDLRDLFG
SEDDALFLSKLGKRISARNVQKRFAEWGIKQGLNNHVHPHKLRHSFATHM
LESSGDLRGVQELLGHANLSTTQIYTHLDFQHLASVYDAAHPRAKRGK
>b2894 xerD, site-specific recombinase
MKQDLARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATA
QSDDLQALLAERLEGGYKATSSARLLSAVRRLFQYLYREKFREDDPSAHL
ASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVS
ELVGLTMSDISLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWL
LNGVSIDVLFPSQRAQQMTRQTFWHRIKHYAVLAGIDSEKLSPHVLRHAF
ATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA
>b2509 xseA, exonuclease VII, large subunit
MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFT
LKDDTAQVRCAMFRNSNRRVTFRPQHGQQVLVRANITLYEPRGDYQIIVE
SMQPAGEGLLQQKYEQLKAKLQAEGLFDQQYKKPLPSPAHCVGVITSKTG
AALHDILHVLKRRDPSLPVIIYPAAVQGDDAPGQIVRAIELANQRNECDV
LIVGRGGGSLEDLWSFNDERVARAIFTSRIPVVSAVGHETDVTIADFVAD
LRAPTPSAAAEVVSRNQQELLRQVQSTRQRLEMAMDYYLANRTRRFTQIH
HRLQQQHPQLRLARQQTMLERLQKRMSFALENQLKRTGQQQQRLTQRLNQ
QNPQPKIHRAQTRIQQLEYRLAETLRAQLSATRERFGNAVTHLEAVSPLS
TLARGYSVTTATDGNVLKKVKQVKAGEMLTTRLEDGWIESEVKNIQPVKK
SRKKVH
>b0422 xseB, exonuclease VII, small subunit
MPKKNEAPASFEKALSELEQIVTRLESGDLPLEEALNEFERGVQLARQGQ
AKLQQAEQRVQILLSDNEDASLTPFTPDNE
>b1749 xthA, exonuclease III
MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKL
GYNVFYHGQKGHYGVALLTKETPIAVRRGFPGDDEEAQRRIIMAEIPSLL
GNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRDNPVLIMG
DMNISPTDLDIGIGEENRKRWLRTGKCSFLPEEREWMDRLMSWGLVDTFR
HANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAECCVETGIDYEI
RSMEKPSDHAPVWATFRR
>b0228 yafM, orf, hypothetical protein
MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRHAIIKVKRDRPFEI
NAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWE
HAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDW
AGDVTDFSAGERIIS
>b0267 yagA, orf, hypothetical protein
MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQR
WAQEGAAGLQDRPRIPHHSPNRSSDDITALLRMAHDRHERWGARKIKRWL
EDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFK
GHFPFGGGRCHPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGL
PDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYHPQTQGKLERF
HRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQ
PSARQYSGNTTPPEYDEGVMVRKVDISGKLSVKGVSLSAGKAFRGERVGL
KEMQEDGSYEVWWYSTKVGVIDLKKKSITMGKGC
>b0278 yagL, DNA-binding protein
MRGKILLYQLKYRWQSLSIFGCFLCKMTLFRYQKIIYDTGVHQMRSFFYT
ICSSEQQESITDHHSLAEICQKFNILPEHVVIEQVDIKEVVSEQRLLRQL
IHHEMNRQDTLVIPDLSCLGRTVEDLQNILFFCLQKEMFIYSYHPASRIE
PSAESCLSFLIARQDTIDIHNLKSTKSRYRHVKKKLGRKEGSKYRRDITI
LKKGGFTQAEIAKKLSISLSTVKRHWNNGIIG
>b0393 yaiD, orf, hypothetical protein
MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMG
SHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLK
KTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDT
LALLRKSLGSLPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSL
LEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSL
KRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGE
AQR
>b0442 ybaV, orf, hypothetical protein
MKHGIKALLITLSLACAGMSHSALAAASVAKPTAVETKAEAPAAQSKAAV
PAKASDEEGTRVSINNASAEELARAMNGVGLKKAQAIVSYREEYGPFKTV
EDLKQVPGMGNSLVERNLAVLTL
>b0454 ybaZ, orf, hypothetical protein
MLVSCAMRLHSGVFPDYAEKLPQEEKMEKEDSFPQRVWQIVAAIPEGYVT
TYGDVAKLAGSPRAARQVGGVLKRLPEGSTLPWHRVVNRHGTISLTGPDL
QRQRQALLAEGVMVSGSGQIDLQRYRWNY
>b0544 ybcK, orf, hypothetical protein
MKKAIAYMRFSSPGQMSGDSLNRQRRLIAEWLKVNSDYYLDTITYEDLGL
SAFKGKHAQSGAFSEFLDAIEHGYILPGTTLLVESLDRLSREKVGEAIER
LKLILNHGIDVITLCDNTVYNIDSLNEPYSLIKAILIAQRANEESEIKSS
RVKLSWKKKRQDALESGTIMTASCPRWLSLDDKRTAFVPDPDRVKTIELI
FKLRMERRSLNAIAKYLNDHAVKNFSGKESAWGPSVIEKLLANKALIGIC
VPSYRARGKGISEIAGYYPRVISDDLFYAVQEIRLAPFGISNSSKNPMLI
NLLRTVMKCEACGNTMIVHAVSGSLHGYYVCPMRRLHRCDRPSIKRDLVD
YNIINELLFNCSKIQPVENKKDANETLELKIIELQMKINNLIVALSVAPE
VTAIAEKIRLLDKELRRASVSLKTLKSKGVNSFSDFYAIDLTSKNGRELC
RTLAYKTFEKIIINTDNKTCDIYFMNGIVFKHYPLMKVISAQQAISALKY
MVDGEIYF
>b0706 ybfD, putative DNA ligase
MELKKLMEHISIIPDYRQAWKVEHKLSDILLLTICAVISGAEGWEDIEDF
GETHPDFLKQYGDFENGIPVHDTIARVVSCICPAKFHESFINWMLDYHSS
DDKDVIAIDGKIHRHSYDKSRRKGAIHVISAFSTMHSLVIGQIKTDKKSN
EITAIPELLNMLDIKGKIIKTDAMGCQKDIAEKIQKQGGDYLFAVKGNQG
RLNKAFEEKFPLKELNNPKHDSYAISEKSHGREETRLHIVCDVPDELIDF
TFE
>b0876 ybjD, orf, hypothetical protein
MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPES
DLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACW
TPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQARHL
VRLMPVLRLRDARFMRRIRNGTVPNVPNVEVTARQLDFLARELSSHPQNL
SDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDII
NRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLH
PIMLSVAWHLLNLLPLQRIATTNSGELLSLTPVEHVCRLVRESSRVAAWR
LGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQC
GHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATV
RSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRMAQIPENVPMNL
RKIISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGR
AD
>b0892 ycaJ, putative polynucleotide enzyme
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHL
HSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQ
NRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELN
SALLSRARVYLLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAI
AELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEIAGERSARFDN
KGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIAS
EDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVY
TAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA
AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR
>b1100 ycfH, orf, hypothetical protein
MFLVDSHCHLDGLDYESLHKDVDDVLAKAAARDVKFCLAVATTLPGYLHM
RDLVGERDNVVFSCGVHPLNQNDPYDVEDLRRLAAEEGVVALGETGLDYY
YTPETKVRQQESFIHHIQIGRELNKPVIVHTRDARADTLAILREEKVTDC
GGVLHCFTEDRETAGKLLDLGFYISFSGIVTFRNAEQLRDAARYVPLDRL
LVETDSPYLAPVPHRGKENQPAMVRDVAEYMAVLKGVAVEELAQVTTDNF
ARLFHIDASRLQSIR
>b1360 ydaV, putative DNA replication factor
MKNIATGDVLERIRRLAPSHVTAPFKTVAEWREWQLSEGQKRCEEINRQN
RQLRVEKILNRSGIQPLHRKCSFSNYQVQNEGQRYALSQAKSIADELMTG
CTNFAFSGKPGTGKNHLAAAIGNRLLKDGQTVIVVTVADVMSALHASYDD
GQSGEKFLRELCEVDLLVLDEIGIQRETKNEQVVLHQIVDRRTASMRSVG
MLTNLNYEAMKTLLGERIMDRMTMNGGRWVNFNWESWRPNVVQPGIAK
>b1460 ydcC, H repeat-associated protein (ORF-H)
MELKKLMGHISIIPDYRQAWKMEHKLSDILLLTICAVISGAEGWEDIEDF
GETHPDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS
DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSN
EITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQG
RLNKAFEEKFPLKELNNPAHDSYAMSEKSHGREEIRLHIVCDVPDELIDF
TFEWKGLKKLCVAVSFRSIIAEQKKELEMTVRYYISSADLTAEKFATAIR
NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF
KAGLRRKMRKAAMDRNYLASVLTGSGLS
>b1432 ydcM, orf, hypothetical protein
MKRLQAFKFQLRPGGQQECEMRRFAGACRFVFNRALARQNENHEAGNKYI
PYGKMASWLVEWKNATETQWLKDSPSQPLQQSLKDLERAYKNFFRKRAAF
PRFKKRGQNDAFRYPQGVKLDQENSRIFLPKLGWMRYRNSRQVTGVVKNV
TVSQSCGKWYISIQTESEVSTPVHPSASMVGLDAGVAKLATLSDGTVFEP
VNSFQKNQKKLARLQRQLSRKVKFSNNWQKQKRKIQRLHSCIANIRRDYL
HKVTTAVSKNHAMIVIEDLKVSNMSKSAAGTVSQPGRNVRAKSGLNRSIL
DQGWYEMRRQLAYKQLWRGGQVLAVPPAYTSQRCAYCGHTAKENRLSQSK
FRCQVCGYTANADVNGARNILAAGHAVLACGEMVQSGRPLKQEPTEMIQA
TA
>b1741 ydjQ, putative excinuclease subunit
MVRRLTSPRLEFEAAAIYEYPEHLRSFLNDLPTRPGVYLFHGESDTMPLY
IGKSVNIRSRVLSHLRTPDEAAMLRQSRRISWICTAGEIGALLLEARLIK
EQQPLFNKRLRRNRQLCALQLNEKRVDVVYAKEVDFSRAPNLFGLFANRR
AALQALQTIADEQKLCYGLLGLEPLSRGRACFRSALKRCAGACCGKESHE
EHALRLRQSLERLRVVCWPWQGAVALKEQHPEMTQYHIIQNWLWLGAVNS
LEEATTLIRTPAGFDHDGYKILCKPLLSGNYEITELDPANDQRAS
>b1813 yeaB, orf, hypothetical protein
MEYRSLTLDDFLSRFQLLRPQINRETLNHRQAAVLIPIVRRPQPGLLLTQ
RSIHLRKHAGQVAFPGGAVDDTDASAIAAALREAEEEVAIPPSAVEVIGV
LPPVDSVTGYQVTPVVGIIPPDLPYRASEDEVSAVFEMPLAQALHLGRYH
PLDIYRRGDSHRVWLSWYEQYFVWGMTAGIIRELALQIGVKP
>b2002 yeeS, putative DNA repair protein, RADC family
MTPGERSLIQRALKTLDRHLHEPGVAFTSTRAAREWLILNMAGLEREEFR
VLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRALYHNAAAVVLAHNHP
SGEVTPSKADRLITERLVQALGLVDIRVPDHLIVGGNQVFSFAEHGLL
>b2184 yejH, putative ATP-dependent helicase
MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARG
RVLVLAHVKELVAQNHAKYQALGLEADIFAAGLKRKESHGKVVFGSVQSV
ARNLDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLT
ATPFRLGKGWIYQFHYHGMVRGDEKALFRDCIYELPLRYMIKHGYLTPPE
RLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPHIISQIMEFAA
TRKGVMIFAATVEHAKEIVGLLPAEDAALITGDTPGAERDVLIENFKAQR
FRYLVNVAVLTTGFDAPHVDLIAILRPTESVSLYQQIVGRGLRLAPGKTD
CLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD
GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRE
CDTVLVDPDDMLKAALRLKDALVLRCSGMSLQHGHDEKGEWLKITYYDED
GADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALL
RHPDFVVARMKGQYWQVREKVFDYEGRFRLAHELRG
>b2251 yfaO, orf, hypothetical protein
MRQRTIVCPLIQNDGAYLLCKMADDRGVFPGQWAISGGGVEPGERIEEAL
RREIREELGEQLLLTEITPWTFSDDIRTKTYADGRKEEIYMIYLIFDCVS
ANREVKINEEFQDYAWVKPEDLVHYDLNVATRKTLRLKGLL
>b2467 yffH, orf, hypothetical protein
MTQQITLIKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATI
LLYNTKKKTVVLIRQFRVATWVNGNESGQLIESCAGLLDNDEPEVCIRKE
AIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDNQRANAGGGVED
EDIEVLELPFSQALEMIKTGEIRDGKTVLLLNYLQTSHLMD
>b2510 yfgJ, orf, hypothetical protein
MNVEGMATGGIHMELHCPQCQHVLDQDNGHARCRSCGEFIEMKALCPDCH
QPLQVLKACGAVDYFCQHGHGLISKKRVEFVLA
>b2644 yfjY, putative DNA repair protein
MMEQSLIPQTPVLPLTAQRTVKRALTLLDRHLRETGVAFTSTQAARDWLK
LKMAGLEREEFMMLYLNQQNQLIAHETLFAGSISSTEVHPREVVKRALYF
NAAAVILAHNHPSGDTTPSQADKTITQRLVQALQLVDIRVPDHLIVGGRQ
IYSFAEHGLL
>b2755 ygbT, orf, hypothetical protein
MTWLPLNPIPLKDRVSMIFLQYGQIDVIDGAFVLIDKTGIRTHIPVGSVA
CIMLEPGTRVSHAAVRLAAQVGTLLVWVGEAGVRVYASGQPGGARSDKLL
YQAKLALDEDLRLKVVRKMFELRFGEPAPARRSVEQLRGIEGSRVRATYA
LLAKQYGVTWNGRRYDPKDWEKGDTINQCISAATSCLYGVTEAAILAAGY
APAIGFVHTGKPLSFVYDIADIIKFDTVVPKAFEIARRNPGEPDREVRLA
CRDIFRSSKTLAKLIPLIEDVLAAGEIQPPAPPEDAQPVAIPLPVSLGDA
GHRSS
>b2830 ygdP, putative invasion protein
MIDDDGYRPNVGIVICNRQGQVMWARRFGQHSWQFPQGGINPGESAEQAM
YRELFEEVGLSRKDVRILASTRNWLRYKLPKRLVRWDTKPVCIGQKQKWF
LLQLVSGDAEINMQTSSTPEFDGWRWVSYWYPVRQVVSFKRDVYRRVMKE
FASVVMSLQENTPKPQNASAYRRKRG
>b3068 ygjF, orf, hypothetical protein
MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ
LKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ
PQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLE
KLVEAYRELDQALVVRGR
>b3155 yhbQ, orf, hypothetical protein
MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLVF
SAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD
>b3262 yhdJ, putative methyltransferase
MTMRTGCEPTRFGNEAKTIIHGDALAELKKIPAESVDLIFADPPYNIGKN
FDGLIEAWKEDLFIDWLFEVIAECHRVLKKQGSMYIMNSTENMPFIDLQC
RKLFTIKSRIVWSYDSSGVQAKKHYGSMYEPILMMVKDAKNYTFNGDAIL
VEAKTGSQRALIDYRKNPPQPYNHQKVPGNVWDFPRVRYLMDEYENHPTQ
KPEALLKRIILASSNPGDIVLDPFAGSFTTGAVAIASGRKFIGIEINSEY
IKMGLRRLDVASHYSAEELAKVKKRKTGNLSKRSRLSEVDPDLITK
>b3465 yhhF, orf, hypothetical protein
MKKPNHSGSGQIRIIGGQWRGRKLPVPDSPGLRPTTDRVRETLFNWLAPV
IVDAQCLDCFAGSGALGLEALSRYAAGATLIEMDRAVSQQLIKNLATLKA
GNARVVNSNAMSFLAQKGTPHNIVFVDPPFRRGLLEETINLLEDNGWLAD
EALIYVESEVENGLPTVPANWSLHREKVAGQVAYRLYQREAQGESDAD
>b3484 yhhI, putative receptor
MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDF
GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS
DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSN
EITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQG
RLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDF
TFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR
NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF
KAGLRRKMRKAAMDRNYLASVLAGSGLS
>b0360 yi21_1, IS2 hypothetical protein
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>b1403 yi21_2, IS2 hypothetical protein
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>b1997 yi21_3, IS2 hypothetical protein
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>b2861 yi21_4, IS2 hypothetical protein
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>b3044 yi21_5, IS2 hypothetical protein
MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWR
KQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEA
VEYGRAKKWIAHAPLLPGDGE
>b4272 yi21_6, IS2 hypothetical protein
MIVLILVFRLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQR
LLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE
>b0361 yi22_1, IS2 hypothetical protein
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>b1402 yi22_2, IS2 hypothetical protein
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>b1996 yi22_3, IS2 hypothetical protein
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>b2860 yi22_4, IS2 hypothetical protein
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>b3045 yi22_5, IS2 hypothetical protein
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>b4273 yi22_6, IS2 hypothetical protein
MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDT
DVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNA
LLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFAL
DCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNG
SCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPK
PDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLE
I
>b4278 yi41, IS4 hypothetical protein
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQT
GDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEEIRKLGKGD
HLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPG
GEMGDLYSHRWEIELGYREIKQTMQRSRLTLRSKKPELVEQELWGVLLAY
NLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELM
RDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA
>b3557 yi5A, IS150 hypothetical protein
MSKPKYPFEKRLEVVNHYFTTDDGYRIISARFGVPRTQVRTWVALYEKHG
EKGLIPKPKGVSADPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARW
LKVYEERGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQ
VRFLETRLMYLKKLKALAHPTKK
>b0016 yi81_1, IS186 hypothetical protein
MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAY
GPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRA
AVTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDS
RDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLR
WLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS
LPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ
VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID
DIIQPSLDFPPRSAGSEKKN
>b0582 yi81_2, IS186 hypothetical protein
MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAY
GPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRA
AVTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDS
RDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLR
WLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVS
LPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQ
VADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLID
DIIQPSLDFPPRSAGSEKKN
>b2394 yi81_3, putative transposase insL for insertion sequence IS186
MPMNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGL
AYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAV
RAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELT
DSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRG
LRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIA
VSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSA
EQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFL
IDDIIQPSLDFPPRSAGSEKKN
>b0255 yi91a, transposase insN for insertion sequence element IS911A
MICSPQNNTGVIMKKRNFSAEFKRESAQLVVDQNYTVADAASAMDVGLST
MTRWVKQLRDERQGKTPKASPITPEQIEIRELRKKLQRIEMENEILKKAT
VDSIGQRNSYVKTWGCGGFLNETNIYSRGKSLCF
>b4283 yi91b, transposase insN for insertion sequence element IS911B
MICSPQNNTGAPMKKRNFSAEFKRESAQLVVDQKYTVADAAKAMDVGLST
MTRWVKQLRDERQGKTPKASPITPEQIEIRKLRKKLQRIEMENEILKRLL
>b3647 yicF, putative enzyme
MMMKVWMAILIGILCWQSSVWAVCPAWSPARAQEEISRLQQQIKQWDDDY
WKEGKSEVEDGVYDQLSARLTQWQRCFGSEPRDVMMPPLNGAVMHPVAHT
GVRKMVDKNALSLWMRERSDLWVQPKVDGVAVTLVYRDGKLNKAISRGNG
LKGEDWTQKVSLISAVPQTVSGPLANSTLQGEIFLQREGHIQQQMGGINA
RAKVAGLMMRQDDSDTLNSLGVFVWAWPDGPQLMSDRLKELATAGFTLTQ
TYTRAVKNADEVARVRNEWWKAELPFVTDGVVVRAAKEPESRHWLPGQAE
WLVAWKYQPVAQVAEVKAIQFAVGKSGKISVVASLAPVMLDDKKVQRVNI
GSVRRWQEWDIAPGDQILVSLAGQGIPRIDDVVWRGAERTKPTPPENRFN
SLTCYFASDVCQEQFISRLVWLGAKQVLGLDGIGEAGWRALHQTHRFEHI
FSWLLLTPEQLQNTPGIAKSKSAQLWHQFNLARKQPFTRWVMAMGIPLTR
AALNASDERSWSQLLFSTEQFWQQLPGTGSGRARQVIEWKENAQIKKLGS
WLAAQQITGFEP
>b4038 yjbI, orf, hypothetical protein
MKKIECACNFLMDKDAQGYIDLSDLDLTSCHFKGDVISKVSFLSSNLQHV
TFECKEIGDCNFTTAIVDNVIFRCRRLHNVIFIKASGECVDFSKNILDTV
DFSQSQLGHSNFRECQIRNSNFDNCYLYASHFTRAEFLSAKEISFIKSNL
TAVMFDYVRMSTGNFKDCITEQLELTIDYSDIFWNEDLDGYINNIIKMID
TLPDNAMILKSVLAVKLVMQLKILNIVNKNFIENMKKIFSHCPYIKDPII
RSYIHSDEDNKFDDFMRQHRFSEVNFDTQQMIDFINRFNTNKWLIDKNNN
FFIQLIDQALRSTDDMIKANVWHLYKEWIRSDDVSPIFIETEDNLRTFNT
NELTRNDNIFILFSSVDDGPVMVVSSQRLHDMLNPTKDTNWNSTYIYKSR
HEMLPVNLTQETLFSSKSHGKYALFPIFTASWRAHRIMNKGV
>b4308 yjhR, putative frameshift suppressor
MGYLHIDGRGMKPNGGSRHNPLEAETIAAWLVAHKDDIERHYGEPLYKVV
GVVTPFSAQVNAIKMSLRKLEINGKDEQGLLTVGTVHSLQGAERAIVLFS
PVYSKHEDGRFLDSNSTILNVAVSRAKDSFLVFGDMDLIEMQPAFSPRGL
LAKYLFSSDNNALQFEFQKRQDLISAHTQISTLHGVEQHDEFLNKTLAGA
QKKITIISPWLSWQKVEQTGFLASMALARSRGIDITVVTDKNCNIAHVDD
DKRQEKQHLLNDAVEKLNKMGIATKLVNRVHSKIVIEDEELLCVGSFNWF
SATREDKYQRYDTSLVYRGEGVKNEIKAIYGSLDQRQL
>b4378 yjjV, orf, hypothetical protein
MLALAENYQPLYAALGLHPGMLEKHSDVSLEQLQQALERRPAKVVAVGEI
GLDLFGDDPQFERQQWLLDEQLKLAKRYDLPVILHSRRTHDKLAMHLKRH
DLPRTGVVHGFSGSLQQAERFVQLGYKIGVGGTITYPRASKTRDVIAKLP
LASLLLETDAPDMPLNGFQGQPNRPEQAARVFAVLCELRREPADEIAQAL
LNNTYTLFNVP
>b0258 ykfC, orf, hypothetical protein
MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDG
VNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPA
LRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCG
ETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHI
DVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYW
NNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQVEAIREECR
GVLEGSLKLRLNMDKTKIPHVNDGFIFLGHRLIRKRSRYGEMRVVSTIPQ
EKARNFAASLTALLWKVRISGEILLG
>b0247 ykfG, putative DNA repair protein
MKQLSFLPGEMTPQDRRLIQRALRALDRHLHEPGVAFTSTHAVREWLRLH
MAALEREEFRVLYLDNQNQLIAHETLFTGTINRTEVHPREVVKRALHFNA
AAVILAHNHPSGETTPSQADKTLTQRLVQVLQLVDIRVPDHLIVGGRQIY
SFAEHGLL
>b1134 ymfB, putative Nudix hydrolase
MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAAR
ELWEETGISAQPQHFIRMHQWIAPDKTPFLRFLFAIELEQICPTQPHDSD
IDCCRWVSAEEILQASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTK
GVI
>b1808 yoaA, putative enzyme
MTDDFAPDGQLAKAIPGFKPREPQRQMAVAVTQAIEKGQPLVVEAGTGTG
KTYAYLAPALRAKKKVIISTGSKALQDQLYSRDLPTVSKALKYTGNVALL
KGRSNYLCLERLEQQALAGGDLPVQILSDVILLRSWSNQTVDGDISTCVS
VAEDSQAWPLVTSTNDNCLGSDCPMYKDCFVVKARKKAMDADVVVVNHHL
FLADMVVKESGFGELIPEADVMIFDEAHQLPDIASQYFGQSLSSRQLLDL
AKDITIAYRTELKDTQQLQKCADRLAQSAQDFRLQLGEPGYRGNLRELLA
NPQIQRAFLLLDDTLELCYDVAKLSLGRSALLDAAFERATLYRTRLKRLK
EINQPGYSYWYECTSRHFTLALTPLSVADKFKELMAQKPGSWIFTSATLS
VNDDLHHFTSRLGIEQAESLLLPSPFDYSRQALLCVLRNLPQTNQPGSAR
QLAAMLRPIIEANNGRCFMLCTSHAMMRDLAEQFRATMTLPVLLQGETSK
GQLLQQFVSAGNALLVATSSFWEGVDVRGDTLSLVIIDKLPFTSPDDPLL
KARMEDCRLRGGDPFDEVQLPDAVITLKQGVGRLIRDADDRGVLVICDNR
LVMRPYGATFLASLPPAPRTRDIARAVRFLAIPSSR
>b2949 yqgF, orf, hypothetical protein
MSGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNIIERLLK
EWQPDEIIVGLPLNMDGTEQPLTARARKFANRIHGRFGVEVKLHDERLST
VEARSGLFEQGGYRALNKGKVDSASAVIILESYFEQGY
>b3034 yqiE, orf, hypothetical protein
MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHRLFNGQMSHEVR
REIFERGHAAVLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIE
EGESVEDVARREAIEEAGLIVKRTKPVLSFLASPGGTSERSSIMVGEVDA
TTASGIHGLADENEDIRVHVVSREQAYQWVEEGKIDNAASVIALQWLQLH
HQALKNEWA
>b3148 yraN, orf, hypothetical protein
MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI
DLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHN
GSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS
>b3283 yrdD, putative DNA topoisomerase
MAKSALFTVRNNESCPKCGAELVIRSGKHGPFLGCSQYPACDYVRPLKSS
ADGHIVKVLEGQVCPACGANLVLRQGRFGMFIGCINYPECEHTELIDKPD
ETAITCPQCRTGHLVQRRSRYGKTFHSCDRYPECQFAINFKPIAGECPEC
HYPLLIEKKTAQGVKHFCASKQCGKPVSAE
>b3397 yrfE, orf, hypothetical protein
MSKSLQKPTILNVETVARSRLFTVESVDLEFSNGVRRVYERMRPTNREAV
MIVPIVDDHLILIREYAVGTESYELGFSKGLIDPGESVYEAANRELKEEV
GFGANDLTFLKKLSMAPSYFSSKMNIVVAQDLYPESLEGDEPEPLPQVRW
PLAHMMDLLEDPDFNEARNVSALFLVREWLKGQGRV