TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Bacillus anthracis str. Ames, Ames
Gene type: CDS

Number of genes found: 223

Free access
Sort by:

 



# Bacillus anthracis str. Ames, Ames

>BA3818 conserved domain protein
MALVKVTDIAKTLSKKMIFLSDTCEVCKKERKRTVRFMKINGEVVCPVCK
LAEDNHKLEAEMNVFRDEKEQRKRKSMFYDKSLIKDETIKLARFSTFKSD
CEEDEKNYTLAKRALEDYLDDVRFNLILVGKVGAGKSHLAYSIAHEMNEN
SAGTVLYVSVSELFDYISSTFNGQSEESEHSIVNLLISADLLVIDDLGAE
LGDMDAADPKATAFVNRVLFKVFDGRQGKKTIITTNLTGEAVMKAYDERI
TSRMFNTYRHIEFKYTRDKRKRKLPF
>BA1565 DnaQ family exonuclease/DinG family helicase, putative
MSKRYVVVDLETTGNSWKDGKDKITQIAAVVVEDGEILEIFSSFINPKRE
IPPFITELTGIDESMVKQAPLFEDVAPMVVELLQGAAFVAHNVHFDWNFL
TEELRQAGYTEIHCPKVDTVELAQILLPTADSYKLRDLAKQHELEHDQPH
RADSDALATAELFLQFLNEIEKLPLVTLQSLYELSDVFQSDIADVLSENI
LKKMMHGKKVAEEYEVYRNIALRKRNYSLNLSETCSSKFDAFLHKTIDKL
ALNMPKFEKRESQQIMMKEIYTALRDSRFSLIEAGTGTGKTLAYLLPSIY
FAKKKEEPVIISTQTVQLQQQILEKEIPLLQKIMPFSFEIALLKGRKHYL
CLHKFEYALQEEEKNYDMALTKAKILVWLLQTETGDRDELNIPEGGKLLW
NRICSDVHSPGGMQSNWFSRCFYQRAKNKALFADIVITNHALLFQDFSSE
EPLLASCEHIIFDEAHHIEEAASRTLGEQFSCMYFQLVLSRLGTLETDDV
LSKVYKMMKKSEQASRSTFRMVSHSLKELKFDADELFQMLRAFIFKQTKQ
EQGISNMPLIYRYNTEVEKGKLWDSITELTNRFVYDIRKLLTILQKQVDI
LQSKIEWEMHVVTGEFMHLIELLRKMAQALQLLILEKNSYVTWMETETKG
TIHSTVLYGQPVHIAERFADEFLTEKKSVIFTSATLTVNDSFAYIKEELG
LHDFAPNTLQVPSPFRFEEQMKLMVSTDVPFIKKASNEEYIESVSAHIAK
IAKATKGRMLVLFTSYEMLKEAYTNLKNDEELEGYLLLTQSVNNKSRSRL
IRKFQEFDKSILLGTSSFWEGIDIPGDALSCLVIVRLPFTPPYQPMIEAK
GEWLKNQGEDVFAKLALPQAILRFKQGFGRLIRTSTDTGTVFVLDRRLTS
SSYGKYFLQSIPTVPLYEGPLEELLEQVKERPTE
>BA4075 prophage LambdaBa02, site-specific recombinase, phage integrase family
MISVKRIVDQAIYEKYVSQENKSLVKDFLIEKKAQGKAASTLKQYGWDLR
IILFLIHQHFENKSLIELTRKDIRNLSIIFQELGMSNARVNGLMSALRSA
LEFCADDDDYEYEFNVGSRVRGLPKNPVREITFITEEQIEWLIDELIVQE
KYMLATYLALSYYSAARKNEVYQVQKEELTERYYTNIVRGKRGKKFRLYY
NPRVQKCIRLYINQRGKDTIPDLFVRVYKNGGRKLLNKSVFNYWCKIFAK
MLYEKEGKEFKINPHCFRHSRLDNLKVQGVPLEKLKSLANHSDISTTQSY
LKDRSEEDIADIFGMDPSCFAAKKE
>BA1008 DNA repair exonuclease family protein
MKQVKFIHAADLHLDSPFKGMEMNVPQSVWERMKQSTFESFERIIDKAIQ
ERVDFVLLAGDLYDAETRSLRAQVFVREQMKRLSQYDIPVFIIHGNHDHL
GGSWAAIEFPENVHVFTEPYVEEKSFYKNGELLASIYGFSYLQQAVTDNM
TAQYTKMSDAPFHIGMLHGSVEGDAEHNRYAPFQIRELKEKQFDYWALGH
IHKREILLEEPYIIYPGNIQGRHRKETGEKGAYLIELTKQGSHCSFFHTA
DVVWDEIEVNIDGLETVDELMTSVSTAMNECRREEEGTQLTVVFTGQGPL
SPYLRDEKRVEEIFHILAAGEERKDFVYTMKWKNETVSFAEIERLKEENH
FVGSVLKELEAFTNMDGVLRSIWTSPIARNSIESFTEEEKKEIQKEAENI
ILEQLFQQERDKK
>BA2923 hypothetical protein
MKIMIDGNNISNEQLTKWKRERIKKVFNTLGHKEPDLIDTDVMIKKLTEL
KLNYSYNEMYDKLKSKLKLSEMGMRICTVLSGDRRKFSKTNIYLDGITAE
EAINGIDSFMLEQSRENDQVNLSACPDHYVLRPFGENELEVIETTGNSPF
PVQFFIVFGDETGLKTPRDKTYQYQSTGVARMKDGTTIGGVRHQFKDTDT
GIEVSLVVEFPALCPNSLINSHQMHLACEFSYWLQWIKSDKK
>BA3809 conserved domain protein
MITFVGVLLTIKFTREESRREKLPEKINNIEECLDYIEDKLNEIEILTRV
DMEEALCKRSFERTTQLFTIDDNYVNERK
>BA1239 ATP-dependent DNA helicase, UvrD/REP family
MEEDADAAYFRALEQQNVFLNEKQLEAVRTTEGPVLTLAGAGSGKTSVLT
TRVGYLVNVKQVHPRNILLLTFTQKAAEEIRNRVANLPGMNHAASSYVVA
GTFHSVFLKLLRSQGYNQQILANEKHKQIMIKKILKELRLKDAYDAETML
AMISLEKNKLNRPKDVKAKTPVEQEFKEVYERFEEVKQRYNYIDFDDILL
ETYYMLENNAPLLTQLQQRFHYIEVDEFQDTSYAQYEIVKLLASPRNNLF
IAGDDDQAIYGWRGASHQIILSFPKEFDNTTIIALNTNYRSNPFIVGLGN
EVIKLNQERFDKELYSVREEGVQPFYARPATTLDEANQILQLIQEKVDSG
ERNYKDFCLLYRTHSVSRSLLDQLTIHKIPFIKHGASQSFYEHSLIKPVL
DHLRLVIEPFRLESLSNILPTMYIGRDDCISFIEREQWKYGEGRFPSLLH
FLLLNPSLKPFQVKKVNERIDFIKFIKELEPKKALKEIIHGKGKYLEYLQ
SNDRSSFTMHKDIQEEMLEELMESANRFTDIPAYLQFIDEAIQGQKEMEA
LKTMPQKDAVSLMSIHNAKGLEFPCVFLLGASDGILPHTSSLKDANDRVT
ETSEALEEERRLLYVAITRAKEELYISSPQFFRGKKLDISRFLYTVRKDL
PEKTSTK
>BA0899 conserved domain protein
MESHINILKSDVKIDPSRSAFIKECVEVMYEGDDLESILKQVEQIDLAGA
SFKVIFVKINDLVGENKIEYGERRLIERDLGMHIEGEADVRNPERVFGIV
PLGGRWYFGHYVESEPVWYHHIKKPHSYSTSLSTRVARSVANIAVPNPEG
VRAIDPCCGIGTVVVEALSMGINIVGRDINPLVVLGTRKNIAHFGLEGTV
TKGPIEEITENYDVAIIDMPYDLFTHATPEDQLSILSSARRIAKKVVVVT
METMDDMIHEAGFEITDRCTTKKGSFSRQILVCE
>BA2712 conserved hypothetical protein
MSINCFVGLIKNELLKEVSIRSEDESEPVVVKDIPRLWKCLGKGNYAAVF
MHKEYKDWVVKVYTQEGEGIEKESEVYRRIGNHPSYSKLIYKGENFIVLK
RLKEITLYDAVHKGIKIPKQVILDINKALEYAREQGLTPCDVHGKNVMME
KGRGYVVDVSDFLKTKEDSKWKDLEKAYFTFYLPFIYNFPFPVKIPYFML
NIVRRSYRKYKKLKKKFKL
>BA3351 conserved hypothetical protein
MSNKVEILMHPVRMKICQVLMRNKEDGLTPLEMVKIIKNVPQATLYRQLQ
TMVDSGIVHVIKEKKVKSVSEKYYAINEEDAKIEGSEWKGLSDDEKLNYI
SYYQLSLMTQYQSYLKKLEEQNSQEDKATFSVVELKLDEEHFGKFQNELN
ELMIKYYNSQSKDDEIEAPVRTVAITIIPET
>BA5487 helicase, putative
MINQTEVTIRLQHVSHGWFLWGEDDSGTPLSVTSWKRNAFTWHSTSFYGT
FLKEATFEGKQGVMLTNAQAFEYIANQPMNSFARIQMNGPITALTKDANE
LWDAFTSGSFVPDMEHWPKQPSWKVQHTPIEDDTLASLFSSAVNESILQD
NRSNDGWEDAKRLYEHYDFTKRQLDAALHEEDWLRKIGYIEDDLPFTIGL
RLQEPQEEFEMWKLETIITPKRGAHRIYVYESIDYLPKRWHDYEERILET
QEGFSKLVPWLKDGDTFRSELFETEAWNFLTEASNELLAAGITILLPSWW
QNLKATKPKLRVQLKQNATQTQSFFGMNTLVNFDWRISTNGIDLSESEFF
ELVEQNKRLFNINGQWMRLDPAFIEEVRKLMNRADKYGLEMKDVLQQHLS
NTAETEIVEEDSPFNDIEIELDGYYEELFQKLLHIGDIPKVDVPSSLHAT
LRPYQQHGIEWLLYLRKLGFGALLADDMGLGKSIQTITYLLYIKENNLQT
GPALIVAPTSVLGNWQKEFERFAPNLRVQLHYGSNRAKGESFKDFLQSAD
VVLTSYALAQLDEEELSTLCWDAVILDEAQNIKNPHTKQSKAVRNLQANH
KIALTGTPMENRLAELWSIFDFINHGYLGSLGQFQRRFVSPIEKDRDEGK
IQQVQRFISPFLLRRTKKDQTVALNLPDKQEQKAYCPLTGEQASLYEQLV
QDTLQNVEGLSGIERRGFILLMLNKLKQICNHPALYLKETEPKDIIERSM
KTSTLMELIENIKDQNESCLIFTQYIGMGNMLKNVLEEHFGQRVLFLNGS
VPKKERDKMIEQFQNGTYDIFILSLKAGGTGLNLTAANHVIHYDRWWNPA
VENQATDRAYRIGQKRFVHVHKLITTGTLEEKIDEMLERKQSLNNAVITS
DSWMTELSTDELKELLGV
>BA2645 conserved hypothetical protein
MKDIPVEIQLNQVTFQLKEHHNFDWLLKLGNVFAVFDQQDSGNLSFGVER
DGHKKFIKYAGARTIAYEGTMKDAIERLKSSVSLYEELMHDSLIKLIDHF
PVQSGYVLIFDWFDGECLHSHWSFPPPDKYKNPNSPFYKFKHLPVIKRIQ
SLNSIFSFHTYVEKKNYVAIDFYDGSILYDFHTNETKICDIDLYSKKPFV
NKMGRLWGSSRFMSPEEFELNAIIDSRTNVFNMGAMAFAILGGGKERSFT
EWEASRDLYEIAYRAVNENRTERYASVKEFYEEWVSVSNTKKI
>BA0247 ATP-dependent RNA helicase, DEAD/DEAH box family
MTTFRELGLSDSLLQSVESMGFEEATPIQAETIPHALQGKDIIGQAQTGT
GKTAAFGLPLLDKVDTHKESVQGIVIAPTRELAIQVGEELYKIGKHKRVR
ILPIYGGQDINRQIRALKKHPHIIVGTPGRILDHINRKTLRLQNVETVVL
DEADEMLNMGFIEDIEAILTDVPETHQTLLFSATMPDPIRRIAERFMTEP
QHIKVKAKEVTMPNIQQFYLEVQEKKKFDVLTRLLDIQSPELAIVFGRTK
RRVDELSEALNLRGYAAEGIHGDLTQAKRMSVLRKFKEGSIEVLVATDVA
ARGLDISGVTHVYNFDIPQDPESYVHRIGRTGRAGKKGIAMLFVTPRESG
QLKNIERTTKRKMDRMDAPTLDEALEGQQRLIAEKLQSTIENENLAYYKR
IAEEMLEENDSVTVVAAALKMMTKEPDTTPIALTSEPPVVSRGGGSKKRG
GNGGGYRDGNRNRSRDGRGGDGRNRDRNRDGRNRDGNRDRNREGSRDGNR
GRRGEGQGRPGSSNGRGERKHHSRKPQA
>BA3288 impB/mucB/samB family protein
MYDYSILPNRIILCVDLRSFYASVSCIKMGLDPLHTKLAVVGDMNRNGSI
VLAATPPLKAMGVKKLARLYEIPRQKDILIVNPIMGTYIKCSNYITKLAL
QYVPIEDFHQYSIDEFFMDITDSIHLFARNPNEFALQFKREIYEHTRIEC
TIGIAPNLLMSKVALDIEAKKNKDGVAYWTYEDIPTKLWSIRPLSKFWGI
SHKTETKLNQKGIHSIGDLANYPLKYLKQSFGVIGEELHLHSNGIDFSRI
SEKYVPVTTSIGKSQILMRDYTIEEFPIILLEHTEEVCYRLRQQNKLAQT
VQFSIGYSKSYSGGFSKTHTLSRPTNLTMDIYQVCLYFLNQQYTGEPIRS
ITISLTKLIGEGEEQISLFDNIIQREKEIKLTKVMDEIRTKFGKNSILRG
ISYTNNATARYRNTLLGGHKA
>BA3638 hypothetical protein
MFVVLKYKSYILYPLINLNIGICLQRWRSLIMKIRICATFMLFLLIWLHP
FWTFAKGEEAKERVVSLVYDDSGSMRNNDRWKYANYALQSLVALLDEKDK
FSYVPMSKPNDPVNISLTKDKRQTEIEGIGAWKTYLNTPFSAVETAMQSI
KKEADIDGKREFWLIVLTDGAFNDLEKDKVGGKEQILQKLAQFKKEMDEK
KISLHPVLITMEEDLGQQEKAQLNTFKEIWKKEINGVTMPSSGEDGIVKS
VNQVAALVANRDPFSSVESIVKTKVVGKKVEITTPFPLKRMTLVRQSPSL
PDYQVKQISKPLQLQSSFSIHAPGEAKLFGNIVHISTENEEVIKPGTYTI
EVDQEIEKEGLQVLVEPALNYNVSTYDKEDKDRKNVEEMYEGVTAVIEAK
PTELPVQSSYFQAEVEIDGKQYAMKWDDKRHVFYYETKLTKGLIRGKVHM
NIKGFYRQTKEFKIETTKKPELSLQTVTKDYKEKVTNLENSKPFIIQPQL
DGKPMTEEAVKKLLKSTGVTSKQSINYEIKQHDNQIYIYPRPHYSDTFNF
TDTGTVEATIVVQDSKLQKVKKDITLHIQNAPFYEKYALIFKFVIPITLL
LLLVGIIVLGWIVRPRFHRKALLYYEWDQEVAKDWLYQSEPELLKNKWWK
HYFGIPYRAERKTVQSVTFIAKKGSKSIFVAKESQVVGMIIDGMFITEDE
VGMEHKTLYPNELLVIDRGYGKEIYKYECE
>BA3639 hypothetical protein
MKSTDQFQTYEVPWEEFIAALREEMKKQVGADIIDTVVCNLEDGFEVYTY
VQGDLDFEYKEALERKYGSDAKTPNVLTIMLLASIYSTSEENIRLHADHK
DNPIVEVYVKQ
>BA3745 transposase, IS605 family
MTKQNKAYKFRLYPTEEQAHLIRKTFGCVRFVYNKMLAERKEVYEKHKAN
KEELKEQKFSTPAKYKVEYEWLKEVDSLALANAQLNLQTAYKNFFSGQSD
FPTFKSKKSRKSYTTNRVNGNIMLFHGYIKLPKLKMVKLKQHREIPPKHI
IKSCTISMTPTGKYYVSILTEYEKEITPKEVERVVGLDFAMVELYVSSED
EKANYPRFYRQMLEKLAKAQRVLARRVKGSERWNKQRIRVAKLHEKVANQ
RKNFLHHKSKELATSFDVVAIEDLHMKGMSRALRFGKSVADNGWRMFTAF
LAYKLQEQGKQLVKIDKWFPSTKMCSRCGNKKEMPLCERTYACSCGLTIG
RDYNAAINIKKEAIRLLALA
>BA3840 site-specific recombinase, phage integrase family
METTEFHDTIQAFSIFLLNKGRKPSTIKRYVYDIEDFGHWLEKNKKLPSS
NIWATLCTKDYEDYFSDLKKNRHYSEKTMHRVFIVLNRMHHFLNIPNPLK
NMEISIQPDRTLRNEDFISSDEEKRLKHIVTSLEGLSEKQRPVRPLLMDR
NIAILNLLIDYGLSLQELTALTMHHVHFETNTLSIPATAGVERTISLTSE
DKKQLYTYYKSIPEPVRPKYHSNDPLFVAFDFNRGTYRWVYENDAPKGLT
EIAIQKMIRLEVSRANLRKGISGQHFRNTYILNLIKKETPESEIIKLAGF
KSKISLKRYYQYAENGKNALS
>BA3139 hypothetical protein
MSISVSKEFGSITDFKQALAKYIKYYNHERIKKKSKGMNPVLY
>BA4794 MutS2 family protein
MLERTLRVLEYNKVKEQLLEHTASSLGRDKVKHLVPSTDFEEIVEMQDTT
DEAAKVIRLKGSAPLGGITDIRSNVKRAKIGSMLSPNELLDIANTMYGSR
NMKRFIEDMVDNGVELPILETHVAQIVSLYDLEKKITNCIGDGGEVVDSA
SDKLRGIRTQIRTAESRIREKLENMTRSSNAQKMLSDSIVTIRNERYVIP
VKQEYRGVYGGIVHDQSASGQTLFIEPQVIVELNNALQEARVKEKQEIER
ILLMLTEEVAVEADIVLSNVEVVANLDFIFAKAFYAKRIKATKPIVNNER
YMDLRQARHPLIDPEVIVPNNIMLGKDFTTIVITGPNTGGKTVTLKTVGI
CVLMAQSGLHIPVMDESEICVFKNIFADIGDEQSIEQSLSTFSSHMVNIV
DILEKADFESLVLFDELGAGTDPQEGAALAISILDEVCNRGARVVATTHY
PELKAYGYNREQVINASVEFDVNTLSPTHKLLIGVPGRSNAFEISKRLGL
SNRVIDQARNHISTDTNKIENMIAKLEESQKNAERDWNEAEALRKQSEKL
HRELQRQIIEFNEERDERLLKAQKEGEEKVEAAKKEAEGIIQELRQLRKA
QLANVKDHELIEAKSRLEGAAPELVKKQKVNVKNTAPKQQLRAGDEVKVL
TFGQKGQLLEKVSDTEWSVQIGILKMKVKESNMEYINTPKQTEKKAVATV
KGRDYHVSLELDLRGERFENAMARVEKYLDDAQLASYPRVSIIHGKGTGA
LRQGVQDYLKKHRGVKTFRYGDMGEGGLGVTVVELK
>BA2475 ATP-dependent RNA helicase, DEAD/DEAH box family
MVYLKNFLELGISETFNHTLRENGITEATPIQEKAIPVILSGKDIIGQAK
TGTGKTLAFVLPILEKIDPESSDVQALIVAPTRELALQITTEIKKMLVQR
EDINVLAIYGGQDVAQQLRKLKGNTHIVVATPGRLLDHIRRETIDLSNLS
TIVLDEADQMLYFGFLYDIEDILDETPGSKQTMLFSATIPKDIKKLAKRY
MDEPQMIQVQSEEVTVDTIEQRVIETTDRAKPDALRFVMDRDQPFLAVIF
CRTKVRASKLYDNLKGLGYNCAELHGDIPQAKRERVMKSFREAKIQYLIA
TDVAARGLDVDGVTHVFNYDIPEDVESYIHRIGRTGRAGGSGLAITFVAA
KDEKHLEEIEKTLGAPIQREIIEQPKIKRVDENGKPLPKPAPKKSGEYRQ
RDSREGSRSGSKGRTRNDSRNSSRNENNRSFNKPSNKKGSTKQGQQRRGR
>BA3060 mutT/nudix family protein
MISKLTFGYKKPTEQYVLRPSCYAIIFNSTSSKIAIIQKGESYFLPGGGM
EGTETKDECLHRELLEELGWKIEIDQYIGNAMRYFFAEKEDTYYLNDGFF
YIANMVQKQTENCEEDHVLRWMSPLHAVELLIHDHQKWAIEQALLLRNEK
GSPSI
>BA4628 ATPase, AAA family
MKQPLAHRMRPTNIQEVIGQQHLVGEGKILWRMVQANHFQSMILYGPPGT
GKTSIASAIAGSTGTPFRLLNAVTHNKKDMEVVVQEAKMHRHLVLILDEV
HRLDKAKQDFLLPHLESGLLTLIGATTSNPFHAINSAIRSRCQIFELHAL
TEEDLLIGLKRALEDKEKGLGEYDVTVTPEALHHFANASGGDMRSAYNAL
ELAVLSSFTTDDKAAEITLEIAEECLQKKSFVHDKGGDAHYDVLSAFQKS
VRGSDVNAALHYLARLIEAGDLQSIGRRLLIMAYEDIGLASPQAGPRTLA
AIESAERVGFPEARIPLANAVIELCLSPKSNSAYKALDAALHDLRNGQTG
DIPSHLKDSHYKGAEALGRGIGYLYPHDQPNGWVQQQYLPDKIKNKQYYK
PKTTGKFEQALSTVYERLQSSNKTKNKG
>BA1964 mutT/nudix family protein
MTEWLTIFDSERNTLGKKLRDEVHRDGNWHETFHCWFVEKDAEDMFLYFQ
LRSKNKKEAPLIWDITSAGHIMHDEDVQIGGLREIEEELGLSFQTTDLAY
KGIFTIDYEISNLTDREFCHMYFHNVIDSLPFAPGEEVDDVMKVHATSFL
QLLKKEISSFTAISVLNNKPITITFEDIYPYDLAYYEFVIEKGKELIKNN
SL
>BA3637 hypothetical protein
MVRRFTNTSVNSRWGKEESIIFPSFFQTSKNRKVVHMTLQVPTILIGLGG
IGSTVTHQIYERLPEERRKKVAMHVFDTDVNTLSKFDHIRKFKTQTSSSK
TPREYIAGDPTIPEWFPMDPTILDKPLTEGAGQLRVISRLALRAAMKEDK
LTSFWQEIEKIFPVTSDQTEYGVRVIIVTSLAGGTGSGMFLQIALYLREM
LRKKLQHHNILIRGAFLMPDVLVKTRTVSAKEFETVQANGYASLKELHAI
TLGSTGELSKRGGVTIELEYRPDQVDEDGRTNHTIKQHHLPYNYCFLYDY
ENLHGHHLHNLSDYMEQMANTIYLQLFSPMSANHFAQEDNQIQQLAESSG
KGRYCGAGTAKIIYPYEHLLKYCALKWAVQGLDESWLHLDQLFQEKKHRY
DQDVKRGMQREKPERGKSYLEDLEHLATRPEQAHIFYRQMYNETREGAEG
GKVGVAKSKLFLEAVESYVQRTVQKDEELNRLQHECKISAAKLKMTEQMK
GEVARVDHAVRLYAYAIPSRVHEHVTTLLYDMIESDRFSPSGSEGQSYQL
NTWFLKKTDSVHPVAARFMLYEIRKQLVEKMNRLHENNEQKRNLIQNYDK
KFNVSNIDGTVTAVRRVEIAQQQGWFGKMINNQQRLFKKEFEDIVTQYVH
KLNEYRKEMLLELVYQSLYQAVNKMIQYWERFFDNLHETRENLLFEIQKR
SKEFEGKTNPTNVYVLAEEKLQEKIWQDMQQHLNLGVLPKDICAEIYMSL
YGEYCRDAKTEEIQSKKVEDFYREHILNYCYDELQIRYRDKLELNIVEAL
RKEADYKNRDRDEYVREKIEDLFHLASPFVPKVSHHRELQYWGIHPSLKK
ELQEELMQEMFKEKDTVNEAFSPFEVICYRAHYGLSLQDFPKLSSGHIAN
GFMNDKGDYFQSYYRRVNKLNSKKSSLTPHLDKYWHLPAFMPDLNATQTK
LDYDKCNRALLYAYIYRWISLVAVDGQFVYQYNGVGRSFSIQSMGKNISS
ESYKLHRALLHNPFIYENILSRFEEEQEKAMIQGGHLYTHPFVLGAQDIR
WLRKEHVHNILDMILMYDREAKYDPTLEETSDELLRLFLDEIELYFQNYY
GTGADMVAKKEKEMFIKQLWDRSYAKGYVDPNSAPYKKWQNLLSVHDEEE
TPKTNV
>BA3207 conserved domain protein
MNTPTQTPSLSETMKEWHYALAYEIKHWKTIGGSKISIMNGRFLYTDYES
TVYVFQLISEVSLPEGSPIRIEFDGEEATGEVLSVHGLEIELKLNDYIQG
EIREAVLYSEPWQLLEQLQERLKEARKDKLKRNRIKRLVDGTSSPKHIEK
MKNPKNELAYRAFYNPTTYVWGPPGTGKSYNLSRIISAHYQKGKSVLVLA
HSNAAVDVLMSEVTKQIEKKKKWTPGEIVRYGYSQHEHIRNHETLLASKL
VETTNGSWGEERLYLEETRQELREKILSYKATSADKKRMQEIESDLRKQR
AKIKEVEKEYIENAKVIGATLSKCAIDSLIYERTFDLVVVDEVSMAFVPQ
IALAASLGKRIVVCGDFLQLPPIAMANHELVRKWLGEDMFYHAGIVESVN
KSEAHPNLFMLQEQRRMHADISKFTNSFIYKNRVYDHPSVSERKELAQLQ
PFANEASVLFDTSQMGAFSLKDAASGSRFNIMSGLVAMQMMLIGLLDGVQ
SIGIVTPYRAQSRFLSTCIREMLQRTKYQNIPVLAATVHKFQGSERDMMI
FDTVDSYPQERPGVLFFDHKNHRLVNVAVTRARGKFIQLSDCHYMCKNLS
RKQALSQLTAHIERHGDVYDRTTSRQLWERKISKRLRWFMEMNLEEPKGL
LKDILAAKRKIVISLPSTKQVDKRVWQALMRTNAQITVYSDGPVPLKNVK
LQRQNKAFPFIVIDDEIFWAGAPLTSQMMFEGSTEFPYVCARLQAPETIG
VLKGFLDIR
>BA4380 mutT/nudix family protein
MGYVEELRKIVGHRPLILVGAVVLVINEHGYVLLQQRTEPYGKWGLPGGL
MELGESPEETACREVYEETGIEVKNLQLINVFSGANYFTKLANGDEFQSV
TTAYYTDEYDGDFVMNKEEAVQLTFFPLTELPDYIVGSHKK
>BA0248 UV-endonuclease, putative
MLVRLGHVAMSVHLKNASPSQTMTYAQFQKIDDREAAIRKLERIANSNLE
NCLRLLKHNKGHDISFFRLSSKLIPLANHEELLEWNYIRPLKENLKVLGD
YAIRMNMRIDFHPDHFVVLNSPEENIFKQSVKTLQMHRKLLKGMGIEHKQ
RCVMHVGGGYKDKELALERFIENWSNVPRGIQEMIMLENDDTTFTLEDTL
YLGEKLDIPVVFDLHHHMMNHDREDWHEDWARVVHTWESSLLPVKMHISS
PREGKDPRAHADFIDVDTFLSFLKKIKGSVPQIDCMIEAKMKDESLFQLM
RDLSEQTDVEIIDGASFYIK
>BA3140 conserved hypothetical protein
MSELHFMSLEELDNELEKDDSGIYFIKDYNDNIIYIGKAFSIKSRVLAHF
NSYSNIKEYVHLFNKVAYLIEDSLLKRSLLQVTYMIKYKPVLNKEVQKEF
PELYTQYIKQTNKKSMLLEIEEAKEKRDELKNRLVKLVGGKTMFYDIISL
LNNGYNYHVLAKVLSIELQTLIIMKEHRNKFPMPHNYKRTIKHQDIMYAL
SGKKNLSTSRLNT
>BA5592 UV-endonuclease, putative
MHKGCVFIMIMRFGYVSHAMALWDCSPAKTITFTSFQKLSKQEREDKLYD
VTKQNLEHTIRILHYNIAHEIPLYRLSSSIVPLATHPEVEFDYIGAFTPL
WRKIGALIKEHNLRISFHPNQFTLFTSDKPHITTNAITDMTYHYKVLDAI
GIADSSYINIHVGGAYGNKEKAIERFHENIKKLPAHIKKQMTLENDDKTC
TTAETLSICQKEKIPFVFDYHHHMANLCEEPLEELLPAIFETWSHTNIVP
KVHISSPKSKKEFRAHAEYIDLEFIKPFLHVAKKINHNFDIMIESKQKDL
AMLQFIQELSSIRGIKRISSSTLQW
>BA0933 DnaD domain protein
MAVYRNVQVNFWQDDFVLDLTPEERYFYVYLLTCSKTTQCGIFPFPKRLA
EMETGYNRETVDKLVQRFVDYGKILYDAETRELFVLNWLRYNPVTNTNVE
KCVLRELKGEKNKEFVHMFLQKCVDEELSVPMLLAHFGMPSDLAVDDVEL
VCEGTEEEEGIEEETGSRVFSFYEQHFGSLSPHTVEELSAWMEDLSEELV
LKALQIAFENNKRTVAYVKGILRGWHGKGFTKVSEVEADTASFRKKGSSA
STGETEEFLARCEEWEKNAPSEEELQRFLAERGWRP
>BA3770 conserved domain protein
MSGGAQINKAGQAFSEDYVIVECTDHTDSFYQFHLDNTKMGNFEKGKDMT
FSVDLQNDVYVTFLVFQFINGSWTENLQTEFPAGDWSRRSFTFQIDGRAT
GWGMRLRFARLEMSKGKRFRFKRPKLEKGSVPTDFSKSTYELEQSVNGIR
ETITKVENNQSGFDKRVTAVEKNAEGITQNVSKIQEMQTEQSKKISEAQS
IIKQHSDQIDLTIKKKDMEDYVGGLGSINEVRNAGLNTTKFWNLSAGTVL
QPNSIYKGYPTFWSDYSGKTSDHWSGAISDFISVTTGESLVSTGWFATDN
IASLDQKAWMEIEFYNGTKSTRMRTQRVEIKWIKQGDWVKMTMLSTVAAN
EEWVRWRYYVQRNGRLRAGLPMLQRGKVATEFWLHPKDQVDADKMLEDIA
NRVATEQYNQKMTQIDNRFSVNEKGIDLVAKKTEVYTQTQSNDKFATNAY
VRNMEGRIQVTEKNILSTVKKGEIISSINQTAEKIKISANLIDLVGRVEA
SWLKAGLLQGMTIKTSATKEYLHMENQVIRFVNQGSAKIVMGFEDERKST
TRNPYIILGEGDGTGRNIGSIYKDGNGVYYRYVDYNGAESNIRLTNAGNI
GITAQDGIWFTAKRVNFTSPISTSGILFNSFGKIPLSQQGVLWMGNGYRG
FGAYYHDGKAWNFISSSQ
>BA4222 conserved hypothetical protein
MEATYKTKDVTNKTGIPKHIVRKYSQLLEEHGYMISKTADARIYKLDDLK
LLKSIHERAATLQEDISETIPIILKEKEAPPVPVIQDKQEIQPKEKEDGR
NFEEFMLKLEMLAQLNEAIIHQNSTLITQNRLKDEKLDELMQQVYVKEGS
QEKMLQELVDHAAQTDALQKEKMDLLMNHMYKRESKQEEKMNKLVNQIYN
KDSNRDAQLMQVIREIQETKRLVAASKEQSFFQSFRSLFSRTKQEKTTE
>BA4134 prophage LambdaBa02, site-specific recombinase, phage integrase family
MAYFRKRGEKWSFTMDVGKDPITGKRKQITKGGFKTKKAAQEEVARVTND
LANGDYENSDIRFSQLVEIWTQEKESSCRPSTLYQYKRILRSRVMPEFGE
KRLSDIKPLSVHNFHQKLLKEGLTTKYISSVDVMLKQILDKGVELEMINS
NPAKKAKRPKVKKKAQASWTVEEAMKFMEYAKIQGSYYIAFVLALHTGMR
IGEVLALQWKDINFESKVIHVQRTLTLVDGKYELGETKTEASNRMIPMTQ
ELMRELLEYQSHKKDNSFDLLICTRNKKIVHPYTIRYQMKALCEAIDVPY
IRFHDIRRTFTTILIDSGANAKVVSKLLGHTNVSTTLNIYTDVYEERQIE
VTEMLGNVLKSGRSGQKVVSEEKQED
>BA3335 conserved hypothetical protein
MDFKTVMQELEALGKERTKKIYISNGAHEPVFGVATGAMKPIAKKIKVNQ
ELAEELYATGNYDAMYFAGIIADPKAMSESDFDRWIDGAYFYMLSDYVVA
VTLSESNIAQEVADKWIASGEELKMSAGWSCYCWLLGNRKDNEFSESKIS
DMLEMLKNTIHDSPERTKSAMNNFLNTVAISYVPLHEKAVEIAKEVGIVE
VKRDNKKSSLLNASESIQKELDRGRLGFKRKYVRC
>BA3452 hypothetical protein
MMSNNNQVLKVGDWVRGISNEGEFIVGYVVSLDEVEDVVTVSIVKRYDKY
TKNEAILLFSKHVNKLPESNVINKEQILYLIDLALLTGDEEWFIELSSKL
NSIKELVNERF
>BA2695 hypothetical protein
MENVSNVDKVESIKSLQSTIRKLENALSQMTQKGSNTTLVKKRLKAASIG
LAMLESIWNQETHHYTQEDLADARNVIIGLLPSIEKIYVKSKLGSPQRTL
LERRIKSLELSIQAIDHFSNQ
>BA4311 integrase/recombinase XerD
MEDQLKDFIHYMVVEKGLAKNTVVSYERDLKSYVKYLQKVEQAKSFHEVT
RLHIVNFLQHLKENGKSSKTLARHIASIRSFHQFLLRERAVEHDPSVHIE
TPQGERKLPKVLSVDEVEALLQTPKMTSAFGVRDKAMLELLYATGLRVSE
LIALNLEDVHLTMGFVRCVGKGNKERIIPLGSLATEAIQKYIEKGRRELM
GKKVVDALFLNHHGNRLSRQGFWKILKRLAKEANIEKELTPHTLRHSFAT
HLLENGADLRAVQEMLGHADISTTQIYTHVSKTRLKDVYKQFHPRA
>BA4509 ATP-dependent RNA helicase, DEAD/DEAH box family
MTQQTFTQYDFKPFLIDAVRELRFTEPTGIQQKIFPVVKKGVSVIGQSQT
GSGKTHAYLLPTLNRINPGREEVQLVITAPTRELAQQIYEEIVKLTKFCA
EDQMITARCLIGGTDKQRSIEKLKKQPHIVVGTPGRIKDLVEEQALFVHK
ANTIIVDEADLMLDMGFIHDVDKIAARMPKNLQMLVFSATIPQKLKPFLK
KYMENPEHIHINPKQVAAGNIEHYLVPSKHRNKIDLVNKMLLQFKPYLAV
VFTNTKKMADQVADGLMERGLKVGRIHGDLSPRDRKKMMKQIRDLEFQYI
VATDLAARGIDIEGISHVINYELPSDLDFFVHRVGRTARAGHSGIAVTIY
DPANEEALDSLEKQRHIEFKHVDLRGDEWADLGERRRRKSRKKPNDELDV
MATKVIKKPKKVKPNYKRKLATERDKVKRKYSNKKR
>BA2676 mutT/nudix family protein
MIKDKKLIVAEYIGHHYFLPGGHVEVGESAESALIRELQEELGVNCSIKQ
FLGVIENQWQDKEMLHHEINHIFEIDSEELHIDFIPKSKEPHLAFHWIDY
NRDALHTYKIMPAPSVKELLERKLSDELLNCWISNF
>BA0362 conserved hypothetical protein
MTGNVLNYFAGGNTARGFHNLYEENLKGLNRLFILKGGPGTGKSSLIKAI
GRDWVEKGYDIEFLHCSSDNKSVDGVIIPKLKVGIVDGTSPHVIEPKMPG
VVEEYINLGGAWDSDKLREQKVEIERFVSEASKAFQAAYARFNEALIIHD
EWEKIYIDNIDFNKANELTDQLIQKLFADKGGKQSLVKHRFLGAATPKGA
VDFVPNLTEGLLHRYFIKGRPGSGKSTMLKKLAKAAEEKGFEVEVYHCGF
DPNSLDMVIVRELGFAIFDSTAPHEYFPSREGDEIIDMYALIVTPGTDEK
YAEEIRHVSIQYKTKMNEAMSFLAKAKSVRDKLERIYIAAMDFSKVDAYK
EEIQKEFERIAISVTEKNK
>BA1804 helicase, putative
MGFTLNKSIIKEVCGETSYKRGEAYYKSNKVIVNHYDETKEICEATVKGN
EDFHVTVEKAKKGDVVARCSCPSLASFQTYCQHVAAVLIQINYNQQTGGM
GSVSSRNDQLTNGMFQLFADKPLRPKSKQHRFDTREILDVAFICSPVATK
SGGALLGIQLKLAKTYLINHIREFLSKVEKRETFHYSNEFTYTPDIHSFK
QETDVIIQHLIKIYHNEKMYEDALEVHAKQDESMIFIPPASWNDMLSALS
RAEYVQLKQNEQLFHGLQISKGLLPLHFEFTKGNNGGFTLHIAGLNRVRV
MEMYNNALYDGKLYHLPMEDCMRLIELQKMMIRSNSNQFYIPENKMEHFV
AKVVPGLMKLGTVRIDEGISDRVETPSLKAKLYLDRVKNRLLAGLEFHYG
NVVINPLEEDGQPSVFNRDEKKEKEILDIMSESAFAKTEGGYFMHNEDAE
YNFLYHIVPTLKGLVDIYATTAIKLRIHKGDTAPLIRVRRKERIDWLSFR
FDIKGIPEAEIKGVLVALEEKRKYYRLANGSLLSLESKEFNEINQFVKES
GIRKEFLHGEEVNVPLIRSVKWMNGLHEGNVLSLDESVQDLVESIQNPKK
LKFTVPQTLRAVMREYQVYGFEWMKTLAYYRFGGILADDMGLGKTLQSIA
YIDSVLPEIREKKLPILVVSPSSLVYNWLSELKKFAPHIRAVIADGNQAE
RRKILKDVAEFDVVITSYPLLRRDIRSYARPFHTLFLDEAQAFKNPTTQT
ARAVKTIQAEYRFGLTGTPVENSLEELWSIFHVVFPELLPGRKEFGDLRR
EDIAKRVKPFVLRRLKEDVLKELPDKIEHLQSSELLPDQKRLYAAYLAKL
REETLKHLDKDTLRKNKIRILAGLTRLRQICCHPALFVDDYKGSSAKFEQ
LLDILEECRSTGKRILIFSQFTKMLSIIGRELNRQAIPYFYLDGNTPSQE
RVELCNRFNEGEGDLFLISLKAGGTGLNLTGADTVILYDLWWNPAVEQQA
ADRAYRMGQKNTVQVIKLVAHGTIEEKMHELQESKKNLIAEVIEPGEEKL
SSITEEEIRDILMI
>BA3331 DnaD domain protein
MAVYRNVQVNFWQDEFILDLTPEERYFYIYLLTGTKTKQCGIYVLPKRVA
ELETGYNMETVEKLLNRFVEYGKILYDVETKELYIMNWLNYNPILNTNVE
KCVLRELKTVKNKEFIHMFLRKCLEEEWKIPLLLQHFGMPKEEDTSSLQE
VVEEIENVEEVADTSHSEVYKFYEQNISSLSPYIVKELKNWIQRLSGEKV
LEALKIAFEQNKRTFAYVKGILRNWCKKGWGDFSGRKEGEVCLHDP
>BA1783 transposase, IS605 family, OrfB
MCLIGLRKRQQKLEQKIEMYETHIKNGTLPPIIFGGRKNFYERMKDKISN
QEWKDLRTRQLYSRGDKSKKGNLNMRITVDDCGQGWLEIANSLGRTNGKT
KSPRIKVPIIIPYRFYHEITNVVMGEQIGVNPKGKPIIDHQKYSVEIIRK
QNDFYVNITFDETEIGRVLDFKETPQSDLIAGIDVNPDRIAVSLCTKQGN
FKGSKIFYLHNLDTFSTNKRTTVIGQIVQQIKKWLIENNVGGIVLEDLKF
QQSHDTDKYSNRKFHQFTYKKMLDSLIRMALRNGFSVKTVNPAHTSVIGK
LKYCKKFGISVHETAAFTIVRRGLGFQERLPKEVVLLLKNKITTKLRIFV
ASMEESEKDTNTKKVYKKWLQTIKTWKDHHNWKLWSILHKTVYMSNQQLL
FKI
>BA4120 prophage LambdaBa02, DNA replication protein DnaC, putative
MTRIVNTSACSEETEGYTCEHCNKYIAAITVEVPQLRIKNKILPTCECVV
EREEAKIREAQNFAKKREIEKLFSISNLGERFSKSTFESFLDRNGSETAY
KVAVKYVKTFKEWNGESLMLWGEPGNGKTHLAAAIVNELSKKGYIVVFQS
VPELLQRIRSTFNSENKENETQIMRALLECDLLILDDIGAEKTTEWVEEK
LFNIIDGRYRKELPTLYTSNLEPKELKNQVGKRSYDRMVETSLTVKNEAA
SYRREIAKQRLQRFIEV
>BA2735 hypothetical protein
MKINWTEVKGKIRPQISKPSYETWFTNTTVYLEDDILTIYCPNEFARDWL
ESHYKELVFNTLREMFNTSFEIQFDLCNGEPSNLKDVQKEKLSSWDEVKK
ALRPKIAEKTFMTWIRNTNATIKDNKVIIFCEDAFHRDLLEGEYKNIISS
TVQKITDEEYQIWFEIGSSATSKAQVHHMKNNQRTSGQEESKTIWNKIKD
KMQLKISRPSYETWVKETTARINEDSLIIYFENEFQQEWVKKSYKDLISQ
IAKELTGNTYEIQFELKSNTTSNNEKSTIKDITSELGERFKVSNNSLKYN
DVDESNKRIRALEEKIMNLEKVIGTLVEKLDAVELKTQLEK
>BA2781 mutT/nudix family protein
MSKLRAEVMILNEDHSKVLVQCDENELFYRFPGGSIEFGETAKGAIIREL
MEEYDLKIDVQELAVVNEHIFEWNNEKGHHCTLIHWGTVQEMVTNEIRHK
EHENIILIWKSIEELKEKPTYPEGIVSYLEENNHNIVHFVSKNI
>BA4926 minD family ATPase domain protein
MEDDIAGYFKQVERYDYIIIDLDKKFFYMEVVQLKTNITSLKISLNLDKD
ETIIAGNVKQYDPSRLEQFIQNFKHTAQTCLEHNLRSPEELFAFWKGN
>BA5397 transposase, IS605 family
MRRAYFIEIKPTHEQITKINQTIGVCRYVYNLYLSKNKEMYESEGRFLSG
YDFSKWLNNVYTKECSQWIKKVSSKAVKQAIMNGDKAFKRFFQGLSGFPR
YKKKKNQDVKCYFPKNNKTDWTVERHRVKVPTIGWMRLKEYGYIPKNTIV
KSGTIGKRAGRYFVSVLCEWKEAEMKPILNQTGLGIDLGVKDFAIRSDGI
VYENMNKTIQVKKIEKRLKREQRSLSRKFENIRKRGEQSATNKRANIDKN
ILRVQKLHAKLANIRLAYVKSVVNDVVKTKPTFITVEDLNVKGMMKNKHL
SKAVAKQCFYTFKTWLLTKCEEYGIELRQVDRFYPSSKLCSYCGQKKVDL
KLSDRVYTCNCGNVMDRDLNASINLLQAKKYTILT
>BA3078 conserved hypothetical protein
MGKYVPLKFLFNEELAEKMADSICKHDPTFSKRNFVSSVTCKVENLELKQ
RIEVMADELHNALQKDFNEAIHILLKTLGPGNTTEVGMFTNGYMHMPMAK
YVEKYGLNDFESSFNAMYEITKRNTAEYAIRPFLETYHEDTLNILQKWIH
DKNSHIRRLVSEGTRPRLPWAKKIGALKGDFKYNLQLLDPLMDDPSKYVQ
KSVANHINDITKEDKELVFQWLQQLRDKQHPVNPWIIKHGLRTVIKNDTL
PKDFYF
>BA2324 hypothetical protein
MAEIKVGLTQFLDFTLKSSAAKTNFVKNLKSQPEYHPVFDYWKQLRETVI
KFHKNKLSFDCFEALVQAVDQKKKQNYIDVLKQYKKFITNKDVSWFDPGK
SHWLSDNLIVRSSPELGLLINDEPHLIKLFFKGKKERIDKYNINSTLTLL
NESTFSNEHKDVNYTVLNIQKNRMYTNNSINNNHVIALESEAHQLCYLWN
KM
>BA2201 site-specific recombinase, resolvase family, putative
MEEKVVGYVRVSTEGQVREGYSLTYQVEEIERHCIENKLQLLHIYEDKGI
SGATVDEDGLTVEREGLQELLSDMTCHQVSKVIVLNTSRLWRSDMAKVLI
QRELKKHAVDVKAIEQPNYSIYTHDPNDFLVNGMLELLDQYQRLEIVLKL
SRGRKKKAEQGGYAGGGVMFGYRVKKGQKVLEVDTNKAIVVRRLFELRHF
CKHWSLTQLAERLNREGYCTEKGKRFTKVQVKRMLDRENFYRGVYTYGQI
QTNGKHPAII
>BA1584 conserved hypothetical protein
MGKVTLIATAAMGIEALVAREVRDLGYECQVENSKVTFEADEKAICRTNL
WLRTADRVKIKVGEFKATTFDELFEKTKALNWGDYIPENGEFPVIGKSLK
SELFSVSDCQRIVKKAVVEKLKTTYKRTTWFEEDGPLFRIEIAMLKDIAT
LTIDASGVGLHKRGYRVDQGEAPLKETLAASLIKLTNWKPDRPFVDPFCG
SGTIPIEAALIGQNIAPGFNRGFASDEWGWVGKQNWREARQEAEDLANYD
QPLQIIGSDIDHRMIRVAQDNAEEVGLGDLITFKQMQVKDFTTKEDYGYV
VTNPPYGERLSEKALVEQLYKEMGQVFRPLDTWSAYVLTSYEAFEKCYGK
DASKKRKLFNGFIRTDYYQYFGKRPPRNS
>BA0222 deoxyribonuclease, TatD family, putative
MKWIDSHIHVDQYKDEEKSRLLKDVENSKEIQGLIAVSMNYQSSKEILSL
AKRYSFVHPAVGFHPEQPIHKEECEKIYKLIEDHVEDIVAIGEVGLPYYL
RKEDEHIAINSYISVLQRFVELASKYDLPIVLHAVYEDADVVCDLLEKHK
VSRAHFHWFKGSETTMERMMRNGYYISITPDVLHKEKIRKIVSYYPLEYM
MVETDGPWGFQEGVMTHPGMIREVLKEISVIKNVAVDKVAETIYENTIQF
YLKG
>BA3969 site-specific recombinase, phage integrase family
MNVKKLLQLFVGYLQIERNYSKYTIASYQNDLEHFVQFMEREGISSFLDI
TYADVRLYLTTLHDEKLARKSVARKVSSLRSLYRFLMREGYRKDNPFALA
SLPKKELSIPKFLYAEELEELFEVSDTETPLGQRNQALLELMYATGIRVS
ECVNLQLTDIDFAVGTILVMGKGKKQRYIPFGSYAQDALITYIENGRKQL
AEKTEEQSHMVFLNAKGTPLTSRGVRYVLNELIKKASLTMRISPHILRHT
FATHMLDEGADLRTVQELLGHENLSTTQIYTHVSKERLRSVYMKHHPRA
>BA4366 DNA-damage-inducible protein P, putative
MREMYPKKGRVILHVDMNCFFASVEIAHDSSLQGKPLAVAGNEKERKGII
ITCSYEAREYGIRTTMPLWEAKRLCPQLIVRRPNFTLYREASFQMFQILS
RFTEKIQPVSIDEGYLDITDCYALGSPLEIAKMIQQALLTELQLPCSIGI
APNLFLAKTASDMKKPLGITVLRKRDIPEMIWPLPVGAMHGIGEKTAEKL
NDIHIQTIEQLAKGNEHIIRAKIGKHGVDLQRRAKGMDDREVDPSQMGQH
KSVGNSMTFSKDMDEEKELLDMLQRLSKSVSKRLQKRTLVSYNIQIMIKY
HDRRTVTRSKQLKNAIWEERDIFQAASRLWKQHWDGDSVRLLGVTATEIE
WKTESVKQLDLFSFEEDAKEEPLLAVIDQINDKYGMPLLQRGSQLLRKQE
KSFQQKLESKFM
>BA3732 mutT/nudix family protein
MERWIGCAAVCVNERNEVLMVLQGQKGEEKRWSVPSGGLEKGETLEECCI
REVWEETGYNVEVVSKIYEKEGITYGVPVNVHYYVVKKMGGSMKIQDPDE
LIHEIAWKGIEEIKQITLSFPEDYEILNEYINKKASV
>BA4622 helicase, putative
MFEEEKKFIKAQVLHTIFHNEENLYSVVSMKVIETNETYDEKKVMINGHF
PRMHEDEVFTLTGHFKDHPKYGKQYLVETFKKELPQTKAGMAQYLASDLF
KGIGKRTAEKIVAHLGEHAISKIMDDPEALNGVVNKQKAQEIYETIVEHQ
GLEKVMSFLNGYGFGTKLSIKIYQQYKEMTLEVIRNNPYQLIEEVDGIGF
GRADDIGRALGISGNHDDRVRAGCFYTLENVSLQLGHVYMRKDQLVRETM
SLLNNQEGRVTEEDIISCIEMMQSEGKVIIEEERVYLASLFYSEKGVVKS
IRRLMNQEETPSFPEAEVLKTLGEIEEQLNVQYAPLQQEAIQTALHKPMM
LLTGGPGTGKTTVIKGIVEMYASLHGLSLNPNEYSDDNPFPILLTAPTGR
AAKRMSESTGLPACTIHRLLGWTPEGSFQRNETDPVQGKLLIIDEFSMVD
IWLANQLFKSLPTNIQVIVVGDEDQLPSVGPGQVLKDLLNAGAVPTVKLT
EIYRQAEGSSVIQLAHAIKNGTLPPDLAQNQKDRSFIGCTGAQIVEVVKK
VCENAKTKGFSARDVQVLAPMYRGPAGINVLNEALQEVFNPKREKSKEIA
YGDVVYRRGDKVLQLVNQPESQVFNGDIGEIVSVFYAKENVEQQDMIIVS
FDGIEVTYTKPDLNQITHAYCCSIHKSQGSEFPIVIMPIVKSYNRMLRRN
LIYTGITRSKKFLIICGEEAAFQSGVNRLDDAMRQTTLASRLQESQGEVQ
MVTVNGEEMDVENISPYDFM
>BA3804 hypothetical protein
MLTIIGFNLSLIVIFLIIYFVLWSFASRNLFFNLTSFFMLAIVVAPIILS
ILNIVWIIKNDLSIWSYLIPVFYISIPIVKIFLEEKEQKRDDELYENYTP
QLRLVINSFFSELSIENDEEDIFIVKLDNKRNNKYVFKVIINLEDVQYNK
DFLVKKLNNILRDEIPDIGTEIYINVNQDMQKKVDLL
>BA2192 hypothetical protein
MGFRPEVGEKISLNKDVYRFEKHPAVIGIEMPYGQEGRQGTVYQLQHENG
MERIALKVFKERYREEKHQLAFLKPLSSIAGLKVCSRYIVTKEEHISAIE
KSEDLANSIVMPWVEGPTWADILQEQRMLSKEQCFFIAEAFLTTLKMMEE
NEVAHNDLSSSNVLIPFLSENPIEGQHYIELVDVEQMYGPKTKRPSLLPA
GSAGYAPMYLKSGVWQKEADRFAGAILLGEILSWCSEEVRNKKWTDASYF
KTEEMQKECERYTLLQQVLHNQWNGEIAKLFKQAWSSNSFAECPSFAQWY
DVFHSVRERIKIDAERQLAEEHSLFVSKCLEIARLLEERGFKQAALYEYK
IIFNSLNPSTALQKELAYIIQTMESQEPEINKKMVLQHYLELATELEREN
NAAFACFVYSRIVQFPNIDQALKQEIASIIEEIKEGQGTETQQEVAATIT
VPNSILQSRKKNEKTSGI
>BA1481 site-specific recombinase, phage integrase family
MKQAGQPIQNEKQLETIKNILLQSSKRDGLLFVLAVNSGLKVSEILQLKV
SDVIDENENVRHSILFYNEKVKKHKWFAVNEDLQHAIEDYMKERKTWKRN
EPLLKSQKGTKSITRQHAWYILNKAAKEVGLEGISSHTLRKTWGYYAYKS
GVDIAFLQHFFDHSTPSKTLKYVGIA
>BA2795 conserved hypothetical protein
METSNSEKYVFYREKYYCLYEYVAGSVLEIKDTEILKELGSTIGVEIANL
HQALNSVNNNNELVKRDLYKVVYGWALPILEKNEYVHQDVIRKMNQIHID
FKETVHSLRKQIIHRDMHLSNVIFKDNEFQGFIDFELLESNVRVFDLCYC
CTSILSELYSDEVLRGKWQHIISTIFEGYNKQNILTREELQSIWYVMLSI
QVIFITYFVQLKDLLKLNEEMFFWIFANKEAIEESIERIARK
>BA2705 endo/excinuclease amino terminal domain protein
MNLIKISIPEVDVTITERKQVIRGDEPRITPINGFIDFHLFPRDKGGIFM
FYNINDELLFVGKARKIRQRIKKHFEDNVSPIKNHRDEVYRIDACIVEDP
TEREIYETYIINEYKAKYNVDKVFYK
>BA2287 conserved hypothetical protein
MEYKTPFIAKKLGVSPKAVVRIAQQLNLTIEKNKYGHFIFTQDDLDQMLE
YHRSQIEQSQNTHPTQKTSSNDVEELKTQVNTIVQNISSHDFEQLAAQLN
TITRRLDRMEEQMQDKANDVVTYQLLQHRREMEEMLERIQKLEAGLKKEE
PIYITPDTKPTYEREKKPKRRKMIFSIFGL
>BA2196 hypothetical protein
MSATETKEKASPVKENTSSNEEKAITEKNEATSTLAEKKQKDAKTEGAKN
NEKTSNIDSSTQKAKVKSEPFSQEIDQIETYLNPANPQYEKALLLANQYV
DGATGEQKEKLEQQFSAAVEGLRGQIKDSKFTVEDAINKANLLANTARVE
KSVQDESHKLLMPLMFERKANDAANNGEYYTAVLNIANAIRTKHPLERSR
EELVDYANKLWTDTENEWNKMNGTTGWQSEVGKTLLPSYTLLAQLKDIDK
TIGQISGMNMIDNASKKMEGIQLVQLAGDEANKEGQTSLYNALHYYGQAA
SRGVVDKTGFSNVAGLVIKEAKQLEKTQYKSALNNYQILYKTPGIEELGI
KDGVKAAIEYLSPFEAAKVSGNKDTIEDLASAIELTYHSMELGYPEADAK
EWMNTVALKMFNKGAGYMSATDTNNAYKCFEFLSRDKYANAINGEITKQA
KENVEKIKSMNVKK
>BA5145 conserved hypothetical protein
MHPFVKALQEHFTANQNPEKAEPMARYMKNHFPFLGIQTPERRQLLKDVI
QIHTLPDPKDFQIIVRELWDLPEREFQAAALDMMQKYKKHINETHIPFLE
ELIVTKSWWDTVDSIVPTFLGNIFLKHPELISAYIPKWIASDNIWLQRAA
ILFQLKYKQKMDEELLFWVIGQLHSSKEFFIQKAIGWVLREYAKTKPDVV
WEYVQNNELAPLSKREAIKHIKENYGITNEKIGETLS
>BA0037 deoxyribonuclease, TatD family
MLFDTHSHLNAEQFEEDLQEVIARMKEAGVTYTVVVGFDEATIKKAIELA
EAYDFIYAAVGWHPVDAIDMTEEHLAWLEELASHPKVVALGEMGLDYHWD
KSPKEIQKEVFRKQITLAKKVKLPIIIHNRDATQDIVDILEEENAAEVGG
IMHCFSGSVEVAERCVDMNFLISLGGPVTFKNAKKPKEVATEIPLEKLLI
ETDCPYLTPHPFRGKRNEPSYVKLVAEEIANLKGISYEEVAEITTKNAKA
LFGVE
>BA4614 conserved hypothetical protein TIGR00250
MRILGLDVGTKTVGVAISDEMGWTAQGLETIKINEERGQFGFDRISELVK
QYDVDKIVVGLPKNMNGTIGPRGEACQQFAENLRELLQLDVVMWDERLST
MAAERLLISADVSRKKRKQVIDKMAAVVILQGFLDSK
>BA0032 conserved hypothetical protein
MEKNKHCFYVVECSDGSYYAGYTNHIEKRIGTHNSGRGAKYTRARLPVVL
KYVEFHEDKRTAMQAEYYFKQLNRKQKEEYMQKGERYVATKKLSTK
>BA0038 primase-related protein
MKIKEIIVVEGKDDTVAIKRAVDADTIETNGSAIGDHVIEQVKLALQKRG
VIIFTDPDYPGERIRKIISDKVPGCKHAFLPKEEALAKRKKGVGIEHASN
ESIRRALENIHEEMEAYTSEISWSDLVDAGLVGGEMAKSRRERMGKLLKI
GYTNAKQLHKRLQMFQVSKESFAEAYKQVIQEERK
>BA0934 replicative DNA helicase, putative
MSIQNVEAEKTVLGSLLLDGELIKECRLTEQYFSMPVHKSIFQLMRKMEG
EGQPIDLVTFTSRVDPNFLKGIGGMEYFIGLMDGVPTTSNFSYYEGLVRG
AWKMYQAGVLGHKMGERLIAEKSEKIIGETITALCELEEKDCVCEFDLKD
ALVDLYEELHQDAKEITGIETGYTSLNKMTCGLQEGDFVVLGARPSMGKT
AFALNVGLHAAKSGAAVGLFSLEMSSKQLLKRMASCVGEVSGGRLKNPKH
RFAIEDWERVSKAFAEIGELPLEIYDHAGVTVQDIWMQTRKLKRKHGDKK
ILIIVDYLQLITGDPKHKGNRFQEISEISRKLKLLARELNVCVVALSQLS
RSVESRQDKRPLLSDLRETGQIEQDADVIMLMYREDYYDKETMQKEMTEI
HVAKHRNGPVGSFKLRFMKEFGRFVEGG
>BA4121 prophage LambdaBa02, DNA replication protein
MATFRVNKDKNYTTINNTGLKDKRLSWKAKGILAYILTLPDDWFFYREEL
SRHAKDGLDSLRAGMKELKEYGYLKRFPVRDDNNKIIKWETIIYEVPQND
PVAEKPPVEKPPVENPELLSTKELNTKELSTDIQSSSSIFSFYENNFGIL
NSFIAENISQWVNDTSEELVQAAMERALKQQKKWNYAEGILKQWVNNNVK
TLKDVDALETEYQRNKGVKKRVGINRKSDDSDSEYIGL
>BA1484 transposase, IS605 family
MAKKKAVKVLRKQKKRETLRRFTQKQNIGRACLTAQEFRLLQRMSHSSKA
LRNVGLYTMKQSYLNYNKMATVKEVDTAMQADMNYWGIQSNSVQAIRRAL
FTEVKSFFKALEQWKKNPEKFTGRPKFPNYSHSTDKRIIEIYQVPKVDEN
GFWMIPMSVAFRKKFGSIKIRMPKNLRNKNISYIEIVPKQKGRFFEVHYT
YEMHVSQMKKQPMTTSNALGCDLGVDRLVSCVTNTGDAFLIDGKKLKSIN
QYFNKTIRNLQQKNVENGLSKRIVTNKIAALWHKRERQIHGYIAQTVGLL
FKKVKEFDIDTIVVGCNAGWKQNSHMGKKNNQKFVQIPFHKLIAAIENKC
IKEGIRFFKQEESYTSKASFLDKDPVPVWSKDDRTQYRFSGKRITRGLYQ
SKAGTCIHADINGALNTLQKSQVVELDGNLKVKTPILLEVQKRKAVASRI
A
>BA4806 conserved hypothetical protein
MSFNLRGAVLANVSGNTQDQLQETIVDAIQSGEEKMLPGLGVLFEVIWKN
ADENEKHEMLETLEQGLKK
>BA4082 prophage LambdaBa02, tape measure protein, putative
MAGDMEIGARVTLDTQRFENGVAGINRGLRLLDSEFNLTSERARLLGNSV
EQLQNKLAHLNEKFTLQGQKVEHYRQKIEQARQKQEQLQASNLTLAASME
RLETQYNQAVQNFGRNSQEAKQLKQELKQLQAEYTANGQALQRLNTQIDN
NTIAMNRAETAQARIQNEIRETNRELAEQQNRLHRTGERMRDTGNKMQDV
GGQVGTTFAAMTGVIGAGLAMAVKESMNFEQKMADIQAVSGATGEEMKQI
GDLAVTMGEKTKYSSVEAGQGIEELIKAGVSLTEIINGGLEGALNLATAG
ELELGEAAEIASTALNAFKADHLSVADAANILSGAANASATDVRELKYGL
SASSAVAAGAGMTFKDTATALAVFAQNGLKGSDAGTSLKTMLMRLNPSTK
EAYNKMRDLGLITYNAQAGFDFLVKNGIQPASRNVGDIEVALEKYVMKTE
GVTKWNDKCDATFRELATSSAFLSSKFYDQQGHIQGLDKISGLLNESMKD
LTDQQRSMALETLFGSDAVRGATILYKEGADGVNKMYGEMSKVTALEVAE
TKMNTTKGKIEQLSGAVDTLKKSFGDALLPILVDVVEGVQGVVDWFNNLD
ASTQQMIAKSSLLAFGIAGVTTAVGFLAMGIGALLANPVALAITGAVLAV
GALGIAIVDLNEKSKQAQNDMDKFGQRVSEATSKAAGAYMDLKDKAINNM
MDLKLKTGEEANKAADETIKAFQRMTNEVIKELEGKKSEFNKMFSQLMGA
VPESAKQTLEQVKNNVIESINKEIEVATQAEKILEEGIKRYQGDTLKMPK
DFAQKFEQALQVADKNVQQFYTKAKEITSISKEIEAGGMLSLDAGKKRFE
SIIKVYEDGVKSLDKQTKGWRENVEKAFKLGEIKPEERKATLDAIALYES
KHVNDLQSIRNDGFKVLQQHMKEEDAEVLASQAKRIEAEDKGWGHALKPH
MDFEKNQLI
>BA0869 methylpurine-DNA glycosylase family protein
MQAPPSFYEGDTLEVAKKLLGQKLVHIVNGIKRSGIIVEVEAYKGPDDKA
AHSYGGRRTDRTEVMFGAPGHAYVYLIYGMYHCFNVITAPVGTPQGVLIR
ALEPVDGIEEIKLARYNKTDITKAQYKNLTNGPGKLCRALGITLEERGVS
LQSDTLHIELVPEEKHISSQYKITAGPRINIDYAEEAVHYPWRFYYEGHP
FVSKK
>BA3370 ribonuclease
MKFRNTKLAMVTLSSFFILGSASLSFTNIIQGEVASASPQEITVKSYDDT
YYNNAIGKTGLELKKELHNIIDDHTKLSYSAVWEALRDTDEDPNNKNNVI
LLYTGRSQGKLTNGSGVDNWNREHVWAKSHGDFGTTAGPGTDLHHLRATD
VSVNSSRGNLDFDNGGVNHSEATECKYDSDSWEPRDSVKGDIARMLFYMA
VRYEGDNGEIDLELNEKVNNNKDPYMGKLSVLLKWNEQDPVDDLERKRNE
VIFTKYQHNRNPFIDHPEWVNKIWN
>BA5043 mutT/nudix family protein
MYKFKDYYHNTVQLSFERYPFSPEPKHVWVVCRYGDQWLLTHHLRRGLEF
PGGKVELGETPEEAAVREVHEETGGIVSDLTYLGQYKVSGKDKIIIKNIY
FATISAVEQHTHYEETKGSVLLTDIPDNIKTDRKFSFIMRDDVLARTMKH
IEEIGCFTK
>BA0880 hypothetical protein
MYKQLPHGVKIGITRSIVVSFEKYMKEIEWNEEKFDMQQFVEQWKQYLYT
KSTWINKVDDKLKGHPDFHQALAMKVNEKINELINKKPSEEQVEQLKRNK
VKHADEMCKLEAEYHIERLLVTK
>BA5363 prophage LambdaBa03, site-specific recombinase, phage integrase family
MLKDLLKEFVFDLKIKNYSDRTIDTYNYNVGQLITYLNEHHEVNDVEDVA
TFHIKKFVQHQISLGKKANYINTLIKSLRSFYKYLVVEEYVSVNIMNKIK
LLKEDVEVIKTFTDDEVAKMIEAYDFKTYLNARNKVIIAMFVDTGMRMSE
LINLQSSWIYDTSLRIKGKGSKWRHVPMSLMLKKYMIRYERIKAKYFEKK
ALEHDNYFLSRAGKPICTVQIENIVKIAGLRAGVREDIRCSPHTVRHYAI
QSNLRNGLDLYSCSKIAGHENIQVTKRYLQGLEMENILEMAQKTSPLMNL
R
>BA2359 exonuclease SbcD, putative
MKLFHTADWHLGKLVHGVYMTEDQKIVLDQFVQAVEEEKPDAVIIAGDLY
DRAIPPTEAVDLLNDVLQKIVIDLQTPVIAVAGNHDSPDRIHFGSSLMKK
QGLFIVGQFQFPYEPIILNDEHGEVHFHLVPYADPSIVRHVLKNEDVRSH
DDAMRIFMNELSETMDKEARHVFVGHAFVTSAGEAEENTSDAERPLSIGG
AEYVNSHYFDKFHYTALGHLHQAHFVRNETIRYSGSPLAYSISEEKHKKG
YYIVELNEQGEATIEKRLLTPRRKMRTVEAKIDELLQHPVSEDYVFVKLL
DENPVLQPMEKIRSVYPNAMHVERSIQRREFTESNEATVSRHKTDDLSLL
KAFYKEMKGLDLSEEKERLFVEVLQTVQEREGERG
>BA5486 conserved hypothetical protein
MLQHSITKDEIMMIANEFVQGLDPQQTADQEHVATARHLYRSGVVYNVDF
DGYTLSGTVDAEGSVYSVHIPIRNVAESYCDCFAPTQCEHMLAVLLSAAS
SFGQVGDVLTLFKNNTKPSLPPIRTARQVLQSSAFEETDYKSWQSYFDNE
YESFKKEQARLTYKQMYFLMSIFTDFYTKLERKAPRIVVIHELFRLHAAL
YCFQKLLEEIQEFETNKTYSYHQPVNVVRIFVDKVESIVRDLQSEATSSE
SEAILQETARLVHEVFFSTDAYTQERFFIYRHIWSELLHNKEQIREEEKR
IDTKMNPLSKALASSHLLFLNDEDLLAMDLLKKHPASVVSLYFYWLEELL
NAMKWDRAKSWLSFTYKQVKTTIQEQENTIFIKDIVRLFVIMYETYATHT
NEQAGLEMILQELLPYSFANYEQYVLAKKQYRTWTELQLLHGFEAIELLK
EPLKDIEKEAPEAALPLYHLAATEAIEERNRKAYRRAVRYLKKLRTLYKR
LKRTDEWDAFIIHIANLHSRLRALQEELRKGKLIDDQSN
>BA3547 MutS family protein, putative
MNTMTFEKLQYNELKDIVKFYCVSGLGKELINKLEPSTSIKVVRNRLNET
TEARAILDAEGHVPFFGISNIASTIQKLEKGMILDPEELVSVSDFLRGCR
KIKKFMLDKEFFAPVLASYANSMTEYKSIEEEINFSIKGNSIDSAASKEL
KRIRNNIDSVDGKIKERLTKFLNSSANKKYIQEFFISKKDDRYTIPIKSS
YKNQVAGSIVEASAKGSTVFIEPHTVTKLNAELASLKAEEAMEEYQILAT
LSGMVVENIYHIKINMELISQYDMVFAKAKFSKSIDGIEPKLNDHGHIHL
VNCKHPLLSGKVVPLNFEIGQNYRSLIITGPNAGGKTIVLKTIGLLTLAT
MSGLHIAGDKETEIAIFENVFVDIGDNQSIENALSTFSSHMKNLSEIMRM
SNNNTLLLFDEIGSGTEPNEGAALAISILEEFYLAGCITVASTHYGEIKR
FSEMHDDFMNAAMQFNSETLEPLYKLVIGKSGESNALWIANKMNVRERVL
KRAKAYMGNKEYTLEKVNESKIRKPKFLQEKRENHYEYKIGDRVNLLDHD
DFGIIYKEKDNFYNVVVYYNGEFIEVNVKRITLEVAAKELYPEGYDLNTL
FVDYKERKMQHDIERGSKKALRKIQKEMRKNRG
>BA2312 conserved domain protein
MLALQEHFQKISEEKERYGDGYNNFDLVCPTYNGNPCNFRSLTQLWKKLI
KKCGVPDIRFHDLRHTHATLMLKQGIHPKIVSERLGHKKVGITLDTYSHV
VPGLQEKAVEDFANNLFQKH
>BA3180 deoxyribodipyrimidine photolyase family protein
MFQKDFRLYDNPALFEAAQSGEVVPVYVHDETFSMGSASKWWLHHAIIDV
KKQLEALGSTLIIRKGSTQEEILSLVEQLGITAVYWNICYDPDRLQSNQK
MKMMLEHKGMICKEFNSHLLLEPWVIKKKDNTEYKVFTPFYNAFQKQVIH
KPISKVQSIKGGNSLPVSLSVSELHLLPTIPWTSHMESIWEPTEEGAYKT
WKEFFSSKLASYSEGRDFPNQNAHSMLAPYLSFGQISVKLIYHYLINKST
ESQCSLFEKQVNSFIRQLIWREFSYYLLYHYPFTAYKPLNKSFEHFPWNN
EEELLRVWQKGDTGYPFIDAGMRELWQTGFMHNRTRMAVASFLVKHLLIP
WQEGAKWFMDTLLDADIANNTMGWQWVAGSGADASPYFRIFNPITQGEKF
DKNGEYIRKWVPELKDMPNKYIHKPWEAPEHILQKANIQLGHTYPLPVVD
HKAARERALCAYKSMKEFV
>BA4316 mutT/nudix family protein
MSNLAERTVKTEPIFDGRVIKVRVDDVVLPNGAMSKREIVNHPGAVAIIA
ITDEGKIVLVEQYRKALEKAIIEIPAGKLEPGEKPEVTAVRELEEETGYV
CENMELITSFYTSPGFADEILYVYKATGLTKKENKAALDEDEFVELMEVS
LEEATTLMKDLRIHDAKTMFAVQYLQLQK
>BA2926 mutT/nudix family protein
MNNATNFHRAFGVYGICIENNHILVIDKMKGPYRNRYDLPGGSLEDGEAL
LAGLHREIKEETGLNVTVVKQIGTIDFQFPSKFKEYTHVHHIAVFYGVER
CGGEFEVPEQFEEQDSSGARWIPIESITERNSSPLVCSAVEWLKSNELPL
EVKKYETWTVKNSF
>BA2713 mutT/nudix family protein
MRAPYQVLIFPYIKTDDSIQYAIFNRSDYGYWQGIAGGGEDGEIPIESAK
REAFEEAGITRECPYIQLDSVSSLPVEDVVGGFLWGDEVYVIKEFSFGVK
VPNKHISLSKEHLHYKWLCFEEAVKCLKWDSNKTALWELNKRLLK
>BA2360 exonuclease, putative
MRPIQLIMTAFGPYKQKEVIDFDDLGEHRIFAISGNTGAGKTTIFDAICY
VLYGEASGEERSDTSMLRSQFADDNVYTSVELTFQLKGKRYEIKRQLGHK
KQGNKTITGHAVELYEVIDEEKVPAVDRFHVTDVNKKVEDLIGLSKHQFS
QIVMLPQGEFRKLLTSETENKEEILRRIFKTDRYKLMRELLDQKRKQWKD
VLQEKQKERELYFRNVFKLPIRDGAILETLVEQEHVNTHQVVEALEQETA
VYKAEVEQLQVEQDVQTKQLKDAETRFHAAKSVNEKFIDLQQKNEKYNTL
QENRTVIEMKETSFKRAEQAKRLLPFEQWHEEAMQNEQKAESLLKQIIAK
KENIMNNFELAQEKYEVVKNKESERENVKKLVQRLEELQPIIASLAEKQL
NLQNAEIQIGKLKESMQNLDRQLEEHTNQKQLMTGELQQLEQALEQYVDK
VEELTNMREDAKVLKQAYDVWQEKQKFEKEKEAAYSKMQLAVNAYENMER
RWLSEQAGILALHLHDGESCPVCGSTTHPKKATEQSGAIDENELNGLRDK
KNIAEKLHVQLEEKWNFYHHQYEQVIEEVKKRGYQSEELVETYSALVQKG
KQLATEVNTLKASEETRKQIAVKIKSVEEKVDALQKQKREVETEQHRIEM
DCMQLRTSYEHDKKNIPENLQTVQAWKVQFDQAMHELKLMEDEWKKVQEA
YQHWQNENIRIQAEQEGATNQFESAKLKKEETFTRFMKELEQSGFTDQST
YKEAKLSDAEMELIQKEIQSYYSFLEVLAKQIEELHVELKDKEYMDITAL
GEHIKELEINLDIIKEKRQRAQNAVTYISDLHENIRRIDEQIHEEEKAFQ
ELVDLYEVMKGDNESRISFERYILIEYLEQIVQIANERLRKLSNGQFYLK
RSERVEKRNRQSGLGLDVYDAYTGQTRDVKTLSGGEKFNASLCLALGMAD
VIQAYEGGISIETMFIDEGFGSLDEESLTKAVDTLIDLQKSGRFIGVISH
VQELKNAMPAVLEVTKQKDGCSQTRFVVK
>BA4065 prophage LambdaBa02, lipoprotein, putative
MFKKILILIFTALCTLTLSACSSDSSSASKNTTKNEAKEIPTEVRSFVTN
FNGILDENGSSTNTEKLSKDLKVVKEKGYQDKTLHVVSLIENQNIKDDGT
VELGNMKNFRVILDDKQKVIQKITYVGDDPNVFLFSLESLKIDATKEVKD
MLEDIRIKINRKDPKVESVAYTSSYKISFTYDPNSPTALLNFSFEKNN
>BA3044 mutT/nudix family protein
MDLTFKVEKTCFNYRGGVICKHDNKILILQGDSEDFWYVPGGRVKMLENS
EDALKRELAEELAVPIEGKRLIWSVENFFTLSEQKFHGISFYYEVELHEL
PANGADQYILEEEGRTYLFKWVPVEELHAYNLQPAFIKEKVKDVSVHTEH
TVLQK
>BA5702 conserved hypothetical protein
MENINKLLETLHLEKNITLEDIPNVDLYVDQVVQLFENTYADTTRTDDEK
VLTKTMINNYAKGKLFIPIKNKKYSKEHMILISLIYQLKGALSINDIKSS
LEHINESLLSDDSFELNMLYKDYLTITENNVESFKQDINNRVSEVSEISS
LEDSKLEKFLLLNSLVNMSNMYRRLAEKLVDDLKGS
>BA0183 lipoprotein, putative
MKKALGIAVMGSMILLAGCNGSKDTKEPKEKVEQSTEQTEEMKEYRAVHE
KYDLKMNKEINNALQLFEVAKEKGGKEITTATYKEDVQKVTTSMLEDIDH
IRKEIRVPKSKEQEHEVYVGFLNESEQAMKKLQKLAKEDSSLIRDIEINF
ATASTYYKRFQAETKK
>BA5385 mutT/nudix family protein
MQRVTNCVLIRDNEVLLLQKPRRNWWVAPGGKMERGETVRDSVVREYREE
TGIYLKNPALKGVFTFVIQEGDKVVSEWMMFSFLATDFAGENKLESEEGI
IGWHTFDKIDDLAMAPGDYHIIDYLIKGNGIIYGTFVYTPDFELLSYRLD
PS
>BA0927 conserved hypothetical protein
MLEQLIIELAKSMKQAEDINGDKNYLIINKDDEGLDVEVKISCGQYEDEV
SYFKVSFELLKNTLEKFMAMRTVKSENFGQTSECNAFLLAFFSQLPFVNV
MESGAITFKEFQTDNLPSEKYDKVMLFLEEIMNGTYNPSMLRQQTDENLY
RMKSNARQDLRLLGFLNESHEMNQLLLNEYVQSEDKNVYIAQLILKQEYF
RHVLFILGLLEKYSKDEKKEALVGLGMTIVQNSLGDNLMVESVAKRRTGN
LLDWLEQVGLINDEWIPVEQYVKDDGEKGGSMNSNLREKFLTVMNEYLQA
RTERFAGHKMGSVVRNEMTTEITRLPFIDHSQYVVTGSVGQGNWAAVPWL
AIMNKDITTSTQRGYYIVYLFSEDMERLYLTLAQGVTETTKEEMQKIKEE
IREQIHMSQKVKKDDDIFLGTSPKAKGYANSTAAYIAYDVNKMPSEKELV
EDLEEMLRYYEGFIAYKEEGTKYEMIYERKEVYLDQQSIIDHVSSYIQSK
GFFYEKKDLVNFFLSLKTKPFVILSGISGTGKTKIVQWFAESLGATEENG
QFTLIPVRPDWSDSSDLLGYVNLQGEFQERPLIKVLENADANPNRPYFVV
LDEMNLARVEYYFSDFLSVIESRKWKDGKIVTLPVLPESIANKHITIPSN
VYIIGTVNMDETTHPLSKKVLDRANTIEFNTVNLDYFNFLMDVEEKEAEI
ASNRSLETEYLHLKECFKDNEDLVRNISTILIEINKILESVGAQVGYRIR
DEICFYMAYNEQGKLFSFDEALDYQIYQKILPRLAGSDGRTEEVLKQLYV
LCANEEYDSGNNDASYAKYPRSANKLSHMLRRFEYDGFTSFWI
>BA2572 excinuclease ABC, A subunit-related protein
MKVNQLIANNINKLVTDIPFNKSFGVAGLSGSGKTTFCQTIGEESKKRLI
SLLPKAEYQYLFPNIMETNFSAIKMEDMPLVLFLGKSSISSNPRSTIGTH
TGVFTEIREKLADVFYLSPEVFSFNNQLGWCSGCKGRGTTKNIECKKCKG
KRYSEEIEQHTIDLFAKPHTISNINDLSVESILSLAEELNISEAKQHILQ
NIINMNIGYLTLNRIMGTLSGGELTRLYLAEFMAVSENAVIIIDEISVGL
DHETLLQILAEIKQLGCKNQIWLIDHSDTVLDTTDKQLFFGPGSGKYGGQ
IVKESPRPKPILWDLNKEFPTEYYTFRELYCRNIQMAEFQIPKNRLVTVT
GESGCGKSTLVNECLATDFLKRYPKDRLVMVGQDRNQSITSRSTVATFLD
IKKKLTKYSEEIDDIFERSIEDIIDELPNEDIAYKRLSLLIKLGLGYLTL
ERKTQTLSTGEFQCVHLVSELFANTRNPHTLFIFDEPSKGLSQNILNQFI
DSVRGILQDESVSIIMIEHNSYMLESSDYIVDFGKRQIEAINHLEVVSHD
DYYRQRTSVNNVEQIHITSALKPKEGVHYLEGNHINYFKNAENVYKGGIL
KSLSSMARLIYGEYESNTIAPVIAIDLEKHLYSQYSFLYEVGGIINHIVA
AHPTNKDTRSFDFYSQDNHCPSCSGRLQIEVFDKDITIQNKNVPFWDGLF
NPEIMKVLKFYQYEKIEFLFEEIKNELGHDLSKSYNDMSEEKHTFWYGYF
EKSFYDKKGKTRRTWVGFNTIIGGYIVISKAPIKEEIKASKEMVTCPICE
GTVLNHHKPLIFGNSDIREIINQQVDEVLKLVGDLPELHKLKSIVGGDMR
LTEDVSLLPRKAQVALKMFELEQASFSNYEMVLQNVLPFWDEIKGNIESI
SVNNQVTVCDFPNVYETRENIIDQYFTNGKYKKLTYVYEAFGYKKLVTQI
NKIKKSNPCPFCKGKKVITEDNLHDGVFKLTIPCVICNASGINDEGLKEV
VEGVDVKTWLTGKVSDVVDENLLTEAVAQIPIFNRIRELDKRDMMAVYEC
LERNQ
>BA5703 ATP-dependent RNA helicase, DEAD/DEAH box family
MSKKSFSNYALSKEVRRALTGLGYEHPTEVQGEVIPVALQKKDLVVKSQT
GSGKTASFGIPLCEMVEWEENKPQALVLTPTRELAVQVKEDITNIGRFKR
IKAAAIYGKSPFARQKLELKQKTHIVVGTPGRVLDHIEKGTLSLERLKYL
VIDEADEMLNMGFIDQVEAIIDELPTKRMTMLFSATLPEDVERLSRTYMN
APTHIEIKAAGITTDKIEHTLFEVREEEKLSLLKDVTTIENPDSCIIFCR
TQENVDHVYRQLDRVNYPCDKIHGGMVQEDRFGVMDDFRKGKFRYLVATD
VAARGIDIDNITHVINYDIPLEKESYVHRTGRTGRAGNSGKAITFITPYE
DRFLEEIEAYIGFAIPKANAPSKEEVMKGKAAFEEKIQAKPTIKKDKSAD
INKGIMKLYFNGGKKKKIRAVDFVGTIAKIQGVSAEDIGIITIQDNVSYV
EILNGKGPLVLKVMRNTTIKGKQLKVHEANK
>BA3672 DNA polymerase III, epsilon subunit, putative
MGNISLPLDYVVIDFETTGFNPYNDKIIQVAAVKYRNHELVDQFVSYVNP
KRPIPDRIMSLTGITNYRVSDAPTIEEVLPLFLAFLHTNVIVAHNASFDM
RFLKSNVNMLGLPEPKNKVIDTVFLAKKYMKHAPNHKLETLKRMLGIRLS
SHNAFDDCITCAAVYQKCASIEEEAKRKSNTEVLDETAVYEAVKEILVRN
KRDIEWIRCMNVGSYLDIKAFYPVMRLKVKGRKKYVLTDILEDDVKEICT
SLACEPALKSEVGNTRIMLNCLEDVLKLESYILEQYDFVLQALHDYKQSE
MNADEKLKEYLNIMV
>BA1816 mutT/nudix family protein
MSNNWKDIEHRIYTMCMIQRKNEVLLIQRPDHLGFPGYIAPGGKVDFPES
IVQAAKREVKEETGLLVSNLTFKGLDEYVNPKENVRYMVFNYWTDSFEGE
LLMNPPEGELLWVPIDTALHLPMQNWFKERFPLFFEKGTFGTET
>BA4095 conserved hypothetical protein
MDKGLNERKPPTHLKKVGKDTWIRIWSVLEGEGKADINDPIVVEAIAFSY
QMFREMAANVKKEGLTMEYTNKAGATNLTKHTLIPEIPKYLQQIRQYLGE
LGLTGASRKKLQEELTGDSDDDYDNF
>BA2071 mutT/nudix family protein
MTVLYNKKVHAYVTREKEGVMQLLVFKHRDIPEAGIQIPGGTVEEGETLE
AAILREVQEETGLRHLCIERFLADYIIHVKEKKEYQKRHFFHVTLLTDVK
DSWEHIVSAGKEDEGLVFCYDWVDIAKCPELAGKQGEFLHLLDEVYVK
>BA2322 hypothetical protein
MSVHNMGGIRNANRILGDLKPYVNKTMQGKEYVYYLNKEGHAMFGDDGKV
VSRGKLAHALLRNEAWLHLFCPDDWQIETDIRYIKNKEKMKIVPDVKFRD
EENILHAVEVDRSQKMKVNEEKIKKYEEFTQIYKQKHNGKMPVIHFFTVT
KYREKKLEELAAKYDVLASVYVIEEI
>BA1997 mutT/nudix family protein
MANYIKELREKVGHDYVFLNFAGGCVFNKEGEVLLQKRGDFNAWGFPGGA
MEIGESAAETAIREIKEETGYDVEINELIGVYTKYFQSYPNGDKAQSIMM
CFSCSIVGGDKKVDGDETLDLKFFPLDDMPPLFCKQHEDCLQDLLEKRVG
VYR
>BA2307 protein kinase domain protein
MKWRRILALFDRPLRKNTIVAERYKIESVIGMGSYGVTYVVNDLQINRYK
VLKQLRQSKQRYVSGRKSFEQEKMILQTLNHHAIPSLYDHFVWEKKSFFV
MEYMPGKNFEDYIFLDGHVYTEREVLKILYEILEIVSVFHSKDIIHRDLR
IPNILMKENQISIIDFGLAKWKGEGDERATTYEGEQALMREVHFRSDFYA
LGHFSLFLLYAGYESNEKYEKPWYDELTLENYNREMLMRMLQMKTPYYEN
VRDLKKDVAFALERMEVPCFKSF
>BA3418 lipoprotein, putative
MELRDVIKQKLAVLLFTGIILSGCSTIAENSLETGNGSMELLTPTITTKD
NELEIKTEGIDENKVTFIYVANKKVLEQKLKNGESYKLNIKDIEHAHRTD
YKPKVQLLQTKDDNDDGEIVTFKQVRYAVKN
>BA3091 transposase, IS605 family, OrfB
MKNKISNQEWKDLRTRQLYSRGDKSKKGNLNMRITVDDCGQGWLEIANPL
GRTNGKTKSPRIKVPIMIPYRFYHQITNVVMGKQIGVNPKGKPIIEHQKY
SVEIIRKQNEFYINITFDETEIGRVLDFKETPQSDVIAGIDVNPDRIAVS
LCTKQGNFKGSKIFYLHNLNTFSTNKRATIIGQIVQQIKTRLLENNVGGI
VLEDLKFQQSHDTDKYSNRNFHQFTYKKMLNSLIRMALRNGFSVKTVNPA
YTSVIGKLKYSKNFGISVHEAAAFTIARRGLELQEQLPQEIILLLKNQIT
TKLRILVASMEESKKNTQKVYKKWLQTIQTWKEYHNWKLWSILHKTVYMN
NQQVVFKI
>BA0764 conserved domain protein
MQKSTPDGLKQKMKEYNIGLRQLTVLTIISQKSNTPLDDVLKMKKDEMDM
KQIAEKLNVKKEDIRAEMIKLVKSIKEKKTN
>BA4548 conserved hypothetical protein
MSDIHKKIKKKQFAPLYLLYGTEAFFINETIKLITTEALEEEDREFNVVT
YDLEEAYLEDVVEDARTLPFFGERKVLLIKSPLFLTSQKEKLEQNIKILE
EYIGEPSPFSILVFVAPYEKLDERKKITKLLKKTADIVEANAMQVQDVQK
WIVARAEEGHVHIDNAAVSLLLELVGSNVTMLAKEMDKLTLYVGMGGEIT
PKLVAELVPKSVEQNVFALTEKVVKKDIAGAMQILDGLFTQQEEPIKLLA
LLVSQFRLLHQVKELQQRGYGQNQIASHIGVHPYRVKLAMNQTKFFSFEE
LKKVIIELAEADYSMKTGKMDKKLVLEFFLMRLNHM
>BA1586 conserved hypothetical protein
MFTEKRLPFEVGKQDNFYDKLNEWIGDVFYDILPEKGFEERDEQIFMAFQ
LERAFQEKKVMFAEAGVGNGKTIVYLLYAICYARYTGKPAIIACADETLI
EQLVKEEGDIAKLSEALGLSVDVRLAKSMDNYLCLRKLEDVMSGRAPEVI
EDVYYELPQFVFDHGTMQNFTHYGDRKEFPLLNDEEWSKVNWDYFQDCFT
CDSRHRCGQTLSREHYRKAADLIICSQDFYMDHIWTYDARKREGQIPLLP
ESSCVVFDEGHLVEYAAQKALTYRLKQTMMEQLLTRLLQNDIREEFAHLV
EETIWQTERFFDVLQENKKEIAGSDRLEITVTEKVTAEAKRLYAKIGEVG
DALVFESEMHTVNTYDLNIVDEHLDVLEHSLRLFMHEKNVITWGEEGDGA
FTLVIMPRAVEEVLQEKVFSKKIPYIFSSATLSNNDSFSFTANSLGVKDY
LSFSVASPFDYEEQMAVNLLSHTKENEWERKCQYTLENIQKTNGRTLVLF
RTTQELAAFKEYVSKEQMSVPFLYEGDQEISQLVSRFQNEEETVLCAVHL
WEGLDIPGSSLSHVIIWSLPFPPNDPVFEAKRKHVNDPFWDVDVPYMILR
LRQGIGRLIRTSDDKGAISIFLSDTEDEKVVEAVKNVLPVEGKEL
>BA5457 transposase, IS605 family, OrfB
MYETHIKNGTLPPIIFGGRKNFYERMKDKISNQEWKDLRTRQLYSRGDKS
KKGNLNMRITVDDCGQGWLEIANPLGRTNGKTKSPRIKVPIMIPYRFYHQ
ITNVVMGKQIGVNPKGKPIIEHQKYSVEIIRKQNEFYVNITFDETEIGRV
LDFKETPQSDVIAGIDVNPDRIAVSLCTKQGNFKGSKIFYLHNLNAFLTN
KRATIIGQIVQQIKTWLLENNVGGIVLEDLKFQQSHDTDKYSNRNFHQFT
YKKMLNSLIRMSLRNGFSVKTVNPAYTSVIGKLKYSQNFGISVHEAAAFT
IARRGLELQE
>BA2203 transposase, IS110 family, OrfB
MKYIERELKQLVNLLDYQLETMPGIELVTASALIAEIGDVRRFPNANKLA
RFAGIAPVYFGSGGKGKTHKSKQGNRALHALFYNLAVQQVQVAKGSKMPR
NPVFHAYYQKKLKEGKTKGQALVCIMRRLVNIVYGMMKYKTAYELPIVEE
KEVV
>BA2316 hypothetical protein
MKQKISNVTGEIILSYKENGYQVVLDEFKHAKCINIVTYNINTYESNSVL
IKELRKLNKTTKINIILNIPNGSYVKNFKNRIDQKEFDKVKWKIKNALSV
LEQEKFGNLEVYINLENHAKLIMTDHIAYIGSQNFSDASEGNFELGFLVK
DSKVIRDIERNIFAEIKNKSIYCIISEYRATMEEISVKLANKLQNIREDI
LTWVGDPPFTFRQEVFFIDDAYFHKERWEEFKEFHSEFEVITEKLIDEYP
SEFNKESARETVKHLRKLVKLLVSELDELAKFKTNQEESMMWDKFHQLDV
GENMEEALEDARYYVENYKEKNYREIEYKGKELIKTFDYIKESIQDIETI
VDEIKDSMIRKALNQNIERILQDIKKQ
>BA1692 conserved hypothetical protein
MYHHTAINVLSLLQNMSNNKMNDMQLEAEFKKIEKQFQVKYEELVDLYNR
MVLFQIDIEKHGGMRAYEKSAITWLKSELELLYEVYQFSQRHGLNIINIS
KYVSKKELNLFPKTESQLQNTYYKLKKCEIPFENIEKQKPGRKRKYTPVK
EPIVEIKKENKQEFRNEVPNTENEKNLVTVISGIVDNFETISQCSERKEH
ELHQFMEGIYKLSSMAAERSKDEKNARGLEGELHVLRAENEKLKREKEEL
VHDIKEMTHHLIHFITSSDIDQIRTLPFFVKDCKQDLHKLGLYNAQDGKM
KIMVDRSGQVMTVTQ
>BA2689 intein homing endonuclease-related protein
MQIERKKKSKCKLSKPEIIHLYAEGKGTSEIAMLANVSARYIRMVLSDSN
VPRRAIGSWKRKYDITEDYFKTWSNNMAYILGFIAADGVIQKENQCVSVS
QKESYILEDIKNELKTTQPLYQNKKTGVYMLNINSKTIKDDLMNIHGIKP
CKSFNIEFPFVPEEYLHHFVRGYFDGDGYVNYETYTVSFVGGSYNFMNSL
HQILQNRNLRADSLNQNKHYRVILSGRKSIQLFSNWIYKDKDIYLHRKYE
VFQKESLSLDQLQDRKLKQTQTAVKQRKQNFLKEYMKNKCSATACSNLKI
SESTFKRWLKNDNQFKKDYERIHSL
>BA1274 conserved hypothetical protein
MNKKTIFFLLTCLLLVASTTYIICNKREQVPPMLVWEGQEYYVTNEPAKA
EEVGQRLGEVTKKIETSEKPIKNSESNIVQEKTEVFTMIEEEKGPHSPLI
IKEPDGEEYRIVRAMLKVL
>BA2189 conserved domain protein
MEEIIVTYIKQSNEEIDIAIPNDVKAVQIIEAILHHEKIDEALSSTYEIR
VAKDKEEWHSIRNDETLRDNDVWDGQYVMLYKKGSIIPSFEMLPAEEVYN
EPVKQDISTSEEDYVWKIIE
>BA0542 mutT/nudix family protein
MEYSLLLGGLHVEHKTPKHIVAVAGYLTNEKDEVLLAKVHWRADTWELPG
GQVEEGEALDQAVCREIKEETGLTVKPIGITGVYYNASMNILAVVFKVAY
VSGEIKIQHEEIQEAKFVALNEENIDEYITRPHMKSRTLDAMRSSHFIPY
ETWEVQPYNLIGRL
>BA0427 prophage LambdaBa04, site-specific recombinase, phage integrase family
MKGYFRKRGEKWSFTIDIGKDPITGKRKQKTASGFKTKKEAERACNELIH
QFNTGSLVDDKNFTLSEYLQEWLENTAKQRVRETTFTNYKRAINSRIIPV
LGSHKLKDLKPLHGQRFVKSLIDEGLSPAYIEYIFIVLKGSLEDAVRWEL
LFKNPFQHVEIPRPRKVVNSTWSIEETKKFLNRTKFENVIYYHLFLLALN
TGMRRGEILGLKWKNFDLNEGKISVTETLIYDENGFRFTEPKTHGSKRLI
SIDQNLCKEFKSYKAKQNEFKLLFGQSYEDNDLVFAKETGQPILPRTMTT
TFNQFIKKADVPQIRFHDLRHTHATILLKLGINPKIVSERLGHSSIKTTL
DTYSHVTIDMQESAVLKLSEALKS
>BA1552 uracil-DNA glycosylase family protein
MKYPDHLVKQVKERSAPYQLEGFLSGQGPENPKFMLLGEAPGETEIHNGI
PFSGRAGKQLMGFLERIHVTREEVYITSAVRSRPYKWREKKERSGSIIQK
KYNRTPNQGEIVAHAPLLDYELEKLNPKLIVTLGNIGLQRLTGKDKKITD
VHGQLLKQPIQKLKDMQSAEFIWSEKEYHIFPTFHPASIFYNRSLLELIY
EDLEKLKQYVIKN
>BA1010 transposase, IS605 family
MILAKKVRLIPTPEQEQVLRNHAGAARFAYNYCKRMSDRYYKLFGKSVSQ
LALQKRFTKIKKRKRYEWLNDINAQVPKQASKDFDKARKHSFKKYKNGYH
TSYKSKKDLIQGFYANYERLIIGKKVVHIQSIGEVKTSQQLPRNKKPFNP
RVTFDGRHWWISVGFQEFFESQELTNESIGVDVGLKELFVASNGTKERNI
NKDAKVKKLLKRKKSAQRDMSRRFKKGVKTQSAGYEKAKAEHLRLSRKIK
NIRNNHIHQATAKLVKTKPMRIVVEDLSISNLLKNKKLSKAFLFQKLNFF
FQCLSYKCEKYGVAYVKADKWFASSKICSCCGVKYEHSVQPEGQWSLKIR
EWRCVPCNSHHDRDLNASINLSRWVK
>BA2733 conserved hypothetical protein
MKVIIKNLPSLEVAFIRRTGSYFEPQDHWGKLLNWSIENKLYPLEQSFIG
ISLDNPELVASHMCRHDACVTIPKNFEKEQHEDVQFKSVDGGLYALYQFY
DEPHKLSEVYRYMYAVWLPNSEYSADYDRDNLEFCMNNVAEDLEGKLKVD
LFVPIKKNK
>BA5226 Toprim domain protein
MIYVEKVIIVEGTSDRRKIESIIREPVEIVCTNGTIGLSTMDELVDQFFD
KEVYVLVDADDAGEKLRKQFRKEFPQAEHIYIDRSYREVATAPSSHLANV
LWGADIDVYTEYLR
>BA4450 helicase, putative
MNVDISIDRTWQNNFLNRIDEDGPWTNWDLYHLAYETEKSLLVPTFDGLQ
APKHLSHFTPLPHQLEVAQNVIEQMNGKAILADEVGLGKTIEAGLILKEY
MVRGLVKKVLILVPASLVSQWAYELNTKFFIPAVAQKKSYSWEQADVIVS
SIDTAKRSPHRDIVLNLEYDLIIIDEAHKLKNNKTKNYEFAQRLKKKFCL
LLTATPVQNKIDEIFNLVSLLKPGHLGNQSNFEEYYASKNRSAESDEDLK
ALINKVMVRNRRHNTGIDWPKRHVRTIFVEFNEEEQDLYNNIENWRGQDA
FTSAFSSLTLKREACSSREAVYYSLKKHVEKRQKENEHYVKDPHIDILMD
KINHIPFNSKANKALELIKEIDDKVVIFTEYRASQMYLQWFLQQHGISSV
PFRGGFKRGKKDWMKELFQNHAQVLIATEAGGEGINLQFCSHMINYDLPW
NPMRLEQRIGRIHRLGQKNDVHIYNLATKHTVEEHILKLLYEKINLFERV
IGELDEILTRINMKNIDAHIQEIFAQSKSEGEIRIKMENLTSIIDFAKRN
EAEVQGYAAT
>BA1978 lipoprotein, putative
MKYGKVAVVGALSVGLLTGCFGEKPEENLYTAFETAATQEKSLVDEAKKL
EKLEKEGQELYSQILQEGKDHNDAVMKKIEQATANVDDREKVLKNEKEML
EKAQKETKSVQGNIEKLEDKKLQKQAKAVEESYKNRYDAFQKMNENYTKA
LATEKELYEKLKVKETKLKEIGEKVKAVNELTVEAQKSKEQFNNFTKEYN
DSKLAFYKDAEIKIKDKK
>BA2755 mutT/nudix family protein
MSMSLYYKKIREQLGHELIFIPSVAAVIKNEQGEILFQYPGGEYWSLPAG
AIEPGETPEEAVVREVWEETGLKVQVKKQKGVFGGEEYRHTYPNGDQVEY
IVVVFECEITSGKLKSIDGESLKLQYFSLSEKPPLALPYPDKIFL
>BA4887 mutT/nudix family protein
MYPRAKAFGLAIHHDHLLVQEYHTSDETYSRPLGGSIELGEKSAHTVIRE
FKEELHTEVEITNYLGCLENIFHLDGEIAHEIIQLYSLRLLDTSLYKMEK
VKIQDEQTVSYAKWIPLTAFIQEKKVLYPDGILTYIKKKKDEIL
>BA1235 conserved hypothetical protein
MRRFGHNLKTFFLNRRKRIREGGIMLFTSWLLFFIFALAAFRLTRLIVYD
KITGFLRRPFIDELEITEPDGSVSTFTKVKGKGLRKWIGELLSCYWCTGV
WVSAFLLVLYNWIPIVAEPLLALLAIAGAAAIIETITGYFMGE
>BA4889 conserved hypothetical protein
MYVSQTVETLFSIFDSSAVVLRKELDVTYLEALVETGDNLFEGAILQEEL
SEAAIERLNREYSTFNEETYKGEEIRKAFQLAILKGMKEGVQANHEMTPD
AVGMFMSYLFHKFMQGQNEITVLDPAIGTGNLMTTVFNSAKEGLTMSGFG
VEVDEVLIKLALVNANLQKHAIEFFHQDGLAPLYIDPVDAVISDLPIGYY
PNEIGASEYKLKADEGMSYAHHLFIEQSVKHTKEGGYLFFLVPNFIFESD
QAPKLHAFIKETCFIQGLLQLPVSMFKNEKNAKSIFVLQKKGPSVTMQKQ
ALLVELPKFSNMKAMEDIMDQLNTWFATHK
>BA2202 transposase, IS110 family, OrfA
MHNRQNYLYVGVDLHKEHHTAVIINCWQEKLGEIQFENKPSAFSEFLLEV
ETYVSNGLSVVFGLEDVGGYGRALAKYLVDHEQVVKEVNPALSFLERKSQ
VMIQKNDSWDAECVARILVNKFNQLPDAKPNDLFWSIQQLVSRRNALVKA
QSALKNQLHIQLNHHYPSYKKFFSELDGKTALAFWQQYPSPSCLEGANIK
QLTAFLLDVSNNTCSVKKASDILKLVKEDGQTMKEYQEMRNFLVRSIVRE
IEFKRRK
>BA0439 prophage LambdaBa04, DNA replication protein DnaC, putative
MTRIVNTSACSEETEGYTCEHCNKYIAAITVEVPQLRIKNKILPTCECVV
EREEAKIREAQNFAKKREIEKLFSISNLGERFSKSTFESFLDRNGSETAY
KVAVKYVKTFKEWNGESLMLWGEPGNGKTHLAAAIVNELSKKGYIVVFQS
VPELLQRIRSTFNSENKENETQIMRALLECDLLILDDIGAEKTTEWVEEK
LFNIIDGRYRKELPTLYTSNLEPKELKNQVGKRSYDRMVETSLTVKNEAA
SYRREIAKQRLQRFIEA
>BA0273 conserved domain protein
MKIIAFYLPQFHQIKENDRWWGKGFTEWTNTKSARPLFSKHYQPREPYQD
FYYDLTDPSVRKWQAEIAKAHGIYGFCYYHYWFKGKRLLETPFNEVLKTG
EPDFPFCLSWANEPWTKTWDGLDSHILMPQNYGELSDWKEHFDYLLQAFQ
DERYIRIDDKPLFIIYRPGHIPDCEQMLNYWNILAQENGLKGIYFVETLN
SFPLPNIHGFDASIQFEPFYTIAHDSSSDINKTIYESGKQINAWDYDKVW
MYILKRSPSEKKTFPGAFVDWDNTARRKDLNSSIFLGSTPEKFTIYLSKQ
IYRTYSLYNSEFLFMNA
>BA1395 hypothetical protein
MTDYHLYDFMRCPHKFYFRHIKRREPSSFEWQQIAQMIVNQIINEYYTLP
AGQQTKIVLLLLIEKYWKKVRINMFASKTEYYIVLAKLTDHLLQFVERDD
SQTLPLFLYEKFQTYMKELGVHMSLTLEVGEWSTESFVIKKYVVDADEEM
LALCQKLMTVFSYKAFGILPGKIEVINLIEGTKYEYIPKQEDITTGMADL
SRMKEMLQQPEHYTERHFRSECISCAFRSECQGEEVKKEAKQKKNIVH
>BA2685 mutT/nudix family protein
MYKHTLCFIKRNEEILMLNRKYDPVKGLWNGVGGKIEKGETPLENAIREI
KEETNIKVTHDQIQFKGIIKWEDSSYSGGMYVYLVELLHEFTYHTPKKVS
EGILDWKEISWILSDYNYGVGEMIPKFLVEVLHNELILEHNFVLSNHKLI
DYRNKELAKQDNSIDNIIPL
>BA0408 conserved hypothetical protein
MAKKNNIARNIAIGVAAGVAVSMLKKENREKVKNTAEKAKTKMIEIGENA
KIKEKVQTVTDKGRELADLNVVKAKVAEIKKLTPSVVETLKETKEIFSKK
KVEPAEKPETIEIQAVSPKVDELKVEEEPVVAEDGGMKESRELFMKDSNV
EEKKLKRTLS
>BA3753 site-specific recombinase, resolvase family
MNAIIYARVSTTKEAQETSLLRQKDELLHLAERYQMNVIKVIEEQASGYT
IERDGILEVLDTIRDEQIDVLLIQDETRLGRGNAKIALMHCLHKEGIKVY
TYTHNGELQLSDSDSMVLDIIGIVEEYQRKIHNLKIKRGMQRAVEKGYRP
ENNLKNRHLSVGREKKEVPIEEIVRLRKSELTFEEIAATLRGFGHNVSKA
TVHRRYVEYTKQLLDKEE
>BA4995 site-specific recombinase, phage integrase family
MEVVEALKDISQIEAMKKYLKEHSQRDYLLFVIGINTGLKITELLSMKFE
DVLHEDGTVKEFYSLPVKDEKFKQDIYLNTKVKEALLEYVQSIDVKRENY
VFQSNKTTNSITRQQAYRVIHNAAEAVGIVGKIGTNSMRKTFGFHAYKRG
IAIALLQKHFHHATPSETLKYLGISKDEEFKTEIDVDL
>BA4140 methyltransferase, putative
MRVVSGKCKGHPLKAVPGNTTRPTTDKVKESIFNMIGPYFDGGIALDLFG
GSGGLGIEAISRGIDKAIFVDRDNKAIKTIHQNLESCRIQEQAEVYRNDA
ERAIKALIKREMSFDLILIDPPYKDQKIVSLISVMDQHGLLHNAGLIMAE
HGNDVVLPESIGRLVKVRAEKYGITAISIYKYEGEGTE
>BA4099 prophage LambdaBa02, site-specific recombinase, phage integrase family
MIKELKEYFKEQNERNYILFLLGINTGLRISDILRLRVRDVEGWNIFIRE
KKTNKIKDVKMPSDLKKALRDYTKGKPKNEFLIKSRNGKNKPITRSMAYV
ILNQAAQEFGLERIGTHSLRKTYGYHHYKQFKDVVALQQMLNHTDQKETL
RYIGIQQDTLNDYQRKFKI
>BA1626 conserved domain protein
MQTELMTDYMNIEEALQFAEDFEKTGRVKELLFYDEMDTEWLLKEMKKLS
KQVEEEPQEILVYFDGGYDVQTKEAGVGICVYYKKGNAKYRIRRNAYIEG
IYDNNEAEYASLLYSMNILEELGIKYEAVTLRGDSQVVLQQLAGEWPCYD
EHLNHYLDQIEQKAKQMKLKLVCEPISRKQNKEAQQLATQALEGTVIDSH
KEITE
>BA2109 ATP-dependent RNA helicase, DEAD/DEAH box family
MIKDMQPFLQQAWEKAGFKELTEIQKQAIPTILEGQDVIAESPTGTGKTL
AYLLPLLHKINPEVKQPQVVVLAPTRELVMQIHEEVQKFTAGTEISGASL
IGGADIKRQVEKLKKHPRVIVGSPGRILELIRMKKLKMHEVKTIVFDEFD
QIVKQKMMGAVQDVIKSTMRDRQLVFFSATMTKAAEDAARDLAVEPQLVR
VTRAESKSLVEHTYIICERREKNDYVRRIMHMGDVKAVAFLNDPFRLDEI
TEKLKFRKMKAAALHAEASKQEREATMRAFRGGKLEILLATDIAARGIDI
DDLTHVIHLELPDTVDQYIHRSGRTGRMGKEGTVVSLVTPQEERKLLQFA
KKLGIVFTKQEMFKGSFVETKPKAPKKKKPAFTGKKKPR
>BA2584 lipoprotein, putative
MKSGKKLIVFLFSIAVLCACEPEMEESKSEDSIVMDIATAAVKEESFFSA
AIWDDKERKVDLEIADSENANEIKKEINKRLQIQGIMSYKVNISQRNKEI
VNAEHRWELVFGQIFDDVFRKNGYEGFGIQQINYKKNQPVTIDIKTKIRD
DEVGAREFGQKIEKEVEDVLKTEAVKKWIENDSYAIGIYDIENRKIN
>BA2578 mutT/nudix family protein
MIHIDKNVRTSGAYVMYKDLFVFQVGPTSKGDTLGVVRLGGHKEADETAV
ETAKREVEEEASMDITILNSPTTYYKEYWNAQSKKLKVENEVNPILIIDS
PDESLSIMYIAYSKMLPKLSSETNGLLLLSLNDIELICTGKITLNNYINQ
GGVAILKEKMDKELILQPFPQLMFLSELLKEDPVLLQQFLN
>BA3593 exonuclease family protein
MKVIGEFSELVKPGARLTRHTTKLTGITKKDLIGVEKFPQIIEKFIRFIG
EDSIFVTWGKEDYRFLSHDCTLHSVECPCMEKERRIDLQKFVFQAYEELF
EHTPSLQSAVEQLGLIWEGKQHRALADAENTANILLKAYSERDITKRYKR
HGELELVENGKLTEKAKKKMRKWVFKEMRKNTERPFVWSTFESSDTWESI
TERYYISESTIELLKKHFRTAVRKAERQIRYLAEMEKVVEEN
>BA3824 conserved hypothetical protein
MNLVGIENLVLPEDAELAKSLRNKKENYIKNQFLLTRIASKKNVEGKTKE
FYEACKEYEACGEKAKECDKQLKELIFKKKENDRVQHVVERMREVGIKED
VIQKVLYK
>BA5717 replicative DNA helicase
MSDVMADRTPPHNIEAEQAVLGAILIDQDALTSASELLVPDSFYRTKHQK
IFEVMLGLSDKGEPIDLVMMTSAMADQGLLEEVGGVSYLAELAEVVPTAA
NVEYYARIIAEKALLRRLIRTATHIVSDGYEREDDVDGLLNEAEKKILEV
SHQTNAKAFQNIKDVLVDAYDKIELLHNQKGEVTGIPTGFTELDKMTAGF
QRNDLIIVAARPSVGKTAFSLNIAQNVATKTDENVAIFSLEMGADQLVMR
MLCAEGNIDAQRLRTGSLTSDDWAKLTMAMGSLSNAGIYIDDTPGIKVNE
IRAKCRRLKQEQGLGMILIDYLQLIQGSGKSGENRQQEVSEISRTLKGIA
RELQVPVIALSQLSRGVESRQDKRPMMSDIRESGSIEQDADIVAFLYRED
YYDRETENKNTIEIIIAKQRNGPVGSVELAFVKEFNKFVNLERRFEDGHA
PPA
>BA2941 conserved hypothetical protein
MLLEEVMQQLEEYGTEQNRKTYKNHGAKEPLFGVSFANLKLLKKKIKKDH
DLAISLWETKNMDAMTLATMILDPKKVTTELLNKWVQEVDYYCLMDVLMT
AICTSPIAIERMEEWTKSDDEWIGRAGWSLLANIAIKNKTLQDDSFSPYL
EEIKENIHNEKNRKREAMNSALIAIGIRNEDLEQTAIEIAREIGKVQVDH
GATSCKTPDAESYIKKARERAEKKKMK
>BA3228 lipoprotein, putative
MRKWMLIGAISCLLLTACSTQADNNTEVQQLKVENDKLQKEVAQLQQEPQ
KTLPAANDSKQIQDFKNEVSSIVEKANNTKPVGVKEDNLNTYLAVKKEID
QLDDKIDLSDNQLEADYRAGTITIEQYQTQEREHDILEDQLEQAENALEA
RFGIED
>BA2261 mutT/nudix family protein
MGYIEELRKVVGSRPLNLAGVAVAVFNEQGQILLQQRQNGIWGVPGGFVE
LGESTEEAGRREVFEETGVEIGTLQLISVFSGKEFFVKLPNGDEFYPITI
AYLCKDIKGGLLKADGIESLSVQFFDFDKLPENISPFIKN
>BA4241 mutT/nudix family protein
MIREGEERRVYILGYIEELRKVVGTRPLILVGSAIIILNDNQEVLLQYRS
DTYDWGVPGGAMELGETTEETARRELFEETGLNAKIMQFIGVLSGKEVYF
QYPNGDEIFNVIHLYQGHHVSGELRLDHEGLQLQYFPVDKLPNLNKTTEK
ILQKFLHALTE
>BA1668 conserved domain protein
MYIQLYEKLVIIQKAYENIQALGEQIYENLKHKNVNAVQKLQVEQLQYID
GLKKLSSSFEEMVVQFCKEKGIEPFRVSALFSHFSNEEIEKMEELQKNVA
ELEENVKMILLKNQYYLNVLLKTTESIVDSVSEYNLERNNNSQIFMNELL
>BA3814 prophage LambdaBa01, C-5 cytosine-specific DNA methylase family protein
MYRSKESGLKMLDLCSGIAGISMAADWAGIDTAAFCEIEEFNQKVLRKNY
PNIPIFPDLYKLMKQSLIDGGVDVDSIGVISAGYPCQGESLVGKRRGAED
ERWLWPEVFRLIKELRPTWFVGENVAGHVTMGLDTVLSDLEEENYSTRTF
VFPAVSVGAPHQRYRTFIVGHSNDKSKLQTDPRVVPFRSKWETWENTTGI
NRGTLSGTYWEENKPAICGMDDGTATRLDEDRLRFLGNAVVPQQIYPIFE
AIAKIEGLL
>BA1974 lipoprotein, putative
MQKWIGIMGIIFILTGCKMSEAPTNLMEAPANEKWINELKEQIDKDLPVN
YRLLTPMSNKDKQMIWSMDFKQDNKKEAVIFYKLPNADHSVYLAVYEKNG
NGWKTKSTHKFNGGDVDIVEVGDFTGNGKRELLIGISVDRESLKHVMYVF
SEENEDMREIYNRNYTKLFIDDLNKNGLKDLSLVTYEKDEKLIVEFIEQF
KTLSEASFDPFINSIQRIQMGRISKSLKAIVIDAGVGAHSGITYVAKFDE
NHYEVLPIDGKEDLFNEYVVESKDVNEDGIIEFVRTVRPKGWEDKPHGDS
PLFERYIQWSESGIKPIEERYIDIEKGYYVKIPKELIGKITISDEQKESN
SQKFLETSTNETWLEVHIFKRKEWFGIKGYSAAVKTASHVYAVPKQPHFE
KVKAYVKPLADYQQE
>BA1606 5'-3' exonuclease family protein
MKKVLLVDGMALLFRAFYATSVYGQFMKRQDGTPTNGIHGYMKHLLTAMQ
AIEPTHIVTCWDMGSTTFRTESFSNYKANRAAPPEELIPQFDLVQEMTAK
LSVPVIGMKGYEADDCIGTLAKQYCNEAEVYILTGDTDLLQLVDKNVTVM
LLRKGMGNYEYYTPEKIMEEKGVEPWQIVHAKAFMGDTSDNYPGVKGIGE
KTAYKLIQEHGTVATVLENVASLTKAQRTKIESDLENLNISLQLAQIHCE
VPISCSLEEGLHTIDEEKLRFVCEEMNWGRPEMLINML
>BA3598 mutT/nudix family protein
MIRNRGVAIIVQEGKIALIKRIRGGETYYVFPGGGIEEGETPEEATKREA
YEELGVHIKVGNLIAKLEFKGTEYYFNAHIIGGVFGSGKAEEFELKGRGS
YITLWLPIYELEKVNIKPYEVVESILEHYKI
>BA4211 lipoprotein, putative
MLKYSKLAIVTALSMTLLAGCFGPKPEEELYVAFENAAKQEKTMFEDAKK
LETLEKEGQELYNQIVQEGKDNNQTVKEKLNQAVKNTDEREKVLKKEKES
LNKAQEEVKSADKYVKKIEDKKLKDQADKVKSTYEKRHDSFNKMYDSYNK
SLKQEKELYTMLQDKGTKLKDISEKVKVVNQSYKDIDSEKDKFNEFTKSY
NTEKIAFYKQANIKIKEEKK
>BA3133 conserved hypothetical protein
MELRKRLGCLHAHYSNIEYIENALSSFNIELIHFVDPALMYRVTSNENFL
ESNAQLKVKEQIEWIAQCNVDAILITCTNYIAILQEDQLSTSVPIIKVDE
PFFDYISNIQQPQTILFTNPATVEGTVKRLKHHANNYQKSLDLEVIVIDS
TFELIMQGLKEEYNQEITKSLNQIMKDEKKVISVAQLSMVDASQQVEYKT
SKTIINPLNTLVSYIVNQLELEKKNQL
>BA1142 addA, ATP-dependent nuclease, subunit A
MMENWPKKPEGSQWTDDQWKAVVATGRDILVAAAAGSGKTAVLVERIIKK
IINEENPVDVDRLLVVTFTNAAAQEMKNRIGEALEKVLIDEPGSQHVRKQ
LSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLD
DILEEEYGIEDNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEK
WLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHIRKATELAMLP
DGPAPRIETLQADLALLGTLSAAARESWTSVYEAMQNVSWQTLKRIKKSD
YNEDIVKQVDSLRNKAKDEVKKLQEELFSRRPESFLRDFQDMHPVLEKLV
QLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRF
RLAEPGLFLGKYKRFTQEGLGGGMKIDLAKNFRSRHEVLAGTNFIFKQIM
GEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVIDGEEGAEV
EKAQLEARLMAQRIKAMVDSGYEVYDRKTDSMRPVKYRDFVILLRSMPWA
PQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDNPMQDIPLAAV
LRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEW
FYNLLQGWREFARQQSLSDLIWKVYGETGYYDFVGGLPAGKQRQANLRVL
YDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKY
TTLSQLAIKRKMKMELIAEEMRVLYVALTRAKEKLILIGTVKDATKEMEK
WLDAREHSEWLLPDHVRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPD
EIYGYDTSWKVEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLESERKE
EVYDRLMWKYGYGEATSHRAKQSVTEIKRNYQSEEGSDNAFIKKLRAPIQ
TRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEILQEQIAGMVNKEL
LTFEQAEEIAIEKVISFFDSDLGKRVLAAKSVEREVPFTMMLAAEEAYQD
WQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
ETRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVIKVEE
>BA1141 addB, ATP-dependent nuclease, subunit B
MSLRFVIGRAGSGKSTLCLHEVQEELKQRPRGETILYLVPEQMTFQTQQA
LIGSEDVRGSIRAQVFSFSRLAWKVLQEVGGASRLHIDEAGVHMLLRKIV
ESRKDGLSVFQKAAEQNGFFEHLGSMIAEFKRYNVTPSNVYEMWQQLDAH
SSSAEQKLLANKVYDLQLLYDDFERALIGKYLDSEDYLQLLVEKLPQSEY
VKGAEVYIDGFHSFSPQELEIVRQLMICGARVTITLTIDEKTLAQPVNEL
DLFYETTLTYEKIKQVAREEKIEIEKTIPLMEQPRFHSPALAHLEMHYEA
RPNEKFHGEASVTIHTAANLRAEVEGVAREIRRLVAEENYRYRDIAVLLR
NGESYYDVMRTLFTDYNIPHFIDEKRPMSHHPLVECIRSALEIISGNWRY
DAVFRCVKTELLYPLDVRKETMREEMDEFENYCLAYGVQGKRWTSEDPWM
YRRYRSLDDTNGMITDSEREMEEKINRLRDVVRTPVIRMQKRLKRAGTVM
QMCEAVYLFLEELDVPKKLEALRIRAEENGDFLFATDHEQVWEEVMSLLD
TFVEMLGEEKMSLSMFTDVMSTGLEALQFANIPPSLDQVLIANIDRSRLS
NVKATFVIGVNEGVIPAAPMDEGMLSDEERDVLSAAGIELAPTTRQTLLE
EQFVMYQMVTRATEKLYISCPLADEEGKTLLASSFIKKIKRMFPDVKDTF
ITNDVNDLSRLEQISYVATPEVTLSYVMQQLQTWKRYGFEGNLDFWWDVY
NFYVTSDEWKQKSSRVLSSLFYRNRAQKLSTAVSRDLYGDKIKGSVSRME
LFNRCAYAHFAQHGLSLRERDIFKLDAPDIGELFHAALKRIADRLLRENR
TWADLSIKECEHLSAVVIEEIAPLLQRQILLSSNRHFYLKQKLQQIIFRT
SIILREHAKSSGFVPVDLEVPFGMGGTGSLPPMEFSLPNGVKMEVVGRID
RVDKAEDENGTFLRIIDYKSSSKALDLTEVYYGLALQMLTYLDVVTSNAQ
TWMKKGGTASPAGVLYFHIHNPIVEVKGDASEAEIEKEILKKFKMKGLVL
GDADVVRLMDNKLSTGSSDIISAGLKKDGSFSARSSIASEQEFNVLQKYV
HHTFENIGKDITEGVIDIAPYKKGNKAACTFCNFKSVCQFDESLEDNQFR
TLKDMKDSEAMEKIREEVGGE
>BA3871 alkA, DNA-3-methyladenine glycosidase
MTAEYSNALILTVPTEFSFQENLRYLSRSNNECMFHIEDNKIYKVIPVHD
VKPLVEISMNADDTIQIRFLGEAYISAKPIRDAVANYVTEWFDLTIDLAP
FYTLAKHDVLLQRPIEQYYGLRTLGIPDLFEALSWGIIGQQINLTYAYTL
KRRLVETFGSYVEWNDRKHWIFPSPETIANLHVEDLKNLKMTTRKCEYLI
GIAKLITEGNLSKESLLQIQDVKQAEKRLTAIHGIGPWTANYVLMRCLRF
PSAFPIDDVGLHNAIKYLTGSESKPTKHEIKDFAVNWKNWESYATFYLWR
VLY
>BA4553 comEA, comE operon protein 1
MWDFPKKWLGLVAIIGIVLFLLFWKTNEHTERSVITTDVQAKEIEKKSKP
KISDTKLQKKIIVIDMKGAVVKEGVYEMKEGDRVKDAIEKAGGFLPEADR
KKVNLAQVVQDQMVLYVPDKNEQVQEGAAVSKGEEKVRINAASKEQLEKI
TGIGSRKAESILKYREEHGPFQKIEDLLEIDGIGVKSLEKIKDQIIIP
>BA5426 comFA, comF operon protein 1
MLAGKQLLLEELPSDLRRELSDLKKEGEVICVQGVIKKASKYICQRCGNI
EQRLFASFLCKRCSKVCTYCRKCITMGRVSECAVLVRGIHERKGERELHS
LQWKGSLSLGQELAAQGVIEAIKQKESFFIWAVCGAGKTEMLFYGIEEAL
QKGERVCIATPRTDVVLELVPRLQEVFPSINVAALYGGSVDREKDAALVV
ATTHQLLRYYRAFHVMIVDEIDAFPYHVDQMLQYAVQQAMKEKAARIYLT
ATPDEKWKRNFRTGKQKGIIVSGRYHRHPLPVPLFSWCGNWKKSLHHKKI
PRVLLQWLKMYVNKKYPIFLFVPHVRYIEEIGLLLKGLDHRIDGVHAEDP
MRKEKVEAFRKGDIPLLVTTTILERGVIVKNLQVAVLGAEEEIFSESALV
QIAGRAGRSFEEPYGEVVYFHYGKTESMVRAKRHIQSMNKSAKEQGLID
>BA0001 dnaA, chromosomal replication initiator protein DnaA
MENISDLWNSALKELEKKVSKPSYETWLKSTTAHNLKKDVLTITAPNEFA
RDWLESHYSELISETLYDLTGAKLAIRFIIPQSQAEEEIDLPPAKPNAAQ
DDSNHLPQSMLNPKYTFDTFVIGSGNRFAHAASLAVAEAPAKAYNPLFIY
GGVGLGKTHLMHAIGHYVIEHNPNAKVVYLSSEKFTNEFINSIRDNKAVD
FRNKYRNVDVLLIDDIQFLAGKEQTQEEFFHTFNALHEESKQIVISSDRP
PKEIPTLEDRLRSRFEWGLITDITPPDLETRIAILRKKAKAEGLDIPNEV
MLYIANQIDSNIRELEGALIRVVAYSSLINKDINADLAAEALKDIIPNSK
PKIISIYDIQKAVGDVYQVKLEDFKAKKRTKSVAFPRQIAMYLSRELTDS
SLPKIGEEFGGRDHTTVIHAHEKISKLLKTDTQLQKQVEEINDILK
>BA4823 dnaB, DNA replication protein DnaB
MEKQSWMELLPIDRYKVSAKGLLHNYDRKVLTMLYQPLIGSRAFSLYMTL
WGELEQDRVFGKENTHHSLMVTMQMQLPEVYEERVKLEAIGLLKVYIKKE
KDIRMFIYELQPPLSPKQFFDDIVLSIFLYNRLSRTKYNQVKHYFLEEEF
DFASYENVTRSFNDVFGSFNPGQLEHAQEDLRIPKTTAMPSNEKGDAPKV
WNDFFDFSLFVDGLSALVPKKAITDQVRECVITLAYVYDVDVLSMQNIVL
GAVTEMQTIDMERLRKGARDWYQFENGQALPVLSERVQPHAARMMKEKEP
STQEEMLIKQLEEISPKQLLKEISGGAEATKADLQIVEDVMINQKLTPGV
VNVLIYYVMLRSDMKLAKTYVEKIAGHWARKKVGTVAEAMALAKEENRQY
QEWAETKKKGRTSKKTVRKEMVPDWLKEEPKEQEKETVKKDASAEKGAST
LEDERKRLEEVLKKYKRD
>BA1569 dnaD, DNA replication protein DnaD
MKKKMMLQWFEQGSIAIPKLLMMHYKKLGLNETEFMVVLHVHTFLESGNS
FPTPSEISERMTITEMKCMEVIQTLIQKGFLSLEGGQKSEAMMCESYSLQ
PLWEKILHFLMNESIEEEQKEIKQLQVNLYTVFEKEFGRPLSPFECETLG
MWEDQDQHHPNLIQAALREAVMSGKLNFRYIDRILFEWKKNGIKTVDQAQ
NQGRKFRANQQRTQQTTKQETKFTGKVPLYNWLEQ
>BA4849 dnaE, DNA polymerase III, alpha subunit
MKFVHLQCQTVFSLLKSACKIDELVVRAKELGYSSLAITDENVMYGVIPF
YKACKKHGIHPIIGLTASIFSEEEEKSYPLVLLAENEIGYQNLLKISSSI
MTKSKEGIPKKWLAHYAKGLIAISPGKDGEIEQLLLEDKESQAEEVARAY
QNMFGNFYMSLQHHAIQDELLLQEKLPAFMNRVNIPVVATNDVRYINQSD
ALVHECLLSVESGTKMTDPDRPRLKTDQYYLKSSDEMEALFSHVEEAIYN
TVEIAERCRVEIPFHVNQLPKFPVPSNETNDMYLRRVCEEGLQKRYGTPK
EVHINRLNHELNVISSMGFSDYFLIVWDFMKYAHENHILTGPGRGSAAGS
LVSYVLEITDIDPIEYDLLFERFLNPERVTLPDIDIDFPDIRRDEMIRYV
KDKYGQLRVAQIVTFGTLAAKAAIRDIARVMGLPPRDIDIFSKLIPSKLG
ITLKDAYEESQSLREFIQGNLLHERVFEIAKRVEGLPRHTSIHAAGVIMS
QEPLTGSVAIQEGHNDVYVTQYPADALEELGLLKMDFLGLRNLTLLENII
KFIVQKTGKEIDIRNLPLQDAKTFQLLGRGDTTGVFQLESSGMRNVLRGL
KPNEFEDIVAVNSLYRPGPMEQIPTFIESKHGKRKIEYLHPDLKPILERT
YGVIVYQEQIMQIASKLAGFSLGEADLLRRAVSKKNRDILDQERKHFVQG
CLQNGYDETSAEKIYDLIVRFANYGFNRSHAVAYSMIGYQLAYLKANYTL
EFMTALLSSAIGNEDKIVQYIRETKRKGFHVLPPSLQRSGYNFQIEGNAI
RYSLLSIRNIGMATVTALLEEREKKMFEDLFEFCLRMPSKFVTERNLEAF
VWSGCFDDFGVSRTNLWKSLKGALEYANLARDLGDAVPKSKYVQGEELSF
IEQLNKEKEVLGFYLSSYPTAQYVKLAKELEIPSLAQAMRHKKKVQRAIV
YITSVRVIRTKKLQKMAFITFCDQNDEMEAVLFPETYIHFSDKLQEGAIV
LVDGTIELRNHKLQWIVNGLYPLEEMDAYEEKKDASVYVKLPSQYEKKLL
NQVTKILFDYSGFAKVLIYYEKEHKMVQLSRSLSIHPSEECLGALREIVG
EENVVVKI
>BA4516 dnaG, DNA primase
MGNRIPEEVVEQIRTSSDIVEVIGEYVQLRKQGRNYFGLCPFHGENSPSF
SVSSDKQIFHCFGCGEGGNVFSFLMKMEGLAFTEAVQKLGERNGIAVAEY
TSGQGQQEDISDDTVIMQQAHELLKKYYHHLLVNTEEGNEALSYLLKRGI
TKEMIEKFEIGYASPAWDAATKILQKRGLSLSSMEQAGLLIRSEKDGSHY
DRFRGRVMFPIYTLQGKVIAFSGRALGDDTPKYLNSPETPIFHKSKLLYN
FHQARPFIRKRGQVVLFEGYADVLAAVKSGVEEAVATMGTALTEEQAKLL
RRNVETVVLCYDGDKAGREATMKAGQLLLQVGCQVKVTSLPDKLDPDEYV
QQYGTTAFENLVKSSISFVGFKINYLRLGKNLQDESGKEEYVKSVLKELS
LLQDAMQAESYLKSLSQEFSYSMETLLNQLHQYRKEQKVQQKQVKQVSKP
SQIVQTKPKLTGFERAEREIIYHMLQSPEVAVRMESHIEDFHTEEHKGIL
YELYAYYEKGNEPSVGTFLSWLSDEKLKNIITDISTDEFINPEYTEEVLQ
SHLETLRRHQEKLEKMEIIFKIKQMEKTDPVEAAKYYVAYLQNQKARK
>BA4822 dnaI, primosomal protein DnaI
MEHIQNSFAKLMENENFKNRYEVLKAEVMAHPRVKEFIDEHRGEVTTSMI
ERSLVKLYEYIGQSVGCADCPDLGSCKNMLQGYEPKLVIQGKMIDIQYDR
CVRKVAYDERKKYEKLVQSVYMPTDILQATMENLDPSDLDARIDAIGAAN
EFLSAYEPGKKVQGLYLYGKFGVGKTYLLGAIANELARKKISSMLVYFPE
FLREIKSSIQDNSIGEKIDAVKRVQVLMLDDIGAEAMSSFVRDDVLGAIL
QFRMLENLPTFFTSNFDFKQLEHHLTYTQRGEAEEMKAARIMERIKYLAK
PIPIGGKNRRHK
>BA0002 dnaN-1, DNA polymerase III, beta subunit
MRFSIQKDYLVRSVQDVMKAVSFRTTIPILTGIKVVATEEGVTLTGSDAD
ISIESFIPVEEDGKEIVEVKQSGSIVLQAKYFSEIVKKLPKETVEISVEN
HLMTKITSGKSEFNLNGLDSAEYPLLPQIEEHHVFKIPTDLLKHMIRQTV
FAVSTSETRPILTGVNWKVYNSELTCIATDSHRLALRKAKIEGIVDEFQA
NVVIPGKSLNELSKILDESEEMVDIVITEYQVLFRTKHLLFFSRLLEGNY
PDTTRLIPAESKTDIFVNTKEFLQAIDRASLLARDGRNNVVKLSTLEQAM
LEISSNSPEIGKVVEEVQCEKVDGEELKISFSAKYMMDALKALDSTEIKI
SFTGAMRPFLIRTVNDESIIQLILPVRTY
>BA2684 dnaN-2, DNA polymerase III, beta subunit
MEFIVNHKHFTQALSDVSKAISTKAIIPILSGIKITADQSGITLIASNSN
IFIEKFIPSAIDDEQITTILQAGTIVVPAKYFIEIIKKMPSDIVIKSKNE
QTITIQSGEITLNLNGFPANEFPNVPQIDDHTEIQIETKQLIDAFKQTVF
AVAKNESRHVLTGVHIELDHNKLICAATDSHRLAIRETLISTNMKANCIV
PSATINELLKLMNSNLEFVSIYLSESHIIFTFGTTTLYSRLIEGKYPNIS
TLIPNEFQTVINIDRQRMLQGVDRSSLLASEWANNNVNLEIVNESTIQIS
SNASQIGKISEKQQIDVIQGKKQLNISFDGRFMLDALRAIKEETVTLSFS
GSMRPILIEAGTQSAAIHLISPVRAY
>BA0019 dnaX, DNA polymerase III, gamma and tau subunits
MSYQALYRTWRPQKFEDVVGQKHVTKTLQNALLQEKVSHAYLFSGPRGTG
KTTIAKVFAKAINCEHAPVAEPCNECPSCLGITQGSISDVLEIDAASNNG
VDEIRDIRDKVKYAPSAVEYKVYIIDEVHMLSMGAFNALLKTLEEPPGHV
IFILATTEPHKIPPTIISRCQRFEFRKISVNDIVERLSTVVTNEGTQVED
EALQIVARAAEGGMRDALSLIDQAISYSDERVTTEDVLAVTGSVSQQYLG
NLVECIRENDVSRALRIIDEMMGKGKDPVRFMEDFIYYYRDMLLYQTSPQ
LEHMLERVMVDEQFRMLSEEMQPEVIYEIIHTLSKGQQEMKWTNHPRIFL
EVVMVQLCQQFMMQANGADRLQAIMNRMQQLEKELEQVKKNGVPVGVQQE
VKETRAAPKPVRTGSMKIPVGRVNEVLKQAKRQDLEQLKAVWGELLGRLK
AYNKVAFAVLLENSEPVAASDDTYVLAFQYEIHCKMASENREAMDTVEQA
LFELLSKKLNMIAIPKSEWGKIREDFLKREGGSSEESPEKKEDPLIEEAI
KLVGQELIEIKE
>BA3868 exoA, exodeoxyribonuclease III
MKFISWNVNGLRAVIAKGGFLEYLEESNADIFCLQEIKLQEGQIDLNVEG
YYTYWNYAVKKGYSGTAIFSKKEPLSVTYGLGIEEHDQEGRVITLEFEDF
YIITLYTPNSKRGLERLEYRMKWEDDFRAYIKRLDEKKSVVFCGDLNVAH
KEIDLKNPKSNRKNPGFSDEEREKFTCILEEGFIDTYRYLYPDQEGAYSW
WSYRMGARAKNIGWRLDYFVVSERMKDQITAAKINSEVMGSDHCPVELHI
NF
>BA1147 gerPC, spore germination protein GerPC
MNQDIYTYLHQLQQALQVQQATILNLEDQVRQLQEELNELKNRPSSSIGK
VEYKFDQLKVENLNGTLNIGLNPFSTKEQQIEDFQVDTETLKVNPETDTN
PDFYQGILQEMHRYLDEEAYNRILHFEKEERTPLDEMYRQMMVDDIKKQM
EHRLPYYLSQAQSYEGTSTDPDYLRDIIIQAMKHDIDKAFLSFIQHIPGN
FRKE
>BA3635 gerSC, spore germination protein
MRRFIHFTILCFLITLLTGCGDRLDLEKQSISLIYGFDAKAKGKLIVYHV
NPIFNEDVEKKYETHEAKVRTPREAKATFNSSSGGLVSTEKLQLILFSTK
FLKQEGAMPYLDVWYRDPKNTGNMRMVAVDGPISSVIYNNFKDKPALPEY
LTDLINTNKLYNRTVFTPFHEFHRQTFNKGITPAISEIKKGKKDVIVTGS
ALLTSRGIYKMSLNRYESALLLMLQKKANIPVSLTMKIPSHSVESNSHLK
DTEGEDFVSINVLSMNRDIRTDYSDNHFKFNVKMDFKIAVSELTFNMDID
KDRKKLTSLITKQLNKDLNNLIHKIQKQQLDPFGFGDYARAFQYEEWKKV
EDDWPSAFSKANVKVAPTIKILENGIIK
>BA0006 gyrA, DNA gyrase, A subunit
MSDNQQQARIREINISHEMRTSFLDYAMSVIVSRALPDVRDGLKPVHRRV
LYAMNDLGITADKAYKKSARIVGEVIGKYHPHGDSAVYETMVRMAQDFSQ
RYMLVDGHGNFGSVDGDSAAAMRYTEARMSKISMELIRDISKNTIDYQDN
YDGSEREPIVLPARFPNLLVNGTTGIAVGMATNIPPHQLGEVIDGVLALS
HNPDITIAELMECIPGPDFPTAGLILGRSGIRRAYETGRGSIILRAKVEI
EEKSNGKQSIIVTELPYQVNKARLIEKIAELVRDKKIEGITDLRDESDRN
GMRIVMEVRRDANANVLLNNLYKHTALQTSFGINMLSLVNGEPQVLNLKQ
NLYHYLEHQKVVIRRRTAYELEKAEARAHILEGLRIALDHLDEVITLIRS
SKTAEIAKQGLMERFGLSEKQAQAILDMRLQRLTGLEREKIEQEYQDLMK
LIAELKAILADEEKVLEIIREELTEVKERFNDKRRTEITIGGMESIEDED
LIPEQNIAITLTHNGYIKRLPASTYKTQNRGGRGVQGMGTNDDDFVEHLL
TTSTHDHILFFTNKGKVYRTKGYEIPEYSRTAKGIPIINLLGVDKGEWIN
AIIPIREFGDDEFLFFTTKQGISKRTPLSSFANIRTNGLIAISLREEDEV
ISVRLTSGDKDIIVGTSNGMLIRFNEQDVRSMGRNAAGVKAITLGEEDQV
VGMEIVEEDVNVLIVTKNGYGKRTPIDEYRLQSRGGKGLKTCNITDKNGK
LVAVKSVTGEEDIMLITAAGVIIRMPVDQISQMGRNTQGVRLIRLEDEQE
VATVAKAQKDDEEETSEEVSSEE
>BA0005 gyrB, DNA gyrase, B subunit
MEQKQMQENSYDESQIQVLEGLEAVRKRPGMYIGSTSGKGLHHLVWEIVD
NSIDEALAGYCDEINVSIEEDNSIRVTDNGRGIPVGIQEKMGRPAVEVIM
TVLHAGGKFGGGGYKVSGGLHGVGASVVNALSTELEVFVHREGKIHYQKY
ERGIPVADLKVIGDTDQTGTITRFKPDPEIFQETTVYEFDTLATRMRELA
FLNRNIKLTIEDKREHKQKKEFHYEGGIKSYVEHLNRSKQPIHEEPVYVE
GSKDGIQVEVSLQYNEGYTNNIYSFTNNIHTYEGGTHEVGFKTALTRVIN
DYGRKNSILKDADSNLTGEDVREGLTAIVSIKHPNPQFEGQTKTKLGNSE
ARTITESVFSEAFEKFLLENPNVARKIVEKGTMAARARVAAKKARELTRR
KSALEVSSLPGKLADCSSKDPAISEIYIVEGDSAGGSAKQGRDRHFQAIL
PLKGKIINVEKARLDKILSNDEVRTIITAIGTNIGGDFDIEKARYHKVII
MTDADVDGAHIRTLLLTFFYRYMRQIIEHGYIYIAQPPLFKVQQGKKIQY
AYNEKELEKILAELPAQPKPGIQRYKGLGEMNPTQLWETTMDPEVRSLLQ
VSLQDAIEADETFEILMGDKVEPRRNFIQENAKYVKNLDI
>BA0028 holB, DNA polymerase III, delta prime subunit
MTKTWEQLSAIQPIGVKMLMNSIAKERISHAYLLEGEKGTGKFATAIQMA
KSFLCSQRNRVEPCHVCTNCKRIDSGNHPNLHIVKPDGLSIKKQQIHDLQ
EEFSKTGLEANKKVYIIEHADRMTANAANALLKFLEEPSSDTTAILLTEQ
SHQILNTILSRCQVVTFRPLPTESLIRRLQDEGITVSLSTLAAQLTNSFD
EALTLCNDEWFAQARALVIKLCEALEKDKASIFFVQEKWGKHFGEKEQLQ
QGLDMLLLIYKDLLYVQLGEEDRLVFREQKEMFESFSYAQKRIVSALFNI
LEAKNRINANVNAQLVFEQLVLRLQEG
>BA1531 hup-1, DNA-binding protein HU
MNKTDLINAVAEASSLSKKDATKAVDAVFDSILEALKQGDKVQLIGFGNF
EVRERAARKGRNPQTGEEIEIAASKVPAFKPGKALKDAVK
>BA2377 hup-2, DNA-binding protein HU
MNKTELIKNVAQSADISQKDASAAVQSVFDTIATALQSGDKVQLIGFGTF
EVRERSARTGRNPQTGEEIQIAAGKVPAFKAGKELKEAVK
>BA3858 hup-3, DNA-binding protein HU
MNKTELIKNVAQNAEISQKEATVVVQTVVESITNTLAAGEKVQLIGFGTF
EVRERAARTGRNPQTGEEMQIAASKVPAFKAGKELKEAVK
>BA5142 kapD, sporulation inhibitor KapD
MDEQRFLFLDFEFTMPQHRKKPKGFFPEIIEVGLVSVVGCKVEDTYSAHV
RPKTFPSLTDRCKKFLGIKQEVVDKGISFSELVEKLAEYEKRCKPTIVTW
GNMDMKVLKHNCEKAGVDFPFLGQCRDLSLEYKKFFGERNQTGLWKAIEA
YGKVGTGKHHCALDDAMTTYNIFKLVEKDKEYLVKPAPPTLGELVDFSKV
LKKVSTQ
>BA0306 ligA, DNA ligase, NAD-dependent
MSKEIAKKRIEELRDLLNTFNYQYHVLDNPSVSDAEYDRNMQELIKLEAE
NPEFMSEDSPSIRVGGTVLDIFEKVTHKSPMLSLGNAFNEGDLRDFDRRV
RQGIDDANVRYICELKIDGLAVSLHYEKGRFIQGATRGDGVTGEDITQNL
KTIKAIPLRLNEEVTLEARGEAYMPKRSFVKLNEEKEQNGEDVFANPRNA
AAGSIRQLDPKIAAKRNLSMFVYGLANVEEKTIPSHSESLDYLGELGFKT
NPNRRTCETIEEVIAYVEEWQEKRPHLDYEIDGIVIKVDDVALQESLGTT
AKSPRWAIAYKFPAEEVVTRLTGIELSVGRTGVVTPTAELEPVRVAGTIV
RRASLHNEDLIREKDIRIGDYVVVKKAGDIIPEVVNVIFDKRTGGEEEYH
MPTHCPACESELVRLEEEVALRCINPTCPAQIREGLIHFVSRNAMNIDGL
GERVITQLFDADYIRTFADLYSLTKEQLLQLERFGEKSATNLVQAIENSK
ENSLERLLFGLGIRHVGAKAARTFAEHFETMDALVKATEEELKAINEIGE
KMAQSVVAYFDNEDVLELLQQFKEYGVNMTYKGIKIADLQNVESYFAGKT
VVLTGKLEVMGRSEAKKKIEALGGKVTGSVSKSTDLVVAGEAAGSKLAQA
EKHNVEVWNEERFLQELNK
>BA0052 mfd, transcription-repair coupling factor
MIGLLEQFYKNEEIQSVINGLEDGLKEQLVSGMATSSRSFLMAALYKKTK
KSQLIVTHNLYQAQKVHEDLVALLGEKDVWLYPVNELIASELGVASPELK
AQRIEVLNRLAAGEHGIIVAPVAGLRRFLPMKELWKQRQIEISLGQEIDL
DTFLHTLHHIGYERKSMVEAPGEFSLRGGILDIYPLTEELPFRIEFFDTE
VDSIRLFDVDEQRSQDKKESVRFGPATEFLFSQEELKSGIKHLEEGLTKT
MQKLSDDKLKTTVLETVSHEIEMLKNGQSIEQMFKYLSIFYNEPASLIDY
LPEDGVVILDEISRIQETASHLESEEAEWYISLLGEGTIIQDLSFSHSFE
EFLHHKKRSFVYLTLFLRHIAHTHPQNIVNVTCKTMQDFHGQMQLLKTEI
DRWNEGHFTTVVLGTDDERVKKLQHILSDYDIDADIVEGTDILLPGRLQI
AVGDLHAGFEMPMQKLVVITEKELFHKKVKKSQRKQKLSNAERIKSYSEL
KVGDYVVHVNHGIGKFLGIETLEINGVHKDYLNIKYQGNDKLYVPIEQID
QVQKYVGSEGKDPKVYKLGGNDWKKVKTKVEKSVQDIADDLIKLYAEREA
SKGYAYTPDTAEQQEFESSFPYQETEDQLRSIEEIKKDMERGRPMDRLLC
GDVGYGKTEVAIRAAFKAIMDEKQVAILVPTTILAQQHYETIRERFQDYP
INIGLLSRFRTRKQQNETIKGLKDGTVDIVIGTHRILSKDVTYKDLGLLI
IDEEQRFGVTHKEKIKQLKANVDVLTLTATPIPRTLHMSMLGVRDLSVIE
TPPENRFPVQTYVVEYNPALMREAIERELARGGQVYFLYNRVEDIERKAD
EISMLVPDARVTYAHGKMNESELESVMLSFLEGQHDVLVSTTIIETGVDI
PNVNTLIVFDADRMGLSQLYQLRGRVGRSNRVAYAYFAYKRDKVLSEVAE
KRLQAIKEFTELGSGFKIAMRDLSIRGAGNLLGAEQHGFIDSVGFDLYSQ
MLKDAIEQRRGTDGVENTVNVEIDLEVDAYLPDAYISDSKQKIMMYKQFR
GVSAIEDIEELQEEMIDRFGDYPQEVGYLLQIANIKVLAMKEQIELIKQN
KFEVTILFSEQASQNIDGGKLFMLGNSFGRMIGLGMEGSQLKIVMKTNGL
ETSKWLTIAENLLKGLPDVKKEVINA
>BA3904 mutL, DNA mismatch repair protein MutL
MGKIRKLDDQLSNLIAAGEVVERPASVVKELVENSIDANSTSIEIHLEEA
GLSKIRIIDNGDGIAEEDCIVAFERHATSKIKDENDLFRIRTLGFRGEAL
PSIASVSELELITSTGDAPGTHLIIKGGDIIKQEKTASRKGTDITVQNLF
FNTPARLKYMKTIHTELGNITDIVYRIAMSHPEVSLKLFHNEKKLLHTSG
NGDVRQVLASIYSIQVAKKLVPIEAESLDFTIKGYVTLPEVTRASRNYMS
TIVNGRYVRNFVLMKAIQQGYHTLLPVGRYPIGFLSIEMDPMLVDVNVHP
AKLEVRFSKEQELLKLIEETLQAAFKKIQLIPDAGVTTKKKEKDESVQEQ
FQFEHAKPKEPSMPEIVLPTGMDEKQEEPQAVKQPTQLWQPSTKPIIEEP
IQEEKSWDSNEEGFELEELEEVREIKEIEMNGNDLPPLYPIGQMHGTYIF
AQNDKGLYMIDQHAAQERINYEYFRDKVGRVAQEVQELLVPYRIDLSLTE
FLRVEEQLEELKKVGLFLEQFGHQSFIVRSHPTWFPKGQETEIIDEMMEQ
VVKLKKVDIKKLREEAAIMMSCKASIKANQYLTNDQIFALLEELRTTTNP
YTCPHGRPILVHHSTYELEKMFKRVM
>BA4830 mutM, formamidopyrimidine-DNA glycosylase
MPELPEVENVRRTLENLVTGKTIEDVIVTYPKIVKRPDDAEIFKEMLKGE
TIENIKRRGKFLLLYVTNYVIVSHLRMEGKFLLHQEDEPIDKHTHVRFLF
TDGTELHYKDVRKFGTMHLFKKGEEMNQMPLADLGPEPFDAELTPQYLHE
RLQKTNRKIKVVLLDQRLLVGLGNIYVDEVLFRSQIHPEREASSLTAEEI
ERIYEATVTTLGEAVKRGGSTIRTYINSQGQIGSFQELLNVYGKKGEPCV
TCGTILEKTVVGGRGTHYCPICQPRI
>BA3905 mutS, DNA mismatch repair protein MutS
MTQYTPMIQQYLKVKADYQDAFLFFRLGDFYEMFFEDAVKAAHELEITLT
SRDGGSSERIPMCGVPYHAAKNYIEQLVEKGYKVAVCEQVEDPKTAKGVV
RREVVQLITPGTMMEGRTIDEKENNFLAALTHFEDGSYALACNDLTTGQN
TVTLLTGSVEDILLEVYATGSKEIVVDSSFSKDELNKLTETLKMTISYED
ATAIPEGLEHLVKNVSQAKLIKAVGRLFNYVIRTQKRSLDHLQPVEIYYT
NQFMKIDVHSKRNLELTETLRTKEKTGSLLWLLDKTKTAMGGRMLKQWME
RPLIQKERIEERLEMVETFVNDYFLREDLKEKLKEVYDLERLAGKVAFGN
VNARDLLQLRRSLLQVPAILEAISLLDNAYAARLIQGADPCESLTELLGR
SIQENPPLSIKDGDIIKDGYNDKLDQYRYVSKNGKTWIAELEKRERDITG
IKSLKIGYNRIFGYYIEVTKANLGALPEGRYERKQTLANAERFITDELKE
KETLILEAEEKIVQLEYDLFTALREEVKVFIPKLQHLAKVISELDVLQSF
ATVSEEEQFVKPVLTTKREIFIKDGRHPVVEKVLNGKLYVPNDCIMPENM
DVFLITGPNMSGKSTYMRQLALVTVMSQIGCFVPATEAVLPVFDQIFTRI
GAADDLISGQSTFMVEMLEAKNAIANASERSLILFDEIGRGTSTYDGMAL
AQAIIEHIHDQIGAKTLFSTHYHELTVLEDSLDQLKNVHVSAIEENGKVV
FLHKIQDGAADKSYGIHVAQLAELPDSLIARAKEVLAQLEGQEEIVIPKR
VEVKAQEQEVIPEPIVVKEEPIEIEETKVDNEEESQLSFFGAEQSSKKQD
KPALDAKETAVLTQIKKIDLLDMTPLEAMNELYRLQKKLKKG
>BA0522 mutY, A/G-specific adenine glycosylase
MTLEILNNFNIEQFQNDLIGWFEKEQRDLPWRKNKDPYRVWVSEIMLQQT
RVEAVKPYYANFMGKFPTLEALANADDEEVLKAWEGLGYYSRARNLHAAV
KEVKEVYGGIVPSDVKKIEKLKGVGPYTKGAILSIAYGIPEPAVDGNVVR
VLSRILSVWDDIAKPKTRKVFEEIVREIISAENPSYFNQGLMELGALICI
PKNPACLLCPVREHCRGYAEGVQKELPVKSKAKAPTMVPIVAGVLQTEDG
RYVINKRPSTGLLANMWEFPNVELGEGIRNQKEQLIDYMKEKFELSISIE
EYAMNVQHTFTHRTWDIFVFYGKVTGDIVETDTLKFVSKEAFEQLPFSKS
HRTIYENCVEKITMQ
>BA4508 nfo, endonuclease IV
MLKIGSHVSMSGKKMLLAASEEAVSYGATTFMIYTGAPQNTRRKPIEELN
IEAGRKHMEQNGIEEIIVHAPYIINVGNTTKPETFQLGVDFLRMEIERTS
ALGVAKQIVLHPGAHVGAGADAGIQQIIKGLNEVLTPDQTVNIALETMAG
KGTECGRSFEEIAKIIDGVKYNEKLSVCFDTCHTHDAGYDIVNNFDGVLN
EFDKIVGIDRLQVLHINDSKNVRGAGKDRHENIGFGHIGYKALHHIVHHP
QLTHVPKILETPYVGEDKKDKKPPYKLEIEMLKNGTFDEGLLEKIKAQ
>BA1570 nth, endonuclease III
MLNKTQIRYCLDTMADMYPEAHCELIHDNPFELVIAVALSAQCTDALVNK
VTKNLFQKYKTPEDYLSVSLEELQQDIRSIGLYRNKAKNIQKLCRMLLDD
YNGEVPKDRDELTKLPGVGRKTANVVVSVAFGIPAIAVDTHVERVSKRLA
ICRWKDSVLEVEKTLMKKIPMDEWSVTHHRMIFFGRYHCKAQRPQCEECP
LLEVCREGKKRMKGK
>BA2039 ogt-1, methylated-DNA--protein-cysteineS-methyltransferase
MYQAYYESELGLLEITANDKGITSVIFVDERQEEHTNEMIDQCINELDEY
FKGNRKEFTVPLSPEGTAFQKNVWDALYTIPYGVSASYLDIAEKVGNTKA
VRAIGGANSRNPISIIVPCHRVIGKSGKLVGYAGGLWRKEWLLKHEGILK
>BA3870 ogt-2, methylated-DNA--protein-cysteineS-methyltransferase
MNSYKNKFIYWTLLTHANWKFHIAATESGLCFIGSQDEQFEELNIWARKK
LPQYILIHSPDYLQVYTKEIIEYLKNKRETFTFPIDAYGTAFQLSVWNTV
REIPYGKTYSYTEIADRIQKPTAVRAVASAIAANPILITIPCHRVIGKNG
KLTGFRGGLEMKKELLVLEKLQVEFI
>BA3656 parC, DNA topoisomerase IV, A subunit
MQAEKFHDLPLEDVLGDRFARYSKYIIQDRALPDARDGLKPVQRRILYSM
YVEGNVHDKAFRKSAKTVGNVIGNYHPHGDSSVYEAMVRLSQTWKVRNVL
VEMHGNNGSVDGDPAAAMRYTEARLSPIASELLRDLDKETVEFVSNFDDT
SEEPVVLPAAFPNLLVNGSTGISAGYATEIPPHHLGEVIDATMMRIDKPN
STVDDLLTVMKGPDFPTGGIIQGIDGIKKAYETGKGKIIIRGKAEVETVR
GGKQQIVITEIPYEVNKANLVKKMDELRLDKKLDGIAEVRDETDRTGLRI
VVELKKEANSEGILNYLYKNTDLQIPYNFNMVAINNRRPTLMTLPKILDA
YIGHQKEVVTRRSQYELRKAENRQHIVEGLKKALSILDQVIETIRASKDK
RNAKDNLSAKFGFTEAQAEAIVSLQLYRLTNTDITALQQEADELNKKIIE
LQAILQSEKRLLQVIKTDLKRVKKTYSDDRRAIIEDQIEEIKIDVEVMIP
QEDVIVTVTKEGYVKRTGWRSHNASNGKDFGMKEGDILLERFDTNTTETV
LLFTNKGNYIYLPVYEMPDIRWKDLGQHVANIVSLDRDETIIWATVVPNF
EEEKRFIVFVTKNGMIKKTELNQYKVQRYSRAFVAVNLKKDDEVVDIFAT
DGTSDIVLATHGAYALIFHEDEVSPVGVRAAGVKAINLKEDDYVASGKPL
NADKDQLILVTQRGAVKRLKASEIEKSTRAKRGLVIFKELKRNPYRIVGI
EIVRDDELVYMKTEKHIVEEIDPKAYRNKDRYSNGSLVLDVNDTGEVIET
WTKKRPE
>BA3657 parE, DNA topoisomerase IV, B subunit
MAKHQFQYNEDAIQVLEGLEAVRKRPGMYIGSTDSRGLHHLVYEIVDNSV
DEALAGFGDEISVVIHKDNSISVIDKGRGMPTGMHKLGKPTPEVILTVLH
AGGKFGQGGYKTSGGLHGVGASVVNALSEWLVVTIKRDGNIYEQRFENGG
VPVTTLEKIGKTKESGTTMHFKPDTTIFSTTNYNYETLCERLRESAFLLK
GMKISIKDERNDLEDVFHYETGIEAFVSYLNEEKDSIHPVVYFTGEQNGI
EAELAFQFNDGYSENILSFVNNVRTKDGGTHEAGFKTAMTRVFNEYARKV
ALLKEKDKNLEGTDIREGVAAIVSVRVPEEVLQFEGQTKGKLGTSEARSS
IDAIVSEHLAYFLEENPDVATLLVRKAIKAAQAREAARKAREEARTGKKK
KKSEGTLSGKLTPAQSRNPQKNELYLVEGDSAGGSAKQGRDRRFQAVLPL
RGKVINTEKAKLADIFKNEEINTIIYAIGGGVGNEFDVEDINYDKVVIMT
DADTDGAHIQVLLLTFFYRYMKPLIEAGKVFIALPPLYKVSKGKGKSEVI
EYAWSDEELDSVTKKVGKGYMLQRYKGLGEMNADQLWETTMNPETRTLIR
VKIDDAARAERRVTTLMGDKVEPRRKWIERNVQFGMQEEGNILENEMIME
TEVE
>BA0305 pcrA, ATP-dependent DNA helicase PcrA
MSMTDRLLNGLNPQQQKAVQTTNGPLLLMAGAGSGKTRVLTHRIAYLLGE
KGVAPWNVLAITFTNKAAREMRERIDTLVGPEAEDIWISTFHSMCVRILR
RDIDHIGINRNFTILDSGDQLTVVKKIMKERNIDPKKFEPRSILAGISNA
KNELLSADKYAKKITIADPYEKLTSDVYTEYQKRLLKNNSLDFDDLIMTT
IQLFERVPEVLEFYQRKFQYIHVDEYQDTNKAQYLLVKHLAARFKNLCVV
GDSDQSIYRWRGADISNILSFEKDYENAQVILLEQNYRSSQNILNAANAV
IERNTNRKPKKLWTDNEVGSKISYYRAATEKDEAYFVAKKIRDDIQMGKR
KYTDFAVLYRTNAQSRMVEEIFLKSNIPYKIVGGTKFYDRKEIKDILAYL
RLIGNPDDEISFARIINVPKRGIGATSIDKIINYGVQNGISLTAVFDEIE
HVGVSAKVTKAVKEFAGLLHNWVNMQEYLSVTELVEEVIEKTGYRDMLKN
ERTLEAEGRLENLDEFLSVTQTFESQSEDKSLVAFLTDLALVADIDRVDE
DPTAGEEVILMTMHSAKGLEFPVVFIVGLEEGIFPHTRSLMEEDEMQEER
RLAYVGITRAEEELYLSNAQMRTLFGRTSMNAASRFITEIPTELVESLNE
TAPKRETSFGAKGRVASSSKTTTTTRSRSAFARPAAKTTGGEQIGWAVGD
KASHQKWGVGTVISVKGEGDAKELDIAFPSPIGVKRLLAKFAPVTKQ
>BA4831 polA, DNA polymerase I
MEKKVVLVDGNNIAYRAFFALPLLNNDKGIHTNAIYGFTMMLMRILEEEK
PTHMLVAFDAGKTTFRHKTYSEYKGGRQKTPPELSEQFPFIREMLDAFNV
PRYELENYEADDIMGTLAKEASEQGASVKVISGDKDLLQLVSDNTLVCIP
RKGITEVDEYTKEALFEKYSLSPKQIIDMKGLMGDQSDNIPGVPGVGEKT
AIKLLTQFGTVEEVYENIDQVSGKKLKEKLEANKDQALMSKDLATIITDA
PITVNVDDMEYKGYEASDVIPMFENLGFTSLLNKLGVTPEETAPAELDDI
TFDIVEEVTEEMLQQDSALIVEVQEDNYHKADIQGFGIQNENGCYFIQTD
IALKSDAFKEWLADGEMRKYTFDAKRAIVALKWNGIDMQGIDFDLLIAAY
LLDPADTDKDFRTVAKMKETHAVKSDEEVYGKGAKRAVPELEIVAEHVAR
KVHVLYDVKQTFVEELEKNEQYELFTELELPLARVLADMEVKGVKVDTER
LRNMGEELAGRLKEMEQEIYKLAGTEFNINSPKQLGVILFENLNLPVIKK
TKTGYSTSADVLDKLMDHHEIIPNILHYRQLGKLNSTYIEGLLKVVHEDS
SKIHTRFNQVLTQTGRLSSTDPNLQNIPIRLEEGRKIRQAFVPSEEGWIM
YAADYSQIELRVLAHIANDKGLVEAFQHDMDIHTKTAMDVFGVEKDEVTS
NMRRQAKAVNFGIVYGISDYGLSQNLGITRKAAAEFIEKYLESFPGVQEY
MDDIVKDAKQKGYVATLLNRRRYIPEITSRNFNLRSFAERTAMNTPIQGT
AADIIKKAMIIMADRLEEEGLQARLLLQVHDELIFEAPKEEVEKLEKLVP
EVMEHAIELAVPLKVDYSYGPTWYDAK
>BA3955 polC, DNA polymerase III, alpha subunit, Gram-positive type
MSLTNEQKERFQILLQQLQIPDDLINQYLQGGGIERLVIDKANKSWHFNL
QVPRILPTELYELLETKLKQSFSHIARTTFALETENKQFTEEEVRAYWPL
CTERITFSPMFAYLKKQLPQVNGVKLLINVNNELESTALKKNVAKPVGDQ
YEAFGFPRFQLDTHIQQNTEEMQKFREQTQQEDRERVIQAMEEMAKKQAE
ESSVVHEGPITLGYLIKPDEEITPMREIQDEERRKTVQGYVFHVETKELR
SGRTLLTLKITDYTDSIMIKMFSRDKEDIPMLQSLKKGMWVKARGSVQND
TFVRDLVMIANDINEITGPSRKDKAPEGEKRVELHLHTPMSQMDAVTPVS
KLVAQAGKWGHEAIAVTDHAVAQSFPEAYSAGKKAGVKVIYGVEANLVND
GVPIAYNEEHRLLADETYVVFDVETTGLSAVYDTVIELAAVKVKGGEIID
RFESFANPHQPLSATIIELTGITDDMLTDAPEVDEVFKKFEEWMGDHTLV
AHNASFDMGFINVGFKKAGLEKTKNPVIDTLELARFLFPEMKNHRLNTLC
KKMDIELTQHHRAIYDTEATGYLLVKMLKDVIEKGFEYHDQLNDSMGQGD
AYKRGRPSHMTLLATSDVGLKNLYKLVSYSHLNYFYRVPRVPRSLLKKYR
EGILVGTACDKGEVFEAMMQKAPEEVEEIAQFYDYIEVMPPEVLRHLVER
ELVRDEGQLKTIISNLVKLGETLDKPVVATGNVHYLDPEDAMYRKILVSS
QGGANPLNRHSLPPVHFRTTDEMLECFSFLGEDVAKEIVVTNTQKIASLI
GDVHPVKDDLYTPKIEGADDETRDMSYKMARSIYGEELPEIVEARLEKEL
KSIIGHGFAVIYLISHKLVKKSLVDGYLVGSRGSVGSSFVATMMEITEVN
PLPPHYVCPKCKQSEFFNDGSVGSGFDLPDKECPTCNIPYVKDGHDIPFE
TFLGFKGDKVPDIDLNFSGEYQPRAHNYTKVLFGEDYVYRAGTIGTVAEK
TAYGYVKGYANDHNLTIRNAEIDRLVAGCTGVKRTTGQHPGGIIVVPDYM
DIFDFSPIQYPADSIGAEWRTTHFDFHSIHDNLLKLDILGHDDPTVIRML
QDLSGIDPKTIPTDDPEVMKIFSGPESLGVTEEQINCKTGTLGIPEFGTK
FVRQMLEETKPTTFSELVQISGLSHGTDVWLGNANELIYNGTCTLSEVIG
CRDDIMVYLIYQGLDPSLAFKIMESVRKGKGVPEEWEEDMKSNNVPGWYI
DSCKKIKYMFPKAHAAAYVLMAVRIAYFKVHFALLFYAAYFTVRADDFDV
EAMAKGSASIRARIDEIAQKGLDAAPKEKSLLTVLEMTLEMCERGYSFQK
VDLYRSHATDFIIDGDSLIPPFNAVPGLGTNAALSIVEARKNGEFLSKED
LQQRSKVSKTIIEYLDSQGCLGDLPDQNQLSLF
>BA4006 priA, primosomal protein N`
MKFASVIVDVPARQTDRPFDYIIPKKWEDIVQTGMRVVVPFGPRKLQGFI
IGIKNSVEVESKKLKTIHEILDVTPVLNEELLKLGYWLTSETLCYMISAF
QVMLPTAIKATYKKRLQLRKQEEVAPELLFLFQEKEAIDWEAIETQPHLY
RTIQQEIKHGTIEVVYQVKDKVQKKKQRVVQPELPEDKLELAAFELKSKK
QQDVLYYFVENYKSVPLKVITEELQITDAPIKALVKKGLISEKYVEVYRN
PYDDEDFEQTKPFPLTEEQKQVITPILSSITNETYNPFLLYGVTGSGKTE
VYLQSIAAVLEKGKEAIVLVPEIALTPQMVDRFKGRFGSQVAVLHSALSV
GEKYDEWRKILRKEVKVVVGARSAVFAPFENLGIIIIDEEHESSYKQEDN
PRYHARDVAVWRGQYHKCPIVLGSATPTLESFARAKKGVYELLTMEKRMN
EQALPTVEIVDMREELRDGNRSMFSKALHEKIADRLEKKEQMVLFLNRRG
HSTFVMCRDCGYVVQCPHCDISLTYHKMNHRLKCHYCSYEENMPTACPAC
QSTYIRFFGTGTQKVEEEITKLFPEARVIRMDVDTTSRKGMHEKLLKAFG
EEKADILLGTQMIAKGLDFPKVTLVGVLTADTMLHLPDFRASEKTYQLLT
QVSGRAGRHELPGEVIIQTYTPEHYSIELAKNQQYDVFFDQEMQMRRTRQ
YPPYYYVVLVTVSHPELLKAVQVTEKIVGHLRSHCSQQTMVLGPVASAIP
RIKDRYRYQCMIKYKREPNLKNVLKMVNEHYQAEMQKELQISIDFNPTML
M
>BA4685 radC, DNA repair protein RadC
MNGIRDVVKEEQPRERLLLEGAGSLSNRELLAVLLRTGSKEESVLKLSDK
ILHHFDGLRMLKDATLEELVSIHGVGVAKATQLIAAFELGRRMVRLEYQN
RYSIRSPEDCATYMMEEMRFLQQEHFVCLYLNTKNQVIHRQTIFIGSLNS
SIVHPREVFKEAFRRAAASIICLHNHPSGDPAPSREDIEVTKRLVECGRI
IGIEVLDHIIIGDHKFVSLKEKGHI
>BA3915 recA, recA protein, group I intron-containing
MSDRQAALDMALKQIEKQFGKGSIMKLGEQAERKVSTVSSGSLALDVALG
VGGYPRGRIIEIYGPESSGKTTVSLHAIAEVQRQGGQAAFIDAEHAMDPV
YAQKLGVNIDELLLSQPDTGEQGLEIAEALVRSGAVDIIVIDSVAALVPK
AEIEGDMGDSHVGLQARLMSQALRKLSGAINKSKTIAIFINQIREKVGVM
FGNPETTPGGRALKFYSTVRLEVRRAEQLKQGNDIVGNKTKVKVVKNKVA
PPFRVAEVDIMYGEGISREGEILDMASELDIVQKSGAWYSYNEERLGQGR
ENSKQFLKENTDLREEIAFFIREHHGISEDSGAEGMEDPNLLD
>BA0004 recF, DNA replication and repair protein RecF
MFISEIQLKNYRNYEKLELSFEDKVNVIIGENAQGKTNLMEAIYVLAMAK
SHRTSNDRELIRWDEDFGQIKGKLQKRNSSLSLELNISKKGKKAKLNQLE
QQKLSQYIGVMNVVMFAPEDLNLVKGSPQVRRRFLDMELGQIAPVYLYEL
SQYQKVLTQRNHLLKKMQGNSKNEETMLDVFTLQLIEHGTKILRKRFEFL
HLLQEWAAPIHRGISRGLEELEIVYKPSVDVSESMDLSKIKEVYYESFQS
VKQREIFRGTTLIGPHRDDLQFFVNSKNVQVFGSQGQQRTTALSLKLAEI
ELIYSEVKEYPILLLDDVLSELDDYRQSHLLNTIQGKVQTFVTTTSVDGI
EHETLKEAKTIHVTNGTVDCEIDRA
>BA3993 recG, ATP-dependent DNA helicase RecG
MNEVVQVPVTDVKGIGGETSELLHEMGIYTVSHLLEHFPYRYEDYAMKDL
AEVKHDERVTVEGKVHSAPLLQYYGKKKSRLTVRVLVGRYLITAVCFNRP
YYKQKLNLDETVTITGKWDQHRQTIAVSELHFGPVVRQQEVEPVYSVKGK
LTVKQMRRFIAQALKEYGDSIVEVLPDGLLSRYKLLPRYEALRALHFPTG
QEDLKQARRRFVYEEFFLFQLKMQTLRKMERENSKGTKKEIPSEELQEFI
DALPFPLTGAQRRVVDEIMKDMTSPYRMNRLLQGDVGSGKTVVAAIGLYA
AKLAHYQGALMVPTEILAEQHYQSLAETFSHFGMKVELLTSSVKGVRRRE
ILAKLEQGEIDILVGTHALIQDEVIFHRLGLVITDEQHRFGVAQRRVLRE
KGESPDVLFMTATPIPRTLAITAFGEMDVSIIDEMPAGRKVIETYWAKHD
MLDRVLGFVEKEINKGRQAYVICPLIEESEKLDVQNAIDLHSMLTHHYQG
KCQVGLMHGRLSSQEKEEIMGQFSENKVQILVSTTVVEVGVNVPNATVMV
IYDAERFGLSQLHQLRGRVGRGSEQSYCLLIADPKSETGKERMRIMTETN
DGFVLSEKDLELRGPGDFFGSKQSGLPEFKVADMVHDYRALETARQDAAL
LVDSEAFWHNDQYASLRTYLDGTGVFQGEKLD
>BA4639 recJ, single-stranded-DNA-specific exonuclease RecJ
MLQPKTRWKEKEYNGERVSELASKLQLSPLVVSLFLGRGLDTEDKILDFL
NTENQEFHDPFLLEGMDRTVERVNKAIQNGEQILIFGDYDADGVSSTTVL
YLALQELGADVEFYIPNRFTEGYGPNEEAFRWAHSAGFSLIITVDTGIAA
VHEAKVAKELGIDLIITDHHEPPPELPEALAIIHPKLDGGVYPFHYLAGV
GVAFKVAHALLGRVPEHLLEIAVIGTVADLVSLHGENRLLVKRGLKHMRM
TKNIGLKALFKVANVSQSEITEESIGFSIAPRINAVGRLEDATLAVHLLL
SEDPEEAKELAEEIDELNKLRKDIVKQITEEAIAEVENNFPPEENKVLVL
AKEGWNPGVIGIVASKLVERFYRPTIVLCIDPVKETAKGSARSIAGFDLF
ANLSDCRELLPHFGGHPMAAGMTLHMNDVDELRRRLNEQAEEILTEEDFI
PITAVDAFCKVEDVTLAAIEDMQKLAPFGVGNPKPRIAVKDAELESIRAI
GSDGSHLKMALRDGQATLDTIGFGFGAYAKEISPVAKVSVIGEASINEWN
NFKKPQLMVKDIAVEAWQLFDWRSMRNVEANVAELPKEKITMVYFSKEVL
HKFSLEDYKEHMMHASEVTELDEQYIVLLDLPKGTDELRDLFKVGFPSRI
YTLFYQENNHLFSTVPTRDHFKWYYSFLSQKSPFSLRQYGEQLCQHKGWS
KDTVNFMTQVFFELEFVTIKDGVIFMADKKQKRDLIESNTYREKMNHLQL
EKELVYSTYQQLYTWFETIRNHKEVEQLG
>BA4397 recN, DNA repair protein RecN
MLSELSIRNFAIIEALNISFQKGLTVLSGETGAGKSIIIDAISLLVGGRG
SAEFVRYGTEKAEIEGLFYVEDDKHPCIEKAEELDIEIEDGMIILKRDIA
ANGKSVCRVNGKLVTLSVLKEIGKTLVDIHGQHETQDLMNEERHLFMLDH
FDGERIVKQLDIYQNVYADYEKLKKQLKSLSENEQQMAHRLDLIQFQHEE
IRKADLKMDEENNLTEERLQISNFEKIYKALGDAYRSLSADGQGLDNVRS
AMGQMESITHLDEVYQENHDSIANSYYLLEEVAYQLREKLDMMEYDPNRL
DEIETRLNEIRMLKRKYGNTVEEILAYADKIEQEIFTIENKDVHIETTKK
QLKELESVILKEATLLSNMRHELAEHLTNAIHQELKELYMEKTKFEVRII
KREGNAEEPLVEGAPVRLTADGYDHVEFYISTNPGEPLKPLSKVASGGEL
FRIILALKSIFSKHQGVASVIFDEVDTGVSGRVAQAIAEKIYRVSVNSQV
LCITHLPQVASMADSHLFIRKQVANDRTITSVTVLTMEDKVTEIARMISG
VEITDLTTEHAKELLTQAHHFKQTAEAIQ
>BA4522 recO, recombination protein O
MFQKVEGIVIRTTDYGETNKIVTIFSRELGKVSAMARGAKKPKSRLASVS
QLMTHGHFLIQMGSGLGTLQQGEIISTMKEIREDIFLTAYASFIVELTDK
ATEDKKHNPYLFEMLYQTLHYMCEGVDPEVLSLIYQTKMLPVLGMRPYFD
TCAICHQETDFVAFSVREGGFLCSRHAEQDQYRIPVGEAVHKLLRLFYHF
DLHRLGNVSVKDSTKKQMRLVLNTYYDEYCGIYLKSRRFLEQLDKFQI
>BA1505 recQ-1, ATP-dependent DNA helicase RecQ
MKLEEYLYKWFGYSEFRPGQKGVITDLLEGKDVIAMLPTGRGKSMCYQFP
GLMREGTVLVVSPLLSLMEDQVTQLKYVVKDRVIAFNSFRTLNEKREAMK
KLSSYKFIFVSPEMLQSELLIRELKKIHISLFVVDEAHCISQWGYDFRPD
YKKLNVVIENIGSPTVLALTATATKGVLQDIADSLNLKGAAEHVYSIDRP
NIAMDVQFVETIEEKKEALLEQVMYLQGPGIVYCSSRAWTERLTEYLRGK
GVTGVAFYHGGMEHEERMLIQQQFMNNQLQLVICTSAFGMGVNKANTRYI
IHFHYPTNIASYLQEIGRAGRDGEPSIAILLCSPLDHDLPISIIEDELPS
KSQIQFLFSLLQERMFQTKELPIEDVEEICYNAARFNEQYWRFVRYHLEQ
VGIIQQRRLLLEGLSDEIMNRLIAEVEIRLRNKYSELENMKSWIQVKGCR
REYVLQQFGYRKEQELMNCCDYCGITKEDYKKRRAQQSDFDYNWETELQK
LFGLEKMEE
>BA2818 recQ-2, ATP-dependent DNA helicase RecQ
MFTKAQELLASYFGYSSFRRGQDETIKNVLDGKDTVCIMPTGGGKSICYQ
IPALVFEGTTLVISPLISLMKDQVDTLVQNGISATYINSSISITEANQRI
QLAKQGHYKLFYVAPERLDSMEFVDQLIDMKIPMIAIDEAHCISQWGHDF
RPSYLHIHRILDYLPEEPLVLALTATATPQVRDDICNTLGINQENTIMTT
FERENLSFSVIKGQDRNAYLADYIRQNQKESGIIYAATRKVVDQLYEDLM
KAGVSVSKYHAGMSDHDRNEQQELFLRDEVSVMVATSAFGMGIDKSNIRY
VIHYQLPKNMESYYQEAGRAGRDGLDSACILLYASQDVQVQRFLIDQSIG
ESRFSNELEKLQNMTDYCHTEQCLQSFILQYFGEEPKEDCGRCGNCTDNR
ESIDVTRESQMVLSCMIRTNQRFGKQMIAQVLTGSKNKKVIEFNFHTLPT
YGLLSNRSVKEVSEFIEFLISDELIAVEHGTYPTLKVTEKGKEVLLGKEN
VLRKERVETRQIVQDHPLFEVLREVRKEIAQGEGVPPFVIFSDQTLKDMC
AKMPQSDSELLTVKGIGEHKLVKYGSHFLQAVQHFIEENPNYAETIKTEV
VSERKKSGKASANSHLETYEMYKQGIDLDEIAKERGLSRQTIENHLIRCF
EDGMEVDWNSFVPAEYEQLIETAVQNAEGGLKSIKEQLPNEVSYFMIRAY
LQIRK
>BA0021 recR, recombination protein RecR
MHYPEPISKLIDSFMKLPGIGPKTAVRLAFFVLDMKEDDVLGFAKALVNA
KRDLAYCSVCGHITDRDPCYICNDSHRDQSVVCVVQEPKDVIAMEKMKEY
QGVYHVLRGAISPMGGIGPEDINIPQLLKRLHDETVQEVILATNPNIEGE
ATAMYISRLLKPTGIKVTRIAHGLPVGGDLEYADEVTLSKALEGRREV
>BA4798 rnh, ribonuclease HIII
MSNSIVIQTNSTVIEDMKQQYKHSLSPKTPQGGIFMAKVPSCTITAYKSG
KVMFQGGRAEAEAARWQTVPQTPKIAVKKSVDSHRYAPPASIGTMSIVGS
DEVGTGDFFGPMTVVAVYVDAKQIPLLKELGVKDSKNLNDEQITAIAKQL
LHVVPYSSLVLHNEKYNELFDKGNNQGKLKALLHNKAITNLLAKIAPTKP
EGVLIDQFTQPDTYYKYLAKQKQVQRENVYFATKGESVHLAVAAASILAR
YSFVKQFNELSKKAGMPLPKGAGKQVDIAAAKLIQKLGKERLPEFVKLHF
ANTEKAFRLLK
>BA1623 rnhA, RNase H
MIEVYIDGASKGNPGPSGAGVFIKGVQPAVQLSLPLGTMSNHEAEYHALL
AALKYCTEHNYNIVSFRTDSQLVERAVEKEYAKNKMFAPLLEEALQYIKS
FDLFFIKWIPSSQNKVADELARKAILQN
>BA3975 rnhB, ribonuclease HII
MQKVTIQEAEHLLQEIISEEDDRFQILIKDERKGVQKLISKWYKQKELAQ
KEKEKFLEMSKYENALREKGLTYIAGIDEVGRGPLAGPVVTAAVILPEDF
YIPGLNDSKKLSEAKRERFYGEIKAKAIAIGVGIVSPQVIDEINIYQATK
QAMLDAIANLSCTPEYLLIDAMKLPAPIPQTSIIKGDAKSISISAASIIA
KVTRDRMMKELGEKYPAYGFEQHMGYGTKQHLEAIEAHGVLEEHRKSFAP
IKDMIKK
>BA4651 ruvA, Holliday junction DNA helicase RuvA
MFEYVTGYVEYVGPEYVVIDHNGIGYQIFTPNPYVFQRSKQEIRVYTYHY
VREDIMALYGFKTREERLLFTKLLGVSGIGPKGALAILASGQTGQVVQAI
EHEDEKFLVKFPGVGKKTARQMILDLKGKLADVVPDAFVDLFSDEERFDE
KKGSSAELDEALEALRALGYAEREVSRVVPELLKESLTTDQYIKKALSLL
LNGKR
>BA4650 ruvB, Holliday junction DNA helicase RuvB
MDERLLSGESAYEDADLEYSLRPQTLRQYIGQDKAKHNLEVFIEAAKMRE
ETLDHVLLYGPPGLGKTTLANIIANEMGVNVRTTSGPAIERPGDLAAVLT
SLQPGDVLFIDEIHRLHRSIEEVLYPAMEDFCLDIVIGKGPSARSVRLDL
PPFTLVGATTRAGALSAPLRDRFGVLSRLEYYTVDQLSAIVERTAEVFEV
EIDSLAALEIARRARGTPRIANRLLRRVRDFAQVRGNGTVTMEITQMALE
LLQVDKLGLDHIDHKLLLGIIEKFHGGPVGLETVSATIGEESHTIEDVYE
PYLLQIGFLQRTPRGRIVTPLAYEHFGMEMPKV
>BA4429 splB, spore photoproduct lyase
MKPFMPKLVYFEPKALEYPLGKELYEKFTKMGLEIRETTFHNQIRNLPGE
NDLQKYRNAKATLVVGVRKTLKFDTSKPSAEYAIPLATGCMGHCHYCYLQ
TTLGSKPYVRVYVNLDEIFEKAQQYMDERAPEITRFEAACTSDIVGIDHL
THALKRAIEFIGESEHGRLRFVTKYSHVDHLLDAKHNGKTRFRFSINSRY
VIKNFEPGTSPFEERIEAARKVAGAGYPLGFIVAPLYMHEGWEEGYRELF
ERLYNALNDLSIPNLTFELIQHRFTKPAKKVIQERYPNTKLEMDEEKRKY
KWGRYGIGKYVYKKDDAEVLEETIRGYIYEFFPDAEIQYFT
>BA5722 ssb-1, single-stranded DNA-binding protein
MNRVILVGRLTKDPDLRYTPNGVAVATFTLAVNRAFANQQGEREADFINC
VIWRKQAENVANYLKKGSLAGVDGRLQTRNYEGQDGKRVYVTEVLAESVQ
FLEPRNGGGEQRGSFNQQPSGAGFGNQSSNPFGQSSNSGNQGNQGNSGFT
KNDDPFSNVGQPIDISDDDLPF
>BA2168 ssb-2, single-stranded DNA-binding protein
MMNRVVLIGRLTKEPELYYTKQGVAYARVCVAVNRGFRNSLGEQQVDFIN
CVVWRKSAENVTEYCTKGSLVGITGRIHTRNYEDDQGKRIYITEVVIESI
TFLERRREGASQ
>BA3971 topA, DNA topoisomerase I
MSDYLVIVESPSKAKTIEKYLGKKYKVVASMGHVRDLPKSQMGIEVKNNF
TPKYITIRGKGPVLKDLKSAAKKAKKVYLAADPDREGEAIAWHLANTLNV
DVESDCRVVFNEITKDAIKESFKHPRAINMDLVDAQQARRILDRLVGYNI
SPLLWKKVKKGLSAGRVQSVAVRLIIEREREIQSFEPEEFWTIKTEFVKG
KDTFEASFYGVDGEKVQLTNETQVNEIIEQLKDNAFSVENVTRKERKRNP
ALPFTTSSLQQEAARKLNMRAKKTMMLAQQLYEGIDLGKQGTVGLITYMR
TDSTRISETAQTEARTYITEAYGTEYIGAEKKKETKKSNAQDAHEAIRPT
SVMRKPEELKSFLSRDQLRLYKLIWERFVASQMASAIMDTVTARLINNNV
QFRASGSVVKFPGFMKVYVESKDDGAEEKDKMLPPLEVGETVFSKDLEPK
QHFTQPPPRYTEARLVRTLEELGIGRPSTYVPTLETIQKRGYVGLDNKRF
VPTELGEIVIELILEFFPEIINIEFTANMEQSLDEVEEGNANWVKIVDDF
YVGFEPRLEKAEKEMREVEIKDEPAGEDCELCNHPMVFKMGKYGKFMACS
NFPDCRNTKPIVKEIGVTCPKCDKGQIIERRSNKKKRLFYGCGTYPECDF
VSWDKPIGRKCPKCEGMLVEKKLKKGVQVQCISCDYEEEQQM
>BA0375 topB-1, DNA topoisomerase III
MAKSVVIAEKPSVARDIARVLKCDKKGNGYLEGSKYIVTWALGHLVTLAD
PESYDVKYKKWNLEDLPMLPERLKLTVIKQTGKQFNAVKSQLLRKDVNEI
IVATDAGREGELVARWIIDKVRINKPIKRLWISSVTDKAIKDGFANLKPG
KAYDNLYASAVARSEADWYIGLNATRALTTRFNAQLNCGRVQTPTVAMIA
NREDEIKNFKAQTYYGIEAQTTNQLKLTWQDANGNSRSFNKEKIDGIVKG
LDKHNATVLEIDKKQKKSFSPGLYDLTELQRDANKKFGYSAKETLNIMQK
LYEQHKVLTYPRTDSRYISSDIVGTLPERLKACGVGEYRPLAHKVLQKPI
KANKSFVDDSKVSDHHAIIPTEGYVNFSAFTDKERKIYDLVVKRFLAVLF
PAFEYEQLTLRTKVGNETFIARGKTILHAGWKEVYENRFEDDDVTDDVKE
QLLPRIEKGDTLTVKLIMQTSGQTKAPARFNEATLLSAMENPTKYMDTQN
KQLADTLKSTGGLGTVATRADIIDKLFNSFLIEKRGKDIHITSKGRQLLD
LVPEELKSPTLTGEWEQKLEAIAKGKLKKEVFISEMKNYTKEIVSEIKSS
DKKYKHDNISTKSCPDCGKPMLEVNGKKGKMLVCQDRECGHRKNVSRTTN
ARCPQCKKKLELRGEGAGQIFACKCGYREKLSTFQERRKKESGNKADKRD
VQKYMKQQKKEEEPLNNPFAEALKKLKFD
>BA1905 topB-2, DNA topoisomerase III
MKLIIAEKPDQGLALVSQFKYRRKDGYLEVEANELFPNGAYCTWAIGHLT
QLCNPEHYHAEWKKWSLDTLPMIPERFQFEVTKSKYKQFNVVKQLLHNPQ
VTEIIHAGDAGREGELIVRNIINLCNVQKPMKRLWISSLTKQAIYQGFKN
LLDESDTINTYYEAYTRSCADWVVGMNASRVFSILLKKKGMNDVFSAGRV
QTPTLALIVKREKEIENFKSEPFWEVFATFNIEGKKYDGKWEKDNESRLQ
DPDMANKIAAFCQGKPAVVKEMKTERKEFQPPLLFNLSSLQATANKAFKF
SPKKTLDITQALYQKGIVSYPRSDSNYVTQGEAATFPDILQKLSQFDEYK
GLLPAPVESIMNNKRYVNEKKVTDHYAIIPTEQVTNPSRLSGDEKKIYDM
IVRRLIAAHYEVAIFDYTTIVTLVDERAEFISKGKQQIQEGWRKVIFQDD
KDDETILPIVAEGEEGKVVKVKVKEGKTQPPKRYTEGQLITLMKTAGKYL
ENEELEKVLKKTEGLGTEATRAGIITMLKDRKYIDVKKNQVYATDKGKVL
ITAIGDKILASPEMTAKWEQRLAEIGEGTASPATFMEQTKKLSAKIIEDA
VEMSEKWDFTGLHVESIERKGSKFTTGKKVGSCKKCDGDVIDKSTFYGCS
NYNTTQCDFTISKKILSKTISQKNMTKLLKGEKTDLIKGFKKGEKTFDAK
LEWKDNKINFVFEN
>BA5648 ung, uracil-DNA glycosylase
MKHVLKNDWGPLLAPEFEKEYYRELDVFLKEEYSTHVVYPKIEDIFNALE
YTSYENTKVVILGQDPYHGPNQAHGLSFSVQPGVKTPPSLLNMYKELRDE
YGYDIPNNGYLVKWAEQGVLLLNTVLTVRQGEANSHKGKGWEHFTDRVIE
LLNEREKPVIFILWGRHAQAKKKLITNPNHQIIESVHPSPLSARRGFFGS
KPYSKVNTILANMGEGEIDWEIPNL
>BA5395 uvrA, excinuclease ABC, A subunit
MSKSKDFIVVKGARAHNLKNIDVTIPRNQLVVVTGLSGSGKSSLAFDTIY
AEGQRRYVESLSAYARQFLGQMDKPDVDTIEGLSPAISIDQKTTSRNPRS
TVGTVTEIYDYLRLLFARIGTPICPNHGIEITSQTVEQMVDRVLEYPERT
KLQVLAPIVSGRKGAHVKVLEDIKKQGYVRVRVDGEMLDVSEDIALDKNK
KHSIEVVIDRIVVKEGIASRLADSLESALKLGGGRVLIDVMGEEELLFSE
HHACPHCGFSIGELEPRMFSFNSPFGACPSCDGLGSKLEVDLELVIPNWD
LSLNEHAIAPWEPTSSQYYPQLLQSVCNHYGVDMDVPVKDIPKDLFDKVL
YGSGEEKVYFRYVNEFGQVKENEILFEGVIPNIERRYRETSSDYIREQME
KYMAEQACPKCKGGRLKPESLAVFVGGKTIADVTKYSVQEVQEFFSNVEL
TEKQQKIAHLILREIQERVGFLVNVGLDYLTLSRAAGTLSGGEAQRIRLA
TQIGSRLTGVLYILDEPSIGLHQRDNDRLIRTLQEMRDLGNTLIVVEHDE
DTMMAADYLLDIGPGAGIHGGQVVSAGTPAEVMQDENSLTGKYLSGKEFI
PVPLERRKGDGRKVEIVGAKENNLKNAKMSFPLGTFVAVTGVSGSGKSTM
INEVLYKSLAQKLYKAKAKPGTHKEIKGLEHLDKVIDIDQSPIGRTPRSN
PATYTGVFDDIRDVFAQTNEAKVRGYQKGRFSFNVKGGRCEACRGDGIIK
IEMHFLPDVYVPCEVCHGKRYNRETLEVKYKDKNISEVLGMTIEDGVEFF
ANIPKIKRKLQTLVDVGLGYMKLGQPATTLSGGEAQRVKLASELHRRSTG
RTLYILDEPTTGLHAHDIARLLEVLQRLVESGETVLVIEHNLDVIKTADY
IVDLGPEGGDKGGQIVASGTPEQVVKEERSYTGKYLKEILNRDKARMKEK
IKEVELSQ
>BA5396 uvrB, excinuclease ABC, B subunit
MERQFEIVSAYSPQGDQPVAIEKLVEGINSGKKKQVLLGATGTGKTFTIS
NVIKEVQKPTLVMAHNKTLAGQLYSELKDFFPNNAVEYFVSYYDYYQPEA
YVPQTDTFIEKDAQINDEIDKLRHSATSALFERDDVIIVASVSCIYGLGS
PEEYRELVVSLRVGMEKDRNQLLRELVDVQYGRNDIDFKRGTFRVRGDVV
EIFPASLDEHCIRIEFFGDEIDRIREVNALTGEVLAERDHVAIFPASHFV
TREEKMKVAIENIEKELEERLKELNDNGKLLEAQRIEQRTRYDLEMMREM
GFCSGIENYSRHLTLRPAGATPYTLLDYFPKDFLIVMDESHVSVPQVRAM
YNGDQARKQVLVDHGFRLPSALDNRPLTFDEFEEKTNQVIYVSATPGPYE
LEQSPEVIEQIIRPTGLLDPPIDIRPIEGQIDDLLGEIQDRIAKNERVLI
TTLTKKMSEDLTDYLKDVGIKVNYLHSEVKTLERIEIIRDLRLGKFDVLV
GINLLREGLDIPEVSLVAILDADKEGFLRSERSLIQTIGRAARNENGRVI
MYADRITRSMGIAIEETKRRRSIQEAYNEEHGITPKTIQKGVRDVIRATT
AAEEPETYEATPAKKMTKKEREKTIAKMEAEMKEAAKALDFERAAELRDL
LLELKAEG
>BA4757 uvrC, excinuclease ABC, C subunit
MHEHLKEKLAILPDQPGCYLMKDRQGTVIYVGKAKVLKNRVRSYFTGSHD
GKTLRLVGEIVDFEYIVTSSNLEALILELNLIKKHDPKYNIQLKDDKTYP
FIKITAEKQPRLLITRNVKKDKGKYFGPYPNAQSAHETKKLLDRMYPLRK
CSNMPDKVCLYYHMGQCLAPCVKEVTEEQNKEIVDEIIKFLNGGHKEVRS
ELETKMYEASEKLEFERAKELRDQIAHIDAIMEKQKMIMSDLVDRDVFGY
AVDKGWMCVQVFFVRKGKLIERDVSMFPIYDEPEEGFLTFIGQFYENSSH
FKPKEIVVPGSIDSELVERFLEVEATQPKRGKKKDLVELANKNAKIALEE
KFYLIERDEERTIKAVENLGKQLGIETPYRIEAFDNSNIQGTNPVSAMIA
FIDGKPAKKEYRKYKIKTVQGPDDYESMREVVRRRYTRALKEGLPLPDLI
IIDGGKGHLAAASDVLENELGLYIPMAGLVKDDKHKTSHLIIGDPPEPVM
LERNSQEFYLLQRVQDEVHRFAITFHRQLHGKSVIQSALDDIPGIGDKRK
KVLLKHFGSLKKMKEASIEEFVEAGMPKNVAETIYTYLTDKKTL
>BA4404 xseA, exodeoxyribonuclease VII, large subunit
MEKQYLTVTALTRYIKTKIEYDPHLQSVWLKGEISNFKNHSRGHMYFTLK
DENARIAAVMFAGHNRNIKFRPENGMKVLVKGKISVYEASGSYQIYIQDM
QPDGIGNLHLAYEQLKVRLEEEGLFSQVYKKTIPPYAKTIGVITSPTGAA
IRDIITTIKRRYPIGNVIVFPVLVQGESAAPSIVQAIRTANEMEEIDVLI
VGRGGGSIEELWAFNEEMVARAIFKSEIPIISAVGHETDFTIADFVADLR
APTPTAAAELAAPNIIELQEKVLQRTLRLQRAMRELVHKKEEKLQVLQKS
YAFRYPRQVYEQKEEQLDRALEQLVLAKERYIDKKVNQLKQLSFYLEKHH
PSQKIMQTKVAVETLQKQLQREMQTLLQAKEFAFVRAAQKLEALSPLKVM
MRGYGLVYDEEKQVLKSVKDVSLGDAVSVQLQDGILDCSVSGIEERELNN
GK
>BA4403 xseB, exodeoxyribonuclease VII, small subunit
MENKLSFEEAISQLEHLVSKLEQGDVPLEEAISYFKEGMELSKLCDEKLK
NVQEQMAVILGEDGELEPFTALGDEA