TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Bacillus cereus E33L, E33L
Gene type: CDS

Number of genes found: 283

Free access
Sort by:

 



# Bacillus cereus E33L, E33L

>BCE33L0720 probable transposase, IS605 family
MNKMAKENPSNYKTLQIWIKKGHRMYSYFQECCHNAKNMYNTTNFYIRQV
YTGLTQEKELQPLQKEVLANIHKNIGKMNDTQLLAYQKKLEKEKLKPKEE
QKEITCNLFSEPNFEKPYVDYNFLDALFKAMIQNDYRSLPTQCSQSIMKG
VFQNWKSFFASLKDYKKNPNKYAGPPRIPKYIRSSEKEILYTNQDCIIKN
DRFLKFPKTKLKLNIGKLGFTEGKLKQVRVIPKYNEYVVELVIDVPYEQQ
MIEENARYMSIDLGIDNLATIVTNTGMKPVLVKGKHVKSINQYYNKMKSH
FTSILRNGKQTNEGPFTSKRIEKLHQKRYLKIKDVFHKVSHHIVKLAQEE
VCKIVIGQNKGWKQETNMGKRNNQSFCHIPHNLLIQMITYKANAVGIQVV
VTEESYTSKASFLDNDFIPTYGENDQNTTFSGKRIKRGIYRSANKTLIHA
DVNAAANILRKVIPNAWTNGIEGLGVKQLANVLTPLTLIVR
>pE33L54_0042 hypothetical protein
MEVHTMKNKIGKPKKYSLQELQQHLLNYVQKHPNKTISYIGLERETGISR
NTWSRNLKDEIAKLNEPIPFTTKIDFDTGIPLPNIFDIVEKNYDNKAKLI
SSLTHLNACINSLYAIAKKERTFESENVGFKVKIKELEQTLREKDREIKE
LKQQVIHYSQAYRNICVSSTYGEKGLKNVLEFKTGSEKNEEKISADLSKQ
FKMFDFE
>BCE33L1441 methyltransferase
MGKVTLIATAAMGIEALVAREVRDLGYECQVENSKVTFEADEKAICRTNL
WLRTADRVKIKVGEFKATTFDELFEKTKALNWGDYIPENGEFPVIGKSLK
SELFSVSDCQRIVKKAVVEKLKTTYKRTTWFEEDGPLFRIEIAMLKDIAT
LTIDASGVGLHKRGYRVDQGEAPLKETLAASLIKLTNWKPDRPFVDPFCG
SGTIPIEAALIGQNIAPGFNRGFASDEWGWVGKQNWREARQEAEDLANYD
QPLQIIGSDIDHRMIRVAQDNAEEVGLGDLITFKQMQVKDFTTKEDYGYV
VTNPPYGERLSEKALVEQLYKEMGQVFRPLDTWSAYVLTSYEAFEKCYGK
DASKKRKLFNGFIRTDYYQYFGKRPPRNS
>BCE33L1116 probable integral membrane protein
MLFTSWLLFFIFALAAFRLTRLIVYDKITGFLRRPFIDELEITEPDGSVS
TFTKVKGKGLRKWIGELLSCYWCTGVWVSAFLLVLYNWIPIVAEPLLALL
AIAGAAAIIETITGYFIGE
>BCE33L0832 possible ATPase involved in DNA repair
MLIEQLELENIGAYTERNTFDLSISSPKKKVILIGGENGAGKTTFLNSIK
LGLFGCFGYGYKTENNDYYKRVHGYLNASARKDETNPFSITITFSEVENY
KRHVYTFKRSWNILNSAIKEKFTVKKDGHYLNDAEKDIFESRLRENFPPK
LFDLCLFDGEEISKIINENKLSSYLKELSTVIFNLDLFRNLEGDLTNYLQ
QEIDQEHLSSIESEILTLQRQEKEQSLKIEDLEDTIKTAQQQIEESKESY
SLLKKDFETHGGLIKEERETLNRQVLEIEAHRKANSEKVREFIQTLLPFY
LNKNLLLSTKNQLQNEEKLSLANQLTSELTEERALELAKSLPGVSAPNDL
AAELRKQIFNIIKPNDTDVEYIHRVSPTQRTQFEVAAQQVERESHDTYMQ
LLQENRENLLQAQELRKKISTNDSTNEFAQMLETMTQTQEKIFKLEKEVE
ENLSILETRQETLEALKNTIDSKQNIVQQSNKTRNTFLIAQSIMKLSTEF
QMLQHQKKLQQVQIEATKMLNKLMRKHQYISSLRIDSSTFEVTLYDNNRD
HVAKETLSAGEKEILLLSLIWAMFKCSGRRVPFIFDTLLGRLDQTHKHNI
LVDFIPACGEQVLILSTNSEVDEKHYNLLKNFVSHGYLLEFDTELRKVNV
TDQYFNFNKEQAK
>BCE33L3754 conserved hypothetical protein
MLKYSKLAIVTALSMTLLAGCFGPKPEEELYVAFENAAKQEKTMFEDAKK
LETLEKEGQELYNQIVQEGKDNNQTVKEKLNQAVKNTDEREKVLKKEKES
LSKAQEEVKSADKYVKKIEDKKLKDQADKVKSTYEKRHDSFNKMYDSYNK
SLKQEKELYTMLQDKGTKLKDISEKVKVVNQSYKDIDSEKDKFNEFTKSY
NTEKIAFYKQANIKIKEEKK
>BCE33L0472 transposase, IS605 family
MILAKKVRLIPTPEQEKVLRNHAGAARFAYNYCKRMSDRYYKLFGKSVSQ
LALQKRFTKIKKRKRYEWLKYINAQVPKQASKDFDTARKHSFKKYKNGYH
TSYKSKKDVIQGFYANYERLVIGKKVVHIQSIGEVKTSQQLPRNKKPSNP
RVTFDGRHWWISVGFQEDFESQELTNESIGVDVGLKELFVASNGMKERNI
NKDAKVKKLLKRKKSAQRDMSRRFKKGVTIQSAGYEKARAEHLRLSRKIT
NIRNNHIHQATAKLVKTKPMRIVVEDLPISNLLKNKKLSKAFLFQKLNFF
FQCLSYKCEKYGIAYVKADKWFASSKICSCCGVKYDHSVQPEGQWSLKIR
EWCCASCNSHHDRDVNAAMNLSRWVK
>pE33L466_0056 integrase-recombinase
MDNNNTNENLSIINNQSDFIDIYVEYIRDIVPFLHNNGYVDKVEEKMEVA
EKEGKSKYTYLSDLEVIYHFVHLQKDMDEKKNRKEDTKKSYISEILSFCQ
CMVQHAEAFELDGEEVQKKDSLLKTLQPWHIRKYNSWLKQVENGRNGETY
AVATLAKKTVLIRSFLKHLHVFSYIEKPLHEELQRANVNEQDRPNRDLSY
DEVMKILGFYKERGHLVNYTILLALASTGARIQELCTARVKNLHYDGKYW
LKVTGKGDKVRQLFISEHLFQCICEMRRRRGFQTVLDRGDESPLFVNQRG
NSYNSKTLSNQVTDMIKKTNLEFLLYRENPVTAHTFRHAFAIMAVEQGNA
DLYHLMQTLGHEDIQTTKIYLEKHMKRKNNVGTTFADMLN
>BCE33L1994 resolvase
MGEKVVGYVRVSTEGQVREGYSLTYQVEEIERYCIENKLQLLHIYEDKGI
SGATVDEDGLTVEREGLQELLSDLAYQQVSHVIVLNTSRLWRSDMAKVLI
QRELKKHEVDVKAIEQPNYSIYTHDPNDFLVNGMLELLDQYQRLEIALKL
SRGRKKKAEQGGYAGGGVMFGYRVKKGQKVLEVDAEKAIVVRRLFELRHF
FKHWSLTQLAERLNREGYCTEKRKLFTKVQVKRMLDRENFYRGVYTYGQI
QTIGKHPVIIL
>BCE33L2092 group-specific protein
MAEIKVGLTQFLDFTLKSSAAKTNFVKNLKSQPEYHPVFDYWKQLRETVI
KFHKNKLSFDCFEALVQAVDQKKKQNYIDVLKQYKKFITNKDVSWFDPGK
SHWLSDNLIVRSSPELGLLINDEPHLIKLFFKGKKERIDKYNINSTLTLL
NESTFSNEHKDVNYTVLNIQKNRMYTNNSINNNHVIALESEAHQLCYLWN
KM
>BCE33L0828 conserved hypothetical protein; possible ATPase
MLFKKLIFDNYKTYYGHQEVDFYIPKEVREEGEKNIILLGGLNGAGKTTI
LKAILYVLFGKRGFSPAEHKRVFSNVINNTFFDEGGRDCSVTLAIETDKN
EEWTLKVKWGFDHNKRLISENRDLTVKKPGAMIGKTVRIDNIDTFNRFMD
KMIPYHAAPFFIFDGEEIKDIILRQNSEEMKEAIHKITGMETYKLLLSDL
SSIKTGIEKNLAKAVDQNKLKSLDANLKEYEEQIQHLEKRKELISSERKK
FDDLINEVKNERNEKITTNSKSREVIVKKQSGLATELRLAKEQFENYFNN
NAINIILKEKTKVLQNRLKLEYDIRQKKMIQDASLMPYEKFMDELLNQPF
TPPLSNEQLNQLKEIGREIWVKENNIEKDISEDHVDIHDISNKDYNYLVN
LPARNNSYVIDLINKIEKLNLELDALEIEIRNAPETIDISAENERIDILT
KKLGELNLKYKSIIAKLNKIKEQRTTVVNQLTRLSDQGADYDALNKRLIY
VKKLIHTMNAYVHEMTKLKASFIREEFSSMLSRLFRKQDEFGKIEFDIST
YTVRLYNDRNQEISIQDRSAGEMQMISSSLIWALTKASDLALPMVIDTPL
GRLDSYHRNHLINHYYKELSEQVIILSTDTEITQDYINFMQEHSYKQYML
DYDQSKKYTVIRDGYFDFIKV
>BCE33L2086 serine/threonine protein kinase
MKWRRILALFDRPLRKNTIVAERYKIESVIGMGSYGVTYVVNDLQINRYK
VLKQLRQSKQRYVSGRKSFEQEKMILQTLNHHAIPSLYDHFVWEKKSFFV
MEYMPGKNFEDYIFLDGHVYTEREVLKILYEILEIVSVFHSKGIIHRDLR
IPNILMKENQISIIDFGLAKLKGEGDERATTYEGEQALMREVHFRSDFYA
LGHFSLFLLYAGYESNEKYEKPWYDELTLENYNREMLMRMLQMKTPYYEN
VRDLKKDVAFALERMEVPCFKSF
>BCE33L2909 possible DNA helicase
MNTPTQTPSLSETMKEWHYALAYEIKHWKTIGGSKISIMNGRFLYTDYEN
TVYVFQLISEVSLPEGSPIRIEFDGEEATGEVLSVHGLEIELKLNDYIQG
EIREAVLYSEPWQLLEQLQERLKEARKDKLKRNRIKRLVDGTSSPKHIEK
MKNPKNELAYRAFYNPTTYVWGPPGTGKSYNLSRIISAHYQKGKSVLVLA
HSNAAVDVLMSEVTKQIEKKKKWTPGEIVRYGYSQHEHIRNHETLLASKL
VETTNGSWGEERLYLEETRQELREKILSYKATSADKKRMQEIESDLRKQR
AKIKEVEKEYIENAKVIGATLSKCAIDSLIYERTFDLVVVDEVSMAFVPQ
IALAASLGKRIVVCGDFLQLPPIAMANHELVRKWLGEDMFYHAGIVESVN
KSEAHPNLFMLQEQRRMHADISKFTNSFIYKNRVYDHPSVSERKELAQLQ
PFAKEASVLFDTSQMGAFSLKDAASGSRFNIMSGLVAMQMMLIGLLDGVQ
SIGVVTPYRAQSRFLSTCIREMLQRTKYQNIPVLAATVHKFQGSERDMMI
FDTVDSYPQERPGVLFFDHKNHRLVNVAVTRARGKFIQLSDCHYMRKNLS
RKQALSQLTAHIERHGDVYDRTTSRQLWERKISKRLRWFMEMNLEEPKGL
LKDILAAKRKIVISLPSTKQVDKRVWQALMRTNAQITVYSDGPVPLKNVK
LQRQNKAFPFIVIDDEIFWAGAPLTSQMMFEGSTEFPYVCARLQAPETIG
VLKGFLDIR
>BCE33L2424 MutT/Nudix family protein
MSNKFHHIVRAVMIKDKKLLVAEYIGHHYFLPGGHVEVGESAESALIREL
QEELGVNCSIKQFLGVIENQWQDKEMLHHEINHIFEIDSEELHIDFIPKS
KEPHLAFHWIDYNRDALHTYKIMPAPSVKELLERKLSDELLNCWISNF
>BCE33L1809 MutT/Nudix family protein
MANYIKELREKVGHDYVFLNFAGGCVFNKEGEVLLQKRGDFNAWGFPGGA
MEIGESAAETAIREIKEETGYDVEINELIGVYTKYFQSYPNGDKAQSIVM
CFSCSIVGGDKKVDGDETLDLKFFPLDDMPPLFCKQHEDCLQDLLEKRVG
VYR
>BCE33L4940 conserved hypothetical protein
MLQHSITKDEIMMIANEFVQGLDPQQTADQEHVATARHLYRSGVVYNVDF
DGYTLSGTVDAEGSVYSVHIPIRNVAESYCDCFAPTQCEHMLAVLLSAAS
SFGQVGDVLTLFKNNTKPSLPPIRTARQVLQSSAFEETDYKSWQSYFDNE
YESFKKEQARLTYKQMYFLMSIFTDFYTKLERKAPRIVVIHELFRLHAAL
YCFQKLLEEIQEFEANKTYSYHQPVNVVRLFVDKVESIVRDLQSEATSSE
SEAILQETARLVHEVFFSTDAYTQERFFIYRHIWSELLHNKEQIREEEKR
IDTKMNPLSKALASSHLLFLNDKDLLAMDLLKKQPASVVSLYFYWLEELL
NAMKWDRAKNWLSFTYKQVKTTIQEQENTIFIKDIVRLFVIMYETYATHT
NEQAGLEMILQELLPYSFANYEQYVLAKKQYRTWTELQLLHGFEAIELLK
EPLKDIEKEAPEAALPLYHLAATEAIEERNRKAYRRAVRYLKKLRTLYKR
LKRTDEWDAFIIHIANLHSRLRALQEELRKGKLIDDQSN
>BCE33L4921 possible transposase
MEIVARDGSITYHNAITEAHPEAVQVSDRFHILKNLTDYASDVLKKALKN
SVKIPVDESKCEETCTQQDNIPQSNINRKLTIKEKYERALQYKADGNSKS
WICNQLNMDVRTYDKLIRLSEEEKQKKFKKQATIIHEEKVQKKMNDINEV
RELKKLGLSNYELLEK
>BCE33L2470 conserved hypothetical protein; possible chromosome replication initiation protein
MKINWTEVKEKIRPQISKPSYETWFTNTTVYLEDDILTIYCPNEFARDWL
ESHYKELVFNTLREMFNTTFEIQFDLCNGEPSNLKDVQKEKLSSWDEVKK
ALRPKIAEKTFMTWIRNTNATIKDNKVIIFCEDAFHRDLLEGEYKNIISS
TIQKITDEEYQIWFEIGSSATSKAQIHHVQNNQRTSGQEESKTIWNKIKD
KMQLKISRPSYETWVKETTARINEDSLIIYFENEFQQEWVKKSYKDLISQ
IAKELTGNTYEIQFELKSNTTSNNEKSTIEDITSELGEQFKVSNNSLKYN
DVDESNKRIRALEEKIMNLEKVIGTLVEKLDAVELKTQLEK
>pE33L54_0045 uncharacterized protein
MSSILVNKYKTLSNQEYQFCVDTALVEIKGTDKESFYRKHALIMLKHKTL
KISIVHPLSAFIFQNWKYKSYNTQRLHAIHLCQFLNYILIDKGNDFGLTS
LAELKFEHGSTFLNQLLWKKKTNSSVKNIERSLVYFYEFLAQKKCSRHYT
IEDFTEAVIPNNPKRKCKLSPFQVKYSNEAANAKYTGLKAIEHTLPPKHI
LSFIRMAIHVAPPVALAIYLSIFGGLRIGEIANITLKDITPYSDPYGKGG
MKIALTSQTLRTDIKDTNGTSYVKKPRLQFVYNIKNILPHLYKDHLRLLK
GIYDSSDLPLTTPLFVNRDGKAMTGKSIRYHFEKVKKCFIEELSKSLNVD
DVIMSINLKNTKWSFHIGRGTFTNLIAKVANNPYEVALSRGDTSIFSALT
YMADTVELKNEIEKLLDELFKEY
>BCE33L2981 conserved hypothetical protein
MAVYRNVQVNFWQDEFILDLTPEERYFYIYLLTGTKTKQCGIYILPKRVA
ELETGYNMETVEKLLNRFVEYGKILYDAETKELYIMNWLNYNPILNTNVE
KCVLRELKTVKNKEFIHMFLRKCLEKEWKIPLLLQHFGMPEEEGNSSLQE
VVEEIEDEEGEEDTQHSEVYKFYEQNISSLSPYIVKELKGWIRRLSGEKV
LEALKIAFEQNKRTFAYVKGILRNWCEKGRGDFRGRKEGEVCLHDP
>BCE33L2400 resolvase
MKKKFLEQQNLNSLKDIWLNFSEENLYSQFLITVMAGVNQLERDLIRMWQ
REGVELAKKEGKFKGRLKKYHKNQAGMNYAVKLYKEGNGTVNHICEITNV
SRS
>BCE33L1345 transposase, IS605 family
MVKTMAKKKAVKVLRKQKKRETLRRFTQKQNIGRAYLTAQEFRLLQRMSH
SSKALRNVGLYTMKQSYLNHNKMATVKEVDTAMQADMNYWGIQSNSVQAI
RRALFTEVKSFFKALEQWKKNPEKFTGRPKFPNYSHSTDKRIIEIYQVPK
VDENGFWMIPMSVAFRKKLGSIKIRMPKNLRNKKISYIEIVPKQKGRFFE
VHYTYEMHVSQMKQQSTTTSNALSCDLGVDRLVSCVTNTGDAFLIDGKKL
KSINQYFNKMICNLQQKNMDNGISKRIVTNKMAALWHKRERQINGYIAQT
VGLLFKKVKESDIDTIVVGDNAGWKQNSHMGKKNNQKFVQIPFHKLIAAI
ENKCIKEGIRFFKQEESYTSKASFLDKDPVPVWSKDNRAQYYFSGKRITR
GLYQSKAGTCIHADINGALNTLQKSQVVELDGNLKVKTPILLEVQKRKAV
ASRIA
>BCE33L2792 conserved hypothetical protein
MLRFDHLVHAVHGTPEEAAKQMQELGFHTALGGEHTIWGTWNSLSYFDLS
YIEFLAVQHEEKAKEAENPLVQETVVKLQDEEGMLQIAIRTDAIEELADK
FSKYGLQTIGPFEGKRMRKDGRLLEWKMLFVKQEENGPKLPFFIQWNETD
EERRKDLRNIGTITEHKNKVQQIETVHYAVKNVRETVQKWEKVMELTASS
VVKNEEWNAECQSVSFGDIQVQFCEPIGKGLVLQYLKNHGEYPFAVEFKG
ENKREYEVLGSLYIY
>BCE33L2461 transposase A
MSVSVSDELQLFAQEIQSFLSPNTLRNLARDVGFVQRTSKYQAKDLVALC
VWMNQNVATTSLTQLSSCLEASTEVLISPEGLNQRFNQAAVQFLQHILAE
LLNQKLVSSMPISSPYTSIFKRIRILDSTAFQLPDPFSFVYPGAGGCSHT
AGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGSLCVPTVTANDLCIRDLG
YFHLKDLQHIQDKKAYYISRIKSNTRIYQRNPNPDYFQDGRIKKCTEYIQ
IDMEVLMNSLQPGQTCEISNAYVGMTDKVPTRVIVHRLTKEQQQKRLQDQ
AVREKKKGMKYSPRSKRLSGINVYMTNTSADIVPMEQVHDWYSLRWQIEI
LFKTWKSFFHIHHCKKIKRERLECHLYGQLIAILLCSSTMFQMRQLLLMK
KKRELSEYKAIYMIKDYFLLLFQAIQKDTQGLSKILIRLFNLLQQNGRKS
HRYEKKTVFDILGVVYNCTMSDNQAA
>BCE33L2490 MutT/Nudix family protein
MSMPLYYKKIREQLGHELIFIPSVAAIIKNEQGKILFQYPGGEYWSLPAG
AIEPGETPEEAVVREVWEETGLKVRVKKQKGIFGGKEFRHTYSNGDQVEY
IVVVFECEVISGKLKAIDGESLKLKYFSLSEKPSLALPYPNNIFL
>BCE33L1876 MutT/Nudix family protein
MTVLYNKKVHAYVTREKEGVMQLLVFKHRDIPEAGIQIPGGTVEEGETLE
AAILREVQEETGLRHLCIERFLADYIIHVKEKKEYQKRHFFHVTLLTDVK
DSWEHIVSAGKEDEGLVFCYDWVDIAKCPELAGKQGEFLHLLDEVYVK
>pE33L466_0446 hypothetical protein
MAYMRLIINTHQYSVEKAKDYIFQNMFGLKEDSLGKHTYQNLRQVNSITL
YFQHINEYEKEIDYLQNFVKNFQLPVMINQRVGSVEREGEIYLSFIGAME
LEGKILQVSRVSTEIRGGYYVGLEGQRSFSSTTLRGVGSDAKFVVRKLIS
YLKNECFR
>BCE33L4541 pyrophosphohydrolase, MutT/Nudix family
MYKFKDYYHNTVQLSFERYPFSPEPKHVWVVCRYGDQWLLTHHLRRGLEF
PGGKVELGETPEEAAVREVHEETGGIVSDLTYLGQYKVSGKDKIIIKNIY
FATISAVEQHTHYEETKGSVLLTDIPDNIKTDRKFSFIMRDDVLARTMKH
IEEIGCFTK
>BCE33L1618 helicase, SWF/SNF family
MSFTLNKSTIKEVCGETSYKRGEAYYKSNKVIVNHYDETKEICEATVKGN
EDFHVTVEKAKKGDVVARCSCPSLASFQTYCQHVAAVLIQINYNQQAGVM
GSISSRNDQLTNGMFQLFADKPLRPKSKQHRFDTREILDVAFICSPVATK
SGGALLGIQLKLAKTYLINHIREFLSKVEKRETFHCSNEFTYTPDIHSFK
QETDVIIQHLIKIYHNEKMYEDALEVHAKQDESMIFIPPASWNDMLAALS
RAEYVQLKQNEQLFHGLQISKGLLPLHFEFTKGNNGGFTLHIDGLNRVRV
MEMYNNALYDGKLYHLPMEDCMRLIELQKMMSRSNSNQFYIPENKMEHFV
AKVVPGLMKLGTVRIDEGISDHVETPSLKAKLYLDRVKNRLLAGLEFHYG
NVMINPLEEDGQPSVFNRDEKKEKEILDIMSESAFAKTEGGYFMHNEEAE
YNFLYHIVPTLKGLVDIYATTAIKLRIHKGDTAPLIRVRRKERIDWLSFR
FDIKGIPEAEIKGVLVALEEKRKYYRLANGSLLSLESKEFNEINQFVKES
GIRKEFLHGEEVNVPLIRSVKWMNGLHEGNVLSLDESVQDLVESIQNPKK
LKFTVPQTLHAVMREYQVYGFEWMKTLAYYRFGGILADDMGLGKTLQSIA
YIDSVLPEIREKKLPILVVSPSSLVYNWLSELKKFAPHIRAVIADGNQAE
RRKILKDVAEFDVVITSYPLLRRDIRSYARPFHTLFLDEAQAFKNPTTQT
ARAVKTIQAEYRFGLTGTPVENSLEELWSIFHVVFPELLPGRKEFGDLGR
EDIAKRVKPFVLRRLKEDVLKELPDKIEHLQSSELLPDQKRLYAAYLAKL
REETLKHLDKDTLRKNKIRILAGLTRLRQICCHPALFVDDYKGSSAKFEQ
LLDILEECRSTGKRILIFSQFTKMLSIIGRELNRQAIPYFYLDGNTPSQE
RVELCNRFNEGEGDLFLISLKAGGTGLNLTGADTVILYDLWWNPAVEQQA
ADRAYRMGQKNTVQVIKLVAHGTIEEKMHELQESKKNLIAEVIEPGEEKL
SSITEEEIRDILMI
>BCE33L4130 conserved hypothetical protein; possible Holliday junction resolvase
MRILGLDVGTKTVGVAISDEMGWTAQGLETIKINEERGQFGFDRISELVK
QYDVDKIVVGLPKNMNGTIGPRGEACQQFAENLRELLQLDVVMWDERLST
MAAERLLISADVSRKKRKQVIDKMAAVVILQGFLDSK
>BCE33L2646 MutT/Nudix family protein
MGNAINFHRVFGVYGICIENNNILVIDKSKGPYKNRYDLPGGSLYEEESL
LSALHRECKEETGLEVKVIRQVGIVDFQYPSKFKDYTHVHHIAVLYEVEK
SAGKIIVPKQFEGQDSLGARWVSIESITEDNSSPLVCSAVEWLKTNELPL
EVKKYETWKVKNSF
>BCE33L4228 site-specific recombinase, resolvase family (DNA invertase-like protein)
MTVFGYARVSSKGQDLSVQIEELTNKGATEIYSEKFTGATSDRPELQRVL
NVLKEGDTLIVTKLDRLARNTKEGIQIVEELFSKGVSVHVLNVGLLENTT
MGRFFLQTMLAVAEMERSMIYERTQEGKEHAKANNPNYKEGRPKALTTSR
YAECMKLLETNSVKQVANIMNISERTIYRYKREGQAE
>BCE33L2388 conserved hypothetical protein
MKDIPVEIQLNQVTFQLKEHHNFDWLLKLGNVFAVFDQQDSGNISFGVER
DGHKKFIKYAGARTIAYEGTMKDAIERLKSSVSLYEELMHDSLIKLIDHF
PVQSGYVLIFDWFDGECLHSHWSFPPPDKYKNPNSPFYKFKHLPVIKRIQ
SLNSIFSFHTYVEKKNYVAIDFYDGSILYDFHTNETKICDIDLYSKKPFV
NKMGRLWGSSRFMSPEEFELNAIIDERTNVFNMGAMTFALLGGEKDRSFI
KWEASRDLYEIAYRAVNENRTERYASVKEFYEEWVNVSNTNKI
>BCE33L2779 possible 7,8-dihydro-8-oxoguanine-triphosphatase
MISTLTFGYKKPTEQYVLRPSCYAIIFNSTSSKIAIIQKGERFFLPGGGI
EGTETKDECLHRELLEELGWIIEIEQYIGNAMRYFYAEKEDTYYLNDGFF
YIANMVQKQTENCEEDHVLRWMSPLHAVELLIHDHQKWAVEQALLLRNQK
GSPSI
>BCE33L2669 group-specific protein
MNTKLIIVEGLPGFGKSTTAQLINDILSQNKIEVELFLEGNLNHPADYDG
VSCFNKFEFDRLLSNSGDFKEVLLKRVLKKGSNYLLPYRKIKNEFGDQFS
DELFNDISRNDIYELPFDKNVELIADKWNDFAEIALEDNKVYIFECCFIQ
NPLTIGMIKYGEQKEKIINYVMKVAKIIENLNPMLLYVEQDNLELSFRKA
LKERTPEWSAGFVDYYTNQGYGKEHNHSGVEGTIKVLEARRNLELEIFDM
LKMNKEKINNTKYEIDSYRSMLKDKLTIQMVK
>BCE33L3454 group-specific protein
MLFLKRLMNLVGIENLVLPEDAELAKSLRNKKENYIKNQFLLTRIASKKN
VEGKTKEFYETCKEYEACGEKAKECDKQLKELIFKKKENDRVQHVVERMR
EIGIKDDVIQKVLYK
>BCE33L4715 DNA primase, Toprim domain
MIYVEKVIIVEGTSDRRKIESIIREPVEIVCTNGTIGLSKMDELVDQFFD
KEVYVLVDADDAGEKLRKQFRKEFPQAEHIYIDRSYREVATAPSSHLANV
LWGADIDVYTEYLR
>pE33L466_0251 conserved hypothetical protein
MKIVAWNAGMAFRKKIDKILPLKADILVISECEKPEKWGQIDKEKGINDF
LWEGDNPNKGIGIITFDERYQIEIYPDYDRSFRYIVPIKISAGNQEFIMF
AVWSQKGEKRYSSYIGQIYLALEKYASLLKEPCIIVGDWNSCKLYDWIKR
VKTHSEVVEFLEGFGIKNAYHHFSEEEQGEESKTTHYLRKEKARPFHIDC
LFTSEIFLNELESIEIGSYEEWIEFSDHMPISAEFNK
>pE33L466_0196 RNA-directed DNA polymerase
MNTSERASAHVSYTNWNTVDWKAVQMYVTKLRQRIYRAEQLQQQRKVRKL
QRLLMRSEANLLLSIRRVTQQNKGKRTAGVDGHTALSRRERNLLYEQLKK
LNTLQHRPKPAKRIYIVKKNGKLRPLGIPTIKDRVYQNIVRNALEPQWEA
RFEAISYGFRPKRSTHDAIRSIFNRINGGTKKKWIFEGDFQGCFDHLNHE
WILKQTSYFPGRKLLKRWLKMGYMEQSFFAETQEGTPQGGIISPLLANIA
LHGMEETLGITYKKNYKANDSYIMNPACKFTLIRYADDFVVLTETKEQAL
SVYMRLRPYLKDRGLELSPEKTKVTHIEEGFEFLGFLIRQYQTEQGNKLF
IKPSKGSRQKAKKKIGDTLRVMRGQPIGEIIRVLNPIIRGYGQYWKHVVS
KKIFGTMDSYIYWRIGKHLRQLHPKKSWKWIYARYYRHPHHGGNAWTPTC
PKTNIQLLHMSWIKIERHNMVKFKNSPDDPTLKEYWKKRDRKVFNTENTM
DRMKLARKQGYRCAICKTPLQNGEKVVVKDMPVPQHLISSNLNLKLVHLP
CLY
>BCE33L0241 conserved hypothetical protein, homology to pX01-98
MFNVFREFDKIAEELMHAAEKFRNADELYDGNLVDGSLSNEKIEAGAMGD
TPPEKSMLRKIWDGIYVGSGQVIGEIVEGFDSLDDTVTKENIKYAIGHPI
ETVSTAWNTVSDSFMNDFWHGDAESRTKWGTSIFMGLGLGWIGDKGISRV
GKVTTLGIKAEQSISHFSNKLQMREPFAYAGDFRDVPKTSFNSLEEARNT
FMFANPGNLNTPRFQEYISQVEEITNRKIPEKQRELLQERIENNSYERLS
KEEVAKRRSEFGRLKDDLREEWELETNQKWPTYEEEILSRRGIPLRHVDD
PYDAHHFIENAHGGEHEWWNIHPAKFPTEHQGGIHGKEGISKEIFKKK
>BCE33L2431 MutT/Nudix family protein
MYKHTLCFIKRNEEILMINREYDPVKGLWNGVGGKIEKGETPLENAIREI
KEETNIKVTHDQIQFKGIIKWEDSSYSGGMYVYLVELLHEFTYHTPKKVS
EGILDWKEITWILSGYNYGVGEMIPKFLVEVLYNELILEHNFVLSNHKLI
DYRNKELVQDRLLTSLYNTIRL
>BCE33L4086 transposase, C-terminal region
MCGEKLYVSPVLDLYNGEIITYTIGSRPTYSLVSKMLENALEHLPETHQL
LMHSDQGWHYQMRQYVRTLESRAIVQSMSRKGNCYDNAVIENFFGIMKSE
FLYIKEFENVEHFKIELEKYIDYYNTKRIKAT
>BCE33L2275 hypothetical protein
MLVLTFYKDGYHSIVDVTAKFSVNSGAIRAWKISYEVNGEDGLEEVLSWK
KGIKIDLSVERNCH
>BCE33L2988 conserved hypothetical protein
MDFKTVMQELEALGKERTKKIYISNGAHEPVFGVATGAMKPIAKKIKVNQ
ELAEELYATGNYDAMYFAGIIADPKAMSESDFDRWIDGAYFYMLSDYVVA
VTLSESNIAQEVADKWIASGEELKMSAGWSCYCWLLGNRKDNEFSEGKIS
AMLEMLKNTIHDSPERTKSAMNNFLNTVAISYVPLHEKAVEIAKEVGIVE
VKRDNKKSSLLNATESIQKELDRGRLGFKRKYVRC
>BCE33L4380 MutT/Nudix family protein
MYPQVKAFGIAIHHDRLLVQEYHTSDETYYRPLGGSIELGEKSAHTVIRE
FQEELHTEVEITDYLGCLENIFHLNGEIAHEIIQLYSLRLLDTSLYEMEI
MKIHDDQTISYAKWIPITAFIQEKKVLYPDGILTYIQNKKDEIL
>BCE33L1461 5'-3' exonuclease
MKKVLLVDGMALLFRAFYATSVYGQFMKRQDGTPTNGIHGYMKHLLTAMQ
AIEPTHIVTCWDMGSTTFRTESFSNYKANRAAPPEELIPQFDLVQEMTAK
LSVPVIGMKGYEADDCIGTLAKQYCNEAEVYILTGDTDLLQLVDKNVTVM
LLRKGIGNYEYYTPEKIMEEKGVEPWQIVHAKAFMGDTSDNYPGVKGIGE
KTAYKLIQEHGTVATVLENVASLTKAQRTKIESDLENLNISLQLAQIHCE
VPISCSLEEGLHTIDEEKLRFVCEEMNWGRPEILINML
>pE33L466_0133 hypothetical protein
MQSLCINHFLALWNYTYKETEKGLTYPSCSAKLTQLKKETRYNLVKGNRL
YCSSIHIKKPRRCFFTFFQETK
>BCE33L0839 phage replication protein
MAVYRNVQVNFWQDDFVLDLTPEERYFYVYLLTCSKTTQCGIFPFPKRLA
EMETGYNRETVDKLVQRFVDYGKILYDVDTRELFVLNWLRYNPVTNTNVE
KCVLRELKGVKNKEFVHMFLQKCVEEEMNVPMLLAHFGMPGDLAVDDVEP
VCEETEEEEAIEEETGSRVFSFYEQHFGSLSPHTVEELSAWMEDLSEELV
LKALQIAFENNKRTVAYVKGILRGWHGKGFTKVCEVEADTAKFRKKDSSV
STGETEEFLARCEEWERNAPSEEELQRFLAERGWRP
>BCE33L3285 conserved hypothetical protein
MTLQVPTILIGLGGIGSTVTHQIYERLPEERRKKVAMHVFDTDVNTLSKF
DHIRKFKTQTSSSKTPREYIAGDPTIPEWFPMDPTILDKPLTEGAGQLRV
ISRLALRAAMKEDKLTSFWQEIEKIFPVTSDQTEYGVRVIIVTSLAGGTG
SGMFLQIALYLREMLRKKLQHHNILIRGAFLMPDVLVKTRTVSAKEFETV
QANGYASLKELHAITLGSTGELSKRGGVTIELEYRPDQVDEDGRTNHTIK
QHHLPYNYCFLYDYENLHGHHLHNLSDYMEQMANTIYLQLFSPMSANHFA
QEDNQIQQLAESSGKERYCGAGTAKIIYPYEHVLKYCALKWAVQGLDESW
LHLDQLFQEKKHRYDQDVKRGMQREKPERGKSYLEDLEHLATRPEQAHIF
YRQMYNETREGAEGGKVGVAKSKLFLEAVESYVQRTVQKDEELNRLQHEC
KISAAKLKMTEQMKGEVARVDHAVRLYAYAIPSRVHEHVTTLLYDMIESD
RFSPSGSEGQSYQLNTWFLKKTDSVHPVAARFMLYEIRKQLVEKMNRLHE
NNEQKRNLIQNYDKKFNVSNIDGTVTAVRRVEIAQQQGWFGKMINNQQRL
FKKEFEDIVTQYVHKLNEYRKEMLLELVYQSLYQAVNKMIQYWERFFDNL
HETRENLLFEIQKRSKEFEGKTNPTNVYVLAEEKLQEKIWQDMQQHLNLG
ILPKDICAEIYMSLYGEYCRDAKTEEIQSKKVEDFYREHILNYCYDELQI
RYRDKLELNIVEALRKEADYKNRDRDEYVREKIEDLFHLASPFVPKVSHH
RELQYWGIHPSLKKELQEELMQEMFKEKDTVNEAFSPFEVICYRAHYGLS
LQDFPKLSSGHIANGFMNDKGDYFQSYYRRVNKLNSKKSSLTPHLDKYWH
LPAFMPDLNATQTKLDYDKCNRALLYAYIYRWISLVAVDGQFVYQYNGVG
RSFLIQSMGKNISSESYKLHRALLHNPFIYENILSRFEEEQEKAMIQGGH
LYTHPFVLGAQDIRWLRKEHVHNILDMILMYDREAKYDPTLEETSDELLR
LFLDEIELYFQNYYGTGADMVAKKEKEMFIKQLWDRSYAKGYVDPNSAPY
KKWQNLLSVHDEEETPKTNV
>pE33L54_0049 recombinase
MFFVIILLETFSEGCLFILNTRKFGYIRVSSKDQNEDRQLEAMKQLITDE
RDIFIDKQSGKDFNRDQYQLVKRMLRKDDILYIHSLDRFGRNKEAILQEW
KDITQNIQAHIVVLDMPLLDTTQYKDSLGNLITDLVLQILSWLAEEERTK
IKTRQREGIEAAKKKGKHLGRPKTEITQEFIDSYNQWKEGKVTALYAMKR
CNMTSPTFYRVVKRYENREG
>BCE33L2410 MutT/Nudix family protein
MKKQLRIIYRRYFKMFIVNVEGAIRKNDKWLVIERSKKEEHAGGWLSLVG
GKLDIEGNFSDILERTVMREILEEVGVSVKDRLNYIHSTSFVTDIGENVV
NIVFLCEYESGEAFFKSPDEVEAVLWLTTKEILNHPNSPIYLKESIKHAE
ALVRIHSS
>BCE33L2467 conserved hypothetical protein
MKVVVKELRPLEVAFIRRTGSYFEPQDHWGKLLNWSIENKLYPPEQSFIG
ISLDNPELVASHMCRHDACVTIPKNFEKEQHEDVQFKSVDGGLYALYQFY
DEPHKLSEVYRYMYAVWLPNSEYSADYDRDNLEFCMNNIAEDLEGKLKVD
LFVPIKKNK
>BCE33L5022 hypothetical protein
MSNISLAEAVKLKSVLSKRIHELEEEMNQIGFVEIEKGDTLPKQTRTLVQ
VEQELDDVRADFRLLDKLMYEANIRNEVNFNGENIKIVEAIELATQLRAK
ARKCKEFGVSKKEEHVYSYGETSSLLKVAMFDPEVYRVKGLELERQANRL
SNLINAKNYAIELDFDGEKYF
>pE33L466_0466 possible group II intron reverse transcriptase/maturase
MSTTLRHAEYYDMQKVFDDLYERSENNAIKGVNLYKYIISKDNILLAYRN
IKANTGSKTEGTDGITIDHYKMKDVDSFVNNIRKTLVDYKPNTVRRVEIS
KPNGKKRLLGIPTMRDRLIQQMFKQVLEPICEAKFYKHSYGFRPNRSTHH
AMARCQFLMNRGGYSHVVDIDIQGFFDNVNHSKLLKQLYNIGITDRRVLT
IISKMLKAPIKGEGIPTKGTPQGGILSPLLSNVVLNDLDWWISSQWDTIK
IRKPHTVQTKNYRLAKAKLKRMYIVRYADDFKIFTNNHQSATKIFHAVEG
YLKNQLHLNISNEKSTITNLKRKSSDFLGFSIKSVKKSKRYVANTHVSEK
KKKDILEKAKAKIKAIQKNPSGKTVQDYNSYVLGIKNYYKIATHVTIDFA
EIAFRLSKTLFNRLKSIGKYKVPTNANALYKRTHRNNYRTFEIAGNHVYP
LADIQTRFPQSFSQDICNYTIKGRQKLIKSLKGNIANEMQKMLLSSNRGE
SMEYTDNRISRYSMQNGKCAVIGIFLIAEDVHCHHKIPKGMGGTDEFNNL
IIVHEFVHRLIHATNEETIRTYMRILQLNDKQLNKINKLRKACNLVTLV
>BCE33L0801 methyltransferase
MNNHSKTPACIYTYAFREEERALCYLEMRSFFGMESHVNILKSDVKIDPS
RSAFIKERVEVMYEGDDLESILKQVEQIDLAGATFKVIFVKINDLVGENK
IEYGERRLIERDLGMHIEGEADVRNPERVFGIVPLGGRWYFGHYVESEPV
WYHHIKKPHSYSTSLSTRVARSVANIAVPNPEGVRAIDPCCGIGTVVVEA
LSMGINIVGRDINPLVVLGTRKNIAHFGLEGTVTKGPIEEITENYDVAII
DMPYDLFTHATPEDQLSILSSARRIAKKVVVVTMETMDDMIHEAGFEITD
RCTTKKGSFSRQILVCE
>BCE33L4087 transposase fragment
MKTVCPYLRRQLFLTFLPLQRFLFGKITRPDKAQVVYELRHKYSVKALVE
LATIPRSTYYNLVKKMNRPDVDADLKAEMKAIYEENEGRYGYRRIRDELT
NRGQKVNHKKVQRIMKELGLKCVVHMKKYKSYKGKVGRIAPNILERNFYT
DAPNQK
>BCE33L3468 possible phage integrase family protein
METTEFHDTIQAFSIFLLNKGRKPSTIKRYVYDIEDFGHWLEKNKKLPSS
NIWATLCTKDYEDYFSDLKKNRHYSEKTMHRVFIVLNRMHHFLNIPNPLK
NMEISIQPDRTLRNEDFISSDEEKRLKHIVTSLEGLSEKQRPVRPLLMDR
NIAILNLLIDYGLSLQELTALTMHHVHFETNTLSIPATAGVERTISLTNE
DKKQLYTYYKSIPEPVRPKYHSNDPLFVAFDFNRGTYRWVYENDAPKGLT
EIAIQKMIRLEVSRANLRKGISGQHFRNTYILNLIKKETPESEIIKLAGF
KSKISLKRYYQYAENGKNALS
>BCE33L2264 transposase
MMINKAYKFRIYPNQAQAILINKTIGCSRFVFNHFLSLWDHTYKETGKGL
TYGICSAKLPAMKKEFVWLKEVDSIAIQSSVRNLADAYTRFFKKQNSAPC
FKSKKNNVQSYTTKQTNENIAVVGNKIKLPKLGLVRFAKSREVEGRIVNA
TVRRNPSGRYFVSLLVETEVQELSKTHSYIGIDVGLKDFAILSDGTLYKN
PKFFRLLEDKLAKAQRVLSRRMKESSRWNKQRVKVARIHEYISNARKDYL
DKISTEIIKSHDVIGIEDLQVSNMLKNHKLAKAISEVSWSQFRTMLEYKA
KWYGKQVIVVSKTFASSQLCSCCGYQNKDVKNLNLREWECPSCRTHHDRD
INASINIKNEAVRLLTARTAGLA
>BCE33L2511 MutT/Nudix family protein
MNRSVLRVEVIIYNGDNSKVLVQCDENETFYRFPGGSIEFGEPAKEAIIR
ELMEEYDLKIDVQELAVVNEHIFEWNNKKGHHCTLIHWGTVKERVTNEIR
HKEYEDIILIWKSIEELKEKPTYPEGIVSYLEENNHNIVHFVSKNI
>BCE33L3909 MutT/Nudix family protein
MGYVEELRKIVGHRPLILVGAVVLVINEHGYVLLQQRTEPYGKWGLPGGL
MELSESPEETAYREVYEETGIEVKNLRLINVFSGANYFTKLANGDEFQSV
TTAYYTDEYDGDFVMNKEEAVQLKFFPLTELPDYIVGSHKKMISEYMKIM
EKKL
>pE33L466_0013 hypothetical protein
MLNNNVLDELFNEVAHEDESYFYYFYTMKMAATSEYIKFGPGAFLENQHQ
VICFTNKRIMMLELNPLTGKFNGNKIIIDIEDVEGIKVKRGLIKSKVIVT
LKGNKGDIVMNPNSFTIGLSNHKKNLVKLQEMYS
>BCE33L3509 conserved hypothetical protein
MMQVTTWFIVTLFVFGAIKVLVSSMPTSVVESIISKFELHQKLEEENTSI
SIDGKNIEGETKLQVIHEFNEALFLDKHYFPPRGEGIPIVIDTKKGNKEI
RFSLYSYEEHVDVIKQYKKKVVAYRLRSKSLQIRAPLAITKDYA
>BCE33L4494 site-specific recombinase, phage integrase family
MEVVEALKDISQIEAMKRYLKEHSQRDYLLFVIGINTGLKITELLSMKFE
DVLHEDGTVKEFYSLPVKDEKFKQDIYLNTKVKEALLEYIQSIDVKRENY
VFQSNKTTNSISRQQAYRVIHSAAEAVGIVGKIGTNSMRKTFGYHAYKRG
IAIALLQKHFHHATPSETLKYLGISKDEEFKTEIDVDL
>BCE33L1532 group-specific protein
MAQNKYRVTFISPSEVEQRTVMAASSLPDLIRKVESIIADPNGYFVNDKK
NNCYFKVIKDNVAFIQYELLFSDKEIHIEKLKHIAPAILKQLFKKINDPE
LYALALLDVDIATKEYVLEEMDSELRIRVETELSKKWEAMPTEIVGAQEV
LLEALASFIQD
>BCE33L1788 group-specific protein
MQKWIGIMGIIFILTGCKMSEAPTNLMEAPANEKWINELKEQIDKDLPVN
YRLLTPMSNKDKQMIWSMDFKQDNKKEAIIFYKLPNADHSVYLAVYEKNG
NGWKTKSTHKFNGGDVDIVEVGDFTGNGKRELLIGISVDRESLKHVMYVF
SEENEDMREIYNRNYTKLFIEDLNKNGLKDLSLVTYEKDEKLIVEFIEQF
KTLSEASFDPFINSIQRIQMGRISKSLKAIVIDAGVGAHSGITYVAKFDE
NHYEVLPIDGKEDLFNEYVVESKDVNEDGIIEFVRTVRPKGWEDKPHGDS
PLFERYIQWSESGIKPIEERYIDIEKGYYVKIPKELIGKITISDEQKESN
SQKFLETSTNETWLEVHMFKRKEWFGIKGYSAAVKTASHVYAVPKQPHFE
KVKAYVKPLADYQQE
>BCE33L0333 conserved hypothetical protein
MTGNVLNYFAGGNTARGFHNLYEENLKGLNRLFILKGGPGTGKSSLIKAI
GREWVEKGYDIEILHCSSDNKSVDGVIIPKLKVGIVDGTSPHVIEPKMPG
VVEEYINLGVAWDSDKLREQKVEIERFVSEASKAFQAAYARFNEALIIHD
EWEKIYIDNIDFNKANELTDQLIQKLFADKGGKQSLVKHRFLGAATPKGA
VDFVPNLTEGLPHRYFIKGRPGSGKSTMLKKLAKAAEEKGFEVEVYHCGF
DPNSLDMVIVRELGFAIFDSTAPHEYFPSREGDEIIDMYALIVTPGTDEK
YAEEIRDVSIQYKTKMNEAMSFLAKAKSVRDKLERIYIAAMDFSKVDAYK
EEIQKEFERIAISVTEKNK
>BCE33L2364 phage terminase, small subunit
MLDDEELTEQECYFCLYYVKCLNGTQVALKAGYTKSSAHVTSCRLLRRER
VASYIREIKGEMVENIFIEAMDVLNEYIKIAFADITNYVTFDQKDIEVMG
PFGPVKGEDGKPVMRTISYADFNESDIFL
>BCE33L2042 MutT/Nudix family protein
MGYIEELRKVFGSRPLNLAGVAVAVFNEQGQILLQQRRNGIWGVPGGFVE
LGESTEEAGRREVFEETGIEIGTLQLISVFSGKEFFVKLPNGDEFYPITI
AYLCKDIKGGLLKADGIESLSVQFFDFDKLPENISPFIKKLIEQNLVSI
>BCE33L1342 site-specific recombinase, phage integrase family
MREMKQAGQPIQNEKQLETIKNILLQSSKRDGLLFVLAVNSGLKVSEILQ
LKVSDVIDENENVRHSILFYNEKVKKHKWFAVNEDLQHAIEDYMKERKTW
KRNEPLLKSQKGTKSITRQHAWYILNKAAKEVGLEGISSHTLRKTWGYCA
YKSGVDIAFLQHFFDHSTPSKTLKYIGIA
>BCE33L2765 MutT/Nudix family protein
MGVDLTFKLEETCFNYRVGAICKHDNKILILQGDGEDFWYVPGGRVKMLE
NSEDALKRELAEEIGVPIKRKKLIWSVENFFTLSERKFHEISFYYEVELH
ELPANGADQYILEEEGRTYLFRWVPVAELHTYNLQPAFIKEKVKDVSVHT
EHIVLQK
>BCE33L2799 MutT/Nudix family protein
MYKYTICFIRKGNKILLLNRNKKPNMGMWNGVGGKIEDNETPYEGIIKET
LEETGIDLPSVTYKGNVVFKSKDESRGSEGMYVFLADLPDGVHMDTPLST
AEGLLEWKEIDWILNKDNRGVVSNLPKYLPIVLTEENKLEHIFTYDNGNI
IHYTTSFLTDDNANKRYEKQLVSQ
>pE33L466_0256 hypothetical protein
MIGETFNNLLISLAGLIVGLSAIYLIIKYIFKLTVEKAIPNLIAYASLYL
PGVITFGLVGYVSNSWIFGIVMGVIVEAIRFYGFCRILNQISSSLRAMKV
KFAIANAKSKPLEVIETAEVNIEKENIVQEQPVIELKKEQKELIKTEESE
IIDDADDVENLDEMEGLERLAVLERIEMVAETKGELLIKNASTVKRERKQ
AETTPDTQEIYEQLQLVLVGGDSEKNNEEISFD
>pE33L54_0007 hypothetical protein
MKKGYQGEIEKITICIGFAILALSLLNVHGALSVTPAFVTGISLAGCALS
FADYTKVLIGEYDGFITSKITSFTVSLMYLIAALSLICLPYTNTILSMKS
SVLDKLSTSSSLLALGCVFITIGHNNRKRFKKKAIEQITNQNKRFEKDKY
YEEILEQKNNKIKELEKFIQEFEKTKDSA
>BCE33L1627 MutT/Nudix family protein
MSNNWTDIEHRIYTMCMIQRKNEVLLIQRPDHLGFPGYIAPGGKVDFPES
IVQAAKREVKEETGLLVLNLTFKGLDEYVNPKENVRYMVFNYWTDSFKGE
LLVDPPEGELLWVPIATALNLPMQDWFKERFPLFFEKGTFEIQRVWDRDL
DKQVAITITHT
>BCE33L0776 collagen-binding surface protein
MNFLRKSFNQKIKKISSSFIVVLLVCMNFLIHLPNKAEAATTELKGLGDV
SYYNAVIFGDHSATSADIEGAMAIQKNMNASSYTVLAAATGAHNLAGATW
VEEGYPSLLLGGQFTKAGTGQVIIQDGTVAMTKDGDLEGAMKSSYDRISY
KEQAEIDAKFKEFRKDVNSVIEDAGQLHTDKPKTGMTFGIGEDVNNPNIY
VSSGQNGKKEFEVKDVFLPNVDNKDFIVIYSDAEEVSFGGGAILYDTTDK
GEFTLVNTSQVYDPNSFFTELASKVIWVFPNATKLTTEGYGVVGSVFAPN
AVVDTKGGSINGQVFLGGLHQRDGFEVHNFKFNWPKWNKPSVEKVTGQFE
IEKVDASDETKLLSGAEFEVYKDGKQIDTLRTDKTGKVISKKLEPGKYTL
KETKAPEGYKLLKEEIEVNVEENKVVSVIVKNVKELGSLQVIKKDAESKN
VLEGAEFRLKNENGQIVGGIKTTNKDGVVTFENLVPGKYTLEETKAPEGY
KVKEVKVEVNVIANEVVKREVLNEKEPSKGPENPGEETENPGEETEKPSE
ETEKPGEETEKPGEETEKPGEETEEPGEETEKPGEETEKPGEETEKPGEE
TEKPGEETEKPGEETEKPGEETEKPGEETEKPGEETEKPGEETEKPGEET
EKPGEETEKPGEETEKPGEETEKPGEETEKPGEETEKPGEETGKPGEETE
KPGEETEKPGEETEKPGEETGKPGEETGKPGEEIEKPGEETGTSGEETEK
PGGETGKPGEETGKPGEEIEKPGEETGTSGEETEKPGGETGKPGEETGKP
GEEIEKPGEETGTSGEETEKPGGETGKPGEEIEKPGEETGTSGEETEKPG
GETGTPSEGMENVDKENPTLPEKGQGASHAQLPATGHDMNYLPFIGLALV
LLGIRLRFMIKNS
>BCE33L5047 UV-endonuclease
MHKGCVFIMIMRFGYVSHAMALWDCSPAKTITFTSFQKLSKQEREDKLYD
VTKQNLEHTIRILHYNIAHEIPLYRLSSSIVPLATHPEVEFDYIGAFTPL
WRKIGALIKEHNLRVSFHPNQFTLFTSDKPHITTNAITDMNYHYKVLDAI
GIADSSYINIHVGGAYGNKEKAIERFHENIKKLPAHIKKQMTLENDDKTY
TTAETLSICQKEKIPFVFDYHHHMANLCEEPLEELLPAIFETWSHTNIVP
KVHISSPKSKKEFRAHAEYIDLEFIKPFLHVAKKINHNFDIMIESKQKDL
AMLQFIHELSSIRGIKRISSSTLQW
>BCE33L4745 conserved hypothetical protein
MKKKTVGYLAIAGALSFGIIGGVGIPAFAATNTPAAEQATKTAKKDLDDA
TKQKVKTIMDDTKKQLEELGVKLHEKEKRKEMFAGLDEQAKEKAKSILEQ
EKSGKLTCEQAKEELTKLGVKLPEKGKRKEMFAGLDEQAKEKARSILEQE
KSGKLTREQAKEELTKLGVNMPEKGKHKDMLADLDDQAKEKAKSIFEQEK
SGELTHEQAKEELTKLGVKLPEKKKHEDIFAGLDEATKTKAKAILDNEKK
QLEALNVDLPHHKFFMNKEDK
>BCE33L1527 conserved hypothetical protein
MDKQQYDTIKLQINQEKEQILKEVYELTAEKRKIEQKKEYDLYVVKSRSK
VVQTGQRIMAGMLSSHTFSPERIEEWNRKIKKTEDFIQKNESLLEQVKEK
ERVIDEMYQENCKKLAKVREKQEEKMLLDLKMCMNG
>BCE33L4970 hypothetical and glycosyltransferase fusion protein
MDNQFSLRLQEVKAKRQWLNKRPDKWDQQLGVEEISISKWFKQANAPITF
KEDTNIFTSNLVDKEYTYLSYFETNTNFQLTPKNKQVQLKAGKEFKVEIT
GEKDEQVEVSLHVILYGNNVKKVNKRISFNEDMLISIPQDVDAIRFALRI
SGKGEFQIHSIHIDDIVLWDSPEREGVNSFGLIGGTSWYVPNQSDITFRK
KSADFYVDLEEGKHIYLPYREGNTNFAGEPQNPIQLHNKNLAVLFEGIKD
SDVNVKLFLIFYEEDKRVKIEQIGLNDKRLINIEDNISAMRLAIRVDGKG
IFKIKNIAISGDGYWLNNNITFNQKMQSSYDYHFELSKETLFNWEKDNKI
LYHDAQNVFESRLIGNQFVYVSCFEDIGIHEVSEKSLLHPKDKYYYEFYV
GAEIAGDVEGTLFVLEYKYGRKQKLHQVPFNKKTILKFNKNTTDIKCFIR
INNEGYFRNLHIGINENAIKITNSLEVDLQCKNWFQTGNLLELSNEGNDF
VGESHIASDKKNYISYKEKNNKFTELPTVSLMPIQQNHVYEFHIRADVEE
GLEVLPMFIGYSGNKKVQVLQLKLNMSTMVRPHPDVKEFRIAFRISGLGK
FKIQHYTVKEMEVVNVNSEVHWINRQETSILEMVPEKPLKDLKMAVIFDE
FTTASYKEECELITFTPENWLEVLNHNMPDLLMVESAWQGNGGTWNKRVG
YYGEENMQPLFALLKWCNENNIPTVFWNKEDPVHFNRFIETAKRFDHIFT
TDENMIPSYQEMAGHNRVYALPFAAQPIIHNPIKIVEERENKACFAGSYY
RHHEERSIDMDRVLDKAAKYGLEIFDRNYEKNKKGLMPNHRFPERFDPYI
KGSLKYYEIDKAYKGYKVMINVNTVKQSPTMFSRRVFEGLACGTPVVSTY
AQGVENIFGDLVYISENENEIDKAFDSLLNNERTYRQKSLLGIREVLSKH
TYTHRLKYITEKIGMRVIQELPRVTVLAFARSKEEFSHILEQFERQEYKN
KELNVLVDTFTGYLEIFGKYNSANVKTFVRSYMHNYQNILEWIDTPYIAY
LSKNDYYGRNYLSDLMLSTTFTDSDFIGKNAYFVVEDGKEVGECNKQSEY
EFVGSLSPARTVAKTNVFTKEALTDVLDNLEAEVDFNIYFRYGKTLYSND
KYNYLSGAYTQGNRKRLKNLIKQIEL
>BCE33L1265 group-specific protein
MHEKIAIKTMTDYHLYDFMRCPHKFYFRHIKRREPSSFEWQQIAQMIVNR
IINEYYTLPAGQQTKIVLLLLIEKYWKKVRINMFASKTEYYIVLAKLTDH
LLQFVERDDSQTPPLFLYEKFQTYMEELGVHMSLTLEVGEWSTESFVIKK
YVVDADEEMLALCQKLMTVFSYKAFGILPGKIEVINLIEGTKYEYIPKQE
DITTGMADLSRMKEMLQQPEHYTERHFRSECISCAFRSECQGEEVKKEAK
QKKNIVH
>BCE33L0174 group-specific protein
MKKALGIAVMGSMILLAGCNGNKDTKEPKEKVEQSTEQTEEMKEYRAVHE
KYDLKMNKEINNALQLFEVAKEKGGKEITTDTYKEDVQKVTTSMLEDIDH
IRKEIRVPKSKEQEHEVYVGFLNESEQAMKKLQKLAKEEDSSLIRDIEIN
FATASTYYKRFQAETKK
>pE33L466_0047 transposase, C-terminal region
MPVYDITYLFYRKGKKAYLSCVKDSTTREILAYHVSSSLQMDIVYQTLDN
LKERLGEVIHPEALLHSDQGIHYTHPEFQKRVREMGIRQSMSRRGNCLDN
APMESFFGHMKDELDYKDCQTFESLELNIKDYMEEYNYNRYQWTLKKMAP
IEYRNHLLSA
>BCE33L3689 possible methyltransferase
MRVVSGKCKGHPLKAVPGNTTRPTTDKVKESIFNMIGPYFDGGIALDLFG
GSGGLGIEAISRGIDKAIFVDRDNKAIKTIHQNLESCRIQEQAEVYRNDA
ERAIKALIKREMSFDLILIDPPYKDQKIVSLISVMDQHGLLHNAGLIMAE
HGNDVVLPESIGRLVKVRAEKYGITAISIYKYEGEGTE
>pE33L54_0051 hypothetical protein
MGFIHKTYLQITLDEKDKEMIKNKAVELGYKSTSAFIIDSAKTHFKLNVD
MKVYRDLTKEINYIGKNINSLIRRINTDGIYTDSDIDFLRVNQKKIVNLI
NTEYDRLIDLKTKFNSDSLSKKQKEKLIQSLAENKIQIPKKLVLEEVYEK
IKEDFVYIIECIENSPEQNKEVTEYVWQYLYGDTLYKLDDNQLIKLADSI
FIFAQKLKFKLSKLDNIFSDDDWFELKDILDEYEIY
>BCE33L3396 site-specific recombinase
MNAIIYARVSTTKEAQETSLLRQKDELLHLAERYQMNVIKVIEEQASGYT
IERDGILEVLDTIRDEQIDVLLIQDETRLGRGNAKIALMHCLHKEGIKVY
THTHNGELQLSDSDSMVLDIIGIVEEYQRKIHNLKIKRGMQRAVEKGYRP
ENNLKNRHLSVGREKKEVPIEEIVRLRKSELTFEEIAATLRGFGHNVSKA
TVHRRYVEYTKQLLDKEE
>pE33L54_0017 prophage integration/recombination/invertion protein
MDKKQHLIDVQPIRSKEQLEDMKWSLKRHCSDRDYILFLIGINTGLRVSD
LLKMETSEILKLKRKKRKEFKVKEGKTKKERIINITSIFEEVLPYAEDLK
STWLFPSRKGDKPISKIQAYRQLQKAGDFAGVESLGTHTMRKTFGYWFYK
QTKDIAMLQEILNHSTPQITLRYIGINKEEKDNVLDTFRI
>BCE33L1534 conserved hypothetical protein
MTQIMYHHTAINVLSLLQNMSNNKMDDMQLEAEFKKIEKQFQVKYEELVD
LYNRMVLFQIDIGKHGGMRAYEKSTITWLKSELELLYEVYQFSQRHGLNI
INISKYVSKKELNLFPKTESQLQNTYYKLKKCEIPFENIEKQKPGRKRKY
TPVKEPIVKMKIENKQELREEVPNTENEKNLVTVISGIVDNFETISQCSE
RKEHELHQFMEGIYKLSSMAAERSKDEKNARGLEGELHVLRAENERIKRE
KEELVHDIKEMTHHLIHFITSSDIDQIRTLPFFVKECKQDLHKLGLYNAQ
DGKMKIMVDRSGQVMTVTQ
>BCE33L1984 group-specific protein
MEEIIVTYIKQSNEEIDIAIPNDVKAVQIIEAILHHEKIDETLSSTYEIR
VAKDKEEWHSIRNDETLRDNDVWDGQYVILYKKGSIIPSFEMLPAEEVYN
EPVKQDISTSEEDYVWKIIE
>BCE33L3461 phage integrase
MASFRKFGDVWEFRVRFKDPYTQKYKEKSKRGFKTKKEAQLAAAEEEKKL
LNGLEVEITPTSLKHYLRDWLKLFKQDNVRKNTFILHERNIEKHIIPYFQ
NMNIKELKPMMYQKFINSLTDQGYSKRTVQIIHGTMNNAMKKAVSLKKIE
NNPCEEVVISNKNNKEREGLKYMRSEDIPLFLKTSYQYNYIYYIFFKALL
NTGMRKGEAAALQWKDINLKEHTITISKTLDFTAKTKEELFGDTKTFTSK
RTIMIPKSLVDELLAHKKWQNANKLVLQDAYEHELDLVFSRVDGKFLPKS
TLFNAFSRILKKANLPRLEIHSLRHTHAVLLLESGASMKYIQDRLGHKSI
EITSNVYSHISDKINKDSISGFEAYMNNVLG
>pE33L466_0325 conserved hypothetical protein
MVKFTFLLKPIMNKIIGLISADFLEKLKMRCKLNKIKSFQKQYEDTFVDS
KTFQDFLNREENALLIFDYVFGSKFKSSSRDDFVEQLSRSAMNEINKFRN
SVDLKEIEDHPVVKQYLRELIIYLQEYRDKSFKQNEMSMLANIQNSIVAG
NENLSEYFEKNLVEIQQRAYIKKYTDEHLEELLNQNILDLGKRYNSEANV
GTDFNVVFDSLIGDKKIFENLDVLIHKFRDSINKFSIALDVYKEEVELKD
ISFVVKILDFLKDINCNDSEFYLESNLNTFIEEINNFMGEVEGFRYWLYE
KGQQNIRNESITLIHQISQHERVITDYINLIHPNLISNPYLLIYGEAGIG
KSHLLADNAKRLQEAGHSVFLFLGQYLNKRDQPFKQLFDLIDYKGDKEYF
CKEFNERAQSKNKKTVIIIDALNEGEGKYFWKNYLLSFLNFIKKFKNIAV
VLSVRSNYVRSVLPENIEYEFPLHELEHKGFNDLSLEALEPFFNFYKINP
VMFPSLENECYNPLFLQIYCEVFQDNYKGFRGWSIVEVLEKYIEKINAKM
SLDERFRYLSSLNLVSEILKGIAVKFIENEKRDITLNELYEVLEKVASKY
TTEYRLLILGLEEENILTINKGHNEGLVYFTYERFADIYMSIILLERHQQ
DKNLFKNILLSNSQFYYGIYESLSIVIPEKQEVEWLDLIESKFITFDIAE
EFVRGISWRNVQNINERTFYWINKCLSQNNMQLQSLVYGRLLKQSYVIES
PLNADFLHDNLNNMSMSIRDGSWTIFINDNSEVPMRLAEIILKQNLSFEH
FNKQNFELLSTTYIWLLTSSNIKLRDAATTALVKLYMNQPSIIIKDIIRF
ISVNDPYVLERLFASVYGAILRTNDLSQLGEIVDCIYNNIFEKNEVYPNI
LVRDYARSIILFAINKEIISFKGYEKINPPYSSVWYEKIYSLQEIDNKVK
EMEQKSEEVYGGFRSIVRSMTTEYGRGTGAYGDFGRYVFGSAVSDWRNQF
EDQDLSNIATMRIVEYGYNEEVHGYYDRNLNHHSRHQNSVERIGKKLQWI
SLYELLAKLTDNYPVYKEVKRYTPEYEQYQKLQNKKMHRYMRNLFDFESE
EDIVVIEQESEIPLKEEEHIVRIEKEYYKKYDGPWDPFLRNIDPSLLNYP
TEKSTHNLIKSYLPHRPNKYWAQSKKEFENLEDYLFIEYEGEKYISLAQL
LIQKRDNGKNYLDVDEFCVKTKAIFLPLKDKENYIGLKSVKKGDLSVSWG
NIYTVFAFEYCWHPAFLENCYEQEFENIKYEESVFEYLWETDVDSISGEL
NSCSYLLPSANLVKFFELNQTSEGVWKDEHNNLVAFDTQYLGYERNLLFR
ADYLEEYLESNHLTIVWDFYMEKKSERSRKEEWFICWINNNKDIEHQILE
EHQETDRKELF
>BCE33L0666 group-specific protein
MCKVFHIIHLGGVYMYKKIAVSMSMATLLFGATVFPASAATPKEVTLHHH
KPISEEEMQSLEKLGYNKHEIWKAAHIAKMSKKEIKDVLAYYKQNNSWEK
TAEYFGVDPSKSKKHHMNKETKKALLQQLANMQKSTPDGLKQKMKEYNIG
LRQLTVLTIISQKSNTPLDDVLKMKKDGMDMKQIAEKLNVKKEDIRAEMI
KLVKSIKEKKTN
>BCE33L1472 possible ribonuclease H
MKYKIHWLYKTKRGLQTELMTDYMNIEEALQFAEDFEKTGRVKELLFYDE
MDTEWLLKEMKKLSKQVEEEPQEILVYFDGGYDVQTKEAGIGICVYYKKG
NAKYRIRRNAYIEGIYDNNEAEYASLLYSMNILEELGIKYEAVTLRGDSQ
VVLQQLAGEWPCYDEHLNHYLDQIEQKAKQMKLKLVCEPISRKQNKEAHQ
LATQALEGTVIDSHKEITE
>BCE33L3982 possible helicase, SNF2 family
MNVDISIDRTWQNNFLNRIDEDGPWTNWDLYHLAYETEKSLLVPTFDGLQ
APKHLSHFTPLPHQLEVAQNVIEQMNGKAILADEVGLGKTIEAGLILKEY
MVRGLVKKVLILVPASLVSQWAYELNTKFFIPAVAQKKSYSWEQADVIVS
SIDTAKRSPHRDIVLNLEYDLIIIDEAHKLKNNKTKNYEFAQRLKKKFCL
LLTATPVQNKIDEIFNLVSLLKPGHLGNQSNFEEYYASKNRSAESDEDLK
ALINKVMVRNRRHNTGIDWPKRHVRTIFVEFNEEEQDLYNNIENWRGQDA
FTSAFSSLTLKREACSSREAVYYSLKKHVEKRQKENEHYVKDPHIDILMD
KINHIPFNSKANKALELIKEIDDKVVIFTEYRASQMYLQWFLQQHGISSV
PFRGGFKRGKKDWMKELFQNHAQVLIATEAGGEGINLQFCSHMINYDLPW
NPMRLEQRIGRIHRLGQKNDVHIYNLATKHTVEEHILKLLYEKINLFERV
IGELDEILTRINMKNIDAHIQEIFAQSKSEGEIRIKMENLTSIIDFAKRN
EAEVQGYAAT
>BCE33L1916 ATP-dependent RNA helicase (D-E-A-D box family)
MIKDMQPFLQQAWEKAGFKELTEIQKQAIPTILEGQDVIAESPTGTGKTL
AYLLPLLHKINPEVKQPQVVVLAPTRELVMQIHEEVQKFTAGTEISGASL
IGGADIKRQVEKLKKHPRVIVGSPGRILELIRMKKLKMHEVKTIVFDEFD
QIVKQKMMGAVQDVIKSTMRDRQLVFFSATMTKAAEDAARDLAVEPHLVR
VTRAESKSLVEHTYIICERREKNDYVRRIMHMGDVKAVAFLNDPFRLDEI
TEKLKFRKMKAAALHAEASKQEREATMRAFRGGKLEILLATDIAARGIDI
DDLTHVIHLELPDTVDQYIHRSGRTGRMGKEGTVVSLVTPQEERKLLQFA
KKLGIVFTKQEMFKGSFVETKPKAPKKKKPAFTGKKKPR
>BCE33L4070 possible DNA polymerase III, delta subunit
MSDIHKKIKKKQFAPLYLLYGTEAFFINETIKLITTEALEEEDREFNVVT
YDLEEAYLEDVVEDARTLPFFGERKVLLIKSPLFLTSQKEKLEQNIKILE
EYIGEPSPFSILVFVAPYEKLDERKKITKLLKKTADIVEANAMQVQDVQK
WIVARAEEGHVHIDNAAVSLLLELVGSNVTMLAKEMDKLTLYVGMGGEIT
PKLVAELVPKSVEQNVFALTEKVVKKDIAGAMQILDGLFTQQEEPIKLLA
LLVSQFRLLHQVKELQQRGYGQNQIASHIGVHPYRVKLAMNQTKFFSFEE
LKKVIIELAEADYSMKTGKMDKKLVLEFFLMRLNRM
>BCE33L4142 ATPase, AAA family
MKQPLAHRMRPTNIQEIIGQQHLVGEGKILWRMVQANHFQSMILYGPPGT
GKTSIASAIAGSTGTPFRLLNAVTHNKKDMEVVVQEAKMHRHLVLILDEV
HRLDKAKQDFLLPHLESGLLTLIGATTSNPFHAINSAIRSRCQIFELHAL
TEDDLLIGLKRALEDKEKGLGEYDVTVTPEALHHFANASGGDMRSAYNAL
ELAVLSSFTTDDKAAEITLEIAEECLQKKSFVHDKGGDAHYDVLSAFQKS
VRGSDVNAALHYLARLIEAGDLQSISRRLLIMAYEDIGLASPQAGPRTLA
AIESAERVGFPEARIPLANAVIELCLSPKSNSAYKALDAALHDLRNGQTG
DIPSHLKDSHYKGAEALGRGIGYLYPHDQPNGWVQQQYLPDKIKNKQYYK
PKTTGKFEQALSTVYEKLQSSNKTKNKG
>BCE33L1494 hypothetical protein
MCNMDKRLLSMLNELQYSEEEITRLEKEANESINIDKYYVKNGELLKKLT
MQMLEESVRKELQEGEKIECEILVEEGSLVNSKVIPNIFTAGATFYYYIV
LTNTRLHIIALDCYFKKINEYNVPINSIQAVSRHKKMKNVYGITIDNNYI
QLGSTEHEQELKNVIEKLKQCGVKEERYKDFEKGWLIFFNVFVIAIGGLL
VLKYLI
>pE33L54_0043 possible phage integrase family protein
MLKNLEIYNKEFISDVGIKSYIESLSKVFFSHDKQTKFTQMFDEFKKTGV
IINDRFEDTIWLLADKDKRITKISFNIHSLPIYKESLRYYVLLELRQENS
LKTIQGKIQFIKQAILVTKGFEDSQCYSQLESFLNNKSASVIFKYATSLA
KYLLFIGYEEQDDLIILCKQFAYGYTNTVRNLPVFKDVLTFDWIIRDFQK
KWTDTEKEIYFPIILWWNITLIIPMRIKEFILLSRECAQENNGKYTITVP
RAKKQARKIHSIDVTDTLTISKRIYDLIQEYICITNENSNNEYLLSYYHY
CKSPRYQKTHGNAFKNKKDKTKFEEGQLYTLLKTFYEEIVTKKYGLSLQY
IKPLDTRHFAFCNMMFQGFNMLTIARIGGHSSLDAQMHYFNHLEYFSQSS
IQYLSDQYRKIPHISLNTEGISNDSNIKNLFSKAFLNQLSQAELEELPRM
EYGYCLYSPTNCPVGDCRYCEYFFIPYSEFNLELYKWLNDESEFLWRRIK
EQLLLFKTITKNMNYNFTTLEYDELSQAELNYISQDLKKMQEQKARVDTQ
LDTVANFLFGGAHDEE
>BCE33L4088 transposase, N-terminal region
MGRSLGISDTIILNWVNQYKQNGVEAFLKRCTNYTRQFKLDVLNFMIENG
MSLFETAAIFNIPAPSTISVWKNHETRQSASSL
>BCE33L3252 MutT/Nudix family protein
MIRNRGVAVIVQEGKIALIKRIRGGETYYVFPGGGIEEGETPEEATKREA
YEELGVHIKVGNLIAKLEYKGTQYYFNAHIIGGVFGSGKAKEFELKDRGS
YIPLWLPIHGLEKVNIKPYEVVESILEHI
>BCE33L2660 conserved hypothetical protein
MLLEEVIQQLEEYGTEQNRKTYKNHGAKEPLFGVSFANLKLLKKKIKKDH
ALAISLWETKNMDAMTLATMILDPKKVTTELLNKWVQEVDYYCLMDVLMT
AICTSPIAIERMEEWTNSDDEWIGRAGWSLLANIAIKNKMLQDDFFSPYL
EEIKVNIHNEKNRKKEAMNRALIAIGIRNEDLERLAIEIARKIGKVQVDH
GATSCKTPDAEPYIKKARERAEKKKEK
>BCE33L2500 group-specific protein
MKYFNKDWYKEMQVSGFLNFSETVEEWEEMLRESEKIGMDYKQSLREDAE
EKKEDLLKFLPKSLHPYIHDNTINSEYPSEKLKKLMLEWTEDYEKRMDDL
EQAYKDNYNSIKERLAQNVVQLHEYSLHDSQVTSVERRLKDTIIITLDCS
GTFNEFDKLKVTFIGVSKCSIPENFEGAWWLCHEIDLAEDGFELGVLFDC
PFAEVMICAKNVLLEIDN
>BCE33L0222 UV-endonuclease UvdE family
MLVRLGYVAMSVHLKNASPSQTMTYAQFQKIDDREAAIRKLERIANSNLE
NCLRLLKHNKGHDISFFRLSSKLIPLANHEELLDWNYIRPLKENLKVLGD
YAIRMNMRIDFHPDHFVVLNSAEENIFKQSVKTLQMHRKLLKGMGIEHKQ
RCVMHVGGGYKDKELALERFIENWSNVPRGIQEMIMLENDDTTFTLEDTL
YLGEKLDIPVVFDLHHHMMNHDREDWHEDWARVVHTWESSLLPVKMHISS
PREGKDPRAHADFIDVDTFLSFLKKIKGSVPQIDCMIEAKMKDESLFQLM
RDLSEQTDVEIIDGASFYIK
>BCE33L4645 conserved hypothetical protein
MHPFVKSLQEHFIAHKNPEKAEPMARYMKNHFPFLGIQTPERRQLLKDVI
QIHTLPDPKDFQIIVRELWDLPEREFQAAALDMMQKYKKHINETHIPFLE
ELIVTKSWWDTVDSIVPTFLGNIFLQHPELISAYIPKWIASDNIWLQRAA
ILFQLKYKQKMDEELLFWVIGQLHSSKEFFIQKAIGWVLREYAKTKPDVV
WEYVQNNELAPLSKREAIKHIKENYGINNEKIGETLS
>BCE33L4382 possible adenine-specific DNA methyltransferase
MYVSQTVETLFSIFDSSAVVLRKELDVTYLEALVETGDNLFEGAILQEEL
SEAAIERLNREYSTFNEETYKGEEIRKAFQLAILKGMKEGVQANHEMTPD
AVGMFMSYLFHKFMQGQNEITVLDPAIGTGNLMTTVFNSAKEGLTMSGFG
VEVDEVLIKLALVNANLQKHAIEFFHQDGLAPLYIDPVDAVVSDLPVGYY
PNEIGASEYKLKADEGMSYAHHLFIEQSVKHTKEGGYLFFLVPNFIFESD
QAPKLHAFIKETCFIQGLLQLPVSMFKNEKNAKSIFVLQKKGPSVTMPKQ
ALLVELPKFSNMKAMEDIMDQLNTWFATHK
>BCE33L0918 DNA repair exonuclease
MDTRFTICSKVRKKKGSLFVKQVKFIHAADLHLDSPFKGMEMNVPQSVWE
RMKQSTFESFERIVDKAIQERVDFVLLAGDLYDAETRSLRAQVFVREQMK
RLSQYDIPVFIIHGNHDHLGGSWAAIEFPENVHVFTEPYVEEKSFYKNGE
LLASIYGFSYLQQAVTDNMTAQYTKMSDAPFHIGMLHGSVEGDAEHNRYS
PFQIRELKEKQFDYWALGHIHKREISSEAPYMIYPGNIQGRHRKETGEKG
AYLIELTKQGTQCSFFHTADVVWDEIEVSIDGLETVDDLMTSVSSAMNEC
RREEEGTLLTVVFTGQGPLSPYLRDEKRVEEIFHILAAGEERKDFVYAMK
WKNETVSFAEIERLKEENHFVGSVLKELEAFTNMDGVLRTIWTSPVARNS
IESFTEEEKKEIQKEAENIILEQLFQQERDKK
>pE33L466_0164 DNA integration/recombination/inversion protein
MNKKPHLIDVQPIRSKEQIEDMKWALKRHCSERDYILFLIGIHTGLRVSD
LLQLETQTILKLKRKKRKEFKIKEGKTKKERMINLTSIFDEIYSYAQTLE
STWLFPSRKGDKPISKIQAYRQLQKAGDFADVESIGTHTMRKTFGYWFYK
QTKDVAMLQDILNHSTPQITLKYIGINKEEKDNILDNFHI
>pE33L9_0009 integrase/recombinase
MDKKPHLIDVQPIRSKEQIEDIKWALKRHCSERDYILFLIGIHTGLRVSD
LLQLETQTIIKLKRKKRKEFKIKEGKTKKERMINLTSIFDEVYSYAQTLD
NTWLFPSRKGDKPISKIQAYRQLQKAGDFAGIESIGTHTLRKTFGYWFYK
QTKDVAMLQDILNHSTPQITLKYIGINKEEKDNILDIFYI
>BCE33L1781 MutT/Nudix family protein
MTEWLTIFDSERNTLGKKLRDEVHRDGDWHETFHCWFVEKDAEDMFLYFQ
LRSKNKKEAPLIWDITSAGHIMHNEDVQIGGLREIEEELGLSFQTTDLAY
KGIFTIDYEISNLTDREFCHMYFHNVIDSLPFAPGEEVDDVMKVHATSFL
QLLKREVSSFIAISVLNNKPITITFEDIYPYDIAYYEFVIEKGKELIKNN
SL
>BCE33L0970 hypothetical protein
MPRIYLNEEVLSQALQQFDQMIQDLNHNKRVVSNVHNLLLSSWSQLGVGK
KAISDLESFKKDIERRMEELESDKRELKGAIDLLKALDQSYDYMGPKY
>BCE33L3133 conserved hypothetical protein
MNLRKRLGCLHAHYSNIEYIEKALTSFNIELIHFVDPALMYQVTSNEKFQ
ESDAQFKVKEQIEWIAQCNVDAILITCTNYIAILHEDQLSISVPIIKIDV
PFFDYICNIQQPQTIVFTNPATVEGTVERLKHHANSRQKSLDLEVIVINS
VFELIMKGLKEEYNQEITKFLNQIIKDEKKVISVAQLSMVDATKQVEYKT
AKTIINPLNTLVSYIVNKLKLEKKNQ
>BCE33L2224 group-specific protein
MQLTYGQYLIEKNREQAIRTSPSDDDTNYEKINWYNDMKTSFANKELADL
VKEICNFVSFVRWELEMHNCHPYWEIAYYDDEDNAV
>BCE33L1514 group-specific protein
MYIQLYEKLVIIQKAYENIQTLGEQIYENLKHKNVNAVQKLQVEQLQYID
GLKKLSSSFEEMVIQFCKEKGIEPFRVSALFSHFSNEEIEKMEELQKNVA
ELEENVKMILLKNQYYLNVLLKTTESIVDSVSEYNLERNNNSQIFMNELL
>BCE33L2315 conserved hypothetical protein
MRNRLIGKEQQFLTDVLKELKKYDISLEERDNIKRQILEHIQECREHGED
SIDDLGTPQLFVQDFLEINEIDLQVKMKQLQNEKDKFNKFMLRGIFISVI
TYLSSQSIFSIFLTESFNPTNSKNAFQYNLLYRISENQWWNALLIMTSLT
LSVVVYISLVSYKKRKHLKYNEECDL
>BCE33L3781 MutT/Nudix family protein
MIREGEERRVYILGYIEELRKVVGTRPLILVGSAIIILNDNQEVLLQYRS
DTYDWGVPGGAMELGETTEETARRELFEETGLNAKIMQFIGVLSGKEVYF
QYPNGDEIFNVIHLYQGHHVSGELRLDHEGLQLQYFPVDKLPNLNKTTEK
ILQKFLYALTE
>BCE33L3247 exonuclease family protein
MENATHFIVFDIERNFRPYKSEDPSEIVDIGAVKIEASTMKVIGEFSELV
KPGARLTRHTTKLTGITKKDLIGVEKFPQIIEKFIRFIGEDSIFVTWGKE
DYRFLSHDCTLHSVECPCMEKERRIDLQKFVFQAYEELFEHTPSLQSAVE
QLGLIWEGKQHRALADAENTANILLKAYSERDITKRYKRHGELELVKDGK
LTEKAKKKMRKWVFKEMRKNTERPFVWSTFESSDTWESITERYYISESTI
ELLKKHFRTAVRKAERQIRYLAEMEKVVEEN
>BCE33L4941 SNF2 family helicase
MINQTEVTIRLQHVSHGWFLWGEDDSGTPLSVTSWKRNAFTWHSTSFYGT
FLKEATFEGKQGVMLTNAQAFEYIANKPMNSFARIQMNGPITALTKDANE
LWDAFTSGSFVPDMERWPKQPSWKVQHTPIEDDTLASLFSAAVNESILQD
NRSNDGWEDAKRLYEHYDFTKRQLDAALHEEDWLRKIGYIEDDLPFTIGL
RLQEPQEEFEMWKLETIITPKRGAHRIYVYESIDSLPKRWHDYEERILET
QEGFSKLVPWLKDGDTFRSELFETEAWNFLTEASNELLAAGITILLPSWW
QNLKATKPKLRVQLKQNATQTQSFFGMNTLVNFDWRISTNGIDLSESEFF
ELVEQNKRLFNINGQWMRLDPAFIEEVRKLMNRADKYGLEMKDVLQQHLS
NTAETEIVEEDSPFTDIEIELDGYYEDLFQKLLHIGDIPKVDVPSSLNAT
LRPYQQHGIEWLLYLRKLGFGALLADDMGLGKSIQTITYLLYIKENNLQT
GPALIVAPTSVLGNWQKEFERFAPNLRVQLHYGSNRAKGESFKDFLQSAD
VVLTSYALAQLDEDELSTLCWDAVILDEAQNIKNPHTKQSKAVRNLQANH
KIALTGTPMENRLAELWSIFDFINHGYLGSLGQFQRRFVSPIEKDRDEGK
IQQVQRFISPFLLRRTKKDQTVALNLPDKQEQKAYCPLTGEQASLYEQLV
QDTLQNVEGLSGIERRGFILLMLNKLKQICNHPALYLKETEPKDIIERSM
KTSTLMELIENIKDQNESCLIFTQYIGMGNMLKNVLEEHFGQRVLFLNGS
VPKKERDKMIEQFQNGTYDIFILSLKAGGTGLNLTAANHVIHYDRWWNPA
VENQATDRAYRIGQKRFVHVHKLITTGTLEEKIDEMLERKQSLNNAVITS
DSWMTELSTDELKELLGV
>BCE33L0202 TatD-related deoxyribonuclease
MKWIDSHIHVDQYKDEEKSRLLKDVENSKEIQGLIAVSMNYQSSKETLSL
AKRYSFVHPAIGFHPEQLIHKEECEKIYKLIEDHVEDIVAIGEVGLPYYL
RKEDEHIAINSYISVLQQFIELASKYDLPIVLHAVYEDADIVCDLLEEYK
VSRAHFHWFKGSETTMERMMRNGYYISITSDVLHKEKIRKIVSYYPLEYM
MVETDGPWEFQEGVMTHPGMIREVLKEISVIKNVAVDKVAETIYENTIQF
YPAIFRKLRP
>BCE33L3045 phosphohydrolase, MutT/Nudix family
MGYIEDMRNLVGNQPLILIGSHAIILNEKNQILMQLRTDFNRWGIIGGAL
EYNETLEDALKREVYEETGLNIKNPELFRTYSGPDFFQIYPNGDQVHGVL
VVYICREFNGELVCDQTESKELRFFPLDELPSNLPPVIERIIQEFQQSNL
YVK
>BCE33L3449 possible ATPase, AAA-superfamily
MALVKVTDIAKTLSKKMIFLSDTCEVCKKERKRTVRFMKINGEVVCPVCK
LAEDNQKLEAEMNVFRDEKEQRKRKSMFYDKSLIKDETIKLARFSTFKSD
CEEDEKNYTLAKRALEDYLNDVRFNLILVGKVGAGKSHLAYSIAHEMNEN
SAGTVLYVSVSELFDYIRSTFNGQSEESEHSIVNLLINADLLVIDDLGAE
LGDMDAADPKATAFVNRVLFKVFDGRQGKKTIITTNLTGEAVMKAYDERI
TSRMFNTYRHIEFKYTRDKRKRKLPF
>pE33L466_0235 hypothetical protein
MRVMVRWQDRRGEMKKTTILLIVLLIGSIGLNFKINADNAQLKDEKAKLS
KKIQKVESQYKDTEKELQALKSNNQQQVKEAAERFLKAFRTYDTGKGESY
LTNVEAYITPNAKKELTPPGAPNPPASSTAEEKDKKKVAFKSEYIGGELY
YAFIDTTKANVLAKVKSKMIINGVSSENMSLMQINLIYDGNKKLWLVDKV
IPLADLRDQMP
>BCE33L1792 conserved hypothetical protein; possible lipoprotein
MKYGKVAVVGALSVGLLTGCFGEKPEENLYTAFETAATQEKSLVDEAKKL
EKLEKEGQELYSQILQEGKDHNDAVMKKIEQATANVDDREKVLKNEKEML
EKAQKETKSVQGNIEKLEDKKLQKQAKAVEESYKNRYDAFQKMNENYTKA
LATEKELYEKLKVKETKLKEIGEKVKAVNELTVEAQKSKEQFNNFTKEYN
DSKLAFYKDAEIKIKDKK
>BCE33L1987 conserved hypothetical protein; TPR-repeat domains
MGFRPEVGEEISLNKDVYRVEKHPAVVGIEMPYGQEGRQGTVYQLQHENG
MERIALKVFKERYREEKHQLAFLKPLSSIAGLKVCSRYIVTKEEHISAIE
KSEELANSIVMPWIEGPTWADILQEQRRLSKEQCFFIAEAFLITLKMMEE
NEVAHNDLSSSNILIPFLSENPIEGQHYIELVDVEQMYGPKTKRPSLLPA
GSAGYAPKYLTSGIWQKEADRFAGAILLGEILSWCSEEVRNKKWTDASYF
KTEEMQTECERYTLLQQVLHNQWNGEIAKLFKQAWSSNSFAECPSFAQWY
DVFNSVRERIKIDAERQSAEEHSLFVSKCLEIARLLEERGFKQAALYEYK
IIFNSLNPSTALQKELAYIIQTMESQEPEINKKMVLQHYLELATELEREN
NAAFACFVYSRIVQFPNIDQALKQEIESIIEEIKEEQGTESQQEIAATIT
VPTSILQSRKQTKKQVEYDIDDEILNAKASNQPLTIAHEQSEPSAFSTWW
KKNKKRILIIGSTVVVVIGGSTLFYFYTTNAKYQKFMEQARQAYDDKKYT
KAEEAVGHAIAVKGKDEAYLQLATIYVAEGKNKIAIDYITKLIKDREIDK
ENNEAAYLLASANFRIGKYQEAVQNFEQALANNAKGIEPYKKDAMRDLAV
SHMKMKEFEKAEDVIVKMSTKTNEDKAIVSYLKGQLSTATVQLEKAESFF
KEAIMQDSKNPIYTIELSNLYVLWNKTNLIDSAKKEMNYQQASHILQVAI
QKDMKNIELLNQLGIVYYEAGQFYETRDGAKSTAAYQQALEAYNRVVSSG
TRDINTLVNIGILYDKVGQGNEAEKLFTEAYSQNDENPHVNFAFGMFKIK
QKKYEEASRFLRKTVQANENESEVRAAQEKLTEMKTNGWIQ
>BCE33L1039 addA, ATP-dependent nuclease, subunit A
MMENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKK
IINEENPVDVDRLLVVTFTNAAAQEMKNRIGEALEKVLIDEPGSQHVRKQ
LSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLD
DILEEEYGIEDNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEK
WLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHIRKATELAMLP
DGPAPRVETLQADLALLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSD
YNEDIVKQVDSLRNKAKDEVKKLQEELFSRRPESFLRDFQDMHPVLEKLV
QLVKVFTGRFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRF
RLAEPGLFLGKYKRFTQEGLGGGMKIDLAKNFRSRHEVLAGTNFIFKQIM
GEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVIDGEEGAEV
EKAQLEARLMAQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWA
PQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDNPMQDIPLAAV
LRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEW
FYNLLQGWREFARQQSLSDLIWKVYGETGYYDFVGGLPAGKQRQANLRVL
YDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKY
TTLSQLAIKRKMKMELIAEEMRVLYVALTRAKEKLILIGTVKDANKEMEK
WLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPD
EIYGYDTSWKVEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLESERKE
EVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNAFIKKLRAPIR
TRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPVTVEVLQEQIAGMVNKEL
LTFEQAEEIAIEKVISFFDSDLGKRVLAAKSVEREVPFTMMLAAEEAYQD
WQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFDQAKPIL
EVRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVIKVEE
>BCE33L1038 addB, ATP-dependent nuclease, subunit B
MSLRFVIGRAGSGKSTLCLHEVQEELKQRPRGETILYLVPEQMTFQTQQA
LIGSEDVRGSIRAQVFSFSRLAWKVLQEVGGASRLHIDEAGVHMLLRKIV
ESRKDGLSVFQKAAEQNGFFEHLGSMIAEFKRYNVTPSNVYEMWQQLDAH
SSSAEQKLLANKVYDLQLLYDDFERALIGKYLDSEDYLQLLVEKLPQSEY
VKGAEIYIDGFHSFSPQELEIVRQLMICGARVTITLTIDEKTLAQPVNEL
DLFYETTLTYEKIKQVAREEKIEIEKTIPLMEQPRFHSPALAHLEKHYEA
RPNEKFHGEASVTIRTAANLRAEVEGVAREIRRLVADEDYRYRDIAVLLR
NGESYYDVMRTLFTDYNIPHFIDEKRPMSHHPLVECIRSALEIISGNWRY
DAVFRCVKTELLYPLDVRKETMREEMDEFENYCLAYGVQGKRWTSEDPWM
YRRYRSLDDTNGMITDSEREMEEKINRLRDVVRTPVIRMQKRLKRAGTVM
QMCEAVYLFLEELDVPKKLEALRIRAEESGDFLFATDHEQVWEEVMSLLD
TFVEMLGEEKMSLSMFTDVMSTGLEALQFANIPPSLDQVLIANIDRSRLS
NVKATFVIGVNEGVIPAAPMDEGMLSDEERDVLSAAGIELAPTTRQTLLE
EQFVMYQMVTRATEKLYISCPLADEEGKTLLASSFIKKIKRMFPDVKDTF
ITNDVNDLSRSEQILYVATPEVTLSYVMQQLQTWKRYGFEGNLDFWWDVY
NFYVTSDEWKQKSSRVLSSLFYRNRAQKLSTAVSRDLYGDKIKGSVSRME
LFNRCAYAHFAQHGLSLRERDIFKLDAPDIGELFHAALKRIADRLLRENR
TWADLSIKECEHLSTVVIEEIAPLLQRQILLSSNRHFYLKQKLQQIIFRT
SIILREHAKSSGFVPVDLEVPFGMGGTGSLPPMEFSLPNGVKMEVVGRID
RVDKAEDENGTFLRIIDYKSSSKALDLTEVYYGLALQMLTYLDVVTSNAH
TWMKKGHAASPAGVLYFHIHNPIVEVKGDASEAEIEKEILKKFKMKGLVL
GDADVVRLMDNKLSTGSSDIISAGLKKDGSFSARSSIASEQEFNVLQKYV
HHTFENIGKDITEGVIDIAPYKKGNKAACTFCNFKSVCQFDESLEDNQFR
TLKDMKDSEAMEKIREEVGGE
>BCE33L0399 alkA, DNA-3-methyladenine glycosylase II (3-methyladenine-DNA glycosidase)
MWSEHVTLEYPYHFEEVLKRLSFDPLNVIQLDEKVIYVPLCIDEEQIVVR
LQGIGTVQNPQFWISSQTGDPEKVMKRMRAIFHWNEPFQDIQNHFLNTSL
RPLFETYAYTPIILEFDYFSCLLRCIIHQQINLKFATVLTEQFVKRYGTE
KNGVFFFPTPEIVANISIEELREQKFSQRKAEYIVGLGRSIVSGTLNLTN
IETRAEEEVSAQLLPIRGIGTWTVQNFLMFGLGRKNMFPKADIGIQRAVQ
GVFQLDDRPDEAFLEKVKQECEPYCSYAALYLWKSIE
>BCE33L3499 alkA, DNA-3-methyladenine glycosidase
MTAEYSNALTLTVPTEFSFQENLRYLSRSSNECMFHIEDNKIYKVISVHD
VKPLVEISMNADDTIQIRFLGEAYISAKPIRDAVANYVTEWFDLTTDLAP
FYTLAKHDVLLQRPIEQYYGLRTLGIPDLFEALSWGIIGQQINLTYAYTL
KRRLVETFGSYVEWNDRKHWIFPSPETIANLHVEDLKNLKMTTRKCEYLI
GIAKLITEGKLSKESLLQIQDVKQAEKRLTAIHGIGPWTANYVLMRCLRF
PSAFPIDDVGLHNAIKYLTGSESKPTKHEIKDFAVNWKNWESYATFYLWR
VLY
>BCE33L0774 alkA, DNA-3-methyladenine glycosylase II
MQAPPSFYEGDTLEVAKKLLGQKLVHIVDGIKRSGIIVEVEAYKGPDDKA
AHSYGGRRTDRTEVMFGAPGHAYVYLIYGMYHCFNVITAPVGTPQGVLIR
ALEPVDGIEEIKLARYNKTDITKAQYKNLTNGPGKLCRALGITLEERGVS
LQSDTLHIELVPEEEHISSQYKITAGPRINIDYAEEAVHYPWRFYYEGHP
FVSKK
>pE33L466_0412 cmk23, integrase
MNKKPHLIDVQPIRSKEQIEDMKWALKRHCSERDYILFLIGIHTGLRVSD
LLQLETQTILKLKRKKRKEFKIKEGKTKKERMINLTSIFDEVYSYAQTLD
NTWLFPSRKGDKPISKIQAYRQLQKAGDFADIESIGTHTMRKTFGYWFYK
QTKDVAMLQEILNHSTPQITLKYIGINKEEKDNILDTFQI
>BCE33L4074 comEA, comE operon protein 1
MMWDFPKKWLGLVAIIGIVLFLLFWKTNEHTERSVITTDVQAKEIEKKSK
PKILDTKLQKKIIVIDMKGAVVKEGVYEMKEGDRVKDAIEKAGGFLPEAD
RKKVNLAQVVQDQMVLYVPDKNEQVQEGAAVSKGEEKVQINAASKEQLEK
ITGIGSRKAESILKYREEHGPFQKIEDLLEIDGIGVKSLEKIKDQIIIP
>BCE33L4887 comFA, comF operon protein 1
MLMVAGKQLLLEELSSDLRRELSDLKKKGEIICVQGIIKKASKYICQRCG
NIEQRLFASFLCKRCSKVCTYCRKCITMGRVSECAVLVRGIHERKGEREL
HSLQWKGSLSLGQELAAQGVIEAIKQKESFFIWAVCGAGKTEMLFYGIEE
ALQKGERVCIATPRTDVVLELAPRLQEVFPSINVAALYGGSVDREKDAAL
VVATTHQLLRYYRAFHVMIVDEIDAFPYHADQMLQYAVQQAMKEKAARIY
LTATPDEKWKRNFRRGKQKGIIVSGRYHRHPLPVPLFSWCGNWKKSLHHK
KIPRVLLQWLKMYVNKKYPIFLFVPHVRYIEEIGLLLKGLDHRIDGVHAE
DPMRKEKVAAFRKGDIPLLVTTTILERGVTVKNLQVAVLGAEEEIFSESA
LVQIAGRAGRSFEEPYGEVVYFHYGKTESMVRAKKHIQSMNKSAKEQGLI
D
>BCE33L4035 deaD, ATP-dependent RNA helicase, DEAD/DEAH box family
MTQQTFTQYDFKPFLIDAVRELRFTEPTGIQQKIFPVVKKGVSVIGQSQT
GSGKTHAYLLPTLNRINPGREEVQLVITAPTRELAQQIYEEIVKLTKFCA
EDQMITARCLIGGTDKQRSIEKLKKQPHIVVGTPGRIKDLVEEQALFVHK
ANTIIVDEADLMLDMGFIHDVDKIAARMPKNLQMLVFSATIPQKLKPFLK
KYMENPEHIHINPKQVAAGNIEHYLVPSKHRNKIDLVNKMLLQFKPYLAV
VFTNTKKMADQVADGLMERGLKVGRIHGDLSPRDRKKMMKQIRDLEFQYI
VATDLAARGIDIEGISHVINYELPSDLDFFVHRVGRTARAGHSGIAVTIY
DPANEEALDSLEKQRHIEFKHVDLRGDEWADLGERRRRKSRKKPNDELDV
MATKVIKKPKKVKPNYKRKLATERDKVKRKYSNKKR
>BCE33L0221 deaD, DEAD/DEAH box helicase
MTTFRELGLSDSLLQSVESMGFEEATPIQAETIPHALQGKDIIGQAQTGT
GKTAAFGLPLLDKVDTHKESVQGIVIAPTRELAIQVGEELYKIGKHKRVR
ILPIYGGQDINRQIRALKKHPHIIVGTPGRILDHINRKTLRLQNVETVVL
DEADEMLNMGFIEDIEAILTDVPETHQTLLFSATMPDPIRRIAERFMTEP
QHIKVKAKEVTMPNIQQFYLEVQEKKKFDVLTRLLDIQSPELAIVFGRTK
RRVDELSEALNLRGYAAEGIHGDLTQAKRMSVLRKFKEGSIEVLVATDVA
ARGLDISGVTHVYNFDIPQDPESYVHRIGRTGRAGKKGIAMLFVTPRESG
QLKNIERTTKRKMDRMDAPTLDEALEGQQRLIAEKLQSTIENENLAYYKR
IAEEMLEENDSVTVVAAALKMMTKEPDTTPIALTSEPPVVARGGGSKKRG
GNGGGYRDGNRNRSRDGRGGDGRNRDRNRDGRNRDGNRDRNREGSRDGNR
GRRGEGQGRPGSSNGRGERKHHSRKPQA
>BCE33L2223 deaD, ATP-dependent RNA helicase
MVYLKNFLELGISETFNHTLRENGITEATPIQEKAIPVILSGKDIIGQAK
TGTGKTLAFVLPILEKIDPESSDVQALIVAPTRELALQITTEIKKMLVQR
EDINVLAIYGGQDVAQQLRKLKGNTHIVVATPGRLLDHIRRETIDLSNLS
TIVLDEADQMLYFGFLYDIEDILDETPGSKQTMLFSATMPKDIKKLAKRY
MDEPQMIQVQSEEVTVNTIEQRVIETTDRAKPDALRFVMDRDQPFLAVIF
CRTKVRASKLYDNLKGLGYNCAELHGDIPQAKRERVMKSFREAKIQYLIA
TDVAARGLDVDGVTHVFNYDIPEDVESYIHRIGRTGRAGGSGLAITFVAA
KDEKHLEEIEKTLGAPIQREIIEQPKIKRVDENGKPLPKPAPKKSGEYRQ
RDSREGSRSGSKGRPRNDSRNSSRNENNRSFNKPSNKKGSTKQGQQRRGR
>BCE33L5149 deaD, ATP-dependent RNA helicase
MSKKSFSNYALSKEVRRALTGLGYEHPTEVQGEVIPVALQKKDLVVKSQT
GSGKTASFGIPLCEMVEWEENKPQALVLTPTRELAVQVKEDITNIGRFKR
IKAAAIYGKSPFARQKLELKQKTHIVVGTPGRVLDHIEKGTLSLERLKYL
VIDEADEMLNMGFIDQVEAIIDELPTKRMTMLFSATLPEDVERLSRTYMN
APTHIEIKAAGITTDKIEHTLFEVREEEKLSLLKDVTTIENPDSCIIFCR
TQENVDHVYRQLDRVNYPCDKIHGGMVQEDRFGVMDDFRKGKFRYLVATD
VAARGIDIDNITHVINYDIPLEKESYVHRTGRTGRAGNSGKAITFITPYE
DRFLEEIEAYIGFAIPKANAPSKEEVMKGKAAFEEKIQAKPTIKKDKSAD
INKGIMKLYFNGGKKKKIRAVDFVGTIAKIQGVSAEDIGIITIQDNVSYV
EILNGKGPLVLKIMRNTTIKGKQLKVHEANK
>BCE33L1425 dinG, ATP-dependent DNA helicase
MSKRYVVVDLETTGNSWKDGKDKITQIAAVVVEDGEILEIFSSFINPKRE
IPPFITELTGIDESMVKQAPLFQDVAPMVVELLQGAAFVAHNVHFDWNFL
TEELRQAGYTEIHCPKVDTVELAQILLPTADSYKLRDLAKQHELEHDQPH
RADSDALATAELFLQFLNEIEKLPLVTLQSLYELSDVFQSDIADVLSENI
LKKMMHGKKVAEEYEVYRNIALRKRNYSLNLSETCSSKFDAFLHKTIDKL
ALNMPKFEKRESQQIMMKEIYTALRDSRFSLIEAGTGTGKTLAYLLPSIY
FAKKKEEPVIISTQTVQLQQQILEKEIPLLQKIMPFSFEVALLKGRKHYL
CLHKFEYALQEEEKNYDMALTKAKILVWLLQTETGDRDELNIPEGGKLLW
NRICSDVHSPGGMQSNWFSRCFYQRAKNKALFADIVITNHALLFQDFSSE
EPLFASCEHIIFDEAHHIEEAASRTLGEQFSCMYFQLVLSRLGTLETDDV
LSKVYKMMKKSEQASRSTFRMVSHSLKELKFDADELFQMLRAFIFKQTKQ
EQGISNMPLIYRYNTEVEKGKLWDSITELTNRFVYDIRKLLTTLEKQVDI
LQSKIEWEMHVVTGEFMHLIELLRKMAQALQLLILEKNSYVTWMETETKG
TIHSTVLYGQPVHIAERFADEFLTEKKSVIFTSATLTVNDSFAYIKEELG
LHDFAPNTLQVPSPFRFEEQMKLMVSTDVPFIKKASNEEYIESVSTHIAK
IAKATKGRMLVLFTSYEMLKEAYTNLKNDEELEGYLLLTQSVNNKSRSRL
IRKFQEFDKSILLGTSSFWEGIDIPGDALSCLVIVRLPFTPPYQPMIEAK
GEWLKNQGEDVFAKLALPQAILRFKQGFGRLIRTSTDTGTVFVLDRRLTS
SSYGKYFLQSIPTVPLYEGPLEELLEQVKERPTE
>BCE33L1443 dinG, ATP-dependent helicase, DinG family
MFTEKRLPFEVGKQDNFYDKLNEWIGDVFYDILPEKGFEERDEQIFMAFQ
LERAFQEKKVMFAEAGVGTGKTIVYLLYAICYARYTGKPAIIACADETLI
EQLVKEEGDIAKLSEALGLSVDVRLAKSMDNYLCLRKLEDVMSGRAPEVI
EDVYYELPQFVFDHGTMQNFTHYGDRKEFPLLNDEEWSKVNWDYFQDCFT
CDSRHRCGQTLSREHYRKAADLIICSQDFYMDHIWTYDARKREGQIPLLP
ESSCVVFDEGHLVEYAAQKALTYRLKQTMMEQLLTRLLQNDIREEFAHLV
EETIWQTERFFDVLQENKKEIAGSDRLEITVTEKVTAEAKRLYAKIGEVG
DALVFESEMHTVNTYDLNIVDEHLDVLEHSLRLFMHEKNVITWGEEDDGA
FTLVIMPRAVEEVLQEKVFSKKIPYIFSSATLSNNDSFSFTANSLGVKDY
LSFSVASPFDYEEQMAVNLLSHTKENEWERKCQYTLENIQKTNGRTLVLF
RTTQELAAFKEYVSKEQMSVPFLYEGDQEISQLVSRFQNEEETVLCAVHL
WEGLDIPGSSLSHVIIWSLPFPPNDPVFEAKRKHVNDPFWDVDVPYMILR
LRQGIGRLIRTSDDKGAISIFLSDTEDEKVVEAVKNVLPVEGKEL
>BCE33L3897 dinP, possible DNA-damage-inducible protein P, DNA polymerase IV
MSTMREMYPKKGRVILHVDMNCFFASVEIAHDSSLQGKPLAVAGNEKERK
GIIITCSYEAREYGIRTTMPLWEAKRLCPQLIVRRPNFTLYREASFQMFQ
ILSRFTEKIQPVSIDEGYLDITDCYALGSPLEIAKMIQQALLTELQLPCS
IGIAPNLFLAKTASDMKKPLGITVLRKRDIPEMIWPLPVGAMHGIGEKTA
EKLNDIHIQTIEQLAKGNEHIIRAKIGKHGVDLQRRAKGMDDRQVDPSQM
GQHKSVGNSMTFSKDMDEEKELLDMLERLSKSVSKRLQKRTLVSYNIQIM
IKYHDRRTVTRSKQLKNAIWEERDIFQAASRLWKQHWDGDSVRLLGVTAT
EIEWKTESVKQLDLFSFEEDAKEEPLLAVIDQINDKYGMPLLQRGSQLLR
KQEKSFQQKLESKFM
>BCE33L0001 dnaA, chromosomal replication initiator protein
MENISDLWNSALKELEKKVSKPSYETWLKSTTAHNLKKDVLTITAPNEFA
RDWLESHYSELISETLYDLTGAKLAIRFIIPQSQAEEEIDLPPAKPNAAQ
DDSNHLPQSMLNPKYTFDTFVIGSGNRFAHAASLAVAEAPAKAYNPLFIY
GGVGLGKTHLMHAIGHYVIEHNPNAKVVYLSSEKFTNEFINSIRDNKAVD
FRNKYRNVDVLLIDDIQFLAGKEQTQEEFFHTFNALHEESKQIVISSDRP
PKEIPTLEDRLRSRFEWGLITDITPPDLETRIAILRKKAKAEGLDIPNEV
MLYIANQIDSNIRELEGALIRVVAYSSLINKDINADLAAEALKDIIPNSK
PKIISIYDIQKAVGDVYQVKLEDFKAKKRTKSVAFPRQIAMYLSRELTDS
SLPKIGEEFGGRDHTTVIHAHEKISKLLKTDTQLQKQVEEINDILK
>BCE33L0840 dnaB, replicative DNA helicase
MAAMSIQNVEAEKTVLGSLLLDGELIKECRLTEQYFSMPVHKSIFQLMRK
MEEEGQPIDLVTFTSRIDPKFLKGIGGMEYFIGLMDGVPTTSNFSYYEGL
VRGAWKMYQAGVLGHKMGERLIAEKNEKIIGETITALCELEEKDCVCEFD
LKDALVDLYEELHQDAKEITGIETGYTSLNKMTCGLQEGDFVVLGARPSM
GKTAFALNVGLHAAKSGAAVGLFSLEMSSKQLLKRMASCVGEVSGGRLKN
PKHRFAMEDWERVSKAFAEIGELPLEIYDHAGVTVQDVWMQTRKLKRKHG
DKKILIIVDYLQLITGDPKHKGNRFQEISEISRKLKVLARELNVCVVALS
QLSRSVESRQDKRPLLSDLRETGQIEQDADVIMLMYREDYYDKETMQKEM
TEIHVAKHRNGPVGSFKLRFLKEFGRFVEGK
>BCE33L4321 dnaB, DNA replication protein
MEKQSWMELLPIDRYKVSAKGLLHNYDRKVLTMLYQPLIGSRAFSLYMTL
WGELEQDRVFGKENTHHSLMVTMQMQLPEVYEERVKLEAIGLLKVYIKKE
KDIRMFIYELQPPLSPKQFFDDIVLSIFLYNRLSRTKYNQVKHYFLEEEF
DFASYENVTRSFNDVFGSFNPGQLEHAQEDLRIPKTTAMPSNEKGNAPKV
WNDFFDFSLFVDGLSALVPKKAITDQVRECVITLAYVYDVDVLSMQNIVL
GAVTEMQTIDMERLRKGARDWYQFENGQALPVLSERVQPHAARMMKEKEP
STQEEMLIKQLEEISPKQLLKEISGGAEATKADLQIVEDVMINQKLTPGV
VNVLIYYVMLRSDMKLAKTYVEKIAGHWARKKVGTVAEAMALAKEENRQY
QEWAETKKKGRTSKKTVRKEMVPDWLKEEPKEKEKETVKKDASAEKGAST
LEDERKRLEEVLKKYKRD
>BCE33L5165 dnaC, replicative DNA helicase
MSDVMADRTPPHNIEAEQAVLGAILIDQDALTSASELLVPDSFYRTKHQK
IFEVMLGLSDKGEPIDLVMMTSAMADQGLLEEVGGVSYLAELAEVVPTAA
NVEYYARIIAEKALLRRLIRTATHIVSDGYEREDDVDGLLNEAEKKILEV
SHQTNAKAFQNIKDVLVDAYDKIELLHNQKGEVTGIPTGFTELDKMTAGF
QRNDLIIVAARPSVGKTAFSLNIAQNVATKTDENVAIFSLEMGADQLVMR
MLCAEGNIDAQRLRTGSLTSDDWAKLTMAMGSLSNAGIYIDDTPGIKVNE
IRAKCRRLKQEQGLGMILIDYLQLIQGSGKSGENRQQEVSEISRTLKGIA
RELQVPVIALSQLSRGVESRQDKRPMMSDIRESGSIEQDADIVAFLYRED
YYDRETENKNTIEIIIAKQRNGPVGSVELAFVKEFNKFVNLERRFEDGHA
PPA
>BCE33L1428 dnaD, DNA replication protein
MKKKMMLQWFEQGSIAIPKLLMMHYKKLGLNETEFMVVLHVHTFLESGNS
FPTPSEISERMTITEMKCMEVIQTLIQKGFLSLEGGQKSEAMMCESYSLQ
PLWEKILHFLMNESIEEEQKEIKQLQVNLYTVFEKEFGRPLSPFECETLG
MWEDQDQHHPNLIQAALREAVMSGKLNFRYIDRILFEWKKNGIKTVDQAQ
NQGRKFRANQQRTQQTTKQETKFTGKVPFYNWLEQ
>BCE33L3316 dnaE, DNA polymerase III, alpha subunit
MGNISLPLNYVVIDFETTGFNPYNDKIIQVAAVKYRNHELVDQFVSYVNP
ERPIPDRITSLTGITNYRVSDAPTIEEVLPVFLAFLHTNVIVAHNASFDM
RFLKSNVNMLGLPEPKNKVIDTVFLAKKYMKHAPNHKLETLKRMLGIRLS
SHNAFDDCITCAAVYQKCASIEEEAKRKLNKEALDETVVYEAVKEILVRN
KRDIEWIRCMNVGSYLDIKAFYPVMRLKVKGRKKYVLTDILEDDVKEICT
SLACEPALKSEVGNTRIMLNSLEDVLKLESYILEQYDFVLQALRDYKQSE
MNADEKLKEYLNIMV
>BCE33L4345 dnaE, DNA polymerase III, alpha subunit
MKFVHLQCQTVFSLLKSACKIDELVVRAKELGYSSLAITDENVMYGVIPF
YKACKKHGIHPIIGLTASIFSEEEERSYPLVLLAENEIGYQNLLKISSSI
MTKSKEGIPKKWLAHYAKGLIAISPGKDGEIEQLLLEDKESQAEEVARAY
QNMFGNFYMSLQHHAIQDELLLQEKLPVFMSRINIPVVATNDVRYINQSD
ALVHECLLSVESGTKMTDPDRPRLKTDQYYLKSSDEMEALFSHAEEAIYN
TVEIAERCRVEIPFHVNQLPKFPVPSNETNDMYLRRVCEEGLQKRYGTPK
EVHINRLNHELNVISSMGFSDYFLIVWDFMKYAHENHILTGPGRGSAAGS
LVSYVLEITDIDPIEYDLLFERFLNPERVTLPDIDIDFPDIRRDEMIRYV
KDKYGQLRVAQIVTFGTLAAKAAIRDIARVMGLPPRDIDIFSKLIPSKLG
ITLKDAYEESQSLREFIQGNLLHERVFEIAKRVEGLPRHTSIHAAGVIMS
QEPLTGSVAIQEGHNDVYVTQYPADALEELGLLKMDFLGLRNLTLLENII
KFIVQKTGKEIDIRNLPLQDEKTFQLLGRGDTTGVFQLESSGMRNVLRGL
KPNEFEDIVAVNSLYRPGPMEQIPTFIESKHGKRKIEYLHPDLKPILERT
YGVIVYQEQIMQIASKLAGFSLGEADLLRRAVSKKNRDILDQERKHFVQG
CLQNGYDETSAEKIYDLIVRFANYGFNRSHAVAYSMIGYQLAYLKANYTL
EFMTALLSSAIGNEDKIVQYIRETKRKGFHVLPPSLQRSGYNFQIEGNAI
RYSLLSIRNIGMATVTALLEEREKKMFEDLFEFCLRMPSKFVTERNLEAF
VWSGCFDDFGVSRTNLWKSLKGALEYANLARDLGDAVPKSKYVQGEELSF
IEQLNKEKEVLGFYLSSYPTAQYVKLAKELEIPSLAQAMRHKKKVQRAIV
YITSVRVIRTKKLQKMAFITFCDQNDEMEAVLFPETYIHFSDKLQEGAIV
LVDGTIELRNHKLQWIVNGLYPLEEMDAYEEKKDASVYVKLPSQYEKKIL
NQVTKILFDYSGFAKVLIYYEKEHKMVQLSRSLSIHPSEECLGALREIVG
EENVVVKI
>BCE33L4043 dnaG, DNA primase
MGNRIPEEVVEQIRTSSDIVEVIGEYVQLRKQGRNYFGLCPFHGENSPSF
SVSSDKQIFHCFGCGEGGNVFSFLMKMEGLAFTEAVQKLGERNGIAVAEY
TSGQGQQEDISDDTVIMQQAHELLKKYYHHLLVNTEEGNEALSYLLKRGI
TKEMIEKFEIGYASPAWDAATKILQKRGLSLSSMEQAGLLIRSEKDGSHY
DRFRGRVMFPIYTLQGKVIAFSGRALGDDTPKYLNSPETPIFHKSKLLYN
FHQARPFIRKRGQVVLFEGYADVLAAVKSGVEEAVATMGTALTEEQAKLL
RRNVETVVLCYDGDKAGREATMKAGQLLLQVGCQVKVTSLPDKLDPDEYV
QQYGTTAFENLVKSSISFVGFKINYLRLGKNLQDESGKEEYVKSVLKELS
LLQDAMQAESYLKSLSQEFSYSMETLLNQLHQYRKEQKVQQKQVKQVSKP
SQIVQTKPKLTGFERAEREIIYHMLQSPEVAVRMESHIEDFHTEEHKGIL
YELYAYYEKGNEPSVGTFLSWLSDEKLKNIITDISTDEFINPEYTEEVLQ
GHLETLRRHQEKLEKMEIIFKIKQMEKTDPVEAAKYYVAYLQNQKARK
>BCE33L4320 dnaI, DNA replication primosomal protein
MEHIQNSFAKLMENENFKNRYEVLKAEVMAHPRVKEFIDEHRGEVTTSMI
ERSLVKLYEYIGQSVGCADCPDLGSCKNMLQGYEPKLVIQGKMIDIQYDR
CVRKVAYDERKKYEKLVQSVYMPTDILQATMENLDPSDLDARIDAIGAAN
EFLSTYEPGKKVQGLYLYGKFGVGKTYLLGAIANELARKKISSMLVYFPE
FLREIKSSIQDNSIGEKIDAVKRVQVLMLDDIGAEAMSSFVRDDVLGAIL
QFRMLENLPTFFTSNFDFKQLEHHLTYTQRGEAEEMKAARIMERIKYLAK
PIPIGGKNRRHK
>BCE33L0002 dnaN, DNA polymerase III, beta subunit
MRFTIQKDYLVRSVQDVMKAVSSRTTIPILTGIKVVATEEGVTLTGSDAD
ISIESFIPVEEDGKEIVEVKQSGSIVLQAKYFSEIVKKLPKETVEISVEN
HLMTKITSGKSEFNLNGLDSAEYPLLPQIEEHHVFKIPTDLLKHMIRQTV
FAVSTSETRPILTGVNWKVYNSELTCIATDSHRLALRKAKIEGIADEFQA
NVVIPGKSLNELSKILDESEEMVDIVITEYQVLFRTKHLLFFSRLLEGNY
PDTTRLIPAESKTDIFVNTKEFLQAIDRASLLARDGRNNVVKLSTLEQAM
LEISSNSPEIGKVVEEVQCEKVDGEELKISFSAKYMMDALKALDSTEIKI
SFTGAMRPFLIRTVNDESIIQLILPVRTY
>BCE33L2430 dnaN, DNA polymerase III, beta subunit
MEFIVNHKHFTQALSDVSKAISAKAIIPILSGIKITADQSGITLIASNSN
IFIEKFIPSAIDDKQITTILQAGTIVVPAKYFIEIIKKMPSDIVIKSKNE
QTITIQSGEITLNLNGFPANEFPNVPQIDGHTEIQIETKQLIDAFKQTVF
AVAKNESRPVLTGVHIELDHNKLICAATDSHRLAIRETLISTNMKANCIV
PSATINELLKLMNSNLEFVSIYLSESHIIFTFGTTTLYSRLIEGKYPNIS
TLIPNEFQTVINIDRQRMLQGVDRSSLLASERANNNVNLEIVNESTIQIS
SNASQIGKISETQQIDVIQGKKQLNISFDGRFMLDALRAIKEETVTLSFS
GSMRPILIEAGTQSAAIHLISPVRAY
>BCE33L0019 dnaX, DNA polymerase III, gamma and tau subunits
MSYQALYRTWRPQKFEDVVGQKHVTKTLQNALLQEKVSHAYLFSGPRGTG
KTTIAKVFAKAINCEHAPVAEPCNECPSCLGITQGSISDVLEIDAASNNG
VDEIRDIRDKVKYAPSAVEYKVYIIDEVHMLSMGAFNALLKTLEEPPGHV
IFILATTEPHKIPPTIISRCQRFEFRKISVNDIVERLSTVVTNEGTQVED
EALQIVARAAEGGMRDALSLIDQAISYSDERVTTEDVLAVTGSVSQQYLG
NLVECIRENDVSRALRIIDEMMGKGKDPVRFMEDFIYYYRDMLLYQTSPQ
LEHMLERVMVDEQFRMLSEEMQPEVIYEIIHTLSKGQQEMKWTNHPRIFL
EVVMVQLCQQFMMQANGADRLQAIMNRMQQLEKELEQVKKNGVPVGVQPE
VRETRTAPKPVRTGSMKIPVGRVNEVLKQAKRQDLEQLKAVWGELLGRLK
AYNKVAFAVLLENSEPVAASDDTYVLAFQYEIHCKMASENREAMDTLEQA
LFELLSKRLNMIAIPKSEWGKIREDFLQREGGSSEESPEKKEDPLIEEAV
KLVGQELIEIKE
>pE33L466_0313 dnaX, DNA polymerase III, gamma and tau subunits, N-terminal region
MTNYVALYRAYRPNSFNDLVGQDHIKTTIMNAIKLEKVAHAYLLSGLRGT
GKTTVAKIIGKDVNCLTPLNNGEPCQKCANCSDINNNKFADILEFDAASN
NVVEEIRQIRDQVHLAPVTGKYKVCTLSMN
>BCE33L3496 exoA, exodeoxyribonuclease III
MKFISWNVNGLRAVIAKGGFLEYLEESNADIFCLQEIKLQEGQIDLNVED
YYTYWNYAVKKGYSGTAIFSKKEPLSVTYGLGIEEHDQEGRVITLEFEGF
YIITLYTPNAKRGLERLDYRMKWEDDFRAYIKRLDEKKPVIFCGDLNVAH
KEIDLKNPKSNRKNPGFSDEEREKFTCILEEGFIDTYRYLYPDQEGAYSW
WSYRMGARAKNIGWRLDYFVVSERIKNQIKDAKINSEVMGSDHCPVELHI
NF
>BCE33L1044 gerPC, spore germination protein PC
MNQDIYTYLHQLQQALQVQQATILNLEDQVRQLQEELNELKNRPSSSIGK
VEYKFDQLKVENLNGTLNIGLNPFSTKEQQIEDFQVDTETLKVNPETDTN
PDFYQGILQEMHRYLDEEAYNRILHFEQEERTPLDEMYRQMMVDDIKKQM
EHRLPYYLSQAQSYEGISTDPDYLRDIIIQAMKHDIDKAFLSFIQHIPGN
FRKE
>BCE33L0006 gyrA, DNA topoisomerase II, ATP-hydrolyzing (DNA gyrase, subunit A)
MSDNQQQARIREINISHEMRTSFLDYAMSVIVSRALPDVRDGLKPVHRRV
LYAMNDLGITADKAYKKSARIVGEVIGKYHPHGDSAVYETMVRMAQDFSQ
RYMLVDGHGNFGSVDGDSAAAMRYTEARMSKISMELIRDISKNTIDYQDN
YDGSEREPIVLPARFPNLLVNGTTGIAVGMATNIPPHQLGEVIDGVLALS
HNPDITIAELMEFIPGPDFPTAGLILGRSGIRRAYETGRGSIILRAKVEI
EEKSNGKQSIIVTELPYQVNKARLIEKIAELVRDKKIEGITDLRDESDRN
GMRIVMEVRRDANANVLLNNLYKHTALQTSFGINMLSLVNGEPQVLNLKQ
NLYHYLEHQKVVIRRRTAYELEKAEARAHILEGLRIALDHLDEVITLIRS
SKTADIAKQGLIERFGLSEKQAQAILDMRLQRLTGLEREKIEQEYQDLMK
LIAELKAILADEEKVLEIIREELTEVKERFNDKRRTEITIGGMEFIEDED
LIPEQNIAITLTHNGYIKRLPASTYKTQNRGGRGVQGMGTNDDDFVEHLL
TTSTHDHILFFTNKGKVYRTKGYEIPEYSRTAKGLPIINLLGVDKGEWIN
AIIPIREFGDDQFLFFTTKQGISKRTPLSSFANIRTNGLIAISLREEDEV
ISVRLTSGDKDIIVGTSNGMLIRFNEQDVRSMGRNAAGVKAITLGEEDQV
VGMEIVEEDVNVLIVTKNGYGKRTPIDEYRLQSRGGKGLKTCNITDKNGK
LVAVKSVTGEEDIMLITAAGVIIRMPVDQISQMGRNTQGVRLIRLEDEQE
VATVAKAQKDDEEEASEEVSSEE
>BCE33L0005 gyrB, DNA topoisomerase II, ATP-hydrolyzing (DNA gyrase, subunit B)
MEQKQMQENSYDESQIQVLEGLEAVRKRPGMYIGSTSGKGLHHLVWEIVD
NSIDEALAGYCDEINVSIEEDNSILVTDNGRGIPVGIQEKMGRPAVEVIM
TVLHAGGKFGGGGYKVSGGLHGVGASVVNALSTELEVFVHRDGKIHYQKY
ERGIPVADLKVIGDTDKTGTITRFKPDPEIFKETTEYEFDTLATRMRELA
FLNRNIKLTIEDKREHKQKKEFHYEGGIKSYVEHLNRSKQPIHEEPVYVE
GSKDGIQVEVALQYNEGYTNHIYSFTNNIHTYEGGTHEVGFKTALTRVIN
DYGRKNSILKDADSNLTGEDVREGLTAIVSIKHPNPQFEGQTKTKLGNSE
ARTITESVFSEAFEKFLLENPNVARKIIDKGTMAARARVAAKKARELTRR
KSALEVSSLPGKLADCSSKDPAISEIYIVEGDSAGGSAKQGRDRHFQAIL
PLKGKIINVEKARLDKILSNDEVRTIITAIGTNIGGDFDIEKARYHKVII
MTDADVDGAHIRTLLLTFFYRYMRQIIECGYIYIAQPPLFKVQQGKKIQY
AYNEKELEKILAELPAQPKPGIQRYKGLGEMNPTQLWETTMDPEVRSLLQ
VSLQDAIEADETFEILMGDKVEPRRNFIQENAKYVKNLDI
>BCE33L0027 holB, DNA polymerase III, delta prime subunit
MTKTWEQLSAIQPIGVKMLMNSIAKERISHAYLLEGGKGTGKFATAIQMA
KSFLCSQRNRVEPCHICTNCKRIDSGNHPNLHIVKPDGLSIKKQQIHDLQ
EEFSKTGLEANKKVYIIEHADRMTANAANTLLKFLEEPSSDTTAILLTEQ
SHQILNTILSRCQVVTFRPLPTESLIRRLQDEGITVSLSTLAAQLTNSFD
EALTLCNDEWFAQARALVIKLCEALEKDKASIFFVQEKWGKHFGEKEQLQ
QGLDMLLLIYKDLLYVQLGEEDRLVFREQKEMFESFSYAQKRIVSALFNI
LEAKNRINANVNAQLVFEQLVLRLQEG
>BCE33L3486 hup, DNA-binding protein HU
MNKTELIKNVAQNAEISQKEATVVVQTVVESITNTLAAGEKVQLIGFGTF
EVRERAARTGRNPQTGEEMQIAASKVPAFKAGKELKEAVK
>BCE33L1392 hup, DNA-binding protein HU
MNKTDLINAVAEASSLSKKDATKAVDAVFDSILEALKQGDKVQLIGFGNF
EVRERAARKGRNPQTGEEIEIAASKVPAFKPGKALKDAVK
>BCE33L2136 hupA, DNA-binding protein HU
MNKTELIKNVAQSADISQKDASAAVQSVFDTIATALQSGDKVQLIGFGTF
EVRERSARTGRNPQTGEEIQIAAGKVPAFKAGKELKEAVK
>pE33L466_0194 insK, transposase
MKALVELAHIPRSTYYNLVKKMNRPDVDADLKAEIKAIYEENEGRYGYRR
IRDELTNRGQKVNHKKVQRIMKELGFKCVVRMKKYKSYKGKVGKIAPHIL
ERNFSADAPNQKWVTDITEFKLFGEKLYVSPVLDLYNGEIITCTIGSRPT
YSLVSEMLEKALERLPENHQLLMHSDQGWHYQMRQYVRTLESRGIVQSMS
RKGNCYDNAVIENFFGIMKSEFLYIKEFESVEHFKIEFEKYIEYYNTKRI
KAA
>BCE33L4642 kapD, sporulation inhibitor
MDEQRFLFLDFEFTMPQHRKKPKGFFPEIIEVGLVSVVGCKVEDTYSAHV
RPKTFPSLTDRCKKFLGIKQEVVDKGISFPELVEKLAEYEKRCKPTIVTW
GNMDMKVLKHNCEKAGVDFPFLGQCRDLSLEYKKFFGERNQTGLWKAIEA
YGKVGTGKHHCALDDAMTTYNIFKLVEKDKEYLVKPAPPTLGELVDFSKV
LKKVSTQ
>BCE33L0279 ligA, DNA ligase(NAD+) (NAD-dependent DNA ligase)
MSKEIAKKRIEELRDLLNTFNYQYHVLDNPSVSDAEYDRNMQELIKLEAE
NPEFMSEDSPSVRVGGTVLDIFEKVTHKSPMLSLGNAFNEGDLRDFDRRV
RQGIDDANVRYICELKIDGLAVSLHYEKGRFIQGATRGDGVTGEDITQNL
KTIKAIPLRLNEEVTLEARGEAYMPKRSFVKLNEEKEQNGEDVFANPRNA
AAGSIRQLDPKIAAKRNLSMFVYGLANVEEKTIPSHSESLDFLGELGFKT
NPNRRTCETIEEVIAYVEEWQEKRPHLDYEIDGIVIKVDDVALQESLGTT
AKSPRWAIAYKFPAEEVVTRLTGIELSVGRTGVVTPTAELEPVRVAGTIV
RRASLHNEDLIREKDIRIGDYVVVKKAGDIIPEVVNVIFDKRTGEEEEYH
MPTHCPACESELVRLEEEVALRCINPTCPAQIREGLIHFVSRNAMNIDGL
GERVITQLFDADYIRTFADLYSLTKEQLLQLERFGEKSATNLVQAIENSK
ENSLERLLFGLGIRHVGAKAARTFAEHFETMDALVKATEEELKTINEIGE
KMAQSVVAYFDNEDVLELLQQFKEYGVNMTYKGIKIADLQNVESYFAGKT
VVLTGKLEVMGRSEAKKKIEALGGKVTGSVSKSTDLVVAGEAAGSKLAQA
EKHNVEVWNEERFLQELNK
>BCE33L0048 mfd, transcription-repair coupling factor
MIGLLEQFYKNEEIQSVINGLEDGLKEQLVSGMATSSRSLLMAALYKKTK
KSQLIVTHNLYQAQKVHEDLVALLGEKDVWLYPVNELIASELGVASPELK
AQRIEVLNRLAAGEHGIIVAPVAGLRRFLPMKELWKQRQIEINLGQEIDL
DTFLHTLHHIGYERKSMVEAPGEFSLRGGILDIYPLTEELPFRIEFFDTE
VDSIRLFDVDEQRSQDKKESVRFGPATEFLFSQEELKSGIKHLEEGLTKT
MQKLSDDKLKTTVLETVSHEIEMLKNGQSIEQMFKYLSIFYNEPASLIDY
LPEDGVVILDEISRIQETASHLETEEAEWYISLLSEGTIIQDLSFSHSFE
EFLHHKKRSFVYLTLFLRHIAHTHPQNIVNVTCKTMQDFHGQMQLLKTEI
DRWNEGHFTTVVLGTDDERVKKLQHILSDYDIDADIVEGTDILLPGRLQI
AVGDLHAGFEMPMQKLVVITEKELFHKKVKKSQRKQKLSNAERIKSYSEL
KVGDYVVHVNHGIGKFLGIETLEINGVHKDYLNIKYQGNDKLYVPIEQID
QVQKYVGSEGKDPKVYKLGGNDWKKVKTKVEKSVQDIADDLIKLYAEREA
SKGYAYTPDTAEQQEFESSFPYQETEDQLRSIEEIKKDMERGRPMDRLLC
GDVGYGKTEVAIRAAFKAIMDEKQVAILVPTTILAQQHYETIRERFQDYP
INIGLLSRFRTRKQQNETIKGLKDGTVDIVIGTHRILSKDVTYKDLGLLI
IDEEQRFGVTHKEKIKQLKANVDVLTLTATPIPRTLHMSMLGVRDLSVIE
TPPENRFPVQTYVVEYNPALMREAIERELARGGQVYFLYNRVEDIERKAD
EISMLVPDARVTYAHGKMNESELESVMLSFLEGQHDVLVSTTIIETGVDI
PNVNTLIVFDADRMGLSQLYQLRGRVGRSNRVAYAYFAYKRDKVLSEVAE
KRLQAIKEFTELGSGFKIAMRDLSIRGAGNLLGAEQHGFIDSVGFDLYSQ
MLKDAIEQRRGTDGVENTVNVEIDLEVDAYLPDAYISDSKQKIMMYKQFR
GVSAIEDIEELQEEMIDRFGDYPQEVGYLLQIANIKVLAMKEQIELIKQN
KFEVTILFSEQASQNIDGGKLFMLGNSFGRMIGLGMEGSQLKIVMKTNGL
ETSKWLTIAENLLKGLPDVKKEVINA
>pE33L466_0002 min, possible DNA-invertase, C-terminal region
MQEFNDKEIHFVSIKDGIDTSTTMGRFLFHIFGAMAEMEREVINERVVSG
VAAAKERGKQGGRKRAHTSEQIEGMLKMGEQGLPKVDICKMFNVSRATLY
RYINEEEKNTKNLK
>BCE33L3527 mutL, DNA mismatch repair protein, MutL family
MGKIRKLDDQLSNLIAAGEVVERPASVVKELVENSIDANSTSIEIHLEEA
GLSKIRIIDNGDGIAEEDCIVAFERHATSKIKDENDLFRIRTLGFRGEAL
PSIASVSELELITSTGDAPGTHLIIKGGDIIKQEKTASRKGTDITVQNLF
FNTPARLKYMKTIHTELGNITDIVYRIAMSHPEVSLKLFHNEKKLLHTSG
NGDVRQVLASIYSIQVAKKLVPIEAESLDFTIKGYVTLPEVTRASRNYMS
TIVNGRYVRNFVLMKAIQQGYHTLLPVGRYPIGFLSIEMDPMLVDVNVHP
AKLEVRFSKEQELLKLIEETLQAAFKKIQLIPDAGVTTKKKEKDESVQEQ
FQFEHAKPKEPSMPEIVLPTGMDEKQEEPLAVKQPAQLWQPPKQEWQPPQ
SLVREEQSWQPSTKPIMEEPIREEKSWNSNDEDFELEELEEEVQEIEEIE
MNGNDLPPLYPIGQMHGTYIFAQNDKGLYMIDQHAAQERINYEYFRDKVG
RVAQEVQELLVPYRIDLSLTEFLRVEEQLEELKKVGLFLEQFGHQSFIVR
SHPTWFPKGQETEIIDEMMEQVVKLKKVDIKKLREEAAIMMSCKASIKAN
QYLTNDQIFALLEELRTTTNPYTCPHGRPILVHHSTYELEKMFKRVM
>BCE33L4327 mutM, formamidopyrimidine-DNA glycosylase
MPELPEVENVRRTLENLVTGKTIEDVIVTYPKIVKRPDDAEIFKEMLKGE
TIENIKRRGKFLLLYVTNYVIVSHLRMEGKFLLHQEDEPIDKHTHVRFLF
TDGTELHYKDVRKFGTMHLFKKGEEMNQMPLADLGPEPFDAELTPQYLHE
RLQKTNRKIKVVLLDQRLLVGLGNIYVDEVLFRSQIHPEREASSLTAEEI
ERIYEATVTTLGEAVKRGGSTIRTYINSQGQIGSFQELLNVYGKKGEPCV
TCGTILEKTVVGGRGTHYCPICQPRI
>BCE33L3528 mutS, DNA mismatch repair protein, MutS family
MTQYTPMIQQYLKVKADYQDAFLFFRLGDFYEMFFEDAVKAAHELEITLT
SRDGGSSERIPMCGVPYHAAKNYIEQLVEKGYKVAVCEQVEDPKTAKGVV
RREVVQLITPGTMMEGRTIDEKENNFLAALTHFEDGSYALACNDLTTGQN
TVTLLTGSVEDILLEVYATGSKEIVVDSSFSKDELNKLTETLKMTISYED
ATAIPEGLEHLVKNVSQAKLIKAVGRLFNYVIRTQKRSLDHLQPVEIYYT
NQFMKIDVHSKRNLELTETLRTKEKTGSLLWLLDKTKTAMGGRMLKQWME
RPLIQKERIEERLEMVETFVNDYFLREDLKEKLKEVYDLERLAGKVAFGN
VNARDLLQLRRSLLQVPAILEAISLLDNAYAARLIQGADPCESLTELLGR
SIQENPPLSIKDGDIIKDGYNDKLDQYRYVSKNGKTWIAELEKRERDITG
IKSLKIGYNRIFGYYIEVTKANLGALPEGRYERKQTLANAERFITDELKE
KETLILEAEEKIVQLEYDLFTALREEVKVFIPKLQHLAKVISELDVLQSF
ATVSEEEQFVKPVLTTKREIFIKDGRHPVVEKVLNGKLYVPNDCIMPEKM
DVFLITGPNMSGKSTYMRQLALVTVMSQIGCFVPATEAVLPVFDQIFTRI
GAADDLISGQSTFMVEMLEAKNAIANASERSLILFDEIGRGTSTYDGMAL
AQAIIEHIHDQIGAKTLFSTHYHELTVLEDSLDQLKNVHVSAIEENGKVV
FLHKIQDGAADKSYGIHVAQLAELPDSLIARAKEVLAQLEGQEEIIIPKR
VEVKAQEQEQEVIPEPIVVKEEPVEIEETKVDNEEESQLSFFGAEHSSKK
QDKPVLDAKETAVLSQIKKIDLLDMTPLEAMNELYRLQKKLKKG
>BCE33L3205 mutS, DNA mismatch repair protein
MNTMTFEKLQYNELKDIVKSYCVSGLGKELINKLEPSTSIKVVRNRLNET
TEARAILDAEGHVPFFGISNIASTIQKLEKGMILDPEELVSVSDFLRGCR
KIKKFMLDKEFFAPVLASYANSMTEYKSIEEEINFSIKGNSIDSAASKEL
KRIRNNIDSVDGKIKERLTKFLNSSANKKYIQEFFISKKDDRYTIPIKSS
YKNQVAGSIVEASAKGSTVFIEPHTVTKLNAELASLKAEEAMEEYQILAT
LSGMVVENIYHIKINMELISQYDMVFAKAKFSKSIDGIEPKLNDHGYVHL
VNCKHPLLSGKVVPLNFEIGQNYRSLIITGPNAGGKTIVLKTIGLLTLAT
MSGLHIAGDKETEIAIFENVFVDIGDNQSIENALSTFSSHMKNLSEIMRM
SNNNTLLLFDEIGSGTEPNEGAALAISILEEFYLAGCITVASTHYGEIKR
FSEMHDDFMNAAMQFNSETLEPLYKLVIGKSGESNALWIANKMNVREHVL
KRAKAYMGNKEYTLEKVNESKIRKPKFLQEKRENHYEYKIGDRVNLLDHD
DFGIIYKEKDNFYNVVVYYNGEFIEVNVKRITLEVAAKELYPEGYDLNTL
FVDYKERKMQHDIERGSKKALRNIQKEMRKNRG
>BCE33L4296 mutSB, DNA mismatch repair protein, MutS family
MLERTLRVLEYNKVKEQLLEHTASSLGRDKVKHLVPSTDFEEIVEMQDTT
DEAAKVIRLKGSAPLGGITDIRSNVKRAKIGSMLSPNELLDIANTMYGSR
NMKRFIEDMVDNGVELPILETHVAQIVSLYDLEKKITNCIGDGGEVVDSA
SDKLRGIRTQIRTAESRIREKLENMTRSSNAQKMLSDSIVTIRNERYVIP
VKQEYRGVYGGIVHDQSASGQTLFIEPQVIVELNNALQEARVKEKQEIER
ILLMLTEEVAVEADIVLSNVEVVANLDFIFAKAFYAKRIKATKPIVNNER
YMDLRQARHPLIDPEVIVPNNIMLGKDFTTIVITGPNTGGKTVTLKTVGI
CVLMAQSGLHIPVMDESEICVFKNIFADIGDEQSIEQSLSTFSSHMVNIV
DILEKADFESLVLFDELGAGTDPQEGAALAISILDEVCNRGARVVATTHY
PELKAYGYNREQVINASVEFDVNTLSPTYKLLIGVPGRSNAFEISKRLGL
SDRVIDQARNHISTDTNKIENMIAKLEESQKNAERDWNEAEALRKQSEKL
HRELQRQIIEFNEERDERLLKAQKEGEEKVEAAKKEAEGIIQELRQLRKA
QLANVKDHELIEAKSRLEGAAPELVKKQKVNVKNTAPKQQLRAGDEVKVL
TFGQKGQLLEKVSDTEWSVQIGILKMKVKESNMEYINTPKQTEKKAVATV
KGRDYHVSLELDLRGERFENAMARVEKYLDDAQLASYPRVSIIHGKGTGA
LRQGVQDYLKKHRGVKTFRYGDMGEGGLGVTVVELK
>BCE33L4849 mutT, MutT/Nudix family protein
MQRVTNCVLIRDNEVLLLQKPRRNWWVAPGGKMERGETVRDSVVREYREE
TGIYLKNPALKGVFTFVIQEGDKVVSEWMMFSFLATDFAGENKLESEEGI
IGWHTFDKIDDLAMAPGDYHIIDYLIKGNGIIYGTFVYTPDFELLSYRLD
PS
>BCE33L0450 mutT, MutT/NUDIX family protein
MKYSLLLGGLHVEHKTPKHIVAVAGYLTNEKDEVLLAKVHWRADTWELPG
GQVEEGEALDQAVCREIKEETGLTVKPIGITGVYYNASMNILAVVFKVAY
VSGEIKIQHEEIKEAKFVALNEENIDEYITRPHMKSRTLDAMRSSHFIPY
ETWEVQPYNLIGRL
>pE33L80004 mutX, phosphohydrolase, MutT/Nudix family protein
MSVNWKSVEHRIYTMCMIQNGDEILLINRPNHLGFPGYLAPGGKIEFPES
IVEGAAREVKEETGLTVSNLIFKGLDEYVNPKANVRYMVFNYWTDTFKGE
LLKNPPEGELLWIPIDEVLNLPMQDWFKERLNLYLEPGTFEIQRLWDDDL
DKQVDIKITRT
>BCE33L0430 mutY, A/G-specific adenine glycosylase
MTLEILNNFNIEQFQNDLIGWFEKEQRDLPWRKNKDPYRVWVSEIMLQQT
RVEAVKPYYANFMGKFPTLEALANADDEEVLKAWEGLGYYSRARNLHAAV
KEVKEVYGGIVPSDVKKIEKLKGVGPYTKGAILSIAYGIPEPAVDGNVMR
VLSRILSVWDDIAKPKTRKVFEEIVREIISAENPSYFNQGLMELGALICI
PKNPACLLCPVREHCRGYAEGVQKELPVKSKAKAPTMVPIVAGVLQTEDG
RYVINKRPSTGLLANMWEFPNIELGEGIRNQKEQLIDYMKEKFELSISIE
EYAMNVQHTFTHRTWDIFVFYGKVTGDIVETDTLKFVSKEAFEQLPFSKS
HRTIYESCVEKITMQ
>BCE33L4034 nfo, deoxyribonuclease IV, phage T4-induced (endonuclease IV)
MLKIGSHVSMSGKKMLLAASEEAVSYGATTFMIYTGAPQNTRRKPIEELN
IEAGRKHMEQNGIEEIIVHAPYIINVGNTTKPETFQLGVDFLRMEIERTS
ALGVAKQIVLHPGAHVGAGADAGIQQIIKGLNEVLTPDQTVNIALETMAG
KGTECGRSFEEIAKIIDGVKYNEKLSVCFDTCHTHDAGYDIVNDFDGVLN
EFDKIVGIDRLQVLHINDSKNVRGAGKDRHENIGFGHIGYKALHHIVHHP
QLTHVPKILETPYVGEDKKDKKPPYKLEIEMLKNGTFDEGLLEKIKAQ
>BCE33L1698 nheA, enterotoxin; possible non-hemolytic enterotoxin lytic component L2
MKKTLITGLLVTAVSTSCFIPVSAYAKEGQTEVKTVYAQNVIAPNTLSNS
IRMLGSQSPLIQAYGLVILQQPDIKVNAMSSLTNHQKFAKANVREWIDEY
NPKLIDLNQEMMRYSTRFNSYYSKLYELAGKVNEDEQAKADFTNAYGKLQ
LQVQSIQESMEQDLLELNRFKSVLDKDSNNLSIKADEAIKTLQGSSGDIV
KLREDIKRIQGEIQAELTTILNRPQEIIKGSINIGKQVFTITNQTAQTKT
IDFVSIGTLSNEIVNAADSQTREAALRIQQKQKELLPLIQKLSQTEAEAT
QITFVEDQVNSFTELIDRQITTLETLLTDWKVLNNNMIQIQKNVEEGTYT
DSSLLQKHFNQIKKVSDEMNKQTNQFEDYVTNVEVH
>BCE33L1429 nth, endonuclease III
MLNKTQIRYCLDTMADMYPEAHCELIHDNPFELVIAVALSAQCTDALVNK
VTKNLFQKYKTPEDYLSVSLEELQQDIRSIGLYRNKAKNIQKLCRMLLDD
YNGEVPKDRDELTKLPGVGRKTANVVVSVAFGIPAIAVDTHVERVSKRLA
ICRWKDSVLEVEKTLMKKIPMDEWSVTHHRMIFFGRYHCKAQRPQCEECP
LLEVCREGKKRMKGK
>BCE33L2453 ntpA, dATP pyrophosphohydrolase
MRAPYQVLIFPYIKTDDSIQYAIFNRSDYGYWQGIAGGGEDGEIPIESAK
REAFEEAGITRDCPYIQLDSVSSLPVEDVVGGFLWGDEVYVIKEFSFGVK
VPNKNISLSKEHLHYKWLCFEEAVKFLKWDSNKTALWELNKRLLK
>BCE33L3851 nudF, ADP-ribose diphosphatase
MSNLAERTVKTEPIFDGRVIKVRVDDVVLPNGAMSKREIVNHPGAVAIIA
ITDEGKIVLVEQYRKALEKAIIEIPAGKLEPGEKPEVTAVRELEEETGYV
CENMELITSFYTSPGFADEILYVYKATGLTKKENKAALDEDEFVELMEVS
LEEATTLMKDLRIHDAKTMFAVQYLQLQK
>BCE33L3377 nudG, phosphohydrolase, MutT/Nudix family
MERWIGCAAVCVNERNEVLMVLQGQKGEEKRWSVPSGGLEKGETLEECCI
REVWEETGYNVEVVSKIYEKEGITYGVPVNVHYYVVKKMGGSMKIQDPDE
LIHEIAWKGIEEIKQITLSFPEDYEILNKYINKKASV
>BCE33L3498 ogt, methylated-DNA--protein-cysteineS-methyltransferase
MNSYKNNYIYWTLLTHANWKFHIAATETGLCFIGSQNETFEEVNIWARKK
LPQHILIHSPDYLQVYTKEIIEYLENKRETFTSPIDAYGTAFQLSVWSTV
REIPYGKTYSYTEIADRIQKPTAVRAVASAIAANPILITIPCHRVIGKNG
KLTGFRGGLEMKKELLALEKLQVEFI
>BCE33L1844 ogt, methylated-DNA--[protein]-cysteine S-methyltransferase (6-O-methylguanine-DNA methyltransferase)
MYQAYYESELGLLEITANDKGITSVIFVDERQEERTNEMIDQCINELDEY
FKGKRKEFTVPLSAEGTSFQKNVWDALYTIPYGVSASYLDIAEKVGNTKA
VRAIGGANSRNPISIIVPCHRVIGKSGKLVGYAGGLWRKEWLLKHEGILK
>BCE33L3303 parC, DNA topoisomerase IV, subunit A
MQAEKFHDLPLEDVLGDRFARYSKYIIQDRALPDARDGLKPVQRRILYSM
YVEGNVHDKAFRKSAKTVGNVIGNYHPHGDSSVYEAMVRLSQTWKVRNVL
VEMHGNNGSVDGDPAAAMRYTEARLSPIASELLRDLDKETVEFVSNFDDT
SEEPVVLPAAFPNLLVNGSTGISAGYATEIPPHHLGEVIDATMMRIDKPN
STVDDLLTVMKGPDFPTGGIIQGIDGIKKAYETGKGKIIIRGKAEVETVR
GGKQQIVITEIPYEVNKANLVKKMDELRLDKKLDGIAEVRDETDRTGLRI
VVELKKEANSEGILNYLYKNTDLQIPYNFNMVAINNRRPTLMTLPKILDA
YIGHQKEVVTRRSQYELRKAENRQHIVEGLKKALSILDQVIETIRASKDK
RNAKDNLSAKFGFTEAQAEAIVSLQLYRLTNTDITALQQEADELNKKIIE
LQAILQSEKRLLQVIKTDLKRVKKTYSDDRRAIIEDQIEEIKIDVEVMIP
QEDVIVTVTKEGYVKRTGWRSHNASNGKDFGMKEGDILLERFDTNTTETV
LLFTNKGNYIYLPVYEMPDIRWKDLGQHVANIVSLDRDETIIWATVVPNF
EEEKRFIVFVTKNGMIKKTELNQYKVQRYSRAFVAVNLKKDDEVVDIFAT
DGTSDIVLATHGAYALIFHEDEVSPVGVRAAGVKAINLKEDDYVASGKPL
NADKDQLILVTQRGAVKRLKASEIEKSTRAKRGLVIFKELKRNPYRIVGI
EIVRDDELVYMKTEKHIVEEIDPKAYRNKDRYSNGSLVLDVNDTGEVIET
WTKKRPE
>BCE33L3304 parE, topoisomerase IV, subunit B
MAKHQFQYNEDAIQVLEGLEAVRKRPGMYIGSTDSRGLHHLVYEIVDNSV
DEALAGFGDEISVVIHKDNSISVIDKGRGMPTGMHKLGKPTPEVILTVLH
AGGKFGQGGYKTSGGLHGVGASVVNALSEWLVVTIKRDGNIYEQRFENGG
VPVTTLEKIGKTKESGTTMHFKPDTTIFSTTNYNYETLCERLRESAFLLK
GMKISIKDERNDLEDVFHYETGIEAFVSYLNEEKDSIHPVVYFTGEQNGI
EAELAFQFNDGYSENILSFVNNVRTKDGGTHEAGFKTAMTRVFNEYARKV
SLLKEKDKNLEGTDIREGVAAIVSVRVPEEVLQFEGQTKGKLGTSEARSS
IDAIVSEHLAYFLEENPDVATLLVRKAIKAAQAREAARKAREEARTGKKK
KKSEGTLSGKLTPAQSRNPQKNELYLVEGDSAGGSAKQGRDRRFQAVLPL
RGKVINTEKAKLADIFKNEEINTIIYAIGGGVGNEFDVEDINYDKVVIMT
DADTDGAHIQVLLLTFFYRYMKPLIEAGKVFIALPPLYKVSKGKGKSEVI
EYAWSDEELDSVTKKVGKGYMLQRYKGLGEMNADQLWETTMNPETRTLIR
VKIDDAARAERRVTTLMGDKVEPRRKWIERNVQFGMQEEGNILENEMIME
TEVE
>BCE33L1120 pcrA, ATP-dependent DNA helicase
MTQEKFSQKSLTHADIPRATYHIPKTSTHLIMEEDADAAYFRALEQQNVF
LNEKQLEAVRTTEGPVLTLAGAGSGKTSVLTTRVGYLVNVKQVHPRNILL
LTFTQKAAEEIRNRVANLPGMNHAASSYVVAGTFHSVFLKLLRSQGYNQQ
ILANEKHKQIMIKKILKELRLKDDYDAETMLAMISLEKNKLNRPKDVKAK
TPVEQEFKEVYERFEEVKQRYNYIDFDDILLETYYMLENNAPLLTQLQQR
FHYIEVDEFQDTSYAQYGIVKLLASPRNNLFIAGDDDQAIYGWRGASHQI
ILSFPKEFDNTTIIALNTNYRSNPFIVGLGNEVIKLNQERFDKELYSVRE
EGVQPFYARPATTLDEANQILQLIQEKVDSGERNYKDFCLLYRTHSVSRS
LLDQLTIHKIPFIKHGASQSFYEHSLIKPVLDHLRLVIEPFRLESLSNIL
PTMYIGRDDCISFIEREQWKYGEGRFPSLLHFLLLNPSLKPFQVKKVNER
IDFIKFIKELEPKKALKEIIHGKGKYLEYLQSNDRSSFTMHKDIQEEMLE
ELMESANRFTDIPAYLQFVDEAIQGQKEMEALKTMPQKDAVSLMSIHNAK
GLEFPCVFLLGASDGILPHTSSLKDANDRVTETSEALEEERRLLYVAITR
AKEELYISSPQFFRGKKLDISRFLYTVRKDLPEKTSTK
>BCE33L0278 pcrA, ATP-dependent DNA helicase
MQAYMSMTDRLLNGLNPQQQKAVQTTNGPLLLMAGAGSGKTRVLTHRIAY
LLGEKGVAPWNVLAITFTNKAAREMRERIDTLVGPEAEDIWISTFHSMCV
RILRRDIDRIGINRNFTILDSGDQLTVVKKIMKERNIDPKKFEPRSILAG
ISNAKNELLSADKYAKKITIADPYEKLTSDVYTEYQKRLLKNNSLDFDDL
IMTTIQLFERVPEVLEFYQRKFQYIHVDEYQDTNKAQYLLVKHLAARFKN
LCVVGDSDQSIYRWRGADISNILSFEKDYENAQVILLEQNYRSSQNILNA
ANAVIERNTNRKPKKLWTDNEVGSKISYYRAATEKDEAYFVAKKIRDDIQ
MGKRKYTDFAVLYRTNAQSRMVEEIFLKSNIPYKIVGGTKFYDRKEIKDI
LAYLRLIGNPDDEISFARIINVPKRGIGATSIDKIINYGVQNGISLTAVF
DEIEHVGVSAKVTKAVKEFAGLLHNWVNMQEYLSVTELVEEVIEKTGYRD
MLKNERTLEAEGRLENLDEFLSVTQTFESQSEDKSLVAFLTDLALVADID
RVDEDPTAGEEVILMTMHSAKGLEFPVVFIVGLEEGIFPHTRSLMEEDEM
QEERRLAYVGITRAEEELYLSNAQMRTLFGRTSMNAASRFITEIPTELVE
SLNETAPKRETSFGAKGRTASSSKTTTTTRSRSAFARPAAKTTGGEQIGW
AVGDKASHQKWGVGTVVSVKGEGDAKELDIAFPSPIGVKRLLAKFAPVTK
Q
>BCE33L2452 pknB, serine/threonine protein kinase
MMSINCFVGLIQNELLKEVSIRSEDESEPVVVKDIPRLWKCLGKGNYAAV
FMHKEYKDWVVKVYAREGEGIEKESEVYRRIGNHPSYSKLIYKGENFIVL
KRLKEITLYDAVHKGIKIPKQVILDINKALEYARKQGLTPCDVHGKNVMM
EKGRGYVVDVSDFLKTKEDSKWKDLEKAYFTFYLPFIYNFPFPVKIPYFM
LNIVRRSYRKYKKLKKKFKL
>BCE33L4328 polA, DNA polymerase I
MKKIISNLYGEVSDLEKKVVLVDGNNIAYRAFFALPLLNNDKGIHTNAIY
GFTMMLMRILEEEKPTHMLVAFDAGKTTFRHKTYSEYKGGRQKTPPELSE
QFPFIREMLDAFNVPRYELENYEADDIMGTLAKEASEQGASVKVISGDKD
LLQLVSDNTLVCIPRKGITEVDEYTEEALFEKYSLSPKQIIDMKGLMGDQ
SDNIPGVPGVGEKTAIKLLTQFGTVEEVYENIDQVSGKKLKEKLEANKDQ
ALMSKDLATIITDAPITVNVDDMEYKGYEASDVIPMFENLGFTSLLNKLG
VTPEETAPAELDDITFDIVEEVTEEMLQQDSALIVEVQEDNYHKADIQGF
GIQNENGCYFIQTDIALKSDAFKEWLADGEMRKYTFDAKRAIVALKWNGI
DMQGIDFDLLIAAYLLDPADTDKDFRTVAKMKETHAVKSDEEVYGKGAKR
AVPELEIVAEHVARKVHVLYDVKQTFVEELEKNEQYELFTELELPLARVL
ADMEVKGVKVDTERLRNMGEELAGRLKEMEQEIYKLAGTEFNINSPKQLG
VILFENLNLPVIKKTKTGYSTSADVLDKLMDHHEIIPNILHYRQLGKLNS
TYIEGLLKVVHEDSSKIHTRFNQVLTQTGRLSSTDPNLQNIPIRLEEGRK
IRQAFVPSEEGWIMYAADYSQIELRVLAHIANDKGLVEAFQHDMDIHTKT
AMDVFGVEKDEVTSNMRRQAKAVNFGIVYGISDYGLSQNLGITRKAAAEF
IEKYLESFPGVQEYMDDIVKDAKQKGYVATLLNRRRYIPEITSRNFNLRS
FAERTAMNTPIQGTAADIIKKAMIIMADRLEEEGLQARLLLQVHDELIFE
APKEEIEKLEKLVPEVMEHAIELAVPLKVDYSYGPTWYDAK
>BCE33L3577 polC, DNA polymerase III, alpha subunit
MSLTNEQKERFQILLQQLQIPDDLINQYLQGGGIERLVIDKANKSWHFNL
QVPRILPTELYELLETKLKQSFSHIARTTFALETENKQFTEEEVRAYWPL
CTERITFSPMFAYLKKQLPQVNGVKLLINVNNELESTALKKNVAKPVGDQ
YEAFGFPRFQLDTHIQQNTEEMQKFREQTQQEDRERVIQAMEEMAKKQAE
ESSVVHEGPITLGYLIKPDEEITPMREIQDEERRKTVQGYVFHVETKELR
SGRTLLTLKITDYTDSIMIKMFSRDKEDIPMLQSLKKGMWVKARGSVQND
TFVRDLVMIANDINEITGPSRKDKAPEGEKRVELHLHTPMSQMDAVTPVS
KLVAQAGKWGHEAIAVTDHAVAQSFPEAYSAGKKAGVKVIYGVEANLVND
GVPIAYNEEHRLLADETYVVFDVETTGLSAVYDTVIELAAVKVKGGEIID
RFESFANPHQPLSATIIELTGITDDMLTDAPEVDEVFKKFEEWMGDHTLV
AHNASFDMGFINVGFKKAGLEKTKNPVIDTLELARFLFPEMKNHRLNTLC
KKMDIELTQHHRAIYDTEATGYLLVKMLKDVIEKGFEYHDQLNDSMGQGD
AYKRGRPSHMTLLATSDVGLKNLYKLVSYSHLNYFYRVPRVPRSLLKKYR
EGILVGTACDKGEVFEAMMQKAPEEVEEIAQFYDYIEVMPPEVLRHLVER
ELVRDEGQLKTIISNLVKLGETLDKPVVATGNVHYLDPEDAMYRKILVSS
QGGANPLNRHSLPPVHFRTTDEMLECFSFLGEDVAKEIVVTNTQKIASLI
GDVHPVKDDLYTPKIEGADDETRDMSYKMARSIYGEELPEIVEARLEKEL
KSIIGHGFAVIYLISHKLVKKSLVDGYLVGSRGSVGSSFVATMMEITEVN
PLPPHYVCPKCKQSEFFNDGSVGSGFDLPDKECPTCNIPYVKDGHDIPFE
TFLGFKGDKVPDIDLNFSGEYQPRAHNYTKVLFGEDYVYRAGTIGTVAEK
TAYGYVKGYANDHNLTIRNAEIDRLVAGCTGVKRTTGQHPGGIIVVPDYM
DIFDFSPIQYPADSIGAEWRTTHFDFHSIHDNLLKLDILGHDDPTVIRML
QDLSGIDPKTIPTDDPEVMKIFSGPESLGVTEEQINCKTGTLGIPEFGTK
FVRQMLEETKPTTFSELVQISGLSHGTDVWLGNANELIYNGTCTLSEVIG
CRDDIMVYLIYQGLDPSLAFKIMESVRKGKGVPEEWEEDMKSNNVPGWYI
DSCKKIKYMFPKAHAAAYVLMAVRIAYFKVHFALLFYAAYFTVRADDFDV
EAMAKGSASIRARIDEIAQKGLDAAPKEKSLLTVLEMTLEMCERGYSFQK
VDLYRSHATDFIIDGDSLIPPFNAVPGLGTNAALSIVEARKNGEFLSKED
LQQRSKVSKTIIEYLDSQGCLGDLPDQNQLSLF
>BCE33L3627 priA, primosomal protein N'
MKFASVIVDVPARQTDRPFDYIIPKKWEDIVQTGMRVVVPFGPRKLQGFI
IGIKNSVEVESKKLKTIHEILDVTPVLNEELLKLGYWLTSETLCYMISAF
QVMLPTAIKATYKKRLQLRKQEEVAPELLFLFQDKEAIDWDAIETQPHLY
RTIQQEIKHGTIEVVYQVKDKVQKKKQRVVQPELPEDKLELAAFELKSKK
QQDVLYYFVENYKSVPLKVITEELQITDAPIKALVKKGLISEKYVEVYRN
PYDDDDFEQTKPFPLTEEQKQVITPILSSITNETYNPFLLYGVTGSGKTE
VYLQSIAAVLEKGKEAIVLVPEIALTPQMVDRFKGRFGSQVAVLHSALSV
GEKYDEWRKILRKEVKVVVGARSAVFAPFENLGIIIIDEEHESSYKQEDN
PRYHARDVAVWRGQYHKCPIVLGSATPTLESFARAKKGVYELLTMEKRMN
EQALPTVEIVDMREELRDGNRSMFSKALHEKIADRLEKKEQMVLFLNRRG
HSTFVMCRDCGYVVQCPHCDISLTYHKMNHRLKCHYCSYEENMPTACPAC
QSTYIRFFGTGTQKVEEEITKLFPEARVIRMDVDTTSRKGMHEKLLKAFG
EEKADILLGTQMIAKGLDFPKVTLVGVLTADTMLHLPDFRASEKTYQLLT
QVSGRAGRHELPGEVIIQTYTPEHYSIELAKNQQYDVFFDQEMQMRRTRQ
YPPYYYVVLVTVSHPELLKAVQVTEKIVGHLRSHCSQQTMVLGPVASAIP
RIKDRYRYQCMIKYKREPNLKNVLKMVNEHYQAEMQKELQISIDFNPTML
M
>pE33L466_0244 radC, possible DNA repair protein
MGKRIMVQSVKLVKESNKIYDVERKRITCPDDINILARAVLCIDEMPHEV
FGVFNLNTKNEIIGCSIVSQGSINSSVVHPRETFRTAILNNAASIVCFHN
HPSGNPDASPEDREVTKRLAECGKILGIELLDHVIIGEHRFVSLREKGCI
R
>BCE33L4198 radC, DNA repair protein
MNGIRDVVKEEQPRERLLLEGAGSLSNRELLAVLLRTGSKEESVLKLSDK
ILHHFDGLRMLKDATLEELVSIHGVGVAKASQLIAAFELGRRMVRLEYQN
RYSIRNPEDCARYMMEEMRFLQQEHFVCLYLNTKNQVIHRQTIFIGSLNS
SIVHPREVFKEAFRRAAASIICLHNHPSGDPAPSREDIEVTKRLVECGRI
IGIEVLDHIIIGDHKFVSLKEKGHI
>BCE33L3539 recA, recA protein (recombinase A), N-terminal region
MSDRQAALDMALKQIEKQFGKGSIMKLGEQAERRVSTVSSGSLALDVALG
VGGYPRGRIIEIYGPESSGKTTVSLHAIAEVQRQGGQAAFIDAEHAMDPV
YAQKLGVNIDELLLSQPDTGEQGLEIAEALVRSGAVDIIVIDSVAALVPK
AEIEGDMGDSHVGLQARLMSQALRKLSGAINKSKTIAIFINQIREKVGVM
FGNQLQVA
>BCE33L3538 recA, recA protein (recombinase A), C-terminal region
MPETTPGGRALKFYSTVRLEVRRAEQLKQGNDIVGNKTKVKVVKNKVAPP
FRVAEVDIMYGEGISREGEILDMASELDIVQKSGAWYSYNEERLGQGREN
SKQFLKENTDLREEIAFFIREHHGISEDSGAEGMEDPNLLD
>BCE33L4137 recD, exodeoxyribonuclease V, alpha subunit
MGNQHAMDLFEEEKKFIKAQVLHTIFHNEENLYSVVSMKVIETNETYDEK
KVMINGHFPRMHEDEVFTLTGHFKDHPKYGKQYLVETFKKELPQTKAGMV
QYLASDLFKGIGKRTAEKIVDHLGEHAISKIMDDPEALNGVVNKQKAQEI
YETIVEHQGLEKVMSFLNGYGFGTKLSIKIYQQYKEMTLEVIRNNPYQLI
EEVDGIGFGRADDIGRALGISGNHDDRVRAGCFYTLENVSLQLGHVYMRK
DQLIRETMSLLNNQEGRVTEEDIISCIEMMQSEGKVIIEEERVYLASLFY
SEKGVVKSIRRLMNQEETPSFPEAEVLKTLGEIEEQLNVQYAPLQQEAIQ
TALHKPMMLLTGGPGTGKTTVIKGIVEMYASLHGLSLNPNEYSDDNPFPI
LLTAPTGRAAKRMSESTGLPACTIHRLLGWTPEGSFQRNETDPVQGKLLI
IDEFSMVDIWLANQLFKSLPTNIQVIVVGDEDQLPSVGPGQVLKDLLNAG
AVPTVKLTEIYRQAEGSSVIQLAHAIKNGTLPPDLAQNQKDRSFIGCTGA
QIVEVVKKVCENAKTKGFSARDVQVLAPMYRGPAGINVLNEALQEVFNPK
REKSKEIAYGDVVYRRGDKVLQLVNQPESQVFNGDIGEIVSVFYAKENVE
QQDMIIVSFDGIEVTYTKPDLNQITHAYCCSIHKSQGSEFPIVIMPIVKS
YNRMLRRNLIYTGITRSKKFLIICGEEAAFQSGVNRLDDAMRQTTLASRL
QESQGEVQMVTVNGEEMDVENISPYDFM
>BCE33L0004 recF, DNA replication and repair protein
MFISEIQLKNYRNYEKLELSFEDKVNVIIGENAQGKTNLMEAIYVLAMAK
SHRTSNDRELIRWDEDFGQIKGKLQKRNSSLSLELNISKKGKKAKLNQLE
QQKLSQYIGVMNVVMFAPEDLNLVKGSPQVRRRFLDMELGQIAPVYLYEL
SQYQKVLTQRNHLLKKMQGNSKNEETMLDVFTLQLIEHGAKILQKRFEFL
HLLQEWAAPIHRGISRGLEELEIVYKPSVDVSESMDLSKIKEVYYESFQS
VKQREIFRGTTLLGPHRDDLQFFVNSKNVQVFGSQGQQRTTALSLKLAEI
ELIYSEVKEYPILLLDDVLSELDDYRQSHLLNTIQGKVQTFVTTTSVDGI
EHETLKEAKTIHVTNGTVDCEIDRE
>BCE33L3614 recG, ATP-dependent DNA helicase
MNEVVQVPVTDVKGIGGETSELLHEMGIYTVSHLLEHFPYRYEDYAMKDL
AEVKHDERVTVEGKVHSAPLLQYYGKKKSRLTVRVLVGRYLITAVCFNRP
YYKQKLNLDETVTITGKWDQHRQTIAVSELHFGPVVRQQEVEPVYSVKGK
LTVKQMRRFIAQALKEYGDSIVEVLPNGLLSRYKLLPRYEALRALHFPTG
QEDLKQARRRFVYEEFFLFQLKMQTLRKMERENSKGTKKEIPSEELQEFI
DTLPFPLTGAQRRVVDEIMKDMTSPYRMNRLLQGDVGSGKTVVAAIGLYA
AKLAHYQGALMVPTEILAEQHYQSLAETFSHFGMKVELLTSSVKGVRRRE
ILAKLEQGEIDILVGTHALIQDEVIFHRLGLVITDEQHRFGVAQRRVLRE
KGESPDVLFMTATPIPRTLAITAFGEMDVSIIDEMPAGRKVIETYWAKHD
MLDRVLGFVEKEINKGRQAYVICPLIEESEKLDVQNAIDLHSMLTHHYQG
KCQVGLMHGRLSSQEKEEIMGQFSENKVQILVSTTVVEVGVNVPNATVMV
IYDAERFGLSQLHQLRGRVGRGSEQSYCLLIADPKSETGKERMRIMTETN
DGFVLSEKDLELRGPGDFFGSKQSGLPEFKVADMVHDYRALETARQDAAL
LVDSEAFWHNDQYASLRTYLDGTGVFQGEKLD
>BCE33L4153 recJ, single-stranded-DNA-specific exonuclease
MLQPKTRWKEKEYNGERVSELASKLQLSPLVVSLFLGRGLDTEDKILDFL
NTENQEFHDPFLLEGMDRTVERVNKAIQNGEQILIFGDYDADGVSSTTVL
YLALQELGADVEFYIPNRFTEGYGPNEEAFRWAHSAGFSLIITVDTGIAA
VHEAKVAKELGIDLIITDHHEPPPELPEALAIIHPKLDGGVYPFHYLAGV
GVAFKVAHALLGRVPEHLLEIAVIGTVADLVSLHGENRLLVKRGLKHMRM
TKNIGLKALFKVANVSQSEITEESIGFSIAPRINAVGRLEDATPAVHLLL
SEDPEEAKELAEEIDELNKLRKDIVKQITEEAIAEVENNFPPEDNKVLVL
AKEGWNPGVIGIVASKLVERFYRPTIVLCIDPVKETAKGSARSIAGFDLF
ANLSDCRELLPHFGGHPMAAGMTLHMNDVDELRRRLNEQAEVILTEEDFI
PITAVDAFCKVEDVTLAAIEDMQKLAPFGVGNPKPRIAVKDAELESIRAI
GSDGSHLKMALRDGQATLDTIGFGFGAYAKEISPVAKVSVIGEASINEWN
NFKKPQLMVQDIAVEAWQLFDWRSMRNVEANLAELPKEKITMVYFSKEVL
HKFSLEDYKEHMMHASEVTELDEQYIVLLDLPTGTDELRDLFKVGFPSRI
YTLFYQENNHLFSTVPTRDHFKWYYSFLNQKSPFSLRQYGEQLCQHKGWS
KDTVNFMTQVFFELEFVTIKDGVIFMADKKQKRDLIESNTYREKMNHLQL
EKELVYSTYQQLYTWFETIRNHKEVEQLG
>BCE33L3927 recN, DNA repair protein
MNGALLSELSIRNFAIIEALNISFQKGLTVLSGETGAGKSIIIDAISLLV
GGRGSAEFVRYGTEKAEIEGLFYVEDDKHPCIEKAEELDIEIEDGMIILK
RDIAANGKSVCRVNGKLVTLSVLKEIGKTLVDIHGQHETQDLMNEERHLF
MLDHFDGERIVKQLDIYQNVYADYEKLKKQLKSLSENEQQMAHRLDLIQF
QHEEIRKADLKMDEENNLTEERLQISNFEKIYKALGDAYRSLSADGQGLD
NVRSAMGQMESITHLDEVYQENHDSIANSYYLLEEVAYQLREKLDMMEYD
PNRLDEIETRLNEIRMLKRKYGNTVEEILAYADKIEQEIFTIENKDVHIE
TTKKQLKELESVILKEATLLSNMRHELAEHLTNAIHQELKELYMEKTKFE
VRIIKREGNAEEPLVEGAPVRLTADGYDHVEFYISTNPGEPLKPLSKVAS
GGELSRIILALKSIFSKHQGVASVIFDEVDTGVSGRVAQAIAEKIYRVSV
NSQVLCITHLPQVASMADSHLFIRKQVANDRTITSVTVLTMEDKVTEIAR
MISGVEITDLTTEHAKELLTQAHHFKQTAEAIQ
>BCE33L4046 recO, recombination protein O (DNA repair protein O)
MFQKVEGIVIRTTDYGETNKIVTIFSRELGKVSAMARGAKKPKSRLASVS
QLMTHGHFLIQMGSGLGTLQQGEIISTMKEIREDIFLTAYASFIVELTDK
ATEDKKHNPYLFEMLYQTLHYMCEGVDPEVLSLIYQTKMLPVLGMRPYFD
TCAICHQETDFVAFSVREGGFLCSRHAEQDQYRIPVGEAVHKLLRLFYHF
DLHRLGNVSVKDSTKKQMRLVLNTYYDEYCGIYLKSRRFLEQLDKFQI
>BCE33L1366 recQ, ATP-dependent DNA helicase Q
MKLEEYLYRWFGYSEFRPGQKGVITDLLEGKDVIAMLPTGRGKSMCYQLP
GLMQEGTVLVVSPLLSLMEDQVTQLKYVVKNRVIAFNSFRTLQEKREAMK
RLASYKFIFVSPEMLQSELLIRELKKVHISLFVVDEAHCISQWGYDFRPD
YKKLNVVIENIGSPTVLALTATATKDVLRDIAESLNLENVTQHVYSIDRP
NIAMEVQFVETIEEKKEALLDQVMYLQGPGIVYCSSRAWTERLTEYLRGK
GVTGVAFYHGGMEHEERMLIQQQFMNDQLQLVICTSAFGMGVNKSNTRYI
IHFHYPTNIASYLQEIGRAGRDGEPSIAILLCSPLDHDLPISIIEDELPS
QSQIQFLFSLLQERMFQTKELPIEEVEEICYNAARFNEQYWRFIRYHLEQ
LGIIQQRKLILESLSDEIMNRLIAEVEIRLHNKYSELENMKSWIQVKGCR
REYLLQQFGYRKEGELKNCCDYCHITKTDYKKRQAQQSDFDYNWETELQK
LFGLEKMGE
>BCE33L2543 recQ, ATP-dependent DNA helicase Q
MFTKAQELLASYFGYSSFRRGQDETIKNVLDGKDTVCIMPTGGGKSICYQ
IPALVFEGTTLVISPLISLMKDQVDTLVQNGISATYINSSISITEANQRI
QLAKQGHYKLLYVAPERLDSMEFVDQLIDMKIPMIAIDEAHCISQWGHDF
RPSYLHIHRILDYLPEKPLVLALTATATPQVRDDICNTLGINQENTIMTT
FERENLSFSVIKGQDRNAYLADYIRQNQKESGIIYAATRKVVDQLYEDLM
KSGVSVSKYHAGMSDHDRNEQQELFLRDEVSVMVATSAFGMGIDKSNIRY
VIHYQLPKNMESYYQEAGRAGRDGLDSACILLYASQDVQVQRFLIDQSTG
ESRFSNELEKLQNMTDYCHTEQCLQSFILQYFGEEPKEDCGRCGNCTDNR
ESIDVTRESQMVLSCMIRTNQRFGKQMIAQVLTGSKNKKVIEFNFHTLPT
YGLLSNRSVKEVSEFIEFLISDELIAVEHGTYPTLKVTEKGKEVLLGKEN
VLRKERVETRQIVQDHPLFEVLREVRKEIAQGEGVPPFVIFSDQTLKDMC
AKMPQSDSELLTVKGIGEHKLVKYGSHFLQAVQHFIEENPNYAETIKTEV
VSERKKSGKVSANSHLETYEMYKQGIDLDEIAKERGLSRQTIENHLIRCF
EDGMEVDWNSFVPAEYEQLIETAVQNAEGGLKSIKEQLPNEVSYFMIRAY
LQIRK
>BCE33L0021 recR, recombination protein
MHYPEPISKLIDSFMKLPGIGPKTAVRLAFFVLDMKEDDVLGFAKALVNA
KRDLAYCSVCGHITDRDPCYICNDSHRDQSVVCVVQEPKDVIAMEKMKEY
QGVYHVLRGAISPMEGIGPEDINIPQLLKRLHDETVQEVILATNPNIEGE
ATAMYISRLLKPTGIKVTRIAHGLPVGGDLEYADEVTLSKALEGRREV
>pE33L54_0039 res, resolvase
MKFGYMRVSTLDQNLDRQKKQLEEFGCDRIFFEKITGTKRNRPELNSMLE
FLRPEDTVVVTDLTRLSRSTKDLIEITEQISQKGAHLKSLKESWLDTTTA
HGKMLFTIFAGIAQFERDLTSERTKEGIEAARKRGKHPGRPKTDEEKVDY
ALYLINQGMSRTDAAEKAGISRMTLYRKMQQ
>BCE33L4300 rnh, ribonuclease HII
MSNSIVIQTNSTVIEDMKQQYKHSLSPKTPQGGIFMAKVPSCTITAYKSG
KVMFQGGRAEAEAARWQTGSQTPKTAVKKAVDSHRYTPPASIGTMSIVGS
DEVGTGDFFGPMTVVAVYVDAKQIPLLKELGVKDSKNLNDEQITAIAKQL
LHVVPYSSLVLHNEKYNELFDKGNNQGKLKALLHNKAITNLLAKLAPTKP
EGVLIDQFTQPDTYYKYLAKQKQVQRENVYFATKGESVHLAVAAASILAR
YSFVKQFNELSKKAGMPLPKGAGKQVDIAAAKLIQKLGKERLPEFVKLHF
ANTEKAFRLLK
>BCE33L1469 rnhA, ribonuclease HI
MIEVYIDGASKGNPGPSGAGVFIKGVQPAVQLSLPLGTMSNHEAEYHALL
AALKYCTEHNYNIVSFRTDSQLVERAVEKEYAKNKMFAPLLEEALQYIKS
FDLFFIKWIPSSQNKVADELARKAILQN
>BCE33L3596 rnhB, ribonuclease HII (RNase HII)
MQKVTIQEAEHLLQEIMSEEDDRFQILIKDERKGVQKLISKWYKQKELAQ
KEKEKFLEMSKYENALREKGLTYIAGIDEVGRGPLAGPVVTAAVILPEDF
YIPGLNDSKKLSEAKRERFYDEIKAQAIAIGVGIVSPQVIDEINIYQATK
QAMLDAVANLSCTPEYLLIDAMKLPTPIPQTSIIKGDAKSISISAASIIA
KVTRDRMMKELGEKYPAYGFEQHMGYGTKQHLEAIEVHGVLEEHRKSFAP
IKDMIQK
>BCE33L0036 rnmV, ribonuclease M5
MKIKEIIVVEGKDDTVAIKRAVDADTIETNGSAIGDHVIEQVKLALQKRG
VIIFTDPDYPGERIRKIISDKVPGCKHAFLPKEEALAKRKKGVGIEHASN
ESIRRALENIHEEMEAYTSEISWSDLVDAGLVGGEMAKSRRERMGKLLKI
GYTNAKQLHKRLQMFQVSKESFAEAYKQVIQEERK
>BCE33L4165 ruvA, Holliday junction DNA helicase, subunit A
MFEYVTGYVEYVGPEYVVIDHNGIGYQIFTPNPYVFQRSKQEIRVYTYHY
VREDIMALYGFKTREERLLFTKLLGVSGIGPKGALAILASGQTGQVVQAI
EHEDEKFLVKFPGVGKKTARQMILDLKGKLADVVPDAFVDLFSDEERFDE
KKGSSAELDEALEALRALGYAEREVSRVVPELLKESLTTDQYIKKALSLL
LNGKR
>BCE33L4164 ruvB, Holliday junction DNA helicase, subunit B
MSIMDERLLSGESAYEDADLEYSLRPQTLRQYIGQDKAKHNLEVFIEAAK
MREETLDHVLLYGPPGLGKTTLANIIANEMGVNVRTTSGPAIERPGDLAA
VLTSLQPGDVLFIDEIHRLHRSIEEVLYPAMEDFCLDIVIGKGPSARSVR
LDLPPFTLVGATTRAGALSAPLRDRFGVLSRLEYYTVDQLSAIVERTAEV
FEVEIDSLAALEIARRARGTPRIANRLLRRVRDFAQVRGNGTVTMEITQM
ALELLQVDKLGLDHIDHKLLLGIIEKFRGGPVGLETVSATIGEESHTIED
VYEPYLLQIGFLQRTPRGRIVTPLAYEHFGMEMPKV
>BCE33L2122 sbcC, exonuclease
MRPIQLIMTAFGPYKQKEVIDFDDLGEHRIFAISGNTGAGKTTIFDAICY
VLYGEASGEERSDTSMLRSQFADDNVYTSVELTFQLKGKRYEIKRQLGHK
KQGNKTITGHAVELYEVIDEEKVPAVDRFHVTDVNKKVEDLIGLSKHQFS
QIVMLPQGEFRKLLTSETENKEEILRRIFKTDRYKLMRELLDQKRKQWKD
VLQEKQKERELYFRNVFKLPIRDGALLETLVEQEHVNTHQVVEALEQETA
VYKAEVEQLQVEQDVQTKQLKDAETRFHAAKSVNEKFIDLQQKNEKYNTL
QENRTVIEMKETSFKRAEQAKRLLPFEQWHEEAMQNEQKAESLLKQIIAK
KENIMNNFELAQEKYEVVKNKESERENVKKLVQRLEELQPIIASLAEKQL
NLQNAEIQIGKLKESMQNLDRQLEEHTNQKQLMTGELQQLERALEQYVDK
VEELTNMREDAKVLKQAYDVWQEKQKFEKEKEAAYSKMQLAVNAYENMER
RWLSEQAGILALHLHDGESCPVCGSTTHPKKATEQSGAIDENELNGLRDK
KNIAEKLHVQLEEKWNFYHHQYEQVIEEVKKRGYQSEELVETYSALVQKG
KQLATEVNTLKASEETRKQIAVKIKSVEEKVDALQKQKREVETEQHRIEM
DCMQLRTSYEHDKKNIPENLQTVQAWKVQFDQAMHELKLMEDEWKKVQEA
YQHWQNENIRIQAEQEGATNQFESAKLKKEETFTRFMKELEQSGFTDQST
YKEAKLSDAEMEMIQKEIQSYYSSLEVLARQIEELQAELKDKEYMDITAL
GEHIKELEINLDIIKEKRQRAQNAVTYISDLHENIRRIDEQIHEEEKAFQ
ELVDLYEVMKGDNESRISFERYILIEYLEQIVQIANERLRKLSNGQFYLK
RSERVEKRNRQSGLGLDVYDAYTGQTRDVKTLSGGEKFNASLCLALGMAD
VIQAYEGGISIETMFIDEGFGSLDEESLTKAVDALIDLQKSGRFIGVISH
VQELKNAMPAVLEVTKQKDGCSQTRFVVK
>BCE33L2121 sbcD, exonuclease
MKLFHTADWHLGKLVHGVYMTEDQKIVLDQFVQAVEEEKPDAVIIAGDLY
DRAIPPTEAVDLLNDVLQKIVIDLQTPVIAVAGNHDSPDRIHFGSSLMKK
QGLFIVGQFQFPYEPIILKDEHGEVHFHLVPYADPSIVKHVLKNEDVRSH
DDAMRIFMNELSETMDKEARHVFVGHAFVTSAGEAEENTSDAERPLSIGG
AEYVNSHYFDKFHYTALGHLHQAHFVRNETIRYSGSPLAYSISEEKHKKG
YYIVELNEQGEATIEKRLLTPRRKMRTVEAKIDELLQHPVSEDYVFVKLL
DENPVLQPMEKIRSVYPNAMHVERSIQRREFTESNEATVSRHKTDDLSLL
KAFYKEMKGLDLSEEKERLFVEVLQTVQEREGERG
>BCE33L3593 smf, nucleotide-binding protein, Smf family
MKRERLLHLHYLLADHWKAIERLLHVDPELKGIYTFNAKQMECYTGISSK
KSSELVNFLQSSNLPQYISYLEKNRIFYMTIWDKDYPKLLREIQDPPFVL
YGKGEKDFLNKANKLAVVGTREPTLYGYESLKFILHPLLEKEWLIVSGFA
RGIDTMSHEITVRRHCPTIAILGHGLSYIYPRENRYLYEAWNEYILLLTE
YPPHYAPKKWYFPKRNRIISGISKGVLVVEAKSRSGTLITADLALEQNRE
VFALPGPIFTDSASGTNHLIQQGAKLVRNAEDILEEILN
>BCE33L3959 splB, spore photoproduct lyase
MKPFMPKLVYFEPKALEYPLGKELYEKFTKMGLEIRETTSHNQIRNLPGE
NDLQKYRNAKATLVVGVRKTLKFDTSKPSAEYAIPLATGCMGHCHYCYLQ
TTLGSKPYVRVYVNLDEIFEKAQQYMDERAPEITRFEAACTSDIVGIDHL
THALKRAIEFIGESEHGRLRFVTKYSHVDHLLDAKHNGKTRFRFSINSRY
VIKNFEPGTSPFEERIEAARKVAGAGYPLGFIVAPLYMHEGWEQGYRELF
ERLYNALKDLSIPNLTFELIQHRFTKPAKKVIQERYPNTKLEMDEEKRKY
KWGRYGIGKYVYKKDDAEVLEETIRGYIYEFFPDAEIQYFT
>BCE33L5170 ssb, single-stranded DNA-binding protein
MNRVILVGRLTKDPDLRYTPNGVAVATFTLAVNRAFANQQGEREADFINC
VIWRKQAENVANYLKKGSLAGVDGRLQTRNYEGQDGKRVYVTEVLAESVQ
FLEPRNGGGEQRGSFNQQPSGAGFGNQSSNPFGQSSNSGNQGNQGNSGFT
KNDDPFSNVGQPIDISDDDLPF
>BCE33L1967 ssb, single-stranded DNA-binding protein
MMNRVVLIGRLTKEPELYYTKQGVAYARVCVAVNRGFRNSLGEQQVDFIN
CVVWRKSAENVTEYCTKGSLVGITGRIHTRNYEDDQGKRIYITEVVIESI
TFLERRREGASQ
>BCE33L4306 sspI, small, acid-soluble spore protein I
MSFNLRGAVLANVSGNTQDQLQETIVDAIQSGEEKMLPGLGVLFEVIWKN
ADENEKHEMLETLEQGLKK
>BCE33L0035 tatD, TatD related DNase
MLFDTHSHLNAEQFEEDLQEVIARMKEAGVTYTVVVGFDEATIKKAIELA
EAYDFIYAAVGWHPVDAIDMTEEHLAWLEELAAHPKVVALGEMGLDYHWD
KSPKEIQKEVFRKQIALAKKVKLPIIIHNRDATQDIVDILEEENAAEVGG
IMHCFSGSVEVAERCVDMNFLISLGGPVTFKNAKKPKEVATEIPLEKLLI
ETDCPYLTPHPFRGKRNEPSYVKLVAEEIANLKEISYEEVAKITTKNAKA
LFGVE
>pE33L466_0183 tnp, transposase
MINKAYKFRIYPNQAQAILINKTIGCSRFVFNHFLSLWDHAYKETGKGLT
YGTCSAKLPAMKKEFVWLKEVDSIAIQSSVRNLADAYTRFFKKQNSAPCF
KSKKNNVQSYTTKQTNENIAVIGNKIKLPKLGLVRFAKSREVKGRILNAT
VRRNPSGRYFVSLLVETEVQELPKTHSYIGIDVGLKDFAILSDGKTYKNP
KFFRLLEDKLAKAQRVLSRRTKGSTRWNKQRVKVARIHEYISNARKDYLD
KISTEIIKNHDVIGIEDLQVSNMLKNHKLAKAISEVSWSQFRTMLEYKAK
WYGKQVIVVSKTFASSQLCSCCGYQNKDVKNLNLREWDCPSCRTHHDRDI
NASINLKNEAIRLLTARTAGLA
>pE33L54_0026 tnp, transposase
MKALVELANIPRSTYYDLVKKMKRPDVDADLKAEIKAIYEENEGRYGYRR
IRDELTNRGQKINHKKVQRIMKELGLKCVVRMKKYKSYKGKVGKIAPNIL
ERNFYADAPNQKWVTDITEFKLFGEKLYVSPVLDLYNGEIITYTIGSRPT
YSLVSEMLEKALERLPENHQLLMHSDQGWHYQMGQYVRTLESRAIVQSMS
RKGNCYDNAVIENFFGVMKSEFLYIKEFECIEHFKIELEKYIDYYNKKRI
KAKLKMSPIQYRAHLDQVA
>pE33L466_0091 tnp, transposase
MIINKAYKFRIYPNQAQAILINKTIGCSRFVFNQFLSLWDHAYKETGKGL
TYGTCSAKLPAMKKEFVWLKEVDSIAIQSSVRNLADAYTRFFKKQNSAPR
FKSKKNNAQSYTTKQTNENIAVVGNKIKLPKLGLVRFAKSREVEGRIVNA
TVRRNPSGRYFVSLLVETEVQELPKTHSYIGIDVGLKDFAILSDGKPYEN
PKFFRSLEDKLAKAQRILSRRMKGSSGWNKQRVKVARIHEYISNARKDYL
DKISTEIIKNHDVIGIEDLQVSNMLKNHKLAKAISEVSWSQFRTMLEYKA
KWYGKQVIVVSKTFPSSQLCSCCGYQNKDVKNLNLREWDCPSCRTHHDRD
INASINLKNEAIRLLTARTAGLA
>pE33L466_0034 tnp, transposase, N-terminal region
MPSIQDTIYPRIKHNLSTEDLRSVYTPTRSEIEWTSMKTKGTLQQLALLI
LLKTVQNLGFFTRISDIPPIIIKHIAQSAQLPIPIETEWEAYSKTRTIKR
HYHFIRQYLKIQQFDHNARQIMLDTMKHVAGSKDDPADLINAAIEELIHQ
RYELPVYNTFKEAANEIRHKSYRFIYEQVYESLHEQQLQQIDCLFQTGPD
TFYSPWNRLKEDAKRASLFHLKELILHYDWLIHQKIPVYLLESFSSTKIQ
QMATEAKTLDAYRMSEIEKKKRYTLAISLVSVQTSRILDNIGEMLIKRMM
SIHKKGREFLQDYKKQTQKRTDSLVATLQHVLMAYQTEGIPEERLQAIQQ
VLGDKEDQVLQDCEDHLALSGDNYYLFLQKFFKSHRSTLFKILEVVPLRS
TNQDSSIVEAIQFLLSHQHTRKEWISVAHIKKIGLWKREVTPLLNLSWIP
DGWWRWLSPKKKREVVLEELNRRHFEVCIFSQIMWGLKSGDLYIEGSEKY
ADYRKQLISWEEYEENLEEFCNQLNLPSWGRTSISE
>pE33L466_0308 tnp, transposase, possible orfA ISRSO11-related
MAKFTADEKIQIVLRYLNGNESYREMGRSLGISDTIILNWVNQYKQNGLE
AFLKRCTNYTQQFKLDVLNFMIENGMSLFETAAIFNIPAPSTISVWKKQL
ETQGIDALQSKKKGRPSMKKDSNKQLKQPLAEGSVEALEARIKQLEMENE
YLKKLNALVQNKEKSQNKTKRK
>pE33L466_0180 tnp, transposase
MLINKAYKFRIYPNKAQATLINKTIGCSRFVFNHFLSLWDHAYKETGKGL
TYGTCSVKLPALKKKFVWLKEVDSIAIQSSVRNLADAYTRFFKKQNSAPR
FKSKKNNVQSYTTKQTNENIAVVGNKMKLPKLGLVRFAKSREVKGRIVNA
TVRRNPSGRYFVSLLVETEVQELPKTNSYIGMDVGLKDFAILSDGTTYKN
PKFFRLLEEKLAKAQRVLSRRMKGSSRWNKQRVKVARIHEYMANARKDYL
DKISTEIIKNHDVIGIEDLQVSNMLKNHKLAKAISEVSWSQFRAMLEYKA
KWYGKQVIVVSKTFASSQLCSCCGYQNKDVKNLNLREWECPSCRTHHDRD
INASINLKNEAIRLLTARTAGLA
>pE33L466_0307 tnp, transposase orfB, probable IS150-related
MKALVELATIPRSTYYDLVKKMNRPDVDADLKAEIKAIYEENEGRYGYRR
IRDELTNRGQKVNHKKVQRIMKELGLKCVVRMKKYKSYKGKVGRIAPNIL
ERNFHTDAPNQKWVTDITKFKLFGEKLYVSPVLDLYNGEIITYTIGSRPT
YSLVSDMLEKALERLPETHQLLMHSDQGWHYQMRQYVRTLESRAIVQSMS
RKGNCYDNAVIENFFGIMKSEFLYIKEFENVEHFKIELEKYIDYYNTKRI
KAKLKMSPVQYRTHFYQAA
>pE33L466_0378 tnp, possible tranposase fragment
MIRLLSYAISDNHLNKDFQAPKPNEKWETDITYLIFNGQRLYLSAIKNLY
NNEIVGYEIHRRNNLKPVLDALKRKEKTRNVKGIFLHSDQGFSTHPSI
>pE33L466_0430 tnp, IS231 transposase
MSVSVSDELQLFAQEIQSFLSPNTLRNLARDVGFVQRTSKYQAKDLVALC
VWMNQNVATTSLTQLSSCLEASTEVLISPEGLNQRFNQAAVQFLQHILAE
LLNQKLVSSMPISSPYTSIFKRIRILDSTAFQLPDPFSFVYPGAGGCSHT
AGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGSLCVPTVTANDLCIRDLG
YFHLKDLQHIQDKKAYYISRIKSNTRIYQRNPNPDYFQDGRIKKCTEYIQ
IDMEVLMNSLQPGQTCEISNAYVGMTDKVPTRVIVHRLTKEQQQKRLQDQ
AVREKKKGMKYSPRSKRLSGINVYMTNTSADIVPMEQVHDWYSLRWQIEI
LFKTWKSFFHIHHCKKIKRERLECHLYGQLIAILLCSSTMFQMRQLLLMK
KKRELSEYKAIYMIKDYFLLLFQAIQKDTQGLSKILIRLFNLLQQNGRKS
HRYEKKTVFDILGVVYNCTMSDNQAA
>pE33L466_0392 tnp, probable transposase
MLINKAYKFRIYPNKAQATLINKTIGCSRFVFNHFLSLWDHAYKETGKGL
TYGTCSAKLPAMKKEFVWLKEVDSIAIQSSVRNLADAYTRFFKKQNSAPR
FKSKKNNVQSYTTKQTNENIAVVGNKIKLPKLGLIRFAKSREVKGRIVNA
TVRRNPSGRYFVSLLVETEVQELPKTHSYIGIDVGLKDFAILSDGKPYKN
PKFFRSLEAKLAKAQRVLSRRMKGSSRWNKQRVKVARIHEYISNARKDYL
DKISTEIIKNHDVIGIEDLQVSNMLKNHKLAKAISEVSWSQFRSMLEYKA
KWYGKQVIVVSKTFPSSQLCSCCGYQNKDVKNLNLREWECPSCRTHHDRD
INASINLKNEAIKLLTARTAGLA
>pE33L466_0053 tnp, possible transposase fragment
MARKKAVKVLRKQKKRETMQRFTQKQNIGRACLTAKEFRLLQRMSHSSKA
LRNVGLYTMKQIYLNNNRMATVKEVDTAMQADINYPGVQSNSVQAIRRAL
FTEVKSFFKALEQWKKKPEKFTGRPKFPNYSRSTDKRIIEIYQVPKVDDN
GYWIIPMNVAFRKKFGSIKIRMPKNLRNKKISYIEIVPKQKGRFFEVHYT
YEMHVSQMKKPSTTTSNALSCDLGVDRLVSCVTNTGDTFLIDGKKLKSIN
QYFNKTIRNLQQKNMENGLSKRVVTNQMAELWHKREQQINGYISQTVGLL
FKKVKVFNIDTVVVGYNAGWKQESDMGKKNNQKFVQIPFHKLIAAIENKC
VKEGIRFLKQEESYTSKPVFLIKIRFPFGLRMIGRIIALVANESLMVCTK
VKQEHVFMLILMVR
>pE33L466_0060 tnp, transposase
MSISVSDELQLFAQEIQSFLSPNTLRNLARDVGFVQRTSKYQAKDLVALC
VWMNQNVATTSLTQLSSCLEASTEVLISPEGLNQRFNQAAVQFLQHILAE
LLNQKLVSSMPISSPYTSIFKRIRILDSTAFQLPDPFSFVYPGAGGCSHT
AGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGSLCVPTVTANDLCIRDLG
YFHLKDLQHIQDKKAYYISRIKSNTRIYQRNPNPDYFQDGRIKKCTEYIQ
IDMEVLMNSLQPGQTCEISNAYVGMTDKVPTRVIVHRLTKEQQQKRLQDQ
AVREKKKGMKYSPRSKRLSGINVYMTNTSADIVPMEQVHDWYSLRWQIEI
LFKTWKSFFHIHHCKKIKRERLECHLYGQLIAILLCSSTMFQMRQLLLMK
KKRELSEYKAIYMIKDYFLLLFQAIQKDTQGLSKILIRLFNLLQQNGRKS
HRYEKKTVFDILGVVYNCTMSDNQAA
>pE33L466_0114 tnp, transposase for insertion sequence element IS231
MNMNQKQELSLFAEELYRYMSPATLNQLAIEAGGMKRKRKCHGHHFLSLC
VWLNQQVATTSLTQLCSQLETSTGVLLSPEGLNRRFNSASVAFFRTVFTT
LLQAKIGGVSKISHSLSSYFERIRILDSTTFQVPDRFAAIYPGAGGCSHK
AGVKIEPGKRSDQAYGATRTDMIQKNELYIRDLGYFRLQDFKSIQDKQGY
YLSRLKLPTKIYRKEFETVVFKTKPPQLKPVYTQIHLENIMKQLQPGQVY
ELHDVYVGSKDKLPTRIVVYRCTEEQKQKRLRDQTIREKKKGITYTERTK
LLQGITVYMTNIPTEWVPKEKIYALYSLRWQIELLFKIWKSWFQIHRCKS
IKQERLECHLYGQLISILFCSSTMFKMREFLLRKKQKELSEYKAMYIIKD
YFSLFYQSLHKNTQELSKVLLRLFNLLQHNGQKSHRYEKKTVFDILGVVY
EYTTSNHQVA
>pE33L466_0401 tnp, transposase
MINKAYKFRIYPNQAQAILINKTIGCSRFVFNHFLSLWDHAYKETGKGLT
YGTCSAKLPAMKKEFVWLKEVDSIAIQSSVRNLADAYTRFFKKQNSAPRF
KSKKNNVQSYTTKQTNENIVVVGHKIKLPKLGLIRFAKSREVKGRIVNAT
VRRNPSGRYFVSLLVETEVQELPKTHSYIGIDVGLKDFAILSDGKTYKNP
KFFRLLEDKLAKAQRVLSRRTKGSTRWNKQRVKVARIHEYMANARKDYLD
KISTEIIKNHDVIGIEDLQVSNMLKNHKLAKAISEVSWSQFRTMLEYKAK
WYGKQVIVVSKTFASSQLCSCCGYQNKDVKNLNLREWDCPSCRTHHDRDI
NASINLKNEAIRLLTARTAGIA
>pE33L466_0450 tnp, IS231 transposase
MSISVSDELQLFAQEIQSFLSPNTLRNLARDVGFVQRTSKYQAKDLVALC
VWMNQNVATTSLTQLSSCLEASTEVLISPEGLNQRFNQAAVQFLQHILAE
LLNQKLVSSMPISSPYTSIFKRIRILDSTAFQLPDPFSFVYPGAGGCSHT
AGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGSLCVPTVTANDLCIRDLG
YFHLKDLQHIQDKKAYYISRIKSNTRIYQRNPNPDYFQDGRIKKCTEYIQ
IDMEVLMNSLQPGQTCEISNAYVGMTDKVPTRVIVHRLTKEQQQKRLQDQ
AVREKKKGMKYSPRSKRLSGINVYMTNTSADIVPMEQVHDWYSLRWQIEI
LFKTWKSFFHIHHCKKIKRERLECHLYGQLIAILLCSSTMFQMRQLLLMK
KKRELSEYKAIYMIKDYFLLLFQAIQKDTQGLSKILIRLFNLLQQNGRKS
HRYEKKTVFDILGVVYNCTMSDNQAA
>pE33L466_0247 tnp, transposase
MENTITLFSYGNHSDDKGQEIRKFYQFYVGIDIGASFHVASCIKFDAFLD
PKGIAWKRTKTMKFNSDSTGIAEFLKALKQIEDQFNITRADFLILLEPTG
GHYSYLVQQVLLNEGYQLFQVENKAVGEFRKNNLGISEKSDSMDAKVMSY
MGWHKQLHPHMQGVTLIKPQTVLQSLFRTVMRDRWYLNVQLTRRKNQVQQ
LLKVTHPDLNKAFKKLGTSSVMKLVLEYPTGLHMKKSSEDELYKAISKAG
AKNVAKRAAKTLSEIMPYTVAVPVEHLVGRQKWVIEEALRLEESIQLIDV
EIHSLLRGDLEKGIEPHPYTELLLSFPFVSENIACTLIGVIGDIDRFNTY
KEFKKYLGVSAENSQSGTSVRSTKQTYSGVRDARRVLYQMSMMMLANGKR
KPTVFKAYYDRKVEEGMIKKKAIGHLCGKIANLIYTILKSSQKYDPKIHA
AACGIEWDEMYKLEKQQETLPLN
>pE33L466_0334 tnp, transposase
MINKAYKFRIYPNQAQAILINKTIGCSRFVFNHFLSLWDHAYKETGKGLT
YGTCSAKLPAMKKEFVWLKEVDSIAIQSSVRNLADAYTRFFKKQNSAPRF
KSKKNNVQSYTTKQTNENIAVVGNKIKLPKLGLIRFAKSREVKGRIVNAT
VRRNPSGRYFVSLLVETEVQELPKTHSYIGIDVGLKDFAILSDGKTYKNP
KFFRLLEDKLAKAQHVLSRRRKGSTRWNKQRVKVARIHEYMANARKDYLD
KISTEIIKNHDVIGIEDLQVSNMLKNHKLAKAISEVSWSQFRTMLEYKAK
WYGKQVIVVSKTFASSQLCSCCGYQNKDVKNLNLREWDCPSCRTHHDRDI
NASINLKNEAIRLLTARTAGIA
>pE33L466_0328 tnp, transposase
MIRSVIMKYNLRNMVSYLCKIAGVSRSGYYNYFSISSQEQRKQKSDRDEI
LKETILKALRFRNRKKGARQIKMTLAGQFQVVYNLKRIRRIMKKYEIVCP
VRKANPYKRMLKATKEHRIVPNQLNREFKQNTPGKTLLTDITYLVYGKNQ
RAYLSTILDGSTNEILAYHVSEQMTLELVTTTLHKLKRNPRIRLIEGAYI
HSDQGAHYTSPTYQKLVKKLNLGQSMSRRGNCWDNAPQESFFGHLKDEAH
IKPCASFNELKQEIKKYMTYYNHYRYQWNLKKMTPVGYRNHLLDVA
>pE33L466_0108 tnp, transposase for insertion sequence element IS231
MSISVSDELQLFAQEIQSFLSPNTLRNLARDVGFVQRTSKYQAKDLVALC
VWMNQNVATTSLTQLSSCLEASTEVLISPEGLNQRFNQAAVQFLQHILAE
LLNQKLVSSMPISSPYTSIFKRIRILDSTAFQLPDPFSFVYPGAGGCSHT
AGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGSLCVPTVTANDLCIRDLG
YFHLKDLQHIQDKKAYYISRIKSNTRIYQRNPNPDYFQDGRIKKCTEYIQ
IDMEVLMNSLQPGQTCEISNAYVGMTDKVPTRVIVHRLTKEQQQKRLQDQ
AVREKKKGMKYSPRSKRLSGINVYMTNTSADIVPMEQVHDWYSLRWQIEI
LFKTWKSFFHIHHCKKIKRERLECHLYGQLIAILLCSSTMFQMRQLLLMK
KKRELSEYKAIYMIKDYFLLLFQAIQKDTQGLSKILIRLFNLLQQNGRKS
HRYEKKTVFDILGVVYNCTMSDNQAA
>pE33L466_0437 tnp, conserved hypothetical protein
MIIIPLLDPKGITMKNMFLFHFFLNINLSKIILIIPLRKVVKTMARKKAV
KVLRKQKKRETMQRFTQKQYIGRVCLTAKEFRLLQRMSHSSKALRNVGLY
TIKQSYLNDNKMATVKEVDTAMQADINYPGVQSNSVQAIRRALYAEVKSF
FKALEQWKKNPEKFTGRPKFPNYSRSTDKRIIAIYQVPKVDENGYWMVPM
SVAFRKKFGSIQIRMPKNLRNKKNLLH
>pE33L466_0072 tnp, transposase
MQNEHPPLTAYSEEQRREAMVKYKIIAPYLTDEKTLTVIIEETGIAKRTL
QYWIQDYKQFGLKGLIRKTRSDAGKTHLEPEVVVSIEQLILKYKRNSLTS
IHRMICEQCQKKGWEKPSYYQVHKVSQSLSPSLKKLAHDGQKAYNNQYDL
IHRREASYPNEIWQADHTPLDIIVLNEKGKPERPWLSIILDDYSRAVAGY
FLTFENPSAIHTSLVLHQAIWKKSNPDWQICGIPEIFYTDHGSDFTSNHL
EQVAVDLKINLVFSTIGVPRGRGKIERFFLTINQLFLQDLPGYLGNQTST
SLLTIKELDEKLSNFIISNYHHRVHSTTKKEPIQAWNNSGFLPNVPESLE
SLDLLLLNVAKPRKVHSDGIHFQGLRYIDTNLAAYIGETVIIRYDPRDIA
EIRVFYQDKYLCTAISPEISNYTVDLKEIVSARNKIRRSLKKQLDSGKAI
VEEIALSKKKDLEDKTNEPTKKSKLKRYFNE
>pE33L466_0410 tnp, IS231 transposase
MAFFMSVSNELQQFAQEIQSALSPNVLRDLARDVGFVQRTSKYQAKDLVA
LCVWVSQNVAMTSLTQLSSCLEASTEVLISPEGLNQRFNKAAVQFLQHLL
AELLNQKLTSSMPISYPYTSVFKRIRILDSTAFQLPDTFSSIYPGAGGCS
HTAGVKIQLEYDLLSGQFLHIHTGPGKQHDRTYGSLCVPTVAANDLCIRD
LGYFHLKDLQHIQDQKAYYISRIKSNTRIYQKNPNPDYFRDGRIKKGTEY
IQIDMEVLMNSLQPGQTYEISDAYVGMTDKVPTRVIVHRLTKEQQQKRLD
DQTVREKKKGMKYSPRSKRLSGINVYMTNTPIDIVPMGQVHDWYSLRWQI
EILFKTWKSFFHIHHCKKIKRERLECHLYGKLIAILLCSSTMFQMRQLLL
MKKKRELSEYKAIYMIKDYFFLLFQSIQKDTPELSKVLLRLFNLLQQNGR
KSHRYEKKTVFDILGVVYNCTMSDNQAA
>pE33L466_0359 tnp, IS231-related transposase
MNLSIQDEFYLFAKELKRYLSPHVLQQLAQKTGFVKRKSKYGARDLTALC
IWINQHVASDSLTQLCSQLYANTATLISPEGLNQRFNPAAVVFLREVFTS
LLTQKLCSSHSLSAYIISTFNRIRILDATVFQLPNQFATDYQGSGGSSNK
AGVKIQLEYDLLSGQFLNVQLGPGKNNDKTYGTFCLKTIDAGDLCLRDLG
YFDLGDLQAIHDKEAYYISRLKLNTRIYTKNPEPEYFNNGTLKKQTEYIQ
IDMTQILSGLIPGETMEIPEAYIGQNQKLPARVIIHRLTDDQKQTRLKNQ
AIREKKKGIVMKDKSKRLMGMNVYITNTSLEEVPTDYVHSLYSLRWQIEI
LFKTWKSFFEIDECQNIKRERLECHFYGQLIGILLCSSTMFQMRQFLLKK
KKQELSEYKAIYMIKDYFPLLFQAIAVGTEELLKILHCLYQSLKKNGRKC
HRYKKMTVFDILGIVYKTTVKHRQAA
>pE33L466_0129 tnp, possible transposase, IS605 family
MIIIPLLDPKGITMKNMFLFHFFLNINLSKIILIIPLRKVVKTMVRKKAV
KVLRKQKKRETMQRFTQKQNIGRACLTAKEFRLLQRMSHSSKALRNVGLY
TMKQSYLNVNKVATVKEVDNAMQADMNYWGVQSNSVQAIRRALFTEVKSF
FKALEQWKKNPEKFTGRPKFPNYSRSTDKRIIEIYQVPKVDENGYWMIPM
NVAFRKKFGSIKIRMPKNLINKKISYIEIVPKQKGRFFEVHYTYEMHVSQ
MKKQSTTTSNALSCDLGVDRLVSCVTNTGDAFLNDGKKLKSINQYFNKTI
RNLKWENMENELSKRVVTNKMATLWHKREKQINGYISQTVGLLFKKVKAF
NIDTIVVGYNAGWKQKSDMGKKNNQTFVQIPFHKLMAAIENKCIKEGIRF
LKQEESYTSKSSFLDKDPVPVWSKDDRTHYRFSGKRISRGLYQSKAGTCI
HADINGALNTLGKSKVVELDDNLKVKTPILLEVQKRKAVASRIA
>pE33L466_0040 tnp, transposase, fragment
MACLLFSFTSFETRVMTYLVIYLVSFMAMRQDPIRQTYEIKEAYIGKDQK
LFTRVIIYRLTEKQIQERRKKQNYTESKKGITYSEKSKRLTGINIYVTNT
PWEIVPMEQIHDFYSLRWQIEITFKTWKSLFQIHHWHNIK
>pE33L54_0029 tnp, possible transposase fragment
MYVQFPRNDMVTPHSKRLSGINVYMINTSTDIVPMGQVHDWYSLRWQIKI
LFKTWKLFFHIHHCRKIKQERLEYYLYGLSLIAIWLNPK
>pE33L466_0445 tnp, transposase
MINKAYKFRIYPNQAQAILINKTIGCSRFVFNHFHSLWDHAYKETGKGLT
YGTCSAKLPAMKKEFVWLKEVDSIAIQSSVRNLADAYTRFFKKQNSAPRF
KSKKNNVQSYTTKQTNENIAVVGNKIKLPKLGLIRFAKSREVKGRIVNAT
VRRNPSGRYFVSLLVETEVQELPKTHSYIGIDVGLKDFAILSDGKTYKNP
KFFRLLEDKLAKAQRVLSRRRKGSTRWNKQRVKVARIHEYMANARKDYLD
KISTEIIKNHDVIGIEDLQVSNMLKNHKLAKAISEVSWSQFRTMLEYKAK
WYGKQVIVVSKTFASSQLCSCCGYQNKDVKNLNLREWDCPSCRTHHDRDI
NASINLKNEAIRLLTARTAGIA
>pE33L466_0287 tnp, transposase
MTNNKRERRTFTAEFKQQMVQLYQNGEPRKDIIKEYGLTPSSLDRWINQN
HTSGSFKEKDNKTAEQLELEALRKQNKQLLMENDILKQAALMLGQK
>pE33L466_0062 tnp, transposase
MGHHTSLSTSDKFISIIHRKRYSFCRNIQKGWRFCMNLSIQDEFHLFAEE
LQRYLSPHILQQLAQETGFVKRKSKYGARDLAALCIWISQHVASDSLTRL
CSQLYANTATLMSPEGLNQRFNPAAVTFLREVFTSLLTQKLCSNHSLSAH
IISTFNRIRILDATVFQLPDQFATDYQGSGGSSNKAGVKIQLEYDLLSGQ
FLNVQLGPGKNNDKTYGTICLETVEAGDLCLRDLGYFDLGDLQTIHDKEA
YYISRLKLNTRIYIKNPEPEYFNNGTLKKQTEYIQLDMTQMMSGLMPGET
MEIPEAYIGQNQKLPARVIIHRLTDEQTQTRLKNQAIREKKKGIVMKDKS
KRLMGMNVYITNTSFEEVPTNYVHSLYSLRWQIEILFKTWKSFFEIDECQ
NIKRERLECHLYGQLIGILLCSSTMFQMRQFLLEKKKQELSEYKAIYMIK
DYFPLLFQAIAVGTEELLKVLHRLYQLLKKNGRKCHRHKKMTVFDILGIV
YKTTVKHRQTA
>pE33L466_0070 tnp, transposase
MGYFHLKDLQHIQDKKAYYISRIKSNTRIYQRNPNPDYFQDGRIKKCTEY
IQIDMEVLMNSLQPGQTCEISNAYVGMTDKVPTRVIVHRLTKEQQQKRLQ
DQAVREKKKGMKYSPRSKRLSGINVYMTNTSADIVPMEQVHDWYSLRWRI
EILFKTWKSFFHIHHCKKIKRERLECHLYGQLIAILLCSSTMFQMRQLLL
MKKKRELSEYKAIYMIKDYFLLLFQAIQKDTQGLSKILIRLFNLLQQNGR
KSHRYEKKTVFDILGVVYNCTMSDNQAA
>pE33L466_0434 tnp, possible IS231 transposase, fragment
MAHHKKMKKKMPKTVRKVNLLVTNTSSEKLPATEVYIFYSLRWQVEILFK
IWKSIFRIHISKRMELEQFQCHLYGQRLRLCLVASVTYQMRRLL
>pE33L54_0024 tnp, transposase
MDLSISDELELFSKELQRYISPHVLEQLAREIGFVQRKSKYRAQDLVALC
VWLSQNIAHTSLTQLCSRLETNTGISMSPEGLNQRFNSQAVHFLQQILTY
LLHQQLCTSSKISTLYTNYFRSIRVLDSTHFQIPDKFASTYQGSGGSGHS
AGVKIQLEYDLLSGQFLHVHVGSGKHNDKTYGSTCLTSLQRHDVCIRDLG
YFDLRDLHTIDECGAYYISRLKLNTRIYQKNREPEYFQNGTIKKHSEYVQ
LDMEQFIDQLQSGETYEIPEIYIGMYQKLPARLILYKLTETQMKRRLKDL
ASKEHKKQITYKERSKRLSAINFYITNIPSEYLPREQVYDFYSLRWQIEL
IFKTWKSFFRIHHCNSVKLERLECHLYGQLISILLCSSTMFQMRQLLLTK
KKQELSEYKAVYIIKDYFLLLYQALQKDTQEISKILLRLFNFLQKNGRKS
HRYEKKTVFDILGVIYNCFVTHNHVA
>pE33L54_0025 tnp, possible transposase fragment
MNGLLLNLRLMGMWRYPGNESYREMGRAIGISGTIILNWVNQYKQNGVEA
FLKRCTNYTQQFKLDVLNYMIENGMSLFETAAIFNIPAPSI
>pE33L466_0438 tnp, possible transposase, C-terminal region
MHYTYEMHVSQMKKQSTTTSNALSCDLGVDRLVSCVTNTGDTFLIDGKKL
KSINQYFNKMIRNLQLKNVENGLSKRVVTNKMAALWHKRERQINGYISQA
VGLLFKKVKEFDIDTIVVGYNTGWKQKSDMGKKNNQTFVQIPFHKLMAAI
ENKCVKEGIRFLKQEESYTSKASFLDKDPVPVWSKDDRPQYRFSGKRITR
GLYQSKAGTCIHADINGALNTLQKSRVVELDDNLTVKTPILLEVQKRKAV
ASHIA
>pE33L466_0288 tnp, transposase
MKNNLHKYSISAMYNVLQLSRATYYYEAKQKQDTGDELSPLVKEIFRESR
QNYGIRKIRVELQKLGYVISRRRIGRIMKNPGLISNYTIAQFKPKRVSCN
EENISNKLDRKFNQKEELAVVVSDLTYVRVNKKWNYVCLLVDLFNREIIG
HSVGEKKDAELVSQAFATVKTNLNRIILFHTDRGNEFKNKLIHDTLEAFK
IKRSLSAKGCPYDNAVAEATYKIFKTEFIRNRHFTSLEKLTLELNDYVNW
FNNVRIHGTLDYLSPVQYKQEHLKKIV
>pE33L466_0404 tnp, transposase
MPKLGFVRFVKSREVKGRIVNATVRRNPSGRYFVSLLVETEVQELPKTNS
YIGIDVGLKDFAILSDGTTYKNPQFFRSLEEKLAKVQRVLSRRTKGFSSW
NKQRVKVARIHEYMANARKDYLDKISTEIIKNHDVIGIEDLQVSNMLKNH
KLAKAISEVSWSQFRAMLEYKAKWYGKQVIVVSKTFASSQLCSCCGYQNK
DVKNLNLRKWDCPSCRTHHDKDINASMNLKNEAIRLLTARTAGGVQAPYL
GAYKQIDLP
>pE33L466_0272 tnp, transposase
MTKHNKAYKFRLYPTEEQAYLMRKTFGCVRFVYNRMLAERKEAYEKYKDD
KEQLKKQKPPTPAKYKAEFEWLKEVDSLALANAQLNLQTAYKNFFRGQND
FPTFKRKKDRKSYTTNVVNGNIMLLNRHIKLPKLKMVRIKQHREIPQDHV
IKSCTISMTPTGKYYVSILTEYEKEIVQKEVETVVGLDFAMDRLYVSSED
ERANYPKFYREMLDRLAKAQRVLSRRTKGSIRCNKQRIRVAKLHEKVANQ
RKNFLHHKSKELATHFDVVAIEDLNMKGMSQALHFGKSVADNGWGMFTSF
LAYKLHEQGKQLVKIDKWFPSTKTCSSCGNVKNISLSERVYSCICGVNLD
RDYNAAINIKNEAIRLLALA
>pE33L466_0195 tnp, transposase
MAKFTANEKIQIVLRYLNGNESYREIWKAIGIRDTIILNWVNQYKQNGVE
AFLKRYTKYTQQFKLDVLNFMIENDMSLFETAAIFNIPAPSTISVWKKQL
KTQGIDALQSKKKGRSSMKKDLNKQLKQALAEGSIEALEARIQQLEMENE
YLKKLNALVQNKEKSRNETKRK
>pE33L54_0012 tnpA, possible transposase fragment
MLVLPLINAITVWNTVYLTEATKILKEKGLLKEELLPHISPLGWEHINLL
GEYSFDSKKVPKSNELRPLKI
>pE33L54_0016 tnpA, Tn5044 transposase fragment
MKYLSDEELRSTIQAATNKSESFNGFTKWLFFGGDGIIAENHRERQRKII
KYNHLIANCVIFYNVFQLTRILHEYIQEGNELDEEVLSDLSPYLTFHINR
FGKYGLDENRQPPDIQFDMAISPNGLKAAN
>pE33L466_0069 tnpA, transposase
MRGKELLTPVQREELLHISVETEHELALHYTFSTEDLEIINQHRRDHNRL
GFAVQLCILRYPGCTVTNMPTIPEGLLKFVAKQISVDHTVYEAYAKREPT
RREHLEEIRKEYGYRNFTIRDYRRISKFLQPYALENGNTMYLIQTALQEL
RKEKIILPAIPTIERAVWEVRKRTEEKIFKVLTSSLTSSQQEKLDRLLHP
MPNTSKTYLSWLREIPGQFSPDAFLKVIERLDYIRRLTLKLDTKGLHPNR
IRQLSRIGARYEPFSFRRFHSTKKYAIMIAYLIDLTQDLVDQAFEIHDKQ
IMNLQLKGRKQQEEIQKQNGKSVNEKINHYANLGTALIKARQENLDPFLL
LETVMPWDTFVASVEEAKKLSRPMNYDYLDLLESRYNYLRKYTPTLLRTL
EFQSTKYANPVLLALDTIHELNEAGKRKVPEGAPLSFVSKRWERYVYDED
GSINRHFYELAAFTELRNYVRSGDISIVGSRQHKDFDEYLIPQQEWTKSK
NIGTRLAVPIQVEEYIKERTETLLHRIQSFSKNVNSLEGVDLEKGILRIH
RLERDVPEDAKKLSAKLYNMLPRIKLTDLLLEVSNWTNFEQQLIHASTNK
APKGDEIIISLAAMMAMGTNIGLTKMADATPGISYHQLAHASQWRMYDDA
FQRAQSILVNFQHKIPLSSYWGDGTTSSSDGMRVQIGVSSLSASFNPHYG
TGKGATIYRFVSDQFSSFYTKVINTNARDAVHVIDGLLHHESDLVIEEHY
TDTAGYTDQVFGLAHLLGFRFAPRLRDLATSKLYTIGSPKEFSNIESLIR
GQINMKLICDNYDDVLRLAHSIREGKVSSALIMGKLGSYTRQNKVAKALR
EIGRIEKTIFILDYLSDKTMRRRIQRGLNKGEAMNALARAIFFGKHGELR
ERALQDQLQRSSALNLLINAISVWNTVYLSEAINVLKRKEKFDEELLKHI
SPLGWEHINFLGEYRFSKKEIAPLDSLRPLQIT
>pE33L466_0042 tnpB, transposase
MLKAYKYRIYPNKEQRTFFSKTFGCVRFVYNKMLADRINSYHESQTSIDK
SIKYPTPAQYKKEFPFLKEVDSLALANAQMNLNKAYAHFFRDKSIGFPKF
KNKKENRCSYTTNNQKGTVCIENGYMKLPKLRTTVRIKLHRPFFGLIKSV
TICKTTTNKYFASVLVQEKEQLFPELKTRVGIVMGLKGFATLSSGKKYET
PKWVRKTENRLAFLQKSLLRKKKGSKNQNKIRLQIARLHEKIANQCNDFL
HKISNEITNENQVIVMEDLKMKDMQNKHKLAKSISEASWANFREYQTYKA
MWKGRDLIITPKNYANDQLCSCCGYKNKDLKKINLYEWDCPMCNSHHDRD
VNASINLLKLTL
>pE33L466_0037 tnpI, integrase-recombinase protein
MKVLRVKKSDIELGEFERFLFEQGKRPNTVHDYSRHICNFHRWLVTEGSS
IHDITRYDIQQYINFLSIQGNQATTITPKYSAIVAYMKFVGKEKLLNHIK
RLEVRHIRNISPKSLSKKQRNQLLREVEKSQNLRNIAIVYTMLYTGVRVF
ELVALNRDDVEMKERSGFIIIRDGKGGISRKIPLPAESRYHLQNYLQKRT
DLEVPLFLSNFRKRLSKRSAQRIFEQYGIGAHMLRHTYGRELVASGIDLA
TVADLMGHNDVNTTKRYAAPSMSDLEQAVEKIFNS
>pE33L466_0073 tnpR, resolvase
MLIGYARVSTGLQNLDLQTDALTQYGCKKIFHDKMSGTKKQRPGLEEAIH
YAREGDTIVVWRLDRLGRNMQDLIQIVNNLNKRGIGFHSLQENLTMDKSN
ATGQLMFHLFAAFAEFERNLIEERSAAGRAAARARGRLGGRPEKFGLKDI
EMMKSLIESGTPIKDVAEKWGVSRTTIYRYLERQ
>pE33L466_0068 tnpR, resolvase
MIFGYARVSTEEQNLDMQIDALQQYGVERLYQEKMTGIKKERPQLEELLK
VLRKGDKIVVYKLDRISRSTKHLIELSEKFKELGVDFISIHDNIDTSNAM
GKFFFRMMASIAELERDIISERTKTGLNAARARGKKGGRPQKHTDKVEMA
LKMYQSKEYSIKQITEATGLSKTTLYRYINK
>BCE33L3592 topA, DNA topoisomerase I
MSDYLVIVESPSKAKTIEKYLGKKYKVVASMGHVRDLPKSQMGIEVKNNF
TPKYITIRGKGPVLKDLKSAAKKAKKVYLAADPDREGEAIAWHLANTLNV
DVESDCRVVFNEITKDAIKESFKHPRAINMDLVDAQQARRILDRLVGYNI
SPLLWKKVKKGLSAGRVQSVAVRLIIEREREIQSFEPEEFWTIKTEFVKG
KDTFEASFYGVDGEKVQLTNETQVNEIIEQLKDNAFSVENVTRKERKRNP
ALPFTTSSLQQEAARKLNMRAKKTMMLAQQLYEGIDLGKQGTVGLITYMR
TDSTRISETAQTEARTYITEAYGTEYIGAEKKKETKKSNAQDAHEAIRPT
SVMRKPEELKSFLSRDQLRLYKLIWERFVASQMASAIMDTVTARLINNNV
QFRASGSVVKFPGFMKVYVESKDDGAEEKDKMLPPLEVGETVFSKDLEPK
QHFTQPPPRYTEARLVRTLEELGIGRPSTYVPTLETIQKRGYVGLDNKRF
VPTELGEIVIELILEFFPEIINIEFTANMEQSLDEVEEGNANWVKIVDDF
YVGFEPRLEKAEKEMREVEIKDEPAGEDCELCNHPMVFKMGKYGKFMACS
NFPDCRNTKPIVKEIGVTCPKCDKGQIIERRSNKKKRLFYGCGTYPECDF
VSWDKPIGRKCPKCEGMLVEKKLKKGVQVQCISCDYEEEQQM
>BCE33L0347 topB, DNA topoisomerase (DNA topoisomerase I)
MAKSVVIAEKPSVARDIARVLKCDKKGNGYLEGSKYIVTWALGHLVTLAD
PESYDVKYKKWNLEDLPMLPERLKLTVIKQTGKQFNAVKSQLLRKDVNEI
IVATDAGREGELVARWIIDKVRINKPIKRLWISSVTDKAIKDGFANLKPG
KAYDNLYASAVARSEADWYIGLNATRALTTRFNAQLNCGRVQTPTVAMIA
NREDEIKNFKAQTYYGIEAQTTNQLKLTWQDANGNSRSFNKEKIDGIVKG
LDKHNATVLEIDKKQKKSFSPGLYDLTELQRDANKKFGYSAKETLNIMQK
LYEQHKVLTYPRTDSRYISSDIVGTLPERLKACGVGEYRPLAHKVLQKPI
KANKSFVDDSKVSDHHAIIPTEGYVNFSAFTDKERKIYDLVVKRFLAVLF
PAFEYEQLTLRTKIGNETFIARGKTILHAGWKEVYENRFEDDDVTDEVKE
QLLPRIEKGDTLTVKLIMQTSGQTKAPARFNEATLLSAMENPTKYMDTQN
KQLADTLKSTGGLGTVATRADIIDKLFNSFLIEKRGKDIHITSKGRQLLD
LVPEELKSPTLTGEWEQKLEAIAKGKLKKEVFISEMKNYTKEIVSEIKSS
DKKYKHDNISTKSCPDCGKPMLEVNGKKGKMLVCQDRECGHRKNVSRTTN
ARCPQCKKKLELRGEGAGQIFACKCGYREKLSTFQERRKKESGNKADKRD
VQKYMKQQKKEEEPLNNPFAEALKKLKFD
>BCE33L1725 topB, DNA topoisomerase I
MKLIIAEKPDQGLALVSQFKYRRKDGYLEVEANELFPSGAYCTWAIGHLT
QLCNPEHYHAEWKKWSLDTLPMIPERFQFEVTKSKYKQFNVVKQLLHNPQ
VTEIIHAGDAGREGELIVRNIINLCNVQKPMKRLWISSLTKQAIYQGFKN
LLDESDTINTYYEAYTRSCADWVVGMNASRVFSILLKKKGMNDVFSAGRV
QTPTLALIVKREKEIENFKSEPFWEVFATFNIEGKKYDGKWEKDNESRLQ
DPDMANKIAAFCQGKPAVVKEMKTERKEFQPPLLFNLSSLQATANKAFKF
SPKKTLDITQALYQKGIVSYPRSDSNYVTQGEAATFPDILQKLSQFDEYK
GLLPAPVESIMNNKRYVNEKKVTDHYAIIPTEQVTNPSRLSGDEKKIYDM
IVRRLIAAHYEVAIFDYTTIVTLVDERAEFISKGKQQIQEGWRKVIFQDD
KDDETILPIVAEGEEGKVVKVKVKEGKTQPPKRYTEGQLITLMKTAGKYL
ENEELEKVLKKTEGLGTEATRAGIITMLKDRKYIDVKKNQVYATDKGKVL
ITAIGDKILASPEMTAKWEQRLAEIGEGTASPATFMEQTKKLSAKIIEDA
VEMSEKWDFTGLHVESIERKGSKFTTGKKVGSCKKCDGDVIDKSTFYGCS
NYNTTQCDFTISKKILSKTISQKNMTKLLKGEKTDLIKGFKKGEKTFDAK
LEWKDNKINFVFEN
>BCE33L1996 tpn, transposase, N-terminal fragment
MQVAKGSKLPRNPVFHAYYQKKLKEGKTKGQALVCIMRRLVNIIYGMMKY
KTAYELPIVEEKEVV
>BCE33L1995 tpn, transposase, IS110 family
MHNRQNYLYVGVDLHKEHHTAVIINCWQEKLGEIQFENKPSAFSKFLLEV
ETYVSAGVSVVFGLEDVGGYGRALAKYLVDHEQVVKEVNPVLSFLERKSH
VTTQKSDSWDAECVARILINKLNQLPDAKPNDLFWSIQQLVSRRNALVKA
QSALKNQLHIQLNHHYPSYKKFFSELDGKTALAFWQQYPSPSCLEGTNIK
QLTAFLLDVSNNTCSVKKASDILKLVKEDGHTMKEYQETRDFLVRSIVRD
IEFKKKEMKYIERELKQLVNLLDYQLETMPGIELVTASALIAEIGDVRRF
SNANKLARFAGIAPVYFGSGGKGKTHKSKQGTTRFIL
>BCE33L1412 ung, uracil-DNA glycosylase family protein; possible DNA polymerase, bacteriophage-type
MQYPDHLVKQVKERSAPYQLEGFLSGQGPENPKFMLLGEAPGETEIHNGI
PFSGRAGKQLMGFLERIHVTREEVYITSAVRSRPYKWREKKERNGSIIQK
KYNRTPNQGEIVAHAPLLDYELEKLNPKLIVTLGNIGLQRLTGKGKKITD
VHGQLLKQPIQKLKDMQSAEFTWSEKEYHIFPTFHPASIFYNRSLLELIY
EDLEKLKQYVIKN
>BCE33L5096 ung, uracil-DNA glycosylase
MEKVLKNDWGPLLAPEFEKEYYRKLADFLKEEYSTHVVYPKKEDIFNALE
YTSYENTKVVILGQDPYHGPNQAHGLSFSVQPGIKTPPSLLNMYKELRDE
YGYDIPNNGYLVKWAEQGVLLLNTVLTVRQGEANSHKGKGWEHFTDRVIE
LLNEREKPVIFILWGRHAQAKKKLITNTKHHIIESVHPSPLSARRGFFGS
KPYSKVNTILANMGEREIDWEIPNL
>BCE33L4859 uvrA, UvrABC system protein A (excinuclease ABC, subunit A)
MSKSKDFIVVKGARAHNLKNIDVTIPRNQLVVVTGLSGSGKSSLAFDTIY
AEGQRRYVESLSAYARQFLGQMDKPDVDTIEGLSPAISIDQKTTSRNPRS
TVGTVTEIYDYLRLLFARIGTPICPNHGIEITSQTVEQMVDRVLEYPERT
KLQVLAPIVSGRKGAHVKVLEDIKKQGYVRVRVDGEMLDVSEDIALDKNK
KHSIEVVIDRIVVKEGIASRLADSLESALKLGGGRVLIDVMGEEELLFSE
HHACPHCGFSIGELEPRMFSFNSPFGACPSCDGLGSKLEVDLELVIPNWD
LSLNEHAIAPWEPTSSQYYPQLLQSVCNHYGVDMDVPVKNIPKDLFDKVL
YGSGEEKVYFRYVNEFGQVKENEILFEGVIPNIERRYRETSSDYIREQME
KYMAEQACPKCKGGRLKPESLAVFVGGKTIADVTKYSVQEVQEFFSNVEL
TEKQQKIAHLILREIKERVGFLVNVGLDYLTLSRAAGTLSGGEAQRIRLA
TQIGSRLTGVLYILDEPSIGLHQRDNDRLIRTLQEMRDLGNTLIVVEHDE
DTMMAADYLLDIGPGAGIHGGQVVSAGTPAEVMQDENSLTGKYLSGKEFI
PVPLERRKGDGRKVEIVGAKENNLKNAKMSFPLGTFVAVTGVSGSGKSTM
INEVLYKSLAQKLYKAKAKPGTHKEIKGLEHLDKVIDIDQSPIGRTPRSN
PATYTGVFDDIRDVFAQTNEAKVRGYQKGRFSFNVKGGRCEACRGDGIIK
IEMHFLPDVYVPCEVCHGKRYNRETLEVKYKDKNISEVLGMTIEDGVEFF
ANIPKIKRKLQTLVDVGLGYMKLGQPATTLSGGEAQRVKLASELHRRSTG
RTLYILDEPTTGLHAHDIARLLEVLQRLVESGETVLVIEHNLDVIKTADY
IVDLGPEGGDKGGQIVASGTPEQVVKEERSYTGKYLKEILNRDKARMKEK
IKEVELSQ
>BCE33L4860 uvrB, UvrABC system protein B (excinuclease ABC, subunit B)
MKRQFEIVSAYSPQGDQPVAIEKLVEGINSGKKKQVLLGATGTGKTFTIS
NVIKEVQKPTLVMAHNKTLAGQLYSELKDFFPNNAVEYFVSYYDYYQPEA
YVPQTDTFIEKDAQINDEIDKLRHSATSALFERDDVIIVASVSCIYGLGS
PEEYRELVVSLRVGMEKDRNQLLRELVDVQYGRNDIDFKRGTFRVRGDVV
EIFPASLDEHCIRIEFFGDEIDRIREVNALTGEVLAERDHVAIFPASHFV
TREEKMKVAIENIEKELEERLKELNDNGKLLEAQRIEQRTRYDLEMMREM
GFCSGIENYSRHLTLRPAGATPYTLLDYFPEDFLIVMDESHVSVPQVRAM
YNGDQARKQVLVDHGFRLPSALDNRPLTFDEFEEKTNQVIYVSATPGPYE
LEQSPEVIEQIIRPTGLLDPPIDIRPIEGQIDDLLGEIQDRIAKNERVLI
TTLTKKMSEDLTDYLKDVGIKVNYLHSEVKTLERIEIIRDLRLGKFDVLV
GINLLREGLDIPEVSLVAILDADKEGFLRSERSLIQTIGRAARNENGRVI
MYADRITRSMGIAIEETQRRRSIQEAYNEEYGITPKTIQKGVRDVIRATT
AAEEPETYEATPAKKMTKKEREKTIAKMEAEMKEAAKALDFERAAELRDL
LLELKAEG
>BCE33L4267 uvrC, excinuclease ABC, subunit C
MHEHLKEKLAILPDQPGCYLMKDKQGTVIYVGKAKVLKNRVRSYFTGSHD
GKTLRLVGEIVDFEYIVTSSNLEALILELNLIKKHDPKYNIQLKDDKTYP
FIKITAEKQPRLLITRNVKKDKGKYFGPYPNAQSAHETKKLLDRMYPLRK
CSNMPDKVCLYYHMGQCLAPCVKEVTEEQNKEIVDEIIKFLNGGHKEVRS
ELETKMYEASEKLEFERAKELRDQIAHIDAIMEKQKMIMSDLVDRDVFGY
AVDKGWMCVQVFFVRKGKLIERDVSMFPIYDEPEEGFLTFIGQFYENSSH
FKPKEIVVPGSIDSELVERFLEVEATQPKRGKKKDLVELANKNAKIALEE
KFYLIERDEERTIKAVENLGKQLGIETPYRIEAFDNSNIQGTNPVSAMIA
FIDGKPAKKEYRKYKIKTVQGPDDYESMREVVRRRYTRALKEGLPLPDLI
IIDGGKGHLAAASDVLENELGLYIPMAGLVKDDKHKTSHLIIGDPPEPVM
LERNSQEFYLLQRVQDEVHRFAITFHRQLHGKSVIQSALDDIPGIGDKRK
KVLLKHFGSLKKMKEASIEEFVEAGMPKNVAETIYTYLTDKKTL
>BCE33L2446 uvrC, excinuclease ABC, C subunit, N-terminal region
MNLIKISIPEVDVTITERKQVIRGDEPRITPINGFIDFHLFPRDKGGIFM
FYNINDELLFVGKARKIRQRIKKHFEDNVSPIKNHRDEVYRIDACIVEDP
TEREIYETYIINEYKAKYNVDKVFYK
>BCE33L0031 uvrC, excinuclease ABC, C subunit
MEKNKHCFYVVECSDGSYYAGYTNHIEKRIGTHNSGRGAKYTRARLPVVL
KYVEFHEDKRTAMQAEYYFKQLNRKQKEEYMQKGERYVATKKLSTK
>BCE33L2848 uvrC, UvrC-like protein
MSELHFMSLEKLDYELEKDDSGIYFIKDYNDNIIYIGKAFSIKSRVLAHF
NSYSNIKEYVHLFNKVAYLIEDSLLKRSLLQVTYMIKYKPVLNKEVQKEF
PELYTQYIKQTNKKSMLLEMDEAKEKRGELKNRLVKLVGGKTMFYDIISL
LNNGYNYHVLAKVLSIELQTLIIMKEHRNKFPIPHNYKRTIKHQDIMYAL
SGKKNLSTSRLNT
>pE33L466_0018 xerC, tyrosine recombinase
MDYNKQRQKSVHRKKLYENLNEMPFYIEEFVEYKELHDASPSTLLNYVYD
FRVFFKWLLSEQIIELKPIKDISFSDLENLKKKDVENFMRFLKLQQNMQN
SSVNRKISALKSLFKYLTSLSENEDGECYFYRNVMAKIEIHKDKETLNAR
AKRMRSKIFHNDDDQEFLNYVKYEHEKSLTKHQLFYFLRDKDRDVAILSL
FLGSGIRVSELADLRMEDINLKERLIDVIRKGNKEDSVWITPIALNDLEK
YMGIRDNKYAPGKELKNVFLSKYKHTAQPLSVRAIQDIVEKYTKAYGKKM
SPHKLRHTLANKLYMEEKDSLQVMQQLGHTSQDTALLYTQLGETTIKDSL
GRIGKNKE
>BCE33L3846 xerD, integrase/recombinase (tyrosine recombinase)
MEDQLKDFIHYMVVEKGLAKNTVVSYERDLKSYVKYLQKVEQAKSFHEVT
RLHIVNFLQHLKENGKSSKTLARHIASIRSFHQFLLRERAVEHDPSVHIE
TPQGERKLPKVLSVDEVEALLQTPKMTSAFGVRDKAMLELLYATGLRVSE
LIALNLEDVHLTMGFVRCIGKGNKERIIPLGSLATEAIQKYIEKGRRELM
GKKVVDALFLNHHGNRLSRQGFWKILKRLAKEANIEKELTPHTLRHSFAT
HLLENGADLRAVQEMLGHADISTTQIYTHVSKTRLKDVYKQFHPRA
>BCE33L3590 xerD, site-specific integrase/recombinase XerD protein
MNVKKLLQLFVGYLQIERNYSKYTIASYQNDLEHFVQFMEREGISSFLDI
TYADVRLYLTTLHDEKLARKSVARKVSSLRSLYRFLMREGYRKDNPFALA
SLPKKELSIPKFLYAEELEELFEVSDTATPLGQRNQALLELMYATGIRVS
ECVNLQLTDIDFAVGTILVMGKGKKQRYIPFGSYAQDALITYIENGRKQL
AEKTEEQSHMVFLNAKGTPLTSRGVRYVLNELIKKASLTMRISPHMLRHT
FATHMLDEGADLRTVQELLGHENLSTTQIYTHVSKERLRSVYMKHHPRA
>BCE33L4231 xerD, integrase/recombinase
MSKRRDSLTVSEDLSNIFDRKLERKQVKGLTIKKALSTVIRQMRATGLRD
RTISDYELHIGHFMSVTGAEFLQELTVEHIYLWLSSMNVSNQTKLTRLKC
LKAFLGRCFDNGWIEINFWKSIKIKVDSPVKEGATDREINLLLSILDLTR
FIELRDAASVLLMYQTGIRVGTLSQLEHKHVNLENKVLRIDGGIIKNHES
IHLPFDDVLARVLGALMKQNDIIRKECHIKNDYLFITKNGGRIATSPTNN
NITKRLSKHSKDYGLKNINPHALRRGFAKNLLKKGANIALISKALGHSDL
AVTTRYLHLDKEEVAESLRNFL
>BCE33L3933 xseA, exodeoxyribonuclease VII, large subunit
MEKQYLTVTALTRYIKTKIEYDPHLQSVWLKGEISNFKNHSRGHMYFTLK
DENARIAAVMFAGHNRNIKFRPENGMKVLVKGKISVYEASGSYQIYIQDM
QPDGIGNLHLAYEQLKVRLEEEGLFSQVYKKTIPPYAKTIGVITSPTGAA
IRDIITTIKRRYPIGNVIVFPVLVQGESAAPSIVQAIRTANEMEEIDVLI
VGRGGGSIEELWAFNEEMVARAIFKSEIPIISAVGHETDFTIADFVADLR
APTPTAAAELAAPNIIELQEKVLQRTLRLQRAMRELVHKKEEKLQVLQKS
YAFRYPRQVYEQKEEQLDRALEQLVLAKERYIDKKVNQLKQLSFYLEKHH
PSQKIMQTKVAVETLQKQLQREMQTLLQTKEFAFVRAAQKLEALSPLKVM
MRGYGLVYDEEKQVLKSVKDVSLGDAVSVQLQDGILDCSVSGIEERELNN
GK
>BCE33L3932 xseB, exodeoxyribonuclease VII, small subunit
MENKLSFEEAISQLEHLVSKLEQGDVPLEEAISYFKEGMELSKLCDEKLK
NVQEQMAVILGEDGELEPFTALGDEA
>BCE33L2796 yhaZ, conserved hypothetical protein
MGKYVPLKFLFNEELAEKMADSICKHDPTFSKRNFVSSVTCKVENLELKQ
RIEVMADELHNALQKDFNEAIHILLKTLGPENTTEVGTFTNGYMYMPIAK
YVEKYGLNDFDSSFNAMYEITKRNTAEYAIRPFLEIYHEDTINILQKWIH
DKNSHIRRLVSEGTRPRLPWAKKIGALKGDFQYNLQLLDPLTNDPSKYVQ
KSVANHINDITKEDKELVFQWLQQLRSKQHPVNPWIIKHGLRTVIKHDTL
PKDFSF
>BCE33L2067 ywkC, conserved hypothetical protein
MEYKTPFIAKKLGVSPKAVVRIAQQLNLTIEKNKYGHFIFTQDDLDQMLE
YHRSQIEQSQNTHPTQKTSSNDVEELKTQVNTIVQNISSHDFEQLAAQLN
TITRRLDRMEEQMQDKANDVVTYQLLQHRREMEEMLERIQKLEAGLKKEE
PIYITPDTKPTYEREKKPKRRKMIFSIFGL