TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Staphylococcus aureus subsp. aureus MRSA252, MRSA252
Gene type: CDS

Number of genes found: 173

Free access
Sort by:

 



# Staphylococcus aureus subsp. aureus MRSA252, MRSA252

>SAR0804 putative helicase
MDNVTRYKITESSQSSSQAYYHLSFELSEQQSYASEHIVRAIRMRQTILL
YAVTGAGKTEMMFQGIQYARRQGDNIAIVSPRVDVVVEISKRIKDAFLNE
DIDILHQQSSQQFEGHFVVCTVHQLYRFKQHFDTIFIDEVDAFPLSMDKS
LQQALKSSSKVEHATIYMTATPPKQLLSEIPLENIIKLPARFHKKSLPVP
KYRYFKLNNNKIQKMLYRILQDQINNQRYTLVFFNNIETMIKTFSIYKQK
ITKLTYVHSEDVFRFEKVEQLRNGQYDVIFTTTILERGFTMANLDVVVID
AHQYTQEALIQIAGRVGRKLECPTGKVLFFHEGVSMNMIQAKKEIQKMNK
LALKRGWIDE
>SAR0489 conserved hypothetical protein
MDSHFVYIVKCSDGSLYTGYAKDVNARVEKHNRGQGAKYTKVRRPVHLVY
QEMYETKSEALKREYEIKTYTRQKKLRLIKER
>SAR0480 putative insertion element protein
MCRILGISRASYYKWVHYQSSELELENEQLKREIESIYHKYNGIYGYRRI
YIYIRLKLGKQVNRKRIYRLMKELNLKAVIRQKRKPYRRSTPQITSENKL
NRQFDIDTPNKVWLTDVTEFKIKEGSKIYLSAIYDLGAKRIVSYELGPSN
NNQLVFKTFNQAIEKVENTKGILFHSDRGFQYTSKTFKHMLDECGMIQSM
SRVSKCIDNGPMEGVWGTIKSEIFRGNKHFKFNSVEEATKTIHDFILFFN
HERITLKMADSV
>SAR1790 conserved hypothetical protein
MAEQQTIMERLFHSLDEKAKTLNNENGQSFIENLGLAMEQVYTNERGLLE
QSTLQDRRKAFQFAYLSLMQEEKIQANHQITPDSIGLILGFLVERFMNNQ
EELHIVDIASGTGHLSATVKEVLPEIAVMHHLIEVDPVLSRVSVHLANFL
EIPFDVYPQDAIMPLPLEEADIVIGDFPVGYYPIDERSRDFKLGFEEGHS
YSHYLLIEQAINALKDAGYAFLVVPSNIFTGEHVKQLEKYIATETEMQAF
LNLPPTLFKNEKARKSILILQKKKSGETKSVEVLLANIPDFKNPSQFQGF
MTELNQWMDTNRPKK
>SAR0222 staphylocoagulase precursor
MKKQIISLGALAVASSLFTWDNKADAIVTKDYSKESRVNENSKYDTPIPD
WYLGSILNRLGDQIYYAKELTNKYEYGEKEYKQAIDKLMTRVLGEDHYLL
EKKKAQYEAYKKWFEKHKSENPHSSLKKIKFDDFDLYRLTKKEYNELHQS
LKEAVDEFNSEVKNIQSKQKDLLPYDEATENRVTNGIYDFVCEIDTLYAA
YFNHSQYGHNAKELRAKLDIILGDAKDPVRITNERIRKEMMDDLNSIIDD
FFMDTNMNRPLNITKFNPNIHDYTNKPENRDNFDKLVKETREAIANADES
WKTRTVKNYGESETKSPVVKEEKKVEEPQLPKVGNQQEDKITVGTTEEAP
LPIAQPLVKIPQGTIQGEIVKGPEYLTMENKTLQGEIVQGPDFPTMEQNR
PSLSDNYTQPTTPNPILKGIEGNSTKLEIKPQGTESTLKGTQGESSDIEV
KPQATETTEASHYPARPQFNKTPKYVKYRDAGTGIREYNDGTFGYEARPR
FNKPSETNAYNVTTNQDGTVSYGARPTQNKPSETNAYNVTTHANGQVSYG
ARPTQNKPSETNAYNVTTHANGQVSYGARPTQNKPSKTNAYNVTTHADGT
ATYGPRVTK
>SAR2029 putative transposase
MYKNYNMTQLTLPIETSVRIPQNDISRYVNEIVETIPDSEFDEFRHHRGA
TSYHPKMMLKIILYAYTQSVFSGRRIEKLLHDSIRLMWLAQNQTPSYKTI
NRFRVNPNTDALIESLFIQFHSQCLKQNLIDDNSIFIDGTKVEANANRYT
FVWKKSIQNHESKLNENSKALYRDLVEEKIIPEIKEDGDSDLTIEEIDLI
GSHLDKEIEDLNHSIQNEDCTQIRKQTRKKRTEIKKFKKKFDDYSERKSK
YEEQKSILKDRNSFSKTDHDATFMRMKEDHMKNGQLKPGYNLQIATNSQF
VLSYDLFQNPTDTRTLIPFLTMIQNTFGYLPEYIVADAGYGSEQNYMAII
DDFNKTPLITYGMFIKDKTRKFKSDIFNTQNWKYDELNDEFICPNNKRIG
FKRYAYRNDRYGFKRDFKLYECDDCSACSLRQQCMKPNSKSNKKIMKNYN
WEYFKAQINQKLSEPETKKIYSQRKIDVEPVFGFMKAILGFTRMSVRGIN
KVKRELGFVLMALNIRKIVARRAVYYQIHLKKADFYQIINRNQLFYIA
>SAR1297 conserved hypothetical protein
MEAVVDWVQVTFHITPISAVIEDVIGLPITLFKKRNSGIYFYNRGYEFSN
IKLYYSSDDESMGIHLQLTGTGCREFEHHLQQLNKTWQDFFDKCLSVNAN
FTRIDIAIDDYKTYLKVPLLIKKAEKAECVSRFRSGSAINGFNLSDGRSK
GATFYIGSKQSNLYCRFYEKNYEQAFKRHCDVEDIGLWNRYEIQMRKAYA
VNCAKVLSRTDNISEIVKSILHNNLRFISPPKDGNDKNRKRWPLYRPWAL
FIKDTEKLNLTTRPTLKSIEDNLDWLCKQVATTLDTVLTAESMAQSEGLL
TDTDFLDKILAHSQFNDEHTNRINHYLEALKQKKHLSKDKC
>SAR0962 putative insertion element protein
MKTLKGALFMSIKPKVDIDVLLYMFQEYEKGVSFQYLIDTLQLDINVNTL
YPKYKYYKTHGIETLLTQKQYNHYSKELKLKVVNEYLNSNQSTQDIAIKY
NIRSSTQVKNWIKMYTVGKEIKNQSPKTGVYIMKARKTTFEERVTIVEYY
LNHKQSYREVAEHFNISYGQFTSGFISIKHMVKMD
>SAR1463 putative endonuclease
MVSKKKALEMIDVIANMFPDAECELKHDNPFELTIAVLLSAQCTDVLVNR
VTTELFKKYKTPEDYLAVSDEELMNDIRSIGLYRNKAKNIKKLCQSLIDQ
FNGEIPQTHKELESLAGVGRKTANVVMSVAFDEPSLAVDTHVERVSKRLG
INRWKDNVRQVEDRLCSVIPRDRWNRSHHQLIFFGRYHCLARKPKCDICP
LLEDCREGQKRYKASLKEA
>SAR0286 hypothetical protein
MSNKGEIRRQIANKEREKASKEAQLTDLKEDLRRLKDASKKLDTAGEDFN
KGQSSYNKVEISTSDWKGERRTKSDSKKKDVDSELKKVEQDFDDAKKAIK
KDIQDKEEEIKGVEGEISTINAAIDALKSKL
>SAR1228 putative integrase/recombinase
MNHIQEAFLNTLKVERNFSEHTLKSYQDDLIQFNQFLEQEHLQLNTFEYR
DARNYLSYLYSNHLKRTSVSRKISTLRTFYEYWMTLDENIINPFVQLVHP
KKEKYLPQFFYEEEMEALFKTVEEDTSKSLRDRVILELLYATGIRVSELV
NIKKQDIDFYANGVTVLGKGSKERFVPFGAYCRQSIENYLEHFKPIQSCN
HDFLIVNMKGEAITERGVRYVLNDIVKRTAGVSEIHPHKLRHTFATHLLN
QGADLRTVQSLLGHVNLSTTGKYTHVSNQQLRKVYLNAHPRAKKENET
>SAR1977 hypothetical protein
MNYVERYIEQFLRATVRNNIKHYLLMLDEKMKNLDDYMHYLITKKEQLSK
LIDSLMLTLENKYIDIVEAFQIQCAREINNQEIENIKSELNKVEAYYAQI
ETQIQQTSTEKIATEKTSYLINYMNAVA
>SAR1289 putative exported protein
MKQKIIMFTLVTVILFCAVLIGYQIPKQQVKMKQNQIEDLQEEQRILRDK
NGELNKLVKRQSKTVISDEEKQIREVSSNFVKQMFEMKKDSSFKSKAPQI
KPLVTKDYYDTLFKDSKDKYDLYDDITVNDIHVYFDTYDPKKDSYKVFVQ
FDERIETDGDDKIEHRQTSAQLDLVRTAEGWRIDNLKRFNLKPLGR
>SAR1523 hypothetical phage protein
MAIDFKPHSYQKYAIDKVIDNEKYGLFLDMGLGKTVSTLTAFSELQLLDT
KKMLVIAPKQVAKDTWVDEVDKWNHLNHLKVSLVLGTPKERNDALNTEAD
IYVTNKENTKWLCDQYKKEWPFDMVVIDELSTFKSPKSQRFKSIKKKLPL
INRFIGLTGTPSPNSLQDLWAQVYLIDRGERLESSFSRYRERYFKPTHQV
SEHIFKWELRDGSEEKIYKQIEDICLSMKAKDYLDMPDRVDTKQTVVLSE
KERKVYEELEKNYILESEEEGTVVAQNGASLSQKLLQLSNGAVYTDEEDV
RLIHDKKLDKLEEIIEESQGQPILLFYNFKHDKERILQRFKEATTLEDSN
YKERWNSGDIKLLIAHPASAGHGLNLQQGGHIIVWFGLTWSLELYQQANA
RLYRQGQNHTTIIHHIMTDNTIDQRVYKALQNKELTQEELMKAIKARIAK
HK
>SAR0061 putative membrane protein
MTQTNPSFNPSPRYKSKKGWYKDKPPKEKGGMPTEVEIAGPIVIENKFID
PKTNTEKVIITDEDQKEIVESSDILTTQKLPSLMKYGFSINEKYTKDLGY
ALQQMRNQLPISYLYEGVGILETPFGPIVSLSEIYTTKEFDNKSPSDAIC
DNPYDLTPKGTFDNWFNMYLKEVKGSLLLELAVVFGISALVTSFLKHKHE
TEFAGIIFSFTGQSSTGKSTAASLAVSVAGNPTKGNETLFRNWNATRNAL
EGYLSNNFGIPIVFDELSSATFRDTTGLLYSITEGQGRQRSNVHGEVKTP
KNWGTSVISTSEYSIFNDSAQNDGLRVRTIEINEQFTTNATNADNIKKAV
ALNYGHVLPLVAKYLINREDEVIQWFYKEVDWFEAKLKDETNNTGIRMFK
RYAVITTSAKILGRVLSTDIDIANIRDYFIDYHTHTVSERSLADKAIDVI
IQFVAQNRGKFSDEGALKNMFENYGLISLKDDHIEVKMIANVFKQMLNNH
QFQDVNNVVNALRDKGFILADRGRQTTKRSVKDNSGKKQSLVFYHLKLDV
EFASILGLTKDKSLLQNWTPSNDNKAAKELFKSANEGIGLSGVHEDF
>SAR1377 ImpB/MucB/SamB family protein
MYNYHLLEDRDVLCIDQKSFFASVSCIEKGLDPLETKLAVVADTKRQGSV
VLAATPKLKELGIKTGSRLFEIPHRNDIYIINPSMRKYLNVSVAISKIAL
RYIPPEDLHQYSIDEFFMDVTDSYHRFSSTVHAFCERLKREIYEETGIYC
TVGIGSNMLLSKIAMDVEAKHNQNGIAEWRYQDVPTKLWPIQPLRDFWGI
NRRTEAKLNKRGIFTIGDLAKYPYKFLKKEFGILGVDMHLHANGIDQSKV
REKHKISNPSICKSQILMRDYHFDEAKVVMQELIEDVASRVRARKKVART
IHFAFGYSDEGGVHKQYTLKDPTNLEKDIYKVVMHFADKLCNKQALYRTL
SISLSQFINEDERQLSLFEDEYQRKRDECLAKTIDQLHLKYSKGIVSKAV
SFTEAGTKHGRLGLMAGHKM
>SAR1562 phage integrase
MWFEKFKNKNNETKYRYYEKYKDPYTDKWKRVSVVLNKNTKQSQKEAMFR
LEEKIKEKLNNKSSSELKTLTFHALLDEWLEYHIKTSGSKLTTLNNIKIR
IRNIKRYSSENLLLNKLDTKYMQIFINKLSDIYSQNQVTRQLGDMKGAIK
YAVKFYNYPNEYLLTNVKIPKRRKTIEDIEKDESKMYNYLEMNQVLQIRD
HILNDNKLHKRNRILIASILEVQALTGMRIGELQALQEKDIDLLNKTINI
TGTIHRIKYEEGFGYKDTTKTISSKRSISINSRTVEIFKKIILENKMLKR
WNSSYVDRGFIFTTKKGNPLCNNQIAGVLKKTTKALNMNKKVTTHTFRHT
HITLLVEMNVSLKAIMKRVGHVDEKTTIRIYTHVTEKMDRELTQKLENIP
S
>SAR2575 putative NUDIX hydrolase
MKKVINVVGAIIFSDNKILCAQRSEKMSLPLMWEFPGGKVEKNETEKDAL
IREIREEMKCDLIVGDKVITTEHEYDFGIVRLTTYKCTLNKELPTLTEHK
SIEWLSINELDKLNWAPADIPAVNKIMTEG
>SAR0963 putative transposase
MYKNYNMTQLTLPIETSVRIPQNDISRYVNEIFETIPDSEFDEFRHHRGA
TSYHPKMMLKIILYAYTQSVFSGRRIEKLLHDSIRMMWLAQDQTPSYKTI
NRFRVNPNTDALIESLFIQFHSQCLKQNLIDDKSIFIDGTKVEASANRYT
FVWKKSIQNYESKLNENSKALYRDLVEEKIIPEIKENGDSDLTIEEIDLI
GSHLDKEIEDLNHSIQNEDCTQIRKQTRKKRTEIKKFKKKFDDYSERKSK
YEEQKSILKDRNSFSKTDHDATFMRMKEDHMKNGQLKPRYNLQIATNSQF
VLSYDLFQNPTDTRTLIPFLTMIQNTFGYLPEYIVADAGYGSEQNYMAII
DDFNKTPLITYGMFIKDKTRKFKSDIFKTQNWKYDELNDEFICPNNKRIG
LKRYAYRNDRYGFKRDFKLYECDDCSACSLRQQCMKPNSKSNKKIMKNYN
WEYFKAQINQKLSEPETKKIYSQRKIDVEPVFGFMKAILGFTRMSVRGIN
KVKRELGFVLMALNIRKIVARRAVYYQIHLKKADFYQIINRNQLFYIA
>SAR2429 putative 3-methylpurine glycosylase
MDFVNNDTRQIAKNLLGVKVIYQDTTQTYTGYIVETEAYLGLNDRAAHGY
GGKITPKVTSLYKRGGTIYAHVMHTHLLINFVTKSEGIPEGVLIRAIEPE
DGLSAMFRNRGKKGYEVTNGPGKWTKAFNIPRAIDGARLNDCRLSIDTKN
RKYPKDIIASPRIGIPNKGDWTHKSLRYTVKGNPFVSRMRKSDCMFPEDT
WK
>SAR2471 putative transposase
MYKNYNMTQLTLPIETSVRIPQNDISRYVNEIVETIPDSEFDEFRHHRGA
TSYHPKMMLKIILYAYTQSVFSGRRIEKLLHDSIRMMWLAQNQTPSYKTI
NRFRVNPNTDALIESLFIQFHSQCLKQNLIDDNSIFIDGTKVEANANRYT
FVWKKSIQNHESKLNENSKALYRDLVEEKIIPEIKEDGDSDLTIEEIDLI
GSHLDKEIEDLNHSIQNEDCTQIRKQTRKKRTEIKKFKKKFDDYSERKSK
YEEQKSILKDRNSFSKTDHDATFMRMKEDHMKNGQLKPGYNLQIATNSQF
VLSYDLFQNPTDTRTLIPFLTMIQNTFGYLPEYIVADAGYGSEQNYMAII
DDFNKTPLITYGMFIKDKTRKFKSDIFNTQNWKYDELNDEFICPNNKRIG
FKRYAYRNDRYGFKRDFKLYECDDCSACSLRQQCMKPNSKSNKKIMKNYN
WEYFKAQINQKLSEPETKKIYSQRKIDVEPVFGFMKAILGFTRMSVRGIN
KVKRELGFVLMALNIRKIVARRAVYYQIHLKKADFYQIINRNQLFYIA
>SAR2185 putative single strand DNA-binding protein
MLNKIVIVGRLTKDAQIFEKEDRKIATFCVATHRNYKDENGEIVCDYLFC
KAFGKLASNIERYTNQGTLVGITGQMRSRKYDKDGQTHFVTELYVETIKF
MSPKSQNNEILSDSILDIDSQNIDNHDLLEI
>SAR0955 putative transposase
MSYNHLTLTERARIEVLRQENYSLRSIARKLKRSVSTISREISRNNLNQS
YQAETAQKNYETKRKLCGRPTRFTPELGNIIKYYLKCHWSPEQIVGRLLQ
NQICFKTIYRWINSNMINFELISCLRQKGKRQKPKETRGKFNIGRPISQR
PKEIKKRNTFGHWEADTIVSSRGKSKGCIATFAERKSRYYYCVLMPDRSS
NSMETAINNLIKHLPKGAVKTITVDRGKEFSCYQNIENQFNINVYFADPY
SAWQRGTNENTNGLLREFFPKKTDLAKVNQEQLNYALDSINYRPRKCLNW
KFPYEVLCDELLHLN
>SAR1305 putative transposase
MSYNHLTLTERARIEVLRQENYSLRSIARKLKRSVSTISREISRNNLNQS
YQAETAQKNYETKRKLCGRPTRFTPELGNIIKYYLKCHWSPEQIVGRLLQ
NQICFKTIYRWINSNMINFELISCLRQKGKRQKPKETRGKFNIGRPISQR
PKEIKKRNTFGHWEADTIVSSRGKSKGCIATFAERKSRYYYCVLMPDRSS
NSMETAINNLIKHLPKGAVKTITVDRGKEFSCYQNIENQFNINVYFADPY
SAWQRGTNENTNGLLREFSPKKTDLAKVNQEQLNYALDSINYRPRKCLNW
KFPYEVLCDELLHLN
>SAR0366 putative integrase
MWVREITKNNSTAYRYLERYTDPLTGKYKTVSVTRNKNNVRSQKDAQLEL
NKIIEQRLKHYSTKQLENLTFHDACDEWLEHYKTHSGSKPTTIKEKKSNT
NTVKNAIDSKVLISKITHTYLQNIINEWAKSHSIGHVQSLVIVIRSVFKY
AFKYYDLHDISVLDKIDIPKKAQTRNEFQAKRNNYLEDSEVKELLECFDY
LIKHKRHATRKRNYEMVKALVEFQINNGMRIGELLAIKTDNVDVENKTLE
IDGTINWVTDVVTGAFGVKETTKTSKSYRTIGLTAQSINLLKKLMLENKK
ENQWNDKFIDRGYIFTNTAGSPIDLNKVNNIIKEATDISSINKRVTTHTL
RHTHISTLAQLGINLKAIQERVGHSDYKTTLEIYTHVTDKMAKDMMNKLE
GIGS
>SAR1358 putative insertion element protein
MKTLKGALFMSIKPKVDIDVLLYMFQEYEKGVSFQYLIDTLQLDINVNTL
YPKYKYYKTHGIETLLTQKQYNHYSKELKLKVVNEYLNSNQSTQDIAIKY
NIRSSTQVKNWIKMYTVGKEIKNQSPKTGVYIMKARKTTFEERVTIVEYY
LNHKQSYREVAEHFNISYGQIYQWVHKYQAHGKNGLVDGRGKGKPKSMMT
PEEQKEAEIQALKAQNRLLEMENDVLKKFQALEREMIQRENKSRHTKRSK
R
>SAR2343 conserved hypothetical protein
MSLENQLAELKYDYVRLQGDLEKRESLNLDTSALVRQLKDIENEIRNVRA
QMQD
>SAR0520 putative insertion element protein
MKTLKGALFMSIKPKVDIDVLLYMFQEYEKGVSFQYLIDTLQLDINVNTL
YPKYKYYKTHGIETLLTQKQYNHYSKELKLKVVNEYLNSNQSTQDIAIKY
NIRSSTQVKNWIKMYTVGKEIKNQSPKTGVYIMKARKTTFEERVTIVEYY
LNHKQSYREVAEHFNISYGQIYQWVHKYQAHGKNGLVDGRGKGKPKSMMT
PEEQKAAEIQALKAQNRLLEMENDVLKKFQALEREMIQRENKSRHTKRSK
R
>SAR0698 putative transposase
MYKNYNMTQLTLPIETSVRIPQNDISRYVNEIVETIPDSEFDEFRHHRGA
TSYHPKMMLKIILYAYTQSVFSGRRIEKLLHDSIRMMWLAQNQTPSYKTI
NRFRVNPNTDALIESLFIQFHSQCLKQNLIDDNSIFIDGTKVEANANRYT
FVWKKSIQNHESKLNENSKALYRDLVEEKIIPEIKEDGDSDLTIEEIDLI
GSHLDKEIEDLNHSIQNEDCTQIRKQTRKKRTEIKKFKKKFDDYSERKSK
YEEQKSILKDRNSFSKTDHDATFMRMKEDHMKNGQLKPGYNLQIATNSQF
VLSYDLFQNPTDTRTLIPFLTMIQNTFGYLPEYIVADAGYGSEQNYMAII
DDFNKTPLITYGMFIKDKTRKFKSDIFNTQNWKYDELNDEFICPNNKRIG
FKRYAYRNDRYGFKRDFKLYECDDCSACSLRQQCMKPNSKTNKKIMKNYN
WEYFKAQINQKLSEPETKKIYSQRKIDVEPVFGFMKAILGFTRMSVRGIN
KVKRELGFVLMALNIRKIVARRAVYYQIHLKKADFYQIINRNQLFYIA
>SAR1911 putative exported protein
MKKEILKKQWVLWTILVTSFFSIGTPGIAIIPFSLSIYALSKLILVNKFV
EPDLVTLQNLKEKNKSINIENQKIKREIQELKNLKKDLISSIEEGTKELE
HITSYLNDELFKYDIELTYPFDLVEVDSSQINTYIKKLQMKEKELLNLEE
VKIFNVSTENKRHQNAQAKQIIRLFNAETSQLINKVNSKNIESMQNKIFK
SYEGINKIFETDNVRIPETLLDIKLEMLDLMHKYQVKIEDEKIIRREERA
RLKEIEQAEKEMEKKLKELDKDIRHHNNEIKKLTMYLNNTDLQVEKELYI
EKIRELDQSLKNLNSERENVEDRKDNAQSGFVYIISNIGSFGENVYKIGV
TRRLEPMDRINELSSASVPFEFDVHALIFSENAFELKNKLHEYFKKYKVN
KVNGRKEFFKVNINEIKDKVLSEHNSTVQFIDEPKAIQYRETLRLTSL
>SAR1356 putative exonuclease
MKIIHTADWHLGKILNGKQLLEDQAYILDMFVEKMKEEEPDIIVIAGDLY
DTTYPSKDAIMLLEQAIGKLNLELRIPIIMISGNHDGKERLNYGASWFEN
NQLFIRTDFTSINSPIEINGVNFYTLPYATVSEMKHYFEDDTIETHQQGI
TRCIETIAPEIDEGAINILISHLTVQGGKTSDSERPLTIGTVESVQKGVF
DIFDYVMLGHLHHPFSIEDDKIKYSGSLLQYSFSEAGQAKGYRRVTINDG
IINDVFIPLKPLRQLEIISGEYNDVINEKVHVKNKDNYLHFKLKNMSHIT
DPMMSLKQIYPNTLALTNETFSYNEENNAIEISEKDDMSIIEMFYNHITD
KELSDIQSNKIKNILENELRKED
>SAR1117 MutS family DNA mismatch repair protein
MRQKTLDVLEFDKIKSLVANETISDLGLEKVNQMMPATNFETVVFQMEET
DEIAQIYNKHRLPSLSGLSKVSAFIHRADIGGVLNVSELNLIKRLIQVQN
QFKTFYNQLVEEDEGVKYPILDDKMNQLPVLTDLFQQINETCDTYDLYDN
ASYELQGIRSKISSTNQRIRQNLDRIVKSQANQKKLSDAIVTVRNERNVI
PVKAEYRQDFNGIVHDQSASGQTLYIEPSSVVEMNNQISRLRHDEAIEKE
RILTQLTGYVAADKDALLVAEQVMGQLDFLIAKARCSRSIKGTKPIFKEE
RTVYLPKAYHPLLNRETVVANTIEFMEDIETVIITGPNTGGKTVTLKTLG
LIIVMAQSGLLIPTLDGSQLSVFKNVYCDIGDEQSIEQSLSTFSSHMTNI
VEILKNADKHSLVLFDELGAGTDPSEGAALAMSILDHVRKIGSLVMATTH
YPELKAYSYNREGVMNASVEFDVDTLSPTYKLLMGVPGRSNAFDISKKLG
LSLNIINKAKTMIGTDEKEINEMIESLERNYKRVETQRLELDRLVKEAEQ
VHDDLSKQYQQFQNYEKSLIEDAKEKANQKIKAATKEADDIIKDLRQLRE
QKGADVKEHELIDKKKRLDDHYEAKSIKQNVQKQKYDKIVAGDEVKVLSY
GQKGEVLEIVNDEEAIVQMGIIKMKLPIEDLEKKQKEKVKPTKMVTRQNR
QTIKTELDLRGYRYEDALIELDQYLDQAVLSNYEQVYIIHGKGTGALQKG
VQQHLKKHKSVSDFRGGMPSEGGFGVTVATLK
>SAR1698 conserved hypothetical protein
MSDPTLFDYSMIKGTVEAILFQNSDNFYTVLKVDTIETNEDFDTMPTVVG
FLPNIVEGDVYTFKGQVVDHPRYGKQLKAETFEKEMPQTKEAIISYLSSD
LFKGVGKKTAQNIVNTLGDNAINDILDDHSVLEKVSGLSKKKQKQIAEQI
SANQESEKIMIRLHDLGFGPKLSMAIYQFYLGDTLTILDRNPYQLIYDIK
GIGFNKADQLARNIGIAYNDNERLKAALLYTLEEECIKQGHTYLPINVVI
DLTVDVLNYQDEEVIEPEKLDEMLQYLNEEKRLIIDNEQVAIPSLYYSEI
KSVQNLFRIKTHTNKLTEIEQSDLQMHIGEIEDANQVNYAASQKEALQTA
INSKVMLLTGGPGTGKTTVIKGIVELYAEIHGLSLDYDDYVNDDYPVVLA
APTGRASKRLQESTGLEAMTIHRLIGWNQDTKPEDILENEINARLIIIDE
MSMVDTWLFHQFLSAVPLDAQLIFVGDEDQLPSVGPGQVFKDLIESKAIP
RVNLTEVYRQQDGSSIIELAHRMKLGQKIDITQRFHDRSFINCQANQIPT
VVEKVVTSAVNKGYTMADIQVLAPMYKGNAGIKRLNQVLQDILNPKKKDT
REIEFGDVVFRKGDKVLQLVNRPNDNIFNGDIGVIVGIFWAKENALNKDV
LVVDFEGNEITFTKQDMMELTHAYCTSIHKSQGSEFPIVIMPIVKQYFRM
LQRPILYTGLTRAKTSLVLLGDSEAFDIGLKTNGQARLTQLCTLLKNYFN
SEDEDMLENTATNDTGASQTTIDDQVEAPMSSNASEEVTSDSTKTDTNVL
TEATMFKIDPMINMGEITPYDFIER
>SAR2083 putative single-strand DNA-binding protein
MLNRTILVGRLTRDPELRTTQSGVNVASFTLAVNRTFTNAQGEREADFIN
IIVFKKQAENVNKYLSKGSLAGVDGRLQTRNYENKEGQRVYVTEVVADSI
QFLEPKNSNDTQQDLYQQQVQQTRGQSQYSNNKPVKDNPFANANGPIELN
DDDLPF
>SAR1986 ImpB/MucB/SamB family protein
MTERRIIHIDMDYFFAQVEMRDNPKLKGKPVIVGGKASSRGVVSTASYEA
RKYGVHSAMPMSQAHKLCPNGYFVTSNFGAYRETSAQIMSIFRSYTDKVE
PMSLDEAYLDITELVRPDLPASKIAQYIRKDILEQTHLTASAGVSYNKFL
AKLASGMNKPDGMTVIDYRNVHDILMTLDIGDFPGVGKASKKVMHDNGIF
NGRDLYEKTEFELIRLFGKRGRGLYNKARGIDHSEVKSTRVRKSVGTERT
FATDVNDDEEILRKVWELSGKTAERLNKLQKSAKTVTVKIKTYQFETLSK
QMSLRDSVSSEEDIYNIAYLLYNDLKDPDVPIRLIGVTVGNLEQSTYKNM
TIYDFI
>SAR0034 putative transposase
MNYFRYKQFNKDVITVAVGYYLRYTLSYRDISEILRERGVNVHHSTVYRW
VQEYAPILYQIWKKKHKKAYYKWRIDETYIKIKGKWSYLYRAIDAEGHTL
DIWLRKQRDNHSAYAFIKRLIKQFGKPQKVITDQAPSTKVAMAKVIKAFK
LKPDCHCTSKYLNNLIEQDHRHIKVRKTRYQSINTAKNTLKGIECIYALY
KKNRRSLQIYGFSPCHEISIMLAS
>SAR1768 formamidopyrimidine-DNA glycosylase
MPELPEVEHVKRGIEPYVINQKIEHVIFSDKVIEGKAQGKETIIKGIELD
TFKTLSEGYTITNVERRSKYIVFQLDNKREQRTLISHLGMAGGFFIVDEL
EDIMIPNYRKHWHVIFELSNDKKLIYSDIRRFGEIRNVASVASYPSFLEI
APEPFTNEALTYYLNRIHQQSNKNKPIKQVILDHKVIAGCGNIYACEALF
RAGVLPDKKVKDLTHQQQEMVFYYVREVLEEGIKHGGTSISDYRHADGKT
GEMQLHLNVYKQPVCKVCGSQIETKIIATRNSHYCPVCQK
>SAR1466 conserved hypothetical protein
MGMATYAVVDLETTGNQLDFDDIIQIGITFVRNNQIIDTYHSMIRTNLEI
PPFIQALTSIEENMLQQAPYFNQVAEEIYDKIKDCIFVAHNVDFDLNFIK
KAFKDCNIQYRPKKVIDTLEIFKIAFPTDKSYQLSELAEAHGITLANAHR
ADEDAATTAKLMILAFEKFEKLPLDTLKQLYYLSKQLKYDLYDIFFEMVR
QYDAKPLDKSYEKFEQIIYRKQVDFKKPTTNYNGSLKSLYSKAVDQLGLT
YRPQQLYLAETILDQLMHSEKAMIEASLGSGKSLAYLLAALMYNIETGKH
VMISTNTKLLQSQLLEKDIPAMNEALNFKINALLIKSKSDYISLGLISQI
LKDDTSNYEVNILKMQLLIWITETPSGDIQELNLKGGQKMYFDQKIETYV
PARHDVHYYNFIKRNAQNIQIGITNHAHLIHSDVENSIYQLFDDCIVDEA
HRLPDYALNQVTNELSYADIKYQLGLIGKNENEKLLKAIDQLEKQRILEK
LDIAPIDIFGLKASMNEIHELNEQLFSTIFTIINDSDVYDDDIHRFHNVF
TFETKDILKDLHAIIDKLNKTLEIFNGISHKTVKSLRKQLLYLKDKFKNI
EQSLKAGHTSFISIKNLSQKSTIRLYVKDYAVKDVLTKQVLEKFKSLIFI
SGTLKFNHSFDAFKQLFNKDVHFNTFEVNTSLQSAKNTSVFIPSDVASYQ
YKNIDEYVASIVSYIIEYTTITSSKCLVLFTSYKMMHMVQDMLNELPEFE
DYVVLTQQQNQNYKIVQQFNNFDKAILLGTSTFFEGFDFQANGIKCVMIA
KLPFMNKHNAKYWLMDSEFTSTFKEYVLPDAVTRFRQGLGRLIRNENDRG
IIVSFDDRLINSNYKNFFEQTLENYRQKKGDIQQFGKLLRQIQKKKK
>SAR1138 putative transposase
MYKNYNMTQLTLPIETSVRIPQNDISRYVNEIVETIPDSEFDEFRHHRGA
TSYHPKMMLKIILYAYTQSVFSGRRIEKLLHDSIRLMWLAQNQTPSYKTI
NRFRVNPNTDALIESLFIQFHSQCLKQNLIDDNSIFIDGTKVEANANRYT
FVWKKSIQNHESKLNENSKALYRDLVEEKIIPEIKEDGDSDLTIEEIDLI
GSHLDKEIEDLNHSIQNEDCTQIRKQTRKKRTEIKKFKKKFDDYSERKSK
YEEQKSILKDRNSFSKTDHDATFMRMKEDHMKNGQLKPGYNLQIATNSQF
VLSYDLFQNPTDTRTLIPFLTMIQNTFGYLPEYIVADAGYGSEQNYMAII
DDFNKTPLITYGMFIKDKTRKFKSDIFNTQNWKYDELNDEFICPNNKRIG
FKRYAYRNDRYGFKRDFKLYECDDCSACSLRQQCMKPNSKSNKKIMKNYN
WEYFKAQINQKLSEPETKKIYSQRKIDVEPVFGFMKAILGFTRMSVRGIN
KVKRELGFVLMALNIRKIVARRAVYYQIHLKKADFYQIINRNQLFYIA
>SAR0405 hypothetical protein
MADITVVNDTGELYNVINQKKSEGYLESELTIISKSKLHLNDLHDSEISL
ISTSGTFSDKMTKLLTGEDGEHAVLSRYNLAPDELEKYKQLILDDKMLVV
GVRDHSSHQEVLENNSAYEEVDITHFAEASKGPKA
>SAR2666 hypothetical protein
MSNKLDGINKMITAKHKQMDDLYDEKQEVKALIDESDALNHSIEQLYQHL
GERYYSSNMASRMEQFRDEFHFAKRRSTEALYEQQQQIQHDIRKAEEEMI
DLEMRRNIEIEMVTKEENKWKQ
>SAR1359 putative transposase
MCRILGISRASYYKWVHYQSSELELENEQLKREIESIYHKYNGIYGYRRI
YIYIRLKLGKQVNRKRIYRLMKELNLKAVIRQKRKPYRRSTPQITSENKL
NRQFDIDTPNKVWLTDVTEFKIKEGSKIYLSAIYDLGAKRIVSYELGPSN
NNQLVFKTFNQAIEKVENTKGILFHSDRGFQYTSKTFKHMLDECGMIQSM
SRVSKCIDNGPMEGVWGTIKSEIFRGNKHFKFNSVEEATKTIHDFILFFN
HERITLKMADSV
>SAR0867 hypothetical protein
MAIVNKVIIVEGKSDKKRVQQVIAEPVNIICTHGTMSIDKLDDMIESLYD
KQVFVLADSNDEGDRIRNWFKRYLSESEHIFIDKTYCQVANCPKQYLAHV
LSKHGFTCKKETPFLPNINNERLVLVNE
>SAR2306 putative insertion element protein
MKTLKGALFMSIKPKVDIDVLLYMFQEYEKGVSFQYLIDTLQLDINVNTL
YPKYKYYKTHGIETLLTQKQYNHYSKELKLKVVNEYLNSNQSTQDIAIKY
NIRSSTQVKNWIKMYTVGKEIKNQSPKTGVYIMKARKTTFEERVTIVEYY
LNHKQSYREVAEHFNISYGQIYQWVHKYQAHGKNGLVDGRGKGKPKSMMT
PEEQKEAEIQALKAQNRLLEMENDVLKKFQALEREMIQRENKSRHTKRSK
R
>SAR1707 putative ATPase
MSTEPLASRMRPKNIDEIISQQHLVGPRGIIRRMVDTKKLTSMIFYGPPG
IGKTSIAKAISGSTQYKFRQLNAVTNTKKDMQLVVEEAKMSGQVILLLDE
IHRLDKAKQDFLLPHLENGKIVLIGATTSNPYHAINPAIRSRAQIFELYP
LNDEDVRQALTRAIEDEENGLKTYQPKIDEDAMTYFSTQSQGDVRSALNA
LELAVLSADNDKDGYRHVTLQDAKDCLQKGAFVSDKDGDMHYDVMSAFQK
SIRGSDVNAALHYLARLIEAGDLPTIVRRLLVISYEDIGLASPNAGQRTL
AAIESAERLGLPEARIPLSQAVIELCLSPKSNSAMSAIDSALSDIRNGHV
GQIPNHLKDGHYQGAKDLGRSIGYKYPHQYVNGYVSQQYLPDKLKNKIYY
EPKTTSKSERQLKEIYNNLLKQRP
>SAR1874 conserved hypothetical protein
MRVKFRDKDNRQVNLTFKKDNEIADGNHVLAIPTFKNQLLFTKHNLRGIE
FPGGKRERGESSAEAITRELYEETGAKVKNIHYIAQYTIETHDQTDFVKD
VYFIEVESLVSKNDYLETAGPVLFNCINDIELAQRSFLLQDSTILKCVER
VQSLGFYQT
>SAR1897 hypothetical protein
MEKLNEAIKNYIDSDELFALFIDGQWGVGKTYYIKNKILKEFDDKIEIRY
ISLYGLQNLYEIKSIIISKLINTSTKNLRKVLKSMKNFQVVPYLNMNNFS
DNILRYIDDNSLKKIKNSLMKNNKSALLIIDDVERMSDKVSFEEFLGFIR
NDLIDYLDCKVLFIGNLDELNKKADFHKNSEKIISRILKFPNNREVAMDI
VKSNLPKLFVKNISTKELFNGFFDISDEKKEIIFEKSKQNNLNYKYYHLN
MRTLNLVVSNFKLIINKMQDEIKYKSKDFKITLYISLFISLFILYNEFRS
GNLGESEIKDLQFTYNDFNNLNTKTSIALFKYLYKNNELANYVFFDIELK
NLLLYGIFDESIFLKNITNNFKEEDAKADLLNVLKEFFKLSESELLEKEI
QAIEIIKDKNNENFNYKVKAYLSLLNLKGKNLYLIDGIKISELEDELVDN
YDLHQNNLIVGDTINRLDLVSFSNEQLYEEIKSLKDRLINKRTTLMTSIY
NQYLETILNGDFTNRTKLKEAYIGMKDIDTIEIIYNNLNKVSNELKSSNE
KIINLSAFIKDNHCNNDDNIQLEYIIGEMKKLKDGTKDRVTQMCFSMLID
TLKDI
>SAR1312 hypothetical protein
MIPGVNAPPMHPWCRSTTVPHVGNWRDKFFKEREGKYQVEVKEAKLQEKA
KNQMKEMIESGKIKIEINPEKQNRHSLGHKLYLKNKIYALQNDERFPSYT
ILSIEELNDLLKKYSMTGKILVDKFGFNRKEIINFEKTIGKAYAGGKYIN
TAYGKIHYSKTGSHIVPFVSKEK
>SAR0375 hypothetical protein
METGKSDVLDKIEKINKKDSALQEIIPKGYEIEHHQCGVALYQLIPSKKE
GEPDKKVFITNTIPQITERFEDIESNEVSFNMLFYDNKTPVNIAVSAEEI
SDSRQLLKLVNKKLDVTSSTSTKLVDYINISKRYNPPLNVKVATRLGHVK
GYFIYPYQEVMKDSNVKLFSNDKGFQKLIDSFRSKGTLQGYSKKVFAQIK
DLPMVMVMLYASLGSVLLREFGLQPFIVEISGSTSTGKTFTLNLVSSVWG
TSDLITTWSSTQNSIESMASFLNSFPMFKDDTRNTHPKFVASATYNFSSG
ESKSRSNINLTLNAKKEWRNILISTGESSIANMADEKAGVSARVVTLQDP
PYPDNFDFTTLDKSFRENYGTLGLAFIKQYESKKDVYKNAFESYQRYFNQ
KGSNEIMQRLGRAFALLQVTGEVLNDIDGFEHDHFKIIEQAYDSMVKNNK
TIDKPKQLLEELLQYLDANRNNIVGDGYDSVNYGDVKAVYKHDFLCIKNE
TVKNKLGHEMQTITGQWDKKGYLIKDKKRIQKQVKHKSQRHLGYAIKKEI
IEELGFDFSISHNPYTESY
>SAR0961 putative transposase
MCRILGISRASYYKWVHYQSSELELENEQLKREIESIYHKYNGIYGYRRI
YIYIRLKLGKQVNRKRIYRLMKELNLKAVIRQKRKPYRRSTPQITSENKL
NRQFDIDTPNKVWLTDVTEFKIKEGSKIYLSAIYDLGAKRIVSYELGPSN
NNQLVFKTFNQAIEKVENTKGILFHSDRGFQYTSKTFKHMLDECGMIQSM
SRVSKCIDNGPMEGVWGTIKSEIFRGNKHFKFNSVEEATKTIHDFILFFN
HERITLKMADSV
>SAR0481 putative insertion element protein
MKTLKGALFMSIKPKVDIDVLLYMFQEYEKGVSFQYLIDTLQLDINVNTL
YPKYKYYKTHGIETLLTQKQYNHYSKELKLKVVNEYLNSNQSTQDIAIKY
NIRSSTQVKNWIKMYTVGKEIKNQSPKTGVYIMKARKTTFEERVTIVEYY
LNHKQSYREVAEHFNISYGQIYQWVHKYQAHGKNGLVDGRGKGKPKSMMT
PEEQKEAEIQALKAQNRLLEMENDVLKKFQALEREMIQRENKSRHTKRSK
R
>SAR1575 putative ADP-ribose pyrophosphatase
MDLNEKTIDRTVIYNGKIVDVEIHTVTLPNGETSTRELVYHNGAVAVCAL
TPKKEVVLVKQYRKPVEKPLLEIPAGKLEDDEDRVEAAKRELEEETGYIA
KELTHVVDMYGSPGFCDEQLSIYFTDNVEEGTVHLDEDEFVEVIKVPIEN
VKSMLMNKEIEDAKTIIALQHLLLNYNHSK
>SAR0493 conserved hypothetical protein
MKINEFIVVEGRDDTERVKRAVECDTIETNGSAINEQTLEVIRNAQQSRG
VIVLTDPDFPGDKIRSTITEHVKGVKHAYIDREKAKNKKGKIGVEHADLI
DIKEALMHVSSPFDEAYESIDKSVLIELGLIVGKDARRRREILSRKLRIG
HSNGKQLLKKLNAFGYTEADVRQALEDE
>SAR1097 putative methylase
MRVIAGKHKSKALESMEGRNTRPTMDKVKEGIFNSLYDVSGIGLDLFAGS
GALGIEALSRGIDKVIFVDQNFKAVKVIKSNLANLDLEAQSEVYKNNADR
ALKALSKRDIQFDVIFLDPPYNKGLIDKALKLISEFNLLKENGIIVCEFS
NHEEIDYQPFNMIKRYHYGLTDTLLLEKGE
>SAR1635 putative helicase
MAKHPFEQFNLESSLIDAVKDLNFEKPTEIQNRIIPRILKRTNLIGQSQT
GTGKSHAFLLPLMQLIDSEIKEPQAIVVAPTRELAQQLYDAANHLSQFKA
GVSVKVFIGGTDIEKDRQRCNAQPQLIIGTPTRINDLAKTGHLHVHLASY
LVIDEADLMIDLGLIEDVDYIAARLEDNANIAVFSATIPEQLQPFLNKYL
SHPEYVAVDSKKQNKKNIEFFLIPTKGAAKVEKTLNLIDILNPYLCIVFC
NSRDNANDLARSLNEAGIKVGMIHGGLTPRERKQQMKRIRNLEFQYVIAS
DLASRGIDIEGVSHVINFDVPNDIDFFTHRVGRTGRGNYKGVAITLYSPD
EEHNISLIEDRGFVFNTVDIKDGELKEVKAHNQRQARMRKDDHLTNQVKN
KVRSKIKNKVKPGYKKKFKQEVEKMKRQERKQFSKQQNRQKRKQNKKG
>SAR0687 putative transposase
MNYFRYKQFDKDVITVAVGYYLRYALSYRDISEILRERGIYVHHSTIYRW
VQEYAPILYQIWKKKNKQAYYKWHIDETYIKIKGQWNYLYRAIDADGHTL
DISLRKKRDNHSAYTFIKRLIKQFGKPQMIITDQAPSTKVAMSKVIKDFK
LTPNCHCTSKYLNNLIEQDHRHIKVRKISYQSINTAKNTIKGIECIYGLY
KKNRRSLQIYGFSPCRVISIMLAS
>SAR1433 putative transposase
MSYNHLTLTERARIEVLRQENYSLRSIARKLKRSVSTISREISRNNLNQS
YQAETAQKNYETKRKLCGRPTRFTPELGNIIKYYLKCHWSPEQIVGRLLQ
NQICFKTIYRWINSNMINFELISCLRQKGKRQKPKETRGKFNIGRPISQR
PKEIKKRNTFGHWEADTIVSSRGKSKGCIATFAERKSRYYYCVLMPDRSS
NSMETAINNLIKHLPKGAVKTITVDRGKEFSCYQNIENQFNINVYFADPY
SAWQRGTNENTNGLLREFFPKKTDLAKVNQEQLNYALDSINYRPRKCLNW
KFPYEVLCDELLHLN
>SAR0929 conserved hypothetical protein
MTIPEKPQGVIWTDAQWQSIYETGQDVLVAAAAGSGKTAVLVERIIQKIL
RDGIDVDRLLVVTFTNLSAREMKHRVDQRIQEASIADPANAHLKNQRIKI
HQAQISTLHSFCLKLIQQHYDVLNIDPNFRTSSEAENILLLEQTIDEVIE
QHYDILDPAFIELTEQLSSDRSDDQFRMIIKQLYFFSVANPNPTNWLDQL
VTPYEEEAQQAQLIQLLTDLSKVFITAAYDALNKAYDLFSMMDGVDKHLA
VIEDERRLMGRVLEGGFIDISYLTGHEFGARLPNVTAKIKEANEMMVDAL
EDAKLQYKKYKSLIDKVKSDYFSREADDLKADMQQLAPRVKYLARIVKDV
MSEFNRKKRSKNILDFSDYEHFALQILTNEDGSPSEIAESYRQHFHEILV
DEYQDTNRVQEKILSCIKTGDEHNGNLFMVGDVKQSIYKFRQADPSLFIE
KYQRFTLDGDGTGRRIDLSQNFRSRKEVLSTTNYIFKHMMDEQVGEVRYD
EAAQLYYGAPYDESDHPVNLKVLVEADQEHSDLTGSEQEAHFIVEQVKDI
LEHQKVFDMKTGSYRSATYKDIVILERSFGQARNLQQAFKNEDIPFHVNS
REGYFEQTEVRLVLSFLRAIDNPLQDIYLVGLMRSVIYQFKEDELAQIRI
LSPNDDYFYQSIVNYINDEAADAILVDKLKMFLSDIQSYQQYSKDHPVYQ
LIDKFYNDHYVIQYFSGLIGGRGRRANLYGLFNKAIEFENSSFRGLYQFI
RFIDELIERGKDFGEENVVGPNDNVVRMMTIHSSKGLEFPFVIYSGLSKD
FNKRDLKQPVILNQQFGLGMDYFDVDKEMAFPSLASVAYKAVAEKKLVSE
EMRLVYVALTRAKEQLYLIGRVKNDKSLLELEQLSISGEHIAVNERLTSP
NPFHLIYSILSKHQSASIPDDLKFEKDIAQIEDSSRPNVNISIIYFEDVS
TETILDNDEYRSVNQLETMQNGNEDVKAQIKHQLDYQYPYVNDTKKPSKQ
SVSELKRQYETEESGTSYERVRQYRIGFSTYERPKFLSEQGKRKANEIGT
LMHTVMQHLPFKKERISEVELHQYIDGLIDKHIIEADTKKDIRMDEIMTF
INSELYSIIAEAEQVYRELPFVVNQALVDQLPQGDEDVSIIQGMIDLIFV
KDGVHYFVDYKTDAFNRRRGMTDEEIGTQLKNKYKIQMKYYQNTLQTILN
KEVKGYLYFFKFGTLQL
>SAR2101 putative exonuclease
MKKYDIAVLDFETMNEHMNSPCEVAVSLIKDLSIVKVYSSYINPPNNRYN
LKNAKIHKIPEDVILKAPKYPDIYQEILYLLKESHLIIAHNALFDISVLK
NTNNYYDLPVPNFMYVDSINIFRSFHAISSFKLENLCSLYDIDKEKLHSA
KFDVLALSKMLISLAKNNKHYSVLKLIHYMPKQYIRFSKYSNSPTKLFDS
GFQKIHMKISEINKIEVESVIPILKDKNVVFTGNFDTEKQDLMILTRKKG
AYIRSDVTAKTDILVEGVQDDKYKDVNGLVSKQRKAREYVGNGAKIQFLN
EEDLINLIKE
>SAR2251 putative transposase
MYKNYNMTQLTLPIETSVRIPQNDISRYVNEIVETIPDSEFDEFRHHRGA
TSYHPKMMLKIILYAYTQSVFSGRRIEKLLHDSIRMMWLAQNQTPSYKTI
NRFRVNPNTDALIESLFIQFHSQCLKQNLIDDNSIFIDGTKVEANANRYT
FVWKKSIQNHESKLNENSKALYRDLVEEKIIPEIKEDGDSDLTIEEIDLI
GSHLDKEIEDLNHSIQNEDCTQIRKQTRKKRTEIKKFKKKFDDYSERKSK
YEEQKSILKDRNSFSKTDHDATFMRMKEDHMKNGQLKPGYNLQIATNSQF
VLSYDLFQNPTDTRTLIPFLTMIQNTFGYLPEYIVADAGYGSEQNYMAII
DDFNKTPLITYGMFIKDKTRKFKSDIFNTQNWKYDELNDEFICPNNKRIG
FKRYAYRNDRYGFKRDFKLYECDDCSACSLRQQCMKPNSKSNKKIMKNYN
WEYFKAQINQKLSEPETKKIYSQRKIDVEPVFGFMKAILGFTRMSVRGIN
KVKRELGFVLMALNIRKIVARRAVYYQIHLKKADFYQIINRNQLFYIA
>SAR1958 HhH-GPD superfamily base excision DNA repair protein
MYQQSSFKENLIHWFDENQREMPWRQTTNPYYIWLSEVMLQQTQVKTVID
YYHRFVERFPTVEVLSQASEDEVLKYWEGLGYYSRARNFHTAIKEVHDKY
EGLVPKDPDQFKALKGVGPYTQAAVMSIAYNVPLATVDGNVFRVWSRLND
DYRDIKLQSTRKSYEQELLPYVTTEAGTFNQAMMELGALICTPKNPLCLF
CPVQENCEAFDKGTFEKLPVKSKNVSKKVIEQSVFLIRNNQGQYLLQKRS
EKLLHGMWQFPMFKSEHARREMTEKIGHDIQPVETPIFELKHQFTHLTWK
IKVYAVSGTINIERLPDDMIWFDLSDRDQYTFPVPMSKIYQFING
>SAR2705 putative transposase
MSYNHLTLTERARIEVLRQENYSLRSIARKLKRSVSTISREISRNNLNQS
YQAETAQKNYETKRKLCGRPTRFTPELGNIIKYYLKCHWSPEQIVGRLLQ
NQICFKTIYRWINSNMINFELISCLRQKGKRQKPKETRGKFNIGRPISQR
PKEIKKRNTFGHWEADTIVSSRGKSKGCIATFAERKSRYYYCVLMPDRSS
NSMETAINNLIKHLPKGAVKTITVDRGKEFSCYQNIENQFNINVYFADPY
SAWQRGTNENTNGLLREFFPKKTDLAKVNQEQLNYALDSINYRPRKCLNW
KFPYEVLCDELLHLN
>SAR2085 hypothetical phage protein
MTENNKLQTIEQQLVQEKNVSDNVLNKVRVLESQGNLELPNDYSPSNAMK
QAWLQISQDNKLMSCNDTSKANALLDMVTQGLNPAKNQCYFIPYGNKMQL
QRSYHGNVMMLKRDAGAQDVVAQVIYKGDTFKQEMGETGRIKAIKHEQDF
FNIDKENIIGAYCTIVFNDGRDNYIEVMTIEQIKQAWMQSSMIKDEKALQ
NSKTHNNFKEEMAKKTVINRAAKRYINTSTDSNIFKYAQESEQRQRKEVL
DAEVEENANQEQLDFEQPVLEEAQYTELENDKPIDVSDFEEIKEPATEKE
SEEEPF
>SAR1464 conserved hypothetical protein
MDKYQLKARPVVIRRELLDHYSDLGLDEQDLVILLKLIYASETSNKQPSI
ELLQKGSTMQPRDITMVIQNLIQRELLELQVQKDEEGRFTEYMNLDPFFE
KLSHILKQQSMETKEQNSKEKFKQLFRVLEDTFARPLSPYEIETLNQWID
VDKHDTAIIQAALDEANSLNKLSFKYMDRILLNWKKNNVKTIDDSRKIRE
KFNKPKMTHTVKTVPKFDWLNGENLDGK
>SAR1541 putative DNA polymerase
MRFMNIDIETYSSNDISKCGAYKYTEAEEFEILIIAYSIDGGAISAIDMT
KVDNEPFHADFETFKIALFDPAVKKYAFNANFERTCLAKHFNKQMPPEEW
ICTMVNSMRIGLPASLDKVGEVLRLQNQKDKAGKNLIRYFSIPCKPTKVN
GGRTRNLPEHDLEKWQQFIDYCIRDVEVEMTIAHKIKDFPVTAIEQAYWV
FDQHINDRGIKLSKSLMLGANVLDKQSKEELLNQAKHITGLENPNSPTQL
LAWLKNDQGLDIPNLQKKTVQEYLKEATGKAKKMLEIRLQMSKTSVKKYN
KMHDMMCSDERVRGLFQFYGAGTGRWAGRGVQLQNLTKHYISDTELEIAR
DLIKEQRFDDLDLLLNVHPQDLLSQLVRTTFTAEEGNELAVSDFSAIEAR
VIAWYAKEQWRLDVFNTHGKIYEASASQMFNVPVESITKGDPLRQKGKVS
ELALGYQGGAGALKAMGALEMGIEENELQGLVDSWRNANPNIVNFWKACQ
EAAINTVKSRKTHHTHGLRFYMKKGFLMIELPSGRALAYPKASVGENSWG
SQVVEFMGLDLNRKWSKLKTYGGKLVENIVQATARDLLAISIARLEASGF
KIVGHVHDEVIVEIPRGSNGLKEIETIMNKPVDWAKGLNLNSDGFTSPFY
MKD
>SAR0382 putative terminase small subunit
MSELTAKQARFVNEYIRTLNVTQSAIKAGYSANSAHVTGCRLLKKPHIKQ
YIQEQKDKIIDENVLTAKELLHVLTNAAVGDETETKEVVVKRGEYKENPQ
SGKVQLVYNEHVELIEVPIKPSDRLKARDMLGKYHKLFTDKHDINGDVPI
FINIGEWDGDDEELDKTVKDVSNANPNHTVIVDDIPLED
>SAR1597 putative DNA repair protein
MLQTLSIKQFAIIEELEIQFSDGLTVLSGETGSGKSIIIDAIGQLIGMRA
SSDFVRHGEKKAVIEGIFDIDESKDAIHILKNMDIDVDEDFLLVKREIFS
SGKSLCKINNQTVTLQDLRKVMQELLDIHGQHETQSLLKQKYHLTLLDNY
AESRYQDLLDKYHQTFQNYKAKKQGLEDLESADQALLQRLDLMKFQLEEL
SEAHLKEGEIEQLEIDIKRIQNSEKLSLALNNAHMTLTDENAITDRLYEL
SNHLLTINDIVPNKYDKLKEDIDQFYYILEDAKHELYDEMANTEFDEQVL
NEYESRMNLLNNLKRKYGKDISELIAYQEKLNNEINKIENYEQSTSQLRE
EINALYNQVIEVGQALSKQRRIVARELRDHIVSEIQNLQMKDANLEISFK
KLEEPNIDGIEFVEFLISPNKGEPLKSLNKIASGGELSRIMLALKSIFVK
SRGQTAILFDEVDSGVSGQAAQKMAEKMRDIAEYIQVICISHLPQVASMS
DHHLLISKSSKDDRTTTQVQELIGDDKVDEIARMISGASVTDLTRENARE
MIQHNQRHR
>SAR0026 hypothetical protein
MSDNLSLFIDYINDNIIYGSEIKREKLENLFNQFAIKNVEKNIVYDELKS
LDITIIESQDSYKNKLKRLFSVLLQSKKI
>SAR0091 putative insertion sequence protein
MSFSNKIASAAIIDRFVHHSKVFKITGESNRLKDYKTEKSLNI
>SAR0703 plasmid replication initiation protein
MTGETVVYKNEMNLVPLRRFTATEINLFFAMCNKLKEQDTNTLRLSFDEL
KKLSNYSPETRNINRFANDLDNVYKKMLNLTIRYEDDDVIERFVLFNHYR
IHKREQYLEISTSSNLKHILNSITNNFTKFELKEMTRLKSTYSKNMFRLL
KQYKHTGYLKIHIDDFKNRLDIPKSYRMTDINKNVFKPIIIELGSIFNNL
TINKIKAKKGRKIEWIEFTFDAEKRIHNKRQPQMSKIDKSRQYVRREKTP
KWLEERSYEKQPQKDYDPQLEKEREDFLKQLELNWE
>SAR2574 putative helicase
MSRLLNDFNQSLHKGFIDKHISHKGNYTPKLLVNNKNEKVLSTIIDELQK
CETFYFSVAFITESGLASLKAQLLDLSNKGVKGKILTSNYLGFNSPKMYG
ELLKLKNVEVRLTDIAGFHAKGYIFEHKDYSSMVIGSSNLTSNALKVNYE
HNVLLSTMKNGDLVDSVKNEFDLLWQNSTQLTQQWINSYKESFEYRSLEK
LAEVEQTQMLLADKVKKSVEIVPNLMQAEALRSLKAIRDKAKDKALIISA
TGTGKTILCALDVREVNPNKFLFIVHNEGILNRAKEEFKKVLPTKNDSDF
GLLTGKHRDVDAKYLFATIQTLSRDDNFKQFDEKEFDYIVFDEAHRSAAS
TYQRVFNYFKPKFMLGMTATPERSDELSIFEMFDYNIAYEIRLQAALESD
ILCPFHYFGVTDYVHQGIKEDDVTKLRYLTSDERVNYIIQKTDYYGYSGE
ILQGLIFVSSKKEAYDLADKLSSKGIKSVALTGDDSVNYRQIVIEKLREG
KINYIITVDLFNEGIDIPEVNQVVMLRPTESSIIFIQQLGRGLRKSSNKE
YVTVIDFIGNYKTNYLIPIALSGDQSQNKDNYKKFLTNNDSINGVSTINF
EEVAKKQIYNSLDAVSLNQNKLILKAYEEVENRLGHMPLLMDFIQQHSID
PSVIFSKFSNYYEFLVRYKKIDALLTENESKNLVFFSRQIAPGLKRIDSI
VLEELLKNELTYDELKNKMLNEVKDITEDDIDTSLRILDFSFYNAGIEKI
YGSPIIECNERMIRLSDAFTNALSNQTFKIFLEDLIELSKYNNEKYQKGK
NGLILYNKYSREDFSKIFNWSKNGSSVIMGYMIRSQEMPIFITYDKHEDI
SDSTKYEDEFLSQDELKWFTKSNRTLKSKEVQKILSHRAKGIKMYIFVQK
KDDDGIYFYYLGTAGYIEGSEKQDKMPNGSNVVTMDLALDKAVRDDIYRY
ITN
>SAR0081 transposase
MEFHLIMTRERRLFSSEFKLQMVRLYENGKPRNEIIREYDLTPSPLGKWI
KQYQNTGTFNHQDNLSDEEKELIKLRKEVQHLKMENDILKQAALIMGRK
>SAR1600 putative exodeoxyribonuclease VII small subunit
MTKETQSFEEMMQELERIVQKLDNETVSLEESLDLYQRGMKLSAACDTTL
KNAEKKVNDLIKEEAEDVKNDESTDE
>SAR0629 phage integrase family protein
MNKVEVIKCNDDINKMYDALKARSDRDYLFFKLAIHSGMKITELLTITVE
DTKRLIEKGTLSELCKAHYHSLIKIRLPETLSNELLHYIENKSLSNEDVL
FQSLRTNQVLSRQQAYRIIHQAAIKAGIENVGLTTLRKTFAYHAYQKGIP
IPVIQKYLGHQSAIETLNFIGLENECEHSIYISLQL
>SAR1827 transposase
MKNKEKYLTNFSEAKRKEATQKYNIIKPFILGKQSLSSISKSKGIALSTL
YRWNKLYKEQGLTGLIHNTRVDKGEHKLKQNIIDEIKRLALKNKRNSIAT
IHRKIANYCIENNFYKPSYKQVYSIIKAMPKSVIDFSHQGEKYYQNKYDL
IQIRESSRPNEIWQADHTLLDIYILDQKGNINRPWLTIIMDDYSRAIAGY
FISFDAPNAQNTALTLHQAIWNKNNTNWPVCGIPEKFYTDHGSDFTSHHM
EQVAIDLKINLMFSKVGVPRGRGKIERFFQTVNQTFLEQLPGYINNNDTS
SDLIDFQNFEEKLRYFLIEDYNQKEHSAIQSTPINRWNSNHFFPNMPSSL
EQLDLLLLEIPKSRKIHSDGIHFQGFRYSNTNLTAYVGEYVLIRYNPNDM
AEIRVFYRDEFLCTAISPDLADYSIDIKEIQHARSQRRKHLKQNIASPST
TDLIKEEKSYGYSPQETTKNVKKLKRYRND
>SAR0719 putative resolvase
MKIGYARVSTGLQNLNLQEDRLNAYGCEKIFNDHMSGSKSKRPGLDKAIE
FARSGDTIVVWRLDRLGRNMEDLITLVNELNERGVSFHSLEENITMDKSS
STGQLLFHLFAAFAEFERNLILERSSAGRIAARARGRYGGRPEKLNKQDL
KLLKTLYDNGTPIKTIAEQWQVSRTTIYRYLNKLEDEKSDK
>SAR0928 conserved hypothetical protein
MTLHAYLGRAGTGKSTKMLTEIKQKMKADPLGDPIILIAPTQSTFQLEQA
FVNDPELNGSLRTEVLHFERLSHRIFQEVGSYSEQKLSKAATEMMIYNIV
QEQQKYLKLYQSQAKYYGFSEKLTEQIQDFKKYAVTPEHLESFIADKNMQ
TRTKNKLEDIALIYREFEQRIQNEFITGEDALQYFIDCMPKSEWLKRADI
YIDGFHNFSTIEYLIIKGLIKYAKSVTIILTTDGNHDQFSLFRKPSEVLR
HIEEISNELNISIERQYFNQLYRFNNQDLKHLEQEFDALQINRVACQGHI
NILESATMREEINEIARRIIVDIRDKQLRYQDIAILYRDESYAYLFDSIL
PLYNIPYNIDTKRSMTHHPVMEMIRSLIEVIQSNWQINPMLRLLKTDVLT
TSYLKSAYLVDLLENFVLERGIYGKRWLDDELFNVEHFSKMGRKAHKLTE
DERNTFEQVVKLKKDVIDKILHFEKQMSQAATVKDFATAFYESMEYFELP
NQLMTERDELDLNGNHEKAEEIDQIWNGLIQILDDLVLVFGDEPMSMERF
LEVFDIGLEQLEFVMIPQTLDQVSIGTMDLAKVDNKQHVYLVGMNDGTMP
QPVTASSLITDEEKKYFEQQANVELSPTSDILQMDEAFVCYVAMTRAKGN
VTFSYSLMGSSGDDKEISPFLNQIQSLFNQLEITNIPQYHEVNPLSLMQH
AKQTKITLFEALRAWLDDEIVADSWLDAYQVIRDSDHLSQGLDYLMSALT
FDNETVKLGETLSKDLYGKEINASVSRFEGYQQCPFKHYASHGLKLNERT
KYELQNFDLGDIFHSVLKYISERINGDFKQLDLKKIRQLTNEALEEILPK
VQFNLLNSSAYYRYLSRRIGAIVETTLSALKYQGTYSKFMPKHFETSFRR
KPRTNDELIAQTLTTTQGIPINIRGQIDRIDTYTKNDTSFVNIIDYKSSE
GSATLDLTKVYYGMQMQMMTYMDIVLQNKQRLGLTDIVKPGGLLYFHVHE
PRIKFKSWADIDEDKLEQDLIKKFKLSGLVNADQTVIDALDIRLEPKFTS
DIVPVGLNKDGSLSKRGSQVADEATIYKFIQHNKENFIETASNIMDGHTE
VAPLKYKQKLPCAFCSYQSVCHVDGMIDSKRYRTVDETINPIEAIQNINI
NDEFGGEQ
>SAR2307 putative transposase
MCRILGISRASYYKWVHYQSSELELENEQLKREIESIYHKYNGIYGYRRI
YIYIRLKLGKQVNRKRIYRLMKELNLKAVIRQKRKPYRRSTPQITSENKL
NRQFDIDTPNKVWLTDVTEFKIKEGSKIYLSAIYDLGAKRIVSYELGPSN
NNQLVFKTFNQAIEKVENTKGILFHSDRGFQYTSKTFKHMLDECGMIQSM
SRVSKCIDNGPMEGVWGTIKSEIFRGNKHFKFNSVEEATKTIHDFILFFN
HERITLKMADSV
>SAR0079 putative protein kinase
MNEEILREVADIFIGDDRDSIYDYKTGNELVRFFNHYFNKGDIYQAPFPS
RWLYVVKHLQTLIQERKINQFFTLILSNHYIKYELKIDEVEAAKQAAKAL
KLFNKRLNHYGYYITGTNNARYFMDKDEDTESIGYGGYANIYLQKSTGLA
VKKLKEEYLTDSSIKSRFKREFDLTKSFDTNPLFINVFEFNESDYSYTME
LADETLKDYIESKTISELEKVKIIMKILKAMSQAHSENKIHRDISSKNVL
MFRGKVKISDLGLGKNLDEIHSHQTFDTNGVGQYKYCAPEQMYSLKQADK
QSDVFSLGRLINFIMTGNVVNNHHLFRGVSDKATNSSKEYRFEDANEMLK
MLQRILEYHSSAKHVEKCQEKLKRGVFDDESEEFIMTRSDEQLCQMVLSS
NNNEQACLIRYMQKNESSACDLIESINRKYQEFCGRFEDYDPFAKLAYMI
LCNNFSYRVNETAARVLNYVAWSVNRFSAQDLIKGLINRGVEPLIEEKLK
DN
>SAR0466 MutT domain containing protein
MSKMIKCVCLVEETADKILLVQVRNREKYYFPGGKIEEGESPVHALLREV
KEELNLTLTMDEIEYIGTIVGPAYLQQDMLTELNGFRALTKIDWENVTIN
NEITDIRWIDKDNDALIAPAVKVWIETYDGKHDK
>SAR1601 putative exodeoxyribonuclease VII large subunit
MSDYLSVSALTKYIKYKFDQDPHLQSVLIKGELSNFKKHSSGHLYFNVKD
KESVISAMMFKGSASKLNFEPKEGDEVLLEARVSVFERRGNYQIYVNKMQ
LDGIGNLYQKLEALKKKLTEEGCFDKANKKSIPKFPKKIAVLTASTGAAI
RDIHSTINSRFPLAEQIQISTLVQGEKAKDDIIEKIEYADSLGVDTIIVG
RGGGSIEDLWNFNEEAVVRAIYNCKTPIISAVGHETDFTLSDFAADIRAA
TPTQAAVIATPDQYELLQQIQQYQFTLTRFIKKHLEQQRKHVEHLSSYYK
FKQPTLLYDQQIQRRDDLEKRLKQQIQATFEQQRHRLMLLQQRYNLKALL
SSVNQEQQNNLQLTNQLVKLLNSKILSYKNDLKNKVENLNNLSPTNTMLR
GYAIVNKKDEVITSTKDLTENDQLTLTMKDGLVDAKVTKVRCNND
>SAR1667 putative membrane protein
MYQFLLRYKDFLAQWKLYIISAVVLIMVLIGFIFWRQDDYTSRDFENKDT
VLKQSTSENSSLSKLEDVQVKDGDNSKNKGPVYVDIKGAVKHPNVYKMTS
KDRVVDLLDKAQLLDDADVSQINLSEKLTDQKMIFIPHKGQKNVEPQFGA
NSVHVKNGNTNNTKVNLNTASVSELMSVPGVGQAKANAIVEYRNQQGAFQ
EIDDLKKVKGFGSKTFDKLNSYFTT
>SAR0617 putative DNA repair protein
MLGTDELYKLLYRHMGPQNWWPADNDIEMMLGAILVQNTRWRNAEIALNQ
IKEHTHFNPNHILELPIETLQSLIRPSGFYKSKSLTIKTLLTWLARHHFN
YQEINERYKGELRKELLSLKGIGSETADVLLVYIFGRIEFIPDSYTRKIY
NKLGYENTKSYDQLKKVVTLPNHFTNQDANEFHALLDVFGKHYFRDKDIK
NCDFLEPYFKK
>SAR0416 putative transposase
FKKVGYFITTRERRSFSSEFKLQKVRLYENGKPKNEIIREYDLTTSTFSN
PIKQHQNTGSFNHQDNLKSDEKELIKLRKEVQHLKMENDVLKQILLITRR
NRNHLTECVSIFNIH
>SAR0082 transposase
MGNVLKILRSTYYDSIKRKDNKITKDDSNVERAVINIFNANRKVFATRRI
KNHLNDKGHTVSRRKIGRIMKKYNLVSVYTKAKYKNHPKETNEKLIKNHL
NRAFNREQPMETLVSDLTYVKVAGTWHYICLFIDLFNREIVGYSAGKNKD
ANLVSKAISRINHNLEQIKLFHTDRGKEFDNHLIDEVLETFKIKRSLSTK
GCPYDNAVAEATMKAMKTEFVKQMQFENLEQLKTELFDYVNWYNNFRPHS
SLQYLTPVAFKNLHMKTV
>SAR1375 putative transposase
MYKNYNMTQLTLPIETSVRIPQNDISRYVNEIVETIPDSEFDEFRHHRGA
TSYHPKMMLKIILYAYTQSVFSGRRIEKLLHDSIRMMWLAQNQTPSYKTI
NRFRVNPNTDALIESLFIQFHSQCLKQNLIDDNSIFIDGTKVEANANRYT
FVWKKSIQNHESKLNENSKALYRDLVEEKIIPEIKEDGDSDLTIEEIDLI
GSHLDKEIEDLNHSIQNEDCTQIRKQTRKKRTEIKKFKKKFDDYSERKSK
YEEQKSILKDRNSFSKTDHDATFMRMKEDHMKNGQLKPGYNLQIATNSQF
VLSYDLFQNPTDTRTLIPFLTMIQNTFGYLPEYIVADAGYGSEQNYMAII
DDFNKTPLITYGMFIKDKTRKFKSDIFNTQNWKYDELNDEFICPNNKRIG
FKRYAYRNDRYGFKRDFKLYECDDCSACSLRQQCMKPNSKSNKKIMKNYN
WEYFKAQINQKLSEPETKKIYSQRKIDVEPVFGFMKAILGFTRMSVRGIN
KVKRELGFVLMALNIRKIVARRAVYYQIHLKKADFYQIINRNQLFYIA
>SAR0075 hypothetical protein
MEKILYQTDEFKLKPSGWYKTIPPKKDGGTEFEIMLSGPIAFTDRFIDPA
TRKEKVFLSDLNNIELVEKASILTALQLPSLIEYGFTINEKHIRDLGFVL
QQMRSTTPLSTIYSGVGMLHTLLGPLISLDQPYFSNEITNSTSIICDNKY
DLIPKGNLSEWLQMYKEEVHGNLSLELDVLFGVSSLVTAFLKYHNNVEFS
GTIFSFTGQSSTGKSTAAMLAASVAGNPTKGTENLFRSWNATRNALEGYL
SGNYGVPIVLDELSAATFHDTTGLLYSFAEGQGRQRANINGDVKTPKN
>SAR1490 DEAD/DEAH box helicase family protein
MLHDILRNKFGFENFKPGQQEIIESIMSQQHTLGILPTGSGKSLCYQIPT
YLSGKPTLIISPLISLMDDQVMQLKINGEKRVTCIHSGMDEIEKKHNIKC
LRHSRFIFLSPEFLLQPSNFKLISMIDFGMIVLDEAHCLSEWGYDFRPHY
ALIGKVTKHFKEAVVLALTATAPPHLQDDLTEMLEIQFNVIKTTMNRPNI
SFKHLNFHDDEDKIEWLLPFLQQSGPTIIYVSSKKMCLNLAQLIYDSGFL
TGIYHGDMNYQERHTVQQQFLNNDIPIIVATSAFGMGINKKDIRTIIHFH
LSTSPSNYIQEIGRAGRDGELSQAISLFQPDDKYILETLLFADMITEEDV
QNFEIGEFLAPDKQDVLTTLHSFYSIGALKQVFKKSFKRKQLGFFRMIGY
CKLDQCRRKYLLEFFGEYPPAQDRCCDNDSNITDIAILNKKKVIRSIGFD
EKLQNLFLR
>SAR0277 putative exported protein
MSKKLKIIIPIIIVLLLIGGIAWGVYAFFANTPKNTYLKSEQQTAKMYKD
YFNDRFENEVKFQEKMKDNSFLSSLELSADASDEIVKGLGIPKSVVNASK
IKMSYGHDPKKEKSMINLEPTIADSALGKFQLAADKDKHYFESPLFKGKY
SVNNSDLLSTYSKLTGEDEETAKENGITNQQLNLNTLFSNAQAQQSDYSK
IAEKYSELIVDKLDDDNFDKGKKEEIKVNGEKYKVRPVTLTLSRADTKKI
TLAVLEEAKKDKDLKKLMEEQGTTKDFEKDIKKAIDDVKETKKDEFAKIQ
SKIYTEKHTIVKREITITDKENNKTKIKGTNTLEDDKLKLDYALDFDQDK
YTYAEAKYTIKGVSSKEKDNKYSDKYEFGKKTEYDESKIKLDNQEKVDGT
KRQDKGKITVALDKYSDENEFTFENNIDSDVKNNTQKSTLNIGIKYAEEP
INFILKSSTKLKADIDFDDSGAKDFNSLSSKDREKLEKEIEKNGGKMFES
ILKKASK
>SAR1456 conserved hypothetical protein
MFQLLAVCPMGLEAVVAREIQELGYETNVENGRIFFEGDASAIVKANLWL
RTADRIKIVVGRFNATTFDELFEQTKALPWESIIDKEGNFPVQGRSVKST
LHSVPDCQAITKKAIVERLRRAYNEKGWLNESGAKYPVEVAILKDNVLLT
IDTSGSGLNRRGYRLAQGEAPIKETLAASLIRLANWKGDTPLIDPFCGSG
TIAIEACLIAQNIAPGFNREFVSEQWNIMPANIYDDYRDEADKMADYDKE
IEVYASDIDPEMVEIAKRNAEEVGLSDIIKFSVKDVNTLTIDTEEPVALI
GNPPYGERIGDREEVEEMYRYIGKLMKQHPFLSTYILTSNKEFEYLVDRK
ATKRRKLFNGYIECTYYQYWGKKTERKTIEN
>SAR0726 putative transposase
MNYFRYKQFDKDVITVAVGYYLRYALSYRDISEILRERGIYVHHSTIYRW
VQEYAPILYQIWKKKNKQAYYKWHIDETYIKIKGQWNYLYRAIDADGHTL
DISLRKKRDNHSAYTFIKRLIKQFGKPQMIITDQAPSTKVAMSKVIKDFK
LTPNCHCTSKYLNNLIEQDHRHIKVRKISYQSINTAKNTIKGIECIYGLY
KKNRRSLQIYGFSPCRVISIMLAS
>SAR0197 hypothetical protein
MDNRNMINRVFSQKILHQIAIKNKSDVVDEAYDFYIQGPKNINVIQKMKS
LYNYLKKSYRNEYFYKNTMLNKLLLGLHSVNTTTALSEMPIGNSIADFIL
LNGKGVVYEIKTELDKLDRLDNQINDYYEVFNYVVVITNDKHLNKVMARY
KDTTVGILVLTSRNTLSEVQKPKENNSLLNSKAMYNFLRKEERKRVIAQN
HMDVPTYNDFTEYDVLFDVFKEIPMTKLHNNMIFELKKRGNMKEYKDEFL
AAPTEIKFLLYFAKMTKKDKNKLYHFLKDTNNPPHFT
>SAR2168 putative helicase
MQNFKELGISDNTVQSLESMGFKEPTPIQKDSIPYALQGIDILGQAQTGT
GKTGAFGIPLIEKVVGKQGVQSLILAPTRELAMQVAEQLREFSRGQGVQV
VTVFGGMPIERQIKALKKGPQIVVGTPGRVIDHLNRRTLKTDGIHTLILD
EADEMMNMGFIDDMRFIMDKIPAVQRQTMLFSATMPKAIQALVQQFMKSP
KIIKTMNNEMSDPQIEEFYTIVKELEKFDTFTNFLDVHQPELAIVFGRTK
RRVDELTSALISKGYKAEGLHGDITQAKRLEVLKKFKNDQINILVATDVA
ARGLDISGVSHVYNFDIPQDTESYTHRIGRTGRAGKEGIAVTFVNPIEMD
YIRQIEDANGRKMSALRPPHRKEVLQAREDDIKEKVENWMSKESESRLKR
ISTELLNEYNDVDLVAALLQELIEANDEVEVQLTFEKPLSRKGRNGKPSG
SRNRNSKRGNPKFDSKSKRSKGYSSKKKSTKKFDRKEKSSGGSRPMKGRT
FADHQK
>SAR2339 putative DNA topoisomerase
MKSLILAEKPSVARDIADALQINQKRNGYFENNQYIVTWALGHLVTNATP
EQYDKNLKEWRLEDLPIIPKYMKTVVIGKTSKQFKTVKALILDNKVKDII
IATDAGREGELVARLILDKVGNKKPLRRLWISSVTKKAIQQGFKNLKDGR
QYNDLYYAALARSEADWIVGINATRALTTKYDAQLSLGRVQTPTIQLVNT
RQQEINQFKPQQYYTLSLTVKGFDFQLESNQRYTNKETLEQIVNNLKNVD
GKIKSVATKHKKSYPQSLYNLTDLQQDMYRRYKIGPKETLNTLQSLYERH
KVVTYPRTDSNYLTTDMVDTMKERIQATMATTYKDQARPLMSKTFSSKMS
IFNNQKVSDHHAIIPTEVRPVMSDLSNRELKLYDMIVERFLEALMSPHEY
DAITVTLEVAGHTFVLKENVTTVLGFKSIRQGESITEMQQPFSEGDEVKI
SKTNIREHETTPPEYFNEGSLLKAMENPQNFIQLKDKKYAQTLKQTGGIG
TVATRADIIDKLFNMNAIESRDGKIKVTSKGKQILELAPEELTSPLLTAQ
WEEKLLLIERGKYQAKTFINEMKDFTKDVVNGIKNSDRKYKHDNLTTTEC
PTCGKFMIKVKTKNGQMLVCQDPSCKTKKNVQRKTNARCPNCKKKLTLFG
KGKEAVYRCVCGHSETQAHMDQRMKSKSSGKVSRKEMKKYMNKNEGLDNN
PFKDALKNLNL
>SAR0066 putative transposase
MKRVSYSVETKYKAVEMKAAGFSTKEIMKELNIKNRTQVETWWRWYRNGE
SYRFSQHVGKQYTYGKGLEELSEVEQLKLENKRKDIELDILKKYKALERK
WYQQ
>SAR0504 putative transcription-repair coupling factor
MTILTTLIKEDNHFQDLNQVFGQANTLVTGLSPSAKVTMIAEKYAQSNQQ
LLLITNNLYQADKLETDLLQFVDVEELYKYPVQDIMTEEFSTQSPQLMSE
RIRTLTALAQGKKGLFIVPLNGLKKWLTPVEMWQNHQMTLRVGEDIDVDQ
FLNKLVNMGYKRESVVSHIGEFSLRGGIIDIFPLIGEPIRIELFDTEIDS
IRDFDVETQRSKDNIEEVDITTASDYIITEEVISHLKEELKTAYENTRPK
IDKSVRNDLKETYESFKLFESTYFDHQILRRLVAFMYETPSTIIDYFQKD
AIIAVDEFNRIKETEESLTVESDSFISNVIESGNGFIGQSFIKYDDFETL
IEGYPVTYFSLFATTMPIKLNHIIKFSCKPVQQFYGQYDIMRSEFQRYVN
QNYHIVVLVETETKVERMQAMLSEMHIPSITKLHRSMSSGQAVIIEGSLS
EGFELPDMGLVVITERELFKSKQKKQRKRTKAISNAEKIKSYQDLNVGDY
IVHVHHGVGRYLGVETLEVGQIHRDYIKLQYKGTDQLFVPVDQMDQVQKY
VASEDKTPKLNKLGGSEWKKTKAKVQQSVEDIAEELIDLYKEREMAEGYQ
YGEDTAEQTTFELDFPYELTPDQAKSIDEIKDDMQKSRPMDRLLCGDVGY
GKTEVAVRAAFKAVMEGKQVAFLVPTTILAQQHYETLIERMQDFPVEIQL
MSRFRTPKEIKQTKEGLKTGFVDIVVGTHKLLSKDIQYKDLGLLIVDEEQ
RFGVRHKERIKTLKHNVDVLTLTATPIPRTLHMSMLGVRDLSVIETPPEN
RFPVQTYVLEQNMSFIKEALERELSRDGQVFYLYNKVQSIYEKREQLQML
MPDANIAVAHGQMTERDLEETMLSFINNEYDILVTTTIIETGVDVPNANT
LIIEDADRFGLSQLYQLRGRVGRSSRIGYAYFLHPANKVLTETAEDRLQA
IKEFTELGSGFKIAMRDLNIRGAGNLLGKQQHGFIDTVGFDLYSQMLEEA
VNEKRGIKEPESEVPEVEVDLNLDAYLPTEYIANEQAKIEIYKKLRKTET
FDQIIDIKDELIDRFNDYPVEVARLLDIVEIKVHALHSGITLIKDKGKII
DIHLSVKATENIDGEVLFKATQPLGRTMKVGVQNNAMTITLTKQNQWLDS
LKFLVKCIEESMRISDEA
>SAR0085 putative transposase
MSYNHLTLTERARIEVLRQENYSLRSIARKLKRSVSTISREISRNNLNQS
YQAETAQKNYETKRKLCGRPTRFTPELGNIIKYYLKCHWSPEQIVGRLLQ
NQICFKTIYRWINSNMINFELISCLRQKGKRQKPKETRGKFNIGRPISQR
PKEIKKRNTFGHWEADTIVSSRGKSKGCIATFAERKSRYYYCVLMPDRSS
NSMETAINNLIKHLPKGAVKTITVDRGKEFSCYQNIENQFNINVYFADPY
SAWQRGTNENTNGLLREFFPKKTDLAKVNQEQLNYALDSINYRPRKCLNW
KFPYEVLCDELLHLN
>SAR1716 putative single-stranded-DNA-specific exonuclease
MIKPKYKWKLTKPAEYISDELTSKLKLTPIVKKILESKSIIDEQAIESII
SDTDINHDALQLSDMTKTIERIKRAIANDEKILVYGDYDADGVTSTTILV
TTLQLLGAQVGWHIPNRFTEGYGPNELAFRNAHDEGITLIITVDNGIQGH
NEIKMVQDLGVDVIVTDHHEIGSTLPEAYAIVHPMHPSFNYPFQQLCGAG
VAYKLAQALIENVPDYFKALVAIGTIADLVSLTDENRSLVKQGLKVLNDQ
CPTSVKALLKEAGYNDNIDEETIGFIIGPRLNAVGRLDDASLACELLMTD
DEEEAAFLAEQVEQFNRERKDIVATITEEAMAMAEMKVKNGDLFLLLAKE
NWHEGVLGIVASKIVETFALPTLILNIDREQNHAKGSARSIDQVSMFEIL
SAHQELIAKFGGHHMAAGMTMDIENIESLAEGLNKWMKELSKTTSLDPVK
PVDVLLTENDITIKNIRDMNRLRPFGTDFSRPIFEMDDLSVSSVKAIGQQ
KNHLKLTLGESNIAALFWQNGHLEPELQDEQPINILGSVQINEWNGNQSP
QLIIQDIAMNEQQILDYRSKRKSLPFTENDENIVVLIHPKSDKVNANEYY
YGEEIKQQTDKVVLRDLPTSMEDLSNSLQQLQFSQLYIVLQHNHSIYFDG
IPNMDVFKKCYKALITKQETNIQKEGMLLCQHLSVKPDTLKFMLKVFLDL
KFVTQEDGLIRINQQPDKRSIDSSKVYQLRQQRMDVEKQLLYQDFSEIKN
WIKSQLS
>SAR1357 putative exonuclease
MKPLHLKLNNFGPFLKEEIDFSKIDNNELFLISGKTGSGKTMIFDAMTYA
LFGKASTEQREENDLRSHFADGKQPMSVTFEFQLNNRIYKVHRQGPYIKE
GNTTKTNAKFDVFEMVDGKYEIRESKVISGTQFIIELLGVNADQFRQLFI
LPQGEFKRFLISNSREKQGILRTLFDSEKFEAIREILKEEVKKENAQIEN
RYQQIDLLWQEIESFDDDKIKGLLEVATQQIDKVIENIPLLQARSKEILA
FVNESKETAIKDYEIIEKKTLENNILKDNINQLNKNKIDFVQLKEQQPEI
EEIEAKLKLLQDITNLLNYIENREKIETKIANSKKDISETNNKILNLECD
KRNIDKEKRMLEENGDLIESKISFIDKTRVLFNDINKYQQSYLNIERLRT
EGEQLADELNNLIEGLEKVEDSIGNNESDYEKIIELNNAITNINNEINVI
KENEKAKAELDKLLGSKQELENQINEETSTLKNLEIKLDRYDKSKLDLND
KESFISEIKSAVKIGDQCPICGNEIQDLGHHIDFDSIAKRQNEIKEIEAN
IHTMKSNIAVHNSEIKFVNEKISNINIKTQSDFSLEVLNKRLLENENALN
NQRELNKFIEQMKEEKDNLTLQIHNKQLRLNKNESELKLCRNLITEFETL
SKYNNITNFEVDYKKYVQDVNQHQEHSNQIEDKLIQLSQRKLIEQNNLNH
YEKQLETYNNDLELNEQSIEMEMSRLNLTDNNDINEIIAWRGEQEELEQK
RDTYKKRYHEFEMEIARLESLTKDKELLDSDKLKDEYELKKGKMNTLIDE
YSAVHYQCQNNINKTQSIVSHINYLNQELKDQQEIFQLAEIVSGKNNKNL
TLENFVLIYYLDQIIAQANLRLATMSDNRYQLIRREAVSHGLSGLEIDVF
DLHSNKSRHISSLSGGETFQSSLALALGLSEIVQQQSGGISLESIFIDEG
FGTLDQETLETALDTLLNLKSTGRMVGIISHVSELKNRIPLVLEVKSDQY
QSSTRFKRN
>SAR0794 putative lipoprotein
MKKIVIIAVLAILFVVISACGNKEKEAQHQFTKQFKDVEQKQKELQHVMD
NIHLKEIDHLSKTDTTDKNSKEFKALQEDVKNHLIPKFEAYYKSAKNLPD
DTMKVKKLKKEYMTLANEKKDAIYQLKKFIGLCNQSIKYNEDILDYTKQF
EKNRYKVESEIKLADNKSEATNLTTKLEHNNKALRDTAKKNLDDSKENEV
KGAIKNHIMPMIEKQITDINQTNISDKHVNNARKNAIEMYYSLQNYYNTR
IETIKVSEKLSKVDVDKLPKKGIDITHGDKAFEKKLEKLEEK
>SAR0027 putative transposase
MNYFRYKQFNKDVITVAVGYYLRYTLSYRDISEILRERGVNVHHSTVYRW
VQEYAPILYQIWKKKHKKAYYKWRIDETYIKIKGKWSYLYRAIDAEGHTL
DIWLRKQRDNHSAYAFIKRLIKQFGKPQKVITDQAPSTKVAMAKVIKAFK
LKPDCHCTSKYLNNLIEQDHRHIKVRKTRYQSINTAKNTLKGIECIYALY
KKNRRSLQIYGFSPCHEISIMLAS
>SAR2627 putative 6-O-methylguanine DNA methyltransferase
MVYKSYYDSPVGRLELVSDGVSLTAVLFENQQDDGTREENTSLAIFKEAT
QWLDAYFKGDNPEITIPLKPTGSHFQQCVWNELRQIPYGTLTTYGAIAKK
VGKLLDKPKMSAQAVGGAVGSNPLSIIVPCHRVVGKTGSLTGFGGTINNK
IKLLELENIDMSKLYVPKHSTKP
>SAR1935 putative DNA repair exonuclease
MVKFIHCSDLHLDSPFKSKSHISPKIFEDVQKSAYESFKNIVDIALQQDV
DFVIIAGDLFDSENRTLRAEIFLKQQFERLQNEQIFVYVCHGNHDPLSSK
ISSNWPDNVSVFSNKVETYEAITKSGETIYIHGFSYENRASYENKIDEYP
SSQGQKGIHIGVLHGTYSKSSVNERYTEFILEDLNSKLYHYWALGHIHER
QQLSDMPVINYSGNIQGRHFNEQGEKGCLLIEGDHLKLKTKFYPTQYIRF
EEATIETDKTSKQGLYEVIQNFKEQVREEGKAFYRLTLVINSETLISPQD
LLQVEEMITDYEENENQFVYIDELKIQYAQNDESPLVNEFSAELLVDQTV
FDKAMSDLYLNPRASKFLDDYGTFDHTALVNRAEEILKAEMRGEQNDN
>SAR1225 SMF family protein
MIRLFLLKLYWAHFSTKQIHQFLMAYPNVIKEGGRKKDSYLCEWVNREEN
VHLLRKYYAFIKLDHNDIIKELQKLKVSYITYMDTEYPVLLKEIYQFPLL
LFYRGNIKLINNMHHLAVVGARDSTSYTQQSLEFLLSNDKSKYLTIVSGL
AQGADAMAHQIALKYNLPTIAVLAFGHQTHYPKSTLALRNKIEEIGLVIS
EYPPHTPIAKYRFPERNRIISGLSKGVLITEAKEQSGSHITIDFALEQNR
NVYVLPGSMFNPMTKGNLLRIQEGAKVVLNANDIFEDYYI
>SAR0774 putative ATP-dependent DNA helicase
MMQQTLSHYFGYETFRPGQEEIISKVLDHRNVLGVLPTGGGKSICYQVPG
LLLGGTTIVISPLISLMKDQVDQLKAMGIQAAFLNSSLTQKEQQRIEKAL
SNGEIQFLYVAPERFENRYFLNMLQRIKIHLVAFDEAHCISKWGHDFRPS
YQNVISKVFTLPQDFTIIALTATATVEVQQDIREKLNIAQTDQIKTSTKR
RNLIFKVNPTYQRQKFILDYIKTHDEDAGIIYCSTRKQVEELQEALESQK
IESVIYHAGLSNKEREEAQNDFLFDRVKVVVATNAFGMGIDKSNVRFVIH
YNMPGDLESYYQEAGRAGRDGLKSECILLFSERDINLHEYFITVSQADDD
YKDKMGEKLTKMIQYTKTKKCLEATIVHYFEPNEKLEECEQCSNCVQQDK
SYNMTQEAKMIISCIARMKQQESYSVIIQVLRGESTDYIKYKGYDQISTH
GLMKGYTTSELSHLIDELRFKGFLNENDEILMCDTSIKKLLSNEVEVFTT
PFKQKATEKVFINTVEGVDRVLFSQLVEVRKKLSDKLTIAPVSIFSDYTL
EEFAKRKPASKQDMINIDGVGSYKLKHYCPAFLETIQNYKAKV
>SAR0492 putative TatD related DNase
MLIDTHVHLNDEQYDDDLSEVITRAREAGVDRMFVVGFNKSTIERAMKLI
DEYDFLYGIIGWHPVDAIDFTEEHLEWIESLAQHPKVIGIGEMGLDYHWD
KSPADVQKEVFRKQIALAKRLKLPIIIHNREATQDCIDILLEEHAEEVGG
IMHSFSGSPEIADIVTNKLNFYISLGGPVTFKNAKQPKEVAKHVSMERLL
VETDAPYLSPHPYRGKRNEPARVTLVAEQIAELKGLSYEEVCEQTTKNAE
KLFNLNS
>SAR1695 conserved hypothetical protein
MLQHKILGLDVGSRTVGIAISDIMGWTAQGLDTLRINEENNELGIDQLVD
IIKKHNVGTVVIGLPKNMNNSIGFRGEASLTYKEKLLEAYPSIEIVMWDE
RLSTMAAERSLLEADVSRQKRKQVIDKMAAVFILQGYLDSLH
>SAR0435 exotoxin
MKLKNIAKASLALGILTTGMITTTAQPVKASEQSRLSVTSNDTQELKKYY
SGTGYNFQNVSGYREKDKMNIIDGTQLNVVTLLGTDKERFKDYDYDYEGL
DVFVVREGSGKQAENISIGGITKTNKNDYKDFVNNVGLEITKPTGHNTAT
RQAETYRINKEEISLKELDFKLRKHLIENHELYKTEPKDGKIRITMKGGG
YYTFELNKKLQPHRMGDVIDGRNIEKIEVDLY
>SAR0932 putative transposase
MYKNYNMTQLTLPIETSVRIPQNDISRYVNEIVETIPDSEFDEFRHHRGA
TSYHPKMMLKIILYAYTQSVFSGRRIEKLLHDSIRLMWLAQNQTPSYKTI
NRFRVNPNTDALIESLFIQFHSQCLKQNLIDDNSIFIDGTKVEANANRYT
FVWKKSIQNHESKLNENSKALYRDLVEEKIIPEIKEDGDSDLTIEEIDLI
GSHLDKEIEDLNHSIQNEDCTQIRKQTRKKRTEIKKFKKKFDDYSERKSK
YEEQKSILKDRNSFSKTDHDATFMRMKEDHMKNGQLKPGYNLQIATNSQF
VLSYDLFQNPTDTRTLIPFLTMIQNTFGYLPEYIVADAGYGSEQNYMAII
DDFNKTPLITYGMFIKDKTRKFKSAIFNTQNWKYDELNDEFICPNNKRIG
FKRYAYLNDRYGFKRDFKLYECDDYSACSLRQQCMKPNSKSNKKSMKNYN
WEYFKAQINQKLSEPETKKIYSQRKIDVEPVFGFMKAILGFTRMSVRGIN
KVKRELGFVLMALNIRKIVARRAVYYQIHLKKADFYQIINRNQLFYIA
>SAR1170 putative transposase
MSYNHLTLTERARIEVLRQENYSLRSIARKLKRSVSTISREISRNNLNQS
YQAETAQKNYETKRKLCGRPTRFTPELGNIIKYYLKCHWSPEQIVGRLLQ
NQICFKTIYRWINSNMINFELISCLRQKGKRQKPKETRGKFNIGRPISQR
PKEIKKRNTFGHWEADTIVSSRGKSKGCIATFAERKSRYYYCVLMPDRSS
NSMETAINNLIKHLPKGAVKTITVDRGKEFSCYQNIENQFNINVYFADPY
SAWQRGTNENTNGLLREFFPKKTDLAKVNQEQLNYALDSINYRPRKCLNW
KFPYEVLCDELLHLN
>SAR1634 putative endonuclease
MLLGSHVSMSGKKMLEGSAIEAHEYGETTFMIYTGAPQNTRRKSIEDLNI
TKGHEVMEKYGLSNIVVHAPYIINIANTTKPETFNLGVDFLQQEIERTQA
IGAKDIVLHPGAHVGAGVDAGINKIIEGLNEVLTNDNNVRIALETMAGKG
TEIGRSFEELARIIDGVHNNERLSVCFDTCHTHDAGYNVKEDFDGVLNEF
DKIIGVDRIKVVHVNDSKNDRGAQKDRHENIGFGYIGFDALNYIVHHDSF
KDIPKILETPYVGEDKKNKKPPYKLEIEMLKQQHFDPELKNKVMQQ
>SAR0103 hypothetical protein
MSQLLNDTLSAWLLIESLSPGEVNFTAEDILSAENFKNGAKQAQLQSFDE
YFEIWNDERFIISEEKSEIGEIIFKFYRHCFRYNEINLKIQDIFDDYSDI
HNPNGTHCYGYTFNIDKHGQVIVDSIHIPMIMSALKEIEKNKNANIEEKF
NDSVEKFVQKVKEILADEPINEFKLKKMDIAYDEYFSVLNSKKDGLFAHY
VAIEYVKDSDLPQPEFNSFFISDIEKARKSPNQTLIDYIEGVEESKRIEV
DENKEMFDKFLHPSRLPDGRWPSQTEFRLSLMQQLAVNQITSGNERISSV
NGPPGTGKTTLLKDIFAHLVVERGKELAKLNNPKDAFVKTKTHETDDKYV
YLLKESIAKYKMVVASSNNGAVENISKDLPKIEEIIRNPEKCKFPKYEQN
YANLAHELKDFAEIAEDLIGESAWGLFSGVFGKSTNINQVLSHMLKQDAN
DIGFAKLLQNENNRMSYNELMSEWQSHQRAFLEELRHVEMLKEESIRAYD
VYKNCESYSKVEHELNSKKMNVKEKLNHLEIQISCDNKEIEDLDDRINYN
TKQLETLNELIKSIRDSNKGFVNKLKAIFNSEEDERYKKHNAEKQQLLGQ
QIELEKCKKIKNEDLVSKLKEKEKLIKQLTKVQLQLDELNSQLQELEAYR
IESKITIPEKDFWSDNNYDERQVTNLWTSDELQYRRAMLFLRAMILHKLL
LIANNTTIYYAINDFKSRRKLIDANPDKVHNAWNVMHLIFPVVSTTFASF
KSMYGGIPKDFIDYLFIDEAGQAIPQAAVGALFRSKKVVAVGDPIQIEPV
VTLESHLIDNIRKNYNVPEYLVSKEASVQSVADNANQYGFWKSDATDSNQ
KTWIGIPLWVHRRCLKPMFTIANQIAYNNKMVLPSNITNVGKTGWYDVKG
NSVQKQFVKEHGEKVLELLANDWFEAIKEGKNEPSSFVISPFSAVQKQVK
RILKQQLPTRIDIERTKINQWVDKSIGTVHTFQGKEAQKVYFVIGTDNTQ
DGAVNWSCEKPNLLNVAVTRAKKEFYVIGDMQRIQSKPFYETIFKERNVK
>SAR1643 putative recombination protein O
MLMRQKGIIIKAVDYGESDKIITILNEHGAKVPLMARRAKKVKTGLQAQT
QLFVYGLFIYNKWRGMGTLNSVDVISQHYKLQMDLYVSSYASLAAETIER
SMDEGDVAPYNYQLLQFVLEKIESGTSAQLMSVVVMLKCMKRFGFTASFN
RCAVSGNDTQADLIGYSFKYDGAISRQEASKDVHAVILSNKTLYLLDVLQ
KLPIDKMNSLNIHQEIIDEMSDIILMLYREYAGMFFKSQKLINQLKRLEQ
>SAR0983 hypothetical protein
MDISLNFNHYIDLKKSIKNLCLMKKFEPYVTNENYDFIEMPLVVDGNYVL
LKIYQEKDALIKLHYQEVLNSNRKSVIQESDIVRISRSILNEALDINSNI
KYKSDTFGLTIISDADNYTTLIRTILAQQVSIEQSNKLFLLFINTYGEIF
ELNNKEYTFLNWDKIFSRIYKEQKVLVGTTNNRKQAIITLAELHIKNGLR
ENVLQVISSIKGIGSWTVQMFMLFQFPHVAFENIPVADIGLHRAIEKNHD
LPHKSIDKSKGESYFQNMKSIDVFKYWYIYMVE
>SAR1452 putative 5'-3' exonuclease
MPNKILLVDGMALLFRHFYATSLHKQFMYNSQGVPTNGIQGFVSHIFSAI
HEIRPTHVAVCWDMGQSTFRNDMFDGYKQNRSAPPEELIPQFDYVKEISE
QFGFVNIGVKNYEADDVIGTLAQQYSTDNDVYIITGDKDLLQCINDNVEV
WLIKKGFNIYNRYTLHRFNEEYALEPQQLIDIKAFMGDTADGYAGVKGIG
EKTAIKLIQQYQSVENVVENIDALSAGQRNKINDNLDELYLSKRLAEIHT
QVPIDSEALFEKMSFATTLNHILSICNEHELHVSGKYISSHF
>SAR1113 putative ribonuclease
MANIVFKLSDKDITTLMSRISFDTENLPQGMKARAKYQNTTVNIYQSGKV
MFQGNHAEAVSEELLPQHSQLNTNKTKKKNTANSSLEQTLMYDQFNCIGS
DEAGSGDYFGPLTVCAAFVTKEHVPILKTLGVDDSKKLTDTKIVELAEQL
VTFIPHSLLTLHNEKYNIQQAKGWTQVKMKAVLHNEAIKNVLEKIDSSQL
DYIVIDQFAKREVYSHYALSDIPLPKKTKFETKGESKSLAIAVASIISRY
AFITYMDQISKNINMTIPKGAGAKVDVIAAKIIKNYGISRLDSISKKHFK
NREKAQKILKPL
>SAR1664 conserved hypothetical protein
MSDNIVAIYGDVPELVEKQSAEIISQFLKSDRDDFNFVKYNLYETEIAPI
VEETLTLPFFSDKKAILVKNAYIFTGEKSPKDMAHNVDQLIEFIEKYDGE
NLIVFEIYQNKLDERKKLTKTLKKHARLKKIEQMSEEEIKKWIQSKLNEN
FKDIKRDALDLFIELTGINFNIVSQEIEKLILFLGDRPTINKQDVNQIIN
RSLEQNVFLLTEYIQKRKKEQAIHLVKDLITMKEEPIKLLALITSNYRLF
YQCKILSQKGYSGQQIAKTIGVHPYRVKLALGQVRHYQLDELLNIIDACA
ETDYKLKSSYMDKQLILELFILSL
>SAR1962 conserved hypothetical protein
MEQKKLSEMSEPELRHEIQLYKEKMRKAEMNGILNEYDVYQSKVIVAESY
LVDRKKIEIGKIYKLTDGSNQYFKVERLKGIFAWGFRFNSDEPEEGLPIA
LLQL
>SAR2135 putative membrane protein
MSTNQTFLIFVIAIIILTSVIGIVGRYMSRQRLLKSMETLWQTISPLESF
IRPNSHFDYEYKLYKDKFESHSLVDDKTWSDLNMNAIFHKMNYNLTAIGE
MKLYACLRGMLSITNKSLLSLFNDNAEFRKNVTYHLALIGKTVYPTFPDQ
ITPVKRHNILFLCPFLPVISFAVIFINSQVGILLFLMSCLFNIILSATLK
RTYEDDLKSIFYASNVLKQGYTISKIKHAPQPEVNFKQFRTARHLTSVLA
EVNDEDIGAMVIKLVKLIFMLDYVLFHSIQKSYTTHINELKNCFDYIAEL
DNHYALAMYRRTLECYTEPQIDDSNDGIVFSELTHPLIADAVANDFSLSQ
NILLTGSNASGKSTFMKSIAINIILASAIQTVTASKFVYQPGIVFTSMAN
ADDVLSGDSYFMAELKSIKRIVEIPDNQKIYCFIDEIFKGTNTTERIAAS
ESVLSFLHEKSNFRVIAATHDIELAELLKQRYENYHFNEVIENNNIHFDY
KIKPGKANTRNAIELLKITSFPAKIYERAKDNVSNG
>SAR0722 putative transposase
MYKNYNMTQLTLPIETSVRIPQNDISRYVNEIVETIPDSEFDEFRHHRGA
TSYHPKMMLKIILYAYTQSVFSGRRIEKLLHDSIRMMWLAQNQTPSYKTI
NRFRVNPNTDALIESLFIQFHSQCLKQNLIDDNSIFIDGTKVEANANRYT
FVWKKSIQNHESKLNENSKALYRDLVEEKIIPEIKEDGDSDLTIEEIDLI
GSHLDKEIEDLNHSIQNEDCTQIRKQTRKKRTEIKKFKKKFDDYSERKSK
YEEQKSILKDRNSFSKTDHDATFMRMKEDHMKNGQLKPGYNLQIATNSQF
VLSYDLFQNPTDTRTLIPFLTMIQNTFGYLPEYIVADAGYGSEQNYMAII
DDFNKTPLITYGMFIKDKTRKFKSDIFNTQNWKYDELNDEFICPNNKRIG
FKRYANRNDRYGFKRDFKLYECDDCSACSLRQQCMKPNSKSNKKIMKNYN
WEYFKAQINQKLSEPETKKIYSQRKIDVESVFGFMKAILGFTRMSVRGIN
KVKRELGFVLMALNIRKIAAQRAVHYKINIKKADFHQIINRNQLFYIA
>SAR0744 putative DNA photolyase
MAIAVLLNRMFRMEHNPLFEYIYQQKEDIDACYFIIPEEDMSSASDLKAQ
FYRGTLQRFYQSLHAEKLTPYVMSYDDIISFCKENNISEVVIAGDIMSYH
LEEYDILHQRSLFNEARIAVTLIRGNHYFKASKTMNQQGEPYKVFTSFSK
KWRPYLRHRDVYHYDLKSFEDFVIASPDDLVFDDIAFGSSQIIEQNKWQH
FLDQDIQNYESGRDYLPEVLTSQLSVALAYGLLDIIEIFNDLLARYDEDE
ANYEAFIRELIFREFYYVLMTQYPETSYQAFKSKYRQIKWSQNEADFNAW
CEGQTGFPIIDAAIMELTQTGFMHNRMRMVVSQFLTKDLFIDWTWGEKFF
RKHLIDYDAASNIHGWQWSASTGTDAVPYFRMFNPIRQSERFDPKALYIK
TYLPIFNQIDAKYLHDTQRNESNLFELGIELGRHYPRQMVDHQEKRTQVL
AAFKALD
>SAR0403 putative DNA-binding protein
MLTKEFAQRVELSEKQVRKIVQHLEERGYQLSKTEYRGREATDFKEEDIE
LFKDIADKVKQTNSYDLAFDELEKEKDFLQVIVKNDDKNLPTNQNVAQLV
EDLRLEIQKMREERHLLGQMMNQVHQQQQELKELQNQLTTKIDSNSESLK
AIQTSQEAIQEAQASQAKVLAESSNKVENNTATDEKAESKDSKVAGVNTS
TDAKTDTKAENAGDGTATKVDKEDQISATEAIEKASVEQPKNDKAAETSN
KEATVDAEAQHDAEQQVAEAHAEASKQATSNDSLEAKAENDSTASQSEMS
EPKPQEEKKGFFARLFNL
>SAR1459 conserved hypothetical protein
MNDVVESLIYEVNNMQQNFENVKSQQQDHDFYQTVKPYTEHIDSLLNEIK
LHREFIIEVPYMNSRKFELLIANIEQLSVECHFKRTSRKLFIEKLKSVQY
DLQNILDGVTKEGTYG
>SAR0519 putative insertion element protein
MCRILGISRASYYKWVHYQSSELELENEQLKREIESIYHKYNGIYGYRRI
YIYIRLKLGKQVNRKRIYRLMKELNLKAVIRQKRKPYRRSTPQITSENKL
NRQFDIDTPNKVWLTDVTEFKIKEGSKIYLSAIYDLGAKRIVSYELGPSN
NNQLVFKTFNQAIEKVENTKGILFHSDRGFQYTSKTFKHMLDECGMIQSM
SRVSKCIDNGPMEGVWGTIKSEIFRGNKHFKFNSVEEATKTIHDFILFFN
HERITLKMADSV
>SAR1446 conserved hypothetical protein
MAKINFDAATKGNPGISTCAIVIKEDEQHYTYTHELGEMDNHTAEWAACI
YALEHARELNVSNALLYTDSKLIADSVNAGYVKNAKFKPYFDQLEIFEQD
FDLLFVKWIPREQNKEANQHAQQALYKFIKKNK
>SAR1985 putative exonuclease
MIQDAFVALDFETANGKRTSICSVGMVKVIDSQITETFHTLVNPQDYFSQ
QNIKVHGIQPEDVENAPTFDYVFPYMMQFIADLPVVAHNAAFDMNVLHQS
IQNIGLPTPNLTYFCSYQLAKRTVDSYRYGLKHMMEFYQLDFHGHHDALN
DAKACAMITFRLLKNYENLTYVTNIYGKNLKDKG
>SAR0060 ccrA, site-specific recombinase
MKQVIGYLRQSTMKQQSLAAQKQAIEAITEKHHIQHINFYSDKQSGRKDN
RSGYRQITQLIQQGQCDILCCYRLNRLHRNLKNALKLIKLCQTYRVHILS
VHDGYFDMDQAFDRFKLNIFISLAELESDNIGEQVRNGLQEKAKQGRLIT
THAPFGYEYHNGTFIINQNESPTVKAVFNYYIKGHGYKKIAQLLEEDNTY
INRQPYQVRNIIINPNYCGRVNNQYGQFDNMFPSIVSTSIYEQAQRLRLQ
KQTKQTPSDNQLKQKIKCPCCNATLTNMTIRKKNHTLRYYVCPKNMNASR
FVCDFKGINAQTLEDKVLEVCRDFYQNQRIYTKIKSAIDKRIKRQRNIEK
HHTLTQKQLIEKLAQGIIDAETFREQTQSLRQQPQRTTSINGHQIQNTIQ
NIIQKRFTLNMLYPYIDEILITKSKTLMGIYFKNEPLNIVNQTTQSSIA
>SAR0059 ccrB, site-specific recombinase
MQQLKTKRVGIYVRVSTEMQSTEGYSIDGQINQIKEYCDFHHFEVKDIYA
DRGISGKSMNRPELQRILKDAKDGYIDCVMVYKTNRLARNTSDLLKIVED
LHKQNVEFFSLSERMEVNTSSGKLMLQILASFSEFERNNIVENVFMGQTR
RAQEGYYQGNLPLGYDKIPNSKHELMINQHEANIVKYIFESYAKGHGYRK
IANALNHKGYVTKKGKPFSISSITYILANPFYIGKIQFAKYKDWSEKRRK
GLNDKPVIAEGKHSPIINQDLWDKVQMRKKQVSQKPQVHGKGTNLLTGII
HCPQCGAPMAASNTTNTLKDGTKKRIRYYSCSNFRNKGSKVCSANSVRAD
VIEDYVMKQILEIVKSDKVIQRVVTHVNQENQVDGAALHHDIAYKQQQYD
EVQIKLNNLIKTIEDNPDLTSVIRPSIQKYEKQLNDITNQINQLKNQQNE
DKPLFDAKEISKLLQHIFHDIKHIEKSRLKALYLSVIDRIDIKKDGNHKK
QFYVTLKLNNEIIKQLFNNKQLDEVHLSTSSLFLPQTLYLTI
>SAR0001 dnaA, chromosomal replication initiator protein DnaA
MSEKEIWEKVLEIAQEKLSAVSYSTFLKDTELYTIKDGEAIVLSSIPFNA
NWLNQQYAEIIQAILFDVVGYEVKPHFITTEELANYSNNETATPKEATKP
STETTEDNHVLGREQFNAHNTFDTFVIGPGNRFPHAASLAVAEAPAKAYN
PLFIYGGVGLGKTHLMHAIGHHVLDNNPDAKVIYTSSEKFTNEFIKSIRD
NEGEAFRERYRNIDVLLIDDIQFIQNKVQTQEEFFYTFNELHQNNKQIVI
SSDRPPKEIAQLEDRLRSRFEWGLIVDITPPDYETRMAILQKKIEEEKLD
IPPEALNYIANQIQSNIRELEGALTRLLAYSQLLGKPITTELTAEALKDI
IQAPKSKKITIQDIQKIVGQYYNVRIEDFSAKKRTKSIAYPRQIAMYLSR
ELTDFSLPKIGEEFGGRDHTTVIHAHEKISKDLKEDPIFKQEVENLEKEI
RNV
>SAR1764 dnaB, chromosome replication initiation/membrane attachment protein
MGRQAFEFGLRPKDQFKVMQHFDLNTNHLEVLNRLYTPLIGTQAVGLYHF
MTQFVKESHNETLILSHYIFMNELKINLLEFRQQMDLLEAIGLLKAFVKH
DEQETQFVYQLIQPPSAHLFFNDPMLSIFLYSEVEHRRFHELKKYFEYQQ
IDLSEFKQVTRQFTDVFKVPSTKIDIDTSDIPINEPYQGIDLSNESFDFE
MLRQMLGKHFISQDIVTKDAKRLITQLATLYGLTADGMKHVILNSITSGQ
QLSFEEMRKQARSYYLMEHENQMPKLQVKSPATSSSTAKSTEANPKPQSD
EWFELLEQTSPIDMLASWSESEPTISQKTLVEELIEREKMSFGVINILLQ
FVMLKEDMKLPKAYILEIASNWKKKGIKTAKEAYNYAKKVNQPKNEGSSG
NYQKRGSYYGQRNRISKEKTPKWLENRDKPSEQDSAKDNSVDDQQLEQDR
QAFLDKLSKKWEEDSQ
>SAR0016 dnaC, DnaB-like helicase
MDRMYEQNQMPHNNEAEQSVLGSIIIDPELINTTQEVLLPESFYRGAHQH
IFRAMMHLNEDNKEIDVVTLMDQLSTEGTLNEAGGPQYLAELSTNVPTTR
NVQYYTDIVSKHALKRRLIQTADSIANDGYNDELELDAILSDAERRILEL
SSSRESDGFKDIRDVLGQVYETAEELDQNSGQTPGIPTGYRDLDQMTAGF
NRNDLIILAARPSVGKTAFALNIAQKVATHEDMYTVGIFSLEMGADQLAT
RMICSSGNVDSNRLRTGTMTEEDWSRFTIAVGKLSRTKIFIDDTPGIRIN
DLRSKCRRLKQEHGLDMIVIDYLQLIQGSGSRASDNRQQEVSEISRTLKA
LARELECPVIALSQLSRGVEQRQDKRPMMSDIRESGSIEQDADIVAFLYR
DDYYNRGGDEDDDDDGGFEPQTNDENGEIEIIIAKQRNGPTGTVKLHFMK
QYNKFTDIDYAHADMM
>SAR1781 dnaE, DNA polymerase III alpha subunit
MVAYLNIHTAYDLLNSSLKIEDAVRLAVSENVDALAITDTNVLYGFPKFY
DTCIANNIKPIFGMTIYVTNGLNNIETVVLAKDNYGLKDLYQLSSEIKMN
ALEHVSFELLKRFSNNMIIIFKNVADEHRDIVRVFDSHEDTYLDHRSVLV
QGIKHVWIQDVCYQTRHDADTISALAAIRDNTKLDLIHDQEDFGAHFLTE
NEIHQLDVNPEYFTQADRIAQKCNAELKYHQSLLPQYQTPNDESAKKYLW
RVLVTQLKKLELNYDVYLERLKYEYKVITNMGFEDYFLIVSDLIHYAKTN
DVMVGPGRGSSAGSLVSYLLGITTIDPIKFNLLFERFLNPERVTMPDIDI
DFEDTRREKVIQYVQEKYGELHVSGIVTFGHLLARAVARDVGRIMGFDEV
TLNEISSLIPHKLGITLDEAYQIDDFKKFVHRNHRHERWFSICKKLEGLP
RHTSTHAAGIIINDHPLYEYAPLTKGDTGLLTQWTMTEAERIGLLKIDFL
GLRNLSIIHQILTQVKKDLGINIDIEKIPFDDQKVFELLSQGDTTGIFQL
ESDGVRSVLKKLKPEHFEDIVAVTSLYRPGPMEEIPTYITRRHDPSKVQY
LHPHLEPILKNTYGVIIYQEQIMQIASTFANFSYGEADILRRAMSKKNRA
VLESERQHFIEGTKQNGYHEDISKQIFDLILKFADYGFPRAHAVSYSKIA
YIMSFLKVHYPNYFYANILSNVIGSEKKTAQMIEEAKKQGITILPPNINE
SHWFYKPSQEGIYLSIGTIKGVGYQSVKVIVEERYQNGKFKDFFDFARRI
PKRVKTRKLLEALILVGAFDAFGKTRSTLLQAIDQVLDGDLNIEQDGFLF
DILTPKQMYEDKEELPDALISQYEKEYLGFYVSQHPVDKKFVAKQYLTIF
KLSNAQNNKPILVQFDKVKQIRTKNGQNMAFVTLNDGIETLDGVIFPNQF
KKYEELLSHNDLFIVSGKFDLRKQQRQLIINEIQTLATFEEQKLAFAKQI
IIRNKSQIDMFEEMIKATKENANDVVLSFYDETIKQMTTLGYINQKDSMF
NNFIQSFNPSDIRLI
>SAR1639 dnaG, DNA primase
MRIDQSIINEIKDKTDILDLVSEYVKLEKRGRNYIGLCPFHDEKTPSFTV
SEDKQICHCFGCKKGGNVFQFTQEIKDISFVEAVKELGDRVNVAVDIEAT
QSNSNVQIASDDLQMIEMHELIQEFYYYALTKTVEGEQALTYLQERGFTD
ALIKERGIGFAPDSSHFCHDFLQKKGYDIELAYEAGLLSRNEENFSYYDR
FRNRIMFPLKNAQGRIVGYSGRTYTGQEPKYLNSPETPIFQKRKLLYNLD
KARKSIRKLDEIVLLEGFMDVIKSDTAGLKNVVATMGTQLSDEHITFIRK
LTSNITLMFDGDFAGSEATLKTGQHLLQQGLNVFVIQLPSGMDPDEYIGK
YGNDAFTAFVKNDKKSFAHYKVSILKDEIAHNDLSYERYLKELSHDISLM
KSSILQQKALNDVAPFFNVSPEQLANEIQFNQAPANYYPEDEYGGYIEPE
PIGMAQFDNLSRQEKAERAFLKHLMRDKDTFLNYYESVDKDNFTNQHFKY
VFEVLHDFYAENDQYNISDAVQYVNSNELRETLISLEQYSLNDEPYENEI
DDYVNVINENGQETIESLNHKLREATRIGDVELQKYYLQQIVAKNKERM
>SAR1763 dnaI, putative primosomal protein
MKQFKSIINTSQDFEKRIEKIKKEVINDPDVKQFLEAHRAELTNAMIDED
LNVLQEYKDQQKHYDGHKFADCPNFVKGHVPELYVDNNRIKIRYLQCPCK
IKYDEERFEAELITSHHMQRDTLNAKLKDIYMNHRDRLDVAMAADDICTA
ITNGEQVKGLYLYGPFGTGKSFILGAIANQLKSKKVRSTIIYLPEFIRTL
KGGFKDGSFEKKLHRVREANILMLDDIGAEEVTPWVRDEVIGPLLHYRMV
HELPTFFSSNFDYSELEHHLAMTRDGEEKTKAARIIERVKSLSTPYFLSG
ENFRNN
>SAR0002 dnaN, DNA polymerase III, beta chain
MMEFTIKRDYFITQLNDTLKAISPRTTLPILTGIKIDAKEHEVILTGSDS
EISIEITIPKTVDGEDIVNISETGSVVLPGRFFVDIIKKLPGKDVKLSTN
EQFQTLITSGHSEFNLSGLDPDQYPLLPQVSRDDAIQLSVKVLKNVIAQT
NFAVSTSETRPVLTGVNWLIQENELICTATDSHRLAVRKLQLEDVSENKN
VIIPGKALAELNKIMSDNEEDIDIFFASNQVLFKVGNVNFISRLLEGHYP
DTTRLFPENYEIKLSIDNGEFYHAIDRASLLAREGGNNVIKLSTGDDVVE
LSSTSPEIGTVKEEVDANDVEGGSLKISFNSKYMMDALKAIDNDEVEVEF
FGTMKPFILKPKGDDSVTQLILPIRTY
>SAR0477 dnaX, DNA polymerase III, tau subunit
MNYQALYRMYRPQSFEDVVGQEHVTKTLRNAISKEKQSHAYIFSGPRGTG
KTSIAKVFAKAINCLNSTDGEPCNECHICKGITQGTNSDVIEIDAASNNG
VDEIRNIRDKVKYAPSESKYKVYIIDEVHMLTTGAFNALLKTLEEPPAHA
IFILATTEPHKIPPTIISRAQRFDFKAISLDQIVERLKFVADAQQIECED
EALAFIAKASEGGMRDALSIMDQAIAFGDGTLTLQDALNVTGSVHDEALD
HLFDDIVQGDVQASFKKYHQFITEGKEVNRLINDMIYFVRDTIMNKTSEK
DTEYRALMNLELDMLYQMIDLINDTLVSIRFSVNQNVHFEVLLVKLAEQI
KGQPQVIANVAEPAQIASSPNTDVLLQRMEQLEQELKTLKAQGVSVAPTQ
KSSKKPARGIQKSKNAFSMQQIAKVLDKANKADIKLLKDHWQEVIDHAKN
NDKKSLVSLLQNSEPVAASEDHVLVKFEEEIHCEIVNKDDEKRSSIESVV
CNIVNKNVKVVGVPSDQWQRVRTEYLQNRKNEGDDMPKQQAQQTDIAQKA
KDLFGEETVHVIDEE
>SAR1367 grlA, topoisomerase IV subunit A
MSEIIQDLSLEDVLGDRFGRYSKYIIQERALPDVRDGLKPVQRRILYAMY
SSGNTHDKNFRKSAKTVGDVIGQYHPHGDFSVYEAMVRLSQDWKLRHVLI
EMHGNNGSIDNDPPAAMRYTEAKLSLLAEELLRDINKETVSFISNYDDTT
LEPMVLPSRFPNLLVNGSTGISAGYATDIPPHNLAEVIQATLKYIDNPDI
TVNQLMKYIKGPDFPTGGIIQGIDGIKKAYESGKGRIIVRSKVEEETLRN
GRKQLIITEIPYEVNKSSLVKRIDELRADKKVDGIVEVRDETDRTGLRIA
IELKKDVNSESIKNYLYKNSDLQISYNFNMVAISDGRPKLMGIRQIIDSY
LNHQIEVVANRTKFELDNAEKRMHIVEGLIKALSILDKVIELIRSSKNKR
DAKENLIEVYEFTEEQAEAIVMLQLYRLTNTDIVALEGEHKELEALIKQL
RHILDNHDALLNVIKEELNEIKKKFKSERLSLIEAEIEEIKIDKEVMVPS
EEVILSMTRHGYIKRTSIRSYNASGVEDIGLKDGDSLLKHQEVNTQDTVL
VFTNKGRYLFIPVHKLADIRWKELGQHVSQIVPIEEDEVVINVFNEKDFN
TDAFYVFATQNGMIKKSTVPLFKTTRFNKPLIATKVKENDDLISVMRFEK
DQLITIITNKGMSLTYNTSELSDTGLRAAGVKSINLKAEDFVVMTEGVSE
NDTILMATQRGSLKRISFKILQVAKRAQRGITLLKELKKNPHRIVAAHVV
TGEHSQYTLYSKSNEEHGLINDIHKSEQYTNGSFIVDTDDFGEVIDMYIS
>SAR1366 grlB, topoisomerase IV subunit B
MAMNKQNNYSDDSIQVLEGLEAVRKRPGMYIGSTDKRGLHHLVYEIVDNS
VDEVLNGYGNEIDVTINKDGSISIEDNGRGMPTGIHKSGKPTVEVIFTVL
HAGGKFGQGGYKTSGGLHGVGASVVNALSEWLEVEIHRDGNIYHQSFKNG
GSPSSGLVKKGKTKKTGTKVTFKPDDTIFKASTSFNFDVLSERLQESAFL
LKNLKITLNDLRSGKERQEHYHYEEGIKEFVSYVNEGKEVLHDVATFSGE
ANGIEVDVAFQYNDQYSESILSFVNNVRTKDGGTHEVGFKTAMTRVFNDY
ARRINELKTKDKNLDGNDIREGLTAVVSVRIPEELLQFEGQTKSKLGTSE
ARSAVDSVVADKLPFYLEEKGQLSKSLVKKAIKAQQAREAARKAREDARS
GKKNKRKDTLLSGKLTPAQSKNTDKNELYLVEGDSAGGSAKLGRDRKFQA
ILPLRGKVINTEKARLEDIFKNEEINTIIHTIGAGVGTDFKIEDSNYNRV
IIMTDADTDGAHIQVLLLTFFFKYMKPLVQAGRVFIALPPLYKLEKGKGK
TKRVEYAWTDEELNKLQKELGKGFTLQRYKGLGEMNPEQLWETTMNPETR
TLIRVQVEDEVRSSKRVTTLMGDKVQPRREWIEKHFEFGMQEDQSILDNS
EVQVLENDQFDEEEI
>SAR0006 gyrA, DNA gyrase subunit A
MAELPQSRINERNITSEMRESFLDYAMSVIVARALPDVRDGLKPVHRRIL
YGLNEQGMTPDKSYKKSARIVGDVMGKYHPHGDLSIYEAMVRMAQDFSYR
YPLVDGQGNFGSMDGDGAAAMRYTEARMTKITLELLRDINKDTIDFIDNY
DGNEREPSVLPARFPNLLANGASGIAVGMATNIPPHNLTELINGVLSLSK
NPDISIAELMEDIEGPDFPTAGLILGKSGIRRAYETGRGSIQMRSRAVIE
ERGGGRQRIVVTEIPFQVNKARMIEKIAELVRDKKIDGITDLRDETSLRT
GVRVVIDVRKDANASVILNNLYKQTPLQTSFGVNMIALVNGRPKLINLKE
ALVHYLEHQKTVVRRRTQYNLRKAKDRAHILEGLRIALDHIDEIISTIRE
SETDKVAMESLQQRFKLSEKQAQAILDMRLRRLTGLERDKIEAEYNELLN
YISELEAILADEEVLLQLVRDELTEIRDRFGDDRRTEIQLGGFEDLEDED
LIPEEQIVITLSHNNYIKRLPVSTYRAQNRGGRGVQGMNTLEEDFVSQLV
TLSTHDHVLFFTNKGRVYKLKGYEVPELSRQSKGIPVVNAIELENDEIIS
TMIAVKDLESEDNFLVFATKRGVVKRSALSNFSRINRNGKIAISFREDDE
LIAVRLTSGQEDILIGTSHASLIRFPESTLRPLGRTATGVKGITLREGDE
VVGLDVAHANSVDEVLVVTENGYGKRTPVNDYRLSNRGGKGIKTATITER
NGNVVCITTVTGEEDLMIVTNAGVIIRLDVADISQNGRAAQGVRLIRLGD
DQFVSTVAKVKEDADEENEDEQSTVSEDGTEQQREAVVNDETPGNAIHTE
VIDSEVNDEDGRIEVRQDFMDRVEEDIQQSSDDDEE
>SAR0005 gyrB, DNA gyrase subunit B
MTALSDVNNTDNYGAGQIQVLEGLEAVRKRPGMYIGSTSERGLHHLVWEI
VDNSIDEALAGYANQIEVVIEKDNWIKVTDNGRGIPVDIQEKMGRPAVEV
ILTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSQDLEVYVHRNETIYHQ
AYKKGVPQFDLKEVGTTDKTGTVIRFKADGEIFTETTVYNYETLQQRIRE
LAFLNKGIQITLRDERDEENVREDSYHYEGGIKSYVELLNENKEPIHDEP
IYIHQSKDDIEVEIAIQYNSGYATNLLTYANNIHTYEGGTHEDGFKRALT
RVLNSYGLSSKIMKEDKDRLSGEDTREGMTAIISIKHGDPQFEGQTKTKL
GNSEVRQVVDKLFSEHFERFLYENPQVARTVVEKGIMAARARVAAKKARE
VTRRKSALDVASLPGKLADCSSKSPEECEIFLVEGDSAGGSTKSGRDSRT
QAILPLRGKILNVEKARLDRILNNNEIRQMITAFGTGIGGDFDLAKARYH
KIVIMTDADVDGAHIRTLLLTFFYRFMRPLIEAGYVYIAQPPLYKLTQGK
QKYYVYNDRELDKLKSELNPTPKWSIARYKGLGEMNADQLWETTMNPEHR
ALLQVKLEDAIEADQTFEMLMGDVVENRRQFIEDNAVYANLDF
>SAR0485 holB, putative DNA polymerase III, delta' subunit
MDEQQQLTNAYHSNKLSHAYLFEGDDAQTMKQVAINFAKLILCQTDSQCE
TKVSTYNHPDFMYISTTENAIKKEQVEQLVRHMNQLPIESTNKVYIIEDF
EKLTVQGENSILKFLEEPPDNTIAILLSTKPEQILDTIHSRCQHVYFKPI
DKEKFINRLVEQDMSKPVAEMISTYTTQIDNALALNEEFDLLALRKSVIR
WCELLLTNKPMALIGIIDLLKQAKNKKLQSLTIAAVNGFFEDIIHTKVNV
EDKQIYSDLKNDSDQYAQKLSFNQLILMFDQLTEAHKKLNQNVNPTLVFE
QIVIKGVS
>SAR1482 hup, DNA-binding protein HU
MNKTDLINAVAEQADLTKKEAGSAVDAVFESIQNSLAKGEKVQLIGFGNF
EVRERAARKGRNPQTGKEIDIPASKVPAFKAGKALKDAVK
>SAR2105 int, integrase
MKTRCYDGKKWQYEFKHEGKRYRKKGFRTKREANSAGLDKLNELRSGFNI
DNYITLEEYFENWIKTYKQPVVKENTYRHYRNALQHIQKHKIGKMELSKI
NRQVYQKFINDYSKEHAKETIRKTNGAIRSALDDALYDGLIFKNPAYKVN
YKAGKPTKSEQEKFISVTEYEILKDHVRKKRTRSSLALFIMICTGCRVSG
ARNIKIEHINQVKNTIFIDERKTNTSPRYISIAKSDMKHIMDVISTFAIS
YDGYIFKEGGSIINLHAINNALKSACRVNNIPIITSHALRHTHCSYLLAK
GVSIHYISKRLGHKNIAITTSVYSHLLEEKFNEEDKKTTKILESM
>SAR1996 lig, DNA ligase
MADLSSRVNELHDLLNQYSYEYYVEDNPSVPDSEYDKLLHELIKIEEEHP
EFKTVDSPTVRVGGEAQASFNKVNHDTPMLSLGNAFNEDDLRKFDQRIRE
QIGNVEYMCELKIDGLAVSLKYVDGYFVQGLTRGDGTTGEDITENLKTIH
AIPLKMKEPLNVEVRGEAYMPRRSFLRLNEEKEKNDEQLFANPRNAAAGS
LRQLDSKLTAKRKLSVFIYSVNDFTDFNARSQSEALDELDKLGFTTNKNR
ARVSDIDGVLEYIEKWTSQRESLPYDIDGIVIKVNDLDQQDEMGFTQKSP
RWAIAYKFPAEEVVTKLLDIELSIGRTGVVTPTAILEPVKVAGTTVSRAS
LHNEDLIHDRDIRIGDSVVVKKAGDIIPEVVRSIPERRPEDAVTYHMPTH
CPSCGHELVRIEGEVALRCINPKCQAQLVEGLIHFVSRQAMNIDGLGTKI
IQQLYQSELIKDVADIFYLTEEDLLPLDRMGQKKVDNLLAAIQQAKDNSL
ENLLFGLGIRHLGVKASQVLAEKYETIDRLLTVTEAELVEIHDIGDKVAQ
SVVTYLENEDIRALIQKLKDKHVNMIYKGIKTSDIEGHPEFSGKTIVLTG
KLHQMTRNEASKWLASQGAKVTSSVTKNTDVVIAGEDAGSKLTKAQSLGI
EIWTEQQFVDKQNELNS
>SAR1272 mutL, DNA mismatch repair protein MutL
MGKIKELQTSLANKIAAGEVVERPSSVVKELLENAIDAGATEISIEVEES
GVQSIRVVDNGSGIEAEDLGLVFHRHATSKLDQDEDLFHIRTLGFRGEAL
ASISSVAKVTLKTCTDNANGNEIYVENGEILNHKPAKAKKGTDILVESLF
YNTPARLKYIKSLYTELGKITDIVNRMAMSHPDIRIALISDGKTMLSTNG
SGRTNEVMAEIYGMKVARDLVHISGDTSDYHIEGFVAKPEHSRSNKHYIS
IFINGRYIKNFMLNKAILEGYHTLLTIGRFPICYINIEMDPILVDVNVHP
TKLEVRLSKEEQLYQLIVSKIQEAFKDRILIPKNNLDYVPKKNKVLHSFE
QQKIEFEQRQNTENKQEKTFSSEESNSKPFMAENQNDEIVIKEDSYNPFV
TKTSESLIADDESSGYNNTREKDEDYFKKQQEILQEMDQTFDSNDDTSVQ
NYENKASDDYYDVNDIKGTKSKDPKRRIPYMEIVGQVHGTYIIAQNEFGM
YMIDQHAAQERIKYEYFRDKIGEVTNEIQDLLIPLTFHFSKDEQLVIDQY
KNELQQVGIMLEHFGGHDYIVSSYPVWFPKDEVEEIIKDMIELILEEKKV
DIKKLREDVAIMMSCKKSIKANHYLQKHEMSDLIDQLREAEDPFTCPHGR
PIIINFSKYELEKLFKRVM
>SAR1271 mutS, DNA mismatch repair protein MutS
MSNVTPMMQQYLKIKSEYQDCLLFFRLGDFYEMFYEDAKEASRVLEITLT
KRDAKKENPIPMCGVPYHSADSYIDTLVNNGYKVAICEQMEDPKQTKGMV
RREVVRIVTPGTVMEQGGVDDKQNNYILSFVMNQPEIALSYCDVSTGELK
VTHFNDEATLLNEITTINPNEVVINDNISDHLKRQINMVTETITVRETLS
SEIYSVNQTEHKLMYQATQLLLDYIHHTQKRDLSHIEDAVQYAAIDYMKM
DFYAKRNLELTESIRLKSKKGTLLWLMDETKTPMGARRLKQWIDRPLISK
EQIEARLDIVDEFSAHFIERDTLRTYLNQVYDIERLVGRVSYGNVNARDL
IQLKHSISEIPNIKALLNSMNQDTLVQVNQLEPLDDLLDILEQSLVEEPP
ISVKDGGLFKVGFNMQLDEYLEASKNGKTWLAELQAKERQRTGIKSLKIS
FNKVFGYFIEITRANLQNFEPSEFGYMRKQTLSNAERFITDELKEKEDII
LGAEDKAIELEYQLFVQLREEVKKYTERLQHQAKIISELDCLQSFAEIAQ
KYNYTRPSFSENKTLELVESRHPVVERVMDYNDYVPNNCRLDNETFIYLI
TGPNMSGKSTYMRQVAIISIMAQMGAYVPCKEAVLPIFDQIFTRIGAADD
LVSGKSTFMVEMLEAQKALTYATEDSLIIFDEIGRGTSTYDGLALAQAMI
EYVAETSHAKTLFSTHYHELTTLDQALPSLKNVHVAANEYKGELIFLHKV
KDGAVDDSYGIQVAKLADLPEKVISRAQVILSEFEASAGKKSSISNLKMV
ENEPEINQENSNLSVEETTDTLSQKDFEQASFDLFENDQESEIELQIKNL
NLSNMTPIEALVKLSELQNQLK
>SAR0847 nuc, thermonuclease precursor
MTEYLLSAGICMAIVSILLIGMAISNVSKEQYAKRFFFFATSCLVLTLVV
ASSLSSSANASQTDNGVNRSGSEYPTVYSATSTKKLHKEPATLIKAIDGD
TVKLMYKGQPMTFRLLLVDTPETKHPKKGVEKYGPEASAFTKKMVENAKK
IEVEFDKGQRTDKYGRGLAYIYADGKMVNEALVRQGLAKVAYVYKPNNTH
EQLLRKSEAQAKKEKLNIWSEDNADSGQ
>SAR1334 nucI, thermonuclease
MKSNKSLAMIVVAIIIVGVLAFQFMNHKGPFKKGTNHETVQDLNGKDKVH
VQRVVDGDTFIANQNGKEIKVRLIGVDTPETVKPNTPVQPFGKEASNYSK
KTLTNQDVYLEYDKEKQDRYGRTLAYVWISKDRMYNKELVEKGLAREKYF
SPNGKYRNVFIEAQNKAKQQKLNIWSK
>SAR1997 pcrA, ATP-dependent DNA helicase
MNALLNHMNTEQSEAVKTTEGPLLIMAGAGSGKTRVLTHRIAYLLDEKDV
SPYNVLAITFTNKAAREMKERVQKLVGDQAEVIWMSTFHSMCVRILRRDA
DRIGIERNFTIIDPTDQKSVIKDVLKNENIDSKKFEPRMFIGAISNLKNE
LKTPADAQKEATDYHSQMVATVYSGYQRQLSRNEALDFDDLIMTTINLFE
RVPEVLEYYQNKFQYIHVDEYQDTNKAQYTLVKLLASKFKNLCVVGDSDQ
SIYGWRGADIQNILSFEKDYPEANTIFLEQNYRSTKTILNAANEVIKNNS
ERKPKGLWTANTNGEKIHYYEAMTERDEAEFVIREIMKHQRNGKKYQDMA
ILYRTNAQSRVLEETFMKSNMPYTMVGGQKFYDRKEIKDLLSYLRIIANS
NDDISLQRIINVPKRGVGPSSVEKVQNYALQNNISMFDALGEADFIGLSK
KVTQECLNFYELIQSLIKEQEFLEIHEIVDEVLQKSGYREMLERENTLES
RSRLENIDEFMSVPKDYEENTPLEEQSLINFLTDLSLVADIDEADTENGV
TLMTMHSAKGLEFPIVFIMGMEESLFPHIRAIKSEGDHEMQEERRICYVA
ITRAEEVLYITHATSRMLFGRPQSNMPSRFLKEIPESLLENHSSGKRQTI
QPKAKPFAKRGFSQRTTSTKKQVSSSDWNVGDKVMHKAWGEGMVSNVNEK
NGSIELDIIFKSQGPKRLLAQFAPIEKKED
>SAR1769 polA, DNA polymerase I
MNKLVLIDGNSLSFRAFYALPLLSNKAGIHTNAVYGFAMLLEKILKEEKP
NHFLVAFDAGKTTFRHEKYSEYKGGRQKTPPELSEQFPYIRQLLDAYHIK
RYELDNYEADDIIGTLSKEADKAGFQTIIITGDRDLTQLATDNVTIYYTK
KGVTDVDHYTPDFIAEKYNGLTPNQIIDMKGLMGDTSDNIPGVAGVGEKT
AIKLLNQFDTVEGVYEHLDEISGKKLKEKLQNSKENALMSKELATINVDS
PIEVKLEDTLMTHQDEQQEKIELFKKLEFKQLLADIDQSASGEDAIEKTF
EIETSFDNVDFTSLKEATIHFELDGGNYLRNNILKFSLFTGEKHIVINAD
DIINYAELVSWLENPNTKKVVYDAKKTYVASHRLGIDIQNISFDIMLASY
IIDPSRTISDVQSVVSLYGQSFVKDDVSIYGKGKKFKVPEDDVLNPYVAS
ITDAIYFAKPNMDKQLEEYNQVELLADLELPLAKILSEMEEIGIYTDVHD
LEEMEKEIQEKLDVLIRNIHDAAGEDFNINSPKQLGVVLFETLQLPVIKK
TKTGYSTAVDVLEQLQGEHPIIDYILEYRQLSKLQSTYVEGLQKVISDDQ
RIHTRFNQTLAQTGRLSSVDPNLQNIPVRLEEGRKIRKAFKPTSKDSVIL
SADYSQIELRVLAHITQDESMKEAFINGDDIHTATAMKVFGVEADQVDSL
MRRQAKAVNFGIVYGISDYGLSQSLGITRKKAKAFIDDYLASFPGVKQYM
SDIVKDAKALGFVETLLHRRRYIPDITSRNFNLRGFAERTAMNTPIQGSA
ADIIKLAMVKFAQKMKETTYQAKLLLQVHDELIFEVPKSEVDSFSEFVEE
IMENALQLDVPLKVDSSYGATWYDAK
>SAR1240 polC, DNA polymerase III PolC-type
MAMTEQQKFKVLADQIKISNQLDAEILNSGELTRIDVSNKNRTWEFHITL
PQFLAHEDYLLFINAIEQEFKDIANVTCRFTVTNGTNQDEHAIKYFGHCI
DQTALSPKVKGQLKQKKLIMSGKVLKVMVSNDIERNHFDKACNGSLIKAF
RNCGFDIDKIIFETNDNDQEQNLASLEAHIQEEDEQSARLATEKLEKMKA
EKSKQQDNNESAVDKCQIGKPIQIENIKPIESIIEEEYKVAIEGVIFDIN
LKELKSGRHIVEIKVTDYTDSLVLKMFTRKNKDDLEHFKALSVGKWVRAQ
GRIEEDTFIRDLVMMMSDIEEIKKATKKDKAEEKRVEFHLHTAMSQMDGI
PNIGAYVKQAADWGHPAIAVTDHNVVQAFPDAHAAAEKHGIKMIYGMEGM
LVDDGVPIAYKPQDVVLKDATYVVFDVETTGLSNQYDKIIELAAVKVHNG
EIIDKFERFSNPHERLSETIINLTHITDDMLVDAPEIEEVLTEFKEWVGD
AIFVAHNASFDMGFIDTGYERLGFGPSTNGVIDTLELSRTINTEYGKHGL
NFLAKKYGVELTQHHRAIYDTEATAYIFIKMVQQMKELGVLNHNEINKKL
SNEDAYKRARPSHVTLIVQNQQGLKNLFKIVSASLVKYFYRTPRIPRSLL
DEYREGLLVGTACDEGELFTAVMQKDQSQVEKIAKYYDFIEIQPPALYQD
LIDRELIRDTETLHEIYQRLIHAGDTAGIPVIATGNAHYLFEHDGIARKI
LIASQPGNPLNRSTLPEAHFRTTDEMLNEFHFLGEEKAHEIVVKNTNELA
DRIERVVPIKDELYTPRMEGANEEIRELSYANARKLYGEDLPQIVIDRLE
KELKSIIGNGFAVIYLISQRLVKKSLDDGYLVGSRGSVGSSFVATMTEIT
EVNPLPPHYICPNCKTSEFFNDGSVGSGFDLPDKTCETCGAPLIKEGQDI
PFETFLGFKGDKVPDIDLNFSGEYQPNAHNYTKVLFGEDKVFRAGTIGTV
AEKTAFGYVKGYLNDQGIHKRGAEIDRLVKGCTGVKRTTGQHPGGIIVVP
DYMDIYDFTPIQYPADDQNSAWMTTHFDFHSIHDNVLKLDILGHDDPTMI
RMLQDLSGIDPKTIPVDDKEVMQIFSTPESLGVTEDEILCKTGTFGVPEF
GTGFVRQMLEDTKPTTFSELVQISGLSHGTDVWLGNAQELIKTGICDLSS
VIGCRDDIMVYLMYAGLEPSMAFKIMESVRKGKGLTEEMIETMKENEVPD
WYLDSCLKIKYMFPKAHAAAYVLMAVRIAYFKVHHPLYYYASYFTIRASD
FDLITMIKDKTSIRNTVKDMYSRYMDLGKKEKDVLTVLEIMNEMAHRGYR
MQPISLEKSQAFEFIIEGETLIPPFISVPGLGENVAKRIVEARDDGPFLS
KEDLNKKAGLSQKIIEYLDELGSLPNLPDKAQLSIFDM
>SAR1188 priA, primosomal protein n'
MIAKVIVDVASKSVDYKFDYIIPEQLESVIQPGVRVIVPFGPRTIQGYVM
EVTAEPDAQLDVSKLKKIIEVKDIQPELTSELIALSEWMGSTHVIKRISM
LEVMLPSAIKAKYKKAFKMKDDIEVPSTLLQKFDKHGYYYYKDAQKNNDI
QLLMKLLKDDIVEERTILTQNITKKTKRAVRVIEGYHPDEVLAKLEKVIK
QYDLYAYLSEEQHKTIFLTDIEDMGFSKSSLDGLIKKGYVEKYDAVVERD
PFKDRVFEQESKQQLTEDQYKAYEAIKAKIVSQEQETFLLHGVTGSGKTE
VYLQTIEDVLSQGKQAMMLVPEIALTPQMVLRFKRRFGDDVAVLHSGLSN
GERYDEWQKIRDGRARVSVGARSSVFAPFKNLGLIIIDEEHESTYKQEDY
PRYHAREIAQWRSEYHHCPVILGSATPCLESYARAEKGVYHLLSLPNRVN
QQALPEIDIVDMREELSEGNRSMFSKDLREAIQLRLDRQEQVVLFLNRRG
YASFMLCRDCGYVPQCPNCDISLTYHKTTDLLKCHYCGYQETPPNQCPNC
ESEHIRQVGTGTQKVEELLQQEFEDARIIRMDVDTTSKKGAHEKLLTEFE
KGNGDILLGTQMIAKGLDYPNITLVGVLNADTMLNLPDFRASERTYQLLT
QVAGRAGRHEKAGQVIIQTYNPDHYSILDVQKNDYLTFYRQEMEYRKLGK
YPPYYYLINFTISHKEMKKVMEASQHVHKILLQHLTEKALVLGPSPAALA
RINNEFRFQILVKYKSEPGLLQAIQFLDDYYHEKFIKEKLALKIDIDPQM
MM
>SAR1261 recA, recombinase A
MDNDRQKALDTVIKNMEKSFGKGAVMKLGDNIGRRVSTTSTGSVTLDNAL
GVGGYPKGRIIEIYGPESSGKTTVALHAIAEVQSNGGVAAFIDAEHALDP
EYAQALGVDIDNLYLSQPDHGEQGLEIAEAFVRSGAVDIVVVDSVAALTP
KAEIEGEMGDTHVGLQARLMSQALRKLSGAISKSNTTAIFINQIREKVGV
MFGNPETTPGGRALKFYSSVRLEVRRAEQLKQGQEIVGNRTKIKVVKNKV
APPFRVAEVDIMYGQGISKEGELIDLGVENDIVDKSGAWYSYNGERMGQG
KENVKMYLKENPQIKEEIDRKLREKLGISDGDVEETEDAPKSLFDEE
>SAR0004 recF, DNA replication and repair protein RecF
MKLNTLQLENYRNYDEVTLKCHPDVNILIGENAQGKTNLLESIYTLALAK
SHRTSNDKELIRFNADYAKIEGELSYRHGTMPLTMFITKKGKQVKVNHLE
QSRLTQYIGHLNVVLFAPEDLNIVKGSPQIRRRFIDMELGQISAVYLNDL
AQYQRILKQKNNYLKQLQLGQKKDLTMLEVLNQQFAEYAMKVTDKRAHFI
QELESLAKPIHAGITNDKEALSLNYLPSLKFDYAQNEAARLEEIMSILSD
NMQREKERGISLFGPHRDDISFDVNGMDAQTYGSQGQQRTTALSIKLAEI
ELMNIEVGEYPILLLDDVLSELDDSRQTHLLSTIQHKVQTFVTTTSVDGI
DHEIMNNAKLYRINQGEIIK
>SAR1203 recG, ATP-dependent DNA helicase
MAKVNLIESPYSLLQLKGIGPKKIEVLQQLNIHTVEDLVLYLPTRYEDNT
VIDLNQAEDQSNVTIEGQVYTAPVVAFFGRNKSKLTVHLMVNNIAVKCIF
FNQPYLKKKIELNQTITVKGKWNRVKQEITGNRVFLNSQGTQTQENADVQ
LEPVYRIKEGIKQKQIRDQIRQALNDVTIHEWLTDELREKYKLETLDFTL
NTLHHPKSKEDLLRARRTYAFTELFLFELRMQWLNRLEKSSDEAIEIDYD
LDQVKSFIDRLPFELTEAQKSSVNEIFRDLKAPIRMHRLLQGDVGSGKTV
VAAICMYALKTAGYQSALMVPTEILAEQHAESLIALFGDSMNVALLTGSV
KGKKRKILLEQLENRTIDCLIGTHALIQDDVIFHNVGLVITDEQHRFGVN
QRQLLREKGAMTNVLFMTATPIPRTLAISVFGEMDVSSIKQLPKGRKPII
TTWAKHEQYDKVLMQMTSELKKGRQAYVICPLIESSEHLEDVQNVVALYE
SLQQYYGVSRVGLLHGKLSADEKDEVMQKFSNHEIDVLVSTTVVEVGVNV
PNATFMMIYDADRFGLSTLHQLRGRVGRSDQQSYCVLIASPKTETGIERM
TIMTQTTDGFELSERDLEMRGPGDFFGVKQSGLPDFLVANLVEDYRMLEV
ARDEAAELIQSGVFFENTYQHLRHFVEENLLHRSFD
>SAR0479 recR, putative recombination protein
MHYPEPISKLINSFMKLPGIGPKTAQRLAFHTLDMKEDDVVQFAKALVDV
KRELTYCSVCGHITENDPCYICEDKQRDRSVICVVEDDKDVIAMEKMREY
KGLYHVLHGSISPMDGIGPEDINIPSLIERLKNDEVSELILAMNPNLEGE
STAMYISRLVKPIGIKVTRLAQGLSVGGDLEYADEVTLSKAIAGRTEM
>SAR1220 rnhB, putative ribonuclease HII
MTLTIKEVTQLINAVNTIEELENHECFLDERKGVQNAIARRRKALEKEQA
LKEKYVEMTYFENEILTENPNAIICGIDEVGRGPLAGPVVACATILNSNH
NYLGLDDSKKVPVTKRLELNEALKNEVTAFAYGIATAEEIDEFNIYKATQ
IAMQRAIDGLSVQPTHLLIDAMTLDNALPQVSLIKGDARSVSIAAASIMA
KVFRDDYMTQLSKDYPEYGFEKNAGYGTKQHLLAIDDIGIMKEHRKSFEP
IKSLL
>SAR1722 ruvA, Holliday junction DNA helicase RuvA
MYAYVKGKLTHLYPTHVVVETAGVGYEIQTPNSYRFQKHLDHEVLIHTSL
IVREDAQLLYGFSSEEEKDMFLSLIKVTGIGPKSALAILATSTPNEVKRA
IENENDTYLTKFPGIGKKTARQIVLDLKGKVKITEEDSDSLLQVDATSTE
QDQFVQEAMLALEALGYSKRELAKVEKTLNKNKYDSVDEAVKAGLQLVVS
>SAR1721 ruvB, Holliday junction DNA helicase
MNERMVDQSMHSEETDFELSLRPTRLRQYIGQNSIKSNLEVFIKAAKLRH
EPLDHVLLFGPPGLGKTTLSNIIANEMEVNIRTVSGPSLERPGDLAAILS
GLQPGDVLFIDEIHRLSSVVEEVLYPAMEDFFLDIIIGKGDEARSIRIDL
PPFTLVGATTRAGSLTGPLRDRFGVHLRLEYYNESDLKEIIIRTAEVLGT
GIDEESAIELAKRSRGTPRVANRLLKRVRDFQQVNEDEQIYIETTKHALG
LLQVDQHGLDYIDHKMMNCIIKQYNGGPVGLDTIAVTIGEERITIEDVYE
PFLIQKGFLERTPRGRKATPLAYEHFAKSNEERE
>SAR2725 sasF, putative surface anchored protein
MAKYRGKPFQLYVKLSCSTMMATSIILTNILPYDAQAVSEKDTEISKELL
SKQDLLDKVDKANRQIEQLKQLSASSKAHYKAQLNEAKTASQIDEIIKRA
NELDSKDNKGSQIEMNGRSDIDSKLDQLLKDLNEVSSKVDRGQQSDEDDL
NAMKNDMSQTATTKHGEKDDKNDEAMVNKALEDLDHLSQQIHKSEDSAVS
TTDNNHEVAKTPNNDGSGHVVLNKFLSNEENQSHSNRLTDKLQGSDKINH
AMIEKLAKSNASTQHYTYHKLNTLQSLDQRIANTQLPKNQKSDLMSEVNK
TKERIKSQRNIILEELARTDDKKHATQRILESIFNKDEADKILKDIRVDG
KTDQQIADQITRHIDQLSLTTSDDLLTSLIDQSQDKSLLISQILQTKLGK
AEADKLAKDWTNKGLSNRQIVDQLKKHFASTGDTSSDDILKAILNNAKDK
KQAIETILATRIERQKAKLLADLITKIETDQNKIFNLVKSALNGKADDLL
NLQKRLNQTKKDIDYVLSPIVNRPSLLDRLNKNGKTTDLNKLANLMNQGS
NLLDSIPDIPTPKPEKTLTLGKGNGLLSGLLNADGNVSLPKAGETIKEHW
LPISVIVGAMGVLMIWLSRRNKLKNKA
>SAR0363 ssb, putative single-strand DNA-binding protein
MLNRVVLVGRLTKDPEYRTTPSGVSVATFTLAVNRTFTNAQGEREADFIN
CVVFRRQADNVNNYLSKGSLAGVDGRLQSRNYENQEGRRVFVTEVVCDSV
QFLEPKNAQQNGGQRQQNEFQDYGQGFGGQQSGQNNSYNNSSNTKQSDNP
FANANGPIDISDDDLPF
>SAR1744 tag, DNA-3-methyladenine glycosylase I
MNECAFGTKDPVYLDYHDHVWGQPLYDSKALFKLLALESQHAGLSWLTIL
KKKEAYEEAFYDFEPEKVAQMTAHDIDRLMTFPNIVHHRKKLEAIVNQAQ
GYLKIEQAFGSFSKFLWSYVNGKPKDLQYEHASDRITVDDTATQLSKDLK
QYGFKFLGPVTVFSFLEAAGLYDAHLKDCPSKPRHN
>SAR0054 tnpA1, transposase A 1
MKVQRIEVENKPYPLYLLLDKEYQLIEPVMKFIKYLDNTGKSPNTIKAYC
YHLKLLYEFMEQRGVILNDINFELLADFVGWLRYPSASNVIDLQSKKAIR
EETTVNTILNVVMSFLDYLSRLGEFKSIDVFKQAKGRNFKGFLHHVNKGR
YQKNVLKLRVKKKQIRTLRSKEVKQIIDACHTKRDKLILMLMYEGGLRIG
EVLSLRLEDIVTWDNQIHLTPRDVNVNEAYIKLRKERTIHVSKELMSLYT
DYLIYEYSEELEHDYVFISLKEGYFGKPLKYQSVLDLVRRIVKRTGIEFT
SHMLRHTHATQLIREGWDVAFVQKRLGHAHVQTTLNTYVHLSDQDMKNEF
NKYLERKEHKK
>SAR1739 tnpA2, transposase A 2
MKVQRIEVENKPYPLYLLLDKEYQLIEPVMKFIKYLDNTGKSPNTIKAYC
YHLKLLYEFMEQRGVILNDINFELLADFVGWLRYPSASNVIDLQSKKAIR
EETTVNTILNVVMSFLDYLSRLGEFKSIDVFKQAKGRNFKGFLHHVNKGR
YQKNVLKLRVKKKQIRTLRSKEVKQIIDACHTKRDKLILMLMYEGGLRIG
EVLSLRLEDIVTWDNQIHLTPRDVNVNEAYIKLRKERTIHVSKELMSLYT
DYLIYEYSEELEHDYVFISLKEGYFGKPLKYQSVLDLVRRIVKRTGIEFT
SHMLRHTHATQLIREGWDVAFVQKRLGHAHVQTTLNTYVHLSDQDMKNEF
NKYLERKEHKK
>SAR0053 tnpB1, transposase B 1
MNASSKRKIISQSEISKKIAVMNEEMQGFWANNSWDIRKCPHPSAIELSK
NPALRNRWVRFERVKNLWLRTELKYFYFYHLNNGIWNAKTVWIRKGTVIN
KMLDFLDLKYPSITSITEVPIEKAMTEYRTYLTKRGVRITTTNYKITANQ
EKTPVKANSYYVTNLKQFMEFYENFYFDGEEWDKDVWDRRNLPLPDDKVN
PTQYEYTINFKGFRNTYFKQLVKRYCKLRLNVDSFSYVSDIAQRLKEFFN
FLDMKFKQVQRVHQLTRVEIEAYLSELNMMGIKPSTITGRISILEGLFST
LLRLEWDDVPSKILIYSEDYPKIPRAKPRFIDEFVLEQLNSHLDKLPEYI
ATMTMIVQECGMRISELCTLKKGCLLEDKDGDFFLKYYQWKMKKEHIVPI
SKEVALLIKVREDKVSEEFPDSEYLFPRKDGSPLKQETFRGELNKLAYEQ
NIVDKSGEIYRFHAHAFRHTVGTRMINNGMPQHIVQKFLGHESPEMTSRY
AHIFDETLKNEFTKFQEKLVTNNGDVLDLDEDNEVDDVELQWFKKNINAQ
VLPNGYCRLPVVAGGCPHANACLDCTHFCTSKQFLPQHEEQLERTEELLA
IAKDKQWQRQVETNSRVKERLEQIIGSLTG
>SAR1738 tnpB2, transposase B 2
MNASSKRKIISQSEISKKIAVMNEEMQGFWANNSWDIRKCPHPSAIELSK
NPALRNRWVRFERVKNLWLRTELKYFYFYHLNNGIWNAKTVWIRKGTVIN
KMLDFLDLKYPSITSITEVPIEKAMTEYRTYLTKRGVRITTTNYKITANQ
EKTPVKANSYYVTNLKQFMEFYENFYFDGEEWDKDVWDRRNLPLPDDKVN
PTQYEYTINFKGFRNTYFKQLVKRYCKLRLNVDSFSYVSDIAQRLKEFFN
FLDMKFKQVQRVHQLTRVEIEAYLSELNMMGIKPSTITGRISILEGLFST
LLRLEWDDVPSKILIYSEDYPKIPRAKPRFIDEFVLEQLNSHLDKLPEYI
ATMTMIVQECGMRISELCTLKKGCLLEDKDGDFFLKYYQWKMKKEHIVPI
SKEVALLIKVREDKVSEEFPDSEYLFPRKDGSPLKQETFRGELNKLAYEQ
NIVDKSGEIYRFHAHAFRHTVGTRMINNGMPQHIVQKFLGHESPEMTSRY
AHIFDETLKNEFTKFQEKLVTNNGDVLDLDEDNEVDDVELQWFKKNINAQ
VLPNGYCRLPVVAGGCPHANACLDCTHFCTSKQFLPQHEEQLERTEELLA
IAKDKQWQRQVETNSRVKERLEQIIGSLTG
>SAR1828 tnpR, resolvase
MKIGYARVSTGLQNLNLQEDRLNAYGCEKIFSDHISGSKSKRPGLDKAIE
FARSGDTIVVWRLDRLGRNMEDLITLVNELNERGVSFHSLEENITMDKSS
STGQLLFHLFAAFAEFERNLILERSSAGRIAARARGRYGGRPEKLNQKDL
NLLKTLYDNGTPIKTIAEQWQVSRTTIYRYLNKLEEKEDEKQGEVSN
>SAR1226 topA, DNA topoisomerase I
MADNLVIVESPAKAKTIEKYLGKKYKVIASMGHVRDLPRSQMGVDTEDNY
EPKYITIRGKGPVVKELKKHAKKAKNVFLASDPDREGEAIAWHLSKILEL
EDSKENRVVFNEITKDAVKESFKNPREIEMNLVDAQQARRILDRLVGYNI
SPVLWKKVKKGLSAGRVQSVALRLVIDRENEIRNFKPEEYWTIEGEFRYK
KSKFNAKFLHYKNKPFKLKTKKDVEKITAALDGDQFEITNVTKKEKTRNP
ANPFTTSTLQQEAARKLNFKARKTMMVAQQLYEGIDLKKQGTIGLITYMR
TDSTRISDTAKAEAKQYITDKYGESYTSKRKASGKQGDQDAHEAIRPSST
MRTPDDMKSFLTKDQYRLYKLIWERFVASQMAPAILDTVSLDITQGDIKF
RANGQTIKFKGFMTLYVETKDDSDSEKENKLPKLEQGDKVTATQIEPAQH
YTQPPPRYTEARLVKTLEELKIGRPSTYAPTIDTIQKRNYVKLESKRFVP
TELGEIVHEQVKEYFPEIIDVEFTVNMETLLDKIAEGDITWRKVIDGFFS
SFKQDVERAEEEMEKIEIKDEPAGEDCEVCGSPMVIKMGRYGKFMACSNF
PDCRNTKAIVKSIGVKCPKCNDGDVVERKSKKNRVFYGCSKYPECDFISW
DKPIGRDCPKCNQYLVENKKGKTTQVICSNCDYKEAAQK
>SAR0586 ung, putative uracil-DNA glycosylase
MEWSQIFHDITTKHDFKAMHDFLEKEYSTAIVYPDRENIYQAFDLTPFEN
IKVVILGQDPYHGPNQAHGLAFSVQPNAKFPPSLRNMYKELADDIGCVRQ
TPHLQDWAREGVLLLNTVLTVRQGEANSHRDIGWETFTDEIIKAVSDYKE
HVVFILWGKPAQQKIKLIDTSKHCIIKSVHPSPLSAYRGFFGSKPYSKAN
TYLESVGKSPINWCESEA
>SAR0813 uvrA, excinuclease ABC subunit A
MKEPSIVVKGARAHNLKDIDIELPKNKLIVMTGLSGSGKSSLAFDTIYAE
GQRRYVESLSAYARQFLGQMDKPDVDTIEGLSPAISIDQKTTSKNPRSTV
ATVTEIYDYIRLLYARVGKPYCPNHNIEIESQTVQQMVDRIMELEARTKI
QLLAPVIAHRKGSHEKLIEDIGKKGYVRLRIDGEIVDVNDVPTLDKNKNH
TIEVVVDRLVVKDGIETRLADSIETALELSEGQLTVDVIDGEDLKFSESH
ACPICGFSIGELEPRMFSFNSPFGACPTCDGLGQKLTVDVDLVVPDKDKT
LNEGAIEPWIPTSSDFYPTLLKRVCEVYKINMDKPFKKLTERQRDILLYG
SGDKEIEFTFTQRQGGTRKRTMVFEGVVPNISRRFHESPSEYTREMMSKY
MTELPCETCHGKRLSREALSVYVGGLNIGEVVEYSISQALNYYKNINLSE
QDQAIANQILKEIISRLTFLNNVGLEYLTLNRASGTLSGGEAQRIRLATQ
IGSRLTGVLYVLDEPSIGLHQRDNDRLINTLKEMRDLGNTLIVVEHDDDT
MRAADYLVDIGPGAGEHGGQIVSSGTPQKVMKDKKSLTGQYLSGKKRIEV
PEYRRPASDRKISIRGARSNNLKGVDVDIPLSIMTVVTGVSGSGKSSLVN
EVLYKSLAQKINKSKVKPGLYDKIEGIDQLDKIIDIDQSPIGRTPRSNPA
TYTGVFDDIRDVFAQTNEAKIRGYQKGRFSFNVKGGRCEACKGDGIIKIE
MHFLPDVYVPCEVCDGKRYNRETLEVTYKGKNIADILEMTVEEATQFFEN
IPKIKRKLQTLVDVGLGYVTLGQQATTLSGGEAQRVKLASELHKRSTGKS
IYILDEPTTGLHVDDISRLLKVLNRLVENGDTVVIIEHNLDVIKTADYII
DLGPEGGSGGGTIVATGTPEDIAQTKSSYTGKYLKEVLERDKQNTEDK
>SAR0812 uvrB, excinuclease ABC subunit B
MTMVEHYPFKIHSDFEPQGDQPQAIKEIVEGIKAGKRHQTLLGATGTGKT
FTMSNVIKEVGKPTLIIAHNKTLAGQLYSEFKEFFPENRVEYFVSYYDYY
QPEAYVPSTDTFIEKDASINDEIDQLRHSATSALFERDDVIIIASVSCIY
GLGNPEEYKDLVVSVRVGMEMDRSELLRKLVDVQYTRNDIDFQRGTFRVR
GDVVEIFPASKEELCIRVEFFGDEIDRIREVNYLTGEVLKEREHFAIFPA
SHFVTREEKLKVAIERIEKELEERLKELRDENKLLEAQRLEQRTNYDLEM
MREMGFCSGIENYSVHLTLRPLGSTPYTLLDYFGDDWLVMIDESHVTLPQ
VRGMYNGDRARKQVLVDHGFRLPSALDNRPLKFEEFEEKTKQLVYVSATP
GPYEIEHTDKMVEQIIRPTGLLDPKIEVRPTENQIDDLLSEIQTRVERNE
RVLVTTLTKKMSEDLTTYMKEAGIKVNYLHSEIKTLERIEIIRDLRMGTY
DVIVGINLLREGIDIPEVSLVVILDADKEGFLRSNRSLIQTIGRAARNDK
GEVIMYADKMTDSMKYAIDETQRRREIQMKHNEKHGITPKTINKKIHDLI
SATVENDENNDKAQTVIPKKMTKKERQKTIDNIEKEMKQAAKDLDFEKAT
ELRDMLFELKAEG
>SAR1119 uvrC, putative excinuclease ABC subunit C
MEDYKKRIKNKLNVVPMEPGCYLMKDRNDQVIYVGKAKKLRNRLRSYFTG
AHDAKTTRLVGEIRRFEFIVTSSETESLLLELNLIKQYQPRYNILLKDDK
SYPFIKITKEKYPRLLVTRTVKQGTGKYFGPYPNAYSAQETKKLLDRIYP
YRKCDKMPDKLCLYYHIGQCLGPCVYDVDLSKYAQMTKEITDFLNGEDKT
ILKSLEERMLTASESLDFERAKEYRDLIQHIQNLTNKQKIMSSDKTIRDV
FGYSVDKGWMCIQVFFIRQGNMIKRDTTMIPLQQTEEEEFYTFIGQFYSL
NQHILPKEVHVPRNLDKEMIQSVVDTKIVQPARGPKKDMVDLAAHNAKVS
LNNKFELISRDESRTIKAIEELGTQMGIQTPIRIEAFDNSNIQGVDPVSA
MVTFVDGKPDKKNYRKYKIKTVKGPDDYKSMREVVRRRYSRVLNEGLPLP
DLIIVDGGKGHMNGVIDVLQNELGLDIPVAGLQKNDKHQTSELLYGASAE
IVPLKKNSQAFYLLHRIQDEVHRFAITFHRQTRQKTGLKSILDDIDGIGS
KRKTLLLRSFGSIKKMKEATLEDFKNIGIPENVAKNLHEQLHK
>SAR1573 xerD, integrase/recombinase
METIIEEYLRFIQIEKGLSSNTIGAYRRDLKKYQDYMTEHHISHIDFIDR
QLIQECLGHLIDQGQSAKSIARFISTIRSFHQFAIREKYAAKDPTVLLDS
PKYDKKLPDVLNVDEVLALLETPDLNKINGYRDRTMLELLYATGMRVSEL
IHLELENVNLIMGFVRVFGKGDKERIVPLGDAVIEYLTTYIETIRPQLLK
KTVTEVLFLNMHGKPLSRQAIWKMIKQNGVKANIKKTLTPHTLRHSFATH
LLENGADLRAVQEMLGHSDISTTQLYTHVSKSQIRKMYNQFHPRA