TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Leptospira interrogans serovar Lai str. 56601, 56601
Gene type: CDS

Number of genes found: 228

Free access
Sort by:

 



# Leptospira interrogans serovar Lai str. 56601, 56601

>LA4353 transposase
MNKKGKYSPEFKEQAVKRTLSGSFTIKEVAGSLGISYFVLRQWRGEYLKK
SEDQLPLTDKQLKESEELKKLRKENLKLKEEVTILKKFAAMLSREQNPD
>LB303 putative transposase
MKFMKSNCLEHSIGSMARVLGVSRSGYYKYLNRHKDRSVAPELTNFLQEK
WLKSRKNYGFKRLFQEVKKSNLPYGARKVRKAMKHCKISGKQRKQFRPLT
TNSKHGGRIAPDLVQRKFHPNEQNRIWVSDVTFIRTSFGWSYLCVILDLY
SRKTVGWSLSDRNDSQLVCDTVLKAVLSRNPRKGLIFHSDRGSNFCSKET
RRLLITNGIRRSNSRKGNCWDNAVAESFFSYLKREIEYNTFYNIEEAEHL
LFDFIEVYYNRFRFHSTLGYISPEEFETNIA
>LA3608 conserved hypothetical protein
MFLVQCSAHRLTKAEGTAIGTQNEPNVINERKISYTAYVTLNVGNLEESR
NKIKSLIKNYKGFITRNSKKNALVRVPSESFEVFLSELKQLGDVENEEVI
GLDITDSYRDNLIKLESLKKIKIKYQDLIAKAVNVQDMLAIEKELERINV
EIEKLEGSKRASDMMVQYSSIYIDFNTEKPGPLGWIFYLGYKAIKWLFIW
E
>LA1461 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LB295 putative transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVIIRTGKLFDSMPENILNRK
LTQYGLM
>LA1286 conserved hypothetical protein
MILCFMSWNFPKFKASLHRRWEYLLPTGSVNETERIARKVVSESYLPKTQ
IPFLKIQHGKILLENANFRQALELLVRIPWVRDFRLNLGMFPFKKSFTFE
GFENALRKSNILPEEWGLRFRSQVKGQNQWNSGNLQTLWESSIPLSSRIK
TTELSALLVENEIILNLSLSGEPLNQRGNFIPLSKSAPIREDLARFIIQK
MHHILPDPDGIFVPFAGTGTFVREAVDSILGIGFTHYTRNYLFQDMEEFP
NTTWDFLKKKIPKNSLENQIRVWWNELEDEIFEYSKNRIKLYQDFLNNLG
FSGSFRFIQDKGDFFHFSAEDIWNASGRCERVWMLLNPPYGLRLERGSSI
GFYKKLGERLKHWWGLPCELSGLILCPDEDCWSVLIRNMGVKTDTIHVTH
GGLDLRVVYF
>LA3435 ATP-dependent RNA helicase
MKFEELSIHPKLLSAIQEIGYTELTPIQEKSIPHGLEGKDITGLAQTGTG
KTVAFLIPVIHNILTKGIQGIAALVLAPTRELTMQIAEEAKKLLKHSEGI
RSVPIIGGTDYKSQNKDLEGLNGIIVATPGRLIDMIKSGSIDISNVEFFV
LDEADRMLDMGFIQDIRWLLHKCKNRKQTLLYSATLSVEVMRLAYRFLNE
PVEIQINPEKIITERIDQKIVHLGREEKIPYMTNLIINSKEEGQGIIFTN
YKANIPKIVYTLRKYGVPVTGISSELDQKKRLRLLRDFKSGKYRYMVATD
VASRGIDVENIDIVYNYDLPQDTENYVHRIGRTARAGRKGKAIGFCSESD
YVELEKIEKYLKQKIEILEVNEEYIQFPAGDFQSFIGGDSYDREKETHFK
QNGKRPHDNDRAPHKHDRNRKRDKRHTSHTHSTAKDSHKKKPAAAIQEAE
LFLQKADSVLSSEPKGKKSAKDQHRFQNKDKQSRTGRGAQNSRNQNKNQQ
FGKQYDKNKRNLFDINDTVKEDTKKKKVSIWQKIKSIFGG
>LA1051 hypothetical protein
MNFEYIESIMNKMISEVVGMGQSIPLILADQAVLPSSCPYGTYKVIQLVQ
DPISNASRSIQKLDPTNFKEIIRINQSVLINITFLHDSSIAVCWELSEKS
MDWFDSKEGTIECDKFGITPVLISPNIQDKTVLTESGIYIYKVSFDVLLK
LRKYNEQPGESTASAPTVEYQEEA
>LA2163 endonuclease III
MLNDFDGKIPKTIPELITLPGFGRKTANVVLSEVHGLVEGIVVDTHVNRL
SKVLGLTTKNDPVQVEKDLMSLLPEKYWRDISLYLIFLGRKSCKAHRRFC
EDCILKKDCPSSSIISGV
>LA1944 putative lipoprotein
MLLARIVMAGIIYRMKTGCQWRVIPNEFGSGQTCHRRFQEWERAGVFKKI
YKSILKYYDVKNKIA
>LA2649 Deoxyribodipyrimidine photolyase
MFQKENLVRVREIKQNPILEEKPYILYWMSMARRLVWNHSLDYSIHLSQK
YKKELLIYEPLKMNYPWSSLRLHKFILEGMSYNIKEAKRLGLTYRAFVET
PGNRIEEAFEKISSEAALVITDDYPAYIIPELLEQVSKKIKCKFLAVDSN
SIIPLTFYGEFVSAARILRPRVHKLFPEVWKFRSFHKPNKPFREKGDSWL
EKNPNSPLKKNIWFEGDVDQISEICKKCNFNFQNIPPVPGKKGGREEGIK
LLQKFLKRGLSGYAELRSNPKPPEESYSSFLSPYIHFGHISQEEIVSEVL
NWNLDGSWTPGVIIPENKNRKEGYFHPDPNVNSFLDELITWRDVGFLMFW
KKPSFRKDLSILPDWIQKNLDFHKNDVRPYIYTKEQLENSKTHDVIWNAA
QKELVLSGCMQNYLRMLWGKKVIEWSPDYQTAFEILEEFNNKYAYDGRDP
NSYNGILWCFGLFDRPWFPERNVFGNIRTMSSDSTKKKFKLQTYLDYVQS
LEERNDKLDSQLLFPT
>LB223 putative transposase
MMFMETHRFEYSIQSMANVLGVSRSGFYQFLKRSKNELEKYNPELVEFIR
ETWLTSRKNYGLVRLLREVKKVYSIYGARTVRKVMKLCEIQGKQEKRFRI
ATTDSNHGNRVAPNLVQRNFKPNQKNRIWVSDITFLRSSFGWIYLCVILD
LYSRKVVGWSISNSNDSKLVCTALSKAIECRNPPKGLVFHSDRGSNYCSY
ETRRYLLNNKLRRSNSRKGNCWDNAVAESFFGSLKREMEYNYFYKIQEAE
ELLFDHIEVYYNRHRSHSSLDFVSPVQFEVNAA
>LB122 site-specific integrase/recombinase XerD related protein
MSELEVPKKNKHSIERLIKIIRQRNYKKATAYTYMKYNLDFLHFADKPAE
KITVKDLNRYMDHLKKRKVSSSTIQINVSSLKMFFEDVMKMNLFQDFQRP
VREYNNPNAITYKEMQNILKTASSNAKHELMCGLVYFGGLRVGELISLRW
AYIDTKRKSIQIKSPILSQSRTVEIPVELGALVKKYEREIVNSSSNTYLF
PGKSLGSHTTSRNVERIISEIGKNSGISSPVTVFTLRHSRALHLIADGSS
LNQVKDFLGHKTLASTESYLPVKKNLRAAVREKSRQDALKNIRKKFKTR
>LA0451 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA2470 hypothetical protein
MLQNFLKINGILFILIISLGIFLLECFKTEQRSLEVSIAIPGNDKEKSLI
INESSDKISFHVLVKNVSNKEIRIWKDWNSWGYDNISFQIQTDQESISIY
KEGKAWDKNFPDFWSVKPNEFVILNVNFDVKTWPKLRELKFKNDKVKIKC
VYEIKEDPYSKEYNIWTGKIESNFVEASIYSNF
>LA1123 conserved hypothetical protein
MDLLYFNSDFSSYFRVTNRKEKLMLRHTFCHLPGIDSKEEKNLWEKGIYD
WKDLEIYLKTEPAPIRNLILDALEFSKKELERENFFYFFHVFSPKHHWRL
FPTIRKKLLYMDIETTGLGNDDRTTVIGTFDGYEYRSYIRGFNLDFFLEN
LSQDQIFVSYNGIAFDVPFLEREFNVRFRNNHIDIMFFLRSLGIRGGLKG
CEKTLGVHRPETAAITGIEAVNLWKQYVDYDDMDALKILEEYNREDTVNL
EILFIKGYNLKIKETPFYGEVIQEPLQLR
>LA0775 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA4351 Integrase core domain containing protein
MGWSISNSNDSKLVCTALSKAIECRNPPKGLVFHSDRGSNYCSYETRRYL
LNNKLRRSNSRKGNCWDNAVAESFFGSLKREMEYNYFYKIQEAEELLFDH
IEVYYNRHRSHSSLDFVSPVQFEVNAA
>LA4251 ADA regulatory protein
MSHYQKIAEAIQFIQKNAISQPELDEVAKSVNLSPFHFQRLFTEWAGVSP
KQFLQYITLQNAKSILSKPQTTLFDAAFETGLSGTSRLHDLFVKIEGMTP
GEFKNGGENIKIRYSFQRSVFGNYLIASTEKGICNLFFYDIPEEQIVSEL
KEQWNRADIIEQMDENQNRVIRFFDKTLNGHEKIKLHLKGTEFQIKVWEA
LLKIPEGQLSSYSDIADLIGQENASRAVGTAIGKNPIGYLIPCHRVIKST
GGIGEYRWGSERKMAMIGWEASKVKI
>LA3956 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINLGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA0683 conserved hypothetical protein
MFIIVRYHVETITQEGRARLRKVAKTCESHGPRVQNPFLNANWNQPIIYN
SKQNFLKL
>LA3253 hypothetical protein
MATQQSRPRRKLNISLFTNCSMGVLVLSPDFVKSFRLFHFRRKEMESQEV
KYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTLNDIVGL
DFLASLRANSNASLLFGNLAHHESLDWISIF
>LA3668 Uracil-DNA glycosylase
MDEEIVMSKEEKLKRLGLMQSEVSACKLCKLETTRTQTVFGEGNPDAELV
FIGEGPGKQEDLTGRPFVGKAGELLTRIIEKGMGVPRESVYIANIVKCRP
TVDMKFEKDRPPEEEETRACAPYLLRQLEIIQPKAIVTLGNPSTRFILNT
KEGITKLRGTWGSFFGIPVMPTYHPSFVIRNGGENSPLKRDVWEDIKKVM
DLLGWKRPS
>LA2858 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA2493 transposase
MNRSIGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSLKIA
RLIQRHPIEELPTVPIPNDEEEDNRRLCSEHENWTKQLTQGKNRLHSLFT
QAGLTHITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVELNLKLI
EKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYIGDCKRFSSAKQAAY
YAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGKVKE
FYQRLYLKKGAKKSIIATSRKMIEVLYAMIRTGKLFDSMPENILNRKLTQ
YGLM
>LA0307 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRHPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTHITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYIGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYAMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3059 hypothetical protein
MGDCKRFSSAKQAAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQA
AWSLVRCQHGGKVKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKL
FDSMPENILNRKLTQYGLM
>LB231 putative transposase
MKFMKSNCLEHSIGSMARVLGVSRSGYYKYLNRHKDRSVAPELTNFLQEK
WLKSRKNYGFKRLFQEVKKSNLPYGARKVRKAMKHCKISGKQRKQFRPLT
TNSKHGGRIAPDLVQRKFHPNEQNRIWVSDVTFIRTSFGWSYLCVILDLY
SRKTVGWSLSDRNDSQLVCDTVLKAVLSRNPRKGLIFHSDRGSNFCSKET
RRLLITNGIRRSNSRKGNCWDNAVAESFFSYLKREIEYNTFYNIEEAEHL
LFDFIEVYYNRFRFHSTLGYISPEEFETNIA
>LA3181 conserved hypothetical protein
MDQYGNIRAMGIHSLLYCERLFYLEEVEGILVADDRVYAGRTLHEELEPN
EDSSGRIESFHYTSEKLEVSGKVDRIQKRDGDWIPYEHKRGRARIGTNGP
EAWESDQCQVTVYALLLEEATGRNISEGKIRYHGSKDLVKIEIDEELRSK
ALKTIDRAKGLSTSTNRPPVAQNENLCKNCSLAPVCLPEETRVITENEYE
PIRLFPEKREKTTLHVFGHDSRIKKSDNVLLVEKVTETGEKSKSEKIPIQ
EIESVNIHGNCQISSQMIKFLVSEEIPVHWFSGGGNYIGGININPSGVQR
RIRQFKALTKETIRLNLAKKLVSAKCESQLRYLLRATRGKDETRNETESY
LATIRSGLKNIESADSPSQLLGIEGSSARAYFSGLPALLKNSDPFLVPNG
RSKRPPKDPFNATLSFLYSLLYKSVRQAIIAVGLDPSFGFYHTPRSSAEP
LVLDLMELFRVSLCDMTLIGSINRKSWIDEDFEITKNKVWLSESGRKKAT
QLYETRLDDTWKHPVVNYSLSYYRMIELEVRLLEKEWSGEANIFAQARLR
>LB164 unknown protein confirmed by proteomics
MACHEIAALRLGMMNLIGIKDETTIRHEQSEIGTVLESPGPIRSLAEAKD
FESLIKFYEISLTDLEEKISKMKKDDPKMAYYRSLLILTKKIEFELKNSV
FIFQNFFRDLEEIHDFVHEIFPA
>LA2436 hypothetical protein
MAASRTWRRGESNPRPFECHSNALPTELRPRLRSRFLPISSKSIQIQECL
RMSESIEDLQKKLKIQNDIIKGYEKVLRLNEQELENADEIIRMYEGIIQY
SGKELKDVKEAFDATNVVTNLSREELMSALSRIKELENVNKKLREEALKF
QTG
>LA1883 conserved hypothetical protein
MKQKTLFLTIFSLLCAINCNRDSDVLASFKSGTVTREELRAYYKLRGIEP
DPNTASITTQAKIVEEIGIQKIAETNNQNTNIVTKDEYDRIMNFVEPQVA
FNDYRKKFSEKLITSGMLEFAFGRILFVKADPDKPQVNEERAKTLFQEIQ
KLKSDREISEFITKNTDEIQRKAIGGRLEPHCINCGNDPFTSILKEAAQN
KGTFILKQPSDPDPSIPKQTAKDYYIIKVERIEKIYPKKIDKFFQNELDK
LKTLALKYVSKEGITEEEKNSAKFYSELAVNERANQTAEHYGNRFLREAW
KNEMDSLKAKSGLKIVDLTPKFIKDLTSETILFEDQKGSKFSFKDLIAEF
NKIAPILQKRKGSPEEDKNDQLSFYAQIYLPIRISSESKEIQSLKDTKEF
KKTLPLLGRSVLFMLIRNRSIDSEVNITEKEIRDTYEAGKLYAYSKPSST
NPNERIPEDFGKVRERIKQELVESKKQSIFQDYISKLKSENEFKISSESL
KAGQI
>LA3204 hypothetical protein
MNFHSTFPKKEVSFSIQSPETIRTAIEILNIPVSALGGKRKIVKHTNEYF
HLFHLYQFLQNKNFTNI
>LA0636 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3195 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCPEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINLGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA4036 ATP-dependent RNA helicase
MEEVLQLSLDFEPKSSSGYRGDFCYLTNSPESGIGKIVSSAEGKFTIEFS
VSQTKKTVSENSTYLRILPGYPSSLRNVEEYSSLMNLALTAYELKLTHAF
DKLSALSNSRTRLLPHQIESTYIVVNSLRPRFILADEVGLGKTIEAALVM
KELIFRRGYKKVLIVAPSPLLVQWQQELKNKFNEDFKIVKRKNFHTDGEK
NWRNFQHVITSIDFIKNPKYAEEILKTKWDIVIFDEAHRLRRDYHKITRA
YLFAEKISKKCECLLLLTATPFRGKLEELYYLMHLIDPNILGPYHTFVND
YILGNKADLKDKISKVLLRRRKVEVGGFTKRFAKTVRIELSSVEREFYEE
TTNYVRREYNLAMRTQNRAIGFVMIVFQKLLDSSVFALLSALTKRKFLLE
NKFHHIQKMESNLEEWDLDETEDVEEFVSGLDESVQLDLQSLKRELLSLN
RLILLGKKIKEDKKSIKLKETILKLQKEGHSKFIIFTQFRTTQDFLASVL
SDFQVTLFHGSLSADAKEKAIVEFKTKTEILICTEAGGEGRNLQFANVLF
NYDLPWSPLKIEQRIGRIHRFGQKDNVFIFNFASKDTVAERILEVLTNKI
RLFEESIGSSDELLGAIEDELDFNSSFMKFVTGNKSKTEMEEEIDNRIKI
AKKGFEKLGALVTPKLIDFNLQDYYSHTLEERSFNNTHLEEFVSRFTKIF
PEEAGFQLLRKKPQIYEIDSPQYKGKYGTFDSELALQNDSLEFLAFGHPL
IDKTVSYLIQNQKGWSTAFHSVSNKEYYVFLVEFQFTLNRTELFYFETNP
NTGSVKQIETLPEELREYNYSVRPSDISSHPELPSRLEENLIRTFISLDE
IVESRKKELGDQTLDLFQKEEFKIRTSNQNTLRQLEEKLMRQEAAFKWEG
KPEKKSAMNRTRNEIQKVKEDFDRELRKVRNGKTIQHRFQLFQVYLPS
>LA2204 hypothetical protein
MVFKNMKKGIFSILFLISCSIKPNISSEMVQKGKNLEKIPLVSLDEFFQL
WLRNQKYPKMAGINFEKLFEDKEFQYFGKIEWNRFIPISKWRFFKIQKEI
LSKEFPNYESVFRQDFSGHFQNQVLPESDRKLYLDIKAKVIDKEYCIDPY
QYSYSLVENKIVLTIKWNVESCEELILLKDKTYRLVYDLRKKQFEK
>LA0561 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA2871 conserved hypothetical protein
MNANEEPMNTPTPESQISPVSIESAITIVVSTDQLSATITVRPYNLKGTI
LSKDKLWGVIQDWGIHRDRVLTDEIRKILNLLERAAQKGDFTPIKMEVAK
GVAPVPGENGWVRFYHPMSKRVKLLEDGRADFRNIDRYINVKVGEKLATK
FEGIPGIPGFDVFGNIIPAPSIKKPKLVVGNHIEERNVIEEGKNLQEYFA
SSNGVIFATEVSITVSPELQIAGNVGLSTGNIQFDGNVIVRGDVEPGSVV
ECTGSLVVYGNLESNQIKVGQDLIVHGGIKGSGTEIILVTGRIQTKFIEN
SYLETEGDIIVEGAILNSTIDTLGSIILNGQNGNLVSSKIRTNEGISVVS
LGSSAELDVTVELGFHFKNDRSFQEISRKIQMGEKEMEKILPKIQQIKHM
VQRFRGNLPEDKKIAFKVVFEDYNKKLKILNMLKFKQDSLKSSRFNPGAV
RLAVQKGAYPGAVIHYRRQIEKITKFQSSFMMVFEPGQEKAIMVALQTK
>LA1479 unknown protein
MRRDQLKILLSGILVLLLGSLIVYSYLFREDISKFLKKKESEEVSNNSKI
DRVILSSDVNSESHNDLNTLPTEKKVPGEFSNHESESLPPNKAVQEDWAE
ETKQGVIPEEKLREETNNYKNVNKGKAKEKTQEVSTNVPKEETNVYKEEP
KKERMKDWENKFEKVDRKTERRFPKKSKMKKHSTSKTGKKIRSLENRVDR
LERKLGISHTKKKHKTHKVDRKSLEKRVQKLEREMERLKTKE
>LA0294 conserved hypothetical protein
MVEYLGPSPEDADSILDSKPFLDAKFDYLALGHIHSERSEKIGSLLIAYP
GSPRIVSSGEFGPRSVNIVTLGKNGTPVLQKKIISSAGEFKEFSLSANLA
GQIPELSKIPALISELDFVRIKISGIVEDEHIVSETLDNFSKSLICRKLE
IKTNDLKTSSALIDNPIAKIFYDKLMNKKSNWNGQNSPDWNEVLVLGLEQ
IEEFSGKK
>LA3774 transposase
MNKKGKYSPEFKEQAVKRTLSGSFTIKEVAGSLGISYFVLRQWRGEYLKK
SEDQLPLTDKQLKESEELKKLRKENLKLKEEVNILKKFAAMLSREQNPD
>LA4049 ATP-dependent RNA helicase
MKGTSMKKLKFSELNLSAEIQNAILEMGFEEASPIQSEAIPVILKGKDII
GHAQTGTGKTAAFAIPTIELLEVESKHLQALILCPTRELVIQVSEQFRKL
IKYKGNFEVVPIYGGQEIERQLRALRKNPQIVIATPGRMMDHMRRGSIHL
DEIKIVVLDEADEMLDMGFREDMEFILKDTPADRQTIMFSATMTDDVLTL
MKKFQNHPQIIDVTHQKLSAPKIEQIYYEIQENAKGEALARLIEYRNVKL
ALVFCNTKAQVDTVVELLKSRGYFAEALHGDLNQKQRDKVMSGFRKGSIE
ILVATDVAGRGIDVNNVEAVFNYDLPRDGEDYVHRIGRTGRAGKKGIAFS
FIVGKQIYNLKKIERINGIKIEAGKIPTLDDLEETKIHSYTSKVRSIVDA
GHIGKYVNQVEKLMGDDYTALDIAAALFKMTILKDSVTFDDSVQFESDFK
YEEKHSTKKKSGGGRYRDRNFGRSGSGSRSKSNSNSHFGNNSKDGSRSKF
SKGDKNQKQGGGATSFKKKKK
>LA3482 transposase
MRPCFPANRIPIEVHEIECLEHSIGSMARVLGVSRSGYYKYLNRHKDRSV
APELTNFLQEKWLKSRKNYGFKRLFQEVKKSNLPYGARKVRKAMKHCKIS
GKQRKQFRPLTTNSKHGGRIAPDLVQRKFHPNEQNRIGCLMLHSFGLRLV
GVISV
>LA2668 DNA modification methylase
MSTALKRKERKIQIGEFWTSRQRQSHSIHYSVSYRASFKPELPAFFLDKY
LSKHKGIVLDPFGGRGTTSIQANLDGHSAIHNDISPMSLFLAKSRQTIPS
LESMEKILNRLDLKKKTKEEKEDKDLLAFYHKDTLTEIKNLKRILLTDLS
PEIQYLGVTALSRLHGHSDGFFSVYSFPQISIPPEAQRRNNQKRGIKPEY
KEIKSRILQKMKRDLKIPLPPFYHEFSGRNLYTNHSSLYLESLQDGTVDI
VITSPPFLDKVNYEEDNWLRYWFLDIKLPDHKKPSIFSTLNAWTDFIHDT
LKELSRVLKSEGICVMEVGDIKKGPTVFNLDEYVIQAASGSGLEWETTFI
NDQKFTKLSNCWNVSNNEKGTNSNRCVVFRNFK
>LA3182 conserved hypothetical protein
MKHWRLVSYDIREPKRLRRVAKIMEGFGERIQYSVFRIYSTDKELEKLRW
KLAKVTEEEDNIFYLTLCTKCASGAHTQEKKSAWPEAPKTLKIL
>LA2602 unknown protein
MGSLIPFLQDQSKELDRIQNEQVEVYGDTLENSLRLAAKHLKKQVHELDY
VVLKRGKKKLFGHEPWHIRVSILPEDNFLDELTELDQKLTGGSGKLVSKD
LKDLIQPKDKNGKALVQILRNGVYLTTFAPLGDGHPVDLDEVFKKLSLKG
VSGEDGKLVRKIVKEAKGEPILISQQKPRPGMEAKLILDISPDKMKAKVT
ILPPRPGGRDFEVRDVVNHLKNAGVKYGFKEEEIQRKLEEEFYNQPFIGA
EGDYPINGKNAQIIYHVRTSKNISFREDESGRVDFKDLDLIENVVVGQLL
AEKIPAEKGKYGRTLFNELLPAKDGADTDLKQGKGTILSEDRSKLTAEVN
GQVVYATGRLSVETVYRVNGDVGIKTGNVTFLGSIVITGNVEDNYSVKAA
GNIEIYGTVQKARVEADGDIIIRQGISGREEAHVESTGGNVIAKFIQSAT
VITEKDVMVQEGVLHSFVSAGGKILCNGKRGQIVGGTVRASELIAARSIG
SSANPATELVVGIDPKVLKQIADYEAKMHESQAKHEQVFKSLKTLQARKE
SDPASFTEEHENQLSKMQKAVDKLDSRIKEFETEINNLKNYMEEKSSHGK
ISIEKVLYGGVTMRIRNSDFKTRNEIKNKTFVEENGMIRQVPYEDPEPDK
KDWRKKRNRGN
>LA0226 conserved hypothetical protein
MRSSNFISPKRFSHIYVEESAKNHPKTLEILSKFPKSYTIPINSYKEVFN
PSAQNFQAQKRSPKLILAKRKEQFLYSGSGVAPDFGYRFFYYNALVLNCL
YNCSYCYLQGMYPSANIVIFVNNEDFILETKEQLTFSKPLYLCISYDTDL
LALENTLGYCKEWILFASSYPDLIIEIRTKSANFKSIADLKPVSNVILAW
TLSPDSVIQEHEPLTPRLSSRLKNIKEALSSGWQVRLCIDPILNVPDWKS
VYLEFIHKIFEEIPGEKLREISLGVFRMNLDYFKNSKKRRPDSYLFYLPM
NTDSGMKSYPEDLEKEMFAVVEKELELFVSKEKIHRLFANEIGFK
>LA1589 MutT/nudix family protein
MNSSIGHAVKALIYRNDQRILLQQRDYTPGIIFQGYWTFFGGQVEFGENL
KDALCRELKEELGCLPGSIGEELFHWEWRGEQITCNHCLPVYFEVKEDVL
TLNEGLAMKWFLWEELDERLPLVPGVSENLYKIKSFLDKIFLNR
>LB224 putative transposase
MNMNKKGKYSPEFKEQAVKRTLSGSFTIKEVAGSLGISYFVLRQWRGEYL
KKSEDQLPLTDKQLKESEELKKLRKENLKLKEEVTILKKFAAMLSREQNP
D
>LA1547 modification methylase
MKRTIHRIHFRDSRETFPLDSESVDLVLTSPPYPMIEMWDELFFGFSREI
QKNFLIDPNLSYEQMHFELDKVWKESFRVLKNGGFLVINIGDATRNTAFG
FRIYMNHARILQGCNSIGFQSLPGILWKKQTNSPTKFMGSGTLPAGAYVT
LEHEHILIFRKNNRRKFSTKSERLSRMESAFFWEERNFWFTDVWDFKGKK
QGLSSLLAGRERSAAYPLELANRIILMYSLKGDIVLDPFLGTGTTTLAAI
GNCRNSIGFDLEPGLLKVQLENLHSIKDKLNRIIEKRKNDHDVFVQNRQN
EGKSFLHFNQNLQTPVVTKQEKFLNLERITKLFRNSGNEIEAEYFPLLQT
ELLPQFESIPTVHP
>LA4093 MutT/nudix family protein
MRLDFQSLKERLIIPQENFIGIPIPPIGQEKSRASSVILSIYEESDRSQG
IILQKRNSNLKTHPGQISFPGGAYSPKDKNLLNTALREWEEEMGESSSFL
EVLGEYNGIFTFTGFHISPFIAHYKGSFLFNTNPEEVERFILLDLNLLES
SPFYSIRIRRSGANEIEIYYFDLKEGLLWGATGRIIVNFLREHANFNREP
IFVEPNLSSPPFFDPSRKFSKKN
>LB346 putative transposase
MNMNKKGKYSPEFKEQAVKRTLSGSFTIKEVAGSLGISYFVLRQWRGEYL
KKSEDQLPLTDKQLKESEELKKLRKENLKLKEEVTILKKFAAMLSREQNP
D
>LA0087 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3680 conserved hypothetical protein
MSKKEYSKNHTDQIEILKEFQTLPGVGKSVSMDYWNLGFRSLEEIRIADP
EKLYVLCCKLHGGYVDRCMLYVFRCVHYSLNTKKPNSEKLKWWNWKDTEK
NQKSKR
>LA2652 Integrase core domain containing protein
MMFMETHRFEYSIQSMANVLGVSRSGFYQFLKRSKNELEKYNPELVEFIR
ETWLTSRKNYGLVRLLREVKKVYSIYGARTVRKVMKLCEIQGKQEKRFRI
ATTDSNHGNRVAPNLVQRNFKPNQKNRIWVSDITFLRSSFGWIYLCVILD
LYSRKVVGWSISNSNDSKLVCTALSKAIECRNPPKGLVFHSDRGSNYCSY
ETRRYLLNNKLRRSNSRKGNCWDNAVAESFFGSLKREMEYNYFYKIQEAE
ELLFDHIEVYYNRHRSHSSLDFVSPVQFEVNAA
>LB345 putative transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQSKNRLHS
LFTQAGLTHITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYAMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3900 hypothetical protein
MATQQSRPRRKLNISLFTNRSMGVLVLSPDFVKSFRLFHFRRKEMESQEV
KYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTLNDIVGL
DFLASLRANSNASLFFGNLAHHESLDWISIF
>LA1128 unknown protein
MAQSIRIKNILNKTKRRDPWFLDDYTINPYSGCSFRCLYCYVKGSKYGLN
VEDKLSIKENAVEILDKQLSNLAKKNRHGIIVLSSATDPYLQLEKERGLT
RELLKIILKYKFPVHILTKSDLILRDLDLLSEIEKNAILPVDLQNQLSRK
SFITFSFSILDDSIAKIFEPGATSPSLRISALKETLKEGFFSGVSLMPLL
PHISDTGENLEFMFQTFQEIGIRYIFPASLTLFGGNDPLDHKNLIFKAIE
NHFPHLLSKYQKFFSKNFRMPNFYQNALYHKTSELCSKYGLQKGILTTEF
>LA1749 transposase
MFMETHRFEYSIQSMANVLGVSRSGFYQFLKRSKNELEKYNPELVEFIRE
TWLTSRKNYGLVRLLREVKKVYSIYGARTVRKVMKLCEIQGKQEKRFRIA
TTDSNHGNRVAPNLVQRNFKPNHKNRIWVSDITFLRSSFGWIYLCVILDL
YSRKVVGWSISNSNDSKLVCTALSKVIECRNPPKGLVFHSDRGSNYCSYE
TRRYLLNNKLRRSNSRKGNCWDNAVAESFFGSLKREMEYNYFYKIQEAEE
LLFDHIEVYYNRHRSHSSLDFVSPVQFEVNAA
>LB048 putative transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTHITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVELNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYAMIRTGKLFDSMPENILNRK
LTQYGLM
>LA2679 transposase
MNKKGKYSPEFKEQAVKRTLSGSFTIKEVAGSLGISYFVLRQWRREYLKK
SEDQLPLTDKQLKESEELKKLRKENLKLKEEVTILKKFAAMLSREQNPD
>LA2298 conserved hypothetical protein
MNPFMPDLLKDLYSKNVLERIAISFSKEIPSISEKEWIQKFKQKDWKQLE
LKQRIRRIGEVLAKVLPKPFPQSLLKITDSLEKSFEGKEIFLTIFLGDVV
EILGIDYPKKSMQAIERITKLISCEFSIRPFLIRHPELTWKQMLEWSSHE
HPGVRRLSSEGSRPRLPWGMGIPGLKQDPEKTLPILENLKDDKDEVVRRS
VANHLNDISKDHPDLVLKIAQQWIGFSKERDLLLKHALRGLLKSGNPKAL
AIFKFDSDVKVKISNLKLKSKTVKIGEDLFFSFTVRSEQSKQTRLRIEYK
IQYAKISGKTSKKVFQVEERLFQPNESTSYQKKQSFKQMTTRVHISGKHV
LEIYINGNLKSKTDFQVIV
>LA1361 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA0702 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA0478 hypothetical protein
MDKTEFSSGLTSRYLKIQETLSSLQGEFTREQMKLGILNEGNTPNSELIN
ILFGETPLFRELAENPEIDQNTLKEKIQEKKNKLTDTIRNLEVESENIFS
VGMLKDQNHFSKSLETISGKSVQMKQLSEKTIERLIKE
>LA1441 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3619 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AGLLRWFGSES
>LA3453 unknown protein
MEIYINEHLIDSSLENEKKLGEVFGEVNKWVESKGKYLLSCTVDGVEFQT
SIMNDQEIESVAKMEFFVGEEMDFLVSTLTELDLYVDKVGSTLFGRDSLT
EKESRELSDGIKWIGSVLDSASNLLHLKLDQIKPMGTGNTVSQILAEISS
NCGSLDNTETIENFLEHLRDLKLFIMDLIARTQVLDLDLPTLKEILNTFI
ENIGGLKEAFVKVNESYQSGKDEVAIELLTQSISQINVLLTSFITLKLKK
PDLDFSEIEINGIGFEEKTGELNEILASIAVALEEKDIIRAGDSIEYELP
GTLDEILPFLKLIREKIS
>LA2098 putative helicase
MKDRNFSDSISELEFVKTELEREKKEEDSIFSKDWLSRPIPDRVRQGITL
YPIVYEEQTLGREGNWILTFRFSNQEEYPIKFQTGAPVQLGKNEDRAKAI
LVSLHKEKIKISIEEVPEWAEEGKCFLDLLPDETSYKEMFNALDAVRLAT
KGTRLYVNREFLLGYGKPDLISTRDSDRSRILGRISTSLNESQKNAVIHS
VLSEDVMIIHGPPGTGKTTTLTEIVSQLVAEEKKILVSAPTNSACDLLVE
SISARGILVLRLGHPARINEIAIHSTLDYKLFHHPDGKLLNEYRKDVIEI
SKQAKKFKRNFGEKEREERKKLFTEVKELKKTIRSMEIGLIDSLVSSHPV
IVSTPVASARGILENRTFDFCVLDESSQALEPAFWIPILKSDRVILAGDH
KQLPPTLFSEKNYLETTLFEKAVENLESYGRVFLLDTQYRMKDEISAFPS
KEFYSGLLKSGRSEKERKSNFPKTFPFLNAFQWIDTSGTDSEEVILDDSI
SNPFEADLQVRLCFLLKENDWPEDEITILSPYRAQVRLISEKLRDVGLTK
INVSTIDSFQGRENRCILLGFVRSNLEGRSGFLKESRRINVGMTRARDLL
LCIGDSSTLSQDPFLSKLIRFAEEKEVFRTAWEF
>LA3882 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA1676 Single-stranded DNA-binding protein
MANDINRVTLVGRFTRDPEFKSINGTSLVNFSLANGRTYVSNGEKREESH
FFDCEVWGKPADIIQQYCKKGKQIAIEGRLKQDTWETPEGKKASRIRIVV
ENFQLLGSKDDSSSGSSVSSSPASAGGNSYPSSPEYYNAAADGGDDDIPF
>LA1778 conserved hypothetical protein
MICLKDNLERPHEALEYKTPEKIYMPSERVFPLRIPEIAYATNIVVETVL
DDGTAKYGPYRIFFGSPLIGERVGFEEVSERLCKIYFTNAFWVIDTFTGK
VLQYKNPMPIH
>LA3832 DNA-3-methyladenine glycosidase I tag
MWIKMKKQKEPKRCAWVTEDSDYVKYHDQEWGVSVHDDRLLFEFLVLEGA
QAGLSWITILRKRENFRKAFDNFDVVQVAAYKENKIQSLLKDKGIVRNEL
KIRSVIKNAQEFLNIQKEYGTFDRFIWGFVNHKTIYNSWKTIKDVPNKST
ESDAMSKALKKRGFKFVGSTICYAFMQATGMIMDHTTDCFCLLTKNLKKL
L
>LA3830 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3718 transposase
MNSCSVSDLPWLWFLITENLIFVFLWKFSPKQVLHVFKILKVQWSHFLKL
EKKKGRPSLRKYWEKYYHIKDMLVNNITWGASRIHSELLLLGYDISLSSV
KRIIRRIQKRNNPFKGHLTTWLNLIYQIKEYTVATDFCRIQTIYGTTLYS
LSFIHIASRKIVHLNITTNPTRDWVLKQIQEAKSLFPEFAILLWDNDTLF
SGRKLLNGLESLGIQSLHTPMSAPWCNGIMERWFGSVRRECLNHIPIFSL
GHAQAITSEYVNYYNFWRPHLALNKDSPCGRAVTFSSYTSKVIKRKVLGG
LHHIYINVEAPFQNVA
>LA0684 conserved hypothetical protein
MIRKAQFKKSEIEKFRLEIARSIVAGKLQNCRSVLSRTARKSKNESEKQD
IKEAIGKIEKNISLLEKAESIESIRGYEGASAKTYFSVFDYCIIQQKEDF
QFHKRTRRPPRSRTNALLSFLYSLLTNDCIAVCQAVGLDPYIGFLHDERP
GRPSLALDMMEEFRPFIDRLVFTLINRKQIQVSDFLEKPGSVFFINDDSR
KELIKSYQERKKEEIFHPWLNIKSTVGELPYLQARIFARTLRGDLKYYIP
FIWK
>LA3249 transposase
MKSQEIKYVGIDCGKKTLEVIRIGDNSLHQRQQFSTTEIGISKLINWLNP
NDVVGLEAGSQSFRIAKSILNKGIQVIVLNPGDLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTRQLTQSKNRLHS
LFTQAGLTHITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEEEIKEALKKNKAYAQTIMSMPGVGMITSLAIKANSISHSLWVVR
>LA4314 hypothetical protein
MSFFHPGIFEGHFMSQITGFFSELKTSFDNLSQSIQSFLNTIEAIRSFLK
ILFSIIPLDLFLVLIFSLVLVYLFNTISPTTTRLNYTLGVLIISVLRAFF
HQTLSQTWNLGPVSLTAIFLLIPAYLVSSLRFGFYFLKKIQKRKNELNPK
NFEAGLNNIQKSFYTLMAKSYEELRSTDGKSSLDLNVLKEQITELERTIQ
GLKNLLDSEKK
>LA1748 transposase
MNKKGKYSPEFKEQAVKRTLSGSFTIKEVAGSLGISYFVLRQWRGEYLKK
SEDQLPLTDKQLKESEELKKLRKENLKLKEEVTILKKFAAMLSREQNPD
>LA2513 transcription-repair coupling factor
MKDLLNMIGDGVFSRFESSLKKKNSTSKIKSSFADSSNLKTQSNLTLKTQ
SASASEKKISVNTEVVGSVYSVTTGSHSILASSLFQKLNQTILVVSENNT
SAEFLFREALSFLPSNDLIYLPGQEVLPYEYMRYPSEMKRERIKAIAKIL
SGEPVLIFTSVSGFLKTLPPIQTMQGRAIVLKKGKEIDLESLLIQLIDLG
YKRVQVCETFGEFSLKGGILDIFSSYSTEPVRIDLFGEEIESIRTFDPDS
QRSMTDLDQAVLLPADEYILSEEQKKEYQNFLKSSDSSLHLPEIPEGNYG
IYYEELIPLVRENHGILSYFSEPPILIFPSANSVKERLFHLEKEYLSLFE
KRSREVLCAPPEKLLSFGEEFKVLSESIGLSFVGLPPRNENDLVSLLKEA
PSFKGKIREVREKISELRAKGGWKIVLTSSFEAQTKRLQGLFEKEGVILL
NEDSTEPLPFHLGNHKSDTFLVLSELRNGFILENQKILILSENDIFGREY
KRKTRFKKQNSKALQSFIDLKEGDYVVHIHHGVGKFLKIERTSAGGKERD
FLKLEYSGGDSLFVPLDQISLVQRYIGGTESPRLDSLGKSTWKKTKDRVQ
KAVEALAEDLVQMYSNRLKLQGYAFPPDTIYQEEFEAEFEYEETPDQIEA
IEAVKKDLESSRPMDRLVCGDVGYGKTEVAIRAAFKVAMAGRQIMMLAPT
TILALQHYNTFKNRFENYPVRVELVSRFKTPAEIRDILADFSAGKVDMVV
GTHAILSSKLKPKNLGLLIIDEEQRFGVNHKETIKKFKNLVDVLTLTATP
IPRTLHMALTGIRELSIIATPPKNRQSVETYVLEEDDDLISDAIRNEIQR
GGQVFYLYNRVETIEEETNYLSKLVPEVSIGILHGQMTEDEIEETLLDFY
NRKYDILVTTTIIESGIDMPNVNTLFVKRADLFGLSQLYQIRGRVGRSDR
KAFAYMLLPKDRVVTEQAEKRLNTIFEYQELGSGFKVAMRDLEIRGAGNL
LGKEQSGDIMEVGFDLYVRMLEDAIARIKGEEIVVEVRTSVTLNTNFFIP
ETYISDTRQKIEFYKKLEGARDLDEIEEIYSEMLDRFGEPPEDAKTFILL
EKIRTLASNLGFEFVTEMKDEIKMKSGSYFRGDHTKIIQLISARTGLTLN
PKEPNVLIFQTEKKLEKEKLDTLIFLLSEMLPSKKV
>LA3899 transposase
MNRSIGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSLKIA
RLIQRFPIEELPVVPIPNDEEEDNRRLCTEQENWTRQLTQNKNRLHSLFT
QAGLTHITKKQSLSFNRISCYKLRLLF
>LA1129 DNA alkylation repair enzyme-like protein
MILSKLSDSDESLWIFIIESEFLKKKIKFPLLEFVGKELYFKIPEMNQIY
FTDQIIKLGHMGGYVISAIILQLRMEKHFEQSLNKAVEYILLGNEWYVCD
IIGERIMGYFLLKEPEKTLPILKNYINDKNGWIVRSVGVASHYAVKKGLG
KKYVEVTFYLLLSKTDTKDFHTKKGIGWALKTISKFYPDIIQKFESNLLA
NPSIQPFFRKKIEIGLSRSSKYGSKYSD
>LB022 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRHPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQSKNRLHS
LFTQAGLTHITKKHLRTKANREISVALLPSRYQKEAERILQVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3167 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LB304 putative transposase
MRMKGRRKYSLEFREQAVNRTLSGSFTIKEVAESLGVSYFVLRQWKAEHL
KKKETEQPLPDKQLKESEELKRLRKENLRLKDENAILKKFAAMLSREQNP
D
>LA1593 conserved hypothetical protein
MINYNLQTKVSEEVYHSTQNFLGIQKGDIQELIEIAKGTKRQRARICSHL
NGNELLQEMFIVHPKGVYVRPHKHIEKPESMMILEGEVDFVTFNDNGEIE
NTMSMGAYHTGKIFYDSMRSSTYHTLMIRSEWLVFLEITKGPFRKEDTIF
APWSPEENDAEKVREFMNKIEQRLK
>LA0821 Putative methylase
MKILKVQTGKFKGKSIETPPAIAGNTNFTPAILKKSVFDIIGSLVLKGRL
IEEESAFVDFFAGSGQMGLEAVSRGFARVVLYELAWERSDNLRKLFSKFG
NNCQVYRKDVFRFYDKLDIPEMSRIFFLDPPYSFWDKKDEKIRTISDLLL
SEDTTVAVFVQSPVNPGWSGFETRRFGKNFLTFQIKIT
>LA1849 hypothetical protein
MKFFNNSKEYDKVKNILITKNIQKKKEWLKEYANTKGNLFSLRFVCSRYK
DGQVGIFVTDLPGSEFPREDIVFLYGKRWNIETHFSFEKYSLELENVASK
TSIRFLQEYYAKILTFNLTSL
>LA0269 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRHPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVELNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYAMIRTGKLFDSMPENILNRK
LTQYGLM
>LA2914 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRHPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQSKNRLHS
LFTQAGLTHITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVELNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYIGDCKRFSSAK
>LA2381 conserved hypothetical protein
MSKLKKRKGDEGESIASNFLISLDHEILKRNYRFLHCEIDIISVKEEVLY
FSEVKFWKEFKFFDPRFTFNLAKQTKMRKAAKGFLAENLSFQNHFVSFCL
VSVNEKKGCKYYLDLF
>LB230 putative transposase
MKGRRKYSPEFREQAVNRTLNGSFTIKEVAESLGVSYFVLRQWKAEHLKK
KETEQPLPDKQLKESEELKRLRKENLRLKDENAILKKFAAMLSREQNPD
>LA2518 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAESILNKGVQVIVLKPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNKRLCTDPENWTEQLTQGKKRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA2129 DNA repair helicase
MSKPLIVQSDRTMLLEVDNPEFEACQSVVSKFAELEKSPEYLHTYRISPL
SLWNAASIKMSADEIVECLEKFSRYSVPKNIVNEIREQISRYGKVKLVKE
ESGELAILSNEKGFLQEIGNHRAVQPFIESTFPDKIYIKKEYRGHIKQAL
IKIGFPVEDLAGYDEGNKYGFNLRPTSISGKKFGMRDYQRACVEVFHAGG
GNEGGSGVVVLPCGAGKTIVGIGVMQIVGAETLILVTNTLSIRQWRNEIL
DKTDIPPEDIGEYSGEVKEIRPITIATYNILTHRKKKGGDFTHFHLFGAN
NWGLIVYDEVHLLPAPVFRMTSELQAKRRLGLTATLVREDGLEEDVFSLI
GPKKYDVPWKELESKSWIAEAKCKEIRVNMEDDLRLKYSIADDREKFRLA
SENPEKMKAIGLIMKKHSESHLLVIGQYINQLEEISKKFNIPLITGKTPL
PERQTLYDAFRSGKIKSLVVSKVANFSIDLPDANIAIQVSGTFGSRQEEA
QRLGRILRPKGHDNTAVFYSLISRDTNEERFGQNRQLFLTEQGYEYEIYT
LDQFREAQEELAQLQLNNV
>LA1566 hypothetical protein
MATQQSRPRIGVLVLSPDFVKSFRLFHFRRKEMESQEVKYVGVDCGKKSI
EVVRINSENSLERRQFSTTESGINNLLQWLTLNDIVGLEAGSQSFRIAKS
ILTKGFK
>LA1082 MutT/nudix family protein
MKQFLPEEYDPHSNLWSKINRKDLVDTPIFKLVSWNITSPDKKISKDFFH
LESLDWVNIIALTPDNKIVLVDQYRHGIHRFSLEIPGGIAEKNSLLESAQ
AELVEETGYVSQDWEYLGKVTGNPAILDNWCHTFIAKNAHRLHKQNLDDS
EQIEIFETPIENIPKLIADHILHHGMVVAAFGMYFIKNPIRY
>LA0716 transposase
MKSNTKSFYHLTFSSCTTDFRTPLSTLRVGSAWIGIKHKKTTRYSPWQNC
YAERWIKTCRNEFLDFFIPINQYHLEKNLTEFVHFYNHHRTHLALNKDSP
VSSPVLKPPPGAKLGATPIIGGLYHTYSYNKAA
>LA0686 conserved hypothetical protein
MSNPIQNRYEFVYLFDVKDGNPNGDPDAGNQPRVDPETGNGLITDVSLKR
KIRNYVTIVKSATPPNDIYIKEKAVLIETHEKAYVAVGAKLETSKKEEKE
KRTGGDQVGKAREWMCKNFYDVRTFGAVMALKVNAGVVKGPIQFTFARSI
DPVINLEHSITRMAVATKKEAEVQDGDNRTMGRKHTISYGLYRAHGFISA
HFANDTGFSEEDLELFWSSLQNMFDHDRSAARGEMNCRGLYVFKHVGDEK
NTNQAKLGVAPAHKLFNLISVSKKDNSTPARDFSDYSVKIQESDLPAGVE
LIKKVS
>LA4025 hypothetical protein
MRDQNFDNLNKGKSMNIRIFSFLIILTIFGLGAAPNKVLTLEEKEELRQI
ETVRKGGFTDIEVDNLHASIAGNILKINNLLGNETYKKALRYIEDEPREA
AKFLFQDKENKQYLQLDLGLGQSFADYPKTYLYQSKIYIYPGTDGKSLEK
IILQFKRTNAKGEVFIREMRRLINNSPKGPTFLGDGKRTPNNNSEILLEF
FSSHDTDFLWPDNPIQPVPASVTTKLNEAVNPLPYNKQKQIILQYKRYLR
KVDKMVSLRLHTMELDQKMMISKMLEFQ
>LA1812 transposase
MNKKGKYSPEFKEQAVKRTLSGSFTIKEVAGSLGISYFVLRQWRGEYLKK
SEDQLPLTDKQLKESEELKKLRKENLKLKEEVTILKKFAAMLSREQNPD
>LA0489 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA4071 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA2524 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA0119 unknown protein
MSKHGFFQITQKLFLRKGDELLILRDRKSGLGDLPGGRMNENEFFEDWSL
SMQREIEEELGSQVQIRVSTKPLFIHKHKVNEGNFPCIIIAYHADYLGGD
IILSDEHDYISWEKVQTYEPSPLFTEYMLDAVNLYLKEYAPLVH
>LA0923 transposase
MGILRCFRGVDYLTAMFLLCEVCDFKRFKTAGSFMSFLGLVPGEYSSGSK
RKQTGITKTGSLRHFDGSCLAASLPRKIVAARRTGQPALVVALAEKASLR
LHKKFRNLQLRGKSPQVMITAVSRELSGFIWAAMNLAA
>LA2729 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3329 transposase
MFMETHRFEYSIQSMANVLGVSRSGFYQFLKRSKNELEKYNPELVEFIRE
TWLTSRKNYGLVRLLREVKKVYSIYGARTVRKVMKLCEIQGKQEKRFRIA
TTDSNHGNRVAPNLVQRNFKPNQKNRIWVSDITFLRSSFGWIYLCVILDL
YSRKVVGWSISNSNDSKLVCTALSKAIECRNPPKGLVFHSDRGSNYCSYE
TRRYLLNNKLRRSNSRKGNCWDNAVAESFFGSLKREMEYNYFYKIQEAEE
LLFDHIEVYYNRHRSHSSLDFVSPVQFEVNAA
>LA3883 Uracil-DNA glycosylase
MKSRKLEYSNHIERLLSCRKCPNMQGNPVHGCVPVSKIISLGQAPGIHEE
RFGRPFAYTAGKTLFGWFKKIGIEEENFRSKVNMSAVCRCFPGKAKSGDR
KPDSIEVKNCSQFLEFEVRFHKPELLIPIGKLAIDQVFELGKYKLEDVIG
RSFSREFYGVQLDWIPLPHPSGLNVWNQTETGKKLIQKALELLKDHPVIR
KEFFR
>LA0433 conserved hypothetical protein
MKIFALSILVLVVLLVAFLFYMGAFNRVLVQEEMKGPFYVLSHERIGDYR
NVGLTFEALQKELPEKGIRNFKLFSIYLDNPNEVPKEKLRCEVGALFSEP
LEKIPNGLSLELKIRTIPSKKYLTAEFPLRNFLSIFLGIYKVYPKLFRAC
EERGCDLKGRASIEIYEPLTEHKTTYLLPLD
>LA2294 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA0418 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3954 hypothetical protein
MWKEKYRGVRINSENSLERRQFSTTESGINNLLQWLTLNDIVGLEAGSQS
FRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSLKIARLIQRHPIE
ELPTVPIPNDEEEDNRRLCSEHENWTKQLTQGKNRLHSLFTQAGLTQITK
KHLRTKVSREASVTLLSDRYKKEAERILKVLDLVELNLKLIEKEIQEALK
KNKAYVQTIMSMPGIGMITSLAIKANSISHSLWVVR
>LA0585 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA2483 Integrase/recombinase xerD
MTSSHNNLLQNFQEYLSVEKGLSDNSIYSYGYDLNKFKNFLEKEHIDFLK
VQADDIMRFLNEEKDRKISSKTIAREVVAIRQFYKFLKDEKKLDTNPTEK
IETPEVMRSIPDYLTQDEIEELFASIKEDNLYELRDKCIFELLYSSGLRI
SEACNLRLNDMDLEGMTLTVEGKGGRQRLVPFGEKSLDILNRYLKQSRPF
ILKSRNCEYLFVSKKGSYINRKSVWRLLNHYIKRTSILKKVTPHTLRHSF
ATHLLENHADLKSVQELLGHIDIATTQIYTHMANKTLREVHKKFHPRG
>LA1565 transposase
MTQITKKHLRTKANREISVALLPSRYQKEAERILQVLDLVEQNLKLIEKE
IQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQAAYYAG
LVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGKVKEFYQ
RLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRKLTQYGL
M
>LA2691 conserved hypothetical protein
MTNFKKPIPKKSLRLKSKRSDRKNFPNDPRQKQNPLKSEWDFSKPDSFEY
HASCPDGLSGLLREEVIEAGLKIISENRGGIFFQGPAKALKEFILSTGIA
SGISISLKYWRVENPEDLYDQALQFPFEKILNSEHSLRIDSTTKDSLKDS
RYATYKLKDAIFDRFRSKGKEPPKISREEPDFLFYLRSHSDHAKLSLGLN
TRPLQQRGHGRIGGEAPIREILASALIRYSGWDAKSQLYDPFCGSGTVVV
EAALKLLYGGHTNYRSLASSLPFQKLFGQPNFKENLNLSSEIKIFASDLD
PKTLGLARLNAKNAGVDHLIHFFESDAKISENEEKISEGFIVTNPPYGVR
LGTKEEAKETYLAWGKKLKDHFSGNVLAIVCGDTSLLGFLKLKKDKEQSL
TIGKLKGKLVAYTLGR
>LA3328 transposase
MNKKGKYSPEFKEQAVKRTLSGSFTIKEVAGSLGISYFVLRQWRGEYLKK
SEDQLPLTDKQLKESEELKKLRKENLKLKEEVTILKKFAAMLSREQNPD
>LA3749 hypothetical protein
MQQETSTGQNAVGQFLAQSPVQNLISRYRSERGKIRSFMEKLPLYSNALK
DHEFQNADSMFRKELASRISYLKESIRRLEETFVLERRMELIGSGEIAVV
LIDKLIHTIQSAGYGLNGLGTGLKATREELEKLAEFDFSLFQEVEGVESK
IQILGINSNSSIQEVRDKIGEIRSSLDELENAFRSRKELFLKL
>LA3227 conserved hypothetical protein
MNMSKWYDPAELEDFLGSLPKFRVRLRLASEYKNRQEKVPKELRYMILIQ
RLYLQKKILLRRNEWMKGELRSIFSEKVQIESEFKVLEKLLKEIRNENAD
LICG
>LA1758 hypothetical protein
MISFFNAINIRNKERLLEILKNFNIPYEDFLEAVQKLNCFEIVEINYDYV
KISEQNLSTFFFYLAFIKNKQLSFGTLLTHYYTDYMSRFSDCIIPANNTF
GSEKVMDVVLPDLKKHFDTIYDDSEKSYKFLSVFWFYLRSETLEYLYNEI
NTYSEHESINTKRMILEKKSTLSGKDQTLELLGKFYVGCPELKDAIELSF
EYIEKCPFLTHKLISQFKELINFEAKDQHNNFRRQTILLKTLIDKIEKGS
CSCLQVFYGIAKLFLKFKFQYVNGYKDRTIHFHTYTIPLSKKIKDIRKMI
WDTLDLYFLENQDECFQVLKDYSAVGGEISKEILEYDLLFIFNIIDNHLK
NEFFEHCLYVQKLIRWLQRHNIQSSKFERYRNDFINPMYDLFTKVNMCGY
EHKEDYEFDDYGEFLRLKELEIRSAYIFKDQADMDSFHSMFTDIVNVHKP
ETIHLESLDFILEENFKRDYNIGFKFLELLAKRNDKLLFIPTRSLKQILV
IEENICLVWELIERISFRSKPLWKISFFTEIDSALIKNEHIDMILEIFRE
IENLKFMSLDWVERYLNFDYELYDKILTIVTERNREPNVKIGLQIHYFEK
TFKMLSKNMPLIQEAYLQQVKIDSHFDYNKNGLFRIIEMNPGFLKDYFDY
FYFSDDIEFTERKADWGFIWEIEGMGPVFSEIFKCINEKNIFSGFSSHFL
NNFFSNLKEDKKAKANEFLFELLKANYKDIRIVNLIVNIARYARKEIYEN
ILLLYISLNQAPDVFGKIWWRGNGGEYNGGDISGEIEANDWKKILSIIER
SEHGTNLIPIKKVIKDRIYSCLRFAESEKTNLFLDR
>LA3645 Single-strand binding protein
MKNIAHIILDGNLTSDPEIKTLNSGKSVATFTLAVNHDYKSTLEEPGEVS
FVEIELWDRQAVNAHEYLKKGKKATVIGELRQDRWKAQDGSNRSKLKVVG
QMIRFDGLPGKKEREVA
>LA1747 transposase
MARVLGVSRSGYYKYLNRHKDRSVAPELTNFLQEKWLKSRKNYGFKRLFQ
EVKKSNLPYGARKVRKAMKHCKISGKQRKQFRPLTTNSKHGGRIAPDLVQ
RKFHPNEQNRIWVSDVTFIRTSFGWSYLCVILDLYSRKTVGWSLKEAEHL
LFDFIEVYYNRFRFHSTLGYISPEEFETNIA
>LA2386 ribonuclease HII
MINRFTQSKFPLKLLIDGNYNFNRYPEWMNLKDCSTFYIKGDLRIVSIAA
ASILAKVSRDRYMISVSKKYPIYRFDQHKGYGTKLHEELILLHGLSDIHR
RSFTGKFLEQISESNL
>LA1125 putative outermembrane protein
MKTLQQKITFPILVGLISVTLIMLIWFVFFSGSKITTSSSASLDPESSSG
SEGWILNQAIINTSRKIFDENGNWLSFDELIQYASNGEINLISELSDLRR
QCPENIHYEQCNEIIRAFIADHYFGKDAEYLMKLFSSYLKYETKMKELEI
SDKLSRAEKYELIKKQRREFFSDKDTKLIFGLEEAEETYLDSLGGFLKDT
ETLSGEQRMQKYEEFRKNVYGQYYNTIKKREPKYNTYETEMFLREKELER
MSSSERNSKTRHIREKYFGKDGADRMETVYQESEEKEKKEKQTAQEESDW
IRKNPNVKVETKEKALMEIRIKNLGKEEAEEYSRRLKYEEEIKK
>LA1768 Integrase core domain containing protein
MVITFEKKKTGRPNIPWEIIKLIRRVAKENKIWGATKLHGLLLKLGHTIC
ERTVSKYIPKPPPNTKKRISWKEFYLLHADAMIVSDTFTAYSSNFKEIFR
IVFFLHLGSRQVLHFDIHTNPTTKWMRKVLKFAIRKQKQAGKKFHYFLSD
NDSVFGKRFTKYLIRFGIKHKKTSFHSPWQNCYAERWIKTCRNEFLDFFI
PLNQFHLEKKLEEFIHFYNHHRTHLALNKDSPIPSPVFIRPPDGSKKLVS
TPVLGGLYHIYSYKKVA
>LA3775 transposase
MFMETHRFEYSIQSMANVLGVSRSGFYQFLKRSKNELEKYNPELVEFIRE
TWLTSRKNYGLVRLLREVKKVYSIYGARTVRKVMKLCEIQGKQEKRFRIA
TTDSNHGNRVAPDLVQRNFKPNQKNRIWVSDITFLRSSFGWIYLCVILDL
YSRKVVGWSISNSNDSKLVCTALSKAIECRNPPKGLVFHSDRGSNYCSYE
TRRYLLNNKLRRSNSRKGNCWDNAVAESFFGSLKREMEYNYFYKIQEAEE
LLFDHIEVYYNRHRSHSSLDFVSPVQFEVNAA
>LA3483 Integrase core domain containing protein
MILDLYSRKTVGWSLSNRNDSQLVCDTVLKAVLSRNPRKGLIFHSDRGSN
FCSKETRRLLIANGIRRSNSRKGNCWDNAVAESFFSSLKREIEYNTFYNI
EEAEHLLFDFIEVYYNRFRFHSTLGYISPEEFETNIA
>LA3961 unknown protein
MSIMKVMKSIFILLAVLGLNLSVLAQQNNQGGNQQANESVEKIDELLKGE
LVPEDDDKNLTEEQKRRKKAIQEQEALWKNPDFKGYDKNFQELHQLSKAF
ANNKFRLALSNYQSGVNTILKMREAIEQYRKEEAEKKRLDEKWYWQKVDR
KAREDRVVSRDKLVAKQQALNYFTKAINHLDEIKNPDLRERPEFKRLLSD
TYRSWILTEYDLQNLPQCIPILELYIEIDENEKEYPAHKYLASCYAFEEN
MIKKNGGASEDQMFKYRYKKNVHLLRATELKYGKDSPEYKHIVNLVNKDE
VISVRP
>LA0131 methylated-DNA--protein-cysteine methyltransferase-related protein
MKHKKVNSEISFFKEVYVLVKKIPKGKVTSYGRIAALLGKPRAARAVGYA
LNALSKDQEQKVPWQRVINSQGKISFRGDTGRSILQRKILEDEGVKFDFA
DKIDWKVFGWPDMIPSKSLLKKRKLTSISVVKQKKRK
>LA3252 transposase
MNRSIGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSLKIA
RLIQRHPIEELPTVPIPNDEEEDNRRLCSEHENWTKQLTQGKNRLHSLFT
QAGLTHITKKQLRTKANREISVALLPSRYQKEAERILKVLDLVEQNLKLI
EEEGSLSTRW
>LA3060 hypothetical protein
MNKPSTPGKDRLHSLFTQAGLTQIAKKHLRTKANREISVALLPSRYQKEA
ERILKVLDLVEQNLKLIEEEIQEALKKNKAYVQTIMSMPGIGMITSLAIK
ANSISHSLWVVRLHG
>LA2066 putative outermembrane protein
MKLQKLFLAVLIAISTAVFSQQNSGSDQKSQPSSAQLGQSILETERKLDE
KIFELNQRLTRHTVLMKMKVRVLPFRTVLFKGKANNDECTPAINQEDPAN
NCIRVEVYDFIRDEERGLNKNVQGALAKYMEIYFEGQNSNDPEPRTEPPR
NINKLKSKIYKNNMVLEDKIISEVMDRGPNTQPSHNDKVEVFFQKDNYPE
YGRPETPAEKGVGKYILAGVENTKTHPIRNSFKKEFYIKHLDQFDRLFTK
IFDYNDQLGNENYKENVDALKDSLRY
>LA3618 hypothetical protein
MVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGKVKEFYQ
RLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRKLTQYGL
M
>LA0846 conserved hypothetical protein
MNRSSQKIKRVLENLFLEYQTSDYLRSDPVEFPHLFSDLRDREISGFISA
LFSYGNITAIKDHLKRIFAICGNSPYRFLLNEDLKSIRKNLKSYRFQKPE
DTYLFLQTIQNKLRTTEGHGLESLFSLPEEGEFQFSSRDQKSFLEGGSLR
RRILSFQLRFLKESSKIDSKQTKSYGYKFLVGQGIRSTSLKRYSMFLRWM
VRKEFPDFGIYSSISPSELQFPLDVHIQRIASVLNLSSRRSSDWKKAEEV
TDFFRAIFPDDPVKADFALSRLGILKKCKSKYLRELCESCGIRSICKVYE
GFSK
>LA1757 unknown protein
MDELKRKIAEKDFIIVTGAAGVGKTKLVLESIREFIRENKTYKSYCISYK
YDSILNDIYQYVSDGDDYILFADDANRIDHFNQLIAYYQSKQFGKLKILI
TVRDYAYSDLYLNCPAELTEVIKLKKLSDNQLIDIVKGEPFGITNPNYQD
VITRISDGNPRLAVMLSRLAVEKKDISVLSDVSNLFETYFNTFIKDQKEL
ANPINIKSLG
>LA0293 purine NTPase, putative
MISAIQLLNFGKFKGKEFELSNSATVFLGKNESGKTTIFDALRLAIGSRF
LTASQEPKKSILSRYGETCLEGYNILGKVPDLSKDSAPQFVHCVSLREGE
LEFAFNNDKLIKPDFLRSKLLNNGVNLEGISSSFKKIHSPKTGSKDADHF
NDLKEEIVNLQTKRSKLILEIENLHSRNKNNSEKEEKHFKDQNKEIEIKN
KLAQIEKNSALDLKIQKKIQILECISEIQRLKSIEESIRKNLLYSKDESA
GFDSFQKEIEKSKNNASSFELLLKDKENTINSKKKELNDYKTQLSILQKL
KQKAEEWFDKIDTILKEDGFSEEIKTTHSNPLHQLLGLILSGLGFCGVLG
SFALFLFSKLSLTMFLSGVFLSGGLIILGLWLFSHKKESVNFRYSSEKEK
NFVLKISGQWNLTFPEYSIPLMEKIENLRQFFSKQIQNFDLKTAQVESIE
KEIRFLTEELDPIRSNLKLEAEKISNLESKRNSWLNDRRSATIQDYHKQV
AEFQTQSKYFSEGLKKILTEHSSHSLEDLEIKQKTLIATMEDVPNEFPND
PERQFRDSKKRELEKELQSLDNELKKLNTAIQVEDARIQDLLPEKEKDLK
DTILSLFEKELEFSKIESKRKSAKIAQELFEEISKDQSTQFVLIASEIGK
EIDLLLPKRSVILEAIDKKDSIKMQDEAGTFRSIDHLSGGTLATFYLIFK
LFLARKTVPKKGILLLDEPFVHLDRGRIESAFFYLKKFQEETEYQICFFT
KQEEIADTVLRFFKDSKKVSL
>LB085 MutT/nudix family protein
MVVPTSKPDSPRFSYDELTLSKKTSFSKSRFLGSKDDNLNRMDFFFKKKG
LRVRVAALIENSQNEVLLIQQKKKDSYYWLLPGGGIEFGESAEDALKREL
KEELSLEMKSASFLLLNESIEPGGKRHLIQLVFSVNVRKEVPELNLNEKA
ITGFGYFSSAAIREMDLRPDIKSFLLEGDFKSAPFIKSIWVSEKK
>LA2279 hypothetical protein
MNFLSDLNSIHPDQSLIVLYGDKILLLDQLISNQKRQIEVFGFGDGEGAA
KIEDSNLKIIHQLCSLDRLIEKTEEAVPQTSQLIELTEILFQKMEESRLL
HSQTEKKMKEILKEYQKELNQVQVQIQLKRHLRQDYWKTGTC
>LA3898 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQPAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3955 transposase
MGRSISYMGDCKRFSSAKQAAYYAGLVPRVDISGDTVRYGRIINRGCHSI
RRVIVQAAWSLVRCQHGGKVKEFYQRLYLKKGAKKSIIATSRKMIEVLYV
MIRTGKLFDSMPENILHRKLTQYGLM
>LA1847 hypothetical protein
MNLENYKKKIGKKVIQEIKSDSFFKLKCFQQFVYLIDELGFDSVKTINHG
KESELSFFKTKKNISIEVRITYEMGTLPDCMIFLENKPPIKIEADKIEYP
IHNKINSIILKDFASSKDYINEIRKTWEDNFSEVSTELNSCIKRLSDKVR
LSLTKQRNETV
>LA4352 conserved hypothetical protein
MMFMETHRFEYSIQSMANVLGVSRSGFYQFLKRSKNELEKYNPELVEFIR
ETWLTSRKNYGLVRLLREVKKVYSIYGARTVRKVMKLCEIQGKQEKRFRI
ATTDSNHGNRVAPNLVQRNFKPNQKNRIWGFRYYFFKILFRVDLSLCNTG
SLFSKSSGLVDFKF
>LA3583 SMF protein
MNRLVLVDSQVSKFCFKNQIFRKLNSWESLNEYLKNHLPASVLGKSLFHS
DKISKDLDLTGFKVLSFYDPQYPSLLKEIYDPPLVLFYKGNLNILNLLYG
AVVGTRDPSPISVFAAELFPSYLKNKGFSGIVSGFAKGIDAVSMNSALDE
DLAVIGVMGTGPEKEYPFENKMLYQRINSSQRTLILTEYPPGQKILKYTF
PKRNRIITGLCSSVFIVEAPEKSGAISSAYNAIEQNRNVFIFSDPRQTKN
FGGEILIREGAEILNLKDISLGMKEVFHMNDLLPDSQSKIPGMLAELSKK
RFSGEWKPIGAGYYARKTYFQPILPGF
>LA3720 Staphylococcal nuclease family protein
MKREDTNSLAQEIASIFESIRENTYKGGNRFLLTGHLEIGALLNREFNSY
ILNEKSKQRMKTLTEKIDKVVKINFSKRTLYHALKFYQAYHGKKLDFRLS
WSHYRILSAISNVETRKKLEKEAGDKGWSRDLLERYARESGYYGGSKSLK
WNRPNGENYHYKIVKNEISSQKKLWIDLGFRCYRELDAKSFKEGEIVQLT
FTKKTWRIQKVSLDSFLYHYLGILERVVDGDTLVAQIDLGFGLTARQKIR
LLGINAPELNAPGGQESFESLKKKLKPGTNLLIRTHTQDKYGRYLGDVLY
LSKKSSYETLREKGIHLNEELLVEY
>LA2680 transposase
MFMETHRFEYSIQSMANVLGVSRSGFYQFLKRSKNELEKYNPELVEFIRE
TWLTSRKNYGLVRLLREVKKVYSIYGARTVRKVMKLCEIQGQPEKRFRIA
TTDSNHGSRIAPDLVQRNFKPIQKNRIWVSDITFLRSSFGWIYLCVILDL
YSRKVVGWSISNSNDSKLVCTALSKAIECRNPPKGLVFHSDRGSNYCSYE
TRRYLLNNKLRRSNSRKGNCWDNAVAESFFGSLKREMEYNYFYKIQETEE
LLFDHIEVYYNRHRSHSSLDFVSPVQFEVNAA
>LA2774 conserved hypothetical protein
MEIWEQLFMKTTIRVFAGIVLFLSVGFLLRKNREQIRSIFQSEAIAASIH
GNVVHPGVYRLHKGDTLDDLVRLAGGLKKPSTLELDLDREILDGQTIELK
E
>LA2992 conserved hypothetical protein
MISIRKLVIYSGISISLIVLVWILFSSEDSSEKERKSKEADSVALLLGGG
SPSSGSGSSGGKTNESIFDSSFYKSGKGEYIESDKGEPKEADPNAADADN
PVNPQTNKPYTNEEMERFSQLRERFPDNSLIPKKLSPAEKEAKRQEDSRI
AEAARNVFARTATHDQIRSYYQNMEKQTQDRMDIINYLVDLQKGSGEEET
EKKLNNIQESIKNQLQQVQKEKENAFKQAGIL
>LA0394 Probable integrase/recombinase
MKQERSLFMNLRKTNPENEEYKNILTDLEIRSLINASRDKPDHFVRIRFL
IMFGLKPEELISIRCGDVDLENQLLMVRGDRGRKDRHLKISPFLLGDFYG
TVRHKNPEEYLFPGRNGKLHPKTIQKFFEKLEHKTGIQVSCSKIRETIAV
RLSSRGFSIQFIADFLGLKTRRAVYQLIGKKSKSKAVKKFSLDEILDIET
>LA3481 transposase
MRMKGRRKYSPEFREQAVNRTLNGSFTIKEVAESLGVSYFVLRQWKAEHL
KKKETEQPLPDKQLRKRRVEKTS
>LA0381 hypothetical protein
MNLSYSSVGTHIKLRFICKIMWELPQITILKTNSKIVGTHTFRKFSFIFP
TNSYYISFFETFFVRTILIFVLKNPRAKDRSGVKLLSFKFLSFDQIIFQ
>LA0460 hypothetical protein
MAARIFYYLSTGIILIGLALAAYSPDLFQWETLEWVYQKRTFFLFSLIFI
TSVILIYLIYWKAKKGILHSKSKTEIHLQESLNELVEDNQSLFSFLKAAT
ESLGKQIETSKQNLSPEFFSACSTEYLKLTREFETSSEIFKSIPMTPEED
PKKNKINFKIYEYSEIINRHRKLSKNLEKLREDLTRLRNKVSR
>LA0903 unknown protein
MSQEVNRKNKFLGQFFTPERVAHFLVDWVLGAERITSSEGLKRILDPAIG
NGVFFESVLNRLPDLNAEWVGFDLDIECLSSSRAVLENRISDSSILSFYD
RDFLLQEENQKFDVILCNPPYRKINDKNYSKELIQQFEGKSDRKLPGTAN
LYVFFLLKCLNLIHVGGRAAFLVPQDFFNSGYGVFIKSVLQESGLLHSLF
LFSPQDILFDEAITSSCILLFENSEREKKSGFYWTRLKPGFFSEESKLPL
CSVESIQTDWISFPDPEAKWSPIFHRLEKKTYQSDKKADFEKERLGHFVP
LTEFGKFTRGIATGDNDFFLFTKEMVETSGIPEKHFKTCIPKSQYARNRI
FLHSDWEELSQKGAKVWLLDVKLEFDLKDSLALRNHLQFGITRGVHQRFL
PSRRKNWYTQESKSPCPILASSFHRTEIRIVRNFSNVVHLTCFHGFNPVA
NMEEWVDPLYAYLISSVSKKDLETRRREYARGLWKAEPGDLNSLWVPDFR
KLDPQIQNELKDLVLDLKQARFSSEKESFILNRIDSIFKEWSV
>LA1179 Uncharacterized ATPase related to the helicase subunit of the Holliday junction resolvase
MSDLFTKKPIPPLAHKIRPSSFEEVIGQSKATKQLANYRFPVSIILYGPP
GTGKSTLAGILCRKWNLPFVEYNAVSTGVAEIKKLLERAEREGTILLFLD
EIHRFSASQQDSLLKGVETGHLILIGATTENPAFRITRPLLSRCQILKIE
PLSLEEQSSLLERGIQNIEYSINLNQDAKETLIRFSGGDGRKLLSNLEGL
SFSFPENHTISKSDVEEYLESRVIEYDKSGESHYDVISAFIKSVRGSDPD
AALYYLAVLLEGGEDPLFIMRRLIILASEDVGNASVNGLPLAVSGLHALD
AIGMPEGRLILAHVTTFLASCPKSNASYKGIGAALAFVRKYGTGIKIPNR
LRNAPTFLHKKEGASKDYIYPHDFGGFKEQNYFPDEFAENPPKFYFPTGN
GAELKLKEYLDKIWEKTPWKKGN
>LA0014 conserved hypothetical protein
MATKTKEYKNSIEFLNDWNQKLPQIVFVAAKESYEFEILAEKYKDSIRKT
GESHEIVIFVSEPGDFERFQSEAFNLDIFSNRKLFIIKSGLEFFKPISTG
KGKNNESLQKQFSNFPDSIQLLIHYNHWEVPNKVLQIFGGKANLIKTKNF
YPNETRGGLLQACKEIGVQLDEDAIDEFLHKIPPSMGAYLQSLSKLKLYL
SKKIFTKQDIEDVLLFSGEFNSSGLVDFFMESDRIRFFKELKKFQSGKDS
LLLFFSILKEKIDQLRKYKIISKKYETSLSDEELYEFLGIQSYSPARKNF
VRNRLKKEATFFSDKTIGELYDFLIDMNIRIKTNSEKEESLFYFKRRMED
FFLQLRRKDRIL
>LA2474 conserved hypothetical protein
MSDSLKNFTDSLLKDLEENENGFFKIENEDGLAYLSVFPAGKKGKPVDAK
EILRRIELFQITESSPISIKEIANKSDGLTHLIGKWPGKPESSRIEIEIS
EDRMKAFLIFHPPKYGGKILNSEQIQESIRERGIKFGIRNEVLNLLSEEP
EYGKKILIAEGESPVPGKNGDIRVLFIHPAAPNLEEDEYGRVDFKNIQII
QSVAKDQKLAEKISPLPGKEGKNVLGEILPYDSGKEAEWKLGLNVRLSSD
GISVHSLINGRPILDRQGTIRVDEICHLENVDFSTGNVNFPGTIIVEGSI
ADGFTLETEGSIIVKKSVGKVFLKAGGDVVLSGGFMGRNGGLIESGADIY
TRFVEQGRLIAKNTIFIEEASMHSELVAGESVVIRGGRGELIGGSCVAGK
SIICTKLGAIAETKTSVSVGIRPELLDDLEKLRLEIQKNKEILKKVEQSL
IKLNEDSQRRQLTIEEKESLPKLSAIKQKYSGILNNLLGQEQSMIMGFEP
DKDSYVEVEQEIFPGVDIYPGKGKNFKVRLKEIPGPSFVFLGNDGNPQIT
KVKPKRLGILQEEN
>LA1875 ATP-dependent DNA helicase
MNRSLEYFDKLSSLWDDFEPRKVQIQLSQKIEDSLYNGTHLIAEAGTGVG
KSLAYLIPAALISIETEEPVVISTETKSLQQQLLSKDIPMVSKILGIDLK
AEVAMGASNYVCKRKMNHVFRDGTFGPEMIPHLDSFNQWIQTTESGRKQE
FNGTASYDFWNRITREADNCLGRNCPNFSHSFYFLEREKWKRSNILIINH
HLLAAHIASDFKILPEFNRIILDEAHNFPDIIGSSFRREIRSQEIQKLLQ
QIWLPNKNSGVAVSIGSSPLKDLATQAGEALTIFFNALSGEVPLNFYSPQ
RIKRPLKLDRGKFAEILFEIVEILQKHLSKLSKDNEDITEKESALVLEML
AGRLNEIASSLETFRQVDDTNLVYWIEPPDQNSKEIYYKICMEPLSPDEI
IRDQFASRMQSIVFTSATLSTSGNDFKYFQKKIGNLNTSNLSVPSPFPYQ
KNALLYVPKEIRDPVADPDGYHADLAKQILWLVELTQGNTFVLFTSFKSL
KLVYDAIRPHTDLPLFSQSDLGPDGAKQMYLQTPNSVLFGVSTFWQGIDI
RGDKLKSVIIAKLPFQVPNDPVLETKSEKLKESGGNPFVELQLPYACTVL
KQGFGRLIRSGTDTGIVSILDPRMFTKTYGRDLLKSLPPAKLIQNREDLR
REFSNLPK
>LA3983 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINLGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA0798 hypothetical protein
MAISHEQILELQKYQKMIHQLEKIAKESKNDEQRYRVSRDLEKYKTKMKD
ISPEGIPDNLDVTAEQIKRYKENPNEAGRVLAKYPIMKISPNSNDPEVNQ
IGTWINVMDREYLPVLNETHVRFDFSHTNEKDGVVKYMENIRRNVKVLTE
TIEEFHAAEKQEFREQLSRMKNKQTRIFIAEAYEMFQKFNEFLNKVTKEA
KEVGGVIMNLEDTIRFNPRFERATELEGKSIMDALKEFQEFTSEALDRIN
VPNIR
>LA1233 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVDLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA3282 transposase
MLVYHNSGLLRPSAPLKTLGERNEKSRNQIRRNRLWKKNFRSHRIGDTSL
HQSHQFSTPEIGVSKLINWLNPNDVVGLEAGSQSFRIAKSILTKGIQVIV
LNPGDLATIYQSLKKSDKEDSLKIARLIQRFPIEELPTVPIPNDEEEDNR
RLCTEQENWTRQLTQSKNRLHSLFTQAGLTHITKKHLRTKANREISVALL
PSRYQKEAERILKVLDLVEQNLKLIEEEIKEALKKNKAYAQTIMSMPGVG
MITSLAIKANSISHSLWVVR
>LA0685 conserved hypothetical protein
MLDVKIQFGAIYHIQSKKRHDVEFSDSLRTLTIQTIEEIRKILRDKILPP
PVSNRSLCKNCSLFDTCMPSSYSDLNFKEYLFRIKDSGGY
>LB040 hypothetical protein
MFSLKKEKQNHGKDSTYQISLMEFKQIYFIESLTDSKNLALTTRTHENST
KLLTAEFTKKIVVFKNTANSGFTCILKQDQNIQIFETI
>LA3542 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA4312 conserved hypothetical protein
MNSITIQFQATVWIYPGKGGWHFLTLPIKTSKEVRTILDKMPRSWGMVPV
IAQIGVTNWNTSIFPEKNSIKYVLPLKADIRKKEKITVNQKIRVSITIQF
>LA3250 transposase
MGRSISYMGNCKRFSSAKQAAYYVGLVPRVDISGDSAYYGRIVNRGCHSI
RRVIVQEAWSLVRCQHGGKIKEFYQRLYLKKGAKKSIIATSRKMIEVLYV
MIRTGKLFDSMPENILNRKLTQYGLM
>LB319 hypothetical protein
MNFWKRLFSSKREYRSAAKIWIEDPEKLQKSIHRIFDLLFDSTLKELMPE
KIFFSLESLNGIKIPEDKIKTILKEINITETFPFHSASDVMSLETQEAIL
KFATGTVAAPSKFFLMKILNQSPIQEMFSLGLEKILLEFNKKLNPLASMF
QAAGFEKQISGFLSTLLPGFTEKIAELIHNSSESETGKIFISNTLKILFQ
TGFSDLGHLESSDFRKHLEKLKEAISKDPILEKNLEKLYESFRDTILNEY
RGETLKEFLGYSEKEYILFRDSVSKTTAENILAIHKQKPLTELVAGLLED
VLG
>LA2651 transposase
MNKKGKYSPEFKEQAVKRTLSGSFTIKEVAGSLGISYFVLRQWRGEYLKK
SEDQLPLTDKQLKESEELKKLRKENLKLKEEVTILKKFAAMLSREQNPD
>LA1811 Integrase core domain containing protein
MMFMETHRFEYSIQSMANVLGVSRSGFYQFLKRSKNELEKYNPELVEFIR
ETWLTSRKNYGLVRLLREVKKVYSIYGARTVRKVMKLCEIQGKQEKRFRI
ATTDSNHGNRVAPNLVQRNFKPNQKNRIWVSDITFLRSSFGWIYLCVILD
LYSRKVVGWSISNSNDSKLVCTALSKAIECRNPPKGLVFHSDRGSNYCSY
ETRRYLLNNKLRRSNSRKGNCWDNAVAESFFGSLKREMEYNYFYKIQEAE
ELLFDHIEVYYNRHRSHSSLDFVSPVQFEVNAA
>LA1006 transposase
MESQEVKYVGVDCGKKSIEVVRINSENSLERRQFSTTESGINNLLQWLTL
NDIVGLEAGSQSFRIAKSILNKGVQVIVLNPGNLATIYQSLKKTDKEDSL
KIARLIQRFPIEELPTVPIPNDEEEDNRRLCTEQENWTKQLTQGKNRLHS
LFTQAGLTQITKKHLRTKANREISVALLPSRYQKEAERILKVLDLVEQNL
KLIEKEIQEALKKNKAYVQTIMSMPGIGMITSLAIMSYMGDCKRFSSAKQ
AAYYAGLVPRVDISGDTVRYGRIINRGCHSIRRVIVQAAWSLVRCQHGGK
VKEFYQRLYLKKGAKKSIIATSRKMIEVLYVMIRTGKLFDSMPENILNRK
LTQYGLM
>LA0281 DNA-3-methyladenine glycosylase
MSSPNSSQSTNFKKRSSNVLENREVRLKKASSWLRKKDPITKKLIDSIGL
CKLKTIGTPYQVLIKSVLGQQLSVKVALTFERRLISLVGSKKIPSPEQIL
KIPNDEMRKIGVSQAKTETIKRIAEAYLKRSITDSKLHKLEDSDVLKLLC
SIKGVGPWTAEMVLIFALDRWDHFSINDLILRKSVEKHYGISKDNKKEIQ
LFLNTYSPYRTILSWYLWADIDGGEGWG
>LA3281 transposase
MGRSISYMGNCKRFSSAKQAAYYVGLVPRVDISGDSAYYGRIVNRGCHSI
RRVIVQAAWSLVRCQYGGKIKEFYQRLYPKKGAKKSIIATSRKMIEILYT
MIKTGELFDSMPEKVLNRKLTQYGLM
>LA3192 devR, fruiting body developmental protein R-like protein
MSLHIFGTILTPHGVAANNRGENEGNLSTLQKLLWNGEVHSTVSGEAIRY
ALRETWMEDHENLLNRKIKSEGYDWQDETFKKPDRYLDDTILGFMDPKKE
TNKRRASLEITRAVSTRPWIGDVSFNVASVGAQRTNKNPIPYATEIHATR
YQYSFALSGDDVKSEWKGLALDGLASLRRVAGNHSRFLYDFSPESIVLRV
THDPAPRILYCFREEGDTIDLKNLAKRVASGDVKAEELILGGEIASEQDA
IELKTKGATVFPGIKAAVEDAKKRIGKSA
>LA3193 devS, fruiting body developmental protein S-like protein
MDPLILYYEIPIASFRPYQSREYQDSYPVPPPSTLIGMSLSLCGLSMDHR
KFFSSCEMCSAVADSNPARSTVLRRMRRLGDSGRDPKQRRPEYQELVTGI
RGIVAYRGLPAELAKPMKTVLTTPEKIIRYGGLSLGESSFLVDVIRLFEL
PDTTQSNWSWLIPDMKGSLDLPVWIDTISPSLTTKFRFSFQSAEGIPENA
WFKLRPS
>LA0503 dinP, DNA damage-inducible protein
MIMETRKIIHVDMDAFYASVEQRDFPEYKGKPLIVGGPPNSRSVVSAASY
EARKFGVRSAMPCSKAAQLAPQAIFVFPRFEVYKEVSKQIREIFLEYTDL
VEMLSLDEGYLDVTFNKKNIPYAVTIAKEIRTEIFKRTELTASAGVGNSK
FISKLASEKNKPNGLTVVLPDDVISFIDPLPVSSFHGVGKVTARKMKELG
IYTGKDLRTKSIDELVQHFGKMGIYYYKISRGEDERMVQSSRERKSLGAE
STFDRDKLDYDDLLKQLKDVAVVVERRLEKKDFAGKTLTLKIKFYDFSLK
TRSKTLSEPIFKADELYSTAIELFEEFFEIKYGKKSAIKAIRLLGISLSH
PNSENEDPNLFLNL
>LA0001 dnaA, chromosomal replication initiator protein
MFLEEKLNLVWNKILEEVSKKISPQYYERFIDTLKLETINSEKCTIIAPS
ATIKTHVERKYQSIIENAILETCGDKIPVEILIETKAASPLQSILEKSFD
QKDFQFNPDYTFETFIVGDCNRLAYTAAKECVRKPAEINPLYIFGSVGVG
KTHLLHAIGSELTKKDPWKTVCYVDISSFMNEFRFALQSRELIESFKIKY
QSYNCLIVDDIQLLSTNAEKTQDEFFALFNFLFERKRQIVIASDRPSSEL
AIHERLKSRFVTGVQADIQYPDREIRKGIVTHHSKIMDLGLSEDILDFLA
DQIEEDTRLLLGALNDIYLYKKSYSLLFLNLDKVKEIVKNRLYRKKNIEF
SHDRIIEAVAKEFNLNTAEIMGKSRKKELIIPRHICFYLLHNVFNVNKSQ
VGRLFQTQHTTVIHGVRKTEELLSNNKEMRFLVERISSKYKLQ
>LA1679 dnaB, Replicative DNA helicase B
MQSDSLYEPESEKAFLGFLLLKGADNLIDIPVAPEDFYVDLHRRVYRAIG
DLVDKRITIDPVSVLNFLKENSLLKDEEKEFNYIYSLYRDTVVTQPLAYY
AVRIKRFSERRMYSKILQESLELIRKEPGDNESVFNTVEKNLTEISRNID
AKGLLPVSSDKAALSDYIMEIMKNRGQITGLRTNFTKLDEATSGLKEHEL
MILAARPGNGKTTFALNIASNVALVYNQPVVIFSLEMSRIELLLKMVCAD
SQVESMKLKKSELTRADAPKLLESIVRVTAAPIYIDDSGGLTIDDFKGRV
RKLLTTEKIGLIVVDYLQLMSDPKNKDGGRQQEVASISRSLKQMAKEARC
PIIALSQMSRAVEQRSKDQKPQLSDLRESGAIEQDADIVSFIYREEKVKG
EDEISPEMRGKAEIIIAKNRSGPIGSFHLAFRPELSRFDNID
>LA3960 dnaC, DNA replication protein C
MILKKYTPIQEGSPDCSHCGGVGFFLTENVVGTSSGVLSICHCISENCPC
NGKPPWRVYDESLGKMVPCVCHNARMELGYLETIFKKSGIPPKYRYRTLD
QADHSAAVGISFTIAHDWANELVHRWSDSNIHSQGLYLWGGPGTGKTLLA
CIILNELIFRYKTNCKYAKINRDFLNTLRETYQKDSETHGMEKTIETLFA
EVEVLVLDDFGVQKESDWSNSKLYDLIDARYEQEKLTILTSNTSPAEWKD
KAEGRIYSRLKEMTQEIHLECADYRLKLSESGERK
>LA0258 dnaE, DNA polymerase III, subunit alpha
MSLTMQDFAHLHLHTNYSMLDGAIRIKELMQHVKECGMSSVAMTDHGNMF
GAVEFYNEAIKQGIKPIIGSEFYVSPNRKQETEMVKIADGNAYHLILLAK
NEEGYKNLIRLSSKSYTEGFYKKARIDYDLLDRYSEGLVCLTACLAGEVN
RKILEGKIDESFQLAGKLNEIFRKEDFYMEIQNHGIPEQMTVAKQVYEFG
KKTGIPLVVTNDSHFLKKNDQEAQDILLRIGMQKRITDPMEFGFNGEFYV
KNGDEMAKIFPEIPEAFYNTLEISNKVNLKLQFGNYLLPEFEVPEGYDAD
SYLEKLIWEGIERKYPNLSPEIKERVIFELNTIKNMKFAGYFLIVQDYIN
YAKRNGIPVGPGRGSAAGSIVAYALGITNVEPLQHNLLFERFLNPDRKDM
PDIDTDFCVERREEVINYIRRRYGEEKVGQIITFNSLAAKAALKDVARVL
NLPFGEANEMTKAFPNKLGMSISEALSTSSELKNFSEKDDINHKIFAIAQ
RLEGNYRQPGRHAAGVVISPYPLEEVVPLSTVAEKEKPGVRAIVTQYDKN
NLESIGLIKMDILGLKNLTTLDYAIKLIEQRRGIRINLDEISYDDANTYS
LLRKANTLGIFQLESTGITDLVAKSQVSNFDEIVALIALYRPGPMGEGML
DEYLDRKSGKKQVTYPHPSCEPILKETFGVPVYQEQVMSISRVVGGFSVG
DSDVLRKAMAKKKADLMDKLKVQFVEGAVKQGIQEKVAKDLFEQLERFGG
YGFNKSHSVAYAIITYQTAYLKANYTIEYLTALLASDHGKTTDIVKYINN
AREMGIQILNPDVSESQASFSVIDNTTIRFGLSAMKGVGETAANSIIQAR
TKVGSFRTLQDFALNIDTRLINKKVFEALIQAGALDSFGYTRKCLFESVD
SILTFAQKEQERANEGQFSLFGNEESSFSLNLPKDALEWEIDEKLKREKA
IAGLYLSGHPLDKYEKQLKSLKTIPIEKFDDLKSGTKVEVAGVISSKNIK
LSKRNEEFANFKLEDRTGEIECVAFAKTYQKYKEFIKEDQAIFIKGDLDK
IEVGDAELRGQVKVNSIEILDDATIEDKLEKSLHLRLEERHTEDPELVPK
LYALLACYKGESSVYFHIVENQEEKRVIRAHDTYSIQPINELFLRLADLL
GDRSVFYSVGEQLKVINKSQAAG
>LA2231 dnaG, DNA primase
MSNKKDFIDRIHREVPIESYISRFIPLKKRGKNFIGLCPFHQEKSPSFNV
SAEKQFYYCFGCKASGDLIRFVMSYERVDFSRSLEILSEYSGIPLEEKSS
KNSEFSDFLYKINLKVSEYYQHLLHTPTGKNALDYLKSREIEDSEIRLFG
LGYAPEGFDTLAKEVLKTKEEISGAIQLGLLREKEAGNGRPYDFFRDRIM
FPVLDLSGRTVAFSGRILGPGKEAKYINSQASLIFDKSRTFYNFFKAREG
VRKSGEAMLVEGYLDVIGLVRRGYENVIASMGTAITENHIRTLKKFAERV
TFVLDGDIAGKKGALRAAEICLKEGMECSIVLLPEGKDPFDLSKSLSRPE
LHEILSDRIQGSEFVVEELLENADSRALPEKKRKSLQNLYSFIQTLGRET
DKQFLLGLGANKLGISMDAVLRDFKGGTGAKNGPSNADTSSNLKEVQGIS
GPALDCERKIVSMLVKHTGLFSYSEEISSMEFMDIASSYLWDYLYTIYTG
EEEISPVGILTSELPEDLKQILAPYLLEEDSEKTEPGELHKVFRILLLQQ
KKFRIEEKIRELDQKRERFFTPEIFTELSFYRKEKEKILEHIRSQSTT
>LA0002 dnaN, DNA polymerase III beta subunit
MKIKVNTSEFLKAIHAVEGVITTREIKSILSNLKIEAEGKETFLSATDLE
ISIKTSVPADVTQEGNVSLPAKQLSSFFKTIHFEDTNLSLEESDSNSSIV
YITDASGKNDYKSKISGMDADEIKTISKIDSSQVSSFPSTLINDMIRKTS
YAIAHEDQRFIFNGLYMIPNGDKLIFVGTDGRRLCKIERTLPSPLQFKDS
IIVPAKAIREISKMIATSEVGNIGLIDGQIYVSANNIELLCKLIEGNFPN
YEQVIPKNTKFSTSISKEEFQVSLRQVLTAAEEPSRQVRLTFSKNNLNLF
AQTLGASEASINKPIEYSGDEVTIAFKGEYLMDIFRSIDDNEVKIEFSDA
NSPVIFKDPSDPEFISVIMPMKL
>LA4331 dnaX1, DNA polymerase III subunit gamma
MCMLGKHVCQKHPGFESRLLRFHIMAGTHEVLSRKYRPQKFRDVIHQDLA
IGALQNALKSGKIGHAYIFFGPRGVGKTTIARILAKRLNCQNPIDNEPCN
ECNSCSEITRGISSDVLEIDAASNRGIENIRELRDNVKFAPMGGKYKVYI
IDEVHMLTDQSFNALLKTLEEPPAHIVFVLATTEFHKIPETILSRCQDFI
FKKVPLSVLQDYSEKLCKIENVQYDQEGLFWVAKKGDGSVRDMLSFMEQA
IVFTDSKLLGAAIRKMIGYHGIEFLTSFIKSLVDPDNHSKSLEIIESLYQ
EGQDIYKFLWDSIEFTHTLNLIRDSLADPESVNFPKEDLVKMKSDFENVD
SSKLNFLSGKLFEIYERIKTIRLRNSFEIKVFTEIQIKKLVEELTYPSLA
GLIDKINHLILMVQGSKNVLSDVNQNTAFALKDTLQPETSKKKDKLSSDV
ILESQFESNQQDSNLENTKPAELSSRKFDTSTEIKKKFLGTEVDPNQTPK
LDS
>LA4361 dnaX2, DNA polymerase III subunits gamma and tau
MKSFQLDEILGQEVALTFLKRYISKPETIPPLLIFHGPDGTGKESASERF
IKNVLCFEGTSCGVCASCKSFIHNSHPDYIRFPEESGKIIPIGSEDNPEE
FSIRWLIRSRLNYRPHLSKFRFIVFPDASLIGNEAETALLKSLEEAPPFS
RFIFIVNNLDKLKETIVSRAICVPFQYLNQEDLRKINTNLGLSTFPFQGG
SLISSECPKEVLDLVQEKIKDRLETGLDLLKLESWILSYKDEHPEWKENF
SYKEFLELVSLILMYEYTRVDYESNLSKLEAIFQFKTELHKKIIGIDSIA
LSRLFFKLSI
>LA0878 dshA, DshA protein
MEPNSDSNRRSNMERAISNEMTRLELSEFLSDPRSRKEFFELMKLKNKIG
HLEMNLKLGSDENKSRTFYIRNSLLAAACVLLLSALAFYFRFFSSEQNEF
EITKSVTTGQCNVSINKENIILKSGKDSYCDYTISGELGLTLRILPESIF
SASKKGDEVNLSLSSGKVLFTTNKKKISLKIRSKVDTLSSELLGTTLVLI
ADQHSKKYQIMVLEGAIRVDSTKSKMDILPGYSVLKDGSSESSTQSSGQE
VEVMKIEPKEFTKYQALSENSKKVLNENFTHHNRETDFLIKSEIEENSYP
IYRITLKNKQVVSGTIEETEKFYLLKDKDGNIKEIEKEDIIELELVQPKN
>LA2516 exoA, Exodeoxyribonuclease
MLIKGIPPIYTLLMKFISLNCNGIRSSLEKGLADYIRNTKPDFICFQETK
ANQDQVPPSLWEEGGYTPVFHSAEKKGYSGVAVLYKKPPEKITIGIGDPF
FDKEGRSIYLEYPNFALWNLYFPSGTTGDIRQAAKMKFLDLFQKESSKRR
KKQPNIIVCGDVNIAHTPQDIHDPKGNAKSSGFLPEEREWLSEFLNKGWV
DTFRYLYPDKQEYSWWTFRAGARAKNKGWRIDYFFVTEELKKNVKSHSIF
RDKPFSDHAPLEFEIKL
>LA0006 gyrA1, DNA gyrase subunit A
MSQEMENETKVLSYNIAGKPDIADALKNGVRVIPVEIEDQMKEAYLGYAM
SVIVGRALPDVRDGLKPVHRRILHAMNERAWRSDRPYVKCAKIVGEVIGN
YHPHGDASVYEALVRMVQEFSLRVPLIDGQGNFGSIDGDNPAAYRYTEAR
LEKVAEELLRDIEKETVSFSPNYDDTKEQPDVLPANFPNLLVNGSSGIAV
GMATNIPPHNLRETIDAVIAVIRNPEITIPEILKIIPGPDFPTSGIIIGG
EGLISAYSTGKGSIRIRSKVEIEEKKNGREVIVVTEIPYQVNKKVLLEKI
GDLVNEKQIEGISEILDLSDRKGIRVEIHIKKDANAQVILNQLYKMTQLQ
VSYGITMLAILDNKPKIFNIKEILTAYAAHRREVIVRRTQFDLDKAEKRA
HILEGLKIALENIEDVIKVIRASKNPPEAKQQLMIRFSLSEVQSDAILEM
RLQRLTSLEVQKIIDELEEVRKLIIDLKDILAKPSRVNEIVCTELQEVGD
KYGNKRKTEISIETIESSSFNAEDLIADEEIVIQITYDQFIKRLPIDTFK
RQKRGGKGIQGLSQKRDDVIKIMKAAMTHDNIMFFSNIGKVYVMKAYELP
IASKEARGKSLKAIINLREDEYISSVFTFRGEDMDKDLLLVTRKGFIKRI
QLKEFGNVKKSGIIAIGLREGDELIKVESMTDKDEVMIFSKKGLALRIEG
NIIRAQGRTASGVTGMRLAEDDAIVGLSKYKEGEDIFVVSEEGYGKRLGF
EEFAAKGRGGKGMAYLKITEKNGFSVGTGSVGSEDEIILITQQGMTIRIN
AFDISKLGRTAVGVRIVDLKDNDKVQDFTVLGEN
>LA4193 gyrA2, DNA gyrase subunit A
MKNSNPPKSKESFPKRPFEDQVNDDQRKYSRYVCDSRAIPHEIDGLKPVQ
RRILWAMWNSDARNRYTKTVKVAGLAMGYHPHGDKSIQDALSQMAQDFTF
ANNIPLVSGEGTFGDVLDPSAIASPRYTEVKLSDFVKDLGFFESLPDIDY
VKNYDETEDEPIHFVGKVPIVLLNNIQGIATGFRCFIPGHRLADIVKSQI
NYLKSKKPLSLKPWYKDFNGEVKMAETETGNITMSTTFAFKWEGDTLYLT
DSPMNWNREKVISLLDDILERKDSWLKDYVDYSSQTFRVELQYKKGEKPS
QKEIMALFNKEDVQTLAMNVITYDGKLKNFKPEEIIKRFCDFRKTHLIRR
FKRLSGLEEEKIERNSELIRFIKEKWNEKVIGIKSKKDFEEKLKTSKFKY
FEWLSTIPVYRMTIEEVKKCEEAIVEAKTTLARYQGLVKEDKKLTEFMII
ELEELQNKWDKV
>LA0005 gyrB1, DNA gyrase subunit B
MSQEETSYSAGQIKILEGLEAVRKRPGMYIGTQDETGLHKMVYEVVDNSV
DEAMAGHCTEIKISILPDNIVEVKDNGRGIPVDIHPDKKISTIEVVMTIL
HAGGKFENDAYKVSGGLHGVGVSVVNALSEYLEVEVHQKGKVYTQKYEKG
VPVSSVQIQGDSSERGTIVRFKPDATIFTTVDFQFDVLSARFRELAFLNK
GLVLVVEDHRRGKENILRNEFQFSGGIVSFVEHINENKHPMHKVIHFERN
KDDVLAEISIQYSETYTENIFCFTNNINNNLGGTHLEGFRAALTRTLNDF
LKKDSTLSKKHPTGLSGEDVKEGLTAVISIKIPQPQFNSQTKEKLVNAEI
KGIMQTLSSEGLTLFFEENPNITKKILEKCILSAKAREAARKARDLTRRK
TVLEGGGLPGKLADCSEKDPAHSELYLVEGDSAGGSAKQGRDRNTQAILP
LKGKILNVEKARLDKILSSEEIRVLVSALGTGIGEDEFNIDKIRYHKIMI
MTDADIDGSHIRTLLLTFFFRYMRPVIERGYLYVAQPPLYLIKHGKNSTY
VYSDKEKEELLKNIGAEKVVIQRYKGLGEMNPEQLWETTMDPSNRVVLKV
KLDDFVEAEETFNVLMGDEVQPRKQFIEVNAAKVANLDL
>LA4194 gyrB2, DNA gyrase B subunit
MSTDKTKVKEKSSQERNFKKLSNVEHVRMRTGMWLGQNSASTFEQHFFRK
NNEGKYEIVHEELEDVPAKLKCLDEACMNAVDEYRKNQKDKSIPEKDKMS
KLIVQLSSDRKCVTIADNGRGIPATNAEGVYLHLMYGENFDDHVKQDHVA
GQNGVGISLVRMVSNYFKVKTVNNGSSFKKLFTVHDDVKKQIRSYKLSKE
DTERVFLYFDEHGKFTDCNLLTKDQIDKLSPLLKKTNMQELIEKASKEDH
GTSVEFELNPKYFNNLDISFNVDLMKQYLQDIAMTNPGLEVQFVFKGKKE
KYKFKKGLDEIFSHSDLVYYKMDYVAPGSASQLHLETYLVIGQNKNLTWV
NSIFAPQGGSAIEYLENRICDEVRKKSQIVALEKKLKTSSTRNDVRNCFH
MYVNARLLNPRFKSQDKSYLINDLNEDIRNAVDKHLDKFIKKTGLLEEIK
LQMEKRTQLKAFEDAQKGLKKASRMNIPKLMPPTGKQGDPGRVLFVAEGD
SAIAGLRPARNPKLHGLFPLRGKPLNCKGLSIAKAIANEELKNIVAIVGL
PLDQKVKSLDELNYEKVSIITDADFDGYAIRSLMLSFFYEYWPELFELGF
IHISSAPLYEVDVKFGDKKKETIFCIDDKDYEDLIKRVEKGGGEITRKKR
NKGLGETGKEAMKFAVEECMTKITIGNKKEASKIQSLWFHKDYAEQRRDA
ISEYAMSVIED
>LA3977 invA, invasion-associated protein A
MDKPYRKNVGMVVFNSRGEVLVGERLNFLGSWQFPQGGIDDDEDPIKAAM
RELYEEVGIDSGKIVAEYPDWISYDFPENLPLNRHLQKYRGQLQKWFLIY
WDGEVDQCDLDIHEREFGTVRFIPIKNTLNTVVPFKKDVYYKIVNDFGPK
IQNFLQDIGNRS
>LA4119 ligA, DNA ligase
MPKKKEDSQKTLSEKEAKGLIAKLSDEIRHHQYLYYVKNDPKISDFDFDQ
LFRRLQDLEEEFPQFKDLASPTLVVGSDLDKDFEKFQHKLPVLSLINTYN
DNELLEWVNKTDPEGLYSVEWKIDGASIVLYYENGILKNGVTRGSGGIGD
DVTDNIRTIRNIPLRLPKPITVYLRGEVFMTFKDFEEFNALSSGKYANPR
NLSAGSIKQKNSSDTAKRPLRIFTYDATFPNMEKKFKTHQEIFSKLEKLT
FPVPPNTAFVSGSKIAKTIQEFKKQKDSLGFPTDGLVIKLNDISKRDALG
YTSHSPRWARAYKFDAIMKESKIVDITYAVGRTGKITPRAEIEPISLAGT
TVTFATLHNQDYIDELGVGIGAIVRVAKRGEIIPAVEEVVTPGKEVFKIP
DRCPSCNTQTIKKESLVDLFCPNPDCPDRVKNGIIFYCQRKQMDIEGLGD
KQIEFLYDHDYIKSIADLYDLKDQKEKLMEEEGFGEKSVNIILKGIEQSK
QKDFRFLLPSLGLSELGHKVTELLIEHGIDSIDEILSIAKDQKKIESLLE
IPGIGPSTIQAFQENFSDKRILKLIERLKKAGLKMKADPIQVADQQPFAG
QSWCVTGSFENFQPRDKAMDLIVYYGGRKVSAVSSKTTHLLAGPGAGSKL
EKANELGVSVYDEKQFLDLLKSLKIDFKNLI
>LA1370 mag1, DNA-3-methyladenine glycosylase
MKDKILKFEKKEFYSICDQLSRKDRGLHSILLKHGYPPFWSRKPNFETLV
HIILEQQVSLASARAALVKLKNKIGSVTARKILLLSDIELRECYFSRQKT
SYVRDLAEFVFSKRIILGDLASKSDQMIRGDLITVKGIGNWTVDIFLIMA
LHRADIFPLGDLAAVKSLKKIKKLPVDTSNDKILSVSKSWRPFRSIATML
LWHSYIQENNIKF
>LA1175 mutL, DNA mismatch repair protein
MTGLRIVPTLETSMGKIQELSPELINQIAAGEVIESAHSVVKELMENSMD
ASATQVDVESKDGGLSLLRITDNGTGIEPEDLEPALKRHATSKIQDYKDL
ESVLSYGFRGEALASIASVSRLTLESGTKEQKTAWKTRSVAGKISEKEEI
PGFIGTKILVEELFFNTPVRRKFLKSIRSEDKKIRDRVTTQALAREDVRF
RLFQDGKEVFVLPTRENKKERIIDLFGENFRDHLLEVSLERGGIQATGYI
SDPDFYKSNRTGQFIFINGRPIEIKYSSVLLKKAYDELLPPNGHPYCFLF
FEIDPSRVDVNVHPAKREIRFLDEDGFNGFFLALIQKELRSSTPVSFLEL
KKRLLKPAPETHSTTSFYQARSSGKNPLLGRELFSGVSKQEGFELDRMGP
GVSLSELTDERVKHSSFVPKKHFGVLFETFILAEAEDGFYIIDQHTAHER
IRYEEVLRKLEKRNYGIQPLLTPIRIDVSKQEQEDILNRKKEYEEVGIFL
DPLGEDSIVLREIPAYMEPGQEKEIVLDFLNRTEGKETSEPELYDLMAKC
VACRSAIKKGDQLSDPILAEILNRLSYCENPSRCPHGRPTLVKLSRDDLE
RMFHRK
>LA2146 mutS1, MutS protein
MNLESTATSAEYWSDLADALNTPMMKQFLAIKKDFPDTILFFRMGDFYEM
FLEDAKIASSILDIALTKRQNAVPMCGIPYHSKDNYISRLLNAGKKIAIC
EQSKPEEAGSKLMTRDVVRIITPGTVIEENLLSGFQNNYLAVLHLKKSLI
YFAIADFSTGEVFYSSVSVTGLERLIAELEKFKPSEICVPKSEHTFFQEL
EYFKNREFTVLKNQIETSEKDSFQVLSKYLNEYIRETYRDNKLVLREPKI
LSSGKFLEMDRETIRNLELVENEKEKNNTLYSIFNFCNTAKGKRLLKQRI
LFPECDPVVLYSRWEKQDILLKTVLAPYITALKDFGDLERILTRFRGNHA
YPRDFRSLLNSISSGIKLKEELEKVSYPFLIPIEELKKISDFIQERLHPG
DDLPVILGNGIFLKKGFSQKLDQAREAGVKGKDWILDLETKEKKRTGLNT
LKIRYNKIVGYFIEISRAQAEQAPKDYLKKQTLVGSERFTMPKLEEIERT
ILEADEIIQEIERTEFNRMVEEVLKFSSSLLSFSEEIGDLDFQISLLTAK
DKFGWIRPKLSEDRSLDLSDSRHPVVEATLPPGQEFIPNSVYLDTQDKAI
AVLTGPNMAGKSTFMRQIALNQILFQIGAFVPAKSAKLPIVDKLFTRIGA
GDNLTAGESTFFVEMKETANILNHYTEDSLILFDEVGRGTSTYDGMSIAW
SILEYLSSLSVRPKTIFATHYHELTELSRLGGIFNLYLETLEKEDRVLFL
RKVKVGKAKKSFGIYVAKIAGVPEPIVKRAAELLTDLESKKKEIKIQEAQ
PTLFTEPETKNFNSQTEESILKLKLEEMTPIEALKTLEDFQKKLRKQK
>LA2351 mutS2, DNA mismatch repair protein MutS
MFSVLLNKKHWYRRRVSDCMKKKSVKFGYSGAKTIYIRKLRFEEAQWKLE
KEIQEAFLAGETLIEIVHGIGEGILKKLTLDTIRSHDFLKEVDYSQFGIS
NPGSTLVEVLGPDKDVLKRYLR
>LA4236 mutS3, MutS-like mismatch repair protein, ATPases
MTSFTRIDRLKRRAEKLNKLHDKISVLLSRLSLFRLIFFSVFLLWISIFY
YLHSSIFYHLPSLVFLFLFFFFVRRYKKTLLTRKKIQLWIFVLERESARI
SIKGFGKKFGTQIVLEKVSPLARDLDLFRENGLFSWLDTTFTSNAEQKLI
ALLDLEDSSSYNNRENVLLRQSIVRSISEKTLAIPKILRLASYLKENQDV
FQARTKTETKSIRDENTSKFWESYPWLKKIYRPITILVLAFIPANVFLGV
PFPASVLFLNLILFGLYRSRSLDIFRRYYSISGSISGLQKILIYLKGLKI
RDKNGRFLLEDTSKDELRSAYKDLDLILKRVSLTEAPLLHLILNNLFLYD
LWVLQKISKWKEKHSVLLEKSIEDLTLFDSLFSFANLKWMFPDYCFPEIL
SENSKEGISGKGLFHPLIPSDSRVSNPLDFIEEQNVVLITGSNMSGKTTY
LRTIGVASILSMAGGPVPASKFSLPVLKIHTSMRNEDNLEEGISFFYAEV
RRLSEIVKKIRDKNSSHLVLLDEILKGTNTRERSLACKGILKELKKNRTI
VFVTSHDLELAKVEGVILKHFQEEVLDGTMYFDYKIREGLVETSNALRIL
VQEGLDLDFT
>LA2328 mutY, A/G-specific DNA glycosylase
MKLNKNSIDPKLILELRKNLLSWFHKNKRELPFRINKNAYRIWVSEIMLQ
QTRVTAMLPIYETFLKRFPDPNSLSEASEEEVMKYWKGLGYYSRAKNLKK
GARLLVEKYQSRFPENYEEALLIPGVGSYTASAVLSIAYGKPHAVLDGNV
KRVLSRLFLVESDPSLTSTNQTLADLAKEFLTPQSPGDHNEAVMELGALV
CVPIPNCSACPLQNHCEARSVGKEKEIPASKSVENWIDLDLNFLFLKSED
KVLLVKYTTRRFFKTIYSLPFRLEGKHPYEKDEWIEELFEDSRIVPNFLQ
TKHSITNHRIRLKFCDLDEKNISKVEKNLKKNKHIEFKWVPESELKEEFP
SSISGKLIKLRNKNKKQPELPVGKL
>LA0488 pcrA1, ATP-dependent DNA helicase pcrA
MKNKVQYSSAQQKVINENTRFVQVVAAAGSGKTSTMVGIIERILVENLFP
KESVLVLTFSRKAAIEISNRIQKVTDKNSIRVQTFHAYCLYALSQWHPKF
TLKKPKILSPEEKNQFYRGFLKKERNKIGGIPYDFFWAENIPFIQENFSE
LKKDLEFAYQKFKHNNGFLDFEDLVKMFLDGLKNEEEWTSEPRSLLQKII
VDEFQDTDLEQLEFLKLLSQRASIVVVGDDSQAIYSFRGTSPEAFLNFQH
LFQPCKVHFLNTNYRSLPEIIHTSSIPIQKNHHKINKEVFPFRHEKGFVG
KIFIEEAADLIPFLNRAILTSKDDFKILCRSNFRISEYIREGIPKRYLMT
IHASKGLEFHTVFVDVADGWNARLDSTLKTIEEERRILYVGLSRAKDRLL
ILGTSKNSRRETIENTFFHYFKKLKNIVPEDLI
>LA1085 pcrA2, ATP-dependent DNA helicase pcrA
MKLSPEQEKAVRHVDGPILIFAGAGSGKTRVISNRIAHLIENAGVPAGKI
VALSFTNKSAREMEERVRKMIPRQKLKGIVLSTFHSLGLNILKKHIGLLG
YKHPFLLMNQNDQEGFLTTLLIANKVELKKAKVSEILGKISRIKNSGLAY
REYLDSSLVESDQVANLVYDSYQKSLKEQNSLDFDDLILLPGILLREFSE
VREEYHKKYQYFMVDEFQDTNQTQYIFLRALMGENRNLCVVGDDDQSIYA
FRGSDLSLILNFEKDFPEANVVRLLENYRSTSVIIQGANSLIKNNLSRRS
KELFSSIPGGRKIRYLERMDEKDEAAYVVDCIREEMIKDARVGSQISILF
RTNFQTRPFEEELRSRSIAYKLVGGYNFFDRKEVRDMISYIRLIANTRDD
ASLLRILNYPKRGIGPGSISLIHEKASQMGESLYEILFRVCESPDFIPNL
QKKIQSEIYNFVNLIERTKKKFSTAPKMYYAFREFIQEVGIEKEILLEEK
DEKVAKARTFNLSELVNMMSYFEENHDSPEKPTLFDFINRLNLLMEDENP
SDEDKEDNRVQLLTIHQSKGLEFESVYVPGMEEGILPNSRVLTEESSVDE
ERRLLYVAMTRARKHLCLTGAANRRKFGEQTATQASRFLTEIDPETMDWV
SNDEVRQQETEDFFAELEKLKTGS
>LA2193 pcrA3, ATP-dependent DNA helicase pcrA
MGITIIELSWKEELNPAQMEAVLTLDGPVLVLAGAGTGKTKTIVSRLAQL
VASGIPASSILLLTFSRKAAREMILRASMIGNKKCSEVQGGTFHSFCNGV
LRKFAPVLDISSGFTILDESDCLDVFQFLRNEKNFGKTKSRFPSNETLVS
IHSEIQNTGKPLSSILEKDYPLFLQKANDISQIFEDYKSYKKEQSLLDYD
DLLYFTRELLTNHPGVRNALSEKYRFIMVDEFQDTNKIQAHIACLLASEH
SNLMVVGDDAQCIYTFRGASVRGILDFPKIFPNTKTIFLEKNYRSTPSIL
NLANEVLKNFSEKYDKYLFTDNENGPLPQVLQFEDELEEAEGISKILLQK
REEGIPFQKMCVLFRAGWNSNQLELVLAKRNIPFVKFGGKKFIETAHIKD
LLSLLKLLVNPLDSVSWIRTLKLIPGIGNAKANSILDKIRKSSGSFEVLS
EENGTTIDKYISPLYHLYQKYKETHSEVKKMVSEFIDYYRVLLEKNYDDS
KRRSEDLDAVLGFSLKYNSLSDFLSDLTMDPTSLSLDKMKPDDTDSDLLN
LSTVHSAKGLEFDLVFVLNTTEGIFPSSKNTNIEEERRLFYVAITRARKE
LYLTRPSLAQSRSGPYYTKLSRFLSEIQFPEKVYELKLMSGKSVSKNPFA
NQGSFVKNTNDSFSRIQDYFGN
>LA2317 pcrA4, ATP-dependent DNA helicase pcrA
MDSSFLSDLNEEQKKAVLQVNGPVLILAGAGSGKTRVITHRIANLLINHG
IDRICAVTFTNKAAAEMVERVKKIVPFLPANVQIKTFHSLCLYILRREAS
FFGFDNGFTIYDTTLQESLLKQVIKDLSLDPKFYKPSTLGNYISGLKDKM
LSPESYLEKEGRNDFSKAVSAIYKEYEKRKDANYAFDFGDLIWKTVQLFQ
KSSDAISKYRHKWEYVMVDEYQDTNKVQYELVLLLAGEKRNLCVVGDDDQ
SIYSWRGADIGNILNFEKDFPESVVIKLEENYRSTSNIILAASNVISNNT
QRKEKEIFTNNPEGAPVVLNEFENESEEAHGVITRMRSAYSGGTEYKNIA
IFYRTNSQSRYFEEALRNVGIPYKIFGGFRFFDRAEIKDLIAYLNVVSNP
LDSVSLLRIINYPPRGIGDSGVEKIREFSLEKGISILEVLGQEDIPLKKA
AKSKGKELYNLFCDLIEKSEKGLSPSEIALELLNRSGLMSHLKDEGTEES
VARLENLQELVNSIEEYEKNSDSPSLEEYLNQISLITSEEDSKELTDYVN
LMTVHNAKGLEFEVVFLSGLEEGTFPHSMSLEESHFGDEEERRLFYVALT
RAREELFLSYCRTSRKFGKVEDRIPSRFLSEIPRECFGNRGYVSRERTAR
KPQGPPVASKIRQANNELESHHAPPPDPNNLYLKPGDRVKHKQFGIGTIL
TISGREKNTKVAIRFGNVEKNFFVAYTPLEKL
>LA3625 polA, DNA polymerase I
MKRLLIIDGHAFVFRAYYAFGASNLTNSKTGKPSGATFGFFKMLFKLIQD
YTPSHIAMTFDPGGTLERGKIFQDYKANRKPMPEDLRPQIQEVMDTLEKI
GFKILKVEGQEADDVIGTLCETYRSTAKEILIFSGDKDLYQLLEKKNIKM
LRGKKGVTEFVEIDSAWVKEELGVDVKQIPDYMGIVGDTSDNIPGVKGIG
DKGASKLLQEYKSLDGVYKNLEKIKNPGLKTKLSEQKENAYLSKQLATIR
RDLQLGITEKDIEIPDYKSDAAILYFKSQGYNVLSKDLAKSAGKEVPKDP
EPAADSSENVTIPTAEKGSYKLISSIDELSKICRGLLKSRVLSVDTETTS
PNPAMAELVGISFSNQEKTGFYVSVKNNASLFQDKSLNLDEVREHLGPVL
SSQVPKVGQNIKYDLIVLENHGFVLNNIQFDTMLASYVLRPEGRRHNMDD
LAKEFLNYNTITYEDLVGTGKKKKELTDIDPEQVAEYAAEDADVTFRLYQ
IFRKSIKDSGVEPILREMEMPLISVLAKMEKTGIALDVPYFEELARDFDR
EIRHLESEIHRQAGGPFNIASTKELQKILFDNLKLRIVKKTQTGFSTDHE
VLEELVGEHPIIEKLLDYRKYTKLKSTYVDALPKMVNPKTGRIHTSYNQT
IAATGRLSSTDPNLQNIPIRDREGRLLRKGFTVDSNDYEILSLDYSQIEL
RIMAHISKDPAMLEAYNHGLDIHKRTAAALYGVPETDVTHEMRDKAKVVN
FSVIYGVTPYGLSRNLRIPRDEAKSFIERYMTQYPGVKSYMDSMVEFAEK
NGYVQTLTGRRRPVTDINSTHKSAKEAAKRIAINSPIQGTSADMIKIAMI
KIHEDIEKKHYKSRMLLQVHDELVFEVHKKEKDDFKASMKKHMETAMSLD
LPIVVEGKFGVNWDEAH
>LA2398 priA, Primosomal protein N'
MIYYAEVAFDLPIEEDTFTYEIPPNVQIGVRVLVKLRNREEEGIIVSIHQ
NEPNYKVFQIEKIIDKIPIVLQEQIDLAHWMKNQYIASLGECIYKMIPAG
RRQVKLETFPSDAEGKPVTLNEEQQIATQNILSTFGTAAVHLLFGITGSG
KTEVYIHLIQKVLETPNRSVILLVPEISLTFHIIRKLELIFPGQLAVLHS
ALKVSEKFKAYNELLTGKKRIAVGTRSAVFAPVSNLGLVIIDEDHDSSFK
EHSSPRYHARQIALQRCRTNEAVLVMGTATPSLEIYHLAKEGKIHLHTLT
KRPQGVLPPTVRIVENQKESNVLSSELSFAIKQRLEKKEQIILLLNRRGY
SPLIYSPSTSSYVPCPNCTTNLCYHKKGTTICHLCGHTETLDSLQKRMGE
ALTLKGTGTQKLEENLLEAFPQTRVERLDQDSIQDRSLLNEVISRLLGGE
IDILTGTQMIAKGLDASRVTLVGVLNAGIGLGLPDFRANERVFSLLTQVA
GRAGRSKLKGEVLIETNAPNHPVIQMAMNQNYIQFYESEIPVRKELFYPP
FSRLVRVVSRSKEEQISLETIELVFGVLKKFFPSKDTILLGPAPCPFYRI
DSNFRNHIILKTSSLNIWREIFKKEIRPLKLSKKVYLEIDFDPLDLV
>LA1456 radC, DNA repair protein radC homolog
MKSSGKLSEKLSPFPDPRTRIAYEAESLEDWELLAVLLGRGNRAQPIEEL
SREILHRSKGFGGLLQKQVSDLRKIPGVGIAKATTLLAAIEIARRLKWEA
LKGKRYSSEQLLNFLATSLIPKNRECFVLITLSPEGAVLRAEVVAVGSLE
EVGVQTRDLLKIILNDAASAVIIAHNHPESSSKPSKEDLWIYKNFGSLLA
NIGLELLDQWIFGIDGIYSCKKGKVLQARAKC
>LA2179 recA, recA protein
MGESIMKKAKEDAPSVDDSKKLAIEQAMSQIEKQFGKGSIMKLGSDSAKQ
TVQVIPSGSLDLDIALGIGGYPIGRIVEIYGPESSGKTTLTLSAIAEAQK
RGGVAAFIDAEHALDPSYAKKLGVNIDELLVSQPDNGEEALEICESLVRS
NAIDLIVIDSVAALVPKAEIEGDMGDSHMGLQARLMSQALRKLTGTIAKS
KTVVIFINQIRMKIGVMFGSPETTTGGNALKFYCSVRLDIRKIETIKEKE
ESVGNRVRVKVVKNKCAPPFKQAEFDIIFNAGISREGSLVDLGVKHDIIH
KAGAWYSYNTEKIGQGKEAAKEYLKNNPEIALTIENMVRDLNSLPLLVQE
NNKKSRKEEKLEQAAG
>LA0966 recB, exodeoxyribonuclease V, beta chain
MNVSRTYKFKSSFIEASAGTGKTYTIMEIVIDLILEHKIPLTQILILTYT
EKAAGELKERLRKKLISSGLTKEARELDQVTISTIHGFCNTILKEYPVET
ETHTNWILTDALERLNIALYKLQHEEWNSWVDPEKLEDFILSSKYRFKKE
NILISASKLLSGKKYKYSNETTTMTSETFLQKTALIIADMVLKEFKTSEW
MSYDQMILKTRDSLENPRLRKALQSRYRVGILDEFQDTDGAQYEIFKRLF
LESNDDRALYLIGDPKQSIYGFRGADIGIYLQAKEELKKHKAEEISLNVN
YRSVPELIRAYNEIFGGKSGKQSFFPILEQSIPIQYEPVFAPKENIKVLL
SDKQKQGPIQIVRFTGKDFWNTQDAKNAWGQFISEEILKLIHKEDPFTYQ
VAEGDLYYVEKKLKLREIAVLVKSKSEGKLAEQFLKLRGIPCSFYKQEGI
YQSAESYQISNIFECLLDPNKPSSYRKLLLGDLFQIHPAHLPYFDEHSID
SYEKSILDRWKTLSMDRKFSELFRSIEEDSRIFLTEDATDIDWERKRTNY
RQIFRRLLQFQIANQADLEEILEELKLLQKSFKNEEELPLFEKETEKDAV
QILTLHASKGLEWPVVFLFNLSGDYIPEVYDYPFVDKDGKLSWKLSLWDN
EEEKKISKEIYSNQSLNENKRLLYVGITRSKVRIYLPFYTPLNNWKTRDS
AYYKILYPRLQSILENEIDTDLFHVVSWAPTPFDSTDPKRIDWNLDSEIC
FTPLLYEEPETSKTIRLNSYSSLRTSMNFSEEVSLSVLEEKTRIQSDDVE
NSETVIKDALPSSASIGSFLHSLLEELDFSIFKTTTSKELLKNEKIISRM
DFHLDYFRILKREEGRPILEKETVQKRTVEILWNLLNANILDQKGALFRL
VDLPKENRVSEMDFYLDLDTNPGNFLRGSIDLVFQIDDKFYLADYKSNLL
EDYSTISLKRNVENLESRYDLQRDIYALVLYEYLKNLFGPKEALQKIGGV
YYFFLRGMIYGESSGIYSDFFWSLERIESIRKTVLESTTFRWEKSN
>LA0965 recC, exodeoxyribonuclease V, gamma chain
MSITHITSLSLEDITSELSQNILKERKNFPLRSITIVVPSVNMRSWLNLN
LARISGLCANLRFLFLEKALEEYFHFRAGLDYDPFQRTFPSQDAIQRKIL
TFLIENLNSEETKFLGSFLESIPRAFSLSAKLTSLYKDYELNRSSWIQSW
ANEKGLDIPSISHRPTPFPKEDEYYLFQKKLYQKVFLNSNQPSTLIQFFL
KEVFKNPRRSPQDSLHLFCLSNLADTYLGILESISKKDKLPIYLYQFHTG
ASTKTESLGPQRWSNPQIHISSKIVSIPGTISKNLEDTRIYPEKLSALKN
LLKGEIRGHNVENFSGDFSVRFWNAPSSYREIESVANDILYKMNQDRTLT
YLDFAVLVTDMKVYRPAVEWVFDGGILLQTKVDADPIRKKIPYSLTDINA
NEASLLYRGLMNFWEICSGNFVRKNDLLKLLRNPLLQKKIRIHSEDVQEL
EKLIETSGVRYEESGRENDTFQISNGLKRIRLSSILSQEAAWTKYKISQI
PLESEEYSLHLTLFWETVLKVKKDILSFFANETIRWTSEYLKVVQTSIEE
LFEFSEEYEQEAKLFHSWLQSLSEWEGIQLQNPEQGISLLKFLTEQVFNQ
IPYRKGAYLTGGVTISLLQPMRPIPFKHIYVLGLGEGKFPGSDDISQLNL
RKHFREEWDISKREIQEFLLWETIHSAKESITFSYVGKNLQEDKTFEPCS
HFLEIMEFLEVKDVVRLPLHSYSIKYEHTLRELKQGLVSYDFARVWVNGK
RKDHPVLDRFQNPDELIKNHSDLSRSTIDVKELSQFLSDPLDTYLKRKLA
MYLEEEYAEENEKEPFYLDAIEETHILKKVHALMIPDLVLEKPWVWDQEK
IVQAVTPILEKEKFSAKFPASVFGKIQEVDLVQYLVKTSEHLSEWKPLFQ
GGKYYPYLSLGDTGLPDAICKKLPALKVPLESGDFFLQGEWEHVIEKEGN
LYWLFSKSLEEKPSEDYFGYKDYWKVMSFPFLTGVAFASSNENFKIYSFK
PRPSEESKKKNILEFEYDIKSSSLGMEYLTKIVTEYLKEEPIFFPRRAFL
SYYVKNIQVSSGKNKTVEDLSKFEDESVWIRFLKEELNGIKENLSPLVKL
YPKTPELILQSRIRWAKYFYKPLLNWKKIYER
>LA0967 recD, ATP-dependent exoDNAse, alpha subunit
MGEIELIFIEKLRTDILELIQSSEFEKKKLIKVSEEDSNYILEILNSIWE
ATQEGSLCVPVKQEWKKILKIKLPGLVVDRFENTEWVYFEKTYHSKIELE
RLLKERIENNVSVNVDTDRVEKILKDLESKSFSLKKNQKKTIFSCLNSSF
QIISGGPGTGKTTVVAFLLQILNELKQLPSPEKIALVAPTGRAAQRLTES
VQENLKKISTSLENGFLRGQTIHGLLNYKQSLGGFYYNRERYLPHRLIIV
DEVSMVDLDLMLSLWNSIPKNETIQEGTIPFRFILIGDPHQLPSVEKGAV
LSDFLSVLESKRFHFVSKLEESNRQQPNTNGEVSKIVTLAEEILKYSPEN
LNLGTKIDESFPKTNQIKENILYKSEVVWLQNIESSTSDYFSRDELVEKL
WKEIFYPQIDRISSWKIQDSFCFDKPDFIQKFQEELKRFRCLTIFRSGYW
GVDAIQTKIMNLAAQNLFFGKGGNFFAKRLSKGIYFVGLPILITRNDKSR
KLFNGDIGIVLKTESTGELRAVFPIEGKLFQFALDTLPEHEPAFVMSVHK
SQGSEYDTILIYLPDSIEEEKSNGLLNRQVLYTAITRAKKQVILAGNPQT
WEIGIANFQNRNTGFRI
>LA0003 recF, RecF protein
MFLKHLTIQNFRNHEELSLDFDSRLIFFVGDNGEGKTNLLEAICILSWLK
SFRESEDSNLIRWGSENYFLRGKIKDNLKESVLEIGFTSKPSVKRKLKFN
QEEIKKRTDLIGKFITVLLTPMDLKIIEGGPAERRKFIDAFISSFDPFYL
ESLLEYNKILKHRNALLKSGNPDISHLSIWDKKIVEKGIFILNKRREVVL
ELNSFYRVNLDKLSGGKDGLELIYKPNVKDQDEFLEKLNHNLSRDLRLGY
TSVGIHRDDLFIGSDQRDITEFGSQGQKRSTVIALKAATFNYYKNILNTI
PVLLIDDVIRELDVKRREYFVDLVVTAGQAFFTTTDLEGIQDYVGKLKDQ
KQIFLIRQGKVESIK
>LA3945 recG, ATP-dependent DNA helicase recG
MKNSVSKTETQSNGLLLPVTVIKGVGPSKAAALASIGIDTLQDLLNFFPR
RYLDRNLTDNVLLKTGETVTLIVEVIDAYLAHGKKSRLVVGTKTRNNERI
SIVFFRGVNFFQKIFQPGTTLVATGKLEYFRGFQLIHPDYEILTSAIKTT
YPISSTGSKKKKQEQEPEEELSELPEMIHAGRIIPLYPSGEVLKSEGLDS
RGFRKILYSALEKLKGKIPEILPNEIVKRRGLILREESYREIHFPTDENS
LDTAKYRLKYEELFYFNLLIEHKKKEREKIKRVLWPLPESETANKVRKNL
PFQLTEDQNSALQKIKELTNKEQPIAVLLQGDVGSGKTLVALLTALRYID
NQIQVCMVAPTEILARQHYQTILSFLGNMPFLGIELLVGKEPKKNRYEKL
YRIKKGDTLFVIGTHSVFQEDVYFSELGLVIIDEQHKFGVDQRETLRSKG
KNPDILAMTATPIPRTLCLTLYGDLDLLTIKSKPKGRMPIQTKWFQEDRR
EGIYKSIRKYVSSGRQCYIVYPLVEESEKVDLKSCIEAYEYLKHEIFPDF
EVGLVHGKMEVEEKDRVMREFSKNRIQILVSTTVIEVGIDVPNSTVMVIE
HADRFGISQLHQLRGRVGRGDQESFCILMTDSKVTEDAKVRLEAMVNFSD
GFALSEIDLQLRGPGELMGVRQSGLPDFKIADLRKDSNLIELTREDATLF
GNPGDLEKEEIRGRFSEGRLLFSN
>LA2999 recJ, Single-stranded-DNA-specific exonuclease recJ
MIQPSPFSHGPGLKELHPAGLRPHFSPLQHRFYETHLREKSHPEHLLYHG
LRELPSPFLLPDLEPALDLLKEFIQKEKKILLFGDRDCDGVSSTSLLGSF
LKKIHRGELILKTSNEEDYGLCPAALDFVKRIKPDLLITLDFGTTNHIQI
DELASIGIKVIVLDHHEIPERIPDCYLISPKRSDSIYPNEKICTSVLALK
FIQAYLYSSLEEYNRATWIGDGNSLFSGFLIYRGKLLFQGDRQEAESKFS
LTIQDESYSFQSSYPEREWFYQEFLKYPAILEQYLQNFDLASIGTVSDMM
PLYGENRIIVREGCKILFKLHKKETSHREGLFQLLQLMEFADNRVTSKDL
GWGLGPMINSAGRMNRTDVALNLLLEENPELAKSGAKELQKLNEERKERT
KRNIFKVDSFLKRKKERTERSVLFCYEPDFEPGVSGIVATRLVEEYKRPV
LFITPDHGHAKGSIRAYGKENVLNLLKKAESVFLQFGGHKEAGGFSLEID
KIPELAKLIFDNADNWLAEEQMTSTSEQTESLISLKPEELNPKIFQELSI
FEPFGHENPIPLYSIKNAKIYHTKPMTDGKHVRFRILGAPESIQCLIWNR
GKDFLELISKSVSLDLWGSLEESTFRSKTSLQFIVNYFQESEN
>LA2321 recN, DNA repair protein RecN
MLKTLNIRDFALIEEACIDFQKGMTVITGETGAGKSLILDAISSLLGGKS
SPMEIRTSAPRYVLEGVFDLSKNPAAVEWLKEKGFPFESKELTLHRECGR
DGKSRILINQSLASSTTLRGLGELLAEVHNQNDQILLLDRGEQLDIIDLH
AGLVPLRNEVKECFLTYRSLKKRLEELRKSEEEKSKRIEFLNFQIREIRD
VDLKEGEEEGLNQEEHLLAHGELLAENYEILSSHLADSESAILPLFPKLL
SAAEKIKSIQPDFSKTLDSLQEIYIQLKEINSSILDEKEEIFFSPDRLQF
VQSRLDLISKMKKKYGSDLAEILDCKNKAEQELEAMEKNSKNKESMEVEI
EKIASRLASLSIQLSKSRRESLIRFESSLKLELEQLGMPGAAVQVVLRWE
PNPEGEVSASGKSYIVNESGLDQLEFYFSPNPGEKPRPLRKIASGGEVSR
VMLAIRSILGRQSNLRVLIFDEIDSGLGGEIAMDVARKLRNLAENHQLIL
ITHLQQIASAANDHLKITKSVEGGRTFSKAEFLSLEERTLELARMISGQR
VSKGALEHAKELLKKQAV
>LA1685 recO, putative recombination protein O
MSGNSPGALKKIRGIVLESKTIQEGDALIRLLPEAGSVENFRIRGIRKSK
TRPIASVEPGSLSDVDYYHSKNKETHNVKEISLINRFDRAKSGYLGTVLV
SYLVELASSFTPDGAEHPGEFRLLFGALEELEENGISILILPFFKLRLLV
SGGFLSKELICHSCGAELKEMTFVTLQTTPLELICGNCLYGDRNDLGCVQ
WIQTFLMLRFRDLKEREISVENILDLDRICNQMLEPILRKKLKSAPTLYE
ALGENLGKFS
>LA0625 recQ1, DNA helicase RecQ
MFTISNPMNSEQKQHIQNLEYTLKKKWGLSKFRSGQKNAIESLMAGKDTL
AILPTGGGKSLIYQFPAVLDETSLTLVISPLIALMKDQVDSLKAKGIAAE
YCNSTQDDLEQLRILSRAVTGKIRILYLSPEKALGRQVLEILPKFPLARI
AVDEAHCVSQWGHDFRPEYRKIHELREKYPKSIPVIALTATATSRVIKDI
SDSLGLKNPILIKGSFYRENLSFSVRFPQNEISRENELLKLLAQGNFQKI
SSGRAIVYCATRQKVESVYGFLKKNGFKVGKYHAGRTDSSRERAQDGYNN
GKTNILVATNAFGMGLDQPDVRLVIHYQIPASLESYYQEAGRAGRDGKPS
DCILFYHPSDLVTQGFIIGKENNRKGGETLLSHLKEYSIANKCRQQALCS
YFGEEILPCKICDICLEKESVESNFADERNQFLEREKTKRLKQEVKERYS
FSKEELETIEKVLEQIPGKFGKRMIVGILRGSRSNEILRKKLDRLIGYGS
LRSVAEEAILKILDEWISEKKVKIVGDKYPKLILSSTVILKVPRKKKSLD
VESEKKVIPAKNLIQELKNFRDREARRRKWKKFMVLQNPVIVQIAKVMPE
TPEELSCVKGMGVAKVEKFGNDILRILEKWK
>LA0964 recQ2, DNA helicase RecQ
MTSLSELKTLFGISSFRTSQEKIITDVLSGKNCMVIMPTGMGKSICYQIP
ALILEGLTIVISPLIALMQDQVLKLKQLGIEAGYINSSLSKQDRLQSYQY
LKEGKYKIIYVSPERFRKNEFLDCLKNRKISLLAIDEAHCISQWGHDFRP
DYTKISEFREILGKPITIALTATATTEIQKDMILQMGLENSEIIIYNEGI
CRPNLFLDVRTFVDEPSKSNAILELLKKQNGSTIVYFNLIQNLEKFCEKL
DIQKIEYQVYHGKLTTDQRKKVQNQFLKSNDKILLATNAFGMGVDKPNIR
TIIHAELPSSLESYYQEIGRAGRDGKPSDCHVFYNQDDLSVLMDFIEWQN
PDAAFISRTFQTLKRLGEELSSIDYEDLQSKIVFKNRGDHRLQTVLNLFD
RYGVTSGELEKNSLKLISTLPEALCSAELLELKKKTSLKRLYQMLLYLKS
EKCRREFVYEYFDAKFSECGNCDICKNSSESK
>LA4333 recR, Recombination protein recR
MAENLANHLLDEMIEALSSLPGIGRKSAFRISFHLLRLEQGLFNQFIHQL
TDTKNKIKFCKRCGSYAETEICEICVSEKRDSHTFCVVEQPEDIFFIENT
REFQGKYHVLNGVISPLEGIGPRDLRIKELLERIEPEQVKEVLIATNPTL
EGDATADYLANQLKPISVNVTRIAYGITVGGSIELADQYTLGRAIRSRLQ
L
>LA1972 rnhA, ribonuclease H-like protein
MITIYCDGASKGNPGPSSIGIVAYIHEKEEFRISERIGETTNNVAEWSAL
KKGIEECISRKFDSIHAYMDSELVVKQVNGKYKVKHPNLLEYKKEVDKLI
SSLHSFQITHVPREKNSVADKLANEAFRK
>LA2903 ruvA, Holliday junction DNA helicase ruvA
MISGLKGTLKKLEVGFVHIETGGITYEVTISFKTYLELKNLPLLKEVQLQ
IFHSINERGQKLFGFLTEQDKEFFKVMKGLQGIGELTALKILSFFSAEDL
YRIVQSGEAKELEKIPKVKGKTSEKIFFEVKQNLKKLELFLSGSSKESSV
ILTSLLQSPEEMAFSKKRETVILGLVQLGFEEKTASKEVDKVLKIFSSND
PGEIIREILKSL
>LA0810 ruvB, Holliday junction DNA helicase RuvB-like protein
MAKSHTLNPEEEFEEESGLRPSLLSEFIGQKEVLNNLTVYVQAAKNRKRA
LDHVLISGPPGLGKTTLAGIISNELGTRLTITSAPVITKGADLARLLTSM
GENEILFIDEIHTLPKKLEEILYPAMENYMIDLVIGEGVTAQMVQIPLKP
FTLVGATTRSGLISEPLKSRFGIQLRLDYYNDEEMKQIVLRSSKILGVLI
EDDAALEIGKRSRKTPRIANHLLKRIRDFSEVEGNLSVKKNLCLKAFEKM
GIDDLGLDGMDRQILDCMIDRYKGGPVGLKAIAVVVGEEEKTIEDTYESF
MVRIGLINRTPAGRVATEKAYRQLKRMEDFSVHHGQDPTLF
>LA0723 ruvC, Holliday junction resolvasome endonuclease subunit
MDTIFMRSSSSSLRIIGIDPGSHRAGYAVLEKNASKIKILNYGTVEVPSG
TPSPDNLLMLRKGLREILEEFNPSIASVEEMFFAKNKKTASRVFESRGVL
LVTLAEMNIRILEPTVSQIKKGTTGSGTADKKQIHQALKLLLNVELLKGH
DDSWDAIAAAYVGLSMSSSPLLSKLR
>LA4340 tatD, Putative deoxyribonuclease, tatD family
MVSIVDTHCHLDIIQSQGLEIADSLKNAAESGVKKIVQIGIDLESSIRAR
SIANEYSNDSLEIRYSIGCHPTETHEFPNKEEILKFVYENLGDPKLSAIG
EIGLDYYHTADTKKQQKDILESFLECSSKSGLPVVIHSRDAKEDTISILK
NFRDQAFGVIHCFTYDYLTAKTLVDIGYYISFSGIVAFKNATEIQEAAQK
LPLECILIETDAPFLAPPPFRGKRNEPSYMKFILDKMFSLRKESNSDVEN
KLFENSIKFMNRKAYHYNA
>LA1721 topA, DNA topoisomerase I
MSLLIIVESPSKAKTIAGYLGKEFRILATLGHVADLPKTTLGLDLKNRFE
PEYVILPGKKKILSEIIKTAKQSQKVFLATDPDREGEFISAYIRDRLQKK
SNVFRIRFTEVTKQAILNSLQNPDTINEFLVDAQKTRRIGDRLIGYFISP
VLWKQIGPGLSAGRVQSVALKWICEREEEIRNFKIEIYYNILLHGTDQKG
IVGIFSRTGDRIFSKEKADQILQNVQKEKEFRISEKKETLGKLFPPPPFQ
TASLQQEAFKKLRFSSKKTMSLSQRLYEGMDLGNGQRNGLITYMRTDSVR
LSSEFVERARSWILSELGETFASPLERKIRKSAKKIQDAHEAIRVTDPFL
TPKEIKKFLGKEEAALYDLIWKRTISSLLPAEEFIKIEYSIFAAGECFQL
ETKKTIFPGYKILNEADVKTNPSWEKGELFILQKVECEKKQTEPPSRYSE
GTLVAKLEKQGIGRPSTYATVSETLLKRKYVYEEKKFFYPFSLGEKVNFF
LQSSFGELFREKFTAELESDLDRIEKKEIDSNSILNRLWLDLQTQIQNSK
FILFQKEWATVLQKKKETGWGICPVCRNGILQKKKSSRKKEFYQCNRFPD
CEFVSYELPESLE
>LA2212 uvrA, Excinuclease ABC subunit A
MQEIRIRGAREHNLKNINVDIPRDQLVVITGLSGSGKSSLAFDTIYAEGQ
RRYVESLSAYARQFLGQMEKPDLDLIEGLSPAISIEQKTTHRNPRSTVGT
VTEIYDYLRLLYARVGKPHCPECGTPIQSMSIDQITARVLAFPQGSKLQI
LAPVISGKKGEHKDVLEKIRKDGFNRVRINGEIRTLEEEIVLKKNFKTSI
EIVVDRIVMKEGIRSRLADSVETALKQSEGLVILDDGSKDHILSQKMACP
NGHDIGFTELSPRMFSFNSPYGACETCDGLGSLLEFDEDLLVNDPELSLV
DGCIEAWAGSKSNGFWFMATLKSLSDSLKFKMNTPWKDLPEKTRQTILYG
DKKIKIEYDFRGANSHYEFTKEYEGVIPNLQRRYKETKSDSMRQWFESYM
TNHPCPSCKGKRLKRESLSVKVHNVPVDEFTSYSIEKALNFVQNLKVTGA
EEIIAKPILKEIHQRLSFLNDVGVGYLTLERSAGSLSGGEAQRIRLATQI
GSRLMGVLYILDEPSIGLHQRDNTKLVSTLKNLRDLGNTVLVVEHDQETM
EESDWLIDMGPGAGVHGGSIVCAGTPAEVSKHKNSLTGKYLSGRLKVPIP
AKLREGNGSKLQIIGAKENNLKNIDVNIPLGKLVVITGVSGSGKSTLIND
ILYNAAAHKVMKMKTLAGKHKTIKGFENIDKIINIDQSPIGRTPRSNPAT
YTGLFTPIREMFAGLEEAKLRGYGPGRFSFNVSGGRCETCEGDGILKIEM
HFLPDVYVTCEVCKGKRYNQETLEVRYKGKNIFDVLEMTVEDANQFFENI
PIVKRKLETLLEVGLGYIRLGQPATTFSGGEAQRIKLATELSKRPTGKTL
YILDEPTTGLHFEDVRRLSEVLHTLVDRGNSMIVIEHNLDVIKQADWIVD
MGPEGGDGGGLVIAEGIPKDIAKIKNSYTGQYLKKIFTSSEKISRKTK
>LA0649 uvrB, Excinuclease ABC subunit B
MASVFKIHSAYQPAGDQVKAIQNIADSFQKGEKKVTLVGVTGSGKTFTMA
QVIQNLGLPTLVLSHNKTLAAQLFREFKEFFPENAVEYFVSYYDYYQPEA
YVPSSDTFIEKDSSINEEIDKLRLRATSSLLEREDVVIVSSVSCIYGLGS
PEEYTNSVVALKVGDTIERDTVIRKLLHIQYNRNDLDFSRGNFRVRGDSI
EIYPAYHTDGIRIEFFGDEIDSISRINPVTAQTIFKLEKAYIYPAKHFIT
SGPKVKEAVENIRAEVDAQTDFFRKNNKLLEAERILSRTNYDMEMLQEMG
YCNGIENYSRHLTGRKPGERPACLIDYFQGEFLLIVDESHVTIPQIGGMF
AGDRARKQTLVDFGFRLPSALDNRPLNFQEFETLTPRTLYVSATPAEYEI
EKSSKVVEQIIRPTGLLDPIVDVRPTKNQIEDLLVEIRKRIDAGERVLVT
TLTKKMSEDLTDYYEEIGLKVAYLHSEVETLDRVGIIRDLRKGIYDVLIG
INLLREGLDIPEVSLVAILDADKEGFLRNYKSLIQTIGRAARNVNGTAIL
YADKTTDSMAKAIEETKRRRKIQEDHNLKFGITPLTIKKEVGDIIEREEK
ERTSEDLVLEDVEKKFNSKKFPNKEVLKEKLREEMMKAAKELDFERAAIL
RDKMLSIQTEDSSAKN
>LA2166 uvrC, excinuclease ABC, subunit C
MPEILNHTLILEKIKNLGASPGCYLWKSKKGEVLYVGKAKNLDKRVRSYL
KENHPDVKTKVLQREIFDLDWIATGTEKEALILEATLIKKHNPRFNVRLK
DDKKYPYICVSLSEPYPMVYVTRKLKDNGDRYFGPYSDVKSTRETLDIIL
RIFPVRKTRQVLPLPRPRRPCLNFDMGRCLGPCQGNIPVEDYKVIIDQVI
QFLEGRKESLVSDLNIKMSNASERLDFEKAARYRDMLQRIQNFREKQTVV
SLEGGDEDVIGFARKQDEGQVILLEIRGGRLETKKSFPIQGVLDAENSEI
LGAFFRDYYLNASLVPPCIFIPADIQDEVIPVIDVLQEKTGFRPKIKFPK
GGDKRSLLKIAEKNAELGLSERLLATHYRDQTASLKEIQEMFSLERLPHI
IECYDISHFQGSQPVASGVMFVEGKPFKQGYRKYNIQGYEGINDPGMIHE
VISRRLQRIINEEGVFPDLIVIDGGLTQLTKACEAAVEAGAEGIPMVGLA
KKREEIFFPGENEPFIFDMNSPGMKLLRHLRDEAHRFGVSHHRSRRNKET
MRSLIQEVPDIGFKRSKLLLQHFSGEKKIEEATKEELLLVPGIGENLAEK
ILKQLQKKE
>LA2347 xerC, Integrase/recombinase xerC
MILWRISLGDYPFQFPEFSSESLNETAKKFINYLKIEKNYSQNTINAYSI
DLKFFFEFCEKEQLDIFQIEPVDIRSYFAYLAKKHEIDRRSQSRKLSSLR
TFYKVLLREDLVKSNPATQLSFPKVRKEVPKNFRINETEEILEFESENAS
EVSEIRDRAMIEVLYSSGLRVFELVNAKLNSLSKDLTVLKVLGKGRKERF
VYFGKEAVSSLQKYLEYRNVSFPDAEEIFLNQRGKKLTTRGVRYILNERR
KKMGWEKTITPHKFRHTFATDLLDAGAEIRAVQELLGHSSLSTTQIYLSV
SKEKIKEVYRKAHPHARK
>LA2356 xseA, Probable exodeoxyribonuclease VII large subunit
MEDSKPLSVSEVTRIIKNLISGSKDLKNIWVRGEISNYSKASSGHIYFSL
KDAGSLIRCTFFNYSNKNYSGKPLSDGKEIQVYGTITLYEAGGSYNLNVT
RVEELGQGDILLQIEKLKQKLAVEGIFDPEKKRRIPSFPKTLGIATSPTG
AAIEDIIKISRSRFPGINILIAPCIVQGEDAPDSIVAAIEELNHPNWKVD
VIIAGRGGGSFEDLMAFNDEKVVRAYANSRVPIISAVGHQTDVLLSDFAA
DHFTPTPTAAAEYAIPKEEDVLQFLSQLEGRIKSSLVTKISSNRDRLRLL
SGKFIFKEPMQLLNQRSQRVDEIGIRLQKALSNKLNLARVRLERYQNLTS
RIQNILFHKKQKAEFWTSKVEDLSPAATMKRGYSILRNENGKIIRSPEET
KPEEELQVLLSGGTMQVIRKGK
>LA2357 xseB, Probable exodeoxyribonuclease VII small subunit
MVETKSKISFEDALMELEQIAEKLERQDFSLEESLKAYERGMELKKICQG
ILDTAEGKIEALTKDESKKTNKTGFRGESKTTETKNNTAQEEDLF