TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Porphyromonas gingivalis W83, W83
Gene type: CDS

Number of genes found: 166

Free access
Sort by:

 



# Porphyromonas gingivalis W83, W83

>PG1276 DNA-binding protein, histone-like family
MLFFKRRQSCIAHKATGKKLWYPQTVINGSIASTLHIAEQISELSGASPG
DVFGILRDLGIVMRRELASGKKIKLDGIGCFRLIAQAKGSGVEKKEDVKA
SQFNSVRVNFRAECRYNTVTRERDCTLIAPDLKFAEYGKPLPAGASANAG
DSNSQTGGGDQGSGGGGL
>PG0050 ISPg4, transposase
MSTNISLFAQVIRLLPRPLIKKLSTEFQVNKHSKHFTGWQHLVSMIFSQF
SSCISLREISNGLRSATGNLNHLGICTAPSKSNLSYQNEHRTSDFFRACY
YALLDYFGQQGMGQGRKFRFKQPVKLLDSTTITLCLALYDWAKYTHTKGA
VKLHTLLDFKTLLPEYVHISDGKGHDGKMANSIPIPAGSIVVADRGYADT
ALLNSWDSTQVSFVVRHPRSLKYEVIQELELPEHGHQQILVDQRVRLTGV
QTQGKYTKPLRHIALYNEQHGDVVELLTNIDTLAASSIALLYRSRWLIEI
FFRNLKQRLSMKAFLGTTRNAVEVQIWTALITMLLMVYLKSIAKYRWCLS
NLVSSLRINTFTKMDLMQWLNEPFTPPPEPENQLF
>PG0820 integrase
MRSTFSLLLYINRNKVRVDGTTSVLCRISIDGKNTVITTGISCKPQQWNA
KNAETSDARTNNRLKKFRSDAERLYEDLLKRYGVVSAELLKNEIAGHVVV
PIHLLQMGERERERLAVRANEIGSNSTYRSSRYYQSYIREFLESKGMSDI
AFLDITEEFGREYKVYLKRYKNFGASQTNHCLCWLNRLVYLAVDHEIIRA
NPLEEVEYEKKPPAKRMHISKAELKQLLELKLPQNDPLKELARRTFIFSC
FTGLAYVDTQLLYPHHIGKTAEGRRYIRINRRKTKVESFIPLHPIAEQII
NLYNTTDDTQPVFPLPSRDMMWFEIHELGVIIGRKENLSYHQARHSFGSF
LISEGICTESIAKMMGHASITSTQNYAKISEKKISEDMDRLMERRKNNEY
>PG1177 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVDRLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGSITIEVADDREDNRSEEEKEAEEDYQKQVVRQRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIALLQG
ANIPQGSAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQR
NKAISRIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSFVG
>PG2072 UvrD/REP helicase domain protein
MKRSKQLRIYTASAGSGKTHTLTGEYLRLALRTRGAFRYIQAVTFTNKAT
AEMKERILEELYSLAVGGSSPFAEELMQELALTTEQLQVRAQEVLTEILN
DYSSLRVKTIDSFFQEVMRAFSHELGLPGGFRIEMEQKAVLEQAVVRLLH
SLGEKDTSDVENWIRRLAEDLIEEGRGHNIRREIVSLGDELFKEQLLLLS
EEGKLPTKAAIHRYQTEMNKLMEGFEQRRLSIARRAEEIVATAGISFYDF
KGGTKGGILEFAKVLKGGEVKPPTKTFMAMAEGDPETTLYAKTTPATTQA
AILSAYQSGLKECLTEMATLYLGREWQEYSTAKQSLPFLNRLGIISDLWR
QIEGIRQEENKMLISDAPSLLHRIIDGSETPFVYDKIGVRIEHEMIDEFQ
DTSRLQYENFKPLLSESLAHGKYNLLVGDAKQSIYRFRNADRRLLTEVVS
RDFAETSERVNLPYNWRSTPEIIEFNNSLYKHLPQILCEAMTREAETMAM
PDPKLPEEINRTFMQTYADVEQLVPPAKVDRHGSVCIYLPSPASSEEAEA
NLSWEEQILQDLPRFIIGLQKRGYAPSDIAILVRKTYQAREIARAMLSYQ
PEPDEEDYPLIPMSDESLSLSGAASIRFLSNLLKFISRPQSDALRQIAYL
SYEELRKEKGLAPTEEGNFSAAELAEFANLRRRSLYELAEGLVSFFHSYL
PEREMPYLIAWLDLINDFGHERSADLHSFLQWWDETGHAKSRISSAPNSQ
AVTLMTIHKAKGLGFRVVLIPFLDWNLDDEAAHRHILWCKIDPARSPFNI
LPVVPIRYKKEMAQTMFATDYFRERADILLDNLNLLYVATTRAKDEMHLW
LHPSQKPESLSTVGDLIHLALASLDENKTESDMYCWGSPVISARQKTTDH
PSSGLPFTLPKGSLSAVADRLAIRPEGSEFYRRHKPLYHGHVMHRILADI
VLAKDIEPALERYVSGGIVTRDEAIELVDRLSVVTSDSRLSRWFDGSGRV
LNEQDILLPEGEQRRPDRIILYDDHTDIVDYKFGAVRKVHHAQMENYIRL
LVSMGYPSVRGYLWYLPNNEIVGVRSVRGITRSKGEERKEGKGERGIERK
AK
>PG0062 TPR domain protein
MPRRKTKKREEGLSAIDLANEEMGYVLSLIERTDQSLFLTGKAGTGKSTL
LRHICATTHKKFVVLAPTGLAALNVGGQTLHSFFKLPLRPIPPNDPDLST
RDRRVFDVFRYTNEHKKLIRSLDLIIIDEVSMVRADVIDAIDKLLGIYRG
RQERPFGGVQMLFVGDLYQLEPVVTADDADILKRFYPNPFFFSSEALRAT
PPVTVELTKVYRQTEQAFVQILDRIRSGRLTEQDLQDINKCVNRHYEPPK
DEPVITLATKRRHVSYINERQLEQLQGDPVTLYGTIEGDFPSSSLPTEEE
LTLKQDAQVIFVSNDKDRRWVNGTLGIVAGFDQESETIDVCLSTGEVVTV
EPCIWDNKRFSYNEQANKVEEEVLGRFVQYPLKLAWAITVHKSQGLTFDR
VIIDFSEGTFAGGQAYVALSRCRSLEGMVLRAPLAGRDVIVRREIIDFYS
RANDLEIINGSLLRAEAEREYAQALRLWNARHYTEAIGAFSQALARKNEL
DDPLFRRLLVSKLFEMEYARRREEELRRELAEQREVMRRLAKEYVLMGND
CITEAHDAHAAIRCYDKALTLYPDYVEALVRKGKTLAGLNEITTSLSTLN
QAIAIAPMDFTARFTLGQVLEKFAYYEEALDAYRQADGLNSQSRSLLKRL
IALCEELDEEDEANLYRIRLRKNRDQTGHK
>PG0222 DNA-binding protein, histone-like family
MLNLSVKSRLMKIGKHKDKTMYYAQVDKPRVIEYEDVIKDIAEMSSLTTG
DVRNAIDRLAYYLQRELTEGNTVRLGQIGTFRVSVPSKYVETEKEVNASI
LKKPRIRFYINNTLSAVADNIRLAVYRNGQKVDDTTSPSTPSEGGGEQTG
GSGGL
>PG0838 integrase
MNIKRNIIFALESRKKDGVLIVENVPIRMRVNFASQRIEFTTGYRIDVAK
WDADKQRVKNGCTNKLKQSASEINAALLGYYTELQEVFKRFEVAEVMPSP
ADVKSAFNARHRSDDVQTEQTSDKPDASAEFYKAFDEFVRVCGRQNDWTH
ATYEKFAAVKNHLRSFRADLSFSFFNEVGLTEYVRYLREVRKMRNSTIGK
QLSFLKWFLRWSFGQGIHSNNAYDTFRPKLKDTQKKIIFLTWEELTKLRE
FEIPPAKQSLDRVRDVFLFQCFTGLRYSDVYNLRRSDIKEDHIEVTTIKT
SDSLVIELNKHSKAILEKYQDVVFEHDKALPVITNQKMNEYLKELAELAG
INEPIRQTYYKGNERIDEVTPKYALLGTHAGRRTFICNALALGIPPQVVM
KWTGHSDYKAMKPYIDIADDIKANAMSKFNQL
>PG2194 ISPg4, transposase
MSTNISLFAQVIRLLPRPLIKKLSTEFQVNKHSKHFTGWQHLVSMIFSQF
SSCISLREISNGLRSATGNLNHLGICTAPSKSNLSYQNEHRTSDFFRACY
YALLDYFGQQGMGQGRKFRFKQPVKLLDSTTITLCLALYDWAKYTHTKGA
VKLHTLLDFKTLLPEYVHISDGKGHDGKMANSIPIPAGSIVVADRGYADT
ALLNSWDSTQVSFVVRHPRSLKYEVIQELELPEHGHQQILVDQRVRLTGV
QTQGKYTKPLRHIALYNEQHGDVVELLTNIDTLAASSIALLYRSRWLIEI
FFRNLKQRLSMKAFLGTTRNAVEVQIWTALITMLLMVYLKSIAKYRWCLS
NLVSSLRINTFTKMDLMQWLNEPFTPPPEPENQLF
>PG0277 ISPg2, transposase
MYHSLFESLCMVPDPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYA
EEYEEELKSLYEMLTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIE
GFISATSGKHICIDGKTMRGVKKLSFDTQSHVVSAFSPQDMCSLAQLYID
RKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQTAIAEQIIDKGGNYVLCVK
ANQSLSLQEIEAYFCPLFQKHILLDEQTELSHGRIETRRYESILNPLEIE
ANEVLTRWKGLRSIHKVVRKRRDKKSDKTSEEVAYYISSLTDVSSLKQAI
RGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKT
NMKSSIPRIQKKLARMKPQQLITIQF
>PG1248 conserved hypothetical protein
MDHSQLTAQAVTAHFESIGNPERAGQMQRFFKTAPGEYAEGDLFLGISVP
DIRAYEKTHRPWRVEILALLLQQPYHEVRHLALIGMTELYRRARSEAVRE
GLLASYLSHTTMINNWDLVDVSAPGIVGEYVHAHPIEGNALLDRLADSCL
LWEQRIAMVANWRLIRYGEYEATIRIAERLLHHPHDLIHKAVGWMLREMG
KKQERLLLAFLDKHAATMPRTALRYAMEKLPSDLRSYYLTKGK
>PG1281 hypothetical protein
MWSTNYEYHLFRTMSTNNITIGSRVRFLNSQGGGIVRRISGRIAWVEGED
GFEIPTPIQECVTVGDNDTFIPAYKSPMEKRREEESKKEERNKRHKEVAV
ASDRKPTSSEEKPHLLHTELAGHDKLNVYLAYLPMNEQRLGTEPYECFLI
NDSNYCLQYNYLSLMSTGWKIRATGSIDPNTKLFIEEFTAADLQDMERIC
LQLTAYKERKTFLLKPALSVELRFDTIKFVKLHCFIENDFFEDKALVYPM
VEDDRPVVEKVFDPESIEQAMKAKELIDRTTPTPARKTSTEKKPNIIEID
LHAGELLETTAGMKNGDILQYQLDKFHEVMKQYASCKGQKIVFIHGKGEG
ILRQAIEKELRTRYKQHRFQDASFREYGFGATMVIIH
>PG1874 conserved hypothetical protein
MADHNDRGRQGEEIALKHLRQQGYQIEALNWQSGRRELDIVASTSRELVV
VEVKTRTEGFLLAPEEAVDARKRRLISESAHHYVRMYAIDLPVRFDVISV
VLSADGSCKRIEHRENAFPLLLKRSQRSTPRRRRL
>PG1746 ISPg2, transposase
MYHSLFESLCMVPDPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYA
EEYEEELKSLYEMLTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIE
GFISATSGKHICIDGKTMRGVKKLSFDTQSHVVSAFSPQDMCSLAQLYID
RKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQTAIAEQIIDKGGNYVLCVK
ANQSLSLQEIEAYFCPLFQKHILLDEQTELSHGRIETRRYESILNPLEIE
ANEVLTRWKGLRSIHKVVRKRRDKKSDKTSEEVAYYISSLTDVSSLKQAI
RGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKT
NMKSSIPRIQKKLARMKPQQLITIQF
>PG1985 CRISPR-associated protein, TM1792 family
MKTAAYMIHCLTNMHVGKGDATYEVVDKCVQRDVTTGMPCIYSSSLKGAL
RQFFSKQGFAELNYVFGKDQSRDKKNNGNEKNEEEHSVGNFTFFQANLLA
YPVQTTQEGATYILKTNKTLQDPIKELANLFSAPSNIEEMLPKETEKDLS
NETLEQLPVIARNQLDNGKSENLWYEEVVPAESRFVFFVSYDDEEIFTKF
DKIIQEKVIQIGANGSIGYGFCKIANVTNPFKK
>PG0223 exonuclease
MLDFAAIDFETANAERTSVCSVGIIVVRGGQIVHRYYSLIRPEPDYYNYH
NTRVHGLTAADTESARIFPDVWAEVEPLIAGLPLVAHNKPFDEGCLKAVF
RMYRMDYPDYPFFCTLQAARRQLRHLSDHRLPTVAQCCGYTLAQHHQALA
DAEACAAIALKLL
>PG1710 ISPg5, transposase Orf2
MLGFSRQAFYKRHLNDLAQHEEDVLCSSIIQYCWHLRQAEHLPQAGFREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPRTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTSNMIHHSDRGIQYAGKSYTDLLH
GRGWRISMTQTGDPLHNALAERMNNTLKNSWHISSSKQSFDQALLSVEQA
VRMYNEARPHQALGAKTPMQVIAPESKNPLLARIEHWPEIAPELYRRMNV
RQRANFARVNRN
>PG0864 site-specific recombinase, resolvase family
MIYGYIRVSSDKQTVENQRFEITNFCDRHQLVIDDWIEETISGTKAYNKR
QLGRLLRKVGKDDIIICSELSRLGRNLFMIMEILNICMTKECRVWTIKDN
YRLGDDIQSKVLAFAFGLSAEIERNLISQRTKEALARKRAEGVVLGRPKG
AKTAPEKHKLYRKRTLIAELLKQRISQRKIAELIRVDRGTLGRFIQSRKE
LKDLIH
>PG0932 DNA polymerase III, delta prime subunit, putative
MDRLQLYESKKLRRETPLSPLLRIFAGQSVHNHRPMYFRDVAGLEDLKTY
LRRTAQTRQIAHAQLFAGEEGGGAFPLALAYARYLNCQMPTDTDACGHCP
SCVKYDALAHPDLFFVYPVVNASSSPAPSDDYIRQWREMLGSESYFTPAD
WLEYIKAGNSQPIIYSKEAEAVEQKLSFRIYEASYRVVMIWQPERMNEAM
ANKLLKLIEEPPEHTLFFMISSEPDKVLGTIRSRTQLINVRLLHEIEIVE
ALSRNNQGNTADIIRIAHLAEGNYRRAMDLYRGEWADRDNFVLMGRMMGS
IIKGDPSKMRPVADELAALGRVSQIGFLTYCLRLFRELYISRVGVAKLNY
LSPEEESFVDMLSGGITGQNIRPVMEEVELAIRHIRQNGNGRMIFFDLLL
RLTAALAPALRAKGLK
>PG2057 ISPg5, transposase Orf2
MLGFSRQAFYKRHLNDLARHEEDVLCSSIIQYCWHLRQAEHLPQAGFREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPRTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTSNMIHHSDRGIQYAGKSYTDLLH
GRGCRISMTQTGDPLHNALAERMNNTLKNSWHISSSKQSFDQALLSVDRA
VRMYNEARPHQALRAKTPMQVIAPESKNPLLTRREHGPEIAPELYRRMNV
RQRANFARVNRN
>PG2020 CRISPR-associated protein, TM1814 family
MRIELSIKADDSLVSFSHQHLLVGTLQKWLGENDMHGKSISYSFSRLNGG
KLVSELNSILFADWANMFVSAHDPELIRRMLAGIRQDPEMFKSLCVREVT
VIEDPDMTDREIFFPASPILLKRWREDNGFDHIVYTDEAANALLTENLRK
KLQAVGIDDPTATASFVPDQGKAKVMLIDYRGVKNKASWCPIRIIGNAET
KLFAWNAGIGNSTGIGFGAIK
>PG0330 DNA-binding protein, histone-like family
MSLKYVIKKSVAKIGPKAGQTIYYAQPAAQDSVTFHSLCKRIAEESALTS
ADVKGILDRLVNILSEELPNGKTVRIGELGSFRLSFGSKQLDDPKNFSVD
QIQKHRLVFIPSAELKSIPARGKLRPGSSALAFDYERVEPQPKKKPGTGD
NQTPSPSTPDGGGGQGGTGGL
>PG1691 conserved domain protein
MKDSFHFGKSESIISILLLLMVIVGLFIFKARPASGNAPVAEAVQTSADE
PGRKEEKKNYVRPYSSSEQSRKKSRDSIVTDADYVGPPVREAYARSADKF
PRGTVIDLNAADSATLTRIPGIGPTFARRIVSYRRQLGGYYTVLQLQEVY
GMDYERFCALRPWFKIGIKPDRIDLKGVVGDSILRHPYINYKQRAEIKRL
LRSSQWQGSWVQLLKLPTFTKEDSIRLSPYFDIH
>PG0426 ISPg5, transposase Orf2
MLGFSRQAFYKRHLNDLARHEEDVLCSSIIQYCWHLRQAEHLPQAGLREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPRTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTSNMIHHSDRGIQYAGKSYTDLLH
GRGWRISMTQTGDPLHNALAERMNNTLKNSWHISSSKQSFDQALLSVEQA
VRMYNEARPHQALGAKTPMQVIAPESKNPLLARIEHWPEIAPELYRRMNV
RQRANFARVNRN
>PG0875 mobilizable transposon, tnpA protein
MTASKIPIKKVCEHCKQEFIALKTTTKYCSHRCNSRAYKAQKREERVRRT
ELSEQEKKVSDIIDKPYLSISEAGRLLGISRHTIYRYIYSGNLKAYNLSS
QRSIVKREDIERMLAERPYEKRQPRDAIVITELYTTDEVCDIHNISRSSL
FAIAKRENIPRTYNRGKTYWSKRHIDAYFAKQAPASHIEEWCTAVEIQER
FDMTLTAVYNFISDHNIPRKKVKGKSYYSTKHVEAAKGVLDKAAPSYYTV
REAMEKFGLTRDQLYQYTKRYNVPKLKQGRNVLISQQELDEVLKPPSITR
>PG2176 ISPg2, transposase
MYHSLFESLCMVPDPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYA
EEYEEELKSLYEMLTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIE
GFISATSGKHICIDGKTMRGVKKLSFDTQSHVVSAFSPQDMCSLAQLYID
RKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQTAIAEQIIDKGGNYVLCVK
ANQSLSLQEIEAYFCPLFQKHILLDEQTELSHGRIETRRYESILNPLEIE
ANEVLTRWKGLRSIHKVVRKRRDKKSDKTSEEVAYYISSLTDVSSLKQAI
RGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKT
NMKSSIPRIQKKLARMKPQQLITIQF
>PG1735 MutT/nudix family protein
MGGKINAEHPLAPFRYCPRCGNDGFAERNPKAKSCPKCGLIYYANPSAAT
ACFITDSAGRLLAVRRAKDPAKGTLDLPGGFMDMDETAEEGIIREIREET
GIEVEAVSYLFSLPNIYPYGGMRVHTADLFFAAQVSDFSSAIASDDAAEL
VILAPDDITPEDFGLESIHQAVGRWLARKKNQNR
>PG1496 hypothetical protein
MAYHKKETLRGNIEAIRTLLAVEKEHRLPTSVEREVLSRYNGFGGLACIL
KPVDTLADRTAASNCPLSPILSDVCLCKQQNIMFTKNVFYP
>PG0819 integrase
MKIEKFKVLLYLKKSRPDKSGKAPIMGRITVNRSMVQFSCKLSCTPDLWN
PRESRLNGKSNEAVEVNAKLDKLLLSIHAAFDTLVERKADFDAEAVKNLF
QGSLETQMTLLAMTDIVCEELRKRIGIDRAKGTYPAYFYTRRTLAEFIQK
KFHSKDIAFGQLTEQFIHDYQFFVVDDKGLTIETSRHYLAIIKKVCRKAY
KEGYADKCFFAHFSLPKQEEKTPKALSRESFEKIRDLVIPEHRSSHILAR
DLFLFACYTGTSYADAVSVTRDNLFTDDEGSLWLKYRRKKNELQACVKLL
PEALELIEKYNDDTRPTLFPMLYHPNLRRLMKCLAVLADIKEDLTYHAGR
HSFASLITLEAGVPIETICKMLGHSNLQTTQRYAKVTPKKLFEDMDKYIE
ATKDLKLIL
>PG1983 CRISPR-associated protein, TM1791 family
MNFGYWYYREYFNTIKLNSEGIVTNFSTFNKGKNDKLIKGATLPPCDKEN
DIKDVTGFELKTCYPGLLCGVGYHHEINKPADEKGKKVEGDKEDDAPEVY
NLGMYFDYTSGLPVIPGSSIKGMLRSAIEEWDFLADYELNNGVTREEIIE
KVFVGKEYSIYDRDIFLDAIPIRVDNTLFGEDYITHHPNPLQNPNPVRFL
RVEPGVTYQFRFILKDHGEKLTVDFRTKLFKAIICTFGLGAKTNVGYGQF
VEP
>PG2047 helicase, putative
MNTKEYVLDFINRTNCNLFLTGNAGTGKTTLLRHIVQHTFKNCIVAAPTG
IAALNAGGVTLHSLLQLPPGTFIPYGLSLESATGVNFLTPTSFWKQTRMH
ASKRKLLRNMELLIIDEVSMLRADTLDLIDFVLRRVRSNPRSFGGVQVLF
IGDLMQLPPVVKPHEWELMSAIYQGVFFFHSMVIRQHPPVFLELETIYRQ
TDVRFTSLLNNLRHNRLPADDLTYLKDHVNPTFDSTCHEGYITLTTHNHK
ADQINRRALEALPALPHSYQAIIKGDFPEHLYPLEPTMTLKEGARVMFVR
NDTRQPRRYYNGKIAVVHSLSPSSVVVRTDGGELIEVEPHEWTNVRYSVN
PETNAPEQEVIGSFRLFPLRAAWAITVHKSQGLTFEHAAIDLEGVFVPGQ
AYVALSRMTGPEGMILLSPPDLRGLETPQELVEYAQTKPDEKELRTSLQE
NSLIYWKQQSDEAFDWQSLLNIWHKHSLSYREESELSSKSHYSDRAAEQS
AKIDAITEVAGRFRRQLAGLWGQSPLGFGAVKERIDKAVDYFLPQLSAIA
GELSSTIQEVGMLKKAKQFAKELLELQDNLHTAVYRLLRLRAVFGALSVG
QSLSRQELRTEELMAEWVRSISPKDEALPTAAGNRKSKEKKAKEKKKTTH
ETTLELLLAGMSIEEVATERKLSPSTIEGHLALLIEKGRLAPDAYIDAGI
VSELHPYFTNPARIFELKSLYERLNGQYSYTLLRLYRAYFILMDAKEKGS
RAREKAIEQAL
>PG0019 ISPg4 transposase
MSTNISLFAQVIRLLPRPLIKKLSTEFQVDKHSKHFTGWQHLVSMIFSQF
SSCISLREISNGLRSATGNLNHLGICTAPSKSNLSYQNEHRTSDFFRACY
YALLDYFGQQGMGQGRKFRFKQPVKLLDSTTITLCLALYDWAKYTHTKGA
VKLHTLLDFKTLLPEYVHISDGKGHDGKMANSIPIPAGSIVVADRGYADT
ALLNSWDSTQVSFVVRHPRSLKYEVIQELELPEHGHQQILVDQRVRLTGV
QTQGKYTKPLRHIALYNEQHGDVVELLTNIDTLAASSIALLYRSRWLIEI
FFRNLKQRLSMKAFLGTTRNAVEVQIWTALITMLLMVYLKSIAKYRWCLS
NLVSSLRINTFTKMDLMQWLNEPFTPPPEPENQLF
>PG1906 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVDRLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGSITIEVADDREDNRSEEEKEAEEDYQKQVVRQRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIALLQG
ANIPQGSAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQR
NKAISRIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSFVG
>PG1072 MutS family protein
MKLREQINSISGFRYVIDELCIHSSVGRRCLMEQEFLTEASDIEVLLSRV
EIAISYQADQRKQKGLDEIAHKLMQLRDIQGTIYSLSRHVVCTDIDFFEI
KFLAILSEDIRDLIRFYQLDDLSSPLPDLSHIVSVLDPEEKKIPHFYIYD
AYSETLRELRDRLKKETNEDARIEIRNESLQEEDIVRKRLSRELSPYAGG
LATALELLGAIDLLLAKVKLFIQLGWSKPGSGHSVTNYMGLVHPHVLSLL
GKKGEKFQPVDIALPSLPTLITGANMAGKSVLLQGVALAQILYQYGFYVP
AQKAEICPVEKVMLSLGDAQDIRQGLSSFGAEMMCLSSIADEARQGKQLL
VLVDEPARTTNPVEGQAIVSGLLAILSRYKIRSLVTTHYGSIDIPCRRLK
VRGFREDKVNLPLQVNSLSKCVDYTLEEVSENDVPHEAIRIAEILGVNEA
LMTECKQFLNNTK
>PG0194 ISPg3, transposase
MKTNIVDVFCIIDDFSKLFDEAIKKKTLEEADKKRRNRKFKMSDSEVMTI
LILFHLSRYRDLKAFYLQYITYSCRSEFPHLVSYNRFVELQSRVGFKLIA
FLNMCCLGQCTGISFIDSTPLKACHIKRAHGHRTMRGWAQKGKSTMGWFY
GFKLHIVINDRGEIINYQITPGNCDDREPLKDGTFTKNLFGKLIADRGYI
SQNLFDRLFVDDIHMITKIKKNMKNSLMHLYDKVLLRKRALFETVNDMLK
NVCQIEHTRHRSANNFVTNLISGIIAYNILPKKPELNIEIIRNPNFPISA
>PG0908 G/U mismatch-specific DNA glycosylase, putative
MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAK
LLGKPLPDDYAERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIR
SLIERNPRLHTIAFNGRKAEAMFRKHFPETLAIRCLLMPSTSPANAGKTL
DLLVKDWNRIFSL
>PG1480 conjugative transposon protein TraI
MKNKILILGMVLAVCCGSVHAQWVVTDPTNFAGNIANTVKEIATASKTVK
NTLNNFKEVEKVYHQGKKYYDALKTVNNLIGDAYKVKEIILMVGDITDIY
VTGYRSMLKDPNYSPEELSAMASGYAKLLEMSGESLKELKTLLKNNALSM
NDKERMELINRIYDEVREYRAVTSYFTQKNISVSFVRAAEKGELERVNSL
YGSGSSRYW
>PG1852 exonuclease
MKLNLKNPLIFFDLETTGVDLVRDRIVEISILKVMPDGSEECKTRRINPE
RPIPPESTAIHGIRDEDVKDCPPFRSVAKSLAQWIEGCDLAGFNSTRFDV
PMLVEEFLRAGVDIDLRHRKLIDVQTIFHKMEPRTLEAATRFYCNRTLEN
AHSAEADTRATYDVFKAQLDRYEGTLENDMAFLADFSRQSRNVDFAGRLV
YDDNDNVIINFGKYRGRKALDVLRTDSGYYGWIMDADFTLNTKQEFTRLR
MSLNNPETK
>PG0184 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVDRLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGSITIEVADDREDNRSEEEKEAEEDYQKQVVRQRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIALLQG
ANIPQGSAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQR
NKAISRIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSFVG
>PG0915 conserved hypothetical protein
MKNNMKNNILLNKFLLYKSEGLAYGITEMFSRNLAELLSDKIMIVYPDFD
RDSFIQSIENHVVGKTYTRRIPIFASLLKEHLPEDYERALSVLTGIWGEE
NPNETGMFNHSIGSCLSGNLPKIIVHIVSL
>PG2128 ISPg5, transposase Orf2
MLGFSRQAFYKRHLNDLARHEEDVLCSSIIQYCWHLRQAEHLPQAGFREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPRTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTTNMIHHSDRGIQYAGKSYTDLLH
GRGWRISMTQTGDPLHNALAERMNNTLKNSWHISSSKQSFDQALLSVDRA
VRMYNEARPHQALRAKTPMQVITPESENPLLARIEHWPEIAPELYRRMNV
RQKANFARVNRN
>PG1697 type II restriction endonuclease, putative
MATFESSLKPRLIYVFAIADARHEGSLKIGETTLNDDVGSASTEPNSEVL
NKAAKARIDQYTKTAGIGYELLYTELTIYFSGGRVCSFNDKQVHSVLERS
GVKRKSFAGATEWYSCDLATVKRAISAIKEGKDSLGASEVTLSDNPIILR
PEQKEAIERTLKQFRKGNKMLWNAKMRFGKTLCALRVAKEMEAVRTIIIT
HRPVVDASWFEDFGKTFYDRPEWHYGSRSKGESFASLEKLASQGKKCVYF
ASMQDMRGSKDVGGKFDKNNEVFSTSWDLVIVDEAHEGTQTELGKAVLGQ
LMGKDTKALHLSGTPYNLFDQHKEEEVFTWDYVMEQQAKIDWEINHLGDT
NPYASLPAIHIYTYDLGRLISEYSDEEKAFNFREFFRTREDGSFVHEGDI
DRFLSLLCREDEEALYPYSNEHFRQIFRHTLWILPGVRAAKALSKKLSKH
PIFGLFKVANVAGDGDDEEEESRDALELVNQAIGKDPDETYTITLSCGRL
TTGVSIKPWTGVFMMAGSYSTSASGYMQTIFRVQTPYTHNGYMKTECYAF
DFAPDRTLRVLAEAAKVSYKAGKQSESDRKLLGDFLNFCPIISIEGSRMK
PYDVNTMLGQLKRAQIEKVVQDGFENGALYNDELLKLTEVELHDFDELKG
IIGKTKAMAKSGDIDINHQGLTHEQYEEKERLEKRKKKGLTPEEKKRLDE
LKAKGDQRREAISILRGISIRMPLMLYGAEMKDEDKELTIDNFANLVDEQ
SWQEFMPRGVTKAVFRRFKRYYDPDIFREAGKRIREMARMADKFTIEERI
GRIASIFSTFRNPDKETVLTPWRVVNMHLGDSIGGYCFMSEDFSTPIALP
RYIEHQGITNEVFHPKSIILEINSKSGLYPLYAAYNIYRTRVEEAREKYG
DVTRAFALQLWDRTLEENIFVVCKTPMARYITMRTLRGFRDVVTHTEHYP
DLIENITSQPDSVVNMLRSGKRFWKINNDENMKIDAIIGNPPYQVMNQGK
GNGSDPIYHKFFDLAMVLAPQGTLIHPARFLFNAGKTPKEWNEKMLSDKH
YRVVDYWPNSAEVFPTVDVKGGIATSYWNKKMILGPIGMFSAFDELHHIL
YKVEQTNPLPFSNLVAPRELYRITDELYQEHPDLNGRQSAGHKYSFGANI
FDVFPEVFFDEYPQGREEKMACIYGRANKQRCYKWCKRSYITHPENFAKY
KVIIPKTNGSGAIGEVLSTPIIGTPIMGYTDTFISIGAFDTRNEAEACLR
YVKTKFARTMLGILKATQDNPKETWRLVPLQDFTAESDIDWTQPVAEIDR
QLYRKYGLNESEIAFVEEKVRPMD
>PG0949 conserved hypothetical protein
MASYSDIVDSVRKCQFSPLYLLAGEEPFYIDELATLLETHVVPVDEWDFN
RVILYGDKTSVADIANEARRFPMMGRRQLIVVREAQLVDNIDLLEAHYGT
FPDTTILVIAYKKKPDKRKAFYTKAEKFGKVFVSETIPDYKMPDFILSAA
AGKKLSVSPEVAYMLADYLGNDLEKLMNELDKLILITQDSRGVVTSEIVE
QHIGVSKSFNNFELLRAIVNRETGKTFRIAHYFARNEKEHPIQATLPVLF
NYFSNLMIVCYLPQKKPDAIMKALSIRNFQVRDYMTGLKMYSTRKVFDII
HEIRMTDARSKGVDTTGTFAGSGDLLRELLHFIFH
>PG1388 hypothetical protein
MDPPVCSPWGRFGCYLLLFACPIIFASIMKKEFIGRTAKSLSLALAIFSG
CIFGSVLPAWSQEIVAGELERCFLAMPESVLPIVTMEERNDLCRRAGHLS
GFTHTASLESSLGGTVTFLLNRNFIRIQTSTVGEVFMRILPFSDSSSVIC
VVTTVLHPVADSRIDFYTTEWKPLKTDRFWQQPRIEDFFLPHTDRQSYAY
QAIYASLTPSYMQVSLSEESDTLSIRQTVTETLAEEEKPLAAIFLSPEPL
VYRWQSGRFVRQVR
>PG0368 DNA topoisomerase IV, B subunit, putative
MEQEKQLQTEYIESDIKTLEWDEHIRRRPGMYIGKLGDGTHADDGIYVLL
KEVLDNSVDEFVMGAGKSIDITIENGTVTVRDYGRGIPLGKLVDATCKMN
TGGKIDSKAFKKSVGLNGVGIKAVNALSTEFVVTVWRDGQTKRVRYSRAE
LLEETDPVPSGEPSGTEVRFTPDDSLFRNYRYQEEFVVSMIKNYTFLNTG
LSFFYNGKKFLSRKGLEDLLAENMTGDALYPIIHLSDTDIEIVVTHTQQY
GDEFYSFVNGQNTTQGGTHLSAFREAVARTIKEFYSKNFEYSDIRSGMAA
AISIKVEEPVFESQTKTKLGSRDMGPDGPTIQKFVGDFLKKELDDYLHIH
TETAEEMLRKIQQSERERKAMSGITKLARERAKKSNLHNKKLRDCTVHLN
DPKAKEPERTSIFITEGNSASGSITTCRDANYQAVFSLRGKPLNCFGLTK
KVVYENEEFNLLQAALNIEEGLDGLRYNRVIIATDADDDGMHIRLLIITF
FLQFFPDLVRKGHVYVLETPLFRVFLPPESKKQTFGATPKRGRKKQESDM
PKPVTDIYCYSEAEKQAALKQLGKKADVTRFKGLGEISPEQFKDLIDEEN
IRLEPVSLKREDKIKELLTFYMGKNTSERQEFIIDNLVVDEDVL
>PG2099 ATP-dependent RNA helicase, DEAD/DEAH box family
MRFDELNLGDEVLDGLDAMNFIETTPVQAATIPPILEGRDVIACAQTGTG
KTAAYLLPILDRLSAGEFASDVVNAVIMAPTRELAQQIDQQVEGFSYFMP
VSAVAIYGGTDGVAWEQQRRGMAMGADIVIATPGRLISHLNLGSADLSHV
SYFVLDEADRMLDMGFFDDIMQIYKQLPSSCQTVMFSATMPPKIRKLAAS
ILRDPIEVEIAISRPPESIMQSAYICHEAQKLPILRKLFEQSAPKRTIIF
ASAKLKVRELTSTLRKMGFNVADMHSDLEQSQREQVMRDFKNGYVDVLVA
TDIVARGIDIDNIRVVINYDIPHDPEDYVHRIGRTARGTNGEGLAITFVS
EEEQSDFHKIETFLGKSVYKLPVDLEFGEVPAYEPAKRRPRRLGRSGEAR
SGRSETNKRQGKTASNGRRRGGTRQRR
>PG1362 conserved hypothetical protein
MNNDLQFTMVAKTLYGLEDVLAAELTALGAEEVTVGRRMVSFRGDKRMLY
LANLRLRTALRILKPVITFHAKTTDEIYERLRLFDWTTVISSDQTFSIDS
VVYSDSFKNSQYISYRTKDALVDFFSDREGKRPSVRLSNPDILLNIHVSH
EEVTLSLDSSGDSLHKRGYRVAETTAPLNEVLAAGILLKAGWDGSTDLID
PMCGSGTFLIEAALIACNIAPGIYRRGFAFQRWADFDLDLYDELFHDDSA
ERVFDHIIYGSDILPQAVAAARSNVERAGLGRYISLSVLPMQQRPKPESK
AMLVMNPPYGERIKVEDMQQLYTMIGERLKHNYAGCSAWILAFKPEHFNH
IGLRQSHREKLMNGALECELRGYELFEGRRDSFAERKSRRAEGEQGVGRR
IDRRDVSAGREKRSNSMDRENKSPYRSPRPDKPFRTSDKRKKEHNDEQQR
EARWPNDRFRSSDESERGPRKSSSKRIQVIRNDE
>PG0865 ISPg2, transposase
MYHSLFESLCMVPDPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYA
EEYEEELKSLYEMLTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIE
GFISATSGKHICIDGKTMRGVKKLSFDTQSHVVSAFSPQDMCSLAQLYID
RKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQTAIAEQIIDKGGNYVLCVK
ANQSLSLQEIEAYFCPLFQKHILLDEQTELSHGRIETRRYESILNPLEIE
ANEVLTRWKGLRSIHKVVRKRRDKKSDKTSEEVAYYISSLTDVSSLKQAI
RGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKT
NMKSSIPRIQKKLARMKPQQLITIQF
>PG1982 CRISPR-associated protein Cas1
MPLSTDSLPIFLSDFTAYHFSVRFRAERAIAFERKWYFMPRFALGNALKN
SEQYAYLYGQIFKPQEEDTDESKGPGNTSRLIIRADKPSRKSLEAGEAMD
LYITVVTRDPLLVGDFLSFLPEWQAYNFFRENDLTYDSYRLYNPTTQKYE
SGLRVEDAALTVDFFSRQAIRWGEILSVRFLSPASIKVDQILSAEIPYSR
LMNRLSRRLYELYTQYLSRGETSVERYIFPDHDGLIYSQISMPRKATIKE
NRQYDMSGILGQLFYRVPYDPVAALMLSMAHWVHIGNHTIVGNGQIESTP
GNDTLYRKWLSSLAADRDLPMAEAERQDLLEALRICSYIPQPYHSVNIPK
GDGSYRQLHIPSAVDLHLQRSLAGILYPITESLSIAQSYAYRKGKGAVAA
VRRVQHLLDSLDENHTVVRCDIDNFFDSIPVPSLLQKVQRTTEDPFLTRM
LSLWMKSGVVDRKQQYARASSGIPQGSPLAPLLSNLYLEDTDRYIAGHIT
TEFIRYADDLLLFLPEKVDPLNALQDLSEHLKYRKGLKLNRDFVVSSIKS
SFSFLGITFCADGSRSMSRDKKEGLKRKITLALHRDTENFSALSETIHGM
EQYYRKLLEKVDIEAIDEVAATVYATHIASLPTSEARKSAKDNLLRLGFL
SSETAKQTLREAMRQTVVSSADNFPIKKESEILREQQKRQLQERGEIFDL
VVTEPGAFIGISRNHVLVRKYGKTICKQPAAQIEQISIISDGVSLSSNVT
KYCRKKNIRVIFYNATGQAYASLNGMNTILPSVMEAQMRLSEEKKREFIL
TLIKNKVRNQGKLLRYYHKYYRHDKELKEPLSNAIAELKQLEGIPIAEGS
SLADFRQHAMLHEARCAQVYWRAFALLVHRSGHEFEGREHKGAEGLVNQM
LNYGYAILRSYVMKTIVLWQLNPNIGILHSTQDNKPALCFDLMEQYRAFV
VDRSILALLAKGEDVGQNSKGLLDMPTRSRIISKINERWFATEYYRSGEK
LFSDIMKLQTKDVSAFCCGKVKRIKFYTPKW
>PG0460 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVERLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGTITIEVADDREDNRSEAEKEAEEDYQKQVVRRRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIPLLQG
ANIPQGTAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQG
NKAISPIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSSLG
>PG1316 hypothetical protein
MGTNGFQRMIERHSHHHLSGMKRIFSLTIILAALLPTLSQAAMPPYSVAT
KFQESEQQGRKREFMNQRNAFFILELELTQEETDAFLPLYNELDTKRYEL
WKDVRQKRALMERKAELTDADMELIINKSLDNKIAEARLEKEYYYKFKRV
LPMRKVMALKVAERKFARLFLKSSYD
>PG0172 exonuclease
MTLWMAVALLVLIAGMAFMTLWKPKDPWQADNLPPEGSSDQFRRKGIIDA
DASSLLVPVPAAVDSLPCFLVIDTETTGLPADETQFISGVDAPAPVSVGW
QLLDFRFRCIEEVVCRLSTDEPVSPEAAALHGITDASLHGDDPHEVYARL
LGAVSKAKVLSAHNLAFHRSVLVYDMRRRGFDPSPLFSLDDYCTMEAGVD
FTHLYGSCGTWKLPRLTELFGVLYFGLPGVRTTYREKVRNDIRLVAACLH
RMNPTTPSLE
>PG1205 DNA-binding protein, histone-like family
MASKPLQFVVVERKLNVGKNAGKVMQIARPTGRHRVDFRSFCERVSKSTT
FNRQEVEAVLNYATEIAKDIVSNGDIVEFGDLGTLMPSFKSKAVEQGVKF
NANVHIEKPVVLFQPSKKYFTLTDVSYEQTTARPKKGTKPAPKPDTGSGG
NSGEGI
>PG0874 mobilizable transposon, int protein
MRHTCPKVTLRQRAIRNGRISLYLDYYPAVRNPETMQMSRREYLGIYIYA
HPKNEMEREFNMDMLNKAEAIRCIRVQSLINEEFGFLDRTKMKTDFLAYF
LKMCRKKDQKWRIVYQHFYNYVQGHCTFGDITIELCQGFREYLLNAKQLK
RKGKVSTNSASGYYSTFRGLLKIAYRDKWLRENINDFLDKIEAKEVKKEY
LTLDEVKILAQTPCEHDVLKRASLFSCLTGLRISDILNLRWEDFTLAPDQ
GYCIRIRTQKTSTEATLPISYEAYELCGEPSSGKVFKGLQRSMINYPLKK
WIKQAGIMKHITFHCFRHSYAVIQISLGTDIYTVSKMLTHKNVSTTQIYA
DLVNSKKRETAEKISLK
>PG1350 ISPg2, transposase
MYHSLFESLCMVPDPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYA
EEYEEELKSLYEMLTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIE
GFISATSGKHICIDGKTMRGVKKLSFDTQSHVVSAFSPQDMCSLAQLYID
RKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQTAIAEQIIDKGGNYVLCVK
ANQSLSLQEIEAYFCPLFQKHILLDEQTELSHGRIETRRYESILNPLEIE
ANEVLTRWKGLRSIHKVVRKRRDKKSDKTSEEVAYYISSLTDVSSLKQAI
RGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKT
NMKSSIPRIQKKLARMKPQQLITIQF
>PG1389 DNA-binding protein, histone-like family
MIQYTVKERRMKVGKHAGKTMYYAEAQKSKVIDFEEVIRDVAEMSSLTTG
DVRNAVDRLAYYLKRELTEGNTVRLGQIGTFRLYAPGRFMEHPEEVNATT
IKGAKIQFIQNRHLREASSLIKVAVDNPYLTKKKDLTATGTESAEGDGSS
GL
>PG2040 DNA-binding protein, histone-like family
MAVSFKVKERKVMINGQPAKIRYAQTLKTGDMDLSEICDLTSKISAVSEG
DVRSIINTVTGLVITGLKQGRAISLGELGRFRISLSSKAAKEGEEFTVEN
IRRARVLFLPGGDIRRACRQIRLKGINMLRPGEQQGGTPSVPDSPEGGGE
QGSGGGGL
>PG1230 hypothetical protein
MKKRKQLSLALAWSLALLPVGGYTAFAQVNTTAQTVKPQNINPMQKRMSS
FRQEMLSELTDLLVQRYGKEVHFRADRGTSQAAALWRAEDGDVEAFRTFV
LENFEPTIDGQRRMYRTLERNLEILNGYFNKIDLALKEPLHLKGPEISSL
DMIFGGYSASAHLSDDLYANKAGLIAALNFPYYSLAEKMELGAKWSREEW
AFARMGDRFASRIPAEVQQSVSETLTRGDAYISEYNICMGYLQRPDGSTL
FPRNMKLITHWGLRDEIKSNYADTKGGLEKQRMIYAIIKRIIDQSIPREV
IDKTDFQWDPLSNMLYDPAGTAIRGHAEPDTRYEYFLANFRAMQAQDAYM
PHQPNQILRAFEGEMEIPQEDVKQLFETLLSSPQVKKVAQLISKRLGRPL
EPFDIWYNGFRSQSGMPESELDKITAAKYPTPEALKADLPNILRKLGFES
KDAERIASLVMVDPSRGAGHAAGSMMRNDFARLRTRIADTGMDYKGYNIA
VHEFGHNTEQTITMNDVDYYMLNGVPNTSFTEAVAFLFQKRDLELLGVSK
PDAQKEYNEALTNFWNCYEIMGVSLVDMAVWEWMYTHPEADAHELKLAVM
DAAKTVWNKYYAGILGEKDETILGIYSHMINYPLYLPNYPMGHIIDFQIE
QYIKGKSLAVELKRMLVQGRLVPQYWMKQAVGSPISVLPILEAVDEALTK
VK
>PG0086 ATP-dependent RNA helicase, DEAD/DEAH box family
MKTFEELGIAKPFLKAIAELGFEQPMPVQEEVIPYLLGEDIDLIALAQTG
TGKTAAFGLPLLQKIDLSGGRPQALVLCPTRELCLQIADDLNSYSKYLSD
VHILPVYGGTPIDAQIRALRRGVQIVVATPGRLLDLMRREAVSLSDIHTV
VLDEADEMLNMGFAEDLEQILADIPSERHLLLFSATMPKEISKIATSYMK
DPKEIVIGNKNEGNANIKHLYYMVSARHKYLALKRIADYYPNIYGIVFCR
TRKETQEIADQLIQDGYNADSLHGDLSQAQRDYVMQKFRVRNLQLLVATD
VAARGLDVNDLTHVIHFGLPDDVESYTHRSGRTARAGKTGLSIAICHVKE
KGKIKNIERTMQKQIERARIPSGAEICEKQLFNLADRIERVDIENTTAID
SVLNEVNKKLEWIEKDELIRRVMALEFNRMLDYYQRAEDVEEVVERKKDE
VSGDRKNRKRGAGEAEEGMTRLFINFGKMDQMFPNRLIELINRCIPGRVN
IGRIDLMPRFAFFDVDEFEAKNVVETLNRYEVDGRRIHVEYADTKKDYAG
GKKGGDKNFSARRKDFDGRRSENPTSRKNKGRKEKPTEGRFYDKFDSKQR
KRRK
>PG1000 hypothetical protein
METDNVLKRILEIEHGFVHILDAAKEILSSSSEERSFAIASEFFDHEAYQ
PRMLATAILGHLAGTNSEALLFLKDRVSLDMNWRVQEMLAKAFDTFCKEQ
GYEKSLPVIREWLDSDNANVVRAVTEGLRIWTSRPFFKENPCLAIEWLSR
HRGHESEYVRKSVGNALKDISKKHKESVAEELKKWDLSDPLVLFTYKHAS
KHPDKDIKKQ
>PG1500 conserved domain protein
MERFNEVERGREISGRMLKGLEIRKKNLEAKLTRIADDIAERKDDTIDFK
RMGIDHLFVDESHKFKNLTFTTRHDRIAGLGNSEGSQRALNLLFAIRTIQ
ERTGKDLGATFLSGTTISNSLTELYLLFKYLRPQALEKQGIGTFDAWAAV
FARKSTDYEFSVTNEIIQKERFRTFIKVPELGAFYAEICDFRTAKDIGID
RPEKHEILHHIPPTPEQEAFIQKLMEFARTGNAELLGREKLSDREEKAKM
LIATDYARKMSLDMRMISPSYEDHIDNKASHCARLIHDYYRKYEREKGTQ
FVFSDLGTYKPDEWNVYSEIKRKLTDDYGIPASEIRFIQECKTEAAKKAM
IAGMNAGSIRVLFGSTEMLGTGVNAQQRAVAVHHLDTPWVPSALEQRDGR
AIRKGNEVARFHAGNKVDVIIYAVEKSLDSYKFNLLHNKQLFINQLKTNS
LGSRSIDEGGMDEATGMNFSEYVAVLSGNTDLLEKARLDKKVMAMESERK
NFLYERDTARSQLAKLQGAVEFHEKKIAEAGKDRTVFEERVKKNEDGTYQ
NPIRIDGVEDGRDIKTIARRLKEYEEKARTGGEHMPIGELYGFQLSVKSE
ASMKESFDFVDNRFFVKGCGSIYYTYNNGHLAEDPKLACMNFLNALEKIP
RVVESHRKELAATQAKIPTFETMVSAVWKKEEELQELKRQAAELDRKIAL
SLKKEDNSEVKPEETVAPDATIAVDAPLREELKRERSSERNHTDWKEIRE
DEKIKPCIKPGRW
>PG0861 helicase, SNF2/RAD54 family
MLLRNTLIDNSSPELSMSSCLKQCFRLDWVNRVRIATGYWDVPGMALVIK
ELSAFLEREGTMLQILIGKDPYVYSSLLKNPKYQDASFPHDFIRTDIHNL
ELHEEYMQVIKLLLKHCESSKIQIRIYLRNAQGEIEFLHSKCYICSGVDD
SLGIIGSSNFTSRGLIGNAELNYLESDSRVVTAKPQKGSAAKGHSHWFDE
KWTIAEDWSQEFLEQVIKTAPIAQETMKSAKQEMQEQSLSPYELYIKLLQ
YKFSSLVDKDLNEILTGYLPSTFDAFEYQLDAVKQCYSIMQEHGGFMLSD
VVGLGKTVVGTLLVKHFLSMPEDDGRNKRVLIITPPAILETWRETIDLFD
VDKPESIAPSIDFVTMGSIGKLVDDIEDEDELNLEELDSGEFIEPLPCAN
YGLILIDESHRLRNNHTQMYQSLDTLIEQIVLREGAYPYIGLLSATPQNN
RPDDLKHQIYLFQRNHTDSTLRKANGGNLESFFADIAREYQSVIYSRYDE
GPTAEQIQQSRKLLNDLSSRVRDCVLQDILVRRTRTDIRKYYPETKLTFP
EISGPHSFEYRMSKSLAKLFARTMDCIAPKENFQFDSSTSLCYYRYRAIE
YLRDESTRKLYSGRNMDAERFSHQLARIMQIGLVKRIESSFSAFKVSLKN
LKRYTQNMVDMWEHDTIFICPQIDVNAELNPRKHWDSTRKLFSFEECAED
LRKKINKLNSSGSNEKGRNREYRREDFAPEYIDLLRQDLALIDQLDEQWS
IYSDDPKLDDFKRQLLPSLLSLERNPEQKLVIFTEAIDTVRAIERAIESV
DDRLSVLSITAKNRREREEDIRANFDANYKGEQRDDYQIIITTEVLAEGI
NLHRANSILNYDTPWNATRLMQRIGRVNRIGSQAPCVYVYNFMPSAEGDA
EINLVKKAYTKLQSFHTLFGEDSQIFSEEEEVVHHELKTQIEGAESALEQ
YLYELKTYKAQHPERYDYIARQSEGLELAVSETEGQALFVVRSPKCPAMF
VRYDALEDKCSMLSAPQMYEAFRSATFGAQASFALPKDWQARRDAAVLAV
NQALVKRNLNMKRSARATEAKAIIDQMKEMPMEAHSRKLLASARKLVDKG
NPDIIKRIIGIGHLLKEREGSLLPITQDEIDTIIHKEIEILVSGIAKKFG
QAEVFIGLSL
>PG1113 integrase
MARSTFKVLFYVNGSKEKEGIVPIMGRVTINGTVAQFSCKRTIPKELWDV
KGNRAKGKSREAIATNLSLDNIKAQIIKHYQRLSDREAFVTAEMVRNAYQ
GLGSEYDTLLKAFDRDCASLLKRVGKDRSMGTYKVMLRARNNTARFIRHK
YNRSDMSMLELTPDFIRDFAVYLSTVKGNRNATIWINCMWLKGVVMRAHF
NGKIPRNPFAQFHVSPNTKERAFLTEDELKTLMSHEFKDSHSAFVRDLFV
FASFTALSFVDLKELTIDEIVEVNGEKWILAKRHKTQVPYQVKLLDVPLQ
IIDRYRPFQKDNSIFGDINYWTVCKKLKKVISECGITKDISFHCARHGFA
TLALSKGMPIESVSRVLGHTNIVTTQLYAKITTEKLDTDLSMLGSKLNAS
FGYIKMA
>PG0851 conserved hypothetical protein
MEHLEKYKVLNLSLPLSGAKSEKDISDFFALGNGANDLKDLLAKMFSDMY
SQTMMMLRSCEIDYDNPPDASKSVVAVNGVPLGTQDNLFCITGGEGTGKS
NYISAILSGTLREERLSAEQTLGLEITANPNGLAVLHYDTEQSEAQLHKN
LGRTLRRASLTAVPEFYHSLYLASLSRKDRLKLIRESMDLFHHKHGGIHL
MIIDGIADLIRSANDESESIAIVDEMYRLAGIYNTCIICVLHFVPNGIKL
RGHIGSELQRKAAGILSIETDDNPEYSVVKVIKVRDGSPLDVPMMLFGWD
KEADMHVYRGEKSKEDKEKRKTDELIAVVKEAFRNKITLSYQELCEVLMR
EMEIKDRTAKKYIAYMKEQRILAQDSNGNYQKGELCHT
>PG0549 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVERLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGTITIEVADDREDNRSEAEKEAEEDYQKQVVRRRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIPLLQG
ANIPQGTAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQG
NKAISPIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSSLG
>PG1103 ATPase, AAA family
MKQIPLAERMRPKTLADYVGQQHLIGSGAVLRQMIEQGQTPSMILWGPPG
VGKTTLAEIIAHEVDAPFYTLSAVSSGVKEVREVIADIESNRGNLFDKGG
RAILFIDEIHRFSKSQQDSLLAAVERGIVTLIGATTENPSFEVIRPLLSR
CQVYVLKPQSDEDLLLLAHRAIDRDELLAAKHPVLEETEALLLYAGGDAR
KLLNILDLLATSEVEDRLVITNEKIRQRLQENPAAFDKGGELHYDIASAF
IKSIRGSDPDAAIYWLARLIDGGEEPSFIARRLIISASEDIGLANPNAML
IAMACAEALDRIGWPEGRIPLAEATIYLATSEKSNSAYLAIDSALEYVRQ
SGNLPVPLHLRNAPTRLMADLDYGKGYKYSHDFPEHFVSQQYLPDKAANT
SFWTPQMHTVHEAKLGERMKRWWKREKSQKDTAE
>PG0461 ISPg7, transposase
MIKKPHHTPSLFSSLSDMLNQSHPLYKLADKIDWEKFDTAFRPLYCQHNG
RPSKPIRLMCGLLILKHLRNLSDESVVEQWSENAYYQYFCGMQEFAPGAP
CASSELVHFRHRIGEKGIELIFQESIRVNNEDDDEHHHDTAFIDSTVQEK
NITFPTDAKLHKKIVRKILDIVHKLNLPLRQSYTFVLKRIYRDQRFRHHP
KNRKKALKADNKLRTIAGRLVRELKRNLGDNSLYAELIERFEAILSQRRN
SPQKIYSIHEPEVQCISKGKEHKKYEFGNKVSVIRSATGIILEARSFRNE
YDGHTIEASPEQVERLTHRKIKIPAGDRGYRGRKEVNGTRILIPDTPKQS
DSRHQRCKKHKLFCKRAGIEPTIGHLKSDHRLGCNFYKGLAGDAINILLA
AAAYNFKRAMKALWDFIKIISQMPFANGFSLKEVF
>PG0574 hypothetical protein
MADTAKNNISEEEDILCEKEGSSPEWETSTEEEAFLHNSYSAASKRKNSF
WNVMGGSFLDHPWIASNWKLGLVIVVMSVINIWNGYQAIEQVREIGRLEE
QVKDYRYRALFKASEVTAMSQKLNVEKAIQSQNLELTLSQTPPYILYRPV
DTDRRKK
>PG0816 hypothetical protein
MKSTEKKELSHFRLKLETYLNEHFPEMSGNNPFITARSDEALTAYCDAVA
QGFSHPEAESMASEVLYQGLHFSRYDTLVSVLEREFEQELPSPLPERLAP
ILLKNKAIQSVFAKYDLTDDFEASPEYEHLYTELTGTIVLLIESNHLPTI
GGGNDTV
>PG2202 conserved hypothetical protein TIGR00250
MGRILAIDYGRKRTGLAVTDPLKIIPGGLTTVPTHTLLDFLRDYVSREPV
ERFVLGLPRRMNYEESESMTYIRPFAVKLAQAFPSIPITYVDERFTSRMA
QRTILEAGIGKMKRRDKALVDEVSAVIILQSYLDNPDR
>PG0177 ISPg4, transposase
MSTNISLFAQVIRLLPRPLIKKLSTEFQVDKHSKHFTSWQHLVSMIFSQF
SSCISLREISNGLRSATGNLNHLGICTAPSKSNLSYQNEHRTSDFFRACY
YALLDYFGQQGMGQGRKFRFKQPVKLLDSTTITLCLALYDWAKYTHTKGA
VKLHTLLDFKTLLPEYVHISDGKGHDGKMANSIPIPAGSIVVADRGYADT
ALLNSWDSTQVSFVVRHPRSLKYEVIQELELPEHGHQQILVDQRVRLTGV
QTQGKYTKPLRHIALYNEQHGDVVELLTNIDTLAASSIALLYRSRWLIEI
FFRNLKQRLSMKAFLGTTRNAVEVQIWTALITMLLMVYLKSIAKYRWCLS
NLVSSLRINTFTKMDLMQWLNEPFTPPPEPENQLF
>PG1300 conserved hypothetical protein
MRVIRGKYGHRRFDVPKSFNARPTTDFAKENLFNILSNRFDFEGLSAIDL
FSGTGSIALELVSRGCSSVTSIEKRREHAAFIRNLIKHLNEENCWRVFET
DVFLFLERNKVCHRYDLVFADPPYALTELEQLPTKVLESNILAEDGLFIL
EHPKDFSFTEHPRFEEHRAYGSVNFTFFR
>PG1673 ISPg4, transposase
MSTNISLFAQVIRLLPRPLIKKLSTEFQVDKHSKHFTGWQHLVSMIFSQF
SSCISLREISNGLRSATGNLNHLGICTAPSKSNLSYQNEHRTSDFFRACY
YALLDYFGQQGMGQGRKFRFKQPVKLLDSTTITLCLALYDWAKYTHTKGA
VKLHTLLDFKTLLPEYVHISDGKGHDGKMANSIPIPAGSIVVADRGYADT
ALLNSWDSTQVSFVVRHPRSLKYEVIQELELPEHGHQQILVDQRVRLTGV
QTQGKYTKPLRHIALYNEQHGDVVELLTNIDTLAASSIALLYRSRWLIEI
FFRNLKQRLSMKAFLGTTRNAVEVQIWTALITMLLMVYLKSIAKYRWCLS
NLVSSLRINTFTKMDLMQWLNEPFTPPPEPENQLF
>PG1038 ATP-dependent DNA helicase UvrD/PcrA/Rep Family
MSEDYLSSLNDSQKAAVLYNDGPALVIAGAGSGKTRVLVYKLLHLIHSGY
DPARLMALTFTNKAAKEMKERVASEIGPAAYRIQMGTFHSVFSRILRENA
THLGYTRDFSIYDTNDTKSLLRHVMKRMNIDDKVYRLNAVQHRISMAKNQ
LISPESYAANKDLSRYDIDCRMPRMAEIYSLYTILCKQNNAMDFDDLLFK
INVLFRDFPDVLQTYRDRIDYLLIDEYQDTNFAQYLIARQLMGEKGKVFV
VGDDAQSIYSFRGAKVENILGFSKSFPGSKLFKLEENYRSTQSIVNAANS
LIAHNEGRIPKQVFSNKQVGERIRLTGCLSGYLEAYTVADSIVERRMQEH
CPYSDFAILYRTNAQSRVFEEALRKHNIPFRIYGGLSFYSRKEIKDVLAY
FRLIVNPNDDEALRRVINYPKRGIGDTTLSRLNEIATASSRSLWDVLSES
QKGLPDLSATARRRLGEFVSLIRELQESEYESLYEQAADVVKRSGISAEI
FSDKSIEGISRQENLKELLNGIEEYGTSYAEERGEKPSLGTFLNEVVLLT
DQDTEGVVGDYVTLMTVHAAKGLEFKHIFIVGMEENLFPSMMNATEQGLE
EERRLFYVAITRAKESCHISFAAERSRNGRTERSRPSRFLQELDDAYVER
RVPQEMLGGHSQGDELPIHFSRSDAFERLPQPEPIRRRLVRVGSSPVHEE
RMVHQQIGDLCVGDTVAHARFGIGIIESLEGEGDNAKAEVSFQQVGRKRL
LLRFAKLSKVEKDSI
>PG1622 DNA topoisomerase IV, A subunit, putative
MTDSFDNEQPDIESSVSDLLPEAGNRLPVDHPAMATHHLGGMYRNWYLDY
ASYVILERAVPHIEDGLKPVQRRILYTMHHWFDNGRMNKVAKVTGQTMAL
HPHGDASINDALVQLGQKGYLIETQGNWGNILTGDEAAAGRYIEAKLSAL
AQETLFNDKITHWKRSYDGSEDEPVALPVKFPLLLAQGTEGIAVGLNSKV
LPHNLGELLMACIAHLRGESFDLYPDFPTGGMMDASRYNDGRRGGQIKSR
AKIRKLDNRTLSITELPCGKTTSSLIESILKANEKGKIKIKRIDDMTAAE
ADIRIHLAAGVSSDKTIDALYAFTDCEVSQSPNACVIMDDKPVFLGVTDL
LRYSAERTRELLKQELTIRLEEKREQYLAASLERIFIEERIYKDREFEEA
ESEQDALMHVEARLEPYAHRFIRPITYDDLKKLLEIRMARILRFNLPKHE
AMMLQLEKDIAELEKNIREITAYTIRWFEYLHEHYASRFPRRTQLIGFST
IQATKVAEANAKLYINREEGFIGMGLKGDEFVCNCSDLDDVIIFFRDGTY
LITKVEEKKFIGNREVIYIDRFERNDKRTIYNVIYRDGKSAPFYIKRFSV
TGVTRDKEYNLTQETAGSRVMYFTANKNGEAETVRIILKPKARQRVLSFE
KDFSNVAIKGRSSKGNLLTKAEVHKILFKQQGASTLGGRKVWFDRDVMRL
NYDEQGEYLGEFQANDSMLVILDNGDCYTTSIAENNHFDPNMIRIEKYRP
EKIWTAIYHNREAGFPYLKRFKIENGSRDNMLGGEAEGNSLLLLSDTRYP
RFALTFAGIDATRPELIVEGEDFVAVKGIHAKGKRLTTYKLDGAKELSPV
RDEEPDEETTDPDESFAEVAPEAIASDEDIRDELLGIQRLQFDDDENE
>PG0798 ISPg3, transposase
MKTNIVDVFCIIDDFSKLFDETIKKKTLEEADKKRRNRKFKMSDSEVMTI
LILFHLSRYRDLKAFYLQYITHSCRSEFPHLVSYNRFVELQSRVGFKLIA
FLNMCCLGQCTGISFIDSTPLKACHIKRAHGHRTMRGWAQKGKSTMGWFY
GFKLHIVINDRGEIINYQITPGNCDDREPLKDGTFTKNLFGKLIADRGYI
SQNLFDRLFVDDIHMITKIKKNMKNSLMHLYDKVLLRKRALIETVNDMLK
NVCQIEHTRHRSANNFVTNLISGIIAYNILPKKPELNIEIIRNPNFPISA
>PG0566 DNA-binding protein, histone-like family
MLFFKRRQSCIAHKATGKKLWYPQTVINGSIASTLHIAEQISELSGASPG
DVFGILRDLGIVMRRELASGKKIKLDGIGCFRLIAQAKGSGVEKKEDVKA
SQFNSVRVNFRAECRYNTVTRERDCTLIAPDLKFAEYGKPLPAGASANAG
DSNSQTGGGDQGSGGGGL
>PG1031 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVDRLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGSITIEVADDREDNRSEEEKEAEEDYQKQVVRQRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIALLQG
ANIPQGSAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQR
NKAISRIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSFVG
>PG1420 ISPg5, transposase Orf2
MLGFSRQAFYKRHLNDLARHEEDVLCSSIIQYCWHLRQAEHLPQAGFREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPQTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTSNMIHHSDRGIQYASKSYTDLLH
GRGCRISMTQTGDPLHNALAERMNNTLKNSWHISSSKQSFDQALLSVDRA
VRMYNEARPHQALGAKTPMQVIAPESKNPLLTRREHGPEIAPELYRRMNV
RQRANFARVNRN
>PG1320 ISPg1, transposase, internal deletion
MACQSKNTDEHVTFADALLSKRYRKAQNDFLNQVDRLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLPKTWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFCSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVEIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSSVG
>PG0970 ISPg4, transposase
MSTNISLFAQVIRLLPRPLIKKLSTEFQVDKHSKHFTSWQHLVSMIFSQF
SSCISLREISNGLRSATGNLNHLGICTAPSKSNLSYQNEHRTSDFFRACY
YALLDYFGQQGMGQGRKFRFKQPVKLLDSTTITLCLALYDWAKYTHTKGA
VKLHTLLDFKTLLPEYVHISDGKGHDGKMANSIPIPAGSIVVADRGYADT
ALLNSWDSTQVSFVVRHPRSLKYEVIQELELPEHGHQQILVDQRVRLTGV
QTQGKYTKPLRHIALYNEQHGDVVELLTNIDTLAASSIALLYRSRWLIEI
FFRNLKQRLSMKAFLGTTRNAVEVQIWTALITMLLMVYLKSIAKYRWCLS
NLVSSLRINTFTKMDLMQWLNEPFTPPPEPENQLF
>PG1454 integrase
MKIEKFKVLLYLKKSRPDKSGKAPIMGRITVNRSMVQFSCKLSCTPDLWN
PRESRLNGKSNEAVEVNAKLDKLLLSIHAAFDTLVERKADFDAEAVKNLF
QGSLETQMTLLAMTDIVCEELRKRIGIDRAKGTYPAYFYTRRTLAEFIQK
KFHSKDIAFGQLTEQFIHDYQFFVVDDKGLTIETSRHYLAIIKKVCRKAY
KEGYADKCFFAHFSLPKQEEKTPKALSRESFEKIRDLVIPEHRSSHILAR
DLFLFACYTGTSYADAVSVTRDNLFTDDEGSLWLKYRRKKNELQACVKLL
PEALELIEKYNDDTRPTLFPMLYHPNLRRLMKCLAVLADIKEDLTYHAGR
HSFASLITLEAGVPIETICKMLGHSNLQTTQRYAKVTPKKLFEDMDKYIE
ATKDLKLIL
>PG1262 ISPg3, transposase
MKTNIVDVFCIIDDFSKLFDETIKKKTLEEADKKCRNRKFKMSDSEVMTI
LILFHLSRYRDLKAFYLQYITHSCRSEFPHLVSYNRFVELQSRGGGKLIA
FLNMCCLGQCTGISFIDSTPLKVCHIKRAHGHRTMRGWAQKGKSTMGWFY
GFKLHIVINDRGEIINYQITPGNCDDREPLKDGTFTKNLFGKLIADRGYI
SQNLFDRLFVDDIHMITKIKKNMKNSLMHLYDKVLLRKRALIETVNDILK
NLCRIEHTRHRSVNNFVTNLISGIIAYNILPKKPELNIEIIRNPNFPISA
>PG0458 ISPg5, transposase Orf2
MLGFSRQAFYKRHLNDLARHEEDVLCSSIIQYCWHLRQAEHLPQAGFREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPRTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTSNMIHHSDRGIQYAGKSYTDLLH
GRGWRISMTQTGDPLHNALAERMNNTLKNSWHISSSKQSFDQALLSVEQA
VRMYNEARPHQALGAKTPMQVITPESENPLLTRREHGPEIAPELYRRMNV
RQRANFARVNRN
>PG0825 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVDRLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGSITIEVADDREDNRSEEEKEAEEDYQKQVVRQRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIALLQG
ANIPQGSAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQR
NKAISRIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSFVG
>PG0199 TatD family protein
MTPFRKKFSSLNSSYMVFLDIHTHSAARNDSEIIRVRNLRPNETISPNTF
CSIGIHPWSVPENYSGELLAVREGLNHDEVVALGECGLDKVCSTPYEKQR
KAFLDQIDLSEEFGMPVLLHIVRAWDDLLAIKKQVKPSQPWIVHGFRSSE
EQARQLIRAGLFLSFGCRFSPEALRLSYESGIALLETDENIKDIREWYYE
AAECIGCSLQELQRHTLALGAGLFPRLRLHKHFSAIR
>PG0225 ISPg4, transposase
MSTNISLFAQVIRLLPRPLIKKLSTEFQVDKHSKHFTSWQHLVSMIFSQF
SSCISLREISNGLRSATGNLNHLGICTAPSKSNLSYQNEHRTSDFFRACY
YALLDYFGQQGMGQGRKFRFKQPVKLLDSTTITLCLALYDWAKYTHTKGA
VKLHTLLDFKTLLPEYVHISDGKGHDGKMANSIPIPAGSIVVADRGYADT
ALLNSWDSTQVSFVVRHPRSLKYEVIQELELPEHGHQQILVDQRVRLTGV
QTQGKYTKPLRHIALYNEQHGDVVELLTNIDTLAASSIALLYRSRWLIEI
FFRNLKQRLSMKAFLGTTRNAVEVQIWTALITMLLMVYLKSIAKYRWCLS
NLVSSLRINTFTKMDLMQWLNEPFTPPPEPENQLF
>PG0943 ISPg5, transposase Orf2
MLGFSRQAFYKRHLNDLARHEEDVLCSSIIQYCWHLRQAEHLPQAGFREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPQTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTSNMIHHSDRGIQYASKSYTDLLH
GRGCRISMTQTGDPLHNALAERMNNTLKNSWHISSSKQSFDQALLSVDRA
VRMYNEARPHQALGAKTPMQVIAPESKNPLLTRREHGPEIAPELYRRMNV
RQRANFARVNRN
>PG0841 mobilizable transposon, excision protein, putative
MQLHDIKQVSIVDYLAQAGFEAKLIKGVNYWYCSPLRSELTPSFKVNAER
NQWYDFATGDHGDIIDLVCTLQHCSTAEAMKRLSALKGVRLAPSFSFGGI
TPVSLQRSSMELISVQAVKHPKLLLYLSERGLQPSDASPFLSEVYYKVSR
KCFFALGFPNDAGGWELRNPYFKGCFAPKAITTIKGTDSHKLLLFEGFMD
FLSWRKLHPEGQAESIILNSLTLLPKLMPTLHPYPMIESLLDNDEAGDRA
TKQLIDAGLPVKDMRACYAPYKDINEYLTLAEQRKKILTPHKRGLRR
>PG1494 conserved hypothetical protein
MDEAGRVKTVEPTEANQTAFMKFKKNDSVLKNFLSNLVKQFKAPTHFGVY
RLVSDRAVESVAELRNMLAGREEPRNKAMLESIRMNLDEFAPAQKAPAID
ESKVDWKELERLGLSRDRLEQSGDLNKLLNRQKTGLVGISVPFGETSIYT
EARIALRQSEDGSLGLAIHSIRKEPRLDFPYMGYRFSEDEKATLLHSGNL
GKQVELTPKNGEPFKGYVSIDPLTYELVALRADRVIIPQEIKSVLLTEQQ
HRDLTEGKPVKVEGMLSRGGKPFDATLQVNAEKRGIEFIFNDNLSFKERR
QLSQKPQADSPYGVPATLCKYQLTERQQKALSEGRTLYLKNMVDKEGEIF
SAYVRYDKEQQRPRFYRYNPDQKQEQVKAVAEGHKTQVAVNNEGKTNEAT
GRVKEPLKSGQVRPTEKQQAKQENEEQKQPRKRGRRM
>PG1658 ISPg4, transposase
MSTNISLFAQVIRLLPRPLIKKLSTEFQVDKHSKHFTSWQHLVSMIFSQF
SSCISLREISNGLRSATGNLNHLGICTAPSKSNLSYQNEHRTSDFFRACY
YALLDYFGQQGMGQGRKFRFKQPVKLLDSTTITLCLALYDWAKYTHTKGA
VKLHTLLDFKTLLPEYVHISDGKGHDGKMANSIPIPAGSIVVADRGYADT
ALLNSWDSTQVSFVVRHPRSLKYEVIQELELPEHGHQQILVDQRVRLTGV
QTQGKYTKPLRHIALYNEQHGDVVELLTNIDTLAASSIALLYRSRWLIEI
FFRNLKQRLSMKAFLGTTRNAVEVQIWTALITMLLMVYLKSIAKYRWCLS
NLVSSLRINTFTKMDLMQWLNEPFTPPPEPENQLF
>PG1433 hydrolase
MSENKTYTEAMRRLEEIVRVIEHESPDVDELTKLAEEAIALIGFCREKLT
VADKQIEELMAKLSWSKNAML
>PG0459 ISPg5, transposase Orf1
MQRSLMGKHLSNFERLAILEDYLSGAQSQGAIGRKYGISRGLIPQWLRKF
GLEDKVHPVPMKASQSPQSELTLNEKEELEQLRKENRVLKSRLKREELGH
QAYKLLVELAEETYGIRIRKNSEAK
>PG0261 ISPg3, transposase
MKTNIVDVFCIIDDFSKLFDEAIKKKTLEEADKKRRNRKFKMSDSEVMTI
LILFHLSRYRDLKAFYLQYITYSCRSEFPHLVSYNRFVELQSRVGFKLIA
FLNMCCLGQCTGISFIDSTPLKACHIKRAHGHRTMRGWAQKGKSTMGWFY
GFKLHIVINDRGEIINYQITPGNCDDREPLKDGTFTKNLFGKLIADRGYI
SQNLFDRLFVDDIHMITKIKKNMKNSLMHLYDKVLLRKRALFETVNDMLK
NVCQIEHTRHRSVNNFVTNLISGIIAYNILPKKPELNIEIIRNPNFPISA
>PG0517 hypothetical protein
MFAPRTTNLKDLLLGLSPYKELDRTALLFELYNTYKEIWPQAEPFDQFIF
WGDIILKDFNEIDKHLVSAKALYSNLKDFREMENDFSFLSERQVAAIRSF
WESFSPASGQMENGCQQSFLDFWKLLSPLYIRFNQRLADQGNGYHGMILR
HTVDRLRQRETSVRELLSSADRGGNAHPDKYVFVGLFALSPAEEYILVRM
KAEGICDFCFDDDLSLLHSAGNLTGTILDHNKEIFGKQTPWQESSENGQT
SSLEDKPAPDIRIIRTASEIVQAKLLPQLIEELYPEGYSDKEGIETAIIL
PDTGMLMPVLNSLAETVGRINVTMGYPLSQSAISIFVEKWIRVQAEIRLI
QKVPHFRTDAVLDLLNSVLLTPLLTDHSRDLLSNREVTKQYYVPEERLWG
DELTDLLFSRPDSGQQLLDRLFLLLEKIAESMLFQPADSEYDQSEETNNM
ELEQIYHYRNMLNRLRGLVESYAMDMSVRSAALLLQGLVSGVSIPFEGEP
LVGLQIMGIDQTRSLDFKHLIILSVNEGKLPSRVYETTMIPYTLRRGYGL
PVNEVNEATQSYDFFRLIQRAESVTMLYDARSDQLGGGEESRYIRQMHFL
FDMPLKVQELHLTGSLPHSPAICVRKEGEVLTRLHRFLECDNEESLDPEE
RLSALSASSINTYVACPLRFYFENVRGIREENASDELMAANDFGTVVHRS
MELIYKPCCGGGIVSSDILSRWLDPKNATIARIVRQAYTEEYLRSANTQG
ALSGLNHLYCIMIEKYVRRILEHDKSLAPFRYIDSERRVKGSFSLSNGSR
VRLHGIIDRIDELNEECRRIVDYKSGDTTTELGSWDLMFQHPIAGKSKKH
PTAIAQTLLYALMAKMQMENEGEGSELPLCPTIYGFKELYKQKEDYTGVV
VLKEVGKAIEQITDFDQICEMFEERLRTCLDELFDPDIPFTETEDIRTCS
YCPFASLCGK
>PG1244 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVERLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGTITIEVADDREDNRSEAEKEAEEDYQKQVVRRRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIPLLQG
ANIPQGTAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQG
NKAISPIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSSLG
>PG1644 ISPg5, transposase Orf2
MLGFSRQAFYKRHLNDLAQHEEDVLCSSIIQYCWHLRQAEHLPQAGFREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPRTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTTNMIHHSDRGIQYAGKSYTDLLH
GRGWRISMTQTGDPQHNALAERMNNTLKNSWHISSSKQSFDQALLSVEQA
VRMYNEARPHQALRAKTPMQVIAPESKNPLLTRREHGPEIAPELYRRMNV
RQRANFARVNRN
>PG1303 helicase, putative
MDNYLAEQILKNLPFTPTQSQDSAIRSLAKYLFDREPYSVFLLRGYAGTG
KTQLIASVVQTILEQGADCELLAPTGRAAKVLTTYTRHQAYTIHRQIYQA
TAAGIEEGGAYRIRRSSGRSTVFIVDESSMIGNESVEPTPFGSGSLLNDL
LAYVNETDGCRLILAGDMAQLPPVGSVVSPALDAGVMETSYGLRIHECTL
TEVVRQQKESAILSLATSLRRLLSNGISEKIKLNIRDSGDVSAISGTELI
EALDASFRTVGMDETIIVSYSNKRALAYNLGIRSQVLYYEEELIRGDRLV
VTRNNYRYCDRRDKTDFVANGEIVEILRLGKRYELYGFRFADATISLVEQ
GREIEARLLLDGLTAETAGLTHAQRQKLYDAVAEDYNSMASIPARRKAIK
EDAFFSALEVKYAYAITCHKAQGGQWKHVYVDMGMLSYLPHDEQLCRWLY
TAVTRASERLFLVNTPKDMLP
>PG1453 integrase
MRSTFSLLLYINRNKVRVDGTTSVLCRISIDGKNTVITTGISCKPQQWNA
KNAETSDARTNNRLKKFRSDAERLYEDLLKRYGVVSAELLKNEIAGHVVV
PIHLLQMGERERERLAVRANEIGSNSTYRSSRYYQSYIREFLESKGMSDI
AFLDITEEFGREYKVYLKRYKNFGASQTNHCLCWLNRLVYLAVDHEIIRA
NPLEEVEYEKKPPAKRMHISKAELKQLLELKLPQNDPLKELARRTFIFSC
FTGLAYVDTQLLYPHHIGKTAEGRRYIRINRRKTKVESFIPLHPIAEQII
NLYNTTDDTQPVFPLPSRDMMWFEIHELGVIIGRKENLSYHQARHSFGSF
LISEGICTESIAKMMGHASITSTQNYAKISEKKISEDMDRLMERRKNNEY
>PG1061 ISPg6, transposase
MDVQVYFSKVEDPRVVGRCKHKLSDILVIALASYLCGGEDYESMHELCLE
RGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEG
KHIAIDGKRLKGSKKKTGSTHILSAWVDEVSLSLAQETVAEKRNELQAIP
EVLDSLDLSGAVISINAMGTQTNIAEQIIQSEADYILSLKSNQKHLYEDV
QDYFTGQYRCHRYETLEKDHGRIEKRTYTTLLASEVFEEGEYSQWQGLRS
LIQVEREISSLEGETRIDRQYYISSLPPEDCQLIGQYIRGHWGIENRLHW
HLDVTFREDTCRARKDYSATNLNTLRKFALAIVSGHKDKLSLRKRLFKAA
LNIDYLKKLLKI
>PG0041 ISPg5 transposase Orf2
MLGFSRQAFYKRHLNDLARHEEDVLCSSIIQYCWHLRQAEHLPQAGFREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPRTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTSNMIHHSDRGIQYAGKSYTDLLH
GRGWRISMTQTGDPLHNALAERMNNTLKNSWHISSSKQSFDQALLSVEQA
VRMYNEARPHQALGAKTPMQVIAPESKNPLLTRREHGPEIAPELYRRMNV
RQRANFARVNRN
>PG0591 ISPg5, transposase Orf2
MLGFSRQAFYKRHLNDLAQHEEDVLCSSIIQYCWHLRQAEHLPQAGFREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPRTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTSNMIHHSDRGIQYAGKSYTDLLH
GRGCRISMTQTGDPLHNALAERMNNTLKNSWHISSSKQSFDQALLSVEQA
VRMYNEARPHQALGAKTPMQVIAPESKNPLLTRREHGPEIAPELYRRMNV
RQRANFARVNRN
>PG1624 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVDRLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGSITIEVADDREDNRSEEEKEAEEDYQKQVVRQRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIALLQG
ANIPQGSAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQR
NKAISRIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSFVG
>PG1448 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVDRLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGSITIEVADDREDNRSEEEKEAEEDYQKQVVRQRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIALLQG
ANIPQGSAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQR
NKAISRIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSFVG
>PG0008 ISPg5 transposase Orf2
MLGFSRQAFYKRHLNDLARHEEDVLCSSIIQYCWHLRQAEHLPQAGFREL
MVLCQQYFGPKFTLGRDRFCALLRRHGMMLRKRSVRPRTTNSRHRLYKYE
DLLNTEPKFVPQRPGELLVADITYVAYQDGFAYLSLLTDAYSRCIVGYCL
HPTLEVEGCLNALHQAFAFYDQHQIDTTNMIHHSDRGIQYAGKSYTDLLH
GRGWRISMTQTGDPLHNALAERMNNTLKNSWHISSSKQSFDQALLSVDRA
VRMYNEARPHQALRAKTPMQVITPESENPLLARIEHWPEIAPELYRRMNV
RQKANFARVNRN
>PG1533 Toprim domain protein
MNHIKEQILERTDGGLLVFNYYMPFPFKPKKKFQNPLYEDKRASCYIYKV
AKTGVYRMNDFGDPNYSGDCFWFVAAMYGMDVQKDFVHILKKIVRDLSLP
IAIPAHSKESFPTRNRKLPYLEQTLSEPKDASMRPYKITEKPYTRSELSF
WEQYGTGQNILDRYHVKSLQTFQSENAEGKSYCFTNTSKEPIFGFLRKDY
VKIYRPFSQCRFVYGGILPETYVFGMEQLPQRGDILFITGGEKDVLSLAS
HGFHAICFNSETGNIEESVIEMLARRFRHIFFLYDMDETGIKASTRWCER
FSHHKLQRIELPLSGNKQEKDISDYFKLGNSTEDFRKLISDHLEQLYTQT
IMLLSSCEINYNNPPDRSKTVISVNDVPLGTYDNLFCITGGEGTGKSNYI
AAIISGVLHTEKRDFPIDLLGLDVCPNMRSKAVLHYDTEQSEYQLHKNIG
KTLRRASLTSVPDFYHPIFLAALSRKERLQLIKDSIDLYHHRYGGIHLVV
IDGIADLIRSANDESESIAIVDELYRLAGIYNTCIICVLHFVPNGIKLRG
HIGSELQRKSAAILSIEKDENPALSVVKALKVRDGSPLDVPLMLFGWDRE
KEMHVYRGEKSPEDKEKRKLTELGQIAKEVFHAQEHLSYNELVERIMQTV
DVKDRTAKSYISYMRNNGIIEQQTNNLYNLKL
>PG0787 hypothetical protein
MSQIAGYFSSDSGVYPNPVKDVLNIKHEGDFGVRIFDFSGRLVLSMENTR
MIGVKVLTAGAYVIKLMTQGSTPAERFIKL
>PG2009 DNA repair protein RecO, putative
MIIVSRAIVLHNTAYNDSYSIAHLFSRESGRVSYLIPRSSKRGKSGGSLR
LLISPLNELEITAEHKQHRDLHFIKEAKLCSLHGRIQSDPVRNSIALFLA
EFLYLILRLPEADTNLYDFVAFSIDKLEEMDGPMANFHLAFLFRLLVPLG
LIPDLQFGGSVIPRWFDPADGRFVPNAPAHGRGIPPHQSTYLQLFSRITF
DNMKAFRLSRAERRQVLDYLVDYYRFHLPPFPLLKTPDILSTLFD
>PG0783 hydrolase, putative
MPLIRSCLMILIDTHTHVYEPQFDDDVEQVILAAQEAGLIHLVMPNIDVE
SIARMQGVLSRHPGYVSEAMGLHPTSVRDDFREQLTFIRHELDTRSFVAI
GEIGIDLYWDKTFEAEQVEAFLTQIEWSMAYDLPIIIHSREAWDVVFACL
NRFPSDKIRGVFHSFSGDETDLRRALSYPHFYIGINGTVTFKKNTLPALL
PLIPLDRLLLETDSPYLAPIPKRGRRNEPAYLVHTATFIAHILGLDPDIL
AEKTARNACRFFGFQFQEGKIIRPI
>PG1197 ISPg1, transposase
MAYQSKNTDEHVTFADALLSKRYRKAQNDFLNQVDRLIDWRPIRTLINKK
YTKRQNAIGAPAYDVILLFKMLLLETWYNLSDCALEERINDSITFSRFLG
LKMEEVSPDHSTISRFRSALTELGLMDKLLAQFNKQLSRHHISVREGVLV
DASLVETPHKPNGSITIEVADDREDNRSEEEKEAEEDYQKQVVRQRKGTD
EEARWVYKQKRYHYGYKKHCLTNVQGIVQKVITTAANRSDTKEFIALLQG
ANIPQGSAVLADKGYACGENRSYLQTHHLQDGIMHKAQRNRALTEEEKQR
NKAISRIRSTIERTFGSIRRWFHGGRCRYRGLAKTHTQNILESIAFNLYR
TPGIIMSSFVG
>PG0128 conserved domain protein
MLSYHTDIPTDLPLLRQAVEAIRREESGGAVPDSDRPRVIYEARNRLYAI
RTAQGEQVVKSFRIPIAIQRVVYSFFRPSKAARSYRNAIRLGRCGIGTPR
PSGYAIEREKGLLCRSYYVCESMHDCRDIRLSMQGVEGGEALLRALAGFI
ARMHRAGIHHIDLSPGNVLYRTDEKGEYSFYLIDLNRMKFYDKPIVGRKA
YANFARLSFCPAVSKQLAEYYAEAQGLNTDGVVQGVQSESDRFFRSKVRK
YARKALVREQKRMSRSAFRRAYIRYRSVRLIRKLTGCTRLFRIENDLYTS
YLEVGDLRHTLKRAEGYSSPKNVQ
>PG0384 MutS2 family protein
MIRMETYPHNFEEKVGFDEIRRLLIGHCHSPMGSDRVIQMHALARHNEVS
RLLAETEEMQIILREEDLFPDLRLADVREALNRIRPAGTYLEERELQDVA
TALRTIEALIRFFHVGEEEEGKDTPYPHLQTLLSEVIVFPDLEKRISSLF
DRFGKMKDNASPELMNIRRELSSIEKNISRTLQGILRLAQSEGWVDQGVQ
PSVRDGRLVIPVAPAHKRKVRGIVHDESGTGKTVFIEPAEIVEANNRIRE
LEAAERREIIRILIEVCDALRPHIHDLIDCYEWIGLFDFISAKARLCGEW
NAIRPILSKKPEIRWEKAVHPLLLRSLRAHGRDVIPLDISLTAPDKRILV
ISGPNAGGKSVCLKTVGLLQYMLQSGLPIPMSPDSTAGIFGKLFIDIGDE
QSIEDDLSTYSSHLRNMKHFARHTDKNTLLLIDEFGGGTEPQIGGAIAEA
LLHRFNEQEGFGVVTTHYQNLKAYAEETPGLVNGAMLYDRHEMRPLFRLS
IGRPGSSFAIEIARKIGLPEEVIAEATDKVGTGYIDMDKYLQDIVRDKRY
WETKRTNIRKEEKRLEGAAAEYESRLESIKKERKQILDEARQQASVMLSQ
SSAQIEKTIRDIKEAQAEREKTRRARQELNDFRETINKEEIEKEERINRE
IEKIKRRKKRKQEKAASRSAETPVQPKLQEVPQPPVIQVGDTVRIKGQTA
IGSIIDMNDREATIALGMIKTTVPIDRLEPAKPVKERKSEPVSGASARMI
IDRIHEKRLDFKQDIDLRGMRVNEAVQAVMYFIDDAIQLGIPRVRILHGT
GTGALRTVIREYLATVNGVRHFADEHVQFGGAGITVVELG
>PG1032 ISPg3, transposase
MKTNIVDVFCIIDDFSKLFDEAIKKKTLEEADKKRRNRKFKMSDSEVMTI
LILFHLSRYRDLKAFYLQYITHSCRSEFPHLVSYNRFVELQSRVGFKLIA
FLNMCCLGQCTGISFIDSTPLKACHIKRARGHRTMRGWAQKGKSTMGWFY
GFKLHIVINDRGEIINYQITPGNCDDREPLKDGTFTKNLFGKLIADRGYI
SQNLFDRLFVDDIHMITKIKKNMKNSLMHLYDKVLLRKRALIETVNDILK
NLCRIEHTRHRSVNNFVTNLISGIIAYNILPKKPELNIEIIRNPNFPISA
>PG0487 ISPg4, transposase
MSTNISLFAQVIRLLPRPLIKKLSTEFQVDKHSKHFTSWQHLVSMIFSQF
SSCISLREISNGLRSATGNLNHLGICTAPSKSNLSYQNEHRTSDFFRACY
YALLDYFGQQGMGQGRKFRFKQPVKLLDSTTITLCLALYDWAKYTHTKGA
VKLHTLLDFKTLLPEYVHISDGKGHDGKMANSIPIPAGSIVVADRGYADT
ALLNSWDSTQVSFVVRHPRSLKYEVIQELELPEHGHQQILVDQRVRLTGV
QTQGKYTKPLRHIALYNEQHGDVVELLTNIDTLAASSIALLYRSRWLIEI
FFRNLKQRLSMKAFLGTTRNAVEVQIWTALITMLLMVYLKSIAKYRWCLS
NLVSSLRINTFTKMDLMQWLNEPFTPPPEPENQLF
>PG2101 hypothetical protein
MSLKSNAMKYRSIILSFFLLMPVSNCFADMQGYCMPQSWVGKIGQKAKEK
VEKRVEEKVDKAMDKTLDKAEEEATRGQKRATTAPAPKTARKTISLQELE
QETDRKTHHIGNTGKSVGRATGKNGCEILFPVKQGTRREMTIYEANGKVS
GTIRQQVQSVTNTAKGMKITTAQEMYDKKGKQIFSSVANMWCDGDRFYVD
AQSLLNEQTLKMFKDIKYKVTGVDIAYPSRMSAGQSLPDAEVTITAEAAD
FPLPPITLRTIGRKVQGIERITTPAGTFECYKISYSIVMESMITVQMSAV
EWMSKDVGCVKSESYDKKGKLVGSTLLTKLE
>PG1261 ISPg4, transposase
MSTNISLFAQVIRLLPRPLIKKLSTEFQVDKHSKHFTGWQHLVSMIFSQF
SSCISLREISNGLRSATGNLNHLGICTAPSKSNLSYQNEHRTSDFFRACY
YALLDYFGQQGMGQGRKFRFKQPVKLLDSTTITLCLALYDWAKYTHTKGA
VKLHTLLDFKTLLPEYVHISDGKGHDGKMANSIPIPAGSIVVADRGYADT
ALLNSWDSTQVSFVVRHPRSLKYEVIQELELPEHGHQQILVDQRVRLTGV
QTQGKYTKPLRHIALYNEKHGDVVELLTNIDTLAASSIALLYRSRWLIEI
FFRNLKQRLSMKAFLGTTRNAVEVQIWTALITMLLMVYLKSIAKYRWCLS
NLVSSLRINTFTKMDLMQWLNEPFTPPPEPENQLF
>PG2014 cas1, CRISPR-associated protein Cas1
MKKTYYLFNPGELSRKDNTIRFVPIQEGENGQEQAGQARYIPVEGISDFY
VFGSLRANSSLYNFLGSNDIAVHFFDYYENYTGSFMPRDFLLSGKMLLAQ
ASAYKNKKKRLFLARKFIEGAASNMQKNLAYYNNRGKDMQPMMELIDKYS
LRLKETTTIEALMGIEGNIRQAYYDAFNLIIDPFEMGVRSKQPPQNEVNA
LISFGNMMCYTLCLKAIHQSQLNPTISFLHTPGERRYSLCLDISEVFKPI
LVDRTIFKVMNKRIIQAKHFDKQLNKCILNPSGKKLFVQAFEERLSETIR
HRKLNRSVSFRHLVKLECYKIAKDILGIEEYQPFKMYW
>PG1981 cas2-1, CRISPR-associated protein Cas2
MNDKKLLVAYDISSNRRRRKVARILEQCGIRINKSVFICSLRELTMDKLV
EAVTSQTAKRDKVFFLPLCQHCYTAAWMSGHPTLPKSRRKRKSIVV
>PG2013 cas2-2, CRISPR-associated protein Cas2
MYIILVYDIGEKRVGKMLKLCRKYLNWIQNSVFEGEISEVKLLELKSRAA
GIMEKEEDSLIIFSSRQERWLEKEIIGKERSATDIFL
>PG2015 cas4, CRISPR-associated protein Cas4
MNTISGTHFNYHQVCRRKLWLFSAGITMEHTSDLVYEGKLIHETTYQQRP
ERYQELELDGIKIDFYDAKNRVVHEVKKSNKISPAHRLQLLYYLYVLERN
GVLGATGILEYPTLRKKEEVILSDIDRERIREIEQEILQIISHEDCPPVI
DSGICKNCSYYEFCFSGEEQ
>PG0001 dnaA, chromosomal replication initiator protein DnaA
MNYHSTNVNEIWDACLRILQDIVDERAYRTWFLPIIPVSIEGDTLTLQVP
SQFFCEFLEGNFVEQLRTVLGRVIGPNASLQYNALVDNSSPKYPGTVTLA
GCADGGQAAEQFDVNLLHRHMPNAATHSEAQDFDTQLNSRLNFRNFYQSE
CNYVARSVAEAIAASPGNTPMNPFFIYGASGVGKTHLCHALGLRVREMHP
RLKVLYVSSHLFEMQFTTAARMGTINDFIAFYQQVDVLIIDDIQWLIGKK
KTQLAFFQVFNHLYMLGKQIVLTSDKPPVDLNGMEERLVTRMAGATCVKI
ERPDLKLRREILQQRTLQSGVRLDESVLNFIAENVCDNVRELEGTLVSLI
TNSVVVGKEIDLTFAKRIVRQAVRLEKKEVTIECIQQAVSRVFQVQIEQM
KSKSRKQDIVQARQVVMFLSKKHTAQSLSAIGELMGGRNHATVLHGCRCV
TNEMEMNASFRSSVERAEQLIAN
>PG1242 dnaB, replicative DNA helicase
MATTKSRTKRPAPESITAPLEGRVPPQAPELEEAVLGAILLEKDAYMQVG
EMLRPSTFYLKTHELIYEAITQLALNQKPVDMLTVTEQLKKNGNLDAVGG
PSYIAGLTLKVASSANLEFHAKILAQKALSREVIGFSSEVLKKAYDDTED
IEDQLQQAEGRLFEISQHNMKQDVQPIDPIIKEALGEIQIAANRKEGLSG
QPSGFPAIDKLTAGWQASDLIIIAARPAMGKTAFVLSMAKNMAIDYNIPV
AIFNLEMSSVQLVKRLMSNVCEIPGEKLKTGRLESHEWVQLDTKLKDFEN
KPLYIDDTPSLSVFELRTKSRRLVREYGVKVIIIDYLQLMNASGMSFGNR
EQEVSTISRSLKVLAKELKIPIIALSQLNRSVETRQGDINSKRPQLSDLR
ESGAIEQDADMVCFIHRPEYYKITEDQQGNSLLGIGEFIIAKHRNGPVDD
VRLRFRSEFAKFLPLEAESMVKRHSRIGGSPVEMGVGNGSFAPPLPPPED
NPLLSGPYSSGTSEADFLAEASGIDNPF
>PG0035 dnaE, DNA polymerase III, alpha subunit
MEPFVHLHVHSQFSLLDGQAAINDLVDKAIADGMPGIALTDHGAMFGIKE
FYNYVEKKNSGHNATLKDCRRELDQLNDSKSADMLTDEERNRIVKLQQKI
EETKKLLFKPILGCEAYCARRTRFDKDNQIPDPYHPRRSIDASGWHLILL
AKNLRGYKNLIKMVSYSWTEGQYYRPRIDKELLQKYHEGIIVSSACLGGE
IPQHIMAGEIAKAEEAILWFKELFGDDYYLEIQRHETHNPLGNQDVFPQQ
QRVNRVILELGKKLGVKVIATNDVHFCNEEDAEAHDRLICLSTGKDLDDP
NRMRYTKQEWMKTTAEMSAIFEDLPETLSNTLEILDKVELYSIDNKALMP
DFPIPPEYKDDDDYLRFLTYEGARRKYGEDLSDEIKERIDFELETIKGMG
FPGYFLIVQDFIAAARSMGVSVGPGRGSAAGSAVAYCLGITDIDPIKYHL
LFERFLNPDRISMPDIDVDFDDDGRAEILRWVTEKYGKERVAHIITYGTM
ATKSSIKDVARVQRLPLLESNRLAKLVPDKIPGEKKVNLKKAIEFVPELK
QASLSSDKVMRDTLKYAQMLEGNVRNTGVHACGIIIGKTDISDVVPVSTA
PDKDTKEELLVTQYEGSVIEQTGLIKMDFLGLKTLSIIKEALVNIKRRHG
IDLNIDTIPLDDPLTYKLYSDGRTIGTFQFESGGMQKYLRELQPSAFEDL
IAMNALYRPGPMDYIPSFIARKHGREPIDYDLPEMEEYLKETYGVTVYQE
QVMLLSRKLAGFTRGQSDELRKAMGKKLIEKMNVLKVKFLEGGNKNGHPE
EVLEKIWTDWEKFASYAFNKSHATCYSWVAYQTAYLKANYPAEYMAGALS
RNLNNITEITKLMDECKSMKIGVLVPDVNESEMKFSVNANGDIRFGLSAV
KGVGSGAVEQIIAEREANGLYKDIFDFVERINLSACNRKTMESLALAGAF
DSFALSREEYMAPPLTGREESYIQALMRYGSVVQEEKHSQSNSLFGEEED
LMIPRPQPSPTEPWNDLERLNKERELVGLYLSSNPMAQYQVILDHYCNTH
AADLTDPDSLVGKNLVLAGIVTKAFQGISRSNNPYSKITLEDLSGTGEIP
LFGQAHVNFGNYCKEGLYLLIRASVQPHKWKEGEMELVVTSIELLPQVAD
TLIKKMTILLPASKIDNELIEMLSDELTNNSGKTMLFIKVYDHTETFDVE
LAQQKSLIKVSPELVNRLKTYEINFALE
>PG1814 dnaG, DNA primase
MIDDLTRERILDAANIVEVVSDFVSLRKRGVNYLGLCPFHSDRNPSFSVS
PAKNLCKCFACGEGGSPVHFIMKIEQLSYSEALRYLARKYGIKIHERELT
DEEKKLKSDRESMFILNEFANDFFKNNLLNTIEGQTIGMTYFRQRGIRPE
TVQKFQLGYAPEKRSAFSDEAIRKGFKPEYLVDTGLSIRYEDSKALDDRF
RGRVIFPVQTVSGKIVAFGGRILGKKDKAAKYVNSPESIIYSKSKELYGL
FLAKKEIARRDKCFLVEGYTDVISMHQSGIENVVASSGTALTQQQINQIH
RFTSNITVLYDGDAAGIKAALRGINLLLEQGMHVKVALLPDGEDPDSFAR
NHTVAEFEEYIEQHETDFIRFKTQLYLSDMERDPIRRAQLITDIIGSIAL
IPDDITRRVYVQETGQTLSMDERLLAREVQKMRFRRSTYSSPQPATQQPA
NGSERKNDSDNLVGAENHSAETPTDNRSAEVEQPYYPAKYEEELLRLIIR
YGERKLLIYRHEEGAEKQTPSEVALAYYIKSDLNADGVDIGTDVFRQILD
EAAEQSFDSGFVAAKYFRDHPNAQISHIAVELLTNRYALSRIHYGKGDES
NAEEDLQMRVERELLTIKNVFIQVQIRALQKEIAQAQKAGDIELQLEIMN
ELRSMNEMKSELAHRLGDRTVLP
>PG1853 dnaN, DNA polymerase III, beta subunit
MKFEVASNVLLQHLQLIARVIASRSTLPILESVLFELEGDQLRLTAADMA
NRMSTELTVNNVGGENGSFAVPERILLEPLKELPDQPISFEINMETKAAE
IAYSNGHYSFVVQDVSTYPVAASLSPEAIVSIVPAEALLSGLSATLFATS
QDERRPIMTGVYLDFFEDKLVFVGSDGQILVKQEDANVQSRRRSAFCLPR
KACLLLRNVLPRLEGDVTLTYDSNYLHIELGNYTLRARLLEGRYPNYNSV
IPTSNPFSVKVDRAQLLSGAKRVSIFSNPATSMLRMEFTPAGIRLSANDI
DFSVAAEEHVPAECPADINMRIGFKSDVFQTILQGMPSEEVIMTLADQTR
AGLILPAENAPGISLCNLLLPMKLIGE
>PG1418 dnaX, DNA polymerase III, gamma and tau subunits
METKYIVSALKYRPDSFASMVGQEALSATLKSSIVQQKTAHAYLFCGPRG
VGKTSCARIFARAINCLERLPDGEACGRCESCKAFDEQRSMNIYELDAAS
NNSVDDIRLLIEQANVPPQIGKYKIYIIDEVHMLSQQAFNAFLKTLEEPP
SYVIFILATTEKHKILPTILSRCQIFDFKRIPPARIEQHLRYVADSEGIK
AEAEALAIIATKADGGMRDALSLFDRIARFSEGNITYANTIENLNILDYD
YYFRLTDFFLQGDYRSVLIVLDELLAKGFDGQVIVSGLASFFRDLMMAQD
ASTLPLLEQSEVVIQRYVDMAARCPTSFLYQSLKILNACDQQYRQSNGKR
LLLELSLMNIAALFSKGLTAPPPTHGAQHANPVVSGSVSPRPAPAKEQPS
VAPSQSSVPPQPSIGEQRRQPETTAPTITEDKPEQTKVVPPTRPMATSTV
GSEGRFTLRGIRNKQEQLQNNIQEVRTEMTESFDEEQLQLAWIEFAETKL
SEEIHLKETMANCLPKLRPGTSAFEVEVLNSRQEEEINNVSARLLSFLAD
KLCNTTIRMNVRVSESTSLNRVPVSLDEKIERLNQENPLFDELRMRLGLS
LV
>PG0295 dprA, DNA processing protein DprA, putative
MDDRRPQLTEEQLLFRIALTHVKGVGSVLARQLLSAMGSPEAIFSDRKEL
VQRLPKAPRRLLDAIFSPSVMEEARRKLDQALKAGLNMYFITDDNYPYRL
KECVDAPILLYSKGNVDLSPRRVLSIVGTRNITAYGRTATERIVSGLAET
IPDLLIVSGLAYGVDVAAHKAALDNGLPTVAVLAHGLDRIYPSGHRSIAM
EMLRNGGLLTDYPMGTEPERFNFVGRNRIVAGLSDATLVIESAEKGGSLI
TAGLAFGYNREVLALPGRATDSRSAGCNALIRDQKAALVSSAQDVLTLLD
WSSTIDAKPQTLNFRPDSWPDTPVAECLLRAGTASVDELTRATGLPINDV
SAQLFDLELDGLVQSQPGGIYSVI
>PG1386 gyrA, DNA gyrase, A subunit
MTDREDRIINISIEEEMKTAYIDYSMSVIVSRALPDVRDGFKPVHRRVLY
AMNETGNVYTNPTRKCANAVGEVLGHYHPHGDSSVYMALVRMAQPWSLRY
PLVDGQGNFGSVDGDSPAAMRYTESRLSRIAGEMLQDIDKETVDFQNNFD
DTRQEPTVLPTRIPNLLINGASGIAVGMATNMPPHNLSEAIDGCVAYIEA
DGDIDVEGLMQYVKAPDFPTGGFIYGYSGVKEAFETGRGRVVIRSRAEIE
QHNNHERIIITEIPYLVNKAELVSNIAQLINEKRLDGISNISDESNRKGM
RIVVEIKRDANASVVLNKLYKMTALQSSFSVNNIALVKGRPRLLNLKDLI
GEFVDHRQEVVTRRCRFELRKARERAHILEGLLIAVDNIDEVISIIRSSK
DAAEAMSRLIERFDLSDIQARSIVDMRLRALTGLERDKLRAEFEEIMASI
NHLEALLADRALLMELVKSELLEIKEKYGDTRKSEIIYASEEFNPEDFYA
DDDMIITLSHMGYIKRTPLSEFRTQARGGVGAKGSDTREEDFVEYIYSAS
MHATIMLFTAKGRCYWLKVYEIPEGAKNAKGRAIQNLLNIDPDDKVNAFI
RIKNLTTDKEFVNSHYLLFCTKRGIIKKTLLEAFSRPRANGVIAIDLRDN
DGLVSVRLTNGKCDMVIANRGGRAIRFHESVVRPSGRTAMGVKGMTLDDD
GQDEVVGMISIKHPEEETILVVSEKGYGKRSNIDDYRITNRGGKGVKTLN
ITEKTGKLVDIRAVTDANDLMIINKSGVAIRVKVADLSIIGRATQGVKLI
DLSKRGDEIASVCSVVSEEEENKAEEHADHTHEVLPQDDSSSSVENGDVS
DMTDAIEPE
>PG1702 gyrB, DNA gyrase, B subunit
MNEDIKKNTSASEYSASNIQVLEGLEAVRKRPAMYIGDISEKGLHHLVYE
VVDNSIDEALAGYCDNVEVIIEEDNSITVRDNGRGIPVDYHEKEGKSALE
VVLTVLHAGGKFDKGSYKVSGGLHGVGVSCVNALSTYLRAEVYRNGKIHM
QEFSCGKPLHDVEVIGSTERTGTTIQFKPDSSIFSVTEYQYSILAKRLRE
LSYLNAGITLTLTDKRTLKEDGSGYKQDVFRSEEGLKEFVRHLDRMKEPL
VDNVIHIVTEKQGIPVEVAMTYNTSYLENVYSYVNDINTIEGGTHLAGFR
RGLTRTLKKYATDSKLLDKVKVEITGDDFREGLTAVISIKVAEPQFEGQT
KTKLGNNEVTGAVDMAVSEALEYYLEEHPKEAKLIVDKVVLAATARQAAR
KAREMVQRKSPLSGGGLPGKLADCSSKDPEQCELFLVEGDSAGGTAKQGR
DREFQAILPLRGKILNVEKAMQHKVFESEEIRNIYTALGVTIGTEEDSKA
LNLSKLRYHKVVIMTDADVDGSHIATLILTFFFRNMRTLIENGYVFIATP
PLYLCKKGKEQEYCWTEQQRQAFVDRYADGNESRVHVQRYKGLGEMNEEQ
LWETTMDPEKRTLRKVTIENAAEADAIFSMLMGDEVGPRREFIEENATYA
RIDA
>PG0121 hup-1, DNA-binding protein HU
MNKTDFIAAVAEKANLTKADAQRAVNAFAEVVTEQMNAGEKIALIGFGTF
SVSERAARKGINPKTKKSISIPARKVVRFKPGSTLELK
>PG1258 hup-2, DNA-binding protein HU
MTKADVVNAIAKSTGIDKETTLKVVESFMDTIKDSLSEGDNVYLRGFGSF
IVKERAEKTARNISKQTTIIIPKRNIPAFKPSKIFMSQMKQD
>PG1253 ligA, DNA ligase, NAD-dependent
MEKIVPPAVRIEELRRILREHEYRYYVLSSPTIDDFEYDAMMKQLEELER
EYPEWDSPDSPTHRVGSDKTEGFASVRHDRPMLSLSNTYNYDEIGDFYRR
VSEGLQGAPFEIVAELKFDGLSISLIYEDGMLVRAVTRGDGIMGDDVTAN
VRTIRSVPLRLRGDDYPRMLEVRGEILLPFKEFDRINAQREAEGEPLFAN
PRNAASGTIKQLDPHIVAGRNLDAYFYYLYSDEPLAENHYDRLMQARQWG
FKVSDAVTLCCSKEEVYAFIDRFDTERLTLPVATDGIVLKVNAPAQQDLL
GFTAKSPRWAIAYKYQAERVRTRLQHVSYQVGRTGAVTPVANLDPVLISG
TVVRRASLHNADFIAEKDLHEGDFVYVEKGGEIIPKIVGVDTDARSIDGR
PIVFTVLCPDCATPLVREQGEAAYYCPNAEGCPQQQKGRLEHYCGRKTAD
INIGPETIELLYSRNMIRNVADFYALTEEQLLTLPGFKKRAAAKLLDSIE
ASKARPYQAILFGLGIRFVGETVAKKLAAVYPSIDALAAATSEELVQIDE
IGERIAAAVLHFFSLRQNRELIERLRLAGVSLEAETVSVAVSNRLAGKTV
VISGTFEKRSRDEYKAMVEDNGGRMAGSVSSKTSFILAGSDMGPSKREKA
EKLGVRLMSEEEFLRLIEE
>PG1774 mfd, transcription-repair coupling factor
MIKSPQPIDILSLFRRHNGVQALMKALDREKCVALDGLCGSSAALIVKLL
HESGRPVLCIASDMEEAGYLFSDLEQLGGEGSALFFPSSYKRAIKYGHTD
AAQQVLRAEALAALSMEGSCPLVVSYPEAVAERVVAGDILEKEMHSIRQG
DRLDRDFLRDLLLEWGFERTDYVYEPGQFAVRGSLLDVFSFSRELPVRID
FFDDEIESIRLFEVESQLSVGTLSEVVLMPDVAGDVRAEECLLGLFPKNT
IIVLPDRPFLEDRLQMVFTDAPLFDDGEGFESLVAMQERLTSPKELMDKL
NDFTTIATGSGRSGNYRIGFKTQPQPLFHKNFDLLIDQLEQWRDAGYRML
LATASDKQYERVEEILSERGSSSLLPKRVSLTLHEGFSDEALRIVLLTDH
QLFDRYHKYNLKSDKARSGKVTLSLKELNQFSQGDYIVHIDHGIGRFGGL
ITTDVGGKRQEVIKLIYRNNDIIFVNLHSLHKLSKYKGGDSDAQVELSRL
GTGAWQKLKERTKKRVKDIARDLIRLYAQRKEERGFAFSPDSYLQHELEA
SFLYEDTPDQERATAEVKADMESDRPMDRLICGDVGFGKTEVAVRAAFKA
ATDGKQVAILVPTTVLAYQHYQTFRDRLQNFPVRIEYISRARSAKDIKAI
LHDLAEGRIDIIIGTHRLVSNDIRFHDLGLLVIDEEQKFGVAVKEKLRKL
QVNVDTLTMSATPIPRTLQFSLMGARDLSNINTPPPNRYPVATELARFSP
DIVREAVNFEMSRNGQVFIVHNRIDNIEEIAGIVQREVPDARVAVGHGRM
SPTELERLILDFVHYEYDVLVATTIIENGIDVPNANTIIIDDAHRYGLSE
LHQLRGRVGRSNRKAFCYLLSPPLSVLSDDSRRRLQAIENFSDLGSGIRI
ALQDLDIRGAGNVLGAEQSGFIADLGYETYSKVFNEAVSELKADEFADLY
AESQEAIPSASRFVVETTVESDLELSFPEEYVPLDSERILLYRELDNLST
DEELDAFRRRMQDRFGKIPPEGEELIRVPRLRRLGRSLGIEKIVLRGDQM
SFHLVGKEDSPYYQSEVFGMLLEYIAAHTRRCEIRQSGGKRIVRLREVPD
VLTACELCTAISTRSSAERIEL
>PG0412 mutL, DNA mismatch repair protein MutL
MSDVIRLLPDSIANQIAAGEVIQRPASVVKELLENALDAGASIIRLDVRE
AGRELIRVTDNGKGMSQSDARMAFERHATSKIASFQDLFSLRTMGFRGEA
LASIAAVAQVELLTRRAEDELGTRLTINGSEVGEVATVTSPLGCILCVKN
LFYNVPARRKFLKSNETEFRHILTEYERVALVNPQVAFSIYHSGELVQDL
PPSPLKKRILDVFGKRMEKDLIPIGMKSPITNISGFVGRPDGARKRGALQ
YFFVNGRFMRHPYFHKAVMAAYEAIIPQGTMPNYFLYFDLEPSQIDVNIH
PTKTEIKFSDEQAIFKLIGVVIREALSSSNAVPAIDFDRKELIDIPAYQG
PGKNVVRPPVDLDPSYNPFKETGLTEPIRSSRRQSPDMGWNELFKQFEAK
RDAEKMAEPPIRSEELFASTDFTPSAVSATPSTDMLCYVHRGRYLVTTLS
RGLALVDFHRAHKRILYDRFMADESRRHIEQQQLLFPELLEFNPSDASAV
KAAVDELQSVGFDLSPLGVSSYSLLAAPVQIIDCAADVVRDVIHTTLEDG
RSSHEQMLELIATQIAEYQAIPCGKTPTAEEASDLLAELFASNDSTYTPD
GKLIVSIIEEADIARRFE
>PG0095 mutS, DNA mismatch repair protein MutS
MAKPVVETPLMRQYFQIKQKHPDAILLFRVGDFYETFSEDAIVASEILGI
TLTRRANGAAQFVELAGFPHHALDTYLPKLVRAGKRVAICDQLEDPKKTK
TLVKRGITELVTPGVSTNDNVLSHKENNFLAAVSCGKEVFGISLLDISTG
EFMAGQGNADYVEKLLTNYRPKEILVERSERSRFNDLFHWSGFIFDMEDW
AFSSENNRLRVLKHFDLKSLKGFGLEELSMAVTAAGAVLNYLDLTQHHQL
QHITSLSRLDENRYVRLDKFTVRSLELLSPMNEGGKSLLDIIDHTITPMG
ARRIRQWIVFPLKDPARIQARQRVVEFFFRHPEERAIIAEHLTEIGDLER
LVTKGAMGRISPREMVQLRVALQALEPIKEVCTHADEENLRTLGGKLELC
KELRDKILREVMPDAPAALGRGPVIAHGVDATLDELRALAYSGKDYLIKL
QQQEIERTGIPSLKVAYNNVFGYYIEVRNTHKDKVPAEWIRKQTLVSAER
YITEELKEYEAKILGAEEKIAALEGQLYALLVAELQRYVAPLQQDSQAVA
SLDCLLSFAESARRYRFICPVVDESFTIDIKAGRHPVIEQQLPADEPYIA
NDIYLDTDRQQVIIVTGPNMSGKSALLRQTALISLMAQIGSFVPAESARI
GMVDSIFTRVGASDNISMGESTFMVEMQEASNILNNLTPRSLVLFDELGR
GTSTYDGISIAWSIVEYIHDNPKAHPRTLFATHYHELNELEGQLDRVHNF
NVSAREVDGKMLFLRKLEPGGSAHSFGIQVARLGGMPHHIVQRATDILHR
LEQEREKIEEEEPKTKDTKRGPSEKVKNASPTLPRDEKGRSIDGYQLSFF
QLDDPVLSQIREEILDLNIDNLTPLEALNKLNDIKRILRGY
>PG1378 mutY, A/G-specific adenine glycosylase
MSYKDAKNYIHQFISLHKQHFLFQSCMTNSSSRTRSLPSESKIDPLPYFP
ELRKLLAEWYDANKRDLPWRQTDDPYRIWISEVILQQTRVEQGRDYYHRF
IECFPDVHSLSLASEDEVLKQWEGLGYYSRARNLHRAARMIVSDFGGCIP
RTRQEILRLPGIGDYTAAAVLSFAYDLPFAAVDGNIFRVISRLMNLDTPI
DTPAGKKLFSFWADALLDREAPARHNQAIMEFGALHCTPTSPSCLLCPVR
RFCMADTAGCVDALPVKKGGLRITNRYLYFIYIRVITPTGVYTYIRRRPS
GDIWQGLYEFPCVELSDHAVLETLLLSPELGNLLRSISGSMDSLPFKTFK
HQLTHRNLWIHGYTLTARLDKAPDLDGYRCIREEQLDDFAFPRALNLLLD
ALSLSDK
>PG1772 nth, endonuclease III
MRKEERYKAVIDWFAENMPVAETELRYRDPFQLLVAVILSAQCTDKRVNM
VTPALFSAYPTAKDMAGSTVEDLLSYIGSISYPNSKAKHLVGMAQMLCSD
FGGVVPDEVSELTKLPGVGRKTANVIASVVYGKPAMAVDTHVFRVSERIG
LTTGSKSPLETERELVRYIPDVLIPKAHHWLILHGRYVCLARKPKCADCG
IAPFCRYYSKVFKKNSTALPKKGE
>PG0486 ogt, methylated-DNA--protein-cysteineS-methyltransferase
MIKYSMETICIQHYQSPCGDLILGSYGHQLCMCDWVHKERKEIIDMGLQK
RLCSSYEIALSPVLKETIAQLDEYFSHKRETFDIPLLMAGTEFQQTVWGE
LLNIPYGTTISYATLARRIGNPKAVRAVARANGANPISILVPCHRVIGSD
NTLTGYGGGLDKKEFLLRHEMSLPV
>PG1794 polA, DNA polymerase type I
MTERLFLLDAYALIFRAYYAFIRSPRIDSTGRDTGAVFGFALTLLDILEK
ESPEHIAVVFDPPGGSFRHREYAEYKAQREETPEGIRIAVPLIKEILAAF
RIPAVEVPDFEADDTIGTLAKQAEEQGLAVRMVTPDKDFGQLVSERIKIY
RPKSGGGYETWGPAEVCEKFGLSIPGQMIDYLGLVGDSSDNIPGCKGIGA
KTAEKLLAEYGSIDGIYAHQDELKGAVAKKIQEGEEQTRFSRYLATIRTD
APIVFDSEAYRRTSPDMAAVRECFAALEFRTLLKRLESTPTDAPATDLFA
GMVQAQEPPTDLFGEGTDATGLPLKKLTDVPHEYTILKTEEEIADCIRMF
SATPCFSFDTETDSKDALRANIVAITLCAESGRAFFIPLPEDEEIGKRRL
DLLRPLFADTAIGKVGQNMKYDIQVLSRYGIEVRGQLFDTMIAHYLLFPD
LRHNMDEMAETLLGYCTVHYSDLVGSDKQEVHIRQVPLQNLADYAMEDAD
ITWQLYERLNAMLSEAGMTSLFESIEMPLVPVLANMERSGVKLDTEVLRR
TASGLGEEMQRIEDEIYRLAGHSFNINSPSQVGTVLFEELQITEKPKKTK
SGSYSTNEEILVKLQEKHPIVRLILDYRGIKKLLSTYVEALPEMRYPDGK
LHTSFNQTVATTGRLSSSNPNLQNIPIRTEVGRGLRAAFVPDNDECIFMS
ADYSQIELRLMAHLSEDESLIQAFLHGEDIHRATASKIYRLPLVEVTDDM
RRRAKTANFGIIYGISAFGLSERLNISRTEAKALIEGYFASYPGVKAYMD
RSIAEAKRQGYVTTLFGRKRFLRDINSANAVVRGYAERNAINAPIQGSAA
DLIKLAMIRIHEEITERKLRSRMILQVHDELNFNVLRPEAAEVRELVRSC
MEGVMPSLRVPLIAEIGEGANWLEAH
>PG2032 priA, primosomal protein N'
MRYAEVLIPLALEGSFHYRLAEGLAEKAIVGMRCVVPFGAKRYYTGIIIG
LSDKRPNLQISFKEVLFLPDDKPSVTASQLSLWQWLSAYYICTQGEVLRA
ALPAALLPESHTVIHYNTDFEADSRLSRDEEELLDILESAKGRTYTLDAL
QKAVGKRAIRAFTSLVERGAIRLEEEVKSRYKPKSEVFVRLAEPFRTEKT
FASLLDSLHRAPKQSALLLHWAELITEHSLPYSSPMPQRLLAESDPHATV
TLSALKKKGIFLSESVTHSVMYSAGGGEYRLWEQPQEEEKAIQSEESSTD
SAVSASLPQKPLSLLYTHDFRRKEKQLLEWTEEVVRSGGQVLYLSPEANK
RGGSDTLSTRMAERLGSCLLSYHAFESDAKRVEVWNRLATTEYPCVVLGV
RSALFLPFRRLRLIIVDEEQEYLYKQQDPAPRFHTRQVAARLGRIHDCPV
VLASATPSAEVLHQVRHKACELITWPDDRVRPRFDLEVIDMGKMRRQRQV
GAGELLSFPLVSAIEETIRQKKMAVVLQNRRGFAPYIICSSCGEKLRCIH
CDVSLTYHKHSCMLVCHYCGYSRPLPRICPSCKRASGVGEPSSLQPVGYG
AERIEEELKRRFPTVSILRMDSDMAMSRTKMDEALARLEAKEVDILVGTQ
LIKGQVYNEHVGLVAVTQLDSILGFPDFRAYERAYQLLYQLMLRSGASRL
YIQTNNPANSFPDLLREGDYKAFIGKQLEERQMLFFPPFSRLIRIEFRAG
EESLVERIASDYAASLAAHLPEQSLSPVLVPPISRLRNAFIRELLLRLPL
SSSVSATRNILDTIRTTLQTRTQEYKRVRILFDVDPL
>PG0894 radC, DNA repair protein RadC
MTIKQLHESDRPREKMIRLGARCLTDAELLAILIGSGNDRQTAIQLAQEI
LAKMDNSLPRLAKCDIQELTGSFRGMGPAKAVTVMAAMELSRRIPSQEMP
RRESITDSRMAYRTISPFLTDLPQEEMWVLLLNQSGKIISMENLSRGGVS
ETSADVRLIMHKAVSHLASAIILAHNHPSGTVRPSEQDIQLTQRVQKAAT
LLGFRLNDHLIIGDDGAYFSFADEGLL
>PG0881 recA, recA protein
MAEEKIPTVQDEKKLQALRMATEKIEKTFGKGAIMNMGANTVEDVSVIPS
GSIGLDLALGVGGYPRGRIIEIYGPESSGKTTLAIHAIAEAQKAGGLAAI
IDAEHAFDRTYAEKLGVNVDNLWIAQPDNGEQALEIAEQLIRSSAVDIIV
IDSVAALTPKAEIEGEMGDNKVGLHARLMSQALRKMTGAISKSNTTCIFI
NQLREKIGVLFGNPETTTGGNALKFYASIRIDIRKSTPIKDGEEIMGHLT
KVKVLKNKVAPPFRKAEFDIVFGEGISRSGEIIDLGVELDIIKKSGSWFS
YGDTKLGQGREAAKEMIRDNEELAEELTEKIREAIRNKHS
>PG0398 recF, recF protein
MIIEELHIVNFKSIAAADCRFSPKVNCLVGNNGMGKTNLLDALHFLSFCR
SHLSVPDNMVVRHGEEMALLQGLYRDESGDGIELLLSIRPGKHKVLRRNK
KEYERLSDHIGRFPLVIVSPQDYQLILGGSDERRRFMDQQLCQQDPRYLS
ALIQYNRHLQQRNTMLKQDRHDDALMDVLELQMGSYAAEICNKRSRFIED
FLPVFNDLYSDISGSAEKVSLSYRSHLADGIPLEELLRRSRPKDYLLGFS
SCGVHKDELEMLLGGVLIRKIGSEGQNKTFLISMKLAQFRHQQLHGDETP
ILLLDDIFDKLDATRVERIIRLVGGNGFGQIFITDTNRKNLDEIIASWSE
DYRLFKIENGQIFQ
>PG0348 recG, ATP-dependent DNA helicase RecG
MDILSTKITYLTGVGPKRAEVLKEEIEVRTYLDLLHYFPFRYVDRSRFYA
IREIRSDMPYIQLRGVLRNFSEVGEGRRKRLTATFSDGTGSIELVWFKGI
KYIRDKLQEGRRYIVFGKPVFFASGYNIAHPEIDAEEKAEQVAGGLTPIY
HTTERMKSMGLGSKQLQQLLYVLLNQVSATLTETLPPYILSSYGFVSYQE
AIRQIHFPQGVAQLEAARTRLKFEELFYVQLHLIGSKLERKARFQGIVFA
QVGALFNTFYKEHLPFELTGAQKRVIREIRQDTLSGHQMNRLVQGDVGSG
KTLVALLSMLLALDNGCQACLMAPTEILARQHHHTLSELLRPLGIEVGLL
IGSCTARQRERLLPRLADGSLSIVVGTHALLEQGVAFRRLGMAVIDEQHR
FGVQQRARLWEKNLDTLPHILIMSATPIPRTLAMTLYGDLDISIIDELPP
GRKPIQTLHHFDNDMAPVFRFLRSQLAAGRQVYVVYPMIEGSETTDLKNL
EDGFELFSSIFPDEGVTMVHGKMKAKEKEARMADFVSGRSRILLATTVIE
VGVNVPNATVMVVENADRFGLSQLHQLRGRVGRGGEQSYCILITGTKTGE
DSRRRIQVMVETNDGFEIAEEDMRLRGFGDLEGTRQSGRQISLRIANPAR
DTELIALSRSIAEQLLERDPELRHPDNQMLALRLNKLFPKEEDWSVIS
>PG0054 recJ, single-stranded-DNA-specific exonuclease RecJ
MNYNWNLEPFTAEERSLGTIFCRELRLAPVVGCLLARRGIASVEEAKRFF
RPRLEDLHDPFLMKDMDKAVARLNRAVGRKEKIMIYGDYDVDGTTAVALV
YKYLRATGCSETQLDYYIPDRYDEGYGISYRGIDLAHSLGTKLIIVLDCG
IKAVEKVAYAKSLDIDFIICDHHNPDETLPDAVAVLDAKRADNTYPYEHL
SGCGVGFKFMQAYARSNNLPESKLLPLLDLVAVSIASDIVPIMGENRILA
YYGLRQLNKRPCLGLQAIIDVCGLKKRRIDMNDIVFKIGPRINASGRMMN
GGKAVELLLSQDAVEAQSRTANIDEYNDQRRELDKDVTEQAIDILGELSD
VDKKILVIYRPEWHKGVIGIVASRMTERYSRPTIVMTKSGDFISGSARSV
GGFDVYKAIEHCKDLLVNFGGHPFASGLTIKEENLETFRKMITDYAEEAV
SPELLVPQIDVDAEISIEEVNYKLLDNLKRMGPFGPENSKPVFISRQLYD
AGGSRAVGKASEHLKIDVRVGSGERHPVSGIAFNQAGHCDEIKNGSFRLC
YTIEENEFNGNKSLQLLVRDIKPEEESAINGKTR
>PG1849 recN, DNA repair protein RecN
MLASLHIANYVLIDRLDIDFAPHFSVITGETGAGKSILLGALGLLVGGRA
DTSAIAPGTDRCIVEGRFTGFVPEMKAVLDRYDLDFDPDECTIRREISSK
GKSRAFVNDTPAPLTALRELADFLIDIHSQHKNLLLGDSLFQLNVLDAYS
GKPDLYAHYSKAYRVYAERKQKLEDLRKAAAATASEYDYWQFRFEQLDKA
GLESGEEARLQEEQAMLTHALDIKRELGHSYSLLSDDERGLLSGLNKVED
ALATIESYYPDSASFRQRVRDVRIELADIASDLGRRSDDVSYEPERLNAV
TDRLDEILSLLHRYNADSSDALIAIRDDLAERLSRISTDEEEISRLEQEV
LAFYKEIEAQASLLTEERIRAASALETSLCESLRKLNMPHVRFVVDIRST
EYGPHGADKVVFLFSANKQMEPEPVSEIASGGEIARLMLCLKALIADKRS
LPAIVFDEIDTGVSGEVADRMGEIMAHMGQGMQVLAITHLPQIAARGERH
YFVYKDETGERARTFIRELTPEERIREIARMQSGNNLTDVALAAAKELLA
R
>PG0416 recQ-1, ATP-dependent DNA helicase RecQ
MDIVLAEKLQQYFGFDRFKGNQEAIIRNVLAERNTFVLMPTGGGKSLCYQ
LPAMLMTGTTIVVSPLIALMKNQVDAMRSFSAKDGVAHFMNSSLNKTQLE
RVKNDVRDGLTKLLYVAPESLTKEENIAFLREIDISFYAIDEAHCISEWG
HDFRPEYRRIRPIINEIGPRSIIALTATATPKVQHDIQKNLGMMDADVFK
SSFNRPNLFYHIRPKTQDVDRDVVKYILSQPGKSGIVYCMSRNKVTTFAQ
VLQANGIKALPYHAGLDAGERASNQDAFLNEEISVIVATIAFGMGIDKPD
VRYVIHYDMPKSLEGYYQETGRAGRDGGEGECIAFYRPKDLQRLEKFMQG
KPISEQEIGRQLLAETAAFAESRVCRRKLLLHYFGEDYTQENCGACDNCT
STYKQVEAKELLLNVLETVSALKEQFKTEYVVNVLMGELTPDIESFGHQD
LEVFGIGSDETETTWTAVIRQALLSGYLSRDIENYGLLKVTAKGKKFALK
PVSFKIVDENEEDEDDDAPKVGGRGSGGAMDPVLFAMMKDLRKKLGASLK
LPPYVIFQDPSLEAMATFYPVTIEELQNIPGVGVGKATRYGKEFLDLIRR
HVEENEIERPEDMRVRTLANKSKMKVSIVQQIDRKVALDDIAVSHGLDFP
ELLSEVETIVYSGTRINIDYFINEVMDEDHLEDIFEYFKESTTDSLEEAM
QELGKDYSEEEIRLVRIKFLSEMAN
>PG1831 recQ-2, ATP-dependent DNA helicase RecQ
MPEEVSNEIEAPSLHTPEEVLLHYWGYPSFRPVQLPIIESVLAGKDTLGL
LPTGGGKSITFQVPGLLLPGLTLVVTPLIALMRDQIMGLRQKGIKATAVH
AGMTREQIITTLDNCIYGRYKFLYVSPERLGSELFLSRLHALRVSLLVVD
ECHCISQWGYDFRPAYLSIADIREALPDVPVLALTATATRPVIDDIQRIL
RFPEPNVLRKSFFRPNLSYSIRRTADKETMLLHILSRVDGSAVVYCRNRD
KARDLARFLGENGFSADFYHAGLNHVTREIRQKSWMEGETRIIVCTNAFG
MGIDKPDVRLVLHMEMPSSPEEYFQEAGRAGRDGERAYAVLLAGEDDIFN
LKRRVSNEFPPREYIATVYNRICNYLQIGEGEGFERSFDFDIDAFCRNFR
MFPTQVLAAIRILDVAGIWEYREEKTRSRLTIQVQRDELYRMRSEQASDS
VLTALMRTYDGLFADYVSIVESELAEKTGLSVDQVYQQLLLLNKAGIVNY
IPQKNLPRIYFLTRREDAELLQIPRAAYEDRRDRLKARIDQSLRYIEEEN
TCRSRMLLAYFGEEQSHNCKLCDVCLRRKDGELHHHEVDDLLHFLEQRLT
EETPYVLIADICRELHHHPDVVLKAIRFVMKESWQYSTDGDTVFLTHKLP
GGLNL
>PG1255 recR, recombination protein RecR
MIQKYSSRLLEKAIDQFATLPGVGRKTALRLALYLLRQPVENTRQFAAAL
VDLREHISYCRHCHNISDSDVCTICADPTRDQSTLCVVENIRDVMAIENT
SQYRGLYHVLGGVISPMDGIGPGDLQIDSLVHRVASEQIHEVILALSTTM
EGDTTNFFLFRKLEPTGVRVSVIARGIAIGDEIEYADEITLGRSILNRTD
FSDSVKF
>PG0736 rnhB, ribonuclease HII
MLLSRYIDDDLRECGCDEAGRGCLAGPVYAAAVILPADFSHPLLNDSKQL
SEKQRYTLRPVIESETIGWGIGIVSPQEIDEINILRASFLAMHRAIEQLP
FRPERLLIDGNRFDPFEQIPHHCIVGGDARYRSIAAASILAKTYRDDSML
RLNKDFPMYGWERNKGYPSPAHKSAIRRFGVSPHHRLTFRGVVDADRPTT
E
>PG0811 ruvA, Holliday junction DNA helicase RuvA
MIEYLKGAIVGLTPTNLVIECAGVGYDVNVSLTTYSAYQGKKEGLIWITQ
LIREDAHLLYGFSTKEERTLFGQLTSVSGVGPTTAQLILSSYAPQELAAL
ITTGQADALKAVKGIGLKTAQRIIVDLKGKIQLETSSDEILSARTAVGDA
ALNTIASGEEAISALKMLGFADPAIRKAVKSILSEDSSLAVEDIIKRALR
ML
>PG0488 ruvB, Holliday junction DNA helicase RuvB
MTEEFDIRQERYRGNDGEREVENKLRPLTFDSFSGQDKVVENLSIFVAAA
RLRGEALDHTLLHGPPGLGKTTLSNIIANELGVGLKITSGPVLDKPGDLA
GLLSSLESNDVLFIDEIHRLSPVVEEYLYSAMEDYRIDIMLDKGPSARSI
QINLSPFTLVGATTRSGLLTAPLRARFGINLHLEYYDVHTITGIVERSAR
ILEVSCSHDAAVEIAGRSRGTPRIANALLRRVRDFAQVKGSGAIDKPIAC
YALEALNIDRYGLDNVDHKLLATIIDKFAGGPVGLSTIATALGEDPGTIE
EVYEPFLIKEGFLKRTPRGREVTELAYTHLGRNPRPHRPSLFD
>PG1324 ruvC, crossover junction endodeoxyribonuclease RuvC
MSPKERIIMGVDPGTILMGYGMLHVVGNTPRLMAMGVIRLEKFDNHYIRL
KRIFDRITGLIDEFLPDEMAIEAPFFGKNVQSMLKLGRAQGVAMAAALAR
DIPITEYAPMRIKQAITGNGNASKEQVAGMLQRYLRIPDEQMLPEMDATD
GLAAAVCHFFQTSGPMARSGGSAVKNWKDFVNRNPDKVR
>PG0271 ssb, single-stranded binding protein
MSLNKIILIGRTGKDPEIRYFDSNSAVANFSLATSERGYKLANGTEVPER
TEWHNVVAYRELAIFAEKWIKKGSLLYVEGKIRYRTYVDNTGVRRQVTEI
LAEKINFFESGSSNRDESRTSQTPSSTQDTTPLASSSSVRDTAKEESSEP
PSDLPF
>PG0754 topA, DNA topoisomerase I
MKNLVIVESPAKAKTIGRFLGSDYTVLSSYGHIRDLKPNKFSVDIQNNYE
PEYEIPADKRPVVKELKSQADRSDFIWLASDEDREGEAIAWHLYEALGLK
NKQTKRIVFHEITETAIRAAIENPRDIDINLVDAQQARRVLDRIVGFELS
PVLWRRIRPSLSAGRVQSVALRLIVEREREINAFVPEASFRCTIEFVLPD
GRMLTAELQKRFKTKEEARYFLEQCMDAHFHITDVTKRPGKRSPATPFTT
STLQQEAARKLGYGVAQTMRIAQKLYEEGLITYMRTDSVNLSDMALGALK
KEITEHWGEQYYRFRRYKTKTKGAQEAHEAIRPTYIHRAEIDGTPQEQKL
YQLIRRRTIASQMADAILEKTTITIGTDKFAETLSSQGEVIVFDGFLGVY
REDSDEEHGSANTEEQLLPSVKAGDTLSLHHAKATESFTQRPARYTEASL
VRKMEELGIGRPSTYAPTIQTIQNREYVVRGDKPGKTREYILLEYHKGKA
ITETIKTELNGQDRNKLLPTDMGLVVNDFLVASFPQVIDYNFTAKVEKEF
DQIAEGKLQWQKQIGRFYNKFHPLVAEACEFDPDQKIGERMLGTDPVTGE
SVVAKMGRYGAMVQKGRTDKENGIKAQFASLQPGQSIESITLEEALELFL
LPKKLGQYEDADVMVAVGRFGPYIKHAGKFVGLPKDTEPLSVSLDDAIKY
IADKREKEEKSLIKGFAEDPEMEIRTGRFGVYIKYKGKNYKVPKTVEDPE
KLTLEECLKYVEEGETKPAKGKKKAPAKKTSAKKTAKK
>PG0104 topB-1, DNA topoisomerase III
MIVCIAEKPSVGREIAAVLGATKAYKGYMEGNGYQVTWTFGHLCALKEPH
DYAPEWKRWSISSLPMIPLRFGIKLIDSDSIREQFGTIERLVHEADMVVN
CGDAGQEGELIQRWVLQKTGCTCPVRRLWISSLTEESIREGFARLRDSEE
FHSLYEAGLARAIGDWLLGMNATRLYTLRFGGNRQVLSIGRVQTPTLALI
VRRQHEIEHFTPEVYWEVKTIYRDTVFNATKGRFSSIEDARREVEVVAGS
PFAVTSVATKKGRELPPRLFDLTGLQVECNKRFAMTADATLRTIQSLYEK
KITTYPRVDTTYLSDDVFEKVPNILQGLSDYTLLTAPLRSSKLKKSKKVF
DNSKITDHHAIIPTGQPSHGMSEDERRVFDLVARRFIAVFYPDCIFSQTT
VLGQAAKVEFKTTGKQILEPGWRTVFTDPQTDDDGENKDEEKKLPIFSEG
ESGVHTPEVQEKTTQPPKFYTEATLLRAMETAGKSVKDEELRDALKENGI
GRPSTRAAIIETLFKRNYIYKEKKNLKATPTGMSLIATIDYDLLKSAELT
GQWEYKLRRIERGDYSAADFINELKVLLTHLIPNVLKSNSSVLLSEPIPA
ASPAPGKKKKADKPMQKQLDLTCPVCGKGIIVRGREAFGCNAFREGCTFR
LPYSEYPASLSDNELVDLLSSQKGH
>PG1495 topB-2, DNA topoisomerase III
MKCIIAEKPSVARDIARIVGATHKEEGCFSGNGHVVTWAFGHLVTLAMPE
AYGFPTYSREQLPIIPEPFRLVVRQIRKGTTYTDDPSALKQLNIIRKCFD
RSDHIIVATDAGREGELIFRLIYAYLDCRKPFDRLWISSLTDKAIREGLS
RPADGRQYDRLYLSGKARSEADWLVGINASRALSIARGGAYSLGRVQTPT
LSMICRRYIEHRDFRSVPFWKIHALPTGLSVKAIGETTYESRRAADRAMK
DLDPREALIVSSVSVQTNPVPPPLLYDLTALQKEANKRYGYTAEKTLSLA
QSLYEKKCTTYPRTGSRYISEDIFEEIPSLLESLKDDPDYGDYVEKLTAG
TLNRRSVDDTKVTDHHAILLTGEKSDSLTRDEEVLYRMVTVRMLESFSEA
AIEETLTAILSQREHRFGIKAKRRVKSGWKAIRGSMEEPVEEGETVVDSF
PEWQEGDRLDVFGFEMTEHQTKPKPLYTEAMLLSAMEHAGREVADEEARK
ALAGCGIGTPATRAAIIETLILREYIRREKKMLIPTEKGLSVYKLVAGRK
IADAEMTGAWEVALAAIEAGAMDERTFGKSIEVYTRQICEELLTTAAGNT
DAHYKTYRCPLCGNDSVRVFPKIVKCVTDGCGFKVFRELCGTLLSKEYIH
ALMTDGCTPLLCRLTGKSGKTFNARLKLDKDGGTSFVFDSKPRKPQT
>PG1338 umuC, umuC protein
MYALLDCNNFFVSCERVFDPSLRNRPVVVLSNNDGCIIARSNEAKALGIG
MGQPFFKVQDLIRRHNVAFFSSNFILYGDMSRRIMSLVSELVPRMSIYSI
DECFMDLRGVKDYMSLCHMIVDRVRQCTGIPVSIGVADTMTLSKVADKYA
KRYAGYKGVCAIDSEEKRRKALQTFDVSDVWGIGRRLSRKLYYYGINTAA
DFASMREGRVYRLAGTSGVRTWQELRGEACKEMKLPQPRQSVCTTRSFGH
PCRDFDSLLRHLAVFADSCCTKIREEKSRARRLSIFISCSRFDQENDYSG
RNEMLLPVATSDPSELIPKIRTLLQEIYRPNLPYKQAGVILSDLVAEAYQ
LNLFDPIDRMRQERFLSSVDAIRQRFGRDSLTVATQETEAIASVSSIKHR
SRRYTTDMNEVIDVVLPDQRKSGEKKSTINSR
>PG0237 ung, uracil-DNA glycosylase
MKEVRIEAGWKKVLQEEFDKFYFEKLTDFVREEYRQSPIYPPARFIFRAF
DTCPFDRVKVVILGQDPYHEPGQAEGLAFSVPTGIPIPPSLRNICEEIRT
DTGQPAHIDGGSLLPWVEQGVLLLNATLTVRASQAGSHQGHGWETFTDAA
IEALAKRREHLVFLLWGSYARRKSAMIDPRCHLILEAPHPSPLSAHRGFF
GCKHFSRTNAYLRQHGIAPIVW
>PG1036 uvrA-1, excinuclease ABC, A subunit
MHDTVINVKDKIARHDAVEVYGARVHNLKNIDVCIPRHSLTVITGMSGSG
KSSLAFDTLFAEGQRRYIETFSAYARNFLGGGMERPDVDEIKGLSPVISI
EQKTTNRNPRSTVGTVTEVYDFFRLLYARVAEAYSYAGGKKMVKYTEEQI
FHLILEKYDGRKIALLAPVVRSRKGHYKELFERLRKKGYLHVRVDGEIRE
ILHGMKLDRYKMHDVEVQIDKLCVASNQSKRIAESLALAMREGEGLVMVL
DSEKNEVGHFSRMMMCPDTGISYSDPAPHNFSFNSPHGYCPRCKGLGEVN
LPDMDKIIPSREKSIYEGGIEPLGKYKNNLFFWQIEALCEKYDVTIKTPL
RDLPEELIEDILYGTDELLTINNKALGQSRYALSFDGVAKYILMQAEESD
ASATAQKWADQFLKTTTCPDCAGKRLNKEALSYRLAGKDIAEVNAMDIKT
LIEWVDSLDEHLLDTQRAISIEILKEIRTRLGFLKDVGLEYLTMNRAAAS
LSGGESQRIRLATQIGSKLVEVLYILDEPSIGLHQRDNLRLIHSLQDLRD
IGNTVVVVEHDQDMMLHADYVIDLGPRAGRHGGEVVFAGSPEEMVQANTL
TADYISGRKRIEKSIGRRDGSGKTIKLFGAKGNNLQNIDVIFPLGVFICV
TGVSGSGKSTLINKTLFPAISQKLYRSLQDPMPYDRIEGLKHIDKIIAVD
QSPIGRTLRSNAATYTGLFTDIRALFVGLPESKARGYKPGRFSFNVKGGR
CEVCKGNGYKTIEMNFLPDVFAPCEGCRGKRYNRETLEVRYKGKSIADVL
DMTINKAVEFFEHAPHILSKLSVLQEVGLGYIKLGQPSSTLSGGECQRVK
LATELSKRDTGNTLYVLDEPTTGLHFEDVRVLLGILNRLIERGNTVIVIE
HNLDVIRCADYLIDIGPEGGAGGGQLLYQGKMEDIIECKNSYTAQFVKAE
LEKGRIDTVHMNSADNI
>PG2210 uvrA-2, excinuclease ABC, A subunit
MKQQESEHPTSGEAIIIKGARVNNLKNISLTIPRGKLVVVTGLSGSGKSS
LAFDTLYAEGQRRYVESLSAYARQFLGRMRKPECDLIAGVPPAIAIEQRV
VSRNPRSTVATSTEIYEYLRLLFARVGRTISPTSGEEVKKHTVADLVAYV
ADRPIGSKLYLLVGLQAPQGRSLREHLQIQQQQGYTRIFVAGEMKRIEDI
LSAETDFSDTASCFLLIDRLVIAEDKADYESRLADSAETAFFEGNGACLL
RIESPEGTVEERMFSNVFEADGRTFQEPSPEMFSFNNPIGACPTCEGFGK
VMGIDEDLVVPNKSLSVYEECVACWIGAKSQMWKDYFIQKSVPLGFPVHK
PYKELSDIERDMLWRGVPTGEPDYPNIGIDDYFSMLQRDMHKIQNRVRLA
HFRGKATCPDCRGMRLKPDALCVRIGGRNISELTALTVEETSAFFEGLQL
SEDDLHISRRLLEEIGKRLRFLLEVGLGYLTLDRLSNTLSGGESQRISLA
TQLGSSLVGSLYVLDEPSIGLHQRDTHRLIGVLKRLRDLGNTVVVVEHDE
ETIRSADYIIDIGPKAGRQGGEVVYAGEYDRIDKDTPGYTAAYLTGREKI
ELPRLRRPWNSYIEVREASKHNLKGVNVRFPLHVLTVVTGVSGSGKSTLV
RDLFYEGVKRILEGGSTQGLACEGIVGDIKSVRDIQYVDQNNFGRSTRSN
PVTYIGAYDDIRKLYSALPLSKQMGYQPYFFSFNKEGGRCEVCKGEGSIV
VEMQFMADIVLECEECHGKRFRKEILDVEYCGANIYDLLEMTVNQAVEFF
TDHPKASYTDKIVEKLECLREVGLGYLKLGQSSSTLSGGENQRVKLAAYL
GQAKPAPTLFIFDEPTTGLHIHDIRTLLHALSALIDKGHSVVVVEHNMEI
IKSADCIIDLGPEGGGAGGYLVATGTPEEVMRCDASYTGKWLKEILGNEQ
RG
>PG0380 uvrB, excinuclease ABC, B subunit
MDYKLTSRFKPTGDQPEAIRQLVQGINEGMPAQTLLGVTGSGKTFTVANV
VAAVNRPTLVLSHNKTLAAQLYGEFKAFFPENAVEYFVSYYDYYQPEAYL
PVTDTYIEKDMAINAEIEKLRLRATASLLSGRKDVLVVSSVSCLYGMANP
EAFSEKVISLHTGQRADRDHFIRLLVESYYTNNKVEFESGNFRVKGDSVD
IFPAVEGYDGVAYRVEFWDGEVERLSTFDPRTGREYGLLSELKIYPANLF
VTTKEQVDRAVGKIDVDLGAQVDFLKEIGKPYEAKRLYERVTYDLEMIRE
LGYCSGIENYSRYFDGRDAGERPFCLLDYFPEDFLLVIDESHVTIPQIRA
MYGGDRSRKENLVEYGFRLPAALDNRPLRFDEFEALTPRTLYISATPADY
ELNRSEGVIVEQLIRPTGLLDPIIDVKPTANQVDDLMEEIARCIEKKERV
LVTTLTKRMAEELSEYLLRHGISTGYIHSDVDTLERVRIMEDLRKGVYDA
LIGVNLLREGLDLPEVSLVAILDADKEGFLRSHRSLTQTAGRAARHIHGR
VIFYADKITDSMQLTMDETARRRAKQLAYNEAHGITPQQIVKNSAAIWGE
GDVSALQSDTESGAYIEESSMVAADPLADYLSKPKLEALIASTKKQMLAA
AKELDFLEAARLRDEAARLEKKLEQLTA
>PG1993 uvrC, excinuclease ABC, C subunit
MTPDELNIILPTLPEKPGCYQYFDEDGKVIYVGKAKNLRRRVSSYFYKEH
ADRKTRILVRQIRSIKYIVVDSEGDALLLENSLIKEYQPRYNVLLKDGKT
YPSIVIKREPFPRIFATRDIKKDGSEYFGPYPGALIAKGMLRLVKEIYPI
RTCKLDLREEKIRQGRYRVCLQYHIKKCKGPCIGNQTSNEYESNVSEIRD
LLRGNLHRLVRMYRDRMQVYSEGLRFEEAQICKERIELLERYEAKHTVVP
RNIDNVDVFSYDEDEHTAYINYMHIEHGGINRVYTLEYRKQIEESKEELL
AAAITELRQRFESNAHEIVLPFDTGWQTGESITTTIPRRGDKRKLLELSE
KNVAQYKLDKLKRAEKLNPEQRALHIVHGIQKDLHLDRPPKHIECFDNSN
IQGTSPVAACVVFKMGKPSKKDYRKFHVKTVEGPNDFASMREIISRHYTR
LTEENLPLPDLIVVDGGKGQLSAAYETLDKLGLIGKIPIIGLAERLEEIF
FPKDPVPLILDKKSETLKVIQHLRDEAHRFGIGFHRDVRSKKQIQSELDN
IKGIGKKTKEDLLRHFKSVKRIRSAEEEELSALIGRNKAKLLYEGLRKK
>PG1732 xerD, integrase/recombinase XerD
MIEDSLSKRYKSYLSLELHLTPNSVDAYLKDLSKLDSYLTAENISFREVS
YENLQHFVAELYDLGISARSIARIISGVKSFFRFLVLEEYIEADPTELLE
GPRIGVHLPTVLTIEEVDRLIGSIDPAAQGAQRNRAILEILYSCGLRVSE
LTSLKFSNLFLNESFLRIDGKGRKQRLVPMSETAITELKRYLSDPERPTP
VLGQEEYVFLSNRGKAISRIMVFVMIKKLAEEAGITKSISPHTFRHSFAT
HLLEGGANLQAIQLMLGHENIATTEIYTHIDRETLRHEIETYHPRNQSYR
RSGSEADQ
>PG1128 xseA, exodeoxyribonuclease VII, large subunit
MPSYTLTQLASAVRNCLESGFPGRYWVSAETSGVRVGISGHCYLELLDKD
TSGSQVTARMKAMIWASDYATLSGRFQRETGETFDSGLHVLVLVSVSYHE
QYGLSLRILDIDPSYTMGAMARKRKEIIEELRRQGLYDLNRSLSLPRPTQ
RIAIVSSGAAAGFEDFIAHLSHSAEHFCFYPVLFQAVMQGAQTEASVLGA
LERIAYHRDSFDVVVIIRGGGAVSELAAFDSLAIGQACARFPLPILTGIG
HDRDETVVDLVAYRSLKTPTAVADFLVNCQREEWKLIDDLRSRAAEGLRM
MMMYCHERLIQLSLRTPAILKSSVREEHHRIKSVEDRIRLAAKQRIAFGL
QQLQIASRSLPALMKSELKQNTGQLDQVAARLPLLVTANLKNYNRRLETN
EQAIRLLHPNATLRRGFAIVLKDDKAIRSHSELHKGDHLVAQFADGSVSA
VVDQPQKGKH
>PG0269 xth, exodeoxyribonuclease III
MKIISYNVNGLRAAMKKDLIGWLREENPDVLCLQETKMQNDQFEKEEFEA
LGYRSYLFSAQKKGYSGVAIITKHQPDHIEYGMGMEEYDAEGRFIRADFG
DLSIVSVYHPSGTSGDERQAFKMVWLEHFQSYVNELRKSRPNLILCGDYN
ICHEPIDIHDPIRNAKNSGFLPEEREWMSRFLADGYVDTFRHTHPELVLY
SWWSYRFQARSRNKGWRIDYCMVTNNLADRIKGADILNEAVHSDHCPIVL
EIAE