TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Xanthomonas axonopodis pv. citri str. 306, 306
Gene type: CDS

Number of genes found: 271

Free access
Sort by:

 



# Xanthomonas axonopodis pv. citri str. 306, 306

>XACb0050 ISxac2 transposase
MVRPSQRREMAQSAVTSGRTSIRHACQTFAVSETCFRYQAKASEENAHIA
DWLVRLTSAHRDWGFGLCYLYLRNVKGFGWNHKRVYRIYRELELNLRIKP
KKRLVRERPEPVAVPEAINQVWSMDFMHDQLADGRSFRLFNVLDDFNRQG
LGIEVDLSLPSARVIRSLEQIIEWRGKPAVIGCDNGPEYISGALLSWAQR
HGIRVEHIQPGKPQQNAYVERYNRTIRYAWLARTLFDTINRCRTKPPAGY
GRTTTSARIWRSAASRQR
>XACa0029 transposase
MLVGYMRVSSDSDRQSTNLQRDALLAAGVDARHLFEDRISGAKDDRAGLT
KALEFVRPGDVLVVWKLDRLGRSLSHLLAIVTSLKDKQVAFRSLTEGMDT
STASGELLFHVFGALAQYERALIQERVVAGLAAARRRGRIGGRPLAIVGE
KLEAIIAALNGGMSKAAVCRNFGVKRTTLIETLERVGWRGTGSTADGQQE
>XACb0008 Tn5044 transposase
MTTLHETAYPRLKPDPTAKELQDIYTPTAAELQCVRNIATGPATRLALLL
HLKLFQRLGYFTTLIEVPERIVQHVAQTLGMRRVPADRLAGYDASGAKRG
HLAQLRAFLNVCPLDAAGRDWLGTVAETAARTKHIVPDIVNVMLEELVHH
RFELPAFSTLERIAIAARERVHDAHYRQIADALSPTMRTLIDNLLLTPPG
SHHSDWHTLKREPKRPTNKEVRHYLRHIQRLRILAEQLPPIDVSVPKLKQ
FRAMARALDAAELAELVPIKRYALAAIFIRSQYRKTLDDAADLFIRLIQN
LENTAQQKLIAYQLEHSKRADALIGQLREILQAYQVEGTDTERVGAIAGV
LVADIALLTAECDEHMAYAGRNYLPFLLAPYGTLRPLLFNCLEIMGLRAA
SQDPSMERMIGAVLALRSQRRETIDAASLGVATTDLTWLSSAWRKHVMPK
ALAAASPGWIHRKYFELAVLAQIKDELKSGDLYIPHGERYDDYREQLVDE
ATLAQELDAYGEVSGVATDAADFVQGLRTELTTLADAVDARFSDNLHASM
LDGRLVLKRLQGAQVTQAIATVDSAITDRLPPTSIVDVLVDTTRWLDLHV
HFRPIAGTDARVDDLLRRVITTLFCYGCNLGPTQTARSVKGFSRRQISWL
NLKYVTDETLDKAIVQVINMYNKFELPGYWGSGKSASADGTKWNVYEQNL
LSEYHIRYGGYGGIGYYHVSDKYIALFSHFIPCGIHEAVYILDGMLANRS
DIQPDTVHGDTQAQSFPVFGLAHLLGINLMPRIRNIKDLVFSRPEPGRTY
ENIQALFGDSIDWTLIETHVHDMLRVAISIKLGKITASTILRRLGTYSRK
NKLYWAFRELGKAVRTLFLLRYIDDVEVRKTIHAATNKSEEFNGFVKWAF
FGGEGIIAENVQHEQRKIVRYNQLVANLVILHNVEQMTRVLAELRDEGSN
ISPEVLAGLSPYRTSHINRFGDYTLDLKRQVEPIDFSRRILAATTR
>XACb0062 ISxac3 transposase
MGYGRKPHFHGGTPCKAAANLLDRQFDVTEPDTAWASDFTFIRTHEGWMY
LAVVIDLFSRQVVGWAMRDRADTELVVQALLSAVWRRKPSAGCLVHSDQG
SVYTSDDWRSFPGVPWLGVQHESAWQLP
>XACb0009 integrase-like protein
MISVAQYLEAATRPNTERAYAAATRHFEIEWGGHLPTTAEQVARYLAAYA
GQLALNTLRHRLAALAQWHQAHGFADPTRAPVVRKVLKGIQTVHPSQEKR
ATPLQLTQLGQVVDWLDGAATAAGTRDDAAGRLRHMRDRAFVLLGFWRGF
RGDELVRLQVQDLELVAGEGMTCFLPHSKSDRQHAGTTYKVPALSRWCPV
AATMAWVAAAALHEGPLFRAVNQWGGIAAAPLHPNSLVPLLRRIFREAGL
SSPNDYSGHSLRRGFAGWANANGWDVKALMEYVGWRDVHSAMRYLDGADP
FARQRIEASVSPATPPLLALAAPAPDPVPTTAVEATVTLTRFNSRVRGLA
KAHRLIEQICLQPHQAQRLNADGTRYRLAIAAVDEAAFEETIAMLLDEMH
RIADNHQCFLAVALRDEAGGRHWD
>XACb0013 ISxac2 transposase
MVRPSQRREMAQSAVTSGRTNIRHACQTFAVSQTCFRYQAKASEQNARIA
DWLVRLTSAHRDWGFGLCYLYLRNVKGFGWNHKRVYRIYRELELNLRIKP
KKRLVRERPEPLAVPEAINQVWSMDFMHDQLADGRSFRLFNVLDDFNREG
LGIEVDLSLPSARVIRSLEQIIEWRGKPAVIRCDNGPEYISGALLSWAQR
LGIRVEHIQPGKPQQNAYVERYNRTIRYAWLARTLFDTIDQVQDKATRWL
WTYNHERPNMALGGITPAMKLAMAA
>XACb0014 ISxac2 transposase
MKKSRFTDSQIIAVLKQAQAGAPVPELCREHGISSATFYKWRSKFGGMDV
SMVARMKELEEENRRLKKMYAEAQLSTDLLKEALAKKW
>XACa0009 ISxac3 transposase
MKNFCADSELAHFWLFFTQIPASPLVCAMSAVGHCGDNAACEGLFGMLKR
KRIYRTKYPTLDAARTDVFE
>XACb0066 ISxac3 transposase
MASGSVYGHRKIAKDLRDLGERCSRHRVHRLMRTEGLRAQVRGGRDLCKK
QPKVG
>XACa0005 ISxac2 transposase
MVRPSQRREMAQSAVTSGRTNIRHACQTFAVSQTCFRYQAKASEQNARIA
DWLVRLTSAHRDWGFGLCYLYLRNVKGFGWNHKRVYRIYRELELNLRIKP
KKRLVRERPEPLAVPEAINQVWSMDFMHDQLADGRSFRLFNVLDDFNREG
LGIEVDLSLPSARVIRSLEQIIEWRGKPAVIRCDNGPEYISGALLSWAQR
LGIRVEHIQPGKPQQNAYVERYNRTIRYAWLARTLFDTIDQVQDKATRWL
WTYNHERPNMALGGITPAMKLAMAA
>XACa0004 ISxac2 transposase
MKKSRFTDSQIIAVLKQAQAGAPVPELCREHGISSATFYKWRSKFGGMDV
SMVARMKELEEENRRLKKMYAEAQLSTDLLKEALAKKW
>XACb0072 resolvase
MLPEPRIELAKARDKYRGRPADQAIHARIVALRGRGETIANTARLAGTSV
TTVKRVWAAHQAQQES
>XACb0051 ISxac2 transposase
MKKSRFTDSQIIAVLKQAEAGAPVPELCREHGISSATFYKWRSKFGGMDV
SMVARMKELEEENRRLKKMYAEAQLSTDLLKEALAKKW
>gid:108818  Ada  DNA methylation and regulatory protein
MQTATPDRTQCDRARLARDARFDGLFFTAVRSTGIYCRPVCPAPPPNPSN
ISYYPTAAAASAAGYRPCLRCRPELSPQAQQHLGEESVQRALAMIADGAL
QEQPVQTLADAVGLSARQLQRQFVQQLGATPIQVHGTHRLLLAKQLLTET
ALPVTEVALAAGFNSLRRFNAAFLQGCGMPPSALRKQRAEVPGGELTLRL
GYRPPLDLPAMLAFLQRRAIPGIEQVDANGYRRVIGAPGNASVIEVSAAP
QRAELLLRIGATDPRQIPQIVRRVRRLFDLDADLQAVHATLAPEPLLAEA
IARRPGLRVPGGWDGFEVAVRAVLGQQISVAGAATLAARLVDRHGGHHAD
MPPGLDRSFPMPEQLADAPLEQLGLPRARAATLRALASACAQGQLHFGAG
QRLPDFIAACTALPGIGPWTAHYVAMRALSHPDAFPAGDLILQQVLGAPG
RLSERATDARSQAWRPWRAYAVLHLWHLANDRKDTRA
>gid:106013  XAC0017  conserved hypothetical protein
MQKNSFPSASAAAERDVRSSRTVIKGRGTTAYVPGRFSVTTAVEIDDGWG
ALDADGKTAMRPKTVVAEEIARSIISRNNSPDIGFSQSVNPYRGCEHGCV
YCFARPSHAYLDLSPGLDFETRIFAKPNAAALLRAEIARPSYQCSPIALG
INTDAYQPAERRLRISRQCLEVLAEARHPVSFVTKAALIERDIDVLAEMA
QHNLVSVYFSVTTLDNRLAAKMEPRAAAPHARLRAMQALHAAGVPVGVLV
APVIPMINDKELERILDAAAAHGARSAGYVLLRLPHELTQLWREWLELHY
PDRAKHVMSLVQQMRGGKDYDSRFGKRMVGEGPFAELIAQRFSIAYRRLG
FGRLPRLDTTHFRPPRPDTPQLNLF
>gid:106031  XAC0035  conserved hypothetical protein
MASAAAVAQAKATARAAGLTYVNDQQPGISRRKAGKSFSYRDADGQRIAD
AETLQRIRSLAIPPAYTEVWICAKPNGHLQATGRDARRRKQYRYHADWAL
VRGEGKFERVIAFGTALPKLRRRLRRDLLLPGFPREKVLAIVVALLADTL
VRVGNAEYARSNRSYGLTTLRNRHMEFLKGGRARLKFRGKSGQDHDIEVD
DKQLVKLIRQCQQLPGQSLFQYRDDDGQLQPVDSGEVNDYLREAMGEDFT
AKDFRTWGGTLAALQRLARLPLPERSSERALTQVQNDVIREVADALGNTP
SVCRKAYIDPCVFEGWRAGQLQAMATGVRGERQWEAATLRFLSASRSKLK
AQAKPASKRKSA
>gid:106073  XAC0077  conserved hypothetical protein
MRHKPALPSTVDVDVDVDVDVDVDAIIAVDADGASHWPTARGTTTASPAS
MSPAHPAQRRTDIAGLRKMIGLRERAVSAHAPVRAPSTDRSLPGNEIAPG
LHLIQAFLPQAIPTEALSLAFAKRDDAVDPMDLLFFDTETTGLAGGTGTR
AFMIGVADWVVDAVQGSGLRVRQLMMSTMAAESAMLDLFRTWLTPRTVLS
SYNGRCYDAPLLKTRYRLARRGDPLSALDHVDLLFPTRRRYRGTWENCKL
ATIERQLLRVVREDDLPGSEAPAAWLSYLRGGSARNLRRVAEHNHQDVVT
LSLLMQRLVAVDAQERNALTLACRP
>gid:106086  XAC0090  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:106087  XAC0091  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:106143  XAC0147  conserved hypothetical protein
MAILSEAALDFAKAHIEKYYDSDFFPKAFEFEALWHQWPQVKQELRSKNI
SKMLVPNPYNSTIQKARGGYRVVHQLEPMEAIAYTAMAYEVGGSIEAMRA
PATHRIACSYRINIANGNFFVGGSGWGDFKAKSQELANENKFAAITDISD
FYNQIYLHRLQNAIEASGPAHKSLASDIEDFLLALNNKASQGVPVGPAAS
IVMAEAVLIDIDSFISQCGVLHTRYVDDIRIFSNSAAKLAETLEKLTLYL
YENHRLTLATDKTKLMPSEDFIENHLHNPYLDDKDELMEELDNISPYGGE
DEDEDEGLDDDDIGETLAGALYKILEFDYIDLGLARSIIRTARRNKIPNI
ITILLENFEYFAPVTPDIFLYLEELFDASTAAIIRAKLEKVCDMQEANCA
LVRIWLEHFIASHSSLLANQKLRIFVMTGPNIENKARAAITLGDIAWVRT
HKGQIYNVGARARRSIINSARVLPSDERNPWLTLVAQNAALPLERWLAGW
VKETA
>gid:106148  XAC0152  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:106149  XAC0153  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHDNAPVESFFGLLKRERIRRRIYPTKDAARAEVFDYIEMFYNPQR
RHGSTGDLSPVEFERRYAQRGS
>gid:106338  XAC0342  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHDNAPVESFFGLLKRERIRRRIYPTKDAARAEVFDYIEMFYNPQR
RHGSTGDLSPVEFERRYAQRGS
>gid:106339  XAC0343  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:106536  XAC0540  ribonuclease
MALTYPTSGTRFSLSTNTALVGGERCTLAITASAIRDASGLSPAANQSIA
FTVATASGGGTGYYARVNTTNASQLRCSLHATIKGHTVYPYSGSGTSTWT
ILEMADEDPNNSGRILDAYRNRSYAKVSDRAGTGSGLTYNREHTWPNSLG
FGSATGDRGLPYAPYTDTHMLYLTDTSFNADRGNKPYATCTSSCGERVTE
VNDGSGGGSGRYPGNSNWVRTPDGNSGTFEVWGRRKGDMARAVMYMAIRY
EGGLDATTGQSEPDLELTDDRSKIVQTAASPAYMGLLSTLLAWHQADPPD
DAERARNEVIFSFQGNRNPFVDHPEWATASLFNSARPASCQLLN
>gid:106574  XAC0578  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHDNAPVESFFGLLKRERIRRRIYPTKDAARAEVFDYIEMFYNPQR
RHGSTGDLSPVEFERRYAQRGS
>gid:106575  XAC0579  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:106602  XAC0606  endonuclease
MPEGPSLVILREDTQAFVGRKIVRVSGNSKQDIARLDQQKVLALRSWGKH
FLIEFAHFSVRIHFLLFGSYRINEDKPNAVPRLCLEFSKGQRLNFYACSV
QFIERPLDEIYDWTADVMNPLWDAAQARRKLRAAPALLAADALLDQTIFA
GVGNIIKNEVLHRIRVHPESTVGALPARKLGELVTQARDYSFDFYTWKKA
FVLKKHYQVHTKTRCPRDGAPLQYRKHLGKAGRRAFFCEVCQRLYRADEA
>gid:106672  XAC0676  conserved hypothetical protein
MDEFDQRSWWTPVPDDAAWALPDDLRAADPLGGRDCGWVNQIRPFVRHFS
RPGEQVFDPFCGFGSSLLAAALEGRNAHGMEIDPARAQLARARLQRHAVE
APVVVGSLAVNAPAGPIDLCLTNVPYFGCHWRGDVLPGQLYASSDYASYL
TGMRAVFHALRKQMRPDGVGVAMVQNVVVGGRVIPQAWDLGRILASLFTL
REERVLCYRRPAAALAHAETQSNRSHEYALIFQHCRARLDLQQAAQLLQA
VQAQGLPVVVHGSYARWLASPTLVPDGPADLDLMLHGEQALWDRLTLWLQ
QQGFALSLWGEPCRAPVALAVVRAHRYLRAERIGADGRRLQIDLQLPVDE
PPLP
>gid:106760  XAC0764  conserved hypothetical protein
MPAARQQRGAAVEAAARALLEQAGLRLVVGNANYRGGELDLVMHDGPSLV
FVEVRYRRDDRFGGGAASVDWRKRRKLVLAAQLFLGAHPALAALPCRFDV
VDASGEPPVLHWIRDAFRADDC
>gid:106907  XAC0911  MutT-like protein
MREPALRPAHPAPSAGRQHRRPRTVSSRFPGTRSPPMNRHDTPPTVVYEG
KYQRMVVRGTWEYSERVHAGGLAAIIVAVTPDDAMLFVEQFRVPLQARTI
EMPAGLVGDVHADESIELSAIRELEEETGWTAEHAEVLMIGPTSAGASSE
KIAFVRATGLRKVGDGGGDASEDITVHAIPRAQVGAWLVQKMAEGYQMDP
KLWAGLYLVDHALDGTPRG
>gid:107048  XAC1052  ISxac3 transposase
MCAMCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGH
RKIAKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAA
NLLDRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRD
RADTELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVC
SMSRRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCS
TTSRCSTTHNAVMVQLATCPL
>gid:107049  XAC1053  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:107066  XAC1070  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:107067  XAC1071  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHDNAPVESFFGLLKRERIRRRIYPTKDAARAEVFDYIEMFYNPQR
RHGSTGDLSPVEFERRYAQRGS
>gid:107091  XAC1095  conserved hypothetical protein
MVSFDATEALAPYREGRGYGAILFDRERLRQADASLFSPQSWGDRARPVD
AGGRGGAWFVDAPFGHSVLRQYLRGGMAARVSRDRYLWKGAGRTRSFAEF
RLMRELIKRKLPVPRPLAACYLREGLGYRAALLMERLENVRSLAEHAQVA
GRGAPWEATGQLIARFHRAGLDHADLNAHNILFDAGGHGWLIDFDRGVLR
IPATRWRERNLKRLHRSLLKLRGNRSREDVDKDYARLHRAYELAWGRGY
>gid:107098  XAC1102  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:107099  XAC1103  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:107103  XAC1107  integrase
MPVDLQPIVGRQTFKRTLHTYDMREARLRALTLAARYAQVFIVLRERRMQ
SRDDDLDALLARLTGTERPQELTLNRTRSADGSITEQWQIDSPEDVTLYQ
QLMALTAPQPSALGELIQQHVPPLPTPRRTTKPAIETITLGKARDAWLTS
LKGSTLPKTWTIKRTAIELLTSFLGEKTKLHTVTRSDLARWYQDMRDKGA
STPTLTNKQSYIGGKGGFFEWAQASGHYHRGDNPASGHVSYSTREKRARR
KFGFKAYDAHQVQALFAPAAFEGLPLAARWASLIGLYTGARASEVGQLLT
ADVVEEGGLPCIQISDEGEHQKVKTDVSLRTVPVHPDLLALGFLSWVEQA
RAEGQERLFPAAKADAKNGQGNWISKAFSRHLAEVGKNWPTAKRGFHSLR
KTLIQELQGAGVVSELRAQLVGHELDDEHHVTYSRAFTAQEKLDGLRGVS
PGLSVLAYGLSLDAIRPLLSEAPGNRKASGARAPR
>gid:107153  XAC1157  conserved hypothetical protein
MANIDSPGHRALRRGRRSSPNSVYLLTTTTWQRRAVFADFLLAATACRAF
TAATPCDARLLAWVLMPDHAHWLLQLGSVTSLADAVSRMKACAARSVNDQ
RQHQAPVWSRSYHDHALRKDDDLHAAARYLIANPLRAGLVTHIGDYAFWD
AIWL
>gid:107177  XAC1181  conserved hypothetical protein
MDLFNAPIAPLQVLHDQQGGVRYWPHLLPPALAQEAFDALRDDADWRSQR
REMYDRVVDVPRLLASYRLDDALPAGLPLQRLLDAVQAVLPAPYNAVGLN
LYRDGRDSVAMHHDALHTLVAPHPIALLSLGTPRRMQLRAKDGSTRAVAL
ELAPGSLLAMSHASQHTHVHGIAKSTRAVGERISVVFRVRPAQRMAAGQH
GPHWEPVQS
>gid:107193  XAC1197  conserved hypothetical protein
MALPLDALLAARTVWRAGQGTATANGGESTGHVALDALLPDGGWPRRALT
ELLLPAHGIGEIALLLPTLARMTASSSRVVLVAPPFIPYAPAWQAGGVVL
EHLEVVQAEPREALWAFEQCLRSGACAAVLGWPQTGDARALRRLQVAADS
GNCCAFALRDRRHAVNASPAALRLEFLPERDAWQVRKCRGGQVPSQPLRL
AH
>gid:107194  XAC1198  conserved hypothetical protein
MLWACIVLPQLALDAVLRRLEAPQAPLVLVEGPAQLRSLHSVNAAAGAAG
LKAGMRLSAAHALMTQVRTIDYDAQNEARTQRFLAAWAYRHSSLVSQQWS
RAIVLEAGASFRLFGPWPRFERRLRDELQALGFQHRLALAPTPRAARVLA
GLRDGMVVTQLPALHSTLDKVPVRRAALPDDAGERLQHMGVRTLAAVRGL
PRDGLRRRFGAGLLDHLDRLYGQVDDPLECYAPPDHFDHRVELGYEVETH
PALLFPLRRLIGDLCTYLSIRDGGVQRFLLRLEHEEGATDVEIGLLTPER
APTLLFELARNRLERVEIPRPVVAMRLLARQLPPFVPAMRDLFDQRAQQS
VEWPQLRERLRARLGDDAVYRVLPADDPRPERAWRRAIGDAVREVAAPPR
PPRPTWLLPQPVPLHDPHLRIVSGPERLESGWWDDDEARRDYYVVETARG
RRGWAFAAPGRVDGWMLHGWFA
>gid:107228  XAC1232  DNA-3-methyladenine glycosylase
MEAAFAHVSRRDRALGAWMKRIGPIAPQPGWHKPFDPVDALARAILFQQL
SGKAASTIVARVEVAIGAQRLHADTLGRIDDAALRACGVSGNKALALRDL
ARRESEGEIPSLRRLAFMDEEAIVQALVPVRGIGRWTVEMMLMFRLGRPD
LLPIDDLGVRKGAQRVDKQEQMPTPKELAVRGERWGPYRTYAAFYLWKIA
DFSVATKVPTPRSQE
>gid:107465  XAC1469  conserved hypothetical protein
MTYPRANRLRGLVARMPLQHLLLETDAPDQPDAGIRGQRNEPAYLRTVLD
CIAQLRGQDPAHIAAQTSANARNLFGLPH
>gid:107500  XAC1504  ISxcd1 transposase
MKTSRFTDSQIIAVLKQAEAGASVPELCREHGISSATFYKWRSKFGGMDA
SLMSQLKELQDENRRLKKMYAESQMTAEVLREAMAKKW
>gid:107501  XAC1505  ISxcd1 transposase
MVRPSQRREMARWAVANKAMSIRHACQAFEVSQTCYRYQAKASEENVQIA
DWLVRLTTTYRDWGFGLCFLHLRNVKGFGWNHKRVYRIYRELELNLRIKP
KKRLVRERPEPLSVPEAINQVWSMDFMHDQLADGRSFRLFNVLDDFNREG
LGIEVDLSLPSARVIRSLDQIIDWRGKPNAIRCDNGPEYISGALLAWAQQ
RGIRIEHIQPGKPQQNAYVERYNRTVRYAWLATTLFDTIEQVQDKATRWL
WTYNHERPNMALGGITPAMKLAMAA
>gid:107582  XAC1586  MutT-nudix family protein
MLPARALTARLPEYAQLRRALYPLDTAPAGPGWNHAELIDLTDGDASAEA
AVLCGLVPREQGTTVLLTRRTDSLRHHAGQVSFPGGRMEPSDADAAAAAL
RESCEEIALGAQQVHAIGYLDPFLTVSGFRVTPVVAVIAPGFVAVPQPDE
VADVFEVPLGYLMDPNNLRSVELEFRGRPRRVLEYDWPAHRIWGATAAIL
LNLRRRLEQVA
>gid:107610  XAC1614  hypothetical protein
MGMSEDAAAGGQGGNLELIQTLEQMLERDQTGKLLQPLSGMAVARPIYQV
PAIALAEPLKTLVATIFRSAGVYDAVPESRHGENVHGSMRDVHQNNDNGR
RYLPVLDYPVAPTGRAGRYHDYIVQRSKADVQNRQRQRSGISGVLTGYVE
YTVVLEGCSNVRLIFDYLNTRFFISPNHYKPWARPEALVARRNLGADPAR
YRLAPHEAGMDEQSWRDDELYGPHLLIML
>gid:107656  XAC1660  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:107657  XAC1661  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHDNAPVESFFGLLKRERIRRRIYPTKDAARAEVFDYIEMFYNPQR
RHGSTGDLSPVEFERRYAQRGS
>gid:107839  XAC1843  conserved hypothetical protein
MDATLQQQLAVYRLRWPEEAALTEQFAQLLDDATDPFVRERVEGHFTGSA
WVVSADGTRTLLTHHRKLQRWLQLGGHADGDRDLAQVALREAEEESGLGG
LRLADGQLFDLDRHWIPARGDVAGHWHFDARYVVVAGADEAFEVSEESLA
LAWRPIAELLAEPELDPSLRRMAEKWLESRD
>gid:107844  XAC1848  conserved hypothetical protein
MRRVAVVVAVGVLAACSVQQAPTQAATPAPVACTDPKVEEEWLQHPAGLC
GMPEEVRKLVEDYDTCEHFAGEEPYDADRRHEIEVAVAQFCTPAPARLAK
LMQQYRNDAHVSQWLRQYAKQADLQPAG
>gid:107867  XAC1871  transposase
MRDMHIFRALSEVREQTEHWLADYNQQIPHDSLGGLTPAEFRDQHQPQTS
SFGWH
>gid:107868  XAC1872  transposase
MEAADVQRLRDLETEHSKLKRVYAELAMENHALKDVIVKKAVDPTHKRPL
LAWLVEQHGWSERRACSVVGLARSTARYRCRPDRDEEVIALLSELAERFP
SVVLESSFRSSAVVDMYGITKGSGVCALAIEVDLNLPAARVIRTLERSAA
WRGYPNKLRLDNGPEFVALDLTEWAERFHRTLQRQLQARRAGHAYIPRAE
RGPRTDRTLACRLQPTDPTRQPGRTNAR
>gid:107908  XAC1912  serine/threonine kinase
MSAQLNLDLVAEQLCQQCGFEFKGLLGKGAFKSAYLSVHKSYPFALKIAA
IAGDANRLVREADALKECSHASVAKLLQAFSYAHGTHHLWVVYEEYLGGG
TLEARLKLGALPPAVVRALGVRLAEVLEHLESKRLVHRDIKPANIMFRDD
RDPHPVLTDFGIVRMLDQPTLTHAFMQMGPGTPAYAAPEQLTNDKALIDW
RTDQFGVAIVLAECLLGHHPFLEPGKTINDAIISVAAKHEMPATNAQRLT
AMGFGCITRALKPWPVSRYRRPSDFIKSLTQA
>gid:107916  XAC1920  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:107917  XAC1921  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:107920  XAC1924  transposase
MQHPIHGRRGRARPWVGSADGRPCDRPGAAPNQNAFIERFNRTFREEVLD
LNLFACLDEVREAAHWWMIDYNQARSHDSLGGMTPVEYRNKYAESSTFEL
PA
>gid:108006  XAC2010  ATPase
MRPRTLDEMVGQKRLLAPDSALRRAVESGRVHSMILWGPPGCGKTTLALL
LAHYADAEFKAISAVLSGLPDVRQVLAEAAQRFAGGRRTVLFVDEVHRFN
KAQQDAFLPHIERGTILFVGATTENPSFELNSALLSRCRVHVLEGVSPQD
IVEALQRALRDAERGLGAETIQVSEASLLEIASAADGDVRRALTLLEIAA
ELAAGEGGEITPQTLLQVLADRTRRFDKNGEQFYDQISALHKSVRSSNPD
AALYWLTRMLDGGCDPAYLARRLTRMAIEDIGLADPRAQSMALEAWDIYE
RLGSPEGELAFAQLVLYLASTAKSNAGYAAFNQAKAEVRATGTQEVPLHL
RNAPTKLMKTLGYGQDYQYDHHAEGGIALDQTGFPDAMGERVYYNPVPRG
MEIKLKEKLDRLREARAQARADKAGKPTA
>gid:108079  XAC2083  conserved hypothetical protein
MHAAADRAVNGAARPLAQRSPRFHGVTAMVRSIFSGASPMRKGIALIVGV
TGISGYNLANVLLADGWTVYGLARRPLPHDGVIPVAADLLDAESTNNALR
GLPITHVFFCTWTRRDTERENVQANGAMMRHLCDALSDAPLQHMALVTGT
KHYLGAFENYGSGKAETPFRESEPRQPGENFYYTLEDLLFAHAEQHGFGW
SVHRSHTMVGMANGSNAMNMGVTLAVYASLCKHTGQPFVFPGSQAQWNSL
TDLTDAGLLGRQLAWAGLSPAARNQAFNTVNGDVFRWRWMWGEIAKFFEL
DAAPCPAVPEPLEARMSQTAPALWAEVAAQHTLVESDVNRLASWWHTDAD
LGREIECVNDMTKSRELGFLDFYDSRASFFELFTRLRALRIIP
>gid:108095  XAC2099  ISxac2 transposase
MVRPSQRREMAQSAVTSGRTNIRHACQTFAVSQTCFRYQAKASEQNARIA
DWLVRLTSAHRDWGFGLCYLYLRNVKGFGWNHKRVYRIYRELELNLRIKP
KKRLVRERPEPLAVPEAINQVWSMDFMHDQLADGRSFRLFNVLDDFNREG
LGIEVDLSLPSARVIRSLEQIIEWRGKPAVIRCDNGPEYISGALLSWAQR
LGIRVEHIQPGKPQQNAYVERYNRTIRYAWLARTLFDTIDQVQDKATRWL
WTYNHERPNMALGGITPAMKLAMAA
>gid:108096  XAC2100  ISxac2 transposase
MKKSRFTDSQIIAVLKQAQAGAPVPELCREHGISSATFYKWRSKFGGMDV
SMVARMKELEEENRRLKKMYAEAQLSTDLLKEALAKKW
>gid:108097  XAC2101  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSV
>gid:108098  XAC2102  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:108099  XAC2103  DNA recombinase
MGDFVPSTEVRKRTVGDAIDRYLEVTLPAKRQKDAAKQVQMLAWWKAEIG
DVPLVGLTPAKIAAIRWLTKNPVPNVTRMQESKGRERFLSEPERLALLAA
CDASDCAPLAPLVRLALATGARRGELLGLQWEHVDLDRRTARFIDTKNGE
NRTVPLATGVTQMLQAMARTTGPIFPITGPMLDKPWRAACVSAGLGD
>gid:108127  XAC2131  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:108128  XAC2132  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:108170  XAC2174  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:108171  XAC2175  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:108175  XAC2179  RadC family protein
MSQLSFSSFDSLLHVRDTQGRYRLASVEQILEAAREAIDQKMQRGTEFSS
PAVVKEYLRAKLAGFEHEVFAVLFLDTQHRLIEYTEMFRGTIDSASVHPR
EVVKEALRTNAAAVILAHNHPSGHPEPSTADRALTRQLKAALELVDVRTL
DHIIVAGGANTSFAERGLL
>gid:108209  XAC2213  cytosine-specific DNA methyltransferase
MNPSPPTSNDIAYGSVCSGIEAVSLAWEPLGLKPAWFSETDAFASAVLAH
RYPHVPNLGDMTRLAQRIRDRSVPAPDILVGGTPCQSFSVAGARQGLADP
RGALTLAYVEIANAIDQVRNRAHRPPATLVWENVPGVLSDRGNAFGCLLG
ALAGEDHALEPPGKRWTHAGCVSGPRRRIAWRVLDAQYFGVAQRRKRVFL
VASGRGGVDPAEILFEREGLPRDPSPGGAPWQGAAHATGAGAPAAGRSSG
LMSPYGKVSISVGFGNAQGPADVAACLLGAPPRFDLSTETMLVQSVAGAI
SHTLDTANGGKGSGEDGTGKGVPIIAFTAQGSGADADVGRSPTLRADAHR
ASHANAGVVPAIAFAQNSRSELRWESGHGQISGTLSTGGGTPGQGRPMVL
QAAAYEDHYVCAPEGADTAWGMQWRVRRLMPRECERLQGMPDDHTLVPYR
GKPAADGPRYRAIGNSMAVPCIAWIGERLRRSMVGKS
>gid:108220  XAC2224  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:108221  XAC2225  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:108277  XAC2281  RadC family protein
MEQPTSDVLASTSTPAFSVQENRLIYRAIEVLERKLFQREAYIPSPEALF
DYLRLKLAREPNEIFGAVFLDNKHRVIAFEALALGTINQAIVHPRVVVRR
AMELNAAAVILAHNHPSGETQDSTADRMLTERLRSALDFVDIRVLDHVIV
GKGTPYSFAQAGLL
>gid:108367  XAC2371  IS1479 transposase
MRFLIEVHARPLRRSCACVGLSRAAWYAPPLDWTVRDAELIAEIARYVEA
HPSRGFWKCSDYLRKQQLGWNPKRIYRVYKAMKLNLRRAAKRRLPKVCLR
LPRPSASPNVSCMDHSST
>gid:108368  XAC2372  IS1479 transposase
MRTSKFTETQIIATLKQADAGVPVKDICRQVGISTATYYQWKSTYGGLEA
SELRRVKELESENAKLKRMYAELALDNAAMKDLIAKKL
>gid:108419  XAC2423  IS1478 transposase
MEELLAHTINTAHAVKAVDARELSRVIVDTTVQEKAIAHPTDNRLLEVAR
KKLVLLAKRHGIALRQTYARQGPALSRKSGRYAHARQFKRMRKVLRRQRT
ILGRVLRDLQRKLAQRESSVRERIGVWLERAHRLLTQRPKDNQKLYALHA
PEVECISKGKARNPYEFGVKVGIAVSARKGLIVGARSFPGNPYDGDTLAE
QLEQARGLLQDVNVIPQMAIVDLGYRGREVEGVQILHRGKARTLTRRQWR
WIKRRQAVEPVIGHLKQDCRLNRCHLNGAQGDALHVLGCAAGYNLRWLLR
WIAFLRAWLQVVRARSSTSSSPPWLANMAFGA
>gid:108420  XAC2424  ISxcd1 transposase
MVRPSQRREMARWAVANKAMSIRHACQAFEVSQTCYRYQAKASEENVQIA
DWLVRLTTTYRDWGFGLCFLHLRNVKGFGWNHKRVYRIYRELELNLRIKP
KKRLVRERPEPLSVPEAINQVWSMDFMHDQLADGRSFRLFNVLDDFNREG
LGIEVDLSLPSARVIRSLDQIIDWRGKPNAIRCDNGPEYISGALLAWAQQ
RGIRIEHIQPGKPQQNAYVERYNRTVRYAWLATTLFDTIEQVQDKATRWL
WTYNHERPNMALGGITPAMKLAMAA
>gid:108421  XAC2426  ISxcd1 transposase
MKTSRFTDSQIIAVLKQAEAGASVPELCREHGISSATFYKWRSKFGGMDA
SLMSQLKELQDENRRLKKMYAESQMTAEVLREAMAKKW
>gid:108426  XAC2430  Tn5044 transposase
MADVFRYVNGQCQFLSALTPLQPRYAKKVADADSLMAVIIAQAMNHGNLV
MARTSDIPYHVLESTYQQYLRQASLHAANDCISNAIAVLPIFPHYSFDLG
ALYGAVDGQKFGVERPTVKARYSRKYFGRGKGVVAYTLLCNHVPLNGYLI
GAHDYEAHHVFDIWYRNTSDIVPTAITGDMHSVNKANFAILHWFGLRFEP
RFTDLDAQLQDLYCADEPALYETCLIRPVGQIDRQLIVSEKSNIDRIVAT
LGLKEMTQGTLIRKLCTYTAPNPTRRAIFEFDKLVRSIYTLRYLRDPQRE
RSVHRSQNRIESYHQLRSTIAQVGGKKELTGHTDIEIEISNQCARLMANA
IIYYNSAILSRLLTKYEASGNAKAVLLITQMSPAAWRHILLNGHYTFQSN
GNLIDLDALVAGLELG
>gid:108428  XAC2432  transposase
MMGFIDAQREHFGVESICKALQVAPSGYWRRAARRSNPALLPARAKGDAE
LVPQIERVWTSNLQVYGYRKVWRQMLREGTAVARCTVERLMRLKGLQGAR
RGKKILTTVPDAKAPCPLDRVNRQFKADKPNQLWVSDFTYVSTWQGMVYV
AFVVDVYARFIVGWRVSRSMRTDFVLDALEQALYARKPERDGALIHHSDR
GLQYVSIRYSERLAEAGIEPSVGSKGDSYDNALAETINGLYKTELIHRRA
PWKSKEAVELATLEWVSWFNHHRLMESLGYIPPAEAEANYHWQLAGQAMS
V
>gid:108445  XAC2449  3-methyladenine DNA glycosylase
MPAKPLPRTFYAHDARQVAPRLLNKVLVSADGRCGRITEVEAYCGSDDPA
AHSFRGMTPRTRVMFGAPGHLYVYFIYGMHWAINVVCGGAPGHAVLIRAL
EPLDGIDRMQAARGAAPFTALTTGPGRLAQAFGVTAADNGLDLSTAAARL
WIEDDGAPPPSNPVATPRIGIRKAVDAPWRWVVADSRYLSRPLPRVTGKG
TVLAGD
>gid:108504  XAC2508  transposase
MRGVDKQTEHWLADYNQQIPHDSVGGLTPAEFRDQHQPQTSSFGWH
>gid:108523  XAC2527  conserved hypothetical protein
MSRPQRPAPRGAEGQVRIVGGRWRNTRLAVPSLPGLRPSSDRVRETVFNW
LMPRLPGARVLDLFAGSGALGLEAVSRGAAHATLIERDPGLVQRLREHVT
RLDAATQVQVLQEDAVRWLERAPAALADIVFVDPPFAAGLWPAVLERLPA
HVAADAWLYLEAPADAPPLLPAGWHLHREGATREVRYALYRRAAATLKSD
PTPVVSV
>gid:108554  XAC2558  excinuclease ABC subunit C homolog
MARSERVRRALGAVEYAYPDHLRSTLATLPTTAGVYLFHGKDSGLPLYIG
KSINLRMRVMDHFRTPREASLLRQTRRISVFEMAGDLGAQLLESQLIKSM
RPLYNQKLRRVPRQFSIRLYCGEVSIVHSAERDPAVTPWLYGLYSSPRAA
KEALRRLADRDRLCYGLLGLERATDGRPCFRAMLERCAGACHGAESLGDH
ETRLRAALQHLEQVVWPFPGAVALMEEGKQLRQFHVLRDWHYLGSATSLA
TARKLQSTPGDFDRDCYRILRKGLQTHLHNVVLL
>gid:108560  XAC2564  conserved hypothetical protein
MIVPGGSYAPDAGGAVAAIHQLPTLRLLSLDAHGRVLDWINWQDAACLYA
RDAVSWTLGEPCMQIHGGISRLTGERSTLELHPIIAARGHARSRALDPTP
TLSNTALFARDSQLCMYCGQHFSRPHLTRDHVMPVSKGGRDSWENVVTAC
FQCNSRKANRTPQQAHMPLLAVPYRPSWIEHLILSNRNILSDQMAFLRAQ
LPKRSKLSL
>gid:108599  XAC2603  ISxac4 transposase
MRTSKFTETQIIATLKQADAGVPVKDICRQVSISTATYYQWKSKYGGLEA
SELRRVKELESENAKLKRVYAELALDNAAMKDLIAKKL
>gid:108600  XAC2604  ISxac4 transposase
MRFLIEVHARPLRRSCACVGLSRAAWYAPPLDWTVRDAELIAEIARYVEA
HPSRGFWKCSDYLRKQQPGWNPKRIYRVYKAMKLNLRRAAKRRLPKRERV
PLYVPRLPDTVWSVDFMSDALACGRRFRTFNVVDDFNREALHIEVDTSIN
SQRLVRVRPDQTRSWLAAGGALGQRPRVPGRSVHQLAQDQWRRTAVHPAG
KPNQNAFIERFNRTFREEVLDLNLFACLDEVREAAHWWMIDYNQARSHDS
LGGMTPVEYRNKYAESSTFEVPA
>gid:108629  XAC2633  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:108630  XAC2634  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:108635  XAC2639  site-specific DNA-methyltransferase
MKNQLLQGDALTILPTLEANSFDALITDPPYASGGLTAAARARPPSTKYC
RDGGHADFVGDERDQRSHLKWMHLWLSECARVLKDGAPVLLFTDWRQLTL
TTDALQIAGFTWRGITVWDKTEGVRPQLGRFRNQAEYIVWGSKGNMPLDR
RAPVLPGVIRESVRKADKHHLTGKPTELMRQLVRICEAGGRVLDPFAGSG
TTLVAAQLEGFEAVGIEMTDQYATVTRDRLTAL
>gid:108657  XAC2661  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRNPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:108658  XAC2662  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:108727  XAC2731  conserved hypothetical protein
MIPSATGNPQVRIRSIDVLSDNWYVLRKVTFDFQRKDGRWQSLSREAYDR
GNGATILLYSRARQTVMLTRQFRLPTLLNGNPDGMLIEACAGLLDQDDAL
TCIRKETEEETGYRIDNVRKVFEAFMSPGSVTERLYFFVGEYFDADKVGD
GGGLEEDGEEIEVLELSLDAALAMIGTGEIADAKTIMLLQYAKLHGVLES
IHANAR
>gid:108745  XAC2749  methylated-DNA-protein-cysteine S-methyltransferase related protein
MSKSRHLASRPSTSPAVRAGSKGESLSGEQARLRIVQIIRTIPAGEVAGY
GEVARRAGLPGRARLVARVLSGNDDPKLPWHRVLRSDGRIALPEGSAGYR
EQCQRLRAEGVQVERGRVRRASAAQRLDAAVWGPS
>gid:108794  XAC2798  conserved hypothetical protein
MTKAKKKSAQQALQSPAPLYQLHVALTGSAPPIWRRLLVSGAVRLATLHR
VLQPVMGWNGAHPYEFDFGGGRYGESGLDVPERPRLKHAGRVTLESAVGE
LNGFDYFYGAGPGWQHRLQVEALLPPDASLRVARCVDGAHACPPDSSGGI
ADYRVLVQIIADPDHPQHVQELATLGGRFEPGHFDLAEVNRLLARVRE
>gid:108885  XAC2889  ISxac2 transposase
MVRPSQRREMAQSAVTSGRTNIRHACQTFAVSQTCFRYQAKASEQNARIA
DWLVRLTSAHRDWGFGLCYLYLRNVKGFGWNHKRVYRIYRELELNLRIKP
KKRLVRERPEPLAVPEAINQVWSMDFMHDQLADGRSFRLFNVLDDFNREG
LGIEVDLSLPSARVIRSLEQIIEWRGKPAVIRCDNGPEYISGALLSWAQR
LGIRVEHIQPGKPQQNAYVERYNRTIRYAWLARTLFDTIDQVQDKATRWL
WTYNHERPNMALGGITPAMKLAMAA
>gid:108886  XAC2890  ISxac2 transposase
MKKSRFTDSQIIAVLKQAQAGAPVPELCREHGISSATFYKWRSKFGGMDV
SMVARMKELEEENRRLKKMYAEAQLSTDLLKEALAKKW
>gid:108897  XAC2901  conserved hypothetical protein
MPKRIITKATLPKIMRELDRWQGKLTWPLFCERVAKVLNVAAISKHTMYL
YPAIKERFQQRQKDLREARDALPRDFTLDSATRRIADLEAQVKRLEETNN
RLLDQFRRWQYNAYANNVRMDLLALDKPLPEVNRSGRPRGIPRPRVSKKT
>gid:108899  XAC2903  conserved hypothetical protein
MVRVKPKQVDAERKSIARAVSKDGYPFEPTDDHWRLNKDVQIALGLPGAI
DAIAEAGFRATLLRYAEEASARHTRNMETRFKRYLRDTGASRVTVSDLIN
WRASLATDEQYQLGGLKGFLLAWYDYGFEGITDEVVDLLQGWRIQGNEKG
VAVASGCPESGPYTDLEMAAILDWANLAAARKDIAFEDYAYLLTLAMTAR
RPVQIAALRGRDLVRETGEGTPMFRLNIPRAKQRGLAFRGAFRSLAILED
LYLVLRQQHRQSVAAVSEAIGRTVDPVLAGEVPIFLNRKRIEGVEHVDEL
TDLLMGSAPDQLHAKIDSFDSALQRCAKASTARSERTGEFIRLSAIRFRH
TRGTKLRREGFGAVIIADLLDHSDTQNVRVYTENTAQEAVVINELVGAQL
APFAQACLGKLVRSEREAIRGGDPRSRVPNDHQHAVGTCGNYGFCASGYR
SCYTCYHFQPWIDGPHEEVLVDLYAEKERTREAGCADVIVNANDQLILAV
EHCVSMCKKARDRMPEATLLEADAHG
>gid:108913  XAC2917  conserved hypothetical protein
MPEAGAIRPDGTVLGFDVGSRRIGVAVGSALGAGARAVAVINVHANGPDW
VALDRVHKQWRPDGLVVGDPLTLDDKDQPARKRAHAFARQLRERYALPVV
LIDERSSSVEAAQRFARERADGRKRRRDAEALDAMAAAVIVERWLAAPDQ
ATLLP
>gid:108952  XAC2956  replication related protein
MSVPQLPLALRAPPDQRFDSYIAAPDGLLAQLQALAAGQVSDWLYLSGPA
GTGKTHLALSLCAAAEQAGRTPAYLPLQAAAGRLRDALEALEGRSLVALD
GVESIAGQRDDEVALFDFHNRARAAGITLLYTARQMPDGLALVLPDLRSR
LAQCIRIGLPVLDDAARAAVLRDRAQRRGLALDEAAIDWLLTHSERELAA
LVALLDRLDRESLAAKRRITVPFLRRVLEDRGS
>gid:109008  XAC3012  conserved hypothetical protein
MRILLRHDPGGNAPLRYVQLTLQPDLFGGWELLRETGEIGGRTQLRRDQY
LQQDEADRAFDKARDSQLKRGFQLITGGADDAPR
>gid:109198  XAC3202  conserved hypothetical protein
MDALKPWHLYLLLCRNGSYYAGITNDLERRFQAHLRGTGARYTRANPPVQ
MLASHPYPDRASASRAECALKRLPRARKLAWLQAQPRTVHESQPADASIT
RV
>gid:109218  XAC3221  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:109219  XAC3223  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:109222  XAC3226  Tn5044 transposase
MGLNLSMTTLHETAYPRLKPDPTAKELQDIYTPTAVELQCVRNIATGPAT
RLALLLHLKLFQRLGYFTTLIEVPERIVQHVAQTLGMRRVPADRLASYDA
SGAKHVHLAQLRAFLNVCPLDAAGRDWLGTVAETAAQTKHIVPDIVNVML
EELVHHRFELPAFSTLGRTAIAAREHVHEAHYRQIADALSPTMRTLIDNL
LLTPPGSHHNDWHRLKREPKRPTNKEVRHYLRHIQRLRILAEQLPPIDVS
VPKLKQFRAMARALDAAELAELVPIKRYALAAIFIRSQYRKTLDDAADLF
IRLIQNLENTAQQKLIAYQLEHSKRADALIGQLREILQAYQVEGTDTERV
GAIAGVLVADIALLTAECDEHMAYAGRNYLPFLLAPYGTLRPLLFNCLEI
MGLRAASQDPSMERMIGAVLALRSQRRETIDAASLGVATTDLTWLSSAWR
KHVMPKALAAASPGWIHRKYFELAVLAQIKDELKSGDLYIPHGERYDDYR
EQLVDEATLAQELDAYGEVSGVATDAADFVQGLRTELTTLADAVDARFSD
NLHASMLDGRLVLKRLQGAQVTQAIATVDSAITDRLPPTSIVDVLVDTTR
WLDLHVHFRPIAGTDARVDDLLRRVITTLFCYGCNLGPTQTARSVKGFSR
RQISWLNLKYVTDETLDKAIVQVINMYNKFELPGYWGSGKSASADGTKWN
VYEQNLLSEYHIRYGGYGGIGYYHVSDKYIALFSHFIPCGIHEAVYILDG
MLANRSDIQPDTVHGDTQAQSFPVFGLAHLLGINLMPRIRNIKDLVFSRP
EPGRTYENIQALFGDSIDWTLIETHVHDMLRVAISIKLGKITASTILRRL
GTYSRKNKLYWAFRELGKAVRTLFLLRYIDDVEVRKTIHAATNKSEEFNG
FVKWAFFGGEGIIAENVQHEQRKIVRYNQLVANLVILHNVEQMTRVLAEL
RDEGSNISPEVLAGLSPYRTSHINRFGDYTLDLKRQVEPIDFSRRILAAT
TR
>gid:109243  XAC3247  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:109244  XAC3248  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHDNAPVESFFGLLKRERIRRRIYPTKDAARAEVFDYIEMFYNPQR
RHGSTGDLSPVEFERRYAQRGS
>gid:109265  XAC3269  RadC family protein
MLTPGCGRAGRFACRFPRGAQRAGGEEPMKRTQDRAVQYQLQMDEEGILL
AAATILEQRLQRQGRIHSPDQAGDYLVARCAHLPHEVFGVVFLDNQHHII
STEHLFTGTVDGCDVHPRVVAKRALELSAAAVILFHNHPSGNPEPSEADR
KVTERLKQALSLLDIRVLDHLVIGGQQHVSLASRGWA
>gid:109278  XAC3282  integrase
MTNFWAVLSLLPLFQWKQAVSSTAQQQLRDYITAPFGLLQIKDVHVGNIR
ASEINITGREAAIHALELFDRADERSRRVRLSKNRTHYPEPLPLCSPARL
LSVEISDYLGHRDRCCLAKETIDATARTLKLLRIACGDIPVSRIDHAHIY
KLWDLMRWAPPLLLSDPKYRDYTFEQAVALGKELGVSPPAPATLEKHRRF
LVSFFGKLVKAKAIPMSPMDAFPEIKKDLVVDMNKPERLFNEEELQRIFS
PKTFPAWAKKYPHRWWLPMISLYTGARINELAQLKVADIVEEAKVWCIRI
QKTVDADLRHKDRNKSRQSLKGKAAVRTLPIR
>gid:109279  XAC3283  ISxac2 transposase
MKKSRFTDSQIIAVLKQAQAGAPVPELCREHGISSATFYKWRSKFGGMDV
SMVARMKELEEENRRLKKMYAEAQLSTDLLKEALAKKW
>gid:109280  XAC3284  ISxac2 transposase
MVRPSQRREMAQSAVTSGRTNIRHACQTFAVSQTCFRYQAKASEQNARIA
DWLVRLTSAHRDWGFGLCYLYLRNVKGFGWNHKRVYRIYRELELNLRIKP
KKRLVRERPEPLAVPEAINQVWSMDFMHDQLADGRSFRLFNVLDDFNREG
LGIEVDLSLPSARVIRSLEQIIEWRGKPAVIRCDNGPEYISGALLSWAQR
LGIRVEHIQPGKPQQNAYVERYNRTIRYAWLARTLFDTIDQVQDKATRWL
WTYNHERPNMALGGITPAMKLAMAA
>gid:109281  XAC3285  hypothetical protein
MDFLEDIKACGHPRLFPHLSAGVNRETGETNARYSQGAVNQFSSYMKTLG
FGKGIGAHAFRHTLATELHHKNVSDQDIALITGHSLRKNVPVLHDAYFHK
KPKLARAKQIKILAKYKPPVELPKYERGQFKESLADPSKFYP
>gid:109294  XAC3298  integrase
MMSRSAGLYARFFVPTDLRASIGSRYLVRSLGNRRGDHARLAAATMGVAL
SMAFDAMRKGMTVDLEELLEKVRSGDIKELTLKDVMLPDGTRIAQAQLDN
PADAVIFGDLMERSRRVEPAEVTSARVAKRRDQLRDASQSISRHAPDALM
LSKAMADHLGDLAGARLHQKTVLESRHTLRLFAGIIGEDVPVASLTQAHV
RAFFDGVRYWPSNATKRPAYRELSVPEVIKLAKKNQEPEPAAWTMAKHRQ
RLSVFLVSLVDGKHLAVNPLAGIRAIATPDSEDTGSPFTDAELKAIFDPV
EFPKWASKYPHRWFGPILGLYSGARVNEIAQLRLEDIDTIDGVPGFFVRK
IGKQQSIKNKHSRRFIPLAQPVIDCGFLTYVEEARQAGVERLFPDLPNST
GLGYGRQLSRQFSVYIKRQGVSEKGQGFHGFRHTIASKLDEAGVSASAIA
AITGHGTGQTVLEKFYIDRRSLPDRVATLGKFTVPISLPIYSGFKGYREN
SREIK
>gid:109316  XAC3320  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:109317  XAC3321  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:109428  XAC3432  conserved hypothetical protein
MSHDPAANLPLKSDTFGRILLVRDGERLFVRRDVGVAPWLLRGVAWWLAR
REALALRQLDDVPRTPRLLHWDGRHLDRSYLDGAAMYQRPPRGDLAYFRS
ARRLLQQLHRRGVAHNDLAKEANWLVQEDGSPAVIDFQLAVRGNPRARWM
RLLAREDLRHLLKHKRMYCPAALTPVERRVLKRTSWVRELWFATGKPVYR
FVTRRVLHWEDNEGQGPKP
>gid:109499  XAC3503  ISxac4 transposase
MAADVFVLPGKFTETRIIAALKQADAGVPVKDICRQVGISTATYYQWKSK
YAGLKASELRRVKEIESENAKLKRMYADLALDNAAMKDLIAKKL
>gid:109500  XAC3504  ISxac4 transposase
MRYLIEVHARPLRRSCACVGLARAAWYAPPLDWTVCDAGLISAIARVVED
RPSRGFWKCSDVLRRTRPDWNPKRIYRVYKAMRLNLRRAAKRRLPKRERV
ALYVPRLPDTVWSVDFMSDALACGRRFRTFNVVDDSNREVLHIEVDTSIN
SHRLVRVFEQIKHDHGLPQIVRSDNGPEFLGEAFTSWLKVNGVAIKYIQP
GKPNQNAFIERFNRTFREEVLDQHLFTCLDDIRQAIHWRMIDYNEERPHD
SLSGLTPTEYRNQHARRATFGVSA
>gid:109560  XAC3564  conserved hypothetical protein
MSDSMSEPIWSDLQVSYLQALGHTVYLDRDAVDALPAPVEIAERAPMAAP
ARVERALQPATANAPVARRNAPAAPAEAPRAPSTAPASVPQRRSRVGMPD
RLQMALLRASGCNPGDPATQALMASWPLAELRGNPAAKRALWPQLRALRR
RRDPA
>gid:109659  XAC3663  conserved hypothetical protein
MTEKHLSRGLPAHIRSDCRVLLLGSMPGVASLEAARYYAHPRNRFWPLMH
ALLGIDTAASYALRLHALLDHRVGLWDVIGQCERRGSLDTSIVAASIVVN
PLPALLATLPQLRMVACNGTAAAQAWRRHVQPLLSAQRCALPVVALPSTS
PANAAWSLPRLAAAWQPLCDAVR
>gid:109662  XAC3666  hypothetical protein
MSRQRPRKRVADTAGDCDGARAASHARPCRSDAFAIPDGFTREQIQPFHD
LERQYATLFQTSHVCAVQSARDIQASRTMDVEMDECAVINLAHDGFDSIG
THGLSSCVCICAKGKNPRGHDILGLLHYSGIQDAQDALSEIRDDMREEGV
QEPEIFLVGGMISNQDELGSFEIERDLLALQRPFNIVGAKLHPSMSDRNG
EENAINLVMTASGIYYYKSW
>gid:109700  XAC3704  DNA polymerase related protein
MFSAEVEPAWSFEAWRALARAGWCAQVEPDDVAWNGGAQGGLLMGQGLLA
MPAVVPPPRVSADFMQLAASVLCHRDPQRHAVLYRLLWRVASGEKALLER
ATDVDVHRLRQWQKAVQRDTHKMKAFVRFRRLPGEEEDFMSWFEPEHWIV
DRVAPFFARRFAGMRWAILTPYRSVRWDGQALTFGEGAVRTQVPADDAQE
TLWRTYYAHIFNPARLNPTMMRQEMPQKYWKNLPEATLLPELIRDAGMRV
REMAERAPEPVRRRVPAAPAPLPEAATQSLAQLRTAARDCRRCELWQPAT
QTVFGEGSDDASVMVIGEQPGDEEDLSGRPFVGPAGRLFNMALGELGVDR
ETFYVTNAVKHFRFEQRGKRRLHRNPERTHVQACSGWLQAERAHLRPAQI
VCLGATAAQAVLGSRFRLMQQRGQWQRLDDGTPVLTTVHPSWVLRQPSTD
AREEGYRGFVEDLRQLLDPPPPSSDQSR
>gid:109749  XAC3753  conserved hypothetical protein
MQRFVPANWDVMFRTLVHVCVFLVGLLAVCWVGIGYLGSNPLGACVAAVI
GACYVAGALELYRYRQATATLEAAVDGLTTAPSALSDWLQAVHPGLRNAV
RLRIRGERVALPAPVLTPYLVGLLVLLGMLGTLLGMMATLKGTGFALQSA
TDLQAIRGSLAAPVEGLAFAFGTSIAGVATSAMLGLLSALCRRERLQAVQ
RLDLKIASELHPHSDAHRRGEAFKLQQQQSALMPALIDRLQTLMHTIEQH
SMAAEQRLTAQQADFHAKSEAAYARLATAMEHALQAGVAESARAVGAALQ
PAMEATMASIVRDTAALQAHLTQAVQQQLDGITHGFQTSASTTAQHWSMA
LAAQERAQQTHNAQLQATLEQIAAHSVAVQDSVSSAVHQQLQGVSAAIEQ
NARTATEHWQVALSAQERTQATLAEQLQGTLEQIDQRSVAVQDSVTQAVQ
SN
>gid:109760  XAC3764  ISxac2 transposase
MKKSRFTDSQIIAVLKQAQAGAPVPELCREHGISSATFYKWRSKFGGMDV
SMVARMKELEEENRRLKKMYAEAQLSTDLLKEALAKKW
>gid:109761  XAC3765  ISxac2 transposase
MVRPSQRREMAQSAVTSGRTNIRHACQTFAVSQTCFRYQAKASEQNARIA
DWLVRLTSAHRDWGFGLCYLYLRNVKGFGWNHKRVYRIYRELELNLRIKP
KKRLVRERPEPLAVPEAINQVWSMDFMHDQLADGRSFRLFNVLDDFNREG
LGIEVDLSLPSARVIRSLEQIIEWRGKPAVIRCDNGPEYISGALLSWAQR
LGIRVEHIQPGKPQQNAYVERYNRTIRYAWLARTLFDTIDQVQDKATRWL
WTYNHERPNMALGGITPAMKLAMAA
>gid:109836  XAC3840  conserved hypothetical protein
MSAIAHRYEFVYLFDVANGNPNGDPDAGNLPRLDPETNRGLVTDVALKRK
IRNYVALEKDNAPGYTIYMQEKSVLNNQHKQAYTALGIEHEAKKLPKEGD
KARQLTAWMCENFFDVRTFGAVMTTEVNTGQVRGPVQLAFATSVEPVLPL
EVSITRVAVTNEKDLEKERTMGRKHILPYGLYRAHGFVSAKLAERTGFSE
EDLQLLWRALTNLFEHDRSAARGEMAARKLIVFEHEHPMGNAPAHVLFDK
VKVERIDQADQGPARSFSDYRVVIDHALPAGVSVTELF
>gid:109837  XAC3841  conserved hypothetical protein
MAHTQRRRGVRTVTAMPLLALELGITGKADVVEFHHDPAGEFAFPVEYKR
GRPKSHRADEVQLCAQALCLESMLKRPVDAGALFYGQPRRRKDVVFDPAL
RELTQRTIAETRALLSHGLTPGARYDSKRCDACSLIDLCQPRLLGRGSVD
TWLRRQLDAEEE
>gid:109838  XAC3842  conserved hypothetical protein
MTMRRQLNTLYVTTDGAWLHKDGANVVLNVERQERTRLPVHMLESIVCIG
RVGVSPQLLGFCAEQGISICYLTAQGRFLARVEGPVSGNVLLRREQYRCS
DDPVRCAAIVRHMLAGKIHNQRAVLARGWRDHGDRMIDIPAFQHALKRLK
RIPQRLLIETSVDVLRGLEGEAAQSYFGVFGQLVRTESPLLRFGGRNRRP
PRDAFNALLSFLYTLLTHDCRSALETVGLDPAVGFLHRDRPGRPSLALDL
AEEFRPLLGERLALSLINRRQLNQRDFQVFDNGAVLLKDDARKTVLIAYQ
ERKREQLQHALLGEKIDIGLLPFVQAQLMARHLRGDLDGYPSFFWK
>gid:109839  XAC3843  conserved hypothetical protein
MMILVSYDVSTSSPGGEKRLRKVAKACRDRGQRVQFSVFEIEVEPAQWTE
LRQQLCDLIDPALDSLRFYHLGAKWETRVEHIGAKPSLNLKGPLIF
>gid:109923  XAC3927  conserved hypothetical protein
MNGQPAVVKDYGRYRRTLLAPIARLMVRHEASMLRQLQGWRHAPALLGTL
GGLALGMEFIPGDTLSASTVVGQEVFQQLQQALRRLHAFGITHNDLHGTN
VVVSAGVPVLIDFTSAWRFPRWLRRGTLARQLQRSDVANFQKMRHRLAGI
APSVDESARSAEPRWIHVIRSGWKRLYPRLKGKA
>gid:109931  XAC3935  IS1389 transposase
MVSAPARRALVCEWIGRGASERRALAAIGMSASALRYCPRQDRNGELRER
ILALAHRHRRYGVGMIYLKLRQEGRLVN
>gid:109932  XAC3936  IS1389 transposase
MEQLYREQQLQVRRRKRKKVPVGERQPLLRPAQASQVWSMDFVFDRSAEG
RVIMCLVIVDDATHEAVAIDVERAISGHGVTRVLDRLVHSRGLPQVIRTD
NGKEFCGKAMVAWAHARGVQLRLIQPGKPNQNAYVESFNGRPRDECLNEH
WFPTLLHARTEIERWRRGYKRGPTQESNWRDDASCLRPTSCQHRYHQPRT
LNPTATQGGGTSVLFPRMRC
>gid:109934  XAC3938  ISxac3 transposase
MFDYIEMFYHLQRRHGSTGDLSPVEFERRYAQRGS
>gid:109935  XAC3939  ISxac2 transposase
MFDTIKQVQDKATRWLWTYNHERPNMALGGITPAMKLAMAA
>gid:109937  XAC3941  DNA helicase
MPAWPEEGRGRMSTDTPSLEQGAIITAPLQPMSVIACAGSGKTFTAVRRL
AHIRRCLGDHRGRVALLSFSNIAVETFRREYQALAVGSLKIPSHQRVEID
TLDGFITTNILRPHAHRTMGCTTTPFLLSGSESFLLGKDYRFWVESPRTG
NYPVPQAHIHRVVIGPSGAGTDFFYQNSGMLTRINNGPKAAAALGKLGAY
THELARYWADLTLREQPEILRVLARRYPHILIDEAQDIGILHQLIIERLS
QAGSQISLIGDPNQGIYEFMGATGSFLSSYHSKPGVAPAALTRNYRSVPA
VLDLANAISARTDEPDRHAPLTLHGAYFVPYTTTSTQNVLAAFQATLMSA
GLDPGRSAVLCRARKWANELAGSEGAPGVGTTKMLAQAAMVRDIHSDYST
AFRLVATAIIGLTDNAPKGIVARLTQPARYPEERALRRAVWAFTRDSVAG
LPSASLPAVSDWQPTLLQRVRVLMTNLSTAHGLNHTSSKLGSRLSKKSLS
SSPLMSPPDLGADQPLRIRISTVHQVKGESLDAVLYMATKAHVSALLDGG
STEDGRIGYVAVTRARDLLWLGVPATALKILRADLEAKGFKDATAKIAAP
PAGHGIGTP
>gid:109938  XAC3942  conserved hypothetical protein
MGGLPLDSDDIHRPKTGSPSGSIVFKYIFRGLCSDDEADFMSALKPAKDG
QMEAHITISFSEADKTGRLRAKRWCGDHDDVGLTADMMENLRGVYLQPLR
DASQALRPGRMSQLSRLFQLLVDDNGKIEINGALKLLDDELKKKAPIVST
HKAITTRHSEMLGPQLSQLLELGVSVSDFQRLASRLSLTVDLFELEQNGL
GFNNLIFMAVVLSELTKNPQASYRGLIIEEPEAHLHPQLQAVLLGYFETI
KAVDKEKPVQLFVTSHSPNFASIANLDSLTCLVDTGIKVETFFPRDIKFA
VGKREKLERYLDVTRAELFFARRAILVEGAAELVLVSVLAEKCGHDLRKH
GVSLISVEGLNFDSFIPLFGENALKIPVSVLTDADPTGPSEEDEEAGSDG
SVVADDLASTDDASGGKKKAAPVYPSIGDKVQLSANTLSMIKFEDSYLKV
FHGLKTLEYDLALYACNRKMMLAALSELHPKISASLAKKVDAATGETEKA
RVLFSGMFERKNGNVQKGRFAQALAHLIAKAKAEDFIVPGYIKSAVEHAC
QPGPKKGEDE
>gid:109939  XAC3943  ISxac2 transposase
MVRPSQRREMAQSAVTSGRTNIRHACQTFAVSQTCFRYQAKASEQNARIA
DWLVRLTSAHRDWGFGLCYLYLRNVKGFGWNHKRVYRIYRELELNLRIKP
KKRLVRERPEPLAVPEAINQVWSMDFMHDQLADGRSFRLFNVLDDFNREG
LGIEVDLSLPSARVIRSLEQIIEWRGKPAVIRCDNGPEYISGALLSWAQR
LGIRVEHIQPGKPQQNAYVERYNRTIRYAWLARTLFDTIDQVQDKATRWL
WTYNHERPNMALGGITPAMKLAMAA
>gid:109940  XAC3944  ISxac2 transposase
MKKSRFTDSQIIAVLKQAQAGAPVPELCREHGISSATFYKWRSKFGGMDV
SMVARMKELEEENRRLKKMYAEAQLSTDLLKEALAKKW
>gid:109941  XAC3945  recombination related protein
MAELTITNFRKINNAVLHFQSGLNVLVGANNAGKTAIVDALRSLLAGHDE
PYPRLGNPPAH
>gid:110112  XAC4116  serine/threonine kinase
MHEDTATVTELVTAMRRGALDLNAVLAALGRRAAVPEAEYRAGVETLWTL
QRQHLLDDATVTTLVSRLDALRDRAPAIAPAPAPSAAQEPTPTPTVVDDD
ATVVMPRPGPARMAENDITRVQPAQLVSGDAPSLTSLGLATHPNLGTQTG
TGTGTGTAGTASVSSWQHLAEAAGGDHAGVGSLLKGRFQLERELGRGGMG
VVYLARDERKVEARDRDPWLAVKVLSDEFRRHPDALVALQREARRSQKLA
HDNIVRVYDFDKDRTLVFMTMEYIDGCDLKTLIREQAYNGMPLKQAWPLI
DGMGRALMRAHAAGIVHSDFKPGNVMVTRDGVAKVFDFGIARAGKHAAEV
AGEQTVFDAAALGALTPAYASLEMIRGQAPTPADDLYALGCVCFELLTGK
HPFDKLSAEVALREGRRPRPVPGLTRRQYATLCAAVAFPAAQRLTRVDQL
LDGLRQVPLRERALPLIGYGATALALVGVGGWGASRYLHQRHLDEVIARF
GAQDPQHYRDETQAMQALSSLSADDRRRVLMDRSDLIEDFLLRRLDALWN
PANGRDDYRGALQVFALRDRLRLYSPALDARRQAIEREKNQRLNALDTTL
NRQLDAGLLFRTQPDNALATLDQVRRIDPGSSLLRHSALEVRLDAAIAEA
VAAGQLTTARTEVEQARAAFPDSLRLQLRSAEVGVAEQQVRRTPVAATPR
DADSARTALAADLANPSTDPAWRARIDAELAALPAAERSSQGSALAEAIS
TAVAVQSDPAQLAGAQALVDFGLGLAPRSASLLAQRMRLQTLEHQFEQAL
ARESAEAELAARIESLRRAAAANDLGKTRDALARIRTLQPGHPLLRSEGP
QLLSQVYRNTAGDAFAQGRYAQAADLLRQGLQTLGQRAELRSAQTRYAVA
AAVMTAGALPPSERQQLAGRLQTLYRTDAIALAQLEAQMKAAGQLPEGSL
SARLQRARAADAPADDSVPPGSATGTAAPAVGAKPARDNSAARPAAATPT
TARDPAAAAALPANVDDADSLPPVPTGPDPCAGLAGRGAACFDTLGQTRA
PMLVVIPAIAGGKPYALSRGEVAVADFNLFCQATGRCTAQAAATPELARA
PVRNISLTLAQAYLRWLTIGSGGWRYRLPSDAEWLHAAQAASNWRQAEDS
NCVPPNASAGDAARAPVSARGRDANPWGLINLTGNVWEWVSTGTGLQARG
GSFSSYWSDCTVQASRADNGSAQPDVGLRVLRELP
>gid:110203  XAC4207  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:110204  XAC4208  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:110219  XAC4223  conserved hypothetical protein
MFFRNLTLFRFPTTLDFSQIDTLLPPVQLKPVGPLEMSSRGFISPFGRDE
QEVLSHRLEDFLWLTVGGEDKILPGAVVNDLLERKVAEIEEKEGRRPGGK
ARKRLKDDLIHELLPRAFVKSSRTDAILDLQHGYIAVNSSSRKSGENVMS
EIRGALGSFPALPLNAEVAPRAILTGWIAGEPLPEGLSLGEECEMKDPIE
GGAVVKCQHQELRGDEIDKHLEAGKQVTKLALVLDDNLSFVLGDDLVIRK
LKFLDGALDQLEHSEDDGARAELDARFALMSAEIRRLFLLLETALKLSKA
E
>gid:110318  XAC4322  ISxac3 transposase
MSSKRYTDEFKIEAVRQVTDRGFKVAEVAERLGVTTHSLYAWLRKFGKPG
VVQRAEADQSAEVRRLKIELRRVTEERDILKKAAAYFAKG
>gid:110319  XAC4323  ISxac3 transposase
MCRVLRVNRAGYYAWLKSPDSERAKEDERLLGLIKHHWLASGSVYGHRKI
AKDLRDLGERCSRHRVHRLMRTEGLRAQVGYGRKPRFHGGTPCKAAANLL
DRQFDVTEPDTAWASDFTFIRTHEGWMYLAVVIDLFSRQVVGWAMRDRAD
TELVVQALLSAVWRRKPSAGCLVHSDQGSVYTSDDWRSFLASHGLVCSMS
RRGNCHEREACPWGTTHRWRASLVCSNASGSGGGSTPPRTPHAPRCSTTS
RCSTTHNAVMVQLATCPL
>gid:109570  alkB  DNA repair system specific for alkylated DNA
MSGMRIRVALPGAEINWWRGWLQPAQADALMQALLAQAQWEVHRIRMFGR
MVDSPRLSSWIGDPEASYRYSGTRFSPQPWLDVLQPVRIRLEDETGHRFN
SVLVNRYRSGSDAMGWHSDDEPELGAQPLIASVSLGATRRFAFKHRDDAA
VKQTLELGHGDLLLMGGDTQRHYKHALPRTARPMGERINLTFRQIAPAEA
GSG
>gid:108947  comEA  DNA transport competence protein
MKSFTVVLKSLLLALLLSSNAYALDKVDINTASAEELDKVLVNVGRSKAE
AIVEHRQANGPFKSAEELALVKGIGLKTVERNRDLIEVGATMAPAKKAAK
GAAVKPVGRR
>gid:108386  dbpA  ATP-dependent RNA helicase
MNEFSALPLSPALAPGIDALGYTVLTPIQAHSLPPILQGLDVIAQAPTGS
GKTAAFGLGLLQKLDPALTRAQALVLCPTRELADQVGKQLRKLATGIPNM
KLVVLTGGMPLGPQLASLEAHDPQVVVGTPGRIQELARKRALHLGGVRTL
VLDEADRMLDMGFEEPIREIASRCDKHRQSLLFSATFPDSIRTLARELLK
EPVEITVEGADSAPEIDQQFFEVDPTYRQKAVAGLLLRFNPESSVVFCNT
RKEVDEVAGSLQEFGFSALALHGDMEQRDRDEVLVRFVNRSCNVLVASDV
AARGLDVEDLSAVVNYELPTDTETYRHRIGRTARAGKHGLALSLVAPRET
ARAQALETEQGQPLKWSRAPLATARPAQLPQAAMTTLRIDGGKTDKLRAG
DILGALTGEAGLSGAAIGKIAIYPTRSYVAIARAQVAKALAHLHAGKIKG
RRFRVSKL
>gid:106573  dcm  cytosine methyltransferase
MKKKIGEPKIKIAGPSVISLFSGCGGMDFGIKAAGGRIVFSNDILADACK
TLEKYFPDSTVSCGDIAALHEFPEADVVVGGYPCQSFSMAGNRDPKKDAS
DTPRVP
>gid:108809  deaD  ATP-dependent RNA helicase
MTQESSAPLLFADLGLSDAVMKAVANVGYESPSPIQAATIPALLAGRDVL
GQAQTGTGKTAAFALPVLSNADLNQVKPQALVLAPTRELAIQVAEAFQKY
AEAIPGFRVLPVYGGQPYAQQLSALKRGVHVVVGTPGRVIDHLDRGTLDL
SQLKTLVLDEADEMLRMGFIDDVEAVLKKLPEQRQVALFSATMPPAIRRI
AQTYLKDPAEVTIAAKTTTSANIRQRYWWVSGLHKLDALTRILEVEPFDG
MIIFARTKAATEELAQKLQARGMAAAAINGDMQQAAREKTIAQLKDGKLD
ILVATDVAARGLDVERISHVLNYDIPYDTESYVHRIGRTGRAGRNGDAIL
FVTPREKGMLRSIERATRQPIEEMQLPSVDAVNDTRVARFMTRITETLAG
GQIDMYRDLLQRYEAENNVPAIDIAAALAKLLQGDAPFLLTPPVRGVREE
SAPRQRNERADRGERPRFEPTFERGPRADGERAARPPRGDDTGERPRREP
APRGEPEFGMESYRIEVGHSHGVKPANIVGAIANEAGLESRYIGRIDIQD
DYSILDLPADMPRELLAHLKKVWVSGQQLNMRKLEDGEAAAAASKPRFPR
GGKPSGRPNGPGRPMDRAGAPHRKGPPKPRGE
>gid:110027  dinG  ATP-dependent helicase
MSEVAPPAAPSAPVPTQRSLTDPIKASIRDAYAKLQANTPGFATRRAQSQ
MIGLVSRALATSGGIGIAEAPTGVGKSLGYLTAGVPIALATKKKLVISTG
TVALQSQLVERDIPAFLKATGLDATVALAKGRTRYLCTRNAAELEGDTSQ
NAMFEDEQALYDRPLSPVDADLAKRLAKAYAARTWNGDLDDAPEQISVPL
RMRITTPASGCAGRRCSYAAQCPVLRARTDVREAQIVVTNHALLLSSLSL
GDAENGQPLIAPPADMLLVLDEGHHIAGVAIDQGAANLPLDDMARRTGRM
QILIAAAYRAVDKDKLGNLLPNEAIEVAARVSKLLKAFHAEVERVWKPEP
GERDPLWRAANGKLPPQWGPAIEELGEETRALFNWVHAAHSAIAKGKQDD
SARERLQRSLGLALEMAEQQHNLWSGWRREDKEGQPPMARWITLSRDGDL
ICHCSPVSAAQVLRTMIWNEVDSVVMTSATLTGGGDFQSFAIDNGLPDHA
EMASLASPFDLPNQAELIVPNFPVTPDDREGHPKEVAKYLVRELDWNAKG
SIVLFTSRWKMEKVADLMPLAQRNRVLVQGEGNKSQLITEHLRRIGAGEG
SVLFGLNSFGEGLDLPGEACTTVVITQVPFAVPTDPQTSTLSEWLESRGH
NAFNLIAIPHALRTLTQFAGRLIRSSSDHGRVIILDSRLLTRRYGKRIID
ALPPFKRVIGR
>gid:109618  dinP  DNA polymerase IV
MRKIVHVDMDAFYASVEQRDDPSLRGKPVVVAWRGARSVVCAASYEARIF
GIRSAMPAVRAERLCPDAIFVPPDFTRYKAVSRQVREIFHRHTDLVEPLS
LDEAYLDVTHAKTGMQLATEVAQLIRTQIREETQLTASAGIAPNKFLAKI
ASDWRKPDGQFVIAPSRVDAFLLPLKVNRIPGVGKVMDGKLAALGIVTVA
DLRQRPLEELQAHFGSFGQSLYRRARGIDERPVEPDQEVQSVSSEDTFSE
DLALDALAPHILRLAEKTWLATRRTERIGRTVVLKLKTSNFRILTRSYTP
EQPPTSQEALAQIALALTRRVELPAQTRYRLVGVGLGGFSDVENGAVQGQ
LFGQMPPLE
>gid:105997  dnaA  chromosomal replication initiator
MDAWPRCLERLEAEFPPEDVHTWLKPLQAEDRGDSIVLYAPNAFIVDQVR
ERYLPRIRELLAYFVGNGDVALAVGSRPRAPEPAPAPVAVPSAPQAAPIV
PFAGNLDSHYTFANFVEGRSNQLGLAAAIQAAQKPGDRAHNPLLLYGSTG
LGKTHLMFAAGNALRQANPAAKVMYLRSEQFFSAMIRALQDKAMDQFKRQ
FQQIDALLIDDIQFFAGKDRTQEEFFHTFNALFDGRQQIILTCDRYPREV
EGLEPRLKSRLAWGLSVAIDPPDFETRAAIVLAKARERGAEIPDDVAFLI
AKKMRSNVRDLEGALNTLVARANFTGRSITVEFAQETLRDLLRAQQQAIG
IPNIQKTVADYYGLQMKDLLSKRRTRSLARPRQVAMALAKELTEHSLPEI
GDAFAGRDHTTVLHACRQIRTLMEADGKLREDWEKLIRKLSE
>gid:107473  dnaB  replicative DNA helicase
MSARPGFRSKRSRDRDDDDYDRPEPRLDQLRVPPHSVEAEQAVLGGLMLA
PDAFDRVNDQLTENDFYRRDHRLIYRAIRELNEKDRPFDAVTLGEWFESQ
GKLEQVGDGAYLIELASTTPSAANIAAYAEIVRDKAVLRQLIEVGTTIVN
DGFQPEGRDSVELLSSAEKAVFKIAEAGARGRTDFVAMPGALKDAFEELR
NRFENGGNITGLPTGYTDFDAMTAGLQPTDLIILAARPAMGKTTFALNIA
EYAAIKSKKGVAVFSMEMSASQLAMRLISSNGRINAQRLRTGALEDEDWA
RVTGAIKMLKETKIFIDDTPGVSPEVLRSKCRRLKREHDLGLIVIDYLQL
MSVPGNSENRATEISEISRSLKGLAKELNVPVIALSQLNRSLETRTDKRP
VMADLRESGAIEQDADMIVFIYRDDYYNKENSPDKGLAEIIIGKHRGGPT
GSCKLKFFGEYTRFDNLAHDSVGSFE
>gid:107402  dnaE1  DNA polymerase III alpha chain
MSTSRFVHLHVHTEFSLADSTIRVPEKPDQADPKKAKQANLLSRAVELDM
PALAVTDLNNLFALVKFYKAAEGVGIKPIAGADVMIATPDMTPWRMTLLC
RDREGYLSLSRLLTRAWMEGHRPEGGVAIHPEWLQAGHANLFALAGRDSL
AGRLFNEGRADLAEQQLADWQRVFGDGLHLELTRTGRDGEERFNQFALHA
AGVRGLPVVASNDVRFLYASDFNAHEARVCISSGRVLDDPKRPRDYSDQQ
YMKSSEEMAALFADIPDAIDNTHALAQRCNIEMRLGTYFLPAYPVPEDET
LDTWIRSQSRQGLAARLEKNPIAPGKTRQDYVDRLEFELDTIIKMGFPGY
FLIVADFIQWGKNQGIPIGPGRGSGAGSLVAWALQITDLDPLPYNLLFER
FLNPERVSMPDFDIDFCMDRRDEVIDYVARKYGRERVSQIITYGTMAAKA
VVRDAGRVLGFSYGLVDSVAKLIPNILGITLKDAMGEGKDSEMASPDLIQ
RYQVEDDVRDLMDLARQLEDLTRNAGKHAGGVVIAPDPLSEFCPLFAEHD
EDGRGKNPVTQFDKDDVEAVGLVKFDFLGLRTLTIIDWAVKAINVRHARA
GIDPVDITAIPLDDVPTYKGVFASGNTGAVFQFESSGMRRLLKDARPDRF
EDLIALVSLYRPGPMDLIPDFNARKHGQQEIVYPDPRTEVILKDTYGIMV
YQEQVMQMAQIVGDYSLGGADLLRRAMGKKVPAEMAKHREIFREGAAKGG
VSAAKADEIFDLMEKFAGYGFNKSHAAAYALVSYQTAWLKRHYPAEFMAA
TLSSDMDNTDKVVGFLDEVRNLGLTVKPPRVNESAYMFEAASPDTIQYGL
GAIKGVGQGACEAIVEERQRGGPYTTLLDFCTRVGTAKLNRRTLEAMINA
GAMDGLGKNRASLMLQLPEVMKATEQMARERASGQNSLFGGPDPSAPAMR
LDLPESKEWPLGQLLTGERETLGFYLSGHPFDPHRDEVRELVGCDLGALE
KILASQQRGGGGGGGDGEKRAWRPEVGAILAGQVVGVRRKGDSQVFVQLE
DGRGRVECSAFSDAMAEFGHLLTRDRILIVKGGLREDEFNGGYSLRIRQC
WDYEQICADHAQRLSLRLDLREKQAFKRIDALLAKHRPGKTPLRLDLLLR
APSGGVAGMLDLNGNHSVRIDQQLMDSLRADPAVRTVKIKYSPPWA
>gid:107195  dnaE2  DNA polymerase III alpha chain
MPRAWNVAARLRAANDDIVHAQQADGLPAYAELHCLSDFSFLRGASSAEQ
LFARAQQCGYSALAITDECSLAGIVRGLEASRATGVRLIVGSEFTLVDGT
RFVLLVENAHGYPQLCGLITTARRAASKGAYRLDRAEVQAQFRDVAPGVF
ALWLPGAQPQAEQGAWLQQVFGERAFLAVELHREQDDVARLHVLQALAQQ
LGMTALASGDVHMAQRRERIVQDTLTAIRHTLPLAECGAHLFRNGERHLR
TRRALGNIYPDALLQATVELAQRCTFDISKISYTYPRELVPEGHTPTSYL
RQLTEAGIRRRWPGGITAKVREDIEKELALIALKKYEAFFLTVQDVVRFA
REQNILCQGRGSSANSAVCYALGITAVNPDETRLLMARFLSEKRDEPPDI
DVDFEHERREEVLQYVYSKYGRERAALAATVICYRGKSAVRDVAKAFGLP
PDQIALLANCYGWGNGETPMDQRIEEAGFDLANPLINKILAVTEHLRDHP
RHLSQHVGGFVISDEPLSLLVPVENAAMANRTIIQWDKDDLETMKLLKVD
CLALGMLTCIRKTLDLVRGHRGRNYSIATLPGGDAPTYKMIQRADTVGVF
QIESRAQMAMLPRLKPAAFYDLVIEVAIVRPGPIQGDMVHPYLRRRQGRE
EVNYPSPAVEDILKPTLGVPLFQEQVMELLMHAADYSEDEADNLRRSMAA
WRRGGDMEQHRTRVRERMQGKGYASSFIDQIFEQIKGFGSYGFPQSHAAS
FAKLVYASCWLKRHEPAAFACGLLNAQPMGFYSASQIVQDARRGSPERER
VEVLPVDVLHSDWDNTLVGGRPWRSAADPGEQPAIRLGMRQVAGLSQVVA
QRIVAARTQRAFADIGDLCLRAALDEKARLALAEAGALQGMVGNRNAARW
AMAGVEARCPLLPGSPEERPVEFEAPRAGEEILADYRSVGLSLRQHPMAL
LRPQMRQRRILGLRELQGRRHGSGVHVAGLVTQRQRPATAKGTIFVTLED
EQGMINVIVWSHLALRRRRALLESRLLAVRGRWERVDGVEHLIAGDLYDL
SNLLGDMQLPSRDFH
>gid:109872  dnaG  DNA primase
MARIPDAFIDELLARTDIVEVVGGRVPLKRQGKEYSARCPFHDERSASFT
VSPTKQFYHCFGCGAHGTAISFLMNYDRLEFLDAVDELAKRAGMDIPRET
QQRTPQQQDDSRELYSALEAATKFFQRQLEGSGRARDYLDGRGVDADNRA
RFQIGYAPDGYSALKDTLGTDARRMSVLERAGLFSKNDRGHVYDKFRDRV
MFPIFDRRGRVIAFGGRIMGAPADGRDPGPKYLNSPETALFHKGRELYGL
WQVRQANQKIERLIVVEGYMDVVSLFQFGVTQAVATLGTATTPEHAELLF
RNAPDVYFCFDGDNAGRKAGWRALESVLPRMKDGRQAFFLFLPDGEDPDT
IVRKEGAQAFDQRLKQATPLSQFFFDEMSREINLHTLDGKARLAERARPL
LAQIPEGAFGDLMKQELARLTGVGANASAQQPAPRPRPPARMGAPTQKRS
LVRASIAILLQRPSLAMSLEGVHDFSGLRLPGIDLLMELLDLVRQRPEIS
TGALLEHFAEREELAALQKLAAQELPGDEHSWTLELHDVVAQLDKQLLRQ
RVEELQAKQRAQGLDDTDKYEMRELLKALAAL
>gid:105998  dnaN  DNA polymerase III beta chain
MRFTLQREAFLKPLAQVVNVVERRQTLPVLANLLVQVKDGQVSLTGTDLE
VEMISRTLVEDAQDGETTIPARKLFDILRALPDGSRVTISQTGDKVTVQA
GRSRFTLATLPSNDFPSVDEVEATERVVVPEAGLKELIERTAFAMAQQDV
RYYLNGLLFDLRDGLLRCVATDGHRLALCEMELEKAGGAKRQIIVPRKGV
TELQRLLEGADREVELEVGRSHIRVKRGDVTFTSKLIDGRFPDYEAVIPI
GADREVKVDREALRASLQRAAILSNEKYRGVRVEVSPGQLKISAHNPEQE
EAQEEIEADTKVDDLAIGFNVNYLLDALSALRDEHVVIQLRDANSSALVR
EASSEKSRHVVMPLRL
>gid:107086  dnaQ  DNA polymerase III epsilon chain
MRQIILDTETTGLEWRKGNRVVEIGAVELLERRPSGNNFHRYLKPDCDFE
PGAQEVTGLTLEFLADKPLFGEVVDEFLAYIDGAELIIHNAAFDLGFLDN
ELALLGDHYGRIVERATVVDTLMMARERYPGQRNSLDALCKRLGVDNSHR
QLHGALLDAQILADVYIALTSGQEEIGFASADAGQQADAASGMIAFDPAL
LLPRPRVAVTASESQAHEARLAQLRKKAGRALWDPPVEEIAVAG
>gid:107105  dnaX  DNA polymerase III tau and gamma subunits
MSYLVLARKWRPKRFAELVGQEHVVRALSNALDSGRVHHAFLFTGTRGVG
KTTIARIFAKSLNCETGTSADPCGTCPACLDIDAGRYIDLLEIDAASNTG
VDDVREVIENAQYMPSRGKFKVYLIDEVHMLSKAAFNALLKTLEEPPEHV
KFLLATTDPQKLPVTVLSRCLQFNLKRLDEDQIQGQMTRILAAEPIESDP
SAIVQLSKAADGSLRDGLSLLDQAIAYAGGALREDVVRTMLGTVDRTQVG
AMLHALTDGDGARLLQVVAALAEFSPDWSGVLEALAEALHRVQVQQLVPS
VAFVGDGIDPTPFAAQLRPEVVQLWYQMALNGRRDLYLAPSPRAGFEMAV
LRMLAFRPAAAVPAGGSDDGRGATAGGGARSAAAGGQLAAPVAAAPVTSA
PAVVAAPTAVTDTMPVAAAEASLPASGTRATPAPLVVLSPQESSPASRAS
AANARNDDTPPWAVDDAPVRARATPTPDTMGVLAPEAAMAPPSAPVELAQ
PDGIASDAAVEAPASVVVPSPAALVEPSSPEEPAVVASAATAGAASDAAT
DAAALLDDGRIADAEQWLELVTRSGLNGPSRQLAANAAFIGHRDGVLRLA
LAPGFEYLNSERSIANLAQALAPVLGNAPRIVVETGSADVETLHERASRQ
KGERQSAAETAFMNDPTVQLLVQQQGARIVPDSIRPYDE
>gid:106908  exo  exodeoxyribonuclease IX
MTTPTNPLDAAPLAIQRPTPLYLVDASLYVFRAWHSIPDEFQDAQGWPTN
AVHGFARFLLDLLEREDPQYITIAFDEALDSCFRHAIYPAYKGNRTPAPD
ALRRQFAHCKALCAALGLSVLAHRDYEADDLIGSALHSVRAHGLRGVIVS
ADKDLSQLLLEHDEQWDYARNQRWGMDGVKARHGVHAHQIADYLALCGDA
IDNIPGVTGIGAKSAAVLLAHFGSLDALLERIDEIPFLRLRGAAQMAVRL
REQREHALLWRQLTTIALDAPLALTQTGFTRAHADADMLTGLCESLRFGP
LTRRRLLAASTGVPAPFPPASLAQGPRP
>gid:109898  exoA  exodeoxyribonuclease III
MRIISFNANGLRSAASKGFFAWFAAQDADVLCVQETKAQEHQLAGPDFLP
TGYKAWFRDASTKKGYSGVAIYSRHEPDEVRTALGWPEFDEEGRYIEARF
GNLSVVSFYIPSGSSGELRQDYKFQVMQWLRPILDEWLASGRQYVLCGDW
NIVRTALDIKNWKSNQKNSGCLPPERDWLNGLCADAPEDASAADGRGWVD
SYRVLHPQGQDYTWWSNRGAARANNVGWRIDYQLVTPGLRDALKACSIYR
EERFSDHAPYIVDYAQ
>gid:109254  exoI  succinoglycan biosynthesis protein
MATRQDTRSAVCRLERKCLLRLLKRWMSLLVVLVLPVSAAELVGRATVTD
GDTITVAHQRIRLWGIDAPESAQQCSAHDGNAWPCGRRAAAALDGYLLDK
TVRCQPKDTDRYGRTVAECFVQGQSVNAWMVRSGWAVAYRQYATAFEADE
RIAQQQRRNLWQGAFQMPADYRRDRKARTATTRQAAASVPGNCTIKGNIS
SDGKKIFHLPGQRDYAKTRISPAKGERFFCSVNEAATAGWRPAHR
>gid:106518  fis  DNA-binding protein
MNAAPSRPDTSRGAPKSPLREHVAQSVRRYLRDLDGSDADDVYEIVLREM
EIPLFVEVLNHCEGNQSRAAAMLGIHRATLRKKLKEYGLT
>gid:107627  gyrA  DNA gyrase subunit A
MAETAKEIIQVNLEDEMRKSYLDYAMSVIVGRALPDARDGLKPVHRRVLY
AMHELGAHSNKAYFKSARIVGDVIGKYHPHGDQSVYDTLVRMAQPFSLRY
MMVDGQGNFGSVDGDSAAAMRYTESRMSRLAHELMADIDKETVDFQPNYD
EKEQEPTVMPTRFPSLLVNGSAGIAVGMATNIPPHNLTEAINACIALIDT
PELDVEGLMEYIPGPDFPTAGIINGTAGIAAGYRTGRGRVRIRAKADVEV
ADNGREAIVVTEIPYQVNKARLIEKIAELVKEKKLEGISELRDESDKDGM
RIYIEVKRGESAEVVLNNLYQQTQMEAVFGINMVALIDGRPQLMNLKQML
EAFIRHRREVVTRRTIFELRKARARAHVLEGLTVALANIDEMIELIKTSA
NPQEARERMLAKTWQPGLVGALLGAAGAEASKPEDLPPGVGLIQGFYQLS
EVQAAQILEMRLHRLTGLEQEKLTDEYKQLLGVIEGLIRILENPDVLLQV
IRDELINVREEYGDERRTEIRHSEEDLDILDLIAPEDVVVTLSHAGYAKR
QPVSAYRAQRRGGRGRSAAATKEEDFIDQLWLVNTHDTLLTFTSSGKVFW
LPVHQLPEAGSNARGRPIINWIPLESGERVQAVLPVREYADNRYVFFATR
NGTVKKTPLSEFAFRLARGKIAINLDEGDALVGVALTDGDRDVLLFASNG
KTVRFGESTVRSMGRTATGVRGIRLTKGEEVVSLIVAERAGGVEEEIEGD
EVDDVVEATDGGESAVIEVADDGNVAYILTATENGYGKRTPLTEYPRKGR
GTQGVIGIQTTERNGKLVRAVLLGATDEVLLISDGGTLVRTRGSEISRVG
RNTQGVTLIRLSKGEKLQAVERLDASLDEVDEGVDEASASVSETPPTDA
>gid:106000  gyrB  DNA gyrase subunit B
MTDEQNTPPTPNGTYDSSKITVLRGLEAVRKRPGMYIGDVHDGTGLHHMV
FEVVDNSVDEALAGHADDIVVKILADGSVAVSDNGRGVPVDIHKEEGVSA
AEVILTVLHAGGKFDDNSYKVSGGLHGVGVSVVNALSEHLWLDIWRDGFH
YQQEYALGEPQYPLKQLEASTKRGTTLRFKPSVAIFSDVEFHYDILARRL
RELSFLNSGVKITLIDERGEGRRDDFHYEGGIRSFVEHLAQLKSPLHPNV
ISVTGEHNGIMVDVALQWTDAYQETMYCFTNNIPQKDGGTHLAGFRAALT
RVLSTYIEQNGIAKQAKVALTGDDMREGMIAVLSVKVPDPSFSSQTKEKL
VSSDVRPAVENAFGARLQEFLQENPNEAKAITGKIVDAARAREAARKARD
LTRRKGALDIAGLPGKLADCQEKDPALSELFIVEGDSAGGSAKQGRNRKN
QAVLPLRGKILNVERARFDRMLASDQVGTLITALGTGIGRDEYNPDKLRY
HRIILMTDADVDGSHIRTLLLTFFYRQMPELIERGYIYIGLPPLYKLKQG
KSELYLKDDAALNAYLASNAVEGAALIPATDEPPITGEALEKLLMLFTSA
NEAIARNAHRYDPALLTALIDLPPLDVEKLQAEGDQHPTLDALQAVLNRG
TLGTARYQLRFDPGSDNAPATLVAIRRHMGEEFTQVLPMGAFESGELRPL
REVSLALHDLVREGAQIVRGNKSHPITSFAQAHAWLLDEAKKGRQVQRFK
GLGEMNAEQLWETTVNPDTRRLLQVRIEDAVAADQIFSTLMGDVVEPRRD
FIEDNALKVSNLDI
>gid:108261  helD  helicase IV
MQDRAPSWFLPRWLDRGHVVLTYRSGKEEYQRSFVLRGAARAQARKDILQ
RAHRELTRALDAADRALVPIRSDIERTYRSDRYVRHSQAARITSNHADMA
RKVAPVLDALHRHPLLAPSDRELARSLKDELADLAKRVNKPEASRTEHNE
QYLAYQIEAEADYFNAVESSPLTPEQTAAALVFEDATLVVAAAGSGKSSC
IVGKIGFALKSGLFQDHEILALAYNKKAAKSLDERLSKKLSKAIGRKVAV
ASKTFHSFGLSVMVEANGEGIRPRVLKEEAGEEGRFLRTVIDRLKDEDLP
FQQALADWLLFAPFEDPSPSGIAGNLEDCEKRYEECCRQRIKAKREEGKK
SYEPTIPTLDPAIHVRSLEERSIVNWLFLRGIPVGYETPDWDGAKLLGLG
VSESTKRQRPYKPDFTYCQTQQLSNGTQREVRVIHEHFALDGNGRAPAWM
GGAEYEAQARSKRAMFKKRVAGGHRNGKPVVFFETTSAQMRDGTLWNHLE
HSLKAAGIAVGPRSQDVYRRAIASFGPIEGLEQLIMDFVLRFKDSGLTEG
EVLQEAKQQPNSWRALLFLKVAFPVFHAYQQALRDAGKIDYADMLREALA
ALRDGRVKTPYRFVLVDEFQDIARLRADLVKSVLDQAPYESIVFCVGDDW
QTINRFSGSDVGIFKNIGNHFGRHERQLMLSRTFRCSSGIATLARELVLK
NDNQFDKPVQARPDRLSHCVRVVLHKPGAEHRMPALETQLAALIETGKDL
GIALPSVQILARTTVKTTAPLGLDSKDAINALKARYTGQLEVEVMSLHGS
KGLEADFVVLVGFDSGFRGFPDERAPEPLLDLVLPKLSHENEEERRLLYV
GLTRAKHQVIVLANGDAPSEYVLELSALSEHHAFIDWINLGAQRTDCPRC
KVGSLQPPRNSELKLVCSRSARCGYRGQIPRRQAPTVNS
>gid:108584  himA  integration host factor alpha subunit
MALTKAEMAERLFDEVGLNKREAKEFVDAFFDVLRDALEQGRQVKLSGFG
NFDLRRKNQRPGRNPKTGEEIPISARTVVTFRPGQKLKERVEAYAGSGQ
>gid:108775  holA  DNA polymerase III delta subunit
MELRPEQLAGQSSQPLQPVYLIAGPETLRVLEAADAVRARARAEGIGERE
VFDADGREFDWNQLDASFNAPSLFSPRRLVEVRLPSGKPGKDGAEVISRF
CANPPPDVVLLITANDWSKAHHGKWAEAVGRIGTIAVAWAIKPHELSDWI
ERRLRAHGLRADAAAVQRLSERVEGNLLAAAQEIDKLALLADGKTLDLEA
MESLVADAARYDVFRLAETTFSGQPAAVIRMLAGLRGEGEAVAALMPILI
KELLRTASLAKVQANGGNLAAEMKAQGLWESRQAPFKRALQRHPEPHRWE
RFVAEAGLVDRMAKGRADGDAWVGLERLLVAVAEARAVRLLA
>gid:107128  holB  DNA polymerase III delta' subunit
MTAAFSPWQQRAFDQTVAALDAGRLGHGLLICGPEGLGKRAVALALAEHV
LASSPDPALAQRTRQLIAAGTHPDLQLVSFIPNRTGDKLRNEIVIEQVRE
ISQKLALTPQYGIAQVVIVDPADAINRSACNALLKTLEEPSPGRYLWLIS
AQPARLPATIRSRCQRLEFKLPPAHEALAWLLSEGVSERAAQEALDAARG
HPGLAAQWLREDGLAVRRAVAQDLEQIANGRAGAVDVAQRWTNDGQADQR
LRHAADLALAQASAGLTDPSRLHKLATWFDAANRTRDLLRTTVRADLAVT
ELLLSWREGERQPRSRGTR
>gid:109554  holC  DNA polymerase III holoenzyme chi subunit
MPRADFYLIAKPRFLNEPLRLVCELARKANDANLSTLILARDAEQAEALD
DLLWAFDDEAYVPHQIAGTDEEDELAPVLIAAPEFAAPSRPLVINLRDDP
YLGACDRVLEVVPADPAAREPLRERWKQYKALGLELTKYDM
>gid:109118  hrpA  ATP-dependent RNA helicase
MSRDRGRLLGLWSRWQGKPGNPQVRDAFEQALAASQAQRQARAEQQPAIT
LDDQLPIAREAERIIALIRDHQVVVIAGETGSGKTTQLPKLCLAAGRGAA
GMIGCTQPRRIAARAVAARVADELKTPLGTTVGFQVRFTDRVSDQSRIKF
MTDGILLAEIASDRWLSSYDTIIVDEAHERSLNIDFLLGYLKQLLHKRSD
LKLIVTSATIDTERFAQHFDNAPVISVEGRTFPVEVRYRPLEGDTGDCDE
GEHGSGRDGERSVNDAIVAAIDEITRIDPRGDVLMFLPGEREIRDAHQSL
ERRKYRETEVVPLYARLSAADQDHVFNPGPRRRLVLATNVAETSLTVPRI
RYVVDPGLARVKRYSPRQKLDRLHIEPISQASANQRMGRCGRIAEGICYR
LYAEADFAARPAFTDPEIRRSSLSGVILRMLQLGLGRIEDFPFLEAPDER
AVADGWQQLLELGAIDTQRRLTATGRQMARLPVDVKLARMLVAAQQHGCL
REMIIIAAFLGIQDPRERPPEAREAADNAHALFADARSEFVGILRLWDAY
RQVHEDLTQSKLRDWCNRHFLGFLRMREWRELHRQLRLLCEELGWSEEPA
NAMLAPLLAGASAPAREDGHNANRPTRGQLHRAARLAREGKPDPNAAPAT
PAPAAVNQPAEAAEASARSSERERAAAYQALHRALLAGLPTQIGHRTEKG
DYLAARQRRFVPFPGSALARKPPPWILAATLLDTQKVWGMTNAAIEPDWA
IAELPHLLARKHFDAHWSRAQGQVVASEQISLFGLVLAPKKPVHFGKIDP
AAAHDLFVRQGLVTGEINTRAAFVADNLKVLEQAREEEAKLRRAGIVADE
GWQARWYLDRIPAELHSASGLDAWWKTLPPDKRRSLHWSLNDLLPGEGSE
ADRFPKYFPLGDARLPLHYRFEPGAVDDGVTLDVPLHLLNALDPSRLSWL
APGFVADKASALIRSLPKAQRRNYVPAPDFGRAFYEAYSVASADDIRGEL
ARFLTKATGTQVTALDFDEQALDTHLLMNLRLRDDDGKVLAESRDLDGLR
ARFGERAGQAFAARAGRALAVEGLRDFPSAPIPEQVAGEAGVPAYPALVD
QGDNAALRIFADRNEAARAHPRGVRRLLEIALADKIKQARKQLPVSPKTG
LLYAAIESQERLRGDLVDAALNAVLADGLGAIRDPGAFAQRRDDASKRLF
GEAMERLKLAESILSAVAELKPLLEAPLMGWARGNLDDMEQQLRALVHAG
FLRDTPADALANYPRYLRAMILRTERAKRDPARDQARMLELKPFVDAVEE
AAARGLQNRQEWQALRWDLEELRVSVFAQELGAKSGVSAKKLSQRVAALR
S
>gid:106289  hrpB  ATP-dependent RNA helicase
MTDPAFPISPLLPRMRDSLAAHPRLVLEAPPGAGKTTQVPLALLDAPWLA
GRKIVMLEPRRVAARSAAQFMARQLGEPVGETVGYRIRFENKTSARTRIE
VVTEGILTRLIQDDPMLESVGVLLFDEFHERHLAGDLGLALALDVQAQVR
DDLRIVAMSATLDGERLAGFLEAPRLSSAGRSFPVEIAHFPARRDEALEP
QTRRAVEHALSTHPGDVLVFLPGQREIARVHGALQDVLDPAVQVLALHGE
LSVEAQSQVLQPGPQGRRRVVLATNVAESSVTLPGVRVVIDSGLAREPHY
DPNSGFSRLDVAAIAQASADQRAGRAGRVASGWAYRLWPQSQRLEPQRRA
EITQVELTGLALELAAWGSSALRFVDAPPSGALAAAHELLQRLGALTASG
GITALGRRMLALGTHPRLAAMLAQAGEATRVALACDLAALLEARDPLRQG
GDGLAARWRALAAFRQGRSAADANRGGLAAIDSAARQWRRRLRCDSVPPS
SVEAHALGDLLSHAFPDRIAARHPADPLRYLLANGRSARLFDHSDLRGEP
WLVASELRYEAKDALLLRAAPVDEAYLRRSLPERFVQQDVVQWDADKRAL
VARRQSSFDRIVLDSRPAGRVDPAHAAGALTDAVRQLGLDALPWTENLQQ
WRARVQSLRRWMPELALPDLSDAALLETLDTWLRPAFAGKTRLDALDEAS
LGEALKSALPWERRQSIDRHAPTRISVPSGMERPISYALDHAGQPLPPVL
AVKLQELFGLAETPRIADGRIPLTLHLLSPGGRPLQVTQDLKSFWATTYP
DVKKEMKGRYPRHPWPDDPWTAAATHRAKPRGT
>gid:108923  hup  histone-like protein
MAKTAAKKAAPKKAVKKVAATKTAKPAKAAAAKSAAPKPIKEALSKTGLV
AHIAESTQLAPKDVRAVLASLEATAHASLSKKGVGTFTIPGMLKLTTVHV
PAKPKRKGINPFTKEEQIFAAKPATTKLKSRMMKRLKDAAL
>gid:107077  hupB  histone-like protein
MNKTELIDGVAAAADISKAEAGRAVDAVVSEITKALKKGDAVTLVGFGTF
QVRERAERTGRNPKTGDSIKIAASKNPAFKAGKALKDAVN
>gid:108293  ihfB  integration host factor beta subunit
MTKSELIEILARRQAHLKSDDVDLAVKSLLEMMGQALSDGDRIEIRGFGS
FSLHYRPPRLGRNPKTGESVALPGKHVPHFKPGKELRERVSSVVPVDVAD
TAD
>gid:108900  int  integrase/recombinase
MLSVHVLGEGERVPMLHDEQGMPLFYPTLFATSQLRNAGAAVNTIRNKLA
DLLVLLRWEQANGRDLITEFRSGRFLSVADIVSLRDFAKLDMRELSSAGD
GEKERGVVVDFLEARVSSSRALATIGGQQHFNRISTFADYLEFTASVVTQ
HQNPSRTAQEIARMARTIRKHRPRGLAKQRADESDLRSPPSELVERFMAI
GAEGNPRNPFRHPEVQLRNAIIFGLLRHTGMRRGELLSLRLDQFDLGHEP
HVWVRRNQDDKHDSRRYQPVAKTKERPLPLPEMLANQIDRYMLKVRPKIG
PARRHPYLLVSHRKGSTWGKPLSASALSSQIFSRMRAVDSAFEAIHPHAF
RHHFNYELSVSIDKHNAKSRSGEDAEASPISEAKELDVRAFLNGHRSKTS
GATYNRRHIREASDKAARRVQAGRPKTSDNSKEDGDGPR
>gid:108218  int  phage-related integrase
MAKIKLTKSAVDAAQPQDKAIELRDTVVPGFLCKITPAGRKVFMLQYRTN
AGERRKPALGQYGELTVDQARTMAQEWLAEVRKGGDPSAAKNAARKAPTM
REFCHTFMEDYSKQRNKPSTQRGYQGVIDRCIIPIMGRMKVQDVKRPDVA
ALMKRLAYKQAEANRTFGVLRKMFNLAEVWGLRPDGTNPCRHVPMYPPGK
ETRLIVDDELVRIFRHLEHLEAEGLENYVIPLAIRLQFEFAARRSEICPL
EWAWVDLENRRVVWPDSKTGGISKPMSEEAYRLLSTAPRREGCPYVLPSP
NDPTRHMTHGEHYGGWTRVLKAAGVPHVGTHGIRHRATTDIANSGVPTKV
GMKLTGHKTVAMFMHYVHTEDKPVRDAAELVANRRLAITGASRSMEATA
>gid:107506  int  phage-related integrase
MPKPYFLRRPAGLYVRFFVPTDLQALVGSRYLVRPVRLCLGDAARLAVAR
TAVALSEAFDRMRRGASSMKDDLLSQALAALKGNEARPYTIKVGGMELSA
NGAEDHARLLEALKNLPLPSSTEVRKKDGPLFSERAAIHVREMKRVGRSA
TNIFDTEYSLGLFVALVGDKPVEDYKADDVRSFLEAIEHYPSNASKKAEF
AGLAPADILKKAKKGSYAKLGMRTKEKHRDRLAAFFNALASEDLISKAPH
KAIINRSKASTDEPSRNPFSKDELNALFEPSKFKIWASKYPHRWFGTLLG
LTTGARINEVAQLYVDDLDHVGGCWGIHIRAARADQRLKNAHSSRFVPLP
SALIEAGILIYRDEVKAAGFERLFPHLPYHSEHGYGDALGDQFRSYAKKC
GLTQRLKSFHCFRHTISNALINEHGVSVMTTQQITGHDLTLPPGLKHYVD
PPTVPARLQALEKFGPPVDLPAYQPKQFDLSFKQVRHMERRRQTAKQEVV
AVKNKTGWKN
>gid:106340  int  phage-related integrase
MTPPTPKLLDQVRGRLRLRHYSLRTEQAYVSWIRRFILASGQRHPGQMGQ
VEGEAFLTKLAARGQVSAGTQEVARLLSMLEGNCRLVAGLLYGSGMRLLE
CLRLRSKAMDIVRGEIAVRDGWDASICGMWSAGRAQPS
>gid:107494  int  integrase
MKKDQIENNLNERQCSVSRVISGNLVLEGFTVQGPNADQRFVRMIKQLRM
AQGLTNSSSPNTSGDLHLLEDEIKKYRRDIKREKKGLGGIVGLTQTLSIT
ASVIGNIPVDQINQDHIRSVLDSFQWWPKNAKSRKEFRGMSTTEVLAIGR
SRKPVTIAASTYNNHLARICAFFQRQIVMGVIKNSPCIGVATRIDTSTER
KSRRPLYIEELAAIFEPIEFKRWVKDRPERWWVPQLCLYTGARASEIAQP
RLADIATIDGISCITIRVTQKEQRVKNKPSVRVIPLAQPLIDAGFLIYVE
EVRATKHPRLFPHLDAGYKLYEGEAFYLGYGDKVIRDFCKRLKQMGFDKG
IGSHAFRHTFSTNLTKHGIGVAEVALVTGHAVQGSKPDSVHVPGLQTYID
RDQCPPELKAIYSTVSRFNPGVELPAYTPGQFAKALSNKKLFKA
>gid:108624  int  phage-related integrase
MGRGRKRKFNPDIPRHIDQDALPKGVYWADGRWYIIEPHPEGGTTRKRTI
AYADARLSDLHAAREASRGAGLVGSLQYLANAFKLSTEYRDLSRSTRDDY
DRHAEVACGYVLKDGSLFGRLLVDRLSVPLVQRLVEALAKGREASAVQPA
LLARPSTANHTLRYLHRLFAWGIRIGHCKTNPASGVRGAKERADAKMPDP
QSFMAVLEFAKSRAALPLHAKGSVPPYMPAVMVLAYNARLRGVEVTDLTD
ADALQQGVRCTRRKGSRDNITAWNDDLRWAWIWLRDYRAQRIQAHRRPVP
LRPEQRGLLVTQTGTRLARSTLKTAWQRLITAAIEAGVIAEESRFTLHGL
KHRGITDTRGTRAHKQDAAGHVTSQMTHRYDHELQVIAPPALPTDDALAG
ALAFVDLVKPSDT
>gid:107660  intA  phage-related integrase
MPRTHHRLSPTFVNAAKKGNAKPGRYSDGGNLALRVGEGGAASWTFEFVR
AGQRRELGLGSVGVVGLADARARAASLRSDLAAGITPQTARARTQANIGL
TFRQATEKYLSGKVSAMKNDKHKAQWRSTLETYAYPYIGDLDVKQIDTPH
VLAALDPIWITKNETASRVRGRIERVLAWATVSGHRTGDNPARWRGHLSE
ALAAPAQAKKVEHHAALPYAELPAFLSALQGQIGIGAHALRFAILTATRT
GEVIGARWDEFDLEENVWTIPVERMKAGKEHRVPLSSAALEILSAMRNFA
TGSGDGFVFPGAKNGSPLSNMAMLATLKRMGRTDITAHGFRSTFRDWAGQ
ETHHPREVIEHALAHQLKSKTEAAYARGDLLRKRRALMDEWATYAA
>gid:108179  intS  phage-related integrase
MLTDTALRNLKPKSKIYKVFDRDGMYVAVSTAGTITFRYDYRLNGRRETL
TLGRYGPAGMSLAMARERLLDAKRSVVQGLSPALEKQRAKRRLTAARTFG
EMMERWLADARMADSTRAMRKHIIDRDILPVFQNRKLKEVTPDDLRALCN
KVKARGAPATAIHVRDIVKQVYAFAILHGERLANPADDVKAASIATFVPK
DRALSPAEIRLAFHQLETIASYPTIRLALRLILLTMVRKSELVEATWNEV
DFENATWTIPKVRMKGRKPHVVYLSRQAMDIFVALHTCAAGSRFVLPSRY
DPDRCMSHATLNRVTQLIGSRAKEAGLPLDPFTVHDLRRTASTLLNEVGF
NGDWIEKCLAHEEGSRTSRSVYNKAEYAGPRRHMLQEWADMVEAWLDGRT
HVPKLVPEDVSVPVLSPAL
>gid:108282  intS  phage-related integrase
MSRMRAKNMLTDTKLRNLKPRDRLYKENDRDGLYVAVTPAGAISFRYNYS
IHGRQETITFGRYGVGGITLAEARERLGEAKKMIAAGKSPAKEKARDKAR
VKDAETFGAWSEKWLRGYQMADSTRDMRRSVYERELKPKFGNQKLVEITH
EDLRALTDAIVERGAPATAVHAREVMSQVYRWAIERGQKVANPAELVRPT
TIAKFEPRDRALTPDEIGLMYQYMERIGTTPSIRAAAKLLLLTMVRKSEL
TNATWSEINFSDALWTIPKERMKRRNPHLVFLSRQALDIFIALKTFAGGS
DYVLPSRYDSDLPMSSATLNQVLTLTYRLAQKEGKSLPKFGPHDLRRTAS
TLLHEAGYNTDWIEKCLAHEQKGVRAVYNKAEYREQRTAMLQDWANMIDE
WTMKRPSA
>gid:107623  lig1  DNA ligase
MTASPDPAQRIDALRRRIEDANYRYHVLDEPQMADADYDKLMRELEALER
AHPELASADSPTQRVGHLAASRFAEVRHAMPMLSLGNAFSDEEVTEFVRR
ISERLEVRQPLFSAEPKLDGLAISLRYENGEFVQGATRGDGATGEDVSAN
LRTVKAIPLRLRGEGWPRVLEVRGEVYMPRAAFEAYNAQMRAQGGKILAN
PRNGAAGSLRQLDARITAQRPLSFFAYGVGEVSEGALPQAHSAILAQLRA
WGFPVSALVEVVQGSDGLLAYYQRIGEARDGLAFDIDGVVYKLDDLAGQR
EMGFVSRAPRWAIAHKFPAQEQSTTVEAIEIQIGRTGAATPVARLKPVHV
AGVIVTNATLHNADQIARLDVRVGDTVIVRRAGDVIPEVAAVVADQRPPG
TQAWQMPTQCPVCGSEIVREEGQAVWRCSGELTCPAQRKEAFRHFVSRRA
MDVDGLGEKFIEVLVDSGLVKGVADLYLLSVDQLLQLRLISTADSPHAFL
REAREHLASGAYAQLEASVVGIGVDLAGERDVPQTWQADLLRAGLPSFDW
NRKKIATKWAENLIEAIEISRDTTLERFLFALGIEHVGESTAKALSAWFG
DLELIRHLPWPLFKRVPDIGGEVARSLGHFFDQAGNQKAIDHLLARKVRI
GDTHPPSPKLRGELSLANLLEDLEIPKVTPIRAAQIATAFGSIDALRNGG
PEPLVEAGVPQSVAESLAAWLLVPANDTLAVNAQRKLSELLAMLPEAGEE
KTGPLDGQTVVITGTLAALTRDAAKQRLEALGAKVAGSVSKKTAFLVAGE
EAGSKLDKAQSLGVEIWDEARLLAFLGDHGQQP
>gid:107337  lig2  DNA ligase
MKRFAALYRTLDRATGTLDKRAALVAYFRDAPALDAAWALYLLAGGKVAS
ARMRIASSGELREWITDTAGIADWLVADSYDHVGDLAETLALLLDDPVTE
AADLPLAQWIEQRLLPIANQDVDVRKACIVQAWRSLAFDERLAFNKLLTG
ALRVGVSQRLVQQALAELSGVDIARIAQRMLGSWRPHPTYLAELLTSEEL
PGDRQQPYPFFLASPLEAEVETLGAIDDWLLEWKWDGIRLQLIRRAGEAA
LWSRGEERLDGRFPEIEHAAMQLPDGTVIDGELLAWQPEQPLPMPFTALQ
TRIQRLKPGPKTLAAAPARVVAYDLLELAGEDLRERPLRERRALLEGVLF
ALADPRIVASPLVQVSDWQAAAHVRVEARERGVEGLMLKRASSMYQSGRR
RGDWWKWKIDPLTIDAVLLYAQAGHGRRSTLYTDYTFGLWHEGQLVPIAK
AYSGLDDKEILQLDRWIRANTTERFGPVRAVTAHHVFELGFEAVNRSARH
KSGIAVRFPRILRWRHDKPMAEADHLSTLQALAR
>gid:108410  lig3  ATP-dependent DNA ligase
MSLSEYRRKRSFDKTREPEPGKTLPPGQRAIFVVQLHHASRRHYDFRLQV
GDALKSWAVPKGPSYDPKVKRMAVEVEDHPVDYASFEGEIPRGEYGGGHV
AQFDHGVWATAGDPEAQLAKGHLRFELFGNKLKGGWHLVRSGKPARQPQW
LLFKEDDAYAGTLEADDLLADVAATPADDVRRAGAGKTQRKGLTTVPPPA
AKKRGTWAKQALALSGARRAEMEDAPFAPQLAKLVQSPPEGAQWLHEIKW
DGYRILATVTNGKVRLWSRNALEWTDKTPEIADAIQSLGLRNAQIDGELI
AGRGTKEDFNLLQATLSGERQVPLALAVFDLLHVDGVDISEAPLRERKQL
LQQVLQAAPHMHLAYSSHVEGDGTEAFRVAGEQHFEGIISKRADRPYRDG
RSDDWRKTKQLASQEYAVVGYTAPKGSRSGFGSLLLATPDPVHGWLYVGR
VGSGFSDALMREVTQQLEGGGRKPTAHIPTPDTDLRGATWFAPRFVVEVF
YRGIGGQQLLRQASFKALRPDKRIADLADSDAGNGRSTSSSTQRSAKTRA
AKNAATQVAKRAATRATAPARKSAVATPSSAALPTLSSPTKLIYPDIRAT
KGDVWDYYQAVMDHLLPQIVGRPLSIIRCPSGAEKPCFFQKHHTAGLERV
SSVKLTEETGTNAYYLVIEDAPGLLELVQFNALEFHPWGSHADRPDVADR
VVFDLDPGPDVPFAEVKRAANDIRKLLAQLELESFLRVSGGKGLHVVVPL
NPGCDWEVTKRFAKGFADALAQAEPQRFIATATKRLRNKRIFVDYLRNGR
GATAVASYPLRGRPGAPVALPLAWSDLSKLQRADAFTLRDVPEKLRRRRK
DPWADMDRIRQNLARWAEQDED
>gid:106557  mdcD  delta subunit of malonate decarboxylase
METLRYRFDGQRGARAGLEHALVGVVASGNLEVLVERVPLEGAMEIDILT
AARGFGAIWQAVLDDFAARHPLRDVRISINDVGATPAVVSLRLEQALDVL
QGADA
>gid:108854  mfd  transcription-repair coupling factor
MPSPTFPSPPLPKSGQLRAYWRAPSSPTALAWSIARAAEAHAGPLLVIAR
DNQSAHQIEADLHALLGEHSALPVVPFPDWETLPYDQFSPHPEIISQRLA
ALHRLPGLTRGVVIVPVQTLLQQLAPLSYIVGGSFDLTVGQRLDLDAEKR
RLESAGYRNVPQVMDPGDFAVRGGLLDVFPMGADTPLRVELLDEDIDSIR
VFDPESQRSLDKVDAVKMLPGREVPMDDASIERVLACLRERFDVDTRRSA
LYQDLKSGLAPSGIEYYLPMFFAKTATLFDYLDKRVLPVIATGVSNAADA
FWTQAQNRYEQRRHDVERPLLPPDELYQSPDALRERLNKLARIEVWASDH
ARIDEAAPLGDQPLPPLPVAAKDAPAGQALASFLSHYPGRVLVAADSAGR
REALMEVLAAAQLKPELVADVPAFLAGTLRFGITVAPLEDGFALDQPQIA
LLTERQLFPERANQPRRARRVGREPEAIIRDLGELSEGAPIVHEDHGVGR
YRGLIVLDAGGMPGEFLEIEYAKGDRLYVPVAQLHLISRYSGASAETAPL
HSLGGEQWTKAKRKAAEKVRDVAAELLEIQARRRARAGLALQVDRAMYEP
FAAGFPFEETGDQLAAIDATLRDLGSSQPMDRVVCGDVGFGKTEVAVRAA
FAAASAGKQVAVLVPTTLLAEQHYRNFRDRFADYPMKVEVLSRFKSTKEI
KAELEKVASGDIDVIIGTHRLLQPDVKFKDLGLVVVDEEQRFGVRQKEAL
KAMRANVHLLTLTATPIPRTLNMAMAGLRDLSIIATPPPNRLAVQTFITA
WDNTLLREAFQRELSRGGQLYFLHNDVESIVRMQRDLSELVPEARIGIAH
GQMPERELERVMLDFQKQRFNVLLSTTIIESGIDIPNANTIIINRADRFG
LAQLHQLRGRVGRSHHRAYAYLVVPDRRSMTSDAEKRLEAIASMDELGAG
FTLATHDLEIRGAGELLGEDQSGQMAEVGFSLYTELLERAVRSIRQGKLP
DLDAGEEVRGADVELHVASLIPEDYLPDVHTRLTLYKRISSARDTDALRE
LQVEMIDRFGLLPDPVKHLFAIAELKLQANALGVRKLDLGENGGRLVFEV
KPAIDPMTIIQMIQKQPKIYTMDGPDKLRIKLPMPEGADRFNAARGLLAA
LSPG
>gid:106309  mttC  type V secretory pathway protein
MQLIDIGANLTHDSFDRDRDAVLQRARDAGVAQLVITGASREHSPLALQL
AQQHPGFLYATAGVHPHHAVEFTAECEAEMRTLQAHPQVVAVGECGLDYF
RDFAPRPAQRKAFERQLQLAADNGKPLFLHQRDAHEDFIAIMRSFEGRLG
AAVVHCFTGTRDELFAYLDRDYYIGITGWLCDERRGAHLRELVRNIPANR
LMIETDAPYLLPRTLKPMPKDRRNEPMFLSHIVEELARDRGEDVAVTAAN
STAAARAFFRLSAPASTTSG
>gid:108401  mutL  DNA mismatch repair protein MutL
MAIRQLPEILINQIAAGEVVERPASVVKELVENALDAGATRVDIELEEGG
VRLIRIRDNGGGITPDELPLAVSRHATSKIASLDDLETVATLGFRGEALP
SIASVSRFTLTSRRHDAEHGSALEIDGGRLGEVVPRAHAPGTTVEVRELF
FNVPARRKFLRAERTELGHIEEWLRSLALARPDVELRVSHNGKPSRRYKP
GDLYSDARLGETLGDDFARQALRVDHSGAGLRLHGWVAQPHYSRASTDQQ
YLYVNGRSVRERSVAHAVKMAYGDVLFHGRQPAYVLFLELDPARVDVNVH
PAKHEVRFREARLIHDFVYRTLQDALAHTRAGATPNSIGGDGTGYTAATS
GGMGGIASGGVPGNGGASIGSGGAYSYASWTPSQTPLGLRVDEARAAYSA
LYAPPPSSAQQSAGMPNMAGTGLPATAQDSGVPPLGYAIAQLHGIYILAE
NAEGLIVVDMHAAHERIGYERLKHAHDSIGLHAQPLLVPMTLAVGEREAD
TAEREAETLATLGFEITRAGPQSLHVRSIPALLANAEPEALLRDVLSDLR
EHGQSRRIASARDELLSTMACHGAVRANRRLTVPEMNALLRDMEATERSG
QCNHGRPTWARFTLSDIDRWFLRGR
>gid:110282  mutM  formamidopyrimidine DNA glycosylase
MPELPEVETTLRGLSPHLVGQRIHGVILRRPDLRWPIPEQIERLLPGATI
TNVRRRAKYLLIDTDAGGSVLLHLGMSGSLRVLPGDTLPRAHDHVDISLQ
SGRLLRFNDPRRFGCLLWQSGTQAHDLLAALGPEPLSDAFTGDYLHALAQ
GRRAAVKTFLMDQAVVVGVGNIYAAESLHRAGISPLREAGKVSLERYRRL
ADAVKDILAYAIQRGGTTLRDFISPDGAPGYFEQELFVYGREGEACKQCG
RVLKHATIGQRATVWCGSCQR
>gid:107299  mutS  DNA mismatch repair protein
MGRRNRRATRHGESSKSALIDCHIPEEHFLQSTDPKEKTKNAGGAVEHTP
LMKQFFAAKSDYPDLLLFFRMGDFYELFYDDARKAARLLDITLTQRGSSG
GAPIPMAGVPVHAYEGYLARLVALGESVAICEQIGDPALAKGLVERKVVR
IVTPGTVTDEALLDERRDTLLMAISRSKQGYGLAWADLAGGRFLVNEVDS
ADALEAEIARLEPAELLVPDEDNWPEFLRGRIGVRRRPPWLFDADSGRRQ
LLAFFKLHDLSGFGIDDKPCATAAAGALLGYVEETQKQRLPHLTSIAMEV
ASEAISMNAATRRHLELDTRVDGDTRNTLLGVLDSTVTPMGGRLLRRWLH
RPLRLRDVLVQRHHAVGTLIDAAADADLREAFRALGDLERILTRVALRSA
RPRDFSTLRDGLALLPKVRAILAPLDSPRLQALHAELGEHDATAHLLISA
VAETPPLKLSDGGVIATGYDADLDELRRLSTNADQFLIDLEQRERASSGI
ATLKVGYNRVHGYYIEISKGQAEKAPLHYSRRQTLTNAERYITEELKSFE
DKVLSARERSLSREKLLYEGLLDALGTELEGLKRCAGALSELDVLAGFAE
RAQALDWSQPELDSAPCLRIERGRHPVVEAVREQPFEPNDLDLHSDRRML
VITGPNMGGKSTYMRQNALIVLLAHIGSYVPASRAVIGPIDRILTRIGAG
DDLARGQSTFMVEMAETSYILHHATPQSLVLMDEIGRGTSTYDGLALADA
VARHLAQTNRCYTLFATHYFELTALADASHAGGASGIANVHLDAVEHGER
LVFMHAVKDGPANRSFGLQVAALAGLPKAAVTQARRRLAELEQRGGESHS
AQMAPTALDAPQQFGLFTAPSSAAQEALQALDPDELTPKQALEALYRLKA
LL
>gid:108040  mutT  7,8-dihydro-8-oxoguanine-triphosphatase
MPYTPIVATLGYLLSPDGTQVLMIHRNARPGDQHLGKYNGLGGKMEPDED
VLACMRREIREEAGVDCGRMQLRGTISWPGFGKQGEDWLGFVFLIHSFEG
TPHTSNPEGTLEWIAIERMDQVPMWEGDRNFLPLVFDGDPRPFHGVMPYR
DGRMQSWTYSRI
>gid:107995  mutT  7,8-dihydro-8-oxoguanine-triphosphatase
MSVTEQRWHPDVTVATVVVRDGRFLQVEEAIGGRLLLNQPAGHLEPNESL
LDAAVRETLEETGWDVRLTQFIGTYQWVAPNGQCFLRFAFVADALTHHPD
RSLDTGVVRALWMTPDELRASIERLRSPLVWEVVADYLAGQRHPLSLVRH
VA
>gid:108549  mutY  A/G-specific adenine glycosylase
MPADHATPSAHAFVDRLLHWFDGHGRHDLPWQHPRAPYRVWLSEIMLQQT
QVAVVIPYFHKFMARFPALADLAAADNDTVMAQWAGLGYYARARNLHAAA
KQCVALHGGQVPRDFDALLALPGIGRSTAGAILSQAWNDRFAIMDGNVKR
VLTRFHGIAGYPGLPAIEKQLWQHAIIHVAHVPAGRLADYTQAQMDFGAT
LCTRAKPACVLCPLQTDCIARRNGLVDALPTPKPGKQLPEREATALLLQN
AEGHILLQRRPPTGIWASLWTLPQAETESGMRAWFAAHIDGNYERADEMP
PIVHTFSHYRLHLQPWRLRKVALRPAVRDNDDLRWVAPADLASLGLPAPI
RKLLDAL
>gid:108871  nfi  endonuclease V
MQTNDPVFAGWDGSVIQARQLQRQLARRVVLDDAVSATPQLLAGFDVGFE
DDGQTTRAAAVLLDAQTLLPLETHIARVPTSMPYVPGLLSFRELPALLQA
LALLSRTPDLVFIDGQGIAHPRKLGIAAHFGVVTGLPCIGIAKQRLAGSF
AEPGPERGDHTPILLGGSQIGWALRSKPRCNPLIVSPGHRVSMQGALDWT
LRTLRAYRLPEPTRLADRLASRRGEVTVPAGYSGHLL
>gid:107578  nth  endonuclease III
MRKPEIQEMFERLRELNPHPTTELEYTTPFELLIAVLLSAQATDVGVNKA
TRKLYPVANTPRDILDLGEEGLKRYISTIGLFNAKAKNVIATCRILLERY
GGDVPHDRAALEALPGVGRKTANVVLNTAFGEPAMAVDTHIFRVSNRTGL
APGKDVRAVEDKLVKVIPSEFLHDAHHWLILHGRYVCKARKPDCPGCVIH
DLCRYRDKTPPAPERQTKTRS
>gid:106499  nudC  NADH pyrophosphatase
MSESLFSASGFAFTHAPLDRGDVLRDDPDALARLWPQGRVLLIDAKGTAL
ADADGQPLLLDGAELGDGPEAAIFLGLRDAVGWFCLPADIVAVQAPQRID
LRQAAADWPAEIATAFAYARAMLHWQSRTRFCGVCGGAIAFRRAGFIAHC
TQCQTEHYPRVDPAIIVAVSDGARLLLGRQASWAPGRYSVIAGFVEPGES
LEQTVVREVYEETRVHVQDCRYLGAQPWPFPGALMLGFTARAAATEVPQV
TGELEDARWVSHAQVSAALAGEGDIGLPPRISIARALIEHWHRAHG
>gid:109063  nudE  ADP compounds hydrolase
MRRMSSRLPIIHKITDLGDGPFRRQQLDLEFSNGQRRIYERQLSQGHGAV
VVVPMLDAQTVLLVREYAAGVHRYELGLVKGRIDAGETPEQAADRELKEE
AGYGARQVQVLRAMTLAPTYMSHQSWLVLARDLYPERLVGDEPEEMDVVP
WPIARLDELMLREDFSEGRSLAALFIAREWLERNP
>gid:106487  nudH  probable (di)nucleoside polyphosphate hydrolase
MIDPDGFRPNVGIVLMRQDGQVFWARRVRRDGWQFPQGGMNTDETPVEAM
YRELREETGLLPEHVELLGATPGWLRYRLPSRAVRRNERQVCIGQKQVWF
LLQFTGDESHLKLDHTDTPEFDHWRWVDFWYPVEHVVMFKRGVYARALRH
LAPLAQSLAGPAAVGAMPERALEAWLPGSSAAGHDSPRKRPRKRNGARAM
RINND
>gid:108819  ogt  6-O-methylguanine-DNA methyltransferase
MSTLHYDTFPSPIGALSVAADDKGVHHILFAQNRYDAIGRARWLHDPDAP
LVREAREQLLDYLHGGRRSFDLPLAPVGTPFQLTVWRTLAQIPFGQTWSY
AQLAQAVGKPAASRAVGAANGRNPLPIVLPCHRVIGANGALTGFGGGLPT
KQALLQLEGWSPRSAARADDLFAPASAMRTG
>gid:108239  orf8  plasmid-related protein
MPHFSWSITMSFEIDTAADNAPVQGDLLGAEPSPLELSLTEFVAEFGDEL
LESLNRANPPVYSGQPRAHRQRIVAGLKRRLFPAQAEVVHAAAELLIDRG
ERAAIVNGEMGCGKTTVGIATAAVLHAEGYRRTLVLSPPHLVYKWRREIL
ETVAGARVWVLNGPDTLVKLIKLREQLGERPTGQEFFVLGRVRMRMGFHW
KPVCAVRRTRHGDVATCPDCGHVVTDLDGEPVHPAMLDEDEHRRKCGHCA
APLWTLMRPESLSGNAQSSAVAKALQRIPTIGEATAQKLMRKFGDGFLAS
MLGDNIHEFINLMDANGELVFSDRQATRMERAMANMEFGFGEGGYQPTEF
IKRYLPQGTFDLLIPDEAHEYKNGGSAQGQAMGVLAAKCRKALLLTGTLM
GGYGDDLFYLLFRALPGRMIEDGYRPAKNGSMTSAAMAFMRDHGVLKDIY
SESAGTAHKTAKGSKTSVRTVKAPGFGPKGVLRCVLPFTIFLKLRDLGRD
VLPPYDEEFREVPMAPAQASAYSALSIALTSALRQALARRDTTLLGVVLN
VLLAWPDTCFRAENVVHPRTRDTLAFVPAQFHDAQAMPKERELIDICRQE
KAQGRKTLVYTVYTGTRDTTSRLKRLLEVEGFKVAVLRASVDASRREDWI
AEQLDRGIDVLITNPELVKTGLDLLEFPTIVFMQSGYNVYSLQQAARRSW
RIGQTQCIKVIYLGYAASSQMSCLKLMAQKIMVAQSTSGDIPESGLDVLN
QDGDSVEVALARQLVN
>gid:109224  orfS  cointegrate resolution protein S
MISVGQYLEAATRPNTQRAYAAATRHFEVEWGGHLPATAEQVARYLAAYA
GQLALNTLRHRLAALAQ
>gid:109223  orfS  cointegrate resolution protein S
MLKGIQTVHPSQEKRATPLQLTQLGQVVDWLDGAATAAGTRDDAAGRLRH
LRDRAFVLLGFWRGFRGDELVRLQVQDLELVAGEGMTCFLPHSKSDRQHA
GTTYKVPALSRWCPVAATMTWVAAAALHEGPLFRAVNQWGGIAAAPLHPN
SLVHLLRRIFREAGLSSPNDYSGHSLRRGFAGWANANGWDVKALMEYVGW
RDVQSAMRYLDGADPFARQRIEASLPPATPPLLALAAPAPDPVPTTAVEA
TVTLTRFNSRVRGLAKAHRLIEQICLQPHQAQRLNTDGTRYRLAIAAVDE
AAFEETIAMLLDEMHRIADNHQCFLAVALRDEAGERHWD
>gid:108429  parA  resolvase
MVARVYLRVSTDAQDLERQEGIITATKAAGYYVAGIYREKASGARADRPE
LLRMVADLQPGEVVIAEKIDRISRLPLPEAERLVASIRAKGARLAVPGVV
DLSDLAAEAQGVAKIVLESVQDMLLKLALQMARDDYEDRRERQRQGIYLA
KKAGRYKGRRADPKLRAQVIALRCIGRSIADTAKLAGCSMAQVKRIWATR
DVSQAEAARHGAFVEDALSEPDARAAADVEQGAAELPEPAFVDDEGPMAP
PAHVSTVRLRDVLASLTDAELAALPGRVSKEEIERRRHRVGAQRESAPAL
FR
>gid:107437  parC  topoisomerase IV subunit A
MTDLTRPTFHGFEQLPLREYAERAYLDYSMYVVLDRALPFLGDGLKPVQR
RIVYAMSELGLNAAAKPKKSARTVGDVIGKYHPHGDSACYEALVLMAQPF
SYRYPLIEGQGNFGSTDDPKSFAAMRYTESKLTPIAEVLLGELGQGTTDW
APNFDGTLEEPTWLPARLPHLLLNGTTGIAVGMATDVPPHNLNEIVSALL
RLLDDPDASVADLCEHVLGPDYPTTAEIITPAADLRNIYETGHGSVRARA
TYKKEHANIFIDALPYQVSPSKVIEQIAQQMRAKKLPWLEDIRDESDHTS
PVRVVLVPRSNRVDAEQLMGHLFVTTDLERSYRVNLNVIGLDGRPQVKNL
KHLLSEWLSFRSDTVTRRLNHRLQKVERRLHLLEGLLIAFLNLDEVIRIV
RSEDEPKPVLISRFALSEEQAEYILETKLRQLARLEEMKIRGEQEALAEE
RAQIMAILESKTKLKKLIKDELTADAKKFGDARRSPLVQRGAAQAIDETE
MVASEPMTVVLSQKGWVRAAKGHEVDPAGMSYRDGDGLLAAVRSRSTHQV
AFLDSDGRAYSTAVHTLPSARGNGEPLTGRFSPASGAAFQVMASADNATR
FVLASSHGYGFVTRFENLTGRNKAGKAMLNLTTGSHVLTPAQVSNPQTDR
IVAVTSAGNLLAVPATDVPELDKGKGNKIIEIPKAKLGTERVVAVVAVAP
GNTLLVRSGARTMSLSFKDLDTYVGARASRGSLLPRGWQKVDGLEVQ
>gid:107710  parE  topoisomerase IV subunit B
MNTRYNAADIEVLSGLDPVKRRPGMYTDTARPNHLAQEVIDNSVDEALAG
HAKQVEVTLYKDGSCEVSDDGRGMPVDIHPEEKIPGVELILTRLHAGGKF
SNRNYTFSGGLHGVGVSVVNALSTKVELFIKREGSEHRMEFRDGNAASKL
EVVGTVGKKNTGTRLRFWADPKYFDTPKFNVRALRHLLRAKAVLCPGLTV
KLHDEATGEQDSWYFENGLRDYLKGEMAEHELLPADLFVGSLKKDTEIVD
WAAAWVPEGELVQESYVNLIPTAQHGTHVNGLRSGLTDALREFCDFRNLL
PRGVKLAPEDVWDRVTFVLSLKMTDPQFSGQTKERLSSRQAAGFIEGAAH
DAFSLYLNQNVEIGEKIAQIAIDRASARLKTEKQIVRKKVTQGPALPGKL
ADCISQDLSRTELFLVEGDSAGGSAKQARDKDFQAILPLRGKILNTWEVA
SGSVLASEEVHNLAIAIGCDPGKDDITGLRYGKVVILADADSDGLHIATL
LTALFLQHFPALVAAGHVFVAMPPLFRVDVGKQVFYALDEEEKTTLLDKI
AREKMKGQISVTRFKGLGEMNPQQLRESTIHPDTRRLVQLTIDDGEQTRS
LMDMLLAKKRASDRKQWLETKGDLASLEV
>gid:108043  phaE  PHA synthase subunit
MASSGSDNGNFDDKARQYWAAWGDAMRHGQAGAAAQQPPPTSGSDAPQDW
RKAVDWWSQLLPTQAAPQAQEAIDRFRTQAGDWFGTMQQVAAQFAGRDTS
ANEVADAWRQAVQGQGEQLMQWTLGSLRGGSPGGFDPWLQDAAQALQKWR
EENAPWLDMPAFGLNRNHQSRLQKLARAQQEFQAQSEAYGEQLKAAIEQA
FARFASRLSEHESSGSQLTSARALFDLWIEAAEESYADVALSEQFRKVYG
GFANAHMRLRAALQEEVEQLSERFGMPTRSEMDAAHRRIAELERLVRRML
RNAASPASKPAAAAQPAPVTPAAGKRASSAGASAVARPAGAERAVKQGAS
KKAPVDKTATKKAAPKKAPAVSKPTKTTSPKKHGGATSSPELAARKPVAA
KKRGAQA
>gid:107474  phr  photolyase
MLRSLIVCCRVRMSYAIVWFRRDLRLEDNPALRAALDAGHHPIPLYIDAP
HEEGEWTPGAASRTWRHRSLAALDGALRALGSGLVIRVGNSAQVLDEVIA
QTGAVAVYWNRKYEPATQPRDAQIKRDLRERGIEVQSCNCALLFEPWQLS
TQQGGPYKVFTPFWRNALTQLQLPAAIGAPRSLPPLPASLKTDALDTLQL
VPGLQWDQGFWEHWQPGEAGAHEMLEIFIDGALSGYRENRDRPDRVGTSQ
LSPHLHFGEIAPWRIANTLETHRTARNGAEIDAYIRQLGWRDFAYHLLHH
FPDTTNQNLNPRFAGFDWATVDPVALQAWQRGRTGIPIVDAGMRQLWHTG
WMHNRVRMIVASFLCKHLRIHWIEGARWFWDTLVDADLANNTLGWQWVAG
TGADAAPYFRVFNPVTQAEKFDPQAAYITRWVPELGKLAVKERFAPWLHP
LSLARLAPEYPRTPIIGLAEGRDAALAAYAKTRG
>XACa0006 pin, invertase/recombinase like protein
MALIGYARVSTAEQDTALQTDALRKAGCERVFEDTASGAKADRPGLADAL
AYLRNGDVLAVWRLDRLGRSMPHLIETIGALEARGVGFRSLTEAIDTTTP
GGRLIFHVFGALGQFERDLIRERTKAGLSAAAARGRKGGRKPVITADKLQ
QARKHIANGINVREAATRLKVSKTALYAALQSTSAGVIPPLLAVARSGAK
RPSPTSWQA
>XACb0061 pin, invertase recombinase-like protein
MALIGYARVSTAEQDTALQTDALRKAGCERVFEDTASGAKADRPGLADAL
AYLRNGDVLAVWRLDRLGRSMPHLIETIGALEARGVGFRSLTEAIDTTTP
GGRLIFHVFGALGQFERDLIRERTKAGLSAAAARGRKGGRKPVITADKLQ
QARKHIANGINVREAATRLKVSKTALYAALQSTSAGDLQK
>gid:110123  pknB  serine/threonine kinase
MRAPLPELPAGTRFGAWAIDRLIGAGGMGQVYLGHRADGAYEREVAIKLV
AADALDAQGRALFEFECRLLAQMVHPAIAQIHDVGTDAHGQPYLVMEYLR
GEPITWWCDEHRLSLHARVLLMLRVGEAVQHAHQKGVIHRDLKPSNVLVS
EIDGRPMPGVIDFGIAVDATNPGMTYAHDRGTPGYMSPEQARGAQDVDAR
SDIYALGAMFYELSCGLAPVAGRDGVPQPPSQRVAAVPADARARICAARA
TTYQKLHEQLRDGLDAIVLRALEPQPGARYASVSALLDDLHRWLDGYPPR
ALQASRWLRLRKFTQRHRSGVAAAGLAAIAVVAGLGATLWSLQQAARDAQ
RAQVTGDFMSSVLSSVDPGVARDLDKTLMLRVMQQASQRAARELADQPDA
RTQVELTLGRTLIALANFPQAVEHLRIAQTLAQQTAGAGSLPALKAASML
GQALTGDGKPGDAERVLRQGIAQAARGDAQQQAMGNELRAWLAWALREQG
RMRDALAESRHAYVAALANASSSADQRIDAGNSYAAMLADNGQLVPAIAL
QRSLLRERMAMRGPEHPLVVSMRNSLAVFLLMQRDFAGAEAELVPLLRVT
RTLYGESAADTLMIEGNLAGALRQQGKVAEAGPYYRRALERARAAFGNDA
PVTIAYRSNHAFWLLDDGQVDASLAEQRAALTASERVLGRAHPQTAEILR
GMSEAERRLGQTVAARDHARQALQILTGIFGDADAPLRAARQTLASATPP
ADSREAEATTAVAQR
>gid:110106  polA  DNA polymerase I
MSRLVLIDGSSYLYRAFHALPPLTNAQGEPTGALFGVVNMLRATLKERPA
YIAFVVDAPGKTFRDDLYADYKANRPSMPDDLRAQVQPMCDIVHALGIDI
LRIDGVEADDVIGTLALQGAADGLAVTISTGDKDFAQLVRPGIELVNTMS
GSRMDSDAAVIAKFGVRPEQIVDLLALMGDTVDNVPGVEKCGPKTAAKWL
AEYDSLDGVIANADKIKGKIGDNLRAALPRLPLNRTLVTIKTDVTLASGP
RALDLREPNAEALAVLYARYGFTQALRELGGAAAAQAGLLAEPVALGAAA
ATARTEPGRARGTGFVSGPVNAAVDLDPSLSAPGQYDTILTQEQLDSWIA
RLRAAGQFAFDTETDSLDPLQADLIGLSVAAEPGQAAYLPFGHNFPGVPV
QLDRTQALAQLAPLLTDPAVRKLGQHGKYDLHVMRRHGVELAGYADDTLL
ESFVLNSGSARHDMDSLAKRYLGYDTVKYEDVCGKGAKQIPFAQISLEDA
TRYAAEDADITLRLHRVLGPRLASEPGLERVYRDIEMPLVDVLARIEANG
VCVDAAELRRQRADLSKRMLAAQQKATELAGRTFNLDSPKQLQALLFDEL
KLPAVVKTPKGQPSTNEEALEAIADQHELPRVILEYRGLTKLRSTYTDKL
PEMIHPQSGRVHTSYHQAGAATGRLSSSDPNLQNIPIRTEDGRRIRRAFV
APPGRKLIACDYSQIELRIMAHLSGDPGLVGAFESGADVHRATAAEVFGR
TIDTVSADERRAAKAINFGLMYGMSAFGLARQLGIGRGEAQDYIALYFSR
YPGVRDFMETTRQQARDKGYVETVFGRRLYLDFINAGSQGQRAGAERAAI
NAPMQGTAADIIKRAMVKVDGWIADHAQRAKMILQVHDELVFEADIDFVD
TLLSEVTTRMAGAAELRVPLVVDSGVGDNWDQAH
>gid:109814  priA  primosomal protein N'
MSSTVATLRVALPVPLPQLFDYLPPADAAPTDPARVGCRVRVPFGPRELV
GVVVEIGQLPAADGLRPALAWCDQAPLLVDELARSLHWLARYTHAPLGEA
QASALPGPLRRGEPLADTHAWAWQLTEAGRTGSTSLRAGSRPALLAALLR
TGAVGEEQLDPLLPQWREAARSLAKREYAERVAVPADTIPPRPGNGPPLN
DEQQAATAAIRAHSGFATYLLDGVTGSGKTEVYLQAIADCLAAGKQALVL
VPEIGLTPQTLGRFRARLGVPVHALHSGLSDGERARVWAAAWRGQAQLIV
GTRSAVFTPLPNAGLIVIDEEHDGSYKQQDGIRYHARDFALVRGKALDVP
VILGSATPSLESLHNAYAGRYQHLRLSQRAGEARPPRVRVLDVRKRPLKD
GLSPEVLAGIGTTLARGEQVLVFKNRRGYAPVLLCHDCGWTAACQRCSTP
LHQTPMTVHAGGRRLQCHHCGARQPAPLACPACASLALQPQGIGTERLEE
RLTEAFPDVPVVRIDRSTTQRRDALETQLARLGTEAGILVGTQILAKGHD
LPRLTMVVVVGIDEGLFSADFRAAEKLAQQLIQVAGRAGRADRPGEVWLQ
THHPEHPLLQTLVNGGYHAFADAELQQREAAGFPPFAHLALFRAEAKDVA
AANQFLMAVRGLTTADSTAQSPAFAAVECYGPMPAPMPRRAGFQRSQLLL
SAQQRSALHRLLDAQLPAIYALPQARRVRWSLDVDPIDLY
>gid:109911  radC  DNA repair protein
MHIHDWPTHERPREKLLARGATALSDAELLAILVGSGLRGQDAVQTARDL
LHRHGPLRLLLDRPAKALTRLPGLGPASACKFAAAMELAQRHLMSALERG
EALSDPPSVGRYFSQRLRARAYEVFAVLFLDNRHRAIAFEELFTGTIDGA
DIHPREVVRRALLHNAAAVIVGHNHPSGNPEPSEADRAVTKRLLDSLELV
DIRLLDHFVIGDGRPVSFAERGWLE
>gid:108414  rci  shufflon-specific recombinase
MFVSRTEAESTTLCEALDRYEREIVSLKKGKAQEVSLLKTWRATPVAQRS
MASVRSSDIAKLRDDWLRQTNPALKPASVLRRLAILSHVFSIARKEWGME
SLSNPLELVRKPPANNARTRRVRETEGDSRDDRRAPDGELKRVVSASSSD
VLPTIISLAVETAMRRGEIVDLRWEHVDLKRHVAHLPHTKNGSSRDVPLS
PRAVEALKEWRKVVGKNPGDGRVFSIRGDAVTRAFERAVDRARKTYEQEC
RDAGRTPDAKYLIDLRFHDLRHEATSRLASIFPMHELAKITGHRDPRMLM
RYYHPSAEDLAKRLR
>gid:107736  recA  RecA protein
MDENKKRALSAALSQIEKQFGKGSVMRMGDRVIEAVEVIPTGSLMLDIAL
GIGGLPKGRVVEIYGPESSGKTTLTLQAIAECQKNGGTAAFIDAEHALDP
IYAAKLGVNVDDLLLSQPDTGEQALEIADMLVRSGSVDIVVVDSVAALTP
KAEIEGEMGDQLPGLQARLMSQALRKLTGNIKRSNTLVVFINQLRMKIGV
MMPGQSPEVTTGGNALKFYASVRLDIRRIGAIKKGDEIIGNQTKIKVVKN
KLAPPFKQVVTEILYGEGISREGELIDMGVEAKLVDKAGAWYSYGDERIG
QGKDNARTYLRDNSQVATRLEAELREKFQPAEAPREAGDDEDKE
>gid:110332  recB  exodeoxyribonuclease V beta chain
MSISPVTDPYLQLPLHGVRLIEASAGTGKTFTLATLFTRLVVERGLRIGQ
ILAVTFTEAATQELRRRIRERLVLAASLVPDAPPPFVGAALAAGDLADDP
SRPGPLLHATPDVLLTGAILATHLAGASETPAALRRRLQQAVEEIDLAAV
FTIHGFCARVLREHALESGQAFAAPELLANDRELLGEVAADLWRQRAADA
VMAEDLIALWSGGPEALASDLRALVRHPQLLPRPSTPAPDPTPIRQAAAQ
ALVATLRAHGDTAYDAIASAFDNKVFDGRRARRASFDKAFEELWNGTADA
HWILDEKAHLDKLLPARMREFCRDGAHDRVPCSPLFDALQVWRQADIQVQ
QWQQNRRIALLHALRDDAIARLALLKRQRRVQTYDDLVDGVAHALQGAQA
DALLKRLRTQYAIALVDEFQDTDDRQWSIFSNVFGEGPLAQAAGLEPALF
LIGDPKQAIYGFRGGDVRTYLAAAVTAERAPPLSHNFRSRPGVLAAIDAL
YAQAGYADAFLTDGIAFHPVRAGTKRVDEDLQRDGVTAPALTLWRAPEPP
PPAKGKPKPWSAGRARELATAACVAAIRGWLAGGRDGTATVCGRPVQAGD
IAVLVRSHGEATRIQQALGAVGIPAVAAGKQSLFATEEALELLALLQALL
DPGDDSRLRAALATVLIGEDAIAIAALERDGERHRRWQQQALDWRERWQR
GGPLALIGDLGATHGQRLLALVDGERRLTNYLQLAELLQEADARALGPHG
LVDWLSRRIANADDNDDAQQLRLESDARRVQIVTLHKSKGLEYPLVFLPY
VGIGRKPPDQKPGKRVIVYRDGKRCLHWPDDVDSMEWKAIKEEWKKEQDA
EDARLLYVGLTRAEHALWIASGPFHQHERTALSAMLRALDALQGTAGEGG
VVVDTTTPPAHLPRLPAPSEAQVPLARNPQRHIAPEWWVYSFTQLANADA
GAATDPMASATVVGSGGSDEPSGSEAVAVAVATDAEPFDPRFAGNRFGVV
MHDVFERCDFAAWQAWRPGQTAPDGQAAPILEALQRGGYAQDDLDDGLRM
LTALIGHTLTVALPEGTRLADVPEPQRRNEMEFHFAMRPTRVDALLALLH
RFGVVGDRQAFGARQRLEGLMTGLIDLTYSVDGRWYVLDYKSNRLPSYDA
DALARAMAHSEYELQALIYTVALHRWLRFRLGDAYDYARDFGGVRYLFCR
GLDAARPAVEGMPAPGIHAWRFAPALVQSLDALFAGAPPEALA
>gid:110333  recC  exodeoxyribonuclease V gamma chain
MHATSAPDFRLYPSNALDTLAALLAQELRRPMPGQPLLQPEVVLIPQVAM
RRWLQSTLAAEHGVAANLEFLTPGEFVARALERNLGAADDDLDMATTQWR
LYAALQSDLGSDAALAPLAGYLSDGDALKPWALAGELGSVFEKYQAWRRD
WLLRWEAGADPDDPQARLWRSIAAGRQYRARRIGQYLDRYTRADGPLPQG
LPPRVFAFAILNISPDVLRVLATQARVGTLHFYLPTPTQGYWGDLQTLRQ
RRREDGAVQLFAEQVQENPLLQAWGAAGRDFMALVGDYEVVHPLAEIAAY
ADPLDSGRRTLADGGLGDSLLRRMQSDLFHRRGPAADARLPAVNLHDPSL
QVHACHTRLRELQVLHDQLRALLDDARFDPPLQPREIAVLSPNIDPYVPY
LDAVFGSHGNDDALPYALADASPLASEPLAEVFLALLGLPIARFGLHEIL
DLLASAPIAEAAGLDEAGLERLRNWLHAAGARWGLDAAHRRQHQAPGDDA
YTWRFALDRLLLGHASGADEDIDGVAPWPQLEGSALAALDTLLRLLRVLE
RHQAALADAMTPVEWRERLLGLLEALIPATPTAPRAQRALDRLRTLIDQF
ARDAVRAEYAGKVPAEVVRAHFAAVLGDSDTRAPLLTGGISFGRMVPMRL
LPFRVICLLGMNDGDFPRRDPAAGLNRLTAELGTERRRHGDRSTREDDRF
LFLQLFASAQEVFYLSYLGADARDGSVREPSVLVSELLASAAQYHAASNA
LDTLVVRHPLQPFAAAAFGALGDDGADPRRFSYRRQWRPAVDSLVGQRQP
LAPWVAGALPAEAIALPASMSIDALRRLLTDPAGQFLRHRLGMRLPDPAG
EDSDLEPLLAPTRGLDHYGLQQQVFEAALAGDTEGLYERLRARALLPSGP
LGRRQLDERVAQLRPYADAFRQWRGESPAQSRRLQVQIGEIELHGRVPGW
YASGVGRVQVGALSGRSAIRHGLEWLLLRAAGEHAPYVRFFEHDDGLGPH
PIDAQPLSQTQAHTALAELLQLYRQGVQTPLAFAPYSSWKYHQAARGGDL
DKAIKEAAGQWQSGFGWSESHSPELRLVTRGRDPFADAQRFAAFATTSHR
VYALLEQGNTGAPLETERVIDSWRHWHGAREDAE
>gid:110331  recD  exodeoxyribonuclease V alpha chain
MSLLADLQAKSHLRTLDHAFAQSLRRLRPDTPDTVLLAAALASLAVANGH
AGLDPRRPRLLVDADVAWPAADAWMAQLQASPWIAQPDDPLAAAASDAPL
VLEHGLLYLRRYREYEARLAAGLTRIASDTLPATESAVLAAVFAALFPSA
TTGQDRQAQAAALALRRALLLITGGPGTGKTTTIARLLLLRLAQAHAAGQ
PPPRIALAAPTGRAAERMAESLRIALARAIEQGLPAEHQGASPLPPAGEG
ARRAGEGTAPTDFSWHSALPTGASTLHRLLGVIPDSPHFRHDADNPLPFD
LIVVDEASMVDLPLMCKLVDAVADGTQLILLGDADQLPSVEAGDVLAAIL
RAAGPGDTLPPDDARALQPLLGNAPVTAPAASTPVGHLAGHRVHLLRGYR
QAADFALAPLADAVRIGDAETALSLLRSGTLPGVHFHEDGDDPLSLGREA
LLAHWQALTATNDPADALRQAGRLRLLTAVRAGPQGARGLNARIEQLLAE
SGSGARRLGAASPWFHGRLLLITENSYRHGLFNGDVGICLRSEASPASVR
DPQAALVAWFEGDGDGQVRGFHPAALPAHESAFAMTVHKAQGSEFDEVWL
QLPTRDARVLSRELLYTGITRARRALHLAGSEAVIRSALARHAARISGLA
WRLGAEQMQPTADAASPLRATTPTALGNPIQGSLF
>gid:105999  recF  DNA replication and repair RecF protein
MHVVRLSIHRLRRFQTVELHPSSALNLLTGDNGAGKTSVLEALHLMAYGR
SFRGRVRDGLIQQGANDLEVFVEWKEGGGAAVERTRRAGLRHSGQEWTGR
LDGEDVAQLGSLCAALAVVTFEPGSHVLISGGGEPRRRFLDWGLFHVEPD
FLTLWRRYARALKQRNALLKQGAQPRMLDAWDNELAESGETLTSRRMRYL
ERLQDRLVPVADAIAPALGLSALTFAPGWKRHEVSLADALLLARERDRQN
GYTSQGPHRADWMPSFHALPGKDALSRGQAKLTALACLLAQAEDFAFERG
EWPVIALDDLGSELDRHHQGRVLQRLASAPAQVLITATETPPGLADAAAL
LQQFHVEHGQIARQATVN
>gid:109387  recG  ATP-dependent DNA helicase
MSRTRVVTPSLAVAGQARLSSLPGVGPKVADRFAARGILSVQDLWLHLPL
RYEDRTRLTTIAQLQGGVPAQIEGRVDAVERGFRFRPVLRVAVSDASYGT
LVLRFFHFRAAQVAQFAVGTRVRVFGTPKPGQHGWEIVHPSYRVLAPDED
AGLGDSLDPVYPVLEGVGPATLRKLIGQALERLPPESALELLPPHWLQDE
RLPSLRAALLTMHRPPVGTDPQQLLAGGHPAQQRLAIEELLAHQLSLRRQ
RIALQRLHAPSLPGNGTLVQQLRKALPFQLTGAQQRVFEQIAHDLAQPSP
MLRLVQGDVGSGKTVVAALAAMLAVEQGKQVALAAPTELLAEQHLTNLRG
WLEPLGIRIVWLAGKVTGKARAAAMAEVASGQAQVVVGTHALMQEAVVFH
DLALAIIDEQHRFGVHQRLALRDKGAAAGSVPHQLVMTATPIPRTLAMSA
YADLDVSAIDELPPGRTPVQTIVLSAERRPELVERIRAACAEGRQAYWVC
TLIEESEEPGKGAQGQQGGPPRIEAQAAEVTFETLSAQLPGVRVALVHGR
MKPAEKQKAMLDFKQGRSDLLVATTVIEVGVDVPNASLMIIENAERLGLA
QLHQLRGRVGRGAAASSCVLLYQAPLSMMARQRLETMRQTNDGFVIAEKD
LQLRGPGELLGTRQTGLASFRIADLARDAGLLPRVQLLAERLLDEAPDIA
DRVVERWIGGAVRYAAA
>gid:107861  recJ  single-stranded-DNA-specific exonuclease
MTSSPRIVRRSSGQAGSWPDTMLPLLRRIYAARGVTDAQGAQPRLGQLLS
PELLHNSGVAAELLADAIAQQRRILVVGDFDCDGATACAVGVRGLRMLGA
LDVHHAVPNRMVHGYGLSPALVDELAGLQPDVLVTVDHGIACHAGVAAAK
ARGWTVLVTDHHLPGEVLPPADAIVDPNLAEDTFPSKTLAGVGVIFYVLL
ALRQVLRARGAFAQRAEPDLSVLLDLVAVGTVADLVPLDTNNRALVSAGL
RRLREGKGCIGLRALIDASGRDVARLSASDIGFALGPRLNAAGRLEDMAL
GIELLLSEDWSQAREIAGTLEEINAERRAVQQLMTDDAEHAVTKVVLDGD
GALPIAACLFDAEWHPGVIGLVASKLKDRLHRPVIALAPAEPGSSQLRGS
ARSIPGLHIRDVLAAVDARHPGLIQKFGGHAMAAGLSIDHAALATFEQAF
QAQVTAMVDASLLHAQLHSDGELAAHELDHLHAEALRVAGPWGQGFPEPL
FDGQFEVLQHRVLKERHLKLTLRCPGRAEPLNAIHFNGWRGSDPARLVRL
AYRLVADDYRGGTAVQLIVEHCEPVALA
>gid:107515  recN  recombination protein N
MLRHLSIKDFAVVRATELEFGPGMTVVSGETGAGKSLMVDALGFLSGLRA
DSGVVRHGAERAELSAEFQLPAEHPGLRWLADNELDDEAQCQLRRIIRAD
GGSRAWINGRPVTSSQLAELASKLVEIHGQHEHQALMARHSQLALLDAYA
RNSAQREQVRQASQRWQALLDERDALSAQGDVSDRIGFLEHQLAELERED
LDPAAIAALDVNHRRQAHATALIGACDSVAQQLNGDDGASALGLLQDSRH
DLSRVAEHEPRLGEVDALLDSAVIQIEEALALLDRVRDDLEADPAQFEAM
ERRLGRLHDLARKHRVTPDELAAHRDHLSAEVESLRGADERLQQLDKHIE
TATGAWRSAAGALSDSRTAAAEALSAATTALIGELGMGGGQFLIQLQPHE
SARPDPNGAERVEFLVAANAGQPPRALRKVASGGELSRISLAIEVAALGL
DSVPTMVFDEVDSGIGGAVADIVGQKLRALGEERQVLCVTHLPQVAAKGH
AHYRVSKAPVDGMTQSAVELLGPQTRQEELARMLGGVEVSKEARAAARKL
LQSA
>gid:107323  recO  DNA repair protein
MLIEHERGFVLHVRAWRETSLLVEVLTEQHGRVGLLARGVQGPRKQALRA
ALQPLQLIQFSAVQRGELAQLRQAEALDTAPRLAGDTMLAGFYIGELLLR
LAPRHDPVPELYACYAQARRHLASDLPLAWGLRRFERDVLEGLGFAFDLQ
HDSEGQPIDPAARYRLDPQDGALRVLSERLAQDRRETVTGAALLALGQDV
MPDADDMPGLRRSMRGVLLHHLGGRGLKSWEMLEDLARRR
>gid:109120  recQ  DNA helicase
MSSPAHELLSRVFGYDDFRGPQQAIVEHVAAGNDALVLMPTGGGKSLCYQ
VPALLRDGIGIVVSPLIALMQDQVEALRQLGVRAEFLNSTLDAENAQRVE
RALLSGDLDLLYVAPERLLTQRFLSLLERSRIALFAIDEAHCVSQWGHDF
RPEYRQLTVLHERWPHVPRMALTATADPPTQREIAERLDLVDARHFVSSF
DRPNIRYTVVQKDNARKQLQEFLGRHRGSAGIVYAMSRRKVEETAQQLCA
QGFNALPYHAGLPAEVRAENQRRFLREDGIIMAATIAFGMGIDKPDVRFV
AHVDLPKSLEGYYQETGRAGRDGDPAEAWLCYGLGDVVLLKQMIEQGEAA
EERKRLERAKLDHLLGYCESMQCRRQVLLAGFGETYPKPCGNCDNCLTPA
AAWDATVASQKALSCVYRSGQRFGVGHLIDILRGSENERIKQLGHDQLST
YGIGRDLDERTWRGVFRQLVAASLLEVDSEGHGGLRLTDASRQVLKGERQ
VMMRRENPAAGRERSAQRTGLPVQPQDLALFNALRGLRADLAKEQNVPAF
VIFHDSTLRNIAEQRPTSIDALSRVGGIGGGKLARYGAQLIEIVREQG
>gid:107107  recR  recombination protein RecR
MSTLLEQLIEAFRVLPGVGQKSAQRMAYHVLEREREGGRRLAAALGSAVE
KVGHCVQCRDFTESEICTICASSSRDRQQLCVVESPADRLAIEHATGYRG
LYFILQGRLSPLDGIGPRELGLDRLSERLAAGEVTEMIIATNATVEGEAT
AHYLAQLARQHAVRPSRLAQGMPLGGELEYVDRGTLSHAFGTRSEVL
>gid:110277  rep  ATP-dependent DNA helicase
MYGLNPPQSAAVLHCEGPLLVLAGAGSGKTRVIVEKIAHLIAIGRYPAKR
IAAITFTNKSAKEMRERVAKRIRGDGAEGLTICTFHALGLKFLQIEHAAA
GLKRGFSIFDSDDAAAQIKDLMHGAKPDAIEDAKNLISRAKNAGLSPEQA
MAAARSTREKEAASLYERYQARLTTFNAVDFDDLIRLPVQILEANEDIVM
GWRERIGYLLVDECQDTNDAQYRLLKMLAGPRGNFTCVGDDDQSIYAWRG
ANPENLQQMARDYPALEIIKLEQNYRCSNRVLRAANALIAHNPHEHLKTL
WSDQADGERIRVWECRDSEHEAEKVAAEISFLGTAKQVPWSDFCILFRGN
FQSRPLEKALQLLRVPYHLTGGTAFLERQEVKDLLSWLRLIVNPDDDAAF
LRAVQSPKREVGATSLARLAELASAKSVPMSRAAESMGALQHLPPRAANG
LSAFTDILRDMREHSSSMPAGELVRLLADKSGLLNDLRNQSKDETGFQRR
KRNLDELAEWFEGGPRGASASDLAAQLALLSRNDKDDGGNQVRMMTMHAS
KGLEFRYVFIVGCEDGVLPHEVSLEEGNLQEERRLLYVGITRAKEQLWMS
HSKLTRKFGEHVRLKPSRFFDEIPAEELQRDGADPVADAERKKERASAGL
AAIQALFD
>gid:109825  rhlB  ATP-dependent RNA helicase
MSDKPLTDVTFSSFDLHPALIAGLESAGFTRCTPIQALTLPVALPGGDVA
GQAQTGTGKTLAFLVAVMNRLLIRPALADRKPEDPRALILAPTRELAIQI
HKDAVKFGADLGLRFALVYGGVDYDKQRELLQQGVDVIIATPGRLIDYVK
QHKVVSLHACEICVLDEADRMFDLGFIKDIRFLLRRMPERGTRQTLLFSA
TLSHRVLELAYEHMNEPEKLVVETESITAARVRQRIYFPSDEEKQTLLLG
LLSRSEGARTMVFVNTKAFVERVARTLERHGYRVGVLSGDVPQKKRESLL
NRFQKGQLEILVATDVAARGLHIDGVKYVYNYDLPFDAEDYVHRIGRTAR
LGEEGDAISFACERYAMSLPDIEAYIEQKIPVEPVTSELLTPLPRAPRVP
VEGEEADDDAGDSVGTIFREAREQRAAEEQRRGGGRGGPGGSRSGSGGGR
RDGAGADGKPRPRRKPRVEGQAPAAAASTEHPVVAAVAAQAPSAGVADAE
RAPRKRRRRRNGRPVEGAEPALASTPVPAPAAPRKPTQVVAKPVRAAAKP
SGSPSLLSRIGRRLRSLVSGN
>gid:106438  rhlE  ATP-dependent RNA helicase
MSFSDLSLSPQLQPFFDTALAKAGLRTPTPIQQQAIPPMLDGRDLIAMAQ
TGSGKTLAYALPLLQQRCLAPDTAPRVLGALVLVPTRELAAQVEDALRQL
AAHLPRRLKSVVATGGSSINPQLLALRGGADLVVATPGRLLDLVEHNALR
LNGVTTLVLDEADRLLELGFGAELDRILALLPAQRQTVLFSATFPPAIAS
LAKRRLRDPLRVTIDATPEQAPAIAQCAIAVDAGQRTQLLRHLLQEHAWP
QLLVFVASRHSADKVAEKLSKTGIAALPLHGELSQGRRERTLRAFKQADV
QVLVATDLAGRGIDIDALPAVLNYDLPRSTVDYTHRIGRTARAGASGVAI
SFVTADSAQQWRLIEKRQGLRVPTSVIEGFEPTPVEAPASDNAPGTAMRA
PNDNGGIKGKRPSKKDKLRAAAQAQTGKPG
>gid:109606  rhlE  ATP-dependent RNA helicase
MSFESLGLAPFLLRALAEQGYETPTPIQQQAIPLVLAGHDLLAGAQTGTG
KTAAFGLPLLQHLGTAPQTVNGPRKPRALILTPTRELATQVHDSLRGYSK
YLRIPSAVIYGGVGMGNQLDALRRGVDLLIACPGRLIDHIERRSVDLSGI
EVLILDEADRMLDMGFLPSIKRILTKLPRQDRQTLLFSATFEENIKQLAL
EFMRNPVQIQVTPSNTVAESITHRVHPVDGARKRDLLLHLLAQDSREQTL
VFARTKHGSDKLALFLEKSGIKTAAIHGNKSQGQRMRALSDFKAGRVTVL
VATDIAARGIDIDQLPKVINYDLPMVAEDYVHRIGRTGRNGSTGEAISLV
AQDEAKLLRQIVRMLGRDVEIRDVPGYEPQTPIRWGNSAPGRAEQPGGDR
APRKSHARRPHGDAPRQAHAHAGPKKPGGQRSNGPRQANTGAGAGRRDGG
RGGQRPSGTR
>gid:107085  rnhA  ribonuclease H
MKSIEVHTDGSCLGNPGPGGWAALLRYNGREKELAGGEAVSTNNRMELMA
AIMALETLTEPCQIVLHTDSQYVRQGITEWMPGWVRRNWKTAGCDPVKNR
ELWERLHAATQRHRIDWRWVKGHNGDPDNERVDVLARNQAIAQRGGLATS
>gid:107403  rnhB  ribonuclease HII
MRRSTSNGAVVVPATQDGLFHDSPFPIPDSRLIAGVDEAGRGPLAGPVAV
AAVVFDPGKPRINGLDDSKQLTAERREQLYARIVDRALAWSVVLIDSEEI
DRINIYQATMLGMRRAVEGVAHVAGFARIDGNRVPKGLPCPAEALIGGDA
LDRAIMAASIVAKVTRDRLMHELHTRHPEYRFDQHKGYSTPVHLAALQTH
GPCPQHRRSFAPVRLALQGREGLEPDPGTRDLEQLQVGQLVAPL
>gid:107567  rnt  ribonuclease T
MRMNEPVDAQPAPSFLPMSRRFRGYLPVVVDVETGGFDWNKHALLEIACV
PIEMGADGRFFPGETASAHLVPAPGLEIDPKSLEITGIVLDHPFRFAKQE
KEALDHVFAPVRAAVKKYGCQRAILVGHNAHFDLNFLNAAVARVGHKRNP
FHPFSVFDTVTLAGVAYGQTVLARAAQAAGLDWNAADAHSAVYDTEQTAR
LFCKIANAWPGPV
>gid:109145  ruvA  holliday junction binding protein DNA helicase
MIGRLRGILAYKQPPWLVIDVGGVGYELEAPMSTFYDLPDVGRDVILFTH
YAQKEDSVSLYGFLREGERRLFRDVQKVTGIGAKIALAVLSGVSVDEFAR
LITSGDITALTRIPGIGKKTAERMVVELRDRAADFSSGAPITGQLGPDAV
SEATVALQQLGYKPAEAARMARDAGAEGDEVATVIRKALQAALR
>gid:109143  ruvB  holliday junction binding protein DNA helicase
MTEQRIIASSSTREDDAADASIRPKRLADYLGQQPVREQMEIYIQAAKAR
GEAMDHVLIFGPPGLGKTTLSHVIANELGVSLRVTSGPVIEKAGDLAALL
TNLQPHDVLFIDEIHRLSPVVEEVLYPAMEDFQIDIMIGDGPAARSIKID
LPPFTLIGATTRAGLLTAPLRDRFGIVQRLEFYSPQELTRIVIRSAAILG
IDCTAEGAAEIARRARGTPRIANRLLRRVRDYAQVKAAGHIDLPVAQAAM
QMLKVDPEGFDELDRRMLRTIVEHFDGGPVGVESLAASLSEERGTLEDVI
EPYLIQQGFLIRTARGRMVTPKAYLHLGLKPPRERAPGIGEPGDLF
>gid:109146  ruvC  holliday junction resolvase
MTRILGIDPGSQRTGIGIIDIDEGGRSRHVHHAPLMLLGEGDFSQRLKRL
LQGLGELIEIYRPDEVAIEKVFMGKSAASALKLGHARGAAICAVVMRDLP
VHEYAAKEIKLALVGKGGADKVQVQHMVGIMLNLNGKLQPDAADALAVAI
THAHVRATAQRLGVNTQQAWSRKK
>gid:107595  sbcB  exodeoxyribonuclease I
MPDSFLFYDLETFGQDPRRTRIAQFAAVRTDAQLRVIEEPISFFVQPADD
LLPSPYATMVTGITPQHALREGVTEAEAFARIAEQMGRPQTCTLGYNSIR
FDDEFIRCGLFRNFYDPYEREWRGGNSRWDLLDVLRLVHALRPDGIVWPQ
REDGATSFKLEHLADANDVREGDAHEALSDVYATIGMARKFQQHQPKLWD
YALRLRDKRFAAALLDVIAMQPVLHVSQRYPASRLCAAAVLPLTRHPRID
SRVIVFDLEGDPEVLLRLSPDDIADRLYIRAADLPEGEQRIPLKEVHLNK
SPALVAWQHLRAADFERLGVDHDAVVAKAARLRALGPELAEKVRQVYGSE
RAAAAAVNDADASLYDGFLAEGDKRLLAQVRASAPDELGALEARFRDPRL
IELLFRYRARNWPQTLSFEEQERWNVYRRQRLLQDRGLGEVTLEQYHAQI
ADLRGAHPDDATKQALLDQLAAWGSDLQRTL
>gid:109799  smf  DNA processing chain A
MAQTAPDLRALLILLLAGGRSPPRRKLLTSYGSASEMLAAGPGAWRAAGC
DEVQAALLQSPDPHALEAALRWCEQPGHHLIGWRDPDYPALLRHIANPPM
ALFVDGDPDALWHPGVAVVGSRAATAGGRDHTRAFASSLACAGLSIVSGM
AAGVDAIAHEGALAQHDGITVAVVGTGVDVAYPERHQGLRDRIAERGAVV
SEYLPGTAPVAAHFPARNRIIAGLALGTLVVEAAMRSGALITARLAAEAG
REVFALPGSLHNPLARGCHHLIRQGATLAQEPAQVIEGLGQLSGELATAL
RERLAAPTELPGKATTSAGPGPARSDPDYQRLWQALDHDPTPMDSLVQRT
GLTAAALSSMLLIMELEGDVVTEHGRYTRNP
>gid:108207  ssb  single-stranded DNA binding protein
MSTHFHGEGNIGSPPKFDEYPNGNEEPRRVLRLNVYFDNPVQNKDGAYDD
RGGFWAPVDWWHRDAERWATLFAKGMRVAVRGHLERDDWTDGDGEPHTTY
KVDARSVGILPYRLDAVHLSPKPGTTGE
>gid:108901  ssb  single-stranded DNA binding protein
MARGINKVILVGNLGNDPDTKYTQAGMAITRVSLATTSMRKDRDGNNQER
TEWHRVVFFGKLGEIAGEYLRKGSQVYVEGELRYDKYTGQDGVEKYSTDI
VANEMQMLGGRGEGGGGGMGGDRPQRSAPRQQGGGGGQGGGGYGGGGGGG
GQDYAPRRQQPAQQQSAPPMDDFADDDIPF
>gid:108915  tag  DNA-3-methyladenine glycosylase I
MSGYCSIAPGHPVHGHYHDHEYGFPQRHERELFERLVLEINQAGLSWETI
LRKRVNFQQAYDGFDVDTVAAYGDQDISRLMGDAGIIRNRLKVLAAIHNA
HVIRTLRATHGGFAQWLDAHHPLDKPAWVKLFKKTFRFTGGEITGEFLMS
LGYLPGAHHAQCPVFAQIRALAPPWLQAQKPAKARTVQRG
>gid:108427  tnp  transposase
MRKSPKFSPEVVERSVRMVLESQGQYESQWAAIESIASKIGCTSETLRRW
VR
>XACb0067 tnpA, Tn5045 transposase
MESRLWRLLVHGVTPEQRQRLDDLLKLVEGSRQSWLDRLRKGPVRVSAPA
LVAALLRIETVRGLGIKLPGTHVPPSRIAALARFASTAKVSAVARLPEVR
RIATLVAFVHCLEASAQDDAIDVLDLLLRELFTKAEKEDRKVRQRSLKDL
DRAASTLAEACRMLLDPALPDGELRERVYAAIGHDELAQALNEVRGLVRP
PNDVFYTELEARKATVSRFLPALLRVIRFDANPAAQPLAQALQWLHEKPD
HDPPTAIVGKAWQRHVVQDDGRINATAYSFCALDKLRSAIRRRDVFISPS
WRYADPRAGLLAGAEWEASRPIVCRSLSLSAQPEATLSELTRELDETYRR
VAARLPQNDAVRFENVGDKTELVLSPLEALEEPPSLIALRNEIKARMPRV
DLPEILLEVAGRTGCMEAFTHLTERTARAADLTTSLCAVLMAEACNTGPE
PLVRPDTPALKRDRLMWVDQNYVRDDTLTACNAVLVAAQSRIALARTWGG
GDVASADGMRFVVPVRTIHAGPNPKYFNRGRGVTWYNLLSDQRTGLNAIT
VPGTLRDSLILLAVVLEQQTELQPTQIMTDTGAYSDLVFGLFRLSNYRFC
PRLADVGGTRFWRVDPDADYGDLNALARQRVNLDRITPHWDDVLRLVGSL
KLGLVPAMGIMRTLQVDERPTSLAQAIAEIGRIDKTIHTLNFIDDEARRR
ATLLQLNLGEGRHSLAREVFHGKRGELFQRYREGQEDQLSALGLVVNMIV
LWNTLYMDAVLTQLRSEGYPVKPEDEARLSPFGHEHINMLGRYSFSVPEA
VARGEPSRRNSWLGHAALEHDCIRIVPHSGERRCLPSRATRRCGWGRTR
>XACa0035 tnpA, Tn5045 transposase
MPVSFLSTTQRERYGRYPDTLSSEELARYFHLDDDDREWIATKRRDSSRL
GYALQLTTARFLGTFLEDPTAVPSPVLHTLSSQLGIADPSDCVIDYRTTR
QRWQHTSEIRTRYGYREFTGTGVQFRLGRWLCALCWTGTDRPSALFDYAN
GWLVGHKVLLPGVTLLERFIA
>XACa0034 tnpA, Tn5045 transposase
MESRLWRLLVHGVTPEQRQRLDDLLKLVEGSRQSWLDRLRKGPVRVSAPA
LVAALLRIETVRGLGIKLPGTHVPPSRIAALARFASTAKVSAVARLPEVR
RIATLVAFVHCLEASAQDDAIDVLDLLLRELFTKAEKEDRKVRQRSLKDL
DRAASTLAEACRMLLDPALPDGELRERVYAAIGHDELAQALNEVRGLVRP
PNDVFYTELEARKATVSRFLPALLRVIRFDANPAAQPLAQALQWLHEKPD
HDPPTAIVGKAWQRHVVQDDGRINATAYSFCALDKLRSAIRRRDVFISPS
WRYADPRAGLLAGAEWEASRPIVCRSLSLSAQPEATLSELTRELDETYRR
VAARLPQNDAVRFENVGDKTELVLSPLEALEEPPSLIALRNEIKARMPRV
DLPEILLEVAGRTGCMEAFTHLTERTARAADLTTSLCAVLMAEACNTGPE
PLVRPDTPALKRDRLMWVDQNYVRDDTLTACNAVLVAAQSRIALARTWGG
GDVASADGMRFVVPVRTIHAGPNPKYFNRGRGVTWYNLLSDQRTGLNAIT
VPGTLRDSLILLAVVLEQQTELQPTQIMTDTGAYSDLVFGLFRLSNYRFC
PRLADVGGTRFWRVDPDADYGDLNALARQRVNLDRITPHWDDVLRLVGSL
KLGLVPAMGIMRTLQVDERPTSLAQAIAEIGRIDKTIHTLNFIDDEARRR
ATLLQLNLGEGRHSLAREVFHGKRGELFQRYREGQEDQLSALGLVVNMIV
LWNTLYMDAVLTQLRSEGYPVKPEDEARLSPFGHEHINMLGRYSFSVPEA
VARGELRPLTKPNDP
>XACb0068 tnpA, Tn5045 transposase
MPVSFLSTTQRERYGRYPDTLSSEELARYFHLDDDDREWIATKRRDSSRL
GYALQLTTARFLGTFLEDPTAVPSPVLHTLSSQLGIADPSDCVIDYRTTR
QRWQHTSEIRTRYGYREFTGTGVQFRLGRWLCALCWTGTDRPSALFDYAN
GWLVGHKVLLPGVTLLERFIA
>XACb0071 tnpR, Tn5045 resolvase
MKIGYARVSTREQNPALQVDSLKAAGCERIYQDVASGAKTARPALDELLG
QLRGGDVLVIWKLDRMGRSLKHLVELVGSLMERKVGLLSLNDPIDTTSAQ
GRFVFNLFATLAEFERELIRERTQAGLTAARARGRVGGRPKGLSPQAEAT
ALAAETLYRERKLSVAAIAQKLHLSKSTLYSYLRHRGVEIGPYKQSAQSP
INVSVVESGNADQPKVATILLTLRIENNSKFVRGKKRSIEHVEFFDLAQY
DATRRPNGEYELKVPYDTDEELDEAVNDLLTDIACGADDRHCFSESHARM
EGTDRHW
>XACa0038 tnpR, Tn5045 resolvase
MKIGYARVSTREQNPALQVDSLKAAGCERIYQDVASGAKTARPALDELLG
QLRGGDVLVIWKLDRMGRSLKHLVELVGSLMERKVGLLSLNDPIDTTSAQ
GRFVFNLFATLAEFERELIRERTQAGLTAARARGRVGGRPKGLSPQAEAT
ALAAETLYRERKLSVAAIAQKLHLSKSTLYSYLRHRGVEIGPYKQSAQSP
INVSVVESGNADQPKVATILLTLRIENNSKFVRGKKRSIEHVEFFDLAQY
DATRRPNGEYELKVPYDTDEELDEAVNDLLTDIACGADDRHCFSESHARM
EGTDRHW
>gid:109803  topA  DNA topoisomerase I
MPKHLLIVESPAKAKTINKYLGKDFTVLASYGHVRDLVPKEGAVDPDNGF
AMRYDLIEKNEKHVEAIARAAKGADDIFLATDPDREGEAISWHIAEILKE
RGLLKDKPMQRVVFTEITPRAIKEAMAKPRMIAGDLVDAQQARRALDYLV
GFNLSPVLWRKVQRGLSAGRVQSPALRMIVEREEEIEAFIPREYWSIDAH
CRHPSQAFNARLIKLDGQKFEQFTVTDGDTAEAARLRIQQAAQGVLHVTD
VASKERKRRPAPPFTTSTLQQEASRKLGFTTRKTMQVAQKLYEGVALGDE
GSVGLISYMRTDSVNLSQDALAEIRDVIARDFGTASLPDQPNAYTTKSKN
AQEAHEAVRPTSALRTPAQVARFLSEDERRLYELIWRRAVACQMIPATLN
TVSVDLSAGSEHVFRASGTTVVVPGFLAVYEEGKDTKSSEDEDEGRKLPL
MKAGDNVPLDRIVTDQHFTQPPPRFTEAALVKALEEYGIGRPSTYASIIQ
TLQFRKYVEMEGRSFRPTDVGRAVSKFLSGHFTRYVDYDFTANLEDDLDA
VSRGEAEWIPLMEKFWGPFKELVEDKKDSLDKTDAGSVRVLGTDPVSGKE
VSARIGRFGPMVQIGTVEDEDKPTFASLRPGQSIYSISLEDALELFKMPR
ALGQDKDQDVSVGIGRFGPFARRGSTYASLKKEDDPYTIDLARAIFLIEE
KEEIARNRVIKEFDGSDIQVLNGRFGPYISDGKLNGKIPKDREPASLTFE
EVQQLLADTGKPVRKGFGAKKATLKKNTVKDSAPKKPAVKKTATKTAASK
TAVKKAPAKKTATKKAAKRVVKKTVSKAAG
>gid:108208  topB  DNA topoisomerase III
MRVVLCEKPSQGKDIARVLGAHRREAGCLGGPGITVTWGSGHLIEAAPPE
AYGEHFQRWTLPIVPPRWQWVVKARSAAQFKVVKRLLGEASELIIATDAD
REGEMIAREMIDYCGYRGPVQRLWLSALNDASIRKAFGALRASSETLSLY
HCALARSRADWLVGINLTRLFTLLGRHAGHDGVLSVGRVQTPTLKLVVDR
DREIAAFVPVPFWSIDVSLSAANVPFSAHWLAPRTATDDAGRCLRQSVAR
HAADAIRAAGTAQVVHTETERVREGPPLPFDLSTLQEVCSRRLGLEVQET
LDIAQALYETHKATTYPRTDAGYLPENMLAEVPTVIQSLVKTDPALQPLI
DRLDFRQRSRAWNDAKITAHHAIIPTLEPADISAMSAKELAVYRLIRAHY
LAQFLPHHEFDRTVVRFRCGGETLQAVGKRIAVPGWQHAFSAPSLPHVAA
DGADTEGSDEAPREQMLPVLQEGISCQVSALDLKSLSTKPPRPYTQGELV
KAMKTVARLVTDPRLKQKLKDTTGIGTEATRANIISGLLSRGFLIRKGRA
VRASDTAMMLIDAVPAAIADPGTTAIWEQALDMIEARQMGLESFVEKQAA
WVTQLVRQHLGTTLAIRSPEGPPCPLCGAATRQRTGRSGPFWSCSRYPDC
KGTAPVAPAQAGPDGQKRSRSRGARAQR
>XACb0031 trwC, TrwC protein
MLNVTPIRGNNQYAAAHYFSAADDYYAKENPGEWQGQGAQVLGLTGPVEQ
AQLSRLLDGRLPNGERIQTTFDPTDNKKRMGLDLTFSAPKSVSMQALVAG
DKDVTAAHDRAVTRALEQVERLAEARKKVKGKSYRERTGNMVIGKFRHEM
SRAKDPQLHTHSVVLNMTQRADGAWRALSNEDIFRVQHEVDALYKAELAR
GLQALGYAIRLVDDQGNFELDHISRDQIEAFSARSRLIEEALANEGKTRA
TATTLEKQIISLATRPRKDESDRDLVKQYWVEKSREFGIDYGPRSQLDGR
TYEAGDSFGGRGRERGDAGDRIAATSLPARLTPAQAVVQYAINHLTEREA
VVRETALTATALRRAVGLAGPDEVRAEIKRLVGQGALIEAMPAYRMADRK
DGPALSSAGWKSYLQDLKGWSDKQAQQYVSNAIKQGSLVPAEKRYTTQKA
LAREKAILAIERTGRGAIEPIMTAAAVKTALESSALNAGQRFAVETIVST
KNRFVGIQGDAGTGRTYTVNQAVALIKQASAVSEGYRTVALAPYGNQVKA
LKNEGLEAHTLASFLHTKDKPIDGKTIIVLDESGVVGARQMEQVMRIVEK
SGARMVLLGDTKQTEAIEAGKPFAQLQQDGMQTARISEIQRQKDHELKTA
VEQAAEGRVTQSLAHIKHVEELKEPTERHRAIVNDYIQLTEPERRETLIV
AGTNEARREINRMVRESLDLTGKGREFETLTRVDLTQAQRRFAPSYQPGM
VIQPEKDYQKAGLTRGQTYRVKEALPGNALVLQRQDGTTTTINPRKATQL
SVYHLERAELSIGDTVRINRNDPGRDLTNGDRMRVAGVIGGTVKLESVEQ
RDGRPARALELPTNRPLHLEHAYASTVHSAQGLTNDRALIALDTKSRTTS
MNLYYVAISRARQEARVYTNNRGELPAAIARRFDKTTALAIQRERQLQRR
AAGMQPKGTADGKQALQRQQLQQQRKQPASGKKPSEYGRFG
>gid:109821  ung  uracil-DNA glycosylase
MTEGEGRIQLEPSWKARVGDWLLRPQMRELSAFLRQRKAAGARVFPPGPQ
IFAAFDATPFEQVKVVILGQDPYHGEGQAHGLCFSVLPGVPVPPSLLNIY
KEIQDDLGIARPDHGYLMPWARQGVLLLNAVLTVEQGRAGAHQNKGWEGF
TDHVVETLNREREGLVFLLWGSYAQSKGKVIDQTRHRVLKAPHPSPLSAH
RGFLGCQHFSKTNDHLRRRGLSPIDWSLPPRSALDLTSAGA
>gid:107243  uvrA1  excinuclease ABC subunit A
MAMDFIRIRGARTHNLKNIDLDLPRDKLIVITGLSGSGKSSLAFDTIYAE
GQRRYVESLSAYARQFLSVMEKPDLDHIEGLSPAISIEQKSTSHNPRSTV
GTITEIYDYLRLLYARVGQPRCPDHGFPLEAQTVSQMVDHMLAQDQEQRY
MLLAPVIRDRKGEHAQVFEQLRAQGFVRVRVDGELYEIDAVPPLALRQKH
TIEAVIDRFRPREDIKQRLAESFETALKLGEGMVAVQSLDDPAAAPHLFS
SKYSCPVCDYSLPELEPRLFSFNAPVGACPSCDGLGVAEFFDPDRVVVHP
ELSLSAGAVRGWDRRNAYYFQLIASLAKHYKFDVDAVWNTLPAKVRQAVL
FGSGDEVISFTYFTDAGGRTTRKHRFEGILPNLERRYRETESPAVREELT
KYVSQQPCPACNGTRLNRAARNVFVADRPLPELVVLPVNEALSFFRGLSL
PGWRGEIAAKIVKEIGERLGFLVDVGLDYLTLERKADTLSGGEAQRIRLA
SQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLTRLRDLGNTVIVVEHDE
DAIRLADHVLDIGPGAGVHGGEICAQGTLDDILKSPRSLTGQYLSGKRRI
EIPKQRHKPNPKMMLHLRGATGNNLKNVDLDIPAGLLTCITGVSGSGKST
LINDTLFTLAANEINGASHTVAPHREVENLDLFDKVVDIDQSPIGRTPRS
NPATYTGMFTPLRELFAQVPEARARGYSPGRFSFNVRGGRCEACQGDGMI
KVEMHFLPDVYVPCDVCHGKRYNRETLEIRYKGFNISDVLQMTVEDALRL
FEPVPSIARKLETLVDVGLSYIKLGQSATTLSGGEAQRVKLSKELSRRDT
GRTLYILDEPTTGLHFHDIEALLGVLHKLRDEGNTVVVIEHNLDVIKTAD
WIVDLGPEGGHRGGTILVSGTPEEVAAHKASYTGQFLAKMLPSVKARETR
PAAMANKPDARPPRKVKPEKVAKAAKSATKKTAKKAS
>gid:107182  uvrA2  excinuclease ABC subunit A
MSNATSSSGLVRVRGAREHNLKNVDVDVPRDALVVFTGVSGSGKSSLAFG
TIFAEAQRRYLDSISPYARRLIDQVGVPEVDSIDGLPPAVALQQARGAPS
ARSSVGSVTTISNSLRMLYSRAGQYPPGQEIIYADGFSPNTPAGACPTCH
GLGRIYDATEATMVLDRSLSIRDRAVAAWPGAWHGQNQRDILTTLGHDVD
APWHTLPKKTRDWILFTDEQPVVPVYAGYNVAEVRRALKRKEEPSYMGTF
TSARRYVLHTFATTQSAQMKKRVAQYLLSTQCPQCDGKRLRREALSVTFA
GLDIGELSRRPLDEVAELLRPAAEATPSKGTRKGAHQHPEQVIAAQRIAA
DLRARIAVVQALGLGYLSLERSTPTLSPGELQRLRLATQIRSQLFGVVYV
MDEPSAGLHPADAQALLGALDQLKAAGNSVFVVEHEVDVIRHADWIVDVG
PAAGAQGGQVLYSGPPAGLEQVQASSTRRYLFGTPPQVHRPARAANGWLQ
LRGITRNNVRALDVDLPLGVFTTVTGVSGSGKSSLVSQALVELLAAHLGQ
AQAEDEEALDPLERGTQVPLEGAIVAGLDRVRRLVRVDQKPIGRTPRSNL
ATYTGLFDPVRKLFAATPAARRRRYDPGQFSFNVAKGRCATCEGEGSVYV
ELLFMPSVYAPCPTCHGARYNAKTLEIALRGRNIAQVLEMTVDQAAIFFA
DDAGVLRPLQVLREVGLGYLRLGQPATELSGGEAQRIKLATELQRAQRRD
TVYVLDEPTTGLHPSDVDTLMAQLQGLVDAGNTVIVVEHDMRVAASSDWV
LDMGPGAGGVGGHVVVAGPPDVVARHRGSRTAPFLAQVLRGE
>gid:108621  uvrB  excinuclease ABC subunit B
MTDRFQLVSPYSPAGDQPAAIDKLVANFEAGLAKQTLLGVTGSGKTYTIA
NVVQQVQRPTLVMAPNKTLAAQLYGEFKSFFPHNAVEYFVSYYDYYQPEA
YVPSSDTFIEKDSSINEHIEQMRLSATKTLLSRRDSLVVATVSAIYGLGA
PEDYLSLRLILSVGEHIDQRQLIRHLTDLQYTRNEFELTRGAFRVRGEVL
DVFPAESDTEALRIELFDGDIEQLTLFDPLTGETLRKLQRYTVYPKTHYA
TTRERTLSAVDTIKEELKERLEQLYSQNKLVEAQRLAQRTQFDLEMMAEV
GFCNGIENYSRHLTGKAPGEPPPTLFDYLPPDALLVIDESHVTIPQIGAM
YKGDRSRKETLVEFGFRLPSALDNRPLRFEEWEARSPRSIYVSATPGPYE
VRESAGEVTELVVRPTGLIDPVVEIRPVGTQVDDLMSEVHERIKLGDRVL
VTTLTKRMAENLTEYLGEHGIRVRYLHSDIDTVERVEIIRDLRLGKFDVL
VGINLLREGLDMPEVSLVAILDADKEGFLRSTGSLIQTIGRAARNLRGKA
ILYADKMTRSMQAAIDETDRRREKQVEYNLEHGITPKSVERPIADIMEGA
RDDAAEKKSGKGRSKSRHVAEETPDYRAMKPAEIAGKLKSLEQKMYQHAK
DLEFEAAAQIRDQIQKLKAASLG
>gid:108088  uvrC  excinuclease ABC subunit C
MSVRPQNDFDGKAFAAQLSTAPGVYRMYAGDDTLLYVGKAGALRKRVGSY
FNGTPKNARLTSMLSQVARMDVTVTRSEAEALLLENQLIKSLSPRYNVSL
RDDKSYPYVLLTREQWPRIALHRGPRAVQGRYFGPYTGVTGVRETLSLMH
KLFKLRSCEDSVFRNRSRPCLQYQIGRCSGPCVDLVAAPDYAESVRRATM
FLEGKSDQLGEEIMHSMQQASEALEFERAARLRDLLSSLRSMQNRQYVDG
RAADLDVLACATQSSQACVLLLSFRDGRNLGTRSFFPKTNGEDSADEILG
AFVSQYYAEHSPPREILLDREIPEAELIEAALSTAAEHKVALKWNVRGER
AGYLLLATRNAQLTLVTELTSQSAQHARSEALRELLGLAEPVKRVECFDI
SHTMGEATVASCVVFDASGPVRGQYRRFNISGITPGDDYAAMRQAIERRF
RRAVEENGVIPDVLLIDGGAGQLAQAQAALADLGVENVLLVGVAKGEERR
AGHEALIMADGRELRPGAASPALQFIQQVRDEAHRFAITGHRGRRQKARM
TSKLEDIPGIGPRRRASLLKHFGGLVGLKAAGEAEIARVEGVNAALAARI
YANLHGLALPDAAGEASPQ
>gid:110147  uvrD  DNA helicase II
MPVDVSHLLDHLNPAQREAVSAPPGHYLVLAGAGSGKTRVLIHRIAWLNE
VQGVPNHGIFAVTFTNKAAGEMRHRTDLQLRNGSRGMWIGTFHGLAHRLL
RLHWQDARLPEGFQVMDSDDQLRLVKRVVQALELDESKYPPKQMSWWINE
QKDEGRRPQHIQPEPNDDWTEVRRQVYAAYQERCDRSGLLDFAELLLRAH
ELLRDTPALLAHYRARFREILVDEFQDTNAIQYAFVRVLAGESGNVFVVG
DDDQAIYGWRGAKVENVQRFLKDFPGAQTVRLEQNYRSSANILGAANAVI
AHNPDRIGKQLWTDSGNGDPIDLYAAYNEVDEARYVVERARQWVRDGGSY
GEVAVLYRSNAQSRALEEALISEQLPYRVYGGMRFFERAEIKDALAYLRL
LTNRSDDAAFERAVNTPTRGIGDRTLDEVRRLARASALSLWEAAMLCTQQ
NTLAARARNALATFLSLIGQLHAETGEMELAERIDHVLMRSGLREHWAKE
SRGGLDSESRTENLDELVSVASRFTRPDDEDSQGMTELVAFLAYASLEAG
AGQAQAGEEGVQLMTLHSAKGLEFPIVFLVGLEDGLFPSARSLEESGRLE
EERRLAYVGITRARQKLVLCYAESRRIHGQDNYNVPSRFLREIPRDLLNE
VRPKVQVSRTASLGAARGGPVHGIVETAPIKLGANVEHPKFGGGVVVDYE
GAGAHARVQVQFDEVGAKWLVMAYANLTVV
>gid:108432  xamIM  XamI DNA methyltransferase
MNVQTEQELVAFCLALIGGHGGLSAAERKLVKTAPASALKLRDIEAIRKA
ISRGTDPLGEAFSSIRSAAERRAVGAVYTPAPIVRSMMTWLASQGTPARI
VDPGAGSGRFILAAGLAFPDAQLVAVEMDPLAALMLRANLSARGWTDRAT
VLVKDYREVKLPRCAGMTAFIGNPPYVRHHDIAEDWKAWYASNFADFGIK
ASALAGLHLHFFLQTRLLAKAGDVGAFITSAEWMDVNYGSALRRLLLDEL
GGIALHVLEPTVEAFPGTATTAAITCFRVGETAEPVRVRAVDELERLNGL
TKGTDIPRERLHAAPRWSIIVRPSEPAAAGDIELGELFRVHRGQVTGAND
IWIAGEHANGLPDRVKLPAVTKAKDLIQAGAHLHSAEVLRRVIDLPAELD
DFTKEERRRISAFLSWAKINGADQSYIAQHRKAWWSVGLKAPAPILCTYM
ARRPPQFTLNACDARHINVAHGLYPRQPLADGVMARLVTWLNKNINTGSG
RTYAGGLTKFEPKEIERLRIPSLETLLA
>gid:106632  xerC  site-specific recombinase
MSSVDEFLTYLQVERQVSAHTLDAYRRDLAALVVWASEQKTDDGVQDAAV
PAETAQFDSAHLRQFVAAEHRRGLSAKSLQRRLSACRSYYAWLLKRGRIS
ASPAAALRAPKAPRKLPQVLDADEAVRLVEVPTDAPLGLRDRALLEVFYS
SGLRLSELCALRWRDLDLDSGLVMVLGKGSKQRLVPVGSHAIAALREWRR
DSGASADSHVFPGRAGGAISQRAVQIRIKQLAVRQGMFKDVHPHMLRHSF
ASHILESSGDLRGVQELLGHSDIATTQIYTHLDFQHLAKVYDAAHPRARR
KKAAE
>gid:109547  xerD  integrase-recombinase XerD
MQPADASVIERFLDRFWAEQGVARQTLESYRRDLEGLARWRDGAGGGLLG
IDRAALFDYLRWRTRANYSPRSTARLLSTLRAFYGLCLRDGARSDDPTAL
IDPPHLPRSLPKALTESQIEALLAAPDLDTPAGLRDRAMLELMYAAGLRV
SELVNLPAVGVNLRQGVLRVTGKGSKDRLVPLGEESQHWLERYLREARPL
LAANKPVAAVDGQVPLFIDVSRQPLSRQQFWALVKRYAAVAGIDPATVSP
HGLRHSFATHLLNHGADLRALQMLLGHSSLSTTQIYTLVARQHLQKLHAS
HHPRG
>gid:108406  xseA  exodeoxyribonuclease VII large subunit
MADRNEQILTPSQLNSLARDLLEGGFPLVWVEAELSSVTRPASGHLYFTL
KDARAQIRCAMFKPKSTWLKFQPREGLRVLARGRLTLYEARGDYQLVLDH
MEEAGEGALRRAFDELRARLTAEGLFDAERKQALPAHVRRLAVITSPSGA
AVRDVLSVLARRFPLLEVDLLPSLVQGDSAAAQITSLLQRADASGRYDVI
LITRGGGSLEDLWAFNDERLARAIAAAQTPVVSAVGHETDFSLSDFVADV
RAPTPSVAAELLVPDQRELVARVRRAQARMTQLQQHALGNAMQRADRLAL
RLRAQSPQARLQLLHRRQEDAGRQLRARMMHVLERLQARVQRGQAQLQSH
NPQRHLAGLQQRLRALHPQAAMQRRLQHDQLQLRRIARSLEAVSPLATVA
RGYAIVTRPANGSVVRSAAEVVTGERLRAQLADGSIEVRVESGES
>gid:108757  xseB  exodeoxyribonuclease VII small subunit
MAKKSLNETSPVARFEQSLEELEQLVQKMEVGDLSLEQSLTAYERGIGLY
RDCQQALEQAELRVRLLTDPARPELAEAFEPPSLDG
>gid:110167  xthA1  exodeoxyribonuclease III
MKIASWNVNSLNVRLPHLQQWLAAFAPDVVGIQETKLEDHKFPDAALAAL
GYRSVFCGQKTYNGVAILSRSPAIDVQMGIPGLDDVQQRVIAATVDGVRI
INLYVVNGQDVGTDKYAYKLRWLAAAHDWIAQELQRYPQLVVLGDFNIAP
DARDVHDATVWNEHHILTSTDERAALDKLLALGLHDAFRLHNQDADHFSW
WDYRQAGFRRNLGLRIDLTLVSDALRTRAVESGIDREPRTWERPSDHAPA
WVRLAEVGA
>gid:108026  xthA2  exodeoxyribonuclease III
MSGTSRKIATFNVNGIASRLPHLLEWLQRDQPDIVGLQELKSTQVAFPEQ
AIRDAGYGVIWQGEKSWNGVALLARGIEPVEIRRGLPWDPRDTQSRYLEA
AIHGVVVACLYLPNGNPQPGPKFDYKLAWFERLIRHAKTLVDLPHPVALI
GDFNVVPTDADIYDPKGWRKDALLQPESRAAYQTLLAQGWTDSLLAIHGQ
TPIYTFWDYFRQHFARDRGLRIDHLLLNRTLAPGLQDAGVDKWVRALEKA
SDHAPTWISVRVPDAPAEAAPDTTPMAKVRKPGAAKKAAVPKAAAKKTSS
VTAKKAVKKIVAKRATAGSATPRKSAKKNS
>gid:109103  yoaA  ATP-dependent helicase
MSQLATASIQALSEGGALARQLDAFAPRAAQLRLTGTIAEAFEQRDVLLA
EAGTGTGKTYAYLVPALLSGLKTIVSTGTRALQDQLFHRDLPRVRAALGV
GLRSALLKGRANYLCKYRTQQARGEPRFATPEQVSQFQRIVAWSGRTQFG
DMAELEALPDDSPLLPMVTSTVDNCLGTDCPFYSECFVVQARQRAQAADL
VVVNHHLLLADLALKQEGFGEILPGAQAFVIDEAHQLPELAANFFGESFG
MRPWQELARDCMVEARLVAGAQASLQAPILALDEALRSLRAGMEGLPPRG
TQWRALAKPQVREGFDAVLSSLARLGESLLPLREASPGFDGCTARAQEAL
SRLSRWLGEDLPVADFAQDPPEETVDNDVLWYELSPRGFRCQRTPLDVSG
PLREHREKSQAAWVFTSATLAVGGEFDHIAQRLGLSDPVTLLQPSPFDWA
RQALCYLPPNLPDPAARGFGTALIGALTPVLEASNGRAFLLFASHRALRE
AAEALRGAPWPLFVQGEAPRATLLQRFRDSGNGVLLGSASFREGVDVVGD
ALSVVVIDKLPFAAPDDPVFEARLDAIRREGGNPFRDEQLPQAVIALKQG
VGRLIRSETDRGVLVLCDPRLVNKSYGRTFMKSLPPFARTHAIADVQAFF
GVEPAVGDNGALALPVGGDRSSSS