TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Chlorobium chlorochromatii CaD3, CaD3
Gene type: CDS

Number of genes found: 121

Free access
Sort by:

 



# Chlorobium chlorochromatii CaD3, CaD3

>Cag_0763 Exodeoxyribonuclease V, RecC subunit
MSAFLSIKSIHDKNSSLTFFLISNQFQSGTMALHLYTSNRMEMLVDSLAE
VVRQPLASVFEHEVIVVQSRGMQRWLSMELAGRFGVWANGRYPFPNAMVQ
ELFKQLLPSVAQSDAFKKEVMSWRVMRLLPHLLEMAEFLPLRRYAADDSD
GLKLFQLSEKIADTFDQYTLFRPDMLALWEAGGGVAEGGEAWQPLLWRAL
VEGAGLHRGQLRELLFRQLSRSSSKISELPERITLFGISYLPQFHLELFA
AVARLTEVHLFLLSPTQEYWGDIVSRKAMARLSEAEQALRSEGNPLLASL
GRIGRDFSEMVLEMSDEALDSQEFYDDPPEDSLLHALQWDILHLQGAGEM
DETPRLLQPHDRSVQIHACHTPLREVEVLYDAILGLLEAHPHISLRDIIV
MTPDIESYSPYIATVFGTAREAGKEGKGVVALPFSIADRRMMHEGEIASA
LLKLLALHGSRLTASMLFDFLASPPVSRAFGFDAEALRLIRGWIEGSGIR
WGMDEEDRRERNLPAYRDHSWRAGLERLLLGYAMPEEEQLFQGVLPYGDI
AGSAAEMLGRFAEAVEALERFVSSSEPSRTLEAWRQQYAMWLTTFFAPDE
DSEREFATLATLGEELAEYGINAGFEENISPLVFFTWLRSRLEEQEQGLG
FMTGGITFCAMLPMRSIPFRVVMLIGMNDGAFPRQSRAPSFDLITRQPQK
GDRSLRNEDRYLFLESILSARELLYISYVGQSIRDNSEIPPSVLVSELLD
AVRRAFVLPNESSIEQHLVVRHRLQPFHHDYFSEHSPLASYSSENYYALI
ASEQSLQAVPPIRSFISTALSEPTAEWRTVQLEQLLHFYDNPSAFFLEQR
LGIKPEGLLLPLQDSEPFAVESLERYRLQQELLEAQLRGQPAEALLPLFK
SRGMLPPAQHGELLFATVMQEVDDFAATLRQHLAGEVALAPLEVDIEVGE
FRIVGRLDGIWANAMLRYRPARMKVRDRFRWWIEHLLLCALQPTGYPLTT
HMLMSDGEWSYPPIDNPHQHLTTLLQRYWQGLCEPLPFFPRSAYAFVLKG
MDANHHLDVGKGIDAAYREWRDDTFTNRKGEGSDSAIQRCFGAAANPFSD
TFIELALELFTPMMEAMGAMGDGKRSG
>Cag_1296 conserved hypothetical protein
MIISASRRTDIPAFYGEWFINRLRVGEVLVRNPMQPKQVSHIALTPETID
ALVFWTKNPNPFFRYLAEIDAFGYPYYFLFTITPYDTTIEPHVPTLEKRI
AHFQYLAKRIGAERVVWRYDPILFTKTLSPTWHIAAFRHIANALSGYTKR
CIISFIDNYRKVRRNMASLPLITPNEGMITQLLQTFTNIAEQQQINLQVC
REEIDVTHYGIANGSCIDRSLVEQLCGRPLVGIGKDKNQRKTCGCIASRD
IGRYDTCLHGCRYCYAVSNHAKAAAAYKNFNPDTPLLCNELCGNETITCA
PKQNQSKLECLPLFEKT
>Cag_1447 SMF protein
MDILNFLMLSQVPGIGAARIKALLTHWGNLSFLQHATIADLTHINGIGET
LATELYNTFHNAAKNDTVRRAAEAQLLALERCNGQVLTLLDEGYPPLLRE
IYDPPPCLFIRGTLPPNTEKSLAVVGTRHASAYGKQVTTHFCHAIAKQEM
PIISGLAYGIDMAAHQAALDAGGTTVAVLASGIDTIYTDPKGLLWPKILE
HGAIVSEEWIGSHITPAKFPKRNRIISGIAKGTLVVESDLKGGALITATT
ALEQNREVFAVPGSIFSHTSRGTNKLIQQGQAKAIMEVDDILMELQPSQP
HQAKPIHPTKATANATTTTATTQLPLLNPLESQIYQALSSSDPTHIDTLA
ATLQLDLSTLFLHLFELELQGVIEQQPGQLFLRKA
>Cag_0770 Exodeoxyribonuclease V, alpha subunit
MITYNERPIDRHFAKMLLQHCGNSKHELLPLLFSMVSNAIGQGSVCLNLA
DIAAQSVTYGNRTVQLPPLAELMRLFSTLPVVSRNGAEFRPLVIDNVGRL
YLYRYWRYEHDLAEALRQKASTKSCTIEKKSEAVQVLLQQLFPEGSDAQQ
KQAAEVALHRRFCIISGGPGTGKTTTVVRIVALLLEQAGGERLRIALAAP
TGKAAARLKQSISTIRGTLSCSQTLQQAIPSEVVTIQRLLGAIPNSTRFR
YHQRNPLPYDVLIVDEASMVSLSLMHALLMALKPECRLILLGDRHQLASV
EEGAMLGDLCSAVGEATPHSPLAGTLVMLEKSYRFQTGGAIAELSRAMNQ
GEGEQALALLQSNQSAALRWQPLPTPDALPSALGRAAVAGYRAYCEATTP
AEAMERFERFRILAALREGIYGVSGLNRFVEQALAREGLLAPTSLWYAYR
PVLITVNDYNVRLFNGDTGLLLPDAENGGVSAWFTTPDGGLRRLPPERLP
AHETAFCMTIHKSQGSEFDNVLLILPPTDTPLLSRELLYTGVTRAKSRVE
VWGDPTFVQAACKRTTIRHSGFREALALE
>Cag_1136 conserved hypothetical protein
MNNMQYNRRSIRLQGYDYSQSGAYFITICTQNRECLFGKIVDGNMILNDA
GEMIKNIWHKIPTYHPYSYLDAMCIMPNHFHAIIMTVGADSISAPIDSIS
APIDSISAPTIGAEMDSAPTLGNIVQTFKRYTTIEYIKMVKQNKLPSFNK
RIWQRNYYEHIIRNESDYTHIYDYIQNNPQQWEMDTLYPNTL
>Cag_0075 DNA topoisomerase I
MASSVAALSAKNKTLIVVESPSKAKTINKYLGSNYTVFASVGHIKDLPKK
EIGLDFEHNYSPRYEIIPGKEKVVKQLKKLATEASNILIATDPDREGEAI
AWHIANEIEHAKAPVARVLFNEVTKKAILEAIEKPRHIDLRLVHSQQTRQ
GLDKIVGYKISPFLWKVVLRGLSAGRVQSVALRLICEREEEIERFVIQEY
WTIAADFLTANKESFRARLVRLDGDKPEITNVEQAEAIAAIAKKGNYSVR
EITPRIQQRKQPLPFTTSLLQQAASNQLGFGAQRTMRTAQQLYEGIELGA
EGAMGLITYMRTDSTRISPEAVGEARNYIERNFGKDYVGAGSSGKPGKNA
QDAHEAIRPTSLLKTPEQVKPYLSADQFKLYELIWKRFLAAMMAPAKIEQ
TKVDVEEQSGKFLFRANGSRVLFPGFMRVYDDQQELAYEAQTSTKEEVEN
EMVVKLPEKLAVNDPLGLGALEQKQSFTRPPARYSEASLVKDLDHFGIGR
PSTYASIFSTLQDRRYVALEKRKIMPTDLGRDVAKILVANFPELFNVGFT
AFMEDELDKVASGDDAYEKVLDSFYKPLTSALALRSATPLIPQNNEAETC
DKCGTGKMILKWTASGKFLGCSNYPKCKNIRTISSNREKPASTGVHCPSC
EDGEMVLRKGRLGPFLACSNYPKCNTLLNLNKQRHIEPPKTPPVVTDMAC
PKCGAPLYLRSGKRGLWLGCSKFPKCRGRLAWTALEPAAQERWERVMAAH
QKAHPPVTLKMVDGSTVSMTSSIDDIIMKADAAGLIAPAMDLVPEAEG
>Cag_2007 DNA-directed DNA polymerase B
MENLINHIVTNNLLFGKDKEERIVGAYQLSDTHIRLFNRNGDTVTFHDEP
FYPYFFLSDSSLLETFVPENQEKFWLVPLAGSNYYTALAIFKSSRNHKNA
VDFLNRKWNGNQAAQGEAAGKNSMESNPFMYNKGDTITQYLMQSGKTMFK
GMLFDDIYRMQLDIETNYNGEKKGFYDDEIIIISLSDNRGWEQPLHSKGR
NEKELLQELIAVIQEKDPDVIEGHNIFNFDLPYIQRRCERHSIPFTIGRN
QTIPRTYPSSIRFGERTIDFPYCDIPGRHVIDTLFLVQGYDVAKRSIESY
GLKNVARHFGFASANRTYIEYKDIARLWQEEPNTLLAYALDDVRETQALS
SLLSGSNFYMTQMLPYSYAMTARLGQAAKIEALFVREYLREKHSLPKPTS
GQQQSGGYTEVFLKGILGPIVYADVESLYPSIMLSYNVCPKSDALRVFPN
VLRSLKELRFKAKDQAQQELQAGNKRNADNFDAMQASFKIIINAMYGYLG
YSGGIFNDYGEADRVTTTGQGIARKMIAEFEKRGCKIIEVDTDGIFFIPP
ASIASEQEEKALVEEVSQQMPDGINIGFDGRFKKMISYMKKNYALLSYNN
VMKLKGSSLNSRSAEKFGREFIRRGFQMLLAEDIKGLHLLFAEYKEKILN
HQLSIEEFSRSESLKQTKEQYLEDVASAKRSKSITYELAIRKGMEIRKGD
KISYYITGSGSSNFSWDKGKLAAEWDPNKPDENSAFYLKRLDEYSQKFLP
FFKPQDYSMIFSTGSLFAFSEEGIELLKEIPNTDSQTE
>Cag_1762 site-specific recombinase, phage/XerD family
MFMSNSALHQPLPRLLQESALPIQAFLEHVAQRRGLSPNTVVAYRGDLIQ
FFTFLAQHLELLDLRAFQPESVTPMDVRLFMGFLLEQGVKQRSIARKLVA
VKVFYRYLQEHGIITTCLFSSLGSPKFPQRVPNFLTEEQTSKLFELLETV
PNGAVSDSQPANSALHAFTAARDCSILELLYSSGLRVSELVNLRMDELDV
ERGYVKVHGKGNRERIVPVGAAAIEALKKYFEVRRNFFRMNKEVEPFTSV
FVTQKGAKIYPMLVQRVTARHLSLVTEQKKKNPHLLRHTFATHLLNSGAD
LESVSEMLGHSNLATTELYTHVTFERLKEVYRKAHPNA
>Cag_0543 conserved hypothetical protein
MQLPCIVPSTLRRYLPTLFVTSLALLPLTPISVYGEMAEDLAALSSSSES
DYNSEVALALLEELRHHPLSINRATANELRQLPWLSAADVHAIIKYRTQK
GAFRSLSELETILGKERATWLSPYLTVEAAPVPAKTTVRPKATSTSKKTK
KVATTGSLYSRYFTEMPPRKGILTEKYEGGNSKMYHRAQFYAPHVSASVV
QEKDIGEAAITDFTSLSVSVADVGMMERVVLGNYRLTLGQGLMIGQGRFF
SKGAEVGGRLTTKTLMPYASASEEGFLQGAAATLQIQPIALTLFYSANQR
DAIINKEGVITSLSSSGYHRTTLEVSRKDNITENVMGAHLRYRTAVAGME
ATLGGGMMNYSYPYPFDELEPNEPVSTVLGATLTNVDATLSFGSGALFAE
AAFASDPHDMAWFAGAEYEPLRGVTAVAALRRYGENFYSPFANAFAERGG
GSNEEGLYTAVQAAFSKKVTLGAYYDRFTFPQLGSHYQQAADGFDARAWF
SWQQSSLLCWNVQVQHKEKPEEKNQGTTKNPIWTPLPILTDRLQLNCEVT
PHKGISLRTRFELKNVDKEYLLATQSFTGKMWYQQVGYRTENFSLKGRFT
RFTTTDYAAAIYAYEDDLPLTSSLGMYSGDGSSLFAVATWQPMKQMKVAA
RYEVTRYNDRDVYSSGNDERATNAPSSLHVGCMLSF
>Cag_1507 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFTLPALTHSK
>Cag_1317 MutS 1 protein
MAKEQSGTKEHSPMMRQYLEVKERYPDYLLLFRVGDFYETFFDDAITVST
ALNIVLTKRTADIPMAGFPYHASEGYIAKLIKKGYKVAVCDQVEDPADAK
GIVRREITDIVTPGVTYSDKLLDDRHNNYLAGVAFLKEGKTLMAGVAFID
VTTAEFRITTLLPEELPHFLAGLHPSEILFSTQEKERTLLLKKSLPSETL
ISLLEPWMFSEEQSQTVLLRHFKTHSLKGFGIETAGGNRAALVAAGVILQ
YLEETRQNSLSYITRIGELHHTEFMSLDQQTKRNLEIISSMQDGSLSGSL
LQVMDRTRNPMGARLLRRWLQRPLKKLTNIQERHNAVEELVENRTLRESV
AEQLAAINDLERSLARIATLRTIPREVRQLGISLAAIPTLQALLSDVTAP
RLQALTAALQPLPKLAEQIESAIDPDAGATMRDGGYIRAGYNEELDDLRS
IASTAKDRLMQIQQEEREATAISSLKVSYNKVFGYYIEISRANSDKVPAY
YEKKQTLVNAERYTIPALKEYEEKILHAEEKSLLLEAELFRNLCQQIATE
AATVQANAALLAELDALCSFAECAVAFDYTKPTMHEGTTLSITAGRHPVL
ERLLGAEESYIPNDCHFDDKQTMLIITGPNMAGKSSYLRQIGLIVLLAQA
GSFVPAESASLGVVDRIFTRVGASDNLTSGESTFLVEMNEAANILNNATE
RSLLLLDEIGRGTSTFDGMSIAWSMCEYIVHTIGAKTLFATHYHELAELE
ERLKGVVNYNATVVETAERVIFLRKIVRGATDNSYGIEVAKMAGMPNDVI
SRAREILAGLEKRDVEIPRQKAPKVNTMQISLFEETDNQLRNAVEAVDVN
RLTPLEALLELQKLQEMARSGGY
>Cag_0088 DNA gyrase, subunit A
MQRERIVPISIEEEMRGSYLDYSMSVIVSRALPDVRDGLKPVHRRVLFGM
HELGLQAGKPHKKSARVVGEVLGKFHPHGDTAVYDSLVRLVQDFSLRYPL
IDGQGNFGSVDGDSPAAMRYTEVRMKSIAGEMLKDLEKETVDFALNFDDS
LEEPTVLPSAIPNLLVNGASGIAVGMATNLAPHNLREVVNGIIALIEQPE
IEIQELMKHVIAPDFPTGGIIYGYEGVRQAYLTGRGKVVIRARALVEVTQ
KNGRESIIVTELPYQVNKVRLIEKIVELVHDKKVEGIADIRDESDREGMR
LVIELKRDAVAKVVLNNLYKHTPMQDTFGVINLALVDGVPKILNLKEMMQ
YYVKHRNEIVLRRTRFDLAAAERRAHILEGLKICLDNLDEVISTIRQSPD
TATAQERLIERFGLSEIQAKAILEMRLQRLTGMERQKIDTEYIEVLALIE
ELRFILNSPEKQMEIIREELLKVKDVYGDERRTEIVPQEGDFSIEDMIAQ
EDVVITITHDGFIKRFPVSGYRRQARGGKGVTGAQAKNDDFIEHMFIAST
HNYILFFTTSGRCYWLKVYEIPEAGRAARGRSLANIMELPPGEKIRTYIN
IRNFEEPGFIVMATTHGIVKKTALEEFSHPRRTGIAAITIDEGDELLDAR
LTDGDHQIILAKNSGFVVRFPENEVRPMGRTAMGVKGITLDEDEKCIAMV
TTRRMDTALLAVTDNGFGKRSRVEDYRLTRRGARGVITLKPHEKIGALVG
LLDVNDEDDLILITVNGIVNRQHVSDIRITGRNTSGVRLIRLMQGDSISA
LARVPKSDEEGDGDFPLEDADGQIPLFE
>Cag_1341 Excinuclease ABC, C subunit
MEPLDALEKHGDIKKVLTEKLATLPTSPGIYQFKNSAGRIIYVGKAKNLR
NRVRSYFRNSHQLFGKTLVLVSHIDDLEVIITSSEVEALILENNLIKELK
PRYNVNLKDDKTYPYLVITNEPYPRILFTRHRRNDGSIAFGPYTEARQLR
SILDLIGSIFPVRSCKLRLTPDAIASGKYKVCLDYHIHKCKGACEGLQPE
DEYRQMIDEIIKLLKGKTSALIRSLTENMHLAATELRFEQAAEIKAQIES
LKRYAERQKVVAADMVDRDVFAIAAGEDDACGVIFKIREGKLLGSQRIYI
NNTNGESEASMQLRMLEKFYVESIEPVPDEILLQEALSEEEEETLRAFLL
VKAKNEGQEKKGIRLVVPQIGDKAHLVGMCRQNARHHLEEYLIQKQKRGE
AAREHFGLTALKELLHLPTLPQRIECFDNSHFQGTDYVSSMVCFEKGKTK
KSDYRKFKIKTFEGSDDYAAMDEVLRRRYSGSLTESLALPDLIVVDGGKG
QVNTAYKTLQELGVTIPVIGLAKRIEEIFTPHSSDPFNLPKTSPALKLLQ
QLRDEAHRFAITYHRKLRSDRTLQTELTTIAGIGEKTAFKLLEHFGSVES
VAQASREELQAVIGAKAGETVYTFYRPEG
>Cag_1695 reverse transcriptase family protein
MKRKGKLVEQIADLHNLYEAFYKAQKGKQAKRYVCAYRKQLQENLQLLRH
QILSGAIQTGKYHAFTIYDPKERVICATPFSQRVLHHAIMNVCHPFFEKH
QIAGSFASRKGKGTYAALDKAREYNCCYRWFLKLDVRKYFDSINHTVLQK
QLTRLFKDKTLLLIFEQIIDSYSTADHKGVPIGNLTSQYFANHYLSVADH
YAKEGLRVPAYVRYMDDMVLWHNEKEELLAMGYMFQTFIAKELLLELKPF
CLNATHKGLPFLGYLLFENQARLAPRSKKRFLAKYQRYENNLQSGVWTQQ
EFAKHALPLFAFTEYAQAREFRKKSLHSFCSLEGVFVRSSKGID
>Cag_1918 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFTLPALTHSK
>Cag_0179 DNA polymerase A
MAMLYRAFFALQRTGMSSPSGLPTGALYGFTTALLKIFENYHPHYLVAAF
DSREKTFRHHLLESYKANRAAPPEELLQQLEKLFELLKAFGVPVIKQAGY
EADDLIGAMVTQFADVCRIGIVTPDKDLAQLVREGVQILKPGKNQHELEP
LGCNEVKAHFGVPPKQFTNFLTLTGDTSDNIVGAKGIGPKTAATLLEKYQ
TLDKLYQHLDELTPKVRKSLEDFAPNRELVLQLVTICCDAPLHVTLEELA
CKNPARDVVLPLLQELGFRTIAARLQAASVALTCACNDGGESAPPMQSDP
NSSNLLNGSDGNTSATDTAPPPSFPDVPRHYTLVETREQLQALLEELQQV
THIAVDTETTSLDVFEAELAGISLCAEAGKAFFIATTPDALERKEVVKQL
KPLLENPAITKSGQNLKYDMLVLKKYGIELAPISFDTMLASYVLNPDEHH
NLDDMALRYLGRTTTKYDELTGTGKQRRHIFEVEKEALTNYACQDADVAF
QLEEVLQAQLQAEPQLLALCTTMEFPLVRVLATMEYAGIAIDTEHLARVA
ETTELELQSLTDNIYAAAGSSFNIDSPKQLSHVLFTDLSLPTGKSTKTGF
STDVGVLEELAATYPIASDLLSYRTLQKLKGTYIEALPKIINPRTGRIHT
SFNQHITATGRLSSSNPNLQNIPVRTALGKEIRRAFIPSTPEHWLLSADY
SQIELRIAAELSGDERLIAAFRNGEDIHTATAQVIFGTEEISSDMRRKAK
EVNFGVLYGIQPFGLAKRLNIPQKEAKVIIETYKAKYPQLFNVLRHIIEE
GKEKGYVTTLLGRRRYIADLNSRNGTVQKAAERAAMNTPIQGTAADIIKC
AMNLCYQQMQASGMASEMLLQVHDELLFETTDSEKEALTKLVENAMKEAA
VLCGMKQVPVEVDCGVGKNWLEAH
>Cag_1744 ribonuclease H
MKKQVTIYTDGACSGNPGPGGWGALLMFGSITREVSGSSPATTNNRMELG
AAIEALALLKEPCLVDLYSDSSYLVNAINNGWLQRWQRNSWQTAAKKSVE
NIDLWQKLIKLLKVHEVRFHKVKGHSDNAYNNRCDQLAREAIKKTS
>Cag_0599 hypothetical protein
MENFEKIKKILTSNFEIELFEAALASLNDKSNRLRFNNFAYSIRELSRHF
LYSLSPELNIKNCRWYKTETNDDKPTRAQRVRYAIQGGISDELLEDWSFD
ILGLADTIKSVVSSINSLNKYTHINPEVFDLKDEEVKEKSILVLETFSKF
VETIKEYREELKKFLDGHIENHMINSVISNFFKNVDCLAPHHSLEYCEVS
DYHISEINDKKIVVNVTGDLHVVLQYGSSSDRREGDGLDLNENFPFETKI
RYEISEDFPSDNHEVDDYDVDTSKWYE
>Cag_0769 Exodeoxyribonuclease V, beta subunit
MHHQPLNHTTVTLAGINLIEASAGTGKTYAIASLYVRLLLEKQLLPEQIL
VVTYTEAATQELRGRIRSRIREVLEVFEGAATSDAIVQRLYDQALEQGDD
MVERARMALVQALALFDTAAIFTIHGFCLRVLQEHAFESGSLYDTTLVTD
QRALLLEIVEDFWRTHFFGEASPLLAYTLQCGGSPESFLALLQKLHVSGG
ATIIPTFCDEEREALHATCLVAYAELCRLWQSDGAAVRELLSTDKGLSRA
ADYYRADKLELLFAGMEEFIAGGNPFNLFADFQKFATSGIAAGTKPKGTS
PDHPLFACAEKLLQAVQKRYVALKSELVQFYQRELPKRKRKANFRFFDDL
LSDLADALQAPERGVALAQRLRSTYQAALIDEFQDTDPVQYIIFQTMYAD
SDAPLFLIGDPKQAIYSFRGADIFAYMQAARAVEASRRFTLSENWRSTPQ
LLNAFNQLFSNERLPFIYPDIIYHPLQAGNPDVANGEESAPALQFYLLEG
DDAKGDVLSVEQGEALAAEATAGELYRLLQAGEIIGGKQVAAGDCAVIVR
THAQAAQMVAALQRRGIAGVVRSDKSVFATREAEELRQLLIALADPAHEV
KVRSALITDILGRSGDDCAELLADEVAWLQVLRRFRHYHHVWQHRGVMVM
SRELMADEGVRGRLLASPDGMGERRLTNVLHCIELLHRQEHEHGFGCEEL
LQWFSERISLQDELQEEYQLRLESDEAAVRIVTVHASKGLEYPIVFCPFL
WNSVGNRRDEVVSFHNEVWQLVKDFGSPERDRHRVLAGRESLAEQLRLLY
VALTRAKYRCTVLLARIKSEASAFNYLLHASDATRQSNKVVLELEQEMKG
ISSEERKVRLHDIAKQSAGAIGVRQLSRVEIEALKEQPRLVRQRSAEPLH
LRHFAGTVDGSWRVASFTSFSRHESTSTHFASPELPDRDEVRSSTSASTM
QPTLPSEQSIAAFPKGARAGILLHALFEELNFANPTDEAIAERVTEELAR
SIYPLSWQSTLITIVQAVLQTPLAALDGSTFQLGTLHAKSWITELEFFFP
LRFINSKELSALLTRHGVLPGGIALADMVEVLDFKPVRGMVMGFMDMVFE
SGGRYYLLDWKSNYLGASPADYTLEAMGRAMQEHLYPLQYLLYMVALHRY
LALRIPNYRYSTHIGGVIYVFLRGVTPEFGEARGFYRDLPSEALIEELTA
LLVDFEG
>Cag_1981 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFTLPALTHSK
>Cag_1872 exodeoxyribonuclease, small subunit
MTSSQPNEPSLEELLQRLDEITHTLENPDTGIERSIKLYEQGLLIAERCR
KRLEYARSKIEKLKPNSSSSLPQFPLTDDLFN
>Cag_0603 RecJ exonuclease
MKRYRWKCFMPHEETVAALSESINVSQPIARALCNRGISTYNEAKEFFRP
VLSTLHSPWLFNDMERAVERLVRALKNGETILLYGDYDVDGTTGVALLLL
FLRHHGVEPLWHINDRFAEGYGLSPEGIDRVIASGTTLLITVDCGIKDHA
AIRRCGEHGVEVIVCDHHEADVTPEAYAILNPKVVGSGYPFRELCGCAVA
FKLVQALAERLGDSEAVWHQFLDLVAVATAADLVSLTGENRTLVIEGLQQ
MRSKPRKNFSEMFRVMKVSLGDVRMFHLAFGIAPRINAAGRMHSAHLALE
WLLASAPDAVEQHTEALERVNVQRRSLDSTIMSQADKMVESHCASYCSSI
VLYDEAWHLGVLGIVASKLIDKYYLPTVVLGGMNGLVRGSVRSIEGLNIH
AVLQHCSHHLEQFGGHHQAAGLTLKPENLAVFRKAFDEQCANQLTIEQRQ
KVMEIDAVVELEQITDKFIAVLEQFAPYGIGNREPLFMSERLQLAEPARL
LKERHVKFAVRDKQKRRFEVIGFNRPDIYNDLRAVKHPTITMLYTIERRQ
WNGMWQVQLLLKDLEVQR
>Cag_1545 NUDIX/MutT family protein
MASRFRGVFKQSGVIPLFDDKVVLITARKSDRWIIPKGYIELGMSAADSA
AKEALEEAGLVGKVGEHPIGKYRYNKSGRHFVVLLYPFFVETMLDVWDEV
HERERCVVSPDVAATMVAHSDVGRLIRSYCASLDDDEAVLVPPHVASAIT
G
>Cag_1387 ATP-dependent endonuclease of the OLD family-like
MNSILYGVGNKFIQTNTFERNDLHNLDYTNQIRIRIELQGSDFTCPQYWD
RQSNSYRTTKSITGTYEITTEIDDSELKSGMQPSMFGMNKHYNIFYINFH
NIKDEIKTQRTSWGNLTSFLAKHIKSIVDTDTSMAAKKEDYENEVELATD
KVLKNSQLSAFIDKIKENYSTNLRNNSCEVKFGLPDYEDIFLQMIFKVGL
NGDNANLIPIDHFGDGYISMFVMAVIQAIAESNTDDRCLFLFEEPESFLH
ENHQEYFYKTVLCNLAEKGHQVIYTTHSDRMVDIFNTKSIIRIELEEQDK
QTVVKYNNVGEFSPTMPTNSNGQEIISFANFNSYIKSVEPNLNKILFSRK
VVLVEGPNDILAYKIAIEREVEKAHGDKKYAETYLSFLNIAFVVHHGKAT
AYLLIELCKHFGLDYFVINDWDFETDFVTDLANFQDENTLKQDNLYLKDG
ADDRSSNSKAMITTNWKLLNNSGIDKIHFNIPKLERVLGYQSDDKDSLGI
LNTVQKLIYYTETFLPTKLKEFLELDKLTNLTENVVETANSEVDTDELPF
>Cag_0029 DNA gyrase, B subunit
MPPAAYGATNIQVLDGIEHVRMRPAMYIGDIHSRGLHHLVYEIVDNSIDE
TLGGFNDYIFVALNADGSITVIDHGRGIPVDMHPEKQKSALELVMTVIGA
GGKFDKGAYKVSGGLHGVGASVVNALSEWCEVEVYRDGKAYYQRYERGVP
QGDVKVIGDSDQRGTKTTFMPDGTIFKTTEFRKEIIIDRMRELAFLNKNL
RIIVQDTNGEQEEFHFEGGICEFVRFTDQNRLNLLREPIYLYGERDGTVV
EIALQYNDSYQENVFSYVNNINTHEGGTHVTGFRKALTRTLNSYAQKNDL
LKNLKLTLTGDDFKEGLTAVISVKVAEPQFEGQTKTKLGNSETQSIVETV
VNDQLAEFAESNPNTLKLIIEKVKGAAMSREAARKAKELTRRKSVLESSG
LPGKLADCSINDPEHCELYIVEGDSAGGSAKQGRDRSFQAILPLKGKILN
VEKARLHKMLENEEIKTIILALGTSFGDEEFAVEKLRYGKIIIMTDADVD
GAHIRTLLLTFFFRHMRPVIEAGRVYIAQPPLYLVKSGKDQHYAWDDDER
NSIVDNMKKMQKSKANIHIQRYKGLGEMNPEQLWSTTMDPAHRSLLLVSV
ENAMEADQVFSTLMGDKVEPRREFIEKNARYVRRLDV
>Cag_0332 type II DNA modification methyltransferase M.TdeIII
MEMSMRNDLLTIAEASQWASNYLGKQVTTSNIAYLIQYGRVKKFGHNGST
KISKEHLCNYYATINRQREHSWKEQLGSDLNWSLSFDQYKEAETTKHVHR
LHPYKGKFIPQLVEYFLDDHTDDFKQQMYFTKGDIVLDPFSGSGTTIVQS
NELDIHAIGIDVSAFNTLIGNCKISSYNLKDLQQEINRITVVLKTYLKNS
SVVAFEEHLLQELALFNKKYFPVPEYKYQLRQGIIDEKKYGIQKEQDFLN
FYNSLVHEYGIILYQKNNHHFLDKWYLAPVRAEIDVVFQEIKKVQCKEIK
KILTIILSRTIRSCRATTHADLATLVDPISAPYYCAKHGKICKPLFSILS
WWETYTKDTIKRLAEFDRLRTNTYQICLTGDSRTINIIEVLEQRHPLLAA
LVKQQKVRGIFSSPPYVGLIDYHEQHAYAYDLFGFTRHDELEIGPLYKGR
GKEAKQSYINGISAVLNNCKHVLADDYDVFLVANDKFGMYPIIAENAGMK
IVNQFKRPVLNRTEKDKNAYSETIFHLKEK
>Cag_0321 Exonuclease VII, large subunit
MEAINAMDALSVTELTAHIKSELESLFPFVRVRGEISNCKQHSSGHIYLT
LKDSGAQLPAVIWKSTASLLSIRPKDGMEVVAEGRLELYPPSGRYQLICR
HVAQAGVGALQQAFAELVQKLAALGYFDENRKKTLPTIPTTIGIITSPTG
AVIEDMSKVLARRFPAARIALYPVKVQGAGAAEEIAQALDFFNHTKKKQW
KPQVIIVARGGGSLEDLQPFNEEIMAHAIYRSAIPVISAVGHETDITIAD
MVADVRAGTPSIAAELVVPDSAQVLRDVEQMVAYAQQILNNKIEGAEREL
HSLCNSYAFNRPILKMQQCYENLDRFEASMMRSVETTYRQQIQRCTASIQ
QLNLLDYHKTLERGYALIKKNGRFVTSAKALQPNDTIELLLHDGVRKASV
KPPDAFA
>Cag_1214 conserved hypothetical protein
MKITEAPNCPHCGATMQKCAPPPFNFGDGLGWCTPYMYVCFNDECKLYAN
GWNNLKNNFNKTASYRCICYPDNGVFEAMCVFSPDGMKGQIIEE
>Cag_0555 conserved hypothetical protein
MSIWMLHHKHHRHSIRLPEYDYSTCGAYFITICTQNRACWFGEIINGEMI
LNNVGKMVKDEWLKTEQLRTNVQCGAFVVMPNHLHGIIVINETVGAIHEL
PLQMSQKQRRNMILPKIIGRFKIQSSK
>Cag_0014 Holliday junction resolvase YqgF
MPLYQRIVAIDYGTKRIGVAKSDPLGMFAQPIGTVDRAGLSKLLSPMVEA
GEVQLVVVGYPLNRHGEQTAMTEVIDRFIESLRLEFPALPIETINEHCSS
KSAMQLLVASGTSRKERKTKGRLDTAAACLLLSDYLEQQK
>Cag_0202 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFTLPALTHSK
>Cag_1749 DNA repair protein RecN
MLASLSIKNIALIEELTVVFHPSLTIITGETGAGKSILMDSLSLVMGDRA
SSSMIRTGANKAVIEAILTDVHSETIEALLADAAIDSRQGELILRRELAA
NGQSRCFMNDTPCTLSLLRQAAEELIDLHGQHEHQLLLRSATHEGLLDDF
AQAHHERATYSRCYQHLQQLQAQRSALVEKAQSLRDKKEFLDFQLQELQS
AQLQEGEEINIEQEITLLENAEQLFTLTTLLHETLYNSDNSAYSNLTAAL
HTLEKLATIDQRFASAIEEARAATTIVDELARFARSYSADVEFNPERLEE
LRERQLLLQRMCRKYGRTHAELIAFEQELCAEQAGAESLDDELRQLEMAI
VTEKKQLSQLAIILSEKRQKAATLLEAHLQQELALLGMPHARFAISITQQ
EKADGDIAVAGNHFAATRTGYDTVEFLLSANQGETARPLTKVASGGEISR
VMLALKSALATSTHLPILVFDEIDTGISGRIAESVGKSLKKLSRLHQIIA
ITHLPQIAAMGDLHLSVQKSVRENRTTTSVTPLDGESRLHAIASLMSGEQ
ISATSLNLAAELLAHGQAVNLPSI
>Cag_1025 conserved hypothetical protein
MKPLPVGIQTFSEIIKQDYLYIDKTSLANELIKKHKYVFLSRPRRFGKSL
FLDTLKNIFEGKQELFKDLLIYNQWNWTVTYPVIKISFSGGIRDTESLRE
NLFYILKDNQERLNITCEEKSNANLCFAELIKKAFQHYQQKVVILIDEYD
KPILDNIENIPSALIIRDGMRDFYTKIKESDEYLRFVFLTGVSKFSKVSL
FSGLNNLEDISLNPDFGNVCGYTQNDVDTTFAPYFDGVDMEEVKRWYNGY
NFLGDKVYNPFDILLFIKNKYVFDSYWFETGTPKFLIDLIKKNNYFIPNF
LDIKVDKSLVNSFDIENINLQTILFQTGYLTIKQFLPSGMGIGYKLGFPN
KEVQISFNNYILQVLTSDSDKEPIRHELFDIMNNGKVANLEPVIKRLFAS
IAYNNFTNNYIESYEGFYASVLYAYFASLGFDMIAEDITNKGRVDLTLKT
LDKTYIFEFKVIAEEPLEQIKKMKYYEKYDGERYLIGIVFDPKARNVSRF
EWERV
>Cag_1301 Excinuclease ABC, A subunit
MNAHGQLTDTSLPDIVLKGINTHNLRNISVRIPRNKFIVITGVSGSGKSS
LAFDTLYAEGHRRYVESLSAYVRQFLERMPRPDIEHVEGIAPAIAIEQKA
LPKNPRSTVGTVSEIYDYLRLLYARIGKIYSRDTNELVLKHTPDDVSLQA
GFIEDGKKFYVGFFFPHHHTAQQLDCSPEEEIANLLKKGFFRLLAGDELL
DLNQEADYQKVLDMPAKVRAELLVVVDRFVARNNDKLFSRISQAAESSFM
ESGGHAVLKVVDGKTYRFSDRLELHDIEYQEPSPQLFAFNSPIGACTTCQ
GFGRIMGIDEDAVIPDKSLSIEEGAIACWNSEKYRWNLLELMHYAPKFGV
PLREPYEKLTFEQKEIIWKGTPDGSFNGIRAFFAEIEKDAGYKMHYRVFL
SRYRGYAICPDCEGSRLNPDALQVKISGRHIGEVTRMSIGEVAEFFRNLN
ISPFDRSVAEVILQEINRRLGYLLDVGLDYLTLDRLTHTLSGGEFQRINL
STSLGSPLVGTMYILDEPSIGLHQSDSARLIALLRKLRDLGNTVVVVEHD
REIIEAADEVIDLGPFAGRLGGEVVFQGSMEAMRSSGTSLTAQYMNGEQQ
IEVPQQRRTVDFSACITISGAMQNNLKNIDVQIPLKVMTCITGVSGSGKS
TLINDILCKGILREKHGSRGTVGTHRSLTGAWLIDRIEHVDQSPIGKSSR
SNPVTYMKIFDDIRTLFANTPDARKKKVKAGYFSFNIPGGRCEVCSGEGS
VHIEMQFLADIEAVCEACNGLRYQPEALAIKFNGKSIAEVLDMTVSEALS
FFKGEKNIVKKLSVLDQVGLGYIRLGQSSSTFSGGEAQRLKLATFIAHAD
TTHTLFVFDEPTTGLHFEDIKKLILCFEKLLEQNNSLIIIEHNLDIIKQA
DWVIDLGPGAGDKGGHLVEQGTPEEVAQCTESLTGQYLRGVV
>Cag_0765 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFTLPALTHSK
>Cag_1625 Helix-hairpin-helix DNA-binding, class 1
MKWLNSLATKLSLTKAEITLITALLGFLLLGGVVKNFQDVEERTTLIKRA
EAARLDGAEVDSLLRLASLKEGDLSAEPVAEQAEEGEVAPSTKKKSKSAR
SEKKEFHGTVAFNKASAAQLQKIPGVGTVMAERMIAFRLLKGGKVSDMKE
LLEVKGIGAKKLEQLQPYLTLD
>Cag_1630 hypothetical protein
MDNTKQDFEKRKKEIESYFNFLLIFDDDKTKIRYIKDGILVNEKINPVFQ
ITLIANSFLILYNLIESTIRNSIIEIYEKIEADEITYETLSENLKKIWIK
QKTDKLKENNFKQDTLRGYIAEIANDILNRETIRFDKDNLEFSGNLDARK
IRDLADSIGFQKTVNGQNLVDIKNKRNRLAHGEHTFYDVGKDYTVNDVIE
FKTETFNYLSDIITNIDHFISTQAYKIKN
>Cag_1041 hypothetical protein
MNLQEAYKQKAETELELAHTRLVEFRAKVKNLNAEAHLNYAKQLDDFEHG
ITTAKEKLHELGEAGEDAWEKLKDGVESALRSSSKTLQEIADKFKD
>Cag_0658 transposase
MKDTVLFQQALCLPAPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HNLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEYDSRIWRIIHHYLDEVLEQQ
DLSSVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPEAHITFDKFHIVQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGGKF
EFALPALTHSK
>Cag_1772 hypothetical protein
MLTNITIENFKKLERISFPLSQSVVIIGPNNSGKSTIFQALCLWEIGVKN
YIAAYQKNDLNRQGTITINRRDLLNSPIADARFLWKSKKVTQRNISGAGQ
KHVPLSIELEGDNNGVQWSCQAEFTFSNSESFSCKICTGLQQMVELYENE
HGLHFGFLQPMSGISTTEDKLTKGSIDRKLGEGKTAEVLRNICFEILNPE
TASKNRNNAENNWLQLCNVIKVMFGVILQKPEFIKATGLISLEYIENNIK
YDISSGGRGFQQTLLLFSYMFANPNTILLLDEPDAHLEVIRQREVFQKIN
DIATITNSQLLVASHSEVVLDEAAEASKVIALIENQTFEVNTSTNSKSIQ
YIKKALTEIGWEKYYLAKSKGHILYLEGSTDLQMLLAFATALNHNVAALL
RFANVSYTSDNVPNTAVANFVALKEIFPELKGLAIFDKIEKNLNDIKPLT
VVCWQKRELENYFARPYLLIKYAQSLHEKYEQFSLEQLEKAMKKAIEDFT
LRAYLNDLNHNWWNSAKLSSEWLDNIFPEFYKQLNVPLNFYKRDYYQLIA
LMERQDIADEIVDKLDLIYEILK
>Cag_0507 hypothetical protein
MSREVKIHLISESKSLVNSFFKKTIHNLPNNDSLISSGITITSIKTSDLK
FFPQNSALENTVLHFWDLSIDSSIPQSIYPLFMTPNSVYLLLLDNFNQNE
KFWLKLIKTHGKSSPIMILKDKSKGVFSIEEKSLNLEFPLIDNQFINVNF
DDDDDKGIVDFMENFAQLLMFKQNKCIIELQNSWLLIKEAIFKETEQVKF
ISKKKYKQVCYDKGVYNQTDSELLFEYLKNLSIILFFKEIPFADIYIINF
SDNSSNLCWLIDGINRILTSKKINNGCLYWRDLDFMLEDDEEKNIYDTKD
LYYILELLILLNVCYEIDKGCYLFPNKMPSDFVLSLPTNRSQTCFIMQYN
YLPFDIISRLMIMMKKDIIDDQYWVYGILLKSHNFSKLHASINPEAVNDV
TALIIADPDNKQIRITVYGPDRYRRHYFQVIWNHLHDINKKYDDLEVKEL
VPLPDRPDKLINYQDLLGYELHNIKKYPVLQSYRNYLVSDLLDTVIDNKK
VNKKEIVINNIVNNQNSESQVEYEKLQKDVLKLINAISEKISTLPNNEDE
KKLKSILNNTSNDLENIESPTYKKQLRQFIEMLQSNPHITNLVKFIANGP
DKVNAIIDLYNHIM
>Cag_0564 Histone-like DNA-binding protein
MGNTTTKADLVAVIAHKTGLTKNETEAVVDGLFESIIESLKAGRRIEIRG
FGSFNIRQKNFRKARNPRTGESVEVDPKQVPAFKISKEFKLAVSESLKGG
DV
>Cag_1966 conserved hypothetical protein
MLYEDTDNVTPNGGQECPPSFSPSCLPFLNPDCEIAMTHHRLPHWQQGDV
WVFVTWRLADSLPKVTLDEWTETRKIWLSLHPEPWDEKTEKEYHQRFSLQ
RDEWLDQGCGSCLLKDTVNAKIVVDALLHFNGLRYQLASFVVMPNHVHVL
FRPFGKYSLSEIVKSWKGFTAREINKRLGTKGVLWQDGYWDRLIRNERHF
FKVVAYIRHNPINAIQKEGGHSCPPFQCFVE
>Cag_1347 TatD-related deoxyribonuclease
MFIDSHCHLSFPDFDADRNDVLQRLQAAKVSLLIDPGTDVTTSKNSIALA
QEVDCVYANVGLHPHEATQPIGDDVFAQLEALAHQPKVVGLGEIGLDYHY
PDCNASAQQAAFREMLRMAIRLDIPVVIHSRDAWSDTLRLLDEEQHSALR
GIMHCFSGDVAIAKECIQRGFKLSIPGTLTYKKSLLPEVVAQVALDDLLT
ETDAPYLAPVPHRGKRNEPAYVALVTETIARIRSLSVEDAATAIYRNTLS
VFEKINGNGLSVKIADNK
>Cag_0554 conserved hypothetical protein
MSIWMLHHKHHRHSIRLPEYDYSTCGAYFITICTQNRACWFGEIIDGEMI
LNNVGKMVKDEWLKTEQLRTNVQCGTFVVMPNHLHGIIVINETVGAIHEL
PLKMSQKQRRNMILPKIIGRFKMQSSKQFNQLHNTPGQQFWQRNYYEHII
RNEQDYHRIHDYIVNNPLKWECDSLHP
>Cag_0911 hypothetical protein
MIITPKVDETQEFIEIANDFSNPLDLVREAISNSFDANANKIYLSFDMVK
EYMDTNLRIRIVDDGEGMTLDGLQSFFDLGNSTRRGIDGTIGEKGHGTKV
YLNSSKISVKTIRDGKQYVAVMIEPIKKLYVREVPTVEVIESNVDELSGT
TIEIIGYNSNRRGKFTHEQLKDYILWFTKFGSFESFFEKKENSHKRLFLK
GLNASEYEEICFGHSFPNESQPVQRLFEEFLVSAPDYYCKRFVKRGQLKN
SPEISFEAIFSVEGNRVKLAHNTMIQRQGRPSIAGNYKVAERYGVWVCKD
FIPIQRKNEWVNYKGSEFIKLHAFFNCQGLRLTANRGSIDNTPSEVLSDI
QEEIKKIYDEITSSDDWTQLSWLEQEAESYKTTEKEKKDFEFRLKKANKA
NICEFENTIIVEPQRESGVYALVLQLKMLKPDLFPFFIVDYDTHSGIDVI
VKADDTQPIISSKLYYVEFKHYLTEEFNHSFVNLHSIICWDTTIKHNDIL
KDINGEERKMQIIPPESDGDYTKYFLDRPSSAHKIEVFVLKDYLKQKLGI
EFRPRTAKDIL
>Cag_0528 AP endonuclease, family 2
MKRVGAHVSASGGVEQAPLNATAIGAKAFALFTKNQRQWKAPKLSKATIE
AFQKACADGGFQPQHILPHDSYLINLGSPDPEKLERARSAFIDEMQRVAD
LGLQLLNFHPGSHLKEISEEASLLLIAESINMALEATNGVTAVIENTAGQ
GTNLGYRFEQIAFLIDRIEDKSRVGVCLDTCHLFASGYDLSSTEAIETTF
NEFDSTVGLHYLRGMHLNDAMQPLGSRVDRHASLGKGTIGMAAFTFIMNH
PACEEIPLILETPNPDIWSEEIALLYSLQQVD
>Cag_1993 ribonuclease HII
MHTHYEEPLWQHYEFICGIDEVGRGPLAGPVVAAAVVFPRWFQPTEALLT
LLNDSKKLSAKERESLVPAIKAQALHWALAEVQHNVIDEVNILQATMLAM
NNAVKALPIIPSLLLVDGNRFTTDLAIPYKTIVKGDSHVFSIAAASVLAK
VHRDALMCVYATHYPHYGFERHAGYPTSAHIEAIRQHGRCPIHRQSFKLR
QLGEKV
>Cag_1628 C-5 cytosine-specific DNA methylase
MKMQNNISAIDLFCGIGGLTYGLKKSGIQVKAGIDIDESCRYSFEENCGT
KFINKDIQKLQKEELNSIYGNAEIKILVGCAPCQPFSSYTYKKDKNKDKK
WQLLYDFSRLIKETKPAIISMENVPTLLNFKKAPVFYDFIQELTANSYKV
WFNIVYSPDYGIPQKRRRLVLLASKLGDIELLPPTHNPDNYITVKDAIGN
LEAIKSGETSQNDFIHKAAQLSEINLSRIKQSIPGGSWKKDWDDELKLVC
HTKEKGKTYVSVYGRMMWNEPSPTMTTFCTGIGNGRFGHPEQNRAISLRE
AAILQSFPADYKFAENEATLKFGKTSKHIGNAVPPKLGEIIGKSILQHLE
KYNYGKENK
>Cag_1529 conserved hypothetical protein
MEKFKGLYRIESARMQGWNYGWAGLYFITICTKDRVCWFGEMVNHKLSLS
DIGTIVEMEWRNTFEMRPDMNLYMGEFVIMPNHFHAIIGIGTNRYNIQYD
DHRRDAMHCVSTHHCVSNTPPKTTISSQSNNLASIVRGFKASVTKQARML
HVDFAWQSRYYDHIIRDEKSFHAISTYIINNPAQWAKDELYL
>Cag_1005 Protein of unknown function DUF48
MKKFLNTLYVTSQGAYLSKEGECAVISIEKEVKTRIPLHMLDGIICFGAV
TCSPFLLGHCAEQGVTVTFLTQYGKYLCQVQGATRGNILLRRAQYRIADN
EAQSAALSRSFVIGKIGNARITLARTLRDHPDKVDALRLKQAQHHLAECI
QHLQHETNQERIRGIEGEAAKAYFEVFNECITSPDSHFQFKGRSRRPPLD
RVNCLLSFFYTLLTHDVRSALEACGLDPAAGFLHKDRPGRPSLALDMVEE
FRSYIADRLTLTLINRGQIHANDFTVSETGAVLLKDDARKKLLTAWQERK
QEVIEHPFVKEKMEVGLLWHMQAMLLARHIRGDLDVYPPFVWK
>Cag_1017 HhH-GPD
MLKEFLITHNKELEIEKSLFSGQSFLWKKHQSNLDSFVTVMDKRLVIISQ
LSPYTIRVHCDSEVLYGQKISAFISHYFTLDVPFQKIFSSSFKSNYSEVW
RLLDGYKSIALLRQHPFETLISFMCAQGIGMRLIRQQINRLCERYGEFYE
AEMEGEMLCFSGFPAPEQLACLNAEELSYCTNNNRERAANIIAVARKVVE
GRLDLSSLSYPNMAFEEVQARLTQERGIGLKIADCVALFGLGYFEAFPID
THVHQFMAQWFKVPAASRSLTPATYRQLTLEAREILGSHYTGYAAHLLFH
CWRCEVKKLCWF
>Cag_0597 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFTLPALTHSK
>Cag_1882 Transcription-repair coupling factor
MKTQLASEESSVAVVRHNPSWLFDILRQSAPYNQLTSLLSKSNAQQKGDI
LLPLAGLYGSFSSLLAATLFADATAPLMVVCSSNSFERYENDLEVLLPKG
SLCNSADELSHTIEALATKRRSLLLSLFDDLDVPLCSPSEVESRMFHVTL
DATIGYDALKHFLTANGFEQRDFVEDEGEFSLRGSIIDVYPFGAAEPLRI
ELFGDTITSLRLFDSNSQLSGKNLQQATLTANFTTPNSPITLLDYLAPET
VVLVDDVAELIAQSDGKELLERLCYFRCLSINHAEVQALNFGGEAQQKLQ
GNFRTLATLLHTAHHEARQPLFAMSSKREIGELNDFLAQESSQEALPQSG
WLPVTLHSGFRFGSLDLYTESDIFGKMHTHKVHRKRKVRGISLKELQKLK
VGDYVVHEDYGIGIFKSLETITAGNSEQESVLIEYANGDQLFVNVQNIHL
LSKYTASENSSPTLSKLGSSKWAAKKEKVRKKIRDIAINLIKVYAQRKMQ
PGFAFAPDSIFMREFEAAFIFDETPDQLRAINDVKKDMQASHPMDRLICG
DAGFGKTEIAMRSAFKAVESKKQAAVLTPTTILAHQHADSFTRRFANFPI
SIAVLSRFVSRKEQLSLLKKIEEGKIDIVIGTHRLVSKDVHFKDLGLLVI
DEEQHFGVEVKEKLREQFPTIDTLTMSATPIPRTLQFSMLGARDLSIVST
PPKNRQPVETIITDFDAALIQSAIQRELQREGQVFFLHNRIAGLETIAES
LRELVPSARIVYAHGQMPTRELEKIMMDFMQQEVDVLISTTIIGSGLDIS
NANTIIINRADLFGLSDLYQLRGRVGRSERKAYCYLITPPMKTLKKDALQ
RLAVIESFTELGSGFNIALRDLDIRGAGNLLGAEQSGYIHELGFDLYQKM
LEETVAELKTNEFSHMFEEEGNKPLRQQKPCDLLFFFDALIPDYYVAATQ
ERFAFYNRIAKATRNEQLDAIASELCDRFGKLPEEVTNLLMITKLKLIGT
LLGLEKIDIQPQSTMLYLPDQASEHVAQRHYLQYLFTAVQAEWMAEYKPG
FKMEKKMKLQLHHPTHADTTSAGLMERYSALLHKVYEEAKSEVEAAMVG
>Cag_0002 DNA polymerase III, beta chain
MKILSSIRQLQEPVAKVAQAIPSKSVDGRYDNIHFTLEPNALTLFGTDGE
LSITAKIEVESTDSGHIGINARTLQDFLRSMYDTPVTLSIERQEISDHGM
VEVTTDKGRYKIVCLFESKPERYDKVYDITLDLPTSELLGLVQKTLFACS
IDGMRPAMMGVLFELEGTTITAVATDGHRLVRCRKESSLDIAEKQKIVLP
ARVLSILQKLAQHESITMCVSTDRRFVRFISGHMILDAALIVEPYPNYNA
VIPVEHDKNVVINRQSFYDSVRRVGRFSSIDDIRLILENDRLTVMAENTS
DGEAAQEELPCSYNGEPMTIGFNAKFVEAALAHLDDEEILIELKSPTTAV
IFTSSKIEDRDKLIILVMPVRINS
>Cag_0828 Primosomal protein n
MYARCVADRFFRGEPFSLVVPEAFCEELQAGCMVLLLSLKGQGLMSIGYV
LSLSPDAPPDMVNEELPSFEMVDLLNGSQPVLNGELLKLTSWIADYYLTR
PIDAIHTALPVAIRTTVHDVVEAAGFTLQAEPTKVMNTALRRSILKLLAT
NKQLTVTQLQRRLGKKQLYKTISQLEKGGYLTLSKKFSTKKPKYKSAYRL
TAPLQDGVLESVASAKKQHATLSTLADLYPETAFLNELEVSHAVIQVLLN
KGLVEKVQKRIESNFSSGYRESAQPAKKPTAQQQKVLNELCSASRQGHYQ
TFLLHGVTGSGKTLVYIEFLKEVLAAGKTAIVLVPEIALTPQTAGRFREH
FHHDIAILHSAMSLQEKYDAWHSLKSGRCRIALGARSTLFAPLENLGAII
VDEEHDGAYKQDRSPRYHARDTAVMRAMLSNAICLLGSATPSFESYQNAQ
NGKYHLLRMAERIDGATMPTISLIWMRESPRRTTSISEMLYQQIAQRIEK
NEQVILLQNRRGFAGSILCLECGHIPLCPHCNIPLVYHATHNHLRCHYCG
HTERYKAMCSACKSTGLFYKGSGTERIEEELQKLFPDEKILRMDVDTTAK
KGAHGRILREFHERKARILLGTQMVAKGLDFPAVTLVGVLMGDIGLNIPD
FRASERTFALLMQVAGRAGRAAIPGEVLIQVYNKESDVFTALLHGDYERF
FQQELESRRTLLYPPAARLIKFECSADDEVQAEAAATFCKEIVQQHLPEK
QGMVLGPAPACIAKIRNRFRYHVLVKLMLGKLSPLFIREMSDTIHSRFRS
ANVLLTVDVDPQSLM
>Cag_1099 DNA modification methylase-like
MNELQDESVHLIVTSPPYWQLKDYGTENQIGFHDDYETYINHLNLTWQEC
YRVLHKGCRLCINIGDQFARSTYYGRYKIIPIHSEIIKFCEIIGFDFMGQ
IIWQKTTTMNTSGGASIMGSYPNPRNGIVKLDFEYILLFKKQGTSPKPTK
EQKDNSVMTNEEWNTYFNGHWYFSGAKQDQHLAMFPEELPRRIIKMFSFP
NETVLDPFMGSGTTALAARNLNRNSIGYEINPTFIPIIKNKIGMDDVFMK
VETSVIKQPEITIDFNECVNRLPYQFIDTHKLDKKIDVKKIQYGSKIDSE
STGKREDFFSVKEIISPELLKLNNGLIVRLIGIKQNPAINGKATEFLFNK
VRGKKVFLRYDAIKHDKENNLMVYLYLENKTFINAHLIKNGLVLVDNSID
FKYKAKFNSLTNG
>Cag_1093 putative exonuclease
MFTFLHVADLHLDSPLKGLEEYPDAPLKQLRHATRRAFDNVVQMALDERV
AFVVVAGDLYDTDWRDYNTGLFFVSRMAKLREAGIPVIIVSGNHDAASQI
TRSLRLPDNVKILSHTHPESYLLEPYNVAIHGQSFATRFVRDDLARNYPQ
ADPSLFTIGLLHTSLETSGDVYAPTTLDLLRSKGYNYWALGHMHRHEVVH
RNPWVVYTGNIQGRHIREGGAKGCMLVTVENDAVVQTEWRAVDVLRWARC
AVLLEGCDSMEQVYHLVRERMEELRQQAEGRPLALRVQLRGATPLHHTLH
TKIGHVMEEIRAIAVSFGDCWLEKVELELSAPHAKSDLLGAASPLASLLE
AVDALELPDGSLTSLLPDFEKLRHKLPHELISDGDPFAPPADELEILRDE
VKQLLSATIEETIGGTTERAIRGSMGGTIGGRNA
>Cag_0708 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFALPALTHSK
>Cag_0847 Helicase RecD/TraA
MSVQDGQESYYYSPKERLSGAVERVTFHSQKNGFSVLRIKVKGRRDLVTV
VGATPSIAPGEFVECLGEWHNDSTYGLQFRATELTVVPPETIDGIEKYLA
SGMVKGIGPHFAKTLVYAFREDVFTVIEEEPERLLELPGIGQKRMEMVTS
AWADQKVIRDIMVFLQSHGLGTSRAVRIFKTYGNESILRVKENPYRLVLD
IYGVGFKTADALAMQLGIAPDSLIRAQAGVHHVLQEIASSGHCAAPREQL
VAEASRLLSIPEERTHEAIDAELRAGNLVREELRGVETLYLLSLHRAELG
VATSLMRLLEGEIPWRHLAIEEALPWVEAQNNITLSPSQKEALHTALTNK
VTVITGGPGVGKTTLVKSILLILQAQKVRVALCAPTGRAAKRLSESTGLE
AKTIHRLLEFDPLTGGFKHQRDNPLECDLVVVDESSMVDVVLMNRLLAAV
PEKAALLLIGDVDQLPSVGAGAVLADIIRSETIPTIRLTEIFRQAASSRI
IMNAHRINKGELPLRDESNTLSDFYLIAANTPEEIYNRLLTVITERIPAR
FGLHPVRDVQVLTPMNRGGLGARALNVELQKVLNGQVEPSVTRFGTRYAA
GDKVIQMVNNYDKEVFNGDIGHISAVEREDGAVLVDFDGTLVSYEFGELD
ELSLAYATSIHKSQGSEYPAVVIPLAMQHYNLLERNLIYTAVTRGKKLVV
IIGETRALAMAVKNHKAMRRLTGLAERLSALARYEANL
>Cag_0368 hypothetical protein
MHAKDVVSKDILKRIALDIARILLHLKVDHAELLETEHQRVEERRADVVV
LVQGESGRFILHLEIQNDNQANIAWRLLRYRSDIGLAHKGYDIKQYLIYI
GKAPLSMPTGIHQTGLDYRYHVIDMHSVDCQALLTQDTPDALVLAILCDF
KGRSEREVVRYIIQRLQELTAENESRYHDYMRMLEILSANRSLEKIIEEE
EAMLSVVDQTRLPSFRIGMRHGIEQGVQQGTLSLVKRQLTRRFGTLSYHH
VARLDKLNIEQLEELSDALLDFNTVTDFDVWLENRKN
>Cag_1007 CRISPR-associated protein TM1801
MSTLNQKIDFAIIMRVTNANPNGDPLNGNRPRTDLDGHGEMTDVCLKRKI
RNRIMELKDKEQKYQFDIFVQPDDSKRDSHTSLKARFESEIGKNVKDKDD
AAKKACKKWFDVRAFGQLFAFDGEESSGLSIPVRGPVSIHSAFSVEPVNV
SSIQITKSVSGNEGKNGKRSSDTMGMKHRVDYGIYVTYGSMNPQLAERTG
FSDEDAKVIMEILPKLFENDASSARPDGSMEVVSVIWWKHGSKAGKHSSA
KVHKSLHVNEDGTYRLDDLEGLTPECINGF
>Cag_0042 hypothetical protein
MLQKLIIKRFRGFSTLEVDIPKVLLLMGPNSSGKTTALHAIRISCQAAWI
AVTNNIAWKVEDTVIIFKDFIIRDISQLMPIADWQALFVNQIVGEHTHFS
IEIIFEKTDALSSILIEGKYARNENLKITATIGAETLINNLKNISNRSSQ
YKNIAFEFFQKHLPKAILIPPFYGVIRDEEYRAKAVVDAMVGSADQSHVV
RNMISRLSTTQLEQLNAFIKDMVGATLVQRTQGDDIEKISPLRVTFRDTN
GELELSAAGAGLINLIALYSSLARWESETIDRQIIFLLDEPEAHLHPRLQ
GYTADRLATIITNDFNAQLIMATHSIEIINKIGERDDATIFRTDRLNKEK
GGQQLIGQTPLLDDLSQWADLTPFSIINFLASKRILFYEGKSDGIILTKC
AEILFRNNPDKKKKFEKWTLIQLEGSGNKNIAQLLAHLIDSSTFASVAEK
KDFKIVVQLDKDYNDEVEQLKLITNRDISTFYNIWSKHSIESLFCESATL
YQWLKPKYPDIQEETIEKAIIAANQDNELNQYAREQRQATLLKPLQKISE
NITATNRQADNDIAATPEIWQRGKDRSKVILHHIKTALSTSANSLSTSLT
KVIEKADVNLFPAGNRAVVPSEIKQLLDWMVTNA
>Cag_1004 Protein of unknown function DUF196
MMVLVTYDVNTESPDGKRRLRRIAKTCQNYGQRVQFSVFECNVDPAQWTK
LRAKLLREMDPNRDSLRFYFLGSNWQNHIEHEGAKEPRDLEGVLIL
>Cag_1796 Ankyrin
MKILEYTGFDSSSVAESYRKVATALAQGDFRAAQVKKLVNLTHGKFYRAK
LDAANRLLFTFVRYGDEVCLLMLEVIMGHNYHKSRFLRGAPLEEEKIPDV
DASEALNDAEQLRYLHPNHTEIHLLDKPISFDDAQQAVYLHKPPLIIVGS
AGSGKTALMLEKLKHVEGEVLYVTHSQYLAQNARNIYYAYGFEHPAQEAH
FLSYREFVESIRVPTGREATWRDFAAWFYRMRSNFKEIDPHQAFEEIRGV
ITAPEDGCLSRKNYLQLGVRQSIFSKEQRSILYDLFLKYRHWLTDSGLFD
LNLIAHEWKASPRYDFVLIDEVQDMTVAQLSLVLKSLKKAGHFLLCGDSN
QIVHPNFFAWSHVKTLFWKDPNLAGKKQLQVLTANFRNGREATRIANQLL
KLKHQRFGSIDRESNFLVEAIGGAEGQAQLMADTDATKREFNKKISHSTR
FAVLVMRDEEKQEARKYFSTPLLFSIHEAKGLEYDNIVLFRFVSSCRREF
NDIAEGVSLTDLEAIDSLEYCRAKEKGDKSLEVYKFFINALYVALTRAVK
NLYLIESDTKHRLFELLGLAVAGKVEVAAEESSLEEWQKEARKLELQGKQ
EQAEAIRRDILKEVPPPWQVCNETRLDELIHKVFKEKAPGNKFKQQLYEY
ATCHVEPVLAQALEKQTDYRSPHGSFWEHLDTIGRKSYLPYFSQQTKAIL
RQCEQHGPNHRLPMNQTPLMAAAAAGNIALTEALLERGADPTLNDHYGYN
ALHWAMRQAFRDNRFARTTFGTLYERLAPAAVDISSGERMIRLDRHLAEY
LLFQTCWVLFKSRFTTLELNGEYPAFDTSLILEAWEHMPDNVVPTERKRR
TYLSSVLARNEVSRNYAYNRSLFERLATGWYQFNPALHVRTSVTEEGQSP
WIPIFQAVNLPLISKFCHSHTIATIVQCFRKACMAVIPELEAEIAQQQAT
KAAKEQHLQTLVKQVKKKITPSSDSLAAKLLKQHKLSKKLDDELLVPFLK
FVREKELEEIRQQKMKKKLEREERQQIKAAEQAKRDEQVQQQLGFDF
>Cag_0893 DNA polymerase III, alpha subunit
MDFIHLHTHTHYSMQSSPIFPSELFKAAKAFGMPTIAVTDYGAMFNMPEL
FSEAKQVGIRLIIGSEVLLLEHDEHQTSRHTVSPSLVLLVKNETGYRNLC
ILLSRASREGFVNGMPHVESRLLEQYHDGLLCLSAYGAGRIGRALMAGSL
DEAANFSAYYQEIFGSNFYLELQRHNTSFDALLNEQIIGLAQKFSIELVA
TNNVHYLRQNDAGCYRALVANRTKEKLSGPVSAALPGSEHYLKSAEEMQQ
LFSNEYGELENTLRIAEQCTFTFSDKEPALPRFPLPDGFSDEASYLRHLT
WEGAAEKYAKSEEEGISQEEVKERIELELGVIEKMGFSSYFLIVSDLIAA
SRRMGYSVGPGRGSAAGSIVAYLTGITRVDPLRYKLLFERFLNPERLSMP
DIDIDFTPVGKQKVLEYTVEKYGADSVAKVIAIGTLGAKAAIRDAGRVLE
VPLPLVDKLAKLVPTKPGITLEKALTDSRDLREMAESTPELKTLMQYARS
MEGRARNVSMHAGAVVITDGALEEQVPLYVSNKIETEERRFADELDLDQP
DNGKAKAGESNDEKQVVTQFDKNWIETAGLLKIDYLGLETLAVIDETLRM
IKRRHGLDIDLEKVPMTDRKTFRIFQEGKMAGIFQFESSGMQSYMTRLQP
TQIGDIIAMSALYRPGALNARVDEHRNAVDLFVDRKHGREAIDYMHPMLE
GILKETYGVIVYQEQVMQISQVMGGFSMAKADNLRKAMGKKKPEIMEKFK
ADFIAGGVAQGVHDTLATRVFDLMSEFAGYGFNKSHSAAYGVLAYWTGYL
KAHYTIEFVTAVLNSEIGDTERMKHLTDEAKSFGIATLPPSINKSDALFS
VENSSNGRSAIRVGLSAIKQVGGAARAIVTSRMRRKRDFLNIFDLTASVD
LRVMNRKALECLILAGACDDFDPHRARLLANIDKAIKFGQMQNRTVTMGQ
CGFFSNEEGQEGDIHYPELDNADMMPDGEKLLHEKKLVGFYLSRHPLSAY
RRDWQAFANLPLNTKEIVKNKQYKVIGVVVSLKPYQDKKGKQMLFGAIED
FTGKADFTIFASVYEQFGHLIKPEEVLMLVVEAELGGGMLKLLVREVLPI
KKVRKSLVKKLVLTIDADEQGQLDKLSSIKELFNKHKGGTAVEFEMKAQA
GDNIETLTLFARATPIEPEEELIEQLELLLGPDNVRIAG
>Cag_0362 Histone-like DNA-binding protein
MSKAELVEKIAKQADLTKADAERALTAFVDVMTASLKAGDDVALVGFGTF
SVGDRAERQGRNPQTGETITIAAKKVVKFKPGKALKDEIGG
>Cag_1580 transposase
MKDTVLFQQALCLPAPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HNLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEYDSRIWRIIHHYLDEVLEQQ
DLSSVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPEAHITFDKFHIVQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGGKF
EFALPALTHSK
>Cag_0181 conserved hypothetical protein
MVKTIMAMKIDKPKFNPNIHHRRSIRLQGYDYSQSGFYFITIACQDRICR
FGYVENGEMVLNKYGIVAYNEWVRLRTRFPNIELDVFQIMPNHMHGIIVL
NEISVEDVGAGFTPAQNNALSNIRAGASPAPTVSEIVGTYKSLVANGCLK
IYTTKNETMGKLWQRNYYEHIIRNEQSYQSFSEYIINNPAKWEDDTFYVI
>Cag_1608 DnaB helicase
MISKKAAPIIDFSKDIDFSQESRIPPYSTEVEQEVLACVLLEDEPIEQVI
QIFGESSEEVFYERRHQTIFRAMMQLYHKRQAIDIITVSEELLRMGELEV
VGGRHYLAELSGKVISAANIEYYARLAKEKFLYRRLISIATKISGVAYNS
SMDIFDLVEHASQQFFTISQAGVKKKASPIKELVKTGIRMLENLRASQSS
VTGVASGFSELDQFTAGFQPSDMIIIAARPSAGKTAFSLALARNAAVDFN
TPVLFFSLEMAEVQLAIRLMCAEAYVESQLVRTGRITPEMMGRIINSMDK
LNEAKLFIDDTPGISIMELAAKTRRMKQEQNIGMVVVDYLQLVTPVRDGR
TNREQEIAQISRSLKALAKELNIPIIALAQLNRSVEQRSGDRRPQLSDLR
ESGSIEQDADVVMFLSRPEMYGKNTFEDGTSTKDIVEIVIGKQRNGPIGD
IRLLFLKNYGRFQSTANVYITANAEAESAPQAEPERYLQPSQEFPPPASG
GAFIAQDDAPF
>Cag_1392 putative type II DNA modification enzyme (methyltransferase)
MNNLLIHGDNIAGLDYLLHQKQLKGKIDLVYIDPPFATGGNFTITNGRAS
TISNSRNGDIAYSDKLTGDDFINFLRKRILLLRELMSEKASIYVHIDYKI
GHYVKIMMDEVFGIDNFRNDITRIKCNPKNFTRIGYGNIKDLILFYTKSS
NPIWNEPTEKYSENDIVNLFPKITTNGRRYTTVPIHAPGETVNGKSNKPF
KGMLPPQGRHWRTDVITLEHWDKEGLIEWSSTGNPRKIIFADEREGKRVQ
DIWEFKDPQYPIYPTEKNSDLLDLIITTSSNPNSIVLDCFCGSGTTLKSA
HFLQRQWIGIDQSPHAIEATINKFSDIKADLFIESPQYDFIALTDELINQ
S
>Cag_0457 DNA recombination protein, RuvA
MYAYFRGTLISFTADEAIIELQGVAYHFLISATTSRQLPNSGSEVQLFAH
LYVREDAMLLYGFYSEEERQLFRLLLQASGVGPKLALSVLSGLPVHEVHD
AIVSNIPERLYGISGVGKKTAARIILELRDKILKLSPVLPTATARRPHNA
AQQLRDDAITALVTLGFPRAAAQKTVTSLLDENSNCTVEEVVKSALLLIH
NAQL
>Cag_1137 HhH-GPD
MEDWLPSKRQIEQLQAKVFAFYGEHGRSFPWRNTTDRYAVMVSEVMLQQT
QAERVVERFEAWLVAFPTVQALADAPLREVLALWSGLGYNSRAERLQRCA
QTIVADFGGVVPALPEVLLQLPGIGAYTSRSIPIFADNFDVATVDTNIRR
IVLHEFGLPETLKPRELQMVADRLLPHGQSRKWHNALMDYGALHLTSQKS
GIRPLTRQSKFQGSRRWYRGQMLKALLKTEALPLEALEATWADSPYCLRD
IASDLVREGLVEYHPSASADDSPLLRIRGSG
>Cag_1123 conserved hypothetical protein
MNQYNPNIHHRRSIRLKDYDYTQVGLYFITICCQDRTCRFGRIENGEMIL
NEHGKIAHNEWMKTREIRPNVELGEFIVMPNHIHAIIRFLRRGELHSPNN
NVVFDTPLPFDNGGVFKTPNNTGECNSPLRSPSQTVGAIVRGYKSSVTKQ
LGLMGFTEKLWQRNYYEHIIQNEQSYQTISEYIINNPAKWQDDKFYVE
>Cag_0086 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFTLPALTHSK
>Cag_0224 DNA polymerase III, delta subunit
MEKLKKAIQSKKIAPIYFFTGSESYLKEEFATLIQGALFASEEDAVANTH
LLHGHDMTLRELLSRASEYPMFTERQLLVVRHFEKIKKPTTKEQQKQQYA
AFGNYLANPATFTVLLLDADELDKSDFEKQPFSLLKSVRHDFPAIKHPDL
FASERAAQAGWEFEPDALKAFAAYIDPSSREICQELDKIILYASERQSAK
RITAADVLDCVGVSRTYNVFELEKALVARNLRLCSGMSLMIMDQEGQKEG
LMAIVRYLTTFYMRLWKISMPEVQRMAQSDIAKVLGMSPRQEFLIKSYLT
YTRQFSLQQTEAALCALRDVDASLKGLRPYSDEKYLLLQLMQRLLG
>Cag_0314 Excinuclease ABC, B subunit
MENRTDNEYQLVSPYQPAGDQPKAIEALVQGVRDGRHWQTLLGVTGSGKT
FTISNVIAQLNRPVLVMSHNKTLAAQLYGELKQFFPHNAVEYFISYYDFY
QPEAYLPSLDKYIAKDLRINDEIERLRLRATSALLSGRKDVIVVSSVSCI
YGLGSPEEWKAQIIKLRAGMEKDRDEFLRELISLHYLRDDVQPTSGRFRV
RGDTIDLVPAHEELALRIEFFGSEIESLQTFDIQTGEILGDDEYAFIYPA
RQFVADEEKLQVAMLAIENELAGRLNLLRSENRFVEARRLEERTRYDLEM
MKELGYCSGIENYSRHISGRPAGERPICLLDYFPEDYMVVVDESHVTLPQ
IRGMYGGDRSRKTVLVEHGFRLPSALDNRPLRFEEYEEMVPQVICISATP
GEHELMRSGGEVVELLVRPTGLLDPPVEVRPVKGQIDNLLAEIRHHISIG
HKALVMTLTKRMSEDLHDFFRKAGIRCRYLHSEIKSLERMQILRELRAGD
IDVLVGVNLLREGLDLPEVSLVAILDADKEGFLRNTRSLMQIAGRAARNL
DGFVVLYADVITRSIQEVLDETARRRAIQQRYNEEHGITPRSIVKSVDQI
LDTTGVADAEERYRRRRFGLEPKPERVLSGYADNLTPEKGYAIVEGLRLE
MQEAAEHMEYEKAAYLRDEITKMEQVLKKDG
>Cag_0876 DNA photolyase, class 2
MLHSPVDPRRVRLLNHHIDGNGVVIYWMSRDQRVRHNWALLFARWKAAML
QQPLMVVFTLAPSFLGAPLRHYDVLFNGLQEVETELRALNIPFMVLQGEP
SEELPRYAMHHNASMVVADYSPLHLTRCWKNQVAEALSVPLYEVDAHNIV
PCRVASPKQEYAARTIRPKINKLLGEFLTPFPELEALPQPLTEPPVNWQK
LRSHFHADASVAPVGWLTAGEAGAHATLQCFVQQKLNGYATQRNDPSLEA
TSRLSPYLHFGQISTQFVALQVKAAHAPQEDKDAFLEELIVRRELSDNYC
HYNASYDRLSGIPAWAQETLARHATDPRDYIYSHEAFEQAKTHDPLWNAA
QHELLQSGIIHGYMRMYWAKKILEWSRTPEEAFEIALWLNDRYALDGREP
NGYVGVAWSIGGVHDRPWRERPVYGTIRYMNANGCARKFDVKRYIHNVTS
RRPQQVGLF
>Cag_1559 transposase for IS1663
MKSNSLSVKTSPVKYSFGLDVSKAKIDVSFCTLDDQQQVKVYGSHSFSNT
NKGFVELLLWCHKKCKETLPTVYILEATGVYHEHVAWFLHDHDCAVSIVL
PNKACHYKKSLGLRSKTDSIDAFGLAKMGAEQNLPIWETPDKTLRELRII
TRHREDLVTDKTIILNRLEAFEFCHNGSALMIKQLKKQLSLIEKQIEEID
QLVKETVEENAELKARFDKILAIKGVGLITLATIISETDGFSLITNQRQL
TSYAGYDIIENQSGNHTGKTRISKQGNSHIRRILHMPAFLVVKYEPQFAN
LFERVYERTKIKMKAYVAVQRKLLILIYALWKNGTVYQSTAQPIIASKLC
A
>Cag_0682 transposase
MMHPSPDHMVHYGVEGNCECGLALSESAISIGECRQQWDIPAPRIEVTEH
RQLIATCRCSKVHKGEFPSSLPPYISYGARLKAYTVGLVQGHFISLARVT
EIVSDQYGVKPSDGSVQRWISQASKNLTTTYTAIGETISNSAVAHFDESG
IRAQGKTQWLHVAATTEAVYYTAHAKRGQEAMSAAGILPLFNGVAVHDHW
KPYFRFDHVLHSLCGAHLLRELNAFDETLQHRWPVQLKQVLIDAKNAVAQ
AKKAKQTSLPPEQIADLKQRYEQWLNYGLLIFSERPKINKQQGKGKQHPA
RNLLCRLRDFKDSVLRFIERFDVPFDNNTAERAVRPVKVKLKVAGGFRAM
GGAEAFCVIRSVWQTDKLQQQNPFETLRLVFR
>Cag_1992 Protein of unknown function UPF0102
MNPPNSTCELGRQGEALAATYLQNEGYQILERNYRFRHNEIDLIALDGST
LCFVEVKARLSNKAGSPLDAVTVAKQREIIRAAQAYLTFSGQECDCRFDV
IGVNVHAMHEARISSFTIEHIKDAFWVEQ
>Cag_1513 DNA repair protein RadC
MKLHDIDPDNRPRERFLQHGAAALSPAELLALILRSGSQQYNILDTCHHI
INRFSLEKLSDVSLKELQQIKGIGESKAMQIVAIFELNRRLHYSRNQLRK
IMAAGDVFEYMSGRIPDESKEHLFVLHLNTKNQVIKNELISIGTLNTAVI
HPREIFKSAIRESAHSIIVVHNHPSGDVNPSNADKKITNELKQAGAFMQI
EMLDHVIMSKTEWYSFRERGLL
>Cag_1116 DEAD/DEAH box helicase-like
MSEQQLPLENNFFSLQLPELLMKALEEVGYESPTPIQAQTIPFLLAGRDV
LGQAQTGTGKTAAFALPILASIDIQQAEPQALVLAPTRELAIQVAEAFQR
YAEYLKGFHVVPIYGGQDYGIQFRMLRRGVQVVVGTPGRVMDHIRRGSLN
LTHLKTLVLDEADEMLRMGFIDDVEWILEQTPAGRQVALFSATMPPPIRR
IAQKYLDQPAEVTIQTKTTTVDTIRQRYWVVGGSHKLDILTRILEVEPFD
GMIIFSRTKTMTIELAEKLQARGYAAAALNGDMPQNQRERTIEQLKNGNI
NIVVATDVAARGLDVERISHVVNYDIPSDTESYVHRIGRTGRAGRAGDAI
LFVAPREKNMLYAIEKATRSRIEQMVLPTTEVINNKRIAKFNQRISDTIA
AEDLGFFTRMIEQYCNEHNVPMLDAAAALASLVQGETPLLLADKPERSRS
SERDSYGSSRDRGFEREGRDSRSGGREGRSDRFERDGRSGRDDRGGRDER
SAPRKRGRSEVYGEEPKDRYRLEVGSTHGVKAGNILGAILNEAGLAPESV
GHISISDTYTTIELPKQMPDTMFHELRKIRVCGRQLRLSRMEEHEGGHST
HSSHGTGAYGGGAKKSFRKPNKSANDEGEFFAGFKKKRKG
>Cag_0626 ATPase
MTESAIQPDLFGFSTPSSSVTSTTEKSSRFVPLAERVRPRMLDEVAGQQH
LVGANAPLRRFLESGQMPSVIFWGAPGCGKTTLAEICASTLQCHFEQLSA
VDAGVKEVRKALDIATRVRQAGQRCLLFIDEIHRFNKSQQDTLLHALEQG
LILLIGATTENPSFEVNGALLSRMQVYTLKPLTAEELEQVIRRALATDAL
FRERSIELADLEVLWHYCAGDARKALNAIEAAFALFPTNQSSVQLTREHF
EAALQQKAPLYDKSGENHYDVISAFIKSMRGSDPDAALFWLARMIEGGED
AKFIARRMVIFASEDVGNADPYALTLALSVFQAVSVIGLPEARINLAQGV
TYLASAPKSNASYQAINEAMAEVKSTTATTVPLHLRNAPTKFMKNEGYGA
GYCYPHNYPSHFVEQHYFPEGMEPKAYYRPTAEGREKMAQERLHQLWKER
YRK
>Cag_1802 Methylpurine-DNA glycosylase (MPG)
MEPLPKQFYQCSTIELTEKLLGKCFVRILPNGTRLAGRIVETEAYLGEGD
EACHAWRSRTPRNEIMFREAGTLYVYFTYGAHYMLNIVSEPEERAGAVLI
RAMEPLEGIEFMQQQRNTTKFPNLMSGPGKLTQALAIERSCNGRTLFDGE
FFVADAPAIPSHQIGTSGRIGISRSTELPWRKFIMGNAHVSGGKVGGVVS
SLQ
>Cag_1363 serine/threonine protein kinase
MRKRLFIKKQKFDKWELKRFLGGGGNGEVWECCDEEGNKGAIKLLKHVKS
KSYARFCDETKIMEQNFDIEGIIPILDKFLPEKLDGSIPYYVMPMAESAE
KVFKAKNIVSKIDSIIEICKTLAKLHERGIAHRDIKPPNLLVFNSRLALA
DFGLVDYPDKKDISLQNEEIGAKWTMAPEMRRESSKADSLKSDVYSLAKT
IWIILTENPKGFDGQYSIDSIIELKRFYNKTYTTPIDNLLTKCTDNDPNQ
RPTVNEIILELENWKVLNKDFHERNQEQWFEIQTKLFPMTFPKRVIWENI
EDIVKILKVVCTYDNLNHMLYPNGGGMDLEDVRLSHEKSCIELDCQLINI
VKPKRLLFESFGYTAEWNYFRLELYELEPSGAYENDEYYENIQYYEEYDG
YVSPENTMQLLRWFRGSFVIFNKRSVYNRISSTYDGRHNKMNTEEFRDYI
QEMVSHTIEMNKKKSAMATIESKRRKTR
>Cag_1778 DNA polymerase III, delta subunit, putative
MSWSSIIGQQQQLRVLQHALETGRFAHAYLFMGAEGCGKEAVAFEIAALL
NCRNASASPQVGACNTCPDCEKVHALNHPNVEYIFPVEAVLLEGGGDLAK
KENKRFTEAKERYDALIERKKENPYFAPAMERSMGILTEQILSLQQKALF
MPSVGSKKIFIISQAERLHPSAANKLLKLLEEPPEHVLFILISSRPEALL
PTIRSRCQAVKFSRITTMQLREWLAQHRPDIVEPERSFVVNFSRGNLRLA
WDLLSNRSSDMAEAPALQLRNQALDYLRYVLTPNRFHEAIVACEQYAKSL
SRRELTLFLAALLLFFQDACHRRINPSVADLNNPDLSDNVNRFAKNFPNT
NYFALSQAIEDAISSLERNVAPLLVMATLTTELRQQLQRRG
>Cag_0890 Methylated-DNA-(protein)-cysteineS-methyltransfe rase
MPTTQPPPHKSRLLVQPTAIGRIAIAERNGNIVQLLFEGERVPFVYEEGE
SALLLEAFQQLDEYLLGKRTNFTLSLAPMGTPFMQAVWKALTTIPYGTTL
SYGALAVQLGSPKAARAVGMANHRNPLPIFLPCHRVVGSNGRLVGYRGGM
ALKQQLLELERRVVGNTALHL
>Cag_0145 DNA mismatch repair protein
MPIITRLPDSVANKISAGEVVQRPASVVKELLENAIDAGATKISVTIKDA
GKELIRIADNGVGMNRDDALLCVERFATSKIKSADDLDALHTLGFRGEAL
ASICSVSHFELKTRQADATLGLLFRYDGGSLVEELEVQAEQGTSFSVRNL
FYNVPARRKFLKSNATEYHHLFEIVKSFTLAYPEIEWRMVNDDEELFNFK
NNDVLERLNFYYGDDFASSLIEVAEQNDYLPIHGYLGKPALQKKRKLEQY
FFINRRLVQNRMLLQAVQQAYGDLLVERQTPFVLLFLTIDPSRIDVNVHP
AKLEIRFDDERQVRSMFYPVIKRAVQLHDFSTNISVIEPFASASEPFVGS
SSQPIFSSTSSQAPRMGGGSRRFDLSDAPERAITKNELYRNYREGAFSSP
SVASYDAPSPLQQGGLFALASAEESLFGAQAVHEASENIEAFQLSPLDNI
VEHKEVEPKIWQLHNKYLICQIKTGLMIIDQHVAHERVLYERALEVMQQN
VPNAQQLLFPQKVEFRAWEYEVFEEIRDDLYRLGFNVRLFGNRTVMIEGV
PQDVKSGSEVTILQDMITQYQENATKLKLERRDNLAKSYSCRNAIMTGQK
LSMEEMRSLIDNLFATREPYTCPHGRPIIIKLSLDQLDKMFGRK
>Cag_1189 RecR protein
MRFPSVALDTLIDEFAKLPGIGRKTAQRLAMYILHEPKIEAEQLAKALLD
VKEKVVRCTICQNITDVGTDPCAICASKARDRTVICVVESPVDMLAFEKT
GHYKGLYHVLHGVISPLDGVGPDDIKVRELLARIPVGEASGVREVVLALN
PTIEGETTSLYLARLLKPLGIAVTKIARGIPVGAELEYVDEATLSRAMEG
RTVV
>Cag_0241 RecA DNA recombination protein
MTMDNPKVEQAGHAVDSAKLKQLNLAVDALEKQFGKGTIMRMGDGSAGLT
VQAISTGSMALDFALGVGGLPRGRVTEIYGPESSGKTTLALHVIAEAQKE
GGITAIVDAEHAFDPSYARKLGVDINALLISQPESGEQALSIVETLVRSG
AVDVVVVDSVAALVPQAELEGEMGDSSMGLQARLMSQALRKLTGAISKSS
TVCIFINQLRDKIGVMYGSPETTTGGKALKFYSSVRLDIRKIAQLKDGDE
LTGSRTRVKVVKNKVAPPFKMAEFDILYGEGISALGELIDLGVEFGVIKK
AGSWFSYGTEKLGQGRESVKKILREDPVLYQKIHMQVKELMTGHTEIISS
PTE
>Cag_0548 DNA primase
MSMIPPAIIDEVRQAADIVDVVSDYVALQPSGRNYKALSPFTQEKTPSFI
VSPDKQIYKCFSTGKAGNVFSFIMEMEKVPFMEALKLVAQRAGIDISRYT
EPKGKQEGEEEQGSGAALRWAARMFHSLLKQPAGAEGWRYFVEERGLREE
TINRFGLGYAPESWDFLLREARREGIKSEQLVELGLLVSHREKQSLYDSF
RHRVIFPIFSRGGQVVGFGGRALVSDERSPKYLNSPESAMFAKSKLLYGL
HFAKNEIRRQERAILVEGYMDVLALHQAGLTNAVASCGTALTRYQAKMLR
HYSEHVLFVYDADKAGQKSMMSGIDILVSEQMVPQVLMLPEGDDPDSFVR
REGRQGFLQYAESHTMGFQDFQLAFFEAAGAFSTPEQKAEALRVMVRTIA
LIPKRAQRELYAQELSKKVGLTVTALRELLGNATSAVAKQQSCTPSKASA
TAPTSSSATNATSAPTIPHAPNLPNAQALPPLSVLEKTFLKALLESTQYG
TAVLGFAASHQSMLELRHPLAQEIFAHLIHRYHNIAADPEATIDMVSEIS
AFTNPETRDLASTLLLDPPISPKWQQQNDLFSEQARRCLAMFLDAFKNLV
LEPLLDEKNKLMEQIRVEENVEREIELSRQKIVLDKKIREENRSLQQMIK
AILDSTQQVG
>Cag_1509 conserved hypothetical protein
MANNLLLPNQRDDYDSPWKEAIELYFPEFMAWYFPNAYAAIDWSKPYHFL
DQELRSILPEAENGKRIVDKLVQVHLLDGKERCLYIQIEVQGNRETDFPR
RIFICNYRIFDKYGKPVASFVILTDSDSSWRPTAYSYEFAGSKMTLEFDM
VKLLDFEPRMKELLASDNAFALVTAAHLLTQKTREKSLERLDAKSQLIRL
LYNKQWTKERVRELFRVIDWFLELPKELEQQLRTEIYNIEEEQKMKYISS
IERYAMEKGILEGMERGMVAGKEVGVLEGMERGLEEGLLKGRLEVAQRLV
ASGMSKAEAASFAGVSVEML
>Cag_0164 Crossover junction endodeoxyribonuclease RuvC
MIVLGVDPGSLKTGYGVVQHHNGSFSVLAAGVIRLQAAWSHPERIGIICR
ELEQVIAEFQPERVALETAFLSHNVQAALKLGQVRGAVIGLVVRYALPIY
EYAPREVKSAITGKGAATKEQVAFMVSRMLSLHTVPKPHDVTDALGIALC
DILRGESRQSGVPPRTNSRRKSGTGGSWEQFVRQSPNVVVRS
>Cag_0848 Histone-like DNA-binding protein
MGNTTTKIDLVTTIARNTGLTKYETEAVVNCLFESIIESLKAGRRIEIRG
FGSFNIRQKNVRKARNPRTGEKVMVESKQVPSFKISREFKLAVSESLKSS
EL
>Cag_0095 Single-strand binding protein
MAELKMPEINSVIIAGNLTKDPVFRQTNSGGTPVVNFSIACNRRFRDSNH
QWQEDVCYVGIVAWNKLAESCRDNLRKSSAVLVDGELQSRTWKAQDGSSR
TVVEIKARRIQFLNKKHKNGEDDVEGFIEDECPDQHHETLQDEDADYLYD
CK
>Cag_1402 DNA modification methylase-like
MKFPDDYINTIICADSLTVMEQMPDKCIDIAVTSPPYNLKNSTGNGMKAN
TKSGKWAGNALQNGYSHYNDNIPNDEYAEWQYNCLKAMYRLLKDDGAIFY
NHKWRVQNGLIQDRTDIIRDLPVRQIIIWKRKGGINFNPGYFLPTYEVIY
LIAKPSFKLLPKANAYGDVWEFTQEMKNNHPAPFPVALIDRIISSTSAQI
ILDPFMGSGTTAVAALQLQRNYIGIDISPDYCEMAKERILNLNPAKRFIK
KNGLETISLFEKIV
>Cag_1148 hypothetical protein
MRHQGASILLYNQQHEVLLVLRDNLPFIACPNTWDAPGGHLDAHETPLHC
IVREMMEEMELDVSTCSHFKSYEFSNRTEHIFTMQTDVLNTATTPLHEGQ
MIRWFTVADALQLSLASDMEVVLHDVGIWLEQQNNGTEDCGNV
>Cag_1527 conserved hypothetical protein
MNHEKYNRRSIRLKGYDYWQVGAYFITICTQNKECLFGKITDGKMVLNDA
GNIIQEFNAITESHFKNIAISPFVVMPNHYHAIITVGAGSPRPNNPHNEN
DHICDDGRVRVDDGRVRVDDGRVRVDDGRGNPAPTLGQIVGYFKYQTTKR
INTICQTGGKKLWQRNYYDHIIRDEKSFHAISTYIINNPAQWAKDELYL
>Cag_1415 DNA helicase II
MSNFLHDLNEVQRSAVEATSGPVMVLAGAGSGKTRVITYRIAHLINNEGI
APRNILALTFTNKAAGEMRERVDTLLHHGASRGLWIGTFHSIFARLLRNS
IDRIGYDRNFSIFDADDSRSLIRQSMAELDISADAVPLNTLQSIISRAKN
SFVMPAEFQRNANDYNQQKAAQVYSLYCKKLKENNALDFDDLLIKPLELF
NAHPDVLHELQELFRYIMIDEYQDTNRVQYLVAKMLGARHRNIFVVGDDA
QSIYSWRGADISNILNFQDDYHDAQTFKLVENYRSTGNILKAANSVICRN
QRQIKKELVSHRHAGEPLTVMEAFNERNEAEKVADRIRTMRMSGTNDYRS
FAIFYRTNAQSRVLEDIMRQQRIPYRLFGSVSFYKRKEIKDAVAYLRFIV
NERDSESLLRIINFPPRKIGDVSIAKLRDFAEVRHISLYEAIHRAAEAGF
PARLLNALASFTSVIEALREMATRGTVYDVLNELFTLTSIPLLLQAENTP
ESLARHENLQELLSMARDFADHNPDGGSLGDFLENISLASDYDETQESDN
YVSLMTVHASKGLEFPVVFITGLEERLFPLHTYEPEELEEERRLFYVAMT
RAQEKIFLSYAKSRYQYGQLHQSIASTFISEIDASIVQSEGGRLLSDRRA
PREATTPQNHAAAPAMRRPTTSGMAPSSASSPTAESSSPSISNGTLVHHP
LFGQGVVLEVQGKGSKQKVRIAFRNAGEKTLMVQYANLKIQTS
>Cag_1609 hypothetical protein
MGKLKAHQKLPAEWNEREILRDKGQFWTPSWVAEAMVAYVTENTDLVFDP
ATGRGAFYEGLLKLNKQNISFLGTDIDPDVLSDEIYNKENCFVENRDFIK
DPPNRKFKAIVANPPYIRHHRIDEATKILLKKIAISITGNSIDGRAGYHI
YFLIQALNLLEKDGKLAFIMPADTCEGKFAKNLWEWISEKFCIECVVTFD
ERATPFPNVDTNAIIFLIKNTKPQQTLQWIRANQAYSDDLLQFVTSNFKL
IEFDSLEITTRQLKEGLTTGLSRPEQNHNGFKFHLNDFANVMRGIATGSN
EFFFLTSEQVKELNIPKDFLKRAVGRTKDASESVLSLKNIEDLDRENRPT
YLLSINGQESFPKPISDYLKVGEEMGLPTRSLIQQRKPWYKMEQRKVPQI
LFAYLGRRNTRFIKNEAGVLPLTGFLCVYPIYDDQEYIDNLWQALNHPDT
LENLKLVGKSYGSGAIKVEPGNLNKLPIPEHIVANFNLKRPYKNAYEQLE
IFREPKTKYGLKKRKTAGNKC
>Cag_1969 Recombination protein O, RecO
MYRIESRGAVYNTIQQYIKVHSVIVKTRAVVLRETNFRDQSRICSLYSRD
FGRLSVIIKGARNPKNRLCGLFSAGNILDVVIYRKSGRELQLASDATLVA
SPLMAEPDMERFAALYRIIDVVKQATAEHEHNPQLFTLLAATLQSLYQQG
SNNLLLTAWFLLRLVSLLGFQPSLRQCVFSNHQLATEVVAMKLSELLFVM
NPGGLALPAAGGISVGKQWRVPVALALQIAPLAEARTPADISLQVEDAEL
ELLCAILYDYCAIHLEHTPKRRHLAITAQLAEA
>Cag_0934 ATPase
MSYQVIARKYRPAKFSDITAQEHVTRTIQNALRSGRIGHGYIFSGLRGVG
KTTAARIFARALNCQKLIDDADYLQQVTEPCGECESCRDFDAGTSMNISE
FDAASNNGVDDIRTLRENVRYGPQKGRYRVYIIDEVHMLSIAAFNAFLKT
LEEPPPHAIFIFATTELHKIPPTISSRCQRFNFKRIPLEAIQQQLQQICE
AEHIQVEADALQLVARKAQGSMRDAQSILDQVIAFSSENALEGSITYRGV
ADLLNYIDDDTMFAVTDAVLANNPVAMLEVAHFVLKNGYDEQDFLEKLLE
HLRNLLVVLNLSSTRLVERPDAVRERYQRDAAKFSPHTIMQMAELLLQTQ
KELKFLFEYQFRFELALLKLLEIAHPPASAAALTIAPEKKKPLSNQ
>Cag_0788 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFALPALTHSK
>Cag_0223 Single-strand binding protein
MPFLQLLSELDAMARSLNKVMLIGHLGTDPELRTTTSGQSVANFTLATNE
NYKDSSGNLQERTEWHRIIAWGKLAEICNQYLKKGRQVYVEGRLQTRSWD
DQKTGEKKYTTEIVCSDMQMLGSPREQMGGESTMQPYDQSTLPSQSSAPS
VMPPATPTVPTMIDTDKDDLPF
>Cag_1767 Holliday junction DNA helicase RuvB
MRIELLNTPPDAAESRFEEQIRPIRMEDFAGQQRLTDNLKVFISAAKMRG
DALDHVLLSGPPGLGKTTLAYIIASEMGSSIKATSGPLLDKAGNLAGLLT
GLQKGDILFIDEIHRMPPMVEEYLYSAMEDFRIDIMLDSGPSARAVQLRI
EPFTLVGATTRSGLLTSPLRARFGINSRFDYYEPELLTRIIIRASSILGI
GIEPDAAAEIAGRSRGTPRIANRLLRRARDFAQVDGISTITRTIAMKTLE
CLEIDEEGLDEMDKKIMDTIVNKFSGGPVGIASLAVSVGEERDTIEEVYE
PYLIQAGYLARTTRGRVATRKAFSRFADHTLLGGNFGGHKGSLPLFDESE
AD
>Cag_0154 Excinuclease ABC, A subunit
MSFSHISIRGARVHNLKNISLDIPRNQFVVITGLSGSGKSSLAFDTIYAE
GQRRFMETLSPYARQYIGNIERPDVDFIEGLSPVIAIDQKSTSRSPRSTV
GTITEIHDFIRLLYAKAGRRYNPETGAMVQAQSADNILATILALPEGSKV
QILSPLVTGRKGHYRELFERLRSKGFLRVRVDGELQEMVPNMQLERYKSH
TIELVVDRLVLAPESEARVREAVMLAISISEHKSSVICTPFEGGFTELAF
TLSKGDNEDALPTSTLAPNHFSFNSPYGACPTCNGLGELMQLSGELMIPD
PSLSLNQGGLDPFGKAGKRNHWQVIRAIAKEFDFTLDTPMSKIPKSALKI
LLNGSGKRTFEVAYTSSGHTSLYPQPFQGAVAYVQEILNNATTSKVREWA
EAYMLHQPCPVCLGARLKPESLQVKIHGLNIAELEALPLPETLAFFNNLP
PNLSQKELIIATPVLHEITKRLQFLLDVGLGYLSLDRSSHTLSGGEAQRI
RLASQLGSQLSGVLYVLDEPSIGLHQRDNHKLITSLKHLRDLGNTVLVVE
HDKDTMLEADTIVDLGPGAGAYGGEIVAFGAARELDPSSLTAGYLNGTNR
VFYASEASSEKTDADADATPLFLTLKGCKGNNLKNIDAQIPLRKLVSITG
VSGSGKSTLINETLYPILARHFYRSKVVTAPFDAIEGIELLDKVVNVDQS
PIGRTPRSNPATYTGAFTFIRDFFTRLPEAQIRGYKAGRFSFNVKGGRCE
VCQGAGTRKIEMNFLPDVYVQCENCKGERYNRETLMVKYRGKSIADVLEM
SITEAAEFFTDFPRIRRILNTMQSVGLGYLKLGQPSPMLSGGEAQRIKLS
AELAKIQTGKTLYILDEPTTGLHFQDTQHLLEVLRKLVEKGNSVIIIEHN
LDIIKNSDWVIDLGAEGGFEGGTIIAEGTPQQIADTPHSHTGRFLKMEMG
G
>Cag_0275 NAD-dependent DNA ligase
MTIIDASERIAQLRQEIERHNYLYFNEAKPELSDYEFDKLLEELMALERE
FPDLLTPDSPSQRVGGTITKEFPVVTHREPMLSLANTYSAGEVAEFYNRV
AKLLAAENVHKQEMVAELKFDGVAISLLYRDGVLVRGATRGDGVQGDDIT
PNIRTIASIPLRLHQPLAGEVEVRGEIFMRKEDFEQLNDNRPEEERFANP
RNATAGTLKLQDSAEVARRRMNFVAYYLKGLKDETLDHVSRLHKLEALGF
TTGGHYRRCKTIEEINTFINEWEEKRWKLPYETDGVVLKLNNVQLWEQLG
ATAKSPRWAIAYKYPAQQARTQLCNVVFQVGRIGTITPVAELTPVLLAGS
TVSRSTLHNFDEIERLSVMILDYVMIEKSGEVIPKVVRTLPDERPADAHA
IAIPTHCPECDTPLIKPENEVSWYCPNEEHCPAQIRGRLLHFASRNAMDI
KGLGDALVEQLVAWGLVHDVGDLYLLQEPQLERMERMGKKSAQNLIRALD
ESRTRSYDRLLYALGIRHVGRATARELAHAFPTLDALMQANEERLAEVPD
IGTTVAQSIVDFFAKPSSRQLVDKLREARLQLAASASKIEQVNRNFEGMS
LLFTGTLERYTRQQAAELVVERGGRVVESISKKTSLLVAGRDGGSKLDKA
HKLGVRVISEDEFMGMM
>Cag_0832 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFTLPALTHSK
>Cag_1438 conserved hypothetical protein
MQILAGQFRGQKIGRSASAAVRPCSSRVKKSLFDTLAVRMDLEDAHVLDI
FAGFGSLGFEALSRGAASVTFVDRFHESLKALKSTAAKLGVTNKVSIVNA
DALAFLGRTTNQFDLLFCDPPYAWADYHALLELIFRRSLLAEDGLMLMEH
STQHNFSHTPEYLFHKDYGMTRVSFFQPPPLNQP
>Cag_1694 HRDC
MQIKLFTIPISDSGAPEEELNAFLKTHKIVSVDSELANNKDGAWWCFCVR
YLEQAMNALPERKVKVDYRQVLDDVTFQKFVKLREIRKRVASEEGLSAFI
VFTDEELAELAKLDEISVKSMLSIKGIGEKKIERFARYFITTPESDEAQG
EIG
>Cag_0416 Endonuclease III/Nth
MNPQEKIIALHDLLSKQFPNPKSELEYLSPFQLLIATILAAQATDKQVNV
ITRELFKRAPDAITMSRMELEEITGYVRTINYFNNKAKNILEVSRRLVEH
FGGEVPQEREALESLPGVGRKTANVVLANAFGMPVMAVDTHVHRVSNRIG
LVSTKKVEATEEALMAIIPEAWVADFHHYLLLHGRYTCKAKKPACPTCTV
AHICDFAE
>Cag_0699 DNA mismatch repair protein MutS-like
MNPSTLKKLEFTKIAAYAAQLCLSPMGRDRLLNARPLREREALMAELERV
LELRMLLQEGLTLPFSHLPDTRVLLKKLEIEHLALEPLELLDLYHLLYSS
VQLRRFMYGNRERYGRLNDLTIMLWMERSLQAMIQRCVDERGLVRDSASD
GLLLIRHDLAESRELLRRRMERLLRRASANGWLMEETVAVKNGRLTLALK
VEYKYKIPGYIQDYSGTGQTVFIEPAETLETSNRIQDLEISERREVERIL
QEVSAALRGELENIHHNQQLMAEFDALYARARFAVETNAVLPTVTEGNEL
RLIKAYHPWLLLSHRERTVQPLDLHLSAEEQVLVISGPNAGGKSVTMKSV
GLLCCMLVHGYLLPCSESSCIPLFNNIFIEIGDDQSIEHDLSTFSSHLSA
IRSILERAGTRDLVLIDELCGGTDVEEGGAIARAVIEELLASVAKSIVTT
HLGDLKAYAHQRDGVVNGAMAFDRAELQPTFRFIKGLPGNSFAFAMMQRM
GFSPALVERARHFMAHERIGLEQMVDDLSHIMEEQQRQRQQLDDEQRTFA
ERERTVLEVEATLKQQQRELKQQISRAVQKEVEHARKEIRAIVQEVKAAP
TNPQVVQAAREKLGIKRQEVEERHTTAAPTTASEPTIDRTITIGDMVRLL
DTNATGEVERFNGDNVVVRCGTIRLQTHLKNLEKSSKTKARTAQRDTSNS
KVRSWSTVTNEVSSTQLDVRGMSGNEAVPHIERFLDTLRLHRIHFATILH
GKGTGSLRKRTAECLKLHTAVKSFRLGGLGEGGDGVTIVELGE
>Cag_0748 transposase
MKDTVLFQQALCLPAPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HNLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEYDSRIWRIIHHYLDEVLEQQ
DLSSVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPEAHITFDKFHIVQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGGKF
EFALPALTHSK
>Cag_0809 probable transposase
MAGTYSQIYIQYVFAVKGRENLLQKPWRDDVFKYIAGIIKGKNQKPIIVN
GIEDHIHVFVGLKPSMSIADLVRDIKNNSSNFINEQKFLPRKFAWQEGYG
VFSYAHSQIEYVYQYISKQEEHHKTKTFKNEYLEFLQKYEISYDEKYLFE
WLD
>Cag_1063 transposase
MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR
DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ
DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK
HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA
VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA
RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW
YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF
EFTLPALTHSK
>Cag_0001 chromosomal replication initiator protein, DnaA
MHDHSPILVTDPHSLKGQKQSSMEQQVWDTCLAVIKESINPLAFKTWFLP
IRPLGFVGGELTIEVPSQFFYEWIEENYSLLLKQTLRDVIGSEARLMYSI
VMDKSQGQPVTIELPQQTTSPFTYEQAPLKVDRIEEQRHESYERNVSRFE
SHLNTKYIFDTLIRGDCNSLAFAAAKAVSQNPGQNAFNPLVIYGGVGLGK
THMMQAVGNSVRENRLTDRVLYVSSEKFAIDFVNAIQNGKIQEFSSFYRS
IDVLIIDDIQFFSGKEKTQEEIFHIFNTLHQSNKQIILSADRPIKDIKGI
EDRLISRFNWGLSADIQPPDYETRKAIILSKLQHNGVTLDDAVIEFIATN
VTENVRELEGCIVKLLAAQSLDNRDIDLAFTKSTLKDIIRHTTKQLTLDT
IEKGVSSYFSITSNDLKGKSKKKEIAVGRQIAMYLAKMLTDSSLKTIGLH
FGGRDHSTVIHAVSTISKRVEQISEERKRIEEIKKRIEILSM
>Cag_0328 ATP-dependent DNA helicase RecG
MDGTTSVAFLKGVGSRKAVVLGEVGIVTVDDLLAYYPRRYLDRRSIKRVR
ALVDGELTTVVGTIVRTQLEQPTSGKARFKAWLDDGSGLLELTWFRSVRY
FSRFFTKGESLAVHGKVSFFGNQAQMQHPDYDRLTPENAVGGEKGSDDFA
LFNTGAIIPLYHTTEAMKQAGLASRQLRVLIKRALEEVPFREQENLPLSI
IRQYGLIPQWEAEREIHLPSSPEKLEQARYRLKWTELFYAQLLFALRRST
LRRNRAAVRFTHSGELTRKLHESLPYQLTEGQKQAVRDIYRDLRSGSPMN
RLLQGDVGAGKTMVAMFAMALAVDNGLQAMVMAPTEILAVQHALVMKRFF
APLGIELGLLTGKQGKKERRATLEKLRTGDMQLVVGTHALLEPDVQYANP
GLVIIDEQHRFGVLQRKALQEKAANPHVLLMTATPIPRTLSMGMFGDLDL
SIIRDKPVGRQPIKTVLKKEQDKPSVYHFVREQIAAGRQGYIIYPLVEES
EKMDLKAAVESYEELSTAIFPDLSIGLIHGQMSPDEKEHVMERFRQREFS
ILVGTTVIEVGVDVPNATVMIIEHAERFGLAQLHQLRGRVGRGEHPSTCI
LLTAKMTADARERLLAMVSTNDGFVLSELDAKIRGVGNLLGKEQSGTLSG
LRIADLNTDEAIMAAARQAAFTLVEADAQLRATEHRMVREHYMRYYHERF
SLADIG
>Cag_0003 RecF protein
MKLQRTIFSGFRNHTSLLFEPSEGVTIIYGANGSGKTSLLEGIHYGALTK
GLLGAPDSECLSFDTEAFTLDSHFLSDSNIPIHVLVTYQLEGEKQVIVDR
QEVKPFSSHIGRIPTITFSPYEISLVSGPPAERRRFLDSAISQLDHRYLD
RLITYRRILQQRNALLAQLSSGEKSNRNTLPLWTTQLAELSAWLVERRLL
FLTSFSPYFQHYYRYIIKGEEPSINYRCTSCPLHGNTTFQELYQLFLQRY
SDIEAQEIQRGQTLFGAHRDDVLFFLNEKEIKRYASQGQLRSFLIALKIS
QAHLFADHLHEQPMCLFDDLFSELDGGRIEQILALLKECGQTIITAVEPR
YTEGITLCDIQALR
>Cag_1118 Tyrosine recombinase XerD
MSTLSSSYQTTLNSFLNYLIVERNFSANTRSSYHNDLHRYLLFVQEQATP
IAEITSKVIDRFLAELVALGLETTSMARNISTIRSFHKFLHNERLSSNNP
AERLHLPKKAHYLPAVLNLSETLALLEAPSIMQPAPTYALRDRAMLELLY
ATGVRATELISIQQEHLYSDAGFIRIFGKGSKERLVPIGASATLWVQRYQ
KELRVQLVKAHSNDFLFLNSRGGKLSRMSLFEMVKTYSVVAGITKSISPH
TLRHTFATHLIEGGADLRAVQEMLGHSSIVTTQIYTHLDRSFIKEVHKTF
HPRG
>Cag_1006 Protein of unknown function DUF83
MYPESDFIAISALQHFAFCPRQCALIHLEQIWSENMYTAEGRELHERVDE
GKTSYKSGVRITRSEPLRNATLGIAGVADVIEWHKQPNGKELPFPVEYKR
GKPKKHNADKIQLCAQALCLEEMLGIHIPSGALFYGETMHRLEVEFTPPL
REQTRGAAEGIHELFERGLTPPPDYSAKCKQCSLLEVCQPNLLAQHNTAR
NYLASLVQTLSAEDA