Gene list
Applied filters:
COG category: Replication, recombination and repair
Organism: Chlorobium chlorochromatii CaD3, CaD3
Gene type: CDS
Number of genes found: 121
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Chlorobium chlorochromatii CaD3, CaD3 >Cag_0763 Exodeoxyribonuclease V, RecC subunit MSAFLSIKSIHDKNSSLTFFLISNQFQSGTMALHLYTSNRMEMLVDSLAE VVRQPLASVFEHEVIVVQSRGMQRWLSMELAGRFGVWANGRYPFPNAMVQ ELFKQLLPSVAQSDAFKKEVMSWRVMRLLPHLLEMAEFLPLRRYAADDSD GLKLFQLSEKIADTFDQYTLFRPDMLALWEAGGGVAEGGEAWQPLLWRAL VEGAGLHRGQLRELLFRQLSRSSSKISELPERITLFGISYLPQFHLELFA AVARLTEVHLFLLSPTQEYWGDIVSRKAMARLSEAEQALRSEGNPLLASL GRIGRDFSEMVLEMSDEALDSQEFYDDPPEDSLLHALQWDILHLQGAGEM DETPRLLQPHDRSVQIHACHTPLREVEVLYDAILGLLEAHPHISLRDIIV MTPDIESYSPYIATVFGTAREAGKEGKGVVALPFSIADRRMMHEGEIASA LLKLLALHGSRLTASMLFDFLASPPVSRAFGFDAEALRLIRGWIEGSGIR WGMDEEDRRERNLPAYRDHSWRAGLERLLLGYAMPEEEQLFQGVLPYGDI AGSAAEMLGRFAEAVEALERFVSSSEPSRTLEAWRQQYAMWLTTFFAPDE DSEREFATLATLGEELAEYGINAGFEENISPLVFFTWLRSRLEEQEQGLG FMTGGITFCAMLPMRSIPFRVVMLIGMNDGAFPRQSRAPSFDLITRQPQK GDRSLRNEDRYLFLESILSARELLYISYVGQSIRDNSEIPPSVLVSELLD AVRRAFVLPNESSIEQHLVVRHRLQPFHHDYFSEHSPLASYSSENYYALI ASEQSLQAVPPIRSFISTALSEPTAEWRTVQLEQLLHFYDNPSAFFLEQR LGIKPEGLLLPLQDSEPFAVESLERYRLQQELLEAQLRGQPAEALLPLFK SRGMLPPAQHGELLFATVMQEVDDFAATLRQHLAGEVALAPLEVDIEVGE FRIVGRLDGIWANAMLRYRPARMKVRDRFRWWIEHLLLCALQPTGYPLTT HMLMSDGEWSYPPIDNPHQHLTTLLQRYWQGLCEPLPFFPRSAYAFVLKG MDANHHLDVGKGIDAAYREWRDDTFTNRKGEGSDSAIQRCFGAAANPFSD TFIELALELFTPMMEAMGAMGDGKRSG >Cag_1296 conserved hypothetical protein MIISASRRTDIPAFYGEWFINRLRVGEVLVRNPMQPKQVSHIALTPETID ALVFWTKNPNPFFRYLAEIDAFGYPYYFLFTITPYDTTIEPHVPTLEKRI AHFQYLAKRIGAERVVWRYDPILFTKTLSPTWHIAAFRHIANALSGYTKR CIISFIDNYRKVRRNMASLPLITPNEGMITQLLQTFTNIAEQQQINLQVC REEIDVTHYGIANGSCIDRSLVEQLCGRPLVGIGKDKNQRKTCGCIASRD IGRYDTCLHGCRYCYAVSNHAKAAAAYKNFNPDTPLLCNELCGNETITCA PKQNQSKLECLPLFEKT >Cag_1447 SMF protein MDILNFLMLSQVPGIGAARIKALLTHWGNLSFLQHATIADLTHINGIGET LATELYNTFHNAAKNDTVRRAAEAQLLALERCNGQVLTLLDEGYPPLLRE IYDPPPCLFIRGTLPPNTEKSLAVVGTRHASAYGKQVTTHFCHAIAKQEM PIISGLAYGIDMAAHQAALDAGGTTVAVLASGIDTIYTDPKGLLWPKILE HGAIVSEEWIGSHITPAKFPKRNRIISGIAKGTLVVESDLKGGALITATT ALEQNREVFAVPGSIFSHTSRGTNKLIQQGQAKAIMEVDDILMELQPSQP HQAKPIHPTKATANATTTTATTQLPLLNPLESQIYQALSSSDPTHIDTLA ATLQLDLSTLFLHLFELELQGVIEQQPGQLFLRKA >Cag_0770 Exodeoxyribonuclease V, alpha subunit MITYNERPIDRHFAKMLLQHCGNSKHELLPLLFSMVSNAIGQGSVCLNLA DIAAQSVTYGNRTVQLPPLAELMRLFSTLPVVSRNGAEFRPLVIDNVGRL YLYRYWRYEHDLAEALRQKASTKSCTIEKKSEAVQVLLQQLFPEGSDAQQ KQAAEVALHRRFCIISGGPGTGKTTTVVRIVALLLEQAGGERLRIALAAP TGKAAARLKQSISTIRGTLSCSQTLQQAIPSEVVTIQRLLGAIPNSTRFR YHQRNPLPYDVLIVDEASMVSLSLMHALLMALKPECRLILLGDRHQLASV EEGAMLGDLCSAVGEATPHSPLAGTLVMLEKSYRFQTGGAIAELSRAMNQ GEGEQALALLQSNQSAALRWQPLPTPDALPSALGRAAVAGYRAYCEATTP AEAMERFERFRILAALREGIYGVSGLNRFVEQALAREGLLAPTSLWYAYR PVLITVNDYNVRLFNGDTGLLLPDAENGGVSAWFTTPDGGLRRLPPERLP AHETAFCMTIHKSQGSEFDNVLLILPPTDTPLLSRELLYTGVTRAKSRVE VWGDPTFVQAACKRTTIRHSGFREALALE >Cag_1136 conserved hypothetical protein MNNMQYNRRSIRLQGYDYSQSGAYFITICTQNRECLFGKIVDGNMILNDA GEMIKNIWHKIPTYHPYSYLDAMCIMPNHFHAIIMTVGADSISAPIDSIS APIDSISAPTIGAEMDSAPTLGNIVQTFKRYTTIEYIKMVKQNKLPSFNK RIWQRNYYEHIIRNESDYTHIYDYIQNNPQQWEMDTLYPNTL >Cag_0075 DNA topoisomerase I MASSVAALSAKNKTLIVVESPSKAKTINKYLGSNYTVFASVGHIKDLPKK EIGLDFEHNYSPRYEIIPGKEKVVKQLKKLATEASNILIATDPDREGEAI AWHIANEIEHAKAPVARVLFNEVTKKAILEAIEKPRHIDLRLVHSQQTRQ GLDKIVGYKISPFLWKVVLRGLSAGRVQSVALRLICEREEEIERFVIQEY WTIAADFLTANKESFRARLVRLDGDKPEITNVEQAEAIAAIAKKGNYSVR EITPRIQQRKQPLPFTTSLLQQAASNQLGFGAQRTMRTAQQLYEGIELGA EGAMGLITYMRTDSTRISPEAVGEARNYIERNFGKDYVGAGSSGKPGKNA QDAHEAIRPTSLLKTPEQVKPYLSADQFKLYELIWKRFLAAMMAPAKIEQ TKVDVEEQSGKFLFRANGSRVLFPGFMRVYDDQQELAYEAQTSTKEEVEN EMVVKLPEKLAVNDPLGLGALEQKQSFTRPPARYSEASLVKDLDHFGIGR PSTYASIFSTLQDRRYVALEKRKIMPTDLGRDVAKILVANFPELFNVGFT AFMEDELDKVASGDDAYEKVLDSFYKPLTSALALRSATPLIPQNNEAETC DKCGTGKMILKWTASGKFLGCSNYPKCKNIRTISSNREKPASTGVHCPSC EDGEMVLRKGRLGPFLACSNYPKCNTLLNLNKQRHIEPPKTPPVVTDMAC PKCGAPLYLRSGKRGLWLGCSKFPKCRGRLAWTALEPAAQERWERVMAAH QKAHPPVTLKMVDGSTVSMTSSIDDIIMKADAAGLIAPAMDLVPEAEG >Cag_2007 DNA-directed DNA polymerase B MENLINHIVTNNLLFGKDKEERIVGAYQLSDTHIRLFNRNGDTVTFHDEP FYPYFFLSDSSLLETFVPENQEKFWLVPLAGSNYYTALAIFKSSRNHKNA VDFLNRKWNGNQAAQGEAAGKNSMESNPFMYNKGDTITQYLMQSGKTMFK GMLFDDIYRMQLDIETNYNGEKKGFYDDEIIIISLSDNRGWEQPLHSKGR NEKELLQELIAVIQEKDPDVIEGHNIFNFDLPYIQRRCERHSIPFTIGRN QTIPRTYPSSIRFGERTIDFPYCDIPGRHVIDTLFLVQGYDVAKRSIESY GLKNVARHFGFASANRTYIEYKDIARLWQEEPNTLLAYALDDVRETQALS SLLSGSNFYMTQMLPYSYAMTARLGQAAKIEALFVREYLREKHSLPKPTS GQQQSGGYTEVFLKGILGPIVYADVESLYPSIMLSYNVCPKSDALRVFPN VLRSLKELRFKAKDQAQQELQAGNKRNADNFDAMQASFKIIINAMYGYLG YSGGIFNDYGEADRVTTTGQGIARKMIAEFEKRGCKIIEVDTDGIFFIPP ASIASEQEEKALVEEVSQQMPDGINIGFDGRFKKMISYMKKNYALLSYNN VMKLKGSSLNSRSAEKFGREFIRRGFQMLLAEDIKGLHLLFAEYKEKILN HQLSIEEFSRSESLKQTKEQYLEDVASAKRSKSITYELAIRKGMEIRKGD KISYYITGSGSSNFSWDKGKLAAEWDPNKPDENSAFYLKRLDEYSQKFLP FFKPQDYSMIFSTGSLFAFSEEGIELLKEIPNTDSQTE >Cag_1762 site-specific recombinase, phage/XerD family MFMSNSALHQPLPRLLQESALPIQAFLEHVAQRRGLSPNTVVAYRGDLIQ FFTFLAQHLELLDLRAFQPESVTPMDVRLFMGFLLEQGVKQRSIARKLVA VKVFYRYLQEHGIITTCLFSSLGSPKFPQRVPNFLTEEQTSKLFELLETV PNGAVSDSQPANSALHAFTAARDCSILELLYSSGLRVSELVNLRMDELDV ERGYVKVHGKGNRERIVPVGAAAIEALKKYFEVRRNFFRMNKEVEPFTSV FVTQKGAKIYPMLVQRVTARHLSLVTEQKKKNPHLLRHTFATHLLNSGAD LESVSEMLGHSNLATTELYTHVTFERLKEVYRKAHPNA >Cag_0543 conserved hypothetical protein MQLPCIVPSTLRRYLPTLFVTSLALLPLTPISVYGEMAEDLAALSSSSES DYNSEVALALLEELRHHPLSINRATANELRQLPWLSAADVHAIIKYRTQK GAFRSLSELETILGKERATWLSPYLTVEAAPVPAKTTVRPKATSTSKKTK KVATTGSLYSRYFTEMPPRKGILTEKYEGGNSKMYHRAQFYAPHVSASVV QEKDIGEAAITDFTSLSVSVADVGMMERVVLGNYRLTLGQGLMIGQGRFF SKGAEVGGRLTTKTLMPYASASEEGFLQGAAATLQIQPIALTLFYSANQR DAIINKEGVITSLSSSGYHRTTLEVSRKDNITENVMGAHLRYRTAVAGME ATLGGGMMNYSYPYPFDELEPNEPVSTVLGATLTNVDATLSFGSGALFAE AAFASDPHDMAWFAGAEYEPLRGVTAVAALRRYGENFYSPFANAFAERGG GSNEEGLYTAVQAAFSKKVTLGAYYDRFTFPQLGSHYQQAADGFDARAWF SWQQSSLLCWNVQVQHKEKPEEKNQGTTKNPIWTPLPILTDRLQLNCEVT PHKGISLRTRFELKNVDKEYLLATQSFTGKMWYQQVGYRTENFSLKGRFT RFTTTDYAAAIYAYEDDLPLTSSLGMYSGDGSSLFAVATWQPMKQMKVAA RYEVTRYNDRDVYSSGNDERATNAPSSLHVGCMLSF >Cag_1507 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFTLPALTHSK >Cag_1317 MutS 1 protein MAKEQSGTKEHSPMMRQYLEVKERYPDYLLLFRVGDFYETFFDDAITVST ALNIVLTKRTADIPMAGFPYHASEGYIAKLIKKGYKVAVCDQVEDPADAK GIVRREITDIVTPGVTYSDKLLDDRHNNYLAGVAFLKEGKTLMAGVAFID VTTAEFRITTLLPEELPHFLAGLHPSEILFSTQEKERTLLLKKSLPSETL ISLLEPWMFSEEQSQTVLLRHFKTHSLKGFGIETAGGNRAALVAAGVILQ YLEETRQNSLSYITRIGELHHTEFMSLDQQTKRNLEIISSMQDGSLSGSL LQVMDRTRNPMGARLLRRWLQRPLKKLTNIQERHNAVEELVENRTLRESV AEQLAAINDLERSLARIATLRTIPREVRQLGISLAAIPTLQALLSDVTAP RLQALTAALQPLPKLAEQIESAIDPDAGATMRDGGYIRAGYNEELDDLRS IASTAKDRLMQIQQEEREATAISSLKVSYNKVFGYYIEISRANSDKVPAY YEKKQTLVNAERYTIPALKEYEEKILHAEEKSLLLEAELFRNLCQQIATE AATVQANAALLAELDALCSFAECAVAFDYTKPTMHEGTTLSITAGRHPVL ERLLGAEESYIPNDCHFDDKQTMLIITGPNMAGKSSYLRQIGLIVLLAQA GSFVPAESASLGVVDRIFTRVGASDNLTSGESTFLVEMNEAANILNNATE RSLLLLDEIGRGTSTFDGMSIAWSMCEYIVHTIGAKTLFATHYHELAELE ERLKGVVNYNATVVETAERVIFLRKIVRGATDNSYGIEVAKMAGMPNDVI SRAREILAGLEKRDVEIPRQKAPKVNTMQISLFEETDNQLRNAVEAVDVN RLTPLEALLELQKLQEMARSGGY >Cag_0088 DNA gyrase, subunit A MQRERIVPISIEEEMRGSYLDYSMSVIVSRALPDVRDGLKPVHRRVLFGM HELGLQAGKPHKKSARVVGEVLGKFHPHGDTAVYDSLVRLVQDFSLRYPL IDGQGNFGSVDGDSPAAMRYTEVRMKSIAGEMLKDLEKETVDFALNFDDS LEEPTVLPSAIPNLLVNGASGIAVGMATNLAPHNLREVVNGIIALIEQPE IEIQELMKHVIAPDFPTGGIIYGYEGVRQAYLTGRGKVVIRARALVEVTQ KNGRESIIVTELPYQVNKVRLIEKIVELVHDKKVEGIADIRDESDREGMR LVIELKRDAVAKVVLNNLYKHTPMQDTFGVINLALVDGVPKILNLKEMMQ YYVKHRNEIVLRRTRFDLAAAERRAHILEGLKICLDNLDEVISTIRQSPD TATAQERLIERFGLSEIQAKAILEMRLQRLTGMERQKIDTEYIEVLALIE ELRFILNSPEKQMEIIREELLKVKDVYGDERRTEIVPQEGDFSIEDMIAQ EDVVITITHDGFIKRFPVSGYRRQARGGKGVTGAQAKNDDFIEHMFIAST HNYILFFTTSGRCYWLKVYEIPEAGRAARGRSLANIMELPPGEKIRTYIN IRNFEEPGFIVMATTHGIVKKTALEEFSHPRRTGIAAITIDEGDELLDAR LTDGDHQIILAKNSGFVVRFPENEVRPMGRTAMGVKGITLDEDEKCIAMV TTRRMDTALLAVTDNGFGKRSRVEDYRLTRRGARGVITLKPHEKIGALVG LLDVNDEDDLILITVNGIVNRQHVSDIRITGRNTSGVRLIRLMQGDSISA LARVPKSDEEGDGDFPLEDADGQIPLFE >Cag_1341 Excinuclease ABC, C subunit MEPLDALEKHGDIKKVLTEKLATLPTSPGIYQFKNSAGRIIYVGKAKNLR NRVRSYFRNSHQLFGKTLVLVSHIDDLEVIITSSEVEALILENNLIKELK PRYNVNLKDDKTYPYLVITNEPYPRILFTRHRRNDGSIAFGPYTEARQLR SILDLIGSIFPVRSCKLRLTPDAIASGKYKVCLDYHIHKCKGACEGLQPE DEYRQMIDEIIKLLKGKTSALIRSLTENMHLAATELRFEQAAEIKAQIES LKRYAERQKVVAADMVDRDVFAIAAGEDDACGVIFKIREGKLLGSQRIYI NNTNGESEASMQLRMLEKFYVESIEPVPDEILLQEALSEEEEETLRAFLL VKAKNEGQEKKGIRLVVPQIGDKAHLVGMCRQNARHHLEEYLIQKQKRGE AAREHFGLTALKELLHLPTLPQRIECFDNSHFQGTDYVSSMVCFEKGKTK KSDYRKFKIKTFEGSDDYAAMDEVLRRRYSGSLTESLALPDLIVVDGGKG QVNTAYKTLQELGVTIPVIGLAKRIEEIFTPHSSDPFNLPKTSPALKLLQ QLRDEAHRFAITYHRKLRSDRTLQTELTTIAGIGEKTAFKLLEHFGSVES VAQASREELQAVIGAKAGETVYTFYRPEG >Cag_1695 reverse transcriptase family protein MKRKGKLVEQIADLHNLYEAFYKAQKGKQAKRYVCAYRKQLQENLQLLRH QILSGAIQTGKYHAFTIYDPKERVICATPFSQRVLHHAIMNVCHPFFEKH QIAGSFASRKGKGTYAALDKAREYNCCYRWFLKLDVRKYFDSINHTVLQK QLTRLFKDKTLLLIFEQIIDSYSTADHKGVPIGNLTSQYFANHYLSVADH YAKEGLRVPAYVRYMDDMVLWHNEKEELLAMGYMFQTFIAKELLLELKPF CLNATHKGLPFLGYLLFENQARLAPRSKKRFLAKYQRYENNLQSGVWTQQ EFAKHALPLFAFTEYAQAREFRKKSLHSFCSLEGVFVRSSKGID >Cag_1918 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFTLPALTHSK >Cag_0179 DNA polymerase A MAMLYRAFFALQRTGMSSPSGLPTGALYGFTTALLKIFENYHPHYLVAAF DSREKTFRHHLLESYKANRAAPPEELLQQLEKLFELLKAFGVPVIKQAGY EADDLIGAMVTQFADVCRIGIVTPDKDLAQLVREGVQILKPGKNQHELEP LGCNEVKAHFGVPPKQFTNFLTLTGDTSDNIVGAKGIGPKTAATLLEKYQ TLDKLYQHLDELTPKVRKSLEDFAPNRELVLQLVTICCDAPLHVTLEELA CKNPARDVVLPLLQELGFRTIAARLQAASVALTCACNDGGESAPPMQSDP NSSNLLNGSDGNTSATDTAPPPSFPDVPRHYTLVETREQLQALLEELQQV THIAVDTETTSLDVFEAELAGISLCAEAGKAFFIATTPDALERKEVVKQL KPLLENPAITKSGQNLKYDMLVLKKYGIELAPISFDTMLASYVLNPDEHH NLDDMALRYLGRTTTKYDELTGTGKQRRHIFEVEKEALTNYACQDADVAF QLEEVLQAQLQAEPQLLALCTTMEFPLVRVLATMEYAGIAIDTEHLARVA ETTELELQSLTDNIYAAAGSSFNIDSPKQLSHVLFTDLSLPTGKSTKTGF STDVGVLEELAATYPIASDLLSYRTLQKLKGTYIEALPKIINPRTGRIHT SFNQHITATGRLSSSNPNLQNIPVRTALGKEIRRAFIPSTPEHWLLSADY SQIELRIAAELSGDERLIAAFRNGEDIHTATAQVIFGTEEISSDMRRKAK EVNFGVLYGIQPFGLAKRLNIPQKEAKVIIETYKAKYPQLFNVLRHIIEE GKEKGYVTTLLGRRRYIADLNSRNGTVQKAAERAAMNTPIQGTAADIIKC AMNLCYQQMQASGMASEMLLQVHDELLFETTDSEKEALTKLVENAMKEAA VLCGMKQVPVEVDCGVGKNWLEAH >Cag_1744 ribonuclease H MKKQVTIYTDGACSGNPGPGGWGALLMFGSITREVSGSSPATTNNRMELG AAIEALALLKEPCLVDLYSDSSYLVNAINNGWLQRWQRNSWQTAAKKSVE NIDLWQKLIKLLKVHEVRFHKVKGHSDNAYNNRCDQLAREAIKKTS >Cag_0599 hypothetical protein MENFEKIKKILTSNFEIELFEAALASLNDKSNRLRFNNFAYSIRELSRHF LYSLSPELNIKNCRWYKTETNDDKPTRAQRVRYAIQGGISDELLEDWSFD ILGLADTIKSVVSSINSLNKYTHINPEVFDLKDEEVKEKSILVLETFSKF VETIKEYREELKKFLDGHIENHMINSVISNFFKNVDCLAPHHSLEYCEVS DYHISEINDKKIVVNVTGDLHVVLQYGSSSDRREGDGLDLNENFPFETKI RYEISEDFPSDNHEVDDYDVDTSKWYE >Cag_0769 Exodeoxyribonuclease V, beta subunit MHHQPLNHTTVTLAGINLIEASAGTGKTYAIASLYVRLLLEKQLLPEQIL VVTYTEAATQELRGRIRSRIREVLEVFEGAATSDAIVQRLYDQALEQGDD MVERARMALVQALALFDTAAIFTIHGFCLRVLQEHAFESGSLYDTTLVTD QRALLLEIVEDFWRTHFFGEASPLLAYTLQCGGSPESFLALLQKLHVSGG ATIIPTFCDEEREALHATCLVAYAELCRLWQSDGAAVRELLSTDKGLSRA ADYYRADKLELLFAGMEEFIAGGNPFNLFADFQKFATSGIAAGTKPKGTS PDHPLFACAEKLLQAVQKRYVALKSELVQFYQRELPKRKRKANFRFFDDL LSDLADALQAPERGVALAQRLRSTYQAALIDEFQDTDPVQYIIFQTMYAD SDAPLFLIGDPKQAIYSFRGADIFAYMQAARAVEASRRFTLSENWRSTPQ LLNAFNQLFSNERLPFIYPDIIYHPLQAGNPDVANGEESAPALQFYLLEG DDAKGDVLSVEQGEALAAEATAGELYRLLQAGEIIGGKQVAAGDCAVIVR THAQAAQMVAALQRRGIAGVVRSDKSVFATREAEELRQLLIALADPAHEV KVRSALITDILGRSGDDCAELLADEVAWLQVLRRFRHYHHVWQHRGVMVM SRELMADEGVRGRLLASPDGMGERRLTNVLHCIELLHRQEHEHGFGCEEL LQWFSERISLQDELQEEYQLRLESDEAAVRIVTVHASKGLEYPIVFCPFL WNSVGNRRDEVVSFHNEVWQLVKDFGSPERDRHRVLAGRESLAEQLRLLY VALTRAKYRCTVLLARIKSEASAFNYLLHASDATRQSNKVVLELEQEMKG ISSEERKVRLHDIAKQSAGAIGVRQLSRVEIEALKEQPRLVRQRSAEPLH LRHFAGTVDGSWRVASFTSFSRHESTSTHFASPELPDRDEVRSSTSASTM QPTLPSEQSIAAFPKGARAGILLHALFEELNFANPTDEAIAERVTEELAR SIYPLSWQSTLITIVQAVLQTPLAALDGSTFQLGTLHAKSWITELEFFFP LRFINSKELSALLTRHGVLPGGIALADMVEVLDFKPVRGMVMGFMDMVFE SGGRYYLLDWKSNYLGASPADYTLEAMGRAMQEHLYPLQYLLYMVALHRY LALRIPNYRYSTHIGGVIYVFLRGVTPEFGEARGFYRDLPSEALIEELTA LLVDFEG >Cag_1981 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFTLPALTHSK >Cag_1872 exodeoxyribonuclease, small subunit MTSSQPNEPSLEELLQRLDEITHTLENPDTGIERSIKLYEQGLLIAERCR KRLEYARSKIEKLKPNSSSSLPQFPLTDDLFN >Cag_0603 RecJ exonuclease MKRYRWKCFMPHEETVAALSESINVSQPIARALCNRGISTYNEAKEFFRP VLSTLHSPWLFNDMERAVERLVRALKNGETILLYGDYDVDGTTGVALLLL FLRHHGVEPLWHINDRFAEGYGLSPEGIDRVIASGTTLLITVDCGIKDHA AIRRCGEHGVEVIVCDHHEADVTPEAYAILNPKVVGSGYPFRELCGCAVA FKLVQALAERLGDSEAVWHQFLDLVAVATAADLVSLTGENRTLVIEGLQQ MRSKPRKNFSEMFRVMKVSLGDVRMFHLAFGIAPRINAAGRMHSAHLALE WLLASAPDAVEQHTEALERVNVQRRSLDSTIMSQADKMVESHCASYCSSI VLYDEAWHLGVLGIVASKLIDKYYLPTVVLGGMNGLVRGSVRSIEGLNIH AVLQHCSHHLEQFGGHHQAAGLTLKPENLAVFRKAFDEQCANQLTIEQRQ KVMEIDAVVELEQITDKFIAVLEQFAPYGIGNREPLFMSERLQLAEPARL LKERHVKFAVRDKQKRRFEVIGFNRPDIYNDLRAVKHPTITMLYTIERRQ WNGMWQVQLLLKDLEVQR >Cag_1545 NUDIX/MutT family protein MASRFRGVFKQSGVIPLFDDKVVLITARKSDRWIIPKGYIELGMSAADSA AKEALEEAGLVGKVGEHPIGKYRYNKSGRHFVVLLYPFFVETMLDVWDEV HERERCVVSPDVAATMVAHSDVGRLIRSYCASLDDDEAVLVPPHVASAIT G >Cag_1387 ATP-dependent endonuclease of the OLD family-like MNSILYGVGNKFIQTNTFERNDLHNLDYTNQIRIRIELQGSDFTCPQYWD RQSNSYRTTKSITGTYEITTEIDDSELKSGMQPSMFGMNKHYNIFYINFH NIKDEIKTQRTSWGNLTSFLAKHIKSIVDTDTSMAAKKEDYENEVELATD KVLKNSQLSAFIDKIKENYSTNLRNNSCEVKFGLPDYEDIFLQMIFKVGL NGDNANLIPIDHFGDGYISMFVMAVIQAIAESNTDDRCLFLFEEPESFLH ENHQEYFYKTVLCNLAEKGHQVIYTTHSDRMVDIFNTKSIIRIELEEQDK QTVVKYNNVGEFSPTMPTNSNGQEIISFANFNSYIKSVEPNLNKILFSRK VVLVEGPNDILAYKIAIEREVEKAHGDKKYAETYLSFLNIAFVVHHGKAT AYLLIELCKHFGLDYFVINDWDFETDFVTDLANFQDENTLKQDNLYLKDG ADDRSSNSKAMITTNWKLLNNSGIDKIHFNIPKLERVLGYQSDDKDSLGI LNTVQKLIYYTETFLPTKLKEFLELDKLTNLTENVVETANSEVDTDELPF >Cag_0029 DNA gyrase, B subunit MPPAAYGATNIQVLDGIEHVRMRPAMYIGDIHSRGLHHLVYEIVDNSIDE TLGGFNDYIFVALNADGSITVIDHGRGIPVDMHPEKQKSALELVMTVIGA GGKFDKGAYKVSGGLHGVGASVVNALSEWCEVEVYRDGKAYYQRYERGVP QGDVKVIGDSDQRGTKTTFMPDGTIFKTTEFRKEIIIDRMRELAFLNKNL RIIVQDTNGEQEEFHFEGGICEFVRFTDQNRLNLLREPIYLYGERDGTVV EIALQYNDSYQENVFSYVNNINTHEGGTHVTGFRKALTRTLNSYAQKNDL LKNLKLTLTGDDFKEGLTAVISVKVAEPQFEGQTKTKLGNSETQSIVETV VNDQLAEFAESNPNTLKLIIEKVKGAAMSREAARKAKELTRRKSVLESSG LPGKLADCSINDPEHCELYIVEGDSAGGSAKQGRDRSFQAILPLKGKILN VEKARLHKMLENEEIKTIILALGTSFGDEEFAVEKLRYGKIIIMTDADVD GAHIRTLLLTFFFRHMRPVIEAGRVYIAQPPLYLVKSGKDQHYAWDDDER NSIVDNMKKMQKSKANIHIQRYKGLGEMNPEQLWSTTMDPAHRSLLLVSV ENAMEADQVFSTLMGDKVEPRREFIEKNARYVRRLDV >Cag_0332 type II DNA modification methyltransferase M.TdeIII MEMSMRNDLLTIAEASQWASNYLGKQVTTSNIAYLIQYGRVKKFGHNGST KISKEHLCNYYATINRQREHSWKEQLGSDLNWSLSFDQYKEAETTKHVHR LHPYKGKFIPQLVEYFLDDHTDDFKQQMYFTKGDIVLDPFSGSGTTIVQS NELDIHAIGIDVSAFNTLIGNCKISSYNLKDLQQEINRITVVLKTYLKNS SVVAFEEHLLQELALFNKKYFPVPEYKYQLRQGIIDEKKYGIQKEQDFLN FYNSLVHEYGIILYQKNNHHFLDKWYLAPVRAEIDVVFQEIKKVQCKEIK KILTIILSRTIRSCRATTHADLATLVDPISAPYYCAKHGKICKPLFSILS WWETYTKDTIKRLAEFDRLRTNTYQICLTGDSRTINIIEVLEQRHPLLAA LVKQQKVRGIFSSPPYVGLIDYHEQHAYAYDLFGFTRHDELEIGPLYKGR GKEAKQSYINGISAVLNNCKHVLADDYDVFLVANDKFGMYPIIAENAGMK IVNQFKRPVLNRTEKDKNAYSETIFHLKEK >Cag_0321 Exonuclease VII, large subunit MEAINAMDALSVTELTAHIKSELESLFPFVRVRGEISNCKQHSSGHIYLT LKDSGAQLPAVIWKSTASLLSIRPKDGMEVVAEGRLELYPPSGRYQLICR HVAQAGVGALQQAFAELVQKLAALGYFDENRKKTLPTIPTTIGIITSPTG AVIEDMSKVLARRFPAARIALYPVKVQGAGAAEEIAQALDFFNHTKKKQW KPQVIIVARGGGSLEDLQPFNEEIMAHAIYRSAIPVISAVGHETDITIAD MVADVRAGTPSIAAELVVPDSAQVLRDVEQMVAYAQQILNNKIEGAEREL HSLCNSYAFNRPILKMQQCYENLDRFEASMMRSVETTYRQQIQRCTASIQ QLNLLDYHKTLERGYALIKKNGRFVTSAKALQPNDTIELLLHDGVRKASV KPPDAFA >Cag_1214 conserved hypothetical protein MKITEAPNCPHCGATMQKCAPPPFNFGDGLGWCTPYMYVCFNDECKLYAN GWNNLKNNFNKTASYRCICYPDNGVFEAMCVFSPDGMKGQIIEE >Cag_0555 conserved hypothetical protein MSIWMLHHKHHRHSIRLPEYDYSTCGAYFITICTQNRACWFGEIINGEMI LNNVGKMVKDEWLKTEQLRTNVQCGAFVVMPNHLHGIIVINETVGAIHEL PLQMSQKQRRNMILPKIIGRFKIQSSK >Cag_0014 Holliday junction resolvase YqgF MPLYQRIVAIDYGTKRIGVAKSDPLGMFAQPIGTVDRAGLSKLLSPMVEA GEVQLVVVGYPLNRHGEQTAMTEVIDRFIESLRLEFPALPIETINEHCSS KSAMQLLVASGTSRKERKTKGRLDTAAACLLLSDYLEQQK >Cag_0202 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFTLPALTHSK >Cag_1749 DNA repair protein RecN MLASLSIKNIALIEELTVVFHPSLTIITGETGAGKSILMDSLSLVMGDRA SSSMIRTGANKAVIEAILTDVHSETIEALLADAAIDSRQGELILRRELAA NGQSRCFMNDTPCTLSLLRQAAEELIDLHGQHEHQLLLRSATHEGLLDDF AQAHHERATYSRCYQHLQQLQAQRSALVEKAQSLRDKKEFLDFQLQELQS AQLQEGEEINIEQEITLLENAEQLFTLTTLLHETLYNSDNSAYSNLTAAL HTLEKLATIDQRFASAIEEARAATTIVDELARFARSYSADVEFNPERLEE LRERQLLLQRMCRKYGRTHAELIAFEQELCAEQAGAESLDDELRQLEMAI VTEKKQLSQLAIILSEKRQKAATLLEAHLQQELALLGMPHARFAISITQQ EKADGDIAVAGNHFAATRTGYDTVEFLLSANQGETARPLTKVASGGEISR VMLALKSALATSTHLPILVFDEIDTGISGRIAESVGKSLKKLSRLHQIIA ITHLPQIAAMGDLHLSVQKSVRENRTTTSVTPLDGESRLHAIASLMSGEQ ISATSLNLAAELLAHGQAVNLPSI >Cag_1025 conserved hypothetical protein MKPLPVGIQTFSEIIKQDYLYIDKTSLANELIKKHKYVFLSRPRRFGKSL FLDTLKNIFEGKQELFKDLLIYNQWNWTVTYPVIKISFSGGIRDTESLRE NLFYILKDNQERLNITCEEKSNANLCFAELIKKAFQHYQQKVVILIDEYD KPILDNIENIPSALIIRDGMRDFYTKIKESDEYLRFVFLTGVSKFSKVSL FSGLNNLEDISLNPDFGNVCGYTQNDVDTTFAPYFDGVDMEEVKRWYNGY NFLGDKVYNPFDILLFIKNKYVFDSYWFETGTPKFLIDLIKKNNYFIPNF LDIKVDKSLVNSFDIENINLQTILFQTGYLTIKQFLPSGMGIGYKLGFPN KEVQISFNNYILQVLTSDSDKEPIRHELFDIMNNGKVANLEPVIKRLFAS IAYNNFTNNYIESYEGFYASVLYAYFASLGFDMIAEDITNKGRVDLTLKT LDKTYIFEFKVIAEEPLEQIKKMKYYEKYDGERYLIGIVFDPKARNVSRF EWERV >Cag_1301 Excinuclease ABC, A subunit MNAHGQLTDTSLPDIVLKGINTHNLRNISVRIPRNKFIVITGVSGSGKSS LAFDTLYAEGHRRYVESLSAYVRQFLERMPRPDIEHVEGIAPAIAIEQKA LPKNPRSTVGTVSEIYDYLRLLYARIGKIYSRDTNELVLKHTPDDVSLQA GFIEDGKKFYVGFFFPHHHTAQQLDCSPEEEIANLLKKGFFRLLAGDELL DLNQEADYQKVLDMPAKVRAELLVVVDRFVARNNDKLFSRISQAAESSFM ESGGHAVLKVVDGKTYRFSDRLELHDIEYQEPSPQLFAFNSPIGACTTCQ GFGRIMGIDEDAVIPDKSLSIEEGAIACWNSEKYRWNLLELMHYAPKFGV PLREPYEKLTFEQKEIIWKGTPDGSFNGIRAFFAEIEKDAGYKMHYRVFL SRYRGYAICPDCEGSRLNPDALQVKISGRHIGEVTRMSIGEVAEFFRNLN ISPFDRSVAEVILQEINRRLGYLLDVGLDYLTLDRLTHTLSGGEFQRINL STSLGSPLVGTMYILDEPSIGLHQSDSARLIALLRKLRDLGNTVVVVEHD REIIEAADEVIDLGPFAGRLGGEVVFQGSMEAMRSSGTSLTAQYMNGEQQ IEVPQQRRTVDFSACITISGAMQNNLKNIDVQIPLKVMTCITGVSGSGKS TLINDILCKGILREKHGSRGTVGTHRSLTGAWLIDRIEHVDQSPIGKSSR SNPVTYMKIFDDIRTLFANTPDARKKKVKAGYFSFNIPGGRCEVCSGEGS VHIEMQFLADIEAVCEACNGLRYQPEALAIKFNGKSIAEVLDMTVSEALS FFKGEKNIVKKLSVLDQVGLGYIRLGQSSSTFSGGEAQRLKLATFIAHAD TTHTLFVFDEPTTGLHFEDIKKLILCFEKLLEQNNSLIIIEHNLDIIKQA DWVIDLGPGAGDKGGHLVEQGTPEEVAQCTESLTGQYLRGVV >Cag_0765 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFTLPALTHSK >Cag_1625 Helix-hairpin-helix DNA-binding, class 1 MKWLNSLATKLSLTKAEITLITALLGFLLLGGVVKNFQDVEERTTLIKRA EAARLDGAEVDSLLRLASLKEGDLSAEPVAEQAEEGEVAPSTKKKSKSAR SEKKEFHGTVAFNKASAAQLQKIPGVGTVMAERMIAFRLLKGGKVSDMKE LLEVKGIGAKKLEQLQPYLTLD >Cag_1630 hypothetical protein MDNTKQDFEKRKKEIESYFNFLLIFDDDKTKIRYIKDGILVNEKINPVFQ ITLIANSFLILYNLIESTIRNSIIEIYEKIEADEITYETLSENLKKIWIK QKTDKLKENNFKQDTLRGYIAEIANDILNRETIRFDKDNLEFSGNLDARK IRDLADSIGFQKTVNGQNLVDIKNKRNRLAHGEHTFYDVGKDYTVNDVIE FKTETFNYLSDIITNIDHFISTQAYKIKN >Cag_1041 hypothetical protein MNLQEAYKQKAETELELAHTRLVEFRAKVKNLNAEAHLNYAKQLDDFEHG ITTAKEKLHELGEAGEDAWEKLKDGVESALRSSSKTLQEIADKFKD >Cag_0658 transposase MKDTVLFQQALCLPAPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HNLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEYDSRIWRIIHHYLDEVLEQQ DLSSVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPEAHITFDKFHIVQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGGKF EFALPALTHSK >Cag_1772 hypothetical protein MLTNITIENFKKLERISFPLSQSVVIIGPNNSGKSTIFQALCLWEIGVKN YIAAYQKNDLNRQGTITINRRDLLNSPIADARFLWKSKKVTQRNISGAGQ KHVPLSIELEGDNNGVQWSCQAEFTFSNSESFSCKICTGLQQMVELYENE HGLHFGFLQPMSGISTTEDKLTKGSIDRKLGEGKTAEVLRNICFEILNPE TASKNRNNAENNWLQLCNVIKVMFGVILQKPEFIKATGLISLEYIENNIK YDISSGGRGFQQTLLLFSYMFANPNTILLLDEPDAHLEVIRQREVFQKIN DIATITNSQLLVASHSEVVLDEAAEASKVIALIENQTFEVNTSTNSKSIQ YIKKALTEIGWEKYYLAKSKGHILYLEGSTDLQMLLAFATALNHNVAALL RFANVSYTSDNVPNTAVANFVALKEIFPELKGLAIFDKIEKNLNDIKPLT VVCWQKRELENYFARPYLLIKYAQSLHEKYEQFSLEQLEKAMKKAIEDFT LRAYLNDLNHNWWNSAKLSSEWLDNIFPEFYKQLNVPLNFYKRDYYQLIA LMERQDIADEIVDKLDLIYEILK >Cag_0507 hypothetical protein MSREVKIHLISESKSLVNSFFKKTIHNLPNNDSLISSGITITSIKTSDLK FFPQNSALENTVLHFWDLSIDSSIPQSIYPLFMTPNSVYLLLLDNFNQNE KFWLKLIKTHGKSSPIMILKDKSKGVFSIEEKSLNLEFPLIDNQFINVNF DDDDDKGIVDFMENFAQLLMFKQNKCIIELQNSWLLIKEAIFKETEQVKF ISKKKYKQVCYDKGVYNQTDSELLFEYLKNLSIILFFKEIPFADIYIINF SDNSSNLCWLIDGINRILTSKKINNGCLYWRDLDFMLEDDEEKNIYDTKD LYYILELLILLNVCYEIDKGCYLFPNKMPSDFVLSLPTNRSQTCFIMQYN YLPFDIISRLMIMMKKDIIDDQYWVYGILLKSHNFSKLHASINPEAVNDV TALIIADPDNKQIRITVYGPDRYRRHYFQVIWNHLHDINKKYDDLEVKEL VPLPDRPDKLINYQDLLGYELHNIKKYPVLQSYRNYLVSDLLDTVIDNKK VNKKEIVINNIVNNQNSESQVEYEKLQKDVLKLINAISEKISTLPNNEDE KKLKSILNNTSNDLENIESPTYKKQLRQFIEMLQSNPHITNLVKFIANGP DKVNAIIDLYNHIM >Cag_0564 Histone-like DNA-binding protein MGNTTTKADLVAVIAHKTGLTKNETEAVVDGLFESIIESLKAGRRIEIRG FGSFNIRQKNFRKARNPRTGESVEVDPKQVPAFKISKEFKLAVSESLKGG DV >Cag_1966 conserved hypothetical protein MLYEDTDNVTPNGGQECPPSFSPSCLPFLNPDCEIAMTHHRLPHWQQGDV WVFVTWRLADSLPKVTLDEWTETRKIWLSLHPEPWDEKTEKEYHQRFSLQ RDEWLDQGCGSCLLKDTVNAKIVVDALLHFNGLRYQLASFVVMPNHVHVL FRPFGKYSLSEIVKSWKGFTAREINKRLGTKGVLWQDGYWDRLIRNERHF FKVVAYIRHNPINAIQKEGGHSCPPFQCFVE >Cag_1347 TatD-related deoxyribonuclease MFIDSHCHLSFPDFDADRNDVLQRLQAAKVSLLIDPGTDVTTSKNSIALA QEVDCVYANVGLHPHEATQPIGDDVFAQLEALAHQPKVVGLGEIGLDYHY PDCNASAQQAAFREMLRMAIRLDIPVVIHSRDAWSDTLRLLDEEQHSALR GIMHCFSGDVAIAKECIQRGFKLSIPGTLTYKKSLLPEVVAQVALDDLLT ETDAPYLAPVPHRGKRNEPAYVALVTETIARIRSLSVEDAATAIYRNTLS VFEKINGNGLSVKIADNK >Cag_0554 conserved hypothetical protein MSIWMLHHKHHRHSIRLPEYDYSTCGAYFITICTQNRACWFGEIIDGEMI LNNVGKMVKDEWLKTEQLRTNVQCGTFVVMPNHLHGIIVINETVGAIHEL PLKMSQKQRRNMILPKIIGRFKMQSSKQFNQLHNTPGQQFWQRNYYEHII RNEQDYHRIHDYIVNNPLKWECDSLHP >Cag_0911 hypothetical protein MIITPKVDETQEFIEIANDFSNPLDLVREAISNSFDANANKIYLSFDMVK EYMDTNLRIRIVDDGEGMTLDGLQSFFDLGNSTRRGIDGTIGEKGHGTKV YLNSSKISVKTIRDGKQYVAVMIEPIKKLYVREVPTVEVIESNVDELSGT TIEIIGYNSNRRGKFTHEQLKDYILWFTKFGSFESFFEKKENSHKRLFLK GLNASEYEEICFGHSFPNESQPVQRLFEEFLVSAPDYYCKRFVKRGQLKN SPEISFEAIFSVEGNRVKLAHNTMIQRQGRPSIAGNYKVAERYGVWVCKD FIPIQRKNEWVNYKGSEFIKLHAFFNCQGLRLTANRGSIDNTPSEVLSDI QEEIKKIYDEITSSDDWTQLSWLEQEAESYKTTEKEKKDFEFRLKKANKA NICEFENTIIVEPQRESGVYALVLQLKMLKPDLFPFFIVDYDTHSGIDVI VKADDTQPIISSKLYYVEFKHYLTEEFNHSFVNLHSIICWDTTIKHNDIL KDINGEERKMQIIPPESDGDYTKYFLDRPSSAHKIEVFVLKDYLKQKLGI EFRPRTAKDIL >Cag_0528 AP endonuclease, family 2 MKRVGAHVSASGGVEQAPLNATAIGAKAFALFTKNQRQWKAPKLSKATIE AFQKACADGGFQPQHILPHDSYLINLGSPDPEKLERARSAFIDEMQRVAD LGLQLLNFHPGSHLKEISEEASLLLIAESINMALEATNGVTAVIENTAGQ GTNLGYRFEQIAFLIDRIEDKSRVGVCLDTCHLFASGYDLSSTEAIETTF NEFDSTVGLHYLRGMHLNDAMQPLGSRVDRHASLGKGTIGMAAFTFIMNH PACEEIPLILETPNPDIWSEEIALLYSLQQVD >Cag_1993 ribonuclease HII MHTHYEEPLWQHYEFICGIDEVGRGPLAGPVVAAAVVFPRWFQPTEALLT LLNDSKKLSAKERESLVPAIKAQALHWALAEVQHNVIDEVNILQATMLAM NNAVKALPIIPSLLLVDGNRFTTDLAIPYKTIVKGDSHVFSIAAASVLAK VHRDALMCVYATHYPHYGFERHAGYPTSAHIEAIRQHGRCPIHRQSFKLR QLGEKV >Cag_1628 C-5 cytosine-specific DNA methylase MKMQNNISAIDLFCGIGGLTYGLKKSGIQVKAGIDIDESCRYSFEENCGT KFINKDIQKLQKEELNSIYGNAEIKILVGCAPCQPFSSYTYKKDKNKDKK WQLLYDFSRLIKETKPAIISMENVPTLLNFKKAPVFYDFIQELTANSYKV WFNIVYSPDYGIPQKRRRLVLLASKLGDIELLPPTHNPDNYITVKDAIGN LEAIKSGETSQNDFIHKAAQLSEINLSRIKQSIPGGSWKKDWDDELKLVC HTKEKGKTYVSVYGRMMWNEPSPTMTTFCTGIGNGRFGHPEQNRAISLRE AAILQSFPADYKFAENEATLKFGKTSKHIGNAVPPKLGEIIGKSILQHLE KYNYGKENK >Cag_1529 conserved hypothetical protein MEKFKGLYRIESARMQGWNYGWAGLYFITICTKDRVCWFGEMVNHKLSLS DIGTIVEMEWRNTFEMRPDMNLYMGEFVIMPNHFHAIIGIGTNRYNIQYD DHRRDAMHCVSTHHCVSNTPPKTTISSQSNNLASIVRGFKASVTKQARML HVDFAWQSRYYDHIIRDEKSFHAISTYIINNPAQWAKDELYL >Cag_1005 Protein of unknown function DUF48 MKKFLNTLYVTSQGAYLSKEGECAVISIEKEVKTRIPLHMLDGIICFGAV TCSPFLLGHCAEQGVTVTFLTQYGKYLCQVQGATRGNILLRRAQYRIADN EAQSAALSRSFVIGKIGNARITLARTLRDHPDKVDALRLKQAQHHLAECI QHLQHETNQERIRGIEGEAAKAYFEVFNECITSPDSHFQFKGRSRRPPLD RVNCLLSFFYTLLTHDVRSALEACGLDPAAGFLHKDRPGRPSLALDMVEE FRSYIADRLTLTLINRGQIHANDFTVSETGAVLLKDDARKKLLTAWQERK QEVIEHPFVKEKMEVGLLWHMQAMLLARHIRGDLDVYPPFVWK >Cag_1017 HhH-GPD MLKEFLITHNKELEIEKSLFSGQSFLWKKHQSNLDSFVTVMDKRLVIISQ LSPYTIRVHCDSEVLYGQKISAFISHYFTLDVPFQKIFSSSFKSNYSEVW RLLDGYKSIALLRQHPFETLISFMCAQGIGMRLIRQQINRLCERYGEFYE AEMEGEMLCFSGFPAPEQLACLNAEELSYCTNNNRERAANIIAVARKVVE GRLDLSSLSYPNMAFEEVQARLTQERGIGLKIADCVALFGLGYFEAFPID THVHQFMAQWFKVPAASRSLTPATYRQLTLEAREILGSHYTGYAAHLLFH CWRCEVKKLCWF >Cag_0597 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFTLPALTHSK >Cag_1882 Transcription-repair coupling factor MKTQLASEESSVAVVRHNPSWLFDILRQSAPYNQLTSLLSKSNAQQKGDI LLPLAGLYGSFSSLLAATLFADATAPLMVVCSSNSFERYENDLEVLLPKG SLCNSADELSHTIEALATKRRSLLLSLFDDLDVPLCSPSEVESRMFHVTL DATIGYDALKHFLTANGFEQRDFVEDEGEFSLRGSIIDVYPFGAAEPLRI ELFGDTITSLRLFDSNSQLSGKNLQQATLTANFTTPNSPITLLDYLAPET VVLVDDVAELIAQSDGKELLERLCYFRCLSINHAEVQALNFGGEAQQKLQ GNFRTLATLLHTAHHEARQPLFAMSSKREIGELNDFLAQESSQEALPQSG WLPVTLHSGFRFGSLDLYTESDIFGKMHTHKVHRKRKVRGISLKELQKLK VGDYVVHEDYGIGIFKSLETITAGNSEQESVLIEYANGDQLFVNVQNIHL LSKYTASENSSPTLSKLGSSKWAAKKEKVRKKIRDIAINLIKVYAQRKMQ PGFAFAPDSIFMREFEAAFIFDETPDQLRAINDVKKDMQASHPMDRLICG DAGFGKTEIAMRSAFKAVESKKQAAVLTPTTILAHQHADSFTRRFANFPI SIAVLSRFVSRKEQLSLLKKIEEGKIDIVIGTHRLVSKDVHFKDLGLLVI DEEQHFGVEVKEKLREQFPTIDTLTMSATPIPRTLQFSMLGARDLSIVST PPKNRQPVETIITDFDAALIQSAIQRELQREGQVFFLHNRIAGLETIAES LRELVPSARIVYAHGQMPTRELEKIMMDFMQQEVDVLISTTIIGSGLDIS NANTIIINRADLFGLSDLYQLRGRVGRSERKAYCYLITPPMKTLKKDALQ RLAVIESFTELGSGFNIALRDLDIRGAGNLLGAEQSGYIHELGFDLYQKM LEETVAELKTNEFSHMFEEEGNKPLRQQKPCDLLFFFDALIPDYYVAATQ ERFAFYNRIAKATRNEQLDAIASELCDRFGKLPEEVTNLLMITKLKLIGT LLGLEKIDIQPQSTMLYLPDQASEHVAQRHYLQYLFTAVQAEWMAEYKPG FKMEKKMKLQLHHPTHADTTSAGLMERYSALLHKVYEEAKSEVEAAMVG >Cag_0002 DNA polymerase III, beta chain MKILSSIRQLQEPVAKVAQAIPSKSVDGRYDNIHFTLEPNALTLFGTDGE LSITAKIEVESTDSGHIGINARTLQDFLRSMYDTPVTLSIERQEISDHGM VEVTTDKGRYKIVCLFESKPERYDKVYDITLDLPTSELLGLVQKTLFACS IDGMRPAMMGVLFELEGTTITAVATDGHRLVRCRKESSLDIAEKQKIVLP ARVLSILQKLAQHESITMCVSTDRRFVRFISGHMILDAALIVEPYPNYNA VIPVEHDKNVVINRQSFYDSVRRVGRFSSIDDIRLILENDRLTVMAENTS DGEAAQEELPCSYNGEPMTIGFNAKFVEAALAHLDDEEILIELKSPTTAV IFTSSKIEDRDKLIILVMPVRINS >Cag_0828 Primosomal protein n MYARCVADRFFRGEPFSLVVPEAFCEELQAGCMVLLLSLKGQGLMSIGYV LSLSPDAPPDMVNEELPSFEMVDLLNGSQPVLNGELLKLTSWIADYYLTR PIDAIHTALPVAIRTTVHDVVEAAGFTLQAEPTKVMNTALRRSILKLLAT NKQLTVTQLQRRLGKKQLYKTISQLEKGGYLTLSKKFSTKKPKYKSAYRL TAPLQDGVLESVASAKKQHATLSTLADLYPETAFLNELEVSHAVIQVLLN KGLVEKVQKRIESNFSSGYRESAQPAKKPTAQQQKVLNELCSASRQGHYQ TFLLHGVTGSGKTLVYIEFLKEVLAAGKTAIVLVPEIALTPQTAGRFREH FHHDIAILHSAMSLQEKYDAWHSLKSGRCRIALGARSTLFAPLENLGAII VDEEHDGAYKQDRSPRYHARDTAVMRAMLSNAICLLGSATPSFESYQNAQ NGKYHLLRMAERIDGATMPTISLIWMRESPRRTTSISEMLYQQIAQRIEK NEQVILLQNRRGFAGSILCLECGHIPLCPHCNIPLVYHATHNHLRCHYCG HTERYKAMCSACKSTGLFYKGSGTERIEEELQKLFPDEKILRMDVDTTAK KGAHGRILREFHERKARILLGTQMVAKGLDFPAVTLVGVLMGDIGLNIPD FRASERTFALLMQVAGRAGRAAIPGEVLIQVYNKESDVFTALLHGDYERF FQQELESRRTLLYPPAARLIKFECSADDEVQAEAAATFCKEIVQQHLPEK QGMVLGPAPACIAKIRNRFRYHVLVKLMLGKLSPLFIREMSDTIHSRFRS ANVLLTVDVDPQSLM >Cag_1099 DNA modification methylase-like MNELQDESVHLIVTSPPYWQLKDYGTENQIGFHDDYETYINHLNLTWQEC YRVLHKGCRLCINIGDQFARSTYYGRYKIIPIHSEIIKFCEIIGFDFMGQ IIWQKTTTMNTSGGASIMGSYPNPRNGIVKLDFEYILLFKKQGTSPKPTK EQKDNSVMTNEEWNTYFNGHWYFSGAKQDQHLAMFPEELPRRIIKMFSFP NETVLDPFMGSGTTALAARNLNRNSIGYEINPTFIPIIKNKIGMDDVFMK VETSVIKQPEITIDFNECVNRLPYQFIDTHKLDKKIDVKKIQYGSKIDSE STGKREDFFSVKEIISPELLKLNNGLIVRLIGIKQNPAINGKATEFLFNK VRGKKVFLRYDAIKHDKENNLMVYLYLENKTFINAHLIKNGLVLVDNSID FKYKAKFNSLTNG >Cag_1093 putative exonuclease MFTFLHVADLHLDSPLKGLEEYPDAPLKQLRHATRRAFDNVVQMALDERV AFVVVAGDLYDTDWRDYNTGLFFVSRMAKLREAGIPVIIVSGNHDAASQI TRSLRLPDNVKILSHTHPESYLLEPYNVAIHGQSFATRFVRDDLARNYPQ ADPSLFTIGLLHTSLETSGDVYAPTTLDLLRSKGYNYWALGHMHRHEVVH RNPWVVYTGNIQGRHIREGGAKGCMLVTVENDAVVQTEWRAVDVLRWARC AVLLEGCDSMEQVYHLVRERMEELRQQAEGRPLALRVQLRGATPLHHTLH TKIGHVMEEIRAIAVSFGDCWLEKVELELSAPHAKSDLLGAASPLASLLE AVDALELPDGSLTSLLPDFEKLRHKLPHELISDGDPFAPPADELEILRDE VKQLLSATIEETIGGTTERAIRGSMGGTIGGRNA >Cag_0708 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFALPALTHSK >Cag_0847 Helicase RecD/TraA MSVQDGQESYYYSPKERLSGAVERVTFHSQKNGFSVLRIKVKGRRDLVTV VGATPSIAPGEFVECLGEWHNDSTYGLQFRATELTVVPPETIDGIEKYLA SGMVKGIGPHFAKTLVYAFREDVFTVIEEEPERLLELPGIGQKRMEMVTS AWADQKVIRDIMVFLQSHGLGTSRAVRIFKTYGNESILRVKENPYRLVLD IYGVGFKTADALAMQLGIAPDSLIRAQAGVHHVLQEIASSGHCAAPREQL VAEASRLLSIPEERTHEAIDAELRAGNLVREELRGVETLYLLSLHRAELG VATSLMRLLEGEIPWRHLAIEEALPWVEAQNNITLSPSQKEALHTALTNK VTVITGGPGVGKTTLVKSILLILQAQKVRVALCAPTGRAAKRLSESTGLE AKTIHRLLEFDPLTGGFKHQRDNPLECDLVVVDESSMVDVVLMNRLLAAV PEKAALLLIGDVDQLPSVGAGAVLADIIRSETIPTIRLTEIFRQAASSRI IMNAHRINKGELPLRDESNTLSDFYLIAANTPEEIYNRLLTVITERIPAR FGLHPVRDVQVLTPMNRGGLGARALNVELQKVLNGQVEPSVTRFGTRYAA GDKVIQMVNNYDKEVFNGDIGHISAVEREDGAVLVDFDGTLVSYEFGELD ELSLAYATSIHKSQGSEYPAVVIPLAMQHYNLLERNLIYTAVTRGKKLVV IIGETRALAMAVKNHKAMRRLTGLAERLSALARYEANL >Cag_0368 hypothetical protein MHAKDVVSKDILKRIALDIARILLHLKVDHAELLETEHQRVEERRADVVV LVQGESGRFILHLEIQNDNQANIAWRLLRYRSDIGLAHKGYDIKQYLIYI GKAPLSMPTGIHQTGLDYRYHVIDMHSVDCQALLTQDTPDALVLAILCDF KGRSEREVVRYIIQRLQELTAENESRYHDYMRMLEILSANRSLEKIIEEE EAMLSVVDQTRLPSFRIGMRHGIEQGVQQGTLSLVKRQLTRRFGTLSYHH VARLDKLNIEQLEELSDALLDFNTVTDFDVWLENRKN >Cag_1007 CRISPR-associated protein TM1801 MSTLNQKIDFAIIMRVTNANPNGDPLNGNRPRTDLDGHGEMTDVCLKRKI RNRIMELKDKEQKYQFDIFVQPDDSKRDSHTSLKARFESEIGKNVKDKDD AAKKACKKWFDVRAFGQLFAFDGEESSGLSIPVRGPVSIHSAFSVEPVNV SSIQITKSVSGNEGKNGKRSSDTMGMKHRVDYGIYVTYGSMNPQLAERTG FSDEDAKVIMEILPKLFENDASSARPDGSMEVVSVIWWKHGSKAGKHSSA KVHKSLHVNEDGTYRLDDLEGLTPECINGF >Cag_0042 hypothetical protein MLQKLIIKRFRGFSTLEVDIPKVLLLMGPNSSGKTTALHAIRISCQAAWI AVTNNIAWKVEDTVIIFKDFIIRDISQLMPIADWQALFVNQIVGEHTHFS IEIIFEKTDALSSILIEGKYARNENLKITATIGAETLINNLKNISNRSSQ YKNIAFEFFQKHLPKAILIPPFYGVIRDEEYRAKAVVDAMVGSADQSHVV RNMISRLSTTQLEQLNAFIKDMVGATLVQRTQGDDIEKISPLRVTFRDTN GELELSAAGAGLINLIALYSSLARWESETIDRQIIFLLDEPEAHLHPRLQ GYTADRLATIITNDFNAQLIMATHSIEIINKIGERDDATIFRTDRLNKEK GGQQLIGQTPLLDDLSQWADLTPFSIINFLASKRILFYEGKSDGIILTKC AEILFRNNPDKKKKFEKWTLIQLEGSGNKNIAQLLAHLIDSSTFASVAEK KDFKIVVQLDKDYNDEVEQLKLITNRDISTFYNIWSKHSIESLFCESATL YQWLKPKYPDIQEETIEKAIIAANQDNELNQYAREQRQATLLKPLQKISE NITATNRQADNDIAATPEIWQRGKDRSKVILHHIKTALSTSANSLSTSLT KVIEKADVNLFPAGNRAVVPSEIKQLLDWMVTNA >Cag_1004 Protein of unknown function DUF196 MMVLVTYDVNTESPDGKRRLRRIAKTCQNYGQRVQFSVFECNVDPAQWTK LRAKLLREMDPNRDSLRFYFLGSNWQNHIEHEGAKEPRDLEGVLIL >Cag_1796 Ankyrin MKILEYTGFDSSSVAESYRKVATALAQGDFRAAQVKKLVNLTHGKFYRAK LDAANRLLFTFVRYGDEVCLLMLEVIMGHNYHKSRFLRGAPLEEEKIPDV DASEALNDAEQLRYLHPNHTEIHLLDKPISFDDAQQAVYLHKPPLIIVGS AGSGKTALMLEKLKHVEGEVLYVTHSQYLAQNARNIYYAYGFEHPAQEAH FLSYREFVESIRVPTGREATWRDFAAWFYRMRSNFKEIDPHQAFEEIRGV ITAPEDGCLSRKNYLQLGVRQSIFSKEQRSILYDLFLKYRHWLTDSGLFD LNLIAHEWKASPRYDFVLIDEVQDMTVAQLSLVLKSLKKAGHFLLCGDSN QIVHPNFFAWSHVKTLFWKDPNLAGKKQLQVLTANFRNGREATRIANQLL KLKHQRFGSIDRESNFLVEAIGGAEGQAQLMADTDATKREFNKKISHSTR FAVLVMRDEEKQEARKYFSTPLLFSIHEAKGLEYDNIVLFRFVSSCRREF NDIAEGVSLTDLEAIDSLEYCRAKEKGDKSLEVYKFFINALYVALTRAVK NLYLIESDTKHRLFELLGLAVAGKVEVAAEESSLEEWQKEARKLELQGKQ EQAEAIRRDILKEVPPPWQVCNETRLDELIHKVFKEKAPGNKFKQQLYEY ATCHVEPVLAQALEKQTDYRSPHGSFWEHLDTIGRKSYLPYFSQQTKAIL RQCEQHGPNHRLPMNQTPLMAAAAAGNIALTEALLERGADPTLNDHYGYN ALHWAMRQAFRDNRFARTTFGTLYERLAPAAVDISSGERMIRLDRHLAEY LLFQTCWVLFKSRFTTLELNGEYPAFDTSLILEAWEHMPDNVVPTERKRR TYLSSVLARNEVSRNYAYNRSLFERLATGWYQFNPALHVRTSVTEEGQSP WIPIFQAVNLPLISKFCHSHTIATIVQCFRKACMAVIPELEAEIAQQQAT KAAKEQHLQTLVKQVKKKITPSSDSLAAKLLKQHKLSKKLDDELLVPFLK FVREKELEEIRQQKMKKKLEREERQQIKAAEQAKRDEQVQQQLGFDF >Cag_0893 DNA polymerase III, alpha subunit MDFIHLHTHTHYSMQSSPIFPSELFKAAKAFGMPTIAVTDYGAMFNMPEL FSEAKQVGIRLIIGSEVLLLEHDEHQTSRHTVSPSLVLLVKNETGYRNLC ILLSRASREGFVNGMPHVESRLLEQYHDGLLCLSAYGAGRIGRALMAGSL DEAANFSAYYQEIFGSNFYLELQRHNTSFDALLNEQIIGLAQKFSIELVA TNNVHYLRQNDAGCYRALVANRTKEKLSGPVSAALPGSEHYLKSAEEMQQ LFSNEYGELENTLRIAEQCTFTFSDKEPALPRFPLPDGFSDEASYLRHLT WEGAAEKYAKSEEEGISQEEVKERIELELGVIEKMGFSSYFLIVSDLIAA SRRMGYSVGPGRGSAAGSIVAYLTGITRVDPLRYKLLFERFLNPERLSMP DIDIDFTPVGKQKVLEYTVEKYGADSVAKVIAIGTLGAKAAIRDAGRVLE VPLPLVDKLAKLVPTKPGITLEKALTDSRDLREMAESTPELKTLMQYARS MEGRARNVSMHAGAVVITDGALEEQVPLYVSNKIETEERRFADELDLDQP DNGKAKAGESNDEKQVVTQFDKNWIETAGLLKIDYLGLETLAVIDETLRM IKRRHGLDIDLEKVPMTDRKTFRIFQEGKMAGIFQFESSGMQSYMTRLQP TQIGDIIAMSALYRPGALNARVDEHRNAVDLFVDRKHGREAIDYMHPMLE GILKETYGVIVYQEQVMQISQVMGGFSMAKADNLRKAMGKKKPEIMEKFK ADFIAGGVAQGVHDTLATRVFDLMSEFAGYGFNKSHSAAYGVLAYWTGYL KAHYTIEFVTAVLNSEIGDTERMKHLTDEAKSFGIATLPPSINKSDALFS VENSSNGRSAIRVGLSAIKQVGGAARAIVTSRMRRKRDFLNIFDLTASVD LRVMNRKALECLILAGACDDFDPHRARLLANIDKAIKFGQMQNRTVTMGQ CGFFSNEEGQEGDIHYPELDNADMMPDGEKLLHEKKLVGFYLSRHPLSAY RRDWQAFANLPLNTKEIVKNKQYKVIGVVVSLKPYQDKKGKQMLFGAIED FTGKADFTIFASVYEQFGHLIKPEEVLMLVVEAELGGGMLKLLVREVLPI KKVRKSLVKKLVLTIDADEQGQLDKLSSIKELFNKHKGGTAVEFEMKAQA GDNIETLTLFARATPIEPEEELIEQLELLLGPDNVRIAG >Cag_0362 Histone-like DNA-binding protein MSKAELVEKIAKQADLTKADAERALTAFVDVMTASLKAGDDVALVGFGTF SVGDRAERQGRNPQTGETITIAAKKVVKFKPGKALKDEIGG >Cag_1580 transposase MKDTVLFQQALCLPAPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HNLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEYDSRIWRIIHHYLDEVLEQQ DLSSVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPEAHITFDKFHIVQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGGKF EFALPALTHSK >Cag_0181 conserved hypothetical protein MVKTIMAMKIDKPKFNPNIHHRRSIRLQGYDYSQSGFYFITIACQDRICR FGYVENGEMVLNKYGIVAYNEWVRLRTRFPNIELDVFQIMPNHMHGIIVL NEISVEDVGAGFTPAQNNALSNIRAGASPAPTVSEIVGTYKSLVANGCLK IYTTKNETMGKLWQRNYYEHIIRNEQSYQSFSEYIINNPAKWEDDTFYVI >Cag_1608 DnaB helicase MISKKAAPIIDFSKDIDFSQESRIPPYSTEVEQEVLACVLLEDEPIEQVI QIFGESSEEVFYERRHQTIFRAMMQLYHKRQAIDIITVSEELLRMGELEV VGGRHYLAELSGKVISAANIEYYARLAKEKFLYRRLISIATKISGVAYNS SMDIFDLVEHASQQFFTISQAGVKKKASPIKELVKTGIRMLENLRASQSS VTGVASGFSELDQFTAGFQPSDMIIIAARPSAGKTAFSLALARNAAVDFN TPVLFFSLEMAEVQLAIRLMCAEAYVESQLVRTGRITPEMMGRIINSMDK LNEAKLFIDDTPGISIMELAAKTRRMKQEQNIGMVVVDYLQLVTPVRDGR TNREQEIAQISRSLKALAKELNIPIIALAQLNRSVEQRSGDRRPQLSDLR ESGSIEQDADVVMFLSRPEMYGKNTFEDGTSTKDIVEIVIGKQRNGPIGD IRLLFLKNYGRFQSTANVYITANAEAESAPQAEPERYLQPSQEFPPPASG GAFIAQDDAPF >Cag_1392 putative type II DNA modification enzyme (methyltransferase) MNNLLIHGDNIAGLDYLLHQKQLKGKIDLVYIDPPFATGGNFTITNGRAS TISNSRNGDIAYSDKLTGDDFINFLRKRILLLRELMSEKASIYVHIDYKI GHYVKIMMDEVFGIDNFRNDITRIKCNPKNFTRIGYGNIKDLILFYTKSS NPIWNEPTEKYSENDIVNLFPKITTNGRRYTTVPIHAPGETVNGKSNKPF KGMLPPQGRHWRTDVITLEHWDKEGLIEWSSTGNPRKIIFADEREGKRVQ DIWEFKDPQYPIYPTEKNSDLLDLIITTSSNPNSIVLDCFCGSGTTLKSA HFLQRQWIGIDQSPHAIEATINKFSDIKADLFIESPQYDFIALTDELINQ S >Cag_0457 DNA recombination protein, RuvA MYAYFRGTLISFTADEAIIELQGVAYHFLISATTSRQLPNSGSEVQLFAH LYVREDAMLLYGFYSEEERQLFRLLLQASGVGPKLALSVLSGLPVHEVHD AIVSNIPERLYGISGVGKKTAARIILELRDKILKLSPVLPTATARRPHNA AQQLRDDAITALVTLGFPRAAAQKTVTSLLDENSNCTVEEVVKSALLLIH NAQL >Cag_1137 HhH-GPD MEDWLPSKRQIEQLQAKVFAFYGEHGRSFPWRNTTDRYAVMVSEVMLQQT QAERVVERFEAWLVAFPTVQALADAPLREVLALWSGLGYNSRAERLQRCA QTIVADFGGVVPALPEVLLQLPGIGAYTSRSIPIFADNFDVATVDTNIRR IVLHEFGLPETLKPRELQMVADRLLPHGQSRKWHNALMDYGALHLTSQKS GIRPLTRQSKFQGSRRWYRGQMLKALLKTEALPLEALEATWADSPYCLRD IASDLVREGLVEYHPSASADDSPLLRIRGSG >Cag_1123 conserved hypothetical protein MNQYNPNIHHRRSIRLKDYDYTQVGLYFITICCQDRTCRFGRIENGEMIL NEHGKIAHNEWMKTREIRPNVELGEFIVMPNHIHAIIRFLRRGELHSPNN NVVFDTPLPFDNGGVFKTPNNTGECNSPLRSPSQTVGAIVRGYKSSVTKQ LGLMGFTEKLWQRNYYEHIIQNEQSYQTISEYIINNPAKWQDDKFYVE >Cag_0086 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFTLPALTHSK >Cag_0224 DNA polymerase III, delta subunit MEKLKKAIQSKKIAPIYFFTGSESYLKEEFATLIQGALFASEEDAVANTH LLHGHDMTLRELLSRASEYPMFTERQLLVVRHFEKIKKPTTKEQQKQQYA AFGNYLANPATFTVLLLDADELDKSDFEKQPFSLLKSVRHDFPAIKHPDL FASERAAQAGWEFEPDALKAFAAYIDPSSREICQELDKIILYASERQSAK RITAADVLDCVGVSRTYNVFELEKALVARNLRLCSGMSLMIMDQEGQKEG LMAIVRYLTTFYMRLWKISMPEVQRMAQSDIAKVLGMSPRQEFLIKSYLT YTRQFSLQQTEAALCALRDVDASLKGLRPYSDEKYLLLQLMQRLLG >Cag_0314 Excinuclease ABC, B subunit MENRTDNEYQLVSPYQPAGDQPKAIEALVQGVRDGRHWQTLLGVTGSGKT FTISNVIAQLNRPVLVMSHNKTLAAQLYGELKQFFPHNAVEYFISYYDFY QPEAYLPSLDKYIAKDLRINDEIERLRLRATSALLSGRKDVIVVSSVSCI YGLGSPEEWKAQIIKLRAGMEKDRDEFLRELISLHYLRDDVQPTSGRFRV RGDTIDLVPAHEELALRIEFFGSEIESLQTFDIQTGEILGDDEYAFIYPA RQFVADEEKLQVAMLAIENELAGRLNLLRSENRFVEARRLEERTRYDLEM MKELGYCSGIENYSRHISGRPAGERPICLLDYFPEDYMVVVDESHVTLPQ IRGMYGGDRSRKTVLVEHGFRLPSALDNRPLRFEEYEEMVPQVICISATP GEHELMRSGGEVVELLVRPTGLLDPPVEVRPVKGQIDNLLAEIRHHISIG HKALVMTLTKRMSEDLHDFFRKAGIRCRYLHSEIKSLERMQILRELRAGD IDVLVGVNLLREGLDLPEVSLVAILDADKEGFLRNTRSLMQIAGRAARNL DGFVVLYADVITRSIQEVLDETARRRAIQQRYNEEHGITPRSIVKSVDQI LDTTGVADAEERYRRRRFGLEPKPERVLSGYADNLTPEKGYAIVEGLRLE MQEAAEHMEYEKAAYLRDEITKMEQVLKKDG >Cag_0876 DNA photolyase, class 2 MLHSPVDPRRVRLLNHHIDGNGVVIYWMSRDQRVRHNWALLFARWKAAML QQPLMVVFTLAPSFLGAPLRHYDVLFNGLQEVETELRALNIPFMVLQGEP SEELPRYAMHHNASMVVADYSPLHLTRCWKNQVAEALSVPLYEVDAHNIV PCRVASPKQEYAARTIRPKINKLLGEFLTPFPELEALPQPLTEPPVNWQK LRSHFHADASVAPVGWLTAGEAGAHATLQCFVQQKLNGYATQRNDPSLEA TSRLSPYLHFGQISTQFVALQVKAAHAPQEDKDAFLEELIVRRELSDNYC HYNASYDRLSGIPAWAQETLARHATDPRDYIYSHEAFEQAKTHDPLWNAA QHELLQSGIIHGYMRMYWAKKILEWSRTPEEAFEIALWLNDRYALDGREP NGYVGVAWSIGGVHDRPWRERPVYGTIRYMNANGCARKFDVKRYIHNVTS RRPQQVGLF >Cag_1559 transposase for IS1663 MKSNSLSVKTSPVKYSFGLDVSKAKIDVSFCTLDDQQQVKVYGSHSFSNT NKGFVELLLWCHKKCKETLPTVYILEATGVYHEHVAWFLHDHDCAVSIVL PNKACHYKKSLGLRSKTDSIDAFGLAKMGAEQNLPIWETPDKTLRELRII TRHREDLVTDKTIILNRLEAFEFCHNGSALMIKQLKKQLSLIEKQIEEID QLVKETVEENAELKARFDKILAIKGVGLITLATIISETDGFSLITNQRQL TSYAGYDIIENQSGNHTGKTRISKQGNSHIRRILHMPAFLVVKYEPQFAN LFERVYERTKIKMKAYVAVQRKLLILIYALWKNGTVYQSTAQPIIASKLC A >Cag_0682 transposase MMHPSPDHMVHYGVEGNCECGLALSESAISIGECRQQWDIPAPRIEVTEH RQLIATCRCSKVHKGEFPSSLPPYISYGARLKAYTVGLVQGHFISLARVT EIVSDQYGVKPSDGSVQRWISQASKNLTTTYTAIGETISNSAVAHFDESG IRAQGKTQWLHVAATTEAVYYTAHAKRGQEAMSAAGILPLFNGVAVHDHW KPYFRFDHVLHSLCGAHLLRELNAFDETLQHRWPVQLKQVLIDAKNAVAQ AKKAKQTSLPPEQIADLKQRYEQWLNYGLLIFSERPKINKQQGKGKQHPA RNLLCRLRDFKDSVLRFIERFDVPFDNNTAERAVRPVKVKLKVAGGFRAM GGAEAFCVIRSVWQTDKLQQQNPFETLRLVFR >Cag_1992 Protein of unknown function UPF0102 MNPPNSTCELGRQGEALAATYLQNEGYQILERNYRFRHNEIDLIALDGST LCFVEVKARLSNKAGSPLDAVTVAKQREIIRAAQAYLTFSGQECDCRFDV IGVNVHAMHEARISSFTIEHIKDAFWVEQ >Cag_1513 DNA repair protein RadC MKLHDIDPDNRPRERFLQHGAAALSPAELLALILRSGSQQYNILDTCHHI INRFSLEKLSDVSLKELQQIKGIGESKAMQIVAIFELNRRLHYSRNQLRK IMAAGDVFEYMSGRIPDESKEHLFVLHLNTKNQVIKNELISIGTLNTAVI HPREIFKSAIRESAHSIIVVHNHPSGDVNPSNADKKITNELKQAGAFMQI EMLDHVIMSKTEWYSFRERGLL >Cag_1116 DEAD/DEAH box helicase-like MSEQQLPLENNFFSLQLPELLMKALEEVGYESPTPIQAQTIPFLLAGRDV LGQAQTGTGKTAAFALPILASIDIQQAEPQALVLAPTRELAIQVAEAFQR YAEYLKGFHVVPIYGGQDYGIQFRMLRRGVQVVVGTPGRVMDHIRRGSLN LTHLKTLVLDEADEMLRMGFIDDVEWILEQTPAGRQVALFSATMPPPIRR IAQKYLDQPAEVTIQTKTTTVDTIRQRYWVVGGSHKLDILTRILEVEPFD GMIIFSRTKTMTIELAEKLQARGYAAAALNGDMPQNQRERTIEQLKNGNI NIVVATDVAARGLDVERISHVVNYDIPSDTESYVHRIGRTGRAGRAGDAI LFVAPREKNMLYAIEKATRSRIEQMVLPTTEVINNKRIAKFNQRISDTIA AEDLGFFTRMIEQYCNEHNVPMLDAAAALASLVQGETPLLLADKPERSRS SERDSYGSSRDRGFEREGRDSRSGGREGRSDRFERDGRSGRDDRGGRDER SAPRKRGRSEVYGEEPKDRYRLEVGSTHGVKAGNILGAILNEAGLAPESV GHISISDTYTTIELPKQMPDTMFHELRKIRVCGRQLRLSRMEEHEGGHST HSSHGTGAYGGGAKKSFRKPNKSANDEGEFFAGFKKKRKG >Cag_0626 ATPase MTESAIQPDLFGFSTPSSSVTSTTEKSSRFVPLAERVRPRMLDEVAGQQH LVGANAPLRRFLESGQMPSVIFWGAPGCGKTTLAEICASTLQCHFEQLSA VDAGVKEVRKALDIATRVRQAGQRCLLFIDEIHRFNKSQQDTLLHALEQG LILLIGATTENPSFEVNGALLSRMQVYTLKPLTAEELEQVIRRALATDAL FRERSIELADLEVLWHYCAGDARKALNAIEAAFALFPTNQSSVQLTREHF EAALQQKAPLYDKSGENHYDVISAFIKSMRGSDPDAALFWLARMIEGGED AKFIARRMVIFASEDVGNADPYALTLALSVFQAVSVIGLPEARINLAQGV TYLASAPKSNASYQAINEAMAEVKSTTATTVPLHLRNAPTKFMKNEGYGA GYCYPHNYPSHFVEQHYFPEGMEPKAYYRPTAEGREKMAQERLHQLWKER YRK >Cag_1802 Methylpurine-DNA glycosylase (MPG) MEPLPKQFYQCSTIELTEKLLGKCFVRILPNGTRLAGRIVETEAYLGEGD EACHAWRSRTPRNEIMFREAGTLYVYFTYGAHYMLNIVSEPEERAGAVLI RAMEPLEGIEFMQQQRNTTKFPNLMSGPGKLTQALAIERSCNGRTLFDGE FFVADAPAIPSHQIGTSGRIGISRSTELPWRKFIMGNAHVSGGKVGGVVS SLQ >Cag_1363 serine/threonine protein kinase MRKRLFIKKQKFDKWELKRFLGGGGNGEVWECCDEEGNKGAIKLLKHVKS KSYARFCDETKIMEQNFDIEGIIPILDKFLPEKLDGSIPYYVMPMAESAE KVFKAKNIVSKIDSIIEICKTLAKLHERGIAHRDIKPPNLLVFNSRLALA DFGLVDYPDKKDISLQNEEIGAKWTMAPEMRRESSKADSLKSDVYSLAKT IWIILTENPKGFDGQYSIDSIIELKRFYNKTYTTPIDNLLTKCTDNDPNQ RPTVNEIILELENWKVLNKDFHERNQEQWFEIQTKLFPMTFPKRVIWENI EDIVKILKVVCTYDNLNHMLYPNGGGMDLEDVRLSHEKSCIELDCQLINI VKPKRLLFESFGYTAEWNYFRLELYELEPSGAYENDEYYENIQYYEEYDG YVSPENTMQLLRWFRGSFVIFNKRSVYNRISSTYDGRHNKMNTEEFRDYI QEMVSHTIEMNKKKSAMATIESKRRKTR >Cag_1778 DNA polymerase III, delta subunit, putative MSWSSIIGQQQQLRVLQHALETGRFAHAYLFMGAEGCGKEAVAFEIAALL NCRNASASPQVGACNTCPDCEKVHALNHPNVEYIFPVEAVLLEGGGDLAK KENKRFTEAKERYDALIERKKENPYFAPAMERSMGILTEQILSLQQKALF MPSVGSKKIFIISQAERLHPSAANKLLKLLEEPPEHVLFILISSRPEALL PTIRSRCQAVKFSRITTMQLREWLAQHRPDIVEPERSFVVNFSRGNLRLA WDLLSNRSSDMAEAPALQLRNQALDYLRYVLTPNRFHEAIVACEQYAKSL SRRELTLFLAALLLFFQDACHRRINPSVADLNNPDLSDNVNRFAKNFPNT NYFALSQAIEDAISSLERNVAPLLVMATLTTELRQQLQRRG >Cag_0890 Methylated-DNA-(protein)-cysteineS-methyltransfe rase MPTTQPPPHKSRLLVQPTAIGRIAIAERNGNIVQLLFEGERVPFVYEEGE SALLLEAFQQLDEYLLGKRTNFTLSLAPMGTPFMQAVWKALTTIPYGTTL SYGALAVQLGSPKAARAVGMANHRNPLPIFLPCHRVVGSNGRLVGYRGGM ALKQQLLELERRVVGNTALHL >Cag_0145 DNA mismatch repair protein MPIITRLPDSVANKISAGEVVQRPASVVKELLENAIDAGATKISVTIKDA GKELIRIADNGVGMNRDDALLCVERFATSKIKSADDLDALHTLGFRGEAL ASICSVSHFELKTRQADATLGLLFRYDGGSLVEELEVQAEQGTSFSVRNL FYNVPARRKFLKSNATEYHHLFEIVKSFTLAYPEIEWRMVNDDEELFNFK NNDVLERLNFYYGDDFASSLIEVAEQNDYLPIHGYLGKPALQKKRKLEQY FFINRRLVQNRMLLQAVQQAYGDLLVERQTPFVLLFLTIDPSRIDVNVHP AKLEIRFDDERQVRSMFYPVIKRAVQLHDFSTNISVIEPFASASEPFVGS SSQPIFSSTSSQAPRMGGGSRRFDLSDAPERAITKNELYRNYREGAFSSP SVASYDAPSPLQQGGLFALASAEESLFGAQAVHEASENIEAFQLSPLDNI VEHKEVEPKIWQLHNKYLICQIKTGLMIIDQHVAHERVLYERALEVMQQN VPNAQQLLFPQKVEFRAWEYEVFEEIRDDLYRLGFNVRLFGNRTVMIEGV PQDVKSGSEVTILQDMITQYQENATKLKLERRDNLAKSYSCRNAIMTGQK LSMEEMRSLIDNLFATREPYTCPHGRPIIIKLSLDQLDKMFGRK >Cag_1189 RecR protein MRFPSVALDTLIDEFAKLPGIGRKTAQRLAMYILHEPKIEAEQLAKALLD VKEKVVRCTICQNITDVGTDPCAICASKARDRTVICVVESPVDMLAFEKT GHYKGLYHVLHGVISPLDGVGPDDIKVRELLARIPVGEASGVREVVLALN PTIEGETTSLYLARLLKPLGIAVTKIARGIPVGAELEYVDEATLSRAMEG RTVV >Cag_0241 RecA DNA recombination protein MTMDNPKVEQAGHAVDSAKLKQLNLAVDALEKQFGKGTIMRMGDGSAGLT VQAISTGSMALDFALGVGGLPRGRVTEIYGPESSGKTTLALHVIAEAQKE GGITAIVDAEHAFDPSYARKLGVDINALLISQPESGEQALSIVETLVRSG AVDVVVVDSVAALVPQAELEGEMGDSSMGLQARLMSQALRKLTGAISKSS TVCIFINQLRDKIGVMYGSPETTTGGKALKFYSSVRLDIRKIAQLKDGDE LTGSRTRVKVVKNKVAPPFKMAEFDILYGEGISALGELIDLGVEFGVIKK AGSWFSYGTEKLGQGRESVKKILREDPVLYQKIHMQVKELMTGHTEIISS PTE >Cag_0548 DNA primase MSMIPPAIIDEVRQAADIVDVVSDYVALQPSGRNYKALSPFTQEKTPSFI VSPDKQIYKCFSTGKAGNVFSFIMEMEKVPFMEALKLVAQRAGIDISRYT EPKGKQEGEEEQGSGAALRWAARMFHSLLKQPAGAEGWRYFVEERGLREE TINRFGLGYAPESWDFLLREARREGIKSEQLVELGLLVSHREKQSLYDSF RHRVIFPIFSRGGQVVGFGGRALVSDERSPKYLNSPESAMFAKSKLLYGL HFAKNEIRRQERAILVEGYMDVLALHQAGLTNAVASCGTALTRYQAKMLR HYSEHVLFVYDADKAGQKSMMSGIDILVSEQMVPQVLMLPEGDDPDSFVR REGRQGFLQYAESHTMGFQDFQLAFFEAAGAFSTPEQKAEALRVMVRTIA LIPKRAQRELYAQELSKKVGLTVTALRELLGNATSAVAKQQSCTPSKASA TAPTSSSATNATSAPTIPHAPNLPNAQALPPLSVLEKTFLKALLESTQYG TAVLGFAASHQSMLELRHPLAQEIFAHLIHRYHNIAADPEATIDMVSEIS AFTNPETRDLASTLLLDPPISPKWQQQNDLFSEQARRCLAMFLDAFKNLV LEPLLDEKNKLMEQIRVEENVEREIELSRQKIVLDKKIREENRSLQQMIK AILDSTQQVG >Cag_1509 conserved hypothetical protein MANNLLLPNQRDDYDSPWKEAIELYFPEFMAWYFPNAYAAIDWSKPYHFL DQELRSILPEAENGKRIVDKLVQVHLLDGKERCLYIQIEVQGNRETDFPR RIFICNYRIFDKYGKPVASFVILTDSDSSWRPTAYSYEFAGSKMTLEFDM VKLLDFEPRMKELLASDNAFALVTAAHLLTQKTREKSLERLDAKSQLIRL LYNKQWTKERVRELFRVIDWFLELPKELEQQLRTEIYNIEEEQKMKYISS IERYAMEKGILEGMERGMVAGKEVGVLEGMERGLEEGLLKGRLEVAQRLV ASGMSKAEAASFAGVSVEML >Cag_0164 Crossover junction endodeoxyribonuclease RuvC MIVLGVDPGSLKTGYGVVQHHNGSFSVLAAGVIRLQAAWSHPERIGIICR ELEQVIAEFQPERVALETAFLSHNVQAALKLGQVRGAVIGLVVRYALPIY EYAPREVKSAITGKGAATKEQVAFMVSRMLSLHTVPKPHDVTDALGIALC DILRGESRQSGVPPRTNSRRKSGTGGSWEQFVRQSPNVVVRS >Cag_0848 Histone-like DNA-binding protein MGNTTTKIDLVTTIARNTGLTKYETEAVVNCLFESIIESLKAGRRIEIRG FGSFNIRQKNVRKARNPRTGEKVMVESKQVPSFKISREFKLAVSESLKSS EL >Cag_0095 Single-strand binding protein MAELKMPEINSVIIAGNLTKDPVFRQTNSGGTPVVNFSIACNRRFRDSNH QWQEDVCYVGIVAWNKLAESCRDNLRKSSAVLVDGELQSRTWKAQDGSSR TVVEIKARRIQFLNKKHKNGEDDVEGFIEDECPDQHHETLQDEDADYLYD CK >Cag_1402 DNA modification methylase-like MKFPDDYINTIICADSLTVMEQMPDKCIDIAVTSPPYNLKNSTGNGMKAN TKSGKWAGNALQNGYSHYNDNIPNDEYAEWQYNCLKAMYRLLKDDGAIFY NHKWRVQNGLIQDRTDIIRDLPVRQIIIWKRKGGINFNPGYFLPTYEVIY LIAKPSFKLLPKANAYGDVWEFTQEMKNNHPAPFPVALIDRIISSTSAQI ILDPFMGSGTTAVAALQLQRNYIGIDISPDYCEMAKERILNLNPAKRFIK KNGLETISLFEKIV >Cag_1148 hypothetical protein MRHQGASILLYNQQHEVLLVLRDNLPFIACPNTWDAPGGHLDAHETPLHC IVREMMEEMELDVSTCSHFKSYEFSNRTEHIFTMQTDVLNTATTPLHEGQ MIRWFTVADALQLSLASDMEVVLHDVGIWLEQQNNGTEDCGNV >Cag_1527 conserved hypothetical protein MNHEKYNRRSIRLKGYDYWQVGAYFITICTQNKECLFGKITDGKMVLNDA GNIIQEFNAITESHFKNIAISPFVVMPNHYHAIITVGAGSPRPNNPHNEN DHICDDGRVRVDDGRVRVDDGRVRVDDGRGNPAPTLGQIVGYFKYQTTKR INTICQTGGKKLWQRNYYDHIIRDEKSFHAISTYIINNPAQWAKDELYL >Cag_1415 DNA helicase II MSNFLHDLNEVQRSAVEATSGPVMVLAGAGSGKTRVITYRIAHLINNEGI APRNILALTFTNKAAGEMRERVDTLLHHGASRGLWIGTFHSIFARLLRNS IDRIGYDRNFSIFDADDSRSLIRQSMAELDISADAVPLNTLQSIISRAKN SFVMPAEFQRNANDYNQQKAAQVYSLYCKKLKENNALDFDDLLIKPLELF NAHPDVLHELQELFRYIMIDEYQDTNRVQYLVAKMLGARHRNIFVVGDDA QSIYSWRGADISNILNFQDDYHDAQTFKLVENYRSTGNILKAANSVICRN QRQIKKELVSHRHAGEPLTVMEAFNERNEAEKVADRIRTMRMSGTNDYRS FAIFYRTNAQSRVLEDIMRQQRIPYRLFGSVSFYKRKEIKDAVAYLRFIV NERDSESLLRIINFPPRKIGDVSIAKLRDFAEVRHISLYEAIHRAAEAGF PARLLNALASFTSVIEALREMATRGTVYDVLNELFTLTSIPLLLQAENTP ESLARHENLQELLSMARDFADHNPDGGSLGDFLENISLASDYDETQESDN YVSLMTVHASKGLEFPVVFITGLEERLFPLHTYEPEELEEERRLFYVAMT RAQEKIFLSYAKSRYQYGQLHQSIASTFISEIDASIVQSEGGRLLSDRRA PREATTPQNHAAAPAMRRPTTSGMAPSSASSPTAESSSPSISNGTLVHHP LFGQGVVLEVQGKGSKQKVRIAFRNAGEKTLMVQYANLKIQTS >Cag_1609 hypothetical protein MGKLKAHQKLPAEWNEREILRDKGQFWTPSWVAEAMVAYVTENTDLVFDP ATGRGAFYEGLLKLNKQNISFLGTDIDPDVLSDEIYNKENCFVENRDFIK DPPNRKFKAIVANPPYIRHHRIDEATKILLKKIAISITGNSIDGRAGYHI YFLIQALNLLEKDGKLAFIMPADTCEGKFAKNLWEWISEKFCIECVVTFD ERATPFPNVDTNAIIFLIKNTKPQQTLQWIRANQAYSDDLLQFVTSNFKL IEFDSLEITTRQLKEGLTTGLSRPEQNHNGFKFHLNDFANVMRGIATGSN EFFFLTSEQVKELNIPKDFLKRAVGRTKDASESVLSLKNIEDLDRENRPT YLLSINGQESFPKPISDYLKVGEEMGLPTRSLIQQRKPWYKMEQRKVPQI LFAYLGRRNTRFIKNEAGVLPLTGFLCVYPIYDDQEYIDNLWQALNHPDT LENLKLVGKSYGSGAIKVEPGNLNKLPIPEHIVANFNLKRPYKNAYEQLE IFREPKTKYGLKKRKTAGNKC >Cag_1969 Recombination protein O, RecO MYRIESRGAVYNTIQQYIKVHSVIVKTRAVVLRETNFRDQSRICSLYSRD FGRLSVIIKGARNPKNRLCGLFSAGNILDVVIYRKSGRELQLASDATLVA SPLMAEPDMERFAALYRIIDVVKQATAEHEHNPQLFTLLAATLQSLYQQG SNNLLLTAWFLLRLVSLLGFQPSLRQCVFSNHQLATEVVAMKLSELLFVM NPGGLALPAAGGISVGKQWRVPVALALQIAPLAEARTPADISLQVEDAEL ELLCAILYDYCAIHLEHTPKRRHLAITAQLAEA >Cag_0934 ATPase MSYQVIARKYRPAKFSDITAQEHVTRTIQNALRSGRIGHGYIFSGLRGVG KTTAARIFARALNCQKLIDDADYLQQVTEPCGECESCRDFDAGTSMNISE FDAASNNGVDDIRTLRENVRYGPQKGRYRVYIIDEVHMLSIAAFNAFLKT LEEPPPHAIFIFATTELHKIPPTISSRCQRFNFKRIPLEAIQQQLQQICE AEHIQVEADALQLVARKAQGSMRDAQSILDQVIAFSSENALEGSITYRGV ADLLNYIDDDTMFAVTDAVLANNPVAMLEVAHFVLKNGYDEQDFLEKLLE HLRNLLVVLNLSSTRLVERPDAVRERYQRDAAKFSPHTIMQMAELLLQTQ KELKFLFEYQFRFELALLKLLEIAHPPASAAALTIAPEKKKPLSNQ >Cag_0788 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFALPALTHSK >Cag_0223 Single-strand binding protein MPFLQLLSELDAMARSLNKVMLIGHLGTDPELRTTTSGQSVANFTLATNE NYKDSSGNLQERTEWHRIIAWGKLAEICNQYLKKGRQVYVEGRLQTRSWD DQKTGEKKYTTEIVCSDMQMLGSPREQMGGESTMQPYDQSTLPSQSSAPS VMPPATPTVPTMIDTDKDDLPF >Cag_1767 Holliday junction DNA helicase RuvB MRIELLNTPPDAAESRFEEQIRPIRMEDFAGQQRLTDNLKVFISAAKMRG DALDHVLLSGPPGLGKTTLAYIIASEMGSSIKATSGPLLDKAGNLAGLLT GLQKGDILFIDEIHRMPPMVEEYLYSAMEDFRIDIMLDSGPSARAVQLRI EPFTLVGATTRSGLLTSPLRARFGINSRFDYYEPELLTRIIIRASSILGI GIEPDAAAEIAGRSRGTPRIANRLLRRARDFAQVDGISTITRTIAMKTLE CLEIDEEGLDEMDKKIMDTIVNKFSGGPVGIASLAVSVGEERDTIEEVYE PYLIQAGYLARTTRGRVATRKAFSRFADHTLLGGNFGGHKGSLPLFDESE AD >Cag_0154 Excinuclease ABC, A subunit MSFSHISIRGARVHNLKNISLDIPRNQFVVITGLSGSGKSSLAFDTIYAE GQRRFMETLSPYARQYIGNIERPDVDFIEGLSPVIAIDQKSTSRSPRSTV GTITEIHDFIRLLYAKAGRRYNPETGAMVQAQSADNILATILALPEGSKV QILSPLVTGRKGHYRELFERLRSKGFLRVRVDGELQEMVPNMQLERYKSH TIELVVDRLVLAPESEARVREAVMLAISISEHKSSVICTPFEGGFTELAF TLSKGDNEDALPTSTLAPNHFSFNSPYGACPTCNGLGELMQLSGELMIPD PSLSLNQGGLDPFGKAGKRNHWQVIRAIAKEFDFTLDTPMSKIPKSALKI LLNGSGKRTFEVAYTSSGHTSLYPQPFQGAVAYVQEILNNATTSKVREWA EAYMLHQPCPVCLGARLKPESLQVKIHGLNIAELEALPLPETLAFFNNLP PNLSQKELIIATPVLHEITKRLQFLLDVGLGYLSLDRSSHTLSGGEAQRI RLASQLGSQLSGVLYVLDEPSIGLHQRDNHKLITSLKHLRDLGNTVLVVE HDKDTMLEADTIVDLGPGAGAYGGEIVAFGAARELDPSSLTAGYLNGTNR VFYASEASSEKTDADADATPLFLTLKGCKGNNLKNIDAQIPLRKLVSITG VSGSGKSTLINETLYPILARHFYRSKVVTAPFDAIEGIELLDKVVNVDQS PIGRTPRSNPATYTGAFTFIRDFFTRLPEAQIRGYKAGRFSFNVKGGRCE VCQGAGTRKIEMNFLPDVYVQCENCKGERYNRETLMVKYRGKSIADVLEM SITEAAEFFTDFPRIRRILNTMQSVGLGYLKLGQPSPMLSGGEAQRIKLS AELAKIQTGKTLYILDEPTTGLHFQDTQHLLEVLRKLVEKGNSVIIIEHN LDIIKNSDWVIDLGAEGGFEGGTIIAEGTPQQIADTPHSHTGRFLKMEMG G >Cag_0275 NAD-dependent DNA ligase MTIIDASERIAQLRQEIERHNYLYFNEAKPELSDYEFDKLLEELMALERE FPDLLTPDSPSQRVGGTITKEFPVVTHREPMLSLANTYSAGEVAEFYNRV AKLLAAENVHKQEMVAELKFDGVAISLLYRDGVLVRGATRGDGVQGDDIT PNIRTIASIPLRLHQPLAGEVEVRGEIFMRKEDFEQLNDNRPEEERFANP RNATAGTLKLQDSAEVARRRMNFVAYYLKGLKDETLDHVSRLHKLEALGF TTGGHYRRCKTIEEINTFINEWEEKRWKLPYETDGVVLKLNNVQLWEQLG ATAKSPRWAIAYKYPAQQARTQLCNVVFQVGRIGTITPVAELTPVLLAGS TVSRSTLHNFDEIERLSVMILDYVMIEKSGEVIPKVVRTLPDERPADAHA IAIPTHCPECDTPLIKPENEVSWYCPNEEHCPAQIRGRLLHFASRNAMDI KGLGDALVEQLVAWGLVHDVGDLYLLQEPQLERMERMGKKSAQNLIRALD ESRTRSYDRLLYALGIRHVGRATARELAHAFPTLDALMQANEERLAEVPD IGTTVAQSIVDFFAKPSSRQLVDKLREARLQLAASASKIEQVNRNFEGMS LLFTGTLERYTRQQAAELVVERGGRVVESISKKTSLLVAGRDGGSKLDKA HKLGVRVISEDEFMGMM >Cag_0832 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFTLPALTHSK >Cag_1438 conserved hypothetical protein MQILAGQFRGQKIGRSASAAVRPCSSRVKKSLFDTLAVRMDLEDAHVLDI FAGFGSLGFEALSRGAASVTFVDRFHESLKALKSTAAKLGVTNKVSIVNA DALAFLGRTTNQFDLLFCDPPYAWADYHALLELIFRRSLLAEDGLMLMEH STQHNFSHTPEYLFHKDYGMTRVSFFQPPPLNQP >Cag_1694 HRDC MQIKLFTIPISDSGAPEEELNAFLKTHKIVSVDSELANNKDGAWWCFCVR YLEQAMNALPERKVKVDYRQVLDDVTFQKFVKLREIRKRVASEEGLSAFI VFTDEELAELAKLDEISVKSMLSIKGIGEKKIERFARYFITTPESDEAQG EIG >Cag_0416 Endonuclease III/Nth MNPQEKIIALHDLLSKQFPNPKSELEYLSPFQLLIATILAAQATDKQVNV ITRELFKRAPDAITMSRMELEEITGYVRTINYFNNKAKNILEVSRRLVEH FGGEVPQEREALESLPGVGRKTANVVLANAFGMPVMAVDTHVHRVSNRIG LVSTKKVEATEEALMAIIPEAWVADFHHYLLLHGRYTCKAKKPACPTCTV AHICDFAE >Cag_0699 DNA mismatch repair protein MutS-like MNPSTLKKLEFTKIAAYAAQLCLSPMGRDRLLNARPLREREALMAELERV LELRMLLQEGLTLPFSHLPDTRVLLKKLEIEHLALEPLELLDLYHLLYSS VQLRRFMYGNRERYGRLNDLTIMLWMERSLQAMIQRCVDERGLVRDSASD GLLLIRHDLAESRELLRRRMERLLRRASANGWLMEETVAVKNGRLTLALK VEYKYKIPGYIQDYSGTGQTVFIEPAETLETSNRIQDLEISERREVERIL QEVSAALRGELENIHHNQQLMAEFDALYARARFAVETNAVLPTVTEGNEL RLIKAYHPWLLLSHRERTVQPLDLHLSAEEQVLVISGPNAGGKSVTMKSV GLLCCMLVHGYLLPCSESSCIPLFNNIFIEIGDDQSIEHDLSTFSSHLSA IRSILERAGTRDLVLIDELCGGTDVEEGGAIARAVIEELLASVAKSIVTT HLGDLKAYAHQRDGVVNGAMAFDRAELQPTFRFIKGLPGNSFAFAMMQRM GFSPALVERARHFMAHERIGLEQMVDDLSHIMEEQQRQRQQLDDEQRTFA ERERTVLEVEATLKQQQRELKQQISRAVQKEVEHARKEIRAIVQEVKAAP TNPQVVQAAREKLGIKRQEVEERHTTAAPTTASEPTIDRTITIGDMVRLL DTNATGEVERFNGDNVVVRCGTIRLQTHLKNLEKSSKTKARTAQRDTSNS KVRSWSTVTNEVSSTQLDVRGMSGNEAVPHIERFLDTLRLHRIHFATILH GKGTGSLRKRTAECLKLHTAVKSFRLGGLGEGGDGVTIVELGE >Cag_0748 transposase MKDTVLFQQALCLPAPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HNLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEYDSRIWRIIHHYLDEVLEQQ DLSSVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPEAHITFDKFHIVQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGGKF EFALPALTHSK >Cag_0809 probable transposase MAGTYSQIYIQYVFAVKGRENLLQKPWRDDVFKYIAGIIKGKNQKPIIVN GIEDHIHVFVGLKPSMSIADLVRDIKNNSSNFINEQKFLPRKFAWQEGYG VFSYAHSQIEYVYQYISKQEEHHKTKTFKNEYLEFLQKYEISYDEKYLFE WLD >Cag_1063 transposase MKDTVLFQQALCLPMPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLNFFQHECYLTARVPRISCPTCGVKAITDLPWARR DSGFTLLFEAMIIALVPSMPCKTIANYVGEHDSRIWRIIHYYLDEALEQQ DLSAVTKVGLDETASKRGHNYVTSFVDLESSKVLFVTEGKDATTVEKFHK HLLAHKGKAENIKEICCDMSPAFIKGVTTNFPETHITFDKFHIIQVLTKA VDEVRREEQKERPELAKSRYLWLKNQVHLNQSQQVKLEKLQLKKLNLKTA RAYQVKLNFQEFFKQAPAYAQSFLNQWYYSASHSRLEPIKEAARTIKRHW YGILRWFTSNITNGKLEGLNSMIQAAKARARGYRTTNNLIAMIYFIGSKF EFTLPALTHSK >Cag_0001 chromosomal replication initiator protein, DnaA MHDHSPILVTDPHSLKGQKQSSMEQQVWDTCLAVIKESINPLAFKTWFLP IRPLGFVGGELTIEVPSQFFYEWIEENYSLLLKQTLRDVIGSEARLMYSI VMDKSQGQPVTIELPQQTTSPFTYEQAPLKVDRIEEQRHESYERNVSRFE SHLNTKYIFDTLIRGDCNSLAFAAAKAVSQNPGQNAFNPLVIYGGVGLGK THMMQAVGNSVRENRLTDRVLYVSSEKFAIDFVNAIQNGKIQEFSSFYRS IDVLIIDDIQFFSGKEKTQEEIFHIFNTLHQSNKQIILSADRPIKDIKGI EDRLISRFNWGLSADIQPPDYETRKAIILSKLQHNGVTLDDAVIEFIATN VTENVRELEGCIVKLLAAQSLDNRDIDLAFTKSTLKDIIRHTTKQLTLDT IEKGVSSYFSITSNDLKGKSKKKEIAVGRQIAMYLAKMLTDSSLKTIGLH FGGRDHSTVIHAVSTISKRVEQISEERKRIEEIKKRIEILSM >Cag_0328 ATP-dependent DNA helicase RecG MDGTTSVAFLKGVGSRKAVVLGEVGIVTVDDLLAYYPRRYLDRRSIKRVR ALVDGELTTVVGTIVRTQLEQPTSGKARFKAWLDDGSGLLELTWFRSVRY FSRFFTKGESLAVHGKVSFFGNQAQMQHPDYDRLTPENAVGGEKGSDDFA LFNTGAIIPLYHTTEAMKQAGLASRQLRVLIKRALEEVPFREQENLPLSI IRQYGLIPQWEAEREIHLPSSPEKLEQARYRLKWTELFYAQLLFALRRST LRRNRAAVRFTHSGELTRKLHESLPYQLTEGQKQAVRDIYRDLRSGSPMN RLLQGDVGAGKTMVAMFAMALAVDNGLQAMVMAPTEILAVQHALVMKRFF APLGIELGLLTGKQGKKERRATLEKLRTGDMQLVVGTHALLEPDVQYANP GLVIIDEQHRFGVLQRKALQEKAANPHVLLMTATPIPRTLSMGMFGDLDL SIIRDKPVGRQPIKTVLKKEQDKPSVYHFVREQIAAGRQGYIIYPLVEES EKMDLKAAVESYEELSTAIFPDLSIGLIHGQMSPDEKEHVMERFRQREFS ILVGTTVIEVGVDVPNATVMIIEHAERFGLAQLHQLRGRVGRGEHPSTCI LLTAKMTADARERLLAMVSTNDGFVLSELDAKIRGVGNLLGKEQSGTLSG LRIADLNTDEAIMAAARQAAFTLVEADAQLRATEHRMVREHYMRYYHERF SLADIG >Cag_0003 RecF protein MKLQRTIFSGFRNHTSLLFEPSEGVTIIYGANGSGKTSLLEGIHYGALTK GLLGAPDSECLSFDTEAFTLDSHFLSDSNIPIHVLVTYQLEGEKQVIVDR QEVKPFSSHIGRIPTITFSPYEISLVSGPPAERRRFLDSAISQLDHRYLD RLITYRRILQQRNALLAQLSSGEKSNRNTLPLWTTQLAELSAWLVERRLL FLTSFSPYFQHYYRYIIKGEEPSINYRCTSCPLHGNTTFQELYQLFLQRY SDIEAQEIQRGQTLFGAHRDDVLFFLNEKEIKRYASQGQLRSFLIALKIS QAHLFADHLHEQPMCLFDDLFSELDGGRIEQILALLKECGQTIITAVEPR YTEGITLCDIQALR >Cag_1118 Tyrosine recombinase XerD MSTLSSSYQTTLNSFLNYLIVERNFSANTRSSYHNDLHRYLLFVQEQATP IAEITSKVIDRFLAELVALGLETTSMARNISTIRSFHKFLHNERLSSNNP AERLHLPKKAHYLPAVLNLSETLALLEAPSIMQPAPTYALRDRAMLELLY ATGVRATELISIQQEHLYSDAGFIRIFGKGSKERLVPIGASATLWVQRYQ KELRVQLVKAHSNDFLFLNSRGGKLSRMSLFEMVKTYSVVAGITKSISPH TLRHTFATHLIEGGADLRAVQEMLGHSSIVTTQIYTHLDRSFIKEVHKTF HPRG >Cag_1006 Protein of unknown function DUF83 MYPESDFIAISALQHFAFCPRQCALIHLEQIWSENMYTAEGRELHERVDE GKTSYKSGVRITRSEPLRNATLGIAGVADVIEWHKQPNGKELPFPVEYKR GKPKKHNADKIQLCAQALCLEEMLGIHIPSGALFYGETMHRLEVEFTPPL REQTRGAAEGIHELFERGLTPPPDYSAKCKQCSLLEVCQPNLLAQHNTAR NYLASLVQTLSAEDA