Gene list
Applied filters:
COG category: General function prediction only
Organism: Chlorobium chlorochromatii CaD3, CaD3
Gene type: CDS
Number of genes found: 262
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Chlorobium chlorochromatii CaD3, CaD3 >Cag_1709 conserved hypothetical protein MNGYRKRVVDDELNELIAALPAIALEGAKGVGKTATAEGRCRTIFRFDDP AQRAIAEADMGVVLNQDTPLLIDEWQRVPSVWDAVRRAVDRDQTSGRFLL TGSASPTTPPTHSGAGRIVTLRMRPMSLAERGVGVPTVSLRELLFGHRPE IVGKTNIALADYVREIVHSGFPGIRPLSGRALRLQLDGYLRRIIDTDFPE QGYMVRRPEVLRRWLAAYAAATATTASLETIRDAAFGGDKEKPSKTTTQP YREILERLWIVDPLPAWLPSRNHLKRLAQPPKHHLADPALAVRMLGLDES ALLAGEESMLSIPRDGSLLGHLFESLVTLGIRVFAQAAEAHVSHLRLHGG RQEVDLIVERGDQRVIAIEVKLSSTIKESDVRHLLWLREQLGDELLDMLV IHTGTQAYRRPDGIAVIPAALLGA >Cag_1497 chlorobiumquinone synthase BchC related protein MPLTAQAIVLQKASKLKMVTASFEVPTVNGLLVQTIASTITPGYDRLLIT NKPVTNKVFKYPIMPGSEAIGQVLEVGSGVSDIAVGDFVFVFAAQGWQGI EAYAGCHANIIPTTRDGVLPLGRLPIHRDLLTGLLAYAMSGIEKIPLNPS QRVLVLGLGSVGLMVTEYLHHLGYQHVDGVETFGLRGQLSRAENIALDIA DFTEAFNNCYDIVVETTGRILMLEKAIRLMKPHAKVLLMGNYEVMACDYR LIQHKEPHLISSNITTMEHHRKASELLESGLLDTEKFFTGVYPVEQFELA YRHALDDKPSIKTVLTWL >Cag_0239 peptidase, M16 family MALTTSTTVHLATLPNGITVITDSVPYVESITLGIQINAGSRDDPAHAAG LAHFMEHALFKGTRTRSYLDIARSVEQHGGYLDAYTTKEQTCVYLRCLAA HLEPSFELLADLVSNPTFPPEEMEKEKEVVLEEISSINDTPEELIFEEFD QRSFPNHPIGNPILGTEKSVEAFSQNDLHLFLQQHYIPQKMVVTATGNVS HHAIMQLCERFLNHLANPAESTETRQPLSVATYKPFSLTLKKRIYQAQIV MGTAIERNDRHFYSLMVLNTLLGSGMSSLLNLELREKRGLAYNVYSSLAF FDDLTALNIYAGTDGNKVATTLTLIKELLQSDALHHPIHEELQAAKTKLL GSHIMGMEKMTRRMSNTASDYVYFRRHISPDEKSAAIEAVTASDVTEAAE LLLRQATYSTLVYKPSRQG >Cag_0608 hypothetical protein MAYLRSGISLIIAGVSIMHFSHQFWYWIIGIACIPTGIVTGFFGVWRYIT ISKSITIVRRELPLADQREAEQMK >Cag_0151 conserved hypothetical protein MRLLPAEREIIRTLATRIFGDGTRVLLFGSRVDDSVKGGDIDLYVQSPDA EQALTKKREFVVALKLALGDQKIDVVISSNPSRFIEQEALKHGVAL >Cag_1617 conserved hypothetical protein MKIVQFVAMLLLFAGVNGAVVYRLWRLMPPLRWFRGSVTLLLLAVIAAPF VVMAWGNALPLPLVSLLYMVGLSWLILLIYLILLFLLIDGLSLLGFFGVQ RLKSLKLWTHESWVGTVGVVVLMAVLAVYGNYNYHQKERVELTMRVEKAM AQPLRIVVVSDLHLGYSIGREELERWVVLINREEPDVVLLVGDVIDTSLR PLEVERMAEVLRRLSSRYGVYAVAGNHEHYATLAKSAPFFSDAGIRLLRD EVLLIDNRCYFVGRDDYMNKQRKPLSVLLSGVDVAKPIVLLDHQPRALGE ARAAGADIHFAGHTHRGQIWPISLLVEQMYEQAYGYRRFGAMQSYVSSGL GIWGGKFRIGTKSEYVVVTLQGR >Cag_0985 putative plasmid maintenance system antidote protein, XRE family MMILMYLKNKGGARNSCLIFLTFHYAEAFGTSPEFWLNLQATYDLSLHKP TKHIQPLVAVSAQHEIQERSCIVGLFLLCVLG >Cag_0115 CrtF-related protein MMNTNELLNYNHRANELVFKGLVEFGCIKASLELDLFTHLAGEAKDTETI AANVGAIPQRLVILLETLAQIGVLAKNDGKWSLTPFAATMFLPNNELPNL YMMPVTKAMAHLSENFYLKIADAVRGNHIFKAEVPYPPMTREDNWYFEEI HRSNAHFSILLLLEEANLSNVKTLVDVGGGIGDISAALLQKYPQMDSTIL NLPGAVELVNENATEKGLADRLRGSVVDIYKEEYPKADAVMFCRILYSAN EQITSMMCKKALDALQEGGKVLILDMIIDEPEDPNFDYLSHYILGAGMPF SVLGFKQQERYKELLEAIGFRDVRIVRKYGHLLCEAVK >Cag_0693 sepiapterin reductase MQHIILITGAGKGIGRAIALDFAKATSPTFQPVLVLVSRTLSELESLAAE CHALGAETHLCAADISNLQQIDAMVNDVVARYGTIHCLINNAGVGRFKPF AELTPDDYEFVMDTNLKGTFFLTQKVFPFMEKQQLGHIFFITSVAAETAF STSALYCMSKFGQKGFVEVMRLYARKCNVRITNVLPGAVLTPMWGEVPDA MQRVMMQPEDISQPIVQAYLLPQRTSIEELVIRPVAGDINE >Cag_0632 TPR repeat MVQSSNEGGGVSATAPYYTSYKQALAYVDEQRYEEALQLFDHCIAIERRH AALLYGRAVTLLALGTYRQACCDLFKSLALDKAQPEAWKHLAYLLFMLGK DEPAEKTLKKALERFPDYAPLYCVLADIYLDLGEFDKAHEAIEQALRLDP QNPEPHSKLAMYYVARGNMEGLQQECKTLEQLDAALAEQIRTLFFENQ >Cag_1073 Phosphatase kdsC MLTLSQNELVRRAQAIRLVIADNDGVFTDTGVYYSERGEELKRYSIRDGM GVERLRNAGIETCIMTGERSPNVQKRAEKLCMKWLYLGVKDKRSMLATLL AETGMERHELAYIGDDVNDCGIMEEIAPFGLVAAPRDATRFVEPYLHYRA TADGGHGGFRDIAEWLLELKNS >Cag_0098 YgfB and YecA MNAPDPMMQPLTLQEFTILEEFLVSERTPEEALSSLEMLDGYMTAAIIGP QAFEPKDWYALMWDKNKQLEPQFSSADEADMISELIVRHNNSIEAVFLED PESFVPLFDRVAYENEEIHKLAVEEWCMGFLIGMELAYEAWQPLFDNEDA AVMTMGFFMLSKVSDEFAHMTEREIEEITSTVGDAVIGIYLYWHGDDEMD EEDDDELFRE >Cag_0398 conserved hypothetical protein MLLMKRYQWLGQLVATTLLWLLLYNNLELGADQLLRLIGLTRAVPFGEAL HFFVYEVPKVLLLLTGVVFVMGVIHTFISPERTRALLSGRRTGVGNVMAA TLGIVTPFCSCSAVPLFIGFLQAGVPLGVTFSFLISAPMINEVALALLFG MFGWQVALLYMGMGLAIAIVAGLIIGKLGMERYLEEWVQQLQNSGMADEN NEDNAMEVPERLAYGWKHVQEIVGKVWFYIVLGVGLGAGIHGYVPENFMA SLMGNNVWWSVPIAVLLGVPMYSNAAGILPVIQALLGKGAALGTVMAFMM SVIALSAPEMIILRKVLKPQLIAVFASVVALGIIIVGYVFNAVL >Cag_0437 conserved hypothetical protein MTILDRYILKKQIAPFFFAFITIVALLQLQFFSTFAERFIGKGITFVAIV ELLALQSAWMVSFALPMAVLVAVVMSFGTLTTTSEMTVCRASGISLYRVM VPVIVVSLLLSFTVERFNNVLLPQANYQAKSLMAEIARSKPAFGLTEQAF STLVDGYSMYVRSSDERHGELRGVVIHDMTRPEYRTTITATRGRVEFTPD YQYLVMTLRNGAIHQLQQPEKSGYRSMNFERYRFVFESSLSGFTPSSGNR MRADANELSAGELHAIGLEFRRREAVALLHVQAPLVALERLAANTDNSKM AASPPTLRQETSAIAATKALAVIEGEIARVASELEVASTNRTLYNRYMAA YHKKYSLSLACVVFVLVGAPLGVLARRGGFGVGAAISLLFFVLYWMLMIS GEKMAERGVLDPMIAMWMADGVMALIGVGLVTKLTQALFSTSR >Cag_0723 oxidoreductase, Gfo/Idh/MocA family MKIGVIGVGKLGEFHTKLLTELAHERTDLHVAGIFDLNTQRAEEMAQKYN VPRFNSVEELAKTCDAAVLATTTSTHFALASALLNEGLHLFIEKPITTTV EEADELIRLEAENNVRIQVGHIERFNPALRTVEQWIGRPMYIQAERLSGF SLRVTDVSVVLDLMIHDIDLVLSLIQSDIKHIAASGVKVFSNELDMATAR IDFVNGATANVTASRLSRSKMRKLRFFCTEPKSYASLDLTSGKSEIYRLV PPDMASSKNPLKSFAARKILEQFGEIQESLNGKVLDYIHPEVPKVNALRD ELEYFINAVRDNAPTVVSALDGRRALFVAGKITDEINASTALLHD >Cag_1954 iron-sulfur cluster-binding protein, GltD family MKVESNPILDFAINYQFPPFEELTGTHKIVAFGDHSHKCPVYVRQTPPCQ AECPAGENIRGYHRFLNGIDKSEDEWKSAWETLVEINPFPAVMGRICPHP CQSACNRQYHDESVAINAVEQAIGNYGIQAGLQLPEPAPATGKRVAVIGG GPAGLSCAYQLRRRGHAVTLYDANEKLGGMVLYGIMGYRVDRKVLEAEIQ RIINLGIETKMGVRVGSDVTLDELEQEFDAVFIGIGAQAGRSLPVAGAAE TQGVTNAIEFLRSYEVEGDNITIGKKVLVIGDGNVAMDVARLALRLGSEA AVVAGVPREEMACFKEEFDDADHEGAVMHFMSGALELLKNDDGSVRGLRC AKMVKKAKGEEGWNSPIPFFRYKNSDETFDIEADTVVAAIGQTTNMQGFE AITNGAPWLKVDRSFRIPGREKLFGGGDALKVDLITTAVGHGRKAAEAID AFLKGEPMPDQGYREVTKVSRQDVLYFPVTPPAKRDTIKIQEVVGNHDEL LVALTPEQAKAESGRCMSCGLCFDCKQCVSFCPQEAISRFRDNPVGEVVY TNYDKCVGCHLCSLVCPSGYIQMGMGDGL >Cag_0951 conserved hypothetical protein MPQSQQLTNEQQAALQQALIYRFLGLVFAYPNDAFLPTLQNALQKISDNA ARFQPLLDAFAAEPQEQLQAEYTRLFLNGYPNTPCPPYESVYREERMMGE SSLAVQKLYQQWEIAIDANLSDHLATELEFLAFLSAATTLTEVATDALAT REYFLEEHVRQWLPQFCRDLQKEATVEAYRLLSQLLANVLLH >Cag_0725 drug resistance protein, putative MKRSPLFILLLTVLLDLIGFGIVLPLLPTYTKSLGANPFMIGLIAAIFSI MQFIFSPLWGKLSDKIGRRPVMLSSILLTSLSYLMFAQATTLPLLILARA LAGVGSANVSAAQAYITDVTDAKGRSGAMGMMGAAFGIGFIVGPLLGGVL MHNYGIAVVGYVASLLIAIDFILAIFFLPESNKAAIPFTHLLKGNENKGT RHGKKQSLGTTLATKVQEYNEHFRTTFSSRPLALLMVANFIYTLAIVNMQ TASILLWKEYFKASDEQIGYLFAYVGIWSVVVQGGLIGKLTKKVGEHNIF LWGHLFTFFGVFFMPFLPSYSLFSMGLTVLFFFAIGTSLVAPINLSMISL YSYNQQQGQIMGLSQSVNAFARILGPFSGSILYGMSFHAPYIVAGILTLV GAVIALRLFRYRIEAHD >Cag_0981 conserved hypothetical protein MNFIIKQATVSNRKDFLKILQCWNMQNGFLHDETELDYSNFFIAEVNNQV VGMAGFMPIDGERYRTRLLAVYPEFRGTEIGKALQDRRLEEMYKRGAKIV ETSVDNLEMKHWYKKHYGYTEIYKTKKEYEISFIDVDVVDVLYLNLIEYM KNKIAFDSKKLRYMEKYEPHPLSPYPPLIINVALTGVIPTKTLTKYIPIS VNEIIEEAINVYDAGASIVHLHAKDENGKACSDAKYYEKIISGIKKERPE LICCATTSGRDGQSVEQRAEVLSLTGNAKPDMASLTLGSLNFLSGASINS IDTVTELAYIMKEKGIKPELEIFDTGMVNLAQYLERHNIINGKKYFNILL GNLNTAGATIKDLSHIYTSLPDNSIWAAAGLGHFQLPMNMASIVAGGHVR VGLEDNIYYDLNKTKLATNITLVNRIKKIANELERPISTAQKTREILGI >Cag_0293 putative cytoplasmic protein MSKNKTSHVSTNKEPANIRSSAAEYLTYIAAIGERATSVEMRYEAENIWL TQKMMATLYDVSVPAINQHLKRIFDDNELTREATVKQYLIVQTEGNRQVE RMVDHYNLQAIIAVSFKIENERAVQFRKWANQIVKDYTIQGWVMDVERLK HGGTILNNEFFERQLEQIREIRLSERKFYQKITDIYATALDYDPSATASK RFFAAVQNKIHYAIHGLTAAEVIVNRADHRKNNMGLTHWEGAPSGKIHKY DVSIAKNYLSEFEIAQMERIVSAYLDMAELQTMRKIPMTMEDWEKRLAGF LTLWDREILQDAGKVSAELAKVHAESEFEKYRIIQDRLYESDFDRLLKQI EHCNTTQAEKQ >Cag_0857 conserved hypothetical protein MSGIKYLLDTNIILGLLKATSTVLEAIGFRSIQAAECGYSAITRISNCYV LFVS >Cag_1651 Small GTP-binding protein domain MKPLLALVGRPNVGKSTLFNRILRQRSAIVDPTPGVTRDRHIAEGEWQGK QFKLMDTGGYNTDGDVLSKAMLEQTLHALADADSILFITDARAGLSYEDL ELARILQRSFQHKQLFFVVNKVESPQLVIEAESFIKTGFTTPYFVSAKDG SGVADLLDDVLEALPEAPEGEVKGDTAVHLAIVGRPNVGKSSFVNALLGT NRHIVSNIPGTTRDAIDSRLMRNQQEYLLIDTAGLRKRTKIDAGIEYYSS LRSERAIERCEVAIVMLDAEQGIEKQDLKIINMAIERKKGVLLLVNKWDL IEKDSKTSIRYEEQLRMAMGNLSYVPVLFVSAMTKKNLYRALDTALQISR NRSQNVSTSQLNKFLEQTLAQVHPATKSGRELKIKYMTQLKSAWPVFGFF CNDPLLVQSNFRKFLENKLREAYNFEGVPISLRFLHKNKVKED >Cag_1931 Hit family protein MSHNQEQDCLFCRIVRGEIPATIVYRNEHVVAFKDISPTAPHHVLIIPVQ HVASLNALSPEHEAVAGQLLLAAAPVAEALGIKESGYRFVINTGADAMQT VFHIHAHLIGGQAMGWPPFPV >Cag_0424 drug:proton antiporter MALASSPIVSFTFLRLLSFRFLIVLSYQMLAIVAGWHIYELTHNALALGF IGLAEVIPYFASALFAGHAVDHYSRRLFGVMASVMVMASALMLTAVSAGM VVGNPVWWLYGAIAFNGLARAFISPSYSAMFALVLPREAYAKASGIGSSV FQLGLVTGPALGGLLAGWFGNTAGYAVAAVLAFGAALALFSVRVKEPPSA ESMPIFASIASGIRFVFGNQIILGAQSLDMFAVLFGGAVALLPAFIKDVF HFGPEAFGLLRAMPAIGAVITGLYLARHPLNHHAGRWLLGAVAGFGVCII GFALSTTIWMAGLLLLLSGICDGVSVVMRTAIMQLLTPDDMRGRVAAING IFIGSSNELGAFESGVAAHVMGLVPSVIFGGFMTLGVVAVTAKLAPKLRR LDLQQLY >Cag_1887 von Willebrand factor, type A MCFTHPEKLLLLLLLVPIAGLLIGRFIKEKRLRQALMANKMAATMMPLQH LLPYAFRSLMLFVASGLLLLALAEPRWCGGTKPVLRHGADVLFILDVSRS MQATDVAPNRLMRAKQEIAAISQNVQGGRRGLLIFAASPLLHCPLTTDRD GFATLLNMAAPELIEEQGTRLQPAFALASTIFDVANESNAASTRGVQVIV LLSDGEDHDSNVQRAAQQLAKQSVQLFVIGVGSLKPSPIPLADGSFKRDA SGQVVMSRFRPQMLQAFARQAKGLYRHSHAEVWASADVVNRINRYAADSR IVMEPATNDSSLLRLMVGIAVALLFMETLLRKSS >Cag_0290 conserved hypothetical protein MKSPFPLFFPLLQKKRDCSKQAQCSKRRLIMALLAVSALAPANLYAASKA KRQPPLVAHVYPDTLALPYNRYALKPFMRPARKSVAVALSGGGANALAQI GVLKAFEEAHIPVDAIAGTSMGAIIGGLYSCGYSAAELEQLALTMPWSSI LALQEDYSRSSLFVEQQRIRDRATIALRFDGLKLLLPQSLNSAQAFTRTM DMLVLHALYHPHSNFSSLPIAFRAVTTDLVSGERVTLESGSLSEAMRASS TVPILFEPIHRAEQQLVDGGLVANLPVDELAHFGADCKIAIDTHGSMYAT GKELDLPWKAADQAMTILITLQYPAQRAQASLVIEPETGKHKATDFKNIP QLIAAGYVAGKQQVPTLQRLLAITSPSNSSAPQTSSVPPSSIVPSVATPP PISPILTANKKEMRNFSLATYTKRWSISPTSTELERLVGEKVASALELHA LLRDLLATDYFARVSAEVHQEDRTVTVKLEALPSVTVVTVQGELADELSS AELNECFAPLMGRLYTNHQATAALEALVRRLRAKGYSLAAIEQVHVENER LTITFSSGKAAMLTISLNKGRTLLTPIQRELKLDATKPLRLRAAEESVKN LYETGVFNRVSLFAEPITQTEAIAPISSTTPNQTIHLSLEEKPASVLRLG LRYDETNNAQLLLDVRNENVGGTTNTMGGWVKAGRKGYLANMELNMPRIG ATHLIFATRLFFDSYLFDYTNSDGSLAPYNIQKYGITSSFGTRLRKNGHF LTDVSYYNSQAFTDEAHRPLFSTTNNNVLTIGTHLTIDSRNNALMPTRGS YSYLTYAFTPLSLDDGLRYWQFSGTHQVNLPLGRETTLQLSAMTGVSSKA LPLSEQYFLGGIGNSYSARFIGLQPHALATNNVATAGVQLSYEPSFPILF PTTLQLHYNAGRGWNAMENVRLDGALQGALQAVGASMVWKTPLGPTRFTL AKVLVNNDDNSLMLPHRDDDPVFYFSIGHDF >Cag_0126 CBS MDQLIKLRTLPVSALMQKDFHVIKGSCTVAEALQLMKQTRESGLIVEPRN EDDCYGMVTEKDILEKVIDPGEDVHRDPWNTPVFQVMSKPVISVNPSLRI KYALRMMKRTNVRRLTVMEGNKVIGVLNMADVLHAVEELPVHDEHVAL >Cag_0350 oxidoreductase, short-chain dehydrogenase/reductase family MKVALITGASMGIGEAFARSLAEQGRTMVLVARSVDALHRLATELEQRYR VAVYVMPADLSQHESATAVYNYCRQQQLEVELLINCAGFSVAGAFADIPP ERIAEMVQVNSTSLALLTRHFLPNMLQRKSGTIINVASLGGLQGVPGMGL YSATKSFVITFSEALAHEVRPYGIKVVALCPGFIATGLMESAGQNTKAIR LPISQTDVVVKAMQRAFVTRYVRLYPTWLDSLLAFSQRLVSRSLAVRLAA FFAGVLKKG >Cag_1975 putative 3-deoxy-D-manno-octulosonate 8-phosphate phosphatase MCAASSFTLNHYKPLSGFQFFGNDPSQADPSSRIDQALKGIQALLFPVDG ILNGSKITFDHSGNELCTISVRDAIAIKEAVKLGLRIGVLSSRNAEGYRP MLEALGVQDLYLNGEHVFYSYDAFRHRHSLSNEECAYIGDDIGDIDVLAK VGLPATSIDGADYLRNRVAYISGFEGGKGCIRELVEEILTRQGKWPYIER PDEEDETAEE >Cag_0927 Beta-phosphoglucomutase hydrolase MSSSFKGAIFDLDGVITGTAKVHSLAWESMFNSFLQNYAEANNEPFVPFD PIHDYHKYVDGKPRMEGVKSFLFSRDIELPFGELDDNPENETICGLGNRK NSLFTEILEKEGPEVFSSSIELIEQLIERGIKIGIASSSRNCQLILRLAN LEYLFETRVDGEVSIHLGLKGKPNPDIFVVAAKNLGLEPHECVVVEDAIS GVQAGARGNFGMVLGIAREIEGARLIEQGADIVVNDLGEITPEDIEEWFT KGLEFEGWNLTYTEFSPKDEKLRETLTATGNGYLGVRGAYEGSKSSHNHY PGTYIAGIFNRVPSLVHGQTIYNNDFVNTPNWLPIEFRIGGGAFIDPFRQ KILSYRQNLDLRSGLMERDLVVQDNLGRITSITSSRFASMANPHQCAVKF TLKPVNYSADIEFRCSIDGTVQNRNVARYSELTSDHLEHVEAQHNGATML LHVRTSVSKYEIVTAAKTRIIMHGKEVTAERQPLHSNRFIGEQFTLSLGP SKGCTIEKMVSIYTSLDQNSTSPLTAAKTSLQDCSSFDELLTPHVAAWEA LWEKANLQIEGDRFSQKVLRLHTYHMLCTASPHNASIDAGMPARGLNGES YRGHIFWDEIFILPFFNRHFPDISKSLLLYRYHRLDAAREYARENGYKGA MFPWQTADDGKEDTQSVHFNPKSGSWGPDHSCLQRHVSIAVFYNTWRYIY DSDDTTFLNEYGAEMMFEIARFWASIATLNPATGRYHIEGVMGPDEFHES VPGSKKEGLKDNAYTNVMCVWLFEKATEIAAKLSPEALERLQKTIGYTPE EAEEWHKIGHHLNVLIDHDGIMEQFDGYKSLKELDWKHYRTKYGNIHRMD RILKAEDDTPDNYKVAKQPDVLMMFYTLSPGEVAELLTKIGYMVPDALTL VRNNYAYYEPRTSHGSTLSKVVHSIISSYLHDGRDMAWSWFLDALKSDIN DTQGGTTHEGIHCGVMAGTLDTVARYFAGIAFYNEKLNVHPNLPDQWKKL SLTVCFRANRYALTIEKAAITVTLLESDSNEAPACIAGHHLTLQKGVPYN SALHA >Cag_0895 hypothetical protein MMQSIIRNATLSDLSRCAELLAILFSQEKEFAPNATAQLHALTMIAESPA SGQIFVAEVNGKVEGMVLLLFTISTYLGKRVALLEDMIVTPEWRGKGIGS QLLRHALAYAQSNGMGRVTLLTDLDNEAGHQFYKAHGFVRSSMVPFRYVW E >Cag_0185 Glutamate synthase, NADH/NADPH, small subunit 1 MGKIKGFMEYKRALPADRQPLERIKDWQEFHEKMAPEALCEQGARCMDCG TPYCHSGIMLSGMTTGCPIHNLIPEWNDYVYRGFWHEAYDRLNKTNNFPE FTGRVCPAPCEGSCVLGIINPPVTIKNIEYSIIEHAFAEGWVTPKTIANR TGKRVAVVGSGPSGLACADQLNKAGHSVTVFERDDRCGGLLMYGIPNMKL DKVQVVERRIALMKQEGITFMEKTEVGVDYPAGKLLEEFDAVVLCTGATK PRDLAEEGRQLAGIHFAMEFLTASTKALLDGTEPKLSAKGKKVVVIGGGD TGTDCVATSLRQKCASVVQLEIMPKPPMERQADNPWPEWPKVFRVDYGQE EAEALQGSDPRRYAMMTKKFLTTGNGNVSGVEVCSIAWENIDGRFVPTPI PGTEEIIEADMVLLALGFIGAEENLLQQLQVAQDERSNIKANTQNYRTNH ERIFAAGDARRGQSLVVWAIQEGRAAARECDRFLMGGTNLP >Cag_1088 hydrolase, alpha/beta hydrolase fold family MLHYKTYLHHNPKAAWVVFVHGAGGSSSIWYLQIKEFMQHCNVLMVDLRG HGRSKDMGIPEGMRRYDFNCVTRDIIEVLDFLRIEKAHFIGISLGTILIR NICELAPERVASMIMGGAIIRLNLRSTVLVTLGNTFKHVMPYMWLYRFFA WIIMPRARHKKARNLFVNEAKKVEQKEFLRWFSLTYDLTPLLRYFEEKDA ATPTLYLMGDEDHMFLPFAKRIVTRHTYATLEVIANSGHVCNVDQAREFN QRAIRFIKLHS >Cag_1544 phenylalanyl-tRNA synthetase, beta subunit MKISISWLREFLPNFSCETVSLVERLTFLGFEVEGVEESASLDRRIVVGR VLETEPHPNAERLTLCLVDVGREEPLRIVCGAPNVRAGMVVPVATEKAKL QFPDGQTLTIKPSKIRGERSQGMICAADELGLSNDHSGVMELESSWEIGK PFADYLESDVVLDIAVTPNRPDVLSHLGIARELADGAPLQYPSQQSLTYQ PAGERIAINDAVACPYYTGVIIRGVTIRESPEWLRKRLQAIGLNPKNNIV DITNYMLHALGQPMHAFDCAKLAGERIAVRSDCQAEVVALNNLTYKVEGG MPVICDGSGAIAAIAGVMGGMASAVTESTTDIFLESALFHPSMVRRTAKK LALASDSSYRFERGVDSRMVQQASATAVALILELAGGTVECAMEQGSVAA DLQLLALRPERTNKLLGTALSGEQMVELLERIGFRCVEQTTEQLLFAVPS FRVDVTAEIDLIEEVARLYGYNAIESSRQMATIYPTKRQHPAYFPDFLRG ELITLGFREILTNPLIKRNDAALASEQLVDVLNPISEGLEVLRPSLLPGL LKVISHNIRHGNRDQKLFEVAHVFEAKPQVQQTQQPLEGYCEQERLVMAI TGSRYLRRWNHPTDMVDFYDLSGAVEMLLEQLNILDKSVVNIYTPSALSI DVFLTEKGKRTTHRLGIMQPVNAAWLKHFDIEQEVYCAELDVALLERCYQ PTSAYEPPSRFPVVERDISFIIPEGVSAQSLVELVQSSNPLIKTVTVFDR FERNHESGKECSIALSLTIADAKATLQDEKINDILATISRNAESKLGAVI RQV >Cag_1480 Acetyltransferase (isoleucine patch superfamily)-like MNPWKNLQKRYFPALCYWLGNHFFMNWTPYPVRHWFLRKYCNVKIGKDSS ICMGCFITGQKIEIGLNTVINRFTYLDGRVALRIGNNVNISHYTLIQTLT HDPQSSNFTCQEKPVTIGDNVWIGARAIICPGVAIGEGAVIAAGAVVIKD VPPYTIVGGNPARYIKTRTNDLHYKTRYFPLLDTDIQ >Cag_0111 magnesium chelatase, subunit I, putative MQLAAELEALGAEIVDASRFLVRVEEELSHRIVGQREVVRRVFIALLVNG HILLEGVPGLAKTLIVSSFAEAMALKFQRIQFTPDMLPADLVGTLIYNPK DLTFFPRKGPLFTNIVLADEINRSPAKVQSALLEAMQEHQVTIGDESYQL PAPFLVLATQNPIEHEGTYLLPEAQMDRFMMKVEVDYPSYDEELEIMLRS ATNAPRPSIQAVAQPEDIERARTLIDRIYVDPRVQRYIVDLVVATRSPAQ YGMENLNGMIECGASPRASIYMLLAAKAHAFLQQRPYITPEDVKAVVYDV LRHRIRPGYEAEAENMRSTDIIRQILQHVQVP >Cag_0824 Dhh family protein MIIPSYGRTLHAEEWQPLLEPLLAAQHLVLTTHENSDGDGLGCEVALALA LTALGKEVSIVNPTEVPPNYQFLRQLYPIVQFNPKSEEAIQELSLCDAVV LLDANLSDRMGTLWPHVRFARELGSLKLLCVDHHLEPNDFTDVMISESYA SSTGELVYGLILAMEQSVGRALFTPNIAQALYVAVMTDTGSFRFSKTTPY VYQLAGDLVARGANPEKAYDLIFNSLTPQALKLLGLSLSAISLVEGGKLS WLLISQEMLKATESKLFDTDIIVRYLLSVPSVAIAVLLVEMQDGRTKASF RSRGKLPVNKLAKEFGGGGHMNAAGALFPYTPEKVQQVLPQAVRRFIKEH EALL >Cag_1768 TPR repeat MLMLAGCSSSSSTVSTQKIQAPLPKPLPETVAYELATASLLMAQGEYQQA LERYRALLTTESNNAALHHALAKAYTANGEFVAARQHSQQSVTLEGTNVW YLRLLIALTHNESDYAQAVALSKKLVTLEPDNREALTMLAYEHLAARQPN EALEVFQRLLQLDPANAEVLLSSAEVALELGRRSDALRFFNQLLHYGIES DSIHFFIGDLQQQQGLHEAALASYRNALKLNPHLLPAWYRRLELVALSPN LSQSSKPTLFAEELQHFYKQSGTTLEQQLGLLQLFTNRATRNPAFISATQ SMIKALQQRYSSHSLVRFTVQIAQGRLFVAQGQHAQAITLLRQALRSPHA TRQPNVALDAESTLALAYERSGKVTESIRLYEKMLRRTPNNALLANNLAY LLATQHRELPRALELAKKAVAAEPNNPIYLDTLGWVHFAMQQYEPARELL EKALQGEPNEPEVIEHLIAVYEKLGNQSKVQELQERLRRVCL >Cag_0732 TPR repeat MNPPSSSSQVIPLQVRQLFDHARQLRKQGMLNEAIEAFREVIELQPDYVA AYNNLANALQAQGDSDGAEAVYQQALHYAPMLPVLHCNYGSLLLARQEYD AAIKSYQKALTLQADFFLAYTNLAKAYSVRGNFFAALQTYKAALRLKPQD AELYLDCGQLYQQYGFIPQAVKYYRRSLQLAASARGYNALGAALQDWGNL KLARASYHRALKLQPDFDLPQYNLAQLYENLGELETARRYYEQTLTVDAE NAKLLLHLEMIKRRQADWSNYTERVEQLRHALERHVENDKGEAVPMLSVL SSSLSPALYRALAEQMARQLTRNAQALNATFTFPNNVAPERLKIGYLSPD FRGHAVGTLIADLFQYHERPDFEVFAYSLLPHHDEWTERVKAGCDHFIDV SHKSPLAIAQQIHADGIHILVDLAGYTSYARPLVLALKPAPIQLQYLGYP GTLGAEYVPTIIADKHLIPENHQSYYTEQLCLLPHAWVAAPMQIASLSLT RAEFGLPEKGMVYCCFNGVYKLEPHVFSLWMEILSKVPNSVLWLIDGEES GSNERLRAVAQEAGIAPERLVFAKKRSHEEYLALYRLADLFLDTLSYNAG ATAVGAFSAGLPLLTCQGEHYATRMGSALCYAVGLPELVAPTPADYVEFA VQLGSSPKKRAALKRKLAKKLPTAPLFQPQQFVVALEQQYRSLWNNYCEL TNNMLG >Cag_0192 conserved hypothetical protein MSLRTLQQQIITLYKSRYKATPQSITQLQGDASTRRYFRVEYNSLGTIAC YDPAFVGADPERYPFLVLQKLLKQHDILVPRTCVYNAALGLLLLEDCGDL LFQNYVLEVLHTKKYDVLQQIYQNVVELMVAVQSIKGNEHELPFNLSFDR EKLLFEFDFFLQHALRGYFASEIDAALIPLLRQEFEAITDLLVQPEHFVL NHRDYHSRNIMVTYDGYFLIDFQDARMGLPQYDAVSMLRDSYVVLPDELV AAMKEFHYKQLLEHHLTTMTYYEYCYYFDLMGFQRAIKALGTFCYQAVVK QNRSYEQYIAPTLGYIVNYIAERPQELGKAGGLLQPLLEKALHQ >Cag_0510 conserved hypothetical protein MSNAPRRFHFIVNPAANKGRATRHIAKLQQRLLGRNEAKVHVTQTAGDAA VCAQSAAQAGDTIVACGGDGTLHEVVNAVAAMNATVGVLPLGSANDFIKS LYTNPAEASNIDALWSAQAKAVDLGRVTYGSTQRYFVNSMGIGFTGNIAR HVRENRWLKGDLTYLYALFRVLITLQPRTFTLTLTTPQGVRHEQEPLMVF SIMNGKIEGGKFTIAPEAAIDDGLLDVCLLKAVPKWKIFRYLMRYIRGTH INDAQVLYYKASRIELTLNEPTTMHMDGEVYENVCGNVTIEVVPKALRML IIN >Cag_1230 Molybdenum-pterin binding protein MEQMEEQYGGQIEKQKGDAIGLEGDVWFQKAESCFLGGDRIALLEKIAAL GSITSAAKAVGISYKTAWQLVDMMNNLASRPLVERTTGGKGGGGTIVTNE GRKVIEQFRVVQEEHRHFLQQLEVRLGESQNVCQLLKRIAMRISARNIFA GTVEHLTKGAVNAEVVLRLTGGQRIVSIITNTSADNLGLHEGMSAYAIIK ASSILIGQEISPASLSASNILQGTITRVVEGAVNSEVDVAIGGGNSISAI VTQSSLQHLALCEGAQVSAIFKASSVIVGTQ >Cag_0840 TPR repeat MSIELKAKYLGKTPKDKAKILIAQSSWEKTQLARDENQCYKLSLTEENEK FIDSILELGNGKEVDLIVFPEFSIPEKYLEKIREWTFHNQIIVIAGSANL QREEKYYNTSSIFFEGIPYKTEKHDLSPLETSNLLGGYGPSSGTNQFYFT ETPIGELGVMICADEFDRQTRNEFLKHNIDILCVIAFQQKGKDHHQSINE IVKESNNGIYVAYANALCNSWTDGHSAFFSNEYREGRVEYVETGLTKDDG LEMKLVEMPSNAGCLIVECNLKSKKPVIRNLDPNRALVNAELPYVFENGG LRQFTKEELKKPDEKSNKVKAEYQPSIPPIKAFVADYIGRQTDVEYLSEF LNNPQKHFCLLYGVGGMGKSHLLYCCMKDYKQKTFFYHVVSPNEEFTLNK LFEVCLLPKPDAKLSLEEKQNHFVKKFQENNIHLILDDYYEVQLDEVKSI LPKLTGIGKGKLLLLSRIIPSNISYIKADYLNHKILPLTEPDFKQVIQNF ILDKNLTLTDEEIHLIYEKAQGYPLGGQLIIDAKPYSKNLLELLTNLGKF EAEIDPDGKIYSGRILDNIFKKGNNKEIKLLCEFSALFGVSDIETVRQLP SYNLNLFQGLHSRKSFVDMDVQGKFSSHAMIRDFAYHRLQNKEALHLKLA KYFENNINGRTDDDWKWLNEAILHYTKSPKAEHFAFINRVERNFESRNIK EQIDKNSILKTIRNYTTLLNLYPDKPAYYNELGIAYRMNRQQRNAIETFE RALVIDPKDLPSLNELGITFRENNQKTKAIETFERALVIDAKHLPSLNEL GITFRENNQKTKAIETFERALVIDAKHLPSLNELGITFRENNQKTKAIET FERALVIDAKHLPSLNELGITFRENNQKTKAIETFERALVIDPKNLPSLN ELGITFRENNQKTKAIETFERALVIDAKHLPSLNELGITFRENNQKTKAI ETFERALVIDAKHLPSLNELGITFRENNQKTKAIETFERALVIDAKHLPS LNELGITFRENNQKTKAIETFERALVIDPKDLPSLNELGITFRENNQKTK AIETFERALVIDAKHLPSLNELGITFRENNQKTKAIETFERALVIDAKHL PSLNELGITFRENNQIEEAIKVCKRALNISKDRQLYLNLLQIYLFFKSDK QISKEIYDILLMPPRLHAFSASRKKYENIIRDMDYLLSISFDDVKQYESF LFLAIQYKAYEKVLFILEKLNDQFPDNSKIKSRLGKTLSNQVIGEHEKGG RFLKQAIGLFKKENNIQQLQGHIIYYFYNLLNQNQIELIEKEMMTYEKDL IYDANYFRFMANFSFVKNSNINDAISYFEKAIEISEVLMDKKEFAESLLR FLSEQKSLHYKTYFVKYEKYI >Cag_0666 hypothetical protein MNKKIMNLPHKVNKKIDLIEFNKIKAIFRREFKRGTFKLLDVGSGLCSFP NYIKNEFENAKIYCIDINKDLVDLATKSGYNAQEGDLTKLNYNDNTFDVV HCSHVVEHLPYPHVIEAIDELVRVCKSNGLIIIRSPLWANHRFYNDIDHI RPYPPNSILNYFANQQQQKVSKHRIVEEDRWYTKIYYEINPLRFNYKIIK YINFFLKISWLFLSFPIDRPNNYGIVLRKRSGL >Cag_0297 conserved hypothetical protein MENSSNIARQRKAVKSRACEPNENLLAADGWRVFKIMSEFVNGFETMSSC GAAVSMFGATRALPESKEYQLAEELGELLAREGFAVITGGGPGIMEAGNK GAQKAGGVSIGFNIKLPEQQHPNHYIDQEKLLHFDYFFVRKMMFLKYAQA FIALPGGFGTLDEVSEAIALIQTGKSERFPIILVGKSFWQGFYDWIRQTL LEEKGYINTFDLDFIYLEDDPKEVVAIITRFYPEGYTLNF >Cag_0277 Sel1-like repeat MKRKFFTSILCLLLFSGTVHAESPQEIQQLRIAAEQGNAAAQFNLGIKYQ FGKGVRQDYVEVIKWFRLAAEQNHVYAQLMLGTMYRNGEGVRQDYIEAIK WFRLAAEQRYADSQYSLGLMYAGGKGVSKDYVEAIKWFRLAAEQGNVEAQ AMLGSIFYVGKNVQRDEFEAIKWFKLAAQQNYAYAQMMLGTMYATGEGVR QDYVEAIKWYRFAAEQGNVEAQYDLGLLYLNGYGVRQNKAIAKEWFGKAC DSGSQEGCNQYRALNIPQKHR >Cag_0302 conserved hypothetical protein MKILERYIFQQFIKAFLFTALVFVSLFIIINMIEKLGNFMDHHVSALEIA RYYLLSIPSIFLVTSPVSALLASILVAGKLATQNELPAIRSAGVSMRQLL TPFAWGALLLFLFNFFNAGWLAPTTYSHNRTFEQLYLGKNAGDQETRNLH LLDSGNRFISIGAFNPINESLNNVSIERLSGATMISRIDADSMHYNRRTK RWTMWRVTERYFSNGYQSFTTKPTATIRLALRPKALHEMRLQPDEITLPR HYQFLREKEEAGFSGLERSAVKFHNKIAMPFASLIITLIGVPLAARKKRG GIAAEIAITLFIGFLYLGIQRTIAIAGYQGVLPPIVAAWLPNLLFLVVGW VLYKKSTDS >Cag_1487 putative plasmid maintenance system antidote protein, XRE family MATLRNIHPGEILMEEFLLPFGISLRKLSYDIVISQQQLEAIVEGRERIT ADIALRLSQYFGNSAKFWLGLQDDYDIEEGMEQNKVDILSILRFKQPVST SVE >Cag_0251 ATPase MNSILSSHKLKKSYNKHPVVTNSSIEVRQGDIVGLLGPNGAGKTTTFYMI VGLVRPDSGEVRLDDKPITHLPMYKRARLGVGYLPQEASVFRNLSVEENI AAVLEFTALSKKERQERMEKMLEELNITHIRKSMGYALSGGERRRTEIAR ALALNPKFILLDEPFAGVDPIAVEDIQHIVAGLAKRNIGVLITDHNVHET LSITNRAYLLFDGSIFMKGTPEEIADNPEVRKMYLGEKFTLERY >Cag_1785 conserved hypothetical protein MKQPLKIAHISDIHLSGANDRSHAARLTRLLQHLRNEQFDHLVITGDLGN HADPDEWRVVQQLLKQTEWYHWERCTILPGNHDLMNLEEEMRLYNALNPI QWFRQKAFQRKRQLFCELFYEIMGGKNQTFPFLKILNYPTLRLALVALDS VAAWHPSTNPLGARGFIEPQQLTALQQPQIAEALRSCVVIGLCHHAYKVY GTDSLIDQAFDWTMELQNRDAFFSLMQQLGASIVLHGHFHRFQSYQKEGI TFINGGCFRYNPYRYSELLLEADGSFQQQFLSLEEK >Cag_2009 conserved hypothetical protein MKQRKEPKPFGLLISGVYRSLGLEEPYQQFKALQVWREVVGEAIAEVTTL ERFTAGQLYIKVNNAAWRLELNFRKRDIIQRLNKELGSPLVQEIIFR >Cag_0266 Protein of unknown function DUF132 MSKYQIVVDTNVFVTALRSQYGASYKLFSLIDKDIYQLNISVPLVLEYEA VAKRMIDKILLNEEEVDNILNFVIQNSNRWEIYYLWRPQLKDPCDDMVLE LAITAACNYIVTYNINDFKGIEGFGIEAITPKAFLKLIGEL >Cag_1623 conserved hypothetical protein MKVIGINGSPRKDGNTAKLINMVFEPLQAEGIECELIQVGGTLIRGCLSC YQCVKLKDKRCSTKNDSFNEIFEKIIAADALILGSPVYFADITPELKALI DRTGFVARVNGHMLRHKVGAGVVSLRRGGAIHAFDSINHLFQISSMFTVG STYWNVAFGGRTGNEVEGDVEGVENMHDLGSSMAFLLKKLHCCE >Cag_0679 Protein of unknown function DUF132 MSYRIVFDTNCIISALLFSRQKMARLRYSWQSDAVIPLVCKETVSELLRV LTYPKFKLTRDERLLLLADFLPYAETITVLDVPSNLPVIRDSADQIFLTL AVVGNADALVTGDNDLLTIKDSFKMLPIMSLNEFNQWLK >Cag_0904 conserved hypothetical protein MAETDTPYLRFADISLDISRGNYSAARQKLEVLEPLMPESYHLNLLYARA LAGMERYAQACKYLQACCTLAPANEVAWYELATMQALAENDTSNAMESSS AYDPVVDELEQLSAALMKAGPILASDSSEPTSIAEQKQPFADDTEIAVPT ESLATLFIAQGAYKKAIRMYSHLIQLKPNNARFYQDEIDRLLDRL >Cag_1498 ATPase MSVAIELKHITKKFGSFTANHNISLEIAEGSIHAFVGENGAGKSTLTKLL YGMHQPTSGDIVLHGTSTHFSSPRDAIRAGIGMVHQHFMLIEELTVTENI MLGYEQASLFGSLPLKKAKERIAADALASGLTINPDARISSLSVGEQQRV EILKLLFRNASIMLFDEPTAVLTPAETEQLFTTLRALRAQSKTIVLITHK LDEVLSVADTVSVMRQGEIVATKSVAGVTREELARLMVGRDVLLRVENSP HATNAPVLEINNLTYRALNGNEKLRQLTLTLHAGEVYGIAGVEGNGQSEL LQALWGLVPEGVKVSGNITMQGSSLLGKSAAAIAALGVSHAPENRLHHAI IGDYSVSDNLIFGRHREATFHHGMGFNRATVERFSNAMIADFDIRCTNAL RQPISALSGGNQQKVVIARELTRPNLKLLILAQPTRGVDIGAIETIHKKI LAARTSGIAILLISSELEEIIALSTRIGCIYKGTIRHQFSAEEVERNRQH GRAFSERIGTYIT >Cag_0629 Hit family protein MQSFKEESYLCQSGKSIFADLAPEEDEQHFVLHRAKKCFIIMNLYPYNCG HLMVIPYLQTPEFSDLDRETWLEVMELTDLSIRALKKVMRPHGFNTGANL GRIAGGSVDNHIHFHIVPRWDGDTNFMPVLADVKVLSNDMISTYKNLKAA ITELLAQEAQ >Cag_0665 probable polysaccharide biosynthesis protein MRYLKNTSWLFGEKVLKLFVGLFVGVWVAKYLGPKQFGLFSYAQSFVGLF STIATLGLDGIVVRELVKDEKRRDELIGTAFWLKVIGAIAVLLVLAIAIN FTSNDSYTNILVFIIASATVFQSFNVVDFFFQAKVMGKYITYTNTITLFT SSIVKITLILSNAPLIAFAWTVLFDSIVLALGFIYFYLQQTKYSKPHFTF RKEIAVSLLKDSWPLILSGFVISIYMKIDQVMIQEMIGSEAVGQYAAAVR ISEAWYFIPMVVSSSLFPAIINAKNQSDDLYYARLQKLYNLMVWMAIAVA LPMTFLSDWIVNLLYGELYNEAGDVLMIHIWAGLFVSLGVARGSWIMAEN LQLFSTYFIGIAGAINVIGNYMFIPTYGINGAAFTTLVSYAISVIVAPYF FRATRHSVYMLMKAMLMFNLFRSLDE >Cag_1938 Sel1-like repeat MNVMKKFITSIVIASLTLLAINGFCETPSQKQISQWQQAAAQGNSEAQLN LGYAYDHGEGVKQDYAEAIKWYRLSAAQGDVKAQFNLGVMYYNGEGVKQD YAEAIKWFRLLATQGDAIAQFNLGVMYYNGEGVKQDYTDALKWFQLSAAQ GNAMAQNNLGVMYAKGEGVQQDYAEALKWHRLSAAQGNAMAQNNLGAMYY KGEGVEQDYVEALKWYRLSAAQGDAVAQWILGLMYYEGQGVRQDYGEAIK WYRLSAAQEDAKAQYNLGLMYYNGEGVKQDYAEALKWHRLSAAQGNAMAQ NNLGAMYAKGEGVQQDYAEALKWHRLSAAQGDATAQGILGLMYCEGYGVR QNYGEALKWYRLSAAQGNAGAQYNLGLMYYNGTGVRQSKAIAKEWFGKAC DNGFQDGCDAYRELNEAGAKTNRSR >Cag_0740 Type I secretion system ATPase, PrtD MKKPEVKSPLREALWAQLPALLKTFYFSIVVNVLVLAPSVYMMEVYDRVV NSRSHNTLLMLTLLVVGAYLLLEALEWVRRQIMQSAALQLDGKLREEVFS AIFAARLQNIPSAGAQALRDLKSIREFLPSQALLAMVDTPLALLVLILLF LMAPLFGWFSVAGAVVQFGIGFFNERRIRKPLQEANRSAMVAQGYADGVI RNAQVIESMGMLPHIHRRWMERQQEFLVNQATASDHAGTNAALSKLLQSL LSSLLLGVGCWLTLKGEVFGSAMIVASILGGRVLAPLVQIIGSWRQVEGV MEAYHRLEAMLRELPMPQKGMPLPAPTGQLSVEGIIAGAPRSPMPILKGV SFRVMPGGTLAVVGPSASGKTTLARLLVGIWPSTQGKVRLDGQDIYLWDK EELGRYVGYLPQNVELFEGTIAENIARFGEPELEKVEAACRLVGLDALMV NWPKGYDTQIGEDGAFLSGGERQRVALARAVYDMPKLVVLDEPNASLDEA GDAALINTVKKLRENGTTVIVMTHRLNILAAIEYMLVLVDGQVQKFGTVK EVMEALQNPQQAGGAQQQPQPKPAPQPKPMPSTPRLA >Cag_0383 oxidoreductase, short-chain dehydrogenase/reductase family MTKRSYLHSISYSGYSYAIPSGLKTREKTIGNMQQRHNSLGIVITGGSKG LGFALAARFLAEGDRVVLCARNGERLEAALAALRQQVPTGEVYGIACDVA DTAAPPLLAQFAVAKLGNIDRWINNAGTAGLQKRPLWQLAGSDIAETCTT NLAGTMAMCAEAVRVMQRQPSAPQACYHIFNMGFSAVGASFSRSAVPHKA SKRGVAEITHFLARELHEAAIRSIGVHELSPGLVLTDLLLRDAPADTRRF LQVVAQTPEAVAAVLAPKIRKVRGLNRTVRYEPLVAMVFRMVAGLPRLLR SASATS >Cag_0196 conserved hypothetical protein MQQPTPQQSLTTMPVEGRLFALALCSLLIVLTTTVPYLTLINVLFFSGIF WSGFIALHQTILRYQVPLSLRNAFVLGSLAGFVGGLASELLGIILMVLFD YRPGIESLSLIVEWATQQAMQNPELQEQVNMLQEAEKLAKTPITLGITDV LFNLAVTGMVYAPIAGLGGMFAVRWLKFQAARK >Cag_0039 Drug resistance transporter EmrB/QacA subfamily MANAPSLSASPKLLGTTEEHYETGWRKLIITLTVIVSAMLELIDTTIVNV AITQISGNLGASIEDTAWVVTSYAIANVIVIPLSGFLGNLLGRRNYYIGS ILLFTVASLLCGVATDIWTLVFFRFVQGIGGGALLPTSQAILYETFRPEE RGKATGIFSMGLVLGPTIGPLLGGYLVDYFNWEWCFFVNIPIGLLAAWSS FIFLKEPKVTHTVSKIDWAGIGLLAVGIGSLQFILERGESKDWFETPYIT WFTIIAVLSLIAFVWHELHTKEPAVDLRVLARSHNLPIAAVLTFIVGFGL YGSLFVFPVFVQGLLGFTAVLTGLVLFPSAMVTGMISMPLGMALQKGASP KHLMLFGMLTFSLFCWLLGQQTLQSGAENFFWILLLRGIALGFIFIPVTM LAISGLHGKDIGQATGLNNMVRQLGGSFGIAIANTYIAKRVAAHRTELLS HLSPYDPEAMNRIHAIAAKATAEHGLPPASAELAALKALEGTVTVQSTHL AFMDAFMLIALLFLCAVPLLFFIRLHKGEQASAMGGH >Cag_1179 hypothetical protein MQIPLNQFENYIDETILKRGLSYFKNGYVHEPEEIKQGEFEALVEGTEDY TVRLTIEKGIITDYSCTCPYDYGPICKHITALIFYLQQEELGLEVQPPKS KTTTTKAKKTTKRKTIAEQVDELLDKVNHDELKQFVKKTVLADKKLRQNL ILHFFHLTTNEDSKDFYAQQIKTILQIAKDSDGYISYSAVHNVCNITDQY LAAAQNEIANNKYNKAISICTAVIEEMTKGLQFTDDSDGYIGDAIFEALE ILQNIAKSNPPETVRIQLLEYAISAYKKSCFDGWNDHFDMIHFATLLVKK DVEINNLIDILKNSIHTAHYKEKVQEIIYELLVKFKKLEEAETYLEQNIS NSSFRLKVLETAYQNHNYSKAKLLANDAIKQENGISNSTWYEWLLKIAQA ENNIKSIIEVARYLLLQGYDSECDTYYALLQQHIAPEEWNGFVEKMIAEI KKKPYHLHYWLLPKFYIKQEQWERLLNFVQSTERIDILMTYDKYLVNNYF TEIMDMYKKYILHSLNRAAQRNEYQKACEYIQRVVTLGGVRTAEIIISYL KNNYPRRPALMEELSHIKLR >Cag_1769 Protein of unknown function DUF132 MQKDNAIRVVIDTNIWIGFLIGKTFSDLYKAIINDQIKILFSDELFAELI EVLQRPKFHKYFSQNDIAELISLIHLKTEFVEITDQFNDCRDPKDNFLLD VCVSGHADYLLTGDDDLLILNPFHEVKIINYRKFTDILKRV >Cag_1313 ExsB MDSLVTTAIAQQAGFELAAMHVNYGQRTMQRELNSFRAICSHYSIQQRLE INADFLGKIGGSSLTDLSMPVSVANLESHAIPASYVPFRNAGFLSMAVSW AEVIGAERIFIGAVEEDSSGYPDCRKIFYEAFNRVIELGTKPETHIEVVT PLIALKKWEIVRKGIELHAPFAFSWSCYKNEGQACGVCDSCALRLRAFEQ AGMEDPIDYETRPHYIDC >Cag_0909 conserved hypothetical protein MTITFSQKARLQIEENVRFIAADKPNAARKWAAGVKQAVYKLKEFPYLGR QVPEYANDTLRELIYGEYRIVYQVNIELSRIEVLSLFHSKQLL >Cag_1040 hypothetical protein MIDIFFTNPMRLIRSCTSAVEDDSHHSKQHVNRRKTMKKTIWLAAGVAGM LLGNPTVNAQAETRDIIQPRSDHSFVIDARPSFIYLPDQGFAVSVDSPYD IISGDDHYYMNQKGSWYRSSSYRGPWKLRKEKNLPSKIKKHRLEDIRMYR DAEYNKIINQRNSQQQRMDNNRPR >Cag_0100 tRNA modification GTPase TrmE MRNSLSFQDEPIVALATPLGVGALAVVRMSGQGVFDIARKVFHKQGAPDF HLASSKGFQAHFGTIHDAQGVVDEVIALVFRSPRSFTMEDMVEFSCHGGP VVVQHLLKALIDAGCRLAEPGEFTRRAFLNGRIDLLQAEAIGEMIHARSE SAFRTAVTQMQGRLSRQLEEMREKLLHSCALLELELDFSEEDVEFQNREE LREDVQRLQGEINRLLDSYQHGRLLKEGVATVLVGSPNAGKSTLLNALLG EERSIVSHQPGTTRDYIEEPLLLGSTLFRLIDTAGLREGEEEVEHEGIRR SYRKIAEADVVLYLLDVSHPDYCNELSDITSLLEQASPNVQLLLVANKCD AITNPTERLAQLQAAMPQATVCGIAAKEGDGLEALKQQMSNMVAGLDKLH EASVLITSMRHYEALRRASDALENGACLVAEHAETELVAFELRSALEAVG EITGKVVNDEILSLIFERFCIGK >Cag_1720 thioesterase, menaquinone synthesis protein MIMQPPLHIEIIGNKALPKIVFLHGFLGSGRDWLPLAEMLTSHYCCVLVD LPGHGSATLSASDEHHAYFTATVEALATVIQPISPEPCRLVGYSMGGRIA LALMLTHPELFHQAVIVSASPGLPTEEERAKRRAGDEGIARKIERNFPDF LEAWYQQPLFSTLKNHPLFQEIERKRAINNSESLAAALRLLGTGQQPSFW DALSKCAVPTLFIAGEKDERYVAIARQMVKLAPHATLSIVPNCGHTLHIE NKESFVEQLHTFFNQ >Cag_0886 conserved hypothetical protein MRKKRAGRPQHCRCVQDLPKVTCFTPSGVAPEAVEQVLMTVDELEAMRLA DRDGLYHADAAMQMKVSRPTFGRILESGRRKVADALVGGKQICIKGGTVL AVCDSIPTERPDICLCPTCGREFPHIKGVPCRNSICPDCNEPLQRKGGCL SDEESDEQENEQRTVGYPESEEELEIE >Cag_0045 Aminodeoxychorismate lyase MKSPRSFITRLILAVTLLIAAFPLGFLLIPGLNSKSKPTQLVVHREMRFS DVLDKLQASGAIRERWQPELIARMVPKFRTIKAGRYTIPPNTSNFGLLWY LRTHPLDEVRVTLPEGIDRRKMARILSRKLDFDSTQFMAATENPRLLAKY GIRASHAEGYLLPGTYDFAWGSSPDEAASFLIRQFKKLYTTERQQRAAAL GFNEHSLLTLASIVEAETPLDKEKPTVASVYLHRLRIGMRLQADPTVQYA LGGTTRRLYYKDLAIASPYNTYRNKGLPPGPICNPGKASIIAVLNAPQSG YLYFVATGTGGHYFGASLQEHHANVQKYKQARSSNE >Cag_1169 TPR repeat MKPFFTFLRYAFIGILTLSLVTPEVVDAAKKSKKKSSSRKKSSKRNARAK KGSNKKTSARQARLRVVDGVETERNSINLTASPSSASRQLNKRAMGFYEQ GRYAEAEPLYRELLTLDEKQLGSRHPEVAVTLNNLASLLQQQGRYNEAEP LYRRALSIREENFGADDASVAQSLNNLGSLLQDQGRYYEARQLYSRSLAI DEKVLGTDHPDVAADLNNLASLLQAQGRYAEAEPLYRRSLAIREQRFGAE HTLVAMSLNNLGVLLQAQGRYSEAEPLYRRSLAIREAQYPANNHSIVATS LNNLASLLQARGKLTEAEPIYQRALSINEQTLGENHPSVATSLNNLAGLL RAQGRYADAEPLYRRSLTIREEQLGENHPDVAMSLNNLGVLLQAQGRASE AEPLYRRALLIDEKVLGATHPQTIRLRNNLNALLNPSAIPLTTQ >Cag_0596 acetyltransferase, GNAT family MKHMNPKIVIRNEANADIRAISEVTAAAFQTLEISNHTEQFIIEALRATK ALTVSLVAEIDGLVIGHIAFSPVTISDGTRNWYGLGPVSVLPAYQQQGIG KALIWEGLLRLKDMNAQGCCLVGHPDYYIKFGFKNLPGLVHEGVPLEVFF ALPFDGHIPQGTVTFHEGFKADG >Cag_0537 Glutamate synthase (NADPH), homotetrameric MTMDAKTITTKQRLAIPRQAMPAQDPAVRTHNFLEVNLGYTPELAQQEAL RCIQCKDPVCIKGCPVNIKIDQFIKLIAEGDFLGAARKIKEDNVLPAICG RVCPQEDQCEKVCVLTKKYTPIAIGNLERFAADYEREHGDIELPSVKAPT GKRVAVIGSGPAGLSCANDLIQLGHDVTVFEALHELGGVLMYGIPEFRLP KEIVRTEIDGLKKLGVKFVTNTVVGRSVTVDELMEEENFDAIFIGVGAGL PWFMGIPGENLLGVYSANEFLTRVNLMKSYNFPDNDTPVFNCEGKNVAVF GGGNTAMDAVRTAKRLGAKNAYIVYRRSEAEMPARIEEVHHARSEGIEFL MLMNPLEFIGNDEQWLTGAKCLRMELGEPDDSGRRRPVPIKDSEFILPID MAVISIGNGSNPLIKQTTPDISVSKRDTIVVDLNTMATSKENVYAGGDIV TGGATVILAMGAGRTAAAAIHEKLMGGSAS >Cag_0481 Phosphoesterase PHP-like MPYSTSSVGHNGFQKADLHIHTKCSDGLFTPEEIVEKAARIGLNAISITD HDSVLGIDKAKPLALEKGVELIAGVEMSSTYKGYDIHILGYFFDYQHSEL KDYLDHCRQLRTDRAERMVSKLAKMGVKIGIEQIIVKAQNGSVGRPHIAA VLQDGGYVKSFSEAFSKYLGAHSPAYVKSVETHPADIIRLINKASGLSFL AHPAQNVPDEVLKQLITLGLDGIEIIHPSHDTYRQNYYREIANEYFLLFS GGSDYHGIRERDEDLFGKVTIPYDWVKKMKSRLMLA >Cag_1886 von Willebrand factor, type A MSEWLASLPTLTFAAPWWLVLLPLVAIVLWLKERWQRQVAAISFPDVQRF ERAKLVAPRWMVRMPQWFRWAALAVGMLLLAEPHLTLRSTTAAARGIDMV LAIDISESMMQSQTDTQSRFEIARQAARNVVEQRSNDRIGLVVFRGEAYT LSPLTRDHTVLSLLLDNLSSRIIQDDGTAIGSALLVALNRLQASESELQM VILLTDGENNAGEVSPLTAAALAARRGVRFYVLNVAFESVKDENAPRSAL YAAELQEVARRTGGSYFTVNNKTELETTIASIAARAKNGQGNMVVVQHNA VTQPLLLLLLSLLGLELLVSATRLLKIPS >Cag_0081 Biotin biosynthesis protein BioC MAQCVDKLLVGERFRKALATYREHAVVQHAMAEDLAAMLARHLPSSPTIK RLFEIGAGSGALMEALLHRFCIDHYFANDLVAESEGCLRPLLAPYREEAF TFLMGDIEVLAEWPSSLEVVISNATVQWLEQPAHFFQQAAKALQPHGLLL LSSFGASNMQELSSLLGVGLRYHAPDELIALASHSFDLLEVKEEQKELLF CSPEAVLHHLRCTGVNGVVRTQWTKSDYKRFLSSYRERFSTTDGKVVLTY HPYAIALQRKG >Cag_1932 CBS MEIFLLFVLILVNGAFAMSEIALVTAKRSRLSRLADDGDKSATTAMKLGE DSTSFLSTIQIGITSIGILNGIVGEGALAVPFSLFIHSATGIELETAQLI ATVVVVLGITYVTIVVGELVPKRLGQLNPEQIACLVARPMQILATITRPF GRLLSFSTNTLLRLMGVKPQITPSVTEEEIHAMLEEGSEAGVIEQQERDM VRNVFRLDDRQLGSLMVPRADIVFLDVTQPLEENICRVTESEHSRFPVCN GNLQSLLGVVNAKQLLLKTLRGGLTEFATLLQPCVYVPETLTGMELLDHF RTSGTQMVFVVDEYGEIQGLVTLQDLLEAVTGEFVPRNLEDSWAVERADG SWLLDGLIPVPELKDTLKLKEVPDEDKGLYHTLSGMIMWLLGRMPHTGDV LVWEEWNLEIVDLDGQRIDKVLASPLNNAPKASQKEEKPPVKSDDNAALC SIRPTP >Cag_0284 oxidoreductase, Gfo/Idh/MocA family MSSNIKIGFIGSGWSRIAQAPAFSLMDNVSMAAVASPTADHRQKFMKMFD VEHGFADWRDLLQCDLDFVCVTTPPFLHKEMVEGVLRSGKGVLCEKPFAL SVADATEMADVATRTPLLALVDHQLRFHPAVRHMKQMIDGGEIGKVFEVR AVVNLASRNKTDLPWSWWSDATRGGGALGAIGSHLIDLNRYLVGEIAEVS CNLGTSIAHRPDGNGNLLPVTSDDHFAMMMKFGRSSVALGSSSLMHVTTV GAYTWFSFEVVGSLKTIRLDGGGRLWEVYNDNAQRGRSLIDMPRWKQIEP ILPWDELVLQEKIKQSSLAVHGIFAVGFAFLAHRIVKALQCGEKAILDAA TFADGLMIQRVLHAGTESHQQRNWVKI >Cag_0486 TPR repeat MLDNNSTHIQPAGGFAAISKYNPHLWSAEQLRAIFVARTNELADLVQTLR MVQPDTVAQHVLLVGARGMGKSTLMRRLALAVEDDPSLSANWLPLRFPEE QYTVATLGQFWANVLDSFADTLQHLGESVIALDAAAERIAALPVTQQPEA YIDAINHFADERKQRLLLLVDNTDMLLHNIGKDAHWGLRATLQSNPRLFW IGGSYQSLEAESNYHDAFLDFFRVINLRPLKVEEMRQALLALAETFGGAT ARNAMVHQLDLQPERLPTLRQLSGGNPRTTVMLYEILANGQNGNVRSDLE ALLDNMTPLYKARMDSLADLQRKLLAHILEHWAPISFGELAAVSQVAKGT ISPQLQRLEIEGLIEKTSLHGTTRSGYQAAERFFNIWYLMRFSPRRQRNR LVWLVEFMRLWFSGDELCSLAKQRMSVGSNDLRSTHDLEYDRALADALPQ FAPERHALRWSLLKLLQENNSQLVELFDFDGEDKEFKGATDYLRRLAALP SLLRQCPHAVTEQEKTHWVETVLGSISLTLEEKEIVAQKAEYLTLFQYDE LLKVFSEEQKRWEKQFGVAALQTVRSAVLSQDFFSDMPDSHLAYEQVRVC FANNKEALRFVSLLFYSKHKDEWSYKAQKLALNLLHDDSKSDWFLREKLV RYEETEKAYCKAIEFNKEDAVTWNHVGNLLKDYLGRYEEAEAAYRQAIAI DKKFAYPWNNLGQLLHYNLNRYEESEAAYRQAIALDEKYAYPWFNLGQLL HYKLERYEESEAAYRQAIAIDENNAYPWNNLGQLLHEWLGRYEEAETAYR QAIALDEKYVYPVTNLARLLAQRNRKAEAETYYREAVLKDTQDTQQLFLQ AHLFLGNRQLAMDALQALAEKAQNGNQYAFYRLKEQVWECYELGLGERLA DWMAESNVAEFLTPFIQALYTLAGVNEKLRDLPMESQHMVDEIVRKARLR QEKREACNMRAKSIH >Cag_0418 hydrolase, haloacid dehalogenase-like family MLKKLVLFDIDGTLLSMTSANRRILADALLAVYGTAGSSYTHNFAGKMDS AIIYEVLAADGLTRKEVASRFDLAKEMYITFMQKVARVDDVTLMPGIVEL LDALAERDDVLLGLLTGNFEGSGRHKLHLASINHYFPFGAFADDAEHRNQ LPAIAVTKAHQRTGITFSEHNIIIIGDTEHDIACARAVQAKSIAVATGTY APHELEAHEPHLLYHNFSDTQAVLNDILSH >Cag_1196 adenine specific DNA methyltransferase MSMTIQEYVAALNRRYKTGIATEHSYRADLQALLSDLLPKVEVTNEPRRS ACGAPDFILTRKNIPIGYIEAKDIGKSLDNKLYREQFDRYRHALQNLVIT DYLEFRFYREGDLVTSLVLAEMHKGKVVIRQENVAAFADLMQDFGGYEGQ TIGSASKLSKMMAAKARLLADVLEKALDGYSDENNGAIDEASNTLYDQLK GFRDVLIHDITPKQFGDIYAQTIAYGMFAARLHDPTLENFNRKEAAELIP KTNPFLRKLFQYIAGYDLDDRLVWIVDALADIFKATDVNSLLKDFRNATQ QNDPIIHFYETFLAEYDPTLRKSRGVWYTPEPVVNFIVRAVDDILKTEFD LRDGLTDTSKITVEIDKATTDKNFKSKHIKQKQEVHKVQILDPAVGTGTF LAEIIKHIHKQFEGQEGMWNNYVSHHLIPRLNGFEILMASYAMAHLKLDL LLAETGYTSTTDQRFRVFLTNSLEEHHPETGTLFASWLSQEANEANYIKR DTPVMVVLGNPPYSGHSANKSKWIEELLRDYKQEPNGGKLQEKNPKWLND DYVKFIRYGQYFVEKNGEGILGFINNHSFLDNPTFRGMRWHLLSTFDAIY LIDLHGNAKKKEACPDGSSDKNVFDIQQGVSINLFVKTGKKKKGALAEVF HYDVYGDRPFKYDFLSKKSLSTVDFTKLTVAAPNYLFVPKDFSVLASYNQ GFAINELFSLNSVGIVTARDRFVIDSSKLALTQRIKNFFSLDKDELLRMY GLKENSAWRIHTIKQSTIRYSADFIQMLAYRPFDNRYVYYDPLFIERSRS DVMKHFLQENIVLTVCRQVKAGESYHHVFLANKIFESCLISNKTSEIGYG YPLYRYLNINNQIPTNGEPQRIPNLNAGIVNQIAIVLGLTFTPEKEETDG TFAPIDILDYIYAVLHSPAYREKYKEFLKIDFPCVPYPQEPQQFWQLVAL GSELRQLHLLESPTVEQRLTSYPIAGNDVIEKITYLDGKVYINEKQYFEG VPLVAWEFYIGGYQPAQKWLKDRKGMALCYDDVRHYQRIIIALTETARIM EEIDTVGVE >Cag_0575 conserved hypothetical protein MATRLIWSPEALEDIELIASYIERDSLWYAKVVASKIFAVAETIAKNPKA GRVVPEIANPCIRERLVYNYRIIYRIDVELILVVAVIHGARLLPSLTHRF EFEQ >Cag_0160 TPR repeat MNILLNKLVIIFEFVVGVTTLLLGADFLGVEWKQYPFLQRLDGQQPLLLL LTFCSLLSILVIKLKNNSPAFEAVQTKDGTTIVKATLSGFEQELDDAADE IKKKAQEYFEAGERDFANEQYAQAASNYKASFELLPTMAASLNAGIAFSI TSDYQAAEELYQQGLRIAQHKRDKEYEAAFLGNIGVGYRSSEKFADALDY QQKAFKLNQTIRNQRGQANQLCNIGLIYNDKGLLDAALGYFKQSLELYET IGDTSGQAQNLRSIGIIYRRKGKLNEALSCHQQALNIDKANKNASGEAEN LNNIGIIYKGKKEFDKALNCCYQALDICKRIGYKFGEGLALSAIGVIYYS KGELDNALDYCNQALKLYKSIDSHLIAQAENLNNIGLIYQDKGQLDQALI YLGQSYLLFQKTGVTLQLKKTEANIQEVKRLQAAKGKN >Cag_1421 conserved hypothetical protein MKAIILYDSKSQGGSTDRLVDAIGVKLAEAGHYVEKARCKSNGDYSFVKE FDMVIMGSPIYYLMVSTELLGSMFQSNLKSCVEGKQIGLFLLCGSPEIMG NLLYLPQLKLHLLGQSLVAEKIFAPDQASNPEAISAYANKLLAALKNH >Cag_1348 TPR repeat MEQTAKPSAEEQLLYIVIKYKKALIGALVVIVTLGAGAFFGTRYQEQREQ EAALQLSRVSPALEQGNFTLAINGTKQTAGLQKIANEYSGRFIGTPSGNM AKLLLANAWYSFGKYETALQHFNEVTIAHEDLAAAALAGSGDCYLNLNKL KEAAEAYQKAANKTDNRILKAQYLTHEATAYHYAKDFPKATELYKTVIAS YPGTTAASIAQHGLWQLSGSL >Cag_1662 3-oxoacyl-(acyl-carrier-protein) reductase MLTGKIAVVTGAARGIGQAIATNLAARGADIVLCDIKAEWLTETADKVEA IGSKAFCVELDVSNAASVQEVFNKIAEETGRIDILVNNAGITRDGLLMRM SEEDWDAVLTVNLKGTFACTKAVSRIMMKQRSGSIINIASIIGLMGNAGQ ANYAASKGGVISFTKSIAKELSSRNIRVNAVAPGFIASKMTDALSEEVKG KMLEAIPLARFGEPEDVANVVSFLAGDESSYITGEVINISGGMVM >Cag_1706 Protein of unknown function UPF0011 MASPSLQPATLYVVATPLGNLEDITLRAIKILQQVEIIACEDTRRASILL KHLAISGKRLISYHTQNEPRAIAQIVALLEEGNNVALITDAGTPAVSDPG FALLRAVHERGIVALPIPGASAVTAALSVCPLPLNTFLFGGFLPHKKGRK TKLAQLSAIGQPFVLYESPYRIHKLLDELEAILPNAQLFIGREMTKLHEE YLTGSIEEMRQHLTSSKTKGEFVVIVHPTAEKTINPESDTDADY >Cag_0469 polysaccharide biosynthesis protein MAALSKLKLLFKDTVIYGASTILARSLNYVLVPVYANTLSTFENGIQTII YANIALANVLFSYGLETSYLKVAADTHREGSEGEKPLFSTAVLTLLATST LFALLIVLLAPWIGALVGLDSGAAPFVRYAALILWLDTMLVIPFAELRLR RKALHFATARLLGVVAVVLCALLFIVVMKVGLSGVFLAEAAGSVVSLLVV LPLFRGFRFGFSGQQLREMLRIGLPYVPTGIAGLLIHLIDRNILIRIAPS DIERLYGAGYQPSDIVGIYGRIAAFGVVLQLVIQVFRFAWQPFFLQHGKE PDAQQLFHHVLSISTLLTMVLALSATFFVPDLVRYHYGGAFYLLPPPYWI GLSILPAIFLSYVFDMVSTNLSAGLLLTGNTRYLPVVTFAGAVVTALSCW WLVPLYGMDGAAYAIVAGTVVMSVVMGYYSLRFFPVSHDWAKLLLLLLTG IGMVLVQRQSEALLSAPAQVIGVKVGLVLLYLALALFLFRNKATRLLKQV RHRNSSPHVVS >Cag_1702 TPR repeat MLKLGKGVSTQPSIGRIGGEWYIGGMKQSFKHQALRRIKKGALLLCIFAA TTLTACSNNELEKLQQEAWKNPNDAALTLQLGYKYAQEGRYMEANESFQK VLALDPKRDEALQALGATAFRQKQYSQAISYFQQHLERAPADSARLYNLG NAYMQLKQYDKATELYNKAIDNSTAFIDAHYNLAVCYAKTGRRNEAQAIY EWLLTKNNYLATSLQKHLDKENQAPK >Cag_1363 serine/threonine protein kinase MRKRLFIKKQKFDKWELKRFLGGGGNGEVWECCDEEGNKGAIKLLKHVKS KSYARFCDETKIMEQNFDIEGIIPILDKFLPEKLDGSIPYYVMPMAESAE KVFKAKNIVSKIDSIIEICKTLAKLHERGIAHRDIKPPNLLVFNSRLALA DFGLVDYPDKKDISLQNEEIGAKWTMAPEMRRESSKADSLKSDVYSLAKT IWIILTENPKGFDGQYSIDSIIELKRFYNKTYTTPIDNLLTKCTDNDPNQ RPTVNEIILELENWKVLNKDFHERNQEQWFEIQTKLFPMTFPKRVIWENI EDIVKILKVVCTYDNLNHMLYPNGGGMDLEDVRLSHEKSCIELDCQLINI VKPKRLLFESFGYTAEWNYFRLELYELEPSGAYENDEYYENIQYYEEYDG YVSPENTMQLLRWFRGSFVIFNKRSVYNRISSTYDGRHNKMNTEEFRDYI QEMVSHTIEMNKKKSAMATIESKRRKTR >Cag_1777 PIN (PilT N terminus) domain MNKRFLLDTNVLSELMKNNPEKKVIQWFDAHQHERFFTSSITKAEILLGI ALLPKGRRQKQLYEATFKLFSTTFLEYCFPFCERAAVVYSDIVAHQKRIG RGICTEDAQIAAIALTENLTLATRNVKDFHFIQGLEISNPWHG >Cag_0043 conserved hypothetical protein MPPAIKIIILANVAVFMLQRLPWGGELLSAFASLWPIGTGNFYIWQPVSY MFMHGGLTHLFFNMFALWMFGAEIENYWGTRQFTIYYFICGIGAALINLI ATMHSPYPTIGASGAVYGVLLAFGMMFPNRYIFLYFFFPIKAKYFIAGYA LLEFVSGLGSREMGSGSNIAHFAHLGGMLIGFIYIILKRNDWALDDVVQK MRSLRSSGGKKSSPYQANRNSSNSNKLTPTDDEINAILDKISAHGYASLT DEERRKLLRAGGNG >Cag_1325 nucleic acid-binding protein,contains PIN domain MITYLLDTNIVIYTIKRRPIEVLETFNQHATRMAISAITLSELFYGAEKS SNVSANLSVIEDFCSRLQVLPYGAKASQHYGAIRAILAKSGQQIGMNDLH IAAHARSEGLILVTNNVKEFVRVPALQVENWVE >Cag_0195 sodium:solute symporter family protein MQPLDTALVLLFLVANIAFGLWQSKSNKSTGDYFLGGHSVPWIVAMLSIV ATETSVLTFVSVPGLAYRGDWSFLQLPLGYIVGRVLVSMFLLPLYFREGV SSIYEIIGRRFGTGMQKLASVAFLITRILGDGVRFLATGVVVQAVTGWSL PLSIVLIGVVTLIYTISGGLKSVVWLDSFQFGLYFLGGVISISYLLQQLD APFPTLFATLHEAGKLQVFQFSNDLLVNPMAFGAAFLGGVFLSFASHGVD FMMVQRVLGCRSLSNARKAMIASGFFVFFQFAIFLLAGSLMFLFMEGREV EKDREFAFFIVHHLPTGLKGILLAGILSAAMSTIASSINSLAASTVTDMA GGKVSLTGSRWISFGWSLVLIAIALLFDESNKAIIMVGLEIASFTYGGLL SLFLLSRSSRAFHPVSLAVGFLASMAVVVLLKYVGLAWSWYILLSVLLNV LLVYGIDIVTNTISPKRL >Cag_0015 conserved hypothetical protein MQRDEILSILSNHKAEFSERYRFKKLGIFGSVARNSASTTSDIDIVVDME PNILLRAQFKEELEQLLGSKIDLIRYWAKMNAYLKARIDQEAYYV >Cag_1215 conserved hypothetical protein MEQILQKLGIELNEQTRLSLDSTFQFGCHSKLSCYNSCCSNLDIFLTPYD VLRMKNKLGITSTQFLSEYVEPVIQQESKLPFLRLKFQEGGQCRFVAPEG CTIYSDRPVACRYYPVGFGIHKSQNAHGNDFYFLVREDHCKGFEETQEWT VREWRKNQGIDEYDDNNRIWMDIILHKKLVSPDLEPDEKSLKMFFMASYD IDSFKEFMFESRFLEIFEVEEEELELLKSNEAELMLFAHKWLQYALFKQP TMTVRQQPKA >Cag_0896 methyltransferase, putative MPKTVPNSPHTAQSTQWFEAWFNHPLYMEVYRHRDHNEATQCIRTILQHT ALEQATPATTTVLDIACGAGRHAIELARRGYNVTGNDLSTTLLNEAAKAA KQEKLPLQLTNYDMRHVPTHQRYQLVVQLFTSFGYFDSKAEDGAVVQKVW ELLHKNGWYVLDLLNPDYLAANFIAESQRQVGELTIKEKRTLEANRVRKE LCILSPSGETLHFSEAVWLYSADEIVDILHNVGFHTTEIVGNYDGSTFNA TSPRMMLFCHKA >Cag_1759 conserved hypothetical protein MKLEILESFENKYPNRDYTIEIVNPEFTSVCPITGLPDFGTITIRYVPNQ RCVELKSLKYYFFEFRNAGIFYENITNKVLDDMVALLEPRSISVITEWKA RGGITETVSVHYTSQS >Cag_0903 Peptidase M20D, amidohydrolase MKQEESHHPIAEAIQHKAAELFPEVVALRRDIHAHPELSLQEHRTTALIT SYLMQLGITPEKPLLDTGVIALIRGTSPHHHGKVIALRADIDALPLQEEN STDYCSIEAGKMHACGHDMHTAMLLGAAKILSGMKEQLAGDVLLIFQPSE EKAPGGARPLLDAGLFATYKPILILGQHCFPTIECGSVAFCRGAFMAAAD ELYITVNGKGGHASAPHKAADPVLAAAHMVTAVQQLVSRVVPPHEAAVVT ISAINGGHATNVIPRQVTMMGTMRSMNEEVRAILQERLQQAITHTAQAFG VEAELTIVKGYPVLYNNQTITDQASCICAEYLGHHQVQHCQPLMTAEDFA YYLQECPGTFWQIGTGVREGETANTLHSPTFNPNEEALQVGTGLLAYNAY RFLASLHGE >Cag_1588 glutamine synthetase MNSDSKVPVSTYFGAFTFDHKAMRAKLPKEEFVALQETIKAGKKITAEIA GVVAHGMKEWAMEHGATHYTHWFQPMTGSTAEKHDAFLSIDRDGTPIERF SGEQLIQGEPDASSFPSGGMRTTFEARGYTAWDPSSPAFLMKGGNGLTLC IPTVFISYHGAALDAKTPLLRSMDAASKSALRLLGVLGVEGVKRVKTYAG CEQEYFLVDKKFYTARPDLVMCGRTLLGALPPKGQQLEDHYFGSIPDRVL EFMQEVEHELFLLGIPAKTRHNEVAPHQFEIAPIFEEANIAVDHNMLVME VMRKIADKRGFALLLAEKPFAGINGSGKHNNWSIGTDTGINLLDPGDTPE KSILFLIILVSVLKAVHKRADLLRMSIASMGNDHRLGANEAPPAVVTVFL GDLLERVLDAIESGQVDLKTEKQVLNLGLSQVPEVNKDYTDRNRTSPFAF TGNKFEFRAVGSSQAPSVANMVLNTIMAEALDEMSEAIEAKIQAGSDKDV AVLETIREQITVTKPIRYPGDNYSEALQVAAHERGLPNLKNTPHSLRILL KKDVQDMFIKYGVLSHEEIDARLHIRLERYIKGIDIEARTLLLMLKTYVI PNVSEYQGDLGNSFNSLFAVSEAIGLSDKALDSQAKHLKMVAENLATLLD MTNELEEAIEQIESCKSEFDKADYCADKLLPFMEEVRVVADRLEQVVDRS RWQLPTYLEMLFEH >Cag_0294 conserved hypothetical protein MNSEILIYQNQTGDITIDVRLEEETVWLTQAQLCQLFQKSKATISEHIKN VFEEGELDEKVVVRKFRITTQHGAMVGKTQEMEVNGYNLDVIISVGYRVK SQQGTQFRIWATKRLKEYIIKGFVLNDERFKSGNAMNYFTELQERIREIR LSERFFYQKIKDIYTTSIDYEPRAEKTIEFFKVVQNKLLWAISKQTAAEL VYRRANAELPLMGMQSYDKKGAASIKKAEVSVAKNYLNEDEIKLLGLLVE QYLAFAETMAQQRTPMYMKDWIDRLDTILQLNGRELLTHAGNISHDMALK KSEVEFEKYRLSLKAVEKEESLRELEEDLKQLTKSAS >Cag_1642 oxidoreductase, short-chain dehydrogenase/reductase family MQNLWNDAELQGFVSNVCHEPDDHPELAALVYASRLLGRERSLVMHGGGN TSVKCGLTDMVGNHAEVLLIKASGIDLSNVTCRDYTPLRLGPLSKLVELC SSNDPIHAERVERFSTKEFKHLLMLNMFSLTDHMAEKRLTPSIETLLHAF LPHRYILHTHSFALLTMSNQPNGEALCRETLGEAFGSVPYIKPGLGLARA AAGVYEAHPAIEGLVLQKHGLVTFGETAQEAYNRMIDAVTKLEERIALAG RKPFTTVPLPEEIAKVEDVAPIIRGACAEEKEVGRRDYQHLILDFRTSDE ILTYVNSADVVRMSQKGSMTPDFIIRTKNKPLVVPAPDAADLNGFKAAVD EAVQRYRDAYIAYFNAQQQASGMEVTMLDPMPRVVLVPGLGLFGLGKSAA AAAVNADIATCTATAILDAESVGSFESISEREAFDIEYWDMEQAKINKVY HGTFAGKVVMVTGGASGIGLATAKAFRQRGAELVVLDLSQEALDKAAEEI GGNPLTLTCNVTSRADIRAAYDAVCKRYGGVDVIVSNVGAAIQGRIGDVS DELLRKSFEINFFSHHYIAQEAVRVMRLQGTGGVLLFNVSKQAVNPGPDF GPYGLPKAATMFLVRQYALDHGRDGIRANGINADRIRTGLLTEEMIKSRS AARGLSEHEYMAGNLLQLEVYAEDVAEAFVHLAQEIRTNAAIITVDGGNI AATLR >Cag_1268 Elongator protein 3/MiaB/NifB MAEIPAWLHTTNDANALASLLAPNATRSLESLAAEASAITRRRFGRTITL YAPLYLSNHCSNGCAYCGFASDRTTPRRRLEMEEIRREIAAMKALGISDI LLLTGERTPAADFDYLRQSVALAAEEMQRVAVEAFPMSVAEYRALAESGC TSVTIYQETYNRKQYEALHRWGAKKDFLYRLETPARALEAGIKHVGLGVL LGLSDPIEDALCLYRHVRHLERRYWRAGFSISFPRLRPESGGYQPPFPVD DRQLARLIMAFRIALPNIELVLSTRESARFRDGMATLGITRMSVESRTTV GGYAENETIKSSAGQFEICDDRNVEEFCAALRTQQIEPIFKNWERAYNAP SMSCFL >Cag_1302 ATPase-like MIKQLTLTNWKSFAEATLYIDPLTILIGTNASGKSNTLDALLFLQRVSSG IPIFQAIAGDVNLTPLRGGMEWVCRKPFNTFTLTVLTDGLSKDEEYRYTL TVQVNGTKAEILHEELTLLIYGTNRTTSKEKRLFKTELDEINHPSIPTYC YTGTQGRGRRFDLLRSHTILNQTETLNVRKEVQEGAKLVMTQLQRIFVFD PIPSHMRNYAPLAETLLSDGSNLAGVLAGLEPSRKIEVEKTLTTYLKALP ERDIKRVWTEHVGKFQSDAMLYCEEGWSNETTQEIDARGMSDGTLRYLAI VTALLTRQSGSLLVIEEVDNGLHPSRAHLLIRMLKELGKQRGIDLIITTH NPALLDAAGNRMIPFITVVHRNSSTGTSSLTLLEDIEQLPKLIAQGTLGD LTSDGRLEEALQQKRGNGE >Cag_1433 possible NtrR protein MGMQNNRYMLDTNIASHIIKGDIPVVRERLIALPMEAIMVSSVTKGELMY GLAKRDYPKVLTQKVNEFLLRIQVLAWDQDVAVVYGKFRSACETIGVTLS PLDLMIAAHAHASNAILVTADKAFSRVPNLVIENWASEPS >Cag_0112 von Willebrand factor, type A MGREWFSLTKHSLEQTPQESLAELQRRIRRIEIRSRRKATELFSGEYHSS FKGKGIEFSHVREYHYGDDVRSIDWNTSARNQDLYVKLFTEERERSLLLM VDGSASMFFGSNQQSKKELAFELAAVLAFSALDNNDKVGLLIFTDQVELY LPPRKGRRHVLLLLDKLSRHKPQSKQTNINAALSFLRYTLRRQEIVFLIT DLIDSDYEKGMKQLNQRHDFILVHLRDALDTKLPLSGLLTLQDPESGERC VVDMATPQQCERYKAMQERSIEELRQRMRRMRIDAIYLETDHSFFGALNA FFRYREQKV >Cag_1316 glycosyl transferase MEPSPTVEIIIPHYRRRDMLERCLASLSRCPFYASPSLSILVICNGTADA ALQKLIANYPTVKLLALAENRGYAGGCNAGLQQSNAEYLIFLNDDTEHEA DWLEALLAIAQSNQNIAALQPKILSLEHAKQGKRRFEYAGAAGGMIDKLG YPWCWGRTFFRVETDGGQYDKARNIFWASGVALCVRRSVALEVGGFDEDF FMQMEEIDLCWRIQLAGYPIYSAPSSVVYHAGGASLAEGSAEKIYYNHRN NITMLLKNRSLVALLWIMPIRMVMELGAALFYLTKADGLKKSGMVFRALR DQLRAMPTTLRKRRTIQTNRTVSDRQLFHHQPFFHLLNHLIPQLYTFAQP LKNQQHHQ >Cag_0172 conserved hypothetical protein MRIQRRQFEIIQEQAFRELPYECCGLLVGKQQKDHRGNIENIVYEVAPCQ NCLYYGRESGFEIAHHEYLAVEAEAKQHGYQIVGSYHSHINSPAVPSLHD VDFALKGHSLLIIAMIYGQPKEVTAWLRHHSGSGVNQEQIRVIE >Cag_1075 carbon-nitrogen hydrolase family protein MQNATLRIAQIDCTLANFQENLATHCTLIEAAIADGMDAIAFPELSLTGY NLQDAAQDIAMHINDERFAPLCELSRHITIICGGVELSNEYGVYNAAFMF EDGRGETIHRKIYLPTYGMFEELRYFSAGKQIRAITSRRLGRIGVAICED FWHISVPYLLAHQGAQLLLVLMSSPMRLKPGSGEPAIVQQWRSIAATCSF LFSGYVACVNRVGNEDSFTFWGNSSVTNPEGTIIGAAPLMQPHMLDVSLE AAAIKRARLHSSHFLDEDVRLLSSGLREIM >Cag_0451 Tryptophan synthase, beta chain-like MNADVTKILLSEEDMPRQWYNIQADLPTPMPPPLAPDGTPITPEQLAPVF PMNLIEQEVSTERWITIPQEIQAILKIWRPSPLYRAHRLEAALQTPAKIF YKNEGVSPAGSHKPNTAVAQAWYNKEFGIKHLITETGAGQWGSALAMSCK LVGIDCKVFMVRISFDQKPFRKMMMNTWGAECIASPSMQTNIGRKILEET PDTPGSLAIAISEAIELAVQRDDTRYALGSVLNHVMLHQTIIGLEARTQL EKVNLYPDVVIGCAGGGSNFAGISFPFIGDKIHGRDVQIIAVEPEACPTL TRAPYSYDSGDVAKMTPLLPMHSLGHGFIPPAIHAGGLRYHGMAPLVSHV KQLGLIDAVALPQTECYEAALLFAHTEGFIPAPETSHAIAQTIREAKQAK EEGKEKVILMNWSGHGLMDLQGYDAFLSGRISDYPLPEEYLLRSLAAIKD HPQPPQA >Cag_1239 VCBS MSNTVVFIDSRISDVNALISRFAVGTEYYVLDSERDGILQITEALAGKSG YSSLQIFSHGTAGSLMLGSTVLNNAALSNYTAQLAAIGGALTASGDILLY GCNVAAGDVGQQFIAALAEATGADVAASDDLTGSAALGGDWDLEYATGAI ESGVDTNVVQEYDGVLEPPTTTITITLPTEAEIRNNSNIDQGYITESTLG SNVTLYLQDILAVDTTNLLSTTAPTSFVISVSYGKISFLKTKVDTIPIVS GNGSNTVTLTGTITQIRTLLTNNTISYVGNTGFSGIETLSAGISGVLANN SGNASGSANSELSIVGINDAPTILDAKATLTVSEGSLLNDVFEGSFTLSD PDIAHYIMQADITVLHGTLKLSGDDLDSVTGNETKSVTITGSRANIELAI TALQYTPDENYNGSDTLTVTVNDLGTTNVNQGNPDEDKTDIHQVTISVVN HAPVLENDYTPILSEISEDINSSITETEDGDNAGTLVKDIIPENDPTDSI TLDAITDEDVTSALQSIYITAVDSTNGKWQFKLDGQTTWSTITLTGDTAL LLSENNSLRFIPNNNWSGTATFTFGAWDGTGRDINNNNALYEAGDYVVIT QRGVLNAPFSLDVDTATITVNPVNDAPTFTAFSAPITPAITTNQPSLALE DGGNTVGEPSTTPITITFDDLATKGNEADANDAAYGGAVTAFVVKRVLSG TLTIDNTDYTAGTEDLNLTISESQDASWTPELNKNGILNAFTVVAKDNDT TDSKESTTPSVTVTINLTPVNDAPTVIPATQSATLTAGSDDVTSAYISMA MSDVDTGDIVKVDGDWLADESAANDGEEHWTSSDGGKTYTFDGSYGTATL YRVATTDGHSAGEVTYSFDYADTDLDSLAANATATDSFTIVVSDDAKVTA TADAVFTINGANDDPIITSGTQSGSVIENSSETATGDVNATDIDYGTTLS YSGDATGDYGSLDVTEIDGTWTYTLTSNALDDGESDIESFTITVSDGDGG SATQDVTITVWGDNDPPTITESAQSPDYVTEDGGINNGTAGTSSAHIDVT LSDADGGDTVSYVTTGWSGSGNTYTKTGTYGTATLHPNTHVIDYVLNNSD TDTQALDTNDVVSDSFSITVTDGTDFTSETIAFTIHGTNDAPTIDVSDTG ESFTESDDASMQTLSATGSITFDDVDASDSVSLTFAESTDISWSDSNGGG DYDNELDSDYSSVKAALLNGFSTTATGWSYSTSGGSSLIGSEFGAIPSNY FLPPPFSFTSLITGVLGDQNDVTEGQFKYDVYTLSNVVSGTLVFAAIESS AFPEYANVYTSANLQLRQPITPRIISNTTDGNARILAWFYLPGDTLWVGS DAPQQSGAYNLYLGGTIAESGDGVDLDFLDEDEQLTWSYTVSATDGTAST TSTESVSFTITGENDAPTISISDDSATITENSGSDVSDLQTIDVSGDIEI DDPDTHDDTVTVTSSLDSISWSGGDTTDSVFDTLTDGFTADEDGWSYSVT DGVDLNFIGEDETVVLTYNVTASDGTESDTDTVTITIEGTNDTPTVDVVE TSITFAEDDTLSDSGTVSFSDLDTNDVIDVTESYNNDIAWSYSGGTLTDE MIGEDISTLTNGFTAWGEDLSSDETVWNYSATLGTDDLDFLDAGETLTFS YTVTATDNNGASATDTITVTITGSNDTPHFNQDSVDNGSVTDTSASDTFS NDIIGTLTADDVDHNDTITYGVVNGTGTYGTLTVDSSTGVYRYDGNDNTI NALNAGTWSDTFTVTASDGTISESATVVITIHGANDNPTVSINDDSLSFT EDVSASAQDLTDSGSITFDDVDTSDNVDIIATYKNNISWSGGTLANQMSG EDSGKLTSGFHASATDSTSDSITWDYSATDVDLDFLAEDETITLGFTITA RDSHNAFATDDVTITITGTNDDVDITGGTTSGSVTELADKYSENGVLEND YLHSITGSIDISDPDVNDSHTATFTDNSESEQVTYLGSFDVADNGEDWTF TVSDEDLDYLDDDDDPLIQTYTITISDGHGSSDTQDVTITIHGSDDNASV AGRVYYWSNFDDIDDVATEMYAQDSMHTDGEDSGIEFRNIEKHDNGTYTL DIYKTADTDEADSFLIKLQLAKGSVATWEQSHDLFDGEGNELPDLTEFGF VTYASSLRTGECNIGGYSASLLSLPNDQEVKLGTLTITAPIFDAKLLSGS YIGDTAIDAGDIFSEMKLNVTEGEDITYYNGDGLYDYLNQLNSGEDSFYY YDSVDPDDYNFDAIKEVTTEDANEVTPADALLALKLSMGINTALPDLIAL FEESDTIPLIPYMYMAADVNKDGEVTIQDALNILKMSVNYACAPEQEWIF SPLPHNEQNILENMMNGFYVTYVNDLGEDVKVDWLDIKGISGDETGIILS IGDNAIPIEEWNIGVDWEQASPELHDITVTDNDLQFVDLIGILKGDVNGS WGDPINFNPPN >Cag_0569 GTP-binding protein Era MTPSHFASGFAMIIGQPNAGKSTLLNALLDFKLSIVTPKPQTTRKKITGI YNSERCQIIFLDTPGILKPRQKLHESMLGVVRSTVTDADVLIALLPYTGT AELFDRSFAAELFTEWILPSGKPVVAVLNKSDLASQEEQKAAEAFVWEQW KPTNVLSVSALKRKNLTPLISALYPFLPLTEPLYPDEALSTAPERFFVSE LIREKIFMLYGAEIPYSTEVVVDEFREQHDDDPSRKEFIRCSIVVERDTQ KQIIIGKGGAALKKLGQLARKAIEEMLGRPVYLELFVKVRPDWRKKSGML KSFGY >Cag_0882 sulfide-quinone reductase, putative MATVVVLGAGVAGHTCASFLKKKLGKRHEVIVVTPNAYYQWIPSNIWVGV GQMTIDDVRFELRKVYNRWGIILKQAKAIEIHPEGNRDINRGFVTIEYTD STRAGEIEYVEYDYLVNATGPKLNFEATPGLGPDKHTFSVCTYSHAAHAW ENLQEVMKKMQAGQKQRILIGTGHAMATCQGAAFEYILNVAYEISKRGLS KMAQITWISNEYEVGDFGMAGAFILRGGYVTPTKVFTESILAEYGIKWIR RAGVHHVEPGKVFYETLEGEETSIEFDFAMLIPAFSGVGMLAFDKNDNEI TDKLFAPNKFMKVDADYTPKSFDQWDAEDWPSVYQNPLYDNIFAPGIAFA PPHPISKPMSSPNGRQIFPAPPRTGMPAGVMGKIVALNIAERINGAPDFR HNASMSKMGAACIVSAGFGSFDGLGASMTLFPVVPDWNKYPDWGRDMNYS LGEAGLAGHWLKFILHYLFFHKAKGYPFWWLIPE >Cag_1251 Nitrogenase cofactor biosynthesis protein NifB MKQDITKHPCFNDSARHTFGRIHLPVAPKCNIQCNYCSRKFDCMNENRPG VTSKVLSPQQALYYLDQAMELSPNIAVVGIAGPGDPFANPDETMETLRLV RAKYPEMLLCVATNGLDLLPYIDELARLQVSHVTITINAIDPEIGQEIYA WVRYNKKMYRGKDAAKVLINNQLEALKRLKEVGVTAKVNSIIIPGINDAH VITVASKVAELGADILNCLPYYNTKETVFENIDEPSPELVFEIQKATSEF LPQMKHCARCRADAVGIIGEINSPEIMEKMAEVAAMAKNPFEQRPYIAVA SMEGVLVNQHLGEADRLLVYGIDEQGDCVLVDSRQTPPAGGGNERWEALA NLLSDCRTVLVSGIGNSPKKVLNNNGVEVLVMEGVIAEAVYALFNGHDMR HLIKTELAHACGTNCSGTGAGCG >Cag_1339 putative sugar transport protein MARNVVNAEQTEEVSSTRRVIAASSVGTLIEWYDFYIFGSLAKIISEQFF PKDNPTAALLATLATFAAGFVVRPFGALFFGRLGDLIGRKYTFLVTLVIM GGSTFAIGLVPGYATIGFAAPAIVFVLRLLQGLALGGEYGGAATYVAEHS PNGKRGFWTSFIQTTATFGLFLSLGVILIVRQTLGVETFQDWGWRVPFIL SAFLVGVSIYIRMKMSESPMFAKMKKEGKTSANPLAESFKQKDNLKMVLL ALLGATAGQGVVWYTGQFYALSFLQNACNIEFEQSNLIILIALVIGTPFF VIFGALSDKIGRKYIMMAGMFIAVLAYRPIYTMMYNDANLKNKIEIVDQT TVETKEEVKGTDNVITTVTKKTFEDGTTYKEIKKETIPLDAAKKAELAAA DKLKPETKKEVVLPQHLYYKMIGLVLIQVIFVTMVYGPIAAFLVEIFPTR IRYTSMSLPYHIGNGVFGGLVPLISTRLVEATRPAAGLPPADPLAGLWYP IIIAGVSFVIGMLYISNNTNNMDVE >Cag_0889 transporter, putative MSSNNSYKLGPITLAPSVLPRHALTYLYAAFFSIGLVTFVSIGQTYILNE HLKIPTSQQGAISGDLVFWTEVVTLLFFVPAGMLMDRIGRKPVYSAGFLL VALSYALYPLSRSIEEMTIYRMIYALGIVALTSALSTVMIDYAAERSRGK LIAITGFLNGIGIVVINSFFGGLPQKLMAQGFSGIEAGLYTHFGIAAIAV VAAVVVGLGLKGGTEVRKEDRPPLRSLFTSGIKCAKNPRILLSYAAAFVA RGDQSIIGTFVPLWGTTTGIALGMEPAEAVKQGMMMFIISQAAALLWAPV IGPLIDRWNRVTALFVCMALASVGYLSLGFIGNPHDANAYIFFILLGIGQ ISSFLGAQSLIGQEAPKAERGSVVGMFNISGAIGILIITTLGGRLFDSWS PKAPFLVVGAINVLVMLAAIYVRIKAPGKNLHVAEEG >Cag_2010 transporter, putative MTQPPDTPPFMDTESSSPQGKSAITRSLLKAFPAFANPDFRRYFPGQVIS MIGTWMQMVAQGWLVYELTGSAFDVGMAAAATTFPTLFLSLFGGLLVDRY PRRTILFWTQSSAMLLAFILGIVTMTGTVTMGIILLLSFLLGCVNAINVP ALQAFLSEIVRRDHLPSAIAMNSAIYNSSRVIGPALAGWLIAYSGAGIAF IVNGFSFFAVLLSLFTMKTKRRAPTVIESNPLLAIREGVLYAWNHKLIRL CIYYIAIVSVFSWAYVSMLPVIAKQRFGMDASGMGSLFGISGIGSVMGTI MVSMLANKIQPLRFIAIGSLIFAVALLGFTLTENLPLAMVGLFFAGFGLV AAVSTLSATIQGAVEDRFRGRVMSLYMMIFMGFMPLGNVTIGYLSDLFGT GFAIQLNCIVTIIAALLLLVHSKQFLRIG >Cag_0021 conserved hypothetical protein MTSEELQQLLTPEAQAMLQAHQHDNPTTFALRYSNRHDLPIRALAEQLAC RRKAERKLPTLSRHNLLYTTLSLEQASSERTARFKCTFMQGKRCIDLSGG LGIDAIFLAAHFEELLYCERNELLCNVVRHNMVRCGIGNVRLQQGDSLSF LASQPDNAFDWIMVDPARREEGKRSIGLEAASPNVVASQELLLAKAPHIC IKASPALEISNLKMLLPALHTILVVSVSGECKEILLLLKRGAEAEHPITK AICLQADNNAVVEIVGTHEQHRSLAESLQCYLYEPDAAIIKARLSGVVAK QEGLEFLNKSVDYLTSNHVVASFAGKVFQVIESVPYKPKEFRKFLDRHAI SAASIQRRDFPLSADELRKKFRLREDEKHFLIFTRNRNAEPICIYAERC >Cag_1626 conserved hypothetical protein MEQQDKFYKGLFWGAALGAAMGTVMGLLFAPYRGRETQQRISGKVKSMLD KATDLYESSEHDGMYNNDAKTRAQGVIDTARDEAKKILSEADSLMRDIKG HPAKTRES >Cag_1354 nucleic acid-binding protein contains PIN domain-like MMKKFAVVPDTNIFLASEKSVHSTSPNKEFVARWKREEFEVLYSEDTLLE YITKLRQKGISETSIKKLLATLFALGREIKIEFYHLLHYPLDPDDIAFLL CAENGKATHIITYDRHLKAIESSYTFRVCKPVEFLLELRHQYGIQPKA >Cag_1645 regulatory protein RecX MKNTPPENTLAAATGFALKLLGIRNHSHEELERKLRKKGFQAELCATVLE QLVARGLLNDRTFGEEMLQSRSQRKPSGKLKMRAELLNRGIASDIADELL SDYKSHELCHQAAAKKIASLRIADEAIKKRKVETFLRNRGFGWQEISTTL HHFFPTTSTMDDDIE >Cag_0760 DEAD/DEAH box helicase-like MPDIVKIEYHQTGESVKNTANGMRPMAARAYEERNSQYLLIKAPPASGKS RALMFIALDKIRNQGLRKVVVAVPERSIAGSFAKANLKKENNFFANWEPN DEYNLCTPGMEGSKSKVSAFKNFIDNDEEILICTHATLRFAFEELDESKF NNMLLAIDEFHHVSADGDSRLGELLRSIMQKSNAHIVAMTGSYFRGDSVP VLLPEDEVKFTKVTYNYYEQLNGYNFLKSLGIGYHFYQGKYTSAILEILD TNKKTILHIPNVNSGESTKDKHNEVDTILDAIGDVQKVDSETGIIFLERH HDKKIIKVADLVNDNPKDREKVITYLRNIKSVDDMDLIIALGMAKEGFDW PYCEHALTVGYRGSLTEVIQIIGRCTRDSANKSHAQFTNLIAQPDAADDL VKLSVNNMLKAITASLLMEQVLAPNFKFRTKLSDDDKADAGEIKIRGFKS PSSKRVKDIIESDLNDLKATILQDDTVMKAIPGNLDPEVINKILIPKIIE IKYPDLSADEREEVRQHVLVDSLVKGGEIKEVGDKRFIRMADQFINIDDL HIDLIDRINPFQKAFEILSKEVTTKVLKVIQDVIESTRIQMTMDEALLLY PEKVKAFMDKNGREPSVTSLDPLEKRMAEAIIYLKDLKRKKQSGQQ >Cag_1275 conserved hypothetical protein MLDVQVSHNSVGTLAHHTSEQGSYTFAYHKAIDIGQEVSLTMPWSLASYH YRKGLHPIFQMNLPEGRLRYTLERAFRKQAQGFDDLMLLDIIGHSQIGRL HCTSNPQLPKSVPLQSINELLAYNGTEDLLRDLLERFSATSGISGIQPKV LICDPNQAALGAKFPTHHSPQLTNAQARITVKGATHIVKGWDENEYPHLA LNEWFCMKAAKQAGLEVPRIFLSENYQLLILERFDLLEDGTYLGFEDFCA LHGLSTFEKYDGSYERVAKRITQFVSQEHRQKAFEEYFKIVALSCAVRNG DGHLKNFGVLYSNTTSDVWLSPAYDIVSTTPYIPRDSLALMLDGSKRFPS RKKLLNFARQHCNLQHEQATEMMEKIGDAVNETMAEIKVQIKEYSPFASI GNRMLSTWNEGIIDLNGKSTISFST >Cag_1600 ATPase MRIEHLIVKNFKGFVSKEFTFHPNFNLIVGMNGTGKTSMLDALAVAIGSW FLGFYVDSLKMRQIRHDDVLLKYIQHSWEHIYPCEVEAYGVVMDRHIKWS RELNTINGRTTYGNALAIKELALQATRSMLNGDDIILPLISYYGTGRLWQ EPREAFKVSDPRKVANKETQSRRTGYFNSIEPRLSVNQLTQWIAQQSWIA YQEQGQVFPVFNTVQDAIIGCIEDAKKLYFDAKLGEVIVEFSSGTQPFSN LSDGQRCMLAMVGDIAHKAAKLNPHLGSDVLKETNGVVLIDELDLHLHPR WQRRVIEDLRNVFPKIQFICTTHSPFLIQSLRSGEELVMLDGQPFATLGN LSLEEIAHGIQQVKNPEVSLRYESMKATAKSFLTMLDEASLAPKEKLKQL ADKLRPYADNPAFQAFLEMERIAKLGE >Cag_1488 conserved hypothetical protein MIVSFGSKECERIWDGFQVKSLPCEIQDIARRKLRMINNALTLVDLRIPP ANRLEKLSGDLKDFYSIRINKQWRIIFRWHNGEASMVEIIDYH >Cag_1384 sodium:solute symporter family protein MPTLTLLDYSFIGGYMLLTLFIGLWFSKRASENVGEFFLSGRQLPWWIAG TGMVATTFAADTPLAVAGFVAKHGIAGNWVWWTFVSGGMLTVFFFARLWR RAEILTDLEFIELRYSGAPARFLRGFKAIYFGLFINAVIIGWVNLAMFKI IRIMVPELPPEITIVALVLFTTFYSGLSGLWGVSITDAVQFVIAMVGCII LAVLAVQSPAVVSAGGLTGALPAWMFDFFPNFSHSAEESNSVTGAMSLPL LSFVAMAFVQWWASWYPGAEPGGGGYIAQRMMSAKDEKHSLLATLWFTVA HYCLRPWPWILVGLASLVMFPNLPANQKEDGFVYVMQAVLPPGLKGLLIA AFLAAYMSTLSTHLNWGTSYLINDFYQRFVKRDGTPQHYVLASKITTFLT AAFALYITFFVLETITGAWEFIIQCGAGTGFVLIMRWFWWRLNAWSEITA MVAPFIAFTLLQQFTTITFPISLFIIVGVTITATLVVTFATKPTEPAQLE TFYRTTRVGGRLWKKVSDTLPDVQSDSGFGMLLVDWALGVVMVYTILFGT GRVIFGEIGTGILFLAIGAIAGTLIFVDLNRRGWNNLQ >Cag_0200 competence protein MLTLMSPLVTPRATQLVQRKVLPPLHALRQLLPSGSIPPLRNVRHLLFPN VCLVCEQLLQPHEEHVCGACYASFDAFASPELAEYYVRRTITDHFCFPTF FERAWSRYKFHKESDLQELLHSLKYQGIFTLGVTLGKQLGEWLHSADLPD DIECIVPIPLHPLKKIERSYNQAEKIAEGISQLLNRPVRSSLLTRQRYMV SQTGLSATERQQNAEGAFCAKAPLRIGHVLLVDDVLTTGATMVAAAQALH DAGVAKVSIVTVAVAAKEM >Cag_1502 hypothetical protein MDNDDEILDSAPISKSDEELEKGFVAAIVEQPKLMDELAKQLVALSLAIP GIYATALKLLAGDDAVASSLPCIIGAFVFWALSLVFAFVSLTPREWHVER TLLRRNGASKNGAPLSIEEFFKVSARYKRALLIAATACCFVGICLACVAV FTVTPPNLSVPQTVQP >Cag_0334 ATPase MFEARNLSLSIGTKQLLNDTSFRIGDTDRVALVGLNGTGKSTLMRLISNT SPDSSTLRVGGDFIKSADTTIGYLPQEISFEDDLEKSALHYALQANKELF DLSETITRFEHELALPEHDYESEAYHRLIERFSDAMHNFERLGGYTMQSD AEKVLAGLGFSEIDFHKKVKAFSGGWQMRLHIAKLLLQNPTLLLLDEPTN HLDIDSLRWLENYLTNYEHSYIIISHDRFFLDKLTTRTLEIAFERINEYK GNYSTYEKEKVERYELLMSKYQNDLKKMAELNSFVERFRYKATKARQAQS RLKQMEKLEKNLVAPEEDLSQISFRFPKAQPSGREVMRLDGVKKSYTLPD GSRKEVLKRIDLEIMRGDRIAIVGSNGAGKSTFCKILANELDYEGKLTTG HNVSLNYFAQHQTDTLATEKSIYIEMMDSAPNSEAQKKVRDILGCFLFSG DTVNKKIKVLSGGEKSRVALAKILLQASNLLIMDEPTNHLDMRSKEMLIE SLENYDGTLLLVSHDRYFLDSLVNKVVEIKNGTLQLYLGTYAEYLEKSEK TRQAEEQAEALQRQKEQAAAKAAIKAEEQRAAAATPAPAKAKNSKKLEAI EKKINQLEQQKEEMERIMATEDFYKKSKEENARTLEHYHKLCDELNALFA EWETLG >Cag_0957 conserved hypothetical protein MSTSYSVLWTKVAERDIKEIITFIANDNPSNALHVLEKIKDKAAALSMAP ERGRIVPELHSKGIFIYRELIISPWRLLYRIANHEVYIMAVLDSHRNVED ILFHRLIQS >Cag_0834 hypothetical protein MLPQSKVIIHPSFKKKIPMKSLLLAFAICVTITALCFVTLEAVGMPQDIS KTISVMVLGAFPKLREMLEKMEGERSGGAVAVAKVQSFGDFNVSTSRALL YVTIVGFVALEFASGITGVVLALLGAQLSNIGVALQLLTMIIAYPIIFLA GRWIGRRCSQQPYMVAALAGLTIRLSTTIFDIAMVPMEQLIQIYQGQMQL SMIIVSQVGGSVLFALLLMAGAFVGSRQRLVVYVQYLLSRISPQSRLALV DLAHEEAVKMQKESGGK >Cag_1077 conserved hypothetical protein MKALDTNILVRFLVRDNQEQAERVYRLFKAAEADKTLFFISIPVLLELIW VLDSVYGIARYDILNAIEELLLLPILSFDGQPAVRSFIAEARNNNLDLSD LLIACDAALSGCEQMLTFDKKAAKSELFILLEI >Cag_1752 conserved hypothetical protein MNIEVRYHKLAENELHDAAKYYESRCSGLGRAFLTEITQAINQISAFPES APMILDIVRQKVIHRFPYSIMYVNDDDGVMILAIANHHRRPFYWGNRISN FHE >Cag_0513 putative DNA-binding protein MCMDGTKNEIVLYQSNELTSHIEVKVEDDTVWLNRQQIATLFGRDVKTIG KHINNVFLENELNKSSTVANFATVQNEGGRVVERQVEYYNLDVIISVGYR VKSKQGTQFRIWANQVLKDYLLKGYVLNQRMNCIENSVENLACKVKEIEL QITSNAIPNQGVFFDGQVFDAYELASRIIRSAKQSIVLIDNYIDESTLTH LTKKEKGVRVLLLTKNITKQLALDVQKANEQYGNFELKSFAKSHDRFLII DTNEVYHIGASLKDLGKKWFAFSQMDKSSVSTILTSIDTML >Cag_0125 hydroxyacylglutathione hydrolase, putative MSASQLVVKQIRTGGDRNFAYIAACTFTQEAMVVDASYNPAMVATVAANE GFTIRYIFSTHSHVDHTNGNAELSQLCGVPALLYGDMVPDLQRSVLDGTV LPLGKLNIQILHTPGHTPDSISLYCDNALFTGDTLFVGKVGGTYSDEDAR TEYESLWQKLMVLPDATMVYPGHDYGVAPTSTLAHERQTNPFLQQKSFND FLSLKKNWAAYKKAHGIV >Cag_0295 transcriptional regulator-like MNPLLHHHFFNQNSNAVIFPCEKAVWYYLPQIRADLAIELVATGMTQSSA AKKLGVTPAAISQYIHKKRGMQPNKSAEYLAQIKQAVAVICKGTAPADLQ RLVCSCCHLLQKASDEHAEACGGGQD >Cag_0756 conserved hypothetical protein MHTLNLKPKEHYRLQKGHLWVFSNELAQIPRDIASGETIKLLSHDGKFLG IGFFNPHSLIAVRLLTRRDEAIDHAFFKRKFAEAIALRTKLYSKEVTNAM RLVHGEADGLPGLVVDRFNHAIVVQTFSAGMEIHLPLICNVLQELLEPRV IIVRNESPLRELEGLTLYKEVVRGDAAEAIQQIYDYGVNYRVDLLEGQKT GFFLDQRENRRMVRAFAAGASVLDVFTNDGGFALNALAAGASSAMLVDAS KEALVRADYNGQLNKFSNYSLVAADAFDTLETMVEAKESFGLVVLDPPGF TKSRKNLPGALKAYKRLNKLGLQLVQSGGFLATASCSHHVSEEDFLGVIQ QAALAAGCNIRLIHKNTQPFDHPVLLAMPETSYLKFACFYVTR >Cag_0099 sulfide dehydrogenase, flavoprotein subunit, putative MSKKIVVLGAGTGGTIISNNLRRHLPHDWEITVIDRDDHHIYQPGLLFVP FGLQKVSTLVRSRKKYILSGINFVIDEITRIEPDKRVVTTKKHSFPYDFL VISTGCRVVPEDNDGLMEAWGKNAFSFYTIASAELLHRRLQEFQGGKLVL NIAEVPFKCPVAPIEFVFLMDWMCRKKGIRNKTEIELVTPLTGAFTKPKA SAVFNESAKAKNIKITPGFSLNEVHGKEGYIQSVQGDKVNFDTLVIIPST QGDEVISSSGLDDGIGFVPTHHHTLQALKHERIYVVGDATNVPTSKAGSV AHYEADVVAFNIMAEIHGIKPEEIYDGHSTCFIVYSKGTSSLIDFNYKIE PLPGQFPMPKFGPFSLLKETKMNWYGKLGFEWLYWNVLLAGHNLGAPPTL VMAGKELG >Cag_1987 Protein of unknown function UPF0079 MREEFFSTSESETLLLAERFAAALPPRSVVALLGTLGAGKTLFMRGICRA FHCEAQLSSPTFSLMNIYEGELNGQAVSVHHFDLYRLESERELEAIGFDD YLTSADLSVVEWADLFPHYKGRYTATVLLEYAGERERRIIIERGN >Cag_0156 ABC transporter, permease protein MESELLLLLLQIARLSVPYVLTSVGATFSERGGVVNLALEGLMLAGAFGA AYGEYLSGSPLAGVAMALLFGTAVALLFAFVTVTLKANQIVAGIAINLLV MGATRFGLTLLFGSAMNSPRLEGFAAPFLLLDPLFLTALLSVGVGQWVLF QTPYGLRLRSTGESAATADSAGVSVSSMRYSGVVVSGALAALAGAFLLFQ QHTFTDGMTAGRGYMALAAMIIGKWTPIGAALASILFAAAESMEMWFQSG VIPSQIIQTLPYVVTLVVLAGFVGKAQAPREVGVPFENGRGE >Cag_0792 basic membrane protein A MAAQRVHRFSPFTILLLLLTQLLVVGCSKQEQTASLPSSASAPMRIGLVF DVGGRGDKSFNDSAYNGLELAKQQHGVDFVYVEPQGEGADREAALREMAA NPDINLVVGVGLLFSEDITRIAADFPDKKFICIDYIHQPNVTIPANLQGI AFEELKGSYLAGALAGLTTKSNTVGFIGGMESGIIKKFETGFIKGVKAVN PNAQVISGYIGMTGAAFANPAKGKELALGQYGRGADIIYQAAGASGLGVI EAARETKKLVICTDRDQEPDAPGFVLSSMVKAVDRALLKSVESVLDGTFK GGEVKVYGLADRYTDYVYNEKNAPLIGEATHKKVEELRNNIISGKIELSE ALHQ >Cag_0257 ATPase MRLKSMRLENFRAVEHAVIEFGNRLTLLIGANGSGKTTILDGIAIALGAA LTYLPTLSGRSFKKGDLHQRHNSIAPYTRIALETTTGLKWDRIQRRDKSK STSKLVPAADGIRALEQFLDATILEPMNQGSDYLLPLFIYYGVSRALLDV PASRKGFTKKQHRFDALVHCLHADSRFRSAFMWFYNKELEENRLQKEKKS FEVTLRELDVVRSAITAMFPDISEPHIALNPLRFVVRQQGELMDIAQLSD GYKTLLGVVIDLSSRLAMANPHLDDPLAAEAIVMIDEVDLHLHPSWQQHV VGDLLRTFKNTQFIITTHSPFIVEAINNHIKRQQIEGLPINNNEVNQLLP LRSSDVKAYLMSDTIEMLMNNDVALLDDKLLEYFNSQNQLYDKMRDLEWE HKG >Cag_1504 TPR repeat MVEPLVLTPSVDIVTQHPALIRQTAELSRLYKDKEVLVTDEVLQAIGSIL WRLLDADEALANAKQRAGQHIVPLLLSSNDAAIQQLPWETIYHPDYGFLA RHEGFTFSRTIPSVQKALPDAAKGPLRILLFSSLPDDLTEKEQLQIEVEQ AAVLEALGEWRQSGHVVLEMPDDGRFSEFTQVLKSFKPHVVYLSGHGMFQ HDALNHTTTGYFFFEDEVSGKSKAFSEAEIAAALTATAVQAVIVSACESG KAASDSLVNGLTYRLLQQGIPHVIGMRESILDRAGIQFAQAFFSALMERH GIAEALQQARNAIVLPLQEDEEFKDTVEASISLGQWCLPMLFSHQYNRAL LDWEFTPQPMRAENRRNKSFKQIKSLPNRFIGRRRELRKWQRKFRSGKQN ALLLTGAGGIGKTALSYKLIMGLKHDGYEVFCLSFRPEDNWRKYLTSEIP FSLDEHRKNEFKDRIADNSDIVFQAECLFTLLLEQFNGKVAILFDNLESV QDSVTRHLIDAELQQLIDMALALESDGLRMLLTSRYALPHWDNSLVYPLG NPVYRDFLAVAQQQKLPKEFFKDDKGKRTYKRLRQAYEVLGGNFRALEFF AAALQTMNAAEEQDFLNGLKSATEQIQTNMLLEKVWSYRNQEEQELLCAL TAYQNAVALDGIKALNLPTMQQSEEFVRALVAVSLVEQYENKVWDVKEEF LVAPLVRDWLQKQGVATLPIELLQRAARYQQWLLENERRTFEQATITHAA LMAAGLNDEAHRVTLDWIVTPMNMAGLYQALLDSWLLPACYAVDKQILSE TLGQTGKQYHHLAQYSTALDYLKRSLAIVEEIGDKSGEGTTLNNISQIYD ARGDYDTALDYLKRSLAIVEEIGDKARVGAALNNISQIFKARGDYDTALD YLKRSLNIRQEIGDKSGEGVTLNNISQIFKAWSDYDTALDYLKRSFAIRQ EIGDKKGEGTTLDNIGKIYLAKGDYDTALDYLKRSFTITHEIGDKKGEGT TLNNISQIFQARGDYDIALDYLKCSLVIQQEIGDKSGEGTTLNNISQIYD ARGDYDTALDYLKRSLAIQQEIGDKSGEGTTLNNISQIYDARGDYDTALD YLKRSLAIQQEIGDKSGEGTTLNNISQIYDARGDYDTALDYLKRSLAIRQ EIGDKSGEGTTLNNISALYHARGDYDTALDYLKRSLAIAQEIGDKSGEGT TLNNISALYHARGDYDTALDYLKRSLAIRQEIGDVAGLCATLINMGHIYL QNNEIQDAVSAWVTAYTLARKIGYAQALDALENLAQQLGLPNGLAGWEML ARQMGEVNSFNRE >Cag_0236 putative plasmid maintenance system antidote protein, XRE family MNNTYTSQEDIAIARELLSCPGDTLAEHLDYIGITKMELAKQLQCSEQTI NEIIKGTAPITTAIALQLEQCIGIPANFWIERDRQYWLQLAEINEAENRL ALSNKALQLNIPLIMKKRTSCSCN >Cag_1186 hypothetical protein MQSQKPYNKQLWWRLILIIGLLLAVLFFIMTPWNIEHLPSRPNPAKSYNE ALARTEALRLAQAQPMNPRCELQLMTHKHKTEHAIVLVHGYTSCPRQFQA LGKQFYNAGYNVLIAPLPHHGYANRLSREHGKLTAEELATYADRTIDIAH GLGNKVVMMGLSAGGVTTAWAAQNRPDIERAIIISPAFGFKQIPLPFTAA AMNLFSALPDEFEWWNPTLREHETPTYAYPQYSRHALTQILRLGFIAKFD ALHHAPATKKIALVVNHNDTSVSNEAALQFIAVWQKKQNQTIAIIEFADS LKLPHDLIDPEKVNQRTNVVYPRLLQITGGER >Cag_0924 oxidoreductase, short-chain dehydrogenase/reductase family MHDKVFLITGASTGIGEATARRAVEAGFRVILVARSTDRLANLVAELGAT HAHAIPCNVAEWQEQEQMVVQALERFRRIDVVFANAGFSKGSPFFGGENK LEEWKEMVLVNMFGAAATARLTLPELVKNKGHFLVTGSVAGRSTSIRNLY SATKWGVSGMAYAIRNEMAEHGVRVTLVQPGVVDTPFWDNLQKLGTPELQ ADDVARAVLYAVSQPPHVDVNEVVIRPVGQPH >Cag_0024 conserved hypothetical protein MTNIAPLRFGVITDIHYTLDGSIATEQLAAAIRTCFASWQKRGITQALHL GDCIRGDEQFKYEELRQVLALLQEFQGEMFHVAGNHCLLMPRQELLAALG LQSTFYSFAMQGFRFIVLDGLDVSLFHPQADAEDAALLAHYLQHPQLHDY CGAIGKMQQAWLQAELASAERARETVIILSHLPLLPEVSAEPYGLLWNHQ EIAALLSASSTVKACLSGHYHHGAYAVRNGIHFMTLPAFSHQAQNPLALG MVLELEPSMLRMYNQYNEVVFCCTLR >Cag_0948 hypothetical protein MKYFIEKASQHDYADILDIMQYWNMHHIPSVEMEELDLSCFFVARISNII GGAGGYKVLSQKTGKTTLLGIRPEFLGMGIGKSLQEAMLVAMFNAGVKHV ITNTDRTETILWYKKHYGYYEIGQLKKQCDFSLSDVDSWTTLEMNLEEFI QKKLQR >Cag_0022 patatin family protein MKKILSIDGGGIRGLIPALVLAEIEAQSGKAIGATFDLIAGTSTGGLLAL GFAKNDGNGKAQYSANNLADIYLSRGNEIFSKSFLKSVASVEGLRDELYS ANGIEHVLDDYFGDDPLSSCITKSLVTCYDIQNREPLFLKSWREEYQSVL MKHAARATSAAPTYFEPALIPIGGATKALVDGAVYINTPSVSAYAEALKL FEDEQDFFVLSLGTGELIRPISYDKSKNWGKAEWVVPLLSCMFDGMADAA NYQMKMLLDDKYVRLQTNLSVASDDLDNVTANNLENLILESQKLIRTHRQ VIDMVCSLL >Cag_0536 Hydrogenase expression/synthesis, HypA MLISGIRIAQCLLLVNNCKYSPANTMHEMSIALSIVEAVEEQARKEGAQK IIALELVVGKLAAIQVESLTFCFAAAAKGTLAEHAALIIEEPEGIGKCEE CGKEFPVNFYYAECPQCRSLRINIVSGEEFRIKAMEIC >Cag_1997 TPR repeat MWGTFLFCAVMNFSLSHLARTIALLLLTASPAFAESADELFNRGFALHMQ GKLQEAVSCYSDAIDEVPTFAMAFQMRALAYQQLKKFPKAVNDYSSAIEQ GDASFKVVGYYNRGVVKNIMGDFVGAVDDFSQAIVLNKKMATAFFHRGIA RHQLGDNDGRFEDFRQAALLGDRTAEQWLNTYHPNWKPVPPSPPSIPPSI PSSLPATAPPIQPSSPPTAPTDAPKPASVQPAPSEPNDSTRTSATTPA >Cag_0068 putative lipoprotein MMLCSFLFKNLSILAMTFSRFIASVCLIALPVSALSLSGCSSSRQPTTAS EQVSDGYARAEALIKKGDYRSAVLVLEPILFTSRATALEDDVLFRLGQAY YHTEQYLLAADMFTKVQQLPASPYAATAQFMVASSYEKMSPPFELDQAYT QKAIEEFALYRELYPLTDSVRSAEQAAFWKEMLKVDAANETYKKNYAQAM VGMSRSDSVRYAGKAITTLREKLAHNAYSVALHYQQLGKLKAATIFLDEV IARYPDTSYYKLAMREKVDLLVKREKWYDAALALAQYQQLAPENGGALQS LQEQIARNTKK >Cag_1817 GTP-binding MNITSASFVASYTSLHALPEAVLPEIVFAGRSNVGKSSLLNSLTGLKGLA KTSAKPGKTRQINYFLINELFYFVDLPGYGYAAVSQSEKAAWGQLLANYI ERRDAISLVVVLVDSRHPFMENDVAMLEFLEFHGRPYGIVMTKSDKLNQS EKSKCQRVAKTYAAKAKFVVNYSSFSGAGKALLLSHIDHSIISQ >Cag_0213 WD-40 repeat MGFLSNIFGKKEVELKRPQVKEDENLIKTMEGHLDRVLCVKYSSDGKKLV SGSFDETAMLWDVASGKPLHTMKGHSTWVECVDYSRDSKLLASGSTDSTV RIWDAATGQCLHLCKGHDTAVRMVAFSPDSKVLASCSRDTTIRLWDVANG KQLAVLNGHTSYIECVAYSRDGKRLASCGEETVIRIWDVASGKNIANYDT GDRLSHAVQFSPDDKLIAFGGRDAMVKILDAESGNMVKVMKGHGDAVRSV CFTPDGRKVVSAANDETVRVWDVQSGNELHMYRGHVLEVQSVDVSPDGTV IASGSDDRKIKLWRLL >Cag_1551 RNA-binding region RNP-1 (RNA recognition motif) MNIYIGNLAYSVTENDLRDAFGQFGQVESASIITDKFSGRSKGFGFVDMP NDSEAREAIGAMNEKELNGRPIKVNEAKPREERPARRDRY >Cag_0259 3-oxoacyl-(acyl-carrier-protein) reductase, putative MQKNISQKTCFMTGATGVLGSAIAEAIAKQGYSLFFTWNGSEAKALLLLE RLQAISPHSAMVRCDVAQPSAIAEAFIEFRERYQRLDLLVASASNFFRTT LPEVTEAEWDALVNTNLKGTFFTMQEAARMMQQQSFVSRIITMTDISANL AWRGFAPYTASKAAIQHITRLFAKTFAPTILVNSIAPGTITLNPEHATEA ALDAVTNVPLRRTGEPADIVRTVLFLLEQEYMTGQILAVDGGRLLA >Cag_0071 Beta-phosphoglucomutase hydrolase MFIAMQRSAFIFDMDGVLTDNMRLHANSWIELFRDFGMEGMDADRYLKET AGMKGVDVLRYFLGQSISAEEAERLTEFKDFLYRVTSRNKITPLTGLQPF LEQAQQQAIPMGIGTGASPKNIDYVLELLELEQTFQALVDPSQVSNGKPH PDIFLRVASLLGAEPQHCIVFEDALPGIEAARRAGMQCVAITTTNNADEF RHFDNVLAIVNHFQELTPQGLLMLLTEKQNTLVA >Cag_0585 conserved hypothetical protein MNVVYSAEAVDDLVRLREFIAVHNPQAAHRISNELVSRIEQLCAFPEMGK QIPQFPTPSIRDFIFGNYIVRYAIHSDAITILKIWHHYENRIK >Cag_0301 TPR repeat MNMLQPPVVVLMTDFGITDTFIGQMKGVILSLCPIAQLIDLTHAVLPQNV VQGAFLLGKSLPFLPDGSVVVAVVDPGVGSTRRIIAVQTSRHTFLAPDNG LLTPMLASGDVQQCVSVTNERYMLPQRSSTFHGRDIFSPVAAHLAAGVPL AELGKSMPMAECVQLEVLRANVLDNGNCIESTILYTDHFGNAVTTIEREL LAEKHDWLIHVNELRLPLSTTYSDVAEHQPIAYIGSSGTLEIAIRNGNAA AALGLHAGVAVRMERGEWEVESGMWEEVVQAIPDLLKQGVTLHQSGKHNE AEACYQQILKQQPHHIDALHLLGVLFYHKKEYSKALDLLNQAIALKPTFT EAYSNRGAVLKELKRFDEALASYNKALELKENYAAAWYNRANLLKEWKQF SEAIESYNKAIEFQPNYPEAYSNRGVVLKELKQFDAAFASYNQAIALKPT YVEAYSNKGTVLKELKQLDAAIESFNKAIALKPDYAEVQWNKSLVLLLSG NFIDGWMLYEWRWKKADFTSPKRNFTQPLWLGKESLEHKTILLHSEQGLG DTLQFCRYATLVAKRGARVILEVPDILIPLLKQLEGVEQIIAKGKKIPPF DYHTPLLSLPLAFTTRLENIPSPSKYLFIDNNKIEEWKQRLHTIPHPRIG LVWSGRAEHKNDHNRSIALADLLRYLPNKYHYVSLQKEVRDSDKKTLDVT SNMVHFGNELHDFADTAALCELMDLVISVDTSVAHLSASLGKPTWILLPF IPDWRWLLDRNDTPWYASATLYRQHTRDDWESVLKNIATDLYDYFTTDNK VAVSHKATKAIQALLKEAIKLHQSGKQNEAAICYKNIIQLQPNHVDALHL LGVVAFQKEQYNEALNLLNQAIALNTDFASAYFNRGLVFKNLYHFDKALE DFDRALRLKPNYAEAYHKRGNILKELGLITAALSSYNNALALKADYAGVY LDKAIILLLLGNFADGWDIYEWRWKCKDLPLVQRNFTQPLWLGQKDIQSK TILLHSEQGLGDTIQFCRYTQLVAERGALVILEVPASLASLMQSLEGVTE IVVKGKKLPPFDCHCPLLSLPLACNTTLENIPSPSKYLSSNTKKRNKWKD RLQAIPQPRIGLVWSGSTQHKNDRNRSIELSELLQYLPDAYHYISLQKEL RESDKATVEATSNIVHFGDALHDFADTAALCELVDIVISVDTSVAHLSAA LGKPTWILLPYIPDWRWLLDRNDTPWYASATLYRQAHRDDWLSVFKRLQE DLQQRCMVESAQQQLPIHTSQNNHIATLLQQGVQLHKSGKQNEAELCYQK ILQLQPNHADALHLLGVLSFQKENYSQSLELLNQAIAIKSDFASAYFNRG LVLKNLSQFEKAIEDFNKAIEQKPEYASAYHSRGTVQKELKQFDAALKSY EKAIALKPDYTEAYCNRGNALQLLKRFNEAIDSYNKAIALKPQYAEAYSN RGVVFRELKELDTSLDNFNKAIELKADYAEAYSNRGVVFRELKMLDNALA DFNKAIELKKDYAIAYWNKSLVLLLLGNFAEGWQCYEWRWKKADFTSPKR NFTQPLWLGEESIADKTILLHSEQGLGDTIQFCRYAPLVAELGARVVLEI PSSLALLLQPLDGVAEVIVKGKTLPPFDYHCPLLSLPLAFKTTLETIPFP TKYLSLPSHKIKQWQQRIGNIAKPRIGLVWSGSTKHKNDHNRSIELSKLL EYLPDHYHYISLQKELRESDKATLEATANMVHFGDELHDFTDTAALCELV ELVISVDTSVAHLAAALGKPTWILLPFIPDWRWLLDRNDTPWYASATLYR QHTRDDWTAALERLHEDLRQRFLVSEM >Cag_1194 putative plasmid maintenance system antidote protein, XRE family MKNNYKSKEDIAVAREIISCPGDTLAEHLEYMGMSQAELAERMGRPKKTI NEIIQGKAQITPETALQLERVVGISATFWMNLEHNYRLLLAELDEAEKRI VDAEWAKQFPLQEMIDKGWITVDNGCDNAINTILSFFKVATPQAYQNYCH NQLYATAYRMSETCSKDPHAVAAWLRQGERQAEYLKAVLFDRKKFEEMLL TIKKLIVQDDNFFEALQDCCLQAGVKVVHTPCLKKAPLNGSTRWINDSPL IQLSNRFNRNDIFWFTFFHEAAHIIKHNKKDVFIEGMDYSFDGKKKEDEA NMYAEEYLISRKEENELLASTSFQKDDIQHFAEKFSTHPAVVIGRLVNKG KVKAELGHLYGFYKKVELH >Cag_1022 hypothetical protein MDNTFLSNGGIVTTDFGGSDSGSSIALQTDGKIIVAGESSGGGDGGFTVV RYNADGSLDITFDGDGKVTTDFGGLEYATSVALQGDGKIVVAGYKGISSS GGGDFALVRYNADGSLDITFDGDGKVTTDFGGWDEAESVTIDSNGKIVVV GYTGISSSGGGDFAVVRYNVDGSLDATFGAGGKVITNVGGEEYAHSVIVQ SDNKIVVIGDTGISSSGASDFALVRYNNDGSLDTTFGVSGKVTTNLDFGD FVGGATMQSDGKIVVVGESYSLADMAVSGDQDFVLVRYNTDGSLDTSFAD DGTLVADFGGGESATSVAVQADGKIVVTGDSFPAGGSGDSNVIVVRYHTD GTLDTTFSENGFVKTVVGESEGNSVVVQSDGRILVAGQSNGDFSLVRYNS NGSMGLGIDFDGTPIGTPSSYESFADLYFDETQATIAGYIGSALVTLAPP GVVHCFDEDHDGFADHFTRTWVDGNGTQSISGTTVWLDNNIFKSSGSALI GGIPYTVNQYGRAAYDADGDVVGMYFLTVNPVFTLTADTTVGNELVATFT IPNQAETWSLLDSDLNGVVDHVNRWNSWIDQNSVTQIRNFTYLLTWSDIT HFTARQVGVITGTSFDALGRPLGITFSNSTPPAHILPITWLDTKGDDNVV ATFDIPSTIFGQLLDTNDDNLPDQAVFIETSSSGQKDTATATIQGWSSWS DISTQQVTMEIQTSTAPWNFFTGTINGTSSNPTTVIMPSYFMGNNVVVPE TTFPSMTNNSLTFDLGTTGASLASSGSITLWSSATGTITIPVTSLTFDGS HLTIPLSGTDVATNQPYHLYPNSVDNFRVQIPAGVVIGEPTIDKAWFVGE WNNIGYELSPMMIVYNGDGTTDADWVLGTSGNDSVAAGAGDDLMKWSAGN DTIDAGDGYDKLYMPKAVPSANYITKTDSQGVLHIGEVNAATNTIIADAY RITRLAAGSFQIQKMDSTGTTVTQTMLLNNAEVLHIGPPSNYTSVALTIN YANEFIYGTPWRDTIYLNASNISTLSQIWAYSSTDTLAIDVGAGYSKIEV VREGSTSLLKGTLIADGTVVDLGSFSKALPSQYNYTATMSIGTGESAHSF TINNIEAYRFTSGDVILTVDPIPPTVISFTPSDNATAIAVGSNIVLTFSE TVQAGTGNIVITDGTDVRTIAMTDSSQVSIAGNTLTINPTADLAKGMHYF VMLDAGSIEDLAGNDYAGTTSYDFTTIVGGVITTDFGGDSFGCGVTIQAD GKILVVGGGSNGDIALARYNMDGSLDTSFGNETGKLTTDFGYEDAALSTI IQSDGKILVIGESVINGGSYGKCIIARYNIDGSLDTSFDGNGKVITDFLD GLNVDGFYTTEAILQSDGKILIVGGGYQSGNSLVTLFRYNSNGSLDSSFG NNGMVITPSISSLNSSDFPYGVVQQVDEKILVAVRSDNNAAITLIRYNSN GTVDTSFNADSMQITGLKENDMDGGLVLQVDGKILVSGSSNGNIILVRYH SDGTLDSTFNGNGNIVTDLGGNDGVGAITLQPDEKILVSGYSNNELALLR YNTDGTPDTHFCNNGVVLTNIGSDSFHEITFAGWGITVQADGKILVTGQS NGDFALVRYNTDGSLDTSFDGVSEPTPPTHDLSGHITFWKTGNAISNVQA TLATMPMQPASDDVAFRNLQHQSNGGYTVELWATTTQTELQSIQLAMQFS DNVTAQWQQSSAVPTGWLSVINNTQAGHLEIGTIGQATMQGDEEIMLGTL TFSAPDNPNNFTLAVTSGWLGDNSIAPTSILCTATDSEGNYSFETIADGW YQITGESNTAKLADAVTAQDALAALRMAVALNPDGGNNLDEVSPFQYLAA DINRDGKVRANDALNILKMAVGIESAPADEWIFVAESAASKTMDRSHVDW SFAEQPIDVYGDMELDLVGVVKGDVDGSWGMVG >Cag_0580 glutamate synthase (NADPH) small chain MTSISLTINNISVSVPQGSTILAAAEAAGVTIPTLCFLKELEERGACWMC IVEIKDKNRFVPACNTAAAEGMVIETENPTLSAMRRQNLERIIVEHSGDC NAPCELACPAGCNIPDFIAAIERGDNAKALEIIKEDIPLPAILGRICPAP CEEACRRHGVDEPLSICALKRYAADRDSEQAERYLPPCEPSSGKHVAIVG AGPAGLSAAWFLLRKGHKVTIFEAAPQAGGVMRYGIPRFRLPESVIESDV APLLAMGLELRCNTRFGRDVTFDNIRTQYDALLLAAGTEEAASMGIAGEE LEGVISGITFLRNVALGTQSTLGSKVIVTGGGNTAIDAARTALRLGAEHV TILYRRSRADMPANASEIGEALAEGITLREWAAPLSIHAVNGALEMQAIA MQAGELDASGRRKPVPIAGSKFTLQANTIISAIGQQLNPALAEAAALTTT RNGLAVNPDTLQSTSDASLFACGDCVTGSDTAIRAVAQGKLAAHSIHSYL TNQPVEAPTQPFNSSYGNREQAPKAFYAQKEAAPRVALPELPLSERQGNF HEVAIGYNNELARTEAARCLRCKCNAINNCRLRDLATHYLFGKVEQHPEH LGFYKAANSAISMEREKCVDCGICVRLLEEHNGNVEIAVMRQSCPTGALS MPM >Cag_1469 Death-on-curing protein MMEQDTNQLAVYQAENGALELRADSALDTIWASLDQIAELFGRDKSVISR HIKNIYLESELEKTATVAFFATVQKEGSRIVTRNIEYFNLDTILSVGYRV NSKVATRFRQWATKTLKQYITQGFSINQKRLEENKTQFLKTLEDLKILTQ SSQQVETKDILTLIQNFSHTWFSLDSYDKNEFPKQGTQEAIQTSAEELYQ DLMQLKAELVAKDEATELFAQEKNIGNLKGIFGNVFQSVFGQDAYPSVEE KAAHLLYFIIKNHPFNDGNKRSGAFSFIWLLQKAGYQFRDKISPETLTTL TILIAESKPSDKDKMIGIVKLVLSVEP >Cag_1862 polysaccharide efflux transporter, putative MSRNSLVAGQAGFAFAGLLFGQLMRFGYNLVVARLLGVEALGIYALAIAV MQVAEVVALAGCDASLLRFVNLYHNDAARQRQVIGFAAKSSLLFSLAVMA LLMLFANQLSALFHGNELLTLALSCYAAALPFNVLTQVTAHALQAFQHLK PKIIATQLLSPLLLLLFTLLFYYTVGIQAALLMPFLLSACGALLWILLPF ATTTGIRFIDIVRARHDNAMLTYALPLMAVSLFSMLSHWLDVMMLGIFSD AVTVGLYHPAARTAGLLRSVLLAFAGIAAPLFAELHAQGNKAEMARLYKL VTRWSVILLIPPLLIFMVLPQQVLSLFGAHFADSGAVALQLLSAAYFVQC VFGIASTLLAMSGYAQLSLINAVVALALQAGLNWLFIPTMGLQGAAVASL VLFLLLSALRWLEVRLLLQMNPLSTMLWKPLVAGAVTFLLLMLMHSWLLM LPSLLALGVGTVIAFSCYVALMLMLKLEVDEKEIIFKYLPFMRKDG >Cag_1344 TPR repeat MIHMPDFFDDDRFEFSSNNGELPPDLDGLDSIFDSEELVERIMQYMEDGF PLEALAVARRLEQIAPYNSETWFYLGNCLTMNAFFDEALEAFHKALLLSP TDSEMQLNLALGYFNNSMYEEALEQIERVMVDFAFEKEYHYYRGIILQRL DRYDEAEKAFLMALELDNEFADAWYEIAYCHDVCGRLEESTTTYNTALDH DPYNINAWYNNGLVLSKMKHYDEALFCYDMALAIADDFSSAWYNRANVLA ITGRIQEAAESYEQTLELEPEDINALYNLGIAYEELERYPDAMECYRRCI TIVPEFGDAWFALACCHEVLEEFDEAYSATLEALKTSADCVEFLLLKAEI EYTLNKAEESIHTYERIIELEPDNPQIWVDFAIVLREAGMVNASIEALHC SLKLQPMSADAHFEIAAAYFALGDKLSTLKALSKAFKIDPDKKELFQSTF PELYQQDSVRRMLGILEMPNE >Cag_1977 MazG MPATIDELKAAILKEHERSVPEGFQRVLDLVRVLRQECPWDRKQTAESLA HLLLEESYELVHAIDQQETDELKKEIGDLFMHLCFQVQLADEQGHFSFNE VFDALCKKLIHRHPHVFGSTEATTEKEVLQNWEKLKLSEGRKSLLDGVPS AMSELLRAYRVQKKVAGVGFDWQSDEGVIDKIVEEIQELKQAATQNEREE EFGDLLFTLVNYSRFIGTNPEDALRKATNKFMQRFRTVELLVAESERPWQ EFTPEELDTLWQQAKEK >Cag_1871 Small GTP-binding protein domain MKFVDSASVFVQAGDGGRGCVSFRREKFVPKGGPDGGDGGRGGHVWLETN SHLTTLLDFKYKNKYIAERGVHGQGARKTGKDGVEVVIQVPCGTIVRNAA TGEVIADLTEDAQKILIARGGRGGRGNQHFATSTHQAPRHAEPGQKGEEF TLDLELKLMADVGLVGFPNAGKSTLISVVSAARPKIADYPFTTLVPNLGI VRYDDYKSFVMADIPGIIEGAAEGRGLGLQFLRHIERTKVLAILIAVDSP DIEAEYQTILGELEKFSATLLQKPRIVVITKMDVTDEPLALQLAGEQTPI FAISAVAGQGLKELKDALWRIIVAERAVPTNQVPQGGE >Cag_0401 2-desacetyl-2-hydroxyethylbacteriochlorophyllide MKSMKAKAIVFSGVRQIELADVKLKPLSSTDVLVETWWSSISTGTEKMAW NGLIPSPPFIFPFIPGYETVGKIIAVGAHVNDNLIGRFAYVAGSFGYEGV NAAFGGASEFIACPVDSLTVLDNIEHPEAGIALPLGATALHIVDLAHVEA KKVLVLGQGAVGILAAELAKLMGAKLVAVTEPNCNRLKLSAADLKVNPDR QDVSAALAGHEFDVLIDSTGIMSAIDTGLRFLKFQGTVIFGGYYQRINID YSQAFQKELSFIAAKQWAKGDLERVRELIASHKLNAERIFTHHHTVGSGN ITDAYQQAFTDQDCLKMVLHWKQANEEPTTSN >Cag_1299 dihydrolipoamide acetyltransferase, putative MAKDHYFTWQLSAEISAKIRYQEYGHEHHGKTPILFLHGYGAMLEHWDLN IPHFAEQHKMYAMDLIGFGKSQKPNVRYSLELFAQQIQTFLLYKKLESVI IVGHSMGAASSLYFAHHQPEPIKALVMANPSGLFADTMDGVASMFFGLVA SPVIGDVLFTAFANPMGVSQSLTPTYYNQNKVDDKLIRQFTQPLHDVGAQ YSYMSPSKRPLDFRLDHLPKPCNYQGPAYLVWGADDMALPPQKIIPEFQQ LIPHAGAFIIPKAAHCIHHDAHEAFNQRLAFILQELEG >Cag_0105 conserved hypothetical protein MQEGFNHYQQQRRAYTLYQQHSYLQAEQAFHTLAAQAPSPKEKASAHFNE ACALAMQGNHTQALPLFTLSRKGTTLTEPLRLQALFNEGTLLAAQAKKSS ARQEKMTLYQRSLHHFKQVLLQSPTDVDAKINYEIVRRHMAALQPKPPQS PKQQPNRAAITPAGGIGNDVAQRLLEQAARNESSLMREMAQQGKSSTPRS TKNLRDW >Cag_1403 Methionyl-tRNA synthetase, class Ia MSTMPHFSRTLVTTALPYANGPVHLGHLAGVYLPADLYVRYKRLKGEDII HIGGSDEHGVPITISAEKEGISPRDVVDRYHAMNLDAFTRCGISFDYYGR TSSAVHHATAQEFFSDIEQKGIFQQKTEKLFFDLQAGRFLSDRYVTGTCP VCNNPEANGDQCEQCGTHLSPLELLNPKSKLSDATPELRETLHWYFPLGR FQAALEEYVNSHEGEWRPNVVNYTRTWFKQGLNDRAITRDLDWGVAVPLQ SAEAVGKVLYVWFDAVLGYISFTKEWAALQGNAELWKTYWQDPETRLIHF IGKDNVVFHTLMFPSILMAWNEGKTTDCYNLADNVPASEFMNFEGRKFSK SRNYAVYLGEFLDKFPADTLRYSIAMNYPESKDTDFSWQDFQNRTNGELA DTLGNFIKRSIDFTNTRFEGVVPASVTKEEWDNLGIDWQATLEQLDSAYE GFHFRDAATLGMEIARAANRYLTSSEPWKVIKVDREAAATTMALSLNLCH ALSIALYPVIPETCNRIRAMLGFSEPLEATIQRGTSLLSSLLTPTLQQGH KLREHSEILFTKIEDSAIAPELEKIAKLIAEAEKREAALAESRIEFKPAI SFDEFQKVDLRVATVVAAEPVAKANKLLKLRVQVGSLTRQVLAGIAKHYT PEEMVGKQVLLVANLEERTIRGELSQGMILAVENSDGKLFIVQPSGEGIN GQSVQ >Cag_1010 CRISPR-associated helicase Cas3, core MDAFDNKLYAHTLEGVKNKSQWQTLREHALSTAHLASDYATSFGLAECGY WLGLIHDLGKSLPQFQQRLEDDRVKADHKHAGGLFLWDKLNNGTKPSHLA AQCLALCVISHHGGLVDCLNQLGEDNFINTIENKLYQANLKDSLENLQLD TELEDNIKKISGARSLVQDEIDFFFQNILQQAKKYWPTEDDKNKRKKLEL FRIGLMTKMLFSCLIDADHTDTANFHDEERKNKNLPHLPKWDELRDMVER YLETLPQTSSVDIERKRISDKCIQASVCESGTYLLTVPTGGGKTLASMRF ALHHAVNREPYIPFKRIIYVIPYTTIIEQNAQAIRKVFVSQLNEDVLNEM ILESHSNVLPNEENRNNRVLAENWDAPIIFTTNVQFLEAFYGVGTRNARK LHNLANSIIIFDEAQTLPVRCLHLFCHAVNFLVEHCNCTAILCTATQPLL HEIPAEHGALWLSKNFQILPDKFRKDSADSLKRVTVIDECKPQGWRLEEV ADKVSCIHKQGNSCIIILNTKADTRELYTILRKRHGEELTYHLSTAMCAA HRMDILSEVKTLLRNNQPVICVSTQLIEAGIDIDFDTGIRALAGIDSIAQ AAGRINRNGKKPADSALYIQNISGENLKNLQDIAVAQVEAQKVLREFKEN PNEFGNSLLSEAVMKRYFKFYIFNRKDEMTYKIKSDNLVNLLSSNVNAVG EYKRTHKNQPYPNILRQSFATAAREFKVINSDTQGIFVPYNDEARGLLNQ LRNTKSSEFQRYLFRRLQRYTVNVYPYMLKKLTKIHALEPLCENSGILAL YEIFYDSRFGVNINSTISPDMLIQ >Cag_1691 serine esterase MLTRHHHDLTYLEYATPTLGNNAPLLVMLHGYGSNEKDLISLAPMLPDGL RIVSVRAPLTLAPEMYAWFSLEFLADGIRVDEAEARAACERFVLFLRDLI TRYQPAGSKVFLMGFSQGSVMSYLTAFLAPELLHGVIACSGQLPEKNMPS ESAFALLRTIPFVVLHGIYDDILPIEKGRHAHAWLQQQVDDLTYREYPIA HQIADDGIALISSWLTERLEKVGNSRL >Cag_0091 conserved hypothetical protein MFTQHNFSLNLAVRDYECDLQGIVNNSVYLNYLEHVRHEYLKHVGIDFAT LTREGIHLVVIRAELDYKASLTSGDSFCVGLTFLRESPLRFSFLQDIYRL PDNKLILKAKIIGTALNEHGRPFLPLQLEQLFQST >Cag_1141 conserved hypothetical protein MVASPAFGQMRYTALRLTLVALTFGTVQELNAAESSNFRQQMSMADQELW KARYPQADSLYNVLLRQNPSNTEVNWKLARLQISLGESLPPSQQPVRLRH YRQAENYARTAIAIDSTEAPAHIWLAASLGLMADKIGPQEKLKRAEEIKR ALDTAVRLNPDDATAHSLLGTYYYEASKIGWFRRMIGNTFVGTMPQGNKE LAEKEFRRSIALDPRMIRNYHDLAKLYLDMGRKAEAITLLKTALNKPILV ESDKRRLEQIRELLRKHNGDGE >Cag_0273 conserved hypothetical protein MKKIILRQAFNELNDAIAYYEEQQPGLGVKMKDEVDQHVHWILNHPLIPR LRHGGYRRVNLKVFPYYIAYLVHQETLWILAIAHTHRKPKYWIKRKNKI >Cag_1173 Protein of unknown function UPF0054 MSLELCNSTRQAIPNKRLLQAIRMVVQGEGYEIATITGVYCGNRMSQRIN RDYLNHDYPTDTITFCYSEGKAIEGEFYISLDVIRCNARHFNVTFEEELL RVTIHSALHLTGMNDYLPEERVAMQAKEDYYLQLLKTQKSISPQKSTDNA IFSCNS >Cag_0004 conserved hypothetical protein MNNHIVITGATGVIGVELAQKLIKRGEKVVLLARSPNAAQQKIPGAAAYV RWDSDMQEGEWKSTISGAKAVIHLAGKPLLESRWNEEHKQECYQSRIIGT RHIVAAIAEAAEKPQVFISSSAIGYYGSFDKCSDTAPLTESGNKGSDFLA HICIDWEEEARKAENLVPRLVFLRTGIVLSTRGGMLQKMMTPFQYFAGGP IGTGLQCISWIHMDDEVNAIIASLDNSAYKGAINLVAPTPVSMKEFASKL GAVMGRPSLLQVPEFAVKMLMGEGGEYAVRGQKVLPTFLEKQGFTFRYPD LSNALGDLIKHGK >Cag_1591 conserved hypothetical protein MQNKAIPHSPAEQVALLDSNECSLPRFHERLASEGVSQLQADGIEILQLN IGYRCNLRCTHCHVNGSPERHELMSREVMEQCLVALDKSNATTVDITGGA PEMNPHFRWFIGELRATKPDARILVRTNLTLLTDNKTYSDIPELLKAHRI ALIASLPCTTKKTVDAVRGDGVFDRSIAALKLLNSIGYGTSDSALELNLV FNPSGAFLPEAQQQLEHHYRTELQNKYGITFSHLFTITNMPVSRFLENLC TNGTYCDYMKLLVDSFNPTSVKNVMCRTTLSVGWDGTLYDCDFNQMLRLP VECSVPQHINAFDAEVLSKRHIVTNQHCYGCTAGAGSSCQGCLV >Cag_0489 IMP dehydrogenase MTKILYEALTFDDVLLVPAYSAVLPKETTTACRLTNTISLNIPLVSAAMD TVTESRLAIALARAGGIGIIHKNLSIEQQAREVAKVKRYESGIIRNPFTL YDDATVQDALDLMHRHAISGIPVIERPQNEGDASRILKGIVTNRDLRIKL QPNAPIAQIMTSQNLITAREDVGLQQAEEMLLANRIEKLLITDNAGNLKG LITFKDIQKRKQFPNACKDSHGHLRAGAAVGIRENTLDRVQALVDAGVDV VAVDTAHGHSQAVLDMVKKIKSHYPDLQVIAGNVATPEAVRDLVKAGADC VKVGIGPGSICTTRIVAGVGMPQLTAIMKCAEEAAKTNTPIIADGGIKYS GDIAKALAAGADSVMMGSIFAGTDESPGETVLYEGRKFKTYRGMGSLGAM SEPEGSSDRYFQDSSSEAKKYVPEGIEGRIPAKGTLDEVVYQLIGGLKSS MGYCGVATIDELKQNTRFVRITSAGLRESHPHDVKITKEAPNYSTSM >Cag_1541 metal dependent phosphohydrolase MGIVINLLLLVLAALVAFVAGFFIGRYFLERIGTTKVLEAEERAVQIVQE AQKEANEYKELKVSEVNQEWKKKRREFEQDVLIKNNKFAQLQKQLQQREA QLKKQSQDVRDAERKLQDQRKEVEQLSDSVKLRATELERVIVEQNQRLES ISNLQADEARQMLIDNMVTQAREEASNTIHRIHEEAEQQATRMAEKTLIT AIQRISFEQTTENALSVVHIQSDELKGRIIGREGRNIKAFENATGVDIIV DDTPEVVILSCFDPLRRELAKLTLKKLLADGIIHPVAIEKAYADATKEID DVVYSAGEEVAASLQLNDIPTEVIALLGKMKFHTVYGQNLLQHSREVAML AGVMAAELKLDARMAKRAGLLHDIGLVLPESDEPHAITGMNFMKKFNESD QLLNAIGAHHGDMEKESPLADLVDAANTISLSRPGARGAVTADGNVKRLE SLEEIAKGFPGVLKTYALQAGREIRVIVEGDNVSDSQADMLAHDIARKIE SEAQYPGQIKVSIIREKRSVAYAK >Cag_0825 conserved hypothetical protein MTHTTPLLPLAPPINLVDLRYNELHNAITAFGEPPFRAKQIHEWLFSHHA NSFAAMSSLPLRLREKLAERFTLQRPEVVEVQESCESGCLRPTRKILLKL SDGALIECVLIPAEERMTACLSSQAGCPMQCTFCATGTMGLQRNLSAGEI WEQLYALNGLALQEGKTITNVVFMGMGEPLLNTDNVLEAIATMSSRNYNL SLSQRKITISTVGIVPEIERLSRSGLKTKLAVSLHSARQEVRQQLMPIAA ERYPLPLLSKSLEAYSKATGEAITIVYMMLNGVNDSKEDAHLLARYCRHF SCKINLIDYNPILTIRFGSVQESQKNEFQAYLMAQKFHVTVRKSYGASVN AACGQLVTQQQRRTIK >Cag_1111 conserved hypothetical protein MDHTKINLLVRLQYLDNQIENIVSLQKGLPEEIEALEDDLAFTTRQIESR KKIADEQLRQRTRLNEQINECNNKINSFKEKQTLARNNKEYDALSKQIEY EEKEIANANMQLQDIAQTTQRLQELQKKGAQLITENRYDEITEEMMPDDV LLQQLKDLTAQVAQKREELESIVIETAADVATLEEKVVAQRALITKEAKR LIDKYDHLRSGTLRNAVVKLNRNACSGCNTRVPTNRHTMIVQGGFYLCES CGRIVVHERLFEEAKD >Cag_0885 methyltransferase, putative MMSSSYSLEKFNREAATWDEKPRRRLVAKAVAHAIIAHAKPQPTMRALEF GCGSGLVTMPIAPLVGSLVAVDTSPEMVKMVQQKAEEAALTTLTTLVDDL FAEAEAYREPFDLIFSSMTLHHIADTATVLQRVAQLLVTGGVLALADLEL EDGFFHDDPHEEVHPGFERSALEAALSAAGLQVRSYHIAHTIHKCNRAGV DAAYPIFLLVAEKV >Cag_1216 conserved hypothetical protein MQQKLLLISTIGPEHPEKATLPFVIATAAQALDVKVVMFLQSNGVILAKK GEAETIAAPGLVPMKELLDTFLEMGGTLMLCSPCLKERYITPNDLIEGAE IGAAGTLVSEIMSATSVVTY >Cag_0902 putative transcriptional regulator, XRE family MLQSQFIEAHLETDCPVFYDDDFVADVLPIMQRQHVSCAPVLSGGKPERL VTLPDLLAAEQTTDSDTLRLKELPLPQASGVDAGEHLFDIFRRLPHFPCD VVPVADDKGMFAGVIDKQQVIEQVARIFHVGDDSLTLELEVPKSGVKLSE IIALLERNEATILSFGMYTATSDNHESIILSFRLQTHDFFRLVQNLEHYG YQVHYTSQMFNAEDEVLREKAREILYLIDL >Cag_0197 conserved hypothetical protein MTHPPFHRLIAFLLFLGAEVLYLATMAPTFSFWDCGEFIATAVTLGIPHP PGAPLYLLLGRLFAMIPFVSDIGARVNVISTLASSATVMLTYLITVRFIT LYRKHPINEWSRSEQIAAYGGAAVGALALAFSDSFWFNAVEAEVYALSSL FTALVVWLMLRWHEEAPKAGNERWLLLVMYIIGLSIGVHLLSLLAVFAVA LAYYFKKHQVTLVSFGWLVVVSLALFFLIYVVIIKGLPVLFQVASWWGLL AGLLLLCAAIWYSQRHRKAMMNTLLMSLLLLVIGYTSYGMIYVRAQANPP INENNPSTTESFYAYLNRDQYGDMPLFPRRWSPEPIHQYFYEQYSSDFDY FTRYQMQKMYLRYFGWQFIGREHDMEGAGVDWSVLWGLPFLVGLVGAISH FRRDWQMGVVVTALFVLTGAALVIYLNQTEPQPRERDYSYAGSFFAFALW IGIGVESLWQWLAGRMKASSEKLPVVALSVVGLALLLVDGRMLMANYRTH DRSGNYVSWDWAWNMLQSCERNAILFTNGDNDTFPLWYLQEVEGIRRDVR VVNLSLANTGWYLEQLKNSSPRGAKPVNFSMSDGELATISYMPIDSVDAI LPSSTARRSLLRDTWRSGNNLPSAPLDTMVWPLKPGLTYDGQGYLRPQDL AVYDIVMSNFEDRPIYFALTVDPESMIGLDTFLRLDGLVCKVVPVKSSDP MSYTDPLILYQRLMQVYRYRNLANKHVYLEETSLRLSSNYTPLFVRLALE LATQPEETLAVTDGNGVPRLVRRGALALQVLDSSERFMPLSRYPVNPELA ASIIALYVQLGEKQKSSPYISYLEALSHTNSPLMEPRLFLILARAYYSLG REAEAKAIVQQLARELDQPELLKTFETTKK >Cag_1750 hypothetical protein MKILLHIGCGIKDKTQTTLAFHNEEWDEIRYDIDPDVAPDIVGNMTDLIT LAPASVDAIYASNCLETLYPHEVPQALAEFRRVLTEDGFVVINSPDLQAV CTLVAKGKVLDPAFVSPENGLVTPFDLLFGHRPSLAEGNMFMAHRCGFTS QMLSGALQAAGFSMVASMVRPKHYDLWTVASKSQRTEPEMRALASEHFPG LGVM >Cag_0291 Survival protein SurE MTHHDAQPSTDAEQSSNATLPHILICNDDGIEADGIHALATAMKKVGRVT VVAPAEPHSAMSHAMTLGRPLRIKEYQKNGRFFGYTVSGTPVDCIKVALS HILTEKPDILVSGINYGSNTATNTLYSGTVAAALEGAIQGITSLAFSLAT YENADFTYATKFARKLTKKVLAEGLPADTILSVNIPNVPESQIAGVIIAE QGSSRWEEQAIERHDMFGNPYYWLSGSLQLMDHSMKKDEFAVRHNYVAVT PISCDLTNYAALAGLEKWKLKK >Cag_1963 conserved hypothetical protein MPLMALLLLAFSLPAFAADAAPLAVQSDWWIWVLGLFTFSFFLGIIAVIA GVGGGVLFVPIVSSFFPFHIDFVRGAGLLVALAGALSAGAPLLRKGLANL KLALSMALIGSISSIAGAMVGLALPENIVQLSLGATILFISVIMLLSKNS AYPDISKPDSLSQALHIHGIYYDEQLKKDVSWQIHRTPIGLLLFIVIGFM AGMFGLGAGWANVPVFNLVLGAPLRVSVATSVFVLSINDTAAAWVYLHQG AVLSLIAVPSVAGMMLGTKIGAKLLTKVHTSVVRWIVIALLAGAGLKAFL KGLGI >Cag_1499 membrane protein MPSLRSISRYPFAIPLLSILAALLLSSLIIVAAGRDPLMIFQKMLRSVAG SPYGMGQVLFRTTTLVCVGLAVALPFHLKLFNIGGEGQLLMGTFAAAMAG LFLPQTVPPMVAIALCTLAAMAAGSLWALTAALLKVRFGVNEVIGTIMLN FIAQGITGYLLTWHFAVPSTVHTAPIIDSATIPTFSVLTGWFASSPANPS IIFVLLVALMLHLLLYHSRMGYEMRAAGLQPDAARYGGINATMHTLTAFA LGGAIAALGATNMVLGYKHYFESGMSGGLGFTGIAVALLAGAHPLWLLLS ALFFATLEYGGLTVNIWIPKDIFMIIQALTILIFISLSALGKRAN >Cag_0588 conserved hypothetical protein MIHSIRLDNLLSFASGNPTLPLQKLNVFIGTNGAGKSNLIEALDLVRATP RSPSNNDFQRVISRGGTIMEWIWKGSPDTPATIELIMDNPYNSHTNEKQP IRHLFSFKGEQQRVIFVDEIIENESPYHSNNEPYFYYRSYNGKPVINSAI AGERKLQRDSINEELSILAQRRDPEQYPEITKLAEIYEEFRLYREWTFGR NTIFRNPQRSDLRNDRLEEDFSNKGLFLNRLKTHKPKAKTAILEGLKDLY QGIDDFNISIEGGTVQVFFTEGEFSIPATRLSDGTLRYLCLLALLCDPEP PPLLCIEEPELGLHPDIIPKLADLLIDASQRTQIIVTTHSDILIDALTEI PESVVVCEKNEGKTTMQRLNSNDLAEWLKHYRLGQLWTRGDIGGTRW >Cag_1389 nucleic acid-binding protein,contains PIN domain MSYLIDTNIIIYYFNGLTNDESLHSILANNFKISIITKIEFLGWGQFLSN QNLYIKAKSFIHYATIFDINDAIAEQTILLRQQFKTKTPDAIIAATAMVK NLTVVTNNTDDFNRLGIKTISVTMQ >Cag_0143 Competence-damaged protein MRAEIISVGDELLRGQRVNTNAAVIARMLSAIGVSVSHIVACSDDEADIM ATCSAALGRAEVVLVTGGLGPTRDDRTKHAIQQLLGRGTVLDEASYRRIE ERMAARGSAVTPLLREQAVVIEGSHVIINSRGTAAGMLLDCGEPFAHHHL ILMPGVPVEMEAMMHEGVIPFLTSLSNSVICQTPLKIVGVGETAIAAMLV EIEDAMPPATMLAYLPHTAGVDLMVSSRGNSREAVEAEHQQVVDAIMERV GTLVYATREISLEEVIGEMLLRQTFTVAVAESCTGGLLASRFTDISGAST YFQQGFVVYSNEAKERALGVPHETLVAHGAVSEEVAQGMALGCLEKSGAD FALATTGIAGPTGGTPEKPLGTLCYAIAVKGGGVVVCRKVVMQGTREQRK VRFSTAVLREFWMLLKEREASEE >Cag_0303 peptidase, M16 family MAKPRKFFPVLLFAAMVLFFLHLTACSPTKTLMNSNSAYPYTTIQGDSLH TRIYKLKNGLTVFMSPCYDEPRIYTSIAVRAGSKNDPAETTGLAHYLEHM LFKGTDAIGSLDYHKEHPQLEKITALYEEYRSTANPEKRAAIYKMIDSLS NVAASYTVPNEYDKLLSSLGATGTNAYTWVEQTVYINDIPSNKLDQWLTI EAERFRNPVMRLFHTELETVYEEKNMTMDSDSRKIWENLFAQLFQKHQYG TQTTIGKAEHLKNPSIKNVMEYYRSHYVPNNMALCIAGDFDPDATIRLID EKFSVLESQPLARFTVEAEEEITAPRVMHVKGPESEELVMGYRFKGVNSS DADYLTLIDKILFNHTAGLIDLNLNQQQKVLDASSMLVLMKDYSAHLLTG KPREGQSLEEVQQLLMEQIELLKQGEFPEWLLEAAINDLYTEQLKQYETN RGRVEAYVDSFIWGMEWQAYMQQIERLHKITKADIVAFARKHYSTNNYVA VFKEHGTPESEAKIQKPPITPLTVNRDTISTFAQNLLERPSALTQPRFLD YSKDISFYNVTDDITLHYVHNNENDLFSLFYVFDIGKNHSKKIDLALDYL SYLGTSKLSPKAYSQEMYKIGASFSAYTADNYVYLKLSGLHKNAEAAIRL LEELLMDAQPDEEALGKLKAGTLKERADDKLSKKKILFEAMANYGKYGAH SPFTNVLSNREVEQVRSQELLDELRNLLNYRHRVLYYGPESAENVLSELR SVRHYPATFMATPSLDLFKPLEVTENLVYVVDYDMTQAEVMMLMKDETYN SATLPIVTLFNEYYGGGMSSVVFQELREAKALAYSVFSVYRTPKQKGEHN YIISYIGTQADKLPEALEGIGDLMKTLPESPQLFETAQKGIEQKIATERL IKTEILFNYEEALRLGHSHDVRKDIYDATQRMSLEDVKAFHKKHFSNKKQ VMLVLGNRKNLDMATLRKYGTVRELTLKEIFGY >Cag_1273 conserved hypothetical protein MKPPFLITTLNKAHDRNAFYSGSEMLDRYLKQQVTQDIRRNLTACFVALN NEKQIAGYYTLSSASIALDALPESLIKQLPRYTSLPAARMGRLAVAKTYQ GMGLGATLLTDAIMRAKQLNREIGMYALLVDAKDEHAAMFYLHHGFIRFT NSPQTLFLPLSQISLQN >Cag_1174 GTP-binding protein HflX MNTVTPENQREKAFLVGIYSPPEVPRSLVEEYLAELAFLADTAGADVLDT FIQERKVRDPSYCIGRGKVDEMEAYIKSEKIDVVIFDDDLSPGQARNLER AWGCKVIDRTGLILHIFAIRAQSTQAKMQVELAQLEYILPRLSGAWTHLS KQKGGIGNKGPGETQIETDRRLVRNRIALLKKKLREVERQHYTRTRSRQN VSRVSLVGYTNAGKSTLMNALCPQAEAFAENRLFATLDTKTRRLELKINK LVLLSDTVGFIRKLPHTLVESFKSTLDEVLQADFLLHVIDISHPSFEEQI AVVRDTLREIGVQHDQIIEVFNKIDALEEPTLLREMGDKYPNAVFISAVR GINLSLLKEIIGEQLARDYTERHVRLHVSNYRLISYLYDHTEVVEKKHED EMVELTIHVRNHQLPQIDAMIQAAAEAPHEP >Cag_0380 DEAD/DEAH box helicase-like MSFQTILEKYRRISFSERDKGNRFERLMQAYLQTDRQYATQFKKVWLWNE FPGRHDLGGSDTGIDLVALTHGGDYWAIQCKCFEASATIDKASLDSFLAT SSREFKNEQMQTVRFAERLWISTTNKWSSNAEEAIKNQNPPVTRITLQNL VNAPIDWEKLENGVHGEFARREKKKLYPHVLEVRDKVVDYFKEHERGRLI MACGTGKTMTSLKIAEKLTNHKGTVLFLVPSIALIGQTLREWTSQADETI NPICICSDPEITKKKNTTDQDLTSTIDLAWPASTDANYILKQFQHYKNKS NNGMTVVFSTYQSIEVIAKAQKVLLKNGFSEFDLIICDEAHRTTGYTEPG MDDSAFVKVHDGNFIKSKKRLYMTATPRMYNVDARSQAAKQAIPLWSMDD EEYFGKEIHRIGFGEAVEKGLLTDYKVIILTLNDKDVPPAVQKMISNGKT EIKTDDLTKLIGTVNALSKQFLGNESIIVDGDELPMKRAVAFCQSISNST TIAASYNLASENYLDALPENKKAKMVTIQAQHMDGTMAAPQRDQMLNWLK EETSGNECRIITNVRVLSEGVDVPSLDAVLFISAKNSQVDVVQSVGRVMR KSDGKKYGYIIIPVFVRSDEEPENALDDNERYKVVWTVLNALRAHDDRFN ATVNKIELNKKRPNQIIVGGADTAFDGDGNPIDKRRDGYDQSKEIGQQIA IQFEQLQDVVFARMVQKVGDRRYWEQWAKDVAVIAERQIERINYLINEKK EQRAAFDKFLLGLQKNINPSINEEQAIEMLAQHIITQPIFDALFEGYSFV KSNAVSVAMQSMIDALEKGSNLAEQDETLQRFYDSVRKRAEGIDNAEGKQ RIIIELYDKFFKTAFPKMVEKLGIVYTPVEVVDFIIHSVNDILKKEFNRT ISDENIHILDPFTGTGTFIVRLLQSGLIDINDLERKYKHELHANEIVLLA YYIAAINIENAYHDAISGYRNLGLGFGEENLVTHRYLNTNANFQRTNCLA GSDEFGRDDLQNNKELSERGDVWLDESNKESSEFNSGKHSRRIWEKEQGR ISTISGNSERITYGVRDTCFDSTENSNSECDGNGNNIGTNSNSGKIIDSS SEISNTQTLNPIPYEPFDGIVLTDTFQLGETKEGEIQYEEMLKKNSDRVE KQKKAPLRVIIGNPPYSVGQKSANDNAQNQKYEKLDARIAETYAAGTNAT NKNSLYDSYIKAFRWSSDRLSKEHGGIIAFVSNGAWLDGNSNDGFRKCLE KEFTSIYVFNLRGNARTQGELRRKEAGNVFGGGSRTPIAITLLVKKGKKD A >Cag_0329 GTPase EngC MKQVVEHVLVGTVTEVAGTSYIVQGDDGTLYRACTVPSTKSANSDASLVA VGDRVELKASVSGHAGYEAIITNVLARRTTLARQRDVRRNRSKERVQVIA ANIDQLVAVVSAFEPPLNRRLIDRYLVFAESEQLPILLVVNKCDLDDEED GSSYVREMMHPYHALGYSVLYTSAENGEGVEELRQALAHKLSAFSGHSGV GKSTLINMLSGQERLRTAETNVKTGKGLHTTTNAVMLVLPDGGAIIDTPG LREFTLADITRDNLRFYFREFLPVMAQCAYSSCTHTVEPECAVRNAAESG TIDPERYESYLALYDSIAE >Cag_1074 xanthine/uracil permease family protein MQKYFEFERLGTNYRQEIIAGITTFFTLAYIIIVNPAILEAAGIPKEASL TATILTSIFGTLLMGLYAKRPFAVAPYMGENAFVAYTVVHTLGYSWQTAM AAIFISGVLFTLITIGGLRQWLAEAIPATLKHSFSVGIGLFLAFIGLNDM GVVALGVAGTPVKLADVTQLPVMLSLAGMLVTALLLIRRVTGALLIGMAF ITAAFLLLGLTPLPTALFSFPPSIAPIFMQIDWHGALTWGFVGVIISVLV MDFVDTMGTLFGLSSRADLLDENDNLPDIQKPMLVDALSTIAASIFGTTT AGVFIESAAGIEQGGKSGFTAVVVALLFALALFFAPILTIVPPYAYGPVL LLVGMFMMQSVTRFNFNDYSELFPAFLTIALMVFTFNIGVGITAGFIAYL LLKLLSGQFRDIKSGMWILALLSLSFYLFYPYH >Cag_1148 hypothetical protein MRHQGASILLYNQQHEVLLVLRDNLPFIACPNTWDAPGGHLDAHETPLHC IVREMMEEMELDVSTCSHFKSYEFSNRTEHIFTMQTDVLNTATTPLHEGQ MIRWFTVADALQLSLASDMEVVLHDVGIWLEQQNNGTEDCGNV >Cag_1052 conserved hypothetical protein MKERWVVVIGGGAAGMAAAVSAAEQARYLGVDCHITVIEKTHQVGSKIRI SGGGKCNVTHVGTSAELLEKGFLRAAEQRFLRSALYAFSNNELRALLQQQ GVATTEREDGKVFPVAGEASVVAEAFRTLLQRLKINCELHAPVQAIKVHG QQFHLITLHGDIVADAVIVATGGVSYRHTGTTGDGLRLARALGHTVVEPS AALSSIMVQPHSLVALAGAALRGVAAVARAGKLRAERQGDILFTHRGFSG PAMLSLSRDVANMQRSQREAVHLAADLYPQQLHDELEALLLQHSKKQGGQ LVRKFLQVSPIGMLLLKSETMPYGTIPNAMVPLLMRQAALDDEVTFATLS REHRHQLVVTLKQFQLGTVHNVSLDAGEVSAGGVALSEVNPKSMESRLVP NLYFCGEVLDYVGEIGGYNLQAAFSTGWMAGKSAVNKLLTAL >Cag_0970 conserved hypothetical protein MRVEKLSTLLLCRIALAISWIYQGAVPKLMCQSSGELELLGHIIPIYKWA CIAMQWMGYGEILFGVFLLIARWQWAFWLNIIALIGLLFFVGIFEPTMLT LPFNPLTLNVALIALSLIAIRELRQ >Cag_0837 hydrolase, alpha/beta hydrolase fold family MNKVDNNNHSEWPAEAISQFATINGFNVHYRIAGKGEPLVMLLHGSFLSI RSWRLVFGELAKHTTVVAFDRPAFGKSSKPRPSTTTGANYSPEAQSDLVI ALMRHVGFQKAMLVGNSTGGTLALLAALRHPNNVAAIALAGAMVYSGYAT SGIPAPLKPLFKAASPLFARLMGKMITKLYDRTMYGFWHNKERLSPDVVA AFRNDFMQGEWARGFWELFLETHHLHFEERLKGIVVPSLVITGDNDLTVK TAESERLANELPGAALAVIANCGHLPQEEQPEAFVQALLPFIEKVRLHL >Cag_0188 conserved hypothetical protein MNADNFRLQLGSKEYVPIIIGGMGVNISTAELALEAARLGGIGHISDAVV TYICDQLFKTSYVSRKRKQYAAYSNSPDKSAVLFNLEELAEAQKKYIENT IARKKGDGAIFLNCMEKLTMNNSAETLKVRLSAAMDAGIDGLTLAAGLNI RTLDLISDHPRFRDVKIGIIISSVRALSIFLKRAVRLNRLPDYIIVEGPL AGGHLGFGADNWHMFDLKTIFTEVVDFLAKEDLHIPVIPAGGIFTGTDAV EYLQMGAGAVQVATRFTISKEAGLPCDVKQHYLNAREEDIVVNMASTSGY PMRMLVQSPTLRYAIRPNCEGLGYLLENGGKCSYIDAYYEALENRKEGEP LSVKIKTCLCTGMANYDCWTCGQTTYRLKETTNRLPDGKWQLPSAEDIFL DYQFSSDHAIQRPKPEA >Cag_0598 conserved hypothetical protein MEENKSQIIIYQTENGETKLDVRFQDETVWLTQKLMAELFQTTSQNITIH LKNIFEERELEEDATCKDFLQVQKEGNRKVKRNQKFYNLDAIISVGYRIK SHVATKFRQWATQHIKEYIVKGFVLDDERLKNPDLPFDYFEELERRIQDI RTSEKRFYRKITDIYATSVDYDPTFDISIDFFKTVQNKLHWAITGQTAAE IISLRADSTKENMGLTNWRGDKIRKADVLIAKNYLNEEELTSLNNLVEQY LIFAQGQAMRRIPMHMKDWIEKLNGFLTLNDRDILNNAGSISHTLAKENA EREYEKFKDSEQKMITEDDFEKTIKNIEQKKKK >Cag_1773 conserved hypothetical protein MKPLRQQRWGATTQGIAIWLIALLWFLPIGMLQALTVPALTSRVNDYANM ISPNVRAELEAKLAALETTDSTQLVILTVPSLEGDPIEDFSIRVAEAWKI GQKGTDNGVLLIVSQADRKVRIEVGYGLEGKLTDLQAGRIIRNNIAPAFK MGEYDLGFVQGTNSIIAAVRGEFIASDKKSQKSNKPSMPLLFVILFVFYI LSQLMRGHRQSGPMAYGGPGGFYGGGFGGGGSGFGGGGFGGGGGGSFGGG GSSGDW >Cag_0485 hypothetical protein MKLGSPATGNDFFGREQELRDLWRYLESDHIRFPGVRRLGKTSILKRLEA DAAEHGLLAKWVDVSNIDSAKGFVALLEQAFPENTIKRFLSDKTKQVADW FKIIRKVEVTLPDEAGGGGFGIELGEVLLEWQHAANHLHHRLSNQPLLIL LDEFPVMLEKLIQRNRQEAEQLLTWLRIWRQSQGACRFVFTGSIGLQSLL ERHRLGETMNDCFAYPLGPYKPSEARDLWKYFAQNADENTWQVTDLVIDH ALSRVGWLSPYFLCLLLDESIRAARERQEEWPSKASGAASIEVEDVDDAY EQLLAERSRFHHWEKRLKDALAPAELDFCLSLLTHLSRKPEGLTLNQLSS RLAKREPDPDCRAQRIQELLVRLTDEGYTSSPDSNKRVQFLSFPLRDWWN RNHVR >Cag_0450 hypothetical protein MNITIDTSSLIAVIGNEESKEKIIKITEGSSLCSPLSVHWEIGNALSNMF KKGRILLEQAQLALFAYNEIPIKFIDVSLVKAINLSHSLNIYAYDAYVIQ CAKQTGTPLLTLDNGLKVAAQKSGINLLELQS >Cag_0702 probable phage-related lysozyme MQTSDNGLNIIRQYEGLRLKTYFCPAGKLTIGYGHTGTDVTSGMSITEAQ ANELLQEDVKRFATSVNKMVTTEVTQGMFDALISFSYNIGAGNLQKSTLL KKLNAGDKQGAADEFLKWNKSNGKPLAGLTARRTAERELFLA >Cag_1350 Protein of unknown function UPF0001 MESIASNLAAIHAQVAAACKQAGRKPESVRLIAVSKTKSAELVREAFDAG QLEFGESYMQEFLEKYESRALQGCPIQWHFIGHLQSNKVRSLVGKVSLIH GIDKLSTAEELSKRAVQQNLTVDYLLEVNTSGEASKYGMAPHTVLSEASA FFALPNVRLRGLMTIATYEREAARREFQELRELLEQLQAIAPDPTLVTEL SMGMSGDFEEAIQEGATMIRVGSAIFGWR >Cag_0656 nucleic acid-binding protein, containing PIN domain MTEKSEKYLIDSNIIIYHLNGENSATEFLRSNVSQSYISRITFIEVLSFD FMKDEKEDVLNLLRRFEIIDTTDAIAMRAIENRKLKKIKLADNIIASTAQ VDDLVLVTKNIKDFNGLNVRVLNIFA >Cag_1760 conserved hypothetical protein MTDLSHETLLRLHSELAAEYELAESSYQFGGKPFSFLHVRDSYALLDRIS PEEFVKDEQMPYWAEIWPSASALSTFFMDEVALEGKHLLELGAGIGVVSI VAAWRGAQVVATDYSIEALRFIRYNSLKNSVALTAERLDWRQVQRSDRFD YVVAADVLYERVNLLPVVLALDKLLKADGVAFIADPRRRMAEQFVELATE NGFVVTTHARRCQIGEAPVMVNIHQISRL >Cag_1471 conserved hypothetical protein MQHFSQSKRRFVLFWHTTAALAVLMLLYPLIGNFFIAWFVGAELERGALL QPEMLRSFVPSLRIVQMVGQVLLLAFPALLLAGWQRGCRAPWSPEVRMWM GLQPPFDGGVLVAGAGGIILLQPLLSLIAALQERYLWPALGEAGREVLQQ QESMELFLRTIADAHSLPEALFVLAVLAVTPAITEELLFRGYVQRNYMQV LSPAMAIVLSGLLFAFFHLSAANLLPLALLGCYIGYIYYCSGNLFVPMVA HFTNNALALIVLWFTPQEHTMAMQAERIALLLSPAWWFLVVGCTLLFWRL MRWLTDRIVRH >Cag_0092 conserved hypothetical protein MKPLKQVGESYFLLSQGEKQIEGAAFEEAEQSYRLAMTMARTIPTEEAFD YDGFDAIAHAGLSSALIGLGRYNEALVSVAEALRYFNRRGDLHSAEGSLW IAVICNKARALESLGRKDEAIKYYRMAGEMIAEKKGEIKQRDLLTELIEQ GLQRLEGAKPATAKQGYKAWWEFWS >Cag_0331 conserved hypothetical protein MNPANAHIDNQLRQVLERHPLIRLAILFGSVAKGTAGFESDLDLAVGAAR PLTVQQMMALIEDLVEMSGRPIDLIDLSTVGEPLLGQIIAHGRRIIGSDT DYVNVVLKHIYNEADFVPLQKRILKERREAWIGK >Cag_1639 oxidoreductase, short-chain dehydrogenase/reductase family MLTLHQKVAIITGSTRGIGKAIAQEFVRQGARVVITSSSPQNVEAACKEF PAGSVHGIACNVTSPADMERLVRESVAHFGQLDCFINNAGISDPFTNITE SDPEAWGRVIDTNLKGTYNGCRAALIYFLTNNKQGKIINMAGSGTDKGSN TPWISAYGSTKAAIARFTYAIAAEYRHTNISIMLLHPGLVRTGMVSTEHP TPELERQLRTFNTILDIFAQPPTVAASLAVKMASPWSDGKNGIYLSALSS LRKKKLLISYPFRKLFKKIDRQTY >Cag_0203 conserved hypothetical protein MSNNHSTNQRILLLLLLVISALFFTMIRYFLLVVVLAAIFSALAMPVYNR FERGLRGKRSLSAIMTLLTLLFIVVLPLAILLGLVVKQAIRLSNVAVPFV QEQLLTPSQFDHHLQSLFFYPELVLYREEILQKVSELATKFGTLLFNAIS SFTYSAVTEIVLFFVFLYTMFFFLRDGKQMLQSMLALLPLSHTDQYRLLD KFLSVTRATLKGSLVVGMVQGSLAGMALYMAGIESALFWGTVMSFLSLIP VLGSALVWIPAVIYLATIGSYPQALGVLLFCMIVVGQIDNIIRPILVGRD TQMHELLIFFGTLGGIGMFGFFGVILGPIVAALFTTIWEMYAESFGDYLS TIQKNRTSTLKD >Cag_0689 PucC protein MKKFNLVRLSLFQMGFGIMLGFLLDTLNRVMTTELRISATIVFGLISLKE LLAIFGVKVWAGNLSDRSQIFGLRRTPYILLGLVSCVFSFIMAPTAAYEV TVGGVGFAEMIPAMLQDVGLLKLALIFLLFGFGLQVATTAYYALLADTVD EANLGKITGASWTLMVLTTIVSTRIVGSYLDHFTPERLLFVAEVGGFIAL ALGLIAVLGVEQRNGEIKEGKEKHALSFAQSLQLLTSSPKTLFFASYIFI SIFALFANEIVMDPFGAHVFDMSVGTTTKLFRPTMGGMQLLFMLIVGFLL GRIGQKRGALIGNIMCMIGFGLLIGAAFSRDEQFLRIALVVTGIGLGASS VSNISMMMAMTAGRSGVYIGLWGTAQSLAIFIGHLGAGMIIDVVYHFTGQ YVWAYAAIFAMEIVAFAIATLMISHVSKEEFEAESKAKLAELALSAKG >Cag_1453 TPR repeat MSDATEYQENLREAERFFHYNKHGFLFAVSNDELVQRNLNSSLQQLLRGK GKTLLTYTWDTNPEALHPVKQLRQFQQNHLELNGLILNGLEPALEHNPNF LVQLNFGREGLAELRIPLLFWVSNRTLQRVNREALDLYNQRVSANLYFEH DPTLQQSDNSALRYIAQETVRANKSLAGVEERMKLLQQQLDEAEKQHVEP KTIANEIVLELLELYSQILGAEPLIHTLLNKYEAFIDRENPENCFKLARV LYEIGMRTEAMDLYQKALQTLRELAVRNPDIYLPHVATTLNNLGALQYTT NDYTAALASFTEALTIRRELAAKNPDVYLPDVADTLNNLGALQSDMNDYA AALASFTEALTLYRELAVKNPDVYLLYVAGTLNNLGILQYNTNDYAAALA SYNEALTLYRELAAKNPDVYLPDVATTLYNLGNLQYNTNDYAAALASYNE ALTLYRELATKNPDVYLPDVAMTLSNLGNLQYTTNDYAAALASYTEALTI RRELAVKNPDVYLPDVATTLNNLGALQSETNDYAAALTSFTEALTLYREF ATKNPDVYLPYVAGSLINMAVWYYKAPQANQQQSLAFVTEALHIALSLVE KIPNVQLNIDSAYNLLRAWGIEPEEFVKQVGTSGASGGE >Cag_1866 putative DNA-binding protein MQMKNNQSSFILFTTEDAKIAVDVRFEEETVWLTQEQMAVLFGKARTTIT EHIQNVFKEGELNEEVVCRNFRHTTQHGAIEGKTQETWVKHYNLDVIISV GYRVKSLRGTQFRQWATKRLNEYIRKGFTMDDERLKNIGGGGYWKELLQR IRDIRASEKVFYRQVLDIYATSIDYDPKDEVSLAFFKKVQNKIHYAVHGQ TAAELIFNRADAEKDFMGLMTFSGSRPYLKDVVVVKNYLNEKELRALGQI VSGYLDFAERQAEREQAMTMKDWAEHLDRILTMSGEKLLQEAGTISHEKA VEKATTEYKKYQQKTLSEAEYNYFESLKILESKIH >Cag_0341 TPR repeat MKVRNLVLFTLMAVSAEGYAAGTKAKSIVKTPSAATQAAETMQALPLDRQ TALNLAQSYLANGSSRQAELILSKLVLLYPDDEEILRETISLYEKSNRAE QTLPLYQHLLQLRPNDLELTLASARAYSWTGRKAESIALYEKVLKAGNAS EKVVTEYADFLYADKQYQKAIDLYKSVGQKGKLSKQHMLNTVNGFIALKK FDEAAKICNANVPLYPQDTDFLRLAADINFNAKQFEEAAGHYRQLLLKNP DDPGAYSKLADIAMAKNDFTEVARLSHKILALIPDHKTAMLSLARVSSWQ GDFTTSLTYYDKLIASPNPEPFYYREKARVLGWMGDFKQALSVYKSAVQK WPDDKAISAEAEAKKNYYNHTYRPAVKAYNAWLLAEPQQPEALFDLAQLY AQYGKWNNGLNTYNSLLSQIPAHRQAALAKQKIDFAASRMFVRSGVEYFS AKTKFIDATNHRQADTKSTSIYSSLTYPINERVSAFVNLDSKSYDFRIAK PNTPKNPVTYGLMAGAEYRNMPNIALSAGLGMRMNPGDVDNGLTGFINAN SQPVDNLHVGVTLRNDDIVTNTSSFNNQLEATRLQGRVAYNGYRRWQAGM DIAFDSYADNQSYDNSSLTVGADVVAHLLYEPQRLSVSYRLQEYGFDKNH ANHPQYYNYFWTPKSYTTHTFGLEWQHYLNRERFHGSNNTYYDIAFRVGL EQEGDISRQIHASINHDWNSRLATSLEGQYTWGTSAEIYQDSMVKAEFRW FL >Cag_0737 nucleotidyltransferases MIDSIEIVKKQIVDALMPLNPEIIILFGSYAYGVPNKNSDLDICIVEKEY SNKWKEKEKIRKLLNKIDMPMDILNPKLDEFEFYKNEINSVYYDADKKGI WLWKKNS >Cag_0591 proline iminopeptidase, putative MSYFASTRCRLYYEDSAEGDPSALSKPTIFFVNGWAISSRYWKPLVSILS DRYRCIIYDQSGTGQTLIKGYNPTFTIQGFTDEASELLEHLELHHSRNVH IVGHSMGGMVATDLCMRYPDALVSSTIIACGIFEETPFTSVGLMMLGGLI DVSMNLRSIFLMEPFRSMFINRAVAKAISKEYQDVIIDDFTKSDHAATNA VGKFSIDRNVLRTYTRHVLAIQAPLLCSVGMADQTIPPEGTLTLYEKRKA KSELQTSLARFEDLGHLPMLEATELFAQVLDKHFQQAQQLL >Cag_1545 NUDIX/MutT family protein MASRFRGVFKQSGVIPLFDDKVVLITARKSDRWIIPKGYIELGMSAADSA AKEALEEAGLVGKVGEHPIGKYRYNKSGRHFVVLLYPFFVETMLDVWDEV HERERCVVSPDVAATMVAHSDVGRLIRSYCASLDDDEAVLVPPHVASAIT G >Cag_0467 KpsF/GutQ MNTTPQTETATLTGIGRQILEQEAQAIAHIAEHLDHHFAEAIQVMVACKG KVIVSGMGKSGIIAQKIAATMASTGTTALFLHPADAAHGDLGVVAAEDVV LCLSKSGSTEELNFIIPPLRQLGAKIIVMTGNPRSFLAQNADITLNTGVA KEACPYDLAPTTSTTAMLAMGDALAITLMQQKKFTQHDFALTHPKGSLGR RLTVKVSDIMATENAVPMVRTNAAVTELILEMTSKRYGVSAVVNENGELA GIFTDGDLRRLVQSGRKFLALQAGEVMTARPKTVPPDMLARECLDILEEY RITQLLVCDNHQRPIGVVHIHDLLTLGL >Cag_1491 TPR repeat MLKDALGSYRGSLAELNKMVEVQPHNAELWFARANERSGCGDYAGAISDY TTALLLGLRFREAVNAYGNRGMARLAMGDMRGAMEDFSTIIARQPKNRSL LRTAYLKRAHLREKQGDEVGAQNDRKAADCINGK >Cag_1921 Twin-arginine translocation pathway signal MFFQCVMTNHQLSRRDFAKLLLSSTAGALLGVGVPSSRTYAATNRVVIIG GGFGGATAAKYLRKLDPSVAITLVEPKCQFYTCPISNWVIAGLKPMHAIA QNYNALRVRYGVNVVHATAVAIDALKNSVTLHSGKKLFYDRLIVSPGIDF RWNAIPGYSQKVAESVMPHGFQAGEQTLLLRKQLLAMPNGGTVIMCPPNN PHRCPAAPYERASLIAHYLKQHKPKSKVLILDCKEKFSKQELFLQGWERL YSGMIEWRAATAGGKVEAVNSAAMTVTTEFGDEKGDLINIMPPQQAGRIA FEAGLTDAAGWCPVHPITFESTLHPGIHIIGDACHAGDMPKSAFASSSQG KVASSAIAALLQGRVPVAPSLVSTCYSLLKPDYAISVANVFRLTIDGIVD VKGSGGVTPLDASVEHLQHEADFAWGWYENITRDTWG >Cag_1939 lysophospholipase L2, putative MPEPKPHTLAVLVLHGFSGTLESVKALREPLQALGLPVAMPLLAGHGEHS PEALRGVTWETWLADAEEALLQLSNQALQVIVIGHSMGALLAVQLAYRYP TLVDSLVLAAPALRIASIFAPGRLFHSIAPIVSRLVKNWRLQSEEDRRNA VYGDLHYEWVPTKTVLSFFELVKKTERLLPHITHPALILHCRCDNTVLPE SAEIAHSSLGSMPAAKSLVWFDKVGHQLFCATERDCVVLEIVGFVKSRFR >Cag_0150 hypothetical protein MEREQASTLRTIFSMPVIVAALGYFVDIYDLVLFSIVRVPSLKSLGLSGQ ELIDYGVYLLNMQMIGMLLGGFLWGWLGDKKGRLKIMFASILMYSLANIA NGFVTTLPMYAALRFIAGVGLAGELGAGITLVAEILPTKIRGYGTMLVAS IGVSGAILANYVATTFEWHNAFFIGGALGLLLLAARFKVSESGMFQAMAD HRGTNRGNMAALFTDRSRFLRYLNSIMIGVPIWFVVGVLITFSPEFGEKL SISAPVSAGNAVMYCYLGLVFGDLSSGLLSQLLKSRKKVVLLFMVLTVAG VALYFTQHGQTPQFFYMVCAFLGFASGYWAIFVTVAAEQFGTNLRATVAT TVPNLVRGMVVPITMLFQYFRGMFGMELGALVVGVICIVGGFLSLMALSE TFHKDLDFYEEFL >Cag_1138 putative metal-dependent hydrolase MLQIGNYRIAALLVQEFALDGGAMFGVVPKVFWQQQAPADALNRVTLAAR LLLISGAGRNMLVDVGLGDAWNDKQRSIYAISPFRLREELQRFQLTADDI TDIILTHLHFDHIAGAFAVENGGLVSLFPAATFHVQERNLQVACQPHIKE KGSYLSPYIDALMQQCNVSLKQGECELCEGVSLLVSNGHTQAQQLVKISD GKQTLLHGGDLLPSAAHLPLAWITSYDVEPLQAINEKTALLETAMDEEWL LFFGHDPRYAAARIRRGEKGAEVAEYFEEL >Cag_0378 helicase domain protein MSNAKIFYHDIGDYYSREEKLALIKKYHSLAHPNMQWQQLQPNEHGDWIS QRNDLFETFIPLGDKENKKADTFFVPFYSRGLASARDSWCYNSSKTTLEN NIRTLIEFYNQQRIAYFNTIDKDSKITVENFIDYDSSKITWNRGLKNDLE KNKAIDFNRDYIITGLYRPFNKQKIYFARELNDMVYQIPKIFPASNSKNY VICVSGVGASKDFSVLITNCIPDIQLQFNGQCFPLYYYEKQEKSNPTLFD AAKEPDYIRRDGVSNFILEQAQKRYGNRVTKEDIFYYVYGILHSPDYRTR FASDLKKMLPRLPLVENVKDFWHFSKAGRELAELHINYEAVPPAKGVILL YNNIPTEEIEKGLQSSKMQEINYMVTKMRFPKKDQKDTIHYNNQITITNI PLKAYDYIVNGKSAIEWIMERYQITTHKESGITNNPNDWATEVGNPRYIL DLLLSIINVSLQTVEIVNNLPKLEF >Cag_1627 3-oxoadipate enol-lactonase, putative MLRCITNVTAETAGRYPITVLLLHAFPLSAAMWQPQIEALEKAGYGVIAP HAYGIEGSPEIAEWNFTDYAVELAQLLESLHIASVTVVGLSMGGYQAFEF YRLYSNKVKSLVLCDTRAEADAPAARATREEFMKAVASTGSAEAIRRMVP NYFSPAAYGANSTLVAQVEAIINKQSPEVINAAMRAIMLRADATPLLGSI SCPTLILNGEEDSMTTKETAATIQAGINGSTLQLIAGAGHIANLEQPELF NQALLEHLSLLQ >Cag_1400 chloride channel, putative MNILSHKKGRLARRIIAFFYILFRRSRYFKGSSQNFIKLTLEYILVQLNL NQDIPFLFVAVIVGLVTGYVAVLFHEAIKAISNFSFNDLRLLGDISFIEQ YWVFFLPFIPAIGGLFVGLYNTFIIKKSSRHALASVIKSVAHNDGIIDRK LWFHKTITSVVCIGTGGGGGREAPIVQVGSAIGSTIAQWLRFSPEKTRTL LGCGAAAGLAAVFNAPIGAVMFAIEVLLGDFSVKTFSPIVIAAVIGTVLS RSFLGNRPTFDVPDYTLVSNIELLFYCVLGVLAGLSAVMFIKTYFAIEEW FDKLQIRRNLPVWIMPAIGGFLSGIICIWLPGLYGFSYNVISNAVYGNET WYNLIGIYLLKPVVAGLSIGSGGAGGMFAPAMKMGAMLGGMFGIVVHQFF PLITATSGAYALVGMGALTAGVMRAPLTVILILFEITGQYEIVLPIMFAA VTSAVVARLAYRHSMETYVLEKQGIKVGFGIALSVAEQVVVSDILDKKRT QFVSTTPMKKILEVFYSTPETNFLIVDKQGVFIGNISLDDIRILLKNGCN DDLIADDIVNKNVPVLYTNSRLDEALKLFELSDYDILPVLDTKNNILQGV LRQEKAFASYRKQLNLYGSDYSDKSVHQGVK >Cag_0101 conserved hypothetical protein MKTAFVKIWGELVGAVAWDDATGYATFEYDAKFKSKGWELAPLQIPVNAT KSNFSFPALRKKADPALDTFKGLPGLLADMLPDRYGNELINLWLAQKGRP LDSMNPVETLCFIGTRGMGALEFEPTTLKESKKAFSLEIDSLVEITQKML TKKEAFVTNLQENEEKAILEILRIGTSAGGARPKAVIAYNERTGEVRSGQ TNAPQGFEHWLLKLDGVSEVQLGASHGYGRVEMAYYNMAVACGIQIMPSR LLEENGRAHFMTKRFDREGGAAKHHIQTFCAMKHFDYNLVTNFSYEQLFQ TMRELKLSYPDAEQLFRRMVFNVVARNCDDHTKNFAFRLKKDGKWELAPA YDVCHAYQPKHQWVSQHALSINGKRTNITKDDLLTIGKSIKNKKAAETIE EISNTISQWKTFADEVKVLPKLRDEIAATLIRL >Cag_0034 conserved hypothetical protein MLLSLPNWIIHISSSLEWGIGAALLFHYGQLTERRDIRTFALAMLPHWIG SFCVLAYHISGDTIPLLLDMSELINLVGSTALLWATYKLFQSTGGWKAAH GIVPSIAPIGYLSAIVIAGKPQSWLGEDIFDTILQLSSIVYLAFLLLLIV IYRRDKTIFSGLTVAGFWFVLVFISITIFCMYLATQMRGYPTLSHDDLLH GMAESLLSISNLMIVLGAHRQIKAFKGQRG >Cag_0976 HAD-superfamily hydrolase subfamily IA MNIKALIFDLDGTLLNTLEDIANTLNATLARHHFPTHSLDECRFLVGAGL RELIRKALPSEAAADDNMVDKLLSEFIEMYRTSWNQLTRPYEGIIEMLAA IAERNLPMAILSNKADHFTQQCAEELLPRPLFSVVLGHRDGMAHKPDPAG ALFVAAELGVEPTSVVYVGDSSIDMLTATRAGMYAVGVCWGFRPESELRA HGAQSLIHHPLELVTLIDTLREANA >Cag_0873 TPR repeat MSNELLDEKLRLLVKALRADTFHFVLIINNHPSVYNDVVEWLKQHITDRE IRELRLTGKHYREVSDVLQAAKQDIVTIPDFDELFTKENDDVRVALNQRR DFLAFQRMNLVCFLSPDTFRLLPKKIPDLWSLRSLELDIAYDIKEPLFTI PTTPFISSLGGTTIAEKEAEIRRLTYQLSQIDPANIALRKELEAQLVTLQ MEVPQRFEEATSLHDTSQKNIIAAEVTATQDETVTNISTSILRSIFAVLP SEPITLIALQELLPNIDNLETALQNLVAENVLNYNSTTKSYKCSPVVQEV TRKQQSDHLFADIEQLISRLIDRLAYEPNTGHVTEVSYETAALYVRYGET ILRNCTDVEYQLAALADRIGNYHTATGNLDKALSFHAECLRLSKELYEAY PNNVFFKNGLAISYEKLGNTHTSLGNLDKALTYYEQYYKLSKELYEAYPN NVSFKFGLAVSYSKFGNTHTSLGNLDKALSYYDNETRLFEELYEAYPNNV SFKNGLAISYSALGQFYRDHRNDSDIVKNYFQQAEKVWAELVSSSPQHAE FKQNLSWVKNQLQSLHS >Cag_0375 ATPase MVLLTVEGLEKKYGLKHLFEDVSFGVDERDKIGIIGANGSGKSTLLKILA GVEQPDKGKLMVANHKRIAYLPQDSPYNPNDTVLQAILASSGKVMDLIYE YELVCKKLEEHQGDSVALMERMSTLAHELDVCGAWELESNAKTVLGRLGL YDLTARMGTLSGGQRKRVALAHALVVPSDGLILDEPTNHLDADSVEWLEQ YIRRYQGAVLLITHDRYFLDRVATRMLELDGRTATTFTGGYSSYLQQKAE LEEQAVRDERKRQALVRQELDWMRSGCKARTTKQKARMQRAESLVYSPKQ EKSKELEIGFGAGRLGDKIIEFHKVSKSFGDKLLLKNFEYHLQKGDRIGI IGANGSGKTTLFEMIAARTTPDSGHIEIGKTVRLGYYDQESRELDDSKRV IESIQEVAEQITLKDGVVLSASKMLERFLFPPSTQYSLVKTLSGGERRRL YLLRQLIASPNVLLLDEPTNDLDIPTLRVLEDYLDNYQGCLLVVSHDRYF LDRTVEYIFAFEENGQVRRYPGNYTVYLEMKASIASAGEPKVAKKTTEPP KPVAVQSTKPAALSSKEKRELEKLEVAIAAAESRQAEIAVQLSAAGNDFA MQQQLGTELQALQQQLELDMERWSELAEKAG >Cag_1875 TPR repeat MSIQFVKHNPAFLDAGHFLEQVVARRADVAHLLGHLGSCEPIGTVRHLFI TGQRGSGKTFVVRRVALAVEQHNALRSRYYPLFFSEESYSVSSSAEFWLE ALFHLARQTANEQFAQTYQALREEVDEERIRQIVLPLLLDFADNQGKTLL LIIENFSMLLADMASSREGEVLAQTLLQEPRFQLLATGTFTFDSLEVPFG KYFSSITHHALEPLSNADCNALWQLYSGTPLADGQIRAMNILAGGNARLL VTLARVANGRTFSQLPEILALALDEHTEYCKSYLDVMAPVERKVYLSIAE LWAMVSAREVSLAARIDINKTSAYLNRLINRGAISVERQAKRNKLYGVTE RIYSIYYLMRRHGWQSGRVRALLDCMLAFYDPASFPDRLSDSERERCTSV AEALTMLPEREHVEQCHQNFLDAKRHSRFVPSLAYFVRFVQKSDVSHADE SSSPMLGESFRQAFELLETESYTEALPIFDAIIMVSRHSESEQAIGQRYG AMIGRGVALGNLERYEEGFRLLDEVAATCQERSVRRRLKWGLLALLGKAS VLERAGRIDEAVTLYDELVSRYRRQQELECSTLVAAALLHKSLLVSKSKG GEEEIAMCDTLLELYSERVELPLVELVCAAWRNKAIAFEALNRNDDALLA YGKLLALCRQRSEPHMMQHTAHALQNMGVVYGKMHRYSDAEHCFMEVQSL APQQARAHLMLLKLLVKMEEQQHAVLSELRNYLAATSLALRALPQTIELF ITCAVAGYAAEALELLVASPLAVSLEPVQAALQHATGDEVRSAPTIVEVA NDIVAAIEARRNA >Cag_1496 Nuclear protein SET MPPLFSAECLTPDSFNCLLIIAAVVLLLGIVLGFVVALKAPWIHRKTVIA KASTVSGRGVFALVNFREGDIIERCPALEVRDRDVDGELLNYVFYGSTEQ HRLVAMGNGMLFNHDNNPNVAYYREDTPLGAELVLYALRNIRKGEELFYS YGEAWWATRQNG >Cag_0707 conserved hypothetical protein MKIEYHPAIEDELRQIIKYYKESSAGLGTEFLNEFERQVLKIADNPRRWV AVKGTIRRSLMRRFPYVIYFRLVNDETLRVTVVKHQRRHPKKGVNRR >Cag_1557 conserved hypothetical protein MQYSEAKSGRVFVLRLEDGDVVHECLEQFAHKHGIERASFIAVGGADKGS VLVVGPEDGRTSPVVAMTHELYDVHEICGTGTIFPDDSGRPMVHAHFACG REENTVTGCIRSGVKVWHVMEIVLTELLDNHASRKTDAATGFKLLAME >Cag_1367 hypothetical protein MKLTLYIIYTIFFIAMGCGIYIGFQTADGLVDNNYYHNSTNYFQTKAREE KLGIVINKPDTLTIGTNTFTVAVTSHGKPFEQGNISLLLGNVSTNNNDTT LTMQETAPGIYQTTISIPYKGKWFTRLELYHQQQLITTKQWFFSVQ >Cag_0681 conserved hypothetical protein MSGLKYILDTNIIIGLLKANPTAIALAESVRLDLGECAISQITRMELLGC KGISEAEESSIHQFLACCVVLMIDETVECEAIRFRKHSSLKLPDAIIAAT AQVHNLNLLSLDERLVSQYERAVKDNR >Cag_1051 hypothetical protein MRKIILSKRASKRLEKLLEYLEFEWSFKVKNDFIKQLDKSLKRIQKYPES CEQTRFVKGLHMLVVTKQTSLFYQFDSETITIVTLFDNRMNPDTLKKETA >Cag_1107 ComEC/Rec2-related protein MTEPSKPHSAVKPKRAIGLSLAPYPAVRLLFFVIIGIVVGVVAPFSLTEW LWSVALSFALLLLTWLYERIRYHQAAVPHFGMAIMYCFVVVSVFATLSAY RLHYAPRNGLTQYAGRTVILYGSIESRPERSKGGASWVMEVQELFEHGKT VTLRDRTKVFMRMSADAHLAVQKGDMVRVKGKLDLLPEAANAGEFNPRHY GAMQQISVQLYAAGPWQVLYEGEKRLHPFEQYMVQPTYRYIMQALAALLP DGEERKLAAGVLTGERETMSEEVFEAFKRTGTAHILAVSGMNVGLLALII QVFLQRLKITPFGRWTAFLLFVFLLILYSNVTGNSASVTRAAFMALVLIA GETVGQKTYPLNSLAVADLIILLINPLDLLNPGFLMTNGAVLALFLVYPL LHFPRPKNRTLLLSIVWFLLDSIIITLAASIGVSPVIAYYFGTFSLISFV ANIPVVFFSTLLMYALVPMLVVYGLSQALASVFAAGAFWLARMTLQSALW FSNFSFASIPLKLDAVEVWLYYIVLAAVLLLATRKAWSRVAITFLLGVNL FVWYSLLFRPNPIAPTLLTVNLGRNLATIVSNGSESVLIDVGKKPKDYQR ISAQFERFGIVEPTAVVQFYSPDSLILATPTRHHFLRSDSLLRLSSMVIT RPDEKMVKLWSRNQSYFLASGTSRLKAGEPYCGDVACIWIYRFGEKQRIE LERWLTATKPKEALLVPSSFLSRVQLVALHRFAAAYPHVEVRSKTKQVVV NGGER >Cag_1561 zinc protease, putative MNQIATTSFPPSYTLQVNARAKYPRLKMVPHKGLVVVIPVGFSKKHIPDL LKQHEEWIRKIEHHFEAHRQTAEEAFEVLPTTITFSTFNEAWQLNYHQAA RNSVRLTMQGEGQLLLSGNIGETALCRQVLTKWLNRRADVLLSPRLTQLA ASVGMCFSTTTMRCQQSRWGSCSSKGAITLNSKLLFLPEELVRHVMLHEL CHTLHMNHSAAFWAEVARFDPQWKHHKREMKDAWKFVPRWLTAL >Cag_0936 MFS transporter family protein MQRAFRAFTLRRKRLQAQRNYQILFWLLFDFANTAFSVMMVTFVFPLYFK NVICSAQPYGDALWGLSISVSMLLVALVSPFLGAAADVLGRRKHFLLMFT LAAVVGTALLSLTGAGMATVAVGLFIMANMGFEGGIVFYDAYLKELASER SVGRLSGYGFAMGYLGALSILLLVSPLLADGINVANAAKVQQSFLVAATF FALFAAPLFFVIRDRRSLSTSPHPSTKKILSTATSVKNVVVATHHVERKI FGQGAWRNLLQTVQHIRRYPDLARFLLACFFYNDAILTVIAFASLYAEQT LGFSSRELMHFFMVVQIAAMVGALFIGFIADTIGAKRALVVTLILWIGVI AAALFAESKELFFYTGMLAGISMGSSQAASRSMMTRLTPQEHVTEFFGFY DGTFGKASAIVGPFLFGVISSQAGSQKVALSSLLIFFAIGLVLLTKVKSS STNVPSLQ >Cag_1936 conserved hypothetical protein MQILDLSHTIEPTMPLYLGTPSPSFQPIASIAHDGFAEQLLTFSSHTGTH VDAPSHLFKQGATVEAMDVSRFVGRAVVLDVRSLLGEEIGLELLLPHEAL VRECQFVLLYTGWSCFWGKEAYFGHYPCLSLEAAQWLTSMELHGIGVDAL SVDSADSHELPIHRILLERGMVIVENLRGLEPLLHQRFLFSALPLKLAGG EASPVRAIAKVDGVF >Cag_2004 sulfide dehydrogenase, flavoprotein subunit MSNGLSRRDFNKLLLSGVAGSTIGLFGNSGTLFGATSKRVVVIGGGFGGA SAAKYLRKLDPTIQVTLVEPKSVYHTCPFSNWVLSGLKNMEDIAHFYDVL RNRYKVNVIADTAVSIDADKSSVTLQTGKTLYFDRLIVAPGIDFKYDSVQ GYSENVANSVMPHAWQAGPQTILLHKQLQAMPNGGKVFISAPANPFRCPP GPYERASLIARYLKEQKPLSKVIIFDAKESFSKQGLFKQAWERLYPGMIE WRASTMGGKVVSVDAATMTVTTEFGAEKGDVINIIPAQKAGKIAVDAGLT DASGWCPINPISFESTLHPGIHVIGDAAIAGAMPKSGFAASSQGKVAAAA IVRLFQGKVPAPPSLVNTCYSLIDKNYAISVAGVYKLAMTGIVEIKGSGG LTPMNADADQLEQEAMFAQGWYDNISQDVWG >Cag_1575 conserved hypothetical protein MDTLTKLRILSGAARYDASCASSGSNRSGASCGIGNTSQSGICHSWSDDG RCISLLKILLSNDCCYNCAYCVNRATNPVERASFTAREVVDLTLDFYRRN YIEGLFLSSAVMQSPDATMERMVAVAETLRSEERFGGYIHLKIIPGASSE LVRKAGLYADRISVNIELPSQVSLERLAPQKHRAAILEPMALIGREINTS LVERQHSHRAPRFAPAGQSTQMIIGATPESDFQILRLSQGLYKKMNLKRV YYSAYVPVSEDNRLPVLAAPPLLREHRLYQADWLLRFYGFSAEEILSEEL PHLDEQFDPKTAWALRHPEFFPVDINRADYATLLRVPGIGVTSAKRIVAA RRFSLITFEGLKKIGVVIKRARYFITMQGRRVECTDFSPTLIRRQLLLSE STEKPASRQLVLPGLEPILA >Cag_0922 conserved hypothetical protein MFFDPLYLILALPPMLLGLWAQFRVKSAFKKYSGVPTQSGINGAEAARRI LQRGGLTNVSIEPSHGMLSDHYDPTQKALRLSDEVYGYASIAAVGVAAHE AGHALQDKTGYAPLQLRSIMVPAVTVGGNVGPILFMIGMFMAGSLGTTLA WAGVLLFAATSLFALVTLPVEFDASRRAKELLVSQGIVSSAEMKGVNAVL DAAALTYVAAATQSIMQLLYFVMALNRREE >Cag_0628 conserved hypothetical protein MESTSCPICNTNSFTPWLHVVDRFEPSTLWNIVQAVDSGLLMLHPRPTEA EMAPYYAHAGYEPFLNSNKKSSLAERTLLFARSLLLHYRAMLIAKAREHP LCKAHILEVGCSNGELLHCLQQKHHIPTAQLLGVEPDAASAEYARKRFGL QVVDGVEKLPTTLFDTIILWHTLEHIHRVNETLAMLRERLTVNGIMVIAL PNPLSYSARHYREAWIAWDAPRHLYHFTPTTLAALLKKHKLHIVKQQPYL PDTLFNTLYSEQLQRQHNNAPSTPLPFANALAQVTTAIKISTKELREPNN TSGIMYVVTHDA >Cag_1867 glycosyl transferase MKGDIEGTLAIVVLNWNGAADTIACLHSIIPTLDASVHLLVVDNGSTDCS VERIRAAFPHIEVLELPHNLGFAAGNNAGFRRVQALGAEYLLFLNNDTVV APYFYRPLLNLLQQHPDVGIAVPKIFYHHQPQRLWYAGGEVNLATALIRH VGLRQFDAPQFNVATSTDYATGCSLAIRVADFEQLGGFDERFTMYAEDVD LSLRVRAQGKRIAYEPSSMVWHKVSASLGNNSLQKLWMKSKAMVRLCIKH RAWSGLLLYFVLLPFRLVRSVGGSLLFKIGKMR >Cag_0249 acetyltransferase, CysE/LacA/LpxA/NodL family MLMGKILPYKGIVPQLHESVFLTDGAFVIGDVHIGANSSVWFNAVVRGDV CPIRIGEKTNVQDNVTLHVTHDTGPLTIGNCVTIGHGAVLHACTVQDHVL IGMGAVLLDDCVVEPWSVVAAGSLVKQGFRVPSGMLVAGVPAKVMRPITE AERQTITESPENYVRYVQNYRAEDAQG >Cag_0509 conserved hypothetical protein MVIKRAQKALLTERLENEPRNFIQVLYGPRQVGKTTIAQQFMQTTSLPVH FVSADYVAVEQSHWISQQWETARMKLRQSEQQQAVLIIDEIQKINNWSEV VKKEWDSDTANQLSLKVVLLGSSRLLLQQGLTESLAGRFETLYVGHWSYS EMREAFNVTPEEFVWFGGYPGAASLIYDEERWQRYITDSLIETSISKDIL MLTRVDKPALMKRLFELGCSYSGQILSYTKILGQLQDAGNTTTLAHYLRL LDSAGLLGALEKYSIETVRRRASIPKFQVHNSALLSAQQPLAFRDVVSNP ALWGRWVESAIGSHLLNYTRTHNLELYYWREGNHEVDFVLVHKGRAIGLE IKSEHSQQTAGMGAFAKQCKPYKVLLVGDSGIAWQEFLTLNPLELF >Cag_0594 conserved hypothetical protein MNEFGITDSHLHIIRSIFKQYQAINKVLIYGSRAKGNYSERSDVDLVICD TTFDRKTIGKILLAINNSDFPYTVDLQIMENIKNKNLQEHIKRVGKEFYT KM >Cag_0942 carbon-nitrogen hydrolase family protein MSKESVSIAVVQSECKGDAVANRAEATAKIREAAALGAQIICLQELFVTR YFCQTEAYEPFGEAEAIPDGATTRLMQELAAELGVVIIASLFERRARGLH HNTAVVIDADGSYLGMYRKMHIPDDPGFYEKFYFTPSDLGYKVFKTRYAT IGVLICWDQWYPEAARLTALKGAEILFYPTAIGWATDEDSAEVRHAQQNA WITMQRSHAIANGVFVAAANRVGTEENLEFWGNSFISDPFGQMVAEAPHQ HETILLAQCDLSRINFYRSHWPFLRDRRIETYGGLQQRFLDNNQ >Cag_0511 metal dependent phosphohydrolase MIAEHLLFHSDGGFIRIPVWGHIPLSKPLKSILSHPLFLRLKGIRQLSFS QQVYPGATHTRFEHSVGVYHLMKLILQRMVTSSLAQKLQTEHFRFDDASC RLLLASALLHDIGHFPHAHIIEEQIPRVGNEVVFSHHEELCRYFLEEEHP NHPSLATLLMEEWRVDPNDVVALISGKHRLSKLISGTLDPDKMDYLMRDA HHCNIPYGSIDIERLIESFVPDPERQRFAITEKGIAPLESLLFAKYMMMR NVYWHHTSRALSAMLRRLLQDIAEAELLPAATLRELFYRNADDRVLYELK LLLPEATHPLVALLEDVLMRRVYKRAITVQPYLQSSGKEDERWFLYSNNS ALRRSMEVEICELLNKRYQLNLHGYEVLIDSPSRKDIFDYADLQELRVYP TRSEHIHYAMHCASEYVRFDELNESVFQSNFILSFERYTKKFRLLCRPDL VAHIVELRHDIMSLLAHDYPLFHSTVSSSATEHS