TitleGenColors Logo

Gene list

Applied filters:

COG category: General function prediction only
Organism: Chlorobium chlorochromatii CaD3, CaD3
Gene type: CDS

Number of genes found: 262

Free access
Sort by:

 



# Chlorobium chlorochromatii CaD3, CaD3

>Cag_1709 conserved hypothetical protein
MNGYRKRVVDDELNELIAALPAIALEGAKGVGKTATAEGRCRTIFRFDDP
AQRAIAEADMGVVLNQDTPLLIDEWQRVPSVWDAVRRAVDRDQTSGRFLL
TGSASPTTPPTHSGAGRIVTLRMRPMSLAERGVGVPTVSLRELLFGHRPE
IVGKTNIALADYVREIVHSGFPGIRPLSGRALRLQLDGYLRRIIDTDFPE
QGYMVRRPEVLRRWLAAYAAATATTASLETIRDAAFGGDKEKPSKTTTQP
YREILERLWIVDPLPAWLPSRNHLKRLAQPPKHHLADPALAVRMLGLDES
ALLAGEESMLSIPRDGSLLGHLFESLVTLGIRVFAQAAEAHVSHLRLHGG
RQEVDLIVERGDQRVIAIEVKLSSTIKESDVRHLLWLREQLGDELLDMLV
IHTGTQAYRRPDGIAVIPAALLGA
>Cag_1497 chlorobiumquinone synthase BchC related protein
MPLTAQAIVLQKASKLKMVTASFEVPTVNGLLVQTIASTITPGYDRLLIT
NKPVTNKVFKYPIMPGSEAIGQVLEVGSGVSDIAVGDFVFVFAAQGWQGI
EAYAGCHANIIPTTRDGVLPLGRLPIHRDLLTGLLAYAMSGIEKIPLNPS
QRVLVLGLGSVGLMVTEYLHHLGYQHVDGVETFGLRGQLSRAENIALDIA
DFTEAFNNCYDIVVETTGRILMLEKAIRLMKPHAKVLLMGNYEVMACDYR
LIQHKEPHLISSNITTMEHHRKASELLESGLLDTEKFFTGVYPVEQFELA
YRHALDDKPSIKTVLTWL
>Cag_0239 peptidase, M16 family
MALTTSTTVHLATLPNGITVITDSVPYVESITLGIQINAGSRDDPAHAAG
LAHFMEHALFKGTRTRSYLDIARSVEQHGGYLDAYTTKEQTCVYLRCLAA
HLEPSFELLADLVSNPTFPPEEMEKEKEVVLEEISSINDTPEELIFEEFD
QRSFPNHPIGNPILGTEKSVEAFSQNDLHLFLQQHYIPQKMVVTATGNVS
HHAIMQLCERFLNHLANPAESTETRQPLSVATYKPFSLTLKKRIYQAQIV
MGTAIERNDRHFYSLMVLNTLLGSGMSSLLNLELREKRGLAYNVYSSLAF
FDDLTALNIYAGTDGNKVATTLTLIKELLQSDALHHPIHEELQAAKTKLL
GSHIMGMEKMTRRMSNTASDYVYFRRHISPDEKSAAIEAVTASDVTEAAE
LLLRQATYSTLVYKPSRQG
>Cag_0608 hypothetical protein
MAYLRSGISLIIAGVSIMHFSHQFWYWIIGIACIPTGIVTGFFGVWRYIT
ISKSITIVRRELPLADQREAEQMK
>Cag_0151 conserved hypothetical protein
MRLLPAEREIIRTLATRIFGDGTRVLLFGSRVDDSVKGGDIDLYVQSPDA
EQALTKKREFVVALKLALGDQKIDVVISSNPSRFIEQEALKHGVAL
>Cag_1617 conserved hypothetical protein
MKIVQFVAMLLLFAGVNGAVVYRLWRLMPPLRWFRGSVTLLLLAVIAAPF
VVMAWGNALPLPLVSLLYMVGLSWLILLIYLILLFLLIDGLSLLGFFGVQ
RLKSLKLWTHESWVGTVGVVVLMAVLAVYGNYNYHQKERVELTMRVEKAM
AQPLRIVVVSDLHLGYSIGREELERWVVLINREEPDVVLLVGDVIDTSLR
PLEVERMAEVLRRLSSRYGVYAVAGNHEHYATLAKSAPFFSDAGIRLLRD
EVLLIDNRCYFVGRDDYMNKQRKPLSVLLSGVDVAKPIVLLDHQPRALGE
ARAAGADIHFAGHTHRGQIWPISLLVEQMYEQAYGYRRFGAMQSYVSSGL
GIWGGKFRIGTKSEYVVVTLQGR
>Cag_0985 putative plasmid maintenance system antidote protein, XRE family
MMILMYLKNKGGARNSCLIFLTFHYAEAFGTSPEFWLNLQATYDLSLHKP
TKHIQPLVAVSAQHEIQERSCIVGLFLLCVLG
>Cag_0115 CrtF-related protein
MMNTNELLNYNHRANELVFKGLVEFGCIKASLELDLFTHLAGEAKDTETI
AANVGAIPQRLVILLETLAQIGVLAKNDGKWSLTPFAATMFLPNNELPNL
YMMPVTKAMAHLSENFYLKIADAVRGNHIFKAEVPYPPMTREDNWYFEEI
HRSNAHFSILLLLEEANLSNVKTLVDVGGGIGDISAALLQKYPQMDSTIL
NLPGAVELVNENATEKGLADRLRGSVVDIYKEEYPKADAVMFCRILYSAN
EQITSMMCKKALDALQEGGKVLILDMIIDEPEDPNFDYLSHYILGAGMPF
SVLGFKQQERYKELLEAIGFRDVRIVRKYGHLLCEAVK
>Cag_0693 sepiapterin reductase
MQHIILITGAGKGIGRAIALDFAKATSPTFQPVLVLVSRTLSELESLAAE
CHALGAETHLCAADISNLQQIDAMVNDVVARYGTIHCLINNAGVGRFKPF
AELTPDDYEFVMDTNLKGTFFLTQKVFPFMEKQQLGHIFFITSVAAETAF
STSALYCMSKFGQKGFVEVMRLYARKCNVRITNVLPGAVLTPMWGEVPDA
MQRVMMQPEDISQPIVQAYLLPQRTSIEELVIRPVAGDINE
>Cag_0632 TPR repeat
MVQSSNEGGGVSATAPYYTSYKQALAYVDEQRYEEALQLFDHCIAIERRH
AALLYGRAVTLLALGTYRQACCDLFKSLALDKAQPEAWKHLAYLLFMLGK
DEPAEKTLKKALERFPDYAPLYCVLADIYLDLGEFDKAHEAIEQALRLDP
QNPEPHSKLAMYYVARGNMEGLQQECKTLEQLDAALAEQIRTLFFENQ
>Cag_1073 Phosphatase kdsC
MLTLSQNELVRRAQAIRLVIADNDGVFTDTGVYYSERGEELKRYSIRDGM
GVERLRNAGIETCIMTGERSPNVQKRAEKLCMKWLYLGVKDKRSMLATLL
AETGMERHELAYIGDDVNDCGIMEEIAPFGLVAAPRDATRFVEPYLHYRA
TADGGHGGFRDIAEWLLELKNS
>Cag_0098 YgfB and YecA
MNAPDPMMQPLTLQEFTILEEFLVSERTPEEALSSLEMLDGYMTAAIIGP
QAFEPKDWYALMWDKNKQLEPQFSSADEADMISELIVRHNNSIEAVFLED
PESFVPLFDRVAYENEEIHKLAVEEWCMGFLIGMELAYEAWQPLFDNEDA
AVMTMGFFMLSKVSDEFAHMTEREIEEITSTVGDAVIGIYLYWHGDDEMD
EEDDDELFRE
>Cag_0398 conserved hypothetical protein
MLLMKRYQWLGQLVATTLLWLLLYNNLELGADQLLRLIGLTRAVPFGEAL
HFFVYEVPKVLLLLTGVVFVMGVIHTFISPERTRALLSGRRTGVGNVMAA
TLGIVTPFCSCSAVPLFIGFLQAGVPLGVTFSFLISAPMINEVALALLFG
MFGWQVALLYMGMGLAIAIVAGLIIGKLGMERYLEEWVQQLQNSGMADEN
NEDNAMEVPERLAYGWKHVQEIVGKVWFYIVLGVGLGAGIHGYVPENFMA
SLMGNNVWWSVPIAVLLGVPMYSNAAGILPVIQALLGKGAALGTVMAFMM
SVIALSAPEMIILRKVLKPQLIAVFASVVALGIIIVGYVFNAVL
>Cag_0437 conserved hypothetical protein
MTILDRYILKKQIAPFFFAFITIVALLQLQFFSTFAERFIGKGITFVAIV
ELLALQSAWMVSFALPMAVLVAVVMSFGTLTTTSEMTVCRASGISLYRVM
VPVIVVSLLLSFTVERFNNVLLPQANYQAKSLMAEIARSKPAFGLTEQAF
STLVDGYSMYVRSSDERHGELRGVVIHDMTRPEYRTTITATRGRVEFTPD
YQYLVMTLRNGAIHQLQQPEKSGYRSMNFERYRFVFESSLSGFTPSSGNR
MRADANELSAGELHAIGLEFRRREAVALLHVQAPLVALERLAANTDNSKM
AASPPTLRQETSAIAATKALAVIEGEIARVASELEVASTNRTLYNRYMAA
YHKKYSLSLACVVFVLVGAPLGVLARRGGFGVGAAISLLFFVLYWMLMIS
GEKMAERGVLDPMIAMWMADGVMALIGVGLVTKLTQALFSTSR
>Cag_0723 oxidoreductase, Gfo/Idh/MocA family
MKIGVIGVGKLGEFHTKLLTELAHERTDLHVAGIFDLNTQRAEEMAQKYN
VPRFNSVEELAKTCDAAVLATTTSTHFALASALLNEGLHLFIEKPITTTV
EEADELIRLEAENNVRIQVGHIERFNPALRTVEQWIGRPMYIQAERLSGF
SLRVTDVSVVLDLMIHDIDLVLSLIQSDIKHIAASGVKVFSNELDMATAR
IDFVNGATANVTASRLSRSKMRKLRFFCTEPKSYASLDLTSGKSEIYRLV
PPDMASSKNPLKSFAARKILEQFGEIQESLNGKVLDYIHPEVPKVNALRD
ELEYFINAVRDNAPTVVSALDGRRALFVAGKITDEINASTALLHD
>Cag_1954 iron-sulfur cluster-binding protein, GltD family
MKVESNPILDFAINYQFPPFEELTGTHKIVAFGDHSHKCPVYVRQTPPCQ
AECPAGENIRGYHRFLNGIDKSEDEWKSAWETLVEINPFPAVMGRICPHP
CQSACNRQYHDESVAINAVEQAIGNYGIQAGLQLPEPAPATGKRVAVIGG
GPAGLSCAYQLRRRGHAVTLYDANEKLGGMVLYGIMGYRVDRKVLEAEIQ
RIINLGIETKMGVRVGSDVTLDELEQEFDAVFIGIGAQAGRSLPVAGAAE
TQGVTNAIEFLRSYEVEGDNITIGKKVLVIGDGNVAMDVARLALRLGSEA
AVVAGVPREEMACFKEEFDDADHEGAVMHFMSGALELLKNDDGSVRGLRC
AKMVKKAKGEEGWNSPIPFFRYKNSDETFDIEADTVVAAIGQTTNMQGFE
AITNGAPWLKVDRSFRIPGREKLFGGGDALKVDLITTAVGHGRKAAEAID
AFLKGEPMPDQGYREVTKVSRQDVLYFPVTPPAKRDTIKIQEVVGNHDEL
LVALTPEQAKAESGRCMSCGLCFDCKQCVSFCPQEAISRFRDNPVGEVVY
TNYDKCVGCHLCSLVCPSGYIQMGMGDGL
>Cag_0951 conserved hypothetical protein
MPQSQQLTNEQQAALQQALIYRFLGLVFAYPNDAFLPTLQNALQKISDNA
ARFQPLLDAFAAEPQEQLQAEYTRLFLNGYPNTPCPPYESVYREERMMGE
SSLAVQKLYQQWEIAIDANLSDHLATELEFLAFLSAATTLTEVATDALAT
REYFLEEHVRQWLPQFCRDLQKEATVEAYRLLSQLLANVLLH
>Cag_0725 drug resistance protein, putative
MKRSPLFILLLTVLLDLIGFGIVLPLLPTYTKSLGANPFMIGLIAAIFSI
MQFIFSPLWGKLSDKIGRRPVMLSSILLTSLSYLMFAQATTLPLLILARA
LAGVGSANVSAAQAYITDVTDAKGRSGAMGMMGAAFGIGFIVGPLLGGVL
MHNYGIAVVGYVASLLIAIDFILAIFFLPESNKAAIPFTHLLKGNENKGT
RHGKKQSLGTTLATKVQEYNEHFRTTFSSRPLALLMVANFIYTLAIVNMQ
TASILLWKEYFKASDEQIGYLFAYVGIWSVVVQGGLIGKLTKKVGEHNIF
LWGHLFTFFGVFFMPFLPSYSLFSMGLTVLFFFAIGTSLVAPINLSMISL
YSYNQQQGQIMGLSQSVNAFARILGPFSGSILYGMSFHAPYIVAGILTLV
GAVIALRLFRYRIEAHD
>Cag_0981 conserved hypothetical protein
MNFIIKQATVSNRKDFLKILQCWNMQNGFLHDETELDYSNFFIAEVNNQV
VGMAGFMPIDGERYRTRLLAVYPEFRGTEIGKALQDRRLEEMYKRGAKIV
ETSVDNLEMKHWYKKHYGYTEIYKTKKEYEISFIDVDVVDVLYLNLIEYM
KNKIAFDSKKLRYMEKYEPHPLSPYPPLIINVALTGVIPTKTLTKYIPIS
VNEIIEEAINVYDAGASIVHLHAKDENGKACSDAKYYEKIISGIKKERPE
LICCATTSGRDGQSVEQRAEVLSLTGNAKPDMASLTLGSLNFLSGASINS
IDTVTELAYIMKEKGIKPELEIFDTGMVNLAQYLERHNIINGKKYFNILL
GNLNTAGATIKDLSHIYTSLPDNSIWAAAGLGHFQLPMNMASIVAGGHVR
VGLEDNIYYDLNKTKLATNITLVNRIKKIANELERPISTAQKTREILGI
>Cag_0293 putative cytoplasmic protein
MSKNKTSHVSTNKEPANIRSSAAEYLTYIAAIGERATSVEMRYEAENIWL
TQKMMATLYDVSVPAINQHLKRIFDDNELTREATVKQYLIVQTEGNRQVE
RMVDHYNLQAIIAVSFKIENERAVQFRKWANQIVKDYTIQGWVMDVERLK
HGGTILNNEFFERQLEQIREIRLSERKFYQKITDIYATALDYDPSATASK
RFFAAVQNKIHYAIHGLTAAEVIVNRADHRKNNMGLTHWEGAPSGKIHKY
DVSIAKNYLSEFEIAQMERIVSAYLDMAELQTMRKIPMTMEDWEKRLAGF
LTLWDREILQDAGKVSAELAKVHAESEFEKYRIIQDRLYESDFDRLLKQI
EHCNTTQAEKQ
>Cag_0857 conserved hypothetical protein
MSGIKYLLDTNIILGLLKATSTVLEAIGFRSIQAAECGYSAITRISNCYV
LFVS
>Cag_1651 Small GTP-binding protein domain
MKPLLALVGRPNVGKSTLFNRILRQRSAIVDPTPGVTRDRHIAEGEWQGK
QFKLMDTGGYNTDGDVLSKAMLEQTLHALADADSILFITDARAGLSYEDL
ELARILQRSFQHKQLFFVVNKVESPQLVIEAESFIKTGFTTPYFVSAKDG
SGVADLLDDVLEALPEAPEGEVKGDTAVHLAIVGRPNVGKSSFVNALLGT
NRHIVSNIPGTTRDAIDSRLMRNQQEYLLIDTAGLRKRTKIDAGIEYYSS
LRSERAIERCEVAIVMLDAEQGIEKQDLKIINMAIERKKGVLLLVNKWDL
IEKDSKTSIRYEEQLRMAMGNLSYVPVLFVSAMTKKNLYRALDTALQISR
NRSQNVSTSQLNKFLEQTLAQVHPATKSGRELKIKYMTQLKSAWPVFGFF
CNDPLLVQSNFRKFLENKLREAYNFEGVPISLRFLHKNKVKED
>Cag_1931 Hit family protein
MSHNQEQDCLFCRIVRGEIPATIVYRNEHVVAFKDISPTAPHHVLIIPVQ
HVASLNALSPEHEAVAGQLLLAAAPVAEALGIKESGYRFVINTGADAMQT
VFHIHAHLIGGQAMGWPPFPV
>Cag_0424 drug:proton antiporter
MALASSPIVSFTFLRLLSFRFLIVLSYQMLAIVAGWHIYELTHNALALGF
IGLAEVIPYFASALFAGHAVDHYSRRLFGVMASVMVMASALMLTAVSAGM
VVGNPVWWLYGAIAFNGLARAFISPSYSAMFALVLPREAYAKASGIGSSV
FQLGLVTGPALGGLLAGWFGNTAGYAVAAVLAFGAALALFSVRVKEPPSA
ESMPIFASIASGIRFVFGNQIILGAQSLDMFAVLFGGAVALLPAFIKDVF
HFGPEAFGLLRAMPAIGAVITGLYLARHPLNHHAGRWLLGAVAGFGVCII
GFALSTTIWMAGLLLLLSGICDGVSVVMRTAIMQLLTPDDMRGRVAAING
IFIGSSNELGAFESGVAAHVMGLVPSVIFGGFMTLGVVAVTAKLAPKLRR
LDLQQLY
>Cag_1887 von Willebrand factor, type A
MCFTHPEKLLLLLLLVPIAGLLIGRFIKEKRLRQALMANKMAATMMPLQH
LLPYAFRSLMLFVASGLLLLALAEPRWCGGTKPVLRHGADVLFILDVSRS
MQATDVAPNRLMRAKQEIAAISQNVQGGRRGLLIFAASPLLHCPLTTDRD
GFATLLNMAAPELIEEQGTRLQPAFALASTIFDVANESNAASTRGVQVIV
LLSDGEDHDSNVQRAAQQLAKQSVQLFVIGVGSLKPSPIPLADGSFKRDA
SGQVVMSRFRPQMLQAFARQAKGLYRHSHAEVWASADVVNRINRYAADSR
IVMEPATNDSSLLRLMVGIAVALLFMETLLRKSS
>Cag_0290 conserved hypothetical protein
MKSPFPLFFPLLQKKRDCSKQAQCSKRRLIMALLAVSALAPANLYAASKA
KRQPPLVAHVYPDTLALPYNRYALKPFMRPARKSVAVALSGGGANALAQI
GVLKAFEEAHIPVDAIAGTSMGAIIGGLYSCGYSAAELEQLALTMPWSSI
LALQEDYSRSSLFVEQQRIRDRATIALRFDGLKLLLPQSLNSAQAFTRTM
DMLVLHALYHPHSNFSSLPIAFRAVTTDLVSGERVTLESGSLSEAMRASS
TVPILFEPIHRAEQQLVDGGLVANLPVDELAHFGADCKIAIDTHGSMYAT
GKELDLPWKAADQAMTILITLQYPAQRAQASLVIEPETGKHKATDFKNIP
QLIAAGYVAGKQQVPTLQRLLAITSPSNSSAPQTSSVPPSSIVPSVATPP
PISPILTANKKEMRNFSLATYTKRWSISPTSTELERLVGEKVASALELHA
LLRDLLATDYFARVSAEVHQEDRTVTVKLEALPSVTVVTVQGELADELSS
AELNECFAPLMGRLYTNHQATAALEALVRRLRAKGYSLAAIEQVHVENER
LTITFSSGKAAMLTISLNKGRTLLTPIQRELKLDATKPLRLRAAEESVKN
LYETGVFNRVSLFAEPITQTEAIAPISSTTPNQTIHLSLEEKPASVLRLG
LRYDETNNAQLLLDVRNENVGGTTNTMGGWVKAGRKGYLANMELNMPRIG
ATHLIFATRLFFDSYLFDYTNSDGSLAPYNIQKYGITSSFGTRLRKNGHF
LTDVSYYNSQAFTDEAHRPLFSTTNNNVLTIGTHLTIDSRNNALMPTRGS
YSYLTYAFTPLSLDDGLRYWQFSGTHQVNLPLGRETTLQLSAMTGVSSKA
LPLSEQYFLGGIGNSYSARFIGLQPHALATNNVATAGVQLSYEPSFPILF
PTTLQLHYNAGRGWNAMENVRLDGALQGALQAVGASMVWKTPLGPTRFTL
AKVLVNNDDNSLMLPHRDDDPVFYFSIGHDF
>Cag_0126 CBS
MDQLIKLRTLPVSALMQKDFHVIKGSCTVAEALQLMKQTRESGLIVEPRN
EDDCYGMVTEKDILEKVIDPGEDVHRDPWNTPVFQVMSKPVISVNPSLRI
KYALRMMKRTNVRRLTVMEGNKVIGVLNMADVLHAVEELPVHDEHVAL
>Cag_0350 oxidoreductase, short-chain dehydrogenase/reductase family
MKVALITGASMGIGEAFARSLAEQGRTMVLVARSVDALHRLATELEQRYR
VAVYVMPADLSQHESATAVYNYCRQQQLEVELLINCAGFSVAGAFADIPP
ERIAEMVQVNSTSLALLTRHFLPNMLQRKSGTIINVASLGGLQGVPGMGL
YSATKSFVITFSEALAHEVRPYGIKVVALCPGFIATGLMESAGQNTKAIR
LPISQTDVVVKAMQRAFVTRYVRLYPTWLDSLLAFSQRLVSRSLAVRLAA
FFAGVLKKG
>Cag_1975 putative 3-deoxy-D-manno-octulosonate 8-phosphate phosphatase
MCAASSFTLNHYKPLSGFQFFGNDPSQADPSSRIDQALKGIQALLFPVDG
ILNGSKITFDHSGNELCTISVRDAIAIKEAVKLGLRIGVLSSRNAEGYRP
MLEALGVQDLYLNGEHVFYSYDAFRHRHSLSNEECAYIGDDIGDIDVLAK
VGLPATSIDGADYLRNRVAYISGFEGGKGCIRELVEEILTRQGKWPYIER
PDEEDETAEE
>Cag_0927 Beta-phosphoglucomutase hydrolase
MSSSFKGAIFDLDGVITGTAKVHSLAWESMFNSFLQNYAEANNEPFVPFD
PIHDYHKYVDGKPRMEGVKSFLFSRDIELPFGELDDNPENETICGLGNRK
NSLFTEILEKEGPEVFSSSIELIEQLIERGIKIGIASSSRNCQLILRLAN
LEYLFETRVDGEVSIHLGLKGKPNPDIFVVAAKNLGLEPHECVVVEDAIS
GVQAGARGNFGMVLGIAREIEGARLIEQGADIVVNDLGEITPEDIEEWFT
KGLEFEGWNLTYTEFSPKDEKLRETLTATGNGYLGVRGAYEGSKSSHNHY
PGTYIAGIFNRVPSLVHGQTIYNNDFVNTPNWLPIEFRIGGGAFIDPFRQ
KILSYRQNLDLRSGLMERDLVVQDNLGRITSITSSRFASMANPHQCAVKF
TLKPVNYSADIEFRCSIDGTVQNRNVARYSELTSDHLEHVEAQHNGATML
LHVRTSVSKYEIVTAAKTRIIMHGKEVTAERQPLHSNRFIGEQFTLSLGP
SKGCTIEKMVSIYTSLDQNSTSPLTAAKTSLQDCSSFDELLTPHVAAWEA
LWEKANLQIEGDRFSQKVLRLHTYHMLCTASPHNASIDAGMPARGLNGES
YRGHIFWDEIFILPFFNRHFPDISKSLLLYRYHRLDAAREYARENGYKGA
MFPWQTADDGKEDTQSVHFNPKSGSWGPDHSCLQRHVSIAVFYNTWRYIY
DSDDTTFLNEYGAEMMFEIARFWASIATLNPATGRYHIEGVMGPDEFHES
VPGSKKEGLKDNAYTNVMCVWLFEKATEIAAKLSPEALERLQKTIGYTPE
EAEEWHKIGHHLNVLIDHDGIMEQFDGYKSLKELDWKHYRTKYGNIHRMD
RILKAEDDTPDNYKVAKQPDVLMMFYTLSPGEVAELLTKIGYMVPDALTL
VRNNYAYYEPRTSHGSTLSKVVHSIISSYLHDGRDMAWSWFLDALKSDIN
DTQGGTTHEGIHCGVMAGTLDTVARYFAGIAFYNEKLNVHPNLPDQWKKL
SLTVCFRANRYALTIEKAAITVTLLESDSNEAPACIAGHHLTLQKGVPYN
SALHA
>Cag_0895 hypothetical protein
MMQSIIRNATLSDLSRCAELLAILFSQEKEFAPNATAQLHALTMIAESPA
SGQIFVAEVNGKVEGMVLLLFTISTYLGKRVALLEDMIVTPEWRGKGIGS
QLLRHALAYAQSNGMGRVTLLTDLDNEAGHQFYKAHGFVRSSMVPFRYVW
E
>Cag_0185 Glutamate synthase, NADH/NADPH, small subunit 1
MGKIKGFMEYKRALPADRQPLERIKDWQEFHEKMAPEALCEQGARCMDCG
TPYCHSGIMLSGMTTGCPIHNLIPEWNDYVYRGFWHEAYDRLNKTNNFPE
FTGRVCPAPCEGSCVLGIINPPVTIKNIEYSIIEHAFAEGWVTPKTIANR
TGKRVAVVGSGPSGLACADQLNKAGHSVTVFERDDRCGGLLMYGIPNMKL
DKVQVVERRIALMKQEGITFMEKTEVGVDYPAGKLLEEFDAVVLCTGATK
PRDLAEEGRQLAGIHFAMEFLTASTKALLDGTEPKLSAKGKKVVVIGGGD
TGTDCVATSLRQKCASVVQLEIMPKPPMERQADNPWPEWPKVFRVDYGQE
EAEALQGSDPRRYAMMTKKFLTTGNGNVSGVEVCSIAWENIDGRFVPTPI
PGTEEIIEADMVLLALGFIGAEENLLQQLQVAQDERSNIKANTQNYRTNH
ERIFAAGDARRGQSLVVWAIQEGRAAARECDRFLMGGTNLP
>Cag_1088 hydrolase, alpha/beta hydrolase fold family
MLHYKTYLHHNPKAAWVVFVHGAGGSSSIWYLQIKEFMQHCNVLMVDLRG
HGRSKDMGIPEGMRRYDFNCVTRDIIEVLDFLRIEKAHFIGISLGTILIR
NICELAPERVASMIMGGAIIRLNLRSTVLVTLGNTFKHVMPYMWLYRFFA
WIIMPRARHKKARNLFVNEAKKVEQKEFLRWFSLTYDLTPLLRYFEEKDA
ATPTLYLMGDEDHMFLPFAKRIVTRHTYATLEVIANSGHVCNVDQAREFN
QRAIRFIKLHS
>Cag_1544 phenylalanyl-tRNA synthetase, beta subunit
MKISISWLREFLPNFSCETVSLVERLTFLGFEVEGVEESASLDRRIVVGR
VLETEPHPNAERLTLCLVDVGREEPLRIVCGAPNVRAGMVVPVATEKAKL
QFPDGQTLTIKPSKIRGERSQGMICAADELGLSNDHSGVMELESSWEIGK
PFADYLESDVVLDIAVTPNRPDVLSHLGIARELADGAPLQYPSQQSLTYQ
PAGERIAINDAVACPYYTGVIIRGVTIRESPEWLRKRLQAIGLNPKNNIV
DITNYMLHALGQPMHAFDCAKLAGERIAVRSDCQAEVVALNNLTYKVEGG
MPVICDGSGAIAAIAGVMGGMASAVTESTTDIFLESALFHPSMVRRTAKK
LALASDSSYRFERGVDSRMVQQASATAVALILELAGGTVECAMEQGSVAA
DLQLLALRPERTNKLLGTALSGEQMVELLERIGFRCVEQTTEQLLFAVPS
FRVDVTAEIDLIEEVARLYGYNAIESSRQMATIYPTKRQHPAYFPDFLRG
ELITLGFREILTNPLIKRNDAALASEQLVDVLNPISEGLEVLRPSLLPGL
LKVISHNIRHGNRDQKLFEVAHVFEAKPQVQQTQQPLEGYCEQERLVMAI
TGSRYLRRWNHPTDMVDFYDLSGAVEMLLEQLNILDKSVVNIYTPSALSI
DVFLTEKGKRTTHRLGIMQPVNAAWLKHFDIEQEVYCAELDVALLERCYQ
PTSAYEPPSRFPVVERDISFIIPEGVSAQSLVELVQSSNPLIKTVTVFDR
FERNHESGKECSIALSLTIADAKATLQDEKINDILATISRNAESKLGAVI
RQV
>Cag_1480 Acetyltransferase (isoleucine patch superfamily)-like
MNPWKNLQKRYFPALCYWLGNHFFMNWTPYPVRHWFLRKYCNVKIGKDSS
ICMGCFITGQKIEIGLNTVINRFTYLDGRVALRIGNNVNISHYTLIQTLT
HDPQSSNFTCQEKPVTIGDNVWIGARAIICPGVAIGEGAVIAAGAVVIKD
VPPYTIVGGNPARYIKTRTNDLHYKTRYFPLLDTDIQ
>Cag_0111 magnesium chelatase, subunit I, putative
MQLAAELEALGAEIVDASRFLVRVEEELSHRIVGQREVVRRVFIALLVNG
HILLEGVPGLAKTLIVSSFAEAMALKFQRIQFTPDMLPADLVGTLIYNPK
DLTFFPRKGPLFTNIVLADEINRSPAKVQSALLEAMQEHQVTIGDESYQL
PAPFLVLATQNPIEHEGTYLLPEAQMDRFMMKVEVDYPSYDEELEIMLRS
ATNAPRPSIQAVAQPEDIERARTLIDRIYVDPRVQRYIVDLVVATRSPAQ
YGMENLNGMIECGASPRASIYMLLAAKAHAFLQQRPYITPEDVKAVVYDV
LRHRIRPGYEAEAENMRSTDIIRQILQHVQVP
>Cag_0824 Dhh family protein
MIIPSYGRTLHAEEWQPLLEPLLAAQHLVLTTHENSDGDGLGCEVALALA
LTALGKEVSIVNPTEVPPNYQFLRQLYPIVQFNPKSEEAIQELSLCDAVV
LLDANLSDRMGTLWPHVRFARELGSLKLLCVDHHLEPNDFTDVMISESYA
SSTGELVYGLILAMEQSVGRALFTPNIAQALYVAVMTDTGSFRFSKTTPY
VYQLAGDLVARGANPEKAYDLIFNSLTPQALKLLGLSLSAISLVEGGKLS
WLLISQEMLKATESKLFDTDIIVRYLLSVPSVAIAVLLVEMQDGRTKASF
RSRGKLPVNKLAKEFGGGGHMNAAGALFPYTPEKVQQVLPQAVRRFIKEH
EALL
>Cag_1768 TPR repeat
MLMLAGCSSSSSTVSTQKIQAPLPKPLPETVAYELATASLLMAQGEYQQA
LERYRALLTTESNNAALHHALAKAYTANGEFVAARQHSQQSVTLEGTNVW
YLRLLIALTHNESDYAQAVALSKKLVTLEPDNREALTMLAYEHLAARQPN
EALEVFQRLLQLDPANAEVLLSSAEVALELGRRSDALRFFNQLLHYGIES
DSIHFFIGDLQQQQGLHEAALASYRNALKLNPHLLPAWYRRLELVALSPN
LSQSSKPTLFAEELQHFYKQSGTTLEQQLGLLQLFTNRATRNPAFISATQ
SMIKALQQRYSSHSLVRFTVQIAQGRLFVAQGQHAQAITLLRQALRSPHA
TRQPNVALDAESTLALAYERSGKVTESIRLYEKMLRRTPNNALLANNLAY
LLATQHRELPRALELAKKAVAAEPNNPIYLDTLGWVHFAMQQYEPARELL
EKALQGEPNEPEVIEHLIAVYEKLGNQSKVQELQERLRRVCL
>Cag_0732 TPR repeat
MNPPSSSSQVIPLQVRQLFDHARQLRKQGMLNEAIEAFREVIELQPDYVA
AYNNLANALQAQGDSDGAEAVYQQALHYAPMLPVLHCNYGSLLLARQEYD
AAIKSYQKALTLQADFFLAYTNLAKAYSVRGNFFAALQTYKAALRLKPQD
AELYLDCGQLYQQYGFIPQAVKYYRRSLQLAASARGYNALGAALQDWGNL
KLARASYHRALKLQPDFDLPQYNLAQLYENLGELETARRYYEQTLTVDAE
NAKLLLHLEMIKRRQADWSNYTERVEQLRHALERHVENDKGEAVPMLSVL
SSSLSPALYRALAEQMARQLTRNAQALNATFTFPNNVAPERLKIGYLSPD
FRGHAVGTLIADLFQYHERPDFEVFAYSLLPHHDEWTERVKAGCDHFIDV
SHKSPLAIAQQIHADGIHILVDLAGYTSYARPLVLALKPAPIQLQYLGYP
GTLGAEYVPTIIADKHLIPENHQSYYTEQLCLLPHAWVAAPMQIASLSLT
RAEFGLPEKGMVYCCFNGVYKLEPHVFSLWMEILSKVPNSVLWLIDGEES
GSNERLRAVAQEAGIAPERLVFAKKRSHEEYLALYRLADLFLDTLSYNAG
ATAVGAFSAGLPLLTCQGEHYATRMGSALCYAVGLPELVAPTPADYVEFA
VQLGSSPKKRAALKRKLAKKLPTAPLFQPQQFVVALEQQYRSLWNNYCEL
TNNMLG
>Cag_0192 conserved hypothetical protein
MSLRTLQQQIITLYKSRYKATPQSITQLQGDASTRRYFRVEYNSLGTIAC
YDPAFVGADPERYPFLVLQKLLKQHDILVPRTCVYNAALGLLLLEDCGDL
LFQNYVLEVLHTKKYDVLQQIYQNVVELMVAVQSIKGNEHELPFNLSFDR
EKLLFEFDFFLQHALRGYFASEIDAALIPLLRQEFEAITDLLVQPEHFVL
NHRDYHSRNIMVTYDGYFLIDFQDARMGLPQYDAVSMLRDSYVVLPDELV
AAMKEFHYKQLLEHHLTTMTYYEYCYYFDLMGFQRAIKALGTFCYQAVVK
QNRSYEQYIAPTLGYIVNYIAERPQELGKAGGLLQPLLEKALHQ
>Cag_0510 conserved hypothetical protein
MSNAPRRFHFIVNPAANKGRATRHIAKLQQRLLGRNEAKVHVTQTAGDAA
VCAQSAAQAGDTIVACGGDGTLHEVVNAVAAMNATVGVLPLGSANDFIKS
LYTNPAEASNIDALWSAQAKAVDLGRVTYGSTQRYFVNSMGIGFTGNIAR
HVRENRWLKGDLTYLYALFRVLITLQPRTFTLTLTTPQGVRHEQEPLMVF
SIMNGKIEGGKFTIAPEAAIDDGLLDVCLLKAVPKWKIFRYLMRYIRGTH
INDAQVLYYKASRIELTLNEPTTMHMDGEVYENVCGNVTIEVVPKALRML
IIN
>Cag_1230 Molybdenum-pterin binding protein
MEQMEEQYGGQIEKQKGDAIGLEGDVWFQKAESCFLGGDRIALLEKIAAL
GSITSAAKAVGISYKTAWQLVDMMNNLASRPLVERTTGGKGGGGTIVTNE
GRKVIEQFRVVQEEHRHFLQQLEVRLGESQNVCQLLKRIAMRISARNIFA
GTVEHLTKGAVNAEVVLRLTGGQRIVSIITNTSADNLGLHEGMSAYAIIK
ASSILIGQEISPASLSASNILQGTITRVVEGAVNSEVDVAIGGGNSISAI
VTQSSLQHLALCEGAQVSAIFKASSVIVGTQ
>Cag_0840 TPR repeat
MSIELKAKYLGKTPKDKAKILIAQSSWEKTQLARDENQCYKLSLTEENEK
FIDSILELGNGKEVDLIVFPEFSIPEKYLEKIREWTFHNQIIVIAGSANL
QREEKYYNTSSIFFEGIPYKTEKHDLSPLETSNLLGGYGPSSGTNQFYFT
ETPIGELGVMICADEFDRQTRNEFLKHNIDILCVIAFQQKGKDHHQSINE
IVKESNNGIYVAYANALCNSWTDGHSAFFSNEYREGRVEYVETGLTKDDG
LEMKLVEMPSNAGCLIVECNLKSKKPVIRNLDPNRALVNAELPYVFENGG
LRQFTKEELKKPDEKSNKVKAEYQPSIPPIKAFVADYIGRQTDVEYLSEF
LNNPQKHFCLLYGVGGMGKSHLLYCCMKDYKQKTFFYHVVSPNEEFTLNK
LFEVCLLPKPDAKLSLEEKQNHFVKKFQENNIHLILDDYYEVQLDEVKSI
LPKLTGIGKGKLLLLSRIIPSNISYIKADYLNHKILPLTEPDFKQVIQNF
ILDKNLTLTDEEIHLIYEKAQGYPLGGQLIIDAKPYSKNLLELLTNLGKF
EAEIDPDGKIYSGRILDNIFKKGNNKEIKLLCEFSALFGVSDIETVRQLP
SYNLNLFQGLHSRKSFVDMDVQGKFSSHAMIRDFAYHRLQNKEALHLKLA
KYFENNINGRTDDDWKWLNEAILHYTKSPKAEHFAFINRVERNFESRNIK
EQIDKNSILKTIRNYTTLLNLYPDKPAYYNELGIAYRMNRQQRNAIETFE
RALVIDPKDLPSLNELGITFRENNQKTKAIETFERALVIDAKHLPSLNEL
GITFRENNQKTKAIETFERALVIDAKHLPSLNELGITFRENNQKTKAIET
FERALVIDAKHLPSLNELGITFRENNQKTKAIETFERALVIDPKNLPSLN
ELGITFRENNQKTKAIETFERALVIDAKHLPSLNELGITFRENNQKTKAI
ETFERALVIDAKHLPSLNELGITFRENNQKTKAIETFERALVIDAKHLPS
LNELGITFRENNQKTKAIETFERALVIDPKDLPSLNELGITFRENNQKTK
AIETFERALVIDAKHLPSLNELGITFRENNQKTKAIETFERALVIDAKHL
PSLNELGITFRENNQIEEAIKVCKRALNISKDRQLYLNLLQIYLFFKSDK
QISKEIYDILLMPPRLHAFSASRKKYENIIRDMDYLLSISFDDVKQYESF
LFLAIQYKAYEKVLFILEKLNDQFPDNSKIKSRLGKTLSNQVIGEHEKGG
RFLKQAIGLFKKENNIQQLQGHIIYYFYNLLNQNQIELIEKEMMTYEKDL
IYDANYFRFMANFSFVKNSNINDAISYFEKAIEISEVLMDKKEFAESLLR
FLSEQKSLHYKTYFVKYEKYI
>Cag_0666 hypothetical protein
MNKKIMNLPHKVNKKIDLIEFNKIKAIFRREFKRGTFKLLDVGSGLCSFP
NYIKNEFENAKIYCIDINKDLVDLATKSGYNAQEGDLTKLNYNDNTFDVV
HCSHVVEHLPYPHVIEAIDELVRVCKSNGLIIIRSPLWANHRFYNDIDHI
RPYPPNSILNYFANQQQQKVSKHRIVEEDRWYTKIYYEINPLRFNYKIIK
YINFFLKISWLFLSFPIDRPNNYGIVLRKRSGL
>Cag_0297 conserved hypothetical protein
MENSSNIARQRKAVKSRACEPNENLLAADGWRVFKIMSEFVNGFETMSSC
GAAVSMFGATRALPESKEYQLAEELGELLAREGFAVITGGGPGIMEAGNK
GAQKAGGVSIGFNIKLPEQQHPNHYIDQEKLLHFDYFFVRKMMFLKYAQA
FIALPGGFGTLDEVSEAIALIQTGKSERFPIILVGKSFWQGFYDWIRQTL
LEEKGYINTFDLDFIYLEDDPKEVVAIITRFYPEGYTLNF
>Cag_0277 Sel1-like repeat
MKRKFFTSILCLLLFSGTVHAESPQEIQQLRIAAEQGNAAAQFNLGIKYQ
FGKGVRQDYVEVIKWFRLAAEQNHVYAQLMLGTMYRNGEGVRQDYIEAIK
WFRLAAEQRYADSQYSLGLMYAGGKGVSKDYVEAIKWFRLAAEQGNVEAQ
AMLGSIFYVGKNVQRDEFEAIKWFKLAAQQNYAYAQMMLGTMYATGEGVR
QDYVEAIKWYRFAAEQGNVEAQYDLGLLYLNGYGVRQNKAIAKEWFGKAC
DSGSQEGCNQYRALNIPQKHR
>Cag_0302 conserved hypothetical protein
MKILERYIFQQFIKAFLFTALVFVSLFIIINMIEKLGNFMDHHVSALEIA
RYYLLSIPSIFLVTSPVSALLASILVAGKLATQNELPAIRSAGVSMRQLL
TPFAWGALLLFLFNFFNAGWLAPTTYSHNRTFEQLYLGKNAGDQETRNLH
LLDSGNRFISIGAFNPINESLNNVSIERLSGATMISRIDADSMHYNRRTK
RWTMWRVTERYFSNGYQSFTTKPTATIRLALRPKALHEMRLQPDEITLPR
HYQFLREKEEAGFSGLERSAVKFHNKIAMPFASLIITLIGVPLAARKKRG
GIAAEIAITLFIGFLYLGIQRTIAIAGYQGVLPPIVAAWLPNLLFLVVGW
VLYKKSTDS
>Cag_1487 putative plasmid maintenance system antidote protein, XRE family
MATLRNIHPGEILMEEFLLPFGISLRKLSYDIVISQQQLEAIVEGRERIT
ADIALRLSQYFGNSAKFWLGLQDDYDIEEGMEQNKVDILSILRFKQPVST
SVE
>Cag_0251 ATPase
MNSILSSHKLKKSYNKHPVVTNSSIEVRQGDIVGLLGPNGAGKTTTFYMI
VGLVRPDSGEVRLDDKPITHLPMYKRARLGVGYLPQEASVFRNLSVEENI
AAVLEFTALSKKERQERMEKMLEELNITHIRKSMGYALSGGERRRTEIAR
ALALNPKFILLDEPFAGVDPIAVEDIQHIVAGLAKRNIGVLITDHNVHET
LSITNRAYLLFDGSIFMKGTPEEIADNPEVRKMYLGEKFTLERY
>Cag_1785 conserved hypothetical protein
MKQPLKIAHISDIHLSGANDRSHAARLTRLLQHLRNEQFDHLVITGDLGN
HADPDEWRVVQQLLKQTEWYHWERCTILPGNHDLMNLEEEMRLYNALNPI
QWFRQKAFQRKRQLFCELFYEIMGGKNQTFPFLKILNYPTLRLALVALDS
VAAWHPSTNPLGARGFIEPQQLTALQQPQIAEALRSCVVIGLCHHAYKVY
GTDSLIDQAFDWTMELQNRDAFFSLMQQLGASIVLHGHFHRFQSYQKEGI
TFINGGCFRYNPYRYSELLLEADGSFQQQFLSLEEK
>Cag_2009 conserved hypothetical protein
MKQRKEPKPFGLLISGVYRSLGLEEPYQQFKALQVWREVVGEAIAEVTTL
ERFTAGQLYIKVNNAAWRLELNFRKRDIIQRLNKELGSPLVQEIIFR
>Cag_0266 Protein of unknown function DUF132
MSKYQIVVDTNVFVTALRSQYGASYKLFSLIDKDIYQLNISVPLVLEYEA
VAKRMIDKILLNEEEVDNILNFVIQNSNRWEIYYLWRPQLKDPCDDMVLE
LAITAACNYIVTYNINDFKGIEGFGIEAITPKAFLKLIGEL
>Cag_1623 conserved hypothetical protein
MKVIGINGSPRKDGNTAKLINMVFEPLQAEGIECELIQVGGTLIRGCLSC
YQCVKLKDKRCSTKNDSFNEIFEKIIAADALILGSPVYFADITPELKALI
DRTGFVARVNGHMLRHKVGAGVVSLRRGGAIHAFDSINHLFQISSMFTVG
STYWNVAFGGRTGNEVEGDVEGVENMHDLGSSMAFLLKKLHCCE
>Cag_0679 Protein of unknown function DUF132
MSYRIVFDTNCIISALLFSRQKMARLRYSWQSDAVIPLVCKETVSELLRV
LTYPKFKLTRDERLLLLADFLPYAETITVLDVPSNLPVIRDSADQIFLTL
AVVGNADALVTGDNDLLTIKDSFKMLPIMSLNEFNQWLK
>Cag_0904 conserved hypothetical protein
MAETDTPYLRFADISLDISRGNYSAARQKLEVLEPLMPESYHLNLLYARA
LAGMERYAQACKYLQACCTLAPANEVAWYELATMQALAENDTSNAMESSS
AYDPVVDELEQLSAALMKAGPILASDSSEPTSIAEQKQPFADDTEIAVPT
ESLATLFIAQGAYKKAIRMYSHLIQLKPNNARFYQDEIDRLLDRL
>Cag_1498 ATPase
MSVAIELKHITKKFGSFTANHNISLEIAEGSIHAFVGENGAGKSTLTKLL
YGMHQPTSGDIVLHGTSTHFSSPRDAIRAGIGMVHQHFMLIEELTVTENI
MLGYEQASLFGSLPLKKAKERIAADALASGLTINPDARISSLSVGEQQRV
EILKLLFRNASIMLFDEPTAVLTPAETEQLFTTLRALRAQSKTIVLITHK
LDEVLSVADTVSVMRQGEIVATKSVAGVTREELARLMVGRDVLLRVENSP
HATNAPVLEINNLTYRALNGNEKLRQLTLTLHAGEVYGIAGVEGNGQSEL
LQALWGLVPEGVKVSGNITMQGSSLLGKSAAAIAALGVSHAPENRLHHAI
IGDYSVSDNLIFGRHREATFHHGMGFNRATVERFSNAMIADFDIRCTNAL
RQPISALSGGNQQKVVIARELTRPNLKLLILAQPTRGVDIGAIETIHKKI
LAARTSGIAILLISSELEEIIALSTRIGCIYKGTIRHQFSAEEVERNRQH
GRAFSERIGTYIT
>Cag_0629 Hit family protein
MQSFKEESYLCQSGKSIFADLAPEEDEQHFVLHRAKKCFIIMNLYPYNCG
HLMVIPYLQTPEFSDLDRETWLEVMELTDLSIRALKKVMRPHGFNTGANL
GRIAGGSVDNHIHFHIVPRWDGDTNFMPVLADVKVLSNDMISTYKNLKAA
ITELLAQEAQ
>Cag_0665 probable polysaccharide biosynthesis protein
MRYLKNTSWLFGEKVLKLFVGLFVGVWVAKYLGPKQFGLFSYAQSFVGLF
STIATLGLDGIVVRELVKDEKRRDELIGTAFWLKVIGAIAVLLVLAIAIN
FTSNDSYTNILVFIIASATVFQSFNVVDFFFQAKVMGKYITYTNTITLFT
SSIVKITLILSNAPLIAFAWTVLFDSIVLALGFIYFYLQQTKYSKPHFTF
RKEIAVSLLKDSWPLILSGFVISIYMKIDQVMIQEMIGSEAVGQYAAAVR
ISEAWYFIPMVVSSSLFPAIINAKNQSDDLYYARLQKLYNLMVWMAIAVA
LPMTFLSDWIVNLLYGELYNEAGDVLMIHIWAGLFVSLGVARGSWIMAEN
LQLFSTYFIGIAGAINVIGNYMFIPTYGINGAAFTTLVSYAISVIVAPYF
FRATRHSVYMLMKAMLMFNLFRSLDE
>Cag_1938 Sel1-like repeat
MNVMKKFITSIVIASLTLLAINGFCETPSQKQISQWQQAAAQGNSEAQLN
LGYAYDHGEGVKQDYAEAIKWYRLSAAQGDVKAQFNLGVMYYNGEGVKQD
YAEAIKWFRLLATQGDAIAQFNLGVMYYNGEGVKQDYTDALKWFQLSAAQ
GNAMAQNNLGVMYAKGEGVQQDYAEALKWHRLSAAQGNAMAQNNLGAMYY
KGEGVEQDYVEALKWYRLSAAQGDAVAQWILGLMYYEGQGVRQDYGEAIK
WYRLSAAQEDAKAQYNLGLMYYNGEGVKQDYAEALKWHRLSAAQGNAMAQ
NNLGAMYAKGEGVQQDYAEALKWHRLSAAQGDATAQGILGLMYCEGYGVR
QNYGEALKWYRLSAAQGNAGAQYNLGLMYYNGTGVRQSKAIAKEWFGKAC
DNGFQDGCDAYRELNEAGAKTNRSR
>Cag_0740 Type I secretion system ATPase, PrtD
MKKPEVKSPLREALWAQLPALLKTFYFSIVVNVLVLAPSVYMMEVYDRVV
NSRSHNTLLMLTLLVVGAYLLLEALEWVRRQIMQSAALQLDGKLREEVFS
AIFAARLQNIPSAGAQALRDLKSIREFLPSQALLAMVDTPLALLVLILLF
LMAPLFGWFSVAGAVVQFGIGFFNERRIRKPLQEANRSAMVAQGYADGVI
RNAQVIESMGMLPHIHRRWMERQQEFLVNQATASDHAGTNAALSKLLQSL
LSSLLLGVGCWLTLKGEVFGSAMIVASILGGRVLAPLVQIIGSWRQVEGV
MEAYHRLEAMLRELPMPQKGMPLPAPTGQLSVEGIIAGAPRSPMPILKGV
SFRVMPGGTLAVVGPSASGKTTLARLLVGIWPSTQGKVRLDGQDIYLWDK
EELGRYVGYLPQNVELFEGTIAENIARFGEPELEKVEAACRLVGLDALMV
NWPKGYDTQIGEDGAFLSGGERQRVALARAVYDMPKLVVLDEPNASLDEA
GDAALINTVKKLRENGTTVIVMTHRLNILAAIEYMLVLVDGQVQKFGTVK
EVMEALQNPQQAGGAQQQPQPKPAPQPKPMPSTPRLA
>Cag_0383 oxidoreductase, short-chain dehydrogenase/reductase family
MTKRSYLHSISYSGYSYAIPSGLKTREKTIGNMQQRHNSLGIVITGGSKG
LGFALAARFLAEGDRVVLCARNGERLEAALAALRQQVPTGEVYGIACDVA
DTAAPPLLAQFAVAKLGNIDRWINNAGTAGLQKRPLWQLAGSDIAETCTT
NLAGTMAMCAEAVRVMQRQPSAPQACYHIFNMGFSAVGASFSRSAVPHKA
SKRGVAEITHFLARELHEAAIRSIGVHELSPGLVLTDLLLRDAPADTRRF
LQVVAQTPEAVAAVLAPKIRKVRGLNRTVRYEPLVAMVFRMVAGLPRLLR
SASATS
>Cag_0196 conserved hypothetical protein
MQQPTPQQSLTTMPVEGRLFALALCSLLIVLTTTVPYLTLINVLFFSGIF
WSGFIALHQTILRYQVPLSLRNAFVLGSLAGFVGGLASELLGIILMVLFD
YRPGIESLSLIVEWATQQAMQNPELQEQVNMLQEAEKLAKTPITLGITDV
LFNLAVTGMVYAPIAGLGGMFAVRWLKFQAARK
>Cag_0039 Drug resistance transporter EmrB/QacA subfamily
MANAPSLSASPKLLGTTEEHYETGWRKLIITLTVIVSAMLELIDTTIVNV
AITQISGNLGASIEDTAWVVTSYAIANVIVIPLSGFLGNLLGRRNYYIGS
ILLFTVASLLCGVATDIWTLVFFRFVQGIGGGALLPTSQAILYETFRPEE
RGKATGIFSMGLVLGPTIGPLLGGYLVDYFNWEWCFFVNIPIGLLAAWSS
FIFLKEPKVTHTVSKIDWAGIGLLAVGIGSLQFILERGESKDWFETPYIT
WFTIIAVLSLIAFVWHELHTKEPAVDLRVLARSHNLPIAAVLTFIVGFGL
YGSLFVFPVFVQGLLGFTAVLTGLVLFPSAMVTGMISMPLGMALQKGASP
KHLMLFGMLTFSLFCWLLGQQTLQSGAENFFWILLLRGIALGFIFIPVTM
LAISGLHGKDIGQATGLNNMVRQLGGSFGIAIANTYIAKRVAAHRTELLS
HLSPYDPEAMNRIHAIAAKATAEHGLPPASAELAALKALEGTVTVQSTHL
AFMDAFMLIALLFLCAVPLLFFIRLHKGEQASAMGGH
>Cag_1179 hypothetical protein
MQIPLNQFENYIDETILKRGLSYFKNGYVHEPEEIKQGEFEALVEGTEDY
TVRLTIEKGIITDYSCTCPYDYGPICKHITALIFYLQQEELGLEVQPPKS
KTTTTKAKKTTKRKTIAEQVDELLDKVNHDELKQFVKKTVLADKKLRQNL
ILHFFHLTTNEDSKDFYAQQIKTILQIAKDSDGYISYSAVHNVCNITDQY
LAAAQNEIANNKYNKAISICTAVIEEMTKGLQFTDDSDGYIGDAIFEALE
ILQNIAKSNPPETVRIQLLEYAISAYKKSCFDGWNDHFDMIHFATLLVKK
DVEINNLIDILKNSIHTAHYKEKVQEIIYELLVKFKKLEEAETYLEQNIS
NSSFRLKVLETAYQNHNYSKAKLLANDAIKQENGISNSTWYEWLLKIAQA
ENNIKSIIEVARYLLLQGYDSECDTYYALLQQHIAPEEWNGFVEKMIAEI
KKKPYHLHYWLLPKFYIKQEQWERLLNFVQSTERIDILMTYDKYLVNNYF
TEIMDMYKKYILHSLNRAAQRNEYQKACEYIQRVVTLGGVRTAEIIISYL
KNNYPRRPALMEELSHIKLR
>Cag_1769 Protein of unknown function DUF132
MQKDNAIRVVIDTNIWIGFLIGKTFSDLYKAIINDQIKILFSDELFAELI
EVLQRPKFHKYFSQNDIAELISLIHLKTEFVEITDQFNDCRDPKDNFLLD
VCVSGHADYLLTGDDDLLILNPFHEVKIINYRKFTDILKRV
>Cag_1313 ExsB
MDSLVTTAIAQQAGFELAAMHVNYGQRTMQRELNSFRAICSHYSIQQRLE
INADFLGKIGGSSLTDLSMPVSVANLESHAIPASYVPFRNAGFLSMAVSW
AEVIGAERIFIGAVEEDSSGYPDCRKIFYEAFNRVIELGTKPETHIEVVT
PLIALKKWEIVRKGIELHAPFAFSWSCYKNEGQACGVCDSCALRLRAFEQ
AGMEDPIDYETRPHYIDC
>Cag_0909 conserved hypothetical protein
MTITFSQKARLQIEENVRFIAADKPNAARKWAAGVKQAVYKLKEFPYLGR
QVPEYANDTLRELIYGEYRIVYQVNIELSRIEVLSLFHSKQLL
>Cag_1040 hypothetical protein
MIDIFFTNPMRLIRSCTSAVEDDSHHSKQHVNRRKTMKKTIWLAAGVAGM
LLGNPTVNAQAETRDIIQPRSDHSFVIDARPSFIYLPDQGFAVSVDSPYD
IISGDDHYYMNQKGSWYRSSSYRGPWKLRKEKNLPSKIKKHRLEDIRMYR
DAEYNKIINQRNSQQQRMDNNRPR
>Cag_0100 tRNA modification GTPase TrmE
MRNSLSFQDEPIVALATPLGVGALAVVRMSGQGVFDIARKVFHKQGAPDF
HLASSKGFQAHFGTIHDAQGVVDEVIALVFRSPRSFTMEDMVEFSCHGGP
VVVQHLLKALIDAGCRLAEPGEFTRRAFLNGRIDLLQAEAIGEMIHARSE
SAFRTAVTQMQGRLSRQLEEMREKLLHSCALLELELDFSEEDVEFQNREE
LREDVQRLQGEINRLLDSYQHGRLLKEGVATVLVGSPNAGKSTLLNALLG
EERSIVSHQPGTTRDYIEEPLLLGSTLFRLIDTAGLREGEEEVEHEGIRR
SYRKIAEADVVLYLLDVSHPDYCNELSDITSLLEQASPNVQLLLVANKCD
AITNPTERLAQLQAAMPQATVCGIAAKEGDGLEALKQQMSNMVAGLDKLH
EASVLITSMRHYEALRRASDALENGACLVAEHAETELVAFELRSALEAVG
EITGKVVNDEILSLIFERFCIGK
>Cag_1720 thioesterase, menaquinone synthesis protein
MIMQPPLHIEIIGNKALPKIVFLHGFLGSGRDWLPLAEMLTSHYCCVLVD
LPGHGSATLSASDEHHAYFTATVEALATVIQPISPEPCRLVGYSMGGRIA
LALMLTHPELFHQAVIVSASPGLPTEEERAKRRAGDEGIARKIERNFPDF
LEAWYQQPLFSTLKNHPLFQEIERKRAINNSESLAAALRLLGTGQQPSFW
DALSKCAVPTLFIAGEKDERYVAIARQMVKLAPHATLSIVPNCGHTLHIE
NKESFVEQLHTFFNQ
>Cag_0886 conserved hypothetical protein
MRKKRAGRPQHCRCVQDLPKVTCFTPSGVAPEAVEQVLMTVDELEAMRLA
DRDGLYHADAAMQMKVSRPTFGRILESGRRKVADALVGGKQICIKGGTVL
AVCDSIPTERPDICLCPTCGREFPHIKGVPCRNSICPDCNEPLQRKGGCL
SDEESDEQENEQRTVGYPESEEELEIE
>Cag_0045 Aminodeoxychorismate lyase
MKSPRSFITRLILAVTLLIAAFPLGFLLIPGLNSKSKPTQLVVHREMRFS
DVLDKLQASGAIRERWQPELIARMVPKFRTIKAGRYTIPPNTSNFGLLWY
LRTHPLDEVRVTLPEGIDRRKMARILSRKLDFDSTQFMAATENPRLLAKY
GIRASHAEGYLLPGTYDFAWGSSPDEAASFLIRQFKKLYTTERQQRAAAL
GFNEHSLLTLASIVEAETPLDKEKPTVASVYLHRLRIGMRLQADPTVQYA
LGGTTRRLYYKDLAIASPYNTYRNKGLPPGPICNPGKASIIAVLNAPQSG
YLYFVATGTGGHYFGASLQEHHANVQKYKQARSSNE
>Cag_1169 TPR repeat
MKPFFTFLRYAFIGILTLSLVTPEVVDAAKKSKKKSSSRKKSSKRNARAK
KGSNKKTSARQARLRVVDGVETERNSINLTASPSSASRQLNKRAMGFYEQ
GRYAEAEPLYRELLTLDEKQLGSRHPEVAVTLNNLASLLQQQGRYNEAEP
LYRRALSIREENFGADDASVAQSLNNLGSLLQDQGRYYEARQLYSRSLAI
DEKVLGTDHPDVAADLNNLASLLQAQGRYAEAEPLYRRSLAIREQRFGAE
HTLVAMSLNNLGVLLQAQGRYSEAEPLYRRSLAIREAQYPANNHSIVATS
LNNLASLLQARGKLTEAEPIYQRALSINEQTLGENHPSVATSLNNLAGLL
RAQGRYADAEPLYRRSLTIREEQLGENHPDVAMSLNNLGVLLQAQGRASE
AEPLYRRALLIDEKVLGATHPQTIRLRNNLNALLNPSAIPLTTQ
>Cag_0596 acetyltransferase, GNAT family
MKHMNPKIVIRNEANADIRAISEVTAAAFQTLEISNHTEQFIIEALRATK
ALTVSLVAEIDGLVIGHIAFSPVTISDGTRNWYGLGPVSVLPAYQQQGIG
KALIWEGLLRLKDMNAQGCCLVGHPDYYIKFGFKNLPGLVHEGVPLEVFF
ALPFDGHIPQGTVTFHEGFKADG
>Cag_0537 Glutamate synthase (NADPH), homotetrameric
MTMDAKTITTKQRLAIPRQAMPAQDPAVRTHNFLEVNLGYTPELAQQEAL
RCIQCKDPVCIKGCPVNIKIDQFIKLIAEGDFLGAARKIKEDNVLPAICG
RVCPQEDQCEKVCVLTKKYTPIAIGNLERFAADYEREHGDIELPSVKAPT
GKRVAVIGSGPAGLSCANDLIQLGHDVTVFEALHELGGVLMYGIPEFRLP
KEIVRTEIDGLKKLGVKFVTNTVVGRSVTVDELMEEENFDAIFIGVGAGL
PWFMGIPGENLLGVYSANEFLTRVNLMKSYNFPDNDTPVFNCEGKNVAVF
GGGNTAMDAVRTAKRLGAKNAYIVYRRSEAEMPARIEEVHHARSEGIEFL
MLMNPLEFIGNDEQWLTGAKCLRMELGEPDDSGRRRPVPIKDSEFILPID
MAVISIGNGSNPLIKQTTPDISVSKRDTIVVDLNTMATSKENVYAGGDIV
TGGATVILAMGAGRTAAAAIHEKLMGGSAS
>Cag_0481 Phosphoesterase PHP-like
MPYSTSSVGHNGFQKADLHIHTKCSDGLFTPEEIVEKAARIGLNAISITD
HDSVLGIDKAKPLALEKGVELIAGVEMSSTYKGYDIHILGYFFDYQHSEL
KDYLDHCRQLRTDRAERMVSKLAKMGVKIGIEQIIVKAQNGSVGRPHIAA
VLQDGGYVKSFSEAFSKYLGAHSPAYVKSVETHPADIIRLINKASGLSFL
AHPAQNVPDEVLKQLITLGLDGIEIIHPSHDTYRQNYYREIANEYFLLFS
GGSDYHGIRERDEDLFGKVTIPYDWVKKMKSRLMLA
>Cag_1886 von Willebrand factor, type A
MSEWLASLPTLTFAAPWWLVLLPLVAIVLWLKERWQRQVAAISFPDVQRF
ERAKLVAPRWMVRMPQWFRWAALAVGMLLLAEPHLTLRSTTAAARGIDMV
LAIDISESMMQSQTDTQSRFEIARQAARNVVEQRSNDRIGLVVFRGEAYT
LSPLTRDHTVLSLLLDNLSSRIIQDDGTAIGSALLVALNRLQASESELQM
VILLTDGENNAGEVSPLTAAALAARRGVRFYVLNVAFESVKDENAPRSAL
YAAELQEVARRTGGSYFTVNNKTELETTIASIAARAKNGQGNMVVVQHNA
VTQPLLLLLLSLLGLELLVSATRLLKIPS
>Cag_0081 Biotin biosynthesis protein BioC
MAQCVDKLLVGERFRKALATYREHAVVQHAMAEDLAAMLARHLPSSPTIK
RLFEIGAGSGALMEALLHRFCIDHYFANDLVAESEGCLRPLLAPYREEAF
TFLMGDIEVLAEWPSSLEVVISNATVQWLEQPAHFFQQAAKALQPHGLLL
LSSFGASNMQELSSLLGVGLRYHAPDELIALASHSFDLLEVKEEQKELLF
CSPEAVLHHLRCTGVNGVVRTQWTKSDYKRFLSSYRERFSTTDGKVVLTY
HPYAIALQRKG
>Cag_1932 CBS
MEIFLLFVLILVNGAFAMSEIALVTAKRSRLSRLADDGDKSATTAMKLGE
DSTSFLSTIQIGITSIGILNGIVGEGALAVPFSLFIHSATGIELETAQLI
ATVVVVLGITYVTIVVGELVPKRLGQLNPEQIACLVARPMQILATITRPF
GRLLSFSTNTLLRLMGVKPQITPSVTEEEIHAMLEEGSEAGVIEQQERDM
VRNVFRLDDRQLGSLMVPRADIVFLDVTQPLEENICRVTESEHSRFPVCN
GNLQSLLGVVNAKQLLLKTLRGGLTEFATLLQPCVYVPETLTGMELLDHF
RTSGTQMVFVVDEYGEIQGLVTLQDLLEAVTGEFVPRNLEDSWAVERADG
SWLLDGLIPVPELKDTLKLKEVPDEDKGLYHTLSGMIMWLLGRMPHTGDV
LVWEEWNLEIVDLDGQRIDKVLASPLNNAPKASQKEEKPPVKSDDNAALC
SIRPTP
>Cag_0284 oxidoreductase, Gfo/Idh/MocA family
MSSNIKIGFIGSGWSRIAQAPAFSLMDNVSMAAVASPTADHRQKFMKMFD
VEHGFADWRDLLQCDLDFVCVTTPPFLHKEMVEGVLRSGKGVLCEKPFAL
SVADATEMADVATRTPLLALVDHQLRFHPAVRHMKQMIDGGEIGKVFEVR
AVVNLASRNKTDLPWSWWSDATRGGGALGAIGSHLIDLNRYLVGEIAEVS
CNLGTSIAHRPDGNGNLLPVTSDDHFAMMMKFGRSSVALGSSSLMHVTTV
GAYTWFSFEVVGSLKTIRLDGGGRLWEVYNDNAQRGRSLIDMPRWKQIEP
ILPWDELVLQEKIKQSSLAVHGIFAVGFAFLAHRIVKALQCGEKAILDAA
TFADGLMIQRVLHAGTESHQQRNWVKI
>Cag_0486 TPR repeat
MLDNNSTHIQPAGGFAAISKYNPHLWSAEQLRAIFVARTNELADLVQTLR
MVQPDTVAQHVLLVGARGMGKSTLMRRLALAVEDDPSLSANWLPLRFPEE
QYTVATLGQFWANVLDSFADTLQHLGESVIALDAAAERIAALPVTQQPEA
YIDAINHFADERKQRLLLLVDNTDMLLHNIGKDAHWGLRATLQSNPRLFW
IGGSYQSLEAESNYHDAFLDFFRVINLRPLKVEEMRQALLALAETFGGAT
ARNAMVHQLDLQPERLPTLRQLSGGNPRTTVMLYEILANGQNGNVRSDLE
ALLDNMTPLYKARMDSLADLQRKLLAHILEHWAPISFGELAAVSQVAKGT
ISPQLQRLEIEGLIEKTSLHGTTRSGYQAAERFFNIWYLMRFSPRRQRNR
LVWLVEFMRLWFSGDELCSLAKQRMSVGSNDLRSTHDLEYDRALADALPQ
FAPERHALRWSLLKLLQENNSQLVELFDFDGEDKEFKGATDYLRRLAALP
SLLRQCPHAVTEQEKTHWVETVLGSISLTLEEKEIVAQKAEYLTLFQYDE
LLKVFSEEQKRWEKQFGVAALQTVRSAVLSQDFFSDMPDSHLAYEQVRVC
FANNKEALRFVSLLFYSKHKDEWSYKAQKLALNLLHDDSKSDWFLREKLV
RYEETEKAYCKAIEFNKEDAVTWNHVGNLLKDYLGRYEEAEAAYRQAIAI
DKKFAYPWNNLGQLLHYNLNRYEESEAAYRQAIALDEKYAYPWFNLGQLL
HYKLERYEESEAAYRQAIAIDENNAYPWNNLGQLLHEWLGRYEEAETAYR
QAIALDEKYVYPVTNLARLLAQRNRKAEAETYYREAVLKDTQDTQQLFLQ
AHLFLGNRQLAMDALQALAEKAQNGNQYAFYRLKEQVWECYELGLGERLA
DWMAESNVAEFLTPFIQALYTLAGVNEKLRDLPMESQHMVDEIVRKARLR
QEKREACNMRAKSIH
>Cag_0418 hydrolase, haloacid dehalogenase-like family
MLKKLVLFDIDGTLLSMTSANRRILADALLAVYGTAGSSYTHNFAGKMDS
AIIYEVLAADGLTRKEVASRFDLAKEMYITFMQKVARVDDVTLMPGIVEL
LDALAERDDVLLGLLTGNFEGSGRHKLHLASINHYFPFGAFADDAEHRNQ
LPAIAVTKAHQRTGITFSEHNIIIIGDTEHDIACARAVQAKSIAVATGTY
APHELEAHEPHLLYHNFSDTQAVLNDILSH
>Cag_1196 adenine specific DNA methyltransferase
MSMTIQEYVAALNRRYKTGIATEHSYRADLQALLSDLLPKVEVTNEPRRS
ACGAPDFILTRKNIPIGYIEAKDIGKSLDNKLYREQFDRYRHALQNLVIT
DYLEFRFYREGDLVTSLVLAEMHKGKVVIRQENVAAFADLMQDFGGYEGQ
TIGSASKLSKMMAAKARLLADVLEKALDGYSDENNGAIDEASNTLYDQLK
GFRDVLIHDITPKQFGDIYAQTIAYGMFAARLHDPTLENFNRKEAAELIP
KTNPFLRKLFQYIAGYDLDDRLVWIVDALADIFKATDVNSLLKDFRNATQ
QNDPIIHFYETFLAEYDPTLRKSRGVWYTPEPVVNFIVRAVDDILKTEFD
LRDGLTDTSKITVEIDKATTDKNFKSKHIKQKQEVHKVQILDPAVGTGTF
LAEIIKHIHKQFEGQEGMWNNYVSHHLIPRLNGFEILMASYAMAHLKLDL
LLAETGYTSTTDQRFRVFLTNSLEEHHPETGTLFASWLSQEANEANYIKR
DTPVMVVLGNPPYSGHSANKSKWIEELLRDYKQEPNGGKLQEKNPKWLND
DYVKFIRYGQYFVEKNGEGILGFINNHSFLDNPTFRGMRWHLLSTFDAIY
LIDLHGNAKKKEACPDGSSDKNVFDIQQGVSINLFVKTGKKKKGALAEVF
HYDVYGDRPFKYDFLSKKSLSTVDFTKLTVAAPNYLFVPKDFSVLASYNQ
GFAINELFSLNSVGIVTARDRFVIDSSKLALTQRIKNFFSLDKDELLRMY
GLKENSAWRIHTIKQSTIRYSADFIQMLAYRPFDNRYVYYDPLFIERSRS
DVMKHFLQENIVLTVCRQVKAGESYHHVFLANKIFESCLISNKTSEIGYG
YPLYRYLNINNQIPTNGEPQRIPNLNAGIVNQIAIVLGLTFTPEKEETDG
TFAPIDILDYIYAVLHSPAYREKYKEFLKIDFPCVPYPQEPQQFWQLVAL
GSELRQLHLLESPTVEQRLTSYPIAGNDVIEKITYLDGKVYINEKQYFEG
VPLVAWEFYIGGYQPAQKWLKDRKGMALCYDDVRHYQRIIIALTETARIM
EEIDTVGVE
>Cag_0575 conserved hypothetical protein
MATRLIWSPEALEDIELIASYIERDSLWYAKVVASKIFAVAETIAKNPKA
GRVVPEIANPCIRERLVYNYRIIYRIDVELILVVAVIHGARLLPSLTHRF
EFEQ
>Cag_0160 TPR repeat
MNILLNKLVIIFEFVVGVTTLLLGADFLGVEWKQYPFLQRLDGQQPLLLL
LTFCSLLSILVIKLKNNSPAFEAVQTKDGTTIVKATLSGFEQELDDAADE
IKKKAQEYFEAGERDFANEQYAQAASNYKASFELLPTMAASLNAGIAFSI
TSDYQAAEELYQQGLRIAQHKRDKEYEAAFLGNIGVGYRSSEKFADALDY
QQKAFKLNQTIRNQRGQANQLCNIGLIYNDKGLLDAALGYFKQSLELYET
IGDTSGQAQNLRSIGIIYRRKGKLNEALSCHQQALNIDKANKNASGEAEN
LNNIGIIYKGKKEFDKALNCCYQALDICKRIGYKFGEGLALSAIGVIYYS
KGELDNALDYCNQALKLYKSIDSHLIAQAENLNNIGLIYQDKGQLDQALI
YLGQSYLLFQKTGVTLQLKKTEANIQEVKRLQAAKGKN
>Cag_1421 conserved hypothetical protein
MKAIILYDSKSQGGSTDRLVDAIGVKLAEAGHYVEKARCKSNGDYSFVKE
FDMVIMGSPIYYLMVSTELLGSMFQSNLKSCVEGKQIGLFLLCGSPEIMG
NLLYLPQLKLHLLGQSLVAEKIFAPDQASNPEAISAYANKLLAALKNH
>Cag_1348 TPR repeat
MEQTAKPSAEEQLLYIVIKYKKALIGALVVIVTLGAGAFFGTRYQEQREQ
EAALQLSRVSPALEQGNFTLAINGTKQTAGLQKIANEYSGRFIGTPSGNM
AKLLLANAWYSFGKYETALQHFNEVTIAHEDLAAAALAGSGDCYLNLNKL
KEAAEAYQKAANKTDNRILKAQYLTHEATAYHYAKDFPKATELYKTVIAS
YPGTTAASIAQHGLWQLSGSL
>Cag_1662 3-oxoacyl-(acyl-carrier-protein) reductase
MLTGKIAVVTGAARGIGQAIATNLAARGADIVLCDIKAEWLTETADKVEA
IGSKAFCVELDVSNAASVQEVFNKIAEETGRIDILVNNAGITRDGLLMRM
SEEDWDAVLTVNLKGTFACTKAVSRIMMKQRSGSIINIASIIGLMGNAGQ
ANYAASKGGVISFTKSIAKELSSRNIRVNAVAPGFIASKMTDALSEEVKG
KMLEAIPLARFGEPEDVANVVSFLAGDESSYITGEVINISGGMVM
>Cag_1706 Protein of unknown function UPF0011
MASPSLQPATLYVVATPLGNLEDITLRAIKILQQVEIIACEDTRRASILL
KHLAISGKRLISYHTQNEPRAIAQIVALLEEGNNVALITDAGTPAVSDPG
FALLRAVHERGIVALPIPGASAVTAALSVCPLPLNTFLFGGFLPHKKGRK
TKLAQLSAIGQPFVLYESPYRIHKLLDELEAILPNAQLFIGREMTKLHEE
YLTGSIEEMRQHLTSSKTKGEFVVIVHPTAEKTINPESDTDADY
>Cag_0469 polysaccharide biosynthesis protein
MAALSKLKLLFKDTVIYGASTILARSLNYVLVPVYANTLSTFENGIQTII
YANIALANVLFSYGLETSYLKVAADTHREGSEGEKPLFSTAVLTLLATST
LFALLIVLLAPWIGALVGLDSGAAPFVRYAALILWLDTMLVIPFAELRLR
RKALHFATARLLGVVAVVLCALLFIVVMKVGLSGVFLAEAAGSVVSLLVV
LPLFRGFRFGFSGQQLREMLRIGLPYVPTGIAGLLIHLIDRNILIRIAPS
DIERLYGAGYQPSDIVGIYGRIAAFGVVLQLVIQVFRFAWQPFFLQHGKE
PDAQQLFHHVLSISTLLTMVLALSATFFVPDLVRYHYGGAFYLLPPPYWI
GLSILPAIFLSYVFDMVSTNLSAGLLLTGNTRYLPVVTFAGAVVTALSCW
WLVPLYGMDGAAYAIVAGTVVMSVVMGYYSLRFFPVSHDWAKLLLLLLTG
IGMVLVQRQSEALLSAPAQVIGVKVGLVLLYLALALFLFRNKATRLLKQV
RHRNSSPHVVS
>Cag_1702 TPR repeat
MLKLGKGVSTQPSIGRIGGEWYIGGMKQSFKHQALRRIKKGALLLCIFAA
TTLTACSNNELEKLQQEAWKNPNDAALTLQLGYKYAQEGRYMEANESFQK
VLALDPKRDEALQALGATAFRQKQYSQAISYFQQHLERAPADSARLYNLG
NAYMQLKQYDKATELYNKAIDNSTAFIDAHYNLAVCYAKTGRRNEAQAIY
EWLLTKNNYLATSLQKHLDKENQAPK
>Cag_1363 serine/threonine protein kinase
MRKRLFIKKQKFDKWELKRFLGGGGNGEVWECCDEEGNKGAIKLLKHVKS
KSYARFCDETKIMEQNFDIEGIIPILDKFLPEKLDGSIPYYVMPMAESAE
KVFKAKNIVSKIDSIIEICKTLAKLHERGIAHRDIKPPNLLVFNSRLALA
DFGLVDYPDKKDISLQNEEIGAKWTMAPEMRRESSKADSLKSDVYSLAKT
IWIILTENPKGFDGQYSIDSIIELKRFYNKTYTTPIDNLLTKCTDNDPNQ
RPTVNEIILELENWKVLNKDFHERNQEQWFEIQTKLFPMTFPKRVIWENI
EDIVKILKVVCTYDNLNHMLYPNGGGMDLEDVRLSHEKSCIELDCQLINI
VKPKRLLFESFGYTAEWNYFRLELYELEPSGAYENDEYYENIQYYEEYDG
YVSPENTMQLLRWFRGSFVIFNKRSVYNRISSTYDGRHNKMNTEEFRDYI
QEMVSHTIEMNKKKSAMATIESKRRKTR
>Cag_1777 PIN (PilT N terminus) domain
MNKRFLLDTNVLSELMKNNPEKKVIQWFDAHQHERFFTSSITKAEILLGI
ALLPKGRRQKQLYEATFKLFSTTFLEYCFPFCERAAVVYSDIVAHQKRIG
RGICTEDAQIAAIALTENLTLATRNVKDFHFIQGLEISNPWHG
>Cag_0043 conserved hypothetical protein
MPPAIKIIILANVAVFMLQRLPWGGELLSAFASLWPIGTGNFYIWQPVSY
MFMHGGLTHLFFNMFALWMFGAEIENYWGTRQFTIYYFICGIGAALINLI
ATMHSPYPTIGASGAVYGVLLAFGMMFPNRYIFLYFFFPIKAKYFIAGYA
LLEFVSGLGSREMGSGSNIAHFAHLGGMLIGFIYIILKRNDWALDDVVQK
MRSLRSSGGKKSSPYQANRNSSNSNKLTPTDDEINAILDKISAHGYASLT
DEERRKLLRAGGNG
>Cag_1325 nucleic acid-binding protein,contains PIN domain
MITYLLDTNIVIYTIKRRPIEVLETFNQHATRMAISAITLSELFYGAEKS
SNVSANLSVIEDFCSRLQVLPYGAKASQHYGAIRAILAKSGQQIGMNDLH
IAAHARSEGLILVTNNVKEFVRVPALQVENWVE
>Cag_0195 sodium:solute symporter family protein
MQPLDTALVLLFLVANIAFGLWQSKSNKSTGDYFLGGHSVPWIVAMLSIV
ATETSVLTFVSVPGLAYRGDWSFLQLPLGYIVGRVLVSMFLLPLYFREGV
SSIYEIIGRRFGTGMQKLASVAFLITRILGDGVRFLATGVVVQAVTGWSL
PLSIVLIGVVTLIYTISGGLKSVVWLDSFQFGLYFLGGVISISYLLQQLD
APFPTLFATLHEAGKLQVFQFSNDLLVNPMAFGAAFLGGVFLSFASHGVD
FMMVQRVLGCRSLSNARKAMIASGFFVFFQFAIFLLAGSLMFLFMEGREV
EKDREFAFFIVHHLPTGLKGILLAGILSAAMSTIASSINSLAASTVTDMA
GGKVSLTGSRWISFGWSLVLIAIALLFDESNKAIIMVGLEIASFTYGGLL
SLFLLSRSSRAFHPVSLAVGFLASMAVVVLLKYVGLAWSWYILLSVLLNV
LLVYGIDIVTNTISPKRL
>Cag_0015 conserved hypothetical protein
MQRDEILSILSNHKAEFSERYRFKKLGIFGSVARNSASTTSDIDIVVDME
PNILLRAQFKEELEQLLGSKIDLIRYWAKMNAYLKARIDQEAYYV
>Cag_1215 conserved hypothetical protein
MEQILQKLGIELNEQTRLSLDSTFQFGCHSKLSCYNSCCSNLDIFLTPYD
VLRMKNKLGITSTQFLSEYVEPVIQQESKLPFLRLKFQEGGQCRFVAPEG
CTIYSDRPVACRYYPVGFGIHKSQNAHGNDFYFLVREDHCKGFEETQEWT
VREWRKNQGIDEYDDNNRIWMDIILHKKLVSPDLEPDEKSLKMFFMASYD
IDSFKEFMFESRFLEIFEVEEEELELLKSNEAELMLFAHKWLQYALFKQP
TMTVRQQPKA
>Cag_0896 methyltransferase, putative
MPKTVPNSPHTAQSTQWFEAWFNHPLYMEVYRHRDHNEATQCIRTILQHT
ALEQATPATTTVLDIACGAGRHAIELARRGYNVTGNDLSTTLLNEAAKAA
KQEKLPLQLTNYDMRHVPTHQRYQLVVQLFTSFGYFDSKAEDGAVVQKVW
ELLHKNGWYVLDLLNPDYLAANFIAESQRQVGELTIKEKRTLEANRVRKE
LCILSPSGETLHFSEAVWLYSADEIVDILHNVGFHTTEIVGNYDGSTFNA
TSPRMMLFCHKA
>Cag_1759 conserved hypothetical protein
MKLEILESFENKYPNRDYTIEIVNPEFTSVCPITGLPDFGTITIRYVPNQ
RCVELKSLKYYFFEFRNAGIFYENITNKVLDDMVALLEPRSISVITEWKA
RGGITETVSVHYTSQS
>Cag_0903 Peptidase M20D, amidohydrolase
MKQEESHHPIAEAIQHKAAELFPEVVALRRDIHAHPELSLQEHRTTALIT
SYLMQLGITPEKPLLDTGVIALIRGTSPHHHGKVIALRADIDALPLQEEN
STDYCSIEAGKMHACGHDMHTAMLLGAAKILSGMKEQLAGDVLLIFQPSE
EKAPGGARPLLDAGLFATYKPILILGQHCFPTIECGSVAFCRGAFMAAAD
ELYITVNGKGGHASAPHKAADPVLAAAHMVTAVQQLVSRVVPPHEAAVVT
ISAINGGHATNVIPRQVTMMGTMRSMNEEVRAILQERLQQAITHTAQAFG
VEAELTIVKGYPVLYNNQTITDQASCICAEYLGHHQVQHCQPLMTAEDFA
YYLQECPGTFWQIGTGVREGETANTLHSPTFNPNEEALQVGTGLLAYNAY
RFLASLHGE
>Cag_1588 glutamine synthetase
MNSDSKVPVSTYFGAFTFDHKAMRAKLPKEEFVALQETIKAGKKITAEIA
GVVAHGMKEWAMEHGATHYTHWFQPMTGSTAEKHDAFLSIDRDGTPIERF
SGEQLIQGEPDASSFPSGGMRTTFEARGYTAWDPSSPAFLMKGGNGLTLC
IPTVFISYHGAALDAKTPLLRSMDAASKSALRLLGVLGVEGVKRVKTYAG
CEQEYFLVDKKFYTARPDLVMCGRTLLGALPPKGQQLEDHYFGSIPDRVL
EFMQEVEHELFLLGIPAKTRHNEVAPHQFEIAPIFEEANIAVDHNMLVME
VMRKIADKRGFALLLAEKPFAGINGSGKHNNWSIGTDTGINLLDPGDTPE
KSILFLIILVSVLKAVHKRADLLRMSIASMGNDHRLGANEAPPAVVTVFL
GDLLERVLDAIESGQVDLKTEKQVLNLGLSQVPEVNKDYTDRNRTSPFAF
TGNKFEFRAVGSSQAPSVANMVLNTIMAEALDEMSEAIEAKIQAGSDKDV
AVLETIREQITVTKPIRYPGDNYSEALQVAAHERGLPNLKNTPHSLRILL
KKDVQDMFIKYGVLSHEEIDARLHIRLERYIKGIDIEARTLLLMLKTYVI
PNVSEYQGDLGNSFNSLFAVSEAIGLSDKALDSQAKHLKMVAENLATLLD
MTNELEEAIEQIESCKSEFDKADYCADKLLPFMEEVRVVADRLEQVVDRS
RWQLPTYLEMLFEH
>Cag_0294 conserved hypothetical protein
MNSEILIYQNQTGDITIDVRLEEETVWLTQAQLCQLFQKSKATISEHIKN
VFEEGELDEKVVVRKFRITTQHGAMVGKTQEMEVNGYNLDVIISVGYRVK
SQQGTQFRIWATKRLKEYIIKGFVLNDERFKSGNAMNYFTELQERIREIR
LSERFFYQKIKDIYTTSIDYEPRAEKTIEFFKVVQNKLLWAISKQTAAEL
VYRRANAELPLMGMQSYDKKGAASIKKAEVSVAKNYLNEDEIKLLGLLVE
QYLAFAETMAQQRTPMYMKDWIDRLDTILQLNGRELLTHAGNISHDMALK
KSEVEFEKYRLSLKAVEKEESLRELEEDLKQLTKSAS
>Cag_1642 oxidoreductase, short-chain dehydrogenase/reductase family
MQNLWNDAELQGFVSNVCHEPDDHPELAALVYASRLLGRERSLVMHGGGN
TSVKCGLTDMVGNHAEVLLIKASGIDLSNVTCRDYTPLRLGPLSKLVELC
SSNDPIHAERVERFSTKEFKHLLMLNMFSLTDHMAEKRLTPSIETLLHAF
LPHRYILHTHSFALLTMSNQPNGEALCRETLGEAFGSVPYIKPGLGLARA
AAGVYEAHPAIEGLVLQKHGLVTFGETAQEAYNRMIDAVTKLEERIALAG
RKPFTTVPLPEEIAKVEDVAPIIRGACAEEKEVGRRDYQHLILDFRTSDE
ILTYVNSADVVRMSQKGSMTPDFIIRTKNKPLVVPAPDAADLNGFKAAVD
EAVQRYRDAYIAYFNAQQQASGMEVTMLDPMPRVVLVPGLGLFGLGKSAA
AAAVNADIATCTATAILDAESVGSFESISEREAFDIEYWDMEQAKINKVY
HGTFAGKVVMVTGGASGIGLATAKAFRQRGAELVVLDLSQEALDKAAEEI
GGNPLTLTCNVTSRADIRAAYDAVCKRYGGVDVIVSNVGAAIQGRIGDVS
DELLRKSFEINFFSHHYIAQEAVRVMRLQGTGGVLLFNVSKQAVNPGPDF
GPYGLPKAATMFLVRQYALDHGRDGIRANGINADRIRTGLLTEEMIKSRS
AARGLSEHEYMAGNLLQLEVYAEDVAEAFVHLAQEIRTNAAIITVDGGNI
AATLR
>Cag_1268 Elongator protein 3/MiaB/NifB
MAEIPAWLHTTNDANALASLLAPNATRSLESLAAEASAITRRRFGRTITL
YAPLYLSNHCSNGCAYCGFASDRTTPRRRLEMEEIRREIAAMKALGISDI
LLLTGERTPAADFDYLRQSVALAAEEMQRVAVEAFPMSVAEYRALAESGC
TSVTIYQETYNRKQYEALHRWGAKKDFLYRLETPARALEAGIKHVGLGVL
LGLSDPIEDALCLYRHVRHLERRYWRAGFSISFPRLRPESGGYQPPFPVD
DRQLARLIMAFRIALPNIELVLSTRESARFRDGMATLGITRMSVESRTTV
GGYAENETIKSSAGQFEICDDRNVEEFCAALRTQQIEPIFKNWERAYNAP
SMSCFL
>Cag_1302 ATPase-like
MIKQLTLTNWKSFAEATLYIDPLTILIGTNASGKSNTLDALLFLQRVSSG
IPIFQAIAGDVNLTPLRGGMEWVCRKPFNTFTLTVLTDGLSKDEEYRYTL
TVQVNGTKAEILHEELTLLIYGTNRTTSKEKRLFKTELDEINHPSIPTYC
YTGTQGRGRRFDLLRSHTILNQTETLNVRKEVQEGAKLVMTQLQRIFVFD
PIPSHMRNYAPLAETLLSDGSNLAGVLAGLEPSRKIEVEKTLTTYLKALP
ERDIKRVWTEHVGKFQSDAMLYCEEGWSNETTQEIDARGMSDGTLRYLAI
VTALLTRQSGSLLVIEEVDNGLHPSRAHLLIRMLKELGKQRGIDLIITTH
NPALLDAAGNRMIPFITVVHRNSSTGTSSLTLLEDIEQLPKLIAQGTLGD
LTSDGRLEEALQQKRGNGE
>Cag_1433 possible NtrR protein
MGMQNNRYMLDTNIASHIIKGDIPVVRERLIALPMEAIMVSSVTKGELMY
GLAKRDYPKVLTQKVNEFLLRIQVLAWDQDVAVVYGKFRSACETIGVTLS
PLDLMIAAHAHASNAILVTADKAFSRVPNLVIENWASEPS
>Cag_0112 von Willebrand factor, type A
MGREWFSLTKHSLEQTPQESLAELQRRIRRIEIRSRRKATELFSGEYHSS
FKGKGIEFSHVREYHYGDDVRSIDWNTSARNQDLYVKLFTEERERSLLLM
VDGSASMFFGSNQQSKKELAFELAAVLAFSALDNNDKVGLLIFTDQVELY
LPPRKGRRHVLLLLDKLSRHKPQSKQTNINAALSFLRYTLRRQEIVFLIT
DLIDSDYEKGMKQLNQRHDFILVHLRDALDTKLPLSGLLTLQDPESGERC
VVDMATPQQCERYKAMQERSIEELRQRMRRMRIDAIYLETDHSFFGALNA
FFRYREQKV
>Cag_1316 glycosyl transferase
MEPSPTVEIIIPHYRRRDMLERCLASLSRCPFYASPSLSILVICNGTADA
ALQKLIANYPTVKLLALAENRGYAGGCNAGLQQSNAEYLIFLNDDTEHEA
DWLEALLAIAQSNQNIAALQPKILSLEHAKQGKRRFEYAGAAGGMIDKLG
YPWCWGRTFFRVETDGGQYDKARNIFWASGVALCVRRSVALEVGGFDEDF
FMQMEEIDLCWRIQLAGYPIYSAPSSVVYHAGGASLAEGSAEKIYYNHRN
NITMLLKNRSLVALLWIMPIRMVMELGAALFYLTKADGLKKSGMVFRALR
DQLRAMPTTLRKRRTIQTNRTVSDRQLFHHQPFFHLLNHLIPQLYTFAQP
LKNQQHHQ
>Cag_0172 conserved hypothetical protein
MRIQRRQFEIIQEQAFRELPYECCGLLVGKQQKDHRGNIENIVYEVAPCQ
NCLYYGRESGFEIAHHEYLAVEAEAKQHGYQIVGSYHSHINSPAVPSLHD
VDFALKGHSLLIIAMIYGQPKEVTAWLRHHSGSGVNQEQIRVIE
>Cag_1075 carbon-nitrogen hydrolase family protein
MQNATLRIAQIDCTLANFQENLATHCTLIEAAIADGMDAIAFPELSLTGY
NLQDAAQDIAMHINDERFAPLCELSRHITIICGGVELSNEYGVYNAAFMF
EDGRGETIHRKIYLPTYGMFEELRYFSAGKQIRAITSRRLGRIGVAICED
FWHISVPYLLAHQGAQLLLVLMSSPMRLKPGSGEPAIVQQWRSIAATCSF
LFSGYVACVNRVGNEDSFTFWGNSSVTNPEGTIIGAAPLMQPHMLDVSLE
AAAIKRARLHSSHFLDEDVRLLSSGLREIM
>Cag_0451 Tryptophan synthase, beta chain-like
MNADVTKILLSEEDMPRQWYNIQADLPTPMPPPLAPDGTPITPEQLAPVF
PMNLIEQEVSTERWITIPQEIQAILKIWRPSPLYRAHRLEAALQTPAKIF
YKNEGVSPAGSHKPNTAVAQAWYNKEFGIKHLITETGAGQWGSALAMSCK
LVGIDCKVFMVRISFDQKPFRKMMMNTWGAECIASPSMQTNIGRKILEET
PDTPGSLAIAISEAIELAVQRDDTRYALGSVLNHVMLHQTIIGLEARTQL
EKVNLYPDVVIGCAGGGSNFAGISFPFIGDKIHGRDVQIIAVEPEACPTL
TRAPYSYDSGDVAKMTPLLPMHSLGHGFIPPAIHAGGLRYHGMAPLVSHV
KQLGLIDAVALPQTECYEAALLFAHTEGFIPAPETSHAIAQTIREAKQAK
EEGKEKVILMNWSGHGLMDLQGYDAFLSGRISDYPLPEEYLLRSLAAIKD
HPQPPQA
>Cag_1239 VCBS
MSNTVVFIDSRISDVNALISRFAVGTEYYVLDSERDGILQITEALAGKSG
YSSLQIFSHGTAGSLMLGSTVLNNAALSNYTAQLAAIGGALTASGDILLY
GCNVAAGDVGQQFIAALAEATGADVAASDDLTGSAALGGDWDLEYATGAI
ESGVDTNVVQEYDGVLEPPTTTITITLPTEAEIRNNSNIDQGYITESTLG
SNVTLYLQDILAVDTTNLLSTTAPTSFVISVSYGKISFLKTKVDTIPIVS
GNGSNTVTLTGTITQIRTLLTNNTISYVGNTGFSGIETLSAGISGVLANN
SGNASGSANSELSIVGINDAPTILDAKATLTVSEGSLLNDVFEGSFTLSD
PDIAHYIMQADITVLHGTLKLSGDDLDSVTGNETKSVTITGSRANIELAI
TALQYTPDENYNGSDTLTVTVNDLGTTNVNQGNPDEDKTDIHQVTISVVN
HAPVLENDYTPILSEISEDINSSITETEDGDNAGTLVKDIIPENDPTDSI
TLDAITDEDVTSALQSIYITAVDSTNGKWQFKLDGQTTWSTITLTGDTAL
LLSENNSLRFIPNNNWSGTATFTFGAWDGTGRDINNNNALYEAGDYVVIT
QRGVLNAPFSLDVDTATITVNPVNDAPTFTAFSAPITPAITTNQPSLALE
DGGNTVGEPSTTPITITFDDLATKGNEADANDAAYGGAVTAFVVKRVLSG
TLTIDNTDYTAGTEDLNLTISESQDASWTPELNKNGILNAFTVVAKDNDT
TDSKESTTPSVTVTINLTPVNDAPTVIPATQSATLTAGSDDVTSAYISMA
MSDVDTGDIVKVDGDWLADESAANDGEEHWTSSDGGKTYTFDGSYGTATL
YRVATTDGHSAGEVTYSFDYADTDLDSLAANATATDSFTIVVSDDAKVTA
TADAVFTINGANDDPIITSGTQSGSVIENSSETATGDVNATDIDYGTTLS
YSGDATGDYGSLDVTEIDGTWTYTLTSNALDDGESDIESFTITVSDGDGG
SATQDVTITVWGDNDPPTITESAQSPDYVTEDGGINNGTAGTSSAHIDVT
LSDADGGDTVSYVTTGWSGSGNTYTKTGTYGTATLHPNTHVIDYVLNNSD
TDTQALDTNDVVSDSFSITVTDGTDFTSETIAFTIHGTNDAPTIDVSDTG
ESFTESDDASMQTLSATGSITFDDVDASDSVSLTFAESTDISWSDSNGGG
DYDNELDSDYSSVKAALLNGFSTTATGWSYSTSGGSSLIGSEFGAIPSNY
FLPPPFSFTSLITGVLGDQNDVTEGQFKYDVYTLSNVVSGTLVFAAIESS
AFPEYANVYTSANLQLRQPITPRIISNTTDGNARILAWFYLPGDTLWVGS
DAPQQSGAYNLYLGGTIAESGDGVDLDFLDEDEQLTWSYTVSATDGTAST
TSTESVSFTITGENDAPTISISDDSATITENSGSDVSDLQTIDVSGDIEI
DDPDTHDDTVTVTSSLDSISWSGGDTTDSVFDTLTDGFTADEDGWSYSVT
DGVDLNFIGEDETVVLTYNVTASDGTESDTDTVTITIEGTNDTPTVDVVE
TSITFAEDDTLSDSGTVSFSDLDTNDVIDVTESYNNDIAWSYSGGTLTDE
MIGEDISTLTNGFTAWGEDLSSDETVWNYSATLGTDDLDFLDAGETLTFS
YTVTATDNNGASATDTITVTITGSNDTPHFNQDSVDNGSVTDTSASDTFS
NDIIGTLTADDVDHNDTITYGVVNGTGTYGTLTVDSSTGVYRYDGNDNTI
NALNAGTWSDTFTVTASDGTISESATVVITIHGANDNPTVSINDDSLSFT
EDVSASAQDLTDSGSITFDDVDTSDNVDIIATYKNNISWSGGTLANQMSG
EDSGKLTSGFHASATDSTSDSITWDYSATDVDLDFLAEDETITLGFTITA
RDSHNAFATDDVTITITGTNDDVDITGGTTSGSVTELADKYSENGVLEND
YLHSITGSIDISDPDVNDSHTATFTDNSESEQVTYLGSFDVADNGEDWTF
TVSDEDLDYLDDDDDPLIQTYTITISDGHGSSDTQDVTITIHGSDDNASV
AGRVYYWSNFDDIDDVATEMYAQDSMHTDGEDSGIEFRNIEKHDNGTYTL
DIYKTADTDEADSFLIKLQLAKGSVATWEQSHDLFDGEGNELPDLTEFGF
VTYASSLRTGECNIGGYSASLLSLPNDQEVKLGTLTITAPIFDAKLLSGS
YIGDTAIDAGDIFSEMKLNVTEGEDITYYNGDGLYDYLNQLNSGEDSFYY
YDSVDPDDYNFDAIKEVTTEDANEVTPADALLALKLSMGINTALPDLIAL
FEESDTIPLIPYMYMAADVNKDGEVTIQDALNILKMSVNYACAPEQEWIF
SPLPHNEQNILENMMNGFYVTYVNDLGEDVKVDWLDIKGISGDETGIILS
IGDNAIPIEEWNIGVDWEQASPELHDITVTDNDLQFVDLIGILKGDVNGS
WGDPINFNPPN
>Cag_0569 GTP-binding protein Era
MTPSHFASGFAMIIGQPNAGKSTLLNALLDFKLSIVTPKPQTTRKKITGI
YNSERCQIIFLDTPGILKPRQKLHESMLGVVRSTVTDADVLIALLPYTGT
AELFDRSFAAELFTEWILPSGKPVVAVLNKSDLASQEEQKAAEAFVWEQW
KPTNVLSVSALKRKNLTPLISALYPFLPLTEPLYPDEALSTAPERFFVSE
LIREKIFMLYGAEIPYSTEVVVDEFREQHDDDPSRKEFIRCSIVVERDTQ
KQIIIGKGGAALKKLGQLARKAIEEMLGRPVYLELFVKVRPDWRKKSGML
KSFGY
>Cag_0882 sulfide-quinone reductase, putative
MATVVVLGAGVAGHTCASFLKKKLGKRHEVIVVTPNAYYQWIPSNIWVGV
GQMTIDDVRFELRKVYNRWGIILKQAKAIEIHPEGNRDINRGFVTIEYTD
STRAGEIEYVEYDYLVNATGPKLNFEATPGLGPDKHTFSVCTYSHAAHAW
ENLQEVMKKMQAGQKQRILIGTGHAMATCQGAAFEYILNVAYEISKRGLS
KMAQITWISNEYEVGDFGMAGAFILRGGYVTPTKVFTESILAEYGIKWIR
RAGVHHVEPGKVFYETLEGEETSIEFDFAMLIPAFSGVGMLAFDKNDNEI
TDKLFAPNKFMKVDADYTPKSFDQWDAEDWPSVYQNPLYDNIFAPGIAFA
PPHPISKPMSSPNGRQIFPAPPRTGMPAGVMGKIVALNIAERINGAPDFR
HNASMSKMGAACIVSAGFGSFDGLGASMTLFPVVPDWNKYPDWGRDMNYS
LGEAGLAGHWLKFILHYLFFHKAKGYPFWWLIPE
>Cag_1251 Nitrogenase cofactor biosynthesis protein NifB
MKQDITKHPCFNDSARHTFGRIHLPVAPKCNIQCNYCSRKFDCMNENRPG
VTSKVLSPQQALYYLDQAMELSPNIAVVGIAGPGDPFANPDETMETLRLV
RAKYPEMLLCVATNGLDLLPYIDELARLQVSHVTITINAIDPEIGQEIYA
WVRYNKKMYRGKDAAKVLINNQLEALKRLKEVGVTAKVNSIIIPGINDAH
VITVASKVAELGADILNCLPYYNTKETVFENIDEPSPELVFEIQKATSEF
LPQMKHCARCRADAVGIIGEINSPEIMEKMAEVAAMAKNPFEQRPYIAVA
SMEGVLVNQHLGEADRLLVYGIDEQGDCVLVDSRQTPPAGGGNERWEALA
NLLSDCRTVLVSGIGNSPKKVLNNNGVEVLVMEGVIAEAVYALFNGHDMR
HLIKTELAHACGTNCSGTGAGCG
>Cag_1339 putative sugar transport protein
MARNVVNAEQTEEVSSTRRVIAASSVGTLIEWYDFYIFGSLAKIISEQFF
PKDNPTAALLATLATFAAGFVVRPFGALFFGRLGDLIGRKYTFLVTLVIM
GGSTFAIGLVPGYATIGFAAPAIVFVLRLLQGLALGGEYGGAATYVAEHS
PNGKRGFWTSFIQTTATFGLFLSLGVILIVRQTLGVETFQDWGWRVPFIL
SAFLVGVSIYIRMKMSESPMFAKMKKEGKTSANPLAESFKQKDNLKMVLL
ALLGATAGQGVVWYTGQFYALSFLQNACNIEFEQSNLIILIALVIGTPFF
VIFGALSDKIGRKYIMMAGMFIAVLAYRPIYTMMYNDANLKNKIEIVDQT
TVETKEEVKGTDNVITTVTKKTFEDGTTYKEIKKETIPLDAAKKAELAAA
DKLKPETKKEVVLPQHLYYKMIGLVLIQVIFVTMVYGPIAAFLVEIFPTR
IRYTSMSLPYHIGNGVFGGLVPLISTRLVEATRPAAGLPPADPLAGLWYP
IIIAGVSFVIGMLYISNNTNNMDVE
>Cag_0889 transporter, putative
MSSNNSYKLGPITLAPSVLPRHALTYLYAAFFSIGLVTFVSIGQTYILNE
HLKIPTSQQGAISGDLVFWTEVVTLLFFVPAGMLMDRIGRKPVYSAGFLL
VALSYALYPLSRSIEEMTIYRMIYALGIVALTSALSTVMIDYAAERSRGK
LIAITGFLNGIGIVVINSFFGGLPQKLMAQGFSGIEAGLYTHFGIAAIAV
VAAVVVGLGLKGGTEVRKEDRPPLRSLFTSGIKCAKNPRILLSYAAAFVA
RGDQSIIGTFVPLWGTTTGIALGMEPAEAVKQGMMMFIISQAAALLWAPV
IGPLIDRWNRVTALFVCMALASVGYLSLGFIGNPHDANAYIFFILLGIGQ
ISSFLGAQSLIGQEAPKAERGSVVGMFNISGAIGILIITTLGGRLFDSWS
PKAPFLVVGAINVLVMLAAIYVRIKAPGKNLHVAEEG
>Cag_2010 transporter, putative
MTQPPDTPPFMDTESSSPQGKSAITRSLLKAFPAFANPDFRRYFPGQVIS
MIGTWMQMVAQGWLVYELTGSAFDVGMAAAATTFPTLFLSLFGGLLVDRY
PRRTILFWTQSSAMLLAFILGIVTMTGTVTMGIILLLSFLLGCVNAINVP
ALQAFLSEIVRRDHLPSAIAMNSAIYNSSRVIGPALAGWLIAYSGAGIAF
IVNGFSFFAVLLSLFTMKTKRRAPTVIESNPLLAIREGVLYAWNHKLIRL
CIYYIAIVSVFSWAYVSMLPVIAKQRFGMDASGMGSLFGISGIGSVMGTI
MVSMLANKIQPLRFIAIGSLIFAVALLGFTLTENLPLAMVGLFFAGFGLV
AAVSTLSATIQGAVEDRFRGRVMSLYMMIFMGFMPLGNVTIGYLSDLFGT
GFAIQLNCIVTIIAALLLLVHSKQFLRIG
>Cag_0021 conserved hypothetical protein
MTSEELQQLLTPEAQAMLQAHQHDNPTTFALRYSNRHDLPIRALAEQLAC
RRKAERKLPTLSRHNLLYTTLSLEQASSERTARFKCTFMQGKRCIDLSGG
LGIDAIFLAAHFEELLYCERNELLCNVVRHNMVRCGIGNVRLQQGDSLSF
LASQPDNAFDWIMVDPARREEGKRSIGLEAASPNVVASQELLLAKAPHIC
IKASPALEISNLKMLLPALHTILVVSVSGECKEILLLLKRGAEAEHPITK
AICLQADNNAVVEIVGTHEQHRSLAESLQCYLYEPDAAIIKARLSGVVAK
QEGLEFLNKSVDYLTSNHVVASFAGKVFQVIESVPYKPKEFRKFLDRHAI
SAASIQRRDFPLSADELRKKFRLREDEKHFLIFTRNRNAEPICIYAERC
>Cag_1626 conserved hypothetical protein
MEQQDKFYKGLFWGAALGAAMGTVMGLLFAPYRGRETQQRISGKVKSMLD
KATDLYESSEHDGMYNNDAKTRAQGVIDTARDEAKKILSEADSLMRDIKG
HPAKTRES
>Cag_1354 nucleic acid-binding protein contains PIN domain-like
MMKKFAVVPDTNIFLASEKSVHSTSPNKEFVARWKREEFEVLYSEDTLLE
YITKLRQKGISETSIKKLLATLFALGREIKIEFYHLLHYPLDPDDIAFLL
CAENGKATHIITYDRHLKAIESSYTFRVCKPVEFLLELRHQYGIQPKA
>Cag_1645 regulatory protein RecX
MKNTPPENTLAAATGFALKLLGIRNHSHEELERKLRKKGFQAELCATVLE
QLVARGLLNDRTFGEEMLQSRSQRKPSGKLKMRAELLNRGIASDIADELL
SDYKSHELCHQAAAKKIASLRIADEAIKKRKVETFLRNRGFGWQEISTTL
HHFFPTTSTMDDDIE
>Cag_0760 DEAD/DEAH box helicase-like
MPDIVKIEYHQTGESVKNTANGMRPMAARAYEERNSQYLLIKAPPASGKS
RALMFIALDKIRNQGLRKVVVAVPERSIAGSFAKANLKKENNFFANWEPN
DEYNLCTPGMEGSKSKVSAFKNFIDNDEEILICTHATLRFAFEELDESKF
NNMLLAIDEFHHVSADGDSRLGELLRSIMQKSNAHIVAMTGSYFRGDSVP
VLLPEDEVKFTKVTYNYYEQLNGYNFLKSLGIGYHFYQGKYTSAILEILD
TNKKTILHIPNVNSGESTKDKHNEVDTILDAIGDVQKVDSETGIIFLERH
HDKKIIKVADLVNDNPKDREKVITYLRNIKSVDDMDLIIALGMAKEGFDW
PYCEHALTVGYRGSLTEVIQIIGRCTRDSANKSHAQFTNLIAQPDAADDL
VKLSVNNMLKAITASLLMEQVLAPNFKFRTKLSDDDKADAGEIKIRGFKS
PSSKRVKDIIESDLNDLKATILQDDTVMKAIPGNLDPEVINKILIPKIIE
IKYPDLSADEREEVRQHVLVDSLVKGGEIKEVGDKRFIRMADQFINIDDL
HIDLIDRINPFQKAFEILSKEVTTKVLKVIQDVIESTRIQMTMDEALLLY
PEKVKAFMDKNGREPSVTSLDPLEKRMAEAIIYLKDLKRKKQSGQQ
>Cag_1275 conserved hypothetical protein
MLDVQVSHNSVGTLAHHTSEQGSYTFAYHKAIDIGQEVSLTMPWSLASYH
YRKGLHPIFQMNLPEGRLRYTLERAFRKQAQGFDDLMLLDIIGHSQIGRL
HCTSNPQLPKSVPLQSINELLAYNGTEDLLRDLLERFSATSGISGIQPKV
LICDPNQAALGAKFPTHHSPQLTNAQARITVKGATHIVKGWDENEYPHLA
LNEWFCMKAAKQAGLEVPRIFLSENYQLLILERFDLLEDGTYLGFEDFCA
LHGLSTFEKYDGSYERVAKRITQFVSQEHRQKAFEEYFKIVALSCAVRNG
DGHLKNFGVLYSNTTSDVWLSPAYDIVSTTPYIPRDSLALMLDGSKRFPS
RKKLLNFARQHCNLQHEQATEMMEKIGDAVNETMAEIKVQIKEYSPFASI
GNRMLSTWNEGIIDLNGKSTISFST
>Cag_1600 ATPase
MRIEHLIVKNFKGFVSKEFTFHPNFNLIVGMNGTGKTSMLDALAVAIGSW
FLGFYVDSLKMRQIRHDDVLLKYIQHSWEHIYPCEVEAYGVVMDRHIKWS
RELNTINGRTTYGNALAIKELALQATRSMLNGDDIILPLISYYGTGRLWQ
EPREAFKVSDPRKVANKETQSRRTGYFNSIEPRLSVNQLTQWIAQQSWIA
YQEQGQVFPVFNTVQDAIIGCIEDAKKLYFDAKLGEVIVEFSSGTQPFSN
LSDGQRCMLAMVGDIAHKAAKLNPHLGSDVLKETNGVVLIDELDLHLHPR
WQRRVIEDLRNVFPKIQFICTTHSPFLIQSLRSGEELVMLDGQPFATLGN
LSLEEIAHGIQQVKNPEVSLRYESMKATAKSFLTMLDEASLAPKEKLKQL
ADKLRPYADNPAFQAFLEMERIAKLGE
>Cag_1488 conserved hypothetical protein
MIVSFGSKECERIWDGFQVKSLPCEIQDIARRKLRMINNALTLVDLRIPP
ANRLEKLSGDLKDFYSIRINKQWRIIFRWHNGEASMVEIIDYH
>Cag_1384 sodium:solute symporter family protein
MPTLTLLDYSFIGGYMLLTLFIGLWFSKRASENVGEFFLSGRQLPWWIAG
TGMVATTFAADTPLAVAGFVAKHGIAGNWVWWTFVSGGMLTVFFFARLWR
RAEILTDLEFIELRYSGAPARFLRGFKAIYFGLFINAVIIGWVNLAMFKI
IRIMVPELPPEITIVALVLFTTFYSGLSGLWGVSITDAVQFVIAMVGCII
LAVLAVQSPAVVSAGGLTGALPAWMFDFFPNFSHSAEESNSVTGAMSLPL
LSFVAMAFVQWWASWYPGAEPGGGGYIAQRMMSAKDEKHSLLATLWFTVA
HYCLRPWPWILVGLASLVMFPNLPANQKEDGFVYVMQAVLPPGLKGLLIA
AFLAAYMSTLSTHLNWGTSYLINDFYQRFVKRDGTPQHYVLASKITTFLT
AAFALYITFFVLETITGAWEFIIQCGAGTGFVLIMRWFWWRLNAWSEITA
MVAPFIAFTLLQQFTTITFPISLFIIVGVTITATLVVTFATKPTEPAQLE
TFYRTTRVGGRLWKKVSDTLPDVQSDSGFGMLLVDWALGVVMVYTILFGT
GRVIFGEIGTGILFLAIGAIAGTLIFVDLNRRGWNNLQ
>Cag_0200 competence protein
MLTLMSPLVTPRATQLVQRKVLPPLHALRQLLPSGSIPPLRNVRHLLFPN
VCLVCEQLLQPHEEHVCGACYASFDAFASPELAEYYVRRTITDHFCFPTF
FERAWSRYKFHKESDLQELLHSLKYQGIFTLGVTLGKQLGEWLHSADLPD
DIECIVPIPLHPLKKIERSYNQAEKIAEGISQLLNRPVRSSLLTRQRYMV
SQTGLSATERQQNAEGAFCAKAPLRIGHVLLVDDVLTTGATMVAAAQALH
DAGVAKVSIVTVAVAAKEM
>Cag_1502 hypothetical protein
MDNDDEILDSAPISKSDEELEKGFVAAIVEQPKLMDELAKQLVALSLAIP
GIYATALKLLAGDDAVASSLPCIIGAFVFWALSLVFAFVSLTPREWHVER
TLLRRNGASKNGAPLSIEEFFKVSARYKRALLIAATACCFVGICLACVAV
FTVTPPNLSVPQTVQP
>Cag_0334 ATPase
MFEARNLSLSIGTKQLLNDTSFRIGDTDRVALVGLNGTGKSTLMRLISNT
SPDSSTLRVGGDFIKSADTTIGYLPQEISFEDDLEKSALHYALQANKELF
DLSETITRFEHELALPEHDYESEAYHRLIERFSDAMHNFERLGGYTMQSD
AEKVLAGLGFSEIDFHKKVKAFSGGWQMRLHIAKLLLQNPTLLLLDEPTN
HLDIDSLRWLENYLTNYEHSYIIISHDRFFLDKLTTRTLEIAFERINEYK
GNYSTYEKEKVERYELLMSKYQNDLKKMAELNSFVERFRYKATKARQAQS
RLKQMEKLEKNLVAPEEDLSQISFRFPKAQPSGREVMRLDGVKKSYTLPD
GSRKEVLKRIDLEIMRGDRIAIVGSNGAGKSTFCKILANELDYEGKLTTG
HNVSLNYFAQHQTDTLATEKSIYIEMMDSAPNSEAQKKVRDILGCFLFSG
DTVNKKIKVLSGGEKSRVALAKILLQASNLLIMDEPTNHLDMRSKEMLIE
SLENYDGTLLLVSHDRYFLDSLVNKVVEIKNGTLQLYLGTYAEYLEKSEK
TRQAEEQAEALQRQKEQAAAKAAIKAEEQRAAAATPAPAKAKNSKKLEAI
EKKINQLEQQKEEMERIMATEDFYKKSKEENARTLEHYHKLCDELNALFA
EWETLG
>Cag_0957 conserved hypothetical protein
MSTSYSVLWTKVAERDIKEIITFIANDNPSNALHVLEKIKDKAAALSMAP
ERGRIVPELHSKGIFIYRELIISPWRLLYRIANHEVYIMAVLDSHRNVED
ILFHRLIQS
>Cag_0834 hypothetical protein
MLPQSKVIIHPSFKKKIPMKSLLLAFAICVTITALCFVTLEAVGMPQDIS
KTISVMVLGAFPKLREMLEKMEGERSGGAVAVAKVQSFGDFNVSTSRALL
YVTIVGFVALEFASGITGVVLALLGAQLSNIGVALQLLTMIIAYPIIFLA
GRWIGRRCSQQPYMVAALAGLTIRLSTTIFDIAMVPMEQLIQIYQGQMQL
SMIIVSQVGGSVLFALLLMAGAFVGSRQRLVVYVQYLLSRISPQSRLALV
DLAHEEAVKMQKESGGK
>Cag_1077 conserved hypothetical protein
MKALDTNILVRFLVRDNQEQAERVYRLFKAAEADKTLFFISIPVLLELIW
VLDSVYGIARYDILNAIEELLLLPILSFDGQPAVRSFIAEARNNNLDLSD
LLIACDAALSGCEQMLTFDKKAAKSELFILLEI
>Cag_1752 conserved hypothetical protein
MNIEVRYHKLAENELHDAAKYYESRCSGLGRAFLTEITQAINQISAFPES
APMILDIVRQKVIHRFPYSIMYVNDDDGVMILAIANHHRRPFYWGNRISN
FHE
>Cag_0513 putative DNA-binding protein
MCMDGTKNEIVLYQSNELTSHIEVKVEDDTVWLNRQQIATLFGRDVKTIG
KHINNVFLENELNKSSTVANFATVQNEGGRVVERQVEYYNLDVIISVGYR
VKSKQGTQFRIWANQVLKDYLLKGYVLNQRMNCIENSVENLACKVKEIEL
QITSNAIPNQGVFFDGQVFDAYELASRIIRSAKQSIVLIDNYIDESTLTH
LTKKEKGVRVLLLTKNITKQLALDVQKANEQYGNFELKSFAKSHDRFLII
DTNEVYHIGASLKDLGKKWFAFSQMDKSSVSTILTSIDTML
>Cag_0125 hydroxyacylglutathione hydrolase, putative
MSASQLVVKQIRTGGDRNFAYIAACTFTQEAMVVDASYNPAMVATVAANE
GFTIRYIFSTHSHVDHTNGNAELSQLCGVPALLYGDMVPDLQRSVLDGTV
LPLGKLNIQILHTPGHTPDSISLYCDNALFTGDTLFVGKVGGTYSDEDAR
TEYESLWQKLMVLPDATMVYPGHDYGVAPTSTLAHERQTNPFLQQKSFND
FLSLKKNWAAYKKAHGIV
>Cag_0295 transcriptional regulator-like
MNPLLHHHFFNQNSNAVIFPCEKAVWYYLPQIRADLAIELVATGMTQSSA
AKKLGVTPAAISQYIHKKRGMQPNKSAEYLAQIKQAVAVICKGTAPADLQ
RLVCSCCHLLQKASDEHAEACGGGQD
>Cag_0756 conserved hypothetical protein
MHTLNLKPKEHYRLQKGHLWVFSNELAQIPRDIASGETIKLLSHDGKFLG
IGFFNPHSLIAVRLLTRRDEAIDHAFFKRKFAEAIALRTKLYSKEVTNAM
RLVHGEADGLPGLVVDRFNHAIVVQTFSAGMEIHLPLICNVLQELLEPRV
IIVRNESPLRELEGLTLYKEVVRGDAAEAIQQIYDYGVNYRVDLLEGQKT
GFFLDQRENRRMVRAFAAGASVLDVFTNDGGFALNALAAGASSAMLVDAS
KEALVRADYNGQLNKFSNYSLVAADAFDTLETMVEAKESFGLVVLDPPGF
TKSRKNLPGALKAYKRLNKLGLQLVQSGGFLATASCSHHVSEEDFLGVIQ
QAALAAGCNIRLIHKNTQPFDHPVLLAMPETSYLKFACFYVTR
>Cag_0099 sulfide dehydrogenase, flavoprotein subunit, putative
MSKKIVVLGAGTGGTIISNNLRRHLPHDWEITVIDRDDHHIYQPGLLFVP
FGLQKVSTLVRSRKKYILSGINFVIDEITRIEPDKRVVTTKKHSFPYDFL
VISTGCRVVPEDNDGLMEAWGKNAFSFYTIASAELLHRRLQEFQGGKLVL
NIAEVPFKCPVAPIEFVFLMDWMCRKKGIRNKTEIELVTPLTGAFTKPKA
SAVFNESAKAKNIKITPGFSLNEVHGKEGYIQSVQGDKVNFDTLVIIPST
QGDEVISSSGLDDGIGFVPTHHHTLQALKHERIYVVGDATNVPTSKAGSV
AHYEADVVAFNIMAEIHGIKPEEIYDGHSTCFIVYSKGTSSLIDFNYKIE
PLPGQFPMPKFGPFSLLKETKMNWYGKLGFEWLYWNVLLAGHNLGAPPTL
VMAGKELG
>Cag_1987 Protein of unknown function UPF0079
MREEFFSTSESETLLLAERFAAALPPRSVVALLGTLGAGKTLFMRGICRA
FHCEAQLSSPTFSLMNIYEGELNGQAVSVHHFDLYRLESERELEAIGFDD
YLTSADLSVVEWADLFPHYKGRYTATVLLEYAGERERRIIIERGN
>Cag_0156 ABC transporter, permease protein
MESELLLLLLQIARLSVPYVLTSVGATFSERGGVVNLALEGLMLAGAFGA
AYGEYLSGSPLAGVAMALLFGTAVALLFAFVTVTLKANQIVAGIAINLLV
MGATRFGLTLLFGSAMNSPRLEGFAAPFLLLDPLFLTALLSVGVGQWVLF
QTPYGLRLRSTGESAATADSAGVSVSSMRYSGVVVSGALAALAGAFLLFQ
QHTFTDGMTAGRGYMALAAMIIGKWTPIGAALASILFAAAESMEMWFQSG
VIPSQIIQTLPYVVTLVVLAGFVGKAQAPREVGVPFENGRGE
>Cag_0792 basic membrane protein A
MAAQRVHRFSPFTILLLLLTQLLVVGCSKQEQTASLPSSASAPMRIGLVF
DVGGRGDKSFNDSAYNGLELAKQQHGVDFVYVEPQGEGADREAALREMAA
NPDINLVVGVGLLFSEDITRIAADFPDKKFICIDYIHQPNVTIPANLQGI
AFEELKGSYLAGALAGLTTKSNTVGFIGGMESGIIKKFETGFIKGVKAVN
PNAQVISGYIGMTGAAFANPAKGKELALGQYGRGADIIYQAAGASGLGVI
EAARETKKLVICTDRDQEPDAPGFVLSSMVKAVDRALLKSVESVLDGTFK
GGEVKVYGLADRYTDYVYNEKNAPLIGEATHKKVEELRNNIISGKIELSE
ALHQ
>Cag_0257 ATPase
MRLKSMRLENFRAVEHAVIEFGNRLTLLIGANGSGKTTILDGIAIALGAA
LTYLPTLSGRSFKKGDLHQRHNSIAPYTRIALETTTGLKWDRIQRRDKSK
STSKLVPAADGIRALEQFLDATILEPMNQGSDYLLPLFIYYGVSRALLDV
PASRKGFTKKQHRFDALVHCLHADSRFRSAFMWFYNKELEENRLQKEKKS
FEVTLRELDVVRSAITAMFPDISEPHIALNPLRFVVRQQGELMDIAQLSD
GYKTLLGVVIDLSSRLAMANPHLDDPLAAEAIVMIDEVDLHLHPSWQQHV
VGDLLRTFKNTQFIITTHSPFIVEAINNHIKRQQIEGLPINNNEVNQLLP
LRSSDVKAYLMSDTIEMLMNNDVALLDDKLLEYFNSQNQLYDKMRDLEWE
HKG
>Cag_1504 TPR repeat
MVEPLVLTPSVDIVTQHPALIRQTAELSRLYKDKEVLVTDEVLQAIGSIL
WRLLDADEALANAKQRAGQHIVPLLLSSNDAAIQQLPWETIYHPDYGFLA
RHEGFTFSRTIPSVQKALPDAAKGPLRILLFSSLPDDLTEKEQLQIEVEQ
AAVLEALGEWRQSGHVVLEMPDDGRFSEFTQVLKSFKPHVVYLSGHGMFQ
HDALNHTTTGYFFFEDEVSGKSKAFSEAEIAAALTATAVQAVIVSACESG
KAASDSLVNGLTYRLLQQGIPHVIGMRESILDRAGIQFAQAFFSALMERH
GIAEALQQARNAIVLPLQEDEEFKDTVEASISLGQWCLPMLFSHQYNRAL
LDWEFTPQPMRAENRRNKSFKQIKSLPNRFIGRRRELRKWQRKFRSGKQN
ALLLTGAGGIGKTALSYKLIMGLKHDGYEVFCLSFRPEDNWRKYLTSEIP
FSLDEHRKNEFKDRIADNSDIVFQAECLFTLLLEQFNGKVAILFDNLESV
QDSVTRHLIDAELQQLIDMALALESDGLRMLLTSRYALPHWDNSLVYPLG
NPVYRDFLAVAQQQKLPKEFFKDDKGKRTYKRLRQAYEVLGGNFRALEFF
AAALQTMNAAEEQDFLNGLKSATEQIQTNMLLEKVWSYRNQEEQELLCAL
TAYQNAVALDGIKALNLPTMQQSEEFVRALVAVSLVEQYENKVWDVKEEF
LVAPLVRDWLQKQGVATLPIELLQRAARYQQWLLENERRTFEQATITHAA
LMAAGLNDEAHRVTLDWIVTPMNMAGLYQALLDSWLLPACYAVDKQILSE
TLGQTGKQYHHLAQYSTALDYLKRSLAIVEEIGDKSGEGTTLNNISQIYD
ARGDYDTALDYLKRSLAIVEEIGDKARVGAALNNISQIFKARGDYDTALD
YLKRSLNIRQEIGDKSGEGVTLNNISQIFKAWSDYDTALDYLKRSFAIRQ
EIGDKKGEGTTLDNIGKIYLAKGDYDTALDYLKRSFTITHEIGDKKGEGT
TLNNISQIFQARGDYDIALDYLKCSLVIQQEIGDKSGEGTTLNNISQIYD
ARGDYDTALDYLKRSLAIQQEIGDKSGEGTTLNNISQIYDARGDYDTALD
YLKRSLAIQQEIGDKSGEGTTLNNISQIYDARGDYDTALDYLKRSLAIRQ
EIGDKSGEGTTLNNISALYHARGDYDTALDYLKRSLAIAQEIGDKSGEGT
TLNNISALYHARGDYDTALDYLKRSLAIRQEIGDVAGLCATLINMGHIYL
QNNEIQDAVSAWVTAYTLARKIGYAQALDALENLAQQLGLPNGLAGWEML
ARQMGEVNSFNRE
>Cag_0236 putative plasmid maintenance system antidote protein, XRE family
MNNTYTSQEDIAIARELLSCPGDTLAEHLDYIGITKMELAKQLQCSEQTI
NEIIKGTAPITTAIALQLEQCIGIPANFWIERDRQYWLQLAEINEAENRL
ALSNKALQLNIPLIMKKRTSCSCN
>Cag_1186 hypothetical protein
MQSQKPYNKQLWWRLILIIGLLLAVLFFIMTPWNIEHLPSRPNPAKSYNE
ALARTEALRLAQAQPMNPRCELQLMTHKHKTEHAIVLVHGYTSCPRQFQA
LGKQFYNAGYNVLIAPLPHHGYANRLSREHGKLTAEELATYADRTIDIAH
GLGNKVVMMGLSAGGVTTAWAAQNRPDIERAIIISPAFGFKQIPLPFTAA
AMNLFSALPDEFEWWNPTLREHETPTYAYPQYSRHALTQILRLGFIAKFD
ALHHAPATKKIALVVNHNDTSVSNEAALQFIAVWQKKQNQTIAIIEFADS
LKLPHDLIDPEKVNQRTNVVYPRLLQITGGER
>Cag_0924 oxidoreductase, short-chain dehydrogenase/reductase family
MHDKVFLITGASTGIGEATARRAVEAGFRVILVARSTDRLANLVAELGAT
HAHAIPCNVAEWQEQEQMVVQALERFRRIDVVFANAGFSKGSPFFGGENK
LEEWKEMVLVNMFGAAATARLTLPELVKNKGHFLVTGSVAGRSTSIRNLY
SATKWGVSGMAYAIRNEMAEHGVRVTLVQPGVVDTPFWDNLQKLGTPELQ
ADDVARAVLYAVSQPPHVDVNEVVIRPVGQPH
>Cag_0024 conserved hypothetical protein
MTNIAPLRFGVITDIHYTLDGSIATEQLAAAIRTCFASWQKRGITQALHL
GDCIRGDEQFKYEELRQVLALLQEFQGEMFHVAGNHCLLMPRQELLAALG
LQSTFYSFAMQGFRFIVLDGLDVSLFHPQADAEDAALLAHYLQHPQLHDY
CGAIGKMQQAWLQAELASAERARETVIILSHLPLLPEVSAEPYGLLWNHQ
EIAALLSASSTVKACLSGHYHHGAYAVRNGIHFMTLPAFSHQAQNPLALG
MVLELEPSMLRMYNQYNEVVFCCTLR
>Cag_0948 hypothetical protein
MKYFIEKASQHDYADILDIMQYWNMHHIPSVEMEELDLSCFFVARISNII
GGAGGYKVLSQKTGKTTLLGIRPEFLGMGIGKSLQEAMLVAMFNAGVKHV
ITNTDRTETILWYKKHYGYYEIGQLKKQCDFSLSDVDSWTTLEMNLEEFI
QKKLQR
>Cag_0022 patatin family protein
MKKILSIDGGGIRGLIPALVLAEIEAQSGKAIGATFDLIAGTSTGGLLAL
GFAKNDGNGKAQYSANNLADIYLSRGNEIFSKSFLKSVASVEGLRDELYS
ANGIEHVLDDYFGDDPLSSCITKSLVTCYDIQNREPLFLKSWREEYQSVL
MKHAARATSAAPTYFEPALIPIGGATKALVDGAVYINTPSVSAYAEALKL
FEDEQDFFVLSLGTGELIRPISYDKSKNWGKAEWVVPLLSCMFDGMADAA
NYQMKMLLDDKYVRLQTNLSVASDDLDNVTANNLENLILESQKLIRTHRQ
VIDMVCSLL
>Cag_0536 Hydrogenase expression/synthesis, HypA
MLISGIRIAQCLLLVNNCKYSPANTMHEMSIALSIVEAVEEQARKEGAQK
IIALELVVGKLAAIQVESLTFCFAAAAKGTLAEHAALIIEEPEGIGKCEE
CGKEFPVNFYYAECPQCRSLRINIVSGEEFRIKAMEIC
>Cag_1997 TPR repeat
MWGTFLFCAVMNFSLSHLARTIALLLLTASPAFAESADELFNRGFALHMQ
GKLQEAVSCYSDAIDEVPTFAMAFQMRALAYQQLKKFPKAVNDYSSAIEQ
GDASFKVVGYYNRGVVKNIMGDFVGAVDDFSQAIVLNKKMATAFFHRGIA
RHQLGDNDGRFEDFRQAALLGDRTAEQWLNTYHPNWKPVPPSPPSIPPSI
PSSLPATAPPIQPSSPPTAPTDAPKPASVQPAPSEPNDSTRTSATTPA
>Cag_0068 putative lipoprotein
MMLCSFLFKNLSILAMTFSRFIASVCLIALPVSALSLSGCSSSRQPTTAS
EQVSDGYARAEALIKKGDYRSAVLVLEPILFTSRATALEDDVLFRLGQAY
YHTEQYLLAADMFTKVQQLPASPYAATAQFMVASSYEKMSPPFELDQAYT
QKAIEEFALYRELYPLTDSVRSAEQAAFWKEMLKVDAANETYKKNYAQAM
VGMSRSDSVRYAGKAITTLREKLAHNAYSVALHYQQLGKLKAATIFLDEV
IARYPDTSYYKLAMREKVDLLVKREKWYDAALALAQYQQLAPENGGALQS
LQEQIARNTKK
>Cag_1817 GTP-binding
MNITSASFVASYTSLHALPEAVLPEIVFAGRSNVGKSSLLNSLTGLKGLA
KTSAKPGKTRQINYFLINELFYFVDLPGYGYAAVSQSEKAAWGQLLANYI
ERRDAISLVVVLVDSRHPFMENDVAMLEFLEFHGRPYGIVMTKSDKLNQS
EKSKCQRVAKTYAAKAKFVVNYSSFSGAGKALLLSHIDHSIISQ
>Cag_0213 WD-40 repeat
MGFLSNIFGKKEVELKRPQVKEDENLIKTMEGHLDRVLCVKYSSDGKKLV
SGSFDETAMLWDVASGKPLHTMKGHSTWVECVDYSRDSKLLASGSTDSTV
RIWDAATGQCLHLCKGHDTAVRMVAFSPDSKVLASCSRDTTIRLWDVANG
KQLAVLNGHTSYIECVAYSRDGKRLASCGEETVIRIWDVASGKNIANYDT
GDRLSHAVQFSPDDKLIAFGGRDAMVKILDAESGNMVKVMKGHGDAVRSV
CFTPDGRKVVSAANDETVRVWDVQSGNELHMYRGHVLEVQSVDVSPDGTV
IASGSDDRKIKLWRLL
>Cag_1551 RNA-binding region RNP-1 (RNA recognition motif)
MNIYIGNLAYSVTENDLRDAFGQFGQVESASIITDKFSGRSKGFGFVDMP
NDSEAREAIGAMNEKELNGRPIKVNEAKPREERPARRDRY
>Cag_0259 3-oxoacyl-(acyl-carrier-protein) reductase, putative
MQKNISQKTCFMTGATGVLGSAIAEAIAKQGYSLFFTWNGSEAKALLLLE
RLQAISPHSAMVRCDVAQPSAIAEAFIEFRERYQRLDLLVASASNFFRTT
LPEVTEAEWDALVNTNLKGTFFTMQEAARMMQQQSFVSRIITMTDISANL
AWRGFAPYTASKAAIQHITRLFAKTFAPTILVNSIAPGTITLNPEHATEA
ALDAVTNVPLRRTGEPADIVRTVLFLLEQEYMTGQILAVDGGRLLA
>Cag_0071 Beta-phosphoglucomutase hydrolase
MFIAMQRSAFIFDMDGVLTDNMRLHANSWIELFRDFGMEGMDADRYLKET
AGMKGVDVLRYFLGQSISAEEAERLTEFKDFLYRVTSRNKITPLTGLQPF
LEQAQQQAIPMGIGTGASPKNIDYVLELLELEQTFQALVDPSQVSNGKPH
PDIFLRVASLLGAEPQHCIVFEDALPGIEAARRAGMQCVAITTTNNADEF
RHFDNVLAIVNHFQELTPQGLLMLLTEKQNTLVA
>Cag_0585 conserved hypothetical protein
MNVVYSAEAVDDLVRLREFIAVHNPQAAHRISNELVSRIEQLCAFPEMGK
QIPQFPTPSIRDFIFGNYIVRYAIHSDAITILKIWHHYENRIK
>Cag_0301 TPR repeat
MNMLQPPVVVLMTDFGITDTFIGQMKGVILSLCPIAQLIDLTHAVLPQNV
VQGAFLLGKSLPFLPDGSVVVAVVDPGVGSTRRIIAVQTSRHTFLAPDNG
LLTPMLASGDVQQCVSVTNERYMLPQRSSTFHGRDIFSPVAAHLAAGVPL
AELGKSMPMAECVQLEVLRANVLDNGNCIESTILYTDHFGNAVTTIEREL
LAEKHDWLIHVNELRLPLSTTYSDVAEHQPIAYIGSSGTLEIAIRNGNAA
AALGLHAGVAVRMERGEWEVESGMWEEVVQAIPDLLKQGVTLHQSGKHNE
AEACYQQILKQQPHHIDALHLLGVLFYHKKEYSKALDLLNQAIALKPTFT
EAYSNRGAVLKELKRFDEALASYNKALELKENYAAAWYNRANLLKEWKQF
SEAIESYNKAIEFQPNYPEAYSNRGVVLKELKQFDAAFASYNQAIALKPT
YVEAYSNKGTVLKELKQLDAAIESFNKAIALKPDYAEVQWNKSLVLLLSG
NFIDGWMLYEWRWKKADFTSPKRNFTQPLWLGKESLEHKTILLHSEQGLG
DTLQFCRYATLVAKRGARVILEVPDILIPLLKQLEGVEQIIAKGKKIPPF
DYHTPLLSLPLAFTTRLENIPSPSKYLFIDNNKIEEWKQRLHTIPHPRIG
LVWSGRAEHKNDHNRSIALADLLRYLPNKYHYVSLQKEVRDSDKKTLDVT
SNMVHFGNELHDFADTAALCELMDLVISVDTSVAHLSASLGKPTWILLPF
IPDWRWLLDRNDTPWYASATLYRQHTRDDWESVLKNIATDLYDYFTTDNK
VAVSHKATKAIQALLKEAIKLHQSGKQNEAAICYKNIIQLQPNHVDALHL
LGVVAFQKEQYNEALNLLNQAIALNTDFASAYFNRGLVFKNLYHFDKALE
DFDRALRLKPNYAEAYHKRGNILKELGLITAALSSYNNALALKADYAGVY
LDKAIILLLLGNFADGWDIYEWRWKCKDLPLVQRNFTQPLWLGQKDIQSK
TILLHSEQGLGDTIQFCRYTQLVAERGALVILEVPASLASLMQSLEGVTE
IVVKGKKLPPFDCHCPLLSLPLACNTTLENIPSPSKYLSSNTKKRNKWKD
RLQAIPQPRIGLVWSGSTQHKNDRNRSIELSELLQYLPDAYHYISLQKEL
RESDKATVEATSNIVHFGDALHDFADTAALCELVDIVISVDTSVAHLSAA
LGKPTWILLPYIPDWRWLLDRNDTPWYASATLYRQAHRDDWLSVFKRLQE
DLQQRCMVESAQQQLPIHTSQNNHIATLLQQGVQLHKSGKQNEAELCYQK
ILQLQPNHADALHLLGVLSFQKENYSQSLELLNQAIAIKSDFASAYFNRG
LVLKNLSQFEKAIEDFNKAIEQKPEYASAYHSRGTVQKELKQFDAALKSY
EKAIALKPDYTEAYCNRGNALQLLKRFNEAIDSYNKAIALKPQYAEAYSN
RGVVFRELKELDTSLDNFNKAIELKADYAEAYSNRGVVFRELKMLDNALA
DFNKAIELKKDYAIAYWNKSLVLLLLGNFAEGWQCYEWRWKKADFTSPKR
NFTQPLWLGEESIADKTILLHSEQGLGDTIQFCRYAPLVAELGARVVLEI
PSSLALLLQPLDGVAEVIVKGKTLPPFDYHCPLLSLPLAFKTTLETIPFP
TKYLSLPSHKIKQWQQRIGNIAKPRIGLVWSGSTKHKNDHNRSIELSKLL
EYLPDHYHYISLQKELRESDKATLEATANMVHFGDELHDFTDTAALCELV
ELVISVDTSVAHLAAALGKPTWILLPFIPDWRWLLDRNDTPWYASATLYR
QHTRDDWTAALERLHEDLRQRFLVSEM
>Cag_1194 putative plasmid maintenance system antidote protein, XRE family
MKNNYKSKEDIAVAREIISCPGDTLAEHLEYMGMSQAELAERMGRPKKTI
NEIIQGKAQITPETALQLERVVGISATFWMNLEHNYRLLLAELDEAEKRI
VDAEWAKQFPLQEMIDKGWITVDNGCDNAINTILSFFKVATPQAYQNYCH
NQLYATAYRMSETCSKDPHAVAAWLRQGERQAEYLKAVLFDRKKFEEMLL
TIKKLIVQDDNFFEALQDCCLQAGVKVVHTPCLKKAPLNGSTRWINDSPL
IQLSNRFNRNDIFWFTFFHEAAHIIKHNKKDVFIEGMDYSFDGKKKEDEA
NMYAEEYLISRKEENELLASTSFQKDDIQHFAEKFSTHPAVVIGRLVNKG
KVKAELGHLYGFYKKVELH
>Cag_1022 hypothetical protein
MDNTFLSNGGIVTTDFGGSDSGSSIALQTDGKIIVAGESSGGGDGGFTVV
RYNADGSLDITFDGDGKVTTDFGGLEYATSVALQGDGKIVVAGYKGISSS
GGGDFALVRYNADGSLDITFDGDGKVTTDFGGWDEAESVTIDSNGKIVVV
GYTGISSSGGGDFAVVRYNVDGSLDATFGAGGKVITNVGGEEYAHSVIVQ
SDNKIVVIGDTGISSSGASDFALVRYNNDGSLDTTFGVSGKVTTNLDFGD
FVGGATMQSDGKIVVVGESYSLADMAVSGDQDFVLVRYNTDGSLDTSFAD
DGTLVADFGGGESATSVAVQADGKIVVTGDSFPAGGSGDSNVIVVRYHTD
GTLDTTFSENGFVKTVVGESEGNSVVVQSDGRILVAGQSNGDFSLVRYNS
NGSMGLGIDFDGTPIGTPSSYESFADLYFDETQATIAGYIGSALVTLAPP
GVVHCFDEDHDGFADHFTRTWVDGNGTQSISGTTVWLDNNIFKSSGSALI
GGIPYTVNQYGRAAYDADGDVVGMYFLTVNPVFTLTADTTVGNELVATFT
IPNQAETWSLLDSDLNGVVDHVNRWNSWIDQNSVTQIRNFTYLLTWSDIT
HFTARQVGVITGTSFDALGRPLGITFSNSTPPAHILPITWLDTKGDDNVV
ATFDIPSTIFGQLLDTNDDNLPDQAVFIETSSSGQKDTATATIQGWSSWS
DISTQQVTMEIQTSTAPWNFFTGTINGTSSNPTTVIMPSYFMGNNVVVPE
TTFPSMTNNSLTFDLGTTGASLASSGSITLWSSATGTITIPVTSLTFDGS
HLTIPLSGTDVATNQPYHLYPNSVDNFRVQIPAGVVIGEPTIDKAWFVGE
WNNIGYELSPMMIVYNGDGTTDADWVLGTSGNDSVAAGAGDDLMKWSAGN
DTIDAGDGYDKLYMPKAVPSANYITKTDSQGVLHIGEVNAATNTIIADAY
RITRLAAGSFQIQKMDSTGTTVTQTMLLNNAEVLHIGPPSNYTSVALTIN
YANEFIYGTPWRDTIYLNASNISTLSQIWAYSSTDTLAIDVGAGYSKIEV
VREGSTSLLKGTLIADGTVVDLGSFSKALPSQYNYTATMSIGTGESAHSF
TINNIEAYRFTSGDVILTVDPIPPTVISFTPSDNATAIAVGSNIVLTFSE
TVQAGTGNIVITDGTDVRTIAMTDSSQVSIAGNTLTINPTADLAKGMHYF
VMLDAGSIEDLAGNDYAGTTSYDFTTIVGGVITTDFGGDSFGCGVTIQAD
GKILVVGGGSNGDIALARYNMDGSLDTSFGNETGKLTTDFGYEDAALSTI
IQSDGKILVIGESVINGGSYGKCIIARYNIDGSLDTSFDGNGKVITDFLD
GLNVDGFYTTEAILQSDGKILIVGGGYQSGNSLVTLFRYNSNGSLDSSFG
NNGMVITPSISSLNSSDFPYGVVQQVDEKILVAVRSDNNAAITLIRYNSN
GTVDTSFNADSMQITGLKENDMDGGLVLQVDGKILVSGSSNGNIILVRYH
SDGTLDSTFNGNGNIVTDLGGNDGVGAITLQPDEKILVSGYSNNELALLR
YNTDGTPDTHFCNNGVVLTNIGSDSFHEITFAGWGITVQADGKILVTGQS
NGDFALVRYNTDGSLDTSFDGVSEPTPPTHDLSGHITFWKTGNAISNVQA
TLATMPMQPASDDVAFRNLQHQSNGGYTVELWATTTQTELQSIQLAMQFS
DNVTAQWQQSSAVPTGWLSVINNTQAGHLEIGTIGQATMQGDEEIMLGTL
TFSAPDNPNNFTLAVTSGWLGDNSIAPTSILCTATDSEGNYSFETIADGW
YQITGESNTAKLADAVTAQDALAALRMAVALNPDGGNNLDEVSPFQYLAA
DINRDGKVRANDALNILKMAVGIESAPADEWIFVAESAASKTMDRSHVDW
SFAEQPIDVYGDMELDLVGVVKGDVDGSWGMVG
>Cag_0580 glutamate synthase (NADPH) small chain
MTSISLTINNISVSVPQGSTILAAAEAAGVTIPTLCFLKELEERGACWMC
IVEIKDKNRFVPACNTAAAEGMVIETENPTLSAMRRQNLERIIVEHSGDC
NAPCELACPAGCNIPDFIAAIERGDNAKALEIIKEDIPLPAILGRICPAP
CEEACRRHGVDEPLSICALKRYAADRDSEQAERYLPPCEPSSGKHVAIVG
AGPAGLSAAWFLLRKGHKVTIFEAAPQAGGVMRYGIPRFRLPESVIESDV
APLLAMGLELRCNTRFGRDVTFDNIRTQYDALLLAAGTEEAASMGIAGEE
LEGVISGITFLRNVALGTQSTLGSKVIVTGGGNTAIDAARTALRLGAEHV
TILYRRSRADMPANASEIGEALAEGITLREWAAPLSIHAVNGALEMQAIA
MQAGELDASGRRKPVPIAGSKFTLQANTIISAIGQQLNPALAEAAALTTT
RNGLAVNPDTLQSTSDASLFACGDCVTGSDTAIRAVAQGKLAAHSIHSYL
TNQPVEAPTQPFNSSYGNREQAPKAFYAQKEAAPRVALPELPLSERQGNF
HEVAIGYNNELARTEAARCLRCKCNAINNCRLRDLATHYLFGKVEQHPEH
LGFYKAANSAISMEREKCVDCGICVRLLEEHNGNVEIAVMRQSCPTGALS
MPM
>Cag_1469 Death-on-curing protein
MMEQDTNQLAVYQAENGALELRADSALDTIWASLDQIAELFGRDKSVISR
HIKNIYLESELEKTATVAFFATVQKEGSRIVTRNIEYFNLDTILSVGYRV
NSKVATRFRQWATKTLKQYITQGFSINQKRLEENKTQFLKTLEDLKILTQ
SSQQVETKDILTLIQNFSHTWFSLDSYDKNEFPKQGTQEAIQTSAEELYQ
DLMQLKAELVAKDEATELFAQEKNIGNLKGIFGNVFQSVFGQDAYPSVEE
KAAHLLYFIIKNHPFNDGNKRSGAFSFIWLLQKAGYQFRDKISPETLTTL
TILIAESKPSDKDKMIGIVKLVLSVEP
>Cag_1862 polysaccharide efflux transporter, putative
MSRNSLVAGQAGFAFAGLLFGQLMRFGYNLVVARLLGVEALGIYALAIAV
MQVAEVVALAGCDASLLRFVNLYHNDAARQRQVIGFAAKSSLLFSLAVMA
LLMLFANQLSALFHGNELLTLALSCYAAALPFNVLTQVTAHALQAFQHLK
PKIIATQLLSPLLLLLFTLLFYYTVGIQAALLMPFLLSACGALLWILLPF
ATTTGIRFIDIVRARHDNAMLTYALPLMAVSLFSMLSHWLDVMMLGIFSD
AVTVGLYHPAARTAGLLRSVLLAFAGIAAPLFAELHAQGNKAEMARLYKL
VTRWSVILLIPPLLIFMVLPQQVLSLFGAHFADSGAVALQLLSAAYFVQC
VFGIASTLLAMSGYAQLSLINAVVALALQAGLNWLFIPTMGLQGAAVASL
VLFLLLSALRWLEVRLLLQMNPLSTMLWKPLVAGAVTFLLLMLMHSWLLM
LPSLLALGVGTVIAFSCYVALMLMLKLEVDEKEIIFKYLPFMRKDG
>Cag_1344 TPR repeat
MIHMPDFFDDDRFEFSSNNGELPPDLDGLDSIFDSEELVERIMQYMEDGF
PLEALAVARRLEQIAPYNSETWFYLGNCLTMNAFFDEALEAFHKALLLSP
TDSEMQLNLALGYFNNSMYEEALEQIERVMVDFAFEKEYHYYRGIILQRL
DRYDEAEKAFLMALELDNEFADAWYEIAYCHDVCGRLEESTTTYNTALDH
DPYNINAWYNNGLVLSKMKHYDEALFCYDMALAIADDFSSAWYNRANVLA
ITGRIQEAAESYEQTLELEPEDINALYNLGIAYEELERYPDAMECYRRCI
TIVPEFGDAWFALACCHEVLEEFDEAYSATLEALKTSADCVEFLLLKAEI
EYTLNKAEESIHTYERIIELEPDNPQIWVDFAIVLREAGMVNASIEALHC
SLKLQPMSADAHFEIAAAYFALGDKLSTLKALSKAFKIDPDKKELFQSTF
PELYQQDSVRRMLGILEMPNE
>Cag_1977 MazG
MPATIDELKAAILKEHERSVPEGFQRVLDLVRVLRQECPWDRKQTAESLA
HLLLEESYELVHAIDQQETDELKKEIGDLFMHLCFQVQLADEQGHFSFNE
VFDALCKKLIHRHPHVFGSTEATTEKEVLQNWEKLKLSEGRKSLLDGVPS
AMSELLRAYRVQKKVAGVGFDWQSDEGVIDKIVEEIQELKQAATQNEREE
EFGDLLFTLVNYSRFIGTNPEDALRKATNKFMQRFRTVELLVAESERPWQ
EFTPEELDTLWQQAKEK
>Cag_1871 Small GTP-binding protein domain
MKFVDSASVFVQAGDGGRGCVSFRREKFVPKGGPDGGDGGRGGHVWLETN
SHLTTLLDFKYKNKYIAERGVHGQGARKTGKDGVEVVIQVPCGTIVRNAA
TGEVIADLTEDAQKILIARGGRGGRGNQHFATSTHQAPRHAEPGQKGEEF
TLDLELKLMADVGLVGFPNAGKSTLISVVSAARPKIADYPFTTLVPNLGI
VRYDDYKSFVMADIPGIIEGAAEGRGLGLQFLRHIERTKVLAILIAVDSP
DIEAEYQTILGELEKFSATLLQKPRIVVITKMDVTDEPLALQLAGEQTPI
FAISAVAGQGLKELKDALWRIIVAERAVPTNQVPQGGE
>Cag_0401 2-desacetyl-2-hydroxyethylbacteriochlorophyllide
MKSMKAKAIVFSGVRQIELADVKLKPLSSTDVLVETWWSSISTGTEKMAW
NGLIPSPPFIFPFIPGYETVGKIIAVGAHVNDNLIGRFAYVAGSFGYEGV
NAAFGGASEFIACPVDSLTVLDNIEHPEAGIALPLGATALHIVDLAHVEA
KKVLVLGQGAVGILAAELAKLMGAKLVAVTEPNCNRLKLSAADLKVNPDR
QDVSAALAGHEFDVLIDSTGIMSAIDTGLRFLKFQGTVIFGGYYQRINID
YSQAFQKELSFIAAKQWAKGDLERVRELIASHKLNAERIFTHHHTVGSGN
ITDAYQQAFTDQDCLKMVLHWKQANEEPTTSN
>Cag_1299 dihydrolipoamide acetyltransferase, putative
MAKDHYFTWQLSAEISAKIRYQEYGHEHHGKTPILFLHGYGAMLEHWDLN
IPHFAEQHKMYAMDLIGFGKSQKPNVRYSLELFAQQIQTFLLYKKLESVI
IVGHSMGAASSLYFAHHQPEPIKALVMANPSGLFADTMDGVASMFFGLVA
SPVIGDVLFTAFANPMGVSQSLTPTYYNQNKVDDKLIRQFTQPLHDVGAQ
YSYMSPSKRPLDFRLDHLPKPCNYQGPAYLVWGADDMALPPQKIIPEFQQ
LIPHAGAFIIPKAAHCIHHDAHEAFNQRLAFILQELEG
>Cag_0105 conserved hypothetical protein
MQEGFNHYQQQRRAYTLYQQHSYLQAEQAFHTLAAQAPSPKEKASAHFNE
ACALAMQGNHTQALPLFTLSRKGTTLTEPLRLQALFNEGTLLAAQAKKSS
ARQEKMTLYQRSLHHFKQVLLQSPTDVDAKINYEIVRRHMAALQPKPPQS
PKQQPNRAAITPAGGIGNDVAQRLLEQAARNESSLMREMAQQGKSSTPRS
TKNLRDW
>Cag_1403 Methionyl-tRNA synthetase, class Ia
MSTMPHFSRTLVTTALPYANGPVHLGHLAGVYLPADLYVRYKRLKGEDII
HIGGSDEHGVPITISAEKEGISPRDVVDRYHAMNLDAFTRCGISFDYYGR
TSSAVHHATAQEFFSDIEQKGIFQQKTEKLFFDLQAGRFLSDRYVTGTCP
VCNNPEANGDQCEQCGTHLSPLELLNPKSKLSDATPELRETLHWYFPLGR
FQAALEEYVNSHEGEWRPNVVNYTRTWFKQGLNDRAITRDLDWGVAVPLQ
SAEAVGKVLYVWFDAVLGYISFTKEWAALQGNAELWKTYWQDPETRLIHF
IGKDNVVFHTLMFPSILMAWNEGKTTDCYNLADNVPASEFMNFEGRKFSK
SRNYAVYLGEFLDKFPADTLRYSIAMNYPESKDTDFSWQDFQNRTNGELA
DTLGNFIKRSIDFTNTRFEGVVPASVTKEEWDNLGIDWQATLEQLDSAYE
GFHFRDAATLGMEIARAANRYLTSSEPWKVIKVDREAAATTMALSLNLCH
ALSIALYPVIPETCNRIRAMLGFSEPLEATIQRGTSLLSSLLTPTLQQGH
KLREHSEILFTKIEDSAIAPELEKIAKLIAEAEKREAALAESRIEFKPAI
SFDEFQKVDLRVATVVAAEPVAKANKLLKLRVQVGSLTRQVLAGIAKHYT
PEEMVGKQVLLVANLEERTIRGELSQGMILAVENSDGKLFIVQPSGEGIN
GQSVQ
>Cag_1010 CRISPR-associated helicase Cas3, core
MDAFDNKLYAHTLEGVKNKSQWQTLREHALSTAHLASDYATSFGLAECGY
WLGLIHDLGKSLPQFQQRLEDDRVKADHKHAGGLFLWDKLNNGTKPSHLA
AQCLALCVISHHGGLVDCLNQLGEDNFINTIENKLYQANLKDSLENLQLD
TELEDNIKKISGARSLVQDEIDFFFQNILQQAKKYWPTEDDKNKRKKLEL
FRIGLMTKMLFSCLIDADHTDTANFHDEERKNKNLPHLPKWDELRDMVER
YLETLPQTSSVDIERKRISDKCIQASVCESGTYLLTVPTGGGKTLASMRF
ALHHAVNREPYIPFKRIIYVIPYTTIIEQNAQAIRKVFVSQLNEDVLNEM
ILESHSNVLPNEENRNNRVLAENWDAPIIFTTNVQFLEAFYGVGTRNARK
LHNLANSIIIFDEAQTLPVRCLHLFCHAVNFLVEHCNCTAILCTATQPLL
HEIPAEHGALWLSKNFQILPDKFRKDSADSLKRVTVIDECKPQGWRLEEV
ADKVSCIHKQGNSCIIILNTKADTRELYTILRKRHGEELTYHLSTAMCAA
HRMDILSEVKTLLRNNQPVICVSTQLIEAGIDIDFDTGIRALAGIDSIAQ
AAGRINRNGKKPADSALYIQNISGENLKNLQDIAVAQVEAQKVLREFKEN
PNEFGNSLLSEAVMKRYFKFYIFNRKDEMTYKIKSDNLVNLLSSNVNAVG
EYKRTHKNQPYPNILRQSFATAAREFKVINSDTQGIFVPYNDEARGLLNQ
LRNTKSSEFQRYLFRRLQRYTVNVYPYMLKKLTKIHALEPLCENSGILAL
YEIFYDSRFGVNINSTISPDMLIQ
>Cag_1691 serine esterase
MLTRHHHDLTYLEYATPTLGNNAPLLVMLHGYGSNEKDLISLAPMLPDGL
RIVSVRAPLTLAPEMYAWFSLEFLADGIRVDEAEARAACERFVLFLRDLI
TRYQPAGSKVFLMGFSQGSVMSYLTAFLAPELLHGVIACSGQLPEKNMPS
ESAFALLRTIPFVVLHGIYDDILPIEKGRHAHAWLQQQVDDLTYREYPIA
HQIADDGIALISSWLTERLEKVGNSRL
>Cag_0091 conserved hypothetical protein
MFTQHNFSLNLAVRDYECDLQGIVNNSVYLNYLEHVRHEYLKHVGIDFAT
LTREGIHLVVIRAELDYKASLTSGDSFCVGLTFLRESPLRFSFLQDIYRL
PDNKLILKAKIIGTALNEHGRPFLPLQLEQLFQST
>Cag_1141 conserved hypothetical protein
MVASPAFGQMRYTALRLTLVALTFGTVQELNAAESSNFRQQMSMADQELW
KARYPQADSLYNVLLRQNPSNTEVNWKLARLQISLGESLPPSQQPVRLRH
YRQAENYARTAIAIDSTEAPAHIWLAASLGLMADKIGPQEKLKRAEEIKR
ALDTAVRLNPDDATAHSLLGTYYYEASKIGWFRRMIGNTFVGTMPQGNKE
LAEKEFRRSIALDPRMIRNYHDLAKLYLDMGRKAEAITLLKTALNKPILV
ESDKRRLEQIRELLRKHNGDGE
>Cag_0273 conserved hypothetical protein
MKKIILRQAFNELNDAIAYYEEQQPGLGVKMKDEVDQHVHWILNHPLIPR
LRHGGYRRVNLKVFPYYIAYLVHQETLWILAIAHTHRKPKYWIKRKNKI
>Cag_1173 Protein of unknown function UPF0054
MSLELCNSTRQAIPNKRLLQAIRMVVQGEGYEIATITGVYCGNRMSQRIN
RDYLNHDYPTDTITFCYSEGKAIEGEFYISLDVIRCNARHFNVTFEEELL
RVTIHSALHLTGMNDYLPEERVAMQAKEDYYLQLLKTQKSISPQKSTDNA
IFSCNS
>Cag_0004 conserved hypothetical protein
MNNHIVITGATGVIGVELAQKLIKRGEKVVLLARSPNAAQQKIPGAAAYV
RWDSDMQEGEWKSTISGAKAVIHLAGKPLLESRWNEEHKQECYQSRIIGT
RHIVAAIAEAAEKPQVFISSSAIGYYGSFDKCSDTAPLTESGNKGSDFLA
HICIDWEEEARKAENLVPRLVFLRTGIVLSTRGGMLQKMMTPFQYFAGGP
IGTGLQCISWIHMDDEVNAIIASLDNSAYKGAINLVAPTPVSMKEFASKL
GAVMGRPSLLQVPEFAVKMLMGEGGEYAVRGQKVLPTFLEKQGFTFRYPD
LSNALGDLIKHGK
>Cag_1591 conserved hypothetical protein
MQNKAIPHSPAEQVALLDSNECSLPRFHERLASEGVSQLQADGIEILQLN
IGYRCNLRCTHCHVNGSPERHELMSREVMEQCLVALDKSNATTVDITGGA
PEMNPHFRWFIGELRATKPDARILVRTNLTLLTDNKTYSDIPELLKAHRI
ALIASLPCTTKKTVDAVRGDGVFDRSIAALKLLNSIGYGTSDSALELNLV
FNPSGAFLPEAQQQLEHHYRTELQNKYGITFSHLFTITNMPVSRFLENLC
TNGTYCDYMKLLVDSFNPTSVKNVMCRTTLSVGWDGTLYDCDFNQMLRLP
VECSVPQHINAFDAEVLSKRHIVTNQHCYGCTAGAGSSCQGCLV
>Cag_0489 IMP dehydrogenase
MTKILYEALTFDDVLLVPAYSAVLPKETTTACRLTNTISLNIPLVSAAMD
TVTESRLAIALARAGGIGIIHKNLSIEQQAREVAKVKRYESGIIRNPFTL
YDDATVQDALDLMHRHAISGIPVIERPQNEGDASRILKGIVTNRDLRIKL
QPNAPIAQIMTSQNLITAREDVGLQQAEEMLLANRIEKLLITDNAGNLKG
LITFKDIQKRKQFPNACKDSHGHLRAGAAVGIRENTLDRVQALVDAGVDV
VAVDTAHGHSQAVLDMVKKIKSHYPDLQVIAGNVATPEAVRDLVKAGADC
VKVGIGPGSICTTRIVAGVGMPQLTAIMKCAEEAAKTNTPIIADGGIKYS
GDIAKALAAGADSVMMGSIFAGTDESPGETVLYEGRKFKTYRGMGSLGAM
SEPEGSSDRYFQDSSSEAKKYVPEGIEGRIPAKGTLDEVVYQLIGGLKSS
MGYCGVATIDELKQNTRFVRITSAGLRESHPHDVKITKEAPNYSTSM
>Cag_1541 metal dependent phosphohydrolase
MGIVINLLLLVLAALVAFVAGFFIGRYFLERIGTTKVLEAEERAVQIVQE
AQKEANEYKELKVSEVNQEWKKKRREFEQDVLIKNNKFAQLQKQLQQREA
QLKKQSQDVRDAERKLQDQRKEVEQLSDSVKLRATELERVIVEQNQRLES
ISNLQADEARQMLIDNMVTQAREEASNTIHRIHEEAEQQATRMAEKTLIT
AIQRISFEQTTENALSVVHIQSDELKGRIIGREGRNIKAFENATGVDIIV
DDTPEVVILSCFDPLRRELAKLTLKKLLADGIIHPVAIEKAYADATKEID
DVVYSAGEEVAASLQLNDIPTEVIALLGKMKFHTVYGQNLLQHSREVAML
AGVMAAELKLDARMAKRAGLLHDIGLVLPESDEPHAITGMNFMKKFNESD
QLLNAIGAHHGDMEKESPLADLVDAANTISLSRPGARGAVTADGNVKRLE
SLEEIAKGFPGVLKTYALQAGREIRVIVEGDNVSDSQADMLAHDIARKIE
SEAQYPGQIKVSIIREKRSVAYAK
>Cag_0825 conserved hypothetical protein
MTHTTPLLPLAPPINLVDLRYNELHNAITAFGEPPFRAKQIHEWLFSHHA
NSFAAMSSLPLRLREKLAERFTLQRPEVVEVQESCESGCLRPTRKILLKL
SDGALIECVLIPAEERMTACLSSQAGCPMQCTFCATGTMGLQRNLSAGEI
WEQLYALNGLALQEGKTITNVVFMGMGEPLLNTDNVLEAIATMSSRNYNL
SLSQRKITISTVGIVPEIERLSRSGLKTKLAVSLHSARQEVRQQLMPIAA
ERYPLPLLSKSLEAYSKATGEAITIVYMMLNGVNDSKEDAHLLARYCRHF
SCKINLIDYNPILTIRFGSVQESQKNEFQAYLMAQKFHVTVRKSYGASVN
AACGQLVTQQQRRTIK
>Cag_1111 conserved hypothetical protein
MDHTKINLLVRLQYLDNQIENIVSLQKGLPEEIEALEDDLAFTTRQIESR
KKIADEQLRQRTRLNEQINECNNKINSFKEKQTLARNNKEYDALSKQIEY
EEKEIANANMQLQDIAQTTQRLQELQKKGAQLITENRYDEITEEMMPDDV
LLQQLKDLTAQVAQKREELESIVIETAADVATLEEKVVAQRALITKEAKR
LIDKYDHLRSGTLRNAVVKLNRNACSGCNTRVPTNRHTMIVQGGFYLCES
CGRIVVHERLFEEAKD
>Cag_0885 methyltransferase, putative
MMSSSYSLEKFNREAATWDEKPRRRLVAKAVAHAIIAHAKPQPTMRALEF
GCGSGLVTMPIAPLVGSLVAVDTSPEMVKMVQQKAEEAALTTLTTLVDDL
FAEAEAYREPFDLIFSSMTLHHIADTATVLQRVAQLLVTGGVLALADLEL
EDGFFHDDPHEEVHPGFERSALEAALSAAGLQVRSYHIAHTIHKCNRAGV
DAAYPIFLLVAEKV
>Cag_1216 conserved hypothetical protein
MQQKLLLISTIGPEHPEKATLPFVIATAAQALDVKVVMFLQSNGVILAKK
GEAETIAAPGLVPMKELLDTFLEMGGTLMLCSPCLKERYITPNDLIEGAE
IGAAGTLVSEIMSATSVVTY
>Cag_0902 putative transcriptional regulator, XRE family
MLQSQFIEAHLETDCPVFYDDDFVADVLPIMQRQHVSCAPVLSGGKPERL
VTLPDLLAAEQTTDSDTLRLKELPLPQASGVDAGEHLFDIFRRLPHFPCD
VVPVADDKGMFAGVIDKQQVIEQVARIFHVGDDSLTLELEVPKSGVKLSE
IIALLERNEATILSFGMYTATSDNHESIILSFRLQTHDFFRLVQNLEHYG
YQVHYTSQMFNAEDEVLREKAREILYLIDL
>Cag_0197 conserved hypothetical protein
MTHPPFHRLIAFLLFLGAEVLYLATMAPTFSFWDCGEFIATAVTLGIPHP
PGAPLYLLLGRLFAMIPFVSDIGARVNVISTLASSATVMLTYLITVRFIT
LYRKHPINEWSRSEQIAAYGGAAVGALALAFSDSFWFNAVEAEVYALSSL
FTALVVWLMLRWHEEAPKAGNERWLLLVMYIIGLSIGVHLLSLLAVFAVA
LAYYFKKHQVTLVSFGWLVVVSLALFFLIYVVIIKGLPVLFQVASWWGLL
AGLLLLCAAIWYSQRHRKAMMNTLLMSLLLLVIGYTSYGMIYVRAQANPP
INENNPSTTESFYAYLNRDQYGDMPLFPRRWSPEPIHQYFYEQYSSDFDY
FTRYQMQKMYLRYFGWQFIGREHDMEGAGVDWSVLWGLPFLVGLVGAISH
FRRDWQMGVVVTALFVLTGAALVIYLNQTEPQPRERDYSYAGSFFAFALW
IGIGVESLWQWLAGRMKASSEKLPVVALSVVGLALLLVDGRMLMANYRTH
DRSGNYVSWDWAWNMLQSCERNAILFTNGDNDTFPLWYLQEVEGIRRDVR
VVNLSLANTGWYLEQLKNSSPRGAKPVNFSMSDGELATISYMPIDSVDAI
LPSSTARRSLLRDTWRSGNNLPSAPLDTMVWPLKPGLTYDGQGYLRPQDL
AVYDIVMSNFEDRPIYFALTVDPESMIGLDTFLRLDGLVCKVVPVKSSDP
MSYTDPLILYQRLMQVYRYRNLANKHVYLEETSLRLSSNYTPLFVRLALE
LATQPEETLAVTDGNGVPRLVRRGALALQVLDSSERFMPLSRYPVNPELA
ASIIALYVQLGEKQKSSPYISYLEALSHTNSPLMEPRLFLILARAYYSLG
REAEAKAIVQQLARELDQPELLKTFETTKK
>Cag_1750 hypothetical protein
MKILLHIGCGIKDKTQTTLAFHNEEWDEIRYDIDPDVAPDIVGNMTDLIT
LAPASVDAIYASNCLETLYPHEVPQALAEFRRVLTEDGFVVINSPDLQAV
CTLVAKGKVLDPAFVSPENGLVTPFDLLFGHRPSLAEGNMFMAHRCGFTS
QMLSGALQAAGFSMVASMVRPKHYDLWTVASKSQRTEPEMRALASEHFPG
LGVM
>Cag_0291 Survival protein SurE
MTHHDAQPSTDAEQSSNATLPHILICNDDGIEADGIHALATAMKKVGRVT
VVAPAEPHSAMSHAMTLGRPLRIKEYQKNGRFFGYTVSGTPVDCIKVALS
HILTEKPDILVSGINYGSNTATNTLYSGTVAAALEGAIQGITSLAFSLAT
YENADFTYATKFARKLTKKVLAEGLPADTILSVNIPNVPESQIAGVIIAE
QGSSRWEEQAIERHDMFGNPYYWLSGSLQLMDHSMKKDEFAVRHNYVAVT
PISCDLTNYAALAGLEKWKLKK
>Cag_1963 conserved hypothetical protein
MPLMALLLLAFSLPAFAADAAPLAVQSDWWIWVLGLFTFSFFLGIIAVIA
GVGGGVLFVPIVSSFFPFHIDFVRGAGLLVALAGALSAGAPLLRKGLANL
KLALSMALIGSISSIAGAMVGLALPENIVQLSLGATILFISVIMLLSKNS
AYPDISKPDSLSQALHIHGIYYDEQLKKDVSWQIHRTPIGLLLFIVIGFM
AGMFGLGAGWANVPVFNLVLGAPLRVSVATSVFVLSINDTAAAWVYLHQG
AVLSLIAVPSVAGMMLGTKIGAKLLTKVHTSVVRWIVIALLAGAGLKAFL
KGLGI
>Cag_1499 membrane protein
MPSLRSISRYPFAIPLLSILAALLLSSLIIVAAGRDPLMIFQKMLRSVAG
SPYGMGQVLFRTTTLVCVGLAVALPFHLKLFNIGGEGQLLMGTFAAAMAG
LFLPQTVPPMVAIALCTLAAMAAGSLWALTAALLKVRFGVNEVIGTIMLN
FIAQGITGYLLTWHFAVPSTVHTAPIIDSATIPTFSVLTGWFASSPANPS
IIFVLLVALMLHLLLYHSRMGYEMRAAGLQPDAARYGGINATMHTLTAFA
LGGAIAALGATNMVLGYKHYFESGMSGGLGFTGIAVALLAGAHPLWLLLS
ALFFATLEYGGLTVNIWIPKDIFMIIQALTILIFISLSALGKRAN
>Cag_0588 conserved hypothetical protein
MIHSIRLDNLLSFASGNPTLPLQKLNVFIGTNGAGKSNLIEALDLVRATP
RSPSNNDFQRVISRGGTIMEWIWKGSPDTPATIELIMDNPYNSHTNEKQP
IRHLFSFKGEQQRVIFVDEIIENESPYHSNNEPYFYYRSYNGKPVINSAI
AGERKLQRDSINEELSILAQRRDPEQYPEITKLAEIYEEFRLYREWTFGR
NTIFRNPQRSDLRNDRLEEDFSNKGLFLNRLKTHKPKAKTAILEGLKDLY
QGIDDFNISIEGGTVQVFFTEGEFSIPATRLSDGTLRYLCLLALLCDPEP
PPLLCIEEPELGLHPDIIPKLADLLIDASQRTQIIVTTHSDILIDALTEI
PESVVVCEKNEGKTTMQRLNSNDLAEWLKHYRLGQLWTRGDIGGTRW
>Cag_1389 nucleic acid-binding protein,contains PIN domain
MSYLIDTNIIIYYFNGLTNDESLHSILANNFKISIITKIEFLGWGQFLSN
QNLYIKAKSFIHYATIFDINDAIAEQTILLRQQFKTKTPDAIIAATAMVK
NLTVVTNNTDDFNRLGIKTISVTMQ
>Cag_0143 Competence-damaged protein
MRAEIISVGDELLRGQRVNTNAAVIARMLSAIGVSVSHIVACSDDEADIM
ATCSAALGRAEVVLVTGGLGPTRDDRTKHAIQQLLGRGTVLDEASYRRIE
ERMAARGSAVTPLLREQAVVIEGSHVIINSRGTAAGMLLDCGEPFAHHHL
ILMPGVPVEMEAMMHEGVIPFLTSLSNSVICQTPLKIVGVGETAIAAMLV
EIEDAMPPATMLAYLPHTAGVDLMVSSRGNSREAVEAEHQQVVDAIMERV
GTLVYATREISLEEVIGEMLLRQTFTVAVAESCTGGLLASRFTDISGAST
YFQQGFVVYSNEAKERALGVPHETLVAHGAVSEEVAQGMALGCLEKSGAD
FALATTGIAGPTGGTPEKPLGTLCYAIAVKGGGVVVCRKVVMQGTREQRK
VRFSTAVLREFWMLLKEREASEE
>Cag_0303 peptidase, M16 family
MAKPRKFFPVLLFAAMVLFFLHLTACSPTKTLMNSNSAYPYTTIQGDSLH
TRIYKLKNGLTVFMSPCYDEPRIYTSIAVRAGSKNDPAETTGLAHYLEHM
LFKGTDAIGSLDYHKEHPQLEKITALYEEYRSTANPEKRAAIYKMIDSLS
NVAASYTVPNEYDKLLSSLGATGTNAYTWVEQTVYINDIPSNKLDQWLTI
EAERFRNPVMRLFHTELETVYEEKNMTMDSDSRKIWENLFAQLFQKHQYG
TQTTIGKAEHLKNPSIKNVMEYYRSHYVPNNMALCIAGDFDPDATIRLID
EKFSVLESQPLARFTVEAEEEITAPRVMHVKGPESEELVMGYRFKGVNSS
DADYLTLIDKILFNHTAGLIDLNLNQQQKVLDASSMLVLMKDYSAHLLTG
KPREGQSLEEVQQLLMEQIELLKQGEFPEWLLEAAINDLYTEQLKQYETN
RGRVEAYVDSFIWGMEWQAYMQQIERLHKITKADIVAFARKHYSTNNYVA
VFKEHGTPESEAKIQKPPITPLTVNRDTISTFAQNLLERPSALTQPRFLD
YSKDISFYNVTDDITLHYVHNNENDLFSLFYVFDIGKNHSKKIDLALDYL
SYLGTSKLSPKAYSQEMYKIGASFSAYTADNYVYLKLSGLHKNAEAAIRL
LEELLMDAQPDEEALGKLKAGTLKERADDKLSKKKILFEAMANYGKYGAH
SPFTNVLSNREVEQVRSQELLDELRNLLNYRHRVLYYGPESAENVLSELR
SVRHYPATFMATPSLDLFKPLEVTENLVYVVDYDMTQAEVMMLMKDETYN
SATLPIVTLFNEYYGGGMSSVVFQELREAKALAYSVFSVYRTPKQKGEHN
YIISYIGTQADKLPEALEGIGDLMKTLPESPQLFETAQKGIEQKIATERL
IKTEILFNYEEALRLGHSHDVRKDIYDATQRMSLEDVKAFHKKHFSNKKQ
VMLVLGNRKNLDMATLRKYGTVRELTLKEIFGY
>Cag_1273 conserved hypothetical protein
MKPPFLITTLNKAHDRNAFYSGSEMLDRYLKQQVTQDIRRNLTACFVALN
NEKQIAGYYTLSSASIALDALPESLIKQLPRYTSLPAARMGRLAVAKTYQ
GMGLGATLLTDAIMRAKQLNREIGMYALLVDAKDEHAAMFYLHHGFIRFT
NSPQTLFLPLSQISLQN
>Cag_1174 GTP-binding protein HflX
MNTVTPENQREKAFLVGIYSPPEVPRSLVEEYLAELAFLADTAGADVLDT
FIQERKVRDPSYCIGRGKVDEMEAYIKSEKIDVVIFDDDLSPGQARNLER
AWGCKVIDRTGLILHIFAIRAQSTQAKMQVELAQLEYILPRLSGAWTHLS
KQKGGIGNKGPGETQIETDRRLVRNRIALLKKKLREVERQHYTRTRSRQN
VSRVSLVGYTNAGKSTLMNALCPQAEAFAENRLFATLDTKTRRLELKINK
LVLLSDTVGFIRKLPHTLVESFKSTLDEVLQADFLLHVIDISHPSFEEQI
AVVRDTLREIGVQHDQIIEVFNKIDALEEPTLLREMGDKYPNAVFISAVR
GINLSLLKEIIGEQLARDYTERHVRLHVSNYRLISYLYDHTEVVEKKHED
EMVELTIHVRNHQLPQIDAMIQAAAEAPHEP
>Cag_0380 DEAD/DEAH box helicase-like
MSFQTILEKYRRISFSERDKGNRFERLMQAYLQTDRQYATQFKKVWLWNE
FPGRHDLGGSDTGIDLVALTHGGDYWAIQCKCFEASATIDKASLDSFLAT
SSREFKNEQMQTVRFAERLWISTTNKWSSNAEEAIKNQNPPVTRITLQNL
VNAPIDWEKLENGVHGEFARREKKKLYPHVLEVRDKVVDYFKEHERGRLI
MACGTGKTMTSLKIAEKLTNHKGTVLFLVPSIALIGQTLREWTSQADETI
NPICICSDPEITKKKNTTDQDLTSTIDLAWPASTDANYILKQFQHYKNKS
NNGMTVVFSTYQSIEVIAKAQKVLLKNGFSEFDLIICDEAHRTTGYTEPG
MDDSAFVKVHDGNFIKSKKRLYMTATPRMYNVDARSQAAKQAIPLWSMDD
EEYFGKEIHRIGFGEAVEKGLLTDYKVIILTLNDKDVPPAVQKMISNGKT
EIKTDDLTKLIGTVNALSKQFLGNESIIVDGDELPMKRAVAFCQSISNST
TIAASYNLASENYLDALPENKKAKMVTIQAQHMDGTMAAPQRDQMLNWLK
EETSGNECRIITNVRVLSEGVDVPSLDAVLFISAKNSQVDVVQSVGRVMR
KSDGKKYGYIIIPVFVRSDEEPENALDDNERYKVVWTVLNALRAHDDRFN
ATVNKIELNKKRPNQIIVGGADTAFDGDGNPIDKRRDGYDQSKEIGQQIA
IQFEQLQDVVFARMVQKVGDRRYWEQWAKDVAVIAERQIERINYLINEKK
EQRAAFDKFLLGLQKNINPSINEEQAIEMLAQHIITQPIFDALFEGYSFV
KSNAVSVAMQSMIDALEKGSNLAEQDETLQRFYDSVRKRAEGIDNAEGKQ
RIIIELYDKFFKTAFPKMVEKLGIVYTPVEVVDFIIHSVNDILKKEFNRT
ISDENIHILDPFTGTGTFIVRLLQSGLIDINDLERKYKHELHANEIVLLA
YYIAAINIENAYHDAISGYRNLGLGFGEENLVTHRYLNTNANFQRTNCLA
GSDEFGRDDLQNNKELSERGDVWLDESNKESSEFNSGKHSRRIWEKEQGR
ISTISGNSERITYGVRDTCFDSTENSNSECDGNGNNIGTNSNSGKIIDSS
SEISNTQTLNPIPYEPFDGIVLTDTFQLGETKEGEIQYEEMLKKNSDRVE
KQKKAPLRVIIGNPPYSVGQKSANDNAQNQKYEKLDARIAETYAAGTNAT
NKNSLYDSYIKAFRWSSDRLSKEHGGIIAFVSNGAWLDGNSNDGFRKCLE
KEFTSIYVFNLRGNARTQGELRRKEAGNVFGGGSRTPIAITLLVKKGKKD
A
>Cag_0329 GTPase EngC
MKQVVEHVLVGTVTEVAGTSYIVQGDDGTLYRACTVPSTKSANSDASLVA
VGDRVELKASVSGHAGYEAIITNVLARRTTLARQRDVRRNRSKERVQVIA
ANIDQLVAVVSAFEPPLNRRLIDRYLVFAESEQLPILLVVNKCDLDDEED
GSSYVREMMHPYHALGYSVLYTSAENGEGVEELRQALAHKLSAFSGHSGV
GKSTLINMLSGQERLRTAETNVKTGKGLHTTTNAVMLVLPDGGAIIDTPG
LREFTLADITRDNLRFYFREFLPVMAQCAYSSCTHTVEPECAVRNAAESG
TIDPERYESYLALYDSIAE
>Cag_1074 xanthine/uracil permease family protein
MQKYFEFERLGTNYRQEIIAGITTFFTLAYIIIVNPAILEAAGIPKEASL
TATILTSIFGTLLMGLYAKRPFAVAPYMGENAFVAYTVVHTLGYSWQTAM
AAIFISGVLFTLITIGGLRQWLAEAIPATLKHSFSVGIGLFLAFIGLNDM
GVVALGVAGTPVKLADVTQLPVMLSLAGMLVTALLLIRRVTGALLIGMAF
ITAAFLLLGLTPLPTALFSFPPSIAPIFMQIDWHGALTWGFVGVIISVLV
MDFVDTMGTLFGLSSRADLLDENDNLPDIQKPMLVDALSTIAASIFGTTT
AGVFIESAAGIEQGGKSGFTAVVVALLFALALFFAPILTIVPPYAYGPVL
LLVGMFMMQSVTRFNFNDYSELFPAFLTIALMVFTFNIGVGITAGFIAYL
LLKLLSGQFRDIKSGMWILALLSLSFYLFYPYH
>Cag_1148 hypothetical protein
MRHQGASILLYNQQHEVLLVLRDNLPFIACPNTWDAPGGHLDAHETPLHC
IVREMMEEMELDVSTCSHFKSYEFSNRTEHIFTMQTDVLNTATTPLHEGQ
MIRWFTVADALQLSLASDMEVVLHDVGIWLEQQNNGTEDCGNV
>Cag_1052 conserved hypothetical protein
MKERWVVVIGGGAAGMAAAVSAAEQARYLGVDCHITVIEKTHQVGSKIRI
SGGGKCNVTHVGTSAELLEKGFLRAAEQRFLRSALYAFSNNELRALLQQQ
GVATTEREDGKVFPVAGEASVVAEAFRTLLQRLKINCELHAPVQAIKVHG
QQFHLITLHGDIVADAVIVATGGVSYRHTGTTGDGLRLARALGHTVVEPS
AALSSIMVQPHSLVALAGAALRGVAAVARAGKLRAERQGDILFTHRGFSG
PAMLSLSRDVANMQRSQREAVHLAADLYPQQLHDELEALLLQHSKKQGGQ
LVRKFLQVSPIGMLLLKSETMPYGTIPNAMVPLLMRQAALDDEVTFATLS
REHRHQLVVTLKQFQLGTVHNVSLDAGEVSAGGVALSEVNPKSMESRLVP
NLYFCGEVLDYVGEIGGYNLQAAFSTGWMAGKSAVNKLLTAL
>Cag_0970 conserved hypothetical protein
MRVEKLSTLLLCRIALAISWIYQGAVPKLMCQSSGELELLGHIIPIYKWA
CIAMQWMGYGEILFGVFLLIARWQWAFWLNIIALIGLLFFVGIFEPTMLT
LPFNPLTLNVALIALSLIAIRELRQ
>Cag_0837 hydrolase, alpha/beta hydrolase fold family
MNKVDNNNHSEWPAEAISQFATINGFNVHYRIAGKGEPLVMLLHGSFLSI
RSWRLVFGELAKHTTVVAFDRPAFGKSSKPRPSTTTGANYSPEAQSDLVI
ALMRHVGFQKAMLVGNSTGGTLALLAALRHPNNVAAIALAGAMVYSGYAT
SGIPAPLKPLFKAASPLFARLMGKMITKLYDRTMYGFWHNKERLSPDVVA
AFRNDFMQGEWARGFWELFLETHHLHFEERLKGIVVPSLVITGDNDLTVK
TAESERLANELPGAALAVIANCGHLPQEEQPEAFVQALLPFIEKVRLHL
>Cag_0188 conserved hypothetical protein
MNADNFRLQLGSKEYVPIIIGGMGVNISTAELALEAARLGGIGHISDAVV
TYICDQLFKTSYVSRKRKQYAAYSNSPDKSAVLFNLEELAEAQKKYIENT
IARKKGDGAIFLNCMEKLTMNNSAETLKVRLSAAMDAGIDGLTLAAGLNI
RTLDLISDHPRFRDVKIGIIISSVRALSIFLKRAVRLNRLPDYIIVEGPL
AGGHLGFGADNWHMFDLKTIFTEVVDFLAKEDLHIPVIPAGGIFTGTDAV
EYLQMGAGAVQVATRFTISKEAGLPCDVKQHYLNAREEDIVVNMASTSGY
PMRMLVQSPTLRYAIRPNCEGLGYLLENGGKCSYIDAYYEALENRKEGEP
LSVKIKTCLCTGMANYDCWTCGQTTYRLKETTNRLPDGKWQLPSAEDIFL
DYQFSSDHAIQRPKPEA
>Cag_0598 conserved hypothetical protein
MEENKSQIIIYQTENGETKLDVRFQDETVWLTQKLMAELFQTTSQNITIH
LKNIFEERELEEDATCKDFLQVQKEGNRKVKRNQKFYNLDAIISVGYRIK
SHVATKFRQWATQHIKEYIVKGFVLDDERLKNPDLPFDYFEELERRIQDI
RTSEKRFYRKITDIYATSVDYDPTFDISIDFFKTVQNKLHWAITGQTAAE
IISLRADSTKENMGLTNWRGDKIRKADVLIAKNYLNEEELTSLNNLVEQY
LIFAQGQAMRRIPMHMKDWIEKLNGFLTLNDRDILNNAGSISHTLAKENA
EREYEKFKDSEQKMITEDDFEKTIKNIEQKKKK
>Cag_1773 conserved hypothetical protein
MKPLRQQRWGATTQGIAIWLIALLWFLPIGMLQALTVPALTSRVNDYANM
ISPNVRAELEAKLAALETTDSTQLVILTVPSLEGDPIEDFSIRVAEAWKI
GQKGTDNGVLLIVSQADRKVRIEVGYGLEGKLTDLQAGRIIRNNIAPAFK
MGEYDLGFVQGTNSIIAAVRGEFIASDKKSQKSNKPSMPLLFVILFVFYI
LSQLMRGHRQSGPMAYGGPGGFYGGGFGGGGSGFGGGGFGGGGGGSFGGG
GSSGDW
>Cag_0485 hypothetical protein
MKLGSPATGNDFFGREQELRDLWRYLESDHIRFPGVRRLGKTSILKRLEA
DAAEHGLLAKWVDVSNIDSAKGFVALLEQAFPENTIKRFLSDKTKQVADW
FKIIRKVEVTLPDEAGGGGFGIELGEVLLEWQHAANHLHHRLSNQPLLIL
LDEFPVMLEKLIQRNRQEAEQLLTWLRIWRQSQGACRFVFTGSIGLQSLL
ERHRLGETMNDCFAYPLGPYKPSEARDLWKYFAQNADENTWQVTDLVIDH
ALSRVGWLSPYFLCLLLDESIRAARERQEEWPSKASGAASIEVEDVDDAY
EQLLAERSRFHHWEKRLKDALAPAELDFCLSLLTHLSRKPEGLTLNQLSS
RLAKREPDPDCRAQRIQELLVRLTDEGYTSSPDSNKRVQFLSFPLRDWWN
RNHVR
>Cag_0450 hypothetical protein
MNITIDTSSLIAVIGNEESKEKIIKITEGSSLCSPLSVHWEIGNALSNMF
KKGRILLEQAQLALFAYNEIPIKFIDVSLVKAINLSHSLNIYAYDAYVIQ
CAKQTGTPLLTLDNGLKVAAQKSGINLLELQS
>Cag_0702 probable phage-related lysozyme
MQTSDNGLNIIRQYEGLRLKTYFCPAGKLTIGYGHTGTDVTSGMSITEAQ
ANELLQEDVKRFATSVNKMVTTEVTQGMFDALISFSYNIGAGNLQKSTLL
KKLNAGDKQGAADEFLKWNKSNGKPLAGLTARRTAERELFLA
>Cag_1350 Protein of unknown function UPF0001
MESIASNLAAIHAQVAAACKQAGRKPESVRLIAVSKTKSAELVREAFDAG
QLEFGESYMQEFLEKYESRALQGCPIQWHFIGHLQSNKVRSLVGKVSLIH
GIDKLSTAEELSKRAVQQNLTVDYLLEVNTSGEASKYGMAPHTVLSEASA
FFALPNVRLRGLMTIATYEREAARREFQELRELLEQLQAIAPDPTLVTEL
SMGMSGDFEEAIQEGATMIRVGSAIFGWR
>Cag_0656 nucleic acid-binding protein, containing PIN domain
MTEKSEKYLIDSNIIIYHLNGENSATEFLRSNVSQSYISRITFIEVLSFD
FMKDEKEDVLNLLRRFEIIDTTDAIAMRAIENRKLKKIKLADNIIASTAQ
VDDLVLVTKNIKDFNGLNVRVLNIFA
>Cag_1760 conserved hypothetical protein
MTDLSHETLLRLHSELAAEYELAESSYQFGGKPFSFLHVRDSYALLDRIS
PEEFVKDEQMPYWAEIWPSASALSTFFMDEVALEGKHLLELGAGIGVVSI
VAAWRGAQVVATDYSIEALRFIRYNSLKNSVALTAERLDWRQVQRSDRFD
YVVAADVLYERVNLLPVVLALDKLLKADGVAFIADPRRRMAEQFVELATE
NGFVVTTHARRCQIGEAPVMVNIHQISRL
>Cag_1471 conserved hypothetical protein
MQHFSQSKRRFVLFWHTTAALAVLMLLYPLIGNFFIAWFVGAELERGALL
QPEMLRSFVPSLRIVQMVGQVLLLAFPALLLAGWQRGCRAPWSPEVRMWM
GLQPPFDGGVLVAGAGGIILLQPLLSLIAALQERYLWPALGEAGREVLQQ
QESMELFLRTIADAHSLPEALFVLAVLAVTPAITEELLFRGYVQRNYMQV
LSPAMAIVLSGLLFAFFHLSAANLLPLALLGCYIGYIYYCSGNLFVPMVA
HFTNNALALIVLWFTPQEHTMAMQAERIALLLSPAWWFLVVGCTLLFWRL
MRWLTDRIVRH
>Cag_0092 conserved hypothetical protein
MKPLKQVGESYFLLSQGEKQIEGAAFEEAEQSYRLAMTMARTIPTEEAFD
YDGFDAIAHAGLSSALIGLGRYNEALVSVAEALRYFNRRGDLHSAEGSLW
IAVICNKARALESLGRKDEAIKYYRMAGEMIAEKKGEIKQRDLLTELIEQ
GLQRLEGAKPATAKQGYKAWWEFWS
>Cag_0331 conserved hypothetical protein
MNPANAHIDNQLRQVLERHPLIRLAILFGSVAKGTAGFESDLDLAVGAAR
PLTVQQMMALIEDLVEMSGRPIDLIDLSTVGEPLLGQIIAHGRRIIGSDT
DYVNVVLKHIYNEADFVPLQKRILKERREAWIGK
>Cag_1639 oxidoreductase, short-chain dehydrogenase/reductase family
MLTLHQKVAIITGSTRGIGKAIAQEFVRQGARVVITSSSPQNVEAACKEF
PAGSVHGIACNVTSPADMERLVRESVAHFGQLDCFINNAGISDPFTNITE
SDPEAWGRVIDTNLKGTYNGCRAALIYFLTNNKQGKIINMAGSGTDKGSN
TPWISAYGSTKAAIARFTYAIAAEYRHTNISIMLLHPGLVRTGMVSTEHP
TPELERQLRTFNTILDIFAQPPTVAASLAVKMASPWSDGKNGIYLSALSS
LRKKKLLISYPFRKLFKKIDRQTY
>Cag_0203 conserved hypothetical protein
MSNNHSTNQRILLLLLLVISALFFTMIRYFLLVVVLAAIFSALAMPVYNR
FERGLRGKRSLSAIMTLLTLLFIVVLPLAILLGLVVKQAIRLSNVAVPFV
QEQLLTPSQFDHHLQSLFFYPELVLYREEILQKVSELATKFGTLLFNAIS
SFTYSAVTEIVLFFVFLYTMFFFLRDGKQMLQSMLALLPLSHTDQYRLLD
KFLSVTRATLKGSLVVGMVQGSLAGMALYMAGIESALFWGTVMSFLSLIP
VLGSALVWIPAVIYLATIGSYPQALGVLLFCMIVVGQIDNIIRPILVGRD
TQMHELLIFFGTLGGIGMFGFFGVILGPIVAALFTTIWEMYAESFGDYLS
TIQKNRTSTLKD
>Cag_0689 PucC protein
MKKFNLVRLSLFQMGFGIMLGFLLDTLNRVMTTELRISATIVFGLISLKE
LLAIFGVKVWAGNLSDRSQIFGLRRTPYILLGLVSCVFSFIMAPTAAYEV
TVGGVGFAEMIPAMLQDVGLLKLALIFLLFGFGLQVATTAYYALLADTVD
EANLGKITGASWTLMVLTTIVSTRIVGSYLDHFTPERLLFVAEVGGFIAL
ALGLIAVLGVEQRNGEIKEGKEKHALSFAQSLQLLTSSPKTLFFASYIFI
SIFALFANEIVMDPFGAHVFDMSVGTTTKLFRPTMGGMQLLFMLIVGFLL
GRIGQKRGALIGNIMCMIGFGLLIGAAFSRDEQFLRIALVVTGIGLGASS
VSNISMMMAMTAGRSGVYIGLWGTAQSLAIFIGHLGAGMIIDVVYHFTGQ
YVWAYAAIFAMEIVAFAIATLMISHVSKEEFEAESKAKLAELALSAKG
>Cag_1453 TPR repeat
MSDATEYQENLREAERFFHYNKHGFLFAVSNDELVQRNLNSSLQQLLRGK
GKTLLTYTWDTNPEALHPVKQLRQFQQNHLELNGLILNGLEPALEHNPNF
LVQLNFGREGLAELRIPLLFWVSNRTLQRVNREALDLYNQRVSANLYFEH
DPTLQQSDNSALRYIAQETVRANKSLAGVEERMKLLQQQLDEAEKQHVEP
KTIANEIVLELLELYSQILGAEPLIHTLLNKYEAFIDRENPENCFKLARV
LYEIGMRTEAMDLYQKALQTLRELAVRNPDIYLPHVATTLNNLGALQYTT
NDYTAALASFTEALTIRRELAAKNPDVYLPDVADTLNNLGALQSDMNDYA
AALASFTEALTLYRELAVKNPDVYLLYVAGTLNNLGILQYNTNDYAAALA
SYNEALTLYRELAAKNPDVYLPDVATTLYNLGNLQYNTNDYAAALASYNE
ALTLYRELATKNPDVYLPDVAMTLSNLGNLQYTTNDYAAALASYTEALTI
RRELAVKNPDVYLPDVATTLNNLGALQSETNDYAAALTSFTEALTLYREF
ATKNPDVYLPYVAGSLINMAVWYYKAPQANQQQSLAFVTEALHIALSLVE
KIPNVQLNIDSAYNLLRAWGIEPEEFVKQVGTSGASGGE
>Cag_1866 putative DNA-binding protein
MQMKNNQSSFILFTTEDAKIAVDVRFEEETVWLTQEQMAVLFGKARTTIT
EHIQNVFKEGELNEEVVCRNFRHTTQHGAIEGKTQETWVKHYNLDVIISV
GYRVKSLRGTQFRQWATKRLNEYIRKGFTMDDERLKNIGGGGYWKELLQR
IRDIRASEKVFYRQVLDIYATSIDYDPKDEVSLAFFKKVQNKIHYAVHGQ
TAAELIFNRADAEKDFMGLMTFSGSRPYLKDVVVVKNYLNEKELRALGQI
VSGYLDFAERQAEREQAMTMKDWAEHLDRILTMSGEKLLQEAGTISHEKA
VEKATTEYKKYQQKTLSEAEYNYFESLKILESKIH
>Cag_0341 TPR repeat
MKVRNLVLFTLMAVSAEGYAAGTKAKSIVKTPSAATQAAETMQALPLDRQ
TALNLAQSYLANGSSRQAELILSKLVLLYPDDEEILRETISLYEKSNRAE
QTLPLYQHLLQLRPNDLELTLASARAYSWTGRKAESIALYEKVLKAGNAS
EKVVTEYADFLYADKQYQKAIDLYKSVGQKGKLSKQHMLNTVNGFIALKK
FDEAAKICNANVPLYPQDTDFLRLAADINFNAKQFEEAAGHYRQLLLKNP
DDPGAYSKLADIAMAKNDFTEVARLSHKILALIPDHKTAMLSLARVSSWQ
GDFTTSLTYYDKLIASPNPEPFYYREKARVLGWMGDFKQALSVYKSAVQK
WPDDKAISAEAEAKKNYYNHTYRPAVKAYNAWLLAEPQQPEALFDLAQLY
AQYGKWNNGLNTYNSLLSQIPAHRQAALAKQKIDFAASRMFVRSGVEYFS
AKTKFIDATNHRQADTKSTSIYSSLTYPINERVSAFVNLDSKSYDFRIAK
PNTPKNPVTYGLMAGAEYRNMPNIALSAGLGMRMNPGDVDNGLTGFINAN
SQPVDNLHVGVTLRNDDIVTNTSSFNNQLEATRLQGRVAYNGYRRWQAGM
DIAFDSYADNQSYDNSSLTVGADVVAHLLYEPQRLSVSYRLQEYGFDKNH
ANHPQYYNYFWTPKSYTTHTFGLEWQHYLNRERFHGSNNTYYDIAFRVGL
EQEGDISRQIHASINHDWNSRLATSLEGQYTWGTSAEIYQDSMVKAEFRW
FL
>Cag_0737 nucleotidyltransferases
MIDSIEIVKKQIVDALMPLNPEIIILFGSYAYGVPNKNSDLDICIVEKEY
SNKWKEKEKIRKLLNKIDMPMDILNPKLDEFEFYKNEINSVYYDADKKGI
WLWKKNS
>Cag_0591 proline iminopeptidase, putative
MSYFASTRCRLYYEDSAEGDPSALSKPTIFFVNGWAISSRYWKPLVSILS
DRYRCIIYDQSGTGQTLIKGYNPTFTIQGFTDEASELLEHLELHHSRNVH
IVGHSMGGMVATDLCMRYPDALVSSTIIACGIFEETPFTSVGLMMLGGLI
DVSMNLRSIFLMEPFRSMFINRAVAKAISKEYQDVIIDDFTKSDHAATNA
VGKFSIDRNVLRTYTRHVLAIQAPLLCSVGMADQTIPPEGTLTLYEKRKA
KSELQTSLARFEDLGHLPMLEATELFAQVLDKHFQQAQQLL
>Cag_1545 NUDIX/MutT family protein
MASRFRGVFKQSGVIPLFDDKVVLITARKSDRWIIPKGYIELGMSAADSA
AKEALEEAGLVGKVGEHPIGKYRYNKSGRHFVVLLYPFFVETMLDVWDEV
HERERCVVSPDVAATMVAHSDVGRLIRSYCASLDDDEAVLVPPHVASAIT
G
>Cag_0467 KpsF/GutQ
MNTTPQTETATLTGIGRQILEQEAQAIAHIAEHLDHHFAEAIQVMVACKG
KVIVSGMGKSGIIAQKIAATMASTGTTALFLHPADAAHGDLGVVAAEDVV
LCLSKSGSTEELNFIIPPLRQLGAKIIVMTGNPRSFLAQNADITLNTGVA
KEACPYDLAPTTSTTAMLAMGDALAITLMQQKKFTQHDFALTHPKGSLGR
RLTVKVSDIMATENAVPMVRTNAAVTELILEMTSKRYGVSAVVNENGELA
GIFTDGDLRRLVQSGRKFLALQAGEVMTARPKTVPPDMLARECLDILEEY
RITQLLVCDNHQRPIGVVHIHDLLTLGL
>Cag_1491 TPR repeat
MLKDALGSYRGSLAELNKMVEVQPHNAELWFARANERSGCGDYAGAISDY
TTALLLGLRFREAVNAYGNRGMARLAMGDMRGAMEDFSTIIARQPKNRSL
LRTAYLKRAHLREKQGDEVGAQNDRKAADCINGK
>Cag_1921 Twin-arginine translocation pathway signal
MFFQCVMTNHQLSRRDFAKLLLSSTAGALLGVGVPSSRTYAATNRVVIIG
GGFGGATAAKYLRKLDPSVAITLVEPKCQFYTCPISNWVIAGLKPMHAIA
QNYNALRVRYGVNVVHATAVAIDALKNSVTLHSGKKLFYDRLIVSPGIDF
RWNAIPGYSQKVAESVMPHGFQAGEQTLLLRKQLLAMPNGGTVIMCPPNN
PHRCPAAPYERASLIAHYLKQHKPKSKVLILDCKEKFSKQELFLQGWERL
YSGMIEWRAATAGGKVEAVNSAAMTVTTEFGDEKGDLINIMPPQQAGRIA
FEAGLTDAAGWCPVHPITFESTLHPGIHIIGDACHAGDMPKSAFASSSQG
KVASSAIAALLQGRVPVAPSLVSTCYSLLKPDYAISVANVFRLTIDGIVD
VKGSGGVTPLDASVEHLQHEADFAWGWYENITRDTWG
>Cag_1939 lysophospholipase L2, putative
MPEPKPHTLAVLVLHGFSGTLESVKALREPLQALGLPVAMPLLAGHGEHS
PEALRGVTWETWLADAEEALLQLSNQALQVIVIGHSMGALLAVQLAYRYP
TLVDSLVLAAPALRIASIFAPGRLFHSIAPIVSRLVKNWRLQSEEDRRNA
VYGDLHYEWVPTKTVLSFFELVKKTERLLPHITHPALILHCRCDNTVLPE
SAEIAHSSLGSMPAAKSLVWFDKVGHQLFCATERDCVVLEIVGFVKSRFR
>Cag_0150 hypothetical protein
MEREQASTLRTIFSMPVIVAALGYFVDIYDLVLFSIVRVPSLKSLGLSGQ
ELIDYGVYLLNMQMIGMLLGGFLWGWLGDKKGRLKIMFASILMYSLANIA
NGFVTTLPMYAALRFIAGVGLAGELGAGITLVAEILPTKIRGYGTMLVAS
IGVSGAILANYVATTFEWHNAFFIGGALGLLLLAARFKVSESGMFQAMAD
HRGTNRGNMAALFTDRSRFLRYLNSIMIGVPIWFVVGVLITFSPEFGEKL
SISAPVSAGNAVMYCYLGLVFGDLSSGLLSQLLKSRKKVVLLFMVLTVAG
VALYFTQHGQTPQFFYMVCAFLGFASGYWAIFVTVAAEQFGTNLRATVAT
TVPNLVRGMVVPITMLFQYFRGMFGMELGALVVGVICIVGGFLSLMALSE
TFHKDLDFYEEFL
>Cag_1138 putative metal-dependent hydrolase
MLQIGNYRIAALLVQEFALDGGAMFGVVPKVFWQQQAPADALNRVTLAAR
LLLISGAGRNMLVDVGLGDAWNDKQRSIYAISPFRLREELQRFQLTADDI
TDIILTHLHFDHIAGAFAVENGGLVSLFPAATFHVQERNLQVACQPHIKE
KGSYLSPYIDALMQQCNVSLKQGECELCEGVSLLVSNGHTQAQQLVKISD
GKQTLLHGGDLLPSAAHLPLAWITSYDVEPLQAINEKTALLETAMDEEWL
LFFGHDPRYAAARIRRGEKGAEVAEYFEEL
>Cag_0378 helicase domain protein
MSNAKIFYHDIGDYYSREEKLALIKKYHSLAHPNMQWQQLQPNEHGDWIS
QRNDLFETFIPLGDKENKKADTFFVPFYSRGLASARDSWCYNSSKTTLEN
NIRTLIEFYNQQRIAYFNTIDKDSKITVENFIDYDSSKITWNRGLKNDLE
KNKAIDFNRDYIITGLYRPFNKQKIYFARELNDMVYQIPKIFPASNSKNY
VICVSGVGASKDFSVLITNCIPDIQLQFNGQCFPLYYYEKQEKSNPTLFD
AAKEPDYIRRDGVSNFILEQAQKRYGNRVTKEDIFYYVYGILHSPDYRTR
FASDLKKMLPRLPLVENVKDFWHFSKAGRELAELHINYEAVPPAKGVILL
YNNIPTEEIEKGLQSSKMQEINYMVTKMRFPKKDQKDTIHYNNQITITNI
PLKAYDYIVNGKSAIEWIMERYQITTHKESGITNNPNDWATEVGNPRYIL
DLLLSIINVSLQTVEIVNNLPKLEF
>Cag_1627 3-oxoadipate enol-lactonase, putative
MLRCITNVTAETAGRYPITVLLLHAFPLSAAMWQPQIEALEKAGYGVIAP
HAYGIEGSPEIAEWNFTDYAVELAQLLESLHIASVTVVGLSMGGYQAFEF
YRLYSNKVKSLVLCDTRAEADAPAARATREEFMKAVASTGSAEAIRRMVP
NYFSPAAYGANSTLVAQVEAIINKQSPEVINAAMRAIMLRADATPLLGSI
SCPTLILNGEEDSMTTKETAATIQAGINGSTLQLIAGAGHIANLEQPELF
NQALLEHLSLLQ
>Cag_1400 chloride channel, putative
MNILSHKKGRLARRIIAFFYILFRRSRYFKGSSQNFIKLTLEYILVQLNL
NQDIPFLFVAVIVGLVTGYVAVLFHEAIKAISNFSFNDLRLLGDISFIEQ
YWVFFLPFIPAIGGLFVGLYNTFIIKKSSRHALASVIKSVAHNDGIIDRK
LWFHKTITSVVCIGTGGGGGREAPIVQVGSAIGSTIAQWLRFSPEKTRTL
LGCGAAAGLAAVFNAPIGAVMFAIEVLLGDFSVKTFSPIVIAAVIGTVLS
RSFLGNRPTFDVPDYTLVSNIELLFYCVLGVLAGLSAVMFIKTYFAIEEW
FDKLQIRRNLPVWIMPAIGGFLSGIICIWLPGLYGFSYNVISNAVYGNET
WYNLIGIYLLKPVVAGLSIGSGGAGGMFAPAMKMGAMLGGMFGIVVHQFF
PLITATSGAYALVGMGALTAGVMRAPLTVILILFEITGQYEIVLPIMFAA
VTSAVVARLAYRHSMETYVLEKQGIKVGFGIALSVAEQVVVSDILDKKRT
QFVSTTPMKKILEVFYSTPETNFLIVDKQGVFIGNISLDDIRILLKNGCN
DDLIADDIVNKNVPVLYTNSRLDEALKLFELSDYDILPVLDTKNNILQGV
LRQEKAFASYRKQLNLYGSDYSDKSVHQGVK
>Cag_0101 conserved hypothetical protein
MKTAFVKIWGELVGAVAWDDATGYATFEYDAKFKSKGWELAPLQIPVNAT
KSNFSFPALRKKADPALDTFKGLPGLLADMLPDRYGNELINLWLAQKGRP
LDSMNPVETLCFIGTRGMGALEFEPTTLKESKKAFSLEIDSLVEITQKML
TKKEAFVTNLQENEEKAILEILRIGTSAGGARPKAVIAYNERTGEVRSGQ
TNAPQGFEHWLLKLDGVSEVQLGASHGYGRVEMAYYNMAVACGIQIMPSR
LLEENGRAHFMTKRFDREGGAAKHHIQTFCAMKHFDYNLVTNFSYEQLFQ
TMRELKLSYPDAEQLFRRMVFNVVARNCDDHTKNFAFRLKKDGKWELAPA
YDVCHAYQPKHQWVSQHALSINGKRTNITKDDLLTIGKSIKNKKAAETIE
EISNTISQWKTFADEVKVLPKLRDEIAATLIRL
>Cag_0034 conserved hypothetical protein
MLLSLPNWIIHISSSLEWGIGAALLFHYGQLTERRDIRTFALAMLPHWIG
SFCVLAYHISGDTIPLLLDMSELINLVGSTALLWATYKLFQSTGGWKAAH
GIVPSIAPIGYLSAIVIAGKPQSWLGEDIFDTILQLSSIVYLAFLLLLIV
IYRRDKTIFSGLTVAGFWFVLVFISITIFCMYLATQMRGYPTLSHDDLLH
GMAESLLSISNLMIVLGAHRQIKAFKGQRG
>Cag_0976 HAD-superfamily hydrolase subfamily IA
MNIKALIFDLDGTLLNTLEDIANTLNATLARHHFPTHSLDECRFLVGAGL
RELIRKALPSEAAADDNMVDKLLSEFIEMYRTSWNQLTRPYEGIIEMLAA
IAERNLPMAILSNKADHFTQQCAEELLPRPLFSVVLGHRDGMAHKPDPAG
ALFVAAELGVEPTSVVYVGDSSIDMLTATRAGMYAVGVCWGFRPESELRA
HGAQSLIHHPLELVTLIDTLREANA
>Cag_0873 TPR repeat
MSNELLDEKLRLLVKALRADTFHFVLIINNHPSVYNDVVEWLKQHITDRE
IRELRLTGKHYREVSDVLQAAKQDIVTIPDFDELFTKENDDVRVALNQRR
DFLAFQRMNLVCFLSPDTFRLLPKKIPDLWSLRSLELDIAYDIKEPLFTI
PTTPFISSLGGTTIAEKEAEIRRLTYQLSQIDPANIALRKELEAQLVTLQ
MEVPQRFEEATSLHDTSQKNIIAAEVTATQDETVTNISTSILRSIFAVLP
SEPITLIALQELLPNIDNLETALQNLVAENVLNYNSTTKSYKCSPVVQEV
TRKQQSDHLFADIEQLISRLIDRLAYEPNTGHVTEVSYETAALYVRYGET
ILRNCTDVEYQLAALADRIGNYHTATGNLDKALSFHAECLRLSKELYEAY
PNNVFFKNGLAISYEKLGNTHTSLGNLDKALTYYEQYYKLSKELYEAYPN
NVSFKFGLAVSYSKFGNTHTSLGNLDKALSYYDNETRLFEELYEAYPNNV
SFKNGLAISYSALGQFYRDHRNDSDIVKNYFQQAEKVWAELVSSSPQHAE
FKQNLSWVKNQLQSLHS
>Cag_0375 ATPase
MVLLTVEGLEKKYGLKHLFEDVSFGVDERDKIGIIGANGSGKSTLLKILA
GVEQPDKGKLMVANHKRIAYLPQDSPYNPNDTVLQAILASSGKVMDLIYE
YELVCKKLEEHQGDSVALMERMSTLAHELDVCGAWELESNAKTVLGRLGL
YDLTARMGTLSGGQRKRVALAHALVVPSDGLILDEPTNHLDADSVEWLEQ
YIRRYQGAVLLITHDRYFLDRVATRMLELDGRTATTFTGGYSSYLQQKAE
LEEQAVRDERKRQALVRQELDWMRSGCKARTTKQKARMQRAESLVYSPKQ
EKSKELEIGFGAGRLGDKIIEFHKVSKSFGDKLLLKNFEYHLQKGDRIGI
IGANGSGKTTLFEMIAARTTPDSGHIEIGKTVRLGYYDQESRELDDSKRV
IESIQEVAEQITLKDGVVLSASKMLERFLFPPSTQYSLVKTLSGGERRRL
YLLRQLIASPNVLLLDEPTNDLDIPTLRVLEDYLDNYQGCLLVVSHDRYF
LDRTVEYIFAFEENGQVRRYPGNYTVYLEMKASIASAGEPKVAKKTTEPP
KPVAVQSTKPAALSSKEKRELEKLEVAIAAAESRQAEIAVQLSAAGNDFA
MQQQLGTELQALQQQLELDMERWSELAEKAG
>Cag_1875 TPR repeat
MSIQFVKHNPAFLDAGHFLEQVVARRADVAHLLGHLGSCEPIGTVRHLFI
TGQRGSGKTFVVRRVALAVEQHNALRSRYYPLFFSEESYSVSSSAEFWLE
ALFHLARQTANEQFAQTYQALREEVDEERIRQIVLPLLLDFADNQGKTLL
LIIENFSMLLADMASSREGEVLAQTLLQEPRFQLLATGTFTFDSLEVPFG
KYFSSITHHALEPLSNADCNALWQLYSGTPLADGQIRAMNILAGGNARLL
VTLARVANGRTFSQLPEILALALDEHTEYCKSYLDVMAPVERKVYLSIAE
LWAMVSAREVSLAARIDINKTSAYLNRLINRGAISVERQAKRNKLYGVTE
RIYSIYYLMRRHGWQSGRVRALLDCMLAFYDPASFPDRLSDSERERCTSV
AEALTMLPEREHVEQCHQNFLDAKRHSRFVPSLAYFVRFVQKSDVSHADE
SSSPMLGESFRQAFELLETESYTEALPIFDAIIMVSRHSESEQAIGQRYG
AMIGRGVALGNLERYEEGFRLLDEVAATCQERSVRRRLKWGLLALLGKAS
VLERAGRIDEAVTLYDELVSRYRRQQELECSTLVAAALLHKSLLVSKSKG
GEEEIAMCDTLLELYSERVELPLVELVCAAWRNKAIAFEALNRNDDALLA
YGKLLALCRQRSEPHMMQHTAHALQNMGVVYGKMHRYSDAEHCFMEVQSL
APQQARAHLMLLKLLVKMEEQQHAVLSELRNYLAATSLALRALPQTIELF
ITCAVAGYAAEALELLVASPLAVSLEPVQAALQHATGDEVRSAPTIVEVA
NDIVAAIEARRNA
>Cag_1496 Nuclear protein SET
MPPLFSAECLTPDSFNCLLIIAAVVLLLGIVLGFVVALKAPWIHRKTVIA
KASTVSGRGVFALVNFREGDIIERCPALEVRDRDVDGELLNYVFYGSTEQ
HRLVAMGNGMLFNHDNNPNVAYYREDTPLGAELVLYALRNIRKGEELFYS
YGEAWWATRQNG
>Cag_0707 conserved hypothetical protein
MKIEYHPAIEDELRQIIKYYKESSAGLGTEFLNEFERQVLKIADNPRRWV
AVKGTIRRSLMRRFPYVIYFRLVNDETLRVTVVKHQRRHPKKGVNRR
>Cag_1557 conserved hypothetical protein
MQYSEAKSGRVFVLRLEDGDVVHECLEQFAHKHGIERASFIAVGGADKGS
VLVVGPEDGRTSPVVAMTHELYDVHEICGTGTIFPDDSGRPMVHAHFACG
REENTVTGCIRSGVKVWHVMEIVLTELLDNHASRKTDAATGFKLLAME
>Cag_1367 hypothetical protein
MKLTLYIIYTIFFIAMGCGIYIGFQTADGLVDNNYYHNSTNYFQTKAREE
KLGIVINKPDTLTIGTNTFTVAVTSHGKPFEQGNISLLLGNVSTNNNDTT
LTMQETAPGIYQTTISIPYKGKWFTRLELYHQQQLITTKQWFFSVQ
>Cag_0681 conserved hypothetical protein
MSGLKYILDTNIIIGLLKANPTAIALAESVRLDLGECAISQITRMELLGC
KGISEAEESSIHQFLACCVVLMIDETVECEAIRFRKHSSLKLPDAIIAAT
AQVHNLNLLSLDERLVSQYERAVKDNR
>Cag_1051 hypothetical protein
MRKIILSKRASKRLEKLLEYLEFEWSFKVKNDFIKQLDKSLKRIQKYPES
CEQTRFVKGLHMLVVTKQTSLFYQFDSETITIVTLFDNRMNPDTLKKETA
>Cag_1107 ComEC/Rec2-related protein
MTEPSKPHSAVKPKRAIGLSLAPYPAVRLLFFVIIGIVVGVVAPFSLTEW
LWSVALSFALLLLTWLYERIRYHQAAVPHFGMAIMYCFVVVSVFATLSAY
RLHYAPRNGLTQYAGRTVILYGSIESRPERSKGGASWVMEVQELFEHGKT
VTLRDRTKVFMRMSADAHLAVQKGDMVRVKGKLDLLPEAANAGEFNPRHY
GAMQQISVQLYAAGPWQVLYEGEKRLHPFEQYMVQPTYRYIMQALAALLP
DGEERKLAAGVLTGERETMSEEVFEAFKRTGTAHILAVSGMNVGLLALII
QVFLQRLKITPFGRWTAFLLFVFLLILYSNVTGNSASVTRAAFMALVLIA
GETVGQKTYPLNSLAVADLIILLINPLDLLNPGFLMTNGAVLALFLVYPL
LHFPRPKNRTLLLSIVWFLLDSIIITLAASIGVSPVIAYYFGTFSLISFV
ANIPVVFFSTLLMYALVPMLVVYGLSQALASVFAAGAFWLARMTLQSALW
FSNFSFASIPLKLDAVEVWLYYIVLAAVLLLATRKAWSRVAITFLLGVNL
FVWYSLLFRPNPIAPTLLTVNLGRNLATIVSNGSESVLIDVGKKPKDYQR
ISAQFERFGIVEPTAVVQFYSPDSLILATPTRHHFLRSDSLLRLSSMVIT
RPDEKMVKLWSRNQSYFLASGTSRLKAGEPYCGDVACIWIYRFGEKQRIE
LERWLTATKPKEALLVPSSFLSRVQLVALHRFAAAYPHVEVRSKTKQVVV
NGGER
>Cag_1561 zinc protease, putative
MNQIATTSFPPSYTLQVNARAKYPRLKMVPHKGLVVVIPVGFSKKHIPDL
LKQHEEWIRKIEHHFEAHRQTAEEAFEVLPTTITFSTFNEAWQLNYHQAA
RNSVRLTMQGEGQLLLSGNIGETALCRQVLTKWLNRRADVLLSPRLTQLA
ASVGMCFSTTTMRCQQSRWGSCSSKGAITLNSKLLFLPEELVRHVMLHEL
CHTLHMNHSAAFWAEVARFDPQWKHHKREMKDAWKFVPRWLTAL
>Cag_0936 MFS transporter family protein
MQRAFRAFTLRRKRLQAQRNYQILFWLLFDFANTAFSVMMVTFVFPLYFK
NVICSAQPYGDALWGLSISVSMLLVALVSPFLGAAADVLGRRKHFLLMFT
LAAVVGTALLSLTGAGMATVAVGLFIMANMGFEGGIVFYDAYLKELASER
SVGRLSGYGFAMGYLGALSILLLVSPLLADGINVANAAKVQQSFLVAATF
FALFAAPLFFVIRDRRSLSTSPHPSTKKILSTATSVKNVVVATHHVERKI
FGQGAWRNLLQTVQHIRRYPDLARFLLACFFYNDAILTVIAFASLYAEQT
LGFSSRELMHFFMVVQIAAMVGALFIGFIADTIGAKRALVVTLILWIGVI
AAALFAESKELFFYTGMLAGISMGSSQAASRSMMTRLTPQEHVTEFFGFY
DGTFGKASAIVGPFLFGVISSQAGSQKVALSSLLIFFAIGLVLLTKVKSS
STNVPSLQ
>Cag_1936 conserved hypothetical protein
MQILDLSHTIEPTMPLYLGTPSPSFQPIASIAHDGFAEQLLTFSSHTGTH
VDAPSHLFKQGATVEAMDVSRFVGRAVVLDVRSLLGEEIGLELLLPHEAL
VRECQFVLLYTGWSCFWGKEAYFGHYPCLSLEAAQWLTSMELHGIGVDAL
SVDSADSHELPIHRILLERGMVIVENLRGLEPLLHQRFLFSALPLKLAGG
EASPVRAIAKVDGVF
>Cag_2004 sulfide dehydrogenase, flavoprotein subunit
MSNGLSRRDFNKLLLSGVAGSTIGLFGNSGTLFGATSKRVVVIGGGFGGA
SAAKYLRKLDPTIQVTLVEPKSVYHTCPFSNWVLSGLKNMEDIAHFYDVL
RNRYKVNVIADTAVSIDADKSSVTLQTGKTLYFDRLIVAPGIDFKYDSVQ
GYSENVANSVMPHAWQAGPQTILLHKQLQAMPNGGKVFISAPANPFRCPP
GPYERASLIARYLKEQKPLSKVIIFDAKESFSKQGLFKQAWERLYPGMIE
WRASTMGGKVVSVDAATMTVTTEFGAEKGDVINIIPAQKAGKIAVDAGLT
DASGWCPINPISFESTLHPGIHVIGDAAIAGAMPKSGFAASSQGKVAAAA
IVRLFQGKVPAPPSLVNTCYSLIDKNYAISVAGVYKLAMTGIVEIKGSGG
LTPMNADADQLEQEAMFAQGWYDNISQDVWG
>Cag_1575 conserved hypothetical protein
MDTLTKLRILSGAARYDASCASSGSNRSGASCGIGNTSQSGICHSWSDDG
RCISLLKILLSNDCCYNCAYCVNRATNPVERASFTAREVVDLTLDFYRRN
YIEGLFLSSAVMQSPDATMERMVAVAETLRSEERFGGYIHLKIIPGASSE
LVRKAGLYADRISVNIELPSQVSLERLAPQKHRAAILEPMALIGREINTS
LVERQHSHRAPRFAPAGQSTQMIIGATPESDFQILRLSQGLYKKMNLKRV
YYSAYVPVSEDNRLPVLAAPPLLREHRLYQADWLLRFYGFSAEEILSEEL
PHLDEQFDPKTAWALRHPEFFPVDINRADYATLLRVPGIGVTSAKRIVAA
RRFSLITFEGLKKIGVVIKRARYFITMQGRRVECTDFSPTLIRRQLLLSE
STEKPASRQLVLPGLEPILA
>Cag_0922 conserved hypothetical protein
MFFDPLYLILALPPMLLGLWAQFRVKSAFKKYSGVPTQSGINGAEAARRI
LQRGGLTNVSIEPSHGMLSDHYDPTQKALRLSDEVYGYASIAAVGVAAHE
AGHALQDKTGYAPLQLRSIMVPAVTVGGNVGPILFMIGMFMAGSLGTTLA
WAGVLLFAATSLFALVTLPVEFDASRRAKELLVSQGIVSSAEMKGVNAVL
DAAALTYVAAATQSIMQLLYFVMALNRREE
>Cag_0628 conserved hypothetical protein
MESTSCPICNTNSFTPWLHVVDRFEPSTLWNIVQAVDSGLLMLHPRPTEA
EMAPYYAHAGYEPFLNSNKKSSLAERTLLFARSLLLHYRAMLIAKAREHP
LCKAHILEVGCSNGELLHCLQQKHHIPTAQLLGVEPDAASAEYARKRFGL
QVVDGVEKLPTTLFDTIILWHTLEHIHRVNETLAMLRERLTVNGIMVIAL
PNPLSYSARHYREAWIAWDAPRHLYHFTPTTLAALLKKHKLHIVKQQPYL
PDTLFNTLYSEQLQRQHNNAPSTPLPFANALAQVTTAIKISTKELREPNN
TSGIMYVVTHDA
>Cag_1867 glycosyl transferase
MKGDIEGTLAIVVLNWNGAADTIACLHSIIPTLDASVHLLVVDNGSTDCS
VERIRAAFPHIEVLELPHNLGFAAGNNAGFRRVQALGAEYLLFLNNDTVV
APYFYRPLLNLLQQHPDVGIAVPKIFYHHQPQRLWYAGGEVNLATALIRH
VGLRQFDAPQFNVATSTDYATGCSLAIRVADFEQLGGFDERFTMYAEDVD
LSLRVRAQGKRIAYEPSSMVWHKVSASLGNNSLQKLWMKSKAMVRLCIKH
RAWSGLLLYFVLLPFRLVRSVGGSLLFKIGKMR
>Cag_0249 acetyltransferase, CysE/LacA/LpxA/NodL family
MLMGKILPYKGIVPQLHESVFLTDGAFVIGDVHIGANSSVWFNAVVRGDV
CPIRIGEKTNVQDNVTLHVTHDTGPLTIGNCVTIGHGAVLHACTVQDHVL
IGMGAVLLDDCVVEPWSVVAAGSLVKQGFRVPSGMLVAGVPAKVMRPITE
AERQTITESPENYVRYVQNYRAEDAQG
>Cag_0509 conserved hypothetical protein
MVIKRAQKALLTERLENEPRNFIQVLYGPRQVGKTTIAQQFMQTTSLPVH
FVSADYVAVEQSHWISQQWETARMKLRQSEQQQAVLIIDEIQKINNWSEV
VKKEWDSDTANQLSLKVVLLGSSRLLLQQGLTESLAGRFETLYVGHWSYS
EMREAFNVTPEEFVWFGGYPGAASLIYDEERWQRYITDSLIETSISKDIL
MLTRVDKPALMKRLFELGCSYSGQILSYTKILGQLQDAGNTTTLAHYLRL
LDSAGLLGALEKYSIETVRRRASIPKFQVHNSALLSAQQPLAFRDVVSNP
ALWGRWVESAIGSHLLNYTRTHNLELYYWREGNHEVDFVLVHKGRAIGLE
IKSEHSQQTAGMGAFAKQCKPYKVLLVGDSGIAWQEFLTLNPLELF
>Cag_0594 conserved hypothetical protein
MNEFGITDSHLHIIRSIFKQYQAINKVLIYGSRAKGNYSERSDVDLVICD
TTFDRKTIGKILLAINNSDFPYTVDLQIMENIKNKNLQEHIKRVGKEFYT
KM
>Cag_0942 carbon-nitrogen hydrolase family protein
MSKESVSIAVVQSECKGDAVANRAEATAKIREAAALGAQIICLQELFVTR
YFCQTEAYEPFGEAEAIPDGATTRLMQELAAELGVVIIASLFERRARGLH
HNTAVVIDADGSYLGMYRKMHIPDDPGFYEKFYFTPSDLGYKVFKTRYAT
IGVLICWDQWYPEAARLTALKGAEILFYPTAIGWATDEDSAEVRHAQQNA
WITMQRSHAIANGVFVAAANRVGTEENLEFWGNSFISDPFGQMVAEAPHQ
HETILLAQCDLSRINFYRSHWPFLRDRRIETYGGLQQRFLDNNQ
>Cag_0511 metal dependent phosphohydrolase
MIAEHLLFHSDGGFIRIPVWGHIPLSKPLKSILSHPLFLRLKGIRQLSFS
QQVYPGATHTRFEHSVGVYHLMKLILQRMVTSSLAQKLQTEHFRFDDASC
RLLLASALLHDIGHFPHAHIIEEQIPRVGNEVVFSHHEELCRYFLEEEHP
NHPSLATLLMEEWRVDPNDVVALISGKHRLSKLISGTLDPDKMDYLMRDA
HHCNIPYGSIDIERLIESFVPDPERQRFAITEKGIAPLESLLFAKYMMMR
NVYWHHTSRALSAMLRRLLQDIAEAELLPAATLRELFYRNADDRVLYELK
LLLPEATHPLVALLEDVLMRRVYKRAITVQPYLQSSGKEDERWFLYSNNS
ALRRSMEVEICELLNKRYQLNLHGYEVLIDSPSRKDIFDYADLQELRVYP
TRSEHIHYAMHCASEYVRFDELNESVFQSNFILSFERYTKKFRLLCRPDL
VAHIVELRHDIMSLLAHDYPLFHSTVSSSATEHS