TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Organism: Chlorobium chlorochromatii CaD3, CaD3
Gene type: CDS

Number of genes found: 151

Free access
Sort by:

 



# Chlorobium chlorochromatii CaD3, CaD3

>Cag_1970 conserved hypothetical protein
MPTYQYRCSTCGHELEVMQKMSDAALTLCPSCQQEALQRVISADGGFMLK
GSGFYKTDYNKSAASPCSTGSCSTGSCPLAS
>Cag_1190 conserved hypothetical protein
MAMPNLGDMMKQIQQAGEKMQEVQKQLERLVAHGEAGGGMVKATVSGKQK
LLSLAIDPEIMDDYEMVQDLVVAAVNSALDASLKLAQDEIGKVTGGMMNP
TELLKNLNLGQ
>Cag_0312 hypothetical protein
MNTHTILIDFPSDILLALNETEAELKLRIKTTLAMRLYQLQKLTIGKAAQ
LSGLSRIEFETLLSENEIPISNLTIDDVIDDYKKLK
>Cag_1864 conserved hypothetical protein
MNTAIVNPLFGEIRELINQSRRQVAVEVNSAITMLYWQIGKRINEEILQN
KRAVYGKEVIVTLSRELTTEYGNGWSTKHLRHCLRLAEMFPDFEIVSTLW
RQFSWSHIKELMYIEEPLKRAFYLEICKLEKWSVRTLRERINAMLYERTA
ISKKPELTIQNDLELLKNEQRLNPDLVFRDPYFLDFLGLSDTYSEKELES
AILVELQKFIIELGSDFAFMARQKRITIDHDDYYLDLLFFHRRLKCLVAI
ELKLGKFEASFKGQMELYLRWLEKNEMVEGENPPIGLILCAYKNQEHIEL
LQLENSNIRVAEYLTKLPDLKLLEQKLHHAIEIARHKSEFDN
>Cag_1256 TPR repeat
MNYSRHTEEVVHSGYQSVEASQAAEKTLELLHQALQLHQDGRLEEANALY
LQTLELQPKNLDAVELLAAAALQKSHFDDSDAYRLLTALSQQQSSFLDIV
ALFESSATGESAPEQSLNELGAALHQLKFYKEALVRFERALQHNPVYLEA
HFNRGNTLFVLERYDEALQSYSKAIELKKDYAEAYYNAGTLLFLLKRYEE
ALAHFDAALAIRPDYAEARTNRDYVQKELDALDAKTHHVHTAKLKAIAGT
HPTSRLIDVLTFKNIFSLPGLVLLLGTFFCLSPWASPPVALLLGIVCAQL
FDHPFMHLGHKVSSMLLKASVVGLGFGVNFNNAVKAGSDGFVATVVSIAF
TLLLGYVFGKIFSVEKKTSYLISTGTAICGGSAIAAIAPVIDAEENEVSV
AVGTIFILNAVALLIFPEIGQWLHMTQHQFGLWSAIAIHDTSSVVGAASK
YGHEALLVATTVKLARSLWILPLVIASSFLFKSKIKKLKAPWFIAMFIGA
SLFHTYFPEFHPYTEFMVPLAKSALTVTLFMIGTGLSWTILKGVGYRPLV
QGLLTWIAVSIVTLIMVMSMASF
>Cag_2003 fic family protein
MKFEEFTAGYWQQRYQYKSFEPSLINHEWTWDEPTINTLLEQANCALGEL
NAFSLIVPNIDLFIQMHVVKEAQTSSKIEGTQTGIDEALLSEEQISPEKR
DDWREVRNYIDAVNSAITTLHDLPLSNRLLKQTHKILLSGVRGEHKLPGE
FRVSQNWIGGSNLTDASFIPPHPESVAELMSDLEKFWHNQDIAVPHLIRI
ALSHYQFETIHPFLDGNGRIGRLLIPLYLVSHGVLAKPSLYLSDFFERHR
SSYYDALMHVRTSNNLIHWLKFFLNGVAQTATKGRDIFQQILTLREEVEQ
AVLSLGKRATLAREALHLLYRQPIVEATDFSTMLKVSAPTANALIQALID
KAILVEITGQQRGRIYSFERYVKLFME
>Cag_0604 conserved hypothetical protein
MLAKIKLTPKQLLKPLLQIVVTAIALYVVFQKIDIAQLTTLIRTAHPLYL
LCALLFFNLSKVINAFRLNRMFKAIGIELSTTYSLKLYYLGMFYNLFLPG
GVGGDGYKVYILQKNYGIRMLNVFHAVLWDRIGGIFALTVLALALLLPSN
FATLYPMLIPWAWGGLLLLYPIAWLVNKLFYKQFLHLFAVTSFDSMLVQV
TQTIAAWFILEAMNLPAHHIDYLAIFLLSSVATILPITVGGAGAREITFL
YLLNVVELNTNPGVALSLIFFVISAISSLAGILSRIKHEKADVVQ
>Cag_0609 conserved hypothetical protein
MVVGIAGGYEIIIYWSTEDNAFVVEVPELPGCMADGKTYEEALNNVQVII
EEWVETARVLGRAVPEPKGRLMYA
>Cag_1070 conserved hypothetical protein
MSKGREFDTREITVFQKNEITEHFIYKNLSTRVTGVKNRRILAQIADDEM
RHYNVWKNYTRKDVAPDRVKIWFYTMISMLFGFTFGIKLMEKGEHSAQMV
YSRMPSSYREINGIISDEEEHEQALMGLLNEERLKYTGSVVLGLNDALVE
LMGVLAGLTFALQNATTIALTASITGIAAALSMAASGYLSTKSEDDGRNP
VIASFYTGMAYLLTVLALIAPYILIGDRFISLGVAFFIAICIIGFFNYYV
SIARDLPFKSRFFEMVGISFTVAALSFLAGYAINYYFPGLTM
>Cag_1756 conserved hypothetical protein
MPVRKLRDFLDRHKVHYFVVSHSPAYTAQEIAASAHVPGNELAKTVMVML
DGEFAMAVLPASRLLDLRLLQEVSGATHVALAKEDEFAELFPECEVGAMP
PFGNLYGMKVFVSEELEADDDIAFNAGGHRELLILSYKRYKELVQPIVAK
LS
>Cag_1570 virulence-associated protein D
MFAIAFDLVVADTSQNHPKGVAQAYSDIGSTLAAFGFTRVQGSLYTTNSE
DLANLFSAISALKALPWFSASVRDIRAFRVDQWSDFTNFIKL
>Cag_1044 conserved hypothetical protein
MKKIQRMVIKGVAALSLLGCTTLSAFALDKETARAKGLAGEVDNGLMAIP
PGASEEAKELIITINNGRRAEYAKIAATNNLPFDTVGTMMAEKIYERLPA
GTWVQQKGVWVQKKP
>Cag_1283 conserved hypothetical protein
MSNILVPLTTANGRQSIVSEHFGSAPYFAVVESESGKCSIIENGGCHHPQ
GECSHGDVFAQHQATVLLCKGIGGRAASRVEASGVAIYVVPQANSLDEAL
QLLQNGSLQQFAAGDACRGHNCH
>Cag_0210 conserved hypothetical protein
MAATFFISDLHLALQASNVEAQKLDKLQVLFERIGNEGGALYLLGDILDY
WLEFHHVIPKEFSRFFCMLSSLVQQGVQVFYVAGNHDFELGSYFMQSLGI
TTAYGINEVTIEGKRFFVAHGDGLGKGDTGYKIFARLIRTRFSRFVLQWL
HPDLTIGFMKWVSQLSREHKPTNSSFEVDRLLHFAHSLRAEQEFDYFVCG
HNHVQGVHELAEGSRYLNLGTWINGNYSYGVFKEGKFDFFDL
>Cag_2025 Protein of unknown function DUF205
MLTLLAILVVSYIVGSIPTSLIAGKMLKSIDIRDFGSGNAGGTNAFRVLG
WKTGLTVTLIDIIKGVVAAVSVVAFFRHHPIDVFPDINEVALRLLAGMSA
VIGHVFTVFAGFRGGKGVSTAAGMLIGIAPVSMLMVIGVFLLTVYISRHV
SVASILAAIAFPLIIAIRKYLFELGAGLDYYIKLFNAKFFFHDSLDYHLI
IFGLIVAIAIIYTHRANIKRLLAGTENRISFGKH
>Cag_1360 conserved hypothetical protein
MFEWDENKRLKNLEKHKLDFVAAVPLFDGRSTVTATSNSANETRYVTTGF
IDGKFYTVIWTWKGSIRRIISFRRARHAEERAYCTLYGI
>Cag_0694 conserved hypothetical protein
MSFKEKIDSDLKVAMKSGDKNRLNAIRSIRAALLEKEVSIRVGGTAVLSE
DQELEVLVSLAKKRRDAIEQFIAGNRPDLAETEQLELQVLEEYLPAPVSD
DEVQAVIQEIITKSGAISMKDMGKVMGEAMKALKGKADGGKVQNMVKTLL
SA
>Cag_0508 Rab family protein
MKDLDLIKLIEDKVNIKLALFENLSSINTGYKLNAQGDVSELSLSKCKIS
SLYLFIDILSQFKHLQILYLNDNNLSDVALLSKLTQLKALVLLNNPINSI
PFILTNLPLEFEWKNTDYGHIGFITLYNNPLTDPPPEIVAQGKEAVRSYL
LTRQKAEAEGQQMQVLHEVKVHLVGDGMAGKTSLLKQLQGLAFDKNESQT
HGINVVSLQAPQLKGCKINNELKESIFHFWDFGGQEIMHASHQFFMSSRS
IYILLLDSRTDSNKYYWLRHIEKYGAKSPIIVVMNKIDENPSYNIQQQSI
NQQFPAINNRFHRVSCNSGEGLDGVVLSLIAVLNDEGCLYGAEFPPAWLA
VKKALVTATAQERHISRNRYEELCSEQGINDAHERDTLLGYLDNLGIVFY
FKESHFTNNIYVLDPHWVTVGVYRIVNSAKTANGIFKLADLEYILNQEEI
SSSSYTPAQKKHFIYKGDEQRYLVDIMKQFELCYEEENGRFILPSNLSKE
PQTALPNLEDETPLRFIMQYDYLPAPIIPRLMIAMKDDVVEELRWRYGMV
LQSKNLEGVQASVIANQEKKEITIIVKGPDRYRRSYFTIIWNHLRTINQR
FENLQVKEFIPLPGYPNEFVEYEELLGHARNGRDNYFSGKLGQAFSVSTI
LNSIISTEERYRKSENMEVQITFSPIIKVEPPVHTQNVTVQLELHQHIEE
LQGRFRILKEDLLDEVEIELEDPKEQKRLFNELNKVENAITEMAEAANSE
QPKRLSSAVRERLDGFLESLHNPDSRLNKVLQAVEKGAERVKTLSDIYQK
CKPFFEQFPIS
>Cag_0433 Pyruvoyl-dependent arginine decarboxylase
MSFVPTRVFFTKGTGRHKEYLSSFELALRDAKIEKCNLVTVSSIFPPHCK
RISVEEGLRSLAPGQITFAVMARNSTNEFNRLIAASVGVAIPADETQYGY
LSEHHPNGESAEQAGEYAEDLAATMLATTLGIEFDPNKDWDEREGIYKMS
HKIINSYSITESAEGENGMWTTVISCAVLLP
>Cag_0721 conserved hypothetical protein
MSNEFDKQLFPNLVQIIEQGKKQLAVQVNSTIVLTYWQVGKTINEHILNN
ERAGYAKEIVATVATQLVEQFGKSFETKNLYRMMQFAELFHDFEIVVPLA
RQLSWSHFLALLPLKSNDARIFYAQKAIEANWGKRELRHQIDRKAYERQE
IVNTQLQNTSEFTDATGVFKDPYFLDFLGLKDGYLEKDLESAIIKELENF
ILELGKGFTFVERQKRMIIDGEDFYLDLLFYHRKLQRLVAIELKYGKFKA
SYKGQMELYLKWLDKYERHDNENSPIGLILCAGKSNEQVELLEMHKDGIM
VADYWTELPSKAQLENKLHQLLIEARNRIEQRKALEE
>Cag_0660 conserved hypothetical protein
MIILDTHIWLWWVNGDTEKLDQRRLEQITSSDIVAVSAISSFEVAWLVHH
GRIVLPIGARDWFDKALDGSGIHLIPITPEIACKAVELPDHHSDPQDRII
IATALVNNASLMSSDRKFSCYTELGKMLL
>Cag_1618 conserved hypothetical protein
MRVFKNKWFNHWASREGISDDVLFGAAKEIIIGNVEANLGGYLFKKRLPR
QGKGKSGGYRVIVGFKKQNNDRIIYLYGFSKSQAATISKKEEAALKMVSS
EFVAYSDEQISRLIQQQYIMEVLSNE
>Cag_1677 Ric1 protein
MIDIRRWILVFIMPPAAVLNKEAGTIMLAGLLTVAGWIPGVVFALFLMVQ
EMLQAKKQVTA
>Cag_0061 Protein of unknown function DUF152
MQLAGYVTPQIFSAFPELVAIQSTRVGGVSSGAFASLNVGHNTTDNPTCV
LENRERLCAALGIEYQNLVTADQVHGTNIYVAYEAGHYSGYDAFITNQPN
RYLCIFTADCYPVLLYDPHHKAVAAIHAGWKGSAGKIVLKTLHAMQTHFG
TVPGNCLAYIGAGISGSAYEVGMDVAHHFANDVLCSGCAIEGTNEHKALL
DLRKENYRQLLEAGIPSMHIEQSPYCTFRESDLFFSYRRDNGVTGRMVAL
IGLRH
>Cag_0281 conserved hypothetical protein
MSQFTNVAITKEANIYFDGNVTSRSVHFADGTKKTLGIMLPGDYEFNTGA
KELMEILSGDLEIQLVGEEWRKISAGESFEVPANSSFKLKIYKITDYCCS
FLG
>Cag_1312 conserved hypothetical protein
MNKKRKAMYYQERITINPNICHGKPCIRGLRYPVEHILELLAGGMNIEEI
LADYEDLEREDILAALAYAARLAHIKSTKEIAA
>Cag_1592 conserved hypothetical protein
MPCVTQRIMAPSECLLAILTRNPELGQVKTRLAKAIGKEAALHIYELLRH
RTAEVAQALASERMVFYSNYLPTSDCFSPTHFHYSLQAGADLGERMHHAL
ASGLTAGFRSVVLIGTDCYDITPEILQAAFVALERYEVVIGPATDGGFYL
IGMKQPMPHLFFQRKWSTSSVLKESCIRLQQAGTKYALLKELSDIDTLED
LQQSSLWLTPELDALRSLFEEKAAQPTRQQP
>Cag_0280 conserved hypothetical protein
MDYLIDTHALIWFINGDTQLPDKAQKIIKNIDNKCYISIASIWEIAIKIS
LDKLDLNGGFDEISKIIVRYDFELLPISIEHIAEIIGLEFHHRDPFDRMI
IAQGLVENISIITKDKIFTNYNVKIIWD
>Cag_0720 conserved hypothetical protein
MKKVIAFHIAESIDIKRFWKGFTDTENHMSSLDIFYTNDKDQYLYLLAYG
VVVFVGYDELKMSDMIDYLKPLCKNLLTEKMREEFIINTTTNKDAFEYNE
IHISNSNPNVIRIIMLNVAQSVALDYFSKLAEDLMIETTIYTQQLEKYGK
INISIKRLQMFIGKVLNIKNRIAENFYILDSPEETWEDEYLSKIDFGLRK
TFDVKIRFREIDYQLQIIRDNLDLFKDLIQHWKSNMLEWVIILLILVEVV
NLFVEKFSH
>Cag_1307 conserved hypothetical protein
MTTPIQILEYVDQHGNCAFRDWFNYLDSVSAARVAMYLERVAQGNFSNVA
PIGAGLSEIKINVGPGYRVYFAKKGSTILILLGGSNKKDQSQAIEKAKQL
WEEYKALVKQEKNKL
>Cag_0170 conserved hypothetical protein
MPLPTIVSRHLLPQVLKLLYRSLRISVTMPKHGLPQDGGIVAFWHGNMMV
GWLLAKKLFPHKNVAAVVSQSGDGTILADALGTLGFTLIRGSSSTDGDMV
KQRMYEHVQQGQMVAITPDGPRGPNHQFKYGTIRLASRQHIPLLFATIGY
KRSWQLASWDSFAIPKPFSKVTITLHILAIPLFTSEEELRHFSTTLSARF
SHE
>Cag_0325 conserved hypothetical protein
MTVMNFSTANQTYRQLMGNGLIYSVPRFQRDYSWTDQEWEDLWADILELL
TSDGEQAHYMGYLVLQSKDSKNFDIVDGQQRLTTLSILALSVLAHLARLV
EKNFDADNNRLRQEQLRNSFIGYLDPVSLVPRSKLNLNRNNNAFFQNYLV
PLQKLPQRGLKATEHLMRKAFEWFNERIKQEYGNRKDGAAIAGILDSLAD
KLFFTVITVTDELNAYKVFETLNARGVRLSSTDLLKNYLFSVVHHENNNE
YELNSLDERWEILVSKLGSESFPDFLRTHWNSHHRFVRHADLFKTIRANV
LNRKHVFELIRSMETDADFYVALSSHEDQLWNPTEQKFIEELRMFNVRQL
YPLLMAAWRSCERQDFTDILRACAIISFRYNVIGNLPTHEQERVYTNVAE
QISKEKLTSFYDILIAMRSIYPNDEAFSNAFSDKQLKTTQSRNKRIVRYI
LFKLETLLSGTEYDFDSDRYNIEHILPEHPENNWEQFSDRDYEQSVYRLG
NMTIMNTAANRDIGNTSYEEKRPRYAESEMMLTKKIAEENQSWTIERIGT
RQRSMAKQAKTIWRISQLD
>Cag_0438 conserved hypothetical protein
MSIGIKSRFFNNARKMAVESIRNPEKMRRLIASALELTTKAGRNAKLQAL
SNKVQTLIRFVQASISREYNVMPWRSLILSVAALIYFVNAFDAIFDFIPL
LGFVDDAAVLTAVLTSINNDLAKFIEWENSVKPQRTNVVDAEFEEVKEGS
LQ
>Cag_0121 conserved hypothetical protein
MVLMEQTTSQAQDVIVRLKKVSGQVDGLIKMLEREDECMRIITQFQAVKA
ALDSTFSLILHRNLRECVSRDNTESMERILKLISKQ
>Cag_1728 pentapeptide repeat family protein
MRLTFPPAVSLQFFNMNNQSMISSFFQRMAFTALVASALPSSLFAYDRAH
VTLLQQGVAVWNNQRQATMGQTLDLSRAPLAKAQLGEANLAHVSLSSAFL
QAANLRGANLQAANLRWSVLDGADLRDAVLVGAHLFEASLVKADARGANF
KSATSLEQADLSGALVSNNTIVPSGERAHGQWALRHHATFVQEPERPIAS
IASAISFSPERTITSPPNSAPTTVSQSAVTAHPSNVVPSPQASAQAPITK
EYARATLNGVNWSNADLAGANFYKADMKGAQLQGANLQGAHCDRAFLLQA
NLQGANLTKALLFGATLDKADLRNANLTEASLFGANCEGADLRGAILTRA
NVTDAVLTNALISSTTVLPSGKAATRQWALMQQAIFSQD
>Cag_1160 Pyruvoyl-dependent arginine decarboxylase
MSFVPTKVFFTKGVGRHKEYLSSFELALRDAKIEKCNLVTVSSIFPPKCK
RISVEEGIKELSPGQITFAVMARNSTNEFNRLIAASVGVAIPADDTQYGY
LSEHHPFGESAEQSGEYAEDLAATMLATTLGIEFDPNKDWDEREGIYKMS
GKIINSYNITQSAEGENGMWTTVISCAVLLP
>Cag_2020 hypothetical protein
MNRVVFIVDGFNLYHSLKAAQAVQRPASTKWLDLCSLFSSQLYHFGKDAV
LHKVFYISALAVHLEASNPNLTKRHQAYIKCLQANGVITLLNRFKRKDVF
CKLCKRSFYKYEEKETDVMLATTLFEQLATDNCDTVVLVTGDTDLAPAIR
SGQKLFPHKLILFAFPYGTKTTELKTLAPASFTLSSAAYAKHQFPNPFIL
SDGTTIPKPIKW
>Cag_0762 hypothetical protein
MTTDQLPTTQQPDDYDSPWKEAIEHYFPEFMAFYFPNAYTAIDWSTPYHF
LDQELRTIVPQSAQGKRVVDKLVKVQLLDGKERWLYIHIEVQGRREANFP
RRVFICNYRIFDQYGVPVASFVILTDTHYNWRPTSYSYEFAGCKHTLEFP
IVKLLDYEPRMEELLASDNAFGLITAAHLLTQKTKNQSKPRYEAKKLLMQ
LLLQRQWDQERIEELLRVIDWFLRLPKALRKKLKTEIHNMEEAQKMKYIT
SFERDAMEEGIEKGKELGVLEGIEMGKAEGLEEGLMKGRLEVAQRLVAGG
MSKAEAASFAGVSVDLL
>Cag_0592 DedA family
MTDAFLFLTDFILHIDSHLQTLAAEYGLWLYLVLFLIVFGETGLVVLPFL
PGDSLLFAAGSLASMPNSALDPNVLFVVFFAAAVLGDTLNYHIGNKFGNK
LVHGGYTRFFKAEHLEKTNAFFTKHGGKTIIIARFVPIIRTFAPFVAGIG
NMPYRTFLLFNVIGAFVWVGFFCYSGYYFGQLMFVQENFKLLIIAIIAIS
LLPPIIEFLKHKFSATKQR
>Cag_1091 conserved hypothetical protein
MTTTDRYINPFTDFGFKKLFGSEMNKDLLIAFLNTLLPIEAGTIADLTFL
PNDRVGRSEFDRRAIFDLHCKNEKGEYFIVEMQQAKQDYFKDRSVFYASF
PIQEQAQKGKWNYCLQPIYMVGILDFIFDENKADDTIVHHEIKLVNLSTG
KVFYEKLTFIYLELPKFTKSVDELESDFDKWCYLLSNLPDLTDRPARLQE
KVFLKVFELAEIAKYTPEEAREYEKSLKVYRDLKNVIDCAYDEGKAEGIE
EGIEKGKEIGVLEGMVKGKELGLQEGLQKGMEAGLLKGKLEIARKLMVKG
MSADEAAGIAGVDVERLSSNDE
>Cag_0037 conserved hypothetical protein
MQGMVEQYTRNANTENRAYKPHLGWHKKTALLVTLFLTLTALFSLPQTTL
AAVNTVPPNFVLIRGGEFTMGSPESESERDRDEMPHRVKVGDFYIARYEV
TTAEFRTFVQETGYRTDAEKTNPSLVFWSGLWPGKAGLNWRYGTNGKERS
GAENNHPVILVSWNDAVAYCKWLSKKHGMNFRLPTSAEWEYACRAGTSTV
FNYGDNLSTTQANYDGNYPYSNHPKGIYRKNTVPVNSFTPNAWGLYNMHG
NVAEWCSDWYSEPYYESSKANGTVTNPTGPATGSNRVMRGGSWYDDARYC
RSADINDSTPSYRYINVGFRVVLEP
>Cag_0980 conserved hypothetical protein
MNTFIYQQENWPHFTWQNEEIVNLLSEARHLQGKLIGKMESLGFDLRNEA
LLDTLTLDVVKSSEIEGEYLNSDQVRSSIARKLGIEIACSVESDRNIDGV
VEMMFDATQNCYNQLTTERLFDWHAALFPTGRNGMYKINVADWRKDTTGP
MQVVSGAMGKEKVHFQAPDSSLVEKEMNLFMDWFNSQVTTDLVLKAAIVH
LWFVTIHPFEDGNGRIARALTDMLLAQSDKSHLRFYSMSAQIRIERKEYY
EILEKTQKGFLDITEWIKWFLSCLINSLKASESVLLNVLFKASFWDKQSK
TLINERQRKLLNKLLEGFDGKLTSSKWAKIAKCSKDSAIRDINDLIDKNI
LQKESAGGRSTNYALKQ
>Cag_0894 conserved hypothetical protein
MIKAILENRTLKNMADKHLMTPSTTTLSGLLGNGLTYKVPLFQRDYSWKI
DNWTDLWEDIKILLNTGKDHYMGAVVLQKVGEKQFLIIDGQQRFTTLSLL
ALATIKKIQDLIDAGIESENNTERINELRRGFLGQKDPASLHYSSKLFLN
ENNDPFYQRNLLQLISPQNQKTLIDSNRLLWQGFDFFYKRIGEHFHNATG
AELATFLSKSIGDKTMFIKIEVEDEFSAYTLFETLNYRGVELTVSDLLKN
YLFSLITPSDLRIAKEIWKKIATSVGMENFPIFLRHFWISRKPLVRQEQL
FKTIRTEVASNQQVFDLLNNLDAYSEMYLALQDPYNINWQGNRERIKRIR
EMKLFGVKQQLPLLMVAKEKFSDQEFDKTLKVISIISFRYNVIGSRQANR
MEEVYNIVSQKIFNTTITSAQGVFNNLKELYISDADFRNDFSTLELYSYG
QSKKLARYILFELENNMMQGGDRDYELDPATIEHILPENAGSHWDIDFPQ
IIQPSYIYRLGNYTLLEDHLNRDCETLLFDKKKPFYVRSQYEMAKAINYN
EWNPLTLDARQSDFAKKATSIWRISYAD
>Cag_0201 conserved hypothetical protein
MELFYTPTEQIHPATNQVSIEGEEFHHLVRVLRKREGELILVTDGNGLRC
EVRIASISKHDVQGEIISTTTIEPPSTRVTVALSLLKNPQRFEWFLEKAT
ELGISTIIPMVTARTVVQPSNDRVHNKLKRWRTIVLSAARQSKRYYLPQV
VEPQRFSDVLNRSGYDERCMPFEAATSSPTLHCAGKNILFLIGGEGGFSA
EEVQQAEATGCRTMSLGTTILRAETAALFAVAYVRSQLLTEAPSAWL
>Cag_1224 Uncharacterized low-complexity proteins-like
MTTPDPQVKPSTLPNDPLTLLEVANHSSERLAVQHTAFIAACVYVLIIVF
GTTDLDLLIGKGVRLPFVDVEVPIVGFFAFVPFILVLVHFNLLLQLQLLS
RKLFAFDATVPQDDGIGGLRDRLHIFAFTYYLAGNPSRLVKPFLAIMVSI
TLVLLPLFVLFAMQLQFLAYQDEVITWMQRFALWLDIALINIFLPTMLHP
KDDWKSYWRNVIACYVPHRRVWLSFLLLYVGTNICLFASKKEILLIGIAL
LVLSLLLLPILRGWKATHKVQKIIIPILIIVTFAIIALLFLVEVRDWIEI
TITSFISTETIREKVFPLSFILYALIIVLTVLWQQSAPRGSFALVVTLFL
GTLFPLAFMVDGEHLEKIIAKGENATFLSNVLQDKRRLNLSEQHLFAKAL
KPEIITLISDGKWKEALPQIEPINLQGRHLRHAELNQAMLLGADLRFADL
QGAYLSDADLQGAYLSDADLQGAHLRQAELQGAHLRQANLQGAYLRQADL
QDANLSYTNLQGADFIGADLQGADLRFAHLQGANLFGAHLQGAYLFVAHL
QGAYLSGAHLQGADLSAAHLQGADLFGANLYAADIRRANTTLVDAQNIRL
EPLSEKEATELRTTLKPLIKDNEDYNEVAERIKKATAPHGEIPYFESILA
EKNTPLRYKKCYNAENSAERRAFTKQLHPYLVSLASQSPEIARGIIQQIP
ISEPNTSSRKGLAAELAKHLNDPKCKGLYELRDDEKEELRNWKEE
>Cag_1790 conserved hypothetical protein
MAILTRSMVLQRQASPYQFRFALIRCIAVLMLCAPISLYAAEEVQQAESA
AWKRDVALRLEKYCITVFRSPGSGKTRENNLRVARFYATRSYQPLWSSTT
MTQELATSLNAAFEHGLTPAEYDVAGELPRWMALTNRSAAAQARYDVLAT
RAFLTLATHLRYGKLDPVRFEPTWNFSSPPNLFHFDELLARTLQRTSPSE
VLNGLLPRDPGYDVLKKELARYREIAKNGGWSAIPAGTLLQEGSRDARVP
LLRQRLAASGDISSSAVADTTTLYNPDVTKAVKRFQQQHGLWSDGVVGAT
TLRAINVSADERIGQLRVNLERCRWLLHDISPTSVIVNIPAYTLHYFEQG
DRRWSTRVIVGQPKRPTPVFRADMQLLILNPRWVVPSTVLAKDVLPAVIK
DPAYLRKKKLRVVDENGTIIDPATIKWSSYSASTLPYRLQQKSGDDGALG
RIKFLMPNRYTIYLHDTPDKALFQKTQRAFSSGCIRVQHPEELARLVLRH
SNRESRPSLESRIKSGATSTIRLPQQIPVYLIYLTALPCNNKAEFREDIY
HRDPQILKALDAK
>Cag_0818 hypothetical protein
MSKRVVTDTMAIVLRLEQRKLPQQVRNIFLKAEQEECTIIIPTMVFAEIG
YLSERGRIDVTLDDVRTYCTQHPNIVESALTQAIVAHSFTINDIPELHDR
LIAGTASYQQLPLLTNDPIITQSQHLTVIW
>Cag_0262 hypothetical protein
MMPTNEPLSSIQARVEELQSTIAEKEEHIKASARHFKDELKEEVSPLRLV
RRYPLQSVGVAALAGVALGRRLRRGNRASSVVAAAPVVQSNVLLSTTQTL
LSVVGMELMRSLKEVGLSYLKQYIEKKIR
>Cag_0148 conserved hypothetical protein
MLLLHKLLPLFVLPLGVVLIAIIVGTLVRKQVLLWYAALVLWAFSIPTVA
DGLMHFVEGNRTVALPQTLHQADAIVVLGGMIRRVEGAAEGEWNDAADRF
EAGVTLYRAGKAPLLLFTRGRMPWSPDAVPEGELLVERALQRGVPEAALG
LTAPVANTEDEAADVARLLRERGAERIILVTSAYHMRRSQLLFEHTGLSV
EPYPVDFRVDSYPEPAVLRFAPSAEALYRSETALRELIGWLYYRVKLFFV
K
>Cag_0697 conserved hypothetical protein
MKILLDTHYLIWSFTNPEKLPQGTAELLMAEENEIFFSQASLWEISIKFN
LGKLVLMGITPEKLYNEIKLSYFQCLPLQNEELISFHRLPIKHRDPFDRI
MIWQCICHNISFLTVDNVIPNYKQYGLKIVTSS
>Cag_1988 conserved hypothetical protein
MMQWHCKAFLILRHSFNEQCNLFMPEIKPFSGVLYHPELLKQADKLICPP
YDIISSAQQQSLYHRSPLNAIRLELPLEENPYGTAAARLTQWLQSGELQR
DSEPAIYPYFQTFEDLEGKSHVRHGFFTAMRLHEFSENKVLRHEKTLSAP
KADRLNLFRATRTNISPIYGLYADEHRTLDQLMVAYSETHEPLLDANVQG
IRNRLWRITEPTLLEQFRQTLLNRQVYIADGHHRYDTGVTYRNERMAANP
THNGNEPYNFIFSCLTNIYDEGLIVFPLHRVLHSVADFNAERLKEQLAEF
FTITDLNSQDELKAYLAASTSSFSYGVVTSGALYGMTLKGEAAPLLDAQC
AHCPEAVAQLGVVVLHQVIFHKLLGISHEAMEAQRNLLYVTDVNEVFHAV
ACRTAQAGFVVKPTTVQQVLDVSESGEVMPQKSTFFYPKLMTGLLFNPLD
>Cag_1441 conserved hypothetical protein
MVAFIHQKTNWPYFTWNNDEIVNALSEARNLQGRVIGKMESLGFDLRNEA
LLDTLTLDVLKSSEIEGEYLNPEQVRSSIARRLGMEIAGSVESDRNVDGV
VEMMLDATQNCFKPLTVERLFDWHAALFPTGRSGMLKITVGDWRKDTTGP
MQVVSGALGKEKVHFQAPDSIVVEKEMNQFLEWINNNVKIDLVIKAAIAH
LWFVTIHPFEDGNGRITRALTDMLLAQSDNSNQRFYSMSAQIRIERKQYY
DILEKTQKGNLDITEWIQWFLNCLINALKSTDATLFNVLLKANFWSKHSK
TLINERQKKLLNKLLDGFDGKITSSKWAKIAKCSKDTAIRDINDLIEKNI
LQKEAGGGRSTNYELKI
>Cag_1188 phytoene desaturase
MSSEKKSVVILGGGLAGLTAAKRLTDLGFQVKLLEKRNIFGGKVSSWKDE
EGDWIESGTHCFFGAYDVLYDLLREIGTYHAVQWKDHQLTYTLAGGNAFT
FKTWDMPSPLHLLPAIASNGYFSAGEMAAFAKSLIPLAFLKADYPPTQDH
LTFAEWAQEKKFGTRLMDTMFRPMALALKFIPPEEISAKIILDVTETFYR
IPDSSCMGFLKGAPQEYLHQPLVDYLTERGAELQTNVTVDELLFDGSDIK
GVQLLNGEILTADYYLCALPIHNLNKVLPQNLKDYDFFFNRLENLEGVPV
ISVQIWYDTEITSANNVLFSPDGVIPVYANLARTTPEYQTLRGKPFTGKS
RFEFCVAPARELMGLSKYEIIRMVDQSIRNCYPKTSRGAQILKSTVVKIP
HSVYAPLPNMEQHRPTQQTPVSNLFLAGGFSRQLYYDSMGGAVMSANLAA
KALAKAAGCEE
>Cag_1495 conserved hypothetical protein
MLRLLLQWLINALAVYATAQILEGIHIRGFATAIAVALVFGLINTLVRPV
LLFFSFPVIVLTLGLFLLVINALLLQLAALLVGGFSIDGFWWAVAGSVVI
SAISWLLSSLFRVG
>Cag_0715 conserved hypothetical protein
MTDAEKPRLIVVAGPNGSGKTTITDKLLRHEWMSDCHYINPDLIAQQEFG
NWNSPEAILQAANKAKALREEYLRQRCSMAFETVFSTNEKVEFLWRAQKQ
GFFIRLFFVCTNDPTINAQRVAQRVIDGGHDVPITKIIQRYYRSLSNSIA
AMKWCNRSYFYDNTAANQDPQLLFRVNDGCLKKVYAPIAPWAEIMYQTAI
NQGTISDTTSHNPNPSTITFYEY
>Cag_1042 conserved hypothetical protein
MKKISLYLTALLLTVATPLHAEEEQVCTTQADQKGSLTITITNFRAIKGS
LGISLYNTKKGFPSGYEQAYANQIKKVNGSSECVTFQNIPYGVYAVSVVH
DENENGKLDTTFIGIPKEGVGASNNPKMSFGPPSFNDSKVLLNKDVLEVM
VSMKYF
>Cag_1140 conserved hypothetical protein
MNEHSHHRPLTKIGILLMLIGRYLFAGFFIYGFWHKLTRGWLSSDITQRH
FIKRLGELPIDSWQAAYLEYFAIPLAMPIAWIVTVGELLIGVSLVVGLMT
RINAGFALFLLLNFAAGGYYNLSLPPFIIFAVVMMLLPTSHWLGMDKQLH
NRFPHSIWFR
>Cag_0080 conserved hypothetical protein
MEAVWLKQSGADELLLFFNGWGMDRRSAEYLYHIVIRDGWRGDCLSFFNY
KDFAIEPSLIDAISNYKRCNLLAWSFGVWAARHVALPPIECAIALNGTIF
PLDAERGIAPELVAATCNGWSESNRQRFERRMCYSRQLHEQFADITSQRT
VADQQAELATLQPLMLVSQAAALIPSPKSSPLPQASSQLSRSASAAIAPL
STWHYQHAIIGGRDLIFPPQAQQTAWQGTPTTFIADMPHLPFFHLPALTE
LLAWHNV
>Cag_1184 conserved hypothetical protein
MLSRVAESLFWMSRYFERAENTARFLDVNFNLLLDLNKITHVDNPNYWIA
LILVTSDKERFNNLYDSYSAESVTDYLVFNKSNPNSIISCIGLARENARS
VIESISSEMWEQVNNLYHYLQSVTPQMVHNDPYIFYKEIKNASHLFQGIT
DNTFSHNEGWDFIQIAKYLERADNIARLIDVKYHMLLSDAHHQAEIVAGS
EDIIQWMAVLKSCSALEAFRKVYLAKIDPDNILRFLILDRSFPRSINFSV
CAAEDALWRISGSSRHRYANNADRLIGKLEAELIYTTVEDIHEKGLHDYL
EDMEQRLIKVGEQVHLTYFAYHTPEIEPDSSEEALPFTGVAGGRPNWSQA
QQQQQQQ
>Cag_1812 conserved hypothetical protein
MPSCVYSADEGLIDTDYGGGVIKQRIPRQGEGKSGGLRSIILYKKADKAF
FVYGFAKNAQQNIKDSEVKGFKKLAKNIFELTDTQLEKLIISKEFTRVQC
HEE
>Cag_1717 conserved hypothetical protein
MAITFLQMAEKVLEEEKQPLTASEIWQIATEKGYDKLVESKGKTPWATLG
ARIYVEVRDNPSTDFIPLATRPKRFSLKTQMSILGGKIPETTKAPQLHTP
KIEFLEKDLHALMVYYGFYYLKAYLKTISHNKSDKKGFGEWVHPDIVGCY
FPYKDWKAEVVEVSSLMGNTAVKLYSFELKRELSIANIRESFFQAVSNSS
WANEGYLVAAHIDNDEDFRSELKRLSTAFGIGIIQLDIDDPDSSGIILPA
NSKDVIDWDTVNKLAGINPDFNDFLKRVRNDMKNQEIRKELYDVVVEKEE
LKKLFTKKKSSS
>Cag_0935 Ribonuclease BN
MLMKKVPHIIHRANQSGGKRYERMVAFVAFFRTNLLHDRIFISAGSLAFQ
TLLSIVPVLAVVLSVLNLFEVFTPFQHSLETFLVENFMPATGRLLHGYLL
EFVGKTGNIPLLGSLLLFVIALSLLSTVDQTLNDIWGIRAPRKALQGFTL
YWTVLTLGPLLIVSSLAASSYVWYTIFTDEGALFELKTRLLALFPFINSI
VAFFLLYMLVPKRRVRIAHAFAGALVASLLLELSKRWFLFYVTHVATFEH
IYGALSVVPMLFFWVYLAWVVVLVGAEFVYCLGAFAPSTPTAEAQGSLRY
LSLPLAVLATLHNAIEKGAPLSLKSLSRELSTLPFNLLRDMVDLLLDRQV
LHLSTRGELALSRNLHAMSLYELYQIIPQPINATEKSLLFSESKSVHLAP
LSLEVEACLQERMATPIAELLQHSFLKASTVD
>Cag_0962 conserved hypothetical protein
MKKAGLIILVLCGIVLLFDMLLMPLYTTQGRSERVPNVVGMEFEDAERKL
EMAGFEAVRSYNAGYEVDVPANVVLSQTPEATMEVKPGRAVYVVVNRGAK
PAVQMPNFLGLSEGEARQEAARLELFPVDVVGTPVANSSDDGRVLNQSLP
AMTLVQSGMPLTLFVGRYDAEAVNAERIELPNLLGMSLGQAQQTLAEAGL
IIGHVVTERSRLLLPNTVISQRPAVGTLLAPGQAVDLTIVGE
>Cag_2011 conserved hypothetical protein
MSAHIYKKVEMVGSSPNSIEDAINNAVAKAAETMHSIRWVEVAETRCHVE
NQKVAYFQVTVKIGATLEENETL
>Cag_0822 conserved hypothetical protein
MVSTALRHHSQFRSTLLLFAALLMLVSSCTPQQSEKDERQQHVESMMEIL
AQVQKNLARIQQKEAVVERLSTGVERSQGENVEQMGRDIAASIRYIDSTL
AASRNLVQKLEEQNRTSTYRVESLDRLVAELQVAINVRDREVKALKGQVK
KLGGQVASLVATVDVLDEYIDTQESEYYTAYYIAGSAEDLMKKGVLVPIN
PLQKIFGTNYRLASDFNINLFRKIDMMESRDFFFEKPLQSLTIITPHTRG
SYELAGGKTSSLLVIRNEREFWQKSRCLVIVTE
>Cag_0085 conserved hypothetical protein, phage-related
MSYELEYFHPRIQKEIADWPKTIRMDYARLVELLLEFGADLKMPHSKAMG
DGLFELRAKGKEGIGRAFFCFMKGKRIVILHTFIKKTQTTPQRELDKARQ
RMKEVINAKE
>Cag_1324 conserved hypothetical protein
MTISTVFINNHSQAVRLPTCVRLPDTIKKVSVRVNGNERIIAPVGQMWNS
FFLGSSKVTDDFMEERNEQAQPEREEL
>Cag_0194 conserved hypothetical protein
MIVTGLDVLLRNLDMLRHRSVGLLVNQTSLTASMEYSWQLLQKQGITIRR
IFSPEHGLFATEQDQIAVSYQPELGCDMVSLYGDSAATLVPDMALLDDLD
VVIFDIQDVGARYYTYVNTLALFMEAIAGRDIELMVLDRPNPLGGEIVEG
PMLDMAFGSFVGVFPVPVRHALTAGELAVLYRDVMQLDVNLRIIKMEGWK
RTMLYGETGLPWIPPSPNMPTVATAEVYPGMCLFEGLNVSEGRGTTTPFQ
LSGAPFIHPIELAERCHSYGLEGVRFRPVWFKPTFHKFAGEVIGGIWQQV
TDARRYRSFATAVAMTAALRELYGEQVTFLRGVYEFNDTIPAFDLLAGNA
TIRTAIESGNTIHTLLTLWQKDEAQFAETKTRYHLY
>Cag_1696 conserved hypothetical protein
MEKIAEELLKLIPFSKLFELFGLSKEISLALSTIFSIGIVALLWYGVKMI
YLRFQIAINAKDIKPYFNNANDIEKKLSLFIETWGQDKLPAREEEPIYTH
ESALNKSRLISHFIKKVFSADKSGEKFFLVLGDSGMGKTTFMVNLYVRCQ
SFINFRRKNKVKVKFFPFGYKGEILDKIKEIPQDEKINTILLLDAFDEYY
KLLPPDIPDGLSDDKRFRKVLDEVIDVVQDFRKVVITGRTQYFPGEDDKS
YILEIPTFDDNGFHKLNKFYISPFTEDDIRHYLYKKYGYIRFWNFKKREK
ALKLIFENLKETKFLLVRPMLLSYIDLFVNSNQIYKNIWDIYEALINEWI
EREGNKRKHDSIACQQLKENLHNYSQKIAVTIYENRKGMQIVSLTKEEAT
ENINDALKHYEVTGQSLLTRDAENKWKFAHRSILEFFIAKEAVKNQEFAN
KLDVTNLDMAKKFCIEKGLGYLFDYVPIKGGEFTMGSPDGEVDRSSTETQ
HQVKLHNFYIAKYVVTVAEFRKFIEECGYKTDAEKANSSRIWTGKEWKYK
AGVNWRCGVGGQLRLQNEENHPVIHVSWNDAKAYCDWLSKKTGKKYRLPT
EAEWEYACRAGTTTPFNTGDNLTTAQANYDGNYPYNGNAKGKYRQTTVPV
DSFAPNAWGLYNMHGNVWEWCSDWYNDKYYEVCKAKGVVENPECTEEQSY
RVLRGGSWGNDARSCRSAIRNLLRPRPPLQQRWLPPGFRPVASGVAHSTC
F
>Cag_1489 conserved hypothetical protein
MFASSSSRKKSLPVVLVSKKTKSRIMQVVRRMVVVLAVVVGIAAMSFVSI
GLLLSLPASKPQKADVIVVLGGDKGLRVQKGAELYNDGFSKKVLLTGIDR
RYYRPSRPNWRERRIRDLGVKKSAIVVDTKSQTSWEEAMNTAATMERRKW
ESAIVVSDPPHMLRLWLTWHHAFAGTSKRFTLVATKPDWWHPIFWWKHKT
NYKFVVSELKKNIAYVVFHYIKWSDDPNQDVLTER
>Cag_1791 conserved hypothetical protein
MGTLLLIGFATIAPTTQSIAADNHIAVSASASIPVKPDMAEFTVIISADA
KQADKAATEVAEKYAAVQASLRKAGIAADDAPTTAYTVAPRWEWNGTLQK
NVLKGYSARHTLKVKVRSLSAIGQAVDAAVQAGANEVQEVHFVVSRYEAF
RQQALEQAVAKARADAAIMAQAADCKLGTLLEASVTQQSNMPRPMYDAMT
LRVAAAPKAETTMVAAEQEVEVTVHSRWQIRPLVGSK
>Cag_1371 conserved hypothetical protein
MNSAPFLALFLSGLAGGFGHCISMCGPVVAAFSMGETRKGILHHLLYNLG
RITTYTILGAIVGLSGSFLVLATAIEKFQNVIVILAGLSIIIMGLATAEW
LPMPKQLNSCTSGVSLLQRLMSFFKPPRSLGSYYPMGIVLGFLPCGLTYT
ALLAAARAAMDTHHPFIGMMTGALMMLLFGIGTTPALLLVGKVINTISNK
TRKWFYRIASIIMIATGVWFVLSAL
>Cag_0794 conserved hypothetical protein
MDAFWLSLVMIFLAELGDKTQLVALTLATCYNTSVVLWGIFWATLAVHVF
SAAIGWFIGDQLPTEWILFVAGVAFIAFGFWTLRGDSLDEEEESCKRGIN
PFWLVFTTFFMAELGDKTMLSTITIASTHPFLPVWLGSTVGMVLSDGLAI
VLGKMVGKQLPETLIKRGAAAIFFLFGAYSMYDGGATFSPLIWAIAGMVV
LLFGYFFLRKPKA
>Cag_1457 conserved hypothetical protein
MLESMTGYGSAESVMDGVRALVELRSVNNRFAEISVKLPRQLLAYELEVR
EMIRAHFQRGKISAFIQLQLDEEQPIPVAINPAKVKAYTALLRSLQQEAE
IDAPLQLDHLLRFSEIFDSTASALDKGEMLWPSVKTLVLEAIERLQAMRR
REGEELSGDFRQRIATIEELLATITTLAAGNLDAVRAKLTARVEAVAGKD
VAYSRDRLEMELVIAADKLDITEELTRFASHNKFFIQELESNESGAGRKL
NFLLQEQLREANTIASKSQNAAISQHVVHIKEELEKIREQLQNIE
>Cag_0977 conserved hypothetical protein
MQPTALTTYYTVIYPQLASFFCALPLPHGWRLEAEHPTSDVWSFYDTFSW
HAFLQDVAIVKKGKILALYHMKSGVSNATISLPTTPTSFFASELPPCALQ
EELATCTTLRAYLQLCSVEVNNIAYRLLDNAHKTLAIINYLQFQLPEQSE
QDTLSIITVTPLRGYQKESGMVEALIKADEAIGTYSFNELYRSIMQAYNL
NVGAYSTKLRVTLDADAPAAISIAKILQWTFSIMQRNEEGMCNNFDTEFL
HDYRVAIRRSRSLLKQTRSIFLNNEIEPILTFLTTLGKRTNALRDCDVLL
LHQKAFTNALPPLLQPALQQLFTLTINEKEKLQKELSHYIKSNTYIKERQ
EWQQKLLPPYSFFDSHTRETLTTKIVAITSLRKAWKKMLRYGRTLHNNST
DTELHTLRIHAKKVRYLLECFQSLFVSAQLAPLLQQLKALQESLGTCADK
AAHLQLLEKKLSHPKLTQESAAALGALLAILYQEHQTARLIAQEAFAVFD
SNEKSALFNTIVTNVKL
>Cag_0319 conserved hypothetical protein
MVELQEKNGSVCIAVRAQPRSSKSMVSGEWNGALKVHLQSPPVDDAANEE
CCRLLARLFQVPPSRVHLVAGHSSRNKRVMVEGVSAAMATELLQPFLHT
>Cag_1923 Twin-arginine translocation pathway signal
MGMTRRQFFSTIAGAAASAALVAALPERLLAAWSDKAFTASTLANAIAGK
YGNLPIEDSTAIQVKAPEIAENGAVVPITVATNIAGATNISIFTEANFAP
MVASFDLLPRSLPEVSLRMRMAKTATVVVLVQAGNKLYRATREVKVTIGG
CGG
>Cag_0403 conserved hypothetical protein
MGKSEQRYGKQLISVLSAQLTQKYGSGFSVTNLKYFRTFYVTYPDRFDTI
GYPMGSQLPQEEKSRPLGDQLPEAEKSYPIGSGFSPQLTWSHYRALMRVQ
NEKAREFYENEAIDGGWDKRTLERQIHTQYYERCLHSQQPEKIIAEGRKL
QKEVPLATDILKNPYVLEFLGYPNFAELRESDVERAIITHLQRFLLELGN
GFAFVARQKHIRIDEDDRFIDLVFYHCRLKFYLLIDLKLGKLTHADVGQM
DGYVRMFDGLFTALDDNPTIGLILCTEKCDTVARYSVLNDRKQIFASKYL
PSLPSEEQLQIEIEKERRLIEAALEEQKACKHE
>Cag_1912 TPR repeat
MKKHILPFLALPFVLLNACASKQELNVVQYDVTRLKSEASNLKNEAQAIK
SQTAVSYADMQQVRNDIARLNGSLEEVSHRITEQTNKNNNVFKRLGTEDS
LLVHQLSGLETKAAVLEKKLATLDSRLLALEGIVGTGTEALRKDSVATTS
IKPAVASTLAVEPTPATVNDASMFQEGVTLFGKKNYGAARQTFMALIKRF
PTSLLVGDAQFYIADSFFSEKRYEQAIVEYQEVIAKYPKNSKRPAALYRQ
ARSFELIGDVANAKTRYKDVVNVYPTSPEAALAKKKL
>Cag_0165 Protein of unknown function DUF28
MSTSSLITIFYIMSGHSKWATIKRKKAATDQKRGNLFTKLVKEITIAAKM
GGGDPTGNPRLRLAIDTARSNSMPMDNIQRAIKKGTGELDGVSWDEITYE
GYGPAGIALIIETATDNRNRTVADLRHIMSRNNGSLGESGSVAWMFQRKG
SLDVPHSAVSEEQLMELLLDAGLEDLDQDDENYFTVITDIKDLESAKKAL
DEAGIAYENAKIDLIPDNYIELDADDAAKVMKLIDALESNDDVQAVYSNM
ELSESAMNSLSE
>Cag_1774 LemA family protein
MRYMKSTHYALVKIFMALLLISSLSGCGYNSIQQNEEAVNRAWGDLESQL
QRRADLVPNLVATVKGAANFEKETLTAVIEARSKATSIQLSPEMLSDPAA
MAKFRAAQGALTSSLSRLMVAVERYPDLKANQNFLDLQNQLEGTENRISV
ARQRYNGAVEIFNVSIRQFPNSLTNSVLLQLKAKEYFKADEAAKAVPDVK
F
>Cag_0701 conserved hypothetical protein
MEKRDLKKELRHLYNPSPKAVEMVDVPTMNFLMVDGMGDPNTSQTYADAI
EALFSLSYTIKFMVKKGEMALDYGVMPLESLWWCEDMSTFSPDDKSNWQW
RAMIMQPSWITEDLVNQAIEEVKRKKGLAALPFVRFEAFEEGVAAQIMHI
GHFAEEAPTIEKLHHFIAANQKQRRGKHHEIYLSDIRKAKPEAWKTIIRQ
PLA
>Cag_1961 conserved hypothetical protein
MADTKKLAIIASQGTLDWAYPPFILASTAAAMDMEAVIFFTFYGLPLLKK
EIDLKVSPHTNPAMPMKMPFGSKEFQGVNWSIPNLISGNVPGFDNMATML
MKETFKKKGVATIEQLRSMCQEFGVRFIACQMTMEVFGFEKDQFIDGVEY
GGAATFLEYAADANISLFI
>Cag_1020 YjeF-related protein-like
MQPVLTAQEMQAADRAAIETLHISEARLMELAGRECLRLILDMLERKKLD
GCGFLILCGKGNNGGDGFVLARHLLNYGAAVDVVLLYPPSILQGVNREGF
ATLQAYEAEQAPLRIFEGIEEALPFVEENHYTMLIDAMTGTGLRLARRGM
ELAPPLSDGIELLNRMRHESNATTLAIDIPSGLEATTGFAAQPVVEADVT
VTMAFLKRGFLLNDGPECAGDVKVAEISIPTFLTESASCRLIDQEFAAEH
FLLREPSSAKQHNGKVLMIVGSQSAQHSMLGAAILAAKAAIKSGIGYLCC
SLPQELVGAMHLAVPEAVLIGRDVDVLTEKIAWADSVLIGCGLGRNAEAL
ELVEMLLQSETLQSKKLILDADALFALSTLDAITALQKCNHVLLTPHYGE
LSRLCNIPIADIAANPIEIAHECACNFSATMLLKGNPTVIANGKYPILLN
NSGTEALATAGSGDVLAGLIASLAAKGATLPHAAAAATWFHGRAGDLAHD
VASLVTATMVADAIAQAIGEVFEVE
>Cag_2031 Protein of unknown function DUF37
MSIWKIINAIPIVLIRLYRTFLSPLLGPSCKYVPTCSSYALEAFERHNFF
YALWLTIWRILRCNPFSKGGYDPVPPLQGTQQSSSHHQESSHHG
>Cag_0016 conserved hypothetical protein
MYDTTLLLEKLEQIDLAIEKIKRRFTTIKRPDDFLDSEQGIDMLDAIAMM
LIAIGENFKIIDKATNGSLFVPYPHINWAGVKGLRDILSHQYFNIDAEEI
FEICQKHLDDLHEVVKHMMEALPS
>Cag_1985 Cold shock protein
MVNNRDLSLTRIGVFYDGNYFLHISNYYNYFHERKARISISGLHHFVRNY
IAQQEGSDEQLCQIVDAHYFRGRLNAYEAAQEGNALFYDRLFDDILSSEG
VTTHYLPVKTSQTGVRYEKGIDVWLALEAFEQAFYKRFDVLVLIASDGDY
VPLIRKLNTLGTRVMVLSWDFEYTSDNGKQMTTRTSQDLLEEVTYPVAMH
EIIDNRIKKNEPLINNLFVPPSKKRIFEKPLDREYDESEQQTGTILALKD
GYGFIKFPPNNLFFHYSNLNGVDFNDLKVDDAVQFVIGKNDRGDEVAKDI
VLVAEESAIS
>Cag_1874 conserved hypothetical protein
MLYKYLFLMKFISFLKSSALALFLTLPLALPAEAGWDPAAEDRARVSVEY
FKTQWTELDRYFSQAYGYAVFPDVYKGGLFFIGGAHGKGYVFEQTRLVGT
STITQLNAGPQLGGQSFSEIIFFKGQEDLERFKQGNFEFGAHMSAIVVNQ
SIATNTDYSNGVAVFVFPKAGVMVEASVGGQKFSYHPN
>Cag_0182 Fic family protein
MKSTYDPPLSITNTVIHLIADISAQIERYAIRMEQDDGLLWRKINRIKTI
QGSLAIEGNNLGTDQITALLEGKQVIGTMREIQEARNALKTYDAFSTFDP
YKQVDLLRAHGLLMEALVDNHGKYRRGNVGVFAGDSPIHVAPPAHIVPKL
MDDLFEWLTFNNNHLLIKSCVFHYEFEFIHPFMDGNGRIGRLWQSLLLAK
LHPIFEYLPVETMVFKNQQRYYQAINDSTNVADSAIFVEFMLGEILTTLK
NRQGEPLQCLPTNSMSGAVNGVVSGVVKTVYDFIEQNPGCRKPQIAKKTE
IPIKTLEKHITKLKAMNKIIFVGSPKSGGYYIKIGEQKTVWTTLAVAQNQ
TQ
>Cag_0836 conserved hypothetical protein
MKSFESEIIIEATPAEVWQMLTAFAAYGAWNPFLRRVQGSATNGAQLYVE
AKLPALPAIRFTATITTMQLLHRLGWHARFAGGLFRAHHFFRLEPQTSGG
CRLLHGEEFSGLLAAPILLLLGSQFRAGYTAMNEALKRQVETNSDR
>Cag_2017 hypothetical protein
MKLFMKAKIVAQHATKAALTAQAVKPEEATEVATVTDMGDSAGSFRVILF
NDEEHTFEEVIQQLMIALSCTRSKAERLTWVVHTRGRCMVFAGSLEEALQ
VSAVLEVIALRTEIQSVG
>Cag_1361 conserved hypothetical protein
MQERGEDKSDWKQAALLTSGEIEAAVASDEDEAGMVVDWSNVSVELPRPK
SVLNMRIDYEVLEFFRSQGKGYQKKINAVLRSYMEKQKHSAMS
>Cag_1792 hypothetical protein
MISAWINLSPLKNIFKKELNEGLPAHYGLRTLGQGIATYNKQKRVMAAFD
FGTTVPPEQFSHFRHHPVQLDIQRALREGWELFRNNPSEFLGFTLVCFAA
WLVGLLFDGGSSIIFSSIAAPLYSGYTMAVFRIAKGESLEFSDFFKGFNY
LLPLFLASLACTLFVSFGLMLFILPGIYLAVSYMFTTFFIIDYKMNFWQA
METSRTIITRSWFSFFAFAIVLFIINVAGAFLFGIGMIVSAPVTACAAAV
AYRNCMGVQISSIDEE
>Cag_1396 conserved hypothetical protein
MQLFFLLLFLLPSIAFGKQPLQCGIDRLDASQFRELAGLRVGLITNAAAL
TQRGEQNYKAMQRCGVNLRFLMAPEHGLAANVEAGKKVGHSVSADSLPVY
SLYGATRKPEQRHLHDIDLLLFDLQDVGVRCYTYISTMKLAMEACAEAGI
PFMVLDRPNPIAPLQPQGFMLQAGYESFVGLVSVPFVHGMSVGEIALLLQ
RAHYPNLDVRIITMTGYQPHCFGDELEGFTFRSPSPNIRDVATALLYPAT
VLLEATTVSEGRGTQAPFQQFGAPFIKSNELLQALERYHLAGVKLRAVRF
TPTASKWKGEECKGIGITVTNRRSFSPFAMSVALLRELQRLYPAQLGLDI
KRNATFFDRLAGTPRLRELIVQQASLQDILAESEREVARFRAVRLYR
>Cag_0843 conserved hypothetical protein
MSEPLEMGFVADDRLAGFRLQRLEIFNWGTFDGRVWTLKLGGKNGLLTGD
IGSGKSTVVDAVTTLLVPSQRIAYNKAAGADNKERTLRSYVLGYYKSERQ
ENLGGGAKPVALRDLNSYSVILGVFHNEGYDKTVTLAQLFWMKDAAQPAR
LYAAYEGDLSIATDFSNFGAEIATLRKRLRGAGVELFESFPPYGAWFRRR
FGIENEQALELFHQTVSLKSVGNLTDFVRLHMLEPFTVEPRIAALIHHFE
DLNRAHEAVLKAKRQIEMLAPLVADCDNHHAMAQRTEELRASRDCLRPWF
ASLKLELLEKRHTSLNEELSRHQIAIERLDGERRTLQGRDRELRRTIAEN
GGDRIESIAAEIRQHQQELDRRTQKSTRYKELLRQLGGQLGEHPATSAEE
FYQQRAEHAAMHESAAEAEVQVQNNLNEAGVLFTQGRQEYEQLSTEIKSL
KARMSNIDEKQIAMRHALCKALNLPEVEMPFAGELLQVREEETAWEGAIE
RVLRNFGLSLLVPDHHYPKVAEWVERTNLKGKLVYFRVRPQSRNEQLADH
PASLARKLAIKADSTYYDWIEREVSHRFDLICCTTQEEFRREKKAITPAG
QIKSPGERHEKDDRHRLDDRSRYVLGWSNAAKIAALEAKAKVQQRELTKL
AERISTLQQEQKALKERLTILSKLDEYSDFNDLDWQPLAVAIARLEKEKE
ELEKTSNILQTLTEQLALLEQELQKTEQQLDDRKDKRSKIEQKISSITEL
QQQTAALLAEAGDEVSNRFALLQAMRQEAFGDQSLTVESCDNREREMREW
LQNKIDSEDRKLSRLGEKIIRAMTEYKEEWKLETRDVDVNIAAGKEYRAM
FEQLQADDLPRFEGRFKELLNENTIREVANFQSQLARERETIKERIVRLN
ESLTQIDFNPGRYITLEAENSLDADIRDFQTELRACTEGTLTGSDDAQYS
EAKFLQVRRIIDRFQGREAYADLDRRWTAKVTDVRNWFVFAASERWREDD
IEHEHYADSGGKSGGQKEKLAYTVLAASLAYQFGLEWGAVRSRSFRFVVI
DEAFGRGSDESAQYGLQLFAQLNLQLLIVTPLQKIHIIEPFVASVGFVHN
QEGRCSVLRNLTIEEYRSEKERAIA
>Cag_1336 conserved hypothetical protein
MINHLFLPNQRDDYDSPWKEAIERYFPEFMALYFPTAYAAIDWSKPYHFL
DQELRTIVPQAENGKRIVDKLVQVELLDGKESWLYIHIEVQGNRETDFEK
RMFTCNYRIFDKYGKPVASFAILTDKDCNWRPTSYSYAFAGCKLTLEFEV
AKLLDFEPRMEELLASNNAFGLVTAAHLLTQKTRENMLQRLDAKSQLIRL
LYNKQWTKERVRELFRVIDWFMELPQELEHQLQTEIYNIEEEQKMKYISS
IERYAMEKGWSEGIEKGIAEGLEIGMEKGIEKGKLEVAERLLGVGMNIEQ
VAELTGVSVAQLRNKR
>Cag_1604 conserved hypothetical protein
MALRKNIGESYTPTYFLASLGNGGLAVTFFMFLMFMIPHKGRPMPVFEDI
VAALQSTLPIQFLTIVSLVGIIWFSAQHYRMLIWNIRQYLAFKHTPAFNR
FQTTDAQVQLMAIPLTYAMAINVMFILGAVFVPQLWSVVEYLFPMAMGAF
FIVGIYSISIFYTFFSRVIAHGGFDCEKNNSLSQMLSIFTFSMVAVGFAA
PGAMSHNVIVSGVGIIMATFFLALVTTLGVIKIVLGFRSMLAHGINYEAS
VSLWIVIPILTLVGITIYRIAMGLVHNFDAVIHPWAHVIMFTALCGIQIF
FGLLGYGVMKELGYFNEFIHGESKSAVSFAAICPGVAFVVLGNFFINRGL
VAAGLIEMFSVAYFVLYIPLLAIQAQTIIVLMRLTRKLLKA
>Cag_0874 conserved hypothetical protein
MNYQQLLALFKETHCELQHKAARSVDTALVIRNWLFGWYIVEFEQGGAER
TVLYGKSLINRLSQELKSLGLKGISPTSLKQCRTFYLAYEKIGQALPDQC
RGKLRISEVLQALPAKSENELPEIQQALPVISFEVLHNIPQIVHELSTTL
AGSFKLGWTHYVALLTISNADERSFYEIEAYHNSWGARELERQIAASLFE
RLALSRDKEEICQLAQKGQVVEKATDIIKNPFVLEFLGLEEKSSYSEHAL
ETAIINHLEHFLLELGKGFLFEARQKRFTFDNDHFYVDLVFYNRLLRCYV
LIDLKRDKLTHQDLGQMQMYVNYFDRYVKTDDELPTIGIVLCHRKNDALV
ELTLPKDSNIFASKYQLYLPSKEELKRELEEAAGIKH
>Cag_0521 conserved hypothetical protein
MEDNMSKEKAMVLDEYESEILEAFENGKLKPVKSKTDFQAIARDTMKKKR
EINIRISENDLSALQRRAEKDGIPYQTLIGSVLHKFACGFLKEA
>Cag_1605 conserved hypothetical protein
MAQKTINLLFIGDVVGTPGLQMVSRMLKSFITKHRVDFVVCNGENAHQGK
GLSAEALNQLLEAGVDVVTGGNHTWSNFNFFETLKTHPKVLRPQNYPKGT
YGKGYAIYKLPNGLGDIAVVNLQGRTFMYTIDCPFRTADWVLKQIKEQSK
ESVKCIFVDFHAEATAEKVALGWYLDGRVSAVIGTHTHVPTADERLLPKG
TAYCTDAGMSGPHQSVIGMQIKSATDRMLYQTPHKYECAEDDVHFSAVLL
TLDLGSGKALGIERIFYPEFERGTVVR
>Cag_1134 conserved hypothetical protein
MPQKASSNPAWIGYAGLALGLLLIGLLFSQLDLQRSFALIADTGWYALLI
LLPFGALHLLETFAWQHLFPQTSGRVPFVRLLKIQIIAETVSMTFPAGVA
VGEPLRPFLCHRLLGIPVPLGVASIAVRKLLLSVAQGVYTLVGSLFGFAL
LQQLSPTILRFNGLGYIMVTMGLSVLLLFLFVLILLLNGNVAEKVHSLLM
RIPFEKVRQKLLAHEAGFLATDKALQAFRGNHHPRLWLVLLLYVTAWCML
ALESYLILQALGIQIPFMQVLTIDVALTMLRTIFFFIPSGLGVQDVGYLL
FFQALGIPEAVVVGGAFVLFRRLKELLWYALGYVLMFFSGIHLGDAASLQ
GEAE
>Cag_1542 conserved hypothetical protein
MEKIDVNVYGNSFPLRSARRELTEKAARDVDGVMRLFAEKAPTFGEAKLA
VLAAIQFAERKIELEEEIGALRQQLGRLNLFIGQHVE
>Cag_1436 DedA family protein
MLDAVVLWLQSADPSLLLFILFITAFLENVFPPIPGDVPIAFAGYLLYLQ
GGEGFTQSLLWSSLGSSAGFMVVFWLSKTVGQKWYGSEAQPHSSRLAKQV
LRLFPPSDMELLRQKFAAHGYVAVLANRFLFGSRAVIAVMAGMMHLHAAG
VFAASLVSAVLWNLLLLSGGFLLGSQWQDIGNYLVLYSAPVSLLFLGVIA
FSVWSFMKQRKKHHHSDH
>Cag_0844 conserved hypothetical protein
MSWTTPAELKRQVQKLWDRGMLLATFCNGKALFPRRLMLKAPDARQLSTS
FPEVREWIAQLSNAAKHYRIVWRTINHRILGANELPAEIWIDSLDNALLL
IGKQREAQQFAAMVTLTRTMQPALLPWLEKRPLRALELAPEWHRLLSIVA
WRITHPKPAIYLRQIDLPGIHSKFIEQHRGVLGELFDLVLPPEEIDTTAI
GVGGFCRRYGFQDKPLRVRFRILDPALALLPTVSDHDITVTQATFACLEI
AVTKVFITENEINFLAFPNVPQAMVIFGAGYGFENLASVKWLHDCAIHYW
GDLDTHGFAILNQLRRFFPHATSFLMDSKTLMEHQALWGIEPSPETGELT
RLTAEESALYDQLRQNELGHHIRLEQERIGFEWLVGALGRGTEKAAV
>Cag_1153 conserved hypothetical protein
MLYFAHPEEVLSHWVSRVCGGSPSDIAALYHPNAVLIPTFSPHTVTTPEG
ILNYFSQLATRQGMGVRLHNKALRTQPLSETLHTISGIYSFEFEVDTVLL
SFPSRFTFVVDCALKTPIIHHHSSQVPRNLS
>Cag_1404 conserved hypothetical protein
MSNVVCSCLLGDNAHIRLRLSQLLHSEVEALYSNEPPLGEGQPEICEVEL
QGCRRDFFVNASGVPCVVGQQVVVESDGGYDYGLVYSTGAIARKKLQLKG
LDKQGIEWSSVVRVADEHDARAIEELQRRQAEIREVCLAKIKRHELDLKL
VDVELRMDQQKLSVYYTSSHRVDFRMLVRDLAGEFKARIQMVQITTREEA
RRANAFGPCGNLLCCSSWIQKIQANPFADKTHYSENPSNNDSHTFNMTGL
CNRPKCCIGFTTRQDKNGGRIGGSCCSQQQPLPTVGTLLSTPDGQAQIAF
VDAQKKLVVIRYQHNNQTRRFPLDKFNALFTRQ
>Cag_1590 zeta-carotene desaturase
MKVAIFGAGVAGLSAAIELVDRGHSVEIYEKRKVLGGKVSVWKDSDGDSV
ESGLHIVFGGYEQLQSYLKRVGAEDNYQWKDHALVYAEQDGKQVAFRKAL
NMPSPWAEVVGGMRTDLLTFWDKISLLKGLYPAITGDEAYFRSQDYMTYS
EWHRRNGASEHSLQRLWRAIALAMNFIEPNVISARPMITIFKYFGTNYSA
TKFGFFRKNPGDSMIEPMRQYIQSKGGRIFVDAKLSRFELNSDETIKEAV
LRDGHKIEADAYISALPVHSIKKIVPTTWLKHKYFRNLHEFVGSPVANCQ
IWFDRKITDTDNLMFSQGTIFATFADVSLTCPEDFQQGIGSANGGSVMSL
VLAPAHQLMDMPQEVIIDLVVKDLHDRFPASRNAKVLKSTLVKIPQSVYK
AVPDVDQYRPDQISPVRNFFLAGDYTDQHYLASMEGAALSGKQAAEKLMS
KIGNS
>Cag_1240 conserved hypothetical protein
MNHGSRRDVAIIIQNQKFKIHNHYQMARTLISFDWAMKRLLRSKANFDIL
EGFLSELLGEDITILDILESESNKENKSGKFNRVDLKVKNSKGEFIIIEL
QYDREYDYLQRMLYGTSRVITEHQKESESYDTVVKVISVNILYFELGQGT
DYVYHGKTSFVGIHDHDTLQLDSRQKERFGKLQIHSLYPEYYIIRINNFD
SIARNTLDEWIYFLKNSEIPDNFKAKGIQKAKESFSVLRMSVEEYQAYQA
FQDELRDEASYVETKRIDALYQGREEGFAEGMEKGKEIGVLEGMEKGKEI
GVLEGKLEIARRLLASGISKKEVAAITGITVDLL
>Cag_0292 conserved hypothetical protein
MNTFQLDYRHIQTPKKGFSALFRDYTADAPERETLIAECFHLDYRKSADY
YRQLGLLSARTFQRESLVNMLLRQNSRFGGGERQQQAIEKLRSPRCMAVV
TGQQLGLFTGTLYTIYKALTAIIVAEQQKSLFPDYDFVPIFWLEGEDHDY
DESASTTIFAENQLKHFTHQPYRRLPDQTVANSSFSDDIRTIIDEMVALL
PNTPHRTMVAEMLHECYYPGCTFEISFASTMLRLFRNYPLILLSPQESDF
KKLAMPIFFKEIESAPAVSYQVIAQSTRLESLGYSAQTKPRAVNFFYVNQ
HGQRQKVEQPSSDTFQIVPEKQRLSRHQMLELCQDHPERFSPNVVLRPIV
QDYVLPTFATIVGPGEINYMAQYRPIYEHFGITMPFLVPRGSFTLVEPKI
SRVMDKLIHVTGQPGFSRKNIYNAVFSNLQQVKKNAIGEAEHPQLATMFE
QAKDEMRQALERLNHTLSTIDPTLEPLLAASIVQSAKLVETIEQKTWKAS
RRKHEELLEQIQKAETALFPEGVPQERVVNIFYFIAKYGMGILDDLSNLL
KGYASDAHIIAELQG
>Cag_1026 C-type lectin
MSIIQEHESNDLIANATSLTLVAESLGSTTSVADGVGTQSNPTQYNSWSD
PDYWRIELLAGYQISFSILTPSSSLQPYAELRDAANNTVAYATADASGES
VHLVPYTLTASGSYYVVVGKNYYTSDGGDYELHVETAPFIKPEDDDNNTI
DKATELVLSENPASSGLLLGVGMGEQDPATMYNHWSDPDYWRIEVLAGDL
VSITVQTPDSELNPYVELRNAADGNLVGSNDEGAGNDAFISSYEIKDSGS
YFVLVGKDYYSGGGTYSVQVDVARGIQMESDANYDNGNLTQANALHLSAE
ITNTDRYQVATVAGAIMLPEYLTVDVDVFALGRLNAENTVELTSLLPSTS
NLTPLVTLLDSAGNVVTDSDGNSADGNFSATLTKDDDYYVQVERGYQYNG
HTYLLTNNGMNWTAAQEYAESLGGHLVTIDDASEQQWLFSQFGSTNSWIG
LNDEVTESIWQWGNGATSTYRYWGDGHPYGGEYYNYAYIATDGKWYSGNE
TWGYYALIEIENTASAANSASSSTSNYLLDVRIEDSVAPRVESITLPANG
SSVDDPIGATITVTMSEKLEPATVKAGLREVWVRDGHYYTVTDAAVSWTD
ANTAATALGGQLVNIESAEEQAWLLSMLDGRYGDVWLGLSDTATEGTWVS
ANGETVWVYGAETNSAYANWGDSQPYYQWDENYDYAAMNGSGKWYASYGS
NAMRGVIEIVDNDSDSDGLSDALDPYDNDPLNAWDLREAGADGVFDTPDD
VIHRLLLKEAYNGGTAVNLLIEDGTLGAGSYRFTANTTLTDIVGNVLDGD
SAKIGSEPYVHYFTITHPAGVTAEGGRNNILQNATTLSLNEDPAGKGLWL
AHGVGNQDPATIYNHWSDPDYWKLELQQGDLVSIYVNTPDSSLDAYVELR
NANDGYVASSNDDGASNDSFISRYVVTESGTYYVLVGKGYYSGGGAYELQ
VDVAKSIQMESDANYSNNPLSGANLLSFVADGNNQVATVAGAIMESEGQP
DIDVYALGRFNAGNTITLTANLPSTSGLSPIVTLLDEAGHLLLDADAHYA
DGTCTITLEDNGNYYAQVEKGYQYNEYTYIVSSTTMSWEAAHIYADMIGG
HIVTINDADEQQWLTEQFGWTSSWIGMNDAALDGTWVWDDGTTVEYQNWG
SGHPYTWSNDYNYGYLATDGKWYSSYNGYTYRALIEIEIPHTAEASTNSL
NTSYLLDVSIEDDVAPRVESTTLPTNNSTVDNLVGATFSVTMSEKLDAAT
VKAGLREVWVRDGHYYTVTDAAMSWQDAAIAAQALGGQLVNIESADEQAW
VQSMLDGRYGDVWLGLNDAATENTWVSADGSTTWIYGAETNSAYTNWGDS
QPYYQWDENYDYAAMNSSGKWYATYNNSSYNLMRGVIEIVGSDSDNDGMP
DALDPYDNDLYNAWDLREAGADGVFDTTDDIIHRLLLNGTYVDSTTVNLL
IEDGSLNAGSYRFTVNTTVTDIVDNTLNGNVLNGDGDSTAGDRYEHFFTI
APPAGVTTEGGRNNISPNATALTFSEDPSGKGLWLAYGMGNQDPATMYNH
WSDPDYWKIELQAGDYVSVYVNTPESDLDPYVELLNTNNGGVAWSNDDGA
HEDSLISRYAVTESGTYYVLVGKGYYSGGGSYELQVDVARGIDMESDANY
SNSSFGNANRVTLVDAGVEQRATVAGNIMAPESTFDRDIYALGRLNAGNR
VELNTAMPSSSDLMPVVTLWQADGTMVADSDGNYTDGQFNALLAADSDYY
TQVERGYTFGEHTYLLSSSNMTWSAAKTYAESLGGHLVTIETAEEQAWIN
ELLGSSTSWIGIYDAADNGTWVLLDGTQPTYTNWEASQPSTWDNYNYGHI
NYNQLWYAGLESWGLPVLVEIDTVGTLPSSSALGSEYILDITVTDGVAPR
VESTTLPTNNSTTDNLVGATFSLTMSEKLDAATVKAGAFEVWKYNGNYYA
LTESAMTWVQAEAAAVALGGHLASVLDANEQAWVQSMLDGRYGNVWLGLN
DAATEGSWVYSDNNLALYTNWASNEPYYQWDGNYDYAYMHTNGEWRNTDG
SSTMRGLIKLNDTDSDKDGLPDAFDLYLTDSRNAWDLREAGADGVFDTAD
DIIHRVLLNGAYENGTTVNLLIEDGSLGAGSYRFTANATLTDIVGNALDG
NGDGTAGDYYQQFFTIAPPTGITVENGRDNISTNATALALHEDPSGKGLW
LAHGMGNQDPATMYNHWSDPDYWKIELQAGDLLSVYVNTPESNLNPYVEL
RNATDSQISYNNDDGAKEDAFISRHLIEQSGTYYVLVGKDYYSYGGSYEL
LVDVARGIDMEYDPQYRNDSLGGSQSLGFSAAQNYQLSTVAGAIMGYEYG
YDYDVYNLGRFNAGNTIELTITQPSTSSLVPLVTLYNAAGTPVTDANWNP
ADGTFNATLTHNDDYYAKVEHGYTFAGHTYVLTHDNMYWSDAEAYAEALG
GHLVTINNAFEQQWLANTFSWANPWIGISDSSNTNEWHWSDGSQSLYRNW
GDSQPDNYYDYGYLNPNGYWYTGANNWNYRALIEFDSMGIIPAATDPTVT
NYLLDVRVEDSVAPRVEMVSLPANNSTIEHLVGASITVTMSEKLAPATVQ
AGMFEVWEHNGHYYTLTDRAKSWQDAEASAVALGGHLVSINDATEQAWLQ
SMLDGRYGNVWIGLSDAAAEATWQLTDGTTSTYANWASNEPYYQWESNYD
YAYMNSNGQWGASYNTNSMRGVIELVGTDSDNDGMPDQLDTYRNDPYNAW
DLREAGADGTFDTADDVIHRLILNKGYSSGSTFVNLLIEDGSLNAGSYRF
TANATLTDIVGNHLDGNANSIGGDAYMHYFTIAPPAGVIAEGGRNNTMQN
ATVLPLTQDPAGRGLWLGHGIGNQDPGFAYEYGIDVDYWKVELQKDDLVS
ISVNTPDSNLDPNIALYDANGTYFDYSNNEGPDYDAFISRYVVTTTGTYY
IKVDKDYYSGPGSYELQVDVARGIQMEYDRNYDNDPLSGANVLTFTQAGT
QRIATVAGNMMSAGDGQVDDDTYALGAIEAGNTILLGITIPDMGDLRPVV
EIYNAQEQLVGLDPNPSSGVARFDVITTGTYFARVLPFTGSGSFGHYLLD
AAITPTVEAQFADLAIDSKSVIVPATPQSGSTITIAWSVGNYGKIATEQS
TWQDRVVLSLNSRLGDADDLLLATVEHNGVLDPATSYNASVDVVLPTLLE
GNARIFVTSDVADVVEESFFEINNTAEKEIVVSLTPYADLHVAEASMPST
LQADTTFIVTATIANNGTGAPGTGIPNESVNTWVDKLVLSGNAVLGDADD
EILETLAHTGGLDAGSSYEVSFDVTLSAEQLQDHLFIVSDSGDAVFEAYN
SGMNERRVNHLPEGSVVINGNALQSTTLTVTQTINDADGMGDLLYQWYAD
DEAIAGATQTTLWLDTSLIDKQISVVAHYEDGYGVEEAVESTPTEAVVAD
TTAPTVVSFAPTDNATEVGLNSSITVEFSEAIERGSGTISLHTGSPQGTL
VESYDVSSSYNLHIAGSTLTITPNNRLDDSTHYYVVFEEGSMQDMAGNDY
AGTVEYDFTTVVNHAPIIRIPHELSFADKVDYATGDEPYDISTGYFNNDE
WLDVAVVNSRSKTFAIYNNNGNGSFTLGESYNTPHMSFSIASGEVTGDDY
VDLIVSNYYNNTVSVWSNSEIGTFTETSSCVTANAPSDVRLADTDGDTDL
DIITLHQDSNSISIIKNNGDNTFANYVVYATGNHPSSLAVSDLNNDGFVD
LMVTNTVGDSVSVLINDTYGAFSEKVDYSIVNPSIVISRDVDADGDADMV
VGRALFGYVSVLKNNGDGTFTAQADYRLADNPASLNSVDVDGDGMLDIIV
GYRDELSTISVLKNNGDGTFGTPIDYPAGTKSYSIASGDFNNDEQSDLVV
VHYDTDTFTLHLNNSVEKTATAFTEQTPVAVSSNITINDPDGDASWNGGC
LQVQITANAESLDRLLLPTVAGNGVWLDSANNNALMAGELRIGEANVVGV
QGSAAWHFSFNEHATNALVQEVTRSIMFNNNSNTPSELERTITFTVTDTF
GDWAAVDQQITVTADDDPAPITHDLYGNITFWKSGNPLNDVQPTLASKPM
ENHDQEVAFRNLQQQSDGSYTVELWASTTQTDIHDFQLQLYFPESTTSVS
WQASSTLNGWISVLSDQTTGQVVLGGVSTAQTLQTGSVQIGTLSFSAPDI
PDNFTLTAASGWIGTENIVPTSILCTATDTTGDYRFEDVTDGWYRVAGES
NTNMLANAVTTEDALAALKMAVELNPNEPNASGLLDPVSPYQFLAADINR
DGKVRANDALNILKMAVNYANAPTDEWIYVRQDHQLADMDRSHVDWSFAE
RAIDIYGDMNVNFVGVVKGDIDGSWGMVP
>Cag_1967 conserved hypothetical protein
MPTHPDAIGYIRDLGHSEGKLWFEMICDLAAYRTTNLSPTDFEILVQLFT
KQVSYLRQPPPIIATTVSVETVFPSFERLETIGPFNGFKRLGNSLVACFP
KRATIIFGANGSGKSSLCEALKILASNDAPRRPLHDVHCSTTVTPSFKFK
FTTDTTAQTWSSTDTYGTRLSVIKYFDTGIAIHNIKNSVQPGRIIELAPF
KLDVFEIAKNHCEVLRTERQKRQRENTDQLTIVIEQIRAKFESFEGTILA
GLQISAKSILEAEIKLGENYSEENGLDEKLKKKSDLEKATSEEGIKLLKG
EIAALKALNAEIDPILTASEKLVEIDPVAKSNSLKDKETELKVLAEALIP
SGATLDKLMELIRPANEISILNSSELEECPLCKQPLQTRELELFKQYYTL
LTGELDAAITELRKILKTSEKNLKVISDSTPDEWAKGSVLSQDLIDAIKD
SGRAIQKYFKFGENISQNCKDAAVLLRKFSIKLSIKTEEKETLIDKSGKD
REELLKQLTQISNECKKLLYAKCIADNMDLVKNAHGKMLNATFWDTNLPN
FSPVLRKITSTAKKAHKELVVEDFKNRLNAEYLALAEKDMSAFGVELKDV
GSDVAVTVDHHVAGQRIESVLSEGEQRIHALALFFAELETCEQQVIVFDD
PISSFDYNYIGNYCNRLRDLIQQYPDRQIIVLTHNWEFFVQIQTTLNTAR
LNQHMSVHVLESCVAIDEYSENIDELKTNIDAILLGSGEPTKQQKEAMAG
KMRRLIEAVVNTHVFNKQRHQFKQKNQQVSAFDDFTKVVPLLLSEAQTLR
DLFSKLSITEHDDPRNAYVNTDKSMFLTRYNAIKSIETAIIGRK
>Cag_0831 regulatory proteins, AsnC family
MLNTSTIHITPEMLSLIAHIDEFKGAWRALGVLAPERLSALRRVATIESI
GSSTRIEGSKLSDQEVERLLSNLTINTFETRDEQEVAGYAELMELLFTSW
QYIPFNENHIKQFHQLLLSHSSKDARHRGTYKTTSNSVAAFDENGKQLGI
VFQTASPFDTPFLMQELIAWVNQEREAKQLHPLLIIAIFVVVFLEIHPFQ
DGNGRLSRALTTLLLLQAGYAYVPYSSLESVIESNKEAYYVALRQTQGTI
RSEVPNWQAWLLFFLRSLVEQVHRLQNKIEREHVVLAALPELALQIVEFV
HQHGRITIGEAVKLTDANRNTLKVHFRKLVEQGYLKQQGSGRGVWYERG
>Cag_1637 hypothetical protein
MDMQKDINYRLALAQGFLQEAEDSYTTQHWRACVSSAILVIENAGLAVLM
LFGVSPMTHKPGMHLKHLVSEGTLHADLAELIAQLLPYLEQYDSHEKMLA
KYGDETTYELPWQLYDAAKATTALDAARNAVRISTTMAERGV
>Cag_1562 conserved hypothetical protein
MEQKEVVAHATFVQLVESIRNVHQELIAQANRAVNVSLTLRNWLIGYYIA
EYELQGKDRAEYGDRLFSELARALKSLSNCNRRQLYRYYRFYTFYPIIVE
LLPPQFKSLSLLSSIEIVGTVSPLSRPSSTASLNIAKKLSYSHFEELIAL
DDPTKRAFYEVECIRGNWSVRELKRQIGSLYYERTGLSFNKTKLAELTLQ
EREMQPLFNIRDPYIFEFLGLKPVEVMSESHVEQQLIEKLQDFLLELGHG
FCFEARQKRLLIGDEYFFIDLVFYHRLLKCHVLVELKLDHFKHEHLGQLN
TYVSWYRQHVMSKGDNPPIGMLLCTSKNNSLVEYALAGMDNQLFVSQYQL
ELPKKEEMQEFIATQLRELGE
>Cag_1092 conserved hypothetical protein
MKIRRFELNAFGPFSGNVLDFNSPTPGLHIVYGANEAGKSSAMRALYAWF
FGYPLRTTDDFLHKKSNLLLSGTLENKQGEVLTFSRRKRKERDLFDGNDQ
PLEAQTLEHWLLGMDRELFQALYAMSHESLALGGQGILDEEGEIGKALFA
AGAGLASLRPMLAHLQSEADELFRPQGARQQLNEALARHRTLQQQLREAT
LSGSVWQEKKEALEQAEAKRNALQVRKQELETEKHRLERLQHALPELADR
KHVVEQRAALGKVPLLPADFAAQREALQKQLHLAQHNYEREQERITALQQ
SISSHHVNHALLEQAAVLDELHQRLGEYRKGKNDLPQRQSQRAAALQAAM
DILRPLWADLAGSEEAMDATDDSPTLMQRLQKALLKKKEVQRLATHFEAL
VSSGKSARQQVQESEQALEQLQRDLAALPMQGDSNQLEQTLRIAERNAAL
DRDIAELEQSLRHSEQECHAMLQRLTLWHGTLEQVPTLPLPLPETISRFD
EAFQRLQSDTLALRAQAEGVEKRLQEITTELEQLAAESHVPSVEELQHSR
AERNKGWELLKRQWLQQEDVTAESNAYSAAHPLHEAYEIMVNAADQLADQ
LYREVERVRRHTALTAEAKKLHHQHTHLHERLATLATEEAALHTAWQEQW
RTTEIEALPPREMVAWVATFEALRQHVRERDKLLLERNVRHKRRQEAHEQ
LHQAVEAVAPPFPIKNNELAPLLQYAQQQLSRMQAVEKRGENLLNRQRDI
THNLESSRQLLNRAEEEHREWRKEWIAVTSALELTGQAQPMEIVDSVEAM
QQALTKLKEAEEFRKRIEGIERDMRQFELDVATATATLAPDAQESDGAKR
VAMLHERLDEARREQTLLQREKDEVTQHKEALRRHAATLQEGELQLTAMC
QQAECATPTDLPIAEARSQQAQELHEKLMAVETRLVRIAGSASPEALEAL
ETEAATVERDALPSHIETITTEIHQEIEPEIDQLNELRGRLRNELKQMEK
EDGNAADLADAAQRELARIRRLTNRYIRLRLAETMVRNATERYRSSSERP
VLSLASTYFATLTLQSFVALDTESDDNGHIALMGVRTNGNRIGVEAMSSG
TRDQLYLALRLATLQWRMQQSEPMPIIADDILITFDDARSRSTLQALAKL
GESCQIILFTHHRTIADMASHRAFKGTVFLHTLGTTNESEHNNAETSAQP
PKPENLTLFG
>Cag_1596 conserved hypothetical protein
MPTNQPPLYALAFGAHPDDVELSCGATLLKIMREGKSVAVCDLTRGEMGT
LGTPESRKAEAEAATALMGYSARTTLDLGDGKLHYCDENLDRIISVIRHF
RPSVVFANPPDERHPDHIKASRLVTDACYYAGLRQRPTTFEGTLQEPHRP
RHLLYYIQFRHLEPHVVVNVSETFEASRHAILAFASQFYREGMSDAPQTM
INRPEFLTSLEARARYFGEQIGVLYGEGFRLSAMQGVAHFSSLFPEL
>Cag_0892 internalin-related protein
MALRLFFSIFKELKSSAVPQSPLVPLSSDGIFWYQKALLPIAIKKMEVRQ
STNQLVISPTLNALLIVEQHTYNQCPVCGFPLSMNSAICPRCGNDILEDI
SSLDQQSLERYHKHLENKKAEWYARCLTDQITGGDNPPLSAEHQECPAGR
QKPHALFNSDDELAFFTSLNRADILRDTNLRKKWWQSITADWQDVVRFTL
KINHDPSDSDLLAFFDSTNLRCDDRRIHSLLPIRVLEKLQQLRCDESPIE
SLEPLAHLTLLQRLYAFDCDFTSLEPLRNLTHLKLLWISSTEITSLEPIS
NLINLEELYCSETDITDLEPLRKLINLEKLSCYKTSITSLEPLAELENLI
ELGINHSDINDLTPLAGLINLEYLRCSKTAISSLEPLRNMVELRELSIAH
TNVDSLEGLQGLENLEELDITNTLVSSIEPLMGLEYIEKLELSVGTIPDE
ELERFVELHPDCNVVAK
>Cag_0785 conserved hypothetical protein
MHQNSDENLQADIVSEQSLQVTKSLQAFSGPLPHPDLLREYENILPGCAE
RILVMAELEQTKRHEITERESKGLLDHLARGQHYAFAAVIITFASAIYLA
MNGHEFIASTIVTLDIIGLVVAFITGKALSLKQTKHNES
>Cag_0452 fic family protein
MWEELYILKKRYLEMGLSEAIDYEKFSMISIVYNSTKIEGCSLTESDTQL
LLENGITAKGKPLADHMMVKDHYAAFLFLKEIAKQKQKISIELIKKVAAL
VMKHTGGLVNTISGTFDTSWGDFRKAQVYVDRKYFPDFSKVENLLLKLID
NVNQRLDTVFGNEILKLSADIHYNLVNIHPFGDGNGRTSRLMMNYIQMYH
QEPLIKIFTEDRAEYIDALNKTEELEDISIFRDFICSQQIKFYKAEIKKF
EQKDKGFSLMF
>Cag_1594 hypothetical protein
MAAEEQKTAAGLGTPTTPAKPASGGVGDGDMAHLIGNMGILIDSTIVTVQ
SAVSTVSSTTEQIMQSVTTAINSEPVQGVINSVNSVTDKLVSGVANSVNP
NSLQEVINSVSNALSSSPVQGVLDSVSSATGQLLEGVTTTINSGQNSFQE
LGKLWTGLLENLNAMDGANQVQNLFNNVSAGLGQLLGNLPQMPMIPNMGQ
NRDALPKKDRTEIHFTPTTPVAQAAPMAPKPAPAPQAPAAQAAPVAPKPA
PAPQAPAAQAAPVAPKPAPAPQAPAAQATPVAPKPAPAPQAPAAQAAPVA
PKPAPAPQASAAQAAPVAPKPANPSQQGGPRVIVAAPQKKK
>Cag_1996 putative segregation and condensation protein A
MFTINLDEFEGPLDLLLFFIKRDELDIYNIPIARITTDFISYIHAARQLD
LEVAAEFIYMASMLMSIKARMLLPRPPDDPTAESDEFDPRTELVERLLEY
QRIKEAAEALRLRADERALLLARGWSELQEAVSAGNEQEGEDELLQQPTL
YHLMLACNAMLKRRPRPMTRSVADVPVTIEEQSAMILERLRYEPQLSFTV
LLASFSENLVVVVTFLAVLELCKSQRISVLINDAYNDVWLTLRVSQSN
>Cag_1464 conserved hypothetical protein
MQDRVERVTRYIDNVMANTEGTRAEGVYVVSVALKGRAGQQKLEVLVDSD
KGIAVEQCAWVSRRILEKLEEDDEPLSEEIAIEVSSPGLGTPLQLPRQYY
RHLGKLLHVRYRTPEGAEAEIEGYLQEAQLDGVNGGDSADSVIVLKPKVQ
GKRPRNAQPLENIRLPLSRIIKAVPEAEL
>Cag_1417 Protein of unknown function UPF0047
MKILTHTLTIPTCKPIELIEVTDQIKDLLMASELQQGQVTIISRHTTAFV
NINEYEERLLEDMEIFLKRLVPKDGNYLHNISPLDGRHNAHSHLMGLFMN
SSESIPFADGKLLLGQWQSIFFVELDGPRPKRELLMQIAGV
>Cag_0087 Iojap-related protein
MKSEHESNEAMVQEIEESELLAQRIAELALEKKCEVVKILDVRGLTSITD
FFVIATADSERKAKASADHILDELRTEEGERPLHVEGLDSQHWILLDYVD
VVVHLFLPDERRFYDLESLWSDAPTTLVG
>Cag_1243 hypothetical protein
MANMQLPPNQRDNYDSPWKEAIEHYFPEFMAFYFPNAHKAIDWAKGYHIL
NEELRSLIPDAEISNRVVDKLVQVHLLDGNESWLYIHIEVQSFWEADFPK
RVFIYFYRIFDKYGKAVANFVVLADMNPHWLPTSYNMETIGSKLTLDFSV
VKLLDFEPHMQELLASNNVFGLITAAHLLTQRTHRNYEARLEAKKLLIQL
LLQRQWEQERIEQLIRVIDWFLSLPKELRQKLKTEIYQMEEEQKMKYITS
FERDAKEEGILEGREIGVLEGMENGKLEVAMRLLGIGMSVEQVAELTGVT
ANVLTAKLKS
>Cag_1295 MCBG protein (microcin resistance protein)-like
MFNHFIAMECYQQTFEKKDFYENPLTMGTYEECHFHGCTFINVDLSHYIF
INCTFDGCDLSMVKLNNTSLQEVLFCNCKLLGVPFSDCRQLLLSFRLERC
MARLALFCRLKLKGSLWSECMLQEADFSEADLSNALFERCEFPQATFFHT
NLEGADFRTSWHYSINPATNRVRKARFSLAGIAGLLESFDVVIE
>Cag_0462 conserved hypothetical protein
MQKIQVDILGLSTSPHANGAYALILYEVDGKRKLPIIIGGFEAQAIALKL
ENIKPPRPFTHDLFKSVADVFDLHVSEVIIDELHHETFYAKVVVEMDGEV
HEVDARPSDAIAIAVRFRAPIYVTDDIMEEAGIQEEQTVPRSAAGPVAAV
LSSPTSATAQHLRAEQRKATLKELQAHLEEAINNEAYEEAARLRDEIARL
KP
>Cag_1144 conserved hypothetical protein
MKCPACHTELLLAERQGIEIDYCPSCRGVWLDRGELDKIIERSASYSSEK
RYEKEAYHQHDKESHYRYHNDDRDDYIDPKSGKRKRKGGPLGFLEDLFD
>Cag_0578 conserved hypothetical protein
MSKELSHRADYQKLLGSISTLYTSGQRRAYQAVNSVITETYWQIGCYMVE
FEQCGNIRAEYGKALLDNLSRDLTLRHGKGFSRSNIIRFRQFYLAYPKGA
KPSHLLSWSHWVELLKLDDPLERSFYEQQAIREKWSVPELQRQKKSSLFL
RLAAGKDKAAILQLAEQGQIVEQPADLLRDSFVFEFLKIPESSEMAELDL
ESRLCDHLQPFLLELGKGFTFVGRQYRIPINNSNYRVDLVFYHRILRCFV
LIDLKINEVEHHDIGQMNLYLGYFAAEENTPDDNPPIGIILTRQKDELLV
EYATYQMNSQLFVQKYQLYLPDREELRREIERALWDIEESNSNKEKKNE
>Cag_0287 conserved hypothetical protein
MDNALHWINCLQLQPHPEGGYYRETYRSSGNYSFSDSAPQTSESTFFQGE
RSYATAIYYLLQSGERSRLHRIHSDELWFYHAGAPLTVHIFPETGEPSCF
TLGLDVAQGEVPQAWVPAGAWFGASGASLKNASVDDYALVSCVVAPGFDF
RDFTFADRHELLRKFPQYSTTIERLT
>Cag_1094 conserved hypothetical protein
MLETLSFFLLLALFGLLIFLAIRLLSAAPKQAELARLQALEADALRNMER
LTALAEEHEALKIRYARLEVAYQNEQQSVAEKEALMLASEERLKKEFELL
SLRILEERGKALGAEQRERLDTLLLPLRQQLEAYRQRIEEVHHADTLLSG
QLIEEVRQLQALSSRVSNDAQQLAHAIKGDSKVQGNWGEIIIERMFEASG
LEKGREYLAQESFRDSDGALKRPDFMVLLPDNKAIIVDSKVSLTAFERYS
ALSDPDEQQIALREHLQSVRRHITELQAKNYHELGGNRTLDFVLLCIPIE
AAWQAAMQADPALLYTLAGRNVVVCSPTTLMMTLKLIAQLWRREHENRNA
ELIAEKAGRIYDQVALLAHSMLEAQKKLSNVNDSFEQVLKQLKTGRGNLI
GRVEEIRKLGAKVNRQMPLDVTAEALEE
>Cag_0144 ATPase
MITKIQIKNFRQIRDQTLELKQVAVVIGPNNGGKTTLLQAISLFALGLRA
WGMQRINKKSKAQKRTGVAITLEEVLNIPISDFKELWSDLNVREGIINEE
GKPTSKNIRIEIHAEGYTQNVFWKIGFEFDFGRDSLIYVRLTQDENGELY
DFPEVLLEEKIGYLPSVAGLKPIEDKLEIGSVLRNIGNGNTSDVLRNICY
ILYNASDKELWQNFVKQIDELFKIELNPPQYYSLTGLLKMSYNEGAKKHI
DLSSLGSGAKQAILLFAYILAFPNTVNLLDEPDAHLEVIRQSNIYDRISD
IAKKNNSQIIIASHSESVLNRAFTKDQVISGIFGEFEEVSNKKYITNALR
TYGYEEFIIARQRPYIFYFEGTTDLDFIKAFCKRLGRSDVFRFIEDHVYP
YPVANDVVRVRNHFDTLKKFIPTLRGFALFDNLHKNLESNQPDLLLRQWG
RNEIENYIPIPQTLFAFIESADYGELWKNRFKELVESNVPPAAFIDMNHS
FWKKTKMSDDFLTPLFEGFYAEAQMHKGLMDKSKFYQLVDYVDVALLQQE
VIDLISAMYVHFCVK
>Cag_0749 conserved hypothetical protein
MKKLIKLHIGGNEAKEGWEILNTVGKDNVDYIGDIRNLSQFESESCSVIY
GSHVLEHVSQREMLPTLNGIKRLLIPGGKLMISVPDMDILCKLFVHQNMN
IQGKFHVMRMMFGAQIDSFDFHYIGLNFDMICYYLSKSGFVRITRVKDFG
IFNDTSTYAPFGVPISLNVICYK
>Cag_0913 conserved hypothetical protein
MQKFQTRFLEEADKFISELDSKAAKKIFYNIDLAEQTNDPKLFKKLQNDI
WEFRTVFAGLQFRLLAFWDKSDNTDTLVFATHGFIKKVDKVPKNEIDRAV
RIKEQYFENKLKK
>Cag_1399 membrane protein, putative
MSKMTNIFNIKSDCESFDTIYKTVNLDSTFNGSRLSILICAIILASVGLN
MNSTAVIIGAMLISPLMGPINGIGYSIATYDFLLLRKAIKNLLFAIIASL
ITSSLYFAVSPVSTAHSELLARTSPTIYDVLIALFGGFAGTISLVTKLKG
NVVPGVAIATALMPPLCTAGYGLATFQFSFFFGALYLFTINLVFIALATI
LFSQLIPFPLKNIINDDKKKKINNFITSITLITLVPSIYFGYVLSEKEKF
NENAKRFIQSVTFFDGEYLLHHEINAGNRLITLIYNGQGLTEKEKAELQK
RANDFGLKNTTLTFQQGIKFSPISKEKSENEALKTEINRLAIALKQQTTK
QDSLQKQIYRGSIFLRELQPLFPQVTTCAFMESYRFSNKSNTLPVKTAYV
VITTNNSSLKKTDKEKITMWLQARTQNDSLRVIFE
>Cag_1449 conserved hypothetical protein
MCINMWGCSNPETERRTSNSMLSGADHPSQEGWNIYMVLSESGREKAIIQ
AGHGAEFEEAQHLDNGVTLQLFDSNGSNRTTITANKAVIFNNQDIEAEGD
VTIRSTSATGQTTRITTEYMKRTANDQMIRSDRLVTITRGEEVLRGNGFE
SDQYLKRFRIFRGSGEAVKQ
>Cag_1183 conserved hypothetical protein
MSSFFNCYEADTARFFDEVIAHDGTPRQHYHKMLQRFDQFSKEDIKARRE
VINIFFRNQGITFTVYGENEGVERLFPFDLIPRVVPAHEWQRIEKGLVQR
ITALNEFLHDIYHSQKILKDKIIPPELILGSQHFRREFVGVNPPLGIYIH
VTGSDIIRDHNGNYLVLEDNLRTPSGVSYMLQNRQAMKRAFPVLFDKYKI
RPIENYPQELLRTLQEIGQSARRNPNVVLLTPGIYNSAYFEHSFLARQMG
IELTEGKDLVVNNNRVYTRTTRGLEQVDVIYRRIDDAYLDPLVFRPDSKL
GVAGLVNAYRKGNVTLANAIGTGVADDKVIYSFVPKIIKYYLGEDPILEN
VPTWLASNPDDLKYILANLGSLVVKAANESGGYGMLIGPESTVAEQERFA
EMLVANPRNYIAQPTISLSRHPSFFHDTELWGCHIDLRPYVLYGKNITIV
PGGLTRVALKRGSLVVNSSQGGGSKDTWVVDE
>Cag_0316 conserved hypothetical protein
MAMAIEKTVRLAVLIDADNTQANIIDGLLAEVAKYGTASVKRIYGDWTSP
SLRSWKEMLLEHSIQPIQQFGYTKGKNATDSAMIIDAMDLLYTGKFHGFC
IVSSDSDFTKLASRIREAGLVVYGFGEKKTPSAFVSACDKFIYIEVLRAK
INDNEQIARKSMAELRKDARLVRLLRNAVEASSDENGWAHLATTGNNIAK
QSPEFDPRNYGYAKLRELVAATKLFDFDERVIGDGQSKAIYVRDKRMRDK
LKKADEKADSEIPF
>Cag_0858 conserved hypothetical protein
MLKSERFFTKFNCRRLIMTTVEKLYKTVQELPEPVISEILDFAEFLSKKR
PVINKGSNISLLDLVGGLENSIAFNGNPLAIQKQLRNEWD
>Cag_1211 conserved hypothetical protein
MPNNSPLTIFGASCKSGTYLLLIELVEPCRIVFGNFRKGEMFSLSAGMYL
YVGSALGNRSGYPLARRLIRHASRSGNNPPHAMRQTLLHYFSTTCNTTFT
PSPKKLRWHVDYLLEQPEATLSEVVIIQSPLRLEYALATLLEAMQETSHI
APHLGAQDCRDSTHLLRLRNREALFTELQQAIPAMLENT
>Cag_1629 conserved hypothetical protein
MEKKISKFSEEQLNKIISEFKKESKLYEYRTKDYPFEVIHAKFGDEEDLT
KSLYVPAYQREFVWKPDKQSLFIESVLLGVPLSPFLVSDNINDNENAGRL
EIVDGSQRIRTLIAFYENKLRLIKLDKLKEINSAKFKDLPKTLQSYLYNR
DFRIIVVENAEIEIRKDLYKRINTTSEPLSDAEIRRGSYSGDFYDLVIEL
SKNVLFKEICPISEQKNDRGEGEELVLRFFAYIDNYLQFKHDVAFFLNDY
LDDKNEKGFDESFYKSAFLSMVNFVKENYPLGFRKEENSNSTPRVRFEAI
SVGTYLAIKENDSLQKPNMEWLYSSGFKIQTTSDASNNPDRLKNRIEFVR
DGLLGKLDTNRLQNG
>Cag_1311 hypothetical protein
MSALSLRLPKTLHEQLRELAQEEGISVNQFVMLAVAEKVASLSTIDYLQK
RAERGSREKFLEILNKVPDIEPADFDKL
>Cag_0455 hypothetical protein
MKFTIEVDQEIDGRWIAEILEIPGVLKYGNSQHEAIAQAEALALRVLAER
IEEGEQLVEPISITFAA
>Cag_0364 putative signal transduction protein with Nacht domain
MVFTIRRHVVSLFIRIKRMSCHSGFLVLATLLFGYAGIAVAQEPSAIDSV
GVVTKQLKEFGWSVEDITLIGGPFGAVILGVWAFFKFFYPDRKKKTEERN
TVNQYAERYKETLKAQIEKQTLSTTAFESVSVSLADTFVPLRLSEKWRNE
TRFMPESLFSAKNSDAILTPEEVLQLALKKTYKRLLVIGEPGSGKTTLLY
YYALLCLEPNRAKELGVRIPELVFYLPLRELSKTKDRYNLLPENLWAFSE
KHSHSIPIEVISGWLRSTTTLVLLDGLDEISELEERIQVCDWIDNAVTSF
PKAYFIVTSRTTGYRKVDNVELKHFVRVDIMDFTLKEQKDFLHRWFEAAF
LEAETPFADTTEAWKKSQQEKANAKAEAIIAFLTADENKSLQVLAGIPLL
LQMMAMLWKNCDDLPRSRVELYREALNYILDYQNKPKKKKPLLPAEQALA
VLTPVSLWMQEELHKDEVKKEAMQQQMQIALCELRDHPNPPTPLTFCENL
IDRAGLLVEYSDKEYLFRHKSFREYLVAVQLVNNIKQRNKSLDGGVDYLS
ILVAHFGEAWWKEPICFFAAQADADMFNTFLQKLFDSPVSKDFSQEQQDL
LERVIQEGSKEELVALHTKLLDSEMNANQQRYLLQCLEINKHPSTGDVVR
QFVEKKLAKDDDIRRRAGDFTVTGRVDKQGAQYLLIQGGQFIYSVTKKQE
TVPDIYVARYTVTNKLYRRFIAYLDGKEESYARLLPLERYRKNLEKMALG
IKGFRRYLRGNENLATLFRSRYDDDRRFNGDEQPVVGVSWYAARAYCLWL
SLLDGGNTSLYRLPTEIEWEYAAGGKEQRRYPWGNEEPTSTRANYGDNEG
ATTPVGRYPEGKTPEGLYDMAGNVWEWMENSYVFGLVSFIVFIIATLIAS
ISDSRIYSVFSTRSLRGGSWENVSDNLRCSSRSNNGFPRFRLYFYDGFRV
IHSSHSSLKI
>Cag_1771 conserved hypothetical protein
MTFADFSEKKFPIKIQTMKHPEEKFLTQAERQRIEERIAAAEKRTSGEIV
VKVVAESYHYPLEAMQGSLLLAIMVGIAVALLISEETMWLFFTFFGFSFF
SAPHTESMWFFLAIFSVSFMVFHELLKRVSFLKRIFVRKANMREEVEEGA
TYSFFRRNIHHTVNRTGILIYISLFEHRVRVVADQGINEKVTQSDWDEVV
TIIINGIKTGKQADAIATAVDRCANILAAHFPITTGDRNELSNKVILGRN
G
>Cag_1292 conserved hypothetical protein
MGVLTMIVMDSHVWFWWINLEHQRLSGIILDTIATANRIGVSPVSCFELA
LAHKKGRLNLPLPLDKWFRFALDGSDVELLPFDEKIATRAVKLSNIHHDP
FDRIIIATALQLNGKLASVDNRFSFYEEFSEILLS
>Cag_0379 conserved hypothetical protein
MHNILTKQYGSFLQEIKDKIRNAQYEAAKAVNTTIITLYWEIGCQLSEKL
KTGWGKAVIQTLADDIQKEFPGIKGFSTTNLWYMKQFFEEYSASEFLQPL
VGEISWTKHLLIMSKCKDLQERQFYIVATKKYGWTKNVLNHKIELKSFEK
FALGQSNFEQTLPSSIKNQAILALKDEYNWNFSELDDEHSERELEEAIIK
NIRAFLMDFGPDFSFIGNQYRIQLEEKEYFIDLMLYHRAMQCLIAIELKT
GEFLPEYKGKMEFYLNVLNNTVKLPHENPTIGIIICKSKSRTIVEYALKE
CAKPIGVATYTLTDTLPENYRKQLPSAEELAIKLEAFIEMTKKK
>Cag_0861 conserved hypothetical protein
MINLAEALCHPEAYPHAPQSVEMVQTHCSWVFLAGAWAYKVKKPLDLGFL
DFSTLELRRHFCYEELRLNQRLCSTLYLSVVPIVAVRQQIKVIDKENNTD
EHWNEEENNEHGTIIDYAVKMVRFDRTQELDRLLAHHKLDVKQMEQLART
IAAFHNSLPAAPMDSALGHPDTIIKPMLHNFTLLEDIVVESEEQQELATL
HQATLSDHQRLYQRLLQRKADGFIRQCHGDLHTGNMVMWQGRITLFDCIE
FNPTLNTIDCISDLAFLFMDLRHSGETALAWRLLNGYLMETGDYHALALL
PFYERYRAMVRAKVTAIHASQSKDAPEVSSLMAEHRSYVAHATNCTKHNQ
PMLLIVCGLSGSGKSTLAASIASELPAIHLRSDVERKRLAGLRPLERSPK
SDLYSHSMTNNTYAHLLGLARFCLLEGYCVVVDATFLRQSNRALFTTLAN
ECNVPYRLLHCTAPKQVLMERVQLRNLEGNDASDADAEVVAMQLEQQEAL
TDDEKKITITIDTTHPINATALTGMYQLKREH
>Cag_0410 conserved hypothetical protein
MPTIEAICISHEKGVIKEQVPSALLRTNWGIEGDAHAGEWHRQVSILASE
SIERMKALMPEIAYGMFAENLVTTGVDVTALRVGDQLRIGSEVVLCITQI
GKECHNGACAIEQATGKCIMPTEGLFCRVLHGGTVQAGMLVEVMPQAL
>Cag_1103 conserved hypothetical protein
MKSVRFEWDEEKNKANQKKHQVSFALAQHAFLDPKRIIAEDIKHSNEENR
FYCVGRVNEEIMTVRFTYRGNVIRIFGAGYWRKGRKIYEEQHQLYR
>Cag_1034 hypothetical protein
MTSNQLPTTQQPDDYDSPWKEAIEHYFPEFMAFYFPNAYTAIDWSTPYHF
LDQELRTIVPQSVQGKRVVDKLVKVQLLDGKERWLYIHIEVQGRREANFP
RRVFICNYRIFDQYGVPVASFVILTDTDYNWRPTSYSYEFAGCKHTLEFP
IVKLLDYEPRMEELLASDNAFGLITAAHLLTQKTSDNAFHRLDAKKQLIL
LLYEREWERDRIKELFRVLDWFLELPKELNQQLQTEIQQIEEGQKMKYIS
TFERYAKEEGLLEGIEKGKEIGVLEGIEMGKAEGLEEGLMQGRLEVAQRL
VASGMGKAEAALLAGISVDLL