TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Organism: Chlorobium chlorochromatii CaD3, CaD3
Gene type: CDS

Number of genes found: 360

Free access
Sort by:

 



# Chlorobium chlorochromatii CaD3, CaD3

>Cag_0253 hypothetical protein
MKDTVLFQQALCLPAPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ
HDLKAYDTAEKQWRHLTSNDVVNGISILCTIPSPLINLAKNFAERCNSCK
LFLLRSSVVETDDCLTSNTDCEILLSSEHRHNREKLRDCLLLYAIEPTNH
FPHNAIPRFYAKPRNICHALYCSVLKAHYGKRNNTPYSPPLTVNTLKGQS
REMV
>Cag_0367 conserved hypothetical protein
MSWGVEDPQKRKDIRELMDIASKIVQENPNHVNEVKKSHNNCWMEQIYLI
QRCDFCDLAPDCPTREEKEWQEYIKANNIMVIKDSFPTNPQ
>Cag_1066 hypothetical protein
MNYIAVSEATYLDGYRISLTFNTGESGEVDLGDLIHRYAIAEPLRNPQNF
ARFYLDSWPTLAWECGFDVAPESLYARATGKLFPLPQPSNSPL
>Cag_0305 conserved hypothetical protein
MEIIFRPEAEAELFEAQAWYESRSQGLGIEFAQAVTVAVESVLRMPFAYP
RQFAT
>Cag_1558 conserved hypothetical protein
MASTQPPSNRRDDYDSPWKEAIELYFPEFMAWYFPKAYAAIDWSQPYHFL
DQELRSILPEAENGKRIVDKLVQVHLLDGNESCLYIQIEVQGNRETDFPR
RIFICNYRIFDKYGMPVASFVILTDTDSSWRPTTYSYEFADSKMTLEFDM
VKLLDFEPRMEELLASDNAFALVTAAHLLTQKTRENSLERLDAKTQLIRL
LYNKQWTKERVKELFRVIDWFMELPKELEQQLQTEIYNIEEEQKMKYISS
IERYAMEKGWSEGKELGVLEGMEKGKAEGLEEGLMQGRLDVARRLVASGM
SADEAAGIAGVDVYLL
>Cag_0684 conserved hypothetical protein
MDNLLLKKSMLMPPVQRVALAELLLASLDYEAEDIREEWINEVQARMNAV
SEGRSKLLDFDLLWQ
>Cag_1195 killer suppression protein HigA, putative
MKLRYSTRKLEKSVESFSVIKKNYGEWAKKVVLRLEQLSQAPNLAAMRTV
PSAHCHELKADKISELAVDISPNHRILFQLAHNPIPLKDDGGLDWREVTC
IIITAIGEDYH
>Cag_1284 hypothetical protein
MVQNERYFLFCVREYNHDLKRRVTMMQGVKQFGKCQGNATKNGQVRRVGV
PQGVVAAQAGQGGFALNQAVNGGHQTLRGGCNGQGRSQGQGMGQGQGQRC
RCTAQNSVSL
>Cag_0576 conserved hypothetical protein
MKTITVQLPDEVHKGMTMVAATQHISISKLYEYISQNMLRGYAAEVRFRE
RVAQGSRKKGLAVLDKLDECYGE
>Cag_1353 hypothetical protein
MKSQITEQFQQIGLDDLKIHPELLTDILSNKQDLSLILEKRGDIIRYAYL
RTYDSDSRKILEEAKAEYCLKKEDGYSREEAFQDFEDVQHDIATQLASRV
R
>Cag_0918 cytochrome c family protein
MMRKKVISATVALLATVASFAHAEEDARLIKARALADQLPPKLGAVLKAE
IAKSGPEGAISVCRDEAPKIAKELSKESGTRIRRVSLQQRSCKAKPDKWE
KAVLEEFDRRAAAGEPLTTLEKGEQVGSKYRYMKALPVQDRCLNCHGSLE
TMKPAVKAALEQHYPKDKATGYREGQIRGAISVRL
>Cag_0519 hypothetical protein
MLTVSYLTMQSLSIFLSSTCYDLKSLREHLHSEIAKLGHDPILSDYPSFP
VSPDLSTVENCKKVVRDRADVFVLVVGGKRGSLDLETERSVVNSEYREAR
AAGLDCIVFVDRQVWDLRHLFQKNPQADFSPTVDFPEVFGFIEEIENDTK
WIFQFHRTDDILTILLQQLSIRFRDLLLRHRTNRLLVPSEFAYERSDIAR
IVQDKDSLWEYRLTSALLADRMERLESKFNDLDSGYIVKRTKFLPARDTL
NYIQDLISDFTNVIKASVKVLEHQLTPAFGPPGVAGDAIQIRRACDNLFS
LFLALYEWEMDIRFVRPHELFENLFLRMHGWTSELLQDFRRIPKELDELL
AVPNLTGEHNIQLVINAPAGLALLTVEFERMSHDPDVVAALAGG
>Cag_1937 hypothetical protein
MQRKAWFVVLVSLLIGLCANKGFAESPTQPSTLVATRFSTNEIAFTTGYG
YSLRRKAFEEHNFSIYPFAVRYGWNLNRPLGLSGASSALYATVEPFVNMI
VGKEQGREVGCGVGVRYRRAVSQHANFFAEGSVAPMELTINTPEQGAAGF
NFLDQFGVGLQHEVGQRTHLFVGYRFRHISHAGLIDRSNGGINSHGVMIG
ISLIQ
>Cag_0645 hypothetical protein
MKLLKQTSIAALLGMMALPSLTASAAPYSSTLYMPNSHGKSVTTPTAWGA
SGNVGFVGLGGTYQSPYTDDADGAAVFGVGLGDSKENLGVQIALISLDIS
EWEEYSSAFHVFKELGDADAIGIGVENVMLTDGGDSEKSFYVVYSRGVQN
DWALNKNSNQTKFHYSIGAGTSRFGDKSPADIADGKGKHGTYVFGNVAYE
IAEEFNVIADWNGVNFNIGASKSFIINNKIPVGVSVGLADLTTNSGDGVR
LVAGAGFGFKL
>Cag_0402 2-vinyl bacteriochlorophyllide hydratase
MPRYTPEQLARRNASKWTTVQAILAPIQFLMFIAGLTVTYLYKEGIWIDD
FTWITIFVTLKTFMLVLIFVTGGFFELEVFGQFAFVHEFFWEDFGSAIAM
IVHIGYFVLFFMGLDESTLIWTALLAYLSYLINAAQFVIRLLLEKHNEKK
LKQQNAL
>Cag_1405 conserved hypothetical protein
MAFKIRDIEVALEKKGFKRVESDHSYFIYFTIENKKSRVRTKTSHGHKGQ
ALSDNLFSVMAKQCKLNKNQFSELIQCPLSRNDYEKILDMQGMVK
>Cag_1121 hypothetical protein
MANELSHQHIGLFEKIRQTDENGNEFWSARDLSKVLEYSEFRHFLPVIER
GKEACINSGQQIADHFEDILEMITTDKTEHREIEGIKLSRYACYLIVQNA
DHGKEVVALGQTYFANLSNIQLLNKSRISKFIYTIEGQQIILDRDLAMLY
QTDTRTLKQAVKRNIERFPSDFMFELSEQQIETMVSQLVIPSKSYFGGAK
PFAFTEQGVAMLSAVLRTSVAVEISLQIIRAFVEMRKMINNNALILQRID
RIEIKQIETEQKFEQLFQALEQKNSKPQQGVFYKDSIFDAHSFVCDLIRQ
AQTSVILIDNYVDDTILTILSKRKNGVRATIYTSKKDKQLELDIKKYNSQ
YPEIMVIEFKEAHDRFLIIYEKELYHFGASLKDLGKKWFAFSRMDSFVNE
VLAKLKNNGNNE
>Cag_1776 hypothetical protein
MTEVEEIVHRVQKLSKDDFAHFKQLVQDIDNDYWDQQIATDFRQGKFEQL
IKKARQEFAEGKARAL
>Cag_0867 hypothetical protein
MIESNLDFYRPIVEQIVERWAVGKPPLPTTGKPSGYYRLTNYLLNYLVEH
DAFPTGIHQMPEGLDAQQQVEPSFPVDFNVVIGETRLPKISVNKGEKL
>Cag_0724 conserved hypothetical protein
MQQIPFGIQTFKKIRQNNLVYIDKTADIANLVAQHNAVFLSRPRRFGKSL
LIDTIQDLFEGNKTLFEGLYIADKWNWTTTYPVIKIDFAAGVLHSVDDLK
SRIRKILFDNKQRLQITCEFLDDRDLAGCFADLISKAHEKYQQPVVVLVD
EYDKPILDNIENVDIAIQMREGLKNIYSVLKAQDAHLRFVMLTGVSKFSK
VSLFSGLNNLDDITLDATYATICGYRQVDLETSFAEHLEEVDWERLKLWY
NGYNFLGESVYNPFDILNFIKKQHTYRSYWFETGTPTFLMKLFAKERYFL
PNLENVEVGDEILDSFDVEDIQLETLLFQTGYLTIKQRVEMFGNLRYQLK
IPNQEVRVALNNHFINVYTAQASVQKYAQQKRFYTYLMAVDMLGLQQALQ
ALFAGIPWNNFTNNDLPQFEGYYASVLYAFFCSLNAMVIAEDTTNQGQVD
LTIIFDTLIYIIEIKRDTSETYQVSPENVALQQLLQKRYFEKYQGQGKEI
VQVGMIFNTVQRNLVQLDWAKP
>Cag_1287 hypothetical protein
MLQTISGCGIASNATSLYMSVKNTALLFELIAPLEHYGSMNVPYETHSIP
IAIKSNAQIVEVFLKNTIPLIEKQQEARRSSAKILRN
>Cag_0680 hypothetical protein
MVEVRIHFFVTICLIIRDHCLRRNDKLSMLFSVINHNYEMKENMMLQTLE
AEILPNGHIHFLENFSTNRKVKAYVTILSQQPLNKPKADWHHFVGALKES
SLFQGDPVEIQKTMRDEWA
>Cag_0013 conserved hypothetical protein
MKPIYYFMAATFSILLSIYVFIFGTSTNHEMLGIFIGLWAPTIICVGIFN
TLIGILDEMCCAHKRIEEGQSCSHNRH
>Cag_1643 conserved hypothetical protein
MNYSFKTLWNRAFYFISPLWCLLVWIIWSTDQLQDPADKIVFISIVIPGF
FAVYVSGFLIEKWHNNKQQKLK
>Cag_1104 conserved hypothetical protein
MKNNINYTDEPVGELVVVKDFLPSPDQLILKEDNVKITIALKKSSIDFFK
NEAKKHHTSYQKMIRELVDWYAVNNAKNA
>Cag_1860 hypothetical protein
MHEERSKLDEYSLKHGPTIGKMYEGLASDVLGRAIPESLGLQLQHGIIHD
GKGAMSGEIDCMLVKGEGEKIPYTDSYKWHVKDVIAVIEVKKTLYSADLK
DAFGHLRGVADNYGSYVQSGEGSEKFDINPAKKAFSETTGLIPPDHSKVD
SLGIEMEMLYHTFVMEHISPIRIVLGYHGFKKESSLREALVSFINENQMT
QGFGVGSFPQIIVCGNYSLIKMNGYPYSAPMDSDFWNFYASSNANPILLI
LELIWTRLSYQIKVENLWGEDLSIENFTLFLSAKIKKVGDLTGWEYKYTP
ISEDLLKERAPEDEWSPTIVSSTQFVIFNRLCNEAEESIGDKSLREYVEK
EGENFDHFIESLTSTGLIALKDDKLQLTTYECQCVILPDGRFAVADNNSG
RLTRWVNKQIEKNKA
>Cag_0032 hypothetical protein
MRLLPVIQNYLKTIVELKTMKQITAADALGLSIPERIQLVEDIWDTIAMK
SDELEFTVEEKRIIDKRLNAYHRNPEVGSPWEDVYKRILSKQ
>Cag_1202 conserved hypothetical protein
MTPDEIDDQKKIEFYAASVSAWYESSLEHDKSLLTLSAGGIGLLITLLTT
VGLGTAEALVLYVGAIISFVISLVSVLFVFRGNKKHIEDILSGKNQGTDP
VLSKLDGTAIWSFGIGVVFTAVIGISAAIHSFTSKENTMANETTKTTQAV
PLHESFNGAANLQSGTDLGQSFNGAGNLQPQQTTQPATPSTTPANSGNSQ
NQSDKGK
>Cag_1272 hypothetical protein
MSTTTLSSRRSKKSWHHANPIQKAHHVVREISIAPIIEEQINLSVADQTL
LISTLLNPPAANEALQKAFAHHQRLVQKY
>Cag_0991 hypothetical protein
MRLMTSDQLPTTQQPDDYDSPWKEAIEHYFPEFMAFYFPNAYTAIDWSTP
YHFLDQELRTIVPQSAQGKRVVDKLVKVQLLDGKERWLYIHIEVQGRREV
NFPKRVFICNYRIFDQYGVPVASFVILTDTDYNWRPTSYSYEFAGCKHTL
EFPIVKLLDYEPRMEELLASDNAFGLITAAHLLTQKTSDNAFHRLDAKKQ
LILLLYEREWERDRVKELFRVLDWFLELPKELNQQLQTEIQQIEEGQKMK
YISTFERYAMEEGIEKGKELGVLEGMERGKVEGKLEGLEEGLMKGRLEVA
QRLVAGGMSKAEAASFAGVSVDLL
>Cag_1568 hypothetical protein
MATYSSFEQLNFYRSMQLTITLPDILPDEISRVIKKVKEIFSQEGIAAEI
TPEPLSTDAWDSLNFDEIAVDTGRVDFAENHDHYLYGIAKRS
>Cag_0736 hypothetical protein
MIRDLFFNIAPAISTLFKLGFEPKEECFYELTIDQYEQLRKDGEDITETL
YMILPEESKYMDNDIIVVNEQEKNSLLKAKKVIENYCEKGGKVFNSYQDK
LTYVSNLLPSVFTEDSNFRKCHLKLVEPNQSK
>Cag_0881 conserved hypothetical protein
MNRIKAYYDEAYPPVPSKRVLYWRKNLPWQIIRFFVLNFKIMRIVVGGHS
>Cag_1884 hypothetical protein
MAQQQRKLTIMVSSTVYGIEELLDRIYTLLTAYGYEVWMSHKGTLPVHSG
LTAFDNCLRAVDESDLFLGIITTSYGSGQNPADNKSRSITHQEILKAIEL
NKPRWLLAHDHVVFARTLLTNLGFKGKSGRQSLKLQKNTIFGDLRILDLY
EEATIDHESPDDVPLAERRGNWVQKFRTHEEGSLFVNAQFGRYQEVEAFL
QENFERGFSLLKKGGNA
>Cag_0661 hypothetical protein
MQAVKALYKEGNIEFRANSKEEFMALGLGSFFDTDEDNNVDWEAMFVGAT
LLEKTQEYFHRETVLL
>Cag_0492 hypothetical protein
MPSRPSKRFVLNWLKALLLSPGMLFPATYLLLYFSTTLYLNTALKSEVIK
TLMPMGHISVRTVTTDMTLERITLHNVTLQPSAIGESKKPQHIRQLTIAC
PKLIPQLFTKQGRTQTICQVEQALSPIAQ
>Cag_0734 hypothetical protein
MKLTFNDYTLETVYLHISDAQRLEVMDLWQTENAITSAAERERRSYEVVV
MVRHVSGAVVGVSTVTVRTASNGKRYYYYRMFIHSSHRVPYLMRAVTNAS
RDFLATFRHPDGEVESFVVVTENPKLMRQGMRQLFERHGYTYKGKTSQGL
DCWEYSFLQ
>Cag_0300 hypothetical protein
MRKLFLLVLLPLVFLLPVNKAFGANLVILEPTDGASVTWRPLVKGSVSGA
KHVWVVVRSVENARFYVQPKAAVRKNGSWKTSVFIGKQADTANNRDFEIM
AVGNPTEKLSDGMELEDWPSGVVTSNIVRVVRTKMN
>Cag_1201 hypothetical protein
MSFDATSIEYAFAKLIGNTTGARSTSHDADVFRGGNPKNLAKALSDAADA
LEEKVKSVPFAAADTEPGGAKARIDVAISRLHKIAESMSKSATVSREDYH
WEIIGCLVSTIADLLEKAKC
>Cag_0311 hypothetical protein
MLTKDFYSIIDTELDALIIKYKDDKLIKKHKSAINNQKSYVLLIWFLDFY
GRISNYSNFITDGDNDSSCDIVFDSLNNQGNKVFYVVQSKWNNADNSKKE
TKKDDILKALNDFETILRGEKKNVNEKLKSKLEELDLHLKANGEVKFIFL
SLSQYKGDADENIEAFRSNDVKTKFEVIDISRIRVDYIDRKYKKIDPINP
LENYQNPEESPINIEIVQKGGSVKIEKPFEAYMFLLRPKAIWELFKTYGF
ALFYKNVRNPLLQSQFNVDIERTALENPSYFWYYNNGITAITYLLPEIGK
KAEKVPLTGLQIINGAQTVYSIYRAYESASPTKRIQMDSEALVTLRLLKS
GGKDFDLNVTRYTNSQNPVQDRDFCANDDIQVALQNASYQTNVWYEKRRD
EFRETPTGVKKVPNFIFANVYLAYHLQDPVSVLKNHTQRFKTHKDLNFIS
HKDHKDGLYEKIFNSNTSFEDMLCAFYIFDTIDDYTPFSYEETFKTNLYH
LLALFKVAFTKYIIAKKAMYGKGKNGEKEINVNKQIIEIYEKDEKEIILK
TFKFINQFVEKQIEVADNEEKTTDRMFKFLFTLSHYQKIYDALEDTEISV
KDIDDIVLQDNDDIVEGDKDTEVTSEQE
>Cag_1795 hypothetical protein
MAELPEASSSNKTGERMAELPESSTPSKPSRRARFFEEDNGSLSSMRLMS
FVALIAAILFGALTLTFDTSENNGTGLYITLMFLVSAFAPKAVQKFAEQK
LNER
>Cag_0445 hypothetical protein
MSIYTSPETSPSPIAILCSEYVQSVEAMAESLPFIMSTLIEAEDTFDKKL
DAFIDFHAIDVEKLEDGRRYGLKLEDKSAHDRLHRRIRVFREALGVTPRS
FLVALVSAYDAFLGRLIRSLFYARPELLNSSERVLTFAQLQDLQTLDAAR
EYLVEKEVESVLRKSHSQQFEWLEKTFSVPLRKGLECWPRFIELTERRNL
FVHADGIVSSQYLNVCGEHGVTHSEVLTSGTRLHVERSYFQLSAHCLMEI
GIKLAHVLWRKLVPTDREKADENLIEIVYDLLIKQKYRLAADLASFGTNT
IKSHGSDQTRRILVVNLAIAHKFGGDAEKCTAVLDAEDWSATADDFRLAI
AALRDNFDEAAKLMKSIGKDNRLGMFEYREWPVFRDFRKSYQFAAAYKEV
FGEEFVLKAQEPKASESPQTTAKGENVLDEE
>Cag_1043 conserved hypothetical protein
MKKRSALTLLLLPIAASALLAGCSPTVKIEASDKPITINMNIKIDHEIRV
KVDKELDSLLNQKSALF
>Cag_1501 hypothetical protein
MTFFNLLFFDTRHHNSTMLKKSFSTMWSSLSLLFAGLWLVVRIILEYFGI
ISDGNDRTTGIKDLREEYKKANYR
>Cag_1944 polysulfide reductase, subunit C, putative
MIEKALKGSPTYWLWILLLLALMGFGYSSYATQYAHGLAVAGLGGSAIWG
LKVMQFATMATFAASSVLVVALAYLQASKPATSLLVTSQFIGVSAASTAL
VSLVADCGKPELLFELVRSASLSSPYFLSILSLKLYIITSLVSSWATLGA
EAKGVPAASWVKSVALLSLPFGLLTPLFVALMIADAVAILQLLAASIAGG
MSLLLLVVALLKQRATFSVDANAPKMVATLSLYGGVAHLLLTAVAWFRAS
SDNSGMVAAAFVAALLAVVLFALSLKKGNTTMPALAMVISVVLAVAGTGS
FVTYAPSGVELSIVAGVFSCGLLVATLLLKTATAIRREA
>Cag_0296 hypothetical protein
MIFQAKSIASWFIIVLMLLQFVPLPRRNPNERQPLRAPVAVVTVLRNSCY
QCHSHETRWEFPLGTVAPFAWWMSQRVEHGRRALNFSTTASLSVYEKERV
TVMLRNPKEHQPLYYVLHSNVLPDSVGVATVQLWATHR
>Cag_0755 conserved hypothetical protein
MKKLPVGIQTFSKVVEDDYLYIDKTDIARSIIEKYQYVFLSRPRRFGKSL
FLDTLKNIFEGKQELFKDLLIYKQWNWDVTYPVIKISFSGGIRDKESLQK
NLFYILNDNQERLTITCKEKSDPNQCFAELIKKTFQKYQKSVVILIDEYD
KPILDNIENIAEALIIRDGMRDFYTRIKESDQYLRFVFLTGVSKFSKVSL
FSGLNNLEDISLNPDFGNICGYTQKDVDTSFAPYLKGVDMEEVKRWYNGY
NFLGDKVYNPFDILLFIKNKCVFDNYWFETGTPKFLVDLIKKKNYFIPDM
LTLRVNKSVVNSFDIENINLETILFQTGYLTIKQVLPLGMGVGYELGFPN
KEVQISFNDYILQIMTIVADKEPIRYELFDIINNGNVASLEPIIRRLFAS
IAYNNFTNNYIESYEGFYASILYAYFASLGFDIIAEDVSNKGRIDLTLKN
QDKIYLFEFKVSNQEPLEQIKKMKYYEKYNGERYLIGIVFDPKERNVSQF
AWEKI
>Cag_0841 hypothetical protein
MGMPVRIDDTLYGQARAQAKAEHRTIAGQIEFWAMIGRAALDNPDLPIDF
VRDLLIARREGEAHSTPFVPEGHRS
>Cag_1820 hypothetical protein
MGEAYLQEIYINKLSEGLSRLIACELFLDRYYSSDSIFEAEAAILQIRKA
MECVAYAAVAPNKSKYAEFRSQADKAIDFTKDFHAGTILKMLSKINPDFY
PKPVSAPLNVSLGKWHFDRRNDKSLSQKQFESFYDRLGKLLHADNPWGNE
KGLRNLLADIPSTIESIRLLLSLHFTVIRTSEFNGVWIVESPNNGQQPRV
IVGQAIGEFAVEE
>Cag_1736 hypothetical protein
MNERVFLKILLTLLSSGAIAEVMVVLLLGWHGEALLFVFFMSCFAVAIAL
VLHKLYGTAEASGAIESVSARRVRAMQSEELRSRLGAYSVDDEFLAGTPL
RAKSTSSTYEKSDVEAMIRKFAPHVGGLSRLLQMVQERDEASFAAVAKQA
GLANVERQTVIDYIHIMLNAEKECESNTKSGEATSPLTEFTMERESFDSY
IQRCMSGEGDDGDLSDELSVGLENPLTPKSVGIPPSDFSHSPTSIMESLK
KRAGRVP
>Cag_0811 conserved hypothetical protein
MELAQILEGNWLFRVEHRGITIHSSTIQDSPIQGFKGEVELQVSLKRLLS
AFYDMENYKRWVHQLAELTVIDKPDPTEYIIRQVINTPWPLQQREVIMRS
RLEGVGENGVALSMQSEPDYLPLHAQCHRVRHAQGMWVFTPNGHGVVQVM
FIMHLDPGPDVPPPVSNAGLFEVPFYTLKNLKALLDDAKYQPMWPEELEH
YLAIVEEDNLDTL
>Cag_1574 conserved hypothetical protein
MNRYLYDGTADGLLSAISWILEEEQEPEQVVLAEREDTLFEEGIFLNTDV
ARSEALFSRFRKQLPDVAQTLYFFMLAESNGMATNLLHYMALALQYGDRV
NGYLTHPAVKAIVHLSRKASRELHRMKGLLRFEQLCDGAYLAQMSPDHNI
LHPLSHHFRHRLKAEHWFIVDRKRHTAAHWHQGSLEFGTIEQFNVPALSE
QEQKVQTMWRTFFATIAIQERKNPALQRSNMPMKYWKYLTEKQ
>Cag_0670 conserved hypothetical protein
MSSIKTLVKDWVPPIAIRLLQSFRRKGVIFEGDYVTWEEASAQCSGYDAK
NILDKVLDATLKVKRGEAVYERDSVLFDKIEYVWPVLTALMWIAAQSKGK
LDVLDFGGALGSSYFQNRTFLADLKQVRWSVVEQSHYVDAGKTHIADERL
QFYKTIEHAVSAGSPNIVLLSSVLQYLPNPLIILKQLTSLNADCLIIDRT
PFIKDKDLSVIKIQHVPSSIYEASYPCWFFNMDKFLEYIESLGYRKIEIF
QSLDRLSNEAIWQGIIFKKKNEL
>Cag_0267 hypothetical protein
MNTLSLTLPESLHKSAREIAEKENITINQLITSALAEKISALGAEDYLEM
RARHASKAKFLNAMAKVAKIEPPDYDRL
>Cag_1385 hypothetical protein
MVIFPISLSFPRKRESNSLILLGFRRGNETTIIILLSGFQKKSQKTPQQE
IDKAERLKKEYFDGKNKQ
>Cag_1644 2-vinyl bacteriochlorophyllide hydratase
MPRYNPEQLAKRNASVWTDIQIILAPIQFFFFIGGLTVTILYANDLFPGG
FYWVSLAILFKTLFFALLFITGMYFEKEIFNQWVYSKEFLWEDVGSTVAA
FFHLLYFVLAFAGYPRDILILDAFLAYFTYVLNALQYLVRIILEKLNERK
LRMDGQI
>Cag_0780 conserved hypothetical protein
MPLLSVGNILVEPDVLQARFACNLQECRGACCIEGELGAPLQPEEAAQLD
HLPEELFRLLPEKGLRYLRRHGAVELYQGVHYTRTVKSRECVFTVVRDGI
TLCAIEIAFREGLLPFDKPISCRLFPIRVRKKFGLDYLVYEQHAMCRSAR
EAGREQGVRLIDYLTPSLTARFGEALVQELQRFHDSSSNNSHG
>Cag_1343 conserved hypothetical protein
MKPIVYLESSVISYLTSRPHRDVVIAGRQAITQEWWEYQRHQFELRISIL
VEEEISRGDAEAAAQRLASIADIPSLTLSDDAVMIAHLLLAKCAIPKGSE
DDALHIGIAAAQGVDFLLTWNFKHINNAVTRGYVTHIVEACGYNCPQLCS
PEELMGRYYEYD
>Cag_1003 hypothetical protein
MVLPESRPSRARGLKPRIGNLFTDGMESRPSRARGLKLGYADRAAIEVFE
VAPFAGAWIETAAEGLKKKPVQQVAPFAGAWIETPEDVDDYEKAASRPSR
ARGLKHVRCQSQSSALCRALRGRVD
>Cag_0698 conserved hypothetical protein
MKTVSVSEACSTLSTLLKEVELGDEIGISFEHQQHTIAVLVPIAKYKKIK
DRKLGSLAGKVKVEFSNDWQITDEELFNL
>Cag_0377 conserved hypothetical protein
MMNKLFNLYCDESTHLQNDGMPYMMIAYIRSPYNEIEQHKEYLKFLKAKH
KFKGETKWTSVSAGQYLYYADLIDYFFSTDLCFRSIIVDKSQINENCPEF
SYDDFYFKMYYQLIHHKVDLGYHYNIYIDIKDTRSNKKLAKLHEILKLNT
SIKNCQFMRSHESSLMQLTDLIMGAINYKLRGYNRVIAKNKIIEKIEQHS
KVPITRSTPKHADKFNLFFIDLK
>Cag_0583 conserved hypothetical protein
MLAQAQQIYDEALTLSPIEKVELIEHLYFSLDSKNSRQEVDKLWAEEAED
RLTAYENGEIKTTPASEVFAEINSMRPQ
>Cag_1322 conserved hypothetical protein
MIIEFLDPAKVDLLEAVKYYNNEQENLGFEFSEEVERSLSRVVNFPHAWC
KLSVRTRRCQTNRFPYGVIYQKRGNLILIISIMHLHQEPNSWRKRIPKKE
Q
>Cag_1191 hypothetical protein
MMDCRGNPCGCPVVVFSIMNYACLLIKIELLNQFYFYSPNHSQRLQIGAI
NMKKRWILLLLALGTNVPESKADYTIHGYTSPDGQSTFHVRENPLQNPLQ
KLADDLERNNNKSERPSDAFYRGYEQAQKMKMMIEQRRLMEEQRKLLEEQ
RKLLEQTRLEQQTRLEQQTRLEQQTKE
>Cag_0374 conserved hypothetical protein
MSNPNPSPLSHAMPEAIRQLPAEAQQEVLAWVEKLLATQNEGVDELYNAI
SSIVKFIPNFMVIPLMVEQIHPRIAAGVCVKMGVEKATGYANDLPVEYLS
SVTHHLPNPMVGEILTAMKRYAAEKLITYEIEHHRTDLQALMPSVSESHQ
ALITKHLST
>Cag_1503 hypothetical protein
MPEQMKRKKIRCYNCGEIFTLLMDIAGEPTRSITCPFCGASLTVTLAKYP
KKVITVYRAAVGESSASEITVYDLPDVLESTESSSQS
>Cag_1667 conserved hypothetical protein
MALPSNLSFSIFFFVHWFSFFKRLHMRKNNERVGLRIVSLIQGSNRLELC
CQAADFGERETHLLEAGFDGNIAVSIMAEKSDNKIVVTLSAQTTAHCTCD
RCLALLSLPIHGTATVIFTCETVVDEAAITLDDYRSYNRQSEYLDLADDV
CDALLLALPMKITCTNNPYCRVFQAGESDATHNDASHPINSEWQEALDKL
KQKYSS
>Cag_0434 hypothetical protein
MEIVCSKWLSDFIYIMTEYLYPFQEFLKIVPGLLIFPISLYLGLQKIGTK
VNARISFSSNPTSPSTVRYIDLRNMKERTIVIYSIFASVNDEEILLELKQ
CNPPIILKAHQSTIVEIPTYSFMQLDGSRLDVSKLPLKKTTIYLELAHTT
IKCHTLNYQKSVSRKLSKYQILEKEDHYPDALPYNKHAIYAITYSKNSRI
KTATINEAGTFHGDWTFSLNQLPISALISKESVESYLQSMNFSNEVEWFK
VYYLNHCSS
>Cag_1671 conserved hypothetical protein
MKPLPVGIQTFSEIIKQDYLYIDKTSLANELIKRYKYVFLSRPRRFGKSL
FLDTLKNIFEGKQELFKDLLIYKQWNWEVTYPVIKISFSGGIHSKADLEE
DLIHILNANEKRLELKCENRSKAKYFFAELIQQAFQKYQQSVVILIDEYD
KPILDNIENIAEALIIRDGMRDFYTKIKESDEYLRFVFLTGVSKFSKVSL
FSGLNNLEDISLNPDFGNICGYTQNDVDTAFAPYFEGVDMEQVKRWYNGY
NFLGDKVYNPFDILLFIKNHKMFKNYWFETGTPKFLIDLIKKNQYFVPEF
NGLKADESLINSFDIEKLALETLLFQTGYLTIKQLLLSDVGVSYELGFPN
KEIQISFNNYILQSITQNSQKESIRHELLAIVKAGDVANLEPIIKRLFAS
IAYNNFTNNYIESYEGFYASVLYAYFASLGFDMIAEDITNKGRIDLTLKT
IDKTYIFEFKVIKQEPLEQVKKMEYYEKYDGERYIIGIVFDPKDRNVSKF
EWERV
>Cag_0011 conserved hypothetical protein
MAVGNKRNDWFDATGHSKGDWMFQGNFQRTINYLSEHQKILLFNPFCHSV
DLLQGSENVYKWLFRVNDPQNNPFEVIFFVEQLEELLLDVPSEINCSDPS
TLTPEMIELYTTGKKITWRHYDAQEEVDDPSRYLFEGKVFADMLMEAKDN
DRTRVQIDLRVDVRFVLYPAFRIIPDPVLHAMVNGGMSILMQTATNRMFQ
AISKDFHSITPT
>Cag_0826 conserved hypothetical protein
MESNGFFPFRSELLKASIGLATTITETSEQPIFDELKIRRYNVPADEIAS
FILTKLDHWFGWNMLSDRPSKNNTRLIRADVGSILLFGLKIKVTYGLYEE
KDANERPITSVHAHAETSIESKGDLGESRRVIRMMLSALDFNYLTEQLHD
EEYQSRSLDCAATRYILEQMFVVEPEPEPAKPAPSKVPKATVIELRKPNP
VQTIPLITKPKSNEADVVALLEPMETEQPAANVAALEEETYQSSATSAAG
DDKPAKPKIIVVTSKKNQ
>Cag_0998 hypothetical protein
MRVSIHAPAKGATKVKVKREIEGMFQSTRPRRARPKVNDRRNKNTVSIHA
PAKGATKKQLVSYTAFTVFQSTRPRRARHRLMLFACYSPRFNPRARGGRD
SEILRRNHELQSFNPRARGGRDAVQTQQTTINGVSIHAPAKGATVIRSLF
ENEAAVSIHAPAKGATLMQ
>Cag_1949 conserved hypothetical protein
MERLLVLYGGKIAANSFVTDSLSKALADVLHYGGKECNELSEALKDVTCI
LAVQAMPPSQTLALFFELPAVLRDYAVKGAPKQLLKESDMDALQKRLEEL
TLKAFDSYMMHRETISQLKVDEMKRQLFMQLRRAEA
>Cag_0561 hypothetical protein
MPQNDAKRFVEKLRADDGYRGRIAEIMRYEGYSGTADDLKKLTNEEGDDK
RYPNKSYCSSLSWHQGDKKQQDGASQGYHHWAG
>Cag_0265 conserved hypothetical protein
MALLESIRQSAPALHRAFASLADGEQHLQRLVELDGYWERLKQPPARVVL
PASALPCNAAISGEYDLIYAGGTLSMFHAAVMARRYGHSVMVFDRHTPAT
STRDWNISWEELLRLRDVGLFSEAELDSVVVRRYRDGWVEFYQPDGKQKR
LTIEHVLDCAVETSTLLGMAKVKLLEVPNAAIFGGYTFQRCYQLPDGVIV
EIIDSKGERLFYKCRLLLDVMGILSPIAMQLNEGRPQTHVCPTVGTIASG
FEGVDMEVGEILASTRPADVENGTGRQLIWEGFPAKGSEYITYLFFYDSV
ESANNKSLISLFDTYFRLLPEYKQMGKNFTIHRPVYGIIPAYFHDGVSCK
RTIAADNILLLGDAASLSSPLTFCGFGSLVRNLHRLTAGLEQALAANQLS
QEQLTTISAWEPNVAAMANLMKYMCFNPETDSPNFVNDLMNEVMIVLDSL
PHRYRQAMFRDEMKIEELVEVMLRVAWRYPKVLSATWTKLGVTGSIGFFK
NLAGWALGK
>Cag_0472 hypothetical protein
MKQHRHNIEAEVAKTMSLLDKPAAIEVSAPFRARLMQRLEAEKNNGLQGN
HAFHVDYRVAFMALLLVANLASSLLLFRQENRTNSPTQNVAATLNVDALA
EQELLGGDEQGEWYENILP
>Cag_0532 conserved hypothetical protein
MHPETAQIVHAIESLKQAPNFIKDYIFPIANAFFTSLLGAGIAYFTLRYQ
EVIQIEKDKMDTVNKWTLLVDEARSSLLAIKSNYHGNLTDSPIQRALAIR
TVLFTATPINEEYLHLFFLIPKATEKKCEYQKWSQISIIRTMVLNYNNLL
KLWIKRNEVERPIKEKLLQKYSQNAYADVNTDQIIECIGAANFASLVDLT
EHVIKLTDDLLVEFDNFLHEFPNYAKTLINTKRLKRYGSILTYSNNSNKK
LLSMLEKSPTSNYESVIKLFGMTVEQLREKYKTGYEL
>Cag_1197 conserved hypothetical protein
MIIPEKYKVWVEARKKYKLSDAQIQMARELNLNPKKFGKIANHKHESWKV
PLPEFIEELYIEHFGKNKPDVVKSIEQIIESKKS
>Cag_1149 hypothetical protein
MLRKKLPLLLCAAALLSPMNSLYAGKPTAQAKSSVADGAAAASSNQASMS
AIVPVADHTKGIYLVLTEADAMTQMMALVLATQHLEQGKTVQVLLCGPAA
KLAVKNCKEEPTIFKPINKSPQMLLAVLLSRGVQVEVCPLFLPNSNMTQE
QLIAGVTVAKPPVVASQMRADGIKTMNF
>Cag_1427 hypothetical protein
MPLFAFTKYAQVREFRKKSLHFTLGMVLWILNIGFWMGATARVARTGLSV
ETPKLGVFTTYGMVGRRRGEIHFRPHERADIESAPTRDGSCVGTLNPKET
LRSLFSDREKPHSPARHH
>Cag_0333 type II restriction endonuclease TdeIII
MSLNQQQIQKVETVLRNSLRHKFQNYNPEPAVMPFHTRLLGKDRMALFSF
IHSLNTNFGTTIFEPVAQALALSRFGSVELQKVAGNQISLQAQQVIQEIM
DGLTTATNLPCKSQEIEAIRAVCQSGEMRKVKPTKVDVKLISYDGTLFLI
DIKTAKPNKGGFQEFKRTLLEWVAVTLATNPEATIETFIAIPYNPYEPKP
YSRWTMRGMLDLEAELKVAEEFWDFLGGEGTYPQLLDCFERVGCELRSEI
DAYFAQYINS
>Cag_0766 conserved hypothetical protein
MTIESISQRQSARNELISSLLARCPMNVEATGSHRSFIMDKRGEGVPIII
TESEKLSGRRPEYQLLDDAELKLTIFATPSPHGDE
>Cag_1916 conserved hypothetical protein
MTTPAYPPTQRDDYDSPWKEAIELYFSEFMALYFLKAYAAIDWSKPYHFL
DQELRSILPQAENGKRIVDKLVQVHLLDGKERCLYIQIEVQGNRESNFEK
RMFTCNYRIFDKYGKPVASFVILTDTDSSWRPTSYSYEFAGSKMALEFQV
AKLLDFEPRMEELLASNNAFALVTAAHLLTQQTRENSLDRLDAKTQLIRL
LYNKQWPKERVRELFRVIDWFMELPKELEQQLQTEIYNIEEEQKMKGTSI
YL
>Cag_0617 conserved hypothetical protein
MQQVIFQEKYPLFTLELQKNETTYTNVNDILAYFRQKIDEHPITVFIANF
DHYSHTMSLPEHAMNPAIKDAKNIIFCFGKDLNDPLMIGARPRSIGVIEF
ENSFLISFLEAPNPMATNTMEAWAKGLKKS
>Cag_1130 conserved hypothetical protein
MRVFFLKYAQQELDDTAHCYEMELKGLGKIFKDEVKKAISRIIKYPEAWT
IERTTIRKCTLHKFPYVILYSIEKNHIVIIAISHQHRKPYYWIDRKPT
>Cag_1397 conserved hypothetical protein
MNTKSCIQVIEQGTRGFYIRNSLYLPFHCEILSIWVGREMSFIAAPELLC
DMMDSEVLALREGDRYTNLVFRKWGDMAKELGNNKGHVILFAAEKGSDLF
QAEQRFYIRITFELETRELSFELLDNPFYL
>Cag_1655 FtsK/SpoIIIE family protein
MATTIKQRLKNLFVRALDRSMIKRELAGIALMLGSLFMVSAIISYHPDDE
ALYSALRWFDVFSNPARDTADAIHNHFGLFGARMANFLIHFVLGYPVLLL
ISSFFFWGLSLVRARSLKPALFFFLYSVVMAIDIATMFGLTSLAFSDVMS
GSIGRMLAAFLITTIGFSGAWVLLLSVGLLLTFYMGRSFFIPAFHALMAM
VPRLSSLWDNIRARISAIQKKKPLQSP
>Cag_0683 hypothetical protein
MNDGIVIPTSQFSIINKNPTHSTHPLIAKNLVQDKRQPQGLPLHFYNSPP
LEGCPKGGVVFRANSNCGQKGLPQRAAPYFPLSTQYFSTPPFSINVIRYK
KRLRVRVIRGIRGEWFVRGLFGVARVHHNTQHYPFYTSYVYKQYNERLYS
HCFCGLHENCYGIRS
>Cag_1761 conserved hypothetical protein
MSKQLALCWRYHAGNNALIWQIMFTESGLLVGQKRLVAEKKALFFALNET
SGEVAVDDFVLMEHGEDSTIPAGEGWFTGIVTTRHALTYCYATAPESPEH
LGMWAIDFREGKVVWSKAGASFVAHSKNAILAVATTFFAGFPERHFVLLE
PTTGNEQPATLTIEQVNAIRAAAEPEEVRQGVMMPDLANELLLAAFPIIT
EHVGNKQLLCCELLTHGNWLIATLHYPSATPNCCDSYLAMWQGDTLRYHD
YMEQAATRPLLNSVIVHNKHLFYIKAKNELCCLCLSTHHANHTDA
>Cag_0696 conserved hypothetical protein
MKASELDKKFDDNQEEVLDYFDISKIKMLNEEPTRVYIDFPSWMVDSLDR
EAKHIGVSRQAVIKMWLAEKLQSLNSQAEVI
>Cag_0999 conserved hypothetical protein
MVWLLGSFNPRAREGRDSLICLSFRFRSRFNPRAREGRDTGTRAKMLSKI
CFNPRAREGRDLWATVKRTTTKFQSTRPRRARLAFYLYSAISKAFQSTRP
RRARRGIEEESSNGGMFVSIHAPAKGATINSHHLCEIVGVSIHAPAKGAT
VDAYPDFTELPFQSTRPRRARHKNQECEISVYQFQSTRPRRARPIWQKLG
LGCDMFQSTRPRRARPHIRTNVFSK
>Cag_0219 chlorosome envelope protein A
MSGGGVFTDILAAAGRIFEVMVEGHWETVGMLFDSLGKGTMRINRNAYGS
MGGGTSLRGSSPEVSGYAVPSKAVESKFAK
>Cag_0426 conserved hypothetical protein
MLTNSKHVVSRPNGGWAVKTAGTTRAGRVFENKIDAIKYARDAAKKIQGE
LYVHNTDGTIMEQRSYGNDPFPPRDKK
>Cag_0691 hypothetical protein
MSKREELLQSYENGNFLETVYACSSNDHNDRSSVVFDLVALNNEGLIDVV
GAFQSLKNESSNSPDFFLTRHVFEKALPELEASVPAVMHCVLQLYLDAGQ
DLAASTVINSFFDFCTKKASRPHEALEVIKASPGKLAHLLPATLIAGSQI
DSSFYLCETMRLCKDENIELKRWALFSIGKLNLPEDIKKFGDALSALEYA
AVQETDDQILSSVIKSAFPLLQRDKSQEPRAIAIIISALRKGDDYVLHAA
SEIFGFYTGELPTTLREAFFVDLLRVKPTHKGTLDNVDYGISHLLKNGNS
EQAIQFLEALLLRHSGELTIEVFDSAISEIVSNKAFISKVLTRWFLRGDR
VLCEAVHEIVWAHHGSGLLLEIDVTELNPSDSGHILFIARKAIGYLFMQP
LSAASVLISLMRNATDDKVLKKLGELLLDPLLLNYTGKARDYIIKQSGSE
SGKVKETIDNALKDINTYLEELRSVGSLSALHPSEAQREAYNRHYSQLMA
ESWKAAEAKSVVLNLFPKSVLLYGKKLINYVYGSDGQSHRQEIPLQCLGS
EMEYARMHIFDPFGMDYMLRVFRNEQLKT
>Cag_0926 conserved hypothetical protein
MHNSILRERSLTTCNSLITLQDIRTLKALYQLKEQTRILRLPVVNNIIKQ
RVVGQGCIESLKNALYSLQTIYIDDDTGQRRLQLDEAKDIAVDLTYERQE
LQKDIFYLEYGEDKFIEYLSKFSPNFIDYVNKGIEMFKGKHFNAFITDRD
GTTNNYCGRYRSSVQPIYNSVFLSRFAKNRCDYPIIITSAPLKDFGILNV
SINPAHTFVYAGSKGREFIDLDENFHSYPIDEKKQQLIERLNGRLETLLG
KEDFEKFNFIGSALQMKFGQTTVARQDITRSIHEDESVAFLEKVASMVRE
IDPKGENFRIEDTGLDIEIILTIDADAGHQEAKDFDKGDGLAFIAQTINI
KPNGNPVLVCGDTRSDIPMLTKAMEMYDDVWSVFVTRDERLIEDVMNICP
NTLIVPYPDILLTILGLLSL
>Cag_1321 hypothetical protein
MLIYRHLIWTEIKNSRLMRISLINVRRKMKTTLSLFFLLLAFSGTCRADT
ETLFTIRDLKGEMQVYSGAIGSVLPQIKKNDDAQSDAIYIQREKEGEKEK
FYIAHNNGRHPVILIQGEKSISFLESYGDNNFIWTVCLDKRNPDGSSLAI
VANIKAAGVAGYTTSSIMSGGAYTLLQPRTNK
>Cag_1786 conserved hypothetical protein
MSETFRQLLDESIQLELNLAKLYTAYNDLFEEDEDFWWDLAMEERGHASL
LQHEKNSPQQEPFFPENLLANDLDVLKAANKRILDLVAACKSTPPSRLEA
LRTAYELETSAGESHFQRFMESPASSFAANIFQQLNQGDRDHAERIQQYI
DELE
>Cag_0343 photosystem P840 reaction center, large subunit
MAEQVNPAGVKPKGTVPPPKGNAPSPKGNGAAGAPSVIKEQDAAKMRRFL
FQRTETRSTKWYQIFDTDRVDDEQVVGAHLALLGFLGYLMAIYYISGVQV
FPWGAPGFQDNWFYLTIKPRMVSLGIDTYSTKTEDLQWAASNLLGWALFH
IISGSILIFGGWRHWTHNLTNPFTGRAGNFRDFRFLGKFGDVVFSGTSAK
SYKDALGPHTVYMSLLFLGWGFVMWFVLGFAPIPDFQTINSETFMSFIFA
VIFFAAGIYWWNNPPNAATHLNDDMKAAFSVHLTAIGYINIALGCIAFVA
FQQPSFADYYKALDSLVFYIYGEPFNRVSYDYVEAGGRIVSGSKEFADFP
AYAILPKDGAAFGMSRTVINLIVFNHIICGVLYVFAGVYHGGQYLLKIQL
NGLYSQIKSVFITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICE
LNLFGTNIMMSFYWLKPLPMFQWMFNDPSINDWVMVHVITAGSLFSLIAL
VRIAFFSHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLW
GIKGLSAVCWYIDGAWIASMMYGVPAADAKAWDSVAGLQHHYTSGIFYYF
WTETVTIFSSPHLSTILMIGHLVWFISFAVWFEDRGSRLEGADIQTRTIR
WLGKKFLNRDVTFRFPVLTISDSKMAGTFLYFGGVFMLVFLFIGNGFYQT
DSPLPPQVGDASVSGQMMLTQVVDYLLKLIA
>Cag_0038 hypothetical protein
MTTAKLPPNQPDDYDSPWKEAIEHYFPEFMALYFPEAYAAIDWSKGYHFL
DQELRTIVPEAKTRKQVVDKLVQVQLLDGKESWLYIHIEVQGNRESGFPK
RLFIYNYRTYDKYDKAVASFVILADSDPTWRPSSYSYEFVGCKMTFAFET
VKLLDFEPRMEELLASDNVFGLITAAHLLTQKTKNKVKQRYEAKLLLMQL
LLQRQWEQARIDELLRVIDWFLRLPKELRQKLKIEIHKMEEAKKMKYVTS
FERDAKEEGIVIGIEKGMEKGREIGVLEGMEKGKAEGLEEGLMQGRLEVA
RRLVSGGMSKAEAALLAGVSVEML
>Cag_1217 conserved hypothetical protein
MPNVFYYPVSYPAWWLDYYYLATWALSLLFLGGVWAIFFRFGKFSYGIDL
GCFWKSALLVVCTTISLGGPMYYNTRFVGEHGQDGDSVRLADGKVVYSDR
NGNLRQLAIGDITTIYQESVTYNPPPKIFIVAKTAQGKDSLFVTTNLPNY
RQFIEELSKQSGVTATVR
>Cag_0633 conserved hypothetical protein
MSIFNDAEKRKRLMKGGLPLILAVAWMPIVGMVVLVIIGAPLATLVGWPF
TFVLVGAVTLSLLWLFLKLFRKSGHKIKQGNP
>Cag_1597 hypothetical protein
MSTNTAPRISIPIELSFATPIRYTTGDKPSDVVIGLLGADTYPDLIVANA
GSKSITVHFNNGLGEFTSSISYASFYNSPPLALSVAKINNDAYGDVVAIT
NSSVSIFLSNENGTLQTPTFYTNNGWQSLTAVATGKIDTDTDIDVVVTDA
TTNKLYVVQNDGTAVLNNQTPPSYATGNYPIAVTLGDVDKDGWLDALVVN
NDNTSTPTLSVLINNKIGGFKTKVDYTLATGALDVTTADLNGDGWLDIIV
GQSQEQGNTLVLLNKGDGTFGNTQSYSAGAYPLGVAVGLLGNDNRADIAV
ATSGEKTFAVLQNQGNATFNAPNTFAVVSTIDTPKPTDIAVGDLNGDGKN
DVAITSEHLDSVSLLLNTTFQTRNFTEQTPLLIAPTILIEDPENNWIDGW
LHVAITKNGEAGDDLVLQTSFNEESDLTDYIWFDKVNNGVRAGSSIILGR
FENKVTSTLGKEVGKIEIHFTNPNYKTNEWVQKIAQSILFTNESDTPSTA
TREVTITAIDASHQTSSITQQIAINAVNDAPQELFIEGVPIVGQTLHADT
STFSDADGLNKANITWQWLRDSVNITGATNSTYTVTNNDLGKSLSLQAKY
RDGAGNNEVFTVTTDTVKALNEDPLIARPSTVAFQTSTTLKSFTDASSLP
VIGTLDLNHDGILDLLVAKSSSAPSNQLVVLRGSSNGTFTQDSLTYTLGN
TPSAIAFGDVDNDTFTDIAITNKDSNSVSLLRVVNGVIKNPFTFSCGSKP
TALAIADFNDDGFEDIVTANSGENTLSFLQGNGNGTFAPTNSITTASAPY
GVVAADFNGDGKSDIAYSDSGNDRIVVCTYNNESWSEITNALVGDVPHTL
VATDFNGDGKSDIATINSGSNNVSVLLNNGDGTVATAKTYTVKSNASSLT
ALLASDVDGDGFADLAVSHTTGVSLLMNNGTGGNGTFALAQEIVSNRITA
PPTLASADVNGDNLTDILFPGSYTSINAQLNSQSSSATLFTEQTPIEVTP
NLTLRDPNGDASWRDGKLQVQINYNTTAYDVLALPTEKPELGGVWIDSEA
SNALMVYDQQTDTNLQIGTADNTSVSNNSTWTFTFNRYATNALVEKVAQS
ITFSNNRDNPSLETRTILFTATDSLGASSSATQHITVQPVDDAPLTLISA
TPVDGAGNVEINSNLSFTFSENIVFGSGFIELHRDAPNGALIEHYDVATH
TNLGLNGTTLTIDPYNQLAYGTRYFVTFGEGAIDNGYGTTFSGSEYDFLT
ATDPYVPPTPNNDGSDGGSSTGTILIGTGSLALLALAFL
>Cag_1552 hypothetical protein
MPISLVYLSAALLLLSLHPLLKAYVDIIYMVVWGTFAWGVYANVLKKKLL
LPLAFLPFVVLFNPIDPPALPLYADIAMKIGGATLLLFTRKHIAV
>Cag_1391 putative type II restriction enzyme
MNQWIELSIEYANQRSYLDDLFSVYSTIPDSIRTINEKLWSNVERAFYEK
DNLSLIKELLLLDLFPIKDSYIAYLKKDITAIDRNPRTINRICGRLYEMG
LNKIYEKSSEPKETNRQIGPMFRNWMRKKSLGIEPVDLSTFMNNEDDAVL
DASDKVMMDFAREYLGYQHNKGLDLIARFNGTYVIGEAKFLTDFGGHQNA
QFNDAINTIEAKGVNAVKVAILDGVLYIKGKNKMYKAITSFYKDHNIMSA
LVLRNFLYSL
>Cag_0601 hypothetical protein
MVAITTTELRKNFKKYFDIAHSERVIVHYGKNKSYEIIPTQKECENDAYF
SNPKLLAALKEAEEDIAAGRFTEIKDPKNLWDSIK
>Cag_0600 hypothetical protein
MVITLTPEFEQALHKIAEHNGTTVELLVLKTLQENLLFCKPQKTLFRKSE
KTLADFLVGYVGVFDSDELVKSGAQMSTNVRKQFGDILLEKRQQQKL
>Cag_1922 sulfur oxidation protein SoxX
MTHLITLAVTTALLLPAIVNGAPRAESIAKGKQLSFDGSKGNCLACHLIA
DGEMAGNLGPPLIQMKQRYPKRSVLKAKLTDATASNPQTLMPPFGKHGIL
SNEELEQVVDYLYTL
>Cag_1263 hypothetical protein
MSKQDLKLMPRFLKGLPNIWTILGWILIISSFLKQPVVAGILIALAGHLL
TQAKHRDDLKEKQSLFYLDAWVKAYEEAQSLLKDGNNDRVEWIAAARALL
HAEQLEKKITEDAHLELLDLYKLKYRHFFYSIIDNKPESFFHDENDRDKA
LSEPSVYAIWKAAQWSEEYNDHSKDPLKRKFADDEVEKTKFASIGLYRFL
KKKRTR
>Cag_0250 chlorosome envelope protein E
MATNISGAFTNGAAAYGRFLEVFIDGHWWVVGDALENVGKTTKRLGANAY
PHLYGGGAGSAGSLRGSSPTVSGYAQPSKPTESRFND
>Cag_1528 restriction modification system, type I
MEKIESIKTLYQQSHNNLETLYNALSQKAFKGELDLSRVAVPEETADGKR
RD
>Cag_1606 hypothetical protein
MSTSGAIASLDAFLHRWKAKAGNYTGDYITTPEGLVRNNMDDEQGRGGYY
QEYACTSESQVMMARGYLRAYQATGESRYLQNARTAMQALIRYFFFGKVP
STATAWRSHWIVNAGAPFKSKENGRTTDTIAFGEAYECWPTWRKLRPNEF
ATAGDSMHWFIENFHLFSQLETEDQKGQWLAARDAMFREFKLLLSPKWQA
KYKGAIPFEYTNKGDNLTVRSTSIFRGPYYTGYQNPLPWLYMQDYTAAAN
MLQLLVESQVAYTKSTGVKGPFAPVYHYDASLLGSAKKNVFTWNGPDPNT
FWGGFQYRPFADVAHFWYHCKRSNIQNAAVSNASKVCMSFLSWLDGWLTA
HPNNEYVPTEFREATQPSAPPANGDNDPHMIALALKGALFCKNAGADAAM
VGRVIARLYAMVMKRQSKAGDMAGAFMHDPYSHIFKGFWAGDIMEALALY
IMHHEKG
>Cag_0237 killer suppression protein HigA, putative
MILLLLFCTLHMNLRYSTRKLEKTVETFSAIKKHYGIWGKQISQRLADLT
SAKNLADMYTIRPAHCHELKADRATEFAVSISRNHRIIFIPDHDPIPRKE
DGGVDVNQVNSVIITAIGEDYH
>Cag_1579 hypothetical protein
MITTRKAMYLNPQFIEKAGKKEFVVLPYEEYQAIEKMMEDYMDLIDVRET
KAETQNQPSVPLDEVITMLKKRMNV
>Cag_1571 hypothetical protein
MPHCIHRAVHSFAEITNLPHPADLVGTWRRFGLLGPVYEIICMGNTLPNG
DVMMRVRVVESGEELDYRFADILDDPKER
>Cag_0968 hypothetical protein
MGAAWSGAFSFNHYKRTFVMQLTGKLIAILPEQTGAGKNGPWKKQDIVLE
TSGQYPKKVCVSFWGDKLDRQMLQLGTMLSISFDVESREYNGKWYTDVKG
WKAEVAGRAESAPYGGGDDAGSWEPPAFEPTSSNEECPF
>Cag_1532 conserved hypothetical protein
MLFAGLALAAFGLLITIASKAGATNWLSWFGNLPLDLRIEKENFNLYFPL
GSMVLISLALNLLIYVFNKLFR
>Cag_1024 conserved hypothetical protein
MKPLPVGIQTFSKIIEDNYLYIDKTDIAKSIIEKYQYVFLSRPRRFGKSL
FLDTLKNIFEGKQELFKDLLIYNQWNWAVTYPVIKISFSGGIHSKADLEE
DLIQILKANEKRLDLKCENRSKAKYFFAELIQQASEKYQQSVVILIDEYD
KPILDNIENIPEALIIRDGMRDFYTKIKESDEYLRFVFLTGVSKFSKVSL
FSGLNNLEDISLNSDFGNICGYTQNDVDTTFAPYFEGVDMEEVKRWYNGY
NFLGDKVYNPFDILLFIKNQKMFRNYWFETGTPRFLIELIKKNNYFVPNL
NKLRINESLANSFNLENLNLETILFQAGYLTIKRLISTNKGVSYELGFPN
KEVQISFNDYLLQELTTVSENELICDDLFELFNNGDIANLEPVIKRLFVS
IAYNNFTNNYIESYEGFYASVLYAYFASLGFDMIAEDITNKGRIDLILKT
FDKTYIFEFKVIAEEPLEQIKKMKYYEKYDGERYLIGIVFDPKARNVSQF
AWERV
>Cag_1329 bacteriochlorophyll A protein
MALFGTKDTTTAHSDYEIVLESGSSSWGKVKCRAKVNVPPALPLLPADCN
VKINVKPLDPAKGFVRFSAVIESIVDSTKSKLVVEADIANETKERRICVG
EGSVSVGDFSHTFSFEGSVVNIFYYRSDAVRRNVPNPIYMQGRQFHDIIM
KVPLDNPDVIDTWENTLRAIQSTGAFNDWIRELWFIGPAFTALNEGGQRI
SKIEVNSIGTQSGDKGPVGVTRWRFSHGGSGIVDSISRWMELFPVDKLNK
PASVEAGFRSDSQGIEVKVDGEFPGVSVDAGGGLRRILNHPLIPLVHHGM
VGKFNDFTVDSQLRIVLPKGYKVRYAAPQFRSQNLEEYRWSGGAYARWVE
HVCKGGTGQFEVLYAQ
>Cag_1638 conserved hypothetical protein
MRIFAWLLVIVAVVLQGCYSFSTENRLTHLHTIAVPIFNDHSGAGIAQSR
SELTKALIDRLERESALRLIPSLSLADALLEGTLVAYSDVPAQLSATTGR
AATNRITLIVQVELEERSTHELLFSERFVGSAEYAIGNMVAQQEARRRAQ
HQIAESIADRIISGW
>Cag_0551 conserved hypothetical protein
MALHGEEQRPTSPTRLPKRHPARRGYHLDMTPMVDVAFLLLTFFMLATTF
TPFYSMELSQPQKQRRLAVQAEEVLTMQVANNGIVHYRLGSNALSSMPLY
QETATTQEPSATPMLNPALHHFLRSLKAEQPNITVVLSMNNQARYRDLVA
VVDLLNSLQITRFSLGEIDGEEKQKAGVK
>Cag_1304 conserved hypothetical protein
MAFIKQISNAMDAQLLQITAQVLFEKLQSLRSKMQDDVATPDDRVAAAML
EEVVALARNRYCVAGNEAEFQQGKSVWVERTAATHFPHIRLHGGALEDDP
IA
>Cag_1009 CRISPR-associated protein, CT1134
MKNSIEFKVYGREAMFTDPVSRIGGEKCSYHIPTYEALKGIVKSIYWKPT
IIWVVDKVRVMKPIRTKTKCLKPLKYHEKGNSLAIYNYLCDVEYQVLAHF
EWNLHRPELAQDCINGKHYSVALRALDKGGRQDIFLGTRECQGYVEPCKF
GEGKGEYDNISELAFGFTFHGFDYPDETGQKMFRSRFWNPKMVKGVIDFI
RPEECLPEHCKPIHEMSMKPFGIDTNQRSVIKEEEVWQ
>Cag_1036 uncharacterized conserved coiled coil protein
MHLKDAYKKKAEAELELAQAKLVELRAKSKNFGAEAHLNYAKHLDDFEHA
ITNAKHKLHELGEAGEDAWEKLKDGVESALRSSSNKLRDIANKFKD
>Cag_1895 hypothetical protein
MLLGEMMIQGVAMSAYSWSGYNGTGYESLLQQAVEVGATSVLLGSVSIID
LNNGAVSAWVRDDGFTTTASMGDVEAAIQQAQAHGLQVFLKPQIHSYNPA
SAAFGGNPYNNLINPDPSNPLIIPNLDLFFEGYKAYIVEWAELAERYQVP
LFSVGNEMVAVTSAEFTPYWEDIIASVRNVYHGQLTYAAMTDVKWDSNDE
VSHIEFWDKLDYVGVDMYPDFDTGATIPTTPTVEQLNDIWVEQKWQSYLS
AIAEATGKPLLFTETGVASFLGGANRSRYTDALISQMGTVRDDATQTNWF
QSFAETWMGENQPEWFGGMYFWNNDPPYNAGLQDITGYTFFGKPAEVVVS
SLFDAVNSLDFDQTLFLASDSDDRIALYKYIAEADANPLTRAQSYHSTVI
IELNGTILEGAEAVTPTIHFYLNGKDYGAVTLSNVESEYSINKGIEAASK
GEAYYPHSTLIPFLFEIDELEVRDIHIVRDSVQVENSEVYISRVTIVPDM
GAATVNTTVNSLQNAWLAFEEPSQAWGGATGYQFPNGAIPYDVPSVTIDT
SPYKKTLATMSGTPDNPITVKGYEGFDTVYLLGSPEQYTITLEGDMLMVA
ESSGLGQNSQLGGVERLLFAEADYALLFGGMGNDTLYGGAGNDRFNGGDG
NDVVLLSGYATEYEVSDNEASATYTITDSVAGRDGSYQLSNMEALQFGAS
PMQWNLEEFRAALAASQLLPQQEEPFNVTGSVTFWKNGAAISNVATTLSL
HSVTNNGEELLFQHLQHHADGGYSVEVWANATDALHSLQFEFQLPTNAQA
AWHFSEEVPQGWQTGVNNQGADALLIGGMGATALPSGLVQLGTLSFVAPT
DADRLEIALTRGELGKQWLVPATITLESNVLASNGGYQHNALWQGSYHLS
VQHESTEEPTNMVTMSDAYAALQIAAGHNPNESEAPLQSWQFLAADINRD
GKVRASDALTILKMALNYHDAPSEELIFLPEWVGKSEMTRSSVDWSATEI
MLDVENYQIVNLIGVIQGDVDGSFS
>Cag_0468 conserved hypothetical protein
MSWLVLLAMALTLFFSFVVLFRKFLGYMKKEQNLVIEPLKDALIDKDNPV
GLNPEELQKLKQQQQEAQRHLSEVIAKIPVIQKDGRFQIDQEAIQKRKEE
LLKTENISKN
>Cag_1881 hypothetical protein
MNIINSLIEKLKGAEIKMAHEKGAFDLFALFMLDVIPEQWDVVAAASWIT
DENYDASLRYIINCIQPLLSSKELFSISGVVLIDQYNPGLDAVLEAIHVE
HGLVEVRDTTFFGLDIKHGFVITSCTRHGCTAKSA
>Cag_0476 hypothetical protein
MEKLVDYFAHHPVVFFIAVVFSFFVVFAFFRKIVQTLFVIGALMVLYAAY
IHFTGSPIPDIFQHIWQWMVNLYQTILGLILRILKKEPEEGVEAFIIFFA
VPTCHLLQGIQQRCCASGDGKHGF
>Cag_0376 hypothetical protein
MPLNLLKKYPELLEIAHMSEADRNSSLYAIFNRDFVQNDNLYFQGKKVRP
IKGEDGAIAMDVLFQHLTTSSDKEKSSLNRNARTFEMARSCRLHWIRYHI
DKSAGKGVKIFSCEERDQKRRKDVIRTYIFDVDQKYVIILEPQRSGNDYY
LLTAYYLDRDDGKKQIEKKFKQRMKEVL
>Cag_0271 hypothetical protein
MYPSVKKVVPNENYNLTIDFNNGESGTLDMKPFLNFGVFKRLKDMNHFRQ
VRVSFDTIEWPSGIDMDPEFVYSKCKKSTPPQVEPADA
>Cag_1697 hypothetical protein
MRDAQIGHLDVGVGSLMLNYGFWIIVRATLAVAPNKTIHTKQEFAQFFCV
GADSISALSSVRAKMDFAPTPYAPEIWINRF
>Cag_1615 conserved hypothetical protein
MQSFKHISLKALAGLLLSVLLVALMALVLLNSGSVDRAARALAMQLFQKE
LHGRLEIDELHLTFPNHVTLLHPRVYAPNEREPVVEAARMTARFHFLALL
QPDIKKLAFQSLEAQRLKVRLVQNEQGSLNIERAFASRYPDTTKTGIEEY
FCKQLSLKQASFSYSFIKDGKEIPLARANNINAQLRSFTAGKALVKGEIQ
EFQSNIASHALVVQKMQGRFFFSDKRSEVLDLQTRIGNSHAILSATLEGV
SLFKPSLLQQVAQSNAFVAIEEIDLHSNDVKRLFPSLPLLEGIYQVKATA
KQQNGTLELREAQLVYRKSKLALQGTIQHPFESNKLQYNLQCDSSKVSSE
LLTALITNKEDQQLVTSLKSIGDITLAGKLSGNLSALQADVQSLTNVGMV
GFKGAIERQPNNSFAAHGNVALNALKPHLILGMADVKSQLNATGTMELLV
EPNALPEVALALQLQNSFWQHLNVTKGTLSFRHKKQLYEGALSLSNGSEN
LAVQGTVNLNSAQPTYDLTGTTYKLNVGQLLQSKNFSTDLNSRFTLQGEG
FDLRQLNLQSSVVCAPSVINDVVLPNGTAATLSIAQQGTASRVKVTSDFF
DVTAEGHYTFEDLTALGWMALSGISHEVARLNIWGETAQLNPPANMLTPQ
PFTVTYQLALHNIAPLLSIIAPLQQIAMQGTAQGKAEHNGGAYTIAGTID
LNNLIVEEEFAAKRIHLQGSLRGNSNGILEAQGRGAIAALRVGKQKVRNT
NVTAAYVPTTLTSSIDVDVADVVQRVSTSFAMKQQGSGYLLDVQQLNVQD
REGSWQANNNLPIVLDKEVLRFNNFTLMRGAQKAVLQGELSNNRASAFTC
TLSSLQMNELQHFMLNAGLEKLQGIVSATLHISGVPGAKQSSISVRADNV
AYDDLMIGIVQGSARHSNNLLHFELQSEAPAMVNGIQSAQRSNALLSTIE
GSGTIPLELKYYPFKLRVNEQQNVHATFRSDNLSARFLEYLLPFFSAAEG
TIPTLCTIEGSAAKPLISLHSRLQDTRITVKPTHVSYRLDGDIYATPQAL
ELRNITLSDNNNGKGSIRGFVHLEKLQPSRLELAASCNNLLFYNKKDQQD
DTSFGSIVGSTRNFTLTGSLRSPIVEGEVQIDRADYSLYSAGANESAQYV
GIDNTISFVARNPKPKAPKAKELKSGGSKEFYYSLIDILTIHNLRISSPM
PLKYTTIFDRIRGEELETTLSNLSLVVNKNSQRYRMFGSVQVTSGTYRFS
NASFELQPGGSITWNNVDMRSGVLENLYGRKYINALNPQSGERDDVHLLL
AMTGTLNEPQVAMGYYLNDQTQPYASSTTIGTETSKVDPNAELNVISMLL
SRQWYSKPGDGGTQENVALTSASFSAGTGILSSRISRVIQTIGGLESFNV
NVGMDKKGELSGLDLYFAVNVPGTDGKVRVSGSGSANDPRTANASTAYGS
NQKVEYRVTPKVYLEASHSSGQNSGISSSSSTLQKPTDTWGVSLSYKERF
HHWDQFWKKLLPFSSDKASDKTPNKVPDKASNKKENKPNE
>Cag_0709 hypothetical protein
MFYRKNFLGIPEQVLHGDGFTIELSRNEVVLIDIYNADLLLSKLADEVLT
AKAS
>Cag_0877 hypothetical protein
MIKTAVFVEGQAELIFTRELILKFFEYKNIWVECYTLFNDQELNPTEYSY
KSDTVNFYFQILNIGNDNKVLSSILKREKYLFGGDKAFHKIVGLRDMYSK
EYRDIVKKSTIDSMLNQKFIENHNNTILEKSRYHKKISFHFAIMELEAWL
LGIQGLFEKMDHRLTNEKIAEACRIDLFKADPETAVFHPAHLINDILHII
GASYTKKKDEINKFMSYIERDDFARLLNSEKCQSFTSYCNALVIK
>Cag_1280 conserved hypothetical protein
MDIKAIAQALGMAIFRYPALWRKLHHEPASNDGSMLRNYAVPIIALVQLL
KFPLIGEPRPAMFLGIVSMLVDSAVLYVLAGGVLALLPIPRTEEAKGQVM
TVFCYALTPCWLAELAYGHGVWSILIALFALLHALASSREGLVRLLSLEV
QSASGALTRSAFFMVLISSISFFILSAATLLVSF
>Cag_1578 conserved hypothetical protein
MINQKNIGSSFDEFLEEEALLDEATAVAVKRVIAWQIAQEMKAKHLTKSL
MASKMQTSRAALNRLLDATDTSLTLTTLSSAASALGKKFRIELVS
>Cag_1962 conserved hypothetical protein
MSEQSHADKVQLAYAAVLGKTSTIGIGLIVVGYALYVMQILPATASPEVV
ASHWHLRASELHQAINVPNGWDWLGNMGYGDVLSFASLAYLATVTTICLI
TVIPVLLKENDKIYAVITTLQVLVLLFAAAGIVSGGH
>Cag_1946 hypothetical protein
MSKRQSFIIGGVLIALVASLWSFWPTLQAEKVPDKPSVTASVPIDSTVCI
APTEYMRSHHMQILQDWKRTGARDPRPHTTPDGRKFQKSLNTCLGCHSTN
SYFCIMCHDYTHAKPNCWNCHVAPFK
>Cag_1917 hypothetical protein
MDFGWLSIHTSVAEAWSLILRNARSKTPESIYRNLNFQDARISRIFCVGA
KTYVFTCLLRILGNRKGLYLHCKRNHGSDFSGNMRFNTK
>Cag_1678 hypothetical protein
MFISVHPDAVLDDVDEAFDEPNYNNYNTSPEPPSPYADLEKKYRKNRFNR
QRQHSNSNPLKDVTAVNGNRITSTQKANGSKPQKPKAYKPKSKPSNSTVA
AAPKSAQATYSKHSAHPTKSTSPAKASTNGKTFAKVERTPLSPRAAHTPS
ASHTPHKPHASNSPHKTHTTHSPKTAHAEKSSQLTNQAKSAPTNQQAVRP
SRSPQLSQLPPLPKSSQSSQSPQSPQSPQSSKTSKSPHSTTKPHSSQTTH
SPRQSRPQQNKKRPV
>Cag_1340 hypothetical protein
MSAIKKLFGILWALMGVGIIPLAIQQAMKEIAEKPSEENWIFWSIVMVVL
MPIIAFSLITFGIFALKGEYDTIE
>Cag_0761 conserved hypothetical protein
MDNNKKLQQLFENDPLGLLDVKPSNSSARNENERLVASFQEINEFFEQNK
REPKADNGIQEHQLYSRLKSIRENPTKSEILISHDIHELLNTKPKAPVSI
DDIIENDPLGLLDDDTAGLFELKNIKPNEKSRAETDFVARRKPCKDFDKY
EQLFKTVQKDLKEGKRKLINFKLGNLRQGSYYIHNGILFFVEKIEITKKD
HYKPDGTRVREDGRTRSIFENGTESNMLKRSIEKILYENGQVVTEHSDQS
NLNYVESLFAITDEDKEAGFIYILTSKSEKKEIKEIDNLYKIGFCKTTVE
ERVKHASQEPTYLMADVRIIKAYQCYNMNPQKLEQLLHNFFGNSCLNIDV
SDKEGNRHTPREWFIAPLGIIEQAINFIISGDIIYYRYDAINQEIVEK
>Cag_1797 hypothetical protein
MDNTTQIGIQYSGFRFKSFFFQGLFDEQENEAFEFQTSLDIRTGSDRVII
GVMVLVNRKSDAQTYAKAETESLFLVEGVERTKDESCSLIIPQVLLITLV
SLAISTSRGALLVKGAGSFLEKIPMPIVDPKVFVSEIQFLGQS
>Cag_1372 hypothetical protein
MFINLLKFMPSRSVQAFRDNIVDVDRLIVSHAQLRDGSPGKKGLGHITRS
GIVMLCAAWELYLELICVEAAKYFCLKCQSPDQLPIRVQKELSKMAKESK
HELKPLEFAGNGWKNVFITHVEDLCNTINTPKAGPINELFNRSIGLELIS
DSWSCGKDQINNFVSIRGDIAHRGRHADYIKISLLQDYRALIYNATIETD
NTVSEYLALKTPGKHKPWRVTS
>Cag_1002 conserved hypothetical protein
MCSTKQPTSFNPRAREGRDSTPTFLRLATKRFNPRAREGRDCVPCCVPFK
VIVSIHAPAKGATGKADEEFDAKIVSIHAPAKGATQRR
>Cag_1511 hypothetical protein
MSMKLILTVFALFLATQIADAKDVYVNGYYRQNGTYVRPHIRSSPDAYKS
NNYGPSKNSYELMNPKARDADRDGIPNYQDNDDNNNGISDNNE
>Cag_1738 hypothetical protein
MIDKNELLKRISAIEQSEESVISIYSSHIQHVLRYSNINKESQAKIIEML
KQLDSDLEEHKIVTKQLVDAIAKSEKSIF
>Cag_0975 chlorosome envelope protein B
MANETNDFAGALNNLMQTATSIGQKQIELVTNTAQNLVQLAEPLAKTAVD
LIGSITNTAGQLFQNIASAIAPKQ
>Cag_1100 hypothetical protein
MALIRECQPKTIQEWEEWYFKNATTAGKNNFKITRESLQELGERLYEKIT
EVVIPEWQEAFNALTIEDCYNYIFNLTINRTFDGYLREKSVVNDGLAKEF
PQIRFDESPSELDHAGDIDYLGFVSENKAFGIQIKPVTAQSNFGNYSVSE
RMKASFHSFKEEFGGNVFIVFSLDGEIANTQVIEQIQMEIERLQSEQ
>Cag_0260 conserved hypothetical protein
MLFKAEKNFTVNFLPLVIMEQEVPVTIHQNQESADEAARQHGTYRAPAKD
FNETISEAWTQFRESEAGEALSKGSSAAKEYIQQHPTQAMLLSVGAGALL
GLLLKRR
>Cag_0526 conserved hypothetical protein
METTNSSNQKTKEPGFGIWLGITLLWGSVFFWSSVLALQFVTGWMGEGMF
QPAGSGLMRVYGVHVMVLVLFALLAMIFKRMVDPGATRQATRRQEIDAGK
GERIFISLLGSIATSFFFTLLTALTFALAAGAVGVPVALTLPVVFVAGLF
NIVAGLAASLLVGILFIVAKVGKK
>Cag_1291 hypothetical protein
MPKHVAKEFIPSLTNKDNIMQAIEFESTIHNGIIQLPNECQQWNKKLVKV
IVLEKTRASIISKPRRMPHPAIAGKGKTIGDLLEPIVNKSDWECLQ
>Cag_1913 hypothetical protein
MYQTLINRGALKYISSIERYAMEKGWSEGMERGKEQGKAEGLEEGLLRGR
LEVAERLVASGMSKSEAAVLAGVSVDMLE
>Cag_0110 hypothetical protein
MGKANNQPNKSKPKVSVSIGRLLMVLIGANFIMLLAPNFGMLNSFLYIPD
LYTWPAFVLGFVLIFFGFKGLYKKP
>Cag_1894 photosystem P840 reaction center protein PscD
MQSTLSRPYTGNEQVRANVAGPWSGNAAHKAEKYFITSAKRDDYGKLQLT
ISPASGRRKLLPTKEMIGKVASGEIELYVLTTQPDIGINLQQKVLDNENR
YVIDFDNRGVKWTMRDIPVFYDSLRQQLCIEIDRRTYTLNEFFK
>Cag_0269 hypothetical protein
MERVALTTDEQNLWDQIYFSEKTIQIDHDKPRESIEPAYQLAQSLLKRKV
IPQIRMRYFTDPKLNIGGRDKSRKEVFERNGTSGDMILRHPHFHKYLRYF
VLGPDIPLTAINEFVVLANDCDPITSGDTKEFCNLAWKQIRNSGQDTKYA
AEEYFKLGLELELGEDVAYAIRDTIMRMR
>Cag_1722 hypothetical protein
MKLDELYQVFLPTFTMVEEEWNNFLAEKNTALSEAQNYLNAIFSITGYNL
PTYEEISSTRYLGVYENGAQVTFEGSGIDKLFGGDMTTGTLSLSRIALDS
YANNIHMAMLGYNNGITVDLNSGAVSGMFNEYSLTTPWFELSAVGTVAVS
GASILSEMNIEAELTELSFTYPDDGVIVKLLGDIDYYQDELGNMEYSGDV
YTAYFTGWGADITLNGDFQCDFDINNTFILFSDELTGELYVSELSLVIPS
QHVVANFYNSLTGLSYDITSGSLGGSFNSFHFGTSLFDVYAHGSIYVEPV
SSSSTLGIHGILDSIEITYPESTFAITVVGDVEYIQIDQGEYTYFGTMSE
VYLENPTTTVSVLGDFSGSYNDSNGLHLAGNLYEFHWQREEAFISFVGDI
VFGEDQLVVNEVTTLEVYGDGRYYDASSLSVMNIVTDVVGNELLALGEDA
NWDISAALDELLWDVINKLNGDAEGVSSVNFDSVPASSTPVNAEFLDFYL
DLSKVGEAGYYASFRVGHLYDTNGDGLPDYVDEIHDSPATITWNNGMFTV
LSLDDSSTRATGSLAYDGNGNAVGLYAFDRASGDSETTPPTLIAATPSDN
AMGIEVESDLSFIFSENVQFGNGTIEIHRGSATGEFVESYNIGTPLSTNL
NIVGSTLTINPTSDLASNTHYFVTFSEGSIRDLDGNNYVASQPYDFTTGA
DPYPTHTLTGNITFWKTGEAITDVTTTLTTLPTNGTHAIELKNIHVQANG
SHTIEVWATTPNSTTGSFECEFALPTGTSVTWQDAAKLPSGWMTTNNVIA
TGAFRVASIGTHALAEGAVQLGTLTISQSANPGTFELAMTHAQLGNNDVA
GYAISSVSSTTGSGNEYQYHSLTDGHYALTGDKAAGDAGSAVHANDALAA
LKMAVELNPNEANANGLLGPVSPFQYLAADINRDGKVRANDALNILKMAV
GIESAPTDEWIFVAESVTGKTMDRSHVDWSDISPIVDFNQTAIELDLIGI
VKGDVDGSWVMVG
>Cag_0750 hypothetical protein
MNFGAWDGKHFSNCNHLIANQWKGCFIEGNIDRYRELVATYSENKDVVCL
NFFIKYQSRLLLIEFNPTIPNDVIFIQEKSNNVHQGSSLLALIILGKEKG
YELVCCTTCNAFFVKKELYSFFNLKSNSIYSLYQPLCDGRIFHGYDSKIF
VVGMSKLLWSNISIDSSDFQVLPKSMRYFNDAQ
>Cag_1337 conserved hypothetical protein
MKKMLSLAALLAAISYATPASAELKIGGDASLRMRNEFNAVDPGNNATDD
VMWQSRVRLNASADLGDGYYFKTLIMSEGGAAGWLNTTGNENYALAASQV
YFGRNMENCNYKFGRIPLNSFSNPIYDLTLYPAQPTDTPVNNLNFDRLYG
ASYGTKMGGGMLNTTLVVLDNSSTTAGTAAYDGMLNDGYALSVAYTTTWG
NVTVEPQIFAVLTNANVTTLGNNVTPLTFGLNASGKVGDGKLSGAAFYTS
AGDEADYSGYLLRVKGETGPYMAWVDLTSTTNDNAAGATVKDYTNTFVWA
QYKYTAYKSAAGSLTLQPTLRYRASSTETAAGTEADTSVLRGEFSATVTF
>Cag_0746 hypothetical protein
MNFDKDVSQQQLLTNIFIISWAGQHENAIFIANQISFVTNKITIVYSDPN
SDFLLDVPCVLIKRPNDLFWGDKFEASLHACKDDFMLVIHADCKCDDWKG
LVIRCNEIFSKNKDIGVWAPKIEGTPYYLERTKIASIEYNALSLSLVAQT
DGIVFALSLPVVNRMKKINYMNNKYGWGIDWIFCCTAYALNLMVVVDEKH
TVIHPLHRGYDTRQAVMEMNTFLKQLTTVEFIQYRLLSSYLKLSDIKTIA
KV
>Cag_0535 conserved hypothetical protein
MKPLPVGIQTFSEIINQDYLYIDKTGLASNLINKYKYVFLSRPRRFGKSL
FLDTLKNIFEGKQELFKNLLIYNQWNWDVTYPVIKISFSGGIRDKESLRK
NLFYILKDNQKRLNIICEEKEDPNQCFAELIQQAFEKYQKKVVILIDEYD
KPILDNIEKIPEALIIRDGMRDFYTKIKESDEYLRFVFLTGVSKFSKVSL
FSGLNNLEDISLNPDFGNVCGYTQDDVDTIFAPYLEGVDMAQVKRWYNGY
NFLGDKVYNPFDILLFIKNQRMFKNYWFETGTPRFLIELIKKNNYFIPKL
GKIQVNEFLVNSFNLENLNLETILFQTGYLTIKQLLLSDVGVSYELGFPN
KEVQMSFNDYLLHDITTVSEKEPIRHELLAIIKAGDIANLEPIIKRLFAS
IAYNNFTNNYIESYEGFYASVLYAYFASLGFDIIAEDITNKGRIDLTLKT
FDKTYIFEFKVIAEEPLEQIKRMKYYEKYDGERYIIGIVFDPKERNVSRF
AWERV
>Cag_0685 conserved hypothetical protein
MAVTLKIHELAHQELLDAIAWYNEIQSGLGKRFQETIMLQIQKIKQHPTW
FPRETIEVFKAYVPRFPYKIIYSVNDEAITIWAIAHLHRKPSYWQSREKS
>Cag_1524 DNA-damage-inducible protein D
MQSQEIQQLKEQFDALSHTIPDEDVEFWFARDLMEPLGYTRWENFMTAIK
RALESCETTGYAVDDHFRGVTKMIGIGKGGQRPVEDFMLTRYACYLIAQN
GDPRKEAIAFAQSYFAILTRKQELLEDRMRLQARLDARERLRESEKTLSQ
NIYERGVDDAGFGRIRSKGDAALFGGHTTQAMKERYGITQTRPLADFLPT
LTIAAKNLATEMTNHNVSQDDLHGEHAITREHVQNNQSVRTMLSQRGIKP
EQLPPEEDIKKLERRIKTEEKQLVKHSGKLPVAKNQD
>Cag_1021 conserved hypothetical protein
MKQLPVGIQTFNKIIEGDYLYIDKTDIAKNIIEKYQYVFLSRPRRFGKSL
FLDTLKNIFEGKQELFKDLFIYNQWNWNVTYPVIKISFSGGIRDKESLRR
NLVYVLKDNQKQLNITCEEKDDPNLCFAELIQQASEKYQQKVVILIDEYD
KPILDNIENIAEAIIVRDGMRDFYTKIKESDEYLRFVFLTGVSKFSKVSL
FSGLNNLEDISLNPDFGNVCGYTQNDVDTIFAPYFEGVDMEEVKRWYNGY
NFLGDKVYNPYDILLFIKNKYVFDSYWFETGTPRFLIELIKKNNYFIPDF
LTLKVKKSIVNSFNLENLNLETILFQAGYLTIKRLISTNKGISYELRFPN
KEVQISFNDYLLQELTTISENELICDDLFDLFNNGDIANLEPVIKRLFAS
IAYNNFTNNYIESYEGFYASVLYAYFASLGFDMIAEDITNKGRIDLTLKT
LDKTYIFEFKVIAEEPLEQIKKMRYYEKYDGERYLIGIVFDPKARNVSRF
EWERV
>Cag_1210 hypothetical protein
MIAISPTELKRNLYKYLEQAQSEQVIIQCKNAETYAIVPTGKTSETDRLF
LHQNIKDRLRHSLEQVKEGKTYQLTKAEINSFLGHYDDK
>Cag_0502 conserved hypothetical protein
MAPKQAKSEESAMPLSTINYIMIACGVVVIAATYWGMALERSVDGFFSLV
VSPILLIGSYLWIIVGILYRGKSSSNAKKR
>Cag_1535 conserved hypothetical protein
MEQPNSDNIVKTAVGVAGGSALLAPALPLALPALPLAMPVIHGLAGIALI
GAGVFAVVQAAGAISSLDNPFQPKKPK
>Cag_0593 Nucleotidyltransferase substrate binding protein, HI0074
MQQDIRWKQRLQNYSRAIKLLQEVPELDREKLSFLEKEGIIQRFEYTLEL
AWKTLKDKMEEDGIILDKISPKMVLKEAYKAKYIDNIELWIEMVNDRNLL
SHTYDFETFEEIIIDIQYRYTQLLSDLYINLIESQL
>Cag_0385 conserved hypothetical protein
MTMRDHTPDFRMHELSQENKALIGSTVKQLLEKLAVDGRLCSEALLEFWV
EVAGAQRPRGTYRNGCLMPDSFIYIRDYFRASESGTLLAGESYVKDGTHD
LESAWDDMLDELFYQIEIFTSPVSTGKGITLELWAGCRQRPEGDWVYAVD
TKVELE
>Cag_1277 hypothetical protein
MSLTLQKEDAHKLIDQLPMDATWDDLIHEIYVREAIEHGLRDSQTGATKD
VHEIRAKYGLPL
>Cag_1745 conserved hypothetical protein
MEPKRSLKRGAGLLMSLSALSILSLVFMAIWVNWEKPRQAAEPLTPEVKN
LVDRIPSTTDALIYIGMKDIRQSRLWQEVIPDSLKQAPLFQPTGELATLL
ERSTINPSKDIDTLLISFKRHGYKEQLFLAIASGNLQTKLPKAMQAGNHE
TLGGHSCYSFGSSLWFSQLNSRRVVLSNSKELLGNFLQPQGSFLQRDSLT
TTLIDKARYKSHLWFALPSAAWTSGALQSLTSSNKDVKSIGNLNRIKHLT
LSVNFKDGIEAESEWLYESNQAAYFASTFLWGAIQLPRLSEKNEQTRALL
DNIAIQQNLNSVIIHTALPLQIFQTAKEQPAP
>Cag_0833 hypothetical protein
MLTISTTYDVQSDRLVTIKLPQEVYPGKHELLIVVEQQKKEKRIGTTIAN
SIMRFAGTVPAFRSLDGVSFQQSIRMKWE
>Cag_1576 hypothetical protein
MLVSTSIRINQELYEQAKQDAKLEHRSIAGQIEFWARVGRAALDNPDLPV
SFIAESLASLAEPREHATPFIPRSSKQ
>Cag_1737 hypothetical protein
MFINKDLLEFFGSMMEIKKQKRDIFNALAADVDDPEIRNTLLRIGADEQR
HVDQIQQSINLVNSGSTAEPMVPEAAPAPQVAPAPAPPAPTIAPATLQPA
IAIAQPAIVRPEPPQPVAPMPVPEPVAAPPTYVQPVQPIAQPITQQVVVS
EPVAPTVSQLQQPAPPQPLTYLTPTIATPAAPAEPAYSEPAEQSISSFAS
PLSSGTQRYPVQPPTSKTFENMTTLHHPLGEVFGFAATDQSPKAQRYRSH
RHCPFNNKSPNCTNSHTENPLGVCSILHNNKAIITCPIRFREDWLITDDA
ASFFFEPGVRWSSLTDVRLADANGTSAGNMDVMLVAYDKEGKIIDFGAIQ
IQTAHIDGNVREPFECYMKDPKTNAMMDWTRQPNYPEPDFLSAMRTSVVP
ELLYKGGILHSWNKKMAIAINKSMFETLPPLTRVKKDEADIAWLLYELEA
VNDGEKEAYQLKKSEVVYTAFQPTLLALTAIAPGNVNDFMKFIPELGA
>Cag_1388 hypothetical protein
MLRNNNKRILWTFLNYLFMTVNELLPSVTTLSHVDKIRLVQIMLEQLAND
AVNSAQQKSLSSETFNPRYFFGADHQSKQIIDDYIASSREEWH
>Cag_0153 hypothetical protein
MTVLPQIIANKIDVKRDFPLLSERELDYINEAKGLFDSGFYSYSLLAIWN
AAVNNLKRKVEAYGVELWSSVVKDESGRKKYDKDAETIAERWSNVDDLVL
ITGATRLGLLNPKAGKSLEMINWMRNHASPAHDSDNRVEMEDAVGLILLL
QKNLFEQPFPDPGHSVSAIFEPIKNKTHTPDELSILRDHISSYKNQDIRN
VFGFFMDLLTKGDEPAKTNVTELFPVVWEKANEDLRKTLGVKYHTFVIDP
DSDDSPDKGAKTRVFELLVKLDAVNYIPDGTRARVFRRAAEKLAEAKNTG
YGWRLEESASRNLAQLGISVPSIAFEYVYQEILAVWCGNYWGRSDSYVTL
RPFVDSLNTDQIRLVLKMFRENERVKDELSQSRPNKIAVSLLKEFETKLT
IEAHKQELRETIDIVKDI
>Cag_0124 conserved hypothetical protein
MKFIAIMGHEETRPQVRALFQKYQVHLFSNLSIKGCSCEQKGGEQPTWWP
SNEMPTSYTSLCFAILEDEKAEALMTELEKNPIAIEKDFPARAFLMNVER
TA
>Cag_0710 hypothetical protein
MKKSAIKKALCKLDKETIEKQMEALMELVAEPRYICRKCARVASTKRHLC
KPVAITNSNGSKKRAAKVLNNGVVPPNALT
>Cag_1809 hypothetical protein
MPSIAPIIPFLLLFLWLLQGCASDRAPSGGSADTTPLRLLASTPINGTQN
FKGNQLQLYFSHEVSSRALLRALRTFPDIGQFELTVNGKRADIQLLDTLQ
ANQTYTLLLNRHLNDFRGQLLHAPTTLAFSTGNNVNNGTIRGTVVQYNGT
PASNALLLAFASAEKGATVNLLENKPTQIAQCDASGSFAFNHLPHGSYHV
VAINDRNHDLAWAPSSEEYATPSQPLMATNSANQLLRLSPPLKSPKPLKI
PLEASSAPTNSTIATGSLSGMCTVRGNPPSVIIEAISPSATYYTVAVRKK
AGSYTYHFNQLPVGDYTITASIPTASYQPNQAWQWNAGSVAPFVPSDSFT
FYPETVTIREEWLTERINITFPTILQ
>Cag_1443 conserved hypothetical protein
MVKNKKNEVIGREITLYSDKSEDYISLTDMARYRDTERSDYILQNWMRTR
STIEFMGLWEQFNNPNFNSIEFDGIKNMAGSNSFSLTPKRWIAATNAVGV
VSKTGRYGGTFAHRDIAFEFATWISAEFKFYLIHEFQRIKEQETNRQKLD
WNLQRTLAKINYTIHTDAIKERLIPEKLTAKQTSLVYASEADLLNMALFG
TTAADWHTENPNAKGNLRDDATLEQLVVLSNLESINAVLIRQGLTQSERL
MQLNQIAITQMTSLVKNAHLKKMQ
>Cag_0897 conserved hypothetical protein
MAKKQSFVDKTKKAGASDFKTAKVIFSVRSEKTNAWRFIEKNVRIPNGEN
DQEVISKAIAGFSK
>Cag_0405 conserved hypothetical protein
MKKAAFLVALAALFGGTQANATDWNWKGDVRYRYQSDLASDPAVTGENSR
DRHRTRVRLGVYPWISEELTGGLQFSTAGAGDETTSRNETFGDQFVPDQL
YLNEAFINFHPKAFDSKVNIILGKREVANTMVVLSDLVWDGDLTFEGMTL
QYGKDENGKNKDGWNAMLGYYPLNEINDLKEVKAQDAYLLAGQVAYKGKT
SAVTYHMGAGYYDYTHFDVSNKKVNASQLAAAPYTYTAAKSATYSPEYDY
TGKDFNIIELFGTVGGKLTENTPWTLTLQYAFNTAKQDAKHINIDDDERT
SYLAGVKIGDAKNVGQWAVGADYVRIEKDAMTVLTDSDRNGGTATNLEGM
KLGVTYHMVKNMTVGATYFNFNTIDNDATAVDESATKRHTLMLDTVVKF
>Cag_0854 hypothetical protein
MLCACKLCLMHKEWLVESEMSGKAYNKMSYQPFLFIYSFFFSGNNCKFAN
FTCILILNAFFSVICNGSAAAQA
>Cag_1133 conserved hypothetical protein
MRKKILFICGSMNQTTQMHQISEWLGDYDHFFCPFFSDGLLGVATKLGLL
EFTIMGKKRSSKALEYLHSHHLQVDESGAAHPYDLVVTCTDLIVPKIFQQ
RKMVLVQEGMTEPETLFYHLARNVKWIPRWIAGTSTTGLSDAYQKFCVAS
EGYRQLFIRKGVNPNKIEVTSIPNFDNCAHYLQNDFPYKDYVLVCTSDNR
ETFIYENRRKNIEKYVAMAAGRQLIFKLHPNENVERATREIKQYAPGSLV
FSEGKTEEMIANCAMMIAQFSSTIFVGSALGKEVHCGLPTDELKALTPLQ
NNSAAKNIADVCREVVEQ
>Cag_0969 conserved hypothetical protein
MGINYLLTNNHKSLQNMASYSQSHENLSGVARLFLVVSASVIAVASIAGG
AGTLLQSELLLTLHPYLFFIGFGNLAILILNRYLTAAIYPELTIDPARQR
SYIALVLLALGMITIAVALKLPLLKAATGLLLMAVVSVPLREIFSKLSIP
AIWKEVSVRYYIFDVIFLMVANLGLFTLGLKEAFPDFSIIPFFVTQSSYF
LGSSFPLSISVMGFLYAYAWSRSPKRELAKQLFSLWFYIFVGGVLFFLVV
ILIGHYWSMMLISHFLMFGVMAMLASFAVYLNNFFHSKFHHPALAFLLSG
LSLLFATSGYGIMNIYFMQGITFGTRPPLPFEQMWIYHSHTHAALVGWIS
FSFMGMMYIVIPSILRSGSLETLRSDNALSALLDAESMKRAFAQLTIMVL
AAMAMLLAFFLQQQLILGVAGVLFGVAVAYVMLNMHSSR
>Cag_0574 hypothetical protein
MNTTKNEVATLLQTLSDDVSFDEIHYHLYVLEKVNRGIKRAETEGAISHE
DAKKRLSKWLLD
>Cag_0673 putative glycosyltransferase
MEHYVTLFDSLFLPQGMALHISMERHIKDYTLWILCVDDEAYDVLTKLQL
ANVRLLQLSTLETEELLRVKPTRSKGEYCWTLTPFAPRFVFEADATVHRV
TYLDADLWFRKHPKPIFDEFEASGKHVLITDHAYAPEYDQSATSGQYCVQ
FMTFSRHAGEEVRKWWEERCIEWCYARHEDGKFGDQKYLDDWPDRFANSV
HVLANKEYALAPWNATRFPYSAAIFYHFHGLRIFKKRKKYYVFNGTYFIP
KPTYRYIYKLYLNDLQGSIFLFLKMGAILRNQKNMWFGYNFFTFLKVLYS
KLFRLNVYASLCYVYLKPNLKAVNYNN
>Cag_1686 hypothetical protein
MKELSLLLRQIHPNFVQDGHLSSQAFRPTPKDEQQLSVYDGDMILPLDAW
EHYNNILGLTSCGVMAVNVAECTVLELPVMSDPQPFPEHVLIDFSAYNKR
EIEKKAKLLKAKAEVRGWLYKKAQL
>Cag_1455 conserved hypothetical protein
MSVKPVDLNALRATHGNLYETVVAMSRRARKLHEEERSELEERLLPYKEM
IRNPASEAESDKVFPEQIAISLDFEVRQKASHRAVADYFDGKYDYMVEKP
VEKKIVLPTNDDDEADGH
>Cag_1236 hypothetical protein
MKKIYTTLRQVVKATAFLGMLATTSPVQAQEVTYNTEGWYGTAALSKIIN
TESSGMQANLGSGVIRPGEIDYNGNFVGMLAVGHENSFCRKNNTPIYLRT
EGDYLMGSADRKSATVDQYHTVLDDSVDFRALFANALLGIKDTQHTRWWL
GGGIGYGWVDRPAITGCSSTCSFAAATTDGFAWQLKAVVERTISKDAALF
AEARYVALPGESNSTSQCYDDINVATLGIGFRSYF
>Cag_1926 conserved hypothetical protein
MKRLTIPTALLTVTIAIASPLYAAAPPTTQELIAQAEATRKEAAAIGYEW
RNTAQLIKQANDALTANNEPEAQKLASAALLEAEQAVKQGKWMQANWQTL
IPTL
>Cag_0674 hypothetical protein
MNYMVNTRSFYYYSLLYALIAHFGFLGYGIDVYDAYSIAYGWGIGTFEPI
GWYLSTFRLYSANDIYLGVFFVSLIVSSGLIYASFYFLGNENKFSKIEII
FIIFFMHFVHVTVFSSVNALRQGLAMSFFMFGIVKLLSGSFRKTLFLFLL
SILCHNAVLFIIVPLLTLINVNKKLLQLGIGFIFIVLTPFALNLGVAEKT
QVSTTLNYSIIYFILFFAYSFFYWQSFSNSTKFKFSEVRRRYQFGFFVVM
LMMLFLLNRESHLQRMVMFIMIPLVYEIFVFLPAINPLRNVIVFSFLLLW
VVITLTSSAFSSFREYSTMPL
>Cag_1530 hypothetical protein
MEKIESIKTLYQQSHNNLETLYNVLSQKAFKGELDLSRVALVVEEKQKNL
>Cag_0739 hypothetical protein
MKKIYTKLQKTVTATTIGALLFIAPKVQAQEVTYNTEGWYGTAALSKIID
TESSGMQANLGSGVIRPGEIDYNGNFAGALAVGHENSFCRKNNTPIYLRT
EGEYLMGSADRKSATVDQYHAVLDDSVDFRALFANALLGIEDTQHTRWWL
GGGIGYGWVDRPAITGCSSTCSFAAATTDGFAWQLKAVVERTISKDAALF
AEARYVALPGESNSTSQCYDDINVATLGIGFRSYF
>Cag_1045 hypothetical protein
MRKKALSIALASLLLLGTSLPIAAWLALPRYVEPLLQRALIGKPVQIAIK
DVRPSLHGVAFSSLQATITTPPDECNNYERTIYHVTIKNGTIGWLITDLS
ASHRSPFIPSLLDVKLHLQADTLHLQPTPNTFAFSDSQPEITVNLKLFRN
EKQVLSVVPLDAAYAIHDGTVTREQMRFEGIAYNVAVSSSNKWQQLPDSL
FVARMVNEGKVQPVGNFRAIVGSKGDPLHPCRITLSNCSAEIVDWNASSP
FVHFDRKTKAGDLTLCINDFPLQSLSSIALQAAQQQPKAPSRLAAKAPLP
PMVAGKINATIPLSFRDSTIVIRNASVIAKAGAKVVLYNKQQQPMLFVVA
NKSGMDERIVDKLYVTATLNHAGKTTQSVALQNLSATIFDGSIRSTPLTV
KTDGSSPLDVTVTFDNLKLFDHLILPDNEQSSFQGALSGKLPIRYAKNQL
TIRNASLLASEGTQVKLVTKEQKPLVTIIAGKKGGKETVLDKLNVRARFN
QTPNQTASITLQEFSTTLFGGSVNVTPLTFKTDASSPLVATVTLDKVKLF
EHLILPANLHGSLYGDLSGKVPLTYQNDQLSISNATLRSSGGGSFTLNNA
QQSSNNNLSRSDQQTTYAFSEPALTFSHLANGATTVDFTLNEFRQKSGSN
DFKFGNPKGTIHFAENPREPDVMRLSNFSTNFFGGKIALNEFVYDIKKQE
GETIVQLSNMPLQKLLDLQGTKKVYATGALKGNIPIKLKKGTVEIPDGAL
LAQESGQIIYATSPEERAAAHQSLRTTYEVLSNFLYQQLSTSLTMTPDGQ
STFAIRLKGTNPDMYGARPVELNLNVQQNLLDLMRTLSISSEIEQAISDK
TTQQQKK
>Cag_0659 hypothetical protein
MEQSATKSFDGERRIMYAEKLIVETDLSGMLKKVPKLPPNKQLEAIFLVL
SESSAKVAVVRTPHPDIAGKVIIKGDIINCATSSDWDLPQ
>Cag_1465 hypothetical protein
MLYPISSIPNVLTQERMLKFLLLLVVTFLAIRLVFRLLRNGIFLFKSQNS
VNPYPKSSPFQRGQRVEEADFEVIETQLGESEKRRDVA
>Cag_1285 hypothetical protein
MVFEISFHLFIISKLMKLTNYQNNQISIAMKKLFAFLFLLSSVSFVGCAK
KAEEAPVEEPAAVEAPAAPAAEAPAAEAPAAPAAEAPAAEAPAAPAK
>Cag_1374 hypothetical protein
METTMQTNANNNYTTDSIIREVRCLKEDNAAEYGFDIRMIAAAVQLKQRQ
HPERIVTRILSDVEQKYGKQPLTRLVTENEL
>Cag_1721 conserved hypothetical protein
MKPLPVGIQTFSEIIKQDYLYIDKTSLANELIKRYKYVFLSRPRRFGKSL
FLDTLKNIFEGKQELFKELLIYKQWNWNVTHPVIKISFSGGIRDKESLRD
NLFYILKDNQERLNINCEEKNNQNLCFAELIKKVYQKYQQKVVILIDEYD
KPILDNIENIPEALIVRDGMRDFYSKIKESDEYLRFVFLTGVTKFSKVSL
FSGLNNLEDISLNPDFGNVCGYTQHDVDTIFAPYFEGVDMEEVKRWYNGY
NFLEDKVYNPFDILLFIKNQRMFKNYWFETGTPRFLIELIKKNNYFIPKL
NKLKVNESLVNSFNLENLNLETILFQAGYLTIKRLLPSGMGVGYELGFPN
KEVQISFNDYILQVMTIVSDKEPIRYELFDIINNGDVANLEPIITRLFAS
IAYNNFTNNYIESYEGFYASILYAYFASLGFDIIAEDLTNNGRIDLTLKN
YEKTYLFEFKVSNQEPLEQIKKMKYYEKYDGERYLIGIVFDPKARNVSQF
VWEKV
>Cag_1924 sulfur oxidation protein SoxZ
MRVKATLQNNVVSVKMLLQHVMETGRRKDEAGALVPAHYITEVTATHKGE
TVFHAELGAGVSQNPYLSFQFTGASAGESLTISWVDSKGMSETADSVISA
V
>Cag_0582 hypothetical protein
MKSGSVYQVHDRVRFRVRERKPEAVVMRDGRYFKLIIDGFDEPLICVQIV
EPGRRSSSGATTSNVIHSYIDGDFEGWEGETIFKLDNGQIWQQSSYAYMY
HYAYHPEVMIINDGGTWKMKVEDVDEMIEVTRLK
>Cag_0735 hypothetical protein
MGQFLAIGLVTQIGVLKKELAAAQLTTDQLQERMKAELPYNPELYLLHEH
TDYYSFDLRDEIFYAQLLPLLEEFYPSFYNSPEMYESILAKLRKLPPSEW
FAWAKRKPEEAFQFDPYGMRETIEEGFTDISLHYEAILLTMNGKIVMEAY
GSLFRFLNYTMKQTFKQYSLASALRLYITG
>Cag_1401 conserved hypothetical protein
MKIYTKEELIQSLKIIAAQGWIENARHGNHGGIGNTLEDLLGIAENNLPI
PNAAEWELKAQRLNTSSLITLFHIEPSPRAIKFVSQVLLPNYGWKHQQAG
KKYPENEMSFRQTIHGLSTSDRGFQVNIDRKNQKVVISFDWNCVAEKHHK
WLQSVKNRIGLEQLNPQPYWGFDDLSNKAGTKLLNCFYVQAEVKKEAGKE
FYKFSKVMMLQKFNFDGFLSQIEQGNILVDFDARTGHNHGTKFRMRQNCL
PTLYEKMTIIV
>Cag_1050 hypothetical protein
MTIAELQEQPLAERLMLMEELWETLCNEKHHIQSPAWHQEILEERINLIN
SGEAEYLSIEELYVLPKPREIRKGFPSLNYSRATFLSFPRRRESRIV
>Cag_1207 conserved hypothetical protein
MLARIQSLYLFVVALLAVASMALPIWSFNATPQLIVRDLASAPLDNALYN
LASTAGMVLSPLTAIVAGAAIFLFTNRALQTKLIMLAMLLFAGDLVAALA
AAHMMNEHFVALGNVVVHQPQAGLFILLPEPLLLFLALKGVKTDDKIANA
YKRL
>Cag_1428 conserved hypothetical protein
MRAIIVTLPQKIIPYIPMRRITVLLVFILTTIIAGTSLQAATTPFTGSMD
MALTMPNGRGTVTYLFGKGAQRMDMSVQMENIPSLLRTTVLTQANQPDNA
TIINHQTKSYSQVNLTHAAQSALLMDFNSVYRVTRLGRTTLRGYNCEHLR
LQSASETVELWVTGDLGNFSTFQILQAQNPRLATTQLAAAFRNNNIEGFP
VKMVQEVQKQRYSMELLKLTKKAIAASQFRVPAGYKRVDASEPTLNSEQK
QQLKNLMEKMKQVE
>Cag_1861 hypothetical protein
MGLFIKEFKMDKAILFNELLDAVDHLSLDDQESLIDVVRHRIAECHRQEI
FSLISSARKEYQQSKLSPETPQDIMNSILS
>Cag_1113 hypothetical protein
MKNNTGLWIDHKTAILVNIKGDYTHVQHVESNAESNLKPSGGWKANGSVV
AQAVANEHTADERRKHQYHTYYQKVIALLANSTEIALFGPGEAKIELAKE
IEKNSDMHKKVSIVETCERMTENQLIAKIKSSFSAKS
>Cag_1181 hypothetical protein
MQPFEHIPKIIAVKALDAQHLMVTFEGNIIKRYDCTTLLAMQEFKLLKTY
AFFKAAQVDAGGYGIAWNDGMDVSGYELWKNGVLQ
>Cag_0655 hypothetical protein
MTMLREIIKPTTDFYSVHIPKKYINQEVEILVLPFSYKNRQEIEDNVSCD
VFSKTSGILKPKNIDPLQWQEEIRNDREI
>Cag_1406 hypothetical protein
MEMQENNIRQLRLHFDGLATVEHKLPASLLVQALSKFQRVVHLIAMADEG
REVLQRARITREIERRFPLICEVPQKGGYALPITIGGEADQLFDEQACEN
IAKKTREVIVAIDRSDVKELGNIIPDMFYRRSILEELKAMQPASHSCFFI
DIEDCYNQPILNGSTATEKIKTLLMPPTNETSSSDFGYVTGALIEMKFNE
RRLVMKLLGSNKQLSVTYAEDFEPMLLDNPRELIQIHGNIVWNDDGLPQS
ISDVDEVVAIDETPLDIHVVEFDTIFLQPKKTLQSEVVFDRESALFQASG
PFDIYLCAATRAELEEQLYNELAMLWQEYAKPPSSDLTLDAQELQKELLY
AFEEVIRGI
>Cag_0522 conserved hypothetical protein
MADETKTTGQGGVKGDFATILVGVGTILDNTIEPLSKILVQTLDSLTVVA
KQILEGVNSSLGCKK
>Cag_0577 hypothetical protein
MILLDLGKPYPLDDVFKAAARKHQSHYRATELNVGYSDKYGTKLNEDDAK
KLLNYYDSLNVREELQNRFQKGDEYSFSLKRDGDLLRSEHIPFNLFAPLC
ADTKLAQNLIKNVFGLDCAKNLSIKFEYAPKPKGKYLDDATAFDAFFKFD
DNNGKRIGIGAEVKYTEKSYPIGKKEKKYVHDPKSCYWKVSCKSGAFLEP
SYSPSSALITDELRQIWRNHLLGLAMCQQNELDDFYSITLHPAGNHHFQR
VKPNQGVIPEYQAQLTDSYRSKVFGRTYEEYIAAIDGDSEILKWKQYLHD
RYIVKNTDDQPQ
>Cag_0031 hypothetical protein
MNETIHHQILEQKPSEKINLVTIILESLDKPEPEIQKIWVDESQKRFDAF
KAGKIKLYTY
>Cag_1308 conserved hypothetical protein
MPLSRSYKETIQDRAQHDPEFRVALFDEAINALLEGETNVGKALLRDLVH
TTVGFEGLASELAKSSKSLHRMLAPSGNPSMENLFQIINAVKKHAGISVQ
VASSCIQQNTQQIVA
>Cag_1067 hypothetical protein
MVKKQKYMPEISRFLGIIISMYFDEHNPPHIHVQYNEYRAAMDIYDFNII
AGSLPAKVRGLVAEWMELHSEELLKMWETKEFHRITPLV
>Cag_0279 conserved hypothetical protein
MNNVQLYTEISLLPASLKQEVKDFVDFLKTKSQSKSKITEREFGCAKGLF
TIHDDFDEPLDDFKEYM
>Cag_1177 conserved hypothetical protein
MKRKIYVETSVISYLTARPSKTILGAAHQQLTLTWWEKRSDYDLYVSQAV
WQECAAGHPEAAERRLSVLAELDILVVTEPMITLANTLVEQGIIPTKAIE
DALHIAIATLHHVDFLLTWNCRHIANPIIQEKISLYLEQQGLYLPIICTP
EELIGEKNDD
>Cag_0274 hypothetical protein
MQRTIEDITSELIGLPKNERLEIVRFLLFLDNRSSDNNDTDSVWEHEIAD
RVLAVEDGTAIGIDYEEAMKKINAQFAS
>Cag_1971 conserved hypothetical protein
MNKKTWQGIFLLVLFVIAPNFLQNSIVRAEKKKIILKQAEIVEGGENAKG
SFRRLSGSVELSDGSITLRCNRATEYEASRSIVLEGKVMIADQRAEVYAD
GGTYYPDKEIGDLNGKVRLRTLDGALVAIANTAHLNHAANQITLYGNVVA
WHEAQQVSGNEMVITLRASSGKQEHQVEKVDIRGNAFLAAKDTLSKPIAV
YNQFSARRMTMHFNEASLLQNALLQGQSESLWHLYSEENRPSAIHYSSGN
TMQLAFREGALYTMKVSGHCEGKHYPASFWENKKINLPFFVWREKEYPFP
KKK
>Cag_0984 hypothetical protein
MLCIPASYVFQMPTVLIVSASPLDQDRLRLNAEFRDIRHALQRSRNREDW
VIESNEAVTVDDLRRALLDFRPTIVHFSGHGGGLDGLCFESTEGRTNSAD
AESLAKLFHHFKDDLKCVVLNACYSKVQGDVIRQEIDYVVGMSSAVEDKS
AANFAVAFYDAVFAGTDFRTAFDLGCTALDLNNLPDADVPIFMTGSHLKP
TDLHDSAYIAEIEKVLYSYINTPFSERWRYTTTGELLRAVMEKHYAGNMH
RLVPKVSVISMKQIADEHWVVAVDVVSSLMYMRIKNRSVSVEWEASVGLW
SVPVKTYLALGSREPLLARVNAELDTYYNFEFANEQHRFQSVSLCAVSGP
MLHGYVERGSKVYEELIDILSDGNEHAITLEIEQATDHTDMPLIKRVLSR
TWICSQPQDTKNA
>Cag_1000 conserved hypothetical protein
MGFNPRAREGRDFEPICLARKPLSVSIHAPAKGATLTGSHCVLSGDGFNP
RAREGRDLAFPCSISITCKFQSTRPRRARRSCSKRIWSALSFNPRAREGR
DPIVPRPFRGFSRFNPRAREGRDRLEGL
>Cag_0771 hypothetical protein
MKKRTLFFAALCTVGLTMPLSNAHAEWTLHINSRNEHPPTLVNNATIQPD
SRALTMSSQPPIAVAPPAPVFITPQPVAIAPPPPVVVAAPRPNYQVVVYE
NSYYNRRPDGWYRSYHPQGTWVRVQQRHLPPRFAVAPRAPEPRFAPPHRR
FDDRRGVELRVRY
>Cag_0496 conserved hypothetical protein
MGKRQIIYQADRIRGNQELLNREINLVTREARVWHGRITAISSNDVELKN
SRMGKHRFNIDQIESIYCDITTDY
>Cag_1323 conserved hypothetical protein
MLAHAQQIYDEALTLSPIEKVELIEHLYFSLDSKNSRQELDKLWAEEAED
RLTAYENGEIKTTPASEVFAEINSMRPQ
>Cag_1259 hypothetical protein
MSQKHEAWQIDVALANQAAFRHFKKRHEREYISCFNNLNKIKRLLEEGKK
LSELHYHPSFFRHETDGIFRIGQSGVSGAKESRLYIYPDNQHRIIYILEI
GTKETQQADIAAAQKAIQQIFLR
>Cag_1001 conserved hypothetical protein
MRPFSSSFNPRAREGRDGTHVRNLQGDSCFNPRAREGRDVPLLFIKRIHR
VSIHAPAKGATVSARDFGYTVLVSIHAPAKGATNVGTACRKHRS
>Cag_1269 hypothetical protein
MGATHVTVTIRNPANPEKFWEGLFLVDSGAIDSLVPRDALESIGLKPKAQ
RSYELADGTEIKMDITTGDIEFMGEIVGGTIIFGASDTEPILGVTALESV
GIDIDPRNQQLKRMPSTRLKKLKPIASC
>Cag_1610 hypothetical protein
MKTEMHPFERILSVSTEDIELELRGITEINWADFWLNPRKLRGSDFLMRW
SQGVWSEKRLIDAVNNTGEFYAIAYGPSGTAPTDDVRAFELYFERLEAAG
LGNIKRPDLLVFKISDREFVDDFLSKNGGEEELPFITEDKLQALVQKAII
AVECENSLWVAEKMPDYRTPMKAQRRLGGKMGLAKNAVLPTVIIKEEDRI
PLNKWQEENRIPIHVWHVFFDRAYGLSFDEAQRLVTEGLILPTEQVFQAP
GGATTKKAIYKYYYHYAYPLGVASERPQLIPAFIEDKNGHILPYVKFEGG
SLTIADEAINVLNQL
>Cag_1857 conserved hypothetical protein
MKKQILLFSAIFSGYTFLLLLLYFPLVFQSQVLTAPDSLIPQASSMALDK
LQAESGSYPLWQPWIFSGMPTVEAFSYLSGLYYPNLLFNLFHTDGVLLQL
LHLAFAGAGTFLLLRDLRLSLLASIAGGLIFLCNPFFSAMLVHGHGSQLM
TTAYMPWMLWAAMRFMDRGGVAEAGIFALIAGLQLQRAHVQMAYYSWLMM
LLLVVVLFATRRWVVPQAVQRGGLFVIASVTAIAMAAAIYLPASHYAEAS
VRGAAVGGGGAAWEYATLWSLHPLEAITFLFPGFFGFGGVTYWGFMPFTD
FPHYAGLVVLLLALMGLIMRRREPMTWLFAGVGFLALLLAFGRFFSPIFD
LFYSFAPLFSRFRVPSMALIMLYFALAALAAIGLHELLERKPQRLLKVLR
LSSIVVALLLLIFLALEEVAEHAARSLFPLPQVDSFELVSAINSIRWEQL
SSSVIVTLTLLLLVAGVLWLLLSGKISSKYSASLLVLLAVGDLLWVTVQV
IYPSAHSLRTPLFADKQQVAPAFQHDDVTRFLASQPKPFRIYPAGNFFTE
NKFALFGIESVGGYHPAKLKSYDDLLQVSDNLASIALLRMLNVHYIVSPA
PIEHPTLTLATSGTLQRANGSAQAFVYRLQEPAPRAWFVSRVVPFSNKQE
LYSHLLDDTASLSVAYVEAQQWQGAQRFSEGTIQSVTTQPESIKLNVNAP
NSSFLVLSEIYYPNGWQVMLDGKATSMLRVNGVLRGVNVPAGNHAIHFSY
NRHLFEQSQWIALAGFIIALLMIAGGLLWKHLLLSGEKRVVRGFHTIR
>Cag_0786 hypothetical protein
MGSLTVKDYVMLKMAFSSKQHAFLAGFGSLFDFTGHKLNAKHFGNQLTDR
SALQADWYAISNDIHKASNAVVTEMANTKATRNASK
>Cag_0396 conserved hypothetical protein
MKPDSKVVLINAVVIISFGLLSAWHNSRDTTPSVTPVTQEVSVEPTETSP
SLTPNVPVTTTPEPAAPAEVVPAKPSVSVTPTKPKSSHVLKPRVLIAKKP
RPAASAEVVLAEPSVSVTPAKPTASPVPEPQVSVPAKPEPAAPAVAVPAE
PSVSVTPPTPTTANVSEPQVPAPTTSQPAQ
>Cag_0323 conserved hypothetical protein
MIDSSLTQKLMASLDCDEKEALRLLKNCGAAMVHYILASKKIAIKGLGVL
TVRHIPLKKERQASGVTFVPPSNNLVYERREVGEGDIARLAISALSLSEH
QAIRFSEVLASYFTAAFTAKQEVALPALGAFYADADGLYGFHVAPSFTAL
LNREYCDLADIVVPVGNRWGLWQERFRALRPAFITVGAVGVLFTASLLLY
RWFSEHPLQIVVPSTLSSAKAVKQSLHAVAATASSMLESSTERPVTVLPT
TPSFADSLQLERGAYAVVLATFQTERTAYEQVAVMRQAGIEAFVWPVFME
GSRYSRIMTGMFTTREAAEAHLKMLPEAFIKGAYVQKAKRNVVLYAKKRV
>Cag_0270 conserved hypothetical protein
MPTISMFYGIIIRMYFVPTEHPPPHFHVYYAEHTATVDIRICEVIQGHLP
KKQTKLVLAWAELHQEDLMADWELVMNGEEPFKIQPLQ
>Cag_0686 conserved hypothetical protein
MINNLFLTNQRDNYDSPWKEAIEHYFLEFMAFFFPAAYASIDWSKPYHFL
NEELRAIIPDAEVSNRVVDKLVQVQLLDGMESWLYIHIEVQSFWEVNFPE
RIFVYFYRIYDKYGKAVANFVVLADQHSNWRPTSYTMETIGSKLSLDFSV
VKLLDFEPRLQELLVSDNVFGVITAAHLLTQKTKNKVKQRYEAKKLLMQL
LLQRQWEQERINELIRVIDWLLKLPKELRQKLKAEIHNMEEEQKMKYVTS
FERDAMEEGREMGLVEGMEKGKAEGLEEGLLKGRLEVAERLVASGMSKAE
AALLAGVSVEML
>Cag_1909 conserved hypothetical protein
MVAVQHKQNAQKQSKPFAFWLSVATLSHFALLLALLLYQQFSNRQQEPPP
VVNVMLVSLPGRVGSAAVPAPTLEAPQQVPAEQAKEASVVKGSPSAVPTK
VPVATPPASTTKKVPEAQPVVDRQQQMNQALERLKQKVGKSASPSVTASP
SAAPSPLAPSSSNSLTNALAKLQAKVKASGQATTSAPSTTSPTTSPTAKS
GGNIAAASRTTGSGSGSGSPASYKAEVASIIQNNWAFSNPMLRGEGMEAY
VRIHVLPNGTISQIVFDRRAASEYLNNSIKRALEKSSPLPVIPQEAGGRD
MWIGFLFSPEGIER
>Cag_1690 hypothetical protein
MGRGLVAKFVITILFSQRDVVLRFSLGSSIISCCKSVYKYSALNGNQQFF
ESSLSVPDSGKAIFSRILNDNKINMKTIGAC
>Cag_0993 hypothetical protein
MGISSRDVSKLNKTIQHTYNGGSTMTREEIIKRLDEIFQQKDIRHGVEVR
KLHVELFGVEPVFTGYYYECEKGALEWMIESMLDGKPFVEPELEDGEFT
>Cag_1368 hypothetical protein
MVFQRSMKVFTTFALFAGMMMASSNLHAVTVDDSIHEKACSVVAGERTVT
LSIDPKPVKHMKELTFTVSVTPCDKLPDMLLLDLSMPGMQMGKNQVTLKK
ISSCKWQGNGIIVRCMSGRKLWQATVLSNELNNPAFAFNVRD
>Cag_0718 hypothetical protein
MIKPRSRNVAPVTPSLDDFIRQPEQPAARELEPNASRKFKTVSLPMNEYE
YSQLHATCKKTGRSEKNLLRYAMMLYAKEVLAE
>Cag_1573 hypothetical protein
MNRMTSSQQNPEPNATCPICKASYHCARSSSCWCSTRKVPQQLSDYLADK
YKSCICPDCLDSMIAEANAGKQFC
>Cag_1335 conserved hypothetical protein
MIKPFLQRTLPLLATLPFFATPTAQAATPLHAAKSAHFAPLLADPLEPRV
AVEPFLGEKSLQLDIGTTEELYRNDKGTFAAGVDFATWSLLRRSNNFKFP
VDAIDYLFGVNASWKMPLQNSSLPFDDFNVRARLSHISAHFEDGHYQNGQ
WLQQAEWQGTIPFVYSREFVNVVLALSAPEHRIYTGYQYLYNALPSGINP
HSWQAGVEIATTNTTYVAADMKLLPIWQTKQAETEGFRASWNFQAGMRLK
GKQADKVRLVANYYTGMSRHGMYFYHPENYSTIGAIIDF
>Cag_0690 conserved hypothetical protein
MKLILKEYLSSLRERGELDAIFPDLLSQLGLNVYSRPGRGTRQDGVDVGA
VGRIDGGLEKVYLFSIKPGDLTRKDWDGDSVQSLRPSLNEILDAYIPNRL
PAEHRGKDIVICIGIGGDVQEQVRPQLTGFITKNTTTKITFEEWNGDKIA
SFIQSCFLREDLLPKGARSCLRKSLALLDESESSYRYFAELISSLSAGAD
ELKNSERITAIRQMGICLWILFAWSREAENMESAYLSSELTLLHGWDIIK
RYAEKTGKTAQAVETAFFSIFSAYQQISSEFLLKNVLPHVGKLHGLSSAV
HSSCAFDINLKLFDLLGRLATHGIWAYWITSRFSDEQAEVKKKSLEETLK
LMKSIKELISNNPVLLLPAKDDQAIDIFIATSLLAFNKENYNYINEWFAE
ILGRASFAYQTHGNYPCILNSYTELLSHPKSGDDEYRKTVTSGSVLYPVI
ALWTALLGNEEMYNNVAQFKQAHLSHCNFQFWYPDEYSEAHFWKNSDSHG
AVLSHVPVDRPKEEFLGQVFGECDQSPHYKDLSAIKFGWWPLVIVACRHY
RLPLPLQLLEGLWKT
>Cag_1038 hypothetical protein
MKKTRWLIAGLMGVILGSLPLSAQEGFCMSPKKSDTSIVINSQQSFIELP
DQGFSVSVGSPYDIINYDNRYYIYQDGSWYRSSNYRGTWTVIRDSDLPDR
IRRHRPEDIRRLRDNESRRYENDNRQYRRDENNRK
>Cag_0268 conserved hypothetical protein
MNIYTERLPHRYRQAMFRDEMKIEELVEVMLRVAWCYPKVLSATCTKLGV
TGSIGFFKNLAGWAFGK
>Cag_1318 hypothetical protein
MLQAKVSLSPPLYEFLKNYKDFGFKDKSSMVQNALERLKNELEVLKMQQS
AQWYAELYDQDLETQELTESAITGWPE
>Cag_1227 conserved hypothetical protein
MIRLHVTAEGQKYMEHDQPIKNLLQMVGEQNPELINDGWETAPSKRIINE
IPEYDKVSSGVLVTEKIGLSILRKKCRHFHEWLIRLEQLGETM
>Cag_0880 conserved hypothetical protein
MLHCMKIYLDVCCLNRPFDDQTQDKIHLESEAVLTIIRHLEKKDWEWISS
SVVLYEVQKIPNRDRKQRILRLCDKSSEVILLNKEIYRFAEILNKKGIAS
YDALHLACAHFANVDFFLSTDERLIKKAQKNIDIFNMVIDNPLYWLQTIW
>Cag_0189 conserved hypothetical protein
MKQKALLFFYGAALALLLVLFVVVSQQFLSLFAFLSALHPYVGMGFLALS
GIILLFTLVTALLFFARPSEPSLPDNDVSPAMAAYVRYRVARVPTHPKHP
EGSNAPKDQRWLRTNLKLLDGDAMEITREIATKNFFVGAFAQNTSYGTTT
SLLNNIRMLWRIYTLHYRQHHFREFVALARDVYETLPLSDFRKEELPEHI
KPIIQCSFSNTLASLLPGGNLLTPFFMNLFLSGSTNSYITCLTGIAATRY
VQASTQEERHEVMQQSMFEASFMLKEVVRECNPILSVTISKAVKKAGMDS
LDTMQQPSASSGVAQDIVAHLANSLRTILRDDG
>Cag_0790 hypothetical protein
MREIIMNKKIALALLGLALPLSAQAVEFRTPGTALGIGGAGVARNNGGLT
SYWNPAAGAFKDSPFAVGAGVGAGLKINNGLAENVDNLSKLDFDDITKFN
NSVDDVGNFTKAVTIMDDISKSGGNIGITGQVPIGVSINQFSFGIYGNMS
GYIMPIADITNIVPTANAGGANITVNDLNTSLGANTYTPSGYFTTAQLAA
LSAAITANQTGALPAGAADNLANAIDNQLKESGIPADQALATLTTTALPV
LNAASANTFNQNTTSVLTKAIQYVEIPVSYGHPIKLGKKSTLGVGITGKV
ISGTVYQSQVLLVNNNNVDASDIIEDIDTNKKTSSAFGIDLGLLYKYDKW
LNVGLVAKNINSPEFDAPDYNAPKYDTISGQVLINDLKKGDAVKLKPQVR
AGVSADVLPIVNVSADLDITENETVAPSVVGLTAPKSQNLGGGVEVHPAS
WLKIRGGAYKNLSASKGGTVLTAGFKIFMLDVDGAFATDTMEFDGNEIPQ
EAQVNASLNFSF
>Cag_0134 hypothetical protein
MVERIQNWIKTMLGLTAMNTPAHPNQDLPTWLRWLSSLLGVGLSIGSLVM
LYCPPEKASRELDGKGTVIKVLLESTDVTTPFLSIFLAGVALVVFGINGI
RFAKITAAGVSAEAPDATAAATNYYKAPSEDRPQTEVQVAEKESPDPTDV
PAGYLEAEDGGKYAVYKLNEVPSSVITDALASWPTEDSKPEDLSGFEFAT
RKTGKGNHPWTLKFKGKKAVIVSYGGFAKPGATVSHPE
>Cag_1727 conserved hypothetical protein
MNRFYAGDKIIYRKPKSSFSPGPRARDIYPLAHGEAYHYIVDKYWKVEKV
YADGTLEVVTRTGKTNRLQANDPNIHKAHLLQRLFYKKRFPSSNVAAQS
>Cag_1200 conserved hypothetical protein
MCNSFSFVAYYKYSDISSMTTNNRIKLKSTLACHTQGTVDLASWLEQHGI
SYGLQKHYRKSGWMESVGTGALKRPGEEVTWQGALYTLQTQAKLPLHAGA
LTALALQGFAHYVPLGKQTVYLFSPIKTLLPAWFRNYDWPQLILHEKTSF
LPNETGITDLKLPLFSLCISSPERAILECLYLSPDTLNLVECYQIMEGLT
TLRPQMVQDLLEQCRSIKVKRLFLYMAEKAGHEWYKRLDHTKLDLGKGAR
SVIKGGVYVEKYSLNLPEELVKL
>Cag_1724 conserved hypothetical protein
MQTIYADGVANIALIDGIIRFDLVNITKMEKENVNLRPVAPVAMSVTGLL
RMHDQLSQAINKMVEDGILKKNEQPPVVIDGGQ
>Cag_0446 conserved hypothetical protein
MKKQATITTEELDDKFDAGEDISQYLDWSQSQRPLLDHKRINVDLPQWML
NSLDFEAKRVGVKRQAIVKMWLSERIKAEQVAAGNAVR
>Cag_0610 conserved hypothetical protein
MKKTAKLLSLAVALFAGVSGTAQAEGFKLGADVVSSYVWRGTQVTTSPAI
QPALSYTFKNSDIVVGAWGSYAISEHTGAVANQETDVYVTVPVGPVSVTL
TDYYNQTATSRTFDFSDDSNNIVELSVAYAKDNVSLMGAMNVAGTDTDNA
MYLEAGYKFYEKDGYTAKACLGAGNEAYTSDQDFTLVNTGISVSKDRYTA
SCIYNPDTEASSFVFMASF
>Cag_0714 hypothetical protein
MATETQRAFDSELDSEAAELHLAQLIHEAGEEVRRCWAKRQELHMEKLHA
TVAESQATLNKLLQNDRC
>Cag_1995 hypothetical protein
MQVTIQLLSSAINDLLDARRFYEQQRNGLGAYFFDSIFADIDKLTLYAGC
HPKYFGYYRMLAKKFPYAIYYKMNDTSVAVVWRILDMRRSPYKIKQLLP
>Cag_0884 Fibrobacter succinogenes major paralogous domain
MRYSKSALLASCYVLLLVAIAGCGKKLSPPVNDRDGNSYPVVELASKTWM
AKNLEVEHYRNGDLIPQVQNAEEWAQLTTGAWCYAGNNPEEGKKYGKLYN
WYAVADPRGIAPEGWHVATDAEWQALCEAFGGLDAAGAALKATGEWKNST
PENATNSSGFNALPGGARRDTDGYFMPTGEYSRLWTSTEIAEGSAWAVSL
GYYDAAVRRGKASKKTGFSLRCIKD
>Cag_0152 hypothetical protein
MKQEQQHFLQEKLRECDRHVEKITIAQEHMRSVLPLTPQVYAQLDDVALS
FLDQIVFRYSKLQDTLGDKVFPLLLLATGEEVKRKTFLDILNRLEELELV
DRMTWLQLREARNEVTHDYSSEVGETVDAINAIIVASDTLQKLYSTIRHF
CNHRLQVL
>Cag_0704 hypothetical protein
MFITNLQDTVCRQTQLEQYYAKDVLRDKHFVCRCFDKCRASHAGTYYEGQ
VHYVGSNYDILVGPQPLRVVVVGQEYGHGPALVDSLMRAKMFQDSAHKSR
GFLDRNPHMRGTTTALRILFGIEPGEDKAGEWLETSTGRIHLFDAFSLVN
FLMCSATDGSSKGKATSTMLSNCSKHFVKVLEILQPTVLVCQGKGFFTYL
AESLGVSKQQKEMLFHYRFNGVDGVGVCLNHPSTPRWDSGWAQLTQPYLT
SRVLPLLNDVRCELGLDQVIWNL
>Cag_0992 hypothetical protein
MEISLLQVVIDEHVAVFGVEPVFTGWSAFLSEDEIATNVCAAIDKGEPYV
EEEVPDGVDI
>Cag_1799 hypothetical protein
MKFGFEIVPVERGHGGALYSLRFEAEEKTELDKFLDNEEIQACKEYESLV
ARLYDMVDSLGFRDYFFKLKEGSINDSVAAFHYNHGTLRLYCLRWSSILL
IVGSGGPKTTRTYQDDPLLSDAVGKLQMVDRLFDERQKSREIIIDPNTGI
ITGNLVFTSD
>Cag_1008 CRISPR-associated protein, CT1133
MSWMQRLCETYDNAHNKVGDYNDDAILLPLYHTTMTCNIEVTLNENGEFV
QAKPSEKKKIIIIPCTESSAGRSGDSPKAHPLSDKLQYIAGDFSHYGGEV
TSGFKNDPEEPFRQLYQQLTEWSEASPDKYKLRAVKRYLEKKRLIEDLIE
AGVLHLDEDRKLLKKWDSKGKKKTDKPPIFESITNVNDATVGWHVEKKGE
PTEPLWKDKDIHKAWQAYYESRKINPKLCFISGRSDVAPAEQHPKKILQG
ASNAKLISSNDKKGFTFRGRFTTAEEACTISAVASQKMHNALSWLVERQG
YNKGTLNIVAWAVSGGNIPDPMKETAPYDYDDLGDDYNAAQAFGLAFKKR
IAGYRARISSTDSILVLAFDAATSGRASLTYYRELTGSDFLDRLERWYAR
HEWLYSKPEKKRGFFIQVRSPEQIAKDIYEHNKTEDKEKKTDDIIRSVVQ
RLLPCIIDGQKVPFDLVVAARNRASKPMSFKKYKEDWKNREDWENTLSTA
CALFRGYYYTNFQEEYSMSLDPNRTTRDYLYGRLLAVAESLEKSALGLAD
EGRSSTAERYMQQFAERPFNTWKTIELSLSPYIARLQSNAPGLKKFYTDK
LDEIHCLFNPDDFENNEPLTGEFLLGYHCQRLKNYEGSSKKD
>Cag_0945 conserved hypothetical protein
MKFPHCDYAPSEDGFYSQCASCPIDASLAGCALEECGDGVYQSEGETPHG
GSKALFYFTDNDGNRVAKRQAHHVEVHEMNAENQIIAVLYGMVDPEGIIY
LKKSSASSSNN
>Cag_0427 hypothetical protein
MAFNQNVFVNCPFDKTFYPLLRPLLFTIIYLGLKPRIATERLDSGEARIT
KIVELIEDSKYAIHDLSRIKATKKGEFYRLNMPFELGIDVGCRLFKGGEH
EHKKCLILVAEPYNYQAAISDLSNSDVANHHKETPEDVVIEVRNWLSATC
GLEADGPSRIWDAFNVFMGDNYSALIARGFSKQDIEKLPVQELIQSMERW
VTENV
>Cag_0252 hypothetical protein
MIQCLENGTLALEHLRQSGELPPTWQSMAELEAQVNQLHIRFLEERWRIA
EKRGWGDLAIELLDRAWRTNDREYMDEEHVSEAEKVTVMQALDRQNRLMD
IYNRSANMLLALCREVPNQPQRPIRVLELACGSGGLALALAEMAQRHHLS
LEITASDAVLAYCEEGNAQAKAQQLPVTFRQLDAFHLTDYANEQYDITVM
SQSLHHFTAGQLAVIIAQAMSQTTTAFVGTDAQRSVLLAGGVPLVASLQA
IPAFALDGFISARKFYSEPELALIAESATRRCNYTISRDWPLSVLTVRGG
E
>Cag_0322 hypothetical protein
MGAAWAFPSIPSLIARNITAIFVVPTATPLSSVEHVTAGATITPFKPLAV
YGGTMPYIYEVTSGVLPAGMQLDLQTGMVSGTPESIGRHQTVTITVRDAN
NAVAATRGNLTFVVSAPPVAKAQPSVQPLAKGVAVKPFKPLEAIGGTGAH
VYSVVEGQLPEGLMLDAQTGVISGVPSTTYRQSAIVVGVRDVNNVSANQT
SRVTFTTQSSSLAKRVPVKRELQVIARVVKKAPVKSVQIAKVVRSTEKVV
AANETPKKTSQNLEKSVVVSLVPNNLLLPQWANNVFVMEVTAADLERAKH
MSASVKSVAQEVRQVAEETVQSPVRSTVVKPIRASNDVMTAADEVIASPV
ENEPFKPQSPVVEIHYPSADVDGTPAASDAPSQVYLSSLSSLSLADCSSA
SSSIAYSLN
>Cag_1703 hypothetical protein
MTFAEFKVQLENAATEEAVKAAYATYFKIKYDTSNYHDLYTEQVFFEFKK
EKNFHNIKALATILAQSLYYIRRLKFVEVEKIIPFFICLADKDEACLTEV
RKWSSYYSNDSYDWERPASKPDPKLIDHLVKEPEINNIHIYNVTKKQEHD
AFKKNLENALKPQMVLDFGDKKVINEENFEAVFEHWKNVIGHYIVNGYKP
SFYFLSNIQKERIEIDRENNRIVFHFEDKNSKVQKVLMKDYDYFWSMYDY
VASPDTINGIHAKLDRLTDDSQRRFEGEFYTPLIFAKKAIHYFTKLLGKN
WYKSGKYRIWDMAAGTGNLEYHLPAEAYKYLYMSTLHASEADHLKKVFPE
ATCFQYDYLNDDVEYVFNCKNFLFEDNWKLPQKLRDELADPNITWIVYIN
PPFATAQDAKQKESKTGVSKTKIEKLMDIEKIGHAKRELFAQFMFRIAHE
LPKKSYLGMFSTLKYINAPDSVEYRNHYFNFKYEQGFLFHSKCFHGVTGN
FPIAFLIWNLAEQCHSEIIKIDISDDNAHTIGTKYLRFIDKSIVLNNWFT
RPKNSKTYILPPLSNGITVKNENTDRRHRARPDFLASICSNGNDLQHAKY
VVILSSPNASAGAFTVIKENFEQALVLHAVKKIPKPTWLNDRNQFIIPHT
QPSQEFINDCIVWSLFSHSNETTALRNVHYLGRTYQIKNNFFPFMLEEIK
EWEIKEHDFYVQMLDDTNRFVAEWLVTHQYSNEARAVLEKGKNVYKMYFS
HLHQMITKHWKIDTWDAGWYQMRRCLAEHNIAVDELRELYVANEKLANKI
LPQIEEYGFLDKDEIYEQL
>Cag_1199 conserved hypothetical protein
MIDPMYRQQVDLLLQILPLVAKEKVFALKGGTAINLFVRDMPRLSVDIDL
TYLPLDDRDTAMKGISEALNHIRQKINHAMPGIKAYLVQQSSGQEAKLTC
QSSSAQLKIEVNTIIRGHVFPPRIMDIAKSVEAEFQKFVTMPVVSHAELF
GGKICAALDRQHPRDIFDIHQLFAHEVFTDEIRLGFIAMLISHSRPIHEL
IRPNLLDQRTVFQHQFTGMTFTAFSYDDYESTRKRLVKEIHEHLSDTDKR
FLLSFKSGTPDWELLPMDNLRLMPAVQWKLANIVKLKAQNSAKHKAQLKA
LDNALKDC
>Cag_1633 conserved hypothetical protein
MTVESRKRQFIVEGKIKPSFCEGCGQLTAKIFVGEWMPSDKPKEEDVLAP
LTSKKIKEQKAQAQSPSAAENQYWVRCAECNQIYLLKEWQIQIDREVDIN
QLTPEECQVYSPHGIYAKGAAVYHQALGEVGIVREKQATGSGAFVIIVEF
AKSGRKQLLENVQLSSGNGQSSTELLKLKLRRQA
>Cag_1554 conserved hypothetical protein
MAENNLQAWEKVLEYASVPLHGTMSRKIRKGVKLQINGGDVYEDAVLFIS
DLFLRVTQESDGASINTYYDMKAIASIRTYSTKE
>Cag_0520 conserved hypothetical protein
MAWFLRCLILISMKIFRWNTEKNELLAKDRGITFEEIVEIIESGAKIIEV
DHPNKKKYPNQRILIVDVRGYAYMVPFVKDGNEYFLKTIIPSRKATKKHL
GG
>Cag_1362 hypothetical protein
MYMTSTFMKGIHICRKIYFIIMEINNGHRIAIITGIFSIITAIISGIFLQ
TKENISDKGTTINGNQNAIINGNNNIISNTRENSSKNVIVVATTKNVIEK
NIIGKIKPYITKNYLLTCLGVSQEQEIKADCTYGGKDIVLYFYSFDNLYI
TALISNDIVIGLNFELKNRTTEFPINTIQGKTWILGKISFGDVLYDDDKI
LYTQGGNKFTSTLTCGTSYPQYIGAGPYYYIWNSNDFKNISNKFENEIHR
YDKKHGTEFFDEKKITIDVVKKCRITSLTILGYEYKELSAYLNSPMSQFH
ADIEFSTEVPR
>Cag_0669 sugar transferase
MMLAPVALFVYARPDHTRKTVEALQKNELAKETDLIIFSDAARIPDKESV
VNEVRAYLATISGFRSVTIHHRPYNFGLAKSIIEGVTQVLSEHERIIVLE
DDMVTSPYFFSYMNDALKLFANDDRVISIHGYVYPVKQQLPEAFFLRGAD
CWGWATWRRGWVLFNRDGQVLLDELKQCKLTREFDFNGSYPYTKMLEAQI
KGQNDSWAIRWYASAFLANKLTLYPGRSLVHNIGNDSSGTHCGNDTTHDV
DLSSMPINITNIDVLPSIEVRQVFESFFAKSKGSFLNKLHISFKKAFV
>Cag_0278 hypothetical protein
MEMEKSEKNQYLEQLDAYVHALMQELRIMEQRKAILEPLLFDEDLKSSLN
MKFKDTDGAVAYNHFVPLLAQDLIRDISRLFLDEGKKAGSFTNLCRKISN
KKQLGWLRERYCESQVINPNELASEFHNIWDNVKKGKEKIMHDPNSEKLK
TFRDKYYAHLEMTPMGNEPGPFNIKALGLTYCDIFNFLDTHQNVIYNVAL
MITGTNYDNEEFLGIHRKSANEMWRLLAGE
>Cag_1046 conserved hypothetical protein
MKPLPVGIQTFSKIIEDDYLYIDKTDIAKSIIEKYQYVFLSRPRRFGKSL
FLDTLKNIFLGNKELFQNLHIYNQWNWNITYPVIKISFSGGIRNNESLRK
NLFYILKDNQKRLNITCEENDEPNLCFAELIQQAFEKYQQKVVILIDEYD
KPILDNIENIPEALVIRDGMRDFYTKIKENDEYLRFVFLTGVSKFSKVSL
FSGLNNLEDISLNPNFGNICGYTQHDVDTVFAPYLEGVAMEKVKRWYNGY
NFLGDNVYNPFDILLFIKNQKTFKNYWFETGTPTFLMKLFAKERYFLPNL
EHLEVGDEILDSFDIEKIQLATLLFQTGYLTIEKRFETFERLRYQLKIPN
QEVRLALSDHFINVYTEQPNELKYAQQNRFYTYLTQVDMLGFQQTLQALF
AGIPWNNFINNSLPEFEGYYASVLYAFFISLNATVIPEDTTNQGQVDLTI
MVENKVYIIEIKRDTVKSYEISQQNIALQQIQRKGYATKYKGQGKTIIQI
GMIFNIYQRNLVQMDWEVVG
>Cag_0118 hypothetical protein
MSFLKEAGGLAGGALLLTPLTPLGVPLLLHGVAGIVVGGAGLFVADAVLK
QVAETTKPQSGDEPEDELVD
>Cag_1624 conserved hypothetical protein
MSGSVVSCAFMPHTTALVRVKPSSGSGYVVSTAKRFPFGLVRIAADRDGS
LLAKIGKELQRWHDDLLALNFTPPLYRSLPAFLPSDATPEEQATYQRLEA
SNFLHQPNSYWCSALESAEPCTASDFQAYFLLYYPAEPLRMVRNALSAHC
ALAVCSTPVEAFCRLTVSTQDVHILLEIEEAHVALAVAHQGKLMRFVCHP
IHRREEREYFALRELLNTPACRDHVVQVSGSHATKNMLELLRRETGLSLR
LPTLPAPHFVTNSTRQLLTEPDMYHALSAAMFSL
>Cag_1819 hypothetical protein
MKVIISRKQVNTNKSNHWFRDIAMLEICDAKYVGDYKIYLVFNNGREGIA
NLEKALFNDIRSVFSQFRDKERFANFKVDHGTVIWSDEFDLASEYLFYLA
FQDNPELQTKFKEWGYVA
>Cag_1373 conserved hypothetical protein
METVFIETTIPSYYVARRPRDIIQAARQELTIEWWDKHSSRYELLSSQIV
IDELARGEEIMAAKRIELLANIPLLLINEPVIKIAEELLRDRVVPQKAAD
DAFHIACAGVHQVDFLLTWNCTHIANPHNRHRIERCFAKHGIIIPIICTP
QEFIGDDYAN
>Cag_0276 conserved hypothetical protein
MFISKRFMKRKVYIETSVISYVTARPSKTILGAAHQQLTLAWWETRSQYD
LVVSELVLRECGAGNPDAAKKRLTVLHDVPLILITEQALKIANSLIEKGI
VLAKAAEDALHIAIATVHGVDYLLT
>Cag_1994 conserved hypothetical protein
MPNTLTLNHLSREEKLQMMDLLWDDLSFNQEALDSPNWHREALQETEARV
NAGAEQLMEWSAVKKILRNECK
>Cag_1904 hypothetical protein
MLNVTEIHPEYVTDMNGVKKSVILSLSDFYALLENLDDLAAIAERKDEPT
MSHQQVVEELVLDSSLRSE
>Cag_0767 possible abortive infection phage resistance protein
MNINASIIDQRITGIVDEHPEWMAESNDRNKKKSVAFVLLSIAMCLDIPL
DEAAELITDGGNDAGVDGLHIGEVEDGEFMVTIFQGKYKVELSGEANFPE
NGVQKAVDTVQVLFDPYRNVALNKKIAPKIEEIRSLIRDAYIPNVRVILC
NNGAKWTRQAENWIDNAKKDYGDKVDFIHFNHDSIVSILQRSKKVDTTVT
LSGNAIIEDMNYMRVLVGRVSVQEIHRLFNEHGDKLLERNIRRYLGLHTN
RVNTAIHQTLCDPQKSDKFYFYNNGITVVCDKFDYNAFQKADYKVQLKNM
QVINGGQTCKTIQETLNSDVSNMIGESAYVMIRIYQLAETHQNFVQEITY
ATNSQNPVDLRDLRSNDDIQKQLEIGISDFGYVYKRQREEGGGGSHVVTS
SIVAESVLAIWRQRPHQAKFRRKEHFGKLYENIFKDLSAAQALLAVLIFR
AVENERKRPTSLTPPDFLPYASHYIAMVVGRTLLQDMNISLANVSHQNFN
EILKKFEANEAAYYAHAVSDVKEALTACYGEREVSLQQLSATFRRGDLLE
MLGAVGLSGDYCFVSQS
>Cag_0048 conserved hypothetical protein
MVLGLLQVYNALSINELAKKSEKLREQIRLNNSMITTQKLTADELQSIHN
IEQEALLLGLEASHEPPIEIERTIEP
>Cag_0662 hypothetical protein
MIFFVSFGGIWASGGGCSMWLAERDELVTMFQRDVVEVKPLRNRRLVLTF
RDGLVATLCLDDIVHHYNGVFLPLLDVAYFNQVAINRDLGTIVWPNGADV
CPDVLYAVASGKPIVCE
>Cag_0133 hypothetical protein
MENLMSKNAIIYGIRKLNVVERLNIITDIWDEIKDSQELEIVSENDKKVL
LDRLANYRANPEFATDWDELKQKIHDRYAD
>Cag_1505 hypothetical protein
MATIPHYTPNDFMHEIEQVPPQYLPQLFQIVHIYKESITKKACLDSFEQS
WQQAIAGNTMPISELWEDIDAE
>Cag_1069 conserved hypothetical protein
MSVTLMIFIYWAIAIAIGFIFFKKDILSFEPKFDGRRIGLLIASLLIIAL
NAWVYSHSTSDGGRSLDPLTLLVFSVGNGIAETFMFYAVFRFGTSLVGRF
TQNAVATFLVGFLCFVVYSGLIHGLFWINILPEHVVQTSPYKPFFMPVQM
LIAGSWALNFFWYRDIRTVIFLHGLVDLTMAWNVKFDMF
>Cag_1147 hypothetical protein
MPKEEKEQQSPIEQPKNPPINLPIEPLIAAPAVLGAPAMLGVPLVIHALA
GAAIGAFTFVTGSLLLKMTKDKALAAKPDPNEVIPPPMSVPNFPRHYSRN
DPHIDSPLLSSLRK
>Cag_0958 conserved hypothetical protein
MLTAYCGLDCEKCEAFLATQENDDAKRITIAQKWSAQYHADIKPEHINCN
GCKSDGVKFFYCTNMCEIRQCCISKGVDNCAKCSDYICDILSNFIKVAPE
AGVVLAKLRAS
>Cag_0261 conserved hypothetical protein
MTRTSQHSERQQATEASTHNIQQLVDEAATSLYKDIMAIVEARLELLKIE
LTQKISLIGAAVVVGVIVIIGATFLLATIALFLGELMGHTFLGFFAVSLI
FFVGFWFFTHYRPTLLQHFIQNLLLSTYDADK
>Cag_1178 hypothetical protein
MMTDEIITEVRVIKDEIAAQYNYSFQDRFIAIKKGEAELAAKGMRLIYPP
NNAVMTPATALQRGKLKTHKIFQKLSASQ
>Cag_2001 hypothetical protein
MNSFKSISRTLFALSVFTATPLFAETATSQPSTQQEPLNISGTLEVEGYG
ERVAGKTTSEFVLATFETRLERQLSKHVFAHATLLFEEDGDNELLVDEAF
ITYKALNSPWYVKAGRFTQPFGWFESGMISDPLTLELGETKHHAALLLGT
ESEQLSAALGMFRGDVQYDNNPSIDSFVAAVATNFTLGKEMRGTLGASFT
NNMGDTDGLQDLFDDGAGNLVGTEKVSGYSFYGSLAQGALTLRAEYVAAA
NSFVDGTLAGLKPSASNVEVSYAVREGVDVTARYGSGSDMDIANQYGAAL
ACTVDGAATLGVEYLHNSMDSAADANRLTVQLAVEF
>Cag_0654 hypothetical protein
MWLAERDELVTMFQRDVVEVKPLRNRRLVLTFCDGLVATLCLDDIVYHYK
GVFLPLLDAAYFNQVAINRDLGTIVWPNGADVCPDMLYAVASGKPIVCE
>Cag_0820 conserved hypothetical protein
MATRKLITFDWAMKRLLRSKANFDILEGFLSELLGEDITILDILESESNQ
ENKEYKYNRLDLKVKNSKGELIIIEVQYEREYDYFQRMLYGASRVITEHQ
KLSEPYSTIPTVISINILYFTLGEGSDYIYYGTTKFVGMHNRDVLHLSKE
QREKYGKREVSDIYPKYYLLQINNFNNVAKTSLDEWIYFLKNAEIPENFN
AKGIKKAKESFDFISMTPEEQEAFLSYQDALRDQASYFETTYEIPFEQGL
KKGIRKGRKEGKEQGLREGKKLGLQEGVLKGKELGLQEGKELGLQEGVLK
GKLEIARKLMAKGMSAEEAAGIAGVNIGLLERND
>Cag_0063 hypothetical protein
MKPLFDFLLKLCILSLVLWGAILYALDPSSVDYYSIFIAWVMMFSNTLVG
YLLFEYAIDKDSVVFNKIVFGGLALRLLALMVLVAIFIVGKLVAVNDFVF
SVFAFYCIYVVVEILGYQKKNKQKKN
>Cag_0256 hypothetical protein
MIDVKPFENCVCQKMNNHLSLCKSELTLSEKGKSVTLRIRSSEEAKVLVL
DGCVFMDNSSRCDGLYLYKKGNKRYALLVELKGACDIPHGFEQLAYVKKN
RQEYRQIVDHFWVEAGGVQPIEKAFLVSNGSLSKPDLETLEKQHNIRVTA
ILHCEATSQIPDLRKYL
>Cag_0120 conserved hypothetical protein
MEVVAKSKAHVVLDACFQESGQFFTDHERLMRCNPFCSNVRYLPAHNIFQ
WIFQVDDPRNNPFMAIFFVRQQEHPFSLDDEYGKNFVEKRIKNRERYNGN
GKRIYWEPVQEPPADVVLPKPAENGRSFVGTASSDICLLHHHDNKTSVYF
DTNITMDFDISFPLNLMPEGVLRFMTEAVMSQVMQQATEAMLCKVQADMG
CASSGLLPTE
>Cag_1500 conserved hypothetical protein
MAKKITENEVDLKKESKKKTSTKSETSKSTATKTKKSKATAEEAALTPEM
REEAIRLAAYYAWEQKGCPDNSHMDDWFEAENTLP
>Cag_0830 conserved hypothetical protein
MVDVVARVHKHRVKLRDEGMRPIQLWVLDTRREGFAEECHRQSALLANDS
HEDEMMLFLSEVADTEGWTA
>Cag_0966 conserved hypothetical protein
MKLTRYIHRVVLLAVLGLQLAGCSGNEASRLRKDDIRFAAFYTDYLLLSG
VEPATKDEQLVALSPAMVDQLLEKHHLTRQSMSSLVANYRRNPEQWQVVL
EQVRGNLRNKAREANGE
>Cag_0272 hypothetical protein
MHDNPEKSLESVTKLALQFLAEAIGQCPAGLEQSTNQDVVFAVVGFQYGA
VQSAAYVAGLGIEAWNSMAGEVIGRLNGIEKEKVAQFLSVMPMLARKKYP
PISIGGQAIMRFYNATSEEEKLTAAASLREILRQIDEGN
>Cag_1810 Bacteriochlorophyll 4-vinyl reductase
MSEPSKIGPNSIIQTVTALEENYGKSKAETILRKIGQGYLIGNLPKEMVE
EIKFHTLVGALNKEIGSTATANILKESGERTARYLMRVRIPAPFQKLVKL
LPPRLAFRMLLFAISKNAWTFAGSGEFRYSMSTPPEISVKVTFPSQPVVG
NFYLGTFTALLKEMVNPKTSIKADIQKAGSDIQCTYRCEI
>Cag_1432 possible virulence-associated protein
MRLPKEFRVSTKDLFIRQDEVSGDIILSQRPHSWNGLFELDKLEKSPIDF
MNNNDRNLALHNRDPFNGYAE
>Cag_0567 conserved hypothetical protein
MDKFDLFFDQLGDIQQMVNEKKRLQAEVAQMEQECAAKMQPLKDELNAIY
RQLEARIPKFMNQGGSTPRTSSRIPRGKLGESIKNLLRSNPEKAFKPREI
AEALDIKGTAVSLWLNKSGQEDPELKRIPTGPEGKRFVYTVN
>Cag_0994 hypothetical protein
MDDLLIDLVINEHIAVFGVEPVFTGRSAFLSQEEIIANIDAAIRKGEPYV
EDDVPDDVDI
>Cag_1098 hypothetical protein
MTRNVVHIYTREIDFNLELDIEFLGIRMLTGTAKELLWENNHTEKNDSPH
LAIVIQNQLELFESVTDGMAYSRPRLLIAFGVLSYFTQQIFTPFETYASS
SYVGKFDKECKNRFIFKETELIEDYIQFETIIKYHKDKEFIYSLLDRWRK
GLYMETESEDNMIYDDETLISYFHILELLTTKYEDKQKKELKDKIKDFSK
SIFEESFLFEGNQLKSEINAKSKIIEGLLLPDLSVSSKIFYIFKEQGILT
YRLKSFITNFVKDRNSVAHGRQVYQDRVIFPVPPFFPLIKNRDYPEEFYR
ILTAKAIANFIGVNLYSQEWDEMSQSIIPSFDELREFQREKKYLELTNQD
FCDGKENDITPFVVSHYLISKKLKAEEALEILTPFINNYNKTEDETMMSI
WAIIIILDLLEEGELKEKCIEIIKIAEKNNWHPNYFKMRDEMYKLEYFGF
EINGLKDLIRKKEIR
>Cag_1076 conserved hypothetical protein
MSTITRPMTTMILKQRFPFRFGTSSYIIPADIIPNVEYLKDKVDDIELVL
FESDEFSNLPSAEDIQTLKQLAEEWALTYCVHLPLDVYLGHTDRAERERS
VGKCLRIVELTRTLPTSGYVVHFEAGNGVDINGFNDADQQQFTDSLRDSL
AMLLAGANVPAAHFCVENLNYPYELVWAIVQEFGLSVTLDVGHLEYYGFP
TADYLKRYLSKAKVLHVHGTVDGKDHNSLCYMKPATLAILMQALAASPNP
QRVFTMEIFSEEDFLSSCKVMEGYVFLPPT
>Cag_1531 Preprotein translocase SecG subunit
MHVFIVVLALLAAILLIGVVLLQNPKSGSGLTGGISSLGTVQTLGVRRTG
DFLSKTTAILAAVVMGLCFLAQFTLPNKESDRKEASSVLQKSAPLKPLQP
APSTPNVPVAPAATPAK
>Cag_0997 conserved hypothetical protein
MFQSTRPRRARQRGRRKIRLKKVSIHAPAKGATFHAYLYVSVPMFQSTRP
RRARLSSMRYAYELGLVSIHAPAKGATAQKLRLFYVERVSIHAPAKGATC
FVIISNHLILPFQSTRPRRARLIDIEEEGKQ
>Cag_0819 hypothetical protein
MTAIDIKKTLAIEVEKLSVDALQEVLDFVQFLKIKQWRNREQVSFSQQRI
ADDLHAFDINSVVHLEEEFADYKKEFPYE
>Cag_1049 conserved hypothetical protein
MTIAELQEQPLAERLMLMEELWETLCNEKHHIQSPAWHQEILEERINLIN
SGEAEYLSIEELKKY
>Cag_1145 hypothetical protein
MLKSLLIQRALPLSISYILLIIAGIALDYLLHVAHLVWIGRYFGIVGTLF
LALSFGYSARKQKLIKNGALKFFLKFHCYSGWVGTLMILVHSGIHFNALL
PWIATALMMVVTASGHVGQYLVKKLKEEMKQKMKQLGITTSVDNEFEQQH
FWDSLTVKALDQWRGLHMPLVSFLLALTTIHILAILFFWNWR
>Cag_1278 hypothetical protein
MNQIDWDQLDRQMQQFSSLFITEVKIPKEKTNKIASIIADDINKIPAKGK
KEIVNSISNPIPIQDRLNELTAFQGWMDIAHDFKNPYISRAQVIVQNYIC
FVYLGEACFKTLKQHLKPESVAKKCCNFLTNNPVRAFRNAVAHSNWKYKD
DFSGIIFYARKGHQASDSIIEWQVEDKSLAFWQALSRCTAYTAFLCLK
>Cag_0839 conserved hypothetical protein
MGMMNHTESEKSSLTAYNQELALVVIPLLKGVIYQEENPSLWEVLRNELA
GVRDYVAVLGLELILDEAEGYAFLRSRSEGETEGANNAPRLMARRQLSYP
VSLLLALLRKKLAEFDAGAGDTRLILSRDEVVELIRIFLPPASNEVKLID
QVDATLNKIADLGFIRRLRGERQMIEVRRIIKAFIDAQWLAEFDERLNEY
LQRPVNAMERENE
>Cag_1569 hypothetical protein
MNPRVKSVLVLNDYKLSLVFTNGERGIYDCSEFIEFGVFKEFKKYGYFQL
AKVEHGTVIWPHEQDICPDTLYLDSEKVSAEG
>Cag_0430 conserved hypothetical protein
MRNLRNLGFCYGQNMMWLHRRKHPRLAQLLRSALPIQLNESSKIVILSDL
HMGDGSRFDEFRTNAELVYTMLHNYYHPRHFSLVLNGDIEELLKFPLYAI
ETQWNEFYPLFRSFQHNGFFWKTWGNHDAPLLDEKEYQLSDYLLESLKFH
YKEESLLLFHGHQASVFMWETFPMVSHLAVLILRYLAKPTGIKNFSVAHN
SRRRFAVEKAIYEFSNREKIVSIIGHTHRPLFESLSKLDFLNYKIEDLCR
HYPTTIGEERSTVRQQMQTMKKELEACYQQGKKIGLRSGIYHTLTIPSLF
NSGCTIGKRGITALEIEGNTIRLVGWYNSKEQPRFATNHNQTPHQLASTN
YYRTILNEEPLDYIFSRLHLLA
>Cag_1976 hypothetical protein
MKKWQQFLQDGSGVFSSTRLAFLLWVVGTLVVWIFGCIEVINAHATLDMA
GKIKAIQFPVIPENVMLIIGALMTGKVWQSFSENSADKSTTSLTVQQTSS
AQQGTSENVASETVAK
>Cag_1880 hypothetical protein
MELSYEWLLTSITTFFTEQFYLLPEEYQVGLNIFFLIVAFLAASGLLLIA
LKKLWSFIRTVLTQRKVESGYRLQFPGSYSWGNALMRLPLVLIGTDIFCF
AQLSQKGKLLRYIKSASVLIPTLWSVFGLVVFMASTGRAYQTHELYLAAL
PVLLVGGTIFFLDLSIISAGGTIKAKSIRFFLAMVTGYIFSSVPLNYYFN
ADISSYMLKHDDQIAAVESSYGKRIAAIEQSGWYQQYYSLLRQEEALRQD
LMDERKGIEGAGLSGKPNALNPTLNVHYNAIESELQQLQQTRADYVRQYQ
PKLDELQQLITLKSKKVVEIAKNNEGSHIKRHHALWEYALSSTSTALFFL
AAMILFWAIDSLSILCSYIDESEYNFLCQERNEEMREFFGSRTVRQRFNP
VQTSELR
>Cag_1708 hypothetical protein
MQRIIRIFFASPGDLEEERHLTKEILRQMSERSRYTFEFYGFERALATTA
CRPQDVINNFVDECDVFIAVFHRRWGQPPQDTVVYSSYTEEEFERAKRRF
VSTGAPEIFCFFKQVDLPSIADPGEQLRKVLAFKRRLEESHQVLYRTFAT
AAQFVADIEQHLFAFAEGKLPTPRSPKHRFHIPIIEDQQPDSQRSYDLTK
VHQALNAATSGCVEEAVILMAGVSQTTRDIELLDVIKEFFINTNNLDAAQ
AVVEKKLTLLQDRRLAAHAYVAVLMSEHWLNDLVASMSKTVSPEKQSVAE
HTTRKLFTGIRFHELMIEYLSKYFTVGELLSLTRFYQGEGASITAKFGRT
IGIMIPEINAILMAENPELFEG
>Cag_0663 hypothetical protein
MVGEIYLAQIYFTDLSEYKIRPVLIVKELGDDCMCLQLTSQLNYDGILIT
NNDLFDGYLKKDSMILMPKNFTLHKSILKKYLARIKLDLIERIMNQLCKA
LGCV
>Cag_1431 conserved hypothetical protein
MGCITALTLSSALHAASSSPQKIGILWWNVENLFDTQDDPAKRDEDFTPN
GKLQWSEKKLYLKQMRIRDLLGALAADKQMGSLPDIIGFAEVENKTVFEQ
TLQGVKTGSYKSVYYNSRDPRGIDVALAYNSATLKLQHSKAYSVPLKHPT
RPVVVASFMVGRHPLHLLLNHWPSRAFDAELSEPNRIAAATIARHIVDSL
LTANPKADVVVMGDMNDEATNRSLANTLGSSMDGVQVKAAKGKLLYNCWS
GYNGIGSYYYRSKWQKIDHMLLTHGMLDRTGFYVTKEAFRCIDYPALLKS
SGKGTWSTYEKRVYKGGYADHLPLYLKVSVE
>Cag_0891 metacaspase
MAQRALLVGINDYAPIGPGGPDLRGCVNDVQDMANTLSVLGIIPASPVNM
RILTDGRATKAAILDGLQWLTAGASPGDTLVFHYAGHGSQVLDISDDEPD
GKDETICPHDFATAGMILDDDLAAILGTVPTGVNFDVIIDACHSGTGARE
LSALTALSDDEAVAYRFIEPPIDWGFFLDSAPSLPVRGILKRNTTRGKAK
ATAAKNEDQGVGQLNHILWAGCQSNQTSAEATVNGQKRGLFTATFCKILR
SANGNITRKNLEVQVSRNIRAMGYSQIPQLEGASTHLKKKAFT
>Cag_1129 hypothetical protein
MSNQEIVEQARTLQPMDRLWIIEQLLQSLDEPDATIAEIWAEEAEKRLEA
YRNGTLEAIPMENIFHD
>Cag_1687 hypothetical protein
MNTPIEFMKPRLVGDRFSGHAIPFEMLKNLSVLEELVIEAAKWKYLKAHP
DRQRVPRGFTEGVSLQLTEVRDGSAIPIIVLTFMTTTPLFPEVGTHVTYF
EQGRDAIFATVSAAEEQVQNPNSLPPHLLGYFDQLGRALRDDEALELDPT
NQLKPARLTKETRRRILLQSDKIQELTEEVTLRGTVPEMDQEKSSFEFQV
IAGSRIKAPLEPQYFDVILNAFTAYRDNRKIVIRGIGRYDRNEKLLGLTL
VEHVSLLDELDPGARLNEFKSLKNGWLDGKGIAPTHKQLEWLVDAFERHY
PDELRLPYLYPTADGGVQAEWSLGGWEISLEINLDTQQGEWQALQVCDEH
EEFYVLNLNQPDAWQWLSAEIMKKSGVEA
>Cag_1112 conserved hypothetical protein
MTTVSTLYGTLSGVEFRSLFSNGKVDGCLVTEPNTLSTPYGALMPQYEAE
DMGRRSVKPLYFYKDGALKSIALQTQTMLTTPIGTIPAELVSFYKNGTIK
RIFPLDGKLSGFWGWKNEFALAIDITFSSPLGLLTAKVIGFQFYESGALK
SITLWPGETLKLPTPVGTISVRKGVAFYESGALRSCEPARKIEVTTPIGT
ITAYDNEPNGIHGDINSLQFYENGSLEALSTIDQSVEVTCSNNCQELFEP
GVKRNVCGDERRISVPMPIRFSKTWVMFNNSPTASFNLQECSFLVQKAEL
KTEAPSYSCAG
>Cag_1062 hypothetical protein
MMQNYGMYKIYQWEAIPYSSHNDKWNCDQIDLINRPIESVYFHRNQDLSI
SMIAYIKANALNPKATYQLGELHSNFESIKIHHTSGAQGTATITHIPKNT
TKFDNNGNPINESIINTLELEIKYNNLPISYTIEWITNLKSEGVWHWPNV
VNTKLQGKFNMDFSSKSHTISIEKDNFLEFNNSLSCLQFSFEDELIFIGE
TKTDEIEKKYHPGFILYQGNPSQEKMERIRLALSFLLGRFLPSLGYTAFD
EKWDIVSYKCITPYDLDGSVYNSSTMPPIKLKTINFFINTNIVTRSVNAF
YENFLKFDLKYISYLYWNAINSPAHIKASQFGATLEAIQRNYRQVHANDF
QTALFKHEEWKIIQKALLDSLDKVLNSNSDNKEIDEKKIIKNKIYSLNQT
PQSKLTNRFFDLIDISLSEIEENAFKQRNYSAHGIKTTQEDFSVIKNNYI
LMTLIHRILIKILNISQNYIDYYAIGHPSRNIKEPIGG
>Cag_0646 conserved hypothetical protein F56H9.1
MKKKIISIGTIAATCVASPIFAVSNFSDNTASQGAVVAAQAATNALTVLN
FNFTPPVVAPPIVPVVPVITTLPPVQLPGGFTVTTSVQPPVNGVSTSTAV
TTNNINNTTVSTTVTTTAPTPSGGTTTTAITTDVNGATTTTTTVRDATGN
VLSTETN
>Cag_0678 conserved hypothetical protein
MLAKITRKNQLTLPKSIVTSLPKTDYFQVEVVSGRIMLTPVRMQQADAVR
AKLDVLGINDQDILDAIEWAREG
>Cag_0393 hydroxyneurosporene synthase CrtC
MNITTRPEEELWHNVSSAGAYEWWYFDAVDVESGISFVVIWFCGFPFSPS
YATHYEQWKRGAINHPPHPSDYSAFSFQCYEQGQELINFIKESDRSAFSS
NPSSVGVTFEQCSFTYQPDNDSYQLSIAFDFPARNRSVKASFCFAVQQRV
ALEQQDGNNGGKVPRHQWLLGAPYARVSGDMRLMNSGGQHLRTITVQGAH
GYHDHNLGELPMQEYMKRWYWGRAFSSRYYLVYYLIFYRNTAYAPRFFVL
LHDVELGSTTHYQPATLREEQLHHGLFAPLHSRQLFLSGNGASLTIEHHH
PLDAGPFYLRFPATISLKQEGQAVITLQGISEFLNPQRLNSHFFRFFTRS
RILRSNQPSLMYNVYNRFKQIVG
>Cag_1383 hypothetical protein
MRLSPIAIQQIKEQVNRFFGQQAVIWLFGSRLDDNKRGGDIDLYVQTEQF
NLLDELRCKVALQEQLDIPIDLIVRSKNDTSVITTHAIKNGVPL
>Cag_1543 conserved hypothetical protein
MQDSDGQNIRVELSLLEKNIEQLVLQLTDCRKENEALRSELASLQNILRS
CKLPGSGSAQSPTDGSMSEGALSGSEKMQFKQRLVLLLQKIEMELRNSQP
L
>Cag_1877 conserved hypothetical protein
MEYQPSGMRALDSVERAKLGMKVFNLPFDEAEGVIDDYVSGGNYDPASVE
LFKDQLDTQRHIQEKAYELFDTGAQILRLVVGAVLKNMPSPLDDDKSTSR
E
>Cag_0177 conserved hypothetical protein
MYWNLELARYIADAPWPVTKDELISYANRTGAPQQVIDNLEDLPDSDEMY
ESLDEVWPDYPTDEDFGYGDEDPLN
>Cag_1180 hypothetical protein
MRIINIHSPDIKQYQEFYTMPIIARFYGIIIKMFFIQSEHQPPHFHAIYG
EYNAIFAIESLEMIEGDLPKRAYAFIAEWAQEHQQELLDMWNTQEFKQLP
GLE
>Cag_1037 conserved hypothetical protein
MVKNHKEVLAATHQKLIEFNELNKLHGPELAWEKMLEGLPEKQKKRMATF
LAEPTLFKAFTRAIPFFESAGMEMEIVDLSNKGSDAVLEIQKYCPYLQIC
KEYNIETPCHIICDIEIESTRQAFPEMKGEILARQAFGSCVCLFKYERPA
K
>Cag_0062 conserved hypothetical protein
MFFGLQAMNFKKVTMSEPQNERFPEYFGRSVRAMSDYIGIGLQIAVSFAL
FVLGGYWVDARFGTSPLLLFVGVLLGMVGMVLVLMKVIRQANAKK
>Cag_1303 hypothetical protein
MKRVLILDTSILCVWLEVPNMKQCGADNDRWDKPRIDAKINAELQNQQTT
LVLPLASIIETGNHIAKAPHSNYERAGALAELIRKSADAQTPWAAFSEQS
TLWSQEQLKALANSWPILAAQKLSLGDVTIKDVAEFYANSGYSVEILTGD
NGLKAYEPIVPIEKPRRR
>Cag_0184 conserved hypothetical protein
MEQQKLRELLQALHQELEQLQSVDESTTAVLTTLRNDTQRLLSNKEEPME
EEEGSLSERMQQALEHFEEKHPSLSISIQHVLDSLARMGL
>Cag_0218 conserved hypothetical protein
MATVSVYVSGQTEQNDVIEFFQKGMIGADEHPIAFFEGVFYESHQERVGN
IAFQDYLVYTNKAIYLWARGASKDYLDRFNLGAVSINSRNKDRDFATLNL
KVRREDKEPIYVIFDMVELREAELITRLHTLVETIIEDRLGLNYRQQIPD
EIAVYILHSAKSLCPPQSITFSAGEPNAPQQDSQIGYGQDLLEQYKASLG
YPSPEPSPTQAQSRATAAAPEGFSPADALKGLEHLLPTDPAAIKKIAESL
KEVIGDAPFKLRDQLKNDLQHVPGMLSAVTELLTSIADNPQAERFVLNLV
KTAVKNDGVLGSVSKLMKLSSTFGGDNNSKRRSSSSQQASGRSEQGSASS
KRRNESFDDDMPKRKSIHIKQEDDEVILPDCFSGLDLPFEESAPPTPAKK
REAEEISGTKISPRKPIVIKADEDAIPSIVKTMSASDTPLPNANNSNDKL
>Cag_1117 conserved hypothetical protein
MTIKELVPLLQTAIGPMILVSGLGLLLLSMTNRLGRIIDRSRTLLGCIEA
SAEPQVHRINREVAILWQRAHYIRLSILLACVACFGASMLILLLFLSALL
MLEVSLVLATIFVLTMLCLSCSLLFFFLEVNMTLSALKIEMEHYDKKHQL
MESMEWGR
>Cag_1375 conserved hypothetical protein
MAIEIRHISNSDKKARKEFIKFAWQIYRNNPELNRNWVPPVIEDYMKTLD
TTIFPLYDHADLAMFTAWQDGKMVGTIAAIENRRHNQVHNDKVGFWGFFE
CVNNQQVANALFGAAAAWLRKKGLNAMRGPVSPSMNDQCGMLVRGYDSPP
VFLMLYNPPYYNDLVRNYGHRIGQELLAWYIDQTLIDIERLRRIAAHVMK
REELTVRILDMKHFDRDIEIVRNIYNKAWEKNWGFVPMTDKEFDMLAKSL
KPIANPHYVYFVEDRNKRTVGFSLSLPDVNQALKHVNGNPFSPIGLLKYL
WYSRNITMVRTIVMGVLPEYRNKGIDSIMNVQIADYGGQHGVFASEMSWV
LKANEAMSKLAQVIGGKPYKEYIIYEADI
>Cag_1080 conserved hypothetical protein
MKPLPVGIQTFSKIIEDDYLYIDKTDIAKNMIEKYQYVFLSRPRRFGKSL
FLDTLQNIFEGKQELFKNLLIYNQWNWSRTYPVIKISFSGGIRDKESLHK
NLFYILKDNQERLNITCEEKNDPNQCFAELIKKTFQTYQKSVVILIDEYD
KPILDNIENIAEALIIRDGIRDFYTKIKESDQYLRFVFLTGVSKFSKVSL
FSGLNNLEDISLNPDFGNICGYTQHDVDTSFAPYLEGVDMEAVKRWYNGY
NFLGDKVYNPFDILLFIKNHKMFKNYWFETGTPKFLIDLIKKNQYFVPEF
NGLKADESLINSFDIEKLTLETLLFQTGYLTIKQLLLSDVGVSYELGFPN
KEIQISFNNYILQSITQNSQKESIRHELLAIVKAGDIGNLEQIIKRLFAS
IAYNNFTNNYIESYEGFYASVLYAYFASLGFDIIAEDITNKGRIDLTLRS
LDKTYIFEFKVIAEEPLEQIKKMKYYEKYNGERYLIGIVFDPKERNVSQF
AWEKI
>Cag_1770 conserved hypothetical protein
MNVSLAIDFNQLKSLIAQCGIEEKTQIVQMLEKDTFPLRFNALLEKVKTD
QLTLHDITTEIETVRQQRYSAKR
>Cag_0587 conserved hypothetical protein
MVKSIIYLEGGGDSKELRSRCREGFRKLLERNGFKDKMPRFVACGGRNTA
FSDFKVAHEQKIYTFVALWVDSEEPLEDIHKTWEHVQKRDGWEKPHNSID
EQLLFMTTCMETLIAADRETLQQVFKPLQESALPSLYNLEKQPRHELYQK
LKKATQGCAAPYEKGKISFEVLGKLNAETLQQHLPSVARTWHILQQKL
>Cag_1485 conserved hypothetical protein
MQVTYIVLDDHNPLHRELSIYRTGIIQRICMDDAAYKTYGSLEVDGHNYA
ACFHYGLVESLNRLPFLSESGSGLESGEEALLHRSRLAEFLCIVKEALAT
LDNTHRETILVGWQQEPVAIAYLRALDAERFATFLISLLHFVEESELQQY
DLEFLW
>Cag_0545 hypothetical protein
MSTINIQLPNSLHIKMQEVARQNGVSLDQFIATAIAEKLAALMTVNYLRE
RTERSSQEDFERALSEIPDVAPEEFDKL
>Cag_0838 conserved hypothetical protein
MPFDYSTLNLLRQNHPAWRLLCAQHAPLVAGFLHRVFIVPNVRILSQADL
VEALEDELFALRQQLGADQFPHTAQSYLNEWAENDKGWLRKFYPDGTDEP
HFDLTASTEKALAWLESLTERAFVGTESRLLTLFELLRQMSTGSQTDPEV
RIAELQKRRDDIDAEIERIRAGEIELLDDTALKDRFQQFLQLARELLTDF
REVEHNFRTLDRRVRERIALWEGAKGALLEQIMGERDAIADSDQGKSFRA
FWDFLMSQSRQEELSLLLEEVLALPPILSMRPDNRLRRVHYDWLEAGEHT
QRTVARLSEQLRRFLDDKAWLENRRIMDILHNIETQALDLRDDFPSGGFM
PLNAASATIELPFERTLYRPPFKPLLAGVALDEGDAEIDTAALYAQVIID
KAELLRNIRFELQMRNQVTLAEVVERHPLRNGLAELVAYLQLAGEWQQST
VDEAVEEQVQWQSATGITRAATLPRIILLK
>Cag_1241 conserved hypothetical protein
MKEIPFGLQTFSDLRQQNFIYVDKTAEIYNLTRVKSYIFLSRPRRFGKSL
LIDTIKELFEGNKALFDGLYIADKWNWTTTYPVIKIDFAAGTIHSIDAFE
KRVKDMFITAQEKLAIKCRVDTDLAGCFADLIRKAHEKYRQTVVVLIDEY
DKPILDNIEDTAIALQIREGLKNIYSVLKAEDAHLRFVMLTGVSKFSKVS
LFSGLNNLNDITLHPAYATICGYRQIDLETSFAEHLQGVDWEKLKRWYNG
YSFLGEAVYNPFDILNFIEKQHTYRSYWFETGTPTFLMKLFAKECYFLPN
LENIEVGDEILDSFDVERIQLTTLLFQTGYLTLKQRIESFGRIRYLLKMP
NQEVRLALSDHFINVYTAQQSVQKYAQQERFYNYLMQIDMLGLQQALQAL
FAGIPWKNFTNNDLPQFEGYYASVLYAFFCSLNATVIAEDITNQGQVDLT
IIFDSIIYIIEIKRDTSENYQVSSENVALQQLQQKRYFEKYQRQGKEVIQ
VGMIFNTVQRNLVQLDWAR
>Cag_0933 DNA polymerase III, gamma/tau subunit
MGAWQHKLANFATQPHFKPPTAPVPTQADASHASTVTTPIVAMPSTITLE
ALKVEWQQFLEHLTHHGHTVLATHLQSCELASCSATGLVELACCRKFSCE
EVQQERDMLQQEMVRFYQQPLQLRIRYDAAKDACTKEKSRFTLFQELSQQ
NEVIRFIVQEFGGELMY
>Cag_1342 hypothetical protein
MNTIDPIVDEIRMYRKEHAALYGYNLHTIVEVLRKKEQESKRIFLNPGPK
PLGIETSSTSVATMPHV
>Cag_1556 conserved hypothetical protein
MESTVRPLGTVMQVLEELGHKVTYAYDDLVFTEHNDFLLQFTNHAPELSL
FFNTSCPRQQAEKVEQQLIPAADRVGLSVITKGRYSVTGNEDEENLKIEF
FNN
>Cag_1536 hypothetical protein
MLRGVELVQFLWIHEFFVMSASYNESVMPSQGINGLAMLTPIIGFPIFFH
ALSGMVVAGIGVTAYNNVVAPLAGKLVEFTQDTLPQLLPPLTPSILSIIP
AQEVPVVIPITIKAKETSLLEA
>Cag_1047 hypothetical protein
MNCPYYNRPPTSSLPFPTAVETTHALSLQSIKKFPTLTLYNLHAFFHVNL
LTIMATNSSSPLVRRELFIFDASVSNLSTLSSALSANSSYFVLDSTRDGL
VQIADLLAGQTDIDSLHIFSHGSAGSLQLGNSSLSLVNLNNYELPLSVIG
SSLSSSGDILLYGCNVGAGDEGLAFVDKLAKMTGADVAASDDLTGATALG
GDCELEVESGVIDEASFYYAPEYAGLLGAVGPEFHVDTSDIQVWSYEPSV
AALANGGFVVTWISETLETLSSDTHTDIHGQLYNSEGAMVGSEFQVNTYT
QYGQYTPSITALADGGFVVTWISETLETLSSDTHTDIHGQLYNSEGAMVG
SEFQVNTYTQYGQYTPSITALADGGFVIIWRCVNNDDYNCNYIHGQRYNA
DGIMVGSEFQVNTYTQIGAYEPSVAALADGGFVVTWESGIVTTWKSGYQD
TSNSDIYGQIFNVDGAMVGSEFRINTYTKGFQGCPSVTSLTN
>Cag_0304 conserved hypothetical protein
MNALVEDIKKLPVVERIELVEEIWNSIPQCSLELSAEECTELHRRYAAHQ
AHPSTAITWEEVRSKMLSTSQR
>Cag_1865 hypothetical protein
MASYRTKLDSTYFSDAAHALRWQKTLAFLRESEVVGTNCSLGLDLGDRTP
LTTALEELFACTFHNSTIDLDVGSLFGSYNVVTAFEVLEHLYNPLHLLLQ
VRNVLRGNDARLFVSMPLWKPHILASPDHFHEMTRRAALSLFERAGFAVV
RRAEFRIREPLFYVTGIKPLLRAWYEKIQIYELAMQVETVPCNEAALVF
>Cag_0866 hypothetical protein
MSANLLIIGDTKARALYDVDADALYLPISGRHLESAKALSLPLVMVDDEG
IFLAPSHWKRLLPESSSTIDTIERGLLKMGRDARQEL
>Cag_1508 hypothetical protein
MGHIYLQNNEIQDAVSAWVTAYTLARKIGYAQVLDALENLAPQLGLPGGL
EGWEMLARQMGGEE