Gene list
Applied filters:
COG category: Unclassified
Organism: Chlorobium chlorochromatii CaD3, CaD3
Gene type: CDS
Number of genes found: 360
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Chlorobium chlorochromatii CaD3, CaD3 >Cag_0253 hypothetical protein MKDTVLFQQALCLPAPWFVKSSAFDIEQKRLTIQLDFQKGSTFSCPTCGQ HDLKAYDTAEKQWRHLTSNDVVNGISILCTIPSPLINLAKNFAERCNSCK LFLLRSSVVETDDCLTSNTDCEILLSSEHRHNREKLRDCLLLYAIEPTNH FPHNAIPRFYAKPRNICHALYCSVLKAHYGKRNNTPYSPPLTVNTLKGQS REMV >Cag_0367 conserved hypothetical protein MSWGVEDPQKRKDIRELMDIASKIVQENPNHVNEVKKSHNNCWMEQIYLI QRCDFCDLAPDCPTREEKEWQEYIKANNIMVIKDSFPTNPQ >Cag_1066 hypothetical protein MNYIAVSEATYLDGYRISLTFNTGESGEVDLGDLIHRYAIAEPLRNPQNF ARFYLDSWPTLAWECGFDVAPESLYARATGKLFPLPQPSNSPL >Cag_0305 conserved hypothetical protein MEIIFRPEAEAELFEAQAWYESRSQGLGIEFAQAVTVAVESVLRMPFAYP RQFAT >Cag_1558 conserved hypothetical protein MASTQPPSNRRDDYDSPWKEAIELYFPEFMAWYFPKAYAAIDWSQPYHFL DQELRSILPEAENGKRIVDKLVQVHLLDGNESCLYIQIEVQGNRETDFPR RIFICNYRIFDKYGMPVASFVILTDTDSSWRPTTYSYEFADSKMTLEFDM VKLLDFEPRMEELLASDNAFALVTAAHLLTQKTRENSLERLDAKTQLIRL LYNKQWTKERVKELFRVIDWFMELPKELEQQLQTEIYNIEEEQKMKYISS IERYAMEKGWSEGKELGVLEGMEKGKAEGLEEGLMQGRLDVARRLVASGM SADEAAGIAGVDVYLL >Cag_0684 conserved hypothetical protein MDNLLLKKSMLMPPVQRVALAELLLASLDYEAEDIREEWINEVQARMNAV SEGRSKLLDFDLLWQ >Cag_1195 killer suppression protein HigA, putative MKLRYSTRKLEKSVESFSVIKKNYGEWAKKVVLRLEQLSQAPNLAAMRTV PSAHCHELKADKISELAVDISPNHRILFQLAHNPIPLKDDGGLDWREVTC IIITAIGEDYH >Cag_1284 hypothetical protein MVQNERYFLFCVREYNHDLKRRVTMMQGVKQFGKCQGNATKNGQVRRVGV PQGVVAAQAGQGGFALNQAVNGGHQTLRGGCNGQGRSQGQGMGQGQGQRC RCTAQNSVSL >Cag_0576 conserved hypothetical protein MKTITVQLPDEVHKGMTMVAATQHISISKLYEYISQNMLRGYAAEVRFRE RVAQGSRKKGLAVLDKLDECYGE >Cag_1353 hypothetical protein MKSQITEQFQQIGLDDLKIHPELLTDILSNKQDLSLILEKRGDIIRYAYL RTYDSDSRKILEEAKAEYCLKKEDGYSREEAFQDFEDVQHDIATQLASRV R >Cag_0918 cytochrome c family protein MMRKKVISATVALLATVASFAHAEEDARLIKARALADQLPPKLGAVLKAE IAKSGPEGAISVCRDEAPKIAKELSKESGTRIRRVSLQQRSCKAKPDKWE KAVLEEFDRRAAAGEPLTTLEKGEQVGSKYRYMKALPVQDRCLNCHGSLE TMKPAVKAALEQHYPKDKATGYREGQIRGAISVRL >Cag_0519 hypothetical protein MLTVSYLTMQSLSIFLSSTCYDLKSLREHLHSEIAKLGHDPILSDYPSFP VSPDLSTVENCKKVVRDRADVFVLVVGGKRGSLDLETERSVVNSEYREAR AAGLDCIVFVDRQVWDLRHLFQKNPQADFSPTVDFPEVFGFIEEIENDTK WIFQFHRTDDILTILLQQLSIRFRDLLLRHRTNRLLVPSEFAYERSDIAR IVQDKDSLWEYRLTSALLADRMERLESKFNDLDSGYIVKRTKFLPARDTL NYIQDLISDFTNVIKASVKVLEHQLTPAFGPPGVAGDAIQIRRACDNLFS LFLALYEWEMDIRFVRPHELFENLFLRMHGWTSELLQDFRRIPKELDELL AVPNLTGEHNIQLVINAPAGLALLTVEFERMSHDPDVVAALAGG >Cag_1937 hypothetical protein MQRKAWFVVLVSLLIGLCANKGFAESPTQPSTLVATRFSTNEIAFTTGYG YSLRRKAFEEHNFSIYPFAVRYGWNLNRPLGLSGASSALYATVEPFVNMI VGKEQGREVGCGVGVRYRRAVSQHANFFAEGSVAPMELTINTPEQGAAGF NFLDQFGVGLQHEVGQRTHLFVGYRFRHISHAGLIDRSNGGINSHGVMIG ISLIQ >Cag_0645 hypothetical protein MKLLKQTSIAALLGMMALPSLTASAAPYSSTLYMPNSHGKSVTTPTAWGA SGNVGFVGLGGTYQSPYTDDADGAAVFGVGLGDSKENLGVQIALISLDIS EWEEYSSAFHVFKELGDADAIGIGVENVMLTDGGDSEKSFYVVYSRGVQN DWALNKNSNQTKFHYSIGAGTSRFGDKSPADIADGKGKHGTYVFGNVAYE IAEEFNVIADWNGVNFNIGASKSFIINNKIPVGVSVGLADLTTNSGDGVR LVAGAGFGFKL >Cag_0402 2-vinyl bacteriochlorophyllide hydratase MPRYTPEQLARRNASKWTTVQAILAPIQFLMFIAGLTVTYLYKEGIWIDD FTWITIFVTLKTFMLVLIFVTGGFFELEVFGQFAFVHEFFWEDFGSAIAM IVHIGYFVLFFMGLDESTLIWTALLAYLSYLINAAQFVIRLLLEKHNEKK LKQQNAL >Cag_1405 conserved hypothetical protein MAFKIRDIEVALEKKGFKRVESDHSYFIYFTIENKKSRVRTKTSHGHKGQ ALSDNLFSVMAKQCKLNKNQFSELIQCPLSRNDYEKILDMQGMVK >Cag_1121 hypothetical protein MANELSHQHIGLFEKIRQTDENGNEFWSARDLSKVLEYSEFRHFLPVIER GKEACINSGQQIADHFEDILEMITTDKTEHREIEGIKLSRYACYLIVQNA DHGKEVVALGQTYFANLSNIQLLNKSRISKFIYTIEGQQIILDRDLAMLY QTDTRTLKQAVKRNIERFPSDFMFELSEQQIETMVSQLVIPSKSYFGGAK PFAFTEQGVAMLSAVLRTSVAVEISLQIIRAFVEMRKMINNNALILQRID RIEIKQIETEQKFEQLFQALEQKNSKPQQGVFYKDSIFDAHSFVCDLIRQ AQTSVILIDNYVDDTILTILSKRKNGVRATIYTSKKDKQLELDIKKYNSQ YPEIMVIEFKEAHDRFLIIYEKELYHFGASLKDLGKKWFAFSRMDSFVNE VLAKLKNNGNNE >Cag_1776 hypothetical protein MTEVEEIVHRVQKLSKDDFAHFKQLVQDIDNDYWDQQIATDFRQGKFEQL IKKARQEFAEGKARAL >Cag_0867 hypothetical protein MIESNLDFYRPIVEQIVERWAVGKPPLPTTGKPSGYYRLTNYLLNYLVEH DAFPTGIHQMPEGLDAQQQVEPSFPVDFNVVIGETRLPKISVNKGEKL >Cag_0724 conserved hypothetical protein MQQIPFGIQTFKKIRQNNLVYIDKTADIANLVAQHNAVFLSRPRRFGKSL LIDTIQDLFEGNKTLFEGLYIADKWNWTTTYPVIKIDFAAGVLHSVDDLK SRIRKILFDNKQRLQITCEFLDDRDLAGCFADLISKAHEKYQQPVVVLVD EYDKPILDNIENVDIAIQMREGLKNIYSVLKAQDAHLRFVMLTGVSKFSK VSLFSGLNNLDDITLDATYATICGYRQVDLETSFAEHLEEVDWERLKLWY NGYNFLGESVYNPFDILNFIKKQHTYRSYWFETGTPTFLMKLFAKERYFL PNLENVEVGDEILDSFDVEDIQLETLLFQTGYLTIKQRVEMFGNLRYQLK IPNQEVRVALNNHFINVYTAQASVQKYAQQKRFYTYLMAVDMLGLQQALQ ALFAGIPWNNFTNNDLPQFEGYYASVLYAFFCSLNAMVIAEDTTNQGQVD LTIIFDTLIYIIEIKRDTSETYQVSPENVALQQLLQKRYFEKYQGQGKEI VQVGMIFNTVQRNLVQLDWAKP >Cag_1287 hypothetical protein MLQTISGCGIASNATSLYMSVKNTALLFELIAPLEHYGSMNVPYETHSIP IAIKSNAQIVEVFLKNTIPLIEKQQEARRSSAKILRN >Cag_0680 hypothetical protein MVEVRIHFFVTICLIIRDHCLRRNDKLSMLFSVINHNYEMKENMMLQTLE AEILPNGHIHFLENFSTNRKVKAYVTILSQQPLNKPKADWHHFVGALKES SLFQGDPVEIQKTMRDEWA >Cag_0013 conserved hypothetical protein MKPIYYFMAATFSILLSIYVFIFGTSTNHEMLGIFIGLWAPTIICVGIFN TLIGILDEMCCAHKRIEEGQSCSHNRH >Cag_1643 conserved hypothetical protein MNYSFKTLWNRAFYFISPLWCLLVWIIWSTDQLQDPADKIVFISIVIPGF FAVYVSGFLIEKWHNNKQQKLK >Cag_1104 conserved hypothetical protein MKNNINYTDEPVGELVVVKDFLPSPDQLILKEDNVKITIALKKSSIDFFK NEAKKHHTSYQKMIRELVDWYAVNNAKNA >Cag_1860 hypothetical protein MHEERSKLDEYSLKHGPTIGKMYEGLASDVLGRAIPESLGLQLQHGIIHD GKGAMSGEIDCMLVKGEGEKIPYTDSYKWHVKDVIAVIEVKKTLYSADLK DAFGHLRGVADNYGSYVQSGEGSEKFDINPAKKAFSETTGLIPPDHSKVD SLGIEMEMLYHTFVMEHISPIRIVLGYHGFKKESSLREALVSFINENQMT QGFGVGSFPQIIVCGNYSLIKMNGYPYSAPMDSDFWNFYASSNANPILLI LELIWTRLSYQIKVENLWGEDLSIENFTLFLSAKIKKVGDLTGWEYKYTP ISEDLLKERAPEDEWSPTIVSSTQFVIFNRLCNEAEESIGDKSLREYVEK EGENFDHFIESLTSTGLIALKDDKLQLTTYECQCVILPDGRFAVADNNSG RLTRWVNKQIEKNKA >Cag_0032 hypothetical protein MRLLPVIQNYLKTIVELKTMKQITAADALGLSIPERIQLVEDIWDTIAMK SDELEFTVEEKRIIDKRLNAYHRNPEVGSPWEDVYKRILSKQ >Cag_1202 conserved hypothetical protein MTPDEIDDQKKIEFYAASVSAWYESSLEHDKSLLTLSAGGIGLLITLLTT VGLGTAEALVLYVGAIISFVISLVSVLFVFRGNKKHIEDILSGKNQGTDP VLSKLDGTAIWSFGIGVVFTAVIGISAAIHSFTSKENTMANETTKTTQAV PLHESFNGAANLQSGTDLGQSFNGAGNLQPQQTTQPATPSTTPANSGNSQ NQSDKGK >Cag_1272 hypothetical protein MSTTTLSSRRSKKSWHHANPIQKAHHVVREISIAPIIEEQINLSVADQTL LISTLLNPPAANEALQKAFAHHQRLVQKY >Cag_0991 hypothetical protein MRLMTSDQLPTTQQPDDYDSPWKEAIEHYFPEFMAFYFPNAYTAIDWSTP YHFLDQELRTIVPQSAQGKRVVDKLVKVQLLDGKERWLYIHIEVQGRREV NFPKRVFICNYRIFDQYGVPVASFVILTDTDYNWRPTSYSYEFAGCKHTL EFPIVKLLDYEPRMEELLASDNAFGLITAAHLLTQKTSDNAFHRLDAKKQ LILLLYEREWERDRVKELFRVLDWFLELPKELNQQLQTEIQQIEEGQKMK YISTFERYAMEEGIEKGKELGVLEGMERGKVEGKLEGLEEGLMKGRLEVA QRLVAGGMSKAEAASFAGVSVDLL >Cag_1568 hypothetical protein MATYSSFEQLNFYRSMQLTITLPDILPDEISRVIKKVKEIFSQEGIAAEI TPEPLSTDAWDSLNFDEIAVDTGRVDFAENHDHYLYGIAKRS >Cag_0736 hypothetical protein MIRDLFFNIAPAISTLFKLGFEPKEECFYELTIDQYEQLRKDGEDITETL YMILPEESKYMDNDIIVVNEQEKNSLLKAKKVIENYCEKGGKVFNSYQDK LTYVSNLLPSVFTEDSNFRKCHLKLVEPNQSK >Cag_0881 conserved hypothetical protein MNRIKAYYDEAYPPVPSKRVLYWRKNLPWQIIRFFVLNFKIMRIVVGGHS >Cag_1884 hypothetical protein MAQQQRKLTIMVSSTVYGIEELLDRIYTLLTAYGYEVWMSHKGTLPVHSG LTAFDNCLRAVDESDLFLGIITTSYGSGQNPADNKSRSITHQEILKAIEL NKPRWLLAHDHVVFARTLLTNLGFKGKSGRQSLKLQKNTIFGDLRILDLY EEATIDHESPDDVPLAERRGNWVQKFRTHEEGSLFVNAQFGRYQEVEAFL QENFERGFSLLKKGGNA >Cag_0661 hypothetical protein MQAVKALYKEGNIEFRANSKEEFMALGLGSFFDTDEDNNVDWEAMFVGAT LLEKTQEYFHRETVLL >Cag_0492 hypothetical protein MPSRPSKRFVLNWLKALLLSPGMLFPATYLLLYFSTTLYLNTALKSEVIK TLMPMGHISVRTVTTDMTLERITLHNVTLQPSAIGESKKPQHIRQLTIAC PKLIPQLFTKQGRTQTICQVEQALSPIAQ >Cag_0734 hypothetical protein MKLTFNDYTLETVYLHISDAQRLEVMDLWQTENAITSAAERERRSYEVVV MVRHVSGAVVGVSTVTVRTASNGKRYYYYRMFIHSSHRVPYLMRAVTNAS RDFLATFRHPDGEVESFVVVTENPKLMRQGMRQLFERHGYTYKGKTSQGL DCWEYSFLQ >Cag_0300 hypothetical protein MRKLFLLVLLPLVFLLPVNKAFGANLVILEPTDGASVTWRPLVKGSVSGA KHVWVVVRSVENARFYVQPKAAVRKNGSWKTSVFIGKQADTANNRDFEIM AVGNPTEKLSDGMELEDWPSGVVTSNIVRVVRTKMN >Cag_1201 hypothetical protein MSFDATSIEYAFAKLIGNTTGARSTSHDADVFRGGNPKNLAKALSDAADA LEEKVKSVPFAAADTEPGGAKARIDVAISRLHKIAESMSKSATVSREDYH WEIIGCLVSTIADLLEKAKC >Cag_0311 hypothetical protein MLTKDFYSIIDTELDALIIKYKDDKLIKKHKSAINNQKSYVLLIWFLDFY GRISNYSNFITDGDNDSSCDIVFDSLNNQGNKVFYVVQSKWNNADNSKKE TKKDDILKALNDFETILRGEKKNVNEKLKSKLEELDLHLKANGEVKFIFL SLSQYKGDADENIEAFRSNDVKTKFEVIDISRIRVDYIDRKYKKIDPINP LENYQNPEESPINIEIVQKGGSVKIEKPFEAYMFLLRPKAIWELFKTYGF ALFYKNVRNPLLQSQFNVDIERTALENPSYFWYYNNGITAITYLLPEIGK KAEKVPLTGLQIINGAQTVYSIYRAYESASPTKRIQMDSEALVTLRLLKS GGKDFDLNVTRYTNSQNPVQDRDFCANDDIQVALQNASYQTNVWYEKRRD EFRETPTGVKKVPNFIFANVYLAYHLQDPVSVLKNHTQRFKTHKDLNFIS HKDHKDGLYEKIFNSNTSFEDMLCAFYIFDTIDDYTPFSYEETFKTNLYH LLALFKVAFTKYIIAKKAMYGKGKNGEKEINVNKQIIEIYEKDEKEIILK TFKFINQFVEKQIEVADNEEKTTDRMFKFLFTLSHYQKIYDALEDTEISV KDIDDIVLQDNDDIVEGDKDTEVTSEQE >Cag_1795 hypothetical protein MAELPEASSSNKTGERMAELPESSTPSKPSRRARFFEEDNGSLSSMRLMS FVALIAAILFGALTLTFDTSENNGTGLYITLMFLVSAFAPKAVQKFAEQK LNER >Cag_0445 hypothetical protein MSIYTSPETSPSPIAILCSEYVQSVEAMAESLPFIMSTLIEAEDTFDKKL DAFIDFHAIDVEKLEDGRRYGLKLEDKSAHDRLHRRIRVFREALGVTPRS FLVALVSAYDAFLGRLIRSLFYARPELLNSSERVLTFAQLQDLQTLDAAR EYLVEKEVESVLRKSHSQQFEWLEKTFSVPLRKGLECWPRFIELTERRNL FVHADGIVSSQYLNVCGEHGVTHSEVLTSGTRLHVERSYFQLSAHCLMEI GIKLAHVLWRKLVPTDREKADENLIEIVYDLLIKQKYRLAADLASFGTNT IKSHGSDQTRRILVVNLAIAHKFGGDAEKCTAVLDAEDWSATADDFRLAI AALRDNFDEAAKLMKSIGKDNRLGMFEYREWPVFRDFRKSYQFAAAYKEV FGEEFVLKAQEPKASESPQTTAKGENVLDEE >Cag_1043 conserved hypothetical protein MKKRSALTLLLLPIAASALLAGCSPTVKIEASDKPITINMNIKIDHEIRV KVDKELDSLLNQKSALF >Cag_1501 hypothetical protein MTFFNLLFFDTRHHNSTMLKKSFSTMWSSLSLLFAGLWLVVRIILEYFGI ISDGNDRTTGIKDLREEYKKANYR >Cag_1944 polysulfide reductase, subunit C, putative MIEKALKGSPTYWLWILLLLALMGFGYSSYATQYAHGLAVAGLGGSAIWG LKVMQFATMATFAASSVLVVALAYLQASKPATSLLVTSQFIGVSAASTAL VSLVADCGKPELLFELVRSASLSSPYFLSILSLKLYIITSLVSSWATLGA EAKGVPAASWVKSVALLSLPFGLLTPLFVALMIADAVAILQLLAASIAGG MSLLLLVVALLKQRATFSVDANAPKMVATLSLYGGVAHLLLTAVAWFRAS SDNSGMVAAAFVAALLAVVLFALSLKKGNTTMPALAMVISVVLAVAGTGS FVTYAPSGVELSIVAGVFSCGLLVATLLLKTATAIRREA >Cag_0296 hypothetical protein MIFQAKSIASWFIIVLMLLQFVPLPRRNPNERQPLRAPVAVVTVLRNSCY QCHSHETRWEFPLGTVAPFAWWMSQRVEHGRRALNFSTTASLSVYEKERV TVMLRNPKEHQPLYYVLHSNVLPDSVGVATVQLWATHR >Cag_0755 conserved hypothetical protein MKKLPVGIQTFSKVVEDDYLYIDKTDIARSIIEKYQYVFLSRPRRFGKSL FLDTLKNIFEGKQELFKDLLIYKQWNWDVTYPVIKISFSGGIRDKESLQK NLFYILNDNQERLTITCKEKSDPNQCFAELIKKTFQKYQKSVVILIDEYD KPILDNIENIAEALIIRDGMRDFYTRIKESDQYLRFVFLTGVSKFSKVSL FSGLNNLEDISLNPDFGNICGYTQKDVDTSFAPYLKGVDMEEVKRWYNGY NFLGDKVYNPFDILLFIKNKCVFDNYWFETGTPKFLVDLIKKKNYFIPDM LTLRVNKSVVNSFDIENINLETILFQTGYLTIKQVLPLGMGVGYELGFPN KEVQISFNDYILQIMTIVADKEPIRYELFDIINNGNVASLEPIIRRLFAS IAYNNFTNNYIESYEGFYASILYAYFASLGFDIIAEDVSNKGRIDLTLKN QDKIYLFEFKVSNQEPLEQIKKMKYYEKYNGERYLIGIVFDPKERNVSQF AWEKI >Cag_0841 hypothetical protein MGMPVRIDDTLYGQARAQAKAEHRTIAGQIEFWAMIGRAALDNPDLPIDF VRDLLIARREGEAHSTPFVPEGHRS >Cag_1820 hypothetical protein MGEAYLQEIYINKLSEGLSRLIACELFLDRYYSSDSIFEAEAAILQIRKA MECVAYAAVAPNKSKYAEFRSQADKAIDFTKDFHAGTILKMLSKINPDFY PKPVSAPLNVSLGKWHFDRRNDKSLSQKQFESFYDRLGKLLHADNPWGNE KGLRNLLADIPSTIESIRLLLSLHFTVIRTSEFNGVWIVESPNNGQQPRV IVGQAIGEFAVEE >Cag_1736 hypothetical protein MNERVFLKILLTLLSSGAIAEVMVVLLLGWHGEALLFVFFMSCFAVAIAL VLHKLYGTAEASGAIESVSARRVRAMQSEELRSRLGAYSVDDEFLAGTPL RAKSTSSTYEKSDVEAMIRKFAPHVGGLSRLLQMVQERDEASFAAVAKQA GLANVERQTVIDYIHIMLNAEKECESNTKSGEATSPLTEFTMERESFDSY IQRCMSGEGDDGDLSDELSVGLENPLTPKSVGIPPSDFSHSPTSIMESLK KRAGRVP >Cag_0811 conserved hypothetical protein MELAQILEGNWLFRVEHRGITIHSSTIQDSPIQGFKGEVELQVSLKRLLS AFYDMENYKRWVHQLAELTVIDKPDPTEYIIRQVINTPWPLQQREVIMRS RLEGVGENGVALSMQSEPDYLPLHAQCHRVRHAQGMWVFTPNGHGVVQVM FIMHLDPGPDVPPPVSNAGLFEVPFYTLKNLKALLDDAKYQPMWPEELEH YLAIVEEDNLDTL >Cag_1574 conserved hypothetical protein MNRYLYDGTADGLLSAISWILEEEQEPEQVVLAEREDTLFEEGIFLNTDV ARSEALFSRFRKQLPDVAQTLYFFMLAESNGMATNLLHYMALALQYGDRV NGYLTHPAVKAIVHLSRKASRELHRMKGLLRFEQLCDGAYLAQMSPDHNI LHPLSHHFRHRLKAEHWFIVDRKRHTAAHWHQGSLEFGTIEQFNVPALSE QEQKVQTMWRTFFATIAIQERKNPALQRSNMPMKYWKYLTEKQ >Cag_0670 conserved hypothetical protein MSSIKTLVKDWVPPIAIRLLQSFRRKGVIFEGDYVTWEEASAQCSGYDAK NILDKVLDATLKVKRGEAVYERDSVLFDKIEYVWPVLTALMWIAAQSKGK LDVLDFGGALGSSYFQNRTFLADLKQVRWSVVEQSHYVDAGKTHIADERL QFYKTIEHAVSAGSPNIVLLSSVLQYLPNPLIILKQLTSLNADCLIIDRT PFIKDKDLSVIKIQHVPSSIYEASYPCWFFNMDKFLEYIESLGYRKIEIF QSLDRLSNEAIWQGIIFKKKNEL >Cag_0267 hypothetical protein MNTLSLTLPESLHKSAREIAEKENITINQLITSALAEKISALGAEDYLEM RARHASKAKFLNAMAKVAKIEPPDYDRL >Cag_1385 hypothetical protein MVIFPISLSFPRKRESNSLILLGFRRGNETTIIILLSGFQKKSQKTPQQE IDKAERLKKEYFDGKNKQ >Cag_1644 2-vinyl bacteriochlorophyllide hydratase MPRYNPEQLAKRNASVWTDIQIILAPIQFFFFIGGLTVTILYANDLFPGG FYWVSLAILFKTLFFALLFITGMYFEKEIFNQWVYSKEFLWEDVGSTVAA FFHLLYFVLAFAGYPRDILILDAFLAYFTYVLNALQYLVRIILEKLNERK LRMDGQI >Cag_0780 conserved hypothetical protein MPLLSVGNILVEPDVLQARFACNLQECRGACCIEGELGAPLQPEEAAQLD HLPEELFRLLPEKGLRYLRRHGAVELYQGVHYTRTVKSRECVFTVVRDGI TLCAIEIAFREGLLPFDKPISCRLFPIRVRKKFGLDYLVYEQHAMCRSAR EAGREQGVRLIDYLTPSLTARFGEALVQELQRFHDSSSNNSHG >Cag_1343 conserved hypothetical protein MKPIVYLESSVISYLTSRPHRDVVIAGRQAITQEWWEYQRHQFELRISIL VEEEISRGDAEAAAQRLASIADIPSLTLSDDAVMIAHLLLAKCAIPKGSE DDALHIGIAAAQGVDFLLTWNFKHINNAVTRGYVTHIVEACGYNCPQLCS PEELMGRYYEYD >Cag_1003 hypothetical protein MVLPESRPSRARGLKPRIGNLFTDGMESRPSRARGLKLGYADRAAIEVFE VAPFAGAWIETAAEGLKKKPVQQVAPFAGAWIETPEDVDDYEKAASRPSR ARGLKHVRCQSQSSALCRALRGRVD >Cag_0698 conserved hypothetical protein MKTVSVSEACSTLSTLLKEVELGDEIGISFEHQQHTIAVLVPIAKYKKIK DRKLGSLAGKVKVEFSNDWQITDEELFNL >Cag_0377 conserved hypothetical protein MMNKLFNLYCDESTHLQNDGMPYMMIAYIRSPYNEIEQHKEYLKFLKAKH KFKGETKWTSVSAGQYLYYADLIDYFFSTDLCFRSIIVDKSQINENCPEF SYDDFYFKMYYQLIHHKVDLGYHYNIYIDIKDTRSNKKLAKLHEILKLNT SIKNCQFMRSHESSLMQLTDLIMGAINYKLRGYNRVIAKNKIIEKIEQHS KVPITRSTPKHADKFNLFFIDLK >Cag_0583 conserved hypothetical protein MLAQAQQIYDEALTLSPIEKVELIEHLYFSLDSKNSRQEVDKLWAEEAED RLTAYENGEIKTTPASEVFAEINSMRPQ >Cag_1322 conserved hypothetical protein MIIEFLDPAKVDLLEAVKYYNNEQENLGFEFSEEVERSLSRVVNFPHAWC KLSVRTRRCQTNRFPYGVIYQKRGNLILIISIMHLHQEPNSWRKRIPKKE Q >Cag_1191 hypothetical protein MMDCRGNPCGCPVVVFSIMNYACLLIKIELLNQFYFYSPNHSQRLQIGAI NMKKRWILLLLALGTNVPESKADYTIHGYTSPDGQSTFHVRENPLQNPLQ KLADDLERNNNKSERPSDAFYRGYEQAQKMKMMIEQRRLMEEQRKLLEEQ RKLLEQTRLEQQTRLEQQTRLEQQTKE >Cag_0374 conserved hypothetical protein MSNPNPSPLSHAMPEAIRQLPAEAQQEVLAWVEKLLATQNEGVDELYNAI SSIVKFIPNFMVIPLMVEQIHPRIAAGVCVKMGVEKATGYANDLPVEYLS SVTHHLPNPMVGEILTAMKRYAAEKLITYEIEHHRTDLQALMPSVSESHQ ALITKHLST >Cag_1503 hypothetical protein MPEQMKRKKIRCYNCGEIFTLLMDIAGEPTRSITCPFCGASLTVTLAKYP KKVITVYRAAVGESSASEITVYDLPDVLESTESSSQS >Cag_1667 conserved hypothetical protein MALPSNLSFSIFFFVHWFSFFKRLHMRKNNERVGLRIVSLIQGSNRLELC CQAADFGERETHLLEAGFDGNIAVSIMAEKSDNKIVVTLSAQTTAHCTCD RCLALLSLPIHGTATVIFTCETVVDEAAITLDDYRSYNRQSEYLDLADDV CDALLLALPMKITCTNNPYCRVFQAGESDATHNDASHPINSEWQEALDKL KQKYSS >Cag_0434 hypothetical protein MEIVCSKWLSDFIYIMTEYLYPFQEFLKIVPGLLIFPISLYLGLQKIGTK VNARISFSSNPTSPSTVRYIDLRNMKERTIVIYSIFASVNDEEILLELKQ CNPPIILKAHQSTIVEIPTYSFMQLDGSRLDVSKLPLKKTTIYLELAHTT IKCHTLNYQKSVSRKLSKYQILEKEDHYPDALPYNKHAIYAITYSKNSRI KTATINEAGTFHGDWTFSLNQLPISALISKESVESYLQSMNFSNEVEWFK VYYLNHCSS >Cag_1671 conserved hypothetical protein MKPLPVGIQTFSEIIKQDYLYIDKTSLANELIKRYKYVFLSRPRRFGKSL FLDTLKNIFEGKQELFKDLLIYKQWNWEVTYPVIKISFSGGIHSKADLEE DLIHILNANEKRLELKCENRSKAKYFFAELIQQAFQKYQQSVVILIDEYD KPILDNIENIAEALIIRDGMRDFYTKIKESDEYLRFVFLTGVSKFSKVSL FSGLNNLEDISLNPDFGNICGYTQNDVDTAFAPYFEGVDMEQVKRWYNGY NFLGDKVYNPFDILLFIKNHKMFKNYWFETGTPKFLIDLIKKNQYFVPEF NGLKADESLINSFDIEKLALETLLFQTGYLTIKQLLLSDVGVSYELGFPN KEIQISFNNYILQSITQNSQKESIRHELLAIVKAGDVANLEPIIKRLFAS IAYNNFTNNYIESYEGFYASVLYAYFASLGFDMIAEDITNKGRIDLTLKT IDKTYIFEFKVIKQEPLEQVKKMEYYEKYDGERYIIGIVFDPKDRNVSKF EWERV >Cag_0011 conserved hypothetical protein MAVGNKRNDWFDATGHSKGDWMFQGNFQRTINYLSEHQKILLFNPFCHSV DLLQGSENVYKWLFRVNDPQNNPFEVIFFVEQLEELLLDVPSEINCSDPS TLTPEMIELYTTGKKITWRHYDAQEEVDDPSRYLFEGKVFADMLMEAKDN DRTRVQIDLRVDVRFVLYPAFRIIPDPVLHAMVNGGMSILMQTATNRMFQ AISKDFHSITPT >Cag_0826 conserved hypothetical protein MESNGFFPFRSELLKASIGLATTITETSEQPIFDELKIRRYNVPADEIAS FILTKLDHWFGWNMLSDRPSKNNTRLIRADVGSILLFGLKIKVTYGLYEE KDANERPITSVHAHAETSIESKGDLGESRRVIRMMLSALDFNYLTEQLHD EEYQSRSLDCAATRYILEQMFVVEPEPEPAKPAPSKVPKATVIELRKPNP VQTIPLITKPKSNEADVVALLEPMETEQPAANVAALEEETYQSSATSAAG DDKPAKPKIIVVTSKKNQ >Cag_0998 hypothetical protein MRVSIHAPAKGATKVKVKREIEGMFQSTRPRRARPKVNDRRNKNTVSIHA PAKGATKKQLVSYTAFTVFQSTRPRRARHRLMLFACYSPRFNPRARGGRD SEILRRNHELQSFNPRARGGRDAVQTQQTTINGVSIHAPAKGATVIRSLF ENEAAVSIHAPAKGATLMQ >Cag_1949 conserved hypothetical protein MERLLVLYGGKIAANSFVTDSLSKALADVLHYGGKECNELSEALKDVTCI LAVQAMPPSQTLALFFELPAVLRDYAVKGAPKQLLKESDMDALQKRLEEL TLKAFDSYMMHRETISQLKVDEMKRQLFMQLRRAEA >Cag_0561 hypothetical protein MPQNDAKRFVEKLRADDGYRGRIAEIMRYEGYSGTADDLKKLTNEEGDDK RYPNKSYCSSLSWHQGDKKQQDGASQGYHHWAG >Cag_0265 conserved hypothetical protein MALLESIRQSAPALHRAFASLADGEQHLQRLVELDGYWERLKQPPARVVL PASALPCNAAISGEYDLIYAGGTLSMFHAAVMARRYGHSVMVFDRHTPAT STRDWNISWEELLRLRDVGLFSEAELDSVVVRRYRDGWVEFYQPDGKQKR LTIEHVLDCAVETSTLLGMAKVKLLEVPNAAIFGGYTFQRCYQLPDGVIV EIIDSKGERLFYKCRLLLDVMGILSPIAMQLNEGRPQTHVCPTVGTIASG FEGVDMEVGEILASTRPADVENGTGRQLIWEGFPAKGSEYITYLFFYDSV ESANNKSLISLFDTYFRLLPEYKQMGKNFTIHRPVYGIIPAYFHDGVSCK RTIAADNILLLGDAASLSSPLTFCGFGSLVRNLHRLTAGLEQALAANQLS QEQLTTISAWEPNVAAMANLMKYMCFNPETDSPNFVNDLMNEVMIVLDSL PHRYRQAMFRDEMKIEELVEVMLRVAWRYPKVLSATWTKLGVTGSIGFFK NLAGWALGK >Cag_0472 hypothetical protein MKQHRHNIEAEVAKTMSLLDKPAAIEVSAPFRARLMQRLEAEKNNGLQGN HAFHVDYRVAFMALLLVANLASSLLLFRQENRTNSPTQNVAATLNVDALA EQELLGGDEQGEWYENILP >Cag_0532 conserved hypothetical protein MHPETAQIVHAIESLKQAPNFIKDYIFPIANAFFTSLLGAGIAYFTLRYQ EVIQIEKDKMDTVNKWTLLVDEARSSLLAIKSNYHGNLTDSPIQRALAIR TVLFTATPINEEYLHLFFLIPKATEKKCEYQKWSQISIIRTMVLNYNNLL KLWIKRNEVERPIKEKLLQKYSQNAYADVNTDQIIECIGAANFASLVDLT EHVIKLTDDLLVEFDNFLHEFPNYAKTLINTKRLKRYGSILTYSNNSNKK LLSMLEKSPTSNYESVIKLFGMTVEQLREKYKTGYEL >Cag_1197 conserved hypothetical protein MIIPEKYKVWVEARKKYKLSDAQIQMARELNLNPKKFGKIANHKHESWKV PLPEFIEELYIEHFGKNKPDVVKSIEQIIESKKS >Cag_1149 hypothetical protein MLRKKLPLLLCAAALLSPMNSLYAGKPTAQAKSSVADGAAAASSNQASMS AIVPVADHTKGIYLVLTEADAMTQMMALVLATQHLEQGKTVQVLLCGPAA KLAVKNCKEEPTIFKPINKSPQMLLAVLLSRGVQVEVCPLFLPNSNMTQE QLIAGVTVAKPPVVASQMRADGIKTMNF >Cag_1427 hypothetical protein MPLFAFTKYAQVREFRKKSLHFTLGMVLWILNIGFWMGATARVARTGLSV ETPKLGVFTTYGMVGRRRGEIHFRPHERADIESAPTRDGSCVGTLNPKET LRSLFSDREKPHSPARHH >Cag_0333 type II restriction endonuclease TdeIII MSLNQQQIQKVETVLRNSLRHKFQNYNPEPAVMPFHTRLLGKDRMALFSF IHSLNTNFGTTIFEPVAQALALSRFGSVELQKVAGNQISLQAQQVIQEIM DGLTTATNLPCKSQEIEAIRAVCQSGEMRKVKPTKVDVKLISYDGTLFLI DIKTAKPNKGGFQEFKRTLLEWVAVTLATNPEATIETFIAIPYNPYEPKP YSRWTMRGMLDLEAELKVAEEFWDFLGGEGTYPQLLDCFERVGCELRSEI DAYFAQYINS >Cag_0766 conserved hypothetical protein MTIESISQRQSARNELISSLLARCPMNVEATGSHRSFIMDKRGEGVPIII TESEKLSGRRPEYQLLDDAELKLTIFATPSPHGDE >Cag_1916 conserved hypothetical protein MTTPAYPPTQRDDYDSPWKEAIELYFSEFMALYFLKAYAAIDWSKPYHFL DQELRSILPQAENGKRIVDKLVQVHLLDGKERCLYIQIEVQGNRESNFEK RMFTCNYRIFDKYGKPVASFVILTDTDSSWRPTSYSYEFAGSKMALEFQV AKLLDFEPRMEELLASNNAFALVTAAHLLTQQTRENSLDRLDAKTQLIRL LYNKQWPKERVRELFRVIDWFMELPKELEQQLQTEIYNIEEEQKMKGTSI YL >Cag_0617 conserved hypothetical protein MQQVIFQEKYPLFTLELQKNETTYTNVNDILAYFRQKIDEHPITVFIANF DHYSHTMSLPEHAMNPAIKDAKNIIFCFGKDLNDPLMIGARPRSIGVIEF ENSFLISFLEAPNPMATNTMEAWAKGLKKS >Cag_1130 conserved hypothetical protein MRVFFLKYAQQELDDTAHCYEMELKGLGKIFKDEVKKAISRIIKYPEAWT IERTTIRKCTLHKFPYVILYSIEKNHIVIIAISHQHRKPYYWIDRKPT >Cag_1397 conserved hypothetical protein MNTKSCIQVIEQGTRGFYIRNSLYLPFHCEILSIWVGREMSFIAAPELLC DMMDSEVLALREGDRYTNLVFRKWGDMAKELGNNKGHVILFAAEKGSDLF QAEQRFYIRITFELETRELSFELLDNPFYL >Cag_1655 FtsK/SpoIIIE family protein MATTIKQRLKNLFVRALDRSMIKRELAGIALMLGSLFMVSAIISYHPDDE ALYSALRWFDVFSNPARDTADAIHNHFGLFGARMANFLIHFVLGYPVLLL ISSFFFWGLSLVRARSLKPALFFFLYSVVMAIDIATMFGLTSLAFSDVMS GSIGRMLAAFLITTIGFSGAWVLLLSVGLLLTFYMGRSFFIPAFHALMAM VPRLSSLWDNIRARISAIQKKKPLQSP >Cag_0683 hypothetical protein MNDGIVIPTSQFSIINKNPTHSTHPLIAKNLVQDKRQPQGLPLHFYNSPP LEGCPKGGVVFRANSNCGQKGLPQRAAPYFPLSTQYFSTPPFSINVIRYK KRLRVRVIRGIRGEWFVRGLFGVARVHHNTQHYPFYTSYVYKQYNERLYS HCFCGLHENCYGIRS >Cag_1761 conserved hypothetical protein MSKQLALCWRYHAGNNALIWQIMFTESGLLVGQKRLVAEKKALFFALNET SGEVAVDDFVLMEHGEDSTIPAGEGWFTGIVTTRHALTYCYATAPESPEH LGMWAIDFREGKVVWSKAGASFVAHSKNAILAVATTFFAGFPERHFVLLE PTTGNEQPATLTIEQVNAIRAAAEPEEVRQGVMMPDLANELLLAAFPIIT EHVGNKQLLCCELLTHGNWLIATLHYPSATPNCCDSYLAMWQGDTLRYHD YMEQAATRPLLNSVIVHNKHLFYIKAKNELCCLCLSTHHANHTDA >Cag_0696 conserved hypothetical protein MKASELDKKFDDNQEEVLDYFDISKIKMLNEEPTRVYIDFPSWMVDSLDR EAKHIGVSRQAVIKMWLAEKLQSLNSQAEVI >Cag_0999 conserved hypothetical protein MVWLLGSFNPRAREGRDSLICLSFRFRSRFNPRAREGRDTGTRAKMLSKI CFNPRAREGRDLWATVKRTTTKFQSTRPRRARLAFYLYSAISKAFQSTRP RRARRGIEEESSNGGMFVSIHAPAKGATINSHHLCEIVGVSIHAPAKGAT VDAYPDFTELPFQSTRPRRARHKNQECEISVYQFQSTRPRRARPIWQKLG LGCDMFQSTRPRRARPHIRTNVFSK >Cag_0219 chlorosome envelope protein A MSGGGVFTDILAAAGRIFEVMVEGHWETVGMLFDSLGKGTMRINRNAYGS MGGGTSLRGSSPEVSGYAVPSKAVESKFAK >Cag_0426 conserved hypothetical protein MLTNSKHVVSRPNGGWAVKTAGTTRAGRVFENKIDAIKYARDAAKKIQGE LYVHNTDGTIMEQRSYGNDPFPPRDKK >Cag_0691 hypothetical protein MSKREELLQSYENGNFLETVYACSSNDHNDRSSVVFDLVALNNEGLIDVV GAFQSLKNESSNSPDFFLTRHVFEKALPELEASVPAVMHCVLQLYLDAGQ DLAASTVINSFFDFCTKKASRPHEALEVIKASPGKLAHLLPATLIAGSQI DSSFYLCETMRLCKDENIELKRWALFSIGKLNLPEDIKKFGDALSALEYA AVQETDDQILSSVIKSAFPLLQRDKSQEPRAIAIIISALRKGDDYVLHAA SEIFGFYTGELPTTLREAFFVDLLRVKPTHKGTLDNVDYGISHLLKNGNS EQAIQFLEALLLRHSGELTIEVFDSAISEIVSNKAFISKVLTRWFLRGDR VLCEAVHEIVWAHHGSGLLLEIDVTELNPSDSGHILFIARKAIGYLFMQP LSAASVLISLMRNATDDKVLKKLGELLLDPLLLNYTGKARDYIIKQSGSE SGKVKETIDNALKDINTYLEELRSVGSLSALHPSEAQREAYNRHYSQLMA ESWKAAEAKSVVLNLFPKSVLLYGKKLINYVYGSDGQSHRQEIPLQCLGS EMEYARMHIFDPFGMDYMLRVFRNEQLKT >Cag_0926 conserved hypothetical protein MHNSILRERSLTTCNSLITLQDIRTLKALYQLKEQTRILRLPVVNNIIKQ RVVGQGCIESLKNALYSLQTIYIDDDTGQRRLQLDEAKDIAVDLTYERQE LQKDIFYLEYGEDKFIEYLSKFSPNFIDYVNKGIEMFKGKHFNAFITDRD GTTNNYCGRYRSSVQPIYNSVFLSRFAKNRCDYPIIITSAPLKDFGILNV SINPAHTFVYAGSKGREFIDLDENFHSYPIDEKKQQLIERLNGRLETLLG KEDFEKFNFIGSALQMKFGQTTVARQDITRSIHEDESVAFLEKVASMVRE IDPKGENFRIEDTGLDIEIILTIDADAGHQEAKDFDKGDGLAFIAQTINI KPNGNPVLVCGDTRSDIPMLTKAMEMYDDVWSVFVTRDERLIEDVMNICP NTLIVPYPDILLTILGLLSL >Cag_1321 hypothetical protein MLIYRHLIWTEIKNSRLMRISLINVRRKMKTTLSLFFLLLAFSGTCRADT ETLFTIRDLKGEMQVYSGAIGSVLPQIKKNDDAQSDAIYIQREKEGEKEK FYIAHNNGRHPVILIQGEKSISFLESYGDNNFIWTVCLDKRNPDGSSLAI VANIKAAGVAGYTTSSIMSGGAYTLLQPRTNK >Cag_1786 conserved hypothetical protein MSETFRQLLDESIQLELNLAKLYTAYNDLFEEDEDFWWDLAMEERGHASL LQHEKNSPQQEPFFPENLLANDLDVLKAANKRILDLVAACKSTPPSRLEA LRTAYELETSAGESHFQRFMESPASSFAANIFQQLNQGDRDHAERIQQYI DELE >Cag_0343 photosystem P840 reaction center, large subunit MAEQVNPAGVKPKGTVPPPKGNAPSPKGNGAAGAPSVIKEQDAAKMRRFL FQRTETRSTKWYQIFDTDRVDDEQVVGAHLALLGFLGYLMAIYYISGVQV FPWGAPGFQDNWFYLTIKPRMVSLGIDTYSTKTEDLQWAASNLLGWALFH IISGSILIFGGWRHWTHNLTNPFTGRAGNFRDFRFLGKFGDVVFSGTSAK SYKDALGPHTVYMSLLFLGWGFVMWFVLGFAPIPDFQTINSETFMSFIFA VIFFAAGIYWWNNPPNAATHLNDDMKAAFSVHLTAIGYINIALGCIAFVA FQQPSFADYYKALDSLVFYIYGEPFNRVSYDYVEAGGRIVSGSKEFADFP AYAILPKDGAAFGMSRTVINLIVFNHIICGVLYVFAGVYHGGQYLLKIQL NGLYSQIKSVFITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICE LNLFGTNIMMSFYWLKPLPMFQWMFNDPSINDWVMVHVITAGSLFSLIAL VRIAFFSHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLW GIKGLSAVCWYIDGAWIASMMYGVPAADAKAWDSVAGLQHHYTSGIFYYF WTETVTIFSSPHLSTILMIGHLVWFISFAVWFEDRGSRLEGADIQTRTIR WLGKKFLNRDVTFRFPVLTISDSKMAGTFLYFGGVFMLVFLFIGNGFYQT DSPLPPQVGDASVSGQMMLTQVVDYLLKLIA >Cag_0038 hypothetical protein MTTAKLPPNQPDDYDSPWKEAIEHYFPEFMALYFPEAYAAIDWSKGYHFL DQELRTIVPEAKTRKQVVDKLVQVQLLDGKESWLYIHIEVQGNRESGFPK RLFIYNYRTYDKYDKAVASFVILADSDPTWRPSSYSYEFVGCKMTFAFET VKLLDFEPRMEELLASDNVFGLITAAHLLTQKTKNKVKQRYEAKLLLMQL LLQRQWEQARIDELLRVIDWFLRLPKELRQKLKIEIHKMEEAKKMKYVTS FERDAKEEGIVIGIEKGMEKGREIGVLEGMEKGKAEGLEEGLMQGRLEVA RRLVSGGMSKAEAALLAGVSVEML >Cag_1217 conserved hypothetical protein MPNVFYYPVSYPAWWLDYYYLATWALSLLFLGGVWAIFFRFGKFSYGIDL GCFWKSALLVVCTTISLGGPMYYNTRFVGEHGQDGDSVRLADGKVVYSDR NGNLRQLAIGDITTIYQESVTYNPPPKIFIVAKTAQGKDSLFVTTNLPNY RQFIEELSKQSGVTATVR >Cag_0633 conserved hypothetical protein MSIFNDAEKRKRLMKGGLPLILAVAWMPIVGMVVLVIIGAPLATLVGWPF TFVLVGAVTLSLLWLFLKLFRKSGHKIKQGNP >Cag_1597 hypothetical protein MSTNTAPRISIPIELSFATPIRYTTGDKPSDVVIGLLGADTYPDLIVANA GSKSITVHFNNGLGEFTSSISYASFYNSPPLALSVAKINNDAYGDVVAIT NSSVSIFLSNENGTLQTPTFYTNNGWQSLTAVATGKIDTDTDIDVVVTDA TTNKLYVVQNDGTAVLNNQTPPSYATGNYPIAVTLGDVDKDGWLDALVVN NDNTSTPTLSVLINNKIGGFKTKVDYTLATGALDVTTADLNGDGWLDIIV GQSQEQGNTLVLLNKGDGTFGNTQSYSAGAYPLGVAVGLLGNDNRADIAV ATSGEKTFAVLQNQGNATFNAPNTFAVVSTIDTPKPTDIAVGDLNGDGKN DVAITSEHLDSVSLLLNTTFQTRNFTEQTPLLIAPTILIEDPENNWIDGW LHVAITKNGEAGDDLVLQTSFNEESDLTDYIWFDKVNNGVRAGSSIILGR FENKVTSTLGKEVGKIEIHFTNPNYKTNEWVQKIAQSILFTNESDTPSTA TREVTITAIDASHQTSSITQQIAINAVNDAPQELFIEGVPIVGQTLHADT STFSDADGLNKANITWQWLRDSVNITGATNSTYTVTNNDLGKSLSLQAKY RDGAGNNEVFTVTTDTVKALNEDPLIARPSTVAFQTSTTLKSFTDASSLP VIGTLDLNHDGILDLLVAKSSSAPSNQLVVLRGSSNGTFTQDSLTYTLGN TPSAIAFGDVDNDTFTDIAITNKDSNSVSLLRVVNGVIKNPFTFSCGSKP TALAIADFNDDGFEDIVTANSGENTLSFLQGNGNGTFAPTNSITTASAPY GVVAADFNGDGKSDIAYSDSGNDRIVVCTYNNESWSEITNALVGDVPHTL VATDFNGDGKSDIATINSGSNNVSVLLNNGDGTVATAKTYTVKSNASSLT ALLASDVDGDGFADLAVSHTTGVSLLMNNGTGGNGTFALAQEIVSNRITA PPTLASADVNGDNLTDILFPGSYTSINAQLNSQSSSATLFTEQTPIEVTP NLTLRDPNGDASWRDGKLQVQINYNTTAYDVLALPTEKPELGGVWIDSEA SNALMVYDQQTDTNLQIGTADNTSVSNNSTWTFTFNRYATNALVEKVAQS ITFSNNRDNPSLETRTILFTATDSLGASSSATQHITVQPVDDAPLTLISA TPVDGAGNVEINSNLSFTFSENIVFGSGFIELHRDAPNGALIEHYDVATH TNLGLNGTTLTIDPYNQLAYGTRYFVTFGEGAIDNGYGTTFSGSEYDFLT ATDPYVPPTPNNDGSDGGSSTGTILIGTGSLALLALAFL >Cag_1552 hypothetical protein MPISLVYLSAALLLLSLHPLLKAYVDIIYMVVWGTFAWGVYANVLKKKLL LPLAFLPFVVLFNPIDPPALPLYADIAMKIGGATLLLFTRKHIAV >Cag_1391 putative type II restriction enzyme MNQWIELSIEYANQRSYLDDLFSVYSTIPDSIRTINEKLWSNVERAFYEK DNLSLIKELLLLDLFPIKDSYIAYLKKDITAIDRNPRTINRICGRLYEMG LNKIYEKSSEPKETNRQIGPMFRNWMRKKSLGIEPVDLSTFMNNEDDAVL DASDKVMMDFAREYLGYQHNKGLDLIARFNGTYVIGEAKFLTDFGGHQNA QFNDAINTIEAKGVNAVKVAILDGVLYIKGKNKMYKAITSFYKDHNIMSA LVLRNFLYSL >Cag_0601 hypothetical protein MVAITTTELRKNFKKYFDIAHSERVIVHYGKNKSYEIIPTQKECENDAYF SNPKLLAALKEAEEDIAAGRFTEIKDPKNLWDSIK >Cag_0600 hypothetical protein MVITLTPEFEQALHKIAEHNGTTVELLVLKTLQENLLFCKPQKTLFRKSE KTLADFLVGYVGVFDSDELVKSGAQMSTNVRKQFGDILLEKRQQQKL >Cag_1922 sulfur oxidation protein SoxX MTHLITLAVTTALLLPAIVNGAPRAESIAKGKQLSFDGSKGNCLACHLIA DGEMAGNLGPPLIQMKQRYPKRSVLKAKLTDATASNPQTLMPPFGKHGIL SNEELEQVVDYLYTL >Cag_1263 hypothetical protein MSKQDLKLMPRFLKGLPNIWTILGWILIISSFLKQPVVAGILIALAGHLL TQAKHRDDLKEKQSLFYLDAWVKAYEEAQSLLKDGNNDRVEWIAAARALL HAEQLEKKITEDAHLELLDLYKLKYRHFFYSIIDNKPESFFHDENDRDKA LSEPSVYAIWKAAQWSEEYNDHSKDPLKRKFADDEVEKTKFASIGLYRFL KKKRTR >Cag_0250 chlorosome envelope protein E MATNISGAFTNGAAAYGRFLEVFIDGHWWVVGDALENVGKTTKRLGANAY PHLYGGGAGSAGSLRGSSPTVSGYAQPSKPTESRFND >Cag_1528 restriction modification system, type I MEKIESIKTLYQQSHNNLETLYNALSQKAFKGELDLSRVAVPEETADGKR RD >Cag_1606 hypothetical protein MSTSGAIASLDAFLHRWKAKAGNYTGDYITTPEGLVRNNMDDEQGRGGYY QEYACTSESQVMMARGYLRAYQATGESRYLQNARTAMQALIRYFFFGKVP STATAWRSHWIVNAGAPFKSKENGRTTDTIAFGEAYECWPTWRKLRPNEF ATAGDSMHWFIENFHLFSQLETEDQKGQWLAARDAMFREFKLLLSPKWQA KYKGAIPFEYTNKGDNLTVRSTSIFRGPYYTGYQNPLPWLYMQDYTAAAN MLQLLVESQVAYTKSTGVKGPFAPVYHYDASLLGSAKKNVFTWNGPDPNT FWGGFQYRPFADVAHFWYHCKRSNIQNAAVSNASKVCMSFLSWLDGWLTA HPNNEYVPTEFREATQPSAPPANGDNDPHMIALALKGALFCKNAGADAAM VGRVIARLYAMVMKRQSKAGDMAGAFMHDPYSHIFKGFWAGDIMEALALY IMHHEKG >Cag_0237 killer suppression protein HigA, putative MILLLLFCTLHMNLRYSTRKLEKTVETFSAIKKHYGIWGKQISQRLADLT SAKNLADMYTIRPAHCHELKADRATEFAVSISRNHRIIFIPDHDPIPRKE DGGVDVNQVNSVIITAIGEDYH >Cag_1579 hypothetical protein MITTRKAMYLNPQFIEKAGKKEFVVLPYEEYQAIEKMMEDYMDLIDVRET KAETQNQPSVPLDEVITMLKKRMNV >Cag_1571 hypothetical protein MPHCIHRAVHSFAEITNLPHPADLVGTWRRFGLLGPVYEIICMGNTLPNG DVMMRVRVVESGEELDYRFADILDDPKER >Cag_0968 hypothetical protein MGAAWSGAFSFNHYKRTFVMQLTGKLIAILPEQTGAGKNGPWKKQDIVLE TSGQYPKKVCVSFWGDKLDRQMLQLGTMLSISFDVESREYNGKWYTDVKG WKAEVAGRAESAPYGGGDDAGSWEPPAFEPTSSNEECPF >Cag_1532 conserved hypothetical protein MLFAGLALAAFGLLITIASKAGATNWLSWFGNLPLDLRIEKENFNLYFPL GSMVLISLALNLLIYVFNKLFR >Cag_1024 conserved hypothetical protein MKPLPVGIQTFSKIIEDNYLYIDKTDIAKSIIEKYQYVFLSRPRRFGKSL FLDTLKNIFEGKQELFKDLLIYNQWNWAVTYPVIKISFSGGIHSKADLEE DLIQILKANEKRLDLKCENRSKAKYFFAELIQQASEKYQQSVVILIDEYD KPILDNIENIPEALIIRDGMRDFYTKIKESDEYLRFVFLTGVSKFSKVSL FSGLNNLEDISLNSDFGNICGYTQNDVDTTFAPYFEGVDMEEVKRWYNGY NFLGDKVYNPFDILLFIKNQKMFRNYWFETGTPRFLIELIKKNNYFVPNL NKLRINESLANSFNLENLNLETILFQAGYLTIKRLISTNKGVSYELGFPN KEVQISFNDYLLQELTTVSENELICDDLFELFNNGDIANLEPVIKRLFVS IAYNNFTNNYIESYEGFYASVLYAYFASLGFDMIAEDITNKGRIDLILKT FDKTYIFEFKVIAEEPLEQIKKMKYYEKYDGERYLIGIVFDPKARNVSQF AWERV >Cag_1329 bacteriochlorophyll A protein MALFGTKDTTTAHSDYEIVLESGSSSWGKVKCRAKVNVPPALPLLPADCN VKINVKPLDPAKGFVRFSAVIESIVDSTKSKLVVEADIANETKERRICVG EGSVSVGDFSHTFSFEGSVVNIFYYRSDAVRRNVPNPIYMQGRQFHDIIM KVPLDNPDVIDTWENTLRAIQSTGAFNDWIRELWFIGPAFTALNEGGQRI SKIEVNSIGTQSGDKGPVGVTRWRFSHGGSGIVDSISRWMELFPVDKLNK PASVEAGFRSDSQGIEVKVDGEFPGVSVDAGGGLRRILNHPLIPLVHHGM VGKFNDFTVDSQLRIVLPKGYKVRYAAPQFRSQNLEEYRWSGGAYARWVE HVCKGGTGQFEVLYAQ >Cag_1638 conserved hypothetical protein MRIFAWLLVIVAVVLQGCYSFSTENRLTHLHTIAVPIFNDHSGAGIAQSR SELTKALIDRLERESALRLIPSLSLADALLEGTLVAYSDVPAQLSATTGR AATNRITLIVQVELEERSTHELLFSERFVGSAEYAIGNMVAQQEARRRAQ HQIAESIADRIISGW >Cag_0551 conserved hypothetical protein MALHGEEQRPTSPTRLPKRHPARRGYHLDMTPMVDVAFLLLTFFMLATTF TPFYSMELSQPQKQRRLAVQAEEVLTMQVANNGIVHYRLGSNALSSMPLY QETATTQEPSATPMLNPALHHFLRSLKAEQPNITVVLSMNNQARYRDLVA VVDLLNSLQITRFSLGEIDGEEKQKAGVK >Cag_1304 conserved hypothetical protein MAFIKQISNAMDAQLLQITAQVLFEKLQSLRSKMQDDVATPDDRVAAAML EEVVALARNRYCVAGNEAEFQQGKSVWVERTAATHFPHIRLHGGALEDDP IA >Cag_1009 CRISPR-associated protein, CT1134 MKNSIEFKVYGREAMFTDPVSRIGGEKCSYHIPTYEALKGIVKSIYWKPT IIWVVDKVRVMKPIRTKTKCLKPLKYHEKGNSLAIYNYLCDVEYQVLAHF EWNLHRPELAQDCINGKHYSVALRALDKGGRQDIFLGTRECQGYVEPCKF GEGKGEYDNISELAFGFTFHGFDYPDETGQKMFRSRFWNPKMVKGVIDFI RPEECLPEHCKPIHEMSMKPFGIDTNQRSVIKEEEVWQ >Cag_1036 uncharacterized conserved coiled coil protein MHLKDAYKKKAEAELELAQAKLVELRAKSKNFGAEAHLNYAKHLDDFEHA ITNAKHKLHELGEAGEDAWEKLKDGVESALRSSSNKLRDIANKFKD >Cag_1895 hypothetical protein MLLGEMMIQGVAMSAYSWSGYNGTGYESLLQQAVEVGATSVLLGSVSIID LNNGAVSAWVRDDGFTTTASMGDVEAAIQQAQAHGLQVFLKPQIHSYNPA SAAFGGNPYNNLINPDPSNPLIIPNLDLFFEGYKAYIVEWAELAERYQVP LFSVGNEMVAVTSAEFTPYWEDIIASVRNVYHGQLTYAAMTDVKWDSNDE VSHIEFWDKLDYVGVDMYPDFDTGATIPTTPTVEQLNDIWVEQKWQSYLS AIAEATGKPLLFTETGVASFLGGANRSRYTDALISQMGTVRDDATQTNWF QSFAETWMGENQPEWFGGMYFWNNDPPYNAGLQDITGYTFFGKPAEVVVS SLFDAVNSLDFDQTLFLASDSDDRIALYKYIAEADANPLTRAQSYHSTVI IELNGTILEGAEAVTPTIHFYLNGKDYGAVTLSNVESEYSINKGIEAASK GEAYYPHSTLIPFLFEIDELEVRDIHIVRDSVQVENSEVYISRVTIVPDM GAATVNTTVNSLQNAWLAFEEPSQAWGGATGYQFPNGAIPYDVPSVTIDT SPYKKTLATMSGTPDNPITVKGYEGFDTVYLLGSPEQYTITLEGDMLMVA ESSGLGQNSQLGGVERLLFAEADYALLFGGMGNDTLYGGAGNDRFNGGDG NDVVLLSGYATEYEVSDNEASATYTITDSVAGRDGSYQLSNMEALQFGAS PMQWNLEEFRAALAASQLLPQQEEPFNVTGSVTFWKNGAAISNVATTLSL HSVTNNGEELLFQHLQHHADGGYSVEVWANATDALHSLQFEFQLPTNAQA AWHFSEEVPQGWQTGVNNQGADALLIGGMGATALPSGLVQLGTLSFVAPT DADRLEIALTRGELGKQWLVPATITLESNVLASNGGYQHNALWQGSYHLS VQHESTEEPTNMVTMSDAYAALQIAAGHNPNESEAPLQSWQFLAADINRD GKVRASDALTILKMALNYHDAPSEELIFLPEWVGKSEMTRSSVDWSATEI MLDVENYQIVNLIGVIQGDVDGSFS >Cag_0468 conserved hypothetical protein MSWLVLLAMALTLFFSFVVLFRKFLGYMKKEQNLVIEPLKDALIDKDNPV GLNPEELQKLKQQQQEAQRHLSEVIAKIPVIQKDGRFQIDQEAIQKRKEE LLKTENISKN >Cag_1881 hypothetical protein MNIINSLIEKLKGAEIKMAHEKGAFDLFALFMLDVIPEQWDVVAAASWIT DENYDASLRYIINCIQPLLSSKELFSISGVVLIDQYNPGLDAVLEAIHVE HGLVEVRDTTFFGLDIKHGFVITSCTRHGCTAKSA >Cag_0476 hypothetical protein MEKLVDYFAHHPVVFFIAVVFSFFVVFAFFRKIVQTLFVIGALMVLYAAY IHFTGSPIPDIFQHIWQWMVNLYQTILGLILRILKKEPEEGVEAFIIFFA VPTCHLLQGIQQRCCASGDGKHGF >Cag_0376 hypothetical protein MPLNLLKKYPELLEIAHMSEADRNSSLYAIFNRDFVQNDNLYFQGKKVRP IKGEDGAIAMDVLFQHLTTSSDKEKSSLNRNARTFEMARSCRLHWIRYHI DKSAGKGVKIFSCEERDQKRRKDVIRTYIFDVDQKYVIILEPQRSGNDYY LLTAYYLDRDDGKKQIEKKFKQRMKEVL >Cag_0271 hypothetical protein MYPSVKKVVPNENYNLTIDFNNGESGTLDMKPFLNFGVFKRLKDMNHFRQ VRVSFDTIEWPSGIDMDPEFVYSKCKKSTPPQVEPADA >Cag_1697 hypothetical protein MRDAQIGHLDVGVGSLMLNYGFWIIVRATLAVAPNKTIHTKQEFAQFFCV GADSISALSSVRAKMDFAPTPYAPEIWINRF >Cag_1615 conserved hypothetical protein MQSFKHISLKALAGLLLSVLLVALMALVLLNSGSVDRAARALAMQLFQKE LHGRLEIDELHLTFPNHVTLLHPRVYAPNEREPVVEAARMTARFHFLALL QPDIKKLAFQSLEAQRLKVRLVQNEQGSLNIERAFASRYPDTTKTGIEEY FCKQLSLKQASFSYSFIKDGKEIPLARANNINAQLRSFTAGKALVKGEIQ EFQSNIASHALVVQKMQGRFFFSDKRSEVLDLQTRIGNSHAILSATLEGV SLFKPSLLQQVAQSNAFVAIEEIDLHSNDVKRLFPSLPLLEGIYQVKATA KQQNGTLELREAQLVYRKSKLALQGTIQHPFESNKLQYNLQCDSSKVSSE LLTALITNKEDQQLVTSLKSIGDITLAGKLSGNLSALQADVQSLTNVGMV GFKGAIERQPNNSFAAHGNVALNALKPHLILGMADVKSQLNATGTMELLV EPNALPEVALALQLQNSFWQHLNVTKGTLSFRHKKQLYEGALSLSNGSEN LAVQGTVNLNSAQPTYDLTGTTYKLNVGQLLQSKNFSTDLNSRFTLQGEG FDLRQLNLQSSVVCAPSVINDVVLPNGTAATLSIAQQGTASRVKVTSDFF DVTAEGHYTFEDLTALGWMALSGISHEVARLNIWGETAQLNPPANMLTPQ PFTVTYQLALHNIAPLLSIIAPLQQIAMQGTAQGKAEHNGGAYTIAGTID LNNLIVEEEFAAKRIHLQGSLRGNSNGILEAQGRGAIAALRVGKQKVRNT NVTAAYVPTTLTSSIDVDVADVVQRVSTSFAMKQQGSGYLLDVQQLNVQD REGSWQANNNLPIVLDKEVLRFNNFTLMRGAQKAVLQGELSNNRASAFTC TLSSLQMNELQHFMLNAGLEKLQGIVSATLHISGVPGAKQSSISVRADNV AYDDLMIGIVQGSARHSNNLLHFELQSEAPAMVNGIQSAQRSNALLSTIE GSGTIPLELKYYPFKLRVNEQQNVHATFRSDNLSARFLEYLLPFFSAAEG TIPTLCTIEGSAAKPLISLHSRLQDTRITVKPTHVSYRLDGDIYATPQAL ELRNITLSDNNNGKGSIRGFVHLEKLQPSRLELAASCNNLLFYNKKDQQD DTSFGSIVGSTRNFTLTGSLRSPIVEGEVQIDRADYSLYSAGANESAQYV GIDNTISFVARNPKPKAPKAKELKSGGSKEFYYSLIDILTIHNLRISSPM PLKYTTIFDRIRGEELETTLSNLSLVVNKNSQRYRMFGSVQVTSGTYRFS NASFELQPGGSITWNNVDMRSGVLENLYGRKYINALNPQSGERDDVHLLL AMTGTLNEPQVAMGYYLNDQTQPYASSTTIGTETSKVDPNAELNVISMLL SRQWYSKPGDGGTQENVALTSASFSAGTGILSSRISRVIQTIGGLESFNV NVGMDKKGELSGLDLYFAVNVPGTDGKVRVSGSGSANDPRTANASTAYGS NQKVEYRVTPKVYLEASHSSGQNSGISSSSSTLQKPTDTWGVSLSYKERF HHWDQFWKKLLPFSSDKASDKTPNKVPDKASNKKENKPNE >Cag_0709 hypothetical protein MFYRKNFLGIPEQVLHGDGFTIELSRNEVVLIDIYNADLLLSKLADEVLT AKAS >Cag_0877 hypothetical protein MIKTAVFVEGQAELIFTRELILKFFEYKNIWVECYTLFNDQELNPTEYSY KSDTVNFYFQILNIGNDNKVLSSILKREKYLFGGDKAFHKIVGLRDMYSK EYRDIVKKSTIDSMLNQKFIENHNNTILEKSRYHKKISFHFAIMELEAWL LGIQGLFEKMDHRLTNEKIAEACRIDLFKADPETAVFHPAHLINDILHII GASYTKKKDEINKFMSYIERDDFARLLNSEKCQSFTSYCNALVIK >Cag_1280 conserved hypothetical protein MDIKAIAQALGMAIFRYPALWRKLHHEPASNDGSMLRNYAVPIIALVQLL KFPLIGEPRPAMFLGIVSMLVDSAVLYVLAGGVLALLPIPRTEEAKGQVM TVFCYALTPCWLAELAYGHGVWSILIALFALLHALASSREGLVRLLSLEV QSASGALTRSAFFMVLISSISFFILSAATLLVSF >Cag_1578 conserved hypothetical protein MINQKNIGSSFDEFLEEEALLDEATAVAVKRVIAWQIAQEMKAKHLTKSL MASKMQTSRAALNRLLDATDTSLTLTTLSSAASALGKKFRIELVS >Cag_1962 conserved hypothetical protein MSEQSHADKVQLAYAAVLGKTSTIGIGLIVVGYALYVMQILPATASPEVV ASHWHLRASELHQAINVPNGWDWLGNMGYGDVLSFASLAYLATVTTICLI TVIPVLLKENDKIYAVITTLQVLVLLFAAAGIVSGGH >Cag_1946 hypothetical protein MSKRQSFIIGGVLIALVASLWSFWPTLQAEKVPDKPSVTASVPIDSTVCI APTEYMRSHHMQILQDWKRTGARDPRPHTTPDGRKFQKSLNTCLGCHSTN SYFCIMCHDYTHAKPNCWNCHVAPFK >Cag_1917 hypothetical protein MDFGWLSIHTSVAEAWSLILRNARSKTPESIYRNLNFQDARISRIFCVGA KTYVFTCLLRILGNRKGLYLHCKRNHGSDFSGNMRFNTK >Cag_1678 hypothetical protein MFISVHPDAVLDDVDEAFDEPNYNNYNTSPEPPSPYADLEKKYRKNRFNR QRQHSNSNPLKDVTAVNGNRITSTQKANGSKPQKPKAYKPKSKPSNSTVA AAPKSAQATYSKHSAHPTKSTSPAKASTNGKTFAKVERTPLSPRAAHTPS ASHTPHKPHASNSPHKTHTTHSPKTAHAEKSSQLTNQAKSAPTNQQAVRP SRSPQLSQLPPLPKSSQSSQSPQSPQSPQSSKTSKSPHSTTKPHSSQTTH SPRQSRPQQNKKRPV >Cag_1340 hypothetical protein MSAIKKLFGILWALMGVGIIPLAIQQAMKEIAEKPSEENWIFWSIVMVVL MPIIAFSLITFGIFALKGEYDTIE >Cag_0761 conserved hypothetical protein MDNNKKLQQLFENDPLGLLDVKPSNSSARNENERLVASFQEINEFFEQNK REPKADNGIQEHQLYSRLKSIRENPTKSEILISHDIHELLNTKPKAPVSI DDIIENDPLGLLDDDTAGLFELKNIKPNEKSRAETDFVARRKPCKDFDKY EQLFKTVQKDLKEGKRKLINFKLGNLRQGSYYIHNGILFFVEKIEITKKD HYKPDGTRVREDGRTRSIFENGTESNMLKRSIEKILYENGQVVTEHSDQS NLNYVESLFAITDEDKEAGFIYILTSKSEKKEIKEIDNLYKIGFCKTTVE ERVKHASQEPTYLMADVRIIKAYQCYNMNPQKLEQLLHNFFGNSCLNIDV SDKEGNRHTPREWFIAPLGIIEQAINFIISGDIIYYRYDAINQEIVEK >Cag_1797 hypothetical protein MDNTTQIGIQYSGFRFKSFFFQGLFDEQENEAFEFQTSLDIRTGSDRVII GVMVLVNRKSDAQTYAKAETESLFLVEGVERTKDESCSLIIPQVLLITLV SLAISTSRGALLVKGAGSFLEKIPMPIVDPKVFVSEIQFLGQS >Cag_1372 hypothetical protein MFINLLKFMPSRSVQAFRDNIVDVDRLIVSHAQLRDGSPGKKGLGHITRS GIVMLCAAWELYLELICVEAAKYFCLKCQSPDQLPIRVQKELSKMAKESK HELKPLEFAGNGWKNVFITHVEDLCNTINTPKAGPINELFNRSIGLELIS DSWSCGKDQINNFVSIRGDIAHRGRHADYIKISLLQDYRALIYNATIETD NTVSEYLALKTPGKHKPWRVTS >Cag_1002 conserved hypothetical protein MCSTKQPTSFNPRAREGRDSTPTFLRLATKRFNPRAREGRDCVPCCVPFK VIVSIHAPAKGATGKADEEFDAKIVSIHAPAKGATQRR >Cag_1511 hypothetical protein MSMKLILTVFALFLATQIADAKDVYVNGYYRQNGTYVRPHIRSSPDAYKS NNYGPSKNSYELMNPKARDADRDGIPNYQDNDDNNNGISDNNE >Cag_1738 hypothetical protein MIDKNELLKRISAIEQSEESVISIYSSHIQHVLRYSNINKESQAKIIEML KQLDSDLEEHKIVTKQLVDAIAKSEKSIF >Cag_0975 chlorosome envelope protein B MANETNDFAGALNNLMQTATSIGQKQIELVTNTAQNLVQLAEPLAKTAVD LIGSITNTAGQLFQNIASAIAPKQ >Cag_1100 hypothetical protein MALIRECQPKTIQEWEEWYFKNATTAGKNNFKITRESLQELGERLYEKIT EVVIPEWQEAFNALTIEDCYNYIFNLTINRTFDGYLREKSVVNDGLAKEF PQIRFDESPSELDHAGDIDYLGFVSENKAFGIQIKPVTAQSNFGNYSVSE RMKASFHSFKEEFGGNVFIVFSLDGEIANTQVIEQIQMEIERLQSEQ >Cag_0260 conserved hypothetical protein MLFKAEKNFTVNFLPLVIMEQEVPVTIHQNQESADEAARQHGTYRAPAKD FNETISEAWTQFRESEAGEALSKGSSAAKEYIQQHPTQAMLLSVGAGALL GLLLKRR >Cag_0526 conserved hypothetical protein METTNSSNQKTKEPGFGIWLGITLLWGSVFFWSSVLALQFVTGWMGEGMF QPAGSGLMRVYGVHVMVLVLFALLAMIFKRMVDPGATRQATRRQEIDAGK GERIFISLLGSIATSFFFTLLTALTFALAAGAVGVPVALTLPVVFVAGLF NIVAGLAASLLVGILFIVAKVGKK >Cag_1291 hypothetical protein MPKHVAKEFIPSLTNKDNIMQAIEFESTIHNGIIQLPNECQQWNKKLVKV IVLEKTRASIISKPRRMPHPAIAGKGKTIGDLLEPIVNKSDWECLQ >Cag_1913 hypothetical protein MYQTLINRGALKYISSIERYAMEKGWSEGMERGKEQGKAEGLEEGLLRGR LEVAERLVASGMSKSEAAVLAGVSVDMLE >Cag_0110 hypothetical protein MGKANNQPNKSKPKVSVSIGRLLMVLIGANFIMLLAPNFGMLNSFLYIPD LYTWPAFVLGFVLIFFGFKGLYKKP >Cag_1894 photosystem P840 reaction center protein PscD MQSTLSRPYTGNEQVRANVAGPWSGNAAHKAEKYFITSAKRDDYGKLQLT ISPASGRRKLLPTKEMIGKVASGEIELYVLTTQPDIGINLQQKVLDNENR YVIDFDNRGVKWTMRDIPVFYDSLRQQLCIEIDRRTYTLNEFFK >Cag_0269 hypothetical protein MERVALTTDEQNLWDQIYFSEKTIQIDHDKPRESIEPAYQLAQSLLKRKV IPQIRMRYFTDPKLNIGGRDKSRKEVFERNGTSGDMILRHPHFHKYLRYF VLGPDIPLTAINEFVVLANDCDPITSGDTKEFCNLAWKQIRNSGQDTKYA AEEYFKLGLELELGEDVAYAIRDTIMRMR >Cag_1722 hypothetical protein MKLDELYQVFLPTFTMVEEEWNNFLAEKNTALSEAQNYLNAIFSITGYNL PTYEEISSTRYLGVYENGAQVTFEGSGIDKLFGGDMTTGTLSLSRIALDS YANNIHMAMLGYNNGITVDLNSGAVSGMFNEYSLTTPWFELSAVGTVAVS GASILSEMNIEAELTELSFTYPDDGVIVKLLGDIDYYQDELGNMEYSGDV YTAYFTGWGADITLNGDFQCDFDINNTFILFSDELTGELYVSELSLVIPS QHVVANFYNSLTGLSYDITSGSLGGSFNSFHFGTSLFDVYAHGSIYVEPV SSSSTLGIHGILDSIEITYPESTFAITVVGDVEYIQIDQGEYTYFGTMSE VYLENPTTTVSVLGDFSGSYNDSNGLHLAGNLYEFHWQREEAFISFVGDI VFGEDQLVVNEVTTLEVYGDGRYYDASSLSVMNIVTDVVGNELLALGEDA NWDISAALDELLWDVINKLNGDAEGVSSVNFDSVPASSTPVNAEFLDFYL DLSKVGEAGYYASFRVGHLYDTNGDGLPDYVDEIHDSPATITWNNGMFTV LSLDDSSTRATGSLAYDGNGNAVGLYAFDRASGDSETTPPTLIAATPSDN AMGIEVESDLSFIFSENVQFGNGTIEIHRGSATGEFVESYNIGTPLSTNL NIVGSTLTINPTSDLASNTHYFVTFSEGSIRDLDGNNYVASQPYDFTTGA DPYPTHTLTGNITFWKTGEAITDVTTTLTTLPTNGTHAIELKNIHVQANG SHTIEVWATTPNSTTGSFECEFALPTGTSVTWQDAAKLPSGWMTTNNVIA TGAFRVASIGTHALAEGAVQLGTLTISQSANPGTFELAMTHAQLGNNDVA GYAISSVSSTTGSGNEYQYHSLTDGHYALTGDKAAGDAGSAVHANDALAA LKMAVELNPNEANANGLLGPVSPFQYLAADINRDGKVRANDALNILKMAV GIESAPTDEWIFVAESVTGKTMDRSHVDWSDISPIVDFNQTAIELDLIGI VKGDVDGSWVMVG >Cag_0750 hypothetical protein MNFGAWDGKHFSNCNHLIANQWKGCFIEGNIDRYRELVATYSENKDVVCL NFFIKYQSRLLLIEFNPTIPNDVIFIQEKSNNVHQGSSLLALIILGKEKG YELVCCTTCNAFFVKKELYSFFNLKSNSIYSLYQPLCDGRIFHGYDSKIF VVGMSKLLWSNISIDSSDFQVLPKSMRYFNDAQ >Cag_1337 conserved hypothetical protein MKKMLSLAALLAAISYATPASAELKIGGDASLRMRNEFNAVDPGNNATDD VMWQSRVRLNASADLGDGYYFKTLIMSEGGAAGWLNTTGNENYALAASQV YFGRNMENCNYKFGRIPLNSFSNPIYDLTLYPAQPTDTPVNNLNFDRLYG ASYGTKMGGGMLNTTLVVLDNSSTTAGTAAYDGMLNDGYALSVAYTTTWG NVTVEPQIFAVLTNANVTTLGNNVTPLTFGLNASGKVGDGKLSGAAFYTS AGDEADYSGYLLRVKGETGPYMAWVDLTSTTNDNAAGATVKDYTNTFVWA QYKYTAYKSAAGSLTLQPTLRYRASSTETAAGTEADTSVLRGEFSATVTF >Cag_0746 hypothetical protein MNFDKDVSQQQLLTNIFIISWAGQHENAIFIANQISFVTNKITIVYSDPN SDFLLDVPCVLIKRPNDLFWGDKFEASLHACKDDFMLVIHADCKCDDWKG LVIRCNEIFSKNKDIGVWAPKIEGTPYYLERTKIASIEYNALSLSLVAQT DGIVFALSLPVVNRMKKINYMNNKYGWGIDWIFCCTAYALNLMVVVDEKH TVIHPLHRGYDTRQAVMEMNTFLKQLTTVEFIQYRLLSSYLKLSDIKTIA KV >Cag_0535 conserved hypothetical protein MKPLPVGIQTFSEIINQDYLYIDKTGLASNLINKYKYVFLSRPRRFGKSL FLDTLKNIFEGKQELFKNLLIYNQWNWDVTYPVIKISFSGGIRDKESLRK NLFYILKDNQKRLNIICEEKEDPNQCFAELIQQAFEKYQKKVVILIDEYD KPILDNIEKIPEALIIRDGMRDFYTKIKESDEYLRFVFLTGVSKFSKVSL FSGLNNLEDISLNPDFGNVCGYTQDDVDTIFAPYLEGVDMAQVKRWYNGY NFLGDKVYNPFDILLFIKNQRMFKNYWFETGTPRFLIELIKKNNYFIPKL GKIQVNEFLVNSFNLENLNLETILFQTGYLTIKQLLLSDVGVSYELGFPN KEVQMSFNDYLLHDITTVSEKEPIRHELLAIIKAGDIANLEPIIKRLFAS IAYNNFTNNYIESYEGFYASVLYAYFASLGFDIIAEDITNKGRIDLTLKT FDKTYIFEFKVIAEEPLEQIKRMKYYEKYDGERYIIGIVFDPKERNVSRF AWERV >Cag_0685 conserved hypothetical protein MAVTLKIHELAHQELLDAIAWYNEIQSGLGKRFQETIMLQIQKIKQHPTW FPRETIEVFKAYVPRFPYKIIYSVNDEAITIWAIAHLHRKPSYWQSREKS >Cag_1524 DNA-damage-inducible protein D MQSQEIQQLKEQFDALSHTIPDEDVEFWFARDLMEPLGYTRWENFMTAIK RALESCETTGYAVDDHFRGVTKMIGIGKGGQRPVEDFMLTRYACYLIAQN GDPRKEAIAFAQSYFAILTRKQELLEDRMRLQARLDARERLRESEKTLSQ NIYERGVDDAGFGRIRSKGDAALFGGHTTQAMKERYGITQTRPLADFLPT LTIAAKNLATEMTNHNVSQDDLHGEHAITREHVQNNQSVRTMLSQRGIKP EQLPPEEDIKKLERRIKTEEKQLVKHSGKLPVAKNQD >Cag_1021 conserved hypothetical protein MKQLPVGIQTFNKIIEGDYLYIDKTDIAKNIIEKYQYVFLSRPRRFGKSL FLDTLKNIFEGKQELFKDLFIYNQWNWNVTYPVIKISFSGGIRDKESLRR NLVYVLKDNQKQLNITCEEKDDPNLCFAELIQQASEKYQQKVVILIDEYD KPILDNIENIAEAIIVRDGMRDFYTKIKESDEYLRFVFLTGVSKFSKVSL FSGLNNLEDISLNPDFGNVCGYTQNDVDTIFAPYFEGVDMEEVKRWYNGY NFLGDKVYNPYDILLFIKNKYVFDSYWFETGTPRFLIELIKKNNYFIPDF LTLKVKKSIVNSFNLENLNLETILFQAGYLTIKRLISTNKGISYELRFPN KEVQISFNDYLLQELTTISENELICDDLFDLFNNGDIANLEPVIKRLFAS IAYNNFTNNYIESYEGFYASVLYAYFASLGFDMIAEDITNKGRIDLTLKT LDKTYIFEFKVIAEEPLEQIKKMRYYEKYDGERYLIGIVFDPKARNVSRF EWERV >Cag_1210 hypothetical protein MIAISPTELKRNLYKYLEQAQSEQVIIQCKNAETYAIVPTGKTSETDRLF LHQNIKDRLRHSLEQVKEGKTYQLTKAEINSFLGHYDDK >Cag_0502 conserved hypothetical protein MAPKQAKSEESAMPLSTINYIMIACGVVVIAATYWGMALERSVDGFFSLV VSPILLIGSYLWIIVGILYRGKSSSNAKKR >Cag_1535 conserved hypothetical protein MEQPNSDNIVKTAVGVAGGSALLAPALPLALPALPLAMPVIHGLAGIALI GAGVFAVVQAAGAISSLDNPFQPKKPK >Cag_0593 Nucleotidyltransferase substrate binding protein, HI0074 MQQDIRWKQRLQNYSRAIKLLQEVPELDREKLSFLEKEGIIQRFEYTLEL AWKTLKDKMEEDGIILDKISPKMVLKEAYKAKYIDNIELWIEMVNDRNLL SHTYDFETFEEIIIDIQYRYTQLLSDLYINLIESQL >Cag_0385 conserved hypothetical protein MTMRDHTPDFRMHELSQENKALIGSTVKQLLEKLAVDGRLCSEALLEFWV EVAGAQRPRGTYRNGCLMPDSFIYIRDYFRASESGTLLAGESYVKDGTHD LESAWDDMLDELFYQIEIFTSPVSTGKGITLELWAGCRQRPEGDWVYAVD TKVELE >Cag_1277 hypothetical protein MSLTLQKEDAHKLIDQLPMDATWDDLIHEIYVREAIEHGLRDSQTGATKD VHEIRAKYGLPL >Cag_1745 conserved hypothetical protein MEPKRSLKRGAGLLMSLSALSILSLVFMAIWVNWEKPRQAAEPLTPEVKN LVDRIPSTTDALIYIGMKDIRQSRLWQEVIPDSLKQAPLFQPTGELATLL ERSTINPSKDIDTLLISFKRHGYKEQLFLAIASGNLQTKLPKAMQAGNHE TLGGHSCYSFGSSLWFSQLNSRRVVLSNSKELLGNFLQPQGSFLQRDSLT TTLIDKARYKSHLWFALPSAAWTSGALQSLTSSNKDVKSIGNLNRIKHLT LSVNFKDGIEAESEWLYESNQAAYFASTFLWGAIQLPRLSEKNEQTRALL DNIAIQQNLNSVIIHTALPLQIFQTAKEQPAP >Cag_0833 hypothetical protein MLTISTTYDVQSDRLVTIKLPQEVYPGKHELLIVVEQQKKEKRIGTTIAN SIMRFAGTVPAFRSLDGVSFQQSIRMKWE >Cag_1576 hypothetical protein MLVSTSIRINQELYEQAKQDAKLEHRSIAGQIEFWARVGRAALDNPDLPV SFIAESLASLAEPREHATPFIPRSSKQ >Cag_1737 hypothetical protein MFINKDLLEFFGSMMEIKKQKRDIFNALAADVDDPEIRNTLLRIGADEQR HVDQIQQSINLVNSGSTAEPMVPEAAPAPQVAPAPAPPAPTIAPATLQPA IAIAQPAIVRPEPPQPVAPMPVPEPVAAPPTYVQPVQPIAQPITQQVVVS EPVAPTVSQLQQPAPPQPLTYLTPTIATPAAPAEPAYSEPAEQSISSFAS PLSSGTQRYPVQPPTSKTFENMTTLHHPLGEVFGFAATDQSPKAQRYRSH RHCPFNNKSPNCTNSHTENPLGVCSILHNNKAIITCPIRFREDWLITDDA ASFFFEPGVRWSSLTDVRLADANGTSAGNMDVMLVAYDKEGKIIDFGAIQ IQTAHIDGNVREPFECYMKDPKTNAMMDWTRQPNYPEPDFLSAMRTSVVP ELLYKGGILHSWNKKMAIAINKSMFETLPPLTRVKKDEADIAWLLYELEA VNDGEKEAYQLKKSEVVYTAFQPTLLALTAIAPGNVNDFMKFIPELGA >Cag_1388 hypothetical protein MLRNNNKRILWTFLNYLFMTVNELLPSVTTLSHVDKIRLVQIMLEQLAND AVNSAQQKSLSSETFNPRYFFGADHQSKQIIDDYIASSREEWH >Cag_0153 hypothetical protein MTVLPQIIANKIDVKRDFPLLSERELDYINEAKGLFDSGFYSYSLLAIWN AAVNNLKRKVEAYGVELWSSVVKDESGRKKYDKDAETIAERWSNVDDLVL ITGATRLGLLNPKAGKSLEMINWMRNHASPAHDSDNRVEMEDAVGLILLL QKNLFEQPFPDPGHSVSAIFEPIKNKTHTPDELSILRDHISSYKNQDIRN VFGFFMDLLTKGDEPAKTNVTELFPVVWEKANEDLRKTLGVKYHTFVIDP DSDDSPDKGAKTRVFELLVKLDAVNYIPDGTRARVFRRAAEKLAEAKNTG YGWRLEESASRNLAQLGISVPSIAFEYVYQEILAVWCGNYWGRSDSYVTL RPFVDSLNTDQIRLVLKMFRENERVKDELSQSRPNKIAVSLLKEFETKLT IEAHKQELRETIDIVKDI >Cag_0124 conserved hypothetical protein MKFIAIMGHEETRPQVRALFQKYQVHLFSNLSIKGCSCEQKGGEQPTWWP SNEMPTSYTSLCFAILEDEKAEALMTELEKNPIAIEKDFPARAFLMNVER TA >Cag_0710 hypothetical protein MKKSAIKKALCKLDKETIEKQMEALMELVAEPRYICRKCARVASTKRHLC KPVAITNSNGSKKRAAKVLNNGVVPPNALT >Cag_1809 hypothetical protein MPSIAPIIPFLLLFLWLLQGCASDRAPSGGSADTTPLRLLASTPINGTQN FKGNQLQLYFSHEVSSRALLRALRTFPDIGQFELTVNGKRADIQLLDTLQ ANQTYTLLLNRHLNDFRGQLLHAPTTLAFSTGNNVNNGTIRGTVVQYNGT PASNALLLAFASAEKGATVNLLENKPTQIAQCDASGSFAFNHLPHGSYHV VAINDRNHDLAWAPSSEEYATPSQPLMATNSANQLLRLSPPLKSPKPLKI PLEASSAPTNSTIATGSLSGMCTVRGNPPSVIIEAISPSATYYTVAVRKK AGSYTYHFNQLPVGDYTITASIPTASYQPNQAWQWNAGSVAPFVPSDSFT FYPETVTIREEWLTERINITFPTILQ >Cag_1443 conserved hypothetical protein MVKNKKNEVIGREITLYSDKSEDYISLTDMARYRDTERSDYILQNWMRTR STIEFMGLWEQFNNPNFNSIEFDGIKNMAGSNSFSLTPKRWIAATNAVGV VSKTGRYGGTFAHRDIAFEFATWISAEFKFYLIHEFQRIKEQETNRQKLD WNLQRTLAKINYTIHTDAIKERLIPEKLTAKQTSLVYASEADLLNMALFG TTAADWHTENPNAKGNLRDDATLEQLVVLSNLESINAVLIRQGLTQSERL MQLNQIAITQMTSLVKNAHLKKMQ >Cag_0897 conserved hypothetical protein MAKKQSFVDKTKKAGASDFKTAKVIFSVRSEKTNAWRFIEKNVRIPNGEN DQEVISKAIAGFSK >Cag_0405 conserved hypothetical protein MKKAAFLVALAALFGGTQANATDWNWKGDVRYRYQSDLASDPAVTGENSR DRHRTRVRLGVYPWISEELTGGLQFSTAGAGDETTSRNETFGDQFVPDQL YLNEAFINFHPKAFDSKVNIILGKREVANTMVVLSDLVWDGDLTFEGMTL QYGKDENGKNKDGWNAMLGYYPLNEINDLKEVKAQDAYLLAGQVAYKGKT SAVTYHMGAGYYDYTHFDVSNKKVNASQLAAAPYTYTAAKSATYSPEYDY TGKDFNIIELFGTVGGKLTENTPWTLTLQYAFNTAKQDAKHINIDDDERT SYLAGVKIGDAKNVGQWAVGADYVRIEKDAMTVLTDSDRNGGTATNLEGM KLGVTYHMVKNMTVGATYFNFNTIDNDATAVDESATKRHTLMLDTVVKF >Cag_0854 hypothetical protein MLCACKLCLMHKEWLVESEMSGKAYNKMSYQPFLFIYSFFFSGNNCKFAN FTCILILNAFFSVICNGSAAAQA >Cag_1133 conserved hypothetical protein MRKKILFICGSMNQTTQMHQISEWLGDYDHFFCPFFSDGLLGVATKLGLL EFTIMGKKRSSKALEYLHSHHLQVDESGAAHPYDLVVTCTDLIVPKIFQQ RKMVLVQEGMTEPETLFYHLARNVKWIPRWIAGTSTTGLSDAYQKFCVAS EGYRQLFIRKGVNPNKIEVTSIPNFDNCAHYLQNDFPYKDYVLVCTSDNR ETFIYENRRKNIEKYVAMAAGRQLIFKLHPNENVERATREIKQYAPGSLV FSEGKTEEMIANCAMMIAQFSSTIFVGSALGKEVHCGLPTDELKALTPLQ NNSAAKNIADVCREVVEQ >Cag_0969 conserved hypothetical protein MGINYLLTNNHKSLQNMASYSQSHENLSGVARLFLVVSASVIAVASIAGG AGTLLQSELLLTLHPYLFFIGFGNLAILILNRYLTAAIYPELTIDPARQR SYIALVLLALGMITIAVALKLPLLKAATGLLLMAVVSVPLREIFSKLSIP AIWKEVSVRYYIFDVIFLMVANLGLFTLGLKEAFPDFSIIPFFVTQSSYF LGSSFPLSISVMGFLYAYAWSRSPKRELAKQLFSLWFYIFVGGVLFFLVV ILIGHYWSMMLISHFLMFGVMAMLASFAVYLNNFFHSKFHHPALAFLLSG LSLLFATSGYGIMNIYFMQGITFGTRPPLPFEQMWIYHSHTHAALVGWIS FSFMGMMYIVIPSILRSGSLETLRSDNALSALLDAESMKRAFAQLTIMVL AAMAMLLAFFLQQQLILGVAGVLFGVAVAYVMLNMHSSR >Cag_0574 hypothetical protein MNTTKNEVATLLQTLSDDVSFDEIHYHLYVLEKVNRGIKRAETEGAISHE DAKKRLSKWLLD >Cag_0673 putative glycosyltransferase MEHYVTLFDSLFLPQGMALHISMERHIKDYTLWILCVDDEAYDVLTKLQL ANVRLLQLSTLETEELLRVKPTRSKGEYCWTLTPFAPRFVFEADATVHRV TYLDADLWFRKHPKPIFDEFEASGKHVLITDHAYAPEYDQSATSGQYCVQ FMTFSRHAGEEVRKWWEERCIEWCYARHEDGKFGDQKYLDDWPDRFANSV HVLANKEYALAPWNATRFPYSAAIFYHFHGLRIFKKRKKYYVFNGTYFIP KPTYRYIYKLYLNDLQGSIFLFLKMGAILRNQKNMWFGYNFFTFLKVLYS KLFRLNVYASLCYVYLKPNLKAVNYNN >Cag_1686 hypothetical protein MKELSLLLRQIHPNFVQDGHLSSQAFRPTPKDEQQLSVYDGDMILPLDAW EHYNNILGLTSCGVMAVNVAECTVLELPVMSDPQPFPEHVLIDFSAYNKR EIEKKAKLLKAKAEVRGWLYKKAQL >Cag_1455 conserved hypothetical protein MSVKPVDLNALRATHGNLYETVVAMSRRARKLHEEERSELEERLLPYKEM IRNPASEAESDKVFPEQIAISLDFEVRQKASHRAVADYFDGKYDYMVEKP VEKKIVLPTNDDDEADGH >Cag_1236 hypothetical protein MKKIYTTLRQVVKATAFLGMLATTSPVQAQEVTYNTEGWYGTAALSKIIN TESSGMQANLGSGVIRPGEIDYNGNFVGMLAVGHENSFCRKNNTPIYLRT EGDYLMGSADRKSATVDQYHTVLDDSVDFRALFANALLGIKDTQHTRWWL GGGIGYGWVDRPAITGCSSTCSFAAATTDGFAWQLKAVVERTISKDAALF AEARYVALPGESNSTSQCYDDINVATLGIGFRSYF >Cag_1926 conserved hypothetical protein MKRLTIPTALLTVTIAIASPLYAAAPPTTQELIAQAEATRKEAAAIGYEW RNTAQLIKQANDALTANNEPEAQKLASAALLEAEQAVKQGKWMQANWQTL IPTL >Cag_0674 hypothetical protein MNYMVNTRSFYYYSLLYALIAHFGFLGYGIDVYDAYSIAYGWGIGTFEPI GWYLSTFRLYSANDIYLGVFFVSLIVSSGLIYASFYFLGNENKFSKIEII FIIFFMHFVHVTVFSSVNALRQGLAMSFFMFGIVKLLSGSFRKTLFLFLL SILCHNAVLFIIVPLLTLINVNKKLLQLGIGFIFIVLTPFALNLGVAEKT QVSTTLNYSIIYFILFFAYSFFYWQSFSNSTKFKFSEVRRRYQFGFFVVM LMMLFLLNRESHLQRMVMFIMIPLVYEIFVFLPAINPLRNVIVFSFLLLW VVITLTSSAFSSFREYSTMPL >Cag_1530 hypothetical protein MEKIESIKTLYQQSHNNLETLYNVLSQKAFKGELDLSRVALVVEEKQKNL >Cag_0739 hypothetical protein MKKIYTKLQKTVTATTIGALLFIAPKVQAQEVTYNTEGWYGTAALSKIID TESSGMQANLGSGVIRPGEIDYNGNFAGALAVGHENSFCRKNNTPIYLRT EGEYLMGSADRKSATVDQYHAVLDDSVDFRALFANALLGIEDTQHTRWWL GGGIGYGWVDRPAITGCSSTCSFAAATTDGFAWQLKAVVERTISKDAALF AEARYVALPGESNSTSQCYDDINVATLGIGFRSYF >Cag_1045 hypothetical protein MRKKALSIALASLLLLGTSLPIAAWLALPRYVEPLLQRALIGKPVQIAIK DVRPSLHGVAFSSLQATITTPPDECNNYERTIYHVTIKNGTIGWLITDLS ASHRSPFIPSLLDVKLHLQADTLHLQPTPNTFAFSDSQPEITVNLKLFRN EKQVLSVVPLDAAYAIHDGTVTREQMRFEGIAYNVAVSSSNKWQQLPDSL FVARMVNEGKVQPVGNFRAIVGSKGDPLHPCRITLSNCSAEIVDWNASSP FVHFDRKTKAGDLTLCINDFPLQSLSSIALQAAQQQPKAPSRLAAKAPLP PMVAGKINATIPLSFRDSTIVIRNASVIAKAGAKVVLYNKQQQPMLFVVA NKSGMDERIVDKLYVTATLNHAGKTTQSVALQNLSATIFDGSIRSTPLTV KTDGSSPLDVTVTFDNLKLFDHLILPDNEQSSFQGALSGKLPIRYAKNQL TIRNASLLASEGTQVKLVTKEQKPLVTIIAGKKGGKETVLDKLNVRARFN QTPNQTASITLQEFSTTLFGGSVNVTPLTFKTDASSPLVATVTLDKVKLF EHLILPANLHGSLYGDLSGKVPLTYQNDQLSISNATLRSSGGGSFTLNNA QQSSNNNLSRSDQQTTYAFSEPALTFSHLANGATTVDFTLNEFRQKSGSN DFKFGNPKGTIHFAENPREPDVMRLSNFSTNFFGGKIALNEFVYDIKKQE GETIVQLSNMPLQKLLDLQGTKKVYATGALKGNIPIKLKKGTVEIPDGAL LAQESGQIIYATSPEERAAAHQSLRTTYEVLSNFLYQQLSTSLTMTPDGQ STFAIRLKGTNPDMYGARPVELNLNVQQNLLDLMRTLSISSEIEQAISDK TTQQQKK >Cag_0659 hypothetical protein MEQSATKSFDGERRIMYAEKLIVETDLSGMLKKVPKLPPNKQLEAIFLVL SESSAKVAVVRTPHPDIAGKVIIKGDIINCATSSDWDLPQ >Cag_1465 hypothetical protein MLYPISSIPNVLTQERMLKFLLLLVVTFLAIRLVFRLLRNGIFLFKSQNS VNPYPKSSPFQRGQRVEEADFEVIETQLGESEKRRDVA >Cag_1285 hypothetical protein MVFEISFHLFIISKLMKLTNYQNNQISIAMKKLFAFLFLLSSVSFVGCAK KAEEAPVEEPAAVEAPAAPAAEAPAAEAPAAPAAEAPAAEAPAAPAK >Cag_1374 hypothetical protein METTMQTNANNNYTTDSIIREVRCLKEDNAAEYGFDIRMIAAAVQLKQRQ HPERIVTRILSDVEQKYGKQPLTRLVTENEL >Cag_1721 conserved hypothetical protein MKPLPVGIQTFSEIIKQDYLYIDKTSLANELIKRYKYVFLSRPRRFGKSL FLDTLKNIFEGKQELFKELLIYKQWNWNVTHPVIKISFSGGIRDKESLRD NLFYILKDNQERLNINCEEKNNQNLCFAELIKKVYQKYQQKVVILIDEYD KPILDNIENIPEALIVRDGMRDFYSKIKESDEYLRFVFLTGVTKFSKVSL FSGLNNLEDISLNPDFGNVCGYTQHDVDTIFAPYFEGVDMEEVKRWYNGY NFLEDKVYNPFDILLFIKNQRMFKNYWFETGTPRFLIELIKKNNYFIPKL NKLKVNESLVNSFNLENLNLETILFQAGYLTIKRLLPSGMGVGYELGFPN KEVQISFNDYILQVMTIVSDKEPIRYELFDIINNGDVANLEPIITRLFAS IAYNNFTNNYIESYEGFYASILYAYFASLGFDIIAEDLTNNGRIDLTLKN YEKTYLFEFKVSNQEPLEQIKKMKYYEKYDGERYLIGIVFDPKARNVSQF VWEKV >Cag_1924 sulfur oxidation protein SoxZ MRVKATLQNNVVSVKMLLQHVMETGRRKDEAGALVPAHYITEVTATHKGE TVFHAELGAGVSQNPYLSFQFTGASAGESLTISWVDSKGMSETADSVISA V >Cag_0582 hypothetical protein MKSGSVYQVHDRVRFRVRERKPEAVVMRDGRYFKLIIDGFDEPLICVQIV EPGRRSSSGATTSNVIHSYIDGDFEGWEGETIFKLDNGQIWQQSSYAYMY HYAYHPEVMIINDGGTWKMKVEDVDEMIEVTRLK >Cag_0735 hypothetical protein MGQFLAIGLVTQIGVLKKELAAAQLTTDQLQERMKAELPYNPELYLLHEH TDYYSFDLRDEIFYAQLLPLLEEFYPSFYNSPEMYESILAKLRKLPPSEW FAWAKRKPEEAFQFDPYGMRETIEEGFTDISLHYEAILLTMNGKIVMEAY GSLFRFLNYTMKQTFKQYSLASALRLYITG >Cag_1401 conserved hypothetical protein MKIYTKEELIQSLKIIAAQGWIENARHGNHGGIGNTLEDLLGIAENNLPI PNAAEWELKAQRLNTSSLITLFHIEPSPRAIKFVSQVLLPNYGWKHQQAG KKYPENEMSFRQTIHGLSTSDRGFQVNIDRKNQKVVISFDWNCVAEKHHK WLQSVKNRIGLEQLNPQPYWGFDDLSNKAGTKLLNCFYVQAEVKKEAGKE FYKFSKVMMLQKFNFDGFLSQIEQGNILVDFDARTGHNHGTKFRMRQNCL PTLYEKMTIIV >Cag_1050 hypothetical protein MTIAELQEQPLAERLMLMEELWETLCNEKHHIQSPAWHQEILEERINLIN SGEAEYLSIEELYVLPKPREIRKGFPSLNYSRATFLSFPRRRESRIV >Cag_1207 conserved hypothetical protein MLARIQSLYLFVVALLAVASMALPIWSFNATPQLIVRDLASAPLDNALYN LASTAGMVLSPLTAIVAGAAIFLFTNRALQTKLIMLAMLLFAGDLVAALA AAHMMNEHFVALGNVVVHQPQAGLFILLPEPLLLFLALKGVKTDDKIANA YKRL >Cag_1428 conserved hypothetical protein MRAIIVTLPQKIIPYIPMRRITVLLVFILTTIIAGTSLQAATTPFTGSMD MALTMPNGRGTVTYLFGKGAQRMDMSVQMENIPSLLRTTVLTQANQPDNA TIINHQTKSYSQVNLTHAAQSALLMDFNSVYRVTRLGRTTLRGYNCEHLR LQSASETVELWVTGDLGNFSTFQILQAQNPRLATTQLAAAFRNNNIEGFP VKMVQEVQKQRYSMELLKLTKKAIAASQFRVPAGYKRVDASEPTLNSEQK QQLKNLMEKMKQVE >Cag_1861 hypothetical protein MGLFIKEFKMDKAILFNELLDAVDHLSLDDQESLIDVVRHRIAECHRQEI FSLISSARKEYQQSKLSPETPQDIMNSILS >Cag_1113 hypothetical protein MKNNTGLWIDHKTAILVNIKGDYTHVQHVESNAESNLKPSGGWKANGSVV AQAVANEHTADERRKHQYHTYYQKVIALLANSTEIALFGPGEAKIELAKE IEKNSDMHKKVSIVETCERMTENQLIAKIKSSFSAKS >Cag_1181 hypothetical protein MQPFEHIPKIIAVKALDAQHLMVTFEGNIIKRYDCTTLLAMQEFKLLKTY AFFKAAQVDAGGYGIAWNDGMDVSGYELWKNGVLQ >Cag_0655 hypothetical protein MTMLREIIKPTTDFYSVHIPKKYINQEVEILVLPFSYKNRQEIEDNVSCD VFSKTSGILKPKNIDPLQWQEEIRNDREI >Cag_1406 hypothetical protein MEMQENNIRQLRLHFDGLATVEHKLPASLLVQALSKFQRVVHLIAMADEG REVLQRARITREIERRFPLICEVPQKGGYALPITIGGEADQLFDEQACEN IAKKTREVIVAIDRSDVKELGNIIPDMFYRRSILEELKAMQPASHSCFFI DIEDCYNQPILNGSTATEKIKTLLMPPTNETSSSDFGYVTGALIEMKFNE RRLVMKLLGSNKQLSVTYAEDFEPMLLDNPRELIQIHGNIVWNDDGLPQS ISDVDEVVAIDETPLDIHVVEFDTIFLQPKKTLQSEVVFDRESALFQASG PFDIYLCAATRAELEEQLYNELAMLWQEYAKPPSSDLTLDAQELQKELLY AFEEVIRGI >Cag_0522 conserved hypothetical protein MADETKTTGQGGVKGDFATILVGVGTILDNTIEPLSKILVQTLDSLTVVA KQILEGVNSSLGCKK >Cag_0577 hypothetical protein MILLDLGKPYPLDDVFKAAARKHQSHYRATELNVGYSDKYGTKLNEDDAK KLLNYYDSLNVREELQNRFQKGDEYSFSLKRDGDLLRSEHIPFNLFAPLC ADTKLAQNLIKNVFGLDCAKNLSIKFEYAPKPKGKYLDDATAFDAFFKFD DNNGKRIGIGAEVKYTEKSYPIGKKEKKYVHDPKSCYWKVSCKSGAFLEP SYSPSSALITDELRQIWRNHLLGLAMCQQNELDDFYSITLHPAGNHHFQR VKPNQGVIPEYQAQLTDSYRSKVFGRTYEEYIAAIDGDSEILKWKQYLHD RYIVKNTDDQPQ >Cag_0031 hypothetical protein MNETIHHQILEQKPSEKINLVTIILESLDKPEPEIQKIWVDESQKRFDAF KAGKIKLYTY >Cag_1308 conserved hypothetical protein MPLSRSYKETIQDRAQHDPEFRVALFDEAINALLEGETNVGKALLRDLVH TTVGFEGLASELAKSSKSLHRMLAPSGNPSMENLFQIINAVKKHAGISVQ VASSCIQQNTQQIVA >Cag_1067 hypothetical protein MVKKQKYMPEISRFLGIIISMYFDEHNPPHIHVQYNEYRAAMDIYDFNII AGSLPAKVRGLVAEWMELHSEELLKMWETKEFHRITPLV >Cag_0279 conserved hypothetical protein MNNVQLYTEISLLPASLKQEVKDFVDFLKTKSQSKSKITEREFGCAKGLF TIHDDFDEPLDDFKEYM >Cag_1177 conserved hypothetical protein MKRKIYVETSVISYLTARPSKTILGAAHQQLTLTWWEKRSDYDLYVSQAV WQECAAGHPEAAERRLSVLAELDILVVTEPMITLANTLVEQGIIPTKAIE DALHIAIATLHHVDFLLTWNCRHIANPIIQEKISLYLEQQGLYLPIICTP EELIGEKNDD >Cag_0274 hypothetical protein MQRTIEDITSELIGLPKNERLEIVRFLLFLDNRSSDNNDTDSVWEHEIAD RVLAVEDGTAIGIDYEEAMKKINAQFAS >Cag_1971 conserved hypothetical protein MNKKTWQGIFLLVLFVIAPNFLQNSIVRAEKKKIILKQAEIVEGGENAKG SFRRLSGSVELSDGSITLRCNRATEYEASRSIVLEGKVMIADQRAEVYAD GGTYYPDKEIGDLNGKVRLRTLDGALVAIANTAHLNHAANQITLYGNVVA WHEAQQVSGNEMVITLRASSGKQEHQVEKVDIRGNAFLAAKDTLSKPIAV YNQFSARRMTMHFNEASLLQNALLQGQSESLWHLYSEENRPSAIHYSSGN TMQLAFREGALYTMKVSGHCEGKHYPASFWENKKINLPFFVWREKEYPFP KKK >Cag_0984 hypothetical protein MLCIPASYVFQMPTVLIVSASPLDQDRLRLNAEFRDIRHALQRSRNREDW VIESNEAVTVDDLRRALLDFRPTIVHFSGHGGGLDGLCFESTEGRTNSAD AESLAKLFHHFKDDLKCVVLNACYSKVQGDVIRQEIDYVVGMSSAVEDKS AANFAVAFYDAVFAGTDFRTAFDLGCTALDLNNLPDADVPIFMTGSHLKP TDLHDSAYIAEIEKVLYSYINTPFSERWRYTTTGELLRAVMEKHYAGNMH RLVPKVSVISMKQIADEHWVVAVDVVSSLMYMRIKNRSVSVEWEASVGLW SVPVKTYLALGSREPLLARVNAELDTYYNFEFANEQHRFQSVSLCAVSGP MLHGYVERGSKVYEELIDILSDGNEHAITLEIEQATDHTDMPLIKRVLSR TWICSQPQDTKNA >Cag_1000 conserved hypothetical protein MGFNPRAREGRDFEPICLARKPLSVSIHAPAKGATLTGSHCVLSGDGFNP RAREGRDLAFPCSISITCKFQSTRPRRARRSCSKRIWSALSFNPRAREGR DPIVPRPFRGFSRFNPRAREGRDRLEGL >Cag_0771 hypothetical protein MKKRTLFFAALCTVGLTMPLSNAHAEWTLHINSRNEHPPTLVNNATIQPD SRALTMSSQPPIAVAPPAPVFITPQPVAIAPPPPVVVAAPRPNYQVVVYE NSYYNRRPDGWYRSYHPQGTWVRVQQRHLPPRFAVAPRAPEPRFAPPHRR FDDRRGVELRVRY >Cag_0496 conserved hypothetical protein MGKRQIIYQADRIRGNQELLNREINLVTREARVWHGRITAISSNDVELKN SRMGKHRFNIDQIESIYCDITTDY >Cag_1323 conserved hypothetical protein MLAHAQQIYDEALTLSPIEKVELIEHLYFSLDSKNSRQELDKLWAEEAED RLTAYENGEIKTTPASEVFAEINSMRPQ >Cag_1259 hypothetical protein MSQKHEAWQIDVALANQAAFRHFKKRHEREYISCFNNLNKIKRLLEEGKK LSELHYHPSFFRHETDGIFRIGQSGVSGAKESRLYIYPDNQHRIIYILEI GTKETQQADIAAAQKAIQQIFLR >Cag_1001 conserved hypothetical protein MRPFSSSFNPRAREGRDGTHVRNLQGDSCFNPRAREGRDVPLLFIKRIHR VSIHAPAKGATVSARDFGYTVLVSIHAPAKGATNVGTACRKHRS >Cag_1269 hypothetical protein MGATHVTVTIRNPANPEKFWEGLFLVDSGAIDSLVPRDALESIGLKPKAQ RSYELADGTEIKMDITTGDIEFMGEIVGGTIIFGASDTEPILGVTALESV GIDIDPRNQQLKRMPSTRLKKLKPIASC >Cag_1610 hypothetical protein MKTEMHPFERILSVSTEDIELELRGITEINWADFWLNPRKLRGSDFLMRW SQGVWSEKRLIDAVNNTGEFYAIAYGPSGTAPTDDVRAFELYFERLEAAG LGNIKRPDLLVFKISDREFVDDFLSKNGGEEELPFITEDKLQALVQKAII AVECENSLWVAEKMPDYRTPMKAQRRLGGKMGLAKNAVLPTVIIKEEDRI PLNKWQEENRIPIHVWHVFFDRAYGLSFDEAQRLVTEGLILPTEQVFQAP GGATTKKAIYKYYYHYAYPLGVASERPQLIPAFIEDKNGHILPYVKFEGG SLTIADEAINVLNQL >Cag_1857 conserved hypothetical protein MKKQILLFSAIFSGYTFLLLLLYFPLVFQSQVLTAPDSLIPQASSMALDK LQAESGSYPLWQPWIFSGMPTVEAFSYLSGLYYPNLLFNLFHTDGVLLQL LHLAFAGAGTFLLLRDLRLSLLASIAGGLIFLCNPFFSAMLVHGHGSQLM TTAYMPWMLWAAMRFMDRGGVAEAGIFALIAGLQLQRAHVQMAYYSWLMM LLLVVVLFATRRWVVPQAVQRGGLFVIASVTAIAMAAAIYLPASHYAEAS VRGAAVGGGGAAWEYATLWSLHPLEAITFLFPGFFGFGGVTYWGFMPFTD FPHYAGLVVLLLALMGLIMRRREPMTWLFAGVGFLALLLAFGRFFSPIFD LFYSFAPLFSRFRVPSMALIMLYFALAALAAIGLHELLERKPQRLLKVLR LSSIVVALLLLIFLALEEVAEHAARSLFPLPQVDSFELVSAINSIRWEQL SSSVIVTLTLLLLVAGVLWLLLSGKISSKYSASLLVLLAVGDLLWVTVQV IYPSAHSLRTPLFADKQQVAPAFQHDDVTRFLASQPKPFRIYPAGNFFTE NKFALFGIESVGGYHPAKLKSYDDLLQVSDNLASIALLRMLNVHYIVSPA PIEHPTLTLATSGTLQRANGSAQAFVYRLQEPAPRAWFVSRVVPFSNKQE LYSHLLDDTASLSVAYVEAQQWQGAQRFSEGTIQSVTTQPESIKLNVNAP NSSFLVLSEIYYPNGWQVMLDGKATSMLRVNGVLRGVNVPAGNHAIHFSY NRHLFEQSQWIALAGFIIALLMIAGGLLWKHLLLSGEKRVVRGFHTIR >Cag_0786 hypothetical protein MGSLTVKDYVMLKMAFSSKQHAFLAGFGSLFDFTGHKLNAKHFGNQLTDR SALQADWYAISNDIHKASNAVVTEMANTKATRNASK >Cag_0396 conserved hypothetical protein MKPDSKVVLINAVVIISFGLLSAWHNSRDTTPSVTPVTQEVSVEPTETSP SLTPNVPVTTTPEPAAPAEVVPAKPSVSVTPTKPKSSHVLKPRVLIAKKP RPAASAEVVLAEPSVSVTPAKPTASPVPEPQVSVPAKPEPAAPAVAVPAE PSVSVTPPTPTTANVSEPQVPAPTTSQPAQ >Cag_0323 conserved hypothetical protein MIDSSLTQKLMASLDCDEKEALRLLKNCGAAMVHYILASKKIAIKGLGVL TVRHIPLKKERQASGVTFVPPSNNLVYERREVGEGDIARLAISALSLSEH QAIRFSEVLASYFTAAFTAKQEVALPALGAFYADADGLYGFHVAPSFTAL LNREYCDLADIVVPVGNRWGLWQERFRALRPAFITVGAVGVLFTASLLLY RWFSEHPLQIVVPSTLSSAKAVKQSLHAVAATASSMLESSTERPVTVLPT TPSFADSLQLERGAYAVVLATFQTERTAYEQVAVMRQAGIEAFVWPVFME GSRYSRIMTGMFTTREAAEAHLKMLPEAFIKGAYVQKAKRNVVLYAKKRV >Cag_0270 conserved hypothetical protein MPTISMFYGIIIRMYFVPTEHPPPHFHVYYAEHTATVDIRICEVIQGHLP KKQTKLVLAWAELHQEDLMADWELVMNGEEPFKIQPLQ >Cag_0686 conserved hypothetical protein MINNLFLTNQRDNYDSPWKEAIEHYFLEFMAFFFPAAYASIDWSKPYHFL NEELRAIIPDAEVSNRVVDKLVQVQLLDGMESWLYIHIEVQSFWEVNFPE RIFVYFYRIYDKYGKAVANFVVLADQHSNWRPTSYTMETIGSKLSLDFSV VKLLDFEPRLQELLVSDNVFGVITAAHLLTQKTKNKVKQRYEAKKLLMQL LLQRQWEQERINELIRVIDWLLKLPKELRQKLKAEIHNMEEEQKMKYVTS FERDAMEEGREMGLVEGMEKGKAEGLEEGLLKGRLEVAERLVASGMSKAE AALLAGVSVEML >Cag_1909 conserved hypothetical protein MVAVQHKQNAQKQSKPFAFWLSVATLSHFALLLALLLYQQFSNRQQEPPP VVNVMLVSLPGRVGSAAVPAPTLEAPQQVPAEQAKEASVVKGSPSAVPTK VPVATPPASTTKKVPEAQPVVDRQQQMNQALERLKQKVGKSASPSVTASP SAAPSPLAPSSSNSLTNALAKLQAKVKASGQATTSAPSTTSPTTSPTAKS GGNIAAASRTTGSGSGSGSPASYKAEVASIIQNNWAFSNPMLRGEGMEAY VRIHVLPNGTISQIVFDRRAASEYLNNSIKRALEKSSPLPVIPQEAGGRD MWIGFLFSPEGIER >Cag_1690 hypothetical protein MGRGLVAKFVITILFSQRDVVLRFSLGSSIISCCKSVYKYSALNGNQQFF ESSLSVPDSGKAIFSRILNDNKINMKTIGAC >Cag_0993 hypothetical protein MGISSRDVSKLNKTIQHTYNGGSTMTREEIIKRLDEIFQQKDIRHGVEVR KLHVELFGVEPVFTGYYYECEKGALEWMIESMLDGKPFVEPELEDGEFT >Cag_1368 hypothetical protein MVFQRSMKVFTTFALFAGMMMASSNLHAVTVDDSIHEKACSVVAGERTVT LSIDPKPVKHMKELTFTVSVTPCDKLPDMLLLDLSMPGMQMGKNQVTLKK ISSCKWQGNGIIVRCMSGRKLWQATVLSNELNNPAFAFNVRD >Cag_0718 hypothetical protein MIKPRSRNVAPVTPSLDDFIRQPEQPAARELEPNASRKFKTVSLPMNEYE YSQLHATCKKTGRSEKNLLRYAMMLYAKEVLAE >Cag_1573 hypothetical protein MNRMTSSQQNPEPNATCPICKASYHCARSSSCWCSTRKVPQQLSDYLADK YKSCICPDCLDSMIAEANAGKQFC >Cag_1335 conserved hypothetical protein MIKPFLQRTLPLLATLPFFATPTAQAATPLHAAKSAHFAPLLADPLEPRV AVEPFLGEKSLQLDIGTTEELYRNDKGTFAAGVDFATWSLLRRSNNFKFP VDAIDYLFGVNASWKMPLQNSSLPFDDFNVRARLSHISAHFEDGHYQNGQ WLQQAEWQGTIPFVYSREFVNVVLALSAPEHRIYTGYQYLYNALPSGINP HSWQAGVEIATTNTTYVAADMKLLPIWQTKQAETEGFRASWNFQAGMRLK GKQADKVRLVANYYTGMSRHGMYFYHPENYSTIGAIIDF >Cag_0690 conserved hypothetical protein MKLILKEYLSSLRERGELDAIFPDLLSQLGLNVYSRPGRGTRQDGVDVGA VGRIDGGLEKVYLFSIKPGDLTRKDWDGDSVQSLRPSLNEILDAYIPNRL PAEHRGKDIVICIGIGGDVQEQVRPQLTGFITKNTTTKITFEEWNGDKIA SFIQSCFLREDLLPKGARSCLRKSLALLDESESSYRYFAELISSLSAGAD ELKNSERITAIRQMGICLWILFAWSREAENMESAYLSSELTLLHGWDIIK RYAEKTGKTAQAVETAFFSIFSAYQQISSEFLLKNVLPHVGKLHGLSSAV HSSCAFDINLKLFDLLGRLATHGIWAYWITSRFSDEQAEVKKKSLEETLK LMKSIKELISNNPVLLLPAKDDQAIDIFIATSLLAFNKENYNYINEWFAE ILGRASFAYQTHGNYPCILNSYTELLSHPKSGDDEYRKTVTSGSVLYPVI ALWTALLGNEEMYNNVAQFKQAHLSHCNFQFWYPDEYSEAHFWKNSDSHG AVLSHVPVDRPKEEFLGQVFGECDQSPHYKDLSAIKFGWWPLVIVACRHY RLPLPLQLLEGLWKT >Cag_1038 hypothetical protein MKKTRWLIAGLMGVILGSLPLSAQEGFCMSPKKSDTSIVINSQQSFIELP DQGFSVSVGSPYDIINYDNRYYIYQDGSWYRSSNYRGTWTVIRDSDLPDR IRRHRPEDIRRLRDNESRRYENDNRQYRRDENNRK >Cag_0268 conserved hypothetical protein MNIYTERLPHRYRQAMFRDEMKIEELVEVMLRVAWCYPKVLSATCTKLGV TGSIGFFKNLAGWAFGK >Cag_1318 hypothetical protein MLQAKVSLSPPLYEFLKNYKDFGFKDKSSMVQNALERLKNELEVLKMQQS AQWYAELYDQDLETQELTESAITGWPE >Cag_1227 conserved hypothetical protein MIRLHVTAEGQKYMEHDQPIKNLLQMVGEQNPELINDGWETAPSKRIINE IPEYDKVSSGVLVTEKIGLSILRKKCRHFHEWLIRLEQLGETM >Cag_0880 conserved hypothetical protein MLHCMKIYLDVCCLNRPFDDQTQDKIHLESEAVLTIIRHLEKKDWEWISS SVVLYEVQKIPNRDRKQRILRLCDKSSEVILLNKEIYRFAEILNKKGIAS YDALHLACAHFANVDFFLSTDERLIKKAQKNIDIFNMVIDNPLYWLQTIW >Cag_0189 conserved hypothetical protein MKQKALLFFYGAALALLLVLFVVVSQQFLSLFAFLSALHPYVGMGFLALS GIILLFTLVTALLFFARPSEPSLPDNDVSPAMAAYVRYRVARVPTHPKHP EGSNAPKDQRWLRTNLKLLDGDAMEITREIATKNFFVGAFAQNTSYGTTT SLLNNIRMLWRIYTLHYRQHHFREFVALARDVYETLPLSDFRKEELPEHI KPIIQCSFSNTLASLLPGGNLLTPFFMNLFLSGSTNSYITCLTGIAATRY VQASTQEERHEVMQQSMFEASFMLKEVVRECNPILSVTISKAVKKAGMDS LDTMQQPSASSGVAQDIVAHLANSLRTILRDDG >Cag_0790 hypothetical protein MREIIMNKKIALALLGLALPLSAQAVEFRTPGTALGIGGAGVARNNGGLT SYWNPAAGAFKDSPFAVGAGVGAGLKINNGLAENVDNLSKLDFDDITKFN NSVDDVGNFTKAVTIMDDISKSGGNIGITGQVPIGVSINQFSFGIYGNMS GYIMPIADITNIVPTANAGGANITVNDLNTSLGANTYTPSGYFTTAQLAA LSAAITANQTGALPAGAADNLANAIDNQLKESGIPADQALATLTTTALPV LNAASANTFNQNTTSVLTKAIQYVEIPVSYGHPIKLGKKSTLGVGITGKV ISGTVYQSQVLLVNNNNVDASDIIEDIDTNKKTSSAFGIDLGLLYKYDKW LNVGLVAKNINSPEFDAPDYNAPKYDTISGQVLINDLKKGDAVKLKPQVR AGVSADVLPIVNVSADLDITENETVAPSVVGLTAPKSQNLGGGVEVHPAS WLKIRGGAYKNLSASKGGTVLTAGFKIFMLDVDGAFATDTMEFDGNEIPQ EAQVNASLNFSF >Cag_0134 hypothetical protein MVERIQNWIKTMLGLTAMNTPAHPNQDLPTWLRWLSSLLGVGLSIGSLVM LYCPPEKASRELDGKGTVIKVLLESTDVTTPFLSIFLAGVALVVFGINGI RFAKITAAGVSAEAPDATAAATNYYKAPSEDRPQTEVQVAEKESPDPTDV PAGYLEAEDGGKYAVYKLNEVPSSVITDALASWPTEDSKPEDLSGFEFAT RKTGKGNHPWTLKFKGKKAVIVSYGGFAKPGATVSHPE >Cag_1727 conserved hypothetical protein MNRFYAGDKIIYRKPKSSFSPGPRARDIYPLAHGEAYHYIVDKYWKVEKV YADGTLEVVTRTGKTNRLQANDPNIHKAHLLQRLFYKKRFPSSNVAAQS >Cag_1200 conserved hypothetical protein MCNSFSFVAYYKYSDISSMTTNNRIKLKSTLACHTQGTVDLASWLEQHGI SYGLQKHYRKSGWMESVGTGALKRPGEEVTWQGALYTLQTQAKLPLHAGA LTALALQGFAHYVPLGKQTVYLFSPIKTLLPAWFRNYDWPQLILHEKTSF LPNETGITDLKLPLFSLCISSPERAILECLYLSPDTLNLVECYQIMEGLT TLRPQMVQDLLEQCRSIKVKRLFLYMAEKAGHEWYKRLDHTKLDLGKGAR SVIKGGVYVEKYSLNLPEELVKL >Cag_1724 conserved hypothetical protein MQTIYADGVANIALIDGIIRFDLVNITKMEKENVNLRPVAPVAMSVTGLL RMHDQLSQAINKMVEDGILKKNEQPPVVIDGGQ >Cag_0446 conserved hypothetical protein MKKQATITTEELDDKFDAGEDISQYLDWSQSQRPLLDHKRINVDLPQWML NSLDFEAKRVGVKRQAIVKMWLSERIKAEQVAAGNAVR >Cag_0610 conserved hypothetical protein MKKTAKLLSLAVALFAGVSGTAQAEGFKLGADVVSSYVWRGTQVTTSPAI QPALSYTFKNSDIVVGAWGSYAISEHTGAVANQETDVYVTVPVGPVSVTL TDYYNQTATSRTFDFSDDSNNIVELSVAYAKDNVSLMGAMNVAGTDTDNA MYLEAGYKFYEKDGYTAKACLGAGNEAYTSDQDFTLVNTGISVSKDRYTA SCIYNPDTEASSFVFMASF >Cag_0714 hypothetical protein MATETQRAFDSELDSEAAELHLAQLIHEAGEEVRRCWAKRQELHMEKLHA TVAESQATLNKLLQNDRC >Cag_1995 hypothetical protein MQVTIQLLSSAINDLLDARRFYEQQRNGLGAYFFDSIFADIDKLTLYAGC HPKYFGYYRMLAKKFPYAIYYKMNDTSVAVVWRILDMRRSPYKIKQLLP >Cag_0884 Fibrobacter succinogenes major paralogous domain MRYSKSALLASCYVLLLVAIAGCGKKLSPPVNDRDGNSYPVVELASKTWM AKNLEVEHYRNGDLIPQVQNAEEWAQLTTGAWCYAGNNPEEGKKYGKLYN WYAVADPRGIAPEGWHVATDAEWQALCEAFGGLDAAGAALKATGEWKNST PENATNSSGFNALPGGARRDTDGYFMPTGEYSRLWTSTEIAEGSAWAVSL GYYDAAVRRGKASKKTGFSLRCIKD >Cag_0152 hypothetical protein MKQEQQHFLQEKLRECDRHVEKITIAQEHMRSVLPLTPQVYAQLDDVALS FLDQIVFRYSKLQDTLGDKVFPLLLLATGEEVKRKTFLDILNRLEELELV DRMTWLQLREARNEVTHDYSSEVGETVDAINAIIVASDTLQKLYSTIRHF CNHRLQVL >Cag_0704 hypothetical protein MFITNLQDTVCRQTQLEQYYAKDVLRDKHFVCRCFDKCRASHAGTYYEGQ VHYVGSNYDILVGPQPLRVVVVGQEYGHGPALVDSLMRAKMFQDSAHKSR GFLDRNPHMRGTTTALRILFGIEPGEDKAGEWLETSTGRIHLFDAFSLVN FLMCSATDGSSKGKATSTMLSNCSKHFVKVLEILQPTVLVCQGKGFFTYL AESLGVSKQQKEMLFHYRFNGVDGVGVCLNHPSTPRWDSGWAQLTQPYLT SRVLPLLNDVRCELGLDQVIWNL >Cag_0992 hypothetical protein MEISLLQVVIDEHVAVFGVEPVFTGWSAFLSEDEIATNVCAAIDKGEPYV EEEVPDGVDI >Cag_1799 hypothetical protein MKFGFEIVPVERGHGGALYSLRFEAEEKTELDKFLDNEEIQACKEYESLV ARLYDMVDSLGFRDYFFKLKEGSINDSVAAFHYNHGTLRLYCLRWSSILL IVGSGGPKTTRTYQDDPLLSDAVGKLQMVDRLFDERQKSREIIIDPNTGI ITGNLVFTSD >Cag_1008 CRISPR-associated protein, CT1133 MSWMQRLCETYDNAHNKVGDYNDDAILLPLYHTTMTCNIEVTLNENGEFV QAKPSEKKKIIIIPCTESSAGRSGDSPKAHPLSDKLQYIAGDFSHYGGEV TSGFKNDPEEPFRQLYQQLTEWSEASPDKYKLRAVKRYLEKKRLIEDLIE AGVLHLDEDRKLLKKWDSKGKKKTDKPPIFESITNVNDATVGWHVEKKGE PTEPLWKDKDIHKAWQAYYESRKINPKLCFISGRSDVAPAEQHPKKILQG ASNAKLISSNDKKGFTFRGRFTTAEEACTISAVASQKMHNALSWLVERQG YNKGTLNIVAWAVSGGNIPDPMKETAPYDYDDLGDDYNAAQAFGLAFKKR IAGYRARISSTDSILVLAFDAATSGRASLTYYRELTGSDFLDRLERWYAR HEWLYSKPEKKRGFFIQVRSPEQIAKDIYEHNKTEDKEKKTDDIIRSVVQ RLLPCIIDGQKVPFDLVVAARNRASKPMSFKKYKEDWKNREDWENTLSTA CALFRGYYYTNFQEEYSMSLDPNRTTRDYLYGRLLAVAESLEKSALGLAD EGRSSTAERYMQQFAERPFNTWKTIELSLSPYIARLQSNAPGLKKFYTDK LDEIHCLFNPDDFENNEPLTGEFLLGYHCQRLKNYEGSSKKD >Cag_0945 conserved hypothetical protein MKFPHCDYAPSEDGFYSQCASCPIDASLAGCALEECGDGVYQSEGETPHG GSKALFYFTDNDGNRVAKRQAHHVEVHEMNAENQIIAVLYGMVDPEGIIY LKKSSASSSNN >Cag_0427 hypothetical protein MAFNQNVFVNCPFDKTFYPLLRPLLFTIIYLGLKPRIATERLDSGEARIT KIVELIEDSKYAIHDLSRIKATKKGEFYRLNMPFELGIDVGCRLFKGGEH EHKKCLILVAEPYNYQAAISDLSNSDVANHHKETPEDVVIEVRNWLSATC GLEADGPSRIWDAFNVFMGDNYSALIARGFSKQDIEKLPVQELIQSMERW VTENV >Cag_0252 hypothetical protein MIQCLENGTLALEHLRQSGELPPTWQSMAELEAQVNQLHIRFLEERWRIA EKRGWGDLAIELLDRAWRTNDREYMDEEHVSEAEKVTVMQALDRQNRLMD IYNRSANMLLALCREVPNQPQRPIRVLELACGSGGLALALAEMAQRHHLS LEITASDAVLAYCEEGNAQAKAQQLPVTFRQLDAFHLTDYANEQYDITVM SQSLHHFTAGQLAVIIAQAMSQTTTAFVGTDAQRSVLLAGGVPLVASLQA IPAFALDGFISARKFYSEPELALIAESATRRCNYTISRDWPLSVLTVRGG E >Cag_0322 hypothetical protein MGAAWAFPSIPSLIARNITAIFVVPTATPLSSVEHVTAGATITPFKPLAV YGGTMPYIYEVTSGVLPAGMQLDLQTGMVSGTPESIGRHQTVTITVRDAN NAVAATRGNLTFVVSAPPVAKAQPSVQPLAKGVAVKPFKPLEAIGGTGAH VYSVVEGQLPEGLMLDAQTGVISGVPSTTYRQSAIVVGVRDVNNVSANQT SRVTFTTQSSSLAKRVPVKRELQVIARVVKKAPVKSVQIAKVVRSTEKVV AANETPKKTSQNLEKSVVVSLVPNNLLLPQWANNVFVMEVTAADLERAKH MSASVKSVAQEVRQVAEETVQSPVRSTVVKPIRASNDVMTAADEVIASPV ENEPFKPQSPVVEIHYPSADVDGTPAASDAPSQVYLSSLSSLSLADCSSA SSSIAYSLN >Cag_1703 hypothetical protein MTFAEFKVQLENAATEEAVKAAYATYFKIKYDTSNYHDLYTEQVFFEFKK EKNFHNIKALATILAQSLYYIRRLKFVEVEKIIPFFICLADKDEACLTEV RKWSSYYSNDSYDWERPASKPDPKLIDHLVKEPEINNIHIYNVTKKQEHD AFKKNLENALKPQMVLDFGDKKVINEENFEAVFEHWKNVIGHYIVNGYKP SFYFLSNIQKERIEIDRENNRIVFHFEDKNSKVQKVLMKDYDYFWSMYDY VASPDTINGIHAKLDRLTDDSQRRFEGEFYTPLIFAKKAIHYFTKLLGKN WYKSGKYRIWDMAAGTGNLEYHLPAEAYKYLYMSTLHASEADHLKKVFPE ATCFQYDYLNDDVEYVFNCKNFLFEDNWKLPQKLRDELADPNITWIVYIN PPFATAQDAKQKESKTGVSKTKIEKLMDIEKIGHAKRELFAQFMFRIAHE LPKKSYLGMFSTLKYINAPDSVEYRNHYFNFKYEQGFLFHSKCFHGVTGN FPIAFLIWNLAEQCHSEIIKIDISDDNAHTIGTKYLRFIDKSIVLNNWFT RPKNSKTYILPPLSNGITVKNENTDRRHRARPDFLASICSNGNDLQHAKY VVILSSPNASAGAFTVIKENFEQALVLHAVKKIPKPTWLNDRNQFIIPHT QPSQEFINDCIVWSLFSHSNETTALRNVHYLGRTYQIKNNFFPFMLEEIK EWEIKEHDFYVQMLDDTNRFVAEWLVTHQYSNEARAVLEKGKNVYKMYFS HLHQMITKHWKIDTWDAGWYQMRRCLAEHNIAVDELRELYVANEKLANKI LPQIEEYGFLDKDEIYEQL >Cag_1199 conserved hypothetical protein MIDPMYRQQVDLLLQILPLVAKEKVFALKGGTAINLFVRDMPRLSVDIDL TYLPLDDRDTAMKGISEALNHIRQKINHAMPGIKAYLVQQSSGQEAKLTC QSSSAQLKIEVNTIIRGHVFPPRIMDIAKSVEAEFQKFVTMPVVSHAELF GGKICAALDRQHPRDIFDIHQLFAHEVFTDEIRLGFIAMLISHSRPIHEL IRPNLLDQRTVFQHQFTGMTFTAFSYDDYESTRKRLVKEIHEHLSDTDKR FLLSFKSGTPDWELLPMDNLRLMPAVQWKLANIVKLKAQNSAKHKAQLKA LDNALKDC >Cag_1633 conserved hypothetical protein MTVESRKRQFIVEGKIKPSFCEGCGQLTAKIFVGEWMPSDKPKEEDVLAP LTSKKIKEQKAQAQSPSAAENQYWVRCAECNQIYLLKEWQIQIDREVDIN QLTPEECQVYSPHGIYAKGAAVYHQALGEVGIVREKQATGSGAFVIIVEF AKSGRKQLLENVQLSSGNGQSSTELLKLKLRRQA >Cag_1554 conserved hypothetical protein MAENNLQAWEKVLEYASVPLHGTMSRKIRKGVKLQINGGDVYEDAVLFIS DLFLRVTQESDGASINTYYDMKAIASIRTYSTKE >Cag_0520 conserved hypothetical protein MAWFLRCLILISMKIFRWNTEKNELLAKDRGITFEEIVEIIESGAKIIEV DHPNKKKYPNQRILIVDVRGYAYMVPFVKDGNEYFLKTIIPSRKATKKHL GG >Cag_1362 hypothetical protein MYMTSTFMKGIHICRKIYFIIMEINNGHRIAIITGIFSIITAIISGIFLQ TKENISDKGTTINGNQNAIINGNNNIISNTRENSSKNVIVVATTKNVIEK NIIGKIKPYITKNYLLTCLGVSQEQEIKADCTYGGKDIVLYFYSFDNLYI TALISNDIVIGLNFELKNRTTEFPINTIQGKTWILGKISFGDVLYDDDKI LYTQGGNKFTSTLTCGTSYPQYIGAGPYYYIWNSNDFKNISNKFENEIHR YDKKHGTEFFDEKKITIDVVKKCRITSLTILGYEYKELSAYLNSPMSQFH ADIEFSTEVPR >Cag_0669 sugar transferase MMLAPVALFVYARPDHTRKTVEALQKNELAKETDLIIFSDAARIPDKESV VNEVRAYLATISGFRSVTIHHRPYNFGLAKSIIEGVTQVLSEHERIIVLE DDMVTSPYFFSYMNDALKLFANDDRVISIHGYVYPVKQQLPEAFFLRGAD CWGWATWRRGWVLFNRDGQVLLDELKQCKLTREFDFNGSYPYTKMLEAQI KGQNDSWAIRWYASAFLANKLTLYPGRSLVHNIGNDSSGTHCGNDTTHDV DLSSMPINITNIDVLPSIEVRQVFESFFAKSKGSFLNKLHISFKKAFV >Cag_0278 hypothetical protein MEMEKSEKNQYLEQLDAYVHALMQELRIMEQRKAILEPLLFDEDLKSSLN MKFKDTDGAVAYNHFVPLLAQDLIRDISRLFLDEGKKAGSFTNLCRKISN KKQLGWLRERYCESQVINPNELASEFHNIWDNVKKGKEKIMHDPNSEKLK TFRDKYYAHLEMTPMGNEPGPFNIKALGLTYCDIFNFLDTHQNVIYNVAL MITGTNYDNEEFLGIHRKSANEMWRLLAGE >Cag_1046 conserved hypothetical protein MKPLPVGIQTFSKIIEDDYLYIDKTDIAKSIIEKYQYVFLSRPRRFGKSL FLDTLKNIFLGNKELFQNLHIYNQWNWNITYPVIKISFSGGIRNNESLRK NLFYILKDNQKRLNITCEENDEPNLCFAELIQQAFEKYQQKVVILIDEYD KPILDNIENIPEALVIRDGMRDFYTKIKENDEYLRFVFLTGVSKFSKVSL FSGLNNLEDISLNPNFGNICGYTQHDVDTVFAPYLEGVAMEKVKRWYNGY NFLGDNVYNPFDILLFIKNQKTFKNYWFETGTPTFLMKLFAKERYFLPNL EHLEVGDEILDSFDIEKIQLATLLFQTGYLTIEKRFETFERLRYQLKIPN QEVRLALSDHFINVYTEQPNELKYAQQNRFYTYLTQVDMLGFQQTLQALF AGIPWNNFINNSLPEFEGYYASVLYAFFISLNATVIPEDTTNQGQVDLTI MVENKVYIIEIKRDTVKSYEISQQNIALQQIQRKGYATKYKGQGKTIIQI GMIFNIYQRNLVQMDWEVVG >Cag_0118 hypothetical protein MSFLKEAGGLAGGALLLTPLTPLGVPLLLHGVAGIVVGGAGLFVADAVLK QVAETTKPQSGDEPEDELVD >Cag_1624 conserved hypothetical protein MSGSVVSCAFMPHTTALVRVKPSSGSGYVVSTAKRFPFGLVRIAADRDGS LLAKIGKELQRWHDDLLALNFTPPLYRSLPAFLPSDATPEEQATYQRLEA SNFLHQPNSYWCSALESAEPCTASDFQAYFLLYYPAEPLRMVRNALSAHC ALAVCSTPVEAFCRLTVSTQDVHILLEIEEAHVALAVAHQGKLMRFVCHP IHRREEREYFALRELLNTPACRDHVVQVSGSHATKNMLELLRRETGLSLR LPTLPAPHFVTNSTRQLLTEPDMYHALSAAMFSL >Cag_1819 hypothetical protein MKVIISRKQVNTNKSNHWFRDIAMLEICDAKYVGDYKIYLVFNNGREGIA NLEKALFNDIRSVFSQFRDKERFANFKVDHGTVIWSDEFDLASEYLFYLA FQDNPELQTKFKEWGYVA >Cag_1373 conserved hypothetical protein METVFIETTIPSYYVARRPRDIIQAARQELTIEWWDKHSSRYELLSSQIV IDELARGEEIMAAKRIELLANIPLLLINEPVIKIAEELLRDRVVPQKAAD DAFHIACAGVHQVDFLLTWNCTHIANPHNRHRIERCFAKHGIIIPIICTP QEFIGDDYAN >Cag_0276 conserved hypothetical protein MFISKRFMKRKVYIETSVISYVTARPSKTILGAAHQQLTLAWWETRSQYD LVVSELVLRECGAGNPDAAKKRLTVLHDVPLILITEQALKIANSLIEKGI VLAKAAEDALHIAIATVHGVDYLLT >Cag_1994 conserved hypothetical protein MPNTLTLNHLSREEKLQMMDLLWDDLSFNQEALDSPNWHREALQETEARV NAGAEQLMEWSAVKKILRNECK >Cag_1904 hypothetical protein MLNVTEIHPEYVTDMNGVKKSVILSLSDFYALLENLDDLAAIAERKDEPT MSHQQVVEELVLDSSLRSE >Cag_0767 possible abortive infection phage resistance protein MNINASIIDQRITGIVDEHPEWMAESNDRNKKKSVAFVLLSIAMCLDIPL DEAAELITDGGNDAGVDGLHIGEVEDGEFMVTIFQGKYKVELSGEANFPE NGVQKAVDTVQVLFDPYRNVALNKKIAPKIEEIRSLIRDAYIPNVRVILC NNGAKWTRQAENWIDNAKKDYGDKVDFIHFNHDSIVSILQRSKKVDTTVT LSGNAIIEDMNYMRVLVGRVSVQEIHRLFNEHGDKLLERNIRRYLGLHTN RVNTAIHQTLCDPQKSDKFYFYNNGITVVCDKFDYNAFQKADYKVQLKNM QVINGGQTCKTIQETLNSDVSNMIGESAYVMIRIYQLAETHQNFVQEITY ATNSQNPVDLRDLRSNDDIQKQLEIGISDFGYVYKRQREEGGGGSHVVTS SIVAESVLAIWRQRPHQAKFRRKEHFGKLYENIFKDLSAAQALLAVLIFR AVENERKRPTSLTPPDFLPYASHYIAMVVGRTLLQDMNISLANVSHQNFN EILKKFEANEAAYYAHAVSDVKEALTACYGEREVSLQQLSATFRRGDLLE MLGAVGLSGDYCFVSQS >Cag_0048 conserved hypothetical protein MVLGLLQVYNALSINELAKKSEKLREQIRLNNSMITTQKLTADELQSIHN IEQEALLLGLEASHEPPIEIERTIEP >Cag_0662 hypothetical protein MIFFVSFGGIWASGGGCSMWLAERDELVTMFQRDVVEVKPLRNRRLVLTF RDGLVATLCLDDIVHHYNGVFLPLLDVAYFNQVAINRDLGTIVWPNGADV CPDVLYAVASGKPIVCE >Cag_0133 hypothetical protein MENLMSKNAIIYGIRKLNVVERLNIITDIWDEIKDSQELEIVSENDKKVL LDRLANYRANPEFATDWDELKQKIHDRYAD >Cag_1505 hypothetical protein MATIPHYTPNDFMHEIEQVPPQYLPQLFQIVHIYKESITKKACLDSFEQS WQQAIAGNTMPISELWEDIDAE >Cag_1069 conserved hypothetical protein MSVTLMIFIYWAIAIAIGFIFFKKDILSFEPKFDGRRIGLLIASLLIIAL NAWVYSHSTSDGGRSLDPLTLLVFSVGNGIAETFMFYAVFRFGTSLVGRF TQNAVATFLVGFLCFVVYSGLIHGLFWINILPEHVVQTSPYKPFFMPVQM LIAGSWALNFFWYRDIRTVIFLHGLVDLTMAWNVKFDMF >Cag_1147 hypothetical protein MPKEEKEQQSPIEQPKNPPINLPIEPLIAAPAVLGAPAMLGVPLVIHALA GAAIGAFTFVTGSLLLKMTKDKALAAKPDPNEVIPPPMSVPNFPRHYSRN DPHIDSPLLSSLRK >Cag_0958 conserved hypothetical protein MLTAYCGLDCEKCEAFLATQENDDAKRITIAQKWSAQYHADIKPEHINCN GCKSDGVKFFYCTNMCEIRQCCISKGVDNCAKCSDYICDILSNFIKVAPE AGVVLAKLRAS >Cag_0261 conserved hypothetical protein MTRTSQHSERQQATEASTHNIQQLVDEAATSLYKDIMAIVEARLELLKIE LTQKISLIGAAVVVGVIVIIGATFLLATIALFLGELMGHTFLGFFAVSLI FFVGFWFFTHYRPTLLQHFIQNLLLSTYDADK >Cag_1178 hypothetical protein MMTDEIITEVRVIKDEIAAQYNYSFQDRFIAIKKGEAELAAKGMRLIYPP NNAVMTPATALQRGKLKTHKIFQKLSASQ >Cag_2001 hypothetical protein MNSFKSISRTLFALSVFTATPLFAETATSQPSTQQEPLNISGTLEVEGYG ERVAGKTTSEFVLATFETRLERQLSKHVFAHATLLFEEDGDNELLVDEAF ITYKALNSPWYVKAGRFTQPFGWFESGMISDPLTLELGETKHHAALLLGT ESEQLSAALGMFRGDVQYDNNPSIDSFVAAVATNFTLGKEMRGTLGASFT NNMGDTDGLQDLFDDGAGNLVGTEKVSGYSFYGSLAQGALTLRAEYVAAA NSFVDGTLAGLKPSASNVEVSYAVREGVDVTARYGSGSDMDIANQYGAAL ACTVDGAATLGVEYLHNSMDSAADANRLTVQLAVEF >Cag_0654 hypothetical protein MWLAERDELVTMFQRDVVEVKPLRNRRLVLTFCDGLVATLCLDDIVYHYK GVFLPLLDAAYFNQVAINRDLGTIVWPNGADVCPDMLYAVASGKPIVCE >Cag_0820 conserved hypothetical protein MATRKLITFDWAMKRLLRSKANFDILEGFLSELLGEDITILDILESESNQ ENKEYKYNRLDLKVKNSKGELIIIEVQYEREYDYFQRMLYGASRVITEHQ KLSEPYSTIPTVISINILYFTLGEGSDYIYYGTTKFVGMHNRDVLHLSKE QREKYGKREVSDIYPKYYLLQINNFNNVAKTSLDEWIYFLKNAEIPENFN AKGIKKAKESFDFISMTPEEQEAFLSYQDALRDQASYFETTYEIPFEQGL KKGIRKGRKEGKEQGLREGKKLGLQEGVLKGKELGLQEGKELGLQEGVLK GKLEIARKLMAKGMSAEEAAGIAGVNIGLLERND >Cag_0063 hypothetical protein MKPLFDFLLKLCILSLVLWGAILYALDPSSVDYYSIFIAWVMMFSNTLVG YLLFEYAIDKDSVVFNKIVFGGLALRLLALMVLVAIFIVGKLVAVNDFVF SVFAFYCIYVVVEILGYQKKNKQKKN >Cag_0256 hypothetical protein MIDVKPFENCVCQKMNNHLSLCKSELTLSEKGKSVTLRIRSSEEAKVLVL DGCVFMDNSSRCDGLYLYKKGNKRYALLVELKGACDIPHGFEQLAYVKKN RQEYRQIVDHFWVEAGGVQPIEKAFLVSNGSLSKPDLETLEKQHNIRVTA ILHCEATSQIPDLRKYL >Cag_0120 conserved hypothetical protein MEVVAKSKAHVVLDACFQESGQFFTDHERLMRCNPFCSNVRYLPAHNIFQ WIFQVDDPRNNPFMAIFFVRQQEHPFSLDDEYGKNFVEKRIKNRERYNGN GKRIYWEPVQEPPADVVLPKPAENGRSFVGTASSDICLLHHHDNKTSVYF DTNITMDFDISFPLNLMPEGVLRFMTEAVMSQVMQQATEAMLCKVQADMG CASSGLLPTE >Cag_1500 conserved hypothetical protein MAKKITENEVDLKKESKKKTSTKSETSKSTATKTKKSKATAEEAALTPEM REEAIRLAAYYAWEQKGCPDNSHMDDWFEAENTLP >Cag_0830 conserved hypothetical protein MVDVVARVHKHRVKLRDEGMRPIQLWVLDTRREGFAEECHRQSALLANDS HEDEMMLFLSEVADTEGWTA >Cag_0966 conserved hypothetical protein MKLTRYIHRVVLLAVLGLQLAGCSGNEASRLRKDDIRFAAFYTDYLLLSG VEPATKDEQLVALSPAMVDQLLEKHHLTRQSMSSLVANYRRNPEQWQVVL EQVRGNLRNKAREANGE >Cag_0272 hypothetical protein MHDNPEKSLESVTKLALQFLAEAIGQCPAGLEQSTNQDVVFAVVGFQYGA VQSAAYVAGLGIEAWNSMAGEVIGRLNGIEKEKVAQFLSVMPMLARKKYP PISIGGQAIMRFYNATSEEEKLTAAASLREILRQIDEGN >Cag_1810 Bacteriochlorophyll 4-vinyl reductase MSEPSKIGPNSIIQTVTALEENYGKSKAETILRKIGQGYLIGNLPKEMVE EIKFHTLVGALNKEIGSTATANILKESGERTARYLMRVRIPAPFQKLVKL LPPRLAFRMLLFAISKNAWTFAGSGEFRYSMSTPPEISVKVTFPSQPVVG NFYLGTFTALLKEMVNPKTSIKADIQKAGSDIQCTYRCEI >Cag_1432 possible virulence-associated protein MRLPKEFRVSTKDLFIRQDEVSGDIILSQRPHSWNGLFELDKLEKSPIDF MNNNDRNLALHNRDPFNGYAE >Cag_0567 conserved hypothetical protein MDKFDLFFDQLGDIQQMVNEKKRLQAEVAQMEQECAAKMQPLKDELNAIY RQLEARIPKFMNQGGSTPRTSSRIPRGKLGESIKNLLRSNPEKAFKPREI AEALDIKGTAVSLWLNKSGQEDPELKRIPTGPEGKRFVYTVN >Cag_0994 hypothetical protein MDDLLIDLVINEHIAVFGVEPVFTGRSAFLSQEEIIANIDAAIRKGEPYV EDDVPDDVDI >Cag_1098 hypothetical protein MTRNVVHIYTREIDFNLELDIEFLGIRMLTGTAKELLWENNHTEKNDSPH LAIVIQNQLELFESVTDGMAYSRPRLLIAFGVLSYFTQQIFTPFETYASS SYVGKFDKECKNRFIFKETELIEDYIQFETIIKYHKDKEFIYSLLDRWRK GLYMETESEDNMIYDDETLISYFHILELLTTKYEDKQKKELKDKIKDFSK SIFEESFLFEGNQLKSEINAKSKIIEGLLLPDLSVSSKIFYIFKEQGILT YRLKSFITNFVKDRNSVAHGRQVYQDRVIFPVPPFFPLIKNRDYPEEFYR ILTAKAIANFIGVNLYSQEWDEMSQSIIPSFDELREFQREKKYLELTNQD FCDGKENDITPFVVSHYLISKKLKAEEALEILTPFINNYNKTEDETMMSI WAIIIILDLLEEGELKEKCIEIIKIAEKNNWHPNYFKMRDEMYKLEYFGF EINGLKDLIRKKEIR >Cag_1076 conserved hypothetical protein MSTITRPMTTMILKQRFPFRFGTSSYIIPADIIPNVEYLKDKVDDIELVL FESDEFSNLPSAEDIQTLKQLAEEWALTYCVHLPLDVYLGHTDRAERERS VGKCLRIVELTRTLPTSGYVVHFEAGNGVDINGFNDADQQQFTDSLRDSL AMLLAGANVPAAHFCVENLNYPYELVWAIVQEFGLSVTLDVGHLEYYGFP TADYLKRYLSKAKVLHVHGTVDGKDHNSLCYMKPATLAILMQALAASPNP QRVFTMEIFSEEDFLSSCKVMEGYVFLPPT >Cag_1531 Preprotein translocase SecG subunit MHVFIVVLALLAAILLIGVVLLQNPKSGSGLTGGISSLGTVQTLGVRRTG DFLSKTTAILAAVVMGLCFLAQFTLPNKESDRKEASSVLQKSAPLKPLQP APSTPNVPVAPAATPAK >Cag_0997 conserved hypothetical protein MFQSTRPRRARQRGRRKIRLKKVSIHAPAKGATFHAYLYVSVPMFQSTRP RRARLSSMRYAYELGLVSIHAPAKGATAQKLRLFYVERVSIHAPAKGATC FVIISNHLILPFQSTRPRRARLIDIEEEGKQ >Cag_0819 hypothetical protein MTAIDIKKTLAIEVEKLSVDALQEVLDFVQFLKIKQWRNREQVSFSQQRI ADDLHAFDINSVVHLEEEFADYKKEFPYE >Cag_1049 conserved hypothetical protein MTIAELQEQPLAERLMLMEELWETLCNEKHHIQSPAWHQEILEERINLIN SGEAEYLSIEELKKY >Cag_1145 hypothetical protein MLKSLLIQRALPLSISYILLIIAGIALDYLLHVAHLVWIGRYFGIVGTLF LALSFGYSARKQKLIKNGALKFFLKFHCYSGWVGTLMILVHSGIHFNALL PWIATALMMVVTASGHVGQYLVKKLKEEMKQKMKQLGITTSVDNEFEQQH FWDSLTVKALDQWRGLHMPLVSFLLALTTIHILAILFFWNWR >Cag_1278 hypothetical protein MNQIDWDQLDRQMQQFSSLFITEVKIPKEKTNKIASIIADDINKIPAKGK KEIVNSISNPIPIQDRLNELTAFQGWMDIAHDFKNPYISRAQVIVQNYIC FVYLGEACFKTLKQHLKPESVAKKCCNFLTNNPVRAFRNAVAHSNWKYKD DFSGIIFYARKGHQASDSIIEWQVEDKSLAFWQALSRCTAYTAFLCLK >Cag_0839 conserved hypothetical protein MGMMNHTESEKSSLTAYNQELALVVIPLLKGVIYQEENPSLWEVLRNELA GVRDYVAVLGLELILDEAEGYAFLRSRSEGETEGANNAPRLMARRQLSYP VSLLLALLRKKLAEFDAGAGDTRLILSRDEVVELIRIFLPPASNEVKLID QVDATLNKIADLGFIRRLRGERQMIEVRRIIKAFIDAQWLAEFDERLNEY LQRPVNAMERENE >Cag_1569 hypothetical protein MNPRVKSVLVLNDYKLSLVFTNGERGIYDCSEFIEFGVFKEFKKYGYFQL AKVEHGTVIWPHEQDICPDTLYLDSEKVSAEG >Cag_0430 conserved hypothetical protein MRNLRNLGFCYGQNMMWLHRRKHPRLAQLLRSALPIQLNESSKIVILSDL HMGDGSRFDEFRTNAELVYTMLHNYYHPRHFSLVLNGDIEELLKFPLYAI ETQWNEFYPLFRSFQHNGFFWKTWGNHDAPLLDEKEYQLSDYLLESLKFH YKEESLLLFHGHQASVFMWETFPMVSHLAVLILRYLAKPTGIKNFSVAHN SRRRFAVEKAIYEFSNREKIVSIIGHTHRPLFESLSKLDFLNYKIEDLCR HYPTTIGEERSTVRQQMQTMKKELEACYQQGKKIGLRSGIYHTLTIPSLF NSGCTIGKRGITALEIEGNTIRLVGWYNSKEQPRFATNHNQTPHQLASTN YYRTILNEEPLDYIFSRLHLLA >Cag_1976 hypothetical protein MKKWQQFLQDGSGVFSSTRLAFLLWVVGTLVVWIFGCIEVINAHATLDMA GKIKAIQFPVIPENVMLIIGALMTGKVWQSFSENSADKSTTSLTVQQTSS AQQGTSENVASETVAK >Cag_1880 hypothetical protein MELSYEWLLTSITTFFTEQFYLLPEEYQVGLNIFFLIVAFLAASGLLLIA LKKLWSFIRTVLTQRKVESGYRLQFPGSYSWGNALMRLPLVLIGTDIFCF AQLSQKGKLLRYIKSASVLIPTLWSVFGLVVFMASTGRAYQTHELYLAAL PVLLVGGTIFFLDLSIISAGGTIKAKSIRFFLAMVTGYIFSSVPLNYYFN ADISSYMLKHDDQIAAVESSYGKRIAAIEQSGWYQQYYSLLRQEEALRQD LMDERKGIEGAGLSGKPNALNPTLNVHYNAIESELQQLQQTRADYVRQYQ PKLDELQQLITLKSKKVVEIAKNNEGSHIKRHHALWEYALSSTSTALFFL AAMILFWAIDSLSILCSYIDESEYNFLCQERNEEMREFFGSRTVRQRFNP VQTSELR >Cag_1708 hypothetical protein MQRIIRIFFASPGDLEEERHLTKEILRQMSERSRYTFEFYGFERALATTA CRPQDVINNFVDECDVFIAVFHRRWGQPPQDTVVYSSYTEEEFERAKRRF VSTGAPEIFCFFKQVDLPSIADPGEQLRKVLAFKRRLEESHQVLYRTFAT AAQFVADIEQHLFAFAEGKLPTPRSPKHRFHIPIIEDQQPDSQRSYDLTK VHQALNAATSGCVEEAVILMAGVSQTTRDIELLDVIKEFFINTNNLDAAQ AVVEKKLTLLQDRRLAAHAYVAVLMSEHWLNDLVASMSKTVSPEKQSVAE HTTRKLFTGIRFHELMIEYLSKYFTVGELLSLTRFYQGEGASITAKFGRT IGIMIPEINAILMAENPELFEG >Cag_0663 hypothetical protein MVGEIYLAQIYFTDLSEYKIRPVLIVKELGDDCMCLQLTSQLNYDGILIT NNDLFDGYLKKDSMILMPKNFTLHKSILKKYLARIKLDLIERIMNQLCKA LGCV >Cag_1431 conserved hypothetical protein MGCITALTLSSALHAASSSPQKIGILWWNVENLFDTQDDPAKRDEDFTPN GKLQWSEKKLYLKQMRIRDLLGALAADKQMGSLPDIIGFAEVENKTVFEQ TLQGVKTGSYKSVYYNSRDPRGIDVALAYNSATLKLQHSKAYSVPLKHPT RPVVVASFMVGRHPLHLLLNHWPSRAFDAELSEPNRIAAATIARHIVDSL LTANPKADVVVMGDMNDEATNRSLANTLGSSMDGVQVKAAKGKLLYNCWS GYNGIGSYYYRSKWQKIDHMLLTHGMLDRTGFYVTKEAFRCIDYPALLKS SGKGTWSTYEKRVYKGGYADHLPLYLKVSVE >Cag_0891 metacaspase MAQRALLVGINDYAPIGPGGPDLRGCVNDVQDMANTLSVLGIIPASPVNM RILTDGRATKAAILDGLQWLTAGASPGDTLVFHYAGHGSQVLDISDDEPD GKDETICPHDFATAGMILDDDLAAILGTVPTGVNFDVIIDACHSGTGARE LSALTALSDDEAVAYRFIEPPIDWGFFLDSAPSLPVRGILKRNTTRGKAK ATAAKNEDQGVGQLNHILWAGCQSNQTSAEATVNGQKRGLFTATFCKILR SANGNITRKNLEVQVSRNIRAMGYSQIPQLEGASTHLKKKAFT >Cag_1129 hypothetical protein MSNQEIVEQARTLQPMDRLWIIEQLLQSLDEPDATIAEIWAEEAEKRLEA YRNGTLEAIPMENIFHD >Cag_1687 hypothetical protein MNTPIEFMKPRLVGDRFSGHAIPFEMLKNLSVLEELVIEAAKWKYLKAHP DRQRVPRGFTEGVSLQLTEVRDGSAIPIIVLTFMTTTPLFPEVGTHVTYF EQGRDAIFATVSAAEEQVQNPNSLPPHLLGYFDQLGRALRDDEALELDPT NQLKPARLTKETRRRILLQSDKIQELTEEVTLRGTVPEMDQEKSSFEFQV IAGSRIKAPLEPQYFDVILNAFTAYRDNRKIVIRGIGRYDRNEKLLGLTL VEHVSLLDELDPGARLNEFKSLKNGWLDGKGIAPTHKQLEWLVDAFERHY PDELRLPYLYPTADGGVQAEWSLGGWEISLEINLDTQQGEWQALQVCDEH EEFYVLNLNQPDAWQWLSAEIMKKSGVEA >Cag_1112 conserved hypothetical protein MTTVSTLYGTLSGVEFRSLFSNGKVDGCLVTEPNTLSTPYGALMPQYEAE DMGRRSVKPLYFYKDGALKSIALQTQTMLTTPIGTIPAELVSFYKNGTIK RIFPLDGKLSGFWGWKNEFALAIDITFSSPLGLLTAKVIGFQFYESGALK SITLWPGETLKLPTPVGTISVRKGVAFYESGALRSCEPARKIEVTTPIGT ITAYDNEPNGIHGDINSLQFYENGSLEALSTIDQSVEVTCSNNCQELFEP GVKRNVCGDERRISVPMPIRFSKTWVMFNNSPTASFNLQECSFLVQKAEL KTEAPSYSCAG >Cag_1062 hypothetical protein MMQNYGMYKIYQWEAIPYSSHNDKWNCDQIDLINRPIESVYFHRNQDLSI SMIAYIKANALNPKATYQLGELHSNFESIKIHHTSGAQGTATITHIPKNT TKFDNNGNPINESIINTLELEIKYNNLPISYTIEWITNLKSEGVWHWPNV VNTKLQGKFNMDFSSKSHTISIEKDNFLEFNNSLSCLQFSFEDELIFIGE TKTDEIEKKYHPGFILYQGNPSQEKMERIRLALSFLLGRFLPSLGYTAFD EKWDIVSYKCITPYDLDGSVYNSSTMPPIKLKTINFFINTNIVTRSVNAF YENFLKFDLKYISYLYWNAINSPAHIKASQFGATLEAIQRNYRQVHANDF QTALFKHEEWKIIQKALLDSLDKVLNSNSDNKEIDEKKIIKNKIYSLNQT PQSKLTNRFFDLIDISLSEIEENAFKQRNYSAHGIKTTQEDFSVIKNNYI LMTLIHRILIKILNISQNYIDYYAIGHPSRNIKEPIGG >Cag_0646 conserved hypothetical protein F56H9.1 MKKKIISIGTIAATCVASPIFAVSNFSDNTASQGAVVAAQAATNALTVLN FNFTPPVVAPPIVPVVPVITTLPPVQLPGGFTVTTSVQPPVNGVSTSTAV TTNNINNTTVSTTVTTTAPTPSGGTTTTAITTDVNGATTTTTTVRDATGN VLSTETN >Cag_0678 conserved hypothetical protein MLAKITRKNQLTLPKSIVTSLPKTDYFQVEVVSGRIMLTPVRMQQADAVR AKLDVLGINDQDILDAIEWAREG >Cag_0393 hydroxyneurosporene synthase CrtC MNITTRPEEELWHNVSSAGAYEWWYFDAVDVESGISFVVIWFCGFPFSPS YATHYEQWKRGAINHPPHPSDYSAFSFQCYEQGQELINFIKESDRSAFSS NPSSVGVTFEQCSFTYQPDNDSYQLSIAFDFPARNRSVKASFCFAVQQRV ALEQQDGNNGGKVPRHQWLLGAPYARVSGDMRLMNSGGQHLRTITVQGAH GYHDHNLGELPMQEYMKRWYWGRAFSSRYYLVYYLIFYRNTAYAPRFFVL LHDVELGSTTHYQPATLREEQLHHGLFAPLHSRQLFLSGNGASLTIEHHH PLDAGPFYLRFPATISLKQEGQAVITLQGISEFLNPQRLNSHFFRFFTRS RILRSNQPSLMYNVYNRFKQIVG >Cag_1383 hypothetical protein MRLSPIAIQQIKEQVNRFFGQQAVIWLFGSRLDDNKRGGDIDLYVQTEQF NLLDELRCKVALQEQLDIPIDLIVRSKNDTSVITTHAIKNGVPL >Cag_1543 conserved hypothetical protein MQDSDGQNIRVELSLLEKNIEQLVLQLTDCRKENEALRSELASLQNILRS CKLPGSGSAQSPTDGSMSEGALSGSEKMQFKQRLVLLLQKIEMELRNSQP L >Cag_1877 conserved hypothetical protein MEYQPSGMRALDSVERAKLGMKVFNLPFDEAEGVIDDYVSGGNYDPASVE LFKDQLDTQRHIQEKAYELFDTGAQILRLVVGAVLKNMPSPLDDDKSTSR E >Cag_0177 conserved hypothetical protein MYWNLELARYIADAPWPVTKDELISYANRTGAPQQVIDNLEDLPDSDEMY ESLDEVWPDYPTDEDFGYGDEDPLN >Cag_1180 hypothetical protein MRIINIHSPDIKQYQEFYTMPIIARFYGIIIKMFFIQSEHQPPHFHAIYG EYNAIFAIESLEMIEGDLPKRAYAFIAEWAQEHQQELLDMWNTQEFKQLP GLE >Cag_1037 conserved hypothetical protein MVKNHKEVLAATHQKLIEFNELNKLHGPELAWEKMLEGLPEKQKKRMATF LAEPTLFKAFTRAIPFFESAGMEMEIVDLSNKGSDAVLEIQKYCPYLQIC KEYNIETPCHIICDIEIESTRQAFPEMKGEILARQAFGSCVCLFKYERPA K >Cag_0062 conserved hypothetical protein MFFGLQAMNFKKVTMSEPQNERFPEYFGRSVRAMSDYIGIGLQIAVSFAL FVLGGYWVDARFGTSPLLLFVGVLLGMVGMVLVLMKVIRQANAKK >Cag_1303 hypothetical protein MKRVLILDTSILCVWLEVPNMKQCGADNDRWDKPRIDAKINAELQNQQTT LVLPLASIIETGNHIAKAPHSNYERAGALAELIRKSADAQTPWAAFSEQS TLWSQEQLKALANSWPILAAQKLSLGDVTIKDVAEFYANSGYSVEILTGD NGLKAYEPIVPIEKPRRR >Cag_0184 conserved hypothetical protein MEQQKLRELLQALHQELEQLQSVDESTTAVLTTLRNDTQRLLSNKEEPME EEEGSLSERMQQALEHFEEKHPSLSISIQHVLDSLARMGL >Cag_0218 conserved hypothetical protein MATVSVYVSGQTEQNDVIEFFQKGMIGADEHPIAFFEGVFYESHQERVGN IAFQDYLVYTNKAIYLWARGASKDYLDRFNLGAVSINSRNKDRDFATLNL KVRREDKEPIYVIFDMVELREAELITRLHTLVETIIEDRLGLNYRQQIPD EIAVYILHSAKSLCPPQSITFSAGEPNAPQQDSQIGYGQDLLEQYKASLG YPSPEPSPTQAQSRATAAAPEGFSPADALKGLEHLLPTDPAAIKKIAESL KEVIGDAPFKLRDQLKNDLQHVPGMLSAVTELLTSIADNPQAERFVLNLV KTAVKNDGVLGSVSKLMKLSSTFGGDNNSKRRSSSSQQASGRSEQGSASS KRRNESFDDDMPKRKSIHIKQEDDEVILPDCFSGLDLPFEESAPPTPAKK REAEEISGTKISPRKPIVIKADEDAIPSIVKTMSASDTPLPNANNSNDKL >Cag_1117 conserved hypothetical protein MTIKELVPLLQTAIGPMILVSGLGLLLLSMTNRLGRIIDRSRTLLGCIEA SAEPQVHRINREVAILWQRAHYIRLSILLACVACFGASMLILLLFLSALL MLEVSLVLATIFVLTMLCLSCSLLFFFLEVNMTLSALKIEMEHYDKKHQL MESMEWGR >Cag_1375 conserved hypothetical protein MAIEIRHISNSDKKARKEFIKFAWQIYRNNPELNRNWVPPVIEDYMKTLD TTIFPLYDHADLAMFTAWQDGKMVGTIAAIENRRHNQVHNDKVGFWGFFE CVNNQQVANALFGAAAAWLRKKGLNAMRGPVSPSMNDQCGMLVRGYDSPP VFLMLYNPPYYNDLVRNYGHRIGQELLAWYIDQTLIDIERLRRIAAHVMK REELTVRILDMKHFDRDIEIVRNIYNKAWEKNWGFVPMTDKEFDMLAKSL KPIANPHYVYFVEDRNKRTVGFSLSLPDVNQALKHVNGNPFSPIGLLKYL WYSRNITMVRTIVMGVLPEYRNKGIDSIMNVQIADYGGQHGVFASEMSWV LKANEAMSKLAQVIGGKPYKEYIIYEADI >Cag_1080 conserved hypothetical protein MKPLPVGIQTFSKIIEDDYLYIDKTDIAKNMIEKYQYVFLSRPRRFGKSL FLDTLQNIFEGKQELFKNLLIYNQWNWSRTYPVIKISFSGGIRDKESLHK NLFYILKDNQERLNITCEEKNDPNQCFAELIKKTFQTYQKSVVILIDEYD KPILDNIENIAEALIIRDGIRDFYTKIKESDQYLRFVFLTGVSKFSKVSL FSGLNNLEDISLNPDFGNICGYTQHDVDTSFAPYLEGVDMEAVKRWYNGY NFLGDKVYNPFDILLFIKNHKMFKNYWFETGTPKFLIDLIKKNQYFVPEF NGLKADESLINSFDIEKLTLETLLFQTGYLTIKQLLLSDVGVSYELGFPN KEIQISFNNYILQSITQNSQKESIRHELLAIVKAGDIGNLEQIIKRLFAS IAYNNFTNNYIESYEGFYASVLYAYFASLGFDIIAEDITNKGRIDLTLRS LDKTYIFEFKVIAEEPLEQIKKMKYYEKYNGERYLIGIVFDPKERNVSQF AWEKI >Cag_1770 conserved hypothetical protein MNVSLAIDFNQLKSLIAQCGIEEKTQIVQMLEKDTFPLRFNALLEKVKTD QLTLHDITTEIETVRQQRYSAKR >Cag_0587 conserved hypothetical protein MVKSIIYLEGGGDSKELRSRCREGFRKLLERNGFKDKMPRFVACGGRNTA FSDFKVAHEQKIYTFVALWVDSEEPLEDIHKTWEHVQKRDGWEKPHNSID EQLLFMTTCMETLIAADRETLQQVFKPLQESALPSLYNLEKQPRHELYQK LKKATQGCAAPYEKGKISFEVLGKLNAETLQQHLPSVARTWHILQQKL >Cag_1485 conserved hypothetical protein MQVTYIVLDDHNPLHRELSIYRTGIIQRICMDDAAYKTYGSLEVDGHNYA ACFHYGLVESLNRLPFLSESGSGLESGEEALLHRSRLAEFLCIVKEALAT LDNTHRETILVGWQQEPVAIAYLRALDAERFATFLISLLHFVEESELQQY DLEFLW >Cag_0545 hypothetical protein MSTINIQLPNSLHIKMQEVARQNGVSLDQFIATAIAEKLAALMTVNYLRE RTERSSQEDFERALSEIPDVAPEEFDKL >Cag_0838 conserved hypothetical protein MPFDYSTLNLLRQNHPAWRLLCAQHAPLVAGFLHRVFIVPNVRILSQADL VEALEDELFALRQQLGADQFPHTAQSYLNEWAENDKGWLRKFYPDGTDEP HFDLTASTEKALAWLESLTERAFVGTESRLLTLFELLRQMSTGSQTDPEV RIAELQKRRDDIDAEIERIRAGEIELLDDTALKDRFQQFLQLARELLTDF REVEHNFRTLDRRVRERIALWEGAKGALLEQIMGERDAIADSDQGKSFRA FWDFLMSQSRQEELSLLLEEVLALPPILSMRPDNRLRRVHYDWLEAGEHT QRTVARLSEQLRRFLDDKAWLENRRIMDILHNIETQALDLRDDFPSGGFM PLNAASATIELPFERTLYRPPFKPLLAGVALDEGDAEIDTAALYAQVIID KAELLRNIRFELQMRNQVTLAEVVERHPLRNGLAELVAYLQLAGEWQQST VDEAVEEQVQWQSATGITRAATLPRIILLK >Cag_1241 conserved hypothetical protein MKEIPFGLQTFSDLRQQNFIYVDKTAEIYNLTRVKSYIFLSRPRRFGKSL LIDTIKELFEGNKALFDGLYIADKWNWTTTYPVIKIDFAAGTIHSIDAFE KRVKDMFITAQEKLAIKCRVDTDLAGCFADLIRKAHEKYRQTVVVLIDEY DKPILDNIEDTAIALQIREGLKNIYSVLKAEDAHLRFVMLTGVSKFSKVS LFSGLNNLNDITLHPAYATICGYRQIDLETSFAEHLQGVDWEKLKRWYNG YSFLGEAVYNPFDILNFIEKQHTYRSYWFETGTPTFLMKLFAKECYFLPN LENIEVGDEILDSFDVERIQLTTLLFQTGYLTLKQRIESFGRIRYLLKMP NQEVRLALSDHFINVYTAQQSVQKYAQQERFYNYLMQIDMLGLQQALQAL FAGIPWKNFTNNDLPQFEGYYASVLYAFFCSLNATVIAEDITNQGQVDLT IIFDSIIYIIEIKRDTSENYQVSSENVALQQLQQKRYFEKYQRQGKEVIQ VGMIFNTVQRNLVQLDWAR >Cag_0933 DNA polymerase III, gamma/tau subunit MGAWQHKLANFATQPHFKPPTAPVPTQADASHASTVTTPIVAMPSTITLE ALKVEWQQFLEHLTHHGHTVLATHLQSCELASCSATGLVELACCRKFSCE EVQQERDMLQQEMVRFYQQPLQLRIRYDAAKDACTKEKSRFTLFQELSQQ NEVIRFIVQEFGGELMY >Cag_1342 hypothetical protein MNTIDPIVDEIRMYRKEHAALYGYNLHTIVEVLRKKEQESKRIFLNPGPK PLGIETSSTSVATMPHV >Cag_1556 conserved hypothetical protein MESTVRPLGTVMQVLEELGHKVTYAYDDLVFTEHNDFLLQFTNHAPELSL FFNTSCPRQQAEKVEQQLIPAADRVGLSVITKGRYSVTGNEDEENLKIEF FNN >Cag_1536 hypothetical protein MLRGVELVQFLWIHEFFVMSASYNESVMPSQGINGLAMLTPIIGFPIFFH ALSGMVVAGIGVTAYNNVVAPLAGKLVEFTQDTLPQLLPPLTPSILSIIP AQEVPVVIPITIKAKETSLLEA >Cag_1047 hypothetical protein MNCPYYNRPPTSSLPFPTAVETTHALSLQSIKKFPTLTLYNLHAFFHVNL LTIMATNSSSPLVRRELFIFDASVSNLSTLSSALSANSSYFVLDSTRDGL VQIADLLAGQTDIDSLHIFSHGSAGSLQLGNSSLSLVNLNNYELPLSVIG SSLSSSGDILLYGCNVGAGDEGLAFVDKLAKMTGADVAASDDLTGATALG GDCELEVESGVIDEASFYYAPEYAGLLGAVGPEFHVDTSDIQVWSYEPSV AALANGGFVVTWISETLETLSSDTHTDIHGQLYNSEGAMVGSEFQVNTYT QYGQYTPSITALADGGFVVTWISETLETLSSDTHTDIHGQLYNSEGAMVG SEFQVNTYTQYGQYTPSITALADGGFVIIWRCVNNDDYNCNYIHGQRYNA DGIMVGSEFQVNTYTQIGAYEPSVAALADGGFVVTWESGIVTTWKSGYQD TSNSDIYGQIFNVDGAMVGSEFRINTYTKGFQGCPSVTSLTN >Cag_0304 conserved hypothetical protein MNALVEDIKKLPVVERIELVEEIWNSIPQCSLELSAEECTELHRRYAAHQ AHPSTAITWEEVRSKMLSTSQR >Cag_1865 hypothetical protein MASYRTKLDSTYFSDAAHALRWQKTLAFLRESEVVGTNCSLGLDLGDRTP LTTALEELFACTFHNSTIDLDVGSLFGSYNVVTAFEVLEHLYNPLHLLLQ VRNVLRGNDARLFVSMPLWKPHILASPDHFHEMTRRAALSLFERAGFAVV RRAEFRIREPLFYVTGIKPLLRAWYEKIQIYELAMQVETVPCNEAALVF >Cag_0866 hypothetical protein MSANLLIIGDTKARALYDVDADALYLPISGRHLESAKALSLPLVMVDDEG IFLAPSHWKRLLPESSSTIDTIERGLLKMGRDARQEL >Cag_1508 hypothetical protein MGHIYLQNNEIQDAVSAWVTAYTLARKIGYAQVLDALENLAPQLGLPGGL EGWEMLARQMGGEE