Gene list
Applied filters:
COG category: Unclassified
Gene type: CDS
Genomic element: chromosome
Number of genes found: 588
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Chlorobium tepidum TLS, TLS >CT2272 hypothetical protein MSVCDDRGLHIDMVSLFFQAKNSHFGRIFLKPLLYFKLRIFCRVSANAFA AYRYAVSYNFAE >CT0086 hypothetical protein MIDQRTSYPSSMTRHLSPFWLNRLLGIPSSTPLTPRLMASTLRVAFCPLQ EESPRLDHFITALRESFVECGVTIVEEAAGEGRDSRVEAGTALIAPGRFE DHQLPISRVSTLYNNLIVGVYDEPPPVHGGQTPQERLDAVVGKLAWEMVH LLIYVTENSWTVCSMNGGITTFDTPLPESRDVLESLIPKLTAQVVPPRDG ELELRTGALDTSAPEFLEFAADFVECGRIWAGNERFMNHTSRESLDYRNG FCRKIVSRYLDERNGMSYGFFARQLPVKVAPAIEADDLGGTSVGDALVPV TIAGKQLLVPVPGVRILTTRSGCRKTAIDPRHDLVEIGLDNGHAWLVTPA GLPEGLVSKPSFDTLTIIAHAIGNAMIASILLALAPGNRFPGLLARFGCG MTHWHYYLDEEMIPDGYVVHGFDNPPVSCSTPQSAAYSLLGKLDALERAL EGGTDYLGDVHIEPGHGTNVVGTRSLAKMAMFLNADSCVCSQREG >CT1108 hypothetical protein MEIKREFTNIRFPEYLFTLVHSHLNPDESYPDAE >CT0332 hypothetical protein MVKQLTKHGNSMAMVIDKPILELIGADADTPFEITTDGQALILTPLKNPK GGEAFGVALEKVNTRYARALKKLAE >CT1531 hypothetical protein MARRVIVSQAASENRASLPDVRLAFQDFLVSRQHWSDRF >CT1352 hypothetical protein MLRSKEEWQETAESVLPPEERYVDRNRMITARYAGWYLENPGTLKWAGMA AFASRQVGLAIMAADLMTAPERDGSGNPLLALHRFGVDWFMRADFEQIRR GNNNIYRDIAWAHAAYVGGGMAELEACASEPEDTLLVKGFGMIDRGRALC RRDAGSPAGERLIWEGNICLLRHEQVDVLQPIFDTLSVGGRIMASFGSEL DFSGALFPDSRFRTSFSLFYGYLETLTGLKSVANPDDRWRWVEQSVIPSW QAAERQMSAPCPTRNALQKMAACEQ >CT1690 hypothetical protein MLAQPFDDIPLKLRQLALLLLQQRFDVEVLELLQVVQAYSAVFIPVAGFS GMIFETTGRTTMGRRSCDGFFSPQIWQACACVSFVLMVVSR >CT1950 hypothetical protein MHTNRQRPDERTKNTALFIKKMKILIHFCFIAESAILLKPC >CT0025 hypothetical protein MPRRIKPCYLELLLEIGNFSHSAQLYRYIYLRKIYLRKHPFSRI >CT0926 hypothetical protein MVKVNCTTLVPYFFMLKVWEDWIDTLPIRDHELGSCLPMNRARRMRTFLP R >CT1941 hypothetical protein MTASGSQQQKRRKAATKEELRRKACKQKSSEGGKPVTASENRLNGLLIWQ TSIQLLW >CT0583 hypothetical protein MHLSIPLKQMSVEEKLQAIEEIWADLASTPANIPSPAWHADVLQVREERI AEGRAQFLDIEEAKKAVRERLG >CT1329 hypothetical protein MISKISFDRKTIPVVRSGSGIDNAASRPVLYSVLFAPNIKTIFKRFAIFC GFGVFVVSGQMRGTHSKKTEYYE >CT0220 hypothetical protein MTLVLNITAIVLFTGLSLLVKYRIKRWKQKQLRQQNDVWSIGLYEGPDPV TLSPAAGIRNPILTAKEVTDAPARFIADPFMIERDGAFHLFFELLNTKRK MGEIGHAVSDDLKTWRYSHVVLRERFHLSYPYVFEHDGEVYMIPECAKSK SIRLYRAASFPDDWRPIATLLSGNKREVALLDPSIIFHDGHWYLFSYMRK VNNLHLHVAETLTGPWREHPASPVVKNSDHFARPGGRVVKNGAALYRFAQ DGQPRYGSKVWGFRITELTPTAYREEAVSDTPVVQEGNEVWNGRGMHTVD PHRMPDGRWIALVDGLEDKLRS >CT1263 hypothetical protein MVKHIVMWRLCDEAHGNSAQVNARLMKEKLEALSGRIPGLLSIEVGLDFS CTDSSADVVLYSEFANKAALETYQSHPEHEALKPFIGGATRERRLVDYEC >CT0738 conserved hypothetical protein MRKGWEMFKGNIGEFIGFTLICFAASIVSSKMAAFGSLLFSAIAAPLYAG YTIAAFRIMTGQELQFSDFFKGFNYFLPLFLAGLASGILVAVGLVLLILP GIYLAIGYMLATFLIIDHGMEFWQAMETSRKIITKNWFAFFVFAVVLFLV NVLGALALGVGLLVSAPVTACATAIAYKEIVGLHSAEW >CT1884 hypothetical protein MRPPWFCVEPLMFTEHKFQSVEPARNEKKPARHVMDTGRNRP >CT0736 hypothetical protein MLILLGASIAALGLLIMLVQKSGGNGWLGWFGHLPFDIHIEKENFRLYFP LGSSIVLSIILSLVIGLINKFFR >CT0689 hypothetical protein MKTENIGTNFDDFLQEEGLLDEANAVAIKRVIVWQIGQEMKAQKLNIIAR NDDTEKEISF >CT1869 hypothetical protein MRFILTVIAAIPEQRPRKNTITVKQWHERKNPIKKRDTAAKRDALLSN >CT2031 hypothetical protein MLHYTIKLVFVISYEQGNKTNREMLFFQKFWLRRQSSNLYEALQNWTSPV SGTSGTFFTTSPAHPSSSTLLSSPKKESARY >CT0277 hypothetical protein MNRTTEQRNAEVMKTIGLLDQMPRVEVDHLFRVRLMQRIEAMEVKKTSWS ALPGGAFNPRLAFMALLLMLNIASALMLFMHGTPQATGSSGAIAESLTED YGGPALSYYDDQTTIDR >CT1047 hypothetical protein MLSIANSMQGQSSRSCLLPRPKRSARQAVQLRYLLAR >CT1385 hypothetical protein MMEWSSNGSSGIDSDATALRDRQITANLLAIAILEPLFYFAVTFLITYSG HRKLHIFAAPAHKNELANAKIRPFCSQGNRRPG >CT1372 hypothetical protein MKRVSIIAFPELETKKRQCLHNRVACCIFVAI >CT2120 hypothetical protein MPVMDRAAMRTMATLQPSCMRWLCTTPSILDAAKAGWPISWAATPDSPAT VTIRQFQASTCCRKKTFDVVVNTDVLEHVPEAELDCVLRDFRKLSTNAII IPHLAKATRILPNGENAHCTIKTPSEWAQVFKRHYAHVYELPHHSAVHAL FLCGDQERDVTALRGILEQYVAAKNEVRHHLLPLGKRIEKAIRLIRGKDI NR >CT0869 hypothetical protein MKKCEPPPINYGDGLGWCTPYMYVCFNDECKLYVNGWANLKNNYNKIASY RCMCYPDNGMFDAMCVFSPDGLKGQIVEE >CT1914 hypothetical protein MSFEGEFASYEPLRRLLDSEKVKSLQNRLKIRQQEEETEDFEGSIVKKSD LTESTLQPDLVLAIDGSNLAAKAENGFPGAEFGYITIASVLIDLKLIGEL EKKEFVEPKKFRETEKASTIESVFPGCNVILDTEKNAKSSLRRALFEELR SNTIFSDGESLLDTYEHLFRIKREHFQERNLPRSPIEGVEEEMTYDFGEY TCPHSGEPLFSTDALRLHELMNPGGSNGEMFGQIMSTLEKLWLVHILRAF ERKGWLATLRRVAFIMDGPLAVFSTSSWLTKVISHELTRLNDLQRKINGQ DLLIIGIEKSGTFFNHFIEIDTTKDGVTDKFPKQSALLLNDGYIKRNIIF SESIKPYGQDTYFGRKLFYKAASGQKIVPVVACFNEYQRNLNTANPDQFT RLADVMNLLDLLVSSRYPNSVSPLVSAHAEAAIPLNLGKRIFEDIAREIR EKSKE >CT0298 hypothetical protein MVSMDRPALIFPELAQILLLLPNAVRIFLNSMRFESALGISARLDFRKMR LTSQSAEII >CT1119 lipoprotein, putative MKTVMLMLTSLLMAGCSVLGKREAAEPPYELLKHDGAFEVRRYGPMVIAE TILDEKSYSAASGKGFNRLAGYIFGKNRSKTSISMTAPVLQERSSEKISM TAPVLQQPQKGGWSMAFVLPEGFTLQSAPEPLDPEVKLRELPPSTIAVVT FSGLHSAANLEKYSRQLQAWLKKQGYRALSEPKLASYDPPWTIPFLRRNE VQIRIEPDHGESGKE >CT1030 conserved hypothetical protein MTTNFLAILGGFALGLFYFGGLWLTVRKGLFSPHPALLFLTSTLLRTASV IGGFLLISSGDPVRLLFAVGGFVAAKVASIAFGRRNSAPEHRDKEETPCI >CT0486 hypothetical protein MLKGPEREFVANGCRAASPDAAEIARIAKHASGKPASAPPPLPAEFMLRV CTRDDVEAMAGIYREVFSTYPFPIHDSVWLLETMQRAISTTSASSTKVVS SRWPPRRWKHERLAQASGVKAGLFSVADSVPF >CT1940 conserved hypothetical protein MANISLYISGQTELDDVSEFFQKRLIGKGETPIAFFDGVFYESHQERVGN IVYQDYLIYTDKALYLWARGASKDFLDRFSLGAVSVNSRNKDSAFATMNL KIRREGKEPIYVIFDMVEIREADTIVRLQTLVESIIEDYLGINYRQEIPQ DTADRIFQAARTLCPPRQIALQLDTPQAPTPDAGIGYGQDLLEQYRASAG YEQPQMPYPPYQPSGTGGASPRSMGSQDAMRGLESMLPADPASLKRIAEQ IKNMVGDAPFKMRDQVMKDLQHVPGDMATVLNALNELLSNIAGNPMAERF VMNAIKTAVANDGVLGSLSKIIKMTGFGSGGGKKSSRPPANESPREERST AKERKSPFVDEEPDEDGSTIRRKKISIKDDDNHTGADLFAGNDEPITSRE SRPPVSRSSVPDDSDSGAEPGGRRKKLSIKMEDDNEIARKLMSYDEPERE APSPAPVEASSAVSTENGSGIRRKKIAIKAEEGSGAEADIARKLMDYDEA SRSAVTSALSGEVPGSVSSEEPSEPEPLRRKKIQIKADVEQESEPIVKAE PEEPTRKKIQIKTETEPELPVVSPELEIKSMLEPEPQLEVPVQKASQIEA EPEEEIDISEEVIFSALGEIEEMEGSSISQEYMIIESDIPVRRAVSEPVK EPVSEPVIEKKPEQVRHSSGKVSHGKRRGR >CT1684 hypothetical protein MFLLHEQFARNFLAAISAFHFLHPEKPIAFIVFSSKCIWNV >CT0428 hypothetical protein MLKMYVDYWVAVLSGFLQQYFGVKSQKGVTMIEYALIAALIAVAVIAVLL TVGSNLKTVFSYVGSNLTT >CT2108 hypothetical protein MCRLMTSIWQKNREPAPDVPRYARIIGTEPP >CT0573 hypothetical protein MIPALKLPEATPLQSWMLRFAQHDRKEWCSSNFYRFLRERFFSKTQPHKS SFHNQALQSLETRSRMVKYQRSPILVK >CT1668 hypothetical protein MYWNLDLARYIADAPWPVTKDELIDYANRTGAPQQVIENLENLPDSDELY ETLEEIWPDYPTDEDFGYSDEEPLN >CT0880 hypothetical protein MQFLKVYSVFGEEVVGCHVRIENLSVACSVVSAGGTHPSEKCVCNHEFSS VSLHQKKHPHLEKLLETSRPIKLDSSSKVLILSDLHMGNGGRRDEFRRNS ELVRSMLQDYYLPGGYSLVLNGDIEELFKFSVEDITKVWGHIYDLFLQFE KNGFFWKTYGNHDSDLFEERNYPLSKHLLESIRFQYGDEVMLLFHGHQAS ILLWETYPLVSRAVVLFLRYVAKPIGIRNFSTAYNSRRRFAIEKSIYEFS NQAKIVSIIGHTHRPLFESLSKVDHLKYKIEELCRQYPSALPEERLAIQE RIGELKAMLDACFTEGKKIGLRSGRYNNIAIPSVFNSGCAIGKRGVTALE IDGDRIRLVYWFKEKQGRRFVSDRNSEPEQLGDTGFYRIVLNEDHLDYVF SRIRLLA >CT1154 hypothetical protein MRKKILFVCGSMNQTTQMHQISEHLREYDQWFTPFFSDGLLGKASDLGML EFTIMGKKRASKAIDYLTSHNLQLDIGGTLHRYDLVVTCTDLIVPKHIKR TKIVLVQEGMTEPETILFHLARNFRWVPRWIAGTAMTGLSDTYEKFCVAS EGYRDLFISKGVRPEKIEVTSIPNFDNCERFLENDFEHRDYVLVCTSDNR ETFIYENRKRNIRKYLDMADGRQLLFKLHPNENVVRATREIELYAPGSIV YAEGKTEEMIANSQMMIAQFSSTIFVGSALNKPVYCGLEPDYLKRLTPLQ NRSAARKIAEVCREVIEK >CT1310 hypothetical protein MVSPFLRREKNFLMNAGIGGVASSYGVKKDPY >CT0442 hypothetical protein MAGNAMKTQAHWLNWKHLKVYSSLLLALFLIYGIGGVYFSKNMVDAGGHP LGLDFIAFWGASYLALAGHAQDAYNIPLLFKAQQIGVPAAKVSYPWFYPP SYFLVILPLALLPYLAAYGTFMLSTLGGYLLVFRRIIRGKTAMWCLAGFS GLWMNFFDGQNGFLTAALAGAALLNFERRPVLAGVFIGLLAIKPHLAMLF PVALLAIGAWRTLITAAVTAITFMAAGTAILGTAVLKAFLASLGDARLFM ENTHLLWNKVPSVFAFLRLLGTSATWAYAVQFAVAVVAVIIVWRVWRHCR NRNLRNAVLMTATFLVSPYAFYYDLAWLAFPIAWLALDGLRNGWLRGERE VLVAAWLLPLMMVLIAAMLKVQVGPLVLGSLLWMTYRRATTASMTGAPAS AAPAKISSRLYSKRCYSLANTSTMAKKSEHAAERKTMNADAHWLNRERLI FYSRIFLALFFGIGVGLVVTSKHMVTGDFVLAWAASHLALTGHALDAYSI PSLIKAQQIAEPGPQDVYGWFYPPSYYLLILPLALLPYAAAYWSFMLSTL GGYLLVFRRIIRDKTAMWCLAGFSGLWMNFFDGQNGFLTAALAGAALLNL ERRPVLAGVFIGLLAIKPHLAMLFPVALLAIGAWRTLITAAVTAITFMAV GTAILGTAVLKAFLASLGDARHLCLENGSLLWSKMPSVFAFMRLLGTPVT WAYVAHFIVAVVAVIAVWRVWRNCQNRNLRGASLMTATFLVSPYVLFYDL AWLAFPIAWLALDGLRYGWQRGERAVLAAAWLLPLLMMVQIIAHLNVQVG PLVLCSLLWMTYRRATTASMTGAPASAAPAKISL >CT1366 hypothetical protein MKKQQQLDLLEKALVSTGHKIRYEKGSFVGGDCRVKENMIVVVNKFLPIE GKIATLAAVLRKINPPALSPDVVKIIDTVVPTNLFSRENI >CT2027 hypothetical protein MLRTSWIKKDMKKAEGLFSEVCFLKNKFVDFRPEQ >CT0012 hypothetical protein MSLNFSGPNAQFIDFDSDFNFDRNLFADFSQ >CT1433 hypothetical protein MQEWIEYRCQTHKKASLKKRLFKAVCSYFSNLILSAAR >CT0045 hypothetical protein MVVVFRRFVALYDRFELLWEDQRTERMAANLLILAFAFSLALIELARQGL LPAEVAARVPKSHYYAINTALSMLLGLEVFGLVFSIAKSVSASVGKQFEI LSLILLRHSFSELVHFSEPLDATEASLPVLFMLAYAGGGLLIFLLLGVYY RLQHHRAITRDKEAAADFIVVKKIIALLLLLIFFVIGVIDGLKYLNGNAT NEFFSLFFTILIFSDVLLVLISLRYSSSYPVVFRNSGFAVATLLIRLALT APPWFSVLLGIGAIAFSIGITFAYNRFENLEIQRSAPI >CT2051 conserved hypothetical protein MEVVGRSRAKAIIETHLNDSLVYFSDHVRILKCNPYCTGVTYLKEHGVYQ WIFQVNDPRENPITAVFFVTQNEEHLDSSRTVPAGPVSEEESFPDSAVSG RCIQWVNAPQVPDVPLKEKNTFVGQANTRICLYPLEDRRTEVHFETDITL DFELSFPLNLMPEGILKFMTEAVMSKIMQQATESMLCQVQSDLCCCTTAE LDASGGKV >CT2017 hypothetical protein MMLIPYPFGLHESMYVSSNENTETILCIDLRFLQPKSTPNVTRHFKAAR >CT0475 hypothetical protein MEKKAVQTSKKGQHVFFLRSSRRIIPSSVAFTFVPCFPAESHSNPHITSN RKNNVQDCFRYFFSRKY >CT0135 hypothetical protein MKIGCRHFLVKIYCLGEVRNGLGVMTVHVFDYAAIVVTCCLEQGVTEFDN HGVVGDCSLELVQLLPYCGPHLVGHDVSVVIIYSLGV >CT1202 hypothetical protein MPSPMKFFTSIIESIKLLFAGIWLVFRIILEYFGIISDGNDRTTGIKDMR EEYKKANYR >CT0506 hypothetical protein MLSCAVSKQFLRREKNFERKLSGPRINKQEGNKEASEKRNKKTKSK >CT1562 hypothetical protein MVTRLTEFVIHYGIQVFLAGPGLDLPRNRFALFINPIKKTRKQVIDDISM >CT0533 hypothetical protein MKRSFSEAEPESIEYAFSMEHQGFLVIADNSV >CT0628 hypothetical protein MNSGNFGPLDEDLLIQSIKTSFPEAHIALKFLSCLR >CT0957 hypothetical protein MKFSSTSTLFLLLLSCLCAPSRNGMCKEQPQQILVMWWNVENLFDTKNDP KVDDQEFTPMGKAHWTEKKLLLKRLRIAQVFNAIRAEREYGKYPDIVAFA ETENRQVFAGTLAALDRATYAIDYHESPDPRGIDIGLAWNPATVKFTGSK PYKVRLNNRRGTRFVIAAGFTAASNHFTIVLNHWPSRSFDTQWSETNRIA AARVARHIVDSLRTCNPQSEIIVMGDFNDQPENHSVKDVLGSSFDRKAVR HASSRLLYNCWNEASSPGSYFYRNHWEQIDQMLVSAALLDEKGLSIDKTS FRVFSIPAMFDRFGKGLYSTYKQGKFKGGYSDHLPLLLKVRIKP >CT0022 hypothetical protein MILLLVSENLEHQINTVKRDEGYHEIDRLDHTQQIDDQHQRHDNQKPECD TAENDERENLRLLVKSVLKEQVPREAVENNHKPGNENRVDVNRIVCRAPV NTPP >CT1107 hypothetical protein MNSRFISMKKQGISEMFTIPMFCRFAYLSA >CT0074 hypothetical protein MKKNVGHQDRNSLFSVGVPSSEIIWRSALKMINFVSLNRHSLIEQQ >CT0793 hypothetical protein MSDQVSTAPCDHEWFDSYQRFSSFSPESVRDGCHPGIMEHLQSSQKAIVL VHGLSDSPYFMAAINPKIRGTRYLFSAVFDHAVTTMDAHRPFFLPIFPPT AESNSYASIATPAS >CT1212 hypothetical protein MMLTSDRGVMMNNGVLWIKPVMKWMGLTAFLVIAGCSQFKTVTREPGVLG DRVALTPEMTGLYEEVASNASTVRALDGYADLYLETPKRKAKAYCTVQIQ KSRDARMIVTAGILGWPVADLLIRPDSLFVNDMLNNRMLVGRNNGENMGK IIGVNAGFGRMIETLFGIADVPEPAKNIESVRKGSGRVSFTVKSGNGTKE LVVDPLTRELTGLVYFDQSGRKSVEFRFAAYQSQVDKNGAELRVPREIDM ILYREDDPEGSRSLKVVYDERVINPPDFNITFKWPARAKTVNLDEVERLP WL >CT2208 hypothetical protein MRHCSGKRLSRRCFRVERKSHWISENQMRSMCSAPCRKLVNPDFFENAPQ EGTDAAQAEKRRTDAKLRAAHYLKARARQKRFE >CT1024 hypothetical protein MLICPSENRPNAFFTFWTGTPLTGSENLASTEFPRFIASYHSASFRTG >CT1927 conserved hypothetical protein MSPTVFCEQGFRFFFFLREEKRMHVHVISGDGEAKFWLEPELELAKNHGY SRIQLKQIESIVEAHSDELVKAWRKHFSS >CT0059 hypothetical protein MAYASFDHRQVPLFEYVLRILPVGAWRRGRTNVNEVYRRQCSNRTAFMLP RVDEVAAVGALFILHVLHEAIIIHLSGKIMSRQVFSKTAKGLYLISLASI LGACNHKAPEQQNTAPKPVSAAADNATAVTIESGKGSVEITDAVKPWPDD APADVPRYPYGTIRKIIRTETPEGNSWDMAIERLPEHALLDYEAVLKAKG FETTSMIVPEKEGDRGSVTGIKGAITVVLIGSGGSMSLSIIQKQ >CT1133 CRISPR-associated protein, CT1133 family MSGKRSYQSARLGERGGEKMILQALYDYYQRKAADPESGIAPEGFEWKEI PFIIVIDREGNFVSLEDTREGDGKKKKAKPYLLPKSVGRTGSNSYKTSFL LWDHYGYVLGHSRSESDKDQAMAEKQMPSFIEKLRSLPENVKGDDGVLAV IRFYEKGEYKKVKESDNWGECTKIIGCNMSFRLDGEVDLVPCRDAVKRYI ETQIGESADDAVGLCLVTGKKAAIARIHSDTPINKDSKKFVSFQKNSGYD SYGKEQAFNAPISESAVFAYTTALNMLLGKNSKNKVQVGDATTVFWSEKQ DVFEEDFPAFFGYSKDDPDADVRAVKALYEGIKSGHAQMDSKTRFYVLGL APNSARISVRFWHTGTIAEFAGNIRQHFDDLEIIRSPKDSGHFSMFWLLS AMAHEGKVDNVPPNLSGQIFQSVITGGLYPATMLQQAIRRIRATQEVTRI QASILKACLNRFSRIYNTKAKEITVALDPTNNNPGYRLGRLFAVLEKIQE EASPGLNATIRDRFYGAASSTPVTVFPQLLKLKNHHLSKLDNAGRRVNFE RMLAGVFEGIGNEMPSHLSMEDQARFAIGYYHQRQDFFKKKDSENNN >CT0571 hypothetical protein MFAQELFRFPTLSAGTIMRSPLLRIVAISALFCAQPFQPEANAWHDKTHL TIAEAAGFDLWYSAAAPDVAKSKEMFSPVESPNHYYNNNANKRVTPEMVM AQVERYNRPNDDEGHLYGAIIGSVREYQSMKKSGKYAKYPLVYCAHYCGD LSMPLHNTRYDDFNKERHSINDGIIENSVRHNIGYIQRMMRPPVIDSEAD LAREIAAVAESARKLGMKMRKENRDMTVDEAYTQVTRSASLFNAILAWLE RTQKTAGERTVTVTN >CT0915 hypothetical protein MLPMVTCAMAPDGGIDPDGHVIRFFAMKMVQACWSLVWFCG >CT2268 succinate dehydrogenase, cytochrome subunit, putative MNFAGCAASSPVTSGLSVMDGSARRTFSSITSKVVMALAGLFLLVFLAVH LGINMLLLVDDGGKSFSAAAGFMSSYPVIRVFELALFGGFALHIAFGVIV SIRNRMSRPIRYQHRSRSETSPFSKYMLHSGIVVLIFLGLHFIDFYFIKL GIVAPPPGVARHDFYSRAVLLFSDRTSSSIYMVAFVFLGFHLNHALQAAI QTLGLNHTRHAAAIQAVSTVYAIVIAGGFMAIPLRFTLFN >CT1917 hypothetical protein MLLRFFRQSTPLQKPGIARFFAFLGGDEKGNPVASFPDLFARRNRFFTFN ECVG >CT0585 hypothetical protein MYYRVDGGIARVRAILDCRRDPEWITKRLG >CT0685 hypothetical protein MLGHGRIYTCEKGCLFSSRQLVKANGNPYGQSVMPVKLHQNGRITPAIRR AIQSSSLSASQLAARHGVANLKALMPVGVKYLPKMPDGPSRRFNVHSSP >CT0811 hypothetical protein METPTSRELVSLLFYLRIEISLNNPEVMDSTMKHRLENMLGYLESESYLM AYRTLNAIVTENEASGELPSLETSTALEVMQTCLRIIVGERVGHPEVAKH FAQTVSFYERLALLLTKKLLGDDSAAAEVDILLFCHDALAKHRRN >CT0885 hypothetical protein MGTLDGTGDIDCSNRGLTSLEGCPEIVEGSFNCSGNRLTTLEGAPRITGS FDCSGNEIVSLEGGPEKVNGDFNCSSNQLSCLKGGPSKVKGDFLCSGNRL ISLLGAPKKVKGYFDCSDNQLVSLYGGPIETGAFNCSGNRLRSLLGAPDE VHAGFDCSSNLLVSLDGAPEFVNGDFSCANNLLENLAHGPVEVSGNFNCS GNRLMNLKRFPKRVEGELDCSGNPILACDVTGPESDRNCIRVVHGGTVRC CCRKTDSSQLEALSS >CT0218 hypothetical protein MQQTFFYYNYENYLLTFLTIVTSVLDLTDSPLLGKGSSRLCYLHTPMTPT SASRSLIPVILMSKSRS >CT0228 hypothetical protein MRVSLRHFRYGASGDLLFGQAARLFAFGIGSARAAKSDKKAM >CT1458 hypothetical protein MPALPPPPQYPPTRCSRSRAQRYLAAFRFCCFILIPTPMKTEVKVIIISG IVTIIVAIINGNDNIHAGGNVTMHGAKK >CT1518 hypothetical protein MLSRRSFFDVNIRKKPSTHFWLDKIHKRGKKSGVK >CT0471 hypothetical protein MPGDSTNFIFMLSPTTSTFDMQKKATLLVSALMLSSTPLFAAMPLVTDDT GTQGAGHGQIEIGFESTSDKETEAGVSCKETGGAISATFSYGLTDNIDLV VGLPWEWDTVKENGLKVADENGIGDLALQIKWRFYELPDSGFNLAIKPGL TIPTGDENKGFGTGKVSGDVTLIATREAKLATFHVNLGYSRNAYKLDEIS ESSRKNIWHASMATELNVTDKLRAVGDIGIETNSDKDSDTDPAWILGGLI YAVNDNTDLDIGIKGGLNDAETDTTLLAGVTMRF >CT1796 hypothetical protein MKELSFGFALRVVKLCRFLEKEKKEYVLSRQLLKSGTAIGALIREAQQAE SRADFIHKLSIALKEAHETEYWIDLLYQSQLIEKKGYESIKSTKNRRDNG >CT1120 hypothetical protein MNPAKSDRRFAIGTRGIVSRINDLADQPDAKETLKI >CT1843 hypothetical protein MVLDNRVAVAAGDWVCRVLHPASPETQTVFLDELAEAMAGSGKAEVVTVA EAVEIISAVSSPYLSPRWLAANISNATRPGRGTAFSARLSEPGSIGSVCL GTL >CT0733 hypothetical protein MEAKNLKETPTVVKRVLESYTSLEGAVFFVQF >CT0490 hypothetical protein MYCSVVVYNGIHHPNKDVRVFLKSYVLLFEFIAKCFQVAASDWPFFIWSG RGTIG >CT2197 hypothetical protein MCLRSFDRNLKQTHSKSSNKKNNPAFILDR >CT0023 hypothetical protein MPPKEMTMLDNDRKKDKFSEQFGGTVRALSEYLGIGIQIAASFALFVFLG YWSDSKLGTSPLLLLAGVLVGMVGMALVLMKTIRQADREHDRLHQHTRNH EKDRRT >CT1393 conserved hypothetical protein, truncation MTSYKPEYLIPNLLDLVAEGEGLRIEFKRLIHSAPKIARSITAFANTSGG VILIGVDDDRRIVGIQSEKEALQVIDEAMRFHIEPKPRIEVHFEEFKRRM VLLVDIPKSPERPHFHIEPLIRRDTGKHGVERRVFIRDGSHNKAASDDRI ELMLSSREPLKVAFTGRERCLLDWLNEHDRITAEEFADSAGIPMKEARRI LVSLVRAGALRLDTANGDNSYTLAHR >CT0690 hypothetical protein MKYLDVVKKADTIGLFTIKNDIDMESVQKDYEAYKLNPE >CT1283 hypothetical protein MKKLFILFAFVFLAACGSTSSIQDKEGKSTKIDLSMYDNVVILDFTDATK KHNMPAFAGRNFADRIAASVKEKGVFKVVSREPLADKSIVVSGTITKYEE GNGALRLLIGFGAGSSYFNANVHFTDSLNQQELGKVFVDKQSWALGGIAA STQTVDGYMNEAAKKIAKELADAKNYHCEPNTSAQTETK >CT1617 hypothetical protein MQNGSNSIHFDTVVMRPSCRKKVSAKALGFRYWSQTQQPAGKRRQ >CT1912 hypothetical protein MSLQSLSSFLKVIADLLQTLKVCLRQFDLFLQVLQVWKSIRITLNDFSDI ELQPIVLEYYFLKLMM >CT1355 hypothetical protein MMVCLLCHTGAHEKSFFNGYDGRALYEQVQFLLLSIMVVQNFL >CT2226 hypothetical protein MLNSMKSLRLHRSCTVFFFVIEIIRQNMLCKPFYGVFPWSFAG >CT0491 hypothetical protein MLVKAKDYGIACVGITDYFLINSYKRLIELINDDSRLNALLNPPYADYAK QLLVLPNIEFRSSTIVRHVDIEGKTATREPIFTSFSPTQFRRKRLKRIFF ES >CT0456 conserved hypothetical protein MLEQIRNTHPLVYQNFSALPDGEKHLRSILAIDRYWEKLDLPVPDVILAG SRLPVDACVEEACDILYAGGTLGLLHAAVMSKKYGRKVLVIDRAEPGRTT RDWNISRGELLRLADTGVFTSEELDSTIVRMYKTGWVEFHAPAERRKRLY MDEVLDCAVDADRLLGMACKKVLAGGGSKVLGHTSFVCCYQFPDHLVVQV EELSGKPRYFRTQVLVDAMGIVSPVAMQLNRGRPQTHVCPTVGTIASGFE NADFEVGEILASTEDAEVSGKRGRQLIWEGFPAKGDEYITYLFFYDKVDS PNDKSLLGLFEAYFRKLPEYKKPGPNFTIHRPVFGIIPAYFHDGAGCTRV VSGERIALLGDAASLASPLTFCGFGSVVRNLDRMTSGLDRAMREGRLGAA ELANISAYEPNVASMATLMKYMCYDPETDEPGFVNEMMNEVMIVLDELPQ RYRQAMFRDEMKVEELVTVMLKVAWRYPKILKATWNKLGVGGSVGFVKNL AGWAISQNEKRG >CT0739 hypothetical protein MSKSAGSTFCPKSNGAMIFLLFFSSLLPMVAIRKNSLSFPIHRYRAQGYV LLDVIYSSLVRAQSVIPYSGGAALGSHYCRNCVPLYIVSGLRS >CT2143 hypothetical protein MVVLCFDFFIVYQNMSHCSIKSLQKNYNYATGSRILKTAGFIGILTVFQL PVRYLLTTLFMDMAFRYVRWPDCELVSMKGV >CT0765 hypothetical protein MKLFPDEDKKKNFMKRGLPVVLAVVWTPIIWMVLAAFLGPAMERVIGVWQ VTVAILAVATLLAMVALIRLFKTLGLKIFDNIG >CT0596 hypothetical protein MQQTLSLPSIHHNQRKNAALLAIRISNSFLALSPQQKSNCIVRFYPAKDE LIF >CT0382 hypothetical protein MRTDKLYRTRENRNWSKKRGIRDAVKGNFGQAQQRYGPFGQEQHDGGLGH FSGDESGSAVQPAFLRRFEWRCWLASRIENIHRIRSPQAAAMHCCLSAYR LAWQGLIINIATSSWACSIDFFPDMPFEAWTSNMTYLRTDSG >CT0871 hypothetical protein MFLFRECNHDNSSREFLQQHMTNIPPFFPVPGQEKRQTKPPK >CT2195 hypothetical protein MSDSHLLNTRQNLPRERFSLGIESGLRHFLKSVPPPRLVQRALWSDHAPG TMPVKKS >CT1014 hypothetical protein MTFMSVGTLKLKERQALCLRIDAEQIQPVLEGDFEDMELCGLLLGPMERC PKLPVERQGSRQGSDDACRRSPIPVLYLLYRDQMSSSSIVSEWAHWIPAL IMEALEASWFRRASTIMTC >CT0749 hypothetical protein MTVSVAQLILKYIEEDKFLDAIQCVQNEILKIEVKPELAGADRRQIKNLT AIMDKLSEAAMFGSEWDEGRRAKKAAIVKLQKVSAA >CT1407 hypothetical protein MVLFSRVLTLLPLDRYILFCCPGFQAGVRREKVSLKVETL >CT1071 conserved hypothetical protein MNAAHQVAFPDFGSLILFDLLVVITAIALSRDFNEDGINDLSGFGKDTLI VQGGVEPFEEKSFNHAFLDQALTKLPDGFGIENPVAGFESQKALETEPIG NLVFHLIIRKTVEALQDEELEHRCPVKRRSAHFAQIGRLLECDLKNWAEE IPVDMLFQFHQWIFELGQTLRKKILVEKAQGIDVLHGNEVD >CT0893 hypothetical protein MRSSMFMQSRHNTKQHNTQKRSIMKKTLFLLGLATAMGFNNAQAVDWNWN GDIRFRYDSSKTELASSPDKPADDRYRLRARFGVSPTINDELSAGLRLAT GDGKNPTSTNQTLGKDFADKAIWLDEAYINYHPKALEGKVNVLLGKRDIA KTFNVVKDLVWDSDVTIEGATLQYGKDVSGKQKSGPSLIAGYYTLENYAT ASDPCIFAVQGAYMGTVSGADFNLGASYFDYVHMKNVAWWNSPNGPANDG KDFRILEVFGTLGGKLGGSLPATLYAQYAHNTALNSDNNAVLAGLKLGSD KKPGGWTLDGGYFYIEKYAVTPLTDGDRPMSSKYSTDIKGLKIGATYQLV QNMTLGATYFHVNPVDSSLTGSTSDHKNLVQADVAVNF >CT0921 hypothetical protein MLPQIALVESRLPHRDDEGPSVGSPEIVHSPSGDLPIAASLQSTPACSNS RTGGGNSLEAPSMHQQLLKASRRRRFRGSAR >CT0210 hypothetical protein MFPMFFPVSYLQKGFALRIAYSGAVRPFLTVFRDKG >CT0190 hypothetical protein MTVTHILLDDQNPLHRELPIYRSGKINTVRLADKSYKIYDSMEISAHDYT ALFYYGVIEQLNALPFISESNNGLDSWDEAFLPSGAIGRMIEIIDECVGE IRGKSPEKVMLGWQDDPERIAYWREIDPAETLGFLRDFQKFAAKAAKEGY DLEFIL >CT0492 hypothetical protein MITQILNNVNDYMLFSAFLRTKSRRDGNWVG >CT0234 hypothetical protein MWTQDRETAFSGEPCRIVWPRAAEPKKGRVAKNGENPRTARLPQVLQTWF RQTEDHDTVNRGVPLKTKPEYKKNILLCTGINR >CT0027 hypothetical protein MLSSLSKLTPEQLEEIRSLEQQLGKTLLSFSEYDVVSDDLSEADLATIKA LEEKLGTMLVAVRGLSQNA >CT0513 hypothetical protein MTLSACTQRRKQTSQAATLHASLRPKLRKTLEHRALAIRL >CT1916 hypothetical protein MCHLRVGCLFYTDAQLAYMRYSIAPYIPYQTNKRNKVARYCPVFSCRVYY KSSNGLLILSKLCWLHSPHPHPRQPNDRNSAKRLTAAALHLRIRAARSLW KYLREAFAVGER >CT0575 hypothetical protein MSKTALLPTVTAVRMVWAMSGKFSNDAPSQTMATYVEDVLKGIEEQAPQI SSDERSYVTSAVATMKASLRSIDVISKGRDLNFKENEKLRSAYLESVKES LDFGNKAQDFLKSLPAMTIAGAGGVTVAQYFFKASTFELWGFGLILTGIG YFFNQYIVLWVRRKKQMLYVTQDYERGLYYEQYLDRVRLVLLALFLDLER IHKRVFRENYEADTTSVAVQSIIDDILSGVRSTFCPYAYKHMAEKKVTPE LWTLCESGIRKAVENCPLWEGGQQSENRIDSL >CT1438 hypothetical protein MKFQALRYGEFFQGKISLYHHKRPLIVRFLECINKGTQEWLVDFYPGNGL ASGMVLAIDHDNQHVLCEITGKKGRGRYIRVLNPKITIEELWQQSELALA DRLER >CT1496 hypothetical protein MQRISHRPMFTIRNSLFIFLCVMAGMSLPCQQPAAMAAKISYASEIVKDV REDKVYLLEKIRKQLTKPSEKILVEALLTEDGPKAAKLYRKQLEEHPDPQ LDPISSSRLAAYEFAVSTTPGLPVMQARASSESRPALMTIAQPLQPKQPV SKPDSSLKRTPPPAGPAHAASKGDTVSTRLAPPPAQASGGGFTLQFGSFD SITNAEQMVAQLQYTAPARVQQINGVYKVRLRRTFTTQQEAAAFARTLPI ESIVVPPQP >CT1160 type III restriction system endonuclease, putative MTTQAPFSLRGRNPDVLTCIANLSNDEVFTPPELANRMLDTLTEAWAANH NGANLWADKTVTFLDPFTKSGIFLREITRRLVEGLTEEIPDLQERVNHIL TRQVFGIGITRLTAMLARRSVYCSKHANGPHSVCKTFTTESGNIWFKRVE HTWKDGRCIYCGASQSTLDRGEERETHAYAFIHADDIRTRINEIFGGDMQ FDVIIGNPPYQLDDGGFGRSASPIYQNFVEQAKKLEPRYLVMIIPSRWMG GGKGLREFRATMLKDKRIRKLVDYENAQDAFPGVDLAGGVCYFLWDRDCP GLCEVTNISGGESVTTVRQLDEFSTFIRNSAAISIIRKVMATNEPRMSEQ VSNSKPFGLRTFVRPEKKGDLILRWEKGEGPYPREKVTAGHDMIDKWKVI TSYVGYDHAGNPGKDGRRRVFSKIDILPPGTICTETYLVVGSYGSKTEAE NLVAYMKTRFFRFLVAQFMYSHHLTKSAYELVPILDMNETWTDAKLAARY GLTDDEVQIIESKIRPFDNGNGAG >CT0143 hypothetical protein MYNFFRILQNLFSDFFHLLGGHHFSQGKTLASTSIAQFPTISKVK >CT1999 hypothetical protein MLCFDMSQDTAIPSHNRLGVLNVSLKGVAIVRVMMLAQRRISV >CT1284 hypothetical protein MIRKTVKTVGTAFVFSPFGMPLLVHGAAGLLVGAVGLNLLNGVINDVKSA GDILQKEMSKPSDRQQDEEPE >CT1157 hypothetical protein MNRIKVSGLFLIFLVITATLQLSGCATVPSTPTDPRMLGYDERVEVTVQA LAAPDAPSNDKSFFIVPGMQNLSENDLEFMEVSRYITNALSKKGYIRANS VKSAAILIRLSYGIGDPQTSSRTVEISPGYSYPVGWMWFTQPPQTQTVKE TTYQRNLILEAYDLKDPNRKSQLWKTIVKSEGSYSDLNRILAYMIAASSE YFGTNTGRQIDLTIYGHDPRLLDIWK >CT2020 photosystem P840 reaction center, large subunit MAEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFL FQRTETRSTKWYQIFDTEKLDDEQVVGGHLALLGVLGFIMGIYYISGIQV FPWGAPGFHDNWFYLTIKPRMVSLGIDTYSTKTADLEAAGARLLGWAAFH FLVGSVLIFGGWRHWTHNLTNPFTGRCGNFRDFRFLGKFGDVVFNGTSAK SYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTINSETFMSFVFA VIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVA FQQPSFAPYYKELDKLVFYLYGEPFNRVSFNFVEQGGKVISGAKEFADFP AYAILPKSGEAFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQL NGMYNQIKSIWITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICE LNIFGTNITMSFYWLKPLPIFQWMFADPSINDWVMAHVITAGSLFSLIAL VRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLW GIKGLSAVCWYIDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYF WTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLEGADIQTRTIR WLGKKFLNRDVNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQT NSPLPPPVSHAAVSGQQMLAQLVDTLMKMIA >CT1468 hypothetical protein MPEWQNLFTCRKREAVGNVEADKPLIKLFSP >CT0542 hypothetical protein MAALRKLLRLRLLLLPIRLLRLLLLPIRLLRPIRLLRPLLRPIRLLRLLL RPIRLLRLPTKLLRLLLPKRKNKFLNRDLSTKETETTLSLFLYHVLWNYV KYWPKALRTLNITGPDVKGGTG >CT0984 hypothetical protein MKKTAKLIALAAVLFAGFGSTSAKADEGFKIGADVVSSYVWRGAEIGDSP AIQPNLSYTFKNGLNVGLWGSYAIEKNTPRINNSDYRYKEVDLTVSMPVG PVTFAVTDYYVPVEGGETNTFDFGKDSANTVEVSGTYTYKNASLMAGVFV GGNDYDNAWYCEANYKFYDKNGYTAKATAGLGNEGYYGDGEGKKLALVNT GISISKDRYTASAIYNPDTEKSYLVFMASF >CT1519 conserved hypothetical protein MRTRFWIFSMIALLTLAGCSNYRVVSDYDRTIPFERYKTYRWSDKGSAGI SDDILANNPLIYKNIKSVVDRELATKGFVLKASGPVDFTVFPHARVRERV VIEPSGFFGYGCGYCPGWGWRSYPPYWYDPYPYPVFSHYEEGTLIIDIID SRSGEVAWAGIARGILKDYDSSVQMNRDLDEVLTKIMAQFPPMVK >CT0510 hypothetical protein MKRLSRIILIQSRITLIQISKTLMTSYLVFEKPLKYAISSAMVHSLHQIK IKTEHQFHFS >CT0528 hypothetical protein MTSDDRNKRNPETLGAIGKVTLEKLASLSSKIRDDVATEADRVNHEILSE IVALASNRVSVEKNESILKEGGSAWTERTESHYPHIRLHGGALEDDPLKM MIR >CT1280 hypothetical protein MEREPIQGNVLKTAATVAGAGALLSPAGLPILQGIAGIAVVGLGIFAAGT AAMKVGEMISSGFGQSKPQQEEEDSPFL >CT0794 hypothetical protein MYPSERVVTPRSFRGNPLRNPVLIHLQIHDYQVKRK >CT1400 hypothetical protein MIGHSEVIRCFGMSGKVYFFNDSGNIQKGVHDLPPKGLACFVKIKNMMMQ AKPAEVSMVDALVDPWLGAIGRDFEGYRNHCRRVFIFACTLAGAEGESRE KIAIAAAFHDLGIWTDNTFDYLEPSKRLASAYLASTSKAEWTDEIKAMIE QHHKVTPWSCKPGWLVEPFRKADWIDVTLGARNFELNRSYIREIQRRYPN AGFHATLARLSFERMKTHPKDPLPMMRW >CT0505 hypothetical protein MNLNEIKTSLCPNGRNIQVGDPDGVSGITHASHDDAPCRRFTSVAEREIA VTRACRLPNSFSRVSASELSDDDGGSF >CT1600 hypothetical protein MQFKTRHPKHSMHLDTFPGLFKLERKVLYLPVLFYLSAHIAA >CT0750 hypothetical protein MVKREKVNFLSAYGTYTTRSIYESVRCKYIRYKQKLLIVFYNLEITAAGN NFFLSRSRDSANNFLFLIS >CT2149 hypothetical protein MSAPLAFQGAWWSKMMVEGKFGLVYPYCSVFSFFCFFAIFLRKDLEERGW CVTLGAFT >CT1186 hypothetical protein MKENGNKGRKKIAHSKFIIHNEKKPVVQTGFFFHDMHPAVSR >CT1172 hypothetical protein MCLIMFIFFIVFVFICRLLLLFCVVYWFFYCFYFNVSVLILCFLFFLNLF VFLQALIFL >CT1711 hypothetical protein MRDHTPNFKLLELSDASKALVRETVTQLLEKLAGDGQLTPDARLEFWVEI PGVKHPRGTFRGGCLMPDSYLCLSDWFATGSSAIEPAAEYASSENPLDAA WADFLGELYYQIEIFTSVASANQGITVELWAGTRGRPECEWMYAVDKKIE LP >CT0122 conserved hypothetical protein MLEMKATGKSKGQWMFYRNFDDTVDYLSDHTRILSYNPFCHKVEPLDRDE AYRWHFRVTDPQNNPFDVIFNIQQETEILVDLPDEVASMDPEEMSDEMIR QFTVGRKITWRPLAQDKTFTMPEKYLFEGQVTADMLIVPVQQEQTRVDFD LWVNVAFLLYPAFRIVPEKVVRTMVSTGMSLIMQTATNHMFQKISKEFGK IRKL >CT2122 restriction endonuclease-related protein MLAVKTTCKDRWRQVLNEANRIGKKHLLTVQQGISLNQFREMRAHDVQLV VPADIIKLYHKDIRSEIMTLEGFLGEVKTLVEKPRKRS >CT0677 hypothetical protein MSAHEKDLTAFINMSTRPKTTRAKNDLLFTWALPGPAGQEYMPLIKRKAV FV >CT1563 conserved hypothetical protein MNNKTSIVAGWIIIASILIQFIPLDRVEHPSKPILGIPASVLAQLEAHCF DCHSSRTRWPQSAYIAPLSWYVTAKVRQARKAIDFSNFDALPDDGRRNIK RATSSLARSKGLSAHGEIPGFPKIKMTERERQALTEWAADNNRK >CT1866 hypothetical protein MAMKKDILERYDRLDDGRVVIDVYASKVEELYEDFDKQAPFHRKDLDEEL AAYLFDCVREIGRVDFIIRITLDAVPSAELQERIRTSLKKFFIYQRGLES ASMQQLLRKSLLFFLSGMALLFFSLWFGGSMIPEVRQLVYERVLVEGVTI ASWVSIWESLSILMFNWWPARLRIRLNSRIADAEVQFQSHPGIRR >CT1320.1 hypothetical protein MHERKAAIGDIMKKHLLLATLASGLLFFSPSGQALADVDLHVNVGGPGFV VDYNPEFFYVPDLGYSISYGGPYDIIMYGGYYYLYHNGYWYRSHHYRHGP WVIVDYRRLPYRIRRYRWDDIRRYREVYYRRIHPDRFREHRDRDWRDRWD DRRDRRDDRWDRHDDRHDDRRDGERRF >CT0164 hypothetical protein MSWGIENSKKRKDILDLMDIGVRKTMANPNYVNEVPKSTNGCWMDQIYGL MRCDICDLSSQCPVREEEEWQAWLKEHNIVIEKKKAE >CT0522 hypothetical protein MIERFLYFVTNTLHRHPATGKKIHNSYRKSPHFSSQQNGNAIFNTL >CT2231 hypothetical protein MLWASRKTECAGRPVSLASLNRLLRFSLKPVQNRSCSFSRISRRWRFRPG REMTIMRITVR >CT0407 hypothetical protein MNHTKEATKNRNGQPKAKLRPELAEAQPEARLKTYRGWRPDEQVSRRQMG EARFSHESLPETINP >CT1515 hypothetical protein MLLTPSEKITREYNRLWIVSIARILLALAIGFTLYEATIHHPILPPRMDY GDKVLHAAAFFALTMLTEISFPGLKSLLPKLLFLLGFGIFIEWIQSFLPW RSSDVSDFLADCAGIALCFVPVLLTRLTLRLSDH >CT0983 conserved hypothetical protein MIHIGWQESIFVTGFLLFNPRHYPIQTIMSIEIRRVNTSRERKQFIKFAW KVYRKDPELNRNWVPPVISDYMKTLDTERYPLYEHADLAMFTAWKDGVMV GTIAAIHNHRHNEVHQDKVGFWGFFECVNDQKVADALFEAAAMWLKSKGL DTMRGPVSPSMNDQCGMLTKGYDSPPVFLMLYNPPYYNDLCLNSGHKIGQ ELLAWYIDQKMIDIGRLSRIAQHVLKREGLTVRDMDMKKYDSEVEKIREI YNKAWEKNWGFVPMTDKEFEFMAKSLKPLADPHFIYFVEDKNGKAIGFSL TLPDINQALKHVNGNPFTPWGLVKYLWYKRNISMFRTITMGVLPEYRNKG IDSIMNARISEYGGKYGLFASEMSWVLKSNEAMSKLAKVIGGVPYKEYVI YEKAI >CT2110 hypothetical protein MVSNLLKSLKKEVLEYAVLPFLFYTPAIRPYTVISP >CT0599 hypothetical protein MFKMLMTITTIKELIPVLQTAIGPVILISGIGLLLLTMTNRLSRVIDRSR ELLDEADKLFGVDRARIDREIDVLWRRARYVRSAIMLAVASCLGAATLII LLFLTSLLQIDVPLLASIVFIVSMVSLIGSLIFFLFDVNLTLSALHIEFE GHRKKS >CT1996 hypothetical protein MVSGKPSASGFAFLPACPIPGKGDFLSSRLLFRPNKTKIILIY >CT0710 hypothetical protein MTIMYEKGGGMAGSALQTWEKVLEYASVPLHGTMSRKIRKGVKLQINEGT VYENAVLFISDLFLRVTEDSADTSVNTYYSIDSIASIRTYSTKE >CT1044 hypothetical protein MLTEEVILIAEKQCGSRIFAFCRNGDKSGYRWYITIMFGKVMLCMNNLAA GMKVWISIRSSLAVSN >CT0731 hypothetical protein MIQPGFMDAANEHNDHQLLAGLEKNVGRLVEQLSECRKENELLKSEVLSL QNILRSFKLPGTEGPEPKVSGTSGEGFSYADKLQIKQKLVMILQKIEREL RGEKAGF >CT0504 hypothetical protein MATAIYVFFDNDYITDRAGIQKLFEQRPLVEVIVQSIPYVWLFALFLFIV AAFYGFRHTRKGYRYPMFRVIGGSLLVSFLLCGLLNVFDIGKYVHRYLID NVEGYGSLVYTNDVLWAQQEKGLLGGKVVRYTPGDSTLVIRDYRHHFWTV DLSRARARPGTKIVTGKYLKITGLKTGQSTFKALTIRPWVKKSHHRHPKA PKPTPVKKSSASGKPASPLSPAQQLK >CT0395 hypothetical protein MVMKQYVMATALLCYAAPAYAAYPLTTDDTGTQGAGGWQIELHTEFSTSS RTDGGVRIKDREDDATTVISYGVAKRMDVIVTLPYQWYQHRQGQLVTDDE SGIGDMTVELKWRFLENEKSGLSLAVKPGISLPTGDADRGLGTGRVTGGA VLIATKEFGALTLHANAGYHRNAYALDADDAACNKDIWNASLAGEYAFSE KLRAVADIGLETATEKGSRTHPAFLIGGLTYSITKDFDFDFGIKGGLNDA EPDTAVLLGLAARFN >CT0681 hypothetical protein MCRLISTPRKMARRVQFRISQEVSIMAKDYLSIASEIKELEDLLAAIPED NVIERLSLESRLESARAALTVLPQQIAPKARLTFRGRPVFGSHGIAADFG SKAAGAFSDAFAAVCAGLSEGLRYMGPIPNRDENQLLITGTAIGSFGFEF ELPAPDPSLFPPETEKTQEAMVKIEELFRLSAEGTDDEIAEVIEEVHPRA IKKVYEFLELLVQQEAYCGLEFADRFFRFADYKQIKASCERLKSDNIQER EETYRGEFQGVLPTARTFEFQVMDQKGPIKGKIDLTIADPDVLNREWLHK PVTVKFNVMQVGQGRPRFTLMTLDDLRP >CT0589 hypothetical protein MVKDSNHFMNSTNMNFVIRLKDVVDAIDQPDEERRAFLNIRTGRIVTFSR DALDAVELGSAVRAREEALVREAGEALLSGDYRELPDQFDINDCSIMRRF CQTVENDELRRGLLRSIQGRGASMRIRSTVDAFGVVEAWSAFRNEALQAI AIDWLGNLGIAYSGE >CT2217 hypothetical protein MDNKIKPAKSCNATPAYNVRFSSLKTRIASERKKPKNQLVGR >CT0580 hypothetical protein MCRISIRRKNIVSMYQKIFKVFLTAIGLVKMRLGVLFSIFLQYSQN >CT0860 conserved hypothetical protein MNQGKPEKLYADSVQLAYAKTLEFVSHAIIILMAIGFILYVFRLLPLTVP VETVAANWHLNATKLQVKIHHHCGWSCFEDVHTFMHGDAVSYASVVFLSL ATMICLATSTMAFFREKNRIYLVITILQILVLLVAASGKLTSGH >CT0465 hypothetical protein MPAIFFLTEELNDMKYRSGKGTGMISDARLRRVCRLTW >CT1443 hypothetical protein MKARSQELVEKSVSAMVSAIEVYNKPDFKYREETFSILAINAWELLLKAK WLKDNGNKVRMLYVTEKKLRPNGKPYKHAKVKMTGAGNPLTHSLDYLAKR MAEKKTLADAAHKNIIALCEIRDSSVHFYNKSGVFAVRLQEVGSASVRNY AKAAQSWFDIDFSQYNFYLMPLAFVNPQQPGDAIILSKEEKNVATFISSL EAAGDPEADYAVSVNVELKFLKSKADDAIKVQVTNDPSAPKVQLTEEHLK DKYPLNYAALTKACSARYFDFKQNQKYHDLRKPLKSDKRYCHVKKLDPDN PKSAKQAWFSNAVFNVLDKHYTVKG >CT0253 hypothetical protein MPSQESWNVTLVVSDNGRTKTIVSARHAAEYRQGEKQEIRLDGGINVQLI NRDGSVTLITAGRGIVHDNQDIEAFDNVVIRSADNTVIRTEHISRSSSNR MIRSDKYVTISGPSRTIRGYGFESDDAMKRYRIFHASGEALSK >CT0912 hypothetical protein MTGLSQSQASPMQIQPGNAAFNPWTDAALDTIRDVNQALTLYAEMRVVPA HHDAFLAAIDTVSAKLRVLPGFLSLALKQMSGDSTMVKNYPETYKGVLAT AYLDGVAAGTQPYFYNLFVRFADGRAARAAGFEALFETHIHPLLHAMAPR GGDGPELLAYRAVLQSVVAGDRHAIYRGAEEIRSFLRRPVELPERETVTV ENHVMVPEDKHAAWEPQVAILLQVAQDTFEPQDEPSGVGLPGARDNRYYR KALSTEILRNAHADGGLRAYIMHGVWESVWDHENSHLDPRFLAAAGPVGA AAVVGPVEPFYLTRRLVVAD >CT1079 hypothetical protein MIGAVPFNNVPFSSLSVHSGHVFHLAIAGIIVRIGYIREVFFNFSSDTAA >CT0787 hypothetical protein MTRQRRASAPIQYRPHNRNKMDGFWSQRHEIRHAKKPNRPKESTSVRAIC NSGGCERATSTR >CT2079 hypothetical protein MLEKMANNNVEYRSTREPLDAIENNDEQDAIQKSIERRAISTEALWRALG RRITKKWKDEGKACRSGVVIQSDDKPLQASFSEQIASKYNLNANYC >CT2147 conserved hypothetical protein MTNAFKGIPEELLKPHASPCVSIFMPTSRTFPDNTQDPVRFKNLVSRAEA DGIAFSTKREMAPLIERLRLLQDDASFWNHTLDGLAVFISPDYFRIFRLQ QSVLEQAHVTDAFYIRPLIRIYQIVERFQVLALTRSEVKLYEGTRDHLDE IELAPEVPKTMTDALGTEITPPHMTIASYGGTGTAMRHGHSSRKDEEALD NERFFRAVDQGINEYHSSSSGLPLVLVALPEHQGLFRSISRNQRLVAEGI EIDPAALGLEAMRQKAWQVMEPYRERKIDQMIARFREAEGGKLGSDNPYA IAIAAVAGNVSHLLLDGQKFWPGQIDPVSGDILLDEASQASGRDVFEELG AAVLARGGEVLVLPSERMPSASGVAAIFRHD >CT1623 hypothetical protein MLMADRFGWTTPGKNHPVCPSMKSFAPIDHHPALAP >CT1764 hypothetical protein MYSRIQTLDKNRPGNEKIRFTAAGYAMQLSMHHLNNA >CT1134 CRISPR-associated protein, CT1134 family MEHWNKTFCLEVKGDYACFTRPEMKVERVSYDVITPSAARGIFEAIFWKP AIRWRIRKIEVLNPIKWISVRRNEVGQTASERSDGIFIETARQQRAGLFL RDVAYRLHAELEFVPPSERPDAKRPVPESLQDGRETSELRKDENPGKYYA IFERRARKGQCFNQPYLGCREFSCEFRLVDDLANEPPPISETRNLGFMLY DLDFQKNLKEPPPAFFPACLEKGVIKVPDWESEEVRK >CT0248 hypothetical protein MSVKPVDLNKLRSTHENLYETVVAISKKAREIQEEERAELEERLLPYKEM IRNPASESESEKVFPEQIAISVAFECREKPTQQALAQYLDHQYDYVLEKS PETKVAQNEDEDESDRD >CT1820 hypothetical protein MPALFFPPRQSKSISPFHPDKQTSLPVARCGGILYHYERVDFTKIAHMTD YHYPSQALPFVR >CT1877 hypothetical protein MEHLNQLPVPALHRFERVPEGTTLGSYRDTETPLLVIDNEERDVTSRSLD FSGQRIITKLPVVPSMFIRNVRVVMQDCLGYIMEPYKVKEKV >CT0078 hypothetical protein MTMFTRLILWLLPLLLVLPAASAAETAGGKLFIESTLDNATPWVGQEVKL TYTLFFSGTAPQIEDKSQPEHSGIWVRELAPENYINSAPVSKNGELFRKA VIKQLRLVPLQSGKLPVTGYRLRCLVPQQGEASLDSRNDTETIVTAPTAI IQARALPKPAPADFSGAVGHFTLSVSPENSTVHAGEPLSLSVGISGKGNL DTLPPLKVLLPEGIRQEVSVASPDTASAKGSTSSVNTKMLLIASKEGTFR FVPLKLTVFDPETGRYETIASNAIVIKVIAGRTAMMPPQSPLPGVMPPPA DPDPLGAVIRPIIMSMGLAVLVLIFGLHLRYIKRYKRTGVQQKTSEAAEP IRPTAAPAPATQTSTGGKSPQSLRNELYGAVKKTGIMNPAGLTTKELGKL LKEKGVKAQTISALTELLSGIDHALYSPGQISPEKLETMNRDASRIIADL TRS >CT0223 hypothetical protein MFALLRWLFNGNQATKDMAINQRKRGGLPLRFAAFHELTEEEDGHKTRSY NRE >CT0693 hypothetical protein MIFRYFIRRLNGYDWLVRKIKISHDALLMFDALTMSFRKMTVR >CT2046 hypothetical protein MKLIALIGDEETRPLIRKMFTAHQVTLFSSIAIRGCSCETGGEPVAWWPA GKDIPTVYSSLCFAILEDEKAEKIMKDIEANPLAADPAFPAKAFMMNVEK MI >CT0229 transposase, degenerate MRPKRHESLCETSEATWRHLNFFQHKACLTARVPQISSPECGLLKLQSVS LCSWPGQWRSRRSPR >CT0186 hypothetical protein MTHNDHPGEGTPKTVVCPMCGEPFTCGMSTSCWCATRVVPDSVRNYLAER YETCVCSTCLDRLIAEAKEELRGA >CT1944 hypothetical protein MPTAGAATGYWFYGEIYIYLNLIKKSAMLKQNQATRYALLYSLVR >CT0894 hypothetical protein MTPVSANAPEELREAVDLRGCLHYLCISAMISQ >CT0503 hypothetical protein MAVAIDTAPVMTEIRAKAQKTERFRMKCQRGIGTSFLFSISSMILSELIF GMTVQN >CT1061 conserved hypothetical protein MRHRITTLRCLFMAIVAIYGSLALFPAKASCQNDKKLEVPVRTQQSDNDH RGITVQTSDLDDGVTGVVGKVYIEASPKHVWAAITDYNNHKSFVPKLIDS GLISDNGREQVMFERGKTGIFLFRKTVYIKLSLQGEYPKRLDFHQIEGDF KVYEGDWLIERASDGKGSILTFRAKIKPDFFAPAMFVRKVQQNDLPMVLA AMKKRAESAEGSLRVARTSSLKQSTQPSADSAIAD >CT1905 hypothetical protein MGREWQGNKKAAVFDSSFWLVFRSVLSTWGGQPGALHEALLRSP >CT0944 hypothetical protein MKKYPFRLGTSSYIIPDDILPNVRYLADKVEDIELALFESDEFSNLPSPE VIAELVALAGEHGLTYSVHLPLDVYLGSPFRDERERSVGKCRRIIDLTEA LPKSAFVMHFEAGKGVDINAFSDEERQIFVESLGDSARMLLEGCGEPVSM FCAENLNYPFEIVWPVVEQFGFSVALDVGHLEYYGFPTADYLDRYLSRAK VLHMHGTTGGRDHNSLACMRPEALDLVVEALRKVEGEPKVFTLEIFSEAD FLSSVETLERFSS >CT0888 hypothetical protein MTVAIELLMLWQQRRLKVFGIGRVHIVRTGAGCPSF >CT0577 hypothetical protein MAPGTELGDPFNDRNLVIPRSFFVSSPKLFPQYGGVVYIALSNLTVTIRN GILCVLTSGKSWNKKDLSINNQLSILETPHENRPPVHFPW >CT0712 hypothetical protein MTRRINPNRRSVTINGFYVTSSAPNETVNWTVSTGGGGTTAGPVPEPATV MLLGIGGLLAGGRKLYESRKEEVAF >CT1582 hypothetical protein MEKENKKEAARKKSLGELGELFAIKALVDKKFDRIRNLNDKLMNETFADI ECEKEGKNYIISVKARNKYQKNGKVNTRYNLGSDVYTKAVMAEKKYDAIA HWIAIQFDKNSFSIYFGSLEELQGSKAIPVDKCEKGIIGEIWEHDKRHFF DFDYYTNQKK >CT0844 hypothetical protein MPSSVLFLFLFRNLTIVFIEEHLGSAFRESVTHIDIKGGA >CT0537 hypothetical protein MRHELDNPHAHATNNPCIRCLRHADRYARAVVSMLEVFASDKAEAKHNLF LKEGLCDIDKTLRSNDKRDTLRERPRNQTSSAMTLSPEKRARVMQGEILI DLNWLPDGVIGAKGSVFVEAEPPVVWRMLTDYDHLHETMPKVISSRLLET NNQTRIIAQSGKSGIFIFEKTVNFTLKVEEVFPEHLWFSQIGGDFQVYEG EWQLEAVEGKNGHATLLSYQAEIKPDFFAPQFVVSFVQSQDLPTILRAIR SYCEARAKG >CT0635 hypothetical protein MQRRVELKLEQRRFYVWLGIAAAAHVAVVAAVVVLQLLYVRMHPPMKIVN VSLVQMPGLPGPAGGPKSPETPPAAIEKQAETPELASAKKVAEPPPQPVK KIAKPVAAVKKIPEKPPVKAPVKAPEPASKTQSAADERKKIAEPVAAVKK IPEKPPLKAPVKAPEPAPKTQSAADERKNLQEALERLKSKSASQKAETGK SAAPSNLSSTLANLQKKVASGGGGPARSGSGSGAGGGRYGTGGGGAFDSY KARIADIIQNNWSFSSQMVRSTSGMEVYVSLLILPDGSVNEIRYDRKSSS EYLNNSVKNALAKSMPFPSLPREYGAKGIWVGFVFTPEGVGR >CT0379 hypothetical protein MLSINTQRTTHHNLTQEHSTQKLIISFNSSLTIIETMMLKLPCALFYFFS KNNFALSNPAAPHS >CT1694 hypothetical protein MHSAPDRKTSAIGNHNAMKTIYQQVASIYLNI >CT0365 hypothetical protein MEKERQRLVEEEKREESEQQRCRKQQVLFLISFIYHGVFVCFDDVYLAFS KPFGTIFGGSFSSIMMIFPTVSPYRSYGCYRRK >CT1594 hypothetical protein MSRSSGNTPHYRHRSLSCAISNCSTVTPTSSTLMSTSPTIAALLDESIQL ELNLAKLYTLFNDHFEEDEEFWWQLSMEERSHAALLQQEKKQPQPLQFFP ENLLAKDLDALKANNARIIAETERFAISPFSREEALNLALHIEMSAGEAH FQEFMESETGSLTADLLQQLASEDQNHAKRIREYMKEQGVKEKKQA >CT0688 hypothetical protein MPGERSRDFSALIVLKAYISGECKYAVILLHRPMRHERAYAGIPVMIETI TWRSDRG >CT1377 hypothetical protein MGDAARQAADRLDLLGLKKLMFQLGSLLFGLFPAADVPEKDKRAMLTGKN EGNG >CT1098 hypothetical protein MFLAEKNIQQSGARPDVSGFGNTDSRHNNNKVLMKAHLSMLLKNLDNNNA FKLLQALQR >CT0406 hypothetical protein MLLVLILWSLARFLCSPFDRWACKSEKDLVFKALNKFFLAMTGAGLKAGE >CT1445 hypothetical protein MGKRQIIYTASQIGGARELLDKEINLVTKERRVWHGYVTAIDQDKIELRD SRFWKHTFKVADIDKIYGEVVTDY >CT1026 hypothetical protein MERLGNNVSKRYQISLEVADGCRVHAVVIPVGEDNITNVTKDGGGVSYAV WSAFSNI >CT1290 hypothetical protein MLERYLPQPLTTLVLCDRQLHPGFGRVVDAVRGMEEDEKNLCNAN >CT0620 transposase, internal deletion MTRKKDKTPDIQGELIGQLGYPKHLPIVPNTANSRNEHGTSVRDMQAMLL WSSTRSKSPKRSSAA >CT0182 conserved hypothetical protein MNKYFYDGTPEGLVSAIGAILESGDDPEQTVLSIRQDTLFEEGLFLRTDS AVAEALFQRLRERAPDAVQTLWYFTMTEVDGLATSLLRYIALAFEHGDQV NGYLTHPDVKAVVATARKVGRELHRMKGLLRFEQLRDGTWLARMEPDHNV IQPLARHFSRRLRTQEWFIYDARRHSAAHWDGHALSFGTLERFSRPELSP EERVMQQLWQTFFKTIAIPERKNPRLQQSNMPAKYWKYLTEKQGE >CT1434 hypothetical protein MASKDLDKVEEELKAAPNREIPPINASYIEHQPALFSHSWPLLAHPDKGM IFFSGDSANTMNVFDQFMVSRGLYYGTSGLKAQPGSMRIFTTAEMAPGAK GRPKAFDKETKKGFSDHFPVEMVVDIV >CT1599 hypothetical protein MNDGRLQRFFWICAGTPVEIIEKYPTEHAKYFGIGATIFFTALFAALSGG YALYFVFAGAPFDWFASILFGIF >CT1081 hypothetical protein MTVFMMRYTDKTHQQLLNHAAPAIKHSMMSPGACRLRIISSVQ >CT1389 hypothetical protein MEQEIMRTWVWSTSSPVPIILLNVFLWAFYFVPSLLAWTRKHRSLPAIIA LNILLGWTGLGWIGAFVWSLSWPGHDNSQPPAAPTQTASEPDQEG >CT0101 hypothetical protein MGSAALRGYIPLLQYPFFVHITRFQQPLSLFVSKAVDGPNKLKGQLITMC FRGGYETGTYGIDLLFSTFRPALPSVTNDC >CT0171 hypothetical protein MKISTLYTLFAILATAANIEVQDISIRYYSGQYAIAISVGLGTLAGLLLK YMLDKRYIFRFKAENPIHDTRTFLLYSMMGAVTTLIFWGFEFGFNHLYHT KESRYLGAVIGLAIGYASKYQLDKRFVFKQEGAS >CT2053 hypothetical protein MACFHDNEIKLIYFKQLSKRGDVKKSIDNRFCFQDYCRQSGDVWDASAME LQAGNTGVFAFDRFQERWPWSVALAGVI >CT0485 hypothetical protein MKEYKVLTQKDRFFGGTFDPEKLEKAINSYATEGWVVVSVATASIPSLTG AREEMIVVMEREK >CT1319 hypothetical protein MPRLSSRGQSINKKPGFSPISFRPHPYPLVISTITGDTAIHAIHFRSRIR >CT1991 hypothetical protein MDCDYSFRLPVLMNTGRKNDDDTPHEHPFFRSSSV >CT1478 hypothetical protein MFLKKFDLILYGFIDSIPKVRTDSAVFCSSVDFTYYYSSKSIPVLRKKKA GESSQMFNQSQNHP >CT0715 hypothetical protein MTMSSGCFSPVSLEFCQGFVQELSGKRCRTGAGTVNWLIMYDFHRIINGS VKIFEVHLRLFSWYLTLKQT >CT0146 hypothetical protein MTKYLTELWDLIRLNPKKFVIRALLVLAAIGFIFGDFGLVTRISMELENR KLEKLLAEEQEKIVELRSTIKNAYQPDSVEKVARERFNFHKKGETVFIIR EK >CT1459 hypothetical protein MMKHNTIKIIAASAVFSISTGIFAPSFCLSNAATGATNIQGNQNIVAGGN VLIYHGLSAKQ >CT0761 hypothetical protein MRFQPKPGSLPKQICLVSDGDFETVNLRINACKS >CT0627 hypothetical protein MSEQMVVWWDSESEPERWRVEVQRKNSLRQYESV >CT0119 hypothetical protein MLILYSQVESHFVIQLLRLAMLVPGVSNSKQE >CT0642 hypothetical protein MHRKERALELFSNRCNCSQAVFAAFRQTKVLDEASALRLATMFGGGVAGS GGGMCGAVTGALMVLSMRYGMGGVEELVNRKKTYELGRQFIEEFEKRMGS ARCESILGLCIGEPENLQKARELKLFETVCVSAVATASDILEEMLCAEG >CT0353 hypothetical protein MNLIAYLNLLRPSKQYPLDDTVGGALEAITMLLP >CT2100 hypothetical protein MMEGIFCTVLDLKNELLNIKKVGSSQNLVGGMKDAHMTIPDEQV >CT0998 hypothetical protein MVIVKLICSGFPYQRLSTMAHNDFIVQKTTIKKANTATALRLYDGF >CT0515 hypothetical protein MPSSSFRRDALPSVMTVMDRRMVLFPIFCIIRTRTGIGRKSNRPSPPGAP DNRLFAQWLSPRPELAEKKQALNRCTFYK >CT1966 hypothetical protein MQKVYKHVLSGLLLLMLCILAAGSFGVGAAGNSDDGATTTNYKIGETAHV GYMSYAVWKAFYRNQLSDNPYINQPPDAAYLFVDITVRNDDKEARTIAPF KLIDENGAEYETSSNAWSVDGSIGILDSLNPGVEKRGYIVFDVPRGKHYK LEVSGGYWSSDKALVDLGLK >CT2252 hypothetical protein MIFIKMSLLNDLLEKTLSEIWEIMPWNLVDQMAENSELIVISMFEVRRCS KNEFCENEQR >CT2198 hypothetical protein MLGQFARFWTFTPASNGKALARAFGFYDNALRGGEVGPGNGFAVRCVKD >CT1439 hypothetical protein MTGIAVRIGETEIVTGTTGGIMTIEEIPATGMTRTAIRGNPNTDPGQA >CT0489 hypothetical protein MGNSELQPVGKSAGGQAWLLLALDELLAIVRECINPKVSRSGLDRCLRRH GRAL >CT1804 hypothetical protein MSSIRITQPNGEHTMKKMLSLAAMFAVLAYASPASAELKLSGDASVRLRD VSYFGDADQFSFTGSADDDVVYQYRVRLNAAADLGNGYFFKALVMNEDRN YAGGWQSVRHGNTETISLDISNFYFGRMLENSHWMVGRLPLNSFDNPIFD LTLYPAQPLANPVYNINFDRVFGGNYGVKLGNGMLNATLCVLDNDSHNNT SADGDGLFNDGYALHLDYKVNVGNVTLEPQFLSVLTNSDIWYQDITGRVT TLAYKVTPYTFGALVGVPAGNAKLSFGGFYTTCDDTTPNGGPHVKYDGYL LRVKGEIGNFMAWYDYNHTTVKPGGNDIKLNNHFVWAQYKIPVYSSAMGS VTLQPTLRYLASKRDDGFNNYSGERLRSELWATVTF >CT0549 hypothetical protein MLCLKSTKIGGELKINIKRVQSCYHLIFPVH >CT1687 hypothetical protein MNLQAHGFQLREVQFFYCRGFGWLLSKDTSGIPAVFVTGRCL >CT0654 hypothetical protein MALTELLNLFIRDGHVSILHVTHEILRRTENKNEFLFKTT >CT1281 hypothetical protein MTASKESSAQRRHVKSTLVDDEKPTIEVMAADLTMPETSK >CT0932 hypothetical protein MGMSKRVEDLMHLREITCKIVQGVFQGKRFSWGGDWRRVGGAGLCIALLF EKFFRNQLTFWFFAEFRE >CT0061 hypothetical protein MSARCKSKLAALLMAAGLGTAIPPASVSATLRPADVRVSAEPDSLFAGER LRYVITVQHDHRDSISVVSLKAGQGTPFEITGTKSFSKNLPDGRAEFRMD TELAVFGSGRKPLPGFTVVSKHASAAEPERLVITPSESVTVLSMTDSTVT ELRPIAPPVSAPFPTWLLVPVLLSLAALGLAGYFVKLLITALRRHLADPG RAARNRLRAIHRQLSKGLQPAAGYESLSNILREFLQKRYQFGALEMVTQE IADELAARRISIRQELIKLLDEADLVKFADRRPDIEECRRSLRIAEVLVA TAAEAETTEKEPLMEQSE >CT0723 hypothetical protein MDTVPKALLRELSPELAERFPVSRLSLRDIHLHEKTASANRRTPAVLVVR LTKNRKGNYVTFKVLLYLGISFRRPSGQGGRSDFRRRAG >CT0366 hypothetical protein MPSSPKKQKRKAEAAAKQPPAAAKPAPATMPLGAMNYLFIALGATVLALS YAVMYIEKSVDGFFALDIAPFTLVGAYAWILFAIFYRSKKKKN >CT0827 hypothetical protein MGELIMKKTIITSLVAIAAFGFAGTAHADSFATYSSLNTLSAGTDDPNGV SYGYSYDWGGVSSSCCNGSSAVGSYTEIDAFGVGDGSLSAVSGFQGKAEQ GANSSFATAAGIAASSNSGLTNYGYPVVDVSVDGGAYAQSSSYASYSWMT VWGGSTSW >CT0623 hypothetical protein MKNDRSIDSTILTLAGAAKHVARHLANVMNRIPEL >CT1122 hypothetical protein MGKRYEIGADFFREEILAAMLFGFRNVKNPSTVTVHPELMVKIRESFMDK VTSPKQLGDVEVFFGLTVIEDATKAKDYISVN >CT0354 hypothetical protein MLIGEIKSRRKCFRFNDHDAIVISSILIVIFGKTGDIDCSSFASILYF >CT0586 hypothetical protein MRLEPEVVLGAFTGLVEKHLEGILCTEKAIAVSKEKR >CT0572.1 conserved hypothetical protein MKNGFTVSKLLCSMSRFAVVPIRRVSVLFIFSIILLFADGGNAMSRMQPP DGVVAGVVNAFGSRDAVRLNRFVHPKQGVVVIYRQGVFNVFKAVSRIDFR KPVPEYFPYPKIRGGAPLRYAALPVYDCGREAWSKTGLFCDPKHRDVLLS TMAINLKRSGLKEISQETIDRFRALEAKSVRVVLVDVNGNDLVFYLTRIG ERWYLTILDRVSSDCSA >CT1460 hypothetical protein MESGLSALRQRSLAEQLLRVGIRLHHLIEIV >CT1093 hypothetical protein MSLIILFYPGRHEYTTVEKIGTYFVIYQYSSSFPATSYPSES >CT1314 conserved hypothetical protein MLSVFVVNLRSNSAPFSFSRLLMHTTIVNERSLRTCNFPITLQDIRTLKE LYRLKAETRDLRKPIVRNIMKQRVVGKGCLESLKNALYSLETIYIDDYTG QRLLRIDGMKQIEVDLTYEIRELQKDIYYLEYGEDRFIEYLAKFIPGFTD YVTEGVEMLRGKSFNAFITDRDGTTNNYCGRYRSSIQPIYNSVFLSRFAK NCCRYPMIVTSAPLKDFGILNVSINPEHIFVYAGSKGREFIDIDGQFHSF PIEPGKQELIRLLNERMQLLLLDPSFEKFNFIGSALQMKFGQTTIARQDI TRSVNEAESAAFLEKIKGIVRDIDPEGKNFRIEDTGLDIEIILTIDVDPK TGMIRDFDKGDGLEFICRKMNIDHTGEPVLVCGDTSSDIPMLKKAMEMYD DVWAIFVTRDEKLMQRVREICPKSYMVPYPDILLTILGLLSL >CT2103 hypothetical protein MKPMYYLVAAALSIMLSIYVFIFGTWANSQLVAIFIGLWAPTIICLGIFN ILMNIHDEMCCAHKRIEGRQTGHDRCGGG >CT1956 hypothetical protein MHIRKHSVCRNILRTFEIALFFAISVLGYGLLLKASSFNTKSKKRDRESI VVQNAVDLNGRHRELKVLSGTLLFPNDTKAALPNRYTFTGQSFLAISPLP ALALHLLCEKELTVNYRNRNSRISRPYNLFEQNPVLLN >CT1789 hypothetical protein MSQRVEKGFELFRKLFFYKAFQHSYPNKGDQ >CT0448 hypothetical protein MFSSNGYGSAPNIEYLSLNATDDGLERYRGLKQ >CT1984 hypothetical protein MASFCRAVSRDGEEGAAVIMERYFFPTSTSTSSAHQRRFCMYSMLPDFC >CT1803 hypothetical protein MLDKQRMAILNTQGADSQVDRLSRRMIHKQFTLQ >CT1613 hypothetical protein MPSRKTPNLSRPMSGNESMPATPQMQAPKKKGPKVIATLFVIFVIIAAAG WFWINWEKPRIAPEPLSPELTTIIQNMPGISDAMIYVGLKDIRESKFWNE VVPDSIKNSPLLSLGKRTDSLMKAGNINLTNDLDTLLVGFQRSGRKQQNY IGIACGPVARKAQAPFLKSASLQTAEVAGRQAYEIDSTLWVSPMGTNRLA IASSSNMLEKFFKPSGHLFERDSTTASLIRKTPYKSHVWFALASPQWTAG ALQSITSQNRDVKSVGNLNRLQQISMSVKFDDNGLKGQSEWVYKDRQAAF FASTFLWGAIKLSSISGTRTSESTKELLKHLKVSQNLESVIVTADLPETI FKKSGKKE >CT0208 hypothetical protein MDRVKKFSLFVMTAASAVVIVLCIAAALVLNSGMVDLFAKKQLLSMFNNE YRGRLELKEVKLRFPDEVTLVGPGIFEEGAAKPAARADRLTLKFNFLSLL RPKITLLSFREVDVDGGHASIAEYPDGQLNIGKIFTRRHPELPEMLAIEK FRARRLKLRNSTVSWKPANAPAYRLQNLQLDMSRAFVAKYEFMGTIKQMQ FTMPDRGLTLKKGSGSLAFSSVRSDVLGLDLETAKSHAKLSVSIDGLDIF SGISKKSLLNKKTFIHIESLGIDTSELNRFIPIPALPSGVYRIKGDAKGT FSDLEMLPVSIEHDGSSVALQGKILNLLDPESLSFNLQIDKSKISSALLT KVLTDERYRSLAKEAGDVNFSGMLRGRLDQWMTGIDFKTGLGSGSTKFDT KRLGGGKYQLDGDFNIEKTEPHRLLGIRGVKSGFSGSGSFNGTGSASGIE NAHLETSVKSAFWQQQTISSGSVTLDLKGKKADLSSDLKSPDGGSLIMAG LIDFSSLAPSYSVGGSVKKLDLSKATGLQDYRSDLNGRFDLKGRGFDPAS LNIKASFVLEPSSFSDFHFRERSAISASIAQSAGSSAVSLESEAVDLAVQ GSASMSQMIEALQMAAACIARETGSTAAIRLPRGPSPWTFNYKLAVRDLT PLKPLLPAKEFRFKGSASGKATLSGGRLSMDTALSSTTLSNGPSFQLNNT AMTGSMQCTAAGVAAARLSGTAGTVNTFGRELKNLRLVSSFDNGRLAASL DLAIPRFSEKLSAAFTARRSGNAAAVSIDRLAFTTPSGVWQTAPGGTLDV AKEFIRFNRVRFAKGTQSLQLNGLLSNSVSGTFRGTLSGINLTEAKYFLP DSAQKPMSGTINADFTVSGAPGSKTSDLDLRGSGVTWDQLNVGAVHLTAR HAGEQLRFEYNSRGAATPAGTTATVPVNTITGSGSIPLVLRYSPFEARIP DNRPVSISMRSDDLSASIITYIVPIFDHAEGVIPTDLRVTGRMPKPEIFL TTRLRGTDVRIAPTQVTYRVNGQIIGTPSRIDFGRLEVKDAENGTGAVSG MIGLDGLKPVTVNLSGSFSNLLLYNKKDMKDDTSFGTIRGTTGNLRFYGE LSAPVAEGDLVLNSVNFSLYRKGSNESAKYIGVEKFIKFVPRRPAPKPVE AAAPPEKLEFHYNLLDILQIKNLRLSNSAPVKGTMIFDRIRGERVEAQMN NLSLVVNKTGQRFSLFGSIDITGGKYTFSNTSFELENSGRVAWNNEEIRD GRLIDIYGVKQVTATDVQTGERDNVRLLIAVSGTIEKPDVRMGYYLNDDT QPYSSANMIGRQSSHVDPNADLNVISMLFSRQWYLNPERQGSRGVSPVSS VGISAGTGLLSSQVSSIVQNLAGLESFNVNLGAGANGNLSGLEVSYAMLV PGTGGKVRFVGTNTTPVAGSRTTTNYYNGSSQKIEYRVTPKVYVEAFRSY GMTTSDATYTNLQKPTENWGVSVSYREKFHTWSQFWDHLFGGKKKGREKD KDKKE >CT0405 hypothetical protein MINMNDSNFPARITPALSLQRQHFYAKRNHNCH >CT1294 lipoprotein, putative MLRANRLTPIMLILALILSACSGITDSGLSEKQVEKLVRASRPNNPSPDI WARAIKESLEELGQPVDKEHVSAVCAVISQVSAFSISPKNSRMASILRKK IEAAESNEVLRLLIETRLDQTASNGRTFRENIDSIQSELDFEKWYDEFTS ASVTKPILLVLKKDASDLITTAGSMQVSVKFAEEYPKKPRNAGGGSVRDM LYTCKGGVFYGTAYLLDYKHNYDDWKYVFADFNAGHYTSRNAGFQKMLGR LTHRMVDTDGDLLSYENGNATPSVTYVTFINFLKDKGIGFDEKKVMKDFQ QEKSYDFEETWSYKTLSELYKKKYGHPIYAVLPDIPLNSPKFVSKNLSTK WFAERVKSRYNHCMRTSI >CT0788 hypothetical protein MRLTRKCRTGRRDCNDCNLLLINYIKLKFFLNYNGSIFKLMETIYFSKIV LVSIQKTISNLPQ >CT2228 hypothetical protein MMVCVVFNGRFSIALQALSPESKKKMEDRMMKMRGCSDDAASLDPETAGN VRLGIIDRIRPADGSARIDFVGA >CT0165 hypothetical protein MASSLKNRKFRAENCCGWLFAAHRSSTHKGVAGLPFMCFPMPASYR >CT1688 hypothetical protein MAFSCGAQTDGLFTVSGVRILFLGLEKHTAAFSDDILTEAGTDGRIRGID GKALTLLALLTGLCFLFVHDLHIRNTVYRFNNG >CT0891 hypothetical protein MATSAGRMMRRLEWPLFRFLLLIGEPYIVRGQLLLPISDEPHNR >CT0426 hypothetical protein MLKMYVDYWVAVLSGFLQQYFGVKSQKGVTMIEYALIASLIAVAVIAVLL TVGSNLKTVFSYVGSNLTT >CT1118 hypothetical protein MVGFDQLSVSMMTLPCDGSCETLRQHYKSHTIAAMKQHRAFFGNATVRRI EKTFFSHPLLHYTEKEGTRS >CT1585 hypothetical protein MSKIKAKRVGFRLDMTPMVDVAFLLLTFFMLTTKFRPPEAVTIDLPSSHS NMKLPESDVLTVTIAKDNSIYMGVSSQRTRERLFDMVVRPKLENAGVSKA AVADSLSRFRLDDSFKIEKEELARYIMMSRFADQRLRPVIRADNKADYEA VNYVIKVFRKMNLLNFNLVTVLEKEVR >CT1322 hypothetical protein MHQDRTPGRGKVMARVNGSVWFEMKNPARRKVLRDFFCGKTITVW >CT0741 hypothetical protein MRGEKIQVVARLRDRSLPKFFLCFLTLILATKRIKKIYYRMIKVKKHVI >CT2096 hypothetical protein MTTRRSIAIAAWCLLPVALWVTSLPDNTVSAENKKTILEHADQIEGGEKA GPSGTAIPYRSAVGNVKFLHAETTLECDRATDWPDSERIDLEGHIIIKDK NVETRADRGVYHTDSETGELSGNVRGRVTGDSLTIKSGRAAFDQHKNELW LFDDAVAWQLGRQLSGDSIRVHFHEVGGKKKVDEIQVFGHAFLAVRDTLS ASPALHDQLSGKKLTANLDDNSRLQKVIAIGKARSLYHIYDDKNQPSGVN FTSGERIRMFFAEGKLDRILVTGGPLGKEYPNYMRNDPEINLPGFRLRDK EKPVFAP >CT1413 hypothetical protein MITFIDHVTAMKELSRKFSADEPDAPNSPGRLISRPVRDSRPAFQDLRFI VKESLNLGFRFFDEVDRMLQKFKGEEPEKQNSEEKADDPS >CT2203 hypothetical protein MPHERRRVTIEEPPMKNKQPVPHPLHGHAHKRKYEKRLFRSRHKSIGQPP GSLIHIGE >CT1858 hypothetical protein MRLFSFTRECHAPAWHELKVNMSCLEELSMKAIS >CT0520 hypothetical protein MNSRGSLRGCPASGGTGGNLIGQKNFHISVA >CT1775 hypothetical protein MNYSFKTLWNAMFLAVGPVWFVLVWMIWSSGQLKTAEDHTLFLGLVIPGF ILIYVSGFLIQKRHAKKIQGQHS >CT0080 hypothetical protein MYNKIHPDVVLDEADEALDEPNYNNFNTSPDPPSPYADLEKKYRKNKFNK TRTNTSPGLYGTHGLVPVNGTKPAGKNIQKKRGKKAVASKPFRASNGNKS A >CT1864 hypothetical protein MELMAKDIRRAIGWSSRVTERLVAGKAMNSISVNAIPLFFIHMFLIAQEM SPRHSDR >CT1404 hypothetical protein MIVFDAAKRSSFAGENNRPEPVIAHFSYIGRKSTAG >CT1952 hypothetical protein MRQIMPLITIKALRNPQGFFFCEQTISKTTYFGVCVNSP >CT0925 hypothetical protein MHLSKKILHQITSCLTIQSCSPGTPSAPGRLYSLLELSQRLTAFDGFAGG GQDGLDGSGGVGGDVDAAHLFIRQIVRLWFCALEPGGGSGCRDWMVETGC GNEAGVRPSSPSITSTFTTYFFPIYGQS >CT0691 hypothetical protein MRHISSTQNKSHNKALHRAAIPLRSIAAGELGR >CT0069 hypothetical protein MMKTGRHQALAQLMQRSRIDLRQGNNISISGKQ >CT1004 hypothetical protein MLPRSISGISADGKQFIDLIDGLTLPPLVRNKYLRYILDEHHSAVYAYGV LALRGDSDMGEIEEVLDVVAANAEHYIMGHGKVIRGENGNVSGPKP >CT0737 preprotein translocase, SecG subunit MLNSFVVIFALLAALLLIVSVLLQSPKAGSGLTGGISSLGTVQTLGVRRT GDFLSKTSAILAGLVMVLCFIAQFTLPARHQEGTGSSILQKSAPASLPVN NLPQSLPTGNIQPAAAPAEQPAAPAK >CT0445 hypothetical protein MSILYLAGHLFLHFQGKVMTFHHGIGVNIACGGTPPGVKVNLGFGSDDGN VFIFPGMGSDVIGNFKGLWKGIQAVGKAAGSVRAHPKEKCTLLEQGESHD KSGNEHPDAEPAQVRHAALQDSCQRIHAQTSPY >CT2068 hypothetical protein MFGLSIEPEWFFMPQHACYNLELGRVCEQVVFDELVA >CT0276 hypothetical protein MGVLWWQNYVNQRYQSVEITRFYSGTSPRGGNKLALTDQQKTAFRKLRQE HFRKTMPAVQKIIEFKKEMISEAVKPDPDLQKLSAIADSLGKRQAWLEKD LALHFHELAMLCTLTQRDSLKKLLSNIYTVRYQKMTLWKGRPHREDREDN HRGPIPPSAPEP >CT2013 hypothetical protein MKEFPGLKILAALLIPLLFCACAVDRPPTGGPPDRSPLSVTSTLPASASV NTSPQTIRIAFNHYVGRNDLSKSIFFAPRIDDYEVSIHGKEADIRLYSPL QQNRTYTLTLRTPLKSLDGNHQLDRSWVLAFSTGPVIDQGTIEGRVWTNR LAPMQNATVMAYNASRSNAVPERRPDYIAQSGPSGEYRFEYLAPGSYRIV AITDNNGNLQFDPETEVFAVAATPTVQTGMAGVGLRFAPEDYSARSLQSC RIINNREIEITFKNAIPARSFELSAIRIENTATGASLPVLGYFSLSRSSE DTTYRILTAPMEDRAFYRLRFSPGDAESQTSELTFSGNAHTERYPELSVS IVPANGADNVITETIRPESGSSIELQCNLPVVESSVKPAVTLSLSEKGQQ IPVPFTISRIDSRTFAIVASQGFQHSKDYLVQVKPGILKGLVGEPSKTAL VQSRFSTAGPDAYGEISGSGRANAPAVVVEARRTGSEASRRMVAKTDASG TFRFDFHDLPAGEYTIAGFIPSASGAISPMTRWNSGSVAPFAPSDPFAAL TITVRGGWTTEDVRLDIPSARRSGPDDAKSPEKP >CT1517 hypothetical protein MVSNERISTILAIKNGGSFQRALTAGFRPDYQSLLALCDGEEGRNLSFFY QIKTE >CT2090 hypothetical protein MVVKFRLWLKLCSRKESFFSMKTNPAGSVVRSGRNKMLTGGGKSGTFHIF MNEKQGFSVLLSFMMMAEACSRIVLPVSGGTV >CT0783 hypothetical protein MLTTSIPLLPFHFHHQMLYRSLSPNAEEYGVFSCLNAETYNWQAGLN >CT1636 hypothetical protein MNSEASENRPAHSATGTLWSWHGGNGALIWQLMFSSDAVMGIKRFPQERK AAFFCLESSTGRVLRDDFVLTAGDENETPVGDGWMIGLETVHGSRLVCHT YQPGSPEHLGIWAINLPEARVVWSRPDLTFTANLGDAFLAYRSIVFAGFP ERDYVLIDPLSGCELEHLGTAHERPNQLRDAAQSEEERQRILLPDTVFDE AGHVENINHGATSVTVFHRMEPVAEGVPGWVSTLSVSECERLVHEDVMAS GEPMPVFNSFLIKDDRLYYIREREFLVSFVVS >CT0875 hypothetical protein MHLVVRYSVDQGRKHLFDLRRSVDHIGQCFVIDPVLRDFRSYRNNSEGRF LITLYVLENRCETARRVTALRRGTNEVLPKKTSSKISIFFPENDQIIAGF TLKSLFNGSF >CT1086 hypothetical protein MMFPASPQFHRNVSSIALNGISNSSFRPGPACCSLFSGLIGI >CT1924 hypothetical protein MSRSQTACAFDANSTTIARVKSTGHGSFTMTMCRTIEGGLDDLGSPRGSK LAGKLLSALKEWRNEPVALSFSPAELMTLPAWFPSGSSAEYCDSLCRIEA GYFLHEADRWQWHDMVLEPTPDQPSGLDRRMLLFYPVKPAQFIENELLKH ARVGWRGVHVEAVARLSSVTGETLAVLELEERYAALSISTNGKISYFRYW PVKDGSEREYFAIRELTSAPIDGAPVKVTGSAASAKVIERIGRETACAIE PLELHPWVSVEKGASKGKSPTATIRAVSTAIMALNGG >CT0408 hypothetical protein MRQPSHHHHTMAGGPDGETEMVSATLDSALAEKRGGLMQKKA >CT2093 hypothetical protein MFDGHRRRGSSGGRCDYWQKHTEQEKKQMPCWFHGALVSDGGKFWGEFPA SLAEQCPAISDNFP >CT1169 conserved hypothetical protein MDVASALEGNWEFRVDHKNIKIFSSKIRGSQVLGFKGEAVFEASLRKLIS LFHDFGNYGKWVHQLSEMEVLHKSDELDYVVRQVLNTPWPIPKREMIVRT ALHASEEGALALTMTGIPDYVPLKPDFHRVREARGGWILMPVDGGKVHVT FVMHLDPGSDIPPALSNAALFEVPFYSLLKMRDLAQNPSYKPAWPSVVDN HVTIIEDVPDKH >CT1522 hypothetical protein MKTIMEQALLDQALAMSPNERVEFAQLILASIEHEDEKIRQKWITEVKDR MAALKSGKAKLIDFDSLYHED >CT0096 hypothetical protein MFRIHEVSRSWKYSKSEPENDPETIKIQFFRLRVW >CT1226 hypothetical protein MLPLVRYLKKQCIRAIRKHKLKLFYLKSSFPSRSVINKQF >CT1619 hypothetical protein MLDLFLSRDKPDSHRDRAAGWAFRFHLSAVVGTGAVSS >CT1974 CRISPR-associated protein, CT1974 family MIATMLTLSRKDVKALRITDSYSLHRVIYSLFEDVRSEAEKRSSVPSGFL FADKGGDAKGRKILILSDRPPLQPAHGELVSRPVPEEFLQHRFYKFEVTL NPTRKENKSGKRVPIKTREEVAAWFGGKSQTSWGFSVDPARLDVRMLPVM QFSKQGDRTVTHGAARVSGMLRVENRDLFIESFNKGIGRGRAFGFGLLQI EPLKDNSNH >CT2063 hypothetical protein MNIADVAKKGFKALFCFPIHLRLKGYRRMSTFLQSNTRAVDF >CT1269 hypothetical protein MPFFRFIKEMKQSFMLHPVAAHFSNGVIPVAVLYLVLFLPTGNPFFEHTV VHLLLVSLLAVPFSFYSGIRDWKTKYKGAKAPVFQTKIRLSILLLVAGIL AAAIRLAVPDVMHEGGPLSWLYVATLLVMLPTVVLLGHHGGKLAAGQRSE RFR >CT1121 hypothetical protein MSNRNLTTDPRWKSILRPATTCNHQNISDMAMTVEIRDNKLCIEIDLEKP TPSSSGKTLVVASTHGNAVTDVMIEGKPVTIGLNAYIKK >CT0212 hypothetical protein MNAGKKGVIYTCITGGYDELLNHTFISPEWDYVCFSDDMGINNEKNAQWE IRPLRFEKLDDVRNQRWHKLHPHLLFPESGLSLWVDGNVDILDGEIFHDI DRALNANLLIAPSLHPERNCIYDEFDACRQLGKDDPDVMGRQEYLIKKDG FPKAKGLFETNIIFRCHSHPMVITIMEEWWYWVEQYSRRDQLGFTYVLWK NNYTVEPLSPVSYRFSPGVRFRYGAFHITKEQLIKEKAALEIKVQRFEAL ICGRLVKVLYKIRKSTVKRWCRVKMQLLNSLCCRK >CT0507 hypothetical protein MKKHFIISMIVALGMAGFTGVTFAADAPAAKPAATAPAGEKKAEAPKAEA KKKAVKKKKAAKKKVAKKAEEKKAEEAK >CT1980 hypothetical protein MKRFDTNIAIHEKILLKKELFRQLRANSAYQRSNSFSFPGFFYNIVIF >CT0222 hypothetical protein MSPDVIHPKEFREGVPDRALNQRQFQMVLASRPEKMILTRTGHFEFLKET LAGAGFKSPMEAVSAQERRALVGKISGCYDPIVTSDFFRLPLDRKIRYAG SLASTFLKRLLNKRKDCGAVFRPSTGILALVFAIAEHGRTADYVICGIGV RKRNEYLSGKQVKGHDLPHHVFADVKVLRKLARRYNLFTTEPELEHLVPR YRSG >CT1844 hypothetical protein MLPMRLCSFRVQKKQLFVALSYISKKANGLKSPQQ >CT0901 hypothetical protein MRKSAFKHNQNEKIPPPEPPCPMVGFISSAPLHQRLI >CT2277 hypothetical protein MRYDGIPFRGASFAAAIFMGLSAACHFEAVSEVKPRRPRRKRDNEMCSLP LSFRYFHFQSGFRSQTGSTTLASKKRLRIQKTVQRRV >CT1900 hypothetical protein MKKILLSLALLSAAMLSTRPSFAFGHELLDEPLAIIADQQASLEQTAKNS TYELLVAKRSITLKNGVFKAGDNPDNFIEARLVRSVICDLNKDNKPDIAV IIEHHGMGSAGFFELSALLSGAKGFTQTRPVLLGENIEIKEFSVSSNMWR PEELDIVYLGHQESDSHANPTEQKRARYFLDDDGQLSNDFSHIQIVKKPA LYLYPVRTTKIEVRLSPKGKVIRTIPDYNNRWRVTVQKDGMIDGQYHYLF YEAALDKKIELPRRGWSVRYGDLAGWFDSHLHEMGLNRAEAEDLKEYWLK NLPDSPYYTIRLIEPDVVNKRLGLKIHPKPDSELRVLLNFTPTEKPEKIK APKLTSFRRKGFTAVEWGGILDDGRMAENVH >CT2218 hypothetical protein MQGHAISIQPRCVITPMLYAFEKLATGQCPAYSAFDSIYDTYFPWRY >CT0902 hypothetical protein MKLVHRFPVFKSILAAFALLVVQLRAAVAEPLSFSTGQTVSVPSYSHIFV GNRLKTFDLTTSLAIRNSDPETPITVTRVDYFDASGRFVRAMMKTPLVIR PVSTLVYVIDESDKTGGVGASFLVS >CT1073 hypothetical protein MRMQLKNRSKMATALLASATMLLPSAKNALADAAPEEGIFSLKYLNYHDT QTGDTNLTAGMSMDRMTVNALSFYGMVPIAGKWSIAGTFIEDSVTGASPA YHGWGFPSESKNDSTSGASGELRHAGDISVTRYFSRGTLSLGTSYSQESD YISRGLSLNGTLSTENKNTTFSLGAAYSSDTVYLDKPAVIESKQSDTPGR KRIVSVLLGVTQVMSQNDVMQVTATYTHGDGYYSDPYKDPDLRPGKRRMF TLMTRWNHHFDGPDGTARLSYRYYTDTFGIEAHTFTAEYIQPLPHGWEIT PTVRYYSQSSARFYVPTEDDPRAKTPTDGMEYYSEDQRLSAFGAFSYGVK VLKELGWSWSADVKYEHCEQRYDWGINGHGDPGIPAFSFRSLQVGLSRKF >CT0060 hypothetical protein MIKRRICHMFGAMIPLLLATLFMAACNGNTPKRVTDIDGNTYGTVNIGGH VWMAENLRVTRYRNGDPIAEVKEGASWTAQTAGARCSYDNSPENGKTYGF LYNWYAVSDPRGLAPEGWHVATDKEWQALADALGGEQEAGAGLKAPGKWG NSSGETQSSGFNALPSGARRDADGVFLMLGQFARFWTSTPASNGKALARA LGFYDNALRVGEVVPRNGFAVRCVKD >CT1306 hypothetical protein MGCFQSGQIPGFSIDLGHCFVNPQNRLFPMAMMDPASWMFVLVVSIVCSI ACAIVSFNKGYRGSPVFAWFGAGFVFSVFALIAIILAPHHHEV >CT0709 hypothetical protein MEQTPLRPLGEVMAMIEALGHEVTYAYDDLVFINHNDFLLQFDAAEPNAL ALFFNTECNAAEADHVAARMIPEGIEKGLIIRRKGTYTMTEAESDNLQIT FNP >CT1770 hypothetical protein MRKTTGTLFVLLTLVTLILQGCYSFSGGALPPHLHTVAVPLFDDTTQAGI AEFREGITRSLINKIESQSTLSIEPDPSRADAVLKGAIVSYSDEPSQLGS ATERAVTNRITIVLQADFDDQVKNSKLFSQTFVGFADYQTGNYTAQQTAI QSAYNMALDDLFNQMISNW >CT1403 hypothetical protein MGRGVACSGAVCRGVTGSHAMVIDNLLWSVAIGWWIRELIAIRTGGRRAR SPSS >CT0907 hypothetical protein MNAAEVIAALELPPGARVDRRIPKTLLVERGARTATDKRRINEGVEEVQW VATLKPSTIGVPAFRDEVREYLEINVLSATLRGGAKAARFAELIHRAVPY PVFLLMAEGTRLTLSLAHTRWSQGEAGATVLDGEPIAVAVTEAETEGLPS SFRQALSLARQPRADLYQLYQGWIDTLLALKAAEVTGRFAVPTSADQAAA RREALRECARLDAEIARLRKAAAKERQVPRQVELNLELKRAEAARAAALT RL >CT1370 hypothetical protein MRISVRFIMLLHGERWRFRKMSMDGSEREALLSTSSI >CT0953.1 hypothetical protein MAPGRIYLCEKGCLFSSRQLHEGERETLPVYGQSVMLVKLHQNGRITPAI RWAIQSSSLSVSQLAARHGIGKAHSPAMEKPRPG >CT1452 hypothetical protein MKAAGRYVTIFFLYFLILLFYARRSCGEAVV >CT0341 hypothetical protein MLHCLCVIFNRAIHPRVARGSCPETVKPLLAPYLLLFPCYIRHPKFPAST LAASAQLVQACFLWFMKSIRESG >CT1679 hypothetical protein MLLRVCVWEDELFMVWYKKGDAFENSRFSNICGRDLMIKKFFAALQRRDP FLLFVVVLLLVTGCVKRDKRIRSITIGEQVWMAENLATDCYRNGDPIRHA KSVEEWNDAISRQEGAWCDYDNDPASGRLYNWFAVADPRGLAPVGWHVPN DEEWRELEAATGGRGFETAFTGSRNCLGLFFGQGSTAFFWAATPSGEFDA WNREISKTGGKMQRVSVAKGLGLSVRCVKDN >CT0419 hypothetical protein MKFSVTCIMTAMLMSVAPADVVRASGRGDEFGAEFDGLESRSWYRETTTS NSDSDRDFFSRLRVKSAGQTLKRQVSQIELNAGSSSNLPTFRQRYNETST TRSNPDLQRERAVKTGLALSFAPSEKLTFTWKPAAWLELTPSYLYQHARN EETGLWLPPFHSIPCRDRSCSNLSAVSPFAPT >CT0882 hypothetical protein MLPEWLPQKPAQGRYDTIAIPSIFNSGCCIGKRGITAIEIEDGMIRLVYW TRDVQKYRYSGERLHTVEELGNSGIYRAVLNEDYLDYVFSRIRLLA >CT1356 hypothetical protein MTQQTNHHSGTANGSRCVVLDTASYCLLLKGDSPPFSAEERSLLIAESPE EACDFQCLLGDACQPFLSQEYRRIHPGLYEKLEAIAISGWNNDAPFGIDT AAHLMKKLPLDLNCLINSPIALSPNDPLPALIQKNLVHKHVEANVLVSEP FTAGRLRYFNIFSETAELKFDHQSPHVQGLLILEALRQAGIASAHYQGLP LDGKLALLNYNTSFYHFLEQESPIICRCYTDFTSSKTSDDAEACIYMQVF QWGRLCADAILKGFACTNAERYEQKEQRLKKIIERHKTNFDSKLKRMYES MVSTQCM >CT1254 hypothetical protein MAVTGFFFLFVVYSGRILVFLIFFPVVPSITVPYL >CT1589 hypothetical protein MSNTKKGLATAGYNDDLAKLEEIFLPNFFYKKMQYYFSSSGLISNDVLFV HYTSTESALDIIREKRVLMRNALHMPDRQEVQDGFNIMDGLLSNENNHWV EFRNRIEVVLPGVVDRVMKIYSDHSHGRNDGTYFLSVLEHDESEKELGRL SMWRAFCGQSQPVAMFLRLPALSAVSQVLRIFFNPVLYKGKGQQHLELAE VIKNVENHKSFLERLDPDLVTSAIVSMILINVLCVKHKVFKEEREWRCVY LPKCFTSETSARLIEPGVEEQVGASRNVYKMPLNAAIDPVLSDIDLSKIF DSLIIGPSKSPYATYEVFCDELKKIGVSDVESKVRVTEIPVR >CT0349 hypothetical protein MPPDSALLAIPDGAALTGGRLISVPQFVLYRLFRQDAIFTGNCLTIHRKL VS >CT0314 hypothetical protein MNSSTKKTPSQSVWTWIVITLLWGSVFFATSTWILGIVSSWFDGGAFSPD RAEALRVYAMYVPALLVVALSAMVIQSRLDPGWQKQREREKAVRAGKREQ LFVSFAASIATSSLFTLLTAAAHMLAAPVIGTAVSFSVKTVLVAAGLNIA FGIAASLFVGMIFLVFGVAKGGSKA >CT0962 hypothetical protein MMAENKKWCQKNAGNRPSENLKVKSGRLIYQVMLRQLAVFFF >CT1216 hypothetical protein MDQPFFLESTELFCRVFRYEFHQNVFLNLKSVLLERSWFF >CT1083 hypothetical protein MHRHHGKCKYLPKNLTNQPAQTNKRFLTDEHPNRILLKNGRDFNGSFSGI EKDMIQFCFLRT >CT1904 hypothetical protein MFMLFKGLGALIEREKLPNNQKHKSLESLWPNRCPESL >CT0819 hypothetical protein MKLRRLACMINIGSSQNFVGGMKDAHMTIPDEQV >CT1141 hypothetical protein MTAMYFDSLVILYMVVGFSFCDNLYKKILNKCANDFDFYVTLLV >CT1739 hypothetical protein MKRQPVGRPRINSDRGRFPDAGVSKFEYNINQEKHMQEQSNKNNEQKAPA CISTIGVSRCRCGAYHLRYRYVDVAIPRETLYLIMEECFRYEEECAKREG QRPEAMVFSLGVVTLAILPLDFAAFSKAVGDAVNEDLGIGRLFAGAEAGD NGLADGTQN >CT0272 hypothetical protein MRIMSWIFIFGFALSVFFLLMYFLSKFVNHMKMEQNMEIESFKDSLIDKD NPVGLTGEELEKMKQQQAEAQAHLREVISKIPVIQKDGKFQVDMDAVRQQ KAAAAKTNGSTGPATGKN >CT1604 hypothetical protein MSANGLSGRCGRTAFTQAIKPVPLYRPRSNELLAAFKKAKTDYLRRKILY RFFHSISSVLWITRLKYG >CT1768 hypothetical protein MSRMKQEPTNVNRKEERERQRKGIPGLIDSTVSSTIDDLKAIIDAKLELF KIELTEKVALVSAFVLLLVVLMIGVAYLITTIALLFGELFGHVWLGYLLV SMVFILTFAFFTKVKPNALKNFIHKILLSAND >CT2074 hypothetical protein MKSKLSEEKDKWDSSDVVDIGQEKSCYHKGFTGKPVTAR >CT1138 hypothetical protein MTVIGIQYWRKNIHALFGISSGRKPQVGKDDKDSGLLRSFVRRDHRRRSG GSN >CT1476 putative addiction module component, TIGR02574 family MKLSVFERIQLVEDIWNSIAAEASDTIELLSQTQKDELHRRVAEHRADPS TAVPREQVKSRLFSGKS >CT1653 hypothetical protein MSEEKSCCCQKPEMLKGTPEKCSPETIKQCHGDQPTHPCVPEEKNAKEDK E >CT1035 hypothetical protein MKQSWLWVLLPLGIVLAVLALFFALFFGNLINGELQFAPIALFTLSLSAG SYALVRGFRLGWNLSTIISAVIALFAFLASIAGLTLELQGMRIGGIIAGL LSVTCLAVLFVSNGMDEISVGKRKKTEIAAGPAEWADRIEAIGRRCTKPD VRTKVLRLGGETRFLTPGTGQADLMVNQTIGRAIDELAEAVKLGNDSAAL SMLPGIRSLFAQRENQLKP >CT2246 hypothetical protein MMDDMIQMVEKAIETSLHWQETGWPVTFGNRQVEVSNLKAAEALPRNAVY RDEAINYWRQVRLTGEDTAAAGKKALEALKNGDICAAYDALYLCQYLEIP FEADAKTWRPVYEAFMAKCA >CT1104 hypothetical protein MRSKDWERTNGWSGFVKVSKQKVIQGMSEIRIVRAVTRKPSRVIPHYRSA TFSFKPPS >CT0356 hypothetical protein MAVFADMIKAEKSYGRVKVESAKVRKQRRILKDTDTIHRRPSG >CT1801 hypothetical protein MNKSTSLVSAAMLGALCATAPLSTASAESAIKPTFDTLFEPLLADPMEPR IAVMPKLNKKQLQLDIGTSADLYQNSSKTFAVGIDFATWSLLNRTSNFKF PVDCIDYMFGINTTFRHQFKDKLLSFDEASVRVRLSHISAHFEDGHTDDH GNWLNPGDSPFGIPFTYSREFVNVTGALSAPGRRVYLGYQYLYHTLPDEI SPSSFQAGVEIGLPANAYVAADFKLLPKWDWNEGKTDGYRGTWNLQAGMR LTSIGLKNVRVAANYFSGMSRQGMYFYKPESYTTLGMIVDL >CT2280 hypothetical protein MPHSFHYTRRKRSDMEQQTSGQRILDPIERAKLGVKVFNLPYSQAEALID DYVSGKNYDQASVDYFKDQVATQIHIREKSAELLVTGGEIIKLITRSFMQ NLPKSIDRS >CT1876 conserved hypothetical protein MQFTNDSSMSSNIAKTEFSEQDFRQFQQNLRKETLMLMEWFSEDVFENRQ VMCGFELEGWLVDQNCNPAARNEELLARVNNPLVMAGLSKYNFELNVAPH PLNHCLPEFLRGELQTLWDSCSRHAREMGCQTLMVGILPTLQDRMLTLQN MSSMQRFHALNREILRTRSCHPLKINIEGPNDRLEVVHNDVMAEAAATSL QIHFQVPLSKTAAFYNVAHVLSAPMAALSANSPFLFGRELWDETRIPLLE QAAHTPSFVDPTGRPVSRVTFGRDYVRDSLKEVFLENLDGYPVLLPVTFN HDPGMMNHLRLHNGTIWRWNRPLIGFGENGRPHLRIEHRVPAAGPSIPDI IANILFFYGAMLHLQPEVPQASISFEEARTNFYAAARSGLDAQVRWTSGN SMPVETLILQHLIPGAILALAAAGFRSSDLRYYLVDILAQRVASHRNGAW WQKAFVKKHGPDFRMLTQAYLENQNLGTPVHEWSI >CT0671 hypothetical protein MLNYFLLVCLLIARFFPLSRQDSGKDVFRVEFSLCRKEYIDNHVNCQKEN SRLNPAR >CT2254 conserved hypothetical protein MTYLLDANVFIQAKNLHYGLDFCPAFWEWLIESNASGKVFSIDKVAEEIA TGADELTDWMHNHASDLFLNTDSGTVEKFGQVSTWATSQKYEPTAINTFL NAADFYLVAHALSGGYVLVTHEVSSNSQRKIKIPDACRGLQLQCMTPYEM LRREQARFILR >CT0519 hypothetical protein MLQRLLDCSNSGRIYINAFYSRRSIQESDSH >CT0581 hypothetical protein MKSQMKMDYQKIKKRLGYSFAFVFGFFFGISTCVNFTIDAKWTDFVSLAF TAAGVALGYITFFRWWRNKKKDDSYRVSKDYLNALNEVQEVIREIDFQYF YLCPAPGLLVEGDEVSFKRIKQVDQLSHQLYLCRVNLVNAKSELNFWDVN LSAAFEKEHEELLKCLANLKVVMTGLSSQLFHYYKNHSNEYMTEIDRHKK MFNGYLKSIRDILNKRRSLKFDGIFTFK >CT1227 hypothetical protein MPLGKMFIILVFATGLDLKSLFSDAAFALPVTMPERV >CT1767 hypothetical protein MEHQIPVNDTSQEQPKTGPAVHDEIPEPFRKISQKVSEAFSEFKESETWE KMLDARDKARDYITENPVNSFFYALGAGMFLGFLLKRK >CT1394 hypothetical protein MSARPIVTGDYLVEKAIRLRVILIEQSGHGR >CT0886 hypothetical protein MTGIESSTFRNDAHIQRYYGHARWYGRYRLLEQGTDLA >CT0439 hypothetical protein MRSLNPLHAQYPGDSFFENQIFVKKLAPLKQGGDSFFLSKSEPVETDTMK SDKHWLNRERLTVYPRIFLALFLILGLVWVLMSKNMLDIKGKPLGYDFMT FWAASHLALTGHAQDAYKIPLLFKAQQLAISASKVAYAWFYPPTFYLVVL PLALLPYVTAYWTFMLSTLWGYLLVFRRIVRGNIAMWCLAAFSGLWINFF CGQNGFLTASLAGLALLTVERRPVLAGVFIGLLAIKPHLAMLFPVALLAI GAWRTLVTAAVTAVTFMAIGMATFGIAVLKGFLASIGDARLFLENGILEW IKMPSVFVFMRLLGMPVAGAYIAHCAVAIAAVIVVWRVWRRCEDRNLRGA ALMTATFLVSPYVFDYDLAWLAFPIAWLSLDGLRNGWLRGEREVLVAAWL LPLLMAVIAEAVKVQIGPLVLCSLLWVTYRRATAASMAGALATDAYDDQL GTVP >CT1910 hypothetical protein MWLIKHWFSASTFVVKIATFAKPQNVMGNNKKQKL >CT1435 hypothetical protein MEQLLSSLISVWSDISPMQKVIVLSAVGVMSVVWWIRHLDKEAGAD >CT0791 hypothetical protein MKMLLSSINKKGVKMDKQVDHSNGKRRSGSDKGERMIAIGQKLGVAIPAG LLLFCGVNASAMTRDAVNISFSPDAVRENEVAQHLTALSANPQGALLADN DNPLHGNTHVNKFDPNIHNDYSDSGVHTDSHGNEHCNTHGDANRY >CT0582 conserved hypothetical protein MRGEIDFSEYPEKGPVFAPLKEPGFFRKAFIEGGTIAWPNGADIAPESLY EKLLQKEQNRDSVLH >CT1704 cytochrome c, putative MSMKIGKVLTVVLVVLILIQLIPGPSHENPPVTGTPKWDSPRTEELFKRS CANCHSNETIWPWYSTIAPLSWIINLDVSVGRSKFNVSEWGRPGKNDGDE AAGELRHGKMPPWFYMPAHPEAKLTAAEKDELVKGLAATFGDKSAEKEKK EEK >CT1872 hypothetical protein MPLTELLNLFIRDGHVSILNATHEILRRTIFLIKMRFRIVQSAERSFFPV FPLDRYFAYPMEWLS >CT0863 hypothetical protein MTVFRETGIPCFFISLISNGFVNRGHSLQIFG >CT0516 hypothetical protein MTIELTASNLLMPSILFGPISSYHQFSPITHGFS >CT1463 hypothetical protein MPTSIGQKGKMKKALLITGLVASLLAVLGLWLHRSYTILKTKPPAPLTTD VKLEQPSSLFNLPISIEHTVLADYLNGKIRGNFLNADLWLQKKHKERVSL ALTREENITISSNGHKLFCTFPVSAEARLTDSRFGKFLAKLLVWPIHAKA VVTFSTPIALDRNWHLKTRFKIENIRWEEEPVLKIGPFRKHIRADVDTLL SDNKRGLTALLDAEIDKEASLYPTVSDVWKDLQKPIVLTRKPVPVWLRFH CNDITGHISLNKRAIVCNARIMTNMRMLTDTTAISPPTPLPRFRQTPRDS ISTISDVNFYALVPFASINRNLNDVFMNRRFSRSGYDIVVRSVEAYGSSS GLSVAIMTDLDLKSHIVISGRPRYDIPTHTLSIDHFDYSIDTGNPIIRTR ELILHDAIRDSISTRLDVQIGSLVDRLPTIITRAVSKAKAGRTIDLTIDS LAIRKCDIRVGRNNIYLLVNATAKNALRIKRIKSGKVIRIRKQAETKDQN PTSQLPRPPDTDNKSRLLPVTARPLQLTNHF >CT1733 hypothetical protein MLGFKDIPLTKKVIHIINDIERPFMIRWSNSHDVQQIHDWLQEEEALEVH GNFLCNWNLTRQCHEEGRLLVLIDEIKGIPVAYQWGQLLSSGILQVRNGW RGNGLGRLVVEHCVELALQQDEMVLQVECKPSSSIPFWEAMGFTIVEGEF GKNAKGFRVLSKNLALPPGGRPILATISSFPEERNWQDNVPAIASYHLNA IVADDGKVYLAERASFPKCFRRMSRDPVIEIIVEGKLVYRDKAKYQGAQD HGVKWCRNGFYIDVVTI >CT1184 hypothetical protein MDLSSFLPFRDEMVKVYHCLTTNATHTSEKPVFSELKVRRYSCPLEDVSN FITNKIESWVGWELKNQKTAVGGMKTIRAEVSSFALLGMKIDVTFGLVEE TDINGRKITTVNGKAHTRIDSKGDLGESRRMLRMMLASLDFEFRPQIVHE DEYVHRSIDPKNSNAAFQQLFDESTLEHRPSTPKAKSIELKKPVKKQIEF KSSKNSGETVKAPISSQAIPVATNGAQTTTAPDSDVEEVKKPAKPKITVI SLKKNS >CT2003 hypothetical protein MDRHPEIVTRERVNKTKKSGPMMNSGRSFFMKSGQS >CT0539 hypothetical protein MTTEAMMKSDSLPVEITRNGDCHFAANRYVRSTCNIHHQPEFHPLTR >CT1587 hypothetical protein MSQTRKHQENRLEKHFFHKNRHKTSSIETRLARHCDFFIFNLS >CT1390 hypothetical protein MARLTRCGPFPLRSQPNLKKSFAQKIIAIESLAKSAFVWYPHTEKIHCKY CHDYQSTDPIQ >CT2009 hypothetical protein MSNNQSRMTHRNANRIIAAGIFLVAEAVYLSTMAPTFSFWDCGEVIATSY TLGIPHPPGAPLYLLVGHLFSLLPFFQDIGARLNFFSTLISSTTIMLTYL IIVRLIALYRDSKPDGWSLHEQIAAYGGGVVGALALAFSDSFWFNATETG LWAASSLLTATIFWMMLCWYDEDPAPGSERWLLGVMYLIGLSIGVHLLCL LALFALVLIYYFKKYTVDLKSFSLMTLFSLGLFFLIYKLIIKGIPVLLVT TSWWGMSLLVAALASGIWYSHKKRLVLLNLGLFSVVLLILGYTSYMLIFV RAHAGPPINENNPSTLQAFFSYVNREQYGEWPLWPRRWSPEPVYQYFYQK YSSEWDYFWRYQLNQMYLRYFGWQFIGRSADVEGAVVDWGKLWGIPFLVG LFGAGAHFRKNWKMALPVATLFLMTGVILVLYLNQPEPQPRERDYSYVGS FFAFALWIGIGVERLFTWFSGRLKSLDPKQLVWLAVAVVASGLLSINGRM LMANYRTHDRSGNYVPWDWAWNMLQSCEKDAILFTNGDNDTFPLWYLQEV ERIRTDVRVVNLSLANTGWYLLQLKHDSPRGAKPVNIEMRDDDLANISYV PVDSVNVAVPAGMEARKLYDDARRSGVALPGAPSDSLRWTLKPALTYQGQ GFLRPQDIAVYAIVVDNFGKRPIYFALTVDPAEMTGLDRNLRLDGLVYRL VPLKSDSALSFADPGTLYGNLFNVYRYRNTGNLAVHIDETSRNLLGNYPP LFARLAITLSASPEQAVMVPDASGAYKTVRRGELALEVLDRYTRLFPLSR YPVTPKLAGSVVAMYAAGGANEKAYPYIHYLETLAAQSGAEQEPDLYFTL AQTYRAVGRVHEADRIMKELETALPELRKRLDSLKQ >CT1718 hypothetical protein MSHHHQTKNLGLSIVINAFIFMVIHQLLQPIVINVQPKPKHTPGTRIVHC SIPGRPLCTSAFPSPFFRSGRTSARIEKTRSRSSMLVQIPWRPRNSFEIS SRDFVLNVMADISI >CT1311 hypothetical protein MFPPRRKHHIWPMSLYRTYNLSHRDERRALMNRTVVVAVAVVVCCGLSGC TGPTDQKVNLPHTDLNGAIVSPANTPQSSLVSDMPPWIDLYRERNLVNVS SVDHVVIVIFETQGSREEVYHHYFDKFNGEENFSSFRYNRDIISFVKDGY GIKITLLDSTKNLWSLEYHRQMI >CT2205 hypothetical protein MKKAPWPSLVAIALLALLLVVPFGPLLTLRFVPGSPDSVAPMALDKALEA LQAQSGRYPLWQPWTFSGMPTVEAFSYLSELYLPNLLFGFLHFDPMYIQL LHLVFAGMGGFVLARRLGLGSIPAFLSGSAFMLNPYMTAMLVYGHGSQLM TAAYMPWVFWAALRLSEKGRLADAGLLALMLGLQLQRAHVQIAWYTWMLA VPLLVVKILIDTKPPGVSKGKVGVLALAALALGGAIALQVYLPALGYLPF SARSGAGDAAEAYRYATLWSMHPLELITYLMPGAFGFGGITYWGFMPFTD FPHYAGLVVLGFAIAGVVAGRKKPMVLFLSAMTALALLLSFGNFFSPVYD LFYYFAPKFSSFRVPSMALVVVALCLALLAGYGLQAWLDRPLVESSPVFK WGGLVIGVAAVFFLAFEGELKQLLRAAFPAVQIDNYDLVPMVGNLRWELW SGSLFVLIVVAAAIAGLLWIAARGMIGARAVAIVLVALSCADLGWIDHRI VSPDDHSLRVSPLVERTALDRALEGDEITRFLASRPGVFRIYPAGRLFTE NKFSLAGIESVGGYHAAKLGVYQELLARTDNLANLDVLRMLNVGYVLSPA PIDNPALKAVAAGKLNLISGEVPVAVYELAGSMPRAWFAPWAVAVQSDDE AIAAVMAGRGADGGAFVTGVPWQGMERFSTGTVLSMQRSAESIAMKVRAE GDALLVLSEVFYPERWKLTVDGREQPTLKIDGIIRGIAVPPGEHEVRFVY DRSRFETGRTVSLVATLLSIGLIAAGIVTGRTSSKTIKSSDKP >CT2270 hypothetical protein MKKKFLRFFTTLLFVCSFIPGKLNAAPTATHDGVYLDEIAIGSGYAWGHL KFSEADYNAVPIFARFGFNMNSVFGMKESKSTLQLALEPFCNPVTEPDSG VETGLNVFIRYLQPVAPSVKLVGEIGSGPMYLSINSAEQGKAGFNFLNQF GLGAQVAVSPKSAITVGYRFRHLSNAGTSEPNRGINSNAVVVSYSLLY >CT1979 conserved hypothetical protein MLFHNQTIDFTMLATNENRLVEILLQCQPGQPRTRGTWEVDHQGTPFILP SIGGITLNLQVGDPAFGWEGDHIEPGVSCTADTHKPFEHPNVTVQMLSCV GNTATIVSGEAKGESGVVIGHHGGSEHIIVDFPREVKEKMAYGDTIMVRS KGQGLKLTDFPDVSLFNLDPALLAKMKINIAEDGVLEVPVTTLVPAYCMG SGIGSAHVAKGDYDIVTSDPGAVEEFGLDRIRFGDFVALLDQDNRYGRAY RKGAVTIGVVVHSDCREAGHGPGVTTIMTCATRGIRPVIDPKANIADLLG IGTRL >CT1006 hypothetical protein MAGVSVSEKYTTAFFGYETGYLRSGAVFQRGGASGGPAFCRGVTLLK >CT0850 hypothetical protein MTIVFRHEASCSRLPQGNQKRKECALKAEKIGKKKSTLNSMVPG >CT1178.1 conserved hypothetical protein MAEVRQNPVVIVPGVLFWDSLYEVMREALSTWIPAEKIAIVPVNLLDWLG FPPSPERSTNRVMAALDRTVRAMASRFPGEPVTIVAHSGGGTVAMIYLLE RPFQGDVYAVNGLVGRLVTLGTPFHTHEHFAKIKTDFIFKHLGPEFFQKY QVVSVVSNQYKGSLDGGMIEKMCYMFYRGVTDDGNLAGDGVVPARSCFLD GAKNVTILECEHLPAPHTKWYGTKDGVEQWIEWL >CT1041 hypothetical protein MAKKQTFGDKQKKGTVDFKMAKLVYSVKSEKTNAWKFVEKSVRIPNGENE LDVLKKAMAGQGK >CT0858 hypothetical protein MYMIYPPLFLSEAGKYVSEIIEKNAFLYKRSSISYYVPYKFSAKTAPANR ATLIPVRQQK >CT0826 hypothetical protein MLLTFPGKLENLNRIDRKIAAMALAFRPAWRHMIQPTLFRRKLKTLPV >CT1972 CRISPR-associated protein, CT1972 family MHNRFNLIDEPWIPAIGKGLVSLADIFSDPRIPALGGNPVQKIALTKLLL AIGQAACTPETTEALEQLDAETFRRACRAYLEKWRDRFWLFGDKPFLQMP AILDWMESQRAAGILSETENAKQIGPGFYPSLPSENDSILSQFQTLKAQT DAEKALFIVSVMNFAFGGTQINKNIYPSEEKVKGKGKPAKPGPSLGRNGY LHTFLFGSTIIDTLIMNLLSQEEIDNLPFWEKGIGTPPWENMPVSRECDA ALSLKKSYMGTLVSLSRFVLLHDDGIYYIDGLPYPSHQEGWLEPSMTIDN QQNPPKAILVNPEKRPWRELVSILAVFDSNKNNKFVCLFIKYGLSRWPKR YNKPGDKIGVWSGGLQVSFQTGEQYAKATNDFVESSVELDPDMWNNLWYD KFFGEISILEIMANKVKNGVINYYDSFEPKKEKKPKERASTIMGKKAVEL FWQLCERRFPELVDACGEPDKLPAIHEAINLLALQSYDAYCPKETARQID AWAECQKDLKKFIRELMEADRRVGGVPSEF >CT0661 hypothetical protein MKDSFSGVAHSGQNANDFMQATVKSRCRFHFQTEAQCSRSIPSEKTGRVF K >CT1001 hypothetical protein MLCVSLQIAILFLCCLVLHKELYKTKYLLDLKKYEAASNSRFILKLFREK GCFKVREVQPLQMIRDGLPESGCSCGGGKKL >CT0287 hypothetical protein MFFRVITHNHKNRQPKSLATDGKSSERKENINVRNNNAGNAEAESAGQKK SQDFRSLRTSRA >CT1973 CRISPR-associated protein, CT1973 family MDNEKEKKTGRQKQFVEFVIGLCQRDKGAAAALRRADNPATEYQSWEYLA GFNIDLEKPFERIPYAAIAAAIARAKAERNGSAGIGKAIAFCYEDRSKSD QAKARLRRLLACNSVEEACRILRPLFSLIDSKAAVTLDYAELLSQLLWFN DDSNRIKTDWATDFYRHAAKTENEEVKA >CT0118 hypothetical protein MGNYKFKAYYDEAYPPVPDKATLFWRKFIPWQLFRFFILNIKMIRIVVGG HS >CT0751 hypothetical protein MKAKNTMPFENAFSYTPNKWSIVFNGTFIIHTRSIGISGITKTQKSRASD IPNCPFPQNTDYVKGQSNLLPAARTISKNIYSCIGTCLFKPVKGLQYIKN RQPQGGVTINLAARKGTADRKKLTLRDFCLMCSNNQEFVHA >CT2083 conserved hypothetical protein MLVCDFKTTCLNWVNVDDVVMGGVSNSAMQLTQDGTAVFAGNLSLENSGG FASVRTVLERRNYADFAGFRIRVKGDGKRYSFRARNDERFDGVVYKFDFE TVPDEWMEIDLSFAGFIPSFRGRTLVDVPPLDSSNIVQIGLLVSNKQAGA FWLEIAWIEAYRADTVASSFR >CT0639 hypothetical protein MQKEKWQEPTKDVWFSSWIDIWFMNNKTGGWA >CT1136 hypothetical protein MMSNSGKMLDQFVFNPLTRFILPEVGRRIFFVTHALPI >CT0682 hypothetical protein MSIDWCPGIREACGYWRDAPMLQQTFEAMERNLEQNNDACIDCAKTVVEV VCRVVVESFHTQQAPLKLTEETPSLSNWLTAAIRALKLGDVRDDRFKKLV SSHHKLADALNDLRNKAGPASHGKDPYLARLAEHHRRSAVLAADAIVAFL PQAYLDAQLDPISSREPWERFAADNALIDAHVGLAVDAEDGDTPTLRFLL PSGDEIPINIEVSRLLYLLDRDAYVEALNAARGAPAPAAEIVEGQGESA >CT1902 hypothetical protein MKKAERKSFLLFDKPVSELDLTLNRWLSRPAPDALPRLE >CT1142 hypothetical protein MPIVQKLNLKNRKNKVAIVLQNGLQNAVTGLFHSGGSQKGAPRVDLFEDV DSAVQWIAA >CT0905 hypothetical protein MKKQVLFMPTIRALSNSYRFFFSGDASDKDIEALLSRLRAFKEKEFALKN QIAENPLDNLFHETGRAFSADKIFDEEERINLKPATGEAIVKELEKYNLS DTSEDLKGVAFEPFPGRTFRGEIGRFFTPRTIVRHTQAGPCGEQRRGHTR RSPVHGLAQRLRIDLASTEVSAP >CT1860 hypothetical protein MLISFPEPRFEALSIQAALVKMGGKYRFITLENFSDTATSSSKTVMGKLT ADNGHMNLGSKNYDDLNSFQNDPDKLLSL >CT0497 hypothetical protein MFQPLVSNQKSPYWHHHKRMKIVSSMLFSFLEIAATVIHKKI >CT1911 hypothetical protein MCLRSNEPPLLPNRCYQQYFYLSMSKYFLVAKPDFGSPVSLSMYEIRRCL CLISITVTCCPSLTRPIIFEFFLSEYLKLTLVLAVLVLLSLFSSDSIFDE NSLLIFSISLLVSMITSELLLSVGITKGL >CT0835 hypothetical protein MVASDRKAYFRSVSRELQAAGLPVDMNKAL >CT1889 hypothetical protein MNQRLTAALLVGISVAFIAAEILVAVFGSFDQGWMVLFLSLYAGFVGLLF GLSTLLEGRREEVESVSERRARARRDGLVGNLLDDYEIDEEFLGRGVRKP RSKKPSPSSSSGASKERIPDDEELKAAVTAYAGMVGGIVTLRETIESMDD SAFLSMARKAGMGGVTRERVLALVVEMVSAQGPTKSDESPALSLSIDKES FDDYIKRCMTEPEVCIDDDATDSEGFSVGLDASDLSSRPGTPPTEFSHDP KAVMERFKRSTEKR >CT0664 hypothetical protein MDINDLFDKIMKSINQFADEIAEQRLQEKMQDTGRGPKNGGKNAENFEAK EKQGVTSFPKRE >CT0508 hypothetical protein MLYGYGYEHPGFLDCGKLKERPLSMRGLSFYI >CT0500 hypothetical protein MIYPATHWQYSGYSAKTLISKIYFLLKFNFYT >CT1117 hypothetical protein MDTFTKADDFIAALTGFRQMVQSVELLQQFYDEAGAALLYDCHTASPAGV IRTAEFFTLTGD >CT0608 hypothetical protein MNVMVLVAVGKMDKELFSFHNPSVSILRCQLSIINYSSYLPVLEPQPTNS FIPS >CT0904 hypothetical protein MFFQFLLSMSLKKAGLARLLKVFNALTQQKCRAIAQEISAQV >CT0128 hypothetical protein MITSGCGAQGWVITVHSSGAISYIKKRQCVN >CT1282 hypothetical protein MTKKKHAKVWRLEKKRLFQRQYRKSLKSINKKSSSSPSG >CT0673 hypothetical protein MAVLCGRTSIPAWICWMLRVENGTRLDSGVWHRA >CT1976 CRISPR-associated protein, CT1976 family MSNPFILLWLEAPLQSWGADSRFGRRDTLDFPTKSGLLGLLCCALGAGGE QRELLDEMAELRQTVLAFQRERGERPPLLRDFQMVGSGYNEKDKWETLLI PKKRDGGGAVGGGTKMTYRYYLQEAAFAAALEVPAARAGEFAEALKAPVW DIYFGRKCCAPTDMVFRGEFDSEVAALEAASSIAKEKRLREAFRVRDYAP GDEGEAEVVALNDVPVRFGPKKKYRQRRVTIIHHNDEE >CT0347 hypothetical protein MKRGLYSTFDQPLFSKHQFFLVHFEMDARSVMVHSPIKQTPTAMKQRFLQ AFFIVFAISSFFGGVSLAADSSPAVSKKAATSPESGKTVSKDISKPGVWS KFEMLSFTDRVVFRDTMAGLTGVGYEPLVVRKQIVEGVNYEFFCNARAVY PGTDWHPAMVLIYKPLKGNAVIKKISKIDGR >CT1651 hypothetical protein MLEMCDWEENASGGGNFLRSERTELIDSEDWPFVCSVAIGHTVLY >CT0352 conserved hypothetical protein MHSTALKVALTLLLGSIMLLTVAAAGLRFDSGKVSEEAVNGAKIALADYL AAHPEAPPRALAVIDYSQPSYVKRMAIIDLKTGRQSFYRVAHGKNSGELY ARRFSDVPESNMSSLGLFRVGERYLGDHGLALRLDGLDSLRNGNAAKRDI VLHKAGYVSIPFILLNVVTGYGPMIGRSNGCFVVSENDIDEVVQKLAGGG FIYAWATPDDNSRK >CT1648 hypothetical protein MTISNAQKRNRMKALPILKGNKFIVYYLTL >CT0427 hypothetical protein MFPPFVLLKSLVNQLIEIVSLFPANLIAVSATQKKDYTRLLAGQHEPVLQ AKPRDNSWTLITKS >CT0779 hypothetical protein MMIENEALITGAGRVSALWLFSKPKLECFFHNFHNLLVFPL >CT1115 hypothetical protein MKKGVDMRYFYDSHCHMMNLSHPNLSAIIKRIYNDSIKPLLLKYSVYLKA ALLLLLFVIPVVVITLLLTGHFVVIKWILYAVSLIAVILFVFVVVKFGDK KKREIEISKIKSNLLDKVKEKLANVMNLLAAMETDIGDCLIQMEEELRKK IPLNNVLVISGNGEKKEYDKIVLTPLIMDFGLKDSGKTNLIYKVRWKPIV AQVEDLCIGIRDYYLYRDKYITGHAEPLFQIIPFMGINTQNYYSEKDNTT GKSISVSLVQLLDKNFSEFKYDTSPQMRRKKIDAVNWRQFNGDIESIGSY YFLGIKVYPPLGFDPWPEDDIERAKVCYLYQYCIDHNIPITAHCSPGGFL VDDDFKNFSSPYKWEQVLDYTDEKNNKPFERLRLNLAHFGGADSKVWRKK IADMILKKDTVSSKYKYENLYTDISYQGVDKKSYDALKDLLDRYDSAERA RLIERLIFGSDFMINLQDINSYSQYLDFFFKTNALTLEEKDMLCNKNAER FLFVG >CT0702 hypothetical protein MPVFLTVTLFNQAMQAALCLAQAEAAQATANADNAKARRMEKFKELKKMG LSNNQLDNFIP >CT0784 hypothetical protein MLVVNIKNARVVLSGSGNFRFTRRDSGVAKISFTKKTI >CT2157 hypothetical protein MESKRYSEAFKLKVVLEIESGKFRIGEAARHCIGKATALQWKNRDQVDDL PNTPHRFNKTMSDLEALVVIELSKSLLLPLDDLLAIVRECINLKVSRSGL EPLSAPAWRRQPERVDACRRGRAQAQKERQRPRAGSCACRCQIPAQDAR >CT0425 hypothetical protein MLKMYVDYYVAVLSGFLQQYFGVKSQKGVTMIEYALIASLIAVAVIAVLL TVGSNLQTVFSYVGSNLTT >CT2242 cytochrome DsrJ MQVAHRGSLPAIAEESPVVAAATVKPGGAPIDSSKCILPTEYMRAHHMQI LNKWRHDSVREGNRTFVNPQGEHFDKSLNTCLGCHGSNPMFCFMCHEYAN VKPTCWNCHLSPMEVSQ >CT0296 hypothetical protein MNAADSVTTKYLFLTSIIMQYLILLITGIAAGLLSGMFGVGGGVIIVPAL IFFLGMSQETASATSLIALLLPVGLLGVYEYYQAGKITTEHIWFGLIIAL GLFAGAFFGAKLAIELSNDLLRRMFAVFLVLVAIRLWY >CT0910 hypothetical protein MLVEEPSMEEALRHLLPKIIGNRAGWKVINMGSKGRLMKELPNRLRGYKQ RMDKGEKIKIIVLIDRDNDNCHDLKRQLEDMARKAGLQTKTAAGTGGAAF QVVNRIAIEELEAWFMGDTAALQCAFTSLRGVRFPNSFNNPDNDGTWERL HHFLKQNGIYRKSYPKIDAARTIAKHMDPGRNRSRSFQYFVQGVEACL >CT1097 hypothetical protein MVLSKQLYINKPAQVTAKFHTFTFGELVTV >CT1262 hypothetical protein MPRLHEKAFLSLLGLLLQPIKKEALSTASFFIWKTEGVNQP >CT1521 putative addiction module component, TIGR02574 family MTASAEKIMNDALRLTPVERAEMIERLFQSFDNHRKAEIDAAWAAEFESR LDAYKEGKIKASPVEEVMARINKR >CT0238 hypothetical protein MVNLNTSFRLYGLFFEKKCMIPSIITNPNR >CT0041 hypothetical protein MNVLKSAEKPFAGFVGRPGKILGKEDGSNMASYELGVVDVRERRPFGFDP EPFRKASPQANSPEIFDIDSLLRLRTFVYLLLGTVLFLVQINNTLAINDL AKRNERLREQLRISTSISTAEKLKSRELQSIRYISGYAKNLGLDSSFIPP VEIEP >CT1776 bacteriochlorophyll c3(1) hydratase MCFSGYPDFTIFPASYSSGGCFNRPNQLQAKDFMPRYTPEQLAKRNASVW TDIQIILAPIQFFIFLGGITLNTLYYFNLAGIDFYWISIAILFKTLFFAI LFITGMFFEKEIFNHWIYSKEFLWEDVGSTVAAFFHLLYFVMAWMEYPEH VLVVEAYIAYLTYVLNALQYLVRIILEKNNERKLKGQGAI >CT1244 hypothetical protein MLHRLMTADVMVSWKLPDVSNAGEPFGCGDTVRFYVKVSGVPAVFG >CT1409 hypothetical protein MTELFQYPMSYPGWWLNNYYYFTWTLAMLFLAGGWAIFYRYGKFSYGVDF GCFWKTALLVIMTTIALGVPSYYNTKFVAQHGNDGDSVLLTPDRIEYRYR NGEKKMFLLKDIVSIYQEPVTYNPPPKIFIVAKNAGLRDSITVTEGKYGL PDVDKLLAALSARTGLQIKRP >CT0174 hypothetical protein MKYPILVFLLILSFLMTSYKITYSEESLSPEYISSTGEEFVFATTYPGGE KYGYPLWHKWPMVLGGELSYQDYVGRKGKLEDRIIYEPSGISKFRKAVME NGEVLYLDVAGNIPPNGIYFQLATLNLSE >CT0667 hypothetical protein MLLWEKDSRNSGTRMGFFMVHLLSGRTADRAARGETGFSSKEEG >CT1374 hypothetical protein MLKANDEIDFCMIHYDTGLEIKKKQKSTSMEGACLVFKIISKQQTGKFLN SDHRAHENLA >CT1379 conserved hypothetical protein MEIEQTVLVQCPYCAQSFEVLVDLLAGHQEYIEDCEVCCRPVSLVIDVAE DGTATVQAQGEDV >CT1933 hypothetical protein MRVSPFPVSRKAWIGTESIPEILKPELYYFFRLF >CT2199 hypothetical protein MPSLDWIGKKAVVNHHKKVAFDRLRRLALRCSPGC >CT2282 hypothetical protein MSQNLTASVFVCQSVVHFFINKLRVLWLFDQAISLFWLAKASL >CT0526 hypothetical protein MLDSREYCRCSVMFLIPGYSFTASYQRFSCYPPSAPTHNHRKQI >CT0752 hypothetical protein MTALSIAFHLRSAIGLRNKVPHRQQIFNQEEFSPKRQMAHS >CT2066 hypothetical protein MDFHGAMLYGVWKNADLTGVPFVFPCKQRLMLKLNEKERESIRLFPILAA SFL >CT1065 hypothetical protein MNGEFPKGVHIMPEGRDRFGQLKPSFPVDFEPLTAGRVLPKL >CT0724 hypothetical protein MVMDGVNGLFSRVMRVFCGKTLPDNNNNHL >CT0544 hypothetical protein MSALRSMGAILADETHIMISREMIMNDYSMVITANGNNRNGFCLEKSKKL K >CT2201 hypothetical protein MVYQARPDAGKPDRTEIHKHIRTYFFLLPTK >CT0645 hypothetical protein MFVFSRFGLWRSLITSVIETSKDVPKRFFHFDSLCV >CT1178 hypothetical protein MDRVALKPFRRIAVILLAGAVIATSTSCSRDERKKEEEKHLESMMSILVQ VQKNLGRIRQKEAVVVRLSSDVEGRKPKSAEQIGREINTNIRFIDSTLSA SKNLVATLEKQNHESQYRIAALDRMTGQLKGELDKKELELGAMKREIAKL NRQIARLTNTVDVMDEAISDQEDQMVKAYYVVGTVDQLVSKGILVPPGPF SRFFGMRPVLANDFDLRPFRQVDITETKDIYFDKPVSRLHVITPHTKGSY ELVGGKTSSLLLIRNEAEFWKKSRCLVIVVE >CT1392 hypothetical protein MKAGVVSRSSRVNSAIERMFPDPIYVRLSNKRFREHITKPYASVAVNASG F >CT1200 hypothetical protein MMNVQKQSYVFVLTRASIVILNVSSCPVFRNIHGLNLPLACF >CT1875 hypothetical protein MLDHIGAHLAEAYKAYLHMKSPFLGVVTPFVDYWFRPPVNNPKTSATETS NQTLHLVCHVILNEVKNPVPGK >CT0579 hypothetical protein MPARRFFVGGFEFDIRFECKIDPDSFALCSD >CT1217 hypothetical protein MSVHSTLQLASDAIEDARKRLERARVDADDDYEIRQALRHLEEASDYLRK VSTELKQHG >CT2148 hypothetical protein MYNFFRILQNLFSDFFHLLGGHHFSQGKTLASTSIAQFPTISKVK >CT0955 hypothetical protein MDKSSIFFPVSHQEKTISPKQSQNKIMIITTYIDTIAETLFREYMLHKEI HNHLG >CT1238 hypothetical protein MQARSGQGITRNLHTPSPGSAERRAILDAMRLKIKELHGIDVIFVVKTMN VSGGWAWVHTLPRSVDGFFHYEDFSALLHNDGKQWLVDEIACTEPDNPHC IGSPGYFRKLSHRFPCAPLSIFPTASFLR >CT1274 hypothetical protein MAYIMATSLIFIYSTDSGAVSTLLDTGHKMLSPSTTVGMEVAKTS >CT2082 hypothetical protein MHIASQVAILADFPYEVFMSADPITIFRKTWGTYQKVISHNLMFHRREIT TAVAKLFESRNAKRYDRNV >CT0849 hypothetical protein MTQNHETAVTCSTVSQLFAAPGQSMLHGIGMAVVSGSP >CT1307 hypothetical protein MTRKKEKMDNAATQPGAVINQFSMKFEGRVTL >CT1581 hypothetical protein MRGCPLSTNTNAGAGYELSIMYFTRHYFIAFVMGWFSFS >CT0431 hypothetical protein MNRMHANTAAMRHAQAPKHAQFQKGAVMVEFAFILPIFLLLLFGMVTFSI ALYDKTVLCIASRQGARTGALYYASNYDSNGNLINANVQQRACDAANAVC QQDLINFGPNMNLQIQCQVLGGTVHGQRSVSVTTGIDYTGIYILSDVLHL SSTTIMRLEED >CT1723 hypothetical protein MLYFGKNRRHRHWASACRCIGIEMNPIQTRNHMNRPTARTLSAAMILLSA GLVIGGCSPTVKVEAPDKPITINMNIKIDHEIRIRVDRDIDNAIGKRTDI F >CT1863 conserved hypothetical protein MSLAINSIEQNSDIDPQALRLALQGYFNLKRQGLIRREGLITLIDFDKPS DCKRLFIIDINSGTVIQTALVAHGRGSGDIMATSFSNQPGSNKSSLGFYL TENTYIGNNGYSLVLKGLDQGINDKAEQRGIVIHGADYVSEEYIRQKGRL GRSLGCPALSMDQCREVIDLIKDGTCLFIYHQGEDYASRSVVLNPKLALG SGKSKNPA >CT1080 hypothetical protein MKKLFAVFLVFVAMALLPVLASAKSASGPLAFGLYRYEQHSKINAETYWK CDYPVFERSKAGDIINAAILKAVISQAPSPDSKPAAASIEAAASAFIKEC DEQMKDAQAHSWAWQSETSGEVLLDRPGMVTVSIFTYAFTGGAHGMSVTQ YLIFDTATGRPLGLNDLFKPGFEAMLDKLIERRFRQMRGLSATDPLNGEK GGLFENKITHNENFAVTGSGIRFLYNQYEIAPYAAGQITVDLSFDELKGI LKPLPALKPIKP >CT0396 hypothetical protein MWCPPWKHTGLKGNPVRVRNSTRCCNSALAARLATRFADNATVPFRDGKA GRIRESQKTCLIFFGFGNKATYYHETQLRSAFRNRLGSGIDPVDAQPTVP LAFQSIRSSCLSRSVPLAFRRPGAIRAVFHRPPSILSHSSYRLLFAFFAS IRSTLSTRSSRA >CT1570 hypothetical protein MNAGECSRQKKYPLLRTLSERYVKHDIDSTKKLAMHIT >CT1686 hypothetical protein MKKGSDGPTPHKKALPLRDAYQSKLPIAVFVLIASRR >CT1978 CRISPR-associated protein, CT1978 family MLVVVANDLPPAVRGRMKLWFIEPRANVFVSGVRDSLARKVVDYLHQHCP PKSGLMIFNSSNTCPGYEIFGLGDTRKEITEISGLPLVIEKSAASPPENQ NRLTPEAPKVQ >CT0584 hypothetical protein MRVQLLDEATVDLADGYRFYERQAEGLGEYFLDSLWSDI >CT0449 hypothetical protein MPERYRKSGGHGALDAETVWSLPWWERAGREGYTGEARMVSSLSETPRL >CT1926 conserved hypothetical protein MSNISQHGIWLLAHGKELFLSYADFPWFRDQTVKSILNVKEQSPGHFWWP DLDVDLTEEIIENPERFPLVADARVIYK >CT1429 hypothetical protein MNLLILLLAVIILLLLVIITMLATGWPGKQREEVERLGNSLRREILEQRS GNLQLMKSLRIVIEDAVRESVEKEMMAVAPRGRSRRNSRKKIQEAVDLGS ELFIAGDEDADNGSYESPLQAMQLSLFSEMTERVQAAAVPDASPDKTKER EPEGETIHMGYVDDIPDVE >CT0214 hypothetical protein MDTAINPHSLWAKTIPTIFIVFFVKYVKAYML >CT1009 hypothetical protein MLKEAAAVMGGALLLPPLGVSMVACGIPGLLVAGAGFFAFDAMMQERRAS AQQSSSDPANGGDESWQTMPPEETERYR >CT1547 hypothetical protein MRWSSASQKASTLPVDFSNEPLDIVLHRGHRKISPIMVPAAHSGSNKTLH YEKSGKVTGKNRRPNRS >CT1022 hypothetical protein MRRVSSVDEAVAETKSRLMLVFVLNGGLECLSSGKQQIFAVEYLVFFLKE PVGGLLLF >CT1685 conserved hypothetical protein MKKNMGQKDRAVRAILGVAMLLYSIVFQNLVGLVGLIPIVTAIIGYCPLY EVLGVTTNKYAD >CT1137 hypothetical protein MVTPFTSNKKQYATGVWLKKQAIEIARKIQAGECSRDGFR >CT2043 hypothetical protein MNARVPQQAVGPFSNQPACHSADRQGFPIVYETI >CT2076 hypothetical protein MYLLLIMCPFFILLACAVRGVSPGAGSFLASIAVESGY >CT0231 hypothetical protein MKNDCIIAGRKIPYFRQSRQWAKTAFVFILNETLFSQKADYL >CT1116 hypothetical protein MSQGDLLFITLAVKNFLAHFDRCSREYFGLVQRCKR >CT0574 hypothetical protein MERLTEAFESQVMLIAELVELLRKREEKRGE >CT1224 hypothetical protein MSNNFTGYMTLISYGWTNRMVKCNFQNVRWKFPETIPRRKIQ >CT0567 hypothetical protein MNKYLLYTIALVILAFVQRFLVSKLLILHASPDILAIFIAFISMSTGQRT GTNFGFGAGLIAGILSGDLGLSALLGTVQGFVAGFFHVPQKSHATSVKKK RMFYAASATALIAGNLLQSLLSDPLSLPLYVRVPETVILGTLMSMMLAVL VYHFALKKLLKD >CT0271 hypothetical protein MPSCVLVACSFIVAHVVSAAAIRFAAKSWKYSFLIRYLRYNPENS >CT1592.1 conserved hypothetical protein MIVSGGQTGVDRSALDAAIAAGHAHGGWCPRGRRAEDGVIPEKYRLVETP FSRYAVRTAWNVRDSDATLVLTSGLVAGGTKLTVECAQRYGRLCLIVDLC GETDAGTVAEWIWAHGIGVLNVAGPRESERPGIGENARRFVAEIIDRDRW RTPAGS >CT0338 hypothetical protein MQPIILIMRNMEQNVGANPCGRPRTMLADPGLPTRLRIHQSNQATTEMGY GKNLFISFLPNPDFDLDSEQGNPPIPER >CT1469 hypothetical protein MLVLVTMKGCVYIADMWSDYYQISHMPGGMEWVSL >CT1865 hypothetical protein MIVESFHKAGDKQTGKEGIDHASALWPCMRSPLAFL >CT0692 hypothetical protein MIVKDQNGAVVTICRYLLNTRQTNSKMNLRSIRPFVDPHLGDDFVSFKRY YDPFVFFVFQ >CT1657 conserved hypothetical protein MAGNPISDEPNRLTAEGNGYGDPQAQLIDDRKFQRLMKAYETTVETRKLE IELFWSRSLFFWGFIASAFVASATLRRYSSDISVVVACFGFVCSVAWSLG NRAGKFWQESWEMKVERIEPSVTRAMFAQPEAVQTNKNFWLRGRRFSVSK LAIALSDYTIILWVAVVV >CT0359 hypothetical protein MSLFITKSFRRLLGGRFSRGFRGTECKGGLVAVAPGSGNGLEIYTIFKGI PTPRDSGLKTVFL >CT1028 transposase, truncation MQSLSSHYHQLLGLPSNWEVENVNLSMSSRQVEIRLAFTGKQGECPICGQ SCLIYDHAAEQRWLHLDTMQFETILVARLPRCQCKEHGVKTVQAPWAARH SRFTLLFESFAVELLLHCANIKAASRLLRLNWHTVNQIMRRAVQRGLVRR KAETVEYLGIDEKSFKAGQHYVTTLTDLGERRVLEVVEHRTTEATKELLA SLNDRPAVAMETIAMMVSVSMIVSRLVNLLIFSAFWSPPRYRPWPGC >CT1466 TIR domain protein MSPKVFVSHASEDKDRFVLQFAERLRQKGIDAWLDKWEMLPGDSLVDKIF EEGIKEAKAVIVVLSKFSVEKPWVREELNAAFVKRINNGSKLIPIVIDDC EVPEALKSTLWEPIADLSAYDKSFDRIVASIYGANDRPPIGPQPEYVQSF VQAIGNLNNIDSLVLRFSCEEVLKTGNAFVNPERVFLKDDKPILPEDELK DSLEILDGGGYIKLMRTLGGGFFPYQITTYGFDVYANASIPDYQGKIAAV VSAIVNEKLMSNAKIQERLKENKIIVDHILNVLENKGHIKQSKMIGGLSE IFNVSPSLKRALSGG >CT1656 hypothetical protein MLVACAKSLATALFIFFSAIYEVPLFIAGRSTKRADH >CT0355 hypothetical protein MANYRQIASGADGRDYSDILIKLDRRTVSQVTSSTVIMLADNLIKNTPST VPVANAPRLSKIDDSKSSSISSKLR >CT0556 hypothetical protein MQSKNRHNGRASLQVQIKSGFKIKFVFLSLK >CT0914 hypothetical protein MAGDSGLSQSFAKAFQFVVLHSGSFLVPYMLL >CT0848 hypothetical protein MMTDHHVSELFRKPGFQKAIQKTPGQTQDVQKRRSPRPEAKKRAPRRIIF S >CT0808 hypothetical protein MLLESILASESFRASELPAVPDRMRKPQKATSGGLFDFEE >CT0997 hypothetical protein MFFIGYYIQIYGLLYMYLMSRITFLLDFIYLMFL >CT1522.1 hypothetical protein MKIRIHELAAHELDEAIEWHELQSRDLGKHFRRIVREQVKTLARNPIWYL RKSDDIYKAFIPKFPYKNIVHRRRK >CT1012 hypothetical protein MSLFKKAVGVAALYCLLAMGGEVRAADSGAPYTVKAAYESLSLPAGESMG MLGLGVERQFNENFSGGVGTWTAVRGERGGFITIGFQGTARVPLSETFGL EAGAFVGAGGGRGGATLSGGGLMLRGYTGLTADLGELGRIGAGVSYVDFP NGGAIDSTQPTVFYSIPFGSSSRQFDGLAYERNSLAVVSKLVRVRSGARD LSGKVQDDFTLLGVEWRSYFDNEMFVRFEAAGAAGGSSTGYMQVLVGAGL QIPLSDNFWIDGSLGLGGGGGGDVDTGGGFLVDAGADLRCALDDDLFAAA GVSYLRAPNGSLSAFCPSLEVGGTFGKESQKHDKLPVRVRMVSQRYFNGS DGWRTHDADKDVDNLGVQFDYFAKPWAYVTGQALAAYDGQAGAYMIGLVG GGLHQTIAGPLFVEAEGLIGAAGGGGLAMGSGLAWQVNGGVGVQVSKDVA LMATLGRLDAFNGPFKADVVGLSLAFGGR >CT1476.1 conserved hypothetical protein MRIVVRPEAEQELLEAHARYESKAQGLGYEFARAADAAVASALRTPFGYG TRIAEGFRRVLFGTQSPQCDPRQSFPT >CT0837 hypothetical protein MQERLPFSHRELKKNKAGKPESSRFRNPLVSAKFADADEP >CT1273 hypothetical protein MKPIRIDDYCFDRIHLWLQYDPFVPEFIEMVIEVFYPANRKANGLVMFNH GFLIGNDLLWYPKKIAGMLLDDNPLFGINPSAYYNYSEAIVEKNWAMAFV SASHAQVDWMPWTDIGGNPRVGQETFAAASYLIRYGLTEFFWLAESRGHN SKNFDAQLASKAKFLVSNNVIFAGHSVGGAHAQAAACGFDTLSQIGRQQC RPFNPVIYNRELLPTFSMPMTDWPEADRANPVGLLMLSPVDQHVPIFMPG MSDYRAALASRQMPMAMVVGQCDCACLDMSQPPAWSGTPGVESQFSQLTG DGSWVVASQVERGSHCGYLTNKSPLCSVAELPSQCKRCPGVEVYKPMGAE TAFTAEMLGKFINLYPNGGGFEGGFNDWIGSEFITWLNRQSPCCDLNLMP MPGGGYIDNVPPA >CT2069 hypothetical protein MQGYTPLNQREGGVMSQFRVGDSIIYHKPKSSVSPGPRARQVYALEHGEH YHYVVDKFWKVTAVNGDGTIEVITRTGKTHRLPVNDPNISKAQPLQQLFH RKRFPN >CT0686 hypothetical protein MMNPSKIFALAIACGIVLLTFNWMAQAQVMYEPLQNKAPLSIYKYPRVKL GADLEANLTLIKADNGRLNITGTVTNVGKSSCKTASVAELIMNLGYAPQY SYAKTGVSDILVSRSFNNLKAGDSIVVNAVYQIPDFGGWASANLPGNAKR LFTLRVIKQDASSYKPDEDSNIENNVADDVVFYRDLTH >CT1380 hypothetical protein MKIMADTALLQSIVKLTRPLFIILSLALCGCIQMHTTVHVRKDGSGTIEE KMLFSEMLSGIMKEKGEGLPALPKKDQLREMSAEFGPDVKVVNVKKVENS SGSGFIVTYAFDDIEKVRIGNVQKMSKKLTADSTAVKSDSTVVQKPETWF TFTMKRGANPELTINKEAMLNSSSRGEVAKKPVSTQEKEQMLDMISAFLK GMKLEIDVVVDGRVISSDASYRADNTITLYAMDFDQLMTHRDILTGKYDG LSDRDFARRSGKDSGLKFEFKDKVHVIFN >CT1488 hypothetical protein MMDLVLMGLFWQIGFYRTMLVEDTLFQMNKG >CT1745 hypothetical protein MNLSFSKASSLVLAGMLCSAPTFAAMPLETDDTGTQGAGKFQIEAGMEYA RDHETVNGDSVREKEWELATTFSYGLSDTIDLVAGVPWSWSKVRVNGQTV RDENGIGDLSLQLKWRFFESDDKRTSFALKPGISLPTGDDEKGFGNGRVG GDVTLIATHTVDRGALHLNLGYEYNNYSIAEVRESSRKSIWRASLAGEVE VAKRLKAVADIGVETNEERDSDTNPAYILGGLIYGVSDDVDLDFGVKGGL NDAETDTTWLAGITMRF >CT1201 hypothetical protein MAKKSTAKEAPEVTPEKKTAKKAAASAETKPKSAKSKTAKIAAPEEPKHA KTAKPRKKASAKPMAPETPAASPETVEEHIRVAAYYRWVERGMTDGGHEE DWIAAEKQIKG >CT0109 hypothetical protein MKTITRRLFAAVLVPGLLLLSACGKKDSSAPDSAGVEHAAPAIAGPFTGV LTMKTTIPKAGTSDMKLYIGPKGMRAESKTNIGAHGGEVSMTILSLKDSP DKIYMINGATGACMELDVSKVKKQPGGDPYKNAKIENLGRERVNGYDCNH VRISWPDKQNTVDLWVSKDILDYFAYAKMQGSDDQTDTQLAEKLRAAGLD GFPVKTLLSPEGVVTELVKAERTTPDDKLFEVPANCTKMEIPAIPASPQG MSKEDVKKMQDWARKMQQQMPKQ >CT1532 hypothetical protein MFELFFCRSGNLFSVNELSMRQLVEAMLNSGSVFYVGR >CT1467 hypothetical protein MKLVIQMVREDDEKYEEPCRKQRGILKAILEYFTP >CT1741 hypothetical protein MIGMIGGRRSQILVSRDFRAAVSQPFSKNHKLFIIMSLTDVREYLQRERS ASLKQISSHFKADSSLVESMLDQWILKGRVVVKQRDVFGAACCGKCGGKE HIHYEWVYEWVE >CT0300 hypothetical protein MDRQTGECPILRLAGGDEMTGKRYLDEIFTVSLR >CT0870 hypothetical protein MIGGGSHFFIFAPQWGHSGASVIFMVFLIESKFG >CT1752 hypothetical protein MFAVPLITSSQRESVMSLITVDLLKVALPDCKKPEEWVAALIPALEKYAI NSEARVASFLTQYRA >CT0832 hypothetical protein MMGNFTGTALRFSRIVLRDGNDKADCSRRVHNLRSLQACRL >CT1388 hypothetical protein MLTCAKRGLSGARSNAETVQSAGLKEERFERFVAKTGSNRADKPIPANKT RRHEAPGKS >CT1920 hypothetical protein MKVESRKRQYILEGKIKPGFCDGCGCVTERVFVGEWKPSDKPKDEDPLFG LSDKKSKEKNPAAEEEASAAENQYWIRCTSCNQVHLLKEWQIQIDKELSP DELKPEDCQLYTPHGIYAQGDALYHKSLDEVGVVREKHATGSGAHVIIVE FCKSGRKQLLENVQLNQGKPKSTESVTDIIKLKLRR >CT1975 CRISPR-associated protein, CT1975 family MNNNPFKGQRIEFHILQSFPVTCLNRDDVGAPKTAMVGGSTRARVSSQCW KRQVRLEMHELGVRLGIRSKKVADYVAKACVALGADDEAAKACGEKIAAA FSNDTLFFFSETEASAYAQYAAEKEFDAAKFNDKELAKLSKKTLDPAKDG LDIALFGRMVAQAAELNVEAAASFAHAISTHKVSNEVEFFTALDDLAEEP GSAHMGSLEFNSATYYRYVSLDLGQLSANLGGADIADAVEAFTKALFVAV PSARQTTQSGASPWEFAKIYIRKGQRLQVPFETPVKAERGGGFLQPSIKA LTDYLTKKEQQAGSLFGKEKEFTFGGEDETFSIDTLVSEIRNFIEAKS >CT2052 hypothetical protein MNKASTMTDTFNYTTIFAPLGFFIGGIFLVLLLNKFIGKSQNKKSS >CT1223 hypothetical protein MSVHSSLNGKRFFASRLLITFRKGVFALNLHTMAHEAA >CT1363 hypothetical protein MNQSQNVLKLPKKRHLLHDLQTAIRKKSRLF >CT1056 hypothetical protein MYDQLTNLIVLKTYYKNKRIDRTAYSVTALFKVTCPVYPVRSSVYRPVRY RQIVNLLLNTPFCYYQLSIVFRIFPHRTILWALCPATS >CT1292 hypothetical protein MNPPIYESEHGLTTLRSMRSVWQVIFLCLKRGVEF >CT1836 hypothetical protein MYFSFHDSIVYCWIQASSAFRKVCKKLAPANSALRPTWGEGRKQQAIKAG >CT1520 hypothetical protein MQGQVSRLPCPPFCDLHHDGSLDIFSFARTKIPEDKQEY >CT1954 hypothetical protein MKRVKDQESGGAGRLWRAWKAVKAVMISGVIRNGNNVIGLKNKGNRYLAG FFQVGFIGW >CT0115 hypothetical protein MGFAKSTGDDVNNASLRLKRLRKARPPHYLSDHNGYSQLFCKRRAVLSFF MIPLESKFF >CT1230 hypothetical protein MVGVNLFQEGTNYDRLHTVSYRLPFFACSTSPVLFSVIIFPLAF >CT1490 hypothetical protein MASPRTNAICKSNEMKADNIVQPEPFTLRIFSSSP >CT0828 hypothetical protein MPRCLWRKVFRHCLQDTPLLAAGFFIKKIKYRRAMKKITPLLLLASCMLS SPVLADEPTTTVGVDNNQDVSNCSSVSQAGATSVAGVGVGNWSQIFEASH PIPYLPGTPGVTANAPTLFSMQGLPAQVKGLSLLTQNLYNANYHDVAIGS SQGTKIIFNASYPAPKPEKKNRNVYVNLDGVARGEVVGSLTVQSRKDKAE EVDFATLLYDARQYIAANHKLDGYDVTLLTVPNTVSYSMGVDGKASGMTV APLVSGLINGPLGAMTALSTGFSRNGGITVPTARIGVTFLVLVDSGKSQV VDLREYYNMLEKGSTNGNGNGNNKKKYEAIQPKESAE >CT1746 hypothetical protein MLKGCDKVPENGMTRKDRERAYKKHDFSGTIKAVSGLMRYFQKGGTSKKQ NQTQQNKSTRSAKPCHNKLMTTSIMTSQCRCSGTIFRN >CT0809 hypothetical protein MNQATAEKINTLFSNFDPTRCFIPYISMYEIEALYFSDPPTLATTSGAPL KAIEHILAECGEPEKINDHTTTAPSKRLEKLSNRSRKTTTGIAIATAIGI PKMRDACPLFNNWVTELEKLAC >CT0431.1 conserved hypothetical protein MARLRSIKRLHSQRGVVTILFALVLMVLVGLIALAVDLTRLHLVKAELQN AADAAALAGAGSLIDTSLQTFNWSAATAKAQEFADVNSADGKTIGQHRQE QDVNVAIQPGYWNLITPSFTSNTGLVTHTGDGNIPAVQVTITLSHLKFFF APILGIPEGTVQATAIAAVSPPTGGTGLFPMAIGGCLFNLFWDSVHNTPK LDPATGQPYEIQVYSVYSGGAGASCDSGQWTSFQTDANNVPFIRDLIKNG NSIPLSIGDSIWIQPGTEATVYDSVPTNVDVAVPVVDNVATHSSQTVIAI AGFHITGVVKHGNKSVVTGHLIPQSMVPSLHPGNGTGIPYGAYTPPFLVK >CT1031 ATP synthase, putative MNEREPEKLSAKDKPLEKRVGDSELRKIRARKNATRSIWEGFAMFGIIGW AVAIPTLIGVAVGIWLDRHYPSPHSWTLTMIVVGVVIGCLNAWHWVSEEN RNIDKEE >CT1642 hypothetical protein MMMLTSCAPSALGKLFFEAGDSCLDRFGSYDRG >CT0559 hypothetical protein MGLKWIFTALFSVLFRPEVFWSDARERFREVNAMKDYAAPVIAIVQFVKL PFIGTPRMAMLLAIISFTIDVAVLYLLTGVMDSVAEAERSAPVQHEIMTA LSFSLTPIWLAEPFGFAGTWRWLFIAAALAYTVFISRTGLQAMLGSDESG VEAFSGKSAFLVGAMAMISSLLQNGLIRFFISI >CT0728 hypothetical protein MQKIQDFSFFPCLFHYDVTMIFYRTKARKHNTTCKNAISPIT >CT1027 hypothetical protein MMVFGVLHERILGIEQVISIDMTVKFYSQKMKIIVYCEHWAGVQ >CT1621 hypothetical protein MKNLVSRTAGVALLSVMTVISGCSKSDNPVSEAVSSLSGE >CT1720 hypothetical protein MQSVEGSMKLTRRFSLLSPLFSPDAKELSAHLKV >CT1251 hypothetical protein MKMDTKRGRKRSLFLCKSFLILHDFPHPNIILIFRHH >CT0713 hypothetical protein MTFLLIHAERKIFKEYDFLVFLSRRHNVIVILYSLDIKLVF >CT0482 hypothetical protein MGNSELQPVSKSAGGQAWLLLALDELLAIVRECINPKVSRSGLDRCLRRH GRAL >CT1057 hypothetical protein MRKSSQDWSIRALAMAQSVNAEQSVLKALFDKTKKRQYRMPEKRRLVITD ELICRRN >CT0680 hypothetical protein MSVNTVTVPAWNNAGVLPPIRPNASGNSGDRSPYVVDLATVFDYFSTSPE RKTILDGLLRFRADLHTAGITSGFQWLDGSFFEQIETLEKRPPKDMDVVT FFHLPQGWDQRSLVQHHGSLFDQKLVKKNYAMDAYFIVLGQPTNNWHVKN ITYWYSMWSHRRDGLWKGFVQVDLDPAQDGPARAV >CT1343 hypothetical protein MKIPVCLQTHNRLGEKYVGDGGRELVRKNCTGFSGIESESEEHFLWYIQE VRVSAKGTA >CT1140 hypothetical protein MSNDELKGRALKPALSASGITGSPFLSRYDAFKHHQFSTKMKNKQDTTEF QSDDYDQRQVKKS >CT1643 hypothetical protein MDDPAIMWNAGNIRGRHPQLRLTKGFKSGEKSRVEVAVAVARTIGETNVS GADSGKDATMPSIQGHLALSTHSSSRRNRQPSLFQAITNRRNGIRRLTKR MKLSTRGHACSNCRCHWATSCFLPVSFSPERISTITGKASVRAATAPKPS VRTEAGLPCATRQARRRR >CT1481 hypothetical protein MSLTPKCSLIIRHYFFTAKSGGAKLINHSSHKSHKKNALYNAEGPPFQAA LLITNFRETGSYTNFE >CT1421 bchF, 2-vinyl bacteriochlorophyllide hydratase MPRYTPEQLEKRNASKWTTVQAILAPIQFLIFLAGLTVTYLYSQGIWVTD FWWVTFFVALKTFMLVLIFVTGGFFELEVFGKFAFAHEFFWEDFGSAIAM IVHISYFILFFWIKPAEHILILTAYLAYLSYLVNAAQFVIRLLLEKHNEK KLKASGAV >CT2014 bchJ, bacteriochlorophyll synthase, 23 kDa subunit MSSSPSRIGPNSIIQTVGALETAYGKNETEKLLKKIGQGYLINNLPSEMV EESKFHALVTALQKELGETATAGILKESGERTAKYLLKVRIPGPFQTIVK LLPAGLAFKVLLFAISKNAWTFAGSGEFSYGSKPSPNVMVKVTFPSHPVV SNFYLGTFTALLRELVSPKTEIKADIRKEGSAIRCNYLCKI >CT1826 bchY, chlorophyllide reductase, BchY subunit MHPQSMCPAFGGLRVLMRIDGAQVCMAADQGCLYGLTFVSHFYAARRSIV SPELMNAQISGGTMIDDVRCTIEKIAEDPSVRFIPVVSTCVAETAGIAEE LLPKRVGNADVLLVRLPAFQIRTHPEAKDVAVSSLVKRFGAFGEPKKGKT LVVLGEIFPVDAMMIGGVLQKIGVESVITLPGADLDDYVQAGRASACAVL HPFYERTAALFESAGVKIVGGNPIGANATGQWIERIGEALDLDPETVKTV AEEERQKAKGMMAGFAERMHGSVIVAGYEGNELPLVRLLLEAGLDVPYAS TSIARTALGEEDHRLLTMLGTEVRYRKYLEEDMEAVLEHKPDLVIGTTSL DSFAKEHGIPAIYYTNNISARPIFFASGAASVLGMIAGLLEKREIYGRMK EYFMPSA >CT0301 crtC, hydroxyneurosporene synthase CrtC MNITTDSLQQAWHRLDAPGSYEWWYFDAEDESEGISVVFIWFVGFAFSPY YLSHYEEWKAHRRDDQPYPLDYGGFSFQLYQDGRETINFIKEGGRELFAS EDGGIGVRFEGNRFVYDPLRDEYRLSIDFSFPARDRSVQASFSFRPLHRF DYHFDTDLHAGVDFRHQWVLSVPKAEVHGLLDITSLSSDKRQVLQFRGRG YHDHNLGTVPMYESIDRWYWGRTFSRRCDLIYYVVFLRGCSAEPQAVLML LDHKTGRQSTFDAVRVSESRFTRGLFAPVHGKTLRLEAEGVSVEVQHQKA LDTGPFYLRYTSLLSMMIGEEAQEEVRGISEFLNPAPLKSRLMQFFTASR VWRAGKQSAMYVLYNFFKHRFERVHRINRKKF >CT1942 csmA, chlorosome envelope protein A MSGGGVFTDILAAAGRIFEVMVEGHWETVGMLFDSLGKGTMRINRNAYGS MGGGSLRGSSPEVSGYAVPTKEVESKFAK >CT2054 csmB, chlorosome envelope protein B MLSNNSKHIRIMSNGTNIDVAGAINTLAETFGKLFQMQIDVANTALKALA DVAEPLGKTATDLIGSFTGAATQVLQSVSSAIAPKK >CT2064 csmD, chlorosome envelope protein D MADEEKIDTMKSFDFAVKSITEAGVNQLNLISNTIQSAVPAVTNAAQSLT NAVSVSVKTVSEAAGALAGALGELGGAVANLAGALTNSAVSIAQSGVSAV TNAIGSVLQAKKI >CT2062 csmE, chlorosome envelope protein E MNNPRGAFVQGAEAYGRFLEVFIDGHWWVVGDALENIGKTTKRLGANAYP HLYGGSSGLKGSSPKYSGYATPSKEVKSRFEK >CT1046 csmF, chlorosome envelope protein F MANESGNIGVFGDLFTAVGDLAQQAVDMAGSALKTATDTVQPVTNACVQL CTTSINSATQLVEGATKAITTAIAPKQ >CT1499 fmoA, bacteriochlorophyll A protein MALFGSNDVTTAHSDYEIVLEGGSSSWGKVKARAKVNAPPASPLLPADCD VKLNVKPLDPAKGFVRISAVFESIVDSTKNKLTIEADIANETKERRISVG EGMVSVGDFSHTFSFEGSVVNLFYYRSDAVRRNVPNPIYMQGRQFHDILM KVPLDNNDLIDTWEGTVKAIGSTGAFNDWIRDFWFIGPAFTALNEGGQRI SRIEVNGLNTESGPKGPVGVSRWRFSHGGSGMVDSISRWAELFPSDKLNR PAQVEAGFRSDSQGIEVKVDGEFPGVSVDAGGGLRRILNHPLIPLVHHGM VGKFNNFNVDAQLKVVLPKGYKIRYAAPQYRSQNLEEYRWSGGAYARWVE HVCKGGVGQFEILYAQ >CT1639 pscC, photosystem P840 reaction center cytochrome c-551 MDKNSNGKLIALAVGGAVLMGALFFSVSFLTGYIPAPNHSAILTPLRSFM GWFLLIFCASIIIMGLGKMSSAISDKWFLSFPLSIFVIVMVMFLSLRVYW EKGRTTTVDGKYIRTTAELKEFLNKPAATSDVPPAPAGFDFDAAKKLVDV RCNKCHTLDSVADLFRTKYKKTGQVNLIVKRMQGFPGSGISDDDAKTIGI WLHEKF >CT0641 pscD, photosystem P840 reaction center protein PscD MQPQLSRPQTASNQVRKAVSGPWSGNAVHKAEKYFITSAKRDRDGKLQIE LVPASGRRKLSPTPEMIRRLIDGEIEIYILTTQPDIAIDMNKEIIDMENR YVIDFDKRGVKWTMREIPVFYHEGKGLCVELHNKIYTLDQFFK >CT1018 soxZ, sulfur oxidation protein SoxZ MKIKAVVQNNIVSVKVLIPHPMDTGRVKDQAGALIPAHFITEVTATIGGD TVFHAELGSGVSKDPYLSFQFKGAKAGDMLKVSWVDNKGASETAEAAITA M