TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Gene type: CDS
Genomic element: chromosome

Number of genes found: 588

Free access
Sort by:

 



# Chlorobium tepidum TLS, TLS

>CT2272 hypothetical protein
MSVCDDRGLHIDMVSLFFQAKNSHFGRIFLKPLLYFKLRIFCRVSANAFA
AYRYAVSYNFAE
>CT0086 hypothetical protein
MIDQRTSYPSSMTRHLSPFWLNRLLGIPSSTPLTPRLMASTLRVAFCPLQ
EESPRLDHFITALRESFVECGVTIVEEAAGEGRDSRVEAGTALIAPGRFE
DHQLPISRVSTLYNNLIVGVYDEPPPVHGGQTPQERLDAVVGKLAWEMVH
LLIYVTENSWTVCSMNGGITTFDTPLPESRDVLESLIPKLTAQVVPPRDG
ELELRTGALDTSAPEFLEFAADFVECGRIWAGNERFMNHTSRESLDYRNG
FCRKIVSRYLDERNGMSYGFFARQLPVKVAPAIEADDLGGTSVGDALVPV
TIAGKQLLVPVPGVRILTTRSGCRKTAIDPRHDLVEIGLDNGHAWLVTPA
GLPEGLVSKPSFDTLTIIAHAIGNAMIASILLALAPGNRFPGLLARFGCG
MTHWHYYLDEEMIPDGYVVHGFDNPPVSCSTPQSAAYSLLGKLDALERAL
EGGTDYLGDVHIEPGHGTNVVGTRSLAKMAMFLNADSCVCSQREG
>CT1108 hypothetical protein
MEIKREFTNIRFPEYLFTLVHSHLNPDESYPDAE
>CT0332 hypothetical protein
MVKQLTKHGNSMAMVIDKPILELIGADADTPFEITTDGQALILTPLKNPK
GGEAFGVALEKVNTRYARALKKLAE
>CT1531 hypothetical protein
MARRVIVSQAASENRASLPDVRLAFQDFLVSRQHWSDRF
>CT1352 hypothetical protein
MLRSKEEWQETAESVLPPEERYVDRNRMITARYAGWYLENPGTLKWAGMA
AFASRQVGLAIMAADLMTAPERDGSGNPLLALHRFGVDWFMRADFEQIRR
GNNNIYRDIAWAHAAYVGGGMAELEACASEPEDTLLVKGFGMIDRGRALC
RRDAGSPAGERLIWEGNICLLRHEQVDVLQPIFDTLSVGGRIMASFGSEL
DFSGALFPDSRFRTSFSLFYGYLETLTGLKSVANPDDRWRWVEQSVIPSW
QAAERQMSAPCPTRNALQKMAACEQ
>CT1690 hypothetical protein
MLAQPFDDIPLKLRQLALLLLQQRFDVEVLELLQVVQAYSAVFIPVAGFS
GMIFETTGRTTMGRRSCDGFFSPQIWQACACVSFVLMVVSR
>CT1950 hypothetical protein
MHTNRQRPDERTKNTALFIKKMKILIHFCFIAESAILLKPC
>CT0025 hypothetical protein
MPRRIKPCYLELLLEIGNFSHSAQLYRYIYLRKIYLRKHPFSRI
>CT0926 hypothetical protein
MVKVNCTTLVPYFFMLKVWEDWIDTLPIRDHELGSCLPMNRARRMRTFLP
R
>CT1941 hypothetical protein
MTASGSQQQKRRKAATKEELRRKACKQKSSEGGKPVTASENRLNGLLIWQ
TSIQLLW
>CT0583 hypothetical protein
MHLSIPLKQMSVEEKLQAIEEIWADLASTPANIPSPAWHADVLQVREERI
AEGRAQFLDIEEAKKAVRERLG
>CT1329 hypothetical protein
MISKISFDRKTIPVVRSGSGIDNAASRPVLYSVLFAPNIKTIFKRFAIFC
GFGVFVVSGQMRGTHSKKTEYYE
>CT0220 hypothetical protein
MTLVLNITAIVLFTGLSLLVKYRIKRWKQKQLRQQNDVWSIGLYEGPDPV
TLSPAAGIRNPILTAKEVTDAPARFIADPFMIERDGAFHLFFELLNTKRK
MGEIGHAVSDDLKTWRYSHVVLRERFHLSYPYVFEHDGEVYMIPECAKSK
SIRLYRAASFPDDWRPIATLLSGNKREVALLDPSIIFHDGHWYLFSYMRK
VNNLHLHVAETLTGPWREHPASPVVKNSDHFARPGGRVVKNGAALYRFAQ
DGQPRYGSKVWGFRITELTPTAYREEAVSDTPVVQEGNEVWNGRGMHTVD
PHRMPDGRWIALVDGLEDKLRS
>CT1263 hypothetical protein
MVKHIVMWRLCDEAHGNSAQVNARLMKEKLEALSGRIPGLLSIEVGLDFS
CTDSSADVVLYSEFANKAALETYQSHPEHEALKPFIGGATRERRLVDYEC
>CT0738 conserved hypothetical protein
MRKGWEMFKGNIGEFIGFTLICFAASIVSSKMAAFGSLLFSAIAAPLYAG
YTIAAFRIMTGQELQFSDFFKGFNYFLPLFLAGLASGILVAVGLVLLILP
GIYLAIGYMLATFLIIDHGMEFWQAMETSRKIITKNWFAFFVFAVVLFLV
NVLGALALGVGLLVSAPVTACATAIAYKEIVGLHSAEW
>CT1884 hypothetical protein
MRPPWFCVEPLMFTEHKFQSVEPARNEKKPARHVMDTGRNRP
>CT0736 hypothetical protein
MLILLGASIAALGLLIMLVQKSGGNGWLGWFGHLPFDIHIEKENFRLYFP
LGSSIVLSIILSLVIGLINKFFR
>CT0689 hypothetical protein
MKTENIGTNFDDFLQEEGLLDEANAVAIKRVIVWQIGQEMKAQKLNIIAR
NDDTEKEISF
>CT1869 hypothetical protein
MRFILTVIAAIPEQRPRKNTITVKQWHERKNPIKKRDTAAKRDALLSN
>CT2031 hypothetical protein
MLHYTIKLVFVISYEQGNKTNREMLFFQKFWLRRQSSNLYEALQNWTSPV
SGTSGTFFTTSPAHPSSSTLLSSPKKESARY
>CT0277 hypothetical protein
MNRTTEQRNAEVMKTIGLLDQMPRVEVDHLFRVRLMQRIEAMEVKKTSWS
ALPGGAFNPRLAFMALLLMLNIASALMLFMHGTPQATGSSGAIAESLTED
YGGPALSYYDDQTTIDR
>CT1047 hypothetical protein
MLSIANSMQGQSSRSCLLPRPKRSARQAVQLRYLLAR
>CT1385 hypothetical protein
MMEWSSNGSSGIDSDATALRDRQITANLLAIAILEPLFYFAVTFLITYSG
HRKLHIFAAPAHKNELANAKIRPFCSQGNRRPG
>CT1372 hypothetical protein
MKRVSIIAFPELETKKRQCLHNRVACCIFVAI
>CT2120 hypothetical protein
MPVMDRAAMRTMATLQPSCMRWLCTTPSILDAAKAGWPISWAATPDSPAT
VTIRQFQASTCCRKKTFDVVVNTDVLEHVPEAELDCVLRDFRKLSTNAII
IPHLAKATRILPNGENAHCTIKTPSEWAQVFKRHYAHVYELPHHSAVHAL
FLCGDQERDVTALRGILEQYVAAKNEVRHHLLPLGKRIEKAIRLIRGKDI
NR
>CT0869 hypothetical protein
MKKCEPPPINYGDGLGWCTPYMYVCFNDECKLYVNGWANLKNNYNKIASY
RCMCYPDNGMFDAMCVFSPDGLKGQIVEE
>CT1914 hypothetical protein
MSFEGEFASYEPLRRLLDSEKVKSLQNRLKIRQQEEETEDFEGSIVKKSD
LTESTLQPDLVLAIDGSNLAAKAENGFPGAEFGYITIASVLIDLKLIGEL
EKKEFVEPKKFRETEKASTIESVFPGCNVILDTEKNAKSSLRRALFEELR
SNTIFSDGESLLDTYEHLFRIKREHFQERNLPRSPIEGVEEEMTYDFGEY
TCPHSGEPLFSTDALRLHELMNPGGSNGEMFGQIMSTLEKLWLVHILRAF
ERKGWLATLRRVAFIMDGPLAVFSTSSWLTKVISHELTRLNDLQRKINGQ
DLLIIGIEKSGTFFNHFIEIDTTKDGVTDKFPKQSALLLNDGYIKRNIIF
SESIKPYGQDTYFGRKLFYKAASGQKIVPVVACFNEYQRNLNTANPDQFT
RLADVMNLLDLLVSSRYPNSVSPLVSAHAEAAIPLNLGKRIFEDIAREIR
EKSKE
>CT0298 hypothetical protein
MVSMDRPALIFPELAQILLLLPNAVRIFLNSMRFESALGISARLDFRKMR
LTSQSAEII
>CT1119 lipoprotein, putative
MKTVMLMLTSLLMAGCSVLGKREAAEPPYELLKHDGAFEVRRYGPMVIAE
TILDEKSYSAASGKGFNRLAGYIFGKNRSKTSISMTAPVLQERSSEKISM
TAPVLQQPQKGGWSMAFVLPEGFTLQSAPEPLDPEVKLRELPPSTIAVVT
FSGLHSAANLEKYSRQLQAWLKKQGYRALSEPKLASYDPPWTIPFLRRNE
VQIRIEPDHGESGKE
>CT1030 conserved hypothetical protein
MTTNFLAILGGFALGLFYFGGLWLTVRKGLFSPHPALLFLTSTLLRTASV
IGGFLLISSGDPVRLLFAVGGFVAAKVASIAFGRRNSAPEHRDKEETPCI
>CT0486 hypothetical protein
MLKGPEREFVANGCRAASPDAAEIARIAKHASGKPASAPPPLPAEFMLRV
CTRDDVEAMAGIYREVFSTYPFPIHDSVWLLETMQRAISTTSASSTKVVS
SRWPPRRWKHERLAQASGVKAGLFSVADSVPF
>CT1940 conserved hypothetical protein
MANISLYISGQTELDDVSEFFQKRLIGKGETPIAFFDGVFYESHQERVGN
IVYQDYLIYTDKALYLWARGASKDFLDRFSLGAVSVNSRNKDSAFATMNL
KIRREGKEPIYVIFDMVEIREADTIVRLQTLVESIIEDYLGINYRQEIPQ
DTADRIFQAARTLCPPRQIALQLDTPQAPTPDAGIGYGQDLLEQYRASAG
YEQPQMPYPPYQPSGTGGASPRSMGSQDAMRGLESMLPADPASLKRIAEQ
IKNMVGDAPFKMRDQVMKDLQHVPGDMATVLNALNELLSNIAGNPMAERF
VMNAIKTAVANDGVLGSLSKIIKMTGFGSGGGKKSSRPPANESPREERST
AKERKSPFVDEEPDEDGSTIRRKKISIKDDDNHTGADLFAGNDEPITSRE
SRPPVSRSSVPDDSDSGAEPGGRRKKLSIKMEDDNEIARKLMSYDEPERE
APSPAPVEASSAVSTENGSGIRRKKIAIKAEEGSGAEADIARKLMDYDEA
SRSAVTSALSGEVPGSVSSEEPSEPEPLRRKKIQIKADVEQESEPIVKAE
PEEPTRKKIQIKTETEPELPVVSPELEIKSMLEPEPQLEVPVQKASQIEA
EPEEEIDISEEVIFSALGEIEEMEGSSISQEYMIIESDIPVRRAVSEPVK
EPVSEPVIEKKPEQVRHSSGKVSHGKRRGR
>CT1684 hypothetical protein
MFLLHEQFARNFLAAISAFHFLHPEKPIAFIVFSSKCIWNV
>CT0428 hypothetical protein
MLKMYVDYWVAVLSGFLQQYFGVKSQKGVTMIEYALIAALIAVAVIAVLL
TVGSNLKTVFSYVGSNLTT
>CT2108 hypothetical protein
MCRLMTSIWQKNREPAPDVPRYARIIGTEPP
>CT0573 hypothetical protein
MIPALKLPEATPLQSWMLRFAQHDRKEWCSSNFYRFLRERFFSKTQPHKS
SFHNQALQSLETRSRMVKYQRSPILVK
>CT1668 hypothetical protein
MYWNLDLARYIADAPWPVTKDELIDYANRTGAPQQVIENLENLPDSDELY
ETLEEIWPDYPTDEDFGYSDEEPLN
>CT0880 hypothetical protein
MQFLKVYSVFGEEVVGCHVRIENLSVACSVVSAGGTHPSEKCVCNHEFSS
VSLHQKKHPHLEKLLETSRPIKLDSSSKVLILSDLHMGNGGRRDEFRRNS
ELVRSMLQDYYLPGGYSLVLNGDIEELFKFSVEDITKVWGHIYDLFLQFE
KNGFFWKTYGNHDSDLFEERNYPLSKHLLESIRFQYGDEVMLLFHGHQAS
ILLWETYPLVSRAVVLFLRYVAKPIGIRNFSTAYNSRRRFAIEKSIYEFS
NQAKIVSIIGHTHRPLFESLSKVDHLKYKIEELCRQYPSALPEERLAIQE
RIGELKAMLDACFTEGKKIGLRSGRYNNIAIPSVFNSGCAIGKRGVTALE
IDGDRIRLVYWFKEKQGRRFVSDRNSEPEQLGDTGFYRIVLNEDHLDYVF
SRIRLLA
>CT1154 hypothetical protein
MRKKILFVCGSMNQTTQMHQISEHLREYDQWFTPFFSDGLLGKASDLGML
EFTIMGKKRASKAIDYLTSHNLQLDIGGTLHRYDLVVTCTDLIVPKHIKR
TKIVLVQEGMTEPETILFHLARNFRWVPRWIAGTAMTGLSDTYEKFCVAS
EGYRDLFISKGVRPEKIEVTSIPNFDNCERFLENDFEHRDYVLVCTSDNR
ETFIYENRKRNIRKYLDMADGRQLLFKLHPNENVVRATREIELYAPGSIV
YAEGKTEEMIANSQMMIAQFSSTIFVGSALNKPVYCGLEPDYLKRLTPLQ
NRSAARKIAEVCREVIEK
>CT1310 hypothetical protein
MVSPFLRREKNFLMNAGIGGVASSYGVKKDPY
>CT0442 hypothetical protein
MAGNAMKTQAHWLNWKHLKVYSSLLLALFLIYGIGGVYFSKNMVDAGGHP
LGLDFIAFWGASYLALAGHAQDAYNIPLLFKAQQIGVPAAKVSYPWFYPP
SYFLVILPLALLPYLAAYGTFMLSTLGGYLLVFRRIIRGKTAMWCLAGFS
GLWMNFFDGQNGFLTAALAGAALLNFERRPVLAGVFIGLLAIKPHLAMLF
PVALLAIGAWRTLITAAVTAITFMAAGTAILGTAVLKAFLASLGDARLFM
ENTHLLWNKVPSVFAFLRLLGTSATWAYAVQFAVAVVAVIIVWRVWRHCR
NRNLRNAVLMTATFLVSPYAFYYDLAWLAFPIAWLALDGLRNGWLRGERE
VLVAAWLLPLMMVLIAAMLKVQVGPLVLGSLLWMTYRRATTASMTGAPAS
AAPAKISSRLYSKRCYSLANTSTMAKKSEHAAERKTMNADAHWLNRERLI
FYSRIFLALFFGIGVGLVVTSKHMVTGDFVLAWAASHLALTGHALDAYSI
PSLIKAQQIAEPGPQDVYGWFYPPSYYLLILPLALLPYAAAYWSFMLSTL
GGYLLVFRRIIRDKTAMWCLAGFSGLWMNFFDGQNGFLTAALAGAALLNL
ERRPVLAGVFIGLLAIKPHLAMLFPVALLAIGAWRTLITAAVTAITFMAV
GTAILGTAVLKAFLASLGDARHLCLENGSLLWSKMPSVFAFMRLLGTPVT
WAYVAHFIVAVVAVIAVWRVWRNCQNRNLRGASLMTATFLVSPYVLFYDL
AWLAFPIAWLALDGLRYGWQRGERAVLAAAWLLPLLMMVQIIAHLNVQVG
PLVLCSLLWMTYRRATTASMTGAPASAAPAKISL
>CT1366 hypothetical protein
MKKQQQLDLLEKALVSTGHKIRYEKGSFVGGDCRVKENMIVVVNKFLPIE
GKIATLAAVLRKINPPALSPDVVKIIDTVVPTNLFSRENI
>CT2027 hypothetical protein
MLRTSWIKKDMKKAEGLFSEVCFLKNKFVDFRPEQ
>CT0012 hypothetical protein
MSLNFSGPNAQFIDFDSDFNFDRNLFADFSQ
>CT1433 hypothetical protein
MQEWIEYRCQTHKKASLKKRLFKAVCSYFSNLILSAAR
>CT0045 hypothetical protein
MVVVFRRFVALYDRFELLWEDQRTERMAANLLILAFAFSLALIELARQGL
LPAEVAARVPKSHYYAINTALSMLLGLEVFGLVFSIAKSVSASVGKQFEI
LSLILLRHSFSELVHFSEPLDATEASLPVLFMLAYAGGGLLIFLLLGVYY
RLQHHRAITRDKEAAADFIVVKKIIALLLLLIFFVIGVIDGLKYLNGNAT
NEFFSLFFTILIFSDVLLVLISLRYSSSYPVVFRNSGFAVATLLIRLALT
APPWFSVLLGIGAIAFSIGITFAYNRFENLEIQRSAPI
>CT2051 conserved hypothetical protein
MEVVGRSRAKAIIETHLNDSLVYFSDHVRILKCNPYCTGVTYLKEHGVYQ
WIFQVNDPRENPITAVFFVTQNEEHLDSSRTVPAGPVSEEESFPDSAVSG
RCIQWVNAPQVPDVPLKEKNTFVGQANTRICLYPLEDRRTEVHFETDITL
DFELSFPLNLMPEGILKFMTEAVMSKIMQQATESMLCQVQSDLCCCTTAE
LDASGGKV
>CT2017 hypothetical protein
MMLIPYPFGLHESMYVSSNENTETILCIDLRFLQPKSTPNVTRHFKAAR
>CT0475 hypothetical protein
MEKKAVQTSKKGQHVFFLRSSRRIIPSSVAFTFVPCFPAESHSNPHITSN
RKNNVQDCFRYFFSRKY
>CT0135 hypothetical protein
MKIGCRHFLVKIYCLGEVRNGLGVMTVHVFDYAAIVVTCCLEQGVTEFDN
HGVVGDCSLELVQLLPYCGPHLVGHDVSVVIIYSLGV
>CT1202 hypothetical protein
MPSPMKFFTSIIESIKLLFAGIWLVFRIILEYFGIISDGNDRTTGIKDMR
EEYKKANYR
>CT0506 hypothetical protein
MLSCAVSKQFLRREKNFERKLSGPRINKQEGNKEASEKRNKKTKSK
>CT1562 hypothetical protein
MVTRLTEFVIHYGIQVFLAGPGLDLPRNRFALFINPIKKTRKQVIDDISM
>CT0533 hypothetical protein
MKRSFSEAEPESIEYAFSMEHQGFLVIADNSV
>CT0628 hypothetical protein
MNSGNFGPLDEDLLIQSIKTSFPEAHIALKFLSCLR
>CT0957 hypothetical protein
MKFSSTSTLFLLLLSCLCAPSRNGMCKEQPQQILVMWWNVENLFDTKNDP
KVDDQEFTPMGKAHWTEKKLLLKRLRIAQVFNAIRAEREYGKYPDIVAFA
ETENRQVFAGTLAALDRATYAIDYHESPDPRGIDIGLAWNPATVKFTGSK
PYKVRLNNRRGTRFVIAAGFTAASNHFTIVLNHWPSRSFDTQWSETNRIA
AARVARHIVDSLRTCNPQSEIIVMGDFNDQPENHSVKDVLGSSFDRKAVR
HASSRLLYNCWNEASSPGSYFYRNHWEQIDQMLVSAALLDEKGLSIDKTS
FRVFSIPAMFDRFGKGLYSTYKQGKFKGGYSDHLPLLLKVRIKP
>CT0022 hypothetical protein
MILLLVSENLEHQINTVKRDEGYHEIDRLDHTQQIDDQHQRHDNQKPECD
TAENDERENLRLLVKSVLKEQVPREAVENNHKPGNENRVDVNRIVCRAPV
NTPP
>CT1107 hypothetical protein
MNSRFISMKKQGISEMFTIPMFCRFAYLSA
>CT0074 hypothetical protein
MKKNVGHQDRNSLFSVGVPSSEIIWRSALKMINFVSLNRHSLIEQQ
>CT0793 hypothetical protein
MSDQVSTAPCDHEWFDSYQRFSSFSPESVRDGCHPGIMEHLQSSQKAIVL
VHGLSDSPYFMAAINPKIRGTRYLFSAVFDHAVTTMDAHRPFFLPIFPPT
AESNSYASIATPAS
>CT1212 hypothetical protein
MMLTSDRGVMMNNGVLWIKPVMKWMGLTAFLVIAGCSQFKTVTREPGVLG
DRVALTPEMTGLYEEVASNASTVRALDGYADLYLETPKRKAKAYCTVQIQ
KSRDARMIVTAGILGWPVADLLIRPDSLFVNDMLNNRMLVGRNNGENMGK
IIGVNAGFGRMIETLFGIADVPEPAKNIESVRKGSGRVSFTVKSGNGTKE
LVVDPLTRELTGLVYFDQSGRKSVEFRFAAYQSQVDKNGAELRVPREIDM
ILYREDDPEGSRSLKVVYDERVINPPDFNITFKWPARAKTVNLDEVERLP
WL
>CT2208 hypothetical protein
MRHCSGKRLSRRCFRVERKSHWISENQMRSMCSAPCRKLVNPDFFENAPQ
EGTDAAQAEKRRTDAKLRAAHYLKARARQKRFE
>CT1024 hypothetical protein
MLICPSENRPNAFFTFWTGTPLTGSENLASTEFPRFIASYHSASFRTG
>CT1927 conserved hypothetical protein
MSPTVFCEQGFRFFFFLREEKRMHVHVISGDGEAKFWLEPELELAKNHGY
SRIQLKQIESIVEAHSDELVKAWRKHFSS
>CT0059 hypothetical protein
MAYASFDHRQVPLFEYVLRILPVGAWRRGRTNVNEVYRRQCSNRTAFMLP
RVDEVAAVGALFILHVLHEAIIIHLSGKIMSRQVFSKTAKGLYLISLASI
LGACNHKAPEQQNTAPKPVSAAADNATAVTIESGKGSVEITDAVKPWPDD
APADVPRYPYGTIRKIIRTETPEGNSWDMAIERLPEHALLDYEAVLKAKG
FETTSMIVPEKEGDRGSVTGIKGAITVVLIGSGGSMSLSIIQKQ
>CT1133 CRISPR-associated protein, CT1133 family
MSGKRSYQSARLGERGGEKMILQALYDYYQRKAADPESGIAPEGFEWKEI
PFIIVIDREGNFVSLEDTREGDGKKKKAKPYLLPKSVGRTGSNSYKTSFL
LWDHYGYVLGHSRSESDKDQAMAEKQMPSFIEKLRSLPENVKGDDGVLAV
IRFYEKGEYKKVKESDNWGECTKIIGCNMSFRLDGEVDLVPCRDAVKRYI
ETQIGESADDAVGLCLVTGKKAAIARIHSDTPINKDSKKFVSFQKNSGYD
SYGKEQAFNAPISESAVFAYTTALNMLLGKNSKNKVQVGDATTVFWSEKQ
DVFEEDFPAFFGYSKDDPDADVRAVKALYEGIKSGHAQMDSKTRFYVLGL
APNSARISVRFWHTGTIAEFAGNIRQHFDDLEIIRSPKDSGHFSMFWLLS
AMAHEGKVDNVPPNLSGQIFQSVITGGLYPATMLQQAIRRIRATQEVTRI
QASILKACLNRFSRIYNTKAKEITVALDPTNNNPGYRLGRLFAVLEKIQE
EASPGLNATIRDRFYGAASSTPVTVFPQLLKLKNHHLSKLDNAGRRVNFE
RMLAGVFEGIGNEMPSHLSMEDQARFAIGYYHQRQDFFKKKDSENNN
>CT0571 hypothetical protein
MFAQELFRFPTLSAGTIMRSPLLRIVAISALFCAQPFQPEANAWHDKTHL
TIAEAAGFDLWYSAAAPDVAKSKEMFSPVESPNHYYNNNANKRVTPEMVM
AQVERYNRPNDDEGHLYGAIIGSVREYQSMKKSGKYAKYPLVYCAHYCGD
LSMPLHNTRYDDFNKERHSINDGIIENSVRHNIGYIQRMMRPPVIDSEAD
LAREIAAVAESARKLGMKMRKENRDMTVDEAYTQVTRSASLFNAILAWLE
RTQKTAGERTVTVTN
>CT0915 hypothetical protein
MLPMVTCAMAPDGGIDPDGHVIRFFAMKMVQACWSLVWFCG
>CT2268 succinate dehydrogenase, cytochrome subunit, putative
MNFAGCAASSPVTSGLSVMDGSARRTFSSITSKVVMALAGLFLLVFLAVH
LGINMLLLVDDGGKSFSAAAGFMSSYPVIRVFELALFGGFALHIAFGVIV
SIRNRMSRPIRYQHRSRSETSPFSKYMLHSGIVVLIFLGLHFIDFYFIKL
GIVAPPPGVARHDFYSRAVLLFSDRTSSSIYMVAFVFLGFHLNHALQAAI
QTLGLNHTRHAAAIQAVSTVYAIVIAGGFMAIPLRFTLFN
>CT1917 hypothetical protein
MLLRFFRQSTPLQKPGIARFFAFLGGDEKGNPVASFPDLFARRNRFFTFN
ECVG
>CT0585 hypothetical protein
MYYRVDGGIARVRAILDCRRDPEWITKRLG
>CT0685 hypothetical protein
MLGHGRIYTCEKGCLFSSRQLVKANGNPYGQSVMPVKLHQNGRITPAIRR
AIQSSSLSASQLAARHGVANLKALMPVGVKYLPKMPDGPSRRFNVHSSP
>CT0811 hypothetical protein
METPTSRELVSLLFYLRIEISLNNPEVMDSTMKHRLENMLGYLESESYLM
AYRTLNAIVTENEASGELPSLETSTALEVMQTCLRIIVGERVGHPEVAKH
FAQTVSFYERLALLLTKKLLGDDSAAAEVDILLFCHDALAKHRRN
>CT0885 hypothetical protein
MGTLDGTGDIDCSNRGLTSLEGCPEIVEGSFNCSGNRLTTLEGAPRITGS
FDCSGNEIVSLEGGPEKVNGDFNCSSNQLSCLKGGPSKVKGDFLCSGNRL
ISLLGAPKKVKGYFDCSDNQLVSLYGGPIETGAFNCSGNRLRSLLGAPDE
VHAGFDCSSNLLVSLDGAPEFVNGDFSCANNLLENLAHGPVEVSGNFNCS
GNRLMNLKRFPKRVEGELDCSGNPILACDVTGPESDRNCIRVVHGGTVRC
CCRKTDSSQLEALSS
>CT0218 hypothetical protein
MQQTFFYYNYENYLLTFLTIVTSVLDLTDSPLLGKGSSRLCYLHTPMTPT
SASRSLIPVILMSKSRS
>CT0228 hypothetical protein
MRVSLRHFRYGASGDLLFGQAARLFAFGIGSARAAKSDKKAM
>CT1458 hypothetical protein
MPALPPPPQYPPTRCSRSRAQRYLAAFRFCCFILIPTPMKTEVKVIIISG
IVTIIVAIINGNDNIHAGGNVTMHGAKK
>CT1518 hypothetical protein
MLSRRSFFDVNIRKKPSTHFWLDKIHKRGKKSGVK
>CT0471 hypothetical protein
MPGDSTNFIFMLSPTTSTFDMQKKATLLVSALMLSSTPLFAAMPLVTDDT
GTQGAGHGQIEIGFESTSDKETEAGVSCKETGGAISATFSYGLTDNIDLV
VGLPWEWDTVKENGLKVADENGIGDLALQIKWRFYELPDSGFNLAIKPGL
TIPTGDENKGFGTGKVSGDVTLIATREAKLATFHVNLGYSRNAYKLDEIS
ESSRKNIWHASMATELNVTDKLRAVGDIGIETNSDKDSDTDPAWILGGLI
YAVNDNTDLDIGIKGGLNDAETDTTLLAGVTMRF
>CT1796 hypothetical protein
MKELSFGFALRVVKLCRFLEKEKKEYVLSRQLLKSGTAIGALIREAQQAE
SRADFIHKLSIALKEAHETEYWIDLLYQSQLIEKKGYESIKSTKNRRDNG
>CT1120 hypothetical protein
MNPAKSDRRFAIGTRGIVSRINDLADQPDAKETLKI
>CT1843 hypothetical protein
MVLDNRVAVAAGDWVCRVLHPASPETQTVFLDELAEAMAGSGKAEVVTVA
EAVEIISAVSSPYLSPRWLAANISNATRPGRGTAFSARLSEPGSIGSVCL
GTL
>CT0733 hypothetical protein
MEAKNLKETPTVVKRVLESYTSLEGAVFFVQF
>CT0490 hypothetical protein
MYCSVVVYNGIHHPNKDVRVFLKSYVLLFEFIAKCFQVAASDWPFFIWSG
RGTIG
>CT2197 hypothetical protein
MCLRSFDRNLKQTHSKSSNKKNNPAFILDR
>CT0023 hypothetical protein
MPPKEMTMLDNDRKKDKFSEQFGGTVRALSEYLGIGIQIAASFALFVFLG
YWSDSKLGTSPLLLLAGVLVGMVGMALVLMKTIRQADREHDRLHQHTRNH
EKDRRT
>CT1393 conserved hypothetical protein, truncation
MTSYKPEYLIPNLLDLVAEGEGLRIEFKRLIHSAPKIARSITAFANTSGG
VILIGVDDDRRIVGIQSEKEALQVIDEAMRFHIEPKPRIEVHFEEFKRRM
VLLVDIPKSPERPHFHIEPLIRRDTGKHGVERRVFIRDGSHNKAASDDRI
ELMLSSREPLKVAFTGRERCLLDWLNEHDRITAEEFADSAGIPMKEARRI
LVSLVRAGALRLDTANGDNSYTLAHR
>CT0690 hypothetical protein
MKYLDVVKKADTIGLFTIKNDIDMESVQKDYEAYKLNPE
>CT1283 hypothetical protein
MKKLFILFAFVFLAACGSTSSIQDKEGKSTKIDLSMYDNVVILDFTDATK
KHNMPAFAGRNFADRIAASVKEKGVFKVVSREPLADKSIVVSGTITKYEE
GNGALRLLIGFGAGSSYFNANVHFTDSLNQQELGKVFVDKQSWALGGIAA
STQTVDGYMNEAAKKIAKELADAKNYHCEPNTSAQTETK
>CT1617 hypothetical protein
MQNGSNSIHFDTVVMRPSCRKKVSAKALGFRYWSQTQQPAGKRRQ
>CT1912 hypothetical protein
MSLQSLSSFLKVIADLLQTLKVCLRQFDLFLQVLQVWKSIRITLNDFSDI
ELQPIVLEYYFLKLMM
>CT1355 hypothetical protein
MMVCLLCHTGAHEKSFFNGYDGRALYEQVQFLLLSIMVVQNFL
>CT2226 hypothetical protein
MLNSMKSLRLHRSCTVFFFVIEIIRQNMLCKPFYGVFPWSFAG
>CT0491 hypothetical protein
MLVKAKDYGIACVGITDYFLINSYKRLIELINDDSRLNALLNPPYADYAK
QLLVLPNIEFRSSTIVRHVDIEGKTATREPIFTSFSPTQFRRKRLKRIFF
ES
>CT0456 conserved hypothetical protein
MLEQIRNTHPLVYQNFSALPDGEKHLRSILAIDRYWEKLDLPVPDVILAG
SRLPVDACVEEACDILYAGGTLGLLHAAVMSKKYGRKVLVIDRAEPGRTT
RDWNISRGELLRLADTGVFTSEELDSTIVRMYKTGWVEFHAPAERRKRLY
MDEVLDCAVDADRLLGMACKKVLAGGGSKVLGHTSFVCCYQFPDHLVVQV
EELSGKPRYFRTQVLVDAMGIVSPVAMQLNRGRPQTHVCPTVGTIASGFE
NADFEVGEILASTEDAEVSGKRGRQLIWEGFPAKGDEYITYLFFYDKVDS
PNDKSLLGLFEAYFRKLPEYKKPGPNFTIHRPVFGIIPAYFHDGAGCTRV
VSGERIALLGDAASLASPLTFCGFGSVVRNLDRMTSGLDRAMREGRLGAA
ELANISAYEPNVASMATLMKYMCYDPETDEPGFVNEMMNEVMIVLDELPQ
RYRQAMFRDEMKVEELVTVMLKVAWRYPKILKATWNKLGVGGSVGFVKNL
AGWAISQNEKRG
>CT0739 hypothetical protein
MSKSAGSTFCPKSNGAMIFLLFFSSLLPMVAIRKNSLSFPIHRYRAQGYV
LLDVIYSSLVRAQSVIPYSGGAALGSHYCRNCVPLYIVSGLRS
>CT2143 hypothetical protein
MVVLCFDFFIVYQNMSHCSIKSLQKNYNYATGSRILKTAGFIGILTVFQL
PVRYLLTTLFMDMAFRYVRWPDCELVSMKGV
>CT0765 hypothetical protein
MKLFPDEDKKKNFMKRGLPVVLAVVWTPIIWMVLAAFLGPAMERVIGVWQ
VTVAILAVATLLAMVALIRLFKTLGLKIFDNIG
>CT0596 hypothetical protein
MQQTLSLPSIHHNQRKNAALLAIRISNSFLALSPQQKSNCIVRFYPAKDE
LIF
>CT0382 hypothetical protein
MRTDKLYRTRENRNWSKKRGIRDAVKGNFGQAQQRYGPFGQEQHDGGLGH
FSGDESGSAVQPAFLRRFEWRCWLASRIENIHRIRSPQAAAMHCCLSAYR
LAWQGLIINIATSSWACSIDFFPDMPFEAWTSNMTYLRTDSG
>CT0871 hypothetical protein
MFLFRECNHDNSSREFLQQHMTNIPPFFPVPGQEKRQTKPPK
>CT2195 hypothetical protein
MSDSHLLNTRQNLPRERFSLGIESGLRHFLKSVPPPRLVQRALWSDHAPG
TMPVKKS
>CT1014 hypothetical protein
MTFMSVGTLKLKERQALCLRIDAEQIQPVLEGDFEDMELCGLLLGPMERC
PKLPVERQGSRQGSDDACRRSPIPVLYLLYRDQMSSSSIVSEWAHWIPAL
IMEALEASWFRRASTIMTC
>CT0749 hypothetical protein
MTVSVAQLILKYIEEDKFLDAIQCVQNEILKIEVKPELAGADRRQIKNLT
AIMDKLSEAAMFGSEWDEGRRAKKAAIVKLQKVSAA
>CT1407 hypothetical protein
MVLFSRVLTLLPLDRYILFCCPGFQAGVRREKVSLKVETL
>CT1071 conserved hypothetical protein
MNAAHQVAFPDFGSLILFDLLVVITAIALSRDFNEDGINDLSGFGKDTLI
VQGGVEPFEEKSFNHAFLDQALTKLPDGFGIENPVAGFESQKALETEPIG
NLVFHLIIRKTVEALQDEELEHRCPVKRRSAHFAQIGRLLECDLKNWAEE
IPVDMLFQFHQWIFELGQTLRKKILVEKAQGIDVLHGNEVD
>CT0893 hypothetical protein
MRSSMFMQSRHNTKQHNTQKRSIMKKTLFLLGLATAMGFNNAQAVDWNWN
GDIRFRYDSSKTELASSPDKPADDRYRLRARFGVSPTINDELSAGLRLAT
GDGKNPTSTNQTLGKDFADKAIWLDEAYINYHPKALEGKVNVLLGKRDIA
KTFNVVKDLVWDSDVTIEGATLQYGKDVSGKQKSGPSLIAGYYTLENYAT
ASDPCIFAVQGAYMGTVSGADFNLGASYFDYVHMKNVAWWNSPNGPANDG
KDFRILEVFGTLGGKLGGSLPATLYAQYAHNTALNSDNNAVLAGLKLGSD
KKPGGWTLDGGYFYIEKYAVTPLTDGDRPMSSKYSTDIKGLKIGATYQLV
QNMTLGATYFHVNPVDSSLTGSTSDHKNLVQADVAVNF
>CT0921 hypothetical protein
MLPQIALVESRLPHRDDEGPSVGSPEIVHSPSGDLPIAASLQSTPACSNS
RTGGGNSLEAPSMHQQLLKASRRRRFRGSAR
>CT0210 hypothetical protein
MFPMFFPVSYLQKGFALRIAYSGAVRPFLTVFRDKG
>CT0190 hypothetical protein
MTVTHILLDDQNPLHRELPIYRSGKINTVRLADKSYKIYDSMEISAHDYT
ALFYYGVIEQLNALPFISESNNGLDSWDEAFLPSGAIGRMIEIIDECVGE
IRGKSPEKVMLGWQDDPERIAYWREIDPAETLGFLRDFQKFAAKAAKEGY
DLEFIL
>CT0492 hypothetical protein
MITQILNNVNDYMLFSAFLRTKSRRDGNWVG
>CT0234 hypothetical protein
MWTQDRETAFSGEPCRIVWPRAAEPKKGRVAKNGENPRTARLPQVLQTWF
RQTEDHDTVNRGVPLKTKPEYKKNILLCTGINR
>CT0027 hypothetical protein
MLSSLSKLTPEQLEEIRSLEQQLGKTLLSFSEYDVVSDDLSEADLATIKA
LEEKLGTMLVAVRGLSQNA
>CT0513 hypothetical protein
MTLSACTQRRKQTSQAATLHASLRPKLRKTLEHRALAIRL
>CT1916 hypothetical protein
MCHLRVGCLFYTDAQLAYMRYSIAPYIPYQTNKRNKVARYCPVFSCRVYY
KSSNGLLILSKLCWLHSPHPHPRQPNDRNSAKRLTAAALHLRIRAARSLW
KYLREAFAVGER
>CT0575 hypothetical protein
MSKTALLPTVTAVRMVWAMSGKFSNDAPSQTMATYVEDVLKGIEEQAPQI
SSDERSYVTSAVATMKASLRSIDVISKGRDLNFKENEKLRSAYLESVKES
LDFGNKAQDFLKSLPAMTIAGAGGVTVAQYFFKASTFELWGFGLILTGIG
YFFNQYIVLWVRRKKQMLYVTQDYERGLYYEQYLDRVRLVLLALFLDLER
IHKRVFRENYEADTTSVAVQSIIDDILSGVRSTFCPYAYKHMAEKKVTPE
LWTLCESGIRKAVENCPLWEGGQQSENRIDSL
>CT1438 hypothetical protein
MKFQALRYGEFFQGKISLYHHKRPLIVRFLECINKGTQEWLVDFYPGNGL
ASGMVLAIDHDNQHVLCEITGKKGRGRYIRVLNPKITIEELWQQSELALA
DRLER
>CT1496 hypothetical protein
MQRISHRPMFTIRNSLFIFLCVMAGMSLPCQQPAAMAAKISYASEIVKDV
REDKVYLLEKIRKQLTKPSEKILVEALLTEDGPKAAKLYRKQLEEHPDPQ
LDPISSSRLAAYEFAVSTTPGLPVMQARASSESRPALMTIAQPLQPKQPV
SKPDSSLKRTPPPAGPAHAASKGDTVSTRLAPPPAQASGGGFTLQFGSFD
SITNAEQMVAQLQYTAPARVQQINGVYKVRLRRTFTTQQEAAAFARTLPI
ESIVVPPQP
>CT1160 type III restriction system endonuclease, putative
MTTQAPFSLRGRNPDVLTCIANLSNDEVFTPPELANRMLDTLTEAWAANH
NGANLWADKTVTFLDPFTKSGIFLREITRRLVEGLTEEIPDLQERVNHIL
TRQVFGIGITRLTAMLARRSVYCSKHANGPHSVCKTFTTESGNIWFKRVE
HTWKDGRCIYCGASQSTLDRGEERETHAYAFIHADDIRTRINEIFGGDMQ
FDVIIGNPPYQLDDGGFGRSASPIYQNFVEQAKKLEPRYLVMIIPSRWMG
GGKGLREFRATMLKDKRIRKLVDYENAQDAFPGVDLAGGVCYFLWDRDCP
GLCEVTNISGGESVTTVRQLDEFSTFIRNSAAISIIRKVMATNEPRMSEQ
VSNSKPFGLRTFVRPEKKGDLILRWEKGEGPYPREKVTAGHDMIDKWKVI
TSYVGYDHAGNPGKDGRRRVFSKIDILPPGTICTETYLVVGSYGSKTEAE
NLVAYMKTRFFRFLVAQFMYSHHLTKSAYELVPILDMNETWTDAKLAARY
GLTDDEVQIIESKIRPFDNGNGAG
>CT0143 hypothetical protein
MYNFFRILQNLFSDFFHLLGGHHFSQGKTLASTSIAQFPTISKVK
>CT1999 hypothetical protein
MLCFDMSQDTAIPSHNRLGVLNVSLKGVAIVRVMMLAQRRISV
>CT1284 hypothetical protein
MIRKTVKTVGTAFVFSPFGMPLLVHGAAGLLVGAVGLNLLNGVINDVKSA
GDILQKEMSKPSDRQQDEEPE
>CT1157 hypothetical protein
MNRIKVSGLFLIFLVITATLQLSGCATVPSTPTDPRMLGYDERVEVTVQA
LAAPDAPSNDKSFFIVPGMQNLSENDLEFMEVSRYITNALSKKGYIRANS
VKSAAILIRLSYGIGDPQTSSRTVEISPGYSYPVGWMWFTQPPQTQTVKE
TTYQRNLILEAYDLKDPNRKSQLWKTIVKSEGSYSDLNRILAYMIAASSE
YFGTNTGRQIDLTIYGHDPRLLDIWK
>CT2020 photosystem P840 reaction center, large subunit
MAEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFL
FQRTETRSTKWYQIFDTEKLDDEQVVGGHLALLGVLGFIMGIYYISGIQV
FPWGAPGFHDNWFYLTIKPRMVSLGIDTYSTKTADLEAAGARLLGWAAFH
FLVGSVLIFGGWRHWTHNLTNPFTGRCGNFRDFRFLGKFGDVVFNGTSAK
SYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTINSETFMSFVFA
VIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVA
FQQPSFAPYYKELDKLVFYLYGEPFNRVSFNFVEQGGKVISGAKEFADFP
AYAILPKSGEAFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQL
NGMYNQIKSIWITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICE
LNIFGTNITMSFYWLKPLPIFQWMFADPSINDWVMAHVITAGSLFSLIAL
VRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLW
GIKGLSAVCWYIDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYF
WTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLEGADIQTRTIR
WLGKKFLNRDVNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQT
NSPLPPPVSHAAVSGQQMLAQLVDTLMKMIA
>CT1468 hypothetical protein
MPEWQNLFTCRKREAVGNVEADKPLIKLFSP
>CT0542 hypothetical protein
MAALRKLLRLRLLLLPIRLLRLLLLPIRLLRPIRLLRPLLRPIRLLRLLL
RPIRLLRLPTKLLRLLLPKRKNKFLNRDLSTKETETTLSLFLYHVLWNYV
KYWPKALRTLNITGPDVKGGTG
>CT0984 hypothetical protein
MKKTAKLIALAAVLFAGFGSTSAKADEGFKIGADVVSSYVWRGAEIGDSP
AIQPNLSYTFKNGLNVGLWGSYAIEKNTPRINNSDYRYKEVDLTVSMPVG
PVTFAVTDYYVPVEGGETNTFDFGKDSANTVEVSGTYTYKNASLMAGVFV
GGNDYDNAWYCEANYKFYDKNGYTAKATAGLGNEGYYGDGEGKKLALVNT
GISISKDRYTASAIYNPDTEKSYLVFMASF
>CT1519 conserved hypothetical protein
MRTRFWIFSMIALLTLAGCSNYRVVSDYDRTIPFERYKTYRWSDKGSAGI
SDDILANNPLIYKNIKSVVDRELATKGFVLKASGPVDFTVFPHARVRERV
VIEPSGFFGYGCGYCPGWGWRSYPPYWYDPYPYPVFSHYEEGTLIIDIID
SRSGEVAWAGIARGILKDYDSSVQMNRDLDEVLTKIMAQFPPMVK
>CT0510 hypothetical protein
MKRLSRIILIQSRITLIQISKTLMTSYLVFEKPLKYAISSAMVHSLHQIK
IKTEHQFHFS
>CT0528 hypothetical protein
MTSDDRNKRNPETLGAIGKVTLEKLASLSSKIRDDVATEADRVNHEILSE
IVALASNRVSVEKNESILKEGGSAWTERTESHYPHIRLHGGALEDDPLKM
MIR
>CT1280 hypothetical protein
MEREPIQGNVLKTAATVAGAGALLSPAGLPILQGIAGIAVVGLGIFAAGT
AAMKVGEMISSGFGQSKPQQEEEDSPFL
>CT0794 hypothetical protein
MYPSERVVTPRSFRGNPLRNPVLIHLQIHDYQVKRK
>CT1400 hypothetical protein
MIGHSEVIRCFGMSGKVYFFNDSGNIQKGVHDLPPKGLACFVKIKNMMMQ
AKPAEVSMVDALVDPWLGAIGRDFEGYRNHCRRVFIFACTLAGAEGESRE
KIAIAAAFHDLGIWTDNTFDYLEPSKRLASAYLASTSKAEWTDEIKAMIE
QHHKVTPWSCKPGWLVEPFRKADWIDVTLGARNFELNRSYIREIQRRYPN
AGFHATLARLSFERMKTHPKDPLPMMRW
>CT0505 hypothetical protein
MNLNEIKTSLCPNGRNIQVGDPDGVSGITHASHDDAPCRRFTSVAEREIA
VTRACRLPNSFSRVSASELSDDDGGSF
>CT1600 hypothetical protein
MQFKTRHPKHSMHLDTFPGLFKLERKVLYLPVLFYLSAHIAA
>CT0750 hypothetical protein
MVKREKVNFLSAYGTYTTRSIYESVRCKYIRYKQKLLIVFYNLEITAAGN
NFFLSRSRDSANNFLFLIS
>CT2149 hypothetical protein
MSAPLAFQGAWWSKMMVEGKFGLVYPYCSVFSFFCFFAIFLRKDLEERGW
CVTLGAFT
>CT1186 hypothetical protein
MKENGNKGRKKIAHSKFIIHNEKKPVVQTGFFFHDMHPAVSR
>CT1172 hypothetical protein
MCLIMFIFFIVFVFICRLLLLFCVVYWFFYCFYFNVSVLILCFLFFLNLF
VFLQALIFL
>CT1711 hypothetical protein
MRDHTPNFKLLELSDASKALVRETVTQLLEKLAGDGQLTPDARLEFWVEI
PGVKHPRGTFRGGCLMPDSYLCLSDWFATGSSAIEPAAEYASSENPLDAA
WADFLGELYYQIEIFTSVASANQGITVELWAGTRGRPECEWMYAVDKKIE
LP
>CT0122 conserved hypothetical protein
MLEMKATGKSKGQWMFYRNFDDTVDYLSDHTRILSYNPFCHKVEPLDRDE
AYRWHFRVTDPQNNPFDVIFNIQQETEILVDLPDEVASMDPEEMSDEMIR
QFTVGRKITWRPLAQDKTFTMPEKYLFEGQVTADMLIVPVQQEQTRVDFD
LWVNVAFLLYPAFRIVPEKVVRTMVSTGMSLIMQTATNHMFQKISKEFGK
IRKL
>CT2122 restriction endonuclease-related protein
MLAVKTTCKDRWRQVLNEANRIGKKHLLTVQQGISLNQFREMRAHDVQLV
VPADIIKLYHKDIRSEIMTLEGFLGEVKTLVEKPRKRS
>CT0677 hypothetical protein
MSAHEKDLTAFINMSTRPKTTRAKNDLLFTWALPGPAGQEYMPLIKRKAV
FV
>CT1563 conserved hypothetical protein
MNNKTSIVAGWIIIASILIQFIPLDRVEHPSKPILGIPASVLAQLEAHCF
DCHSSRTRWPQSAYIAPLSWYVTAKVRQARKAIDFSNFDALPDDGRRNIK
RATSSLARSKGLSAHGEIPGFPKIKMTERERQALTEWAADNNRK
>CT1866 hypothetical protein
MAMKKDILERYDRLDDGRVVIDVYASKVEELYEDFDKQAPFHRKDLDEEL
AAYLFDCVREIGRVDFIIRITLDAVPSAELQERIRTSLKKFFIYQRGLES
ASMQQLLRKSLLFFLSGMALLFFSLWFGGSMIPEVRQLVYERVLVEGVTI
ASWVSIWESLSILMFNWWPARLRIRLNSRIADAEVQFQSHPGIRR
>CT1320.1 hypothetical protein
MHERKAAIGDIMKKHLLLATLASGLLFFSPSGQALADVDLHVNVGGPGFV
VDYNPEFFYVPDLGYSISYGGPYDIIMYGGYYYLYHNGYWYRSHHYRHGP
WVIVDYRRLPYRIRRYRWDDIRRYREVYYRRIHPDRFREHRDRDWRDRWD
DRRDRRDDRWDRHDDRHDDRRDGERRF
>CT0164 hypothetical protein
MSWGIENSKKRKDILDLMDIGVRKTMANPNYVNEVPKSTNGCWMDQIYGL
MRCDICDLSSQCPVREEEEWQAWLKEHNIVIEKKKAE
>CT0522 hypothetical protein
MIERFLYFVTNTLHRHPATGKKIHNSYRKSPHFSSQQNGNAIFNTL
>CT2231 hypothetical protein
MLWASRKTECAGRPVSLASLNRLLRFSLKPVQNRSCSFSRISRRWRFRPG
REMTIMRITVR
>CT0407 hypothetical protein
MNHTKEATKNRNGQPKAKLRPELAEAQPEARLKTYRGWRPDEQVSRRQMG
EARFSHESLPETINP
>CT1515 hypothetical protein
MLLTPSEKITREYNRLWIVSIARILLALAIGFTLYEATIHHPILPPRMDY
GDKVLHAAAFFALTMLTEISFPGLKSLLPKLLFLLGFGIFIEWIQSFLPW
RSSDVSDFLADCAGIALCFVPVLLTRLTLRLSDH
>CT0983 conserved hypothetical protein
MIHIGWQESIFVTGFLLFNPRHYPIQTIMSIEIRRVNTSRERKQFIKFAW
KVYRKDPELNRNWVPPVISDYMKTLDTERYPLYEHADLAMFTAWKDGVMV
GTIAAIHNHRHNEVHQDKVGFWGFFECVNDQKVADALFEAAAMWLKSKGL
DTMRGPVSPSMNDQCGMLTKGYDSPPVFLMLYNPPYYNDLCLNSGHKIGQ
ELLAWYIDQKMIDIGRLSRIAQHVLKREGLTVRDMDMKKYDSEVEKIREI
YNKAWEKNWGFVPMTDKEFEFMAKSLKPLADPHFIYFVEDKNGKAIGFSL
TLPDINQALKHVNGNPFTPWGLVKYLWYKRNISMFRTITMGVLPEYRNKG
IDSIMNARISEYGGKYGLFASEMSWVLKSNEAMSKLAKVIGGVPYKEYVI
YEKAI
>CT2110 hypothetical protein
MVSNLLKSLKKEVLEYAVLPFLFYTPAIRPYTVISP
>CT0599 hypothetical protein
MFKMLMTITTIKELIPVLQTAIGPVILISGIGLLLLTMTNRLSRVIDRSR
ELLDEADKLFGVDRARIDREIDVLWRRARYVRSAIMLAVASCLGAATLII
LLFLTSLLQIDVPLLASIVFIVSMVSLIGSLIFFLFDVNLTLSALHIEFE
GHRKKS
>CT1996 hypothetical protein
MVSGKPSASGFAFLPACPIPGKGDFLSSRLLFRPNKTKIILIY
>CT0710 hypothetical protein
MTIMYEKGGGMAGSALQTWEKVLEYASVPLHGTMSRKIRKGVKLQINEGT
VYENAVLFISDLFLRVTEDSADTSVNTYYSIDSIASIRTYSTKE
>CT1044 hypothetical protein
MLTEEVILIAEKQCGSRIFAFCRNGDKSGYRWYITIMFGKVMLCMNNLAA
GMKVWISIRSSLAVSN
>CT0731 hypothetical protein
MIQPGFMDAANEHNDHQLLAGLEKNVGRLVEQLSECRKENELLKSEVLSL
QNILRSFKLPGTEGPEPKVSGTSGEGFSYADKLQIKQKLVMILQKIEREL
RGEKAGF
>CT0504 hypothetical protein
MATAIYVFFDNDYITDRAGIQKLFEQRPLVEVIVQSIPYVWLFALFLFIV
AAFYGFRHTRKGYRYPMFRVIGGSLLVSFLLCGLLNVFDIGKYVHRYLID
NVEGYGSLVYTNDVLWAQQEKGLLGGKVVRYTPGDSTLVIRDYRHHFWTV
DLSRARARPGTKIVTGKYLKITGLKTGQSTFKALTIRPWVKKSHHRHPKA
PKPTPVKKSSASGKPASPLSPAQQLK
>CT0395 hypothetical protein
MVMKQYVMATALLCYAAPAYAAYPLTTDDTGTQGAGGWQIELHTEFSTSS
RTDGGVRIKDREDDATTVISYGVAKRMDVIVTLPYQWYQHRQGQLVTDDE
SGIGDMTVELKWRFLENEKSGLSLAVKPGISLPTGDADRGLGTGRVTGGA
VLIATKEFGALTLHANAGYHRNAYALDADDAACNKDIWNASLAGEYAFSE
KLRAVADIGLETATEKGSRTHPAFLIGGLTYSITKDFDFDFGIKGGLNDA
EPDTAVLLGLAARFN
>CT0681 hypothetical protein
MCRLISTPRKMARRVQFRISQEVSIMAKDYLSIASEIKELEDLLAAIPED
NVIERLSLESRLESARAALTVLPQQIAPKARLTFRGRPVFGSHGIAADFG
SKAAGAFSDAFAAVCAGLSEGLRYMGPIPNRDENQLLITGTAIGSFGFEF
ELPAPDPSLFPPETEKTQEAMVKIEELFRLSAEGTDDEIAEVIEEVHPRA
IKKVYEFLELLVQQEAYCGLEFADRFFRFADYKQIKASCERLKSDNIQER
EETYRGEFQGVLPTARTFEFQVMDQKGPIKGKIDLTIADPDVLNREWLHK
PVTVKFNVMQVGQGRPRFTLMTLDDLRP
>CT0589 hypothetical protein
MVKDSNHFMNSTNMNFVIRLKDVVDAIDQPDEERRAFLNIRTGRIVTFSR
DALDAVELGSAVRAREEALVREAGEALLSGDYRELPDQFDINDCSIMRRF
CQTVENDELRRGLLRSIQGRGASMRIRSTVDAFGVVEAWSAFRNEALQAI
AIDWLGNLGIAYSGE
>CT2217 hypothetical protein
MDNKIKPAKSCNATPAYNVRFSSLKTRIASERKKPKNQLVGR
>CT0580 hypothetical protein
MCRISIRRKNIVSMYQKIFKVFLTAIGLVKMRLGVLFSIFLQYSQN
>CT0860 conserved hypothetical protein
MNQGKPEKLYADSVQLAYAKTLEFVSHAIIILMAIGFILYVFRLLPLTVP
VETVAANWHLNATKLQVKIHHHCGWSCFEDVHTFMHGDAVSYASVVFLSL
ATMICLATSTMAFFREKNRIYLVITILQILVLLVAASGKLTSGH
>CT0465 hypothetical protein
MPAIFFLTEELNDMKYRSGKGTGMISDARLRRVCRLTW
>CT1443 hypothetical protein
MKARSQELVEKSVSAMVSAIEVYNKPDFKYREETFSILAINAWELLLKAK
WLKDNGNKVRMLYVTEKKLRPNGKPYKHAKVKMTGAGNPLTHSLDYLAKR
MAEKKTLADAAHKNIIALCEIRDSSVHFYNKSGVFAVRLQEVGSASVRNY
AKAAQSWFDIDFSQYNFYLMPLAFVNPQQPGDAIILSKEEKNVATFISSL
EAAGDPEADYAVSVNVELKFLKSKADDAIKVQVTNDPSAPKVQLTEEHLK
DKYPLNYAALTKACSARYFDFKQNQKYHDLRKPLKSDKRYCHVKKLDPDN
PKSAKQAWFSNAVFNVLDKHYTVKG
>CT0253 hypothetical protein
MPSQESWNVTLVVSDNGRTKTIVSARHAAEYRQGEKQEIRLDGGINVQLI
NRDGSVTLITAGRGIVHDNQDIEAFDNVVIRSADNTVIRTEHISRSSSNR
MIRSDKYVTISGPSRTIRGYGFESDDAMKRYRIFHASGEALSK
>CT0912 hypothetical protein
MTGLSQSQASPMQIQPGNAAFNPWTDAALDTIRDVNQALTLYAEMRVVPA
HHDAFLAAIDTVSAKLRVLPGFLSLALKQMSGDSTMVKNYPETYKGVLAT
AYLDGVAAGTQPYFYNLFVRFADGRAARAAGFEALFETHIHPLLHAMAPR
GGDGPELLAYRAVLQSVVAGDRHAIYRGAEEIRSFLRRPVELPERETVTV
ENHVMVPEDKHAAWEPQVAILLQVAQDTFEPQDEPSGVGLPGARDNRYYR
KALSTEILRNAHADGGLRAYIMHGVWESVWDHENSHLDPRFLAAAGPVGA
AAVVGPVEPFYLTRRLVVAD
>CT1079 hypothetical protein
MIGAVPFNNVPFSSLSVHSGHVFHLAIAGIIVRIGYIREVFFNFSSDTAA
>CT0787 hypothetical protein
MTRQRRASAPIQYRPHNRNKMDGFWSQRHEIRHAKKPNRPKESTSVRAIC
NSGGCERATSTR
>CT2079 hypothetical protein
MLEKMANNNVEYRSTREPLDAIENNDEQDAIQKSIERRAISTEALWRALG
RRITKKWKDEGKACRSGVVIQSDDKPLQASFSEQIASKYNLNANYC
>CT2147 conserved hypothetical protein
MTNAFKGIPEELLKPHASPCVSIFMPTSRTFPDNTQDPVRFKNLVSRAEA
DGIAFSTKREMAPLIERLRLLQDDASFWNHTLDGLAVFISPDYFRIFRLQ
QSVLEQAHVTDAFYIRPLIRIYQIVERFQVLALTRSEVKLYEGTRDHLDE
IELAPEVPKTMTDALGTEITPPHMTIASYGGTGTAMRHGHSSRKDEEALD
NERFFRAVDQGINEYHSSSSGLPLVLVALPEHQGLFRSISRNQRLVAEGI
EIDPAALGLEAMRQKAWQVMEPYRERKIDQMIARFREAEGGKLGSDNPYA
IAIAAVAGNVSHLLLDGQKFWPGQIDPVSGDILLDEASQASGRDVFEELG
AAVLARGGEVLVLPSERMPSASGVAAIFRHD
>CT1623 hypothetical protein
MLMADRFGWTTPGKNHPVCPSMKSFAPIDHHPALAP
>CT1764 hypothetical protein
MYSRIQTLDKNRPGNEKIRFTAAGYAMQLSMHHLNNA
>CT1134 CRISPR-associated protein, CT1134 family
MEHWNKTFCLEVKGDYACFTRPEMKVERVSYDVITPSAARGIFEAIFWKP
AIRWRIRKIEVLNPIKWISVRRNEVGQTASERSDGIFIETARQQRAGLFL
RDVAYRLHAELEFVPPSERPDAKRPVPESLQDGRETSELRKDENPGKYYA
IFERRARKGQCFNQPYLGCREFSCEFRLVDDLANEPPPISETRNLGFMLY
DLDFQKNLKEPPPAFFPACLEKGVIKVPDWESEEVRK
>CT0248 hypothetical protein
MSVKPVDLNKLRSTHENLYETVVAISKKAREIQEEERAELEERLLPYKEM
IRNPASESESEKVFPEQIAISVAFECREKPTQQALAQYLDHQYDYVLEKS
PETKVAQNEDEDESDRD
>CT1820 hypothetical protein
MPALFFPPRQSKSISPFHPDKQTSLPVARCGGILYHYERVDFTKIAHMTD
YHYPSQALPFVR
>CT1877 hypothetical protein
MEHLNQLPVPALHRFERVPEGTTLGSYRDTETPLLVIDNEERDVTSRSLD
FSGQRIITKLPVVPSMFIRNVRVVMQDCLGYIMEPYKVKEKV
>CT0078 hypothetical protein
MTMFTRLILWLLPLLLVLPAASAAETAGGKLFIESTLDNATPWVGQEVKL
TYTLFFSGTAPQIEDKSQPEHSGIWVRELAPENYINSAPVSKNGELFRKA
VIKQLRLVPLQSGKLPVTGYRLRCLVPQQGEASLDSRNDTETIVTAPTAI
IQARALPKPAPADFSGAVGHFTLSVSPENSTVHAGEPLSLSVGISGKGNL
DTLPPLKVLLPEGIRQEVSVASPDTASAKGSTSSVNTKMLLIASKEGTFR
FVPLKLTVFDPETGRYETIASNAIVIKVIAGRTAMMPPQSPLPGVMPPPA
DPDPLGAVIRPIIMSMGLAVLVLIFGLHLRYIKRYKRTGVQQKTSEAAEP
IRPTAAPAPATQTSTGGKSPQSLRNELYGAVKKTGIMNPAGLTTKELGKL
LKEKGVKAQTISALTELLSGIDHALYSPGQISPEKLETMNRDASRIIADL
TRS
>CT0223 hypothetical protein
MFALLRWLFNGNQATKDMAINQRKRGGLPLRFAAFHELTEEEDGHKTRSY
NRE
>CT0693 hypothetical protein
MIFRYFIRRLNGYDWLVRKIKISHDALLMFDALTMSFRKMTVR
>CT2046 hypothetical protein
MKLIALIGDEETRPLIRKMFTAHQVTLFSSIAIRGCSCETGGEPVAWWPA
GKDIPTVYSSLCFAILEDEKAEKIMKDIEANPLAADPAFPAKAFMMNVEK
MI
>CT0229 transposase, degenerate
MRPKRHESLCETSEATWRHLNFFQHKACLTARVPQISSPECGLLKLQSVS
LCSWPGQWRSRRSPR
>CT0186 hypothetical protein
MTHNDHPGEGTPKTVVCPMCGEPFTCGMSTSCWCATRVVPDSVRNYLAER
YETCVCSTCLDRLIAEAKEELRGA
>CT1944 hypothetical protein
MPTAGAATGYWFYGEIYIYLNLIKKSAMLKQNQATRYALLYSLVR
>CT0894 hypothetical protein
MTPVSANAPEELREAVDLRGCLHYLCISAMISQ
>CT0503 hypothetical protein
MAVAIDTAPVMTEIRAKAQKTERFRMKCQRGIGTSFLFSISSMILSELIF
GMTVQN
>CT1061 conserved hypothetical protein
MRHRITTLRCLFMAIVAIYGSLALFPAKASCQNDKKLEVPVRTQQSDNDH
RGITVQTSDLDDGVTGVVGKVYIEASPKHVWAAITDYNNHKSFVPKLIDS
GLISDNGREQVMFERGKTGIFLFRKTVYIKLSLQGEYPKRLDFHQIEGDF
KVYEGDWLIERASDGKGSILTFRAKIKPDFFAPAMFVRKVQQNDLPMVLA
AMKKRAESAEGSLRVARTSSLKQSTQPSADSAIAD
>CT1905 hypothetical protein
MGREWQGNKKAAVFDSSFWLVFRSVLSTWGGQPGALHEALLRSP
>CT0944 hypothetical protein
MKKYPFRLGTSSYIIPDDILPNVRYLADKVEDIELALFESDEFSNLPSPE
VIAELVALAGEHGLTYSVHLPLDVYLGSPFRDERERSVGKCRRIIDLTEA
LPKSAFVMHFEAGKGVDINAFSDEERQIFVESLGDSARMLLEGCGEPVSM
FCAENLNYPFEIVWPVVEQFGFSVALDVGHLEYYGFPTADYLDRYLSRAK
VLHMHGTTGGRDHNSLACMRPEALDLVVEALRKVEGEPKVFTLEIFSEAD
FLSSVETLERFSS
>CT0888 hypothetical protein
MTVAIELLMLWQQRRLKVFGIGRVHIVRTGAGCPSF
>CT0577 hypothetical protein
MAPGTELGDPFNDRNLVIPRSFFVSSPKLFPQYGGVVYIALSNLTVTIRN
GILCVLTSGKSWNKKDLSINNQLSILETPHENRPPVHFPW
>CT0712 hypothetical protein
MTRRINPNRRSVTINGFYVTSSAPNETVNWTVSTGGGGTTAGPVPEPATV
MLLGIGGLLAGGRKLYESRKEEVAF
>CT1582 hypothetical protein
MEKENKKEAARKKSLGELGELFAIKALVDKKFDRIRNLNDKLMNETFADI
ECEKEGKNYIISVKARNKYQKNGKVNTRYNLGSDVYTKAVMAEKKYDAIA
HWIAIQFDKNSFSIYFGSLEELQGSKAIPVDKCEKGIIGEIWEHDKRHFF
DFDYYTNQKK
>CT0844 hypothetical protein
MPSSVLFLFLFRNLTIVFIEEHLGSAFRESVTHIDIKGGA
>CT0537 hypothetical protein
MRHELDNPHAHATNNPCIRCLRHADRYARAVVSMLEVFASDKAEAKHNLF
LKEGLCDIDKTLRSNDKRDTLRERPRNQTSSAMTLSPEKRARVMQGEILI
DLNWLPDGVIGAKGSVFVEAEPPVVWRMLTDYDHLHETMPKVISSRLLET
NNQTRIIAQSGKSGIFIFEKTVNFTLKVEEVFPEHLWFSQIGGDFQVYEG
EWQLEAVEGKNGHATLLSYQAEIKPDFFAPQFVVSFVQSQDLPTILRAIR
SYCEARAKG
>CT0635 hypothetical protein
MQRRVELKLEQRRFYVWLGIAAAAHVAVVAAVVVLQLLYVRMHPPMKIVN
VSLVQMPGLPGPAGGPKSPETPPAAIEKQAETPELASAKKVAEPPPQPVK
KIAKPVAAVKKIPEKPPVKAPVKAPEPASKTQSAADERKKIAEPVAAVKK
IPEKPPLKAPVKAPEPAPKTQSAADERKNLQEALERLKSKSASQKAETGK
SAAPSNLSSTLANLQKKVASGGGGPARSGSGSGAGGGRYGTGGGGAFDSY
KARIADIIQNNWSFSSQMVRSTSGMEVYVSLLILPDGSVNEIRYDRKSSS
EYLNNSVKNALAKSMPFPSLPREYGAKGIWVGFVFTPEGVGR
>CT0379 hypothetical protein
MLSINTQRTTHHNLTQEHSTQKLIISFNSSLTIIETMMLKLPCALFYFFS
KNNFALSNPAAPHS
>CT1694 hypothetical protein
MHSAPDRKTSAIGNHNAMKTIYQQVASIYLNI
>CT0365 hypothetical protein
MEKERQRLVEEEKREESEQQRCRKQQVLFLISFIYHGVFVCFDDVYLAFS
KPFGTIFGGSFSSIMMIFPTVSPYRSYGCYRRK
>CT1594 hypothetical protein
MSRSSGNTPHYRHRSLSCAISNCSTVTPTSSTLMSTSPTIAALLDESIQL
ELNLAKLYTLFNDHFEEDEEFWWQLSMEERSHAALLQQEKKQPQPLQFFP
ENLLAKDLDALKANNARIIAETERFAISPFSREEALNLALHIEMSAGEAH
FQEFMESETGSLTADLLQQLASEDQNHAKRIREYMKEQGVKEKKQA
>CT0688 hypothetical protein
MPGERSRDFSALIVLKAYISGECKYAVILLHRPMRHERAYAGIPVMIETI
TWRSDRG
>CT1377 hypothetical protein
MGDAARQAADRLDLLGLKKLMFQLGSLLFGLFPAADVPEKDKRAMLTGKN
EGNG
>CT1098 hypothetical protein
MFLAEKNIQQSGARPDVSGFGNTDSRHNNNKVLMKAHLSMLLKNLDNNNA
FKLLQALQR
>CT0406 hypothetical protein
MLLVLILWSLARFLCSPFDRWACKSEKDLVFKALNKFFLAMTGAGLKAGE
>CT1445 hypothetical protein
MGKRQIIYTASQIGGARELLDKEINLVTKERRVWHGYVTAIDQDKIELRD
SRFWKHTFKVADIDKIYGEVVTDY
>CT1026 hypothetical protein
MERLGNNVSKRYQISLEVADGCRVHAVVIPVGEDNITNVTKDGGGVSYAV
WSAFSNI
>CT1290 hypothetical protein
MLERYLPQPLTTLVLCDRQLHPGFGRVVDAVRGMEEDEKNLCNAN
>CT0620 transposase, internal deletion
MTRKKDKTPDIQGELIGQLGYPKHLPIVPNTANSRNEHGTSVRDMQAMLL
WSSTRSKSPKRSSAA
>CT0182 conserved hypothetical protein
MNKYFYDGTPEGLVSAIGAILESGDDPEQTVLSIRQDTLFEEGLFLRTDS
AVAEALFQRLRERAPDAVQTLWYFTMTEVDGLATSLLRYIALAFEHGDQV
NGYLTHPDVKAVVATARKVGRELHRMKGLLRFEQLRDGTWLARMEPDHNV
IQPLARHFSRRLRTQEWFIYDARRHSAAHWDGHALSFGTLERFSRPELSP
EERVMQQLWQTFFKTIAIPERKNPRLQQSNMPAKYWKYLTEKQGE
>CT1434 hypothetical protein
MASKDLDKVEEELKAAPNREIPPINASYIEHQPALFSHSWPLLAHPDKGM
IFFSGDSANTMNVFDQFMVSRGLYYGTSGLKAQPGSMRIFTTAEMAPGAK
GRPKAFDKETKKGFSDHFPVEMVVDIV
>CT1599 hypothetical protein
MNDGRLQRFFWICAGTPVEIIEKYPTEHAKYFGIGATIFFTALFAALSGG
YALYFVFAGAPFDWFASILFGIF
>CT1081 hypothetical protein
MTVFMMRYTDKTHQQLLNHAAPAIKHSMMSPGACRLRIISSVQ
>CT1389 hypothetical protein
MEQEIMRTWVWSTSSPVPIILLNVFLWAFYFVPSLLAWTRKHRSLPAIIA
LNILLGWTGLGWIGAFVWSLSWPGHDNSQPPAAPTQTASEPDQEG
>CT0101 hypothetical protein
MGSAALRGYIPLLQYPFFVHITRFQQPLSLFVSKAVDGPNKLKGQLITMC
FRGGYETGTYGIDLLFSTFRPALPSVTNDC
>CT0171 hypothetical protein
MKISTLYTLFAILATAANIEVQDISIRYYSGQYAIAISVGLGTLAGLLLK
YMLDKRYIFRFKAENPIHDTRTFLLYSMMGAVTTLIFWGFEFGFNHLYHT
KESRYLGAVIGLAIGYASKYQLDKRFVFKQEGAS
>CT2053 hypothetical protein
MACFHDNEIKLIYFKQLSKRGDVKKSIDNRFCFQDYCRQSGDVWDASAME
LQAGNTGVFAFDRFQERWPWSVALAGVI
>CT0485 hypothetical protein
MKEYKVLTQKDRFFGGTFDPEKLEKAINSYATEGWVVVSVATASIPSLTG
AREEMIVVMEREK
>CT1319 hypothetical protein
MPRLSSRGQSINKKPGFSPISFRPHPYPLVISTITGDTAIHAIHFRSRIR
>CT1991 hypothetical protein
MDCDYSFRLPVLMNTGRKNDDDTPHEHPFFRSSSV
>CT1478 hypothetical protein
MFLKKFDLILYGFIDSIPKVRTDSAVFCSSVDFTYYYSSKSIPVLRKKKA
GESSQMFNQSQNHP
>CT0715 hypothetical protein
MTMSSGCFSPVSLEFCQGFVQELSGKRCRTGAGTVNWLIMYDFHRIINGS
VKIFEVHLRLFSWYLTLKQT
>CT0146 hypothetical protein
MTKYLTELWDLIRLNPKKFVIRALLVLAAIGFIFGDFGLVTRISMELENR
KLEKLLAEEQEKIVELRSTIKNAYQPDSVEKVARERFNFHKKGETVFIIR
EK
>CT1459 hypothetical protein
MMKHNTIKIIAASAVFSISTGIFAPSFCLSNAATGATNIQGNQNIVAGGN
VLIYHGLSAKQ
>CT0761 hypothetical protein
MRFQPKPGSLPKQICLVSDGDFETVNLRINACKS
>CT0627 hypothetical protein
MSEQMVVWWDSESEPERWRVEVQRKNSLRQYESV
>CT0119 hypothetical protein
MLILYSQVESHFVIQLLRLAMLVPGVSNSKQE
>CT0642 hypothetical protein
MHRKERALELFSNRCNCSQAVFAAFRQTKVLDEASALRLATMFGGGVAGS
GGGMCGAVTGALMVLSMRYGMGGVEELVNRKKTYELGRQFIEEFEKRMGS
ARCESILGLCIGEPENLQKARELKLFETVCVSAVATASDILEEMLCAEG
>CT0353 hypothetical protein
MNLIAYLNLLRPSKQYPLDDTVGGALEAITMLLP
>CT2100 hypothetical protein
MMEGIFCTVLDLKNELLNIKKVGSSQNLVGGMKDAHMTIPDEQV
>CT0998 hypothetical protein
MVIVKLICSGFPYQRLSTMAHNDFIVQKTTIKKANTATALRLYDGF
>CT0515 hypothetical protein
MPSSSFRRDALPSVMTVMDRRMVLFPIFCIIRTRTGIGRKSNRPSPPGAP
DNRLFAQWLSPRPELAEKKQALNRCTFYK
>CT1966 hypothetical protein
MQKVYKHVLSGLLLLMLCILAAGSFGVGAAGNSDDGATTTNYKIGETAHV
GYMSYAVWKAFYRNQLSDNPYINQPPDAAYLFVDITVRNDDKEARTIAPF
KLIDENGAEYETSSNAWSVDGSIGILDSLNPGVEKRGYIVFDVPRGKHYK
LEVSGGYWSSDKALVDLGLK
>CT2252 hypothetical protein
MIFIKMSLLNDLLEKTLSEIWEIMPWNLVDQMAENSELIVISMFEVRRCS
KNEFCENEQR
>CT2198 hypothetical protein
MLGQFARFWTFTPASNGKALARAFGFYDNALRGGEVGPGNGFAVRCVKD
>CT1439 hypothetical protein
MTGIAVRIGETEIVTGTTGGIMTIEEIPATGMTRTAIRGNPNTDPGQA
>CT0489 hypothetical protein
MGNSELQPVGKSAGGQAWLLLALDELLAIVRECINPKVSRSGLDRCLRRH
GRAL
>CT1804 hypothetical protein
MSSIRITQPNGEHTMKKMLSLAAMFAVLAYASPASAELKLSGDASVRLRD
VSYFGDADQFSFTGSADDDVVYQYRVRLNAAADLGNGYFFKALVMNEDRN
YAGGWQSVRHGNTETISLDISNFYFGRMLENSHWMVGRLPLNSFDNPIFD
LTLYPAQPLANPVYNINFDRVFGGNYGVKLGNGMLNATLCVLDNDSHNNT
SADGDGLFNDGYALHLDYKVNVGNVTLEPQFLSVLTNSDIWYQDITGRVT
TLAYKVTPYTFGALVGVPAGNAKLSFGGFYTTCDDTTPNGGPHVKYDGYL
LRVKGEIGNFMAWYDYNHTTVKPGGNDIKLNNHFVWAQYKIPVYSSAMGS
VTLQPTLRYLASKRDDGFNNYSGERLRSELWATVTF
>CT0549 hypothetical protein
MLCLKSTKIGGELKINIKRVQSCYHLIFPVH
>CT1687 hypothetical protein
MNLQAHGFQLREVQFFYCRGFGWLLSKDTSGIPAVFVTGRCL
>CT0654 hypothetical protein
MALTELLNLFIRDGHVSILHVTHEILRRTENKNEFLFKTT
>CT1281 hypothetical protein
MTASKESSAQRRHVKSTLVDDEKPTIEVMAADLTMPETSK
>CT0932 hypothetical protein
MGMSKRVEDLMHLREITCKIVQGVFQGKRFSWGGDWRRVGGAGLCIALLF
EKFFRNQLTFWFFAEFRE
>CT0061 hypothetical protein
MSARCKSKLAALLMAAGLGTAIPPASVSATLRPADVRVSAEPDSLFAGER
LRYVITVQHDHRDSISVVSLKAGQGTPFEITGTKSFSKNLPDGRAEFRMD
TELAVFGSGRKPLPGFTVVSKHASAAEPERLVITPSESVTVLSMTDSTVT
ELRPIAPPVSAPFPTWLLVPVLLSLAALGLAGYFVKLLITALRRHLADPG
RAARNRLRAIHRQLSKGLQPAAGYESLSNILREFLQKRYQFGALEMVTQE
IADELAARRISIRQELIKLLDEADLVKFADRRPDIEECRRSLRIAEVLVA
TAAEAETTEKEPLMEQSE
>CT0723 hypothetical protein
MDTVPKALLRELSPELAERFPVSRLSLRDIHLHEKTASANRRTPAVLVVR
LTKNRKGNYVTFKVLLYLGISFRRPSGQGGRSDFRRRAG
>CT0366 hypothetical protein
MPSSPKKQKRKAEAAAKQPPAAAKPAPATMPLGAMNYLFIALGATVLALS
YAVMYIEKSVDGFFALDIAPFTLVGAYAWILFAIFYRSKKKKN
>CT0827 hypothetical protein
MGELIMKKTIITSLVAIAAFGFAGTAHADSFATYSSLNTLSAGTDDPNGV
SYGYSYDWGGVSSSCCNGSSAVGSYTEIDAFGVGDGSLSAVSGFQGKAEQ
GANSSFATAAGIAASSNSGLTNYGYPVVDVSVDGGAYAQSSSYASYSWMT
VWGGSTSW
>CT0623 hypothetical protein
MKNDRSIDSTILTLAGAAKHVARHLANVMNRIPEL
>CT1122 hypothetical protein
MGKRYEIGADFFREEILAAMLFGFRNVKNPSTVTVHPELMVKIRESFMDK
VTSPKQLGDVEVFFGLTVIEDATKAKDYISVN
>CT0354 hypothetical protein
MLIGEIKSRRKCFRFNDHDAIVISSILIVIFGKTGDIDCSSFASILYF
>CT0586 hypothetical protein
MRLEPEVVLGAFTGLVEKHLEGILCTEKAIAVSKEKR
>CT0572.1 conserved hypothetical protein
MKNGFTVSKLLCSMSRFAVVPIRRVSVLFIFSIILLFADGGNAMSRMQPP
DGVVAGVVNAFGSRDAVRLNRFVHPKQGVVVIYRQGVFNVFKAVSRIDFR
KPVPEYFPYPKIRGGAPLRYAALPVYDCGREAWSKTGLFCDPKHRDVLLS
TMAINLKRSGLKEISQETIDRFRALEAKSVRVVLVDVNGNDLVFYLTRIG
ERWYLTILDRVSSDCSA
>CT1460 hypothetical protein
MESGLSALRQRSLAEQLLRVGIRLHHLIEIV
>CT1093 hypothetical protein
MSLIILFYPGRHEYTTVEKIGTYFVIYQYSSSFPATSYPSES
>CT1314 conserved hypothetical protein
MLSVFVVNLRSNSAPFSFSRLLMHTTIVNERSLRTCNFPITLQDIRTLKE
LYRLKAETRDLRKPIVRNIMKQRVVGKGCLESLKNALYSLETIYIDDYTG
QRLLRIDGMKQIEVDLTYEIRELQKDIYYLEYGEDRFIEYLAKFIPGFTD
YVTEGVEMLRGKSFNAFITDRDGTTNNYCGRYRSSIQPIYNSVFLSRFAK
NCCRYPMIVTSAPLKDFGILNVSINPEHIFVYAGSKGREFIDIDGQFHSF
PIEPGKQELIRLLNERMQLLLLDPSFEKFNFIGSALQMKFGQTTIARQDI
TRSVNEAESAAFLEKIKGIVRDIDPEGKNFRIEDTGLDIEIILTIDVDPK
TGMIRDFDKGDGLEFICRKMNIDHTGEPVLVCGDTSSDIPMLKKAMEMYD
DVWAIFVTRDEKLMQRVREICPKSYMVPYPDILLTILGLLSL
>CT2103 hypothetical protein
MKPMYYLVAAALSIMLSIYVFIFGTWANSQLVAIFIGLWAPTIICLGIFN
ILMNIHDEMCCAHKRIEGRQTGHDRCGGG
>CT1956 hypothetical protein
MHIRKHSVCRNILRTFEIALFFAISVLGYGLLLKASSFNTKSKKRDRESI
VVQNAVDLNGRHRELKVLSGTLLFPNDTKAALPNRYTFTGQSFLAISPLP
ALALHLLCEKELTVNYRNRNSRISRPYNLFEQNPVLLN
>CT1789 hypothetical protein
MSQRVEKGFELFRKLFFYKAFQHSYPNKGDQ
>CT0448 hypothetical protein
MFSSNGYGSAPNIEYLSLNATDDGLERYRGLKQ
>CT1984 hypothetical protein
MASFCRAVSRDGEEGAAVIMERYFFPTSTSTSSAHQRRFCMYSMLPDFC
>CT1803 hypothetical protein
MLDKQRMAILNTQGADSQVDRLSRRMIHKQFTLQ
>CT1613 hypothetical protein
MPSRKTPNLSRPMSGNESMPATPQMQAPKKKGPKVIATLFVIFVIIAAAG
WFWINWEKPRIAPEPLSPELTTIIQNMPGISDAMIYVGLKDIRESKFWNE
VVPDSIKNSPLLSLGKRTDSLMKAGNINLTNDLDTLLVGFQRSGRKQQNY
IGIACGPVARKAQAPFLKSASLQTAEVAGRQAYEIDSTLWVSPMGTNRLA
IASSSNMLEKFFKPSGHLFERDSTTASLIRKTPYKSHVWFALASPQWTAG
ALQSITSQNRDVKSVGNLNRLQQISMSVKFDDNGLKGQSEWVYKDRQAAF
FASTFLWGAIKLSSISGTRTSESTKELLKHLKVSQNLESVIVTADLPETI
FKKSGKKE
>CT0208 hypothetical protein
MDRVKKFSLFVMTAASAVVIVLCIAAALVLNSGMVDLFAKKQLLSMFNNE
YRGRLELKEVKLRFPDEVTLVGPGIFEEGAAKPAARADRLTLKFNFLSLL
RPKITLLSFREVDVDGGHASIAEYPDGQLNIGKIFTRRHPELPEMLAIEK
FRARRLKLRNSTVSWKPANAPAYRLQNLQLDMSRAFVAKYEFMGTIKQMQ
FTMPDRGLTLKKGSGSLAFSSVRSDVLGLDLETAKSHAKLSVSIDGLDIF
SGISKKSLLNKKTFIHIESLGIDTSELNRFIPIPALPSGVYRIKGDAKGT
FSDLEMLPVSIEHDGSSVALQGKILNLLDPESLSFNLQIDKSKISSALLT
KVLTDERYRSLAKEAGDVNFSGMLRGRLDQWMTGIDFKTGLGSGSTKFDT
KRLGGGKYQLDGDFNIEKTEPHRLLGIRGVKSGFSGSGSFNGTGSASGIE
NAHLETSVKSAFWQQQTISSGSVTLDLKGKKADLSSDLKSPDGGSLIMAG
LIDFSSLAPSYSVGGSVKKLDLSKATGLQDYRSDLNGRFDLKGRGFDPAS
LNIKASFVLEPSSFSDFHFRERSAISASIAQSAGSSAVSLESEAVDLAVQ
GSASMSQMIEALQMAAACIARETGSTAAIRLPRGPSPWTFNYKLAVRDLT
PLKPLLPAKEFRFKGSASGKATLSGGRLSMDTALSSTTLSNGPSFQLNNT
AMTGSMQCTAAGVAAARLSGTAGTVNTFGRELKNLRLVSSFDNGRLAASL
DLAIPRFSEKLSAAFTARRSGNAAAVSIDRLAFTTPSGVWQTAPGGTLDV
AKEFIRFNRVRFAKGTQSLQLNGLLSNSVSGTFRGTLSGINLTEAKYFLP
DSAQKPMSGTINADFTVSGAPGSKTSDLDLRGSGVTWDQLNVGAVHLTAR
HAGEQLRFEYNSRGAATPAGTTATVPVNTITGSGSIPLVLRYSPFEARIP
DNRPVSISMRSDDLSASIITYIVPIFDHAEGVIPTDLRVTGRMPKPEIFL
TTRLRGTDVRIAPTQVTYRVNGQIIGTPSRIDFGRLEVKDAENGTGAVSG
MIGLDGLKPVTVNLSGSFSNLLLYNKKDMKDDTSFGTIRGTTGNLRFYGE
LSAPVAEGDLVLNSVNFSLYRKGSNESAKYIGVEKFIKFVPRRPAPKPVE
AAAPPEKLEFHYNLLDILQIKNLRLSNSAPVKGTMIFDRIRGERVEAQMN
NLSLVVNKTGQRFSLFGSIDITGGKYTFSNTSFELENSGRVAWNNEEIRD
GRLIDIYGVKQVTATDVQTGERDNVRLLIAVSGTIEKPDVRMGYYLNDDT
QPYSSANMIGRQSSHVDPNADLNVISMLFSRQWYLNPERQGSRGVSPVSS
VGISAGTGLLSSQVSSIVQNLAGLESFNVNLGAGANGNLSGLEVSYAMLV
PGTGGKVRFVGTNTTPVAGSRTTTNYYNGSSQKIEYRVTPKVYVEAFRSY
GMTTSDATYTNLQKPTENWGVSVSYREKFHTWSQFWDHLFGGKKKGREKD
KDKKE
>CT0405 hypothetical protein
MINMNDSNFPARITPALSLQRQHFYAKRNHNCH
>CT1294 lipoprotein, putative
MLRANRLTPIMLILALILSACSGITDSGLSEKQVEKLVRASRPNNPSPDI
WARAIKESLEELGQPVDKEHVSAVCAVISQVSAFSISPKNSRMASILRKK
IEAAESNEVLRLLIETRLDQTASNGRTFRENIDSIQSELDFEKWYDEFTS
ASVTKPILLVLKKDASDLITTAGSMQVSVKFAEEYPKKPRNAGGGSVRDM
LYTCKGGVFYGTAYLLDYKHNYDDWKYVFADFNAGHYTSRNAGFQKMLGR
LTHRMVDTDGDLLSYENGNATPSVTYVTFINFLKDKGIGFDEKKVMKDFQ
QEKSYDFEETWSYKTLSELYKKKYGHPIYAVLPDIPLNSPKFVSKNLSTK
WFAERVKSRYNHCMRTSI
>CT0788 hypothetical protein
MRLTRKCRTGRRDCNDCNLLLINYIKLKFFLNYNGSIFKLMETIYFSKIV
LVSIQKTISNLPQ
>CT2228 hypothetical protein
MMVCVVFNGRFSIALQALSPESKKKMEDRMMKMRGCSDDAASLDPETAGN
VRLGIIDRIRPADGSARIDFVGA
>CT0165 hypothetical protein
MASSLKNRKFRAENCCGWLFAAHRSSTHKGVAGLPFMCFPMPASYR
>CT1688 hypothetical protein
MAFSCGAQTDGLFTVSGVRILFLGLEKHTAAFSDDILTEAGTDGRIRGID
GKALTLLALLTGLCFLFVHDLHIRNTVYRFNNG
>CT0891 hypothetical protein
MATSAGRMMRRLEWPLFRFLLLIGEPYIVRGQLLLPISDEPHNR
>CT0426 hypothetical protein
MLKMYVDYWVAVLSGFLQQYFGVKSQKGVTMIEYALIASLIAVAVIAVLL
TVGSNLKTVFSYVGSNLTT
>CT1118 hypothetical protein
MVGFDQLSVSMMTLPCDGSCETLRQHYKSHTIAAMKQHRAFFGNATVRRI
EKTFFSHPLLHYTEKEGTRS
>CT1585 hypothetical protein
MSKIKAKRVGFRLDMTPMVDVAFLLLTFFMLTTKFRPPEAVTIDLPSSHS
NMKLPESDVLTVTIAKDNSIYMGVSSQRTRERLFDMVVRPKLENAGVSKA
AVADSLSRFRLDDSFKIEKEELARYIMMSRFADQRLRPVIRADNKADYEA
VNYVIKVFRKMNLLNFNLVTVLEKEVR
>CT1322 hypothetical protein
MHQDRTPGRGKVMARVNGSVWFEMKNPARRKVLRDFFCGKTITVW
>CT0741 hypothetical protein
MRGEKIQVVARLRDRSLPKFFLCFLTLILATKRIKKIYYRMIKVKKHVI
>CT2096 hypothetical protein
MTTRRSIAIAAWCLLPVALWVTSLPDNTVSAENKKTILEHADQIEGGEKA
GPSGTAIPYRSAVGNVKFLHAETTLECDRATDWPDSERIDLEGHIIIKDK
NVETRADRGVYHTDSETGELSGNVRGRVTGDSLTIKSGRAAFDQHKNELW
LFDDAVAWQLGRQLSGDSIRVHFHEVGGKKKVDEIQVFGHAFLAVRDTLS
ASPALHDQLSGKKLTANLDDNSRLQKVIAIGKARSLYHIYDDKNQPSGVN
FTSGERIRMFFAEGKLDRILVTGGPLGKEYPNYMRNDPEINLPGFRLRDK
EKPVFAP
>CT1413 hypothetical protein
MITFIDHVTAMKELSRKFSADEPDAPNSPGRLISRPVRDSRPAFQDLRFI
VKESLNLGFRFFDEVDRMLQKFKGEEPEKQNSEEKADDPS
>CT2203 hypothetical protein
MPHERRRVTIEEPPMKNKQPVPHPLHGHAHKRKYEKRLFRSRHKSIGQPP
GSLIHIGE
>CT1858 hypothetical protein
MRLFSFTRECHAPAWHELKVNMSCLEELSMKAIS
>CT0520 hypothetical protein
MNSRGSLRGCPASGGTGGNLIGQKNFHISVA
>CT1775 hypothetical protein
MNYSFKTLWNAMFLAVGPVWFVLVWMIWSSGQLKTAEDHTLFLGLVIPGF
ILIYVSGFLIQKRHAKKIQGQHS
>CT0080 hypothetical protein
MYNKIHPDVVLDEADEALDEPNYNNFNTSPDPPSPYADLEKKYRKNKFNK
TRTNTSPGLYGTHGLVPVNGTKPAGKNIQKKRGKKAVASKPFRASNGNKS
A
>CT1864 hypothetical protein
MELMAKDIRRAIGWSSRVTERLVAGKAMNSISVNAIPLFFIHMFLIAQEM
SPRHSDR
>CT1404 hypothetical protein
MIVFDAAKRSSFAGENNRPEPVIAHFSYIGRKSTAG
>CT1952 hypothetical protein
MRQIMPLITIKALRNPQGFFFCEQTISKTTYFGVCVNSP
>CT0925 hypothetical protein
MHLSKKILHQITSCLTIQSCSPGTPSAPGRLYSLLELSQRLTAFDGFAGG
GQDGLDGSGGVGGDVDAAHLFIRQIVRLWFCALEPGGGSGCRDWMVETGC
GNEAGVRPSSPSITSTFTTYFFPIYGQS
>CT0691 hypothetical protein
MRHISSTQNKSHNKALHRAAIPLRSIAAGELGR
>CT0069 hypothetical protein
MMKTGRHQALAQLMQRSRIDLRQGNNISISGKQ
>CT1004 hypothetical protein
MLPRSISGISADGKQFIDLIDGLTLPPLVRNKYLRYILDEHHSAVYAYGV
LALRGDSDMGEIEEVLDVVAANAEHYIMGHGKVIRGENGNVSGPKP
>CT0737 preprotein translocase, SecG subunit
MLNSFVVIFALLAALLLIVSVLLQSPKAGSGLTGGISSLGTVQTLGVRRT
GDFLSKTSAILAGLVMVLCFIAQFTLPARHQEGTGSSILQKSAPASLPVN
NLPQSLPTGNIQPAAAPAEQPAAPAK
>CT0445 hypothetical protein
MSILYLAGHLFLHFQGKVMTFHHGIGVNIACGGTPPGVKVNLGFGSDDGN
VFIFPGMGSDVIGNFKGLWKGIQAVGKAAGSVRAHPKEKCTLLEQGESHD
KSGNEHPDAEPAQVRHAALQDSCQRIHAQTSPY
>CT2068 hypothetical protein
MFGLSIEPEWFFMPQHACYNLELGRVCEQVVFDELVA
>CT0276 hypothetical protein
MGVLWWQNYVNQRYQSVEITRFYSGTSPRGGNKLALTDQQKTAFRKLRQE
HFRKTMPAVQKIIEFKKEMISEAVKPDPDLQKLSAIADSLGKRQAWLEKD
LALHFHELAMLCTLTQRDSLKKLLSNIYTVRYQKMTLWKGRPHREDREDN
HRGPIPPSAPEP
>CT2013 hypothetical protein
MKEFPGLKILAALLIPLLFCACAVDRPPTGGPPDRSPLSVTSTLPASASV
NTSPQTIRIAFNHYVGRNDLSKSIFFAPRIDDYEVSIHGKEADIRLYSPL
QQNRTYTLTLRTPLKSLDGNHQLDRSWVLAFSTGPVIDQGTIEGRVWTNR
LAPMQNATVMAYNASRSNAVPERRPDYIAQSGPSGEYRFEYLAPGSYRIV
AITDNNGNLQFDPETEVFAVAATPTVQTGMAGVGLRFAPEDYSARSLQSC
RIINNREIEITFKNAIPARSFELSAIRIENTATGASLPVLGYFSLSRSSE
DTTYRILTAPMEDRAFYRLRFSPGDAESQTSELTFSGNAHTERYPELSVS
IVPANGADNVITETIRPESGSSIELQCNLPVVESSVKPAVTLSLSEKGQQ
IPVPFTISRIDSRTFAIVASQGFQHSKDYLVQVKPGILKGLVGEPSKTAL
VQSRFSTAGPDAYGEISGSGRANAPAVVVEARRTGSEASRRMVAKTDASG
TFRFDFHDLPAGEYTIAGFIPSASGAISPMTRWNSGSVAPFAPSDPFAAL
TITVRGGWTTEDVRLDIPSARRSGPDDAKSPEKP
>CT1517 hypothetical protein
MVSNERISTILAIKNGGSFQRALTAGFRPDYQSLLALCDGEEGRNLSFFY
QIKTE
>CT2090 hypothetical protein
MVVKFRLWLKLCSRKESFFSMKTNPAGSVVRSGRNKMLTGGGKSGTFHIF
MNEKQGFSVLLSFMMMAEACSRIVLPVSGGTV
>CT0783 hypothetical protein
MLTTSIPLLPFHFHHQMLYRSLSPNAEEYGVFSCLNAETYNWQAGLN
>CT1636 hypothetical protein
MNSEASENRPAHSATGTLWSWHGGNGALIWQLMFSSDAVMGIKRFPQERK
AAFFCLESSTGRVLRDDFVLTAGDENETPVGDGWMIGLETVHGSRLVCHT
YQPGSPEHLGIWAINLPEARVVWSRPDLTFTANLGDAFLAYRSIVFAGFP
ERDYVLIDPLSGCELEHLGTAHERPNQLRDAAQSEEERQRILLPDTVFDE
AGHVENINHGATSVTVFHRMEPVAEGVPGWVSTLSVSECERLVHEDVMAS
GEPMPVFNSFLIKDDRLYYIREREFLVSFVVS
>CT0875 hypothetical protein
MHLVVRYSVDQGRKHLFDLRRSVDHIGQCFVIDPVLRDFRSYRNNSEGRF
LITLYVLENRCETARRVTALRRGTNEVLPKKTSSKISIFFPENDQIIAGF
TLKSLFNGSF
>CT1086 hypothetical protein
MMFPASPQFHRNVSSIALNGISNSSFRPGPACCSLFSGLIGI
>CT1924 hypothetical protein
MSRSQTACAFDANSTTIARVKSTGHGSFTMTMCRTIEGGLDDLGSPRGSK
LAGKLLSALKEWRNEPVALSFSPAELMTLPAWFPSGSSAEYCDSLCRIEA
GYFLHEADRWQWHDMVLEPTPDQPSGLDRRMLLFYPVKPAQFIENELLKH
ARVGWRGVHVEAVARLSSVTGETLAVLELEERYAALSISTNGKISYFRYW
PVKDGSEREYFAIRELTSAPIDGAPVKVTGSAASAKVIERIGRETACAIE
PLELHPWVSVEKGASKGKSPTATIRAVSTAIMALNGG
>CT0408 hypothetical protein
MRQPSHHHHTMAGGPDGETEMVSATLDSALAEKRGGLMQKKA
>CT2093 hypothetical protein
MFDGHRRRGSSGGRCDYWQKHTEQEKKQMPCWFHGALVSDGGKFWGEFPA
SLAEQCPAISDNFP
>CT1169 conserved hypothetical protein
MDVASALEGNWEFRVDHKNIKIFSSKIRGSQVLGFKGEAVFEASLRKLIS
LFHDFGNYGKWVHQLSEMEVLHKSDELDYVVRQVLNTPWPIPKREMIVRT
ALHASEEGALALTMTGIPDYVPLKPDFHRVREARGGWILMPVDGGKVHVT
FVMHLDPGSDIPPALSNAALFEVPFYSLLKMRDLAQNPSYKPAWPSVVDN
HVTIIEDVPDKH
>CT1522 hypothetical protein
MKTIMEQALLDQALAMSPNERVEFAQLILASIEHEDEKIRQKWITEVKDR
MAALKSGKAKLIDFDSLYHED
>CT0096 hypothetical protein
MFRIHEVSRSWKYSKSEPENDPETIKIQFFRLRVW
>CT1226 hypothetical protein
MLPLVRYLKKQCIRAIRKHKLKLFYLKSSFPSRSVINKQF
>CT1619 hypothetical protein
MLDLFLSRDKPDSHRDRAAGWAFRFHLSAVVGTGAVSS
>CT1974 CRISPR-associated protein, CT1974 family
MIATMLTLSRKDVKALRITDSYSLHRVIYSLFEDVRSEAEKRSSVPSGFL
FADKGGDAKGRKILILSDRPPLQPAHGELVSRPVPEEFLQHRFYKFEVTL
NPTRKENKSGKRVPIKTREEVAAWFGGKSQTSWGFSVDPARLDVRMLPVM
QFSKQGDRTVTHGAARVSGMLRVENRDLFIESFNKGIGRGRAFGFGLLQI
EPLKDNSNH
>CT2063 hypothetical protein
MNIADVAKKGFKALFCFPIHLRLKGYRRMSTFLQSNTRAVDF
>CT1269 hypothetical protein
MPFFRFIKEMKQSFMLHPVAAHFSNGVIPVAVLYLVLFLPTGNPFFEHTV
VHLLLVSLLAVPFSFYSGIRDWKTKYKGAKAPVFQTKIRLSILLLVAGIL
AAAIRLAVPDVMHEGGPLSWLYVATLLVMLPTVVLLGHHGGKLAAGQRSE
RFR
>CT1121 hypothetical protein
MSNRNLTTDPRWKSILRPATTCNHQNISDMAMTVEIRDNKLCIEIDLEKP
TPSSSGKTLVVASTHGNAVTDVMIEGKPVTIGLNAYIKK
>CT0212 hypothetical protein
MNAGKKGVIYTCITGGYDELLNHTFISPEWDYVCFSDDMGINNEKNAQWE
IRPLRFEKLDDVRNQRWHKLHPHLLFPESGLSLWVDGNVDILDGEIFHDI
DRALNANLLIAPSLHPERNCIYDEFDACRQLGKDDPDVMGRQEYLIKKDG
FPKAKGLFETNIIFRCHSHPMVITIMEEWWYWVEQYSRRDQLGFTYVLWK
NNYTVEPLSPVSYRFSPGVRFRYGAFHITKEQLIKEKAALEIKVQRFEAL
ICGRLVKVLYKIRKSTVKRWCRVKMQLLNSLCCRK
>CT0507 hypothetical protein
MKKHFIISMIVALGMAGFTGVTFAADAPAAKPAATAPAGEKKAEAPKAEA
KKKAVKKKKAAKKKVAKKAEEKKAEEAK
>CT1980 hypothetical protein
MKRFDTNIAIHEKILLKKELFRQLRANSAYQRSNSFSFPGFFYNIVIF
>CT0222 hypothetical protein
MSPDVIHPKEFREGVPDRALNQRQFQMVLASRPEKMILTRTGHFEFLKET
LAGAGFKSPMEAVSAQERRALVGKISGCYDPIVTSDFFRLPLDRKIRYAG
SLASTFLKRLLNKRKDCGAVFRPSTGILALVFAIAEHGRTADYVICGIGV
RKRNEYLSGKQVKGHDLPHHVFADVKVLRKLARRYNLFTTEPELEHLVPR
YRSG
>CT1844 hypothetical protein
MLPMRLCSFRVQKKQLFVALSYISKKANGLKSPQQ
>CT0901 hypothetical protein
MRKSAFKHNQNEKIPPPEPPCPMVGFISSAPLHQRLI
>CT2277 hypothetical protein
MRYDGIPFRGASFAAAIFMGLSAACHFEAVSEVKPRRPRRKRDNEMCSLP
LSFRYFHFQSGFRSQTGSTTLASKKRLRIQKTVQRRV
>CT1900 hypothetical protein
MKKILLSLALLSAAMLSTRPSFAFGHELLDEPLAIIADQQASLEQTAKNS
TYELLVAKRSITLKNGVFKAGDNPDNFIEARLVRSVICDLNKDNKPDIAV
IIEHHGMGSAGFFELSALLSGAKGFTQTRPVLLGENIEIKEFSVSSNMWR
PEELDIVYLGHQESDSHANPTEQKRARYFLDDDGQLSNDFSHIQIVKKPA
LYLYPVRTTKIEVRLSPKGKVIRTIPDYNNRWRVTVQKDGMIDGQYHYLF
YEAALDKKIELPRRGWSVRYGDLAGWFDSHLHEMGLNRAEAEDLKEYWLK
NLPDSPYYTIRLIEPDVVNKRLGLKIHPKPDSELRVLLNFTPTEKPEKIK
APKLTSFRRKGFTAVEWGGILDDGRMAENVH
>CT2218 hypothetical protein
MQGHAISIQPRCVITPMLYAFEKLATGQCPAYSAFDSIYDTYFPWRY
>CT0902 hypothetical protein
MKLVHRFPVFKSILAAFALLVVQLRAAVAEPLSFSTGQTVSVPSYSHIFV
GNRLKTFDLTTSLAIRNSDPETPITVTRVDYFDASGRFVRAMMKTPLVIR
PVSTLVYVIDESDKTGGVGASFLVS
>CT1073 hypothetical protein
MRMQLKNRSKMATALLASATMLLPSAKNALADAAPEEGIFSLKYLNYHDT
QTGDTNLTAGMSMDRMTVNALSFYGMVPIAGKWSIAGTFIEDSVTGASPA
YHGWGFPSESKNDSTSGASGELRHAGDISVTRYFSRGTLSLGTSYSQESD
YISRGLSLNGTLSTENKNTTFSLGAAYSSDTVYLDKPAVIESKQSDTPGR
KRIVSVLLGVTQVMSQNDVMQVTATYTHGDGYYSDPYKDPDLRPGKRRMF
TLMTRWNHHFDGPDGTARLSYRYYTDTFGIEAHTFTAEYIQPLPHGWEIT
PTVRYYSQSSARFYVPTEDDPRAKTPTDGMEYYSEDQRLSAFGAFSYGVK
VLKELGWSWSADVKYEHCEQRYDWGINGHGDPGIPAFSFRSLQVGLSRKF
>CT0060 hypothetical protein
MIKRRICHMFGAMIPLLLATLFMAACNGNTPKRVTDIDGNTYGTVNIGGH
VWMAENLRVTRYRNGDPIAEVKEGASWTAQTAGARCSYDNSPENGKTYGF
LYNWYAVSDPRGLAPEGWHVATDKEWQALADALGGEQEAGAGLKAPGKWG
NSSGETQSSGFNALPSGARRDADGVFLMLGQFARFWTSTPASNGKALARA
LGFYDNALRVGEVVPRNGFAVRCVKD
>CT1306 hypothetical protein
MGCFQSGQIPGFSIDLGHCFVNPQNRLFPMAMMDPASWMFVLVVSIVCSI
ACAIVSFNKGYRGSPVFAWFGAGFVFSVFALIAIILAPHHHEV
>CT0709 hypothetical protein
MEQTPLRPLGEVMAMIEALGHEVTYAYDDLVFINHNDFLLQFDAAEPNAL
ALFFNTECNAAEADHVAARMIPEGIEKGLIIRRKGTYTMTEAESDNLQIT
FNP
>CT1770 hypothetical protein
MRKTTGTLFVLLTLVTLILQGCYSFSGGALPPHLHTVAVPLFDDTTQAGI
AEFREGITRSLINKIESQSTLSIEPDPSRADAVLKGAIVSYSDEPSQLGS
ATERAVTNRITIVLQADFDDQVKNSKLFSQTFVGFADYQTGNYTAQQTAI
QSAYNMALDDLFNQMISNW
>CT1403 hypothetical protein
MGRGVACSGAVCRGVTGSHAMVIDNLLWSVAIGWWIRELIAIRTGGRRAR
SPSS
>CT0907 hypothetical protein
MNAAEVIAALELPPGARVDRRIPKTLLVERGARTATDKRRINEGVEEVQW
VATLKPSTIGVPAFRDEVREYLEINVLSATLRGGAKAARFAELIHRAVPY
PVFLLMAEGTRLTLSLAHTRWSQGEAGATVLDGEPIAVAVTEAETEGLPS
SFRQALSLARQPRADLYQLYQGWIDTLLALKAAEVTGRFAVPTSADQAAA
RREALRECARLDAEIARLRKAAAKERQVPRQVELNLELKRAEAARAAALT
RL
>CT1370 hypothetical protein
MRISVRFIMLLHGERWRFRKMSMDGSEREALLSTSSI
>CT0953.1 hypothetical protein
MAPGRIYLCEKGCLFSSRQLHEGERETLPVYGQSVMLVKLHQNGRITPAI
RWAIQSSSLSVSQLAARHGIGKAHSPAMEKPRPG
>CT1452 hypothetical protein
MKAAGRYVTIFFLYFLILLFYARRSCGEAVV
>CT0341 hypothetical protein
MLHCLCVIFNRAIHPRVARGSCPETVKPLLAPYLLLFPCYIRHPKFPAST
LAASAQLVQACFLWFMKSIRESG
>CT1679 hypothetical protein
MLLRVCVWEDELFMVWYKKGDAFENSRFSNICGRDLMIKKFFAALQRRDP
FLLFVVVLLLVTGCVKRDKRIRSITIGEQVWMAENLATDCYRNGDPIRHA
KSVEEWNDAISRQEGAWCDYDNDPASGRLYNWFAVADPRGLAPVGWHVPN
DEEWRELEAATGGRGFETAFTGSRNCLGLFFGQGSTAFFWAATPSGEFDA
WNREISKTGGKMQRVSVAKGLGLSVRCVKDN
>CT0419 hypothetical protein
MKFSVTCIMTAMLMSVAPADVVRASGRGDEFGAEFDGLESRSWYRETTTS
NSDSDRDFFSRLRVKSAGQTLKRQVSQIELNAGSSSNLPTFRQRYNETST
TRSNPDLQRERAVKTGLALSFAPSEKLTFTWKPAAWLELTPSYLYQHARN
EETGLWLPPFHSIPCRDRSCSNLSAVSPFAPT
>CT0882 hypothetical protein
MLPEWLPQKPAQGRYDTIAIPSIFNSGCCIGKRGITAIEIEDGMIRLVYW
TRDVQKYRYSGERLHTVEELGNSGIYRAVLNEDYLDYVFSRIRLLA
>CT1356 hypothetical protein
MTQQTNHHSGTANGSRCVVLDTASYCLLLKGDSPPFSAEERSLLIAESPE
EACDFQCLLGDACQPFLSQEYRRIHPGLYEKLEAIAISGWNNDAPFGIDT
AAHLMKKLPLDLNCLINSPIALSPNDPLPALIQKNLVHKHVEANVLVSEP
FTAGRLRYFNIFSETAELKFDHQSPHVQGLLILEALRQAGIASAHYQGLP
LDGKLALLNYNTSFYHFLEQESPIICRCYTDFTSSKTSDDAEACIYMQVF
QWGRLCADAILKGFACTNAERYEQKEQRLKKIIERHKTNFDSKLKRMYES
MVSTQCM
>CT1254 hypothetical protein
MAVTGFFFLFVVYSGRILVFLIFFPVVPSITVPYL
>CT1589 hypothetical protein
MSNTKKGLATAGYNDDLAKLEEIFLPNFFYKKMQYYFSSSGLISNDVLFV
HYTSTESALDIIREKRVLMRNALHMPDRQEVQDGFNIMDGLLSNENNHWV
EFRNRIEVVLPGVVDRVMKIYSDHSHGRNDGTYFLSVLEHDESEKELGRL
SMWRAFCGQSQPVAMFLRLPALSAVSQVLRIFFNPVLYKGKGQQHLELAE
VIKNVENHKSFLERLDPDLVTSAIVSMILINVLCVKHKVFKEEREWRCVY
LPKCFTSETSARLIEPGVEEQVGASRNVYKMPLNAAIDPVLSDIDLSKIF
DSLIIGPSKSPYATYEVFCDELKKIGVSDVESKVRVTEIPVR
>CT0349 hypothetical protein
MPPDSALLAIPDGAALTGGRLISVPQFVLYRLFRQDAIFTGNCLTIHRKL
VS
>CT0314 hypothetical protein
MNSSTKKTPSQSVWTWIVITLLWGSVFFATSTWILGIVSSWFDGGAFSPD
RAEALRVYAMYVPALLVVALSAMVIQSRLDPGWQKQREREKAVRAGKREQ
LFVSFAASIATSSLFTLLTAAAHMLAAPVIGTAVSFSVKTVLVAAGLNIA
FGIAASLFVGMIFLVFGVAKGGSKA
>CT0962 hypothetical protein
MMAENKKWCQKNAGNRPSENLKVKSGRLIYQVMLRQLAVFFF
>CT1216 hypothetical protein
MDQPFFLESTELFCRVFRYEFHQNVFLNLKSVLLERSWFF
>CT1083 hypothetical protein
MHRHHGKCKYLPKNLTNQPAQTNKRFLTDEHPNRILLKNGRDFNGSFSGI
EKDMIQFCFLRT
>CT1904 hypothetical protein
MFMLFKGLGALIEREKLPNNQKHKSLESLWPNRCPESL
>CT0819 hypothetical protein
MKLRRLACMINIGSSQNFVGGMKDAHMTIPDEQV
>CT1141 hypothetical protein
MTAMYFDSLVILYMVVGFSFCDNLYKKILNKCANDFDFYVTLLV
>CT1739 hypothetical protein
MKRQPVGRPRINSDRGRFPDAGVSKFEYNINQEKHMQEQSNKNNEQKAPA
CISTIGVSRCRCGAYHLRYRYVDVAIPRETLYLIMEECFRYEEECAKREG
QRPEAMVFSLGVVTLAILPLDFAAFSKAVGDAVNEDLGIGRLFAGAEAGD
NGLADGTQN
>CT0272 hypothetical protein
MRIMSWIFIFGFALSVFFLLMYFLSKFVNHMKMEQNMEIESFKDSLIDKD
NPVGLTGEELEKMKQQQAEAQAHLREVISKIPVIQKDGKFQVDMDAVRQQ
KAAAAKTNGSTGPATGKN
>CT1604 hypothetical protein
MSANGLSGRCGRTAFTQAIKPVPLYRPRSNELLAAFKKAKTDYLRRKILY
RFFHSISSVLWITRLKYG
>CT1768 hypothetical protein
MSRMKQEPTNVNRKEERERQRKGIPGLIDSTVSSTIDDLKAIIDAKLELF
KIELTEKVALVSAFVLLLVVLMIGVAYLITTIALLFGELFGHVWLGYLLV
SMVFILTFAFFTKVKPNALKNFIHKILLSAND
>CT2074 hypothetical protein
MKSKLSEEKDKWDSSDVVDIGQEKSCYHKGFTGKPVTAR
>CT1138 hypothetical protein
MTVIGIQYWRKNIHALFGISSGRKPQVGKDDKDSGLLRSFVRRDHRRRSG
GSN
>CT1476 putative addiction module component, TIGR02574 family
MKLSVFERIQLVEDIWNSIAAEASDTIELLSQTQKDELHRRVAEHRADPS
TAVPREQVKSRLFSGKS
>CT1653 hypothetical protein
MSEEKSCCCQKPEMLKGTPEKCSPETIKQCHGDQPTHPCVPEEKNAKEDK
E
>CT1035 hypothetical protein
MKQSWLWVLLPLGIVLAVLALFFALFFGNLINGELQFAPIALFTLSLSAG
SYALVRGFRLGWNLSTIISAVIALFAFLASIAGLTLELQGMRIGGIIAGL
LSVTCLAVLFVSNGMDEISVGKRKKTEIAAGPAEWADRIEAIGRRCTKPD
VRTKVLRLGGETRFLTPGTGQADLMVNQTIGRAIDELAEAVKLGNDSAAL
SMLPGIRSLFAQRENQLKP
>CT2246 hypothetical protein
MMDDMIQMVEKAIETSLHWQETGWPVTFGNRQVEVSNLKAAEALPRNAVY
RDEAINYWRQVRLTGEDTAAAGKKALEALKNGDICAAYDALYLCQYLEIP
FEADAKTWRPVYEAFMAKCA
>CT1104 hypothetical protein
MRSKDWERTNGWSGFVKVSKQKVIQGMSEIRIVRAVTRKPSRVIPHYRSA
TFSFKPPS
>CT0356 hypothetical protein
MAVFADMIKAEKSYGRVKVESAKVRKQRRILKDTDTIHRRPSG
>CT1801 hypothetical protein
MNKSTSLVSAAMLGALCATAPLSTASAESAIKPTFDTLFEPLLADPMEPR
IAVMPKLNKKQLQLDIGTSADLYQNSSKTFAVGIDFATWSLLNRTSNFKF
PVDCIDYMFGINTTFRHQFKDKLLSFDEASVRVRLSHISAHFEDGHTDDH
GNWLNPGDSPFGIPFTYSREFVNVTGALSAPGRRVYLGYQYLYHTLPDEI
SPSSFQAGVEIGLPANAYVAADFKLLPKWDWNEGKTDGYRGTWNLQAGMR
LTSIGLKNVRVAANYFSGMSRQGMYFYKPESYTTLGMIVDL
>CT2280 hypothetical protein
MPHSFHYTRRKRSDMEQQTSGQRILDPIERAKLGVKVFNLPYSQAEALID
DYVSGKNYDQASVDYFKDQVATQIHIREKSAELLVTGGEIIKLITRSFMQ
NLPKSIDRS
>CT1876 conserved hypothetical protein
MQFTNDSSMSSNIAKTEFSEQDFRQFQQNLRKETLMLMEWFSEDVFENRQ
VMCGFELEGWLVDQNCNPAARNEELLARVNNPLVMAGLSKYNFELNVAPH
PLNHCLPEFLRGELQTLWDSCSRHAREMGCQTLMVGILPTLQDRMLTLQN
MSSMQRFHALNREILRTRSCHPLKINIEGPNDRLEVVHNDVMAEAAATSL
QIHFQVPLSKTAAFYNVAHVLSAPMAALSANSPFLFGRELWDETRIPLLE
QAAHTPSFVDPTGRPVSRVTFGRDYVRDSLKEVFLENLDGYPVLLPVTFN
HDPGMMNHLRLHNGTIWRWNRPLIGFGENGRPHLRIEHRVPAAGPSIPDI
IANILFFYGAMLHLQPEVPQASISFEEARTNFYAAARSGLDAQVRWTSGN
SMPVETLILQHLIPGAILALAAAGFRSSDLRYYLVDILAQRVASHRNGAW
WQKAFVKKHGPDFRMLTQAYLENQNLGTPVHEWSI
>CT0671 hypothetical protein
MLNYFLLVCLLIARFFPLSRQDSGKDVFRVEFSLCRKEYIDNHVNCQKEN
SRLNPAR
>CT2254 conserved hypothetical protein
MTYLLDANVFIQAKNLHYGLDFCPAFWEWLIESNASGKVFSIDKVAEEIA
TGADELTDWMHNHASDLFLNTDSGTVEKFGQVSTWATSQKYEPTAINTFL
NAADFYLVAHALSGGYVLVTHEVSSNSQRKIKIPDACRGLQLQCMTPYEM
LRREQARFILR
>CT0519 hypothetical protein
MLQRLLDCSNSGRIYINAFYSRRSIQESDSH
>CT0581 hypothetical protein
MKSQMKMDYQKIKKRLGYSFAFVFGFFFGISTCVNFTIDAKWTDFVSLAF
TAAGVALGYITFFRWWRNKKKDDSYRVSKDYLNALNEVQEVIREIDFQYF
YLCPAPGLLVEGDEVSFKRIKQVDQLSHQLYLCRVNLVNAKSELNFWDVN
LSAAFEKEHEELLKCLANLKVVMTGLSSQLFHYYKNHSNEYMTEIDRHKK
MFNGYLKSIRDILNKRRSLKFDGIFTFK
>CT1227 hypothetical protein
MPLGKMFIILVFATGLDLKSLFSDAAFALPVTMPERV
>CT1767 hypothetical protein
MEHQIPVNDTSQEQPKTGPAVHDEIPEPFRKISQKVSEAFSEFKESETWE
KMLDARDKARDYITENPVNSFFYALGAGMFLGFLLKRK
>CT1394 hypothetical protein
MSARPIVTGDYLVEKAIRLRVILIEQSGHGR
>CT0886 hypothetical protein
MTGIESSTFRNDAHIQRYYGHARWYGRYRLLEQGTDLA
>CT0439 hypothetical protein
MRSLNPLHAQYPGDSFFENQIFVKKLAPLKQGGDSFFLSKSEPVETDTMK
SDKHWLNRERLTVYPRIFLALFLILGLVWVLMSKNMLDIKGKPLGYDFMT
FWAASHLALTGHAQDAYKIPLLFKAQQLAISASKVAYAWFYPPTFYLVVL
PLALLPYVTAYWTFMLSTLWGYLLVFRRIVRGNIAMWCLAAFSGLWINFF
CGQNGFLTASLAGLALLTVERRPVLAGVFIGLLAIKPHLAMLFPVALLAI
GAWRTLVTAAVTAVTFMAIGMATFGIAVLKGFLASIGDARLFLENGILEW
IKMPSVFVFMRLLGMPVAGAYIAHCAVAIAAVIVVWRVWRRCEDRNLRGA
ALMTATFLVSPYVFDYDLAWLAFPIAWLSLDGLRNGWLRGEREVLVAAWL
LPLLMAVIAEAVKVQIGPLVLCSLLWVTYRRATAASMAGALATDAYDDQL
GTVP
>CT1910 hypothetical protein
MWLIKHWFSASTFVVKIATFAKPQNVMGNNKKQKL
>CT1435 hypothetical protein
MEQLLSSLISVWSDISPMQKVIVLSAVGVMSVVWWIRHLDKEAGAD
>CT0791 hypothetical protein
MKMLLSSINKKGVKMDKQVDHSNGKRRSGSDKGERMIAIGQKLGVAIPAG
LLLFCGVNASAMTRDAVNISFSPDAVRENEVAQHLTALSANPQGALLADN
DNPLHGNTHVNKFDPNIHNDYSDSGVHTDSHGNEHCNTHGDANRY
>CT0582 conserved hypothetical protein
MRGEIDFSEYPEKGPVFAPLKEPGFFRKAFIEGGTIAWPNGADIAPESLY
EKLLQKEQNRDSVLH
>CT1704 cytochrome c, putative
MSMKIGKVLTVVLVVLILIQLIPGPSHENPPVTGTPKWDSPRTEELFKRS
CANCHSNETIWPWYSTIAPLSWIINLDVSVGRSKFNVSEWGRPGKNDGDE
AAGELRHGKMPPWFYMPAHPEAKLTAAEKDELVKGLAATFGDKSAEKEKK
EEK
>CT1872 hypothetical protein
MPLTELLNLFIRDGHVSILNATHEILRRTIFLIKMRFRIVQSAERSFFPV
FPLDRYFAYPMEWLS
>CT0863 hypothetical protein
MTVFRETGIPCFFISLISNGFVNRGHSLQIFG
>CT0516 hypothetical protein
MTIELTASNLLMPSILFGPISSYHQFSPITHGFS
>CT1463 hypothetical protein
MPTSIGQKGKMKKALLITGLVASLLAVLGLWLHRSYTILKTKPPAPLTTD
VKLEQPSSLFNLPISIEHTVLADYLNGKIRGNFLNADLWLQKKHKERVSL
ALTREENITISSNGHKLFCTFPVSAEARLTDSRFGKFLAKLLVWPIHAKA
VVTFSTPIALDRNWHLKTRFKIENIRWEEEPVLKIGPFRKHIRADVDTLL
SDNKRGLTALLDAEIDKEASLYPTVSDVWKDLQKPIVLTRKPVPVWLRFH
CNDITGHISLNKRAIVCNARIMTNMRMLTDTTAISPPTPLPRFRQTPRDS
ISTISDVNFYALVPFASINRNLNDVFMNRRFSRSGYDIVVRSVEAYGSSS
GLSVAIMTDLDLKSHIVISGRPRYDIPTHTLSIDHFDYSIDTGNPIIRTR
ELILHDAIRDSISTRLDVQIGSLVDRLPTIITRAVSKAKAGRTIDLTIDS
LAIRKCDIRVGRNNIYLLVNATAKNALRIKRIKSGKVIRIRKQAETKDQN
PTSQLPRPPDTDNKSRLLPVTARPLQLTNHF
>CT1733 hypothetical protein
MLGFKDIPLTKKVIHIINDIERPFMIRWSNSHDVQQIHDWLQEEEALEVH
GNFLCNWNLTRQCHEEGRLLVLIDEIKGIPVAYQWGQLLSSGILQVRNGW
RGNGLGRLVVEHCVELALQQDEMVLQVECKPSSSIPFWEAMGFTIVEGEF
GKNAKGFRVLSKNLALPPGGRPILATISSFPEERNWQDNVPAIASYHLNA
IVADDGKVYLAERASFPKCFRRMSRDPVIEIIVEGKLVYRDKAKYQGAQD
HGVKWCRNGFYIDVVTI
>CT1184 hypothetical protein
MDLSSFLPFRDEMVKVYHCLTTNATHTSEKPVFSELKVRRYSCPLEDVSN
FITNKIESWVGWELKNQKTAVGGMKTIRAEVSSFALLGMKIDVTFGLVEE
TDINGRKITTVNGKAHTRIDSKGDLGESRRMLRMMLASLDFEFRPQIVHE
DEYVHRSIDPKNSNAAFQQLFDESTLEHRPSTPKAKSIELKKPVKKQIEF
KSSKNSGETVKAPISSQAIPVATNGAQTTTAPDSDVEEVKKPAKPKITVI
SLKKNS
>CT2003 hypothetical protein
MDRHPEIVTRERVNKTKKSGPMMNSGRSFFMKSGQS
>CT0539 hypothetical protein
MTTEAMMKSDSLPVEITRNGDCHFAANRYVRSTCNIHHQPEFHPLTR
>CT1587 hypothetical protein
MSQTRKHQENRLEKHFFHKNRHKTSSIETRLARHCDFFIFNLS
>CT1390 hypothetical protein
MARLTRCGPFPLRSQPNLKKSFAQKIIAIESLAKSAFVWYPHTEKIHCKY
CHDYQSTDPIQ
>CT2009 hypothetical protein
MSNNQSRMTHRNANRIIAAGIFLVAEAVYLSTMAPTFSFWDCGEVIATSY
TLGIPHPPGAPLYLLVGHLFSLLPFFQDIGARLNFFSTLISSTTIMLTYL
IIVRLIALYRDSKPDGWSLHEQIAAYGGGVVGALALAFSDSFWFNATETG
LWAASSLLTATIFWMMLCWYDEDPAPGSERWLLGVMYLIGLSIGVHLLCL
LALFALVLIYYFKKYTVDLKSFSLMTLFSLGLFFLIYKLIIKGIPVLLVT
TSWWGMSLLVAALASGIWYSHKKRLVLLNLGLFSVVLLILGYTSYMLIFV
RAHAGPPINENNPSTLQAFFSYVNREQYGEWPLWPRRWSPEPVYQYFYQK
YSSEWDYFWRYQLNQMYLRYFGWQFIGRSADVEGAVVDWGKLWGIPFLVG
LFGAGAHFRKNWKMALPVATLFLMTGVILVLYLNQPEPQPRERDYSYVGS
FFAFALWIGIGVERLFTWFSGRLKSLDPKQLVWLAVAVVASGLLSINGRM
LMANYRTHDRSGNYVPWDWAWNMLQSCEKDAILFTNGDNDTFPLWYLQEV
ERIRTDVRVVNLSLANTGWYLLQLKHDSPRGAKPVNIEMRDDDLANISYV
PVDSVNVAVPAGMEARKLYDDARRSGVALPGAPSDSLRWTLKPALTYQGQ
GFLRPQDIAVYAIVVDNFGKRPIYFALTVDPAEMTGLDRNLRLDGLVYRL
VPLKSDSALSFADPGTLYGNLFNVYRYRNTGNLAVHIDETSRNLLGNYPP
LFARLAITLSASPEQAVMVPDASGAYKTVRRGELALEVLDRYTRLFPLSR
YPVTPKLAGSVVAMYAAGGANEKAYPYIHYLETLAAQSGAEQEPDLYFTL
AQTYRAVGRVHEADRIMKELETALPELRKRLDSLKQ
>CT1718 hypothetical protein
MSHHHQTKNLGLSIVINAFIFMVIHQLLQPIVINVQPKPKHTPGTRIVHC
SIPGRPLCTSAFPSPFFRSGRTSARIEKTRSRSSMLVQIPWRPRNSFEIS
SRDFVLNVMADISI
>CT1311 hypothetical protein
MFPPRRKHHIWPMSLYRTYNLSHRDERRALMNRTVVVAVAVVVCCGLSGC
TGPTDQKVNLPHTDLNGAIVSPANTPQSSLVSDMPPWIDLYRERNLVNVS
SVDHVVIVIFETQGSREEVYHHYFDKFNGEENFSSFRYNRDIISFVKDGY
GIKITLLDSTKNLWSLEYHRQMI
>CT2205 hypothetical protein
MKKAPWPSLVAIALLALLLVVPFGPLLTLRFVPGSPDSVAPMALDKALEA
LQAQSGRYPLWQPWTFSGMPTVEAFSYLSELYLPNLLFGFLHFDPMYIQL
LHLVFAGMGGFVLARRLGLGSIPAFLSGSAFMLNPYMTAMLVYGHGSQLM
TAAYMPWVFWAALRLSEKGRLADAGLLALMLGLQLQRAHVQIAWYTWMLA
VPLLVVKILIDTKPPGVSKGKVGVLALAALALGGAIALQVYLPALGYLPF
SARSGAGDAAEAYRYATLWSMHPLELITYLMPGAFGFGGITYWGFMPFTD
FPHYAGLVVLGFAIAGVVAGRKKPMVLFLSAMTALALLLSFGNFFSPVYD
LFYYFAPKFSSFRVPSMALVVVALCLALLAGYGLQAWLDRPLVESSPVFK
WGGLVIGVAAVFFLAFEGELKQLLRAAFPAVQIDNYDLVPMVGNLRWELW
SGSLFVLIVVAAAIAGLLWIAARGMIGARAVAIVLVALSCADLGWIDHRI
VSPDDHSLRVSPLVERTALDRALEGDEITRFLASRPGVFRIYPAGRLFTE
NKFSLAGIESVGGYHAAKLGVYQELLARTDNLANLDVLRMLNVGYVLSPA
PIDNPALKAVAAGKLNLISGEVPVAVYELAGSMPRAWFAPWAVAVQSDDE
AIAAVMAGRGADGGAFVTGVPWQGMERFSTGTVLSMQRSAESIAMKVRAE
GDALLVLSEVFYPERWKLTVDGREQPTLKIDGIIRGIAVPPGEHEVRFVY
DRSRFETGRTVSLVATLLSIGLIAAGIVTGRTSSKTIKSSDKP
>CT2270 hypothetical protein
MKKKFLRFFTTLLFVCSFIPGKLNAAPTATHDGVYLDEIAIGSGYAWGHL
KFSEADYNAVPIFARFGFNMNSVFGMKESKSTLQLALEPFCNPVTEPDSG
VETGLNVFIRYLQPVAPSVKLVGEIGSGPMYLSINSAEQGKAGFNFLNQF
GLGAQVAVSPKSAITVGYRFRHLSNAGTSEPNRGINSNAVVVSYSLLY
>CT1979 conserved hypothetical protein
MLFHNQTIDFTMLATNENRLVEILLQCQPGQPRTRGTWEVDHQGTPFILP
SIGGITLNLQVGDPAFGWEGDHIEPGVSCTADTHKPFEHPNVTVQMLSCV
GNTATIVSGEAKGESGVVIGHHGGSEHIIVDFPREVKEKMAYGDTIMVRS
KGQGLKLTDFPDVSLFNLDPALLAKMKINIAEDGVLEVPVTTLVPAYCMG
SGIGSAHVAKGDYDIVTSDPGAVEEFGLDRIRFGDFVALLDQDNRYGRAY
RKGAVTIGVVVHSDCREAGHGPGVTTIMTCATRGIRPVIDPKANIADLLG
IGTRL
>CT1006 hypothetical protein
MAGVSVSEKYTTAFFGYETGYLRSGAVFQRGGASGGPAFCRGVTLLK
>CT0850 hypothetical protein
MTIVFRHEASCSRLPQGNQKRKECALKAEKIGKKKSTLNSMVPG
>CT1178.1 conserved hypothetical protein
MAEVRQNPVVIVPGVLFWDSLYEVMREALSTWIPAEKIAIVPVNLLDWLG
FPPSPERSTNRVMAALDRTVRAMASRFPGEPVTIVAHSGGGTVAMIYLLE
RPFQGDVYAVNGLVGRLVTLGTPFHTHEHFAKIKTDFIFKHLGPEFFQKY
QVVSVVSNQYKGSLDGGMIEKMCYMFYRGVTDDGNLAGDGVVPARSCFLD
GAKNVTILECEHLPAPHTKWYGTKDGVEQWIEWL
>CT1041 hypothetical protein
MAKKQTFGDKQKKGTVDFKMAKLVYSVKSEKTNAWKFVEKSVRIPNGENE
LDVLKKAMAGQGK
>CT0858 hypothetical protein
MYMIYPPLFLSEAGKYVSEIIEKNAFLYKRSSISYYVPYKFSAKTAPANR
ATLIPVRQQK
>CT0826 hypothetical protein
MLLTFPGKLENLNRIDRKIAAMALAFRPAWRHMIQPTLFRRKLKTLPV
>CT1972 CRISPR-associated protein, CT1972 family
MHNRFNLIDEPWIPAIGKGLVSLADIFSDPRIPALGGNPVQKIALTKLLL
AIGQAACTPETTEALEQLDAETFRRACRAYLEKWRDRFWLFGDKPFLQMP
AILDWMESQRAAGILSETENAKQIGPGFYPSLPSENDSILSQFQTLKAQT
DAEKALFIVSVMNFAFGGTQINKNIYPSEEKVKGKGKPAKPGPSLGRNGY
LHTFLFGSTIIDTLIMNLLSQEEIDNLPFWEKGIGTPPWENMPVSRECDA
ALSLKKSYMGTLVSLSRFVLLHDDGIYYIDGLPYPSHQEGWLEPSMTIDN
QQNPPKAILVNPEKRPWRELVSILAVFDSNKNNKFVCLFIKYGLSRWPKR
YNKPGDKIGVWSGGLQVSFQTGEQYAKATNDFVESSVELDPDMWNNLWYD
KFFGEISILEIMANKVKNGVINYYDSFEPKKEKKPKERASTIMGKKAVEL
FWQLCERRFPELVDACGEPDKLPAIHEAINLLALQSYDAYCPKETARQID
AWAECQKDLKKFIRELMEADRRVGGVPSEF
>CT0661 hypothetical protein
MKDSFSGVAHSGQNANDFMQATVKSRCRFHFQTEAQCSRSIPSEKTGRVF
K
>CT1001 hypothetical protein
MLCVSLQIAILFLCCLVLHKELYKTKYLLDLKKYEAASNSRFILKLFREK
GCFKVREVQPLQMIRDGLPESGCSCGGGKKL
>CT0287 hypothetical protein
MFFRVITHNHKNRQPKSLATDGKSSERKENINVRNNNAGNAEAESAGQKK
SQDFRSLRTSRA
>CT1973 CRISPR-associated protein, CT1973 family
MDNEKEKKTGRQKQFVEFVIGLCQRDKGAAAALRRADNPATEYQSWEYLA
GFNIDLEKPFERIPYAAIAAAIARAKAERNGSAGIGKAIAFCYEDRSKSD
QAKARLRRLLACNSVEEACRILRPLFSLIDSKAAVTLDYAELLSQLLWFN
DDSNRIKTDWATDFYRHAAKTENEEVKA
>CT0118 hypothetical protein
MGNYKFKAYYDEAYPPVPDKATLFWRKFIPWQLFRFFILNIKMIRIVVGG
HS
>CT0751 hypothetical protein
MKAKNTMPFENAFSYTPNKWSIVFNGTFIIHTRSIGISGITKTQKSRASD
IPNCPFPQNTDYVKGQSNLLPAARTISKNIYSCIGTCLFKPVKGLQYIKN
RQPQGGVTINLAARKGTADRKKLTLRDFCLMCSNNQEFVHA
>CT2083 conserved hypothetical protein
MLVCDFKTTCLNWVNVDDVVMGGVSNSAMQLTQDGTAVFAGNLSLENSGG
FASVRTVLERRNYADFAGFRIRVKGDGKRYSFRARNDERFDGVVYKFDFE
TVPDEWMEIDLSFAGFIPSFRGRTLVDVPPLDSSNIVQIGLLVSNKQAGA
FWLEIAWIEAYRADTVASSFR
>CT0639 hypothetical protein
MQKEKWQEPTKDVWFSSWIDIWFMNNKTGGWA
>CT1136 hypothetical protein
MMSNSGKMLDQFVFNPLTRFILPEVGRRIFFVTHALPI
>CT0682 hypothetical protein
MSIDWCPGIREACGYWRDAPMLQQTFEAMERNLEQNNDACIDCAKTVVEV
VCRVVVESFHTQQAPLKLTEETPSLSNWLTAAIRALKLGDVRDDRFKKLV
SSHHKLADALNDLRNKAGPASHGKDPYLARLAEHHRRSAVLAADAIVAFL
PQAYLDAQLDPISSREPWERFAADNALIDAHVGLAVDAEDGDTPTLRFLL
PSGDEIPINIEVSRLLYLLDRDAYVEALNAARGAPAPAAEIVEGQGESA
>CT1902 hypothetical protein
MKKAERKSFLLFDKPVSELDLTLNRWLSRPAPDALPRLE
>CT1142 hypothetical protein
MPIVQKLNLKNRKNKVAIVLQNGLQNAVTGLFHSGGSQKGAPRVDLFEDV
DSAVQWIAA
>CT0905 hypothetical protein
MKKQVLFMPTIRALSNSYRFFFSGDASDKDIEALLSRLRAFKEKEFALKN
QIAENPLDNLFHETGRAFSADKIFDEEERINLKPATGEAIVKELEKYNLS
DTSEDLKGVAFEPFPGRTFRGEIGRFFTPRTIVRHTQAGPCGEQRRGHTR
RSPVHGLAQRLRIDLASTEVSAP
>CT1860 hypothetical protein
MLISFPEPRFEALSIQAALVKMGGKYRFITLENFSDTATSSSKTVMGKLT
ADNGHMNLGSKNYDDLNSFQNDPDKLLSL
>CT0497 hypothetical protein
MFQPLVSNQKSPYWHHHKRMKIVSSMLFSFLEIAATVIHKKI
>CT1911 hypothetical protein
MCLRSNEPPLLPNRCYQQYFYLSMSKYFLVAKPDFGSPVSLSMYEIRRCL
CLISITVTCCPSLTRPIIFEFFLSEYLKLTLVLAVLVLLSLFSSDSIFDE
NSLLIFSISLLVSMITSELLLSVGITKGL
>CT0835 hypothetical protein
MVASDRKAYFRSVSRELQAAGLPVDMNKAL
>CT1889 hypothetical protein
MNQRLTAALLVGISVAFIAAEILVAVFGSFDQGWMVLFLSLYAGFVGLLF
GLSTLLEGRREEVESVSERRARARRDGLVGNLLDDYEIDEEFLGRGVRKP
RSKKPSPSSSSGASKERIPDDEELKAAVTAYAGMVGGIVTLRETIESMDD
SAFLSMARKAGMGGVTRERVLALVVEMVSAQGPTKSDESPALSLSIDKES
FDDYIKRCMTEPEVCIDDDATDSEGFSVGLDASDLSSRPGTPPTEFSHDP
KAVMERFKRSTEKR
>CT0664 hypothetical protein
MDINDLFDKIMKSINQFADEIAEQRLQEKMQDTGRGPKNGGKNAENFEAK
EKQGVTSFPKRE
>CT0508 hypothetical protein
MLYGYGYEHPGFLDCGKLKERPLSMRGLSFYI
>CT0500 hypothetical protein
MIYPATHWQYSGYSAKTLISKIYFLLKFNFYT
>CT1117 hypothetical protein
MDTFTKADDFIAALTGFRQMVQSVELLQQFYDEAGAALLYDCHTASPAGV
IRTAEFFTLTGD
>CT0608 hypothetical protein
MNVMVLVAVGKMDKELFSFHNPSVSILRCQLSIINYSSYLPVLEPQPTNS
FIPS
>CT0904 hypothetical protein
MFFQFLLSMSLKKAGLARLLKVFNALTQQKCRAIAQEISAQV
>CT0128 hypothetical protein
MITSGCGAQGWVITVHSSGAISYIKKRQCVN
>CT1282 hypothetical protein
MTKKKHAKVWRLEKKRLFQRQYRKSLKSINKKSSSSPSG
>CT0673 hypothetical protein
MAVLCGRTSIPAWICWMLRVENGTRLDSGVWHRA
>CT1976 CRISPR-associated protein, CT1976 family
MSNPFILLWLEAPLQSWGADSRFGRRDTLDFPTKSGLLGLLCCALGAGGE
QRELLDEMAELRQTVLAFQRERGERPPLLRDFQMVGSGYNEKDKWETLLI
PKKRDGGGAVGGGTKMTYRYYLQEAAFAAALEVPAARAGEFAEALKAPVW
DIYFGRKCCAPTDMVFRGEFDSEVAALEAASSIAKEKRLREAFRVRDYAP
GDEGEAEVVALNDVPVRFGPKKKYRQRRVTIIHHNDEE
>CT0347 hypothetical protein
MKRGLYSTFDQPLFSKHQFFLVHFEMDARSVMVHSPIKQTPTAMKQRFLQ
AFFIVFAISSFFGGVSLAADSSPAVSKKAATSPESGKTVSKDISKPGVWS
KFEMLSFTDRVVFRDTMAGLTGVGYEPLVVRKQIVEGVNYEFFCNARAVY
PGTDWHPAMVLIYKPLKGNAVIKKISKIDGR
>CT1651 hypothetical protein
MLEMCDWEENASGGGNFLRSERTELIDSEDWPFVCSVAIGHTVLY
>CT0352 conserved hypothetical protein
MHSTALKVALTLLLGSIMLLTVAAAGLRFDSGKVSEEAVNGAKIALADYL
AAHPEAPPRALAVIDYSQPSYVKRMAIIDLKTGRQSFYRVAHGKNSGELY
ARRFSDVPESNMSSLGLFRVGERYLGDHGLALRLDGLDSLRNGNAAKRDI
VLHKAGYVSIPFILLNVVTGYGPMIGRSNGCFVVSENDIDEVVQKLAGGG
FIYAWATPDDNSRK
>CT1648 hypothetical protein
MTISNAQKRNRMKALPILKGNKFIVYYLTL
>CT0427 hypothetical protein
MFPPFVLLKSLVNQLIEIVSLFPANLIAVSATQKKDYTRLLAGQHEPVLQ
AKPRDNSWTLITKS
>CT0779 hypothetical protein
MMIENEALITGAGRVSALWLFSKPKLECFFHNFHNLLVFPL
>CT1115 hypothetical protein
MKKGVDMRYFYDSHCHMMNLSHPNLSAIIKRIYNDSIKPLLLKYSVYLKA
ALLLLLFVIPVVVITLLLTGHFVVIKWILYAVSLIAVILFVFVVVKFGDK
KKREIEISKIKSNLLDKVKEKLANVMNLLAAMETDIGDCLIQMEEELRKK
IPLNNVLVISGNGEKKEYDKIVLTPLIMDFGLKDSGKTNLIYKVRWKPIV
AQVEDLCIGIRDYYLYRDKYITGHAEPLFQIIPFMGINTQNYYSEKDNTT
GKSISVSLVQLLDKNFSEFKYDTSPQMRRKKIDAVNWRQFNGDIESIGSY
YFLGIKVYPPLGFDPWPEDDIERAKVCYLYQYCIDHNIPITAHCSPGGFL
VDDDFKNFSSPYKWEQVLDYTDEKNNKPFERLRLNLAHFGGADSKVWRKK
IADMILKKDTVSSKYKYENLYTDISYQGVDKKSYDALKDLLDRYDSAERA
RLIERLIFGSDFMINLQDINSYSQYLDFFFKTNALTLEEKDMLCNKNAER
FLFVG
>CT0702 hypothetical protein
MPVFLTVTLFNQAMQAALCLAQAEAAQATANADNAKARRMEKFKELKKMG
LSNNQLDNFIP
>CT0784 hypothetical protein
MLVVNIKNARVVLSGSGNFRFTRRDSGVAKISFTKKTI
>CT2157 hypothetical protein
MESKRYSEAFKLKVVLEIESGKFRIGEAARHCIGKATALQWKNRDQVDDL
PNTPHRFNKTMSDLEALVVIELSKSLLLPLDDLLAIVRECINLKVSRSGL
EPLSAPAWRRQPERVDACRRGRAQAQKERQRPRAGSCACRCQIPAQDAR
>CT0425 hypothetical protein
MLKMYVDYYVAVLSGFLQQYFGVKSQKGVTMIEYALIASLIAVAVIAVLL
TVGSNLQTVFSYVGSNLTT
>CT2242 cytochrome DsrJ
MQVAHRGSLPAIAEESPVVAAATVKPGGAPIDSSKCILPTEYMRAHHMQI
LNKWRHDSVREGNRTFVNPQGEHFDKSLNTCLGCHGSNPMFCFMCHEYAN
VKPTCWNCHLSPMEVSQ
>CT0296 hypothetical protein
MNAADSVTTKYLFLTSIIMQYLILLITGIAAGLLSGMFGVGGGVIIVPAL
IFFLGMSQETASATSLIALLLPVGLLGVYEYYQAGKITTEHIWFGLIIAL
GLFAGAFFGAKLAIELSNDLLRRMFAVFLVLVAIRLWY
>CT0910 hypothetical protein
MLVEEPSMEEALRHLLPKIIGNRAGWKVINMGSKGRLMKELPNRLRGYKQ
RMDKGEKIKIIVLIDRDNDNCHDLKRQLEDMARKAGLQTKTAAGTGGAAF
QVVNRIAIEELEAWFMGDTAALQCAFTSLRGVRFPNSFNNPDNDGTWERL
HHFLKQNGIYRKSYPKIDAARTIAKHMDPGRNRSRSFQYFVQGVEACL
>CT1097 hypothetical protein
MVLSKQLYINKPAQVTAKFHTFTFGELVTV
>CT1262 hypothetical protein
MPRLHEKAFLSLLGLLLQPIKKEALSTASFFIWKTEGVNQP
>CT1521 putative addiction module component, TIGR02574 family
MTASAEKIMNDALRLTPVERAEMIERLFQSFDNHRKAEIDAAWAAEFESR
LDAYKEGKIKASPVEEVMARINKR
>CT0238 hypothetical protein
MVNLNTSFRLYGLFFEKKCMIPSIITNPNR
>CT0041 hypothetical protein
MNVLKSAEKPFAGFVGRPGKILGKEDGSNMASYELGVVDVRERRPFGFDP
EPFRKASPQANSPEIFDIDSLLRLRTFVYLLLGTVLFLVQINNTLAINDL
AKRNERLREQLRISTSISTAEKLKSRELQSIRYISGYAKNLGLDSSFIPP
VEIEP
>CT1776 bacteriochlorophyll c3(1) hydratase
MCFSGYPDFTIFPASYSSGGCFNRPNQLQAKDFMPRYTPEQLAKRNASVW
TDIQIILAPIQFFIFLGGITLNTLYYFNLAGIDFYWISIAILFKTLFFAI
LFITGMFFEKEIFNHWIYSKEFLWEDVGSTVAAFFHLLYFVMAWMEYPEH
VLVVEAYIAYLTYVLNALQYLVRIILEKNNERKLKGQGAI
>CT1244 hypothetical protein
MLHRLMTADVMVSWKLPDVSNAGEPFGCGDTVRFYVKVSGVPAVFG
>CT1409 hypothetical protein
MTELFQYPMSYPGWWLNNYYYFTWTLAMLFLAGGWAIFYRYGKFSYGVDF
GCFWKTALLVIMTTIALGVPSYYNTKFVAQHGNDGDSVLLTPDRIEYRYR
NGEKKMFLLKDIVSIYQEPVTYNPPPKIFIVAKNAGLRDSITVTEGKYGL
PDVDKLLAALSARTGLQIKRP
>CT0174 hypothetical protein
MKYPILVFLLILSFLMTSYKITYSEESLSPEYISSTGEEFVFATTYPGGE
KYGYPLWHKWPMVLGGELSYQDYVGRKGKLEDRIIYEPSGISKFRKAVME
NGEVLYLDVAGNIPPNGIYFQLATLNLSE
>CT0667 hypothetical protein
MLLWEKDSRNSGTRMGFFMVHLLSGRTADRAARGETGFSSKEEG
>CT1374 hypothetical protein
MLKANDEIDFCMIHYDTGLEIKKKQKSTSMEGACLVFKIISKQQTGKFLN
SDHRAHENLA
>CT1379 conserved hypothetical protein
MEIEQTVLVQCPYCAQSFEVLVDLLAGHQEYIEDCEVCCRPVSLVIDVAE
DGTATVQAQGEDV
>CT1933 hypothetical protein
MRVSPFPVSRKAWIGTESIPEILKPELYYFFRLF
>CT2199 hypothetical protein
MPSLDWIGKKAVVNHHKKVAFDRLRRLALRCSPGC
>CT2282 hypothetical protein
MSQNLTASVFVCQSVVHFFINKLRVLWLFDQAISLFWLAKASL
>CT0526 hypothetical protein
MLDSREYCRCSVMFLIPGYSFTASYQRFSCYPPSAPTHNHRKQI
>CT0752 hypothetical protein
MTALSIAFHLRSAIGLRNKVPHRQQIFNQEEFSPKRQMAHS
>CT2066 hypothetical protein
MDFHGAMLYGVWKNADLTGVPFVFPCKQRLMLKLNEKERESIRLFPILAA
SFL
>CT1065 hypothetical protein
MNGEFPKGVHIMPEGRDRFGQLKPSFPVDFEPLTAGRVLPKL
>CT0724 hypothetical protein
MVMDGVNGLFSRVMRVFCGKTLPDNNNNHL
>CT0544 hypothetical protein
MSALRSMGAILADETHIMISREMIMNDYSMVITANGNNRNGFCLEKSKKL
K
>CT2201 hypothetical protein
MVYQARPDAGKPDRTEIHKHIRTYFFLLPTK
>CT0645 hypothetical protein
MFVFSRFGLWRSLITSVIETSKDVPKRFFHFDSLCV
>CT1178 hypothetical protein
MDRVALKPFRRIAVILLAGAVIATSTSCSRDERKKEEEKHLESMMSILVQ
VQKNLGRIRQKEAVVVRLSSDVEGRKPKSAEQIGREINTNIRFIDSTLSA
SKNLVATLEKQNHESQYRIAALDRMTGQLKGELDKKELELGAMKREIAKL
NRQIARLTNTVDVMDEAISDQEDQMVKAYYVVGTVDQLVSKGILVPPGPF
SRFFGMRPVLANDFDLRPFRQVDITETKDIYFDKPVSRLHVITPHTKGSY
ELVGGKTSSLLLIRNEAEFWKKSRCLVIVVE
>CT1392 hypothetical protein
MKAGVVSRSSRVNSAIERMFPDPIYVRLSNKRFREHITKPYASVAVNASG
F
>CT1200 hypothetical protein
MMNVQKQSYVFVLTRASIVILNVSSCPVFRNIHGLNLPLACF
>CT1875 hypothetical protein
MLDHIGAHLAEAYKAYLHMKSPFLGVVTPFVDYWFRPPVNNPKTSATETS
NQTLHLVCHVILNEVKNPVPGK
>CT0579 hypothetical protein
MPARRFFVGGFEFDIRFECKIDPDSFALCSD
>CT1217 hypothetical protein
MSVHSTLQLASDAIEDARKRLERARVDADDDYEIRQALRHLEEASDYLRK
VSTELKQHG
>CT2148 hypothetical protein
MYNFFRILQNLFSDFFHLLGGHHFSQGKTLASTSIAQFPTISKVK
>CT0955 hypothetical protein
MDKSSIFFPVSHQEKTISPKQSQNKIMIITTYIDTIAETLFREYMLHKEI
HNHLG
>CT1238 hypothetical protein
MQARSGQGITRNLHTPSPGSAERRAILDAMRLKIKELHGIDVIFVVKTMN
VSGGWAWVHTLPRSVDGFFHYEDFSALLHNDGKQWLVDEIACTEPDNPHC
IGSPGYFRKLSHRFPCAPLSIFPTASFLR
>CT1274 hypothetical protein
MAYIMATSLIFIYSTDSGAVSTLLDTGHKMLSPSTTVGMEVAKTS
>CT2082 hypothetical protein
MHIASQVAILADFPYEVFMSADPITIFRKTWGTYQKVISHNLMFHRREIT
TAVAKLFESRNAKRYDRNV
>CT0849 hypothetical protein
MTQNHETAVTCSTVSQLFAAPGQSMLHGIGMAVVSGSP
>CT1307 hypothetical protein
MTRKKEKMDNAATQPGAVINQFSMKFEGRVTL
>CT1581 hypothetical protein
MRGCPLSTNTNAGAGYELSIMYFTRHYFIAFVMGWFSFS
>CT0431 hypothetical protein
MNRMHANTAAMRHAQAPKHAQFQKGAVMVEFAFILPIFLLLLFGMVTFSI
ALYDKTVLCIASRQGARTGALYYASNYDSNGNLINANVQQRACDAANAVC
QQDLINFGPNMNLQIQCQVLGGTVHGQRSVSVTTGIDYTGIYILSDVLHL
SSTTIMRLEED
>CT1723 hypothetical protein
MLYFGKNRRHRHWASACRCIGIEMNPIQTRNHMNRPTARTLSAAMILLSA
GLVIGGCSPTVKVEAPDKPITINMNIKIDHEIRIRVDRDIDNAIGKRTDI
F
>CT1863 conserved hypothetical protein
MSLAINSIEQNSDIDPQALRLALQGYFNLKRQGLIRREGLITLIDFDKPS
DCKRLFIIDINSGTVIQTALVAHGRGSGDIMATSFSNQPGSNKSSLGFYL
TENTYIGNNGYSLVLKGLDQGINDKAEQRGIVIHGADYVSEEYIRQKGRL
GRSLGCPALSMDQCREVIDLIKDGTCLFIYHQGEDYASRSVVLNPKLALG
SGKSKNPA
>CT1080 hypothetical protein
MKKLFAVFLVFVAMALLPVLASAKSASGPLAFGLYRYEQHSKINAETYWK
CDYPVFERSKAGDIINAAILKAVISQAPSPDSKPAAASIEAAASAFIKEC
DEQMKDAQAHSWAWQSETSGEVLLDRPGMVTVSIFTYAFTGGAHGMSVTQ
YLIFDTATGRPLGLNDLFKPGFEAMLDKLIERRFRQMRGLSATDPLNGEK
GGLFENKITHNENFAVTGSGIRFLYNQYEIAPYAAGQITVDLSFDELKGI
LKPLPALKPIKP
>CT0396 hypothetical protein
MWCPPWKHTGLKGNPVRVRNSTRCCNSALAARLATRFADNATVPFRDGKA
GRIRESQKTCLIFFGFGNKATYYHETQLRSAFRNRLGSGIDPVDAQPTVP
LAFQSIRSSCLSRSVPLAFRRPGAIRAVFHRPPSILSHSSYRLLFAFFAS
IRSTLSTRSSRA
>CT1570 hypothetical protein
MNAGECSRQKKYPLLRTLSERYVKHDIDSTKKLAMHIT
>CT1686 hypothetical protein
MKKGSDGPTPHKKALPLRDAYQSKLPIAVFVLIASRR
>CT1978 CRISPR-associated protein, CT1978 family
MLVVVANDLPPAVRGRMKLWFIEPRANVFVSGVRDSLARKVVDYLHQHCP
PKSGLMIFNSSNTCPGYEIFGLGDTRKEITEISGLPLVIEKSAASPPENQ
NRLTPEAPKVQ
>CT0584 hypothetical protein
MRVQLLDEATVDLADGYRFYERQAEGLGEYFLDSLWSDI
>CT0449 hypothetical protein
MPERYRKSGGHGALDAETVWSLPWWERAGREGYTGEARMVSSLSETPRL
>CT1926 conserved hypothetical protein
MSNISQHGIWLLAHGKELFLSYADFPWFRDQTVKSILNVKEQSPGHFWWP
DLDVDLTEEIIENPERFPLVADARVIYK
>CT1429 hypothetical protein
MNLLILLLAVIILLLLVIITMLATGWPGKQREEVERLGNSLRREILEQRS
GNLQLMKSLRIVIEDAVRESVEKEMMAVAPRGRSRRNSRKKIQEAVDLGS
ELFIAGDEDADNGSYESPLQAMQLSLFSEMTERVQAAAVPDASPDKTKER
EPEGETIHMGYVDDIPDVE
>CT0214 hypothetical protein
MDTAINPHSLWAKTIPTIFIVFFVKYVKAYML
>CT1009 hypothetical protein
MLKEAAAVMGGALLLPPLGVSMVACGIPGLLVAGAGFFAFDAMMQERRAS
AQQSSSDPANGGDESWQTMPPEETERYR
>CT1547 hypothetical protein
MRWSSASQKASTLPVDFSNEPLDIVLHRGHRKISPIMVPAAHSGSNKTLH
YEKSGKVTGKNRRPNRS
>CT1022 hypothetical protein
MRRVSSVDEAVAETKSRLMLVFVLNGGLECLSSGKQQIFAVEYLVFFLKE
PVGGLLLF
>CT1685 conserved hypothetical protein
MKKNMGQKDRAVRAILGVAMLLYSIVFQNLVGLVGLIPIVTAIIGYCPLY
EVLGVTTNKYAD
>CT1137 hypothetical protein
MVTPFTSNKKQYATGVWLKKQAIEIARKIQAGECSRDGFR
>CT2043 hypothetical protein
MNARVPQQAVGPFSNQPACHSADRQGFPIVYETI
>CT2076 hypothetical protein
MYLLLIMCPFFILLACAVRGVSPGAGSFLASIAVESGY
>CT0231 hypothetical protein
MKNDCIIAGRKIPYFRQSRQWAKTAFVFILNETLFSQKADYL
>CT1116 hypothetical protein
MSQGDLLFITLAVKNFLAHFDRCSREYFGLVQRCKR
>CT0574 hypothetical protein
MERLTEAFESQVMLIAELVELLRKREEKRGE
>CT1224 hypothetical protein
MSNNFTGYMTLISYGWTNRMVKCNFQNVRWKFPETIPRRKIQ
>CT0567 hypothetical protein
MNKYLLYTIALVILAFVQRFLVSKLLILHASPDILAIFIAFISMSTGQRT
GTNFGFGAGLIAGILSGDLGLSALLGTVQGFVAGFFHVPQKSHATSVKKK
RMFYAASATALIAGNLLQSLLSDPLSLPLYVRVPETVILGTLMSMMLAVL
VYHFALKKLLKD
>CT0271 hypothetical protein
MPSCVLVACSFIVAHVVSAAAIRFAAKSWKYSFLIRYLRYNPENS
>CT1592.1 conserved hypothetical protein
MIVSGGQTGVDRSALDAAIAAGHAHGGWCPRGRRAEDGVIPEKYRLVETP
FSRYAVRTAWNVRDSDATLVLTSGLVAGGTKLTVECAQRYGRLCLIVDLC
GETDAGTVAEWIWAHGIGVLNVAGPRESERPGIGENARRFVAEIIDRDRW
RTPAGS
>CT0338 hypothetical protein
MQPIILIMRNMEQNVGANPCGRPRTMLADPGLPTRLRIHQSNQATTEMGY
GKNLFISFLPNPDFDLDSEQGNPPIPER
>CT1469 hypothetical protein
MLVLVTMKGCVYIADMWSDYYQISHMPGGMEWVSL
>CT1865 hypothetical protein
MIVESFHKAGDKQTGKEGIDHASALWPCMRSPLAFL
>CT0692 hypothetical protein
MIVKDQNGAVVTICRYLLNTRQTNSKMNLRSIRPFVDPHLGDDFVSFKRY
YDPFVFFVFQ
>CT1657 conserved hypothetical protein
MAGNPISDEPNRLTAEGNGYGDPQAQLIDDRKFQRLMKAYETTVETRKLE
IELFWSRSLFFWGFIASAFVASATLRRYSSDISVVVACFGFVCSVAWSLG
NRAGKFWQESWEMKVERIEPSVTRAMFAQPEAVQTNKNFWLRGRRFSVSK
LAIALSDYTIILWVAVVV
>CT0359 hypothetical protein
MSLFITKSFRRLLGGRFSRGFRGTECKGGLVAVAPGSGNGLEIYTIFKGI
PTPRDSGLKTVFL
>CT1028 transposase, truncation
MQSLSSHYHQLLGLPSNWEVENVNLSMSSRQVEIRLAFTGKQGECPICGQ
SCLIYDHAAEQRWLHLDTMQFETILVARLPRCQCKEHGVKTVQAPWAARH
SRFTLLFESFAVELLLHCANIKAASRLLRLNWHTVNQIMRRAVQRGLVRR
KAETVEYLGIDEKSFKAGQHYVTTLTDLGERRVLEVVEHRTTEATKELLA
SLNDRPAVAMETIAMMVSVSMIVSRLVNLLIFSAFWSPPRYRPWPGC
>CT1466 TIR domain protein
MSPKVFVSHASEDKDRFVLQFAERLRQKGIDAWLDKWEMLPGDSLVDKIF
EEGIKEAKAVIVVLSKFSVEKPWVREELNAAFVKRINNGSKLIPIVIDDC
EVPEALKSTLWEPIADLSAYDKSFDRIVASIYGANDRPPIGPQPEYVQSF
VQAIGNLNNIDSLVLRFSCEEVLKTGNAFVNPERVFLKDDKPILPEDELK
DSLEILDGGGYIKLMRTLGGGFFPYQITTYGFDVYANASIPDYQGKIAAV
VSAIVNEKLMSNAKIQERLKENKIIVDHILNVLENKGHIKQSKMIGGLSE
IFNVSPSLKRALSGG
>CT1656 hypothetical protein
MLVACAKSLATALFIFFSAIYEVPLFIAGRSTKRADH
>CT0355 hypothetical protein
MANYRQIASGADGRDYSDILIKLDRRTVSQVTSSTVIMLADNLIKNTPST
VPVANAPRLSKIDDSKSSSISSKLR
>CT0556 hypothetical protein
MQSKNRHNGRASLQVQIKSGFKIKFVFLSLK
>CT0914 hypothetical protein
MAGDSGLSQSFAKAFQFVVLHSGSFLVPYMLL
>CT0848 hypothetical protein
MMTDHHVSELFRKPGFQKAIQKTPGQTQDVQKRRSPRPEAKKRAPRRIIF
S
>CT0808 hypothetical protein
MLLESILASESFRASELPAVPDRMRKPQKATSGGLFDFEE
>CT0997 hypothetical protein
MFFIGYYIQIYGLLYMYLMSRITFLLDFIYLMFL
>CT1522.1 hypothetical protein
MKIRIHELAAHELDEAIEWHELQSRDLGKHFRRIVREQVKTLARNPIWYL
RKSDDIYKAFIPKFPYKNIVHRRRK
>CT1012 hypothetical protein
MSLFKKAVGVAALYCLLAMGGEVRAADSGAPYTVKAAYESLSLPAGESMG
MLGLGVERQFNENFSGGVGTWTAVRGERGGFITIGFQGTARVPLSETFGL
EAGAFVGAGGGRGGATLSGGGLMLRGYTGLTADLGELGRIGAGVSYVDFP
NGGAIDSTQPTVFYSIPFGSSSRQFDGLAYERNSLAVVSKLVRVRSGARD
LSGKVQDDFTLLGVEWRSYFDNEMFVRFEAAGAAGGSSTGYMQVLVGAGL
QIPLSDNFWIDGSLGLGGGGGGDVDTGGGFLVDAGADLRCALDDDLFAAA
GVSYLRAPNGSLSAFCPSLEVGGTFGKESQKHDKLPVRVRMVSQRYFNGS
DGWRTHDADKDVDNLGVQFDYFAKPWAYVTGQALAAYDGQAGAYMIGLVG
GGLHQTIAGPLFVEAEGLIGAAGGGGLAMGSGLAWQVNGGVGVQVSKDVA
LMATLGRLDAFNGPFKADVVGLSLAFGGR
>CT1476.1 conserved hypothetical protein
MRIVVRPEAEQELLEAHARYESKAQGLGYEFARAADAAVASALRTPFGYG
TRIAEGFRRVLFGTQSPQCDPRQSFPT
>CT0837 hypothetical protein
MQERLPFSHRELKKNKAGKPESSRFRNPLVSAKFADADEP
>CT1273 hypothetical protein
MKPIRIDDYCFDRIHLWLQYDPFVPEFIEMVIEVFYPANRKANGLVMFNH
GFLIGNDLLWYPKKIAGMLLDDNPLFGINPSAYYNYSEAIVEKNWAMAFV
SASHAQVDWMPWTDIGGNPRVGQETFAAASYLIRYGLTEFFWLAESRGHN
SKNFDAQLASKAKFLVSNNVIFAGHSVGGAHAQAAACGFDTLSQIGRQQC
RPFNPVIYNRELLPTFSMPMTDWPEADRANPVGLLMLSPVDQHVPIFMPG
MSDYRAALASRQMPMAMVVGQCDCACLDMSQPPAWSGTPGVESQFSQLTG
DGSWVVASQVERGSHCGYLTNKSPLCSVAELPSQCKRCPGVEVYKPMGAE
TAFTAEMLGKFINLYPNGGGFEGGFNDWIGSEFITWLNRQSPCCDLNLMP
MPGGGYIDNVPPA
>CT2069 hypothetical protein
MQGYTPLNQREGGVMSQFRVGDSIIYHKPKSSVSPGPRARQVYALEHGEH
YHYVVDKFWKVTAVNGDGTIEVITRTGKTHRLPVNDPNISKAQPLQQLFH
RKRFPN
>CT0686 hypothetical protein
MMNPSKIFALAIACGIVLLTFNWMAQAQVMYEPLQNKAPLSIYKYPRVKL
GADLEANLTLIKADNGRLNITGTVTNVGKSSCKTASVAELIMNLGYAPQY
SYAKTGVSDILVSRSFNNLKAGDSIVVNAVYQIPDFGGWASANLPGNAKR
LFTLRVIKQDASSYKPDEDSNIENNVADDVVFYRDLTH
>CT1380 hypothetical protein
MKIMADTALLQSIVKLTRPLFIILSLALCGCIQMHTTVHVRKDGSGTIEE
KMLFSEMLSGIMKEKGEGLPALPKKDQLREMSAEFGPDVKVVNVKKVENS
SGSGFIVTYAFDDIEKVRIGNVQKMSKKLTADSTAVKSDSTVVQKPETWF
TFTMKRGANPELTINKEAMLNSSSRGEVAKKPVSTQEKEQMLDMISAFLK
GMKLEIDVVVDGRVISSDASYRADNTITLYAMDFDQLMTHRDILTGKYDG
LSDRDFARRSGKDSGLKFEFKDKVHVIFN
>CT1488 hypothetical protein
MMDLVLMGLFWQIGFYRTMLVEDTLFQMNKG
>CT1745 hypothetical protein
MNLSFSKASSLVLAGMLCSAPTFAAMPLETDDTGTQGAGKFQIEAGMEYA
RDHETVNGDSVREKEWELATTFSYGLSDTIDLVAGVPWSWSKVRVNGQTV
RDENGIGDLSLQLKWRFFESDDKRTSFALKPGISLPTGDDEKGFGNGRVG
GDVTLIATHTVDRGALHLNLGYEYNNYSIAEVRESSRKSIWRASLAGEVE
VAKRLKAVADIGVETNEERDSDTNPAYILGGLIYGVSDDVDLDFGVKGGL
NDAETDTTWLAGITMRF
>CT1201 hypothetical protein
MAKKSTAKEAPEVTPEKKTAKKAAASAETKPKSAKSKTAKIAAPEEPKHA
KTAKPRKKASAKPMAPETPAASPETVEEHIRVAAYYRWVERGMTDGGHEE
DWIAAEKQIKG
>CT0109 hypothetical protein
MKTITRRLFAAVLVPGLLLLSACGKKDSSAPDSAGVEHAAPAIAGPFTGV
LTMKTTIPKAGTSDMKLYIGPKGMRAESKTNIGAHGGEVSMTILSLKDSP
DKIYMINGATGACMELDVSKVKKQPGGDPYKNAKIENLGRERVNGYDCNH
VRISWPDKQNTVDLWVSKDILDYFAYAKMQGSDDQTDTQLAEKLRAAGLD
GFPVKTLLSPEGVVTELVKAERTTPDDKLFEVPANCTKMEIPAIPASPQG
MSKEDVKKMQDWARKMQQQMPKQ
>CT1532 hypothetical protein
MFELFFCRSGNLFSVNELSMRQLVEAMLNSGSVFYVGR
>CT1467 hypothetical protein
MKLVIQMVREDDEKYEEPCRKQRGILKAILEYFTP
>CT1741 hypothetical protein
MIGMIGGRRSQILVSRDFRAAVSQPFSKNHKLFIIMSLTDVREYLQRERS
ASLKQISSHFKADSSLVESMLDQWILKGRVVVKQRDVFGAACCGKCGGKE
HIHYEWVYEWVE
>CT0300 hypothetical protein
MDRQTGECPILRLAGGDEMTGKRYLDEIFTVSLR
>CT0870 hypothetical protein
MIGGGSHFFIFAPQWGHSGASVIFMVFLIESKFG
>CT1752 hypothetical protein
MFAVPLITSSQRESVMSLITVDLLKVALPDCKKPEEWVAALIPALEKYAI
NSEARVASFLTQYRA
>CT0832 hypothetical protein
MMGNFTGTALRFSRIVLRDGNDKADCSRRVHNLRSLQACRL
>CT1388 hypothetical protein
MLTCAKRGLSGARSNAETVQSAGLKEERFERFVAKTGSNRADKPIPANKT
RRHEAPGKS
>CT1920 hypothetical protein
MKVESRKRQYILEGKIKPGFCDGCGCVTERVFVGEWKPSDKPKDEDPLFG
LSDKKSKEKNPAAEEEASAAENQYWIRCTSCNQVHLLKEWQIQIDKELSP
DELKPEDCQLYTPHGIYAQGDALYHKSLDEVGVVREKHATGSGAHVIIVE
FCKSGRKQLLENVQLNQGKPKSTESVTDIIKLKLRR
>CT1975 CRISPR-associated protein, CT1975 family
MNNNPFKGQRIEFHILQSFPVTCLNRDDVGAPKTAMVGGSTRARVSSQCW
KRQVRLEMHELGVRLGIRSKKVADYVAKACVALGADDEAAKACGEKIAAA
FSNDTLFFFSETEASAYAQYAAEKEFDAAKFNDKELAKLSKKTLDPAKDG
LDIALFGRMVAQAAELNVEAAASFAHAISTHKVSNEVEFFTALDDLAEEP
GSAHMGSLEFNSATYYRYVSLDLGQLSANLGGADIADAVEAFTKALFVAV
PSARQTTQSGASPWEFAKIYIRKGQRLQVPFETPVKAERGGGFLQPSIKA
LTDYLTKKEQQAGSLFGKEKEFTFGGEDETFSIDTLVSEIRNFIEAKS
>CT2052 hypothetical protein
MNKASTMTDTFNYTTIFAPLGFFIGGIFLVLLLNKFIGKSQNKKSS
>CT1223 hypothetical protein
MSVHSSLNGKRFFASRLLITFRKGVFALNLHTMAHEAA
>CT1363 hypothetical protein
MNQSQNVLKLPKKRHLLHDLQTAIRKKSRLF
>CT1056 hypothetical protein
MYDQLTNLIVLKTYYKNKRIDRTAYSVTALFKVTCPVYPVRSSVYRPVRY
RQIVNLLLNTPFCYYQLSIVFRIFPHRTILWALCPATS
>CT1292 hypothetical protein
MNPPIYESEHGLTTLRSMRSVWQVIFLCLKRGVEF
>CT1836 hypothetical protein
MYFSFHDSIVYCWIQASSAFRKVCKKLAPANSALRPTWGEGRKQQAIKAG
>CT1520 hypothetical protein
MQGQVSRLPCPPFCDLHHDGSLDIFSFARTKIPEDKQEY
>CT1954 hypothetical protein
MKRVKDQESGGAGRLWRAWKAVKAVMISGVIRNGNNVIGLKNKGNRYLAG
FFQVGFIGW
>CT0115 hypothetical protein
MGFAKSTGDDVNNASLRLKRLRKARPPHYLSDHNGYSQLFCKRRAVLSFF
MIPLESKFF
>CT1230 hypothetical protein
MVGVNLFQEGTNYDRLHTVSYRLPFFACSTSPVLFSVIIFPLAF
>CT1490 hypothetical protein
MASPRTNAICKSNEMKADNIVQPEPFTLRIFSSSP
>CT0828 hypothetical protein
MPRCLWRKVFRHCLQDTPLLAAGFFIKKIKYRRAMKKITPLLLLASCMLS
SPVLADEPTTTVGVDNNQDVSNCSSVSQAGATSVAGVGVGNWSQIFEASH
PIPYLPGTPGVTANAPTLFSMQGLPAQVKGLSLLTQNLYNANYHDVAIGS
SQGTKIIFNASYPAPKPEKKNRNVYVNLDGVARGEVVGSLTVQSRKDKAE
EVDFATLLYDARQYIAANHKLDGYDVTLLTVPNTVSYSMGVDGKASGMTV
APLVSGLINGPLGAMTALSTGFSRNGGITVPTARIGVTFLVLVDSGKSQV
VDLREYYNMLEKGSTNGNGNGNNKKKYEAIQPKESAE
>CT1746 hypothetical protein
MLKGCDKVPENGMTRKDRERAYKKHDFSGTIKAVSGLMRYFQKGGTSKKQ
NQTQQNKSTRSAKPCHNKLMTTSIMTSQCRCSGTIFRN
>CT0809 hypothetical protein
MNQATAEKINTLFSNFDPTRCFIPYISMYEIEALYFSDPPTLATTSGAPL
KAIEHILAECGEPEKINDHTTTAPSKRLEKLSNRSRKTTTGIAIATAIGI
PKMRDACPLFNNWVTELEKLAC
>CT0431.1 conserved hypothetical protein
MARLRSIKRLHSQRGVVTILFALVLMVLVGLIALAVDLTRLHLVKAELQN
AADAAALAGAGSLIDTSLQTFNWSAATAKAQEFADVNSADGKTIGQHRQE
QDVNVAIQPGYWNLITPSFTSNTGLVTHTGDGNIPAVQVTITLSHLKFFF
APILGIPEGTVQATAIAAVSPPTGGTGLFPMAIGGCLFNLFWDSVHNTPK
LDPATGQPYEIQVYSVYSGGAGASCDSGQWTSFQTDANNVPFIRDLIKNG
NSIPLSIGDSIWIQPGTEATVYDSVPTNVDVAVPVVDNVATHSSQTVIAI
AGFHITGVVKHGNKSVVTGHLIPQSMVPSLHPGNGTGIPYGAYTPPFLVK
>CT1031 ATP synthase, putative
MNEREPEKLSAKDKPLEKRVGDSELRKIRARKNATRSIWEGFAMFGIIGW
AVAIPTLIGVAVGIWLDRHYPSPHSWTLTMIVVGVVIGCLNAWHWVSEEN
RNIDKEE
>CT1642 hypothetical protein
MMMLTSCAPSALGKLFFEAGDSCLDRFGSYDRG
>CT0559 hypothetical protein
MGLKWIFTALFSVLFRPEVFWSDARERFREVNAMKDYAAPVIAIVQFVKL
PFIGTPRMAMLLAIISFTIDVAVLYLLTGVMDSVAEAERSAPVQHEIMTA
LSFSLTPIWLAEPFGFAGTWRWLFIAAALAYTVFISRTGLQAMLGSDESG
VEAFSGKSAFLVGAMAMISSLLQNGLIRFFISI
>CT0728 hypothetical protein
MQKIQDFSFFPCLFHYDVTMIFYRTKARKHNTTCKNAISPIT
>CT1027 hypothetical protein
MMVFGVLHERILGIEQVISIDMTVKFYSQKMKIIVYCEHWAGVQ
>CT1621 hypothetical protein
MKNLVSRTAGVALLSVMTVISGCSKSDNPVSEAVSSLSGE
>CT1720 hypothetical protein
MQSVEGSMKLTRRFSLLSPLFSPDAKELSAHLKV
>CT1251 hypothetical protein
MKMDTKRGRKRSLFLCKSFLILHDFPHPNIILIFRHH
>CT0713 hypothetical protein
MTFLLIHAERKIFKEYDFLVFLSRRHNVIVILYSLDIKLVF
>CT0482 hypothetical protein
MGNSELQPVSKSAGGQAWLLLALDELLAIVRECINPKVSRSGLDRCLRRH
GRAL
>CT1057 hypothetical protein
MRKSSQDWSIRALAMAQSVNAEQSVLKALFDKTKKRQYRMPEKRRLVITD
ELICRRN
>CT0680 hypothetical protein
MSVNTVTVPAWNNAGVLPPIRPNASGNSGDRSPYVVDLATVFDYFSTSPE
RKTILDGLLRFRADLHTAGITSGFQWLDGSFFEQIETLEKRPPKDMDVVT
FFHLPQGWDQRSLVQHHGSLFDQKLVKKNYAMDAYFIVLGQPTNNWHVKN
ITYWYSMWSHRRDGLWKGFVQVDLDPAQDGPARAV
>CT1343 hypothetical protein
MKIPVCLQTHNRLGEKYVGDGGRELVRKNCTGFSGIESESEEHFLWYIQE
VRVSAKGTA
>CT1140 hypothetical protein
MSNDELKGRALKPALSASGITGSPFLSRYDAFKHHQFSTKMKNKQDTTEF
QSDDYDQRQVKKS
>CT1643 hypothetical protein
MDDPAIMWNAGNIRGRHPQLRLTKGFKSGEKSRVEVAVAVARTIGETNVS
GADSGKDATMPSIQGHLALSTHSSSRRNRQPSLFQAITNRRNGIRRLTKR
MKLSTRGHACSNCRCHWATSCFLPVSFSPERISTITGKASVRAATAPKPS
VRTEAGLPCATRQARRRR
>CT1481 hypothetical protein
MSLTPKCSLIIRHYFFTAKSGGAKLINHSSHKSHKKNALYNAEGPPFQAA
LLITNFRETGSYTNFE
>CT1421 bchF, 2-vinyl bacteriochlorophyllide hydratase
MPRYTPEQLEKRNASKWTTVQAILAPIQFLIFLAGLTVTYLYSQGIWVTD
FWWVTFFVALKTFMLVLIFVTGGFFELEVFGKFAFAHEFFWEDFGSAIAM
IVHISYFILFFWIKPAEHILILTAYLAYLSYLVNAAQFVIRLLLEKHNEK
KLKASGAV
>CT2014 bchJ, bacteriochlorophyll synthase, 23 kDa subunit
MSSSPSRIGPNSIIQTVGALETAYGKNETEKLLKKIGQGYLINNLPSEMV
EESKFHALVTALQKELGETATAGILKESGERTAKYLLKVRIPGPFQTIVK
LLPAGLAFKVLLFAISKNAWTFAGSGEFSYGSKPSPNVMVKVTFPSHPVV
SNFYLGTFTALLRELVSPKTEIKADIRKEGSAIRCNYLCKI
>CT1826 bchY, chlorophyllide reductase, BchY subunit
MHPQSMCPAFGGLRVLMRIDGAQVCMAADQGCLYGLTFVSHFYAARRSIV
SPELMNAQISGGTMIDDVRCTIEKIAEDPSVRFIPVVSTCVAETAGIAEE
LLPKRVGNADVLLVRLPAFQIRTHPEAKDVAVSSLVKRFGAFGEPKKGKT
LVVLGEIFPVDAMMIGGVLQKIGVESVITLPGADLDDYVQAGRASACAVL
HPFYERTAALFESAGVKIVGGNPIGANATGQWIERIGEALDLDPETVKTV
AEEERQKAKGMMAGFAERMHGSVIVAGYEGNELPLVRLLLEAGLDVPYAS
TSIARTALGEEDHRLLTMLGTEVRYRKYLEEDMEAVLEHKPDLVIGTTSL
DSFAKEHGIPAIYYTNNISARPIFFASGAASVLGMIAGLLEKREIYGRMK
EYFMPSA
>CT0301 crtC, hydroxyneurosporene synthase CrtC
MNITTDSLQQAWHRLDAPGSYEWWYFDAEDESEGISVVFIWFVGFAFSPY
YLSHYEEWKAHRRDDQPYPLDYGGFSFQLYQDGRETINFIKEGGRELFAS
EDGGIGVRFEGNRFVYDPLRDEYRLSIDFSFPARDRSVQASFSFRPLHRF
DYHFDTDLHAGVDFRHQWVLSVPKAEVHGLLDITSLSSDKRQVLQFRGRG
YHDHNLGTVPMYESIDRWYWGRTFSRRCDLIYYVVFLRGCSAEPQAVLML
LDHKTGRQSTFDAVRVSESRFTRGLFAPVHGKTLRLEAEGVSVEVQHQKA
LDTGPFYLRYTSLLSMMIGEEAQEEVRGISEFLNPAPLKSRLMQFFTASR
VWRAGKQSAMYVLYNFFKHRFERVHRINRKKF
>CT1942 csmA, chlorosome envelope protein A
MSGGGVFTDILAAAGRIFEVMVEGHWETVGMLFDSLGKGTMRINRNAYGS
MGGGSLRGSSPEVSGYAVPTKEVESKFAK
>CT2054 csmB, chlorosome envelope protein B
MLSNNSKHIRIMSNGTNIDVAGAINTLAETFGKLFQMQIDVANTALKALA
DVAEPLGKTATDLIGSFTGAATQVLQSVSSAIAPKK
>CT2064 csmD, chlorosome envelope protein D
MADEEKIDTMKSFDFAVKSITEAGVNQLNLISNTIQSAVPAVTNAAQSLT
NAVSVSVKTVSEAAGALAGALGELGGAVANLAGALTNSAVSIAQSGVSAV
TNAIGSVLQAKKI
>CT2062 csmE, chlorosome envelope protein E
MNNPRGAFVQGAEAYGRFLEVFIDGHWWVVGDALENIGKTTKRLGANAYP
HLYGGSSGLKGSSPKYSGYATPSKEVKSRFEK
>CT1046 csmF, chlorosome envelope protein F
MANESGNIGVFGDLFTAVGDLAQQAVDMAGSALKTATDTVQPVTNACVQL
CTTSINSATQLVEGATKAITTAIAPKQ
>CT1499 fmoA, bacteriochlorophyll A protein
MALFGSNDVTTAHSDYEIVLEGGSSSWGKVKARAKVNAPPASPLLPADCD
VKLNVKPLDPAKGFVRISAVFESIVDSTKNKLTIEADIANETKERRISVG
EGMVSVGDFSHTFSFEGSVVNLFYYRSDAVRRNVPNPIYMQGRQFHDILM
KVPLDNNDLIDTWEGTVKAIGSTGAFNDWIRDFWFIGPAFTALNEGGQRI
SRIEVNGLNTESGPKGPVGVSRWRFSHGGSGMVDSISRWAELFPSDKLNR
PAQVEAGFRSDSQGIEVKVDGEFPGVSVDAGGGLRRILNHPLIPLVHHGM
VGKFNNFNVDAQLKVVLPKGYKIRYAAPQYRSQNLEEYRWSGGAYARWVE
HVCKGGVGQFEILYAQ
>CT1639 pscC, photosystem P840 reaction center cytochrome c-551
MDKNSNGKLIALAVGGAVLMGALFFSVSFLTGYIPAPNHSAILTPLRSFM
GWFLLIFCASIIIMGLGKMSSAISDKWFLSFPLSIFVIVMVMFLSLRVYW
EKGRTTTVDGKYIRTTAELKEFLNKPAATSDVPPAPAGFDFDAAKKLVDV
RCNKCHTLDSVADLFRTKYKKTGQVNLIVKRMQGFPGSGISDDDAKTIGI
WLHEKF
>CT0641 pscD, photosystem P840 reaction center protein PscD
MQPQLSRPQTASNQVRKAVSGPWSGNAVHKAEKYFITSAKRDRDGKLQIE
LVPASGRRKLSPTPEMIRRLIDGEIEIYILTTQPDIAIDMNKEIIDMENR
YVIDFDKRGVKWTMREIPVFYHEGKGLCVELHNKIYTLDQFFK
>CT1018 soxZ, sulfur oxidation protein SoxZ
MKIKAVVQNNIVSVKVLIPHPMDTGRVKDQAGALIPAHFITEVTATIGGD
TVFHAELGSGVSKDPYLSFQFKGAKAGDMLKVSWVDNKGASETAEAAITA
M