TitleGenColors Logo

Gene list

Applied filters:

COG category: General function prediction only
Gene type: CDS
Genomic element: chromosome

Number of genes found: 254

Free access
Sort by:

 



# Chlorobium tepidum TLS, TLS

>CT2223 conserved hypothetical protein
MPPRKAPFRATAPRVILLAIAAVVVTLLFVPGINSMSPKEEPARIAVHRG
EGFRRIVEKLHDAGVIRFRWPLLAAGALVPPLHKIKPGRYTISGNHSVFG
LLWYLHSRPQDEVRVMIPNGVEQRKIARIIAANLDIDSTAFIAASRDPRL
LASLGIKGESTEGYLFPGTYNFAWASTPKEVITFLTRRFRAFYSDSLKQE
AKQAGLDEHQLLTLASIVEAETPLDEEKPVIASVYLNRLKKNMRLQADPT
VQYAIPGESRPLHYKDLAIDSPYNTYRHAGLPPGPICNPGAASIRAVLSP
ANTGYLYFVATGQGGHAFATTLAEHARNVQRYRAARKEQQKNPGGGTP
>CT1765 molybdenum-pterin-binding protein
MNISARNIFKGSISSIVKGAVNAEVTITLASGTPIVSIVTIGAVERLGLQ
EGMAASAIIKASTVILGTNLHDAKMSARNILCGTVTRVIDGPVSCEVDLE
IGVGEVLSAVITHGSAEKLGFAEGSHACAIFKASSVIIGVD
>CT1561 conserved hypothetical protein
MIFPCEKAVWYYLPQIRADIAKELVKTGMTQSSAAKMLGVTPAAVSQYLH
KKRGGQTIKSRLYKQEIRNAVDKLRDGAAEPELYSIVCNCCQILKNDDKE
IGGSSYGDKTAG
>CT1556 conserved hypothetical protein
MRPLRKTVGLALSGGGANSIAQIGVLKALEEENVPIDFIAGTSMGALIGG
LYSSGYSASELESLAHSLPWQKLVSLDNEAPRTNTYLEQKSIRDRASIAI
RFEKFKLVVPKSLSASQTLTRTIDLLILNAPYHTAHSFSDLPVGFRAVST
DLLSGQRVTLTSGPLSEAMRASSTIPILNLPIKRNGQKLADGGLVANLPV
DELDHFDAGYKVAVDTHGRMYTDSSDIDVPWKAADQAMTILTQIQYPQQL
EKADMVITPDISNHKATDFSDIRELIAAGYAKGRLLAPIIKRNIQLAPRH
DIDIAGFSKSIEGIPNSAGYIEQARTARAIVRSDNRIRKILDELLQTGLF
TRVHARLDRRSRSVTFVLEPLPRIEKIEVTGGPANAIPEKEISDTFLPIT
GKLYTSEIATGSLEKLIRLYRDKGYSLVGIERASVSGETLTIRLTSGRIE
NVKIEQDRKLTKPLTVRRELAVDTTKAFRYAEAEKTISNLYGTGVFNRVS
LSTENPDEQSDNPNSTLRIKLDEKLPTVLRIGVRYDETSNAQLLLDFRNE
NLYGTGNSAGVWAKIGQKNNRFNLEFSMPRIGSTPLTMFTRAFFDQRDFE
TRQLALIGQSGLQATGEPRSLGIQRYGITTSFGTRIGKNGRLTADFTVQN
AQSYIRDNIDEPFATGNVNLASIGGQFTFDNRDSSFLPSGGRYTNIRYTS
TSALLNNADSFWQFSALHEENFSISSATTLQLTALAGTSSEEVPLSEKFF
LGGTGNAYSYRFIGLKDSDLIGNNIAVAGAMLRYKVPMQLLFPTSLTLSY
NIGNVWEKRKQMSISKLIQGVGAGLVWDTPIGPAQFTVAKPFAFERDDVN
DSARIDFTDTVFYFSLGHDF
>CT0270 carbohydrate isomerase, KpsF/GutQ family
MSERLDENFSRAIDLMLACTGKIIISGMGKSGIIGQKIAATLSSTGTTAI
FLHPAEAAHGDLGVVSEGDTVICLSKSGMTEELNFILPALRERKATIIAF
TGNPRSYLAMNADVVLDTGVEQEACPYDLAPTSSTTAMLAMGDALAICLM
KKKNFTDQEFALTHPKGSLGKQLTMRVGDVMATGDALPVVSEDAMLSDLI
LEMTSKRYGVSGVVDAEGKLTGIFTDGDLRRLVQTGESFLDKKAVEVMTP
NPKTVAPDMKAKACLELLETHRITQLMVCDEKRCPVGIVHIHDLVTLGL
>CT1700 conserved hypothetical protein
MLKDALGEWRGSEEEITRQIEADPVNAQAWASRAGVRSAAGDFEGALGDL
TMAIELGLRFRERIIAYGNRGIIRSETGDYDGAIEDFSAVIEARPRKSIM
KAALVQRALAKEKFGDKEGSAADRRLARILSPDLNKQTTKK
>CT1790 conserved hypothetical protein
MDGHSKNHCSPVGRYFPMPLDRRFLPGWLAFLALLAICVFPLMAKAAGVP
ALVGRVNDYAGMISPQARSIIDQKLKALEAEDGTQVAVLTVPSLDGQPIE
EFSIKVAEAWKIGQKGRDNGILFIVSKNDRAMRIEVGYGLEGRLTDLQAG
RITRDVIKPAFKSGDYDKGFIDGVDAIVASVKGEYKAPKRKNDDGAPSPF
LIFIILFVLFVASRFMRFFGGGGGPFGFGGPGGGFFPGGGFGGGGGSSDD
GGFSGGGGDFGGGGASDNW
>CT0328 conserved hypothetical protein
MPYSSSGVGHNGFDKADLHIHTKCSDGLFTPEEIVHKAIHVGLKAISITD
HDTVTGIDQAKPLALELGLELIPGVEMSSAYKGYDIHILGYFFDYQQSEL
KGYLDHCRLLRTERAERMVQKLAKMGVKIEIEQIIMKAQNGSVGRPHIAA
VLQDEGFVKSFSEAFSKYLGSHSPAYVKSIETHPEEVIRLINEAGGLSFL
AHPAQNVPDEILRQLISFGLDGLEIIHPSHDTYRQNYYREIANEYFLLFS
GGSDYHGPKDHEDNFGQVWIPYEWVTKMKSRLAPAVKE
>CT1494 carbon-nitrogen hydrolase family protein
MLKSKLRIVQADCTLANFEENLERHIKAIETAIRDGADAIAFPELSLTGY
NVQDAAQDMAMHIDDRRLDALRELSRDICIFCGGIELSDDYGVYNSAFMF
EDGAGRSVHRKIYLPTYGMFEELRYFSAGRQIETVTSRRIGKVGVAICED
FWHMSVPYLLAHQGAKLLLVLMSSPLRLSPGQGVPAIVTQWQTIASTSAF
LLSCYVACVNRVGNEDSFTYWGNSAVTTPDGSIAASAPMFSEHSFDATID
YSVVKRVRLQSSHFLDEDTKLFASQLETMLSAKHQG
>CT1264 conserved hypothetical protein
MRRVFIHTFEVPGSSIDGYGHVSNIEYLRWMQDGATAHTASEGWTLDRYR
QSRAIWVVRRHSIDYLMPAYASDRLDLHTWIEWVRDCQSVRRYLLTREGD
VRALARAETLWVCVDPESGRPKRVPEDFIQAFELVVGGEAEALRIVGKAS
DSAS
>CT0148 oxidoreductase, short-chain dehydrogenase/reductase family
MSFYTLITGASTGIGRRLAEEFASMGDNLVLVARSQDKLETLAAELRRSC
GIEVQVCCQDLAEVGAALKVFGFCEERGLPIDKLVNCAGFSIAGNFERMD
EETFVQMVLVNMVAVAALTRRFLPAMRARRRGVVINIASLAGFQGVPGMA
GYSASKAFVVNLTEALSVELQGTGVRIFAVCPSFLDNDLFYSRAGHDRSR
IVTPVSSPEVVVKAVHRGLDGKQVLILPTVLDRLMVFVQRFTPRKIVVLL
ADIFAGARERGGSNG
>CT1066 conserved hypothetical protein
MLQSERAHQPLSEEELNMLETFLASTEAPEECMNSIEMIDGFLTAVVIGP
EVVPEHRWIKYMLDPENQRENLFNSPEDESRITDLLNRHVNAIDAQFESE
PEGFLPIYEMFSYSEEEERQIAIEEWALGFILGMELSHEAWQPLFADEST
AMLAGPLFVLGKVTDDYDNMSQEEKDQMIDMLDESIIGIYAFWQQQAEEE
KGV
>CT0600 conserved hypothetical protein
MMSRSIQNPLAALVALLMLMPVMLRAGQKGEAMTFELTSTAFRNMGAIPA
LYTCEGKNISPPLTWKNIPKGTKSLVLIVDDPDAPDPAAPKFTWVHWVLY
NIPPGKTGFAEGAGNHPAETEMQEGFNSWNRGGYGGPCPPIGTHRYFFKL
YALDTVIDDLLSPLKADVEAAMQGHILGETVLIGTYKKRGK
>CT1493 xanthine/uracil permease family protein
MRNFFEFDRHGTSYQQEVLAGLTTFFTLSYIIVVNPAILEATGMPRGASM
TATILTAVFGTVLMGLYAKRPFAVAPYMGENAFIAYTVVKTLGYSWQTAL
GAILVSGVLFTLITLTGARKWLAEAIPMTLKHSFTVGIGLFLAFLGLSNM
GVVALGVPGAPVKLGDLTTIPALCGLGGLALTGALLVRKVTGALVIGMAA
TTMLMLSFGLLQLPQTLFSLPPSIAPLWLQVDITGALTWGFVGVILSVLV
MDFVDTMGTLIGLSARARLLDENGNLPEIEKPMLVDALSTTAAALFGTTT
AGVFIESASGIEQGGKTGFTALVVAGLFLLALFFAPILTIVPPQAYGPVL
VLVGMFMIESAGYFDFNDYTELLPAFLTIVMMLFTFNIGVGITMGFISWV
VIKALAGRFREINAGMTALAILSATFYIFYPYH
>CT1194 hypothetical protein
MLKMSVVSIGEVLVDRAVLDARFSCNLDLCHGDCCVEGELGAPIDDREAR
FLESAVEPLRSMLPERNLRYIRRHGCAEVYQGNLYTKTIDGRECVFVYHE
NGKALCAVETAWKKGLLDATKPLSCRLFPIRVRKKFGLDYLVYEQHTMCR
DARRQGAEQDVRLIDFLEAPLVEKYGHDWYMSLKEFVASI
>CT1272 conserved hypothetical protein
MIRSVTVYCSSSNLAPEPYFSEAESLGRGLAERGIDLVFGGGHVGLMGRT
ADAALKAGGTVKGIIPRFLEEREVAHPGLTELHVVETMHQRKMLLTDWAD
AFVILPGGLGTLDELMEILTWKHLGQHRKPIILLNTEGFWNQLLQFFERI
AAEKMVKPGYESYYDICNSASDVLAMIDRQNTSV
>CT1986 WD-repeat family protein
MRRRPATIKRSTLMGFLSKIFGKKEVELKRPQVREDAALIKTLVGHEDRV
LGVRFSPDGKKLVSGSFDEKVKLWDVETGNAIHTMSGHTTWVKCVDYSPK
GDKVASGSIDSTVRIWDVATGQCLHVCKGHDTEVRMIAFSPDGKTVASCS
RDTTIKFWDTETGNEVKTLFGHKSYIECIAFSADGKKLVSCGEEPVVKIW
DLETGKNIANYPTGDTLSHFVSFSPDGSQIALCGRDAKVKVLDAATGQML
KVLEGHEDGVRALCYNPAGTLIASAANDESVRLWDVAKGALVHTYRGHTH
EVQSVAFSPDGKVIASGSDDFKIKLWGVV
>CT1549 membrane protein, putative
MNDGKATPGEAGCRENQLLKLPGFGNFTIIDRYIARQFLTIFLFALASFA
ALFILINLVENLDRFLDRHISFGRILIYYLSGLPDTFLLTSPLSVLLASL
FVTGKLSMQSELPALKSAGMSLSRLMKPFLLVTMAIAALNTINSCFIAPA
MYDWSKGFEKRYLKKQQDNGEEPLHIRESNNRILTVAKIGPDKKSATTVS
LETFNGSQIVSRIDADSLRIITRHKYWIFYNTKQRTFSKGAETLVTRAGA
DTLKLSLAPNTFKMIDTDPDEMNIVQHIDFIWQKARSGLPGLERATVKLH
TKLALPLASMIIVLIGVPLSSKKKRSGLAVEISISLLIGLLYLGMLKTIG
SLGYDGLLNPVLAAWLPDILFIIAGTFLYRSADH
>CT1629 TPR domain protein
MASVSGNSTAPDSAAVVTVQDPLSDYVQGLFLDMKGDYWGAIDIFRKVLV
RKPADPAVHYSISQSYYRLAVLDSARVYGEAAVRLDPSNRYYLRYLAVVA
HDMRDYDRAAELYGQASLLEPDRTEIMYLQGLEYMAAKRLEPALEVFRKA
VRIDPYNEAAFAQTLALEIALKHYPEAIDTSKQLLKLGGNERKIGITLAE
LYTKTGQETLAVQTLQELIAGDRSNITYWIALFDHYIMVGRNDDFHRELA
VFLERASLPPESLHDLAKLYILRSGKDSLYVAPTIALLDELTARRPRDSE
LFMLKGMFGMMHGHQQEGVVLFRKAVQLDSSNATAWEYLISTQLDLGQKR
QAFALLAKARRRLPGQRFRWSVLEGSLLLSSHKLRRAVAVLETVAGTKRK
PGDPNLLIQANINLAMACDLLGMKKRSRSAYERVLDLDPHNTLAMNNLAY
LFTEEGITLRKALRLATNAVMLEPENGVYLDTLGWVHYKLGNFELARQFL
EKAAATGLDEPDIYRHLGEVYRKLGNEPKAREMLEKARTVEKTQGNKKSG
H
>CT0890 methyltransferase, putative
MSEQNWTDTGRFDNKAAEWDANAIRAALADAVARAIIAHMPVGKPANALE
FGCGTGLVTTRIAPHCLQLTAVDSSREMLRMLGEKIAASAIANVTPLHLD
FSRPEEAAGLDRDYDFVYSSMTLHHIPDTASFLRELIGHMSPGGALAIAD
LDAEDGLFHNDATEKVHHGFDRTELQALLESAGFAGVSFMTAHIIEKKNR
EGNLKNYPVFLVTAVKPKA
>CT0058 HIT family protein
MTRYDPDCIFCKIATGHIPANLVYKNDHVAAFHDINPVAPIHVLIIPLEH
IRSLSDLKDGDSEIAAQILLAARIVAEKTGVLESGYRLVFNNGEDALQSV
GHIHAHLIGGKTMGWPPFAGREVAHGQD
>CT2278 conserved hypothetical protein
MNSLFTIVSGIRFDEPWWLLLLPLTIAGSAVYRLFWRKNRQGVLFPSVSE
LRSSGFAALSLFSKFPEWLHWLVLVLIVLALSGPSAPFPPSSRDTVGIDI
MIALDVSDSMNTPDFGGKSRFAGARTAAMRFIDNRPADRIGLVVFSGGSF
TRCPLTLDHEVLGRLAETVAPGFFDEPGTAIGTAILTATNRLKASSSKEK
ALVLITDGENNAGEVTPETAARLAANYGIRIYTVFAGKEARAFENTSNTA
LNRKGRSELETVARISGGRMFSAGDVFGLMKSFRDIDRLEKTRLKGRMPS
RTMALYPWLLLSAVCLLLAEQALSATRFIRIP
>CT0028 C-20 methyltransferase
MSNNDLLNYYHRANELVFKGLIEFSCMKAAIELDLFSHMAEGPKDLATLA
ADTGSVPPRLEMLLETLRQMRVINLEDGKWSLTEFADYMFSPTPKEPNLH
QTPVAKAMAFLADDFYMGLSQAVRGQKNFKGQVPYPPVTREDNLYFEEIH
RSNAKFAIQLLLEEAKLDGVKKMIDVGGGIGDISAAMLKHFPELDSTILN
LPGAIDLVNENAAEKGVADRMRGIAVDIYKESYPEADAVLFCRILYSANE
QLSTIMCKKAFDAMRSGGRLLILDMVIDDPENPNFDYLSHYILGAGMPFS
VLGFKEQARYKEILESLGYKDVTMVRKYDHLLVQAVKP
>CT0117 sulfide-quinone reductase, putative
MAKVVVLGAGVSGHTCASFLKKKLGKQHEVVVISPNSYYQWIPSNIWVGV
GHMTIDDVRFKLKKVYDRWGIDYKQAKAVSIHPEGDANISKGYVTIEYTD
EEHAGYTETVDYDYLVNATGPKLNFEATEGLGPDKNSLSVCTYSHAAHAW
EELQKSIEKMKNGQKQRFLIGTGHAMATCQGAAFEYILNVAHEISRRGLS
HMAELTWISNEYELGDFGMGGAFIKRGGYITPTKVFTESLLAEYGIKWIR
RAGVYKVEPGVAHYETLDGEMLSQEFDFAMLIPSFSGVGLTAFDKSGNDI
TDKMFLPNKFMKVDADYTAKPFGEWGANDWPTIYQTPMYSNIYAAGIAFA
PPHSISKPMTSVNGRQIFPTPPRTGMPSGVIGKIIALNISEQIKGNHKEH
HHKASMARMGAACIVSAGFGSFDGLGASMTVFPIVPDWEKYPEWGRDMTY
SVGEVGLAGHWLKFMLHYLFFHKAKGYPFWYLIPE
>CT2138 hypothetical protein
MKPLKEVVGAYLALSDAQRQLVAGEYDEAAANCRRAMEISHTMPPEEAFD
HAGFDAFCHAGLAEALAGLRSFDEALHSADKALHYFNRRGELNQDEGKLW
ISAVYSRALALDGLGRGAEAMPEFKKVVEMIEERKGETPGKERMMEVAID
RIAQLGASNQQKKPGYKAWWEFWS
>CT1461 kinesin light chain-related protein
MDYLQGRNYELQINYPEAERYYKKAAAIEDENPLYLNARARILWEPGRYP
DAEPLIRHAFAIGEKALGPKHPNTVQCTKNLNALLEKKKQCGFIRCAAWR
TAAPTRRSDPSSLAGNPGCRGQFRRRRRRDRRETG
>CT1766 3-oxoacyl-(acyl-carrier-protein) reductase, putative
MSVSGKKVCFMTGASGKLGSEIALAIAGQGYSIFFTWQHSEKKAKETLEK
IRWVSPESQMVRCDVSNIAEIEKAFAIFSEHFNRLDLLITSASNFFRTPL
LDVTEPEWDSLVDTNLKGAFFTMQQASRIMLKQPFVSRIITMTDISANLV
WRNYAPYTVSKSGIQHLTRIFAKEMAPKILVNSIAPGTISAYSGRDEEPE
ADLVGKIPLERLGDPMDIVMAIRFLMETEYITGQVINVDGGRMLF
>CT1062 hypothetical protein
MLNTFLESLLSRLKSAFDLEHPGSLPGEVVIVLITISFMLTFFYLLWRIV
WFSWLLYGFSLAYRNIRYMGVSVQSRERSRAFGIRQNGKLAGYGLIGLCR
SGFKIGPLFAAPLSLPKRFSARSKARFLKAH
>CT0063 magnesium chelatase, subunit I, putative
MKQETDLELLGRQIAEESAFIDRVREVLASTIIGQGAVIDRVIIALLANG
HLLLEGVPGLAKTLIVRTFASAMNLSFHRIQFTPDMLPADLIGTMIYNPK
TMEFYPRMGPVFANVILADEINRSPAKVQSALLEAMQEKQVTIGDVTYPL
GEPFMVLATQNPVEHEGTYMLPEAQLDRFMMKVIVEYPTFEEELEVMQRA
SAVQAPIEVQAVVQPEEVFRSRSLVDRIYVDQRVQRYIVDLVTATRSPER
YGLDGLSTMIEYGASPRASIFLLLASKAHAFLQRRAYATPEDVKSIVYDV
LRHRVRPSYEAEAENMKAEDFIRNILEHVQVP
>CT0536 glycosyl transferase
MQASGKTTPAVTVIIPHLRNRPTLDACLDALRKTTFRDFAVLVVDNGGDA
SDLAGLESCYPEISVLHLPENAGYAGGCNAGLKLVISPYVVFLNDDTVVE
PEWLGCLVEAAECDPQIAALQPKILSLPEHRQGRRVFDYAGAAGGLIDRL
GFPYCLGRSFGGREVDAGQYDEMCDIFWASGVALFVQREVAEKLGGFETE
FFMHMEEIDLCWRMLLAGYRVRSVPQSVVWHEGGASLSEGSPLKVYYNHR
NAMLTLLRNRSTVPLVLLLPLRIALEAAAVLYYLAGGKAGVMRAGQVARA
FADVLRRLPETLRQRREIQRSRRVSDRELFRNTPLSIFLPRRPD
>CT0289 GTP-binding protein, Era/ThdF family
MQPPLFSCGHVTFVGAPNAGKSTLLNRLLDHKLSIVTPKPQTTRKKITGI
YHDDRSQIIILDTPGIMDPKQSLHESMLEITRRSLRESDVIVALIPFQKG
DEPIDRKFASELIEQWVKPTGKPFVIALNKADLVPEETAKEAQTEIISKY
KPVATLALSALTGGNIPELVELLRPLLPFDEPIWPDDILSTEPERFFVGE
IIREKIFLQYGREIPYSTEVVIDEFKEQHENNPSRKELIRCSVIVERNSQ
KQIIVGQKGAAIKKLGQAARKEIEELLDRPVYLEIFVKVRPDWRKKKNLL
KSYGY
>CT1895 hypothetical protein
MLLTSVVKPALIANGGSVLHVSSMDHNSSVFDPGNMQGERSWSGYEAYAR
SKLFNIMFTLDLSSGDNAVRSNSLDPGVITTKLLHAGWSLAGDEVSVGGD
DVFETVIEIARHGYNGEYFENARPAICSSVARAPAARQENEIPKQKPRFF
GSFVIENVA
>CT1275 alcohol dehydrogenase, zinc-containing
MVLERVVDLLHETQPLVLRELPVPEPGPGEILLRVATCGVCHTELDEIEG
RTPPPRFPVIPGHQVIGRVVACGDGVAGIETGSRRGVAWIYSSCGHCDLC
RSGNENLCAEFSATGRDADGGYAEYMVAPAAFTYSIPDVFSDAEAAPLLC
GGAIGYRSLMLANLKDGQILGLTGFGASGHQVLKLARYLYPKSPVFVFAR
SEKDCDFARSLGADWAGGTTDNSPSPCDSIIDTTPAWLPVLSALERLRPG
GRLVINAIRKESHDSELLAGISYERHLWMEKEIKSVANITRADVSAFLDI
AARMGLKPEVRSYPLEEANRVLLDLRHGKARGAAVLIP
>CT2275 conserved hypothetical protein
MVNSRPRPAGLAGQTGDIGILKDKPFIIEALRAAGALTISLVAALGGRVV
GHIAFSPVTMSDGSFGGFGLGLLSVLPEFQRQGIGGVLIRDGLAWLKALG
AIGCCLVGHPEYYRQFGFENPDGLGHEGVPLEFFFVLSFGGQVPQGSARF
HDAFMASGPAS
>CT1637 conserved hypothetical protein
MPFEKYTVPVSGRSDAAIEEELTRLRAELAAEYDLREVVHRFGESDFTFL
SVLDSYALLDRIDPAAFVKDEQMPYWAEIWPAAVTLSRQIVETGELAGKS
VLELGAGVGMASIAAARSGARVLCTDYSTEALRFVAYNAMKNRVPLDTAR
LDWRMVKGAEKFDAVIAADVLYERVNLLPIVTAIDALLAPGGAAYIADPR
RRLADQFLELVHENGFEVAETRMFDAEGDQTVAVTIYKLQRLKA
>CT2078 NADH oxidase, putative
MNKQVDVLVIGGSAAGIVAATTGKAFYASKSFLIVRKEPEAVVPCGIPYI
FGTLDGVHQNIVPTAPLANADVELLIDEVVSIDREAKSATTAGGVVISWD
KLVLATGSEPKTPDWLEGRDLDGVFVIPKNRDYLCRLRSRLEEPRRVAII
GGGFIGVELADELAKKGHDVTLVEILPHVLSMAFDSDLSLKAEELLVKRG
VKLKTGEKLKKLAGQASVSKVILESGEEIEVDIVILATGYAPNVELARSA
GIKINELGAIRVDEYMRTEDKNIFAVGDCAEKFSFITRIVKGLMLASTAC
SEARIAGMNLFGLSRLRTFSGTIAIFSTAIGGTTFAAAGVTEQLARERGF
EVVSAGFTGIDKHPGTLPETSNQYVKLIVNSENGLVLGGAVMGGQSAGEL
INVIGVIIETKMTVNELLTLQFGTHPLLTGPPTAYALIKAAEAVEMKLRH
FK
>CT2219 histone macro-H2A1-related protein
MPDNVLIHAIKADITSLTVDAIVNAANTSLLGGGGVDGAIHRAAGPKLLE
ACRELGGCLTGEAKITKGYRLPATFVIHTVGPVWHGGNHGEAELLASCYR
NSLKLAIEHHCRTIAFPSISTGIYGYPVEQAAAIAITTVREMLADERGIE
KVIFCCFSDRDLDVYQKALAAG
>CT0872 lipoprotein, putative
MENILDKLGIELNEQTRLTNDSKFTFNCHSGLSCFNTCCSNLDIVLTPYD
ILRMKNRLGLTSVEFISEYTEPVIQKESKLPFLKLKLGDEGKCRFVAAEG
CTVYGDRPAACRYYPLGFGIYKNEEAGDDFYFLIREDHCKGFEEEREWTV
GEWRKNQGVDEYDDKNRVWMEVILNKKLYSPELEPDEKSLKMFFMASYDL
DAFRDFVFESRFLDVFDIEEERLEMIKNDEAELMLFAHQWLLFALFKKPT
MNLKQV
>CT0541 CBS domain protein
MEIFFLFLLLILINGLFAMSEMALITAKRSRLAKLAEDGDKAAAAAIKLG
HEPTQFLSTVQIGLTVIGVLNGFIGESSFTPPLAAALELYGGWDPKTSHI
IATVLVVIVITYITIVIGELVPKRLGQTDPEGIARNVARPMQILATATRP
FVRLLSASTDAILRLMGKHEQTQPSVTEEEIHAMLEEGSVAGIIEQQEHE
MVRNVFRLDDRQLGSLMVPRADIIYLDTALPLEENMKRVVESEHSRFPVC
HNGLQSLLGVVNAKQLLAQTIKGGVTNLAEHLQPCVYVPETLTGMELLDH
FRTSGTQMVFVVDEYGEIQGLVTLQDMLEAVTGEFAPLNLEDAWAVQRED
GSWLLDGLIAVPELKDTLGLRAVPEEEKGVYHTLSGMIMWLLGRLPQTGD
ITFWENWRLEVIDMDSKRIDKVLATKIDNQPTEDPKPVA
>CT1574 conserved hypothetical protein
MKETLSGVVTRVTGASYIVETGDGLKVRCRTVPGTVSENEGSNLVAVGDR
VEFRPKASETDMAEGVITRVEERRTALVRRREVRRNRSKEKEQVIVANID
QLVLITSFDDPPFNSRLVDRYLVFAESEKLPLLIVVNKIDLDEEGMVEED
LEVYRQLDCNICLVSAEDGRGIEELRELLRDRVSAFSGHSGVGKSTLINL
LVGCEELRTAETSGKTGKGVHTTTSSAMFQLPGGGYVIDTPGIREFNLAG
ITRENLRFYYTEFLRYMPECTFSSCSHTVEPGCAVIAAVESGSIARERYE
SYLALLDSLAE
>CT0950 conserved hypothetical protein
MKNPRAGRPLHCRSVEELPGVTCFLPEGVPPARLRNVVLSVDEVEALRLA
DLEGMYHADAADKMKVSRQTFGRIIKSARKKVADALVGGKTICIEGGKIT
GSCLTGESEEPAVCICLHCGYEQPHVPGVPCRTANCPHCGKMLIRKGRYS
RVD
>CT1308 oxidoreductase, short-chain dehydrogenase/reductase family
MTTSDNGKVLLITGASTGIGRSTAIQAVEAGWRVVVAARSAEKLAALAAE
LGAERAMAVPCDVADWNQQEAMVQKTIGRFGRLDAVFANAGFSKGSSFIE
GEHRPDEWRDMVMTNVFGAAATARLTLPELMKTKGHFLVTGSVLGRVTSM
RNLYAATKWAVTGMAQAIRNEVASLGVRVTLVEPGIVDTPFWEGLQKPAA
PELLPEDVARAVLFALGQPPHVDVSEIIIRPTGQAH
>CT2271 lysophospholipase L2, putative
MTNRVTNGQPERLLTGVLILHGFTANLESVRALFGPLGRFDLKMATPLLR
GHGAASPDELRGVTWREWLDDAENAFETLTGTGGKAVVIGHSMGALLALQ
LAARRPELVDSVILATPPVRLTSPLGPGRPLHFLAPLVSHVVDRWDMEAR
FADPGSAIIPKQYDWAPTKTILSMFELLEETMRITGRVRVPALILQARHE
SVVLPESAEILTRAIATPPEAKSIVWFDKTDHQIFCDCERKAAVDAVVSF
VSKRFPAATNQSVKA
>CT1638 conserved hypothetical protein
MNKEIIEVFDNTYPDRDYTIEIINPEFTSVCPKTGLPDFGTITVNYVPDK
SCIELKSLKYYFLEFRNAGIFYENITNRILDDLVEACQPRRMTVKTEWNA
RGGITETVTVSYSKSKE
>CT1753 conserved hypothetical protein
MTTMKNIRKACSSLLMLVVLFSFCGLQAEPAQSGPGEVKVFGLVEHPLTL
TVESLRRMKPVEKGATAIVCDSGQTKQTMRSFKGVLLRDILDSTKVVMPN
PRERGEYYALVRSLDGYNVIFTMNELRYGVAGDGAWLVFEENGKPVETGG
PFVIFCDNDRANGPRHVKMVESIEVSKVNAAP
>CT2085 oxidoreductase, short-chain dehydrogenase/reductase family
MNLADSTAVVTGSSSGIGLAICRALLDAGASVFGLSRRETPIAHERFRWL
KTDVTVEAEIDQAFEAVFAESGRIDLLVNNAGIGFFRDIESIDPVEWRRL
IDTNLTAMFLCTRKVVPSMKAAGRGMIVNIGSVAGKRGIRGGTAYCASKF
AVNGFSESLMEELRGFGIRVSCINPGSVMTEFFDHAGIEPKKHMQSDDLA
QLIVSLVALPDGMLPDEMTVRPL
>CT1899 acetyltransferase, CysE/LacA/LpxA/NodL family
MHPEIHDSVFLAEGSYVIGDVKIGAHSSIWFNAVVRGDVCPITIGEKTSV
QDNATLHVTHDTGPLKIGSNVTIGHAATLHACTVEDNVLIGMSATLLDHC
VVEPWSIVAAGSLVKQGFRVPSGMLVAGVPAKVIRPITEEERANIAESPE
NYVRYVAHYREEGYEGR
>CT1277 conserved hypothetical protein
MPRPQKCRAIAQDPEYRVFGPFCVGKREDEALVMSFDEFEAIRLADVEGL
YQEEAARQMQISRQTFGNILASARKKLGEMLVLGKMLNVKGGNIMISQEE
RIFGCAACGHQWSLPYGIARPVECPSCSSQNIHRMSPGGGFGGGRRGGGK
CRGFRSGLDRGPGHGEGRCQGEGHGNGNGNGNGQGRMRRNQQEGGEV
>CT0459 hypothetical protein
MERKLQEILDSAIPLTQAMGIVVERYTGRELTIIAPLANNFNHLGTAFGG
SLYIACVLSAWGLLYLRLREAGIKGSIVIRKGNAEYLRPVTGDIVATGTL
PTEEEFAALIESFDRKGKAKMTICAVIEVEGKVAVKFEGEFAVVR
>CT0813 membrane protein, putative
MPDVMTVLINILSASWSVLLDAAPWVLFGFLVAGLVKAFVPEKLVAAHLG
RGFSSIVKASAIGVPIPLCSCGVIPAAAGLKKQGAGKGAVASFLVSTPET
GIDSIAITYALLDPLMTIFRPVAAFVTAIATGVAVSFTGTDEPAAAPASG
GGSSGSSCSCGCGHKKVEKPGVAQKIRSGFSFAFGELLGDVGVWLLGGVL
LAGLISVFVSGQFVERYLSNDVVAMVMMLAISVPMYVCATSSTPIVAALA
LKGISPGAALVFLLAGPATNAASLPVISKLLGKKGTVAYLVVIVLMSLLF
GILVNYLYAWLGLDTKNWVSRGAHEEGGVVAIVSAIVLVVLIARARFEAW
RSVREH
>CT2075 TPR domain protein
MPFVSAAGPAEELIDQGIRRGLEGNYQDAIDHFSRAIRLTPRNADAFYNR
GLARVSIGDLTGAIADYSMSISLDPRSSGAYNNRGFALAALGRYAEALAD
MSRAIALRPDMAQLYNNRGTIRMSIKAYALAIADFTRAIALDPLLAGAYN
NRGLARNLSGQLQGAVADYREAVRIDPRYKVAWYNLGNAHISLGDAKEAV
EDYSKVLVLDPGMLVARNNRAFARLSLGDYKGALEDLNLVISKSPQDAAG
WYNRGVVRKLAGDRQGAIEDLRRAAAFGDSLAVEALREITSRDSMPP
>CT1208 conserved hypothetical protein
MICDDARPNVVFEAIESSLLKGNPLGDPATRHVPVYLPPSYNGTERFPVI
YLLAGFASTGISFLNYGFGRQTLPEMIDSMIRRGEMPKTIVVMPDCMTRY
GGSQYVDSTATGPYETYLTSELMPHIDRKFRTLAHARHRAVVGKSSGGFG
ALRLGMRHPELFAAVGCHSGDMDFDLCYRPNFPVAARILEKYDGSVAAFF
TRWESLSKKPRGEFALLELMAMAACYSPDPSKPAPGNMRLPFEPRTCQLV
PEIWEQWKSFDPVTMLEETKNQDALGSLRLLFLDCGSQDEYNLQFGHRRF
SARATEIGIAHRYEEFPDTHTDTSYRYQVSLPLLARAISE
>CT2287 conserved hypothetical protein
MIMSKAKPPRELGGIIADVFCKIGMTEAYDEYKTLHAWKNVVGETIAKVT
SVEKMKDGNLYVKVKSPSWRMELNFRKRDITKRLNKAVGYEMIRTIIFK
>CT0281 hypothetical protein
MWPPCCLNIYDLPTMTTIINYLTAHPVVLGIAVIFSLLIVLSFARRVLRF
LIVLAALGILYVAWVSWHGGNPAEKAEKASKSVKQAVHKGEGAMKAVDWL
FKSEDHPRKEEDN
>CT1091 membrane protein, putative
MLMNPIIEISIPQLLLALLFIVVAQATSFVQKLGLNRDISIGTVRTVSQL
FLMGYALTFIFRAENLWLTLGIYVVMVFSAVFIVRGRVKEKQIAYEVPTF
LTMLSSYFLTALFVSWLVIGVHPWWDPRYFIPTAGMVIGNSMSALAISIE
RLFSQMRQQRELVEMKLCLGANYKEASLDIFRGAVKAGMIPSINAMMGVG
LVFIPGMMSGQILAGADPLIAIRYQIVVMFMLVGSTAMSTIIVTLIIRRR
CFGKSEELVVTAE
>CT1383 conserved hypothetical protein
MPLQIFNTTKRTIDETLLAEVIRLVIGEEGGAVGSIEAIYCGNKMIRRIN
RDFLGHDYVTDTITFGYNEGGEVDGEFYISLDVIESNARRFGVSFEDELL
RVTIHSALHLMGYDDETSELRAAMSLREDHYLYRLRH
>CT0398 conserved hypothetical protein
MNVNNFRLKLGGKEYIPIVIGGMGVNISTSELALSAERLGGIGHISDAEV
MFVCDQLFGTSYVADKRQMYASNVNNRDKSSVHFDLEQLAEAQKRFVSHT
ISQKTGNGAIFMNCMEKLTMNNSAATLKVRMEAAMEAGIDGITLAAGLHL
RSLDLISEHERFRDVKLGIVISSVRALSIFLKRAMRLQRPPDYIVVEGPL
AGGHLGFGPDDWQSQSLQSIVAETLAFLKKENLDIPIIPAGGIFTGTDAV
EYLQAGAAAVQVATRFTIARESGLPDEVKQHYINASPEDVEVNLSSPTGY
PMRMLKQSPTLYYSRKPNCEGLGYLLDNNGQCSYIDDYYEALESRNPAEG
RFVVKNHTCLCTGMARYDCWTCGHTVSRLKETVNRLPDGSWQLPSAEDIF
MDYLLSENHSIRKPEIKKQG
>CT0415 carbon-nitrogen hydrolase family protein
MERLNIALVHLAVRHGEPEHNRRELIRLNRQAAEAGARIIVNTELAVSGY
SFRSPKEVAAVAEPVDGPSVQAMAEIAEAAGCYIVLGYPEIDPCTGICYN
SAAVLGQDGKLVLNYRKVTAEARWACPGSHMQESLFETPWGRAAVLICSD
SYYGLIPRAAALRGADLLLVPANWPGGSLDPRELWRARACENGCALVACN
RTGKDRTMECYDAVSCAYGADGSVIAERSSPDSEVFHVELPLSRGKLRSS
SRERFASRTPERYRSIYLDMRYANDMTSWHSLPEPAPVQVHCLHELYFEP
GDVSVLDSLLHDREKAAGLVVVLPMLRVANREEAVGFLLDVARAHGAAFC
AGLVDTDGVAELICCRPDGSVHRRCPERDDFVLIDLDHLRLTLLSKEECR
HPEAVTALAKEGCDLAVVPETGFDEGDRAVLGSRSIEQLAIAACGRDVSF
ICLPPVDHYRWEEAVGEGVDGASMLIEVEKLRRKRFYDRIDVELLLARNG
RSPDACRADETGKREEETP
>CT2121 methyltransferse, putative
MPMSFYSKFSEYYERVFPFREDVWQFLKRYAGSPGNALLDVGCGPGHYCG
RFASDGFNALGIDLDEAMIDEAQRRYPEAAFRCLDMRRLEKTEGRFDCVW
SIGNVLAHLPTEALAPFISKIHNLLKPGGYWIMQVMNWDTLAELTNYDFP
VRTIEANGSTATFHRHYSSITPESLQFTFSLKDEDSVLFEESVTLYPVAI
ERYLKLHEDAGFLYEGMYSDFSGSALRSVPGTGLALVFGRG
>CT1726 membrane protein, putative
MQQHDRQSSAAPAWIAGVAAAVALWVPLYLNLEAGADLLVAALGLSRATP
LGEALHFFIYEAPKVLMLLTAVVFVMGVVHTFISPERTRAMLSGRRVGVG
NAMAATLGIVTPFCSCSAVPLFIGFLQAGVPLGVTFSFLISAPMVNEVAL
ALLFGMFGWKVALLYLSMGLLVAVISGMVIGKLGLERFLEEWVRQLQNSA
VSSEFSAEAVSWPERIREGLRHVREIVGKVWLFIVLGVGLGAGIHGYVPQ
NFMASLMGNQVWWSVPAAVLIGVPMYSNAAGIIPVVQALLGKGAALGTVL
AFMMSVIALSAPEMLILRKVLRPQLIAVFAGVVATGIMLVGFVFNAIF
>CT0717 epoxide hydrolase, putative
MDNVPEPSSEKFRAYRQKLLDQLETSSQGERHRAQYELELMRNSHFVKVG
GLLHHYHDSGPENPRGTVLLIHGWDCWWMWWHRIIRELNAAGYRTVAYDM
KGHGWSENDPENRYQIADFVRDLDELIRAIGLKDLHIAAFSFGPFVALDY
VNTYPNSVRSMVFFNFGYLPNSEFISKVAPATIIFIFNIMMRKLTWWLPA
YIFARLVLSRNSVMMHDIKVGFESLGFCASEAIEQTAQQITAMETTQMLP
DMVRAVRVPILFAAGEGDVIMTCENARKLQEMTPSGSYLCVPDCGHLITL
ELPQTAAEIVLQHISSNS
>CT1681 ABC transporter, permease protein
MESLAVVFILQVLRISVPYLFASVGAIFSERGGVINLALEGLILAGAFGA
MLGQYLTGSAWAGIGFALALGLVVSLLHAFVTITLRADQIVSGIAINILV
MGATRFGLGLLFGSAMNSARIAGMEVSVPLFDPLLVIAVFTVGVAQFVLF
RTPYGLRLRAAGESAKAVETAGLDVRRLRYSGVLISGALAALGGVFLAFQ
QHSFTDNMSAGRGYIALAAMIIGRWSPAGAALASLLFAAAEAMSMWLPSG
WLPSQLVQSLPYLITLLVLAGFVGKSAPPKELGVPYEPE
>CT1158 hypothetical protein
MSKKMICYCSSVTEETIVSAIRRGATTLKKIQDTTGACTVNRCKELNPSG
RCCSGDILDLIERETGSRPSSPCCCE
>CT0687 Nudix family protein, MutT subfamily
MVIGDVVCAIIERDGRFLIARRPEGKHLARKWEFPGGKVEAGESEAAALD
RELQEELGVRVEIIERLTPVEHSYPDRSLRLIAFRCRIVDGVPDAGEHEE
LRWIEIDEAGAYDFPEADLPILAEYRLKIAAVPPGAPDQLHKAD
>CT1808 TPR domain protein
MSLLDFFDDNLNPSEGFFPDRPEGASDPDSIHDPEELLDLIIQLNEEGLH
ETSLVAARRLEELAPYNAETWFHLGNSLTLNGLFDEALEAFQRAVLLSPA
DNEMALNLALAYFNTGRLDEALEEIERVVSDSTIARDICFYRGLILQRLE
RFEEAEKNFEQTLQLDPEFGEAWYELAYSQDILGKLDNSLVAYEKAIDLD
PYNINAWYNKGLVLSKLKRYPEALEAYDMALVISEDFSSAWYNRANVLAI
TGRIEDAAESYTKTLEIEPDDINALYNLGIAREELEQYSEAIACYKRCIE
LNPEFADAWFALACCFEALENYEASLDAIGHALVEMPECIEYLLLKAEIE
YNLGRLDQSLKTYEKIIPLDPDSPQIWLDYAMVLREAGAMDASIRALEES
ISLQPLSAEAHFEIAATYFAMGDNQSTLKALSKAFKIDPDKKQLFQSVFP
ELYQQDAVRRLLEIS
>CT1189 conserved hypothetical protein
MVQIAGTPVPLSAKSRGPNALSSLLAALFLLLSQPASGAEAKQCIDNAEL
DAADKAFNSLHYAKADSLYQSMLQTGDQSSTLYWKLARLNISIAEAIDPS
ERKKRIPFYNKAVEYARKSVQLDENNASAHTWLAAALALKADKIGAKEKL
NRAAEIKRELDKALALNPNDDVAWSMLGSYNFEASKIGWFSRFMGSTFVG
KMPKGSREEAEKDFKKAISLNPRVIRHYHELALLYLEEDRKQEALNTLRI
AETRPVLMKSDVRRLKEIKKLIAKLSKEIEEK
>CT1300 conserved hypothetical protein
MKVIGINGSPRRAGNTSIMLKTIFEVLEDEGIETELIQVGGTNIKGCRAC
YACIKNKNSECSTKGDGFNEIFAKMVEADGMILGSPTYFADITPELKALI
DRAGFVSRTNGQLFRHKVGASVVSLRRGGGIHAYDSINHLFQICQMFMVG
STYWNLGFGGRDGGEVVNDTEGMENMRDLGHSMAFLLKKLHN
>CT1510 carbon-nitrogen hydrolase family protein
MNTDQVRIALVQMSCVENPQENLRKAQERIRQAAAGGANIVCLQELFTTL
YFCQTEEYEPFGYAEPIPGPSTAALQELAAELGVVIVASLFEIRAKGVHH
NTAAVIDADGSYLGKYRKMHIPDDPGFYEKFYFVPGDLGYKVFDTKFGTI
GVLICWDQWYPEAARLTALRGADILFYPTAIGWATSETSQEVRASQRQAW
KTSHLGHAVANGVFVAAANRAGTEGELEFWGNSFVSDPFGQVIAEAAHNS
EEILYADCDLSKIGFYRSHWPFMRDRRIDSYGDITRRWIDE
>CT1901 ABC transporter, ATP-binding protein
MNVMGSILSCTNLKKVYNKREVVKSSTLEVKQGEIVGLLGPNGAGKTTTF
YMIVGLVKPDAGQVLLDQEEITKLPMYKRARKGIGYLPQEASVFRKLTVE
QNILGVLEFTTLSKAERQEKTEQMLEDLNIAHIRNSMGYALSGGERRRTE
IARALALDPRFILLDEPFAGVDPIAVEDIQQIVEDLVKRNIGVLITDHNV
HETLSITDHAYLLFDGSIFMHGTPEEIAENTEARKLYLGEKFSLNRY
>CT0232 membrane protein, putative
MNFPDDPKPHRRPRFFDTVLVLLFVLAAYPIVGALLTILVAGGNPFGNGF
EAATHSFVVRLLVAQAFGQMVVLALPVFWLARRFSGDGLFGNTTLEWLGI
RKHGGSRPALIAGAGMLLLQPALYSIVELQTLLLPYLGTFGKSLLQEQAT
LDIFLKKLAGGASIGGSVLSILVLVLTPAICEELFFRGYIQKSFVLSLSP
QRAVLFTGIVFALFHMEWFNFVPLTLLGWYIGYIYWKSDNLLVPAVAHGT
NNLAALVLLKSGIDSGSATDPSSGLLVSWPWWGLVVVSLSLFFLLIRYFP
VRPALQDADNPMPPGHRWKSQC
>CT0876 sulfide-quinone reductase, putative
MKTKPHVLILGGNFAGLQVARHIRDHVKPEDASVTVVDKRSYLLFIPNIL
MEILENKNPDSSMQLPLAPVLDKDETRFIQAEVLDIDVESKKVTIQPTER
PGTTTDVLTYDYLVIALGNRLAFDKIDGFAEHGHTVSDGFYGNKLRHYLH
EGGYKGGPIAIGSARFHQGTKGKLDFVPMAKAACEGPPVEIALSLASWMK
HHEMGGPEKVTIFTPADLIAEDAGKNIVKQLLEIAGGMGFGYKNKLEDIK
QIGKDGIEFANGESIEAELKIILPDWVPHEFLKGLPICDEKGFVITNKQL
KNPDYPEVYAAGDAAACTVPKLGSIGHHQSYIVARQLARDLGALSDEEAD
SELYSPEVICYGDMGDGKAFYIHSNVWYGGDIEILKMGKLYYDLKVAFKT
SYFAMGGKVPYWQWKMGSWMGDKIL
>CT0440 hypothetical protein
MKSDAHWLNWERLTIYPRIFLAGFFVFGLGCVFIAKKKFGLNGIPFGADF
ITFWGASHLALTGHAQDAYNNSLLLKAEQLVIPVFRFTYIWAYPPSFYFV
VLPLALLPYVAAYWTFMLSTLWGYLLVFRRIVRGNIAMWCLAAFSGLWVN
LICGQNGFLTAALAGAALLAVERRPVVAGLFIGLLAIKPHLAMLFPVALL
AIGAWRTLITAAVTAITFMAIGTATLGTAVLKGFFANLGYARLFLENGGL
PWKKMPSMFVFLRLLGTPVTWAYAIHFFVAAGAVIAVWRVWRHCRNWELR
NAALMTATFLVSPYVYDYDLAWLAFPIAWLAVDGLRNGWLRGEREVLVAA
WLFPVLMIPIAEAVKVQIGPIVLCSLLWMTYRRATKTSMTGATAIDDHAA
QFETVP
>CT1692.1 major facilitator family transporter
MAALGYFVDIYDLVLFSIVRVPSLKAFGLQGQELIDYGVFLLNMQMIGML
LGGILWGWLGDVKGRLKIMFGSILIYSLANIANGFAGSLETYAALRFIAG
VGLAGELGAGITLVSEILHTKVRGYGTMLVASIGVTGAILANAVATHYDW
RTAFFIGGALGLLLLVARFKVSESGMFQMMENKTAVSKGNLLALFTSRDR
FFRYLNSILIGVPIWFVVGVLITFSPEFATALGVKGAVSAGNAVMFCYLG
LVFGDLSSGLLSQVLKSRKKVVLMFLLLNIVSIAIYFMQRGATPTAFYWV
SFVLGFSGGYWAVFVTVAAEQFGTNLRATVATTVPNLVRGMVVPITMLFQ
FTRGHFGLEGGAIIVGAICVVAAFASLAALEETFHKDLDYFEEFM
>CT0532 exsB protein
MRAVLLVSGGMDSLVATALAHREGLELAAMHVNYGQRTWQKELECFRQIC
VHYGIVDRLEVDAMFLSAIGGSSLTDATIPVGPADLAGTDIPTSYVPFRN
ANFLSMAVSWSEVIGANRIYIGAVEEDSSGYPDCRKVFYDAFNQVIEHGT
RPETRIEIVTPLIAMSKAEIVRRGMELDAPFHFSWSCYKSEGKACGVCDS
CARRLRAFASVGMDDPVEYEVRPNYLQ
>CT0740 membrane protein, putative
MRLKKVNRHVMKIFDRYILKEHLGTFFFAFVTIMFVFILTFLTQFLERLI
GKGLDFRIILEVVALQSAWMVGLAVPMAVLVSTVMAFSSLTNSSELTVMR
AGGISIYRLVAPVLLAALALSLMMERFNNVLMPEANYKANALFADITRMK
PGLGIDKNAFSDVIQGYSIMVRDIDNETGELRDIVLYDRGRPDVRTVIMA
ARGRIQFSQDYSHLVLTLEDGQIHELSLPAMDRYRKMVFARNRYVFDATG
YGFERTDDGKRRRGSKELSAAELLSMAREFRMKDRMAESSIDKGIAGLRA
EIESIRKRSSTSPASSPALLPPAVTGRAIELVDTMIDNATERIGQMRENR
ESFYNYMIEYHKKYSLAFACVVFAMIGAPLGVMARHGGFGAGAALSLFFF
VLYWVLLIGGEKIAERGLLLPAISVWLPNVVLAVTGLFMIYRLSSSASGS
GR
>CT0014 hydrolase, haloacid dehalogenase-like family
MMIQNDTKPKAFIFDMDGVLVDNMRMHAQSWVDLFADYGLSGLDPERYLV
ETAGMKGLDVLRYFLDPSISPEKADRLTELKDILYRVMNRNAIVAMPGLE
TFLDRAANAGIRLGIGTGAGPKNIDYVLGLTGLTSRFEVVVGAHMVRHGK
PHPETFLQVAERLGADPASCIVFEDALPGAEAAAAAGMSCVAVTTTNRPE
AFAAFDNVITTIDHFDGLMPEALLELNRAVKTMS
>CT1856 serine esterase
MYRIHNTNPFPMFTRNNDAFTWLDYNRNPDKKSPLLVMLHGYGSNERDLI
MLAPTLDPRLRVVSVRAPLVLAPEMYGWFPIEFTPGGITVDREAARQVAE
KLVTFLEHLIEKLQPTGEKTFLMGFSQGSVMSYLTAFRNPELLHGVVALS
GQLPDARPEAGALPEALGDVPFLVQHGLFDDVLPIDRGRQANAWLRDRIA
DLTYREYPMAHQINQASLDFLASWLSERIDRVVS
>CT1673 conserved hypothetical protein
MKLPRRKFEIIQEHAVRDLPYECCGLLVGRKVVDHRGNIDNIVVEVAPCR
NVLYYGKENGFEIAYNEFIDVEREAHSLGLVVVGSYHSHINSTAVPSRND
IDFASAGHSMLIISLYGGVPREVTSWLRRDSGGFHQEQIKVIA
>CT1312 lipase, putative
MPIRSRYITLGGHRHRYIDTGGNAPVMLLLHGISSSADYYGPSMSLLARS
FRVLGLDLLGFGESDKPRTIPYTLQLYADLIHEFLWETDAFAHGEVYGTG
HSMGGKYLLATALLYPGTFKKMVLSNTDGFIVLPSFARAISLPGVRHVLK
PLVTGERIAAKMLDMAIHNRQAIDDETYRKVLQIARDHDAFETVMSLNRN
MLKLDLKRTGLRARLRELKQPVLIIWGEHDRYISPKIAHIVKRELPHAKL
LIFKDCGHSPMLEYPEQFSTAITEFIHQEPPLP
>CT1369 conserved hypothetical protein
MIKRTEMRTVISICLFFGMILASSSLFARTDAEPSTRALADTVGFAHRAW
QMDSVMARIRALNHDDLVRTQQPAGTAWRAAICPHDDYTYAGWLYPAVLQ
NIKAKTVIIFGVAHTAWRYHLENQLIFDSFSSWRGPYGNVKVSPLRDEIL
ERLPRGMAIVHDPMQSEEYSVEALIPFLQYQNRNVEIISILVPFMDFERM
QIVSQHFAKALFAVMKKNNLRWGKDVALLISSDAVHYGDEDWDGRNFAYY
GTGGKANALAEAHDREIISHSFESELTEKNIARFYASTTDPNDFKKSRWS
WCGRFAIPVGLLTALDLQKLEKSAPLSGVPIAYATSISQPHLQVDDLGMG
ETAIATQRHWVGYPAIGFK
>CT0760 HIT family protein
MQRMYSPWRDVYMQTFKEEKPFTPEEGKSVFADIPPEQDEERYVLCRGEF
CFAILNLYPYNCGHLMVIPYLQTPDFGDLDAQTMVEIFNMSNLCMKALKM
TIKPQGFNFGANLGRVAGGSIDEHIHFHIVPRWEGDTNFMPVLGETKVLS
NDLRQTYIQLREAIKKLQSEPKKP
>CT1237 hypothetical protein
MSVILSVDPLFQSTSIREPSMTPKQNNHPLALALFSLLSALLVGHAIYHY
VILPEEIATHFGFSGKPDAWGPKTVFFLWYFIITGLCIVMFVVVNRLLRP
GHLSWLNIPNKEYWLAPERIHDTLHYVRSGMLLFGSGTLLFVLDFINQSF
QVSLGNASRLDHPLTTLAMYLLFCVLWVSALYRRFGRKM
>CT0810 hypothetical protein
MQPYHFSNGSIRFICLATALMQPFPPSAIVIDKPELGLHPEAIRILGELI
RDAAKRTQIIIATQSPLLLDQFSIEDNRRNMPARPKS
>CT0854 iron-sulfur cluster-binding protein, gltD family
MNAESNPILDFATEYVYPAFSELTGTDKIVAFGDHSHKCPIYVPQTPPCT
AECPAGEDIRAINRFLNGTDPSDDPLKSAWETATDTNPFPAVMGRICPHP
CQSKCNRGVHDESVAINAVEQVLGNYGIEHNLKLKGPGADTGKRVAIIGG
GPAGLSAAYQLRRKGHAVTIYDANEKLGGMVLYGIMGYRVDRKVLEAEIG
RIIELGVETKMGVTIGKDITLEQLEAEYDAVFIAVGAQKGRALPVPGFEG
TPGATNAIDFLKSYEVLGDDIPVGKHVVVIGDGNVSMDVARLALRLGSQA
TIISGVPREEMACFENEFDDAKNEGTTMHFLTGTVEVLGGASGVTGLRCT
KMVKKEKGEEGWNSPIPFLRYKSNGESFEIEADMVVAAIGQATDLSGLGS
AASGPWLKVDRNFRIPGREKLFGGGDALKVDLITTAVGHGRKAAYAIDAF
LKGEPMPEEPYREITKPHKQDLLYFLHTPQAKRTSIKPEVVVGNHDELLE
ALTPEQAVTESKRCMSCGFCFDCKQCVSFCPQEAITRFRDNPAGEKVYTN
YAKCVGCHLCSLVCPCGYIQMGMGDGL
>CT2269 conserved hypothetical protein
MHIVDLSYPVTADMPLWPGTPAPNFSDLHTVGRDGFGERWLQLSSHTGTH
LDAPAHLFEGAVSLDRLPVDHFIGKGALLDLRDAQPEPLSLDQLLLQRAT
IESAEFLLLHTGWSRFWGTAAYDRGYPVFAEEAAAWLAGLGLKGVGIDAP
SFDDPDSEELPIHRRLLGAGFVLIENLTALDRLGGHEFFLSVLPLPIAGA
EACPVRAVALIASFAANQPI
>CT1696 hydrolase, haloacid dehalogenase-like family
MIEAILWDNDGLLVDSESLFFEMTRTFFAEAGLQVEAEYWGVEYLGNAKH
SYQIAAELGLAPELIPSLLDRRNEAFVQRLRHSVPLMPKVRETIEALAGT
VRLAIVTGSPRDKVLLMHGNNGLLDHFEVIVTDDEISNPKPHPEPYLKAM
EMLGVKPERCLAVEDSQRGLDSAVAAGLRCIAVPNALTKVQRFDRAHAVE
ADVSGVLKHVNATKRLAR
>CT0111 conserved hypothetical protein
MIIKQLSVFLENRAGRLTELTGILADNDINISAFSIADTTDFGILRVITG
KPELAEKVLKEQGFAVKITDVIGMIMPNKPGALHHALQILTDNGISIEYM
YAFTNGEGRATAVIRTDTPQKAIEVLQLHKMELLKTGDVYQL
>CT0786 conserved hypothetical protein
MFYFDPAYFLFALPPLLLGIWAQFKVKSAFKKYSQVATQNGVTGAQAALR
ILQRGGLENVNVEMTSGMLSDHYDPRQKVLRLSEEVYSLPSIASVGVAAH
EAGHALQDKVNYSPLAIRSAMVPVVSIGSNLGPILFMIGLFMQGVLGSSL
AWAGIILFAGTALFALVTLPVEFDASRRAKELLVSQGIVSQREMAGVNAV
LDAAALTYVAAAAQAIMQLLYYVMVMNRRKD
>CT1837 TPR domain protein
MQQTVWQNPEDARGYLNLGKEYARQQRFDDAIQAYRRAIKLEPGLDEAYS
ALGAAYFDKKEFNAALPWMQKRVDIAPDDSLRQFDLGNVYFQLNRYNDAV
ASYQKAIDNSYSFQEAWYSMAVCYIKMGKIDEARKIHKWLQTKNNYLAVS
LERHLQNDVPDKAGK
>CT0734 conserved hypothetical protein
MGIVINLFLIIAASIVFFVVGFYIGRFFLERIGTTKVLEAEERAVQVIQE
AQKEANDYKELKVNEVNQEWKKRKREFDSEVTIKNNKFAQLQKQIRQKEV
TLANQMRDIKETEKKLQEQREELKHQTQNVQNRSAELEKTILEQNQRLES
ISNLTAEEARQMLIDNMIAKAREEAAETVHQIHEEATQKADRIAEKIMLT
AIQRISFEQATESALSVVHIQSDELKGRIIGREGRNIKAFENATGVDIIV
DDTPEVVILSCFDPLRREMAKLTLQKLLVDGIIHPVAIEKAYQDAKKEIE
DVIMSSGEEAISSLQIPDMPAEIVNLIGKMRFHTVYGQNLLQHSREVAML
AGLMAAELKLDAKQAKRAGLLHDIGLVLPETEMPHALAGMEFLKKFNMSP
VVLNAIGAHHGEVEKASPIADLVDAANIVSLSRPGARGAVTAEGNVKRLE
SLEEIARTFPGVIKTYALQAGREIRVIVEGDNVSDSQADVLAHDIASKIE
SEAQYPGQIKVTILREKRSVAFAK
>CT1793 conserved hypothetical protein
MKKLILCVLTAFALLAPATSYCQPPSQKEFSELKKAAEQGDAQAQCMLGL
MYELGLGVRQDKRTAKEWYGKACDNGNQKGCDNYRRLNELGY
>CT1236 phosphatase, putative
MNYRLLVFDFDGTLADSEASIMGAMQLVAKEFGLSEVDCVKARPTIGLSL
LRTIEIGLGLEAGDAAAAVELYRRYYKEIAFDSTCLFPGVKETQEQLRQN
SLLAIASSKSRQGLLSRMRQFGIVDHFSFIAGAQDISGL
>CT2209 glycosyl transferase
MNGGKTAVIVLGWNGAADTLACLASLAKVRQPAFTVILADNGSTDGTVGL
VRQAFPEVEILELGRNLGFAAGNNAAFRSLRGRGFDRVVFLNNDTVVDPG
FLQPLLDELQKPWVGIAAPKILYMDDPGRIWYAGGVLESATGLIEHTGIR
QPDGPRFDTPEPVWYATGCCLAMRCRVFEEVGGFDERFRMYGEDVDLSMK
VRERGLIVMYQPASRIWHRVSASSGGEMNLGKQLRKSGAAMMLFAKHGMI
GGLVLYPLLLPFRALLGLLRFQFFRWTSSQEREEA
>CT0792 AslB/AtsB family protein
MTSREEFLEKLAQAGTLNLIFQLTDQCALSCRYCFAKGSHPSGGLRIADD
LLDAAIRSAFDTRHHQVSFEWTGGEPFFAGIDFYRKVDRLQKKYATKPYA
NTIQTSGYVHDRELIAWLAGHGFRISSTIDGPPELHDFQRPVNGGGPSLG
AVLATRETIIEHQGHCGCICTVTRNSLGKEGAILDYYRSLGIEAFHSNPY
YYFSKNLVGDESLALDADGYAAYFIAQFNAWFEGGRKLPMPGTLNYILRS
LTAGAGLKQSVCTFGGRCLTNFLAITPDGDAWLCPKFAGFDEMRLGNVGK
MAITDILSDANPAMARLIDERLDAIHTCEADECRFQYLCNAGCPYYSFIA
SGGRNIAVKDSLCTGKQLLFEYLESVVELIDPARLPEPSPLEHA
>CT0824 conserved hypothetical protein
MILLSPQEQQSRAARIRLVLSDNDGVFTDNGVYYSERGEEFKRYSIRDGM
GVERLREHGVETGIMTGEVSPSIVRRAQKLHIERLYLGVKDKQSRLADVL
SDTGLSKAEIAYIGDDVNDIGIMNAIAPFGLVACPGDAMPLVEPCVHYRC
TAQGGRGAFREYAEWLIALRAS
>CT0958 conserved hypothetical protein
MKRFVFSGFAVFMLAAPVTGHCRDYQMNDSSPEASVRRDADTDKDAAQKE
YLIAMKYYYGRDVDRDYAEALKWLRLSAAKGNDAAEYAIGFMYQKGLGVP
QDYAEAMKWYRLAAAKDNDNAQNQIGYLYHHGWGVETDYAEAMKWYRISA
AKGNFAAEDNIGVLYEHGQGVEQDYAEAMRWYRISAAKGNGEAELNIGNF
YQHGLGVELDLNKAVKWYRSAAAKGNEEAAQKLSAMKSGSVLPDVHHE
>CT1197.1 ABC transporter, ATP-binding protein
MATAVKLSGISKTFGSLKANDNVSLSIEAGSIHALVGENGAGKSTLSNII
YGLLHPDSGTIEIDGKAVKFSSVRQAIEAGIGMVHQHFMLVPTLSVTENI
ILGKEESRLALPTRRIGKEIRQLSQQHGLEVDPDALVSTLSVGEQQRVEI
LKLLYRRAKLLILDEPTAVLSPPETARLFATLRSLAAEGRTVLLITHKLD
EVLTVSDSVSVMRKGSLVGTVPTTSTSKEDLARMMVGRDVLLRTANAPQT
PGKTVLSIDKLTYRSPLGIDKLTGLTLHVRAGEIYGIAGVEGNGQSELLS
LLWGTFDRSGKTGGSITIDAQETLGKNPSEIAALGVSMIPEDRHKSAIIA
EYGIEENLILGRHREKAFHRGIGFDRDAVHKNATAMIEQYDIRCAAGTNP
PIASLSGGNQQKIVVAREMERPQLKLLVLAQPTRGVDIGAIEQIHKRIID
ARKSGLAILLISSELEEIVALSTRIGCLYKGAIRHEFSETEVWRGRDHES
GFEKEIGMHIT
>CT0417 AslB/AtsB family protein
MLVVTTACNLDCAYCYEGGGNGQGEMMDLDTALRALDLVAASNQPFHVQL
TGGEPLLAGELVFRILEYIRNNNLPATTAIQTNGVLLDHQTARRLHSFGT
AVGVSVDGLPAIQEQLRGQGAATWKALAMLDSERVPFSVTTVLSSRNTRD
LSTLGMALHSMPAASAIGLDLLVQKGSAAAKIGVQPPDAVTIRSGVTRLL
ATLDVLNAQRSRPLILRERQTVLKALGRDEARPYCQACTGSSLAVTPRGE
LYPCTQTLGDARFHLGTLDHPRLSGTALPDGRLPKREECAGCGLQGRCPG
DCPSRLHYNGADGAGLVCALYRTICDYCLYKGEIPS
>CT2279 conserved hypothetical protein
MCFSRPGYLYLLWLLVPLAVLVFYGVRRKLSAWAKIDSTGGGSGILPAVS
VRKLGLRRIMLLLSAALMIVSIAGPQLCRGQKAVRQKGIDIIFMLDISNS
MLARDTAPDRLTHAKTELLQISRRLGDGRKALLLFAGTPVVQCPLTDDEE
DFEILLDMAAPELITTQGTDYRRAFDAALKLTNSGGELSSNETVLVLASD
GEDHGNDLGDIATAMKTRGVHLHVIGVGGVQPVPIPMQGGPKRDQQGKIV
LTSFKPEPLASLIKTAKGKFYYSRPDAPVHDAVADDIAAEAAEARWIMAP
EQRVPVHGETILAALLLFVSGSMMTDVRRPPKQSA
>CT1135 CRISPR-associated helicase Cas3
MGKIDHSAAGAILASNKSKDIGRILGYLIAGHHTGLPDWDKDAGTKGDPL
SERLSNSEHLQHALKGNPPENILDTPLPSSLPCQTQNGGAELVHLWIRML
YSCLVDADFLDTERFMNPETSELRPNGVDLAMLKERFDQHMSLKESGASD
TPVNRARKEILRECRQKGVSLEPGLFSLTVPTGGGKTLASMAFALEHALK
HDKKRIIVVIPYTSIIEQTAAVYREVFGDDAVLEHHSNLDPSRETPASRL
ATENWDAPIIVTTNVQFFESLFAARSSACRKLHNIVNSVVVLDEAQMLPT
DFLQPIVSVIKMLSAHFRTSVVLCTATQPVLSGKVGTGKDILKGFDDGRV
RELMADPEKLFGIFQRVRVRMLGQADQRHEWAEIARRLCEPEQVLCIVNT
RKDCRELHDLMPDGTVHLSALMCPEHRSTVIAELKSKLVAGEPVRVVSTQ
LLEAGVDIDFPTVYRSFSGLDSIAQAAGRCNREGKLEHGDVVVFNPPNPS
PSGRLRKAEQAAQELFRTVPELAASLMPEAFRRYFMHYFSGLNGFDTKGI
MDLLASNDAARYFQIQFRTAARKFKLIDDTLQHGIIVRYRNGKANIDALI
DQLRFGGPNRKLMRQLQRYSVNVYDPDLKKLIENGLIEEVHGVWVQTSES
AYDPVFGLNIDANLNFYW
>CT2058 sodium:solute symporter family protein
MQPLDYAIIILFLAGNMMLGLWQGRSNKQTSDYFLGGHKLPWFAVMLSIV
ATETSVLTFVSVPGLAYRGDWSFLQLAFGYIVGRILVSFILLPTYFKHGV
TSIYEVIGMRFGHGIQKTASVIFLITRILGDGIRFLATGVVVQAVTGWPL
SLSVLVIGIVTLVYTISGGLKTVVWLDSIQFGLYLGGGVIAIAFILARLD
APLPDLLAPLLAAGKLKIIDTDPHIFTNPLSFVSAFSGGILLSLCSHGVD
YLMVQRVLGCDGLGSARKAMIASGVFVLFQFALFLLAGSLIYVFFHGAPL
VKDREFTSFIVRALPAGLKGLLLAGILSAAMSTLASSINSLAASTVTDLI
KGKASLSTSRLISVAWAAVLIGIALVFNENDKAIVMLGLEIASFTYGGLL
GLFLLSKSSRNFHSTSLIAGLLASMAVVFLLKLLGLAWTWYIAVSVTTNI
LTTAGVEALLPDRESFARE
>CT2044 CBS domain protein
MDQLITLRTLPASALMQKDYHTIKGSSTVAEALQLMKKTGESGLVVEPRN
EDDCYGIVTEKDILEKVIDPGEDLHRDPWNTPVFQIMSKPIISINPSMRI
KYALRLMKRTNVRRLTVMEGNKVIGVLNMTDVLHAVEELPVHDDHIAL
>CT0087 conserved hypothetical protein
MSSFQFFGMQPYQADPSSQINAALDSIKALVISLDGVLTSGVITLDGEGR
EMPTLFARDLAGLREALRLGMKVAIIAGRQAGAFRQMLEATGPIDLFLDG
EERLDAYEAFKSRHGLQDDECACIADDIDDLELLKKAGLPVTPINGAEYL
RNRVAYISVFEGGRGCVREIVEMVLDHQGRWKYSEKQAQG
>CT1197 chlorobiumquinone synthase BchC related protein
MQGEAQALVFLKANKLKLQSVKYVANRPRDILVRTIASTITPGLDRLLLT
NKPVSHKVLAYPVMPGSETIGQVMQVGPEVTSVKEGDFVYAFKGDCWVGI
DPYYGCHAEVIPTSEENVLALGRKPIHRDLLTGLVGYVLSAMEKVALDPS
MRVLLLGLGSVGLMVSEYLHYRGIRHVDALENFPLRGQLSHAENIGIEIV
DFTDDFNDRYDLVIETTGRILMVEKVMRLLKPKAKVLLMGSYEVLGYDYR
LIQHKEPVIVCSSVTDKQHLIEAKALLETEAFETEKFFTNVFPVSQYELA
YRIALDSKEAIKTVISWI
>CT2274 conserved hypothetical protein
MSDNVHYKSNIELTPQEYEELEDFLLHESGLKHPMNLDALDGFMTALIIG
PEPIMPSQWLPHVWSSAVVDEAPVFESDEQAKRIIGLIMQMMNALSHQFE
ESPEDYAPLPNLTTFDSDEDQRKAARLWCCGFIEGINMNQDSWKSLLKDE
KGAKTVFAISAASGLLREKLNLDEEKEYELWKLIPEAVLEIRDFWRPGHR
RKKPDEKQPKAEAPGRNDLCPCGSGKKYKKCSGQ
>CT1462 conserved hypothetical protein
MNTYRIVVVVGSIRRESLNRRLADALIRLAPADFAFHHLRIDDLPLYNQD
DEEHPTESMQRLRREIAEADGIMFCTPEYNRSIPGVLKNAIDVGSRPYGY
NAWQGKPAGIIGLSSGACGTAMAQQHLRNILAVLDVPTMAQPEAFIQFRD
DLFDADGGIGPGSRDFLQGWMDRFAAWVRLCAKPRNG
>CT1907 hypothetical protein
MENQDKYYKGLFFGAILGAAVGTIMGLLFAPRKGEDTQMIISGKVKRAID
RATELYEGSEHEAAYSNEAKHRSQEIIDSARDEARKILDEANNIIRDIRG
SQAKAQEN
>CT0104 multidrug resistance protein
MSSTATTIPGAQALEHHYETGARKWIITATVIIAAMLELIDTTIVNVALN
HISGNLGASIEDVSYVVTSYAIANVIVIPLSGFLGNLFGRRNYFVASIVL
FTGASFLCGISTNIWMLVFFRFIQGIGGGGLLPTSQAILYETFRPEERGA
ATGIFSMGLVLGPTIGPLLGGYLVDYFAWEWCFFVNIPIGIAAAWASLTF
VKEPKVKPVVEKIDWAGIGLLSVGIGSLQFVLERGEQKDWFETDYIVWFT
IIAVVSLIAFVWLELHTDHPAVDLSVLARSKNLAIGAVLTFIVGFGLYGS
LFIFPVFVQRLLGFTALLTGLVLFPSAMLTGIISMPLGIALQKGASPKLL
MTVGMVAFFWFCWELGNQTLMSGAENFFLVLLIRGFALGFIFIPVTLLAV
TGLHGKDIGQATGLNNMVRQLGGSFGIAITNTYVTQRVAAHRIDILSHLS
PYDPAAVQRLQDMKQALGQYMSSPVEAGQAAMAALEGIVVRQSYHLAYMD
AFKMIAILFAVCLPLLLFIRVDKKETVDMSSVH
>CT1806 sodium:solute symporter family protein
MSVQVXTYLIVGLTFAIYIGIAIWAKAGSTKEFYVAGAGVHPMINGMATA
ADWMSAASFISMAGLISFMGYDGSVYLMGWTGGYVLLALLLAPYLRKFGK
FTVPDFVGDRYYSNAARTVAVICAIFVSFTYVAGQMRGVGVVFSRFLEVD
INTGIIIGMAIVFFYAVLGGMKGITYTQVAQYWVLIFAYMVPAIFLSIMI
TNNAIPQLGLGGTTSDGVYLLDKLDNLSKELGFGAYTTGSKPMIDVFAIT
LALMVGTAGLPHVIVRFFTVPKVRDARISAGWALIFIALLYTTAPAIAAF
ARVNLIDTVSNKAYAELPGWFKKWEKTGLLAWMDKNGDGKIQYLGKKANG
GDPFEGKKPEFTKEIGKSGELLMSNKPTDNANELYIDKDIMVLANPEIGR
LPNWVIALVAAGGLAAALSTAAGLLLVISTSISHDLIKKQINPNISESAE
LMYARIAVAIAILVAGYFGINPPGFVAEVVAFAFGLAAASFFPIIILGIF
SKRMNKEGAIGGMITGLVFTAAYIVYFKFMNPAMNKPEFWFLGISPEGIG
TIGMLINFAVSFVVSRITPAPPEQIQELVDSLRYPKGAGEASAH
>CT1087 sulfide-quinone reductase
MKKVLILGGGIAGVAAAIAFRKRGFEVEVVSDREFLFIYPIAIWIPVGTE
RFRNVAFPLEKVARKHGFSLTLDTVTAINASRDSVTLEKAGQRSDFDFLV
IALGSDKMKHEGIEHTLSICGAPEHSIRLKEKIDALIERGHGKIAFGFGG
NPKDPSAVRGGPGFELFFNLHHKLTKLGIRDNFEMTFFAPMAQPGAKMGQ
KALDMMAVMFKAKNFRQRYGKKITRFEVDGVVFEDGSKLESDLTMFIPAG
SGHSVFRNSDLPLSEAGFVKIDDFCRVVGVDGWYAVGDSAALEGPEWKAK
QGHIAEFMADCAANNCLAEHFGHQEPMRGYQEHLNVLCVMDTGDGAGFVY
RTGHSEMFIPMPVVGHWLKKAWGYYYKLSKMKYIPRIPGM
>CT0952 conserved hypothetical protein
MTPLLFFIATIIFAAGAGAGAFLVNAFRNKQENVCRGSVRIGPSTVAGRG
AFALTPIKEGDIIERCPALEVTDKDIGGELLNYVFYGSAEDRRLIAMGYG
MMFNHSSNPNVAYYREDTPTGPELIIYALRNIAEGEEMYYNYGDDWWKTR
GEKSDF
>CT2140 conserved hypothetical protein
MNQQTYAFTLEMEVRDYECDMQGIVNNSVYQNYLEHARHVYLKTVGIDFK
EFTERGINLVVVRAELDYKLPLQSGDRFRVGLNMVRQSPLRFAFYQDILR
LPDMKPAVKARIIGTALNGRGRPEIPAELEALMQPGE
>CT1774 oxidoreductase, short-chain dehydrogenase/reductase family
MEAGRIDAADTPAELVELVYASRLLGKDDRLVMHGGGNTSVKCELTDFIG
NHVNVIFIKASGVNLASVDAGDFTPVRIDPLRKLQKMYESGQRHSEEDIR
RFSTREFKNFLYLNLFTLTDHMVSRSLSPSIETLLHAFLPHRYILHTHSL
ALLTLSNQTDGERLCREALGDGYGQVPYIQPGLGLANLAHDAYEKNPSIE
GLVLHKHGLVTFGDSAKEAYDRMIDGVNRIEERIASAARKVFASAPMPTA
IASVEEVAPIVRGACSFEKTPGEKDYQSFVLEFRTSPALLDYLKIADLEA
FSKKGAMTPDFIIRTKNHPLVAPAPDAADLEGFGKELRARAKRFTEEYRS
YFERQQQATGMDVSMIDPMPRVVLVPGLGLFGLGLSAADAKLTADIAEHS
AVAMLDAESIGCFESISEKEAFEIEYWDMEQAKVRKSHNGGEFAGKVALV
TGGAGAIGLATAKAFKAKGAEIVIMDIDPAALEKAAAELGSGTLTIPCDV
TNAAAVREAFDTVCRTFGGLDIVVSNVGVAWQGRIGDVSDELLRRSFELN
FFSHQTVSQNAVRIMRRQGIGGVLLYNVSKQAVNPGPDFGPYGLPKAATL
FLLRQYALDHGRDGIRANGVNADRIRSGLLTPEMIKARSAARGLSERDYM
AGNLLGLEVSAEDVADAFVHLALETKTTGSITTVDGGNIAAALR
>CT0524 dihydrolipoamide acetyltransferase, putative
MKTQSQDRWFIWQLSEELEAKIRYREYGPPDSPFTPLLFIHGYGGMIEHW
NDNIPSFDDRYRIYAMDLIGFGQSGKPNVRYSLALFAAQIKAFMHLKKLE
KVTLVGHSMGAASSIIYAHHNPDSVRALVLANPSGLYGDSMDGVAKIFFG
LVGSPLIGEMLFAAFANPVGVSQSLTPTYYNQKKVDLNLINQFSRPLQDR
GAIFSYLSPSKRPHDFMLDGLKPCNYKGDAWLLWGAEDTALPPHKIIPEF
QELLPQAGAYIIPKAGHCIHHDAHETFNNRLAQLLQRLE
>CT0177 proline iminopeptidase, putative
MSYLTSSRCKLFYEDTAEQNPSLKDKPAILFVNGWAISSRYWRPTIDLLR
QDFRCVTYDQSGTGKTSIDGCQPDLTIGGFADEAGALIEHLGLDKSRNLH
IVGHSMGGMVATELCLRYRDALLSATILACGIFEETPFTSLGLMFLGGLI
DVSMNFRNMFRVEPLRTLFIKRAATGHISKEYSDIIIEDFTTSDKAATNA
VGHFSIDPEALRTYTRSVIEIASPVLCCVGMADHTIPPEGTITLFEKRKA
SATSPTRLVQFMHLGHLPMLEDTPCFVEQLKKHFDFAEHFYKKTQPATPL
ADRVQIQ
>CT1812 hypothetical protein
MTRNDSGDLISRNDSTNHMNTTQPQKMLPEATFSDRMLEIGIKYKKQLTA
LVVVICLAAGGTLFWMQKTKVDEVQASLALAKITPWIEMGDVNKAVNGEG
SIKGLNSIIKTWGGTPSGKTARLYMAYILLNSGKPDDALSIYKGFSSDNK
DLQASALAGAAACHVQKKAFAEAAPEYEKASETAENETLKSMYLTKAAES
YSAAGQADKAAKLYDQVIKTWPATSSAGMAQRALLRLAGAGVQIPQI
>CT2087 esterase/lipase, putative
MQTRDRLFLPRDSMNEYYLETTVSGCYLVESPPGSGPFPLLAGFHGYGQT
AEDELELLRNIPGSDRWIRLPIEALHPFINAKGQPGSSWMTRRDRDRRIA
ENVRYVDAVIGRVMAEQPVDGRLVLHGFSQGAGMACRTAVLGCHPVVGVM
LLGGDIPPEIADCSRMRAVHIGRGDRDRFYPQKRFEADVARLREAGIEPV
VSQFRGGHGPTAEYFDAAGRFLNKIGRG
>CT0729 Nudix/MutT family protein
MANGTVKIAEQSGVLPIAGDKIVLITARGSGRWIIPKGYIEKGMSPAESA
AKEAWEEAGIVGSVRHEEIGTYSYRRPSGIFSVRIYPLEVESLLEQWDEM
HVRQRRLVTPSEAIEMICLKELRSLITDYLIKRFDF
>CT0373 oxidoreductase, Gfo/Idh/MocA family
MKQKIRIGFIGSGWAQVAQAPAFSLMEDVELAAVASPTERRRQKFQDRFG
IPEGYADWREMLDECPLDLVCVTTPTFLHDPMVTGALEKGVGVLCEKPFA
LTVEEAERMNALASKSPGLSLIDHQLRFHPSVRSMKQMIDSGEIGKVYEV
RAVVNLASRNRIDMPWSWWSDASKGGGALRALGSHLIDLNRFLVGEISEV
CCNLSTSIPQRPDASTSGKSLPVTSDDSFAMFMKFGPSSVALGASSLMHV
TTVGSYTWFSLEVVGSLKTIRLDGAGRLWEIVNSEVKGGRSLIDAPRWKQ
LEPMLPWDELVLQEKIKQSSLAVHGIFAVGFAFLAHRIVKALKSGDPVIL
QDAASFRDGLAIQKVMQAGLDSDREKRWVKV
>CT2206 polysaccharide efflux transporter, putative
MKRDVTIAREGSIAMAGFAFGQLFRFGYNFAAARLLGAEALGTYALVVAV
MQVGEVVAVGGFDAGLLRFVSQREGEKRKSIIASAMKRSVLASMLAGVLV
LLFSGDVAGLLHGGWLMQLALCSAALALPLSVMTIMAGVTMQAHHNLLPK
VLATQVIQPVLLVLTMVAARYALGVSAALALPFLLAPLAALVWILPGFRQ
VTGIGLSDVCRAYGNRELWQFSLPLLAVALFSILSHWIDVVMLGFLTDVR
TVGLYQSAARTAGLLRSVLLAFSGIAAPIIAGYHGRQENTGIRETYETVN
RWIVMLVMVPFLLLVLFPDEVLSVFGKGFGAGSTALVLLAVTSLLYASFG
LGNTVLAMSGGERLSMMNQAGALLLQTLLHWLLIPRFGLNGAAFSTLAVM
ALLTIVRMAELRSQLGIPALSGKLWKPLAAGIAVGAIMLAIRYGASSLPP
LMLLAVAGVAGGFVYILLIRALRLEREEMEVILNFMPFLKRQRTNAAP
>CT0859 membrane protein, putative
MVFKQIRNWIVPGVLGLLLLMMPDISFADGTTQIAATAWWVWVLVLFVFS
FLLGIVAVLAGVGGGVLFVPIVSGFFPFHIDYVRGAGLLVALAGALSAGP
PLLRKGIADLKLGLPMALVGSISSIAGALMGLAMPAKNVQLLLGIVILGI
TAIMLKAGKSGYPEVKEPDALSKMLGISGCYFEEFGQHEVSWQVHRTLVA
TVLFFIIGFIGGMFGLGAGFANVPVFNLLMGVPLKVAVATSGLVLSINGS
AAAWVYMFNGAVLPIIAVPAVGGMMLGSKIGAKLLPKVNTRTIRMIVITI
LALSGFRSLLKGMGG
>CT0198 MFS transporter family protein
MEMKQEKRFFGLSGNVFFAGLVSFCMDVSSEMIYPLVPLFLASVLGVNKA
MIGLIEGSAESMASLVKVFSGWYSDRLGKRKRLMLAGYSVSTLSRPLMAL
AGGWHQVLAARLVDRFGKGVRTAPRDAIIADSTEPSWLGRAFSFHRSMDT
MGAVVGPAIAFIGLQVYHSSYRQLFWLSSLPGVLAVLIIVFFIREKRDAP
VRADAEKPRSTGGGKLDRRAWFFIVIVAVFALGNSSDAFLILRAGQLGVP
AAMIPAVYLLFNLVYSITAIPAGIVADRYGRKRLILAGFVLFAALYAGFA
VAGSPLAVWVLFALYGVFMGLTEGIQKAFLATLTPEGLKGTAFGLYAGAV
GLAALPSSLIAGVLWDRVSPSATFWFGAATALLAAALFAVFIAGIKSKPS
FGE
>CT1213 conserved hypothetical protein
MHSLYLKPKEHRRLVSGHLWVFSNELREVPRDIAAGETVQLFTHDGRLLG
AGFFNPQSLIAFRLLTRGEEQPDRDFFRRKLLEALKLREKIYPESETNAW
RLVHGESDGLPGLVIDRFDRAFVLQSFSAGIDQHLPLFCELLRELFDPKA
IVVRNESPLRELEGLPLYRETVLGESSDMHQEIRDSGISYRVNILEGQKT
GFFLDQRENRRHIRKYAAGADVLDVYTNDGGFALNAMHAGAKSTTMVDIS
QEALQRAEQNARTNGFGNFSIVAADAFETLGQLRHENHTFDLVILDPPSF
TKSRKTVPTALKAYTKLNRLGLQLVRNEGYLATASCSHHVSEEDFLAAIH
LGAMQAGKHLRLISRAAQPPDHPVLLAMPETSYLKFACFYVTNL
>CT1315 hydrolase, haloacid dehalogenase-like family
MSMNFKGVIFDLDGVITGTAKIHSLAWEAMFNSFLQNYAEVNNEPYVPFD
PVHDYLKYVDGKPRMEGVKSFLASRGIEIPYGELDDTPEKETVCGLGNRK
NSLFTKILVKEGPEVFQTSVDFIKALKARGIRIGIASSSRNCQLILRLAK
LEELFETRVDGEVSMELKLKGKPNPDIFITAAANLGLEPYDCVVVEDAIS
GVQAGSKGNFGLVLGIAREIEGIKLKEQGADIVVRDLGEITIEEIDKWFD
TGLEHEGWNLHYDSWSPKDERLRESLTTTGNGYMGVRGAFESGMTSAHHY
PGTYLAGVFNKLPSEVHGQTVWNNDMVNAPNWLPIEFRIGNGAFINPLEQ
KILSYRQNLDMRHAVMEREMVIQDTLGNITRMKSKRFCSMDNPHIAAIRY
TIQPVNYSAEIEIRSTIDGRVQNRNVLRYNTLSTDHLEHVDHGRTGKDEG
IFLHVRTNHSKIDIVTHAKTTLRCGYHAKSVCEGNITSSPRWISEHFRLQ
VSADRSCSIDKVVSIHTSRDAGHNDPVAAGKESLASAGSFDQLLERHIEA
WDKIWQKADMKIDGDRFTQMVIRLHIYHLVSTVSPHNVNIDASIAARGLS
GEGYRGHYFWDEIYIMPFFIQHLPEVARALLMFRYHRLDAAREYARDNGY
HGAMYPWQTADDGREETPTIHYNPKSGAWDPDLSCRQRHVSIAVFYNAWR
YVHDTGDTEFLNSYGAEMMFEIARFWASIATFSPDDGRYHIEGVMGPDEF
HEELPGSGKPGLKDNSYTNIMTAWLLEKAIEISQRLDPAVMDGLMEKIGI
GHDEFMKWRDISGKMNVLIDQNGILEQFDGYMGLKELDWEHYKLKYGNIH
RMDRILKAENDSPDHYKVAKQADVLMTFYTLSPAEVCAILENLGYHVADP
LRFVRDNYAFYEPRTSHGSTLSKVVHSIISSYLPNGHEMAWNWFIEALRS
DIHDTQGGTTPEGLHCGVMAGTIDTVTRYFSGIAFHKDMLNIQPNFPSHW
RRLETNLTFQKSWYRIVITPKSVSVTLTESDANELPAFIGGRSVTLKKGE
ELTVQLG
>CT0909 hypothetical protein
MMNQFFEPPRVEALHVQNYRALQNVRLDSITPLTVLLGPNGSGKSTLVDV
FAFLSECFGEGLRKAWDRRGRFRELRSRDSVGPIVIELQYREKPGTPLIT
YHLEIDEKDRGPVVKREFLRWKRTHPGAPFYFLDYREGVGRVITGEQPES
QDKRIEKPLSGPDVLAVNTLGQLAENPRVIALRAFITSWHLSYLSADAAR
GNPEAGAEERLSQTGDNLANVIQYLGEEHPERLNKIFETLKRRVPRIEKV
TSRPLDDGRLLLQVKDAPFSSPVLARFASDGTLKMLAYLILLYDPEPPQL
IGIEEPENYLHPRLLPELAEECDMASERTQLIVTTHSPFFIDRLRPEQVR
VLYRGADGYTRAKRVADMRGIGEFLEAGASLGDLWMEGHFDVGDPLAGEE
GA
>CT1253 hydrolase, alpha/beta hydrolase fold family
MLHYKKHVIAEDAPWVVFVHGAGGSSAIWFLQIKEFVKHFNVLLVDLRGH
GRSKHITTSKEVRHYNFEVITRDIIEVLDDLQIQQAHFIGISLGTIIIRN
LGELAPERVASMVMGGAIIRLNVRAKVLVAVGNFFKSLVPYMWLYRFFAW
IIMPKARHRKSRIMFVNEAKKVAQKEFMRWFTLTYELNPLLKYFEEKDTG
IPTLYLMGDEDYMFLPAVKYIVKRHTNSYLEVISNSGHVCNIDQPQEFNS
RAIKFLCNVSLQSLPESVDTMPQLAAV
>CT1220 conserved hypothetical protein
MSHTTSTLLIAGAGASGMLAAIAARRVAREHGVADERLRIVLLERNPKPG
NKIAISGGGHCNLTHDADVKSLLEKGFLNKGERRFLRHAIHMFSNADLLK
LFGRYGLKTEVREDGRVFPVSGRAGEVLDLLRRMVEESAVTLVTGARVER
LECGAAGFVARAGERRFEADAAILATGGASWGSAGTTGDGNRLAVAVGHT
ITPVLPALAPNYFTVPPRPELVGITLRNILLVASVDGASDSRRGDVLISH
RGISGPACLSLSRSAAGFLASGKKVTIFVDLFPGHDEGKLSAFILDQAAR
HGSRQVRTFLQRCPLAPERLDAPVASSNAETIPNAFADEIMRQAGIDREV
TMSGLTKAQRQCLVSTLKRLALGAVHKVPLDRAEVSAGGVSLREIDPKTM
QSKIHPRLYCCGELLDYAGEVGGFNLQAAFSTGWVAGSHAARMIIETVHF
SLRRNRQDHQLCGRTA
>CT1203 conserved hypothetical protein
MAMQMIPFPADFSLRVLDLGAGTGLFAAMVAQAYPNATFHLTDISEAMLE
VARKRFAGNPRVSFAVQEHLELAEEPEFDLVIFAFSIHHLEHEAKRELLC
KIFHALRPGGAFINADQALGATTENEESYESQWFSDVSANGATAEAIHAA
KERMRADRNATLADQLAWLEEAGFGEVCCAYARFRFVVYGGRRG
>CT1749 moaA/nifB/pqqE family protein
MNRMAGCINDHLKRYGGLAAGREDEQKRYFAEKRLYALQIETTDACQQGC
IFCYAGSTPREHHGLTSDEIRGLLRDAAALEIRAIDWLGGDPLVRPDWYE
LMQYARSLGLVNNVWTSGLPLKSKEVAARVHEVSEGGFVSVHVDSITPEV
YAKLHRGGNPHFIEAIVEGVDNLLALGKPADMMINCITYTSLQGPEDAIK
TMRWWFCEKGLRTCLTMFNPAGMGAEWRSLEPQLDEVQRVYTERDRIDYG
GDNISIAAMDTDKYYCGTMATVTFTGDVTPCSVIREGVANIRTTPFRDIV
ARHLDTLVHAALHDVQNLPNPCNDCVNNAHCWGCRASAYHYSGDADGLDP
KCWLIRTALTSDSFSVNNNLQKSTDEEIGLKP
>CT1708 hydrolase, haloacid dehalogenase-like family
MSRTLVLFDIDGTLLKVESMNRRVLADALIEVYGTEGSTGSHDFSGKMDG
AIIYEVLSNVGLERAEIADKFDKAKETYIALFRERARREDITLLEGVREL
LDALSSRSDVLLGLLTGNFEASGRHKLKLPGIDHYFPFGAFADDALDRNE
LPHIALERARRMTGANYSPSQIVIIGDTEHDIRCARELDARSIAVATGNF
TMEELARHKPGTLFKNFAETDEVLASILTPKHS
>CT0704 zinc protease, putative
MSALSFRGNSPGIEYTVKVSQRARYARLKMSPVEGLTVVVPVGFDKKQVP
ALVESKREWILKVRRTFDKHRAAAPAQGDAALPTVIELAGIGESWRVRYR
SEPRQRITITEKGEGELEVSGPVSEHAMCFAALEQWLKHRAKLKLGAQLM
RLASINGFKVSGVSVKKQKSRWGSCSSRGNINLNLKLIFLPPLLVRYIMI
HELCHTLHMNHSARYWETVARFDPDCVVHDREMKHAWRFVPAWFSNAR
>CT1099 acetyltransferase, GNAT family
MIIGDSSAGTIFVCEMDESIIGMVSLLNLVSTALGKKVAMLEDMIVDPEW
RGQGIGAMLLDHACSWARENGYGRITLLTDGDNVPAQRFYGAHGFARSTM
VAFRKQL
>CT1703 hypothetical protein
MQDGHRPGAGYPMRLIACGDGGDNVEFDSGIEFPGRGQAVKRKISDALKA
RHQRYAANEQQTAEPVEANERRCAVPLRLGDHERELAGCEVVQALRNDHQ
ISRAGFDSFEVKRIQVSEA
>CT1731 conserved hypothetical protein
MNTFLIVAIIFALTVVMIMSGRGGGNFYVATLVLLGVQMHTASTTSQFIL
LASALVGAIVFGKARVMSWPLAIFFGSLNATMAFVGGFMAHSFTGTLLKF
ILSLLLFVAGVAMLFPEKQARKVAISRFGYWNIQEGDNLYVINLWVAVPL
TMATGFFSGMVGISGGSFLIPLMVVGCGVPVRTAVGTATAMLAATALTGF
AGNALHGGFDPELAIPCGAAAVVGGLIGSKIALKTKPKSLKIISGVLTIV
AAIAMLANAVSGK
>CT1143 oxidoreductase, short chain dehydrogenase/reductase family
MRKSLGVVITGGSAGLGLAMAREFLRAGDRVVICSRRESNLKSALQMLGS
DVPDRNVYGMVCDVSLPAQAADFAAFAAAKLGIIDRWINNAGTAGRKRRP
LWELDLSDIDETCRTNLSGSMMLCAEALRVMLRQPASADEPLYHLFNMGF
SSAGLRSSPTSVPHRASKRAVAIMSKLLRQELEAAGIRSVGIHELSPGLV
LTDLLLRDATPAQKRFFNAMAETSETVAATLVPAIRAITGRGSTLRYQPV
LFMFAKLAASAFGYRKERFFDSEGKPWG
>CT0992 drug resistance protein, putative
MKKSPLAILFLTVLLDLIGFGIVLPLLPTYAKELGASPFMIGLIASIYST
MQFIFSPIWGKLSDKIGRRPVMLSSIFLTLVSYVFFSKAVTIPLLILARS
LSGIGSANIAAAQAAITDVTDSKSRSGAMGMLGAAFGIGFIIGPLVGGVL
MTNFGISMVGLFAAGLNFINFTLALFLLNETNPHTEGFLSLFRKNPESVV
HTNNSLFASLAHKSSAYADKIHEVFSSRPVALLMIINFIYTLAIVNMQVS
AILLWSDVYHATEQQVGYLFAYVGFFTVIVQGVMLPKMTRNYGEHKLMVL
GHITSFIGVFFIPFIPVTSLFTVGLAILLFFAIGTSLVNPLNISMISLYS
YKQKQGQIMGFAQSVNALARILGPFSGSILYGYDHRMPYYVAGALTVVGT
FISMTLFKYEIEAFEPTTEMAE
>CT1814 conserved hypothetical protein
MESISSNLTAVREQIAEACRKAGRREDEVTLIAVSKTKSAAAIREAWDAG
QREFGESYVQEFLEKVEAPELSGLPVSWHFIGHLQSNKVRQIVDKVTMVH
GIDKVSTAKELSKRAGQHDLTVDYLLEVNVSRESTKYGFSPDSVLQAAEE
CFALPNVRLRGLMTIASPAPSEARREFAELRQTLDKLRQNAPEPSLLTEL
SMGMSGDFEEAILEGATMIRIGTAIFGWR
>CT0169 oxidoreductase, short-chain dehydrogenase/reductase family
MQNILIIGATSAIAEATAQQFAAKGHRFYLLARNEERLKTIASDLLVRGA
SAVETALFDANDTVNHRTVLEKAKATIGSFDIVLIAHGTLGNQKECETNP
ELALKELQTNALSTISLLTHTANTMEQQHHGTIAVISSVAGDRGRPSNYV
YGTAKAAVTTFCEGLRARLHKSGVHVLTIKPGMVETPMTEGISAPDMLLA
KPEQIASDIVNAIEKKKDVLYTPWFWKYIMMAIIHIPNTIFKKSSL
>CT0282 glutamate synthase, small subunit, putative
MVVETSNAELDAMRRQSLERLLARHCGDCLAPCELACPAGCDIPGFVSAI
AKGNDREALEIIRRTIPLPGILGRVCPAPCEEACRRHGVDEPVSICALKR
FAADQDMVEGSGLPERKPASGKRVAIVGAGPAGLTAAWYLLLDGHAVTVL
DANEKAGGMMRYGIPKFRLPEAVIDADVKPLVKMGAEFRFSTLFGKDANL
EELQQEHDAVLLTIGASQASKLGIPGEELDGVQSGIGFLANVADGKAAAP
GKSVIVIGGGNTAIDAARTALRLGAESVTILYRRGREEMPANRLEIEEAV
AEGVELRLLAAPVAIEKGANSLVVTAVEMQLGEPDASGRRRPVPVAGSEF
TLHADTVISATGQQVDLPAEAAAGIGVERNGTVKINGESMLTGAAGVFAA
GDCVSGPDLAIKAVRQGRLAAEAIDRYLNGGDPAATGMSMFNSSYGARDK
APHQFYDRARPAARVAVPELEAESRRQSFEEAVTGYDPEKAREEAKRCLR
CRCQAVDDCRLRNLATAFGVATPATEEAHEYFSIDRSGDMRFEREKCIDC
GICIRTLESASADAVTIREELIDHCPTGAISR
>CT1932 peptidase, M16 family
MHPNPSAIVRKNIVSPSIGNATHPAERLSTGIVESGTLPNGLRIVSNQVP
WIHSVTLGLWINAGSREDPEGFEGMAHFIEHALFKGTQKRDYVEIARCVE
ETGGYIDAWTTKEQTCLCVRCLREHLHLAFDLLADLCCNPVFPPDEIEKE
KEVVLEEIASVNDTPEELIFEDFDRRAFSRHPLGTAILGTEESVERLTGK
EIRDFMRRHYVPSKMLVTAIGNIEHDAVTGLAESFWGHLKDSPQEDSVRR
LFDLSAYRPFTKTLKKSVFQSQILLGTIFPRDDRRFWGLMVLNAMLSSGM
SSILNLELREKRGLVYQAYSSVSFYDEVTEFNVYAGTDKGKTSKTLDTIA
ELLTGNVLKEPDPFELAAAKSKMLGSMILGMEKMTRRMSHIAQDMFYFGR
YLSPSEKAGMIDGVTAEDVAVAAAALGIPEQISTLVYKPGGR
>CT0975 sodium:solute symporter family protein
MAQLTPLDISFIAGYLLLTLLVGLFFSRRASENVGEFFLSGRKLPWWIAG
TGMVATTFAADTPLAVAGFVAKNGIAGNWVWWTFVSGGMLTVFFFARLWR
RANILTDLEFIELRYSGKAASFLRGFKAVYFGLFINAVIIGWVNLAMFKI
IKIMIPGLDPQLTIVGLVIFTAIYSGMSGLWGVSITDAVQFVIAMAGCII
LAVLAVNSPQVTAAGGLKQALPDWMFDFFPSFSSTVNTSATGAYALPFTA
FAAMAFVQWWASWYPGAEPGGGGYIAQRMMSAKDEKNSLLATLWFTIAHY
CVRPWPWIVVGLASLVMFPNLPAGQKEDGFVHVMNAVLPAGLKGLLIAAF
MAAYMSTLSTHLNWGTSYLINDLYKRFIKREAEQNHYVLMSRIVTAVTAI
FALYITFYVLETITGAWEFIIQCGAGTGFVLIMRWFWWRLNAWSEITSMV
APIVAYSYINQFTTIVFPESIYIIVLFTITCTLAVTWLTKPTDREKLHAF
YRTTRVGGALWKPVADEMPDVKGDTGFPALFADWFFGIVLVYATLFGTGK
LIFGEPVAAALYYAAGALAGVMIYRDLSKRNWKTFD
>CT0131 conserved hypothetical protein
MEGHIVITGATGVIGSEVARRLIKSGREVVVFARSPQSAAAKVPGAADYV
RWDSDMAPDGWSSSIDGAYAVIHLAGRPLLETRWTEEHKVACYDSRIKGT
RALVAAMASASVKPKVFVSSSAIGYYGSFDRCEETDPLTEKAAPGKDFLA
KICFDWEKEARPAETLGTRVVLLRTGIVLSTKGGMLQKMMIPFSYFVGGP
VGSGDQCLSWIHLDDEVSIILQALDNADWSGPVNAVAPEPVSMKAFADSL
GLVMHRPSLFPVPKLAVQILLGEGADYAVKGQKVSPEFLKERDFHFAWPS
LNEALADLVSRGI
>CT0044 hypothetical protein
MAVIEKAQGFILGDWVDRRVSRQTAKDFFSSGAHVFPEHEKPVYIDMTPT
ALGKGPRLPAIPVSGTTPAKSRRSSLWENHEQLATLFDDWEVANRNDLFS
VVGLKKFNYIANSDFHERRHLLSWKTLLRCEKMSSRLRPPSERMSRFRYF
CAVVEKSGCDLRPQKRNDGRCFQALCRFV
>CT0592 hypothetical protein
MLSRIQSLYLFIAALLAFGSMAFPFWTFTTDHVILFGDFMDVQGAGLIVT
AGSIGGGILSPLTGIVALATIFLYKNRKLQQTLITLCFVLFAADLLAGLA
GGHFLKQYLETKASSVSFAPGSGLFMLLPEPVLFWLALLGVKKDEKIATA
YKRL
>CT1367 ComEC/Rec2 family protein
MRYFLSAYPSVRLLACAVAGIMAAIFLPVSPVAWFGVAIASCIVTALLLF
VSRKRHPAGTVSLFSAACYLTAVFSAFAFHASASYRLAPSPSLLSWVGRD
VILSGVVDGRPVAWNGTARLRLRVNEVFEDGQTTRVSDRVKVVVRMPGDE
DPQFQEGDFVRVKGRPALIPVASNRDEYDARFRERLKGTHVQIFCAGPWQ
LLREPPKPGFSVVPSIVNPVRNYLNSAIDRNFPEGGARHFIKGMVLGERD
LMPEELYEAFRRTGTAHVLAVSGLHVALLAYAVNLCLQRLKVTQAGRWLS
LFIIVAVIGLYSFVTGNAPSIKRASVMTAVIIAGGTIGRKSYPVNSLAAA
DLLILLFDPFDLLTPGFLMTNGAVLGILTIYPHLSGVVAHGKGVLRPLAH
FFWSAFSVGFSAMIGVSPVIAWYFGTFSPSGIAANLPVVLFSTLAMYASF
PMFLFHGFASGIASLFGAASWFFAGLALSCAELFSRLPLASVEVRPGLFD
VAVFYLTFAFAFYAIVNRAWGRLALFVLVGMNMLLWHQVLRPVQKPPQVV
TINMGRDVAVLFSSSGETCLVDAGRRDGSWERIRQQASVWNLATPVSVAS
LFSPASVIRSLPVSPTASGAAPHRSFVIRMLDDKVLRIDSKSHSLLLVSG
LKRLEAQRADGADVVFWIYRFTGKEWHRLDAWIAETRPRRMLLVPGPFMT
AEQRELLNRYAASRPQVEVRSRSRQTVWY
>CT1040 methyltransferase, putative
MSDHQSHSRGKNAREWFEEWFDHPLYLKVYHHRDAEEAERCVRTILDLTG
IDPAWQPPHSVLDIACGAGRHALSFARTGLRVTANDLSPYLLDQARKQAK
AEGINMEFSRQDMRTIRFERRFDLIAQLFSSFGYFETDQEDRDVIANIAS
LLNPGGWYVLDLINPVQLKSQFTPRTERNSESLSIIEERTLSERHVTKKI
TLHEANGRKHSFTESVRIYSPAEAFSLLESGGFAVERVVGDYEGSPFDEA
TSPRMMLLARLLVSRS
>CT0132 RNA-binding protein
MNIYIGNLPYSVTDEDLRDKFSEFGQVHSANIITDKFSGRSKGFGFVDMP
NESEAREAIDAMNDKDFKGRTIKVNEARPREQRPPRREQY
>CT0991 oxidoreductase, Gfo/Idh/MocA family
MRIGVAGVGKLGEFHTNLLKQIAAEASDVEFSGVFDLNPERLQEIGKKYG
VACFSTLEELAASCDAAVIATTTSSHYAIARQLLEARLHLFIEKPITATL
EEADKLIRIEQEKGLTIQVGHIERFNPALLAVEPYIGEPMYIQAERLSGF
SRRVTDVSVVLDLMIHDIDLVLSLIKSDIRSIAASGVKVFSNELDMATAR
IEFENGAIANVTASRLSRSKLRKMRFFTRNPKSYASLDFTSGKSEVFRLV
PPDQLSSKNPIKNFATKKILEQFGEIQESLKEMALDYVSPDVPKINALKE
ELEQFIDAIRKGKPAKVTSEEGRRALMVAGKITDEIRNNTADLD
>CT1052 peptidase, M20/M25/M40 family
MHNEAFSTIAARVREAAHNLYPEVAALRRHLHQHPELSYQEFQTTAFIKK
YLSGLGIEAEPPLMETGVIALLRGEGAPPSGERRTVALRADIDALPLQEE
NGHDFCSTVERCMHACGHDMHTAMLLGAATVLSGMKDALNGDVLLIFQPA
EEKAPGGAKPLIEAGLLKKYKPSAIFAQHCFPSVKSGSIAMCKGGFMAAA
DELYVTIHGQGGHASAPHKTRDPILASAHIITALQHLVSRVAPPHESAVL
SIASISGGHATNIIPGNVTMMGTMRTMNEELRALLHKKFEKTVRQVADAF
DVEAEVEIRRGYPVLYNDPAMTDLAWEAGKEYLGDGNVRQSEPVMTAEDF
AYYLQECPGSFWQLGTGLPDSAPGNLLHSPTFDPDEHALETGMGMMSYLA
LRFLAG
>CT0100 conserved hypothetical protein
MTLDELNRLLDPQVLALIDAHASDDPATFAMQFHGRGDLPVRAIAEQIAC
RKKAAAKLPSLSRFPMLYTRLGLEQASGERAAEWKASLMRGWRAIDLTGG
LGIDTLFLAQRFDSVVSCERNEALARLAEANRRMMGVTNVETLIGDSEEL
LAGYADDSFDWVLVDPARREHGGRSAGLSASSPDVVRLHDMLLRKARRVC
IKASPALEISGLETQLPTLSEVIAVSVDGECKEVLLLLYREREAGLTPEI
RAVCLGSETFEIVSSGGVPPARVVAEAPGTWLYEPDTAIIKARLTGELAR
QFHLEFLNRTVDYLTSDRLIEPFPGRSFRIEECRPFRQKSFRKELAELEI
TNAAIQRRDFPLSVEELRKRYKIGESSERYLFFTKNATGSLIWLSCRKP
>CT0916 hypothetical protein
MPEPFTLMSLFARQSPALIASRIALAFVWVYQGAVPKLVCPSPVELGLLS
YLGPLYGFMFSVMGYGEIAFGLLLLLTPWRWPFLLNIAAMLSLLGFVSLY
EPRLLAEAFNPVSLNAAVIALSLTAYWEMGKVSASNQ
>CT1646 tetrapyrrole methylase family protein
MNSRDEHKGTLYVVATPLGNLDDMTFRAVNTLRNAGAIACEDTRRTSILL
KHFGIEGKRLVSYHSFNEERAVRQVIELLEEGSDVALVTDAGTPAISDPG
YTMASAAHAAGLPVVPVPGASALTAALSVCPLPSSSFFFAGFLPHKKGRK
SRLEFLASIDSTIVLYESPHRIGRLMEEVKEHFPDAQVFAAREITKMHEE
YVTGTPDELANHFTGQKQRGEFVVVVHPPDKHSKKRQEHADHQ
>CT2055 conserved hypothetical protein
MELTESIRLLFPPEERAQLDISAIKGDASNRQYYRVTGQGSVSVVCADPA
FRATAVENYPFLIVRDLFARHGIRVPELLGMVHEQGLLRLEDCGDLMLQD
EVPLLDRNRLSARYRQVIDLLVRIQSIRPDKDALTTTLPFSLSFDHEKLM
FEFDFFIEHALNGYFAGRLGKPAIARLREEFINICDLLVLPKHFVLNHRD
FHSRNIMLFRAEPVVIDFQDARLGLPQYDAVSLLRDSYVRLDPGMVNELK
RYHFNQLVQLGLTSMGEAEYLRLFDLMAFQRNVKAIGTFCYQTTVAGNRT
FEPSIAATLSYLREYIEARPELAMAGRLLKPIIPEISR
>CT0454 hypothetical protein
MRLYIREYFRTLCDFCYSVKRRFWNFCVVRVIVLFSLFWSIILINLGINA
INPKNICVARRYGFEKLTDMLSRLFSVCFTGNNEFFLTLAKVSRRDGLSG
GMEAPDLTSCSAQLLPGNKRRFFLILQSARGRGV
>CT1595 conserved hypothetical protein
MSFDKKVTIAHLSDLHFASKNDRYLTARLDTMLGEFVRRKYDHLVMTGDL
IDTASPALWTIIRDALVRHGLFDWTKTTVIPGNHDLIDLEEEMRFYNALN
PDDRSRQHRVDDRLRQFNAIFRPLITDNGDALAGVPFVKVMRLGGISLSF
VAVNTVDPWSGLDNPAGARGSVSPETLRALQEPGVRQVLDDTFIIGLCHH
AYKVYGTGALVDQVFDWTMEFKNRDEYLKAMKNLGVRLVLHGHFHRFQVY
QANGINFINGGSFRYSPERYGELVINADGRWSHRFVNLALKK
>CT2225 hypothetical protein
MIMSSTYRETVVDGYNLIHKLGKAGPGASMADLRERLEAMLARYRQKARR
HVILVYDGGSGPKPLTLTGAVDVTFSGTVKSADRWIIDHVRSLGVRATMT
FVVSSDREIQRYSKAYGAKCVDSETFIDELAAMGIAIDKDGRRKGQQSGV
KMNKNASGLLSDKEVDYWLGLFARKR
>CT2247 iron-sulfur cluster-binding protein, gltD family
MNAESNPILDFATEYVYPAFSELTGTDKIVAFGDHSHKCPIYVPQTPPCT
AECPAGEDIRAINRFLNGTDPSDDPLKSAWETATDTNPFPAVMGRICPHP
CQSKCNRGVHDESVAINAVEQVLGNYGIEHNLKLKGPGADTGKRVAIIGG
GPAGLSAAYQLRRKGHAVTIYDANEKLGGMVLYGIMGYRVDRKVLEAEIG
RIIELGVETKMGVTIGKDITLEQLEAEYDAVFIAVGAQKGRALPVPGFEG
TPGATNAIDFLKSYEVLGDDIPVGKHVVVIGDGNVAMDVARLALRLGSQA
TIISGVPREEMACFENEFDDAKNEGTTMHFLTGTVEVLGGASGVTGLRCT
KMVKKEKGEEGWNSPIPFLRYKSNGESFEIEADMVVAAIGQATDLSGLGS
AASGPWLKVDRNFRIPGREKLFGGGDALKVDLITTAVGHGRKAAYAIDAF
LKGEPMPEEPYREITKPHKQDLLYFLHTPQAKRTSIKPEVVVGNHDELLE
ALTPEQAITESKRCMSCGFCFDCKQCVSFCPQEAITRFRDNPAGEKVYTD
YTKCVGCHLCSLVCPCGYIQMGMGDGL
>CT0181 conserved hypothetical protein
MWLPERAGRWSCSCAAPGSWGGYSGAFRATARVAPTIFRLLSYIRFMELQ
EKITILSGAARYDASCASSGCSNDTPYRGTGNTSQGGICHSWADDGRCIS
LLKILLSNDCRYDCAYCVNRSSNPVPRASFTAREVVDLTMEFYRRNYIEG
LFLSSAVWDSPDRTMEEMVRVAEILRNEERFGGYIHLKVIPGSSADLIRR
AGLAADRISVNIELPSNESLKRLAPQKSKESILTPMKLIGAEAGFSLVER
TKSRKAPRFAPAGQSTQMIIGASPETDLQILQLSQSLYRKMNLKRVYYSA
FVPVNDDNRLPVLAAPPLLREHRLYQADWLLRFYGFSAEEILSDDAPNLD
ESFDPKTAWALRHPEFFPVEINRADYATLLRVPGIGITSAKRIIAARRFA
PVTHEGMKKIGVVMKRAKYFITCSGRPFEKIDQQPARLRQRLLLGDGSEQ
KQPQQLVLPGLFA
>CT1181 DHH family protein
MIIPEYGRTLTPEEWRPVVDEMLEATHIIFTTHENSDGDGLGSQVALALA
LKALGKEVAIFNPTEVPPNYLFLKELHEINLFRDRDEESMQEFFLADLLV
VLDANLHDRIGRLWPHVEFARQMSRLKVLCIDHHLEPEDFADITVCETYA
SSTGELVCDLVTALELRTGQQLFTPEVASALYAAIMTDTGSFRFPKTTPY
TYRLAGMLVEKGADPELVYDRIYNALTPEALKLLGLSLSNIKIIENGLIS
WLFITQEMLEQTGSKLFDTDLIIRYLLSVPTVKVAVLLVEMQDGRCKVSF
RSRGKIYVNQLAKHYGGGGHMNAAGCLLRMSAEKAQLVILEDVRKFTLEQ
VDS
>CT1771 oxidoreductase, short chain dehydrogenase/reductase family
MKQLENRVAVITGSTKGIGRAIAREFVRQGAKVVITSSRQENVEAALREY
PKDLVHGHVSDVSSYASVESLVDAAVRRFGALDCFINNAGISDPFTSVCD
SDPAVWSRVIDTNLKGTYNGSRAAARYFLSVGRRGKIINMAGSGTDKGSN
TPFISAYGSTKAAIARFTFAMAEEYRNAGLSIMLLHPGLVRTEINHPERT
TPELQKQLKTFNIILDIFAQSPDLAVRYAVKMASSWSDGKTGLYLSALDG
KRKKMMLLSYPFRKLFNRIDRQTY
>CT0947 hydrolase, alpha/beta hydrolase fold family
MDSFLDIQRKKADADSKFIDCNGFRVHYKRYGSGKPPFIVLLHGSFLSIR
SWRDVAVPLAENATVLAFDRPAFGLTSRPVPSRSNAARYSPEAQSDLVVA
LMDKLGMDRAVIVGNSTGGTLALLTALRHPRRVQGLVLVGAMIYSGYANS
EVPAVMKPFMKAMSPVFSRLMKVIITKLYDKNIRGFWHVKSRLSDETLAA
FRNDFMVGDWSRGFWELFLETHRLYFNRRVSSAWAPSLVVTGEHDLTVKT
EESFRLARELPRAELLVIPDCAHLPQEEQPAAFVAGVKKFVEKLV
>CT1784 GTP-binding protein
MKPLIALVGRPNVGKSTLFNRILRQKSAIVDPTPGVTRDRHISPGEWQGK
QFLLMDTGGYAPENDTLSKAMLEQTMRAIEDADAVIFIVDARSGLTYLDL
DIAKILQKTFKDKKIFFVANKVDNPQVALEAQSLVKSGFTEPYLISARDG
AGVADMLEDVLNSLPCPEGEEIEEDDSIKLAVLGRPNVGKSSLVNALLGT
ERHIVSDVPGTTRDAIDSVLKRNGEEYVLIDTAGLRKRTKIDAGIEFYSS
LRTARAIERCDVALVLLDARLGLESQDMKIIHMAIERKKGVLILVNKWDL
VEKDSKTSKAFTDNLQNQLGNIGYIPVIFTSALTKKNCYRAIDTAAEIAL
NRRQKISTSNLNRFLQETLTMRHPATKSGKELKIKYMTQIDSDHPVFAFF
CNDPELLENNFRRFLEKRLRESFDFAGIPITMRFLRK
>CT1829 conserved hypothetical protein
MTSRMIKKIKAFFAGAGIGALGGLIGLGGAEFRLPLLLGMFAFPPLEAVI
LNKAMSLLVVASALPFRAATISWETLFAHWTIVVNLLAGSLAGAWAGASW
ATKLRSETLYRVIAILLLGIALVLLTGHQTTTTGTPLFDSPALLMTTGVI
AGLGIGIVAALLGVAGGELLIPVIVFLFGADIKLAGSLSLAVSLPTMLVS
FARYSKDSSFVVLGANKSFVLIMAIGSIAGAWLGSRLLGIVPDSYLLPML
AAILLISALKVWKHK
>CT1365 Nudix/MutT family protein, putative
MSRQINNDPGRWEVLESIYLHQRPWLTVRQDRVRLSSGKTIDDYYVQEFP
HWVNVLAITEERDVVLIRQYRHGIGEVSWELPAGVLDEGESLLDGAQREL
LEETGYSGGTWTPLMELSANPALQNNISYSFLAEGVSLSGTQHLDPTEEI
TVHLMPLDRLREIVFDGGMIQALHAAPILKYLLQNRWGESVKGEG
>CT1344 conserved hypothetical protein
MDIAKLLTTYKHIAIVGISEKPDRASHAVARYLIHAGYTIYPVNPTLSSV
LGLECWPSLSDIPAEKRERIEIVNIFRKPQDVPPVVDEAIAIGAKTIWMQ
LGITNEAAAEKARKAGLDVVQNRCISVEHMHLVS
>CT0928 conserved hypothetical protein
MIASKTATTRRYVITGGPGSGKSTLIEALEARGQRCYPEVSRELIRREAR
RPNGVMPWNDLEAFARLAFTEMLLQHDHAEEAGERCFFDRAIPDIFGYLL
ERGIDIQESWLDVHRRCRYERTVFILPPWPEIYVNDAERPQTLAEANALH
NAIHAVYESLGYELIEVPRMPVEARCEFVLGRLCCGKEEAIKYSRKA
>CT1960 conserved hypothetical protein
MERGINWIDTAAVYGLGHAEELVGKALRGLCEKPLVFTKCGLVWDENRAI
GVIVYSPMLSGMLTGAMTRERALNLPADDWQRNA
>CT0725 hypothetical protein
MMNIGGLFRKTGVVFALFVLLSGRPSVLWAEAWKFGVMGDTQWTTADPSG
QNPHTVPVSIIRQVNRQFIDAGVKFVIQVGDLSDDGKEISEEERVAAAQP
LIDAGIGFFAFRGNHEAKSAENGYGAPGFRQRYPQNRDGGFTKSDGGSFT
VGSNFSSPVKISRDLDGLSYSFDFGEGQERARFVIIDNWPLPGRLVANST
HYPSGYTIADQQPWISAQLDKHNRKTPHVFVLSHQPLIGEGHQDTLFSGF
ANEHPEWQNRFFESLQSNGVRLFICGHDHIHQRSVITSPDGKSKVEQLIV
QSNSSKFYTPKSLDDTNWFGQKSREISVSQERESVGYYIFTIDGPSVTVD
YWADDHGHWQSDANFPQGAGRADTGVTPQFHFVKKERWSYSLNGKQFLVP
QGASYTVVRDRFNGCEARILDGLNTSESRDASLDSTGQGRPLTKVVNTGW
IAAKPSRHSGKNPAIPVFQLSGLGESGSDHTDHYVLSMSYDPATVRQKNI
ASGGFGLVTQDAAGRWLNAVEANSGGAATFIDGPWRRGYALGSHGIDRKH
HRAWAVLNHEGAFAVANFSAH
>CT1971 CRISPR-associated helicase Cas3
MRHYPKSLKNADAPVEHPPLPLERCLAKSRKIDASRSVAGRTVLDHCRIA
GEVARELIARSPAFLRESFFPDGSALVAASHDIGKVSPTFQKKIYTAIGN
ADPTILDVLNDVDSEIEKNWGGHAGVSQCALEALEAGKDIAAIAGCHHGY
APKLAGKTADAEAFGGAAWQRQRARLLERLAEATGERFPKIRNLLHARVL
AGLTSVADWIGSGATFDDPGEAWQSCIADAVDAAGFTPPRLRPGLSFREI
FGFEARPIQRSLIEKACRPGAYILEAPMGIGKTEAALYAAYALVSAGKAR
GIYFALPTQLTSNRIHERVERFLDKVLEADSPHRNALLVHSNAGLQKFEF
GADAAPGCSWFSASKRGLLAPFAVGTIDQALMASMNVKHGFVRAFGLAGK
VVILDEVHSYDAYTGTILDRLVRELRQLHCTVIILSATLTGERRSALLGA
ARSAEAAYPLISALPAEAREPVEIAPETMPGNRVAIHQTLDIDEAMDEAL
NRADSGQQVLWIENTVTEAQEAFKLLAARSSGMAIECGLLHSRFIHCDRE
ALEEKWVTRYGADGAAARRERGRILVGTQVLEQSLDIDADFLVTRLCPTD
MLLQRIGRLWRHSFHQRPAGARCEAWIVSAQFEAAEQNPGEAFGKSAKVY
SPYVLLRTLQAWSGVEALSLPGDIRDLLERTYEARPESEAMAKHKADLQR
RREILESFALQGVSFGLDARPDTNVATRYSDIDTVELLLLQSVSHNHAAH
ETTVTLLDGQQLTLPHDFAAGRKREQRQLAARLATQTLKVAEYAAPADPG
RNTLTWLKPYFYLGDPSKSESLLRVAIVGEGGLLRLPGGGAAAEKYELSY
NPRLGYQYKKR
>CT1337 MFS transporter family protein
MGMNDTSQKARIFSWLLFDFANTSFSVMMVTFAFPLYFKNIICEGEPKGD
ALWGASVSISMLLVALISPVLGAQADYSGRRKRFLFAFTLISVLATALLS
FSGPGMVLFAAVLFILANIGFEGGLVFYDAWLPEITSPRSTGRVSGYGFA
MGYLGAFAILLINLPLLSKGIVPANIPNLKLSFLIVALFFAVFSAPIFVM
LRDTKGSVGDSSSGERRRERGSSFMHSIKEVGYTIRHIMSYPDLARFLLA
YFFYNDAILTVIAFSSIYAQNTLGFTTGELITFFMTVQTTAILGSVVFGF
VTDKIGPKRTIVITLFIWFAVILLAILSGSKETFIMTGLLAGMSMGSSQA
ASRSLMARLTPKEHVTEFFGFYDGSFGKASAIIGPLVFGVVSAQVGSQKV
ALASLLVFFCLGLLIITGVRTRATAEASPEASRIE
>CT0534 hypothetical protein
MLYVVGGMGVVVWRGASKGEWVEMYLAGGDLKKSLKWGGIAGGILFVLDL
VNTVIYYSKGAPPMVDMLGILVNMNFLFLFPILVLAEEFLWRGLMLSSMV
EKGFNPHLSVFVTALCYVLNHYAVAPVGMYERGLMAMMAFPIGILGGYIV
LKSKNVWGSVLVHMITMFSMVLDIFVIPNVVPSLFHL
>CT2004 membrane protein, putative
MNTPQFNKIAVLVATLLISALFLTMIRQFLVTILLAGIFTGLAYPLFSRF
ITLTRGHRSLSASMTLVIFFMMVFLPLLAVFTVVILQAVSLSSTAIPLIR
EQLRDPEGFLRMLSSLPFYKDIESYSDLILEKAAEILGNLGSSVLSSFSA
ITWTAIYDLVLFIIFWYTMFFLLRDGHELLERIKYYLPLNESDQRRLFDR
FVSVTRASLKGSLIIAVIQGTLAGLAFYVAGINQAVFWGAIMAMLSLLPL
IGSPIIWVPAVIILALSGNYAQAIGLFLFCSIIVGQIDNVLRPILVGRDT
SMHELFIFFGTLGGIGMFGLPGFIIGPVVAALFVTVWDIYGETFNESLIE
RRSGGGQAAGSDGSAP
>CT0493 conserved hypothetical protein
MTPLQKAYRYKFLSQCLAYPNEAFIPALNEVLEKIDADRDPRQTLVAAFE
REETEPLQAEYTRLFLNGYPHTICPPYESVYLEKRMHGDAAVSVAAAYTE
WEISVEPGLIDHLATELEFLAFLASAESLDNTVSENASKASKAFMQQHVT
RWVPQFIEDLKAGATMDCYRMLGEVMEKTLAPLSPKS
>CT1183 florfenicol resistance protein, putative
MEKNEISEERRTQEKEKQHGHRLNIRRLGRKELTELLTRLGEPAYRANQL
HRWLYSNQALRFEEMSTLSKQLRQKLASEWIIHPASLVGTERETTDASLV
TGNPTAKFLIKLEDNELVESVLIPSEERITACISSQIGCPLRCTFCATGH
MGFRRNLTASEITDQVFLLEKEAQKRHWRGLTNIVFMGMGEPLLNLDNVL
ESIGTLTEKDYQFSISERKITISTVGLPVEMDRIARSGLKTKLAISLHSA
DQLIRERMMPIAADITLDKLAKAINSYNSVTSQPVTLVYMLLEGINDSPE
DARKLVRFAKRVLCKINLIDYNSIVTLKFKPGCSSSKTMFIQQLLDAGLL
VTVRKSQGATINAACGQLATRPVR
>CT1967 hypothetical protein
MKKNRNTVIGIFSIVFGGMIYVLWRKQSLLMFSWFYSIGLKPFVALLREF
AHSYSLVMPEWVYFSLPNALWAFGGILLFYSIWKDACSERMFWVLLFSMT
AVGSEIGQLVGIVPGTYDTTDISLMLIFIPLAITIGNKNHAIKEVSDAKG
L
>CT2092 conserved hypothetical protein
MSETKHDTLRFGIVTDIHYNPESKTGNQTQAGLERCIEHWTREGAEFVIQ
LGDLISREGPEAESDLIAVRDMLARFPGKVYHVAGNHCLAVPPERYKTIM
GLDSLYYTFSSHGIRFIVLNGMDVSAVNDPQTKADRHLLEYYRDNVKAPF
YCGAIGARQLEWLVNELDLALKNEEPVIILSHLPLLEETTDEKHGLLWNH
EELTAILFRYPNIRACLSGHYHSAAHARSDGIHFIVLPAFAGWPPGECCL
TVKITGENINIGRQDAPPLFDIPLP
>CT1906 conserved hypothetical protein
MIAEPFLFRSEGGFIRIPIWGHIPLSAPLKKILAHPSFLRLKGIRQLSFA
QQVYPGANHTRFEHSIGVYHLMKMILQRMVSNPLALELQDERLQFDDETC
RTLLATCLLHDIGHYPHAHVLEEITPAGDSSAVFAHHESLTGQFLNEEHR
DTPSIAAILHDDWQVNPDTVTEIIAGKTAHRLGKLVSGTLDPDKMDYLMR
DAHHCNIPYGSIDIERLIESFVPDPERQRLAITEKGIAPLESLLFAKYMM
MRNVYWHHTSRTFSTMLRRLLQDVADESALPMETLRELFYFNSDDRVLFE
LERAIRGLGLPAAELLDAILERRVYKRAMTIRPYHEPSMEIDPVWFAYNT
SHRRRKEKELEICAMLAGKTGKRLAGHEVLIDAPPSKDVFDYSDFRELRI
WPTKSEHRHLVQPTDSNGYVRFDDFRESVFGSDFILSFEQYTKKFRLLCR
HDLVDTLSGLEEAVVEILRR
>CT1915 hypothetical protein
MSIEQGLNSRRENDKQVESQQWFQKLIQADQYVGDIYSINYETARVIIHD
FYREKVGGIPSLSFLIATRVDPSKTDIDFKKEDASFVLLRVMDAAALPQD
KEAERIRVETAQRISGETEKHWDDAGSMDLRTKNILGFAGVQCRIIGTFF
LEENGQNGDAPLNLKFGSDISNYYPNRGLKVYKPNGKALEQIVNYADPTS
IQAHTEKYGNTERVKLGFVRYASTNRKYQQVDDVPVYIYPADLLSQKSAL
FGMTRTGKSNTTKIIAKSVFELRKNENPNDRPLRIGQIIFDPNGEYANEN
VQDNNSALKNVWQLLPNGVKANEVITYGITRHPNDLERTLMLLNFFETSN
TQIGKSIIDSILSEDSTNYIKQFCQVSFDEPDPNDRSATTRYNRRLLAYR
SVLARAGFQVPPSLRASTRGLFNQDLITALQTGRNNNPPTPEYVSAAQVF
SNPNPAWGQLANAFEALDKFIRDSSSNYTAFENAYVSRPNGSGDRWADED
LKKIIGIFQYSNGTRKIGKAAEQHSADTTSDYAEDIYNHLVQGKLVIIDQ
SSGEPELNKSSATRIMTKIFKENQRKFVQGETNIPEILVYVEEAHNILPA
GNDLDLSDIWVRTAKEGSKYRIGMVYATQEVSSIQKNILKNTANWFISHL
NNTDETKELCKYYDFADFEPSIRRAQDKGFLRVKTLSNLFVIPVQVDRFE
V
>CT0468 membrane protein, putative
MSGRFANRPFGCRQNGIESMDNLILIAVSFTAGLLMRRFSRMPEQTPVAL
NMFIIYVSLPAMVLNYLHGLDFRPSMFLPASMPWLTFGVSALFFVVAGKL
LKLPRATVGSLILCGGLGNTSFFGFPMIEAFYGKQGIVHGIIIDQLGSFM
VVSILGVTVAGIYSHGSTDVRSIVKRVLLFPSFIALIAAVLLNDVIYSAS
FANLVKRLSDMLAPLALFSVGFQFNPGHIGKSRNTLALGLAFKLAVVPAV
MFVIYVMVAGMQGLPARITIFEAAMPTMITGGIIAAEHQLDPPLANLMVS
FGLLVSFVTLALWTVVLRGV
>CT0102 transporter, putative
MSTVPEEDFSGNHDDSLPLPSGGGLKSKLLAAFPPFASRNFRLYFVGQIV
SMIGTWLQMVAQGWLVLEMTGSAFWVGVTAATSTVPTLLLSLFGGMIVDR
YSRRTILLWTQALSMMLALILGAITLSGHITIPAILVLAFLLGSVGALAT
PAIQAFISEMVERKDLPSAVALNASIFNASRVVGPVLAGFMITWVGTGGA
FIANGVSYVAVIAALLAIRPTAAPPRPAVEERPIQSIRSGIAYTRRHPVI
RAIVLFVGVVSIFGWSFMSMLPVVARQTFGIGAAGMGYLYSAFGLGSLSG
TVVVSMTSGKVRADRLVIGGILTFAVALAAFTFATWLPLALFFLYLSGLG
MLSAFATMSATVQRLVDDRFRGRVMSIYLMVLLGLMPFGNLLMGLLSEHF
GTPAALRTGAMVTIAATIFLFMSRGEITKAWQEYRTTTEA
>CT1375 conserved hypothetical protein
MIENIMSLQKGLPEEISALEEDLAFTSRQIEARKKIADERQKLRERLNSV
IHDCKEKIKSFKEKQTLARNNKEYDALSKQIEYEEKEIAQAEIQLQDISH
AEHHAQELQKKGRELIAENRYDEISEEMMPDDVLQQQMEDLGQQVRQKKE
ELESIVVETAEEVAQLKAVLSEQRSVVAQQAKRLLDKYDHLKSGSIQNAV
VKLDRQACSGCNTRVPTNRHTLIVQGGFYVCESCGRIVVHERLFDEAAAS
GQQ
>CT1199 ABC transporter, permease protein
MSKRSVKPFIPALSLLFALLAGSLIIAATGSDPIEVYQKMLRSTFTSGYG
IGQVLFRATTLIFTGLAVALPFRVKLFNIGGEGQLLMGAFAAALCGIALP
AGTPALVAAPALILVASAAGAGWAMVAGWLKVRRGVNEVISSIMLNFIAL
AITGYLLTNRFAIPSTVHTPAIVAGGWLPDFDTLFGLGWHSPANLSLFIA
LAITAGAAVLLYRSRYGYDMIASGLNPQAARHAGIDTARHTLGAMAMGGA
MAGLAASNLVLGYKHWFEAGLSTGAGFMGIAVALLAGTNPTGIIIAAFLF
AWLDYGGLAVNTLVPKDIFMMVQAITILSIISIPALFKNRLKED
>CT1759 methyltransferase, UbiE/COQ5 family
MKGQSKKQEEADDFGKVSFPKLPCPECFRDGVQGTKKSWASAQNLARQPS
RVVVVRAITKRYKTMSYTMNAAEFNEKIMKGHFRKIYPVIAAQIVERTGV
RSGRCVDLGGGPGMLGVCLAKITSLTVTVVDLMPECVELARENSAEAGVA
ERVDVVQGVAEALPFDDASIDLVVSRGSIFFWEDQQKGLAEVYRVLRPGG
WAWIGGGFGTAELLREIEAAKADDPEWNRKRRERMTQNPPEHFRAILERL
GIDGVVEHQEAGTWIIFRKPAEVEA
>CT1564 conserved hypothetical protein
MADGWRVFKIMAEFVNGFETMSRIGPAVTVFGSTRVKEGDAEYQLGETMG
KLLAETGFAVITGGGPGAMEAANKGAQSKGGASVGFNIKLPNQQRPNRYI
DYDKLVTFEYFFIRKVMFLKYSQAFIVLPGGFGTLDELSEAITLIQTGKS
QKFPVIMMVADYWGDFYGWIKKRMLDEHGFIRESDLDFIFIEDDPAKVIE
IILSFYPEGYRINF
>CT1051 hypothetical protein
MSVTFSYLAETDYPVFTLGGSTADAARRLAASGCACAPVLDGERYLGMVH
LSRLLEGRKGWPTVKEKLGEELLETVRSYRPGEQLFDNLISVAAAKCSVV
PLADEDGRYEGVVSRKRILGFLAERIHSGEGGLTMEIEVPPTGAKLSEII
ETIEKNDASILSFTSWTTGEGRIIFFRVATHDFFRLVRNMENYGYLIRYH
SAFPNAGYDELREKALEFIHYMDM
>CT2029 competence/damage-inducible protein CinA
MKAIIISIGDELLKGHRVNTNAPFIARELGNIGIPVTRIITCSDDPQAIR
DSVTLALTEAEAVFVTGGLGPTNDDRTRDAVRALLGRGLALDEPSFERIA
DYFRRRNRPVTEVMKDQAMVIEGSIAIPNTKGTAPGMIIECAPRFAGRHL
VLMPGVPAEMEAMMRLTVVPFFAPLSGAFIRHTPVMTMGIGETQLADMIV
EVEDSLPSGTTLAYLPHAAGVSLMVSTSGARREDVDAENRRVVEAIVAKA
GRFVYATSEVTLEEVVVNLLLERKLTVAVAESCTGGLLGSRLTDVPGSSG
CFLEGLVTYSNQAKVRLLGVDPATIEAHGAVSEPVAKEMARGCLERSGAD
ISVSTTGIAGPGGGTPEKPVGTVCVGIASKLPDGAVRVEAARFVMHGDRH
QNKIRFSEAALRGLLVRLKEMEF
>CT2257 membrane protein, putative
MLLSPLNWLIHLSSSLEWGVALVMLYRYGQFIGRKDVRRFALFMLPHWIG
SWFVLLYHLSGDAIMRFLEISEAINLVGSIALLYATLKILKGDEKRESKP
AKAWMGSLFGGVILVAGGSTPYSFMMGSSWFDAVLQVSSMVYLTFLVLLL
KVRKKDPEVFSGLTVAGFWFVLVFISVTVVCMYIAIHVLGYPSLSHDDFL
HGFAESLLTVSNLMIVIGIHKQRKRAEERLRA
>CT1548 peptidase, M16 family
MTETKTNQNESYPYTTVPGDALQTRIYTLKNGLTVYMSPYHDEPRIYTSI
AVRAGSKNDPAETTGLAHYLEHMLFKGTDSIGSIDYAKEHTELEKIIELY
EQYRATSDPEHRAAIYRDIDSISNVAAQFTVPNEYDKLLNSIGAKGTNAY
TWVEQTVYINDIPSNELDRWLTIEAERFRNPVMRLFHTELETVYEEKNMT
MDSDSRKLWEELFKGLFTKHTYGTQTTIGKAEHLKKPSIKNVIDYYRSWY
VPNNMALCIAGDFDPDATIRLIDEKFSKLEPKPVPEFHPPVEPEITRPVV
KTVTGPEAEELVLGFRFGGADSDDADMLTLIDKILFNQTAGLIDLNLNQQ
QKVLEGGSMLVLMKDYSVHILSAKPRDGQSLDEVKALLLEQLDLVKKGEF
PDWLVTAVINDLKLEELKAFESNRGRSEAFVDTFVWGMDWARQVNRFKRL
EKITKAEIVEFAKQHYAQNYVAVYKKHGQRKSEAKIQKPPITPIKVNRDR
SSVFAKNLLAKKSSKVQPVFVDFKKDIGYYDITPEISLNYVPNRENELYS
LYFMFDAGSNLNRKIDTALDYLSYLGTSRLSPAEFSQELYRLGAQFTVQT
SDNYVYLKLSGLKENFPQAISLLDELLRDAQPDAPALEKLKEGIRKERAD
EKLSKRKILFEAMVNYGKYGPKSPFTNVLSDEEIDKLTPEELLGEIKHFM
NYRHRVLYYGPDSPETLMTELRTMHHFGQSFQPVPVTDPFEELKTAKNHV
YVVDYDMTQAEIIMLSRGAVYDASKVPLVTLFNEYYGGGMSSVVFQEMRE
AKALAYSVFSVYRLPKEKDRHSYVFSYIGTQADKLPEALDGFNELMQKLP
ESPELFASAKAGIDQKIRTERITKGDVLFALEEARRLGLDHDIRQDVFRE
VPGMSFSDIEQFHETRFRNKPQIMLVLGKKEQLDLETLRKYGDISFLTLR
EIFGY
>CT0017 hypothetical protein
MRHFRVFLRKPKRITSAAQPLETGSSGLHFSAVNPRQAPASFFPSLRRFT
VRLTAARQDQLVSYFFTCFKGVSMSQSVMKKSVLRILPGLLCLALPLSSC
SSSKSPKATMTATPVELRYREATEKIAKRKYNDAIVILESLMFSTRATAL
EDDVLKALADSYYKKKEYILAADTYRRLLQQTPDSPYARDAQFMLAKSYE
KLSPFHELDQEYTVKAINEFETYLDQYPSDDSAQAANDLELYKNLMKVNP
DNASYREKYEAAKEELASGSPARYSQKAISELRERLAHNRFSIARQYFKL
KKYRAAEIFYDVVINQYPDTKWLESAWIGKIDSEIKQNNWFEARQSIETF
QQLYPDKAKLIEPAAKRVTAHYSNKRDPKSKE
>CT2221 membrane protein, putative
MTRYDQPYSPGGFQVMPPAIKAIIITNVIVFLFQNSAFGPALTTFGALWP
IGSHNPAGYSFHLWQPITYLFLHGSFAHIFFNMFALWMFGVEIENYWGTR
NFVSFYFICGIGAALINLLATYGSPYPTIGASGAIFGVLLAFGMMFPDRY
IYLYFLLPIKTKYFVAGYALIEFIMGLGNRTMGSGSDIAYFAHLGGMLFG
YIYIVIRRNEWTIKRMFRDFSLPKKPKGPVLWQGGGKDDDTSEAEIDRIL
DKISSRGYDSLTAEEKRTLLKAGKR
>CT2111 conserved hypothetical protein
MVMSKEKALIEIRLAGLAQGTHEFDFTCKAADFADPALAGAGFSRDVSVN
VSVEKLDGEMIVTLNTSATANLTCDLCLAPITSELKGSYRIYYGYEQAGE
PQEERDEEYRLIDRNTLALDLTEDVRETLLLSVPMKVTCKDNPDCRVFHQ
EKLSEPGEDHLPDSDWQESLEKLKNKYR
>CT1234 hypothetical protein
MNRLDISNGKPAPDMVLLALQHFGIPAAQCLVVGDTVL
>CT0705 hypothetical protein
MMTIPHEPAYWLLTAGCLCIASIPALMRSIAKGKAIAIVALWLALCITTL
WQFGLLPGVATTLISAVWGVILLVASLIFSGIKSMPNQRFEKR
>CT1020 hypothetical protein
MKKVLSLLSMLVLTPSASLLLAEPAPAAPAASSSPLIEQAEAARKEADAL
GYEWRDTAKILDSAREALQRGDQAESDKLASKALFQARAAKAQAQFMDKN
WQMMIPKN
>CT2000 CBS domain protein
MSPQSAEAIIIILLILFEGVLSLAEFAIISSSPARLRELREAGYPSASVA
LKLQDNAARFLSSIKVTAILITTLTSVLGGLFLAEPLAALFSHLAILEPY
SHPLALTVVIASLAYLTHVIGGLLPKKFALRHPEAIAVRIAGFMNKLCTI
SSPAVLLADASAALLLRAFGIEANEKPQVSDEDVMLMIRQGAKKGVFESV
EYEMISRIFRMSDKRASALMTPRNEIEWLDLERPDEELVARIKASGRSRF
PVAKGSLDELQGVVRSLDLVNFSLSSKGSIREAIRASMKPPLFVPESVPA
FHVLELFKKNRAHMALVIDEHGSVQGAITLTDVLESIVGDVPADDMEGDQ
KTIVRRSERTWLVDGMVPVDEFLTAFNLDAEKFFEENEPRYDTMGGFMMT
RLGEVPSVSDTVKWGGLTFKVIKMNGKRVGRILVEQEAKNAEKKITKL
>CT0695 nucleotidyltransferase family protein
MEIPLDMLRTICKECSVRKLSIVGSIARGDEGPESDVDLLVEFKRQGSPL
RQYMETKKRFEKLFHRKVDLIERSAMRNKRFEASVLQDEKVIYEA
>CT1909 hypothetical protein
MTEKIYTEENGEYLKNNPTWHVEDSPWKAKQILKMLNSNPINPKSIAEIG
CGAGEILNQLHLSMPNYVSFTGYDISSDAIRLAKTREKERLEFKHENFLE
TSARFDLLLMIDVFEHVDDYLGFLKLSKSKAKNTIFHIPLDISVQAILRN
KLMSRLRYIDG
>CT1391 chloride channel, putative
MSWLFKNSRIKRRVIVLTYLILRKSRYFKGSSQQFVRMTWASFLAQLNLN
QDLPFLLVAVFVGLVTGYVAVIFHDAIKIISSYLFYGTTALGLPTFNNYL
RIFLLPLIPALGGLIVGLYNAFVVKARPEHGLPSVIKAVAQKNGKIPTKN
WIHKTITSVVSIGTGGGGGREAPIAQVGASIGSTVAQWLKFSPGRTRTLL
GCGAAAGLAAVFNAPIGGVMFAVEVILGDFSVKTFSPIVVAAVVGTVLSR
SYLGNYPTFQVPEYSLVSNTELVFYFILGVLAGLTAVLFIRTFYFIEEHI
QKIEKRFRIPAWLMPAIGGLLCGLISMWVPELYGFSYEVINKALIGQESW
ENMVAVYLLKPVVVALTVGSGGSGGMFAPTMKMGAMLGGMFGKVVNNLFP
AITAASGAYALVGMGAVTAGIMRAPLTVILILFEVTGQYEIVLPIMFAAV
TSALVARLAYPYTMETYVLEKENVRVGFGIALTIAGNISVLEVMQRKFVK
FFDVTKVENIIDAFYNTRDSHFFITTPEGTFVGIIGLDEMSLVLKDGIFP
GMIADDLVKKNVTVLYDTSKLDEALKIFEISEYSTLPVVEYHSRKLLGIL
KQDEAFSYYRKQMNLIGEDAGELADQRTA
>CT0187 conserved hypothetical protein
MKLTILTDNRAAPGLTCEHGFAVLIETGGKRILFDTGQLTAIDANCRALG
IDLSDIDIIVLSHGHYDHTGNLADVLRIADRATLYLHPSALIERYSIRND
KPKPIDMPETAKQAINGLPKERVVWVTEPTRLTDGAFLTGPVPRQTTFED
TGGPFFFDPDGKTPDPIEDDLSLWIEKPEGLIVLAGCCHAGIVNTLDYIE
SITGQKRIATLIGGMHLSAASPERLNRTVASLANRDISRLIACHCTGQAA
VERFSKELPYPVEAGYAGMVVESANE
>CT1346 hypothetical protein
MTSDPHLIFADVSLDIKCGAFQSALDKLKEVERWLPESYIFHLLTARAAR
GLKRYEEAIEHLGHCCRIAPANQVAWRELIEVKTLQSQAPEPARTPAIDE
VAVEFEELSKALAGFTPPRATECFEPTPIAEQKQPFPDDASIAVPTESLA
KLFVNQGAYKKAIRVYTSLIQLNPSKADHYRQSIDKVLEKL
>CT1748 methyltransferase, UbiE/COQ5 family
MLKSWQLLKSFHCRSNSFINQEREVFMAKYKLDVNIANVNEVYDGAGGIL
WEMLMGEQIHVGAEAETDVLARKAGVTAETHLLDVCSALGGPARYLAKNY
GCRVTGLDATQRMHAEAIRRTIEAGLSGKIDYVLGNALDMPFPASSFDVV
WGQDAWCYITDKQRLIGECARVLKPGGVLAFTDWLEAGPMTDEELTALNT
FMVFPYMETLDGYAMLAEQAGLTVIEKEDLTPDFAAHVQGYLDMVQNQYR
QAIVDNYGQEMYDAVEQGIMLWRDASAAGKVGRGRLVARK
>CT0274 carbon-nitrogen hydrolase family protein
MIRLATVQFTPRLGERQANLEAIRSLLDPVEADIVVLPELCSSGYFFTSR
EELAPFAESPGGVACSFFQGLADAKRAIIIAGMPETAQGCFYNSVFVFRP
GVADPLVYRKSHLFYKERFVFEPGDTGFPVIRDEQLDISIGIMLCYDWRF
PEVSRVLALGGADLIACPSNLVTDAWRKVMPARAIENKLYVAVANRCGTE
TRGDETLLFKGCSAVYDPYGETVALADADNDRVLLAEIDPRSCRDKSFNE
FNDIFADRRPELYGAICCPRR
>CT0375 ABC transporter, ATP-binding protein
MLEVRNLSLSAGTKVLLRNTSFRIGDKDRASLVGLNGTGKSTLLRLLSGQ
LKEDGPISEGQIMKSSTTTIGYLPQEISFEGDLDKTALQYALEANKTLHE
LSEKISRMEHELALPDQDHASDEYHKLIERFSDASQDFERLGGYRMQSDA
EKILSGLGFGSADFYKKVKEFSGGWQMRLLIARLLLQNPTLLLLDEPTNH
LDIDSLRWLEQYLLNYEHSYLIVSHDRFFLDKLTTKTLEIAFNEITEYKG
NYSFYEKEKAERYTLMMSRYENDLKKMADLKSFVDRFRYKATKARQAQSR
LRQMQKLEKELQAPEEDLSQISFSFPKARPSGREVLRLEGVSKSFTLPDG
TTKTVLKNIDLEIMRGDRIAIVGSNGAGKTTFCRILADEIDFEGKRQTGH
HVSMSYFAQHQTDNLAPEKSILQEMMDAAPTSEAQRRVRDILGCFLFSGD
AVEKKIAVLSGGEKSRVALAKILLQASNLLIMDEPTNHLDMRSKEMLIDS
LENYDGTLLIVSHDRYFLDSLVNKVFEIKNGGVQVYLGTYAEYLEKAEKS
WEEEKKQQAEAQAKEEAARKAATAKSVEKKPAAPKANSKKIAAIEKEIQR
LEESKKQHEDMMAQPLFYEQSAEETHKAIAEYEELCKELDALYQCWEDEA
G
>CT1841 acetyltransferase, GNAT family
MDVEAVASMVGELLSEIMQAIGVPVFDVASDETAARLRDFLETGRYVVFV
AVDGRDEPVGFIALYESCALYAGGVFGTIPELYVRPECRGLGVGQGLLKA
AREFGKSCGWKRLEVTTLPLPEFDRTLAFYEQEGFELTGGRKLKVLL
>CT0609 sepiapterin reductase
MKHILLITGAGKGIGRAIALEFARAARHHPDFEPVLVLSSRTAADLEKIS
LECRAEGALTDTITADISDMADVRRLTTHIVERYGHIDCLVNNAGVGRFG
ALSDLTEEDFDYTMNTNLKGTFFLTQALFALMERQHSGHIFFITSVAATK
AFRHSSIYCMSKFGQRGLVETMRLYARKCNVRITDVQPGAVYTPMWGKVD
DEMQALMMMPEDIAAPVVQAYLQPSRTVVEEIILRPTSGDIQDD
>CT2045 hydroxyacylglutathione hydrolase, putative
MSVQVEQIRTGGDRNFGYLCADKATGEAFAVDPSNSPKVLVDAAARKGWQ
LVRAFCTHGHADHTNGNEEFERLTGIRVLLFGDRDARLGIEVMHGASFPL
GEGVVEIIHTPGHTLDSICLLAGDALFTGDTLFVGKVGGTWSEADARLEY
RSLHERLMVLPAGTNVFPGHDYGTAPVSTIGHEKTTNPFLLQPDAESFID
LKNNWSAYKKAHGIS
>CT2155 GTP-binding protein
MNITTADFFCSYSSLNGLPSDGRPEIVFVGRSNVGKSSLLNSLCARKGLA
KTSSTPGKTRLINYFIINDNLYFVDLPGYGYAKVGQGERESWGKLLTGYI
QKRGEIALVVLLVDSRHPGMASDLEMMEFLDYCGRPFGIVLTKWDKLKQA
EKSKASRTIESCAPNARFIVNYSSLSGSGRDRLLASIDTFTQ
>CT1908 3-oxoadipate enol-lactonase, putative
MLTFNGAAGGDAGNVLLLHAFPVSSQMWEPQLAPLAESGYRVIAPAVYGF
ESTPSRPGWSMDDYAHDLARLMEALGWKSATIVGLSMGGYQAMAFYRLYP
ELTKSLVLCDTRANADTPQAFSVRQEFRKAVMEKGAEEAAARMVPNFFAK
ETYESNPSLVEKTRESIVRQAPEEISEAMRAIAEREDSTEMLTEITCPTL
IVNGMEDIVTTPEIAATMHALIPGSKLELIPDAGHLSNLDQPAIFNGILL
EHLRSL
>CT1475 transporter, putative
MTNSTPNAKIGPIELAASVMPRHAWTFLFASFFSIGMVTFVSIGQAYILN
EHLGIPVSEQGTISGDLVFWTEIVTLLLFGPAGAIMDRIGRRPVYAVGFI
ILAIAYLYYPLVTNVFQLTIARMIYAVGVVAVTSGLATVLVDYPKERSRG
KLIAVVGFLNGLGIVILNQFFGGLPKRLVANGMSGTEAGFWMHATIAGTA
ALTAIVLFFGLKGGTPVRHEERPPIRKLLTSGFRHARNPKILLSYAAAFV
ARGDQSIIGTFVPLWGMTTGLAMGLEPAEAVKKGTFIFIISQAAALLWAP
VIGVFLDRWNRVTALTICMGLAAIGYLSLAMIGNPLETYSLIFFVLLGIG
QISAFLGSQSLIGQEAPKEARGSVIGAFNISGAIGILFITTTGGRLFDGM
SPKAPFIIVGAVNLLVMLGGMWLRTREVKERGKFAA
>CT1885 conserved hypothetical protein
MSTTENVSERTGLAFGGGVVLGAAHIGVLKAMEETGFRAECVSGTSIGSF
IAAMYAFGKSWREIEAVALELDWSDLSGLTLSGYGLLSIRKFGKIVRAQL
GSRRIEDAPLPLAIVATDICTGNEVVLREGDVATAVMASSSIPGIFKPVE
QGEMLLVDGVLTENVPVSPLKEMGASRIACIDLFGRHSFRRPEHLSDLLL
NAFYSAMRAISQIQISKADLVIAPDLSRFSLVDMSAVPEILDTGYREALP
LLESWRDAHR
>CT0138 nucleotide-binding protein
MRGEFLSRSADETREYARRFASGLKPGDTVCLTGPLGAGKTEFMRGITEA
FGCEEQLSSPTFSLMNIYEGLLRGQPFELHHFDLYRLESEKELDSAGFDD
YLSGPFLSVVEWGERFASLDRRYTRRVQLFIAGESQRKIVIT
>CT0742 membrane protein, putative
MNNLELSLIVGVGALFAGMLGSLTGLGGGVVIVPLLTLGLGIDLRYAVGT
SLVAVIATSSGAAAAYVKEGFSNIRIGLFLEVATTVGALVGAFLAGMLAT
NIIAIIFGLVLLYSAYLSTKAKEDHSDDVNPDPLAIKFKLNSSYPTEEGV
KHYSVHNVGAGFGLMWLAGILSGLLGIGSGAVKVLAMDHAMRLPFKVSAT
TSNFMIGVTAAASAGVYFQRGYINAGLTFPVMLGILAGSFIGAKLLMVAK
TKWLRLIFGVVIFALGLEMIFNGITGRI
>CT0764 TPR domain protein
MNKQQQSMNTNRSAQEMSETSPEYEQKYQQAIDCIENQEYGQAISILDEL
AGEASRDAKLRYARAVALLSNGEYRRAGTDLAFTVALDRSNLEAYRHLGF
VLLTMGKEEAAIKVLEEALRRDPCFVEAWCVLADVHMDLGEHDKALDALD
RAHELQPGNAEVHCKLAMYYMSRGDMRGLRAEYEVLREIEPDVAAQIAEL
LP
>CT1303 ABC transporter, ATP-binding protein
MVLLTVEGITKRYGLKTLFEEVSFGIDDRDKVGIIGANGSGKSTLMKILA
GSETPDTGRVMVSKEKKISYLPQVSPYDADDTVLEAVLKSGDKVMALICE
YELALEALDHAEGDQTALIEKVTHLSHELDVSGAWELESNAKAVLGKLGL
NDLTAKMGTLSGGQRKRVALAHALVVPSDALILDEPTNHLDADSVEWLES
YIRRYAGAVILITHDRYFLDRVATRMIELDGKTAKTYTGGYASYLVQKEA
EEAQEIRDERKRNALAKQELEWMRTGAKARTTKQKARLQRAETLVYAPKK
AEKQEMEIGFGAERLGNKIVEFHDVSKSWGQKKLLRSFDYLLEKGDRIGI
IGPNGSGKTTLLEMIAGRTKPDTGRIEIGPTVKIGYYDQESRHLDDSKRV
IEYIKEEAEQIKTKDGTLVSAAKMLERFLFSGAAQYNPIGNLSGGERRRL
YLLRQLIGAPNVLLLDEPTNDLDIPTLRVLEDYLDTFPGCLVVVSHDRYF
LDRTVEHIFAFEGDGIVRRYPGNYSVYLEMKAAIAAEEQAAKQKPAPAAK
PASEAPKPTLSPKQRKLNSKEKRELEQLEQAIAEAEERQEAINAELAAAG
SDFDAVQKLGDELHKIQTKLDKDMERWAELAELA
>CT0625 Nudix/MutT family protein
MTHLAEQARFIIDFIIIFSALTTSYHIGNVVCAIIEREGRFLIARRPLGK
HLARKWEFPGSKVETGESEAEALERELIEELGVRMEIVERLMPVEHCYAD
RSLRLIAFHCRIAAGAPNAGEHEELRWIDIGEADDYDFPEADLPILAEYR
QKIAASVQSLPGKRRGTA
>CT0062 conserved hypothetical protein
MSRCPEDSRTGNGRNGAPDPGELSAMVRKLEIRSRRLVNELFSGEYHSSF
KGRGIEFSQVREYQYGDDVRTIDWNTSAHKNDLYVKIFTEERERILMLVL
DGSGSMLFGSGRLKKELAAEVSAILAFSAVQNNDMVGLLVFSDTVETYIP
PRKGRAHALVILNEIFSMRQCGRKTDIDAALSFLRRTQKRKSIIFLLTDL
LGSEYERGMKLLNARHEFVLIHIGDPLDHELPHSGLLDLVDPETGERLTI
DAGSRAFLARYAKEQRAKREAVQRQLSRMKVDAVFLDTGKSIIGGLNAFF
RHRERKV
>CT1025 sulfide dehydrogenase, flavoprotein subunit, putative
MSKKIVVLGAGTAGTIVSNNLRRHLPADWEITVIDRDDDHIYQPGLLFVP
FGVQKSSTLVKSRKKYITAGINFVMDEITHIDPEKKEVKTKNHTFTYDFL
VISTGCRIAPEENEGLMEAWGKNAFTFYYKEAADQLRLRLKEFDGGKLVM
NIAELPFKCPVAPIEFVFMADWFLKKKGVRNKSEIELVTPLPMAFTKPKA
AAVFTESAREKNIKITTSFELNRVDGKEKFIESVQGDKVKYDTLVIVPTT
IGDPVISNSGIDDGIGYVPTHHNTLKALKHDGVYVIGDATNVPTSKAGSV
AHYEADVVVFNIMAEIYGAKPEEIFDGHSTCFIVYSKGTASLIDFNYKIE
PLPGKFPMPKLGPFSLLKETKMNWYGKLAFEPLYWNVLLDGKHLGMPPTL
VMAGKEVG
>CT1336 conserved hypothetical protein
MEKRLPDACLSEVIGKEQLDEALDLYRGDGIDARIARAAAEVEGLYYGKL
TRAEEIVAFARRIGAKRIGLATCVGLAGEARVFAKILEANGFEPFSALCK
AGAVDKSQIGIAEELKITPGSHESLCNPVLQARVMNEQPTDLNVVIGLCV
GHDSLFTKHSAAPVTTLIVKDRVLGHNPAAALYASGSYYKRLLEPGREL
>CT0458 conserved hypothetical protein
MEPHYRVTIFGSARISEGDEAYRDVYDIARGLAAEGFDIVTGGGPGLMRA
ANSGSKSVSNGGQSIGLNIKLPHEQCPNPYLDIKEEFDRFSGRLDAFMAM
SDAVVVAPGGIGTMLELFYSWQLVQVQHLCETPIILFGEIWTSLLLWLET
EVLPRHLFERKDMHSIFHVMEASEVVDLIIKIHKARPETEHVCRNFNKYR
LDIEQAGKK
>CT1698 conserved hypothetical protein, truncation
MHNLGGLAAGYAVATVARCDVIDRRTIAIEIGMQNSGLGVTLANQFFQPL
AALPGALFSLWHNLSGIALARHWSRKATFVASEA
>CT0273 polysaccharide biosynthesis protein
MFSKLKLLAKDTVIYGASTILARSLNYVLVPLYANKLTTFDNGIQAVIYA
NIALANVIFTYGLETSYLKVASDVIKRNEDERPLFSTAFFSLFVTSILFS
ALMLLFAPSIAVAIGLAPESGVFIRYAAAILFLDTLLVVPFAELRLKRKA
IPFALAKVMGVVGGVISTFVLILGLHAGLSGVFIGEALGSVVSLLFILPV
LKNLKPTFSPGMCRQLLGIGLPYVPTGIAGLLIHLIDRNLLIRIPQQDID
RLYGAGFQASDITGIYGRVAAFGVALQMFIQIFRFAWQPFFLQHADDPEA
KPLFKQVMNLSGIAVIVLAVACTFFVPDLVRYHWGGKLYLLPPKYWMGMS
ILPWIFFSYVFDMISTNLSAGILITGKTKYLPVVTFAGAAVTTLGCWILI
PLGGMDGAAVAILLGAAVMCLCMGWYSVRFYPISYDWGRLSLLLGAGLAF
AVWHDDLLVWLAGFGISGLLAMMVKLLIVLLYLVLGTLIFRNEASAVVKM
VQRKLRPAGSSGSR
>CT0707 conserved hypothetical protein
MKYSEAQQGRVFVIRLEDGDIFHEEIERFAKEKGIERAYLNVVGGADKES
KLVVGPEESRTYPVNPMEHELYDAHEIVGTGTLFPDDTSAPVVHLHMACG
REENTVTGCVRNGVKVWHVMEVILVELLGTQARRLPDKATGFKLLVP
>CT1422 bchC, 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase
MEAKKSKAIVFSGVNQIELREVTLKPVSSTDVLVETWWSSISTGTEKMAL
NGLIPSPPFIFPFIPGYETVGRIVEAGDHVNQGLIGKFAYVAGSFGYEDV
NAAFGGASQFIVCPVESLTVLDGIANPQCGIALPLGATALHIVDLAEVKN
RKVLVLGQGAVGILAAELAKRFGASLVAVTEPHQRRLDISTSDIKVNPEK
QDVSVALAGHEFDVLIDSTGIMSAIETGLRFLKFHGKVIFGGYYQRMNID
YSQAFNKELSFIAARQWAKGDLHRVRELIAAGKINAEKIFTHQCTVDDNL
MEAYMQAFSDSDCLKMIIHWKHGNEAGEHFPTCNTAN
>CT0049 bioC, biotin synthesis protein, putative
MNGVIDKQLVRRRFRRALPTYAGHAEVQRRMAVRLVALIENAGASTHLGR
VFEFGSGSAMLTSILFERYSANEFFANDLVAESRAFVEKAVTGRNVERLT
FLPGDVERLDPLPGNLDLAVSNATVQWLHDPARFFDRLATSVKPGGIVAF
STFGAENMHEIAALGEAALPYRSLDKIAALSGELFELVAIEDDIVRQEFD
TPEAVLRHIRKTGVNGVARRAWTRSQYLDFLQRYRSAYPSGEGVTLTWHP
VYCCFRKKKS
>CT1036 bmpA, basic membrane protein A
MTYRKYSSFFFRLSSILTMFMLLLTGCAGKKKVSESNPNAYKVGLVFDVG
GRGDKSFNDLAYNGLEQAKKKLGIQFDYIEPSGEGADREAALRQMAADPD
VKLIIGVGLLFTDDITAIAKEFPDKKFACIDYNPQPGAEIPSNLSGIVFE
EKKGSFLAGAIAALESKTGIIGFIGGMDSNIIRKFESGYIEGAKYVRPDI
KLITNFIGMTGSAFNDPAKGKEIALGQYSQGADIIYQAAGASGMGVIEAA
RESKKLVICTDMGLEWPAPENMLTSINKAINKAVLTTIDEAMHGKFEGGK
QRVFGLDNRYTDYVWNSDTEKLIDQSVHERIESIRKDILDGKIKVQE
>CT1567 bmrU, bmrU protein
MGRAMGDSFTFIFNPAADKGRAADKTALIERSLAHFEVASLETTRFAGHA
AEIARAAAGEGSTLIACGGDGTLNEVVNAVAGQPVKVGVLPVGSANDFLK
TFNPSAKEHEVRIRGFAGATSRKVDLGKVEFGGGESRYFVNSIGIGFTGR
IASTVKSVKWLRGELSYAWALVSVLLGYSAVKMHITLDTVEGKIELDEPV
FAFSVSNGRVEGGKFRIAPEADPFDGLLDVCILKAVSKWRVPGYVLKYLK
GSQIHDPNVIYCKARSVEVFLSVSEAMHMDGEVIEKVGGAIAITAEPLAV
EMLYEP
>CT2006 comF, competence protein
MHLLFPEVCILCQKPLGEGEEHICAGCFNDFNPFPSVLAGGAALKSTVRA
HFGEKAVPAAAWCLYPYRSRGSLHEAMHAMKYGGLFPLGELFGKRLGELI
CQGGVPVGFDAIVPVPLHHLKRIERTYNQAEALARGMAGLIGLPVATRSL
ERCVYTGTQTGLGLEARRENMAGAFRPGRERCPARVLLVDDVLTTGATMV
SAAKVLKAAGAVEVAFATVALTEKE
>CT1382 csmI, chlorosome envelope protein I
MNLIINDKTASSSVGQTIGKAARLNHAHVGYVCGGHGLCQACYITVQEGA
DCLAPLTDVEKAFLSPRQIAAGGRMACQATIAKEGTVKVLSRPEEVRRMV
FSNPFQLIGYAADMGKDTAQQIVPGVQNLIGRIQRGEMGGKDALGDMIES
IQGAAGLVVEAIQQGPMALPIPFKEQIADLISKLPLPQIQLPSISLPQLP
SISFPQLPFSLPKLPFSLPFLPQQPQATASLEKVTITVQPPAKD
>CT0333 doc, death on curing protein
MRFLDLHEVLHIHRDQITRYGGTLGVRDMGLLTSAIAMPTAMFKGDFLHT
DIYEMAAAYLFHLVRNHPFLDGNKRVGAVSAIVFLALNGYDFEAPENDLV
EMVYGVARSEFEKSDVALFMRRWSVKW
>CT2116 fabG, 3-oxoacyl-(acyl-carrier-protein) reductase
MFTGKTAVVTGAARGIGQSIALDLAAKGADLVIGDIKAEWLTETEEALKQ
LGAKVSCKELDVTSTDACQKVFDEVAKENGRIDILVNNAGITRDGLLMRM
SEEDWDAVLTVNLKGVFNCTKAVTRTMMKQRSGSIINIASIIGLMGNAGQ
ANYAASKGGVIAFTKSIARELASRNVRANAIAPGFISSKMTDALSEEVRQ
KMLEAIPLGVFGTPQHVADAVAFLASDQSAYITGQVLSVNGGMYM
>CT1015 fccB-1, sulfide dehydrogenase, flavoprotein subunit
MGNTISRRTFNRLLISGLAGSSLLMSGGPLMASAPKAHVVVIGGGFGGAT
VARYLRQLDPSISVTLVEPKKVFHTCPMSNWVIGGLFSMQNTAHTYHALR
SRYGVEVVQEMATGIDPVKKTVKLKGGRMLSYDRLVVSPGVDFIWDAIEG
YSRDVAESSMPYAWEAGPQTLLLRRQLLGMKDGENVIICAPKNPFRCPAA
PYERASLIAYYLKKSKPKSKVIILDDKEVFTKQDLFMLGWDRLYRGKIEW
RSASAGGKVERLDPAKMTVATEFGDEKGGVINVIPPQKAGRIAVETGLAD
TSGWCPVNPANFESLQHPGIHVIGDAALVGTMPKSGTAANTQAKALAAWL
VASFGGGNAGEHDLASLCYSLLAPGYAISVAGGYIQSPEGIKDNPDTVHL
TSMEATTAQLAGEAEQALQWYHNISQDTWG
>CT2081 fccB-2, sulfide dehydrogenase, flavoprotein subunit
MSISRRDFNKLLLAGAAGSAFGLFGSGNTAFAARKRVVVIGGGFGGAATA
KYLKKLDPTLAVTLIEPKPAFVTCPFSNWVLGGLRTMKDITHTYTALRTR
HGVNVIADRVVSVDAAKGTLRLAGGRVIGYDRLVVSPGIDFKYDTIPGYS
QKIAKSKMPHAWQAGPQTILLHRQLQAMKNGGTVVICPPDNPFRCPPGPY
ERASLIAYYLKQHKPKSKIVILDAKEKFSKQGLFTKGWESRYPGMIELRG
STGGGKVLGVDAKAMTVETDLGAVKGDVINVIPAQKAGKIAFEAGLTNEK
GWCPVNPSSFESTIHQGIHVIGDACIAGAMPKSGFAASSQGKVAAVAIIN
LLRGQEPAPPSLVNTCYSLIGPKYGVSVAGVYQLSPTGIVEIPGSGGRLR
PMPATNSWNRRRFSPKAGTPISARISGDKPLDSCGVPAVNHQAPLPLTHK
ATPDCRKR
>CT1411 glnA, glutamine synthetase
MSNESKKPVASYYGALTFGTEAMRAKLPKEVFKALQDTIKAGKKLPADIA
GVVAHGMKEWAMEHGATHYTHWFQPMTGTTAEKHDAFLTTQMDGTVIERF
SGEQLIQGEPDASSFPSGGMRSTFEARGYTAWDPSSPAFLMKGGKGMTLC
IPTVFISYHGEALDEKTPLLRSMDAVSKAAIRLLDTIGITGVTKVNTYAG
PEQEYFLIDKKFYAQRPDLIMTGRTLLGALPPKGQQLEDHYFGSIPDRVL
EFMQEVEEELFLLGIPAKTRHNEVAPHQFEIAPIFEQVNLASDHNLLVME
VMRKVADKKGFALLLFEKPFAGINGSGKHNNWSIGIDGGMNLLDPGDTPE
SNISFLVFLVAVLKGVLKRSAILRASVASIGNDHRLGANEAPPAVITVFL
GDLLEKVLDAIESGKVDLKTEKQILDLGLSHVPVLNKDYTDRNRTSPFAF
TGNKFEFRAVGSSQPISVPNMVLNTIMAEALDDLNAEILAKIEGGMAKED
AILAAVRDGIIATKAVRYPGDNYSEDLQRAAAERGLPNMKNTPESVRAWT
DKDTVSMFVKYGVLTAEEIESRYNVRIERYVKGIDIEARTLLLMIKTMVI
PDASEYQGDLASSFNNLAAAAESIGLSDAALQSQAGLLKTLAEDLSKLID
LTAILEETIEEMEEQESELDKADFCSARLLPCMNAIREVADKIEVQVDRS
RWQLPTYSEMLFEH
>CT0473 gltD-1, glutamate synthase, small subunit
MAIPRQKMPAQDPVERVGNFKEVNLGLTPEQAQQEALRCIQCKDPVCIAG
CPVNIKIDQFIKLIAEGDFMGAVRKIKEDNVLPSICGRVCPQEDQCEKVC
VIGKKHEPVAIGNLERFVGDYERTSGQKIDPKIAPPTGKKVAVVGSGPAG
LSCANDLAQYGHKVVVFEALHELGGVLMYGIPEFRLPKEIVREELDGLRR
MGIEFRTDVVVGRTITIDELMEEEGFDAVFIGVGAGLPWFMGIPGENLVG
VLAANEFLTRVNLMKAYDFLKSSDTPVFDCKGKNVAVFGGGNTAMDAVRT
AKRLGAEHAYIVYRRSEKEMPAREEEIHNAKEEGIEFLLLTTPLEFVGDE
KAWLTGAKCQKMELGEPDDSGRRRPVPVEGSEYILPIDMAIISIGNGPNP
LIHQTTPDIEVSKRETIVVDVNTMQTSKENVYAGGDIVTGGATVILAMGA
GRKAAAAINEKLGGTAKNFNEW
>CT0402 gltD-2, glutamate synthase, small subunit
MGKLKGFMEYRRALPVDREPLERIKDWNEFHEEMSAEQLSDQGARCMDCG
TPFCHSGFMLNGMTAGCPIHNLIPEWNDHVYRGFWRDAWERLMKTNNFPE
FTGRVCPAPCEGSCVLGIIQPPVTIKNIEYSIIEHAFAEGWVEPKQIAVR
TGKKVAIVGSGPSGLACADQLNKAGHTVTVFERDDRVGGLLMYGIPNMKL
DKRLVVQRRVDLMKEEGVSFVTGTEVGVNYPVDKLLSEYDAVVLCIGATN
PRDLNADGRNLDGIHFAMEFLRASTKAVLDGTEPVLSAKGKDVVVIGGGD
TGTDCVATSLRQGCKSVIQLEIMPKPADFRQEDNPWPEWPKVFKVDYGQE
EAAAVQGGDPRRYLMMTKKFIGENGRLSAVEVSKVEWIKQEGRTIPVPVS
GSEEIIPAQLVLLAMGFLGPEAQLLQSLGVEQDSRSNIKADEKSYRTSVD
KVFAAGDARRGQSLVVWAINEGRAAARECDRFLMGCTSLP
>CT0889 gph, phosphoglycolate phosphatase
MMNHSVTQKFSAVVFDMDGTLLDTLADISYSLNSVLEEEGYPTHPVEACR
AMVGFGMRELVRKALPESAHDEAITEPLLKKLQARYAEHWNDSSRPYDGV
VELLDAIDRLGLKKAILSNKPDRFTRQCAEELLAPWKFDVIMGFREGIAP
KPDPTGALLVAKELGVEPASILYVGDSGVDMKTANAAGMYPLGVTWGYRP
GDELLATGAAKLVSHPTEIIPLLTA
>CT1293 guaB, inosine-5'-monophosphate dehydrogenase
MDKILYDALTFDDVLLVPAYSNVLPKETVVKSRLTRQIEVNIPLVSAAMD
TVTEAELAIALARAGGIGIIHKNLSIDEQARQVAKVKRFESGIIRNPIHL
FEDATIQDAIDLMIRHSISGIPVVEHPTPEGCLLLKGIVTNRDLRMTASS
DEKITTIMTTNLVTAKEGIDLLTAEDILMRNKIEKLLIIDDNGYLKGLIT
FKDIQKRKQCPDACKDSQGRLRAGAAVGIRANTMSRVDALVAAGVDVVAV
DTAHGHSQAVLDMVATIKQKYPELQVIAGNVATPEAVRDLVKAGADAVKV
GIGPGSICTTRIVAGVGMPQLTAIMKCAEEAKKTDIPLIADGGIKYSGDI
AKALAAGADSVMMGSVFAGTDESPGETILYEGRRFKAYRGMGSLGAMSEP
EGSSDRYFQDVSAETKKYVPEGIEGRIPAKGKLDEVVYQLIGGLKSAMGY
CGVRTITELKENTRFVRITSAGLRESHPHDVMITKEAPNYSTSA
>CT0920 hdhA, 7-alpha-hydroxysteroid dehydrogenase
MRLQGKIALVTGAAGGIGSATARCFAREGATVVLVDIDLEACSRVCDDIA
QSIGQASCSGVDLTSEKQVVELFTNIRRDYGRLDIVVNIAGGDCEPAASV
ETIDMEMAMKNLDMNLKSCMLCCREAAKIMKPQAYGRIVNMSSLVWRGSP
NQFSYSASKGGIFAFTRSLALALGAFNITANALAPALVEVEAFTRALGPE
RWQALAKASAERYPLGRIATPDDVAKAALFLASDDASFITGQILEISGGA
RL
>CT1384 hflX, GTP-binding protein HflX
MTTIPSPEPRERAVLVGITSTPDIPRHLVEEYLDELKFLADTAGADVITS
IIQEKKQPDPATCIGSGKAEDLAGLVEADSIDIVIFDDDLTPVQVRNLER
ILKCKVIDRTGLILQIFAIRAKSAQARTQVELAQLEYLLPRLSGAWTHLS
KQKGGIGTKGPGETQIETDRRLVRNRIASLKKKLRAVSLQHDTQTRGRAA
VPRVALVGYTNAGKSTLMNALCPEAGAYAENRLFATLDTKTRRLELKINK
LVLLSDTVGFIRKLPHTLVESFKSTLDEVLQADFLLHVIDVSHPGFEEHM
QVVRETLKEIGVKHDHIIEVFNKIDALDDPAILTGLRGKYPDAVFISAVR
GLNLSALKETIANYVARDYKTRKVKTHVSNYKLIGYLYDHAEVIDKKHVD
EDVLLTIRVHRNNLKQIDAMLKASASKNHAAANLQHHETHD
>CT1799 hupA, hydrogenase expression/formation protein HupA
MHEMSIAMSVVEAVVDKAREEGGGKITGIDLVVGRLAGVEVESLKFCFGA
AARGTLAEGAELVIEEPEGRGRCEACGAEFPVTSFYAKCSACGQFRVKIE
SGRELAVRSFTIE
>CT2086 mazG, mazG protein
MKHEANPSIETLKESVLKHNAVTPAEHFERVVNLVRVLRSECPWDRKQTP
ESLAHLLLEESYELVHAIDTGDDPELKKELGDLFLHVCFQVLLADEAKKF
SFIDVFEALCHKLISRHPHVFGDVKAETEQDVLGNWENLKMKEGRTSLLD
GVPKAMSELLRAYRVQKKVAGIGFDWPSDEGVLDKLTEEIGELRNAASKQ
EREEEFGDLLFTIVNYSRFIDTNPEDALRKATNKFMDRFRKVEASVLASG
KSWKEFSAEELNGLWNEAKKAK
>CT1847 menC, o-succinylbenzoate-CoA synthase
MKPLHADICRYEMDFTAPVTVRGVLLARRQGLLLRLKSEGVTAYGEVAPL
IGLHTESLDEALQALATFIPELSRLDWNASDGRQRLLDEAALPPSVTTGI
EMALINLEATERSSLPSFTDEFPPASKIPVNALLAGDPQAVLNRAAKRYA
EGFRAFKLKVRKGELDGAVACIRALHEAFGDKAELRLDANQSLEFDEAVA
FGKALPPGCVAYIEEPLTDAALISDFHAATGLPSALDESLWQRPELLDEI
GPDPLGALVLKPNCIGGIAKSLDLAAKAHRMGLQAVYSSAFESSVSLGLY
ALMAAVSSPAPAASGLDTASFLARDLTATPFATPDGFADPAAAWRDSLRV
RPDMIETVKSWSL
>CT1845 menH, thioesterase, menaquinone synthesis gene
MTTISLHLTTVGDPALPKIVFLHGFLGSGSDWLSFARKLENRFCSILVDL
PGHGEAGIPADGDPKLFFMQTVEALKSNIRRLRAEPCVLVGYSMGGRIGL
ALALLYPELFSKAIIVSSSPGLQTDEKRASRRKSDEGIARKIERNFEGFI
GFWYDQPLFSTLKSHSLFREVEAQRKQGTPQNLARALRLLGTGNQPSFWD
KLPGNRLPMLFCVGEKDAKYVDIAKQVVELCPSSSLELFEHCGHTLHIEE
PERFLASVERFIETHPHNSISHDDL
>CT0969 metS, methionyl-tRNA synthetase
MTHIPKRTLVTTALPYANGPVHLGHLAGVYLPADIYVRYKRLCGHDVIHI
GGSDEHGVPITITADKEGISPQEVVDRYHTMNAEAFAKCGISFDYYGRTS
GPVHHQTAREFFLEIEKKGIFVKKTEKQFFDPKAGRFLSDRYITGTCPVC
KTPGANGDQCEQCGTHLSPTELIDPKSKLSDATPELRETLHWYFPLGRYQ
KQLEAFVERHTGDWRSNVVNYSRTWLNQGLADRAITRDLAWGISLPLDSE
EAKGKVLYVWFDAVLGYISFTKEWAEKQGDAELWRRYWQDPETRIINFIG
KDNVVFHTLMFPAILMAWNEGRSEGRYELADNVPASEFMNFEGRKFSKSR
NYAVYLGEFLERFPADTLRYSIAMNYPENKDTDFSWSDFQNRTNGELADT
LGNFIKRSIDFTNSRFGGQVPADIDLEAWDSLGIDWLASFGKLEAAYDGF
HFREATAQTMEIARFANRFLTESEPWKVIKVDPEAAGRTMAVSLNLCHTL
ALLFWPIVPETANRIWKMLGFEGTIDELVEPGNPVWRQALEPGLKKGHKL
LGSSEILFSKIEDKDIEPEMKKIEALLAEAEQREAAKQPVPMTFKPEITF
DDFQKIDLRVAKVVACEPVKKANKLLKLQLQVGSEQRQVLSGIAQYFTPE
QMVGKNVVLVANLADRTMRGELSQGMILTVEGADGRLFLLEPQGEGINGN
SVS
>CT1543 modE, molybdenum transport protein ModE
MSKTKNPIGIEGSIWFQKSQSRFLGGDRIALLEKIDELGSINSAAKAVGI
SYKTAWHLVNMMNNLSEKPLVDRMTGGKGGGGTVLTREGRQVIEKYRIVQ
EEHRKFVENLEERLGDTGNLYQFLRRISMRISARNTFSGVITELTRDAVN
AEIIITLNGGQQIVSTITNGAIDNLGLKKGMSAYAIVKSSSVMVGRDLQD
KKLSARNIICGTVQRVIEDSVNSEIDIEIGGGNSISAIITETSTSRLNLK
EGEQACAIFKASDVIIGVN
>CT1540 nifB, nifB protein
MTLNIKNHPCFNDSSRHTYGRIHLPVAPKCNIQCNYCNRKFDCMNENRPG
ITSKVLSPRQALYYLDNALKLSPNISVVGIAGPGDPFANPEETMETLRLV
REKYPEMLLCVATNGLDMLPYIEELAELQVSHVTLTINAIDPEIGQEIYA
WVRYQKKMYRDRQAAELLLENQLAALQKLKRYGVTAKVNSIIIPGVNDQH
VIEVARQVASMGADILNALPYYNTTETVFENIPEPDPMMVRKIQEEAGKL
LPQMKHCARCRADAVGIIGEINSDEMMAKLAEAALMPKNPDEHRPYIAVA
SLEGVLINQHLGEADRFLVYALDEEKKSCTLVDSRQAPPPGGGKLRWEAL
AAKLSDCRAVLVNSAGDSPQSVLKASGIDVMSIEGVIEEAVYGVFTGQNL
KHLMKSSQIHACKTSCGGDGNGCD
>CT2213 obg, GTP-binding protein Obg
MKFVDSAKISVKAGDGGRGCVSFRREKFVPKGGPDGGDGGRGGHVYLRAN
KQLTTLLDFKYRKSYIAGRGGHGLGARKSGKDGKDVIIGVPCGTVVRNVE
TGEVICDMVEDGQEIMIAKGGRGGWGNQHFATATRQAPRFAQPGEPGEEY
ELEMELKLMADVGLVGFPNAGKSTLISVLSAARPKIADYPFTTLVPNLGI
VRYEDYKSFVMADIPGIIEGAAEGRGLGIQFLRHIERTKTLLIMVPSNTE
DIAAEYATLLKELEKFDPSLLSKPRLVVITKMDIAPEDFTMPELEKGVKV
LAISSVAGQGLKALKDELWRQVSLQNQSPSEHAGS
>CT0730 pheT, phenylalanyl-tRNA synthetase, beta subunit
MKISVNWLKEFVPSLSFDCSGLVDYLTFLGLEVEDVFEQKLPDQKVIVGK
IVEVRPHPNADRLRICMVDTGEGELRQIVCGAPNVEAGMMVPVATIGAVL
TAVSGETFTIKPAKIRGEHSSGMICAADELGLSDDHDGVMVLDEACEIGQ
PLARYLETDTVLDIAVTPNRPDALSHLGVARELADCNEIVYPQAPVIEFT
RGGGLIEVQDEESCPYYTATVIKGVTVGPSPRWLARRLEQIGLRPKNNIV
DITNYILHSFGQPLHAFDYHQLAGSRIVVRSDAESSFMALNKVEYQLQPG
MTVVCDAREPVAIGGVMGGLHSAVTDKTTDILLEAAYFNPASVRKTAKQL
QLSSDSSYRFERGVDPCNVKRAAEYAIAMILEIAGGNVDSAEAWGDMPAA
QKIVSLRPKRVNAVLGSSITASRMVRLLEKICIKAVSQEAVSDDVDSIAF
SVPSFRVDIEQEIDLIEEVARLYGYNNLEPAPVMVSSYPVSRKVPEYFPD
YLRSIMIGLNFREVLTNPLIRKAEADCFSSMLVNVLNPISEELEVLRPNL
APSLLKVVGYNMRHGNRELRLFEVAHGFEKQPEAGRGNEGPLSAFLEKEL
LSMVITGRREPRSWNRQDENVDFYDLRGVVEMLLEKLNLLEKSAFNIYNA
RTIGIEITSTENGKTSVLKAGTVQQVNREVLDVFGLDQDVYLAELDVTLL
ERCFESGVIYEPPSKFPVVERDLSFVLPRHIPAQRLIDLAKASDPRVRSV
RIFDVFDRGTTQGEPSTRSVALSLELADRSGTMNEEAISAVISKVIDNAR
SELGAVIRQV
>CT1441 pucC, pucC protein
MERVRKKSRNPESEGTVKQDLEMRASSFLASALPASSYRGVISASVHGTW
RSTPKMIFDAHEEAPFENHDNQIHPDFQGWPQDAILEKARVLCKLWPCLM
KRLSDSSNPFPRFMNKFFRIFNLVRLSLFQIGFGIMLGFVQDILNRVMIK
ELFLPATIALGLISLKELLAILGVKVWAGNLSDRYAIFGYRRTPYVLIGL
VSCIVSFILAPTTAYEVRLDGTGSLVSIIFSALGDVGLWKLSAIFLVFGF
GLQVATTAYYALIADMVDEKDIGKIAGASWTLMVLTAIISNYSIGSYLKV
FTPERLTQVAEIGGLVALTFGLIAVLGVERRNAGVGVHKEKHSIPFSQAI
RLLASSPNTMLFALYIFISIFALFANEVVMDPFGAEVFGMQVSETTKLFK
PVMGGTQLIFMLLTGFLLSRIGTRRGAYFGNVFGAVGFGLIIAAGFMHDV
QFLRIALVVTGIGLGAASVSNITMMMNMTAGRSGIYMGLWGTAQSLAIFI
GHSSAGVIRDLVFHFSGNHMLAYAAIFVLEIIAFTISSLVLPHVSREAFE
AESAEKMLELEAAAEAG
>CT1778 recX, regulatory protein RecX
MDEGKKSSALDHALRLLAGRAHGRAELESKLKKKGFDSESIAKALARLDE
LNLTDDRAFAQSCTASMARRKPEGRLKTRARLKQKGLPDNIIDEALNGCD
QTELCRSAAEKKLRTLPASPDQKKKKLITFLKNRGFDWETIRETVKLVLG
EESARSDQLD
>CT1557 surE, stationary-phase survival protein SurE
MTTKPQKPHILVCNDDGIEGLGLHALAASMKKLGSVTVVAPAEPQSGKSH
GMTLGEPLRIRRYQKNNRFFGYTVSGTPVDCIKVALSHILDAKPDLIVSG
INYGSNTAMNSLYSGTVAAAREGAIQNVPSLAFSLTTYENADFTYAAKFA
RQLAREVLRRGMPPDTILSANIPNVPEKEIRGILFTRQGRSRWEESTIER
HDMYGNPYYWLAGSLQLHDNDLAEDEYAVRHNYVAVTPITCDMTDHRFRS
ELETWGLQNTIKK
>CT2084 thdF, thiophene and furan oxidation protein ThdF
MSPSDLHLPVPGHPIAAIATPVGVGALAIVRISGAGVLDLADRVFRKVHG
SGKLAEAAGYTAHFGRLYDGEEMVDEVIALVFRAPRSFTAEQMVEFTCHG
GPVVVGRVLRLMLDNGCRLAEPGEFTRRAFLNGRIDLLQAEAIGEMIHAR
TESAYRTAVSQMKGDLSVRLGGLREQLIRSCALIELELDFSEEDVEFQSR
DELTMQIETLRSEVNRLIDSYQHGRIVSEGVSTVIAGKPNAGKSTLLNTL
LGQERAIVSHMPGTTRDYIEECFIHDKTMFRLTDTAGLREAGEEIEHEGI
RRSRMKMAEADLILYLLDLGTERLDDELTEIRELKAAHPAAKFLTVANKL
DRAANADALIRAIADGTGTEVIGISALNGDGIDTLKQHMGDLVKNLDKLH
EASVLVTSLRHYEALRNASDALQNALELIAHESETELIAFELRAALDYVG
QITGKVVNEEVLNTIFDKFCIGK
>CT0697 thiH, ThiH protein
MIALPAWLTDERLSEDIEPLLRQTDNESLERLAAEAQAVTLRRFGRVISL
YTPLYLSNFCSSGCVYCGFASDRRSPRRKLDTDEIEKELLAMKALGVSDV
LLLTGERTNSVGFDYLRRAVDIAARHMPRVAVEAFPMSVAEYRGLAECGC
TGLTIYQETYDPDHYRELHRWGPKQDFLERLETPERAITGGIRSVGIGAL
LGLSEPVGEALAVLRHARYLCKTYWKAGVTVSFPRIRPQEGGFQPSFTVS
DRFLARMIFAFRIGMPDVDLVLSTRESSNFRDGMAGLGITRMSIASRTTV
GGYVEKETAGASQFEVSDNRSVEAFCAALRAKDLEPVFKNWDAAYNNPLP
AEECT
>CT0192 trpB-2, tryptophan synthase, beta subunit
MSTEPTKILLSEDEMPRQWYNIQADLPSPMPPPVGLDGNPIGPDALAKVF
PMNLIEQEVSTERWIDIPEEILGILKLWRPSPLYRARRLEAALGTPAKIY
YKNEGVSPAGSHKPNTAVAQAWYNREFGIKYLTTETGAGQWGSALAMSCK
LIGIECKVFMVRISFDQKPFRKIMMNTWGAECIPSPSPLTAVGRRILEED
PDTPGSLGIAISEAIEQAVERDDTRYALGSVLNHVMLHQTIIGLEARKQF
DKIGRYPDIVIGCAGGGSNFAGISFPFLYDKIHGKDVQVIATEPEACPTL
TRAPYAYDSGDVAMMTPLLPMHSLGHTFIPPAIHAGGLRYHGMAPLVSHT
KQLGLIEATALPQTECYEAALLFAHTEGFIPAPETSHAIAQTIREAKQAK
EEGKEKVILMNWSGHGLMDLQGYDAYMSGKISDYPLPEELLQRSIAASLE
GHPPVPGC