TitleGenColors Logo

Gene list

Applied filters:

Organism: Chlorobium tepidum TLS, TLS
Gene type: CDS

Number of genes found: 2255

Free access
Sort by:

 



# Chlorobium tepidum TLS, TLS

>CT0937 nitroreductase family protein
MTITTSERNALYKVIYSRRDVRGQYLPDPVPEEVLRRVLDAAHHAPSVGF
MQPWDFVVVHDLAVRKQIKEGFEVAHAEAAEMFEGEKREQYRTFKLEGIL
EAPVGICVTCDRSRSGEVVIGRTANPEMDLYSSVCAVQNLWLAARAENLG
VGWVSIIHHDHVRRVLGIPEHIVPVAWLCIGYVSFFHEAPELEQAGWLPR
LALDDLLHQEQW
>CT0041 hypothetical protein
MNVLKSAEKPFAGFVGRPGKILGKEDGSNMASYELGVVDVRERRPFGFDP
EPFRKASPQANSPEIFDIDSLLRLRTFVYLLLGTVLFLVQINNTLAINDL
AKRNERLREQLRISTSISTAEKLKSRELQSIRYISGYAKNLGLDSSFIPP
VEIEP
>CT1655 ferredoxin, 2Fe-2S
MQIQNESPYIVHVFVCTNDRGGERKSCADNNSQLVKDQLKKAVDGKGWKG
KVRVSTSGCMGVCGEGPNVMIYPQKLWFSRVSPDDVDAVLSVIERLMSEA
>CT1269 hypothetical protein
MPFFRFIKEMKQSFMLHPVAAHFSNGVIPVAVLYLVLFLPTGNPFFEHTV
VHLLLVSLLAVPFSFYSGIRDWKTKYKGAKAPVFQTKIRLSILLLVAGIL
AAAIRLAVPDVMHEGGPLSWLYVATLLVMLPTVVLLGHHGGKLAAGQRSE
RFR
>CT1168 carbohydrate kinase, PfkB family
MTPEKIEQIFRSFKEKKIAVIGDVMIDKYIFGHVSRISPEYPVPVVDVNR
ESSRLGGAANVAVNIHALGAEALLIGVTGDDTERRNLEALMTEHGLNPAM
LAADSTRPTTCKTRILSQNHHITRVDYESRTPVDAELEKRLLGMFMEIAN
AVDAVVLEDYNKGVLTPSLIVSLIAACRERNKPVLVDPKLKGFFSYGGCS
VFKPNLSELAASLGIPVANDDREVEQACLLLGEKLTVESLVVTRSEKGMT
VYDGSFTHIPALSLDVADVSGAGDTVIGTLALGLASGLDLVTSTRIANLA
ANTVCQEVGAVPVRPDKLFSACLAHLQ
>CT0552 ABC-type export system, ATP-binding subunit
MSGSVIIKASNLVKWFGEGEARTVAVKQASFEAQYGEMLYIVGPSGSGKT
TLLSMISGILRPNEGSVVVENQDIWTMKDDDLADLRLNKIGFVFQDYHLF
PRLTTVENVAIPLILKKMEWNAAMKLARDYLDIVGLKDRADLQPVKLSGG
EQQRVAIARAIAGQPDILIFDEPTASLDGDTGRKIVEFVKTNILNEKRAI
IIVTHDSRIYEYADRIMKMEDGKVVGYESGYENSQRHEK
>CT0426 hypothetical protein
MLKMYVDYWVAVLSGFLQQYFGVKSQKGVTMIEYALIASLIAVAVIAVLL
TVGSNLKTVFSYVGSNLTT
>CT0553 ABC-type export system, membrane fusion protein
MRNKLLIGIAIFGILLGLGAAFFMKLRPKSEPPAFTPASNPYRSGIYANG
IIESAQKNGTNVNIYPEVSGTVLKIFVHEGQQVKAGEPLLALDDAVQRST
TESAKASIAVAEATLANVQAQYAKLKASWNLDPRSVSKDALDTAANAVRA
GKASLELARKQYDAARALLGKYTLTAREDGTVLAINTSVGSYISSQGGYN
SYTGGYVPVIVMGNAENALQVRCYVDEILIQRLPKPDRMSAIMSIRGTTL
KIPLTFERFEPNVTPKIELSSQRTERVDVRVLPVIFSFKKPANLTLYPGQ
LVDIYIGEQAAKNQ
>CT0375 ABC transporter, ATP-binding protein
MLEVRNLSLSAGTKVLLRNTSFRIGDKDRASLVGLNGTGKSTLLRLLSGQ
LKEDGPISEGQIMKSSTTTIGYLPQEISFEGDLDKTALQYALEANKTLHE
LSEKISRMEHELALPDQDHASDEYHKLIERFSDASQDFERLGGYRMQSDA
EKILSGLGFGSADFYKKVKEFSGGWQMRLLIARLLLQNPTLLLLDEPTNH
LDIDSLRWLEQYLLNYEHSYLIVSHDRFFLDKLTTKTLEIAFNEITEYKG
NYSFYEKEKAERYTLMMSRYENDLKKMADLKSFVDRFRYKATKARQAQSR
LRQMQKLEKELQAPEEDLSQISFSFPKARPSGREVLRLEGVSKSFTLPDG
TTKTVLKNIDLEIMRGDRIAIVGSNGAGKTTFCRILADEIDFEGKRQTGH
HVSMSYFAQHQTDNLAPEKSILQEMMDAAPTSEAQRRVRDILGCFLFSGD
AVEKKIAVLSGGEKSRVALAKILLQASNLLIMDEPTNHLDMRSKEMLIDS
LENYDGTLLIVSHDRYFLDSLVNKVFEIKNGGVQVYLGTYAEYLEKAEKS
WEEEKKQQAEAQAKEEAARKAATAKSVEKKPAAPKANSKKIAAIEKEIQR
LEESKKQHEDMMAQPLFYEQSAEETHKAIAEYEELCKELDALYQCWEDEA
G
>CT1868 hypothetical protein
MRARAAVYRCVVDLDSKHHRDGIQWKVVDLPQMQSQALKACEDGWHSLAM
DEASGALDYYRSEWLGPSKECVRAGREASAGKYCWKYGNGHDAFRDLEPL
LRCGRRIFVFGEPFRNGKGVHNIHQNQGDPPGSRWAAENGPWQDGAVMVE
RPDASVAAFLSKFSTQRFCVDEVCA
>CT1452 hypothetical protein
MKAAGRYVTIFFLYFLILLFYARRSCGEAVV
>CT0725 hypothetical protein
MMNIGGLFRKTGVVFALFVLLSGRPSVLWAEAWKFGVMGDTQWTTADPSG
QNPHTVPVSIIRQVNRQFIDAGVKFVIQVGDLSDDGKEISEEERVAAAQP
LIDAGIGFFAFRGNHEAKSAENGYGAPGFRQRYPQNRDGGFTKSDGGSFT
VGSNFSSPVKISRDLDGLSYSFDFGEGQERARFVIIDNWPLPGRLVANST
HYPSGYTIADQQPWISAQLDKHNRKTPHVFVLSHQPLIGEGHQDTLFSGF
ANEHPEWQNRFFESLQSNGVRLFICGHDHIHQRSVITSPDGKSKVEQLIV
QSNSSKFYTPKSLDDTNWFGQKSREISVSQERESVGYYIFTIDGPSVTVD
YWADDHGHWQSDANFPQGAGRADTGVTPQFHFVKKERWSYSLNGKQFLVP
QGASYTVVRDRFNGCEARILDGLNTSESRDASLDSTGQGRPLTKVVNTGW
IAAKPSRHSGKNPAIPVFQLSGLGESGSDHTDHYVLSMSYDPATVRQKNI
ASGGFGLVTQDAAGRWLNAVEANSGGAATFIDGPWRRGYALGSHGIDRKH
HRAWAVLNHEGAFAVANFSAH
>CT0858 hypothetical protein
MYMIYPPLFLSEAGKYVSEIIEKNAFLYKRSSISYYVPYKFSAKTAPANR
ATLIPVRQQK
>CT0164 hypothetical protein
MSWGIENSKKRKDILDLMDIGVRKTMANPNYVNEVPKSTNGCWMDQIYGL
MRCDICDLSSQCPVREEEEWQAWLKEHNIVIEKKKAE
>CT1403 hypothetical protein
MGRGVACSGAVCRGVTGSHAMVIDNLLWSVAIGWWIRELIAIRTGGRRAR
SPSS
>CT2237 hypothetical protein
MIRLTASNNPDTRTEELIADVDTLDAWRVVLFNDDDHTFDEVIFQIIKAV
RCTRSIAEKHTWEVHTRGRAIVYAGEMSNCIRVSAILEEIALKTEIQTG
>CT0784 hypothetical protein
MLVVNIKNARVVLSGSGNFRFTRRDSGVAKISFTKKTI
>CT1966 hypothetical protein
MQKVYKHVLSGLLLLMLCILAAGSFGVGAAGNSDDGATTTNYKIGETAHV
GYMSYAVWKAFYRNQLSDNPYINQPPDAAYLFVDITVRNDDKEARTIAPF
KLIDENGAEYETSSNAWSVDGSIGILDSLNPGVEKRGYIVFDVPRGKHYK
LEVSGGYWSSDKALVDLGLK
>CT0580 hypothetical protein
MCRISIRRKNIVSMYQKIFKVFLTAIGLVKMRLGVLFSIFLQYSQN
>CT1541 ferredoxin, 2Fe-2S
MDKPKHHIFVCASFRAQGAPQGMCHKKESLNLIPYLESELADRGMSDVAV
SATACLNLCEKGPVLVVYPENFWYGEIDSEDKVDEILDALEEGQACEDHI
IN
>CT0838 glycosyl hydrolase, family 65
MTGSSLLDDGKPVDELDSLFELSPEEWLLRKKGFRKSPKAIQINETLLTT
GNGYLNVRGSLEELPPGHCGGMYLAGVYDKSEADVEELVKCPMWTDVSVW
HEGEKFCLSCNRALEHEQVLDMKKGILHRRTTFKNQHGKILTLETSRLVF
MHDPHRGYMRVKITPRNFSGQIRVLSGLNGEVYNRGFFPREQYKHLQLER
IERGRNFMYLEMKTRERGIRIAVGASWKMMNGQERKRRWEPRIYGEKFTS
EITIDASRGHTYAFEKLAVVMTNRDVPTERAHNMMREAICNLRCYVRTGV
PVEIGRHLDVWRELWKQADVRIEGDDTAQQALRYNIYQLLINGPSKPGPI
GAKFLSSEGYMGHVFWDTEIFILPFYIYNFPSMARNILMYRCNTLPGAMM
NASKSGCEGARFAWESATTGEDVTPRFASKLEKTIRLIYTGMEEEHIVSD
VIYGVERYFRVTGDESFLLHCGLEMVFLTARYWASRVTKVGEHYEIHKVI
GPDEFHEHVNNNAYTNWLVKWHLRLASMLFRHVRKTAPEALQEVAGKIAL
RDDEPARWLEISRKLKFSQEAETGLVEQFDGYFDLKDRVIERYDRSGNPV
LPAGVTYRNIGRTRLIKQADVLLMMLLFPHSFSFEEKKVNYDFYEPRTVH
KSSLSHCTYAMMGLAVSERNNAYRYFMKTAQFDLENLHNNTELGIHAASV
GGSWQTVIHGFAGLTLKSDRIVINPWLPKKWERLSFRVRWRERDVYLDIT
HSEVSIRIDAVSDVTLPCTLYGQNYKIRTNKPYTLQYCLSK
>CT1057 hypothetical protein
MRKSSQDWSIRALAMAQSVNAEQSVLKALFDKTKKRQYRMPEKRRLVITD
ELICRRN
>CT0589 hypothetical protein
MVKDSNHFMNSTNMNFVIRLKDVVDAIDQPDEERRAFLNIRTGRIVTFSR
DALDAVELGSAVRAREEALVREAGEALLSGDYRELPDQFDINDCSIMRRF
CQTVENDELRRGLLRSIQGRGASMRIRSTVDAFGVVEAWSAFRNEALQAI
AIDWLGNLGIAYSGE
>CT0012 hypothetical protein
MSLNFSGPNAQFIDFDSDFNFDRNLFADFSQ
>CT0488 membrane protein, putative
MLKSLLRILSLRQDAEAFDAIHNAVEADIMFSGARIWVLISAIILASVGL
NMNSSAVIIGAMLISPLMGPINGMGYSIATYDFPLLRKSFKNFSFAVISS
LVASTLYFAITPVSSAHSELLARTSPTIYDVLVALFGGLAGAISLTTKLK
GNVVPGVAIATALMPPLCTAGYGLATGHFTFFFGALYLFTINSVFIGLAN
VGFARVMRIPLRSSLPEEKRTGINRIITAVILVTLIPSVYFGYVLVKKEH
FIETATRFVQTVSLFKGNFLLRYDIDGDSRTISMIYAGEPLTSDDKVELG
RRAEKFGLEQVTLKFEQGLVISKDKEFQDKLSQMVETDRQKIEIARLNAA
LQANQRQQDSLRQVKYTGLKLLNELKPLFPQITSCLYAESYLFSDSTGKK
PLHRSYVLLSAAKPISRVDRTKMENWLKARLQNDSLRVVFE
>CT0348 ArsA ATPase family protein
MCRMSSRDLSEKQSPPRIIIYSGKGGTGKTTISSSTAVALARQGKRVLIM
SSDPAHSLSDVFGVQIGRNEPLKIEKNLYGLEVDTIYELKKNMSGFQKFV
SSSYKNQGIDSGMASELTTQPGLDEIFALSRLLDESQSGKWDTIVLDTSP
TGNTLRLLAYPEIIIGGNMGKQFFKLYKSMSSLARPLSGNNIPDDDFFNE
VNVLLKQMEDINEFILSPEVTFRLVLNPEKLSILETKRAYTFVHLYGINI
DGIVINKILPTSRTVGEYFEFWADLHSKYLMEIDNSFYPTPVFRCHLQRT
EPIGPDALYDISQIVFGEEAPDKTFYSGKNFWIESKKNQATSDNREVLCI
RIPFLKDAADVSVERMGTDIIVTVDRAQRNITLPRALYSLDMDRFVREDN
ILRVIFKEVKVDKEEMELNVNKNVLDKLRSMRRLKI
>CT1727 glutaredoxin family protein
MKKIKILGTGCAKCNQLTDAVKAVIAAENIEAEVQKVEDMQQIVSYGVMS
TPGLVVDGKVVCSGRIPSGDELREMLTAAPKPSCCCGDYANGTDRPNGTC
>CT1028 transposase, truncation
MQSLSSHYHQLLGLPSNWEVENVNLSMSSRQVEIRLAFTGKQGECPICGQ
SCLIYDHAAEQRWLHLDTMQFETILVARLPRCQCKEHGVKTVQAPWAARH
SRFTLLFESFAVELLLHCANIKAASRLLRLNWHTVNQIMRRAVQRGLVRR
KAETVEYLGIDEKSFKAGQHYVTTLTDLGERRVLEVVEHRTTEATKELLA
SLNDRPAVAMETIAMMVSVSMIVSRLVNLLIFSAFWSPPRYRPWPGC
>CT2247 iron-sulfur cluster-binding protein, gltD family
MNAESNPILDFATEYVYPAFSELTGTDKIVAFGDHSHKCPIYVPQTPPCT
AECPAGEDIRAINRFLNGTDPSDDPLKSAWETATDTNPFPAVMGRICPHP
CQSKCNRGVHDESVAINAVEQVLGNYGIEHNLKLKGPGADTGKRVAIIGG
GPAGLSAAYQLRRKGHAVTIYDANEKLGGMVLYGIMGYRVDRKVLEAEIG
RIIELGVETKMGVTIGKDITLEQLEAEYDAVFIAVGAQKGRALPVPGFEG
TPGATNAIDFLKSYEVLGDDIPVGKHVVVIGDGNVAMDVARLALRLGSQA
TIISGVPREEMACFENEFDDAKNEGTTMHFLTGTVEVLGGASGVTGLRCT
KMVKKEKGEEGWNSPIPFLRYKSNGESFEIEADMVVAAIGQATDLSGLGS
AASGPWLKVDRNFRIPGREKLFGGGDALKVDLITTAVGHGRKAAYAIDAF
LKGEPMPEEPYREITKPHKQDLLYFLHTPQAKRTSIKPEVVVGNHDELLE
ALTPEQAITESKRCMSCGFCFDCKQCVSFCPQEAITRFRDNPAGEKVYTD
YTKCVGCHLCSLVCPCGYIQMGMGDGL
>CT1145 NLP/P60 family protein
MPMRKEQQSRPDLRVLKSLMVCATIAASLILHSPSLMAAEEATGATSTAP
SACSINPSEKLKNLFTEVKQYLGIRYRFGGDTPSGFDCSGFVRFMFNKEF
NVNLPRSSREMATIGTRIDRNELRPGDLVFFKNAEDRINHVGIFVGNDTF
VHSSLSKGITRDTLNESYYSKRFATGVRILDIQGNRIPDDFNNLFDESNN
GNSPS
>CT0101 hypothetical protein
MGSAALRGYIPLLQYPFFVHITRFQQPLSLFVSKAVDGPNKLKGQLITMC
FRGGYETGTYGIDLLFSTFRPALPSVTNDC
>CT0849 hypothetical protein
MTQNHETAVTCSTVSQLFAAPGQSMLHGIGMAVVSGSP
>CT0090 glycosyl hydrolase, family 3
MKATLLAALLLFLSFTSALAAPSTPDSLSIKIGQMLMIGFRGMEAKDDSA
IAADIRERGIGGVVLFDYDVPSKSPIRNIESPEQLRRLTSELQKLSTTPL
FIAVDQEGGRVCRLKPSRGFPPTVSAAYLGTLNNSDSTWQAAGSTAALLK
SLGINVNLAPVVDLNVNPSNPVIGKLDRSFSADPVVVAQQARMVIDAFHQ
QGIIAALKHFPGHGSSTTDTHKDFTDVTATWSKKELDPYWALIREGYSDP
VMTAHVFNARLDSLYPATLSKATIDGLLRKQLGFRGVVLSDDMQMKAIAD
RYGLEEAIRLAIDAGVDVLIFGNNVSYDPEIASKATSIIRHLVEKGAISP
ERINESYRRIMTLKTRTIISRP
>CT0364 PAP2 superfamily protein
MGGLLLFAGFWKWNRWLSLSGLFLFSTVAASGLASDLAKVILCRSRPELF
LQQGIYGFDLFGWHFDHAWQSFPSGHSATALSAALTLSLLFPRFRPVFII
AGSTIAASRVVICQHYLSDIVAGSALGAVTVALLYQRYFRKLFDETATIQ
T
>CT1149 conserved hypothetical protein
MLEALFYVVVEVFCRVREVSQCKFQPCFFMISTLVILLLIILVAVLGFLL
LRSQRESNALSVENARLQVEVEHERERAAASLEARRSDEARLEATLENLA
NRIVEERGAALSEQHRQRLDGLLEPFRLQLDSFRQRIDEVHRSDTELSAR
LLEQVRQLQELNHQVSDEANNLARAIKGESKKQGDWGELIIERLFEASGL
VKGREYTVQESDRTDDGRLVRPDFMVHLPGEKAVIVDAKVSLTAYERWCN
EEHEARRSEALREHVQSVRRHVAELERKDYAALRGNRTLDFVIMCIPLEP
AYQAAMQADPSLFYELAGKNVVVTGPATLMITLRLIAQIWRRENENRNAE
LIADKAGRIYDQVVLVAEAMIEARKKLAGVSDTFELAMKRLTEGRGNLAG
RAEEIRKLGAKVSKKLPPEITTQQEDEES
>CT0314 hypothetical protein
MNSSTKKTPSQSVWTWIVITLLWGSVFFATSTWILGIVSSWFDGGAFSPD
RAEALRVYAMYVPALLVVALSAMVIQSRLDPGWQKQREREKAVRAGKREQ
LFVSFAASIATSSLFTLLTAAAHMLAAPVIGTAVSFSVKTVLVAAGLNIA
FGIAASLFVGMIFLVFGVAKGGSKA
>CT1354 hypothetical protein
MITSTPPAMRYQNAEEVLSEWITGICTGKIEKVEQLYHEQAVLIPTFSPH
TVSTTEGIRHYFEQLATRDNLQVRLYEKSLKKQFLGGSAWILSGTYAFEF
EVDQLLLTFPSRFTFAVNLEYERPIIHHHSSQLPRNLS
>CT1056 hypothetical protein
MYDQLTNLIVLKTYYKNKRIDRTAYSVTALFKVTCPVYPVRSSVYRPVRY
RQIVNLLLNTPFCYYQLSIVFRIFPHRTILWALCPATS
>CT1279 conserved hypothetical protein
MEKQPNKLIREKSPYLLQHAWNPVDWHPWGEEAFSRARETGRPIFLSSGY
STCHWCHVMEHESFENAETAALLNRHFVPVKLDREEHPDVDHLYMMFVQA
TTGRGGWPMSVWMTPDLKPFFGGSYFPATERWGMPSFRSVLEHLANLWEH
DRPRLLASAGSIMDQLSGLTRPQEGTDEVTDAHASACLAALERGFDAEWG
GFGGEPKFPRPAVLSFLFSHAVATGNRHALDMALLTLRKMAAGGIHDHLG
VAGLGGGGFARYSTDRFWHVPHFEKMLYDNAQLAASYLEAYQASGDELFA
NTARDIFHYVLCDMTSPEGAFWSAEDADSLDPYGSGEKREGAFYLWTEQE
ITGLLDPEEATLFIATYGIRSDGNAPFDPHGEFTGKNILIRTMSDNELAG
TFEIPIETVGKRLNSARKKLFEARKKRPRPGLDDKILTSWNGLMLSALAK
GSLVLGDTTLLEAAERAARFILDTLCDSKSGKLLRRYRDGQAAIEGKAAD
YACLILGLLDLYSASFDSDWLRAAIKLAEAQIERFFDQEAGVFYSTAVED
HSVPLRMIEDNDNAEPSANSVNALNYLRLAAITGRDEFRTIALRTIRHFS
GTLDANPSALPLLLVARQIATASPVQIIFAGKRGNPALAKLVATAFRHNR
PELTVIHADETCEALLPEAAAIGKMHKGEPAAYLCAGGSCQPAIRNAESL
DAALGSFAFD
>CT1376 sensor histidine kinase/response regulator
MAGGIAHDLNNLLTPVLGYSEMLSNSFPETDKRHKRIEVIHHAALRARGL
VQQLLAFSSRQTLEFRMLDLNRVVRDFEQLLRRTIRDDINIRYHLHDGRL
VIQGDVGQIKQIIMNLAVNAEDAMPSGGELTVKTSTMVIGKGQEKYFEGL
PSGDYAVLTVCDTGTGIDQETLACKGRGTGLGLSTVYGIVRQHGGIIQAG
SEAGSGASFRVCFPLRESEPEPLQPVKPKPHQKGEGAKVLVVEDDEIVRK
FVVQALDEEGFETREAENGQAALELLEKGDFKPELLLTDLVMRGMNGRML
YEKVCDFMPGIKVIYMSGYPKDIISRHGVLDTGLSFLLKPFPVPVLLDKV
KDVLKNGE
>CT0564 smpB family protein
MSKKQGKPEYVTAISNRKARFEYEILDTIEAGIELLGSEVKSVRLGKASL
SESYAMIHHGQVWLENMQITPYEHNTLDTLEPKRSRRLLLHKAEIMRLQS
KISEKGLTLIPLKAYFNKRGVLKIELGLARGKKLYDKRETIKNRDAKRQL
QQLRKQY
>CT1509 CAAX prenyl protease 1, putative
MMMNLFGQIILFTLIGTWLIKLAADLLNLRAASPTLPEAFRDVYDPADYR
RSQEYLRANTKFSLISSTFDLALLLVFWFAGGFNALDQLIRAWGFDPVIN
GVLYIGALLLLQSVADLPFSIYHTFVLEERFGFNQTTPKVFVIDLIKTLL
LAVLIGTPVLAAILWFFQSAGPLGWLWAWGGVTAFSLLLQYVAPTWIMPM
FNKFEPLEDGELRKSIMDYAAEVRFPLTGIYVMDGSKRSAKGNAFFTGFG
KNKRIVLFDTLIKNHSTGELVAVLAHEIGHFKKKHIFMSMGLSMLNLGVV
FYLLSLFMNNRMLFDAFAMQETSVYASLLFFMLLYNPVEFIISILMQMLS
RRNEFEADNYAVKTYRNGALLADALKKLSRQNLSNLTPHPFNVFLNYSHP
PVLQRVERIEAAARH
>CT2106 adhesion protein, putative
MMGKPDPLQRFGFSAVILFLQKRHFFPSTGNPDMTTRQSRFISSFFYPIR
PLVLLLLLFIPLLNGCAGKTESDKLQVVASIEPLAWFAERIGGDRVSVSV
MVPSGGNPHTYEPTPKQMAQVSHAALFVKAGSGVEFELDWMPRLVDLNKS
MTVCNASEGVTLLPMSAEKYEHAVPAEEHHDHGNFDPHFWLSPANARLIA
KNVERSLAAIDPAGKEYYAANAAALDRELQALDSEIRKQLSVVKNRRFLV
FHPAWGYFAHEYGLEQIAAEEEGKTLTPRQMERVIGEARSAGIRVVFVSP
QFSSAQADAIAGDIGGQTVTVDPLAHDYAANLRKATAAFVQSMQ
>CT1864 hypothetical protein
MELMAKDIRRAIGWSSRVTERLVAGKAMNSISVNAIPLFFIHMFLIAQEM
SPRHSDR
>CT0209 cytidylyltransferase family protein
MPPKVLTRDEIVLKTRNWQAAGEKVVFTNGCFDILHAGHVRYLSAARELG
DRLVVGLNTDASVRRLKGPNRPVVPEQDRADVLSALASVDAVTLFDDDTP
ETLIKLLLPDILVKGADWPVEKIAGAKAVIEHGGSVLTVPLLEGRSTTGI
IETIIQLHCPQQTGG
>CT1581 hypothetical protein
MRGCPLSTNTNAGAGYELSIMYFTRHYFIAFVMGWFSFS
>CT0484 NADH dehydrogenase I
MRTPPLLALIITGFAVSAAAPYLYRLLKARFVWFGVAFPLALFASFMLRY
PQAASGVPVRERWSWVPSLGLDLSFVLDGLSLTFVMLVTLIGAAVFLYAS
VYLRHHEEADRFFGFIGMFMTWMLGVVLADNMLLLFLFWELTSISSFLLI
GFNHHAASSRASALKALLVTGAGGLLALLAGMLLLGNVTGSFEISSFYAM
NDLITSHRLYPAIVALILVGAFTQSVQFPFHFWLPDAMAAPSPVSAYLHS
ATMVKAGIYLIARFNHEIGSTALWQDTILFTGAATMIFAGLLFYRQSDLK
RLLPLHLPGLVKPSIWYEEALSGMLRFAAWLTSVLQNGDLRRYLAVIIFS
ALIPVSLMLFSSGGFSVTLPADLSVASYEVALALIIVLATALLLTSDSRL
KAIVSMGVLGFGVGMIFIIYGAPDVALTTFVAIETLNVILFVLVLAHLPK
FTSRSRTTGRIRAAQTQNR
>CT0226 glycosyl transferase
MAARKLSEQHEVSLIYRKEAVGKHFDLPKFRLPCLSHIDLYTLFRMVAII
RQNGIEVVIPTKRKDYLLGGLAGKMTGAAMILRLGADRKLKWPWQRFMYH
TLSNGIIVNASKIKKTLLETGYIPEAKIRVIYNGLDIAELDRRRSEQTVA
KPFPVTITGLGRLTWNKGYDFLIRSFSRFVEASGNRDAGIVLIGDGAQKK
EFAKLADELGLADRVLFPGFQQNPYSWLAASDIFAVTSTNEGLPNALLEA
MYLGNAPISTRAGGVEEVIDDGRNGLLLDYGDEEALASALQLLVKSPERR
AEFARQATTRIVEQFSMERMASEIASFCREVAAKR
>CT0860 conserved hypothetical protein
MNQGKPEKLYADSVQLAYAKTLEFVSHAIIILMAIGFILYVFRLLPLTVP
VETVAANWHLNATKLQVKIHHHCGWSCFEDVHTFMHGDAVSYASVVFLSL
ATMICLATSTMAFFREKNRIYLVITILQILVLLVAASGKLTSGH
>CT0227 UDP-glucose/GDP-mannose dehydrogenase family protein
MDKTRIALIGLGYVGLPLAVEFAKKFPVAGFDIKQERIEELNNGCDSTLE
VDAEALGAVLTASNPLTADGDKGLFVTSSIAEIAKANIYIVTVPTPTDKH
NRPVLTPLIRASETVGQVLEKGDIVIYESTVYPGATEEDCVPVLERVSGL
TFNRDFFVGYSPERINPGDKEHTVTKIKKVTSGSTPEAARKIDELYASVI
TAGTHLASSIRVAEAAKVIENSQRDINIAFVNELAMIFNRMGIDTLDVLR
AAGTKWNFLPFRPGLVGGHCIGVDPYYLAQKAQEYGYHPEIILAGRRLND
NMGRYVASEVVKLMISRDLKVKGSKVLMLGITFKENCPDIRNTRAVDIVA
ELREYGAQVHIHDPWANPEEVRHEYGLDCIDAPVEGGYDAIILAVAHAAF
AEMAIGKLRSGDNAVIYDVKGVLPTDAIDGRL
>CT0480 hypothetical protein
MKPTYTTWLRRMTASAMLLASVASPVPALAFDSFEELLDRSDSMGNADEL
FSDLEELRRNKIPVNRATEEQLLKLPLLSAADAAKIIEWREQRGPIGSVA
ELEAVIGKDTARRFAAYLSFELPPEVKKKALPERVKGSVIGRVFWEDPPR
AGVTNGKYAGGNRHVYSRFQAATPHYGVHLLSDSDVGEPDIDDFLSFNVH
AEGIGMLTQAVIGNYRLSFGQGLVFGQGRYFFKGSDAVDGVLLFAPALRP
YISAGEDNFLQGVAATLAPGPFELTAFTSSNKVDATIKNDVVTSIATTGY
HRTATELNKKDNLTQEVKGVNLRYRYRAGELSTAIGATWAKYRYGLPLSW
LDDGKNGGQLGSVEASAVYRNTQVFGEAAYATEPGALSWICGVQSDLAKG
ITGVVSMRDYAFEYYSPFAGAFAERGDNASNESGLYLGLKAKVRDNLTLA
GLYDRFRFPVLDKRYYVYPSSGYEARFYATWKQNRWMTWDAMYQHKEKEE
AKKQFDSNIGTSRYIPVPKITNRVQLGLVVECSPGITLKSRAAFKSLRSH
FVTGAESEEGWLLYQQINLKRGPFTLKTRLARFDTDSYDVALYAYEDDLP
LVYMLNAYYGRGKAWFIVLDFEPVKNFNLTAKYETTWYDDRDVYSSGNDL
RNTSSPGSYHVGCMLKF
>CT0312 dnaK suppressor protein, putative
MTIMSKKTSPVQESPVESTEETRLTRTYLSDEELEHFRQLLLKRRDEVLR
DLDILRSSLSEESVEDSINSNYSMHMADHGTETMDREQRFMFIARDEKYL
HYIDQALDRIRNKTYGICTKSGKPIPKKRLEAVPHTSVRIEFKQTKK
>CT2110 hypothetical protein
MVSNLLKSLKKEVLEYAVLPFLFYTPAIRPYTVISP
>CT1621 hypothetical protein
MKNLVSRTAGVALLSVMTVISGCSKSDNPVSEAVSSLSGE
>CT2221 membrane protein, putative
MTRYDQPYSPGGFQVMPPAIKAIIITNVIVFLFQNSAFGPALTTFGALWP
IGSHNPAGYSFHLWQPITYLFLHGSFAHIFFNMFALWMFGVEIENYWGTR
NFVSFYFICGIGAALINLLATYGSPYPTIGASGAIFGVLLAFGMMFPDRY
IYLYFLLPIKTKYFVAGYALIEFIMGLGNRTMGSGSDIAYFAHLGGMLFG
YIYIVIRRNEWTIKRMFRDFSLPKKPKGPVLWQGGGKDDDTSEAEIDRIL
DKISSRGYDSLTAEEKRTLLKAGKR
>CT0436 TadC protein
MLDALQFLTIVLSGAAVASVVFALYNWWLSNTVKKRLSQIVRQRWAPADV
ASPRKPTPQKIDTVINILSKLSLPEEGWQSSTVRTRFLQAGIRNKNAPQY
YYAVKTLLVFGLPVMLFLFLRFAQPKLPVILFLIFMLLSAAAGYYLPEIV
MDFITKKRVERMRNSLPDMIELMVVCTESGMSIDAAIARISQEMARTHPD
LAQEFYLAGLEMRAGATRIDSLRNMALRTSLDELHDLVSMLIQAEKFGTS
MAESLRVQSEVMRTKRTQRAEELAAKIPVKLVLPLGLFIFPTLLIVMLGP
AILQLIDVLKHK
>CT1636 hypothetical protein
MNSEASENRPAHSATGTLWSWHGGNGALIWQLMFSSDAVMGIKRFPQERK
AAFFCLESSTGRVLRDDFVLTAGDENETPVGDGWMIGLETVHGSRLVCHT
YQPGSPEHLGIWAINLPEARVVWSRPDLTFTANLGDAFLAYRSIVFAGFP
ERDYVLIDPLSGCELEHLGTAHERPNQLRDAAQSEEERQRILLPDTVFDE
AGHVENINHGATSVTVFHRMEPVAEGVPGWVSTLSVSECERLVHEDVMAS
GEPMPVFNSFLIKDDRLYYIREREFLVSFVVS
>CT1811 Sec-independent protein translocase protein TatD, putative
MNTPGFCDIHCHLSFPDFDDDRDRVIGDLRAAGLQCVIDPGTGVETNRRS
IELANGYDFIYSNVGLHPHETNVPLSPELFDKLAGQARSAKVVGIGEIGL
DYHWPDHDPAHQQAAFREMLRIAVDLDLPVVIHCRDAWPDMLRILSEERS
SALRGAMHCFSGDLDMACRCISLGLKISIPGTITYKKSLLPEVAASVGLG
DLLSETDAPYLAPVPKRGKRNEPAFVVHTVRKIADHRQEPFEEVTEALVS
NARTLFGLP
>CT0717 epoxide hydrolase, putative
MDNVPEPSSEKFRAYRQKLLDQLETSSQGERHRAQYELELMRNSHFVKVG
GLLHHYHDSGPENPRGTVLLIHGWDCWWMWWHRIIRELNAAGYRTVAYDM
KGHGWSENDPENRYQIADFVRDLDELIRAIGLKDLHIAAFSFGPFVALDY
VNTYPNSVRSMVFFNFGYLPNSEFISKVAPATIIFIFNIMMRKLTWWLPA
YIFARLVLSRNSVMMHDIKVGFESLGFCASEAIEQTAQQITAMETTQMLP
DMVRAVRVPILFAAGEGDVIMTCENARKLQEMTPSGSYLCVPDCGHLITL
ELPQTAAEIVLQHISSNS
>CT1394 hypothetical protein
MSARPIVTGDYLVEKAIRLRVILIEQSGHGR
>CT1006 hypothetical protein
MAGVSVSEKYTTAFFGYETGYLRSGAVFQRGGASGGPAFCRGVTLLK
>CT0460 conserved hypothetical protein
MFEWDENKRLSNLEKHNLDFVDSRLLFDGKPAITVTSAVSSETRYLTTAE
ISGKFYSVVWTWRGDVRRIISFRRARDEEERA
>CT1717 tungsten formylmethanofuran dehydrogenase, subunit E, putative
MKTEITLLKGETVMVEPKEFLKAGQAFHGHKCPAMPMGLRVGAAAMNKLG
VERSKDGQLLDLVELGDGHCATCFADGIQMITGCTFGKGNIKKLHYGKWG
VTLIDTQTGRAVRVTPKAEAMLANKKSEFFTEYRMKHIPASKVPAEVVEP
LVEKVMNAPDDMLLNIGDVFDYEVPPKVESFSGFVCDICGEMVVEGYGRP
LDDKKVCQPCYEKAQREH
>CT1303 ABC transporter, ATP-binding protein
MVLLTVEGITKRYGLKTLFEEVSFGIDDRDKVGIIGANGSGKSTLMKILA
GSETPDTGRVMVSKEKKISYLPQVSPYDADDTVLEAVLKSGDKVMALICE
YELALEALDHAEGDQTALIEKVTHLSHELDVSGAWELESNAKAVLGKLGL
NDLTAKMGTLSGGQRKRVALAHALVVPSDALILDEPTNHLDADSVEWLES
YIRRYAGAVILITHDRYFLDRVATRMIELDGKTAKTYTGGYASYLVQKEA
EEAQEIRDERKRNALAKQELEWMRTGAKARTTKQKARLQRAETLVYAPKK
AEKQEMEIGFGAERLGNKIVEFHDVSKSWGQKKLLRSFDYLLEKGDRIGI
IGPNGSGKTTLLEMIAGRTKPDTGRIEIGPTVKIGYYDQESRHLDDSKRV
IEYIKEEAEQIKTKDGTLVSAAKMLERFLFSGAAQYNPIGNLSGGERRRL
YLLRQLIGAPNVLLLDEPTNDLDIPTLRVLEDYLDTFPGCLVVVSHDRYF
LDRTVEHIFAFEGDGIVRRYPGNYSVYLEMKAAIAAEEQAAKQKPAPAAK
PASEAPKPTLSPKQRKLNSKEKRELEQLEQAIAEAEERQEAINAELAAAG
SDFDAVQKLGDELHKIQTKLDKDMERWAELAELA
>CT1827 hypothetical protein
MIIRSRFFRNLRFFNLFPASGGGPTFGGDVLYSNLTFHSSSENAMAAQTA
ADILARLLGIGPDEAQHQLDRFAGGMTAELLDRKRLAVGGLGVFSVVHEQ
ATRQTTASGTRYQPPRDRVAFEARKELTGDAERIASVRLGMEASEARRLA
KALGELFEAMKKGTGRFELRGFGSFTLVSGALRFQPESSLEALLNIGYGD
LKEVVIADHSAEKETLPAPEKPGGLKKAALFVAAVLMLASGWFLYRQFAP
DGVDALPSAGSSAESGVAAPVASSALSAPLSSKAPESVQASPDSLILQKG
RYTVIVATFNTRKVVRQEIARLSEMGHRVWCWPVTSGGSRYYRLVIGDFE
TRKAALDSMKSMPKGLSSHSYIQQAPKNVVLYGEQGL
>CT1212 hypothetical protein
MMLTSDRGVMMNNGVLWIKPVMKWMGLTAFLVIAGCSQFKTVTREPGVLG
DRVALTPEMTGLYEEVASNASTVRALDGYADLYLETPKRKAKAYCTVQIQ
KSRDARMIVTAGILGWPVADLLIRPDSLFVNDMLNNRMLVGRNNGENMGK
IIGVNAGFGRMIETLFGIADVPEPAKNIESVRKGSGRVSFTVKSGNGTKE
LVVDPLTRELTGLVYFDQSGRKSVEFRFAAYQSQVDKNGAELRVPREIDM
ILYREDDPEGSRSLKVVYDERVINPPDFNITFKWPARAKTVNLDEVERLP
WL
>CT0868 heterodisulfide reductase, putative
MAQETVFTPDVKFVRELQKAGADTLKKCYQCATCSVVCPLAPDDKPFPRK
EMLMAQWGLKDELLKSSDIWLCHNCNDCSKYCPRGAKPGDVLSILRKAVI
QENAFPKFMGKIVGDPNNIWQALLIPVVLFLVILGVTGHLHIPEGPVVFS
KFVPVPVIDGVFVPLSLLAVGMFAISMSRFWKNMAEASGMQPKAEFMPSL
VETLKEIMTHSKFRKCDENKDRSVSHVLVFYGFIGLGITTAWAVFNLYIL
HWELPYAIDEHALSIFGGSAVVAWAYKIIFKLFANASAIMLLAGGTLVIM
NRLKERSVETTTSSFDWLFAAIVLLVGLTGFLAQVMRVTGFPPVLAYSTY
FIHLVLVFYIIVYLPYSKLAHFVYRTAAITYTKMLKRDVEM
>CT0410 iron(III) ABC transporter, periplasmic iron-binding protein
MMARFYISIRRFGALPVLGVILLQAALLAGCGTPSGGKQHSTSVASAPDS
LKSSIGYSRRFRLFRRGEVTVLEVMAGQKAYPDTLRYLLVPEGRPQPSGF
VGYTVIRTPVKRIAVFSTTHIGFLDMLGRSDGIIGVSRPEFVNTPAVRSR
IRSGAIKAIGMPFSPDVEALLNLEPDLVIAPALPAARKTDYQALVQAGIP
VLVVAEWLEPSPLGRAEWIKLFGALVGKEKLARERFAAIESSYLKIAALS
RNLPNRPTVLTGLPVKDAWYVPTGGSYVAAMLRDAGASYHWRDLPGTGSV
PLTIEAVWPVALEAEYWLNTGTVSTLDELLAVDSRFGQMRSVRERRVWNN
NRLVNAAGGNAYWESGVVRPDLILKDLVMILHPGLLRAHGIGEGELTFFQ
EVK
>CT0061 hypothetical protein
MSARCKSKLAALLMAAGLGTAIPPASVSATLRPADVRVSAEPDSLFAGER
LRYVITVQHDHRDSISVVSLKAGQGTPFEITGTKSFSKNLPDGRAEFRMD
TELAVFGSGRKPLPGFTVVSKHASAAEPERLVITPSESVTVLSMTDSTVT
ELRPIAPPVSAPFPTWLLVPVLLSLAALGLAGYFVKLLITALRRHLADPG
RAARNRLRAIHRQLSKGLQPAAGYESLSNILREFLQKRYQFGALEMVTQE
IADELAARRISIRQELIKLLDEADLVKFADRRPDIEECRRSLRIAEVLVA
TAAEAETTEKEPLMEQSE
>CT1449 hypothetical protein
MIVSPVSSIGNQQAVAMNIASGSGRKWLTALILSPGMIFASAYLLVWLSM
NLWLNHNFKHHLKQIFTAETGQRYRIDIGSLRPEPNLNSLTLKQLELTPV
GVAENQRASRSVFQIEELRIECADLSLFPFKPADELLTLRKVSRVILLNS
VQ
>CT2157 hypothetical protein
MESKRYSEAFKLKVVLEIESGKFRIGEAARHCIGKATALQWKNRDQVDDL
PNTPHRFNKTMSDLEALVVIELSKSLLLPLDDLLAIVRECINLKVSRSGL
EPLSAPAWRRQPERVDACRRGRAQAQKERQRPRAGSCACRCQIPAQDAR
>CT1135 CRISPR-associated helicase Cas3
MGKIDHSAAGAILASNKSKDIGRILGYLIAGHHTGLPDWDKDAGTKGDPL
SERLSNSEHLQHALKGNPPENILDTPLPSSLPCQTQNGGAELVHLWIRML
YSCLVDADFLDTERFMNPETSELRPNGVDLAMLKERFDQHMSLKESGASD
TPVNRARKEILRECRQKGVSLEPGLFSLTVPTGGGKTLASMAFALEHALK
HDKKRIIVVIPYTSIIEQTAAVYREVFGDDAVLEHHSNLDPSRETPASRL
ATENWDAPIIVTTNVQFFESLFAARSSACRKLHNIVNSVVVLDEAQMLPT
DFLQPIVSVIKMLSAHFRTSVVLCTATQPVLSGKVGTGKDILKGFDDGRV
RELMADPEKLFGIFQRVRVRMLGQADQRHEWAEIARRLCEPEQVLCIVNT
RKDCRELHDLMPDGTVHLSALMCPEHRSTVIAELKSKLVAGEPVRVVSTQ
LLEAGVDIDFPTVYRSFSGLDSIAQAAGRCNREGKLEHGDVVVFNPPNPS
PSGRLRKAEQAAQELFRTVPELAASLMPEAFRRYFMHYFSGLNGFDTKGI
MDLLASNDAARYFQIQFRTAARKFKLIDDTLQHGIIVRYRNGKANIDALI
DQLRFGGPNRKLMRQLQRYSVNVYDPDLKKLIENGLIEEVHGVWVQTSES
AYDPVFGLNIDANLNFYW
>CT1703 hypothetical protein
MQDGHRPGAGYPMRLIACGDGGDNVEFDSGIEFPGRGQAVKRKISDALKA
RHQRYAANEQQTAEPVEANERRCAVPLRLGDHERELAGCEVVQALRNDHQ
ISRAGFDSFEVKRIQVSEA
>CT1699 prismane family protein
MHCDQCQESIKGGCNVRGVCGKDELTAKLQDTLIYAAEGIALCAEGFEGK
IDRKYGQFISECLFVTVTNTNFDDVAIVEQIRKALAMRDEVRALAGTTPA
HDAANWSGSTKEEFLAKAAACSIDSLSADPDLRSLKSLILYGIKGLAAYT
DHAAVLGYHDDDIYAFYVKGLSALTKELPADELLGLVMECGATAVKAMAL
LDKANTETYGNPEITTVKTGVGTRPGILISGHDLRDMEDLLKQTEGTGVD
VYTHCEMLPAHYYPAFKKYDHFVGNYGNSWWSQDREFESFNGPILMTTNC
IVPVRESYRGRMFTTGMAGYPGLKHIPARPDGGSKDFSEIIELAKTCKPP
IEIENGEIVGGFAHTQVLALADKVVDAVKSGAIRRFVVMAGCDGRHNSRQ
YYTDVAETLPKDTVILTAGCAKYRYNKLELGDIGGIPRVLDAGQCNDSYS
LAVIAMKLQEVFGLENINDLPISYDIAWYEQKAVTVLLALLFLGVKGIRL
GPTLPEFLTPNIATTLVKLFDLKPIGTVEADVEAMMAGN
>CT0006 oxa1/60kDa IMP family protein
MDRNSVIGFALIAAIMIVWLQFMKPEQKLGLEKAAASREAVQKTPAAALP
APSAAVAAAARADSLGSFAQASVGTEKTITVSNDLFTATLSSKGATLKSL
VLKKHLDGNRKPFNLISASDKGALSMLFLSSDGKKIDTRDLYFRSLDAKT
TETVTGKEKLSVSYVLDVDATRSIQITYTFTGDSYVVDYDLKLNGFGSSI
AGNEYQLDWDGGLNYSEKDQVDESHNAIASAYLGGSVVKLDAKDAKKTWQ
DEESGKAQWVAVRNKYFVAAIMPQRTTDGIYLHGTKKDGSDFKNYVAALK
MSFPAGQQSVDDHYRLYVGPLDYNTVKSLNADLEKIMDFGWDWLTRPFAE
YLILPIFNWMNKYVTNYGLIIIIFAFLIKLVTWPLSLASTKSMKKMSALQ
PVMKELQEKYKDNPAKLQSELGRIYKEAGVNPLGGCLPTVIQMPLLFAMF
YVFRSSIQLRQHGFLWVKDLSVPDSVYHFAFKLPLYGDHIAIMPILMAVT
VFFQQKITPNAQSNEQTKIMMWLFPAMMLFFFNNMPAGLALYYLMFNIFS
VAQQAYMNATITDEEKAAAAMQVAAATKPAQSAKKGGKKK
>CT1866 hypothetical protein
MAMKKDILERYDRLDDGRVVIDVYASKVEELYEDFDKQAPFHRKDLDEEL
AAYLFDCVREIGRVDFIIRITLDAVPSAELQERIRTSLKKFFIYQRGLES
ASMQQLLRKSLLFFLSGMALLFFSLWFGGSMIPEVRQLVYERVLVEGVTI
ASWVSIWESLSILMFNWWPARLRIRLNSRIADAEVQFQSHPGIRR
>CT0826 hypothetical protein
MLLTFPGKLENLNRIDRKIAAMALAFRPAWRHMIQPTLFRRKLKTLPV
>CT0354 hypothetical protein
MLIGEIKSRRKCFRFNDHDAIVISSILIVIFGKTGDIDCSSFASILYF
>CT0409 ferredoxin, 4Fe-4S
MNDKNQRRKRRLTVPRAEIAWYPVIDAELCNSCLSCYNYCPKDVFASAKA
EEGLRRRPKMEVANPYRCIVLCSACEQECAAGAISFPSPEEFEQFVEYLD
>CT0122 conserved hypothetical protein
MLEMKATGKSKGQWMFYRNFDDTVDYLSDHTRILSYNPFCHKVEPLDRDE
AYRWHFRVTDPQNNPFDVIFNIQQETEILVDLPDEVASMDPEEMSDEMIR
QFTVGRKITWRPLAQDKTFTMPEKYLFEGQVTADMLIVPVQQEQTRVDFD
LWVNVAFLLYPAFRIVPEKVVRTMVSTGMSLIMQTATNHMFQKISKEFGK
IRKL
>CT0349 hypothetical protein
MPPDSALLAIPDGAALTGGRLISVPQFVLYRLFRQDAIFTGNCLTIHRKL
VS
>CT1106 phosphate-binding protein, putative
MNHRFRDVVIALILATGIAVPVWLWMRAPSPESPLGSAISGDTPISGTLA
VAVDQSLTAVAGIQAAAFSDQYPNAAIKLSQESSRPVLRLLEKKVDAALI
EGALSAREDSLLSTLKHPVRRQPVARNALVFVVNCANPVYSVSIANLKAI
FSGRLTDWKSLGGSRGTIVACLDGSNLRARTLLSEMLFGKTEALSASAEP
DLPTLLDRVSKDRYAAAIMTLPEYAASLRSGYGSSIKAVPVSADAGGKPV
AASPETIYTGEYPLSTDIFYLYDPYNPLATGFGAWLAKEGQKLFERGDMA
PYEQLVRTIILK
>CT2082 hypothetical protein
MHIASQVAILADFPYEVFMSADPITIFRKTWGTYQKVISHNLMFHRREIT
TAVAKLFESRNAKRYDRNV
>CT0483 conserved hypothetical protein
MKKVLVAGATGYLGRYAVQEFKNRGYWVRALVRNPEKFKKPGPFFAPEID
TLVDDVVFGDATKPETIAGLCDGIDVVFSSLGMIKPDFEHDNFDVDYQGN
MNILAEALKAGVKKFVYVSVFDAHRMMNIPNVQAHEKFVRELQAAKIEST
IIRPNGFFSEIGQFVARARRGFMLWIGDGYNRQNPIHGADLAKVCADAVD
SSEKEIEVGGPEVFTYREMVDLAIEIAGTQPVQVSLPFWLADGIVGVLGL
FNRDVHDVALFATTLSKMDFVSPKYGTHRLRDFFNECKLLPL
>CT0522 hypothetical protein
MIERFLYFVTNTLHRHPATGKKIHNSYRKSPHFSSQQNGNAIFNTL
>CT0742 membrane protein, putative
MNNLELSLIVGVGALFAGMLGSLTGLGGGVVIVPLLTLGLGIDLRYAVGT
SLVAVIATSSGAAAAYVKEGFSNIRIGLFLEVATTVGALVGAFLAGMLAT
NIIAIIFGLVLLYSAYLSTKAKEDHSDDVNPDPLAIKFKLNSSYPTEEGV
KHYSVHNVGAGFGLMWLAGILSGLLGIGSGAVKVLAMDHAMRLPFKVSAT
TSNFMIGVTAAASAGVYFQRGYINAGLTFPVMLGILAGSFIGAKLLMVAK
TKWLRLIFGVVIFALGLEMIFNGITGRI
>CT0811 hypothetical protein
METPTSRELVSLLFYLRIEISLNNPEVMDSTMKHRLENMLGYLESESYLM
AYRTLNAIVTENEASGELPSLETSTALEVMQTCLRIIVGERVGHPEVAKH
FAQTVSFYERLALLLTKKLLGDDSAAAEVDILLFCHDALAKHRRN
>CT1155 conserved hypothetical protein
MQARKKGKKSWTSYAGLLAGLALIVYLFSKIDLAGSMKLIASLGPSSLLI
LLPYLGLHLLETAAWQRLFPKESGPVPFFGLFKIQLVAETVSMTLPAGVA
VGEPLRPWLCRKFLGIPLPDGFASVAVRKLLLGLTQGVYTLLGAIAGFGM
LQTVSVQVVGFQGLGVIMIVVSLAIAVVFSLFLFLMTNGNAVQKLHRLLM
KIPFEKVRAWLLEREAGFAETDQKLQHIKASGMGVILPVMAIYVAAWMML
ALESYLILHVLGLKVTFFQVLAFDTALTILRAIFFFIPSGLGIQDLGYLA
FFHALGVPDYLAYGGAFVMLRRLKEVIWYSIGYGVMFMEGIHLRDAEQVS
EKSE
>CT0550 hypothetical protein
MLPPTINWKLKPEISIKKFSVFKKKACPRQNRQRNLPEKRMTARPNPEIP
IILMTGFGKDIDNASSVNKLGICKMLKKPVRLADLVSMINEILTRTG
>CT2051 conserved hypothetical protein
MEVVGRSRAKAIIETHLNDSLVYFSDHVRILKCNPYCTGVTYLKEHGVYQ
WIFQVNDPRENPITAVFFVTQNEEHLDSSRTVPAGPVSEEESFPDSAVSG
RCIQWVNAPQVPDVPLKEKNTFVGQANTRICLYPLEDRRTEVHFETDITL
DFELSFPLNLMPEGILKFMTEAVMSKIMQQATESMLCQVQSDLCCCTTAE
LDASGGKV
>CT1920 hypothetical protein
MKVESRKRQYILEGKIKPGFCDGCGCVTERVFVGEWKPSDKPKDEDPLFG
LSDKKSKEKNPAAEEEASAAENQYWIRCTSCNQVHLLKEWQIQIDKELSP
DELKPEDCQLYTPHGIYAQGDALYHKSLDEVGVVREKHATGSGAHVIIVE
FCKSGRKQLLENVQLNQGKPKSTESVTDIIKLKLRR
>CT2011 histidinol phosphatase-related protein, putative
MESAKVLFLDRDGTINRDIGRYVSSREEFILIDRADEAIAIAREAGFRIV
LITNQAGIARGIVTPQDVEDVNDYLNELLAERQTSFDRCYYCPAHPNYPH
PEYDRFIDHRKPSPRMVEQAIADLREEGFEVDRSASFFIGDKLIDVECGQ
RAGLKTILVRTGHNEESLCEQHQIFPDHVADDLYQAVTGYILGQPTSRD
>CT0762 conserved hypothetical protein
MQLDTRRFASLVSWIISPVVVAPAVYLLIVLYRYSATSSHLDWFLVLFLS
ATIVPMFLIYGLKKIGRVSDYNITFREQRFLPLLVLVGVNLVGYELMKQL
DAPRFLSAILLFNAVNTVFILLVTLQWKISIHLFSLTSSIALLVLSFGLP
ALVLLCFVPLLMWSRIYLKAHNFMQTLVGSIIGFLLMYGEFKWWLAL
>CT0820 conserved hypothetical protein
MKKQTKTVPEFSSEQEEREFWTSHDSSEYIDWQKAKPAIFPDLKPTMKTI
SLRLPEMLLNRIKTLANERDVPYQSLMKMYLRERIDSEYEVERNKQKKAN
S
>CT0519 hypothetical protein
MLQRLLDCSNSGRIYINAFYSRRSIQESDSH
>CT1365 Nudix/MutT family protein, putative
MSRQINNDPGRWEVLESIYLHQRPWLTVRQDRVRLSSGKTIDDYYVQEFP
HWVNVLAITEERDVVLIRQYRHGIGEVSWELPAGVLDEGESLLDGAQREL
LEETGYSGGTWTPLMELSANPALQNNISYSFLAEGVSLSGTQHLDPTEEI
TVHLMPLDRLREIVFDGGMIQALHAAPILKYLLQNRWGESVKGEG
>CT0439 hypothetical protein
MRSLNPLHAQYPGDSFFENQIFVKKLAPLKQGGDSFFLSKSEPVETDTMK
SDKHWLNRERLTVYPRIFLALFLILGLVWVLMSKNMLDIKGKPLGYDFMT
FWAASHLALTGHAQDAYKIPLLFKAQQLAISASKVAYAWFYPPTFYLVVL
PLALLPYVTAYWTFMLSTLWGYLLVFRRIVRGNIAMWCLAAFSGLWINFF
CGQNGFLTASLAGLALLTVERRPVLAGVFIGLLAIKPHLAMLFPVALLAI
GAWRTLVTAAVTAVTFMAIGMATFGIAVLKGFLASIGDARLFLENGILEW
IKMPSVFVFMRLLGMPVAGAYIAHCAVAIAAVIVVWRVWRRCEDRNLRGA
ALMTATFLVSPYVFDYDLAWLAFPIAWLSLDGLRNGWLRGEREVLVAAWL
LPLLMAVIAEAVKVQIGPLVLCSLLWVTYRRATAASMAGALATDAYDDQL
GTVP
>CT2268 succinate dehydrogenase, cytochrome subunit, putative
MNFAGCAASSPVTSGLSVMDGSARRTFSSITSKVVMALAGLFLLVFLAVH
LGINMLLLVDDGGKSFSAAAGFMSSYPVIRVFELALFGGFALHIAFGVIV
SIRNRMSRPIRYQHRSRSETSPFSKYMLHSGIVVLIFLGLHFIDFYFIKL
GIVAPPPGVARHDFYSRAVLLFSDRTSSSIYMVAFVFLGFHLNHALQAAI
QTLGLNHTRHAAAIQAVSTVYAIVIAGGFMAIPLRFTLFN
>CT1103 conserved hypothetical protein
MKAELYIARRFAFKPRSLSKPTFIVLAAVIGIAVGTAALILTLSIVNGFA
SVIEGKLIRFTSHLQVRQPDERLFLETRRDLQTLKSVPGIVSVSPFLEMN
VMLRSRAGKGDAGGEIAPAMIQGIDPRESSRFLQNGPELLESAPASADGS
LPILLGRTLADKLGVKAGDQLLIVGIAGKNGTSGITGKESVVELLSSLDL
HVARVAGIYDTGLSEGFDDIIVFADLGRLQQLYHPDMISGYEANVGNLRD
LQAISATVASTLGYPFYSYTVYERYANLFEWLKLQKNIMPLLIITITVVA
VFNIVSTLLVLIIEKTKEIGMLTALGLEPRAVSLVFMGQALMISLVGIGL
GNLLALGLSLFELRFHLITLPEKSYFIRHVPLEIDPLQYLAVSGGVAVLT
LLFAFIPSRVAASLQPATALTT
>CT2026 c-type cytochrome, putative
MPVCETHDYVQAMKHSNKTFGVVLAIGMGVLSTGTLPANASAAVDGKIVF
DKNCSVCHSIAPPPKSAPPILPISARYHQRFSSRAKGIKYMADFIKSPSK
EKVLADQQAITRFGLMPPVPLNAEELNAVAAWVWDQNTGGNWGPGRGARQ
GNGYQR
>CT1776 bacteriochlorophyll c3(1) hydratase
MCFSGYPDFTIFPASYSSGGCFNRPNQLQAKDFMPRYTPEQLAKRNASVW
TDIQIILAPIQFFIFLGGITLNTLYYFNLAGIDFYWISIAILFKTLFFAI
LFITGMFFEKEIFNHWIYSKEFLWEDVGSTVAAFFHLLYFVMAWMEYPEH
VLVVEAYIAYLTYVLNALQYLVRIILEKNNERKLKGQGAI
>CT0456 conserved hypothetical protein
MLEQIRNTHPLVYQNFSALPDGEKHLRSILAIDRYWEKLDLPVPDVILAG
SRLPVDACVEEACDILYAGGTLGLLHAAVMSKKYGRKVLVIDRAEPGRTT
RDWNISRGELLRLADTGVFTSEELDSTIVRMYKTGWVEFHAPAERRKRLY
MDEVLDCAVDADRLLGMACKKVLAGGGSKVLGHTSFVCCYQFPDHLVVQV
EELSGKPRYFRTQVLVDAMGIVSPVAMQLNRGRPQTHVCPTVGTIASGFE
NADFEVGEILASTEDAEVSGKRGRQLIWEGFPAKGDEYITYLFFYDKVDS
PNDKSLLGLFEAYFRKLPEYKKPGPNFTIHRPVFGIIPAYFHDGAGCTRV
VSGERIALLGDAASLASPLTFCGFGSVVRNLDRMTSGLDRAMREGRLGAA
ELANISAYEPNVASMATLMKYMCYDPETDEPGFVNEMMNEVMIVLDELPQ
RYRQAMFRDEMKVEELVTVMLKVAWRYPKILKATWNKLGVGGSVGFVKNL
AGWAISQNEKRG
>CT0108 sigma-54-dependent transcriptional regulator
MNSEIFRIFRKKYSEEIRNPPMNATTLPEALKLMQYIGDAIGTIRDPQEL
FRTVTDKLRLLFAFDSAVIITIDRERREASVFFEMLRFELPEQLRHQTRS
IAGTWLEGHLDDRTVTVASIARDIPSFGADGAPLLWTLHELGMRQIVLSP
LRSGGRVIGFLSFVSAEEKLWSDGDKSLLSGVSSSIAIAVSNALAYEELR
QREAETSMQLAINNALFTIKDRSQMLLTVCEQISRLVPCSFLGIRVVGSD
GRFHIYDNFMRGSGSSFAPFSPLEHLEMLPDDPVARESIEVISRPGIYSG
ERFDELCRDFRILELVRDRFGISSIIVLQLWDLPGSRAGLIISGVGVTLG
DEEARTVSLIVPQLALALQNYLAFDEIDRLRRKLEGERTYLVEEIRAAHN
FEEIVGNSAPLAEVLRRVSQVAPTDATVLIEGETGTGKELIARAIHNLSP
RKERVLVKVNCAALPASLIESELFGHEKGSFTGATERRIGKFELADGGTI
FLDEIGELPLELQAKLLRVLQEKELERIGGRRVIPVDVRVIAATNRDLKK
EVATGRFRQDLYFRLNAFPLSVPPLRERRDDIPVLALHFARKFAREFGKP
ERAIRERDMNELVSREWRGNIRELSHTIEQAVIVAEGDSLDFSTVLPLRN
ESVPLSAPSALMTMAELEEEMRGMERKLILDALDRAGGRVSGEGGAAELL
KINAKTLYSRIDKLGIRKRYGAG
>CT2138 hypothetical protein
MKPLKEVVGAYLALSDAQRQLVAGEYDEAAANCRRAMEISHTMPPEEAFD
HAGFDAFCHAGLAEALAGLRSFDEALHSADKALHYFNRRGELNQDEGKLW
ISAVYSRALALDGLGRGAEAMPEFKKVVEMIEERKGETPGKERMMEVAID
RIAQLGASNQQKKPGYKAWWEFWS
>CT1887 membrane protein, putative
MEWIFSPEAWIALLTLTTLEIVLGIDNIIFLTIIVSRMPAKQQKPGRILG
LGLAMLSRIALLLSITWVMRLTNELFTAFGHAVTGRDLILLGGGLFLLAK
STHEIHQSLEGTEEVVKERSASNFVMTLIQIALIDIVFSLDSVITAVGLA
KDIPVMILAIMIAVGIMMVAAQTIGEFVERHPTIKMLALSFLILVGATLV
AEGAGFEFPRGYIYFAMAFSVSVEMLNLRLRKKEAEPPVHLRKALEEEET
L
>CT0676 conserved hypothetical protein
MNPEALRKLIDTGETQAVEFKGEERTPLNDRTLVEAVVCLANRSGSDTGW
LLVGVEDDGRITGARQRHDHGTDPDRLAALIAARTRPSLSVRVYTVTLNG
VPVLAIEVPPQRVPVATSEGVFLRRALGGDGRPACLPMDAAAMQSLQADR
GLLDPSAQVVAAAGWHDLDPLEFERFRRSVRERRGRSDESLLDLPDLELA
KALGVVDANGQVRGVRLAALLLFGKEDALRRLVPTHEVAFQALRGLEVET
NDFFRWPLLRVMEEIEVRIRARNRERELMVGLLRVGVPDYAERALREALA
NALIHRDYQRLGAVHFQWHADHIEISNPGGLPEGVRLDNLLVTAPRPRNP
LLADAFKRAGIVERTARGIDTIFYEQLRNGRPAPSYARSDATTVVVVIPG
GEANLDFVRLLVTEAQSGRVLGLDELLILNALWQERTLATDAAARLIQKP
EADARATFRHLVEAGLVEERGQKKGRTWHLSAAAYRALGDRAAYVRQHGF
EPLQQEQMVLQYVRKHGSISRGEAADLCRLGPMQAYRLLKRLETEGKIAR
TGGSTKGVRYAMASK
>CT0512 glutamine amidotransferase, class I
MSGTILVVQNISHEGPGLLANLLEEHAINVELCDLSKSEPIPDPSGYAAM
VVLGGPQSANDATPQITGELKAIDKALDAGVPYLGICLGLQLLVKARGGS
VVKCHQKEIGFREPDGEPFMVELTGDGKQDALFLGMPERLRVFQLHGETV
EPAKGMTLLATGRGCKHQVVRVGSNAWGLQCHFEMTPAMFESWIGIDADL
KAMNRDELLAEFEAISEEYTETGRSILLNFLAVTGLVKP
>CT2120 hypothetical protein
MPVMDRAAMRTMATLQPSCMRWLCTTPSILDAAKAGWPISWAATPDSPAT
VTIRQFQASTCCRKKTFDVVVNTDVLEHVPEAELDCVLRDFRKLSTNAII
IPHLAKATRILPNGENAHCTIKTPSEWAQVFKRHYAHVYELPHHSAVHAL
FLCGDQERDVTALRGILEQYVAAKNEVRHHLLPLGKRIEKAIRLIRGKDI
NR
>CT1377 hypothetical protein
MGDAARQAADRLDLLGLKKLMFQLGSLLFGLFPAADVPEKDKRAMLTGKN
EGNG
>CT2087 esterase/lipase, putative
MQTRDRLFLPRDSMNEYYLETTVSGCYLVESPPGSGPFPLLAGFHGYGQT
AEDELELLRNIPGSDRWIRLPIEALHPFINAKGQPGSSWMTRRDRDRRIA
ENVRYVDAVIGRVMAEQPVDGRLVLHGFSQGAGMACRTAVLGCHPVVGVM
LLGGDIPPEIADCSRMRAVHIGRGDRDRFYPQKRFEADVARLREAGIEPV
VSQFRGGHGPTAEYFDAAGRFLNKIGRG
>CT1202 hypothetical protein
MPSPMKFFTSIIESIKLLFAGIWLVFRIILEYFGIISDGNDRTTGIKDMR
EEYKKANYR
>CT0909 hypothetical protein
MMNQFFEPPRVEALHVQNYRALQNVRLDSITPLTVLLGPNGSGKSTLVDV
FAFLSECFGEGLRKAWDRRGRFRELRSRDSVGPIVIELQYREKPGTPLIT
YHLEIDEKDRGPVVKREFLRWKRTHPGAPFYFLDYREGVGRVITGEQPES
QDKRIEKPLSGPDVLAVNTLGQLAENPRVIALRAFITSWHLSYLSADAAR
GNPEAGAEERLSQTGDNLANVIQYLGEEHPERLNKIFETLKRRVPRIEKV
TSRPLDDGRLLLQVKDAPFSSPVLARFASDGTLKMLAYLILLYDPEPPQL
IGIEEPENYLHPRLLPELAEECDMASERTQLIVTTHSPFFIDRLRPEQVR
VLYRGADGYTRAKRVADMRGIGEFLEAGASLGDLWMEGHFDVGDPLAGEE
GA
>CT1733 hypothetical protein
MLGFKDIPLTKKVIHIINDIERPFMIRWSNSHDVQQIHDWLQEEEALEVH
GNFLCNWNLTRQCHEEGRLLVLIDEIKGIPVAYQWGQLLSSGILQVRNGW
RGNGLGRLVVEHCVELALQQDEMVLQVECKPSSSIPFWEAMGFTIVEGEF
GKNAKGFRVLSKNLALPPGGRPILATISSFPEERNWQDNVPAIASYHLNA
IVADDGKVYLAERASFPKCFRRMSRDPVIEIIVEGKLVYRDKAKYQGAQD
HGVKWCRNGFYIDVVTI
>CT0121 potassium channel protein, putative
MSKDAQQISALRRFSVSIISVALLVISGTVGYMYLENMSLLDALYMTVIT
VATVGFSEVRPLDDVGKIFTMVLIVGGTGIFFFTLTSVAVFFVSGEWKEH
WEQQRNERMLRKLNDHFIICGYGRLGGSVAEELRAKAVPFVVIDNMIDNV
LRARDEGFLAIKGNAADEEVLADAGLHRAKGLIAAAGNDAENVFIVLTAR
NLKPNLYIVARADCDGSESKLRRAGAEKVVMLYRSAGKRMASLLTEPELE
EYLDELSNANNLNLRIAQYLVGDNSPLVGKSFQEVDLYNNHRINVVGYKL
PDGELHTTPRPAEIIQKNGTIIVIGKSDDLEMLCKLAQGETQR
>CT0954 hypothetical protein
MALVKPTALQWKNRDPVDNLPNTPRRLNKIKPAHPQTNGMVERCNGRIEE
IPRQTRFASANELATTIEHCAKLADARTPQTMSGHRTPLDTLRAWRKGSD
LFVQNIYNLAESDT
>CT1685 conserved hypothetical protein
MKKNMGQKDRAVRAILGVAMLLYSIVFQNLVGLVGLIPIVTAIIGYCPLY
EVLGVTTNKYAD
>CT0059 hypothetical protein
MAYASFDHRQVPLFEYVLRILPVGAWRRGRTNVNEVYRRQCSNRTAFMLP
RVDEVAAVGALFILHVLHEAIIIHLSGKIMSRQVFSKTAKGLYLISLASI
LGACNHKAPEQQNTAPKPVSAAADNATAVTIESGKGSVEITDAVKPWPDD
APADVPRYPYGTIRKIIRTETPEGNSWDMAIERLPEHALLDYEAVLKAKG
FETTSMIVPEKEGDRGSVTGIKGAITVVLIGSGGSMSLSIIQKQ
>CT0918 pentapeptide repeat family protein
MREKNPSMPTDLSGAALKGRRLRKIDFSQTSLAGADMRQSDFGRSEFRDA
DLSGAKLDGSVLAGSRFTGADMNQASLAGALCAGSDFSGAKMASTVLRRA
DCGEAKLRGTDLSGADLREANLEHADLSRADLRAANLWLARTGGTDFTGA
LISDETVLPNGKTGSAQWASEHGAMFAPVVAASKPEPAGVAGAALPQAVQ
SATPSASSQAVPSAAPQKATTLAKTLKPIRAWRPAPSSISYDADQLDQLK
SNVTKWNRLRTEKPAMNVRLKEAPLDNRVLAYADLHGADLEKASLKRSDL
EKADLKSANLRGADLRSANLQRADLRQADLRGANLWLANTGRAEFEGAIV
SSETVLDTGKKATPAWAQEHAVRFMDESEPLR
>CT0098 ribosomal protein L11 methyltransferase, putative
MQPPKTHNHIELAFEIDSDLYELYIAVLSQVGIEYFLEDDHKLLAYLPES
DWNADKEASINIMLQETFGSVPRFTASFMADRNWNAEWEAHLQPVEISDR
FLIIQHQKEYDVKPGQIVIAINPKMSFGTGYHATTRLMLRQMEELDLADK
KIMDIGTGTGVLAIAARKLGNRNPILAFDNNAWAAENAVENVAENDVADI
RVELLDAEEELAATLEEGYDLILANINKNVLDRILPTIRRHAPNAQVLLS
GVLVYDEPWLKQLLKRIKYTNVKTIYEDEWLSALVEPKN
>CT0406 hypothetical protein
MLLVLILWSLARFLCSPFDRWACKSEKDLVFKALNKFFLAMTGAGLKAGE
>CT1478 hypothetical protein
MFLKKFDLILYGFIDSIPKVRTDSAVFCSSVDFTYYYSSKSIPVLRKKKA
GESSQMFNQSQNHP
>CT0455 ABC transporter, ATP-binding protein
MAETILRLKGIRKELELSRDVRQTILPNLSLEIFEGEFVAITGPSGSGKS
TLLYIMGGLDKPTFGKVWLDGQEITGLDEAEMTVIRNRKIGFIYQFHFLL
PEFSAVDNVMMPMLIRRKYGKKEIRERAMKLLDMVGLEDKYTNKPNQLSG
GQQQRVAIARALANEPKVLLGDEPTGNLDSRSANNVYELFARLNCELNQT
VIVVTHDEDFANRAGRRIHLVDGKIESDSRTPRQATA
>CT1700 conserved hypothetical protein
MLKDALGEWRGSEEEITRQIEADPVNAQAWASRAGVRSAAGDFEGALGDL
TMAIELGLRFRERIIAYGNRGIIRSETGDYDGAIEDFSAVIEARPRKSIM
KAALVQRALAKEKFGDKEGSAADRRLARILSPDLNKQTTKK
>CT1914 hypothetical protein
MSFEGEFASYEPLRRLLDSEKVKSLQNRLKIRQQEEETEDFEGSIVKKSD
LTESTLQPDLVLAIDGSNLAAKAENGFPGAEFGYITIASVLIDLKLIGEL
EKKEFVEPKKFRETEKASTIESVFPGCNVILDTEKNAKSSLRRALFEELR
SNTIFSDGESLLDTYEHLFRIKREHFQERNLPRSPIEGVEEEMTYDFGEY
TCPHSGEPLFSTDALRLHELMNPGGSNGEMFGQIMSTLEKLWLVHILRAF
ERKGWLATLRRVAFIMDGPLAVFSTSSWLTKVISHELTRLNDLQRKINGQ
DLLIIGIEKSGTFFNHFIEIDTTKDGVTDKFPKQSALLLNDGYIKRNIIF
SESIKPYGQDTYFGRKLFYKAASGQKIVPVVACFNEYQRNLNTANPDQFT
RLADVMNLLDLLVSSRYPNSVSPLVSAHAEAAIPLNLGKRIFEDIAREIR
EKSKE
>CT2121 methyltransferse, putative
MPMSFYSKFSEYYERVFPFREDVWQFLKRYAGSPGNALLDVGCGPGHYCG
RFASDGFNALGIDLDEAMIDEAQRRYPEAAFRCLDMRRLEKTEGRFDCVW
SIGNVLAHLPTEALAPFISKIHNLLKPGGYWIMQVMNWDTLAELTNYDFP
VRTIEANGSTATFHRHYSSITPESLQFTFSLKDEDSVLFEESVTLYPVAI
ERYLKLHEDAGFLYEGMYSDFSGSALRSVPGTGLALVFGRG
>CT1632 peptide ABC transporter, periplasmic peptide-binding protein
MLGDADYLNPVIGASLTSSEITGLIYPALLQGEFDTKTGLLNYLALEKRL
RSSTGPDEKSPKGALAKTWTMSPDHRSITYILRDDAKWADGQPITSRDFK
FTYKLYGNPVIASPRQQFLAELVGADKGQVDFDRAIETPNDTTLIFHFYK
PVPEHLALFHTSLTPLPEHLWKNVKPEEFRESKLNQQPVGAGPYRLADWS
KQQSLTLSSNVSCNLPKPGNIKRIIYRVIPDYTVRLAQLQTGDVDVVENI
KPEDFAAVQKANPNVDIKTIGLRVYDYVGWQNIDGAYYNQTGKIRPHPLF
GDPVVRRALTMAIDRQSIIDGYLGEYGVLAKTDISPSLKWAYDDSIKPYG
YDPAQAVKLLEAAGWMPGPDGIRQKNGRKFSFVLYTNAGNARRNYACTII
QQNLREIGIDCKIEMQESNVFFQNLQDRKLDAWMAGWSIGLEIDPLDVWG
SDLKKSRFNFPGFINPRIDQLCELAKNKMRIEDARPYWIEYQKILHEQQP
VTFLYWIRETQGFSKRIQGAKLNISGTFYNIDDWTLKPSIAP
>CT0518 transposase
MQSLSSHYHQLLGLPSNWEVENVNLSMSSRQVEIRLAFTGKQGECPICGQ
SCLIYDHAAEQRWRHLDTMQFETILVARLPRCQCKEHGVKTVQAPWAARH
SRFTLLFESFAVELLLHCANIKAASRLLRLNWHTVNQIMRRAVQRGLVRR
KTETVEYLGIDEKSFKAGQHYVTTLTDLGERRVLEVVEHRTTEATKELLA
SLNDSQQAGVKAVSVDMWKPFIHAVQELLPKADLVHDRFHISKYLNEAVD
LVRRKECRQLDKAGDKRLIGSTYVWLRNPENMREPQQAELSELMEGEFKT
GQAWSLKNMFRFFWQLGCADAGTFFFEYWSKRVDEVGLVPLTKVKELLQR
HFGNLLTWFKHPITNAVSEGLNSKIQIVKASARGFHRFESYRIRILFYCG
KLNMAIGS
>CT1283 hypothetical protein
MKKLFILFAFVFLAACGSTSSIQDKEGKSTKIDLSMYDNVVILDFTDATK
KHNMPAFAGRNFADRIAASVKEKGVFKVVSREPLADKSIVVSGTITKYEE
GNGALRLLIGFGAGSSYFNANVHFTDSLNQQELGKVFVDKQSWALGGIAA
STQTVDGYMNEAAKKIAKELADAKNYHCEPNTSAQTETK
>CT0085 conserved hypothetical protein
MKFAIFVNTTREKALELARELTAWLDARSIDYVFDPQSAKALGCGKWEEK
ADLSQHCDAFVALGGDGTLLLASHYSRSKPVVGINVGDLGFLTEFSPDEM
WVAMDHLVSGNYSIHTRSQLEATLESGESLTSLNDVIFEKGSAARRLPAF
TILLDDEMLGSYRADGIIIATSTGSTAYSMSAGGPIIAPKSNVFVITPIC
PHMLTVRPIVISDDKTIKISVDSQSGEFPLKMDGIQKKLLAPGEVVTVKK
SPHHINLVANEKRNYCEILRKKLLWSHEHPTGE
>CT0026 conserved hypothetical protein
MNHTSGITPLHPSIFHNILGLVALQTTRVGGVSASPLDSLNLGTHVGDDP
ERVRENERRLCAFLEISCESIVTTGQVHGTEIAVVTGPGKLDGYDALITT
VRGLFVGILTADCYPILIYDRRTRACGAAHAGWQGTAGRIAEKTVNAMHN
SFGTRPEDCLAWVGTGISGECYEIGGEVAARFSSRYLKPSPSGEGRQLLD
LSAANRDQLIEAGIPPSQVQCSELCSYRDADRFFSYRRDNGKTGRMLALI
GLRHSD
>CT1679 hypothetical protein
MLLRVCVWEDELFMVWYKKGDAFENSRFSNICGRDLMIKKFFAALQRRDP
FLLFVVVLLLVTGCVKRDKRIRSITIGEQVWMAENLATDCYRNGDPIRHA
KSVEEWNDAISRQEGAWCDYDNDPASGRLYNWFAVADPRGLAPVGWHVPN
DEEWRELEAATGGRGFETAFTGSRNCLGLFFGQGSTAFFWAATPSGEFDA
WNREISKTGGKMQRVSVAKGLGLSVRCVKDN
>CT1343 hypothetical protein
MKIPVCLQTHNRLGEKYVGDGGRELVRKNCTGFSGIESESEEHFLWYIQE
VRVSAKGTA
>CT0357 conserved hypothetical protein
MSAKTANILFIGDVVGNPGLQIVSRMLKGFISKYGVDFVICNGENAHNGK
GMSLEALNLMLEAGVDVVTGGNHTWNNFNFFETLKTHERVLRPQNYPKGT
YGKGYGVYKLPRGLGNITVLNLQGRTFMSPIDCPFRTADWVIKQTKEQSS
LLVVDFHAEATAEKISLAWYLDGRASAVIGTHTHVQTADERIFPKGTAYL
TDVGMTGPYQSVIGMQVKSAVDRMLYQTPHKYECATDDVHFSAVLLTLDT
ETGKAVGIKRIFYPEFESVATQG
>CT1548 peptidase, M16 family
MTETKTNQNESYPYTTVPGDALQTRIYTLKNGLTVYMSPYHDEPRIYTSI
AVRAGSKNDPAETTGLAHYLEHMLFKGTDSIGSIDYAKEHTELEKIIELY
EQYRATSDPEHRAAIYRDIDSISNVAAQFTVPNEYDKLLNSIGAKGTNAY
TWVEQTVYINDIPSNELDRWLTIEAERFRNPVMRLFHTELETVYEEKNMT
MDSDSRKLWEELFKGLFTKHTYGTQTTIGKAEHLKKPSIKNVIDYYRSWY
VPNNMALCIAGDFDPDATIRLIDEKFSKLEPKPVPEFHPPVEPEITRPVV
KTVTGPEAEELVLGFRFGGADSDDADMLTLIDKILFNQTAGLIDLNLNQQ
QKVLEGGSMLVLMKDYSVHILSAKPRDGQSLDEVKALLLEQLDLVKKGEF
PDWLVTAVINDLKLEELKAFESNRGRSEAFVDTFVWGMDWARQVNRFKRL
EKITKAEIVEFAKQHYAQNYVAVYKKHGQRKSEAKIQKPPITPIKVNRDR
SSVFAKNLLAKKSSKVQPVFVDFKKDIGYYDITPEISLNYVPNRENELYS
LYFMFDAGSNLNRKIDTALDYLSYLGTSRLSPAEFSQELYRLGAQFTVQT
SDNYVYLKLSGLKENFPQAISLLDELLRDAQPDAPALEKLKEGIRKERAD
EKLSKRKILFEAMVNYGKYGPKSPFTNVLSDEEIDKLTPEELLGEIKHFM
NYRHRVLYYGPDSPETLMTELRTMHHFGQSFQPVPVTDPFEELKTAKNHV
YVVDYDMTQAEIIMLSRGAVYDASKVPLVTLFNEYYGGGMSSVVFQEMRE
AKALAYSVFSVYRLPKEKDRHSYVFSYIGTQADKLPEALDGFNELMQKLP
ESPELFASAKAGIDQKIRTERITKGDVLFALEEARRLGLDHDIRQDVFRE
VPGMSFSDIEQFHETRFRNKPQIMLVLGKKEQLDLETLRKYGDISFLTLR
EIFGY
>CT0586 hypothetical protein
MRLEPEVVLGAFTGLVEKHLEGILCTEKAIAVSKEKR
>CT0663 conserved hypothetical protein
MTNEFEILKPGKLLLASANLLDPNFKRTVLLMCEHNEEGSIGFILNKPME
FKVCEAISGFDEIDEPLHMGGPVQVDTVHVLHTRGDVIDGAVEVIPGLFW
GGDKEQLSYLINTGVIKASEVRFFLGYAGWSAGQLEAEFEEGSWYTADAS
SEQVFTDEYERMWSRSVRSKGGEYCYVANSPELPGMN
>CT2040 succinate/fumarate oxidoreductase, putative
MKRYAYYQSCINEAMTKEVDRSLGLWQHDLGIEMVKLHESVCCGGSNLDY
VSPKQFALVNARNIALAEKQGLDLVVSCNTCLMTIRTAKKKLDESPALRA
EVNEILKEEGLEYRGTSDVRHLLWVLIDDVGLDVIRKKVKTPLTKLRIAP
FYGCHILRPSTVLGKDDPLEPTSLDLLLDALGAKTIPYEHKNRCCGFHTL
LVAEEESLNVAAEALKEAMDKKADFIVTPCPLCHTVLDGYQAKALKHNGL
KGAIPVYHLSEVVGLALGYSDRQLGIKRHIVTA
>CT0660 Nudix/MutT family protein
MSCPRCGWVHYINPLPVAIALTVNRNNELLMIRRAHEPAFNEWALPGGFL
EAGERPEEGCLRELFEETSLEGTIDKLIGVWHLESGLYGSLIAVAYRVIA
AHERISINHEVFEAGFYRPDNMPPVRIPLHRQIIAESRWPERDLSLTL
>CT1219 conserved hypothetical protein
MTNFNTIHGELRGVEFRTLYSNGKTDGCLVKEENILNTPYGELIPQFETE
DMGRRQLKPFYFYKSGSIKSVPLQSQTLIRTPVGGIPAELVTFYESGALK
RIFPLDGKLSGFWTAKNEFALAEEITVESPLGSLRAKFIAIQFYESGALK
SLTLWPGEFLTLSTPAGQIRVRTGIAFYENGAIRSLEPAGVVKVETPAGT
MTAYDNDPQGVHGDLNSLEFNLDGVVMALKTVSEEVVITDQLGEEHLFRP
WLKDNPCGAERKVVVPLKVRFSEGRIIIDDRHESFNMAQCRVEIRPFDRK
AGDPAYSCAG
>CT0407 hypothetical protein
MNHTKEATKNRNGQPKAKLRPELAEAQPEARLKTYRGWRPDEQVSRRQMG
EARFSHESLPETINP
>CT0467 conserved hypothetical protein
MEHSRETIAMLIDADNSPSDKIDFILAEMAKYGVVNIRRAYGNWKSHSLK
GWEDKLHDYAIRPIQQFDYTKGKNATDMAMTIDGMDLLFSKKLDAFCIVS
SDSDFTPLVMRILSEGLNVYGFGQKSTPLPFVNACSTFLYLDSFAGEKEQ
TKSDSCRRKSSHELRSDGKLRTLLLNAVDSAQEEDGWSNLAKIGKNIANQ
ASFDPRNYGFKRLSDLVRAIDMFDIEFRNNNSELYIRDKRKGRKYETLPV
DAPATAPVSVVPEPVAVKPVEPQAEESQQPASEPALKSTSEPQVVQLSSP
GPESVAPEVPALKPVEPETPMPVTETAESAAELPVSTEAKPKRTARSRIT
KPRSTSSRKTKVAAEPTSETAKSTGDSAVVAPAPAKTEVVQATVETPEPE
KKTRRPARRRTTRKKSEGESDAVPAE
>CT0373 oxidoreductase, Gfo/Idh/MocA family
MKQKIRIGFIGSGWAQVAQAPAFSLMEDVELAAVASPTERRRQKFQDRFG
IPEGYADWREMLDECPLDLVCVTTPTFLHDPMVTGALEKGVGVLCEKPFA
LTVEEAERMNALASKSPGLSLIDHQLRFHPSVRSMKQMIDSGEIGKVYEV
RAVVNLASRNRIDMPWSWWSDASKGGGALRALGSHLIDLNRFLVGEISEV
CCNLSTSIPQRPDASTSGKSLPVTSDDSFAMFMKFGPSSVALGASSLMHV
TTVGSYTWFSLEVVGSLKTIRLDGAGRLWEIVNSEVKGGRSLIDAPRWKQ
LEPMLPWDELVLQEKIKQSSLAVHGIFAVGFAFLAHRIVKALKSGDPVIL
QDAASFRDGLAIQKVMQAGLDSDREKRWVKV
>CT1022 hypothetical protein
MRRVSSVDEAVAETKSRLMLVFVLNGGLECLSSGKQQIFAVEYLVFFLKE
PVGGLLLF
>CT0128 hypothetical protein
MITSGCGAQGWVITVHSSGAISYIKKRQCVN
>CT2220 conserved hypothetical protein
MHHSGMKSKKEGRIFERARKAAEEILRDPEKIRNVIDAALHMVGSASTSS
PFGELTDKFQAIIRLVRAYVNRQYRVVPWQTVILAVAALLYFINPFDAIA
DFLPLIGFFDDAAVLTAVLASINHDLNEFLDWEKMTAQNSGEPIPRVIDA
DFEEVKADSIDNG
>CT1910 hypothetical protein
MWLIKHWFSASTFVVKIATFAKPQNVMGNNKKQKL
>CT0661 hypothetical protein
MKDSFSGVAHSGQNANDFMQATVKSRCRFHFQTEAQCSRSIPSEKTGRVF
K
>CT1475 transporter, putative
MTNSTPNAKIGPIELAASVMPRHAWTFLFASFFSIGMVTFVSIGQAYILN
EHLGIPVSEQGTISGDLVFWTEIVTLLLFGPAGAIMDRIGRRPVYAVGFI
ILAIAYLYYPLVTNVFQLTIARMIYAVGVVAVTSGLATVLVDYPKERSRG
KLIAVVGFLNGLGIVILNQFFGGLPKRLVANGMSGTEAGFWMHATIAGTA
ALTAIVLFFGLKGGTPVRHEERPPIRKLLTSGFRHARNPKILLSYAAAFV
ARGDQSIIGTFVPLWGMTTGLAMGLEPAEAVKKGTFIFIISQAAALLWAP
VIGVFLDRWNRVTALTICMGLAAIGYLSLAMIGNPLETYSLIFFVLLGIG
QISAFLGSQSLIGQEAPKEARGSVIGAFNISGAIGILFITTTGGRLFDGM
SPKAPFIIVGAVNLLVMLGGMWLRTREVKERGKFAA
>CT1390 hypothetical protein
MARLTRCGPFPLRSQPNLKKSFAQKIIAIESLAKSAFVWYPHTEKIHCKY
CHDYQSTDPIQ
>CT0416 hypothetical protein
MNWLLNISTHGSDLEIAGPEWSEAVSLLRRTGMDGFELYPAGAYDCTAIP
KEIVGGVHLRFFVMLRQIWQNDREGLLRMFGSEQTVRHYYSGVNREAVIS
CYRSQLELAARFNVPYVVFHPVHYELEYVFNWQPPWGWRETVDLSTEIIN
EVVRDTPYDGWILFENLWWPGNFRLDSTDEIDRLLSKVRWPKCGIVLDTA
HILNKNQELRTEEEAIAFLLRQVERLGDYRQLIKAVHLSKSLSGEAAKAG
FANADPFCGAHDFWQRFSIALRHVRDLDRHEPFSHSMLGELFDLIEPDNV
VFELAFSSLDEWIGKIEQQQRALQRFFPECRIKENLY
>CT2078 NADH oxidase, putative
MNKQVDVLVIGGSAAGIVAATTGKAFYASKSFLIVRKEPEAVVPCGIPYI
FGTLDGVHQNIVPTAPLANADVELLIDEVVSIDREAKSATTAGGVVISWD
KLVLATGSEPKTPDWLEGRDLDGVFVIPKNRDYLCRLRSRLEEPRRVAII
GGGFIGVELADELAKKGHDVTLVEILPHVLSMAFDSDLSLKAEELLVKRG
VKLKTGEKLKKLAGQASVSKVILESGEEIEVDIVILATGYAPNVELARSA
GIKINELGAIRVDEYMRTEDKNIFAVGDCAEKFSFITRIVKGLMLASTAC
SEARIAGMNLFGLSRLRTFSGTIAIFSTAIGGTTFAAAGVTEQLARERGF
EVVSAGFTGIDKHPGTLPETSNQYVKLIVNSENGLVLGGAVMGGQSAGEL
INVIGVIIETKMTVNELLTLQFGTHPLLTGPPTAYALIKAAEAVEMKLRH
FK
>CT1638 conserved hypothetical protein
MNKEIIEVFDNTYPDRDYTIEIINPEFTSVCPKTGLPDFGTITVNYVPDK
SCIELKSLKYYFLEFRNAGIFYENITNRILDDLVEACQPRRMTVKTEWNA
RGGITETVTVSYSKSKE
>CT0309 mannose-6-phosphate isomerase/mannose-1-phosphate guanylyl transferase
MMIMPVILSGGSGTRLWPLSRAMYPKQLLPLFGEKTMLQDTVLRVGSIEG
VGPVICVCNDEHRFLVAEQLRQIGVAEQAIILEPFGRNTAPAAAIAALVI
AATHPGALMLLLPADHVILDRQGFVASIESARSAAESGSLVTFGIVPSTP
ETRYGYIRAVAGSTGVTRPVAEFVEKPSLERAEGYVVSGDYFWNSGMFLF
RPEIYLAELEASSPEILDACRKSLENARRDLDFLRLDPEAFAACPANSID
YAVMEKTSNAVVVPMQAGWRDVGAWSALWEAQERDAEGNIKRGDVLTHGV
RNSYIHATSRLVAAVGLEEHVIVETADAVLVASKERVQEVKAIVEQLKLQ
KREEPLIHRRVYRPWGSYETVDEGERFKVKRITVKPGAALSLQMHSRRAE
HWIVVTGRALVTVGKKQVPLEANQSIYIPVEELHRLENPGDEPLELIEVQ
SGGYLGEDDIVRFEDHYGRL
>CT1512 thioredoxin reductase, putative
MLDIHNPATDHHDMRDLTIIGGGPTGIFAAFQCGMNNISCRIIESMPQLG
GQLAALYPEKHIYDVAGFPEVPAIDLVESLWAQAERYNPDVVLNETVTKY
TKLDDGTFETRTNTGNVYRSRAVLIAAGLGAFEPRKLPQLGNIDHLTGSS
VYYAVKSVEDFKGKRVVIVGGGDSALDWTVGLIKNAASVTLVHRGHEFQG
HGKTAHEVERARANGTIDVYLETEVASIEESNGVLTRVHLRSSDGSKWTV
EADRLLILIGFKSNLGPLARWDLELYENALVVDSHMKTSVDGLYAAGDIA
YYPGKLKIIQTGLSEATMAVRHSLSYIKPGEKIRNVFSSVKMAKEKKAAE
AGNATENKAE
>CT1573 aldehyde dehydrogenase family protein
MIVTINPATGEQLAEYPVMIAGQIDSVLRQADADFRRWRSTSFGERRTCM
KRLAELLREQAEKHGRIITLEMGKPFSQAVAEVNKCAWVCDYFADHAEEF
LQPEQSEIDGAKGIVAFEPLGVILGVMPWNFPYWQVLRFAAPILMAGNGI
VVKHAPNVTGCAIAIEKLFRDAGFPEHLYRAVHIDLDEVDRLTGFMIDHP
VIKAVSVTGSTGAGQAVAAKAGKALKRSVLELGGSDPYIVLEDADLAQAV
DACVAGRLLNAGQSCIAAKRFIVRKEIIGEFTKKIVQRMQTAVMGDPFAK
STEVGPIAREDLRDLVHSQVQRSVEAGAELLWGGHVPNAPGWYYPPTVLS
GVKPGMPAYSEEIFGPVATIIEVADDDEAVAIANDSEFGLGSAVFSQDVE
RALGIARRLEAGSCFINTMVKSDPRLPFGGVKQSGYGRELSHHGIREFVN
IKTFYLP
>CT1174 thiamine-phosphate pyrophosphorylase, putative
MPEIAFMTEKHPSLPRLMIVSSGGEHFSQKGLVLAQAQTLARSAPVIFQV
REKMLDSASMWRLCSQIAPLVDNSGSILTINERFDIALASKAGGVHLPES
SCPADVVRKTARKLLVGQSVHSETTALKAASTGLDYLLFGPVFHTPSKAS
FGPPQGLDRLREICEKVRIPVFAVGGITPEKVPACIECGAWGVAALTPFL
DAGSLPETVNRFYSFMQS
>CT1118 hypothetical protein
MVGFDQLSVSMMTLPCDGSCETLRQHYKSHTIAAMKQHRAFFGNATVRRI
EKTFFSHPLLHYTEKEGTRS
>CT0836 internalin-related protein
MEQNIYKQCPVCGFPLTRQNAICPRCGNDILEDINTLDEQNHEKHYRIIE
EKKADWYIRCLAENLDTGKSPLVNFPNDYTGPMHSGFHSRLTAAEQEALA
TSRAVLLRDPQKRHNWFKALGNDWKEVVKNTLKIQRDPSDEELLDFLNAT
SLRCNNMRIHNLAPVSLLENLEQLRCDETPVESLEPLRNLRNLRRLYAFD
CDFTSLDPLRDILSLKLLWVSSTEISDLEPLSALVNLEELYCSETMVSDL
TPVSGLFRLEKLSCYKTEITSIEALRNLTNLSELGINSTGVSSLEPLARL
ENLEYLRCSRTAITSLEPLRNLVALKELSIEQTAIHTLEPLSGLVELEEL
FITGTLVDSIAPLMNLMSLEKLELSAGRIPQSEIDHFMQLNPACEVILKS
NG
>CT1234 hypothetical protein
MNRLDISNGKPAPDMVLLALQHFGIPAAQCLVVGDTVL
>CT0430 type II secretion system protein
MDFSAYAPSSYSVPEGESRVYRLAVPAKRVAVGDPAIADFIMISPSELYL
SGKKTGATNLIVWGKNGNFTTAPLVVSRNVKPIQDLLRAVLPKEHDIQLY
SSGDALVLAGSVSNALAAETAIRLVKTFLGGTVPDVTPEATLTKKSEAAT
GTSGGTSISGMTGVASAAGMSGAATVASSGIHGFINLLKIRDPQQVRLEV
RIAEVSKSYMEALGFSWTQGVGSTAGSSLMTGFVSNATLNLLLKNSGNLK
VEADSQKNWIKMLAEPTIVAMSGQEGYFLVGGKIYTPTPTGNGAVDYQER
TYGVGLRFTPTVLDAGRISLKVAPEVSEPDSQFQSAGSLYNLPAFKVSAA
STTVEMNEGQNLVIGGLLKDKLTETIEAVPLLGQLPLLGALFRHTSMDSE
KVEVIVIVRPTLVKASDTVPELPTDRFVPPGPNRLFLEGKLQGSK
>CT1378 sensory box protein
MSEMPAVEFQKQELLQLRQELEVANKRIEALEAELSRRIGQETKIRLRAD
AFRLCAHGTAIGAPGINVVLTCNEAFARMRGQSVKEIEGSSIVSLYAPED
QQMVKDKLKITDSTGFCSCQAKMMRKDGTIFPVQIDVVGLKDENGQIMYR
IVTVQDSTERLESQSALRESEERFRSVVESAPDAIFIQTGGRFAYLNHSA
IALFGASKAEEILGRRVADQIHPDYRDLVAERIRLLNERQ
>CT1790 conserved hypothetical protein
MDGHSKNHCSPVGRYFPMPLDRRFLPGWLAFLALLAICVFPLMAKAAGVP
ALVGRVNDYAGMISPQARSIIDQKLKALEAEDGTQVAVLTVPSLDGQPIE
EFSIKVAEAWKIGQKGRDNGILFIVSKNDRAMRIEVGYGLEGRLTDLQAG
RITRDVIKPAFKSGDYDKGFIDGVDAIVASVKGEYKAPKRKNDDGAPSPF
LIFIILFVLFVASRFMRFFGGGGGPFGFGGPGGGFFPGGGFGGGGGSSDD
GGFSGGGGDFGGGGASDNW
>CT0201 conserved hypothetical protein
MQFRAMQYTIFGQNSRQGSYILLINLDRAIKVAFGKFRKSEPLELQPSHY
LYVGSALGSAKGRFPLASRLLRHASRSDSKPPHAIRSALLELFMSWGYRA
PASRGDKKLHWHIDFLLDREEAELAHVFIFPGAERIEARLAATLAAMPET
TPVADRLGAQDAAAGTHLFRIDKLEPLLPCLEKAMPDLIRQG
>CT1692 conserved hypothetical protein
MYKFITDNVVVFDVEWVPDPASGRRIYKIPDTASDDEVLDVMWRKGGATP
DDPRPYLKTVMCRVISIAALVRKRVKDGVTLKLISLPGLNVSEQAEGEII
RQFLEAVGKQKAQIVGFNSSNADLPILYQRALANRVSAPTFCHRPDKPWE
GVDYFNRYSDFSIDLKDLVGGYGKAMPSLHELASSLGIPGKMGIDGADVI
DLWRSGDIRKIVEYNQFDALSTYLVWLRTAYFSGMLTEDEFVQEEHKLET
LLLQEIESGSEHLALFLDAWKALR
>CT1206 polyprenyl synthetase
MDINVVTFSVTEELKQFQERYKTVLHSSNSLVDKVTRYVLRQQGKQIRPT
LVILAAKVCGGVHDVTYRGAIMVELLHSATLIHDDVVDGAEMRRGIPSIN
ALWKNKISVLIGDYLLSKGLLYSLENKDYRSLHLVSEAVRRMSEGEILQI
QKTRSLDITEEDYLSVIADKTGSLIATSCAIGAASSTDSEDEIASLKSYG
EFLGLAFQIRDDLLDYTGDSKKTGKQLGIDIKDRKITLPLIYALRQSDKS
EQNKIKSILKSSRKRSVRSGEVIDFVTRKGGLDYAAEVAEGFADKALESI
AHFPESDAKRSLQLLVDFVMKRQH
>CT1391 chloride channel, putative
MSWLFKNSRIKRRVIVLTYLILRKSRYFKGSSQQFVRMTWASFLAQLNLN
QDLPFLLVAVFVGLVTGYVAVIFHDAIKIISSYLFYGTTALGLPTFNNYL
RIFLLPLIPALGGLIVGLYNAFVVKARPEHGLPSVIKAVAQKNGKIPTKN
WIHKTITSVVSIGTGGGGGREAPIAQVGASIGSTVAQWLKFSPGRTRTLL
GCGAAAGLAAVFNAPIGGVMFAVEVILGDFSVKTFSPIVVAAVVGTVLSR
SYLGNYPTFQVPEYSLVSNTELVFYFILGVLAGLTAVLFIRTFYFIEEHI
QKIEKRFRIPAWLMPAIGGLLCGLISMWVPELYGFSYEVINKALIGQESW
ENMVAVYLLKPVVVALTVGSGGSGGMFAPTMKMGAMLGGMFGKVVNNLFP
AITAASGAYALVGMGAVTAGIMRAPLTVILILFEVTGQYEIVLPIMFAAV
TSALVARLAYPYTMETYVLEKENVRVGFGIALTIAGNISVLEVMQRKFVK
FFDVTKVENIIDAFYNTRDSHFFITTPEGTFVGIIGLDEMSLVLKDGIFP
GMIADDLVKKNVTVLYDTSKLDEALKIFEISEYSTLPVVEYHSRKLLGIL
KQDEAFSYYRKQMNLIGEDAGELADQRTA
>CT1182 conserved hypothetical protein
MSLEPSKDKKDYNHQLLDKVIHARIRFAALAYLMSVPEASFVDIRDSIRA
TDGNLSIHLRKLEAARYVECMKSFESRKPLTNYRVTSEGREAFARYLRNV
EKFCEDMPDLQVQTA
>CT1005 hypothetical protein
MTSRKSLRQQLHHIIFDYDTIPAKAFELVLIAAIFMSVTVVMLDTVRSIH
DAWRPLLYGLEWFFTILFTIEYLIRLCATE
>CT1160 type III restriction system endonuclease, putative
MTTQAPFSLRGRNPDVLTCIANLSNDEVFTPPELANRMLDTLTEAWAANH
NGANLWADKTVTFLDPFTKSGIFLREITRRLVEGLTEEIPDLQERVNHIL
TRQVFGIGITRLTAMLARRSVYCSKHANGPHSVCKTFTTESGNIWFKRVE
HTWKDGRCIYCGASQSTLDRGEERETHAYAFIHADDIRTRINEIFGGDMQ
FDVIIGNPPYQLDDGGFGRSASPIYQNFVEQAKKLEPRYLVMIIPSRWMG
GGKGLREFRATMLKDKRIRKLVDYENAQDAFPGVDLAGGVCYFLWDRDCP
GLCEVTNISGGESVTTVRQLDEFSTFIRNSAAISIIRKVMATNEPRMSEQ
VSNSKPFGLRTFVRPEKKGDLILRWEKGEGPYPREKVTAGHDMIDKWKVI
TSYVGYDHAGNPGKDGRRRVFSKIDILPPGTICTETYLVVGSYGSKTEAE
NLVAYMKTRFFRFLVAQFMYSHHLTKSAYELVPILDMNETWTDAKLAARY
GLTDDEVQIIESKIRPFDNGNGAG
>CT1955 magnesium-chelatase, bacteriochlorophyll c-specific subunit
MFLACYIKAGFITLRGLKNLNQSTSQLFMGDKIRIAAVVGMEQCNQRVWR
EVKDLIGQNAELTQWTDQDLEHQNPEAGKAIREADCIFTTLIQFKNQADW
LREQIDQSKVRTIFAYESMPEVMQMTKVGNYVVSEDGSGMPDIVKKVAKM
LVKGRDEDALYGYMKLLKIMRTMLPLIPKKAKDFKNWMQVYTYWMHPTAE
NLASMFNYIMAEYFDVNVKADKVQEVPTMGFYHPDAPEYMKDLNHYEKWL
HKKSRDAKSRNNIAMLFFRKHLLQEKEYIDNTIRAIEAKGLNPLPVFVMG
VEGHVAAREWFTHTKIDMLINMMGFGFVGGPAGATTPGASAAAREEILGK
LNAPYVVSQPLFIQDINSWKTQGVVPLQSAMTYALPEMDGAVCPVVLGAI
KDGRLHTVPDRLDRLSTLAKKFSELRHTANRDKKVAFVVYDYPPGMGRKA
SAALLDVPKSIYKMLQRLQNEGYNVGELPESPEALLAMLDRATDYEIQAH
EPDCFAIDRATFNAITTERERERIEARWGGFPGEIAPVGVDKMFLGGLTL
GNVFIGVQPRLGIQGDPMRLLFDKENTPHHQYIAFYRWISREFGAHALVH
VGMHGTVEWMPGLQLGVTGDCWPDALLGEVPHFYIYPVNNPSEANIAKRR
GYATMISHNIPPLSRAGLYKELPAFKEMLNDYRERGLEKIVDVETEMAII
EKAENLNLADDCPRLEGEPFSDYISRLYIYLLELETKLISNTLHVFGETP
ELATQVTTISEYLKVRGNERSLPSVIMQAIGESETWGDYAALATKARKGD
PKALKVREKVDDITRNFIEQTIFGNANASNVFSVLTGGAKANEEMAAAIN
SALQEGVALKQGLQDNSHEMESFVRALNGEYLPSGPGGDLVRDGASVLPT
GRNIHAIDPWRIPSELAFKRGKQIADTIIQKHRDENNGEYPETIAQVLWG
LDTIKSKGEAVAVIIALIGAEPAYDAQNKISHYRLVPLEKLGRPRIDVLI
QISSIFRDTFGVLVDHLDKLVKDAAKAIEPAEMNHIKKHVDEALAQGKDF
ESATSRLFTQAPGTYGSQIDELVEDSAWESEEDLDNMFIKRTAFAYGGNR
YGDEQSDILKNLLGTVDRVVQQVDSAEYGISDIDRYFSSSGALQLSARRR
NPKGDNVKLNYVETYTADIKVDDAEKALKVEFRTKLLNPKWFEGMLAQGH
SGATEISNRFTYMLGWDAVTKGVDDWVYKEAAETYAFDPAMRDKLMKLNP
KAFKNIVGRMLEASGRGMWSADPDTIEKLQEIYSDLEDRLEGIEV
>CT1268 acetyltransferase GNAT family
MIPIELLPVCADDLDEMAGIFNYYVEHTLATYTETPVSVERFVSLMCFSP
GYPAFVARDSDGALAGFGLLRPYSPIPAFDRTAELTCFLSKGNTGRGIGS
AILQALESEAVELGIETIVATVSSLNEESLRFHLARGFVEQGRLVGIGSR
NGRCFDVIYLQKTF
>CT2095 UDP-N-acetylglucosamine 2-epimerase, putative
MSSVKKIVLIAQDRAAFLHVAPLVSVFRKNGVFESVLVRVLTPGNRAEHD
ALAAAFGLSDELRTIELEPCTPVAETASLMLALERVLSELEPAFVVPGGH
DSASLAGAFAAAKMGIPVVSLDAGLRSYDRAEPEEISRLVIDSVAALHFV
SEHSGIYNLMNEGVADERILFVGNTAIDSLVTLMAQANQSGVLETLSLAP
KKFVTVLLKPEPFGNRDLLCKVLESLAATSTVLLPGSQSPEDALVGVSGL
RMIDMPGYIDLLRLLKESALVLTDSAEFEAELTVMNVPCITLRQSTARPS
TVELGTNVLIDPDEAEILERATAILSGKQLKKTLIPEKWDGAASKRIAEV
LERGA
>CT1726 membrane protein, putative
MQQHDRQSSAAPAWIAGVAAAVALWVPLYLNLEAGADLLVAALGLSRATP
LGEALHFFIYEAPKVLMLLTAVVFVMGVVHTFISPERTRAMLSGRRVGVG
NAMAATLGIVTPFCSCSAVPLFIGFLQAGVPLGVTFSFLISAPMVNEVAL
ALLFGMFGWKVALLYLSMGLLVAVISGMVIGKLGLERFLEEWVRQLQNSA
VSSEFSAEAVSWPERIREGLRHVREIVGKVWLFIVLGVGLGAGIHGYVPQ
NFMASLMGNQVWWSVPAAVLIGVPMYSNAAGIIPVVQALLGKGAALGTVL
AFMMSVIALSAPEMLILRKVLRPQLIAVFAGVVATGIMLVGFVFNAIF
>CT0420 magnesium-chelatase, subunit D/I family
MKRKYFPFSAILGQEDLKKALLLNAVNPRIGGVLVRGEKGTAKSTAVRAL
GALLAAARNGSGGEGEVVVTLPLNATEEMVAGGIDFQQTMKEGRRVFQPG
LLAKAHKGILYVDEVNLLDDHLVDIVLDAASSGENRVEREGITLSHPSLF
VLAGTMNPEEGELRPQLLDRFGLCVEVRGEADPALRVDLMLRREAFDREP
EEFSERYRAEEERLGRTIADAQSLLPSVRIPSHLRGFISELCRNSNVAGH
RADLVIEQAARANAALRGSREVSVEDVTKVAGFALVHRRRDPVPPPEQKP
QEPERQEEQSRDEQSRQEKPEERGDEPQQPQEGDDRRDGNDDSGNEERSQ
PENRDAAEQQERPDNEGQDELFSIGQSFRVRSIATPKDRKMRRGSGKRSR
SRVSQKQGRYTKSTMPRGNNDIALDATLRAAAPFQRHRENPNGMAVVLQN
EDIREKIREKRLGNLLIFVVDASGSMGARGRMAASKGAVMSLLLDAYQKR
DKLAMVSFRKNEAFVNLPVTSSIELAARMLKEMPVGGRTPFSAGLLKGYE
IAQNYLRKEPGGRPLIILVTDGKANRAIGQGKPLDEVFTISEKISREERI
RFLVVDTEEPGLVTFGLAKKIAGLLDAWYYRIDDLRADTLVSIVKTMMP
>CT0104 multidrug resistance protein
MSSTATTIPGAQALEHHYETGARKWIITATVIIAAMLELIDTTIVNVALN
HISGNLGASIEDVSYVVTSYAIANVIVIPLSGFLGNLFGRRNYFVASIVL
FTGASFLCGISTNIWMLVFFRFIQGIGGGGLLPTSQAILYETFRPEERGA
ATGIFSMGLVLGPTIGPLLGGYLVDYFAWEWCFFVNIPIGIAAAWASLTF
VKEPKVKPVVEKIDWAGIGLLSVGIGSLQFVLERGEQKDWFETDYIVWFT
IIAVVSLIAFVWLELHTDHPAVDLSVLARSKNLAIGAVLTFIVGFGLYGS
LFIFPVFVQRLLGFTALLTGLVLFPSAMLTGIISMPLGIALQKGASPKLL
MTVGMVAFFWFCWELGNQTLMSGAENFFLVLLIRGFALGFIFIPVTLLAV
TGLHGKDIGQATGLNNMVRQLGGSFGIAITNTYVTQRVAAHRIDILSHLS
PYDPAAVQRLQDMKQALGQYMSSPVEAGQAAMAALEGIVVRQSYHLAYMD
AFKMIAILFAVCLPLLLFIRVDKKETVDMSSVH
>CT1266 conserved hypothetical protein
MRYLILSMIRWERVCMSKIDLVSDIRPLSEFRANTAALITQVRKTGRPLV
LTQHGKSAVVLLDVRHYQSMLSAFEQMHGLQSGAEASVLTGGEKS
>CT0359 hypothetical protein
MSLFITKSFRRLLGGRFSRGFRGTECKGGLVAVAPGSGNGLEIYTIFKGI
PTPRDSGLKTVFL
>CT0353 hypothetical protein
MNLIAYLNLLRPSKQYPLDDTVGGALEAITMLLP
>CT0939 aminotransferase, class II
MNHDHLLAHRHGDRSGDRLASAISLDFSVNLAPIEPPLQELFSTSIPLAP
YPSMDGHGVREFYAARFGLDPVCVLSTNGAIEGLYLIPRALGIKRVLVPQ
PSFFDYGRACRLAGAEVVPLALSESGSFAFPGIDKLADALAGCDALLAGS
PNNPTGTLIPKELLLALACRFPEKRFIVDEAFIQFTEAFPSNSLMTEIKA
FRNIAVVHSLTKFYAIAGLRLGAVIAHPSMIRQLYGYKEPWTVNAVAEHV
AGQLLHCGEYEREVHAIVREGRKQIADGLAGNSVITLHGGAANFFFASVA
DEFSLDALFDVLAERGILVRDCRNFEGVPPKYFRFCIRTSDENRRLIEAL
NDFAELSRIAKAAVEEAVS
>CT0032 ftsQ protein, putative
MARPKHEQRSEELRQDPALPDLDAPESVRPRSGKLRRLFGTTPVMMTFAA
LLLAAVAALSWYATQWKQQVTVHRVVVSGVNLIPTASIERRLNRFKGKNL
DEVRLDDVRRALAPEPWIKQMRISKELNGILRVGIDERRPAALMADAGQP
LIIDTEGNLLPDEAVSERFRLVPVYGARSTRPARPGGVRRLNDKDRNLLF
ELLVAFDQSTYARLMVSAIHLTPDNQTWFTVTGSPIRFVVGNDGNFKEKL
KKFEIFWQKVVAKKGIDCYESVDLRFRQRVFATSPVNEEASVDSTAAPVA
PPAGGQLPDEHH
>CT0898 phosphate ABC transporter, permease protein, putative
MNLGTRKILDRSFTAVGIGSIIVMAMALLVVIVPIIWNGSGAIFFKGTIE
HRKLLYNHFHRGDKAELDREIASTDKYRQKVYKMVADFQKELDAMPPEKS
ADLSTEFEDVKTSLRALLGPLPGDDEPVLTRFKYGQTRWDKAEDKLHDLL
YVTKWDSSNPNEMAKQYFADRAPDFNGTALYPLFGYVKENLRNMLLPKFT
FYWGFITDSSIDSHIFGGIWPEIKGTFFLAVGAMLFAFPLGIVAAVYFTE
YANEGWFTSMLRSANNTLAGVPSIVFGLFGLAFFINTLHVSHSKSVLAGS
LTLAIMILPTIIRAAEEAILAVPKTYKEASLSLGSTKWNTIMTVILPAAL
PGILTGGVISLGRAAGETAPIIFTAAVSVGAAVGLGDVLNTPTPALSWNI
YNLVSEHEMASQIRYVQYGMVLALISIVLFLNLSAIMWRARISKKLKG
>CT1412 transcriptional regulator, AsnC family
MSGHLDLIDLKIIESLGGNGRIRLSELAEVVGLSIPSVSERLDKLQKNGI
IKGFTMEVDERQLGFDIQAFVRVRVDSSKHYKSFVEHVMKEEEIMECYSV
TGEGSHILKVMTHNTASLERFLSRIQSWPGVLGTNTSIVLSQLKKNNRIC
AEIVRNNLQDRQVLITFDEKKSKK
>CT1641 magnesium-chelatase, subunit D/I family
MLTKLCAAALLGIDALKVEIETNVSGGLPAFTVVGLPDSAIKESRERILT
AIRNSGFELPSKKITVNLAPADVKKEGTAFDLGIAVGLLGSLGEIKGQFE
DTIVLGELALDGSVRRISGTLPMAIMAARENIRWMIVPKMNAAEAAVAIS
AMGAATKVFGVETLVEAAGLLSGNITPEPVSVNVAELFDCEPDYPVDFAE
IKGQLAAKKAIEISAAGGHNLLMIGPPGSGKTLLAKAIPGILPPLGFEES
LETTKIYSVSSLLERNRPLMITRPFRSPHHTTSNVALIGGGAQAKPGEVS
LAHNGVLFLDELPEFTRNALEVLRQPLEDREVTVARAALSTRYPAGFMLV
AAMNPSPAGPLKDRDGLPTASPEQIRRYLSKISGPLLDRIDIHIDVPKVE
NIELFSNSSAESSGEIRKRVIAARAIQHERFASLPSPRIFTNAQMNSKLI
RKFCKLDKESAEKLMEAMNRLNLSARAHDRILKVSRTIADLEGAEHIEMK
HLVQGIQYRSLDREFWSF
>CT1304 hypothetical protein
MATLEETLAWTEIPLPEALKNLKQEQQLALTSYIKAVVDNKTDGFDELYH
AIGSIVRFIPHFIVVPLMVEHIRPQISAGVCRTMGVDQAVNYANDLPLEY
FSEVSRHLDNDLMARILEKMKRNQAEKVILFELLHHRSHMLGIAEHLDRR
LLEFVAKNLEINGFSESDPELAKHKLLLEKIRNLR
>CT1009 hypothetical protein
MLKEAAAVMGGALLLPPLGVSMVACGIPGLLVAGAGFFAFDAMMQERRAS
AQQSSSDPANGGDESWQTMPPEETERYR
>CT2274 conserved hypothetical protein
MSDNVHYKSNIELTPQEYEELEDFLLHESGLKHPMNLDALDGFMTALIIG
PEPIMPSQWLPHVWSSAVVDEAPVFESDEQAKRIIGLIMQMMNALSHQFE
ESPEDYAPLPNLTTFDSDEDQRKAARLWCCGFIEGINMNQDSWKSLLKDE
KGAKTVFAISAASGLLREKLNLDEEKEYELWKLIPEAVLEIRDFWRPGHR
RKKPDEKQPKAEAPGRNDLCPCGSGKKYKKCSGQ
>CT1595 conserved hypothetical protein
MSFDKKVTIAHLSDLHFASKNDRYLTARLDTMLGEFVRRKYDHLVMTGDL
IDTASPALWTIIRDALVRHGLFDWTKTTVIPGNHDLIDLEEEMRFYNALN
PDDRSRQHRVDDRLRQFNAIFRPLITDNGDALAGVPFVKVMRLGGISLSF
VAVNTVDPWSGLDNPAGARGSVSPETLRALQEPGVRQVLDDTFIIGLCHH
AYKVYGTGALVDQVFDWTMEFKNRDEYLKAMKNLGVRLVLHGHFHRFQVY
QANGINFINGGSFRYSPERYGELVINADGRWSHRFVNLALKK
>CT0592 hypothetical protein
MLSRIQSLYLFIAALLAFGSMAFPFWTFTTDHVILFGDFMDVQGAGLIVT
AGSIGGGILSPLTGIVALATIFLYKNRKLQQTLITLCFVLFAADLLAGLA
GGHFLKQYLETKASSVSFAPGSGLFMLLPEPVLFWLALLGVKKDEKIATA
YKRL
>CT0891 hypothetical protein
MATSAGRMMRRLEWPLFRFLLLIGEPYIVRGQLLLPISDEPHNR
>CT0421 magnesium-chelatase, subunit I family
MKKNYTYPFTAIVGQEEMKLALILNIINPAISGVLIRGEKGTAKSTAVRA
LADILPEIEVIAGIPFNLAPDEDDETIRECFTVTGHAMPDPDNLEVTMHK
VQVVELPVGATEDRVVGTLDLEHALKTGEKRIEPGLLAAAHRGILYVDEV
NLLDDHVVDVLLDSAAMGVNTIEREGVSFSHPARFTLVGTMNPEEGELRP
QLLDRFGLCVHVGGIADPQDRVTVMERRFAFEQDQERFCSEWQGESSKLA
ERIVAARELYPSVTISREHLLGIAKSCLKVGVDGHRGDIIIMKTAKTIAA
WEGRHCVGSDDIDRAVALALPHRIRRQPLQDMVMDVGSLLGSKCTTN
>CT0554 ABC-type export system, outer membrane channel protein
MVNRPRKTSNRMRTVSKKIASRGVVLLAVPALLLSGCAAGPDFVKPEAPA
VKSYDQQPLPKPVDGGQRFDEGVGPAAYWWKSFGSTQIDSIVEEGLVGNP
GLQAAEASLKASRENLRAGAGIFYPQASATFSQTRETSSPATTAGSGAIV
NLSTLSASVSYALDLFGGQRRSVEALGAQVDVQRAQTLGVYMTLTGNIVN
TSIAIAAYREELDEYEQLIALEKDQLSIATKQYQAGVAPYSSVLALRSQI
ASLEASVPPVRQKLSQAEHLLATLAGKTPGEWVRPDLRLSGISLPERVPL
SLPSELVRQRPDILAAEAELHAANAEIGVATAALFPSFTLTGSYGRTASQ
PHELSDPLNRFWSIGGNIAAPIFNGGSLRAKRRAAIATRNQALALYRQTV
LAAFAQVADLIRALEHDAQQAEAEKQAVESAKLSLDLVQANYKAGMVNYE
QVILADIQYRQAKIGYLQARAQQLQDTAAMYVALGGGWPHDGVAREKAGL
>CT1917 hypothetical protein
MLLRFFRQSTPLQKPGIARFFAFLGGDEKGNPVASFPDLFARRNRFFTFN
ECVG
>CT1852 ATPase, ParA family
MGRVIAIANQKGGVGKTTTSVNIAASIAISEFKTLLIDIDPQANATSGFG
LETGDEIENTFYNVMVNGGEIRDAIKPSGLEYLDVLPSNVNLVGMEVELV
NMREREYVMQKALKQVRDQYDYIIIDCPPSLGLITLNSLTAADSVLIPVQ
AEYYALEGLGKLLNTISIVRKHLNPKLEIEGVLVTMFDARLRLATQVAEE
VKKFFKEKVYKTYIRRNVRLSEAPSHGMPALLYDAQSIGSKDYLDLAQEI
FERDGNIRKFKVRQQ
>CT0342 pterin-4-alpha-carbinolamine dehydratase
MTQLENKHCVPCEGTAAPMASEELQRQLSSLPEWTLVDDSGTSKLVRVFT
FKDFQSALDFTNRVGQLAEAEGHHPALLTEWGKVTVSWWTHAIGGIHLND
VIMATKTEKLV
>CT1329 hypothetical protein
MISKISFDRKTIPVVRSGSGIDNAASRPVLYSVLFAPNIKTIFKRFAIFC
GFGVFVVSGQMRGTHSKKTEYYE
>CT1814 conserved hypothetical protein
MESISSNLTAVREQIAEACRKAGRREDEVTLIAVSKTKSAAAIREAWDAG
QREFGESYVQEFLEKVEAPELSGLPVSWHFIGHLQSNKVRQIVDKVTMVH
GIDKVSTAKELSKRAGQHDLTVDYLLEVNVSRESTKYGFSPDSVLQAAEE
CFALPNVRLRGLMTIASPAPSEARREFAELRQTLDKLRQNAPEPSLLTEL
SMGMSGDFEEAILEGATMIRIGTAIFGWR
>CT0287 hypothetical protein
MFFRVITHNHKNRQPKSLATDGKSSERKENINVRNNNAGNAEAESAGQKK
SQDFRSLRTSRA
>CT1763 uroporphyrin-III C-methyltransferase
MVKMSQILTRKVLHLFFCQALYFLGNNESRKVIHQMAEHGENRVLIVGAG
PGDPELLTVRGAAAIREADVILYDCGTVEPVLALASERAAIVRVDRSPYE
TGEGRREQTPMIVVIREYRDRGLRVVRLKTGDPSLFGGEVDEGDVLTRLG
IPWAAIPGICAGAAAASAYALPISRKFESDAVLNLIAAAITDDFALIRDA
AHLLGHGATVVLYMATANLPGILRTFREAGVPDAMPVVAVSKAGWPDEAF
ARTTLGELTAAGCSIALPEPVVYHIGRYVRVRNVPAGSQEFFVKAPENAG
>CT0216 glycosyl transferase
MAVYNPKPEFFRQAVLSVLGQTMPVLELVLVNDGGSDEFKSVLPDDDRIR
VFTKPNEGVAATRNFAIKQCRGEYIAFLDQDDYWYSDKLAEQMAMISVKG
EACMVISPVDIVDNQGVPIIKQNRKKVTNRYFRKIAHPDIRFPLADGNFI
YSSTPLIHRRVFECVGLFDSFTQPHDDWDMYLRIALAGISVHCYRERALS
VWRMHDSNESQKRMAMMLSKCEVEWKILQGGVPEPVAKIMHSNLALDQIV
MDNLWYNEREFARFRETVRRDLPGLMRCAVNSGRHDRYVTGYRKRAASAI
IKSARRYLLSLFSRQR
>CT0148 oxidoreductase, short-chain dehydrogenase/reductase family
MSFYTLITGASTGIGRRLAEEFASMGDNLVLVARSQDKLETLAAELRRSC
GIEVQVCCQDLAEVGAALKVFGFCEERGLPIDKLVNCAGFSIAGNFERMD
EETFVQMVLVNMVAVAALTRRFLPAMRARRRGVVINIASLAGFQGVPGMA
GYSASKAFVVNLTEALSVELQGTGVRIFAVCPSFLDNDLFYSRAGHDRSR
IVTPVSSPEVVVKAVHRGLDGKQVLILPTVLDRLMVFVQRFTPRKIVVLL
ADIFAGARERGGSNG
>CT2219 histone macro-H2A1-related protein
MPDNVLIHAIKADITSLTVDAIVNAANTSLLGGGGVDGAIHRAAGPKLLE
ACRELGGCLTGEAKITKGYRLPATFVIHTVGPVWHGGNHGEAELLASCYR
NSLKLAIEHHCRTIAFPSISTGIYGYPVEQAAAIAITTVREMLADERGIE
KVIFCCFSDRDLDVYQKALAAG
>CT1375 conserved hypothetical protein
MIENIMSLQKGLPEEISALEEDLAFTSRQIEARKKIADERQKLRERLNSV
IHDCKEKIKSFKEKQTLARNNKEYDALSKQIEYEEKEIAQAEIQLQDISH
AEHHAQELQKKGRELIAENRYDEISEEMMPDDVLQQQMEDLGQQVRQKKE
ELESIVVETAEEVAQLKAVLSEQRSVVAQQAKRLLDKYDHLKSGSIQNAV
VKLDRQACSGCNTRVPTNRHTLIVQGGFYVCESCGRIVVHERLFDEAAAS
GQQ
>CT1225 N-acetylmuramoyl-L-alanine amidase, putative
MTKDEKQKLYTVRWYVRERWADANSQRDVLELLTELLDPESAPDVRPSVS
AGGQHLLWCPFAETDFAPAVSRGSYATGYPKGAVVHFTAGRRNGLKPAMK
YQTEQGYLYFVIDKDGNIGQNFPLDSWGYHAGKSSYPGLNGTVSDELVGI
EIECAGLLTKEGSTYKTWYQTAVAEEEVRYIKTKTDNQQAGYYEKYTQEQ
EEALTRLLLWLHKNNPDVFSLDFVVGHDEVSPGRKNDPGGSLSMSMPKYR
EFLKQQAG
>CT2229 conserved hypothetical protein
MSAHIYKKLEIVGSSATSIEEAVNNAVAKAAETIRNIRWVELVETRCHVE
NQKIAYWQVTCKIGFTLDEN
>CT1983 prokaryotic and mitochondrial release factors family protein
MICIARGVSVDDSEIEITTMRAQGAGGQNVNKVETAVHLRFDIAASFLPP
ELKDRLMQQRDRRITREGVIVIRAQRYRTQEKNRQDAIERFQGILTKALV
EQKMRKATKPPKSASVKRLEQKAKRAELKASRKEVSTED
>CT1287 multidrug resistance protein, AcrB/AcrD family
MFERFISRPVLATVISVILVILGLVSMRQLSITRFPDIAPPSVSVTASYP
GADAETVGRTVAPPLEEAINGVENMTYMTSTSSNDGSLSINVFFKQGTDP
DQAAVNVQNRVAQATSQLPTEVNQIGISTIKRQNSQIMLINLASNVKAFD
ATFLQNYAKINIVDDLARVPGVGQVSVYGKMDYSMRVWLRPAQMAAYGLS
PQEVSAAIQSQNLEAAPGQFGEMSGEPMQYVMKYRGKFQRPEEYENIVIR
ANPDGSLLRLKDIARVEFGAYSYTVNSKVNGKPGVVMAVYQSPGSNANEV
ETQLRKVLEKASQNFPTGIEYSIPFSSKKVVDESIAQVQHTLIEAFLLVF
LVVFLFLQDFRSTLIPAIAVPVAIIGTFFFMNLFGFSINVLTLFGLVLAI
GIVVDDAIVVVEAVHAKIEEKQWPAKIATVSAMREITTAIVTITLVMASV
FLPVGFLEGSTGVFYRQFAFTLAIAILISAVNALTLSPALCAIFLTDLHK
HSKEKHSRFGGFGGRFVKGFNAGFDAVRQRYTGILDYFMRNRKVAFIGLA
LVTAVSFWMFKTTPTGFIPDEDNGFVIVSVSTPPGASMARTQAVTDRADN
ILRTMPAIKKVISVTGINILTRSSSPSAGLMFIQLNDLKERGKVRDIKAI
LGQVNEKLAPIKDANFFVLSMPTVPGFGTVSGLEMVLQDRSGGDLHKFED
VAKGFIGALMQRPEIAVAFTTFKASYPQYELVVDNVKAADLGVNVKELMT
VLQAYYGSQQASDFNRFGKYFRVVMQAERGDRRTPDSLNGIFVKNASGEM
VPVSSIISLKRVYGPDAVDHFNLFNAISINAMPAPGYSTGQAIQAVDEVS
RTSLPAGFTYDWKGQSREEIESSGGLFFIFLLSVVFVYFLLSALYESYLL
PFAVMFSIPTGLLGVFLGIKLAGIENNIYVQVAIIMLIGLLAKNAILIVE
FALQRRVAGMPLVEAAMEGAKARLRPILMTSLAFVAGLIPLLFVTGPAAM
GNHSIGASAIGGMFVGMVLGIMVVPVLFVAFQGLQERFTGPAAEIVEAGT
LLEKQLKKDGSHE
>CT1901 ABC transporter, ATP-binding protein
MNVMGSILSCTNLKKVYNKREVVKSSTLEVKQGEIVGLLGPNGAGKTTTF
YMIVGLVKPDAGQVLLDQEEITKLPMYKRARKGIGYLPQEASVFRKLTVE
QNILGVLEFTTLSKAERQEKTEQMLEDLNIAHIRNSMGYALSGGERRRTE
IARALALDPRFILLDEPFAGVDPIAVEDIQQIVEDLVKRNIGVLITDHNV
HETLSITDHAYLLFDGSIFMHGTPEEIAENTEARKLYLGEKFSLNRY
>CT1953 ferric siderophore receptor, putative, TonB receptor family
MKKQQAKGSGMKRLALSATFLATTLAPVTTWGADGVIRGRVTDKADGEGV
VGAAVSIAGTNIATATDINGNFVLRNVPASKQQKVIVTSIGYAPTTQVIN
LGDGQTATLNIALGQTTIMASEVVVGAALYKQDRLDVPVTANIVTKEQIR
EEPNPTLDEVVQDVPGVVVSRAGGTSSSNLQIRGSNTYNGGGIGTRVNAF
YDGFPINSPDSGEIVWQSVNMNAADKVEVLKGAAATLYGSGAMGGVVNIT
GHLPDKFEVKAGSGIGFYDKTPSSDESEYRKGFTPVFWNTYAGFGNKSGK
WTYDFLYSHSDDDGYRQNAWNYMNDVKFKARYDIDSRQYLQLTSFYNSTV
GGYAYQWPYNATISTSTFTPILDQSYDVFVNARLFPTHTAAFSYAQTMYP
TSPFLWDILANAWSTYTKYDVYTDDLISRKNALVGINYVNLLSDKLSLDT
RLYYTYNASRIEYNRTDADQTYATGRIRTIGEFNETDDSRYGAGIKLDWR
ASDNHRLLFGVDGNIVDTRTTQVAVEYPVKNEFNNIQEKNFAVFLQDEWK
ITDKLTSLMSLRYDWSGVNKDEVEITPGVWIPINKKSVDALSPRVALNYR
ATDDMALRASWGRSFRAPSLYERFVHDAGFLTVVPNPDLDKETMTAWEAG
IFKQFSDKVSLDIAGFINNYDNLIESRPTAAPLTYMYGNITKARIWGIET
NLNYRPNTDWNLSVGYTYMNAKNRSFDASTATATELNNPDPEWLPYRPEH
TASASVTWKATKKLTLNVNGRYVGKYKAVTLYTNPDGKWYPGDFVVFNAG
LKYQFNKNVTATLACNNINNTQYSEAEWFRAPNRSFIAGIDLTY
>CT1653 hypothetical protein
MSEEKSCCCQKPEMLKGTPEKCSPETIKQCHGDQPTHPCVPEEKNAKEDK
E
>CT1867 conserved hypothetical protein
MTLALLGSSLPAASLQAGETGISVSASSTVTYNPDTAEFTTTVESTDKDA
AKAAAKVATLWSSLQQALRKVGISSADASSTSYTVSPEWEWNSANGKREF
KGYKARHVVRVVVRDLGKLGGAIDAVVGAGSGTVDGLSYSSSRFESLRTQ
ALENAVKSAKHDAEVMARAAGGRLGQPLELQYGQPQSDYPVMRFAMAEGF
KAAPAPTDVEPGEQKLTVSVSSRWQFVSGSGK
>CT1841 acetyltransferase, GNAT family
MDVEAVASMVGELLSEIMQAIGVPVFDVASDETAARLRDFLETGRYVVFV
AVDGRDEPVGFIALYESCALYAGGVFGTIPELYVRPECRGLGVGQGLLKA
AREFGKSCGWKRLEVTTLPLPEFDRTLAFYEQEGFELTGGRKLKVLL
>CT1789 hypothetical protein
MSQRVEKGFELFRKLFFYKAFQHSYPNKGDQ
>CT0794 hypothetical protein
MYPSERVVTPRSFRGNPLRNPVLIHLQIHDYQVKRK
>CT0901 hypothetical protein
MRKSAFKHNQNEKIPPPEPPCPMVGFISSAPLHQRLI
>CT0423 receptor, putative
MTTNSAIAACGRAEATRFRPAEANVPFSLRKRIVRRRGKRSAGCHIRDGP
CVDDLHNTQRNIPTTIRLIMKSNFVKPSKSFIRMAGLLAVLSTVCRPDAA
AKEVAKTDSADYVADEIVVSSTRTDEKLKNIPRKVEVITSKDIEALDPDN
ATELLQKTAGVDVIEYPGVLSGVSMRGFVPNYGSYLNPQYVTYLLDGRPL
GTYNLASVDMNMIERVEVIKGPSSALYGSKGMGGTINFITKKSRGPIKGT
ASLGYGSFETFEGNAAVGGSISDRFDFDIGFRYFNQGEDYKVGKNTLISN
PDPQILERDIDTMHNSTYSTNSGMMRVGYRLNDNFRVDLRGAFFNAPSVH
TPGSIWGYYGEGMKDVYRKTADLSLTGTAGRHSIKFMPYWSKDESNNLKQ
TEATQNKPSKIYPYYLGDFEEYGFQLQDVIALGNHRITGGLDYDNQTYKT
RRYSAPDVAQRPYSADSRTSDFGLFTQAALSFLDNRLIVTPGVRFDATTF
GLLDTPLVPNVNTEKEHDGFFSPSLAFQYSFVPELKVHSSIGRAFVAPSG
LQKAGEYVDSFGRTVRGNPDLEPETSRTWDVGLTWSDTKKGVRADVTYYD
TDFKDFITQVQRTDGSTTYMTYVNAGSAKIRGLEFELSYDFGALADYRYS
LRCFVNYTHQFENDVTMGGVTSPMKYVRDGLGSFGIEYDDFRLLSARLSG
RYLGTSYEDNYYRSYGRLPNVLVIKNEPALVFDATVGIKINAQNRVDLMV
KNLLDENYAEKTGYNMPGRWYGMKYIVTF
>CT0234 hypothetical protein
MWTQDRETAFSGEPCRIVWPRAAEPKKGRVAKNGENPRTARLPQVLQTWF
RQTEDHDTVNRGVPLKTKPEYKKNILLCTGINR
>CT0732 conserved hypothetical protein
MPMEKISVNVYGDSYPLRVENSELTGKAAKDVDGVMRRFAAKAPDLEAKK
LAVLAAIQFAEKKNELEEELSQLRQKMAHINEFIEQNLH
>CT1476.1 conserved hypothetical protein
MRIVVRPEAEQELLEAHARYESKAQGLGYEFARAADAAVASALRTPFGYG
TRIAEGFRRVLFGTQSPQCDPRQSFPT
>CT0701 trans-sulfuration enzyme family protein
MHFETIAIHDGNTPESCTGSVTPPVYQTSTFARPSLDERGEFFYSRIGNP
TRSALESTLALLENGKHATTFASGVAAMMAAMQVLKPGDHVVSSLDVYGG
SYRIFEQLMRPWGVETSYAASEATESYIECIRPETRMIWIESPSNPLLQI
CDIRALAEIANERGIVLAVDNTFASPYFQRPLDLGAHIVVHSTTKYLGGH
SDVIGGAVVASDDNLNLTIRNYQGAAGAIPGPWDCWLISRGIKTLKIRME
EHQKNALHLARALEKHPAVSRVIYPGLESHPQHELAKRQMSGFGGMLTIA
LKGGLPAVRKMIESLKLFVIADSLGGVESLVASPARMTLGPLSQAERDRR
ACTDDLVRLSIGLENAEDLEADLLQALATI
>CT0976 carbohydrate kinase, PfkB family
MSLLVIGSLAFDDIETPFGRSDNTLGGSSTYIALSASYFTDEPIRMVGVV
GSDFGKEHFDLLHAKNIDTRGIQVIEDGKTFRWAGRYHYDMNTRDTLDTQ
LNVFAEFDPHVPQYYRDSKFVCLGNIDPELQLKVLDQIDDPKLVVCDTMN
FWIEGKPEELKKVLARVDVFIVNDSEARLLSGDPNLVKTARIIREMGPKT
LIIKKGEHGALLFTDNGIFAAPAFPLESIYDPTGAGDTFAGGFIGHLARC
GNTSEAEMRKAVLYGSAMASFCVEQFGPYRYNDLDLLEVDDRYQSFLELS
RIED
>CT1975 CRISPR-associated protein, CT1975 family
MNNNPFKGQRIEFHILQSFPVTCLNRDDVGAPKTAMVGGSTRARVSSQCW
KRQVRLEMHELGVRLGIRSKKVADYVAKACVALGADDEAAKACGEKIAAA
FSNDTLFFFSETEASAYAQYAAEKEFDAAKFNDKELAKLSKKTLDPAKDG
LDIALFGRMVAQAAELNVEAAASFAHAISTHKVSNEVEFFTALDDLAEEP
GSAHMGSLEFNSATYYRYVSLDLGQLSANLGGADIADAVEAFTKALFVAV
PSARQTTQSGASPWEFAKIYIRKGQRLQVPFETPVKAERGGGFLQPSIKA
LTDYLTKKEQQAGSLFGKEKEFTFGGEDETFSIDTLVSEIRNFIEAKS
>CT1355 hypothetical protein
MMVCLLCHTGAHEKSFFNGYDGRALYEQVQFLLLSIMVVQNFL
>CT1568 protein-tyrosine-phosphatase
MPVRILFVCYENICRSPMAEGAFGHVASLLGAGGFFEVESAGTVCYQSGS
SPDHRAVRAAERYGIDISSIRARCIHDLDLGSFDRIFVMDAENHRDVLDA
LDGLPVTVHMMTDFALSDAGVEIEDPYYGSEEGFERTMERLLHSATGILS
ALREAYELPIVDREAGVFLQHSGTGDGR
>CT2010 oxygen-independent coproporphyrinogen III oxidase, putative
MICLYLHIPFCRERCPYCDFFLVTRPGFVERFFEALAVETAAKAALFAGQ
SIKAIHFGGGTPSLVPPSFIDGWLSQISGIARISAETEITLEANPEDLLP
ASLDALQSVGVNRLSIGVQSFADRKLRALGRAHSAADARRVVLEALERFP
SVSIDLMCGAEGEMLAEWEGDLRAALDLCPQHISIYMLTLEEKTRLWRDV
RKGLRDLPGEEAQAAMYRMAAELLGNAGYGHYEISNFALEGHHSRYNLAS
WMREPYLGFGPAAHSFLVDGDAETRFSNVSSLTRYLTDPVGAVDFREVLT
EAQRFDEEVFLSLRIRKPLSVGFLRKGHKLGHQHLDDILVTLQEKGWIGV
KKGTVTLTEEGFLFADHVAAELLSE
>CT0668 ABC transporter, ATP-binding protein
MFEVEALSAGVGSFRLENIFLSLNEGECQAVLGPSGSGKSTLLSAILGAT
PVTSGHIRLGDDEITYWPMEQRRLGYVPQHLGLFPHLSVRDNLRYSARAR
KLARNDFEPLLDKLVEITNIGKLLNRQIGTLSGGERQRVALVRALAANPR
LVLLDEPFTALNETLRKELWWLVKELQRERGLSALLVTHDLTEAYFLADK
ITVLINGRQEQSDNKTTIYQHPANLAVARFLGIKNLFPATVVKSSEEGIE
ADCPALAHSFRLQGHAPVGTAIRVGIRPENVMVCDEDHPPCPNDCVLSGT
IRLIDMGVNVAMHFHSPQLSSIIEIIAPRRLVNRFRIANDSPRLTIALPS
SAMFWVRDE
>CT0298 hypothetical protein
MVSMDRPALIFPELAQILLLLPNAVRIFLNSMRFESALGISARLDFRKMR
LTSQSAEII
>CT2029 competence/damage-inducible protein CinA
MKAIIISIGDELLKGHRVNTNAPFIARELGNIGIPVTRIITCSDDPQAIR
DSVTLALTEAEAVFVTGGLGPTNDDRTRDAVRALLGRGLALDEPSFERIA
DYFRRRNRPVTEVMKDQAMVIEGSIAIPNTKGTAPGMIIECAPRFAGRHL
VLMPGVPAEMEAMMRLTVVPFFAPLSGAFIRHTPVMTMGIGETQLADMIV
EVEDSLPSGTTLAYLPHAAGVSLMVSTSGARREDVDAENRRVVEAIVAKA
GRFVYATSEVTLEEVVVNLLLERKLTVAVAESCTGGLLGSRLTDVPGSSG
CFLEGLVTYSNQAKVRLLGVDPATIEAHGAVSEPVAKEMARGCLERSGAD
ISVSTTGIAGPGGGTPEKPVGTVCVGIASKLPDGAVRVEAARFVMHGDRH
QNKIRFSEAALRGLLVRLKEMEF
>CT1408 conserved hypothetical protein
MQSSRGARAADVPVRLVVDIGNTSTTLAIFTGDEEPSVESVPSALFADSS
TMREVFGNMARKHGEPQAIAICSVVPSATAVGSALLESLFSVPVLTICCK
LRFPFRLDYATPHTFGADRLALCAWSRHLFSEKPVIAVDIGTAITFDVLD
TVGNYRGGLIMPGIDMMAGALHSRTAQLPQVRIDRPESLLGRSTTECIKS
GVFWGVVKQIGGLVDAIRGDLVRDFGESTVEVIVTGGNSRIIVPEIGPVS
VIDELAVLRGSDLLLRMNMP
>CT1257 hypothetical protein
MIEIKKTPMDLIDDIYSLAYWMTGSEKASSELLNTTCLNADLKAPETEVL
KTFRECYLDTYGQHADLDMHETTKESVSLIDSLKQWTADIKLSVLLSELT
GLKHAQISDIIGKPVDTVRLWLFWGRKFFAHDNLMRASA
>CT1460 hypothetical protein
MESGLSALRQRSLAEQLLRVGIRLHHLIEIV
>CT0465 hypothetical protein
MPAIFFLTEELNDMKYRSGKGTGMISDARLRRVCRLTW
>CT2061 response regulator
MKILVIDDDPSVRKFITTTLKKENYAVTEAENGAEGLIKLQQEKDISIII
TDLIMPEKEGIETIMEVRKINPAVKILAISGGGKAGPENFLLLADVVGAN
ATLKKPFGGQELLMCLRMLA
>CT1358 sensor histidine kinase/response regulator
MVSGILGALANIFFYVVFKYGFHLPYENFWLRMVATLLCISLIFMHRLPE
SFSPYFPYFWHTFLIIILPFTFTVNLLMNNFHELWLYWEIFMVFVLMMFV
PNWLLFMFDLLTGVLGAILFHNLSVPYVPLNPTFNIPLYSIVISFSIVAG
SIFSFANRNTLKELERKKAEEKYRALEALAGSIAHEMRNPLSQVRQNLDE
ILLELPRSSTENDYASLPKKNIETIQKRAIQAQTAINRGLQVITVTLGNF
RNTDVSKKELTCLSATTITRKAIEEYGYASEHERQKIYLSPGEDFIFLGE
ENNYILLFHNLLENALQILQQVPDGRLAITIQRGESVNRILIRDNGPGIP
PNILPRIFEPFFTSGKKNGTGLGLAFCQRVMKSFNGQISCKSEVGIFTEF
TLEFPVLDKATINKFERNLYAEYTPFLAGKNVLMAAIPEAYVPSIRQQLM
PLKIGLDNAEDNNKALEMLAANHYDLVLSGISPLPAGTAKLGNIVKNKDR
NIPVVGCSFSPLPPVDTINGVASVIIMPPALPELLNAMKSSLEMARETLK
ESLSGKTVLVADDLDFNRRVIKLMLNKLGITIFEASNGLEALNILKSQPC
DLLVIDMRMPVLDGFETAQRIRAMPSPYRDIPILGLSGNLDNATLKLAKE
SGINDTLLKPVKLKPFLQKVTSMLKVNTPAA
>CT0757 ATPase, AAA family
MPDEHSQSDLFGFSDPGRSSAKSNRFQPLAERVRPRSIDELFGQEHLVGP
GGPVRSYLEQGRIPSMIFWGPPGSGKTTLAEICARSLNYRFEQLSATDAG
VKDVRRVLELAQKSRSIDGRQMLLFIDEIHRFNKAQQDTLLHAIEQGLIV
LIGATTENPSFEVNRALLSRMQVYILNPLSEAEVRRVVERAIESDPQLAA
AGVEMRDMEFLLAYAAGDARKALNALEAALSLAPRGTAPVVIDRTRLEQA
LQHRAPTYDKGGEAHYDTVSAFIKSMRGSDPDAALFWMAKMIDGGEDPKF
IARRMVIFASEDIGNADPYALTLAVSVFHAVELIGLPEARINLAQGATYL
ASCPKSNASYEGINEALSDVKKGAANAVPPLYLRNAPTDLMKEIGYGKGY
RYPHSYPGHFVEEHYFPEQMEPKAYYRPTGEGREKFIRERLEGLWKDRYG
D
>CT0715 hypothetical protein
MTMSSGCFSPVSLEFCQGFVQELSGKRCRTGAGTVNWLIMYDFHRIINGS
VKIFEVHLRLFSWYLTLKQT
>CT1680 es1 family protein
MKKIGVILSGCGVYDGTEIHEAVLTLLALDKAQAKAVCFAPDIAQRHVVN
HLTGEISENETRNVLVESARIARGSIRNLRDIDTMILDGLIIPGGYGAAK
NLSDYAVTGANCEVLPEVADAVMRFRKRGKPLGFLCIAPVIAAKLFGSEG
VEVTIGSDEQTAADIAAMGAKHVPAAVEEIIVSPDAKIVTTPAYMLGPGI
SDIAKGIDKLVAKVLELA
>CT1944 hypothetical protein
MPTAGAATGYWFYGEIYIYLNLIKKSAMLKQNQATRYALLYSLVR
>CT1976 CRISPR-associated protein, CT1976 family
MSNPFILLWLEAPLQSWGADSRFGRRDTLDFPTKSGLLGLLCCALGAGGE
QRELLDEMAELRQTVLAFQRERGERPPLLRDFQMVGSGYNEKDKWETLLI
PKKRDGGGAVGGGTKMTYRYYLQEAAFAAALEVPAARAGEFAEALKAPVW
DIYFGRKCCAPTDMVFRGEFDSEVAALEAASSIAKEKRLREAFRVRDYAP
GDEGEAEVVALNDVPVRFGPKKKYRQRRVTIIHHNDEE
>CT1026 hypothetical protein
MERLGNNVSKRYQISLEVADGCRVHAVVIPVGEDNITNVTKDGGGVSYAV
WSAFSNI
>CT1415 conserved hypothetical protein
MKSRNDELLIIFSKNPVAGRVKTRLASAIGDAEALRIYEQLRELTKQATT
GINASKAIAYSDFIPNTDLLLAPDTEAWLQQGSDLGERMHRAFVKGFSLG
FSRVALIGTDCPELSPFILDLAFRKLDACDVVLGPARDGGFYLAALKQPF
PELFLGRTWSTSSVLNDSQRIIREHGRSCDLLPALSDIDTFDDLRASGLW
TP
>CT0711 lipoprotein, putative
MKKITSLFFVLLFAILVGCSEKSAQESNSGVAAVVNGVEITNRQIDYFYQ
RTAMPGMSAEDSANLKRRILSDLIRIELLAGKAKEMKLDNNPDYSMALYA
AQKNVLAGLAERKLAGNQAPVTPDQAESVVQNAPQLFAGRKLYVFEEVIF
PGVDMPLLESLDAMATNGAPLSGLLDELKAKKKPFNSSLKALTSEQLPAP
ILAVLNKLKPNTPQVVRSGDKVSVILVLHDAIPAPLEGDPARRAAAAMIE
ANQRNQALSKAMQELLDNAKITYYNEYAKTADGKDKLSALPVPDANKATR
ELYKKIILGSGLTASFTLTIMMLTAVMRTFYSMLWLPRLWPGSANDAERT
ATFDIRHTTPLSRQIYLFALLLLIAVVIIFELVLVWSKLAILAMLAYIAG
GIIVGVFASYLLNVGISRGWSRKTYMLIASFIAFLILVCVVLTIKMSSLV
>CT0693 hypothetical protein
MIFRYFIRRLNGYDWLVRKIKISHDALLMFDALTMSFRKMTVR
>CT2153 conserved hypothetical protein
MYGYMSILSGLSKVAPDFSPGFMNNAKQTRALAQFYFGSGMALLEITVIP
LGTVGSGLSAWIAGLESLLEESCLPFRLNDMGTIVEGSADELFAIARKLH
EYCFVEGVARVYTVMKIDDRRDKSVAIGDKVASVEARKAVKPDK
>CT1542 TonB-dependent receptor, putative
MKKTSLIVLLTMLSATAKAEEDPELLALHTWTAPEITVSGKKGDLLQNVT
GKESAVLNPSQMSVYKVINMMPSISQQSVDPYGLADIVNYHESFRFRGVE
ATAGGAPGTTANVEGLPVTGRPGGGATIYDLENFENIAVYSGVMPANIGL
GIGDVGGKIDMEIKRPEEKFGVTFKQSRGSDNFSRTYLRIDTGDLGGGLK
AFASGSTSYAEKWKGYGASNRNNLMFGMTEQFSDKVKLEAFATYSKGNIH
PYKALSYAEIRNLDSAYETDYGNNPAKYDYYGYNKNEFEDWMVMANLEVK
TGEESKLNVKPYYWSDKGYYMETITLKDSNNNPYNRIRRWDIDHDLKGIL
AEYNTKLGKVDLDFGYLYHTQERPGPPTSWKNYSLNGNSLQFSNWAILSN
SSSHELHEPFIEAKYRTGGWLLEGGVKYVNYTLPSILTYNTTGIGDVSYD
DALATDPAIIAAKSAESTKTFSRLFPDVTITRSIGDNSSVHLAYGENYVT
HVDIYPYFISQYATFSAHHITFQQLWDERKMEISRNIELGLKMNGSGWSI
APTIYYAMHNNKQAVLYDPDLGISYPMNNANAKGYGFELEAEYKPSRALK
CYGSFSWNRFYFDQDIRSDAPGNPILAIKGNQVPDAPEFMAKGIVSYTIG
EFTISPIVRYSSARYGDVQHQQKVDGATLFDLDLTWSKAMLGFRNVDCSL
SFINLFDKKYVSLISTSDYKTLNSSYQPGAPFTVVASIALHY
>CT1267 amino acid efflux protein, putative
MIDFQTFFLFFPVALLLALSPGPDNLFVLAQSAQHGRPAGFAVTIGLCTG
LIGHTLAVAFGLAAVVKASALAFTVLKIAGALYLLWLAWQAWRAGGEVGE
SNAHALSGIELYRRGMVMNLTNPKVSLFFLAFLPQFTDPRHGSMTMQFIE
LGALFILATLIVFAGLSMVAGGLGERFRRSPAALRLVNRAAAMIFTGLAI
KLAITER
>CT2072 membrane protein, putative
MMQELVQHIIEHPASSLLVIFNLIVIEGLLSVDNAAVLATMVLDLPQKQR
PAALTYGILGAYLFRGLFLFFAAFLVSAWWLRPFGGLYLLYLVWNWWNNR
GSKDGDAMCTEKRDNRLYRFVSRRIGPFWATVLFVEMMDIAFSIDNVFAA
VAFTDNLILVCTGVFIGILVMRFVAYGFIRLMEEYPFLESCAYIVLAVLG
LRLTFSFFEHLWPGMVILGYLEGHQADIVMSVITVTIFLAPLVTSALFNV
PAKEGVE
>CT0995 conserved hypothetical protein
MKKVLVAGSTGYIGSHVVQEFKNRGYWVRALARDPEKAKKPGPHLEPVVA
DLADELFTADATKPENLAGVCDGIEIVFSSLGMTRPDFVHSSFDVDYKAN
LNIMREAMKAKVRKFVYISVFNAQKMMEIENIQAHEKFVDELRASGLEYA
VVRPTGYFSDMAQFLNMARNGFMFSLGDGQTRSNPIHGADLAKVCVDAAE
GDAKEIDAGGPEIFTYRQVAMMAADVVKKQPFNIELPTWLADGIAAVTGV
INRDIHDIALFAATVSKNDTVAPQYGTHRLREFFEEMAAKGS
>CT0050 conserved hypothetical protein
MRKGNTDLLLFFNGWGMDRRVADWLVSAWPDSAGRDIAVLYDYRNLSIPA
WLGEVMAEARAVDLVAWSLGVWAAVNSELEKIDRAVAINGTATPLDAERG
IPPEIFAGTLKSWNDANRKRFERRMTGGVPPKIVDATRSDRTSADQQAEL
LSLGEAVARFPAESTASWKFSKALIGGRDLIFSAENQRRAWSEAGVRVAE
IAAMPHFPFTHIAGWGELFT
>CT1231 transposase, internal deletion
MTRKKNKTPDIQGELIGQLGYPKHLPIVLPNTGNSRNEHGMSVRDMQAML
LELYQVEVSKALISSVTDAIKLIYLAMQNIAKKWTMPIKNWGAVINQFSI
TFEGRGASGPSMKTTVDTLN
>CT0927 propionyl-CoA carboxylase-related protein
MIARLVDRADFMEAQEHFARNIVIGFARIQGRSVDIVANQPAVMAGELDI
DASDKTARFIRFCNALNIPLVTLVDVPGFLPGVEQEYGGIICHGAKMLFA
YSAATVPKFTVVMRKAYGGAYLAMSLDILHSKREFRPQNKHGLIPL
>CT0470 membrane protein, putative
MHMSDALLSPVVGGAFWAVSAGMIGLSARKIPSESDGSKTVLMGVMGAFV
FAAQMINFTIPGTGYMAMSARGFRSALDSHPPIALMPGDLFAIAGAAALF
ALVRHLF
>CT1933 hypothetical protein
MRVSPFPVSRKAWIGTESIPEILKPELYYFFRLF
>CT1277 conserved hypothetical protein
MPRPQKCRAIAQDPEYRVFGPFCVGKREDEALVMSFDEFEAIRLADVEGL
YQEEAARQMQISRQTFGNILASARKKLGEMLVLGKMLNVKGGNIMISQEE
RIFGCAACGHQWSLPYGIARPVECPSCSSQNIHRMSPGGGFGGGRRGGGK
CRGFRSGLDRGPGHGEGRCQGEGHGNGNGNGNGQGRMRRNQQEGGEV
>CT1150 nitroreductase family protein
MIDFNVDSARCTRCGQCVADCPSRIIVMATGEYPSIAPGKEFSCLRCEHC
LAVCPEAAISILGFSPDGSTPLKGNLPETDQLETLIAGRRSVRRYCPENL
SPELIDKLITVASCAPTGVNARKVRFTVLDDREKTAHFRDEVMERLVRLL
EEEALPESKAYYARFVKVWQKHKIDLVFRDAPHLLIATAPKSLSTPRQDC
VIALTTFELYAQACGIGTLWNGIATWAIEEMLPEMRQRLGIPDDHAFGYA
MLFGKPAVRYARTVQHHPPEIYRVP
>CT2239 porphyrin biosynthesis protein, putative
MDTSMKVFLPLNIRVDNKKILFVGGGKIAHHKIQTIEKYTRDITIVSPEI
IDELKGKGFTEIYKEYDSSDLDGAFLVYASTNVEEVNRRVRDDAEARGIL
VNVVDNRELSGFISPAIIKQGEMTVAVSSNGQNVKKSVEWRNRLREFVSE
TWPEDNQ
>CT2230 toluene transport protein, putative
MIRKTACSVVALLVLLGATPAFATNGMNLEGYGAKSMAMGGTGSAYDTGN
SAVMNNPATLGFMKEGESEIGFGIRGLHPNIKLENGGASDKSDATSFFMP
SMSYMRRDGRFSYGVAMLAQGGMGTDYGDDSPLFSMGTSLKGAAGVSMSG
KDIRSEVGVGRIMFPVAYHLTDQTTLGASFDVVIASMDLQMDMDGQHLAG
MMAGNGGSISGSMASSLGAIIESGADINYARFDFSNHNPFIGKAVGFGTG
LKFGFTHQFSKVVSVGASCHTQTKLADLETDKATLSFAGDFGEQSVTGKI
KVRNFEWPATFAAGIALNPSDKWMIAGDVKYIDWSSVMDKFQMSFIADGS
NPAPFAGQNLDVTMDQKWKDQTVWSLGVQYKATDKLALRAGASFSTNPVP
DSNLNPLFPAITKNHYTCGFGYRVNDATSVSAAFSWAPKVTATNGDQTVI
SHSQTNWALNLTHRL
>CT0228 hypothetical protein
MRVSLRHFRYGASGDLLFGQAARLFAFGIGSARAAKSDKKAM
>CT1307 hypothetical protein
MTRKKEKMDNAATQPGAVINQFSMKFEGRVTL
>CT1785 ATP-binding protein, Mrp/Nbp35 family
MSSIQKSQIEAALGTVMEPDLGRDLMTLGMVENIAVDEAGNVSFTVVLTT
PACPMKEKIKNSCVEAIKAAVPEVGSIDVNMTSKVTSSCSHGGHGNHDGH
GHHGAQGGHGAPQKIDLPNVKNIIAVASGKGGVGKSTVSLNLAVSLAASG
AKVGLIDADLYGPSIPTMVGLQNVKPEVQNQKLMPIEKFGVKMMSIGFLV
DPETALIWRGPMASSAMRQLITDVDWQELDYLIFDLPPGTGDIQLTLVQN
LAISGAVIVTTPQEVALADVAKAVTMFRKVGVPILGLVENMSWYELPDGT
RDYIFGRQGGETFAKTNAITFLGSIPISSSVREGGDNGIPAIIANPDAPT
SQAASRVAGEIARQVSILNANCSMN
>CT1925 transposase
MQSLSSHYHQLLGLPSNWEVENVNLSMSSRQVEIRLAFTGKQGECPICGQ
SCLIYDHAAEQRWRHLDTMQFETILVARLPRCQCKEHGVKTVQAPWAARH
SRFTLLFESFAVELLLHCANIKAASRLLRLNWHTVNQIMRRAVQRGLVRR
KTETVEYLGIDEKSFKAGQHDVTTLTDLGERRVLEVVEHRTTEATKELLA
SLNDSQQAGVKAVSVDMWKPFIHAVQELLPKADLVHDRFHISKYLNEAVD
LVRRKECRQLDKAGDKRLIGSTYVWLRNPENMGEQQQAELGKLMDAEFRT
GKAWSLKNMFRAFWQLGCADAGTFFFEYWSKRIDEVGLVPLTKVKELLQR
HFGNVLTWFKHPITNAVSEGLNSKIQIVKASARGFHRFESYRIRILFYCG
KLNMAIGS
>CT1759 methyltransferase, UbiE/COQ5 family
MKGQSKKQEEADDFGKVSFPKLPCPECFRDGVQGTKKSWASAQNLARQPS
RVVVVRAITKRYKTMSYTMNAAEFNEKIMKGHFRKIYPVIAAQIVERTGV
RSGRCVDLGGGPGMLGVCLAKITSLTVTVVDLMPECVELARENSAEAGVA
ERVDVVQGVAEALPFDDASIDLVVSRGSIFFWEDQQKGLAEVYRVLRPGG
WAWIGGGFGTAELLREIEAAKADDPEWNRKRRERMTQNPPEHFRAILERL
GIDGVVEHQEAGTWIIFRKPAEVEA
>CT1616 Sec-independent protein translocase protein TatE, putative
MFGLGGQELVLILLIVLLLFGAQKLPELAKGLGKGIKEFKKAQNEIEEEF
NKATDDSSSKEKKETKA
>CT1108 hypothetical protein
MEIKREFTNIRFPEYLFTLVHSHLNPDESYPDAE
>CT0171 hypothetical protein
MKISTLYTLFAILATAANIEVQDISIRYYSGQYAIAISVGLGTLAGLLLK
YMLDKRYIFRFKAENPIHDTRTFLLYSMMGAVTTLIFWGFEFGFNHLYHT
KESRYLGAVIGLAIGYASKYQLDKRFVFKQEGAS
>CT0541 CBS domain protein
MEIFFLFLLLILINGLFAMSEMALITAKRSRLAKLAEDGDKAAAAAIKLG
HEPTQFLSTVQIGLTVIGVLNGFIGESSFTPPLAAALELYGGWDPKTSHI
IATVLVVIVITYITIVIGELVPKRLGQTDPEGIARNVARPMQILATATRP
FVRLLSASTDAILRLMGKHEQTQPSVTEEEIHAMLEEGSVAGIIEQQEHE
MVRNVFRLDDRQLGSLMVPRADIIYLDTALPLEENMKRVVESEHSRFPVC
HNGLQSLLGVVNAKQLLAQTIKGGVTNLAEHLQPCVYVPETLTGMELLDH
FRTSGTQMVFVVDEYGEIQGLVTLQDMLEAVTGEFAPLNLEDAWAVQRED
GSWLLDGLIAVPELKDTLGLRAVPEEEKGVYHTLSGMIMWLLGRLPQTGD
ITFWENWRLEVIDMDSKRIDKVLATKIDNQPTEDPKPVA
>CT0524 dihydrolipoamide acetyltransferase, putative
MKTQSQDRWFIWQLSEELEAKIRYREYGPPDSPFTPLLFIHGYGGMIEHW
NDNIPSFDDRYRIYAMDLIGFGQSGKPNVRYSLALFAAQIKAFMHLKKLE
KVTLVGHSMGAASSIIYAHHNPDSVRALVLANPSGLYGDSMDGVAKIFFG
LVGSPLIGEMLFAAFANPVGVSQSLTPTYYNQKKVDLNLINQFSRPLQDR
GAIFSYLSPSKRPHDFMLDGLKPCNYKGDAWLLWGAEDTALPPHKIIPEF
QELLPQAGAYIIPKAGHCIHHDAHETFNNRLAQLLQRLE
>CT0443 glycosyl transferase
MDKKPSILIFIVAYNAESTIENVLMRIPADLLDDFDAEVLVIDDQSSDDT
VLRCAETIQSGKIRFKTNVLVNPENQGYGGNQKVGYQYAIEHDFDCVALL
HGDGQYAPEYLRDLITPVTKGEAEAVFGSRMMTPFGALKGGMPAYKFVGN
KILTLFQNIMLKTSLSEFHSGYRAYSVKALKQIPFHLNTPDFHFDTEIII
QLILWGFRIAERPIPTYYGDEICYVNGLKYALDVVITTTKARMQRLGLFH
ECKYEVIHPHNSNGYPEAKLSIKSPSGNRI
>CT1273 hypothetical protein
MKPIRIDDYCFDRIHLWLQYDPFVPEFIEMVIEVFYPANRKANGLVMFNH
GFLIGNDLLWYPKKIAGMLLDDNPLFGINPSAYYNYSEAIVEKNWAMAFV
SASHAQVDWMPWTDIGGNPRVGQETFAAASYLIRYGLTEFFWLAESRGHN
SKNFDAQLASKAKFLVSNNVIFAGHSVGGAHAQAAACGFDTLSQIGRQQC
RPFNPVIYNRELLPTFSMPMTDWPEADRANPVGLLMLSPVDQHVPIFMPG
MSDYRAALASRQMPMAMVVGQCDCACLDMSQPPAWSGTPGVESQFSQLTG
DGSWVVASQVERGSHCGYLTNKSPLCSVAELPSQCKRCPGVEVYKPMGAE
TAFTAEMLGKFINLYPNGGGFEGGFNDWIGSEFITWLNRQSPCCDLNLMP
MPGGGYIDNVPPA
>CT2257 membrane protein, putative
MLLSPLNWLIHLSSSLEWGVALVMLYRYGQFIGRKDVRRFALFMLPHWIG
SWFVLLYHLSGDAIMRFLEISEAINLVGSIALLYATLKILKGDEKRESKP
AKAWMGSLFGGVILVAGGSTPYSFMMGSSWFDAVLQVSSMVYLTFLVLLL
KVRKKDPEVFSGLTVAGFWFVLVFISVTVVCMYIAIHVLGYPSLSHDDFL
HGFAESLLTVSNLMIVIGIHKQRKRAEERLRA
>CT0574 hypothetical protein
MERLTEAFESQVMLIAELVELLRKREEKRGE
>CT0714 sulfate transporter family protein
MSPESRTLPGIIRSGDPSICRPPVAGNCRSFTSDHFQQMFKPKLFTVLPE
LTKEQLGRDIVSGILVGIVALPLGIAFAIASGVSPEKGLISAVIGGFLIS
FLGGSRVQIGGPTGAFIVILYGIVQQYGLNGLMIATIMAGVILIIMGLSH
LGSLIKFIPYPVVVGFTSGIAIIIFSSEVKDFLGLHMASVPADFIDKWAA
YFKALPTASPETILVGAVALLIIVFWPKVSRKIPGPLIALIVTTVAVHML
HLQVDTIETRFGDISASIPAPSFPSLDMATIRHLIMPATTIALLGAIEAL
LSAVVADGMIGARHKSNMELVAQGVANIVSPLFGGIPVTGAIARTATNIK
NGGRTPIAGIVHAITLLGIMLVFGSWAKLIPMPTLAAILIVVAWNMSEHH
VFRQLLKSPKSDVAVLLTTFGLTVIFDLTIAIEVGMLLSVILFMKRMASL
ANVGVITGELKDEEDEADPNAIVNRSIPEGVDVFEISGPLFFGAASKFKD
AMHVVEKAPSIRILRMRKVMSIDATGLNMLKELFNDCRKSGTTLILSGVH
TQPLFAMQQYGLADEIGEENIFGNIDDALDRARSLLGLPVQGRPAGFVAS
VKREKELQAKIESKGSGK
>CT0109 hypothetical protein
MKTITRRLFAAVLVPGLLLLSACGKKDSSAPDSAGVEHAAPAIAGPFTGV
LTMKTTIPKAGTSDMKLYIGPKGMRAESKTNIGAHGGEVSMTILSLKDSP
DKIYMINGATGACMELDVSKVKKQPGGDPYKNAKIENLGRERVNGYDCNH
VRISWPDKQNTVDLWVSKDILDYFAYAKMQGSDDQTDTQLAEKLRAAGLD
GFPVKTLLSPEGVVTELVKAERTTPDDKLFEVPANCTKMEIPAIPASPQG
MSKEDVKKMQDWARKMQQQMPKQ
>CT0103 PilB-related protein
MRRVTTLLVMMLTVLSACQASPKASPTKPGNAMSNNHPQNPYYSRTDTTR
LDLPNEVWKKVLSPELYAVARLAETERAFTGKYWNYEGLGTYYCAACGNT
LFRSDAKFASTCGWPSFFETVRPGSVAYREDTSFGMVRTEVLCGRCGAHL
GHVFDDGPPPTGKRFCMNSICLDFEPDQPETAKPQ
>CT1141 hypothetical protein
MTAMYFDSLVILYMVVGFSFCDNLYKKILNKCANDFDFYVTLLV
>CT1173 conserved hypothetical protein
MISHEENLKNAGILLPEIPSAAGLYQPAVRIGDLIYTSGQLPLVGGKLME
PGGRGRVSEANEQEATKAARVAALNAVAVLRSVTGNLDTVARIVKLTIYV
SSADGFTNQHKVANGASELVYKIFGESGRHVRVAVGVLALPLDASVEVEM
IAYCPLIRH
>CT1705 zinc protease, putative
MPDFSYEETKLTTGRHYPIHLLLFVATLFTTTWAGAIWTGIPPFTANKAG
FIEALKTGVPFSLALLAFLTVHEFGHFFATVKHRIRATLPYYIPLPPLPF
LMSIGTLGAIIRIKEPIRSRRALFDIGAAGPLAGFTVALGLLIYGFLHLP
LAEYIYSVHPDYRAMGGIPPAPADTLYLGKNLLFILLDALIQPKGLPPMY
ELYHYPFLFAGWLACFVTALNLLPVGQLDGGHVIYAMFGSEGHRKISKFF
LAFITVIGAPSFILGVLELINPALVFPVPELLLRWSWPGWIIWAIILRRF
LGTKHPQAGSDHPLPPGRMTAGWLCIAVFILTFTPVPFAIS
>CT1292 hypothetical protein
MNPPIYESEHGLTTLRSMRSVWQVIFLCLKRGVEF
>CT2122 restriction endonuclease-related protein
MLAVKTTCKDRWRQVLNEANRIGKKHLLTVQQGISLNQFREMRAHDVQLV
VPADIIKLYHKDIRSEIMTLEGFLGEVKTLVEKPRKRS
>CT0183 ABC-type drug export system, membrane protein
MTASQTSATSTAPAGRPNGWADKFDPTGFLRDTAAIVATELAKLRRDPSE
ILSRSIQPALWLLVFGQVFGRMRAIPTGNLDYFSFLAPGILAQSVLFIAI
FYGINVIWEKDLGILQKLLASPAPRSSLVFGKAVSAGMRAIAQAAIVYLM
ALVAGVDLNWSPLALCGVLVAIVLGAALFSTFSLMIACLVRSRERFMGIG
QVMTMPLFFASNAIYPISIMPTWLKIIAHINPLTYQVDLLRALMVTAGTS
TFGIGTDFLMLTGMLIALIVVTSKLYPRIVQ
>CT1522.1 hypothetical protein
MKIRIHELAAHELDEAIEWHELQSRDLGKHFRRIVREQVKTLARNPIWYL
RKSDDIYKAFIPKFPYKNIVHRRRK
>CT1352 hypothetical protein
MLRSKEEWQETAESVLPPEERYVDRNRMITARYAGWYLENPGTLKWAGMA
AFASRQVGLAIMAADLMTAPERDGSGNPLLALHRFGVDWFMRADFEQIRR
GNNNIYRDIAWAHAAYVGGGMAELEACASEPEDTLLVKGFGMIDRGRALC
RRDAGSPAGERLIWEGNICLLRHEQVDVLQPIFDTLSVGGRIMASFGSEL
DFSGALFPDSRFRTSFSLFYGYLETLTGLKSVANPDDRWRWVEQSVIPSW
QAAERQMSAPCPTRNALQKMAACEQ
>CT2018 dolichol-phosphate mannosyltransferase
MLIVKTLVIIPTYNEAENIRPLVEDILDRYPEGLELLVIDDSSPDGTAGI
VKAIMKNEPRVMLLSRPSKLGLGTAYLTGFRYALERGYERVIEMDADFSH
DPASIASLIKAMDGADMVIGSRYMNNTVNVVNWPLSRLILSKSASIYSRW
ITGMPVSDPTSGFKCISAKALRFIALDRVRSQGYSFQIEIDFRVWKKNLV
IHEVPIIFVDRSVGKSKMTRKNIVEAVWMVWWLKFLSIIGRL
>CT1184 hypothetical protein
MDLSSFLPFRDEMVKVYHCLTTNATHTSEKPVFSELKVRRYSCPLEDVSN
FITNKIESWVGWELKNQKTAVGGMKTIRAEVSSFALLGMKIDVTFGLVEE
TDINGRKITTVNGKAHTRIDSKGDLGESRRMLRMMLASLDFEFRPQIVHE
DEYVHRSIDPKNSNAAFQQLFDESTLEHRPSTPKAKSIELKKPVKKQIEF
KSSKNSGETVKAPISSQAIPVATNGAQTTTAPDSDVEEVKKPAKPKITVI
SLKKNS
>CT1770 hypothetical protein
MRKTTGTLFVLLTLVTLILQGCYSFSGGALPPHLHTVAVPLFDDTTQAGI
AEFREGITRSLINKIESQSTLSIEPDPSRADAVLKGAIVSYSDEPSQLGS
ATERAVTNRITIVLQADFDDQVKNSKLFSQTFVGFADYQTGNYTAQQTAI
QSAYNMALDDLFNQMISNW
>CT1458 hypothetical protein
MPALPPPPQYPPTRCSRSRAQRYLAAFRFCCFILIPTPMKTEVKVIIISG
IVTIIVAIINGNDNIHAGGNVTMHGAKK
>CT1642 hypothetical protein
MMMLTSCAPSALGKLFFEAGDSCLDRFGSYDRG
>CT1836 hypothetical protein
MYFSFHDSIVYCWIQASSAFRKVCKKLAPANSALRPTWGEGRKQQAIKAG
>CT0211 acyltransferase, HtrB/MsbB family
MSIERNRRTKRVKRTMRSASNTVTYGAVMLLGALVRKLSRKQTRRLACVI
GDFMHRGIGLRRDLVYRNLSLTFPEKSSEEIGRIATAMYRNVAGTLLEVL
RLPLICNRDDAAALVDIKGDEAFWEWHRSGTTGAVMVSAHYGNWELMAMA
FGLLICPVTIIVKRLRNERVDRKMNEYRTMRGNSVVYPKQSVREGLRLLQ
SGGTLAILGDQSDPDEVNFGEFLGRRTTMFHGAAFFALRANVPLFVPTCT
SNGDGRYTINITRVNTADLTFNKADIATLAVRYTSAIEAQIRRRPEEWFW
LHDRWKRGRD
>CT0623 hypothetical protein
MKNDRSIDSTILTLAGAAKHVARHLANVMNRIPEL
>CT1691 hypothetical protein
MSENPDIEFNKNLTLGQRMADCILLNLVLSTLAAIQAPVIMMSQNRQEER
DRQRAMYDYKVNLKAELEIRQLHQNVVHLRSKQWERLVEIQAIQMELIHE
LRGRK
>CT1804 hypothetical protein
MSSIRITQPNGEHTMKKMLSLAAMFAVLAYASPASAELKLSGDASVRLRD
VSYFGDADQFSFTGSADDDVVYQYRVRLNAAADLGNGYFFKALVMNEDRN
YAGGWQSVRHGNTETISLDISNFYFGRMLENSHWMVGRLPLNSFDNPIFD
LTLYPAQPLANPVYNINFDRVFGGNYGVKLGNGMLNATLCVLDNDSHNNT
SADGDGLFNDGYALHLDYKVNVGNVTLEPQFLSVLTNSDIWYQDITGRVT
TLAYKVTPYTFGALVGVPAGNAKLSFGGFYTTCDDTTPNGGPHVKYDGYL
LRVKGEIGNFMAWYDYNHTTVKPGGNDIKLNNHFVWAQYKIPVYSSAMGS
VTLQPTLRYLASKRDDGFNNYSGERLRSELWATVTF
>CT1801 hypothetical protein
MNKSTSLVSAAMLGALCATAPLSTASAESAIKPTFDTLFEPLLADPMEPR
IAVMPKLNKKQLQLDIGTSADLYQNSSKTFAVGIDFATWSLLNRTSNFKF
PVDCIDYMFGINTTFRHQFKDKLLSFDEASVRVRLSHISAHFEDGHTDDH
GNWLNPGDSPFGIPFTYSREFVNVTGALSAPGRRVYLGYQYLYHTLPDEI
SPSSFQAGVEIGLPANAYVAADFKLLPKWDWNEGKTDGYRGTWNLQAGMR
LTSIGLKNVRVAANYFSGMSRQGMYFYKPESYTTLGMIVDL
>CT1692.1 major facilitator family transporter
MAALGYFVDIYDLVLFSIVRVPSLKAFGLQGQELIDYGVFLLNMQMIGML
LGGILWGWLGDVKGRLKIMFGSILIYSLANIANGFAGSLETYAALRFIAG
VGLAGELGAGITLVSEILHTKVRGYGTMLVASIGVTGAILANAVATHYDW
RTAFFIGGALGLLLLVARFKVSESGMFQMMENKTAVSKGNLLALFTSRDR
FFRYLNSILIGVPIWFVVGVLITFSPEFATALGVKGAVSAGNAVMFCYLG
LVFGDLSSGLLSQVLKSRKKVVLMFLLLNIVSIAIYFMQRGATPTAFYWV
SFVLGFSGGYWAVFVTVAAEQFGTNLRATVATTVPNLVRGMVVPITMLFQ
FTRGHFGLEGGAIIVGAICVVAAFASLAALEETFHKDLDYFEEFM
>CT1617 hypothetical protein
MQNGSNSIHFDTVVMRPSCRKKVSAKALGFRYWSQTQQPAGKRRQ
>CT0914 hypothetical protein
MAGDSGLSQSFAKAFQFVVLHSGSFLVPYMLL
>CT0065 conserved hypothetical protein
MSDTILALMLFAPPAGGATPNPFVQLVPLVLIFVVFYFFMIRPQQKKQKE
RESLLNDIKRGDRVVTIGGIHGTVAGIETEKKTVLVQVADNVKIKFERSA
IANIEKQETGDKLASKE
>CT2058 sodium:solute symporter family protein
MQPLDYAIIILFLAGNMMLGLWQGRSNKQTSDYFLGGHKLPWFAVMLSIV
ATETSVLTFVSVPGLAYRGDWSFLQLAFGYIVGRILVSFILLPTYFKHGV
TSIYEVIGMRFGHGIQKTASVIFLITRILGDGIRFLATGVVVQAVTGWPL
SLSVLVIGIVTLVYTISGGLKTVVWLDSIQFGLYLGGGVIAIAFILARLD
APLPDLLAPLLAAGKLKIIDTDPHIFTNPLSFVSAFSGGILLSLCSHGVD
YLMVQRVLGCDGLGSARKAMIASGVFVLFQFALFLLAGSLIYVFFHGAPL
VKDREFTSFIVRALPAGLKGLLLAGILSAAMSTLASSINSLAASTVTDLI
KGKASLSTSRLISVAWAAVLIGIALVFNENDKAIVMLGLEIASFTYGGLL
GLFLLSKSSRNFHSTSLIAGLLASMAVVFLLKLLGLAWTWYIAVSVTTNI
LTTAGVEALLPDRESFARE
>CT0628 hypothetical protein
MNSGNFGPLDEDLLIQSIKTSFPEAHIALKFLSCLR
>CT0221 heptosyltransferase
MDHIRTHNTHHARKSADHLCVITDGGVSAIDRVSQRNIRVRLGAEHHDQL
FCQAHGLKRLLPGKSIDLHKDKKWRTLICSITKNPETSMTPDAARKNSPE
MKPRKKKKRQFRQLFARSLQRLARTRSGSAEFSGPLRSVAILAQEKLGDC
VLLTPLVRNLRQAFPDLEIHLITFSRASANFFMNDPQVTAVHLVKKQPRR
YFREVLSRKFDLLFNTKDHPSTWFLLQSALIRARFKVGHNNPFHEGLYDR
LLDTEFHAHMAVKNCALLPLLGVTADTEACRPSLPAMPVSNEIWQLLSRL
AEDIRPIGVNISAGEPNRLWTEAKWRALLERFPGERFVVLSGPDDLDAKR
RLEEQCPNAVASPPTRNLYEASCIVAKLRLLVTPDTSMVHVASATGTPVA
GLYREAPQDISRFGPYAIPYEIVISPTGEVSGIKPESVADAVRRLMARAI
GREA
>CT2105 ABC transporter, ATP-binding protein
MSEPLIVCEQLCVNLGGAKILQGLSLSVYEGDFLAVLGPNGGGKTTLLKV
ILGLVKPTTGTVRVFGKEPGYASRRIGYVPQRLDFDRTFPISAMEVVLMG
RLSRKRLLQRYGHEDRRKALEALETTGLAELAQRRIGALSGGELQRVLIA
RALAGEPELLLLDEPTASVDPDMKTTIYDLLDQLKKSHTIVLVTHDTGTI
GRHVSRIACLNCTLDMHEPGSTLGRSALDKLYGYPVDVVEHRAPQGHATH
QNHRHA
>CT1437 6-pyruvoyl tetrahydrobiopterin synthase, putative
MPSMIDLSGYPDSSVFYGKIYVHFVNISVNTTLRDTMLISRKIEIDYGHT
LPNSFTFCNQLHGHRGVIVATVEGPVIDRAGDAEEGMVIDFKFLRQIMDE
HIHDQLDHGFAVWKEDKEDLEFILKRNTRVLVTDAPPTAECLARWAFNQI
SGKLPEGVILKNLRWYETPNNWADYTGG
>CT1260 ferredoxin, 4Fe-4S
MAHRITDECTYCAACEPECPVSAISAGDSIYVIDENVCVDCIGYHDEPAC
VAVCPVDCIIKV
>CT1337 MFS transporter family protein
MGMNDTSQKARIFSWLLFDFANTSFSVMMVTFAFPLYFKNIICEGEPKGD
ALWGASVSISMLLVALISPVLGAQADYSGRRKRFLFAFTLISVLATALLS
FSGPGMVLFAAVLFILANIGFEGGLVFYDAWLPEITSPRSTGRVSGYGFA
MGYLGAFAILLINLPLLSKGIVPANIPNLKLSFLIVALFFAVFSAPIFVM
LRDTKGSVGDSSSGERRRERGSSFMHSIKEVGYTIRHIMSYPDLARFLLA
YFFYNDAILTVIAFSSIYAQNTLGFTTGELITFFMTVQTTAILGSVVFGF
VTDKIGPKRTIVITLFIWFAVILLAILSGSKETFIMTGLLAGMSMGSSQA
ASRSLMARLTPKEHVTEFFGFYDGSFGKASAIIGPLVFGVVSAQVGSQKV
ALASLLVFFCLGLLIITGVRTRATAEASPEASRIE
>CT0487 hypothetical protein
MKSFIAAIRNGETGFVVHNSVFLPFHCEIIRIWIGKEMSLLSVPDEITDL
GDADVIFIREGESYTNLVFRKWGDLSRELGNHKGHIILRAAEKGDDIFKS
ENLHYIKIGFHDHRKELSFEIINNPFDL
>CT0931 Nudix/MutT family protein
MLMPKATVGAIIHPSESERSTILLTRRNVNPFKDHWCLPGGHIDDYESVE
NAVVREVKEETNLDFAPETFVGWFEEIFPEHNFHAVALVFAGTGSGALQS
QPEEVADMAWFALDDALSMPIAFTHNLVLQHYARSLEK
>CT0640 conserved hypothetical protein
MVPAIFLSACASQKDLSYVQGEVSQLKQESTVIKQQSAGSYSEMTQYREE
IASLNGKIDELQYDYRTSKKRLDMEDSLLVRKVDDLENRIARIEQYLGIE
STGKDKSLVPKVLPPPPSSSMSSNSSGSTSAQSKEEASAATGEALLSEGL
IKMKRGDYAGARESFNAFMTGNPKSPKVADAQFFLAETYYNEKWYEKAIL
EYQTVIARYTKSPKRPAALYKQGLSFAKIGDEANAKARYKDVLNLYPQSP
EAKLAQKNLDKK
>CT2284 DNA polymerase family B protein
MYSILLSGTMNNIVNFSFENELLFGKDPEENIVGAYQLNDSQIRLFFRKT
DVVTHRDEPFYPFFFLSDNELLEGFIPFNKEKFWLIPLDGTNYYRYLAIF
RSWKNYRAALDFVNSSSQDSSSRSDSQNSSSFHSHLTYSPGDAISQYLMQ
TGKTLFKGMLFDDLHRLQLDIETYYQPDKKRKGKGIGEDPIIIVSLSDNR
SWEHVIHSKGRSEKDLLEELVHIIRQKDPDVIEGHNIFGFDLPYLQRRCE
LNGVRFAIGRNGLVPRSYPATIRFAERSADFQFCDIPGRHVIDTYFLVQN
YDISKHTLPSYGLKAVAKFFGFASPNRTYVDYKNIASTWDDNPETLLAYA
LDDVRETRELAALLSGSNFSMTSMVPYSYANTARLGPAAKIEALIIREYL
KRKVSIPRPSIGQQQTGGFTEVFIKGILGPIVYADVESLYPSIMLTFDVC
PKSDSLKVFPTILKDLKELRFAAKKRAEEEKERGNASLSYNFDAMQSSFK
IIINAMYGYLGFGNGMFNDFEEADRVTTKGQEIAKKMIREFESRGAKVIE
VDTDGIFLVPPPYVVTEEEERQLVCEVSSTMPQGIRIGFDGRFKKMISYM
KKNYALLDYNEKLKLKGSAFVSRSGEKFGRDFVREGFILLLNDDIQGLHD
LYVKYRNDLINHRLHISDFSRTESMKTTLDQYVSDVRAGKRSKSITYEIA
LRQGLEIAKGDRITYYVAGTGNPASFVENGRLAEEWNKEQPDENTGFYLK
RLDEYAQKFLPFFKPQDFSSIFSADTLFAFSPEGIKVIKEIRHHETGDLY
RENSPF
>CT0139 protease, putative
MKILAIECTHGFASAAASNGERMVERRLAEWQKTAESLVPLVMQVMDEAG
LTAAELDGVAVSSGPGSFTALRIGLSVAKGIAFGADLPLVPVPTLLAMAD
AAAKHTATKYIVPVIPSRAGEYFYSMFALKDGALSEIESSRCLVSELPER
IAVLTGSLVMVSRPVDLLAEQAPSLAPYLFDASFFSAATLLSHACKSLAE
GAAGTATGTLPDYRQAFVPAQRQG
>CT0709 hypothetical protein
MEQTPLRPLGEVMAMIEALGHEVTYAYDDLVFINHNDFLLQFDAAEPNAL
ALFFNTECNAAEADHVAARMIPEGIEKGLIIRRKGTYTMTEAESDNLQIT
FNP
>CT1729 DNA methylase, putative
MMARKNKTESKRPIESYEHRDKERVNNPPVGLVTPDTDPDAGQKKKTYAY
DPHLDPQLVWTGKAEHTSFEVPTVSLHVHERIDPRTIIEAVRKRPSPHPS
PTGRGRQGEGMAQLPLFEEERKEPLREAVEFYRHAHGWSNRLIAGDSLLV
MNSLLEKEGMAGKVQMIYIDPPYGIKYGSNFQPFVNKRDVKDGKDEDLTA
EPEQIRAFRDTWELGIHSYLTYLRDRLLLARELLTESGSIFVQISDENVH
HVRELMDEVFGARNFQRVITIKKRSPQPDKFLSGVADYLIWFSKDRDRSK
YNQLYWLSEGEYNGNEFVTSDLTSSHEYHRTPFEHEGQVFSPGSRYWSTS
IEGLTNLARSGRLVVSGSTLRYKRFNSDWPCQLIGNIWDDVVFAPFLEDK
LYAVQTSVKILQRCLLMTTDPGDLVFDPTCGSGTTAYVAEQWGRRWITCD
TSRVALTLARQRLMTAVFDYYELAYPSPHPSPSGRGGEENPLPAGEGGRR
PGEGTMGVWNGFKYKTVPHVTLKSIANNPEIDGIYARWQAKLEPIRAQLN
ALLFPSPQPSPKGRGSKEEEAKGRGREENPLLPGEGGRRPGEGFEEWQIP
REPDAGWPEKAKALLADWWQLRRQRQEAIDAAIARHAPQETLYDQPFVDK
KKTRVTGPFTVEAVPAPAVKSVDEILSSPQPSLKGRGGEENPLLPGEGGR
RPGEGKKPLDPEFLEFARQLRKEQTDAEQLLWFLLRDRRLAGLKFRRQHP
VEPYVVDFYCHEARLAVELDGGQHNEPDERARDAKREAFLEGKGIRILRF
WNNDVLQNTEGVLQAIYDALVSLTPALSQGERELWADASIARSGETLRQG
EWRDELLRAGIRGKAGQHIRFARLEPLPGCRWLHADGETRPSDEGADRIR
ETGPAYSPMRVVVSFGPDHAPLEQRQVQQAWEEARMLDPKPKLLVFAAFQ
FDPEAAKDIDEMKPELAGMQFLKVQMNADLLTDDLKKKRATNESFWLIGQ
PDVEVRKEKDGRYVVEVHGFDYYNTKNGGVESGGEDKIAVWLLDTDYDSR
SLYPRQVFFPMTGEKDGWARLAKNLKAEIDEALIEAYRGTVSLPFAPGAH
KRIAVKIVDDRGIESLKVMEVE
>CT0525 conserved hypothetical protein
MLQVSRKFEYGLHAVTYLAMKGPEQVVTVKELAAEIGFSQVFLAKAMQSL
NKAGITRSVQGVKGGYTLARPAEQITVADIGVAIEGEPHLMRCSQENCQC
EIESNCTHKGYMLNLQKRIYDLLAETTVAVLLERYL
>CT1843 hypothetical protein
MVLDNRVAVAAGDWVCRVLHPASPETQTVFLDELAEAMAGSGKAEVVTVA
EAVEIISAVSSPYLSPRWLAANISNATRPGRGTAFSARLSEPGSIGSVCL
GTL
>CT0925 hypothetical protein
MHLSKKILHQITSCLTIQSCSPGTPSAPGRLYSLLELSQRLTAFDGFAGG
GQDGLDGSGGVGGDVDAAHLFIRQIVRLWFCALEPGGGSGCRDWMVETGC
GNEAGVRPSSPSITSTFTTYFFPIYGQS
>CT1984 hypothetical protein
MASFCRAVSRDGEEGAAVIMERYFFPTSTSTSSAHQRRFCMYSMLPDFC
>CT0533 hypothetical protein
MKRSFSEAEPESIEYAFSMEHQGFLVIADNSV
>CT1209 kinase, putative
MKARLPLLVGVTGGIGSGKSTVCAMLAEMGCELFEADRIAKELQVEDPEV
IRGIEKLFGPDVYSRDASGKLLIDRKAIAAIVFSEPEKLAALNRLIHPKV
REAFVNEVKRCAREGKRILCKEAAILFEAGADRDLDRIIVVAANDGLRLA
RAVARGLACEEARKRMQAQWPQEKLVERAHYVIFNDGTLDELRSQVEQVY
QSLLTVVE
>CT0682 hypothetical protein
MSIDWCPGIREACGYWRDAPMLQQTFEAMERNLEQNNDACIDCAKTVVEV
VCRVVVESFHTQQAPLKLTEETPSLSNWLTAAIRALKLGDVRDDRFKKLV
SSHHKLADALNDLRNKAGPASHGKDPYLARLAEHHRRSAVLAADAIVAFL
PQAYLDAQLDPISSREPWERFAADNALIDAHVGLAVDAEDGDTPTLRFLL
PSGDEIPINIEVSRLLYLLDRDAYVEALNAARGAPAPAAEIVEGQGESA
>CT0291 hypothetical protein
MFFKLYLLHKQIMNTMKKEVAPDKFDLYLLNVDEFKALNQQKNELKEKIA
EMQRQFEESIQPYEAELNAIIKKLESTLAKIEGGPVAKPATAKFGRGKLG
ISIKQLLQANPDKAFKPREIAAALQTKSTAVSLWFNKYGNQDPEIERIPV
GEGGKRFIYQIKK
>CT1968 hypothetical protein
MITLPKALVDKFNREFQEPRLSANLELVELLRRIGKQHNTSPGEVAIDWT
LRHPAVTAAIVGGRTAVQVEDTVRAASLALSEQKISEIEAFLASMPA
>CT1908 3-oxoadipate enol-lactonase, putative
MLTFNGAAGGDAGNVLLLHAFPVSSQMWEPQLAPLAESGYRVIAPAVYGF
ESTPSRPGWSMDDYAHDLARLMEALGWKSATIVGLSMGGYQAMAFYRLYP
ELTKSLVLCDTRANADTPQAFSVRQEFRKAVMEKGAEEAAARMVPNFFAK
ETYESNPSLVEKTRESIVRQAPEEISEAMRAIAEREDSTEMLTEITCPTL
IVNGMEDIVTTPEIAATMHALIPGSKLELIPDAGHLSNLDQPAIFNGILL
EHLRSL
>CT0894 hypothetical protein
MTPVSANAPEELREAVDLRGCLHYLCISAMISQ
>CT1151 DegT/DnrJ/EryC1/StrS family protein
MAGAELIGKEELAQIQELFSGEKTTLYRYAPGNYKAREFEEKFAAYMGVK
YAHAVSSGTAAIHCALAAAGVGPGDEVITTAWTFIAPVEAAAALGAVPVP
VEIDETYHLDPAEVEKAITPKTKAVVAIPMWAAPKMDEIAEICERRGVTL
IEDAAQSLGATYKGRKLGTIGKVGSFSFDAGKTLHVGEGGMVITDDKDIY
DRVAEFSDHGHMHLPGLLRGKDPRRAKGLNYRLSEVTAAIGVAQLAKIDY
ILSKARENKYKIKDRISHLSNLTMRPFTDEAGAQGDTLIFKVRDPKAALE
FEAHLMEHGFGTKILPEAIDWHYAAVWGHLLKAYDRYRDANLEELWPKTG
QLLQSSICLNIPVLMDDATIDKLVAAIISGAEKIG
>CT0537 hypothetical protein
MRHELDNPHAHATNNPCIRCLRHADRYARAVVSMLEVFASDKAEAKHNLF
LKEGLCDIDKTLRSNDKRDTLRERPRNQTSSAMTLSPEKRARVMQGEILI
DLNWLPDGVIGAKGSVFVEAEPPVVWRMLTDYDHLHETMPKVISSRLLET
NNQTRIIAQSGKSGIFIFEKTVNFTLKVEEVFPEHLWFSQIGGDFQVYEG
EWQLEAVEGKNGHATLLSYQAEIKPDFFAPQFVVSFVQSQDLPTILRAIR
SYCEARAKG
>CT1547 hypothetical protein
MRWSSASQKASTLPVDFSNEPLDIVLHRGHRKISPIMVPAAHSGSNKTLH
YEKSGKVTGKNRRPNRS
>CT0637 hypothetical protein
MQVQGMMSFGADQARMTTISYSEEKPFDLGHDETAGSKNRWAHFVEK
>CT2212 acetyltransferase, GNAT family
MPAADTCVRNARLADASAISRITEGYAGEGIMLKRSVENIIEHIRDFFVA
DYKGQVIGCCAIAFYTVKLAEIRSLAVLEEFRNKGIGRLLVEKAEAVLSE
EGVNEVFVLTLNSGFFKRMGYKEIEKEYFPQKIWRDCTNCPKRMACDEIA
MVKTL
>CT2045 hydroxyacylglutathione hydrolase, putative
MSVQVEQIRTGGDRNFGYLCADKATGEAFAVDPSNSPKVLVDAAARKGWQ
LVRAFCTHGHADHTNGNEEFERLTGIRVLLFGDRDARLGIEVMHGASFPL
GEGVVEIIHTPGHTLDSICLLAGDALFTGDTLFVGKVGGTWSEADARLEY
RSLHERLMVLPAGTNVFPGHDYGTAPVSTIGHEKTTNPFLLQPDAESFID
LKNNWSAYKKAHGIS
>CT1668 hypothetical protein
MYWNLDLARYIADAPWPVTKDELIDYANRTGAPQQVIENLENLPDSDELY
ETLEEIWPDYPTDEDFGYSDEEPLN
>CT1972 CRISPR-associated protein, CT1972 family
MHNRFNLIDEPWIPAIGKGLVSLADIFSDPRIPALGGNPVQKIALTKLLL
AIGQAACTPETTEALEQLDAETFRRACRAYLEKWRDRFWLFGDKPFLQMP
AILDWMESQRAAGILSETENAKQIGPGFYPSLPSENDSILSQFQTLKAQT
DAEKALFIVSVMNFAFGGTQINKNIYPSEEKVKGKGKPAKPGPSLGRNGY
LHTFLFGSTIIDTLIMNLLSQEEIDNLPFWEKGIGTPPWENMPVSRECDA
ALSLKKSYMGTLVSLSRFVLLHDDGIYYIDGLPYPSHQEGWLEPSMTIDN
QQNPPKAILVNPEKRPWRELVSILAVFDSNKNNKFVCLFIKYGLSRWPKR
YNKPGDKIGVWSGGLQVSFQTGEQYAKATNDFVESSVELDPDMWNNLWYD
KFFGEISILEIMANKVKNGVINYYDSFEPKKEKKPKERASTIMGKKAVEL
FWQLCERRFPELVDACGEPDKLPAIHEAINLLALQSYDAYCPKETARQID
AWAECQKDLKKFIRELMEADRRVGGVPSEF
>CT1483 hypothetical protein
MTPDNERQMRAIILYDSRSTGGSTDKLIDSIGQQLAETGAYVEKARCKAT
ADYSFVREFDVVILGAPVYYLVVSSQLLGALVQSNLKRYLRNKSVALFVT
CGSPEPMAQTLYLPQLKIHLIRNRILAEKVIHPHQIADEEIIADFVDEID
EGYRRASRPRSGFRNHKIQWSDEARELVNNLPPFFADKIKGALYAYAEAN
GITYITPEVLDAARSSPMGM
>CT0291.1 MiaB-like tRNA modifying enzyme
MKQKSVAAVTLGCKVNYAETSSIVDALVSQGWQLNAIDDGADVLIIHTCA
VTGEAERKSRQQIRKIIRNHPGSRVGVIGCYAQLDPKRIADIKGVSFVLG
TTDKFEIAWYDGESLPNDSEPLVKVSPVDKAITAHPACSMLSQPEKGRTR
AFLKIQDGCSFGCAYCSIPLARGRSRSVSLSTVLDRAQKIADAGYREIVL
TGINIGDYQDGDTRLSGLLRRLETIDVSRIRISSVEPQLLDDELIDIVAA
SGKIMPHFHLPLQSGSDTVLRAMRRHYDTAFYRERLMKALSLIRGCAIGA
DVMVGYPGESERDFEEMCRFIEELPVAYLHVFTCSPRPGTKLFAEIAEKK
LIRIPSAESSSRAARLGVIGERIERRFAEAFIGSTLKVLFEEASALPGGA
VRWSGYSEHYLRVSVDTSAGELRGQVREVIIDGFGEGLQLHGRLLS
>CT0578 ribonucleotide reductase family protein
MLRNTDGSKVFEMNNVEVPVSWSQVAADILAQKYFRKTGVPQRDAEGNLV
IGADGLPVTGSENSIKQVAHRMVGCWQDWGKRYNYFDSDEDAQAFYDEVV
YMLLAQMAAPNSPQWFNTGLQFAYGINGPAQGHFYVDPETGEIRESEDAY
SRPQAHACFIQSVKDDLVNEGGIFDLAVREARVFKFGSGSGTNYSNLRGS
GEKLSGGGSSSGLMSFLKIFDSAAGAIKSGGTTRRAAKMVIVDIDHPDVE
KFIEWKAKEEDKVASMVAGSKICSRFLKAIVDEALKGGTDRQENEMLNTL
IKNALHRGVPMSYILRVLALVEQGYTTLDFEEYDTHYESEAYQTVGGQNS
NNSVRVTNEFMKAVQNDEMWVLRERTTGKEARAVRARDLWEKIVMSAWKC
ADPGLQFDSTINEWHTCPKSGRINASNPCSEYMFLDDTACNLASLNLAHF
LDEETGKIKITELQHASALWTVVLEISVLMAHFPSKDIARLSYEFRTLGL
GFANLGRVLMVLGIPYDSPRALAIAGGIAAIMTGQAYVTSADMAKELGAF
ARYRENSDDMLRVIRNHSRAARNSSEEEYEGLVVKPRGIDSEYCPKELFE
AAGKVWDEALKKGKKYGFRNAQVSVIAPTGTIGLVMDCDTTGIEPEFAIV
KFKKLAGGGYFKIVNQSVHKALARLGYSDKQIEEIEKYCKGHGTLRGCPG
INHQWLKSRGFTDEKIEAVEKQLESVFDIRFAFNKWIIGEEFCHSLGFTE
EQLNEPSFDMLSELGATEEDIEAANDYVCGTMMIEGAPHLKPEHLPVFDC
ASTCGRKGKRYINHMAHVRMMSAVQPFISGAISKTVNMPSSATTAEIGDV
YEAAWQSMVKAITIYRDGSKLSQPLNISNASPQDEVIMLGTEEDLDETKG
PKEVQERIVERVYHRSERRMLPKRRKGYIREAYVGGHKVFLRTGEYEDGS
LGEVFIDMYKEGASFKGLLNCFAVLASKALQYGMPLEELVDSFTFTRFEP
AGAVQGHEVIKNSTSILDYVFRSIGYDYLGRKDFVHVKAVDEVPEVPANG
NGNGSGNGHAPKSKATELELAAHATKPHHDEKSVLKSQAAQAKMQGYTGE
QCENCGSVRVKQNGTCKVCEDCGMTTGCS
>CT0972 anti-anti-sigma factor, putative
MKHSISTRKELTILKLEEPIFDVRYADCFKATIDSMISTGTSKNIIIDFS
QVKAIDSSGIGSMLLAHQLANSSDGLAIFVSLCQQIKDLLKLANLDKQLY
IFSSINEVMTLIEPALKGKRGSRSRQQPVQDESIDEIGDELEIPDEAFES
EAYCEIEDEAEAIDEPEAIDENEKPHQKKNPSATAPTRKRGRPKKNPEAG
KTPLKS
>CT0943 iron compound ABC transporter, ATP-binding protein
MPGMTELGAPALAFRGVTAGYKGRTVLRDVDFGIAEGEFVSLIGPNGSGK
STLLKTATGLLKASEGKVEVFGREVSSLKPRQRASLIGVVPQKLDSPMAF
TVGEIVMLGRNLRGRWTGLEGHDYDSVEKAMIYTNVFDLRERRFNELSAG
EQQRTALAMALAQEPRIIMLDESIAHLDINHSQEVLRILMNINREERITV
LLVSHDLNLAAQVAGRLMLVQHGRLVKNGSPEEVMQPELLSRVYDCELRV
RRDPFSGNPVVSGALDELLRRPAVRKRLHIICGGGSGIELFRRLFIEGFE
LTSGVLNRLDSDAEAARALDIPCVLEQPFSAVGDEAFQQAAAMVSEADGV
VIGPVPIGSGNLVNLRLAAEALDAGKPVWIASGLDKRDYTPGKEAAEMSD
KLRKSGATEWKGIQELTAMLNRAWPESQH
>CT1569 hypothetical protein
MTQCPVSGLSVTEKEHWSFENRQGNYTRKYSLIGNDIIHVQELASQGIIP
DHLYSADFTSLIGEENLTGKPVFIMIDCEPIIDLKFSYKREFTNLVTAQE
SDLRLIVLYNIKPSVRLQLEMLQSLATEKLPVMLRESYKEAVTSILDFKS
GNVSSTKPDDGSDKAFRNAFLAEVSRILMLRQFSQLEIFPPENHASYPYF
EILDIMRRDLQALEEEHQQSIERIEQECRVLLASKNSLLDEQIEVCKRSE
QRFKAEESALLSRIAALEIEATRISTANAEKNAALRTLCEMIEKINIDPA
TMEKISAQCATLFETSDQAAMINTELTETDSVFLSKLQKKHPNLNQRELR
ISLMIKLDYHSRDIARSLGLSTRGIESIRYRLHKKIGLDKHNSLKTYLTN
LATESL
>CT0326 ABC transporter, ATP-binding protein
MIELRNVTLRYGEKVILDKVSLTVQDNTIKAILGPSGVGKSTIIKLMLGL
IKPNSGQVFVDGVDITPLKEADLYPIRRKMGIVFQGNALFDSMTISQNMS
FFLRENLQLPDDEIDRRVAEQIRFAGLEGYEDQLPESLSGGMQKRVAIGR
ALIFNPKMILFDEPTAGLDPVSSHKILNLIASLKKSNDLGAVFVTHIIDD
VFAIADEVAVLYQSKIIFDGPTGQLHDSQHPFIKSILSDKILEL
>CT1396 sensor histidine kinase/response regulator
MFGFSNSDISVSSGHTSHLDEVLPQSMYMVDADGRMVRWSPWFRDRIAGF
SDAEMASTDFLELIHPEDRALVRQRMRKILDQGVQDSLEVRIFLKGGPAF
RWFLLMANRYEYAGHYFITGTGIDITRLKNAGMAMTLGEQRYRSMFEQAD
VPVFITDTDGALVYVSPAFEKITGYSFSECEGRPFTECCKEIEPGGASLS
MFSDVISNDFNCRAGEITILKKDGSSCYVELKFQRYHDNRTAGAIGVLCD
LSQRKRFELLTEFRLELLQYADKMSIDEIMQKMLDKAEKLTDSLASFICF
LSHESGGVSDCILSSSFRDRMGATSRSGEQLPFDVMPFLNDAVKSGRASI
VNGYPEQGHSDVSFDHPAIFSSLAVPIIERGNVVAVLLVSNKRTPYTDND
AHWVGTLTDLVWEIVLQKRAAQAEINHQSVLLQIQKMELVGQLAGGIAHD
FNNMLGVILGNAELALSSDDLEASVEENLQEIYRAAERSAEMISQLLAFA
RKQTAMPKLVHINEVVQDSLPILQRVAGEKIEIELRPCGDDCRLHIDPSQ
IDQILMNLCLNARDAMNGTGKIVIEVKRINIAPSQYSSGDFRLPGDFIML
SVTDTGQGIADIHKSHIFEPFFTTKAQGKGTGLGLSSVYGIVKQNRGFVD
FESNVGVGTTFHIYLPLYKKHDVSGADNGTVSSNNEQTTILVVEDEPEIL
NLCQVMLQKSGFKVYAANSPSEAIVIAEENSGKIDLVLTDVVMPGMNGVD
LASKLLTISPGFRVLFMSGYPADVIASHGVDNPLVNVIRKPFTFKALVEK
VQESLAVE
>CT0670 ABC transporter, periplasmic substrate-binding protein
MKAVKPIFALVVGFCLWVSAFSAQAAERLLIYAGAASKPPTEEAAKAYEE
KTGVKVDVIFGGSGYVLSQMKLAKQGDLYFPGSSDYMDKAKREGDVFPET
EKVIVYLVPAINVQKGNPHNIHTLKDLTKPGLRVAIANPEGVCVGAYAVE
IVEKNFSPKELAAFKKNLVNYTGSCEKTATAISLKQADAVIGWRVFQYWD
PERIETIKLPKELIPRVGYIPIAVSKFTHDRAAAQAFIDFLTGPEGQKIF
AKYHYFATPKEAFAWLGEKKPVGGEYVVPADWLKK
>CT1259 phoH family protein
MTEKTIEFEGIEPVIIFGPYDSYLKKVREAFPDIRINARGAKITIGGDEP
DLTAIEKIFREIIFLADQHGEVLENDVNALVSLALSPVSHPSAALSGDKD
VIVETKDYVVKAKTDGQRRMVAEAAGNDIVFAIGPAGTGKTYTAVAIAVA
AWKAKKIKRIVLARPAVEAGESLGFLPGDLAQKIDPYLRPLYDALQDMLT
AEKLKFLTERRVIEIVPLAYMRGRTLNNSFIILDEAQNASSKQMMMCLTR
LGVNSKAIITGDVTQVDLPKEMESGLMNAQKILKGIKGISFVFLDKSDVV
RHRLVKDIINAYEHHEEGLRKGAPKEAEL
>CT0322 iron-sulfur cluster-binding protein
MSEKRLTMKETIRRKALSLGFCAVGFAAADTLNEAMKEYRAMIDEGRHGE
MGYLETGLEARANPELLLPGVKSVLSAALPWPAPAKPGAISGYAVIPDYH
RVVGELLKELLDFIRSICDHPINGQICVDSSPVLEKEWAEAAGIGRTGKN
TLLIVPGYGSRVFLGELLLDLELEPDTPLDWNPCGDCTACLDACPTGALN
TPGKLDARRCISYLTIELKRDFTDEEAAMTNGWFYGCDHCLDACPHNANI
EATGYPGFEQKKELVNLTPEKALELTHSQFRKRFAGTPALRLGLRRLKRN
ARAAVANKSQSKS
>CT1746 hypothetical protein
MLKGCDKVPENGMTRKDRERAYKKHDFSGTIKAVSGLMRYFQKGGTSKKQ
NQTQQNKSTRSAKPCHNKLMTTSIMTSQCRCSGTIFRN
>CT1840 hypothetical protein
MDALSLLIDQFRHMAWADALVLTTIIASPEAERDGYILGKLRHLHVVQKI
FLDVLRNSPINPQETNDLDVKALSNFSQDVHVGSMRLLAALTPVELERVI
KLPWSKTATKKLGFEVAEHSLADALVQVPEHSAYHRGQIAARLRELGVDP
PMTDYIAWIWRHKPDAAWPCSA
>CT0791 hypothetical protein
MKMLLSSINKKGVKMDKQVDHSNGKRRSGSDKGERMIAIGQKLGVAIPAG
LLLFCGVNASAMTRDAVNISFSPDAVRENEVAQHLTALSANPQGALLADN
DNPLHGNTHVNKFDPNIHNDYSDSGVHTDSHGNEHCNTHGDANRY
>CT0397 hypothetical protein
MDGKEQVRLSRLFRHHVPTRKQPPSNMKKQLLSWLRLSASFIILLLAAIA
VNQLYSLSVVLNGLLPYSGYAFLALWVLFFAVTLFTGLRLWSAPKAPPLA
DRVGGSNFEPYIAWLGKSLAVHHAHPDGGRKEHDLRWIKANIKLHEVDAL
NTTKTVATKNFFIGAFAQNTSYGTTTSLVNNIKLVWNIYARYHHKHSIRE
FLRLLRSVYECLPLSDFNKGELPAHIKPIIQCSFSNTLSSLLPGGNLLTP
FFLNLFLAGATNTYLTCLAGIIASKHCQVLSLEDKEEIVQQSMFEAAFML
KEIVKECNPILSVTISNAVKKAGIESLDTVTAPTSSSSLAQDIVSHLASS
IKHIIMESGKE
>CT2201 hypothetical protein
MVYQARPDAGKPDRTEIHKHIRTYFFLLPTK
>CT0193 hypothetical protein
MMIMKRYGYVAVGVMLSAGVFGGMFAHSAYAAESGNDALEFQKPAISLSQ
AVNAAETFTRGRAVRAELEKHGGQPVYDVEVVNGAKVLDVRVDKDNGRIL
AANVDKADHDDAGDAED
>CT0687 Nudix family protein, MutT subfamily
MVIGDVVCAIIERDGRFLIARRPEGKHLARKWEFPGGKVEAGESEAAALD
RELQEELGVRVEIIERLTPVEHSYPDRSLRLIAFRCRIVDGVPDAGEHEE
LRWIEIDEAGAYDFPEADLPILAEYRLKIAAVPPGAPDQLHKAD
>CT1144 Na+/H+ antiporter, putative
MKTEARILKECLAAAAPNCLHSILPPVVRDDQTRSMNKPMESYYHQFLEE
FRLPLTNPVLVFSLVLFIILLAPIVSKRFNIPGTVGLILSGVLIGPHSLN
LLEKSSAVELFSTIGLLYILLIAGLELDVNDFRKNRYKSALFGLLTFSIP
IVLGYPVCRYLLNYPMSTSLLTASMFATHTLVAYPVVSRMGISKNRTVAI
TVAGTILSDTAVLILLAVIIGYSRGDINHEFWLHLVIALTLFSAIVFIVL
PAIARWFFTKLENEKHAHYIFVLAALFFSAFLAKAAGLEPIVGAFAAGLA
LNPLIPGSSALMNRIEFIGNSLFIPFFLISVGMLVDLRIILSSPIALIIA
ALLTFVALAGKWLAALSTQKIFGYSAAHRRLIFGLSSSRAAATIAIALVG
YRARILDLNILNAIIILILVTCIVSSLVTEKAAKEIVLEENDAAPEKEHA
QGDADEQILLPIAENKPSERVLELAVMIREKRSLNPLTILTVVPNDHEAE
LNVKKAKKELAPTVDFAASFDTSLNIVATIDYNICSGISRAVKESQSNLI
IFDWPSRQGFLGRMINDATESIVECTCKTTMICHLTRPLAIHRRIVVICP
PFAEKEKGFGQWLRKMSRLSQELTIPLLFHCDRKSRLAIIETLKTSHSTS
PVLYEVFKNFREWEDIASRKLHFREDDIVTFVCARRESISYKPFFDLVPE
QLEKHVKDISKIMIYPEQFDSAIIEEEYADVVAPKSFPFGASTIRKIREE
ISGFMKKNQLQIRKKARGKTAATPKRFGFSRKEP
>CT1459 hypothetical protein
MMKHNTIKIIAASAVFSISTGIFAPSFCLSNAATGATNIQGNQNIVAGGN
VLIYHGLSAKQ
>CT0137 conserved hypothetical protein
MPEIAPFKGIVYGPDLSGDAANLICPPYDAIPPAMQQELYARSDYNAVRL
ELPSEADPYAAASSRLREWLVSGVLAQDGEPALYPCFQTFEDEHGVTRTR
KGVFVALRLYDFSEGEVLPHERTLSGPKADRLKMFRETGANISSIFGLYA
DSSRRADEAISEFAERNAPLIDATLQGVRNRLWRVADPALISVAQSVLAE
QKVYIADGHHRYETGIAYRNERAASNPGHTGREPYNYIMTYLSNIYDDGL
LILPIHRLVHGIESFDPESFIAQLDRWFTVWELPGRSALDEFLETGDSAK
VFGIVLPGMVLGISLDPKPSEVLSTPVPEALQSLDVVVLHDLVLGQILGI
SSEAMARQSNLTYTSKTADVFEAVVSGKAQLGIVLRSVRVEQVIDVSVSG
EAMPQKSTWFYPKVMTGMVFHSLETEA
>CT0486 hypothetical protein
MLKGPEREFVANGCRAASPDAAEIARIAKHASGKPASAPPPLPAEFMLRV
CTRDDVEAMAGIYREVFSTYPFPIHDSVWLLETMQRAISTTSASSTKVVS
SRWPPRRWKHERLAQASGVKAGLFSVADSVPF
>CT1764 hypothetical protein
MYSRIQTLDKNRPGNEKIRFTAAGYAMQLSMHHLNNA
>CT0707 conserved hypothetical protein
MKYSEAQQGRVFVIRLEDGDIFHEEIERFAKEKGIERAYLNVVGGADKES
KLVVGPEESRTYPVNPMEHELYDAHEIVGTGTLFPDDTSAPVVHLHMACG
REENTVTGCVRNGVKVWHVMEVILVELLGTQARRLPDKATGFKLLVP
>CT0782 6-pyruvoyl tetrahydrobiopterin synthase, putative
MNDIVEKPRKIYVTRQIEFNAAHRLFNPELSDEENQQLYGKCSGKYGHGH
NYLLEITLSGIIDRKTGYLFDLKELKKILEEEIVARFDHRHLNHEVNELA
GHVPTTEILAVIVWEILDSRLKTITKQEVSLHEVIIHETGKNSVTYRGE
>CT0713 hypothetical protein
MTFLLIHAERKIFKEYDFLVFLSRRHNVIVILYSLDIKLVF
>CT1939 ArsA ATPase family protein
MRIILYLGKGGVGKTTVSASTATAIARSGKRVLIMSTDVAHSLADAFGVE
LSSTPVEVEKNLFAMEVNILAEIRENWTELYSYFSSILMHDGTNEIVAEE
LAIVPGMEEMISLRYIWKAAKSGKYDAVVVDAAPTGETMRLLGMPESYGW
YSEKIGGWHSKAIGFAAPLLSRFMPKKNIFKLMPEVNEHMKELHGMLQDK
SITTFRVVLNPENMVIKEALRVQTYLNLFGYKLDAAVVNKVLPSNSSDPY
LQALIDQQAKYLRVIDNCFYPVPIFRAMQSSKEVISTDRLYDLSQELFSG
RNPMEVLYSNDKTQTLEKINGKYVLSLYLPNVEVDKLAVNIKGDELLVDI
NNFRKSIVLPNVLVGRKTEGADFEQGTLNITFAN
>CT1793 conserved hypothetical protein
MKKLILCVLTAFALLAPATSYCQPPSQKEFSELKKAAEQGDAQAQCMLGL
MYELGLGVRQDKRTAKEWYGKACDNGNQKGCDNYRRLNELGY
>CT0395 hypothetical protein
MVMKQYVMATALLCYAAPAYAAYPLTTDDTGTQGAGGWQIELHTEFSTSS
RTDGGVRIKDREDDATTVISYGVAKRMDVIVTLPYQWYQHRQGQLVTDDE
SGIGDMTVELKWRFLENEKSGLSLAVKPGISLPTGDADRGLGTGRVTGGA
VLIATKEFGALTLHANAGYHRNAYALDADDAACNKDIWNASLAGEYAFSE
KLRAVADIGLETATEKGSRTHPAFLIGGLTYSITKDFDFDFGIKGGLNDA
EPDTAVLLGLAARFN
>CT0213 hypothetical protein
MIAVSISHALINHVDEVYVVDHASTDQTFHGLNHLKKLWGERLHIITINN
IGFFQEAATNTIIQISKKSNPDWIYVFDADEFLLVDESTSLRKILSNTEK
CHQAIRYTVNNYISTRHFNDLILDNYKNIIHKSVPNPEIEKKANTRPYKS
LTKNKTIPELIYDGDATFFDIPFPSKVIIRLSDFLQITAGAHSAIHFKGN
VQGKHIKEIEAAHLTYPTKKRLQNKAEHGEFKIKNKFPPNHGWQNQLVYK
IAQEGRLDWFWEKHSMPDKPTSKSLGPAHIIDRRFSEIITKTTDFLEKGF
QSGDLSMIGEKTIEKDSGDETQISISEMIMLSETLRTQNTALIDNFYKTL
QEKPLYRLLKTAKHVEREIRKRTRF
>CT0086 hypothetical protein
MIDQRTSYPSSMTRHLSPFWLNRLLGIPSSTPLTPRLMASTLRVAFCPLQ
EESPRLDHFITALRESFVECGVTIVEEAAGEGRDSRVEAGTALIAPGRFE
DHQLPISRVSTLYNNLIVGVYDEPPPVHGGQTPQERLDAVVGKLAWEMVH
LLIYVTENSWTVCSMNGGITTFDTPLPESRDVLESLIPKLTAQVVPPRDG
ELELRTGALDTSAPEFLEFAADFVECGRIWAGNERFMNHTSRESLDYRNG
FCRKIVSRYLDERNGMSYGFFARQLPVKVAPAIEADDLGGTSVGDALVPV
TIAGKQLLVPVPGVRILTTRSGCRKTAIDPRHDLVEIGLDNGHAWLVTPA
GLPEGLVSKPSFDTLTIIAHAIGNAMIASILLALAPGNRFPGLLARFGCG
MTHWHYYLDEEMIPDGYVVHGFDNPPVSCSTPQSAAYSLLGKLDALERAL
EGGTDYLGDVHIEPGHGTNVVGTRSLAKMAMFLNADSCVCSQREG
>CT0433 ATPase, ParA family
MRAGVREVIDDSSERTLQQVIERAFLRTKGAIIRQKRVFGFVSAKGGDGG
SCIAANFAFALSQEPDIHVLAVDISLPFGDLDMYLSGNTHSQDLADISNA
SDRLDKSLLDTMVQHISPSLDLIPSPATFEKIVNIEPERVSDLIHIAASF
YDYIIVDFGASIDHVGVWVLEHLDELCIVTTPSLQSLRRAGQLLKLCKEF
EKPISRIEIILNRADTNSRITSDEIEKVIGRPISKRIPQDEDAMQESLLS
GQSVLKVAPKSQLSKTIVDWALHLNGVSRPNKRSIWERLKIK
>CT1895 hypothetical protein
MLLTSVVKPALIANGGSVLHVSSMDHNSSVFDPGNMQGERSWSGYEAYAR
SKLFNIMFTLDLSSGDNAVRSNSLDPGVITTKLLHAGWSLAGDEVSVGGD
DVFETVIEIARHGYNGEYFENARPAICSSVARAPAARQENEIPKQKPRFF
GSFVIENVA
>CT0881 hypothetical protein
MRVADDRNELEAVREFVNRTLDGVALAAAVGDRIVANAEWAGQVAQNVEE
RAAKNRAGFVQKRRCLMTVKEKEYLAFVAQRQRFSQGGSELVVVVIEQTV
IVVAEQLPEEAVAIESKQQLIQPVPCSGDGIEIEFEQLLDIAVQHKAEST
SKVPLAQHRFEQLGVVPEFIVAPAIPKMQIAEHHHPRGAINPDRFRGMQK
PFKIGIARHVPPAG
>CT1912 hypothetical protein
MSLQSLSSFLKVIADLLQTLKVCLRQFDLFLQVLQVWKSIRITLNDFSDI
ELQPIVLEYYFLKLMM
>CT0968 conserved hypothetical protein
MGTVLCSCLLGDNAKVRLKLSKMLEEEASGFYPDVLPEGSVSICEVEMKG
CRREFYADRRSEEIEIGTPVIVEADGGFDMGVVYSTGRIALRKLQLKGLV
SEVEKLPAIMRRATEEEVREFTEIRKREPEIREACLKRIERHQLDMKLVD
IDLRLDQQKLSVYYTATHRIDFRGLVRDLAGEFKARIQMVQITTREEARR
TNAQGSCGCALCCSTWMQKIHSNPFAEKPQMPDSMNGDNFSFNTIGLCNR
PKCCLGFSKTNGKNGNCHHGKVRQNGWPSVGTTFTVRDGNAVVEGVDQQK
KSIWIRYVSTGQKRRISLDQYNNMAGKRQGGQA
>CT1359 conserved hypothetical protein
MYPNISDCGVIGDTRTAALVNSNGSIDYCSLPYFDSPTVFAALLDERKGG
YFSLKPAEAFSSRREYLPDTCILCTSFTTRNGKAALYDFMPHQDDKTRER
TQGIHRCIRVDEGRVKFTLTLKLLTFQQTGAIVAAATTSLPESIGGKRNW
DYRFTWIRNASFTLKAFFALSHTSEADTFIRWLHDTYRKNGSRGFSQKLN
AFVQRFDTEILDASLLIMPLVDFLPVTDQRIQGTIEACQTHLMDNGFIRR
YRADDGLEEDEGGFLLCNFWMIECLALSGKSAEAEKLLGITMAAANDLGL
FSEEYDPYSREMLGNFPQAFSHIGYINAAATLIDSKLPLANP
>CT1713 exopolyphosphatase, putative
MSSEKLRVAAIDLGTNSFHMVIVEESEEKGIVEIDRVKDMICIGRGSIST
KRLDDGAMEAGVATLRNFIVLATQRGVALHNILAFATSAIREADNRDEFI
DMVRRETGLKIRVITGKEEAQFIYYGVRNAVTLRDKPDLLFDIGGGSVEF
IIADKSKVHLLESRKIGVARMLERFVTTDPVSAHELHLLQQFFAAEMYGG
AAEMAHELGVSRAIASSGTAQNIARMIRLGKHADGADVLNQSSFTRQEFE
SFYRQVIAMDASARRKLTGLDEKRVDLIVPGLILFDTIFRVFGIKDVVIS
DSALREGMVLHQIALIRGRDGSSQLDIRRQSVMELGYRCNWHKPHSEQVA
RLALMLFDELHPLHGLKERYRELLEYAAMLHNIGEFISISAHHKHSQYII
MNADLRGFSPTEIDIIGNVARYHRKQPPTERHPLYSQLKPSHRRVVDVLS
GILRIANGLERGHRQNVQSITARIDQERIVLEALTQFEPDIELWAACGLK
EWLEEVLGKPILIEARVR
>CT1013 cytochrome c peroxidase, truncation
MPTLINVKLTYPYFHDGAAQTLAQAVETMGQIQLGKKFTPKENAKIVAFL
KTLTGD
>CT1587 hypothetical protein
MSQTRKHQENRLEKHFFHKNRHKTSSIETRLARHCDFFIFNLS
>CT0587 hypothetical protein
MMPAKWDEVRPLLDRFSRDLPTNATVWFMMPDGSYYATAKGGFTDQNLKD
CTCFPKLIGGKEVLGELVISRSTGQRSIVVAAPVVADGKVVAAVGVSVDA
VKLIEVVESRMTLPENAYIYALDAKTKVTLHRYQARTFKTVFEIGNESLG
DTFKKVMHREQGVFNYSLEGKKMNSIFRKSPVHGWYFFIAQQCK
>CT1945 ArsA ATPase family protein
MRILTFTGKGGVGKTSVSAATAVRLSEMGHRTLVLSTDPAHSLSDSFNIQ
LGAEPTKIKENLHAIEVNPYVDLKQNWHSVQKYYTRIFMAQGVSGVMADE
MTILPGMEELFSLLRIKRYKSAGLYDALVLDTAPTGETLRLLSLPDTLSW
GMKAVKNVNKYIVRPLSKPLSKMSDKIAYYIPPEDAIESVDQVFDELEDI
REILTDNVKSTVRLVMNAEKMSIKETMRALTYLNLYGFKVDMVLVNKLLD
AQENSGYLEKWKGIQQKYLGEIEEGFSPLPVKKLKMYDQEIVGVKSLEVF
AHDIYGDTDPSGMMYDEPPIKFVRQGDVYEVQLKLMFANPVDIDVWVTGD
ELFVQIGNQRKIITLPVSLTGLEPGDAVFKDKWLHIPFDLEKQGQHHRTR
EYNKA
>CT1429 hypothetical protein
MNLLILLLAVIILLLLVIITMLATGWPGKQREEVERLGNSLRREILEQRS
GNLQLMKSLRIVIEDAVRESVEKEMMAVAPRGRSRRNSRKKIQEAVDLGS
ELFIAGDEDADNGSYESPLQAMQLSLFSEMTERVQAAAVPDASPDKTKER
EPEGETIHMGYVDDIPDVE
>CT2246 hypothetical protein
MMDDMIQMVEKAIETSLHWQETGWPVTFGNRQVEVSNLKAAEALPRNAVY
RDEAINYWRQVRLTGEDTAAAGKKALEALKNGDICAAYDALYLCQYLEIP
FEADAKTWRPVYEAFMAKCA
>CT0523 hypothetical protein
MSCSGEVSIKEKGFIYSCSDLHDEHHQRPLTQHDPPVKFTKSIKLLTLLL
LVQSFEGILPLRAGESASKPQKAKDGALPDSLLEKREDLDSTVVYTARDS
LIYNVSKRTADLFGKAKVNYKDSRIEGPRITIEQATSTARATASRDSLGR
PAELPVYTGKDGSFSAETIAYNYKTRIGTASDMSSKSDRDIYSGGSDQAI
YSGKEVKRMPSGELYIEDGVYTTCDLEEPHYWFAGKHMKIIPGERLISRP
FVMYIHPEIFHMRLPVLPVMYLPYMSAPISNKRASGFLFPRFGNSGDMGS
YFSNLGYFWAINDYADLRLDGDIAFKGGWRLGERFRYKNGDRYSGSISGE
YARIILNSPGDSNYARYINRDLRIEHHQQFDPTAVLDVNLQFLGGDRYYY
GYTSVDPENLVTDQATSYASFTKSWDENNRVLLGGYQRVDNLSTDELTQR
VTLSLYQNRIYPFRPRLSSSSSESPGWSSRLFVQPTLSGSGQFDAAGGVN
TDFYTGNAGLELGYLQDFSPGNRALFTQGLNMQALRKTITGEDDLNATSV
QLPFKIQSTLFKYLHLTPALTFTQYRVNSTVNKYYDHIAGKVVTQTINDS
DSYATTVFSLDAQTRLYGVMNTGFLDKLVGLTAIRHVFIPTVSFIYNPDY
TGSGYGIYGSYYDPVQMKNVQYNRFGESLYADVPEKRTFVGLSLQNIFQG
KFRSKKVSNEDGSNAGAGYKTVQLLSLTASSGYNFAADSFPIAPLVLTAS
SNAFAPALMFSAGATYDFYTYDPATGDRVNKLAMDDGKGLLRFVNGFLNM
SVSVSGSLHTSYASHDERDGEMSLVREKALPVEQAIYKERFNSDERTKFS
ASLPWSLRMSLYLISDKSNPLDPSSAALLNTAARLSLSRNWQVGLNTGYD
LRNSEFVYPALMLDRDLHDFWFSAQWVPSGEHKGYLFQIAMKPANLKYLK
LKAGSGHIVQSPE
>CT0579 hypothetical protein
MPARRFFVGGFEFDIRFECKIDPDSFALCSD
>CT0329 undecaprenol kinase, putative
MNLFQAIILGIIQGLTEFLPISSSAHLRIVPALMGWGDPGAAFTAIIQIG
TLAAVLIYFAKDIVSISGAVISGLVKGKPLGTDEARTGWMIAVGTIPIVV
FGLTFKHEIETVLRSLYIVSASMIGLALVLVVAEKHTANRARDGRRGKSI
NELSWTDAIIIGLAQAMALIPGSSRSGVTITGGLFRNLDRETAARFSFLL
SLPSVFAAGMLELYQTRQEIMSSTHNMLNLAVATIAAFIFGYLSIAFLLN
YLKRHTTGIFIAYRLILGIGLIVMIGTGHLLP
>CT1663 CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase, putative
MTFGSLFRNQTRTVVKGHIFNLPNSLSILRILLIPWFIYYLDAGQTHIAL
IIMVVALLSDWFDGQMARWTNEVSDMGKILDPLADKLCLASVALYYLWKG
ELPVWFVVFVVFRDLVIFLGAAWIRHRHNVLTTSLWPGKWAVGFVSMMFI
SMVWPLPVFRQWPVKEFFMYLSAATLFYSFVEYCIRFYKIQKGAEFKA
>CT1146 conserved hypothetical protein
MKIFLKISLLFFLISSALCASLYLGLGFLVSMHSTKPEKADIIVILGGDD
GLRVSKGGDLYKAGYAKHVLLTGIDSRYYRPNHPNWRERKLMARGVPRKH
IIVDTWSETSWEEAENTSDLMDKNGWKSALVVSDPPHMLRLNKTWKKAFA
GTNKRFRLVATEPSWWNPLLWWRNPISYRFVINELKKNIYYLVTYY
>CT1634 hypothetical protein
MPEPFFQGISIFSREAGMTNSVTSDAVTVKVTLRHSNNHGSIEDYARNAV
SGLSKFYAGALNCHVILDHQKNDHDQNKLAEITVHVPQHDFVARESAPTY
EQAIENCIDVLERQLKKLKEKQRNI
>CT0504 hypothetical protein
MATAIYVFFDNDYITDRAGIQKLFEQRPLVEVIVQSIPYVWLFALFLFIV
AAFYGFRHTRKGYRYPMFRVIGGSLLVSFLLCGLLNVFDIGKYVHRYLID
NVEGYGSLVYTNDVLWAQQEKGLLGGKVVRYTPGDSTLVIRDYRHHFWTV
DLSRARARPGTKIVTGKYLKITGLKTGQSTFKALTIRPWVKKSHHRHPKA
PKPTPVKKSSASGKPASPLSPAQQLK
>CT0585 hypothetical protein
MYYRVDGGIARVRAILDCRRDPEWITKRLG
>CT2241 polysulfide reductase, subunit B, putative
MSQNRREFLKKAGLGALAGLGAAAGLFPAFALENTDMSSFTTKPTPGKVR
WGMLVDTRKCLGDCKECIVRCHHDHNVPDFGHTKNEVKWIWKSGYENAFP
TASTQFQNPEVLERQVLTLCNHCTEPPCTKACPTEATFKRWDGIVSIDYH
RCIGCRFCMAACPYGSRSFNWLDPRPHIKELSNTYPTRMRGVVEKCNFCS
ERLVKGELPACVVSCAENALTFGDLNDPNSEIRKMLATNETMQRKPELGT
LPSVFYII
>CT0497 hypothetical protein
MFQPLVSNQKSPYWHHHKRMKIVSSMLFSFLEIAATVIHKKI
>CT0594 hypothetical protein
MGCVYCLCKGLSNDPLRRFLNMPHRQLDHTADLCFEVTAGSYPELLGEAL
CAMTEWIGPEWKNSAVERPFRIEAPDRVALLVDLLNEALALSQIYREAYG
ALSIRSSGEEFIEGAFLGRAISGARNEIKAVTWHGAKVEQRADGSWLAIL
LMDI
>CT1907 hypothetical protein
MENQDKYYKGLFFGAILGAAVGTIMGLLFAPRKGEDTQMIISGKVKRAID
RATELYEGSEHEAAYSNEAKHRSQEIIDSARDEARKILDEANNIIRDIRG
SQAKAQEN
>CT0376 FusA/NodT family protein
MSVNVFINYIRIMRRLRRTFVIVAVLCFVCLPKAPVLAVETLDWAQCLAE
AAGHHPDLRSAAESVRQSEEQRNSVKGGMLPSVSASAGGERSGSSAAAPV
GAWSYGVNASQLLYDGAKTSSEVKSATETLKASKYNESKVSVATRYALRT
AFVQLLTAQKQVALAEEIATKRKQELRLVALLYKSGTENIGSLSKAQADL
GEAEFEVAQARRALDLAQMVLSTQLGRRSFTPIRVTGQFTASEQTRTPPD
FDSIARNNPAYLELTAQKSAAGYSLQAARSAFMPSVYLSTGFGNSAFRQL
PPDRTDWQAGIDVSVPIYAGGSGKAGVAKARAVVNQLSADEESLYLSLKR
SLAQAWKTFIDASGNVDVQKKYLDAASERARIADAQYSAGLISFNDWTII
EDNLVTAKKSYLNAQSNLLTAEAAWVQAKGGTLESR
>CT0583 hypothetical protein
MHLSIPLKQMSVEEKLQAIEEIWADLASTPANIPSPAWHADVLQVREERI
AEGRAQFLDIEEAKKAVRERLG
>CT2096 hypothetical protein
MTTRRSIAIAAWCLLPVALWVTSLPDNTVSAENKKTILEHADQIEGGEKA
GPSGTAIPYRSAVGNVKFLHAETTLECDRATDWPDSERIDLEGHIIIKDK
NVETRADRGVYHTDSETGELSGNVRGRVTGDSLTIKSGRAAFDQHKNELW
LFDDAVAWQLGRQLSGDSIRVHFHEVGGKKKVDEIQVFGHAFLAVRDTLS
ASPALHDQLSGKKLTANLDDNSRLQKVIAIGKARSLYHIYDDKNQPSGVN
FTSGERIRMFFAEGKLDRILVTGGPLGKEYPNYMRNDPEINLPGFRLRDK
EKPVFAP
>CT0758 outer membrane efflux protein, putative
MDRAGGGGIFGAPYNKQNAVMNLHRIIALLVLFFVCSTPVHAADGTEPTA
RPPLVISLDDAVRIGEQHNRELEIARLDKLMARQKVRESWAEVLPHLSSS
LTYTRTLKPSVLFLPSSIFGGTGGTQAIEISSDNSSIASLNLSQNIFKLS
AFAGIKAAGLVRQISDQSFRNTNAGVVTSIRRAYYDVLIATEKRKLVEQS
IARWEEARKDTNALFRQGVAADIDTLKAWLSVENLRPDLIRAQNNEAITA
TNLKRVMGIDQETPLILTSTLTFHEIEMPKSVAAAYSEALDKRPDVRSLS
LQAEAENAKVMAARSEGLPVLSAFGQLEAQTQFNDGTSLASTRWPVSSTA
GLQLSVPIFSGFATSARIQQAKITRLQTSTRYEDLKSQVRADVEVRLSNV
VEARKRIDVQSKTIAVAERSYKITMLRLREGIGSRLELTDAELQLDTAKA
NYLQAVYDYLVASAELEKALGRIEPTESKL
>CT0816 hypothetical protein
MLPELIRVTGYAVAVVVLFFGLSASGLLSSLTIDSKSSSGAFFLFGLIRS
LPTGFMTILLPCGFTMAAEGAAILSGNPWQGMTIMTFFVMGTMLPLLVIV
MSGSELSNKPSTSRVFMKTAGLLVIFFTLYNLNSQFGLAAKVAGRNEPPA
QLKTAATTGSARIIRTASSNSYFSTNRFDVKKGEKNRTRSPAPWVCPGGG
SSTSSNYSGVPAMQKIGLSAYAARMARTMVRDQ
>CT1413 hypothetical protein
MITFIDHVTAMKELSRKFSADEPDAPNSPGRLISRPVRDSRPAFQDLRFI
VKESLNLGFRFFDEVDRMLQKFKGEEPEKQNSEEKADDPS
>CT1686 hypothetical protein
MKKGSDGPTPHKKALPLRDAYQSKLPIAVFVLIASRR
>CT1010 natural resistance-associated macrophage protein family protein
MAVSFSKLIQSRWLPLKKLLGFLGPGFLVTVGFIDPGNWATNIEGGARFG
YELLWVITLSTLILIVIQHMAARLGIATGKSLAVNIRDHFPAPVSSLLGF
TIVAACVATDVAELVGGGIGFSLLFGMPLWAGALLTVVLEVFLVVSQRYH
RIETIIVGFLGIIALCYLAELWIVRPAWHEVLPATVTPVLGRESIYVAIA
ILGAVVMPHNVYLHSNVIHSRKWGMTDDEKKELLRYEKADTLFAMTLGWV
VNSAMIVVAAAVFHQHGVRVESIEQASATLRPLAGPLAGLLFAVALVFAG
VGSSITSSMAEANVITGFLGKPEDPESLLWRVSVFLTAIPSFIIILFKVD
TYKILIFSQVVLSLQLPFTLLPLLVLCRSEKVMGVFRSRGVEFVAAVLIT
MVVVALNLYLLSTTVTGDS
>CT1386 phytoene desaturase
MNYSYNGQTVLHDAGQKLSLPNAYDYCRQIARHHAKTFYLAAKFLPKRQQ
NPIFAMYALLRTVDDLVDLAQDKLSNGQLTRKEINDSIADWKMRLRACYD
GSPSNDPILMAWQDTLRHYSIPIELPLDLIDGVAMDIDFKTFETFDELYV
YCYKVASVVGLMTVEIFGYSNKEALQHAIDLGIAMQLTNILRDIGEDIDR
NRIYLPLEDLRRFNYSREEFMSRTMNNKFVDLMKFQIDRARKYYASADLG
IPMLEKNSRLAVGISSRNYSDILKAIEENSYDVFTQRAYRSFYQKLSTLP
SIWIKTMISA
>CT2042 succinate/fumarate oxidoreductase, flavoprotein subunit
MKPFDIVIVGGGGAGLYAAMEAMKTNPGLNIAVLSKVYPNRSHTSAAQGG
ANAALANKAKDDTVEMHIYDTIKGSDYLADQDAVEVLCSEAPKIIRELDN
MGTPWSRMDDKTIAQRPFGGAGRPRCCYCADKTGHTILQTLYEQCLRKGV
FFFNEYFALALSVDGSRSRGLIAMNIKTGKVEAFPAKTVIFATGGYAKMY
WNRSSNAAGNTGDGQAIAYRAGIPLKDMEFVQFHPTGLRKSGLLVTEGAR
GEGGYLVNALGERFMSRYAPEKMELGPRDLVSRSLETEILEGRGFDSPAG
KYLHLDLRHLGADLIKSRLPQIREMCMYFEGVDPIEEPVPVRPTAHYSMG
GIDTDNFGRTIMEGVYAAGECGCVSVHGANRLGGNSLLDILVFGRIAGRA
AAEECGKFNPSAIPASEVAEKEHELRSFMQPRGHYERYGTLREELGHTLG
ANVGIFRDSSKIKQGIADIESLQERFRHVRVFDTSDIYNTNLLQVLELKN
MLDLSETVAAGALAREESRGSHTRTDFPTRDDEKWHKHTIYTYENGRPKL
AYKPVTMGRYELQERTY
>CT0837 hypothetical protein
MQERLPFSHRELKKNKAGKPESSRFRNPLVSAKFADADEP
>CT1275 alcohol dehydrogenase, zinc-containing
MVLERVVDLLHETQPLVLRELPVPEPGPGEILLRVATCGVCHTELDEIEG
RTPPPRFPVIPGHQVIGRVVACGDGVAGIETGSRRGVAWIYSSCGHCDLC
RSGNENLCAEFSATGRDADGGYAEYMVAPAAFTYSIPDVFSDAEAAPLLC
GGAIGYRSLMLANLKDGQILGLTGFGASGHQVLKLARYLYPKSPVFVFAR
SEKDCDFARSLGADWAGGTTDNSPSPCDSIIDTTPAWLPVLSALERLRPG
GRLVINAIRKESHDSELLAGISYERHLWMEKEIKSVANITRADVSAFLDI
AARMGLKPEVRSYPLEEANRVLLDLRHGKARGAAVLIP
>CT0688 hypothetical protein
MPGERSRDFSALIVLKAYISGECKYAVILLHRPMRHERAYAGIPVMIETI
TWRSDRG
>CT0459 hypothetical protein
MERKLQEILDSAIPLTQAMGIVVERYTGRELTIIAPLANNFNHLGTAFGG
SLYIACVLSAWGLLYLRLREAGIKGSIVIRKGNAEYLRPVTGDIVATGTL
PTEEEFAALIESFDRKGKAKMTICAVIEVEGKVAVKFEGEFAVVR
>CT1701 iron-sulfur cluster-binding protein
MKRQIITIDEKKCTGCGDCIPACPEGALQVIDGKARLVSDLFCDGLGACI
GHCPTGAMQVETREAEPYDERRVMAESIVKAGPNVIAAHLCHLRDHGAND
YLREALAYLEEQGIPNPLEQPVAAAHPAHVQHGGGCPGSRTMDFRSSNGS
AQASSVAPVAGAVHALSELRQWPVQLHLVSPIAPCFEGSDLLLAADCVAF
AAGDFHSRLLRGKTLAVACPKLDSGLAVYVEKLAAMIDHARINTITVAIM
EVPCCGGLLSIVEEASRRASRKVPVKKIVIGVQGDTLSEEWM
>CT1079 hypothetical protein
MIGAVPFNNVPFSSLSVHSGHVFHLAIAGIIVRIGYIREVFFNFSSDTAA
>CT1228 UDP-N-acetylenolpyruvoylglucosamine reductase, putative
MTEPIFPCPFDERMPLSTVGYYGIGGEARWIVHPRSVGELALVLDRCRQL
GLPVIIAGKGSNMLFSDEEFPGVVIVLDAMNRMFQVSDELFFCEAGVENT
DAAIVLQEAGRCGGEWLYRLPGTIGATVRMNGRCYGREISAVARSVVTVG
LDGAVRWRRADEVFLGYKETRLMQSPEIVVGAMLEFAEHDEPEAIGKRMQ
EYGDDRDAKHQFDFPSCGSTFKNSYDAGRPSGQIFDALGFRGRREGGAQV
SDHHANFIFNTGGAKAADVLNLCAAMRTEAREKLGATLELELQCAGLFQT
ALLDACGIASTPEPSRPGYGWTGLLPFPDACDDAFPRVLLQGEALDYFCR
DAVFPAGIAVEVGQLIPLDEARKAPDRPFIRWTTRDESGVAFSLHPDAPV
GAFVDRLWEHNVSELFIGQGGGSGQYLEFEVTPEGHWLAIRFDAPRQRTA
GHEIPSEELWRSQATPFASEKGFGIELSYALLEPFIHDDTLRLQCAVSLG
DGRYGLFPWWRGEGAPDFHQPERYCVVRLG
>CT1181 DHH family protein
MIIPEYGRTLTPEEWRPVVDEMLEATHIIFTTHENSDGDGLGSQVALALA
LKALGKEVAIFNPTEVPPNYLFLKELHEINLFRDRDEESMQEFFLADLLV
VLDANLHDRIGRLWPHVEFARQMSRLKVLCIDHHLEPEDFADITVCETYA
SSTGELVCDLVTALELRTGQQLFTPEVASALYAAIMTDTGSFRFPKTTPY
TYRLAGMLVEKGADPELVYDRIYNALTPEALKLLGLSLSNIKIIENGLIS
WLFITQEMLEQTGSKLFDTDLIIRYLLSVPTVKVAVLLVEMQDGRCKVSF
RSRGKIYVNQLAKHYGGGGHMNAAGCLLRMSAEKAQLVILEDVRKFTLEQ
VDS
>CT0753 ABC transporter, ATP-binding protein
MLSTVRSLKQNTVSTIVNHVFEFSNVSAYRGGTKVFEGLDLSIRMGESTA
ILGPNGSGKTTLLKLISREIYPVHADDCRMTVYGRDRWNVEELRSRLGIV
SHDLQQNYGVYAKGREVMLSGYYSSIDTWPHQLFSATDIERAEELMRQLG
VEELADRPFGKMSTGQQRRFLLGRALVHGPEALLLDEPTSGLDISASFQY
LDTVRKLMQQGTQLILVTHHIHEIPPEVTRVIFMRKGRVVADGDKEAMLT
SSNVSSLFGCGVELIERNGYYQAVPAG
>CT1320 BchE/P-methylase family protein
MSLTNGVAPSLEKLAAEQKSRKKWLLVQPKSQTSMMVDSGAVSMPLNLIM
VATLASKYFDVTFLDERTGDTIPQDFSGYDVVAITSRTLNAKNAYRIGDR
AKAQGKIVLIGGVHPTMLTDEASLHCTSVIYGEIESVWEELAIDIFRGKM
KSVYKASNLKPMTTMTPPDFSFALNSPHAKKYSQLIPILATKGCPVGCSF
CTTPTVYGKSFRYREIDLVLDEMRAHQERLGKKKVRFSFMDDNISFRPKY
FMELLEGMAKLGVRWNANISMNFLQKPEVAELAGRSGCELMSIGFESLNP
DILKSMNKGSNRLQNYEAVVSNLHKHKIAIQGYFIFGFDDDSEKSFQATY
DFIMQNRIEFPVFSLLTPFPGTPYFEEMKDRVRHFDWDKYDTYHYMFEPK
KLGGEKLLENFIKLQREVYKGSAIMKRMQGKPLNWVWFVNFLMNRFTRKL
TPEMYL
>CT0622 transcriptional regulator, NusG/RfaH family
MTNALKKDGCWYAVYVRSRYEKKVHQYLLEKGLSSFLPLIETLRQWSDRK
KRVEEPLIRGYVFVNINYHKEHVHVLETDGVVKFIGIGKTPSVISERDID
WLKRLAHEPDAIGETVISIPVGKKVRVLAGPFKDMEGVVKKEGREERLLV
YFDSIMQGVEITISPELLAPIEKGASGQAVEGSKTGDHEVESAIRHLAHS
>CT1708 hydrolase, haloacid dehalogenase-like family
MSRTLVLFDIDGTLLKVESMNRRVLADALIEVYGTEGSTGSHDFSGKMDG
AIIYEVLSNVGLERAEIADKFDKAKETYIALFRERARREDITLLEGVREL
LDALSSRSDVLLGLLTGNFEASGRHKLKLPGIDHYFPFGAFADDALDRNE
LPHIALERARRMTGANYSPSQIVIIGDTEHDIRCARELDARSIAVATGNF
TMEELARHKPGTLFKNFAETDEVLASILTPKHS
>CT2206 polysaccharide efflux transporter, putative
MKRDVTIAREGSIAMAGFAFGQLFRFGYNFAAARLLGAEALGTYALVVAV
MQVGEVVAVGGFDAGLLRFVSQREGEKRKSIIASAMKRSVLASMLAGVLV
LLFSGDVAGLLHGGWLMQLALCSAALALPLSVMTIMAGVTMQAHHNLLPK
VLATQVIQPVLLVLTMVAARYALGVSAALALPFLLAPLAALVWILPGFRQ
VTGIGLSDVCRAYGNRELWQFSLPLLAVALFSILSHWIDVVMLGFLTDVR
TVGLYQSAARTAGLLRSVLLAFSGIAAPIIAGYHGRQENTGIRETYETVN
RWIVMLVMVPFLLLVLFPDEVLSVFGKGFGAGSTALVLLAVTSLLYASFG
LGNTVLAMSGGERLSMMNQAGALLLQTLLHWLLIPRFGLNGAAFSTLAVM
ALLTIVRMAELRSQLGIPALSGKLWKPLAAGIAVGAIMLAIRYGASSLPP
LMLLAVAGVAGGFVYILLIRALRLEREEMEVILNFMPFLKRQRTNAAP
>CT0442 hypothetical protein
MAGNAMKTQAHWLNWKHLKVYSSLLLALFLIYGIGGVYFSKNMVDAGGHP
LGLDFIAFWGASYLALAGHAQDAYNIPLLFKAQQIGVPAAKVSYPWFYPP
SYFLVILPLALLPYLAAYGTFMLSTLGGYLLVFRRIIRGKTAMWCLAGFS
GLWMNFFDGQNGFLTAALAGAALLNFERRPVLAGVFIGLLAIKPHLAMLF
PVALLAIGAWRTLITAAVTAITFMAAGTAILGTAVLKAFLASLGDARLFM
ENTHLLWNKVPSVFAFLRLLGTSATWAYAVQFAVAVVAVIIVWRVWRHCR
NRNLRNAVLMTATFLVSPYAFYYDLAWLAFPIAWLALDGLRNGWLRGERE
VLVAAWLLPLMMVLIAAMLKVQVGPLVLGSLLWMTYRRATTASMTGAPAS
AAPAKISSRLYSKRCYSLANTSTMAKKSEHAAERKTMNADAHWLNRERLI
FYSRIFLALFFGIGVGLVVTSKHMVTGDFVLAWAASHLALTGHALDAYSI
PSLIKAQQIAEPGPQDVYGWFYPPSYYLLILPLALLPYAAAYWSFMLSTL
GGYLLVFRRIIRDKTAMWCLAGFSGLWMNFFDGQNGFLTAALAGAALLNL
ERRPVLAGVFIGLLAIKPHLAMLFPVALLAIGAWRTLITAAVTAITFMAV
GTAILGTAVLKAFLASLGDARHLCLENGSLLWSKMPSVFAFMRLLGTPVT
WAYVAHFIVAVVAVIAVWRVWRNCQNRNLRGASLMTATFLVSPYVLFYDL
AWLAFPIAWLALDGLRYGWQRGERAVLAAAWLLPLLMMVQIIAHLNVQVG
PLVLCSLLWMTYRRATTASMTGAPASAAPAKISL
>CT1513 hypothetical protein
MKFPVCDFAESEEGFYSQCASCPIESSQPGCALDEQSDGTYLFIGETPHG
GVRAVFYFKNDAGKQVSKQEAVHVEIHEFDDQDRLVAILYGMVDPEGMIY
LKKSDS
>CT0131 conserved hypothetical protein
MEGHIVITGATGVIGSEVARRLIKSGREVVVFARSPQSAAAKVPGAADYV
RWDSDMAPDGWSSSIDGAYAVIHLAGRPLLETRWTEEHKVACYDSRIKGT
RALVAAMASASVKPKVFVSSSAIGYYGSFDRCEETDPLTEKAAPGKDFLA
KICFDWEKEARPAETLGTRVVLLRTGIVLSTKGGMLQKMMIPFSYFVGGP
VGSGDQCLSWIHLDDEVSIILQALDNADWSGPVNAVAPEPVSMKAFADSL
GLVMHRPSLFPVPKLAVQILLGEGADYAVKGQKVSPEFLKERDFHFAWPS
LNEALADLVSRGI
>CT1086 hypothetical protein
MMFPASPQFHRNVSSIALNGISNSSFRPGPACCSLFSGLIGI
>CT1931 dihydroflavonol 4-reductase family
MSGIPILITGATGYIGARLLVDMIARYGDSVRCRVTVREGSDASFLRNLP
VEIAQADMHDPIAVNEAVKGAEVVFHCAGLIAYTRNFRNRLYDTNVLGTR
HIVDACLEAGVKRLVATSSIAAVGSSDAKSGIRESNEQTPFTEWQRHNVY
MESKYLAELECRRGVAEGLDVVMVNPGVVIGKNSEPGMSGSSSNEVLRMI
YEGRLPLCPDGATGFVDVRDVADAHIAAWQKGKAGERYIIVGENLSFREL
FERIAALPGSRSGKVFRVNRVARMLAGVGGELFSLLTKRPSFISIESLRQ
AAHLSRYSNQRSVRELGMSYRPFEETLRSAIMQ
>CT1721 hypothetical protein
MKWLLRIIGTLLLLTILAVCGAWLAFPWYAQSLIDRLTAGKGITVKLHRP
GRPGLSGIGFGQLDATIRIKPDSCVTTPSAFTLKLLNGRLSWKHITGSAT
PTFRMQLDADSVEVLQTPSEIRFRQANPRLRARLDLLPSGGLLPSIAPDS
IAVAVRHGQAEAGQLHIEDISYDVLLTRSNKWVQQPALFRAESLFSGSTK
TPLSGFEATFGLQRHPSKPCTLTFSNCSLQLSGIKASTPKIEFNLRNKRM
AFVLKIGNVPLDRFSPDSGPASLTGKLSGSIPVEYLDSTIRISDGTIDAA
KGTAFAFKTDGTKISFDAGRKPGGPPLIENLNARVTLDATNGTVSVIRLD
SLSARLFGSRIASTPARYDLKSGATAATISIDKASILDRIRLTGEFSGGM
NGRLSGTIPVRIDRNGIAISKARLTAQGSSSIRQKLPKQSSGADELFSKT
AGREVLWEITDPSITLNRETKGTLTVNATLKSLKRKTGSGELLLTSPKGS
LTLFAYPGKPSLVTVSDFSAGLLDGTVAVNHMDYDLKSKHTETLVQLNGI
PIQSLLDLQGASKLSATGTIRGAIPVVLDNNLFSIPEGRMDAEKNGLIIY
ASTPEERAAAGAGMRLTYEALGNFFYSELVSNIVMTPDGNSTISIQLKGR
NPDFQNNRPVNLNLNIQQNLLDLFRSLTLSSDIEEAISKKVLENSGGKKK
KK
>CT0138 nucleotide-binding protein
MRGEFLSRSADETREYARRFASGLKPGDTVCLTGPLGAGKTEFMRGITEA
FGCEEQLSSPTFSLMNIYEGLLRGQPFELHHFDLYRLESEKELDSAGFDD
YLSGPFLSVVEWGERFASLDRRYTRRVQLFIAGESQRKIVIT
>CT0068 hemagglutinin-related protein
MSGTAWANGTEVPPPAPSTYTPPPPPPPVETPAPAPVVKQSNAGPYISGA
VGLGLPEKLEVLGEGDEKVKMDSGIALAGALGYNFNPVRLEAEVGYHRHD
ISDDYEIDGHVSLLTVMANAYYDIDAGSGIKPYLMGGAGWGHTNVSVTDK
SDDVFVWQVGAGVGAEVAHNTTLDLGYRYVKPNDFLVDNGGPKAKWAIHN
IMLGLRYQF
>CT0666 H+-transporting ATPase-related protein
MNEHEQRLADLCTAPVENTLDSLNTSLDGLSTKEAKKRLAEYGPNELTHQ
KRLGFRADMFNRLKSPLAVQLLVIALVSAVIGEHLPGEVDHVVKVEAPFA
PAGIRH
>CT1482 hypothetical protein
MTSTIDTPKKVIILYDSMAVGGSADRLIDAIGRNLAQAGAYVEKARCKPN
ADYSFVEEFDLVILGAPIYYLLVASKLSGSLSQSNLRTVLKGKKVALFVI
CGSPEPMAQFLYLPQLKMHLDNPVILAEKVFAPAETSDQNAITEFTKNIL
DAYGKAR
>CT0802 acetyl-CoA synthetase, truncation
MLDDSNPPFYRWFAGGVTNTCYNALDRHVDEGRGNQIAVIYDSPVTGTIE
KFTYREFRDKVALFAGALQARGVRKGDRWH
>CT1627 spoU rRNA methylase family protein
MNRKDHAEIPAQKPREQVVYGRNAVIELLQSKPESVEKIYLQFNTSHPKL
KEIVITARRLRIPCGKARMEKLTLLSGTAKNQGVCALLTSVTFYTLNEVL
AAPRNSSPLLVILQGLEDPHNVGAIIRTAEAVAADAVIMIEGKGAPISAT
VHKASAGALSHMRVCKVKSVVKALDLLRERGFSIVAADMEAEHNHTDIDW
KRPTVLVMGAEGSGLGSETRNRCDQIVRIPMAGCVESLNVSVAAGVMLYE
AMRQRLG
>CT1241 YjeF family protein
MNPVLTAREMSLADRAAIEELRTGETRLMELAGRETVRMIAERFESGRSL
DGLSAFVVCGKGNNGGDGFVVARHLLNKGARVDVLLVWPEDDLAGVNLEG
LHILKAYRRYNEGLRIFAGIDEARTAVGATEYQVVVDAIFGTGIRIDPDN
PQLPEPAKSAIELINFVNTHSKAVTVAVDIPSGLDATNGRCANPCVKADM
TVTMAFLKTGFFQNSGPSLCGEIQTAEISIPEFLVEPTSCLLVDETFAAE
SYLLRDPSSAKHLNGKVLLVTGSADGGGSMLGAALLASRAAVKTGAGYVC
CSLPDGQASVMHSFAPEVVVIGRDMISIIEKAKWADAIVIGCGLGRSEEA
QELVETLLCTPEIASKKLVLDADALYAIAERNLFNRVTALEDAVLTPHAG
ECSRLSGLSVEDIMLSPIDTARMLAEEWNANLLLKGTPTFVAAPSGMVLI
SNSGTEALASAGTGDVLSGMIGALAAKGLDTHEAAAAAAWLHGRAGDLAS
NVSSLVSSVDVLQAIPQAVLELFESEE
>CT0318 radical activating enzyme, putative
MSTEAPLNISEIFYSIQGESSFAGWPCAFVRLAGCGHGCRYCDTTYAEEP
GTAMTIDEIMHRVLAFDAPCVEVTGGEPLLQSGTFGLLSALCDRHPVVLL
ETGGFLPVDRVDPRVHAIIDIKAPSSGVMEHNCAANFTLALNEPERFEFK
IVVASEADYLWAKSYIAGHGILGKCSIIFGPVFGQLEPRLLAEWMLRDRL
PVRMQLQLHKYIWNPDARGV
>CT2287 conserved hypothetical protein
MIMSKAKPPRELGGIIADVFCKIGMTEAYDEYKTLHAWKNVVGETIAKVT
SVEKMKDGNLYVKVKSPSWRMELNFRKRDITKRLNKAVGYEMIRTIIFK
>CT2057 conserved hypothetical protein
MVINGLDILLQNPEVLRHRRVGLIANQTSVSMQLEYSWMLLKRAGVELTR
IFSPEHGLFAIEQDQIAVGHQPDIGCEIVSLYGDSEHSLAPARELLDGLD
LVLFDIQDVGSRYYTYVNTLALFMKAAEGLDIEIMVLDRPNPLGGSLIEG
PQLDPAFHSFVGVMPLPVRHGLTAGEIAHFYRNYKKLDLNLNVIPMQGWS
RQMLFPETCLPWVPPSPNMPSFEAAEVYPGMCLFEGLNVSEGRGTTTPFL
LSGATFVDPEALAERLASMPLEGVVFRPTWFRPTFHKYAGEAIGGLYLHV
TDHARFRPFATGVAMTCALYELYPEQLAFLDGVYEFRDDIPAFDLLAGSS
TIRSMILDNRDTTAIIASWEKDEAEFAAIKPNFHLYNN
>CT0500 hypothetical protein
MIYPATHWQYSGYSAKTLISKIYFLLKFNFYT
>CT0886 hypothetical protein
MTGIESSTFRNDAHIQRYYGHARWYGRYRLLEQGTDLA
>CT0229 transposase, degenerate
MRPKRHESLCETSEATWRHLNFFQHKACLTARVPQISSPECGLLKLQSVS
LCSWPGQWRSRRSPR
>CT1749 moaA/nifB/pqqE family protein
MNRMAGCINDHLKRYGGLAAGREDEQKRYFAEKRLYALQIETTDACQQGC
IFCYAGSTPREHHGLTSDEIRGLLRDAAALEIRAIDWLGGDPLVRPDWYE
LMQYARSLGLVNNVWTSGLPLKSKEVAARVHEVSEGGFVSVHVDSITPEV
YAKLHRGGNPHFIEAIVEGVDNLLALGKPADMMINCITYTSLQGPEDAIK
TMRWWFCEKGLRTCLTMFNPAGMGAEWRSLEPQLDEVQRVYTERDRIDYG
GDNISIAAMDTDKYYCGTMATVTFTGDVTPCSVIREGVANIRTTPFRDIV
ARHLDTLVHAALHDVQNLPNPCNDCVNNAHCWGCRASAYHYSGDADGLDP
KCWLIRTALTSDSFSVNNNLQKSTDEEIGLKP
>CT0843 conserved hypothetical protein
MVRFIELVRHCLLDVREILPWDLVDRLKENPGLLILDVREPNEFDAMHIA
GSLNVPRGILESACEWDFEETEPELVNARQREIVVVCRSGHRSILASHSL
QVLGYENVVSLKSGLRGWNDYEEPLVNKAGEVVDPDFADQYFTAKLRADQ
MRPKRAS
>CT1210 hypothetical protein
MRQLNLNFTSIQKGLALFRRLFFVALFGLLMAGCSLQRQDKLSADDLHFA
AFYADYLARSGVTDKGNAAPLTDLTPAGLDTLFVRHKLDQKTFDARLDAY
SRNPELWRKVLLEVRRNLDQESRK
>CT1072 thiol:disulfide interchange protein DsbE, putative
MKRSTLSTCRVALFALVLSVGLSANAHALDKGDKAPDFALPGKTGVVKLS
DKTGSVVYLDFWASWCGPCRQSFPWMNQMQAKYKAKGFQVVAVNLDAKTG
DAMKFLAQVPAEFTVAFDPKGQTPRLYGVKGMPTSFLIDRNGKVLLQHVG
FRPADKEALEQQILAALGGN
>CT0955 hypothetical protein
MDKSSIFFPVSHQEKTISPKQSQNKIMIITTYIDTIAETLFREYMLHKEI
HNHLG
>CT1014 hypothetical protein
MTFMSVGTLKLKERQALCLRIDAEQIQPVLEGDFEDMELCGLLLGPMERC
PKLPVERQGSRQGSDDACRRSPIPVLYLLYRDQMSSSSIVSEWAHWIPAL
IMEALEASWFRRASTIMTC
>CT2262 conserved hypothetical protein
MNAPQWLGAEGEKIAARHLAAKGYRIVARNYRFHRNEIDIIAFDGEALCF
IEVKTRASLGKGHPAESVTRSKQKEIARAAAGYLASLDDPWITCRFDVIA
VLALSIDERSIRKYEIEHIKAAFMVGDKG
>CT1064 conserved hypothetical protein
MMTRSALDFLVELKANNNREWFEANRTRYEQAAADFFDTVARFIQSLTSF
DPEIAEVMPDPKSCIMRIYRDVRFSKDKTPYKTGLFAYVSKGGRKGALAG
YYLHLEPGNSFGGGGLYLPEAPVLARTREAIETSFDEWNAIVTAPELLAA
FPDGVLPSGVLKRAPKGYDETSPAIEWLRYKGYVTQRFFSDGEVVDEPFI
EQLGDCCHAVMPMVAFLNSAIESDTP
>CT1346 hypothetical protein
MTSDPHLIFADVSLDIKCGAFQSALDKLKEVERWLPESYIFHLLTARAAR
GLKRYEEAIEHLGHCCRIAPANQVAWRELIEVKTLQSQAPEPARTPAIDE
VAVEFEELSKALAGFTPPRATECFEPTPIAEQKQPFPDDASIAVPTESLA
KLFVNQGAYKKAIRVYTSLIQLNPSKADHYRQSIDKVLEKL
>CT1875 hypothetical protein
MLDHIGAHLAEAYKAYLHMKSPFLGVVTPFVDYWFRPPVNNPKTSATETS
NQTLHLVCHVILNEVKNPVPGK
>CT2155 GTP-binding protein
MNITTADFFCSYSSLNGLPSDGRPEIVFVGRSNVGKSSLLNSLCARKGLA
KTSSTPGKTRLINYFIINDNLYFVDLPGYGYAKVGQGERESWGKLLTGYI
QKRGEIALVVLLVDSRHPGMASDLEMMEFLDYCGRPFGIVLTKWDKLKQA
EKSKASRTIESCAPNARFIVNYSSLSGSGRDRLLASIDTFTQ
>CT2271 lysophospholipase L2, putative
MTNRVTNGQPERLLTGVLILHGFTANLESVRALFGPLGRFDLKMATPLLR
GHGAASPDELRGVTWREWLDDAENAFETLTGTGGKAVVIGHSMGALLALQ
LAARRPELVDSVILATPPVRLTSPLGPGRPLHFLAPLVSHVVDRWDMEAR
FADPGSAIIPKQYDWAPTKTILSMFELLEETMRITGRVRVPALILQARHE
SVVLPESAEILTRAIATPPEAKSIVWFDKTDHQIFCDCERKAAVDAVVSF
VSKRFPAATNQSVKA
>CT1929 TIM-barrel protein, nifR3 family
MEQSAMRIGTIDIDRPIILAPMEDVTDRAFRQLCKQHGADIVYTEFISAE
ALRRGVEKSIRKITVADHERPVAVQIFGNTVESMVEAAAIAETYEPDYLD
INFGCPVKKVAGRGAGAALLKEPEKLSEIASAVVNAVSLPVTAKTRIGWD
HDSINITDTVRRLEDAGIRAVAVHGRTRSDMYKGRADWGRIAEAKQACSI
PVIANGDVWLPEDAVAMFERTGADGIMIGRGSIGNPFIFEQAKSLVKEGV
RLSAPSYRQRIQAAVDHLRLSLEYKGEKYGVLEMRRHYSTYLKGLPGVSK
VRNLLVREPDPAVIVELLREFEAACEALDREGLLKEGGENLNDHSPRLIL
NY
>CT0892 conserved hypothetical protein
MNRFLELLRRPLPHLSTRDAVLLRLLMVPNLLLVEVEEAEELRKTGSSPC
IYAFNHNNAFESLFVPALLIWLLGSWRVSFVIDWMFGQLPVVGWLMQRIE
PVYVWHKRSTVPFLERRRVKAASSSSRDECIRRLDAGASIGIFPEGTRNA
DPFRLLKAKPGIGYIALESVALPFRTEVATLAGFS
>CT1236 phosphatase, putative
MNYRLLVFDFDGTLADSEASIMGAMQLVAKEFGLSEVDCVKARPTIGLSL
LRTIEIGLGLEAGDAAAAVELYRRYYKEIAFDSTCLFPGVKETQEQLRQN
SLLAIASSKSRQGLLSRMRQFGIVDHFSFIAGAQDISGL
>CT1117 hypothetical protein
MDTFTKADDFIAALTGFRQMVQSVELLQQFYDEAGAALLYDCHTASPAGV
IRTAEFFTLTGD
>CT0919 glycosyl transferase
MSRSSNPLVTVTVPMYNNERYIAETIRSVLNQTMDDFELLIYDDHSTDRS
LEIVRSFDDPRITIFSSERNLGPEGNWNRAISDVRGRYVKLLCADDLLFP
ECLEKQLAAFELSENEGVSLVSAQRAIIDPEGKVLINKVNFIDGGLKAPD
EVVRKMVRMGTNIIGEPVCGLYPARLIALTSGYSARIPYTIDLDFWMQLL
RLGKLWMIDEPLCAFRISNLSWSSRIGEMRHKQFLAFMEEVAADELFQIN
DLDLFIGRFNSAVQSMTSLMGFKLFAGSAAEKRVIAVQ
>CT0895 phosphate ABC transporter, periplasmic phosphate-binding protein, putative
MNLFKKIIFGAAVMGLMAAGDASAAGKIVIDGSTTVGPIAKAFAAYFHKK
TGIDVTVSESGSGNGAKSLINKTCDIADMSRPMEPKEIAAAKANGVNPVE
HVVALDGLSMIVHPSNPVAKLTTAQIHDIYLGKITNWKQVGGPNAQIVFV
QRESNSGTQECFEKLVMKDDKIAPAAETAASNGAVKSRVASTPNAIGFVG
LGFVDKSVKPLIVNGVKPSIATVRNGSYVLHRPLFMYTNGQPKGDVAKFI
NLPKTPDGKRMISELGFVNK
>CT2046 hypothetical protein
MKLIALIGDEETRPLIRKMFTAHQVTLFSSIAIRGCSCETGGEPVAWWPA
GKDIPTVYSSLCFAILEDEKAEKIMKDIEANPLAADPAFPAKAFMMNVEK
MI
>CT0625 Nudix/MutT family protein
MTHLAEQARFIIDFIIIFSALTTSYHIGNVVCAIIEREGRFLIARRPLGK
HLARKWEFPGSKVETGESEAEALERELIEELGVRMEIVERLMPVEHCYAD
RSLRLIAFHCRIAAGAPNAGEHEELRWIDIGEADDYDFPEADLPILAEYR
QKIAASVQSLPGKRRGTA
>CT1038 CBS domain protein
MQTTTPSNIIVERVAADLGKYPPFDRLSLDKNRKIAASLAVEYHEEGETL
FRKGDELGRFSYMVMKGAVRLFDLIDGEEVLVDLCDEGDLFGVRAIFGTS
TYLLSAQVVEESLLFAIPVETIKELVQSEPSVAAYFTGAIARSVQHIEHS
LSEAIDTRRSLMETGGGSLLANETLVVDQVRNVITCAPGIPIREAAKIMS
ENNIGSIIVVAENRHPLGIITDTDLRKKVVAIAGQVNERPVSEIMTSPVY
TITAGKTVADMMMLMVRTKLRHFCITEDGTADTPVIGIISEHDIVTSEGI
NPAVIMKAIMQSESVEQLSQEREKAEKLLKMYIAQDVAIHFISNIFTELN
DALIVKAIEFSVADMKREGIDLPDIEFCWLSLGSEGRKEQLLRTDLDNAI
LFRDPENEASRETVQRVFLELGKRVTAILVACGFKPCPAEIMASNPEWCQ
PLSGWMNYFRKWIGTPEPKALMNSTIFFDFRPVYGSTALALEMKREINTE
IARGRGFLQFFAKNALQNPPPLGFFRNFLVEKSREHAHEFDIKARALMPL
SDAARVLACEFGVTDFLGTVERFRRIGQLLPSFQELAEEAAQGYEFLMRL
RTEHGLAEKSSGRYIDPKHLNKMQRQKLRDLFTTIGKVQSMLNLRYQLDY
IRA
>CT1883 conserved hypothetical protein
MPIRRLREFLDSHQVKYFVISHSPAYTAQDIAAAAHVSGNELVKTVMVSI
DGKMAMALLHAPRHLDFDLLRELCGSRDVTLAEEIAFSGLFPECEIGAMP
PFGNLYGMKVYADEELDESMDIVFNAGTHRELLRLSWFDYKRLVNPVMGR
IASIR
>CT1501 ABC transporter, ATP-binding protein
MKNLLALRPYLFRYRKSLAAGIACVIATNFFAVAAPRYLGEAIDLMGKPF
EMHEILHKIGLYILFAALSGIMLYLVRQLIIVTSRHIEFDLKNDYYSHLQ
TLSTSFYDTTGSGELISRGTNDLNAIRDFLGPGIMYSINTLFRLLFAIGA
MLAISPLLTLFALLPAPFLSWSVYKRGVSLRFQSKKIQENYASITNLVQE
NISGIRVVRNYNREEWETSRFEALNQDYYDKNLRLGRIQAGFMAVLTALT
ALSLIPVIWAGGLGVMNGTMTIGDIAQFVVYVTMLSWPIISIGWVTNIIQ
KAAAAQGRLDEIFNTKPDIVEPASRQTSETKKPLKGELAFEHVSFAYPSQ
PEREVLRDISFTVEPGTKVAIVGATGSGKSTLVNLIPRLYDPTSGRITIA
GQDIKSLSLGVIRRTIGFVPQSNFLFSDTIRNNIDFGSNGGGIDVIEASR
IAMLHDDVEDFPDGFDTMLGEKGINLSGGQKQRACIARALRWHPSILVLD
DALSAVDTATEASIFDGLLKKLPETTIVLISHRISTVRNCDRIIVLHDGA
IAESGTHEELLQNGRYYADLYMQQMLEEEISALS
>CT0044 hypothetical protein
MAVIEKAQGFILGDWVDRRVSRQTAKDFFSSGAHVFPEHEKPVYIDMTPT
ALGKGPRLPAIPVSGTTPAKSRRSSLWENHEQLATLFDDWEVANRNDLFS
VVGLKKFNYIANSDFHERRHLLSWKTLLRCEKMSSRLRPPSERMSRFRYF
CAVVEKSGCDLRPQKRNDGRCFQALCRFV
>CT1762 TonB-dependent outer membrane receptor, putative
MKKAVLMVLLLAAVRPAIAAETIASSDQSATPVFTAGEIVVTGKSANSSE
TVPAAEMEMLDKQNLSEAADLLPGVTISNVGGRNEGMIYVRGFDMRQVPL
YLDGVPLYVPYDGYVDPNRFLTMNLSEVSVSKGFTSVLYGPNTLGGAINM
VSRKPEKKLEGNVSAGGSMGNDEIDAGFESVNIGSNQGKWYVQAGLSVLS
RDFMPLSSSFEPVANEDGGERGNSDTRDRNATIKIGYTPNSTDEYSITLN
SQHSVKGVPLYTGSNPNNKSRFWRFTNWDKTSLYYIGRTSLGEKSYLKTR
AYFDNYYNTLESYDDATYTTQHSNKAFSSRYDDHTVGGSVEFGTSIAQSD
ILKVALHDKYDMHNEIGNTGEEPLKFADNTFSIAVENTWNASKNLTLIAG
VRQDFRSTIEAQEFTDNTHTAIHSFDLEDNRAANAQLAMVGHLDACNTLT
SYVARTTRFPTLKDRYSYRMGRAIANPGLDPEQSLNYGLDYAFTPSGKFS
LHASVYRSKLDDTIQQVNNVAYDTATSTWLWQMQNTGKSTFTGFEYSMDW
QPEAWLKAHVAYSYIDRQNDSNPDLVFTDVPKHKLNGSVQFLLNKETWVL
VESEYNTKRYSTSDGKYTASPYGLLNLHINAGLTDALSLHAAVNNLFDRN
YEVAEGYPEAGREYIASMNYAF
>CT0187 conserved hypothetical protein
MKLTILTDNRAAPGLTCEHGFAVLIETGGKRILFDTGQLTAIDANCRALG
IDLSDIDIIVLSHGHYDHTGNLADVLRIADRATLYLHPSALIERYSIRND
KPKPIDMPETAKQAINGLPKERVVWVTEPTRLTDGAFLTGPVPRQTTFED
TGGPFFFDPDGKTPDPIEDDLSLWIEKPEGLIVLAGCCHAGIVNTLDYIE
SITGQKRIATLIGGMHLSAASPERLNRTVASLANRDISRLIACHCTGQAA
VERFSKELPYPVEAGYAGMVVESANE
>CT1306 hypothetical protein
MGCFQSGQIPGFSIDLGHCFVNPQNRLFPMAMMDPASWMFVLVVSIVCSI
ACAIVSFNKGYRGSPVFAWFGAGFVFSVFALIAIILAPHHHEV
>CT0327 conserved hypothetical protein
MLRFKELVLTMQEFFYFALRAFATLPKAKRYWRDVLDQAFICGVESIPIV
LVSSISIGALMSMEVGNLLEEFGAKTMLGRSTSNAVLRELGPLLMGLMLS
ARYGARNGAELGAMQISEQIDALRAFGTDPIAKLVMPRLLAALIMFVPLI
ALSDFAGLQTGALVAQYYHKLDPGIFWNSIYPRLLPMDFVVSFLKAPVFA
IIITLVSSFNGFSARGGTAGVGRSTIKGIVASSGLVLVANFYVSKLVFDI
MH
>CT0315 cell division protein, putative
MKEGFSGIGRAKLPAFVTIAVGFFSLLLLGLFGTVSLSFYQVIQEVRSRV
ELEVFFDETVNDDQARSLGEKMQAIPGVAATHYISRDEAASRFRRDFGED
VVTILGMNPLPRSVTLNIDQRYALPDSIAVIRKKLEALHQGLDIRYNQEY
LTGIEKNARLFTLITAGVGGVIALATIILNAFTVRLAMYARRDRIKTMRL
VGATRWFISAPFLIEGAVLGLVSGGLAALGLWLIFEQALLRYEPAIYQIL
HPSTYVIYPGLVLLGIVLGFFGSTWSVARFLRVS
>CT1059 hypothetical protein
MLVGTEILGIACGNAGGGARAVKTPYSYNTISDFLLSQPLTVDVEVDDET
AGCFPLKCIELDATVLYVGMRNFARLTLDLSPTEILIYLNMFLVWMRDSL
AAERFCVVERFLDSSIVLLFSKKFGSEEPFFDAIRAARWMGEHDELLFAP
EMGIASGRVSVGFAGTPKEFTGSVFGRPVLLAAACAKMRPQDEAMASCIT
FPEEEWRGRSFEELFPPLEFDHPECGRVRQPSTWRLGEPRSVDFASHGRI
DLRDVGNFVHWTSKITAKEKAREWFAQIKAKGFYKYQK
>CT1035 hypothetical protein
MKQSWLWVLLPLGIVLAVLALFFALFFGNLINGELQFAPIALFTLSLSAG
SYALVRGFRLGWNLSTIISAVIALFAFLASIAGLTLELQGMRIGGIIAGL
LSVTCLAVLFVSNGMDEISVGKRKKTEIAAGPAEWADRIEAIGRRCTKPD
VRTKVLRLGGETRFLTPGTGQADLMVNQTIGRAIDELAEAVKLGNDSAAL
SMLPGIRSLFAQRENQLKP
>CT0720 serine cycle enzyme, putative
MKSDGTLGWSAISTYLDSLASADPTPGGGAAAAVTAAQGAALLSMVCNLT
IGKKKFADVEHEVHSILDQCETARRQMLNLGDRDMEVFGTIMQIYKMPKA
TPEEADARRAAIQQALKASAEVPFELFKRCLAMLPLADRLEKIGNPKVLS
DVIVGRYLLVAGLMSAMANVEVNLASIDDAEFCKAKREVMLPAIASMGVS
WRGLAGV
>CT1665 conserved hypothetical protein
MSGHSKWATIKRKKAATDQKRGSLFTKLVKEITIAAKMGGGDPTGNPRLR
LAIDTARANSMPMDNIQRAIKRGTGELEGATYEEITYEGYGPGGIAIIIE
TATDNRNRTVADIRHLMSRGGGSLGESGSVGWMFKRKGSIEVPRSAISED
DLMELLLDAGLEELESDDEQYFTVLTDVKDLEPVKKALEEAGIPFENAKI
DMIPDNYVELDVENARKALKLIDALENSDDVQAVYSNMDMSESVMSVLEE
E
>CT1215 thioredoxin
MQSRIPFDFQNDVIERSKTVPVLVDFWAQWCGPCRILAPVLEKLAERHAG
KWVLVKVNTEEFPEISAQYGIRSIPNVKLFSNGVVIDEFTGALPEYQIEQ
WLAKALPSPWAEEVARAAAEMAAGNDAVAITLLEGVLAKEPENRDAAAML
ARLILFSRPEEALRLAETLEAEPDYADLSESVRTLGALLVRPASAFPDGE
GREAYGEAVESLRKGDMDSALEHFISVLRTNRYYDDDGSRKACIAIFRFL
GEEHEITMKYRRAFDRAF
>CT1911 hypothetical protein
MCLRSNEPPLLPNRCYQQYFYLSMSKYFLVAKPDFGSPVSLSMYEIRRCL
CLISITVTCCPSLTRPIIFEFFLSEYLKLTLVLAVLVLLSLFSSDSIFDE
NSLLIFSISLLVSMITSELLLSVGITKGL
>CT1534 nitrogen regulatory protein, P-II family
MLMIRTIVRPEKVHDVMQGLLDAGYPAVTKISVVGRGKQRGLRVGDVVYD
ELPKEMLFLVVPDADKDFVIRAIMDNAKTGDGKFGDGKIFVSAVEEVYTI
SSGMKESETLLEAKEA
>CT1221 conserved hypothetical protein
MKPFSKTQRIVAISSIAVAILLFLVLRPAPLPVDAGVASYGPLEVSVEDE
GITRVIDRFTVSSPVNGKVLRSKLVEGDSVRAGMTVAAILPPEQNALEYR
ESAAQAGSASASVNEALARQHETSVRLAQARLKAGRYERLYSEGAVSKES
SELATEAASVLEKEARAAAAAVRAARMQATAAEARIDAGVASKAVDVRSP
VDGKVLRIIEKNERFVPAGTPLVEIGNPGLLEVVIDVLSSDAVRVRPGDR
VWIEDWGGGGALPGVVKRIEPAAFTKVSALGIEEKRVNIIAMLEKPEPRL
GDNFRIQAKIVTSKADRVLRVPVSALFRGDGGWEVFVIDAGRAKVRKIKI
GMRGADMAEVLGGLREGEKVVTHPPNELEPGMRVATKDE
>CT1494 carbon-nitrogen hydrolase family protein
MLKSKLRIVQADCTLANFEENLERHIKAIETAIRDGADAIAFPELSLTGY
NVQDAAQDMAMHIDDRRLDALRELSRDICIFCGGIELSDDYGVYNSAFMF
EDGAGRSVHRKIYLPTYGMFEELRYFSAGRQIETVTSRRIGKVGVAICED
FWHMSVPYLLAHQGAKLLLVLMSSPLRLSPGQGVPAIVTQWQTIASTSAF
LLSCYVACVNRVGNEDSFTYWGNSAVTTPDGSIAASAPMFSEHSFDATID
YSVVKRVRLQSSHFLDEDTKLFASQLETMLSAKHQG
>CT2071 hypothetical protein
MTAMTTTHSGFNHRLRSYILKKHGSVNAFCRAVGIKYPAQMTPYLKGKCL
PGKKMLERLEKDGADIDWILTGHAVGEQTSPISDALALSRYRMDIDNLLR
QVRMHIGRFEERLKPAIEAYAVLDDEEKIVDLSSSIEKFLDFAPGSLAGV
ALSEILHPEDYLHFKTALDRQKPHEAVLMFYSRFKTGDGSWLNVEWSLSI
KRKPMTDQYEYTIILRKTDNPLF
>CT2198 hypothetical protein
MLGQFARFWTFTPASNGKALARAFGFYDNALRGGEVGPGNGFAVRCVKD
>CT1563 conserved hypothetical protein
MNNKTSIVAGWIIIASILIQFIPLDRVEHPSKPILGIPASVLAQLEAHCF
DCHSSRTRWPQSAYIAPLSWYVTAKVRQARKAIDFSNFDALPDDGRRNIK
RATSSLARSKGLSAHGEIPGFPKIKMTERERQALTEWAADNNRK
>CT1392 hypothetical protein
MKAGVVSRSSRVNSAIERMFPDPIYVRLSNKRFREHITKPYASVAVNASG
F
>CT1012 hypothetical protein
MSLFKKAVGVAALYCLLAMGGEVRAADSGAPYTVKAAYESLSLPAGESMG
MLGLGVERQFNENFSGGVGTWTAVRGERGGFITIGFQGTARVPLSETFGL
EAGAFVGAGGGRGGATLSGGGLMLRGYTGLTADLGELGRIGAGVSYVDFP
NGGAIDSTQPTVFYSIPFGSSSRQFDGLAYERNSLAVVSKLVRVRSGARD
LSGKVQDDFTLLGVEWRSYFDNEMFVRFEAAGAAGGSSTGYMQVLVGAGL
QIPLSDNFWIDGSLGLGGGGGGDVDTGGGFLVDAGADLRCALDDDLFAAA
GVSYLRAPNGSLSAFCPSLEVGGTFGKESQKHDKLPVRVRMVSQRYFNGS
DGWRTHDADKDVDNLGVQFDYFAKPWAYVTGQALAAYDGQAGAYMIGLVG
GGLHQTIAGPLFVEAEGLIGAAGGGGLAMGSGLAWQVNGGVGVQVSKDVA
LMATLGRLDAFNGPFKADVVGLSLAFGGR
>CT1702 conserved hypothetical protein
MKVLNKETDYAVRALISLGMKSDGWVSAKVISDEQAIPYQFLRRILQELI
RNGLVESKEGAGGGVKLAKEPGDIGVADVIEIFQGKVQLSECMFRKQLCS
NRANCVLRHEIMRIEKMVNNEFSQVTIGKLIDDLKAVEAMNKTRQTEQQE
AR
>CT0632 membrane protein
MGVVMFFIPSYFSSNFYPLAAAFLFAVVGLVSLKAGILQSLHGEPVVTQE
GERVISYGPVLFPLVFFLQALFLWGEHVWILQISMLVLGIGDALAALVGT
AAGGRHIENLTKSRKSIEGSMAMFISSLVILSVSIFVFRDAFTGGLVGQP
IWKLLALALLLALLVTAVEALLSWGLDNLFIPLAIAYVLYVVDVNSMVTI
DGLLLGGLFALFIAIFSIKVKFLNNSGATATFLLGTTIFGVGGMVWTVPM
LTFYLLSSILSKLGHKRKAKFDLVFEKGSQRDAGQVYANGGVAWIMMVIY
SLTGDPYIFFAYLGTLAAVQADTWATEIGTMWPNAKARLITTFKDVPVGT
SGGVSIPGTLASFLGSLLICSSAVLMNVSWIDQVGIVTSLLVIGVSGLFA
SLVDSFFGATVQAQYYDPIRQKVTERTHSIASDGSRVANELLKGYDFVNN
DLVNTLCAISGSAVAYLVVRNLVSLSL
>CT1898 protoporphyrinogen oxidase, putative
MAQKEVVIIGGGISGLSMAFYCSQAGMKTTLIEKENAVGGSFFSPQFSSN
SDKFWLEMGAHTCYSSYQNLLGIVEGCGIADTIIPRAKVPFTLLKNGKVS
SIMSGLNIFELLRSAPNLFFLKKENMSVRAYYSKIVGENNYRNTVSHFFN
AVPSQPTDDFPADMMFKSREKRKDVLKNYTFKNGLQSVALTIAAQKGIEV
HTGSPAQSVSTSGGKYVVKTADGQEYAGDALVIALPSAPAARLVHDINPG
LADHFTKLRYADVDSVGVVIRKSNVSLKPVAAIISPNDIFFSMVSRDTVP
DDNFRGFAFHFRPGLDDNAKFKRITEVLGVSLSKIEHTITKKNVVPSLRV
GHKDWASKTDQMLGNVRNLYMTGNYYGGMAIEDCVSRSREEFDRMKISR
>CT0942 FecCD transport family protein
MSLTRLHKTNGGDGCRPMIVKGSPVNRRRALLLAAAIPLLLGLSLLLGPS
GIGLPDTSSISGRAIIDLRLGRFVMGMLVGAALSTSGTVFQAILRNPLAE
PYVLGVSGGAGLGAALSILVGGGILGAAGLPVTAFFFAVITLFAVYAIAA
QGGGMPSVYSLILSGVIVSAICSSVIMFLVSTANVEGMHTVMWWMLGNLQ
PSSWGNQFIALLFIAGASVALWLLAPALNVLTLGREMAHYQGLNAGTVTV
SSLLLATLLAATAVSMGGMIGFVGLIVPHVMRAIFGPDHRWLVPASAFGG
GAFLVLCDVIACVVMPPIEIPVGVVTALAGGPFFLIVLRKRMKYAWND
>CT1531 hypothetical protein
MARRVIVSQAASENRASLPDVRLAFQDFLVSRQHWSDRF
>CT1964 cation efflux family protein
MIIHIRRTDEKQRWLFFSTLLNATLAVAKLGWGWMMGSTLVVADGIHSIS
DVFGALLIFLALFFAAHKSERFPYGLHKLEDMAAVLGGLGILYAGYEIIH
SVFFEAGIKTPEAIWTTIGFILAIVFIQYTFYFFELKAAQRLGSPGVKAD
AVNWLGDIGAGMIVVIGLVAHHYKVPYAQEVSVIIIVLMILKGAYDVLKE
GLLSLLDAADIEMNDTIRNIVMAEPDITNIKRLNVRKSGSVYFADIELSI
AEVATAKAHKSIDEVVNRLHEEINTLEAVTIHYEPDHPPYRTVVRLLDKD
KKHLSIHFGQTAWLEITHIEADNKQISKEFVKNSAVIAPKGKAFRLVAYL
LSIHTDTIVMGNAELDENILTLFEALNIEVKKEKNDET
>CT0896 phosphate ABC transporter, periplasmic phosphate-binding protein, putative
MIRHRGRFTLLFYSLRGNTPLLAARFFLFIKKYGVTVSVSESGSGNGAKS
LINGECDIADMSRPMKPMEIEAARRKGVNPVQHIIALDGIAVVVHPANPT
RALTKAQLASIYLGRVTNWKQVGGPNARIVVIQRESNSGTQSSFEEMALG
KGMRVVRTAETAASNGAVKSRVASTPNAIGFVGLGYVDRSVKPLKIDGVL
PNVAAVKSRAYPLARPLFMYSKGAATGMVARFLALPSTPDGRKIISELGF
VNK
>CT1916 hypothetical protein
MCHLRVGCLFYTDAQLAYMRYSIAPYIPYQTNKRNKVARYCPVFSCRVYY
KSSNGLLILSKLCWLHSPHPHPRQPNDRNSAKRLTAAALHLRIRAARSLW
KYLREAFAVGER
>CT2108 hypothetical protein
MCRLMTSIWQKNREPAPDVPRYARIIGTEPP
>CT1007 hypothetical protein
MLLFRKSFTILAFALTAIFSFGATLNAASPEPQAAAVQPATKGLFVVVTS
DDPMTQMMAMVLSTQTLTQGRSVRVLVCGKAGELVLKGSKEKLFKPLDKS
PQMMLKGLIAKGVTVEICPLYLPDAGKQPSALIAGVTIAKPPVVAAAMAE
DGIKLLTF
>CT0023 hypothetical protein
MPPKEMTMLDNDRKKDKFSEQFGGTVRALSEYLGIGIQIAASFALFVFLG
YWSDSKLGTSPLLLLAGVLVGMVGMALVLMKTIRQADREHDRLHQHTRNH
EKDRRT
>CT1254 hypothetical protein
MAVTGFFFLFVVYSGRILVFLIFFPVVPSITVPYL
>CT0352 conserved hypothetical protein
MHSTALKVALTLLLGSIMLLTVAAAGLRFDSGKVSEEAVNGAKIALADYL
AAHPEAPPRALAVIDYSQPSYVKRMAIIDLKTGRQSFYRVAHGKNSGELY
ARRFSDVPESNMSSLGLFRVGERYLGDHGLALRLDGLDSLRNGNAAKRDI
VLHKAGYVSIPFILLNVVTGYGPMIGRSNGCFVVSENDIDEVVQKLAGGG
FIYAWATPDDNSRK
>CT0340 alpha-amylase family protein
MGVWRPSRYSAAIAASHRGLRSELLDYLKDLQAEDIVSSPYSIPAYEVSK
VIGGQEALVAFRKRLADMGIKLILDFVPNHMALDNRWLPEHPEYFVTISR
HEQCHDPSSCFEYTKGKYLAHGKDPYFPPWTDTLQLNYANPATHEMMIHN
LSHISDLCDGVRCDVAMLILKDVFNTTWQNLAGKMSEEFWPKAIEAIRHK
HPKFLFLAESYWNRESELQQLGFDFTYDKPFYDYLGASPVNVPKLKGHLL
ADWEYQQKLCRFIENHDEVRASERFGPNHAVAALVMLTSPGMNLIHQGQM
LGLKKKIPVQLIRHSKEPSDKALLHLYLKLFEFRREAVIQEGHTEWLELN
TNGQSLCFGYCRTIPDVYAFILANFSATGIDTQFSHPALAGLGDSAITAL
STRFPEHPHELHLHDSTLSIRLAPHEGVLLIAKNS
>CT1750 hypothetical protein
MKAKATAIGILFTALAWTGANAAVISHNDITGTNPGLQNPYTTGITSDYS
DDITASGIGRVGLTGKTADDRYNASGWSTGALDTGKYFTFTLDANDGYAI
NFSSFEYRAQRSNTGPTSFAFRSSIDGFTTNIGSPTATGATIDLTAPQFQ
HLTNPIEFRLYGYDSGSGNGTFSVNDYTFNGTVEAVPEPGTLALVGVGSL
LMLGHARHTRKRMDIMA
>CT1281 hypothetical protein
MTASKESSAQRRHVKSTLVDDEKPTIEVMAADLTMPETSK
>CT1263 hypothetical protein
MVKHIVMWRLCDEAHGNSAQVNARLMKEKLEALSGRIPGLLSIEVGLDFS
CTDSSADVVLYSEFANKAALETYQSHPEHEALKPFIGGATRERRLVDYEC
>CT0281 hypothetical protein
MWPPCCLNIYDLPTMTTIINYLTAHPVVLGIAVIFSLLIVLSFARRVLRF
LIVLAALGILYVAWVSWHGGNPAEKAEKASKSVKQAVHKGEGAMKAVDWL
FKSEDHPRKEEDN
>CT1218 hypothetical protein
MKKPFQAFLLFSLFIAPSIVKADRSSTGKIEVLIDNVRSTDGIVDVALFS
TKKGFPDKSALAIEGRSIPAEKCCKVIFENVPYGAYAISVLHDENGNGKM
DKGLFGIPKEGFGTSNNPEIKMGPPSFSDSRFDLKSKEITLHIDMKYLRK
PENQNQR
>CT0673 hypothetical protein
MAVLCGRTSIPAWICWMLRVENGTRLDSGVWHRA
>CT0468 membrane protein, putative
MSGRFANRPFGCRQNGIESMDNLILIAVSFTAGLLMRRFSRMPEQTPVAL
NMFIIYVSLPAMVLNYLHGLDFRPSMFLPASMPWLTFGVSALFFVVAGKL
LKLPRATVGSLILCGGLGNTSFFGFPMIEAFYGKQGIVHGIIIDQLGSFM
VVSILGVTVAGIYSHGSTDVRSIVKRVLLFPSFIALIAAVLLNDVIYSAS
FANLVKRLSDMLAPLALFSVGFQFNPGHIGKSRNTLALGLAFKLAVVPAV
MFVIYVMVAGMQGLPARITIFEAAMPTMITGGIIAAEHQLDPPLANLMVS
FGLLVSFVTLALWTVVLRGV
>CT1748 methyltransferase, UbiE/COQ5 family
MLKSWQLLKSFHCRSNSFINQEREVFMAKYKLDVNIANVNEVYDGAGGIL
WEMLMGEQIHVGAEAETDVLARKAGVTAETHLLDVCSALGGPARYLAKNY
GCRVTGLDATQRMHAEAIRRTIEAGLSGKIDYVLGNALDMPFPASSFDVV
WGQDAWCYITDKQRLIGECARVLKPGGVLAFTDWLEAGPMTDEELTALNT
FMVFPYMETLDGYAMLAEQAGLTVIEKEDLTPDFAAHVQGYLDMVQNQYR
QAIVDNYGQEMYDAVEQGIMLWRDASAAGKVGRGRLVARK
>CT2159 mannose-1-phosphate guanylyltransferase, putative
MKQIADKHVYAVVMAGGIGSKLWPLSRKKSPKQFLHFFDGGTMIRKTVER
IAGAVRRENILIITGKQHRRLVNDSLPDFDVDNIIIEPTIKDTATCIALA
TTFIRKRDPDAVTIVLPADHLVLDEEHFLKTVSKGVRLARKQRGLITVGI
HPDRPETRYGYIQVEPSVMIGEENDDPDDDDIALFRVKTFAEKPDLATAE
HFLQSQDFWWNSGVFIWHVDDISREYSRSLPDLYEDMQNVHAFIGTERQE
SVIEDVYSWVHPVSIAYGIMEKAEKVYMLAGNFGWTDLGCWDEVIKIGLG
LEGRDMDGSESIMIDSKNVFVRKPHGKAVVSIGVDDIIVVDTPDALLICK
AGESQKVVKAVEMMRREPGLEGFL
>CT1648 hypothetical protein
MTISNAQKRNRMKALPILKGNKFIVYYLTL
>CT0822 sodium:alanine symporter family protein
MQSFEALLSQVSDLVWGIPLLVALFGTHLFLTFRLGIIQKHLLYAIRISF
TRSHEGTGEISHFGALVTALAATIGTGNIVGVSTAVAMGGPGAVLWMWLT
GVFGIATKYGEAVLSVKYRTINDDGTVAGGPMYVLEKGLGMKWLGVIFAV
FTVVASFGIGNMVQANSIAKLLEAKYQVSPWLTGVVLTVFTGVVIIGGIK
SIARVCEYLVPLMAALYLIGCVVLLVMGHHSLGDTIGVIVSSAFTGQAAL
GGFAGAGAREAVRYGIARGLFSNESGLGSAPIVAAAAQTSHPVRQALISS
TGTFWDTVVVCAATGIVVVNSGAWTSGLKGVELTNXAFSGIPHIGPLVLS
FGLLTFVFSTILGWSYYGEKALEYLAGRGAIKPYRWLWVAAVMVGSVASL
QVVWTFADIANGLMAVPNLVALLLLSPVIASETKDYFSGRE
>CT0932 hypothetical protein
MGMSKRVEDLMHLREITCKIVQGVFQGKRFSWGGDWRRVGGAGLCIALLF
EKFFRNQLTFWFFAEFRE
>CT1327 rubrerythrin
MPTTHENLKNAFAGESQAFMKYTTFAEKAEKEGFKNVAKLFRTTAQAERI
HAQGHLAADDAIGSTAENLETAIGGETYEHNEMYPPMFEQAVAEGHKAKR
MFGFAVEAEKVHAALYRKALEAVKAGQDLAETEIWLCPVCGHIELGAPPE
HCPICNVKASAYIQIA
>CT0080 hypothetical protein
MYNKIHPDVVLDEADEALDEPNYNNFNTSPDPPSPYADLEKKYRKNKFNK
TRTNTSPGLYGTHGLVPVNGTKPAGKNIQKKRGKKAVASKPFRASNGNKS
A
>CT2093 hypothetical protein
MFDGHRRRGSSGGRCDYWQKHTEQEKKQMPCWFHGALVSDGGKFWGEFPA
SLAEQCPAISDNFP
>CT0009 RNA methyltransferase, TrmA family
MHTPLEQQEIRYRKGDIIELTITDHAEKDKCFGKTTEGMGVMVSGILAPG
DRVSAQIYKVKSRYLEARAIEVLEASPDRVEPVCPVFGSCGGCKWMHVSY
EAQLRYKHKKVTDSLEHIGGFESPDVRPVLAAPDALHYRNKVEFSCSNMR
YLLQSEIDSDQLAKPKTFALGFHAPGNFEKVLDLDTCYLAKECMNRVLNV
LRDFAIERGLEPYAAKAHEGYLRNLMLRYSERHEQLMVNIVTSWYDKALM
QALKERLEAAMPEQQMTLLNNVTTRKNTVATGEQEYVISGDGYVTERLGD
LDFRISANSFFQTNTRQAETLYDQIIAVGGITPEDTVYDLYCGTGTITLY
LARHCKQAIGIEVVESAVKDAEMNAELNGLSNTVFFQADLKNFHAMQEAL
EPYAKPRIIVTDPPRAGMHPKALDTMLKLQPERIVYVSCNPDNLARDGKE
IAARGYRMTSAQPVDMFPQTNHIETVACFERAE
>CT1227 hypothetical protein
MPLGKMFIILVFATGLDLKSLFSDAAFALPVTMPERV
>CT0045 hypothetical protein
MVVVFRRFVALYDRFELLWEDQRTERMAANLLILAFAFSLALIELARQGL
LPAEVAARVPKSHYYAINTALSMLLGLEVFGLVFSIAKSVSASVGKQFEI
LSLILLRHSFSELVHFSEPLDATEASLPVLFMLAYAGGGLLIFLLLGVYY
RLQHHRAITRDKEAAADFIVVKKIIALLLLLIFFVIGVIDGLKYLNGNAT
NEFFSLFFTILIFSDVLLVLISLRYSSSYPVVFRNSGFAVATLLIRLALT
APPWFSVLLGIGAIAFSIGITFAYNRFENLEIQRSAPI
>CT0751 hypothetical protein
MKAKNTMPFENAFSYTPNKWSIVFNGTFIIHTRSIGISGITKTQKSRASD
IPNCPFPQNTDYVKGQSNLLPAARTISKNIYSCIGTCLFKPVKGLQYIKN
RQPQGGVTINLAARKGTADRKKLTLRDFCLMCSNNQEFVHA
>CT0219 glycosyl transferase
MNKQQIRELPMIRILCLHLAENQASYRYRVEQFLPYWKEYGIDMQPLRIT
SKSYPEKLSIALRSGEYDYVWLQRKPLSTIFTSIIARKTRLIYDYDDALY
AVESYLNGKPKPTQPGSKQIIRRLNTALKYSSLVFAGSDALAEYARRFNP
EHTFVVPTAYRKLLEPPSPKTTDETVTIGWIGNTGNLYFLKMLDQAASTI
QQSHPSVRFSVMSGKPPEGLKTRWEFTPWSKEAEESWLDSIDIGIMPLED
DEWSRGKCAFKLLQYMAHGKPVVASKVGANTKAVIHGISGILADTEEEWA
KAFEVLILNPEQRKNMGQESLRHFLETYERQRVQEQMATILHDHFRQTKI
HHSR
>CT2195 hypothetical protein
MSDSHLLNTRQNLPRERFSLGIESGLRHFLKSVPPPRLVQRALWSDHAPG
TMPVKKS
>CT1756 conserved hypothetical protein
MKLVTISGPPSSGKTAVLAGVIRAMMASGLKVGALKYDCLATNDDDVFRA
IGVPVQTGLSGNLCPDHYFVSNIEECLDWGIREGFDFLLSESAGLCNRCA
PHIKGVLAVCVIDNLMGATTPKKIGPMLKYADIVVITKGDIVSQAEREVF
AFQVRRANPRAKIMHVNGITGQGAGELASQFLLAPETTCLDGSRLRFSMP
AALCSYCLGETRIGMEFQLGNVKKIKMSEEPAR
>CT1091 membrane protein, putative
MLMNPIIEISIPQLLLALLFIVVAQATSFVQKLGLNRDISIGTVRTVSQL
FLMGYALTFIFRAENLWLTLGIYVVMVFSAVFIVRGRVKEKQIAYEVPTF
LTMLSSYFLTALFVSWLVIGVHPWWDPRYFIPTAGMVIGNSMSALAISIE
RLFSQMRQQRELVEMKLCLGANYKEASLDIFRGAVKAGMIPSINAMMGVG
LVFIPGMMSGQILAGADPLIAIRYQIVVMFMLVGSTAMSTIIVTLIIRRR
CFGKSEELVVTAE
>CT1585 hypothetical protein
MSKIKAKRVGFRLDMTPMVDVAFLLLTFFMLTTKFRPPEAVTIDLPSSHS
NMKLPESDVLTVTIAKDNSIYMGVSSQRTRERLFDMVVRPKLENAGVSKA
AVADSLSRFRLDDSFKIEKEELARYIMMSRFADQRLRPVIRADNKADYEA
VNYVIKVFRKMNLLNFNLVTVLEKEVR
>CT1224 hypothetical protein
MSNNFTGYMTLISYGWTNRMVKCNFQNVRWKFPETIPRRKIQ
>CT1688 hypothetical protein
MAFSCGAQTDGLFTVSGVRILFLGLEKHTAAFSDDILTEAGTDGRIRGID
GKALTLLALLTGLCFLFVHDLHIRNTVYRFNNG
>CT1758 iron(III) ABC transporter, periplasmic iron-binding protein, putative
MRAASTRTLLWLLPFFLLLLAACQPSGGKKPDAATREVVDMAGRRMTVPV
TIDRAYATRPGSVTLYAVAPDLMVNRSLWMTDGAERFMSPAYLKLPFSDG
SAEEIVKLHPDVIISYFAINPQSIDQADRLSQRTGIPVFMVDMEMKRYPE
AFAAMGELLGRQEQAAKMTAYIEKWLLPVFAKAAQIPVSQRKRVYYAEGN
RGLNTDPSGSFHSQIIDLVGATNVAQVNRLSGKGMSAVSMEQVLQWNPDE
VIVWTGMGPSMTTYRAIAADPLWQRVPAVRKGRIHQIPFLPFGWFDRPPG
TNRILGALWLAQLVYPEVYHIDLETATREYFTIFFHRDLTDAELREVLHP
FAFTDVAPVQSAKSKPGHDETSR
>CT1816 translation initiation factor, eIF-2B, putative
MIDAISFKNGTFRYLDQRFLPLQEIHVETKNHQEAIEAIKTLAVRGAPLI
GASAGYTVVLGINEYTGDKAGFPEYFKNLIAEVNASRPTAVNLFFATKKM
QEVYDANFENDSLEALFAKMTDMAYKIHNDEIDNCDKIARHGVELLKQDF
ADVLKTRKLNVLTHCNTGTLACCGIGTALGVIRLAYQEGLIERVITSESR
PLLQGLRLTAWELEHDGIPFASISDSSSAILMQRGMIDFAITGADRITAN
GDTANKIGTYAHAISARYHGLPFYIAAPVSTIDITMAEGSEIPIEERNPD
ELRKIFGTQVATPTTPVVNFAFDVTPGTLIRGIITEKGAVVGNYNEGLAR
VVNG
>CT0557 hypothetical protein
MIATVSANRSDTMRMFSGSRSIPVFVIKIFCTCFLFVLSVTLLPAAGSAA
SPEMIRITRERNRVESTLKDLKKQLSEYQSKLNSTKKNERRSLAELNNIR
KQIKVYERLIVENQSLLNNLDNQIEQLQSQLQENRKTYQHVSSDFRQVVI
AAYKHGGDRDAELMLSSGSVNSAIVRARYLGFLSGAVR
>CT1067 conserved hypothetical protein
MDAKTTILEVLKKEGKPMSAGQIAEKSGLERKEVDKAMKSLKEEELIVSP
KRCYWTPKV
>CT0466 hypothetical protein
MGGKWSTPSFDEAARLVLGRGFHHVSLCAAGYFSDGNETIHRESVLSAID
PSVRVETIPCLNDSPHLIACLAGRVVAASRQILAFSGDLKGRSV
>CT0576 NAD(P)-dependent cholesterol dehydrogenase, putative
MAKTVLVTGASGFIGSHLVGRCLADGCRVKALVRKGNACIDSLRASGVEV
IEGDVRDATAVDAAVRESDLVLHAAALTSDWGKMQEFIDINVGGTRNVCE
ASLRHGVGRLVHVSSFECFDHHLLGRIDEQTPYKARKQSYPDTKIGGTNE
VWAAIKRGLSASILYPVWVYGPGDRTLFPLLADSILRRQLFFWARNAPMS
MIYIDNLVDLTMLAASRPEAVGEAFMACDGEAITFEEVCRRVAVAIGSPV
PSLHLPFGMVRSVAGVMEFVWRIAGSKKRPLLTRQAVDVLASRALADVSK
ARTMLGWQSHVPQEEGIRRTLEWLVTVDPARWKVK
>CT1592.1 conserved hypothetical protein
MIVSGGQTGVDRSALDAAIAAGHAHGGWCPRGRRAEDGVIPEKYRLVETP
FSRYAVRTAWNVRDSDATLVLTSGLVAGGTKLTVECAQRYGRLCLIVDLC
GETDAGTVAEWIWAHGIGVLNVAGPRESERPGIGENARRFVAEIIDRDRW
RTPAGS
>CT0396 hypothetical protein
MWCPPWKHTGLKGNPVRVRNSTRCCNSALAARLATRFADNATVPFRDGKA
GRIRESQKTCLIFFGFGNKATYYHETQLRSAFRNRLGSGIDPVDAQPTVP
LAFQSIRSSCLSRSVPLAFRRPGAIRAVFHRPPSILSHSSYRLLFAFFAS
IRSTLSTRSSRA
>CT0746 deoxycytidylate deaminase, putative
MQEESKSGCCCASSSGQHDSDGAEKRLGWHEYFMCVAHLISRRATCTRGH
VGAVIVRDNNILSTGYNGAPSGLPHCNETNCKIYRSIHPDGTVEENCVNT
IHAEINAIAQAAKHGVSIKDADIYITASPCIHCLKVLINVGIKTIYYDKP
YKIEHIAELLRLSGIRLVQVNAVNC
>CT1142 hypothetical protein
MPIVQKLNLKNRKNKVAIVLQNGLQNAVTGLFHSGGSQKGAPRVDLFEDV
DSAVQWIAA
>CT0440 hypothetical protein
MKSDAHWLNWERLTIYPRIFLAGFFVFGLGCVFIAKKKFGLNGIPFGADF
ITFWGASHLALTGHAQDAYNNSLLLKAEQLVIPVFRFTYIWAYPPSFYFV
VLPLALLPYVAAYWTFMLSTLWGYLLVFRRIVRGNIAMWCLAAFSGLWVN
LICGQNGFLTAALAGAALLAVERRPVVAGLFIGLLAIKPHLAMLFPVALL
AIGAWRTLITAAVTAITFMAIGTATLGTAVLKGFFANLGYARLFLENGGL
PWKKMPSMFVFLRLLGTPVTWAYAIHFFVAAGAVIAVWRVWRHCRNWELR
NAALMTATFLVSPYVYDYDLAWLAFPIAWLAVDGLRNGWLRGEREVLVAA
WLFPVLMIPIAEAVKVQIGPIVLCSLLWMTYRRATKTSMTGATAIDDHAA
QFETVP
>CT0004 ribonuclease P protein component, putative
MYPGMLSEKRTNALPRGEIARGKSAVSRLFSQGGRLKGGFLLLIYSASRP
VEQAPRIPVRVLFTVGKKLVPRAVDRNRIKRLMREAYRLEKNILTRALAF
DAGKGDHQVMLAFLYRARADAIPSLERFRAEIRHMLKNLLSNRLPQTREG
DRIE
>CT0544 hypothetical protein
MSALRSMGAILADETHIMISREMIMNDYSMVITANGNNRNGFCLEKSKKL
K
>CT1389 hypothetical protein
MEQEIMRTWVWSTSSPVPIILLNVFLWAFYFVPSLLAWTRKHRSLPAIIA
LNILLGWTGLGWIGAFVWSLSWPGHDNSQPPAAPTQTASEPDQEG
>CT1489 hypothetical protein
MPNTSNRLPPMRRLFSVLPALLILILAIPSSVHAAFTGEMDMKLTMPNGK
ADITYLFGVRDQKMEMTMMLDRIPEPLRTTVITRRSTPDEAIIVNHKSKS
WSIVNLRSAAESATLLDFDSNYRFTRVGLETVKGYPCEHVRLTSSTDTLD
LWMTKGLADFSTFRLLQSQNPRLSNTSLARVLKQNGVDGFPAKIVQKNGN
GLYIMQMQKVMPKPTPESSFRVPAGYRQTEPNQINIDKRQKEHLRQLMEK
MKKFEE
>CT1461 kinesin light chain-related protein
MDYLQGRNYELQINYPEAERYYKKAAAIEDENPLYLNARARILWEPGRYP
DAEPLIRHAFAIGEKALGPKHPNTVQCTKNLNALLEKKKQCGFIRCAAWR
TAAPTRRSDPSSLAGNPGCRGQFRRRRRRDRRETG
>CT1002 hypothetical protein
MPIILSGDETMSNAHAELIEAARKKFSRFRELSEKHGEAKAWEIMLEGFP
EIQKQRMGPLLSLPTLAEAFRQAIPQFKSIGMEMDVVDISNRGTDAVLEV
QRICPWLEVCKECGYAIPCHVICELDMAATRLAFPEIKGEILCRQALGAP
VCIFMYERAAKTSTRTDNATT
>CT0798 LAO/AO transport system kinase
MSGRHRHEPTVEEFVEGIRNGDRRLLSRAITLVESSRPEHEQLAHEILDR
CLGDGSASIRIGVTGAPGAGKSTFIEALGLDLVRQGKRVAVLAIDPSSSR
SKGSILGDKSRMERLAAHPEAFIRPTPSSGFLGGTSPRTHETILLCEAAG
YEVIIVETVGVGQSEIVVNSMVDFVLLLMLPGSGDQLQGIKRGIMEIADL
VAVNKADSGRQTIAENSKADFEAALRLLPEKHSGWKRKVLLTSALEGSGV
SEVWKTIEAFAAAMQQSGEWNEQRREQSRHLLHAIAEEQLKRQFYNAVRV
KEAKAEVEQQVLAGKLSPFTGAVELLKAFRSDRSV
>CT0689 hypothetical protein
MKTENIGTNFDDFLQEEGLLDEANAVAIKRVIVWQIGQEMKAQKLNIIAR
NDDTEKEISF
>CT1134 CRISPR-associated protein, CT1134 family
MEHWNKTFCLEVKGDYACFTRPEMKVERVSYDVITPSAARGIFEAIFWKP
AIRWRIRKIEVLNPIKWISVRRNEVGQTASERSDGIFIETARQQRAGLFL
RDVAYRLHAELEFVPPSERPDAKRPVPESLQDGRETSELRKDENPGKYYA
IFERRARKGQCFNQPYLGCREFSCEFRLVDDLANEPPPISETRNLGFMLY
DLDFQKNLKEPPPAFFPACLEKGVIKVPDWESEEVRK
>CT1835 heavy-metal-associated domain family protein
MKTEIHVSGMRCSGCEMLVSEALEEMEGVQKASASHQTGVVNVEYDESKA
DLELIKKVIEEQGFRVTA
>CT1082 conserved hypothetical protein
MKLDLDAFRIQPGKKPNLAKRPTRIDPVYRSKGEYHELLANHVAELSKLQ
NVLYADNRYAILLIFQAMDAAGKDSAIKHVMSGVNPQGCQVYSFKHPSAT
ELEHDFLWRTNCVLPERGRIGIFNRSYYEEVLVVRVHPEILEMQNIPHNL
AHNGKVWDHRYRSIVSHEQHLHCNGTRIVKFYLHLSKEEQRKRFLERIDD
PNKNWKFSTADLEERKFWDQYMEAYESCLQETSTKDSPWFAVPADDKKNA
RLIVSRIVLDTLESLNLKYPEPSPERRKELLDIRKRLENPENGK
>CT0573 hypothetical protein
MIPALKLPEATPLQSWMLRFAQHDRKEWCSSNFYRFLRERFFSKTQPHKS
SFHNQALQSLETRSRMVKYQRSPILVK
>CT1805 conserved hypothetical protein
MEKAKLQEYWKINLGYLVGLLVVWFVVSYGFGILLAEPLNAIRLGGFKLG
FWFAQQGSIYVFVVLIFVYVALMNRLDKKFDVHED
>CT0534 hypothetical protein
MLYVVGGMGVVVWRGASKGEWVEMYLAGGDLKKSLKWGGIAGGILFVLDL
VNTVIYYSKGAPPMVDMLGILVNMNFLFLFPILVLAEEFLWRGLMLSSMV
EKGFNPHLSVFVTALCYVLNHYAVAPVGMYERGLMAMMAFPIGILGGYIV
LKSKNVWGSVLVHMITMFSMVLDIFVIPNVVPSLFHL
>CT1464 hypothetical protein
MLYHLTLLEEQGNGGKLVVAISGDSETGKSGISHSLALLLKAQSIRIKIL
HTDNSYRVHPLDRWKHRISNSNNYELVTDFSKIQTLVIDGNANNLKMHAW
GKCHPQDCDWGEVNAYPYAPNVSSPIEPQAQAVSAVYTTNFSQTLVIVKP
AGPNMIRAEVFTRFTDNSNRSNYTEVYPFRRQLRLTPIRPFPGLLPMPGA
AVQEDCVSFNPVTTTVRKIDGSWKIVDGSHWLFDFGDKESEARTAYAIIK
KYGFTRSCFVGRPDPSFNYLRK
>CT1402 exodeoxyribonuclease V, alpha subunit, putative
MQKPADSHEERLSGSVERVTFHSEESGFTVLRLRVAGQREPVAVVGLATS
PAVGEFMECVGTWQNDRTHGLQFKASKITPAQPTTLEGKERYLASGQVKG
LGNFYAKKLIERFGDAVFEVIENDPERLLEIPGIGRKRLDMVVKSWAEQK
AIRDIMLFLQSHGIGTARAWKIYRRYREQAIATITENPYRLSLDIEGIGF
QTADAIATSLGVAKNSLMRAEAGISHVLGELSQNGHCAVPEETLIDEASR
LLQIDHPLVVEAVKNEKLTRRIVGETIDGVPCIFLESLHRAETGVASGLH
RLMRGPLPWNALDPHDALPWIEETTALELSPSQRSAVTLVLASKVSIITG
GPGVGKTTILKAILSVLQREKVSVALCAPTGRAAKRLTESTGIEAKTIHR
LLEFDPQAFDFKRGRGNPLDASFVVVDETSMVDVVLMQKLVAAIPDHAAV
VFVGDVDQLPSVGPGAVLADMISSEAIPTVRLTEIFRQAAESMIVVNAHR
INQGDMPLTAEGEELSDFYVVKAASAEEIHDKVIQLVTERIPKRFGLDAV
KDVQVLTPMNKGGVGVHALNAELQAKLNPASEPKVTRFGTTFAPGDKVIQ
TVNNYDKEVFNGDIGLIEAIDPEAEIATAFFFDHHRAEYDFNDLDELSLA
YATSIHKSQGSEYPAVVIPLAMQHFTLLERNLIYTAVTRGRKLVVIVAEP
KALTLAIKNCKSKRRLTGLVQRLRESGKQSWQNL
>CT0102 transporter, putative
MSTVPEEDFSGNHDDSLPLPSGGGLKSKLLAAFPPFASRNFRLYFVGQIV
SMIGTWLQMVAQGWLVLEMTGSAFWVGVTAATSTVPTLLLSLFGGMIVDR
YSRRTILLWTQALSMMLALILGAITLSGHITIPAILVLAFLLGSVGALAT
PAIQAFISEMVERKDLPSAVALNASIFNASRVVGPVLAGFMITWVGTGGA
FIANGVSYVAVIAALLAIRPTAAPPRPAVEERPIQSIRSGIAYTRRHPVI
RAIVLFVGVVSIFGWSFMSMLPVVARQTFGIGAAGMGYLYSAFGLGSLSG
TVVVSMTSGKVRADRLVIGGILTFAVALAAFTFATWLPLALFFLYLSGLG
MLSAFATMSATVQRLVDDRFRGRVMSIYLMVLLGLMPFGNLLMGLLSEHF
GTPAALRTGAMVTIAATIFLFMSRGEITKAWQEYRTTTEA
>CT0947 hydrolase, alpha/beta hydrolase fold family
MDSFLDIQRKKADADSKFIDCNGFRVHYKRYGSGKPPFIVLLHGSFLSIR
SWRDVAVPLAENATVLAFDRPAFGLTSRPVPSRSNAARYSPEAQSDLVVA
LMDKLGMDRAVIVGNSTGGTLALLTALRHPRRVQGLVLVGAMIYSGYANS
EVPAVMKPFMKAMSPVFSRLMKVIITKLYDKNIRGFWHVKSRLSDETLAA
FRNDFMVGDWSRGFWELFLETHRLYFNRRVSSAWAPSLVVTGEHDLTVKT
EESFRLARELPRAELLVIPDCAHLPQEEQPAAFVAGVKKFVEKLV
>CT1291 conserved hypothetical protein
MSGSIWPIIVFCSSGNHAADALDVEGDRRTGSRSLAIIFGARQAMRISAS
VFALVIAGSAVPFAAGWFGWLCLLPIIAFDAVVIWSVLQLLDSRTQQKLR
YIHSIYLAGTATVLALIVIRLVTG
>CT1232 geranylgeranyl hydrogenase BchP, putative
MQRYDAVISGAGPAGCSAALSLAQKGRRVLLIDKARFPREKICGDGVTAA
STELLEEMGVLELLRQRPGSLTEFRGATFVSPGGTVVQGRILRNGHLSGS
SYVIPRMVLDDSLVSRVKEHSSITFLDKTTVTGLVMDGDRARGVATSAGE
FCGRIIIAADGAYSPIAKQLDLWNRDKRQQGFAMRAWFSGVEGLGDSIEL
CYDKAMLPGYGWIFPAGEQRANVGVFLLPRFADQRSTKRLFKSFVKENAF
AAARLKNAVMEPGSLKSWPLPFGSFAGRRGRGNVLLAGDAGSFIDPLTGE
GIYYALKTGRYAAEAVANALERNDESAALTRYEQLWRGEFSGRVYFPGYA
FQHFMNNSWFVDTFMRYTAKKQHRADLLADVVAHNRKRRELFKLLNPFY
>CT1865 hypothetical protein
MIVESFHKAGDKQTGKEGIDHASALWPCMRSPLAFL
>CT0321 conserved hypothetical protein
MPWLSSRGQPSDKKFKASAQYLPLIMETSTILWLASALLLITGFAGLLLP
GLPGVLLIFAGLVFAAWAEGFVYVGWGTLTILGVLTAAAYLIDFLGGLLG
ARRFGAGKYGIIGAAIGTIIGLMFGLPGIIIGPFAGAVGGELYAQKDLRT
ASTAGFGVWIGMAIGIAARIAVAFTMVGVFVVVRFL
>CT1899 acetyltransferase, CysE/LacA/LpxA/NodL family
MHPEIHDSVFLAEGSYVIGDVKIGAHSSIWFNAVVRGDVCPITIGEKTSV
QDNATLHVTHDTGPLKIGSNVTIGHAATLHACTVEDNVLIGMSATLLDHC
VVEPWSIVAAGSLVKQGFRVPSGMLVAGVPAKVIRPITEEERANIAESPE
NYVRYVAHYREEGYEGR
>CT0408 hypothetical protein
MRQPSHHHHTMAGGPDGETEMVSATLDSALAEKRGGLMQKKA
>CT1871 oxidoreductase, Sol/DevB family
MTAMTHWISEPLDALLEKAVAFITDTAYRAVAERGRFTLVLSGGNTPRAL
HQKLARGIREERYLELGYKLPEEARRCTRNPDVIVPPWTHTLLFQGDERY
LPPSHPDSNYGMARQTLIRNVCLKPSNIHRMPTESGDPEADAQRYEMLLK
GLFHKRNSNNAPPSFDLILLGLGDDGHTASLMPDDKGALEEKERWVIAVN
APNGKPPGIRLTLTLPVINEARAVLFLVPPSRHELARSISNGERPELPAG
MVKPRSGDVWWFVEGVV
>CT0062 conserved hypothetical protein
MSRCPEDSRTGNGRNGAPDPGELSAMVRKLEIRSRRLVNELFSGEYHSSF
KGRGIEFSQVREYQYGDDVRTIDWNTSAHKNDLYVKIFTEERERILMLVL
DGSGSMLFGSGRLKKELAAEVSAILAFSAVQNNDMVGLLVFSDTVETYIP
PRKGRAHALVILNEIFSMRQCGRKTDIDAALSFLRRTQKRKSIIFLLTDL
LGSEYERGMKLLNARHEFVLIHIGDPLDHELPHSGLLDLVDPETGERLTI
DAGSRAFLARYAKEQRAKREAVQRQLSRMKVDAVFLDTGKSIIGGLNAFF
RHRERKV
>CT2031 hypothetical protein
MLHYTIKLVFVISYEQGNKTNREMLFFQKFWLRRQSSNLYEALQNWTSPV
SGTSGTFFTTSPAHPSSSTLLSSPKKESARY
>CT0869 hypothetical protein
MKKCEPPPINYGDGLGWCTPYMYVCFNDECKLYVNGWANLKNNYNKIASY
RCMCYPDNGMFDAMCVFSPDGLKGQIVEE
>CT1731 conserved hypothetical protein
MNTFLIVAIIFALTVVMIMSGRGGGNFYVATLVLLGVQMHTASTTSQFIL
LASALVGAIVFGKARVMSWPLAIFFGSLNATMAFVGGFMAHSFTGTLLKF
ILSLLLFVAGVAMLFPEKQARKVAISRFGYWNIQEGDNLYVINLWVAVPL
TMATGFFSGMVGISGGSFLIPLMVVGCGVPVRTAVGTATAMLAATALTGF
AGNALHGGFDPELAIPCGAAAVVGGLIGSKIALKTKPKSLKIISGVLTIV
AAIAMLANAVSGK
>CT2279 conserved hypothetical protein
MCFSRPGYLYLLWLLVPLAVLVFYGVRRKLSAWAKIDSTGGGSGILPAVS
VRKLGLRRIMLLLSAALMIVSIAGPQLCRGQKAVRQKGIDIIFMLDISNS
MLARDTAPDRLTHAKTELLQISRRLGDGRKALLLFAGTPVVQCPLTDDEE
DFEILLDMAAPELITTQGTDYRRAFDAALKLTNSGGELSSNETVLVLASD
GEDHGNDLGDIATAMKTRGVHLHVIGVGGVQPVPIPMQGGPKRDQQGKIV
LTSFKPEPLASLIKTAKGKFYYSRPDAPVHDAVADDIAAEAAEARWIMAP
EQRVPVHGETILAALLLFVSGSMMTDVRRPPKQSA
>CT1586 MotA/TolQ/ExbB proton channel family protein
MKQGFFTAILIVVTYAVSLGFYVWMGTTPPESMFHAVWKGGPIVSVLMAL
ILMVIAYIVERIVALNKASGKGSITEFVQSLKQDVDSGSIDQAISRCDDH
QSSLSAVLRAVLDRYKMLAVHNVTDREKRISEMQKAVEEATLMEMPLLEK
NLVAISTIASISTMVGLLGTTLGMIRAFSAMATSGAPDAVQLSLGISEAL
FNTALGILGGIMGIVTYNVFTSRVDRFSYQIDEAAFYIIQTLGSSKS
>CT0672 conserved hypothetical protein
MPAGPEAFQPVDGRQVHTITSDNGKEFLGHEKVASALRADFYFAHPYASW
ERRRCVGG
>CT1047 hypothetical protein
MLSIANSMQGQSSRSCLLPRPKRSARQAVQLRYLLAR
>CT0797 universal stress protein family
MITIKSILCPMDFSDASKKAYRYACEFAKSMGSKLILLNVIEPRPIAADM
TLNYVPLEEDLAAAAREDFVPMVDEAKAAGIDVSADVIIGIPAEVILQKT
LDFDVSLVIMGSHGRTGLSRLLMGSVAEAVVRKAQVPVLIVKAQEKEFIS
KE
>CT2094 hypothetical protein
MKEAIHTLSNLRLWTGTLPYSCKGRRSSPFGVKRSCNGAPVSSYHKGIDI
AAPSPPRYEIRQQGSWC
>CT1025 sulfide dehydrogenase, flavoprotein subunit, putative
MSKKIVVLGAGTAGTIVSNNLRRHLPADWEITVIDRDDDHIYQPGLLFVP
FGVQKSSTLVKSRKKYITAGINFVMDEITHIDPEKKEVKTKNHTFTYDFL
VISTGCRIAPEENEGLMEAWGKNAFTFYYKEAADQLRLRLKEFDGGKLVM
NIAELPFKCPVAPIEFVFMADWFLKKKGVRNKSEIELVTPLPMAFTKPKA
AAVFTESAREKNIKITTSFELNRVDGKEKFIESVQGDKVKYDTLVIVPTT
IGDPVISNSGIDDGIGYVPTHHNTLKALKHDGVYVIGDATNVPTSKAGSV
AHYEADVVVFNIMAEIYGAKPEEIFDGHSTCFIVYSKGTASLIDFNYKIE
PLPGKFPMPKLGPFSLLKETKMNWYGKLAFEPLYWNVLLDGKHLGMPPTL
VMAGKEVG
>CT1856 serine esterase
MYRIHNTNPFPMFTRNNDAFTWLDYNRNPDKKSPLLVMLHGYGSNERDLI
MLAPTLDPRLRVVSVRAPLVLAPEMYGWFPIEFTPGGITVDREAARQVAE
KLVTFLEHLIEKLQPTGEKTFLMGFSQGSVMSYLTAFRNPELLHGVVALS
GQLPDARPEAGALPEALGDVPFLVQHGLFDDVLPIDRGRQANAWLRDRIA
DLTYREYPMAHQINQASLDFLASWLSERIDRVVS
>CT2156 transposase, putative
MGLASPWQVSISSFDAEKKRLDIHLDVAPGSTFCCPECGILKLQSFPWSG
RDSGFTLLFEAMIMVMARSMAVKAIATIVGEHDTRIWRIVHHYVDQAREM
EDYSAVTTVGVDEPSSKRGHNSVTLFADLARSKVLFATEGKDASTVERFR
DDLVAHKGNPASVTECCSDMSPACISGIGSHFVNAHLSFDKFHVMQIINH
AVDEVRRQEQQERPELKKSRYLWLKNQQNLKAKQRNRLEELSLAKLNLKT
ARADRMRLAFQECFNQPVTLAESFLKKWCFRAANSISPYPLETARSQDND
GWTHRASWAGI
>CT1157 hypothetical protein
MNRIKVSGLFLIFLVITATLQLSGCATVPSTPTDPRMLGYDERVEVTVQA
LAAPDAPSNDKSFFIVPGMQNLSENDLEFMEVSRYITNALSKKGYIRANS
VKSAAILIRLSYGIGDPQTSSRTVEISPGYSYPVGWMWFTQPPQTQTVKE
TTYQRNLILEAYDLKDPNRKSQLWKTIVKSEGSYSDLNRILAYMIAASSE
YFGTNTGRQIDLTIYGHDPRLLDIWK
>CT0378 ABC transporter, ATP-binding protein
MIEIVNVTKTYRIGESSVKALDGVSLTIGQGEFVAIMGASGSGKSTLMHI
LGLLDVPDTGQYRLMGKEVSRMSDDELAGIRNNVAGFVFQQFHLLSRMST
IDNVVLPCIYSGQRGDFRKDALKRLEMVGLAQRSDHRPNQMSGGEQQRVA
IARALIRDPMLIFADEPTGNLDTKNSHEIMRILTDLHRQGKTIIMVTHET
DIAEFADRVITMKDGVVVDDRKKQDARLNPQMPQGGMEAAHSALFQPSRL
LGFVVQAFQSIASNKIRTFLSVLGILVGVASVIAMMALGTGAKASMEEQL
KSMGSNLLSVRGGSAKIGGASQGFGTVTRFTEKDAAAIQAIPNLIDHVSG
DVTGSGQLVYLDKNWSTSVEGVDYDYGEMRAAIPTVGRWFTREEIQERAK
VAILGTTVAMQLFGDADPVDKIIKINRINFRVIGVAPAKGFAGPRDQDDV
VYIPVSTAMYRVLGKLYLDGIYVEVSSAENIAPATQAIDALIRKRHKLAA
DDQDSFNIRDMTQFQQMLSATTQTMSMLLGSIAAISLVVGGIGIMNIMLV
SVTERTREIGLRKAIGARKGDIMLQFLIESVGMTLSGGIIGIVVGVGVSV
MLSAFAGWAVKTSMFSVVLATGFSVLIGLFFGLWPARKAAALKPVEALRY
E
>CT0680 hypothetical protein
MSVNTVTVPAWNNAGVLPPIRPNASGNSGDRSPYVVDLATVFDYFSTSPE
RKTILDGLLRFRADLHTAGITSGFQWLDGSFFEQIETLEKRPPKDMDVVT
FFHLPQGWDQRSLVQHHGSLFDQKLVKKNYAMDAYFIVLGQPTNNWHVKN
ITYWYSMWSHRRDGLWKGFVQVDLDPAQDGPARAV
>CT0542 hypothetical protein
MAALRKLLRLRLLLLPIRLLRLLLLPIRLLRPIRLLRPLLRPIRLLRLLL
RPIRLLRLPTKLLRLLLPKRKNKFLNRDLSTKETETTLSLFLYHVLWNYV
KYWPKALRTLNITGPDVKGGTG
>CT1897 conserved hypothetical protein
MNAETLIESWMMPVAATVLAVLLYLFGNAVLKRILERIRQTVAGSAPVLV
RLLMPIRLLFVLIALAGIVYYVRMPGEIKELLTHILSISSIVGVSWFALR
GFAVVEGVIQQRYRDAGKGDIEARRITTNVSLLRKIVNVLVVILAIAGVL
MTFQTVRQIGLSILASAGLAGIVIGFAAQRSLQTLIAGIQIAITQPVRLG
DIVVVENENGRVEEITLTYVVVRLWDERRIIVPITWFIDKPFQNWTRTSL
ELTGAVTLRVGYHVPVAEVRREFERIIAKSPLWDKRIARLDVTDSGDATM
ELRALVSAANATDLWNLRCHVREKLIAFIFPPQTLEGEEDAVVLPVE
>CT0300 hypothetical protein
MDRQTGECPILRLAGGDEMTGKRYLDEIFTVSLR
>CT0266 bacterial surface antigen family protein
MKTTPFPPDKLSALLIVLALLNIPSSSVFAKAKPSTPANGSVTARPAPAP
IQETRTFTVREITFSGLQTIKESELLKSLPIKINDRITVPGQEIPGVLQY
LWNLQYFSNIQIEQTDLGSGNIKLTFVVTELPVLQSVEFQGNHKFKTKEL
LNTANLNPGRRISNQELLNAENRIEKQYAAKGYLTAGVEYRLQNIGENKV
KAIFTIHEGSKVVIEKIRFHGNTAFKESKLRGVLKETSQNSWWRKIFGQP
KLDKDKFEEDKNLLVEFYRDNGYRDARIVKDTITYTDDKKGLILDIYVDE
GPKYHIRNITWIGNTKDFATTQILDTTFGIKKGDLYNAKKINERLNFSQD
HSDVSSLYLDRGYLSFRAQLEEQVVQPNQVDLVITLTEGDPYTLNIINIK
GNTKTKDHVIRRELYTIPGDTFSRKNVVRSIRELSMLNYFDPETLTPDIQ
PNAKNDTVDLTYNVTERQTDTFNAAIGYSASSGGTGSLGLTFHNFSFGDL
FNPSAYHPLPHGDGQTLALQWQFGSNHYRTLSLSASDPWAFGTHTSLGFT
AFKTKQTYDLTSSNLSTKTINQYGADITIGRRLTWPDDYFAISWRLGYLH
TKGGFVSFLNETNAPEEAEEFSITQTISRNSIDNPIYPRHGSKNVFTAQL
AGGVLPGTINFYKLIGKTSWFFPVSHNLVVNVAAQQGYLNTFSKDDYIPY
TDYFYMGGSGMSSLPTIPLRGYPDHSLGQQFEGETDLYGGRVYSKFTTEV
RYPLTLSSSASIYALAFAEAGNLWASASDFNLTDLKKSAGIGLRLYLPII
GQIGIDYGYGFDSVPSEPGKKPQGWNFTFSFGQPND
>CT1121 hypothetical protein
MSNRNLTTDPRWKSILRPATTCNHQNISDMAMTVEIRDNKLCIEIDLEKP
TPSSSGKTLVVASTHGNAVTDVMIEGKPVTIGLNAYIKK
>CT0888 hypothetical protein
MTVAIELLMLWQQRRLKVFGIGRVHIVRTGAGCPSF
>CT0962 hypothetical protein
MMAENKKWCQKNAGNRPSENLKVKSGRLIYQVMLRQLAVFFF
>CT0475 hypothetical protein
MEKKAVQTSKKGQHVFFLRSSRRIIPSSVAFTFVPCFPAESHSNPHITSN
RKNNVQDCFRYFFSRKY
>CT0414 conserved hypothetical protein
MKLGVFGNAEVHRNGKMISVRFLAPHRVVSTCRVHGGLRDDLDGVFNHQS
CEPAGHMRKDMKTIVAEPERYHQRLCQRYGVSELSASLGTAANMNHAAIA
TRSFRDLSVTAICTGGVEGNAGRAGDPASVWEGETGFEPLDKKGEELPGT
INMMLLINRELSHGAMVRSIVTATEAKSAVLQELAVSSRYSEGLATGTGT
DQIAVACALGGSRPLTSAGKHSKLGELIAQAVINALRDTLSRQNSLTPQS
QRSVYEHIRRFGVSKEEVMEMVAERLSPDESEVFRKNFGGLDRDSMTVAA
ACALVHLYDKHAWGLLPDSCMGEIFVMQGALFAAAVSHRQERIATYAEQL
EAVGWGAGREALLSLITTAWALGYADKWND
>CT0513 hypothetical protein
MTLSACTQRRKQTSQAATLHASLRPKLRKTLEHRALAIRL
>CT1045 pentapeptide repeat family protein
MNKKEPIADREPEPGSQTSKTTQAFEMLRRSVPEWNLYRQEHPGEAVEFN
KKDFSETDLSGANLSDASFKAANFSGASLSRVSFRDAKLNGADFAGADLR
QSDLSNADLSGANLVMADMSEANLDGANLSMADLNHANLKGAILTQSRLN
CANLNECDMREAHLSWADLSGVALSMANAGEADLHDTKLDEADLLGADLH
DTDLHEADLHEADLRDVNLHHANLHEADLSKADLSEADLSDADLSGADLG
EARLRWTNLKGANLSGADFSMANLNGANLTGASLCGVDLTGANLCDANLE
NANLCLANFRGTELNMTNLSGSETFHTCFMNVDLRTVKGLDAINHRGPSE
ISISTLYRSQGQLSDAFMRGCGVPEGMIDHVRSMSGKDFDYLSCIISHSS
RDKKFVERLFADLQKEGIRCWLCPDTLKSNRYLEQYLDRANQCCEKMLLV
VSDASMKGEWLKNTFFKAAQREARDGQRLLFPVSLVRESKLDEWDLTDHE
SGRNFGRELRKNFIPLFYGWERDNALYNSQLNQLVASLKSVDPGRSCQL
>CT1311 hypothetical protein
MFPPRRKHHIWPMSLYRTYNLSHRDERRALMNRTVVVAVAVVVCCGLSGC
TGPTDQKVNLPHTDLNGAIVSPANTPQSSLVSDMPPWIDLYRERNLVNVS
SVDHVVIVIFETQGSREEVYHHYFDKFNGEENFSSFRYNRDIISFVKDGY
GIKITLLDSTKNLWSLEYHRQMI
>CT2137 heptosyltransferase
MVGVLVVTAPDTVRTILVVRLSAIGDIVLTTPVLAELHKALPQARIDYCT
KAPFVPLVASNPAVASVVTPESLAAGSAYDLVLDLQNNRRSRAFVQKLRA
GKVTRYHKRNWKKLLLVQFKINVSAGYYSVVERYGEALDGLVPKITAPCA
LYPSLEERAFAAEALGADGPVLAVCFGANHFTKRYPLERFARIIELVTSQ
TPARVLLLGGKEDEPEAAKLIAMLPEAARSRVLSLAGKATLMQSAALLSG
VDAVLTNDTGLMHIASAFGKKLFVIFGSSVKEFGFMPWGVEYELFETSGL
KCRPCSHIGRAACPKGHFRCMTEIGPERIANRIVETLNEKSS
>CT0222 hypothetical protein
MSPDVIHPKEFREGVPDRALNQRQFQMVLASRPEKMILTRTGHFEFLKET
LAGAGFKSPMEAVSAQERRALVGKISGCYDPIVTSDFFRLPLDRKIRYAG
SLASTFLKRLLNKRKDCGAVFRPSTGILALVFAIAEHGRTADYVICGIGV
RKRNEYLSGKQVKGHDLPHHVFADVKVLRKLARRYNLFTTEPELEHLVPR
YRSG
>CT0186 hypothetical protein
MTHNDHPGEGTPKTVVCPMCGEPFTCGMSTSCWCATRVVPDSVRNYLAER
YETCVCSTCLDRLIAEAKEELRGA
>CT0907 hypothetical protein
MNAAEVIAALELPPGARVDRRIPKTLLVERGARTATDKRRINEGVEEVQW
VATLKPSTIGVPAFRDEVREYLEINVLSATLRGGAKAARFAELIHRAVPY
PVFLLMAEGTRLTLSLAHTRWSQGEAGATVLDGEPIAVAVTEAETEGLPS
SFRQALSLARQPRADLYQLYQGWIDTLLALKAAEVTGRFAVPTSADQAAA
RREALRECARLDAEIARLRKAAAKERQVPRQVELNLELKRAEAARAAALT
RL
>CT0067 hemagglutinin-related protein
MANVYYDFGKESSIRPYIMGGAGMAHVNRLWTDENEEVFAWQAGAGVGAK
AAKNTTLDCGYRYLKPDNVDTCGLGDGKLECHTIMLGLRYQF
>CT1008 hypothetical protein
MKQIDLSSNQRIVFFTGACCIEINPEPSGAIGHDLVIGEPASRALPELFA
RK
>CT1862 conserved hypothetical protein
MMVPFVLLWFALMPVISMVLPLQAYAAENTASRELPDQAVKTYLKSYLDR
LDRLDASSSSGRIEIDGQLQRFYKALDYRVAWTNRKAIARLVELVGECES
DGLNPSDYHYDQIKEFAANPPGSPALKAKADLLMTDALFSLMSHLRSGKV
MPRALDSNWNLPVPKPAANYDQMLMTAVMGSKFPEMISVLRNPSPEYVLL
RKNLLRYRKIAENGGWQPVYQGPNIEKVGQVDKRMPLIRQRLILSGDLAA
DKLVAPADSSLSSAAPVPSDQVYTQELFDAVLAFQQSHGLSADGIIGLET
LNAMNYPAELRADQIRVNLERERWYGGILGGTYVMVNIPAFSLKYVKDNE
VRWKSRVIVGKPDTQTPIFAAQIQSVTFNPHWVIPPGILSKEAIPAIRKD
IGYLAKHQLTVVDSNGKPVDPSRVNWYSDGGFPYRLVQASGDDGSLGRIK
FNMPNRFMVYMHDTPTKPLFERSMRAYSHGCVRVDRPYELAEQLMRDSKN
WSLSKIEAAINTGKTRTIPLTVKVPVYFFYQTAFADGNKIGFRNDIYDRD
KKLLDALNSNQWGRSVEEATR
>CT1471 hypothetical protein
MKSVTPDAYIGTSGFMKMAGEIDFLMTVLKTGYAEQEDLLRQIDAAAKDL
YGPFNVASIIKLTPVQKAWPPESAEQLKKFNNFERSLLNRFELIDRNLKI
DRRNLLMKPAAAPTGKPVMRLMPIRNLNQVNQPAVAAVGKSDAKQTIDLG
MQKYEPGFRTVVPVDKVGISHGLINAMKKQKLEELIGKFKAISPTLSYAE
GRISFLLSIVEDSISKVQGSNGCSCEQKTF
>CT0748 conserved hypothetical protein
MNNNDLGKLLIRLCVGGLMLFHGVYKLIHGYGFIAGKLKAAHLPVWLVAG
VPVGEVFAPLLIVLGIYTRPAALVEAFLMGVAVWLVHMGQLTALDQHGGY
ALELQAFYFFGSIAIFFLGAGRYSISRGTGKWN
>CT1683 proton transporting ATPase, E1-E2 family
MDGQGRKSKEYEQKPVEETLSELKVDRTLGLDDKAVSERRSRFGFNEIEE
KEEALWHRVFRRFWGPIPWMIEVAAILSAAVQKWEDFSIIFVMLLVNAGL
DFMQEHRALNALKTLKQRLSKEVTVRRNGQFVRVPVRELVPGDIVKIRIG
DIVPADVQLLDGDYLQIDQSALTGESLPVTRKTGAVAFANTIVKQGEMLA
VVLNTGMNTSFSSVVALVAEAQRQERSHFQKMVIQIGNFLIMVTLVLVLL
IVMVSLFRHEPLLDIIRFALVLSVAAIPVALPAVLSVTMAVGAMNLAKRQ
AIVSRLAAIEELAGVDIFCTDKTGTLTKNQMEVANPEVLEGFTEQELFLY
AALASRPENNDPVELPIFSYLDTKLKSVDWKSWKQTSFTPFDPVSKRTEA
DAEKDGHTVHVVKGAPQVVIEMAGLDEARTRKLNDSVNELASKGYRTLGV
GVKEGEGMFRMIGLIPLYDPPREDSKQVIDEMHKFGVKVKMVTGDNLAIA
REIGGILGLEQKTIRSSQLSGASANELLNLAEVLATAIYRKLKGDVELRE
AKAFASDVMEQVGKLYDTRLLEREFIHTHESAIVEMIEDVDIFAEVVPED
KYRIVDTLQKGGHIVSMTGDGVNDAPALKKADCGIAVSNATDAARAAADI
VLTAPGLSVINAAMQQARLTFARMKSYATFRIAETIRIILFMTLSIVVFN
FYPITPLMIILLALLNDIPILAIAYDNSTIHPTPVRWKMQELLIIASSLG
LFGVIASFLLFFLLQQYGFSEPMIQTLLFLKLIIAGHSTLYVTRSEGWFW
QRPWPSPLLFGATFGTEILGTIFAVYGLFVTPIGWTYALLIWAYALLEFV
INDAIKLAVKRVFLQRNHD
>CT1704 cytochrome c, putative
MSMKIGKVLTVVLVVLILIQLIPGPSHENPPVTGTPKWDSPRTEELFKRS
CANCHSNETIWPWYSTIAPLSWIINLDVSVGRSKFNVSEWGRPGKNDGDE
AAGELRHGKMPPWFYMPAHPEAKLTAAEKDELVKGLAATFGDKSAEKEKK
EEK
>CT1978 CRISPR-associated protein, CT1978 family
MLVVVANDLPPAVRGRMKLWFIEPRANVFVSGVRDSLARKVVDYLHQHCP
PKSGLMIFNSSNTCPGYEIFGLGDTRKEITEISGLPLVIEKSAASPPENQ
NRLTPEAPKVQ
>CT0508 hypothetical protein
MLYGYGYEHPGFLDCGKLKERPLSMRGLSFYI
>CT2066 hypothetical protein
MDFHGAMLYGVWKNADLTGVPFVFPCKQRLMLKLNEKERESIRLFPILAA
SFL
>CT0069 hypothetical protein
MMKTGRHQALAQLMQRSRIDLRQGNNISISGKQ
>CT0736 hypothetical protein
MLILLGASIAALGLLIMLVQKSGGNGWLGWFGHLPFDIHIEKENFRLYFP
LGSSIVLSIILSLVIGLINKFFR
>CT0177 proline iminopeptidase, putative
MSYLTSSRCKLFYEDTAEQNPSLKDKPAILFVNGWAISSRYWRPTIDLLR
QDFRCVTYDQSGTGKTSIDGCQPDLTIGGFADEAGALIEHLGLDKSRNLH
IVGHSMGGMVATELCLRYRDALLSATILACGIFEETPFTSLGLMFLGGLI
DVSMNFRNMFRVEPLRTLFIKRAATGHISKEYSDIIIEDFTTSDKAATNA
VGHFSIDPEALRTYTRSVIEIASPVLCCVGMADHTIPPEGTITLFEKRKA
SATSPTRLVQFMHLGHLPMLEDTPCFVEQLKKHFDFAEHFYKKTQPATPL
ADRVQIQ
>CT1178.1 conserved hypothetical protein
MAEVRQNPVVIVPGVLFWDSLYEVMREALSTWIPAEKIAIVPVNLLDWLG
FPPSPERSTNRVMAALDRTVRAMASRFPGEPVTIVAHSGGGTVAMIYLLE
RPFQGDVYAVNGLVGRLVTLGTPFHTHEHFAKIKTDFIFKHLGPEFFQKY
QVVSVVSNQYKGSLDGGMIEKMCYMFYRGVTDDGNLAGDGVVPARSCFLD
GAKNVTILECEHLPAPHTKWYGTKDGVEQWIEWL
>CT0310 hypothetical protein
MAEEMKSPNQNQPVGDVVKGDFATILVGLGTMLDGALTPLSKVVASALDS
MTVVAHQILNGISNSIDNKQQ
>CT0282 glutamate synthase, small subunit, putative
MVVETSNAELDAMRRQSLERLLARHCGDCLAPCELACPAGCDIPGFVSAI
AKGNDREALEIIRRTIPLPGILGRVCPAPCEEACRRHGVDEPVSICALKR
FAADQDMVEGSGLPERKPASGKRVAIVGAGPAGLTAAWYLLLDGHAVTVL
DANEKAGGMMRYGIPKFRLPEAVIDADVKPLVKMGAEFRFSTLFGKDANL
EELQQEHDAVLLTIGASQASKLGIPGEELDGVQSGIGFLANVADGKAAAP
GKSVIVIGGGNTAIDAARTALRLGAESVTILYRRGREEMPANRLEIEEAV
AEGVELRLLAAPVAIEKGANSLVVTAVEMQLGEPDASGRRRPVPVAGSEF
TLHADTVISATGQQVDLPAEAAAGIGVERNGTVKINGESMLTGAAGVFAA
GDCVSGPDLAIKAVRQGRLAAEAIDRYLNGGDPAATGMSMFNSSYGARDK
APHQFYDRARPAARVAVPELEAESRRQSFEEAVTGYDPEKAREEAKRCLR
CRCQAVDDCRLRNLATAFGVATPATEEAHEYFSIDRSGDMRFEREKCIDC
GICIRTLESASADAVTIREELIDHCPTGAISR
>CT0520 hypothetical protein
MNSRGSLRGCPASGGTGGNLIGQKNFHISVA
>CT2043 hypothetical protein
MNARVPQQAVGPFSNQPACHSADRQGFPIVYETI
>CT2076 hypothetical protein
MYLLLIMCPFFILLACAVRGVSPGAGSFLASIAVESGY
>CT1562 hypothetical protein
MVTRLTEFVIHYGIQVFLAGPGLDLPRNRFALFINPIKKTRKQVIDDISM
>CT1137 hypothetical protein
MVTPFTSNKKQYATGVWLKKQAIEIARKIQAGECSRDGFR
>CT0501 conserved hypothetical protein
MSQGKSWKYCTGDERGFCAQGVQRIFSAKDPFLTDVIWRIEMRKKVIRLS
LNSSASNARKTVRPVNCYNHSVQKIRGPLIHNFKGGLRAVSIFEAGKGIL
VLSLALLLSTFVSRDLPGIIAEIKTRWNVDPSSHMPHLAKMLMQDLTGAR
LHFLIMLAAIYALMRFIEAYGLWFARRWAEWFALVSGGVYLPIELYELAK
GFSWLKMGILTINLIIVAYMAWLLIRGKSVLKSDEGDA
>CT0232 membrane protein, putative
MNFPDDPKPHRRPRFFDTVLVLLFVLAAYPIVGALLTILVAGGNPFGNGF
EAATHSFVVRLLVAQAFGQMVVLALPVFWLARRFSGDGLFGNTTLEWLGI
RKHGGSRPALIAGAGMLLLQPALYSIVELQTLLLPYLGTFGKSLLQEQAT
LDIFLKKLAGGASIGGSVLSILVLVLTPAICEELFFRGYIQKSFVLSLSP
QRAVLFTGIVFALFHMEWFNFVPLTLLGWYIGYIYWKSDNLLVPAVAHGT
NNLAALVLLKSGIDSGSATDPSSGLLVSWPWWGLVVVSLSLFFLLIRYFP
VRPALQDADNPMPPGHRWKSQC
>CT0990 polyA polymerase family protein
MLTRTHEAIPEIMYRIGDLADETGLPCYIVGGYVRDLIMQRPCTDIDIMV
IGEPVPFAHALAEKLGGRNFVLFERFRTAQLELDDPKAGTFKLEIVGARK
ESYNPESRKPITSIGTLEDDLSRRDFTINALALRLNRKERGEVVDLFDGQ
RHIREKLLKTPLDPEQTFSDDPLRMMRAARFACQLDFQLDEATLTAMSTM
SSRIQIVSRERVSHEFFKIMEARKPSIGLKILYSTGLLKEIIPELTVMAG
IEQVDGLGHKDTLFHTFQVVDNLAEHSDKLWLRVSALFHDIAKPVTKRFH
PGSGWTFHGHEAVGVKIVARIFRAMKWPLEPMEYVQKMVRLHHRPIPLSK
EEITDSAVRRLMFDAGPDLDDLMTLCRADVTSKNPRKVQRIMKNFSNVEA
KIAEVGEKDLLAKWRPPVSGTDIMELLNLPQGRTVGIIKSRMENAIIDGD
IPYDRQAALDYIQQVYREMQEKGEA
>CT0874 conserved hypothetical protein
MDMLNSFDVTNACNSFLQAMNVANSIIITGAVKNVLSCSGEIGSYVANRQ
INSLDELDVKMGGLTLGDAGPAMILSASDGNIGILEINLMNLGEHWEQCH
APEKTAWGPSGGEDLGFLLERTISLKEAPSVMATVLAGTGFRQVSEFSNT
LSLFILQDV
>CT0107 outer membrane protein, putative
MIAVSFSAGEPLLAAETSGAPLTLDEALRMTREHNPKARQALEELNAADA
KVTESRSAWFPQISGKAGYTYLDPVSEMTFGGLAMKFMPNNNYNARFTAE
MMLYDFGRTASTVDLAKAARNSARLRQDMTLRDLSLATVQAFYSVLFLQE
AVRVQQKEVAALQTNLDHMQKRYDQGAATRFDLLTTQVRLAGAANREIDY
QNQLRNQEITLRRLCGLDEKAPLSLKGSFDITAADMDADKLAASALDHRP
EVMLARENFKAASYKKNLATREFLPKIVGSASWGSTNGYVPDLDKMRTNV
AVGVELQVPIFDGFRKSAALREATAMKRSAEQQQLDAEQVSQAEVRQSVN
DLKSSAEKIQTTRLQVSQADLAAKHARIRYDNGLATTLDLLDAEAALAEA
ELANLQARYEYVMNAYSVRRAAGDLIER
>CT1286 multidrug resistance protein, FusA/NodT family
MKKRNQVMKRAVPLLLMTLLATIGLTACSPVGEYRRPEVRLPESYPGQTA
AGKEMAAVPYRKFFADPDLLELIDSAVENNHDLLIAMKNIDYAGKTLDVA
KLWFLPEVDLSATYDYSRGSRNSASALKGQARTSRNYTAQLNVSWEADIW
AKLRNAKKAAVAEYLRSADVARAVRTTLVSNVAQGYWNLKMLDEQIDITK
RNIALADTTLTMMRLQFDAGNVTSLAVDQQESQLLSAKATLPKLEASRTA
QENALSILAGRMPGTPIRRSTGYAPFVMPDSLGAGVPLALLSNRPDVRAA
EESLMEAHALTGVNKAMMYPSITITAQGGLNAIESSKWFSTPGSLFNALQ
GSLLQPIFQHGQLKAQYEQSKIKRDQAELTFRKTLLVAVGEVSNALAQVE
KTGEQETFAKARVAALRKAAGNSRLLFNSGMATYLEVIAAESSLLQSELE
LADVQRSRLSAIADLYRAVGGGWRE
>CT1579 hypothetical protein
MEIFFGWIIFSFVAGIIGSGRKIGFWGAFLLSIFLSPLIGLIFALVSKSN
EKEEYEKRMLETQKSQQEALNKILQEKQNNASTISITDELEKLAKLKNDN
IISEEEYQKLKDKLLNS
>CT1675 conserved hypothetical protein
MAFVKPILLKSATCLLPRVLKLLFATLKINVGYAGEKLPDDQRGVMFAFW
HGKMIAGWLLARRLYPKREISAVVSLSGDGQILSDTLDRLGFHLIRGSSS
RGKEVVRRGIGVALSNGGVAAVTPDGPRGPHHRFKYGTLRLAAQHRTPVV
FAEISYDNARRLKSWDRFEIPLPFSRVRVTLHVVQVPEFPTEEAFRAWAD
ELSTGFDDA
>CT0194 DNA-binding response regulator
MRVLLVEDDPMIAEAVSVALKDAAYAVDWVRDGEKAADALRYGEHQAVLL
DLGLPGRGGLEVLAALREGGSSIPVIIITARDGLGERIAGLDSGADDYLV
KPFDLDELLARLRAVIRRQGGQASGLIGNGQITLDPATHTAFCGEERAVL
SSREFAVLHALLLQPGRILSRSALEERVYGWDEEVESNAIDYLIHQLRRK
LGATSIKNIRGAGWMVEKQP
>CT1177 UDP-glucose/GDP-mannose dehydrogenase family protein
MKITIFGSGYVGLVTGACFAEVGNEVMCVDIDENKINRLLNGEIPIYEPG
LDTIVSENLREGRLKFTTSITDGVDFGLYQFIAVGTPPDEDGSADLSHVL
SVAESIGRNMEDYRIVINKSTVPVGTADLVRETIRTVLEERGRAIDFDVV
SNPEFLKEGDAVDDFMKPDRIIVGVDNPRTKELLRNLYAPFNRSHERFIA
MDIRSAELTKYAANAMLATKISFMNEIANIAERAGADVEAVRRGIGSDSR
IGFSFIYPGVGYGGSCFPKDVRALERTSRKLGYNSRILQAVEAVNDDQKL
SLVSKIRDHFDGDLKGRVIAMWGLAFKPNTDDMREAPSRKVMEELWKAGA
IVRAYDPVAMDEARRIYGDRPDLVLVEHPDEALKGADALTIMTEWMVFRS
PDFELIRNSLKEPVIFDGRNIYNPDMVEQVGISYYSIGRRPRLVR
>CT0516 hypothetical protein
MTIELTASNLLMPSILFGPISSYHQFSPITHGFS
>CT1736 ferredoxin, 4Fe-4S
MSLKITEECTFCAACEPECPVNAISAGSDIYVIDESACTECEGYADSPAC
VAVCPAECIVKA
>CT1571 hypothetical protein
MIAPMQTCPVSNLPVTSKHNWKSIPAGADYVKRLELIGDNIFHHYVESDH
PVTLNSMSTELVHDVLEESGIKDSPLYLLWDMNNISNISFQYKQGINDIV
YRSGLDFAVVVFYNIDESCRIIIETFAAMAPNTMTVLLREDYRDAMTTLL
EYRSGKAPEKLEDQDVDEEISMKNEFLATIARMSWLNMLNQKIFLPPTDS
PFYPYFKAIDHMQEDLKAKEELHEKEMRQLKAENEQKLSQKIIQLNAQIE
LGKKEIQRFEQEKNALKSRIAAQEMELTRISTAIGEKTSSLHLICDQIKS
LNIESAAKQKILEQCHSMIETELNEKRLKTELTAGDSAFLSKLQKKHPNL
NQRELRVSLMVKLNYDTREIARSIGISTRGMESIRYRMHRKLGLDKHKSI
KTYLSELASEVS
>CT0076 conserved hypothetical protein
MDTELLGYAAGILTTIAFIPQAIQMIRTRQARDISMTWAVTMTAGVFLWL
CYGIMKQSFPMISANSITLLLLFIILFFKIRDQGNSNS
>CT0082 conserved hypothetical protein
MSSSSHKRIIGIDFGTKRIGVALSDPLRMFAQPLGTFDMEGLVRVLSRVR
DDEGIELVVVGYPMSDKGEENRMTGVIDRFVAELRESFPGTLIETFDEHR
SSRTAMKILAASGSSRKKRNEKGRLDTAAACLILQGYLDSHS
>CT1170 conserved hypothetical protein
MSLLEKINSSTCTLRFKNDWLKIFTCPVPSSDFLSFVGVATIDAPQHAVL
SLLYDIESATEWVWKTREMRVLQELADDNEGRVVYQVVSAPWPVSDREII
SRSTAYKDPETGEVFIKIESLPDFIPETSNCVRVRKLEGAWNILPLSENS
CRVVFRLHIEPAGEIPSWLANIAVIDTPYHTLNNMRDMVKREKYKNPVGA
PLKESTEGIIRNYNDFITE
>CT0119 hypothetical protein
MLILYSQVESHFVIQLLRLAMLVPGVSNSKQE
>CT1948 conserved hypothetical protein
MADLKKQIASGKIAPVYFFQGPESWLKEEIETQLKAAIFPSENEAALNTL
VLYGPDLTLGQIVSAASEYPMFTEKKLVVVRQFDKLKKVTDKTESQRQEK
SLLSYLSDPPSFTVLVLDADEIEKTELEKAPYKHLKAFRHDFGAIKNAEI
FAAERAREAGWEFEPEALKTFSGYIEPSSRLICQEIEKLTLYASNKRPER
RITLDDVCDCVGISRKYNVFELEKALVSKNLRQCSGIALMIMEQEGQKEG
LMNIVRYLTTFFLRIWKLQTPGAQQLSLQETAAMLGMYGRQEYFAKNFIG
YARQFSAAETENAILALRDADAALKGIIPSPDDRFTLLKLMQQLFN
>CT1419 conserved hypothetical protein
MTLTAPIQPVYALAFGAHPDDVELACGATLLKIMDEGKPVAVCDLTAGEM
GTLGTAETRRQEAALATERMGYVAREQLDLGDSELFYTKESLHKIIRIIR
KYRPDTVFCNPPDERHPDHMKASRLIYEACYYAGLRKIETFDGGLPQAAH
RPRHLLYYIQFKQLEPQIVVDVSSTFERSRAGIEAFGTQFHRKENSDEPV
TMINRKEFLPGLEARSRALGEQIGVMYGEGFRLPTALGIDHFTSVFPPGV
>CT2202 CorA family transport protein
MSVSDAAECFALKEKFKVLWINVDGLHETGVIEKLGELFGIDPLTQEDIL
HTGQRPKVEDFERYLFLSLQMLDFNRETGEISQEQLSIVLGPGWLITFQE
KPGDTFDALRQRIGSSNAKVRKHEADFLAYALVDSVVDHYFSILEEIENR
IDMLDADLMATFSQETFNALNALKRELIMFRKAVWPLREIIGSVARDDFA
VVGDLVEHYFRDVYDHVILVIDTVEVFREIVTSMHETWLAGVNNRMNEIM
KFLTMIATIFMPLSFIAGVYGMNFK
>CT1023 thiol:disulfide interchange protein, thioredoxin family
MKSIKTFFATSFMALSMLGAVPVARTFAAGPSASPAVAGKMAPAFTLKTL
EGKELNSSQLAGRPYIVNFFASWCPPCREELPGMVALQKKYAKQGFTFVG
IAFRDRPATLPDFLWEMGVEYPVGLTTPELEAAFGKFMPGGKIRVIPATF
VVGRDGKLINAVSGGLVKEDFESLILKAIGSRPSK
>CT1102 conserved hypothetical protein
MKAGIWIAGRYSFARKRFRVINIISGISLAGIVVGVSTLLVVMSVLNGFQ
KLARDLFVTVEGPVQIVPAQGRSLVVSDSLLAAIARLDGVETAHPFVEGQ
AIITASGKSELIMVRGLTAEAQRSLMQTTSTTQPYFSTDGISAGNLLAER
LRLYPDEPVRLFSPELISAGLQALSQPELMPALSMSDVRVQSSFSLQKLF
DDRYVLAPVEMAQNILLFGRGRYSGIDIRGKSGVSDETLEARLREWISAT
HREKTLRVVTLRERHRDMFAVMQLEKWASFAVLMLVILVALLSLAGSLAM
TVIDKRHELFYLRCLGLERPQFMTIFIVEGGLTGLAGTTLGSLLAWLICK
AQELWGIVQLPSKSAFIISAYPISMKTGDFLAVGAAALFFTLLVSLYPAS
KAAVIATSQSLENKME
>CT0613 3-isopropylmalate dehydratase, small subunit, putative
MDTIIQGKAYVLGKNIDTDQIIPAEHLVYSLSDPEEVKMYGKYALSGVPI
DQAGLPEGNIPFVEEGEFTSPYSIIIAGPNFGCGSSREHAPFALKVAGAK
AIIAESYARIFYRNCVDGGFVIPFETAQPLNKSIMTGDELSLDMENNTLT
NLTQNITYELRPLGDVINIVQAGGIFEYARKNNLMASTEA
>CT1550 conserved hypothetical protein
MFQPPLIAFLTDFGLEDAFVGVMKGVIATICREAQVIDLTHAISAQNVRQ
AAFHLDRSISYFPAETIFVCVVDPGVGTARRAIGVEAGPYRFIAPDNGLL
TPVFERWADAKCHELTNPAYQLANPSATFHGRDLFSPAAAHLATGVPLNA
FGSAIDVANCTRIELWQNKPCDNGKVWTGEVIATDHFGNLITSFEASMLS
DGEDWTVKAGNAEPLPILRTYGEVEQGKPLAYIGSSGMIEIAIRNGSASA
ELRVGGGDEVELRKG
>CT2227 hypothetical protein
MKTTHTIIHGDSRQMNLLPDRSVHLVVTSPFCWQIKGYCCECERYLIFFP
LGTRA
>CT1383 conserved hypothetical protein
MPLQIFNTTKRTIDETLLAEVIRLVIGEEGGAVGSIEAIYCGNKMIRRIN
RDFLGHDYVTDTITFGYNEGGEVDGEFYISLDVIESNARRFGVSFEDELL
RVTIHSALHLMGYDDETSELRAAMSLREDHYLYRLRH
>CT0739 hypothetical protein
MSKSAGSTFCPKSNGAMIFLLFFSSLLPMVAIRKNSLSFPIHRYRAQGYV
LLDVIYSSLVRAQSVIPYSGGAALGSHYCRNCVPLYIVSGLRS
>CT1651 hypothetical protein
MLEMCDWEENASGGGNFLRSERTELIDSEDWPFVCSVAIGHTVLY
>CT0120 nitroreductase family protein
MTNLHDLAESRKNVRSYDTSQPVTPQILERVLDAWRLAPSAKNLQPWTFL
LVSSPEMLAKIRSCYKRDWFQQAPHILIVTGDRNDAWVRPSDGWNSIETD
LGIAMDQLILAAHAEGLATCWISAFDPAMLREALDLKPSEEVFAITPIGY
ASADAQTRPKSRKPLNEVVRYL
>CT0220 hypothetical protein
MTLVLNITAIVLFTGLSLLVKYRIKRWKQKQLRQQNDVWSIGLYEGPDPV
TLSPAAGIRNPILTAKEVTDAPARFIADPFMIERDGAFHLFFELLNTKRK
MGEIGHAVSDDLKTWRYSHVVLRERFHLSYPYVFEHDGEVYMIPECAKSK
SIRLYRAASFPDDWRPIATLLSGNKREVALLDPSIIFHDGHWYLFSYMRK
VNNLHLHVAETLTGPWREHPASPVVKNSDHFARPGGRVVKNGAALYRFAQ
DGQPRYGSKVWGFRITELTPTAYREEAVSDTPVVQEGNEVWNGRGMHTVD
PHRMPDGRWIALVDGLEDKLRS
>CT2068 hypothetical protein
MFGLSIEPEWFFMPQHACYNLELGRVCEQVVFDELVA
>CT1468 hypothetical protein
MPEWQNLFTCRKREAVGNVEADKPLIKLFSP
>CT1584 hypothetical protein
MGMVDSPGDRRSSKRGGHQRKRLGFKLDMTPMVDVAFLLLTFFMLTTTFA
KSNTMEINIPPETGEVAVAELNVMTLRVPGDGFAYWSLGEAAPRRVPLYD
SAGTHASLSSELRQVLRQETGRNRKMVIVVRISGKAKYKALVDIIDEFNL
MKIDRFSLDDFTPKDEAEIQKAVTMR
>CT0434 type II secretion system protein
MGTLKDQIARWNLSLKRGADFSSSAPETADVPEKAPVVQAPVASVPVAAK
KNKESNGDPDYYSAKTKLHQKLLTRIDLNSIESLDADQLRNELGMLLTRL
IEEEALPLNHQERSKLVTDLKNEILGLGPLEPLLADPDISEIMVNGYQNV
YVEKKGCISLTDIRFSDDAHLMKIIDKIVSHVGRRIDESSPMVDARLPDG
SRVNAIIPPLALDGPALTIRRFAVVPLQMHDLIEKKTLTPTMAELLSALV
KVKCNIIISGGTGSGKTTLLNILSGYIPYNERIITIEDTAELQLQQDHVI
RLETRPPNIENKGEVTMRALVKNSLRMRPDRIVLGEVRSSEVIDMLQAMN
TGHDGSLTTIHANSPRDAIARLENLVGLGGINLPSKALRQLIASSVQFII
QVSRLSDGTRKITHIQEIIGMEGDVITTQDIFFFQRTGTNSDGSVQGVFK
ATGVRPRVYEKIRTFGLNLSEGIFDPDSKD
>CT1580 aminopeptidase C, putative
MNWYLQVLKKYAEFNGRARRKEYWMFALFNIIFLIAAMIIDNIAGTTIGV
LPYGLFYFVYALAVFIPGLAVGVRRLHDVGKSGWFYLIILIPIVGAIWLL
VLFCTDGVVGQNEYGINPKEVATN
>CT0952 conserved hypothetical protein
MTPLLFFIATIIFAAGAGAGAFLVNAFRNKQENVCRGSVRIGPSTVAGRG
AFALTPIKEGDIIERCPALEVTDKDIGGELLNYVFYGSAEDRRLIAMGYG
MMFNHSSNPNVAYYREDTPTGPELIIYALRNIAEGEEMYYNYGDDWWKTR
GEKSDF
>CT1152 alcohol dehydrogenase, iron-containing
MNFFLSTDLCIGVNEALSVHEHLQSLGIAKPGIIYDASLSENLYFQDVLR
NINACYANAVVYCNEFRGEPTYRHLENVAEFFRRELPDGLVAIGGGSTID
LGKGVALLLTNNVPALSLKGFPKDVSDPVPLVTVPSLLGSGAEVSYNAVF
IDEDEGRKLGINSRKNFPKRSVIDPKLSMSAPMESVIASAMDSLVHCVDS
FGSVKHTALSRIFSIEGFQRTFYALQQGQLDRAESRLDLALGSICGVTAL
MNSGDGPTNGFAYYVGVKHQVPHGLAGAIFLKEVMRYNVNKGYDKYALLN
PARKEGASARESALELLEQMDALYCQLAIPNLVPYGYGKGNVAELASKAS
QALSGSFGGNPVEFNEESARVVIDALT
>CT1735 hypothetical protein
MKVLIHHIYEYRKGLRSLVMHTLPARFGEEATRKLRHYGIDYHIYDFGKS
HINVFFGAPECVAVVRSICASKKLKALTPEEDFVLGSMLGYDIRKQCERY
LKKCESAAIADPAMHKCA
>CT0950 conserved hypothetical protein
MKNPRAGRPLHCRSVEELPGVTCFLPEGVPPARLRNVVLSVDEVEALRLA
DLEGMYHADAADKMKVSRQTFGRIIKSARKKVADALVGGKTICIEGGKIT
GSCLTGESEEPAVCICLHCGYEQPHVPGVPCRTANCPHCGKMLIRKGRYS
RVD
>CT2079 hypothetical protein
MLEKMANNNVEYRSTREPLDAIENNDEQDAIQKSIERRAISTEALWRALG
RRITKKWKDEGKACRSGVVIQSDDKPLQASFSEQIASKYNLNANYC
>CT1395 conserved hypothetical protein
MCGPEKRKGWAGFDPTHKTLFVGDWYIKIETGRDYGDVPPVRGTYKGTSS
EKLGVSIRASSLEHDGETPTTRHYHPA
>CT1613 hypothetical protein
MPSRKTPNLSRPMSGNESMPATPQMQAPKKKGPKVIATLFVIFVIIAAAG
WFWINWEKPRIAPEPLSPELTTIIQNMPGISDAMIYVGLKDIRESKFWNE
VVPDSIKNSPLLSLGKRTDSLMKAGNINLTNDLDTLLVGFQRSGRKQQNY
IGIACGPVARKAQAPFLKSASLQTAEVAGRQAYEIDSTLWVSPMGTNRLA
IASSSNMLEKFFKPSGHLFERDSTTASLIRKTPYKSHVWFALASPQWTAG
ALQSITSQNRDVKSVGNLNRLQQISMSVKFDDNGLKGQSEWVYKDRQAAF
FASTFLWGAIKLSSISGTRTSESTKELLKHLKVSQNLESVIVTADLPETI
FKKSGKKE
>CT2055 conserved hypothetical protein
MELTESIRLLFPPEERAQLDISAIKGDASNRQYYRVTGQGSVSVVCADPA
FRATAVENYPFLIVRDLFARHGIRVPELLGMVHEQGLLRLEDCGDLMLQD
EVPLLDRNRLSARYRQVIDLLVRIQSIRPDKDALTTTLPFSLSFDHEKLM
FEFDFFIEHALNGYFAGRLGKPAIARLREEFINICDLLVLPKHFVLNHRD
FHSRNIMLFRAEPVVIDFQDARLGLPQYDAVSLLRDSYVRLDPGMVNELK
RYHFNQLVQLGLTSMGEAEYLRLFDLMAFQRNVKAIGTFCYQTTVAGNRT
FEPSIAATLSYLREYIEARPELAMAGRLLKPIIPEISR
>CT0295 DNA-binding protein HU-alpha, putative
MGQTTTKADLVNVIAQRTGLTKNETESVVDCLFESIIDSLKAGKRIEIRG
FGSFNIRYKNLRQARNPRTGEKVTVEPKNVPTFKISKEFKHAVSESLKAN
K
>CT1806 sodium:solute symporter family protein
MSVQVXTYLIVGLTFAIYIGIAIWAKAGSTKEFYVAGAGVHPMINGMATA
ADWMSAASFISMAGLISFMGYDGSVYLMGWTGGYVLLALLLAPYLRKFGK
FTVPDFVGDRYYSNAARTVAVICAIFVSFTYVAGQMRGVGVVFSRFLEVD
INTGIIIGMAIVFFYAVLGGMKGITYTQVAQYWVLIFAYMVPAIFLSIMI
TNNAIPQLGLGGTTSDGVYLLDKLDNLSKELGFGAYTTGSKPMIDVFAIT
LALMVGTAGLPHVIVRFFTVPKVRDARISAGWALIFIALLYTTAPAIAAF
ARVNLIDTVSNKAYAELPGWFKKWEKTGLLAWMDKNGDGKIQYLGKKANG
GDPFEGKKPEFTKEIGKSGELLMSNKPTDNANELYIDKDIMVLANPEIGR
LPNWVIALVAAGGLAAALSTAAGLLLVISTSISHDLIKKQINPNISESAE
LMYARIAVAIAILVAGYFGINPPGFVAEVVAFAFGLAAASFFPIIILGIF
SKRMNKEGAIGGMITGLVFTAAYIVYFKFMNPAMNKPEFWFLGISPEGIG
TIGMLINFAVSFVVSRITPAPPEQIQELVDSLRYPKGAGEASAH
>CT0249 glutathione S-transferase, fosfomycin resistance protein, putative
MNLTGINQITLRVNDVRLSEEFYAGILGFRVDYRAGANISYLRINSDMLV
LVKAETPGTADARDIRVDHFGFRLASDAEVDEAAVYLDDRGVHLVTRPAH
RREGRAFFVMDPDGNLIEFYSMNATGLQPLAENIDTRTASDIASDSRREL
AAGRDLKKTRRSRK
>CT2111 conserved hypothetical protein
MVMSKEKALIEIRLAGLAQGTHEFDFTCKAADFADPALAGAGFSRDVSVN
VSVEKLDGEMIVTLNTSATANLTCDLCLAPITSELKGSYRIYYGYEQAGE
PQEERDEEYRLIDRNTLALDLTEDVRETLLLSVPMKVTCKDNPDCRVFHQ
EKLSEPGEDHLPDSDWQESLEKLKNKYR
>CT1572 hypothetical protein
MAFRGWKKLPIRIETGEMVEAIAPVIVSASRATDIPAFHAEWFMTRLRAG
YVQWCNPFNARQTQYVSFRKTRAVIFWSKNPAPLLPHLSELDAIGINYSF
QFTLNDYEAEGLEPYVLPLAKRIETFRRLASHIGPERVVWRFDPLVLTGR
LTADMLLERIGRLAEALGGQTRKLVISFADIECYRAVRLRLSRMGAGARE
FAPEEMDEFAARLVERNKDWGLELSACAEDSELAGIAKSRCIDGHVLASC
FGDDAELMEFLGGSTLFPGDAPVSVALKHQGQRKACGCIVSKDIGVYGTC
QHGCRYCYACR
>CT1802 Nudix/MutT family protein
MPESYWILPGGVVERGETLEEALRREVREETGLECEVGGMVFVKELLWPH
PGLPGQGERHHSVSLGFHCEVTGGRLVTGRDPELPDDRQMILQSRWLPLS
ELAEYRLYPPFLYDFIDSGLRRGFEALCPEFFDSTL
>CT0964 methylase, putative
MQIHAGRYRGRKIRHLPSRDIRPCTSRVKKSLFDTLAARLDFDDIEVLDL
FAGFGNLGFEALSRGAKSACFVEQNRQALETMKATAVDIGAGASARFVMA
DVTAFLKREEGSFDLVFCDPPYRWEDYEYLIRSILDTGLLAPDGLLLMEH
HASHDFSQSRGYLFHKDYGTTRVSFFTPQPEAHE
>CT2056 mannose-1-phosphate guanylyltransferase, putative
MNAFVLAAGFGTRLQPLTDTMPKPLVPVLNVPSLCYSLFLLKEAGIRKAI
INIHHHTESLRQFFDRHDFGSLEIVLSEEREILGTGGGLKKCEHLLDGEE
FVLINSDIISDINLRSLIDAHQRSGCGGTLALYETPLAAQIGYIGVRDGL
VLDFRNQRGTGLSSSFIYTGTAVFNPEIFRHLKTEFSGIVETGFYGLADN
GRLALFEHRGLWQDIGTLPNFYRANLDDNLRILQLAGRIQREIGFFPHMI
SDDASINPEAHVENSVLGANCAIAAEAIVEHSVLLPGTIIERGETLRNAI
AAPGIRIPL
>CT1950 hypothetical protein
MHTNRQRPDERTKNTALFIKKMKILIHFCFIAESAILLKPC
>CT0800 acetyl-CoA carboxylase, carboxyl transferase subunit alpha, putative
MPQTRSFIHSLPVKHFFLPFEKTKGFSYSADSIERLSEYEKYQLSFHPER
PKYLDYLTVFDNVEEFLANDLHGSRLIQTHRAELRRNGKVWPVMLIGQQS
GPTSDFGELTRITQDQDEMRRWNQGMPTPAAFDKAIEAIALAERERRTII
TVIDTAEADPTEESEAGGIAWKIGRCMQALAEATVPTISVIINRGCSGGA
IALTGCDAVLAMEYSTYLVISPEACSSILFRTRDKANLAAEISQITSKEG
MKNGIVDELLPEPAGPTHRFKNEALESFREVTGRWIEAFGKAPAESLQQR
RIERWQKIGQCETTTEEHIRTYEKKVSFFIDKPKKNLFISRHKPLLKR
>CT0422 cobN protein, putative
MQVPGRVALHPGLPESFSTVDDAELRTLRDYFGVLSQENYLNGLRRLAGL
DHEPCRVVSTTGIYHPGSDRFFESAEEYASWLRGTERWSENAPVAGIIMH
YNLLAETNHADVDALVEALSEAGIVPFCVFFDSEAAMLQENRYPWHRLFT
SGELRPDVVLNFLLGRLLATPEERHVLQSLDVPVIQLLRNFMLSPEEWLA
DPVGISAMSLTHSLVQPEMFGAIDPVMVSGMIRKPGDPFTLRTAVPIDDR
IAMIARRVRRWARLRHASNAEKRIAIMLHNNPCKGVESTIGMAAGLDTFE
SLLSLLRRMQQEGYDTGELPESGKALLDLIQDRKALLEFRWTTADEIMRK
GGVLHAMHEEEYREWFDGLDAGVRERVDADWGVFPGEGMALEVNGKPALL
ITGLRFGKVLVMPQPKRGCYGARCDGEVCRILHDPNITPPHHWLATYSWV
QKECDAVIHFGTSGALEYLPGKRAVLSGNCFSDISLGDLPNIYPYIMDVP
GEGMTAKRRGRAVIIDHLTPVYRPAALTPELQRFDELLNEFRRTVDGNEK
ARMEALQEELLELAVNLRFLPENATTAHLLDEVERLSRRIGLVKQSMVPD
GMHRFGEPPPVEGVASMLTAALPRLGDEYPSLAEVAGLHPGLGGDDFMKA
SELFAALLDPSQGEQAIRDAAEALPDALKAWCLKTAEGIGRAGDELTNVI
RALSGRYIEAGLGDAPSSGKADVLPTGRNFYAADIMTMPTETAWTIGSGM
ADMVLQKFHREEGRFPESIGMSLWSSDAFKSDGELLSQIFALMGVRPVRQ
SNGRVNGIEVIPLDELTVSIDGVSMPRPRIDVTIETSSIVRDMVPHFLAL
IDKAVAAVSALEEESPEMNFVRKHTLEQLAALQESHAEQMDASLMQRLSL
YRVFSSPPGAYANGVALALDASAWNDRRDLAETYINHSGYAYGGEQLDHG
VKAYGVFSRQLAKVEVSFIKQTSEEYDALDCGCYAASAGGMAAAAQVLSG
KPAKTWWADATRPGNPDIRDFREEAERAVRAKLCNESWIASMKEHGFQGA
QGFASRINNLFKWSATTGEVETWVFERVVETFVQNDENREWIRQQNPYAL
EEITRRLLEAEARGLWEARPDLLEAVRQAALSIEGDLEERIGDVDESFQG
GRIDIFTGKDVERWKQEWRLGVDSSEKKQ
>CT1122 hypothetical protein
MGKRYEIGADFFREEILAAMLFGFRNVKNPSTVTVHPELMVKIRESFMDK
VTSPKQLGDVEVFFGLTVIEDATKAKDYISVN
>CT0081 ric1 protein
MDIRRLILAFILPPAAVMNKEAGTIMLTGILTLWGWIPGVVAALIMISKE
QSGKTAEA
>CT0482 hypothetical protein
MGNSELQPVSKSAGGQAWLLLALDELLAIVRECINPKVSRSGLDRCLRRH
GRAL
>CT1040 methyltransferase, putative
MSDHQSHSRGKNAREWFEEWFDHPLYLKVYHHRDAEEAERCVRTILDLTG
IDPAWQPPHSVLDIACGAGRHALSFARTGLRVTANDLSPYLLDQARKQAK
AEGINMEFSRQDMRTIRFERRFDLIAQLFSSFGYFETDQEDRDVIANIAS
LLNPGGWYVLDLINPVQLKSQFTPRTERNSESLSIIEERTLSERHVTKKI
TLHEANGRKHSFTESVRIYSPAEAFSLLESGGFAVERVVGDYEGSPFDEA
TSPRMMLLARLLVSRS
>CT1711 hypothetical protein
MRDHTPNFKLLELSDASKALVRETVTQLLEKLAGDGQLTPDARLEFWVEI
PGVKHPRGTFRGGCLMPDSYLCLSDWFATGSSAIEPAAEYASSENPLDAA
WADFLGELYYQIEIFTSVASANQGITVELWAGTRGRPECEWMYAVDKKIE
LP
>CT1207 Sec-independent protein translocase protein TatC, putative
MSFLDHLEELRWRLIWSLIAFVVAAIVTAFFSDFLVNQVLIRPLKESGPN
IHLQNLVPYGQISLYLQVIVFAAFVLAFPFLVWQIWQFVEPGLHETEKAA
SRFIIFFISICFFSGIAFGYFVFLPISLKFFAGFGSELIANNIAIQDYIS
FFMGTLLTTGLVFELPFVSYVLSKIGLLTPAFMRFYRRHAVVALLIIAAI
VTPSTDMVTQAVIAIPMIVLYEISIYISAAVQKKRNKKMMEEGVA
>CT0078 hypothetical protein
MTMFTRLILWLLPLLLVLPAASAAETAGGKLFIESTLDNATPWVGQEVKL
TYTLFFSGTAPQIEDKSQPEHSGIWVRELAPENYINSAPVSKNGELFRKA
VIKQLRLVPLQSGKLPVTGYRLRCLVPQQGEASLDSRNDTETIVTAPTAI
IQARALPKPAPADFSGAVGHFTLSVSPENSTVHAGEPLSLSVGISGKGNL
DTLPPLKVLLPEGIRQEVSVASPDTASAKGSTSSVNTKMLLIASKEGTFR
FVPLKLTVFDPETGRYETIASNAIVIKVIAGRTAMMPPQSPLPGVMPPPA
DPDPLGAVIRPIIMSMGLAVLVLIFGLHLRYIKRYKRTGVQQKTSEAAEP
IRPTAAPAPATQTSTGGKSPQSLRNELYGAVKKTGIMNPAGLTTKELGKL
LKEKGVKAQTISALTELLSGIDHALYSPGQISPEKLETMNRDASRIIADL
TRS
>CT2016 conserved hypothetical protein, putative AefA
MESTSLRDLSHVTDWLVRPLVVIGHAEISIVKIVTIILSVALIVAAAKFL
KRMLVGRLLVSSVHDTGTRSALATILQYLIVFFGILIVLQSAGIDLSSLT
VLSGTIGLGIGFGLQNIADNFFSGLIILLERPVKVGDRIQVGEINGDVVR
IAIRSTTLLTNDNINIIIPNSEFVSKQVINWSHNDRSLRVSVPVGVAYGS
DPEQVKHVLLGVAENHPDILTKPSPAVLFSDFGNSSLDFELLVWTETRIQ
TPRFLRSELNYRIFDAFRKNGIEIPFPQTDLHIRSSDIPLWEPRERKNPT
G
>CT0472 conserved hypothetical protein
MSDLYRFGISLERKLIESFDRHIKAQGYQSRSEALRDLIREELLRKTTAE
GGLVAGAIVMTYDHHKRDLVNRLIDIQHDFHDLIISTQHVHLDHENCLEV
IAVKGNAPEIEKLSSALKVLVGVKHLDLSLSSAD
>CT0930 ABC transporter, periplasmic substrate-binding protein
MPVRSKSNRLTPLFILLALCLQTLAGCSKPQASDKTEPTKSQAPQRIVSL
APSLTEMLYAIGAGPQLVGRTSACDWPAEAKKVPVVGSFGRPSLEMLASM
NPDLVLDVDLADDQTAKKMEEMHIRREHIRCQDPKELPAALRKLGTLTGH
TRQADSLATVIEQGLAKYRKEADAKQHKMRIYLEIWNDPLWTGGSNSFVS
QLIALAGGRNIGDAVEKEYFEVSPEWVIRQNPDLIACMYMANQTPAADNV
KKRPGWQGISAVRNNRVYDNFDNRLYLRPGPRILEGIAGMKKLIESNEQ
>CT2075 TPR domain protein
MPFVSAAGPAEELIDQGIRRGLEGNYQDAIDHFSRAIRLTPRNADAFYNR
GLARVSIGDLTGAIADYSMSISLDPRSSGAYNNRGFALAALGRYAEALAD
MSRAIALRPDMAQLYNNRGTIRMSIKAYALAIADFTRAIALDPLLAGAYN
NRGLARNLSGQLQGAVADYREAVRIDPRYKVAWYNLGNAHISLGDAKEAV
EDYSKVLVLDPGMLVARNNRAFARLSLGDYKGALEDLNLVISKSPQDAAG
WYNRGVVRKLAGDRQGAIEDLRRAAAFGDSLAVEALREITSRDSMPP
>CT1024 hypothetical protein
MLICPSENRPNAFFTFWTGTPLTGSENLASTEFPRFIASYHSASFRTG
>CT0949 hypothetical protein
MDSDLIVTFGGGKKVNAEFRGFTIKTDQSVHSGGEGSAPEPFALFLASIG
TCAGIYVYSFCQSREIPTDGIRIVQSHEPKADGRGIGKITLTIEVPPTFP
EKYKDAVINAANLCAVKKHIMEPPAFEVKTKVVEG
>CT1136 hypothetical protein
MMSNSGKMLDQFVFNPLTRFILPEVGRRIFFVTHALPI
>CT0659 conserved hypothetical protein
MLDSLLLYIESFNVDRSIAVPVATLLAVLVAMLLIAIVELVTKKILLRSI
DHFVAKTASNLDDLLVRYGVFNWIAHLVPPLLAYRMAGTVLQFYPAAVPQ
VTDGLVIYFAVVVILLLLAVLDVVYEFYSGHPIGMKLPIKSIVQVIKTVV
VSIGVVIVISRLLGQSPVVFISGIGAFTAVLLLIFKDSILGLVAGVQLST
NDLVRIGDWITMPKYGADGEVIDISLVTVSVQNFDKTIVVLPAYTLISEG
FKNWRGMQESDGRRIKRSFNIDMQSIRFLDGELMKKIEAVQLLKPYLQER
FQEITSHNRAVGADEDSPVNGRRLTNLGTFRAYLQAYVHSQSRINTDMMI
MIRYLQPDAFGLPCEIYCFTRTKEWAEYEGVQADIFDHIFAVLPWFDLKA
YQLQGAAVPPPAVTA
>CT0788 hypothetical protein
MRLTRKCRTGRRDCNDCNLLLINYIKLKFFLNYNGSIFKLMETIYFSKIV
LVSIQKTISNLPQ
>CT0704 zinc protease, putative
MSALSFRGNSPGIEYTVKVSQRARYARLKMSPVEGLTVVVPVGFDKKQVP
ALVESKREWILKVRRTFDKHRAAAPAQGDAALPTVIELAGIGESWRVRYR
SEPRQRITITEKGEGELEVSGPVSEHAMCFAALEQWLKHRAKLKLGAQLM
RLASINGFKVSGVSVKKQKSRWGSCSSRGNINLNLKLIFLPPLLVRYIMI
HELCHTLHMNHSARYWETVARFDPDCVVHDREMKHAWRFVPAWFSNAR
>CT2063 hypothetical protein
MNIADVAKKGFKALFCFPIHLRLKGYRRMSTFLQSNTRAVDF
>CT2009 hypothetical protein
MSNNQSRMTHRNANRIIAAGIFLVAEAVYLSTMAPTFSFWDCGEVIATSY
TLGIPHPPGAPLYLLVGHLFSLLPFFQDIGARLNFFSTLISSTTIMLTYL
IIVRLIALYRDSKPDGWSLHEQIAAYGGGVVGALALAFSDSFWFNATETG
LWAASSLLTATIFWMMLCWYDEDPAPGSERWLLGVMYLIGLSIGVHLLCL
LALFALVLIYYFKKYTVDLKSFSLMTLFSLGLFFLIYKLIIKGIPVLLVT
TSWWGMSLLVAALASGIWYSHKKRLVLLNLGLFSVVLLILGYTSYMLIFV
RAHAGPPINENNPSTLQAFFSYVNREQYGEWPLWPRRWSPEPVYQYFYQK
YSSEWDYFWRYQLNQMYLRYFGWQFIGRSADVEGAVVDWGKLWGIPFLVG
LFGAGAHFRKNWKMALPVATLFLMTGVILVLYLNQPEPQPRERDYSYVGS
FFAFALWIGIGVERLFTWFSGRLKSLDPKQLVWLAVAVVASGLLSINGRM
LMANYRTHDRSGNYVPWDWAWNMLQSCEKDAILFTNGDNDTFPLWYLQEV
ERIRTDVRVVNLSLANTGWYLLQLKHDSPRGAKPVNIEMRDDDLANISYV
PVDSVNVAVPAGMEARKLYDDARRSGVALPGAPSDSLRWTLKPALTYQGQ
GFLRPQDIAVYAIVVDNFGKRPIYFALTVDPAEMTGLDRNLRLDGLVYRL
VPLKSDSALSFADPGTLYGNLFNVYRYRNTGNLAVHIDETSRNLLGNYPP
LFARLAITLSASPEQAVMVPDASGAYKTVRRGELALEVLDRYTRLFPLSR
YPVTPKLAGSVVAMYAAGGANEKAYPYIHYLETLAAQSGAEQEPDLYFTL
AQTYRAVGRVHEADRIMKELETALPELRKRLDSLKQ
>CT0339 conserved hypothetical protein
MNYHTATITCQTTRPIDIIDITADVRSALEESGLQQGTVTLLSRHTTACL
NINEREERLMQDMTTWLKRFIPKDGDWLHNIETIDGRDNAHSHLLGLFMN
SSETIPFSEGQLMLGKWQSIFFIELDGPRPKREVLVHIQGE
>CT0915 hypothetical protein
MLPMVTCAMAPDGGIDPDGHVIRFFAMKMVQACWSLVWFCG
>CT1011 universal stress protein family
MKAYQNILVAIDGSGADDALIEQVSALAAPLGSRVHLLHVVHSHTIDQER
ALREQAGEFLERYRAAMQQQGIEAEVLIRSGEPDREILKEIEERRYDLLA
MAAHGHRLFSRLLFGSVSRALRNKIDIPLLLVRGEAR
>CT0134 P-II family protein
MKLITAIIQPDRLDHVREALIQADITRITVSRVSGHGRQEDIEYYRGQKI
APNLLPKVRLDIAVNDQFVNVTVDTIVAAARHESGEIGDGKIFITPLEEC
VRIRTNERGGSAI
>CT0238 hypothetical protein
MVNLNTSFRLYGLFFEKKCMIPSIITNPNR
>CT1997 conserved hypothetical protein
MPDETLDRSSLERLVRALKQARVQRALSVDEISRLVKIRTVHIERLEEGD
FSFLPPLYIYSYLKKYAAELGVGDDALLDACRNELGVSASNFSILPPAQV
VTESPRESASEPRGKTRRWPVVAAVAAALILAVLIFLLFFSGIF
>CT1196 conserved hypothetical protein
MFRILIHWLISATAVYVTAHMLPGITIKSFGAALIVALVLGLINALIKPV
LVFFSIPLLLLTLGLFMLVINALMLQLAAVLVDSFGVQSFWWAVLGSVCI
SGVSWLMNAVLNI
>CT0096 hypothetical protein
MFRIHEVSRSWKYSKSEPENDPETIKIQFFRLRVW
>CT0140 iojap-related protein
MTGGNASLLNDFFSLKRLSMSRSEHVASSGVEAREVAMSELLARRVAELS
LEKKGEDVKILDVRGLTSVTDFFVIITADSERKAKAVTDYIVDEIKEEGE
RPMHIEGLDTLRWVLIDYVDVVVHIFQPDDRKFYDLESLWSDAPVTVVTA
PEPSDEQEELQEG
>CT0854 iron-sulfur cluster-binding protein, gltD family
MNAESNPILDFATEYVYPAFSELTGTDKIVAFGDHSHKCPIYVPQTPPCT
AECPAGEDIRAINRFLNGTDPSDDPLKSAWETATDTNPFPAVMGRICPHP
CQSKCNRGVHDESVAINAVEQVLGNYGIEHNLKLKGPGADTGKRVAIIGG
GPAGLSAAYQLRRKGHAVTIYDANEKLGGMVLYGIMGYRVDRKVLEAEIG
RIIELGVETKMGVTIGKDITLEQLEAEYDAVFIAVGAQKGRALPVPGFEG
TPGATNAIDFLKSYEVLGDDIPVGKHVVVIGDGNVSMDVARLALRLGSQA
TIISGVPREEMACFENEFDDAKNEGTTMHFLTGTVEVLGGASGVTGLRCT
KMVKKEKGEEGWNSPIPFLRYKSNGESFEIEADMVVAAIGQATDLSGLGS
AASGPWLKVDRNFRIPGREKLFGGGDALKVDLITTAVGHGRKAAYAIDAF
LKGEPMPEEPYREITKPHKQDLLYFLHTPQAKRTSIKPEVVVGNHDELLE
ALTPEQAVTESKRCMSCGFCFDCKQCVSFCPQEAITRFRDNPAGEKVYTN
YAKCVGCHLCSLVCPCGYIQMGMGDGL
>CT1921 cysteine synthase/cystathionine beta-synthase family protein
MTNDIFDISDNTPLVLMKQLTRQKRARFMAKLEYMSPCFSHYCRVSGAIV
RDAEERGAIHPGMTLVDWTFGTSGIALAMAAVSRGYKVLLAAPESICREK
QEVLRALGAELVLTPSEALPGDLQSCVDVAENLVRTLPNAWFANMYQNPV
SRQVHQDGTGPEIFRQTEGKVTHLFVPMASGAMISGIGRYFKSVNPEVQI
IGVEPEGSVYGDLHGGKSQGTPSAFQLEDIGAVRPTSFWDPNVIDRIVQV
SDADAFNCGRELLRAEAVFAGGASGAAMAAALREGEAYGDDALVVVMMTD
FGGYDLSRMYNDDWMRRQGFYRKSKTALDQITADDILQRKAHKDLIFAQP
EQTLAEVFETMKQNDVSQMPIVSFGAPIGSISENRILSILIENDSAMNAK
VVAFMEKPFPVCQPDATISELSEKLQTNASGVLISLSDGRLQLINKSDLI
DVLTRK
>CT2217 hypothetical protein
MDNKIKPAKSCNATPAYNVRFSSLKTRIASERKKPKNQLVGR
>CT0830 conserved hypothetical protein
MSQLSETELRHVAGFQKNEITEHHIYKNLAKKVDGVKNRRVFAQISEDEL
RHYHVWKKYTGREVEPDRFKIWLYSMIALLFGFTFAVKLMEGKEQNAQQE
YERVANMGVEIDGLIKDEEEHEQALIAVLDEERLQYTGSIVLGLNDALVE
LTGALAGLTFALQNTRLVALTAVITGFAAALSMASSEYLSTKTEAGVKSP
VKASIYTGVAYIIAVAVLIVPYLVLNDIYLSLALAFAGAFVVIALFNFYV
SVAQGVPFRSRFLEMAGLIVAVSGISFLAGLGIRYFFGVEV
>CT1089 citrate lyase, subunit1
MAKILEGPAMKLFNKWGIPVPNYVVIMDPKRLAQLGEANKWLRESKLVVK
AHEAIGGRFKLGLVKIGLNLDEAIQASREMLGAKVGTAEVRQVIVAEMLD
HDAEFYVSIIGNRDGAELLISKYGGVDIEDNWDSVRRIQIPLDEHPTIEQ
LTALAKEAGFEGEIAERVGKICSRLVLCFDNEDAQSIEINPLVIRKSDMR
FAALDAVMNVDWDARFRHADWDFKPVSEIGRPFTEAEQQIMDIDSRIKGS
VKFVEVPGGEIALLTAGGGASVFYADAVVARGGTIANYAEYSGDPPDWAV
EALTETICRLPNIKHIIVGGAIANFTDVKATFSGIINGLRESKSKGYLEG
VKIWVRRGGPNEAQGLAAIRKLQEEGFDIHVYDRSMPMTDIVDLALKS
>CT2277 hypothetical protein
MRYDGIPFRGASFAAAIFMGLSAACHFEAVSEVKPRRPRRKRDNEMCSLP
LSFRYFHFQSGFRSQTGSTTLASKKRLRIQKTVQRRV
>CT0170 conserved hypothetical protein
MNTSGWGKYPEIEAQVLTPSSEKALQDLIGTGSYDGVTPRGLGRSYGDSS
LGKLMVKTLRLNHMLSFELQSGLLHCQSGVSLAEILEVFVPRGWFLPVTP
GTKLVTVGGAIASDIHGKNHHIDGCFCDHVDFLDLLTGSGEIIRCSPHEQ
AQLFHATCGGMGLTGIILSAAIRLTPIKSAYIDQVTFKSANLGESFELFE
KQAEQPYSVAWIDCISSGDKLGRSLLMTGKHAQHGALAAHHNPGLSVPLD
LPSFILNPYSIKAFNTLYYHKARQRETHATVHYDPFFYPLDGIGHWNRMY
GKNGFTQYQFVIPKEAGLQAMDLILRTIVNSKKGSFLAVLKAFGKENRNL
LSFPMEGYTLALDFKIEPDLFPLLDRLDAMVLDHGGRLYLTKDARMNAET
FRKCYPKWEQFQNIRQQYGARHTFLSMQSQRLGLD
>CT0126 acylphosphatase
MTEKRVHIIVSGLVQGVGFRMFVLREASARSLSGWTRNLPDGTVEVEAQG
DSGRVDELIRQIRIGPSRSSVTSIKVKEIEVDTSCREFRILT
>CT0377 conserved hypothetical protein
MKAGKPQLIWGAVAVVIALVAGSVVYRTVFVGKPKISYREYRPEIGDIRQ
LVSTTATITPQNRLEIKSPVSGRIDKILVKEGDFVKKGQVLALVSSTERA
ALLDAATLKGQSEIDYWNKVYNQTALISPIDGQVIVSSLNPGQTITTSDA
VMVLSDRLIVRANVDETDIGNVKLGQKAVISLDAYPDIHVEGVVDHIYYE
STLVNNVNIYHVDIVPRKVPEVFRSGMSANVDIVVNEKDHVLMLPLAAVK
SRNSHSFVLKRVAAPDSVKRTPVRIGLSDDNNVEIVSGVSPSDVILVKNA
AYSLPKNSAGTNPFMPQRRSSQQNRQR
>CT0571 hypothetical protein
MFAQELFRFPTLSAGTIMRSPLLRIVAISALFCAQPFQPEANAWHDKTHL
TIAEAAGFDLWYSAAAPDVAKSKEMFSPVESPNHYYNNNANKRVTPEMVM
AQVERYNRPNDDEGHLYGAIIGSVREYQSMKKSGKYAKYPLVYCAHYCGD
LSMPLHNTRYDDFNKERHSINDGIIENSVRHNIGYIQRMMRPPVIDSEAD
LAREIAAVAESARKLGMKMRKENRDMTVDEAYTQVTRSASLFNAILAWLE
RTQKTAGERTVTVTN
>CT2131 hypothetical protein
MLCAVSSQTQSRERPMKQTLLQEKYPIYILELEKEETSFGSVDDIVAYFR
EKVESHPFAVLIGEFDHYGHTSLLPEGQIGEGITAAKNIVFCFGKELMNP
KMLAVRPRSIGVAEMDGHFVISFLEAPNPAANASMEEWTLALKNPVSA
>CT1264 conserved hypothetical protein
MRRVFIHTFEVPGSSIDGYGHVSNIEYLRWMQDGATAHTASEGWTLDRYR
QSRAIWVVRRHSIDYLMPAYASDRLDLHTWIEWVRDCQSVRRYLLTREGD
VRALARAETLWVCVDPESGRPKRVPEDFIQAFELVVGGEAEALRIVGKAS
DSAS
>CT1695 sensor histidine kinase/response regulator
MPTPLFDTCLSSDDLNAFDASQKPGTTTFNTLASLTELYDAIPASIIIID
ERMRLIGWNRFSRDTINGLSDNEMPGINPMKRVHPDDLPEMIRKMRNVID
LDIELSAEFRMYHKSGPPYKWAMCRGKRTIIEGKTCVVAVVTEITDLKEA
EEQRKKLQEQLLQLQKMELVGQLASGIAHDFNNALAAIIGNTELALKKLD
PANPAVSNITDIHTLATRSANLTRQLLAFARKQMAMPRVLNLTEEVSDCL
RLYQRLIDKNIHLEWHLCDEPIQVKLDPTQLEQILSNLLVNARDAIDGSG
CITVRCEGARFEPTDGKTCNPRLSPGDYARLSITDSGCGIEASVLPHIYE
PFFTTKEIGKGIGLGLSSVYGIVRQNNGHIECHSEPGKGTTFDIWLPLHQ
ESQQKTDVHADKPQTELKTKAKIMVVEDEPYILKLVQDILESHEFTVFTA
TDADQCLLAAKTHEYRIDLLVTDVVLPTMNGIEQSRVLQKKNPAMKCLFM
SAHAPDNVDGQKKLRVGVDFIEKPFGIDEFLRAIDQVLNSEVEKI
>CT1369 conserved hypothetical protein
MIKRTEMRTVISICLFFGMILASSSLFARTDAEPSTRALADTVGFAHRAW
QMDSVMARIRALNHDDLVRTQQPAGTAWRAAICPHDDYTYAGWLYPAVLQ
NIKAKTVIIFGVAHTAWRYHLENQLIFDSFSSWRGPYGNVKVSPLRDEIL
ERLPRGMAIVHDPMQSEEYSVEALIPFLQYQNRNVEIISILVPFMDFERM
QIVSQHFAKALFAVMKKNNLRWGKDVALLISSDAVHYGDEDWDGRNFAYY
GTGGKANALAEAHDREIISHSFESELTEKNIARFYASTTDPNDFKKSRWS
WCGRFAIPVGLLTALDLQKLEKSAPLSGVPIAYATSISQPHLQVDDLGMG
ETAIATQRHWVGYPAIGFK
>CT1659 hypothetical protein
MEVESNGVHQYTEQQKQAFKNEFANRRRNQIVLFGLLFILMILYVTADKS
TGLMLGTYLAAIFTPLLLVVVIGALVFFIYQLAVPGLQHKYLGRSMNPHF
CWKCGIALS
>CT1339 conserved hypothetical protein
MAFKAKIKVTLRPSILDVQGKAAQHALENLGYSSVESIRIGKYIEVIIGE
DFRAEAEQVCTEICQKLLSNPIMEDFSFELVPVN
>CT1290 hypothetical protein
MLERYLPQPLTTLVLCDRQLHPGFGRVVDAVRGMEEDEKNLCNAN
>CT1905 hypothetical protein
MGREWQGNKKAAVFDSSFWLVFRSVLSTWGGQPGALHEALLRSP
>CT1861 hypothetical protein
MPFTVTIEVSKQFETSTLPEKVFALLSDVPRSASYFPDVEKLEPLGSNAF
RWIMEKNAIGGHTLKQTIYACTYRSDRVTMSVAWVPVEGEGNARVEGNWQ
IEFAFIQKIDRYISNLKETFAK
>CT0935 TonB-dependent receptor-related protein
MFWCRFSPFPLSLSVDDSVNSTQDNTMNRRTLAIVMAAMLASPTLHAAET
GSDYLGNEIVVTSSRVPQPKKELTSNVTIITKEEINESSANDLSELLAEK
NLGQIQKYPGTLTSVGIRGFRTESHGNDLMGKVLVLLDGRRAGTGNLAKI
MTANVERIEIIRGPAAVQYGSAAIGGVINVITAKGSGAPSASIEQKIGSN
DFSMTEATIQGKSGRFDYSGSISKSDSGSYKTAKGETYQNTGYHDQKMAS
LNVGYEIADGQRIGMIVHSFDVDKAGSPGYFSLPDLNAYTVQNNHSVDFR
YDGTTSSRTLSWMARYFTGKDEYRYVDPNWGESTNNVDQQGAQAQVSWQP
GALRLTAGFDWLKYELDSTSAPTWASYENPAGFLLGKYGLFDERLILTAG
VRYDDYHVDMKEGEGTSKSRDNFAHQAGVAWQATDFLKLRGSYAEGFRMP
SARELAAHITSWGTTYIGNPDLKPESSETYEVGMDLTWKGLNGSLTWFST
DYKDMIQTKAIAPTTYTYINVGSSTVSGIEAELSKVFALEGSSWTLEPYA
GYTYLLRYRDNQTGDDLLYTPQWNASTGLRVHDGCGFNGAFNLAYTGKTL
VQNWETGFGEVVPKGGFTVVDLSVSKKFPFGGKGVKGPGITVRAEINNLF
DREYQYVKGYPMPGRTFVFGLKADI
>CT1138 hypothetical protein
MTVIGIQYWRKNIHALFGISSGRKPQVGKDDKDSGLLRSFVRRDHRRRSG
GSN
>CT0638 peptidoglycan-associated lipoprotein
MRRTLTGIIGAAMIILAGCSSKSAVSTDETSRAGYGSGMGGGTGAGAGVS
VEDIGQGGKAGSIIGDIFFDFDSSALSSEAQEQLNQNAAWMQKNPTSAVI
IEGHCDERGTDEYNIALGERRAEAARMYLVNLGVSGGRLSTVSYGEEKPF
DPGHNEEAWAKNRRDHFVVK
>CT1284 hypothetical protein
MIRKTVKTVGTAFVFSPFGMPLLVHGAAGLLVGAVGLNLLNGVINDVKSA
GDILQKEMSKPSDRQQDEEPE
>CT0655 peptide ABC transporter, ATP-binding protein
MEHILELKDLKTWYRTDSGIAKAVDGVSFSLAQNCIIGIVGESGCGKSVT
ALSIMRLVPMPPGYFAGGDILWKGRSIVKATEAEMRKIRGNEIAMIFQEP
MSSLNPVFTCGDQIMEQILNHRDVSKPEARRQAVELLNMVGIPNPSERID
SYPHEMSGGMRQRVMIAMALSCGPELLIADEPTTALDVTVQAQILELIGK
LREERGMSVMLITHDFGVVAELCEEVVVMYASRIAESGTVRHIFENPLHP
YTQGLLKSIPRLGAKKERLNVIEGNVPSATRLPDGCRFAGRCPLADDHCR
REQPPILEYEPGHRAACWKIT
>CT1932 peptidase, M16 family
MHPNPSAIVRKNIVSPSIGNATHPAERLSTGIVESGTLPNGLRIVSNQVP
WIHSVTLGLWINAGSREDPEGFEGMAHFIEHALFKGTQKRDYVEIARCVE
ETGGYIDAWTTKEQTCLCVRCLREHLHLAFDLLADLCCNPVFPPDEIEKE
KEVVLEEIASVNDTPEELIFEDFDRRAFSRHPLGTAILGTEESVERLTGK
EIRDFMRRHYVPSKMLVTAIGNIEHDAVTGLAESFWGHLKDSPQEDSVRR
LFDLSAYRPFTKTLKKSVFQSQILLGTIFPRDDRRFWGLMVLNAMLSSGM
SSILNLELREKRGLVYQAYSSVSFYDEVTEFNVYAGTDKGKTSKTLDTIA
ELLTGNVLKEPDPFELAAAKSKMLGSMILGMEKMTRRMSHIAQDMFYFGR
YLSPSEKAGMIDGVTAEDVAVAAAALGIPEQISTLVYKPGGR
>CT0980 ArsA ATPase family protein
MRNIIFTGKGGVGKTSVAAATALKAADMGYKTLIMSTDPAHSLGDSLDIE
LGPSPVKVAENLWGQEVSVFGDLNLNWDVVREHFAHLMASRGIEGVYAEE
MGVLPGMEELFSLSYIKRYNEEQKDFDLLVVDCAPTGETLRLLSLPETFG
WFIKMIRNIEKYMVKPVIRPLSKKVKKLDDFVAPEEVYEKVDNLFSSTEG
IIDLLADGTKTTMRLVMNPEKMVIKESMRALTYLNLYGITVDRITINRVM
PDQSPDPYFQQWRNIQQKYIDQINSAFAPIPVAEVPLFNNEVVGLEMLRK
VGEKVYGDENPLDIFFREDPINITKISDGHYKVRVKLPFMESMGLEPKIM
KLGDDLTIRIGDYQKIVALPIFLAGMESTGASFDSGWLNIDFTKE
>CT1200 hypothetical protein
MMNVQKQSYVFVLTRASIVILNVSSCPVFRNIHGLNLPLACF
>CT0994 cytidine and deoxycytidylate deaminase family protein
MSYQPSHIRFSLPEWLESYCGTYQPSASLEARMRFVVGASRKSVEEVSGG
PFAAAVFEIESGRLVSLGVNLVLTQNSSILHAEMVAIVLAQMKLGAYDLG
GFGMPAHELVTSTEPCVMCFGAVLWSGVRHLATGALSEDARAIGFDEGPK
PEKWIEELEARGIRVTTGVERDTARDVLQLYARMGGQIYNARKG
>CT1233 hypothetical protein
MSGCGRYGVMTSRWGSGRVPIPPVRLLTGTNRRTSFAGSNRAYEGECCCS
LLLSRRIYQEAPNHKLQMLVSYAALPQTGRFHRAQADAEMTAFLWMSMIE
RIRRHNMVSLQYSSTSSPNSRKSTGCMPTDIFLAWRRSERRLVLIRVFWY
PESYIFSLGGNLKPIDLCSVTMQ
>CT0893 hypothetical protein
MRSSMFMQSRHNTKQHNTQKRSIMKKTLFLLGLATAMGFNNAQAVDWNWN
GDIRFRYDSSKTELASSPDKPADDRYRLRARFGVSPTINDELSAGLRLAT
GDGKNPTSTNQTLGKDFADKAIWLDEAYINYHPKALEGKVNVLLGKRDIA
KTFNVVKDLVWDSDVTIEGATLQYGKDVSGKQKSGPSLIAGYYTLENYAT
ASDPCIFAVQGAYMGTVSGADFNLGASYFDYVHMKNVAWWNSPNGPANDG
KDFRILEVFGTLGGKLGGSLPATLYAQYAHNTALNSDNNAVLAGLKLGSD
KKPGGWTLDGGYFYIEKYAVTPLTDGDRPMSSKYSTDIKGLKIGATYQLV
QNMTLGATYFHVNPVDSSLTGSTSDHKNLVQADVAVNF
>CT0926 hypothetical protein
MVKVNCTTLVPYFFMLKVWEDWIDTLPIRDHELGSCLPMNRARRMRTFLP
R
>CT0113 ferredoxin oxidoreductase, beta subunit
MQKNIILAGVGGQGILSIAAVIDWAALHEGLNIRQAEVHGMSQRGGAVQS
HLRISDQELFADLIGLGTGDLILSVEPLEALRYLPYLAPDGRIVTSLDPF
VNMTGYPPMEEIEAELGKTGRPVLVHAADLAKDAGSARASNMVMLGAGAP
FAGIAPSKLEASIEALFSTKGQEVVDINIRAFRKGLQLSNTELHHP
>CT0885 hypothetical protein
MGTLDGTGDIDCSNRGLTSLEGCPEIVEGSFNCSGNRLTTLEGAPRITGS
FDCSGNEIVSLEGGPEKVNGDFNCSSNQLSCLKGGPSKVKGDFLCSGNRL
ISLLGAPKKVKGYFDCSDNQLVSLYGGPIETGAFNCSGNRLRSLLGAPDE
VHAGFDCSSNLLVSLDGAPEFVNGDFSCANNLLENLAHGPVEVSGNFNCS
GNRLMNLKRFPKRVEGELDCSGNPILACDVTGPESDRNCIRVVHGGTVRC
CCRKTDSSQLEALSS
>CT1462 conserved hypothetical protein
MNTYRIVVVVGSIRRESLNRRLADALIRLAPADFAFHHLRIDDLPLYNQD
DEEHPTESMQRLRREIAEADGIMFCTPEYNRSIPGVLKNAIDVGSRPYGY
NAWQGKPAGIIGLSSGACGTAMAQQHLRNILAVLDVPTMAQPEAFIQFRD
DLFDADGGIGPGSRDFLQGWMDRFAAWVRLCAKPRNG
>CT1414 zeta-carotene desaturase
MKVAIFGAGVAGLSAAIELVDRGHTVELYEKRKVLGGKVSVWKDSDGDSI
ESGLHIVFGGYTQLQKYLDKIGAGDNYLWKDHSLIYAESDGKQSFFKKAN
LPSPWAEVVGGLQADFLTMWDKISLIKGLWPALAGNEEYFRSQDHMTYSE
WHRLHGASEHSLQKLWRAIALAMNFIEPNVISARPMITIFKYFGTDYAAT
KFAFFRKNPGDSMIEPMRQYIQSKGGRIFIDARLSRFELNDDKTIKRAVL
RDGHTVEADAYISALPVHSVKKIVPNEWLEHDYFLNLHQFVGSPVANCQL
WFDKKITDTDNLMFSQGTTFATFADVSITCPDDFQAGMGTACGGSVMSLV
LAPAHQLLDLPNEVITEMVMKEIHDRFPKSRDAKLLKSTIVKIPQSVYKA
VPDVDQYRPDQVSPIRNFFLAGDYTYQHYLASMEGAALSGRQVAEKLHQR
MGR
>CT0890 methyltransferase, putative
MSEQNWTDTGRFDNKAAEWDANAIRAALADAVARAIIAHMPVGKPANALE
FGCGTGLVTTRIAPHCLQLTAVDSSREMLRMLGEKIAASAIANVTPLHLD
FSRPEEAAGLDRDYDFVYSSMTLHHIPDTASFLRELIGHMSPGGALAIAD
LDAEDGLFHNDATEKVHHGFDRTELQALLESAGFAGVSFMTAHIIEKKNR
EGNLKNYPVFLVTAVKPKA
>CT1989 conserved hypothetical protein
MPGLYFLSDLHLGLQEPQAEQEKLERLEKLFSLIREQGGALYLLGDILDY
WMEFRHVVPKGFTRFFCMLSGLVRSGVEVTWLAGNHDFYLGSFFDDELGV
KTCYGLQEVRYDGKLFLVAHGDGLGEGDLGYKLFARFIRNRFNLGLLTAF
HSDLSTALMKHFSLLSRKHKKVDMRAESTRLLDFAAALARERDFDYFVCG
HNHSERVQALHDSGSTYVNLGSWIEGRYHYGVYEQGQFRLEKL
>CT0025 hypothetical protein
MPRRIKPCYLELLLEIGNFSHSAQLYRYIYLRKIYLRKHPFSRI
>CT1619 hypothetical protein
MLDLFLSRDKPDSHRDRAAGWAFRFHLSAVVGTGAVSS
>CT0413 conserved hypothetical protein
MAVFSRRERRAGCYREPRRVVASRCPFVKREGGFMNHIKSSLRWALSLII
ALLLFSIQACKPSEHAKTASGTEANEAVPLRYAKLFTMKRENGCTRIDVF
AGEKNSSVPVARYLLVPRGKAVPEHDADMLVVKTPVRRLTCDNGLEVSFL
DLLGASKALVGVSGKVLISHDRVHRAIDDGTLAVTGYGRSTNMETLLALK
PDVTFIITSFGVDLPFRLRSYGITPGLFSAYHEEHPLGVVEWIRFMGAFL
GKESLAEAIFREKEAAYLAMQQRVRKVQKRPTVIAGYMRKGTWSTMTAPR
SFVTMLDHAGADYLFKELEPERGYLLSGEVAMQAGQQAAFWLNTHSQAST
LQEVLTEDGRYGEFRSVREGRVYNNNGSCFRNGKTRYWDIGMTEPDQILA
DLVAIFHPELMPGHTLRYYRRLE
>CT0779 hypothetical protein
MMIENEALITGAGRVSALWLFSKPKLECFFHNFHNLLVFPL
>CT0515 hypothetical protein
MPSSSFRRDALPSVMTVMDRRMVLFPIFCIIRTRTGIGRKSNRPSPPGAP
DNRLFAQWLSPRPELAEKKQALNRCTFYK
>CT2146 iron-sulfur cluster-binding protein
MGTTREIFWNIGHGTIVVMYLLALVAVGLLAEGFRRRRDIWRRGKPLGRT
DNLALRFSRFLAETVSQRKVAKVAEGGVPHAMLFWAFLLLFAGTLLVMVQ
ADFLTPFFRFSLLSGDFYRVYSLVLDLAGLLALLALGVLAARRYVIRPPG
LESTPADARIHLLLFSILLTGFIVEGVRISATELRDNPALALWSPVGYLF
ATLFSGMSVHAQRSVHQALWWLHLLLGLGFVAIIPWTKLRHIATTSGNSF
FEPHEPKGTLAPLDLEDESVERFGASVVGDLSWKDLFDADACTLCGRCQQ
RCPAYLTGKPLSPMKVINDIGAVAGTENDKSLIDTVSRDALWSCTTCGAC
EETCPASIEHVGKIIEMRRSLVLMEGEFPGDEARRACDAIEINGNPFGLS
HLSRGEWAKGLPVSAADGKIETDILYFTGCYASFDPRNRRVAESFIKICE
AAGVSVAMLGKAERCCGEPARKLGNEYLYRMVAGSNIAAIRAAKVKRIVT
ACPHCFNTLVRDYRELGLDVPVEHHSTFIRRLVGEGRLKLHQESFSATYH
DSCYLARYQDIIEAPRLVLEAAGGRLTEMEWSGSETFCCGAGGGRILAEE
KLGMPIVDERLRMAKMAGASTVATACPFCLSMFEDGLKREPAESRLDVLD
LAEIVARRIERH
>CT0296 hypothetical protein
MNAADSVTTKYLFLTSIIMQYLILLITGIAAGLLSGMFGVGGGVIIVPAL
IFFLGMSQETASATSLIALLLPVGLLGVYEYYQAGKITTEHIWFGLIIAL
GLFAGAFFGAKLAIELSNDLLRRMFAVFLVLVAIRLWY
>CT2240 polysulfide reductase, subunit C, putative
MIEKALKGGRGYWTWVAFLLVVIGLGVSAYARQMSIGLGVTGMGRDISWG
VYIAQFTFLVGVAASAVMLVLPYYLHNQKAFSKIVIVGEFLAVSAALMCM
LFILADMGRPDRVLNVLLYPSPHSMVFWDVMVLNGYLFLNLISGWAVLGA
ERKGVAPAPWVKVLIYISIPWAFSIHTVTAFLFAGLPGRHLWLTAVLAPR
FLASAFAAGTAILILITFILKKVAKFDAGQEARLKLAVLSSYAGLANLFF
LGTEFFTAFYSNIPAHKHSLQYLFFGLEGHAPLVPWMWFSLITGIISVGV
LLSPPLRRKNGALIAACIGLVVSIWIDKGMGLIFGGFVPTPLEAIVDYMP
TATEISVTLGIWAIGLLVLTMLLKTAIAVKTQE
>CT0654 hypothetical protein
MALTELLNLFIRDGHVSILHVTHEILRRTENKNEFLFKTT
>CT0197 membrane protein, putative
MLQNTMNKNDAAAQALSKVPQVTLMFWAIKILATTLGETGGDAVTMSMKL
GYAEGSLIFLAFFAVALLFQVFSVGYHPARYWAVVVATTTVGTTMSDYLD
RTLGLGYVNSSAILLLGVLLVLFAWNRIMGRIEFEGIADRRDELFYWLTI
LVSNTLGTALGDFVADDVGLGFQLGALLFGVLIVAVAIAYYKTSISPNIL
FWAAYVLTRPLGATLGDTLTKPLAHGGLSLGRISSSLVILALMVAAITLN
HRAEQRRAVAVAS
>CT1282 hypothetical protein
MTKKKHAKVWRLEKKRLFQRQYRKSLKSINKKSSSSPSG
>CT0355 hypothetical protein
MANYRQIASGADGRDYSDILIKLDRRTVSQVTSSTVIMLADNLIKNTPST
VPVANAPRLSKIDDSKSSSISSKLR
>CT0690 hypothetical protein
MKYLDVVKKADTIGLFTIKNDIDMESVQKDYEAYKLNPE
>CT1399 conserved hypothetical protein
MAVYSFLDLAYDVLKVATQPLTYQEVWQAGKENGLTDKIKTSGKTPWQSL
GAQLYVEVRDNEDSRFMKVGKRPARFFLKDRAAELASDAVAKIEKEESKK
KEKKTAYHERDIHPLLTYFAYANPSFNRGRSIFTKTIFHERSQRSGYNEW
IHPDIVGFYLPLDDWRPDVIEFNRLSDNNSLKLFSFEVKKSLTKANYREA
YFQAVSNSSWAHEGYLVAAEILQDDEFLAELERLASSFGIGIIHLDPIDI
DSSSILYPARVRDILDWETINKLCEQNTDFEKFLQDVKIDFESKRIHRAE
FDEIVKDIRKYIREKLKIEPEA
>CT1844 hypothetical protein
MLPMRLCSFRVQKKQLFVALSYISKKANGLKSPQQ
>CT2104 ABC 3 transport family protein
MWSSIARLKDTQPTRITAMPDILHFEFMRNAFLAAILSSVACGIIGTYVV
IRRLGFISGGIAHTAFGGIGLSYYLGLNPLTGIIPFSLAAAIGIGLLSRK
AKVAEDTAIGAFWAAGMSIGVILIGLTPGYAPNLFSYLFGNILTVPDSDL
QLILGLDCLIIAVVWLFDKEFLAISFDEEYARISGLKTLALDLLMLCLIA
LTVVIMVRIVGIVMVIALLTIPAAVARSFSHNLYRIMVIGALLAALFSIA
GLWLSWVFNLASGATIILVAALVFLVNALFGAGKARRAEC
>CT1322 hypothetical protein
MHQDRTPGRGKVMARVNGSVWFEMKNPARRKVLRDFFCGKTITVW
>CT0117 sulfide-quinone reductase, putative
MAKVVVLGAGVSGHTCASFLKKKLGKQHEVVVISPNSYYQWIPSNIWVGV
GHMTIDDVRFKLKKVYDRWGIDYKQAKAVSIHPEGDANISKGYVTIEYTD
EEHAGYTETVDYDYLVNATGPKLNFEATEGLGPDKNSLSVCTYSHAAHAW
EELQKSIEKMKNGQKQRFLIGTGHAMATCQGAAFEYILNVAHEISRRGLS
HMAELTWISNEYELGDFGMGGAFIKRGGYITPTKVFTESLLAEYGIKWIR
RAGVYKVEPGVAHYETLDGEMLSQEFDFAMLIPSFSGVGLTAFDKSGNDI
TDKMFLPNKFMKVDADYTAKPFGEWGANDWPTIYQTPMYSNIYAAGIAFA
PPHSISKPMTSVNGRQIFPTPPRTGMPSGVIGKIIALNISEQIKGNHKEH
HHKASMARMGAACIVSAGFGSFDGLGASMTVFPIVPDWEKYPEWGRDMTY
SVGEVGLAGHWLKFMLHYLFFHKAKGYPFWYLIPE
>CT0692 hypothetical protein
MIVKDQNGAVVTICRYLLNTRQTNSKMNLRSIRPFVDPHLGDDFVSFKRY
YDPFVFFVFQ
>CT1439 hypothetical protein
MTGIAVRIGETEIVTGTTGGIMTIEEIPATGMTRTAIRGNPNTDPGQA
>CT1410 conserved hypothetical protein
MKSYHKELWMEVPSRMDFVNITRDVSRALDESGIREGLCLVNAMHITASV
FINDDEPGLHADYKQWLEELAPHNPSHYQHNRTGEDNGDAHHKRQIMGRE
VVVAVTKGELHLGPWEQIFYGEFDGRRRKRVLIKIIGA
>CT1189 conserved hypothetical protein
MVQIAGTPVPLSAKSRGPNALSSLLAALFLLLSQPASGAEAKQCIDNAEL
DAADKAFNSLHYAKADSLYQSMLQTGDQSSTLYWKLARLNISIAEAIDPS
ERKKRIPFYNKAVEYARKSVQLDENNASAHTWLAAALALKADKIGAKEKL
NRAAEIKRELDKALALNPNDDVAWSMLGSYNFEASKIGWFSRFMGSTFVG
KMPKGSREEAEKDFKKAISLNPRVIRHYHELALLYLEEDRKQEALNTLRI
AETRPVLMKSDVRRLKEIKKLIAKLSKEIEEK
>CT1970 heat shock protein, Hsp20 family
MALTLYGKDPLKMFEDVFNERLTPFISSMGSMMAPAFKVDISEDEKAIYL
SADIPGVKKEDVKVSIEDDVISISAERTQEEEEKKKNYHRVERSWGSLSR
SFTIGDNVDSDNITANYDNGVLKVVIPKKEPEQKKSKEIAVS
>CT0824 conserved hypothetical protein
MILLSPQEQQSRAARIRLVLSDNDGVFTDNGVYYSERGEEFKRYSIRDGM
GVERLREHGVETGIMTGEVSPSIVRRAQKLHIERLYLGVKDKQSRLADVL
SDTGLSKAEIAYIGDDVNDIGIMNAIAPFGLVACPGDAMPLVEPCVHYRC
TAQGGRGAFREYAEWLIALRAS
>CT1336 conserved hypothetical protein
MEKRLPDACLSEVIGKEQLDEALDLYRGDGIDARIARAAAEVEGLYYGKL
TRAEEIVAFARRIGAKRIGLATCVGLAGEARVFAKILEANGFEPFSALCK
AGAVDKSQIGIAEELKITPGSHESLCNPVLQARVMNEQPTDLNVVIGLCV
GHDSLFTKHSAAPVTTLIVKDRVLGHNPAAALYASGSYYKRLLEPGREL
>CT2070 ArsA ATPase family protein
MRLILMTGKGGVGKTSTAAATGLRCAELGYRTLVLSTDPAHSLADSFDVP
LGHEPRKICENLWGAELDVLEELEHNWGSVKRYMTEVLQARGLDGVQAEE
LSILPGMDEIFGLVRVFRHHKEGNFDVLIIDSAPTGTALRLLSIPEVGGW
YMRRLYKPLEKMAVYLRPLVEPIFKPLAGFSLPDKEMMDLPYEFYEQIEA
LGKILTDNNVTSVRLVTNPEKMVIKESLRAQAYLSLYNIAIDMVVANRII
PDEVTDPYFQYWKENQKLYRQEIIDSFSPLPVREVPLYSREICGMATLEK
LKEMLYGDEDPSQVYFRRNPYQIKQSEGGYDLELLLPGLPDDSVQLSKSG
DELNIRIGNHRRNMVLPQALATLRTTGARWEGDRLIIGFSEELS
>CT0789 hypothetical protein
MSALRKSVLIAPSNRPCLNMKCSVQRSVRLISLLAVLGFFHGTAAATLWS
SGPRQTGEVVLDEMVAGAGGFVVRVGSNGCTSKNSFEVLVNKKPGLTDLA
PHYELTIVRKVPDECKAIVEDGTVIAYDLKQDLGISGHYTYTIANPVYSP
RPYADTSDYYLPNALKDAAPDFPKVTEVRPEPFEKYTARPDFFSCLLPVD
WKRSPADPEGDAKAGIYEVQLTKEELAKPEDGEKYYFPHPLIYVGYYAPG
NREGKTYANYLADYDRLMRKNTGSKQSSYSKPVKTTVAGLEATVTDYEVW
QELPRGPLFTTKYWLKARFVFVKARQGFYVLAFKSPKEFYI
>CT1374 hypothetical protein
MLKANDEIDFCMIHYDTGLEIKKKQKSTSMEGACLVFKIISKQQTGKFLN
SDHRAHENLA
>CT1869 hypothetical protein
MRFILTVIAAIPEQRPRKNTITVKQWHERKNPIKKRDTAAKRDALLSN
>CT0839 glycosyl transferase, group 1 family protein
MKKLRIAQVSPLIESVPPKKYGGTERVVYYLTEGLVERGHEVTLFASGDS
ATSARLIAPVKESLRLGRKIHSTTIMHMLMLSKVYEEMAGEFDIIHSHLE
YLTLPYASCSRTPTVLTMHGRLDLPDYADILKRYSSMAWVSISDSQRAPV
PDINWVGTIYHGYPENLFEFNPDPEDYFLYLGRFSEEKRPDEAIRLARAC
KIHLKLAAKIDTADKAYFKAKVEPLLDSPYIEYVGEVGDSRKGELLRNAK
ALLNTIDWPEPFGLVMIEALACGTPVIVRRCGSSPEVITHGVTGFICDSQ
LDFIRAIHNIGTISRIACRREFEQRFTLRHMVDNYETLYRKVIAASSATD
SLSSLP
>CT1208 conserved hypothetical protein
MICDDARPNVVFEAIESSLLKGNPLGDPATRHVPVYLPPSYNGTERFPVI
YLLAGFASTGISFLNYGFGRQTLPEMIDSMIRRGEMPKTIVVMPDCMTRY
GGSQYVDSTATGPYETYLTSELMPHIDRKFRTLAHARHRAVVGKSSGGFG
ALRLGMRHPELFAAVGCHSGDMDFDLCYRPNFPVAARILEKYDGSVAAFF
TRWESLSKKPRGEFALLELMAMAACYSPDPSKPAPGNMRLPFEPRTCQLV
PEIWEQWKSFDPVTMLEETKNQDALGSLRLLFLDCGSQDEYNLQFGHRRF
SARATEIGIAHRYEEFPDTHTDTSYRYQVSLPLLARAISE
>CT1623 hypothetical protein
MLMADRFGWTTPGKNHPVCPSMKSFAPIDHHPALAP
>CT2208 hypothetical protein
MRHCSGKRLSRRCFRVERKSHWISENQMRSMCSAPCRKLVNPDFFENAPQ
EGTDAAQAEKRRTDAKLRAAHYLKARARQKRFE
>CT0461 conserved hypothetical protein
MKKKEHSVRYTDEELRAMQERGESESDWKTAAAMTDEEIEAAISSDEDEA
GTVLDWSTIMVTPPQPKVVLNMRVDYEVMEFFRGQGKGYQKKINAVLRSY
VEHQRKSGGVKA
>CT0317 glycosyl transferase
MTSLSIIVPLYNERESLPEFCESLFAALKSSELKRCFGDEFSFEIIMVDD
GSTDGSDKVIGELMTDRPELRLISFRRNYGKTEALSAGFRAASGEVVVTI
DADMQDDPREIAGLVLKINEGFDLVSGWKRERNDPMSKTVPSKLFNLTTR
LFSGIPLHDFNCGLKAYSRELVASLDLHGEMHRYIPVLAKWKGFRVTEIP
VMHHARKFGKSKFGASRFLPGLFDFLSVMFITRYLKRPMHFFGMMSLGSF
FFGFCISLYVTIEKYLFDKPAGNRPILFLGILLIILGVQFFSTGLLGELL
STRPSRNGGWSIRKTANLPDEIIDRLTD
>CT0817 hypothetical protein
MLINLSGHTLVTVKTLTRKRSGNTNMKRKIYGVSGMHCASCEAIIEKRK
>CT0750 hypothetical protein
MVKREKVNFLSAYGTYTTRSIYESVRCKYIRYKQKLLIVFYNLEITAAGN
NFFLSRSRDSANNFLFLIS
>CT0752 hypothetical protein
MTALSIAFHLRSAIGLRNKVPHRQQIFNQEEFSPKRQMAHS
>CT1820 hypothetical protein
MPALFFPPRQSKSISPFHPDKQTSLPVARCGGILYHYERVDFTKIAHMTD
YHYPSQALPFVR
>CT1481 hypothetical protein
MSLTPKCSLIIRHYFFTAKSGGAKLINHSSHKSHKKNALYNAEGPPFQAA
LLITNFRETGSYTNFE
>CT0645 hypothetical protein
MFVFSRFGLWRSLITSVIETSKDVPKRFFHFDSLCV
>CT0366 hypothetical protein
MPSSPKKQKRKAEAAAKQPPAAAKPAPATMPLGAMNYLFIALGATVLALS
YAVMYIEKSVDGFFALDIAPFTLVGAYAWILFAIFYRSKKKKN
>CT1747 peroxiredoxin, putative
MPLLGDDFPELKVQTTHGPMNIPGDLKGSWFVLFSHPADFTPVCTTEFVA
FQQRVEAFEKIGCKLIGMSVDQVFSHIKWVEWIKENLDVDITFPIVAAND
RIANKLGMLHPGKGTNTVRAVFVGDPNGKVRLVLYYPQEIGRNMDEILRA
VKVLQISDSNKVAMPADWPNNLLIKDHVIIPPANNVEDAKKRKEQQYDCY
DWWFCHKPLDK
>CT0135 hypothetical protein
MKIGCRHFLVKIYCLGEVRNGLGVMTVHVFDYAAIVVTCCLEQGVTEFDN
HGVVGDCSLELVQLLPYCGPHLVGHDVSVVIIYSLGV
>CT1237 hypothetical protein
MSVILSVDPLFQSTSIREPSMTPKQNNHPLALALFSLLSALLVGHAIYHY
VILPEEIATHFGFSGKPDAWGPKTVFFLWYFIITGLCIVMFVVVNRLLRP
GHLSWLNIPNKEYWLAPERIHDTLHYVRSGMLLFGSGTLLFVLDFINQSF
QVSLGNASRLDHPLTTLAMYLLFCVLWVSALYRRFGRKM
>CT1796 hypothetical protein
MKELSFGFALRVVKLCRFLEKEKKEYVLSRQLLKSGTAIGALIREAQQAE
SRADFIHKLSIALKEAHETEYWIDLLYQSQLIEKKGYESIKSTKNRRDNG
>CT2275 conserved hypothetical protein
MVNSRPRPAGLAGQTGDIGILKDKPFIIEALRAAGALTISLVAALGGRVV
GHIAFSPVTMSDGSFGGFGLGLLSVLPEFQRQGIGGVLIRDGLAWLKALG
AIGCCLVGHPEYYRQFGFENPDGLGHEGVPLEFFFVLSFGGQVPQGSARF
HDAFMASGPAS
>CT0531 sensor histidine kinase/response regulator
MPDPSIDKRNEQQAPSPQPDGMPAVDTPQGNNFFKNIFENHAAVMMLIDA
ESGSIVDANQAAARFYGWPVSQLRSMKIDEINAPSWAHEHPWLKKLSEEK
EDRFFSMHRKADGTLSFVELNFGTIPLENKTLILAIIQDNSERYNFAALT
EFRHHLLEMADNASTEDLLTYTLDEAERLTGSTLGFFDLISDDQSIMRSA
CSSNAKKDNCGHARHPIVINFKAPADVLQKKRAIIHNDHATLRHCNSKSV
SHKEAIRELIVPVIRNGKVMAILEVGNKPANYDQNDIQLLNELSGVAWDI
IARKHAEDSAQKTMKAMQHTQNMDSIGRLAGGIAHDINNMLSTILGHTEI
VIEELNDKSPYVENLLNIRDSTMRAAQLIQQLLAFARKQTILPKILELDT
ALQDEVPMLQKLIGEKIHLELRPGSHGAKILIDPSQFEQLLTSLCANARE
ASNGTGSVIIETSSIKVYPADCYANHPCQTPGNFAMISVIDNGCGIDKNV
LPHIFEPFFTTKEVGNGSGMGLSTVYGIVTQNNGYLECESTPGKGSRFTI
YLPLIEETTVPDRKRLRKDTTSNGDQLTILVIDDNEAILYIVKTALEKRG
YRVLSTTSANEAINIVANSGKKIDLLLADVVMPEMNGKELSRKLRAISPH
LKTIFMSGFPQEINLFDEEDTEHERYFISKPFKLAEFTATIKAILTESHG
TQNPETETKS
>CT0143 hypothetical protein
MYNFFRILQNLFSDFFHLLGGHHFSQGKTLASTSIAQFPTISKVK
>CT1913 DNA methylase, putative
MIAKEKILKGFETAEARWARFGPYYAMFPLEFAFDVVEKYSKNGDYIIDP
FAGRCSSIYAGGVLGRNSLGLEINPVGWLYGTVKLHPAEKEEVIDRLLEI
YSKRNYYNRAIQKMPEFYRICYCDEVLKFLLAARTHLDWRNNKVDATLMS
ILLIYLHAKIGEGLSNQMRMTKSMGMNYSVQWWKKNNMTTPPEINPCDFI
LKKIEWRYEKGKPKVSESAVVFGDSSNKLISIAERAINNDIKFSLLFTSP
PYYSITDYHADQWLRLWLLGGSENPQSLKDKHKGRFVNKQEYYDLLDNVF
GLCAKMMKRESTVYVRTDKREFTFNSTLEILKRHFPNHSVKIVEKPLTKD
TKTQTKLFGDKTMKPGEVDIIMSRK
>CT1308 oxidoreductase, short-chain dehydrogenase/reductase family
MTTSDNGKVLLITGASTGIGRSTAIQAVEAGWRVVVAARSAEKLAALAAE
LGAERAMAVPCDVADWNQQEAMVQKTIGRFGRLDAVFANAGFSKGSSFIE
GEHRPDEWRDMVMTNVFGAAATARLTLPELMKTKGHFLVTGSVLGRVTSM
RNLYAATKWAVTGMAQAIRNEVASLGVRVTLVEPGIVDTPFWEGLQKPAA
PELLPEDVARAVLFALGQPPHVDVSEIIIRPTGQAH
>CT1037 3-beta hydroxysteroid dehydrogenase/isomerase family protein
MRKKIVVTGGTGFIGSRLVHRLAASGEDVYVLVRASSDLASLKECLDRIT
LVYGDVTDIASLSGAFEGAEEVYHCAGITYMGDRKNPLLQRINVEGTQNV
LDACRRAKVKRVVHVSSITAVGISGPNRKFNEESCWNFDTIDLEYARTKH
AAEKIVAAAVKKGMDCVIVVPAFVFGAGDINFNAGRIIKDVYKRKMPFYP
LGGICVVDVEIVVDCLIAAMKKGRTGERYIVGGDNVSFKELAQTIMDVTG
VHQRSFPLPIWAAHAVSFLLKFSPERKKISKLFNMTMFTVASKFLYFDSS
KAQRELGMRYEPFAESIRRTFEWYRERRMLN
>CT2089 alpha-amylase family protein
MKPEFVLPAPIRLDRPFDGRRRVVIDRVFPEIDGGRFPVKRVEGDRMTVE
ADIFTDGADTIVAELLYRRKGEAAWSAVPMTHIGNDRWKASFEVGAPGMC
EYTVQGWVDHFETWRKGLQKKLDAGQDVSLDLRIGATIVELGAARAKDED
AGTLHHYVGLLSVGGSDAAVEAALSEGLLAAMRRSPEKAMATLYDKILLL
QIDQKKAGFSTWYEFFPRSWSEEPGKHGTFRDCIKLLPRIARMGFDVIYL
PPIHPIGLTKRKGKNNALVAGPDDPGSCWAIGSADGGHTSVHPELGTMEE
FEAFVQEAEAQGISVALDIAFQCSPDHPWVKEHPQWFRWRPDGTVQFAEN
PPKRYEDILPIDFESEDWQNLWIALREVFLFWIGKGVKIFRVDNPHTKAF
GFWEWALGSIREQYPETMFLAEAFTRPKLMARLAKGGYTHSYTYFTWRNT
KHELQEYLTELTQTELREYMRPNFWPNTPDILHEELQGGSRARFIIRFVL
AATLSSNYGIYGPAYELCEHVPYPGKEEYLDSEKYEIKQWDLDRPGNIRS
EIAMVNRIRHNNPALQQTSDITFVKIEVSQGQEHDQLMGYVKCSPDGANI
ILTVVTLDDRNTQGGWLRFPLEKFGRPHTERFTVEDLISGRTFEWNGEWN
YVELNPHQMPAHIFRVNLPS
>CT2149 hypothetical protein
MSAPLAFQGAWWSKMMVEGKFGLVYPYCSVFSFFCFFAIFLRKDLEERGW
CVTLGAFT
>CT1906 conserved hypothetical protein
MIAEPFLFRSEGGFIRIPIWGHIPLSAPLKKILAHPSFLRLKGIRQLSFA
QQVYPGANHTRFEHSIGVYHLMKMILQRMVSNPLALELQDERLQFDDETC
RTLLATCLLHDIGHYPHAHVLEEITPAGDSSAVFAHHESLTGQFLNEEHR
DTPSIAAILHDDWQVNPDTVTEIIAGKTAHRLGKLVSGTLDPDKMDYLMR
DAHHCNIPYGSIDIERLIESFVPDPERQRLAITEKGIAPLESLLFAKYMM
MRNVYWHHTSRTFSTMLRRLLQDVADESALPMETLRELFYFNSDDRVLFE
LERAIRGLGLPAAELLDAILERRVYKRAMTIRPYHEPSMEIDPVWFAYNT
SHRRRKEKELEICAMLAGKTGKRLAGHEVLIDAPPSKDVFDYSDFRELRI
WPTKSEHRHLVQPTDSNGYVRFDDFRESVFGSDFILSFEQYTKKFRLLCR
HDLVDTLSGLEEAVVEILRR
>CT1645 peptide ABC transporter, ATP-binding protein
MQREQILSVSGLKVHFPVRSSGLAGGKQVARAVNGVSFEVFKGETLGLVG
ESGCGKTTLGRSIVRLVQPSKGGKIVFRGRDITGLGNREFRPLRTEMQMI
FQDPFGSLNPRLTVGQTLGEVLKVHGITKGTQATAKKIDQLLDTVGLNRD
YASRYPHEFSGGQRQRVGIARALAVNPSFIICDEPVSALDVSIQSQIINL
LKDLQRELGLTYLFIAHDLSVVEYISDRVAVMYLGKIVEIADAGTLYANP
KHPYTQALLSAIPMPELGHQRERIVLKGDLPGPLSIPEGCSFHPRCPFAM
EECRKREPELLSLQDEPSHKVSCLLYQ
>CT0231 hypothetical protein
MKNDCIIAGRKIPYFRQSRQWAKTAFVFILNETLFSQKADYL
>CT0168 ferredoxin, 4Fe-4S, putative
MRKRLLAPREEIAWYPAIDADICNGCEACAEFCRPGVFAPGAPEPDAAVV
KRPKMQVAHPMNCLVLCTRCVPVCPSGAITLPDPLDFERFVEYIE
>CT1771 oxidoreductase, short chain dehydrogenase/reductase family
MKQLENRVAVITGSTKGIGRAIAREFVRQGAKVVITSSRQENVEAALREY
PKDLVHGHVSDVSSYASVESLVDAAVRRFGALDCFINNAGISDPFTSVCD
SDPAVWSRVIDTNLKGTYNGSRAAARYFLSVGRRGKIINMAGSGTDKGSN
TPFISAYGSTKAAIARFTFAMAEEYRNAGLSIMLLHPGLVRTEINHPERT
TPELQKQLKTFNIILDIFAQSPDLAVRYAVKMASSWSDGKTGLYLSALDG
KRKKMMLLSYPFRKLFNRIDRQTY
>CT1156 long-chain-fatty-acid--CoA ligase, putative
MITFKAFLSRRPLLDILVDRLRQSRLPRFSSHHSGIKHSFMGLINPDFQT
LPELFTSVFSHFKGQPDKAPIARKINGAYSPISYDSLAEDCRHFAAYLKE
RGIEPGDRVAILSENRPGWYLADIAILSLGATDVPLYPSLPPNQIEYILN
NCSAKGIIVSNMLQLGKILSIWPKLPELNMVIVMNKLDEPVEDVIDLSQA
KTEGKKVLEAKPWLLDGIKSNPDDVATLIYTSGTTGLPKGVMLTHRNICE
NVKSCSTVIRIDQTDSSLSFLPLSHAYERTGGYYLMFACGARINLAESVE
TISLNISEAKPTIIFTVPRLFDRMKASILKSVTSEGGVKEKIFFWAVSTG
EKYHKQLATGKVSPLLAVQHNLADKLVYSKIRKKFGGKLRYFVSGGAALP
QKTGEFFQSIGITILEGFGLTETSPVTNVNRPEKVKFGTVGLPVKNVEVK
IAPDGEIMLKGPNIMKGYWKDEAASAEVLRDGWLYTGDIGEVDSEGYLKI
TDRKKHIIVTSGGKNIAPLQIENLILDSPYVDQAMIVGEKRPFLIALIVP
DFLKLRDFAAEHQIKASSTKELINTKEVIEVYEKLLKSISRQLATHEKVR
KFLLLEEPFSIENGMMTPTLKLKRKEITTKFSAEIDNLYNTLNMVYNTE
>CT1532 hypothetical protein
MFELFFCRSGNLFSVNELSMRQLVEAMLNSGSVFYVGR
>CT0075 cytochrome c-555
MSRFVSAALVGAALLVSGNAFAYDAAAGKATYDASCATCHKTGMMGAPKV
GDKAAWAPRIAQGMNTLVSKSIKGYKGTKGMMPAKGGNAKLTDAQVGNAV
AYMVGQSK
>CT0740 membrane protein, putative
MRLKKVNRHVMKIFDRYILKEHLGTFFFAFVTIMFVFILTFLTQFLERLI
GKGLDFRIILEVVALQSAWMVGLAVPMAVLVSTVMAFSSLTNSSELTVMR
AGGISIYRLVAPVLLAALALSLMMERFNNVLMPEANYKANALFADITRMK
PGLGIDKNAFSDVIQGYSIMVRDIDNETGELRDIVLYDRGRPDVRTVIMA
ARGRIQFSQDYSHLVLTLEDGQIHELSLPAMDRYRKMVFARNRYVFDATG
YGFERTDDGKRRRGSKELSAAELLSMAREFRMKDRMAESSIDKGIAGLRA
EIESIRKRSSTSPASSPALLPPAVTGRAIELVDTMIDNATERIGQMRENR
ESFYNYMIEYHKKYSLAFACVVFAMIGAPLGVMARHGGFGAGAALSLFFF
VLYWVLLIGGEKIAERGLLLPAISVWLPNVVLAVTGLFMIYRLSSSASGS
GR
>CT1095 membrane protein, putative
MGAFWLSLVMIFLAELGDKTQLVALTLATCYNTRVVLWGIFWATLAVHVF
SAGIGWFVGGLLPGDWIAFIAGISFIIFGFWTLRGDSLDDDETGECKTGV
NPFWIVFSTFFMAELGDKTMLTTISLATTNPFLPVWLGSTLGMVVSDGLA
VIVGRMMGKNLPEKAIRIGASVVFFLFGAWWMYEGGSKFSLPVWSAALLT
LAVAALFFFRDLLFRQRGNEA
>CT1999 hypothetical protein
MLCFDMSQDTAIPSHNRLGVLNVSLKGVAIVRVMMLAQRRISV
>CT0124 membrane-associated zinc metalloprotease, putative
MELLNTIFFFVIAIFILVTAHEFGHFITARMFGMRVDRFFIGFDFWGIKL
WQKKIGETEYGIGAFPIGGYVKIAGMIDESMDTDHVSQEVQPWEFRAKPV
WQRLIVLAGGVAMNMVLAAVIFIGITSIFGESRTPITTPAFIEPKSVFSS
MGMQSGDHLVAINGQKLHYWEEALDPERLASGKLQYTIERNGEELTLTAP
KDIISRINGNQSIGIRPTVPPVIDQVLPGDPAAKAGIMPGGLITAINGSP
VADWSEVVNIISANAGKKLTVTWMHLKNSTGEPLTAALIRKKGQTITTEV
TPNNSGKIGISLKQTIETERIKLSLPQAIASGLNQTWKTTVLTVQGFGKI
FSGKEDFRKSVGGPIKIARIANQSAEQGPISFMYFVAVLSISLAIINILP
IPALDGGQFVLNAIEGIMGREIPFEVKMRIQQVGMTLLLMLFAYFMINDL
LNP
>CT0510 hypothetical protein
MKRLSRIILIQSRITLIQISKTLMTSYLVFEKPLKYAISSAMVHSLHQIK
IKTEHQFHFS
>CT1757 hypothetical protein
MSRRYRAQNQNRDMMKPVDKMDKDGNVLAVCSQPPLHRRTTGNGVPLIKK
GAINFYMNMPCPLKVACKMAIGEFAALYNASHETPIYSPMLLDGDTKGIE
GELKAAMSEDELPEVLVASGLHTVMAKGFRERFIESGIYEGVTSQAALAR
MPESYRKLVTEHNIGLFSTGFWSVVCDLSLETIVPYPRRWTDLVDPLYKD
LITVHGYNGKASIAALLLLLREQLGSRAVTDFAGNIRNIWHFAEILKRID
SAEPRRTLFNLLPNAATVQMPSKKRAAILEFEDGPVLAPMLMYVKRSRKE
ECQPLVNFMHSNVIRQALRRGDFHLADEFDWTQPFSFPSWEFLLQNDYET
LSAALDVELKKGLREDVSSS
>CT0505 hypothetical protein
MNLNEIKTSLCPNGRNIQVGDPDGVSGITHASHDDAPCRRFTSVAEREIA
VTRACRLPNSFSRVSASELSDDDGGSF
>CT0093 membrane protein, putative
MLTFLAIIVVAYLIGSIPTSIIAGKLLKGIDIREFGSGNAGGTNAFRVLG
WKAGLVVTLIDIAKGTIAAVPVVGFFKAHPLGAFPDMNEIALSLIAGMAA
VIGHVFTVFAGFKGGKGVSTAAGMLIGIAPVSMLMVIGIFLLAVTISRYV
SVGSILAAIAFPLIIAIRKYVFDLGSGLDYRFFDHWFVHDSLDYHLLIFG
GIVAVAIIYTHRANIKRLFSGTENRLSFGRKN
>CT0172 noeC protein
MNKPLVVDLDGTLLRSDMLIESVLAYSKKGIRPVFHLPFWLLEGKAILKA
RLADRVDLSVEVLPYNTEVIDFIKQAKREQRQIVLATGTHKMFAEKIAEH
LGLFDLVIATENGINLSARSKRDCLIDQFGEKGFDYIGNSHDDVVVWNAA
DKTYLADPEPGVKKKMDLQGGVARVFQTRQSSLRTFFRALRPHQWLKNLL
LFIPLIASHQLTNITLIADALLAFVLFSMTASSGYLINDLLDLESDRYHP
RKRNRPLASGELPILIGLISAPLLFLASLSASLILLSATFTISLVTYYLL
SVVYSQYLKKIPLVDTIMLAGLYTVRILAGAFACSIIPTFWILAFSVFFF
LSLAMLKRYAELLDLQTKGDTAKTSGRGYYPSDIQIIASLGTASGYLSIL
VLALYIQDARTIAMYKTHELIWLACPILLFWISRMWMLTHRGQMHDDPVL
FAIKDRTSLITATLFACIFFLAILV
>CT1730 type III restriction enzyme, res subunit
MARTTIDRLIINSPYEEPARHWRYDRETRTFDLVEGRRPAGYVVASSDSR
AFDDPGIFVEIPLVNQIRPRVKAWREAGYAGVTGITKRLLEHWRDPEEFE
TRRFFFCQLEAVETLIWLMEAPAAERVGIEIPSDGGDFVRQCCKMATGSG
KTIVMAMTIAWHILNKVANPQDARFSKNVLVIAPGLTVKSRLAVLEPAGA
GNYYEAFNIVPSSMLDKLRQGKVLVRNWHALAWESEEQLKKRKSVDKRGA
KSDEAYTREVLGEMANARNILIINDEAHHAWRVNWEAEGKYLRARDLKDS
AEEATVWIGGLDRLNRSRGILTCYDFSATPFTPSGKKSSEEALFGWIVSD
FGLNDAIESGLVKTPRVVVRDDAVPDAKTYKSRLYHIYNDPDVKDDLNRR
AQPEEPLPDLVLNAFYLLGYDWRETWKTWQKAGLATPPVMITVCNRTETA
ARVKHAFDTRRIHIDELCDPERVLHIDSKVLDDAEAQEEPAAAVVAPEDD
GDAEEEAAAPVARKLTKAQQAELLRRTVDTVGKAGQPGEKIQKVISVGML
SEGWDAKTVTHIMGLRAFTSQLLCEQVVGRGLRRTSYEVNPETGLFEPEY
VNIFGVPFTFLPHEGGEDGPPPPPTPKTAVEPDPSKAQFEIRWPNVVRIE
RVFQPTLTLDWSKARVLELDAAQTAQVAELAPVLEGKPDVTKIERIELES
LARQFRTQRIIFETARDVFDQMKHTWQGSREVLLAQLVRIVEEFIRSDKI
AISPPLFYQDELRRRLIITLNMSRVVQHVWEAVRQENTERLTPVFDRDHP
IRSTDEMRTWYTGKPCERTRKSHINVCVYDSTWEAADAFALDNSDAVATW
VKNDHLGFEILYVYRGVVRKYRPDFLVRLADGEMLVLETKGQDTEQDRVK
RRYLDEWTQAVTAHGGFGRWRWAVAQHPGEIRDILMQGEGARAGG
>CT0427 hypothetical protein
MFPPFVLLKSLVNQLIEIVSLFPANLIAVSATQKKDYTRLLAGQHEPVLQ
AKPRDNSWTLITKS
>CT1744 conserved hypothetical protein
MVPLGMLGKGEVGEIINLRFGQHDAGAGFGRGHHYAHGRENGKRLAEMGF
SVGQTIEVIENSPGMPLLVQVHDGRVAIGRGVAMRIMVRRVAS
>CT1469 hypothetical protein
MLVLVTMKGCVYIADMWSDYYQISHMPGGMEWVSL
>CT2253 conserved hypothetical protein
MSTVPASIPVLQWAASRAWLSDTELKKRFRKWPLWLKGEASPTLKQLEDF
AKLTHTPFGYFFLPEPPEVTLPVPDFRTHRDNHLREPSTALLDTIYLCQQ
RQEWFREYALMQGLQPLRFVGSATLSDNPDAVAARMRQELSLSVDERQAL
PTWTEALRQLIAKAEEAGVLVMASSIVESNNHRKLDTQEFRGFALTDNVA
PLIFLNAADSKAAQMFTLAHELAHIWLAESGLSNPEAGLLPEQQIERWCN
RVAAELLVPHEKLHDVHNLHNPGITVDKEIQRLARFFKVSTLVVLRRLFE
AELIDRATMNQCYQKELDHILSPEGRKSTGGDFYRTLGARTGKRFARAIL
SSTLEGHTLFRDAFRLLGVQKSATFYKAAHELGVMP
>CT0181 conserved hypothetical protein
MWLPERAGRWSCSCAAPGSWGGYSGAFRATARVAPTIFRLLSYIRFMELQ
EKITILSGAARYDASCASSGCSNDTPYRGTGNTSQGGICHSWADDGRCIS
LLKILLSNDCRYDCAYCVNRSSNPVPRASFTAREVVDLTMEFYRRNYIEG
LFLSSAVWDSPDRTMEEMVRVAEILRNEERFGGYIHLKVIPGSSADLIRR
AGLAADRISVNIELPSNESLKRLAPQKSKESILTPMKLIGAEAGFSLVER
TKSRKAPRFAPAGQSTQMIIGASPETDLQILQLSQSLYRKMNLKRVYYSA
FVPVNDDNRLPVLAAPPLLREHRLYQADWLLRFYGFSAEEILSDDAPNLD
ESFDPKTAWALRHPEFFPVEINRADYATLLRVPGIGITSAKRIIAARRFA
PVTHEGMKKIGVVMKRAKYFITCSGRPFEKIDQQPARLRQRLLLGDGSEQ
KQPQQLVLPGLFA
>CT0271 hypothetical protein
MPSCVLVACSFIVAHVVSAAAIRFAAKSWKYSFLIRYLRYNPENS
>CT1588 DNA primase, putative
MIDEVRQSTDIVDVVSDYVRLHPSGRHFKALSPFTQEKTPSFIVSPDKQI
YKCFSSGKGGNVFTFVMEMEKVSFPEAVEMLAKRAGIDIGKYQQQKAKEK
DKREASQFDTLRWAAKLFHGTLQSEAGSAALAYLTGKRGLEPGTIRRFGL
GFAPESWDHLLHAAERDGAPVEHLVSLGLLTRHPKRNTLYDTFRNRIIFP
ILTVGGQVAGFGGRTLSNDAETPKYINSPESAFFEKSKLLYGMHAAKNEI
RRQETAILVEGYMDVIAMHQAGFTNTVASCGTALTRYQAKILKRYTSRVL
FLYDGDNAGKKSMLAGIDILLSEGLTPWVVMLPGNEDPDSFIRNYGKEAF
LGELESEKSSFQDFQLRCYQDAGWMESPDTASKAISAMTRSIALIADPVQ
KELYIDELSKKLDLSRQTLREVMAASDTGGSTRESRQRRSEPAPQPLARQ
APLSVTERTFLEALIESTFYGNEVLDFAASHESMFHLEHPAAQTIFSHLV
RRFREMNDRDGHLDINSEISSIGMEDASNLAFDILFRLPVNETTRLTPAE
LEQHARRCLSHFLVAVKALVLEPLQKEKQKIIAQLQSATSTHEQERLSRE
LLEHNKQFRAMEQEVDESIRGILGE
>CT0239 conserved hypothetical protein
MTDDLIKKAIGETIGELSATSGEEIYVVEAAIRAGGRKIELTVDTDKGVS
IDQCAKLSRAIRARLEACEEDSMLSSGEFDLMVSSPGIGEPIQVQRQYLR
HLGRLIRVNYLDEEEQPKEITGKLLEAAVGPEAEQPSITIEAVKEGRKKR
TAGEPPVTIRLADVVRAVVQTEL
>CT1604 hypothetical protein
MSANGLSGRCGRTAFTQAIKPVPLYRPRSNELLAAFKKAKTDYLRRKILY
RFFHSISSVLWITRLKYG
>CT0979 membrane-bound lytic murein transglycosylase, putative
MGAHLFFCLFAAAPCINGALWLCFQIILYCLKPEGASATSSVNPARLVKS
FLPLRHFLVCISLQVVALSSVSFGAEEPDSASNVRASRVSEMLDSLVTAT
YFQDDRFSSADRSGSAGYLPKEFIPQFSDSVYASRIAALASKSRFNLVYN
DHVKGFIRLYAVDKRKMVSKVLGLTHIYFPLFDEVFRQYDIPPEMKYLAI
VESALNPTAVSRAGARGLWQFMSGTGRMYGLQSSSFIEDRYDPTKATVAA
CQHLRDLYDMFGDWFLVLAAYNAGAGNVQRAIRRAGGAHDYWEIWPYLPQ
ETRGYVPAFIAVTYVMNYYREHNIRPAQPGYLYSETAMVPVKQALTFEQL
NETLGVSMDDLKFLNPQYKAGLIPAPASSPNMIRLPKQYVQLFLQKEQQI
YAYKPEMVEEKERLYAMVQENDRQDVVFSSGRGQKTHVVGKRETLASVAR
KYGCSVSQIIDWNNLKSAKLRPGQRLVVFKVVNERFSRNASATKLKGKKG
RLKSKASNAAGKSKVLKKSKSGKKAGKKSGTKKKRK
>CT1728 transcriptional regulator, ArsR family
MQRTLSDISYTTDEVTLAVIAKALGHPVRIKILRLLAGEACCFTGELTSL
IPMAQSTISQHLKALLEAGLIQGEINPPKVKYCIHRENWAKAQALFGEFF
ACEKSGGCNC
>CT0265 conserved hypothetical protein
MHKLQVDILGLSTSPHTNGAYALILYEVEGKRKLPIIIGGFEAQAIALKL
ENIKPPRPFTHDLFKQVADAFDLHVNEVLIDELHNETFYAKVICEMGGVV
HEIDARPSDAIAIAVRFSAPIFVSEEIMNEAGIVEERPKEDEEQPAAEEV
VEHQGAAPEPAQGESVAEELNRKLEEAINREDYEEAARIRDELLRLRKGG
KGEA
>CT1280 hypothetical protein
MEREPIQGNVLKTAATVAGAGALLSPAGLPILQGIAGIAVVGLGIFAAGT
AAMKVGEMISSGFGQSKPQQEEEDSPFL
>CT2017 hypothetical protein
MMLIPYPFGLHESMYVSSNENTETILCIDLRFLQPKSTPNVTRHFKAAR
>CT1456 SMR drug efflux transporter
MPWLYLILAIIAEVSGTTSMKLSAGFTKPVPSVFIFVFYATSLTFLTLAL
RTLPVGMSYAIWSALGTALITAIGVLWFGEGLNALKIISLILIIAGIAGL
HFSQEHMK
>CT2272 hypothetical protein
MSVCDDRGLHIDMVSLFFQAKNSHFGRIFLKPLLYFKLRIFCRVSANAFA
AYRYAVSYNFAE
>CT1001 hypothetical protein
MLCVSLQIAILFLCCLVLHKELYKTKYLLDLKKYEAASNSRFILKLFREK
GCFKVREVQPLQMIRDGLPESGCSCGGGKKL
>CT1294 lipoprotein, putative
MLRANRLTPIMLILALILSACSGITDSGLSEKQVEKLVRASRPNNPSPDI
WARAIKESLEELGQPVDKEHVSAVCAVISQVSAFSISPKNSRMASILRKK
IEAAESNEVLRLLIETRLDQTASNGRTFRENIDSIQSELDFEKWYDEFTS
ASVTKPILLVLKKDASDLITTAGSMQVSVKFAEEYPKKPRNAGGGSVRDM
LYTCKGGVFYGTAYLLDYKHNYDDWKYVFADFNAGHYTSRNAGFQKMLGR
LTHRMVDTDGDLLSYENGNATPSVTYVTFINFLKDKGIGFDEKKVMKDFQ
QEKSYDFEETWSYKTLSELYKKKYGHPIYAVLPDIPLNSPKFVSKNLSTK
WFAERVKSRYNHCMRTSI
>CT1433 hypothetical protein
MQEWIEYRCQTHKKASLKKRLFKAVCSYFSNLILSAAR
>CT0379 hypothetical protein
MLSINTQRTTHHNLTQEHSTQKLIISFNSSLTIIETMMLKLPCALFYFFS
KNNFALSNPAAPHS
>CT1120 hypothetical protein
MNPAKSDRRFAIGTRGIVSRINDLADQPDAKETLKI
>CT1288 multidrug resistance protein, AcrA/AcrE family
MNTIPALRNRFSLLALGALLGSFLLLPGCGKKQDGPKQMPPALPAMKVEL
SSASVTTDYSVRLEGLADAEIRPQVDGMLQAILVKEGDLVKKGQPLFKID
DSVYREQYNTALAAQRAAEAQAAVAKVNAEKLVPLVENKVVAPVQLTTAK
AQEQAARALAAQAAASARSAAINLDYTVIKAPVAGYIGRIPYRQGNLLTK
NQAAALTTLSDVNRMYAVFSISEAGFADFKKLYPGATLQQKIDAVPPVKL
TLSDGTVYQHDGKLESISGDFDASTGSIRLRVSFPNPEGFLRSGNSGTVS
LSSNYNHVILVPQSATVELQDKVMVTLLKPGNKVMKQVISVAGRSGPNYI
VSEGLKPGDVIVTAGIERLQDGMVIKPLSAAAPGAPAPAAQTPNAR
>CT0338 hypothetical protein
MQPIILIMRNMEQNVGANPCGRPRTMLADPGLPTRLRIHQSNQATTEMGY
GKNLFISFLPNPDFDLDSEQGNPPIPER
>CT1902 hypothetical protein
MKKAERKSFLLFDKPVSELDLTLNRWLSRPAPDALPRLE
>CT0825 sialic acid synthase
MVGDGHPVYVIAEIGINHNGSLEVAKKLIEGAAQAGCDAVKFQKRTPELC
VPMDQRLIERDTPWGRMTYMDYRYKVEFGFEEYSEIDAYCREKGIAWFAS
CWDEEAVDFMEQFNPPCYKAASASLTDLTLLKKTKATGRPLIISTGMSTM
EEVDAAVNELGRENLLIAHTNSTYPCPVEELNLRMIHTLQQRYPDNPIGY
SGHEVGLATTWAAVALGATFVERHVTLDRAMWGSDQAASVEISGMSRLVS
NIRDIEKALGDGVKRVYDGEAAARKKLRRV
>CT0951 alcohol dehydrogenase, iron-containing
MPRIHFGPGALAKLPALAAAFGRRMLLVTGRQTLRRSPVASRMLNELRHA
GMEVACLEVEREPSPELINEAAALGRQKPFEVVVAIGGGSALDAGKAISA
MLLQRDPVERFIEDQPGFTPHDGRKVPFIAVPTTSGTGTEATSNAVISRI
GPGGFKRSLRHPAFVPDVAIIDPELMVSLPREVTVSTGMDAFTQLLEAYL
SPFASAYTDALCCSGLEHFARSFAVASGDGAASVAVRADMAYAALMSGIV
LANAGLGIVHGFASSVGGLFEIPHGTLCATLLAESTRENIRQLRASEGGE
LGLHKFANASRILTGSATGNLAADCDRLVELLERWQERFAFPRLGEFGVR
ESDFAEIIAATRSKSNAVPLDASAMQRILAARV
>CT0913 hypothetical protein
MRQARIARHEMQQRFADRLGLSRPTLGKMESGDPGVSIGAWTKALELLDR
QDELDLLLASGDNLFEKFAICRTLHYRHLLRRLY
>CT1502 Bche/P-methylase family protein
MSAVIKRAQFAIPPLSLMILSSVEVLGVRQEICDLRFDKLPLDRSWDLVG
ISVQTGAVKPAVEVASLFRERGVPVVLGGPYVSIFPERCREHGDVLVIGE
ADDLWREVLEDLRRGALKPEYRQGAFPDLSAPRPVDKSALQINRYFTTNV
VQTTRGCPYSCDFCSVHVMNGHKLRHRPVGEVVREVEAFLREDKRIFFFL
DDTINADEAYALELFSKLIPFNIKWVGQATTLLGENPKLLDIFSRSGCGA
LLVGIESLADESNRAHHKFHNPAERQVRCIQEIRKAGICVYGSFIYGLDG
DTLAMPERIDAFIDEAGVDVPGINLLRPIPGTGVFDRLAAEGRLLHPSHD
HDAFRFSWGQEMLYYPKQMSLDQFIPSYTELTSKVFTPRRAIKRAMRAPT
LRSAVLMFNMLYVHMYGLSRKDLRRQLAELSSVR
>CT2140 conserved hypothetical protein
MNQQTYAFTLEMEVRDYECDMQGIVNNSVYQNYLEHARHVYLKTVGIDFK
EFTERGINLVVVRAELDYKLPLQSGDRFRVGLNMVRQSPLRFAFYQDILR
LPDMKPAVKARIIGTALNGRGRPEIPAELEALMQPGE
>CT0294 conserved hypothetical protein
MTCCSDFLKRSAVAAATTLIFSLTVPSVAFCASGGGEHSFLPPVWLAAPF
VVLLLMIATGPLFYPRFWEHHYQKVSIVLGVPVALWYAFMAEHGGHMLLH
TLEEYISFIALISSLFVVAGGILIKIERRGTPLLNALLLLFGAVLANVVG
TTGASMLLIRPYLRVNKGRLKPFHIVFFIFIISNIGGALTPIGDPPLFLG
FLRGVPFFWVIQHLWLPWLVTIGLLTAIFLVLDLKAGKNEAEQEYSGRIT
LIGRRNFLYLIPLIGSIFLDPAVIPGFPSLQEIFHVPFGIREVIMLTIAV
VAYKTANQDALGGNEFNFEPIKEVAFLFVGIFATMIPALQLIGAYAQSHA
AEFTVTKFYWFTGILSGVLDNAPTYLNFLAGALGKFGLDSNSIADVVKFS
RGMDSPIAGDVPSTLYLMAISVGAVFFGAFTYIGNAPNFMVKNIAEQTEA
DVPSFVEYIYRYSIPVLLPVFGIIWFVLFHW
>CT1325 ribonuclease BN, putative
MKENSKEERGRLHGMVWSSSAFFSFMWRHFVHDRVLMSAGSLAFQTLLSL
VPLMAVTLSILKVFPVFASLKQYIGDFLFQNFAPAQGSILKGYLWEFIDK
TSSLSTVGGLFLIVIVLFLISTIDQTLNDIWEVQTPRRRLQGFTLYWTVL
TLGPVFIGTSVLASSYVWYSVFAEGALLEMKARVLSYVPVLNSVIAFFLL
YMLVPNRKVRFTHALAGGVLAAVLFELAKRWFTFYVSSFATFEHIYGALS
VVPMLFFWIYLEWVVVLTGAEFVFSLGYFRPAVCPAREFDPLQGVPEVVA
VLRSVWRAQLSGSFMTGKKLLASENSGDRSKLGHVVDFLKQNGILHKTAD
GGLAISADLHAVTLYDLYTKLPRELFNGDGCVDEGPASREFEPLRAEVRE
ALRSAMQTPLITLVNDSTEKDS
>CT0936 hypothetical protein
MLSSTETGKSLPVMKQLMFVLLLLLFAGAASGSEGRIVSQSPYITDTLCA
LGLANRIAGVSRYGDLDLPKTGGVIDSDPAAIAALHPGSLFLSDWTSRDV
CKRVRPAGAKCVVLQKEGYECPVRIPEGDYQIVRLDGLHFVSPSPKILEG
LADLQWQFGSRNKKNESK
>CT1201 hypothetical protein
MAKKSTAKEAPEVTPEKKTAKKAAASAETKPKSAKSKTAKIAAPEEPKHA
KTAKPRKKASAKPMAPETPAASPETVEEHIRVAAYYRWVERGMTDGGHEE
DWIAAEKQIKG
>CT2223 conserved hypothetical protein
MPPRKAPFRATAPRVILLAIAAVVVTLLFVPGINSMSPKEEPARIAVHRG
EGFRRIVEKLHDAGVIRFRWPLLAAGALVPPLHKIKPGRYTISGNHSVFG
LLWYLHSRPQDEVRVMIPNGVEQRKIARIIAANLDIDSTAFIAASRDPRL
LASLGIKGESTEGYLFPGTYNFAWASTPKEVITFLTRRFRAFYSDSLKQE
AKQAGLDEHQLLTLASIVEAETPLDEEKPVIASVYLNRLKKNMRLQADPT
VQYAIPGESRPLHYKDLAIDSPYNTYRHAGLPPGPICNPGAASIRAVLSP
ANTGYLYFVATGQGGHAFATTLAEHARNVQRYRAARKEQQKNPGGGTP
>CT2085 oxidoreductase, short-chain dehydrogenase/reductase family
MNLADSTAVVTGSSSGIGLAICRALLDAGASVFGLSRRETPIAHERFRWL
KTDVTVEAEIDQAFEAVFAESGRIDLLVNNAGIGFFRDIESIDPVEWRRL
IDTNLTAMFLCTRKVVPSMKAAGRGMIVNIGSVAGKRGIRGGTAYCASKF
AVNGFSESLMEELRGFGIRVSCINPGSVMTEFFDHAGIEPKKHMQSDDLA
QLIVSLVALPDGMLPDEMTVRPL
>CT1409 hypothetical protein
MTELFQYPMSYPGWWLNNYYYFTWTLAMLFLAGGWAIFYRYGKFSYGVDF
GCFWKTALLVIMTTIALGVPSYYNTKFVAQHGNDGDSVLLTPDRIEYRYR
NGEKKMFLLKDIVSIYQEPVTYNPPPKIFIVAKNAGLRDSITVTEGKYGL
PDVDKLLAALSARTGLQIKRP
>CT0276 hypothetical protein
MGVLWWQNYVNQRYQSVEITRFYSGTSPRGGNKLALTDQQKTAFRKLRQE
HFRKTMPAVQKIIEFKKEMISEAVKPDPDLQKLSAIADSLGKRQAWLEKD
LALHFHELAMLCTLTQRDSLKKLLSNIYTVRYQKMTLWKGRPHREDREDN
HRGPIPPSAPEP
>CT0215 ABC transporter, ATP-binding protein
MSIYSILPLLNEVFSTNQTALKAEPATTTHPTLALQKTRPPQPDQRLSTT
DATGKNPGDKTLKSKAIQLKTWALEKFQAMFNAPTKEETLLKICLFLISA
FALKNLFIYLNKQIIFRIQTKTAKKLRDDVFSSIIEMQLDYFNKNRVGNL
MNYVYNDVENVNNSISATFVNFLQNPFSIIVFLAVLLTLSWQLTLFSAVT
SIFILVFVRMIGKKIRGLARRYQVNMGNMNSVLQEKFNGIKIIKSTAFEE
VEFNKFKAFTKEFRTLGIRISQLKNMIGPLNETLMIAAIAMVLWFGGLQV
FNGMMTANELLLFAFTLYSTMGPIKQLGDANTAIEAGLVSVERLFEVLDA
EPEIVNGNRSISTFNEAIRFEDVCFKYNKEDPNAPNILDHVSFEIKKGEM
VALVGQSGSGKSTAVDLLMRFYDVDSGRITIDNIDIREFDYKQLRKMIGV
VSQEVILFNDTIEQNIAYGVHEEIEHDQIEKAAKLANAHQFIMEKPEQYD
TMIGDRGIQLSGGQRQRIAIARAMVKNPELLIFDEATSALDNESEKVVQD
AIDHAMENRTALVVAHRLSTIKNADKIIVMERGKVVEAGNHAALLEKNGV
YKMLYDIQFTGQAANRQQTT
>CT0431 hypothetical protein
MNRMHANTAAMRHAQAPKHAQFQKGAVMVEFAFILPIFLLLLFGMVTFSI
ALYDKTVLCIASRQGARTGALYYASNYDSNGNLINANVQQRACDAANAVC
QQDLINFGPNMNLQIQCQVLGGTVHGQRSVSVTTGIDYTGIYILSDVLHL
SSTTIMRLEED
>CT0958 conserved hypothetical protein
MKRFVFSGFAVFMLAAPVTGHCRDYQMNDSSPEASVRRDADTDKDAAQKE
YLIAMKYYYGRDVDRDYAEALKWLRLSAAKGNDAAEYAIGFMYQKGLGVP
QDYAEAMKWYRLAAAKDNDNAQNQIGYLYHHGWGVETDYAEAMKWYRISA
AKGNFAAEDNIGVLYEHGQGVEQDYAEAMRWYRISAAKGNGEAELNIGNF
YQHGLGVELDLNKAVKWYRSAAAKGNEEAAQKLSAMKSGSVLPDVHHE
>CT0097 NLP/P60 family protein
MNPEHRKSSFTGPMNEITRFAPKAAKATSILRALALMVLASLMLTLGACQ
SIRPLSDRMESKYSLKKRKTSISRLRPQGPERCSVPVQVSARAFRAMLDS
IEEARGVKYRFGGTTPEGFDCSGFVQYLYNRSFQMILPRASNDLALVGPI
IHRDRLQPGDLVFFAAGDEITHVGVYLGNERFAHASSKAGISISTLSQSY
YATHFAFGTRIIRVE
>CT1043 fumarylacetoacetate hydrolase family protein
MKTFSSLSKPATHRSIYCVGKNYPDHAREMASWETDKPEPLHEKEPVIFM
KPGTALSTDGCTSIPRFEGQLVSRNLHYEGELVLLIGADADEVSLADASA
IIAGYAAGLDMTLRDVQLEAKNAGNPWLKSKGFRQSALVSEFIAPESAGP
WAELAISLRLNGEQKQYSKVSKMTFSPAYLVHYLSYIYGLRAGDLVFTGT
PAGVGSVLPGDRLDVSLETADDHSQAKILVSLQATVS
>CT1656 hypothetical protein
MLVACAKSLATALFIFFSAIYEVPLFIAGRSTKRADH
>CT0132 RNA-binding protein
MNIYIGNLPYSVTDEDLRDKFSEFGQVHSANIITDKFSGRSKGFGFVDMP
NESEAREAIDAMNDKDFKGRTIKVNEARPREQRPPRREQY
>CT0110 aminotransferase, class I
MPRFSKSVSALRSSAIRELMSLASRPDILSFAGGMPGNELFPIDEVEELF
RNLDTKTKQAAFQYGPTPGLPSLLESLGGFLERKGLPVKKNRLMITTGSQ
QALSIFARAFVDPGDRVLTEYPCFIGAIAAFRACGAEIVSLPVDEAGIDI
AMLRQEVENPDPAKFLYITPYFHNPAGMLYSTNRKRKLIKALQGRDIPLL
EDDAYGDLWFSDEHREALQPIKAIDPEDIDVCYMGSFSKILGPGLRLGWM
LAPEAIYEKCELIKQSADACSSSFTQVIADAFIRSGRIDTYVASVREEYK
RRAASMVKALRSNLPAYVRWNEPKGGFYIWLTLPEGADATEILKRAIEGG
AVFVTGSTFDPQGTRNNTMRLSYCNNTPEEIERGIPIITRAIREVCG
>CT1238 hypothetical protein
MQARSGQGITRNLHTPSPGSAERRAILDAMRLKIKELHGIDVIFVVKTMN
VSGGWAWVHTLPRSVDGFFHYEDFSALLHNDGKQWLVDEIACTEPDNPHC
IGSPGYFRKLSHRFPCAPLSIFPTASFLR
>CT0248 hypothetical protein
MSVKPVDLNKLRSTHENLYETVVAISKKAREIQEEERAELEERLLPYKEM
IRNPASESESEKVFPEQIAISVAFECREKPTQQALAQYLDHQYDYVLEKS
PETKVAQNEDEDESDRD
>CT0584 hypothetical protein
MRVQLLDEATVDLADGYRFYERQAEGLGEYFLDSLWSDI
>CT0804 transketolase, C-terminal subunit
MGEKITIEQTAKYTSRGNKATRTGFGEALLEAGRENPSVVALCADLTGSL
NMHLFRKEFPERFIQTGIAEANMISMAAGLATIGKIPVASSFAVFATGRV
FDQIRQSVCYSNLNVKICASHAGLTLGEDGATHQILEDIGLMRSLPRMTV
VVPCDYSETKRATKAIIEHEGPVYLRFGRPNVPDFTADEDGFEIGKSIEL
HPGKDVTVIACGIMVWKALEAARILEKEGVSVRVINMHTIKPIDTLAIVR
AANDTGAIVTAEEHQMYTGLGEAVANVCARNIPVPIEMVAVEDTFGESGK
PDDLLRKYKLTTEDILEKIYLVLRRKD
>CT0538 cation efflux family protein
MSNHEEKKQHVALSSVAASLLLTAMKLVVGLMTGSIGILSEAAHSAIDFG
AAALTWFAVRISDKPADKKHHYGHTKIESVSALIETGLLILTSVWIIYEA
VTRLLSGTTEIEVTWYAIAVIIISIIVDISRASALKKVAKETKSQALEAD
ALHFTSDIVSSAVVLAGLGFVALGMHWADAVAGIFVAVLVGKAAWELGRK
TIDVLIDTAPEGLSEQIEEIVVNAPGVIGINKMRIKPAGGPFVFIDLTIS
VSRTLPQEKVVAICAGVEQRLKAALPDSDITVNARPVVLDSETVTERIHT
IGLNHGLHAHNILTSLSGDHKQITFDVEVDSRLTIRQAHDAVTELENELH
REFNEEIDLCIHLDPINSEERNVQPADPENEARITALIRQAASTIPGILN
VHALKIHVLSYNKLCITLHCGFDDNTVLADVHPLTSRLECLIYRDIPETS
RIIVHAEPVNAGD
>CT1285 hypothetical protein
MTTIRTEIVIDALPEQVWAALTGFGSYGAWNPCVRRIDGNAGVNQILHIA
IRFGWLPPISFRARIDCFSRDTIFGWRASFLFGFLEGRHWFELHPLDAGR
TLFVHCETFSGVLASPFLALMSGLVRQSYEAMNRALETIVEEK
>CT1752 hypothetical protein
MFAVPLITSSQRESVMSLITVDLLKVALPDCKKPEEWVAALIPALEKYAI
NSEARVASFLTQYRA
>CT0277 hypothetical protein
MNRTTEQRNAEVMKTIGLLDQMPRVEVDHLFRVRLMQRIEAMEVKKTSWS
ALPGGAFNPRLAFMALLLMLNIASALMLFMHGTPQATGSSGAIAESLTED
YGGPALSYYDDQTTIDR
>CT1863 conserved hypothetical protein
MSLAINSIEQNSDIDPQALRLALQGYFNLKRQGLIRREGLITLIDFDKPS
DCKRLFIIDINSGTVIQTALVAHGRGSGDIMATSFSNQPGSNKSSLGFYL
TENTYIGNNGYSLVLKGLDQGINDKAEQRGIVIHGADYVSEEYIRQKGRL
GRSLGCPALSMDQCREVIDLIKDGTCLFIYHQGEDYASRSVVLNPKLALG
SGKSKNPA
>CT0063 magnesium chelatase, subunit I, putative
MKQETDLELLGRQIAEESAFIDRVREVLASTIIGQGAVIDRVIIALLANG
HLLLEGVPGLAKTLIVRTFASAMNLSFHRIQFTPDMLPADLIGTMIYNPK
TMEFYPRMGPVFANVILADEINRSPAKVQSALLEAMQEKQVTIGDVTYPL
GEPFMVLATQNPVEHEGTYMLPEAQLDRFMMKVIVEYPTFEEELEVMQRA
SAVQAPIEVQAVVQPEEVFRSRSLVDRIYVDQRVQRYIVDLVTATRSPER
YGLDGLSTMIEYGASPRASIFLLLASKAHAFLQRRAYATPEDVKSIVYDV
LRHRVRPSYEAEAENMKAEDFIRNILEHVQVP
>CT0163 alpha oxoglutarate ferredoxin oxidoreductase, alpha subunit
MSDTVILNNNDMVISKTNVSVLFAGDSGDGMQLTGTQFANTVAVYGSDLN
TFPNFPSEIRPPAGTVAGVSGFQLQFGTTGVYTPGAKFDVMIAMNAAALK
ANLKNLHHGGIIIADTDGFDAKNLNLAGYGETNNPLEDGTLTDYTVFKIP
VISLTRQALADTGLSTKIIDRCKNMFVLGVLYWLYSLPLETTIEALQSKF
KNKQDIAEANIKAVKAGYNFGDETEMFSQHGRFCVPPAQKKKGVYRRVTG
NEASAIGLAAAAQKAGLELFLGSYPITPASEILQTLAGLKKWGVKTFQAE
DEIAGILTSIGAAYGGALAATNTSGPGLALKTEGMGLAVILELPLVIINV
MRGGPSTGLPTKPEQSDLLMAMYGRHGEAPMPVIAAMSPVDCFYAAYEAA
KIAVEYMTPVLCLTDGYLALSSEPMLVPSPDELASITPMFSPERKADDPP
YLPYKRDERCVRPWGIPGTPGLEHRIGGLEKQNETGHVSHDPENHALMTR
LRAEKVAKVADIIPDLTIDNGPEKGDLLVLGWGSTYGAIKKAVEQAREGG
LDVAHAHLRYINPFPKNLGAMLGNFKKVLIPENNCGQLLSLIRDKFLIEP
VGFSKVQGLPFNEMEIEEKITDILKEL
>CT2049 lipD protein, putative
MTKKSTRRFFALLVAVVATCSQADAATMRLSLSDALKMAHERNTMLKAAR
AKNDQADARVVQSRQAYLPKVTLSETLLHSNDPGAVLVGKLQQEIAYYNF
NTTPPDFGDFSLHNLNHPSAITDFHTSLQVQQPIYNRDAIIGGKMARSAR
KAQGFMTDRAVESIDLNVKKAFYGLILAKKNLDAIEQSIRIMQGYSSEAA
RGYAAGLVTKSDKLSTEVRLAELRDQKLQIQDAIRNAQDALRTILALGPD
DTIQPIGDLAVDARIPASAAKASAEGRSDLKALAAFQEVAGYQHDMARAQ
YLPRVNAFAQQNWHDSSFLGTEGSSWTIGLNVQWNIFDGMATKGKVQETK
AQELEARYNYQAAKEQSEMEIDMARRALVTSRERIAVTQKSLEAAKVSFD
FIGEQFRTGMAMTFELLMREQAWTWAKMSLNQAKYDYCIAKSELEYYSAH
>CT0609 sepiapterin reductase
MKHILLITGAGKGIGRAIALEFARAARHHPDFEPVLVLSSRTAADLEKIS
LECRAEGALTDTITADISDMADVRRLTTHIVERYGHIDCLVNNAGVGRFG
ALSDLTEEDFDYTMNTNLKGTFFLTQALFALMERQHSGHIFFITSVAATK
AFRHSSIYCMSKFGQRGLVETMRLYARKCNVRITDVQPGAVYTPMWGKVD
DEMQALMMMPEDIAAPVVQAYLQPSRTVVEEIILRPTSGDIQDD
>CT1379 conserved hypothetical protein
MEIEQTVLVQCPYCAQSFEVLVDLLAGHQEYIEDCEVCCRPVSLVIDVAE
DGTATVQAQGEDV
>CT0764 TPR domain protein
MNKQQQSMNTNRSAQEMSETSPEYEQKYQQAIDCIENQEYGQAISILDEL
AGEASRDAKLRYARAVALLSNGEYRRAGTDLAFTVALDRSNLEAYRHLGF
VLLTMGKEEAAIKVLEEALRRDPCFVEAWCVLADVHMDLGEHDKALDALD
RAHELQPGNAEVHCKLAMYYMSRGDMRGLRAEYEVLREIEPDVAAQIAEL
LP
>CT0503 hypothetical protein
MAVAIDTAPVMTEIRAKAQKTERFRMKCQRGIGTSFLFSISSMILSELIF
GMTVQN
>CT0565 oxidoreductase, FAD-binding
MNFKNEPAAIAGFLEDTSNLKNGWTPGVFFPETPEEVASLLREACADGRR
YTIAGNGTGTTGARIPFGDYVIAMQKLDRIGEVEPTIDGRALLRVQGGAL
LQDVQAKAAAAGWFYPPDPTEKTCFIGSTISNNSTGSRSLKYGPTRNHVQ
ALQIALPQGDLLEITRGQHLADAAGNFTLDLPLAGRVTFRLPDYTMPKTS
KHNAGYWSKPGMDLVDLFIGSEGTLGVIVEATLLLRPAPERVIACLAWFR
SEEELLGFVGEARAGSGGVSPRALELFDRRALEFLRQSYPDIAKEMAGAV
YFEEETTVEREEACLEAWLELMEQCGSPVEKSWAALDSEGLQKLRDFRHQ
LPVLVNEWLSRQSESKVSTDMALPDERFAELFRLYRDACDREGFTYIIFG
HIGNAHLHLNILPHNHEEFVRAKTLYRQLVSKVLAMGGTLSAEHGIGKLK
SEYLVQMYGRKGIREMVRVKKAFDPYLVLNVGNMIPAEYYESET
>CT1398 conserved hypothetical protein
MTKTDERKPAIFLLSLGCSKNTVDSERLTAQAVASGLTFTDNVDEADIIL
INTCGFIKDAKQESIDETLAAIGKKEEGVVREVYVMGCLVELYRKELAEE
MPEIDGLFGTRELPEVLAAIGAKYREELFDRRELLTPPHYAFLKIAEGCN
RRCSFCSIPKIRGPYVSQPIEQLLREAALLQQQGVKELNLIAQDISVYGY
DLYGKSALNDLTLRLSDMGFNWIRLLYAYPLNFPLEVISTMRERPNVCNY
IDMPLQHINDRILKSMQRGIGRKATEQLIDDIRQKNPDIRLRTTMIAGYP
GETRAEFEELLDFIRQTRFDRLGCFPYRHEEHASAYALEDTVSDEEKEKR
VGELMELQEGISASLNRKLEGQTLKVLIDRIEESVAYARTEYDAPEVDND
VIIEIGDEAVEEGDFRQVMIEDSTAYELFGRISG
>CT0966 aspartate aminotransferase, putative
MSVESFERFLSRRVLSMQESQTMKITGLAKKMQAEGKDVVSLSAGEPDFP
TPENVCEAGIEAIRKGFTRYTANSGIPELKKAIIRKLQRDNGLEYAEDEI
IVSNGGKQALANTFLALCDEGDEVIVPAPYWVSFPEMARLAEATPVIVET
SIETGYKMTPEQLAAAITPKTRILVLNSPSNPSGAVYNEAEVRALMQVIE
GKEIFVLSDEMYDMICYGGVRPFSPARIPEMKPWVIVSNGTSKSYSMTGW
RIGYLAAPKWIINACDKIQSQTTSNANSIAQKAAVAALDGDQSIVEQRRA
EFEKRRDFMFRELNTISGIECTLPEGAFYIFPSIKGLLGKTFGGKVMKDS
TDVAEYLLTEHYVATVPGDAFGAPENLRLSYAASIEELAEAVNRIRKAFS
>CT1990 conserved hypothetical protein
MTAIAPTFFDLPVVWHNVLVMLLTIAYVFSVPLLMDWLVTNHGLPRDISR
KITHICAGSVIVFLPLFRDGDWSHYLNITVFAVWTVLLIQKGLFAADDDQ
AVKTMTRTGDKRELLKGPLYFVIVAMICGTLYYKQFAGVLAMAILGWGDG
LAPIVGTRMGKMKYKVFCERSVEGSIAFLAGSLAAGLFFVWLIVPQAFNP
AKIAMIAVAATVIEALSPKEVDNILIPAEVIALAAVL
>CT0810 hypothetical protein
MQPYHFSNGSIRFICLATALMQPFPPSAIVIDKPELGLHPEAIRILGELI
RDAAKRTQIIIATQSPLLLDQFSIEDNRRNMPARPKS
>CT0599 hypothetical protein
MFKMLMTITTIKELIPVLQTAIGPVILISGIGLLLLTMTNRLSRVIDRSR
ELLDEADKLFGVDRARIDREIDVLWRRARYVRSAIMLAVASCLGAATLII
LLFLTSLLQIDVPLLASIVFIVSMVSLIGSLIFFLFDVNLTLSALHIEFE
GHRKKS
>CT0876 sulfide-quinone reductase, putative
MKTKPHVLILGGNFAGLQVARHIRDHVKPEDASVTVVDKRSYLLFIPNIL
MEILENKNPDSSMQLPLAPVLDKDETRFIQAEVLDIDVESKKVTIQPTER
PGTTTDVLTYDYLVIALGNRLAFDKIDGFAEHGHTVSDGFYGNKLRHYLH
EGGYKGGPIAIGSARFHQGTKGKLDFVPMAKAACEGPPVEIALSLASWMK
HHEMGGPEKVTIFTPADLIAEDAGKNIVKQLLEIAGGMGFGYKNKLEDIK
QIGKDGIEFANGESIEAELKIILPDWVPHEFLKGLPICDEKGFVITNKQL
KNPDYPEVYAAGDAAACTVPKLGSIGHHQSYIVARQLARDLGALSDEEAD
SELYSPEVICYGDMGDGKAFYIHSNVWYGGDIEILKMGKLYYDLKVAFKT
SYFAMGGKVPYWQWKMGSWMGDKIL
>CT1720 hypothetical protein
MQSVEGSMKLTRRFSLLSPLFSPDAKELSAHLKV
>CT1987 glycosyl transferase
MHCKAAGGGLFFTMIIFYQLFILGSLLVFLGIVLKNLGDLRSLPEVTGEE
AYRPKVSLLVPARNEELNIEACVLSLLGQRYPDFEVIVLDDHSSDSTLAV
LRRIADTKAGARLRILEGRELPEGWHGKAWACQQLAEAATGELLLFTDAD
TRHQPSALARAVEAMRRCGAGMLSLTPAQEMESFWEKLIVPLVYHILFSY
LPISMVSKSSSPAFCYAIGQFILFRREAYEQIGGHRSVCNNIVEDVWLCK
AVKRSGGKVAAFDGTDVVNCRMYRGFGEVWQGFSKNLFAGLGNNIVGLFA
LMAFVALLYLAPYAFLASSALLGDRTVALFWLPLAQVGVALFIRFLIAVK
FRQPPWPAALHLFSQVMLLLIAFNSFRLTVFGKGPEWKGRNYPLSGHGH
>CT1773 conserved hypothetical protein
MAEYDKDKQAKEYYTDIPVNTHGFFLKGAHSLDWGMKNRLSRIFRPDTGK
TVMFAIDHGYFQGPTTGLERPDVNIVPLMKHADAIMLTRGMLRTTVPPSL
TKAVVMRCSGGPSILKELSDEELAVDIEDAIRMNVACITLQVFIGGEYET
RSIHNMTKLVDMGLRYGIPTMAVTAVGKDMVRDAKYFRLATRICAELGAQ
IVKTYYVPEDFETVIASCPVPIVMAGGKKIPELDALTMSYNAIQEGAAGV
DMGRNIFQSDNPEEMMLAVNKIVHEGFTPNEAYEYFNTLVAAK
>CT0179 conserved hypothetical protein
MDFECQFVCELKELAPVPALLIRTQTTMSELGSLFEAGYHDILQLLAGQG
KSPSGPPFARYFGMSAGTFEVEFGFPVEGGVEGSGRVVTGLTPSGKAASS
LYIGPYGEIEAVYDALMKWVDDNGFDLSGEAYEIYLDNPAETAPDQLRTR
VSLMLHES
>CT1583 TonB family protein
MSSKLQSVHHEAARKAQRWTFREQFLDTDRLRQINYGNLALRREAHLFLT
HGVVVAVLLLSIFWLVSANWNRVMGMFGGGHNQQAAVQCYEVVTNVTQLP
PPPSIAPEPSKVKAAAAPVEAPKVGKIRKVAEAPPDQTFATQKEIKQAIT
QGPASQDGGGSSGSCDTVVEFVNCQNPPTVVSTPRLVYPEMARIAGLEGR
VFVRVLISEEGRPMKAEIVKRIPADQTVFDKEAVRIAMETKYTAGVQNGR
KVRVWMTIPVRFTLHES
>CT1438 hypothetical protein
MKFQALRYGEFFQGKISLYHHKRPLIVRFLECINKGTQEWLVDFYPGNGL
ASGMVLAIDHDNQHVLCEITGKKGRGRYIRVLNPKITIEELWQQSELALA
DRLER
>CT1832 conserved hypothetical protein
MSPISQKGEAVCLSVRVQPRSSKSGVAGMYGEQLKICLKSAPVDNAANKE
CCELLAKALGVPRSSVSVMKGASSRSKVLKVEGVTPAAVREALVGMLGDE
AEAGA
>CT1952 hypothetical protein
MRQIMPLITIKALRNPQGFFFCEQTISKTTYFGVCVNSP
>CT0260 phosphoglucomutase/phosphomannomutase family protein
MSLMISVSGIRGVVGKSLTPENLTAFTMAFATWIRRRKQALNPGSLAKPI
IVIGRDSRPTGAVITGLVSNALSLCGCDVIDVGMTTTPTVEIATAGEGAD
GGLIVTASHNPVEWNALKMLDEKGEFLTASDVGELLEIAEKKAFDFARWD
GIGTVTANGSWDAAHIDKILKLSCLDLDLVKSMNFRVLIDAVEGAGSYIV
PELCRKLGITEIKTLACEGTGIFPRNPEPIEQNLRQTMAILASESCEFGI
IVDPDVDRLALVCEDGTLFGEEYTLIACADFYLKHHKGPVVNNLSSSHAL
FDIARKHEVECFSAKVGEANVIEVMKEKEAVIGGEGNGGIILPELHYGRD
ALAGIALFIQAFAGWKSATGGTLSEFRKTFPDYFMSKKKVELTTLSKESL
DRIFDTVAANHPDAECNRLDGLKLDFENGWVHLRPSNTEPIVRIYTEAST
QAEADALAARFIEEIETAASANPAKN
>CT1076 conserved hypothetical protein
MKPPFNGTVLVAGATGRTGQLVVRRLQAHGIDFRLFVQSGQKAIELLGPE
IVDKLVIGSVLSDQEVEAAVRNIDAVICAIGGNVMNPDAPPPSAIDRDGV
IRLATAAKAAGVETFVLISSLGVTHPEHPLNKYGRVLDMKLAGEDAVRKL
YGEAGFRYTILRPGGLLNGPAFRHELRFDTGDKISGLIDRGDVAEAAVIS
LWHPKAKNKTFELIKAGDEEVTQTSLEGFFEGL
>CT1879 conserved hypothetical protein
MRKTQTYKSLPYNPALRDRAKALRKAGILHEALLWFELKSNKLNGLDFDR
QKIIGNYIVDFYCAERSVVIEVDGSSHDSKQIEDRERDAYLNGLGLTVIR
VLAKDVLRNLEGVVEFLKDHPALTGTPPEEGNKTALTGTPPEEGNKTALT
GTPPEEGNKTALTGTPPEEGNKTALTGTPPEEGNKTTLTEARE
>CT1094 hypothetical protein
MKAPEERLGSFYLGAEYDLQSGLRLEQPVHYDARDLTTHAVCVGMTGSGK
TGLCIDLLEEAALDKVPVILVDPKGDMTNLLLQFPDLLPEDFLPWIDQDE
ARRKGQTPEELAAATASMWKNGLFDWGIAPERIRELKDSAEFTIYTPGSD
AGIPVNILGSLAAPRLDFDTEAEAIRERIAGTVSALLGLVGINADPVKSR
ESILLSGIFEHFWRAGTDLDLATLIGSIQNPPMRQVGVFDVNTFYPEKER
FELAMSFNALLASPSFQSWLSGQALDIASVLYTADARPRVAIFSLAHLSE
NERMFFVTLLLENVLTWTRAQQGTSSLRALLYFDEVFGYLPPVSEPPSKR
PLMTMLKQARAFGVGCVLVTQNPVDLDYKALTNTGTWFIGKLQAERDKAR
VLEGLKSAIRTAGGSDAEFDFDTVISQLGKRVFLMHNVHEDRPVIFQTRW
AMSFLSGPMTRPQIKTLMAGHKNGSTAQPATASPRPASPAIAETPATSSA
AKAMPSGYSTVSVLPRLDPSIKQLFVPAATAPAAALAKRSLGAPVSAELL
YEPALIGCASVDFVDARRGISENRRLVLACTEFDAAGNPDWEKSQRLVAM
QNQLSDRPDSKAAGFATLPETLVNARKLAACSRSLSDFLYRRERLPLSIH
PATDLFKGEGESERDFTIRLQQVAREQRDQEVDALEKKYATQLERIAEKI
KKEERELAQDEAEHQNRKAGELIGLGETMLGFFLGRKSTRGISAAINRRR
MTANAKAEVDESVETIEELKRQQEKIEAKLKRLAAEITARWEHPESALTT
EEIAPRRSDVMIQMVTTGWLPFWQVTLDQASGGGTLYLPAYEAADES
>CT0859 membrane protein, putative
MVFKQIRNWIVPGVLGLLLLMMPDISFADGTTQIAATAWWVWVLVLFVFS
FLLGIVAVLAGVGGGVLFVPIVSGFFPFHIDYVRGAGLLVALAGALSAGP
PLLRKGIADLKLGLPMALVGSISSIAGALMGLAMPAKNVQLLLGIVILGI
TAIMLKAGKSGYPEVKEPDALSKMLGISGCYFEEFGQHEVSWQVHRTLVA
TVLFFIIGFIGGMFGLGAGFANVPVFNLLMGVPLKVAVATSGLVLSINGS
AAAWVYMFNGAVLPIIAVPAVGGMMLGSKIGAKLLPKVNTRTIRMIVITI
LALSGFRSLLKGMGG
>CT1302 hypothetical protein
MIKTIQSRLESLADEPTAGILRRFFKTGPGEYGEGDRFRGIRVPVLRKLC
REFLHAGVEVISELLDSPWHEDRMLALLLLIERYQSSSESGREALYEFYC
TLTGRINNWDLVDLSAPCIVGRHLHTRDRSRLYRFVESSSLWERRIAIVS
TFHFIRNNDFSDTLALAERLLTDPEELLHKATGWMLREVGKRDQPLLEAF
LEHYAIAMPRTMLRYAIERFPEDERKGWLKRR
>CT0073 cytochrome c-555, membrane-bound
MKRFLPLLATGLIVLGGCGLEKPPAKLAEEIDKAEKADQPKTEAAAPAAA
PAPAPAATPTDPALAEGKTIYEGGCNACHDAGMMGAPKPGDKAAWAPRIA
KGEESVIKNTINGLNGMPPKGGNAALTDEQLTNAAKYLISISK
>CT2278 conserved hypothetical protein
MNSLFTIVSGIRFDEPWWLLLLPLTIAGSAVYRLFWRKNRQGVLFPSVSE
LRSSGFAALSLFSKFPEWLHWLVLVLIVLALSGPSAPFPPSSRDTVGIDI
MIALDVSDSMNTPDFGGKSRFAGARTAAMRFIDNRPADRIGLVVFSGGSF
TRCPLTLDHEVLGRLAETVAPGFFDEPGTAIGTAILTATNRLKASSSKEK
ALVLITDGENNAGEVTPETAARLAANYGIRIYTVFAGKEARAFENTSNTA
LNRKGRSELETVARISGGRMFSAGDVFGLMKSFRDIDRLEKTRLKGRMPS
RTMALYPWLLLSAVCLLLAEQALSATRFIRIP
>CT0014 hydrolase, haloacid dehalogenase-like family
MMIQNDTKPKAFIFDMDGVLVDNMRMHAQSWVDLFADYGLSGLDPERYLV
ETAGMKGLDVLRYFLDPSISPEKADRLTELKDILYRVMNRNAIVAMPGLE
TFLDRAANAGIRLGIGTGAGPKNIDYVLGLTGLTSRFEVVVGAHMVRHGK
PHPETFLQVAERLGADPASCIVFEDALPGAEAAAAAGMSCVAVTTTNRPE
AFAAFDNVITTIDHFDGLMPEALLELNRAVKTMS
>CT1364 peptide ABC transporter, permease protein, putative
MVAQRANKSSIPALYRIVPGLGLLSAGKRSSGLFYLLATLAFFALFAARL
DLVIISLRSLAAGVTLLVLSPTNFSQIFTIETLEFWIAALYLIIVPFLLL
RFSVRSYRKVLKEKDQSQSGEASLWQLGMISFKRNGISVAASMVIFVLYS
VAFLAPLLSPFSPYDQQDFLVTAYQPPLTRLDALVLRQHQTMVMPLRPGS
DMASKAANTLILDSQKLRSRNEEHNLKFVNSYRIESNELLYRQGMREKSL
PIDQLVNVSGSSQKPVYAVTKTFLLGTDQYGRDIFSRVIYGSRISLSIGF
LVVLISVSLGTVIGISSGYFGGWVDNTLMRIVDVLIAFPALFLILIIIAT
FGNSIYLIVITLSFTGWMGVARIVRGQVLSLKEQEFILAAKSLGLSNLRI
IFRHLLPNTLTPVIVAATLRIGSIILTEAGLSFLGLGVQPPTPSWGNIIN
EGRDSLLNHWWISTFPGVAILTTVVCFNLIGDGIRDALDPRMRG
>CT0196 glycosyl transferase
MGQFLPHWNQYGIQADTLCVSGKEASRNIARALAVCGNYDYVWIQRKVFP
PPLVWLFSRKANVIFDFDDAIHVKQVMLTGKQEPESWLKRQWIARTLQRS
AMVLAGSDALKDYAEQYNGNVHLVPTPFETPPQKLNVHQKNKTVTIGWIG
INANLYFLRQIDETLSLLAEKYPWVQFSLMSGKMASGMKTPWKLTPWSSE
SEKSWLSEIDIGIMPLTDDEWCRGKCAFKLIQYMAYGKPVVASNVGANRS
TIEQGVNGFLADSAKEWLDALEQLVLNEGLRRRMGDESRRIHLERFERAE
VQRTIANLIAEHHRSATAG
>CT1490 hypothetical protein
MASPRTNAICKSNEMKADNIVQPEPFTLRIFSSSP
>CT1885 conserved hypothetical protein
MSTTENVSERTGLAFGGGVVLGAAHIGVLKAMEETGFRAECVSGTSIGSF
IAAMYAFGKSWREIEAVALELDWSDLSGLTLSGYGLLSIRKFGKIVRAQL
GSRRIEDAPLPLAIVATDICTGNEVVLREGDVATAVMASSSIPGIFKPVE
QGEMLLVDGVLTENVPVSPLKEMGASRIACIDLFGRHSFRRPEHLSDLLL
NAFYSAMRAISQIQISKADLVIAPDLSRFSLVDMSAVPEILDTGYREALP
LLESWRDAHR
>CT2218 hypothetical protein
MQGHAISIQPRCVITPMLYAFEKLATGQCPAYSAFDSIYDTYFPWRY
>CT0993 hypothetical protein
MCKMITRLTLRNFKNVQEQTYEFTEFDLLVGRNNSGKSTVLQAIAIWQYC
IDEFHRSNRSGTKGIQIVLPNFTALPVPEFNLLWRNKTDREYPFEEGKKK
QKFILIQIIVEWKVSANKREEFGIDLRYQSPQTIYAIPEGGWGRFRELED
ILPRIAYVPPFSGLDPMEKWLDIAPIRQQVGKGQPGSVLRNLLYKVKRDE
ERGDDWGELASIVKKWFSVEICEPVYDKERDVHIKVEYRQNGKEFDIISG
GSGFHQTLTLLAFLYGYNPTVILLDEPDAHLHVNLQREILDYFKRKSQEK
SIQFLIATHAEEFAKGVDASQIISLMGQEPERIESVPKVIRAMAEVSNED
IVRTLAYPYIVYVEGESDERIIRAWSSACGADEVIDKVCFKAMSGGDKQK
MKKFADKHFEALKQIVPKLQRIILLDYDESEDYHPQKDNEVLYEWQRKNI
ENYLFVPDVWKRVASEKIGLPEGDLFLEPVLRLIDDYFESQNLVLPKKQS
WKNVSAEVFKIVDGKKLLFERDDSLFHQLSKLFHQLSKNNPSIQVLREEI
ALNMKREEIALNMKEEEIHEDVHEFMNKLKSLFNP
>CT0258 conserved hypothetical protein
MFRTLFDIAIRHILGRKRQTLTTMLAVSVSTMVLITTISLTRGLLDSFTE
TIIDAAPHIRIKGEKIDPMPTNLFDSLAVSRKAFVTDNIGRDEPEEVRNY
GRILDIVSSQAFSGKVVAASPFVESQIIAVKGNRTQPVVLKGVDIDREDR
ISHIGRSLTSGDLVLFRKTPDALLVGSSVATDLGVELNDQVTIITPDGRS
RQGKVTGIFFTGINAADNTILSSLKLGQIVEGMPPNKVTGIALKVVDPLN
DAPLARDLERMTGYRCLTWQEENASVLVLFKRIGSIVLSLVGFVGVVSGF
GVANILVTTVFEKSRDIAVMKSFGFSSAQMVGLFVFEGFLVGLGGALTGG
ILATGSIGFLASLHIESSQGPLTKSGFSMSWNPWYFFFVIVVTVIISTIA
AAIPSLRAARLEPVTVLRESNL
>CT0671 hypothetical protein
MLNYFLLVCLLIARFFPLSRQDSGKDVFRVEFSLCRKEYIDNHVNCQKEN
SRLNPAR
>CT0695 nucleotidyltransferase family protein
MEIPLDMLRTICKECSVRKLSIVGSIARGDEGPESDVDLLVEFKRQGSPL
RQYMETKKRFEKLFHRKVDLIERSAMRNKRFEASVLQDEKVIYEA
>CT0441 hypothetical protein
MALPSVVLPRCTAAGVTAVSGGILDLHVVNSGGYLLVLRRLVRGKTAMWC
LAGFSGLWGNLYCGQNGFLTAAIAGIALLSLERRPVLAGVFIGLLAIKPH
LAVLFPVALLAIGAWRTLITAAVTAITFMAIGTLTLGTAVLKAFFASLGD
ARHLCLENGSLLWKKMPSVFAFLRLLGTPVTWAYVVHCIVAAVAVIAVWQ
VWRHCQNWNLRGAALMTATFLVSPYAYDYDLAWLAFPIAWLAVDGLRNGW
LRGEREALVAAWLMPLLMSPIAGALKFQIGPLVLCSLLWITVRRANAASM
MGGMATDAYADQFEPLP
>CT0667 hypothetical protein
MLLWEKDSRNSGTRMGFFMVHLLSGRTADRAARGETGFSSKEEG
>CT1216 hypothetical protein
MDQPFFLESTELFCRVFRYEFHQNVFLNLKSVLLERSWFF
>CT1874 6-phosphogluconate dehydrogenase, decarboxylating, putative
MGSNMVEHLLELGHEVVAFDLSAEAVKAIAAKGATAASSLQHLVGELAAP
RVVWMMVPAGRPVDAVIDGLTPFLRAGDIVIDGGNSRYTDSVARAEKLRK
QGIRMLDIGTSGGLDGARHGACMMAGGDREAYEHVEPVLRDLCVENGYGY
MGASGSGHFVKMVHNGIEYGMMQAIGEGFELLRASGYDLDNQNVARVWSN
GSVIRGWLMDLAGKAFAQGNDLGWLGGKVADSGEGRWTVEAAIELGVAVP
IISGSLFRRFQSQNEEHFSDKVVAALRHEFGGHAYEKPGEGEGKA
>CT0481 membrane protein, putative
MANLWIHALTVFMGFFAIMNPIANVPIFLSLTEGDDKKTTAMVASRALLL
AFLIVTIFSVAGKLIFDLFGITLPAFQITGGLLVFLIGFHMLQGDQSSVQ
HPSETGKKKSPEAALSVTVSPLAMPILAGPGTIATAMNFSTGENFMEMAV
TIVVFGVLCLITYVLFVSGEKFVTYIGASALGVITRMMGLILAVIGAQMV
IAGIHGAFGLGAG
>CT0115 hypothetical protein
MGFAKSTGDDVNNASLRLKRLRKARPPHYLSDHNGYSQLFCKRRAVLSFF
MIPLESKFF
>CT1363 hypothetical protein
MNQSQNVLKLPKKRHLLHDLQTAIRKKSRLF
>CT1973 CRISPR-associated protein, CT1973 family
MDNEKEKKTGRQKQFVEFVIGLCQRDKGAAAALRRADNPATEYQSWEYLA
GFNIDLEKPFERIPYAAIAAAIARAKAERNGSAGIGKAIAFCYEDRSKSD
QAKARLRRLLACNSVEEACRILRPLFSLIDSKAAVTLDYAELLSQLLWFN
DDSNRIKTDWATDFYRHAAKTENEEVKA
>CT0642 hypothetical protein
MHRKERALELFSNRCNCSQAVFAAFRQTKVLDEASALRLATMFGGGVAGS
GGGMCGAVTGALMVLSMRYGMGGVEELVNRKKTYELGRQFIEEFEKRMGS
ARCESILGLCIGEPENLQKARELKLFETVCVSAVATASDILEEMLCAEG
>CT0975 sodium:solute symporter family protein
MAQLTPLDISFIAGYLLLTLLVGLFFSRRASENVGEFFLSGRKLPWWIAG
TGMVATTFAADTPLAVAGFVAKNGIAGNWVWWTFVSGGMLTVFFFARLWR
RANILTDLEFIELRYSGKAASFLRGFKAVYFGLFINAVIIGWVNLAMFKI
IKIMIPGLDPQLTIVGLVIFTAIYSGMSGLWGVSITDAVQFVIAMAGCII
LAVLAVNSPQVTAAGGLKQALPDWMFDFFPSFSSTVNTSATGAYALPFTA
FAAMAFVQWWASWYPGAEPGGGGYIAQRMMSAKDEKNSLLATLWFTIAHY
CVRPWPWIVVGLASLVMFPNLPAGQKEDGFVHVMNAVLPAGLKGLLIAAF
MAAYMSTLSTHLNWGTSYLINDLYKRFIKREAEQNHYVLMSRIVTAVTAI
FALYITFYVLETITGAWEFIIQCGAGTGFVLIMRWFWWRLNAWSEITSMV
APIVAYSYINQFTTIVFPESIYIIVLFTITCTLAVTWLTKPTDREKLHAF
YRTTRVGGALWKPVADEMPDVKGDTGFPALFADWFFGIVLVYATLFGTGK
LIFGEPVAAALYYAAGALAGVMIYRDLSKRNWKTFD
>CT0765 hypothetical protein
MKLFPDEDKKKNFMKRGLPVVLAVVWTPIIWMVLAAFLGPAMERVIGVWQ
VTVAILAVATLLAMVALIRLFKTLGLKIFDNIG
>CT0494 polysulfide reductase, subunit C, putative
MTFVHQEVWHWQIATYLFLGGLGGATFAISAVLHLFEGCDRKMLSVAVMS
SIAFLVIGTVFLLADMLQPLKAIYALTNPRSWIFWGVVFINAYFVAAIAY
VIPLLEEWPMLQPIIQKIPAPILGLLERFNKLVALGGSAAGFLVAIYTGL
LISAAPAINFWNTPALPLLFVISGFSTGAAWLLLLSMLSSNPGAQAISAK
LEQLDAILIVTELIILGAYFNFAMFLPTSARASAEFLFHSPVFIVGFFVA
GLLVPLAIESWGIFFGGHSEKDKPKLTMLLASALVLVGGYLLRIYVLKAG
MFQYPW
>CT1769 hypothetical protein
MTKANDPIKNIQSRIDELEQTINERGEQIRKRTRQLKEDLQAELSPMEML
KRHPVEAAGASFVTGIVAGRVIRSLFGRKPRAAAIQPSAESAPQAAHAPQ
QKQPSQIGVAVGAIGVELLHAAKDLAVTWLKSQVEAKKK
>CT1372 hypothetical protein
MKRVSIIAFPELETKKRQCLHNRVACCIFVAI
>CT1712 para-aminobenzoate synthetase, putative
MARRGARETHSDRSPRPVMALYEKLASPGSLWFESTLPGALYGDSLFFSD
PLETLTLHAGDSVAPWFATLESRLDAGLCLAGWLSYEAGYLLDPALAALA
SAGADRELLGWFGVYGRPERVRRETVEAEDAAAAARSCAVSGFGFEFSEA
EYCERIDRLRTEIAAGNVYQANFTGRCRFSFDGAVEALYVKMKRRQPSPW
SAFLNTGDRQILSFSPELFFASDGRLIETMPMKGTAPRRERPEEDLAEKA
GLAKCEKNRAENLMIVDLLRNDLGRICATGSVQASGLFETQTYPTLHQMV
STVRGELRPATRLHDLFRALFPSGSVTGAPKVRAMQLICELEKSLRGVYT
GAVGFMLPEGRMAFNVAIRTIELRGQSGVYGTGSGIVWDSDPHAEFRECM
LKTRILADLVPPSDPSVPGIFETMQWNGGEFLLAGDHLDRLVSSAMALGF
TFDRAAIAEALSAKERELRKNGGRHRVRLTLSHDGGILITSEPFDFDASG
KSVRVCIAAERVDSRDPLLRHKSVARERYDRALREARERGFGEVLFLNER
NEVTEGAISNVLARIDGRWLTPPESCGLLNGVFRRYLLRSRPWIVEKAFT
LDDLHRADMVFVCNSLRGVRPVAIVFPE
>CT1522 hypothetical protein
MKTIMEQALLDQALAMSPNERVEFAQLILASIEHEDEKIRQKWITEVKDR
MAALKSGKAKLIDFDSLYHED
>CT2027 hypothetical protein
MLRTSWIKKDMKKAEGLFSEVCFLKNKFVDFRPEQ
>CT1493 xanthine/uracil permease family protein
MRNFFEFDRHGTSYQQEVLAGLTTFFTLSYIIVVNPAILEATGMPRGASM
TATILTAVFGTVLMGLYAKRPFAVAPYMGENAFIAYTVVKTLGYSWQTAL
GAILVSGVLFTLITLTGARKWLAEAIPMTLKHSFTVGIGLFLAFLGLSNM
GVVALGVPGAPVKLGDLTTIPALCGLGGLALTGALLVRKVTGALVIGMAA
TTMLMLSFGLLQLPQTLFSLPPSIAPLWLQVDITGALTWGFVGVILSVLV
MDFVDTMGTLIGLSARARLLDENGNLPEIEKPMLVDALSTTAAALFGTTT
AGVFIESASGIEQGGKTGFTALVVAGLFLLALFFAPILTIVPPQAYGPVL
VLVGMFMIESAGYFDFNDYTELLPAFLTIVMMLFTFNIGVGITMGFISWV
VIKALAGRFREINAGMTALAILSATFYIFYPYH
>CT1657 conserved hypothetical protein
MAGNPISDEPNRLTAEGNGYGDPQAQLIDDRKFQRLMKAYETTVETRKLE
IELFWSRSLFFWGFIASAFVASATLRRYSSDISVVVACFGFVCSVAWSLG
NRAGKFWQESWEMKVERIEPSVTRAMFAQPEAVQTNKNFWLRGRRFSVSK
LAIALSDYTIILWVAVVV
>CT0540 S-adenosylmethionine synthase, archaeal-type
MSVPRNITIERSEAVPMDQQPFELVERKGIGHPDTICDSIMEAVCIDLCR
EYNSRFGHICHHNIDKGLLVAGRSLPKTGGGMILEPMKLIFGDRATYTCN
GQLVPVGEIAEAAAKRWIRENLRFVDPDQHLLFQNEIKPGSPELTDAFAR
KVIGANDTSVGVGYAPFSETERIVLATEQFLNSPALKERFPEAGEDVKIM
GCRNGRKLQLTVAVAFVDRFIPNANHYFERKSALRHELLSFIESQSCNID
SVAIDINSLDDPSRGEEGVYLTVLGTSAEGGDGGQVGRGNRVNGLIAFNR
NQTMEAAAGKNPVNHVGKIYNVLSHELARRIHREIEGVVSVTVFLCSQIG
KPVDRPLMASARITPEPGANMAELQSRATSIIDRELDGIDAFCQKLADGE
FRVC
>CT1574 conserved hypothetical protein
MKETLSGVVTRVTGASYIVETGDGLKVRCRTVPGTVSENEGSNLVAVGDR
VEFRPKASETDMAEGVITRVEERRTALVRRREVRRNRSKEKEQVIVANID
QLVLITSFDDPPFNSRLVDRYLVFAESEKLPLLIVVNKIDLDEEGMVEED
LEVYRQLDCNICLVSAEDGRGIEELRELLRDRVSAFSGHSGVGKSTLINL
LVGCEELRTAETSGKTGKGVHTTTSSAMFQLPGGGYVIDTPGIREFNLAG
ITRENLRFYYTEFLRYMPECTFSSCSHTVEPGCAVIAAVESGSIARERYE
SYLALLDSLAE
>CT1872 hypothetical protein
MPLTELLNLFIRDGHVSILNATHEILRRTIFLIKMRFRIVQSAERSFFPV
FPLDRYFAYPMEWLS
>CT0827 hypothetical protein
MGELIMKKTIITSLVAIAAFGFAGTAHADSFATYSSLNTLSAGTDDPNGV
SYGYSYDWGGVSSSCCNGSSAVGSYTEIDAFGVGDGSLSAVSGFQGKAEQ
GANSSFATAAGIAASSNSGLTNYGYPVVDVSVDGGAYAQSSSYASYSWMT
VWGGSTSW
>CT2030 aspartokinase/homoserine dehydrogenase
MRVFKFGGSSIASAAKISNVAGIIRRELKSTPLVVVVSAIGKVTDMLAET
AALAGNGDAAYRDKLEGIASLHGGIIRELFGTEASAEETWLGEMMAELND
VLHGVFLLRELSDKSLALVLSYGERLSCRIVSRYMHVSGTPAECVDARSV
IVTDDNHCFAKVDRLATGKLIHERFRSFDVLPVVTGFIASAPDGSVTNLG
RGGSDFTATILGAALHAEEVWIWTDVDGFYSADPKRVPDARVIPEISYAE
AMELSHAGAKVLHPLAVQPVMKAGIPLRIKNSFNPEKPGTRIGIEAAGAE
ALPGTVTGLTSINHVVLLSLSGSGMAGVPGTASRLFTCLARHSINIIFIS
QASSEQSISLAIAPGQASMAKKVLEEEFAREIEERRIDPVSVRRNLAMVA
VVGNKMLGHPGVSAQLFETLGKNGVNVIAVAQGANEMNISLVIDSADENK
ALNCVHESFFLSMRKVHVFIVGTGTIAKSLISQIRRHRATLQKELGLDVV
VAGLANTRSMCIEPAGIDLEHWHDSLKPRESHEGIGQYIRLIQERNLHNT
IVVDCTASRQVAECYPALLRANISVATANKLGMAGPWDLYRKIMDALRSS
NAKFLYETNVGAGLPVINTLNDLKNSGDKIVCIEGVLSGTLSFIFNELRK
GGRFSEIVRKAKESGYTEPDPRDDLSGADFARKLLILGRELGYQLEYADV
ECQSLVPEPLRGEMSPAEFLDQLSSIDDWYVDEMASAASEGMTIAYTGEL
RDGKAKVGLKRVPLESPVAGLNGSENLVVFTTERYLKTPLVVKGPGAGGE
VTAGGVFADILRIASYLV
>CT0492 hypothetical protein
MITQILNNVNDYMLFSAFLRTKSRRDGNWVG
>CT1310 hypothetical protein
MVSPFLRREKNFLMNAGIGGVASSYGVKKDPY
>CT0415 carbon-nitrogen hydrolase family protein
MERLNIALVHLAVRHGEPEHNRRELIRLNRQAAEAGARIIVNTELAVSGY
SFRSPKEVAAVAEPVDGPSVQAMAEIAEAAGCYIVLGYPEIDPCTGICYN
SAAVLGQDGKLVLNYRKVTAEARWACPGSHMQESLFETPWGRAAVLICSD
SYYGLIPRAAALRGADLLLVPANWPGGSLDPRELWRARACENGCALVACN
RTGKDRTMECYDAVSCAYGADGSVIAERSSPDSEVFHVELPLSRGKLRSS
SRERFASRTPERYRSIYLDMRYANDMTSWHSLPEPAPVQVHCLHELYFEP
GDVSVLDSLLHDREKAAGLVVVLPMLRVANREEAVGFLLDVARAHGAAFC
AGLVDTDGVAELICCRPDGSVHRRCPERDDFVLIDLDHLRLTLLSKEECR
HPEAVTALAKEGCDLAVVPETGFDEGDRAVLGSRSIEQLAIAACGRDVSF
ICLPPVDHYRWEEAVGEGVDGASMLIEVEKLRRKRFYDRIDVELLLARNG
RSPDACRADETGKREEETP
>CT1367 ComEC/Rec2 family protein
MRYFLSAYPSVRLLACAVAGIMAAIFLPVSPVAWFGVAIASCIVTALLLF
VSRKRHPAGTVSLFSAACYLTAVFSAFAFHASASYRLAPSPSLLSWVGRD
VILSGVVDGRPVAWNGTARLRLRVNEVFEDGQTTRVSDRVKVVVRMPGDE
DPQFQEGDFVRVKGRPALIPVASNRDEYDARFRERLKGTHVQIFCAGPWQ
LLREPPKPGFSVVPSIVNPVRNYLNSAIDRNFPEGGARHFIKGMVLGERD
LMPEELYEAFRRTGTAHVLAVSGLHVALLAYAVNLCLQRLKVTQAGRWLS
LFIIVAVIGLYSFVTGNAPSIKRASVMTAVIIAGGTIGRKSYPVNSLAAA
DLLILLFDPFDLLTPGFLMTNGAVLGILTIYPHLSGVVAHGKGVLRPLAH
FFWSAFSVGFSAMIGVSPVIAWYFGTFSPSGIAANLPVVLFSTLAMYASF
PMFLFHGFASGIASLFGAASWFFAGLALSCAELFSRLPLASVEVRPGLFD
VAVFYLTFAFAFYAIVNRAWGRLALFVLVGMNMLLWHQVLRPVQKPPQVV
TINMGRDVAVLFSSSGETCLVDAGRRDGSWERIRQQASVWNLATPVSVAS
LFSPASVIRSLPVSPTASGAAPHRSFVIRMLDDKVLRIDSKSHSLLLVSG
LKRLEAQRADGADVVFWIYRFTGKEWHRLDAWIAETRPRRMLLVPGPFMT
AEQRELLNRYAASRPQVEVRSRSRQTVWY
>CT0146 hypothetical protein
MTKYLTELWDLIRLNPKKFVIRALLVLAAIGFIFGDFGLVTRISMELENR
KLEKLLAEEQEKIVELRSTIKNAYQPDSVEKVARERFNFHKKGETVFIIR
EK
>CT0691 hypothetical protein
MRHISSTQNKSHNKALHRAAIPLRSIAAGELGR
>CT1305 hypothetical protein
MLKDLPVDNVSIIEQDGFRHPLLFYQGIAIPASEVYHTSQRNRSMKKSLF
LATAGFAGLLLGTPQNNAQAEVNLNINIGGPRYVGNYGPDFIYLDDYGFA
VSWGWDYDVIRLGNFYFIYRDGYWFRSPSYRGPWARVRYWELPYQLRRYS
WNDIRRRRDREYRRYDRAYWDRYFRDQRARYGDGPLYAGDYGPDFIYLDD
YGFAVSWGWDYDVIRLDNFYFIYRDGDWFRSSNYRGPWARVRYWDLPYQI
RRHDWNDIRRRRDTEYRRHDKTFWDRHFREQRMQIQRDDRDRRPDFRPDG
RPDFRNDDGKSGGGNIQRLEGRPDLKPLPLPQAPPQPQGVPDGKPDGKWF
HDRGSDRKEFERKDSELKSFEGKGSDGRGFEEKGSDGRRFDRSSFERKDS
DGKSFDRKGFDGKSFEGKGSDRKNSDGKGSDNKDSEDNSNGKSDKSGRFH
GNPESGWNQVR
>CT2000 CBS domain protein
MSPQSAEAIIIILLILFEGVLSLAEFAIISSSPARLRELREAGYPSASVA
LKLQDNAARFLSSIKVTAILITTLTSVLGGLFLAEPLAALFSHLAILEPY
SHPLALTVVIASLAYLTHVIGGLLPKKFALRHPEAIAVRIAGFMNKLCTI
SSPAVLLADASAALLLRAFGIEANEKPQVSDEDVMLMIRQGAKKGVFESV
EYEMISRIFRMSDKRASALMTPRNEIEWLDLERPDEELVARIKASGRSRF
PVAKGSLDELQGVVRSLDLVNFSLSSKGSIREAIRASMKPPLFVPESVPA
FHVLELFKKNRAHMALVIDEHGSVQGAITLTDVLESIVGDVPADDMEGDQ
KTIVRRSERTWLVDGMVPVDEFLTAFNLDAEKFFEENEPRYDTMGGFMMT
RLGEVPSVSDTVKWGGLTFKVIKMNGKRVGRILVEQEAKNAEKKITKL
>CT1600 hypothetical protein
MQFKTRHPKHSMHLDTFPGLFKLERKVLYLPVLFYLSAHIAA
>CT0875 hypothetical protein
MHLVVRYSVDQGRKHLFDLRRSVDHIGQCFVIDPVLRDFRSYRNNSEGRF
LITLYVLENRCETARRVTALRRGTNEVLPKKTSSKISIFFPENDQIIAGF
TLKSLFNGSF
>CT0072 BchE/P-methylase family protein
MAWNKQVALVFLPSDSGVDGARSLYGDTEPPGLRRWLDSGVRNLIKRTQF
AIPPLALMILASIEVPGVEQTLCDLRFEEFPFDRQWDLVGISVQTGMASK
AFELADRLRSRGVKVVLGGPHVTMFPDSCRAHADCLVHGEADDLWRDVLA
DLVTDSLKAEYRPDAPPDLQLHRPVSRASLKKNRYFTTNLVQTGRGCPYQ
CDFCNVHVLNGRVLRQRKIDHIVDEVRRFSEEDKRIFFFVDDSINGDPLF
ARELFKHLIPLNITWFGQATTALGQQPELLDTFARSGCRALLVGIESIEP
SSRIAHKKTQNRTDELAGNIRAIRKSGISLYGSFIYGLDGDTLQTPQAIL
DFIADTRLDVPGINTLRPIPGTKVFDRLREEGRLLFDPEDITSFRYTFGQ
EMLYRPKNIPLPEFIESYSELTRKVFNIGNAIRRGLEAPGINAAVLFFNL
FYTHLYRLSRNDLRKQLETLGSA
>CT1104 hypothetical protein
MRSKDWERTNGWSGFVKVSKQKVIQGMSEIRIVRAVTRKPSRVIPHYRSA
TFSFKPPS
>CT0644 heat shock protein, HSP20 family
MLMKIAKDPMRLFDDIWSGSQMAVAPSFKVDISEDENAYHLDAELPGIAK
EQIALNIEDDVLTIKAERTHKEEEKKKNYHRVERTYGSFSRSFNIGEIID
QEHIGATYDNGVLHVTLPKTQPAKKTKEIPIN
>CT0223 hypothetical protein
MFALLRWLFNGNQATKDMAINQRKRGGLPLRFAAFHELTEEEDGHKTRSY
NRE
>CT0999 conserved hypothetical protein
MKRGSVVTIALQGNDGKLRPAVVVLSDYFPEHPSVTVLPIISDLRSTPFF
RIDVEPEAQNGLLKPSRIMIDKAQAVPSEKIGKVIGLLDDTKMMAVNRAL
ALWFGFA
>CT2225 hypothetical protein
MIMSSTYRETVVDGYNLIHKLGKAGPGASMADLRERLEAMLARYRQKARR
HVILVYDGGSGPKPLTLTGAVDVTFSGTVKSADRWIIDHVRSLGVRATMT
FVVSSDREIQRYSKAYGAKCVDSETFIDELAAMGIAIDKDGRRKGQQSGV
KMNKNASGLLSDKEVDYWLGLFARKR
>CT1073 hypothetical protein
MRMQLKNRSKMATALLASATMLLPSAKNALADAAPEEGIFSLKYLNYHDT
QTGDTNLTAGMSMDRMTVNALSFYGMVPIAGKWSIAGTFIEDSVTGASPA
YHGWGFPSESKNDSTSGASGELRHAGDISVTRYFSRGTLSLGTSYSQESD
YISRGLSLNGTLSTENKNTTFSLGAAYSSDTVYLDKPAVIESKQSDTPGR
KRIVSVLLGVTQVMSQNDVMQVTATYTHGDGYYSDPYKDPDLRPGKRRMF
TLMTRWNHHFDGPDGTARLSYRYYTDTFGIEAHTFTAEYIQPLPHGWEIT
PTVRYYSQSSARFYVPTEDDPRAKTPTDGMEYYSEDQRLSAFGAFSYGVK
VLKELGWSWSADVKYEHCEQRYDWGINGHGDPGIPAFSFRSLQVGLSRKF
>CT2238 siroheme synthetase, putative
MTGSIHTEPQAAAKRGYVYIAGAGPGDPELLTLKADRVLRGADVILFDDL
VLPQMLEPYKAEKIYTGKRKDAHHFAQDEINQEIVRHALMGKTVVRLKGG
DPFIFGRGGEEIETLRQHGIGYEIIPGITAAHGASAYSEIPLTMRKVSSS
VAFCTGHPVNSIQVPDTDTIVYYMVASNVHDVLDKVAASGRSGETMVAVV
QNATRYNQRVITSTLDEFRKREKAVYSPALLIIGQNISQYIEENWFSRKK
KVLMTGEAPKKYPPADYITVPFPCQQVAGADLGAVKACIEGIDRFSMLFF
QNRFAVRYFFKYLFEHGRDVRHLAHLVICTANRSVASALQEYGIIPDCCL
DREGVDAIAAMLRKEELTGQRILLSGAEHVDELVAGQLREGGNEVTPLVV
YVHGAQDQVEKIDLDFIDEIYFASADCVKKFRGMYDAIPARIAVTPADER
TAEEMRRQFGG
>CT0614 3-isopropylmalate dehydratase, large subunit, putative
MAQTITQKIFARAANRKFVDPGQSVWLNVDVLLTHDVCGPPTFDIFKQEF
GPNAKVWDPSKVVVLPDHYIFTANEHAHRNIDLLRQFAAEQGLPNYYDVG
TDRYRGVCHVALAEEGFNLPGTVLFGTDSHTCTSGAFGMFGSGIGNTDAA
FILGTGKLWEKVPDSMKFTFEGQMPEYLTAKDLILQILGDITTDGATYRA
MEFDGEAVYSLPIDERMTLCNMAIEAGGMNGIIAADAVTEAFVKARTSKP
YEIFTSDPDAQYHSMYRYNVEKMEPIVAKPHSPDNRATVHSVAGTPITKS
YIGSCTGGKLTDFKLAAKILKGKKVAVTTNIVPATVLVASQLETEMYDGQ
TLRHIFEEAGCNIALPSCAACLGGPSDTVGRSVDNDVVVSTTNRNFPGRM
GSKFASVYLASPLTAAASAITGKLTDPRDFL
>CT1982 peptidase, M23/M37 family
MSNERFQHFLLRLKRSELFQKLLHFRASSEASLFIALFSVTFFSVSLFLF
ATSRQASSEHPSDSLVQTMLKSIGFASDAEPGTNNEDQNDPSQMTGQNAL
GKEESGYTILTAPGLTPAEVQQMTEQLEKLTPRQKTIVNDTRIVSIKGTL
HSSMANELHKRHLSWLIPKLNKILDAKFDFATDVKAGATYRILFQEQRNG
SQFIGTGDILAVEISSKGRNFNAYLFTNDTGETAYYDENGWAMLQVRTMY
IQPCRYSRISSGFGYRIHPITGRSEFHAGIDLVAPMGTPVFAVADGRIVF
SGWYGYSGNMIAIAHDAGRIQTMYLHLSGFSPAVHYGNTVKQGEIIGYVG
STGRSTGPHLDFRIVKNGQWQNPLLALQQPMLWRSLSSTEFQHFMAKVQT
YHEQLGRQSPDMVDQYRQTPRNVALK
>CT0008 ROK family protein
MNRWGIDLGGTKIEGVILDSELRPLIRHRIPTGQEQGYGHILMQIKSLVG
TMAEKSGLGLPEKIGIGTPGRADGSDGVISNSNTICLNGMPLLRDLQEAL
RLEVVIDNDANCFALAESMLGAGRDEMARPGATAFGIILGTGVGGGIVRD
GRIIRGAHGIAGEWGHNPLPGEHAACYCGRRGCVETVVSGPALERHYAAL
SGRKASLQEIAASTGRDRFARQTIERLVSKFGVALATVINILDPDLVIIG
GGVGNIRQLYSPEARQAIAANVFNRSFDIPLLPPMLGDSAGVFGAALLSG
PPLIAQY
>CT0727 glucose-1-phosphate thymidylyltransferase, putative
MKAIIPVAGVGSRLRPHTFSQPKVLLNVAGKPIIGHIMDKLIESGIDEAV
IIVGYLGGKIEEYLTSHYAIKLTFVTQADQLGLAHAVHMCRPHVIDEEPL
FIILGDTIFDVDLKLVLGSSISTLGVKEVDDPRRFGVVVTEGDRIVRLVE
KPEQPVSNLAIVGLYFLHRAGTLFNSIDYIITNDIRTKGEFQLTDALQHM
IDLGEPFSTFPVQGWYDCGKPETLLSTNEVLLQRDTRQKSLPGCIINPPV
FIADSATVTNSIIGPNATIAEHAVVRDSIIMNSIIGRKSQVSEIMLDRSI
VGNNAIVSAMGHELNIGDYSEIRMG
>CT2145 peptidase, M24 family protein
MTRIEIGERIRRLQARLAESGMQAALLLMPVDIFYFTGTRQNSALWVPAE
GKPVLLVRKSLVRAKAEGLIDDIRPFPSSKELSALLGREGDKVGMTFDAV
PMAQHAWYSKVLPGRTFADVTMIVRELRSVKSSAEIEMLRHSAEMLVSIF
FEVPTFLKTGMREVDLAAEVEYRLRRIGHEGYVRMRAFNQELFGGMVVSG
GAASYGFFDGAVTGKGLSSASPQGASLDAIRENEPVLVDFAGVFNGYIID
MTRMFVIGRLDPELQRAFDVSLEIQEAVRRAMVPGAIGEEIYKQAAAMAE
AAGLGCNFMGMPGEQSRFVGHGVGLELDELPLLAQGFGMPLQAGQVVAVE
PKFVIPGKGAIGIENTFVVTEHGGERLTGLSDEVVEL
>CT0445 hypothetical protein
MSILYLAGHLFLHFQGKVMTFHHGIGVNIACGGTPPGVKVNLGFGSDDGN
VFIFPGMGSDVIGNFKGLWKGIQAVGKAAGSVRAHPKEKCTLLEQGESHD
KSGNEHPDAEPAQVRHAALQDSCQRIHAQTSPY
>CT1694 hypothetical protein
MHSAPDRKTSAIGNHNAMKTIYQQVASIYLNI
>CT1654 transcriptional regulator, ArsR family
MSNADNMARMFKVLSVGSRVRIVELLKERSLCVNALAKALDITPAAVSQH
LRVLRDAEVVLPERRGYSVHYRVDREVLAEWKLVASALLGVTNENE
>CT1080 hypothetical protein
MKKLFAVFLVFVAMALLPVLASAKSASGPLAFGLYRYEQHSKINAETYWK
CDYPVFERSKAGDIINAAILKAVISQAPSPDSKPAAASIEAAASAFIKEC
DEQMKDAQAHSWAWQSETSGEVLLDRPGMVTVSIFTYAFTGGAHGMSVTQ
YLIFDTATGRPLGLNDLFKPGFEAMLDKLIERRFRQMRGLSATDPLNGEK
GGLFENKITHNENFAVTGSGIRFLYNQYEIAPYAAGQITVDLSFDELKGI
LKPLPALKPIKP
>CT1027 hypothetical protein
MMVFGVLHERILGIEQVISIDMTVKFYSQKMKIIVYCEHWAGVQ
>CT0608 hypothetical protein
MNVMVLVAVGKMDKELFSFHNPSVSILRCQLSIINYSSYLPVLEPQPTNS
FIPS
>CT1314 conserved hypothetical protein
MLSVFVVNLRSNSAPFSFSRLLMHTTIVNERSLRTCNFPITLQDIRTLKE
LYRLKAETRDLRKPIVRNIMKQRVVGKGCLESLKNALYSLETIYIDDYTG
QRLLRIDGMKQIEVDLTYEIRELQKDIYYLEYGEDRFIEYLAKFIPGFTD
YVTEGVEMLRGKSFNAFITDRDGTTNNYCGRYRSSIQPIYNSVFLSRFAK
NCCRYPMIVTSAPLKDFGILNVSINPEHIFVYAGSKGREFIDIDGQFHSF
PIEPGKQELIRLLNERMQLLLLDPSFEKFNFIGSALQMKFGQTTIARQDI
TRSVNEAESAAFLEKIKGIVRDIDPEGKNFRIEDTGLDIEIILTIDVDPK
TGMIRDFDKGDGLEFICRKMNIDHTGEPVLVCGDTSSDIPMLKKAMEMYD
DVWAIFVTRDEKLMQRVREICPKSYMVPYPDILLTILGLLSL
>CT0007 PAP2 superfamily protein
MGGLEQADARLFQLLNHSLATPALDDLMPFLTSPKHSVHILVLLALFILV
RKGKDALFVIPLLLFAVGIADYTASGIMKSLFHRVRPCFALEGVRLLVDQ
SHSWSFASSHAANLTAIASLVWLFFWRGETVDKVFTVMVIAYASMVAFSR
IYVGVHYPGDVLGGVVIGLASAAVIYTAFAWIVKNVVHRRVMQREGSE
>CT1133 CRISPR-associated protein, CT1133 family
MSGKRSYQSARLGERGGEKMILQALYDYYQRKAADPESGIAPEGFEWKEI
PFIIVIDREGNFVSLEDTREGDGKKKKAKPYLLPKSVGRTGSNSYKTSFL
LWDHYGYVLGHSRSESDKDQAMAEKQMPSFIEKLRSLPENVKGDDGVLAV
IRFYEKGEYKKVKESDNWGECTKIIGCNMSFRLDGEVDLVPCRDAVKRYI
ETQIGESADDAVGLCLVTGKKAAIARIHSDTPINKDSKKFVSFQKNSGYD
SYGKEQAFNAPISESAVFAYTTALNMLLGKNSKNKVQVGDATTVFWSEKQ
DVFEEDFPAFFGYSKDDPDADVRAVKALYEGIKSGHAQMDSKTRFYVLGL
APNSARISVRFWHTGTIAEFAGNIRQHFDDLEIIRSPKDSGHFSMFWLLS
AMAHEGKVDNVPPNLSGQIFQSVITGGLYPATMLQQAIRRIRATQEVTRI
QASILKACLNRFSRIYNTKAKEITVALDPTNNNPGYRLGRLFAVLEKIQE
EASPGLNATIRDRFYGAASSTPVTVFPQLLKLKNHHLSKLDNAGRRVNFE
RMLAGVFEGIGNEMPSHLSMEDQARFAIGYYHQRQDFFKKKDSENNN
>CT0454 hypothetical protein
MRLYIREYFRTLCDFCYSVKRRFWNFCVVRVIVLFSLFWSIILINLGINA
INPKNICVARRYGFEKLTDMLSRLFSVCFTGNNEFFLTLAKVSRRDGLSG
GMEAPDLTSCSAQLLPGNKRRFFLILQSARGRGV
>CT0635 hypothetical protein
MQRRVELKLEQRRFYVWLGIAAAAHVAVVAAVVVLQLLYVRMHPPMKIVN
VSLVQMPGLPGPAGGPKSPETPPAAIEKQAETPELASAKKVAEPPPQPVK
KIAKPVAAVKKIPEKPPVKAPVKAPEPASKTQSAADERKKIAEPVAAVKK
IPEKPPLKAPVKAPEPAPKTQSAADERKNLQEALERLKSKSASQKAETGK
SAAPSNLSSTLANLQKKVASGGGGPARSGSGSGAGGGRYGTGGGGAFDSY
KARIADIIQNNWSFSSQMVRSTSGMEVYVSLLILPDGSVNEIRYDRKSSS
EYLNNSVKNALAKSMPFPSLPREYGAKGIWVGFVFTPEGVGR
>CT1803 hypothetical protein
MLDKQRMAILNTQGADSQVDRLSRRMIHKQFTLQ
>CT1599 hypothetical protein
MNDGRLQRFFWICAGTPVEIIEKYPTEHAKYFGIGATIFFTALFAALSGG
YALYFVFAGAPFDWFASILFGIF
>CT0607 conserved hypothetical protein
MSLKERIDQELKEAMKSGDKIRLNAIRSIRAALLEKEVSIRVGGKGELNE
EQELEVLMGLAKRRRDAIEQFTAGNRPDLVETEAAELKVLEEYLPQQLSD
EEVEAAIREIIAQTGATSMKEMGKVMGLAMKTLKGKADGGKVQNLVKSLL
SA
>CT2013 hypothetical protein
MKEFPGLKILAALLIPLLFCACAVDRPPTGGPPDRSPLSVTSTLPASASV
NTSPQTIRIAFNHYVGRNDLSKSIFFAPRIDDYEVSIHGKEADIRLYSPL
QQNRTYTLTLRTPLKSLDGNHQLDRSWVLAFSTGPVIDQGTIEGRVWTNR
LAPMQNATVMAYNASRSNAVPERRPDYIAQSGPSGEYRFEYLAPGSYRIV
AITDNNGNLQFDPETEVFAVAATPTVQTGMAGVGLRFAPEDYSARSLQSC
RIINNREIEITFKNAIPARSFELSAIRIENTATGASLPVLGYFSLSRSSE
DTTYRILTAPMEDRAFYRLRFSPGDAESQTSELTFSGNAHTERYPELSVS
IVPANGADNVITETIRPESGSSIELQCNLPVVESSVKPAVTLSLSEKGQQ
IPVPFTISRIDSRTFAIVASQGFQHSKDYLVQVKPGILKGLVGEPSKTAL
VQSRFSTAGPDAYGEISGSGRANAPAVVVEARRTGSEASRRMVAKTDASG
TFRFDFHDLPAGEYTIAGFIPSASGAISPMTRWNSGSVAPFAPSDPFAAL
TITVRGGWTTEDVRLDIPSARRSGPDDAKSPEKP
>CT0398 conserved hypothetical protein
MNVNNFRLKLGGKEYIPIVIGGMGVNISTSELALSAERLGGIGHISDAEV
MFVCDQLFGTSYVADKRQMYASNVNNRDKSSVHFDLEQLAEAQKRFVSHT
ISQKTGNGAIFMNCMEKLTMNNSAATLKVRMEAAMEAGIDGITLAAGLHL
RSLDLISEHERFRDVKLGIVISSVRALSIFLKRAMRLQRPPDYIVVEGPL
AGGHLGFGPDDWQSQSLQSIVAETLAFLKKENLDIPIIPAGGIFTGTDAV
EYLQAGAAAVQVATRFTIARESGLPDEVKQHYINASPEDVEVNLSSPTGY
PMRMLKQSPTLYYSRKPNCEGLGYLLDNNGQCSYIDDYYEALESRNPAEG
RFVVKNHTCLCTGMARYDCWTCGHTVSRLKETVNRLPDGSWQLPSAEDIF
MDYLLSENHSIRKPEIKKQG
>CT1923 comEA-related protein
MNFLNNIAVKLGITRAEITAVTLLTFFLLLGGALKYSGSVQNTDKAIKKA
ETARYSEAEVDSLLSLAMKPGDTVAAETSGVVAENAEQEETAPKSSARRT
SKKQFSGTIVFRTASASQLQMIPGVGPMMAKRLIEFRKQNGGKVEHFNDF
LKVKGIGKKKLELLQKHLTLN
>CT1526 Rab family protein
MSDLDVIRQIEQELGMQLEPVDKLKWYSKGYKLDKDQRVTAIGLYDCGSD
TLDRIIQPLESLKSLSELSLSSNQITDISPLASLNSLSMLWLDRNQITDI
APLASLNSLSMLWLFGNKISDIAPLESLKSLTELQLSSNQITDIAPLASL
KSLTELSLSGNNISDIAPLESLKSLTELSLSSNQITDIAPLASLKSLTEL
SLSSNQISDIAPLESLKSLTELQLSRNQISDIAPLESLKSLTELQLSSNQ
ITDIAPLASLKSLTELQLSRNQISDIAPLESLNSLSKLWLNGNQITDIAP
LASLNSLTELELSSNQITDIAPLASLKSLSTLWLSSNQISDIAPLASLES
LSELSLSSNQISDISPLASLNSLTGFDVRRNPIKRLPETITGFDMEILWN
DFSSSGFITFFDNPLESPPPEIVKQGKEAVRQYFQSIEEARSKGEALVHL
QEIKVHLIGDGMAGKTSLLKQLIGETFDPKESQTHGLNVVTKQAPNIKGL
ENDDELKECLFHFWDFGGQEIMHASHQFFMTRSSVYMLLLDSRTDSNKHY
WLRHIEKYGGKSPVIVVMNKIDENPSYNIEQKKINERFPAIENRFHRISC
KNGDGVESIAKSLKSAVLHPDSIYGTPLAPSWIKVKEKLVEATTAQRYLN
RTEVEKICNDSGITDPGERKTLLGYLNNLGIVLYFEALDLSEIYVLDPHW
VTIGVYRIINSSKTKNGHLNTSALGYILNEEQIRCDEYDPAKNNKFTYTL
LEQRYLLDIMKQFELCYDEGKGLFIIPSNLPTQIDNEPEITEGEPLRFIM
KYDYLPSTIIPRLMIAMQHQILDRMQWRYGMVLKSQDHEGALAKVVAETK
DSTITIAIQGEPRCKREYLSIIWYEIKKINANFTNLDVKEFIPLPGHPDE
LVEYKELLGLEKMGRDEYVSGKLEKVFSVSKMLDSVISKEERNKERLMGD
INIKLENIGNPTIPIHQQVEVNVSQETVQHVENLQGFFENLKADILREAE
LEIDDPKERKRLANELELAENAITKMDAAVKSGKNKLKPDVKDRLGEFID
NLANENSRLRKGIALVMNGAEKVQKLARYYNNVAPFFDLPSVPPVLLGKE
KT
>CT0370 conserved hypothetical protein
MRTAEFWIGRLGLERHPEGGWYRETYRSEGSYGFNGNSPFGSPRSYATSI
FYLLEHGDRSRLHRIRSDEQWYFHAGSPLDVHCFPETGDPSLFTLGDDPN
AGQVLHSWVSAGHWFGASLSEQADAPGTYALVSCVVAPGFDFRDLTFADP
AALTAQFPAHAQIIEKLS
>CT1915 hypothetical protein
MSIEQGLNSRRENDKQVESQQWFQKLIQADQYVGDIYSINYETARVIIHD
FYREKVGGIPSLSFLIATRVDPSKTDIDFKKEDASFVLLRVMDAAALPQD
KEAERIRVETAQRISGETEKHWDDAGSMDLRTKNILGFAGVQCRIIGTFF
LEENGQNGDAPLNLKFGSDISNYYPNRGLKVYKPNGKALEQIVNYADPTS
IQAHTEKYGNTERVKLGFVRYASTNRKYQQVDDVPVYIYPADLLSQKSAL
FGMTRTGKSNTTKIIAKSVFELRKNENPNDRPLRIGQIIFDPNGEYANEN
VQDNNSALKNVWQLLPNGVKANEVITYGITRHPNDLERTLMLLNFFETSN
TQIGKSIIDSILSEDSTNYIKQFCQVSFDEPDPNDRSATTRYNRRLLAYR
SVLARAGFQVPPSLRASTRGLFNQDLITALQTGRNNNPPTPEYVSAAQVF
SNPNPAWGQLANAFEALDKFIRDSSSNYTAFENAYVSRPNGSGDRWADED
LKKIIGIFQYSNGTRKIGKAAEQHSADTTSDYAEDIYNHLVQGKLVIIDQ
SSGEPELNKSSATRIMTKIFKENQRKFVQGETNIPEILVYVEEAHNILPA
GNDLDLSDIWVRTAKEGSKYRIGMVYATQEVSSIQKNILKNTANWFISHL
NNTDETKELCKYYDFADFEPSIRRAQDKGFLRVKTLSNLFVIPVQVDRFE
V
>CT0246 conserved hypothetical protein
MLESMTGYGSAERSEKGMKVLVELRSVNNRFAEIGVKLPRQLLSWELEVR
ELIRASFQRGKISAFVQLQLEEAEALPVTVNPSKVRGYKALLETVRREAE
IEAPVTLDHVLRFSEIFEPDHTILEHPDEIWPIAQSALIEAIGNLKEMRR
KEGEELAIDFLSRIDEIERTLKEIKEIAAGNLETIRKRLASKISAIAGCD
VEYSKDRLEMEIVMAAERLDITEECIRFSSHNKFFIEELNNSSSGSGRKL
NFLLQEQLREANTIASKSQNADISQKVVHIKEELEKVREQLQNIE
>CT0743 hypothetical protein
MNAHHKPEMNDLRMTGIIGNLLRFGVLLAASIVLVGGILFLVHHGGELPE
YHIFRGSSSPLRTVPGVIHGLMAFQKRAIIQLGLLLLILTPVVNVLFTIF
AFWVEKDKLYVGVASLVFMFLLFSLVGGKF
>CT1335 hypothetical protein
MRESEMKRAFWIVIIALLSMQFPAVWQTVKAAEPEMLAIPAGSGAITVRI
TGLRSTNGNLSVALFNAKKGFPGKYERAVRRAKIPAAGSEPLVVFGDIPY
GTYAVAVQHDENANGKLDANFLGMPKEGVGSSNNPKSKFGPPSFDDASFV
LDKKTMELTINLRYL
>CT1245 heterodisulfide reductase, subunit C/succinate dehydrogenase, subunit C
MEHTIRTDSLKRQLEQTTGNHYACCYQCGKCTAGCPAGGFMDNPPARIMR
LVQAGYVEEAMQSDSLWYCIGCMTCTARCPQNMEIAGTMDAVREMALKSG
IISSDKAKKLVTAFHTSFLNTVRKSGRLQELALVNSYKLRTRTFLQDAGA
GLKMIKQGKINPVTAITAKETVEQKGQIEKIFKIAEQESHTAAPKRKPVK
RTFKANEPVRITPGMTIGYYPGCSLSGTAKDYDISIRKMCDRLGIRLHEI
EDWNCCGASSAHATNHKLSVLLPARNQALADQQGFDYVLAPCAACLNRQV
TARKALMESEELRNELKSIMGRDTECKAQFIGIMQVLEGMDPEEIKKRVT
HPLKDMPLACYYGCLLVRPFDAMGYDDPENPTKMEAIIEALGAKPVDWAY
KVECCGAGLTMAQQEMIEDLTHKIAKNATVNGAGGFVVACPLCHTNLDMR
QEGMRKRYNDVGEMPVYYISDLVAMACGASPEEVALDKHFVPATGMVRNH
>CT0058 HIT family protein
MTRYDPDCIFCKIATGHIPANLVYKNDHVAAFHDINPVAPIHVLIIPLEH
IRSLSDLKDGDSEIAAQILLAARIVAEKTGVLESGYRLVFNNGEDALQSV
GHIHAHLIGGKTMGWPPFAGREVAHGQD
>CT1357 hypothetical protein
MHVAIIGGGISGIAAAFYLAQQKVSIDIFESENQIGGRAGSDFLQERRVD
FGGKNIGRNYLLFREFARAYGDPAFEYFGINTSQLINGRIVSISREGSAW
LNLLKIIRLCGLRKGITRLYPHIQAILNDRSQGVLCSDYFRALSENFDHL
TLDRYLNQRCIDRVIRPITIRMNGAEPEECYPGNFGSNLALALDSFEQLT
LGMYNLLDTFIASHRQESSFRILTGHRVTSIAKDQDKFRINYLNGAVSGT
GSYDRIISALPAYSLAELLQDELPEASRLLNKISYFPVSIAIVKYRDEVF
PQNRRAMVFDRNSPLSNAGAYGLNDLDIVRYTFSGKASRAAISEHSTPEE
VISLGEKTASPYFSIKDNPKEAFVFRYFPKGLCAYSPKHHLLLEEIDRLV
NRLSGFGTTGDYRRGASIEACFRAAKECIEKVVGDGS
>CT1831 conserved hypothetical protein
MRTVVQQVREASVSIGGELHSSIGTGLLVLAGISRDDTADDLAWMSRKIP
NLRIFEDEKGKMNRSLKDIGGALLVVSQFTLYADASRGNRPGFSESAPPE
LARVLFDRFVESIRHEAGCPVETGVFGADMQVSLVNDGPVTIILESPKKP
>CT0507 hypothetical protein
MKKHFIISMIVALGMAGFTGVTFAADAPAAKPAATAPAGEKKAEAPKAEA
KKKAVKKKKAAKKKVAKKAEEKKAEEAK
>CT1178 hypothetical protein
MDRVALKPFRRIAVILLAGAVIATSTSCSRDERKKEEEKHLESMMSILVQ
VQKNLGRIRQKEAVVVRLSSDVEGRKPKSAEQIGREINTNIRFIDSTLSA
SKNLVATLEKQNHESQYRIAALDRMTGQLKGELDKKELELGAMKREIAKL
NRQIARLTNTVDVMDEAISDQEDQMVKAYYVVGTVDQLVSKGILVPPGPF
SRFFGMRPVLANDFDLRPFRQVDITETKDIYFDKPVSRLHVITPHTKGSY
ELVGGKTSSLLLIRNEAEFWKKSRCLVIVVE
>CT0749 hypothetical protein
MTVSVAQLILKYIEEDKFLDAIQCVQNEILKIEVKPELAGADRRQIKNLT
AIMDKLSEAAMFGSEWDEGRRAKKAAIVKLQKVSAA
>CT0903 transcriptional regulator, putative
MALFGKKSSDSSKSVAVPKAFTIQIDPSKFNLATKPSGPVHEQLETLKQK
LTKLSSNVENNLMLVIRASTKKDKALASSAFEFDERYIQKGKFEVEYLTL
AYLNFQQLDEADRKTVAHARMILKELERIAQFCLNIADKTEYIQLANIQV
LHKDEYDLKLMGDDTAEMIKKAVEAFVSGNSKHATETLDMMKKIDDLYQK
AVAKLKAEVNDSNIINSTGILSVVEHVHTCAEISCAIARHFC
>CT1088 citrate lyase, subunit2
MSILANKDTRAVIIGGVAGVNAAKRMAQFDYLINRPLTVQAFVYPPEAGQ
QKEIFRGGELKNVTVYPSLAPALNEHPDINTALIYLGASRATEAAMEALE
SPNIQLVSMITEGVPEKDAKRLKKLAQKLGKMLNGPSSIGIMSAGECRLG
VIGGEYRNLKLCNLYRQGSFGVLTKSGGLSNEAMWLCAQNGDGITSAVAI
GGDAYPGTDFVTYLEMFEKDPATKAVVMIGEVGGNLEEEAAEWLAAEPRR
IKLIAAIGGTCQEVLPQGMKFGHAGAKEGKKGAGSARSKMNALREAGALV
PDTFGGLSKAIKQVYEELLASGAIKPKPEIDEALLPELPPSVQEVMKQGE
VVVEPLIRTTISDDRGEEPRYAGYAASELCSKGYGIEDVIGLLWSKKLPS
REESEIIKRIIMISADHGPAVSGAFGTILAACAGIDMPQAVSAGMTMIGP
RFGGAVTNAGKYFKMGVKEYPNDIPGFLAWMKKNVGPVPGIGHRVKSLRN
PDQRVKYLVSYVKNETSLHTPILNYALEVEKITTAKKENLILNVDGTIGC
ILMDLGFPEHSLNGFFVLARTIGMIGHWIDQNNQNSRLIRLYDYLINYAV
KPERPVPDKK
>CT1096 outer membrane lipoprotein Blc
MFRKLIRKYFHPTGPETVPEVDVMKYCGTWYEIASIPSKQQRGCASTKAE
YTLDAAKGKVMVRNSCKRNGREKSIRATAVPVEGSGNAKLIVTFFRYIKA
DYWVIGLAEDYSWALVSNPSATRCWVLSRTPYMDDATYRQLLEQLRLKGI
NTDELVKTVQA
>CT0464 membrane fusion protein, putative
MQNLQKMLPKFSIALAVVFAATIGYLLTRGNTVDVEARKISRSELVEAIY
ATGYVEAENIANLRAEFSGTVRSIGALEGQRVSKGQAIIVFDSVQPRLAV
NEARAAVAEEMAAVHDNDLRLQRNRTLFQAGAISRQDFEAAERNSTASRE
ALRQRQMQLKSREDDLKKLEVVAPFSGILTLQNVKEGDYVQSGTLVATVT
DSSRYLVVVEVDELDVPRLRTGLKAVIAFDSMPEKRFDATVVRIVPQTDR
VTKTSRVYLKLDDSVAAIQGGMTATANIIYNTKKGTLLVSKSSVFEEEHQ
SYVWKIVKGKLKKQPIRTGDSDLVFIEVVKGLNAGDVVVTSPQENYRDGM
EARIVRESPKKQS
>CT1615 conserved hypothetical protein
MRIAPYGTGSVVKTAIFCFVIFITALFLPQPGGVILATAALGFLLFTLYF
YRDPERKIPDGKGLVIAPADGKIVLKQTLDHPVTGPGSTLVSIFMSPFNV
HVNRIPVDGLVRDLRYHEGKFLMAFDHRSMTDNERMEITLDTAAGPLWFC
QVSGFVARRIVCDLEAGQEVASGKRFGMIKLGSRVDIVLPSSIQIKVKEG
MKTTAGETILGQTGGF
>CT0562 conserved hypothetical protein
MSFVPTKVFFTKGVGRHKEYLSSFELALRDAKIEKCNLVTVSSIFPPKCE
RISVEEGLKHLKPGQITFAVMARNSTNENNRLISASVGVALPADESQYGY
LSEHHPYGETAEQSGEYAEDLAATMLATTLGIEFDPNKDWDEREGIYKMS
GKIVNSFNITESAEGETGMWTTVISCAVLLP
>CT0787 hypothetical protein
MTRQRRASAPIQYRPHNRNKMDGFWSQRHEIRHAKKPNRPKESTSVRAIC
NSGGCERATSTR
>CT0847 conserved hypothetical protein
MSGIASDLELNCEGLNCPLPILKTKKAIDNLQSGQVLKMIATDPGSVNDM
ASWAKRTGNDLIEHTEDGGKHIFYIKKK
>CT0985 P-II family protein
MKYIVAMIQPHKLPDVKKSLAAAGIRKMTVTNALGCGAQGGYTEMYRGIP
SEVNLLKKVKLEIAVNEEFADATVKAIIKGAKTGEIGDGKIFIFDLPECI
RIRTEEKGHAAIG
>CT1795 hydrogenase assembly chaperone hypC/hupF
MSKKTLSTVNYQLSTDTMCLAIPGQVIEIREENGLKMGTVDISGALTKAC
LEYVPEIAIGQYTIVHAGFALKIIDEKEAAESLKLWDELIKSGAFDVDGE
LPPSPIQKPEA
>CT0786 conserved hypothetical protein
MFYFDPAYFLFALPPLLLGIWAQFKVKSAFKKYSQVATQNGVTGAQAALR
ILQRGGLENVNVEMTSGMLSDHYDPRQKVLRLSEEVYSLPSIASVGVAAH
EAGHALQDKVNYSPLAIRSAMVPVVSIGSNLGPILFMIGLFMQGVLGSSL
AWAGIILFAGTALFALVTLPVEFDASRRAKELLVSQGIVSQREMAGVNAV
LDAAALTYVAAAAQAIMQLLYYVMVMNRRKD
>CT1065 hypothetical protein
MNGEFPKGVHIMPEGRDRFGQLKPSFPVDFEPLTAGRVLPKL
>CT1768 hypothetical protein
MSRMKQEPTNVNRKEERERQRKGIPGLIDSTVSSTIDDLKAIIDAKLELF
KIELTEKVALVSAFVLLLVVLMIGVAYLITTIALLFGELFGHVWLGYLLV
SMVFILTFAFFTKVKPNALKNFIHKILLSAND
>CT0917 hypothetical protein
MTPVTSKNTLRGISRTFLVVSFTIIAIAATVGGLGVLADNAFLLKLHPYI
FFVGFGNLAILLFNRYLTSSIYPELRIDPARQRLFIGLVVLALVMVTIAV
AVGLPLLKAAAGLMLMGVVAVPLYELLTTLSVAKIWREVSVRYYIFDVVF
LLNANLGLFTLGLKEAFPDNGIIPFFVTQSAYFLGSSFPLSISVMGFLYA
YAWRRTDKMELARKLFSLWFYIFVGGVLFFLVVILMGNYLGMMLISHLLT
FGVMALLYTFGVFLRNYFRSNFAHPALAFMLGGLAFLFATSAFGILNIYY
AKGILFGSYPPIRGDKMWIYHAHTHSALLGWITLSFIGMIYIVLPSIQKT
GSLELLQAGDPLGQLLGAGTMNRAFVSLSIILVAGVAIIVSFFSGEQLVL
GIGGIVYAAVALYLLRNLTHDPVFKTGDKS
>CT0799 acetyl-CoA carboxylase, carboxyl transferase subunit beta/methylmalonyl-CoA mutase, C-terminus
MLYSKLLADNFVCATCGHRYVRLSARDYIELILDENAFTEHQETRYIIDR
DILNFPEYANKLHEERVKNGMTTALITGDGAIDGKEVVLCATSFGFLGGS
FCMSTGEKVWRAAKIAIENRRPRILVAKVGQDGHDRGAKVIAAAFADIGF
DVDISPLFQTPEEIVQQALDNDVHIVGISSLAGGHKTLVPQVVEGLKEAR
RGDILVIAGGVIPERDYDYLYERGIAGVFGPGTVIAEAAIKLLALLLEHH
Q
>CT0577 hypothetical protein
MAPGTELGDPFNDRNLVIPRSFFVSSPKLFPQYGGVVYIALSNLTVTIRN
GILCVLTSGKSWNKKDLSINNQLSILETPHENRPPVHFPW
>CT0253 hypothetical protein
MPSQESWNVTLVVSDNGRTKTIVSARHAAEYRQGEKQEIRLDGGINVQLI
NRDGSVTLITAGRGIVHDNQDIEAFDNVVIRSADNTVIRTEHISRSSSNR
MIRSDKYVTISGPSRTIRGYGFESDDAMKRYRIFHASGEALSK
>CT1140 hypothetical protein
MSNDELKGRALKPALSASGITGSPFLSRYDAFKHHQFSTKMKNKQDTTEF
QSDDYDQRQVKKS
>CT0043 conserved hypothetical protein
MPGFIGREQHSVDEKGRLLIPARFRRRFLLQENDPATGAPSRSPVLYVFK
ADDGSLELYEPSVWSEKEQQLLKLSDFNPDERLLTTMIYARLDQTELDRS
GRIALSREMLDHAGIVKDAVIIGANAKMTVWEPLRLERLLSDNASRFAPL
ANRYI
>CT1903 BchE/P-methylase family protein
MNILLLYPEFPDTFWSFKHALKFVRKKASLPPLGLLTVASMLPPAWQRRL
VDLNVRKLRSEDIAWADLAFVSAMAVQQDSAREAIARCKAAGLPVVAGGP
LFTTGYAEFPEVDYFVLNEAEITLPRFIADLENGVPERYYTADEFPALGC
TPVPEWKLLDMKQYASMAIQFSRGCPYRCDFCNVTALFGHKIRTKTSSQI
IAELEELRRHGWQDSVFFVDDNFIAHRAYLKKELLPALIEWRKTKGATNE
FYTECSINLADDAELMKLMVDAGFNRVFIGIETPEVTALQACGKQHNTSR
DMLDNIRKIQNAGLEVQGGFIVGFDSDTPSIFQKQIEFIQKSGIVTAMVG
ILQALPGTKLYERLKNEGRLLPHSSGDNADGNTNIVPMMDPEILRRGYRE
MMKHLYAPKYYYQRIRTLLEAYRAPQLKSRFRMSQFMAFMRSTVMLGVIG
RERFQYWKMLAWTAVNRRQALPLAVTLAIYGHHFRKVCSLHLKERQHGSI
>CT0383 methyltransferase, putative
MSDDTIDRFFGYLNWLFNPFHGLKTTEVYDLIGTSSLTENALYLNLGYWR
KADTIDEASEALALLVAKRGGMGPGDIVLDCGYGFGDQDILWARQLKPEK
IIGLNITSSQVERARKRVADAGLEQSIDLREGSATAMPIENESIDLVVSV
ESAFHYRTREAFFREAFRVLRPGGRLVTADIVPTENSGNPFRRMEQWFSW
NLVAGKFNIPQENYYLIPSYQNKLTKAGFVQIDIKSIRDDVYEPLHAYLA
KNRTFFAKMHPLARIMAQLTLNRSAESVYAGLDYILSYAEKP
>CT1669 hypothetical protein
MRMISRRLRAAMTVWLLLVLSIPGVLRAKESSSSANGKPAVFPEVKSIKI
TGNKALTTEEIREVMSTSTRNSFFGTGLFAGARRPFIADDFEKDISLIRK
LYTFKGYFFADVEPTVKRSKNGDVSITIRIRENQPTLLDSLSYAGLDSIP
ENLRSRYLKKSLLKLQQIFSVEKLIEERDRTLDFFREHGYTFFHPDSIRI
TVDTLGLHAGVRFDISLPGRYAYGPVKVFVHDPLKKDNPAIAKTFVRDNV
SVTIYGHQKFSPKVFSSAIAWKQGALTRQSLEQRTLENFGSTNLFSSISM
QKDAARAGAIPITIDLDPAPKHQIEPKLLVDNRYGSLFVGGALSYENRNL
FGAGQQLKLSTNYGRQLSSDTNVLSSLSADQYDKLIPYEFNIKADLVIPR
LGKQGSYYNGTVEYAQSKLPVLLDSKRELIRATYSTRPTRVSKLDFDFFE
LEVARKDSLRGFQQLFKNDLAQNIGINPADPVAVNRGLDSLLQTRLNQTF
RLRYSLSNREKSTRSKRSPIWNFSVTAEESGSLLWLIDKYIDKKSYAGFT
DSDPQIFGTPYNQYVKLDTQLAVTKNLSPKRQVAARIALGWMSPYGKADT
TPEDHRFYAGGSNSMRGWVFGTLGPGSCPNAAVSNFGADIKAELGLEYRI
QFFKVFGQESGIALFTDAGNIWDRTGPYAFSLRSLTQDFAWDWGAGLRIG
SPIGPFRFDFAWKLHDPANPQPWRFSQTKLTDFTFNFGIGETF
>CT0344 conserved hypothetical protein
MTKAQHPGATPVDLSFTRYVHSVNAFSPLFSCRRSICDATVLRRCCFLLY
FRNNCNFKLLASSPVAFDWQREGIKQTQCMKILFGVQGTGNGHISRSREL
VRKLKEDGHEVQVIISGRKEEELREIEVFAPYKVLKGFTLVTRRGKMSYM
ETMFQLDFVSLWTDVLSLDTSCVDLVITDFEPVTSMAARFKGIVSVGFGH
QYAFPFHIPIARGNLFEKYTLLNFAPARYNAGLHWDHFNQPIFPPVIPQT
LYDAVRPEEDPQKILVYLPFEEVEDIDAFLRPFDAYRFFIYGKVKEDRDD
GHLCFRGYSREGFLRDLMECSGVVCNAGFELPGEALHLGKKMLLRPLDGQ
IEQESNALAMVQLGYGMSMHTLDGEVLKEWFAMPPGKPLNYSRTVDYIAE
WIASGRWNDLRVFTDAAWKDHCREESKS
>CT1301 prolyl oligopepitdase family protein
MDAPSANVVETVCGERIADPYRPLENLKDPKVAAWYRRESDHARQVLDAI
PGRNELIEKMKEFDQRRKEKVFDLSITDNDHYFYLKQTPVDETGKLYTRK
GYKGQERLLFDPTTYKDGSGSTFVISEVAPNIDASKVIVTVSPNGSENSV
MLIIDVRDGHIWPERIDRCWFASPSWLPDGKSFFYNRMNTADLHDKAREL
DSKVLLHVVGTDPSTDREIFSRTHNPALPIKPEDIPSVIYDRKSEKIFAF
VGSVDPRVTAWYAPARFWNEKTIPWKTLFRPEDDVYDFAATKHNLYVFTP
KNAPRFKVLKTSLDHPDLATAETVIPEPAEGTLTALALTNEGLFYTISTN
GVREELYHLNYGSTKPEKIETPFEAGTMSIGSKGFDRPELWTVIGGWNHD
YRRYRYDAKHNRFIDETLSSKASYPEYDNLEVKEVMATSYDGVKVPLSLI
YNRGIRMDGKNPVLIYGYGAYGNSMTPFFNPSFLLWTYKGGILAVAHVRG
GGELGDAWHKAGMKSTKPNTWKDLIACAEYLIHEGYTSPEHIAINSASAG
GILIGRAITERPDLFAAAMPQVGVLNAVRGEFSPNGPVNVPEFGTVKNPE
ECKALLEMDAYLHIRDGVKYPAVLITAGMNDPRVPAWQPAKFAARLQEAT
TSGKPVLFFTDYKAGHGIGDTKTKQFESLADMLSFGLWQTGGAAQ
>CT1071 conserved hypothetical protein
MNAAHQVAFPDFGSLILFDLLVVITAIALSRDFNEDGINDLSGFGKDTLI
VQGGVEPFEEKSFNHAFLDQALTKLPDGFGIENPVAGFESQKALETEPIG
NLVFHLIIRKTVEALQDEELEHRCPVKRRSAHFAQIGRLLECDLKNWAEE
IPVDMLFQFHQWIFELGQTLRKKILVEKAQGIDVLHGNEVD
>CT1496 hypothetical protein
MQRISHRPMFTIRNSLFIFLCVMAGMSLPCQQPAAMAAKISYASEIVKDV
REDKVYLLEKIRKQLTKPSEKILVEALLTEDGPKAAKLYRKQLEEHPDPQ
LDPISSSRLAAYEFAVSTTPGLPVMQARASSESRPALMTIAQPLQPKQPV
SKPDSSLKRTPPPAGPAHAASKGDTVSTRLAPPPAQASGGGFTLQFGSFD
SITNAEQMVAQLQYTAPARVQQINGVYKVRLRRTFTTQQEAAAFARTLPI
ESIVVPPQP
>CT0251 UDP-N-acetylglucosamine pyrophosphorylase, putative
MSLAIVIMAAGKGTRMKSALPKVLHEANGKPLVAYVIEKSQALDPDKIVL
IIGHQAELVRAATAGFPFDYALQEPQLGTGHAIMQAEPFLKDFSGEIIIL
SGDAPLFTGRTLRELIDFHRSRQAVATVLTAEMDDPTGYGRIIRSDAGEE
VLRIVEQKDATEEEKAVTEINSGVYVFNANELFSALHGITNKNAQGEYYL
TDVFGICFGKGKKVCAFKVADANEIRGINTPEQLREAELLLQGEKYC
>CT1077 hypothetical protein
MHTIPAPFQIYAYWAIALIIGLAFFRKDVTAPNPPFDKRRIGLLVFSVVI
LVLNAYIYSNSTTDGGRMLDPWSALIFSVGNGIAETFMFYAVFRLGQSLV
GKFTQNQWAGFLVGFLFFMSYSGLIHGLFWLNILPEHVVQTSPFKPFFMP
TQLLIALSWALSFFWYRDLRTVFFLHAVVDLTMIMNVKFSMFP
>CT0332 hypothetical protein
MVKQLTKHGNSMAMVIDKPILELIGADADTPFEITTDGQALILTPLKNPK
GGEAFGVALEKVNTRYARALKKLAE
>CT0257 isopentenyl-diphosphate delta-isomerase, putative
MQEASAAITAERKHSHVDICLNRPVCFDGQDTGLDSWRFEHNAAPEVDFA
QIDLSTEFLGHAIGLPLMISSMTGGYGNALALNRALGEAAERFRIPLGVG
SMRQALEGSSHRESFSVVRSSAPSVPIFANIGAPEVAAGLSRDQLSTLID
LIEANGLIVHLNPAQELFQPEGGTDFSGFLDRLHDITATIGVPVIAKEVG
CGISATVARKLADAGVRAIDVAGAGGISWQKVEECRYLDRFGHEERFSPS
ALDEFLNWGIPTAECLTSIQTLKRQNPEYDALSVISSGGIRNGLDIAKSI
ALGANIAASAQHLLKALHSGTLEETIRTWANDLRAAMFLTGSATIEQLKH
ARIYRKP
>CT0572 pentapeptide repeat family protein
MAEKKHLDRLKAGVASWNHWRKAQPEVRPDLSQADFSAADLKGIDLSEAD
LVGATFAKATLSGADLRGADLRGADLSGARLDGVNLSRSTIDLSTRYDGV
TGCQIGVNGLYSPSTDSAALMRLDPPGNSMQGANAEAVVESLRQARKLHT
FSVILAGIAMLFIVIKPKTITLPYLAGSFKFDDFSYAFLATILSAALLSQ
VVSFIDSALQGARYLNDRRAAMLVGHFPWLLSKYESDPANKRQSRVLRFF
MIFHPLIYLYFFVQWSALAIGDWDSVIRHYQQMPIIFGEYLLPVVYVILI
RICLHLFRLSEGFQKPILFDAETERSRRTDMERLTEAIERQSALTAELVD
VLKKREGKS
>CT1099 acetyltransferase, GNAT family
MIIGDSSAGTIFVCEMDESIIGMVSLLNLVSTALGKKVAMLEDMIVDPEW
RGQGIGAMLLDHACSWARENGYGRITLLTDGDNVPAQRFYGAHGFARSTM
VAFRKQL
>CT1132 CRISPR-associated protein, TM1801 family
MTTVDKRYDFVVLFDVQDGNPNGDPDAGNLPRIDAETGMGLVTDVCLKRK
VRNYVQLLGQDIFIKEKAILNNKIDEAYKALNIDLNAAPADSKDGSKRNK
PGVAQGGEVDKGRVQMCTKYYDIRAFGAVMSTGANAGQVRGPIQMTFARS
VEPVVALEHSITRMAVATEAEAEKQSGDNRTMGRKYTVPYGLYRAHGFVS
ANLASQTGFSAEDLDLFWDALLNMFEHDRSAARGLMSTRGLYVFEHSSAL
GNAPASQLFERITVKRKEDSEGPARSFKDYEVLVDESNLGEVKLLRKLG
>CT0598 metallo-beta-lactamase superfamily protein
MEIEFYGATRRVTGSCHILRAAGYTVLLDCGLVQGSREVEALNREPFPFE
PSEIDAVVLSHGHIDHSGRLPLLVNRGFRGPIYTHHGTIELCEILLRDSA
TLSENDARFMRKHGQRDDEPLYTVDDAQRCVRQMEGVRYGERREVLPGIA
VTLLDAGHILASAFVKLEITEGDATRTLVFSGDLGQYDSPILNNPDAIAY
ADVVLVESTYGDRLHRNFDSTVKEIGEIIETSRRDCGNILIPAFSIGRSQ
ELLYLFGEHYREWELEQWQVFLDSPMAIEASRIYWLHEELWDAEARLFRR
HMRGMPPVGNLHLTRRVEESMKINELREGAIIIAGSGMCNGGRIVHHLKR
NIERPECHIIITGFQAEGTLGREIVEGRKEVRLHGRSYRVRAQLHTIGGL
SAHGDRSDLLRWLKSREGSPQVMIVHGEEGVKESFKGFLRDELSVEALIP
KPGDRLDLVSNELYRVESE
>CT1197 chlorobiumquinone synthase BchC related protein
MQGEAQALVFLKANKLKLQSVKYVANRPRDILVRTIASTITPGLDRLLLT
NKPVSHKVLAYPVMPGSETIGQVMQVGPEVTSVKEGDFVYAFKGDCWVGI
DPYYGCHAEVIPTSEENVLALGRKPIHRDLLTGLVGYVLSAMEKVALDPS
MRVLLLGLGSVGLMVSEYLHYRGIRHVDALENFPLRGQLSHAENIGIEIV
DFTDDFNDRYDLVIETTGRILMVEKVMRLLKPKAKVLLMGSYEVLGYDYR
LIQHKEPVIVCSSVTDKQHLIEAKALLETEAFETEKFFTNVFPVSQYELA
YRIALDSKEAIKTVISWI
>CT1115 hypothetical protein
MKKGVDMRYFYDSHCHMMNLSHPNLSAIIKRIYNDSIKPLLLKYSVYLKA
ALLLLLFVIPVVVITLLLTGHFVVIKWILYAVSLIAVILFVFVVVKFGDK
KKREIEISKIKSNLLDKVKEKLANVMNLLAAMETDIGDCLIQMEEELRKK
IPLNNVLVISGNGEKKEYDKIVLTPLIMDFGLKDSGKTNLIYKVRWKPIV
AQVEDLCIGIRDYYLYRDKYITGHAEPLFQIIPFMGINTQNYYSEKDNTT
GKSISVSLVQLLDKNFSEFKYDTSPQMRRKKIDAVNWRQFNGDIESIGSY
YFLGIKVYPPLGFDPWPEDDIERAKVCYLYQYCIDHNIPITAHCSPGGFL
VDDDFKNFSSPYKWEQVLDYTDEKNNKPFERLRLNLAHFGGADSKVWRKK
IADMILKKDTVSSKYKYENLYTDISYQGVDKKSYDALKDLLDRYDSAERA
RLIERLIFGSDFMINLQDINSYSQYLDFFFKTNALTLEEKDMLCNKNAER
FLFVG
>CT0639 hypothetical protein
MQKEKWQEPTKDVWFSSWIDIWFMNNKTGGWA
>CT0683 conserved hypothetical protein
MKLTRIAKLRLRVFRDFAWPTELHPFARFNVIYGWNGSGKTTLSWLLSLV
EKKATLPEAEGEARLEFDGTTKVAGSAFASAHLPQVRVFNRDFINATLSQ
TSGIAPIYFLGEDSIEKQARVEQLKQELATIISKLKASEAGKTKAESTLD
DFCKGRAKVIKELLTSANSQTYNNYDKRNFKRAVEAMDSQRAAGAVLSDD
QKAQLHSQKNAQPKPLVGKVVAPSIELDALTSKVDTLVGRSVVAQTLDEL
TSNAKLAVWVQEGLHLHSGEHATDTCRFCQQPLQAARRAALEAHFNDAFA
GFQKDLSELLSKLNKAKQSVASLSLPDDSRFYEALEHEVSTARAKVLSAK
EETEAALDALIARVEAKRDQPFAPITTQEQATANPSSMTDSVAAFNEIVE
RHNRISQDFTASVNSACEKLEASYVAEAYAEFVRLTDAVKAAATELNVLT
AKQAEIKAQIAELERAILDHRRGADELTAELRAYLGRDELRFEVKGTGYA
LTRGGQYVAHLSDGERTAIAFLYFLKSLQDRAFDLKNGIVVIDDPVSSLD
DNALFSAFGYMKDRTKDAGQLFILTHSFSFFRLVKNWFHHLPGQRKKKIE
DRPGRFFLLRTRRHTDGSRTSELGHLDPLLEEYESEYQYLFKRVHDEAQR
NDVVELEHHYGLPNVARRLLEAFLAFKFPEIRGEGVLYKRLERVDFDGAK
KTRILRLLDTYSHADAIPDPEHDLSLLAETQPVLREVLELMEAVDRDHYL
GLIKMVAPSSVEEQGEP
>CT1048 5-formyltetrahydrofolate cyclo-ligase, putative
MEFAIESKPELRKRLLARRRAMTLERWLSDSEKIQAHAASLPKLREAARV
HCYVSMEHDREARTLELLEKLALEGKAVYMPYIEQGIMKTSIYHSAQKFR
ISVSAPPTPEPLVLSGEERFEVVFVPLTGFDRRGGRIGFGKGWYDRFFSR
LSGHGIHPVKIGLAFSFQEVPSVPCDPWDEPLDMVVTENEIINCQYNSK
>CT1074 ApbE family protein
MHRFGFRAMGSGCEIVLAGTTKKKAKALAMLGMEEIAWIERKYSRYQPES
VVSRINDAAGKAWVACDDETSALLDYTDAVYRSSGGLFDVTSGVLRRAWN
FDKTEVPSQESLFPLLKLVGWQRVERRENDVRLPQEGMQLDFGGIGKEYA
ADAAAEILCEEGVRHGYVNLGGDIRVIGPKPGGEPWTIGIRDPRDPAGTI
ASISVSSGAIATSGDYERFFEVEGRRYCHIINPKTGFPVSQWRSVTVLAP
LATTAGSCTTVAMLLEADGLNYLKSSGFNYLAINQSGEMYYQD
>CT0812 transcriptional regulator, ArsR family
MAPISKKDDGLCQQTCDHPDVVESVRQAMPDENQQQELAQLFKVLGDHTR
VRILNALYRSELCVCDLTSILAMNQSAVSHQLRVLRDARIVRSKKQGKNV
LYALDDSHIAELIKIGFEHVQER
>CT0916 hypothetical protein
MPEPFTLMSLFARQSPALIASRIALAFVWVYQGAVPKLVCPSPVELGLLS
YLGPLYGFMFSVMGYGEIAFGLLLLLTPWRWPFLLNIAAMLSLLGFVSLY
EPRLLAEAFNPVSLNAAVIALSLTAYWEMGKVSASNQ
>CT0581 hypothetical protein
MKSQMKMDYQKIKKRLGYSFAFVFGFFFGISTCVNFTIDAKWTDFVSLAF
TAAGVALGYITFFRWWRNKKKDDSYRVSKDYLNALNEVQEVIREIDFQYF
YLCPAPGLLVEGDEVSFKRIKQVDQLSHQLYLCRVNLVNAKSELNFWDVN
LSAAFEKEHEELLKCLANLKVVMTGLSSQLFHYYKNHSNEYMTEIDRHKK
MFNGYLKSIRDILNKRRSLKFDGIFTFK
>CT0884 hypothetical protein
MSDTQGSLFFRINGQMPLVQSVGQNLPEGWSLGKHSSRRERLVFYDTFEE
EAFRNGLAVMHRKGTLSIIDLESGAVEAETPLAQTPPSFFATDLPEGKVR
KRLLQCSTLRAFINRCAVDRFISSWRILNRDNKTVATLDHESLHPVDKTS
KTVFPQHFSITPLKGYHKELSPMLMALPESVDAYRIVSFKERFMTIMEAA
EPLGRGYSSKLRLQLDAHASIHENVRRLLQFTTSIMEANEEGIRKDIDSE
FLHDFRVAIRRSRSILRLLNGVFDPEKTAWMLAGLRELGKRTNDLRDSDV
YLLRREEYTSLLPPSLRPALDPFFSDLEADKRLHHRQFCRYLTGREYSGF
MTSLKEFIAEGELPDPETAPLAAEPTGDVAAKTIRKALKKVLVHGRRTGS
ETSDAELHELRIDCKKLRYLLEFFASLFPPKATAQVLRQMKTLQDNLGTF
VDLTVQMEFLQSRLETIPADRGGISEAAAIGGLLTTLYRKREKVREHFHE
IFSGFDSNETGELFDELLTGLA
>CT0819 hypothetical protein
MKLRRLACMINIGSSQNFVGGMKDAHMTIPDEQV
>CT0650 alanine dehydrogenase family protein
MSVSIAIPKERAQDERRVAISPAGVQILVEQGIRVVVESNAGCFCNFHDQ
DYAEAGAIIVTSPEELYPQANVIVKVSPPQPEELQLLLPGQMLISAVHLG
TVSRQMLKTLIDKNITALGFEFIETRDGELPIVRTLSEIAGSLAIQTAAK
YLETGYGGSGILLGGIAGVPPAHVTIIGAGTVGLFAAQDALGLGAQVTVI
DKEINRLRRFEAFFNRHLVTAIANEHYISQLAKMSDVMIGALSPKLKLNK
PLVSEQVVKTMKPGSVIIDVSIDQGACFATSRHTTHTNPIYVKYGVTHYC
VPNIPSAVAKTATFALTNTLLPFLLKLNAHQTIPEILWNSHSLRKGTYLY
KGYITKKILAELTDLPFREIDMLLATA
>CT0568 rod shape-determining protein MreC, putative
MLTRKTTDVSSFFRFIAKHTAYLYFLLYCTLSIMLMQLQRKETLDAIRER
GLAINAAIGKQFTDATAIFTQERDNQHLFLQNARLFARLLRQQAALRDAA
ELKAIEANAPQWAGHFKVARVVDRRFSATDNMLIIDAGSRQGVARDMAVL
TPDGLVGRVIDVSQNYAKVMPVINRNFMVSVVSDSTRTNGLLAWQNGNER
LAKMEHVPVSSKLLVGEGVATSGYSTFAIRGIPVGQIIRISKDKLFYNVD
VRLAVDFSSLSWVLVSLAKPSMEKIELMQSPDSPGKGE
>CT1508 conserved hypothetical protein
MSEPTYFMPPEWAPHASTWLSWPHKLESWPGKFEPVPAVFAELAYQLSRS
ETVNINVLDDAMEAQARELLKERDPEGKYAERIVFHRIPTNDAWCRDHGP
NYVIRTQDGRRDKVIMNWEYNAWGGKYEPYDDDNAVPERVAKAQGLPMVS
TGMVLEGGAIDVNGAGLLLTTTACLLNPNRNPSLGKAEIEAQLRRYLGIE
KVLWLGDGIAGDDTDGHVDDMARFVNENTVVIAVEEDPEDENYKPLRENY
ELLKTMTGLDGKPLNIVKLPMPEPVYYDGERLPASYANFYIANTVVLVPT
YRCPRDQQAIDILQQCFPKREVVGIDCSDLIWGLGAIHCVTHEEPAM
>CT0210 hypothetical protein
MFPMFFPVSYLQKGFALRIAYSGAVRPFLTVFRDKG
>CT0912 hypothetical protein
MTGLSQSQASPMQIQPGNAAFNPWTDAALDTIRDVNQALTLYAEMRVVPA
HHDAFLAAIDTVSAKLRVLPGFLSLALKQMSGDSTMVKNYPETYKGVLAT
AYLDGVAAGTQPYFYNLFVRFADGRAARAAGFEALFETHIHPLLHAMAPR
GGDGPELLAYRAVLQSVVAGDRHAIYRGAEEIRSFLRRPVELPERETVTV
ENHVMVPEDKHAAWEPQVAILLQVAQDTFEPQDEPSGVGLPGARDNRYYR
KALSTEILRNAHADGGLRAYIMHGVWESVWDHENSHLDPRFLAAAGPVGA
AAVVGPVEPFYLTRRLVVAD
>CT0646 transcriptional regulator, ArsR family
MSKSMTITEEEVKKWQFNMPDEMLDAVSNRFKLLSEPMRLKILRALCDRE
HTVQEIVKEVGASQANISKHLALMHDNGVVNRRKEGLKCYYRISDDSIVY
ACFLISKSVVENLQDRLSWIQKVNTNLTT
>CT2285 rubredoxin:oxygen oxidoreductase, putative
MTDNKILPITDDVSWIGVLDPGLITFDIVMETKYGTTYNSYFINAEKKTV
VETTKVKFWPTYIEKLKKVVNPEEIEYIIVDHTEPDHSGNVHNLLSVAPN
ATVVGSGNAIKFLRDQTGHDFKHLVVKHGDKLDLGNKTIHFIGAPNLHWP
DTIYSWLEEDRVLFTCDSFGCHYCNEAMYDDLCGDFDDAFKYYFDAILRP
FSKYMLQAIEKIRPLDIKVICPGHGPILRSDWKKYVDLSERYAKSAIAMP
NEKSILIAYVSAYENTSVLAQKIAEGLRSACDFNVDVCDIENMSAEKLEE
KIAHSMGIIIGSPTINQNIVHQIYGIFAVLNPLRDRGKLAAAFGSYGWSG
EGVKIIESNLANLKLKVFDQNVMVKFQPHEPEFEQCREFGKAFAEKMIEM
YNLTCNIK
>CT0821 conserved hypothetical protein
MNLFTGITGFQWDEGNLAKNPEKHGVSNSESESIFFNQPLIVADDPKHSE
IESRWYALGQTNEGRKLFVVFTVKKDKIRIISSRDMNIKEKSTYEKADKD
RS
>CT0572.1 conserved hypothetical protein
MKNGFTVSKLLCSMSRFAVVPIRRVSVLFIFSIILLFADGGNAMSRMQPP
DGVVAGVVNAFGSRDAVRLNRFVHPKQGVVVIYRQGVFNVFKAVSRIDFR
KPVPEYFPYPKIRGGAPLRYAALPVYDCGREAWSKTGLFCDPKHRDVLLS
TMAINLKRSGLKEISQETIDRFRALEAKSVRVVLVDVNGNDLVFYLTRIG
ERWYLTILDRVSSDCSA
>CT0983 conserved hypothetical protein
MIHIGWQESIFVTGFLLFNPRHYPIQTIMSIEIRRVNTSRERKQFIKFAW
KVYRKDPELNRNWVPPVISDYMKTLDTERYPLYEHADLAMFTAWKDGVMV
GTIAAIHNHRHNEVHQDKVGFWGFFECVNDQKVADALFEAAAMWLKSKGL
DTMRGPVSPSMNDQCGMLTKGYDSPPVFLMLYNPPYYNDLCLNSGHKIGQ
ELLAWYIDQKMIDIGRLSRIAQHVLKREGLTVRDMDMKKYDSEVEKIREI
YNKAWEKNWGFVPMTDKEFEFMAKSLKPLADPHFIYFVEDKNGKAIGFSL
TLPDINQALKHVNGNPFTPWGLVKYLWYKRNISMFRTITMGVLPEYRNKG
IDSIMNARISEYGGKYGLFASEMSWVLKSNEAMSKLAKVIGGVPYKEYVI
YEKAI
>CT1582 hypothetical protein
MEKENKKEAARKKSLGELGELFAIKALVDKKFDRIRNLNDKLMNETFADI
ECEKEGKNYIISVKARNKYQKNGKVNTRYNLGSDVYTKAVMAEKKYDAIA
HWIAIQFDKNSFSIYFGSLEELQGSKAIPVDKCEKGIIGEIWEHDKRHFF
DFDYYTNQKK
>CT0991 oxidoreductase, Gfo/Idh/MocA family
MRIGVAGVGKLGEFHTNLLKQIAAEASDVEFSGVFDLNPERLQEIGKKYG
VACFSTLEELAASCDAAVIATTTSSHYAIARQLLEARLHLFIEKPITATL
EEADKLIRIEQEKGLTIQVGHIERFNPALLAVEPYIGEPMYIQAERLSGF
SRRVTDVSVVLDLMIHDIDLVLSLIKSDIRSIAASGVKVFSNELDMATAR
IEFENGAIANVTASRLSRSKLRKMRFFTRNPKSYASLDFTSGKSEVFRLV
PPDQLSSKNPIKNFATKKILEQFGEIQESLKEMALDYVSPDVPKINALKE
ELEQFIDAIRKGKPAKVTSEEGRRALMVAGKITDEIRNNTADLD
>CT0289 GTP-binding protein, Era/ThdF family
MQPPLFSCGHVTFVGAPNAGKSTLLNRLLDHKLSIVTPKPQTTRKKITGI
YHDDRSQIIILDTPGIMDPKQSLHESMLEITRRSLRESDVIVALIPFQKG
DEPIDRKFASELIEQWVKPTGKPFVIALNKADLVPEETAKEAQTEIISKY
KPVATLALSALTGGNIPELVELLRPLLPFDEPIWPDDILSTEPERFFVGE
IIREKIFLQYGREIPYSTEVVIDEFKEQHENNPSRKELIRCSVIVERNSQ
KQIIVGQKGAAIKKLGQAARKEIEELLDRPVYLEIFVKVRPDWRKKKNLL
KSYGY
>CT1407 hypothetical protein
MVLFSRVLTLLPLDRYILFCCPGFQAGVRREKVSLKVETL
>CT0324 D-alanyl-D-alanine carboxypeptidease, putative
MRLLRKLTAALACAVILLMAAAFPANAGAADFRKLDTLLNGAVHDSVFPG
ASLAVIYRGKTVYHKAFGRLTYDPQSAPADTTTIYDAASLTKAVVTTSIA
MQLVERDSLDLHAPVARYLPGFACNGKERITIEQLMRHTSGLRPHVFYAK
TCRTPSDVFRAIEQDSLTYRPGSETKYSDLNFILLGRIIEKLTGQSLPAN
FHARFAAPLGMRSTLFNPPAGLRTRIAPTAPDTTWTLPTPRPLVNDQNAA
LLGGAAGHAGLFTTTGDLIKMVRMLMNGGEYHGHRYIQAKTVRMFLRKTD
APRALGWDIITPGKSSAGTRFSANSWGHLGFTGTSIWVDPEKDLAVILLS
NRVWPTEENKKIRTFRPLLHDTVVECVEEK
>CT2209 glycosyl transferase
MNGGKTAVIVLGWNGAADTLACLASLAKVRQPAFTVILADNGSTDGTVGL
VRQAFPEVEILELGRNLGFAAGNNAAFRSLRGRGFDRVVFLNNDTVVDPG
FLQPLLDELQKPWVGIAAPKILYMDDPGRIWYAGGVLESATGLIEHTGIR
QPDGPRFDTPEPVWYATGCCLAMRCRVFEEVGGFDERFRMYGEDVDLSMK
VRERGLIVMYQPASRIWHRVSASSGGEMNLGKQLRKSGAAMMLFAKHGMI
GGLVLYPLLLPFRALLGLLRFQFFRWTSSQEREEA
>CT1766 3-oxoacyl-(acyl-carrier-protein) reductase, putative
MSVSGKKVCFMTGASGKLGSEIALAIAGQGYSIFFTWQHSEKKAKETLEK
IRWVSPESQMVRCDVSNIAEIEKAFAIFSEHFNRLDLLITSASNFFRTPL
LDVTEPEWDSLVDTNLKGAFFTMQQASRIMLKQPFVSRIITMTDISANLV
WRNYAPYTVSKSGIQHLTRIFAKEMAPKILVNSIAPGTISAYSGRDEEPE
ADLVGKIPLERLGDPMDIVMAIRFLMETEYITGQVINVDGGRMLF
>CT0808 hypothetical protein
MLLESILASESFRASELPAVPDRMRKPQKATSGGLFDFEE
>CT2203 hypothetical protein
MPHERRRVTIEEPPMKNKQPVPHPLHGHAHKRKYEKRLFRSRHKSIGQPP
GSLIHIGE
>CT0922 cation transporting ATPase, E1-E2 family
MLFAAAISENRRAVILLRNEAVGSVEVFARVSLESNIGELGLMAGATLLG
IPLLLTAVQILYVNLATDGLPALALAVDPAEPGIMNRPPNERKQGIFTRP
IMALMLAGGLWSALVNLVLFEWARHSGRSLSEAMTMTFVSLVLIQFFKAY
NFRSEREHVFKNTFTNRWLNLAIIWELVMLAAIIYVPVLTVPFGTFAMPP
HDWLIVICGALTIVPVIEGVKWLIRSGRV
>CT0424 transposase, truncation
MTRKKNKTPDIQGELIGQLLRESGSPQALFDNGGLFDQFKKRLIEKALEG
ELDEHLGYPKHTPIVP
>CT1116 hypothetical protein
MSQGDLLFITLAVKNFLAHFDRCSREYFGLVQRCKR
>CT0514 conserved hypothetical protein
MAIATARAGGKETGTEQMYILQVTNSRSTMSKMSNVKEVNASTAFSMIKK
GALLVDVRETREINRKAFGVSDYLSVPMSRFQSSLHEIPAERKVILACHS
GNRSSIASRILVNSGHRKVHNLQHGIISWEREGLPVRKKESPSPLTMLFQ
MFRKEA
>CT1161 type III restriction system endonuclease, putative
MLMTKSAEDILLIAKPQARPRIYAYSIADDAHAGLLKIGQTTREVKQRVA
EQLRTAAITNYRVELDEDAARADGSIISDFDVRAALARKGFAVVTGEWVR
CAVADVQTVLTELRTGQKLTGTHSETFAMREEQREAVEKTLAYFESIWAE
DADAAPRFLWNAKMRFGKTFTTYQLAKQLAARRVLVVTFKPAVEDAWQSD
LEAHIDFDGWQFFSRNTQDNPSTADADRPLVYFGSFQDLLGRDAAGNIKP
KNEWIHTINWDLVVFDEYHFGAWRETAGELFEGEEEAVAKKEAKLEYAAG
LEEVNEDLMVLSEQECDFLPITSKAYLYLSGTPFRALASGEFIEEQIFNW
TYVDEQRAKERFAREHPGQQNPYAALPELRLLTYQMPDELLAIASGGEFD
EFDLNEFFAATGTGEQAQFKHKDEVQKWLDIIRGQYFPKASEYLKTGTKP
PFPYADVRLLPYLQHSFWFLPNVAACHAMANLLAEKHNTFWHNYTVIVAA
GAGAGIGLDALPPVRKAIGSGFDTKSITLSCGKLTTGVTVPQWSAILMLR
NLKSPETYFQAAFRVQSPWSIKNPNGDDPNEEEILKPVCFVFDFAPTRAL
RQLSDYAIGLAPHESSPEKAVADLVSFLPVLAFDGANMTQIDAGGILDIA
MAGTSATLLARKWESALLVNVDNETLRRILNNPEAMAAVERIEGWRSLGD
NVIEAIINQSEKIKALKKKAKDEGLTPTEKRELSAEEKEFKSKRKLVQKK
LIKFATRIPAFMYLTDFRENTLQDVVTKLEPDLFLAVTGLTVEDFHLLVR
LKVFNTEQMNAAVFAFRRYEDASLGYTGIDSHRGLTHYGLYDTVVAREES
LYHSGCGLTPRSS
>CT1719 transcriptional regulator, Crp/Fnr family
MSLDNNKGKASHATRQILEKYLTLRSFRKGQLLWSEGDTDGLLVFLKSGR
VKIYRLLPMGKAITLYIFGKGSVFGFMPFFDDAPYPAYAQALDDCEADVI
SRSGLRQAVHQDPEVAIVLMKQLAQRLRDAFDTIERLQSKGANPKVAAAL
MALMDDSPNIAGRPLFITLPVASHEFAQSLGLTPETLSRSITHLVEKNIL
KRLQRNRFQILDLEALEKVAESALR
>CT0425 hypothetical protein
MLKMYVDYYVAVLSGFLQQYFGVKSQKGVTMIEYALIASLIAVAVIAVLL
TVGSNLQTVFSYVGSNLTT
>CT1556 conserved hypothetical protein
MRPLRKTVGLALSGGGANSIAQIGVLKALEEENVPIDFIAGTSMGALIGG
LYSSGYSASELESLAHSLPWQKLVSLDNEAPRTNTYLEQKSIRDRASIAI
RFEKFKLVVPKSLSASQTLTRTIDLLILNAPYHTAHSFSDLPVGFRAVST
DLLSGQRVTLTSGPLSEAMRASSTIPILNLPIKRNGQKLADGGLVANLPV
DELDHFDAGYKVAVDTHGRMYTDSSDIDVPWKAADQAMTILTQIQYPQQL
EKADMVITPDISNHKATDFSDIRELIAAGYAKGRLLAPIIKRNIQLAPRH
DIDIAGFSKSIEGIPNSAGYIEQARTARAIVRSDNRIRKILDELLQTGLF
TRVHARLDRRSRSVTFVLEPLPRIEKIEVTGGPANAIPEKEISDTFLPIT
GKLYTSEIATGSLEKLIRLYRDKGYSLVGIERASVSGETLTIRLTSGRIE
NVKIEQDRKLTKPLTVRRELAVDTTKAFRYAEAEKTISNLYGTGVFNRVS
LSTENPDEQSDNPNSTLRIKLDEKLPTVLRIGVRYDETSNAQLLLDFRNE
NLYGTGNSAGVWAKIGQKNNRFNLEFSMPRIGSTPLTMFTRAFFDQRDFE
TRQLALIGQSGLQATGEPRSLGIQRYGITTSFGTRIGKNGRLTADFTVQN
AQSYIRDNIDEPFATGNVNLASIGGQFTFDNRDSSFLPSGGRYTNIRYTS
TSALLNNADSFWQFSALHEENFSISSATTLQLTALAGTSSEEVPLSEKFF
LGGTGNAYSYRFIGLKDSDLIGNNIAVAGAMLRYKVPMQLLFPTSLTLSY
NIGNVWEKRKQMSISKLIQGVGAGLVWDTPIGPAQFTVAKPFAFERDDVN
DSARIDFTDTVFYFSLGHDF
>CT1817 hypothetical protein
MPFHTIFFINQIQEEGMNDTGYDYSIVRGFAFSALFWLVVGLVIGLWIAF
EMFNPALNLTHQHIHPLRL
>CT0438 hypothetical protein
MLKSINSQKGVTMIEYALIAALISTVTILALSQVGQNLVTLLVSVVNAFS
SAPPN
>CT0259 ABC transporter, ATP-binding protein
MRMGAGAASGFRTQQDETLGNRKKGSVDRYIIKQLLGYIKPFKGLVAGAV
ALTAFGAILTPLRPWLTRIAIDDHIAKGDHKGLAVISLLMLLVIVLDGVK
QYAATWLTQLIGQKAVYAIRLDIFRHLQRLPIRYFDRNPVGRIITRTTND
VEALNEMLSSSLITIIGDMLQLLFIVAMMFLTDWRLTLVVLSILPVMIYS
TIFFKNKMRQAFLDVRTHLARLNAFFQEHITGMKVVQLFSREEAEFMKHS
AINADHRDANIKTVFYFSIYSPLIEFLSSAAAGLVVWFSATRIMQTDLSV
GVVVSFVQFIWLFFRPLQHLSDQFNIMQTAITSSDRIFRLLEEPLDAEPT
ESAHSLDSFRDRIRFDKVWFAYDEEHWVLRDISLEIKAGETVAIVGATGS
GKTTLINILSRFYPYAKGSVTIDGIELSDIPRHDLRKLVGVVMQDVVLFT
GSIRENLSFGDPSISDETIHEAARIVGADRFIEKLPDGYDYRIRENGSGL
SAGQKQLLAFVRALLYNPDILVLDEATSSVDTETESLIEQATDRLMKHRT
SIIIAHRLSTIQHADKIVVLHKGTIRETGTHQELLAQRGLYYKLYLLQHP
ERGRMEGAGVKNSPGQSDKSEPADLPSTNCSPATPSPAPALLPGAKESLP
QSC
>CT0188 c-type cytochrome, putative
MMEAQFLFFRSCPAVLKGCVFFGNYNQQGAFQMMKSVKAAMLATTMVAMS
LTAHAAETPSVALGKKLFNDPSLGGAAGAKTCASCHPNGKGVENAWKNPN
LAKQINTCILGALKGKPLPLESVEMKSLVLYHQSLKPASIKP
>CT1388 hypothetical protein
MLTCAKRGLSGARSNAETVQSAGLKEERFERFVAKTGSNRADKPIPANKT
RRHEAPGKS
>CT2097 conserved hypothetical protein
MPTYHYRCSQCGHEEEVFQRMTDNALTKCPKCGAESYERVISADGGFVLK
GSGFYSTDYCGSKSSSKGSESGGGCSTGSCPFTK
>CT1203 conserved hypothetical protein
MAMQMIPFPADFSLRVLDLGAGTGLFAAMVAQAYPNATFHLTDISEAMLE
VARKRFAGNPRVSFAVQEHLELAEEPEFDLVIFAFSIHHLEHEAKRELLC
KIFHALRPGGAFINADQALGATTENEESYESQWFSDVSANGATAEAIHAA
KERMRADRNATLADQLAWLEEAGFGEVCCAYARFRFVVYGGRRG
>CT2210 glycosyl transferase
MPFRRLYPRWLMNGRTATEPGESDVPASPFTFDLMNPLSWITAARKLRNI
KPDVLLIAYWSGVLAPLCALMRRASGLPTFVLLHNFTSHEPISGESMLKR
MLVSSSDGFITLSQTVESELRAFAPAAKTLRLFHPLYERQTPGPSKAAAR
RSIGLPEDAPVLLFFGYVREYKGLDTLFEAMALVLQRESSARLVVAGEFI
LGSSRFREEARRLGIDGAVEFREGYVPAGEVATLMAAADAVVLPYRSATQ
SGIVPLALGHGVPVIACDTGGLGNQVEHGRTGWLVREEGAEALADGILDF
FRERERLPLEEGISDFRRRNSWREFASQAATFLETCSRRSA
>CT0169 oxidoreductase, short-chain dehydrogenase/reductase family
MQNILIIGATSAIAEATAQQFAAKGHRFYLLARNEERLKTIASDLLVRGA
SAVETALFDANDTVNHRTVLEKAKATIGSFDIVLIAHGTLGNQKECETNP
ELALKELQTNALSTISLLTHTANTMEQQHHGTIAVISSVAGDRGRPSNYV
YGTAKAAVTTFCEGLRARLHKSGVHVLTIKPGMVETPMTEGISAPDMLLA
KPEQIASDIVNAIEKKKDVLYTPWFWKYIMMAIIHIPNTIFKKSSL
>CT0203 peptidyl-prolyl cis-trans isomerase, PpiC-type
MALMSKLRDKTHIVLFVLVAAFLALIVFEWGMNFTGPTRKAGVAGKVNGE
SISMNEYEALYNNIVAGFRQSNPGVEITSGLDAKFREQAWNYVVDQTLLA
QLLKKYGITVTDQEVLDAVNNPVNPPAIIRQNFTDPRTGKIDRQLLEQAR
SDPKAKDFWLNAQEAIKRELMVNKLVMTLRTMVFVTDPELTEVVQRQFTT
FAGSFIPFPYSYAGAETNFPVKDDEIKAWYDSHKEQFREEPVRSAEFVFF
PLTPSRQDSLQVKKEIDGLIPQFAAAKSDSEFVKIQSDLPNSANVTLSRA
DFSPAAGQALFSSPKLVPGQIVGPIADEGYYRLLKIKSVTTGEPVASASH
ILIRLNPADKAEAARAMGLLKKISEELKGGASFASLAAKYSEDPGSARNG
GFVGWFTKDRMVPQFAQAVFAGKPGQIVGPVQTQFGLHIIKIEGFDNRRI
VCSEVARQIKASTQTSETIKRQAQAFLTEAKSKGFEAAAKAQRLEVGKTG
DFTRQSLLAVPGMGEAITGFAFKAKDSDISDVLDAEKGFVVAKLLTQNDT
GYHQLDAQLKEMIKTELVREKQGAALKSKLVALSKSSGGSLDAIAAKDPS
LRKITSKEIRWRDGYIDGYGVDPQLVEGMAGMKLNTLSQPVQTSGGYALV
QLTSRQLAPGTDLAAEKQKVLPQLMQARQQQFLSEYLQSYRRNAKIEDFR
>CT0648 RNA polymerase sigma-70 factor, ECF subfamily
MKENKQTLTAQEQLKQQEFQKEAVAHINSLYNYALHLTMNPDDAHDLVQE
TYLKAYRFFDSFERGTNCKAWLFKILKNNYINKFRKNAREPGKVDYDLIK
DFYHTIKDVQSDTTEAESDFFHSLLHEEVYQALQSLPEEFREVIQLCDIE
GFTYEEIANMVESPIGTVRSRLYRGRKLLRAKLEDYAKKHGYDTEIDQ
>CT0381 cytosine deaminase, putative
MSDLNHCMELAFREAIKAYESKEVPVGAVVLDPNGLIIGRGYNQVETLSD
ATAHAEMIALTSAMATIGSKYLEGCTLAVTLEPCPMCAGAIVLSKISRVV
FGAWDPKMGAAGTVLNITACNALNHQPEVYGGIMERKAESLLQDFFRGLR
GR
>CT1745 hypothetical protein
MNLSFSKASSLVLAGMLCSAPTFAAMPLETDDTGTQGAGKFQIEAGMEYA
RDHETVNGDSVREKEWELATTFSYGLSDTIDLVAGVPWSWSKVRVNGQTV
RDENGIGDLSLQLKWRFFESDDKRTSFALKPGISLPTGDDEKGFGNGRVG
GDVTLIATHTVDRGALHLNLGYEYNNYSIAEVRESSRKSIWRASLAGEVE
VAKRLKAVADIGVETNEERDSDTNPAYILGGLIYGVSDDVDLDFGVKGGL
NDAETDTTWLAGITMRF
>CT0813 membrane protein, putative
MPDVMTVLINILSASWSVLLDAAPWVLFGFLVAGLVKAFVPEKLVAAHLG
RGFSSIVKASAIGVPIPLCSCGVIPAAAGLKKQGAGKGAVASFLVSTPET
GIDSIAITYALLDPLMTIFRPVAAFVTAIATGVAVSFTGTDEPAAAPASG
GGSSGSSCSCGCGHKKVEKPGVAQKIRSGFSFAFGELLGDVGVWLLGGVL
LAGLISVFVSGQFVERYLSNDVVAMVMMLAISVPMYVCATSSTPIVAALA
LKGISPGAALVFLLAGPATNAASLPVISKLLGKKGTVAYLVVIVLMSLLF
GILVNYLYAWLGLDTKNWVSRGAHEEGGVVAIVSAIVLVVLIARARFEAW
RSVREH
>CT0495 polysulfide reductase, subunit B, putative
MARYGMVMDMRTCVGCQACMAACSTENQTPFWSEKFRTHVEDKETGAFPD
VRRVQLPRLCMHCENTPCLSACPTGATHMNKDGIVLVNYDRCIGCYACCI
ACPYDARYAYDSEDVQKERELYGKLVTHDVPHVDKCTFCVQRLSEKLEPA
CVATCPTHTRIFGDLDDRKSEVHKLAASGKAQALNQGLGTSPKVFYIPS
>CT0601 conserved hypothetical protein
MERSAIRQISGWLWEIPRSYRSDMRVPARFYASEAMLEQILADRSLEQLV
NVATLPGIVGFALAMPDIHEGYGFPIGGVAAFDPDAGIISPGGIGYDINC
GVRLLATSQPFESVREKIPDLVKEIYRQVPSGVGHGNRITFSSKQLEQIL
RDGAPRMVAFGYGEPEDLGHIESGGVIDVADPSKVSQYAKQRGGDQLGTL
GTGNHFVEIDRIDAIFDQEAAVRMGLFEGQIVIQLHTGSRGLGHQIATDY
IRVMNRAMPKYGIEVPDRELACAPFCSPEGQEYFSAMSAGANFAWANRQL
ITWEIRQAWRAVVGDDPLRVVYDVAHNIAKVETHEIDGHRRQLLVHRKGA
TRAFVGQPVIIPGSMGTASFVLEGGLASMHESFGSSCHGAGRRMSRTKAK
HMVQGSQLRQELEAIGVSVQAGSMQGLAEEASAAYKDIGEVVSTVVSAGI
ARKVVRLVPVGVMKG
>CT1965 membrane protein, putative
MFLKVQASLIDFVLKEWLLVGSGVVLVLTSVYIKRLPEYSANEIQVLFLL
FVLFIAVNGLLKSGTILKIAQKIEKGKLIPLKLVVITFFLSMLVTNDISL
IVIVPLTLSLTVNRKGILVILEALAANAGSALTPVGNPQNLFIYWFYNVP
PGVFIKTIAPFSLMFLVLLIIASLSFRTKRVLQENHVQNINKKAFVYGVL
LAIVLLAVFHVLPVLSAVVVILFALIFDRKSLNVDYALLFSFLFFFGIAD
NLKVILGPKITHSEHIFLFSVLASQVMSNVPAALLFANCTPKWQALLWGV
NAGGFGSLFGSLANLIAYKIYVKNKGTNDTVGFTVKFLVIGYIALFVSIG
LYFLLYGADVSL
>CT0678 conserved hypothetical protein
MSGENVRAFVPAPLPPDPPLEDTPARRKLLEEVTLALGRLDSITLLLPDP
ELFLYSYVRREAVLSSQIEGTQSTLTQLMLFELEESPGVPFDDVVEVSNY
VAALDHGVARLKEGFPLSNRLIREMHTVLLFRGRGSNKGPGEFRRSQNWI
GGTRPGNALFVPPPPHLVPQCMADLERFLHDENNPYPSVLKAALAHVQFE
TIHPFLDGNGRIGRLLIAFILHHDGILSRPLLYLSLYFKRHRETYYRLLD
RVRTEGDWEAWTDFFLEGVRETAGNAVDTARRLIALFEADQQKIQSLGRS
ASSTLQVFQAFKERPLLTVGRISERTGLSFPAANQAVGRMEKKLGIVREI
TGRRRERAYAYDQYVAILNEGAEE
>CT0490 hypothetical protein
MYCSVVVYNGIHHPNKDVRVFLKSYVLLFEFIAKCFQVAASDWPFFIWSG
RGTIG
>CT0910 hypothetical protein
MLVEEPSMEEALRHLLPKIIGNRAGWKVINMGSKGRLMKELPNRLRGYKQ
RMDKGEKIKIIVLIDRDNDNCHDLKRQLEDMARKAGLQTKTAAGTGGAAF
QVVNRIAIEELEAWFMGDTAALQCAFTSLRGVRFPNSFNNPDNDGTWERL
HHFLKQNGIYRKSYPKIDAARTIAKHMDPGRNRSRSFQYFVQGVEACL
>CT2050 conserved hypothetical protein
MNEECRMDDVIVRLKKVNGQVQALMRMIETGEECQKVVTQFQAAKAALDN
TFSLVLNRNLQNCLNRHDSGSVEKIIKLISKK
>CT1477 DegT/DnrJ/EryC1/StrS family protein
MQFIDLITQKNRIRENLMKRIERIIDSAQFVMGPEVLEAEKKLAEYVGTK
HCVSCASGTDALLIPLMAKGIGPGDAVLTTPFTFVATAEVVSLVGATPVF
VDVLPGTFNINPDGVAPAVEEARKKGLNPKALIPVDLFGLPAEYDRLEKV
AAEQGLWILEDAAQGFGGTVGGRRAGSFGLVGATSFFPAKPLGCYGDGGA
IFTNDDELLELLISVRVHGGGSDKYNNERIGLNGRFDAIQAAVINEKLTI
FDDELDLRNKVAAAYSARLKDRVVVPEVPEGYTSCWAQYSVLANSTDDRT
KLMAALKDAGIPSAIYYPIPLHLQKAYENLGYKPGDFPVSEDFSARVFAL
PMHPYLKEEEIEQICRVITRA
>CT1272 conserved hypothetical protein
MIRSVTVYCSSSNLAPEPYFSEAESLGRGLAERGIDLVFGGGHVGLMGRT
ADAALKAGGTVKGIIPRFLEEREVAHPGLTELHVVETMHQRKMLLTDWAD
AFVILPGGLGTLDELMEILTWKHLGQHRKPIILLNTEGFWNQLLQFFERI
AAEKMVKPGYESYYDICNSASDVLAMIDRQNTSV
>CT0225 glycosyl transferase
MAAEALADEHLVMLAYRDPAIGDHIDVQKVRLPFRWEFDLPTIFSLVRLV
WRKKIDILIPTKRKDYVLAGLVCRLTGAANILRLGIDRPLKNTPVQRLIY
GWLADGIIVNADKIRRTLALSPWIDPDKVRVIYNGIDREKLEPDSVLPCK
KPFSFTIGAAGALIPRKGFDYLIRSFARFSAGSNEITDAGLVIAGTGPER
ESLEKLAADLGISGRVRFTGHLTNPYPVMRACDLFVSASTSEGLANVLLE
SMALHCVPVSTLSGGADELIEDGRNGFLVRYGDEKRLAEIFSELYKNPGK
IASVAANAHQTIMQRFSIELMRKDLFEFCSEINDRKKGT
>CT0005 conserved hypothetical protein
MNIVPILLIRFYQSFISPLLGPSCKYHPTCSNYAIEAFRQHNFFYASWLT
VWRVLRCNPFSKGGYDPVPPKSVKSAGNSKDSK
>CT1000 ribonuclease II family protein
MSRKKGKADIVKIAVGVMLLNGLEPDFSLEAEHQLESIDGPGKENGSEIL
DLTSLLWCSIDNDDSRDLDQLTACEVQEDSSIIIYVAIADVDTLVKKGSP
IDKHAWINTTSVYTSAKVFPMLPLRLSTDLTSLNANENRLAIVTIMKIGA
DGELITSTVERAWVRNKAKLAYDSVAAWLEGNGELPPAARAVPGMDQQLR
HQDQVAQKLRLRRHAKGSLEFETFQPRAVFEGDRVVDIKEQEKNRARQLI
EEFMISTNTCTANFLAEKGVASIRRVVKSPERWRRIVNVASEYGYALPGA
PDGKALESFLALRFKEDPLRFPDLSLTIIKLMGSGEYVVEFPGQEPIGHF
GLAERDYTHSTAPNRRYPDLITLRMTKAFLTNSPPPYGIDELEYLAVHCT
RQEDAARKVERRVRKSEAALLMQSMIGHHFDAIVSGHSEKGSWVRIFTPP
VEGRLVRNVGKIKVGQKIKVKLVLADVDRGFIDFERV
>CT1400 hypothetical protein
MIGHSEVIRCFGMSGKVYFFNDSGNIQKGVHDLPPKGLACFVKIKNMMMQ
AKPAEVSMVDALVDPWLGAIGRDFEGYRNHCRRVFIFACTLAGAEGESRE
KIAIAAAFHDLGIWTDNTFDYLEPSKRLASAYLASTSKAEWTDEIKAMIE
QHHKVTPWSCKPGWLVEPFRKADWIDVTLGARNFELNRSYIREIQRRYPN
AGFHATLARLSFERMKTHPKDPLPMMRW
>CT0809 hypothetical protein
MNQATAEKINTLFSNFDPTRCFIPYISMYEIEALYFSDPPTLATTSGAPL
KAIEHILAECGEPEKINDHTTTAPSKRLEKLSNRSRKTTTGIAIATAIGI
PKMRDACPLFNNWVTELEKLAC
>CT1516 Sec-independent protein translocase protein TatA, putative
MDVGGPELLLILVVILILFGGQKIPELARGLGKGIKEFKKAQADIESEFH
KAVDGVSDTVKQAGSSEKKS
>CT0741 hypothetical protein
MRGEKIQVVARLRDRSLPKFFLCFLTLILATKRIKKIYYRMIKVKKHVI
>CT1315 hydrolase, haloacid dehalogenase-like family
MSMNFKGVIFDLDGVITGTAKIHSLAWEAMFNSFLQNYAEVNNEPYVPFD
PVHDYLKYVDGKPRMEGVKSFLASRGIEIPYGELDDTPEKETVCGLGNRK
NSLFTKILVKEGPEVFQTSVDFIKALKARGIRIGIASSSRNCQLILRLAK
LEELFETRVDGEVSMELKLKGKPNPDIFITAAANLGLEPYDCVVVEDAIS
GVQAGSKGNFGLVLGIAREIEGIKLKEQGADIVVRDLGEITIEEIDKWFD
TGLEHEGWNLHYDSWSPKDERLRESLTTTGNGYMGVRGAFESGMTSAHHY
PGTYLAGVFNKLPSEVHGQTVWNNDMVNAPNWLPIEFRIGNGAFINPLEQ
KILSYRQNLDMRHAVMEREMVIQDTLGNITRMKSKRFCSMDNPHIAAIRY
TIQPVNYSAEIEIRSTIDGRVQNRNVLRYNTLSTDHLEHVDHGRTGKDEG
IFLHVRTNHSKIDIVTHAKTTLRCGYHAKSVCEGNITSSPRWISEHFRLQ
VSADRSCSIDKVVSIHTSRDAGHNDPVAAGKESLASAGSFDQLLERHIEA
WDKIWQKADMKIDGDRFTQMVIRLHIYHLVSTVSPHNVNIDASIAARGLS
GEGYRGHYFWDEIYIMPFFIQHLPEVARALLMFRYHRLDAAREYARDNGY
HGAMYPWQTADDGREETPTIHYNPKSGAWDPDLSCRQRHVSIAVFYNAWR
YVHDTGDTEFLNSYGAEMMFEIARFWASIATFSPDDGRYHIEGVMGPDEF
HEELPGSGKPGLKDNSYTNIMTAWLLEKAIEISQRLDPAVMDGLMEKIGI
GHDEFMKWRDISGKMNVLIDQNGILEQFDGYMGLKELDWEHYKLKYGNIH
RMDRILKAENDSPDHYKVAKQADVLMTFYTLSPAEVCAILENLGYHVADP
LRFVRDNYAFYEPRTSHGSTLSKVVHSIISSYLPNGHEMAWNWFIEALRS
DIHDTQGGTTPEGLHCGVMAGTIDTVTRYFSGIAFHKDMLNIQPNFPSHW
RRLETNLTFQKSWYRIVITPKSVSVTLTESDANELPAFIGGRSVTLKKGE
ELTVQLG
>CT2233 thiol:disulfide interchange protein, thioredoxin family protein
MKRLASPFAPFVAALMVFSLSLAFSAQADARPTPAPSFSGVTVDGKPFSS
ASLKGKAYIVNFFATWCPPCRSEIPDMVQVQKTWASRGFTFVGIAVNEQL
PNVKNYMKTQGIIYPVMMATPELIRAFNGYIDGGITGIPTSFVIDASGNV
SGVIVGPRSKADFDRIVKMALGAKAATK
>CT0111 conserved hypothetical protein
MIIKQLSVFLENRAGRLTELTGILADNDINISAFSIADTTDFGILRVITG
KPELAEKVLKEQGFAVKITDVIGMIMPNKPGALHHALQILTDNGISIEYM
YAFTNGEGRATAVIRTDTPQKAIEVLQLHKMELLKTGDVYQL
>CT1154 hypothetical protein
MRKKILFVCGSMNQTTQMHQISEHLREYDQWFTPFFSDGLLGKASDLGML
EFTIMGKKRASKAIDYLTSHNLQLDIGGTLHRYDLVVTCTDLIVPKHIKR
TKIVLVQEGMTEPETILFHLARNFRWVPRWIAGTAMTGLSDTYEKFCVAS
EGYRDLFISKGVRPEKIEVTSIPNFDNCERFLENDFEHRDYVLVCTSDNR
ETFIYENRKRNIRKYLDMADGRQLLFKLHPNENVVRATREIELYAPGSIV
YAEGKTEEMIANSQMMIAQFSSTIFVGSALNKPVYCGLEPDYLKRLTPLQ
NRSAARKIAEVCREVIEK
>CT1824 cation efflux family protein
MSEHHHDHSHDHGHAGHQYHAVGSIQIAFFMNFGFTILEAIGGVMTNSTA
ILANAVHDFGDSIAVGQAWYFEKLSGRTGDKRYSYGYQRFSIFGALVSAL
MMLASSFLVLVEAVPRLLHPEHPNAKGMVAFALVGVAVNALAMLRLKGQA
GMNARVIALHLLEDVLGWLSVLLVSVVLLFTSLPILDPLLAIVITLYILT
GVVKNLRAMVPVFLQAVPSELSLDKVVADIQQTEHVTGVHHAHLWSLDGQ
RTVFTAHLEIGCDVNPAEYASIKEEIRKLVARHGIYHSTVELEYPGEVCR
NESHKDG
>CT1098 hypothetical protein
MFLAEKNIQQSGARPDVSGFGNTDSRHNNNKVLMKAHLSMLLKNLDNNNA
FKLLQALQR
>CT1087 sulfide-quinone reductase
MKKVLILGGGIAGVAAAIAFRKRGFEVEVVSDREFLFIYPIAIWIPVGTE
RFRNVAFPLEKVARKHGFSLTLDTVTAINASRDSVTLEKAGQRSDFDFLV
IALGSDKMKHEGIEHTLSICGAPEHSIRLKEKIDALIERGHGKIAFGFGG
NPKDPSAVRGGPGFELFFNLHHKLTKLGIRDNFEMTFFAPMAQPGAKMGQ
KALDMMAVMFKAKNFRQRYGKKITRFEVDGVVFEDGSKLESDLTMFIPAG
SGHSVFRNSDLPLSEAGFVKIDDFCRVVGVDGWYAVGDSAALEGPEWKAK
QGHIAEFMADCAANNCLAEHFGHQEPMRGYQEHLNVLCVMDTGDGAGFVY
RTGHSEMFIPMPVVGHWLKKAWGYYYKLSKMKYIPRIPGM
>CT1320.1 hypothetical protein
MHERKAAIGDIMKKHLLLATLASGLLFFSPSGQALADVDLHVNVGGPGFV
VDYNPEFFYVPDLGYSISYGGPYDIIMYGGYYYLYHNGYWYRSHHYRHGP
WVIVDYRRLPYRIRRYRWDDIRRYREVYYRRIHPDRFREHRDRDWRDRWD
DRRDRRDDRWDRHDDRHDDRRDGERRF
>CT0970 conserved hypothetical protein
MPNCRKLQGHFLMKRTAPLFFLLFFLLLVKTSIARAEAFRYGLDVLDAQK
CFQLQGKRVGMITNAAAVSRSGEPGYRVLLRNGVDLKFLMAPEHGFSLDY
EAGKKVDNAGIGDSLKIWSLYGNSRKPDISLLKTIDVLIFDLQDAGVRCY
TYISTMKLAMEACNEAGIVFMVLDRPNPLAPIPVSGFVLEPRFESFVGAA
ELPFIHGMSVGEIAGWLQKRRFPGLSLQIVRMQGYRRDRFADDLPGFCFR
PPSPNLHDFKTLLLYPATVMLEGTDVSEGRGTEAPFRMFGAPFIDSKALI
RELETYRLPGVKFYRTTFTPERSKFSGVECEGIRLKVTDRERFDPFMTST
AILLSLQKLWPEQTGLYRHAAFFDQLAGTDRYRLMIQQQRPIAEILDAVR
AQVRAFDAASRDRFLYP
>CT1194 hypothetical protein
MLKMSVVSIGEVLVDRAVLDARFSCNLDLCHGDCCVEGELGAPIDDREAR
FLESAVEPLRSMLPERNLRYIRRHGCAEVYQGNLYTKTIDGRECVFVYHE
NGKALCAVETAWKKGLLDATKPLSCRLFPIRVRKKFGLDYLVYEQHTMCR
DARRQGAEQDVRLIDFLEAPLVEKYGHDWYMSLKEFVASI
>CT0100 conserved hypothetical protein
MTLDELNRLLDPQVLALIDAHASDDPATFAMQFHGRGDLPVRAIAEQIAC
RKKAAAKLPSLSRFPMLYTRLGLEQASGERAAEWKASLMRGWRAIDLTGG
LGIDTLFLAQRFDSVVSCERNEALARLAEANRRMMGVTNVETLIGDSEEL
LAGYADDSFDWVLVDPARREHGGRSAGLSASSPDVVRLHDMLLRKARRVC
IKASPALEISGLETQLPTLSEVIAVSVDGECKEVLLLLYREREAGLTPEI
RAVCLGSETFEIVSSGGVPPARVVAEAPGTWLYEPDTAIIKARLTGELAR
QFHLEFLNRTVDYLTSDRLIEPFPGRSFRIEECRPFRQKSFRKELAELEI
TNAAIQRRDFPLSVEELRKRYKIGESSERYLFFTKNATGSLIWLSCRKP
>CT2004 membrane protein, putative
MNTPQFNKIAVLVATLLISALFLTMIRQFLVTILLAGIFTGLAYPLFSRF
ITLTRGHRSLSASMTLVIFFMMVFLPLLAVFTVVILQAVSLSSTAIPLIR
EQLRDPEGFLRMLSSLPFYKDIESYSDLILEKAAEILGNLGSSVLSSFSA
ITWTAIYDLVLFIIFWYTMFFLLRDGHELLERIKYYLPLNESDQRRLFDR
FVSVTRASLKGSLIIAVIQGTLAGLAFYVAGINQAVFWGAIMAMLSLLPL
IGSPIIWVPAVIILALSGNYAQAIGLFLFCSIIVGQIDNVLRPILVGRDT
SMHELFIFFGTLGGIGMFGLPGFIIGPVVAALFVTVWDIYGETFNESLIE
RRSGGGQAAGSDGSAP
>CT0070 aminotransferase, class V
MKKRLFTPGPTPVPENVMLRMAAPIIHHRNPEFMEILERVHENLKYLFRT
TQPVVVMTCSGTGGMEAAISSLFRQGDKLITINGGKFGERWSELARIYTG
NCVEEKIEWGTAISPERIAELLDEHPDAMGVCITHSETSTGTASDVRALC
AAIRERSEALILVDGITAIGAHEFHFDDWGADICITGSQKGLMMPPGLAL
VAVSERAQEIIHNRKHQPQYYLSLRKALKSHAGNDTPFTPAVSLIIGLDE
ALQMLRAEGIENVWARHEALAGACRLGCQALGMELFSESPSYAVTAVWLP
EGADWKEFNTTLKIKNGITVAAGQDDFKGRIFRISHLGYYDELDMLTLMG
GLERSLKMMEIPFRVGAGVSAVQRAFLGE
>CT1488 hypothetical protein
MMDLVLMGLFWQIGFYRTMLVEDTLFQMNKG
>CT1724 arsenite efflux transporter
MSISTKQLSFLDRYLTLWIFLAMGIGVLSGFLYPQIAGFWNQFQSGTTNI
PIAIGLIVMMYPPLAKVKYEELGDVFRNTKLLGLSLLQNWVIGPVLMFAL
AVTFLSDMPHYMAGLIMIGLARCIAMVIVWNELAKGDTEYAAGLVAFNSV
FQVLFFSVYAWLFLTVLPEKLGMTSFHVKITIGEIARSVFIYLGIPFIAG
FLTRFVMLRVKSREWYEREFIPKISPLTLVALLFTIVVMFSLKGEYIVKI
PMDVLRIAIPLLLYFVIMFLVSFWLGKKIGADYSKSATLSFTAASNNFEL
AIAVAVAVFRIDSGEAFAAVIGPLVEVPALITLVNVSLWFREKWFRVIDN
A
>CT0551 ABC-type export system, membrane protein
MNVMKPVSGIYRVALKLLMNDKSKFTALLVGITFAVFLMVMMMSMFSGIL
ARSCAPVYNIGAKIWVMDPAVNNTNSSIPMPDYILSASRSIPGVKYAVPL
YFGSALVKLKDGTYQAATVVGLDDATLFGRPEMLEGRIEDVFAENAFVVV
KDAEYAKLESPKIGTQFEINDNRGTVVGLAKVASSGLFGIPTLYTTYNRA
ITYIPSARFTISYVLIEPKSEADVPGIQKAIAKLGYEALTREQFVQKVSN
FYTFKTGVGINILMMTGISFVVGLSISGQTFYTFILENIGKFGAMKAIGA
KGNELVSMILVQAVVTSLIGYGLGVGAASTFMIVAKLRMPDYAAQLTYQT
LGLALVMVMIIAAVSSYIGVRKVLQIQPFDIFRG
>CT1979 conserved hypothetical protein
MLFHNQTIDFTMLATNENRLVEILLQCQPGQPRTRGTWEVDHQGTPFILP
SIGGITLNLQVGDPAFGWEGDHIEPGVSCTADTHKPFEHPNVTVQMLSCV
GNTATIVSGEAKGESGVVIGHHGGSEHIIVDFPREVKEKMAYGDTIMVRS
KGQGLKLTDFPDVSLFNLDPALLAKMKINIAEDGVLEVPVTTLVPAYCMG
SGIGSAHVAKGDYDIVTSDPGAVEEFGLDRIRFGDFVALLDQDNRYGRAY
RKGAVTIGVVVHSDCREAGHGPGVTTIMTCATRGIRPVIDPKANIADLLG
IGTRL
>CT0870 hypothetical protein
MIGGGSHFFIFAPQWGHSGASVIFMVFLIESKFG
>CT1199 ABC transporter, permease protein
MSKRSVKPFIPALSLLFALLAGSLIIAATGSDPIEVYQKMLRSTFTSGYG
IGQVLFRATTLIFTGLAVALPFRVKLFNIGGEGQLLMGAFAAALCGIALP
AGTPALVAAPALILVASAAGAGWAMVAGWLKVRRGVNEVISSIMLNFIAL
AITGYLLTNRFAIPSTVHTPAIVAGGWLPDFDTLFGLGWHSPANLSLFIA
LAITAGAAVLLYRSRYGYDMIASGLNPQAARHAGIDTARHTLGAMAMGGA
MAGLAASNLVLGYKHWFEAGLSTGAGFMGIAVALLAGTNPTGIIIAAFLF
AWLDYGGLAVNTLVPKDIFMMVQAITILSIISIPALFKNRLKED
>CT0363 membrane protein, putative
MKLQPYRPDRNILFAGLFLIIFLSFFAGIGSGPLFDVDEGAFSEATREML
VSKNYLTTYLNGALRFDKPILVYWLQLLSIKTFGISEFAFRLPSAVAAAL
WAGAIFFFVRRERNETEAILATALMALSLQVSIIAKAAIADAVLNLCLAI
ALTALFTHWRTKRRSWIYIAFAAMGFGMLDKGPVAILIPIAVSFLFFLAQ
KELTAWFRAMLNPTAIALFLLIALPWYVLEYREQGQAFIDGFFMKHNVRR
FNGSMEGHSGSLFYYIPVVLIGIMPSTGLFFGLFTKLRQRLADPLWQFCL
IWFAFVFVFFSLSGTKLPHYMIYGYTPLFILVGAELSKIERSWLLALWPT
GLLLLLAALPFALEMLMPSMENLFIQAQLQGFLTEMESNHFSLLMLAAAA
LCMLLQFVPALAPSMRFIIGGALLTFSFNFVAMPMAARVMQEPVKEAAQL
ARKEQLKIVMWKVYYPSFFVYSGSFAERRFPEPGDVVLTTIDRLEKLGPV
ERLYGRNGIMLVRMPTQPTSR
>CT1521 putative addiction module component, TIGR02574 family
MTASAEKIMNDALRLTPVERAEMIERLFQSFDNHRKAEIDAAWAAEFESR
LDAYKEGKIKASPVEEVMARINKR
>CT1093 hypothetical protein
MSLIILFYPGRHEYTTVEKIGTYFVIYQYSSSFPATSYPSES
>CT0230 NAD-dependent epimerase/dehydratase family protein
MKILVTGAAGFIGFHLCERLASRGDDVVGIDNINDYYDQRVKYGRLAYSG
IAESAIEYGKTVQSSKYPNYRFVKLNLEDKEGIDNLFKAEKFDALCNLAA
QAGVRYSLTNPASYVSSNIVGFVNLLEAARHNSLGNFCYASSSSVYGLNE
RQPFSVHDNVDHPVSLYAASKKSNELMAHTYSHLFGIPTTGLRFFTVYGP
WGRPDMALFLFTKAALEGRPIDVFNYGNMQRDFTYIDDIVEGVVRVLDHP
AQPNPDWSGAAPDPGTSSAPYRVYNIGNNKTVKLMDYIEALENALGVTIE
KNLLPIQPGDVPSTWANVSDLVKDFDYKPETTVQEGVNRFIAWYREFFKV
>CT1673 conserved hypothetical protein
MKLPRRKFEIIQEHAVRDLPYECCGLLVGRKVVDHRGNIDNIVVEVAPCR
NVLYYGKENGFEIAYNEFIDVEREAHSLGLVVVGSYHSHINSTAVPSRND
IDFASAGHSMLIISLYGGVPREVTSWLRRDSGGFHQEQIKVIA
>CT2020 photosystem P840 reaction center, large subunit
MAEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFL
FQRTETRSTKWYQIFDTEKLDDEQVVGGHLALLGVLGFIMGIYYISGIQV
FPWGAPGFHDNWFYLTIKPRMVSLGIDTYSTKTADLEAAGARLLGWAAFH
FLVGSVLIFGGWRHWTHNLTNPFTGRCGNFRDFRFLGKFGDVVFNGTSAK
SYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTINSETFMSFVFA
VIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVA
FQQPSFAPYYKELDKLVFYLYGEPFNRVSFNFVEQGGKVISGAKEFADFP
AYAILPKSGEAFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQL
NGMYNQIKSIWITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICE
LNIFGTNITMSFYWLKPLPIFQWMFADPSINDWVMAHVITAGSLFSLIAL
VRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLW
GIKGLSAVCWYIDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYF
WTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLEGADIQTRTIR
WLGKKFLNRDVNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQT
NSPLPPPVSHAAVSGQQMLAQLVDTLMKMIA
>CT0953.1 hypothetical protein
MAPGRIYLCEKGCLFSSRQLHEGERETLPVYGQSVMLVKLHQNGRITPAI
RWAIQSSSLSVSQLAARHGIGKAHSPAMEKPRPG
>CT0737 preprotein translocase, SecG subunit
MLNSFVVIFALLAALLLIVSVLLQSPKAGSGLTGGISSLGTVQTLGVRRT
GDFLSKTSAILAGLVMVLCFIAQFTLPARHQEGTGSSILQKSAPASLPVN
NLPQSLPTGNIQPAAAPAEQPAAPAK
>CT2067 pentapeptide repeat family protein
MKIHSQAFPAIILAITLSAPSIAHAYNRDQLQLLQKSVAEWNAMRNQHPE
LAIDLSKANLEDAKLNGANLSKANLSKADLSGASLDKANLEGANLSMTYL
KKANMKAVNAAHAWLADANLNGAFMKDASLKAANLARANLRWAKMSGADL
EQASLKDAVLFEADMEGANLKGTQFNKARFLGNAILKDAVLSNNSVLPSG
EPVTRGWAMMHDARFAKEEPAAPLEFVPPVVAAPVIIPENMPAGGVQAIP
AQPVPDVREVEEWNAMRQNNPEAKIEMTEEKLGHAELGGVDLRKASLSKS
DFERANLDKANLAGANLAGVNFQRADMKEANLKGANLEGANLDRAFLKGA
DLSGANLKGAILYGAMLYGANLDGAILTNVSLFDANLEKASLKGADLTGA
TLIGVNLTNALISPATTTPSGKKATRGWATLNNAVFIDK
>CT0190 hypothetical protein
MTVTHILLDDQNPLHRELPIYRSGKINTVRLADKSYKIYDSMEISAHDYT
ALFYYGVIEQLNALPFISESNNGLDSWDEAFLPSGAIGRMIEIIDECVGE
IRGKSPEKVMLGWQDDPERIAYWREIDPAETLGFLRDFQKFAAKAAKEGY
DLEFIL
>CT1900 hypothetical protein
MKKILLSLALLSAAMLSTRPSFAFGHELLDEPLAIIADQQASLEQTAKNS
TYELLVAKRSITLKNGVFKAGDNPDNFIEARLVRSVICDLNKDNKPDIAV
IIEHHGMGSAGFFELSALLSGAKGFTQTRPVLLGENIEIKEFSVSSNMWR
PEELDIVYLGHQESDSHANPTEQKRARYFLDDDGQLSNDFSHIQIVKKPA
LYLYPVRTTKIEVRLSPKGKVIRTIPDYNNRWRVTVQKDGMIDGQYHYLF
YEAALDKKIELPRRGWSVRYGDLAGWFDSHLHEMGLNRAEAEDLKEYWLK
NLPDSPYYTIRLIEPDVVNKRLGLKIHPKPDSELRVLLNFTPTEKPEKIK
APKLTSFRRKGFTAVEWGGILDDGRMAENVH
>CT1646 tetrapyrrole methylase family protein
MNSRDEHKGTLYVVATPLGNLDDMTFRAVNTLRNAGAIACEDTRRTSILL
KHFGIEGKRLVSYHSFNEERAVRQVIELLEEGSDVALVTDAGTPAISDPG
YTMASAAHAAGLPVVPVPGASALTAALSVCPLPSSSFFFAGFLPHKKGRK
SRLEFLASIDSTIVLYESPHRIGRLMEEVKEHFPDAQVFAAREITKMHEE
YVTGTPDELANHFTGQKQRGEFVVVVHPPDKHSKKRQEHADHQ
>CT1926 conserved hypothetical protein
MSNISQHGIWLLAHGKELFLSYADFPWFRDQTVKSILNVKEQSPGHFWWP
DLDVDLTEEIIENPERFPLVADARVIYK
>CT2148 hypothetical protein
MYNFFRILQNLFSDFFHLLGGHHFSQGKTLASTSIAQFPTISKVK
>CT1051 hypothetical protein
MSVTFSYLAETDYPVFTLGGSTADAARRLAASGCACAPVLDGERYLGMVH
LSRLLEGRKGWPTVKEKLGEELLETVRSYRPGEQLFDNLISVAAAKCSVV
PLADEDGRYEGVVSRKRILGFLAERIHSGEGGLTMEIEVPPTGAKLSEII
ETIEKNDASILSFTSWTTGEGRIIFFRVATHDFFRLVRNMENYGYLIRYH
SAFPNAGYDELREKALEFIHYMDM
>CT0658 sensor histidine kinase
MFLSRPLVRIASAADRFLHGETEVKIPVKHDDEIGRLARAFNYLSEEIVR
LTRKEEWLREVLSSIREAIIVTNASGEIVLANPAASRLFMIEAARKRSIP
VTNIEDAAMRELFERVQKRQSGIYNEELTAMTSKGKRTLKVTAVPVMRRG
ALFDGTVLVINDVTRLRNLERTRRDFVSSVSHELRTPLASIKGYTETLLE
GAMNDPENATAFLNIIHQEAEQLTALVNDVLDLSRIESGKIAYSFEPVDV
KPQLEKTAALFEPAAARKGVRIELNAPEGLPAVLADRSYFDIVVRNLIDN
AIKYVDAESGRVRVSAYATGDSVSIEVADNGIGIPQADLDRIFERFYRVD
KARSRELGGTGLGLSIVKHIVLAHKGKVEVRSRINRGTTFTVTLPVAGA
>CT1066 conserved hypothetical protein
MLQSERAHQPLSEEELNMLETFLASTEAPEECMNSIEMIDGFLTAVVIGP
EVVPEHRWIKYMLDPENQRENLFNSPEDESRITDLLNRHVNAIDAQFESE
PEGFLPIYEMFSYSEEEERQIAIEEWALGFILGMELSHEAWQPLFADEST
AMLAGPLFVLGKVTDDYDNMSQEEKDQMIDMLDESIIGIYAFWQQQAEEE
KGV
>CT0728 hypothetical protein
MQKIQDFSFFPCLFHYDVTMIFYRTKARKHNTTCKNAISPIT
>CT1696 hydrolase, haloacid dehalogenase-like family
MIEAILWDNDGLLVDSESLFFEMTRTFFAEAGLQVEAEYWGVEYLGNAKH
SYQIAAELGLAPELIPSLLDRRNEAFVQRLRHSVPLMPKVRETIEALAGT
VRLAIVTGSPRDKVLLMHGNNGLLDHFEVIVTDDEISNPKPHPEPYLKAM
EMLGVKPERCLAVEDSQRGLDSAVAAGLRCIAVPNALTKVQRFDRAHAVE
ADVSGVLKHVNATKRLAR
>CT1020 hypothetical protein
MKKVLSLLSMLVLTPSASLLLAEPAPAAPAASSSPLIEQAEAARKEADAL
GYEWRDTAKILDSAREALQRGDQAESDKLASKALFQARAAKAQAQFMDKN
WQMMIPKN
>CT0712 hypothetical protein
MTRRINPNRRSVTINGFYVTSSAPNETVNWTVSTGGGGTTAGPVPEPATV
MLLGIGGLLAGGRKLYESRKEEVAF
>CT0405 hypothetical protein
MINMNDSNFPARITPALSLQRQHFYAKRNHNCH
>CT1119 lipoprotein, putative
MKTVMLMLTSLLMAGCSVLGKREAAEPPYELLKHDGAFEVRRYGPMVIAE
TILDEKSYSAASGKGFNRLAGYIFGKNRSKTSISMTAPVLQERSSEKISM
TAPVLQQPQKGGWSMAFVLPEGFTLQSAPEPLDPEVKLRELPPSTIAVVT
FSGLHSAANLEKYSRQLQAWLKKQGYRALSEPKLASYDPPWTIPFLRRNE
VQIRIEPDHGESGKE
>CT0556 hypothetical protein
MQSKNRHNGRASLQVQIKSGFKIKFVFLSLK
>CT0867 heterodisulfide reductase, subunit A/hydrogenase, delta subunit, putative
MADDKKIGVYLCTDCGIGEALNVDELEAVAKKEFKVPVCKQHPNLCSNEG
VQIIKDDLNNGEVNKIVIAACSQRVNNDVFNFDPLTYVTERVNLREQVVW
TAPKGDDAKEGTQLMAADYLRMGITRVQKSEVPIPKIADVNRTVLVVGGG
VTGLTAALEASQTGYKVVLVEKAQELGGWAKKMHRVFPTKPPFAEIEQPA
IGNKIEAVKKDGNITVHTGTTIASIEGGPGEYTVTIDRNGTTETFPAGAI
VLAAGWKPYDASKLGHLGYGKHRNVVTNLEFEQNVAKTGGKVTRPSDGKP
AKNVVFLQCAGQRDKDHVPYCSTVCCNVSLKQAKYVRESDPDAGAFIVYK
DMRTTGLYENFYKSAQDDEGIFLTKGEILGLSEESDGSLVIEIDNQLLGR
KMKVKADLLVLATGMVSNMVPDGMSVNNLTPEYIGKMVQRETSDGVIEAL
EPESLILNLKYRQGPEMPHHKWGFPDSHFICFPYETRRTGIYSAGAVRHP
MDAVQASADATGAALKAIQCMELTAQGRAVHPRTWDRTYPEIRFESCTQC
RRCTVECPFGAYNEKADGTPLEFPSRCRRCGVCMGACPQRVISFRDYSVD
MISAMIKSIEVPDEGTFVIGFVCENDAYPAFDMVGLNRIGMNTNFRFIPL
RCLGGLNLAWIADALSRGVDGILLLGCKYGDDYQCHYVKGSQMANERLGK
VQETLDRLMLEAERVEQVQLAINEWDKLPGILEEFSKKIKDIGDNPYKGF
>CT0356 hypothetical protein
MAVFADMIKAEKSYGRVKVESAKVRKQRRILKDTDTIHRRPSG
>CT1566 conserved hypothetical protein
MQVIIVEDEKTLRFSPLADLKPVYDLVTGCFSLRQRFVEALGARQKLTWH
LRRHIAPWFAEANPGSVVNRVLEDEVLLVNGRLICDAAVTQFIDARKIAP
GEALIQNGNLLFCRTTAEPLHVLETVFPDTIDGMVLAGAFSCVEVSGFRL
IENLWDPVAMHPAMMREDGAALALGRIEGEVHPSAILVNPSAITVEKGAE
VKAGAVLDASDGFIYIGAGAVVEPVALLMENVYIAPGARVKSGARIYSNV
CIGGGAKAGGEIEDSIMEPFSNKQHDGFLGHSYISSWCNLGAGTDTSDLK
NNYSPVSIETAHGKMATGQQFLGLLMGEHSKCSIGTRFNTGTVVGISSNI
FGNGMPAKYVPSFSWGDGNPGTARYEADKAVETARKVMARRKVEMSAAYE
AMFRAVAGE
>CT0681 hypothetical protein
MCRLISTPRKMARRVQFRISQEVSIMAKDYLSIASEIKELEDLLAAIPED
NVIERLSLESRLESARAALTVLPQQIAPKARLTFRGRPVFGSHGIAADFG
SKAAGAFSDAFAAVCAGLSEGLRYMGPIPNRDENQLLITGTAIGSFGFEF
ELPAPDPSLFPPETEKTQEAMVKIEELFRLSAEGTDDEIAEVIEEVHPRA
IKKVYEFLELLVQQEAYCGLEFADRFFRFADYKQIKASCERLKSDNIQER
EETYRGEFQGVLPTARTFEFQVMDQKGPIKGKIDLTIADPDVLNREWLHK
PVTVKFNVMQVGQGRPRFTLMTLDDLRP
>CT1393 conserved hypothetical protein, truncation
MTSYKPEYLIPNLLDLVAEGEGLRIEFKRLIHSAPKIARSITAFANTSGG
VILIGVDDDRRIVGIQSEKEALQVIDEAMRFHIEPKPRIEVHFEEFKRRM
VLLVDIPKSPERPHFHIEPLIRRDTGKHGVERRVFIRDGSHNKAASDDRI
ELMLSSREPLKVAFTGRERCLLDWLNEHDRITAEEFADSAGIPMKEARRI
LVSLVRAGALRLDTANGDNSYTLAHR
>CT0446 transposase
MQSLSSHYHQLLGLPSNWEVENVNLSMSSRQVEIRLAFTGKQGECPICGQ
SCLIYDHAAEQRWRHLDTMQFETILVARLPRCQCKEHGVKTVQAPWAARH
SRFTLLFESFAVELLLHCANIKAASRLLRLNWHTVNQIMRRAVQRGLVRR
KTETVEYLGIDEKSFKAGQHDVTTLTDLGERRVLEVVEHRTTEATKELLA
SLNDSQQAGVKAVSVDMWKPFIHAVQELLPKADLVHDRFHISKYLNEAVD
LVRRKECRQLDKAGDKRLIGSTYVWLRNPENMGEQQQAELGKLMDAEFRT
GKAWSLKNMFRAFWQLGCADAGTFFFEYWSKRIDEVGLVPLTKVKELLQR
HFGNVLTWFKHPITNAVSEGLNSKIQIVKASARGFHRFESYRIRILFYCG
KLNMAIGS
>CT0624 conserved hypothetical protein
MLILSKLLPLLVLPPGICLLLAIAGLLFKRRSLVWISLALLWALSLPVVG
QALMHRTEEPWHRVPVEQVRKADAIVVLSGMLQQIHGAPLGEFGEAADRF
ESGIDLFKAGKAPVLVFTGGQMPWDPDCVPEGELLAARARLRGVPAGSIR
LTSTVANTAGEALATALLLGVSPGKPKRIILVTSAFHMQRALMLFMAAGF
EVEPCPVDFWATDLKSRTTLLDFIPSANAMNDSATALREMIGQIVYSSGM
RFMTLAR
>CT0341 hypothetical protein
MLHCLCVIFNRAIHPRVARGSCPETVKPLLAPYLLLFPCYIRHPKFPAST
LAASAQLVQACFLWFMKSIRESG
>CT1214 hypothetical protein
MGTRWFAKNTTMSKNSIVIGVDLDGVCADFYGRMRQIASEWFERPIDELP
EEVSWGLSEWGITNPSQYDSLHRFAVTQRELFSSMEAIPGARKYLRQLSD
EGFRIRIITHRLFIHYFHATAVQQTVNWLDSHGIPYWDLCFVKEKTQVGA
DIYIEDSPENVAQLRGRGLFTICFGNSTNRHIEELRAASWQDVYDMIKAF
VT
>CT2060 sensory box histidine kinase/response regulator
MLKGSIEALEKKIAMLENRASIAEQKADALKTALESARAGYWEWNVENGT
ILIDRQWAVISGYNSADFDMLTMEQWRELCHPADLGVVTKSIEELLDGSM
ERLELELRVRHKNGEWIRVLDCGKVTGRSKNGKPSCLTGSRQALASLHDK
PESPVKNVPEELQALVDNIPAAIYHLDVSGQATIRFRPPAFLKTLVSEHA
GTTRLNTLSMIHHDDRHMLSNAYSKLREAKHSLTLVYRIVTPEGKLHWIE
DHMRSSFSDDGLFSGIDGILCEVTDRIARLEKTRKLESQLSKSQRLETIG
TLAGGIAHDFNNILTPILGYAEMGLSSIDEDDPIHEYFAEIMQAAERAQK
LVSQILTFSKAEEGKTAPVSIQEVIDEVIRLMRPSLPSSVSIEEDIDSSC
HKVIADPAQMHQVVVNLCTNALHAMEQTGGVLKIVLREVSTGNGMPPIAP
ELPDGNYAELIISDTGTGMDSRAIERIFEPFFTTKSVEKGSGLGLSVVHG
IVTGANGHISVESTQGKGSAFHVWLPVIKSNTPDRLEQNPLLAKYAASVL
FIDDEPATVNLVTIMMTKLGYNIRAENSPVEALKFFREQPDQFDLVITDL
TMPEMTGIQLSSEIHKIAPSIPVILMTGYGKMIDHDMPLRHYGINHLLKK
PLKLAHLALAVKEVLSSTTNLLTEI
>CT2059 response regulator
MKTNDPILVIEDEDDIRQMICDILEEDGYATVQASNGNEGLQLLQKTPEI
RIVITDLLMPEKEGIAMISELREDFPWIKVVAISGGGILIPENYLNRAKA
VGSDATLCKPFESGELLSIIEELNR
>CT0167 conserved hypothetical protein
MPREEIPWFPVIKPELCNGCGDCKVLCKPGVFELGEPDPTGIHRPKLIVA
HPMNCLVLCDRCVPICTSGAIVLPKKEDFEKYVEYLD
>CT1889 hypothetical protein
MNQRLTAALLVGISVAFIAAEILVAVFGSFDQGWMVLFLSLYAGFVGLLF
GLSTLLEGRREEVESVSERRARARRDGLVGNLLDDYEIDEEFLGRGVRKP
RSKKPSPSSSSGASKERIPDDEELKAAVTAYAGMVGGIVTLRETIESMDD
SAFLSMARKAGMGGVTRERVLALVVEMVSAQGPTKSDESPALSLSIDKES
FDDYIKRCMTEPEVCIDDDATDSEGFSVGLDASDLSSRPGTPPTEFSHDP
KAVMERFKRSTEKR
>CT1980 hypothetical protein
MKRFDTNIAIHEKILLKKELFRQLRANSAYQRSNSFSFPGFFYNIVIF
>CT0627 hypothetical protein
MSEQMVVWWDSESEPERWRVEVQRKNSLRQYESV
>CT2047 AcrB/AcrD/AcrF family protein
MNEGIAGRLAKQFINSKITPLLMLASLLVGIMATFMTPREEEPQIVVPMV
DIYIPYPGATAAEVQERVAKPIERAVTEIKGVDYVYSTSMPDFALVTVRY
KVGDSAEESMVKLWATLMKYMDKMPPGVQMPLIKKVSIDDVPVLNLTFWS
KDKSPYELRRIAANVADQLKQTENIGDVEIKGGLKRQIRVQLDKQKLAHF
NITPLQIARQIQSTNSQMTSGDFKDFNENIVVKSGKFLQSKEDVGNIVVG
IYGASPVTLKDVATITDGPEEVNNYTEFGWGAASGEKDKSDYPAVTLTVA
KRQGTDATALANKVLARIDGMKGSLIPADVTVTETRNYGETASEKVFTLL
EHLVMAVVAVTIVVGFFLGWRGALVVFMSVPITFALTLLVYFLLHYTLNR
VTLFALIFVTGIVVDDSIIIAENIHRHFAMKRQPRLQAAITAISEVGNPT
ILATFTVIAAVLPMVFVSGLMGPYMSPMPIGASIAMIFSLLVALIATPWL
SLRLLKSDEGHHEEYDIKKTVYYRLFDRILTPFIDSTFKTWIAFGVVGLM
LAGAIAMIPLKMVQMKMLPFDNKNEFQVIIDMPEGTALEKTAQVAKEISG
YLKTVPEVKSIQYYAGVNAPINFNGLVRHYYLRRGDNMADIQVNLVHKSE
RSAQSHDIAKRVRAGVQEIANRYGANAKIVEIPPGPPVLSTIVAEIYGPD
QASRVALAKEVKKAFASTPGVVDVDWMVEADQKVYDLVINKEKAALRGVT
PEQIAQTLRMSLAGVDVGLVHMPDEIEPVAIQLRLPKADRTSLKDLSNIF
VQSQGMNSLTSRLRPGMQIPLSELVTVKEKIQDKSLFRKDLKDVVYVTAD
VAGVTESPVYAMLELDKKIEAIKVPGGYKISPLYTAPPKTENRLAMKWDG
EWQITYEVFRDLGTAFAVVLVVIYMLIIGWFQSFKTPLIMMISIPLSLIG
IIPGHFILHAFFTATSMIGMIALAGIMVRNSVLLIDFIQIRREEGIGLKQ
AVIESAAVRTRPIILTSGAVVIGSVVMLFDPIFQGLAISLIWGGVLSTIL
TLVVVPLVYYLSEKKGEKKRLENMKSA
>CT1062 hypothetical protein
MLNTFLESLLSRLKSAFDLEHPGSLPGEVVIVLITISFMLTFFYLLWRIV
WFSWLLYGFSLAYRNIRYMGVSVQSRERSRAFGIRQNGKLAGYGLIGLCR
SGFKIGPLFAAPLSLPKRFSARSKARFLKAH
>CT0180 lycopene cyclase, putative
MNVSNLHCIVIGAGIGGLAAGALLARQGMQVVVLEAQRYPGGCAATFTFG
AYRFDAGATVGCGFHPGGPLDLLGRELGIDWPVHAEPLAWQYRHRDLRLD
LLSSRDSIISRFPRSKPFWDEQARLARTLWRLSADALPWPVSNLQDVAGL
AGRALATLPDSAFLLSFMQRTALAWLASHGLDSDAEFVRFIDAQLMISVQ
TTSRHANALNAAIALDLPVAGTWRVEGGIGTVAQCLADSIGRDGGAVLYG
KKVIRLDTIKRGALGVETADGDALATDALVANLTPESLDILDEFRPESEQ
DAPPSDGWSAFMLYLGVDASLFKKAGADHLQIVAPEGELGECRSLFVSAS
PAGDAGRAPEGQRAVTVSTHTKPERWFEAMRQGREVYEALKQEYTDKVLA
LLYEQLPDARGAIRSMTAATPHSWQQWTGRHHGLVGGYAQTSLFGVRGPA
TKYDNLFLVGDSIFPGQSLPGVVTGARRTVELLLRRAGKLGRL
>CT1253 hydrolase, alpha/beta hydrolase fold family
MLHYKKHVIAEDAPWVVFVHGAGGSSAIWFLQIKEFVKHFNVLLVDLRGH
GRSKHITTSKEVRHYNFEVITRDIIEVLDDLQIQQAHFIGISLGTIIIRN
LGELAPERVASMVMGGAIIRLNVRAKVLVAVGNFFKSLVPYMWLYRFFAW
IIMPKARHRKSRIMFVNEAKKVAQKEFMRWFTLTYELNPLLKYFEEKDTG
IPTLYLMGDEDYMFLPAVKYIVKRHTNSYLEVISNSGHVCNIDQPQEFNS
RAIKFLCNVSLQSLPESVDTMPQLAAV
>CT1230 hypothetical protein
MVGVNLFQEGTNYDRLHTVSYRLPFFACSTSPVLFSVIIFPLAF
>CT1697 BchE/P-methylase family
MAYYKNICFVEAPQAMVTPFPRYISDCIGVCYLAAAVEDIVESMAMPENY
YNDGIFESFEKLLKSRPFDLVAISSMTGGFNNAERLARIAKKHGVTVVMG
GFHPTALPEQVLDLGCVDLVVIGEGEATFRELVEKGPSRDVKGLCWKENG
VFVHTGIRELIKDVDSIRFPLRSLRPKRFGETSENYTIDTIYTSRGCPWT
CSFCANDKMHKHWRGRSAENVVDEIAQLHDRKKKKLLKIWDANFLTNIRR
VEKICDLMIERGLTNFRIVTETRAKDVIRAERILPKLRKIGLSKVGLGIE
SPNPKTLELMNKENSLAEVTTAINLLNQYGVGSEGYFIIGHYSESVEDTM
PYPKFARALGLRQTLFMAMTPYPGTRIFDEYARENNVTSFDWDLYNNFVP
VIRTAHMENRDIMRMMVYGNVAFCDYRSVLKRDDNRGVLLTFLKDLFQLV
FLLKVNKTLDRKEASQIVFEAFELWLGTNPALDYTRTPPPPPLKESTGVC
LEHAASGKRIDFVVEQAGDRRVLKLRKVAASEPAPFPAVSLDEVVDAAFS
LSMDALMRLLYRNELMRNNPRLTPRQVLALFADRDIRQVAMRFWRLYRGC
FK
>CT0042 conserved hypothetical protein
MDYQDYHEPVLARESVSLLVTLPGIYIDGTLGGGGHSLQLLRHLQTLNGD
SLLVGIDQDTHALEAAGKKLGEFGDRRRLVRGNFGMVKELVEPMMRGAGN
GLPVMGFLLDLGVSSFQIDTPERGFSYLKDGPLDMRMDPDGPLTAADIVN
GYEEAALAKLFFRYGEEPLGGRIARAIVSARSSAPLTTTGELAEVIRAAC
PRKDLAIKTLSRIYQALRIEVNDELGVLEQALEDGFELLSSGGRFAVISY
HSLEDRIVKRFFAAKTSADWGPKGLALREPLKPAEAVLVVRKPLEATAEE
ISRNPRSRSARLRVIQKL
>CT0285 GTP-binding protein
MSLRCGIVGLPNVGKSTLFNAITAKQAEAANYPFCTIEPNVGTVLVPDPR
LSELARVVKTPVIVPAVLEIVDIAGLVRGASKGEGLGNQFLSHIREVDAI
IHVVRCFEDPNIIHVEGKIDPAGDIATIETELMLADLDSMEKRIDKLRKG
ARKEKDQQALVDLAEKIVAGLGEGVPVRSILENDEERAMAKQFFLITAKP
VLFAANVAETDLPDGNEHTATVAKIAAENGSKMLIISAKAEADIAELPEE
ERPDFLESLGLEMSGLDRLIMAAYNLLGLHNYFTAGVKEVHAWTIRKGAA
APEAAAAIHSDFEKGFIRAEVMAYEDLITLGSEQKVKEAGKMRSEGKEYV
VKDGDVITFRFNV
>CT0182 conserved hypothetical protein
MNKYFYDGTPEGLVSAIGAILESGDDPEQTVLSIRQDTLFEEGLFLRTDS
AVAEALFQRLRERAPDAVQTLWYFTMTEVDGLATSLLRYIALAFEHGDQV
NGYLTHPDVKAVVATARKVGRELHRMKGLLRFEQLRDGTWLARMEPDHNV
IQPLARHFSRRLRTQEWFIYDARRHSAAHWDGHALSFGTLERFSRPELSP
EERVMQQLWQTFFKTIAIPERKNPRLQQSNMPAKYWKYLTEKQGE
>CT0174 hypothetical protein
MKYPILVFLLILSFLMTSYKITYSEESLSPEYISSTGEEFVFATTYPGGE
KYGYPLWHKWPMVLGGELSYQDYVGRKGKLEDRIIYEPSGISKFRKAVME
NGEVLYLDVAGNIPPNGIYFQLATLNLSE
>CT0240 N utilization substance protein A, putative
MPRKQLKGETHDQKAQIASAFGEIEQSKIFLDKRSESAAVKMDIADLLKE
IIQKQLRKDYDPEVESNIFINPERGDFEVYILKKIVKEVDLPTIEIGLDE
VRQIDDSLELGDYYEEGPIKLDDYLTRKSIQIIKQSVQKKVRDLERLAVY
EECLEKVGEVVAGEVYQVRPNEVIFTYNTSKDHRVELVLPKSEMMKKDNP
RRTPRMKLYVKRIEREKVKVRNDDGTVEEREKPDGGMKVIVSRVDDRFLY
KLFESEVPEILDGLIVIKGIARVPGERAKVAVESTSSRIDPVGATVGYRG
KRIQSIVKELNNENIDVIYYTDEPQVFIARALQPAKIDPMTVHADMKTRK
ARVMLKPDQIKYAIGKNGNNIHLAEKLTGYEIDVYRDVIDKSLEDPNDID
IIEFREEFGDDMIYQLLDGGLDTAKKILKGGVEKIEEALLGTQKSEELFV
FNKTRKPVKPKERRITDDEKRYWKKIAETIYRTVREQFNDEDLKALLDDS
HERSLKGDGIDPEEIRDKETN
>CT1385 hypothetical protein
MMEWSSNGSSGIDSDATALRDRQITANLLAIAILEPLFYFAVTFLITYSG
HRKLHIFAAPAHKNELANAKIRPFCSQGNRRPG
>CT1107 hypothetical protein
MNSRFISMKKQGISEMFTIPMFCRFAYLSA
>CT0474 hydrogenase, putative
MYRIVSAIFLAENIKKFEIEAPMIAKKRKAGQFVIIRIDQNGERIPLTIA
DSDPEKGTITIIAQGVGKSTRELNCKEAGDSIADVVGPLGTPSHIENFGT
AISIGGGVGTAIAYPTAAALKEAGNYVITINGARSKDLVILEDEMKAVSD
EAYITTDDGSYGFHGFVTQKLQELIDSGKKIDFVLAIGPIPMMKAVADVT
RPYGIKTVVSLNPIMIDGTGMCGGCRVTVGGEIKFGCIDGPEFDAHQVDF
KNLADRNRIYQHEEKEANDTFAHECNLTRQA
>CT1445 hypothetical protein
MGKRQIIYTASQIGGARELLDKEINLVTKERRVWHGYVTAIDQDKIELRD
SRFWKHTFKVADIDKIYGEVVTDY
>CT0620 transposase, internal deletion
MTRKKDKTPDIQGELIGQLGYPKHLPIVPNTANSRNEHGTSVRDMQAMLL
WSSTRSKSPKRSSAA
>CT1637 conserved hypothetical protein
MPFEKYTVPVSGRSDAAIEEELTRLRAELAAEYDLREVVHRFGESDFTFL
SVLDSYALLDRIDPAAFVKDEQMPYWAEIWPAAVTLSRQIVETGELAGKS
VLELGAGVGMASIAAARSGARVLCTDYSTEALRFVAYNAMKNRVPLDTAR
LDWRMVKGAEKFDAVIAADVLYERVNLLPIVTAIDALLAPGGAAYIADPR
RRLADQFLELVHENGFEVAETRMFDAEGDQTVAVTIYKLQRLKA
>CT2044 CBS domain protein
MDQLITLRTLPASALMQKDYHTIKGSSTVAEALQLMKKTGESGLVVEPRN
EDDCYGIVTEKDILEKVIDPGEDLHRDPWNTPVFQIMSKPIISINPSMRI
KYALRLMKRTNVRRLTVMEGNKVIGVLNMTDVLHAVEELPVHDDHIAL
>CT1677 hydrolase, isochorismatase family
MITPKETLLLVIDIQEKLAPAVFQSDRVIKNTGKLIRACKLLGVPVVHTE
QYPKGLGRTVDELGVLIGDDLPFEKLSFSCCGNEEFMKRLRVLGCNDILV
VGMETHVCVYQTCVELLEFGYNVHLVTDGVSSRTEENRALGIRCIERAGA
VPTSTEMAMFELLRVAEGDTFKAISKIVKED
>CT0633 ExbB/TolQ family protein
MLEHASGGIVSMIAGSGPVVVMVLLVLILFSIISWAIIAWKFGFVRKSMN
ESQDFLDSFFDLRNADKLFAASENYRNSSLARVFRAGYIEYSGMRDRATA
SNVKERIMLAVKREVNAESKRLTHMVPFLATVGNTAPFIGLFGTVWGIMS
SFQNIGLMKSASLAAVAPGISEALVATATGLVAAIPAVMGYNYLAHQIGQ
IERDMEDFALDFVDSYLNQ
>CT0206 DNA polymerase, bacteriophage-type
MITQESLFDPAPEEAAQPNSPLEGLGELCRITVECRKCRLAQTRKNVVFG
EGNPQAGLFVIGEAPGADEDAQGRPFVGRSGQLLDKILLAIGFERQDVYI
GNIIKCRPPENRNPLADEIDCCKPWLMQQLGIIKPKVLLLLGKVAANTIL
ENTQSMGLMRGRIIKWKGFDCVVTYHPAALLRNPNWKRLCWEDVQMLRAH
YDKVCPNG
>CT0506 hypothetical protein
MLSCAVSKQFLRREKNFERKLSGPRINKQEGNKEASEKRNKKTKSK
>CT1334 conserved hypothetical protein
MGVLEAICVSVGKGTVKLAVPSAELRESWGIEGDAHAGEWHRQISLLAGE
SIDLVREVLPELDYGMFGENLVTRGIELAALAVGDRLRVGDAALLEVTQI
GKKCHNGGCVIQQATGDCIMPREGIFCRVLAGGHVAPGSPVVVVSDRSAG
SD
>CT1596 glycogen phosphorylase family protein
MSKSISVVKKLSRNLWWSWDTEAKSLFKNLSPLLWERVNHNPVELLRHIS
ADELEARCACELSSTIDDVNKRFEDYMAEKNTWAAQHAPQLVANPVAYFS
AEFGIHESLRLYSGGLGILSGDHIKSASDLGLNFIGISLFYKEGYFRQYL
NHDGWQIENYPLQYPESLPIEKVTTADGKDLIIEVIIAQSTVFAQAWSLK
VGRATLYLLDTNIPANEMHYRDICSRVYGGDQNMRINQEILLGIGGIKLL
KALKVEPAVYHMNEGHSAFLTLELLADELAEGRSFEEAKAKVKEQCVFTT
HTPVPAGHDRFSRDMMLYTFSKYVALCGMEMEEVLRLGAENPADTSGLFT
MTVLALRLSRCANGVSKLHGEVSREMWKHLYATDDPKEVPIGHITNGIHT
KSWSSGFTERFWKHHAKSLDELFASRESAEAVLSKVSDSELWCLRYQLKR
NLIDYIARYLSNQLYHQHQYATPYKPCDSSKSAKSALSPDVLTIGFARRF
ATYKRAPLIFQDLDRLNAIVNHPTMPVQIIFAGKAHPHDDAGKEYIRQIV
EHSHRPQFLGKVVFLENYNIGVAKRLVSGVDVWLNNPVRPMEASGTSGQK
TVLHGGLNLSIMDGWWCEAYNGTNGWSIGSGEQYENYDEQYRFDANKLYE
VLENQIIPTFYERNEHNLPVRWLKNIRNAIATIAPVYNTDRMVKEYTMRY
YLKEPRP
>CT1158 hypothetical protein
MSKKMICYCSSVTEETIVSAIRRGATTLKKIQDTTGACTVNRCKELNPSG
RCCSGDILDLIERETGSRPSSPCCCE
>CT1084 conserved hypothetical protein
MHFVITGSLGHIGKPLAIQLSKEGHAVTVVSSNTERIKEIESIGAKAAIG
SVEDPGFLTETFRGADAVFTMVPPNMNAADWKAWIAGIGKNYVAAIKESG
VKHVVNLSSIGAHMVDKCGPVSGLSQVELAMDEMKGVNVHHLRAGYFYTN
FLNSIGQIKNQGIITGNYMPDLKMVLVHPADIADIAARALKDESYLGRGF
SYIASDEKTPAEIVTILGKATGRMDLKWVERTDKEEYDDLIAIGLPEEIA
KNYTEMGAALRSGDMNADYFRNRPVMAGWRTFDSFMPEFIATYNA
>CT1974 CRISPR-associated protein, CT1974 family
MIATMLTLSRKDVKALRITDSYSLHRVIYSLFEDVRSEAEKRSSVPSGFL
FADKGGDAKGRKILILSDRPPLQPAHGELVSRPVPEEFLQHRFYKFEVTL
NPTRKENKSGKRVPIKTREEVAAWFGGKSQTSWGFSVDPARLDVRMLPVM
QFSKQGDRTVTHGAARVSGMLRVENRDLFIESFNKGIGRGRAFGFGLLQI
EPLKDNSNH
>CT1877 hypothetical protein
MEHLNQLPVPALHRFERVPEGTTLGSYRDTETPLLVIDNEERDVTSRSLD
FSGQRIITKLPVVPSMFIRNVRVVMQDCLGYIMEPYKVKEKV
>CT0195 sensor histidine kinase
MDGRKTAMTPSLQRRLTAMLAGAIVLAALVAAATSFYFSYREAKEFQDDM
LRQIAVLQSRGEWETVGAGALSVPSSSASLSDPESRITIFHFTDRSAPRW
LAGHLAPGFHTLRQEGEPVRVFIVHGPSAMTAVAQPTDTRDELALNSALR
TLAPGLLLLPLMALLVMRIVRSALKPVTTLSRLLDQQQPGQPVPLPERGI
PEEIMPFVEALNRLLGRISLLLGQQRRFIADAAHELRSPLTALSLQAENL
RQASSADQMRARLVPLQSGIERARQLTAQLLDLARVQAHPSGAAPVDLPA
LARELIAGYLPLAEARGIDLGLDDSGPVLLDGDPESFTLILKNGLENALK
YVQPGGTVTVRLAVEHGDGIAEVIDNGPGIPESERERVFDAFYRTPGSGQ
PGSGLGLTIAREAARSLGGDLRVLSGPDGQGTVFRYRQRGREAG
>CT0447 hypothetical protein
MPGKNRDSKQWDRQIEQPQEEELRLQKRHAEHGAWLFLLAVLLLVAGIRY
HLLNVPMERDEGEYAYGAQLMLQGLLPYEHLYSMKLPGIYGAYALVLSIF
GQTHTGVHAGLLLINAITSILIFLLARRLINPLTGLAAAGSFAVLSVSPS
VQGVFANAEHFVIVPAVAGFLILLIALEKRRWWLFFAAGLLLGLGFVIKQ
HGFAFILAGIVIFFTQYPGGRPFSWKPLGWHALALCSGILLPYAVVCIIF
LASGSFAQFWFWTFKYPRAYISELSFKDGLYNFLNNFGLILRDSWPLWFL
AYIGLRPPIWNKDERKSRTLLLILSLFSALRRISWKDVAVMGNLA
>CT1578 integrase/recombinase-related protein
MGRLFCIQNKVRGNPMLQNIISSAKALKPVTLHWLRHSYATHLLESGTDL
RYIQELLGHKSSKTTEIYTHVSQKSLQKIKSPFDDL
>CT1790.1 LemA family protein
MIRYISRVFPFLLALVMLSGCGYNTMQQNEEAVNRAWGDLESQLQRRADL
VPNLVATVKGAANFEKETLQAVIEARAKATSIQLTPEMLSDPDAMSKFQS
AQGELSSSLSRLLVAVERYPELKATQNFRDLQVQLEGTENRISVARQRYN
EAVQVFNSSIRVFPNSLTNSLVLKLKPKAYFKADEAARAVPKVSF
>CT0729 Nudix/MutT family protein
MANGTVKIAEQSGVLPIAGDKIVLITARGSGRWIIPKGYIEKGMSPAESA
AKEAWEEAGIVGSVRHEEIGTYSYRRPSGIFSVRIYPLEVESLLEQWDEM
HVRQRRLVTPSEAIEMICLKELRSLITDYLIKRFDF
>CT1698 conserved hypothetical protein, truncation
MHNLGGLAAGYAVATVARCDVIDRRTIAIEIGMQNSGLGVTLANQFFQPL
AALPGALFSLWHNLSGIALARHWSRKATFVASEA
>CT1261 ferredoxin, 4Fe-4S
MALYITEECTYCGACEPECPTNAISAGSEIYVIDAASCNECAGFADSPAC
VAVCPAECIVQG
>CT0882 hypothetical protein
MLPEWLPQKPAQGRYDTIAIPSIFNSGCCIGKRGITAIEIEDGMIRLVYW
TRDVQKYRYSGERLHTVEELGNSGIYRAVLNEDYLDYVFSRIRLLA
>CT1967 hypothetical protein
MKKNRNTVIGIFSIVFGGMIYVLWRKQSLLMFSWFYSIGLKPFVALLREF
AHSYSLVMPEWVYFSLPNALWAFGGILLFYSIWKDACSERMFWVLLFSMT
AVGSEIGQLVGIVPGTYDTTDISLMLIFIPLAITIGNKNHAIKEVSDAKG
L
>CT1205 conserved hypothetical protein
MNSTNANPVFSDRSAKMKKVLIFLLFLLIIAIVAVDSFLLPYYTESGSQT
TVPNVTNMTYDAAKRELRRVGLNAMKSYNVRYLPDVPPDRVIDQVPEPGS
IVKPGRSIILVLNRLDKPSYPVPDLVGRTEAEARTELERLGMVVAEVQTQ
VVSDADQDGRVLSQSIPPDVVLKSGSQVSFIVGKLEQEPTGMRLVIVPDV
LGMSVDQARSVIVKDGLLLGKISYENSDLLVPGTVVSQKPSANAMVQFGR
PVDITVVGNPN
>CT1940 conserved hypothetical protein
MANISLYISGQTELDDVSEFFQKRLIGKGETPIAFFDGVFYESHQERVGN
IVYQDYLIYTDKALYLWARGASKDFLDRFSLGAVSVNSRNKDSAFATMNL
KIRREGKEPIYVIFDMVEIREADTIVRLQTLVESIIEDYLGINYRQEIPQ
DTADRIFQAARTLCPPRQIALQLDTPQAPTPDAGIGYGQDLLEQYRASAG
YEQPQMPYPPYQPSGTGGASPRSMGSQDAMRGLESMLPADPASLKRIAEQ
IKNMVGDAPFKMRDQVMKDLQHVPGDMATVLNALNELLSNIAGNPMAERF
VMNAIKTAVANDGVLGSLSKIIKMTGFGSGGGKKSSRPPANESPREERST
AKERKSPFVDEEPDEDGSTIRRKKISIKDDDNHTGADLFAGNDEPITSRE
SRPPVSRSSVPDDSDSGAEPGGRRKKLSIKMEDDNEIARKLMSYDEPERE
APSPAPVEASSAVSTENGSGIRRKKIAIKAEEGSGAEADIARKLMDYDEA
SRSAVTSALSGEVPGSVSSEEPSEPEPLRRKKIQIKADVEQESEPIVKAE
PEEPTRKKIQIKTETEPELPVVSPELEIKSMLEPEPQLEVPVQKASQIEA
EPEEEIDISEEVIFSALGEIEEMEGSSISQEYMIIESDIPVRRAVSEPVK
EPVSEPVIEKKPEQVRHSSGKVSHGKRRGR
>CT0763 ferrochelatase, putative
MVACTVSKRRYSVVLVTYGEVEKLSVRALWPSSRKIIEVITRKIVPLPKL
LLYMIADYRSAKHYLDWKLHCYRSRLVEINRRQALGVEAALRRNPRFAGE
NDEIEVADAYYFVKPYLEEVLDRFWSRSYGIVVVPMIPVESAFSCGPACQ
MVTDHFGENHLGLVRVVRGLWKDDELHRLYLDHLFSSLPAAWQGAKGGEK
QGLVLVIHGTIVKNRKGEKPSVFTGLDETMEFFRLMKEKIMADPRNVFGE
VKLGCLNHSRGGEWTPETVELALDEFAREGFRSVAMFPFGYFADNSETDY
EAKKLLDRSSADRKHYIPCVNDSPVFAAWLASRIATEIERLDVQREVFSD
GPSRK
>CT0872 lipoprotein, putative
MENILDKLGIELNEQTRLTNDSKFTFNCHSGLSCFNTCCSNLDIVLTPYD
ILRMKNRLGLTSVEFISEYTEPVIQKESKLPFLKLKLGDEGKCRFVAAEG
CTVYGDRPAACRYYPLGFGIYKNEEAGDDFYFLIREDHCKGFEEEREWTV
GEWRKNQGVDEYDDKNRVWMEVILNKKLYSPELEPDEKSLKMFFMASYDL
DAFRDFVFESRFLDVFDIEEERLEMIKNDEAELMLFAHQWLLFALFKKPT
MNLKQV
>CT0815 copper-transporting ATPase, E1-E2 family
MTTKTYSVKGMHCASCAAIITKKLSKVEGIEKADVNLATEKAQLEFTGEA
VTDDALNELLGKYGYGLVSEQPPATPSAPFVDRKAAAEKAKMEKEMELQA
QLAKVQLALPIALMVFILMMWDIAARSFPSMPGMPIPMALFNIVSMALAT
YMVFRIGAPFLHGIVMFVQSGAASMDTLIGIGTLTAYLYSTVITLMPEVR
ELLRVPNYTYFDTTIVVIGFVLLGKYLEARSKLKTGEAIEKLIGLQAKSA
IVRRGGAEVEVPLEQVQKGDLIVVRPGTKVPVDGTITEGSSSIGESMVTG
EPVPVDKTVGDPVIGGTINRQGAFVFTASRVGEETVLARIIAMVEEAQGS
KAPIQNIADRIAAIFVPAVLVIAALTFILWLTVGSLFIDFSTALSFGIMG
MVGILVIACPCALGLATPTAIIVGIGKGAEYGILIRNAQSLETLSTVDTV
VFDKTGTITTGTPSVTDVLPFDDTVDPEKILQLAASIENRSEHPLAQAIL
AAAKEKGIDLHEVTGFEALEGIGVKGTIDGKNISVRKPDSGDSSRSRIAE
LQQQGKTVVILEEEGAPLGLVALSDTIKTEAADAVKALHAKGVKVIMLTG
DNKLAANFMARQAGIDKVIAEVLPQEKADRVRELQQQGRKVAMAGDGIND
APALALADVGIAMATGTDIAIESAGVTILQGGIRKVAQSITLARATMRVI
RQNLFWAFIYNVIGIPLAAGALYPFFGIFLNPVFSGVAMAGSSVSVVTNS
LRLKAKKL
>CT1515 hypothetical protein
MLLTPSEKITREYNRLWIVSIARILLALAIGFTLYEATIHHPILPPRMDY
GDKVLHAAAFFALTMLTEISFPGLKSLLPKLLFLLGFGIFIEWIQSFLPW
RSSDVSDFLADCAGIALCFVPVLLTRLTLRLSDH
>CT1425 hemX protein, putative
MANIFIENSLLFALTQIVPLLYIVTTALYGIHFFKETPLAGSLKQPALIL
TVVTHVADLGLLTSTAGYRLSYSAYNLMSMVALTLAITYMFIEFTTKSDK
TGFFVIAFAAGSALFSSILSSQTVDSGLAFSGLGIGVHLFAAIFGFSSVA
IAGLYSGMYLVLFRQIRLNRFGLLFQRLPNLETLEMLIMHAVAFGFFFLS
VTIVAGVLEQHASKEVINLFEPRLVSLFVIWALYGISLVIKPLFGWDIKH
MAVLLIALFVLVTALLFIMSLVTPSFHGMSI
>CT0959 hypothetical protein
MSKKFDVKEQARDILEENLDMEAVIYLGRISEEMEQIFISNPDPSFADVQ
RIVNEYFTTDGRPAAFIEDWLRTADEHTRSRGLDETERPRAILSDLGVFR
FMWFLKERGLTEEQINIVLTGAVQQATGQQGE
>CT2204 hypothetical protein
MRKVLILTRNFPPYVTGSASRVWKFASNFTTIGWEPVVVAPPAIAGMAGG
VKSGANPVNEVCRIAPDLDAGELDAAGRCALLHGQEVPGTTGLFKGRQAK
SFKSATDGELWLKNATATVERLLEEDQEIELLYAQGPPLEPLWLAFDIAR
KHSLNIMLDITEPLDPSMPRPGASRSSAAAKAEEQVMLSGVPILTPNRLL
KEYFLKKYHGKLDNNLVTIVPPAFDPSHPAFRRQEPNRSGTVLRIALLVA
DLPVANLKALVSGLEAWIQADGIIAGGIEIVLLGDGVPELLRRTAKKRVR
KLFTVDATGGIDSELEECRKTAFFCAVLGNSAASASIVPDRLVDALGMGL
PLCVIAPDGEASRLVLDAGGMSAPAGDAKAIMELFRAMASAWSFGNLHST
PDWLREKHAIGTVMHELTRAIASQPLH
>CT0904 hypothetical protein
MFFQFLLSMSLKKAGLARLLKVFNALTQQKCRAIAQEISAQV
>CT0905 hypothetical protein
MKKQVLFMPTIRALSNSYRFFFSGDASDKDIEALLSRLRAFKEKEFALKN
QIAENPLDNLFHETGRAFSADKIFDEEERINLKPATGEAIVKELEKYNLS
DTSEDLKGVAFEPFPGRTFRGEIGRFFTPRTIVRHTQAGPCGEQRRGHTR
RSPVHGLAQRLRIDLASTEVSAP
>CT1909 hypothetical protein
MTEKIYTEENGEYLKNNPTWHVEDSPWKAKQILKMLNSNPINPKSIAEIG
CGAGEILNQLHLSMPNYVSFTGYDISSDAIRLAKTREKERLEFKHENFLE
TSARFDLLLMIDVFEHVDDYLGFLKLSKSKAKNTIFHIPLDISVQAILRN
KLMSRLRYIDG
>CT1274 hypothetical protein
MAYIMATSLIFIYSTDSGAVSTLLDTGHKMLSPSTTVGMEVAKTS
>CT2074 hypothetical protein
MKSKLSEEKDKWDSSDVVDIGQEKSCYHKGFTGKPVTAR
>CT0702 hypothetical protein
MPVFLTVTLFNQAMQAALCLAQAEAAQATANADNAKARRMEKFKELKKMG
LSNNQLDNFIP
>CT0471 hypothetical protein
MPGDSTNFIFMLSPTTSTFDMQKKATLLVSALMLSSTPLFAAMPLVTDDT
GTQGAGHGQIEIGFESTSDKETEAGVSCKETGGAISATFSYGLTDNIDLV
VGLPWEWDTVKENGLKVADENGIGDLALQIKWRFYELPDSGFNLAIKPGL
TIPTGDENKGFGTGKVSGDVTLIATREAKLATFHVNLGYSRNAYKLDEIS
ESSRKNIWHASMATELNVTDKLRAVGDIGIETNSDKDSDTDPAWILGGLI
YAVNDNTDLDIGIKGGLNDAETDTTLLAGVTMRF
>CT0724 hypothetical protein
MVMDGVNGLFSRVMRVFCGKTLPDNNNNHL
>CT0235 PTS system, IIA component
MRPHSAMLSSTMKIEALLTEKHINLNLGSSSKDEVIDTLIAMVADHDKVR
DLKQLAEDVRKREREMSTGIGKNIGLPHAKTSAVTEPVLALATLSNEVDF
ESIDNQPVKIVFLLATPETMLAEHLKLLGRITRLAGRDDVRRKLIDAATP
ADVLELFKQEERDLPQI
>CT1631 peptide ABC transporter, permease protein
MLRYIFKRLLIAVPLIFGVLTLTFFIIRLAPGDPAAFFIQPGISPNVAEQ
IRQQYGLNDPLPVQYIKWLGNVLHGDFGRSFSRAQQPVFDVIAEALPVTM
TIAVLTLIANFVFGIIIGVISAVKQNSFLDRFLTVTALFFYSMPEFWLAL
MMIILFALKWPLFPVSGLNEIGAESYGTFGFIMDRIWHLALPVTVLSING
SAGIARYVRGSMLEVIRQDYVRTARAKGLSERVVILKHALRNALLPVVTL
MGSSLPFIFSGALFIEVIFAFPGMGRVTVEAIFARDYPLIIANTFVSGTM
IVFGNLLADVLYAVVDPRIKL
>CT1629 TPR domain protein
MASVSGNSTAPDSAAVVTVQDPLSDYVQGLFLDMKGDYWGAIDIFRKVLV
RKPADPAVHYSISQSYYRLAVLDSARVYGEAAVRLDPSNRYYLRYLAVVA
HDMRDYDRAAELYGQASLLEPDRTEIMYLQGLEYMAAKRLEPALEVFRKA
VRIDPYNEAAFAQTLALEIALKHYPEAIDTSKQLLKLGGNERKIGITLAE
LYTKTGQETLAVQTLQELIAGDRSNITYWIALFDHYIMVGRNDDFHRELA
VFLERASLPPESLHDLAKLYILRSGKDSLYVAPTIALLDELTARRPRDSE
LFMLKGMFGMMHGHQQEGVVLFRKAVQLDSSNATAWEYLISTQLDLGQKR
QAFALLAKARRRLPGQRFRWSVLEGSLLLSSHKLRRAVAVLETVAGTKRK
PGDPNLLIQANINLAMACDLLGMKKRSRSAYERVLDLDPHNTLAMNNLAY
LFTEEGITLRKALRLATNAVMLEPENGVYLDTLGWVHYKLGNFELARQFL
EKAAATGLDEPDIYRHLGEVYRKLGNEPKAREMLEKARTVEKTQGNKKSG
H
>CT1380 hypothetical protein
MKIMADTALLQSIVKLTRPLFIILSLALCGCIQMHTTVHVRKDGSGTIEE
KMLFSEMLSGIMKEKGEGLPALPKKDQLREMSAEFGPDVKVVNVKKVENS
SGSGFIVTYAFDDIEKVRIGNVQKMSKKLTADSTAVKSDSTVVQKPETWF
TFTMKRGANPELTINKEAMLNSSSRGEVAKKPVSTQEKEQMLDMISAFLK
GMKLEIDVVVDGRVISSDASYRADNTITLYAMDFDQLMTHRDILTGKYDG
LSDRDFARRSGKDSGLKFEFKDKVHVIFN
>CT1440 moaA/nifB/pqqE family protein
MTYVFGPVSSKRLGQSLGVDLLPSKSCTWNCIYCQLGRTTAFVTERREFF
PKEEILSEILETVASGKPIDWITFVGSGETTLYKGLDWLIAEVKKISKIP
VAVITNGSLLSDPEVRRELLEADAVLPSLNAGSPELFERIDRPAPGFTFE
KHVEGLRLFRQEYRGKLWVEVMLIRGVNDSEEALKEMAAVLAEIRPDMIH
LVMPTRPAPESFVGIPDEELIQRAVFVLSSSAPVLHPAKGEMNLGSVGNL
LDTVAGIASRHPVQERELEAALGKLFDGDAAKIRETMGELLASGRFEKVR
QGSELYWIVKSA
>CT0669 ABC transporter, permease protein
MISLKQISVGAALAVLILYGALIVSLAWFLNGTTLRETLLSDRTLFSVNL
SLMAATVATALALLLAIPAAYALSRFNFMGKGAAETILEFPIIVSPAALG
AIILIFFNNPLGEWVQTHVMYFVFTFAGIVLAQFVTILGLAVRMLKTAFD
EVPAELETVARTLGGSPRHVFFTVTLPLARNGLIAAFILTWAKALGEFGA
TLMVAGSMAMRTETLPIAIFMRLSSADIEGTVALILVLVGIGLTALYTAR
RLLRMNTHV
>CT1344 conserved hypothetical protein
MDIAKLLTTYKHIAIVGISEKPDRASHAVARYLIHAGYTIYPVNPTLSSV
LGLECWPSLSDIPAEKRERIEIVNIFRKPQDVPPVVDEAIAIGAKTIWMQ
LGITNEAAAEKARKAGLDVVQNRCISVEHMHLVS
>CT0871 hypothetical protein
MFLFRECNHDNSSREFLQQHMTNIPPFFPVPGQEKRQTKPPK
>CT1420 peptide ABC transporter, periplasmic peptide-binding protein, putative
MTAGQPINRISDLTAMHKSLLSIAALLLVTLFAGCGPKSQKSRPDTIVVA
VSADFDHLNPLLIQMSLAREVCTMIYPQLVKPSFDEKSGAISYQPNAAER
WEFSPDGRNVTFHLNSKAVWEDGKPLTSKDFSFSYRLYADPNVASSRQDY
LYDLLLKPDGSVDFDKAVETPDDKTLVLHFNKPMAENIVLDHFNDLMPVA
EHLFRDIKPQQIRQQAATLPIMGAGPFKVKEWQRQSKLVLESNPTSVLPR
PAASPALTFMVVPEYTTRLAMLKSGQIDALVSAGGINPKDAAELAKSNPE
IAIIPVADRYFDSVVWLNIDGEAWRNGKKIKPNRFFGDRRVRQAMTYAID
RQAIIDGFMGPKHATIVNTTLSPAYTSIIDTSLPAYAYNPDKALALLKEA
GWTPGPDGILQKNGQKFSFELAAPTGNPRRNYAATIIQQNLRKIGIDCRL
RFDESLMFNKNQNEYRYDAALSGLAAETLPFQLVIWGSDFEKKPFNSSAF
QNAELDRVVARLTGPLPQTEQAKLWKEYQQILANEQPRTFLYYYDELEGF
NKRVENVNLSLLATLGNMYEWKVGKKK
>CT0559 hypothetical protein
MGLKWIFTALFSVLFRPEVFWSDARERFREVNAMKDYAAPVIAIVQFVKL
PFIGTPRMAMLLAIISFTIDVAVLYLLTGVMDSVAEAERSAPVQHEIMTA
LSFSLTPIWLAEPFGFAGTWRWLFIAAALAYTVFISRTGLQAMLGSDESG
VEAFSGKSAFLVGAMAMISSLLQNGLIRFFISI
>CT0431.1 conserved hypothetical protein
MARLRSIKRLHSQRGVVTILFALVLMVLVGLIALAVDLTRLHLVKAELQN
AADAAALAGAGSLIDTSLQTFNWSAATAKAQEFADVNSADGKTIGQHRQE
QDVNVAIQPGYWNLITPSFTSNTGLVTHTGDGNIPAVQVTITLSHLKFFF
APILGIPEGTVQATAIAAVSPPTGGTGLFPMAIGGCLFNLFWDSVHNTPK
LDPATGQPYEIQVYSVYSGGAGASCDSGQWTSFQTDANNVPFIRDLIKNG
NSIPLSIGDSIWIQPGTEATVYDSVPTNVDVAVPVVDNVATHSSQTVIAI
AGFHITGVVKHGNKSVVTGHLIPQSMVPSLHPGNGTGIPYGAYTPPFLVK
>CT1658 hypothetical protein
MRPAHIIRILPLCTALLFAGCGGNSTPQSETGGGNAAVQQQASKIIEPCQ
LITQDEAAKLLGEAVKPAEKSEKQVVGMKLCMYKPASGEPIPFLQVSLTQ
DACFPPGGAGAASIYKSLKDNFEGMRTDIDGVGDEAFIASGGIYIMADGY
YIQIGAGNTSNAAIRARLIEAGKMAVAKLKTLE
>CT1172 hypothetical protein
MCLIMFIFFIVFVFICRLLLLFCVVYWFFYCFYFNVSVLILCFLFFLNLF
VFLQALIFL
>CT0963 transketolase, N-terminal subunit
MAKHLKPYPAEKKGKYLQLSTIDELKDMARQVRRDVIRMLAKANSGHTGG
SLGMADIFTALYFKILKHHPHQFKGEADQDMLFLSNGHIAPVWYSVLARS
GYFSLNELNYLREINSYLQGHPTCESGLPGINIASGSLGQGLSAAVGAAL
GLRMDGKKGEVFCLMGDGECQEGQIWEAAMSAAHYQLGNLIGIVDYNNQQ
IDGEVSEVMDIEPFADKWRAFGWDVLSCDGNDIEHFIDTLEYLRKDTGRT
KPVVVLAKTVMGKGVPFFEGTMPDKSNWHGKPPSKEDEAKALEILGVTIF
GDF
>CT0677 hypothetical protein
MSAHEKDLTAFINMSTRPKTTRAKNDLLFTWALPGPAGQEYMPLIKRKAV
FV
>CT1755 ABC transporter, ATP-binding protein
MKLLERTIAEIYEEIPEVSMFLGDYGLDPLERGIPLGHWLESLPEEIYED
IGIDREGLVENLEEYLRQLEMMQLGKLPPVGDITIIGGHDKSGTPEEVRL
TLTRGSVTSIVGPTGSGKSRLLADIEWMAQNDTPTGRRILVNGELPDTDL
RFSLEYKLVAQLSQNMNFVMDTTVAEFVAMHAESRMVERVEEVVREIVTQ
ANLLAGEPFEATTPVTSLSGGQSRALMIADTAFLSSSPVVLIDEIENAGI
DRKKALSLLVRKEKIVLMATHDPILALMAERRLVIRNGGILKVITTTPKE
KENLEKLEALDNRLLALRTRLRSGELIEDAG
>CT1535 nitrogen regulatory protein, P-II family
MKEIISVIRINKVNETKKALIDAGIPAFTATGRVMGRGKGQVHYDILQGA
EAGHPEAIAQLGNAPRLVAKRILTVVVPDDLAQTAIDTIIKTNQTGKPGD
GKIFVTPILDTIRVRTGEEGDEALK
>CT0116 ArsA ATPase family protein
MRNIVFTGKGGVGKTSIAASTAVRAAALGYKTLVISTDPAHSLGDSFDIE
LGPSPVKVAENLWGQEVSVYGDLSLNWEVVREHFAHLMEVQGIEGIYVEE
MGVLPGMEELFSLSYIKRYNESSEYDLLVVDCAPTGETLRLLSIPETFGW
MLKLMRNMEKYVVKPVIRPLSKRISRLHDFVPDTDVYDQVDHLFSSVEGI
IELLSDNSKTTVRLVMNPEKMVVKESMRALTYLNLYNITIDQIVINRVYM
DDVDGQYFKGWKEIQKKYIAEIESSFGPIPITRVPLFRTEVLGLEMLKKV
GETVYGDRNPLDIFYHEEHTEIVKPGEGHYVMKLRLPFVFDNRMEANIVQ
VGDLLTIRIGNHQKSVVLPTFLAGMKVTHAGYEEKWLAIEFRKKETAK
>CT1876 conserved hypothetical protein
MQFTNDSSMSSNIAKTEFSEQDFRQFQQNLRKETLMLMEWFSEDVFENRQ
VMCGFELEGWLVDQNCNPAARNEELLARVNNPLVMAGLSKYNFELNVAPH
PLNHCLPEFLRGELQTLWDSCSRHAREMGCQTLMVGILPTLQDRMLTLQN
MSSMQRFHALNREILRTRSCHPLKINIEGPNDRLEVVHNDVMAEAAATSL
QIHFQVPLSKTAAFYNVAHVLSAPMAALSANSPFLFGRELWDETRIPLLE
QAAHTPSFVDPTGRPVSRVTFGRDYVRDSLKEVFLENLDGYPVLLPVTFN
HDPGMMNHLRLHNGTIWRWNRPLIGFGENGRPHLRIEHRVPAAGPSIPDI
IANILFFYGAMLHLQPEVPQASISFEEARTNFYAAARSGLDAQVRWTSGN
SMPVETLILQHLIPGAILALAAAGFRSSDLRYYLVDILAQRVASHRNGAW
WQKAFVKKHGPDFRMLTQAYLENQNLGTPVHEWSI
>CT0883 ATPase, ParA family
MKTIALYSIKGGVGKTAAAVNLSFLAASPTTPVLICDLDPQGASSFYFRI
KASRKYNSEKFLRGNSKILKNIKATDFDNLDLLPSDLSYRNLDIELSESK
KPKKLLSKNLEGLEEEYRYVFFDCPPNLTLLSESVFRASDMILVPVIPTT
LSVRTFNQLVEFFTQNGLDSSKIFGFFSMEEKRKTMHREIVEEFSANPAM
LRQTIPYSSDVEKMGLTRAPLNATHPKSNAAQAYNKLWEEVRMSIDR
>CT0403 conserved hypothetical protein
MEQQKLRELLETLHQELGQIGTVDEKTAEVLATLNEDISKLIEGGKDAAE
NEESLTDRMGEAVAHFEEEHPRLSMLIQHVLDSLARMGL
>CT2041 succinate/fumarate oxidoreductase, iron-sulfur protein, putative
MSDSTIQHHEESRDVTFRVHRFNPQVDSKPYFDDYTIKLEKGITVLRALN
YIKEHVDSTLTFRAFCQAGICGSCAMRINNMSKLACTTQVWDELEKAREP
GVIKIEPLRNLPHIKDLVVEMDPVVGKMKKYSNWVVSRMPEEQWGKKEFL
ISEEEFQTYDKATDCILCASCVSECTILRAHKEYVSPVVLLRSYRMNADS
RDGIHDQRLADLVQDHGVWDCTHCYRCQETCVKNIPIMDAIHGLREDAIE
RRGTKDTSGARHAEAFMDDIEKKGKLVEATLPIRTNGIAWTLKNLLPMAV
KMIIKRRTPPPPPLVKASKGIQSLRKEMKEMASHITKDHEKKSGE
>CT1044 hypothetical protein
MLTEEVILIAEKQCGSRIFAFCRNGDKSGYRWYITIMFGKVMLCMNNLAA
GMKVWISIRSSLAVSN
>CT2090 hypothetical protein
MVVKFRLWLKLCSRKESFFSMKTNPAGSVVRSGRNKMLTGGGKSGTFHIF
MNEKQGFSVLLSFMMMAEACSRIVLPVSGGTV
>CT0664 hypothetical protein
MDINDLFDKIMKSINQFADEIAEQRLQEKMQDTGRGPKNGGKNAENFEAK
EKQGVTSFPKRE
>CT1837 TPR domain protein
MQQTVWQNPEDARGYLNLGKEYARQQRFDDAIQAYRRAIKLEPGLDEAYS
ALGAAYFDKKEFNAALPWMQKRVDIAPDDSLRQFDLGNVYFQLNRYNDAV
ASYQKAIDNSYSFQEAWYSMAVCYIKMGKIDEARKIHKWLQTKNNYLAVS
LERHLQNDVPDKAGK
>CT0347 hypothetical protein
MKRGLYSTFDQPLFSKHQFFLVHFEMDARSVMVHSPIKQTPTAMKQRFLQ
AFFIVFAISSFFGGVSLAADSSPAVSKKAATSPESGKTVSKDISKPGVWS
KFEMLSFTDRVVFRDTMAGLTGVGYEPLVVRKQIVEGVNYEFFCNARAVY
PGTDWHPAMVLIYKPLKGNAVIKKISKIDGR
>CT1904 hypothetical protein
MFMLFKGLGALIEREKLPNNQKHKSLESLWPNRCPESL
>CT0934 transporter, putative
MMESQSVLSNFLLFAGGLAVFLLGMKSLSAGLRKASSGRVAHLLSTITGN
PVSALLSGVAATVAVQSSSMVMLTLIGLVQSQILTFSQSLGVVLGAEIGT
TMMAQLIAFSPGEFGLPLFALGFFLSLIRRQRVLSLFGEAISGLGLLFFG
LKLMGMAVEPLQSSPQFLSILTTLDNPLLCVLAGAVMTAIIQSSGAFIAI
LITMAQQGVLTLEVAVPLMFGANIGTCITAAIGSAGSVRPARRVFLAQLM
FNVTYVIVFLIFLPPWLDLIRAISPSDAAGGLARQIANAHTFSNLFMALL
FLPFLPQFGRMLVKLLPDDPDEMGRIPAVWHLRTSALATPEVAVGLARLE
ISRMNKILGRMASALPVPFIGGGNGRDLIYKRLGVSEGLMMREEKLDFLE
QKVTAYLIRVMQEAVGEPLIREAAALMALAKEIESAGDVVETLLADFPSG
KLNVGLTAEGKAEIERLHEQVCREIAAMNLAIEEMSPRRASAVIEEGKAF
DRIFFDLGFSHMRRIKSRSESERTHDLHMELLRALDVLHHQAMSMARTIV
GMTADRNA
>CT1300 conserved hypothetical protein
MKVIGINGSPRRAGNTSIMLKTIFEVLEDEGIETELIQVGGTNIKGCRAC
YACIKNKNSECSTKGDGFNEIFAKMVEADGMILGSPTYFADITPELKALI
DRAGFVSRTNGQLFRHKVGASVVSLRRGGGIHAYDSINHLFQICQMFMVG
STYWNLGFGGRDGGEVVNDTEGMENMRDLGHSMAFLLKKLHN
>CT2098 recombination/replication protein RecO, putative
MVLRDIKYRDQSKICLLLTREYGQVSVILKGGRSAKSRIGPLFSPGNVID
AVLYKKGNRDIQFLSDASLVLSPLSESPDLDRFGVLYRVLDLIRYASTHE
EKNVPLFTITHSAIWRLCHAERNFQTILAWFLLRLVGVLGFAPSLDRCVF
SNADLASSIEEMKLDELLFVHDPGGFALPGSAVTMGAAIQTVPVNRYHFI
RNLAATGGNAPCPAAPADDIAAVTALLQEYCARHLDRMPHRKHLDIVSRL
ISA
>CT0674 vrlI protein
MNEMEDRWLSITEICKYLGVSNDTVYKWIDKHGMPAHRMGRLWKFKKDEV
DEWVKAGGAAESSDAGKSRNL
>CT1927 conserved hypothetical protein
MSPTVFCEQGFRFFFFLREEKRMHVHVISGDGEAKFWLEPELELAKNHGY
SRIQLKQIESIVEAHSDELVKAWRKHFSS
>CT1248 hydrogenase, iron-sulfur binding protein, putative
MGIQEQYRSEAAALLSSGEVKLVIGFSAGSTADRRRPFFARTPEEAEKFV
LDAACIANLSTYLVAEGLLSDGKKVGVFLKLDGIRSVNILISEAQLKPEQ
VVILGYAIENGKDVVPLEGRNISDFNIGEKIRSHTPPPHVIEAAEKIEAM
SAQERFEFWKEEFAKCIKCYACRQVCPMCYCRRCIVDVNQPQWISTSSHT
LGNFEWNLVRAFHLAGRCAACGACDRACPVNIPLRLLNYRMGKEVRSAFD
YVAGENSDQKPVLASFKQDDPETFIL
>CT1222 lipoprotein, putative
MRQLNRKLLRDLLHMKGQMLAVTAVVACGIAMFVSMSNVKYSLEMTRADY
YSRFRFADLFVQLKRAPEFTLEAVRRIPGVAIVAPRIVTNVTLDVPGLDE
PATAQLVSLPDRGEPALNGVFIAEGRMLDPSRPEEVIVSKPFMKANRLKP
GDHIGAVINGRWKRLLIVGVGLSPEYIYEVQPGAFFPDNRRFGVFWMSHK
ALASALDMTGAFNDLSLTLTHGASEKEVIRQLDRMFSRYGSLGAYGRDQQ
LSNRFIGDEIRILGMEITVLPTIFLAVAVFLLNIVLQRLVSTQREQIAVL
KANGYDDEAVGLHVLGFALAPTVAGVLLGTGLGAWLGSALLRLYGDVYNF
PRLLYVFRMENALGAVLLSFGAAVGGALAAARRAVKLPPAEAMRPESPPV
YRPGLFDKSPIAHKLSTPVRIILRNIERHPFKSALSVTMISLAVAILVAG
RYTFDSVERMSEVEFGSRHREDVTVIFNDAMPPSVANSFASMKGVLEAEY
YREEPVKLTNGYRSRRQTLKGLQKTDGLQRLIDSHDRPQHLPHQGLLLTT
TLADLLGVKPGDVLHVDFLQGARRSVEVPVAGTIDEILGMSAYLRLDELN
RLAGDGGALSAGVLKIDASKSAQLNDRFKHTPGVAGIMMLQALKKSFSEM
IAKSMNTSTFILTSFACVLAFAMIYNGARITLSERARELSSLRVLGMTKR
EIAVILLGEQAVFCIAAIPLGFLFGIILSTLLAKALSSELYRLPLVFTPA
NFLFAFTVMVTVAVVTGALVGRKIVKLDLIAVLKTRE
>CT1924 hypothetical protein
MSRSQTACAFDANSTTIARVKSTGHGSFTMTMCRTIEGGLDDLGSPRGSK
LAGKLLSALKEWRNEPVALSFSPAELMTLPAWFPSGSSAEYCDSLCRIEA
GYFLHEADRWQWHDMVLEPTPDQPSGLDRRMLLFYPVKPAQFIENELLKH
ARVGWRGVHVEAVARLSSVTGETLAVLELEERYAALSISTNGKISYFRYW
PVKDGSEREYFAIRELTSAPIDGAPVKVTGSAASAKVIERIGRETACAIE
PLELHPWVSVEKGASKGKSPTATIRAVSTAIMALNGG
>CT0685 hypothetical protein
MLGHGRIYTCEKGCLFSSRQLVKANGNPYGQSVMPVKLHQNGRITPAIRR
AIQSSSLSASQLAARHGVANLKALMPVGVKYLPKMPDGPSRRFNVHSSP
>CT1808 TPR domain protein
MSLLDFFDDNLNPSEGFFPDRPEGASDPDSIHDPEELLDLIIQLNEEGLH
ETSLVAARRLEELAPYNAETWFHLGNSLTLNGLFDEALEAFQRAVLLSPA
DNEMALNLALAYFNTGRLDEALEEIERVVSDSTIARDICFYRGLILQRLE
RFEEAEKNFEQTLQLDPEFGEAWYELAYSQDILGKLDNSLVAYEKAIDLD
PYNINAWYNKGLVLSKLKRYPEALEAYDMALVISEDFSSAWYNRANVLAI
TGRIEDAAESYTKTLEIEPDDINALYNLGIAREELEQYSEAIACYKRCIE
LNPEFADAWFALACCFEALENYEASLDAIGHALVEMPECIEYLLLKAEIE
YNLGRLDQSLKTYEKIIPLDPDSPQIWLDYAMVLREAGAMDASIRALEES
ISLQPLSAEAHFEIAATYFAMGDNQSTLKALSKAFKIDPDKKQLFQSVFP
ELYQQDAVRRLLEIS
>CT1092 ABC transporter, ATP-binding protein
MTKLLELRDLSFGYGKGGPPVFEHLSLDVGKGEFLVVKGPSGSGKSTLLR
LICRLNTPRSGAILFRGRNTTEIPPSELRSKVTYVPQIPQMIDGTIRDNL
LLSFTFAQAQRKSPPDDASLERMLEAFYLQGVGLDQSANTLSVGQKQRLS
IMRAILTEPDVLLLDEPTSALDTESASMVFSIVERLNTGEGKTVLIVTHS
EYLPAVPQARSCTFRNGKLECS
>CT0532 exsB protein
MRAVLLVSGGMDSLVATALAHREGLELAAMHVNYGQRTWQKELECFRQIC
VHYGIVDRLEVDAMFLSAIGGSSLTDATIPVGPADLAGTDIPTSYVPFRN
ANFLSMAVSWSEVIGANRIYIGAVEEDSSGYPDCRKVFYDAFNQVIEHGT
RPETRIEIVTPLIAMSKAEIVRRGMELDAPFHFSWSCYKSEGKACGVCDS
CARRLRAFASVGMDDPVEYEVRPNYLQ
>CT0458 conserved hypothetical protein
MEPHYRVTIFGSARISEGDEAYRDVYDIARGLAAEGFDIVTGGGPGLMRA
ANSGSKSVSNGGQSIGLNIKLPHEQCPNPYLDIKEEFDRFSGRLDAFMAM
SDAVVVAPGGIGTMLELFYSWQLVQVQHLCETPIILFGEIWTSLLLWLET
EVLPRHLFERKDMHSIFHVMEASEVVDLIIKIHKARPETEHVCRNFNKYR
LDIEQAGKK
>CT0274 carbon-nitrogen hydrolase family protein
MIRLATVQFTPRLGERQANLEAIRSLLDPVEADIVVLPELCSSGYFFTSR
EELAPFAESPGGVACSFFQGLADAKRAIIIAGMPETAQGCFYNSVFVFRP
GVADPLVYRKSHLFYKERFVFEPGDTGFPVIRDEQLDISIGIMLCYDWRF
PEVSRVLALGGADLIACPSNLVTDAWRKVMPARAIENKLYVAVANRCGTE
TRGDETLLFKGCSAVYDPYGETVALADADNDRVLLAEIDPRSCRDKSFNE
FNDIFADRRPELYGAICCPRR
>CT1213 conserved hypothetical protein
MHSLYLKPKEHRRLVSGHLWVFSNELREVPRDIAAGETVQLFTHDGRLLG
AGFFNPQSLIAFRLLTRGEEQPDRDFFRRKLLEALKLREKIYPESETNAW
RLVHGESDGLPGLVIDRFDRAFVLQSFSAGIDQHLPLFCELLRELFDPKA
IVVRNESPLRELEGLPLYRETVLGESSDMHQEIRDSGISYRVNILEGQKT
GFFLDQRENRRHIRKYAAGADVLDVYTNDGGFALNAMHAGAKSTTMVDIS
QEALQRAEQNARTNGFGNFSIVAADAFETLGQLRHENHTFDLVILDPPSF
TKSRKTVPTALKAYTKLNRLGLQLVRNEGYLATASCSHHVSEEDFLAAIH
LGAMQAGKHLRLISRAAQPPDHPVLLAMPETSYLKFACFYVTNL
>CT2065 conserved hypothetical protein
MAHKSEYGKKVLVAGATGKTGSWVVKRLLHYGVPVRVFVRCEEKARRLFG
EGVEVVTGKIQDAEAIRRAVSGCDAVISALGSSAMSGEASPSEVDRDGAI
RLIDEAAKAGVRHFAMVSSIAVTKWFHPLNLFGGVLSMKLAAEEHLRKIF
GSEGRSYTVIRPGGLRDGEPLQHRLHVEQGDHLWNGWMNRSDVAELAVLS
LWVEKAANKTFEVIIETPEPQESLAGCFDKLAE
>CT0087 conserved hypothetical protein
MSSFQFFGMQPYQADPSSQINAALDSIKALVISLDGVLTSGVITLDGEGR
EMPTLFARDLAGLREALRLGMKVAIIAGRQAGAFRQMLEATGPIDLFLDG
EERLDAYEAFKSRHGLQDDECACIADDIDDLELLKKAGLPVTPINGAEYL
RNRVAYISVFEGGRGCVREIVEMVLDHQGRWKYSEKQAQG
>CT0028 C-20 methyltransferase
MSNNDLLNYYHRANELVFKGLIEFSCMKAAIELDLFSHMAEGPKDLATLA
ADTGSVPPRLEMLLETLRQMRVINLEDGKWSLTEFADYMFSPTPKEPNLH
QTPVAKAMAFLADDFYMGLSQAVRGQKNFKGQVPYPPVTREDNLYFEEIH
RSNAKFAIQLLLEEAKLDGVKKMIDVGGGIGDISAAMLKHFPELDSTILN
LPGAIDLVNENAAEKGVADRMRGIAVDIYKESYPEADAVLFCRILYSANE
QLSTIMCKKAFDAMRSGGRLLILDMVIDDPENPNFDYLSHYILGAGMPFS
VLGFKEQARYKEILESLGYKDVTMVRKYDHLLVQAVKP
>CT2199 hypothetical protein
MPSLDWIGKKAVVNHHKKVAFDRLRRLALRCSPGC
>CT0361 membrane protein, putative
MLKTVIQLLLTVAALAIVLNKTDIGRLTSLMSHANPWYLLGAILFFNISK
IVNAIRLNRFFTSIGLRLSAWYNLKLYYLGMFYNLFLPGGIGGDGYKIYV
LKKNHGLTAINIFGAVFWDRVSGIFALIFLSALLIIPSSFATLYPGEMVW
IWVIAGATYPLSLLLTWLFYRQFMPVFIITAFESLVIQATQVISAWFILM
ALSATANQIDYLAIFLISSVATILPLTVGGAGAREVTFFYALNHLGLDVN
TGIALSLIFFAISAISSLVGILSRIRHEKSGEPLPAASE
>CT1004 hypothetical protein
MLPRSISGISADGKQFIDLIDGLTLPPLVRNKYLRYILDEHHSAVYAYGV
LALRGDSDMGEIEEVLDVVAANAEHYIMGHGKVIRGENGNVSGPKP
>CT1775 hypothetical protein
MNYSFKTLWNAMFLAVGPVWFVLVWMIWSSGQLKTAEDHTLFLGLVIPGF
ILIYVSGFLIQKRHAKKIQGQHS
>CT2245 hypothetical protein
MTRAWEQIVKENRNVLLERWTSSVVAMLPGGMSHGSLVATAIAEELGVLL
DAVADRSMQAAEPIMRITRILAVQDIPPSKSLSILFMLKGMIEALPVECD
HPCRDRLEELTLQAFDSYMKHRETIYQIKYDEARRKMHMALRRAEA
>CT1723 hypothetical protein
MLYFGKNRRHRHWASACRCIGIEMNPIQTRNHMNRPTARTLSAAMILLSA
GLVIGGCSPTVKVEAPDKPITINMNIKIDHEIRIRVDRDIDNAIGKRTDI
F
>CT2092 conserved hypothetical protein
MSETKHDTLRFGIVTDIHYNPESKTGNQTQAGLERCIEHWTREGAEFVIQ
LGDLISREGPEAESDLIAVRDMLARFPGKVYHVAGNHCLAVPPERYKTIM
GLDSLYYTFSSHGIRFIVLNGMDVSAVNDPQTKADRHLLEYYRDNVKAPF
YCGAIGARQLEWLVNELDLALKNEEPVIILSHLPLLEETTDEKHGLLWNH
EELTAILFRYPNIRACLSGHYHSAAHARSDGIHFIVLPAFAGWPPGECCL
TVKITGENINIGRQDAPPLFDIPLP
>CT0060 hypothetical protein
MIKRRICHMFGAMIPLLLATLFMAACNGNTPKRVTDIDGNTYGTVNIGGH
VWMAENLRVTRYRNGDPIAEVKEGASWTAQTAGARCSYDNSPENGKTYGF
LYNWYAVSDPRGLAPEGWHVATDKEWQALADALGGEQEAGAGLKAPGKWG
NSSGETQSSGFNALPSGARRDADGVFLMLGQFARFWTSTPASNGKALARA
LGFYDNALRVGEVVPRNGFAVRCVKD
>CT1271 glycosyl transferase
MLTPRFDIQPAVSVILPTFDRAPLLASAIESVVAQNFAEWELIVVDDGSS
DETFEIVDGYLNQHANIRYMKHRNRKAALSRNAGIQAAFGRYITFLDSDD
LYLPEHLESRLHLLESMPEVRLISGGFVCEGDPWVRDRDDPEKLIHVSEC
IAGATMFGRRELFLDIGGFRALDYAEDTDLWERAARSHKVLKIDQPVSYI
YRRSPGSITRTYKPAKP
>CT1401 hypothetical protein
MMNRDEKEASKSIGLAWAASLVVTALITLLASFWLQSASIINGGSTGSQK
QALLANMRMNIARSAEKEKSAVLTTSDEESARFAEESKAATREVNRDLKT
LEAIIAKSGSAKEKELVARFGKSWAEVQQINAGLLESAPQHTNDKALELS
NSIGADLMHKIDDNLTKLTAKVTPPARKAQMDKIAGDARIAIRNIALLQS
RHIDAMADADKKKFESSMQAEQTKVSSALKALDKMTSKQSRPYIREAKTD
FSEFMRVNSEIVRLSTLNTNRSTAALSLGDKRKADAGCDRALEALQKLAN
GKP
>CT1500 transcriptional regulator, MerR family
MAFESSKSYYSIGEVSRISGIPAYLLRYWEDYFSELAPARDTRGNRRYTN
RDIAMVLNIKDLVYEKGYKLNKASQMVKGGKVDHDGADHKTNEILKLQKQ
LLREQKEYEITGERRRMLLLEIKDEIEDILELLG
>CT0733 hypothetical protein
MEAKNLKETPTVVKRVLESYTSLEGAVFFVQF
>CT0323 gamma-carotene desaturase
MQRREFFQHFLKRAGIGAGALGAATAGLVGYYQPRKEVFDTSGKNNDELA
EKLTTPKKAVVIGGGLAGISSALELARRNFEVTLVEASPSLGGKLTGWSI
EALGEQFPVEHGFHGFFDQYYNLNEMFASAGIGSDMFTASPGYPVIFSDR
QVEVFGQTPKWFPFNILSVVQQSKRLDIMSFLKDYPGLWPVISMFRYQYD
RTFRDWDSIDFMTYCRRGEVLPAFIDTVLHPFSDATMNRMEVLSAAEAMR
YFHFYFMGSPEGLAFRIITKDCMSALIEPLERKMTSLGVRVLKGRKAQNL
VMQDGRVTAVRLDGAGAANGEVASIPKREVPVTGWLQHMSDAGIPVLVAR
RGASWVALDGRCTHMGCPVAPEVSTGGFHCPCHDGRFNAEGLPVSGPPKA
PLPRLDVREAGEMLVIGQASSSSSPVVVTAEELPCDYCVVASGVRGTREL
IALTRPGNSGFAGQVAALGEADPYVVWRVWLDRPLPSADFPFYTVSGYTY
TDSITFYSSFQQPFIDWAKRTGGCVVELHAYAVAPQDVRPEPEIRATMLQ
ELHAMFPESKNATIRHEIFMMQSNFTRWAPGDHAKRPGVETPYANLFLAG
DWVSTKAPVFLMEAAVFTGRMAANAISAKESLRQKPLPIVPMNGIFA
>CT0099 exopolyphosphatase, putative
MQGRVLANNCKTMSNATERIACIDVGTNTALLLVADLDAAASNIVTVDHR
QTIVRLGQNVDEYRMIHPEALDRLIACMTEYRNLCDGLGVQRILAVGTSA
LRDAANRDEVIAAVKGETGIEIRCISGDEEAALTFFGAVAGLPEVPEPFT
VIDIGGGSTEIIMGTVEQVDSAVSINIGSVRMTERFCAAQPPSPEAFEAA
KKEINRKLARSLPPFFAGRQQVFGVAGTLTTIAQVCLGDRHFDAAKVQGY
RLEYDAVHELLDRLRAMKLNEIVALGIPEGRADVFTMGVLILHQFMRMLG
VGSLTVSIQGLRYGVAQQELQKLLMLRNRT
>CT1244 hypothetical protein
MLHRLMTADVMVSWKLPDVSNAGEPFGCGDTVRFYVKVSGVPAVFG
>CT0278 RNA polymerase sigma-70 factor, ECF subfamily
MDRSFSELVAEHQDMVVNTCYRFVFNREDAEDLAQEVFIEVYRSLDKFRE
ESKLSTWIYRIAVTKSLDHLRRLKRKKRFSSLKRVIGIDDPADSIPAASN
DNPADVLDSKERLSVLQNALDGLPDNQKTAFLLSKQDGYSNQEIADILQT
TIPAVESLIHRAKKNLQSRLERHYRSGN
>CT0848 hypothetical protein
MMTDHHVSELFRKPGFQKAIQKTPGQTQDVQKRRSPRPEAKKRAPRRIIF
S
>CT0734 conserved hypothetical protein
MGIVINLFLIIAASIVFFVVGFYIGRFFLERIGTTKVLEAEERAVQVIQE
AQKEANDYKELKVNEVNQEWKKRKREFDSEVTIKNNKFAQLQKQIRQKEV
TLANQMRDIKETEKKLQEQREELKHQTQNVQNRSAELEKTILEQNQRLES
ISNLTAEEARQMLIDNMIAKAREEAAETVHQIHEEATQKADRIAEKIMLT
AIQRISFEQATESALSVVHIQSDELKGRIIGREGRNIKAFENATGVDIIV
DDTPEVVILSCFDPLRREMAKLTLQKLLVDGIIHPVAIEKAYQDAKKEIE
DVIMSSGEEAISSLQIPDMPAEIVNLIGKMRFHTVYGQNLLQHSREVAML
AGLMAAELKLDAKQAKRAGLLHDIGLVLPETEMPHALAGMEFLKKFNMSP
VVLNAIGAHHGEVEKASPIADLVDAANIVSLSRPGARGAVTAEGNVKRLE
SLEEIARTFPGVIKTYALQAGREIRVIVEGDNVSDSQADVLAHDIASKIE
SEAQYPGQIKVTILREKRSVAFAK
>CT0491 hypothetical protein
MLVKAKDYGIACVGITDYFLINSYKRLIELINDDSRLNALLNPPYADYAK
QLLVLPNIEFRSSTIVRHVDIEGKTATREPIFTSFSPTQFRRKRLKRIFF
ES
>CT1349 ABC transporter, ATP-binding protein
MSKQPYIIRISNLCRYYTMGDQTVKALDDINLDFRRNDYAAIMGPSGSGK
STLMNILGCLDTPTSGRYELNGQHVADMDDDELARIRNREIGFVFQTFNL
LPRLNCLRNVELPLVYAGVEPEERLERARQALEQVGLVDRIDHKPGELSG
GQTQRVAIARALVNHPSIILADEPTGNLDTATSHDIMEIFSKLSDAGNTI
ILITHEEDIARFTRRIIRLRDGRIESDSAP
>CT0600 conserved hypothetical protein
MMSRSIQNPLAALVALLMLMPVMLRAGQKGEAMTFELTSTAFRNMGAIPA
LYTCEGKNISPPLTWKNIPKGTKSLVLIVDDPDAPDPAAPKFTWVHWVLY
NIPPGKTGFAEGAGNHPAETEMQEGFNSWNRGGYGGPCPPIGTHRYFFKL
YALDTVIDDLLSPLKADVEAAMQGHILGETVLIGTYKKRGK
>CT1888 spoU rRNA methylase family protein
MRRESFPPLSKAMLGRLARLGQKKHRDSEGLFLAEGLRTVSELLQSLSDP
SMLHALVFDEKAAGQLDGLERFAGKAWLAGPNEFKRLAQTTSPQGVVAAF
RKPESGEFRPASARSFVVALDDVQDPGNVGTIIRTAAWFGAEAVICGRGT
ADPYNAKSVRSSAGSIFALAIDTTPDLAKTLRRLQADGFTVAASALDGQD
YRFFAEWPARRVLVIGNEANGISAEILALADRRLLIPPAGARPAVESLNA
SVSAGILMATIHG
>CT2224 chorismate mutase, putative
MSESCEINNDHWRELEEWRGKIDEIDRQVAALLCQRLQFAGNISSVKSRI
GEAVLQPEREKEVLINVLAHTDSPAMSQALERIYHSILNESRLFQQEYKN
GQQNHSSR
>CT0071 hypothetical protein
MKRALLLLIALFSFDAVRSNLEQYNLFWKASRQAQAGHYAPAIQQYRQLL
DRYPAGLLRCEASFNLACAEYAMKHYRRAAELFAALPPGDATLSKTAGYN
QGNALAMEAFRSRKGPAQEALLGRALAFYRRALLDNPQNADARINYEIVL
RAMQHRQPPSPAPQGGGSPDGKGQDGGGAVSQLILENARQEEARQMRKYF
KPLPTKPSEQNQPDW
>CT0992 drug resistance protein, putative
MKKSPLAILFLTVLLDLIGFGIVLPLLPTYAKELGASPFMIGLIASIYST
MQFIFSPIWGKLSDKIGRRPVMLSSIFLTLVSYVFFSKAVTIPLLILARS
LSGIGSANIAAAQAAITDVTDSKSRSGAMGMLGAAFGIGFIIGPLVGGVL
MTNFGISMVGLFAAGLNFINFTLALFLLNETNPHTEGFLSLFRKNPESVV
HTNNSLFASLAHKSSAYADKIHEVFSSRPVALLMIINFIYTLAIVNMQVS
AILLWSDVYHATEQQVGYLFAYVGFFTVIVQGVMLPKMTRNYGEHKLMVL
GHITSFIGVFFIPFIPVTSLFTVGLAILLFFAIGTSLVNPLNISMISLYS
YKQKQGQIMGFAQSVNALARILGPFSGSILYGYDHRMPYYVAGALTVVGT
FISMTLFKYEIEAFEPTTEMAE
>CT1690 hypothetical protein
MLAQPFDDIPLKLRQLALLLLQQRFDVEVLELLQVVQAYSAVFIPVAGFS
GMIFETTGRTTMGRRSCDGFFSPQIWQACACVSFVLMVVSR
>CT1443 hypothetical protein
MKARSQELVEKSVSAMVSAIEVYNKPDFKYREETFSILAINAWELLLKAK
WLKDNGNKVRMLYVTEKKLRPNGKPYKHAKVKMTGAGNPLTHSLDYLAKR
MAEKKTLADAAHKNIIALCEIRDSSVHFYNKSGVFAVRLQEVGSASVRNY
AKAAQSWFDIDFSQYNFYLMPLAFVNPQQPGDAIILSKEEKNVATFISSL
EAAGDPEADYAVSVNVELKFLKSKADDAIKVQVTNDPSAPKVQLTEEHLK
DKYPLNYAALTKACSARYFDFKQNQKYHDLRKPLKSDKRYCHVKKLDPDN
PKSAKQAWFSNAVFNVLDKHYTVKG
>CT1351 ABC transporter efflux protein
MTLPIRETVIQAATSLAVNKLRSALTTLGVAVGVFSIIAVMTALDAVDRS
IASGLSSLGANTFQIQKNPATVFGEGHNRNLYANRKDITWQEAQLFKKYM
GQSARNIGLIITSQAAQASYGNEITNPDVTLTGGDESFAPANGFDVIEGR
NLNDGDLRYASDTAVIGSDVAAYLFQQGQNPVGKQIRVNGKAYTVIGVFS
KKGPAFGQSQDNFVLIPITRFLEEVNQESSIAISVEATSQKTYQQTIDQA
IGAMRLARGLTVKMPNDFEIRTNESLVDSFRDIQRIVSIGAFIISFMALL
TAGVGIMNIMLVSVTERTREIGIRMSVGAPRRSILQQFLLEALLLSIGGG
VLGIVAGAAAGNLVAVKFNLPVMFPWLWVVVSLTVCSVIGISFGLFPAWK
ASSLDPVTALRASR
>CT0928 conserved hypothetical protein
MIASKTATTRRYVITGGPGSGKSTLIEALEARGQRCYPEVSRELIRREAR
RPNGVMPWNDLEAFARLAFTEMLLQHDHAEEAGERCFFDRAIPDIFGYLL
ERGIDIQESWLDVHRRCRYERTVFILPPWPEIYVNDAERPQTLAEANALH
NAIHAVYESLGYELIEVPRMPVEARCEFVLGRLCCGKEEAIKYSRKA
>CT0897 phosphate ABC transporter, permease protein, putative
MPGRYCCKYVTIPAFFLVRQKIPLTRQECSIRHRERAATHDRKHVVSVKR
VSKINRAVMSAEVVRGSAHRSGDFVVSASKRKMQKVAKVGGEGVLLVLAS
FVAIVVLFIFYFVASDAIPYFKLRGFSEFFTSTSWYPANDPPQFGALAII
YGSVMVTLGSTLLAVPMGVAAAICLSDILPFSVRQYAKPVIEMLSAIPSV
AYGFFALVILAPLLQANGGPILMWAWLLLASPFLLLAVIVAADLLSAKID
NDKQRHLVSKVLYVVMGGASMLLLYVVAGTLDRMEILTGTNALNVSIILS
FMALPTIVSVSEDSLQAVGRDLREGSYALGATRAETIVKTVLPAASSGIL
AAVILGVMRALGETMVVWMASGNSSNMPSPWYNYLAPIRTLTATIAGDMG
EADQVTGSARYHVLFAMGLLLLVVSFVSNLISERIVARQRRVLAGQ
>CT1829 conserved hypothetical protein
MTSRMIKKIKAFFAGAGIGALGGLIGLGGAEFRLPLLLGMFAFPPLEAVI
LNKAMSLLVVASALPFRAATISWETLFAHWTIVVNLLAGSLAGAWAGASW
ATKLRSETLYRVIAILLLGIALVLLTGHQTTTTGTPLFDSPALLMTTGVI
AGLGIGIVAALLGVAGGELLIPVIVFLFGADIKLAGSLSLAVSLPTMLVS
FARYSKDSSFVVLGANKSFVLIMAIGSIAGAWLGSRLLGIVPDSYLLPML
AAILLISALKVWKHK
>CT1031 ATP synthase, putative
MNEREPEKLSAKDKPLEKRVGDSELRKIRARKNATRSIWEGFAMFGIIGW
AVAIPTLIGVAVGIWLDRHYPSPHSWTLTMIVVGVVIGCLNAWHWVSEEN
RNIDKEE
>CT1139 oxidoreductase, FAD-binding
MQLFGDILIHGRNPYLLQELIGSEHRRNRIFEHARKDIETIQSSANGEPR
VLAIVETLRAKLEKFRAEIERLPEFRRKLKKELAPIVGAKNVLYDPFSIV
AHATDATDWRLYLPVAVVTPDDEAQVAPLIAAIAKLGLRVIPRGAGTGLT
GGAVPLRSDCVIINTEKLNHVRGITERTFHLKDSHTVTGSVIEVEAGVIT
ETAMHYADEHGLVFATDPTSEWACTIGGNIAENAGGKMAVRWGTCIDNLL
EWKIAMPGGKLWTVRRTDHQLRKILPEDTVTYEVLDQHGAPLKRIILRGT
EIRKQGLWKDITNKALGGVPGLQKEGTDGVITSAVFVLYPKYEEKRTLCL
EFFGPDMDEASRVIVELSKAFPYQNVEHETLLALEHFDDEYIRAIDYKVK
ASRPQTPKAVLLIDIAGHTEAEVEAGVERVRALLEKHPNTLMFVARDQAE
SILFWQDRKKLGAIARRTNAFKLNEDIVIPIDQLAVFARFIDDMNVEEER
YSQLQYVERIDAMLRESSNPESLTPFEAKIPAGLGLCDLIRNRLEAADPL
LLRSLTLLQEFRTELNQLFRGYPKTLDAIEADFKYVRDRRIVLATHMHAG
DGNVHVNVPVLSNDRPMLERADHVIDKVMEKVISLGGVVSGEHGIGVTKL
KYMDKARIDELTAYRREVDPDGIMNPGKLEDYEALNHIFTPSFNLLELEA
HILKRGKLEALSKKVDYCIRCGKCKPDCCVYYPARGMFYHPRNKNLAIGS
LIEALLYDAQRERSTDFKLLQWLEEVSDHCTICHKCLKPCPVDIDSGEVS
ILEREILSARGFKNSPPITKMTLNYLANRSPFYNKMFRNAVLRFGGAAQR
AGTKITAPLQASRDAQSVIPPLRLLRSSVPPVPEKTLRDVLPPCDSDQVL
VFEPTSKEATSTVFYFPGCGSERLHSTISMAAIHILLETGTRVVLPPPFL
CCGFPLNVNAKEEAYTSIVLRNTVMFSQIREMFSYLDFDACVISCGTCME
GLEVMDAPKLFGNRIVDVSRYAYEKGMRVDGGSTQSLYHAPCHDSLKGKA
CDLLRDVGGFGKVTNVPHCCSEAGTLALSRPDITDSMLHRKREALKESMH
GEATATILTNCPSCVQGLGRNRDMGIEPKHIAVALAEKHSGPDWMERFLV
QAAKAQAVMF
>CT1786 nifU protein, putative
MKDYLPKTDPLYDKVISALETVRPYLQVDGGDCQLVGITKDMVVDVKLLG
ACGSCPMSTLTLRAGVEQAIKKANPEIVRVESV
>CT0759 hypothetical protein
METVPCPITGSREFTPFLQALDRFNLHGEPWQLVQSSASGLVMLNPRPGP
DEMATHYPAEAYDPFLNQTNSRSLRDQCYLAISDVLMAGKASMVMKGIKK
PADATQVLEVGCSTGRLLLRIRGDFGVPLTNLFGVETDRQAASTARRAGL
RISKPDLCDADFDSRRFDRIIFWHALEHLHRIGEALDQARELLRPDGQLI
IAVPNIESLDARGYGPNWIALDAPRHLYHFSPDTLQKLLEKHGLSILNIG
RWIPDTLYNVWYSEKLERSINGKPFGISGIARAGVRAAKSLAAGRNPKRA
SSMIVRAVRMKR
>CT1510 carbon-nitrogen hydrolase family protein
MNTDQVRIALVQMSCVENPQENLRKAQERIRQAAAGGANIVCLQELFTTL
YFCQTEEYEPFGYAEPIPGPSTAALQELAAELGVVIVASLFEIRAKGVHH
NTAAVIDADGSYLGKYRKMHIPDDPGFYEKFYFVPGDLGYKVFDTKFGTI
GVLICWDQWYPEAARLTALRGADILFYPTAIGWATSETSQEVRASQRQAW
KTSHLGHAVANGVFVAAANRAGTEGELEFWGNSFVSDPFGQVIAEAAHNS
EEILYADCDLSKIGFYRSHWPFMRDRRIDSYGDITRRWIDE
>CT1162 A/G-specific adenine glycosylase, putative
MNTALVEAFQAKIFDFYEKNRRSFPWRLTTDRYAVMVSEVMLQQTQADRV
ASRFARWLERFPDVRSLASASLREVLEEWSGLGYNGRGQRLHRAAAMIIE
RYGGEVPAEPAQLIELPGIGVYTSRSIPVFADNLDIAAVDTNIRRVLIHE
LNLSESITPKALLDVAEVVLPKGRSRDWHNALMDYGAMELTGKKTGIAPL
TKQSSFKGSRRWYRGALLRELIAAGELSREAVEERYADCPHGIGSIVDSL
VMESMIEEYGEQRMLRIAGENSP
>CT2252 hypothetical protein
MIFIKMSLLNDLLEKTLSEIWEIMPWNLVDQMAENSELIVISMFEVRRCS
KNEFCENEQR
>CT0778 nickel-dependent hydrogenases b-type cytochrome subunit
MGRIIEEIYVWRLPVRLYHWVNAFSIAVLLGTGLYIAYPVLAAPSGEAVF
HHGIATWRGVHFAVAFVFIANFLFRMYYALMARDDQYARFGGFQPWKPSW
WGKPFKEQMAAYLFLRKGEPDYVGHNPVAALTHFLFIFCGSVFMIASGLA
MYGENNPGGLSETLFGWMLPLFGGSCNLRFAHHLMAWIFPFYILVHLYAV
FRHDIVDRSSVTSSIITGYKHKVENEAAL
>CT1998 hypothetical protein
MKTVRSLCFRSLLLLIVVLPFFSCSHQGQDADTASLYDEAARFYRLKKYS
DALDRYDRALAADTLNGFSQKALDALCRKSRIEFLTGRYAGAFRTWDAIR
RHGGKNLPDSLHTAVALDTGKMYAELGMYGQAASVMASLRNPDAWQLFDE
ARFLFRAGKIIEALRIYTKLSVSDDNAIKISGLSGILDCALTGRVTGLDT
PDNLAGKIAMISGRVMKMNTSPEVKIKALRIAARSLQQMEKQRPNASYLL
FRALAIAQEAGYPRLVAILQYESNNLIVRKPDTYRSVIEYFGQRNMPFAK
VAALFMLGRSVELTPAERIEAYRLGLAACQHYGIPATATDYVTLEREAAG
ELCDLLAAERRYIELFDASAMADYLEHRRLVYAGISGFRLPPGHEAVQNE
IIELTRDISGLLQRKINMLEEGTGFALASVVDQAIREKQGRLIELIVEAS
KVDSTVPERLQPRLLTLRTLQKSLRSDEALIRFLVRDSLSTSMLVSSREL
QIVTAKVTKEQVRARFTALRQRLASASPNLEAILAADDDRRWLSGTLLQS
MGDRLSDYRHIIFVSRNAEPFHLLGRGPMLGSDHQVSWLFSADELLLYAA
VKSQGDIVFFDASNPEKAAIYKLFHPEDQLFLSWKPMQENEDAGLKQLLK
KASESGASSSDILKKAVQHSGPIGAQAWLWLGPYGAK
>CT2280 hypothetical protein
MPHSFHYTRRKRSDMEQQTSGQRILDPIERAKLGVKVFNLPYSQAEALID
DYVSGKNYDQASVDYFKDQVATQIHIREKSAELLVTGGEIIKLITRSFMQ
NLPKSIDRS
>CT1851 partitioning protein, ParB family
MSKKALGRGLKALISEEGFAVAEKAEETEKMQDGVIGSLPVEKIKVNPFQ
PRQAFEETALNELRNSIIENGVIQPVTVCRDGEGYLLISGERRLRAVKSA
GFKFIPAYVIEAHEDASKLELALIENIQREDLNAIEVALALRSLVTKCNL
TQDEVAQKVGKNRSTVANFLRLLKLPRQIQDSIRTREISSGHARALINLP
SEHLQLKVWRQIMARQLSVRQTEALVNNMFKDKPKTASPAPAPRAVQIDQ
IEARLRERLATKVSLVEKKGGQGEIHIKYFSGEDLDRILELIGQ
>CT1085 phosphate acetyl/butyryltransferase family protein
MPIDAMSDRNHAFASDEPPEIQVHPHDRYHAVIKKCASLPPLVTAVVHPV
DSQVLSAVSDAVMEKLITPLLVGPAGRIEKAADEAGIDLSKWQVIDTPHS
HAAAEKAVELAAAREVGAIMKGSLHTDELLGPIVARGSGLRTGRRLSHAY
VMDTAGYHKWLIVTDAVVNISPDLCAKADICRNAVDAWVALTGESCLPKI
AVLAAVEVVNAAMQATLDAAALCKMAERGQITGCIIDGPLAFDNAISRQA
AKEKHIVSQVAGDADILLAPDIEAGNILAKQLTFISHADAAGVVMGAKVP
VILTSRADNLRTRLLSCALAVLVQRAKEEGRIK
>CT0793 hypothetical protein
MSDQVSTAPCDHEWFDSYQRFSSFSPESVRDGCHPGIMEHLQSSQKAIVL
VHGLSDSPYFMAAINPKIRGTRYLFSAVFDHAVTTMDAHRPFFLPIFPPT
AESNSYASIATPAS
>CT2053 hypothetical protein
MACFHDNEIKLIYFKQLSKRGDVKKSIDNRFCFQDYCRQSGDVWDASAME
LQAGNTGVFAFDRFQERWPWSVALAGVI
>CT1739 hypothetical protein
MKRQPVGRPRINSDRGRFPDAGVSKFEYNINQEKHMQEQSNKNNEQKAPA
CISTIGVSRCRCGAYHLRYRYVDVAIPRETLYLIMEECFRYEEECAKREG
QRPEAMVFSLGVVTLAILPLDFAAFSKAVGDAVNEDLGIGRLFAGAEAGD
NGLADGTQN
>CT1348 membrane fusion efflux protein, putative
MANTKRSGKLRNIFIIGGALFAIGVAALIWLNTREKAVEVTTEKVFRKEV
VHTVTATGKIQPETEVAMSPDVSGEIIELPVVEGQEVKAGQLLFRIQPDI
YVNQVKQSRAQLNLSKAQSMEAKARMLKAEDDFRKADILYKDKLISQTDW
LSAKTNAETSRAAWKAARYSIDQNQSLLDQNEERLTKTVVRSPINGTIIS
LSSKPGERVVGTGQFPGTEVLKIANLDNMLLKVEVNENDIVNVQVGNPVT
VTVDAFGDRTFKGEVSEIANSAKTQAANTQEEATNFEVKIKILNHQRLLK
PGMSGTANIETQRVPNALVVPIQSVTMRTAGGKKAATPTDSTSNNKVVQL
NQNHRLTDETEGVFVVEGDHVRFRKVKTGTTDNTHIIILEGVKEGEEVVS
GSYGAISRELQDGSTVKLQKKR
>CT0165 hypothetical protein
MASSLKNRKFRAENCCGWLFAAHRSSTHKGVAGLPFMCFPMPASYR
>CT0417 AslB/AtsB family protein
MLVVTTACNLDCAYCYEGGGNGQGEMMDLDTALRALDLVAASNQPFHVQL
TGGEPLLAGELVFRILEYIRNNNLPATTAIQTNGVLLDHQTARRLHSFGT
AVGVSVDGLPAIQEQLRGQGAATWKALAMLDSERVPFSVTTVLSSRNTRD
LSTLGMALHSMPAASAIGLDLLVQKGSAAAKIGVQPPDAVTIRSGVTRLL
ATLDVLNAQRSRPLILRERQTVLKALGRDEARPYCQACTGSSLAVTPRGE
LYPCTQTLGDARFHLGTLDHPRLSGTALPDGRLPKREECAGCGLQGRCPG
DCPSRLHYNGADGAGLVCALYRTICDYCLYKGEIPS
>CT0880 hypothetical protein
MQFLKVYSVFGEEVVGCHVRIENLSVACSVVSAGGTHPSEKCVCNHEFSS
VSLHQKKHPHLEKLLETSRPIKLDSSSKVLILSDLHMGNGGRRDEFRRNS
ELVRSMLQDYYLPGGYSLVLNGDIEELFKFSVEDITKVWGHIYDLFLQFE
KNGFFWKTYGNHDSDLFEERNYPLSKHLLESIRFQYGDEVMLLFHGHQAS
ILLWETYPLVSRAVVLFLRYVAKPIGIRNFSTAYNSRRRFAIEKSIYEFS
NQAKIVSIIGHTHRPLFESLSKVDHLKYKIEELCRQYPSALPEERLAIQE
RIGELKAMLDACFTEGKKIGLRSGRYNNIAIPSVFNSGCAIGKRGVTALE
IDGDRIRLVYWFKEKQGRRFVSDRNSEPEQLGDTGFYRIVLNEDHLDYVF
SRIRLLA
>CT0448 hypothetical protein
MFSSNGYGSAPNIEYLSLNATDDGLERYRGLKQ
>CT0496 polysulfide reductase, subunit A, putative
MHKKIISRRQFLAAAGVLGGMSLLRPIWGLATASTSESTAAGGATTWVPS
ICNFCSSFCDIKVQTRESDGVRRIVKIDGNTESPLNRGRICARGQAGLRQ
TYDPDRLKQPLIRVAGSKRGEWNFRAATWDEAYSYIVSRLQKINPWEASL
IGGWTACVSYMHFSLPFCQTMQIPNIVASPLQHCVTAGHLGTDLVTGNFN
VHDEILADFENARYILFSLNNASVAAISTARAVRFGQAKKNGAKVVCLDP
RMGELASKADEWIPVKPGTDHAFFLAMLHTLLREKLYDADFVSKHTNAPF
LAYKDKNGAVHLAADMSGGKPSSYYVLDSISGGVRAVPGYINTNEKAAGG
GRIQPALNAPAGLTWKGHAVKTVFDMFIEESEPFTPEWAAAITDVPAETI
RRIAIEFGQARPAMVDPGWMGARYHHLIGQRRLQAIIQTLVGGIDKPGGW
MMSGEYHHRSEKAWHNMQHGIDSSHEPPVERPGMGFAHALLDIFANPKAW
EHGKPALSFAWAMEQQKQGKPSAFLPAMADTGLLEAVKGELTFNGEPYRM
KALIMNAANPMRHYFPAKRWEEILTNDNLDLVVAIDVLPSDTTLYADVIL
PNHTYLERNEPLLYPLGPETGIGYTTRLRAIEPLYDTRDTTDILCEIARR
MGRLEPYLDGIAEYAGLDPQLLRNEVAAAQRAKKPLNEAFLKTAYVAIGK
FSGKLTGKEMTGAQVEATIRDKGLLLLKNADEVVKEMNLPRKLPVPTMSG
RLELFSPILESFAQQAGQQPLFNPVLGYVPRVLTNQGDKNDLSADEFYFT
YGKVPVVSHASTNSNNALLAAVTEPKKGQFMGLWMNAAKAKTLGLSNGQE
VEVTNLRYGPKVKATLFTTEMIRPDTVFLPSAYGSKNKKLSIAGGKGTAL
NELMPYSIEPIAASFMSQEFTVSVKPVNS
>CT2236 muramoyltetrapeptide carboxypeptidase, putative
MNILIPKALRRGEVIGLISPSSTCAEPEKIERAVTYLERCGYKVKTSLYL
NRSEHDPAHTDRYKLHDLHQMFADREVRAIFCLRGGAGATRLLDRIDYGL
IAANPKILVGYSDITALSLAVFRKTGLVNFSGPMAATELLAPSSYTEEHF
WGMLTDPGYSKHLTNFSEHPISCIRPGAVTGRLVGGNLSVLSSLVGTPYL
PSFSGALLFTEDVNEPAYRIDRMLSHLFNAGLAQKCRGLMFGQFSKNPAD
ENRDYRFDKMFTYYANRMHDGVPVMTGLSYGHIRELMTLPVGARCRLEIS
PERFAFGAVDAPVSR
>CT0224 glycosyl transferase
MNILFINSIGRNKFGGGEKWMVNAAKGLTDRGHNVVLASKRNSRLLDYAA
GSGVRTEVMEIRGDFSPLATLKIAAWMKRHQTDILICNLNKDVRVAGLAA
RLVGRPVVLARHGMLLCSKKWKHKLSLTRLTDGIVTNSRTIREAYAGYGW
FSENFVKVIYNGLTIPAEVTAFDFASRYPGKKIIYSAGRLSKQKGFEYLI
DAAAMLKRKRDDLLFAISGEGKLEIELKNRVAALGLKGSFVFLGFTPDIY
PFLKGCDLFVLASLFEGMPNVVMEAMAMQKPVIATDVNGARELMGASPES
LVCDTGLIIPPKNPAAIAEAIEQIIDNPALLEAYGKAGHQRVETHFTVPI
MIDNLEKHLQSKLAEKAGRC
>CT1081 hypothetical protein
MTVFMMRYTDKTHQQLLNHAAPAIKHSMMSPGACRLRIISSVQ
>CT0575 hypothetical protein
MSKTALLPTVTAVRMVWAMSGKFSNDAPSQTMATYVEDVLKGIEEQAPQI
SSDERSYVTSAVATMKASLRSIDVISKGRDLNFKENEKLRSAYLESVKES
LDFGNKAQDFLKSLPAMTIAGAGGVTVAQYFFKASTFELWGFGLILTGIG
YFFNQYIVLWVRRKKQMLYVTQDYERGLYYEQYLDRVRLVLLALFLDLER
IHKRVFRENYEADTTSVAVQSIIDDILSGVRSTFCPYAYKHMAEKKVTPE
LWTLCESGIRKAVENCPLWEGGQQSENRIDSL
>CT0279 phosphoglucomutase/phosphomannomutase family protein
MQVKFGTDGWRAIIAKEYTYDNLKLAALASGKYFLSHPDKSNGVCVGYDT
RFMSKEFARYTAEVLSSMGLKVFLSSSFVSTPAVSLYTREHKLAGGVMIT
ASHNPPIYNGFKVKASYGGPAHPEVIDEIEKNLSGIDPSTLVKPAENLIT
MTDIKSEYIAYLESKLDLKLIRESGLKIAHNAMYGAGQDIVTRLLDESMV
NCYHCSVNPGFDGINPEPIPPYITDFVEFFKEVETDVAIINDGDADRIGM
LDEKGEFVDSHKLFAIVLKYLVEQKGQRGEVAKTFALTDIIDRICQKHDL
KMHLLPVGFKYVSRLMTTNDILIGGEESGGIGITSFLPERDGVYTGLLML
EIMARRQKTLTGLVEELYDEYGFFSYNRLDLRVAESKKAAIIEAASGGKL
KSIAGYPVTSFNDLDGFKYHFEGGWLLIRASGTEPVLRIYCEADSTEKVE
KVLAFASRLA
>CT1353 OmpA family protein
MTMKTIKKFSKPAALLLLASTATVTTGCQSTTNAGRGAGYGAAAGGLIGG
IIGSNNGSWVQGALIGAAIGGAAGAVIGDYMDKQADEIRQEVQGAKVERV
GEGIRVVFDTGLLFSTDSATLNANSRYNIEKLARILNRYNDTNVVIEGHT
DNTGTEASNQILSERRAESVATLLRTYGVSGRRLTAIGYGETRPVATNET
EAGRRLNRRVEVLIYANDALKRQAQAGELKL
>CT1884 hypothetical protein
MRPPWFCVEPLMFTEHKFQSVEPARNEKKPARHVMDTGRNRP
>CT0924 hypothetical protein
MQYQENSNPIHLEEKDMQTIYADGIANMLLVDGVVRFDLVNVTSVEKDKE
PNVRSNATVALSLPAVIRIQDQLTKMIDKMVEDGILTKNNPAAN
>CT1220 conserved hypothetical protein
MSHTTSTLLIAGAGASGMLAAIAARRVAREHGVADERLRIVLLERNPKPG
NKIAISGGGHCNLTHDADVKSLLEKGFLNKGERRFLRHAIHMFSNADLLK
LFGRYGLKTEVREDGRVFPVSGRAGEVLDLLRRMVEESAVTLVTGARVER
LECGAAGFVARAGERRFEADAAILATGGASWGSAGTTGDGNRLAVAVGHT
ITPVLPALAPNYFTVPPRPELVGITLRNILLVASVDGASDSRRGDVLISH
RGISGPACLSLSRSAAGFLASGKKVTIFVDLFPGHDEGKLSAFILDQAAR
HGSRQVRTFLQRCPLAPERLDAPVASSNAETIPNAFADEIMRQAGIDREV
TMSGLTKAQRQCLVSTLKRLALGAVHKVPLDRAEVSAGGVSLREIDPKTM
QSKIHPRLYCCGELLDYAGEVGGFNLQAAFSTGWVAGSHAARMIIETVHF
SLRRNRQDHQLCGRTA
>CT1564 conserved hypothetical protein
MADGWRVFKIMAEFVNGFETMSRIGPAVTVFGSTRVKEGDAEYQLGETMG
KLLAETGFAVITGGGPGAMEAANKGAQSKGGASVGFNIKLPNQQRPNRYI
DYDKLVTFEYFFIRKVMFLKYSQAFIVLPGGFGTLDELSEAITLIQTGKS
QKFPVIMMVADYWGDFYGWIKKRMLDEHGFIRESDLDFIFIEDDPAKVIE
IILSFYPEGYRINF
>CT1455 ferric uptake regulator family protein
MKQTGSDTIAKVEALFKSYMKEEGLRCTQERLSVLREMYGSSTHLDADEL
FVRLRKKGVAISRATVYHTLDLLFRFNLVTKIDLGHKHTHYEKSWGVTNH
LHIICLKCGHVSEATSCELPELMETLCVGHGYSLGSFSLQLFGECADKEA
CARRISNKHKEKT
>CT0382 hypothetical protein
MRTDKLYRTRENRNWSKKRGIRDAVKGNFGQAQQRYGPFGQEQHDGGLGH
FSGDESGSAVQPAFLRRFEWRCWLASRIENIHRIRSPQAAAMHCCLSAYR
LAWQGLIINIATSSWACSIDFFPDMPFEAWTSNMTYLRTDSG
>CT1643 hypothetical protein
MDDPAIMWNAGNIRGRHPQLRLTKGFKSGEKSRVEVAVAVARTIGETNVS
GADSGKDATMPSIQGHLALSTHSSSRRNRQPSLFQAITNRRNGIRRLTKR
MKLSTRGHACSNCRCHWATSCFLPVSFSPERISTITGKASVRAATAPKPS
VRTEAGLPCATRQARRRR
>CT1570 hypothetical protein
MNAGECSRQKKYPLLRTLSERYVKHDIDSTKKLAMHIT
>CT1143 oxidoreductase, short chain dehydrogenase/reductase family
MRKSLGVVITGGSAGLGLAMAREFLRAGDRVVICSRRESNLKSALQMLGS
DVPDRNVYGMVCDVSLPAQAADFAAFAAAKLGIIDRWINNAGTAGRKRRP
LWELDLSDIDETCRTNLSGSMMLCAEALRVMLRQPASADEPLYHLFNMGF
SSAGLRSSPTSVPHRASKRAVAIMSKLLRQELEAAGIRSVGIHELSPGLV
LTDLLLRDATPAQKRFFNAMAETSETVAATLVPAIRAITGRGSTLRYQPV
LFMFAKLAASAFGYRKERFFDSEGKPWG
>CT1993 conserved hypothetical protein
MRQRMSRMPSSFYIHTFGCQMNQADSEIVTALLRAEGFVPSADETNADIV
LLNSCAVRENAEERLGNILMHLKGRKRRCKELVIGVLGCVPQFERERVFS
DYPFVDFIVGPDNYRELAGLVAGLREAVARPALLDYDQTETYAGIEPVRA
GSISAFLPVMRGCNNHCAFCVVPVTRGRERSVGFERVVAEVVALEKAGFR
EVTLLGQNVNSWRDAEKGLDFAGLLEGVSLAVPSMRIRFTTSHPKDISEA
LVKVIAARPNLCNHIHLPVQSGSSRMLDLMKRGHTREEYLDRIAMIRSYI
PEVAITTDLIAGFCTETEKDHRETLSLMEAVGYDTAFMFHYSVRPGTWAA
RNLPDDVPDTVKKARLQEIIELQNAISREIFQREIGKTVEVLAEAESKRS
ESMLMGRTPENRVVVFSRGRFNPSDTLLVKITGATSATLSGEAV
>CT0982 conserved hypothetical protein
MNIDKAVYAVAGFFVIASVLLSIYHNQNWLWFTGFVGLNLFQAAFTGFCP
LAKILKAAGVKPGHAFE
>CT0814 conserved hypothetical protein
MDKQLEAARKFLKDNVRQMVDFSQTAQNRRVPPPPIEKPMPEEAKTIPLP
VIGKAKSAGNIDLWSAIGQRESCRFYADEPMSLDELSLLLWATQGVRLKL
DAGHALRTVPSAGCRHAFETYLCVLNVESLDKGIYRYLPLEHALLCSHAP
EKLESRIVRATLGQRFTGDAAVVFVWTAIPYRMEWRYGLAAHKVIALDAG
HVCQNLYLACEAIGAGTCAIAAYDQDEMDRLLDVDGEEEFTIYLAPVGKK
G
>CT2226 hypothetical protein
MLNSMKSLRLHRSCTVFFFVIEIIRQNMLCKPFYGVFPWSFAG
>CT1722 conserved hypothetical protein
MHMITTYRLATLAKTFVLFFTILAASAASAFALDLETARAKGLVGEVDNG
YIAIPPGAGTEAQQLVETVNKQRLAVYSEIAAKNGISVEVTGQRTFEKRY
PSFPAGTWVKIKGVWSKK
>CT0796 band 7 family protein
MLFMNILVLLALAVAFFVSAVKILPEYERAVIFRLGRIIRAKGPGLIILI
PYIDRMVRVDLRTVTLDVPPQDIITRDNVSVKVSAVVYFRVIDPIKAIID
VADFHFATSQLAQTTLRSVCGQGEMDNLLAERDEINERIQSILDKDTAPW
GVKVGKVEVKEIDLPEGMRRAMAKQAEAERERRSKIINAEGEFQAAQRIS
EAAAIIAQNPAALQLRYLQTLQDIAVENNSTTIFPVPIDLLTSFFEKKA
>CT0485 hypothetical protein
MKEYKVLTQKDRFFGGTFDPEKLEKAINSYATEGWVVVSVATASIPSLTG
AREEMIVVMEREK
>CT2144 outer surface protein, putative
MIFPLLVAGVCSAPSLAQAAESPYVSLSGGLGLLNNSTVNGNNDAIKYKT
GYLINGAVGLKNQDVRVEAEIGYHRNSVDSSDYADPRGAHVSMWTFMANG
YYDIPMKDSVVSPYVMGGLGVADASISGGNMSGSPSSTEFAWQLGAGVGI
KAAKNTTFDLGYRYLSPSDAKFDGAKVSLASSNILAGIRYSF
>CT0998 hypothetical protein
MVIVKLICSGFPYQRLSTMAHNDFIVQKTTIKKANTATALRLYDGF
>CT1188 conserved hypothetical protein
MTDFEWKSNYLKSHKLKTLGVIIVLVLRYLYTSLFIYGFVHKIIHGWMWS
DILAHHFTKRLHELLATASTSSSFEATVAVWQASYLEHFALPLVMPIAWI
VTIGELAIGIALLFGVTTRINAAFGLFMLLNFAAGGYYNLTIPPLVAISI
LLIVLPTGHWLGLDRSLNRKYPESPWFR
>CT0365 hypothetical protein
MEKERQRLVEEEKREESEQQRCRKQQVLFLISFIYHGVFVCFDDVYLAFS
KPFGTIFGGSFSSIMMIFPTVSPYRSYGCYRRK
>CT1356 hypothetical protein
MTQQTNHHSGTANGSRCVVLDTASYCLLLKGDSPPFSAEERSLLIAESPE
EACDFQCLLGDACQPFLSQEYRRIHPGLYEKLEAIAISGWNNDAPFGIDT
AAHLMKKLPLDLNCLINSPIALSPNDPLPALIQKNLVHKHVEANVLVSEP
FTAGRLRYFNIFSETAELKFDHQSPHVQGLLILEALRQAGIASAHYQGLP
LDGKLALLNYNTSFYHFLEQESPIICRCYTDFTSSKTSDDAEACIYMQVF
QWGRLCADAILKGFACTNAERYEQKEQRLKKIIERHKTNFDSKLKRMYES
MVSTQCM
>CT1366 hypothetical protein
MKKQQQLDLLEKALVSTGHKIRYEKGSFVGGDCRVKENMIVVVNKFLPIE
GKIATLAAVLRKINPPALSPDVVKIIDTVVPTNLFSRENI
>CT2216 hypothetical protein
MILLSNEKMVAMMKRVHLMRALLIALAAVTMLFGTARAGWDPADEVNAER
TIGMFRKADPSLERFFSKAYGYAVFSDIYKGGFMIVGGGHGKGVVFDQGR
PIGHATVTMFNVGPQLGGQSFSEIIFFKDRAALADFTKGNYELSAQFAVV
AVRAGMATNTDYSNGVAVFAMPNAGLMGELTVGAQKFTFRQYR
>CT1416 glycosyl transferase
MDTVTKLSIIIPAWNEETGIARTLETLLALTAGRGDTEIIVSVSGNDRTK
AIARTFPVVVCHSEKGRAVQMNAGAKLATGSILYFLHADTTPPPSFCDDI
FSAIERGARAGCFRMTFDDPDWLMQLYGWFTQFPLPVCRGGDQSLFITKE
LFGQIGGFDERLQIMEDIEIIDRLQRHAGFHILDSTVTTSARKYHTNGTV
RLQAIFGTIHLMYALGFSQKNIVAFYRDSIR
>CT0502 RNA polymerase sigma-70 factor, ECF subfamily
MAGFSVNSPKARKSDQELVAGVLEDKKQFAAIVQRYEEPLFRYIVRQGAR
DRELARDILQEVFIKVYLHLNDYDASLPFSSWIYRIAHNETITHFRKEKN
RPLVLDKGDDDEFFGKIVDDLELSRADGQYDVSDIQTVLERLEPRYRDIL
VLKFFEDKSYEEISDILQMPQGTVATLINRAKKKVKASLEKN
>CT1549 membrane protein, putative
MNDGKATPGEAGCRENQLLKLPGFGNFTIIDRYIARQFLTIFLFALASFA
ALFILINLVENLDRFLDRHISFGRILIYYLSGLPDTFLLTSPLSVLLASL
FVTGKLSMQSELPALKSAGMSLSRLMKPFLLVTMAIAALNTINSCFIAPA
MYDWSKGFEKRYLKKQQDNGEEPLHIRESNNRILTVAKIGPDKKSATTVS
LETFNGSQIVSRIDADSLRIITRHKYWIFYNTKQRTFSKGAETLVTRAGA
DTLKLSLAPNTFKMIDTDPDEMNIVQHIDFIWQKARSGLPGLERATVKLH
TKLALPLASMIIVLIGVPLSSKKKRSGLAVEISISLLIGLLYLGMLKTIG
SLGYDGLLNPVLAAWLPDILFIIAGTFLYRSADH
>CT0558 zinc metalloendopeptidase
MNDLRSSAQQMESSRAQLQRVYHEKELAMKSQQEQLQSYSAKKKEKEVVL
SEIQKDKQTYAARITEVRRKQQIMQKKIESLIMAQQALIQKEQERARLEA
LRRQQRLAEARKRREAERRRKEAERQRVLAERRGEAPATKEPPPEKTLPP
EKEPLPVVPDQTESEIDRVSVDFDQGESLPWPVNNGVVVRRFGSSLDKEL
NIVTVSNGIDISVPVGTPVKSVSGGKVVQVAYLPTFGNVVIVRHPKSYLT
VYANLGRVSVAKGEIIRSRQLLGFSAAMPEGGSTVHFEVWKGKVKQNPQK
WLR
>CT2207 glycosyl transferase, group 1 family protein
MTVVTVPFLKNIPSGEPQLFDNALIRWLVALAGRMIAAFWCQALGFSSSE
RLIGLSNIYWGGVAASMPCRVRFYDANDDHLGFTSGQPWLRESMHRFVEK
ADLIFYVSNPLLDKLSPRPEQRCVELGNGVEFDHFAVPRPETPEQLCSLP
GPILGYAGAMDWIDADLVASVARAWPEYSVVLVGPAYAHDWADRHVELLS
LPNVHWVGKVGYDELPAWVQRFDLALMPLERSPLKRASNPNKLYEYAAAG
VPILAIDYCDAVRKASGVAHVASTPEEFVRLVTQALADGRKAARQAFARA
HSWEALAETMVHELRNAMQRRLS
>CT0582 conserved hypothetical protein
MRGEIDFSEYPEKGPVFAPLKEPGFFRKAFIEGGTIAWPNGADIAPESLY
EKLLQKEQNRDSVLH
>CT1558 conserved hypothetical protein
MNTFLIDYQRIRTPKKGFSKLFCDYSSESEARTKLLADCFHLDYRKDGDY
YRHLGFLASRNFRREALVELLTEQNERFDGSERQQREIEKLRSPRCMAIV
TGQQTGLFTGPLYTIYKALTAVVLARKQKELFPEYDFVPVFWIESEDHDF
DEASSTVLFSGGGLEQITAEAAHRLPDQMAGATQLGASIGATVQEFLDLL
PDTEFKPEIAEILESCYEPGVTFEIAFARTMNRLFREHPLILLSAQDTRF
KQLAVEVLCREVETAPASSYDVVAQSSILESMGYPAQTKPRAVNLFYLNQ
LGQRLKIEQPSPDNFLIVPDRQRYTRHQLLEICQDHPEKFSPNVILRPIV
QDAVLPTFAYIGGPGEISYLAQFRKAYEHFGLSMPFVIPRGSFTLVEPKI
ARTMDKVLKATGRPSFSRRQVYEAVFEDVQELRKSMVSGGDSQKLDALFE
QVESEVTRSLSTLEPALVKMDPTLQAALSASSGQITKIIGTIKEKTYRAG
RRKHDELLQQLDKAELNLFPDGKPQERSINIFHYLNKYGPSLIGELAKVL
QGYSTEAHLIVEL
>CT1949 NAD-dependent epimerase/dehydratase family protein/3-beta hydroxysteroid dehydrogenase/isomerase family protein
MFMDGVILVTGSTGFIGSRMVDALVGQGRRVRVLLRPESRSTLSAGYREG
VEEVCAAYGDPEALGRAVSGVASIIHLAGVTKAVDEAGFAEGNVRPVENL
LEAVKRHNPGLGRFLLVSSLAAMGPASSPSPGVMESDRPRPVSAYGRSKL
LGEAVARRHAGSVPLTIVRPPAVYGPGDRDILEVFTMMKNGYLLSAGPGR
RQRFSMIHVDELIRGILLALDSENAAGQDYFITSPRGYAWDEVIAAARPV
LGFRRLLRLNLPKPLVFGLGAVLGGVAKLTGCPALINKDKANELVQDFWV
CSPEKAERELGFTASIPLETGVPETLVWYRQQGWL
>CT1041 hypothetical protein
MAKKQTFGDKQKKGTVDFKMAKLVYSVKSEKTNAWKFVEKSVRIPNGENE
LDVLKKAMAGQGK
>CT1715 SUA5/yciO/yrdC family protein
MQTLVTDRPEEAASWLNRSETVAFPTETVYGLGADAFNPDAVAKIFKAKG
RPSDNPLIVHVATPEQIGEVAAEIGETAKILVERFFPGPLTLVLKKRAAV
PEIVSAGLPTIGVRCPSHPIASEFLRHCKHPVAAPSANVSGRPSPTDWKT
VYHDLGGKIACLLRGEPSTIGLESTIVDCSVNPPRLLRTGAVSIELLREL
VPDIEIATACKPGEAPKSPGQKYRHYSPEAEIILVESPPSNLQTDLHAAW
IGLAAPPAGAAKSLQCRDMDEYARVLFGFFRTCDAEGIRTIYCELPPEKG
IGRAIRDRLVKAAGEEVKNSDKQ
>CT2108.1 lipoprotein, putative
MLRYLFIFFLTALTACASRYAAAPDAKTQELFRTWWRVEEVDGRKAEFIH
GQRRDMHIILYASRKMVGSGGCNQISGSFAQSPGRIRFGAITSSKMMCQP
VVMSRERALISALRKSSSYIVRGRRLTMYDRTGHEVLRFMAVPFR
>CT1474 ABC transporter, ATP-binding protein
MQLQVARHRPGTGEDRLMLEARKLVKSYALPGQPPLKILDGIDLSVAPGE
MVTVIGASGSGKTTLLNLLGTLDTPDEGELIFDGSPVFQGSRCLLSKKEL
AAFRNRKIGFVFQFHHLLSDFTALENVAMAEFIGTGKLKPAKERAAVLLE
KLGLKARLDHLPSELSGGEQQRVAIARALMNKPKLVLADEPSGNLDSRNS
RMLYELMASLSKERQTSFVIVTHNEEFAATADRCLHMQDGRLQACGG
>CT1350 ABC transporter efflux protein
MNSTLRQFRYETVESLNIAVYQIRANKIRSFLTALGVIIGIVAITMMGTA
INGIDIGFDRSLAMLGYDVIYVQKGSWSTMGSWWRYRNRPDIETRYATEI
NRIIAGKTLSELIVAVPQMSTIQASARYRDREILQIFALGTNQDYPLTAS
GDLSAGRFFTAQESAAGDAVAVIGNDIATGLFPDGRAVGKSIRLRNHNVR
VVGVFKKQGKFLGLFSFDNQLIMPLGAFTKVYGKTSMVTIRVKVRDEKRI
PEAKEELTGLMRRIRRLPPGKADDFGINEQQAFKSQLDPIKNGIAVAGIF
ITGMSLFVGAIGIMNITFVSVKERTREIGLRKALGARRRTILLQFLIESV
MICLVGGVIGLVTALSITVLIQNLLPDFPVSFSPMLVLASLVVSVATGII
SGLAPAISASRLDPAVSLRYE
>CT0686 hypothetical protein
MMNPSKIFALAIACGIVLLTFNWMAQAQVMYEPLQNKAPLSIYKYPRVKL
GADLEANLTLIKADNGRLNITGTVTNVGKSSCKTASVAELIMNLGYAPQY
SYAKTGVSDILVSRSFNNLKAGDSIVVNAVYQIPDFGGWASANLPGNAKR
LFTLRVIKQDASSYKPDEDSNIENNVADDVVFYRDLTH
>CT0429 conserved hypothetical protein
MKRPIFVVIALTLGAMTAFIAARWMSGPKASGPSVVIVEQPIAAGRPILA
GQIKAISWSGSVVPQEAFSRTADVVGRIALVPMIPGEPVLPGKLAPIGAT
GGLSSIIPAGKRAISVRVNDVVGVAGFALPGSYVDILVSGRDVSGQPFSR
IVLSKVKVLAVEQDTVAEKDKPKVVNAVTLELSPQESEKLDLARNIGALS
LVLRNELDTTVVNSVGVRLSDVVYPQRGVPNTSSQFKQAAPVQASQAAPA
QVSQAVPARQYRGVEEIRGISRQQATTP
>CT1954 hypothetical protein
MKRVKDQESGGAGRLWRAWKAVKAVMISGVIRNGNNVIGLKNKGNRYLAG
FFQVGFIGW
>CT0218 hypothetical protein
MQQTFFYYNYENYLLTFLTIVTSVLDLTDSPLLGKGSSRLCYLHTPMTPT
SASRSLIPVILMSKSRS
>CT0548 glycosyl transferase
MFCKVMKIGISCHHTYGGSGAVATELGKALAMKGHTVHFFSQAAPFRLGL
YSRNIYCHEIEAMNYPLFETPYHSLALASKIAEVAFYEKLDVVHAHYAIP
HAISAMLARQMLEEKCPAAECFRIVTTLHGTDITIVGADRSMSDAVRLAI
NKSDGVTAVSGYLRDETIRMFTPRKKIEVIHNFVDTAVFKRLPGRRDLLG
LDGGKVVIHISNFRPVKRIMDVLAVFESIRREIPATLLLVGDGPDRSEAE
TWVRNYGIGDRVRFLGKLDDIVPLLSIADLMLMPSNVESFGLAALEAMAC
GVPVVVTDAGGFPEFVRQGVDGFRHPHGDIEGMSRSALSILRDDEVWQRF
SGAAANQAGRFKTALKVKEYEAFYRRLIDEARERKAQ
>CT0755 alpha-amylase
MTNPDSPALSPVERSLAEIRLAELTAAKTCYSSPATWEDEVLYFLMLDRF
SDCREHGGFNDVSGAPVVADGTRTTLLFRIEADANNADWQQWFEAGRGWC
GGTIAGMRDKLGYLKRLGVTAIWVSPVFRQVTGSDSYHGYGIQNFLDVDP
HFGTREELRDFVADAHQLGIRVILDIILNHAGDVFSYHDNQPYFYYQGWQ
WPVKGYRLNQGDAGSIPFSDGEAHLEISMEAAIWPVEFQSETTWTMKGEI
RNWDVFPEYLQGDFCSLKDIDHGWAPDDPAESWDLEKRISLFRPSAALDH
LIKVYRFWMAYADIDGFRLDTVKHMEPGAVRYFASAIHEFAQTLGKENFP
IIGEITGGRSYAMQILDVTGLDAALGIGDLPDKLEFLVKGWRSPGNPDTS
EQEGYFDLFCNSLLDGKNSHQWYSKHIVTMIDDHDQVGEQRKYHFCGDSP
EGRKLLKAALGLNLATEGISCIYYGTEQAFNGADPRSDDHSWGDVFLREC
MFGGPFGSLQSTGRHFFNEEHEVYRFAGRLAEFRKSEIALRRGRQYLRKV
SATGSEDDFVYPQPVNGEMHWVVAWSRIFAETECLCAINTSLERELTVWA
VVDHQLNPPGKTMRCVFSSSPEQEGEEIKAGPVCGSAVKITVPPGGFVIY
R
>CT0118 hypothetical protein
MGNYKFKAYYDEAYPPVPDKATLFWRKFIPWQLFRFFILNIKMIRIVVGG
HS
>CT1520 hypothetical protein
MQGQVSRLPCPPFCDLHHDGSLDIFSFARTKIPEDKQEY
>CT0634 ExbD/TolR family protein
MMTGQKTRLMSEINVTPFVDVMLVLLVIFMVTAPMMTSGMKVDVPQTTHE
RMDIDPKGLVVSVDASRKIMINNYQLDESQISERLPKILESMKAEEVYLK
ADKTLPYGFVMSVMAAIRDAGVEKVGMVTEPLVTDSQKR
>CT0411 iron(III) ABC transporter, permease protein, putative
MSNVAELRAPAHPWPSLVPKSGMILFMLILLVLLFMLDIALGSVSIPLKS
VVAILFGSDQEPVAWQKIVTTIRLPKAITAVIAGAALSASGLQMQTLFRN
PLAGPSVLGISAGASLGVAMVMLVSGSAANAFAIRQLGLGGSWLIVIASS
LGAAAVLLVVLAIAVKIKDNVVLLIVGIMVGNITVSLISIWQYFSEPEQI
QDYLIWTFGSLGGVLGNQLWVLGIVVGAGLLVSFAASKPLNVMLLGENYA
KSLGMSTFSIRITVIAATSLLAGSVTGFCGPIGFIGIAVPHLTRSILNTS
DHRFLMPSSCLVGAILMLVCDIIAQMPGNQTTLPINVVTALIGSPVVIWV
IIRQRNLKSSFA
>CT0184 ABC-type drug export system, ATP-binding protein
MQLPTTDIAIQTDRLTRSFGSNVAVRELNLTVRSGEIFGLLGPNGAGKTT
TIKMLTTMLPPSSGAATVAGHSILCDSIGVRKRIGYVSQMISADGALTGF
ENLLLFARIYNVPRRKRTQRIDETLAFMGLTEARDKLVCTYSGGMIRRLE
IALSMLHRPEVLFLDEPTIGLDPAARLQVWKRLKELLETFGTTILLTTHD
MEEAEELCDRIAIMKEGVIAAEGSAEELKQRAGTESMNQVFIHFAGEFSD
SKPNFRDINRARRTAKRLG
>CT1996 hypothetical protein
MVSGKPSASGFAFLPACPIPGKGDFLSSRLLFRPNKTKIILIY
>CT1960 conserved hypothetical protein
MERGINWIDTAAVYGLGHAEELVGKALRGLCEKPLVFTKCGLVWDENRAI
GVIVYSPMLSGMLTGAMTRERALNLPADDWQRNA
>CT2254 conserved hypothetical protein
MTYLLDANVFIQAKNLHYGLDFCPAFWEWLIESNASGKVFSIDKVAEEIA
TGADELTDWMHNHASDLFLNTDSGTVEKFGQVSTWATSQKYEPTAINTFL
NAADFYLVAHALSGGYVLVTHEVSSNSQRKIKIPDACRGLQLQCMTPYEM
LRREQARFILR
>CT0744 cytosolic long-chain acyl-CoA thioester hydrolase family protein
METYKLVMPEHLNHYGFLFGGNLLKWIDEVSYIAVTLDYPGCNFVTVGMD
NIKFKKSIRQGTILCFESKKNHIGTTSVEYTVDVTREEISTGSRELVFTT
RITFVSVDENGRKKAISA
>CT0831 conserved hypothetical protein
MIQPAEALRRPEAYHHPVEKVIEVVETHISWIFLTGQFAYKLKKPVNLGF
LDFSTLERRKHFCEEELRLNRRLCPDLYLDVLPVTESDGKIRIGGDGEAI
DYVIRMVQFDRRFELDRLLRRGELTKREIGEAAEVIAAFHAEAPRADPSA
KFGTPEVILKPMLENLDLTEEVARTIEERSDIEKIRHWTLTEHRRLGGVM
RERKALGMVRECHGDLHTGNMVIRDGKITIFDCIEFSHVLSIIDVMSDVA
FLFMDLEHSGHPELAWHFLNAWLSKNGDYNGLQVLRLYCVYRAMVRAKVT
SIRVAQESDEEEKAKTLAEHHSYIRLALGYTQPRKPMLLITYGVSGSGKS
TWAARLADLGGFIHIRSDVERKRLFGIDSLERSAGKGFDIYTPEATQKTY
DVMLDAASTALSAGFPVIVDATFPDARKRAPFIRLARAMNCECRILCFQA
THETLRERVRTRHKKGSDASEADQKVLEAQLHAIESPAGDEKALCIQIDT
EGEVTIEALLTALKM
>CT0011 deoxyhypusine synthase, putative
MEERSMQKAGFLKEPIKHIGITKHNVVPMVEEMADMAFQARNLARAAFIV
DLMQKDKECAVILTLAGSLISAGLKQVIIDMLEHNMVDVIVSTGANIVDQ
DFFEALGFKHWKGSQFVDDSELRELAIDRIYDTYIDEDELRVCDDTIAII
ANSMQPGAYSSREFIVEMGKYIEEKGLDKNSIVYKAYEKGVPIFCPAFSD
CSAGFGLVHHQWHNPDQHVSIDSVKDFRELTKIKIENDKTGIFMIGGGVP
KNFTQDIVVAAEVLGYENVSMHTYAVQITVADERDGALSGSTLKEASSWG
KVDTVYEQMVFAEATVAMPLIAGYAYHKRNWEGRPARNFNAMLDAKPVNA
>CT1741 hypothetical protein
MIGMIGGRRSQILVSRDFRAAVSQPFSKNHKLFIIMSLTDVREYLQRERS
ASLKQISSHFKADSSLVESMLDQWILKGRVVVKQRDVFGAACCGKCGGKE
HIHYEWVYEWVE
>CT0957 hypothetical protein
MKFSSTSTLFLLLLSCLCAPSRNGMCKEQPQQILVMWWNVENLFDTKNDP
KVDDQEFTPMGKAHWTEKKLLLKRLRIAQVFNAIRAEREYGKYPDIVAFA
ETENRQVFAGTLAALDRATYAIDYHESPDPRGIDIGLAWNPATVKFTGSK
PYKVRLNNRRGTRFVIAAGFTAASNHFTIVLNHWPSRSFDTQWSETNRIA
AARVARHIVDSLRTCNPQSEIIVMGDFNDQPENHSVKDVLGSSFDRKAVR
HASSRLLYNCWNEASSPGSYFYRNHWEQIDQMLVSAALLDEKGLSIDKTS
FRVFSIPAMFDRFGKGLYSTYKQGKFKGGYSDHLPLLLKVRIKP
>CT1753 conserved hypothetical protein
MTTMKNIRKACSSLLMLVVLFSFCGLQAEPAQSGPGEVKVFGLVEHPLTL
TVESLRRMKPVEKGATAIVCDSGQTKQTMRSFKGVLLRDILDSTKVVMPN
PRERGEYYALVRSLDGYNVIFTMNELRYGVAGDGAWLVFEENGKPVETGG
PFVIFCDNDRANGPRHVKMVESIEVSKVNAAP
>CT1251 hypothetical protein
MKMDTKRGRKRSLFLCKSFLILHDFPHPNIILIFRHH
>CT1186 hypothetical protein
MKENGNKGRKKIAHSKFIIHNEKKPVVQTGFFFHDMHPAVSR
>CT1684 hypothetical protein
MFLLHEQFARNFLAAISAFHFLHPEKPIAFIVFSSKCIWNV
>CT0114 ferredoxin oxidoreductase, alpha subunit
MNRSRLLLLGAEAVAQGAIDAGLSGVYAYPGTPSTEITEYIQRSAPARQS
GIRSAWSANEKTAYEAALGMSYAGKRSLVCMKHVGLNVAADAFMNSAITG
VNGGLVLAVADDPSMHSSQNEQDSRVYGRFAMVPVLEPASQQELYDAMFH
AFELSEAVRLPVIMRMTTRLSHSRAGVELKASLEQNPLSAEYAPGRFVLM
PQNARVQYRHLLDLQPMLEALSESSFLNQPVEGHGPLGIIAFGIGYNYAM
EARKAHSIECRLLKIGQYPLPRTQVNELFEACSEVLVVEEGYPVYEELFR
GYFGSAKVRGRMDGTLPRDGELTTDLVAAALGVKPPETATTPEIVVGRPP
ELCKGCGHRDMFDAINIAIKENADHHVFSDIGCYTLGALPPYNAIHTCVD
MGAAITMAKGAADAGLRPAVCVIGDSTFAHSGMTGLLDAVNDRTPVTVII
ADNNTTAMTGGQCSSASGSKLVNICIGLGVEEAHIRTIVPLRQHLHENVS
VLRDEFAYNGVSVVIAQRECIETAAKKKRRAAASAQ
>CT0977 alpha-amylase family protein
MVDPFDYFIDTISTIKASATVMPDTTEMSGQNWSASATVYNLFVRYNAAF
DHDRDGHIRTEPLPCGFRETGTLLKAIAMLPYIKRMGVNTLYLLPLTSIG
KINRKGELGSPYAIKNPYELDETLGEPVLDLPLEMQLKALVEAAHHLGMR
VVFEFVFRTASIDSDWIGEHPEWFYWLRKEGNLASYGPPHFDHDTLARIY
EQVDKHDFHELPEPDEAYRSRFAPTPEKITLGANGYIGTTADGGKCVIAS
AFSDWPPDDRQPPWTDVTYLKLHDSPRFNYIAYNTIRMYDEALEQPEFRA
MKLWREIAGIIPHYRKEFGIDGAMIDMGHALPPELKASIVAEARKDAPEF
AFWDENFDPSPKLKEEGFNAVFGSLPFVIHDVQYIKGLLNFLNRNGVAIP
FFATGENHNTPRVCHTLAGREAGRRRALFLFTLGAVLPAMPFIHSGMEIC
EWHPVNLGLNFTDDDRQRFPAETLPLFAPAACDWENTNGLEPICDDIRKV
LDIRSRHFDLIRCGDIGSIVQPYISDPALLTVMRKSGGRNLLFAGNSNFE
EPVTGTMEFGQKKMELDELISGRTLMVSDHKLMLDFAPGQCVLFEIPSAD
EE
>CT2205 hypothetical protein
MKKAPWPSLVAIALLALLLVVPFGPLLTLRFVPGSPDSVAPMALDKALEA
LQAQSGRYPLWQPWTFSGMPTVEAFSYLSELYLPNLLFGFLHFDPMYIQL
LHLVFAGMGGFVLARRLGLGSIPAFLSGSAFMLNPYMTAMLVYGHGSQLM
TAAYMPWVFWAALRLSEKGRLADAGLLALMLGLQLQRAHVQIAWYTWMLA
VPLLVVKILIDTKPPGVSKGKVGVLALAALALGGAIALQVYLPALGYLPF
SARSGAGDAAEAYRYATLWSMHPLELITYLMPGAFGFGGITYWGFMPFTD
FPHYAGLVVLGFAIAGVVAGRKKPMVLFLSAMTALALLLSFGNFFSPVYD
LFYYFAPKFSSFRVPSMALVVVALCLALLAGYGLQAWLDRPLVESSPVFK
WGGLVIGVAAVFFLAFEGELKQLLRAAFPAVQIDNYDLVPMVGNLRWELW
SGSLFVLIVVAAAIAGLLWIAARGMIGARAVAIVLVALSCADLGWIDHRI
VSPDDHSLRVSPLVERTALDRALEGDEITRFLASRPGVFRIYPAGRLFTE
NKFSLAGIESVGGYHAAKLGVYQELLARTDNLANLDVLRMLNVGYVLSPA
PIDNPALKAVAAGKLNLISGEVPVAVYELAGSMPRAWFAPWAVAVQSDDE
AIAAVMAGRGADGGAFVTGVPWQGMERFSTGTVLSMQRSAESIAMKVRAE
GDALLVLSEVFYPERWKLTVDGREQPTLKIDGIIRGIAVPPGEHEVRFVY
DRSRFETGRTVSLVATLLSIGLIAAGIVTGRTSSKTIKSSDKP
>CT2143 hypothetical protein
MVVLCFDFFIVYQNMSHCSIKSLQKNYNYATGSRILKTAGFIGILTVFQL
PVRYLLTTLFMDMAFRYVRWPDCELVSMKGV
>CT1404 hypothetical protein
MIVFDAAKRSSFAGENNRPEPVIAHFSYIGRKSTAG
>CT2139 hypothetical protein
MKEVITFKSGKFIFCDQYDKPRIEKLLVEASTLNSAISDLPILPKWSSQI
DPELLYSSVAGTAAIEGNSLSADEVRELDDGKIPDAGHTAKDRLEITNLI
GAYRWLDEQKANFATSRLLTEEHIRDLHRQITSGLPYEDNIPGTYRNGMV
KVGNKAHGGIYTPPKIIEDVEMLMREFIDWIDSDDLLNENVFVQAALAHF
HFSLIHPFWDGNGRTARLIEAMLLQAAGIRYVPKMLSNYYYRHVDDYYRA
FSDTIRASKDVTPFLEFNLHGVIESLQQMKNRIANFIRVLAMRDYLHFLV
SNKAITKRQNDLLALLLDDPSGKPFTLHELQRAMPYAMLYRKVSEMTARR
DLKNLLERKLLVVDADNRYSLNLRGFDS
>CT1519 conserved hypothetical protein
MRTRFWIFSMIALLTLAGCSNYRVVSDYDRTIPFERYKTYRWSDKGSAGI
SDDILANNPLIYKNIKSVVDRELATKGFVLKASGPVDFTVFPHARVRERV
VIEPSGFFGYGCGYCPGWGWRSYPPYWYDPYPYPVFSHYEEGTLIIDIID
SRSGEVAWAGIARGILKDYDSSVQMNRDLDEVLTKIMAQFPPMVK
>CT1594 hypothetical protein
MSRSSGNTPHYRHRSLSCAISNCSTVTPTSSTLMSTSPTIAALLDESIQL
ELNLAKLYTLFNDHFEEDEEFWWQLSMEERSHAALLQQEKKQPQPLQFFP
ENLLAKDLDALKANNARIIAETERFAISPFSREEALNLALHIEMSAGEAH
FQEFMESETGSLTADLLQQLASEDQNHAKRIREYMKEQGVKEKKQA
>CT0844 hypothetical protein
MPSSVLFLFLFRNLTIVFIEEHLGSAFRESVTHIDIKGGA
>CT0703 trans-sulfuration enzyme family protein
MAKVGDQIVRIFDADRQTDEAGRHPLFTEPDANYHGLRWAIDLPEPLAPI
AFALRVRTVPLRNLGAAISPDNSWIFIQGVETLPVRMIRHSENALKVAGH
LKNHPKVAWVRYPGLPDDPSYALASRDLKRGFGGMVVFGVKGGYDAAVKI
IDTIDLFSHLANVGDAKSLILHPASTSHSQMTEEQRVASGLSSDLIRLSI
GLEHPDDLIAALDDVLAGV
>CT1904.1 RNA-binding protein
MNIYIGNLDYNVTEADLSGAFGEFGTVSKANVIIDKFTGRSKGFGFVEMP
DDAEANEAISQLNESSLNGRKIRVNEAKPREERPAARPRY
>CT1319 hypothetical protein
MPRLSSRGQSINKKPGFSPISFRPHPYPLVISTITGDTAIHAIHFRSRIR
>CT1492 thiolredoxin peroxidase
MSVLVGRPAPDFNAAAVVNGSTFVDSCQLSAYRGKYVVLFFYPLDFTFVC
PTELHAFQEKLDEFKKRNVEVLGCSVDSKFSHFAWLNTPRSKGGIQGVTY
PLISDINKTIAKDYDVLTPDGSVALRGLFLIDKEGIVRHQVVNDLGIGRN
IDEVLRIVDALQFTEEFGEVCPANWNKGDKTMKPTDEGLKEYFAE
>CT0526 hypothetical protein
MLDSREYCRCSVMFLIPGYSFTASYQRFSCYPPSAPTHNHRKQI
>CT1754 receptor, putative
MICRECRSDSFFIHRYNCKNISFMNVLNSRSCRILMVAAIIGSASSQAWA
ADTVPGWTADEIVITATRTENPVSKLPMAVEVITRQEIEESGSLNLADVL
AEAEDVNALEPVNGRLGVAKLRGLGSSLTLVLIDGYRLQSGFQGYSDLRE
IPAGMIERIEIVRGSGSALYGSDAVGGVINIITRKPTKDLHGGLSISGGE
SRAGEAGTVETDGWVSGSAGKLGFAVAGTYYDRDRYDRDQSDLMTDGDDR
RIASGSASLTCDLTPGVKLTGGIVYADNSLDGIRTQNSGDFDRWVDSDRL
LVHAGAEIKTGEESNLSLKVARSTYDWRSDMDNHDGVPTVTSATASGTTT
TTETTMASRTKVSQDYDQFDARWTGRIAEVHRLTTGVEYRTETRTDSGST
VTTKVVTKTGLVNSGPTTMVATDKTDISHDVDNLGLYLQDEFTGLKPLTI
IAGVRYDDHSDFGSEYSPKVAVLLPVDSHLKLRASYGEGFRAPSIYELYT
GSLTTRRSIVLSNPDLKAERSKTWEVGADFSRGGFNAGVTAFRNEVRNMI
SLVLADDTTTPDTYQYQNLSKAMTRGIEISASLALPHGFTLSDRVSFLDT
ENLDTGEALFFAPDVANVLRLDYANSRFGLKGNVRVVSTGTQYSSADEKI
SGYTLVNCYLSKSVSKHAELFAGVDNLFNDDANTGYGNNEGAGAMGTYFY
GGLNFKL
>CT1784 GTP-binding protein
MKPLIALVGRPNVGKSTLFNRILRQKSAIVDPTPGVTRDRHISPGEWQGK
QFLLMDTGGYAPENDTLSKAMLEQTMRAIEDADAVIFIVDARSGLTYLDL
DIAKILQKTFKDKKIFFVANKVDNPQVALEAQSLVKSGFTEPYLISARDG
AGVADMLEDVLNSLPCPEGEEIEEDDSIKLAVLGRPNVGKSSLVNALLGT
ERHIVSDVPGTTRDAIDSVLKRNGEEYVLIDTAGLRKRTKIDAGIEFYSS
LRTARAIERCDVALVLLDARLGLESQDMKIIHMAIERKKGVLILVNKWDL
VEKDSKTSKAFTDNLQNQLGNIGYIPVIFTSALTKKNCYRAIDTAAEIAL
NRRQKISTSNLNRFLQETLTMRHPATKSGKELKIKYMTQIDSDHPVFAFF
CNDPELLENNFRRFLEKRLRESFDFAGIPITMRFLRK
>CT0984 hypothetical protein
MKKTAKLIALAAVLFAGFGSTSAKADEGFKIGADVVSSYVWRGAEIGDSP
AIQPNLSYTFKNGLNVGLWGSYAIEKNTPRINNSDYRYKEVDLTVSMPVG
PVTFAVTDYYVPVEGGETNTFDFGKDSANTVEVSGTYTYKNASLMAGVFV
GGNDYDNAWYCEANYKFYDKNGYTAKATAGLGNEGYYGDGEGKKLALVNT
GISISKDRYTASAIYNPDTEKSYLVFMASF
>CT1466 TIR domain protein
MSPKVFVSHASEDKDRFVLQFAERLRQKGIDAWLDKWEMLPGDSLVDKIF
EEGIKEAKAVIVVLSKFSVEKPWVREELNAAFVKRINNGSKLIPIVIDDC
EVPEALKSTLWEPIADLSAYDKSFDRIVASIYGANDRPPIGPQPEYVQSF
VQAIGNLNNIDSLVLRFSCEEVLKTGNAFVNPERVFLKDDKPILPEDELK
DSLEILDGGGYIKLMRTLGGGFFPYQITTYGFDVYANASIPDYQGKIAAV
VSAIVNEKLMSNAKIQERLKENKIIVDHILNVLENKGHIKQSKMIGGLSE
IFNVSPSLKRALSGG
>CT2103 hypothetical protein
MKPMYYLVAAALSIMLSIYVFIFGTWANSQLVAIFIGLWAPTIICLGIFN
ILMNIHDEMCCAHKRIEGRQTGHDRCGGG
>CT1985 hypothetical protein
MANKEMKHPGNVPGPFYVTSPDDPDGEGCSACTICYNAAPDFFAEDPDGY
AYVAQQPQSDADIALCREQIAACPTNSIGDDG
>CT2052 hypothetical protein
MNKASTMTDTFNYTTIFAPLGFFIGGIFLVLLLNKFIGKSQNKKSS
>CT1052 peptidase, M20/M25/M40 family
MHNEAFSTIAARVREAAHNLYPEVAALRRHLHQHPELSYQEFQTTAFIKK
YLSGLGIEAEPPLMETGVIALLRGEGAPPSGERRTVALRADIDALPLQEE
NGHDFCSTVERCMHACGHDMHTAMLLGAATVLSGMKDALNGDVLLIFQPA
EEKAPGGAKPLIEAGLLKKYKPSAIFAQHCFPSVKSGSIAMCKGGFMAAA
DELYVTIHGQGGHASAPHKTRDPILASAHIITALQHLVSRVAPPHESAVL
SIASISGGHATNIIPGNVTMMGTMRTMNEELRALLHKKFEKTVRQVADAF
DVEAEVEIRRGYPVLYNDPAMTDLAWEAGKEYLGDGNVRQSEPVMTAEDF
AYYLQECPGSFWQLGTGLPDSAPGNLLHSPTFDPDEHALETGMGMMSYLA
LRFLAG
>CT2003 hypothetical protein
MDRHPEIVTRERVNKTKKSGPMMNSGRSFFMKSGQS
>CT0790 hypothetical protein
MPDIRIKPTRSTLTMMRIASFIVALVGAGMIAGVVSNGLLEEGGVFVMVW
IAACVGIVGFAIFSAFNRKVIEDGIVSIEDAGESQSASDAQSAEARLKTL
DGLKRQKLISDDEYQKQREKIIQSV
>CT0961 DedA family protein
MPETSSMLESVVAYLQQAEPSSVYAVLFLSAYFENVIPPIPGDVPIALAG
YLLTFSHITFVAALFWSTIGSVAGFMTVFLLSRFLGLKLYAVGECEARHK
FAQSIHKLFPPSEMEVVRQKFSGHGYLAVVVNRFLFGSRAVICIVAGMLH
LKIPLVLAASFVSSLLWNILLLSTGYLLGSNWDKIGQYAVLYTAPFTILF
TGFIVWKVVAYLKKRKQQADGA
>CT1734 cytochrome c, putative
MTFRFSPAILIGCTAISLATAYSTNAFAQVTGGQELYERHCSACHSMLPP
PKSAPPVAGLSYFYHKAFADREQGVRHIMEFVAKPAVEKSKLRPPAISRF
GLMPAVELDARDLRTVSEWLWDSYDPKFQPPDCPAK
>CT0656 DNA-binding response regulator
MAETSILIVEDDRNLAGLLKYNLEKAGYGCIHAASGEEALDELQRHAVNL
VLLDIMLPGIDGFEVCRRIRQNVQWSDLPIVMLTAKGEEIDKVFGFELGI
DDYVVKPFSPRELNLRIRAILKRDRRNRSNVQEVLRSGGIELDIGRHEAT
LDGRPLVLTLMEFKLLALLMKRKGQAQTREVLLSDVWDVDKSINTRTIDT
HVTRLREKLGDAGRFIRTVRGLGYKFDENGDNLNES
>CT2083 conserved hypothetical protein
MLVCDFKTTCLNWVNVDDVVMGGVSNSAMQLTQDGTAVFAGNLSLENSGG
FASVRTVLERRNYADFAGFRIRVKGDGKRYSFRARNDERFDGVVYKFDFE
TVPDEWMEIDLSFAGFIPSFRGRTLVDVPPLDSSNIVQIGLLVSNKQAGA
FWLEIAWIEAYRADTVASSFR
>CT0745 fibronectin-binding protein, putative
MLRNYFTLYHAAAELHDRLADGYLFEIHSQQKNELTLAFVTQEGEHLQLI
VTVRSPHFSLYTREGLNRKNRNTASIMTKVYERQVTGVAISPADRQISLA
LDDGHALVLRMFSAETNMLLVRDGVIVDAFKDARKLENTPFGETTAGTPY
FRALEQLASDPALFRRKLEEGDSTVPLDRRLLAMLPGFDHKLVRRLLAIT
ESDATEQLYSAFVAIFYDLASPTPCVIENPGEPPAFSLFPPAPEGEATTF
ESVIDALNHYSRKMYRHLHLRERAVAMRRELTAKIGKLEKELAANAGHDP
EEISQRNERYGHLLTGAIGAIEPLDAAVTVPNLFAPGSPDVTIPLKRGLN
LQENAAWYFTQAKKNRKKAAALKLRRAELRAELDHLRQKLATLDEAESTD
RLQQALETSSSSGRRASGNSKNKKEKMPPFKTIPISDKITLYIGRNSANN
EKLTFGFAKPDDLWLHARGASGSHCVLKGATMRHTSEIQRAAEIAAWHSS
AKHSELVPVICTQKKYLKKDHKTPGNVIIEREQVLMVKPLRE
>CT0902 hypothetical protein
MKLVHRFPVFKSILAAFALLVVQLRAAVAEPLSFSTGQTVSVPSYSHIFV
GNRLKTFDLTTSLAIRNSDPETPITVTRVDYFDASGRFVRAMMKTPLVIR
PVSTLVYVIDESDKTGGVGASFLVS
>CT0596 hypothetical protein
MQQTLSLPSIHHNQRKNAALLAIRISNSFLALSPQQKSNCIVRFYPAKDE
LIF
>CT1434 hypothetical protein
MASKDLDKVEEELKAAPNREIPPINASYIEHQPALFSHSWPLLAHPDKGM
IFFSGDSANTMNVFDQFMVSRGLYYGTSGLKAQPGSMRIFTTAEMAPGAK
GRPKAFDKETKKGFSDHFPVEMVVDIV
>CT1447 serine protease
MKKKLTMLKSAALVSVGITAGALAFSNLDFSFNGGGNNGFMVSSHPNSSI
AAETLRNHPIRTLNDLNDAFVDIAESATPSVVTIYTETEVDRRIMTPFDF
FGKSFGEMFDFPLPEEPNVRKEVIHGLGSGVIVSQDGYILTNNHVIDQAG
SIAVMTSDNRKFKAKIVGTDPRTDLAVLKISGSGLKPIAFGDSDKLRVGE
WVLAIGSPLGENLARTVTQGIVSAKGRVNVGVADYENFIQTDAAINPGNS
GGPLVNIGGELVGINTAIASRTGGFEGIGFAVPSNMAYRVYTSLVKNGKV
ERGYLGVTIQDIDENIAKGLQLKSPEGVLVGTVMQGGPAARAGLKSGDVI
LEFNGRKVNSAAELRNRIAAMAPGSSAAIRINRDGAILTLNARLESLPDN
ATASARSTESKNELLGFSVAPLTPELAGRLNLKADSRRIVVTSVSKSSRA
FSVGLRPGDVVISVDKKPVDSVAAFNAIVKNKKRGDLLFLLVERGWSRMY
FAFNL
>CT0832 hypothetical protein
MMGNFTGTALRFSRIVLRDGNDKADCSRRVHNLRSLQACRL
>CT0845 membrane protein, putative
MKHVDTEIPTEEQIHEAEKFGFHGQVQAAKIRFDRYFPGVLASITVAAAA
TFLSDHYGAPTMLFALLIGMAFRFLSEDESRALVGIQFASTTVLRIGVAL
LGMRITLGQIQSLGVKPVVMVFFSVLLTILFGLALSKMMGRGKRFGVLTG
GSVGICGASAALAIAAVLPQDEYSERNTIFTVISITALSTLAMIAYPVVA
QWFGLDHQAAGIFLGGTIHDVAQVVGAGYSVSEQTGDTATVIKLLRVSML
VPVVFILSLIFHKRNQKDGNAPRRTLLPPFIIFFVLFVGINSLGVVPKPA
TQFINDVSRWCLVTAIGALGMKTSLKSLFEVGWKPVSIMIAETVFLAVLV
LGSVVWMS
>CT0428 hypothetical protein
MLKMYVDYWVAVLSGFLQQYFGVKSQKGVTMIEYALIAALIAVAVIAVLL
TVGSNLKTVFSYVGSNLTT
>CT1774 oxidoreductase, short-chain dehydrogenase/reductase family
MEAGRIDAADTPAELVELVYASRLLGKDDRLVMHGGGNTSVKCELTDFIG
NHVNVIFIKASGVNLASVDAGDFTPVRIDPLRKLQKMYESGQRHSEEDIR
RFSTREFKNFLYLNLFTLTDHMVSRSLSPSIETLLHAFLPHRYILHTHSL
ALLTLSNQTDGERLCREALGDGYGQVPYIQPGLGLANLAHDAYEKNPSIE
GLVLHKHGLVTFGDSAKEAYDRMIDGVNRIEERIASAARKVFASAPMPTA
IASVEEVAPIVRGACSFEKTPGEKDYQSFVLEFRTSPALLDYLKIADLEA
FSKKGAMTPDFIIRTKNHPLVAPAPDAADLEGFGKELRARAKRFTEEYRS
YFERQQQATGMDVSMIDPMPRVVLVPGLGLFGLGLSAADAKLTADIAEHS
AVAMLDAESIGCFESISEKEAFEIEYWDMEQAKVRKSHNGGEFAGKVALV
TGGAGAIGLATAKAFKAKGAEIVIMDIDPAALEKAAAELGSGTLTIPCDV
TNAAAVREAFDTVCRTFGGLDIVVSNVGVAWQGRIGDVSDELLRRSFELN
FFSHQTVSQNAVRIMRRQGIGGVLLYNVSKQAVNPGPDFGPYGLPKAATL
FLLRQYALDHGRDGIRANGVNADRIRSGLLTPEMIKARSAARGLSERDYM
AGNLLGLEVSAEDVADAFVHLALETKTTGSITTVDGGNIAAALR
>CT1606 DNA polymerase III, delta subunit, putative
MSWNSIVGHEPQLRVLKTALGANRLAHAYLFTGPEGSGKESVAFELAKIL
NCRSSGNLSGEGSCGECESCRQTDLLMHPNIEYLFPVEAALLETIDPSKK
ENKKLTEARERYEALLDEKRKNPFFTPAMERSMGILTEQVVMLQQKASLA
PRDGGKKVFIISQAERLHPTAANKLLKLLEEPPAHVVFILVSSRPESVLP
TIRSRCQLLNFARPRPAEIEAWIARRAPQLDKTERHFIVSLSRGNLCSAL
ELIEAETGEGAAPAVVGIRNKAIDYLRNILVPAKFHEAIGTCEELAKSST
RTEQLIFLDALLLFFQDVTRRSIDHAFPELNNPDIAANTDRFVKAFPNRD
LYRASTAIEEAMRSINRNASVLLVMAGLTAELRGILQGKR
>CT0256 isoprenyl synthetase
MSSPITQAQVESKYRQYHAKINEALAACFPKEKPATLYDPARYILEGKGK
RIRPFLTLLAAEAVSGKSDNALGVALGIEVLHNFTLMHDDIMDQADLRHG
RPTVHKQWNVNAAILSGDMMIAYAYELALKAISSRHAEIIHIFNDANITI
CEGQALDMELEQRKDVTIADYLDMISKKTGRLISAALEAGGVAGDGTPEQ
IAALVTFGEKIGRAFQIQDDYLDIMAGDGKSGKVPGGDVINGKKTWLLLR
SLELAEGADRELLQSIFDNNGTSPDNVPAVKAIFEKCGVLNETRAKINED
TEAALAALDALPFEEGRGYLRGFANILMKRDF
>CT1435 hypothetical protein
MEQLLSSLISVWSDISPMQKVIVLSAVGVMSVVWWIRHLDKEAGAD
>CT1737 iron-dependent repressor
MLNQTVSVMTRKRSGMTERTSGAVGGKPEKLSESSEMYLQVIWRLTERER
EVSVSDIAKAIGHSLSTVSEKIVRLTEAGLLRHEWREGVSMTPKGRQAAC
RTLRKRRLVETFLFKMAGYGIHELHEEACRLEHVISDRLSDALDRLLGYP
MHDPHGHPIPAHDGALRSELLEPLSTVDEGRTVRIAQLRSADPEVLEYIA
QLGLLPGRSCTVMQKAPFQGPLTIADGSERIAIAFEIASIIDVQAEPDER
FDEAGGGPEPFAPPARQSAAQ
>CT2069 hypothetical protein
MQGYTPLNQREGGVMSQFRVGDSIIYHKPKSSVSPGPRARQVYALEHGEH
YHYVVDKFWKVTAVNGDGTIEVITRTGKTHRLPVNDPNISKAQPLQQLFH
RKRFPN
>CT0716 glutathione-regulated potassium-efflux system protein KefC, putative
MILLCFPGSEQGIGMHQFEFLGQLVLIGALAIVNILVFQKIRIPPVIGLI
FTGIMLGPTGFHVIRNSGLISTLAEMGVVLLLFTIGLEFSADDLKKLRKI
VLFGGTAQILLTGLVIAMFSYWLMDAIGKSVGSKEALVLGFSFSVSSTAL
CLKILSDRGELGFDHGKIALGILIFQDMAIVPLMFGFSFLTRGSSMPLES
SFEEIALLLLFAIGMFGGFRLLMPKIVRIITELHAGEVLVLGALVLCFGA
AWLASLIGLSLALGAFMAGMVIASTDGSHRISRTIDPFREAMTSIFFVSV
GLLLDVNMIELPWLIAIALVVLVVKGLIMTGILMALGFSLRVSLMSGMVL
AQIGEFSFVLAGTAKDAGLLDQHMFQSMLAVIVVTMIVTPALISAAPKFA
AQVAPALGFMPLASKPEPKQPARAAAGPIVCRGEIHAAIIGFGLIGRNIA
AVMNATNLNYTVLDTDRKTVKTMRRQGEPLFYGDCTERKSLLRIGVDHSR
AVVICIPEIDAAIQCIRLVREINPGAFIIVRSRSLESTNRFYRAGADAAV
TELFETSIQMFSELLKHFRVEPETILAQQEIIRREGGNIFRELVTETDGS
GDDPAKKSQNGAGFVTSET
>CT2147 conserved hypothetical protein
MTNAFKGIPEELLKPHASPCVSIFMPTSRTFPDNTQDPVRFKNLVSRAEA
DGIAFSTKREMAPLIERLRLLQDDASFWNHTLDGLAVFISPDYFRIFRLQ
QSVLEQAHVTDAFYIRPLIRIYQIVERFQVLALTRSEVKLYEGTRDHLDE
IELAPEVPKTMTDALGTEITPPHMTIASYGGTGTAMRHGHSSRKDEEALD
NERFFRAVDQGINEYHSSSSGLPLVLVALPEHQGLFRSISRNQRLVAEGI
EIDPAALGLEAMRQKAWQVMEPYRERKIDQMIARFREAEGGKLGSDNPYA
IAIAAVAGNVSHLLLDGQKFWPGQIDPVSGDILLDEASQASGRDVFEELG
AAVLARGGEVLVLPSERMPSASGVAAIFRHD
>CT0828 hypothetical protein
MPRCLWRKVFRHCLQDTPLLAAGFFIKKIKYRRAMKKITPLLLLASCMLS
SPVLADEPTTTVGVDNNQDVSNCSSVSQAGATSVAGVGVGNWSQIFEASH
PIPYLPGTPGVTANAPTLFSMQGLPAQVKGLSLLTQNLYNANYHDVAIGS
SQGTKIIFNASYPAPKPEKKNRNVYVNLDGVARGEVVGSLTVQSRKDKAE
EVDFATLLYDARQYIAANHKLDGYDVTLLTVPNTVSYSMGVDGKASGMTV
APLVSGLINGPLGAMTALSTGFSRNGGITVPTARIGVTFLVLVDSGKSQV
VDLREYYNMLEKGSTNGNGNGNNKKKYEAIQPKESAE
>CT0801 conserved hypothetical protein
MPERFRPVAFARATDGIEAEAKRAADEGLTPLGEPLVGAGGKKIVFLHPR
QTNRALVEFVEPKGTH
>CT1217 hypothetical protein
MSVHSTLQLASDAIEDARKRLERARVDADDDYEIRQALRHLEEASDYLRK
VSTELKQHG
>CT0493 conserved hypothetical protein
MTPLQKAYRYKFLSQCLAYPNEAFIPALNEVLEKIDADRDPRQTLVAAFE
REETEPLQAEYTRLFLNGYPHTICPPYESVYLEKRMHGDAAVSVAAAYTE
WEISVEPGLIDHLATELEFLAFLASAESLDNTVSENASKASKAFMQQHVT
RWVPQFIEDLKAGATMDCYRMLGEVMEKTLAPLSPKS
>CT1030 conserved hypothetical protein
MTTNFLAILGGFALGLFYFGGLWLTVRKGLFSPHPALLFLTSTLLRTASV
IGGFLLISSGDPVRLLFAVGGFVAAKVASIAFGRRNSAPEHRDKEETPCI
>CT0567 hypothetical protein
MNKYLLYTIALVILAFVQRFLVSKLLILHASPDILAIFIAFISMSTGQRT
GTNFGFGAGLIAGILSGDLGLSALLGTVQGFVAGFFHVPQKSHATSVKKK
RMFYAASATALIAGNLLQSLLSDPLSLPLYVRVPETVILGTLMSMMLAVL
VYHFALKKLLKD
>CT0595 conserved hypothetical protein
MEGRRVFAGLPVGRELAEAVGEFRQGHSGLRVRWVKPENLHLTMVPPWQC
LNVDAVCRALSGEAARQAPFEVSFERVSFGPDPRRPRLIWATGKAPAGMP
EFARSLRAPVGAPGEPRKSFLLHLTIARFNSHDFKAMGAHTLRETVLWYG
TLDTICLYESILKPGGAHYRELCRFGLGGKSVVGSMPPPVSA
>CT0528 hypothetical protein
MTSDDRNKRNPETLGAIGKVTLEKLASLSSKIRDDVATEADRVNHEILSE
IVALASNRVSVEKNESILKEGGSAWTERTESHYPHIRLHGGALEDDPLKM
MIR
>CT0684 type I restriction system endonuclease, putative
MAFLSEAAVEQALLDQLRDLGYGIEREEDIGPDGHRPERESHDEVVLKKR
FEAAVARLNPGLPAQALQEAVWRVMQSELPSLLEENRRLHKLMTEGVDVA
VQTVLQQAEALSSEWAVPKSRTGGARG
>CT1561 conserved hypothetical protein
MIFPCEKAVWYYLPQIRADIAKELVKTGMTQSSAAKMLGVTPAAVSQYLH
KKRGGQTIKSRLYKQEIRNAVDKLRDGAAEPELYSIVCNCCQILKNDDKE
IGGSSYGDKTAG
>CT2100 hypothetical protein
MMEGIFCTVLDLKNELLNIKKVGSSQNLVGGMKDAHMTIPDEQV
>CT0509 ribonuclease II family protein
MGYGESRQLPGFWYVLHKLQEEGVIDKDSDRCYGWAGFEADTYEEHLIEG
PHFPQPGKERYKVGQIYTGKLTTHPNGYGFVDVDGFDDDIFISADMLGQS
LHLDQVEVQVTKVPESYASRQSPHQRCEGLVVNVIERRLVTVVGTLHREN
RRFLLKPDQRKILPEIHIPLKAAKKAKAGDKVLAGELEFLKSGTIQARVI
EILGTAGESQVEVSAIARGLGIDETFEPELLTFAEKVREAITDEDLKERL
DIRDKDVFTIDPVDAKDFDDALSIETLGNGGGYKVGVHIADVSHYVPENS
ALDKEARKRATSVYLVDRVIPMLPSRLSEKVCSLNPGVDRLAFSVFFNIT
KKGEVTKFEFHKTVIHSKRRFTYEDVQQILDAGKGDYFRELQALDQLSKK
IRAQRMESGGLEFETEEVRFKLGSNGEPVEVIKKERLDSHRLIEEFMLLA
NRTVAAYLTARYAENEKNPHPVIYRVHGAPQMEKVQVLASFVRKIGFDLK
LDRKGKDSATVSSKALRELLQKVRGTNVEFLVNELVLRSMSKAVYSPLND
GHFGLGFEHYTHFTSPIRRYPDLIVHRILFEYETTRKKRRKVTPERISQI
TATITEVCQITNEREKIATEAERESIKLKQVEYMSAHLGNTYDGVITGAT
EYGIYVRMTDFAIEGLVHMRNLKDDYYEYDEATYSLVGRRKHRRLQIGQR
LKVKVHEVDLTRRTIDLTLA
>CT2022 pyruvate-formate-lyase-activating enzyme, putative
MEKKPLYHFMPGTMTWSFGTPGCNFKCANRQNWAISQMGQDKSIPLATPE
AIVRNAMNTGCSSISCTYTEPTIFAEYALDVMQLARQTGLRNIWISNGYL
SPLCLKTVTPWLDAINVGLQSMDDAFYRRVCGARLDPVLDSLRLIQESGM
HLEITTLVIPGHSSDPAMLERLAGFIAHDLGTGVPWHIIPFYPEISWKMQ
DTPPTPAESIEQAFEIGRKAGLSFIYAGNAHSDTFCDQCSARLVGRKASP
FGDYRIERFDTGGRCPVCHAPSPMRD
>CT0933 conserved hypothetical protein
MNLHTKTWVQRLPVPIDVAWDFFSQPGNLARITPPEMMLKAEGGNAGTAI
YEGMKLNFVLYPFMMIPVRWTTEIMKVSKPDFFEDRQLSGPYEQWYHRHL
FRDIEGGTEMTDIVEYALPFDLFGEVVEALIVGPRLDEVFEYRRCRVAEI
LGEMASETR
>CT1765 molybdenum-pterin-binding protein
MNISARNIFKGSISSIVKGAVNAEVTITLASGTPIVSIVTIGAVERLGLQ
EGMAASAIIKASTVILGTNLHDAKMSARNILCGTVTRVIDGPVSCEVDLE
IGVGEVLSAVITHGSAEKLGFAEGSHACAIFKASSVIIGVD
>CT0517 tia invasion determinant-related protein
MKKYAITSIAAAMLSAPAITATADPLYISLSGGLNLMSNSDAKVSDTETS
IKNAVEYKRGYALEGAFGEKTGVFRGEIAVGYQSSDVDKVLGSDIVEQLG
EIDNYEDLTVTASALTVMYNVYADYDMKGILSPYLMGGLGAAFVDMGTSF
KVDGVEYDSSYDKTVFAWQLGAGLGIKITNNVALDLGYRYFKTGDLDLEN
KTKLSFGGSKILLGMRYNL
>CT1681 ABC transporter, permease protein
MESLAVVFILQVLRISVPYLFASVGAIFSERGGVINLALEGLILAGAFGA
MLGQYLTGSAWAGIGFALALGLVVSLLHAFVTITLRADQIVSGIAINILV
MGATRFGLGLLFGSAMNSARIAGMEVSVPLFDPLLVIAVFTVGVAQFVLF
RTPYGLRLRAAGESAKAVETAGLDVRRLRYSGVLISGALAALGGVFLAFQ
QHSFTDNMSAGRGYIALAAMIIGRWSPAGAALASLLFAAAEAMSMWLPSG
WLPSQLVQSLPYLITLLVLAGFVGKSAPPKELGVPYEPE
>CT2197 hypothetical protein
MCLRSFDRNLKQTHSKSSNKKNNPAFILDR
>CT0022 hypothetical protein
MILLLVSENLEHQINTVKRDEGYHEIDRLDHTQQIDDQHQRHDNQKPECD
TAENDERENLRLLVKSVLKEQVPREAVENNHKPGNENRVDVNRIVCRAPV
NTPP
>CT1255 spoU rRNA methylase family protein
MGRLTPEAYADSARHPVTLMLYNIRSMWNVGAMFRTADAAGIDEIVITGY
TATPPRKEIDKTALGAQETVPWRHFADPVEAITVLKRESKKIFGLEIAEN
SRSYSSLTAGDFPLALIVGNEVEGIGDALLSHCDGVIEIPQYGVKHSLNV
AVAAGVALFECVRVFRKNG
>CT0863 hypothetical protein
MTVFRETGIPCFFISLISNGFVNRGHSLQIFG
>CT1622 DNA helicase, putative
MMGVPQRDLLTRLLDYIEEQAKAINPRAFRLSNTSEFLRYRLDLAGLPGV
EFDISVEGDHFWLRVDRLDAYKPPLIDEKNRDFIRIGDDPVGAKPSIDDE
ALKAHFFAAAEGKTAEEIEALERQQRESLDEALAQYSALWEAWAESEKPR
RQTIDLYGALFAIKHQLEAEEATRPTELVWGIGIATWQLTLPESQGKVSK
IDFEYPLLTQQMEVGLDESTMALFLRPRATDTRYEGDAFASCVGRAAVEV
ERAVRQQLKQNQEYPVTPFDPGSYTDLLKLVATNMDSHGAYRPLPEGDGT
VPTPGEHLVVTDSWALFTRPRSNNFLLDDLQRLKERLTEGCDIPKGPAAL
VTLPSDEPAPFENIRFRGLSGYGPDRGSAEVRELFFPLPYNQEQVTIVKQ
LEQAEGVAVQGPPGTGKTHTIANIICHYLATGRRVLVTSKGEPALAVLQG
KIPEEVQPLTVAMLTGDRESLRQFENAINAIQARVSQLNPELTREEIERC
TSRIDRAHAELASIDRRIDEIAESQLSEVPVDGHPMRAQQLAELAVSGEA
EHGWFDDEISLAPENAPPLSDEEAGRLREARRKLGGDLVYVDANVPSADD
LLPPDDVAQLHTVLARMREIERLVDQGALPALKAATPEVLDEARQLLGAI
DAVHSILQEIDELGESWAHELRRRCRQSSFEAERQALEALFDEIAALTEA
RAAFLQRPVEIPPEALGSSKVSEAVQRGAQTGKPFGLIAFGKSEVKEAVA
KVKVAGLPPSNADDWLHVKSFMELHVRVISFITRWNQFALALSAPVLEGG
VGELRRIELVASTARKAHQLAMQHDVALVRRAEAVFAQAPVTLLHGTSEE
IAKVREHLMVHLSRADLAKAATQLSTLKSRLAGTSGPVSEKLRALVDEAL
GNEALAAERVAAEYAALLGELRRIAALNVELALVSEAANRFAQAGAPKFA
ARIRSVPVAQTGEDTALPVNWRDAWRWARVKRHLEQIESRDELVKLSARR
RDLEQGLAKLYREMVALAAWMETKLNASPRVLEALQGYATAIRRIGRGTG
PNATRHRRDARHHMTSAASAIPCWIMSHAKVSESMPADIGAFDLVIVDEA
SQSDIWALPAILRGKTILVVGDDKQVSPDAGFVSAQRVQELRDRFLAEQP
FREAMTPGSSLYDLAARVFAARQVMLREHFRCVPPIIAYSNRTFYKGAIQ
PLRIPKGSERIDPPLVDVHVENGVRSRDDSNREEASFIAEEIAALLADER
FAGRTIGVVSLLGMEQAKYIDTLARKRCNTAELFRRKFECGDARTFQGSE
RDIMFLSMVVDPGNCKALSGNMFEQRFNVAASRARDRMYLVRSVTASHLS
DLDLRRSLLHHFDKPLIADKEEAEELIDRCESGFEREVYSALVERGYRVI
PQVRTGAYRIDMVVEGAGDLRLAIECDGDAFHGPDRWPHDLARQRVLERA
GWTFWRCFASTWRLHKDEVLGELTERLSAMGIEPLGSIARAPKLVEKRSL
IVSVYEETPH
>CT0412 iron(III) ABC transporter, ATP-binding protein, putative
MSDSMIQTRNLAIGYQLRQREFIAWRKPSHPASTVVAGSINLDIASGELV
CLLGPNGSGKSTLMRTLAGVQQPLSGSVLLKGRDMAGLHPKEIAKLLSLV
LTDRVMTGNMSVYALVALGRYPYTGWMGTLSDADEELVRRAIETTGTRAF
AHRHIGELSDGERQKVMIARAIAQDTPVILLDEPTAHLDLPNRLEIISLL
KRLAREEGKAVVLSTHELDLALQASDRIWLMKRSGDRVCSIVSAIPESLV
LEGHLEKAFSRNGFEFDQYSGSFRFRHEGSSQVGLLGDGVTAYWTRRALE
RAGCRVVQGASNIRHVEISGDESRNMQWRFFPGENGEPVVTGSLDELLRA
VVRSSKEKEVS
>CT0710 hypothetical protein
MTIMYEKGGGMAGSALQTWEKVLEYASVPLHGTMSRKIRKGVKLQINEGT
VYENAVLFISDLFLRVTEDSADTSVNTYYSIDSIASIRTYSTKE
>CT0921 hypothetical protein
MLPQIALVESRLPHRDDEGPSVGSPEIVHSPSGDLPIAASLQSTPACSNS
RTGGGNSLEAPSMHQQLLKASRRRRFRGSAR
>CT0705 hypothetical protein
MMTIPHEPAYWLLTAGCLCIASIPALMRSIAKGKAIAIVALWLALCITTL
WQFGLLPGVATTLISAVWGVILLVASLIFSGIKSMPNQRFEKR
>CT1262 hypothetical protein
MPRLHEKAFLSLLGLLLQPIKKEALSTASFFIWKTEGVNQP
>CT1061 conserved hypothetical protein
MRHRITTLRCLFMAIVAIYGSLALFPAKASCQNDKKLEVPVRTQQSDNDH
RGITVQTSDLDDGVTGVVGKVYIEASPKHVWAAITDYNNHKSFVPKLIDS
GLISDNGREQVMFERGKTGIFLFRKTVYIKLSLQGEYPKRLDFHQIEGDF
KVYEGDWLIERASDGKGSILTFRAKIKPDFFAPAMFVRKVQQNDLPMVLA
AMKKRAESAEGSLRVARTSSLKQSTQPSADSAIAD
>CT1131 CRISPR-associated protein Cas4
MYAETDFIALSALQHYVFCPRQCALIHLEQVWSENLYTAEGREMHERADS
AVTSYREGVKVTRSVPLRSAVLGVSGVADVVEWHRREGGFEPFPVEYKRG
KPKKHDADKVQLCAQAMCLEEMRSCAIPSGALFYGQTMRRLDVVFDEVLR
SKTVAAAAGVHGLFSLGVTPHPEFGPKCKLCSLVDECLPEVLERHGSAKR
YVQKLYRELSGEEI
>CT1276 conserved hypothetical protein
MKIIIPLDEYSGPGSQVCDHFGSAPFFATVDMVSGEVSIIDNQNAHHDHG
QCTPADSLADMGASAIVVKGIGPRAAAKMQSLGMDAYMAGSARTLADVIE
QFGSGILNKLDVQQTCQGHGCG
>CT0273 polysaccharide biosynthesis protein
MFSKLKLLAKDTVIYGASTILARSLNYVLVPLYANKLTTFDNGIQAVIYA
NIALANVIFTYGLETSYLKVASDVIKRNEDERPLFSTAFFSLFVTSILFS
ALMLLFAPSIAVAIGLAPESGVFIRYAAAILFLDTLLVVPFAELRLKRKA
IPFALAKVMGVVGGVISTFVLILGLHAGLSGVFIGEALGSVVSLLFILPV
LKNLKPTFSPGMCRQLLGIGLPYVPTGIAGLLIHLIDRNLLIRIPQQDID
RLYGAGFQASDITGIYGRVAAFGVALQMFIQIFRFAWQPFFLQHADDPEA
KPLFKQVMNLSGIAVIVLAVACTFFVPDLVRYHWGGKLYLLPPKYWMGMS
ILPWIFFSYVFDMISTNLSAGILITGKTKYLPVVTFAGAAVTTLGCWILI
PLGGMDGAAVAILLGAAVMCLCMGWYSVRFYPISYDWGRLSLLLGAGLAF
AVWHDDLLVWLAGFGISGLLAMMVKLLIVLLYLVLGTLIFRNEASAVVKM
VQRKLRPAGSSGSR
>CT2228 hypothetical protein
MMVCVVFNGRFSIALQALSPESKKKMEDRMMKMRGCSDDAASLDPETAGN
VRLGIIDRIRPADGSARIDFVGA
>CT0017 hypothetical protein
MRHFRVFLRKPKRITSAAQPLETGSSGLHFSAVNPRQAPASFFPSLRRFT
VRLTAARQDQLVSYFFTCFKGVSMSQSVMKKSVLRILPGLLCLALPLSSC
SSSKSPKATMTATPVELRYREATEKIAKRKYNDAIVILESLMFSTRATAL
EDDVLKALADSYYKKKEYILAADTYRRLLQQTPDSPYARDAQFMLAKSYE
KLSPFHELDQEYTVKAINEFETYLDQYPSDDSAQAANDLELYKNLMKVNP
DNASYREKYEAAKEELASGSPARYSQKAISELRERLAHNRFSIARQYFKL
KKYRAAEIFYDVVINQYPDTKWLESAWIGKIDSEIKQNNWFEARQSIETF
QQLYPDKAKLIEPAAKRVTAHYSNKRDPKSKE
>CT0923 amino acid permease
MKNSFRKKPLSLLLEEMKSEHRLNRVLGPLALTSLGVGAIIGTGIFVLIG
VAAHDKAGPAVTLSFALAGLACVFAALCYAEFASMAPVAGSAYTYAYATL
GELFAWIIGWDLILEYAVASATVAHGWSHYFQDFMGIFGLHIPEIFSRAP
LDFDPATGSLVLTGSMFDLPAVIIVLIVTVILVKGIRESAGFNTAMVIVK
VAIVLLVIVLGAQYVKPENWQPFAPFGYSGLSVFGHTILGETGAGGAPVG
VLAGAAMIFFAYIGFDSISTHAEEARRPERDVPIGIIASLIICTMLYVAV
AAVITGMVPYDQINIDAPVSYAFKQVGLDWAQFLVSLGAITGITSVLLVM
MLSQPRIFLAMARDGLLPNKFFGVVHPKFKTPWNATILTGIFVAILGAFL
PLRLLAELVNIGTLFAFVVVCAAVLIMRRTNPDAERPFRAPLVPFVPIAG
ILTCLLLMFSLPAENWWRLIVWLLIGFCIYFFYGRHHSVLNQHHD
>CT2269 conserved hypothetical protein
MHIVDLSYPVTADMPLWPGTPAPNFSDLHTVGRDGFGERWLQLSSHTGTH
LDAPAHLFEGAVSLDRLPVDHFIGKGALLDLRDAQPEPLSLDQLLLQRAT
IESAEFLLLHTGWSRFWGTAAYDRGYPVFAEEAAAWLAGLGLKGVGIDAP
SFDDPDSEELPIHRRLLGAGFVLIENLTALDRLGGHEFFLSVLPLPIAGA
EACPVRAVALIASFAANQPI
>CT1635 site-specific recombinase, phage/XerD family
MKRSAPVQFAKKDVANCRWLGEFLVHLESTRNVSAKTVTAYTTDLIQFFE
FLTDESGHQEMSAVDPELVEVADVRRFMGDLLDSGIKPRSIARKLASVKS
FYRFLLDTGKIERSPLSLVLTPRLERKIPDFLSEEEASRLFDQLVLSDQE
SVGPEQGQKAAVQRFELARDRAVLELLYGCGLRLSEVTGLENADVDLVHG
FLKVTGKGRKQRIVPFGEPAAEALRNYFEVRRNFFRILKEGAGETSKVFV
TAKGRQIYPMLVQRMTKRYLSPVTESARKNPHMLRHTFATHMLNGGADLK
SVSEMLGHSNLTTTELYTHVTFNRLRDAYTKAHPRA
>CT1342 lipoate-protein ligase A, putative
MRAFTDGSFQREFGGGCCLWRFYAWSPPAVSLGRNQNPAEIDRERCRAEG
VDVVVRPTGGRAVFHADELTYSFFADTLLPNEVIYQMVHETIARALSKVG
VTAEFCRSQPDFRSRYACPESVSCFTASARYELQVDGRKIVGSAQRRNGH
VLLQHGSLPLSMRHRQLSRYLAGASRELVQAVDADMERKTASLDEFADTG
YADLVPLLIAEAGKSFGSEAKMLTLGEIGRLEGFQQSL
>CT0198 MFS transporter family protein
MEMKQEKRFFGLSGNVFFAGLVSFCMDVSSEMIYPLVPLFLASVLGVNKA
MIGLIEGSAESMASLVKVFSGWYSDRLGKRKRLMLAGYSVSTLSRPLMAL
AGGWHQVLAARLVDRFGKGVRTAPRDAIIADSTEPSWLGRAFSFHRSMDT
MGAVVGPAIAFIGLQVYHSSYRQLFWLSSLPGVLAVLIIVFFIREKRDAP
VRADAEKPRSTGGGKLDRRAWFFIVIVAVFALGNSSDAFLILRAGQLGVP
AAMIPAVYLLFNLVYSITAIPAGIVADRYGRKRLILAGFVLFAALYAGFA
VAGSPLAVWVLFALYGVFMGLTEGIQKAFLATLTPEGLKGTAFGLYAGAV
GLAALPSSLIAGVLWDRVSPSATFWFGAATALLAAALFAVFIAGIKSKPS
FGE
>CT0807 phytoene desaturase
MQGMNGEKKKVLIIGGGLAGLTAVKRLVDRGFQVKVLEKRPIYGGKVSAW
KDEEGDWIESGTHCFFGAYGVLYDLMKEIKTYHAVIWKEHQLTYTLEGGN
SFTFNTWDLPSPLHLLPAIIKNGYFTFGEMAAFSKSLIPLALQKANYPPT
QDHLTFAEWAEQKKFGKRLMDKMFRPMSLALKFIPPEEISAKIILDVTET
FYRIPDSSRMGFLKGSPQEYLHQPLVDYSTQKGAVFQNNITVDELLFDGQ
QIRGVQLRNGEILDADYYVAALPIHNLCKVLPSSLKQQDRFFGDLDRLKG
VPVISVQLWYDREISPIDNVLFSPDGVIPVYANLARTTPDYRMLRGERFE
GKTRFEFCVAPARELMALTKEEIIARVDQSVRANFPKETQGAKILKSTLV
KIPRSVYAPLPGMEKFRPTQKTPVGNLFLAGGFSQQLYYDSMGGAVMSAN
LAVDALVKAASDNGH
>CT0539 hypothetical protein
MTTEAMMKSDSLPVEITRNGDCHFAANRYVRSTCNIHHQPEFHPLTR
>CT0345 nitroreductase family protein
MEGSLSRGVEIMKLRELVARSRSIRRFDEHVAVNDATLRDLVELVCYTPS
AANRQLLRFLPVTGADMSDKVFPCLKWAGYLEDWPGPEPGERPAAALVML
CRNEDLPGAACDSGIAAQTIMLGAAEKELGGCIVAAIDRERLMASLGIPD
AWTVLLVIALGKPAETVVIDQIKPGDDIRYWRDKHGIHHVPKRQVDELLV
TAEQLRERG
>CT1687 hypothetical protein
MNLQAHGFQLREVQFFYCRGFGWLLSKDTSGIPAVFVTGRCL
>CT0760 HIT family protein
MQRMYSPWRDVYMQTFKEEKPFTPEEGKSVFADIPPEQDEERYVLCRGEF
CFAILNLYPYNCGHLMVIPYLQTPDFGDLDAQTMVEIFNMSNLCMKALKM
TIKPQGFNFGANLGRVAGGSIDEHIHFHIVPRWEGDTNFMPVLGETKVLS
NDLRQTYIQLREAIKKLQSEPKKP
>CT2048 hypothetical protein
MITASPKASWNITALIEFFSVVTAKLKTMKGMSTRIIVPLFLLLPALLGA
CGGHKKEQTPSSSTTATAPAIAAKVATVTESADAAGRKILLVPTSSVFRK
AGTDQVFVVRPDSIVTLRWVSTGHTQGASTIVLSGLDKGETVVESPSADL
REGARISTDTHTEDRAKEAQHQ
>CT0649 carotenoid isomerase, putative
MVDVIVIGAGIGGLTAAALLQERGFSTVVFEKNRFPGGSCSSFEKGGYTF
DAGASVFYGFCDDDAMGTLNLHSRIFRKLGIAVETIPDPVQIHYHMPGGF
DVPAWFDRDRFLESLCRRFPHERTGIRKFYDELESVYEILNSLPAGSLED
VIHLGVVGARHPLKVMALGVKTLFSMGNVARRYISDKELLRFIDLEAYSW
AVQDAVSTPLVNAGICLADRHHGGINYPVGGSGTIPAALVKGFEKFGGSI
RFGSEVSKVIVKNGSAVGVRLSDGTEVSAKAVVSNATVWDTFSKLVDDPR
LRIPDDRFIKAPSWFQIWLGVDGSIVPPGFHMHHIIVDDWSKYDELGGTI
YFSAPSVLDPSLAPPGKHALHLFVTAETWQWEQYEYRSQEYKAAKEAFAK
SLIARAERLLPGIQDATELMVTATPQSHARYLNRRDGSYGPLLKPGQNIL
LKPQNRTPIKNLFAAGDSTFPGQGVIAVTYSGVSCASYIARQLGKALDYL
>CT1560 hypothetical protein
MSGSSKTVKTGSAPAGFRDAALFTFVAILAGFGIELLTSGGGINVPGWPF
NLVFLLLFGAVIFTVGLMFREHPVVSWLGGIPLGLSLIVGLALLSLVGGV
LPQDKYPPDSIVALLRLNGMFSSWPFAFVTLLFLFNLGLSLVWKTIPFRV
ANLQFMLFHGGFWLALSCGLLGTAQLERLVVPLYEGKESDVAYNRESETA
IHLPFSIYLKDFQIDEYTPQFALYDPQKDQVIEPKSKLMPELRKGVKAEW
AGIGSVTVLDYLPDALPDANGTPVPVEGKKGVPFAKVRIDENGKISEQWI
SSGGPQLKPRFVPMGNAFIVMADGAPKAFRSDVTLIGAGGERKAGTLEVN
KPVDFHGWKLYQVGYDEKAGRWSTLSLVEAVRDPWLPAVYLGFFMIMAGN
VLYFWKGFKKMEEA
>CT1620 hypothetical protein
MHYEPQKLMGMGTTTETLYFDDYGRKEAVERVTESNVMGIKTYEHTMQVT
DGHTGISYEIKKTVNGKDETSKVTTKSDLREFQEMAQAMAKSLDVNELKK
NMDYREEGTETIAGVTGKKYSVAMSKEQSDARVYGVMYKNIVLKSEMGSI
STKAASIEENVVGSASKFAIPARHTVQEVNVAEEMEKAANGGSGE
>CT1472 major intrinsic protein
MVSGNDNNHHAAQRARMLLTQIAPNFLDPKHEWQRIFAELWGTFLLVLVA
AGGPVAATSSGNHAGDALLPVAPGLMVMAIIYFMGTVSGAHLNPAVTLAF
AMRRNFPWVRVPGYILAQVAGGWLAALFLGFMFGNAAVAPGMTLPGHEVT
PLKALVMEMVLTAALVNTILGTSSGARNIGTNGAIAVGGYIALAGMWAAP
VSGASMNPVRSLAPALVCGDTTLAWVYVAGPIAGALIGVVFEWILKGPPT
TAGTVAAQGTLDIDDREG
>CT0647 conserved hypothetical protein
MNCNKAMVLMSAAIDGELSAKEEEELAQHLAECPACRAEFQDAKKTKIII
KERIVRFKAPSTLIESITRLTTITS
>CT0270 carbohydrate isomerase, KpsF/GutQ family
MSERLDENFSRAIDLMLACTGKIIISGMGKSGIIGQKIAATLSSTGTTAI
FLHPAEAAHGDLGVVSEGDTVICLSKSGMTEELNFILPALRERKATIIAF
TGNPRSYLAMNADVVLDTGVEQEACPYDLAPTSSTTAMLAMGDALAICLM
KKKNFTDQEFALTHPKGSLGKQLTMRVGDVMATGDALPVVSEDAMLSDLI
LEMTSKRYGVSGVVDAEGKLTGIFTDGDLRRLVQTGESFLDKKAVEVMTP
NPKTVAPDMKAKACLELLETHRITQLMVCDEKRCPVGIVHIHDLVTLGL
>CT1849 hypothetical protein
MVVETVPGRPSKVSALLGGAVLLVLTTTLPYLTLINAFLFAGIIIAGAVA
AWYYIMRNQIRLEPGEAFVLGAMSGFIGGALSVLVAYLLEKWFGYIPGLE
SLRLLVAWATSLAPENAETFQQMLAMVTAPKDIALSDLLVSMILTGMFYA
PFAGLGGRVTVFFLKRQARKR
>CT1517 hypothetical protein
MVSNERISTILAIKNGGSFQRALTAGFRPDYQSLLALCDGEEGRNLSFFY
QIKTE
>CT1767 hypothetical protein
MEHQIPVNDTSQEQPKTGPAVHDEIPEPFRKISQKVSEAFSEFKESETWE
KMLDARDKARDYITENPVNSFFYALGAGMFLGFLLKRK
>CT0761 hypothetical protein
MRFQPKPGSLPKQICLVSDGDFETVNLRINACKS
>CT1589 hypothetical protein
MSNTKKGLATAGYNDDLAKLEEIFLPNFFYKKMQYYFSSSGLISNDVLFV
HYTSTESALDIIREKRVLMRNALHMPDRQEVQDGFNIMDGLLSNENNHWV
EFRNRIEVVLPGVVDRVMKIYSDHSHGRNDGTYFLSVLEHDESEKELGRL
SMWRAFCGQSQPVAMFLRLPALSAVSQVLRIFFNPVLYKGKGQQHLELAE
VIKNVENHKSFLERLDPDLVTSAIVSMILINVLCVKHKVFKEEREWRCVY
LPKCFTSETSARLIEPGVEEQVGASRNVYKMPLNAAIDPVLSDIDLSKIF
DSLIIGPSKSPYATYEVFCDELKKIGVSDVESKVRVTEIPVR
>CT1463 hypothetical protein
MPTSIGQKGKMKKALLITGLVASLLAVLGLWLHRSYTILKTKPPAPLTTD
VKLEQPSSLFNLPISIEHTVLADYLNGKIRGNFLNADLWLQKKHKERVSL
ALTREENITISSNGHKLFCTFPVSAEARLTDSRFGKFLAKLLVWPIHAKA
VVTFSTPIALDRNWHLKTRFKIENIRWEEEPVLKIGPFRKHIRADVDTLL
SDNKRGLTALLDAEIDKEASLYPTVSDVWKDLQKPIVLTRKPVPVWLRFH
CNDITGHISLNKRAIVCNARIMTNMRMLTDTTAISPPTPLPRFRQTPRDS
ISTISDVNFYALVPFASINRNLNDVFMNRRFSRSGYDIVVRSVEAYGSSS
GLSVAIMTDLDLKSHIVISGRPRYDIPTHTLSIDHFDYSIDTGNPIIRTR
ELILHDAIRDSISTRLDVQIGSLVDRLPTIITRAVSKAKAGRTIDLTIDS
LAIRKCDIRVGRNNIYLLVNATAKNALRIKRIKSGKVIRIRKQAETKDQN
PTSQLPRPPDTDNKSRLLPVTARPLQLTNHF
>CT1678 conserved hypothetical protein
MNDSQDTAMEKESGKSKRFGQRGEHLVIIQMALVAIYFFTPAWPDLRSGE
LYRHLALVRWIGLVAGMAGGAVFGIGGSLNIRKYLTPLPYPVEHSRLVDT
GVYALVRHPLYSSQLFAAAGWTIFSLSLTHLIVTLVALIFFNYKASKEEA
WLMERHPEYRKYASRVGKFVPGFGRLKA
>CT1598 hypothetical protein
MTQFYQASPRLVLAVLIAIVIARPLELKIFDKEIREKLKERYLADQQAQI
TRLQESFKTNYALELSMISRHQSEYDELDRDTARLREELKAEVFGDKTST
TSGIVGYGSYAKNKEAVIQSKQARLDYLARKLASLEEFVNRQKEVEGINS
NLMLSDAMLDQKASKAGFADRNWALGALTHATDDVSRSSAHAVTFITLLI
IAFECAPLIVKLLSDAGPSDVDIRESEARIIARLANPALIDSPRSVRHYN
TPSFRSMPGRRNRFSGYRRRR
>CT0106 universal stress protein family
MTTNHVKPSKIMLCPVDFSPSSERALLYAAEHCPADAELIVLYVGDAGNG
DRGTMLREHLHQFSSYSDLLSAYGCRVRFAVEYGSPGATIIEYASKTGAA
MIVLGSHGASNLGRLLVGSTAESVMRHAPCPVLVLRSPEGVNETGTVQRK
QKEAIL
>CT0237 conserved hypothetical protein
MRPQVTIYGKPECCLCDDALKVLEAVRKRIPFDIEKRDISGNADLIERYG
LSIPVIFVDGKLAFKHRIDKERLIALLKGQ
>CT0549 hypothetical protein
MLCLKSTKIGGELKINIKRVQSCYHLIFPVH
>CT0636 tolB protein, putative
MRSTRNSFACLCIMLFGMLFVPFTLRAEEVGEYIAIRKEGASRIAVVLDK
TSADGGKQREWARSLDVTINKGLDFTGLFNLLPAPLNIRNGQNGGLNFAS
IASVGGDIYAGGSVTKRSGRPVLEMHVYDSSGKSLLARTYTGEESQLRAI
GLRFCADLVELLTGKRSVFGTRIVFVANRTGNKEIYMCDFDGENVVQLTN
SRSISLTPAVSPDGTYIAWTDYTSGKPNLYIKNIATGAKVSVNKHGVCIS
PAWRPGTNTLVTTLSYEGDQDLYLIRADGTVERRLTKGGGIDVSPTFSPD
GSKMAFVSTRQGGPQIFIQDMNSGQVRRLTYSGIYNTQPSWSPNGDKILY
SSMQKSGEINIFSINVDGSGLLQLTSGSGNNEHPSWSPDGSMIVFSSTRD
GRRRLYVMNADGSNQRPLLNMQGEQQQPSWSVSK
>CT1714 membrane protein
MDRMFKTAKVETDSDHLDFRYEFARKAIHLSSLLIPLIYWHIGKKQALLI
LTPVTAGFFLVDVAKHFVPFLSTWYHSTFGTMLRHHELNRERLHLNGATW
ITLAAFALIAFFPKTIAVAAFAMVSVSDTVAALVGKRFGRHRFGQKSFEG
SLAFFVSALPVVLSIPGMIFPAAIVMAITGTITEALVLKIGVFRIDDNFS
VPLAGAIAGLCCYTWFFPEALKALAH
>CT1312 lipase, putative
MPIRSRYITLGGHRHRYIDTGGNAPVMLLLHGISSSADYYGPSMSLLARS
FRVLGLDLLGFGESDKPRTIPYTLQLYADLIHEFLWETDAFAHGEVYGTG
HSMGGKYLLATALLYPGTFKKMVLSNTDGFIVLPSFARAISLPGVRHVLK
PLVTGERIAAKMLDMAIHNRQAIDDETYRKVLQIARDHDAFETVMSLNRN
MLKLDLKRTGLRARLRELKQPVLIIWGEHDRYISPKIAHIVKRELPHAKL
LIFKDCGHSPMLEYPEQFSTAITEFIHQEPPLP
>CT1787 FtsK/SpoIIIE family protein
MLAALFSIAAVFGFHAEDEPYIVTLPWYELFSSAAKAVAGTIHNPFGLFG
ARVSVFFIRVLLGYPSVMPLFGFLVLGWHLFRAKPLGPGLFFLVYTLLMA
LDLSAMFGLSMLPLADLMSGATGRMMASFLSTVIGYPGAWALTAIIAAVL
TFYMGRDFIVDTIAGVSGFFGKLLATVQAIRAERHRKRREKEEMRVRKKA
ERMAAVLEKEQRKRDKKAQRARKAGDASKQKAAPFENSPETPAPVMDVEP
APPLLNPAVSEPVVIPAEVEEIRTPEPAPVRPEEGPEMIIKPGVQEAEAD
LDERALKVRTHDHVKYRFPSIDLLRRPKDEDESYDERHLAETKDRLLEKL
RIYKIDVIRIATTVGPRVALFELELAPEVKISRIKSLENDLAMAMASSSG
GIRIIAPIPGKNAIGVEIPISKPRPVVMRSVLQVEKFKNNSMALPIVLGK
SISNEVIVDDLAAMPHLLIAGATGAGKSVAINVLLTSLLYSKKPDEVKFV
LIDPKRVELKPYKLLKDHFLPKIPGMEEQIIVTDPQKAVSALRSVVREME
HRYELLEQCGVRNIGEYNRKMKDEAMFYLVVVVDELADLMITAGREVEEP
ITRLAQMARAVGIHLIVATQRPSVDIITGIIKANFPSRIAFQVASKVDSR
TILDVSGAEQLLGSGDMLFQSAKMSKPQRIQCPYISLSEVDAITEFIGQQ
PPLRAECMLPEPPSSSGNGSSSGFDQDRGRRDSMFEEAARLVVMHQQASV
SLLQRRLRLGFSRAGRVMDQLEQSGIVSAGDGSKPREVLVKNEDSLELLL
RNLD
>CT1956 hypothetical protein
MHIRKHSVCRNILRTFEIALFFAISVLGYGLLLKASSFNTKSKKRDRESI
VVQNAVDLNGRHRELKVLSGTLLFPNDTKAALPNRYTFTGQSFLAISPLP
ALALHLLCEKELTVNYRNRNSRISRPYNLFEQNPVLLN
>CT0726 HNH endonuclease family protein
MLLQKSKVLVLNASYEPLSICDARNAVLLLFCGKAMMVASHPEHRIRTVT
ENFPLPSIVRLMVYVRIDYRGAVLNRKNLFRRDGFRCQYCGCKDGSLTVD
HVMPKSRGGEDTWENLITACKSCNTKKGNRTPSEAGMAMLNKPCRPSNIT
LMRQHYRSISDEWKPYLFMS
>CT2196 EntD/Gsp/HetI/Sfp family protein
MMIISKEAVTLIHTDTHIAGIPEEKLFETLTDEEKEKADRFRFDNDRHNF
LLRRGLLRLLLGETLSIEPSLIRFSSTQVGKPFMTFPENTGLYFNLSHSG
RQIVYAFSKHPEMGVDIERIRTVDDIDLLARKYFSAEEYAIIVNLPSREK
NKAFIRIWSIKEALIKASGWPLEHGLAAFDVATQYRMTRFKMPFGANRSL
TCITPVFDYMCGFATALAIQLDNNEPLNLRRYSLQNGEYIEL
>CT0162 alpha oxoglutarate ferredoxin oxidoreductase, beta subunit
MTDTHTCLTAKDFTSNQEPKWCPGCGDFMVLQQLKNAMAELCLKTEEVVV
VSGIGCSSRLPYYINTYGVHGIHGRAMAMASGLKVARPDLSVWVGTGDGD
ALSIGGNHYIHTVRRNLDINVVLFNNEIYGLTKGQYSPTSKVGLRTVTSP
TGVVDYPINTIALTLGAGGTFVARVMDRDGKLMKEIFKRAHNHKGTSIVE
IYQNCPIFNDGAFRAFSDKERKDDTTLYLEQGQPLVFGANGSKGIYLDGF
KPTVIDLEKSGVSKDDLWIHDENDLIKANILSRFFDDPNSTEEFLPRPFG
IFYVEDRFTYEQALSAQIDKAQEKGEGTLEELLAGNSTWTIN
>CT0783 hypothetical protein
MLTTSIPLLPFHFHHQMLYRSLSPNAEEYGVFSCLNAETYNWQAGLN
>CT0850 hypothetical protein
MTIVFRHEASCSRLPQGNQKRKECALKAEKIGKKKSTLNSMVPG
>CT0861 conserved hypothetical protein
MSDTKKLAIIASKGTLDWAYPPFILASSAAAMDMEAVVFFTFYGLPLLKK
EIDAKVTPVGNPAMPMHMPFGSKEFQSINWPIPNFISGNIPGFDTMATML
MKETFKKKGVATVEQLREICLESGVRFIACQMTMEVFGFDKSEFIEGVDY
GGAASFLEYAAEANISLFI
>CT1896 peptidyl-prolyl cis-trans isomerase, FKBP-type
MAQAKKGDKVLVHYTGTYDDGTVFDSSVERGPLEVTIGTGMVIPGFDRAL
LDMEPGQKKTVNIPVDDAYGPRAEELIAEVPRERIPAEIPLEIGQQLQLS
LADGGEVIVMIVDLTDTTVTLDANHPMAGLDLNFELELVEIL
>CT0997 hypothetical protein
MFFIGYYIQIYGLLYMYLMSRITFLLDFIYLMFL
>CT0805 conserved hypothetical protein
MTATPRRTTTMAMPNFGDMMKQLQEAGAKMQDLQKQLEKLVSEGEAGGGM
VRAKVNGRQKLLELTIDPEIMDDVDMVQDLVVAAVNKALEASAQLAQSEI
QKAAGGMINPADLMKQFGGQG
>CT2242 cytochrome DsrJ
MQVAHRGSLPAIAEESPVVAAATVKPGGAPIDSSKCILPTEYMRAHHMQI
LNKWRHDSVREGNRTFVNPQGEHFDKSLNTCLGCHGSNPMFCFMCHEYAN
VKPTCWNCHLSPMEVSQ
>CT0536 glycosyl transferase
MQASGKTTPAVTVIIPHLRNRPTLDACLDALRKTTFRDFAVLVVDNGGDA
SDLAGLESCYPEISVLHLPENAGYAGGCNAGLKLVISPYVVFLNDDTVVE
PEWLGCLVEAAECDPQIAALQPKILSLPEHRQGRRVFDYAGAAGGLIDRL
GFPYCLGRSFGGREVDAGQYDEMCDIFWASGVALFVQREVAEKLGGFETE
FFMHMEEIDLCWRMLLAGYRVRSVPQSVVWHEGGASLSEGSPLKVYYNHR
NAMLTLLRNRSTVPLVLLLPLRIALEAAAVLYYLAGGKAGVMRAGQVARA
FADVLRRLPETLRQRREIQRSRRVSDRELFRNTPLSIFLPRRPD
>CT0835 hypothetical protein
MVASDRKAYFRSVSRELQAAGLPVDMNKAL
>CT1467 hypothetical protein
MKLVIQMVREDDEKYEEPCRKQRGILKAILEYFTP
>CT0877 hypothetical protein
MYRNDYRAGGFKILSAIDSRGNRIFKHILASTLLLVLASFLPYLFGLSGL
RFGIGSLLFGIIPLMVSVQLFFSRSDRDTAGFSERPLPTCRCF
>CT1986 WD-repeat family protein
MRRRPATIKRSTLMGFLSKIFGKKEVELKRPQVREDAALIKTLVGHEDRV
LGVRFSPDGKKLVSGSFDEKVKLWDVETGNAIHTMSGHTTWVKCVDYSPK
GDKVASGSIDSTVRIWDVATGQCLHVCKGHDTEVRMIAFSPDGKTVASCS
RDTTIKFWDTETGNEVKTLFGHKSYIECIAFSADGKKLVSCGEEPVVKIW
DLETGKNIANYPTGDTLSHFVSFSPDGSQIALCGRDAKVKVLDAATGQML
KVLEGHEDGVRALCYNPAGTLIASAANDESVRLWDVAKGALVHTYRGHTH
EVQSVAFSPDGKVIASGSDDFKIKLWGVV
>CT1518 hypothetical protein
MLSRRSFFDVNIRKKPSTHFWLDKIHKRGKKSGVK
>CT1309 hypothetical protein
MKKKLVMTALVAGLLGSAGTVFAASSNDQPAPCARPGISGPVDARGYGMG
PEARFARQQDMRWQRMKQALGLSDSQEQKMMELRRGFYQESRPVRQSMRN
LQRDLALESVKKSPDGRKIANLTQRIGQQSVRLAQLESRHLRELAKVLDA
RQIDRLLQMRDNFGGKRWGRG
>CT0212 hypothetical protein
MNAGKKGVIYTCITGGYDELLNHTFISPEWDYVCFSDDMGINNEKNAQWE
IRPLRFEKLDDVRNQRWHKLHPHLLFPESGLSLWVDGNVDILDGEIFHDI
DRALNANLLIAPSLHPERNCIYDEFDACRQLGKDDPDVMGRQEYLIKKDG
FPKAKGLFETNIIFRCHSHPMVITIMEEWWYWVEQYSRRDQLGFTYVLWK
NNYTVEPLSPVSYRFSPGVRFRYGAFHITKEQLIKEKAALEIKVQRFEAL
ICGRLVKVLYKIRKSTVKRWCRVKMQLLNSLCCRK
>CT1083 hypothetical protein
MHRHHGKCKYLPKNLTNQPAQTNKRFLTDEHPNRILLKNGRDFNGSFSGI
EKDMIQFCFLRT
>CT0208 hypothetical protein
MDRVKKFSLFVMTAASAVVIVLCIAAALVLNSGMVDLFAKKQLLSMFNNE
YRGRLELKEVKLRFPDEVTLVGPGIFEEGAAKPAARADRLTLKFNFLSLL
RPKITLLSFREVDVDGGHASIAEYPDGQLNIGKIFTRRHPELPEMLAIEK
FRARRLKLRNSTVSWKPANAPAYRLQNLQLDMSRAFVAKYEFMGTIKQMQ
FTMPDRGLTLKKGSGSLAFSSVRSDVLGLDLETAKSHAKLSVSIDGLDIF
SGISKKSLLNKKTFIHIESLGIDTSELNRFIPIPALPSGVYRIKGDAKGT
FSDLEMLPVSIEHDGSSVALQGKILNLLDPESLSFNLQIDKSKISSALLT
KVLTDERYRSLAKEAGDVNFSGMLRGRLDQWMTGIDFKTGLGSGSTKFDT
KRLGGGKYQLDGDFNIEKTEPHRLLGIRGVKSGFSGSGSFNGTGSASGIE
NAHLETSVKSAFWQQQTISSGSVTLDLKGKKADLSSDLKSPDGGSLIMAG
LIDFSSLAPSYSVGGSVKKLDLSKATGLQDYRSDLNGRFDLKGRGFDPAS
LNIKASFVLEPSSFSDFHFRERSAISASIAQSAGSSAVSLESEAVDLAVQ
GSASMSQMIEALQMAAACIARETGSTAAIRLPRGPSPWTFNYKLAVRDLT
PLKPLLPAKEFRFKGSASGKATLSGGRLSMDTALSSTTLSNGPSFQLNNT
AMTGSMQCTAAGVAAARLSGTAGTVNTFGRELKNLRLVSSFDNGRLAASL
DLAIPRFSEKLSAAFTARRSGNAAAVSIDRLAFTTPSGVWQTAPGGTLDV
AKEFIRFNRVRFAKGTQSLQLNGLLSNSVSGTFRGTLSGINLTEAKYFLP
DSAQKPMSGTINADFTVSGAPGSKTSDLDLRGSGVTWDQLNVGAVHLTAR
HAGEQLRFEYNSRGAATPAGTTATVPVNTITGSGSIPLVLRYSPFEARIP
DNRPVSISMRSDDLSASIITYIVPIFDHAEGVIPTDLRVTGRMPKPEIFL
TTRLRGTDVRIAPTQVTYRVNGQIIGTPSRIDFGRLEVKDAENGTGAVSG
MIGLDGLKPVTVNLSGSFSNLLLYNKKDMKDDTSFGTIRGTTGNLRFYGE
LSAPVAEGDLVLNSVNFSLYRKGSNESAKYIGVEKFIKFVPRRPAPKPVE
AAAPPEKLEFHYNLLDILQIKNLRLSNSAPVKGTMIFDRIRGERVEAQMN
NLSLVVNKTGQRFSLFGSIDITGGKYTFSNTSFELENSGRVAWNNEEIRD
GRLIDIYGVKQVTATDVQTGERDNVRLLIAVSGTIEKPDVRMGYYLNDDT
QPYSSANMIGRQSSHVDPNADLNVISMLFSRQWYLNPERQGSRGVSPVSS
VGISAGTGLLSSQVSSIVQNLAGLESFNVNLGAGANGNLSGLEVSYAMLV
PGTGGKVRFVGTNTTPVAGSRTTTNYYNGSSQKIEYRVTPKVYVEAFRSY
GMTTSDATYTNLQKPTENWGVSVSYREKFHTWSQFWDHLFGGKKKGREKD
KDKKE
>CT0449 hypothetical protein
MPERYRKSGGHGALDAETVWSLPWWERAGREGYTGEARMVSSLSETPRL
>CT2005 conserved hypothetical protein
MDLFYVLPHQLDLEHAWAIIDGEEFHHLARVLRCQPGDVVPITDGAGFAA
ELMVGSIGKHSLDGAICNPRTVPPPKTQVTVALSLLKSPQRFDLFLEKAT
ELGAARIVPMITKRTVSTPDSGKVGRKLERWRGIVQSAARQSRRYHLPEL
MSPLTFREALVLDGYDLRLIAHESENAFPSFEPAGKKILFLVGGEGGFTD
AEVADAVEAGFTPLSFGESVLRAETAGIFAVALVRARLLAEADPAKRL
>CT1223 hypothetical protein
MSVHSSLNGKRFFASRLLITFRKGVFALNLHTMAHEAA
>CT0499 P-II family protein
MKKIEAIIRTSQYEAVKKALHEIDIDFFTWWDVTGQGNDKKHRTEIYRGT
VRDTSYISRRYLSIVVRDVNLQKTVDCLIKNARTGEVGDGKIFVSDIEQS
YRIRTGESGDESLYNKD
>CT0272 hypothetical protein
MRIMSWIFIFGFALSVFFLLMYFLSKFVNHMKMEQNMEIESFKDSLIDKD
NPVGLTGEELEKMKQQQAEAQAHLREVISKIPVIQKDGKFQVDMDAVRQQ
KAAAAKTNGSTGPATGKN
>CT0074 hypothetical protein
MKKNVGHQDRNSLFSVGVPSSEIIWRSALKMINFVSLNRHSLIEQQ
>CT1097 hypothetical protein
MVLSKQLYINKPAQVTAKFHTFTFGELVTV
>CT1858 hypothetical protein
MRLFSFTRECHAPAWHELKVNMSCLEELSMKAIS
>CT1328 conserved hypothetical protein
MGTKGREIVGDSIDRVLELLNKAFADEWLAYYQYWIGAKIVEGPMKDAVI
AELVQHAADELRHADMVSMRIIQLGGTPLTSPKQWFEWTNCGYEAPDDKF
VKMILEQNISGEQCAITTYNSIIQEIGMKDPVTYNLAVQILQDEVTHEED
LQALLEDLGVFLRK
>CT0419 hypothetical protein
MKFSVTCIMTAMLMSVAPADVVRASGRGDEFGAEFDGLESRSWYRETTTS
NSDSDRDFFSRLRVKSAGQTLKRQVSQIELNAGSSSNLPTFRQRYNETST
TRSNPDLQRERAVKTGLALSFAPSEKLTFTWKPAAWLELTPSYLYQHARN
EETGLWLPPFHSIPCRDRSCSNLSAVSPFAPT
>CT1882 glycosyl transferase
MKVALYAGTYVKDKDGAVRSIYQLVSSMIKNGHQVVVWTPDFTPEANASV
PVNLTPSVPIPLYPDYKLGFYNAVTERQLDEFAPDIVHISTPDIVGRKFL
HYAKRKGLPVGSAYHTDFPSYLSYYRLGFAEPAVWRFLRKFYNACDVTLA
PNESVRERLTGKGIERVELWSRGIDKELFDPSRRSAKLRRAWDAEGRTVI
IYAGRFVLYKDIEVVMSLYQRFADEGLGDRVRFVMIGSGPEEAQMRARMP
EAVFTGYLTGTTLPEAYASGDLFLFPSTTEAFCNVTLEALATGLPAVVSD
VGGCQELVERSGGGFVAKAGDVGDFYACCTKLMQDGELFRSMRERGLAFA
KDKSWAAVNGKLIDRYLELIAAKARR
>CT1063 conserved hypothetical protein
MSSSSVLAESCNGSQKKRVFVVGATGYIGKFVVRELVSRGYEVISFARPR
SGVNASTTEDETRRQLQGSEVRFGDVSNLESLLRDGIRGEHFDAVVSCLA
SRNGGIKDSWDIDYQATRNSLDAGMKAGINHFVLLSAICVQKPMLEFQRA
KLKFEKELRESGVTYSIVRPTAFFKSIAGQIEKVKNGKPYVMFGDGKLTA
CKPISEGDLARFITDCLEDPEKQNKILPIGGPGEPVTNLDQALMLFELLG
RKPKLKKVPIQIFDVIIPLLTLISKFLPSFAEKAEFARIGKYYCSESMLV
WDPVKKRYDADATPSYGTETLRDFYKRVLKEGLAGQELGAHAMF
>CT1991 hypothetical protein
MDCDYSFRLPVLMNTGRKNDDDTPHEHPFFRSSSV
>CT1226 hypothetical protein
MLPLVRYLKKQCIRAIRKHKLKLFYLKSSFPSRSVINKQF
>CT0368 hypothetical protein
MTKKPDDNDNAGGRARKKGFWSITMALDALLSVTFLVLAAYIGLYTSTAG
GDHGRDLFRFSLLLGAYGIWRAIRGVIRYRQKDEKS
>CT1860 hypothetical protein
MLISFPEPRFEALSIQAALVKMGGKYRFITLENFSDTATSSSKTVMGKLT
ADNGHMNLGSKNYDDLNSFQNDPDKLLSL
>CT0878 transposase, internal deletion
MTRKKEDPGHSGRAGRAARLSKASLPIVPNTSNSRNGHGSKTIIVDNDQF
TIRSPRDRNGSFEPQLIPKRQTRFKGFDEKILAMDARGMSVRDMQAMLLE
LYQVEVSEALISSVTDAIKLIYLAMQNIAKKWTMPLRNRGAVINQFSITF
EGRVPLVPPMKTTVDTLG
>CT0657 hypothetical protein
MKAKLSFRLGLVFGLIIFFSLAFSYVYQGRQLRKVLFQNVRTSLLHDLRL
TAEMLESRPPGWADPAISDQWTGRVGKMLDVRVTLIDLSGKVIGDSYVPA
AKLDKLENHRNRDEFKAALSGGIGEAQRFSVSVRENMLYMAMPVGKPVPW
AVLSWPSRCTTSLPSKRRSTKASRVASTGRC
>CT0389 cobalt chelatase CbiK, putative
MSKHKKRSQINKKAILLAHFGTTYPSALPSLENIRRQVRARIPGIEVRHC
FTSNMVRNIWSARRRDPQRWLAEGVADEFLNVQGFLGAIGNLQDGNYRTI
IVQPTHMYHGEQFEDLKSYVSALQSIRTIKRVWSPFEKVVLSRPALGTCG
VEHDYTEDIKEVAALFDDDVERARQLDAALVYVAHGNDFFSSGVFRETGD
VMRSRYPGAQIHIGMIEGRPGVEEIVAEVTGKGCTTVLLRPFMITAGDHA
HNDIADDDTGSWKGAFEAAGCRVEIRMEGLGSDDRFASIFAKRILETAED
HGIDLFSASDKID
>CT2231 hypothetical protein
MLWASRKTECAGRPVSLASLNRLLRFSLKPVQNRSCSFSRISRRWRFRPG
REMTIMRITVR
>CT0214 hypothetical protein
MDTAINPHSLWAKTIPTIFIVFFVKYVKAYML
>CT0967 conserved hypothetical protein
MLKVRRSWAYTRWFGWYSRRQFRRHFHSLRVQMPTGLPEMDTSIPVIFYG
NHAYWWDGFWSQLCTELYFRQNLYIIIEYAQLVKHQFFTRLGAFSIDRSN
ARSAIGTLDYAAECLVAPSERQNALWIFPQGLIEHVDKRPLLFFRGTAGI
VQRVFKKRESIYLVSVVSRIEYLEEQKPELFLSFGSPRVLKRADWQGLDA
LTAFMQQTTESHLDKVKQAVIERRLDGFDMVIRGSKSINRRVEAFRSLFG
KGRPAEE
>CT1197.1 ABC transporter, ATP-binding protein
MATAVKLSGISKTFGSLKANDNVSLSIEAGSIHALVGENGAGKSTLSNII
YGLLHPDSGTIEIDGKAVKFSSVRQAIEAGIGMVHQHFMLVPTLSVTENI
ILGKEESRLALPTRRIGKEIRQLSQQHGLEVDPDALVSTLSVGEQQRVEI
LKLLYRRAKLLILDEPTAVLSPPETARLFATLRSLAAEGRTVLLITHKLD
EVLTVSDSVSVMRKGSLVGTVPTTSTSKEDLARMMVGRDVLLRTANAPQT
PGKTVLSIDKLTYRSPLGIDKLTGLTLHVRAGEIYGIAGVEGNGQSELLS
LLWGTFDRSGKTGGSITIDAQETLGKNPSEIAALGVSMIPEDRHKSAIIA
EYGIEENLILGRHREKAFHRGIGFDRDAVHKNATAMIEQYDIRCAAGTNP
PIASLSGGNQQKIVVAREMERPQLKLLVLAQPTRGVDIGAIEQIHKRIID
ARKSGLAILLISSELEEIVALSTRIGCLYKGAIRHEFSETEVWRGRDHES
GFEKEIGMHIT
>CT0418 cobN protein, putative
MKPISLCYFCVNPSEISTLSAGLRRYNDGGGQAELKARIAAQLSTPEQIA
VFARAAASADAVIIRLMGGKASFPAFDAFLEALAERREAGQATPLILISA
GGGDDEAAELAQQHSALYGTEAGDRLRRYIQNGGVINVANMLRYLHHLIH
GGENDADEPVEMPHEGIYHPDWPAFDDFEGYLAKHVDPSKPTVGLWFYQN
YFVDGDLAAYDYLIRQVEERGANIIAVFHHRYRDAMRGNKGADYVAERYF
RRPDGTSRIDVLVNPMLFALSMASPEYRSILPGLDVPFLQAFNTFQSREQ
WRESIQGLGTMEVSYNAAQPEFDGALMTVPFSTRENMGIDPLTGGEVLRI
MPVEERVSKLADMALRWAALRRKPNADKRIAIIFHHYPPRNDRIGCAVGL
DSFESIRLLLERMAAEGYVVERQYENGDELAKELVTRMTCDRRWLTPEQM
AEKAEAKAGSKLFQPWHEALPAAVKQKMVKNWGEMPGELFVHDEELLFPG
TINGNVFITIQPSRGSIERQDQMLHDPDIPPPHHYLAHYRWIRDVFKADA
VMHVGKHGSLEWLPGKALGLSEECYPDLSIMDLPNIYPYIINDPSEGTQA
KRRSNCCIIDHMTPVFTNADLYEEMAVLEGHLRSYAEARNSDPGKLDVLR
PMIWDAVLAADLDKDTGYTREKAFADFDKFLEVLHSALDEIADTMISDGL
HTMGVAPDGDRLVELLVQLTRLEQGSVPSLRESIVTAMGFNYDELLGRKG
EPVFGPTSETGGEMIRRAHEHALAMVKLLSANGYSTQAPEFVQAELPALV
TPDVMAALRYICDDLVPRLLKVTDEIDASLKGFAGRFVDPGPSGAPTRGQ
ADILPTGRNFFSIDPQRIPTPAAWKVGCSLGDALVQRYLAEKGEYPRNIG
IILFGGATMRSGGDDLAEIFYLMGVRPVWKKGSGYVQGVEIIPLNELGRP
RLDVTPRISGFFRDAFPLLVERIDDAVRMVAALDEPLESNLLRRNVLADV
EEYRKQGMNDEEALREASFRVFGCPPGTYGAGVSELIESKNWQTQGDLAA
NYIRYSSHAYGRGSYGQQKPDTFKRVLSRMDATVKNEDSREYDMFSCTDY
YNYYGGLITAAKSQRGGELPEAFMGDSSDPNRVQVRTTFEEAKHIFRSRL
LNPKWLDGLKRHGYKGAGDISKVLDIILGWDATAEVVDDWMYERVAGKYV
FDEEMKKWMEEVNPYARQNILDKLLEAISRGMWNATDEMKQRLQEEYLET
EGQLEEINE
>CT0731 hypothetical protein
MIQPGFMDAANEHNDHQLLAGLEKNVGRLVEQLSECRKENELLKSEVLSL
QNILRSFKLPGTEGPEPKVSGTSGEGFSYADKLQIKQKLVMILQKIEREL
RGEKAGF
>CT0629 PTS system, IIA component
MLKSVLDALENGRLVELPDNDKSDALQFLATILEAVPSIPAGSNIVEIVL
EHEKNSNTSLGFGWACPHARVNFDGDLCCAIGWSPAGIDYGIAGEPLVRL
VVMFFVPENQRNTYLKEVSTLARVLKSNPEFQNLEPIGDLDQVRNTLLDM
THQAMSEGGKNYRARMIQLDTLTKSAGDAGHYWLKGMQIDALLVVAGPDI
KPLVLSQNAELSRLATQDTQLVPAIASGNHYELQGWQIVRRSSTAYAGER
VIYDCLAIKTRQGQGAPEAGG
>CT1105 GGDEF domain protein
MIFGIVALILAAGGTAWFGAFQSSRLHQISEDESLRMRLVECRTSISSIE
QLFGDEHLVLGKQADIQSKLRIECARFSEIGRSLEMKQGSELNQWLKHRQ
PLAAPVSRESLQLMVVDMDGMIEKLSAKITDEKAQLTASLNYYLVFITLL
LVVVVLAGGKLIISSYRRSIIPLHKLSERLALLNCNLSESLHDTAEAAET
LLNGEDPSADMRSVSESVASMCHDIEEKNRKLDELHIRDEKTNLYNYRHF
KEHLIIDVERARRFNGDISLAMIDIDNFKNYNDRYGHVAGDRALARIAEI
IRKECRMTDIPSRFGGDEFAILFPKTDRATALDISERLRNIIYAEPFEHE
SKQPGGQLTVSIGVASWLDDATDGTSLIVNADKALYKAKSSGKNMVLAFT
REV
>CT0330 ham1 family protein
MEPKHPEITIVLATGNKDKVRELKPVLEALASGIHVRSLHDLGLDIDVEE
TEPTLEGNARLKADAIFELVAPRLDWFIALADDTGLEVDALGGAPGVYSA
RYAPVPEGVARTYEDNVRHLLSEMRGKSKRTARFRTVIAMKGRLPAANGS
AVEIEETTDGHIDGLITTEPKGNGGFGYDPVFAPEGMDRTFAQLGIDEKN
AISHRGRAVVAAAKRIGEYLSQCGIQ
>CT1347 outer membrane efflux protein, putative
MVKKSVFIGILLLILPALTIHAESQPARTLSLHECITIALNNASDVKKAE
NARHLSGVDLLRRYGNFLPKVTSSASYVPRSVNRSYTSYSPLYGAGSDTA
INMTSRTSTVDMSLTASLNLFNGLSDYAALQAALDRKKAAGFTLQRAKET
IAYDVTQHYYQVLLDKELLDIARENLKTSRDLLTLTERQFNIGLKSITDL
YQQQAEASNSNLAVINAENQLRRSKLELVRRLRIDPAEEIALEPVDTAAI
EKLSPEVDIAALSAASLQQRADLKAQGLERDAARQDVRQVAGSRLPRLDL
AFTMSSGAIDSYKTTMLGQTYDYAYPSVSKQLKNGIDHAVSLNLSWTIFD
GFSTRYNVESAKVVSRNRQLDYEELKDGIEIDLKQVAGDYQAAFTRIESA
KKSLKASESAAQGITRKYDLGASNFVELSSARAALFSARSTLTQAIYNLA
LQKALLDYTSGVGIEEVSNASSH
>CT0906 DNA-dependent ATPase, SNF2 family protein
MKLVRNTGSDRVADLIHPAVLDGRQLDAVTPGLSIFAWAQLLRGLRRAER
CRMVLPADLSMMPLLGSATDRSXRNQLQSRWLAGGMRDWIVEKADIRLAR
FGVPQGALVIRNAEAQPVQALLGSFALTTDGLGLTPGNPLSLIQASETPE
EALRLSQWFDGQWASLPDDPGAKAAVIEALNQIARHRDPHLTYALILHNL
FASLGDELDEERIVKSATGIRNTVVWKKLYKFQRDGVVGAIDKLNRFGGC
IIADSVGLGKTFEALAIIKYHELRNDRVLVLCPKRLRDNWTLYKANDRRN
VLASDRFNYDVLNHTDLSRDGGSSGDIDLAHVNWGNYDLVVIDESHNFRN
KKTPQAGGETRYDRLMRKIIREGVKTRVLMLSATPVNNRLADLRNQIAFA
TEGDDTALSDHGIGSIDITTRLAQKQFNRWLDLDESERTPSRLIEMLGFD
YFTLLDLLTIARSRRHIEKYYGTAETGRFPERLRPINIKADVDRAGAFPA
IAQINLEIRRLHLASYAPLRYVLPHKKEAYNRKYSTQVKGGTGFFWQEDR
EESLIHLLRVNVLKRMESAVPSFALTVQRQLRDVEATLARIEAQAEDLEE
IDLSIDDLDLDDPAFESLVVGRKVKVLLKDVDLIRWKQDLIEDRNRLATL
VSAAREITPERDDKLAKLREVIEDKCRHPINPGNRKVIVFTAFADTARYL
YEQLASWAKTTLRLDSALVTGAGSNQTTLLGLRKDLVSILTAFAPRAKER
PEELAGEGELDLLIATDCISEGQNLQDCDWLINYDIHWNPVRIIQRFGRI
DRLGSPNQRIQLVNFWPNMELEEYINLEQRVSGRMVLLDISATGEENLIE
QQSGNPMNDLEYRRKQLLKLQDAVIDLEDLSTGVSIADLTLTDFRIDLAE
YLRAHPGVLEALPLGTMAITTTADAEIPPGIVFCLRAEDAAAARVFEPGY
PLAPHYLVHVGDDGTVLLPFTQAKQILDRLKRLCVGRDLPDAAACARFDK
ATKQGEEMRHAQRLLAAAVASVVGKTEERAVASLFTPGGTHALAGEFAGI
NDFEVVAFLVILPEVSA
>CT2270 hypothetical protein
MKKKFLRFFTTLLFVCSFIPGKLNAAPTATHDGVYLDEIAIGSGYAWGHL
KFSEADYNAVPIFARFGFNMNSVFGMKESKSTLQLALEPFCNPVTEPDSG
VETGLNVFIRYLQPVAPSVKLVGEIGSGPMYLSINSAEQGKAGFNFLNQF
GLGAQVAVSPKSAITVGYRFRHLSNAGTSEPNRGINSNAVVVSYSLLY
>CT2282 hypothetical protein
MSQNLTASVFVCQSVVHFFINKLRVLWLFDQAISLFWLAKASL
>CT0275 hypothetical protein
MTVQQHFKWRVFISTGLLLTFLVMLISGIVLYISPPGRVANWTDWNIFGL
TKKGWQNQHVVFGFAFIILSIFHLFVINWKAFLSYIKTKASKGLNSPAEL
VTSLLLFAVLAAGTLWHLPPFEQIIALGDRLSGSWENKTGGPPVPHAEAM
PLDELGALPQVNDPAEKMVAKLRAAGIEVRDTRQTLTEIADKNGVKVEKL
YGIISKGKQSGELQGSGWGRKTLTEAAETIGVSPLALQQALKQQGIEAAP
GESIRDIATKNSMEPSELVSRINRMAEKR
>CT1183 florfenicol resistance protein, putative
MEKNEISEERRTQEKEKQHGHRLNIRRLGRKELTELLTRLGEPAYRANQL
HRWLYSNQALRFEEMSTLSKQLRQKLASEWIIHPASLVGTERETTDASLV
TGNPTAKFLIKLEDNELVESVLIPSEERITACISSQIGCPLRCTFCATGH
MGFRRNLTASEITDQVFLLEKEAQKRHWRGLTNIVFMGMGEPLLNLDNVL
ESIGTLTEKDYQFSISERKITISTVGLPVEMDRIARSGLKTKLAISLHSA
DQLIRERMMPIAADITLDKLAKAINSYNSVTSQPVTLVYMLLEGINDSPE
DARKLVRFAKRVLCKINLIDYNSIVTLKFKPGCSSSKTMFIQQLLDAGLL
VTVRKSQGATINAACGQLATRPVR
>CT1718 hypothetical protein
MSHHHQTKNLGLSIVINAFIFMVIHQLLQPIVINVQPKPKHTPGTRIVHC
SIPGRPLCTSAFPSPFFRSGRTSARIEKTRSRSSMLVQIPWRPRNSFEIS
SRDFVLNVMADISI
>CT0738 conserved hypothetical protein
MRKGWEMFKGNIGEFIGFTLICFAASIVSSKMAAFGSLLFSAIAAPLYAG
YTIAAFRIMTGQELQFSDFFKGFNYFLPLFLAGLASGILVAVGLVLLILP
GIYLAIGYMLATFLIIDHGMEFWQAMETSRKIITKNWFAFFVFAVVLFLV
NVLGALALGVGLLVSAPVTACATAIAYKEIVGLHSAEW
>CT0328 conserved hypothetical protein
MPYSSSGVGHNGFDKADLHIHTKCSDGLFTPEEIVHKAIHVGLKAISITD
HDTVTGIDQAKPLALELGLELIPGVEMSSAYKGYDIHILGYFFDYQQSEL
KGYLDHCRLLRTERAERMVQKLAKMGVKIEIEQIIMKAQNGSVGRPHIAA
VLQDEGFVKSFSEAFSKYLGSHSPAYVKSIETHPEEVIRLINEAGGLSFL
AHPAQNVPDEILRQLISFGLDGLEIIHPSHDTYRQNYYREIANEYFLLFS
GGSDYHGPKDHEDNFGQVWIPYEWVTKMKSRLAPAVKE
>CT1971 CRISPR-associated helicase Cas3
MRHYPKSLKNADAPVEHPPLPLERCLAKSRKIDASRSVAGRTVLDHCRIA
GEVARELIARSPAFLRESFFPDGSALVAASHDIGKVSPTFQKKIYTAIGN
ADPTILDVLNDVDSEIEKNWGGHAGVSQCALEALEAGKDIAAIAGCHHGY
APKLAGKTADAEAFGGAAWQRQRARLLERLAEATGERFPKIRNLLHARVL
AGLTSVADWIGSGATFDDPGEAWQSCIADAVDAAGFTPPRLRPGLSFREI
FGFEARPIQRSLIEKACRPGAYILEAPMGIGKTEAALYAAYALVSAGKAR
GIYFALPTQLTSNRIHERVERFLDKVLEADSPHRNALLVHSNAGLQKFEF
GADAAPGCSWFSASKRGLLAPFAVGTIDQALMASMNVKHGFVRAFGLAGK
VVILDEVHSYDAYTGTILDRLVRELRQLHCTVIILSATLTGERRSALLGA
ARSAEAAYPLISALPAEAREPVEIAPETMPGNRVAIHQTLDIDEAMDEAL
NRADSGQQVLWIENTVTEAQEAFKLLAARSSGMAIECGLLHSRFIHCDRE
ALEEKWVTRYGADGAAARRERGRILVGTQVLEQSLDIDADFLVTRLCPTD
MLLQRIGRLWRHSFHQRPAGARCEAWIVSAQFEAAEQNPGEAFGKSAKVY
SPYVLLRTLQAWSGVEALSLPGDIRDLLERTYEARPESEAMAKHKADLQR
RREILESFALQGVSFGLDARPDTNVATRYSDIDTVELLLLQSVSHNHAAH
ETTVTLLDGQQLTLPHDFAAGRKREQRQLAARLATQTLKVAEYAAPADPG
RNTLTWLKPYFYLGDPSKSESLLRVAIVGEGGLLRLPGGGAAAEKYELSY
NPRLGYQYKKR
>CT1476 putative addiction module component, TIGR02574 family
MKLSVFERIQLVEDIWNSIAAEASDTIELLSQTQKDELHRRVAEHRADPS
TAVPREQVKSRLFSGKS
>CT0944 hypothetical protein
MKKYPFRLGTSSYIIPDDILPNVRYLADKVEDIELALFESDEFSNLPSPE
VIAELVALAGEHGLTYSVHLPLDVYLGSPFRDERERSVGKCRRIIDLTEA
LPKSAFVMHFEAGKGVDINAFSDEERQIFVESLGDSARMLLEGCGEPVSM
FCAENLNYPFEIVWPVVEQFGFSVALDVGHLEYYGFPTADYLDRYLSRAK
VLHMHGTTGGRDHNSLACMRPEALDLVVEALRKVEGEPKVFTLEIFSEAD
FLSSVETLERFSS
>CT0953 ABC transporter, ATP-binding protein
MSMMDQVGQTVLSVDGVSKSYLMGEVTVKALHDISLDFGAGQLVVLLGAS
GSGKSTLLNIVGGLDVPDSGTVSFKGQDINAKGEAGLTAFRRRSIGFVFQ
FYNLIASLSALENVQLVTELVENPMPPEEALRLVGLGDRINHFPSQLSGG
EQQRVAIARAVAKRPELLLCDEPTGALDYRTGKLVLEVIEKVNRELGTTT
LVITHNVSISGMADRIITLRSGEVVEDRRNEKKISPSELSW
>CT1370 hypothetical protein
MRISVRFIMLLHGERWRFRKMSMDGSEREALLSTSSI
>CT1812 hypothetical protein
MTRNDSGDLISRNDSTNHMNTTQPQKMLPEATFSDRMLEIGIKYKKQLTA
LVVVICLAAGGTLFWMQKTKVDEVQASLALAKITPWIEMGDVNKAVNGEG
SIKGLNSIIKTWGGTPSGKTARLYMAYILLNSGKPDDALSIYKGFSSDNK
DLQASALAGAAACHVQKKAFAEAAPEYEKASETAENETLKSMYLTKAAES
YSAAGQADKAAKLYDQVIKTWPATSSAGMAQRALLRLAGAGVQIPQI
>CT1159 type III restriction system methylase
MNYVKSKQRVADHGEVFTPAWLVEAMIDLVKDETERIDARFLEPACGSGN
FLVPILQRKLAAVERKFGKSNFEKQHYALFAVMCTYGIELLADNIAECRA
NMLEIFADYLTLDESDELYRAAIYVLSQNLVHGDALKMRTSDGQPIAFAE
WGYLGKGKFQRRDFRLDSLAMSSTFSVEGSLFAHLGKHEIFTPTRTYPPM
TVSELAARAAEANL
>CT1980.1 peptidase, M23/M37 family
MSSPQFHHFLLRLRRSALFRSRPLLLASSTALLVVLFFSVSLILSATSRQ
TPSGHWSASPIHALGKSLGLKSDDELGLNNESDKVTLDEGENDPAHRVEK
KILQRGESIYTILTAAGLTPAEVHELTSQIKGNRAIKGLRAGKTYELETG
KNGTFISFSMQSSPYEVLHLVKDQQTGKLSAESETIEYDTRVATIEGTLR
SSLSSELRSRNRGSLNPKIRKILSSRLNFRRDIQPGATYRILFQEMWNEN
HCVGTGDILAIEINSKGQRFNAYQFTNAKGDTAYYDENGRAMMQGRSMFI
KPCSYRRISSGFGYRINPVTGQHQFHGGVDLAAPVGTPVRAVADGRVIFR
GRKGNAGNLVTIAHGGGLHTMYMHLSRYAASCRYGKRVKQGDIIGYIGST
GRSTGPHLDFRIVRNGHLQNPLVALKQTAPRRSLSPAELHSFLARVQTYQ
HQLNTDRPVMVADTGKSGEPVL
>CT0846 hypothetical protein
MMIASGVYCGSQVKALSIQRPSERNRNLAKKIIDQGKTNKPVQVKAKTIR
NAYVKQEKRTKKHAFFFCRPIAF
>CT1504 hypothetical protein
MDFHALLRVIHITAFAAWFGTIFATLFLLKTLEPGLTGEKKQAEEYSLLL
RRFIKLETKVADVAVISVLLSGLLLAHFYEGWHPWVFVKIGLMILQIALT
MGYIIKAIQPITYPCDTAKYRAWYRLFTISFSMFGIVLLVTFFLR
>CT0304 alpha-amylase family protein
MSIDSRISAKEHFYINRVARDRYHLDDPELLLLPTDRHKAMALAERQTEA
INRQISLVEGEQAKKILPAQFHAMKLLHELQHHIIDKTSGLQSAALTVLE
SETSHWHTIEYLGKFLERFPTWNIYRSKTTLETSLKNPSSRTAVLEEAFL
VWINNRNPALKEFSRLISDHELHGDQAYPHLVKAMQQAIRSSGPIGSASN
LEELLLNPSRYAPDSLLDQLRFIRLHWGDLLSDSPWRALLDEAIVLIEDE
DRYLFFENLAQDQTTHGGTFGEKEAHAPSYDDLANAPARYSHDSSWMPEV
VMIAKSTYVWLDQLSRQYGRHIARLQDIPDEELDLLAGRGFTALWLIGLW
ERSYASRKIKQLQGNPEAKASAYALESYEIAHELGGYKGYVDLRNRAMQR
GIRLASDMVPNHTGIDSELVRNRPEWFLSTAEPPYPNYTYNGPNLSKDSR
YSIHIEDGYWNRSDAAVTFKRVDHLTGDTRYIYHGNDGTSMPWNDTAQLN
FLDPEVREGVIQQILHVARMFPIIRFDAAMVLAKMHIQRLWFPMHGHAPG
IPSRGAWSMSMAEFETAMPQEFWREVVDRVAQEVPDTLLLAEAFWMLEGY
FVRTLGMHRVYNSAFMHMLKKEDNAGYRGLIKKTLEFDAEILKRYVNFMN
NPDEDTAIAQFGRGDKYFGVCMMMVTMPGLPMIGHGQVEGFTEKYGMEYA
KAYYDERPDEELVERHYREIFPVMKQRSLFAGVAHFQLFDLFAPDGQVNE
NVFAYTNQHNGKQSLFIFNNRYEASEGWIRISTGKLDNGSMKQTLLGDAL
SLPSEHDSYVIFRDQRSGLEFIRSCSQVRNDGLYIALGGYQYNLFMEFRV
VRPSKLKPYDQVCEELNGRGVASIEIEALSMSLRPIHQIVATAIESFIKS
EKSAFGKPEKFAEAFGMQCESLLEAVAERFAEIMEQPLTPPSGIAEKAAN
SYLRALGYESLLEEVENIKRVQVSLGLDEETNQTFHSLAKPLIALNCIQE
MVRENGLLEKQIIGQWLLGNTLKKVFTDKATSSWPVNSAEAVDLISCLLS
PRTEPGNNETPDEQLMASIRTLHESGDHHFRTFMQVQHLHGKEWFRERQL
TLLASWLMVQNLMLNESPSGKTVLQTWLDAIEKLEMSAFISGYELGALLK
LAAGKK
>CT0362 glycosyl transferase
MKLSVVIPLMNEAESIGPLFDALASALQGIDHEIILVDDGSTDSTVAEIK
RLAPANARLVVLNKNYGQTAAMSAGIDEAQGELIATMDGDLQNDPADIPM
MIAHLESKGLDVVAGRRAGRKDGMVLRKIPSKIANAMIRNLTNVHIRDYG
CTLKVFKRDVAKNLGLYGELHRFIPVLVQLYGARMEEVDVRHHARKYGKS
KYGIGRTLRVLSDLFFMVFYQKYAHRPMHLFGSLGFVTLFLGLGINLYLL
VLKILGNDIGGRPLLSLGVIMTFIGIQLITTGFIAEFIMRTYYESQNKKP
YIIREIIDKSK
>CT1169 conserved hypothetical protein
MDVASALEGNWEFRVDHKNIKIFSSKIRGSQVLGFKGEAVFEASLRKLIS
LFHDFGNYGKWVHQLSEMEVLHKSDELDYVVRQVLNTPWPIPKREMIVRT
ALHASEEGALALTMTGIPDYVPLKPDFHRVREARGGWILMPVDGGKVHVT
FVMHLDPGSDIPPALSNAALFEVPFYSLLKMRDLAQNPSYKPAWPSVVDN
HVTIIEDVPDKH
>CT1129 CRISPR-associated protein Cas2
MLVLVTYDVNTETPAGRRRLRRIAKTCQNYGQRVQFSVFECNVDPAQWVK
LRSKLLNEMDPKLDSLRFYFLGSNWQGRVEHEGAKEPRDLEGTLIL
>CT2101 conserved hypothetical protein
MNFIESELLRIFVGEQEKLHHRPLYELLVSEALERGMAGATVFRGLLSFG
LRHKVHTSKIFELAGELPMVIEIVDITEKIEEFLPVVEALLRDSGAQSLV
TREAVRQWTT
>CT2200 conserved hypothetical protein
MCCKPFKGRKKTPTPLQPQSANKKKSSVAALVCTHQNCSISPIHYLTYAL
QQQRLSTHKPITMQRLLSSSRYLVIIAVAGTFLAALTLLLYGGISVVQLI
ADTVMHAEISGKTGKMLVLGFIESIDLFLLGTVFFIISLGLYELFIDDTL
ELPHWLVIHTLDDLKNKLIGVIVVVMSVVFLGHVVQWHGEQELIWLGGAI
ALVTAGLTYFVGSKKK
>CT0792 AslB/AtsB family protein
MTSREEFLEKLAQAGTLNLIFQLTDQCALSCRYCFAKGSHPSGGLRIADD
LLDAAIRSAFDTRHHQVSFEWTGGEPFFAGIDFYRKVDRLQKKYATKPYA
NTIQTSGYVHDRELIAWLAGHGFRISSTIDGPPELHDFQRPVNGGGPSLG
AVLATRETIIEHQGHCGCICTVTRNSLGKEGAILDYYRSLGIEAFHSNPY
YYFSKNLVGDESLALDADGYAAYFIAQFNAWFEGGRKLPMPGTLNYILRS
LTAGAGLKQSVCTFGGRCLTNFLAITPDGDAWLCPKFAGFDEMRLGNVGK
MAITDILSDANPAMARLIDERLDAIHTCEADECRFQYLCNAGCPYYSFIA
SGGRNIAVKDSLCTGKQLLFEYLESVVELIDPARLPEPSPLEHA
>CT1497 sigma-54-dependent transcriptional regulator
MSRSIEFIGRSAGIVQLRELAMQVADTDVTVLITGETGSGKEVLARFIHD
HSRRAGKSMIPVNCGAIPSGILESELFGHEKGAFTGAVQARKGYFESADK
GTIFLDEIGEMPLETQVKFLRVIETGQFQRVGASETISADTRIIAATNRN
LNQAVAEKHFREDLYYRLKSVELLIPPLRERGSDILMLAEHFVHEFERKH
AIAFEGFSQESGELMLRYPWPGNVRELRNLIEALLILERGKKITPDILEK
HLVQRSRHKGLVHEPGKSEKNELHLIYSSLIQLRQEIDEIRQMLQQALLY
RQPTSPLLLPALPAPPAAQDSTLPSGGNEAPVPLKELEKRAIGEALTKFD
GNKRKTALALGINERTLYRKIKEYGL
>CT1707 SMC family protein
MYLSKIELFGFKSFAHRVRIHFDKGLTAIVGPNGCGKTNVVDAIRWVLGE
QKSMLLRSPKMENIIFNGTKRLKPLSFTEVSITIENTRNILPTEYTEVTV
TRRLYRNGDSDYLLNMVPCRLKDILDLFADTGMGSDAYSVIELKMIEEII
SNKSEERLKLFEEAAGITRYKQRRKQTFRQLESASRDLARVDDVLAEVEK
KVRNLRLQVRKAERLKEIREELRTLDLTLSAISMDEHLQKLRPLLDSIAA
EERQCHELAATIAKLDSAHQESELRQLELERKLADAQKELNASNQLVHTL
EKQLLQHKEKQKNLLQTIERLNYSIADKGRKRLEQEALSKELSEKQTPLQ
EVCTAQLAEFERLKKQEVELNSALDASRQALQSERRAVAELQKSLNALNL
TRQSLRTRKEHLEGSVNRLDQRKRDLERSMEQAEPERRRTSEAIEEKKIA
LDELKKEEERLVALKASITEQSEKKKEELLSLKSEHNHLNNRIALCNSIL
EKFEGLPEGVAFLEKQRAGKPGLGCLSDLISVRENDKKAINAALGESLGY
YLCRNLEEARLAVSSLAKADKGKVHFLILDLIDGGAKIDYAEIEGARRAI
DLVETPAELSKALNLLLQHCYVVADLDAAEQLGKKHPEALFITEKGEKFT
RRGMLYGGSAKGGESVRLGKKAERDRLQKQMAGMAETIAEAENALAVLRK
EFSAIDTERVKRAAASISQEISALEKRLARLEAEERSGADQIAHADRERT
ALIASMQSVLDELEKTQPETLRIEAEIETAQQKVNVMQEELSAGESRSRA
LHAELQAQQGRYRDAQLDLEKHRFRASACQQTIVTLSDEIEGMQHQIARA
EKEVAELGQSIAQATAEHEQAVVVSARQQEALNELESSYRDLQTKNHDTL
SNLRDLRRKHDLSQQMLAEFNNRKAKLEQEIAHLQATVMERYGVELEMMP
AHVPEGFDVAASRERLAYLQKQKEQFGGVNELALEEYESEKERLDFLTAQ
KEDLVSAEKQLRETIEEINRTALEKFRETFNQVRKNFIRIFHDLFDPEDE
VDLLITTSDEDPLEAHIQIVAKPRGKKPLAIEQLSGGEKALTALSLLFAI
YLVKPSPFCILDEVDAPLDDANVGRFIKLLKKFENNTQFIIVTHNKKSMA
SCQALYGVTMEEEGVSKLIPVKIENARSEETAS
>CT0189 conserved hypothetical protein
MSEKKISILGCGWLGLPLARFLAGEGYTVKGSTTSEAKITKMKEAGVEPF
RIVIDESIEGDISSFLDSEILVVNIPPKRREDVVEYHVGQISLLIDALAD
SPVKHVLFVSSTSVYPASGGEVVESDAADPDAADSPAGRALMYVEEMLRS
ESAFNTTVVRFGGLIGPGRNPAEFIQRMTEITSPAHPVNLIHLDDCVHVI
AEIIRQEAWGETFNACAPLHPTRSELYAAAAESHGLAALQEERSSDTNFK
IVNSDRIVEKLGYTFLHPDPLAMARGGN
>CT1941 hypothetical protein
MTASGSQQQKRRKAATKEELRRKACKQKSSEGGKPVTASENRLNGLLIWQ
TSIQLLW
>CT0489 hypothetical protein
MGNSELQPVGKSAGGQAWLLLALDELLAIVRECINPKVSRSGLDRCLRRH
GRAL
>CT2007 glucokinase, putative
MPSWGLGIDLGGTNIKIAVVDELKGILFEDTQPTDVPSGPDGVVRQLAFM
ASELYQRATETLDAGEFAGIGLGAPGAVDAEKGTLSYPPNLPGWGRVALR
DELRLRLEEAHSLSSPVIIENDANAAAYGEAIFGGGNAFRDFMLVTLGTG
VGGGIILDRKLYRGPTGTAGEIGFMIVDFEGESVHAGIRGTIEGLIGKER
IVEMACSEQIGAERSPRLAELCNRDFSRLSPRHLEQAAREGDAAALRTWE
RIGTILGVGLANITALMDIRKFVIGGGIAAAGELIFKPAMMQLHRSTLPS
MHDGLEIVPARLGNKAGIYGAAALCFNAAKPPASDA
>CT0723 hypothetical protein
MDTVPKALLRELSPELAERFPVSRLSLRDIHLHEKTASANRRTPAVLVVR
LTKNRKGNYVTFKVLLYLGISFRRPSGQGGRSDFRRRAG
>CT0027 hypothetical protein
MLSSLSKLTPEQLEEIRSLEQQLGKTLLSFSEYDVVSDDLSEADLATIKA
LEEKLGTMLVAVRGLSQNA
>CT1853 3mg, DNA-3-methyladenine glycosylase
MKRLGADFYQMPTILLAERLLGKIFVHHEVSGRVTKGRIVETEAYLGDGD
EACHAWRGMTERNHVMFGPPGHLYIYFSYGCHYLANIVSEQKGIAGAVLL
RAMEPIEGIEWMQERRGTTDERALMSGPGKLTQALGLGPAHYGESLLGDI
CWLEEAPDIPPELIGTSPRIGISRSTDLLWRKFIAGSPYISKTQPGPPPK
KRKKGLESS
>CT0346 aat, leucyl/phenylalanyl-tRNA--protein transferase
MIKVEDILRAYRHGFFPMADSREGTVSWCQPYQRAVVPLDSFRPSRSLRR
VIGKKRFTIKINSVFEQVIRACSQPRSTGQETWLSEEIIKVFLKLHRLGV
AHSVESWQDGELAGGLYGLSMGGAFFGESMFFFRSDASKVAFAWLVGYLK
RKGYLLLDAQIMNPHLESLGAIEIPHEEYMVQLERALGKKISFV
>CT0161 accA, acetyl-CoA carboxylase, carboxyl transferase subunit alpha
MAAKVVLDFEKPLYELEEKLNEMRVYLKSGEVDSMSEGRAGLKREIESLE
AKVESLRKTIYKNLTRWQKVQLARHPERPYTLDYIYMMMKDFVELSGDRN
FGDDKAIIGGFARLEDESAGFSQSVMVIGHQKGRDTKSNLYRNFGMAQPE
GYRKALRLMKLAEKFRKPVITLIDTPGAFPGIEAEERGQAEAIARNLYEM
AALKVPVICVIIGEGASGGAIGIGVGDRILMAENAWYSVISPESCSSILW
RSWNFKEQAAEALKLTADDLLAQGIIDRIVTEPLGGAHHNPEQMASTLKS
ILIEELQGLLTKDARTLVDERIAKFSGMGVWEES
>CT0158 accB, acetyl-CoA carboxylase, biotin carboxyl carrier protein
MNLKEIQQLIEIVNVSALDEVIIKDGQSEITLRRNNSKAQAVLPTAPVVA
QPIPAPLPAVRQAVQEQPAPAAAEPAVSDLIEIYSPIVGTFYRSPSPDSL
PFVNEGDKVKPGDVLCIIEAMKLMNEIESEVSGTIVEILVENGQPVEYNQ
ALFRVKP
>CT0157 accC, acetyl-CoA carboxylase, biotin carboxylase
MFKKILIANRGEIALRVMHTCREMGICTVAVYSTADADSLHVRYADEAVC
IGPPLSRESYLNIPRIIAAAEVTNADAIHPGYGFLAENADFAEVCSSSNI
KFIGPSAEMINKMGDKNTAKSTMIAAGVPVVPGSEGLVEDVAHAIETAKK
IGYPVIIKPTAGGGGKGMRVVHEESQLEKNLKTAQSEAGMAFGNSGVYIE
KFLENPRHIEIQILADQHGNVVHLGERDCTVQRRHQKLIEETPSPVVDEA
LRTKMGEAAVAAAKAINYEGAGTIEFLLDKHKNFYFMEMNTRIQVEHPIT
EQRYDVDIVREQISIAAGGSLEGKTFIPRGHSIECRINAEDPEHMFRPSP
GEIQVFHTPGGPGVRIDSHCYASYVVPSNYDSMIGKLIVTAHNRDEAIAR
MSRALDEFIIMGIKTTIPFHKQVMHDPVFRSGEFDTSFLDSFRFEKP
>CT1555 accD, acetyl-CoA carboxylase, carboxyl transferase subunit beta
MVWFKRAIPSIRTKDKRDTPEGLWSKCDSCGAALHKKQLEDHLYTCPHCG
FHFRISPDLYFSFLFDDGKWEEFDAQLRAADPLKFVDTKPYPERVRSTMQ
KSGKSEACRNATGKLGGSDAVISGMDFGFIGGSMGSVVGEKIARAADKSV
ELNAPLIVISQSGGARMMEGAFSLMQMAKTSARLTRLGERGIPFISLMTD
PTMGGISASFAMLGDLNISEPKALIGFAGPRVIRDTIKRDLPEGFQRAEF
LQKHGFVDTIVHRKDLRAQLIKLLGHMK
>CT1525 ackA, acetate kinase
MPFVPVIFLRIFAMMPRPVCTILALNTGSSSIKFSLYESGDTEELLFSGS
LTRIGLPDGRFSVTDPEGRFIDCERVDVPDHAAACRYVFSWITQHGPGMP
DAVGHRVVHGGPRHTTPEKVTSELLDSIAEIVPYAPEHLPQALNAIRYAA
SELPGVFQVACFDTAFHSTMPPLAKLCPIPEEYRNQGVQRYGFHGLSYQY
ILSQLQAEGDPFARKGRLILAHLGHGASMAAVLDGRSVETTMGFSPAGGL
VMSTRTGDLDPGVVLFLLQQGHLDSDGLRDMVNKKSGLLGVSGLSDDMRD
LLDAEAENEQARLAGELFCYSARKHIGALVAVLGGLDMIVFTGGIGLYSP
DVRERICSGLEFLGVQVDHEKNLNHSGIISTDTSRVVVKVIETNEEVTIV
RETRRVLENG
>CT0543 acn, aconitate hydratase
MSLISQYRAHTAERAQLGIPPLPLTAAQAIELIALLKQNPVQEQEYLLDL
FVNHISPGVDDAALEKAAFLDAIIRGNASCAVITPVEAVRILGTMLGGYN
VKPMIDALSSADNAVAEAAALALKKTLLVYDSFDTVVELAKTNSYAKEVL
ESWANGEWFTSKPALAEKMTLTVFKVPGETNTDDLSPASEAFTRSDIPMH
ALSMLRSKMDDPIATIAQLKEKGYPLAYVGDVVGTGSSRKSGINSVQWHL
GEDIPAVPNKRTAGVVIGGIIAPIFFNTAEDSGALPIQADVTSMEMGDVI
DLYPFKGVIEKNGKVISTFSLEPNTLADEVRAGGRIPLIIGRNLTRKARK
TLGLGEESIFSRPEQPADTGKGYTLAQKIVGKACGLPGVRPGMYVEAETL
TVGSQDTTGPMTRDEIKELAALSFSADLFMQSFCHTAAYPKPSDVQLHRS
LPYFIMSRGGVALKPGDGVIHTWLNRMVLPDTLGTGGDSHTRFPIGCSFP
AGSGLVAFAGVTGTMPLNMPESVLVRFTGELQPGITLRDLVNAIPYVAIK
RGLLTVEKKGKKNIFAGKVLEIEGLPQLKVEQAFELSDASAERSAAACTV
RLDKEPVIEYLQSNVKLLAQMIEEGYGDANTIKRRIGKMEEWLANPQLLE
PDADAEYAAVIEINMSEITEPILACPNDPDDVATLSEILADEKRPKNIDE
VFVGSCMTNIGHFRALGEVLRGKGQAKTRLWVVPPTKMDMKKLIEEGYYS
IFGTAGARTEVPGCSLCMGNQARVADNAVVFSTSTRNFDDRMGKGAKVYL
GSAELAAVCALLGHLPSKDEYMEIAGSLSKNSDKVYRYLNFHEVTSQELQ
LLVD
>CT2117 acpP, acyl carrier protein
MSAAEIKDKVYDIIVSKMGVNKDQIKPESKFADDLGADSLDTVELIMELE
NEFGVQIPDEDAEKIGTVQQAIDYIVNKKVS
>CT1935 acpS, holo-(acyl-carrier-protein) synthase
MEIGVDIVEIARIRSSYDRFGEAFMKKILTSAEMAQCLSKPDPVASLAGR
FAAKEAVSKALGTGIAKGLTWHSIEVLNDETGKPCVSVYAPSFSGRVSIS
ISHDRYSAVAMALFEPR
>CT1652 acs, acetyl-CoA synthetase
MEQYEKLYADAAADPDKYWGDLAEQFHWFKKWDSVLEWNAPYAKWFNGGT
TNIAYNCLDVHVGSWRKNKAAIIWEGEEGNERILTYGELHRQVSKFANVL
KIAGIKPGDKVAIYMGMVPELVIAVLACARVGAVHNVIFAGFAAHAITER
VNDSRAKIVICADGTRRRGSTINLKNIVDEAIINTPSVKNVIVLKVTGEE
IHMHDGMDHWWHDLMGLAVDECEPAQVDSEHPLFLLYTSGSTGKPKGILH
TTAGYMVHAASSFKYVFDIKDEDIYFCTADIGWITGHTYIIYGPLLNGAT
VFMYEGAPNYPQWDRFWDIINRHKITILYTAPTAIRAFIRAGNEWVTKHD
LSSLRLLGTVGEPINPEAWMWYHKYVGQEKCPIVDTWWQTETGGIMISPM
PGATPTKPGTATRPLPGIMVDVVRKDGTPCNANEGGYLVVKRPWPSMLRT
IYGDNERYEKTYWSEFPGMYFTGDGARKDDDGYIWIMGRVDDVVNVSGHR
LGTSEVESALVSHEAVAEAAVVSRPDEIKGNALVAFVTLKDGYEGDAKLR
DSLGKHVAKEIGAIAKPDEIRWAKGLPKTRSGKIMRRLLRELATSNEIKG
DVTTLEDLGVIENLREQEDEG
>CT1185 adk, adenylate kinase
MRIILLGAPGAGKGTQAQYISEAFGIPQISTGDMLRAAVKAATPLGLAAK
KIMDEGGLVPDDLIISLVKERIAQPDCANGCLFDGFPRTLAQAEALRAGG
IRIDHIIEMNVPDEEIIKRMSGRRVHLASGRTYHVTFNPPAVPDKDDLTG
EPLVQRNDDCEETVRKRLKAYHELTEPLVGYYRNLSMNGSADAPKYSRIA
GIGTVEQIKDEIIATLNG
>CT0166 alaS, alanyl-tRNA synthetase
MNSREIRQSFLDFFAGKEHRIVRSAPVIPAEDPTLLFTNAGMNQFKDVFL
GKGTREYTRAADTQKCIRASGKHNDLEDVGRDTYHHTFFEMLGNWSFGDY
YKKEAIGWAWELMTEVWKLPKERLYATVYHDDEESFKLWQSETDIEHSHI
LKFGDKDNFWEMGETGPCGPCSEIHIDLTPDGSGGPLVNVGDHRVIELWN
LVFIQYDRQADGTLVPLPQKHVDTGMGFERVAAVLQGKSSNYDSDVFAPL
FDRITELTGTVYTASLDSPSDIAMRVIADHCRTLTFALSDGAMPGNEGRG
YVLRRILRRAVRYAGTLGCHEPIIYKMVEVLVRTMGDVFPELEKQQATVE
KIIRAEEESFLVTLGRGTEIFNEVVADMKTAGSTTISGADAFKLYDTFGF
PLDLTRLMAAEVGLGIDEAGFEHCMMEQKTRARMDRKGKMQMQDDGGEWQ
WFAPEAPTTFVGYDMLETEATLVAARVSGDKLLMVLDQTPFYAESGGQVG
DHGTLETAGYRLDVTDTRKDGEQIVHVVTSAHDKVRDCAVTPADVSFDGV
VSVKAAVDRDRRVATERNHTATHLLHAALRKVLGEHVQQKGSLVTPERLR
FDFSHFSKVSAEELEQVEHEVNAEIRKAASVTKHADVPYEEALALGALAF
FGDKYADRVRVVEVPGISIELCGGTHVGNIGQIGMVKIVSESSVAAGIRR
IEAVTGAAAEALLWQEYRDLQEIKNLLKLKADEEAGPKIKELLEEKKALD
KQLQESRLAGLLDRLAASLAGGEEVNGCRIMTERLDGVSGDELRQAAVAL
RERVPCAVGLLCGVADGKVSLVAFASDEAVKSLKLDAGKLVKEAAASVKG
GGGGKLELATAGGKEPAGIDKAIEVFVASVKSALQ
>CT0706 ald, alanine dehydrogenase
MKREAENPNHFSAMNIGIPREIKIRETRVACTPAGVRQLVGAGHRVVVER
GAGEASGFSDEKYRLAGAVLASSAEEVWKSELVVKVKEPLAEEYRFFRKE
LVIFTYLHLAGVPGLAKALVDSGVTAIGYETVEVGGRLPLLAPMSEVAGK
MSVLMGGYYLSKHNGGEGKLLCGVPGVLPGRVLVLGGGVAGMSAARIAAG
LGAEVTVMETNHDRMRELEAQLPAEVHTIYSNEQHLEELLPGTDLLIGAV
LLPGATAPKLVTRGMVQSMKPGAVIVDIAIDQGGCVETSRPTSHVDPVFI
EEGVVHYCVTNMPAAYPATSTEALTGVTLPYVRRLADLGLESAMVVMPGL
AGGLNVWNGKITQEAVARSLGMACGENPFA
>CT1886 aldA, aldehyde dehydrogenase
MTYDASTHAGLRCYFDSGQTRPFEWRRAQLRGLDAFLREREHEIAAAVHA
DLRKPVAETWLTETAWLRSEIRFVLKRLHRWMRPKKVGVPLHYQPARAFV
ERDPLGVVLIIGAWNYPLQLCLAPLIGALAGGNVSVVKPSEMAPATSALL
ASELGRYVDPQAVRIVEGDGEASARLLEHCFDHIFFTGSRRTGQAVMQSA
ARHLTPVTLELGGKSPVIVTEKADLRLAARRIAWAKFLNAGQTCVAPDYL
LVQEGVKEPLQGLMKEALRLYYGSDPEASADYGRIVDDRNFRRLEALLCE
GSLVEGGGSNKASRYIAPTILHDVPEDSDLMKEEIFGPVLPVRSFTTLEE
AVSMVRALDAPLAVYLFSRDRSELRYLIEQTRSGGVCCNDLLFQASIPGL
PFGGRGMSGMGRYHGKAGFDTFTTERSVLDRGGFPDPDLRYPPYSSGRFD
LLKKIVTLFS
>CT1922 alr, alanine racemase
MSASHEQNSAAAPNGPNLSEALISLGNLRHNLACIRAITGPQCRVMGIVK
ANAYGHGATQVTATLEAEGVRDFGVANIYEAIELLQEHRMLPDSRILAFA
SPLAGHIDLYLQHGVEMTVCDHETARAAESIAAACGRRLQVQLKVDTGMG
RLGVTPEEAAELLELIEACPNLELTGIYTHFAESDKPEGFTARQLERFLH
VTGAYERRTGKTVTKHAANSGAIISMPDARLDMVRPGILLYGCHPVDAAP
STVPVRPVMQFQSRVIFVKEVPAGTAISYNRTWSAPKATRIATISAGYAD
GFHRALSNQARVSIGGKSFPQVGTITMDQTMVNLGSDDSVKVGDTAVLFG
WDGPSAGEQALAAGTISYELLCSVSRRVRRIVV
>CT0054 amiB, N-acetylmuramoyl-L-alanine amidase
MLLHVTNGPEQGYTIKVQGSTRPDGVFLADIGSFARALRLGSVFDGKQMQ
IDEAFGDTVTSCILLEGSDFAVIASAAGDTPKRVLQLASAPVIRQDKVYL
PVDQACRMFTLWLGRQLRYNASENRIDATLDSRRPDASLQSIGRLPNDAL
PPNPPEQAAPSSDNRTDARAAGGKTTIDDIKIETRANGAIIRFSASGDMR
NASFLRPDNSGILYLTIENATGNPARLARNYPEGAIKSISPKLLDSGALQ
FTIALDIPTSSIKSSSFKYDAAKNDYVISIMNDVDVEAIHLSEKERRIQE
GLSRDVDKWKLDAVVIDAGHGGKDPGAIGTQGTHEKDVALNIARDLGMFI
RQKWPDVRVIYTRKEDRFIPLKERGQIANRYGGKVFISIHCNSAPNNSKV
RGPEVYILGPHKTDAALKVAMFENSVITGEENFAEKYKGFSDEYLIMSSM
AQSAFATQSTELAQDVLQRLSRNNSNNGLGVRQAGFMVLWTPSMPSILVE
TGYLSNPSEEKLLRDRKEQTKIAYAIFQGLQQYRSRYETRMTTAARPVN
>CT0498 amt-1, ammonium transporter
MVQKLIFILTGVLLMLGIAMGGDLHAYTQPDPSGAKVGVTADISAANPGK
PTAAEITDQIGKNKVSINMVWVLITGFLVMFMQAGFAMVEAGLTRAKNAA
HTMSMNMMIYPIGMLGFFVSGFAIMFGGLGTLGTLGGYEGLNQEFAINLF
GHTIGLFGMKGFFLNNVYDVGVFALFLFQMVFMDTTATIPTGSMAERWKF
SAFAIYGIFVGTIIYPLYGNWVWGGGWLATLGSNFGLGHGHVDFAGSSVV
HMTGGVLALVGAIIIGPRIGKYNKDGSPNAIPGHNIPMAIVGAFILAFGW
FGFNPGSTLAGTDLRISVVAVNTMIASATGAAASMLYMWLFKTKKPDPTM
MINGLLAGLVAITAPSAFVNVQAAALIGLISGVLVIEAAFFIEKVLKIDD
PVGAVAVHGVNGAWGCLALGLFADGTYGDGWNGVAGPVTGLFYGNPGQFY
AEVIGVLANIIYVGAIGWVVFKLIDVTIGNRVNPDDELDGLDIPEMGVEG
YCGIRLDKNSETPLSK
>CT0986 amt-2, ammonium transporter
MKSFLSKSGAIAAGLLLAAATFSPPIQAATPTPDAVNAFAIDNFFLFICA
VLVLFMQAGFALVETGLNSAKNAVNILFKNLMDMAIGGIIYYFVGYGLMY
PGEAFSGGFFGFGGPGISTAMPEIAGGTLYPAVDFLFQVAFAATAATIVS
GAVAGRMQFRAYLIYSAVISAVVYPISGFWLWGGGWLKALGFHDFAGSLL
VHALGGFAGLAGAIVLGPRIGRFNEDGTPNAMPGHNLALSTLGVFILLIG
WYGFNPGSQLAIVGGDNTAAVMKIAVNTTLAACAGAVVAMIFAWGLFKKP
DLTMALNGMLAGLVAITANCDVVSYNASLIIGAVGGILVVLGIMLLDKLR
IDDPVGAWPVHGLNGIWGGVAAWIFGGQPMVAQLVGSLVVPFWGFVTMMV
LFLILKAMGILRVHKDEEMKGLDISEHEEEAYYGFDIYTTQ
>CT0133 amt-3, ammonium transporter
MAKKISAAVMLFFAVMMAHGTVWAGEPVSIDTGTTSWMLTSTALVLLMVP
GLAMFYGGLVRTKNVIGTMMHSFGAMVIIGVLWPLVGYSLSFGHGILGGL
VGWDPKYFMLQGIDDTIMASHIPEYVFAMFQGKFAIITPALIAGAFAERV
NFKGYLLFIALWSIFVYSPICHWVWAGDGFLFNLGAMGAIDFAGGTVVHI
SSGVTGLVAALFLGSRRGYPSNVTSPNNLVMTLVGAGLLWVGWFGFNAGS
AIASDLATARALTVTQVAAASGAFTWLIIELVHHNKATSLGVASGILAGL
VAITPAAGVVQPSGAFALGAIAAIVCYLGLMLKSKLGYDDSLDAFGVHGV
GGIVGALCLIFFIRPSWMADAAVKAGGSWTVWQQLGVQATAVGITVVYAA
VVSLVILFIVEKTIGLRAKENDEMSGLDHSMHSEQGYGLINPN
>CT0865 apsA, adenylylsulfate reductase, alpha subunit
MGVEKNEFKFSQKPEVVYVDTDILLIGGGMACCGAAYEAAKWATPKGLRI
TMVDKAAVDRSGAVAMGLSAINTYCGENDPADYVKYVRADLMGIIREDLV
YDLGRHVDNSVHLFEEWGLPIWKRDEDGSTMDGAKPAPKLTEGGKPVRSG
RWQIMINGESYKVIVAEAAKKALEYNRKETGVEQNLYERVFISELIHDKN
NPSKVAGAIGFSVREHKAYVFTAKTMLLACGGAVNVYRPRSTAEGQGRAW
YPVWNAGTTYALAAQAGCELVLMENRFVPARFKDGYGPVGAWFLFFKCKA
TNSLGEDYCATNLAAANKDFGKYAEDPHKLTTAMRNHMMMIDMKAGKGPI
LMRTHEAMAALAETMTPKQIKHLEAEAWEDFLDMCIGQAVVWAGNNIEPE
KTPSELMPTEPYLLGSHAGCAGIWVSGPGDIAGVPAEWSWGYNRMTTVDG
LFTAGDGVGASGHKFSSGSHAEGRIAGKSMTAYCLDHADYKPELGRDVDE
VIAEIYAPMETFAKYKDYSTDPSVNPNYIRPKMFQARLQKIMDEYVAGVS
TWYTTSKTMLEKGLEHLSLLKEDAEKMAAADLHELMRAWENYHRLMAGEA
HARHILFREDSRYPGYYFRADHFYVDDENWKCFTISKYDRDSKEWTLSKR
DYVQVVPD
>CT0293 apt, adenine phosphoribosyltransferase
MPIKSRIRTVPDYPKKGIMFRDITTLIKDPVGFRLVIDNMTQHYLSNGVD
FDAIVGIESRGFILGGALAYTLGKGFVPVRKPGKLPADVVQLEYELEYGT
DKIEMHTDALVQGQRVLLVDDLLATGGTALAAAGLVEKLGGVVASMAFIV
NLPDIGGEKKIREKGYNIYFLTEFEGD
>CT1111 argB, acetylglutamate kinase
MLVEALPYIRKFEGKTFVIKYGGAAMKDEVLKNIFAENVTLLRKVGIKVV
IVHGGGDAITKTSAKLGLETTFVHGKRVTDRQTVDVIQMTLAGKVNQDIV
QLINKDGGNAVGVSGLDADTILAKPSPNASTLGLVGEVAEINTRYIDLLC
DAGLIPVIAPIGYDMEGNIYNINADDAAAAIAVALKAEKLIYVSDVEGVR
VGNRILKTICKADAAELIEKGIITGGMIPKVVSAYQTLDGGVGKVHLIDG
QITHSLLLEVFTNEGVGTQFVNELEQEPTAEEGAS
>CT1109 argC, N-acetyl-gamma-glutamyl-phosphate reductase
MNHIPMQNKKVTVSVIGASGYSGAELVKLLMKHPGIVIEELYAHTQAGKR
FTELYPSIPCDKTFQTYAGQTNSDVYLLALPHGEALQLVPGIVAAGKKVI
DLSGDFRLKNTAEHKRFYGGDKSAEDVLQYGMPELFRDEIAGSTAISNPG
CYATSIILGLAPLFLGGMAGLDVESVNVTAVSGISGAGRSAKLELSFSEM
SGNMRAYKVGKHQHTPEIMQTLGTSVTDPSFRFVFTPMIAPYVRGIYSVL
NVRLASPVAMEPVRELYAGFYANAPFVRLRDGVTEVSHVAYTNFCDISLA
FESDGSLVIITAIDNLVKGAAGQAVQNMNLMLGFGETTALL
>CT0367 argD, acetylornithine aminotransferase
MNTPAINLETEKQLFFHNYARLPLDIASGKGSFLYTASGERYLDMIAGVG
VNAIGYGDKRLEQAITEQASKYIHVSNLFMQKPQFDLAAKLLEISRMSKV
FFCNSGTEAIEAAIKLARRFAARNGDTDKTQVLSLTNCFHGRTYGALSLT
AKPKYVDGFEPLVPETGMIDFNDVEDLERKVSNRTAAVFVEFVQGEGGIH
KVSEAFIAKLKELAKEHDFLIVADEIQAGCGRTGAFFSYMPFDIQPDLVC
VAKPLGGGLPLGAIIGSEKVAEVFTPGSHGTTFGGNPVACAAGLAMIEAI
LADGLMQNALEVGSMMRTAFEKMAEKHAQILEIRQYGLMIGVTVHREAKY
YVEEALKRGVLVNATSNNVIRLLPPLSISKEEAQLCLDTLDAIFTEEAKA
>CT1112 argF, ornithine carbamoyltransferase
MSQDNKGNGSKKRDFLGFTGLDAAKIIELFDYSLFIKQQRETNRNSDEFR
PIRHKTVAMIFNKPSLRTRVSFELGVYELGGHAISLEGKSIGVGERESIE
DIARLLSRYNDAIVARLHEHEIIETLAKHADIPVINALTDLSHPCQVLAD
AFTLYEKGLWRDDIKVVFVGDGNNVANSWIELAGILPFHFVLACPEGYLP
DETLLKQARSNAKGTIEILHDPMEAAKQADVLYTDVWTSMGQEEEMAERL
KAFAPFQINAKMVAEAKPSAVIMHCMPAHRGQEISAEVMDGPQSIIIDEA
ENRLHVQKALMVKLMNHDVYRKFHLTHRLHRAANRLKA
>CT1114 argG, argininosuccinate synthase
MSKEKIAVAYSGGLDTSVMIKWLKDKYEGAEIVAVTGNLGQKMEVDNLEQ
KAIATGAKSFHFVDLRKTFVEEYIWKALKAGALYEDVYPLATALGRPLLA
KALVDVALAEGCTMLTHGCTGKGNDQVRFEVAFAALAPHMKVVAPLREWE
FTSREQEIAYAMEHNIPVSATKKNPYSIDENIWGISIECGVLEDPMVAPP
ADAYQITTAPELAPDEPTVVDIEFAQGVPVALDGQQMEGLDLIVRLNELG
AMNGVGRLDMIENRVVGIKSREIYEAPAATILHFAHRELERLTLEKSVFQ
YKRNIGQDYANLIYNGTWFSPMRKALDAFVDETQKPVTGMVRIKLYKGSM
TLLGRTSPNSLYNEALATYTEADTFDHKSAEGFIKIYGLGLKTFHEVNKS
E
>CT1055 argH, argininosuccinate lyase
MSDQSNQKELLWQSRFSEPFDREALKFSSSVHVDGLLYREDIQGSIAHAT
MLGEQGIISKEEAGQIVTGLKAVEKEIESGELTPVWEDEDIHTVIENRLK
ELIGPTAGKLHSGRSRNDQVATDTRLYLRRNIDRIGELLKAMQSTLLDKA
EQYKHTIMFGYTHLQRAQPISAGHYYMAWHSMFGRDAQRLADLRKRANIS
PLGAAAFAGSTLPLDPARSAELLEFDGVFTNSIDAVSDRDLVIEFVSACS
MIMMHLSRFSEDVILWSSAEFNYLSISDAFATGSSIMPQKKNADIAELVR
GKTGRVYGNLMNLLTIMKGLPLSYNRDMQEDKPPLFDTAETTASSLSVFR
RMIEKTWLNEERLARLTAEDLSLATEIAEYLVKKQIPFRDAHRITGKIVA
YAIEQGKTLPTISLDEYRTFSEAFDEGIYDDLKPDASVNSKKTAGSCSFK
SVEEQIARAKAAR
>CT1110 argJ, arginine biosynthesis bifunctional protein ArgJ
MRPVVADPASDAVFWPEGFTLAGINCGIKPTSKDLMLMLCDEPASTASVF
TTNLCCAAPVELSKAALSASGGKMRAVICNSGNANAATGAAGMQNARLMA
ESTAQAFDLNAGEVLVASTGVIGQQLPVEKIAGAMPSLKATSGSTQWCDA
AEAIMTTDTFPKAFGVDVKLSGGTARIAGIAKGSGMICPNMATMLAFLGT
DATIEPALLQELLAAANAVSFNAITVDGDTSTNDMAAIMASGKGPEVLRG
SDNARLFGEALRSVMTMLAQLIIVDGEGATKFVELRVTGAKSNEEARMAA
MTVANSPLVKTAIHGEDANWGRIIAAAGRSGASFVQEELSVYFDDEPILK
PGLDANFSEERAKEVMSKEEFTITLSLGKGAGRATVWTCDLSHGYIEING
SYRS
>CT1113 argR, arginine repressor
MNKASRQKKIRELIENHEVSGQQELLGMLEKEGIVVAQATLSRDFAEMGV
VRNRTSDGGYRLAVPEEPQVDIIKGLVGMEVLSVASNETSIIIRTLPGRA
HGVGSFLDRLTNDNILGTIAGDDTVLVIPSTVRKISSVKSYIQKILSQP
>CT0015 argS, arginyl-tRNA synthetase
MRAFFLPFIQDALQKAGIETDKEIQIDKPNDKKFGDFSTNIAFLVAKEAR
KNPRELAGQLIGLLDFPEGTVTKTEVAGPGFINFHLAPAFFMRSAQEVLA
KGEGFGCNESGKGLKAIVEYVSANPTGPLTIGRGRGGVLGDCIANLLETQ
GYEVTREYYFNDAGRQMQILAESVRYRYLEKCGQVIEFPETHYQGDYIGE
IAETLFIEHGDGLAATDELTIFKEAAEAVIFSSIRKTLERLLITHDSFFN
EHTLYQSREGQPSANQRVIDALDAKGFIGNYDGATWFMTTKLGQEKDKVL
IKSSGDPSYRLPDIAYHVTKFERGFDLMVNVFGADHIDEYPDVLEALKIL
GYDTSKVKIAINQFVTTTVGGQTVKMSTRKGNADLLDDLIDDVGADATRL
FFIMRGKDSHLNFDVELAKKQSKDNPVFYLQYAHARICSLVRMAEKEVGF
DEATAIGAGLPLLSSEPEIDLASALLDFPDIIQSSLRQLEPQKMVEYLHT
VAERYHKFYQECPILKADEHLRTARLELSLAVRQVLRNGFKILGISAPES
M
>CT1918 aroA, 3-phosphoshikimate 1-carboxyvinyltransferase
MSTFQGEVTTLPPDKSISHRAALIGALAEGTTEISNFSGGYDNQSTLSVL
RDAGISIRQEELSAGDGRIERRVVIESNGLWSFREPSVPLMCNNSGSTMR
MMAGIMAAQPFRSELVGDASLMKRPMKRVADPLRQMGAEISLSDAGTAPV
VIHGTKALKTIEYRLPVPSAQVKSLVAFAALHADGQSKIIEPIRSRDHTE
LMLGLATIDRPDGVREIIIDGRKPIAAKPFKVPADPSAACFMIALGLLGE
RSEIVLRNVCLNPTRVAYIDVLQEAGAGLGIENVRSEGGEPVGDIVVRSC
SGLKPLRISDHGVVAGVIDEVPMLAVLSAFASGEFELHNAAELRTKESDR
IDALVVNLQRLGFECEQYADGFVVKGRKTVASEEVEIECFDDHRIAMSFT
IAAEAAGASLRLSDRDVAGVSFPNFFALIDSLRQ
>CT1406 aroB, 3-dehydroquinate synthase
MQTPSSHIIVRTPVIDSVGELYATRGLGKKTVLLFDENTRRLFGDAIIES
MQRQGFRTIELVVPARETSKSVSTAWKLYGQMIEADVDRSWNLLCAGGGV
VGDLGGYIAASYYRGIPVVQLPTTLLAMTDSSIGGKVAINHPLGKNLIGF
FHMPELVLIDPAYLRTLPSREIYGGMSEVVKYGFIADREFFDLLTKHWDE
VVRLEEPWLSKAVSRSAFIKANVVEKDFRETSGLRATLNFGHTFAHGLEK
MAGYRNLRHGEAVTIGMVCALFLSHRPGFLAEADLREGLALLARFRFPRG
LVNKRFLSLDRDELVESMLSDKKKIDKQLRFVLLDRIGHAFLHDKDITKT
DVLQAIDDAMGWFSSAGF
>CT1432 aroC, chorismate synthase
MIRYFTAGESHGPALSAIVEGMPAGVALTESDINDQLARRQQGYGRGGRM
KIETDRAEVLSGVRFGKTIGSPVAMLIRNRDWENWTTSMAQFEDHATEVQ
KITIPRPGHADLTGFVKYGFDDIRPVIDRSSARETAARVAAGSLARAFLR
QLGIQIGSYISTIGPVSEAAAPASLQELLDAGAESLAAEADKSPVRMLDP
EAETAAIAAIDQAKADGDTLGGIVELYITGVPMGLGSYVQHDRRLDSELA
AAIMSIQAIKGVEIGPAFDNARKPGSQVHDELFAGGEKGLRRETNRAGGI
EGSMSSGQPIHIRAAMKPISSLVSPLSSFDLATLEAVQSRFERSDTCAVP
AAGVVAEAVVAPVIANALLEKLGGDHMAEIKERLEVYRAALRMRFEK
>CT1809 aroE, shikimate 5-dehydrogenase
MSNSQSKKIFGLIGKHVDYSWSPLIHNTGFEALGLPCVYTIFNIPSPEMI
GDALKGSRALGIAGFSVTIPYKKTVVPFLDELSPEALSIGAVNTIVNDNG
RLLGYNTDIDGFAAPLLPMAESIRNRPVCIFGNGGAALAAVEAFRLRFNP
SSVLLVVRDTQKAEDMLEEYAYRDLVTIHAGREIDQPACSKLIRDCRVLV
NATPVGTAGRNDHIHSILPTGHGLLHDGQIVYDMVYNPPETPLLAEARAA
GATVIAGIEMLIAQAARAFSIWTGQELPVDLVRKTVLAAIEKSEG
>CT0981 aroF, phospho-2-dehydro-3-deoxyheptonate aldolase
MQQLQDLRVSRIITLSSPKALKEKLPVTEHIADTVYKARQEVENILTGKD
SRMLVIVGPCSIHDIKAARDYASRLFALRKELESEFCIVMRVYFEKPRTT
IGWKGFINDPHLNDTYDIEHGLFHARKLLIELNEMGLPAATEFLDPISPQ
YVADLISWAAIGARTIESQTHRQMASGLSMPVGFKNATDGRLQVAIDAIR
SAMHQHSFLGIDQDGHSSVITTKGNPFGHLVLRGGSSKPNYDADSIAHAE
KQLRKADLTPHLLVDCSHANSGKKCGNQLKVWADILAQKEQGNTSIVGVM
IESNIFSGNQPFPVDPRELQYGVSITDECVSWEETERMLRDGAAFMKQVR
SKA
>CT1405 aroK, shikimate kinase
MKHHSLIFLTGFSGSGKSTIGPLLANSLGFEFIDLDREIELTAGKSINRI
FAEDGEAAFRSLELRTLEKIGQQERMVVSLGGGVLENDRCFELIRSHGTL
IYLKSSPEILTLRLQHKTDRPLLKGPDGRKLTREEIQQRIAELLKKREPR
YLKADLVLFTDSKKIGASVEELTRKIERHIRRASKNNTNEK
>CT1190 aroQ, 3-dehydroquinate dehydratase
MMSATSLLVMNGPNLSRLGKREPEVYGSLTLDEINRGIAVAFPEVSFEFF
QSEHEGALIEKLFEIEGRGGFSGVVLNAGALTHYSIALRDAISAVTMPVV
EVHLSNVHKREEFRHKSVISAVCIGVIAGFGVESYHLGVRALLGRGNR
>CT1725 arsC, arsenate reductase
MKKSILVLCTGNSCRSQMAEGFLRSFDSELDVFSAGTVPASEVHPLAVQV
MREKGIDLSANRPKKVDEFLTKPFDYVVTVCDSAKESCPLFTGQVKHRQH
IGFDDPAAATGSSEEVLAEFRRVRDEIEQEFGKFYRETIRGEKP
>CT1928 asd, aspartate-semialdehyde dehydrogenase
MSSEAGYRVAVLGATGLVGRTMIKVLEERNFPVSELVPLASPRSRGEVIT
FNGREFVTEVPSAEIFKGVDIALFSAGATVSKEWAPVAAEAGAIVIDNSS
AFRMDEGIPLVVPEVNPETIFDREGKAAPIIANPNCSTIQMVVVLKPLYD
AYGIRRVVVSTYQSVTGKGKAGRDALESELAGNIPDEFTHFHQIAFNAVP
QIDAFTENGYTKEEMKMVNETKKIMGDDTIQVSPTTVRIPVYGGHGEALN
VELRSDFDIDQVRALLAGSPGIVMQDDPSARLYPMPMTSYERDEVFVGRL
RPDYWHPRTLNLWVVADNLRKGAATNAVQIAELLVGNL
>CT0864 aspB, adenylylsulfate reductase, beta subunit
MPSFVIKEKCDGCKGQERTACMYICPNDLMKLDVERMKAWNQEPDQCWEC
YNCVKICPQQAIEVRGYADFVPLGGNVIPLRGTDAIMWTIKFRNGILKRY
KFPIRTTSEGSIDPYSGKPEPDYANLKKPGFFNMTEYPTL
>CT1242 aspC-1, aspartate aminotransferase
MFDEIEFDRIKRLPKYVFAAVNELKMAERRAGRDVIDFSMGNPDGPTPQH
IVDKLVESVSKPKTHGYSVSKGIYKLRGAIGGWYRDKYNVDLDLDREVVV
TMGSKEGYVHLVQAITNPGDLAIVPDPCYPIHSQAFILAGGNVHRMKLEM
NEDYTLDEEAFFHNVETALRESSPKPKYLVVNFPNNPTTATVELPFYERL
VELARRERFYIISDIAYAEITFDGYVTPSILQVSGAKDVAVESYTLSKTY
NMAGWRMGFMVGNAKLVGALEKIKSWLDYGTFTPIQVAATIALTGDQSCV
REIREVYRKRRDVLISSFANAGWDIVSPKASMFVWARIPEQMRAMGSLEF
SKKLLIEGKVAVSPGIGFGTYGDEYVRVAMIENEERIRQAARNIKRFLKN
NAGS
>CT1381 aspC-2, aspartate aminotransferase
MGLSLGERCGLVMQSDIRAMSIECARMGGINLSQGVCDTPVPPVVLQGAA
DAVMQGANIYTHHTGIRELREAIVAKQRRFTGVAFDAEREVVVSAGATGA
MYCAFQALLNRGDEVIVFEPFYGYHVSTLRAVEAVPLFVPLDPSDGWSFR
VEALEAAVTPKTKALLINTPANPSGKVFSCDELALLAEFAIRHDLFVFTD
EMYEHFVFGGLSHVSMASMPGMRERTITISGLSKTFSVTGWRIGYALCDP
RWAASIGYFNDLFYVCAASPLQVGVTAGLRLLGDDYYHGLAAEYEVRRDL
FCAALAEAGLEPHVPQGAYYVLAGTERLPGATAREKAMHILRETGVASVP
GSAFYHDGGGENLVRFCFAKESHVLEAACEKLLKLR
>CT1323 aspS, aspartyl-tRNA synthetase
MSKEPTTADGLQNRFRTHYCGRLNRKSEGELVRIAGWVHRIRDHGGLIFI
DLRDHTGICQLVVLPENESQFKLAETLHSESVISAEGKVVLRSDETVNPR
LASGAIEVVVSSIQIESNADPLPFPVADDMPTSEELRLKYRFLDLRREKL
HENIIFRSKLTAAVRKYLTDLDFIEIQTPILTSSSPEGARDFLVPSRLHP
GKFYALPQAPQQFKQLLMVAGFPRYFQIAPCFRDEDARADRSPGEFYQID
IEMSFIEQDDLFVILEGMFKHLVENMSHKRITQFPFPRISYKDVMNRYGS
DKPDLRIPLEIQDVTELFVNSGFKVFASNTAEGSCVKAMVLKGMGNESRL
FYDKAEKRARELGSAGLAYIQFKEEGPKGPVVKFLSEAEMNALKERLGIV
TGDVVFFGAGKWEKTCKIMGGMRNYFADLFPLDKDELSFCWIVDFPMFEY
NEEDKKIDFSHNPFSMPQGEMEALESKFPLDILAYQYDIVCNGIELSSGA
IRNHRPDIMYKAFEIAGYSKEEVDARFGHMIEAFKHGAPPHGGIAPGLDR
LVMILRDEQNIREVIAFPMNQSAQDLMMAAPSEVTAQQLKELCIRIELPE
EEK
>CT2033 atpA, ATP synthase F1, alpha subunit
MSTAVRPDEVSSILRKQLAGFESEADVYDVGTVLQVGDGIARIYGLSKVA
AGELLEFPHNVMGMALNLEEDNVGAVLFGESNAVKEGDTVKRTNILASIP
VGEAMLGRVINPLGEPIDGKGPINTDIRLPLERRAPGVIFRKSVHEPLQT
GLKAIDAMIPIGRGQRELIIGDRQTGKTAVALDTIINQKGKGVQCIYVAI
GLKGSTVAQVVNTLEKYGAMEYTTVVSATASDPAPLQFIAPFAGATIGEF
FRDTGRHALVVYDDLSKQAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLL
ERAAKITDDIEVAKKMNDLPEPLKPLVKGGGSLTALPVIETQAGDVSAYI
PTNVISITDGQIFLESNLFNAGQRPAINVGISVSRVGGAAQIKAMKKVAG
TLRLDLAQFRELEAFSKFGSDLDKSTKAQLDRGARLVEILKQGQYIPMPV
EKQVAIIFIGTQGLLDSVDLKQVRRFEEEFLAMLEQKHPEILSSIAEKGT
LESDIASKLKELGEKYVASFKEKSKA
>CT1029 atpB-1, ATP synthase F0, A subunit
MHLSSDEVILWQSGFLKLNLTIVTTWALMLLLAGGSALITRRLSTGITIS
RWQSMLEIIVTMAHRQISEVGLQKPEKYLPFIAALFLFIATANLCTVIPG
YEPPTGSLSTTAALALSVFIAVPLFGIAESSLVGYLKTYAEPTPIMLPFN
IVGELTRTMALAVRLFGNMMSGDMILVILLTISPLVFPVLMNILGLLTGM
VQAYIFSILATVYIAAATRTR
>CT0021 atpB-2, ATP synthase F0, A subunit
MHHILDNSTFSFEPFGEVHLPHLEVAGFDISITKHVVMIWLAAILLVVIA
SAAGASVKKMSANQAPKGVANVFESLVDFISNDVAKPNIGHGYEKFLPYL
LTVFFFILVCNLLGLIPYGATATGNINVTLTLSVFTFVITQFSAFKAQGV
KGYLQHLTAGTHWALWIIMVPIEILGQFTKPFALTIRLFANMTAGHIIIL
SLFFISFILKSYIVAVAVSIPFAIFIYLLELFVAFLQAYVFTMLSALFIG
LATAHSDSHDGHELEATARHGDGLTV
>CT1032 atpC-1, ATP synthase F1, epsilon subunit
MRLKILLPYRVFAIKERVLNIVAETELGSYGFLPNRLDCIAPLVPGILMY
KTHDEGETFVAVDQGILVKTGPEVVVSVRHAIGGVDLGQLESAVKQQFLD
LDERERSVRSTMAKLESSFIRRYMELKHE
>CT2235 atpC-2, ATP synthase F1, epsilon subunit
MASSDKAFTLDIVTPQKLFFSGEINSVIAPGLNGLFQVLKGHAPLLAALK
SGKVRLSLSDRSEDTFQIAGGFFEVSGNKAILLTEEVS
>CT1033 atpD-1, ATP synthase F1, beta subunit
MGKTEPEQTGIVTSIRGSVVDMRFDELLPSIYSVVKTGREMEVTVEILMQ
LDRRHVRGIALTPTEGLCRGMKARNTGSPLKAPVGKGTLSRMFDVFGNAI
DRRGPVTNVTWRSVHGAPPQLSRRSTKSEVFETGIKIIDLLVPLERGGKA
GLFGGAGVGKTVLLTEMIHNMVSKESGVSIFCGIGERCREGEELYRDMSE
AGVLDNMVMVFGQMNEPPGSRFRVGLTALTMAEYFRDDLHQEVLLLIDNI
FRFIQAGSEISGMIGQMPSRLGYQPTIGTELSALEERIANTGTGAITSIQ
AVYVPADDFTDPAAVHTFSHLSASLVLSRKRAGEGFYPAVDPLSSGSKMA
GESIVGRRHYDLAREVRRVLAQYAELKDIIAMLGLEQLSAEDRRLVGRAR
RLERFFTQPFFTTEQFSGLAGKSVPIANTIDGCERILRDEFENYPERALY
MIGSIAEAQEKTVIETTMSESVAAKPEGGN
>CT2234 atpD-2, ATP synthase F1, beta subunit
MQEGKISQIIGPVVDVDFPEGQLPSILDALTVTRQDGSKLVLETQQHLGE
ERVRTIAMEGTDGLVRGMSAVNTGKPIQVPVGGEVLGRMLNVVGDPIDGK
GPVPAKKTYSIHRAAPKFDELSTKTEMFETGIKVIDLLEPYSRGGKTGLF
GGAGVGKTVLIMELINNIAKQQSGYSVFAGVGERTREGNDLWHEMMESGV
IDKTALVFGQMNEPPGARARVALTGLSIAEYFREEEGRDVLLFIDNIFRF
TQAGSEVSALLGRMPSAVGYQPTLSTEMGELQDRITSTKKGSVTSVQAIY
VPADDLTDPAPATAFTHLDATTVLSRQIAELGIYPAVDPLDSTSRILDPN
IVGDDHYNTAQAVKQILQRYKDLQDIIAILGMDELSDEDKLVVARARKVQ
RFLSQPFFVAEAFTGLAGKYVKLEDTIKGFKEIIDGRHDNLPEAAFYLVG
TIEEAVAKAKTL
>CT0020 atpE, ATP synthase F0, C subunit
MEGLGYLGAGIGAGLAAIGAGLGIGNAAASAAEGTARQPEAASDIRTTMI
IAAALIEGVALFGEVICVLLALK
>CT0019 atpF, ATP synthase F0, B subunit
MLTSGIILLEGGLLNPNPGLIFWTALTFLIVLVILRKTAWGPILSMLEER
AKSIQSAIDRAHTAKDEAEAILKKNRDLLAKADAEADKIIREAKEVADKL
RADLTEKAHDESRKIIASAKEEIEQEKRRALDVLRNEVADMAVKGAEKII
RTTLDADKQKAVVNDMIKEMAASRN
>CT2032 atpG, ATP synthase F1, gamma subunit
MPTLKDIRIRLKGVKSTQQVTKAMKMVAAAKLRRAQDRAIQARPYAGKLK
EMLASLSTKVDTSVNPLLSPREEVNNVLVILVTSDRGLCGGFNANIIKMA
QRLIHEEYAALHAKGGVTMICAGTKGTEFFRKRGYKLAAAYPGVFQNLSF
DSAREIADKASKMYLSGEVDRVVLVYNEFKSVLAPNLRTEQLLPITPEGG
DAKTASSEYLYEPSPAAIIDELVPKHLNTQLWRVMLESNAAEQAARMAAM
DSATENAKELIRVLNISYNRARQAAITKELSEIVAGADALKQ
>CT0018 atpH, ATP synthase F1, delta subunit
MSSAIASRRYAVALLEVAVEGNFLEKVTEDLQKIQEVLSGSHELVLALKS
PLINVDLKSKILEEIFRNKVDEKTMVFIKLLAHKKRAALLAGVISEFNAL
IDERNGVINADVKSAIKLSDEQAKELVNSLSVRTGKKIRAKMRLDENLIG
GVTVKIGDTIIDGSISHQLEMLRHSLVAQPA
>CT2151 bchB, protochlorophyllide reductase, ChlB subunit
MRLAFWLYEGTALHGVSRVTNSMKGVHTVYHAPQGDDYITATYTMLERTP
EFPKLSISVVRGQDLARGTSRLPGTVEQVDKHYKPELIVVAPSCSTALLQ
EDLGQMARASGVDQSKIMVYAVNPFRVAENEAAEGLFTELVRRFAAEQPK
TEKPSVNLLGFTSLGFHLRSNLTSLRRMLKTLGIEVNVVAPWGAGIDDLK
KLPAAWVNIAPFREIGCQAAGYLKEKFGMPSITEAPLGVNATLRWLRAII
AEVNKIGAEKGMAPMAMPELRDFSLDGQSAPSSVPWFARTADMESFSNKR
AFVFGDATQVVGVTKFLKDELGMKIIGAGTYLPKQADWVREQLEGYLPGE
LMVTDKFQEVSAFIEEEMPELVCGTQMERHSCRKLDVPCMVISAPTHIEN
HLLGYYPFFGFDGADVMADRVYTSAKLGLEKHLIDFFGDAGLEYEAEEPE
AFTEPTMSGNGTVASVSSAEAPSEAAVVTATATGELSWTAEAEKMLGKVP
FFVRKKVRKNTDNYAREIGEPVVTADVFRKAKEHLGG
>CT1422 bchC, 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase
MEAKKSKAIVFSGVNQIELREVTLKPVSSTDVLVETWWSSISTGTEKMAL
NGLIPSPPFIFPFIPGYETVGRIVEAGDHVNQGLIGKFAYVAGSFGYEDV
NAAFGGASQFIVCPVESLTVLDGIANPQCGIALPLGATALHIVDLAEVKN
RKVLVLGQGAVGILAAELAKRFGASLVAVTEPHQRRLDISTSDIKVNPEK
QDVSVALAGHEFDVLIDSTGIMSAIETGLRFLKFHGKVIFGGYYQRMNID
YSQAFNKELSFIAARQWAKGDLHRVRELIAAGKINAEKIFTHQCTVDDNL
MEAYMQAFSDSDCLKMIIHWKHGNEAGEHFPTCNTAN
>CT1296 bchD, magnesium-chelatase, subunit D
MIAFTDIVGMDLAKQALMLLAVDPSLGGVVIPSTVGSGKSTLARAFADIL
PEGTPFVELPLNVTEDRLIGGVDLEATLASGQRVVQHGVLSKAHKGVLYV
DSLSLLDSSAVSHIMDAMSRGAVIVEREGLSEVHPADFMLVGTYDPSDGE
VRMGLLDRIGIIVPFTPVNDYRARKQIVSLVMGTRNEEDTQDELRMLRGI
IGAAREQLHHVSITNEQIKGLIQTAISLGVEGNRVDIFAIRAAIANAALN
QRTEVDDEDLKLAMKLVLVPRATRMPEREPNPEEMAQDEPPPQEEQPQDE
AEDQNAPPDEADSDADEEQEETPDMIEELMMDAVETELPDNILNISLASK
KKAKSGSRGEALNNKRGRFVRSQPGEIKSGKVALIPTLISAAPWQASRKA
EQAKKGIKSTAALIIGKDDIKIKRFRDKSGTLFIFMVDASGSMALNRMRQ
AKGAVASLLQNAYVHRDQVSLISFRGKQAQVLLPPSQSVDRAKRELDVLP
TGGGTPLASALLTGWETAKQARAKGITQIMFVMITDGRGNIPLGAAYDPN
ATKASKEELEKEVEALALSIQADGIASIVVDTQMNYLSRGEAPKLAQKLG
GRYFYLPNAKAEQIVEAALS
>CT1959 bchE, magnesium-protoporphyrin IX monomethyl ester oxidative cyclase, 66 kDa subunit
MKILMIQPNYHSGGAEIAGNWTPSWVAYIGGALKQAGFNQVKFVDAMADD
LPDETIEEIIRKNQPDVVMTTNITPSIFKAQDIMKIAKKVNKNIRTIMGG
IHSTFMYPQVLTEAPETDYVVRGEGEEVTVNLMKAIAAGNDKETRSEITG
IAYIDENGKVFATAAHPVIEDLDTLSPDWSLYDWDKYIYTPLNCRLAVPN
FARGCPFTCTFCSQWQFWRRYRARSPKNFVDEIEILVKKYKVGFFILADE
EPTINKQKFVSLCQELIDRKLDVTWGINTRVTDIMRDEDLLPFYRKAGLV
HVSLGTEAASQMNLNRFRKETTIEENKYAIKLLQKNGIVAEAQFVMGLEH
ETPETIEETYQLCKDWDPDMANWTIYTPWPFSDLFKELGDRVEVRDYSRY
NFVSPIIKPDNMEREDVLKGVLKSYGRFYARKTFFSYPWIKDPYVRKYML
GCLKAFAKTTLTKRFYDIDRVKTKNRKIEIDLGFDQSRILTQEEAKNLKE
RRPEMIADMSFGLKEAGYQREHDEHNWDEFDESTIRDRTSSTVRNC
>CT1421 bchF, 2-vinyl bacteriochlorophyllide hydratase
MPRYTPEQLEKRNASKWTTVQAILAPIQFLIFLAGLTVTYLYSQGIWVTD
FWWVTFFVALKTFMLVLIFVTGGFFELEVFGKFAFAHEFFWEDFGSAIAM
IVHISYFILFFWIKPAEHILILTAYLAYLSYLVNAAQFVIRLLLEKHNEK
KLKASGAV
>CT1610 bchG, bacteriochlorophyll synthase, 34 kDa subunit
MNGSDTLNPELQLNIDEKLHQSGISQSRQKIIRQALENVNRPGFRIEPSA
ILPLMKPVTWFPPMWAFACGVVSTGESVTDNISILVRGVILAGPLMCAMS
QTMNDYFDREVDAINEPERPIPSGKISKQASWLITFGLILTGFAVALSIH
PYVMAIAFVGVLMSHAYSGPPIRAKRNGWFGNLIVGLAYEGVAWLTGSFA
ITQGVPSKESIALAIIFSLGAHGIMTLNDFKSVVGDKIRKVASIPVQLGE
KNAAILASAVMDIAQIAAIAILVAKGSTIPTAIAVTLLIAQLPMQKILID
HPAEKAVWYNAFGTLLYVLSMMVCAVGIRP
>CT1957 bchH-1, magnesium-protoporphyrin methyltransferase
MRFLFITMEPTNNGVLKSAAAELNREFDLDLKVSIFNLGLNHGRHFWEKL
EQEIPRADFIFGSMLFSEEIVRPLEELLATAACPVCMITSNPALINQTRI
GKFSLQKNAEKEKQQGIFKQLASKLKPTHSSSESQRQLSLVRNVGKLMKH
IPGKARDIHTFISAHQFWLNGSKENMRRFLCLLIDRYIPGYKGKLPQNDP
IFYPDTALYHPDAGKPFSTTFELREWQEKHLPAKGAGRVAILVMRATLLS
ENMLHVINLLRELESRDVQCCIAYSGGLDFRPALEGFFDPASSESISIDL
IINATGFSLVGGPAETRSAEAVGILKKIGVPCFNLIPLAFQPISQWRENN
LGLAPLQAALSVAVPELDGAIEPHVFAGLEEGSDRTLPLESETRALADRI
TRLVRLRKKSNAEKKLAIVLFNFPPNLGNAGTAAFLDVFESLLQLMRKLK
NEGYDIELPASVDDLRNKLLEGNRLVFGTDGNVAAHYPVEQYRKAFPAYE
RIEPFWGDAPGELLNDGSRFHILGAMFGNLFIGQQPSFGYERDPMRLLMA
KDAAPNHAFAAFYSWLDREFGADALLHFGTHGALEFMPGKQVGLSQECWP
KQLIGGLPHFYCYCVNNPSEAAIAKRRGFATLVSYMAPPLEHAGLYKGMR
QLRELVGAWRSHPSAEALEEIRKMAGTLDLDHPSDEIGDEEYVTWLNNEL
YLIEERMIPLGLHVLGQAPSAESLVDNLALLVSHSRPELDNRSLPELVCT
GLRLDYDRLIEHHEEEMLLRESWQKVTSICHEAVKRFVGKLPETLPEGVS
TAAFLEGTLPVRMNEANTWLQKSAAIKPRQLDRLWHFLNGILAAMLENRE
IEAITQALAGAYIPPSPGNDLVRNPAIVPTGRNIHSLDPYSIPTPFATKA
GERSAEELLEQYRRETGELPESIALILWGTDNLKSDGEGVAQALALLGAR
TKTDELGKIGDVELIPLEKLGRPRIDIVVTVSGIFRDLLSHQVRLLDRAV
RLAAAADEPESMNFVRKHVLQQATELGITANEAADRIYSNAPGCYGSNVN
HLIESSTWEEEQQLADAFVNRKSFAFSPEGDWRESPEILRSALKNVTLTF
QNIDSFEIGLSDIDHYYEYLGGVSKSVEVLGGTKPKVMVGDTGGFGKGQK
IRSLESMVALEARTKLLNPKWYEAMIEHGYEGVREIEAHLTNTYGWSATA
SAVKNWTYQQFTETFLQDRQMLERMAALNPNATMSMTRRLLEASSRGFWE
ADEGTIEQLQELYDELETRIEGAHTPEV
>CT1295 bchH-3, magnesium-protoporphyrin methyltransferase
MAQRRKITAIVGLEQYNAGLWRKIKSMLDKDAELVQLSDVDLEKQNPEAA
KAIREADCVFMSMINFKEQVDWFKEQLNQGTNEKTIFIFESMPEAMALTK
VGSYQVTEGKSGMPDMVKKIAKMLVKGRDEDALYGYMKLMKIMRTILPLV
PNKAKDFKNWLMVYSYWLQPTPENIVNMFRLILREYFDSNVKVEPIVDVP
NMGLYHPDAKEYFKDVKSFKSWSKKRGVNFDKSQKMALLFFRKHLLQEKT
YIDNTIRTLEKHGLNVFPAFVMGVEGHVLVRDWLMKEKIDLLVNMMGFGL
VGGPAGSTKPGTAAEARHEILTGLDVPYMVAQPLLVQDFESWHELGVSPM
QVTFTYSIPEMDGAVAPVILGALQDGKVETVQERLDRLAILSKKWMRLRA
TPNRDKRVALVVYDYPPGLGKKATAALLDVPTTLLRILERLKKEGYNVGT
LPESPEKLFEMLDRATDYQIMQNKPEAIKVSREKYNELVTYHERERIEER
WQAFPGEIAPIGSDEVFLGGLRLGNIYIGVQPRLGVQGDPMRLIFDKANT
PHHQYISFYRWISREFDAHALVHVGMHGSVEWMPGLQTGLTGECWPDALL
GEVPHFYIYPVNNPSESTIAKRRGLATMVSHVVPPLARAGLYKELPALKE
LLADYRERNQAQGEDVEQVQEAIMTKAELLNLTDDCPRRPDEPFSDFVSR
LYIYIVELENRLISNSLHVFGEAGPLESQIITITETIKNRGENGRSLPYI
FIDTSGKNGHYGSYEEISSLSRKGDEAAIKLREWAENACREFVKQTMFDR
KNPLQAFELVTGGSRMPEEDKPFIQRIIQEGAMMIQALSDNSSEMNSLVK
VLEGGYISSGPGGDLVRDGMNVLPSGRNIHSIDPWRIPSETAFKRGTLIA
DGLIAKHIAENDGQYPETIAEVIWGLDTIKTKGEAVAVVIRLMGAEPAYD
AFGKISHYSLTPLDKLGRPRIDVLMQLSPIFRDAFGILMDQLDRLVKDAA
KADEPHEMNYIKKHVDEALAEGMDFEAATARQFTQAPGSYGTYVDDMIED
SAWENEGDLDDLFIRRNSSAYGGGRKGEKQPEILQKLLGSVDRVVHQVDS
TEFGISDIDHYFSSSGSLQLAARRRNTKTSDIKLNYVESFTSDIKLDEAD
KLLRVEYRSKLLNPKWFEGMLKHGHSGAGEISNRVTYMLGWDAVTKSVDD
WVYKKTAETYALDPEMRERLATLNPQAIKNIVGRMLEAHGRGMWKADQSM
IDELQEIYADLEDRLEGMADE
>CT1297 bchI, magnesium-chelatase, subunit I
MTQTANAAKKTTSTKASAAKEAKVKVTAEEKAVTEVKKPAAKKKSALAFP
FTAIVGQEEMKLSLILNIIDPRIGGVLVMGHRGTGKSTTVRALAEVLPLI
PRVKGDIYNRTVEQYIEMEAAGKGAPAIKPEDVETELIPVPVVDLPLGAT
EDRVCGTIDIEKALTSGVKAFEPGLLAQSNRGFLYIDEVNLLDDHLVDVL
LDVAASGKNVVEREGISIRHPARFVLVGSGNPEEGELRPQLLDRFGLHAR
ITTINDVAKRVQIVKLRREFDEDPEAFMKKVSREQQKLRKKIVAAQQLLP
QVTMDDAVLTDIAKLCMNLGIDGHRGELTITRTAHAYAAWEGDKKVTMKH
VREIAGLCLRHRLRKDPLETVDAGEKIDRELAKVLGEAEAAA
>CT2014 bchJ, bacteriochlorophyll synthase, 23 kDa subunit
MSSSPSRIGPNSIIQTVGALETAYGKNETEKLLKKIGQGYLINNLPSEMV
EESKFHALVTALQKELGETATAGILKESGERTAKYLLKVRIPGPFQTIVK
LLPAGLAFKVLLFAISKNAWTFAGSGEFSYGSKPSPNVMVKVTFPSHPVV
SNFYLGTFTALLRELVSPKTEIKADIRKEGSAIRCNYLCKI
>CT1992 bchK, bacteriochlorophyll c synthase
MVRNISSPEAHTVIFNRPGAVFTISSVMSVASQGLVDKVKAHLELLDPVT
WISVFPCLAGGVMASGAMQPTVHDYLLLAALFLLYGPLGTGFSQSVNDYY
DLELDRVNEPTRPIPSGRLSEKEAIWNWSIVLVIAVALSSWIGTSIGGQR
GMIFVGSLLAGLVIGYLYSAPPFKLKKNIFFSAPAVGFSYGFITYLSANA
LFSDIRPEVLWLAGLNFFMAVALIVMNDFKSQEGDAKEGMKSLTVMIGAK
NTFLVAFIIIDLVFAVFAWRSYMWGFTTLMYFIIAGLVLNIIIQIPIYRD
PKSGITLVQHAVDDGFGNAIGKSEVQEHNAFLRFQVVNNILFLTNQMFAA
ALIGAKYM
>CT2150 bchL, protochlorophyllide reductase, iron-sulfur ATP-binding protein
MSLVLAVYGKGGIGKSTTSANISAALALKGAKVLQIGCDPKHDSTFPITG
KLQKTVIEALEEVDFHHEELSPEDIVETGFAGIDGLEAGGPPAGSGCGGY
VVGESVTLLQEMGVYDKYDVILFDVLGDVVCGGFSAPLNYADYAVIIATN
DFDSIFAANRLCMAIQQKSVRYKVQLAGIVANRVDYTKGGGTNMLDQFAE
QVGTRLLAKVPYHELIRKSRFAGKTLFAMDPNEPELAECLAPYNEIADQI
LSEKPIASVPKPIGDREIFDIVGGWQ
>CT1958 bchM, magnesium-protoporphyrin IX monomethyl ester oxidative cyclase, 25 kDa subunit
MSSPSFNVEEHKKMLQSYFNGQGFQRWASIYGDDKLSTVRSTVRQGHAVM
MDKAFEWLQKTGLPKGSKILDAGCGTGLFTIRLAKSGYRVKAADIAEQMV
NKTREDAEKAGVAGNVEFEVNSIESVSGTFDAVVCFDVLIHYPAEGFAHA
FSNLAGLTKGPIIFTYAPFNNILAFQHWIGGFFPKKERRTTIQMIKDKEM
QRVLSELGLKIVSQQKISFGFYHTMLMHVARR
>CT2152 bchN, protochlorophyllide reductase, ChlN subunit
MMPVSSDCQILKEDNVTHSFCGLACVGWLYQKIKDSFFLILGTHTCAHFL
QNALGMMIFAKPRFGVALIEEADLSRAEPQLEAVIEEIKRDHNPSVIFLL
SSCTPEVMKVDFKGLAHHLSTDKTPVLFVPASGLVFNFTQAEDSVLQALV
PFCPEAPAGEKKVVFVGSVNDITADDLRTEAEQLGIPVGGFLPESRFDKL
PAIGPDTVLAPIQPYLSRVCSRLNRERGSQVLTSLFPFGPDGTKTFWEDL
AAMFGIKVDLSDRAEAAWEKIKPQTDLLKGKKIFLTADTMMELPLARFLK
NAGAEVVECSSAYINKKFHARELEALEGVKVVEQPNFHRQLEEIRATRPD
MIVTSLMTANPFVGNGFIVKWSMEFTLMSIHSWSGVFTLANLFVSPLLRR
ESLPEFDESVWLEGVMPSAQ
>CT2256 bchP, geranylgeranyl hydrogenase
MLYDVAIIGGGPSGAAAAEILARAGHSTILIERNLANVKPCGGAIPLGLI
EEFDIPDELVEKKLTRMSVRSPKGETIFMHMPNGYVGMVRRERFDRYLRE
KAQKAGAEVVEALVKKIERSVDRFTLQLFNKEGEALPPVEASYVIGADGA
NSKTADELGFPPNDLKVIAMQQRFHYCDELKPYEELVEIWFDGEVSPDFY
GWIFPKTDHIAIGTGTEEHRNDIKQLQHRFVEKIGIAEKPYLNEAAKIPM
KPRRSFTQERAILVGDAAGLVTPANGEGIFFAMRSGKLGAEAMIERIRNN
TPLSSYEKKFRKLYSPIFFGLQVLQSVYYKSDRLRESFVAICRDKDVQGI
TFDSYLYKKMVPAPWSVQMKIMAKNIYHLAKGS
>CT1777 bchQ, bacteriochlorophyll c8 methyltransferase
MDDDSNQKPLFHMALGVLTSLTPPQHHIELVDEHFHDKINYDGDYDMVGI
TSRTIEATRAYEIADEFRKRGKTVVLGGLHISFNPEEAAAHADCIVVGEA
DNLWTTLLDDVANNRLKERYDSKDFPPVKAITPLDYARIAKASKRTKVDG
TKSIPIYVTRGCPFNCSFCVTPNFTGKQYRVQDPKLLKHQIEEAKKYFFK
ANGKNSKPWFMLTDENLGINKKKLWESLDLLKECDITFSVFLSINFLEDP
TTVKKLVDAGCNFVLAGLESIKQSTLEAYNKGHVNSAEKYSKIIEDCRKA
GLNIQGNFLFNPAIDTFEDIDELVQFVKKNHIFMPIFQIITPYPGTQMYH
EYRESGLITIEDWEKYNALHLVIKSDRYEPLLFQYKVLKSYVEVYTWKEI
LLRTLYNPRKLINLVTSIAFKKHLAAQLKAFERNHKMNPAMLSGVKPVMN
G
>CT1423 bchX, chlorophyllide reductase, BchX subunit
MATRRANTFQPATRPTEYLMAPRTIAIYGKGGIGKSFTTTNLSATFAMMN
KRVLQLGCDPKHDSTTSLFGGISLPTVTEVFAEKNAKNEQVQISDIVFRR
DIPGFPQPIYGIELGGPQVGRGCGGRGIISGFDVLEKLGIFEWEIDIILM
DFLGDVVCGGFATPLARSLSEEVLLVTSNDRQSIFTSNNICQANNYFRTI
GGRSRLLGLIVNRDDGSGMAENYAKAAGINVLMKVPYNLQARDMDDSFDF
AIKLPEVGEPFKKLATDILNNAITPCEASGLDFKDFVRLFGDVSEELPAA
ATADELFKRKGETAADPEAHDPERQQLLACIEKLPEPEREIYTLHEIEGK
SPEQIAGLKGIGEQEVKAHIARARKAMRKLFFEL
>CT1826 bchY, chlorophyllide reductase, BchY subunit
MHPQSMCPAFGGLRVLMRIDGAQVCMAADQGCLYGLTFVSHFYAARRSIV
SPELMNAQISGGTMIDDVRCTIEKIAEDPSVRFIPVVSTCVAETAGIAEE
LLPKRVGNADVLLVRLPAFQIRTHPEAKDVAVSSLVKRFGAFGEPKKGKT
LVVLGEIFPVDAMMIGGVLQKIGVESVITLPGADLDDYVQAGRASACAVL
HPFYERTAALFESAGVKIVGGNPIGANATGQWIERIGEALDLDPETVKTV
AEEERQKAKGMMAGFAERMHGSVIVAGYEGNELPLVRLLLEAGLDVPYAS
TSIARTALGEEDHRLLTMLGTEVRYRKYLEEDMEAVLEHKPDLVIGTTSL
DSFAKEHGIPAIYYTNNISARPIFFASGAASVLGMIAGLLEKREIYGRMK
EYFMPSA
>CT2125 bchZ, chlorophyllide reductase, bchZ subunit
MAKTIRDESTASAYWAAVNTFCALKDVHVIADAPVGCYNLAGVAVMDYTD
AIPYLENLTPTSLTEREISSSGSSQIVQETIEKLMGSGKQLILVSSAESE
MIGSDHQHMLAMKYPSVRFFASDSLGENEWQGRDRALAWLFDQFDDGQPS
QIEPGTVSIIGPTYGCFNSPADLAEVKRLVTGAGGRVAHVYPFESKAAEI
TKLKNSAAIVVMYREFGAALAEKLGRPVLYAPFGIEETDRFIGKIGRLCG
TPEQATRFIAEEKRTTLSPVWDLWRGPQSEWFPTIRFGVVASKSYADGIK
HVLGDELGMQCLFSLDSAEVDNNAVRKEIVQKQPQFLYGRMPDKIYLAEA
DAKSRFIPAGFPGPIVRRALGTPFMGHSGVVWLVQEIVNALYDTLFNFLP
ITRRQQAAAPTKPLKWTPEANAILDGIVRKAPFISQISFGRELKRKAENL
AASRGADTVTPDILQQLG
>CT0662 bcp-1, bacterioferritin comigratory protein, thiol peroxidase, putative
MALLQAGQKAPEFTAKDQDGKEVSLRDYTGRKVVLYFYPKDDTPGCTKEA
CAFRDNLPNFEKVDAVVLGVSVDGQKAHRKFADKYELPFTLLVDDEKKIV
EAYGVWGLKKFMGREYMGTNRVTYLIDEQGTIEKVWSKVKPETHTAEVLD
WLQQKT
>CT2001 bcp-2, bacterioferritin comigratory protein, thiol peroxidase, putative
MIEEGKIAPDFTLPDSTGKMVSLSEFKGRKVLLIFYPGDDTPVCTAQLCD
YRNNVAAFTSRGITVIGISGDSPESHKQFAEKHKLPFLLLSDQERTVAKA
YDALGFLGMAQRAYVLIDEQGLVLLSYSDFLPVTYQPMKDLLARIDAS
>CT0047 bioA, adenosylmethionine--8-amino-7-oxononanoateaminot ransferase
MTIDLDFDRCHLWHPYTSMADPLPVWPVKRASGVMIELEDGRKLIDGMSS
WWAAIHGYNHPVLNRAVTEQLGRMSHVMFGGLTHEPAIELGKILTSLLPD
PLDRIFFCDSGSVAVEVAIKMALQYWLAAGKPGKKRLLTVRSGYHGDTFM
AMSVCDPVTGMHSLFSGAVPEQLFVEAPACGFNEPWREEAIDKMRQALED
HANTIAAVIIEPIVQGAGGMRFYSPHYLRRLRELCTEHGVLLIFDEIATG
FGRTGKLFAMEYASVTPDIVCLGKALTGGYMTLAATVTTGHVADTISGGN
PGLFMHGPTFMANPLACAVAVASLKLLLSGDWQSTVWRIERQLAEELAPC
TGMTGVRDVRVLGAIGVVELDRPVDMAKIQQAFVERGIWVRPFGRLVYLM
PPFIIRDNELTRLTSVICEVIGAEYR
>CT0052 bioB, biotin synthetase
MKSRLHPDIEKAYAVLDTGEPLSLELASALGRLPDSEVLDLVSLANRVKA
RHAANHGAIHACSIMNAKSGVCGENCRFCAQSKHNSAEVDVYELVDENKV
LEQARSAWEQGIGHFGIVTSGYGYLKVTPEFERILGMIDRLHRELPGLHV
CASLGVLGDAPAAELARHGIAHYNINIQVDPARYGELIADTHAVNERIGT
IRRLRSNGIGVCCGGILGVGETMQERIGMIFALRDLDVTVIPLNVLVPID
GTPLEGAAPVSVPEIAKTFAICRLAHPTKIIKFAAGRETVMKDFQGLLML
AGADGFLTGGYLTTRGRDISTDRQLARQLSKFS
>CT0049 bioC, biotin synthesis protein, putative
MNGVIDKQLVRRRFRRALPTYAGHAEVQRRMAVRLVALIENAGASTHLGR
VFEFGSGSAMLTSILFERYSANEFFANDLVAESRAFVEKAVTGRNVERLT
FLPGDVERLDPLPGNLDLAVSNATVQWLHDPARFFDRLATSVKPGGIVAF
STFGAENMHEIAALGEAALPYRSLDKIAALSGELFELVAIEDDIVRQEFD
TPEAVLRHIRKTGVNGVARRAWTRSQYLDFLQRYRSAYPSGEGVTLTWHP
VYCCFRKKKS
>CT0048 bioD, dethiobiotin synthase
MLFQKEKVMKGQVLAISGIDTGIGKTVVTGLLARCFAETGWRTITQKIAQ
TGCEGVSEDIAEHRKLMGIDLQEADLDGTTCPYLFRFPASPHLAATVEGR
EIDFMTIRRSTFRLQKLYDLVLLEGVGGLLVPLTPELLFADYVRDAGYGL
VLVSASRLGSINHTLLSLEACARRGIPVRAIVYNRYFEADERIAANTREV
IAAALKRYGFGEAPVIDLNTSGLSAEPGDLQRILNPPGQ
>CT1951 bioF-1, 8-amino-7-oxononanoate synthase
MPGKNVEVKPQTVEKTKDIFKKCVDFTLADEVKALGVYPFFRPIDDSEGP
VVSFEGRKLVMAGSNNYLGLTNDPNVKQASIDAIKKYGTSCSGSRYMTGT
VRLHIELEEQLADFFEKECCLLFSTGYQTGQGIIPTLVQRGEYVVADRDN
HASLVAASIMAIGGGANQVRYRHNDMADLERVLQNIPESAGKLIVSDGVF
SVSGEIVDLPALVALAKKYNARIVIDDAHAVGVIGKGGRGTPSEFGLVNE
VDLIMGTFSKTFGSLGGYVVGERSVINYIKHTASSLIFSASPTPASVAAV
LATLKIIREQPQLTERLIANTDYVRQGLLKAGFTLMPSRTAIVTVLIADQ
MKTLYFWKKLFDAGVYVNAFIRPGVMPGHEALRTSFMATHEKEHLDKVIT
EFCSIGRELGVI
>CT0051 bioF-2, 8-amino-7-oxononanoate synthase
MSGGFTNAGQVEPPIVADIAVELAKLKAKQRFRSIPATGERSGRFVTVGG
RKLLNLSSNDYLGLGADRELFSSFIAQIRDDRFDDGRFAMTSSSSRLLTG
HHPVCDQLESAIAAAYGSETALVFNSGYHANTGILPALATRHDLILSDRL
NHASIIDGLRIADADYRRFRHADYDHLEEQLETAARECYRQIFIVTESVF
SMDGDLADLRRLVKLKRRFKAVLIVDEAHGAGVFGERGLGLCEALGVTNE
IDIIVGTFGKSLASAGAYAVMRGLFREYLVNTMRTLIFTTALPPMTLAWS
LATFTKQLTMQRERDHLLGLAATLRGSLREAGFDVPGESHIVPVVLGEDR
TAVEMAGALREAGFHALPVRPPTVPENSSRLRFSLRADLTSGDIAALAET
MKRGAA
>CT0053 birA, birA bifunctional protein
MNPVTATILQRLAGDGGFVSGGDLCNELQISRSAVWKHIVALRKAGYAIE
ATSGKGYRLESLTGSPVAEEVAPLLATESFGRNFIYLESIDSTNLRARAS
AREGAVEGTVVVADSQTEGRGRMRRAWVSPAGVNLYCSIVLRPPVPSIRV
PEIPLVAAAAIHEAVTQECPGLAAFIKWPNDIITGGRKLCGILCEMESEP
DCTHFVVVGFGLNVNLDPVPEELRKIATSIAIETGLRMSRARLLAAVLNR
FEQLYRDWLEKENLAFILPYLEAHSWLKGRELKIEQFNTVLTGTEAGLSP
QGHLLLRTVDGALLSVTSGEAHLLPVNHEPRSA
>CT1823 blc, outer membrane lipoprotein Blc
MKKLFLPMLLWMAALAGCASAPEGIVAVDNFKLDRYLGTWYEIARIDNWF
ERGSDHVSATYTLRDDGKVQVLNKGYYPDKKKWKTAKGKAKFAGSPNVGA
LKVSFFGPFYGPYNVFALDRENYSWAMVTASSRDYFWILARTPQIDDALY
EKLLEKAKSQGFDTAKVMRTLQ
>CT1036 bmpA, basic membrane protein A
MTYRKYSSFFFRLSSILTMFMLLLTGCAGKKKVSESNPNAYKVGLVFDVG
GRGDKSFNDLAYNGLEQAKKKLGIQFDYIEPSGEGADREAALRQMAADPD
VKLIIGVGLLFTDDITAIAKEFPDKKFACIDYNPQPGAEIPSNLSGIVFE
EKKGSFLAGAIAALESKTGIIGFIGGMDSNIIRKFESGYIEGAKYVRPDI
KLITNFIGMTGSAFNDPAKGKEIALGQYSQGADIIYQAAGASGMGVIEAA
RESKKLVICTDMGLEWPAPENMLTSINKAINKAVLTTIDEAMHGKFEGGK
QRVFGLDNRYTDYVWNSDTEKLIDQSVHERIESIRKDILDGKIKVQE
>CT1567 bmrU, bmrU protein
MGRAMGDSFTFIFNPAADKGRAADKTALIERSLAHFEVASLETTRFAGHA
AEIARAAAGEGSTLIACGGDGTLNEVVNAVAGQPVKVGVLPVGSANDFLK
TFNPSAKEHEVRIRGFAGATSRKVDLGKVEFGGGESRYFVNSIGIGFTGR
IASTVKSVKWLRGELSYAWALVSVLLGYSAVKMHITLDTVEGKIELDEPV
FAFSVSNGRVEGGKFRIAPEADPFDGLLDVCILKAVSKWRVPGYVLKYLK
GSQIHDPNVIYCKARSVEVFLSVSEAMHMDGEVIEKVGGAIAITAEPLAV
EMLYEP
>CT0941 btuR, cob(I)alamin adenosyltransferase
MSQKRVLLFTGNGKGKSTAAFGMLSRALGHGLKVRVIQFVKAQEGVGEVL
FFTRFEDVEWDHYGKGFLPTNPDSPMMEKHKQAAEFGFEEALDALASDEY
DFVLLDEVCFALSKEIIPLEPLIKAIVDASDKIIVLTGRNAPQALIDVAD
TVTEMKMIKHGYEQGLLSQAGVEE
>CT2260 cafA, ribonuclease G
MKKNVKKQLLMNKIGDEIQVALVEEGRLAELIIERPESMRSIGDIYLGRV
HKVVEGLKAAFVDIGQKSDGFLHFSDVGTTSEDYRALVEDDDNDENGGDE
DDDENEGDDNAEAPAPPREAARPAGQQKSEQGQQSSGDAAERRQSYTQMI
AQKLKPNDSILVQVIKEPIGTKGSRLTSDITIAGRFMVLLPFGGGQIAVS
RRVVSRKERGRLKKLVRSMLPEGFGAIIRTVAEDQDESLLKQDLEKLLAK
WTQIEEKLQDAQPPQLIFKEDTIISSVLRDSLTTDVTEIVANSKSIYTET
LNYIQWAAPDMVKNVSLYEGKLPLFEGYGIAKDVESIFSRKVWLKSGGYI
IIEHTEAMVVVDVNSGRYAAKREQEENSLKTNLEAAREVVRQLRLRDIGG
IIVVDFIDMMDPKNSKKVYDAMKAELRNDRAKSNILPLSDFGIMQITRER
IRPSLMQRMGDQCPACGGTGVVQARFTTINQVERWLRKFALQQKVPFQQL
DLHISPTVLEPMRNSDMKTEMKWFLQHLVFVRIKSDESLRSDDFRFYNRK
TGADITAEFN
>CT0066 carA, carbamoyl-phosphate synthase, small subunit
MQEIPAILVLENGSVYRGTAFGHAGEATGEVVFNTSLTGYQEILTDPSYT
GQMVVMTYPLIGNYGITPNDNESKKIWASALIVREASNVYSNYESTRSLD
ATLKEAGVMGLAGIDTRKLVREIRQKGAMRAVISTLCDDVAALKAQAAAI
PEMTGLDLVQRVTTGERYTVDNPDAKYHVVAFDYGIKTNIIRQLNAEGCK
VTVVNAKTTADEVLAMNPDGIFLSNGPGDPFAVTYAIDTIRELAARNSTL
PIFGICLGHQLLSLAFGAKTYKLKFGHHGANHPVKNLLSNTIEITSQNHG
FAVEMESLPGELELTHKNLYDMTVEGIRHRELPCFSVQYHPEAAPGPHDS
HYLFKEFTELMERLKN
>CT1672 carB1, carbamoyl-phosphate synthase, medium subunit
MPKREDIKSILVIGAGPIVIGQACEFDYSGTQACRALKEDGYRVILVNSN
PATIMTDIELADATYIEPITPYYVTKIIEKEKPDALLPTMGGQTALNTAV
KLAESGVLERHGVELIGAKFRAIRKAENREFFGDAMRKLGLEMAKGFFVR
NEKEAKEALEEIGLPIVIRPSFTLGGTGGGFAETKADYYDMVRRGLAESP
IGEVLVEESLIGWKEYELEVIRDLADNVIIVCSIENVDPMGVHTGDSITV
APAQTLTDRQYQELRDASIKIIREIGVETGGSNIQFAINPKSGRIVVIEM
NPRVSRSSALASKATGFPIAKVAAKLAVGYTLDEITNDITKTTPASFEPS
IDYTVVKVPRWDFEKFKNVDSRLGVQMKSVGEVMAFGRNFREALQKSLRG
LEIGRAGLGCDGKDIMNVIDMTQQQRKFAKEDLLEKIKIPKADRMFYLRY
AFQAGATVEEIHQATGIDPWFLDNIRQIVEFEDELRQMAAGSAA
>CT1592 carB2, carbamoyl-phosphate synthase, large subunit
MTTPLPELSEQAQALSQKLPASAMKKAKEHGFSDFQIATIFQTSEAAVRE
LRNHYGVISVFKTVDTCAAEFDASTPYHYSTYDEENESVRSDKKKVIILG
GGPNRIGQGIEFDYCCVQAVFALRESGYETIMVNCNPETVSTDYDIADKL
YFEPLTFEDVIRIIEHEQPLGVIVSFGGQTPLKLSTKLDKAGVHILGTSS
DGIDLAEDRKRFGALLEKLDIPHPEYGTATTFAEAQEITNRIGYPVLVRP
SYVLGGRAMKIIYNDDSLKEYVDQALFITEKYPLLIDRFLETAVEFDIDA
LADSTDCVISGIMQHVEAAGIHSGDSTSILPYYNIDPAAIATMKEYTRKL
AEHIGVIGLMNVQYAVQNGKVYVIEVNPRASRTVPFVGKATAIPVVKIAT
RIMLGEKLCDLRKEFNLKDCDELGLKHMAIKEPVFPFSKFVKSGVYLGPE
MRSTGEAMSLAESFPEAFAKAYQAANMHLPLSGTVFISVNDQDKNQRIIN
IARELYRMDFDLVATAGTHAFLTQHGIDCKKIFKVGEEGRPNIFDIIKLG
KIDFVINTPRGERALHDEEAIGSASVQNGVPFVTTIEAAEASVRAIDCIR
CQEFGVKSLQEYATYRDQ
>CT1130 cas1A, CRISPR-associated protein Cas1A
MKKHLNTLFVTTQGSYLSKEGECVLISIDRVEKTRIPLHMLNGIVCFGQV
SCSPFLLGHCAQLGVAVTFLTEHGRFLCQMQGPVKGNILLRRAQYRMADN
YDQTATLARLFVIGKIGNARVTLARALRDHPEKTDGEKLKNAQHVLAGCI
RRLQEATDQELIRGIEGEAAKAYFSVFDECITADDPAFRFEGRSRRPPLD
RVNCLLSFVYTLMTHDIRSALESCGLDPAAGFLHKDRPGRPSLALDMLEE
FRSYIGDRLVLSLINRGQIHAKDFDISETGAVAMKDDARKTLITAYQQRK
QEEIEHPFVGEKMAVGLLWHMQAMLLARYIRGDIDMYPPFVWR
>CT1977 cas1B, CRISPR-associated protein Cas1B
MKSNTLGDEGNPARLLIKVTRETLPQVKEKYPFLYLEKGRIEIDDSSIKW
IDCDCNVVRLPVAMLNCILLGPGTTVTHEAVKVMAAANCGICWVGDDSLM
FYASGQTPTSNTRNMTHQMKLAANPAKALEVARRLFAYRFPDANLENKTL
PQMMGMEGLRVRKLYEEMAVKYKVGWKGRRFEPGKFEMSDTTNKILTASN
AALYSIILSAVHSMGYSPHIGFIHSGSPLPFIYDLADLYKQQVSIDLAFS
LTADMAGYYDRHKIASEFRKRVIEIDLLGKIGPDIETILGKKQCSS
>CT0029 cat2, 4-hydroxybutyrate coenzyme A transferase
MSYRTISAEEAVMAIKSGDRVFLHTAAATPLRLIEAMVSRASHLRDVEIV
SLHTEGDAPYVRPEFQESFRLNAFFVGRNVRSAVQCGYADAIPIFLSDVP
AMFYDGVMPLDVALVHVSPPDRHGYCSLGVSVDATRAAVHTARHVIAQVN
PNMPRTHGEGLLHVSKIHSLVEVDDPLPETPRHELTDVERQIGQHIASIV
EDGATLQMGIGAIPDATLAALTNHKDLGIHSEMFSDGVVDLVEKGVVNGR
CKKTHNEIIVASFLVGTRKLYDFVDDNPLVEMFGSDYVNDTKEIRKNPKV
TAINSAIEIDMTGQVCADSIGTRHFSGVGGQMDFIRGAALSPGGKPIIAL
PATTRKGESRIVTQLKPGAGVVTTRAHVQYVVTEYGIVNLHGKNLRQRAE
ALATIAHPDFREEILRQAHELYGRKSNCLID
>CT0929 cbiA-1, cobyrinic acid a,c-diamide synthase
MNSNATTKRNATINTRQSRAFLVAAPSSGSGKTTITLGLLRLFARRGMAV
QPFKCGPDYLDTMLHAMAASTGETARPGLNLDTFMASKEHVRSLFARNAA
SADISVIEGVMGLFDGAEKSDGSSAEIAALLGLPVIMVIDGSKIAYSAAP
ILYGFKHFDPSVKVAGVIFNRVGSASHYSHLEAAAKDVGVEPLGYLPRNS
DITISERHLGLNTSAEYDREKLIDAMAEHIEKTVNIERLLDVARIELPEI
EQAQPVTKKLDKVIAVARDEAFNFIYHDNLETLKRYGEVRFFSPLHDREL
PKADLVYLAGGYPELHAEALAANKTMRKAIAEWCRNRGATWAECGGLMYL
GKTLTLADGAAHPMCGVLDLDTTMQEARLTLGYRKVYPAGADGVELRGHE
FHYSRIWRQGEIDTIAEIRNARGQQVGTKLFRVGNTIASYLHLYWGEHDY
LANWFDVLR
>CT0940 cbiB, cobalamin biosynthesis protein CbiB
MTWLLMPAAFLLDLLLGDPQWFPHPVRLVGKLASGAERLFRGMRFLPLRL
AGILTALVVIGTTVVTVLALVVAAFLLHPLAGVVVSVLLLYFAIAPRDLY
NHGSAVRDALRAGDLELARQKVAMMVGRDTSMLDEEGVARAAVESVAENT
SDGVTAPLFYGLLFGPAGAWLYKASNTLDSMFGYRNERYREFGWASARFD
DFMNYLPSRLTVLAVAGAAFILRLDPAGVFRSVKLGATLHESPNAGYPEA
AFAGALGVTLGGERSYGGVTKTVPTLGLREGALDATTITGAMRLMYVASV
LFMACGALLLFILSKSLQS
>CT0387 cbiCH, precorrin-3B C17-methyltransferase/precorrin-8X methylmutase
MPAVNPFALCVMNGTITVVGLGPGSDSMMTPQVLDAIRTADAVVGYTGYM
KSIAHLIPDSVQTVATGMTGEMQRAEEAFALAAEGRHVVVVSSGDAGIYG
MAPLIVELHASGRWPEVAVEVLPGISAFQAAAARLGAPVSHDFCAISLSD
LMTPWEVIEKRIEAAASADFVTAVYNPRSRDRYWQIHRFRELFLRHRSPS
TPVGIVRNVTRKDESVVLTTLGEFDPDSLDMFTMMLVGNSTTFFSGERMV
TPRGYFSREERGLRPVGQSIMSSSFRAIAELMKPDSRPTDERWAVMHTIH
TTADFEMQELFHATPGAIQRWHEALTAGGATIITDVTMVQSGLRKAALER
YGVTVRCYLHDERVTELAKSAGITRSQAGMRLAAKEHPDALFVVGNAPTA
LLELASLLHRGGFAPMGVIGAPVGFVNVVESKHRLKAAAGATPIAVIEGR
KGGSAIAATIVNAAFSLDEAEAMNPGRDV
>CT0384 cbiDJ, precorrin-6X reductase/cobalamin biosynthesis protein CbiD
MILLFGGTTEGRQAAALCDRLGLPFIYSTKTRVEPFATACGEFRHGALDE
AALSELVSSSGVRLVIDAAHPFAARLHRSLFEVCQHLAVRLIRFDRPSVD
VPDFEGLHSVADFTEALALLDRLNPAKLLALTGVQSIAPLRPWWEQHDML
LRILPTQASMELARREGIPEAQLLPMNPDSSDEALEKLVRKHGVDCLLAK
ESGESGFFSSKLRVAERCAIPLIVIRRPPAPAYDVVATSVEELEALLPDA
VTLKPLRTGYTTGACAAAATKAAVIALFDGAAPDLVTITLPSGKVASFGI
VSYRAERGRSRCGVRKFAGDDPDVTHGLLIVSEVSLLPNGEPGEVRFLQG
EGVGRVTLPGLEIPPGEPAINPVPRQMIRECIAGELRQRGLRHAVAVTIS
VPGGNAVARKTMNERLGITGGISILGTSGEVIPYSIEAWLASIRQSIAVA
STNGCTTLALTSGLRTERHLRARYPDLPELACIHYGNYIGRTLELVHQHG
GFSRVIVGVMLAKATKLAQGQADLSSRVVQLDPRWLANLSADLGYSPVIA
NQLADLHLVRNVTDLIPFSSDEPLYTAIANASYRACRRWLPDTSLTFVLF
DMEGQAVVVD
>CT0386 cbiET, precorrin-6y c5,15-methyltransferase, decarboxylating
MSEQFTLIGLSDSTEPHLDPSAIEAIRAHRIFAGGTRHREIVGALLPSAY
RWITIAPPIDEVLSQLAGADEPVVVFASGDPFFYGFGATLQKRFPGASIQ
SFPTFHSLQMLAQRCLIPYQSMRHASLTGRSWEELDCALIAGERLIGVLT
DTRKTPTEVARRMLDFGYSSYRMAVGESLGGSDERVTRCTLEEASGMQFG
KLNCLLLEASEPPKRWFGIPENLFDGLPGRPNMITKMPFRLAALAALELG
RARTFWDVGFCTGSVAIEARLRFPGLAVTAFEKRPECDALLEVNARRFGA
PGIAKVMGDFLEQDHRALCGNDGVDAVFIGGHGDRLGELFAAVATHLAPG
GRVVMNAVRRSSAEAFTASAARHGMELAEPLKLTVGDHNPVSVMKAVMPG
>CT0385 cbiGF, precorrin-4 C11-methyltransferase/cobalamin biosynthesis protein
MHERIAIIAITETGIALARSLKSLLVADGFAGCGLFSSRVAEGVEPIESV
PAFVRHSFGSFDAFLFIGSLGICVRSIAPVLQGKLRDPAVINCDESGRFV
QSVLSGHAGGANALAGRVARLLGAQAVLSTSSDVQGLWPLDILGREEGWG
VEFASPVAGESMTTAMAAFVNHEPTTLLLDVRDSLTDELERTAPPFVTIA
YRYEEVDFSTCSLLLAVTPRIIEASVQTVFYRPKVLCVGVGSERGIDPER
FVSSISEQFASNGFSMRSIRSVGSVDFKLNEEAFIAFAEACGTTLTGFTP
EQLESVGPVPNPSDVVFRKTGVHSVSEASAALLSGENRWLIEKQKISLDG
IPEGEPRHYTFAVSLLRGAERRGRIAIVGAGPGDPELVTVKGKRYLEQAD
LILYAGSLVPEKLTHYAKPGALVRSSASLSLEEQFALMERFYRQGKFVVR
LHTGDPSIYGAIQEQMAFFDAEGFEYEIVPGVSSFQAAAAVLQSQFTVPE
KVQTIILTRGSGRTPVPDKERLSELARARATMCIYLSAEWSDQVQSELLE
HYAPDTPVAVCYRLTWDDQQVWRGRLDGLASLVRESGKTRTVLLVVGEAI
GARGGRSKLYDPSFTHGFREGHGA
>CT0388 cbiL, precorrin-2 C20-methyltransferase
MNNQGSIISVSLGPGDPGLITVKALSQLREADVIYYPGTVSASGAVTSVA
LDILKEFDLDPSKLRGMLVPMSRSRGAAEASYAANYASMAEEVQAGRRVA
VVSVGDGGFYSTASAIIERARRDGLDCSMTPGIPAFIAAGSAAGMPLALQ
SDSVLVLAQIDEIGELERALVTHSTVVVMKLSTVRDELVSFLERYAKPFL
YAEKVGMAGEFITMEVDALRSRAIPYFSLLVCSPHCRQSTLSPFAS
>CT0394 cbiM, cobalamin biosynthesis protein CbiM
MTNLKFRFFAVASMAVAFMLFGGGEAYAMHIMEGFLPPGWSLFWWLVSLP
FFVLGFISLRRIVASNPRMKLLLAMAGAFAFVLSSLKIPSVTGSCSHPTG
VGLGAIMFGPSVMSVLGAIVLLFQALLLAHGGLTTLGANAFSMAITGPFV
SWGLYKLFDSLRSPRWLSVFVAASIGDLATYVVTSFQLAFAFPSITGGVV
ASAVKFLGIFAITQVPLAVSEGILTVMVYNAIMAYTGQSFFGAQSLSREV
K
>CT0393 cbiN, cobalt transport protein
MMTKASTKSNLLVKNLLLGVLVIALAAVPLMSLKHAEFGGSDDQAEHAIT
QLHPEYKRWFTPFWEPPGGEVESLLFALQAAIGAGVLGYGLGFLRGKHEA
SGFEQQ
>CT0391 cbiO, cobalt transport protein
MPSPDILRTEALRCVWPDGTLALDGVDLSIRAGQVTALLGSNGAGKSSLL
LSFNGILKPQSGRVLLLDEPLDYSARGLKALRQKVGIVFQNPDAQLFASS
VYEDISFGLCNLGLPEPETRRRIEAAMELVGVSALARKPVHHLSFGQKKR
VALAGVLAMEPSVLLLDEPTAGLDPQGADAIMRFIRELQRSRGMTVVVAT
HDIEMAPLFCDRVCIMERGRVLFEGEISAIVEHRDLVRQAGLRLPRIAHL
IEILVSKDGFTLPGNVMTIGAARKVLKTLKENQKSDERR
>CT0938 cbiP, cobyric acid synthase
MTNIAILGTASDVGKSIVATALCRIFSNAGVDVAPYKAQNMSNNSGVTPD
GFEMGRAQIVQAQAARVAPHADMNPVLLKPNTDTGAQVVLQGKVCADKSA
REYFGDTQRWAEAAFESLDRLMVRHELLVIEGAGSCAEMNLYQRDFVNFK
TARRAGAAVILVADIDRGGVFAQVVGTLAVIPPEDRALVKGVIINRFRGD
KSLFEGGVKMLESMTGVPVLGVIPYFRGFTIDAEDAVPLSSVVDPKQEPS
GDKIGVAAIYFPHISNFTDLAPLERDPSVELHYLHRPKSLDGYKALILPG
SKNVRGDLAWLETMGWRDEIEKFRKRGGIIVGLCGGYQMLGASIADPYGV
EGAPGASAGLAMLPVETVLEREKALCNSVGKIAGTPFFVSGYEIHMGRTA
LEPGASPLLEVTERNGVATDDFDGAKSADGQVTGTYFHGFFDRPEVRTWF
LRLLDGGYESPRGASVADPFELLAKHFSENLDLEKLFAIAGLSVKGEKES
>CT0392 cbiQ, cobalt transport protein
MMTLDEHAARSRLRDVAPVYKLFYALPPIAMVLWADSIVFSLLVLLLMGI
TVVVKGGVRPVDYLRWLTLPAGFLLIGTIAVAVDASASPEAFIVSAPLGG
VHVGVTSAGLAMAAHLCFRALASVSCLYVIAFTTPVADLARSMASVGLPP
LFVEMTLLVYRFVFQLFDTSSAIATAQKSRLGYAGLQATFRSLSALASNL
FVFSARRSEELYVAMECRGYDGAIRVLAPTSGARQPSLAGIALVEAALLC
VAVATHFGRLF
>CT0588 cdd, cytosine deaminase
MNRDHEFMALALEQARKSYDEGGVPVGAVMVENGKVLAAGHNQRVQQGDP
IAHGEMDCIRKAGRCARYDTVTLYTTLSPCMMCAGAIVQFGIGRVVVGED
RNFKGNAGFLREHGVEVSLLDDEGCRSLMDAFIAERPDLWDEDIAGNG
>CT0233 cdsA, phosphatidate cytidylyltransferase
MLMQSNLMQRVAVAIVGIPLLLWLNMQGGLYFLGLVLALSLMATWEFWRL
ATHRAHPPSVVILLPLTAFVQLDFYYGFIGYWEAILAVVMLLYVLEIWRN
QGSQFMNLGATLVGLLYVNLSFGALLRLRLSETTGEGSGEALVLLMLLCV
WAADIFAYFGGRGFGGKFIKKRLFARISPKKTWEGYLAGIAGSALAAWAC
SAYISGCPDGRAIPAGLLIGVVAPAGDLLESMFKRDAGVKDSSGVIPGHG
GVLDRFDTVMFVSPLLYFLVHHW
>CT0970.1 cfa1, cyclopropane-fatty-acyl-phospholipid synthase
MLDHLFQQKFETLLAAANITINGNRPWDIRVHDRRMFRKTMLQGNLGFGE
SYMEGWWDCDDLEELFYRILSAGVDQQLVTLASALEYLQGALINLQKPAR
AFTVGKHHYDAGNDLFRAMLDPLMMYSCAYWHEADSLDEAQQNKLCLVFE
KLDLKPGMKLLDIGCGWGGAARFAAEHYGVQVVGITVSKEQASFARELCK
DFDVDILLMDYRQLEGEFDRIVSIGMIEHVGYKNYRTYFDTARRCLKPDG
RMLVQSIGSNESVTGTDPWIEKYIFPNSMLPSPSQISRGFEGRFVLEDWH
SFGYDYALTLKAWESNITRKWPQIEKHYDKKFYRMWRYYLLSCAGAFRAR
TIQLWQILLTPSGIKGECCIPHQPVKRKFRDRGNDPAKLHRMPAPALGQR
REFPRSDVAAES
>CT1969 cfa2, cyclopropane-fatty-acyl-phospholipid synthase
MSGIYESKLRALLESAGIAIGGSNPWDITVHNPHFYKRVVTESHLGIGES
YMDGWWDCEALDQFFYRVLRARLDAKVSQASRALGNILGVLVNLQKPSRA
FTVGEVHYNVGNDLYEAMLDKRMLYSCGYWKDARNLNEAQENKLRLIFNK
LELQPGMRVLDIGCGWGGAARFAAEHYGVSVTGVTVSSEQKKMADKLRNN
LPVEVRLVDYRQLDGSFDRIYSIGMFEHVGVKNYRRFFEIARNCLKSDGL
FLLHTIGSKRSSTHTDKWTHKYIFPNSMLPSARQITTAAEGQQLIEDWHA
FGNDYDRTLMAWHRNFEEHWPQLRHAYDERFYRMWRYYLLSAAGSFRARN
VQLWQILFSNNGITGDYYVPREHKAVLLKN
>CT1270 chlG, bacteriochlorophyll synthase, 34 kDa subunit
MALNAKRSAVLPHEVLFLGFTCYFQALSFDQPNTSRVSNNSPDNIPFDTT
QERASSGMSRRKPFVQGRRSFEPVSSLALFVRFLKPVTWIPVVWSFICGA
IASGAFGWQQIGEIKFWLAVLLTGPLATGTCQMLNDYFDRDLDEINEPNR
PIPGGAISLKSATLLIALWSLLSVVVGWLVHPLIALYVVVGIINAHLYSA
NPIKLKKRLWAGNIIVAVSYLIIPWVAGEIAYRSDFSLHAITPSLIVATL
YTIASTGTMTINDFKSIEGDRQVGIHTLPAIFGERKAALIAAILIDLGQL
MAAGYMFMIGKAVYGWVTAALVVPQFLLQFSLVRSPRTMDVRYNAIAQNF
LVAGMMVCAFAIKSINP
>CT2281 clpB-1, ATP-dependent Clp protease, ATP-binding subunit ClpB
MHFDPNKFTVKAQEALQAASMLASSKQNQQIEPEHLLSVMLGDHDNIACQ
IARKLETPVDTLLSVVDREIDRIPKVTGASATGQYISSDLGKVFDTALKE
AEQLKDEYISSEHLFIAMSEAGVKVSKLLKDAGIDRNAILKVLTSFRGSQ
RVTSQNAEESYQSLKKYSRNLNDLVIKGKLDPVIGRDDEIRRVLQILSRR
TKNNPVLIGEPGVGKTAIVEGIAQRIVGGDVPENLKSKQIAALDIAALVA
GTKFRGEFEERLKALVKEVQASDGEVILFIDELHLLVGAGSAEGSMDAAN
ILKPALARGELRCIGATTLDEYRKHIEKDAALERRFQTVIVDQPSVEDTV
SILRGLKEKYEIHHGVRIKDAALVAAAELSNRYIADRFLPDKAIDLIDEA
CSRLRLEIDSEPEELDRINRELRRLEIEREALKRELEATGSV
>CT0089 clpB-2, ATP-dependent Clp protease, ATP-binding subunit ClpB
MNTADTRQRLLDIEKQIASLREEQATVKAQWEAEKELIHTSRRLKEELED
LRVQAENYERSGDYGKVAEIRYGKIAEIEKALEENNRKIEARQASGDLIM
KEEIDAGDIADIVSRWTGIPVSKMLQSERQKLLGIESELHRRVVGQDEAV
RAVSDAVKRSRAGMGDEKRPIGSFIFLGPTGVGKTELARTLAEYLFDDED
ALIRIDMSEYMEAHTVSRLVGAPPGYVGYEEGGQLTEAVRRKPFSVVLLD
EIEKAHPDVFNILLQILDDGRLTDSKGRTVNFKNTIIIMTSNIGAQLIQS
EMEHLEGRDADAALAGLKEKLFQLLKQQVRPEFLNRIDEVILFTPLTREN
LREIVTIQFNRIKETARRQRITLEISDEALMWLAKTGFDPAFGARPLKRV
MQRQITNRLSEMILAGQVGEDDTVEIGLENDAIVMKKK
>CT0191 clpC, ATP-dependent Clp protease, ATP-binding subunit ClpC
MEGNFSNRVQDVIRLSREEALRLGHDYIGTEHLLLGLIREGEGIGAKILK
NLKVDLFQLKQKIEENTHPKVPATQMGNVPLTKQAEKVLKITYLEAKICK
STIIGTEHLLLSILKGDDNIAAQILEQFGVTYDQVRDELMTITTGRSEAY
EPPMEGSYSSGSGSPARPSKKSETKRGERTKTPVLDNFGRDITRLAMEDK
LDPIIGREKEIERVAQVLSRRKKNNPVLIGEPGVGKTAIAEGLALKIVQR
KVSRVLYDKRVVALDLAALVAGTKYRGQFEERMKALMNELERSRDVILFI
DELHTIIGAGGASGSLDASNIFKPALARGELQCIGATTLDEYRQYIEKDG
ALDRRFQKIMVEPTSVEETIQILNNIKNKYEAHHHVHYSEDAIEKAVKLS
ERYITDRFLPDKAIDVMDEAGARVHLSNIHVPQEILELEKSIEEIKSEKN
KVVKMQNFEEAARLRDKEKNMLEALDHAKQEWEEQSMESVYDVTEADITS
VIAMMTGIPVAKVAQSESKKLLTMEAELKKEVIGQDEAIKKITKAIQRTR
AGLKDPSRPIGSFIFLGPTGVGKTELAKALTRYLFDSEDALIRADMSEYM
EKFSVSRLVGAPPGYVGYEEGGQLTEKVRRKPYSVVLIDEIEKAHPDVFN
ILLQVLDEGVLTDGLGRKVDFRNTIIIMTSNIGAKEIKSFSTGGGMGFAP
PSDATGDYKAMKSTIEDALKRVFNPEFLNRIDDTIVFHQLEKSDIFKIID
ITAGKLFKRLKEMGIEVEIDEKAKEFLVEKGYDQKYGARPLKRALQKYVE
DPLAEEMLKGRFTEGSVIQITFDEKEKELRFLDGASSAEPTPKKSKKEET
LDKE
>CT1553 clpP, ATP-dependent Clp protease, proteolytic subunit ClpP
MANINFGFDHHAKKLYSGAIENSINSQLVPMVIETSGRGERAFDIFSRLL
RERIIFLGSPIDEHVAGLIIAQLIFLESEDPERDIYIYINSPGGSVSAGL
GIYDTMQYIRPDISTVCVGMAASMGAFLLASGTSGKRASLPHSRIMIHQP
SGGAQGQETDILIQAREIEKIRHLLEDLLAKHTGQEVSRIREDSERDRWM
SAVEAKEYGLIDQIFEKRPAPKSEE
>CT0404 clpX, ATP-dependent Clp protease, ATP-binding subunit Clpx
MIKDKEPRKGQNGGRKSYGSEPPVVCSFCGRTSQEVNSMVAGPRAFICDR
CILSSVEILRKEISAIRHPDQSPEPAFQPRLKSPVNIKEALDQYVIGQEQ
AKKSLAVAVYNHYKRLDAHDWSSGDEVVIEKSNILLIGPTGTGKTLLAQT
LANLLEVPFTIADATSLTEAGYVGDDVETILARLLHASDFNLERAERGII
YVDEIDKIARKSANVSITRDVSGEGVQQALLKILEGSVVGVPPKGGRKHP
EQQLININTKNILFICGGAFEGLDKIIARRVSKSSMGFGSKVRGKQTGYD
PEILKLVTQDDLHDYGLIPEFIGRLPVMSVLEPLDAVALRNILVEPKNAL
VKQYKRLFEMDGVELEFTDEALERVVAIAIERGTGARALRSVLENVMIDI
MFELPTRKDVQKCVITAETIDKTGGPVYEKKDGKERKIA
>CT0286 cmk, cytidylate kinase
MESPIEKKPIIVAIDGPAASGKSTTAKILAARLGYTYIDTGAMYRSVTLK
VLREGLLDEIRKDETRIAELLQTITIGFQGQRVFLDGEDVSEAIRENRVS
REVSFISSLKPVRDKLRELQQEMGRKRGIVMDGRDIGTVIFPDAELKIFL
IADPAERAKRRHAELLLKAGGAAVPSVEALEEEIKQRDRDDEQRTHAPLK
RHPDAVLLDTSNMTIDEQVNVVYDLVNKIVEQQSL
>CT0390 cobA, uroporphyrin-III C-methyltransferase
MSDGKGKVFLVGGGPGDPELLTIRAHNVLQSADVVLHDALISPEILALLP
NGAERISVGKRLGDGKDQTDRQTKINDLLVRHAREGKCVVRLKAGDPFMF
GRGIEEVRALAAAGVPCEVVPGITTGIAAADLCGIPLTERHRNSSVLFCT
GHTADYSLGHFAAVIELMKAGTPLVMYMGFENLDKIVERFIDSGLSPELP
ACAVSRVSRSDQTLVAATIGTIVQQIRERELSLPVVFIIGEHAVPEGACP
DQSDASDPSDQNHNEQQ
>CT0945 cobP, cobalamin biosynthesis protein CobP
MPEVIYVTGGARSGKSCYALKLAERYKCRVFLATAEAFDGEMQRRIDKHK
QERDERFTTVEEPVYLDKALRALPDGTEVVLVDCLTVWLGNMMHYLGDEA
AINERIDALLDVLKNPPCDIIFVSNEVGMGIVPENAMAREFRDLAGTLNR
KVAERATQAYLLCSGLPLVLKK
>CT0948 cobS, cobalamin 5'-phosphate synthase
MPGLRNLSKSSSEMLSGLVTALRTLTALPVPGRDAERFSSSLYWFPVVGL
VIGGIVVLLARAGMGVGWPELAAVLALLGGLILTRGLHADGLADLADGFF
GGRTREAALRIMKDPNVGSFGSLALIGVMLFKWICLLELARAEAYGMIAA
GAVLSRTAQVLLAARMPYARSDGGTAMAFVEDAGWPHLLVASISGVVLLF
VLLDWQLAPSLILLFGSVVALFFVGWLSHRKIGGITGDVLGACSELVEAA
VWLLAALWLKGLFWAIA
>CT0946 cobT, nicotinate-nucleotide--dimethylbenzimidazolephosphoribosyltransferase
MTDRFQQLLASIKPVDMNLTSTVKAHLDDLTKPQGSLGRLEEIVMKYCIA
TGTTKPSLSKKKVFCFAGDHGVAAEGVSAFPAEVTPQMVYNMLGGGAAIN
VLSRHAGADLEVVDMGVNHDFAEHPMLRRCKVKHGSANMAEGPAMSIEET
LQAIMAGAELAIEARNQGYELLATGEMGIANTTPATALYATLLGLPVEAI
TGRGTGIDDERLHHKVAVIEKAIEVNRANLATPLEVLAALGGFEIAGICG
LILGAASVGMPVVVDGFISSSAAVCAIKLSCTVSDYLFFSHLSNEQGHRA
VMQKLGARPILDLDLRLGEGTGAAMAMQVIEASVKIYNEMATFSSAGVSG
KND
>CT1318 comA2, comA2 protein
MAQDSIFTVPVTPEEINETQIVEGQMARHLGIEITAVGPDSMTATMPVDH
RTIQRIGILHGGASLALAETVGSIAASYCVDREKQFIVGQEINANHLRAV
RQGESSVHATATPLHLGRTSQVWDIKIRDDKGRLVCVSRFTAAVLEKRG
>CT2006 comF, competence protein
MHLLFPEVCILCQKPLGEGEEHICAGCFNDFNPFPSVLAGGAALKSTVRA
HFGEKAVPAAAWCLYPYRSRGSLHEAMHAMKYGGLFPLGELFGKRLGELI
CQGGVPVGFDAIVPVPLHHLKRIERTYNQAEALARGMAGLIGLPVATRSL
ERCVYTGTQTGLGLEARRENMAGAFRPGRERCPARVLLVDDVLTTGATMV
SAAKVLKAAGAVEVAFATVALTEKE
>CT0610 cpsG, cold shock-like protein CspG
MAKSKVKWFDGKKGYGFILNPDGGEDIFVHFSSIISDQSFKVLNQDADVE
YDLDKTQKGLQAKNVRELSVSAATVASGVENPAVRLGVQGEFNSLPQ
>CT2102 crcB, crcB protein
MVMKNVLLVGAGGFAGSVARYLVALAVPFSGTGFPFATFAVNLLGSFLIG
FISELALSTTLLSPEARLLLTTGFCGGFTTFSTAMYETGGLMRDGEALYA
SLYVAGSLAGGLACLFSGTLLAKLWQ
>CT0301 crtC, hydroxyneurosporene synthase CrtC
MNITTDSLQQAWHRLDAPGSYEWWYFDAEDESEGISVVFIWFVGFAFSPY
YLSHYEEWKAHRRDDQPYPLDYGGFSFQLYQDGRETINFIKEGGRELFAS
EDGGIGVRFEGNRFVYDPLRDEYRLSIDFSFPARDRSVQASFSFRPLHRF
DYHFDTDLHAGVDFRHQWVLSVPKAEVHGLLDITSLSSDKRQVLQFRGRG
YHDHNLGTVPMYESIDRWYWGRTFSRRCDLIYYVVFLRGCSAEPQAVLML
LDHKTGRQSTFDAVRVSESRFTRGLFAPVHGKTLRLEAEGVSVEVQHQKA
LDTGPFYLRYTSLLSMMIGEEAQEEVRGISEFLNPAPLKSRLMQFFTASR
VWRAGKQSAMYVLYNFFKHRFERVHRINRKKF
>CT0708 crtK-1, crtK protein
MQSWYDGLNKPKLTPPNKVFWPVWSTLYVLIAIALIVYFRTPVKPHATTV
LVILAAHFLAGFSWTSIFFGKKKILAALIDLLFMDATLAAIIVFFANTNP
LAAALLAPYFCWSLLATWLNFGIWRLNPGKR
>CT1660 crtK-2, crtK protein
MNKQILTLALCIGLCLAVGFAGSTFTPKPASWYYTTLVKPSWNPPDWLFP
PVWTILFIMMGTALAKVLGTGWKKNEVNVGVVLFAIQLMLNLGWSASFFG
MQSPLAGLVDIVLLWIFIVLTMLAFARVSKPASLLLVPYLCWVSFASYLN
FTILQLNP
>CT1942 csmA, chlorosome envelope protein A
MSGGGVFTDILAAAGRIFEVMVEGHWETVGMLFDSLGKGTMRINRNAYGS
MGGGSLRGSSPEVSGYAVPTKEVESKFAK
>CT2054 csmB, chlorosome envelope protein B
MLSNNSKHIRIMSNGTNIDVAGAINTLAETFGKLFQMQIDVANTALKALA
DVAEPLGKTATDLIGSFTGAATQVLQSVSSAIAPKK
>CT1943 csmC, chlorosome envelope protein C
MSESYQKLRKDFKELDFTDRLTFLAESLLLTGQSAIVGGLEVAGRVVETV
TGTVGSLIDASGITNILGGSGGVVGETIDRVAITVKDVSRSAGELYNDAV
RNVENATSNAAKAVGDVGVSASEAVKNIAGSFQKTTGNK
>CT2064 csmD, chlorosome envelope protein D
MADEEKIDTMKSFDFAVKSITEAGVNQLNLISNTIQSAVPAVTNAAQSLT
NAVSVSVKTVSEAAGALAGALGELGGAVANLAGALTNSAVSIAQSGVSAV
TNAIGSVLQAKKI
>CT2062 csmE, chlorosome envelope protein E
MNNPRGAFVQGAEAYGRFLEVFIDGHWWVVGDALENIGKTTKRLGANAYP
HLYGGSSGLKGSSPKYSGYATPSKEVKSRFEK
>CT1046 csmF, chlorosome envelope protein F
MANESGNIGVFGDLFTAVGDLAQQAVDMAGSALKTATDTVQPVTNACVQL
CTTSINSATQLVEGATKAITTAIAPKQ
>CT1417 csmH, chlorosome envelope protein H
MATEETNMPAAEAPKAAAGAPNTSAGNGDMAHLIGNMGILIDSTIESVQG
VISTVSSATGQIIEGVTTTINSEPVKEIINNVNSVSGQIIEGVTNTLKSE
QIQNSFNELGKFWTGMISNLNAMVNSNQVKNLFDNVSAGINQLAGGIFPQ
GMPPMFMGASSGEEKRKVVHQIPVVHTSESGAATLKAMTPQPTAAPAAPA
APAAPKNKPEGE
>CT1382 csmI, chlorosome envelope protein I
MNLIINDKTASSSVGQTIGKAARLNHAHVGYVCGGHGLCQACYITVQEGA
DCLAPLTDVEKAFLSPRQIAAGGRMACQATIAKEGTVKVLSRPEEVRRMV
FSNPFQLIGYAADMGKDTAQQIVPGVQNLIGRIQRGEMGGKDALGDMIES
IQGAAGLVVEAIQQGPMALPIPFKEQIADLISKLPLPQIQLPSISLPQLP
SISFPQLPFSLPKLPFSLPFLPQQPQATASLEKVTITVQPPAKD
>CT0651 csmJ, chlorosome envelope protein J
MIIYINDKPCNAKVGDLLLNTAKLNKAHIGYICGGNGICQSCFVYVLEGA
ECLSEPGEDEKAFISDKLFAEGGRLACRTTIVKEGTIRVLTRAEKFRRIV
LGLNVPGFITYAQTIGYNVTNKLPSGVSSIVSRVQSGRLNPVDTIGKIAS
GLSPASQLVYNNFIEAFPFMQAPVNMVSGVAKSAIDNASGALCTISGGRL
HLPGSTCTAHDKPAEAIERITISAK
>CT0652 csmX, chlorosome envelope protein X
MNITVNDRECSAQVGDRLLDIARANHSHIGYFCGGNAICQTCYVRVLEGA
ELLSPMSDAEKAMLSDKLIKEGTRMACQTLIEKPGKITVLSEVEAAKRLT
LENPLQLPAYMGKMGWEAAVKFTDTIAFQARREQGEHALEPTQLLHDVAA
AISDAIQLVFNAVQAAFGVNRSTDKPEIKADKGCGCDTTLLAKTATCDNG
HGRIVSPEVLQLHQERSAVCN
>CT1362 ctc, general stress protein Ctc
METRALSVNLREVKKNGAAKLRRLGQVPAVVYHKGEATVAISVEEISLNK
LVHSSESHMIDLQYPDGKSVRSFIKDVQFDPVTDRVIHADFQLFSTDEVV
EMEVPIHLEGECPGVKIGGGKIQINVHTLPLKGKPEAMPEHFTIDVSALE
LGQTLHIRDLQAIAPEGVQILGDADTSVVSVVAPRKEAETAAEGATAEA
>CT1171 ctpA-1, carboxyl-terminal protease
MSRIFTVIIMLLIGGFGYFIGWRMNSGTGGKMVDTYNLIRSYYVDPVNAD
SLQSAGVRGMLQSLDPHSIYLEPEKAAITQSNFNGNFEGIGVEFDVVRDT
LLVVTPLAGGPSESAGIMPGDRIIGIDSKNVIGIKPSAVLGKLRGERGTR
VRLRIYRPLTRRTIDFLVTRGKISTSSIDAAFLDDARTGYIRISQFVETT
GSEFHNAVQKLRDKGMRKLIIDVRGNPGGYLDQAVAVADELLPAGKLIVY
TKSRHGGDDQMKYLSTSGGIFEKGEVCLLVDRGSASAAEILAGALQDNRR
ALLIGELTFGKGLVQRQFKLPDDSVVRLTVSRYFTPSGRQIQREYDDGLQ
GRERYYKEMFTRNLPGGFIKKYGDLMYREGQNISVYSTGSLLKNTAADSL
RAVLAKAGGIMPDYWVFGKPYSNLYQELYAKGVIEDIALKLIDDPSDPVQ
KYRSSATSYLDGYHVPVNLEALVRKECAEAKVQFNQAEFDRDRADIALAL
KARVARHLFGTEEQIRVLVSEGDPVMKVARHFIEKSAS
>CT2258 ctpA-2, carboxyl-terminal protease
MSWFTTQRHSGLCRDFGRRWRRVATISSAIALICAQPLFAVPAAKPNEDF
FSIAKSIELLGDVYKNVAQNYVDPVNVSEFMYSGIDGMLGQLDPYTAFLD
EEQSGELDEITSGQYAGIGVTLGIFSGDLFIISVIDGQPAAKAGLKVGDQ
IIAIDGVKVSKKSIDEVRSTIKGSPGTNIRLSIKRDGQGPLTVISLTRGE
VRISSVPFFGLFGSSGYVQMNSFSEHSREELSAAIRKIRQEAAKNRVVLN
GIVLDLRGNPGGLLTSAVEVAGLFVEKNSRIVSTRGRAADSEQVYVTKTE
PQEPTLPLVVMIDGDSASASEIVSGAIQELDRGVILGENSFGKGLVQSII
NLPYDHILKMTTAKYYTPSGRLIQKPIARDESRRKVVLSNGDADSTKVFY
TRNRRKVYGGGGIRPDVVAKADSLSEYQHKIENSGLLFRYASRFHRKHPE
FRLQQLSSEPLYDDFNRFLEKEHFSFRSGAQKTLDSLKTLVQKEAGADKA
LAGQLDALDKALAASTRRNISRDSLHITAALQREIMRHYDERAALKRAIE
DDPVAAKAFALLGDQKRYRSLLKP
>CT1607 cutA, periplasmic divalent cation tolerance protein CutA
MSESKSGGYCMVITTAPSREEAEKLAQGILENCLAACVHLSDIRSFFFWD
GEMQNDDEVSLFIKTTKKRYDALESYIQEYHPYDVPEIIQLPITGGSPEY
LAWLDAMTGSSK
>CT1818 cydA, cytochrome d ubiquinol oxidase, subunit I
MDTLFLARLQFALTSVFHFFFVPLTLGLSIFTAIMETAWVRTGKEKYRQL
AKFWGHLFLINFAIGVVTGIVMEFQFGMNWSQYSRFVGDIFGVPLAIEAL
LAFFLESTFLGIWVFGWDRIPKGLHAASIWLVAIGSNLSALWILVANSFM
QSPVGFHMAADGSRAEMTSFSALLFNPYVWLQFPHVITAGIATGGFLVIA
VSVWHLMKKTADEEQFRTSLKFGAIYAFIGSLLVTLAGHTQMQEMVHNQP
MKVAAAEALWHSENPASFSLFTVGDEENLKDVFSIRVPGMLSFLAYNKFS
GEVKGISELQQEAVAKYGPGNYIPSVITAYWSFRFMVGAGTLMLLAAVVA
LFKVIREDYNFGKLTGALLLSAFILPFVANSAGWLLTETGRQPWIVVGLL
KTEQAVTPASVVSSAELLTSVVVFTLIYSVLTLVDVFLLKKYATAGLHGA
E
>CT1819 cydB, cytochrome d ubiquinol oxidase, subunit II
MDLQTLQIIWFILVAVLFTGYFILEGFDFGVGILLPFMGKDDLERRAVIN
TIGPFWDGNEVWLITAGGAIFAAFPHWYATLFSGFYLALLLMLVALIFRG
IAFEYRSKRDSAAWRSFWDWSIFLGSAIPALLWGVAMANFIRGVPIDASM
NYTGGFFNLLNPYALACGLASLSVFTLHGAVFLTLKTTDELHERAMGMAK
KLWIPATVLSLVFGVYTYFETDISTRLGVNPGAIPIFSVLSLLSVIVLLN
KDASGWAFVMTAISIAFSTITIFMGLFPRVLVSSTNPDWSLTIYNASSSQ
YTLGIMTTVAAIFVPIVLLYQGWSYWVFRQRISKDSKMEY
>CT1822 cydC, ABC transporter, ATP-binding protein CydC
MKTFLRLTGLTRPFAWWMLLAALIGFATIGSGIGLLMASAWLITTAALMP
ALSALQIGITGVRFFGISRGVIRYAERLISHNTTFRILTRLRVWFYDAVE
PLAPARLAAYRSADLLKRIVDDIQSLENIYARVLAPPITAALITLLMWFV
VGHWSTAAAEGVLTSQLIAGIAVPLLTARLAAGTARGITALQGEQQVLAV
DMVQGMSELRVFGMVGDYTEKLRDAEVKKLALQKCAAFIEGLHESLTGLA
MNGAVIWILYSMLPMVRTGAVSTITLASVVFGVMASFEAFLPLTGSVQNI
EADVRAGERLFEIIDAKPEVVPPAKPEPFPKSTGIEVKNLAFTYPGSDRP
ALDGVSFSVPQGGRTAIVGPSGAGKSTITSLMVRFWNPTQGEISIGGQTI
DKLDPEELRRNIAMVSQRTYLFGQTIRENLLLAAPDATDERLRQALTLAG
LDSLQSRLDDWVGQHGMNLSGGEQQRLAIARMILQDAPIIVLDEATANLD
AITEQALLDTLDTTSQGKTVLAITHRLHRMERYDEIVVLYEGRTIERGSH
EELLKSDGFYAGMWKLQHQRKA
>CT1821 cydD, ABC transporter, ATP-binding protein CydD
MNIDRNLMRLLAEQKRPFIFSGISGAAGALMLVAQAWVLSGIIETVFRQA
PAWQTILPLVGLFALFSTLRVLFGWAGHHEAKKGTLAIRKTLTERLSGTV
AALGPSYTRSGQSGRIVTTLLKGVESVDAWFSQYIPQLFLSLIIPVVILA
AVFPADWLSGLILVLTAPLIPVFMILIGKRASAATEKQWNTMSRMSGHFL
DMLQGLSTLKLFAQAKTRRDGIAEASENFRHSTMQVLKIAFLSSLTLELV
GTLGTAVVAVSIGVRMLGGHLPFRPGFFALLLVPDFYLTLRQLGTKFHAG
MEGVTASKEMYEILDRSKEIPKAGSRELTATDISSQPIVFEGVDYRFPGS
DKPALDSVSFAIEPGTVTALTGPSGAGKSTLLNLLLRFIEPADGTISLGD
RKTQEFNLDSWYRQIAWVPQHPFLFNATIRENLLMARRDATPDEIDNALK
QAGLLDMVRSLPDGLETMIGEQGARLSGGEAQRLSLARAFLKDAPVLLLD
EPTSHTDPILEAQLRKAMEKLMQGRTVVMIAHRLESIRNADRIVVLDRGR
LVQSGTHDELMADKGFYRQAIFSSIEEAAA
>CT0604 cysD, o-acetylhomoserine (thiol)-lyase
MSEDNTFRFETLQVHAGQEPDPVTGSRAVPIYQTTSYVFENAEHGADLFA
LRKAGNIYTRLMNPTTDVLEKRMAALEGGKAALGVASGHSAQFIAIATIC
QAGDNIVSSSYLYGGTYNQFKVAFKRLGIEVRFVDGNDQEAFRKAIDENT
KALYMESIGNPAFHVPDFDAIAKIARENGIPLIVDNTFGCAGYLCRPIDH
GASIVVESATKWIGGHGTSMGGIIVDAGTFDWGNGKFPLFTEPSEGYHGL
KFYEAVGELAFIIRARVEGLRDFGPAISPFNSFMLLQGLETLSLRVQRHL
DNTLELARWLERHDAVAWVNYPGLESHPTHALAKKYLTHGFGCVLTFGVK
GGYENAVKFIDSVKLASHLANVGDAKTLVIHPASTTHQQLSAEEQVSAGV
TADMVRVSVGIEHIDDIKADFSQAFENLA
>CT1436 cysE, serine acetyltransferase
MSVQDIWSLIREEACLECEREPEIRLFLEQHILRYEEFAPALAMLLSVKL
GSKHFPPPVLEGIFEDFYRQSPESVRCAACDMEATRERDPAAVNYFEIML
FLKGYQALQSYRLAHWLWQNDRKSLAYFLQNRMSEVFAVDIHPAAKIGKG
ILLDHATSLVIGETAVVEDNVSLLHEVTLGGTGKDSGDRHPKVGKSVMIG
AGAKILGNIKIGEGAKVGAGSVVLDDVPPHYTVAGVPAHIVGRTEVPEPS
LDMNQRLIFPEKQKPKGEQHSCL
>CT0700 cysK, cysteine synthase
MATIANITNTIGRTPLVRLNKLAKGLDADILLKLEYFNPLGSVKDRIGRA
MIEAAEAEGKIDARTLIVEPTSGNTGIALAFVCAQRGYRLLLTMPETMSI
ERRKLLRYLGAELVLTPGAQGMKGAIEEAARIVEAEKNSFNPGQFRNPAN
PLIHQATTGPEIWNDTEGRVDMLVSGVGTGGTITGTSRFIKALKPNFKSI
AVEPKDSAVLSGGQPGPHKIQGIGAGFVPDVLDVSLLDEVVTVSNDDAIS
TARALASQEGILCGISSGAAVHAAIEVAKRSSSAGKTIVAIIPSTGERYL
STALFEDIEA
>CT2158 cysS, cysteinyl-tRNA synthetase
MSSLHIYNSLTRTKEPFKPIHPGLATIYVCGPTVYGHAHLGHAKSYVSFD
VVVRWLRHVGEEQGYKVRYVQNITDVGHLTDDADEGEDKIQKQARQERIE
PMEVAQYYTRSFYEDMDRLGVERPNIAPTATGHIPEQIALVERLIESGHA
YESNGNVYFDVNSFEGYGKLSGRTDQEALQSGGRVAERSDKRNPSDFALW
KKAEPGHIMKWQSPWGEGYPGWHLECSAMAMKYLGDTIDIHGGGMENKFP
HHDCEIAQSEAATGKPFVRYWMHNNMVTVDGVKMGKSLKNFVNLKELFGK
FDPLVIRFFILQSHYRSPLDFSEAAIRASQSGFEKLQETYKRLVESAEGK
GQLDVATFEQKITDALNDDFNTPVAIAVLFEFIKALNGALDKDGLDAASK
SGAQNLFDSYAGKVLGILKSRDELLAGESGESAQTLNDVMAVLLELRKEA
RASKDFATSDKIRDLLMERGIEIKDTREGATWSKKKA
>CT1265 dacA, D-alanyl-D-alanine carboxypeptidase
MGRPHGRFYATRKVLFTLLAFFVLTLIPLTASAKRRPAPEGEDAVAAYIV
KETGSPEFLKSKNADTPRSPASLTKIMTCLLAIESGRMNDVVTIPLEATQ
VEPTRAGFKPGEQFRLRDLVKAAMVNSSNDAAFAIAIHLGGSLDAFVAMM
NARARALGMSHTVFTNPAGYDRGIYAGNRTTARDLVILTERAIRFPEFNA
IAKLDRVDFNELSTGKIYSLRTHNKLLERYPYSVGIKTGYTSMAGPCLIA
RALRNGKDMLVIMLSARTDRWSLASTMFDQGFGLDAGPVQVAETAEKSPR
RVAVDAPEAPHSSVSVRRAQALEALRLKVESRRGQSAVTEIRGMSMNVRS
SGADSSVKARMEKRKASAVIKLRGKSSRADRIALRAGKNASARRKQQLAM
ARQRNHKNGVVLKSAKKNASGNVALKSHNKAQRSIKTARKGAVKRSTADA
KSAKSRSWKEGLSLSERAVRSPNG
>CT1624 dapA, dihydrodipicolinate synthase
MSNRLISGSAVALVTPFHQDGSIDFEAMRRLVRFHREAGTDIVLPCGTTG
ESPTLTNEEEAEIIRVVCEEAGESMMVAAGAGNNDTRHAIELARNAEKAG
AQAILSVAPYYNKPSQEGYYQHFRHVAEAVSIPVIIYNVPGRTGSNVNAQ
TILRLARDIENVVAVKEASDNFEQIMTLIDERPENFSVMTGEDGLMLPFM
ALGGDGVISVAANQVPKVVKGLIDAMKAGNLEEARAINRKYRKLFRLNFI
ESNPVPVKYALSLMGMIEEVYRLPLVPMADANKAILRAELEKLSLV
>CT1850 dapB, dihydrodipicolinate reductase
MKITLVGNGRMGRQIADVVAASGAHVVNRVLDVNDTIDAAAFSGSDVIID
FTVRSAFLANYPALIASGVPVVVGTTGWDDVMPQVREEVVKARSTMLYSA
NYSLGVNIFFRTLREAARLIAPFEQFDIALSEQHHTGKADFPSGTAIKAA
DEILNSNPRKRTIVRELQEGRKLQSDELQVASIRLGSVFGVHSAIIDSES
DTIELTHTAKNRTGFASGAVRAAEWLVQRHATSPGFYTMDDFLNDLFSA
>CT2259 dapD, 2,3,4,5-tetrahydropyridine-2,6-dicarboxylateN-su ccinyltransferase
MEQYHQIKEQIIAFSLLSADQLKADAQVKSVFAEFKTLLNEGKIRAAEPS
GDGWTVNQWVKQGILVGMKLGVLIESHVDLAGLGSASFIDKDTYPLREFT
AADGVRIVPGGSSVRDGAYLAPSVVMMPPAYVNVGGYVDEGSMIDSHALV
GSCAQIGKKVHLSAAVQIGGVLEPVGAMPVIIEDEVMVGGNCGIYEGTIV
KKRAVIGTGVILNGSTPVYDLVNGTVLRKSAAGPLVIPEGAVVVAGSRQV
KGEFAAEHGLSIYTPLIVKYRDERTDSATALESALR
>CT2021 dapF, diaminopimelate epimerase
MSGAGNDFIVIDNRQGLFNLTHEQVRAMCTRRTGIGADGLILLETSETAD
FRMNYHNADGFPGTMCGNGGRCAVWFAHLIGIRPTGKHYRFEAGPSTYEA
EVTGEESVRLHMLPPSDFRDGLQAGAWNCHFVDTGSPHAIAYVNNLDQLD
VLTEGGNIRHNKELFPDGTNVNFLEITAPDALSIRTFERGVEDETLACGT
GTVAAALMSFRLGKVTSSLVRVKVKSGETLMVGFNEMMDEIYLEGPARAV
YRGTITL
>CT1345 ddlA, D-alanine--D-alanine ligase A
MSKQTVALFFGGKSAEHEISIISARSIAAQIDRNRYELSPLYIDRDGKWH
ASECSQQVLDTDIAALLRSGTPESAGKRLDELTAAAAGECFDFGSFLKNT
DVAFIALHGSYGEDGKLQGCLDTFGIPYTGCGLTASALAMDKVLTKLCAM
NAGIAVAEFMTITSCAYMANPLETIVEITKRFDWPLFVKPASLGSSVGIS
KVRNAEELAAALENACGLDSKALVEAAISGREIEVAVLGNSDPLASEPGE
IIPGSDFYSYEDKYIKNEAKIVIPADLPEGVAEEVRKAALTVFKALGCEG
MARVDFFVENGTNRVILNEINTIPGFTDISMYPMMMAASGIGFAELVEKL
LLLALEKRSITHKI
>CT0597 deaD-1, ATP-dependent RNA helicase DeaD
MAATFLSMTRKETMNEPTMSEQQDNSVNFRSLELAEPLLRALEAVGYENP
TPIQASTIPLLLEGRDVLGQAQTGTGKTAAFALPVLSNIDLSATDPQALV
LAPTRELAIQVAEAFHTYAEFMPGFHVLPIYGGQDYGVQIRNLKRGVHVV
VGTPGRVMDHMRKGTLNLDNLKCLVLDEADEMLRMGFIDDVEWILDQTPE
SRQVALFSATMPQPIVRIARKYLKAPAEITIQTKTTTVETIRQRYWMVGG
HHKLDALTRILEVEPFDGIIIFVRTKTETVALAEKLQARGYAAAALNGDM
VQSARERTIEQLKDGTLNIVIATDVAARGLDVERISHVINYDIPTDTESY
VHRIGRTGRAGRSGEAILFVSPRERGMLFAIERATRKRIELMELPSTEIV
NDKRIAKFKQRITDTVGAEDLEFYIGLIGQYCQEHDVSELEAAAALAKLL
QGDEPFLLSAKPERERPVRREEREPRYDREHGRDSFRDDFRKEREPRRQS
KGGLYSDEKRETYRLEVGKVHDVKPGNIAGAIINEIGLDPEAVGKIVIND
QYSTVELPAGMPNDVFQELKKVRVCGRQLRVSKMDEHQGGYGDRHDRGFG
GDRSGNRSFGKDRSGDRGFGGDKKKSFGKPSGGGKRKKDDGAFFTGFTGT
KKKRMKD
>CT1034 deaD-2, ATP-dependent RNA helicase DeaD
MPFSALGIIDHLRKALAEEGYNSPTPIQKEAIPVILEGNDLLACAQTGTG
KTAAFALPVLQRLHQSRMHGEKRKIRCLVLTPTRELAIQIGESFTAYGRH
TGLINTVIFGGVNQNPQTARLVRGVDILVATPGRLLDLIGQGHLHLRDIE
YFVLDEADRMLDMGFIHDIRRVLAVLPKKRQSLFFSATMPPEIIKLSAAI
LHNPKEVMVTPVSSTVEIINQQILFVDRENKNSLLAHLLKERNIESALVF
TRTKHGADKVARFLAHHDITAEAIHGNKSQNARQRALGNFKTRQTRVLVA
TDIAARGIDIDELEYVINIDLPNIPETYVHRIGRTGRAGNRGAAYSFCNA
EEKAYLRDIEKLIARKIPVIEDHPFPMVNTIPENAAASKQPARKVPHQKK
PSGTHPGNSHWKRK
>CT0178 dedA, dedA protein
MELFSKLADFILHIDRHLQLLASEYGLWLYGILFLIVFCETGLVVFPLLP
GDSLLFAAGSLAAIPMSQLSPHWLFVIFTTAALLGDTVNYWIGRSLGPKV
FHFEKSAFFNPEHLQKTRGFFEKYGGKTIIIARFIPIVRTFTPFVAGVGA
LPYPRFIMFSIVGALLWVGVFSYGGYFFGQLPVIKANLKLMIVGIIAVSL
LIPAIEFVKHHFRRKASLDAD
>CT1454 def, peptide deformylase
MILPINTYSDPVLAMKAKPLKGVDSAIEELIAEMFDTMYKAPGIGLAAPQ
VGHSLRLVVVDISTIKEYADFKPMVVINPRIVAVRGRSLMEEGCLSVPGI
AGNVVRPSAITLHYRDEKFEEHTADFHSMMARVLQHEIDHLDGTLFVDRM
DKRDRRKIQKELDAIAEGRVKADYPLARDVNRVEAEA
>CT1815 deoD, purine nucleoside phosphorylase
MTEQRQKIREAVEFIRKKTTAEYPVGIVLGTGLGGLVKEIEIDFSLDYAD
IPYFPISTVETHHGKLIFGTLAGKKVVAMQGRFHYYEGYSMTQIVFPIRV
MKELGMKTLGITNACGGMNPGYSKGDIMLIDDHINLLGANPLIGPNDPEM
GPRFPDMCAPYSPRILEIAEKVALEHGIKVQRGVYVAVTGPCLETRAEYR
MLRAIGADVVGMSTVPEVIAAVHQGTEVFGMSIVTDECFPDCLVPVSIEE
IIEVSSRAEPNMTTIFRNVVANL
>CT0207 dfp, pantothenate metabolism flavoprotein
MLTGRNILLGISGGIAAYKTPQLVRLLKKSGAGVQVLATASALKFVSELS
LATVSNHPVLTGIFPSHEAREDEYTRHIALGEWADAFVIAPATANTLARL
ATGLCDDMLSLCFLTLRPGKPVLIVPAMDGHMYDSPSVQRNLATLRAQGC
RLMEPESGSLASGQCGLGRMPEPETIFEALTGMLAAPEASALCGKSVVVT
AGPTREKIDGVRFLSNYSSGKMGFAIAANAARRGARVHLVTGPVNLPTPS
GVERIDVESAVEMLEAAQPLFGECDLFIGAAAVADYRPEAPVEGKIKKNE
AEMEIRLVRNPDILAEFATNRKLGQLAVGFALETAGGIDYARKKLADKKL
DLVAFNVYDGRTSGFEVDTNALTLLARDGGVTELPLLPKEEAAARLLDAV
ESLLPR
>CT0002 dnaA, chromosomal replication initiator protein DnaA
MSDTIQQEAPDNLQVTPTHGRSFAEKVWSACLGLIQENINTLAFKTWFLP
IRPLSFSGSELTIEVPSQFFYEWIEENYSVHVKQALRQVIGPEAKLMYSI
VIDKSQGQPVTIELPHQIDAAPAERSVRPEAPGQKASAERERLEIARPRF
ESNLNPKYTFSTLVRGDCNSLAFAASKSIAQNPGQNAFNPLVIYGGVGLG
KTHMMQAIGNSVLENRITDAVLYVSSEKFAIDFVNAIQNGNIQEFSAFYR
NIDVLIIDDIQFFAGKEKTQEEIFHIFNTLHQSNKQIILSADRPIKEIKG
IEDRLISRFNWGLSTDIQAPDYETRKAIIQSKLKQSGVSLDPVVIEFIAT
NVTSNVRELEGCIVKLLAAHSLDNQEIDLQFAKSTLKDIIRYNTKQLTLE
TIEKAVCSYFSITSNDLKGKSKKKEIAVGRQIAMYLSKDMTDSSLKTIGL
HFGGRDHSTVIHALNTIEKKIAASNEERKKIEELRKRIEIMSM
>CT0205 dnaB, replicative DNA helicase
MKPREPQLDLSRDIDFSKESRVPPYSLEVEQEVLACILLEDDPIEQVIQI
FGDNGEAVFYEKRHQIIYKAMMLLYQKRQAIDLITVSEELSRIGELENIG
GRPYLGELSGKVISSANIEFYARLVKEKYLYRRLISISSHISSAAYNTSM
DIFDLVENASQQFFNISQAGVKKKATSIKELLKTATRMIENLGSSHSSVT
GIGSGFSELDEYTAGFQPSDMIIIAARPSAGKTAFALALARNAAVNFNTP
VLFFSLEMAEIQLALRLMCAEAYVESQLVRRGQISPEMMNKIINSMDALA
DAKLFIDDTPGISIMELTAKTRRMKQEHGIGMVVVDYLQLVTPVRDGRSN
REQEIAQISRSLKALAKELNIPVIALAQLNRSVEQRSGDRRPQLSDLRES
GSIEQDADMVIFLSRPEMYGIKNFEDGSSTKDITEVVIGKQRNGPIGTVR
LLFLKKYGRFQSTANSYLTAADSGQQQPEPARGDGYPEASLPEPPPLDMD
SFINNSDAAPF
>CT0840 dnaE, DNA polymerase III, alpha subunit
MQPDLRSALEFVHLHTHTHYSMQSSPIFPGDLFAACKKQGMTAVAVTDYG
AMFNMPELFGQAKKAEVKLIMGAEIYLAGTGSRDSGKPATLILLIRDDIG
YRNMCVILSRAARDGFSNGLPHVDRSVLEECRDGLVCLSAAHSGLIGRAL
LSGNETEAANFANYYRDLYGEHFYLELQKHGAPYEEQLVPGTIRLSEQLG
IPLVATNNVHYLDRRDSGCYRAMIANRTKQRLNSQNLQCLPGHENYFKSA
AEMSLLFDNAHGELDNTVKIAEACSYTFINKDPHLPKFPLPEGFDSEKEY
LRHLTWEGAKEKYGAADGTIPEEVKARIELELGVIEKMGYSAYFLIVSDL
IAASRKMGYSVGPGRGSAAGSIIAYLTGITRIDPLKYKLLFERFLNPERI
SMPDIDIDFTPVGKQKVLDYTVQKYGAESVAKVVAIGTLGAKAAIRDAGR
VLDVPLKAVDQLAKLVPSRPGTSLEDAFREVKELKRLVDTDPQYQQLVQY
ARAMEGRARNVSMHAGAVVITDGALEEQVPLYVSNKIETEERKYADEFDQ
NDIDGTKAESSDEKQVVTQFDKNCIEQAGLLKIDYLGLETLAVIDETLRL
IKKRHGIDIDLEKVPMDDRKTFRIFQEGKMAGIFQFESSGMQSYMMRLQP
TTIGDIIAMSALYRPGALNAVIDEHRNAVDLFIDRKHGREEIDYMHPMLE
EILKETYGVIVYQEQVMQISQVMGRFSLGKADNLRKAMGKKDPKLMQKFK
EEFVEGAASIDVNKALATRIFDLMAEFAGYGFNKSHSAAYGVLAYWTAYL
KAHYTIEFITAILNSEIGDTERMKHLTDEAKGFGIFMLPPSINKSDALFS
VEEHKGRPCIRVGLSAIKQVGGGARAVVAARLRRDGKPFLNLFDLTASVD
LRAMNRKALECLIQAGALDEIDPNRGKLLANVDKAIKFGQIQNKAVTLGQ
GGFFNDDFSDGQAGVHYPDLDNAEPMPESEKLQYEKRLVGFYLSHHPLDR
FRRDWEAFASLTLDKRDVTPSKLYKAIGVIVSVKPYQDRKGKQMLFGVLE
DFTGKADFTVFASVYEQYHHMLKTDEVVMLSVEAEVKDGGLKLLVREVAP
LKKVRSALVKKVVLRIDADDASQLGKLQQVREIFEKHKGGTPVDFEVRAT
IGSCNETLKLFARNTPIEADDEVLDQLEEILGPDNVRITG
>CT1484 dnaJ, dnaJ protein
MKRDYYEILGVARSADKDEIKKAYRKLALKYHPDKNPDNKEAEEKFKEVN
EAYEVLSNDDKRRRYDQFGHAGVGSSAASGSGPGGAGYGDINDIFSAFND
MFSGGGGRARTGGSPFSGFEDVFSGGFSGSGSGRRRSAGIQGTDLKIRLK
LTLEEIAKGVEKTIKIKKLVTCRECNGTGSKSGKTEICPTCHGSGEVRQA
TKTMFGQFMNISVCPTCGGEGRVVKDRCPSCYGEGIKQGEATVKITVPAG
VQDGNYLTLQGQGNAGPRGGAPGDLIVVIEEKPHELFKRNGDDIIYDLSV
GFPDLVMGTKIEVPTLDGHVKLTIPAGTQPNTMLRIGGKGIGHLRGGGSG
DLYVRVNVFVPKEVSGKDRDLLKELKKSTVICPNHGDENHEKSIFEKAKD
IFS
>CT0643 dnaK, dnaK protein
MGKIIGIDLGTTNSCVAVMQGTQPTVIENSEGSRTTPSMVAFTKTGERLV
GQAAKRQAVTNPKNTIFSIKRFMGRKYDEVPNEKKLASYDVVNEGGYAKV
KIGDKTYSPQEISAMILQKMKQTAEDFLGEKVTEAVITVPAYFNDAQRQA
TKDAGKIAGLEVKRIINEPTAAALAYGLDKKKENEKVAVFDLGGGTFDIS
ILELGGGVFEVKSTDGDTHLGGDDFDQVIINYLADEFKKQEGIDLRKDAI
ALQRLKEAAEKAKIELSSRTDTEINLPFITATQEGPKHLVINLTRAKFEA
MSAALFDKLFEPCRRAIKNSKFDIKEIDEVVLVGGSTRIPKVQALVKEFF
GKEPNKSVNPDEVVAIGAAIQGGVLQGDVTDVLLLDVTPLSLGIETLGGV
MTKLIEANTTIPTRKQEIFSTAADNQTSVEVHVLQGERPMASDNKTLGRF
HLSDIPPAPRGVPQIEVTFDIDANGILNVSAKDKATGKEQSIKIEASGKL
TEAEIEKMKEDAKAHAAEDQKRKEEIELKNSADSLIFSTEKQLTELGDKL
PADKKAAIESALEKLKEAHKSGRVDAIKPAMDELSKVWSDAASNLYGQPG
AEPQPETNGHAGGSKGGDGAVNAEYEVIDGDDK
>CT0001 dnaN, DNA polymerase III, beta subunit
MKFNTTIKRLQEAVNKVILAVPAKSLDARFDNINLTLENGMLTMFATDGE
LSITTNCDVASTDKGNIAIRARTLQDFLRSMYETDVTFDIERQSIGELGT
IHIATDKGRYRIPCTFTTKPESQKRNFDLSIELQQNELQDMIHKTLFACS
VDGMRPAMMGVLFEFDEEYITAVSTDGHRLVRCRKKPGVTVAEKQKIVVP
ARVLSIIQRMLTNEEVKMTIDSERRNVKFKTATMELESALIVEPYPNYEA
VIPVENEKQLVTDRSQLHDSVKRVGRFSSIGDLKISAQDSQIKVMAENAS
EGESAQEELPCTFTGEEITIGFNAKFVEAALSHIETEQVSIEMSSPTTAV
LFKPKYEERQDDLIILVMPVRINN
>CT1039 dnaQ, DNA polymerase III, epsilon subunit
MLELARKIQASRRCRKGLLPENICRYVSLFDHPLQRNTPLDELRFVIFDT
ETSGFDLVKDRILSIGAVSMKGSTIDIADSFEVLLRQEAIGGKDAVSVHG
ILKRDLTQGMEEGEAVCRFLDYLGNGVIVAHHADFDIAMVNRVLSQRYGI
KLLNEALDTASFAKRLEKGPYYNLAHKSGEYRLDNLCARYGICLYDRHTS
AGDAYLTAQLFQRLLAVGRKAGIDTLGKLLLK
>CT1324 dnaZX, DNA polymerase III, gamma and tau subunits
MSYQVIARKYRPSRFADITAQEHITGTIQNSLRMGRVGHGYIFSGLRGVG
KTTAARVFAKAVNCQRMIDDPQYLKEVTEPCGVCESCRDFDAGASLNISE
FDAASNNSVDDIRLLRENVRYGPQKGRYRVYIIDEVHMLSTAAFNAFLKT
LEEPPPHAIFIFATTELHKIPATIASRCQRFNFKRIPLDRIQGQLRQICD
AEGITADADALQLIARKAQGSMRDAQSILDQVIAFAIDSEGERAIRYDKV
SELLSYIDDEHFFMVTDAIANADAAAMLEVAGMVNRNGYDEQDFLEKLIE
HFRNFLIIHNLRSSKLIERPEPVKERYQRDALRLTPAAIMAMTDFLMQTQ
RELKFYAEHQFRFELALLKLIELGRGVWPSPTASADEKKKLEPVAPPSAP
ASAAPFKPERSTRRSKTASSPESGTAPSGLPPVPEPPPLPTKGSASSRPA
ESKLAASIDLGSWKQAFSKFGSNADQHLPARNRLRVEENVAGTDSGAPVA
VLEQLRMEWGRFLEHLSAKGLQVLVSHLHSSELMSCSPSGVVELGCCRKF
SFEELQHDSALLETEMADFYRLPLKLAIRYDAERDACTREKSIFTMFKEL
SETNEVVRYLISEFGGELVY
>CT0333 doc, death on curing protein
MRFLDLHEVLHIHRDQITRYGGTLGVRDMGLLTSAIAMPTAMFKGDFLHT
DIYEMAAAYLFHLVRNHPFLDGNKRVGAVSAIVFLALNGYDFEAPENDLV
EMVYGVARSEFEKSDVALFMRRWSVKW
>CT0255 dprA, DprA/SMF protein, putative DNA processing factor
MFRGVAPPPERSSTPLPPTMTTTPEPGAALLLLIVSQLPGIGPSRARALL
NRFGATPALLEADYDALRQIPGIGETTARETAERLANPAWRDKAREKGEN
QLARAERLEASVITILDPAYPPLLKEIYDPPLLLFARGNPEALCVPSLAV
VGTRKATAYGKQATEFICREMVNNGYAILSGLAYGIDMMAHRAAVESNGV
TIAVLGCGIDRIYTDPAGRLWPRILERGAIVSEEWIGIKPEPGNFPRRNR
LIAGLAQGTLIVESDIKGGSMITASYALEQNREVFAIPGSIFSGTSRGTN
YLIQQNHAKPVFSAEDILAELNPAQHPALKTHPEASEPFDLNIEESCIVE
ALQSGAMHIDLIAEKTGLKIDALLVHLFELELKRIIEQEPGQIFRKRTP
>CT1075 dsbD, thiol:disulfide interchange protein DsbD
MLFMRSASAMAADFLDPEQAFVPSAELTANRSIAVHWKIAKSYKLYRDQI
KVGVSGGKASLGEPVFPEGILFTDPSTGEKQVIYHDELRLEVPVKQASAP
FTVKVEYQGCAEDGLCYPPISKSFKVDPSRPGALAAVESTPDTGASTGLQ
PAASDAAPAVAASSANAVDNGGKNDLSLAQSTLESGSLWKVFAAFLLFGL
LLSFTPCVLPMVPILSSIIVGEGETTKAKSFLMALAYCLGMALVYTSLGV
AAGLAGEGLAGALQKPWVLVMFSLLLIGLSLSMFDVYQLQAPASLQNSLS
KTSSKLKGGRFVGVFFMGAISALIVGPCVAAPLAGTLVYISQTKDVVIGG
LALFSMAMGMSVPLLLIGLSAGSLLPKAGAWMIGVKYVFGLMLIAVAIWM
VTPVLPPQALMVAWGALGILCAVFAGVFGHLPEKLTVGGKFKKALGLVLF
IIGVMELAGAASGGTNPLEPLAGLRGGSSVAAANNGKTAELAFKKIRTVE
DLDRELHASAGKKPVMLDFYADWCVSCKEFEKFTFSNAKVQQGLADVTLL
QVDVTANTADDKALMKRFNLFGPPGIIFFDKSGNEQVDNRIVGFVEAEEF
LKHLEKL
>CT0852 dsrA-1, sulfite reductase, dissimilatory-type, alpha subunit
MSANDSAVNESCHCGGCGSSGNGKFLNETPMLDQLESGPWPSFISGFKAL
AERTEKPMLRGVLDQLEYSYKTKMGYWKGGLVTVDGYGAGIITRYSMIKD
KFPEAAEFHTMRIQPAPGLHYNTTMLRELCDIWEKYGSGIITLHGQTGDI
MLQGIEQDKVQACFDELNQKGWDLGGAGAGMRTGVSCIGPGRCDNACYDN
LKLHLEALKHFSPQVHRPEWNYKLKFKFSGCPNDCTNAIMRSDLAVIGTW
RDSIQIDHDEVKAWIAEKGVDALVNNVINRCPTKAIRLQDGDIDISTRDC
VRCMHCINAMSKALSPGKDKGIALLIGGKNTLKVGVNMGSLIVPFMKMET
DEDREAFIELIEEIIDWWDDAGLDHERIGETIERVGLKQFLDGVGIEYDI
NQISRPRDNPYFKAKY
>CT2249 dsrA-2, sulfite reductase, dissimilatory-type, alpha subunit
MSANDSAVNESCHCGGCGSSGNGKFLNETPMLDQLESGPWPSFISGFKAL
AERTEKPMLRGVLDQLEYSYKTKMGYWKGGLVTVDGYGAGIITRYSMIKD
KFPEAAEFHTMRIQPAPGLHYNTTMLRELCDIWEKYGSGIITLHGQTGDI
MLQGIEQDKVQACFDELNQKGWDLGGAGAGMRTGVSCIGPGRCDNACYDN
LKLHLEALKHFSPQVHRPEWNYKLKFKFSGCPNDCTNAIMRSDLAVIGTW
RDSIQIDHDEVKAWIAEKGVDALVNNVINRCPTKAIRLQDGDIDISTRDC
VRCMHCINAMSKALSPGKDKGIALLIGGKNTLKVGVNMGSLIVPFMKMET
DEDREAFIELIEEIIDWWDDAGLDHERIGETIERVGLKQFLDGVGIEYDI
NQISRPRDNPYFKAKY
>CT0853 dsrB-1, sulfite reductase, dissimilatory-type, beta subunit
MSSQERTWKTIESGPHTYEEALHPVVRKNYGKWKYHEIPKPGVLKHVAES
GDTIWTVRAGTPRQDTVDMVRQLCDVADKYSDGFLRFTVRNNVEFLTPNA
ENVEPMIAELESLGFPVGGTGMCVSAVSHTQGWLHCDIPATDASGVVKSM
MDTVYNEFKDMQMPNKVRLSTSCCSINCGGQADIAVVVKHTRPPRINHDH
LVKTCELPKAVARCPVAAIRPTVVNGKKTLMVDEAKCICCGACFGACPAM
EINHPEHSKFAVWVGGKNSNARSKPSTMSLVAHNLPNNPPRWPEVTDVVG
RILTAYKAGGRPWERIGEWINRIGWKRFFEETGLEFDDNMIDSYRHARTT
FNQSAHIRF
>CT0851 dsrC-1, sulfite reductase, dissimilatory-type, gamma subunit
MAIEVNGMSVETDENGYLVNLDDWTEEVAVKIAEDEGIAMEAGHWDLVKF
LRNYYKEYQIAPAVKVLTKAVASEKGMDKKEASEFLYALFPKGPALQACK
IAGLPKPTGCV
>CT2250 dsrC-2, sulfite reductase, dissimilatory-type, gamma subunit
MAIEVNGMSVETDENGYLVNLDDWTEEVAVKIAEDEGIAMEAGHWDLVKF
LRNYYKEYQIAPAVKVLTKAVASEKGMDKKEASEFLYALFPKGPALQACK
IAGLPKPTGCV
>CT0855 dsrE, dsrE protein
MNIGILLKEGPYNHQAADTAYKFAEAAIAKGHKVDAIFLYNDGVINATKL
GDPPQDDRNIAARWTELNQKHGVEVLACIAASKRRGINDDVMIDGAEITG
LGTLTDIAIRNDRLLTFGD
>CT0856 dsrF, dsrF protein
MSEEQDIKKIMHVMRRAPHGSIYTYEGLEMILIMAAYEQDLSVAFIGDGV
YALKKDQDTAGIGIKGFSKTFMALDGYDVEKLYVDKQSLEERGLTEDDLL
VDVEVMDSSKIGRLMNEQDVVIHH
>CT0857 dsrH, dsrH protein
MLHTINKSPFENNTFTTCVRFLQPGDPVLFIEDGVYAVQAGNRFGALIQS
VLKKNNPVYALKPDLDARGISAIAEGVKTVDYAGFVDLVEEHQVNSWL
>CT2243 dsrK, dsrK protein
MSNKYALKPDELKKEFEQKKPRLLKGEFAGKDWWDLPVEFRDGNWCFPAK
PEVLDELHFANPRKWAATDKDWQLPAGWEKTIRDGMKDRLKRFRSFKVFM
DSCVRCGACADKCHFFLGTGDPKNMPVLRAELVRSVYRNDFTGLAKILKD
FSGSRTLTQDVIKEWHMYFHQCTECRRCSVFCPMGIDTAEITMMVRELLN
LIGVNNNWILAPVANCNRTGNHLGIEPHTFKQNIMSMVDDIEDLTGVRVN
PTFNRKGAEILFITPSGDVFGDPGVYTMMGYLLLFHHIGLDYTISTYASE
GGNFGMFTSNEMMKKINAKMYHEAKRLGVKWILGGECGHMWRLVHQYMNT
MNGPADFLEEPVSPITGTKFTNAKATKMVHIVEFTADLIKHGKLKLDPKR
NDHLRTTFHDSCNVARGMGMFEEPRYVLNKVCNVFHEMPENTIREQTFCC
GSGSGLNAEEFMDTRMRGGFPRASAVAHVREKHKVDSLVTICAIDRASLP
ALMRYWNPGVTVYGLHELVGNALIMDGEKKRTEDLRENPMAGFEEDDDDE
>CT2244 dsrM, dsrM protein
MKKVLKPLIAVIVLALIPWAGITYGKLDYLFAIVIPYASAAILVLGMLFR
LVDWIRRPVPFNIPTTCGQEQSLDWIKTNPLECPSNPFMAAMRVLSEVFL
FRSLFRNTRAELYGGPKLVYGSYKWLWLGGLAFHWSMLIIVIRHARFFLE
TLPLPIELLENADRFLDVTVPAFYITDAIALAAITFLVLRRLSDEKMRIL
SLSTDYFPLFLFGAIVVIGISMRYVTKIDIMPVKALAMTLAHFGFDAPEP
IGVLFYIHLFLVCVLLAYIPFSKLVHMGGIFLSPTRNLPNNSRAKRHKNP
WNPDIKFRTYAEYEDEFREKMKKAGLPVEKQ
>CT2251 dsrN, dsrN protein
MISASWKSAGKTTVSLGLLRLLAENGVPVVSFKKGPDYIDPMWHRVASSG
ECYNLDTWMMGEAACRDTFIRNCARRPGSIALIEGNHGLHDGMDMAGADS
SAGLAALLDAPVLLVIDSRRMNRGVAAQVLGLQAMPPKVRIAGVILNHVT
SSRQESKQRAAIETFCNVPVLGAIPADSSLLLPERHLGLVTVGEASDAEA
FIRVAAQQVERHCDFAAIRKLFEEASPLSHSEPREVFSPKKEATVKIGVF
RDAAFCFYYPDNLDALRDAGAELVFIDTFKTTALPEIDGLYLGGGFPESF
FAELSANTSLLRDMRGRIEAGIPAWAECGGLIYLCRSATWEGRQWPLASV
LPIDIAYQRKPAGCGYLELESRAGSGWFPVGERVRAHEFHYSKPVAGNVD
LACQFDVARGFGLTGREDGLLYRNLFASYAHFHAAANPGWAERFVGLALK
FKEQGRLERD
>CT1418 dut, deoxyuridine 5'-triphosphate nucleotidohydrolase
MIKVKIVRLNQKAILPVYATAHAAGMDVSACLDAPVTVPSSASALIPTGF
AIELPEGYEAQLRPRSGLALRHCISLPNSPATIDADYRGEVGVILINHGR
EPFTVSHGDRIAQMVVAKVDHVVFEEVESLSETARGEGGFGHTGVQAKAE
CL
>CT0125 dxr, 1-deoxy-D-xylulose 5-phosphate reductoisomerase
MKSLSILGSTGSIGLSTLDVVRRHPERFSIAALAEGHDVEMLLKQIDEFR
PSLVSVRDEASRERLKGMLGDHKPEILCGLEGAAEVAAVDGADMVVSAIV
GAAGLVPTVRAIEAGKDIALANKETLVVAGQLVSDLVKKHDVKLLPVDSE
HSAIFQSLVGHRTEDIERIILTASGGPFRKTPAEELKNVGPEQALKHPQW
SMGAKITIDSATLMNKGLEVIEAHWLFDMPAEKIGVVVHPQSIIHSMVEY
IDGCVIAQLGVPDMRAPIAYALAWPERCETGIGKLDLTKVATLTFEEPDM
ERFPALRLAFDALKAGQTYPAVLNAANEIAVAAFLDKKIGFTDIAGTVDK
TMQAHEAWTPITLEEYLQADKWARETARQLIG
>CT0337 dxs, 1-deoxyxylulose-5-phosphate synthase
MADLVSAPAMISQAYPLLSSIHSPADLKKLSLHELELVAAECRKKVIELV
SQNGGHFGSSLGVVELTVALHYVYQSPTDRIIWDVGHQAYVHKILTGRLA
QMETNRRYHGLAGFPKRSESPHDAFGTGHASTSISAAAGLAAARDLAGRK
EKVVAIIGDGSLTGGMAFEAMNHLGDTKSDVLVILNDNQMAISPSTGGLK
NYLVNLTLNKTYNRLRKFVWDSLSLLHNEIGETAKTAVHRIEDGIKAAFT
PGAYFEALGFRYFGPIDGHNMEQLIKALREMRQLHHPKLLHVITTKGKGF
KPAEENQPKWHASVGGFDIETGKNVKAPGKPAKPKYQEVFGEALVELALK
DPTITAITAAMPSGTSLDLFQQAIPSRCFDVGIAEQHAVTFAAGLACGGF
KPVFAVYSTFLQRAYDQLIHDVALQNLHVVFAIDRAGLVGEDGPTHHGAF
DLSYLNVVPNLTIMAPGDEQELRNMLYTALYDIKGPVAIRYPRGSGSGAT
LHKEFTPVPVGRGRILRDGKSVALLGIGTMSNRALETAALLEAAGLDPLV
CDMRFLKPLDTEIIDMAASRCTHIVTIEENSIIGGFGSNVVNYLHHAHPG
IKCISFGLPDAFVTHGSMDELYREVGLDAESLSGKILEFYKDKP
>CT0159 efp, translation elongation factor P
MVSISNVSRGAIIRWNGAPHSIESLVHRTPGNLRAFYQASMKNLKTGRNV
EYRFSATEQVDVIVTERKKYQYLYRDGEDYVMMDTETFDQINVPEVAIGP
ASRFLKDSVMVDIVFADDGSILEVELPTFVELEVTETSPASKDDRATSGT
KPAIVETGAEVNVPMFIQTGSIIRIDTRSGEYMERVKK
>CT0105 emrA, multidrug resistance protein A
MAETQQSNIEAPDNEKGKPKPERSMGRLVIFGILLAIGLVWGGMKLIRSL
SYEETDDAQIAGNIYPVIPRVPGKVVEVLANDNQMVKKGDVLIRLDPSDY
QIKRDMAEAQLLKARAAVSGAKADIIAAAAAQIKLAADLRRSQNLQKQDV
ISRAELDAATAGATAASAQHAAAGDNYKAALAQAKLAEAELKNAELQLSW
TTITAPADGKVSKKNVQPGQYVTPGQQLIAIVGSGDLWVVANFKETQLEH
MRPGQKVIIKVDAFPGKELKGHIDSISAGTGAEFALLPPDNASGNFVKVT
QRVPVKIVFDEKTDLPLAAGMNVIAEVKVK
>CT1962 eno-1, enolase
MKIQNVNAIEILDSRGNPTVEVNLKLEDGTISRAMVPSGASTGEREATEL
RDGDKKRYGGKGVLKAVENVNSAIAKAIENKHFTNQRELDYFLIELDETN
NKSKLGANAILGVSMAFARAKAQSSRTPLYQYLGGSNAHIMPVPCMNVIN
GGKHADNTIDFQEFMIAPHNAPSFRESIRMGEEVFHALKAVLKLKGLSTG
VGDEGGFAPDLKSNEQAVEMILEGITKAGYKPSVDVSICLDPASSEMWEN
GKYKFFKSTQKLVSSDEMVKLWESWVNQYPIVLLEDGMAENDWEGWKNLT
DVIGNKIEIVGDDLFCTNKSILLNGINKGVANSILIKLNQIGTVTETLET
IELAYKNSYNCFVSHRSGETVDSFIADLTVGINAGHLKSGSGCRGERIEK
FNQLMRIENELGKSAQFAGLKAFKNAK
>CT0145 eno-2, enolase
MSVITRIHARQIMDSRGNPTVEVDVHTESSFGRAAVPSGASTGVHEAVEL
RDKDKSVFLGKGVLKAVENVNTLINDALLGMDVTEQEAIDAKLIELDGTP
NKSKLGANAILGVSLACAKAGAEYSALPLYRYIGGTTAKTLPVPMMNVLN
GGAHADNTVDFQEFMIMPIGFERYSDALRCGAEVFHSLKSLLHDRGLSTA
VGDEGGFAPNVESNEQAIELVIEAIGMAGYKAGAPTDRGGLGDGHVMIAL
DPASSEFYDAEKKKYVFKKSSGRELSSEEMASYWADWASRYPIISIEDGM
AEDDWEGWKMLTDKIGGRVQLVGDDLFVTNSKRLAEGIEKGVGNSILIKV
NQIGTLTETLQAIELAKRNGYTSVISHRSGETEDTTIAQIAVATNAGQIK
TGSMSRSDRMAKYNELLRIEEELGSTALYPGIGAFRV
>CT2141 etfA, electron transfer flavoprotein, alpha subunit
MKLLLVGEVRDGRVTAETAELFGFAACFSAEVSMVLAGSTDDLPSFEGKL
YRADGVNAFDLACHKRLVLAAVEREQPDAVVFLHSSHGWELAPRVAFAMQ
SAQVSGVVGLDDGGYVVESCNGKMRRTVKPLTDRVVLTLQLGAFDAPAMA
GIPEVTALDVEPDSTIEFLDCVQPERGIDLTRAGVIVSAGRGVGSAERVE
LVRALADALGGEVGASRPVVDAGWLERARQVGSSGQSVSPALYVACGISG
AIQHLAGMKGSGFVLAINTDRDTPITSVADVLAVADVAEFLSALTAAIRA
RR
>CT2142 etfB, electron transfer flavoprotein, beta subunit
MNILVCVKQVPDMEGHFISNSSGSWFDEAGLAWRMNEYDTFAVEEAIRLK
EQLGGEARVTVLSVGPARVVETIRKALSTGCDDGVHIVDPEAPERDPWQI
ASMIAGFAVGRGFDLIFTGMQSEDRGSAQVGVLVAERLGIASVTGITAFE
WQDGAMTVERELEGGRRCRLRLKAPALMTCQLGLNSPRYPTLPNIMKSKR
TMLTVLSPESVGLESPKVLSCNFRPHEPNGAGVILEGDAGAMAARVLEIL
EAKGLVSGKGGAR
>CT2115 fabD, malonyl CoA-acyl carrier protein transacylase
MKAFVFPGQGSQYCGMGRDLYERFPEAQELMDKADSILGFSITNIMFNGS
EDELKQTRYTQLAIFLHSFAAATLLGREGVKMAAGHSLGEYTALCFSGAI
SFEDAVRLVAKRGELMQNAGQQNPGTMAAIIGMADDALDALLEEASASGI
VQAANFNSPGQIVISGDVDAVRKAVELASSKGARMAKELVVSGAFHSPLM
KPAEKEFAETLDTIAIRDAEIPVCMNVVAKPVTAATEIRANLISQLTSSV
LWSQSVQAMVDAGITEFVEVGPQKVLQGLIKRISKSTMCSGVDTADQVDA
MRTPA
>CT2118 fabF, 3-oxoacyl-(acyl-carrier-protein) synthase II
MKRVVVTGIGVLSPIGLSAGAFWNALMEGKSGAAPITYFDTTNFATTFAC
ELKNFKADEYIDRKSADRMDPYCQYGVISAEQALKDSGLDLTAIDPTRIG
VVHGSGIGGMTVYDQQFRQYLERGPRRVSPFFIPMLIPDIAAGQISIRNG
LMGPNYATASACATSLHAIMDAVMLLQMGMADYMVCGGSEAPITQMSVAG
FNSAKALSTRNDAPTKASRPYDVDRDGFVMGEGAGSLVIETYESAVKRGA
KIYAEIVGMGASADAYHLTAPHPEGLGACSAMTTALNMAGITPDKIDYIN
THGTATPLGDLAEIKAIKKVFGEHASKLSISSTKSMTGHLLGAAGVVESI
ACILALQNQTVPPTINLDNVDPEIDVDVTPNVPKQRSIEYALNNGFGFGG
HNGCLIFRKAPAA
>CT2116 fabG, 3-oxoacyl-(acyl-carrier-protein) reductase
MFTGKTAVVTGAARGIGQSIALDLAAKGADLVIGDIKAEWLTETEEALKQ
LGAKVSCKELDVTSTDACQKVFDEVAKENGRIDILVNNAGITRDGLLMRM
SEEDWDAVLTVNLKGVFNCTKAVTRTMMKQRSGSIINIASIIGLMGNAGQ
ANYAASKGGVIAFTKSIARELASRNVRANAIAPGFISSKMTDALSEEVRQ
KMLEAIPLGVFGTPQHVADAVAFLASDQSAYITGQVLSVNGGMYM
>CT2114 fabH, 3-oxoacyl-(acyl-carrier-protein) synthase III
MKAAITTTAKYLPEEIMSNQDLERILDTNDEWITTRTGIKERRILRDPKK
ATSYMCTEVARQLLEKRGISADEIDLIIVATMSPDMLFPSTACLVQGNIQ
AKNAWGFDLSAACSGFVYGLYTGAQFIESGNCKKVMVIGADKMSSILDYE
DRTTAILFGDGAGGVILEAANEEGYGVLDARLYSDGLNGREHLLMPGGGS
LHPASHETVDQHMHFIQQDGKQVFKAAVIAMADVAEEIMQRNNLTAETID
WLVPHQANQRIIHATAERMGITEEKVMMNIARYGNTTAGTVPICLAELDE
QGKLHKGSNLVLVSFGAGYTWGGVYVRWQ
>CT0350 fabI, enoyl-(acyl-carrier-protein) reductase
MPEKAHYGLLKGKKGIVFGPLDESSIGWQIALHAYREGAQVALSNVATAI
RFGKLQELSELCGNAPILICDASKNEEVDNTFRELKETMGSVDFIVHSIG
MSQNIRKQVPYEELNYEWFMRTLDVSGISFHRLVAYALKNEALNDGASIV
ALSYIASQRNYWTYSDMGDAKSLLESIARSFGPRLAPRGIRINTISQSPT
YTKAGSGIPGFEKMYDYSELMSPLGNASAEECAEYTMTILSDLTRKVTMQ
NLFHDGGYSSMGATIPMIKLAHEALHDKELAERVGLEGRHSSR
>CT0046 fadD, long-chain-fatty-acid--CoA ligase
MTNAPSNAPWLSHYDEGIPSSLAPYPRVTLPDILREAARKHPEDPALLFL
GNTISYGELERESNAFAAALHASEVRKGNRVAVLLPNSPQMIIAEFGIWK
AGGIAVMLNPLWTEHELERAIDECEAEIAVVLAPFYEKINHLRSRTSLKT
VVITDLHDYFPAAMRNASPANGAVATMLQSSDLRMPAMIESYSGSQTPAV
EVSPKDPALFIFSGGTTGKPKCAIGRHEASVMNGMQVDAWFRPVLGSDRV
PVMLNLPLHHVYPQVAIIGYGFVTRSPLVLIPDPRDFELLIKTIKQYKVG
LLPGIPTLFNALAAHPLLKEAPGSLDSLKLIISAAAPLHNKTRRRFKELT
GATIIDAYGLTEAMVSPVCQPLNGIRKNGSVGLPVPDVEMRIVDADTGIE
VLPSMEIGEIVIRSPQLMTGYWKNPEETAEVLRDGWLYTGDLGYIDDDGY
LYIVDRKKDVIKPSGFQVWPSEVEEVIAMHPAVLETGVAGVPDDYQSEAV
KAWVVLHKGHSLDAEQLKNWCRQTLAPYKVPKHIEFCEQLPKSALGKVLR
QALVEQHLTS
>CT1053 fbaA, fructose-bisphosphate aldolase, class II
MKKITGYKELGLVNSRELFAKAISGGYAIPAYNFNNLEQLQAIIMACVET
ASPVILQVSKGARSYANQTLLRHLAAGAVEYAAELGREIPIVLHLDHGDS
FELCKDCIETGFSSVMIDGSHLSYEDNVSLTRKVVEFAHQHDVTVEGELG
VLAGIEDEVHATKHTYTEPDQVEDFVGKTGVDSLAIAIGTSHGAFKFKPG
EDHKIRLDILAEIEKRIPGFPIVLHGASSVPQDLVQMINAHGGKLKDAVG
IGEDQLREAARSAVCKINIDSDGRLAMTAAVRKVLDEKPEEFDPRKYLGP
ARDALKQLYMHKIINVLGSNGKA
>CT0358 fbp, fructose-1,6-bisphosphatase
MNKLTTIESHFLQLQKRYPEINSEVTDLLNDVAFAAKLVRREVVRAGLAD
ILGLAGSTNVQGEEVKKLDLFANERLINAIGQHGRFAIMGSEENEEIIKP
PKFESGEYVLLFDPLDGSSNIDVNVSVGTIFSIYRLKSGEPSQASLEDCL
QKGADQIAAGYVIYGSSVMMVYTTGHGVHGFTYDQTVGEFLLSHENITTP
EHGKYYSVNEGSWQEFNDGTKRFLDYLKEEDKATGRPYSTRYIGSFVADF
HRNLLTGGVFVYPATKKHKNGKLRLMYEANPMAFICEQAGGRATDGYRRI
LDIEPKELHQRTPLYIGSKNDVLIAEEFEQGKR
>CT2080 fccA, sulfide dehydrogenase, cytochrome subunit
MLAAAPLLLASGNGFATTGPAAKPAVKPVTESRGEILSLSCAGCHGTDGN
SSSVIPSIYGKSPEYIETALIDFKNGSRTSTVMGRHAKGYTGEEIHLIAE
YFGNLSKKNH
>CT1015 fccB-1, sulfide dehydrogenase, flavoprotein subunit
MGNTISRRTFNRLLISGLAGSSLLMSGGPLMASAPKAHVVVIGGGFGGAT
VARYLRQLDPSISVTLVEPKKVFHTCPMSNWVIGGLFSMQNTAHTYHALR
SRYGVEVVQEMATGIDPVKKTVKLKGGRMLSYDRLVVSPGVDFIWDAIEG
YSRDVAESSMPYAWEAGPQTLLLRRQLLGMKDGENVIICAPKNPFRCPAA
PYERASLIAYYLKKSKPKSKVIILDDKEVFTKQDLFMLGWDRLYRGKIEW
RSASAGGKVERLDPAKMTVATEFGDEKGGVINVIPPQKAGRIAVETGLAD
TSGWCPVNPANFESLQHPGIHVIGDAALVGTMPKSGTAANTQAKALAAWL
VASFGGGNAGEHDLASLCYSLLAPGYAISVAGGYIQSPEGIKDNPDTVHL
TSMEATTAQLAGEAEQALQWYHNISQDTWG
>CT2081 fccB-2, sulfide dehydrogenase, flavoprotein subunit
MSISRRDFNKLLLAGAAGSAFGLFGSGNTAFAARKRVVVIGGGFGGAATA
KYLKKLDPTLAVTLIEPKPAFVTCPFSNWVLGGLRTMKDITHTYTALRTR
HGVNVIADRVVSVDAAKGTLRLAGGRVIGYDRLVVSPGIDFKYDTIPGYS
QKIAKSKMPHAWQAGPQTILLHRQLQAMKNGGTVVICPPDNPFRCPPGPY
ERASLIAYYLKQHKPKSKIVILDAKEKFSKQGLFTKGWESRYPGMIELRG
STGGGKVLGVDAKAMTVETDLGAVKGDVINVIPAQKAGKIAFEAGLTNEK
GWCPVNPSSFESTIHQGIHVIGDACIAGAMPKSGFAASSQGKVAAVAIIN
LLRGQEPAPPSLVNTCYSLIGPKYGVSVAGVYQLSPTGIVEIPGSGGRLR
PMPATNSWNRRRFSPKAGTPISARISGDKPLDSCGVPAVNHQAPLPLTHK
ATPDCRKR
>CT1761 fecD, iron(III) ABC transporter, permease protein
MKGVPLILVCLFALGGASLLSLGVGRYPVSPLAILSWLLTGRSADSNLPV
VLLNIRLPRLIAAIAAGGALSLSGAAYQGLFRNPMVSPDILGVSSGSGFG
AALGILFSLPVAAVQALSLAGGITAVFAAVLVSRAIGRNGDSVLVLVLSG
IVISSLFGALLSLLKYIADPLDKLPAITYWLMGSFADIRTGELGAAVAMV
FAGAIPLLLVSWRLNVLSFGEEEARSLGLHTERMRVAVILSATLVTASMI
SICGIIGWVGLVVPHISRFLGGANHRRLMPVSFLVGAAFMVAVDTVARSA
ASVEIPVGIITAVLGAPFFIWIMKRSSLRAW
>CT1760 fecE, iron(III) ABC transporter, ATP-binding protein
MVKLKEQSIALRNAGLGYNGRAILNNVDLEIEPGEIICLLGQNGAGKTTL
FRTMLGFIAPVNGSVTLAGREVSRLSPTEIARIVSYVPQSYAMPFAYPVS
DVVLFGRSAHLGLFASPGARDRRIAAECLELLEIGHLASRPFNELSGGER
QMVVIARALAQEARFMILDEPASNLDYGNQVRLLRKVRALAGRGIGILMA
THHPDHAFMAASRVAVLSGGKLSHDGPPEATLTPDTLRSIYGVEVQVFDT
PQNGHASRRVCAPVVE
>CT1743 feoA-1, ferrous iron transport protein A
MKLSELKKGQSARIVAMPSSGRLRKRLNEMGMLVGEIVRIEGVAPLGDPV
EMTVRGYRLSLRRSDIENITVEEVR
>CT0057 feoA-2, ferrous iron transport protein A
MKLSELKAGDRAEVTSVAAEPAVRRRLMDLGLVRGAKLKVLRFAPLGDPI
EVNCNGMLLTMRRNEAEGITVHILAGDEGHPHGWPGFRRRHRFGKRA
>CT1742 feoB-1, ferrous iron transport protein B
METAAQKQRVSAKVPTSPLKRIVAVVGNPNCGKTTIFNALTGLNQRVGNW
PGVTVDKKSGRFRHDGQEYELVDLPGIYSLSSLSQDEEVARSYILSGEAG
LVVNIVDASNPDRNLYLTSRLLEMGVPVVVALNMVDAAEEQGISVDPARL
SSLLGCPVIPMVASRGEGIEELKAAIAKGFTTGGASVPSAKVRFPAELDA
AIRRVAESTGTQARELGYDPVWFAMKLLEHDQELEKRLDGAALTSLFEER
KRVADALGDDPDIVIADAHYRFVSELSAAAITRKESTRKTLSDKIDSLVL
NRFLGIPLFLGVMYLMFLFTIKLGGAFIDFFDIAAGALFVDGFGRLLGAV
GSPGWLTALLASGVGRALQTMATFIPTIGFMFIFLSMLEDSGYMARAAYV
MDRGMRAIGLPGKAFIPLLVGFGCNVPAIMSARTMSDERDRIMTVMMTPF
MSCGARLPVYALFAAAFFPTGGQNLIFLLYLMGIAVAIMTGFILKKTLLK
GEPSPFIMEMPPYHLPTLKGVILRVGDRLGSFLLKAGKVLVPVIVVLGFM
SSIGTDGSFGNEENENSVLVATGKAIVPVLQPFGIQEKNWPAAVGLFAGV
FAKEAVVGTLNNFYAAKEAKAAVDGGGERFSLLAKLGEAVATIPENLAGI
AGSLFDPLGINVGDLSDRGAVAKEQGVSASIYGSMVNRFDGRVGAFAYLV
FILLYFPCVASIGVIYRETNLAWTLFSVVWTTGLAYVFAVLSYQIGTWGA
HPGSSAAWVAAMLLVFAVAVTLMYLAGNGAFGRKGKILEEA
>CT0056 feoB-2, ferrous iron transport protein B
MSQQKTITVALAGNPNAGKSSLFNALTGAHQRVGNFPGVTIEKHEGYLDY
KGYRITVVDLPGTYSLTPYSPEEIVTRRFIIDEKPDVVVNVVEGTNLERN
LMLTVQLMEMEVDLLVALNMMDEVEEKGISIDLDQLEQLFGSHIVPISAR
NRKGLDELLDHIVSVSEGRIEIKKNKITFSAEVEKAIDKIALLLAHEREL
DAAANHRWLAIKLLENDREVYTQVQKFPVWVKIELALQEAITECGILHNT
DPEALITEDRHAFVRGAMQECVHLPKATRASVSDYIDMVVLNRVLGLPVF
FLVVWAIFQLTFTLGKPLVEALAYAFDLLSDTVAPHLPAGMVRSIFVDGV
ISGVGSVVEFLPNIVLLFMGLSFLEASGYMARAAFVIDKVMHRFGLHGKS
FIPMITGFGCSIPAIMATRTLKSRSDRLATIMTIPFMSCSAKLPVYVLLA
GAFFPPAMAANVMFGIYLLGIMIGLWTAWLLKSTVLKSDSEPFVMELPPY
RWPTLTSVVFQARMKAVMYLKKAGTLILGAVIVIWAASNFPRSSALDAEL
AQEKAKIEATAVAPELKAEQLQKLKARIDAGQLEYSLAGRSGKLLEPLIR
PLGFDWRIGISLVTGLAAKEVVVSTLGTIFSIGHAAGQTSLSQILRSDPS
FSKATALSLMVFVLLYIPCVASVGVMKKEVGAWKPVLLYSVYVLAVAWIA
SFITYHLALLWL
>CT1167 ffh, signal recognition particle protein
MFDNLSDKLELTLKKLAGQATINEVNIGIAMRDIKRALLSADVNYKVAKK
LVEDIREKSLGESVIKSVSPAQMIVKIVNDELTEIMGGQNQPLNLPPKKI
PAIVMVAGLQGSGKTTFCAKLAKRLKKNGKNPILVAADVYRPAAVEQLKT
LGEQIDVPVFSIEEQDAMKAALGGLEAAKAGAKDVVIVDTAGRLQIDEAM
MAEAEALKNRLSPDELLFVVDSMMGQEAVNTAKAFNDRLDFDGVVLTKLD
GDARGGAALSIRQVVEKPIKFMSVGEKVDDLDVFYPDRMAQRILGMGDIV
SFVEKAQEALDLEKTMAMQSKLMKNEFDLDDFFDQLQQLKKMGSIQGLIE
MVPGLNKMVPKQDLENINFKPIEAIISSMTKQERKHPEIINGSRRKRIAL
GSGTRVQEVNMLLKQFAEMKKMMRSVNRLAKSGRKITTENLALDKFLKR
>CT1738 fld, flavodoxin
MKKTAVVWASQTGNTKEAAELIATEIGRPDVDLFEVSREELLRLTEYDML
IIGTSTWGAGELPHGWREAVSALDQLDLTGKTVAFFGLGDQLVYGDWYVD
AMGILHDRFVSRGAKVVGAWPSDTYEFSSSKALRDGMFVGLALDADNQEH
LTRERIRRWVCSIRPYLS
>CT1499 fmoA, bacteriochlorophyll A protein
MALFGSNDVTTAHSDYEIVLEGGSSSWGKVKARAKVNAPPASPLLPADCD
VKLNVKPLDPAKGFVRISAVFESIVDSTKNKLTIEADIANETKERRISVG
EGMVSVGDFSHTFSFEGSVVNLFYYRSDAVRRNVPNPIYMQGRQFHDILM
KVPLDNNDLIDTWEGTVKAIGSTGAFNDWIRDFWFIGPAFTALNEGGQRI
SRIEVNGLNTESGPKGPVGVSRWRFSHGGSGMVDSISRWAELFPSDKLNR
PAQVEAGFRSDSQGIEVKVDGEFPGVSVDAGGGLRRILNHPLIPLVHHGM
VGKFNNFNVDAQLKVVLPKGYKIRYAAPQYRSQNLEEYRWSGGAYARWVE
HVCKGGVGQFEILYAQ
>CT1453 fmt, methionyl-tRNA formyltransferase
MGLRVVFMGTPEFAVPSLRRIAAMKPQFETVLVVTGCDKPRRSKNSPPEP
TPVKQAALELGLPVLEADDVSSHEFALQVAAARPDVIVVAAFRVLPPEVL
ELPPLGTFNLHGSLLPAYRGAAPVNWAIINGDAETGVTTFFLQKSVDTGN
IITMDRTPIGPDENAFELLKRLSEIGAGTVERTLTMIADGAVMPEKQDER
FATKAPKLNRENTRIDWNQPVQRLHDFIRGLALKPAAWTTFGGKSLKIYK
AKACAIETAPDEPGTLRIADGRLLVAGTDGWIELLSVQAEGKKAMDGELF
ARGLRARKEMLRFL
>CT1937 folB, dihydroneopterin aldolase
MKLHHRSCIRLVNAVFYARHGVHEEERRLGGRYEVDAELFFDSTEAAAAD
DLLKTIDYGQAYTIISEVMIAGEPAALIETLAARAAARLIEELALAEKVT
VKVRKRALPLGGLCDYAEAEHVIERH
>CT0263 folC, folylpolyglutamate synthase
MHVAGTNGKGTVASCVASIFSTSGRKTGLFTSPHLVDFTERIRIDGQQIG
QARVAEYCTKLQPAVETGATFFETTTAMAFAFFADEGVDAAVIETGMGGR
LDATNVVQPEIVIIPSIGMDHTEWLGGSLREIAAEKAAIIKRCSRVFTAV
PEAGEAFAPIREAAEAVGAELHQVEREAECLVEEVCPGALALRISLDGGE
SRQFRAALTGSFHAPNVCLAVMAARSEGISWEHIDDGLARLGASGYRARL
ERIADKPVVMLDVSHNPEGMQKTAQSILELRNCFRFLYVIIGVAADKDAA
GIVHHIAPIADEIVAVDLPVERSLQAEVLERLCVEAGAQYVSSRHSAAEA
LEFLDQRVEPEDMILVTGSFYLSGEVAAMERFRSPGASAGAI
>CT0781 folE, GTP cyclohydrolase I
MKQEKTVSPTVENNRSAESRLSQCDLDECFDESHDRDEEVLGSMTDAVYS
LLKGVGEDPEREGLLLTPERVAKSLRFLTKGYRQDPEQLLKKAVFTESYD
EMVLVKDIDIYSMCEHHMLPFFGKAHVAYIPDGKIVGLSKIPRVVEVFAR
RLQVQERLTQQIRDAIQNVLNPRGVAVVIEATHMCMVMRGVEKQNAVTTT
SAMSGDFMTSQSTRSEFLRLIGNH
>CT1938 folK, 2-amino-4-hydroxy-6-hydroxymethyldihydropteridin e-pyrophosphokinase
MAPVTAYIGIGSNVGDRLGYLQQAVDHLAQLPGMRVSGASRVYMTEPFGD
PNQERYFNAVIAVQTSLDPTDLRTRCKAIEHDLGRPDRYQRWSPRTIDLD
ILLYSDLCIESDLLVIPHAEMHHRKFVLIPLLDLANPVHPRLRHSIAELL
KSCEDHSVPVRLVQELKIRPT
>CT1706 folP, dihydropteroate pyrophosphorylase
MMKQNLGDQSRYRLNCAGAMLDLSARPAIMGIVNLTPDSFFDGGSYGSAG
EAVQLERALESAMAMARAGAEIIDVGGESTRPGSAPVSAEEEIRRTIPFI
ELLRRQSDVLISIDTWKSEVAAKALRAGVNIVNDISGFSFDPKMPGVCAR
HHCGVVLMHTPAKPDALRWSYNTSAETEEVMNRVTTFLSRSIAIAREHGI
ESIIVDPGLGFGKTVEENYRLLARLDELHKLGCPVLAGISRKSFLGQAIR
RTGEETPPPSERLLATISANTIALMNGADILRVHDVDAAIQARAVVLATR
RASD
>CT1577 frr, ribosome recycling factor
MTVRDVIQKIEPRMKKTIEAFQHELASIRTGKATTALLDRVKVEAYGSQM
PLKQVGNVGVLDAHTLTVQVWDKSMVGATERAIRDAGLGLNPSADGQNIR
ISIPPLTEERRKEFVKLTKKFGEDSKVSLRNHRRDLIHEVEKLEKEKLIS
EDELKRGKKDADDLLHKYEKQITELIAQKEKEIMEV
>CT1740 ftn, ferritin
MLSKTILDKLNHQVNFEAASAHLYLQMSAWLLTQSLDSTAAFFRAHAEEE
KAHMMKLFDYINETGSLALIGEVATPAPEWKSHIELLEAAYNHELAITQS
INDLVDTALREKDYSTFQFLQWYVAEQHEEEYLFSSMLHKARIINTMDGR
ALFRFDEEVRKSVLHHEHHQQKPMFLQVGPAPKHHDGHDGLHAHQHSSHW
SGH
>CT0031 ftsA, cell division protein FtsA
MNSKLLMPKSNIVVGLDIGTTKVCVVVAEKDDVGKLNVLGKGRANSEGLQ
RATVVNINKTVDAIRKAVADAERESSIKIKGVNVGISGAHVHCIYSNSEI
SVNQSGIVNESDVRRFLEKAKTNIRYLDIDHEIIHVIPQEFIVDDQDGVL
DPIGMAGTIMRGSAYIVVGLRTKIRNIKQCIEKAGLEVSAMTFEPVASGL
AVMKESERRSGVVVIDIGGGTTEVAIYIDGAIRYSEVIKVAANDVTHDVA
YGIKALNDVAEEIKIQHGCAYAKVLDKEEEILIESIEGRPSKSFPKSSLT
VIIEARMMEIFELVRDIIKRSGYYDYLNAGVIITGGGALLPGTGELARDI
LGLDVRTGYPEGVSGGIKDAINNPMYATVMGLVAHSLQNNLYQDYGEVAP
APSGQTQEPVTSSPEPSQAQQSPEQNPDTPPTGKKFVDRLKKFWDQL
>CT0590 ftsE, cell division ATP-binding protein FtsE
MIAFVNADLDIRKKPIFRNLNFTIQPGELVYIVGRSGSGKTTLLKSLYME
IKPVKGEVIVNGFSSRKIRKGKIAVLRRKLGIVFQDFRLLEDRNVYDNLA
FVLKVTGTKHSLIKEKVMTALKEVGLEQAAKEMPQNLSGGEQQRVVIARA
LVGDPVAIIADEPTGNLDPETSIEILEYLKKINAKGISVIIGTHDYDLVR
HHPSRTLMIKDMNLVECTIEPAPSGTCWHPVPKQAAVAS
>CT0127 ftsH-1, cell division protein FtsH
MANNPFKLNNPYNNEPDNGPRKPRFSIFYYIAVILLIIGFQLAFFWSGST
REIAYSDFRKLIDQNRVESVKLAPEKIYVQLKEDSLSTATNKPFGQNPPA
FQMPGKNSSKNEVTVNPVRDEQLIPLLESKGIHYEAIPGNGWINELLQWL
LPFGLLIGIYFFMFRRMGGPGSQFMNIGKNKAALYENLDEHTRITFKDVA
GLDEAKAEVMEVVDFLKDPKKYTKLGGKLPKGVLLVGPPGTGKTLLAKAV
AGEANVPFFSISGSDFVEMFVGVGAARVRDLFKSAKEKAPCIIFIDEIDA
VGRSRGKGFMMGANDERENTLNQLLVEMDGFATDKGVILMAATNRADVLD
SALLRPGRFDRQIVVDRPDLKGRTDIFAVHTKNLSLSPDVNLKALASQTP
GFAGAEIANAANEAALLASRRGKQSIEMKDFEDAIERVIAGLEKKNKVIN
PREKEIVAYHESGHAIVSWLMPENDPVQKISIVPRGVSALGYTLNIPLED
RYLMTRSELIARICGLLGGRVAEEIIFGEISTGAQNDLERVTEIAYNMVI
VYGMSEKVGYLSFLESNNPYYGGPGIDKKYGDETARLIDNEVKEIVEAAR
KQVHQMLSDNRDKLEMLAKELLSKEIVQYCRIEEILGKRPAGKFSEHLAH
DCQNGVDMAMSQLHTEPEEAASPAVSAQQETATDPERKELEEAVERLRQS
RNLSSN
>CT0297 ftsH-2, cell division protein FtsH
MQPNKSNDKAPRNRPDNRFMPPDRDTDQDDRFPGKEGNGDRFPRFILFLM
LAVLGLFVFQRFFSQDISPEISYNEYRSIVSGGSLSEVTVKTSQDNSALL
TGKLKAPDSLRLTSGSTVRSDQFMVRLPGFDRAQADSLSASGVQVKITQS
SDEFSNFLLLMLPWGLFGFAYFFIFRRMSMQNDVQRNIFSFGKSRAKLIS
EFDVKVTFNDVAGVDEAIEELKETVEFLMNPEKFQKIGGKIPKGVLLLGP
PGTGKTLLAKAIAGEAKVPFFSISGADFVEMFVGVGAARVRDLFETAKKN
SPCIVFIDEIDAVGRSRGAGLGGGHDEREQTLNQLLVEMDGFTARDNVIL
IAATNRPDVLDSALLRPGRFDRQITIDKPDIRGRKAILEIHTRKKPLDSS
VDLETIAKSTPGFSGADLANLVNEAALLASRYNQTEITADNFEEARDKVL
MGPERRSMYISEEQKKLTAYHEAGHVIVSKFTSGSDPIHKVTIIPRGRSL
GQTAYLPLEDRYTQNREYLIAMITYALGGRAAEELIFNEVSTGAANDIEK
ATEIARKMVKNWGMSDKLGPINYGDGHREVFLGKDYSHVREYSEDTALQI
DVEVRRIITECMDNARKILTAHVRILHEMAARLIEKESLDSEEIDAIVSS
ETAVANSPA
>CT0040 ftsI, penicillin-binding protein 3
MNQARTTPENHRGTSDHEFSLRLGIMVALMLLFCLAIVSKLLGIQVFDVK
KYRAKASRQYETIVTEKARRGRILDRYGRPLAESVESISFYADPEQVNDA
GATAKLFANTFGKSRDYYLDLLRKNKRFVWLERNVPVSQAAKLMSLKIKG
VGFRREQHRYYLNVAAQLIGLTDRDNKGISGLEKSFESQLSGRDGVRIFQ
RSATGERYPAPDADQVQPLAGNDVTLTIDADVQGILEEEIAQAVKEFKAS
GAMGIIMDVRTGEIIAMANYPTFDLNRRSGLTADQMRNRAVTDMFEPGST
FKIVMASAATEALGWRAETPVDGHGGSITIYGKQVRDHEPFGAMNFQKAI
TESSNVVAASTAMRVGATTFYRYAHALGFGQKTGVGLIGESGGFLKPVSR
WGKMTLPWMGYGYQVMATPLQVLQAYATLANDGVRMRPFIIKRVSGPDGK
TIEETRPLKVVQALKPETAHYMAREYFRAVVEKGTGMSAKIEGIPVAGKT
GTAQKLHNGSYQGWRYYVASFVGFFPYDNPQYAAIIVVDEPQTAYYASAV
AAPVFSRVCGRAVACSLEMQKRLSMKSPEKELLDRVSTVVVPELKGLRES
EAAKLLEWNGLKLEPSGSSGGSVTSQSIAPGTKVQKETTVRVRLAKRVQN
K
>CT0035 ftsW, cell division protein, FtsW/RodA/SpoVE family
MASLLPSTGRGENIAGKLLLLIVALLMCIGVVVVYSSGAGWAEQKFSDPQ
YFLWRQLTFAIAGMAVIFVVGAIDYHIFRKISKLFLFVSIGLLAILLLLK
LAHVIHGAARWLGFGPLKFQASDLAKYAIIFHFSRLLSEKRAYIKDLHDG
YYPMLVLLMIVVALVALEPNFSTASIIAIIGFTLMFIGGIRIKYLLATAS
LLIPIAAVFAIAAPYRVARLVSFGGGEKELSYQVRQALLGLGNGGLFGLG
LGASKQRELYLPLSYNDFVFVIIGEEYGFIGALVILLLFSGLFACGIIIA
KHAPDLFGRYVAIGVTFAIVFFAFINIAVACHLMPTTGVALPFISYGGTA
LLFNSLGIGLLVSISRYRKKVETIERAQALLESKGGSL
>CT0200 ftsY, signal recognition particle-docking protein FtsY
MGFFDKLGLSRLKEGLTKTRDTLRDKLAFVSRGKTEVDDEFLEELENILV
AADVGVETTLDIVDAVTVRSKGKTYRSEEELNEMVMGEIRNLLVESGHEH
PVDFDAPLSAKPYVIMIVGVNGAGKTTTVAKLAHNYDKAGKKVVIAAADT
FRAAAYEQLKIWADRAGVPIIGQGQGADPASVVFDSVSSAVSKGTDVVLV
DTAGRLHNKSHLMEELAKIMRVAKKKIPEAPHEVLLVLDGTTGQNAVQQA
REFTKFVNVTGLVMTKLDGTSKGGIVLSISRELNLPVKYIGVGEKIDDLQ
LFDRKSFVEALLEKEK
>CT0030 ftsZ, cell division protein FtsZ
MAFELDPGLFDSDQDSGVNIKIVGVGGCGGNAVNNMMDRKISGAEFVVFN
TDRQALLNSKAPIRVQIGKKATNGLGAGADPAKGRLAAEDDRELIAMQLR
GADLVFIAAGMGKGTGTGAAPVVASIARNMGILTIGVVTRPFSFEGQIKA
RIADSGITELRKYIDTLIIVENEKILSIADEGVSATEAYNMANDVLFRAV
KGIADIITHHGHVNVDFADVRSIMQSAGDAVMGSAAAAGERRALKAASDA
VTSPLMEGVKMRGAKGVLVNITGDVTMRDIADAMNYIEEQVGSDAKIING
YVDEPQVSGEIRVTVIVTGFKRVEPGEERQPASSSGQQEKTLPKAHHVQG
FGRLVQSSGYPEQVVEDMRIPAYIRKSRSIQEPFDIGRACSTTRSRPSGG
SAPMPGQSDEDEQIRKGATDTPAYLRRKNNPPLQ
>CT0833 fumA, fumarate hydratase, class I, aerobic
MIALITETSTNLPSDVRKAIADAVGNDAADSRAGLAMSAITLNIDMAVDN
VGPVCQDTGMPTFFVHTPKGADQLAMKRDIEEAVVEATRTGKLRPNAVDS
LTGKNSGNNLGCHVPVIHFEPWDRDEIEVKLILKGGGCENKNIQYSLPAD
IPGLGRAARDLDGVRKCILHAVYQAQGQGCSPGFIGVGVGGDRTSSFELA
KKQLLRSVDDTNSDAVLAELEQEILDKANRLNIGPMGFAGKTTLLGCKIG
TSHRVPASFFVSVAYNCWAYRRLGVIIDPKEGSISEWQYRYPGEIKRMAR
GAGIPLTGREVVLTAPVSEETIRSLKVGDIVIVDGEMHTGRDAFHHYIMH
HDLPEGLDIRGGIIYHCGPVMLKNEAGEYTVVAAGPTTSIREEPYQSDVI
EKLGLRAVIGKGGMGPKTLAGLQKHGAVYLNAIGGAAQYYARCIEKVTGV
DFLEEMGVPEAMWHLQAKAFPCIVTMDAHGNSLHKQVDEESFAMLENIGK
EQA
>CT2192 fusA-1, translation elongation factor G
MARQVALDRVRNIGIMAHIDAGKTTTTERILYYTGRLHKMGEVHEGGATM
DWMEQEKERGITITSAATTCFWTPKYGNYAGLNHRINIIDTPGHVDFTVE
VERSLRVLDGAVALFCAVGGVEPQSETVWRQANKYGVPRIAYVNKMDRVG
ANFFETVKAIRERLGANPVPIQIPIGQGEVFAGFVDLIRMKGIIYDKEDG
STYTEVEIPHDLENEARTWRINMLEAVSELDETLLEKYLNGEDITEEEIR
TVLRQATLGVTIVPVLCGSSFKNKGVQFMLDAVIDYLASPVDDGEVEGHD
PKTEEPIVRQPKDEEPFAALAFKIATDPFVGKLTFFRVYSGVLNAGSYVL
NSTTGKKERVGRVLQMHSNKREERDAVYAGDIAAAVGLKDVRTGDTLCDE
SKPIVLEKMVFPEPVIEIAVEPKTKADNDKLGMSLAKLAEEDPTFRVKTD
EETGQTLIAGMGELHLEILVDRLKREFKVEANVGQPQVAYRETIRGTVEY
EGKFVRQSGGKGQFGLVVLRVEPLEEGKGYEFVDEIKGGVIPKEYIPAVN
AGIQEAMKDGVVAGFPMQDIKVTLIDGKYHEVDSSEMAFKIAGSIGFKGA
AKKANPVLLEPIMKVEVITPEEYLGDVMGDLSGRRGHIEGMGQRAGAQFV
SAKVPLSQMFGYSTDLRSMTQGRANYSMEFESYREVPRNIAEALQEKRVG
KDSE
>CT0144 fusA-2, translation elongation factor G
MQAVPTDQLRNIVVTGHSGTGKTMLCESLALCMGVINRLGSIEDGTTLSD
YASDETERKHSLNTSLIHGVWNEKKINIIDTPGLLDFHGDVKSAMRVADT
VLITVNAATGVEVGTDTVWEYTKEYYKPTMFVLTKLDADRADYNATIEAL
RDHFGHLVTPIQFPAEEGFGHHILIDVLLMKQIEFSPDKPGSMVISEIHD
LYRKKAEVLHQQLVEAVAETDEELMNHFFEEGTLTEDELRAGIKSALVTR
TFFPVFCTSPLHLIGSERLLNAIVNLCPSPIERGPEHAFCSVMNDEKLLP
PDPDGSTIAFIFKTMSEPRVGEISYIRVYSGHIESGHELIDVQTGQLEKL
GQVYTMLGQKKIPVDKLLAGDIGMVVKLKNSHTNDTLADKGVNCRISPII
FPEPVLSSAIVPVTQGDEEKISAGLHHLHEEDPSFAIEHDVEFNQTILKT
LGETHLDIIISRLRNKFNIQVEVAPVRIPYRETIRVSASAQGKFKKQSGG
RGQYGDVWIRIEPLERGSGFEFASEVVGGVVPTRYIPAVEKGLRESIAEG
SLAGYPVVDLKAVVYDGSHHPVDSSEYAFKIAASMAFKAAVEKAKPLILE
PIYSLTVQTPDQFTGEIVGDISSKRGRILGMDTESRFQVIKALIPQASLS
TFHHALTRLTQSRARYNYTFSHYEEAPAEIANQLIAEKTAKQ
>CT0024 galE, UDP-glucose 4-epimerase
MKILVIGGAGYIGSHVAREFLDRGYQVTVFDNLSTGREENLFDDAEFVRG
DIFDAEMLAEVMNRGFDGCVHLAALKAAGESMQKPEEYSVHNICGTIGTI
NQAVASGIKCLLFSSSAAIFGSPAYLPIDENHPKKPENYYGFTKLEIERI
LEWYDRLKGLKFAAVRYFNAAGYDVRGRIRGLERNPANLLPVIMEVASGV
RPMLSVFGTDYPTRDGTCIRDYVHVNDLATAHVLAFEQVIESGESLSVNL
GSETGVTVLEMLEAARRLTGKEIMAEFAPRRAGDPANLVATSAMARELLG
WVPQYSDLDTLVESTWNVYRDVNAGSGRQ
>CT1480 gapA, glyceraldehyde 3-phosphate dehydrogenase
MAKVKVGINGFGRIGRLVFRQAMENPEIEIVGINDLTDVKTLAHLLKYDS
SHKKFNGEVTIEGDNLIVNGRTIAICAQKDPAQLPWASLGATLVVESTGI
FTSREAASKHLAAGAKKVIISAPAKDKIDATIVIGVNDKSITGKEEIISN
ASCTTNCLAPMTKVLNDNFGIVKGFMTTVHAYTNDQNILDLPHKDLRRAR
AAACSIIPTSTGAAKAIGEVLPELAGKLDGFAMRVPIPDGSVTDLSVIIE
KSATKEEINAVMKAAAEGPMKGILEYNVDPIVSCDIVGNAHSCIFDSPLT
MSSGNMVKIVGWYDNELGYATRVVDLLGIYSKFV
>CT0268 gatA, glutamyl-tRNA(Gln) amidotransferase subunit A
MQFHGYEDLRSRLLSGELTCEQVISDYLQRIDSSRDDNIFTVVFHDEAMA
RARELDSKLQRGEAPGVLFGMPIAIKDNIAMKGAPLSCASKILAGYESVY
DATVIKRMQAEDAIFVGRTNMDEFAMGSSNENSAIGPVPNPYDKTRVPGG
SSGGSAAAVANDLAMVALGSDTGGSVRQPAGFCNIIGLKPTYGRISRYGL
VAFASSFDQIGLLAANCDDAALVLGVIAGKDEHDATSSHHDVPEYDTAMD
HVSVDGLRIGVPRAFFPESLNADVAGVVKAGLKKLEEAGAELVEIDLPES
DYAIAAYYILVTAEASSNLARFDGARYGYRSPDSPDLSSMYVNSRTEGFG
AEVKRRIMLGTYVLSAGYYDTYYKKAQQVRRVFQDKYREAFEKVDVIFGP
TSPFPPFGIGDKMDNPLEMYLADVFTVPASIVGMPAISVPVGFDSLGLPV
GAHLICNFFEEGKMLGIARHLQTLCQTAPSN
>CT2215 gatB, glutamyl-tRNA(Gln) amidotransferase subunit B
MNYEIVVGLEVHCQLNTESKAFCGCSAKFGKPANTNVCPVCLALPGALPV
LNARVVEDAVKLGLATNCTIARHSILARKNYFYPDLPKGYQISQYEEPIC
SEGVIHIDLEEGGKDVRLVRIHIEEDAGKSIHDIGDDTYIDVNRCGVPLL
EIVSYPDMRTPKEASAYLQKLRQIVRYLGISDGNMEEGSLRCDANVSVRP
VGATEYGTRTEIKNMNSFRNVERAIEYEAKRHIEVIEGGGTIVQETRLWD
ADKLETRSMRGKEHAHDYRYFPDPDLVPVLVDDGMIRRMQEELPEFPEDR
AARFVSEFGIPAYDAGVITVDRELADYFESTVKVSGDAKASSNWVMGEVM
RTLKEKYLDIHKFAISPERLGGLIKLINAGAISNTIAKQVFEIMQQDEAT
AEAIVEREGLAQVSDRGAIEAAIREILEANQKQLEQYRSGKTQLFGFFVG
QCMQKMKGKANPKMVNDILRSMLDA
>CT1833 gatC, glutamyl-tRNA(Gln) amidotransferase subunit C
MSVTTKDVAYIAELARLKFTDAEQEKMASELNMILHYIEKLNGIDTEGVE
PLSTIHDQINVLRADVEHTPLSNDEALKNAPDSQDRFFKVPKVIG
>CT0064 gcp, o-sialoglycoprotein endopeptidase
MNILGIETSCDETSAAVLSDGSVRSNIVSSQRCHTDFGGVVPELASREHE
RLIVSIVDAAITEANIAKNDLDVIAATAGPGLIGAVMVGLCFAEGLAWAL
GKPFVPVNHVEAHIFSPFISDEPGHREPKGDFVSLTVSGGHTLLSVVRQD
LGYEVIGRTIDDAAGEAFDKTGKMLGLGYPAGPVIDRLAREGDSDFHRFP
RALTASSQTSKSYRGNFDFSFSGLKTSVRTWLEAHDSEYVQKHQADLAAS
IQSAIVEVLVEKSVAAALLHKVNAISVAGGVSANSGLRSAMQAACDRHGI
ELFIPALAYSTDNAAMIATMAQLMIARGKYRIEDNSYGVAPFARFEAARK
GAR
>CT0147 gcpE, gcpE protein
MVNSKASSFRFSFFIHNSAFIILFPPSFSRLLFIPAIGYLQDHLQSLSTS
RLMSEERLVSGNIIDYPAPVYSYRRRVTREVPFGTIFLGGYLPIRVESMI
TAHTMDTAASVEQCRRLYEAGCEIIRLTVPTEKDAENLKNIREQLRRDGI
DTPLVADIHFSAKAAMKAVEFVENIRINPGNYATGAKFSSKDYTDDEYRA
ELDKVREEFTPLVRKARSLGVSMRIGTNHGSLSDRIVSRYGNSPEGMVEA
ALEFSRICEDEGYYDQLFSMKSSNVRVMIQAYRLLVARADAELRYAYPLH
LGVTEAGDGDEGRIKSAMGIGALLEDGLGDTIRVSLTEDPVNEVPVGFAI
VKKYNDMLLVRGDRAHLPVKHVIEHERKSAGHVQLPFEPFSYSRRPSISI
DGAGIPVGGDALPGVETAAHAPITDTESLRDEILARLDPGKPEDAIRSEL
VSVGVGSAEDISLLKALLDSLGNLREKIVVSTADTSIVPALLPLCGRVRL
DIVEGETLGTGLIESLHDRNAAIEFCFIHEKSSENVPAEVLVRLAAKLKA
RGLQRVMLSIVSDAPLYSTRKLALELKKAGLDYPIAVRYRRLDGERSGVL
IQSAIQAGTLFCDGIGDLIALETNMPASEEVSLCFNILQAARIRMSKTEF
ISCPGCGRTYFELEKTTALIKQRVSHLKGLKIGIMGCIVNGPGEMADADF
GYVGSGKGRVSLYVGKECVEENIPEAEALERLIELIRQNGKWVDPV
>CT1626 gcvH, glycine cleavage system H protein
MNIPDNLRYTKDHEWIKLLEDGLTALVGITDFAQSELGDIVFVETKPVGT
KVAAHGTFGTVEAVKTVADLFAPAAGEIVEVNAGLDDAAIVNSDPYNEGW
IVKMKLDNPADVEALLSPADYSALIGE
>CT1625 gcvP1, glycine cleavage system P protein, subunit 1
MPFIVNTDAERAEMLREIGVENFEALIADIPEEIRLKKALDLFPAMGEPE
VKSLLEKMASGNAATCDHVSFLGAGAYDHFIPSAVKTIASRSEFYTAYTP
YQAEVSQGTLQAIYEYQSVMCRLYGMDVANASMYDGASALAEAALIALNV
TGRNGIVVAGKLHPYTSQVLETYLEAAGDRSIVQNGLENGIGSVEALEAL
VSSETAAVIVQQPNFYGCLEEVEAIGEIARKNGALFIVSADPVSLGVLEA
PGNYGADIAVGEGQSVGNAQSFGGPYLGILTVKQAHVRKIPGRLVGMTKD
KDGNDGFILTLQTREQHIRREKATSNICSNQALCALQSVVHLSLLGKEGI
RDVANRSMQKAHYLADRIAELPGYSMKFSAPFFREFVVETPVPSATIIEK
MLEKKVFAGVDLSAWGEDGLLVAVTEKRTKEELDSFVSELAALG
>CT2123 gcvP2, glycine cleavage system P protein, subunit 2
MKEQLIFDLSRSGRKGYSLSPLDIPERPADELLPSKFLRKEPAELPEMAE
SEVVRHFIRLSNLNYHVDKNMYPLGSCTMKYNPKINDYTCDLPGFASMHP
LQPESTSQGALQLMYELAEMLKEIAGMKAVTLQPAAGAHGELTGILLIKK
YHEKLGNKRHKLLVVDSAHGTNPASAALGGYECVSVKCDESGCTDMGDLR
AKLDGEVAALMLTNPNTVGIFEKQIPEIEKLVHGNGSLLYMDGANMNALL
GITRPGDMGFDVMHYNLHKTFSAPHGGGGPGSGPVGVSERLVEFLPVPVI
EKFEKDGQTRYRLNSSKPNTIGRMMNFYGNFSVLVRAYTYIRMLGADGLR
RVSENAIINANYLLQRLVEHYALPYPRPVMHEFCLSGDRQKKEHGVRTLD
IAKRLLDYGYHAPTVYFPLIVSEALMIEPTETEAKETLNAFADAMIAIAE
EAKSNPDLIKSAPTTTPVKRLDEAQASRQLNICCQH
>CT1788 gcvT, glycine cleavage system T protein
MKKTALSAWHEAAGAKMIDFGGFLMPVQYTGIIAEHKAVREAAGLFDVSH
MGNFYVRGARALEFLQYMTTNDLAKIVDGQAQYTLMLYPDGGIVDDLIIY
RVSADTFFLIVNASNCEKDFDWLSSHIGQFEGVALENHTSELSLIALQGP
KSFDILARVFPGAGIDKLGSFHFIKLPFEGAEIMVARTGYTGEAGVEICL
PNERAVALWSALMEAGKSDGIQPIGLGARDTLRLEMGYSLYGHEIERDVN
PLEARLKWVVKLNKPNFIGKQACEQVEINPRKSVVGFSLEGRAIPRQHFK
VYNSDKQEIGEVCSGTVSPTLQEPIGTASLLLDYAQPGTPIFVEIRGTMQ
PGAVRRLPFVHADRP
>CT2023 gdhA, glutamate dehydrogenase
MAGLTKENPFDIARRQLDAAAGIIGLDAEVLELLRWPMREMHVTIPVKMD
DGAVRAFHGFRVQYNDARGPNKGGIRFHPDETIDTVRALAAWMTWKTAVM
DIPLGGAKGGVICNPKTMSPGELERLSRSYIRQVGRLLDLEKDVPAPDVY
TTPQIMAWMADEYSFMQGHNDFGVITGKPLALGGSLGRGDATARGGIICI
REAAKMLGINLRGKPAAINGFGNAGAFAHKLAVELLGMKVVAVSDSKGSI
YNPDGFDHQALMEYKKQHGSVADFPGSTPLTDAGLLELDVTVLIPAALED
EISCRNARNIQAKIVAELANGPTTPEADKILHERGVYLIPDLLCNAGGVT
VSYFEMVQNASGWYWEEEVVHRQLEKKMAAAIKAVHQAAVQYSVDNRTAA
MIVAIRRVAEAMKLRGWV
>CT2283 gidA, glucose inhibited division protein A
MYDVIVVGAGHAGCEAALAVARGGLHCLLITSDLSAVARMSCNPAIGGVA
KGQITREIDALGGEMGKAIDATGIQFRMLNRSKGPAMHSPRAQADKTQYS
LYMRRIVEHEPNIDLLQDTVIGVSANSGKFSSVTVRSGRAIQAKAAILAC
GTFLNGLIHIGMDHFPGGRSTAEPPVEGLTESLASLGFSFGRLKTGTPPR
IDSRSVDYTIVTEQPGDVDPVPFSFSSTSVANRNLVSCYLTKTTEKTHDI
LRTGFDRSPLFTGKVQGVGPRYCPSIEDKISRFPDKSSHHIFLEPEGTDT
VEMYVNGFSTSLPEDIQIAGLRSIPGLEEAKMIRPGYAIEYDFFHPWQIR
STMETRPVENLFFAGQINGTSGYEEAAAQGLMAGINAVRKILGKELIVLG
RDQAYIGVLIDDLITKETKEPYRMFTSSAEHRLILRHDNADLRLRKIGYD
CNLVSSDDLHRTESIIKRVQHCLEVMKTAKVTPAEINTLLMNKGLQELKT
PARALSLIKRPGISLQDILEHSLSVRSAAEELCNDPRVAEQVQIEIKYEG
YIKREQLVADRIARLDSLHIPDNFNYDSLNSLSSEGREKLLKHRPATIGQ
ASRILGVSPSDVSILMIRLGR
>CT2160 gidB, glucose inhibited division protein B
MQNDLHLLEGLCRQHGIPVTKNALTLLVRYARLLEAWNLKVNLVSRKEHA
PVIVKHVFHSLLIARIHDFKPGETVLDLGTGGGLPGIPLAILFPETSFLL
VDSTGKKIAACKAMIKELGLENVIALHSRVEELKGVIFDTVLSRQVAPLE
ELCAYSARLLKHDGVLICLKGGSLNEEIAEAVLSREKHLGFPASVDQLPI
GEIDPMFSEKQIVIARW
>CT2012 glgA, glycogen synthase
MARRNFKVLYVSGESSPFVRVSSLADYMASFPQALEEEGFEARIMMPKYG
IINDRKFRLHDVLRLSDIEVPLKDKTDMLHIKVTALPSSKIQTYFLYNEK
YFKRYGLFSDISLGGDHKGSAERIIFFSVGVMETLVRLGWQPDIIHCHDW
HAGLVALLAKTRYAKHDFFKKVKVVQTIHNVYRQGVFPSKAFQKHLDPEV
CDALDMEGGEVNLLATGIKHADLVTTTSDRYARQLLDDPELSFGMDKALK
ACGDRFHGILNGMDTRQWNPSSDKLIKKRYSAEQPEMKLEDKKVLLEEVG
LPFSEETPVVGVIGSFDQYQGAEIVKASLAKLLELDIQLIVFGSGDKEFD
QALKETAEENEEKMAFRPEFTDAFYHQMIAGLDILLMTSRIEACGMMQMF
AMNYGTVPVAYAGGGIVDTIEEVSGDKGTGFIFTDYTPEALTAKLQEALA
LFANRERWSALMLECMGRDFSWSTSAGQYAELYRNLLG
>CT0130 glmS, glucosamine--fructose-6-phosphateaminotransferas e
MCGIIGYIGRREAAPLLLNGLKRLEYRGYDSAGMAVLNGSMKMLKKKGSV
SNLEELLNVSGTVMLGATVGIAHTRWATHGDPSDRNAHPHMNVSGDIALI
HNGIIENYSALKQELMGEGYVFESDTDSEVLVHLIDRIWKNDSALGLEGA
VRQALRHVEGAYGICVVSSREPDKIVVARKGSPLVIGLGDGEFFIASDAA
PIVEHTNKVVYLSDGEMAVVTRDSYTVKTIENVEQQKRVTELDFSLEKIE
KGGFEHFMLKEIFEQPEVMRDVMRGRVRVEEGRVHLGGIHDYLDRLKQAK
RIMICACGTSWHAGLIGEYLIEEFARIPVEVDYASEFRYRNPIVSSDDVV
IVISQSGETADTLAALRLAKEKGAMVMGICNVVGSTIPRETLCGMYTHAG
PEVGVASTKAFTAQVIVLFMLAMALSKGRTISQEEIKLNLRELAEVPDKV
AWILEQNDAIKEIAVKLKDARNALYLGRGYNFPVALEGALKLKEISYIHA
EGYPAAEMKHGPIALIDEDMPVIVIATRDNTYAKILSNIEEVRSRKGRVI
AIASEGDREIERLTEDVIYIPQASAAVLPLLTVIPLQLLSYHVATLRGCN
VDRPRNLAKSVTVE
>CT1411 glnA, glutamine synthetase
MSNESKKPVASYYGALTFGTEAMRAKLPKEVFKALQDTIKAGKKLPADIA
GVVAHGMKEWAMEHGATHYTHWFQPMTGTTAEKHDAFLTTQMDGTVIERF
SGEQLIQGEPDASSFPSGGMRSTFEARGYTAWDPSSPAFLMKGGKGMTLC
IPTVFISYHGEALDEKTPLLRSMDAVSKAAIRLLDTIGITGVTKVNTYAG
PEQEYFLIDKKFYAQRPDLIMTGRTLLGALPPKGQQLEDHYFGSIPDRVL
EFMQEVEEELFLLGIPAKTRHNEVAPHQFEIAPIFEQVNLASDHNLLVME
VMRKVADKKGFALLLFEKPFAGINGSGKHNNWSIGIDGGMNLLDPGDTPE
SNISFLVFLVAVLKGVLKRSAILRASVASIGNDHRLGANEAPPAVITVFL
GDLLEKVLDAIESGKVDLKTEKQILDLGLSHVPVLNKDYTDRNRTSPFAF
TGNKFEFRAVGSSQPISVPNMVLNTIMAEALDDLNAEILAKIEGGMAKED
AILAAVRDGIIATKAVRYPGDNYSEDLQRAAAERGLPNMKNTPESVRAWT
DKDTVSMFVKYGVLTAEEIESRYNVRIERYVKGIDIEARTLLLMIKTMVI
PDASEYQGDLASSFNNLAAAAESIGLSDAALQSQAGLLKTLAEDLSKLID
LTAILEETIEEMEEQESELDKADFCSARLLPCMNAIREVADKIEVQVDRS
RWQLPTYSEMLFEH
>CT0185 glpK, glycerol kinase
MAILSIDQGTTGTTCMIYDRTGSVLARAYRELTQHYPQAGWVEHDPEEIW
RTVVECVAEVRGAYSGHIEAIGITNQRETTVVWDRRSGQPVHRAIVWQCR
RTAKLCNRYRSEEAAIRAKTGLPVDAYFSATKIRWILDAHPEIDPSNLLF
GTIDTWLIWKLTGGEVHATDLTNASRTLLFDIHERRWSPELCELFGVPLS
MLPEARPSMGGFGSVRTIPALDGVLIAGVAGDQQAALFGQCCFAPGSVKN
TYGTGCFMVMNTGEKFVSSSHGLLTTLALDGAGRSCYAVEGSVFIGGAVM
QWLRDGLQLIGSAAESETIARSVESNGGVYLVPAFVGLGAPHWNMEARGT
ITGLTRGSSRAHIVRAALESIAFQSHDVFRAMVADIGIQPQSLTVDGGAV
SNEFLMQLQADLLGVPVHRPRNIESTALGAACLAGLEAGVWGSAAELRVL
NSVERVFTPAMPEGEREALLAGWQKALRQTLTS
>CT1834 gltA, citrate synthase
MSDSDHRNSLTIIDNRTGKSYELPIENGTIRTMDLRKIKESEDDFGLLGY
DPAFLNTCSCKSTITFIDGDKGILRYRGYPIEQLAEKSSFLETAYLLIKG
ELPDKERLAVWTYNIRHHTMTHANLTKFMDGFRYDAHPMGILVGTLGALS
TFYPDAKNVGEEESRKLQVRRLIGKMPTLAAMSFRHSLGFPYVLPDNDLS
YAGNFLSMMFRMTEKNYKPNPVLEHALDVLFILHADHEQNCSTNAVRSVS
SSMVDPYSAIAAGCAALYGPLHGGANEAVIHMLKQIGSVDKIPEFIKLVK
SGEGRLMGFGHRVYKNYDPRARIIKKIAFDVFEETGRNPLLDIAIELERI
ALEDDYFIQRKLYPNVDFYSGLIYQAMGFPTEMFPVLFAIGRTPGWLAQW
IEHVKDPEQKIARPRQIYLGEKCRDYVPIDERPRKGDDEQKFGICRL
>CT0401 gltB, glutamate synthase, large subunit
MPENMKAKHEGGLYDPQFEHDACGVGFVANIKGVKSHQIIKQGLQVLVNM
KHRGATGYEKNTGDGAGILIQIPDKFMRKVCAQRNIELPEPGKYGVGMVF
LPPDLTQRRAIEDICRQMVQAEGQKYLGLRKVPTDNSTLGQTARSQEPVV
KQIFVGRGNDNMSDLEFERKLYIIRRRIFKRVRFTSGLLGSSFFYISSFS
ARTIVYKGMLNPEQVEEFYPELRDPDMESAIAMVHSRFSTNTFPSWDRAH
PYRFLSHNGEINTLRGNVNWMKAREKNMQSSIFKGALEEIKPILLEEGSD
SATLDNAFELLVMCGRSMAHAAMMLIPEPWSGNESMDPDKRAFYEYHSCL
MEPWDGPASVVFTDGIQIGAVLDRNGLRPSRYYITSDDLVVMASEVGVLD
IDPEKIIKKDRLQPGRMFLVDTKEGRIISDEEIKKSIASEKPYTEWIERN
VIDLASLPERERMKNPDEDNYSITARQKAFGYTNEDLTLQIRPMAQNGTE
CIGSMGNDTPLAVLSNRPRLLYDYFKQLFAQVTNPPIDSIREEIVTSTTV
MLGAEGNLLESDEINCRRIRIPHPILTDEDLEKIRGIDKPGFKAITLPIF
YNVAEGGRGIQETMQDLYRQAEKAINQEGVNIIILSDKGELEKSRAPIPV
LLALAGFHHFLISAGLRTKVGLIVESGEPRTVHHFSMLIGYGAGAINPYL
AFETIRQQVAQGRITHDEKKAIKNYVKAAVKGVVKTMAKMGISTVQSYRG
AQIFEAVGLNTEVVDTYFTKTPSRIEGIGLDTLADEVRKRHEAAFPPGGN
KVNRGLEAGGDRKWRHDGEFHLFNPETIHYLQHSCRTGDYELFKKYEKLI
DDQSEHYCTIRGLMDIRFSEKPIPIDEVEPVEAIVKRFKTGAMSYGSISK
ESHETLAIAMNRLGGKSNTGEGGEEPDRFVRDANGDSRMSAIKQVASGRF
GVTSEYLTNAEEIQIKMAQGAKPGEGGQLPGTKVYPWIAKTRHSTPGVGL
ISPPPHHDIYSIEDLAQLIFDLKNANRSARINVKLVSTVGVGTIAAGVAK
AHADVVLISGHDGGTGASPISSIMHAGMPWELGLAETHQTLMLNNLRSRI
VVEADGQLKTARDIVIAAMLGAEEFGFATTALVVMGCIMMRCCQDDSCPV
GIATQNPELRKNFKGKPEHVENFMRFLAQGVREYMAKLGIRTLNELVGRS
DLLATSRTIKHWKAKGVDLSKILHQVDTGDNDTPYCTITQDHGIEESLDM
RVLMAICEPAIKRGEKVSTTLPIKNTNRAAGTIVGHEVTKAYGSKGLPDD
TIHLKFIGSAGQSLGAFIPKGMTIELVGDANDYIGKGLSGGRIIAYPPKS
SKFVPEENIIVGNVAFYGATSGEAFIRGMAGERFCVRNSGMEAVVESVGD
HGCEYMTGGKVVILGKTGRNFAAGMSGGVAYVYDVDGAFTGRCNLEMVSL
SAVEAEDELEWLRSKIEQHVEVTGSELGKGLLATWPNASQRFVKVLPNDY
KRAIDAMKEVEAMGMTGDEAVMAAFEKNVHDPSRASGN
>CT0473 gltD-1, glutamate synthase, small subunit
MAIPRQKMPAQDPVERVGNFKEVNLGLTPEQAQQEALRCIQCKDPVCIAG
CPVNIKIDQFIKLIAEGDFMGAVRKIKEDNVLPSICGRVCPQEDQCEKVC
VIGKKHEPVAIGNLERFVGDYERTSGQKIDPKIAPPTGKKVAVVGSGPAG
LSCANDLAQYGHKVVVFEALHELGGVLMYGIPEFRLPKEIVREELDGLRR
MGIEFRTDVVVGRTITIDELMEEEGFDAVFIGVGAGLPWFMGIPGENLVG
VLAANEFLTRVNLMKAYDFLKSSDTPVFDCKGKNVAVFGGGNTAMDAVRT
AKRLGAEHAYIVYRRSEKEMPAREEEIHNAKEEGIEFLLLTTPLEFVGDE
KAWLTGAKCQKMELGEPDDSGRRRPVPVEGSEYILPIDMAIISIGNGPNP
LIHQTTPDIEVSKRETIVVDVNTMQTSKENVYAGGDIVTGGATVILAMGA
GRKAAAAINEKLGGTAKNFNEW
>CT0402 gltD-2, glutamate synthase, small subunit
MGKLKGFMEYRRALPVDREPLERIKDWNEFHEEMSAEQLSDQGARCMDCG
TPFCHSGFMLNGMTAGCPIHNLIPEWNDHVYRGFWRDAWERLMKTNNFPE
FTGRVCPAPCEGSCVLGIIQPPVTIKNIEYSIIEHAFAEGWVEPKQIAVR
TGKKVAIVGSGPSGLACADQLNKAGHTVTVFERDDRVGGLLMYGIPNMKL
DKRLVVQRRVDLMKEEGVSFVTGTEVGVNYPVDKLLSEYDAVVLCIGATN
PRDLNADGRNLDGIHFAMEFLRASTKAVLDGTEPVLSAKGKDVVVIGGGD
TGTDCVATSLRQGCKSVIQLEIMPKPADFRQEDNPWPEWPKVFKVDYGQE
EAAAVQGGDPRRYLMMTKKFIGENGRLSAVEVSKVEWIKQEGRTIPVPVS
GSEEIIPAQLVLLAMGFLGPEAQLLQSLGVEQDSRSNIKADEKSYRTSVD
KVFAAGDARRGQSLVVWAINEGRAAARECDRFLMGCTSLP
>CT0299 gltX, glutamyl-tRNA synthetase
MAGQKVRTRFAPSPTGYLHVGGLRTALYNYLFAKRMNGDFVIRIEDTDQS
RKVEDAEKKLISTLEWAGIIADESPMHGGNYGPYVQSQRLSIYRDYCTRL
LEDKNAYYCFSTPEELEENRQLQLKQGLQPKYNRKWLPEDMGGNMPESEI
KKKLDEGAPYVVRMKVPDYVSVWFEDMIRGPIEFDSATIDDQVLMKSDGF
PTYHFASVIDDHLMEFTHIIRGEEWLPSMPKHLLLYEFFGWEPPKFAHLP
LLLNPDRSKLSKRQGDVAVEDYMRKGYSSEAIVNFVALLGWNEGEGSEQE
VFSMEELISKFSLERVGKAGAVFNVEKLSWLEKQYIKTRPVEKIVGNIKP
VLQAKLAEFSPEMSVERITSDDYLAKVVELMRERVNFEHEFVTFSSYFFF
EPESYEEEAVAKRWTPNVPPLLQEFADLLEANDDFTAENIEAQLKAFVAP
KGLKPAVLIHPIRIAVSGVSFGPSLYHMLEVLGKEAVLRRIRRAIERIEV
PAA
>CT1590 glyA, serine hydroxymethyltransferase
MDNDILKRLDPEVFEAIANETKRQTETLELIASENFTSKAVMEACGSVMT
NKYAEGYPGKRYYGGCEFVDVAENLARDRAKKLFGCEYVNVQPHSGSSAN
MAVLFAVLKPGDAIMGLDLSHGGHLTHGSKVNFSGQFFDAHSYGVDKETG
IIDMNKVEEMARRVKPKLIITGASAYSQGFDFKAFREVADKVGALLMADI
AHPAGLVAAGLSANPMPHCHFVTTTTHKTLRGPRGGMIMMGKDFENPLGL
TINTKNGSRVKMMSEVIDAEVMPGIQGGPLMHIIAGKAVAFGEALQPEFK
AYAQQIKDNAAAMAAKFLAAGYHIVSGGTKNHLMLLDLRNKNVNGKVAEN
LLHEAGITVNKNMVPFDDKSPFVTSGIRIGTPAMTTRGMKVAEAEKIVEF
IDRVISAANDANVADVCKAVRAEVRELCLGFPLNNYGSLV
>CT2255 glyS, glycyl-tRNA synthetase
MNKLVSLAKRRGFIFPSSEIYGGLSSCFDYGPLGSEMKKNIKDLWWNAMT
RRHQNIVGIDASIMMNPTVWEASGHVASFNDPMIDDKTTKRRYRADHLIE
NHIEKLHRDGKEAEAAAIKVAYEAAGGTEDPNRTLYNIIIEAGIKAPDTG
SADWTEVRQFNLMFQCNMGAVADSAGVVYLRPETAQGIFVNFHNVREASR
MKVPFGIAQIGKAFRNEIVKGNFIFRMVEFEQMEMQYFVKPGTQLEAFEA
WREERFRWYSETLGMSKEKLHWYKHDKLAHYADLAYDIKFEFPFGIEEIE
GIHSRTDFDLSQHQKYSGKSMEYIDQTTNERYIPYVVETSSGCDRTFLAL
LSDAYQEDVVDGEPRVMLKLAPKVAPVKAAVLPLMKKGEMGEKAAQLCRD
LSESFMVQYDDAASIGKRYRRQDEIGTPFCFTVDHDTLENGTITVRYRDT
AAQERINMSKAAEFLATKLM
>CT0247 gmk, guanylate kinase
MSAEQVLDQGRLIVFSAPSGTGKSTVAKLVMERLGSLEFSVSATTRQMRA
GERDGVDYHFLSREEFEKKIAENGFIEHEFFFGNFYGTLLDKTIDAIKAG
HNLLFDLDVKGALNLKRIFGDQALLVFLKPPSMEELARRLQARDSESAEA
LKSRLERAEMELSHAGEFDFVVVNDDLGRTVDAVATRIAEFLPQP
>CT0889 gph, phosphoglycolate phosphatase
MMNHSVTQKFSAVVFDMDGTLLDTLADISYSLNSVLEEEGYPTHPVEACR
AMVGFGMRELVRKALPESAHDEAITEPLLKKLQARYAEHWNDSSRPYDGV
VELLDAIDRLGLKKAILSNKPDRFTRQCAEELLAPWKFDVIMGFREGIAP
KPDPTGALLVAKELGVEPASILYVGDSGVDMKTANAAGMYPLGVTWGYRP
GDELLATGAAKLVSHPTEIIPLLTA
>CT0399 gpm, phosphoglycerate mutase
MKKLVLLRHGESQWNRENRFTGWVDVDLSEKGREEARTAGQLLKDEGFVF
DLAYTSVLKRAIRTLWTVLDEMNLMWIPVTKNWRLNERHYGALQGLNKAE
TAQRHGDEQVLIWRRSYDTPPPALTESDEFWPGKDPRYASLSSQELPATE
CLKDTVARFLPYWHETIAPQIRDGKNVIITAHGNSLRALVKYLDNISDED
IVGLNIPTGIPLVYELDDDLKPLKSYYLGDQEELKKKVEVVVKQGKA
>CT0092 gpsA, glycerol-3-phosphate dehydrogenase, NAD-dependent
MKITVLGAGSWGTTLAMLLANKGHEVRLWAHRPEFARALEADRENKRYLK
GVLFPDNLRVVENLHDAVETAEMIVTAVPSHALRETAAAFAHLPLDGKII
VNVAKGIEQHTGKRMSEVLLEALPRIAPEQIAVLYGPSHAEEVARQQPTT
VVACSVSEATARRVQEAFHTSSFRVYVNTDLIGVEIAGSVKNVIAIAAGI
SDGLGFGDNAKAAIITRGLAEISRLSSKLGADPLTLSGLSGIGDLVVTCL
SQHSRNRYVGEQIGKGRKLDEVIGEMSMVAEGVLTSKAVVKLAERLGVEM
PISQAVYEMLYENKPAPQAILELMERDPKPEHY
>CT1527 gpt, hypoxanthine-guanine phosphoribosyltransferase
MIENVPFTELISAERIAARVAELGAEISRDLAGIDELTVVCVLKGGFIFT
ADLVRHITIPCRIEFIRASSYGTHRASTGKVMLDHHHDPHVEGKNVLLVE
DILDTGLTITRVLEELRGHNPASLHVCTLLDKPSARTTPVKADFTGFTIP
DVYVVGYGLDAAGKHRELPYVASLNA
>CT1446 greA, transcription elongation factor GreA
MSDRIYLTRDGYNRLKEELHLLMTETRKEVLEKIAEARSHGDLSENAEYD
AAREEQSQLEAKIGEIENKLASATILDPKQIKTDRVYILTSVKLRNLDAE
DEIIEYTLVSSEEADSDLGKISVRSPVGRSLIGKSVGDKVTISVPKGELH
YEILDIFVK
>CT0530 groEL, chaperonin, 60 kDa
MTAKDILFDAEARTKLKVGVDKLANAVKVTLGPAGRNVLIDKKFGAPTST
KDGVTVAKEIELVDPVENMGAQMVREVASKTSDVAGDGTTTATVLAQAIY
REGLKNVTAGARPIDLKRGIDRAVKEVVAELRNISRSISGKKEIAQVGTI
SANNDPEIGELIAEAMDKVGKDGVITVEEAKGMETELKVVEGMQFDRGYL
SPYFVTNSETMEAELDEALILIHDKKISNMKELLPILEKAAQSGRPLLII
AEDIEGEALATLVVNKLRGTLKVAAVKAPGFGDRRKAMLEDIAILTGGTV
ISEEKGYKLENATMAYLGQAARITIDKDNTTIVEGKGKQEEIKARINEIK
GQIEKSTSDYDTEKLQERLAKLSGGVAVLKIGASTEVEMKEKKARVEDAL
HATRAAVQEGIVVGGGVALIRAAKGLAKAVADNEDQKTGIEIIRRALEEP
LRQIVANTGTTDGAVVLEKVKNAEGDYGFNARTEQYENLIEAGVVDPTKV
TRSALENAASVASILLTTEAAITDVKEDKADMPAMPPGGMGGGMY
>CT0529 groES-1, chaperonin, 10 kDa
MNLKPLADRVIVKPAPAEEKTKGGLYIPDTGKEKPMYGEVVAVGPGKVSD
AGQVVAMQVKAGDKVLYGKYSGTEVHVEGEDYLIMRESDIFAILG
>CT0569 groES-2, chaperonin, 10 kDa
MLFDKRKNVTDKFIVVGDRVLIKPKSLDETTKSGIYLPPGVQEKAKIQSG
YVLKTGPGYPVGPPNDTDEPWKERAESPQYIPLQAKTGDLAIFVQSSAWE
IEYEEERYLIVPNSAILLLIREDDDLEDSLS
>CT1485 grpE, grpE protein
MSRKHHKEQEEIQEQETISAGAAETPAEETAAIPAATEADMDAEISARDA
EIQKLREEVMRRAAEFENFRKQKEREAALSGTRMLENIVRELLPLIDDLK
RLMSHIPAEMQAMAEAKPFIEGVELIHKNFMSLLERKGVKEIEAKGKMLD
VNFHEAITQIDAPGAEPDTIVEEYQTGYTLGDRVIRHAKVIVAK
>CT0175 guaA, GMP synthase
MATSLQSVIVLDFGSQYTQLIARRIREIGIYSEIFPYHTKAETIRAHQPK
AIILSGGPNSVYDEKAFMPDPEVFSLGVPVLGICYGLQAIAKHFGGNVES
SSKQEFGRAKMLVNHDESESLLFRDIPDSDVWMSHGDKVTQLPEGFRVTA
STANAEVCAIESFGSKAALKVYGLQFHPEVQHSLYGKQLLSNFLIDIAGI
TPDWSPKSFIQHQIEEIKRVAGDSTVVCGISGGVDSTVAAVLVSKAIGDK
LHCVFVDNGLLRKDEAVKVMEFLKPLGLNISLVDASDLFLGRLKGVASPE
KKRKIIGRTFIQVFEKNIHDEKFLVQGTLYPDVIESVSVKGPSETIKSHH
NVGGLPKRMKLKLIEPLRELFKDEVRAVGRELGIAEDILMRHPFPGPGLA
VRVLGSLTRERLDVLRDADQIFIDELKSSGLYSKVWQAFSVLLPVQSVGV
MGDKRTYENVLALRAVESTDGMTADWAHLPHDFLAKVSNRIINEVRGINR
VVYDISSKPPATIEWE
>CT1293 guaB, inosine-5'-monophosphate dehydrogenase
MDKILYDALTFDDVLLVPAYSNVLPKETVVKSRLTRQIEVNIPLVSAAMD
TVTEAELAIALARAGGIGIIHKNLSIDEQARQVAKVKRFESGIIRNPIHL
FEDATIQDAIDLMIRHSISGIPVVEHPTPEGCLLLKGIVTNRDLRMTASS
DEKITTIMTTNLVTAKEGIDLLTAEDILMRNKIEKLLIIDDNGYLKGLIT
FKDIQKRKQCPDACKDSQGRLRAGAAVGIRANTMSRVDALVAAGVDVVAV
DTAHGHSQAVLDMVATIKQKYPELQVIAGNVATPEAVRDLVKAGADAVKV
GIGPGSICTTRIVAGVGMPQLTAIMKCAEEAKKTDIPLIADGGIKYSGDI
AKALAAGADSVMMGSVFAGTDESPGETILYEGRRFKAYRGMGSLGAMSEP
EGSSDRYFQDVSAETKKYVPEGIEGRIPAKGKLDEVVYQLIGGLKSAMGY
CGVRTITELKENTRFVRITSAGLRESHPHDVMITKEAPNYSTSA
>CT0141 gyrA, DNA gyrase subunit A
MQREKILPISIEEEMRDSYLDYSMSVIVSRALPDVRDGLKPVHRRVLYGM
HELGLQAGKPYKKSARVVGEVLGKYHPHGDSAVYDSLVRMVQDFSLRYPL
IDGQGNFGSVDGDSPAAMRYTEVRMKAIAGEMLKDLDKETVDFSLNFDDS
LEEPTVLPAAIPNLLVNGASGIAVGMATNIPPHNMREVVSGLIALIENPE
IEIGDLMKHVTAPDFPTGGIIYGYEGVRQAYLTGRGKVVIRARAVVEVTQ
KNGRESIIVTELPYQVNKVRLIEKIVELVHDKKVEGIADIRDESDREGMR
LVIELKRDAVAKVVLNNLYKHTPMQDTFGVIMLALVDGVPRVLNLKEMMQ
YYIRHRNEIVLRRTQYDLNAAEKRAHILEGLKICLDNLDEVISTIRQSPD
TPAAQSRLMDRFGLTEAQSKAILEMRLQRLTGMERQKIDDEYRETLALIE
ELKSILESPAKQMEIIKAELLKVSDVYGDERRTELRPQEGDFSIEDMIAQ
EDVVITITHEGFIKRFPVSGYRRQHRGGRGVAGAQAKNEDFIEHMFIAST
HNYILFFTTAGRCYWLKVYEIPEAGRAARGRSLANIMELPPGEKIRTYIN
VRNFDDPHFIIMATAKGIVKKTSLEEYSHPRRTGINAITIDEGDELIEAR
LTDGDHQIILAKSSGYAVRFPESEVRSMGRTAMGVKGITLDEDERCISMV
TTKRNDTSLLAVTDNGYGKRSKVEDYRMTKRGARGVITIKAHEKIGNLVG
LLDVNDEDDLIIITTNGIVIRQHVSDIRVLGRNTSGVRLIRLDAGDRISA
TARVPKSDDDSATEPLGDEGQIDLEF
>CT2263 gyrB, DNA gyrase subunit B
MQETDIQTAQNASTEYGATNIQVLDGIEHVRKRPAMYIGDIHSRGLHHLV
YEIVDNAIDETLAGYNDYIHVAMNPDGSVTVTDHGRGIPVDIHPVKKKSA
LELVMTVIGAGGKFDKGAYKVSGGLHGVGASVVNALSEWCEVEVYRDGKI
FRQTYRRGVPQGGVEEIGTTDQRGTKVTFKPDPEIFKITEFRKDIILDRM
RELAFLNSNLRIIVQDAEGNEEIFHYEGGLKEFVRFIDSNRLSLLKEPIF
LSGERDSTMVEIALQYNDSYQENIFSYVNNINTHEGGTHVTGFRKALTRT
LNAYAQKNDLLKNVKITLTGDDFKEGLTAVISVKVPEPQFEGQTKTKLGN
SETQSIVETIVNEQLAEFAESNPGTLKLIIEKVKSAAISREAARKAKELT
RRKSVLESSGLPGKLADCSINDPDHCELYIVEGDSAGGSAKQGRDRSFQA
ILPLKGKILNVEKARLHKMLENEEIKTIILALGTSIGEEEFSPEKLRYGK
IIIMTDADVDGAHIRTLLLTFFFRYMRSLIEAGKVFIAQPPLYLVKSGRE
QQYAWDEDERLSIVEQMKKLQKGKANVSIQRYKGLGEMNPEQLWSTTMDP
AHRSLLQVTVENAMEADQVFSTLMGDKVEPRRDFIEKNARYVRRLDV
>CT0920 hdhA, 7-alpha-hydroxysteroid dehydrogenase
MRLQGKIALVTGAAGGIGSATARCFAREGATVVLVDIDLEACSRVCDDIA
QSIGQASCSGVDLTSEKQVVELFTNIRRDYGRLDIVVNIAGGDCEPAASV
ETIDMEMAMKNLDMNLKSCMLCCREAAKIMKPQAYGRIVNMSSLVWRGSP
NQFSYSASKGGIFAFTRSLALALGAFNITANALAPALVEVEAFTRALGPE
RWQALAKASAERYPLGRIATPDDVAKAALFLASDDASFITGQILEISGGA
RL
>CT0866 hdrA-1, heterodisulfide reductase, subunit A
MSVETILVVGGGISGITTAVEAAEVGYNTVLVEKKPYLGGRAAQLNKYFP
KLCPPYCGLEMNFRRIKPNPKITVYTMTEVESVSGQEGNYSVRLKVSPRF
VNEKCTACNACAEACPVERPNEFNFGMDKTKAAYLPHVLSYPMRYVIDRD
ACKDNSCDKCVKACKYNAIDLNMKPQTIEMKVGSIVYATGWNPYDATRMD
NLGFGRVKNVITNMMMERLAAPNGPTGGKLLRPSDNKEVKKVVFVQCAGS
RDENHLNYCSSICCMASLKQATYVRERIPDSQVVVAYIDLRAPGKYEEFL
NKVEADANVKLVKGKVAKIEEDKATGGVILEFEDVEGGGKRHEHADMAVL
ATGMEPCVKANSMLSFEENGFINGGSAAGIYSTGVAKRPSDVTTAIQDAT
GMALKSIQSLVRS
>CT1246 hdrA-2, heterodisulfide reductase, subunit A
MPETISTQGFRMAKIGVFVCHCGENIASKVDTGNLVKALSDHPGVEICNE
YKYFCSDPGQETVKRAIRENNLTGVVVAACSPRMHETTFRKACAEAGLNP
YMLELANIREQCSWVHSDKDQATKKAIEITRSLVEKVKRNNELKPISVPI
TRRALVIGGGIAGIQAALDIAGAGREVILVEREPSIGGHMSQLSETFPTL
DCSQCILTPRMVEAIQHPNITVLTYSEVEEVDGYIGNFRVKVRKKARYVD
IDKCTGCGDCIQKCPVKKIPSEFECGLGNRTAIYTPFAQAVPNVPVIDKN
RCTYFKNGKCKICQKTCQIENCIDFEMQDTFEEYEVGAIVVATGFQIQDT
SVYGEYGYGKYKDVITGLSFERLASASGPTAGKILRPSDGKEPETVVFIQ
CAGSRDPSKGVKYCSKICCMYTAKHAMLYAHKHHGSNSKIFYMDIRAAGK
GYDEFTRRAIEEDEAEYLRGRVSKVFEENGKLIVRGVDTLLGKPVEVAAD
MVVLATAIVPQPDAKEFAKRIGIGYDEYGFYNEAHLKLRPVETATAGIYL
CGACQSPKDIPDSVAQASAAAAKVLGLFNREQLEREPVVAAVGESTCAGC
WGCVYACPYNAIEQKEIRDRNGNLIKEVASVNPGLCQGCGTCVTFCRSNS
IDLAGFTEKQIFAEVMAL
>CT1426 hemA, glutamyl-tRNA reductase
MNIISVGVNHKTAPIEIRERIALSEVQNKELVTDLVSSGLASEAMVVSTC
NRTELYVVPGMPEVNCDYLKDYIISYKNARNAVRPEHFFSRFYCGTARHL
FEVSCAIDSLVLGEGQILGQVKNAYRIAAEVGTAGILLTRLCHTAFSVAK
KVKTRTKLMEGAVSVSYAAVELAQKIFSNLSMKKVLLIGAGETGELAAKH
MYAKNARNIVITNRTQSKAEALAEELGTNRVLPYESYKEHLHEFDIIITA
VSTKEYILNAAEMQQSMAKRRLKPVIILDLGLPRNVDPEVGALQNMFLKD
IDALKHIIDKNLERRRAELPKVKAIIDEELVAFGQWLNTLKVRPTIVDLQ
SKFLEIKEKELERYRYKVSEEELRRMEHLTDRILKKILHHPIKMLKAPVD
TADNIPSKVNLVRNIFDLEEPNQSLQ
>CT1431 hemB, delta-aminolevulinic acid dehydratase
MSQLDLLNIVHRPRRLRRTAALRNLVQENTLTVNDLVFPLFVMPGTNAVE
EVPSMPGSFRFTIDRAVEECKELYDLGIQAIDLFGIPEKKTEDGSEAYND
NGILQQAIRAIKKAVPELCIMTDVALDPFTPFGHDGLVRDGIILNDETVE
VLQKMAVSHAEAGADFVSPSDMMDGRIGAIREALDESDHSDVGILSYAAK
YASSFYGPFRDALHSAPQFGDKSTYQMNPANTDEAMKEVELDIIEGADIV
MVKPGLAYLDIVWRTKERFDVPVAIYHVSGEYAMVKAAAANGWIDEERVM
MESLLCMKRAGADIIFTYYAKEAAKKLR
>CT1427 hemC, porphobilinogen deaminase
MKKELIIGTRSSPLALWQAEFTKAELSRHFPELNITLKLVKTTGDVLLDS
PLSKIGDMGLFTKDIEKHLLAKEIDLAVHSLKDVPTGTPEGLVISSFTKR
EDTRDVIISKSGKGLKDLPPNARMATSSLRRMSQLLSMRPDLQILDIRGN
LNTRFKKFDDGEFDAMMLAYAGVYRLEFSDRISEILPHDVMLPAVGQGAL
GIETRTDDAETIEIVRVMNDSNTEICCRAERALLRHLQGGCQIPIGCYGS
YISGTLKLLAFVGSVDGKTALRNELTKPVNTPEEAEAVGIELAEVLLSMG
AEKILADIRKTR
>CT1428 hemD, uroporphyrinogen-III synthase
MKTVLVTRPKHQAEPFVRELAQYGLDSVVFPTIEIRPVTGWSVPDLTRFA
GIFFTSPNSVQFFLERLLEESPDELPNLQQARVWAVGKTTGGDLEKHGVS
IEPLPKSADAVSLMSGIDASEIEGKTFLFVRGSLSLGTIPEVIAKRGGIC
VELTVYDNIQPSLEETQKIKSLLTEGKIDCLSFTSPSTAINFFEAIDSKE
VPSDVLIAAIGTTTSSALEKLGVKVDIIPEYFDGPNFAKAIAAALS
>CT2039 hemE, uroporphyrinogen decarboxylase
MLKNDLFIRALKRQATPRTPIWVMRQAGRYLPEYRAVREKTDFLTLCKTP
ELACEVTIQPVDLMGVDAAIIFSDILVVNEAMGMNVEIIETKGIKLTPPI
RSQADIDKLIDPDIDEKLGYVLDAIRLAKKELNDRVPLIGFSGAAWTLFT
YAVEGGGSKNYTWAKKMMYREPKMAHQLLQKISDCISAYLVKQVEAGADA
IQIFDSWASALSEDDYREFALPYIKQNVAAVKAAYPEIPVIAFAKDMNTI
LSDIADCGADAVGLGWNIDIAKARKELNDRVCLQGNMDPTVLYGTPEKIK
SEAAKVLKQFGQHNDHSGHVFNLGHGILPDVDPANLKCLVEFVKEESAKY
H
>CT1487 hemK, hemK protein
MPEEKVWSVVELLKTTIAFFAEKKIDEPRLSAELLLGHVLGLQRLQLYLD
HERPLTLKELEAFRAACRERLQGRPVQYIAGEAFFYGYQFFVDERVLIPR
PETELVLEHAMERLAASGLDSADSPSILDVGTGSGCIAITLALRLPGARV
TAADVSADALDVARRNADAHGVSERIRFVEADALSASFADAVGGPFDLLV
SNPPYIPEAEWATLQEEVRRYEPRLALVAPTGFEYYQSIAVAAPSLLRKG
GVLCFELHADGAAEVRNLLGSSFADVQVMQDYNKLDRGLSCMAQ
>CT2099 hemL, glutamate-1-semialdehyde 2,1-aminomutase
MPVLTRSAELFEKAKKFIPGGVNSPVRAFKSVGGTPIYMAKGQGAYMTDV
DGNTYLDYVGSWGPFILGSMHPRVTAAIEYTLRNIGTSFGTPIELEIEIA
ELLCKIVPSLEMVRMVNSGTEATMSAVRLARGYTGKDKIIKFEGCYHGHG
DSFLIKAGSGVLTLGDPDSPGVTKGTANDTLNATYNDIESVKAIVNENKG
QVAAIIIEPVAGNTGVIPAKKEFLVALRELCDAEGIVLIFDEVMCGFRVA
LGGAQELYGVTPDLTTMGKIIGGGLPVGAFGGKRHIMENIAPLGSVYQAG
TLSGNPLALTAGLETLKILMEENPYPELERKAAFLEAGFKANMEKLGLNY
TQNRVGSMACLFFTETPVVDYKSAITADTAKYGKYFHSMLDQGIYLAPSQ
FEAMFTSFAHTDEDLEKTVKANYNALVAATK
>CT0372 hemN, oxygen-independent coproporphyrinogen III oxidase
MTTDVVAKLALKYSNPGPRYTSYPTIPSWSEDGVTQEQWKEAMVKGFNES
NETTGISMYIHIPFCENYCYFCGCNAHRTTDHSLEKPYIDALIKEWQMYL
DVFPGKLNVKELHIGGGTPTFLSPENLIRLVDTLFRDVNKMDDYMFSFET
NPRSTTKEHLEALYSVGFRRMSFGIQDFDPVVQAEINRPQSFELVKEKID
MAREIGFNSVNFDLVYGLPKQTLATVTDTIQKVMELRPDRLAFYGYGHNP
HLYEGQRRFKLEDLPVGDVKQELYDQGRAMLESIGYHEVGMDHFAIEGDA
LYEAMKNGTLHRNFMGYTENTTQMMLALGASSISDTWYAFAQNERTDDRY
MEEVNKGRFPIMRGHLLSDEDLVLRRHILNLMCRQETSWEDPKLYTEELD
IARYRLEDMENDGIVVLGEKSVKVTEIGVPFLRNVCMAFDAHLWRSDSLS
KAYNVSRDIQKEYIEKARQAKLQQTS
>CT1384 hflX, GTP-binding protein HflX
MTTIPSPEPRERAVLVGITSTPDIPRHLVEEYLDELKFLADTAGADVITS
IIQEKKQPDPATCIGSGKAEDLAGLVEADSIDIVIFDDDLTPVQVRNLER
ILKCKVIDRTGLILQIFAIRAKSAQARTQVELAQLEYLLPRLSGAWTHLS
KQKGGIGTKGPGETQIETDRRLVRNRIASLKKKLRAVSLQHDTQTRGRAA
VPRVALVGYTNAGKSTLMNALCPEAGAYAENRLFATLDTKTRRLELKINK
LVLLSDTVGFIRKLPHTLVESFKSTLDEVLQADFLLHVIDVSHPGFEEHM
QVVRETLKEIGVKHDHIIEVFNKIDALDDPAILTGLRGKYPDAVFISAVR
GLNLSALKETIANYVARDYKTRKVKTHVSNYKLIGYLYDHAEVIDKKHVD
EDVLLTIRVHRNNLKQIDAMLKASASKNHAAANLQHHETHD
>CT0477 hisA, phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase
MLIIPAIDIKEGKCVRLTRGDFAQKKIYLDNPCDMAVIWRKQNAKMIHVV
DLDAALTGETVNFERIREIVNVLDIPIQVGGGIRSVEAVEKYLDIGVSRV
VIGSAAVTNPGLIADLLKKYRPSQIVVGIDAEHGVPKIKGWTESSNMQDY
ELAGEMKKLGVERIIYTDITRDGMLQGVGYETTKRFAEKAGMKVTASGGA
TTSDDLHKLRSLEKYGVDSVIIGKALYECNFPCQELWYAYEQGLGIDGEF
STARKKECCS
>CT0735 hisB, imidazoleglycerol-phosphate dehydratase
MTQRIASHSRKTAETDISATVNLDGSGTSAIETGVVFLDHMLTNFSRHSG
IDVQLRCSGDLEVDDHHTVEDVALVLGKAIVDALGDKKGIGRYGWAIIPM
DEALAQCSIDLGGRSYCVFRAEFQRPVIQGLSTEMVEHFFVSLSRTMNAN
LHLAVLEGRNTHHMIEALFKSLAYAMKQAVKVESTEIKSTKGAI
>CT1256 hisC, histidinol-phosphate aminotransferase
MLNPALEHIETYKVEGGQEAEVKLNQNENPFDLPSWLKDKILDQFRHEPW
NRYPDILPYRGMAAYASFLGVKPELVIMSNGSNEMLYTIFMACLGAGRKV
LIPEPSFSLYDKLARLQQAGVVEVPMHDDLSFDVDAIIEAARREKVDFIV
LSTPNNPTSKSLSHDEIERIVEAADAIVLVDEAYVEFSREQSALDLIDRY
PNLIVLRTMSKALALAGMRIGFAIANPELLAEISKPKIPFASSRLAEITL
MAVLENYSLVTDAVQYILAERGRIEAELTEIPGIHTFESDTNFLIIRVAN
ASEVFRKLKNAGVLVRNVSGYPLMENCLRFNVGLREENDRLLELLKKL
>CT0546 hisD, histidinol dehydrogenase
MLTIYHFPQDEAALREQLNRTVSFDPDAQRTVDDILYRVRTEGDAAVLDY
TERFQGIRLYDMRVPEAEIEAAYAAADPEFIAILEEAFANITAFHRNEAE
KSFFYEQKGGVILGQRVTPMEKALLYVPGGKAAYPSSVLMNAAPAQVAGV
DEISMTTPCDAEGKVNPHILAAAKVAGITSVYRLGGAQAVAAFAYGTATI
PKVDIVTGPGNKYVALAKKQVFGHVAIDSIAGPSEVVVIADAGAEPEFIV
MDMFAQAEHDPDASAVLITPSAELADAVRETAARLAGTMLRGEVITRALT
DNGAIVVTGSMQEACKVSDMIAPEHLELHVDNPWEILPDLRHAGAIFMGQ
WSCETVGDYFAGPNHTLPTNGTARFFSPLSVRDFVKHTSIIAWSKSELAR
TGEKIARFADHEGLQAHAEAVRVRLKHL
>CT1514 hisF, hisF protein (cyclase)
MLAKRIIPCLDVRDGRVVKGINFEGLRDAGSILEQARFYNNEMADELVFL
DISASLESRRTTLEEVLKVSGEVFIPLTVGGGISSVERAHDAFLHGADKV
SVNTAAVSEPELISRIAEKFGSQAVVVAIDVKKVDGRYIVHTHSGKKPTE
YEAVEWAHKVQELGAGEILLTSMDRDGTQEGYDNEILKMISTTVHIPVIA
SGGAGNLEHLYRGFTDGHADAALAASIFHFRTYSIRQAKEYLRERGITVR
L
>CT1988 hisG, ATP phosphoribosyltransferase
MSNNKVLKLGLPKGSLQDSTLELFANAGFHFSVQSRSYFPSIDDDELEAI
LIRAQEMGRYVSLGAFDAGLTGKDWIIETDADVVEVADLVYSKASMRPVR
WVLAVPESSPIKTVKDLEGKHIATEVVNITKKYLAENGVNASVEFSWGAT
EVKPPELADAIVEVTETGSSLRANKLRIVETVLQSNTQLIANKAAWADPW
KRKKIENMAMLLQGAINAQGKVGLKMNAPKAALDKIMSGIPALRQPTVSD
LADKEWVALEVIVSEKIVRTLIPELKRAGAEGIFEYNINKLID
>CT0476 hisH, amidotransferase HisH
MVFIADYGAGNLRSVHKAFDYLGIEAVVSDKASEMSRYDKVLIPGVGAFG
PAMEALNRQGFDEAIREHIDKGRSVLGICLGMQLFLSESEEMGAYKGLDI
VPGKVLRFTSSTDKIPQIGWNSVDYCKDSVLFRNVPDQSYFYFVHSYYCA
PDEPESVAATTFFAGKKFCSAIEKNGIFAVQFHPEKSSEAGLQVLKNFAE
F
>CT0463 hisI, phosphoribosyl-AMP cyclohydrolase
MGENQETQKSFLETVKFDSNGLVPAIVQDHETGKVLMMAWMNLESLKMTL
EKKKACYWSRSRNKLWLKGESSGNMQEVHDILIDCDGDTLLLKVSQKGGA
CHVGYHSCFYRRTVDGESMEICDTLMFDPEEVYGKQS
>CT0236 hisS, histidyl-tRNA synthetase
MTQFQGVKGTRDIFPDEISRWHYVEGVVHSVAALYGFSEIRTPVFEYTEL
FQRGIGATTDIVGKEMFTFLPDPNGRSLTLRPEMTAGVMRACLQKNLLSQ
APVHKLWYISDLFRKERPQAGRQRQFTQFGAELLGVSNPAAVAEVLTLMM
QVFETLGLHGLRLRINSLGDLDDRARYREALRAYFQPYEADLDELSKERL
KKNPLRILDSKNPALQEMIMGAPRLYHSLKPESVAEFEQVLAYLDDRQIA
YNVDHLLVRGLDYYCHTAFEVTSSELGAQDAIGGGGRYDGLARELGASAD
LPAVGFAVGMERLMIVMEKQGLFATLNPHGPLVSVIIQQKELAGHAMQVA
FHLRKAGIKTEIDLAERSMKAQMRDANRMGSAYALFVGQSEFESGIYALK
NLVTSEQTSLDLDAIIEVLREPAARESLRP
>CT1486 hrcA, heat-inducible transcription repressor HrcA
MNYRDLTLRERQVLGIIIQSYVVSAMPVGSRTIARNYNLGLSDATIRNVM
ADLEADGFISQPHTSAGRVPTDKGYRYYVDLIMNVSRIDEEEKRMIDDRF
SNRNSELKGTSAEVLGTAARVLGSISRQLAVVLPPRLSNAVFERLDIVQL
ASSRIMVVIAIQSLFVKTIVMELNAEISRQKIDAVVDVLNERLPGLTLEE
IRSTIAQRLSDFKGSEELMNSIVSSADTLFDESSILEQLYVSGTENIVDQ
PEFKQPEKVRDIITMIEDKFGMARLVDNAVPSALRQVSECEVAISIGTEN
RTGKAADLTIVSSPYFAGKMIGRVGVMGPKRMNYEHAVRVVNYMAGCLSE
ALSGNN
>CT0675 hsdM-1, type I restriction system adenine methylase
MARSDSDKNGNGGNLGFEAELFKAADKLRGNMEPSDYKHVALGLIFLKYI
SDAFEAKHKALLAEDALAAEDKDEYLADNVFWVPKEARWSHLQANAKQPT
IGTLIDDAMRAIEKDNASLKGVLPKDYARPALNKVMLGELIDLISGIGHL
LPSPSGRGAGGEGQSFDILGRVYEYFLGQFAGAEGKRGGEFYTPRSVVRV
LVEMLEPYSGRVYDPCCGSGGMFVQSEKFVQEHGGRIGDIAIYGQESNYT
AWRLAKMNLAVRGIDADIRWNNEGSFHKDELRDLKADYILANPPFNISDW
GGDRLREDVRWQFGVPPVGNANYAWLQHIYWHLAPNGTAGVVLANGSMSS
NQSGEGEIRRAMLEADAVDCMVALPGQLFYSTQIPACLWFLARNKNPANG
KTGGLRDRRGHVLFIDARKMGVLVDRTRRELSDEEIQKIARTYHAWRGEP
DAGDYADVAGFCKSATLDEIRKHGHVLTPGRYVGAAEVEDDGEPFEEKMA
RLAAQWRQQRDEAAKLDAAIEANLKELGFWE
>CT1881 hsdM-2, type I restriction system adenine methylase
MTSIQQRAELQRRIWQIANDVRGTVDGWDFKQYVLGALFYRFISENFAAH
MEAGDDGIRYAELPDSVITPELKDDAIKTKGYFIYPSQLFANVVARANTN
DSLNTDLAAIFTAIESSANGYPSEQDIKGLFADFDTTSNRLGNTVKDKNQ
RLAAVLKGVAELDFGPFDDAHIDLFGDAYEFLISNYAANAGKSGGEFFTP
QHVSRLIARLALHGQKSVNKIYDPACGSGSLLLQAKKPFDERLIEDGFFG
QESNHTTYNLARMNMFLHNINYDKFNIQLGNTLLEPHFADEKPFDAIVSN
PPYSVKWIGSDDPTLINDERFAPAGVLAPKSKADFAFVLHALHYLSAKGR
AAIVCFPGIFYRGGAEAKIRQYLVDNNYVETVIALAPNLFFGTTIAVNIL
VLSKHKPDTTTQFIDASALFKKETNNNVLLDEHIEQIMAVFASKEEVPHV
AQSVPLERIAANNYNLSVSSYVEARDTREVVDIAQLNAELKTTVARIDEL
RKQIDAIVAEIEGEEDEA
>CT1878 hsdR, type I restriction system endonuclease
MVSRMTDYKTIAESNNFIVLDRYMPDWRVAEGYQSEADLERELIDDLRRQ
GYEFLPAIKTPEAMLANVRVQLQALNDVQFSDGEWARFVETWLDKPSDGI
VEKTRKIHDDYIHDFVFDDGRIQNIHLVDKKTLVRNKVQVIRQFVTTPAL
CATPPREGNFSGGEQLWDQFPSFGGVPVGRGGYNRYDVTILVNGLPLVQV
ELKRRGVAIREAFNQVHRYSKESFNSAHSLFKYLQLFVISNGTDTRYFAN
TTRRDKNSFDFTMHWAKADNTPIRDLKDFAATFFQKHTLLSVLLHYSVFD
VSNTLLVMRPYQIAATERILWKIKSSHQAKTWSTPEGGGYIWHTTGSGKT
LTSFKAARLGTELDFIDKVFFVVDRKDLDYQTMKEYQRFSPDSVNGSDST
AGLKRNLEKDDNRIIVTTIQKLNNLMKSEPDLPIYHKQVVFIFDECHRSQ
FGEAQKNLRKKFKRYLQFGFTGTPIFPENALGAETTASVFGRELHSYVIT
DAIRDEKVLKFKVDYNDVRPRFKAIETERDEKKLSAAENRQALLHPERIR
EITQYILTHYRQKTHRLQPGAKGFNALFAVSSVEAAKLYYEAFKTQQKDS
AKPLKVATIFSYAANEAQDAVGDIADEGFEVSALNSSAKEFLNAAIADYN
ALFKTNFSVDSQGFQNYYRDLAKRVKGTDDSGKRLPADEQVDLLIVVGMF
LTGFDAPTLNTLFVDKNLRYHGLLQAYSRTNRIFDATKTFGNIVTFRDLE
QATIDAITLFGDKNTRNVVLEKSYREYMEGYTDALTGQARRGFVEVVQEL
QARFPDPAALEKEADKKAFVRLFGEYLRAENVLQNFDEFAALKALQSVNT
GDPAAVEAFKAQHYLSDEDLAALQAIKLPPERTMQDYRSTYNDIRDWLRR
EQAGVEKEKSTIDWDDVVFEVDLLKSQEINLDYILELIFERNKETRSKAE
LVEEVRRVIRASLGHRAKESLVVDFINQTDLEQLADKASVIEAFFTFARA
ELQREAQELIEAEKLNAEAARRYIATSLKREFASDTGTDLNAVLPRMSPL
NPQYLTKKQSVFQKIAAFVEKFKGVGGQV
>CT0679 hsdS-1, type I restriction system specificity protein
MRRELGMGSEWLGEECEIVMGQSPPSETCNTVGIGIPLLNGPTEFGPHHP
SPAQFTTDVRKRAIPGDILFCVRGSTTGRMNWADQEYAIGRGIAAIRHKF
KPELQPFVRAVIECYLPELLAQATGSTFPNVSAQQLSNLKWPELAADEQR
AIAYILGTLDDKIELNRKQNETLEAMARALFKAWFVDFEPVRAKLEGRWQ
RGQSLPGLPAHLYDLFPDCLVDSELGEIPEGWEIGSFADVVEIIGGSTPK
TSVSEYWGGDIPWFSVVDTPASSDVFVVQTEKSITQSGLNESSARLISKG
TTIISARGTVGNLAIAGCDMTFNQSCYALRSKNSLGSYFVFLSAQRMVEQ
LKAMAHGSVFSTITRQTFEAVQTVLPPENVLQQFERSFASLFDEILNNVN
ESRTLAKLRDTLLPKLISGELRVNDAKRILARIELDTATQRREQ
>CT1880 hsdS-2, type I restriction system specificity protein
MSGREFLQKLLDGERVEWKALGEIIQLEKGRQLNKDLLSSSGRYPAYNGG
MSYSGFTDSYNYSENKTIISQGGASAGFVNFVTTKFYANAHCYVVLPDTE
VVDNRYIYHFLKLNEERLTSCQHGAGIPALRASEITSLKIPIPCPDNPKK
SLAIQAEIVRILDAFTELTAELTAELTARKKQYAYYRDRLLTFTTPPYGH
PSKGGELFSLFGHPSEGGELFTPYGHSVEERELNSPSLKGWQAQPDGVVP
VEWKTLGEVGHFIRGSGIQKSDFKASGVGCIHYGQIHTHYGTWTTETKSF
IDPEFANRLKKAKPGDLVIATTSEDDDAVAKAVAWIGTEDVAVSTDAYIF
RHTANPKYMSYFFQTDMFQEQKKPYITGTKVRRISGDNLAKILIPIPPLA
EQERIVAILDQFDALTNSLTEGLPREIELRQKQYAYYRDLLFSFPKASFG
GVPEGRDQFPSFGGVPEGRGGLNA
>CT1191 hslU, heat shock protein HslU
MIQPDEPQDFPVKLIDKEQLTPTQIVEQLDKYIIGQKDAKRSVAIALRNR
LRRQNVSEELRDEIMPNNIIMIGPTGVGKTEIARRLAKLAKAPFVKVEAS
KFTEVGYVGRDVESMIRDLVEQAVAMVRSEKTEEVREKAALLVEERLLDI
LLPPVSGLEESEHVGDEEEAVVVEGDAEVVVEKNLEREINRKSRQKMRER
LRDGRMEDRQIELEVSSDGQGGMMQIFGPLGQMEEIGNIMQDLMSGMPKK
RKKRRMTIAEARKYLEQEEVQKLIDMDAVVKEALRKVEDSGIVFIDEIDK
IAAPTTGAGGKGPDVSREGVQRDLLPIVEGTAVSTKYGVVKTDHVLFIAS
GAFHVARPSDLIPELQGRFPIRVELKSLTEEDFFLILTQPRNALIKQYRA
MLKTEQIDLEFTEEAIREIARTAAKVNETVENIGARRLHTILTNLLEELM
FGIPEMVMDGTIDRNIVIDDNQVREKLGKLVADRDLSQYIL
>CT1192 hslV, heat shock protein HslV
MGYEKPQIRSTTVIGIIRDGKAALGSDGQMTLGNTVMKHSTRKIRSLYQG
RFITGFAGATADALTLLDRFESKLEAYSGKLDRAAVELAKDWRTDKYLRR
LEAMLAVVSTDKALIISGTGDVIEPEDGIVAIGSGSMYALAAARSLMKHT
TLSAEEIVRESLQIAADICIYTNDHIVIETL
>CT0829 htpG, heat shock protein HtpG
MSSNPTSSVREFEYKAEMKQLLNLIVHSLYTHPEIFLRELISNASDALGK
ARFRMLSSDEGLDKSGDLKITITVDKESGSFVIEDTGIGMSEEELISNLG
TVASSGTLGFMEALKEQQKEGQRLDANLIGQFGVGFYSVFMVTDEVTVET
KSIESGLQGWRWKSSGQGSYTIEPVEREARGTRISFILKEEFREFAQEYR
VEQIIKKYSNFVEYPIYIGSRQINSMTALWQRPKSELKQEEVNEFYKFIA
NDFKDPLDYLHVSVEGAVSFKALLFIPSEAPMELLYNQGALEKRGPQLYV
KKVLIQHECRDLLPEYLRFVSGVVDTEDLPLNVSRELVQASPVMAKIKQI
LTTKLLGWFDTIAKEEPEKFRAFYKAFGTILKIGLNTDFTNRDKLIDLLR
FETTKTVEGEYVTLKEYVGRMAEGQTEIYYHSGSSRAQMLAHPNLEYFRK
RDIEVLLLSDPVDVFVIPSIFEYDKKPLKSIEKAEIDMSTVEPEGERLSA
EGTVGVISLFKEVLGERVADVVESRRLVSSPVTLVSGKDALDSQFEKMMK
MMNKDADMPSTKKILEINTAHPIIRNLAGKHAVGLSTDPVVRAAVTQLFE
SALLLEGDLESVADYVSRMNELVEAATRS
>CT1799 hupA, hydrogenase expression/formation protein HupA
MHEMSIAMSVVEAVVDKAREEGGGKITGIDLVVGRLAGVEVESLKFCFGA
AARGTLAEGAELVIEEPEGRGRCEACGAEFPVTSFYAKCSACGQFRVKIE
SGRELAVRSFTIE
>CT0160 hupB, DNA-binding protein HU-beta
MSKAELAEKIAEQTGLTKADAERAVNAFINVVTSTLKSGDDVTLVGFGTF
TTGDRAERQGRNPQTGKTITIAAKKVVKFKPGKALKEEVGG
>CT0780 hupD, hydrogenase expression/formation protein HupD
MEVVRALEASGDWPETVQFVDGGTQGLYLLDYFESCDALMVFDSIIPVEF
EPKVYCYRKEELPAFIHRKMSAHQMGLSELLAVARLHGREPSEIVLIGAP
PHDLGLGNPLSEPMLRHLARAVETGRELLEEWLVKVKASGFPCVAGDIRM
PHRF
>CT0777 hupL, uptake hydrogenase, large subunit
MNATQQTQDHLVHFYYLHALDWVDVVSALKADPGQASAIAQSISAWPKSS
VGYFKDLQKRLVAFVESGQLGIFSNGYWGHPAYKLPPEVNLIAVAHYLEA
LDFQKEIVKIQTVFDGKNPHPNYVVGGMACAIDPNSDTAINIERLNMIKK
IIDETQTFIDQVYIPDLIAIAGFYKEWLYGGGLGNFMSYGDFPETTLDDY
EKLLWPRGIIMGKDLTTVLDVDPRDMSQIKEEIAHSWYTYSNGSDQGLHP
WEGETNPKYTAPKPPFEYLDTDKKYSWLKTPRWKDQPVEVGPLASVLVAY
AKGDSMIKDTVGMVLSKLEVGPEALYSTLGRTAARGIECKQTAGFMRHFY
DELVANVKTGDYQTFNSERWEPLTWPKECKGFGYTEAPRGSLGHWIRIEN
GKIGDYQIVVPSTWNASPRDGRGNSGAYEAALKGTPMHDPSQPLEILRTV
HSFDPCLACASHVFDMNGNEITKVRIV
>CT1894 hydA, hydrogenase/sulfur reductase, alpha subunit
MKVDFNIDIHHLPRVEGHGDIRISVRDGKLVDAKWAVVETPRFFEVMVKG
LSAERVPFLTSRICGICSISHSLASIRSLERAMQITPPETARIIRLLAMH
GETLQSHALHLFFLAAPDFMGTPSVVPLMQSHPEVVEAGLLLKELGNELS
IATTGRATHPVSLVLGGVSKAPAKQRLAEIKQMIADRKPMLDRATDFFMT
LRVPEFVRETEFISLHDGKSYPYIGGNLVSTDGVKRDENDYLAMTNEYTV
DFSTSKFTRLSRESSAAGALARFNNNYAQLHPRAKEAAAAFGLEPVCHNP
FMNNIAQLVECHHLVADAEELIDRLLDDDLRNIKADYKPRAGAATGAVEA
PRGVLYHYMETDESGKVVKGDCIIPTTQNNANIHYDLHALAEQSLAQGMG
EKEVEKLCEMLVRSYDPCISCSVH
>CT1891 hydB-1, hydrogenase/sulfur reductase, beta subunit
MIYKVISKDEFRNFVDALVRANTSYGPRQVDTDRNGEPIYQFMPVSSASE
IAFDYTVTTSSAKHFLMPFREELSKFSFRDGDWDQEVKYDAPPIVLVGLR
PCDINAINILDEVMLKGPYPSPYYLARRKNTFIIGMDHLPLPDCFCRSMN
RHTTDHGFNLFASDIGESYFLSINSSKAFNFLKEFETTEPTEEDDCKLIE
RRKLIKQSFKTNVEVTGLPVFLDLEFDSPIWKKWGDKCLNCGSCAMVCPT
CYCYNVEEHFETNLESSSRQRRLYSCNLIDFAEVTGGHNFRPKNGDRLKY
RYYHHYRGFAVNDNQQICVGCNRCGRACLAGINPKDVINDLRLEKESCVT
CVSPSPAKT
>CT1249 hydB-2, hydrogenase/sulfur reductase, beta subunit
MTRILLRNKLDECLAAWQKAGFSVLAPVKRHEMSCFGEVQKSGDMALDLV
LTERTIKDQFFPQTEPLIKYRIGKQQIDSETMTPPEKKRVFFGVRPCDAS
GLAIDDPLFGWDYKDDYWFRRREKSVIVTIACAQADDFCMCTSVKLSPDS
TKGADVMLRPLSDGSGWQVEAVSDRGTALVDTVSSLLQESSAEAAPVPQV
AEKFDVEKVMEWLADKENFESQFWKDIALRCVGCGSCTFLCPTCHCFDIQ
DEGDTYQGIRRKNWDSCSFPLFTMHTSGHNPRNTQSTRWRQRIMHKFNYY
RGKFGVNSCSGCGRCTRQCPVDMGITETLQAITNLPR
>CT1893 hydD, hydrogenase/sulfur reductase, delta subunit
MNHLGFDREKIRIGSFDFTCCEGCQLQLANKESTLPEFLALLDIRNFREI
SSERLDDYDIALVEGSITRQDEVERLKAIRAQAKTLVAYGTCACFGGMNA
QKNKFDKEECIRTVYGDKEIDTMQESHKISDFVQVDYSIPGCPVNKEEVE
RIVVSIATRSPISLPKYPVCVECKQRLNTCLFDLGEVCLGPISRAGCNAV
CPTGKTVCLGCRGPADGINYDSFVQLVKEHGLSENEMNEKLAFYNGFAEY
LSHEG
>CT1892 hydG-1, hydrogenase/sulfur reductase, gamma subunit
MVTDHGYKCRITNIVPLSEHEKLFQLRIVDPRERELFTFRPGQFLMLEVP
GYGEAPISISSATSNREFIELCIRKAGHVTSALFEAKQGAFVAVRGPFGT
SFPMEAMQDHDVLLIAGGLGIAPLRAPLFWINDHRDHYRNVSFLYGAKEP
SQMLFTYQFEEWKTVSHIDLHTIVEKPDDQWTGRTGMITLLFDEITIDPK
NTWAIVCGPPVMFKFVCTHLDKLGIPMNRMFVSLERHMNCGMGKCCRCMV
GSTFTCLDGPVFDYWSVMNLKEAI
>CT1250 hydG-2, hydrogenase/sulfur reductase, gamma subunit
MIYSPFPMRVVSKRAEAPGVNTLKLEFVKQEDHEFFKANYRTGMFGLYGV
FGEGESTFCVASPETRKEYIECTFRQSGRVTSTLANTDAGDIVTFRGPYG
NRFPIEEFEGKNLLFIAGGIALPPTRSVIWSCLDQREKYRDVTIVYGART
VADLVYKNELDEWKQRDDVRLVLTVDPGGETPDWQDHVGFVPTVLEQAAP
SPENTIAVLCGPPIMIKFTLTALEKLGFTAENVYTTLENRMKCGIGKCGR
CNVGSIYICKEGPVFTAAEVQAMPQADL
>CT1798 hypB, hydrogenase accessory protein HypB
MCDTCGCSGDGGAVLRKPGVKDYHVHVGDDGHHHHHDHQHSHDHGHSHDH
AHEHHHDHHHGEARKVQMEQDVLLQNNMLAERNRGWFEARRVLALNFLSS
PGSGKTSILEKTIPALLEQCPVTVIEGDQQSTNDADRIDALGVPVIQVNT
GTGCHLDAQMVQRAVRELDPPERSLVCIENVGNLVCPALFDLGEAAKVVV
ISVTEGDDKPLKYPTMFHEADICLLNKTDLLPYVDFDVAKCREYAMQVNH
HLEWIEVSAKTGEGFDQWIAWLKTKLAAL
>CT1794 hypD, hydrogenase expression/formation protein HypD
MKFIDEYRDPARARALLDRIRQVARHDWTIMEICGGQTHSILRNGIDQLL
PPNVQLVHGPGCPVCVTPLETIERALAIAARPNTILTSFGDMLRVPGNSK
DLFMARSEGADVRVVFSPLEALQIARDNPSKEVVFLGVGFETTAPANAMA
VHQAAREGLTNFSELVSQVMVPPAMRAMLSSPGNRVQGFLAAGHVCAIMG
YEEYEPVAAEFGVPIVPAGFEPVDLLDAILKTVELLEAGRSGVVNAYGRV
VSREGNPEARRVMQEVFEVADRPWRGIGVIPASGLVLRPEYKRFDAEKRF
DVGHIAPQESPLCRSGEVLQGHLKPSDCPAFGKECTPQTPFGATMVSSEG
ACAAYYRYHRNSNT
>CT1792 hypE, hydrogenase expression/formation protein HypE
MTMQLSCPSPILRHETVQMAHGAGGRLSQELTARVFMPHLGNPVLDQLDD
QARFEAEPGRIAFTTDTYVVSPIFFPGGNIGDLAVNGTVNDLAVGGAVPR
YLSAGFVLEEGLPLSELERIVKSMADAARKAGVVIACGDTKVVQKGQCDR
IFINTSGVGFIPPGRDVSCRNLRPGDAVLLSGTIGDHGMAILTTREGLSF
QSRIQSDSAALNGMIADLLAAAPNLHAMRDPTRGGVAATLNELALSSSVG
IELDEATIPVREEVRGAAELLGIDPLAVANEGKMLVVVPAAEADAALAAM
RLHEHGREAAVIGKVTEEHPGMVVMRTPFGSRRIVEMPLGEQLPRIC
>CT1797 hypF, hydrogenase maturation protein HypF
MAENEARGALSGLSQRERRRIEVRGIVQGVGFRPFVWRLAHQLDLAGFVR
NASSGVVIEVEGKPDALDRFETALRSEAPPLARIDSIDRHSIPPSVCGEP
GFIIPESSGGEAMQTLISPDIATCPACLADIADPAGRRYRYAFTNCTDCG
PRYTIVERIPYDRPFTTMKRFELCPDCQREYNDPGDRRFHAQPNACPACG
PKLELCDAVGVRLAVGDEITAAGELLAGGKILAIKGIGGFHLAVDASNEA
AVRRLRSRKGREEKPFAVMVRDLAAARALCDISPEEKAALASPQAPIVLL
RARQGLSLAPSIAPGNDRLGVMLPYSPLHWLLLREGPKVLVMTSANFSEE
PLVADNAEALERLAGIADAFLMHDRPIARRCDDSVVMSMAGAVRLIRRSR
GYAPAPVRLAESGPPVLGTGGELKSALCLVKGGEAFMSQHIGDMKNYEAY
RHFDDVAAHLQRIFQTEAELLVHDQHPAYMTTRWALEQGKPTLGVQHHHA
HLASCLAEHQHSGPAIGLTLDGAGYGTDGTVWGGEVLIGDAARAMRFASL
EPMPLPGGDVAVRQVWRTALGWLHRSGVSPEGLECFRQPQSAQVLELLRK
EVGTSESSSCGRLFDAVASICGLRHEARYEGQAAIELMQAAGGRIAEAGY
RFGFERKPNRWLMLISPMLRDIAAAVRAGASSTEISRHFHRTLVGIFSEI
TRMAYLETGLKTVVLSGGVFQNQLLTETLARELESNGYQVLMHAQVPTSD
GGISLGQAVIGREFLRGSYRGVDN
>CT0351 icd, isocitrate dehydrogenase, NADP-dependent, monomeric type
MASKSTIIYTKTDEAPALATYSLLPIIQAFTRGTGVDVEMRDISLAGRII
ANFPENLTEAQRIPDYLSQLGELVLAPEANIIKLPNISASIPQLKAAIKE
LQEHGYNVPDYPEAPSNDEEKAIQARYAKVLGSAVNPVLREGNSDRRAPL
SVKAYAKKHPHRMAAWSVNSKAHVSYMTDGDFYGSEQSVTVPAATTVRIE
YVNGANEVTVLKEKTALLAGEVIDTSVMNVRKLREFYAEQIEDAKSQGVL
FSLHLKATMMKISDPVMFGHAVSVFYKDVFDKHGALLAELGVNVNNGLGD
LYAKIQTLPEDKRAEIEADIMAVYKTRPELAMVDSDKGITNLHVPNDIII
DASMPVVVRDGGKMWGPDGQLHDCKAVIPDRCYATMYGEIVDDCRKNGAF
DPSTIGSVPNVGLMAQKAEEYGSHDKTFIAPGDGVIRVVDADGSVLMSQK
VETGDIFRMCQTKDAPIRDWVKLAVRRAKATGAPAVFWLDSNRAHDAQII
AKVNEYLKDLDTDGVEIKIMPPVEAMRFTLGRFRAGQDTISVTGNVLRDY
LTDLFPIIELGTSSKMLSIVPLLNGGGLFETGAGGSAPKHVQQFQKEGYL
RWDSLGEFLALTASLEHLAQTFGNPKAQVLADTLDQAIGKFLENQKSPAR
KVGQIDNRGSHFYLALYWAEALASQDADAEMKARFAGVAQALAEKEELIN
AELIAAQGSPVDIGGYYQPDDEKTTRAMRPSGTLNAIINAM
>CT0311 ileS, isoleucyl-tRNA synthetase
MPATYPEYPSSLSYSAMEAQIREFWIERNIFRKSLEKDAPKGIYSFYEGP
PTVNGKPGVHHLFSRTIKDVVCRYHTMQGYQVPRKAGWDTHGLPVEISVE
KKLGLKNKSHVEEYGVGEFNREARALVYHHIDDNREGWGKLTERMGYWVD
MDSPYITCDNNYIESVWWALKTIFDKGLIYKDYKIVPQDPKSETVLSSHE
LALGYKEVQDPSVYIKFRLKDSGESILVWTTTPWTLISNVALAVGRDIDY
VRVKHRETGEVLILAESRLSVLVEKIGDESAWEVIDRCKGSDLEGRDYEP
LFNYFSPERRAWYVVCGDFVSTGEGTGIVHIAPAFGADDYELSKQYQLPM
LQPVARNGCFTAEVPDYEGMFFKDADKPIMQRLKEEGKLYRRETIQHTYP
FSWRYDVPVIYYARESWYIRTTDIAPRMVALNKTINWNPPEIGTGRFGNW
LEENKDWALSRERFWGTPLPVWVAEDFAIGDGPDSGKLFAVGSVAELREG
FIEIDGEEMNLGDALDKGLVELDLHKPFVDRIWFIRDGKRFNRTPELIDV
WFDSGSMPFAQLHYPFENKELFDKTFPADFIAEGVDQTRGWFYTLHAIAT
LIFDRPAYRNVVVNGHILDKSGQKMSKSKGNVVDPFESMEQYGADAIRWY
LMITSPPWRPKLFNAAEIEEEQRKFFRAFINSYNFFVLYANVDGFRYEEA
DIPFTQRSELDRWVLSSLNTLIAEVTSRMEQYDLTGACRLIGDFTVDDLS
NWYIRRSRKRFWKGEMGPDKLSAYQTLSTVLETLAKLMAPFVPFIAEKIW
LDLKSVGGTSKAESVHLADWPVADESCIDAALEERMKKAQIITSLVRTMR
EKAGIKVRQPLRRILLAAAEPGSRAAYELVSDIIKEEVNVQKIEYVEDED
GSVISKKAKPNFKTLGPRFGKDMKLLAEEIRIMSHKQISRLEKEGSIEID
LGGRICTVLREDVDIVHEDIEGWLVAADDAHRIMVALDTEITEELEMLGL
ARELVSRIQTLRKESGLEITDRIALTIAGSEKLLAAARKSESYIMDETLA
TSIVLLPLDDSQPGDGVEQVNNELCRLSLEKSGS
>CT0618 ilvB, acetolactate synthase, large subunit
MHNNGEKLIGSEIFFECLRRENVEYIFGYPGGALLKVYETLHDVEDIEHI
LARHEQGATHMAEGYARATGRPGVVLVTSGPGATNTVTGITNAYMDSTPL
VVFTGQVPSSLIGNDAFQEADIVGITRPITKHNFLVKDVRELATTIRKAF
YLATNGRPGPVLVDMPKDVLNAECTFEWPENVDIRGFKPTIKCHANQVSK
AAKMIAKAKRPLFYVGGGVISAEASAELRKLAIDQQIPVTMTLQGLGAFP
GDHPLSMGMLGMHGTYWANQAVSNCDLLIAVGARFDDRVTGKVDTFATHA
YKIHNDIDPTNVDKNIKVDLPVVGDSKDFLASLIEAMPKSREDRSAWLAE
IEKWRKQCPLDYEIEPDSLKTEFVIDEVSRQTKGHAVVVTDVGQHQMWTS
QYYKFTEPRSIITSGGLGTMGFGLPSAIGAAFGVTDRPVLLFSGDGGLMM
NIQEMVTAVYNKLPIKIFLINNSYLGMVRQWQELFHQEKYTFTDLASSNP
DFVKVAEAFGCKAMSASNPEAARAAITEALAYNDGPVLVDFRVIRKDMVF
PMVPAGGSISDMLLARLNPKTMV
>CT0616 ilvC, ketol-acid reductoisomerase
MNIYYEQDADLAVLQNKNIAILGYGSQGHAHALNLKDSGMNVCVGLKTDS
ASCAKAREAGLKVDTVAEAVKWADIVMILLPDQTQKSVYDNEIAPNLKSG
ATLAFGHGFNIHYKQIVPPADVNVIMIAPKSPGHLVRRTYTEGNGVPCLI
AVHQDATGDAKAIALAWAKGIGGTKAGVIETSFKDETETDLFGEQAVLCG
GSAELIKAGFETLTEAGYPAELAYFECMHELKLIVDLYYEGGLSRMNYSV
SDTAEYGGMTRGPRVVTSAAKAEMKKILEEIQDGRFAKEFIDECNSGYKK
MNELRESNRNHPIEVVGAKLRGMMSWLKKK
>CT0619 ilvD, dihydroxy-acid dehydratase
MRSDTIKKGFEKAPHRSLLKATGCVSTRDDFSKPFIGICNSFNELIPGHA
HLQELGRIAKEAVREAGGVPFEFNTIGVCDGIAMGHVGMRYSLASRELIA
DSVETVVEAHRLDGLVCIPNCDKITPGMMMGALRTNVPVVFVSGGPMKAG
HTPSGKTVDLISVFEAVGKCSTGEITEDELQTVEECGCPGCGSCSGMFTA
NSMNCLCEALGFALPGNGTILAADPRRNELVKAAAGRIIDLVKKEVRPRQ
ILTRTSMLNAFALDLAMGGSTNTILHTLAIASEAELDFDFSELNDLSAKT
PYICKVSPATTEVHIEDVDRAGGISAILKELSKVEGLLDLSAPTVTGKTL
GENIASAEVLDRTVIRSVEEPYSTTGGLAVLYGNLAPNGAVVKTGAVSPA
MMKHTGPAKVYDCQDDAIAGIMNGDVKSGDVVVIRYEGPRGGPGMPEMLS
PTSAIIGRGLGDSVALITDGRFSGGSRGACVGHVSPEAADRGPIAAVQTG
DMITIDIPARSMTVALDDETIRQRIEALPKFEPKIKKGYLARYARMVTSA
NTGAVLKNDF
>CT1605 ilvE, branched-chain amino acid aminotransferase
MDNSLKIWMNGELVGWSDAKIHVMSHVVHYGSSTFEGIRCYDTVKGSALL
FLDEHVRRLWESSKIYRIEIPYSETEIKDAIISTIKANNHKACYVRPLVF
RGQGALGVNPHRASIEVAIATWEWGTYLGEDVLENGVDVKVSSWHRLAPN
TLPSWAKAGGNYMNSQLIKMEALSDGYAEGLALDHNGYVAEGSGENIFVV
RNNIIYTPLAAQSILPGFTRHAVMHIARELGYEVRETPIPREALYIADEI
FLTGTAAEITPVRSVDRIPIGNEHRGPVTEALQHEYLKIVHSGEDPYNWL
TFI
>CT0617 ilvN, acetolactate synthase, small subunit
MKHLISVLVENKFGTLNRVAAMFSARGFNLESISIGETEDPEISRMTIVT
RGEDRIISQVLKQLNRLIDTIKVTDLTHQPHVERELLLLSLKLSKSTQHE
IFELANVFKGKVVDIKQKSITIEFVGSPDKINTAIDLFRPFGIRELARSG
AVAIHRGES
>CT1430 imp, myo-inositol-1(or 4)-monophosphatase
MPMTPDLQLALELAEKAGKLTLDYFGRRSLQVFSKRDDTPVTEADRKAEE
LIRQGISAKFPGDGLFGEEFNERPSGNGRRWIIDPIDGTRSFIHGVPLYG
VMIALEVDGVLRLGVINFPALGELYHAEIGAGAFMNGSSVQVSAIAETAA
ATVVFTEKEYLLDPPSTHPVDLLRSSAGLVRGWGDCYGHMLVASGRAEVA
VDKIMSPWDCAAVIPIVTEAGGCCFDYRGRRSIIDGEGLVSANRSMGEAL
IEAIGKGERAR
>CT2167 infA, translation initiation factor IF-1
MAKEESIEVEGEILEALPNAQFRVKLENGLEVLAHVSGKIRMHYIRILPG
DKVKVQISPYDLSKGRITYRYK
>CT0241 infB, translation initiation factor IF-2
MAIEEKQSRFRISDIAKELQVSPREVLQFVKQAGGKVASTSSMVGEDMRD
MIFGNFSQEKKRVDEARKIRAEKQKRLTRLEEQSRKAYEKEQQLKESLSI
APLPAPVLHAPEVKIEIPPETATTPVAAEPPAILPVVSTPQPEPVADLPL
VTEPVVAEPVAEAEPVVEAPVAETAGPEVMTPLVQTLPESMQAYEAPQKI
GGLTVLGTIDVISEAERKKKSRKKSFRESAVELKGEFENVLSVDSEDGEA
AKKKAAKPDGGEDVGVKKKKGKKKKKVEIDDKVISKNIKSTISGMDDSGL
SGSRQKFRKQRRMEREREFEEAEAMREAEKTLIRVTEYASPHELAELMGL
TAKEIIQKCFSMGKFVTINQRLDKETIELIGLEFGFEVEFISEIEATTTE
ELVDNAEDLQTRPPVVTIMGHVDHGKTSLLDYIRRSNVVAGESGGITQHI
GAYEVSLDDGRHITFLDTPGHEAFTAMRARGAQVTDIVILVVAADDSVMP
QTIEAINHAKAAGVPIVVAINKIDKPEANVEKIKAQLSEAGVLVEDWGGE
SQCQEISAKKGIGISELLEKVLAEAEIRELKGNYSRDILASGVIVESELD
KGKGVVSTVLVQRGFLKVGDPFVAGNSMGKVRALMDERGKRIHEAGPSTP
VRVLGFEDMPQSGDVLTVMASDRDARDLAQKRQIIKREHEFRRSTRVKLD
SIARQIKEGLKKELSVIIKADTDGSIQALADGLMKIHNEEVKVQIIHQGV
GQITETDVLLAAASDAIIIGFRVRPNVNAKRLAEKEDLDVRFYSVIYHVL
EDVEKALEGMLSPELHEESLGSLEIRQVFRVPKVGNVGGAYVLEGKVSRD
AKVRLLRDGVQIFEGQLDSLKRFKDDVKEVDAGYECGVSLKGYDDIKVGD
VIEAYKIVEKKRKL
>CT2127 infC, translation initiation factor IF-3
MKKQKTSGNQKPKVSYRINEQIRVPEVRVVFPEGGMQVMKTQDARRMAEE
RGIDLIEVQPNAQPPVCKLENYGKLIYKMDKKDKDLKKKQKTTSLKELRF
HPNTDKHDFDFKTAHLEEFLRKGNRVRATIVFLGRSIIYKDKGFELADRL
TERLSTVANRDGEPKFEGKKLFVYFEPDKKKIEQFEKQRAMAEKIASLPP
LPPDNSGEPEDDE
>CT1995 iscS, iscS protein
MPSRQIYFDNNATTPLHPEVKKELVAAMEMYGNPSSMHAFGREARANVED
ARHRVAAFMGADESEIVFTGSGSEGNNTVLSLFACGTSQCFPGMKPKIVT
TKIEHPCVLETSQCLVHRGVDVSYLDVDAYGKIDLGQLEEYLKSDGVGLV
SVMMANNEIGTIQDIPAISALAHQYGALMHTDAVQAFGKIPVDVNELGVD
FLTISGHKIYGPKGIGALYVRKGTPYCPFIRGGHQERGRRAGTENTLGIM
GLAMAVDMRKIEMAAEAERLRGFREMLREGISARIDDALFNGHPTDSIPN
TLNVSFPGAEGESILLYLDLAGIAVSTGSACASGSLDPSHVLLATGVDAE
RAHGSIRISMGRSTTVEDVEYMLDVLPGIIERIRNMSTAYIKGGPHAAIR
>CT1994 iscU, iscU protein
MLQSGEWAYTEKLKEHFESPKNVLQGNDTSAFDGVGMEGNLQCGDQMMVA
IKVDKESEKITDCQWKTYGCASAIASTSILSEMVKGMTLDQAFNISPKEV
AKELGGLPENKIHCSVLGDKALRAAINDYFIRNGMSDRVQKVQARTVCQC
MNVTDHDIEEAVLEGARTFYELQEHTKISTVCGQCREDAENELQKYIHLH
FGS
>CT1317 ispD, 4-diphosphocytidyl-2C-methyl-D-erythritolsynthas e
MKTVVIIAASGVGKRMKLDGGRSKQMLEIGGQPVIWHTMKAFQEASTVES
VYIATLPDSIPVFKEIAKANGFTKITAIIEGGKERQDSIGNCMKLIEQEI
ENSGVMPDAILVHDGARPFIQPEEIDDIARLSATHGACVPATKPKDTIKY
VGCNPEIFGETLDRSRLLQVQTPQGFAPAKLIEAHRLAGEEQWYATDDAA
LVERYFPQQAIRIYETGYHNIKITTPEDVFIGEAILAGLKARKSKN
>CT1495 ispE, 4-diphosphocytidyl-2C-methyl-D-erythritol kinase
MKHFSVKACAKINLGLLITSRRADGYHTLETIFAPIDWFDTLEFTESDAI
SMECSNLDLLVDDSNLCIRAAKALQEHTGVKRGATIKLLKRVPFGAGLGG
GSSDAAATLNALCKLWQIDVPSAELHKLAVKLGADVPYFLEMKGLAYAAG
IGEELEDLNLALPWHVVTVFPEVQVPTAWAYKNFHRQFERPVPDLKTLVR
RLCHERDISVFGVFENDFASVVFEHYPVVREVRDALAASGAQFVSLSGSG
SAVYALYEGRADAVKAAEAMSARFRINMTPAGFRME
>CT1601 ispF, 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
MRIGIGIDVHQFAEGRKLIIGGVEVPSPIGLLGHSDADVLLHAISDALLG
AAALGDIGKHFPDTSPDYKDADSMELLRHVCKLLEQEGYKPVNVDTMLLL
EKPKIAPYIDQMRRNIARCLGLEINAVSVKATTNEKLGYVGRQEGACAHA
VCLIENA
>CT0088 kdsA, 2-dehydro-3-deoxyphosphooctonate aldolase
MQKFSVGTVSLPGPKGLFLIAGPCLIENRRMAMDVAAELDRIRKKEDVRV
IFKGSYRKANRSSASSYTGIGDRQALEILREIRDYFGMPVLTDVHETSEV
ELAASYVDVLQIPAFLCRQTDLIVAAASTGLAVNIKKGQFLAPEDMALAA
AKAAATGNKKIMLTERGTTFGYHNLVVDYRGLPIMAESGWPVILDVTHSV
QLPGAGAGVSGGDRRFLMPLARAAVAAGVDGLFFEVHPDPATAMSDASTQ
APLAGFGEMVRELMQLQRCMQSIREEFHSR
>CT0593 kdtA, 3-deoxy-D-manno-octulosonic-acid transferase
MASLALAAYRLLSPLQPNLLRLLSSRKPRLKSFLDARENLFEELEARLHA
LPEPSCRLWIHASSVGEFEQARTIIAELRAQIPDMDVAVSFFSDSGYEAR
KHYPDAAAVFYLPLDTPENARRLVDMIGADIFMLMRYDFWPNHLEAIRKS
GARMILAAAALPPGSPYLKPWLRGLYRDLFSLFDAIYTIDSKDREMFLNQ
FACKNVFTAGDPRFDQVVERQKKSDERAAKLKPLFRDRMVLVGGSTWEPD
EAILIPAWLSLRQKLSLVLVPHKVDRPNIERLLNNLRQQGIKAVTISEMD
EQFDPAQQVLVVNQTGYLAELYTIAAIAYVGGGFGVNVHNTIEPAVHGIP
VLFGPRYGNSPEATGLIEAGAATVITDEPELRKALSALVEDAGHLKHTGA
KSSSFVNARLGATAIIARDIAQHCRTKS
>CT0965 kdtB, lipopolysaccharide core biosynthesis protein KdtB
MSKKAIYPGTFDPFTNGHLDVLERALNIFEHVDVVLAENSQKQTLFSVEE
RFDMVREVVRDLPNVSVDVLREGLLADYARQAGASAIVRGVRQVKDFEYE
FQMSLLNRHLYPEVTTVFLMPNVKYTYVASTIIREVSMLGGDVSKFVHPY
VLDQLSRKRAERRAH
>CT1830 kpsU, 3-deoxy-manno-octulosonate cytidylyltransferase
MNAVIVIPARLSSSRLKEKMLADLEGEPLIVRTWQQATKSRLASRVVVAT
DSERIFAVLREAGAEVVMTSPDLTCGTDRIAEAAEQVGGDVFVNLQGDEP
LIDPATIDLAIAPFFGEGPLPDCTTLVFPLKPDERQIIDDPHVVKAVLDT
RGNALYFSRSPIPYRRETLPDTKYYRHIGLYAFRADVLKAFVALPPSMLE
RAESLEQLRLLENGYRIRCIETTTDTPGVNTEEELEEVRRLFRERFGA
>CT0803 ksgA, dimethyladenosine transferase
MTKVEYKHTHIAAKKKLGQNFLLDRNIPRKIVRESGIKEGDRVVEIGPGF
GALTTAILEVMPSFTAIEKDRELAKFNREEHPQIELIEDDFLKVPLEPLA
AGGKLSVLGNIPYSITSPILFRLLDNRHLIASATLMIQHEVAQRIAAVPG
TKEYGILAVQMQAFCDVKYLFKVGRAVFKPRPDVDSAVIKMVPKAVDPVK
DSEGFRTFVRRVFHQRRKTLLNNLKEYYDTSGVPEPTLKLRAESLSVPAL
ITLFTQLKLIARGDASGQLLLKRRR
>CT2073 kup, Kup system potassium uptake protein
MSLAALGVVFGDIGTSPLYAIRECFHGEFSIPVNTPNVLGVLSLLIWALL
LIVTLKYLTFIMKADNEGEGGILALTALIISHSKKNKSERWILVSLGLFG
AALLYGDGMITPSISVLSAVEGIQIIAPSFGPLVIPVTIAILAGLFLFQH
HGTAKVGSFFGPIILLWFTSIGLCGLVEIVKYPAILKAVFPWYGLEFLVN
NHAKGFLVLGAVFLAVTGAEALYADMGHFGRRPIRLTWSLLVLPALLLNY
FGQGAVLLSEPAKSWNPFYALVPSWGIIPMVILATLATIIASQALITGIF
SLTQQGIQLGYIPRLTVQHTSASHIGQIYVPAANWALMFSTIALVAGFGS
SSKLASAYGVAVTATMLISAVLFYYVARDLWNWNRLGLNLLMGMFMLIDL
SFFGASVSKLFHGAWFPLVIGFALFTLMLTWKQGRLLLMKQIQDRTLTVS
EFTESLAIQQPQRVKGQAIYLTANPDVVPMALLHNMRHNKILHSEVGLLH
FSTERVPRVPNSKKVEVIQLNYGMYKIIARYGFMEYPNIRQVLALANQQG
MHFRTDAISYFINREKIVTGMKSKMSVWRKKLFALMARNALSATAYYDLP
SGQVIEIGVQVQI
>CT1451 lepA, GTP-binding protein LepA
MPPTGTEVGMIRNFCIIAHIDHGKSTLADRLLEVTHTLERNQMSTAQVLD
DMDLERERGITIKSHAVQMRYTAKDGQDYILNLIDTPGHVDFSYEVSRSL
AACEGALLVVDATQGVEAQTIANLYLAIEAGLEIIPVINKIDLPSSDVEG
VARQIIDLIGVNRDEILRVSAKNGIGVDDLMEAIVARVPAPADNRQMPLR
ALIFDSVFDAYRGAIAYIRIVDGVLKKGDRVRFFANDKIFMADEIGTMSL
KRNPVDILEAGNVGYLICSIKDVKDAKVGDTVTLVENPAAERLAGYKDVK
PMVFSGLYPVESNEFEDLRESLEKLSLNDASLVYTPETSAALGFGFRCGF
LGLLHMEIIQERLEREYGVNIITTVPNVEYRVIMTSGETVEVDNPSKMPE
TTKINWIEEPYVSMQIITMSEYIGNIMKLGMERRGEYKNTDYLDTSRVNI
HFEFPLGEVVFDFHDKLKSISKGYASMDYEYIGYRRSDLVKLDVLLNGEP
VDALSSIVHRSKSYEWGRKLCQKLKGIIPRQMYEVAIQAAIGSRVIARES
ISAMRKNVLAKCYGGDISRKRKLLEKQKEGKKRMKQVGRVEVPQEAFLAV
LNIDE
>CT1450 lepB, signal peptidase I
MKKQNTTPNGKNGKTQSREWFEALVIAALVAAILRMFVIESYRIPTGSME
KTLLAGDFLFVNKFVYGAKVPFTDISLPKVHDVRRGDIIVFKFPRDRSLN
YIKRCIALPGDNLEIRNQQVYINGKGMQLPPHAQFIGTKMPAGVPEFQIF
PSMSDYNKDNYGPIHIPRSGDVITLTSATLPLYRDLIAYEGHTVSLVGDQ
VFLDGQAANRYTVSRNYYFAMGDNRDNSLDSRYWGFLPENDIVGQAMMVY
WSWDPDLPLLFDPVEKIASIRWNRIGLAVH
>CT0612 leuA-1, 2-isopropylmalate synthase
MVIELYDTTLRDGTQGEHINLSVQDKLLIAERLDEFGVDFIEGGWPSSNP
KDEEFFLKARKLNLKHARLTAFGSTARSLDNVENDPNLVGLVRCEAPVLT
IFGKTWKAHSVKSLGISDDENAELIYRSVKFLVESGREVFFDAEHFFDGW
KDNAGFAERMIAAAVDGGASRVVLCDTNGGTLPHEIAAIVTRVREIIGVS
VGIHAHNDSDLAVANSIEAVRAGATQVQGTINGIGERCGNANLVSIIPNL
MLKLGAEFSHVQDLKSLTSMSKFVYEILNLPPDSKAPFVGKSAFAHKGGI
HVSAVMKESSLYEHIDPMLVGNRQRVLVSELAGQSNIRYKAQELGISLPE
KGEVFKNLVNHVKKLEHQGYQFDGAEASFELILRRELGQFKPYFEVLESK
VVIQNGQEIKAVDQAVMKVMVGDETEQTVADGDGPVNALDKALRKALLHF
YPDIRMIRLIDYKVRVLEEKSGTSAKVRVLIESSDGQNSWGTVGVSTNII
EASLQALNDSINYYLFYNQSKAAATAPASEALNS
>CT2107 leuA-2, 2-isopropylmalate synthase
MSTMNYKKYAPYPTVGLKDRTWPDKTITKAPIWCSVDLRDGNQALPIPMS
VDEKVEFFRLLVSVGFKEIEVGFPSASATEFAFVRKLIENNLIPDDVSIQ
VLTQSREHLIRRTFEAIKGAKNAIVHLYNSTSRQQRDIVFRMSREEIITI
AVAGTRLVRELKEASGNPGIRFEYSPESFTGTELEYALDVCHAVMDEWGA
SATNKVILNLPSTVELSTPNVYADRIEWFCRHIKNRDAVLLSVHAHNDRG
TAVATSELAVMAGADRIEGALFGNGERCGNMDIIIMALNLMTQGVDPELD
FSNLPHIKQVYSRCTRMTVHPRHPYSGDLVYTAFSGSHQDAISKGMKAHQ
RAADGVWDVPYLPIDPEDVGCNYEAIVRINSQSGKGGIAYVLEKEYGIQI
PKWMQPDFGAVVQEVTDRTGEELSAEQIHELFQKEYIGATEPYLMKKCTI
SWSDEDPDRNDEVATIVSASMVGPQGEFSFRAKGNGPLDAFVRGMVSHTG
IDFHVDEYAEHAIGHSSDAMAIAYIRLSFDDVALVCGSGIDSNISLASIK
AIVSALNRYKKA
>CT0615 leuB, 3-isopropylmalate dehydrogenase
MMYKIVSIPGDGIGPEVVAGALDVLNAVAKKHGFEVSVEEHLFGGASYDV
HGSMLTDETLEACKNCDAVLLGAVGGYKWENLPHDKKPEAALLKIRKELG
LFANLRPARVYDALVASSTLKTEVVQGTDFMVFRELTGGIYFGQPRGYDE
TRGWNTMVYERYEVERIARLAFEYAQKRGNAKVTSIDKANVLEVSQFWRN
IVHEVHQDFPEIELVDMYVDNAAMQVVRNPKQFEVIVTSNLFGDILSDIS
GMITGSLGMLPSASIGSEHALYEPIHGSAPDIAGQNKANPIATIASVAMM
FENSFNRPEVAADIYAAIEGALAAGFRTGDIAAAGEAISSTTEMTAAIVA
RI
>CT1650 leuS, leucyl-tRNA synthetase
MKYDFSALEKKWQSRWADEQTFASSADQEKPKYYVLDMFPYPSGSGLHVG
HLEGYTATDIMARYKRCQGHNVLHPMGWDAFGLPAEQFAIKTGTHPRLTT
EKNVASFRETLKSMGFSYDWSREINTTDPNYFKWTQWIFLKLYEKGLAYI
SEVDVNWCEELKVVLANEEVDEKIADGYTVVRRPLRQWVLKITAYAERLL
KDLDEVDWPENVKQMQRNWIGRSEGMEIDFELRCHRTNLRVYTTRPDTLF
GATYLVISPEHPMAEKLAIAQQLVAVKKYIEQAKLKTELERTGLQKEKTG
VFTGSYAINPANGEALPVWISDFVLTSYGTGAIMSVPAHDSRDWEFAKKF
GLPIREVIKSPHDVQERVFDGKESVCVNSANDEISINGLDFKTAFDRMAA
WLESKGKGKRKVNYKLRDWVFSRQRYWGEPIPIKHYEDGTMRPETNLPLT
LPEVEAYQPTSTGESPLANIESWLYGEDEHGKFRRETNTMPQWAGSCWYY
LRFIDPQNSDALVDPSLEQYWMNVDLYIGGAEHAVLHLLYSRFWHKVLYD
LGVVSTKEPFQRLFNQGMILGEDNEKMSKSRGNVIPADHVLSTYGADALR
LYEMFLGPLDQVKPWNTHGIEGISRFLNKVWRLVWDENTETQKTTEDKPS
EAILKRMHKAIKKVTEDTEQLKFNTAISEMMVLVNELHKAGCYSRETTET
LLVLLSPFAPHITEELWQALGHAESISGAVWPVFDAKLATDDVLTIAVQV
NGKLRGTFEAPAGCTKEEMIESAKKVESVAKFLDGQQIVKEIAVPGKLVN
FAVKPQQ
>CT1090 lgt, prolipoprotein diacylglyceryl transferase
MIDFTTWWQHLPSQMNPVIFSIDGIAIRWYGTMYIVAFAIVYLLSKYRIS
NEKLPFDKTFPGDALTWAMGGVLIGGRIGYILFYGFDWFLQDPVGTLIPI
KFGNGSCAFSGINGMSFHGGLIGVAIALWLFTRTHKVDFLKTVDLFIPAL
PLGYTFGRLGNFINGELYGRVTTSAIGMYFPAAPTVALRHPSQLYEAFFE
GIVLFIILWTIRKKAPWPGYLSGLYLIGYGTVRFFIEFFREPDAQLGFVF
LNFSMGQVLCFLMIAAGIGILVWSKQRAENADVMMGGKR
>CT0457 lig, DNA ligase, NAD-dependent
MDKAKAQQEIGKLRAEIERHNRLYYLEAKPEISDFEFDKLLKRLIMLEKE
FPELVTPDSPSQRVGGGITKEFPTVVHRDPMLSLSNTYSIAEVADFCNRV
EKLVAAEGGGTPEYVAELKYDGVAISLLYRDGLLVCGSTRGDGRQGDEIT
ANLRTIPSIPLRLEMPDLPLFGTALSGEIEVRGEVYMRKDDFERLNEERP
EEERFANPRNATAGTLKLQDSAEVARRRMSFVAYYLKGYDGEAPTHLKRL
EQLKSMGFMTGAAAKLCKGMDEIADFIAEWSEKRWTLPYETDGVVLKLNE
VSLWERLGATAKSPRWAIAYKYPAQQAKTVLQGVVFQVGRLGTITPVAEL
KPTRLAGSIVSRSTLHNFDEIKRLGVRIGDHVMIEKSGEVIPKVVSVVLD
ERPAETAEIEVPSECPVCGTRLERPEGEVNWYCPNEEGCPAQKRGRILHF
ASRNALDIQNLGESLVTQLVDRGLVSDAGDLYSLTQEQLAGLDRMAAKSA
QNVLDALEKSKKQSYARLLFALGIRHVGAATARELAHACPSIDRLREMDE
EALAAVPDIGPVIAASIRDFFAKPWVQAMLQKLAEAGLPMQAGEEKALVN
NNFEGQSVIFTGALERHVRQEAEEMVRERGGRIVSSVSKKTTLVVAGKEA
GSKLEKAIKLGVKVIDEDEFERML
>CT1078 lipA, lipoic acid synthetase
MNSGPGKKPDWLKIKLASGSSFASTRKLLNRHSLHTVCRSAMCPNLHECW
SKGTATFLLLGNVCTRSCRFCAVGTECRPAMPDPEEPSKIAEAVKTMKLR
HAVLTSVNRDDLADGGATHWVETIRAIREVNPGVSLECLIPDFSGNEQSL
DLVMQELPEVLNHNIETVPSRYAAVRPQALYERSLAVIERAKRQFRLATK
SGMMVGMGETEEELEASLHDLRGHGCDMVTIGQYLQPTAAHLPVSRYVTP
EEFERYREIALDAGFRHVQSGPFVRSSYHAEAFEPVEKIS
>CT1498 lnt, apolipoprotein N-acyltransferase
MNGAYRGALSRLLSKPFFLPLLSGLLLGISFPTWPAVHLEPLAWIALVPL
LLSLEHEERFGPFFRKSWMSMLLFCLIALWWVCLATFVGGILTVFVQSLF
SVVPLVVFYYFKKRAGFRSALLALPFIWTGWEWAYMQQDFSLGWLTFGNS
QANLLWMVQYADVTGVWGVSFWLLTFNVLVLLLFMEKESFQVKVGIVMVM
LVMIATPLLYARQVFRNTALDNTSPKVRVALVQPDIDPHEKWDGLGPEET
LSRLYSLTGQSVRGERLELIIWPETAIPFYIRLPENKPYMDSVRRMVMRW
NTPLLTGFPDEVPVFPNSARGEAVAASGAEYAAYNASMLLHPAGGPVQIY
RKMRLVPFGERVPYSEYFPWLERLSFSMSGISSWAKGREATVMHFTSRDG
QPVRMANIICYESIFPGQVSTFVRRGAQFLTLVTNDGWYGTSYGPWQHAA
IDRLRCIENRRAMARCANTGVTLFYDICGRSYAETPWWQQSVLTADVPLE
SRITFYTAHPDLVPHVCLGIAGVLALVAAVRKR
>CT1602 lolC, lipoprotein releasing system
MTFLDHFRFAWVHLRERKRQTSLTVLGVAVGSAMMITTIAVARGSSLNVF
LKLIDVAPHITIGADRVVPEVPDNLVGMMQGRIAFVRKNVTTDRKVVIKN
YSQVIATVSPMREVVDISPYVTSKLLARNKNRFTSCFAKGVVPSLEGEIA
GLKKNLLDPEALTELGWTPNGIILGSMLAEKLKVKYRDTIMLVDKEGHEY
PVMVVGRFRSGFNTKDDKEAYVNLALAQRMESLASNTVTGIGLRIANIGQ
ADALAARIQKLTGYETKSWSESNKNVIDFYNRNGTITLVLVSFVFVVAGL
GVSSVMTTVVLQKVKDIAILRSMGVQRGSITRIFMLEGLIIGATGSLVGS
PVGHLICDLISRIRFAPSSAGVISSDRLLVAETPDAHLIVIGFGILIAVI
SSVGPARRATSYLPVRVLRGEVG
>CT2038 lpcA, phosphoheptose isomerase
MTQQCNCSEGSCGGTRYEELVLERMLYSARLKESVARRDSDVIVAMASMI
ADTFREGGKVLLCGNGGSAADAQHLAAEFTIRYRSSVHRPALPAIALSTD
TSALTAGANDLGFDEVFVRLTEAYGCPGDILIGLSTSGNSASVLKALEFA
RKRGLKTLALLGGDGGAIKPHADLAVVVPHTGSADRIQECHIAVGHVIVE
LVEKMMGYD
>CT1298 lpd-1, dihydrolipoamide dehydrogenase
MQQADTLAAQFDVAVIGSGPGGYEAAIHAARYGLKTCIVEKAVLGGVCVN
WGCIPTKALLRSAEVFDLAKNPETFGVNVGNVSFDLAQAVKRSRNVALKS
SKGVAYLLKKAAVEVLAGEAVLTGGAGVMVTMPDGSVRMLGAKNIIVATG
STPRVIPGLEPDGKKIITSREALILKEVPKSMIVVGGGAIGVEMAWFYAK
AGSKVTIVELMPRMLPAEEAEVSEALKRSFEKAGITVHCGAKLDNVAVSE
SGVSAELVVEGSAPQTLNASCLLVAVGVTGAIDGLGLDAVGVETERGFIR
TDGQCRTSAPGIYAIGDVRGGMLLAHKASAEAAIAVEAIAGKSPEPLSEP
LIPRCVYAQPSVASVGLTEEAAVNAGYQVAVGRSQFAASGKANAYGQLEG
FVKLVFDAATGKMLGGHLIGHDAVELIGELGLACRYGVTAGGLVNTVHAH
PTLSETVREAAFDALQSMG
>CT1961 lpd-2, dihydrolipoamide dehydrogenase
MSTKFDVIIIGGGPGGTPAAMQLASQGKTVLLVEESGKLGGACLFVGCIP
SKIIRHWADEYAVKLKYSAQEALSPEDREAAWNEIMRKMQTILSQRSGAA
MQMLKHLSNLRFVAGHAKFVSNNELVINEKDTGRKEKYTFNKAIIATGSH
SFIPPFKGNGVQDVLTSEVLFSQDKLPESLLIIGGGPIGIELAQMLTKLG
TKCTIIELLDSILYGVVETEFVSIISNQLSSLGVNIYTSSQVQEINKSDG
HFDVTFTDANGSEHKENFEDVLVVTGKVPNIESLNLDSTDIKYDRKGIIV
DEYLETSVKGIYATGDVTHGPKFAHTATYEAHIASANISAGNNQKVDFSK
NTWVLFSEPEIVAAGFTEAQAVQEGYDIITGVYDYKIDAAAQVMNSPFGY
LKYVVNKKNSEIIGVHICMNNASSLAGEASLIIANRLILKNVAETIHPHP
TLTEAFGILAQKMLSKS
>CT2008 lpxA, acyl-(acyl-carrier-protein)--UDP-N- acetylglucosamine O-acyltransferase
MRNIHATAVIGSGAVLGEGVEIGPYTVIEDDVVIGDRTVIGPHVHIADGA
RIGNECRISTGAVLATAPQDLKYAGEKTYLHIGDRTVIRECVTLNRGTKA
SGKTVVGSDNLIMAYVHAGHDCVIGNHVVIANSVQFGGHCHVGDYVVVGG
LAGVHQXVRIGRYAMVGGISRAALDVPPFVMAGGHASFRYEGLNVIGLKR
RGFTSEQLGNIRDAYRIIFQSGLLLSKALEAVRNDLPQTPEVVEILDFFA
SGVYNRKFLKPFNS
>CT0280 lpxB, lipid-A-disaccharide synthase, putative
MTCRRKLFVLAGEVSGDLHAAGPVRELLAARPDTKVFGVGGRKLAELGAE
LLYTTDQMSIMGFVEVLKHAAFLRKAIRELKAAIVREKPDAALLIDYPGM
NLHLAAFLKKQGVPVIYYISPQVWAWKERRVEKIRACVDRLLVIFDFEVE
FYRRHGIDAEFVGNPVVEELAELKFAPKPEFLARMGIDSDARIVGLLPGS
RKQEIEKIFPEMLGAAKHIGEQGKTVFLLGRSPHIDPALYDRYLREAGIE
PLDCTSYEVMRYSDLELVTSGTATLESLCFAVPMVVLYKTSPLNYFIGKR
LVKLHNIALANIVACGLLSEKQAVPELIQHEANAGNISRKVLEILCNDAV
SSSMRRELREARGRLSSDSPSRHVAAVLFEYL
>CT1662 lpxC/fabZ, UDP-3-O-3-hydroxymyristoyl N-acetylglucosamine deacetylase / (3R)-hydroxymyristoyl-(acyl-carrier-protein) dehydratase
MLIHQRTLQNEISLTGIGLHTGHECTITFKPAPVNTGYIFVRTDINDCPE
IPALIDHVVDVLRGTTIGIGDVKVHTTEHVLAALYGLQIDNCRIELSGPE
PPVLDGSSNPFAEALLSAGIAEQDEPKNYLVIDETIEFHNPEKSVDIVAL
PLDGFRMTVMVDYKNPALGSQHSGLFDLDKEFLREFSPCRTFCFLSEVEA
MANQGIIKGADIDNAIVIVDKQLDETEVQTLADKVGVDASHLVLGQNGIL
NNRELRFSNEPARHKLLDLLGDLALLGMPVKAQILAXRPGHASNVEFVKQ
LKKYADRNKLARQYQHEKKAGVIFDINAIQNILPHRYPFLLIDKIVEFKL
DEKIVSIKNVTMNEPFFQGHFPGNPIMPGVLIIEAMAQTGGIMMLNGKEN
IKESVVFFMGIDKARFRKPVLPGDTLVIEAVMTNMRRTVCQFDAKAYVRG
ELVCEASLMATVMEKKN
>CT1360 lpxD, UDP-3-O-3-hydroxymyristoyl glucosamine N-acyltransferase
MKIADIKAFLGRYFDPVELVGTGDIEIVGPAKIEEASTGHVSFVANEKYY
RYIAQTGASLVIVSQKAPLDDASPGTSFLKVADPYTAFVFILQHFSGKRR
IADTGIAASASVAASVRLGENVSLGEHVVIGENCVIGDGTVIGPGTVLMD
GVTVGSGCTIFPLVTIYDGTVIGDRVTIHSGTVVGADGFGFAPQKDGSYI
KIPQMGTVEIGDDVEIGANTTIDRATMGATVIEKGAKIDNLVQIAHNCRI
GGDTVIASQAGISGSVKIGRQCLIGGQAGFAGHLELADRTSVAAKAGISK
SFLEPGLAIRGVPAQPMRDQLRQEAQVRGLGEMKSKLEALEAKLLALQQQ
LGE
>CT1676 lpxK, tetraacyldisaccharide 4'-kinase
MNSQQDLTMHNRSAAILLRPAAALYGMVMSLRNCLYDQGIFKSWHSPIPV
VSVGNITTGGTGKTPLVDWIVKFYEASGIATAIVSRGYGRRTKGVQLVSD
GGRLLLGSRDAGDETAMLAARNPRTIVVVAEKRVEGVQFLMHQFADRLPG
VIVLDDAFQHRKIARDLDIVVVNAGAPEEIDAMLPAGRLREPLRGLRRAH
LIILGKITDDANSATLLQTLRETGKPVIRSKIKPGKLIHVDGSENETNES
VKTLAFAGIGAPEGFLHSLKTAGIKIAATKFFRDHEPYTESAIRSIIGEA
KRQGLVPVTTEKDWFRIADEPELAEMLRQVGCRYLTITPEFPDGTQELER
QLLDVLKR
>CT1810 lspA, signal peptidase II
MALFYLLAIAAALLDRVTKLLAIHYLRDGAQSIVIIPDWLKLTYAENLGI
AFSVRFLPPTGLLFLTLAISAGVVWYVHKSNNRSPLFLTAFGLILGGGIG
NLIDRVMLGHVVDFIYFDLYHGALFGIPLDLWPIFNVADSCITIGACMIV
LFHEKIFTRKHA
>CT1371 lysA, diaminopimelate decarboxylase
MLDSHFFSFSDGILCCESVALDELARQFGTPLFVTSRQSLVSQYRSFEEA
FASLPHFTCYSVKANFNLAVIRTLAEEGCGCDVNSGGELYRALTAGVPAN
KIIMAGVGKSEAEIKYGLTSGVMMIKAESISELKAINSVAEHLGKVAQVG
VRINPNVTAETHPYITTGDSKEKFGIDEAGLGEVFELFKTLPNLELHGLD
MHIGSQIFDPEYYVAATQKLLEVLESARHLGFDIKWLDLGGGFPVTYDPQ
KPAPPITKFAEKLIPMLQDKGVTVIFEPGRFIAANASVIVTKILYRKKNQ
IGKEFFIVDAGMTELIRPALYQSHHEVLSVKQHDRSVVADVVGPICESSD
FFARHRTIDDAPEGELLAVLSSGAYGAVMSSNYNGRLRPAEVMVDGDEVT
LTRRRETYEQLVQNEL
>CT0095 lysC, aspartokinase
MVVMKFGGTSVGSAAAMRQVIANVAEKKKSSAPLVVLSACSGITNKLIQI
ADAAGSGCLEEAQQLVGEVRQFHLDLIGELIESEELQQEVIAKIEVYLTR
LERLTEGIEIVGELTERSKDRFCSFGELLSTSVFAAALNEAGVSCKWIDV
RTVMITDDRFGFARPLAEICQKNTSEIIKPLLDAGTVVVTQGYIGATESG
RTTTLGRGGSDLSAALFGAWLHSESIEIWTDVDGVMTTDPRIVPEAKSIR
VMTFSEAAELAYLGAKVLHPDTIAPAVQKNIPVYVLNTWHPDSKGTLITN
DPELLAGKSHGGLVKSIAVKKAQAILNIRSNRMFGRHGFMSELFDVFERF
GISVEMISTSEVSVSLTVDDAVVSEPLIKALGALGEVEIEHKVATVSVVG
DNLRMSKGVAGRIFNSLRNVNLRMISQGASEINVGVVVDESDVQAAVASL
HCEFFAESQCDAIFEKPAGS
>CT1387 lysS, lysyl-tRNA synthetase
MSNAPEQKNPQNDPSPAVSLNDQMKRRFEERTHLAEAGINPYPYKFDVTT
TSKAIIDSFSDENPADVSVAGRIMAIRRMGKASFLHIQDSEGRIQIYLKK
DDVGEASYNTFKLLDIGDIVGVSGFTFKTKTGEISVHARQFELLAKSLRP
IPIAKEKEVDGQKVIYDAFSDRELRYRQRYVDLIVNPEVRGTFIKRTKIV
ALMRNYFASNGWLEVETPILQPIYGGAAARPFTTHHNALDMQLYLRIANE
LYLKRLIVGGFDGVFEFAKDFRNEGIDRFHNPEFTQVELYVAYKDYIWMM
ELVEDLLHKACVEVNGKDSTMFLGNEINLKPPFRRLTIADSIREYTGMEI
RGKSEAQLRDIAKDLGLELDPKISSGKIIDEIFGEFVEPKLIQPTFITDY
PEEMSPLAKKHRSEPGLVERFELIVGGKEVCNSFSELNDPVIQRERLEEQ
ARLRQRGDDEAMIVDEDFLRALEYGMPPCAGLGIGIDRMVMLLTGQDSIR
DVIFFPHMKPE
>CT0283 lytB, penicillin tolerance protein LytB
MKINLDRTSSGFCIGVQGTIHVAEEKLAQSGELYCLGDVVHNEVEVKRLE
ALGMETIDIPAFEELRNAEVLIRAHGEPPSTYETARKNNLAITDTTCPVV
AKLQRTAKMLHQLGYQVVIYGKKVHPEVIGINGQCDDEGVVIKHPDLSDP
EEIAPLDLSRKTALISQTTMDVPGFYELKRNLEKLFAEHGHRNPGTQSGE
WMAVRDIDITAEKTGALAMPKLVFKDTICRQVSSRNGKLRDFALANDCIV
FAAGRKSSNGQVLYSICKDANPHSYFIEDVDEIRPEWFVGENGKPVESVG
ICGATSTPMWLLEKVANYIDKTFGDGSSNPNA
>CT0974 maf, maf protein
MTSHRKLILASQSPRRRELLAMTGIPFETASVEIDETFDPVLTAEENVME
ISKQKAEAVLRSISADEACAVVLGSDTTVVLDGKPLGKPGDFDHAFDMLS
TLQGRSHEVLTGFCILHNGKAITDYARTIVEIGPMTPREITRYIEVMKPF
DKAGSYGIQDPLLACFVTGIDGCYYNVVGLPVSKVYAALKPLFPAEG
>CT2168 map, methionine aminopeptidase
MITIKSEREIELMREAGRLVARVLDMLENEIRPGISTKRLDELAEQFIRD
HNAVPSFLNYVPKGESGVTPYPATLCVSINEEVVHGVPSTKRIIHEGEIV
SVDCGVYKSGYHGDSARTYIIGEVDPAVRQLVDVTRECLDLGIEQAVEGN
RLHDISAAIEKHARSFGYSVIENMVGHGIGSELHEEPAVPNYGRPHTGVK
LRSGMTLAIEPMIALGRSRRAVSKRGAWAAVTEDGSYSAHFEHTIAIGKA
QAEILTK
>CT2086 mazG, mazG protein
MKHEANPSIETLKESVLKHNAVTPAEHFERVVNLVRVLRSECPWDRKQTP
ESLAHLLLEESYELVHAIDTGDDPELKKELGDLFLHVCFQVLLADEAKKF
SFIDVFEALCHKLISRHPHVFGDVKAETEQDVLGNWENLKMKEGRTSLLD
GVPKAMSELLRAYRVQKKVAGIGFDWPSDEGVLDKLTEEIGELRNAASKQ
EREEEFGDLLFTIVNYSRFIDTNPEDALRKATNKFMDRFRKVEASVLASG
KSWKEFSAEELNGLWNEAKKAK
>CT1507 mdh, malate dehydrogenase
MKITVIGAGNVGATTAFRLAEKQLARELVLLDVVEGIPQGKALDMYESGP
VGLFDTKVTGSNDYADTANSDIVVITAGLPRKPGMTREDLLSMNAGIVRE
VTGRIMEHSKNPIIVVVSNPLDIMTHVAWQKSGLPKERVIGMAGVLDSAR
FRSFIAMELGVSMQDVTACVLGGHGDAMVPVVKYTTVAGIPVADLISAER
IAELVERTRTGGAEIVNHLKQGSAFYAPATSVVEMVESIVLDRKRVLTCA
VSLDGQYGIDGTFVGVPVKLGKNGVEHIYEIKLDQSDLDLLQKSAKIVDE
NCKMLDASQG
>CT1511 menA, 1,4-dihydroxy-2-naphthoate octaprenyltransferase
MSVSSSSQLSAFQAWMLAIRPKTLPAGAMPVVIGAALAAASGVFKPLPAL
VALICALGIQIATNFINEIYDFRKGADTAERLGPTRTVAAGIITEQTMIR
VSIVLGVSVFVLGLYLVAIGGWPILLIGVLSLLFAWAYTGGPFPIAYSGL
GDVFVFIFFGLVAVGGTYYVQALSLPMEVLVAAAAPGAFSVCILLVNNIR
DIDTDRKVGKMTLPARIGAPAARALYVALVVLAYLVPFYMISTGYSLWCL
LSLLSIPLAIGMVRTLYASEGQALNAVLAGTGKVLTVHGLLFSLGLVIPN
IISIFRP
>CT1846 menB, naphthoate synthase
MPVEPGQRRFSSTTIEISDMSTVNWITAGEYSDILYHKTEEGIAKITINR
PERRNAFRPQTVDQMIEALQDARNDSQIGVIILTGAGDLAFCSGGDQKIR
GNAGYADEKGVNKLNVLDFQRDIRTCPKPIIAMVAGYAIGGGHVLHMLCD
LTIAAENARFGQTGPRVGSFDGGWGASYMARLVGQKKAREIWYLCRQYNA
QEALDMGLVNTVVPLEKLEEETIQWCREILANSPLAIRCLKAALNADCDG
QAGLQELAGNATLLYYMSEEGQEGRNAFVEKRKPDFSKFPKRP
>CT1847 menC, o-succinylbenzoate-CoA synthase
MKPLHADICRYEMDFTAPVTVRGVLLARRQGLLLRLKSEGVTAYGEVAPL
IGLHTESLDEALQALATFIPELSRLDWNASDGRQRLLDEAALPPSVTTGI
EMALINLEATERSSLPSFTDEFPPASKIPVNALLAGDPQAVLNRAAKRYA
EGFRAFKLKVRKGELDGAVACIRALHEAFGDKAELRLDANQSLEFDEAVA
FGKALPPGCVAYIEEPLTDAALISDFHAATGLPSALDESLWQRPELLDEI
GPDPLGALVLKPNCIGGIAKSLDLAAKAHRMGLQAVYSSAFESSVSLGLY
ALMAAVSSPAPAASGLDTASFLARDLTATPFATPDGFADPAAAWRDSLRV
RPDMIETVKSWSL
>CT1839 menD, 2-succinyl-6-hydroxy-2, 4-cyclohexadiene-1-carboxylate synthase
MGRGRSENRGHSCHNAADLMNSKQITTLWCAVIVEELIRQEAGFFCISPG
SRSTPLTLAVASNPKARFRMFPDERSAGFYALGYARATGMPAVLVCTSGT
AVANYFPAVVEASADAQPMLVLSADRPFELLECGANQAIRQQNIFGSYTR
WSFELPEPGIATPLASLLSTVDHAVRKSLSLPAGPVHLNLPFREPLEPEA
PDPGHPWAAPLETWQASGEPWSRFARPLHEPSAESIVTLRELLAQAERPL
FVAGSMSNAADGEAVAALAESLGVPLFADLTSGIRLSSDCTPWQLAFQNE
AFVERFQPDVVIHFGGHVIGKQPAMALRKQPPLHYVVVREHPGRFDPDHN
VTLTLEASPAAVASALEGCREPVPGIRCRDAFSAASGIIDKMACVPELAV
SEISAPRIVSSLAGDGHALFVANSMPARDMDLYAAPVAQKPLQVALNRGV
SGIDGIISTAAGFSAGLGKPTTLLIGDISFLHDLNALCLLNHPWNPLIVI
VLNNHGGSIFSFLPIASQTDRLDECFATPQNFSIESAARTFGIDYACPET
NGDFTQLYAEALTTKKSLIIEIRSDREKNLLLHRSLKARLDPVFEKADCS
R
>CT1848 menE, o-succinylbenzoic acid--CoA ligase
MELVTQAAQTFGDQPALITDERRWSFADLDGDTARIATAFEASSIRRGDI
VALVAPNSPALVLSLMALMRMGAVAAPVNHRFPANHIEGVLARLNPAMTL
DAAKLDAFVADAIARTGATFTAATEMERPVSVIHTSASSGKPKAAVHSLS
NHYHSAMGSAQNLPFGPGDCWLLSLPLYHVGGYSMLFKCLLGGGALAVPS
PDAALAESLTHFPVTHLSLVPTQLYRMLRADGGPERLRSLRALLLGGSAV
SAPLLREAICERVPLYLTYGSTEMSTQVTTSPTPVTKARGDSGVVLPYRE
VAISVDGEILVKGECLFMGYLDNGELREARDKNGWFHTGDMGELSGDARL
TVLGRKDNMFISGGENIHPEEIEKALTSIVGIEEAVVVPAPDAEYGMRPV
AWIKARSDSPDDATIIASLKSTIGKLKTPVAFHRIQEWQTIPGSAKIDRS
WYRKLAEK
>CT1838 menF, menaquinone-specific isochorismate synthase
MSGTCKTMCLTRPENEPRNPMQADDSIPMMQKALQLLEASVKAAMRQRDG
SPAVSGQPMLERFSAPLGEVDATRWLSAQRLFPRLFWMNREKSEWIAGIG
EADRIEITESGPNDRSFLVLEEAMTRKNPHARYIGGFCFNNLQKQNKLWS
AFSPGLFILPLVSIEYRDGQTLMTCSLWLEPGNDRQKGLEQLLAALSAVS
AGDAPETAGIPEMTQVSYCPDKAQWIENCETILRNFDEGKLDKVILARQT
ELSFAGKVPAIRFLLDYPFPENTAYRFYFEPVEGHAFISFTPERLYRRDG
DMLETEALAGTVTKEALKADDSIASELLLNSEKDIREHRFVKDTIYRELQ
PVCSDIDMQEKVGVLQLNRLAHLLAKCKARLLPEFSNDSTVLRQLHPTPA
VGGVPREKAMSLILSIEPFCRGWYAAPVGWLNRDAAEFAVGIRSALVNDD
RVYLYSGAGLVRGSNPESEWEEVDQKIGDILAITQQTS
>CT0462 menG, ubiquinone/menaquinone biosynthesis methyltransferase
MMSSSKETAKSLIQTKSRSSIRNMFDEVAPTYDFLNHLLSLGIDNYWRVV
AAKKARKQLEGEREPKILDVATGTGDLAASMAKIPGAKVTGYDLSPEMLA
IARKKYPNIEFLEGFAEKMPFDDRSFHVVSAGFGVRNFEDLAQGMKEFHR
VLKPGGCAYIIEPMIPRNAVMKKLYLIYFKNVLPKIAGMFSKSTFAYDYL
PNSVEQFPQAEAFTKILKNAGFKKAEYFPMTFETSILYVAMK
>CT1845 menH, thioesterase, menaquinone synthesis gene
MTTISLHLTTVGDPALPKIVFLHGFLGSGSDWLSFARKLENRFCSILVDL
PGHGEAGIPADGDPKLFFMQTVEALKSNIRRLRAEPCVLVGYSMGGRIGL
ALALLYPELFSKAIIVSSSPGLQTDEKRASRRKSDEGIARKIERNFEGFI
GFWYDQPLFSTLKSHSLFREVEAQRKQGTPQNLARALRLLGTGNQPSFWD
KLPGNRLPMLFCVGEKDAKYVDIAKQVVELCPSSSLELFEHCGHTLHIEE
PERFLASVERFIETHPHNSISHDDL
>CT0605 met2, homoserine O-acetyltransferase
MTRMDHRSIISDTTQYFESNEPLQLELGGELPGVRVAYRTWGTLNAEKSN
VILVCHALTGNADADSWWCGMFGEGRAFDETRDFIVCSNVLGSCYGTTGP
MSVNPLSGRHYGPDFPRITIRDMVNVQRLLLRSLGIDRIRLIVGASLGGM
QVLEWGAMYPEMAGALMPMGVSGRHSAWCIAQSEAQRQAIAADAEWQDGW
YDPEVQPRKGLAAARMMAMCTYRCFENYQQRFGRKQREDGLFEAESYVRH
QGDKLVGRFDANTYITLTRAMDMHDLGRGRDSYEAALGALKMPVEILSID
SDVLYPRQEQEELARLIPGSRLLFLDEPYGHDAFLIDTETVSRMVCEFKR
QLIVDN
>CT1368 metF, 5,10-methylenetetrahydrofolate reductase
MLVKEIFDARTEPVFSLEFFPPKKQDDWDKLFETISNLFPLDPSYVSVTY
GAGGSTRERTHNLVTRIQQETGLTVVSHLTCIGAEKSEIESVLQNYREHG
ITNVLALRGDKPADITTLEEATKDFPHAIDLVKFIKENFPEMGIGVAGFP
EGHPETPNRMKELEFLKEKVDAGADYIVTQLFFDNHDYFDYVERCELAGI
TVPIIPGIMPIMSKKGMIRMCELALGSRIPSKLLRKVLEAADDKEVAEIG
VEWATNQVQELLDHKVKGVHFYTLNLSEATLKIFRNLKRG
>CT1857 metH, 5-methyltetrahydrofolate--homocysteinemethyltran sferase
MNDNLYSLIEQRILVLDGAMGTMIQRHGLDEQDYRGERFASHDHPLKGNN
DLLVITRPDIIRSIHCDFLDAGADIIETCTFNANPISQSDYQLQDLTREL
NVAAAKIARSAADEFTAKTPDKPRFVAGSIGPTNKTLSLSPDVNNPGFRA
VTFQEMVDNYTAQLEGLHEGGVDLLLVETVFDTLNCKAALYAIEEYAVKT
GWQVPVMVSGTVVDASGRTLSGQTTEAFWISISHMPSLLSVGLNCALGSK
QMRPFIEALSNIAESYVSVYPNAGLPNEFGEYDDSPEYMAAQIAGFAESG
FVNIVGGCCGTTPTHIRAIAEAVKTLPPRKRPANKHVLRLSGLEPLVVDE
TTGFINVGERTNVTGSRKFARLIKEANYDEALSIARQQVENGAQVIDVNL
DEGMLDSEKVIVEFLNLIASEPEIAKVPVMIDSSKWSVIENGLRCTQGKS
IVNSISLKEGEELFKERARKIMQYGAAAVVMAFDEQGQADSLHRRIEICS
RAYKILTEEVGFPPEDIIFDPNVLTVATGIDEHNNYALDFIESVRWIKQN
LPHAKVSGGISNVSFSFRGNEPVREAMHTAFLYHAIHAGLDMGIVNAAQL
GIYEEIDPELLVYVEDVLLNRRDDATERLVAFAETIRDGGEKAEAKNAEW
RNAPVEERLKHALVKGIVDYIDEDTEEARQLYPSPLEVIEGPLMNGMNHV
GDLFAEGKMFLPQVVKSARVMKRSVAALIPYIEEEKSKNCDTSAKAKVLL
ATVKGDVHDIGKNIVSVVLACNNFDVIDIGVMMPCDKILEALAEHKPDVL
GLSGLITPSLEEMAHVAKEMERLGMNIPLIIGGATTSKVHTAVKLAPCYP
SGAVVHVLDASRSVPVVSNLCNPAQRDSYIAALKDEQEAMRKSHAERMAA
KKYVSLDAARDNRLTIDWEAETIDKPAQTGVTVLEDVTVGALRPYIDWAP
FFWSWELHGVYPQILEDEKVGEEATKLFNDATALLDRIDSEKLLGIKGVA
GIFPANSIGDDIFVYADDERSIIRTVLHTLRQQGEKHGEANLALADFVAP
RESGVNDWIGCFTVTAGLGIQNLLDEFTAENDDYHRIMTQALADRLAEAF
AEMLHEKVRRELWGYAPGEILGNEELIAEKYRGIRPAPGYPACPDHTEKA
IIFDLLNAEAATGVTLTETFAMNPAASVCGLYFANPASKYFVLGKIGKDQ
VEDYANRKGLEVAEAEKWLAPSLNYDPA
>CT0722 metK, S-adenosylmethionine synthetase
MLDEFIKQDPNSRVACETFVTTGQVIVGGEVTSKGIVDVQTIARKTITEI
GYTKGEYMFDANSCGILSALHSQSPDINRGVDRKEEIADEFDRVGAGDQG
MMFGYACTETPELMPAAIQYAQELVRLLAEIRKEGKIMTYLRPDAKSQVT
LEYDGNDNVLRVEAVVVSTQHDPEPAGMSEAEFQAVIKNDVIENVIKKVI
PAKLIDENTKFHINPTGRFEIGGPHGDTGLTGRKIIVDTYGGAAPHGGGA
FSGKDPSKVDRSAAYAARHVAKNIVAAGLADKCTVQVSYAIGVARPISIY
INTHGTSKHGLSDEQIQEKAEAIFDLRPLAIIRRFNLDRPHGWCYRDTAA
YGHFGREQFPWEKTEKVAELKAALGL
>CT0969 metS, methionyl-tRNA synthetase
MTHIPKRTLVTTALPYANGPVHLGHLAGVYLPADIYVRYKRLCGHDVIHI
GGSDEHGVPITITADKEGISPQEVVDRYHTMNAEAFAKCGISFDYYGRTS
GPVHHQTAREFFLEIEKKGIFVKKTEKQFFDPKAGRFLSDRYITGTCPVC
KTPGANGDQCEQCGTHLSPTELIDPKSKLSDATPELRETLHWYFPLGRYQ
KQLEAFVERHTGDWRSNVVNYSRTWLNQGLADRAITRDLAWGISLPLDSE
EAKGKVLYVWFDAVLGYISFTKEWAEKQGDAELWRRYWQDPETRIINFIG
KDNVVFHTLMFPAILMAWNEGRSEGRYELADNVPASEFMNFEGRKFSKSR
NYAVYLGEFLERFPADTLRYSIAMNYPENKDTDFSWSDFQNRTNGELADT
LGNFIKRSIDFTNSRFGGQVPADIDLEAWDSLGIDWLASFGKLEAAYDGF
HFREATAQTMEIARFANRFLTESEPWKVIKVDPEAAGRTMAVSLNLCHTL
ALLFWPIVPETANRIWKMLGFEGTIDELVEPGNPVWRQALEPGLKKGHKL
LGSSEILFSKIEDKDIEPEMKKIEALLAEAEQREAAKQPVPMTFKPEITF
DDFQKIDLRVAKVVACEPVKKANKLLKLQLQVGSEQRQVLSGIAQYFTPE
QMVGKNVVLVANLADRTMRGELSQGMILTVEGADGRLFLLEPQGEGINGN
SVS
>CT0653 mfd, transcription-repair coupling factor
MSAPRNASLEFSSFARNPYRRRFMKFSANPTPQFASIIKRPVDLVLDSLA
ASAPYRALREALSVATTGEHRAVDICGVRGSLAPFIAAKLFRDFDAPVVL
FCNADEQELYDNDLPLLLGGKPFRNTADELSPALGMLSRRETLVVLAAFE
DLSIEVCGTESSNERLFSLAAGSDAGYDALMQFLKSNGFEKREFVENEGE
FSVRGSIIDVFCYGSREPVRIEFFGDTVSSLRNFDTDSQLSTSAIESVDL
FGSFTQESAAESKPAGILDYLPDTAIVIIDDATAMQGSDHRALLEAALPR
FRHVVIQRINKQGIDFNSSEQQRLQGNFRLLAGRLQEEAERGLKPLFACA
SRREIEELAEFIADENTSKSPDAIEWIPANLHSGFAFGELNLYTESDIFG
KFHTHKAHRKRKVRGISLKELQRLKVGDYVVHEDYGIGVFRSLETIQVGD
SEQECVLVEYEGGDQLYVNVQNINLLSKYTASEGSLPNLSKLGSSKWSAK
KERVRKKLRDIAAKLIRVYAKRKMTPGFAFGPDSIFQREFEASFMFEETP
DQLKAIQEVKKDMQSPSPMDRLICGDAGFGKTEIAMRAAFKAVENKKQVA
ILTPTTILTHQHGESFARRFANFPVNIAVLSRFVPRKEQKETIERIASGA
MDIVIGTHRLVSKDVVFKDLGLLVIDEEQHFGVEVKEKLRHQFPGVDTLT
MSATPIPRTMQFSMLGARDISIVSTPPKNRQPVETIITEFDPETVRAAIK
REIQREGQVFFLHNRITSLEETALKLRELVPYARMATAHGQMPAKELENV
MMDFMQQELDVLISTSIIGSGLDISNANTIIINRADMFGLSDLYQLRGRV
GRSERKAYCYLITPPLHTLKREAVQRIAVIETFTELGSGINVALRDLDIR
GAGNLLGAEQSGFIHEIGFDLYQKMIEETVAELKLTEFNHLFSDSEKAAL
KPQRPCDMIFFFDALLPDYYITATQERFSCYDRISKAADNTALQNIAKEL
EDRFGAMPAEVQNLLALARLKHLGSSLGLEKIDLQQSSATIFLPSDEDKE
FYDSAFFQNLIVALQDGSIKEYHPQFKHEKKMKLVFRHPETADTAPLALI
ARYEALLKQIAER
>CT0630 mgt, magnesium-transporting ATPase, E1-E2 family
MFRTLYNKISSTSIREEAAQSHVRSEDEQFLLKLCNAPVDEALRLMESRV
DGLDSQEASKRLSKYGKNEISLATRPSFLQDILHRLSSPLVIQLVLIAVV
SGVTGDMTSSAIVGMMILLSVGTSYVLERRSGNAVEALGKRVQSRAHVIR
NGLEAEVPLSELVPGDIVQLQAGSVIPADLRLISAKDFFVGQAALTGETM
PVEKSADAGETGNLGILELRNACFQGSSVSSGSARAVVVNTGSRTFFGAI
AERLNQRRDETDFDKGIRSFTWLMIRFMVVMVSTVFLIVGLTKGNWLESL
FFSLSVAVGLTPEMLPMIVTVNLAKGALSMARKKVIVKQLSSIQNFGAID
ILCTDKTGTLTQDHVLLEMAVDVMGEQSDNVLRYAYLNSFFQTGLRNLLD
RAVLDHQEFAVDGNCKLIDELPFDFQRRRMSVVVDYEGDHVLISKGAVEE
IFACCDRYQIDDEIYPLIGMIRDDLFEEVAALNNNGYRVLAIAYNEFPPD
RKRFTHDDEKNLILLGYIAFLDPPKDSTAQALVKLREAGVKVKILTGDNA
LVTRKICKDVGMAVNRVVTGDELARLSPDLFGKAVEEADVLAKLSPLQKE
EVVQSLRKQGHVIGFMGDGINDAPALRAADVGISVDSAVDVAKESADIVL
LEKSLMVLDDGILEGRRVFTNIIKYIRMGASSNFGNMFSVVGASYLLPFL
PMQPIQILLNNLLYDFSQTGIPTDNVDEEQVRSPRKWDIGNIKWFMIVIG
PISSIFDYATFALMWFFFNSKLFIDPAATGAAKAHAVQLFQTGWFVESLL
TQALIVHIIRTRKIPFLESHASLPMLLTTLAVMAIAVWLPYSPFASLFGM
VPLPLAYFGWIALFLLSYAALTHKIKTWFFKRFGGN
>CT1854 mgtE, Mg2+ transporter MgtE
MIGTPILPEIRELIEQRNFSALQRLFDDWLPVDLAELISDLPENEQAILF
RLLPKDVATETFEYLDFDAQQNLLNALTQKDVTHILNSMSADDRTALLEE
LPGPVAQELIKLLSFKEFKIAKTLLAYAEDSVGRLMSPDFLSVKKDWEIG
KVLEYIRTYGHESETLNVIYVVDEHGKLVGEMLARDLLLSALDKKVEEII
DEDKLITLTATQDQKDALETFKRYDRVALPVVDSNGYLIGIVTVDDMLDV
AEEEETENIQKFGGIEALEEPYIDVPLLELIKKRAGWLVILFLSEMLTAS
AMSYFEGELAKAIVLATFIPLVISSGGNSGSQAATLIIRSLSLGEISIHD
WWKVMRREILSGLMLGSILGTIGIIRVVLWATILGHLTKVWVLIGITVGC
SLVGIVLWGTLTGSMLPLLLKRLGFDPATSSAPFVATLVDVTGIVIYFTV
ASLVLKGVLL
>CT0973 miaA, tRNA delta(2)-isopentenylpyrophosphate transferase
MNTKPVLVILGPTASGKTELAFRIARQTGGEIISADSRQIYRGMDIGTAK
PPRWMLDEVKHHFIDKKEIGEPFSAGDFAEQAAEKIRELHQRGITPVVAG
GSTLYLEGLLKGFAELPPADPEIRAQLTRELERHGAEALYRRLEALDPEQ
AKTLDPTKTQRLIRSLEIIEISGTTVTALQSKTPGPPTGINFTVIGLDLP
RELLYERINQRTSAMIQAGLEAEARYLFDKFRDEWRSKNLNALATVGYRE
LFEHFEELHDLDTAVSLIAQHTRNYAKRQLTFFRNRLDVEWVKAPLDEAE
IEALVEFFSTRQDDSVPHSPIAIAKKQNA
>CT1333 moaA, molybdenum cofactor biosynthesis protein A
MNRESAKQLVLTDRFGRTVDYVRIAVTSACNLRCTYCLKEDAPTQTQQLD
VVETSKLIALLAGMGVRKIRFTGGEPLLHPSIPELVRIAKATPGIDTVCI
TTNGVLLDRQLDALVEAGLDGVNLSLDTLDREKFTSITRRDRFEQVSKAL
DRLLATPSLTVKLNTLMLRGINNDEIPAFVELTREHDLTVRFMELQPFDD
HQIWRTGRFMGAERIRERLADAYPELEAITGHSTEHYSFSLPGHRGSIAI
IPAFSRNFCSSCSKLRITADARLISCLYHHESIDLAPALKGEMNEVELKK
RIIEAVQQKPKDGLKSSHDTAASSMSQIGG
>CT1330 moaCB, molybdenum cofactor biosynthesis protein CB
MEFTHLDENGMVRMADVSGKPPTRRKACASGRIVMLPETIALLRRKELPK
GNVLAAAKIAGIQAAKQTSTLIPLCHQLNLSWIDIEFEIGDDSIGIAATV
ITRESTGVEMEALAAVSVAALTIYDMCKAVDKTMEISAIRLDHKTGGKSS
AAEYHPRTAILVMSDSIAAGTATDHSGAILREGLQKAGCAVEALTITPDE
PVEIAATVEAWIGEGIEFIVTSGGTGLGPRDLAIDTLAPKFTRRLPGVEQ
ELLRWGQTKTRTAMLSRLAAGVIGNTVVVCLPGSTSAAKDALEVLVPAIF
HAFPMLKGDGHA
>CT1332 mobBA, molybdopterin-guanine dinucleotide biosynthesis protein BA
MGKRMLSMFHPFEIALCGLSGSGKTTLLEKLIRRFSTDGFEVAAFKHGCH
RFDIDREGKDSDRFRRAGAVPVLIVDREKEALISSGTGRLDIAGLTLNAD
LLFIEGFKELPVPKVLLIDERREILPSLESGAIPEVLALAHDGDAEDMER
FGLPLFHRDDVARISDFSAEFFRRAAQSVPLNGLVLAGGRSLRMGRDKAQ
LRYHEASQLDRTAALLGGVCNEVFISCRSDQLDQYSDARLPGIADSYLDL
GPLGGLLSAQRHAPGAAWLVAACDLPFIDEAVIAALRAGRHPFRFGTAFA
GSDGRPEPLLAIYEPKSRRRLLERHASGNDSLRAFLMNSRVQFIEPNDAS
KLRNVNDPAAMDDALRAISKGGQ
>CT0908 mod, type III restriction system methylase
MKQLTANDPETRSHDLVAENIARLKALFPELVTEGPDGAAVNMDVLKQLV
GDKTVTDCDEKYGLNWHGKRRARQLALTPSTGTLRPCPEDSVDWDTTQNL
MIEGDNLEVLKLLQKSYAGKIKLIYIDPPYNTGKDFVYPDDFKDNIRNYL
ELTGQVEGGRKISSNTEASGRFHTDWLNMMYPRLKIARQLLSPTGVIAVH
IDEHELEALVIVLRDLFGEENELGVTIWDKRNPKGDSRGIAYQHESLVLF
ARDAEELFARSPVRRAKRNAERMLNAASRFVTECGSIAEARAAYRSWVKS
QSTLSGGESMYDKLSETGRVYRLVSMAWPNKKRAPADYFVPLKHPITGKD
CPIPERGWRNPPATMKKLLEDGLIEFGPDEIMQPQRIYFLDENMYENVPS
IVPFGGSDDELLKELSVPFEQPKPVDFSVQVISWCSSKDEIVMDFFAGSG
TTGHAVMAQNAADGGNRRYILVQLPEPLDPENKDQKVAAEFCDKLGKPRT
IAELTKERLRRAAKKIKEENPLFAGDLGFRVFKLDSSNIRAWEPDRDNLD
QTLFDHVEHLKEGRTEQDILYELLLKLGLDLCVPIETRTIAGKAVHSIGG
GVLLACLATRITREEVEPLAQGIVAWHQALAPAGDTTCVFRDSAFADDVA
KTNLAAILEQHGIANVRSL
>CT1544 modA-2, molybdenum ABC transporter, periplasmic molybdenum-binding protein
MFKHLRTLFFAFLLALTTPQAFAGEIRLSAGAGMKEVLDVLSGNFAKAHP
GTTFIKNYAAAGALAMQIENGAPADVYISADSKWVEYLMAKKLLAPAWIS
PFAWNEIVVVGNPALKVSSMNDLTKLGKIAMGNPNSAPAGEMAMEAIRSA
GLENPLTGKLVMTRDMPQTMMYAETGTVDAAFVHLTEALTAKKAKILFTV
PHRLYTRTPFTMALTSTGAANSEAKLFFNYLRSTKAKRILERYGYLMK
>CT0451 modB, molybdenum ABC transporter, permease protein
MHLTPEDFSAIALSMKVAVTATALSLPFGFATAWVMVFKRFRGKVLLEVL
VNLPLTLPPVVIGYFLLLMLGRNGWIGQALSSVGIELVFTWKAAVLASAT
VGFPLLVRSIRLGMESIDQQLIDASRSLGAPWYDTLATVILPLSFRGMVA
GSSLMFARSLGEFGATIIVAGNIPGLTQTIPLAIYDYASSPAGTSMALSL
CLVSIALSVSVLFLHELIGKKLEHGGKA
>CT0450 modC, molybdenum ABC transporter, ATP-binding protein
MKLCIDIEKRLGDFSLKLKTEISGERVGIFGASGSGKSTLVHLIAGLIKP
YAGEIYLDDTCLFSSRKKIDLSPEKRRISIVFQQATLFPHLGVKSNLLYG
YKRCRPSERRIRPEAIIEVLNLGHLVGRGVGKLSGGEKQRVALGRAVLAN
PRLLVMDEPLSALDDSLRYQIIPYLRSVSAEFGIPCLFISHSLTEMQLMT
ERVLVVRDGRLAGISTPDELARTGMATNPRGYLNLLRLDGATQKDGLFHY
PWGNGHLIISEGRTGEESLFELSSRDIILFKQHPEAISARNLLECRVAEL
FDSDGRKGVVLESGGKTLVAEIVRSAAEELGIAPGTMLFAAIKASAFRRV
S
>CT0453 modD, molybdenum transport system protein modD
MYSLTPDSEIERYIEEDVPYGDLTTTLLGIGDEPGEITFTTRDRTVLCCT
EEAGRVLAKCGAEVLSMLPSGTIAAPGTEILRASGPAAALHAGWKVAMNL
LEYASGISTRTATIVERARAANPGINVVTTRKSFPGTKKIAMKAIMAGGA
FPHRLGLSESILVFSQHTVFLGGLDPFLDKLSDLKKLAPEQKILVEADSA
VEALRIAAAGADVVQVDKMPVAELAALVSEIRSRFPGVAVSAAGGINGEN
AAAYAATGIDIIVLSSVYFGKPADISTSILPVKSN
>CT1543 modE, molybdenum transport protein ModE
MSKTKNPIGIEGSIWFQKSQSRFLGGDRIALLEKIDELGSINSAAKAVGI
SYKTAWHLVNMMNNLSEKPLVDRMTGGKGGGGTVLTREGRQVIEKYRIVQ
EEHRKFVENLEERLGDTGNLYQFLRRISMRISARNTFSGVITELTRDAVN
AEIIITLNGGQQIVSTITNGAIDNLGLKKGMSAYAIVKSSSVMVGRDLQD
KKLSARNIICGTVQRVIEDSVNSEIDIEIGGGNSISAIITETSTSRLNLK
EGEQACAIFKASDVIIGVN
>CT1331 moeA, molybdopterin biosynthesis protein MoeA
MMTVVHEAHESKQKQAKTMTTVHEAHEIIATTTPLIEATVSVPLLQLQGR
VLAEDVRAGFAMPRFTNAAMDGFAVRFGEIADASDGAPVTLPVSQELAAG
ALNVAPLETGTCCRIMTGAPIPEGADTVVPFEETSGFGSDAVEFYKAPKK
GANIRHAGEEMQPGALLAAAGTRITPAEIGLFATFGIAVALVRRQPRVSI
ITVGDELRMPGEQIEPLAIYNSNLALLASCVEGAGAKVVGMRQLRDNRQA
IREALALAIAESDVVITTGGISTGEFDFMYDALNELGVEQKFWKVAQKPG
KPVYFGTTGSGKIVFSLPGNPVSALVCFLEYGLSALARMQGVAPAPKFTA
LLNEPFPTDRKRHRFLFGRLSVEAEALRCQLSAQVESYMLTALSGANCLV
EAPPSAEPLPAGSLVTCAWLPWANAC
>CT1479 mpl, UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl- meso-diaminopimelate ligase
MSSIYFIGIGGSAMASVAVALSHMGHAVTGSDTQLYPPMSTYLENHNIRY
FNSFSAENLKSATPDLVVVGNAISRGNPDLEYALDQHMELISMPQLVRRE
LIGRHTSIVVAGTHGKTTTTSLVAWLLEAGGLLPGFLIGGIPENFGDGCR
PSGLSEPGFFVTEGDEYDSAFFDKRSKFLHYRPDIAIINNVEFDHADIFD
SLDDIKKSFRLLVNLVPSNGLLIVNADDPNSMEVSAKAFCRVETFGLNGN
AEWTAADIATDTDGTSFTVVRDSEAIGRVKVPLFGNYNVMNALAATAAAI
RAGVSFESVTRGLCSFKRPKRRMELVGEYAGGITLIEDFAHHPTAIRLTL
GAIAEHYPGRRIIACFEPRSNTTSRNIFQHELSECFGDAAIVVLGKVNRP
ERYAPEERLDAALLCRELKANGKRVFAAGSENYPEDIVRFIEAEQQPGDV
VVLLSNGSFSGLKEMLAESFQKNS
>CT0037 mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase
MLYYILRYINELYSLPGMGVIEYLTFRASAAAITALLIIIFAGQRFIRFL
KSKFVEPIKEEAPPEHRKKKDVPTMGGLMIIFAIEVSAFLWAKIDDPHVW
LIMLAVFWMGLIGFIDDYQKVVLKVKGGLAGHYKLIGQVTLGLVIGFYTW
NDPVFSVLLSDTTVPFFKKLSVDYGIFYIPVVIFIITAVSNAVNLTDGLD
GLAAGNAAIVTFALGVFAYLCGNAVYSGYLSIPFISGAGEVAVVSMAIVM
ACVGFLWFNSSPAEVFMGDTGSLSLGSAIAVIALMIKQELLLPVLAGVFF
VETLSVSMQVAWFKISKKLYGEGRRIFLMAPLHHHFQLKGWAEQKIVIRF
WIISILLFLTSLMTLKLR
>CT0566 mrdA, penicillin-binding protein 2
MDDFLRGIKLTILLIAAVFLFFTARLAWLQIVQHDDISSRSGSIRRIWEQ
APRGHFIDRNGITVLENQALYTLKIIPNELRASSIPYLAYLLEIPVDELN
EKVAEAKDYSPFAVSTIYRDLNEFVVARISENLWRLPGVIIEIENKRKYS
DLFRGTHLFGYLRNVSKEQLDTLAEKGYTPDDKIGFSGLERIYEEELRGE
KGVRYELVNPLGMLMGKYNEGKNDIPSVKGNDLYLTIDAHLQRLAENLLR
ATGHPGAVVAIDPSDGGVLALCSEPDFDLDILNGKTRKKEWAEIALSPEK
PLFNRAIQAVYPPGSTYKLVLAIAGLEEGVIKPEDTIISTGSWNYGGRIF
HDHGGRGHGIVNMKKAIIESCNIYFYQLMLKVGLDTWDKYGKMFGFGQRE
GIDLPGERRGLLPTTEYYNRRYGEGRWTKGYLVSLGIGQGELGVTPLQLA
NYAATIANNGTWHQPHIVRGYRDTRTGIYVPIDHASRTLPISKETFGIIK
EAMQGVVQQGTGTLAQVPGVTVAGKTGTAQNPHGKDHAWFICFAPVDHPK
IAIAVLVENAGFGGSISAPIAREMINYYLVEKNKPKTQGADSTAVAKIKK
LNDSLNTKRPAAKTAIDSTSTAPQSDLEGGD
>CT0547 mreB-1, rod shape-determining protein MreB
MGIFSDLFRDIAIDLGTANTLIFIRNKGVVLNEPSIVARDRNTGKVVAIG
HDALLMHEKTHPGIVTIKPLANGVIADYEATEELIRGLINKTKKQFSLGI
RRMVIGIPSGITEVEKRAVRDSAEHVGAKEVYLVAEPMAAAIGIGIDVKE
PMGNMIVDIGGGTTEIAVISLGGIASGESLRVAGTDITSAIIRHFRKAYN
LAIGERTAEEVKIRIASAYKLDKELTMNVRGRNLVTALPEEREINSATIR
EAIATPISQIITSIKKSLEVTKPELSADILDRGLFLAGGGALIKGLDKKI
NEETKLMVHISEDPLTAVARGTGAVLEDLEKYRSVLLSTKRY
>CT1470 mreB-2, rod shape-determining protein MreB
MILGNVLNIETFVDLGLDPGSANTLMCIKDQGIVVNEPTIVAVESESGQL
LAFGHEALNMHQKMHPGIQTIMPVTNGIIGDYENTQKLFRELLHNVKPRI
LFGIHRLVVSIPLSITEVGKRAFFDMAEHLGAKEAWLVLEPIAAAIGAGL
NPFEPVANLIVNLGAGTTQIAVISLGGIVSGESLSVSGNQINNAIIENLR
EQNNLAISEYAAEHIKRNIAATDRADRESRLTVKGFNLLTGFPDTQEIST
AVLREIITTPLQEIVTAIKKCIEVLADKPDVAVDILERGIYLTGGGALLS
GIDKKIQSETGLAVTICEEPQTTVGKGLCTILENFEHYRPVLLDNNKKHK
Q
>CT1243 mscL, large conductance mechanosensitive channel
MLKEFREFALKGNVVDMAVGIIIGGAFGALVNSLVNDLLMPPLGLLLKGV
DFSNLFVVLKEGTPPGPYIALADAKTAGAVTLNYGLFVNALIGFLIMAFA
VFLLVRSINRLRSLSEKSAAPAVAPQTKECPFCFSIIPLKAVRCPNCTSQ
L
>CT1278 msrA, peptide methionine sulfoxide reductase
MKYKTLTPEEKRVIIDKGTERPFSGKYYLTKEKGVYQCKQCGADLFRSDA
KFDSGTGWPSFDDAIPGAVRQETDRDGMRIEILCANCGGHLGHVFYNEGF
TSKNARYCVNSVSLSFEPSDKPATATEKAVFAGGCFWGVEYHFKKMKGVL
STTVGYTGGKTAHPTYEQVCSGRTGHAEAIEVEFDPSVVSYEELAKLFFE
IHDPTQVDRQGPDVGEQYRSAVFYQNDQQKKIAEALIEQLKKRGYDVVTS
VEKGGEFWPAELYHQDYYEKTGHKPYCHIYQKRF
>CT0719 mtd, methylenetetrahydrofolate dehydrogenase
MKVLEPNPVAASFRDAVRRQISEEQLTINIVGILASDDPASITYADYTRA
GCEDVGIHFDLRKCEPESVRATLEAANRDSAVHGIFVYYPIWGDKRDAEL
RDLISPHKDVEGLSPHWIKKLYANERFDDTERRFKSILPCTPLAIIKLLE
VTEAYAPFGLPFGGQQITIFNRSEVVGRPLAYMLSNDGARVYSFDINGGF
VVDVNSSDHESRPVTREEALSQSDIVITGVPSPHFEKVRAEELKPGAICL
NFSYIQNFEPEAKEAASLYIPRVGPMTVAMCMRNALQLYHNYHHEV
>CT2025 mtgA, monofunctional biosynthetic peptidoglycan transglycosylase
MKFLRNIVLFILLLLAVDVGRYFFVPDVSRLVHTNPGKTAFMEYREAEWR
SEGRDKTIEQRWVPLKRVSPSLIKAVLISEDNNFWHHEGFDFEAMEGAIE
KNIKAGEFKFGASTISQQLAKNLYLSPSKNPLRKIKEAILTWRIEQTLSK
RRILEIYVNVAEWGDGIFGIEAAARHYYGVRASQLTASQSARLAAALPNP
ILYPPTGSSRFVKARAKHIYAIMVRRGLVVPDYSEVMTAPDAPVVQPPDS
IVVGVPEQLIHQASQPDSIKQESTPEPAAEDTSENTQSGK
>CT0555 murA, UDP-N-acetylglucosamine1-carboxyvinyltransferase
MDKLVIRGGKQICGTIPASGSKNSALPIIAATLLTPDGTFAIDRTPDLKD
VRTFIQLLNYLGAETSFENNLLKVSTGQLKSIEAPYELVKKMRASIYVLG
PLLARFGHTRVSLPGGCAFGPRPVDLHIMVMEKLGATVTIEKGFINARVN
GSRLRGTHIDFPISSVGATGNALMASVMAKGTTILDNAALEPEIECLCNF
LVKMGAKIDGIGTTTLVIDGVDQLKAVEFENIFDRIEAGTLLCAAAITGG
SVTVTSVAPEQLASVLDAFRQSGCTVTTNGNSVTLTAPAELNPVDITARP
YPEFPTDMQAQWMALMTQARGDSTIIDRIYLERFNHIPELNRLGAHIEIR
DNWALVHGPQELTGTKVMSTDLRASACLVLAGLVAKDTTEVLRVYHLDRG
YEAIEKKLTALGADIRREKYQEFS
>CT0033 murC, UDP-N-acetylmuramate--alanine ligase
MPPMELGKTKNVHIVGIGGAGMSAIAELLLKSGFAVSGSDLASGEVIDKL
RELGAVIHQGHQAENVGASDVVVYSSAVRPESNVEILAAQKLGIPVIKRD
EMLGELMRHKSGICVSGTHGKTTTTAMVATMLLEAGQSPTVMIGGVSDYL
KGSTVVGEGKSMVIEADEYDRAFLKLTPTIAVLNSLESEHMDTYGTMDNL
RDCFAEFANKVPFYGRVICCVDWPEIRRIIPRLNRRYTTFGIEEHADVMA
SEIEPGDGGSTFTVEAFGERYPGVRLNVPGRHNVLNALAAFSVGLEIGLP
PERIIAGLARYSGMRRRFQVKYRGADGVLVIDDYAHHPTEVKATVRAARD
GWKEHRIVAVFQPHLYSRTAEFAGEFGWALSRADTVYVAGIYPSREKAED
YPGITGELVAEASRTAGAKNVWFTEEHEALLAALQEEAAPETLFLFMGAG
DITHLAARFAAWCTEMRSNADATAS
>CT0036 murD, UDP-N-acetylmuramoylalanine--D-glutamateligase
MKPEELKGKVASVIGAGKSGVSAAGLLARAGARPFLSEFGAVSPEAAATL
RQLGVPFEEGGHSERVFEAALCIVSPGIPQTVPVIREMHARGIPVVSEIE
LASWFCPARIIGITGTDGKTTTATLLHRICAAEGERKGFRAFSVGNIGIP
FSSEVPGMTAADIAVLELSSYQLEACFDFRPNIAVLTNVTPDHMDRYGGS
IEAYATAKYRIHARQGAGDTLIYNHDDPILRAHFDRSEPWPFRLVRLGLR
AETLDVAPGDFVSVEDGEIVVRASGSTERLMRVDEIMKPGFRGEHNLYNA
LSSVAAALAAGVAPETMRGVLAGFGGVEHRQELAGNACGLNWINDSKATS
VNALRQALQSVPAGMVLIAGGRDKGNDYSAIADLVREKVACIVAIGESRR
KIADAFRGVTPVVEAASLAEAVELARQNARPGASVLFSPACSSFDMFRDF
EDRGRQFKQLVRELT
>CT0039 murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2, 6-diaminopimelate ligase
MKEIREGAPGAQLDDLVAALGALAERRGGDGARAVITGVTCDSRAVTPGA
LFVAVRGLVADGHHFIGAAIEAGAVAVACEELPAAYSDSVTWLVVPDARK
ALAELSKAFYGNASDKLMLIGVTGTNGKTTTARLVTSMLNASGVAAGYIG
TGLCRIGNHDIPLERTTPEPNRLHDLFRQMVDAGCRAAVMEVSSHSLVLD
RVHGLFFRAAVFTNLTPEHLDFHETMEEYAEAKRLLFDQLNAEGFAVINA
DDPRAEFMAARLAPERVFCCSTGDNTSLCDPARRFHAVITASTVEGSKAD
VTFDGQSMAMQVPLPGAYNVMNMLEAFTVGVGLGIDPATALRSLAAADAI
AGRMERIWSRDRSRCAVVDYAHTPDALQKALEALRAVTPADAKLAVVFGC
GGNRDRQKRPEMGRIAAELADRVILTSDNPRDENPEAILDEVEAGMAGRV
HLRIADRAEAIRRAVEQLGAGDILLVAGKGHEAYQEIRGVKHHFSDRECL
EACFAQMK
>CT0038 murF, UDP-N-acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate--D-alanyl-D-alanyl ligase
MKMKGALMFSDFERAGTVVARDVGEGYRLDDPVVVIDSRKAVDGAVFVAL
PGERTDGHRFVGEVFANGASWAVVSREWFVEKGAEHQGDDRRFLVADDPV
KAFQQLAAAYRERFDIPVIGIGGSNGKTTTKEMLAAVLSTSFNVLVTQGN
YNNHLGVPLTLLQMRRDTEVAVIEMGINHPGEMEFLSSLAKPTHGLLTNI
GHEHLEFFGSLDGVADAEAALFRYLEAHGGTAFVNLDDHRLAAAGASLSR
KTGYGAQPGAGRAWWAEQIGADRVGRVSFTLCSESGVHQPVAMQFVGRHN
VINAVAASAVGAHFGLAPAHIAEGLGTLLPAKGWKRMELFEDGGIVVLND
TYNANPDSVRLALDTLAAIECRGRRIAVLGDMLELGDNSAIEHESIGRYI
RQLPLDACLTLGDAAQLICEQAGGRCLRHFGTMDELRGFLSEYVQPGDAL
LFKGSRGMKLELAADDLIKQNQQHSI
>CT0034 murG, UDP-N-acetylglucosamine--N-acetylmuramyl- (pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase
MKVLFAGGGTGGHLYPGVAMAAELKKRVPGISISFAGTSAGIEATEVPRL
GYRLVLFPVRGLKRGLSIRALVENALILGDFAKSLSMAMALVRKEQPDVV
VGTGGYVSAPLLLAAQLSGKKTLIQEQNAFPGVTTRLLARMATEVHLSFE
ESRKFFGGKSEVFVTGNPAREFPAESRESCLDFFGLDRSLPTLLVFGGSR
GARAINNAVLKLCHRLEGTVNLIWQTGALDADRMRGEIGTSATRWIGPYI
QEMGKAYGAADLVLCRAGASSLAELTNLGKPSVLIPYPYAAADHQRHNAM
ALVSAGASVMIDDSKIGEEASFDVILTLLRDREKLAQMGEAARREGHPGA
AATLAERIIALSKS
>CT0284 murI, glutamate racemase
MPQHKVSSDSPIGIFDSGIGGLTVVKAVQAALPSERIIYFGDTARVPYGS
KSQVTIRKYAREDTELLMKHQPKLIIVACNTVSALALDVVEQTAGGIPVI
GVLKAGAELAVQKTKSGRIGVIGTQATIGSNAYTCAIREEKETLEVFPKA
CPLFVPLAEEGFIDHPATRLVAEEYLAAFTGKEIDTLVLGCTHYPILRKI
IESITGPEITIIDSAEAVASKAGELLAARGLLNQSPEKALPHLMVSDLPQ
KFRELYRLFMGTELPDVELVGM
>CT2028 mutL, DNA mismatch repair protein MutL
MASIARLPDIVANKISAGEVVQRPASVVKELIENSIDAGASRITVIIKDA
GRQLVQIIDNGCGMESDDVLLSVERFATSKISEVDDLDALRTLGFRGEAL
ASISSVSHFELKTRKAGNSLGTLLRSDGGVIETPQPAQCEPGTSIAVRNL
FFNVPARRKFLKSNATEFKHIHETVKAFVLSYPEIEWRMMNDDEELFHFR
TSDVRERLSHFYGEGFGESLIEVTEENDYMTIGGYLGKPGMMVRQKYDQY
FFINRRLIQNRMLVQAVQQAYGELLEERQSPFALLFLGLDPSLVDVNVHP
AKLEVRFEDEKSIRSMVYPVVKRAVRTADFSSEASFAAPSAPTVSGEVDL
PEVSSRKLSYSSFSGKASTTGDLYRNYRAGAFSAPSSVSPMLFDSSLETS
LSAGSRPTPMVQESLLTPSVDQPDTGDGENPVAPEKEPKIWQLHNKYIIC
QIKTGLMIIDQHVAHERVLYERAIDIMNEAAPNSQQLLFPQKIDLKPWQY
EVFEEISDELYRLGFNIRPFGGMSVMIEGVPPDVRDGAEATILQDMIAEY
QENAAKLKLEKRDNLAKSYSCRNAIMTGQKLSVEEMRMLIDRLFATRMPY
VCPHGRPVIIRLSLGELDRMFGRT
>CT1503 mutS1, DNA mismatch repair protein MutS
MAKSAQGRTKEPTPMMRQYLEVKERYPGYLLLFRVGDFYETFLDDAVTVS
SALNIVLTRRSNGGAGEIPLAGFPHHASEGYIAKLVTKGFKVAVCDQVED
PALAKGIVKREITDIVTPGITYSDKILDDRHNNYLCAVAPVKRGREHMAG
VAFVDVTTAEFRMTELPLGELKDFLQSLRPSEILISSRDKELRESLAKSL
FSGALFTTLDEWMFTEEQAARVLENHFKTHSLKGFGIEGYEAGRIAAGVI
LQYLEEAKQGSLKYLVRIGLVESGETMTLDIQTCRNLEIISSMQDGSLNG
SLLEVIDRTKNPMGARLLRRWLLHPLRKLEPVVRRHDAVGELLDAPEMRE
GIRGMLGGIIDLERALARIATSRAMPREVRQLGSSLAMIPQLKSLLEGSK
SLRLRELALRLDPLPELAETIEKALDAEASGTLRDGGYIRAGYHAELDEL
RAISSGARDRLLEIQQQERQRTSISTLKVQYNKVFGYYIEVSRANSDKVP
EYYEKKQTLVNAERYTIPALKEYEEKILTAEEKSQLLEHQLFQELCAMIA
EQAASIQTTAAALAELDCLACFASCADEFGYCRPVMNEGTELSIRAGRHP
VLERILGADEPYVANDCQVGSEQQLLIITGPNMAGKSSYLRQVGLVVLLA
QVGCFVPAESAEIGLVDRIFTRVGASDNLTSGESTFLVEMNEAASILNNA
TERSLLLLDEIGRGTSTFDGMSIAWSMCEYIHDQLRSRTLFATHYHELAE
LESRFERIVNFNATVVETADTVIFLRKIVRGASDNSYGIEVAKMAGMPPE
VIERAREILAGMERREVEVPVQRQALPLRVESRQISLFEEEESRLRKALS
GIDINRLTPLDALMELKRLQEIALGKGA
>CT0334 mutS2, DNA mismatch binding protein MutS2
MDETTVTESVSLFRKLEFDKVARHAAGFCISAMGSDLLMEMGLPDECERE
LVRVLELKNFLLEGEPLPFSRLPDTRFLLSKLEVLDSWLNAAELLDIFYL
LQSSAQLRKFMFTSRERFPALNEFTIRIWLEKSIQYSISQAIDERAVVRD
TASDALYEIRKKLGEARDGLRRKMERILRRCQNEGWLMEETVALKNGRQV
LPLRVENKHRLPGYIQDYSQTGQTVFVEPAETLEISNRIQELEIAERREI
ERILKALSDGVRQELENVRHNERIMAAFDSLYARARLAVETGSMLPKISA
GRRLKIVKGYHPWLFISHGFSKEKVFPLDMELDENEQVLVISGPNAGGKS
VAMKTVGLLVCMLRHGYLVPCSESSEFPLFNSIFIEIGDEQSIENDLSTF
SSHLSAIRDILDHAQPDSLVLIDELCSGTDVEEGSAIARAVIEELLERGV
KTIVTTHLGELKLYAHRREGVVNGAMEFDRHGLAPTFRFLKGVPGNSFAF
AMMRRMGFSDEIVNRATGFLTTGHTGLEEMIEDFRKSAASNRELERELRR
ERLEAESIRSALTLQRAELRKKMQELKSRGYRDLDRQLEQARKEIRDLVR
EVKEHPGDDATLHKARTKLAGMKRELAVKGEQVEREVAPQADLSIRPGDT
VRIGDTNTTGEVESIQGDSAVVQCGNFRLTTALRGLKKISRAGAKKLQKE
AATGAAAGKSWSVKSSTLESTRLDLRGLTGDEAIAEIGRFIDALAVHRMP
FGTIVHGKGSGALRLRTAEFLKQHSRVKSFRLGDWQEGGAGVTVVEMR
>CT1247 mvhD, hydrogenase, methyl-violgen-reducing type, delta subunit
MSEPFEPKIVAFVCTYCTYAGADLAGTSRLNYAPNVRIVRLPCTGRISPM
FILKALQKGADSVLVSGCHPGDCHFTAGNYHARRRWTVFRALLSFTGIPE
ERIRFSWISAAEGAKFAELINEITDDTRKLGPFTQYQELQKVIETQSTY
>CT0570 nadA, quinolinate synthetase
MTIATDMHKEAAGLSTEALLRRVQALKKEMNAIILAHYYTLPEIQQAADI
VGDSLALARAAEKTSADVIVFAGVYFMAETAKILNPGKMVLMPDPGAGCP
LADSCPEEEFRAFRQAHPDAIAITYVNSSAAIKKLSDIICTSSNAEHIVR
QIPPEQQIIFGPDRNLGAWVMKRTGRDMLLWQGFCYVHDAYSEVYMIQAK
AMYPDAELIAHPECREEVLRQASFVGSTSALLDYTEKSPRKSFIVATEPG
ILYEMEKRSPGKVFIPAPKDPANPRSVCKQMKQNTLDKLYLCMVNRSPEI
TVDESLREGALKSIKRMLEMSA
>CT0561 nadB, L-aspartate oxidase
MTQEVKTDVLVIGSGIGGLYFAINMADHATVTIITKKESSTSNTNWAQGG
IAAAIAGDDTPELHIADTLDAGAGLCNEAIVSILVHEGPAHIRRLIELGV
EFTTNPDHTLNLGKEGGHSRNRIVHAKDLTGREVERALLARANAHPNITL
LEHHYALELITEHHLGIKTNDITCYGAYVLDTLNHKPKKILAKVTMVASG
GLGHVYLHTTNPEIATGDGIAMAYRAGAEIANMEFIQFHPTSLFHPKAKS
FLISEAVRGFGGILRNKEGEAFMHRYDRRENLAPRDIVARAIDSEMKKNG
DECVFLDVTHIKAEKVREHFPHIYETCLGFGIDMTKEMIPVVPAAHYSCG
GIRTDSWGRSTINHLYACGETSCTGVHGANRLASNSLLEALVFAWRSSED
IRAELKSIHFKHEFPDWDDSGTTSPEEWILVAHNKKEAQVIMNDYVGIVR
SDLRLDRARRRIDFLKEETEAYYKKTKITPQIIELRNIIKVASLIIQGAI
KRRESRGLHYTTDFPQKDDKHYLADTVLRSF
>CT1936 nadC, nicotinate-nucleotide pyrophosphorylase
MAVMAEKRTNHAFREFFETCRLKAMQLALEEDRFQGDITTEATVDQNQLG
LGYIEVKSEGIIAGVEVARQVFQSLDAALEFTAYVKDGKRVYPGERVLEV
KGRIASILIGERTALNFMQRMSGIATRTNMYVERVSHTNASILDTRKTAP
ALRYYDKEAVRIGGGTNHRFGLFDMILIKDNHIDAAGSVEEAIRRAKAYC
QEQGVSAKIETEVRSISELVRACASRPDMILLDNFMVDDLAEAVRWIKAN
GFGNILLEASGNIGLHNVSEVAMTGVDFISIGELTHSVKALDMSMKIERA
>CT0016 nadD, nicotinate-nucleotide adenyltransferase, putative
MRTAVFGGSFDPPHNGHLALSLFARELAGLDRLIVSVSKNPFKAAADASD
DDRSAMARLLVAEINVAGVFAEISGWELQQSGPSYTIDLLRHVEERCPGD
ELVLLVGEDSYLQMPQWKFASEILKHCTIAVFGRSDIDAADAPPSDPLLP
AIHYDFDMPVSATKIRRLAAAGQPIGQFVPSSIAQYIAEHKLYSA
>CT0560 nadE, NH(3)-dependent NAD+ synthetase
MKPQNLHFDYGLVEAILVPFIRNEIRKFGFGSVVLGLSGGIDSAVVCELA
VRALGVENVLALMMPYKTSSQESLDHAELMVDRLGIRYEIMPVTEVVDAF
FATRPDASRLRRGNVMARSRMLCLYDVSARDGCLVLGTSNKTELMLGYGT
MFGDMASAVNPIGDLYKTQIFGLARHLGIPAPLIDKPPSADLWEGQSDEA
DLGFSYEEVDQLLYMMLEERMDRDAILAEGIDSAFYQRVRSMVVRNQYKR
MMPVIAKLSSRTPGIDFRYARDWQEVR
>CT2124 nagA, beta-N-acetylglucosaminidase
MANGVDSGTGTFPKPWSAQTVFSNREPWVERTLRNMSVREKVGQMIVAKV
DAVYKNDDDPQYQLISRLVSEGKIGGIMFLKGDVQSAGILANHFQELSKV
PLLVSADMEKGVAMRLDGATKFSPAMAISAAGNPALARRMAEIVAREARA
IGIHQNYAPTVDLNINPANPVINTRSFGDRIPLVNAMSAAVIDGLQSNGV
AATAKHFPGHGDVTVDSHLALPVLEGDRRRLENYELKPFRSAIANGVLSV
MVGHLAVPKLTGTMEPASLSRTIVTGLLRNELGFKGLIVTDALNMKALQS
NGLTPGEVAVRAVQAGNDMLLFPEDPELVFDAVCAAVENGEISEQQIDHS
VQRILQMKHWLGLDRRKLVDLSRLSERVGTKENKRIAEQIAEQSLTLLRD
RNRTIPLRFPQNGQLVNIILNDRPGQKVGREFVDTLRTSYNVTSLRLTPQ
SQPEFFQEASRAVAHASAVILTTGIQAWSKSVPSGLSQLQCDFVRSLPSM
APKGTPILFISFGTPYILDAFPEIGSALCAYSENEETDAAILKALKGELV
PRGTLPVSLESVKP
>CT0369 ndh, NADH dehydrogenase
MKKKVVIVGGGFTGLNTARILSNRKDVEVTLIDRKNYHLFQPLLYQVAMA
ALGEGDIATPLRNMLAGYDNVTVFKGNVCNVDLEQKKVKTDFGDIEYDYL
VLACGAQHHYFGKNDWEEHAPGLKNLAQASEIRRRVMEAYEAAERTNDMK
ERKKQLTFVIVGGGPTGVELAGSIGEMSRYTLSKFYRHIDPKLTRIFIVE
AAERILGTFSPELSSKATRELEKLGVQVWTSSMVSDVDADGVQIGRERIE
AATVLWAAGVKASEIGQNMGVQTDRSGRIMVEADLSLPGHPEVFVGGDQA
CYTLENGSTLPGMAPVAMQEGKAIGRMILDDLKGKPRKPFKYRDKGQMAT
IGRNRAIVEIGNLKFDGAIAWFTWLLVHIYYLSTFKHRVFVLMQWAWSYF
TFGYGARLIVNKDWRFYREQKSSPCDDKKA
>CT0770 ndhA, NADH dehydrogenase I, subunit 1
MSSSPSLNTWSDALSGFSIGWFPLGLVIVAAIPLVFIALYALTYGVYGER
KISAFMQDRLGPMEVGFWGLLQTLADILKLLQKEDIVPTVADKFLFVIGP
GILFVGSFLAFAVLPFGPAFIGADLNVGLFYAVGIVAIEVVGILAAGWGS
NNKWALYGAVRSVAQIVSYEIPASIALLCAAMLAGTLSMQQIILMQAGPN
GFLHWFLFTNPIAWLPFLIYFISSLAETNRAPFDIPEAESELVAGYFTEY
SGMKFAVIFLAEYGSMFMVSAIISIAFLGGWTSPLPNIGGLELNTMTSGP
VWGAFWIIMKGFFFIFVQMWLRWTLPRLRVDQLMYLCWKVLTPFSLVSFV
LTAIWVINH
>CT0776 ndhB, NADH dehydrogenase I, subunit 2
MFEMPSGAEIQSIISILKGGAGYFVPEIYLSALFMVLILLDLITGKKNRG
LLATATIAGLLGSVYFIFKQQTMPEVQFFFGMYALDRFGIFFKYFFVVSG
VLAVLTTVIDEQLKKHESGMGEYYALLVAMVVGMMMMASSTDLLMMFLSM
ELVSLSAYALTGYLKREPRSSEAALKYLVYGAVSSGMLLYGFSLLYGMTA
ETNLTRISMVLAAHGYDPLAMILAVLLIMAGLGYKMGAVPFHFWSPDVYE
GAPTPVTAYLSVASKAAGFAMLMRFFFVAVPHGFDMYVSPLHIDWLSILI
LVSAASMIYGNVVAIWQKNVKRLLAYSSIAHAGYLLLGIITMDQLGTQAV
LVYLAAYLLMNYGAFYVVILIANHTGSENLDDYKGLGKRMPLLGAALTVF
LISLVGLPPTFGFIGKLMIFSALLAKGSLFMWLALIGILTSVISLYYYML
IPLNMYLRESNTPEEGVIATGMGAKIVTASLMILTLWFGLFFQPIANYAR
YSTSIFGAFLN
>CT0766 ndhC, NADH dehydrogenase I, subunit 3
MDQTLSGFGTVFVFLVLGIVFVVGGYLTARMLRPSRPNPEKNSTYECGEH
AVGSAWVKFNIRFYVVALIFIIFDVEVVFLYPWATVFKQLGAFALVEVLI
FVGILVLGLVYAWVKGDLDWVRPTPNIPKMPEMPVRRSGKANG
>CT0775 ndhD, NADH dehydrogenase I, subunit 4
MLSFIVFLPIIAGLVILAVPSSQKQIIRIVSLLAALGQGVLAVLIWRHYD
PTMAGIVAAPGGSPVGSFQMIERIPWISLDLGSFGPLNIEYFLGVDGLSI
TMVLLTALISIIGVLSSWPIQKQVKGYFILYNLLSTAMMGCFVALDFFLF
YVFWELMLLPMYFLIGIWGGPNREYAAIKFFLYTLFGSVFMLLVMIGLYF
SVTDPLTGHHTFSLVAMADQANYIKGTILGPDSVTWRYVAFIVLFVGFAI
KVPMFPFHTWLPDAHVEAPTPISVILAGVLLKLGTYGMMRINFPLFPEVY
QAGLYVIGVFGAINIIYGAFCALAQQDLKKMVAYSSISHMGYVLLGLAAA
NTEGMIGALYQMFNHGTITAMLFLLVGVIYDRAHARQIDKFGGLATYMPV
YTGFVIVAWFASLGLPGLSGFISEALVFVGAFSAPVTRPIAMVSVLGIVF
GAAYLLWSLQRMFLGKRKPDALYDLEVDVDGHEHIHFHDWKGKLDLDARE
LTMLVPLVIIVIALGIYPMPVMGLITTSVNKLVQVLTPVAMSAVH
>CT0773 ndhE, NADH dehydrogenase I, subunit 4L
MLTQQLLSIGVNHFLTISVILFGLGMFAVMTRKNAIVILMGVELILNAAN
INFLTFSKYNGGMEGVMFSLFVIVLAAAEAAIALAIVINIFKTFKTVDVS
SVDTMKE
>CT0774 ndhF, NADH dehydrogenase I, subunit 5
MMHSLIQLSIAVLLLPLLSFVVLIFFNRRLPRGGDFVGVGLLGTTLAIAL
YIFWTVIVQHYDPAFRLAWDFTWLDFGNVPGVGPLQVKMGIVIDNLAAIM
LAMVSLISFLVHLYSTGYMKGEMYYGRFFAYLGIFTFSMFGIVLSDNLFS
IYIFWELVGLSSYLLIGFYFHKDSAADAQKKAFLTNRVGDIGMWLGILIL
YSQFHTFGYQEIFNHIKNGDFHMSQAWLTAAGILLFMGCVGKSAQFPLHV
WLPDAMEGPTPVSALIHAATMVAAGVYFVARIFVLLTPDALHVIAFIGAF
TAFMAATIAITQFDIKRVLAYSTVSQLGYMVLGLGVGSYSAALFHLLTHA
FFKACLFLGSGAIIHAMHHEQDMRWMGGLYKKMPWTFVTFSIATLALAGL
PLTSGFLSKDAILAGSLGFAQAEGGGIYYLVPVFGFGAAVLTAFYMGRQI
WMVFFGENRTHLKPKSAHSMDDHDDGDVDHIHDAAHGGHDHHPVHEVAWN
MRLPLVVLATLSVFIVFSPDPLDGGKGWFMHLVQTPATVVSVAEMAHEGA
QLHAPEAGKLLAEAHTDTQHVEAATSREAEGQHGPVFADPRQAEIAHMTH
ANHYTAIKLSSIMVVIGIGMALIVYVFRIIDPDKTAQAIRPLYLYSFNKW
YWDEIYDATIIKGSILISKILAWFDTNIVDGLVNGVATIFRKFAFFNGGV
DKYVVDGLVNFTAFTVQTTGAVFRKIQTGKVQTYLVMVVFAVLGWFALYF
AHLVK
>CT0772 ndhG, NADH dehydrogenase I, subunit 6
MNQLTIAIIFYIFAAVTVLSAAFVVFSKNVIYSAFSLLFTFFGVAALYVF
LSADFIAVTQVVVYVGGILVLLLFGVMFTNTIMSTELKADVLNVVPGILL
TLLLIVGMLFTFYTTGSWMPGEMQLNGSVVQSIGLETMSRYMLPFEMFSI
VLLVALLGAAYLARYDKANKKEH
>CT0767 ndhH, NADH dehydrogenase I, 20 kdA subunit
MGLLDARISNRNVLVTSVDNVMNWARLSSLWPMGFGLACCAIEMMATNAS
NYDLERFGIFPRSSPRQSDLMIVAGTVTMKMAERVVTLYEQMPEPRYVLS
MGSCSNSGGPYWHHGYHVLKGVDRIIPVDVYVPGCPPRPEALIGGLMKIQ
ELIRMEGLGISRQDALKKLAGKRVDPQQVIDQVRKSATA
>CT0771 ndhI, NADH dehydrogenase I, 23 kDA subunit
MSEYFSNIKTSVTTIATGMGITLKHFFNAVKRKGDAGIDDADYFRQVDGL
CTLQYPKEAIPTPPHGRYRLYCNINDCIGCKQCERACPVECITIETIKTT
SDDLEACGKTSGGQQKRMWVPVFDIDLAKCMTCGICQSVCPTDCLYHTPV
ADFSEFDVSNMMYHFGNLSKIEAEAKRKKLAEQQAQAAKEKAAAGGAPAP
KPAPKAAPQPNPGDAR
>CT0768 ndhJ, NADH dehydrogenase I, 30 kDa subunit
MEEAANQMSPAVQQSKAAYDNIKERFGDAISEFDANPTMPFFEVLDVSKW
VDIALYMRDNSLLQFNYLACLSGVDYPEEQKLGIVCNLECIGKYTHKIAV
KVKCPRDGGSIPSVSCVWHTANWHEREAYDMYGMVFEGHPDLRRILCPED
WTGFPLRKDYQVQETYHGIKVPY
>CT0769 ndhK, NADH dehydrogenase I, 49 kDa subunit
MVLSMGPQHPSTHGVLRLECITDGEVVVEAEPYLGYLHRCFEKHCEKIDY
PAIVPYTDRMDYLAGMNNELAYCITVEKLLDIEIPRRVEFIRVIVAELNR
IASHLVAIGTYAIDLGAFTPFLFCFRDREHIMSLLEWISGARMLYNYIWI
GGLAYDVPADFKTRVAEFVTYFRPKAKELYQLLTENEIFVKRTYDIGIMP
ADVAINYGWSGPMLRGSGVKWDLRRNDPYSVYPELDFDVPVPDGKFSVVG
DCLSRHLVRALEMEESLKIIEQCLDKMPEEPNFNSRALIPKKIRPKAGEV
YGRAENPRGELGYYIVSDGKSTSPVRCKARSSCFVNLSAMKDLSKGQLIP
DLVAIIGSIDIVLGEVDR
>CT2002 ndk, nucleoside diphosphate kinase
MERTLTILKPDCVRKQLIGAVTNMIERAGFRIVAMKKTRLTKETAGAFYA
VHKERPFYGELVEFMSSGPCVPMILEKENAVADFRTLIGATDPAQADEGT
IRKLYADSKGENIIHGSDSAENAAIESAFFFAAEEVVRVD
>CT1153 neuA, acylneuraminate cytidylyltransferase
MRTAAIIPARGGSKGLKNKNIHPIAGLPLLAWSVLQALDAEHVDQVFVTT
DDAAIAQVARQFGAEVIDRPERISGDKATSESALLHALEVIAERYGAEPE
TVVFLQATSPLRKPGDIDRAIELFRLEGADSLISVTRADDLTIWEQRGGD
WNSVNFDYRNRGMRQDRPSQFIENGSIYLFTPSVLREFGNRIGQKLSVYL
MEFWQTWEIDTIEEVDLVEFYLKQKGIDRSFLRS
>CT0795 nfeD, nfeD protein
MVKRLLKFCVIPAIAVFLLVAMPVAGQAAQIRAMSLTGSVNAGSAAYFLR
VLDEANRDNDTLLLVELDTPGGLVSSLRQMVQGVMASRVPVVVYVAPSGA
QAASAGALLLLSANVAAMAPGTETGAAHPVDISGGGEKGSVMGKKIENDL
AAFARSLAQKRGRSPEWAERAVRESIASTASEALAAGVIDTVADNRKELL
VSIDGRKVETAIGELIIRTTNVPVKEASPTFGEEVMMAIADPNIAYFLLL
LGLAGLWFELSTPGAVLPGVAGAIALVLGAWAMQLLPVNVTGLLLILLAI
LFFGLEIFVVSSGALAIAGLVALFIGSVMVFNQPELGLVINWWVFLPLFL
SFSAGVLLLVFVVFRSTRRKAISGREGLVGETGTVERAIGEGKDGKVFVH
GELWDASANGLIPAGSQVTVTGIEGMRLMVKQNSKEE
>CT0316 nfo, AP (apurinic or apyrimidinic) endonuclease, nfo family
MKRVGAHVSIAGGVENAPLNAQKIGAKAFAMFTRNQRQWHSAPLTAASIE
AFRRNCDEAGFLPEHILPHDSYLINLGAPEADKIEKSRKAFVTEMQRAEA
LGLTMLNFHPGSHLNLTDEDACLKTIAESVNRSLDATAGVTAVIENTAGQ
GSNLGWRFEHLARIIELVEDKSRVGVCLDTCHLFASGYDLRTPEAFDATL
REFDRVVGLLYLKGMHLNDAKQKLGSKVDRHECLGKGMIGIDAFAHIMRH
PALEEIPLILETPNAEGWAEEIAMLYGFTNE
>CT1529 nifA, nitrogen fixation specific regulatory protein nifA
MLIAQEQEQSSISLLAEVSRAVNNEEDISKVLRLVLFILSEHMNMLRGMV
TILNRTTGEMVINESFGLTDEQRQLARYKVGEGIIGQVVKTGRPSVVPRI
NEEPLFLDRTQSRAEENKEELCFICVPIKVGREVIGTLSADRRISVPAEN
GDKKLWRKSDRIDVLQYYVDLLSIIAAMIAQAVRIKQLDDEKSIDGAHER
TGTPALGFQIIDREIEEELPETERPANIIGNAKPMMSLFKMIDKISKTSA
TALILGESGVGKELVANAIHFKSMRNGKPFIKFNCAALPESIVESELFGH
EKGAFTGASAQRRGRFELADGGTLFLDEVGELSLPTQAKLLRIIQEKEFE
RVGGSKTIKTDVRIIAATNRNLEEQIQKGNFREDLYYRLNIFPITVPPLR
ERKTDILLLADYFVEQFNKTNHKGVRRISTTAIDMLMRYHWPGNVRELQN
CIERAVILSEDNVIHGYHLPPSLQTAESSGTPYTGDLQQKLDAIEKEMII
EALKRTKGNMTKAAIQLGLSDRIMGLRMKKFNIDYRKFRA
>CT1540 nifB, nifB protein
MTLNIKNHPCFNDSSRHTYGRIHLPVAPKCNIQCNYCNRKFDCMNENRPG
ITSKVLSPRQALYYLDNALKLSPNISVVGIAGPGDPFANPEETMETLRLV
REKYPEMLLCVATNGLDMLPYIEELAELQVSHVTLTINAIDPEIGQEIYA
WVRYQKKMYRDRQAAELLLENQLAALQKLKRYGVTAKVNSIIIPGVNDQH
VIEVARQVASMGADILNALPYYNTTETVFENIPEPDPMMVRKIQEEAGKL
LPQMKHCARCRADAVGIIGEINSDEMMAKLAEAALMPKNPDEHRPYIAVA
SLEGVLINQHLGEADRFLVYALDEEKKSCTLVDSRQAPPPGGGKLRWEAL
AAKLSDCRAVLVNSAGDSPQSVLKASGIDVMSIEGVIEEAVYGVFTGQNL
KHLMKSSQIHACKTSCGGDGNGCD
>CT1536 nifD, nitrogenase molybdenum-iron protein, alpha subunit
MEAKVLIPDPSKIKEELINKYPAKVAKKRSKSIVVNDPEIVPEVQANVRT
VPGIITQRGCAYAGCKGVVLGPTRDIVNIVHGPIGCSFYAWLTRRNQTRP
ETPEHENYITYCFSTDMQEEHVVFGGEKKLKVAIQEAYDLFHPKAIAIFS
TCPVGLIGDDVHAVAREMKEKLGDCNVFGFSCEGYRGVSQSAGHHIANNG
VFKHMVGNNNEVKPGKFKLNLLGEYNIGGDAFEIERLLEKCGITLVASFS
GNSTVGAIENAHTADLNVIMCHRSINYMGDMMETKYGIPWMKVNFVGAES
TAKSLRKIAEYFGDEELKAKVEEVIAEEVPAVKAIIDEIRPRTEGKTAML
FVGGSRAHHYQDLFSELGMTTIAAGYEFAHRDDYEGREVLPKIKIDADSK
NIEELKVTADPELYNPRKSKAELEELKAKGLEINGYEGMMKQMMKKTLVV
DDISHYESEKLIEMYKPDIFCAGIKEKYVVQKMGVPLKQLHSYDYGGPYT
GFKGAVNFYKDIDRMVNNPVWKMIKAPWEKSEPESLEASYVAS
>CT1538 nifE, nitrogenase iron-molybdenum cofactor biosynthesis protein NifE
MDTQKISLLEGREKQVYEKTAGGVEVDIACDKTSLSGSVSQRACVFCGSR
VVLYPVADAIHIVHGPIGCAAYTWDIRGAVSSGPELHRLSFSTDLQEMDV
IYGGEKKLYKSLIELIDQYQPNAAFIYSTCIIGLIGDDIDAVCKKVAKEK
GIPVLPVHSEGFKGTKKDGYKAACMALMKLIGQGSTEGISKYSINILGEF
NLAGEAWIIREYYEKMGIEVVATMTGDGRIDDIRRSHGASLNVVQCSGSM
TTLAKEMEEKYGIPYIRVSYFGFEDMSKSLYDVAQHFPERPDIMEKAKEI
VRDEIRKYYPEMQKFKAALAGKKAAIYVGGAFKTFSLIKALRTIGMSVVL
AGSQTGNKQDYKNLKEMCDEGTVIVDDSNPVELSKFVLEKEADLLIGGVK
ERPIAFKLGVGFCDHNHERKIPLAGFIGMYYFAKEVYESVMSPVWQFAPR
KGGAK
>CT1533 nifH, nitrogenase iron protein
MRKVAIYGKGGIGKSTTTQNTVAGLAEAGKKVMVVGCDPKADSTRLLLGG
LQQKTVLDTLREEGEEVELEDIIKEGYRNTRCTESGGPEPGVGCAGRGII
TSVNLLEQLGAYDDEWELDYVFYDVLGDVVCGGFAMPIRDGKAEEIYIVC
SGEMMAMYAANNICKGILKYADAGGVRLGGLICNSRKVDNEREMIEELAR
RLGTQMIHFVPRDNFVQRAEINRKTVIDFDPTHPQADEYRALAKKIDENK
MFVIPKPLEIDELESLLIEFGIAN
>CT1628 nifJ, pyruvate flavodoxin/ferrodoxin oxidoreductase
MTRTFKTMEGNEALAHVAYRTNEVISIYPITPASPMGEYSDAWAAVDVKN
IWGTVPLVNEMQSEAGAAAAVHGALQTGALTTTFTASQGLLLMIPNMYKI
AGELTPCVIHVSARSLAAQALSIFCDHGDVMSVRGTGFALLASCSVQEVM
DMALISQAATLESRVPFLHFFDGFRTSHEISKIEVLSDEQIRSMINDELV
FAHRMRRMSPDAPIIRGTSQNPDVYFQARESVNKYYEACPSITQKAMDQF
AKLTGRSYKLYQYYGAPDADRIIIMMGSGAETALETVEYLNNHGEKVGLV
KVRLFRPFDVATFIASLPSSVKSIAVLDRVKEPGSAGEPLYLDVVNAVAE
SYQEGKCASMPSVLGGRYGLSSKEFTPAMVKAIFDNMNAESPKNHFTVGI
DDDVTKKSLAYDETFSIEPDSVFRALFYGLGSDGTVGANKNSIKIIGENT
DNYAQGFFVYDSKKAGSITTSHLRFGPEQIRSTYLITEAQFVGCHHWVFL
EMIDVAKNLKQGGTLLINSAYAPDVVWSKLPRPVQQHLIDKQAKLYTIDA
YKVAHESGMGQRINTIMQACFFAISGVLPREEAIEKIKDAIRHTYGKKGD
EVVQQNIKAVDNTLANLHEVKIGAVADSTKELRSPIVGDAPEFVCNVLAK
IIAGEGDSIPVSKLPADGTYPLGTTKFEKRNLAQEIPVWAPELCIECGKC
SMVCPHAAIRIKVYEPKHLENAPATFKSLDAKAKNWEGMRYTVQIAPEDC
TGCQLCVNACPARDKQVEGRKALNMHEQAPLRETESACWSFFINLPEFDR
NKINQRLIKEQQLQQPLFEFSGACSGCGETPYVKLMTQLFGDRLVIGNAT
GCSSIYGGNLPTTPYAANPQGLGPTWSNSLFEDTAEFALGFRISIDKQQQ
FAKELVKKLAGDIGENLATAILNATQNSEPEIFEQRERVAVLKDKLQQMK
SDDAKNLLAVADMLVKKSVWAVGGDGWAYDIGYGGLDHVTASGKNVNMLV
LDTEVYSNTGGQASKATPKAAIAKFAAAGRIATKKDLGLISMSYGNAYVA
SVALGARDEQTLRAFIEAEAYDGPSIIIAYSHCIAHGFDLSMGLEHQKAA
VDSGHWLLYRYNPDRLKEGLNPLQLDSKKPKMPVAEFLNMENRFRILKKT
HPDLAKKYFEAIQHEVNARWAHYEHLANRSIEGEA
>CT1537 nifK, nitrogenase molybdenum-iron protein, beta subunit
MLLRHTTKEVKEREGLTINPAKTCQPIGAMYAALGIHGCLPHSHGSQGCC
AYHRSTLTRHYKEPVMAATSSFTEGASVFGGQANLLSAIETIFTVYDPEV
IAVHSTCLSETIGDDLQQITKKASDDGKIPEGKYVIYASTPSYVGSHITG
YANMVTSMTEQFAVSTGEKKDQVNVIAGWMEPSDMREIKSLASRLGVKIV
LFPDTSDVLDAPQTGKHEFYPKGGITINELKSAGDSKCSLAVGCISAEPA
AIALEKKCKVPFETVDMPIGLSATDRFIMALSKAGSVKVPDEITAERGRL
VDVMVDMEQYFYGKKVALFGDPDQLIPLTEFLLDLGMIPAHIVSGTPGLR
FEKRMKEILERAPGANFRNGPQADMFLMHQWIKNEPVDLLIGNTYGKYIA
RDEDIPFVRFGFPILDRIGHSYFPNVGYSGSLRLVEKILGVLMDRQDRTS
LEEKFELVM
>CT1539 nifN, nitrogenase iron-molybdenum cofactor biosynthesis protein NifN, putative
MTTNASAMTAKTATQNACKLCTPLGACLAFRGIESCVPFLHGSQGCATYI
RRYLISHYKEPIDIASSNFNEETAVFGGSHNLQLGLKNVTQQYKPQVIGI
ATTCLSETIGDDVPMILKDYKAIMNDPNLPTMIFASTPSYSGSHIDGFHT
AVRSAVKTFAVGGAKKNLLNLFSGMISPADIRYLKEILKEFGMPFMLLPD
YSQTLDGGPWGEYHRIPPGGTPTSAIADSGSAAASIEFGSTLEAKKSAAG
YLEAEFGVPRHQLPLPIGIKATDRFFALLEELTEKPMPEKYEDERRRLVD
AYADGHKYIFGRKAMLYGEEDLVISMAAFLREIGVVPVLCASGGKSGQMK
QRMLELIPDMDEQGIEACEGVDFVDIEHEAERLKPDMLIGNSKGFTMSKK
HELPLIRIGFPIHDRFGGQRLHHLGYRGTQELFDRIVNTVIEERQKSSPI
GYTYM
>CT1528 nifV, homocitrate synthase
MIRKPWIIDTTLRDGEQAPGVVFSPHEKKRIAAMLAETGVDEIEVGYPAI
SAAERKVIREIVAMKLPVRLTSWSRANMADIELAAECGTDAVHISFPASR
LYLELIHKKDDWIQEQLHALVSKARERFDFVSVGGQDATRSSTDFLQRFM
LDAEAAGAKRFRIADTVGIATPVSVMALGAALRQSSSLPLEFHAHNDLGM
ATANAFTALNEGFEAVSVSVTGLGERAGNAALEELAMALALNGDFDTHLD
TSMLSRLCDAVATASGRAIQEQKPVVGRSAFQHESGIHCAALLQDPLSYQ
PFLPSRVGRSDFEIVIGKHSGTAAIIAHFNRRGITISKKEARELLDLIRS
QSDRLKRALRTDEIDALREQNSVKHA
>CT1710 nth, endonuclease III
MTTSVSEKIAFIEKALSVIWPNPKSELNFESPFQLLVATIMAAQATDKKV
NELTAVLFKAAPDAASMSRMDVEDIRTIIRPINYYNNKAKNILAMSRRLV
DEFGGEVPASREALESLPGVGRKTANVVLGNAFGIPAMPVDTHVHRVSNR
IGLCKTSKPEETEEALVKVIPEEKLIDFHHYLLLHGRYTCKAKKPECANC
AIREICEWPEKTL
>CT1709 nusB, N utilization substance protein B
MKTYRRQLREKIIQALYTLELRDVDTDSAANWLLTKEIMDDPNAMKFFNH
LMQSIVRNREEIDRYIAKHTFNWDMSRIAIIDKNILRMALAEILYCEDIP
PKVSINEAIEIAKKFSSTDKSSKFVNGILDAIFNDMKAEGRIKKCGRGLV
DHSEQKMQKTENNR
>CT0150 nusG, transcription antitermination protein NusG
MSAKKKVVKEQHPPQLHWYALRIYSGHERKVKESIEMEVERCGLSESIKQ
VYIPYERFVEVKNGKKRSLTKNAFPGYVLIEAVLDKQTRNLIMDIPSVMG
FLGVDDNPTPLRPDEVEKILVPGGAIEHRAVVEAPFKVGDSVKVIDGPFS
SLTGIVHEVCTERMKVKVMINFFGRSTPTELDFSQVKPVSQ
>CT0834 oadA, oxaloacetate decarboxylase, alpha subunit
MKKIRFMDVSFRDGFQSCYGARVKTEDFLPVLEAAVEAGTDNFEIGGGAR
FQSLYFYCQEDAFEMMDACRRVVGPDINLQTLSRGANVVGLVSQSRDIID
LHAKMFKKHGVSTIRNFDALMDVRNLAWSGQCIVNAGLKHQVVIALMGLP
PGLNEPYCHTPQFYLDKLKEILDAGIPFDSVAFKDASGTTTPAVIYETIK
GARKMLPEGTVLQFHTHDTAGMGVACNFAAIEAGIDIIDLAMAPVSGGTA
EVDILTMWHRLRGTDYTLDIDQEKYLEVERMFIEHMDKYYMPPEAKEVNP
VIPFSPMPGGALTANTQMMRDHGTLHFFPEVIRNMREVVAKGGFGSSVTP
VSQFYFQQAFANTVQGPWKKIVDGYGKMVLGYFGKTPAAPDPEVVALASE
QLGLEPTVQDVHDINDRNPDLGIEHNRKLLEEAGLPVTDENIFIAATCGA
KGISFLKGDKPMGIRYKADVEAEEKAKHSEEELKVTSHGNSLQDRLSDLI
KPAGRSNLSGNYMVMVDGKSFNVVIADGMVMAQSIASGAQPFVMPVPTAV
SAPQQHRGTPVMPSMPGNVFKMEVEAGQKVEEGQEVAVMEAMKMESPVKA
PKSGIVTVVLAKPGDAVSAAQALMYIE
>CT2213 obg, GTP-binding protein Obg
MKFVDSAKISVKAGDGGRGCVSFRREKFVPKGGPDGGDGGRGGHVYLRAN
KQLTTLLDFKYRKSYIAGRGGHGLGARKSGKDGKDVIIGVPCGTVVRNVE
TGEVICDMVEDGQEIMIAKGGRGGWGNQHFATATRQAPRFAQPGEPGEEY
ELEMELKLMADVGLVGFPNAGKSTLISVLSAARPKIADYPFTTLVPNLGI
VRYEDYKSFVMADIPGIIEGAAEGRGLGIQFLRHIERTKTLLIMVPSNTE
DIAAEYATLLKELEKFDPSLLSKPRLVVITKMDIAPEDFTMPELEKGVKV
LAISSVAGQGLKALKDELWRQVSLQNQSPSEHAGS
>CT0996 ogg, 8-oxoguanine DNA glycosylase, putative
MPDIMITHKKIYSSLLNTKSRLNFKDTVQSGQSFRWNLNETLKSYYSSVI
YNSIIFICEINSEMIEVLCTDQELSGKPINEFLKHLFSLDFQEETVFSSR
FQQEFPEVWNLVQSYRSVRVMRQDPFEIMVTFMCAQGIGMHLIRRQVSMI
AERYGQKIMLELPEGNLTFHSFPPPQALASADPNELAVCTNNNRIRAANI
IAMARSFESGKLALACVDSGKCDLETLRETLCVHSGIGLKIADCIALFGL
GRFDAFPIDTHVKQYLWEWFGIEEARHSLTEKNYRILQEKARAILGAEYA
GYAGHILFHCWRKEVKKMKAF
>CT1289 ogt, methylated-DNA--protein-cysteinemethyltransferase
MQQESILTNYSSFSCRRMPVGLIGIRAHGGRVTDLLFMHVEPSTALSAPG
SRLIDEAFQQLEAWFSGRLREFALPLVEPRSAFERRVREAMLAIPYGQTA
SYGELAAAIGSPGAARAVGAACGRNPLPVIVPCHRVLGAGGRIGGYRGGT
EMKRWLLDFERRNSG
>CT0254 ompH, outer membrane protein OmpH
MKNLNTPIMSDFKGIKKAVSGFLFMSLTLGMSFSAPQAFAADNAQKIGVV
DYGKIFQMMPETKAADQTLQAMKNQSNAELAKQQSALQSAIQAYQKERKP
NAVKEKELRAQEENLQKSALEKQRLLAQKEQALIIPIRQKIDVAVATVAK
KDGYSMIFDKNARVYGDEQSDITYKVLDKLNIK
>CT0112 paaK, phenylacetyl-coenzyme A ligase
MIWNEHHECMGRDQLASLQGERLRQMVERIYYNVPFYRNKLQEMGIEPGD
IRSIDDLKKLPFTTKQDLRDNYPFGLFAVPQQDIVRFHASSGTTGKSTVV
GYTHNDIMMWSEVVARSLTMAGVTKHDIIQVAYGYGLFTGGLGLHYGAEK
IGASVIPISGGNTKKQLQLLEDFGSTAIACTPSYAAYLGEAIMEEKLDRS
KIKLKAGIFGAEPWTEEMRSQIQQLLGIKAYDIYGLSEVIGPGVSMECSI
QHGMHVFEDHFIPEIINPETGEVLPYGELGELVFTAVTKEALPLLRYRTR
DLTRLHVERCDCGRTLVRMEKCVGRSDDMLIIRGVNVFPSQVESVLLEMS
ETKPHYLLVVDRQNNLDTLEIQVEVEDQFFSDEVKELEGLRKRISGNLTS
ILGLHANVRLVEPGTIERSQGKAQRVVDKRKLNEHKETP
>CT1565 pabA, para-aminobenzoate/anthranilate synthase glutamine amidotransferase
MILVIDNYDSFTYNLVQYIGELGAEVAVYRNDELTVEQALALEPEKIVIS
PGPGTPADAGISIPLINVVKGNIPLFGVCLGHQAIGEALGGKVVRAGQIM
HGKTSQIYHDGKGVFRGLPNPFTATRYHSLVVERESLPAELEITAWTEDG
VIMGLQSSELGLYGVQFHPESIMTSVGHDLIRNFLEL
>CT1341 panB, 3-methyl-2-oxobutanoate hydroxymethyltransferase
MNQPSGNKLPHVTTRRMLDMKERGEKIAMLTAYDYTMARILDRSGVDAIL
VGDSASNVFAGHSTTLPMTVDEMIYHAKAVVRGVQAETRRAMVIVDMPFM
SYQLSPEDAVRNAGKIMKEHECDAVKMEGGKVIAEAVKRITDIGIPVMGH
LGLMPQSIYKYGSYKVRAMEEEEARQLIEDAKIIEEAGAFAIVLEKIPSK
LAGEVSRLLTIPTIGIGAGPECDGQVLVINDMLGLSTDFRPRFVRRYADL
SSVIEQAVKSYVEDVRSNSFPSEDESY
>CT1647 panC, pantoate--beta-alanine ligase
MQIINDPAEMQKIAEKLRLQHQYIGVVMTMGALHEGHLSLVKLAKAHAGT
VIMTIFVNPTQFGPNEDFYRYPRPFEQDAALARSAGVDYLFAPSTEAMYP
DGYSTSIDPGPIATRFEGASRPGHFGGMVTVVVKLLGITRPHLAVFGEKD
AQQLAIIRRVVTDLNIGTTILGAPIVRESDGLATSSRNIYLSSNERQQAT
VLYRAIRYAKMEIDKDRTDLEAIAGEAEALVRSEPDAEPDYLCFVDDATF
EPVTQAVTGKAYRLIMAVRIGSTRLIDNWRFDYQ
>CT0252 panD, aspartate 1-decarboxylase
MKLHMLKSKIHNAIVTSGDLEYEGSITIDKELLDMADMIANEKVLVVNNN
NGERFETYIIEGTRGLREIQLNGAAARCALPGDEIIIMTFAEMEPEEARN
WKPMIVIVDRMNNPKRRHRVGSEDEYLG
>CT2232 pckA, phosphoenolpyruvate carboxykinase
MEPIPINAPDSVRNLKLLQWVRETAELCQPDSVCWCDGSVEEYDRLCNEM
VASGTFIKLSEQKRPNSYLCRSDPSDVARVEDRTFICSIRRQDAGPTNNW
VAPKEMKATLNKLLAGCMKGRTMYIIPFSMGPLGSHIAHIGVEITDSPYV
VTNMRIMTRMGRAVLDLLDEEAEFVPCLHSVGAPLEPGQQDVPWPCNDTK
YIVHFPEERSIVSYGSGYGGNALLGKKCFALRIASSMARDEGWLAEHMLI
LGVESPEGEKDYVAAAFPSACGKTNFAMMIPPGEMEGWKITTVGDDIAWI
KQGKDGRLYAINPEYGFFGVAPGTSEKSNPNAMATLHANCIFTNVALTPD
GDVWWEGMTDTPPDFLIDWQGKPWVPGCERPAAHPNARFTAPAHQCPVID
ENWENPDGVPISAFIFGGRRGDTIPLVYQSANWYYGVYLAATMGSEKTAA
AAGKIGDVRRDPFAMLPFCGYHMGDYFNHWLHVGRTLTDPPRIFGVNWFR
KDENGKFLWPGFGENMRVLKWIIGRVHGRAAAVESPLGWMPRYESLDWRG
LDGFTRDKFSTLMSVDREAWKQELFSHEELLEKLYDRLPKEFTHIRELML
STLWRSPEHWELAPERYTAEH
>CT0202 pcm, protein-L-isoaspartate(D-aspartate)O-methyltrans ferase
MARERQEMVVELKRYGISNARVLDAFLTVRRHLFVDAQSRPYAYSDNAMP
IGFGQTISQPYTVAYMTSLLVERVPSGKVLEIGTGSGYQAAILAELGYRV
YTIERIAGLYAAAGRVLDALGLPVHPRLGDGTLGWPEEAPFDGIIVTAAA
PREPHTLMSQLAEGGVLVVPIGDLGSQQMTVIRRRGERFEHEIFHNFAFV
PLCGREGWADNNE
>CT0591 pdxA, pyridoxal phosphate biosynthetic protein PdxA
MHIVFSTGDIHGIGPEIILKSVLALPSGEDTYMVAGSLKALEFYRDLLGL
PVELRRIEGPEAIAKIAAEPGVLHVLSVAEPDFIQPGSLSKSAGEIAMRS
LETAATLCRDGICDALVTAPLHKEAIALAGYRNTGHTDFLAGFFGVSSQI
MLFVDPVSGLKVALATIHEPLSMVPELVRTMDMDGFFTTLAGSMQRDFRL
ESPKIAVLGLNPHASDGGVMGNEETTVIRPAIERLAASMQIDGPFPADGF
FGAKRYRNYDLIVAMYHDQGLLPFKVLAFDTGINVTLGLPIVRTSPDHGT
GFDKAGKGTASERSFLEAAKLAATIATNRTQTTG
>CT0331 pdxJ, pyridoxal phosphate biosynthetic protein PdxJ
MRLAVNIDHIATLRNARNEGHPDPVEAALLAEKHGAAGIVCHLREDRRHI
KDDDLARLREEITTKLDLEMAMTDEMQKIALSVRPDLVTLVPEKREELTT
EGGFAIQKHFTRLTEFVKPLRDKEIGVSVFIEPEEEAIELAAEAGANIVE
FHTGTYSLCTSDKQTAYELERIRNSARIAREMGLTVVAGHGLSVLNIAPF
KELHDIEEVSIGHAIISRAVFIGLPAAIQEILDLIRR
>CT1180 pepA, aminopeptidase
MKCTVTAKESGLVNADILVQFFSKKEMKRDAGKVLAGLGVVASPDGDFKA
SAGEIAMLYRQASGKEASRVILAGVGEGKTAEDYRKAADSVARKTVDLHL
GVLALDCSPIDDWAKQSKQKPEELAAILVEGVLSGAYRFDRLKSGKLDKE
ETKEDKPKNIEELVLAGCGSRLEAIEKGAGKGMIIGACQNRARDLVNLPG
NHLSAEDLAEAAIEAGKRGGFEVTVFDKKKIVELGMGGLLAVNKGSEQPP
TFVILDYKPKGKAKKTIALVGKGVTFDSGGISLKPAQGMDEMKSDMSGAA
VVIAAIEAAASLGLPLRVVGLVPATDNMPGGSAQKPGDVITTMSGITVEV
GNTDAEGRLILADALFYAKKEYNPDVIIDLATLTGACIVALGNSVAGLFS
NDEKLAESIFEAGQSSGEKVWRLPLWDEYDELIKSDVADVHNTGGRGAGT
ITAAKFLEKFIDGHKHWAHIDIAGPAFWAKGGSKTPGATGFGVRLLLDLL
KGWS
>CT1058 pepD, aminoacyl-histidine dipeptidase
MSNIFLSLEPKILWSHFYRLTQIPRPSGHEEAVRTYVADVAKRCGLEWLV
DEAGNIIVRKPASAGMERRRGVILQAHLDMVPQKNAGTAHDFTKDPIDAV
IDGEWVHARGTTLGADNGIGVAAALAVLESDDLRHGPLEALFTVNEEAGM
TGALGLKPGVLRGDILLNLDSEDEGELFIGCAGGLDGTMRFDYSCEPLPP
GYSGIEIRVSGLRGGHSGMDIDLGRGNANKIMNRLLQMGREHHGLLLASI
DGGSLRNAIPRESVALVAVPSAQKVAFLDELHSLASAIGLELNGVDPELR
VEAADAALPVGMIDDTVAQRLFDAVAACPNGVHRMSEAMAGLVETSNNLA
RVHSDGRAVSVECLLRSASVEGMRELADAVAGIAERAGTVAAFENGYPGW
KPNPASPILKCMVKVYRDRFGKTPEIRAVHAGLECGIIGANNPQLDMISF
GPTIRHPHSPDEKVECSSVLKFWELLVATLEEVPEP
>CT1608 pepP, aminopeptidase P
MSQDLLHTYRSSLRAKIFRKMAAAGLDSLLVTDLATIRWLTGFSGSNAKL
LFAGDSTSVLFTDFRYQEQVRQETSGIATVILKDPLPVELASGLFRLGDR
MALQADHVTWHEQQQLSEKMGNREFTPVSSFFDEFREIKDIEELDRMRRA
VALSETVLEAVIGMIGPGVTEIDIAAEITYRHRKLGAEKDSFDPIVAGGI
RGAMPHAKPTAVAFEPGALIVIDMGCIVDGYASDQTRTVAFGKVSEEQRT
VYRIVQEAQQLGIDAAKAGMAARDLDAEVRNFIAAAGYGEAFGHGLGHGV
GVEVHEAPRVGTASTGTLREGTLFTIEPGIYLPGRFGVRIEDMVALGPNG
AEPLQRFTKELIEL
>CT0303 petB, cytochrome b-c complex, cytochrome b subunit
MAENTQNPAAGTAPAKPKPAVPGAAKPAAPGAAKPAAKAPAKPAAPAATA
PSGVYKPPVDRPDPNPFKDSKMSGVASWFQERFYVLNPIIDYMKHKEVPK
HRLSFWYYFGGLGLFFFIIQILTGLLLLQYYKPTETDAFSSFLFIQGQVP
FGWLIRQIHAWSANLMILMLFIHMFSTFFMKSYRKPRELLWVSGFILLVL
TLGFGFTGYLLPWNELAFFATQVGTEVPKVAPGGAFLVEILRGGPDVSGE
TLTRMFSLHVVLLPGLVMLVLAAHLTLVQVLGTSAPIGYKEAGLIKGYDK
FFPTFLAKDGIGWLIGFALLVYLAVMFPWEIGVKANPLSPAPVGIKPEWY
FWAQFQLLKDFKFEGGELLAIILFTIGGVVWMLVPFLDRQASQEKKSPMF
TIFGLLVLAFLLINTYRVYDSYVLHLPK
>CT0302 petC, cytochrome b6-f complex, iron-sulfur subunit
MAQTGNFKSPARMSSLGQGAAPASAGAVTGGKPREEGLKGVDFERRGFLQ
KIVGGVGAVVAVSTLYPVVRYIVPPAKKIKIVNELAVGPASDVPNGTGKI
YQFNDDKVIVVNHGGSLTAVSAICTHLGCLVHWDEAADMIACPCHGAKYT
QDGKIISGPQPLPLKQYKVKIEDGKIVVSIA
>CT1603 pfk-1, phosphofructokinase
MKKIGILTSGGDCGGLNAVIKGAALMALNKGLELYVIPNGYAGLYNLLNQ
DRLVRLDEARLDQFQASFAGSEAGHSRVKIKAISNPDKYNRIKVGMKKFD
LDALIISGGDDSGSVMIDLNHNGIQCIHCPKTMDLDLQTYSVGGDSTINR
IAQFVEDLKTTGRTHNRVLVTEVFGRYAGHTAFRSGVAAEADCILIPEIP
VDWDVVYEHIVERFTRRIRQSDVHSGTYTIVVAEGMKNADGTDIVDESAG
IDAFGHKKLAGAGKYVCQEIKKRFKTDPRMPQFMKDTGMFVEGIYEIPEV
REVHPGHLVRAGHSSAYDINFGFEAGAAAVLLLLEGKTGVTISKVKGRKI
EYIEASKAIEQRYVDLDQVAMYESIGTCFGRTPAAYEPILREVDGVYERI
Y
>CT0250 pfk-2, phosphofructokinase
MHIGVLTGGGDAPGINACIKTIVTISTEKGYRVTGIRRGWNGLLAFDPDD
PASRTEHIVDLDSELVRRIDRTGGTLLHTSRINPGNLKKKEIPPFLRNSP
HLLRTGLHHSGNFDLTDHVLRSIDALGLDVIIVIGGDDTIGYADHLAKAA
VKVIGVPKTMDNDVYGSDYCIGFGTAVTRSVQYIHQLRTSSGSHERITIV
ELFGRSTGETCLVSAYLAGVDRALIPEVPFDPETLADYVIQDKAANPSQY
AMIAISEGARMIGSKMIEYCGRRYDEDGHEPAGIGQLTRETISMLTGQDV
ICQQLGYLMRSGIPDALDLMVGFNFAQLAVELIAEGTFGVMVALQKGIYT
CLPLAEVSSNTKQVDISELYDPRYYRPKMRSVMGKPMFLY
>CT0988 pgi, glucose-6-phosphate isomerase
MYLSRSAEWSALESHYQDISHQAMIDLFSTDPNRHERFSLSFNAIHLDYS
KNRISARTMELLMDLVRRSGIEKKRRQMFEGEQINFTEHRSVLHTALRRP
PGYTMTIDGNDVASEVSDVLDQMKAFCKKVISGEWKGYTGKRITDVVNIG
IGGSDLGPFMVTEALKPFAHGKLKVHFVSNVDGSHLVETLRGLNPETTLF
IIASKTFTTQETLANAVSARAWFLVKAGNRDHVAKHFVAVSTNREKVEEF
GIDPDNMFRFWDWVGGRYSLWSAIGLSIALYLGFDRFRELLAGAHAMDEH
FLNAPLEENMPMILAMLGIWYNNFFGAHSQAIIPYDQYLHRFPAYLQQLD
MESNGKRVDRAGHEVDYATGPVIWGEPGTNAQHAFFQLLHQGTEIVPVDF
IVSLKSQNPVGEHHDMLVANCFAQSEALMKGKSEAEARAELEAAGLSGGD
LEKLLPHKLFPGNRPTNTIVLDELNPFNLGSLIALYEHKVFVQGVVWNIN
SFDQWGVELGKQLAKAILPEFDAVDPVETHDASTNALINRYRQFRNGLKF
PKSNQLKMF
>CT2222 pgk, phosphoglycerate kinase
MQKKTLSDISLQGKRVLMRVDFNVPLDQDRNITDEKRIVEALPSIRKVID
NGGRLILMSHLGRPKGKVNPAFSLSPVAKRLSELLDCPVTMAGDCIGTEV
MQQVLALQDGDVLMLENLRFHPEEEANDPDFARELASLGEIYVNDAFGTA
HRAHASTEGITHFMQTAVAGYLIEKELRYLGTALNDAQRPFVAILGGAKI
SGKIDVLESLFEKVDTVLVGGAMVFTFFKAQGLDVGNSLVEENKLELAVS
LLEKAKAKNVRLLLPEDVVVAGEISADAPSRVEPVSAISAGMIGLDIGPA
TIETYRKEILDAKTVLWNGPMGVFEIDQFARGTFAVAQALADATAEGAIT
IIGGGDSAAAIAKAGLSDKVTHVSTGGGASLEFLEGKELPGIAALND
>CT0336 pgpA, phosphatidylglycerophosphatase A
MRQWLGRIFGSAFGIGYVPFAPGTFASGAAALLYLYIPTIRELPLLALLI
ALSIVLGVWAGGAMEKEYGEDPSQAVIDEVAGQWISLFAIPFSPLAVLLA
FIFFRLFDVLKPGIVDRAQHLPGGWGIMADDVLAGILANLLLRLVMLALP
MLPYGLSL
>CT0335 pgsA, CDP-diacylglycerol--glycerol-3-phosphate3-phosph atidyltransferase
MTFSNQLTILRIILVPVFVLLLMQDSAWYRLLGVIVFVTASLTDIYDGYH
ARKYGEVTRLGAFLDPLADKLLITTAFLFYVWEGYLALWMVLLVAARDIV
VTGLRVYAEHIDHPVVTSKEAKYKTFAQNLFAYVIMLFILLKEKSFFGPK
MAAFMEVILHSPWLGYAMFAITLFTVWTGVSYLISNRRLIFRNSAGGR
>CT1666 pheA, prephenate dehydratase
MTNWLIAYQGEPGAYSEIAALRFGEPLPCESFDDVFSAVTEQKADYAVIP
IENSLGGSIHQNYDLLLRRPVVILAETFVKVEHCLLGLPGASVETATKAM
SHPQALVQCHNFFATHPQIRAEAAYDTAGSAKMVAESRDKSALAIASKRA
GELYGLDILKENLADEEWNITRFFCIAHENNPDISHLKVRPDVARQKTSI
VFALPNEQGSLFRALATFALRGIDLTKIESRPSRKKAFEYLFYADFIGHR
EDQNVHNALENLREFATMVKVLGSYGVVNP
>CT2130 pheS, phenylalanyl-tRNA synthetase, alpha subunit
METSIQQLQQEIDGYQIRNAKELEAFKLEFTVRKGKIAGLFGQLKTVDSA
DRPRIGQLLNALKLSAEAKVADAESLFAQNAETEAPALDLSLPGRRSFLG
SEHPVQKVLGDMKRIFTAMGFGIATGPELELDEYNFDRLNFPPNHPARDM
QDTFFITRGQEEGDVLLRTHTSPVQVRVMLDEKPPIRVICPGKVYRNEAI
SARSYCVFHQLEGLYIDKNVSFADLKATIYSFARQMFGSDVKLRFRPSFF
PFTEPSAEVDVTCYLCGGKGCRVCKKSGWLEIMGCGMVHPNVMRNCGIDP
EEWTGYAFGMGVDRTALLRYKIDDIRLFFENDVRMLSQFMA
>CT0730 pheT, phenylalanyl-tRNA synthetase, beta subunit
MKISVNWLKEFVPSLSFDCSGLVDYLTFLGLEVEDVFEQKLPDQKVIVGK
IVEVRPHPNADRLRICMVDTGEGELRQIVCGAPNVEAGMMVPVATIGAVL
TAVSGETFTIKPAKIRGEHSSGMICAADELGLSDDHDGVMVLDEACEIGQ
PLARYLETDTVLDIAVTPNRPDALSHLGVARELADCNEIVYPQAPVIEFT
RGGGLIEVQDEESCPYYTATVIKGVTVGPSPRWLARRLEQIGLRPKNNIV
DITNYILHSFGQPLHAFDYHQLAGSRIVVRSDAESSFMALNKVEYQLQPG
MTVVCDAREPVAIGGVMGGLHSAVTDKTTDILLEAAYFNPASVRKTAKQL
QLSSDSSYRFERGVDPCNVKRAAEYAIAMILEIAGGNVDSAEAWGDMPAA
QKIVSLRPKRVNAVLGSSITASRMVRLLEKICIKAVSQEAVSDDVDSIAF
SVPSFRVDIEQEIDLIEEVARLYGYNNLEPAPVMVSSYPVSRKVPEYFPD
YLRSIMIGLNFREVLTNPLIRKAEADCFSSMLVNVLNPISEELEVLRPNL
APSLLKVVGYNMRHGNRELRLFEVAHGFEKQPEAGRGNEGPLSAFLEKEL
LSMVITGRREPRSWNRQDENVDFYDLRGVVEMLLEKLNLLEKSAFNIYNA
RTIGIEITSTENGKTSVLKAGTVQQVNREVLDVFGLDQDVYLAELDVTLL
ERCFESGVIYEPPSKFPVVERDLSFVLPRHIPAQRLIDLAKASDPRVRSV
RIFDVFDRGTTQGEPSTRSVALSLELADRSGTMNEEAISAVISKVIDNAR
SELGAVIRQV
>CT1003 phnA, phnA protein
MSDLPNCPKCNSEYTYENGLLLVCPECAYEWSHLEAGDAEQVRVWKDANG
NILQDGDTVTVIKDLKVRGASGVIKGGTKVKNIRLVEGDHDIDCKIDGFG
TMQLKSKFVRKG
>CT0900 phoU, phosphate transport system regulatory protein
MSERPVHEYIKELSEALVQLSDKVLQNFNEALYAVTHKDVQSARKIRIVD
DEIDQTEVKLEERCLAFLALQQPVARDLRTMVTIIKINDDLERIGDLAVH
IIERMPALGHEVMERYQFEKMGNLSATMVRKAIEAFVTRDRLLADNVCAM
DEDVDAMHRMIFKKVADAMKACNTDTEQLLAVLSVSRYIERMADHATRIA
REVIYLVTGEIVRHSDDTFEKLIQSLKD
>CT0511 phr, DNA deoxyribodipyrimidine photolyase, classII
MAKAITIDERRILRLNQREDRQGPVIYWMSRDQRVRHNWALLFACQKANQ
LGQPLEVVFTLSPSFLGAPMRHYDFMFRGLREVETRLRELGVPFTVLYGE
PGETLPKYTEKRNAGVVVADFSPLKLVRGWKLAVAQQLSCAFYEVDAHNI
VPCWLASPKQEYAARTIRPKLNALQGEFLTGFPEPELRHQPDTLPPPVQW
NAMETLLKVDRSIKAVPGLEPGETAAEARLRSFVTGRLSRYADERNDPNS
GAVSGLSPYLHFGQLSAQHATFEAARSKASEVNREAFVEELFIRRELSEN
YCYYNERYDSFDGIPEWAKKTLMEHAGDHRDAIYTPEQFERAQTHDPLWN
AAQTQLLETGIIHGYMRMYWAKKILEWSATPAAAFDIALMLNDRYALDGR
DPNGYVGVAWSIGGLHDRPWTERPVYGTIRYMNSNGCKRKFDVPRYIAEM
TGKSQATLF
>CT2036 plsC, 1-acyl-sn-glycerol-3-phosphate acyltransferase
MNFQTLLFFLVIIPVMFFGMLFALVLNLFDPSGDQFHKMAAWWGRFSARV
LGIEVKVEGEENYNPNKNYLVVSNHAGMADIPLILGSMKLNLRFVAKEEL
GKIPVFGPALKSAGYVFIKRGQNREALQSMLKAADTLKAGRSIHIFPEGT
RSKTGKILPFKRGAFIIAEKAKVPVLPVTIVGSNLITPKKSLKINHGTVR
LIIGKPIEPAKAEALMKESYSVISENLEKSAA
>CT2113 plsX, fatty acid/phospholipid synthesis protein PlsX
MTIVVDAMGGDNAPACVVEGVIDALRESGNRFEILLIGQEEKVAPLLQQY
DTGALKLRFVHAPEVITMEDVPATAVKAKQESSLVRGLKLCKAKDADAFV
SAGNTGAMMAASLFVLGRIPGVLRPTIYAYFPRLGEGLTNLVDVGANVDC
KPENLVQFAEMLTIYQRYAAKIEQPVVGLLNIGEEEGKGPDYLKQAWKML
QKAHEEQKINFIGNIEGHDILAGKATIVVCDGLVGNTILKFGESIPHFLG
AIFKPALEKLVKEGKLDQNSAVLAGQTFKGIFEPFDVEKFGGVPFLGVDG
ISIVGHGRSSARAIKNMIYMAEHMIEQRVNERIAKMLA
>CT1649 pnp, polyribonucleotide nucleotidyltransferase
MFIKKKIDLGHGKVITIETGKMAKQADGSAVVTMNDTMVLATVVSSKTPP
SPNQDFFPLQVEYREKYSAAGKFPGGFFKREGRPSEKEILSARLIDRALR
PLFPDGYYQETQIIISVISSDTINDADVLGGIAASAAIMVSDIPFANPMS
EVRVGRINGLFIVNPDINELAQSDMDICIGGTEDTICMLEGEMKEISEAE
MLDAIKFGHDAIKKICALQRELAAEVAKPKRPFSPTVAPDELVNFVEEHC
SAELKALAYTPLAKEERAEKTKAIYTQTIRKTLTHFTDRVGPDQIEADPT
SAFCLNEHMIEECIHAVEKKVMRHMILDDGKRLDGRTLEQVRPISIELGL
IPRAHGSALFTRGETQALVTLTLGTKKDAQSVDTLTDDKDKRFMLHYNFP
PFSVGEIGRVGGAGRREIGHGNLAERAIKMVMPSEQEFPYTVRLVSDILE
SNGSSSMASVCGGTLAAMDGGIPLKKPVSGIAMGLIKEGDRYAVLSDILG
NEDHLGDMDFKVAGTRDGITACQMDIKIDGLDYHILETALEQARKGRLHI
LDVMAEAIPESRADIGKYAPRLTTIQIPVDAIGMVIGKGGETIRSITEET
GAEINIDDDGTVTIACSSPEATKAAVETIKTLVSKPEVGTIYMGKVRDIR
DELGAFVEFLPKTDGLVHISEIARERIAKVSDVLKVGDRIKVKLIDVRKD
PRTGKTKFALSIKALLDTDQQAETNGEAKPARD
>CT1667 polA, DNA polymerase I
MTSENQIGLFDAPVSEPKPAATPQKSPADAEKTKPGLFLLDGMALVYRAF
YALQQARMSTRDGVPTGAVFGFATSLLRIIEEYRPDYLAVAFDSPEKTFR
HEKYKAYKANRPAPPDDLINQLDNIRELIRACGIPLIIMPGFEADDLIGT
TARKFEADCQVFIVTPDKDMSQLVHDGVRILKPGKKQNEFELLGSREVAE
QFGAPPEQFIDLLTLTGDTSDNIPGAKGIGPKTASSLLKKYGSLEGIYAN
IDALTPKTRQSLEAFREMMPLVRELVTIRTDLDLPLTLEALHASKPDPEA
LFALLAKLELKSIAARLPAVLQIDTSAASSSENAPAMANSVDPGDPQLIP
AGEGSEYHLIDSKEAFDKLLTLLENSGGFAVDTETTSLDTFTAELAGISC
SVKPGEAFFVYFGTPGLDAKTTVARLKPLLENPEIPKTGQNLKYDILVLK
KYGVELAPVGFDTMLASYVLNPEARHNLDDMAALYLGRQTTKYTELVGTG
KQTIGIFEVEPRKLSDYACQDADIALRLRYSLEEQLEKTPELLEVCRKLE
FPLVRVLADMEHKGISIDTAHLEQLSIKVTGELATLTERIFEATGETFNI
DSPKQLGHILFDVLKLPAKKATKTGYSTNVQVLEELAMLHPVASDLLEYR
SLQKLKSTYIEALPKMINPLTGRVHTSFNQSITATGRLSSSNPNLQNIPI
RTELGKEIRRAFIPSNPGNLLLSADYSQIELRIAAELSGDPMLIEAFRNH
EDIHAATARVIFDTKEITPDMRRKAKEVNFGVLYGIQPFGLSQRLGIPQK
EAKEIIDTYMAKYPGMFSSLQTIIEEAKKKGYVTTLMGRRRYVPDLNSAN
SNIRKAAERVTMNTPIQGTAADIIKFAMCSISRELKKGEFRSAMLLQVHD
ELLFEMPPDEEARLREMVERNMIEAASKCGLKNVPVEVDTGSGKNWLEAH
>CT0176 ponA, penicillin-binding protein 1
MSKKFVMKIAGLLLVVLLGAGAVFGLDLFKGLPSVQELENPKPELASLVY
SEDGKLLHKFFLKNRTFVPLRSISRWAPAALIATEDVTFYQHWGVDLRRL
AIALGENIIKGRTRWQGASTITQQLAKNLFLTQERTLSRKAKELVTAVQL
ERTYTKNEILALYLNTVYFGSGAYGIESAAQTYFGKPASQLTIPESATLI
ATLKNPTAYDPSKDPSSSLARRNLIIGLMEKAGFITHAQAVRAKATPLVL
HYTPVTQQGLAPYFTEYIRQTIKSSSMLEGVNVYRDGLTIQTTLDSRMQQ
YAQQAIGEQAAALQAQFDRSWRCPESLKIQFIKESARYKEMVNEEGVPAN
TALARLKADSAWLNNLLHQKTRVQMALIALDPSNGHVKAWVGGTNFGPDD
YRYQFDHVWQAKRQPGSAFKPFVYAAAIDQGIPANYRILDQPLALKSGNG
GVWVARNAEGGSGGMTTLRDALAHSLNQVTVRLAYEFLSPSEIISYARRM
GITSPMQPNLSIALGTSEVSPLELARAYTTFANNGAWCDVLPVTKVLDKH
GRVIAEHQPSSHLGLDPATNYVMVTMLKDVIDRGTGIAARTRYGLDMEVA
GKTGTTQSQRDAWFAGFTPKLVAVVWAGFDDERIHFTSMEYGQGARAALP
AWANFMKKCYSDPSLKLEKTYFPMPDNVIAVPIAGSSTEPSDVLNRNVSV
EYFTPKGYARYQAGDFNRAPQPVNSGAVGNPDSTGAQQPPVPTAPTAPTA
PASTDSTGSRARR
>CT0823 ppa, soluble inorganic pyrophosphatase
MLRLDRVLFSSVLYPENYGFIPKTLGEDHDPLDIVVISQCSIVPMCIVKS
RVIGVMRMIDHGENDDKIIAVAADDMSVSDIHDIGDLGKHFKMELQHFFE
EYKALEQKTVLVEEFQDAATAKQILLDSIKRYSETYA
>CT1640 ppc, phosphoenolpyruvate carboxylase
MITSTTSVVDFDKALLDFRYLLDCFIEVLENLGQEALARHIGGDSKTPIR
EFDCAERAIQAWSIVFQLLNMAEENSAAQYRRRIESSEGVESLSGLWGEA
LRHLKNLGVTEAEIAAELPQIEVEPTLTAHPTEAKRRTVLEQHRELYLLL
VKRENRMWTPAERDKIRTDIKVVLERLWRTGEILLQKPEVSAERQNVLHY
LHTIFPDILPILDNRLEKAWAVAGFDPAITADPRNLPRLSFGSWVGGDRD
GHPLVTAETTQETLQEMRRHAIELVHRQLERLASMLSLSGRLQSPSRAME
ERMETLRHKLGKAGALAFERNLHEPWRQFVNLMIAALPAGTTENEHYCYR
RHTELLKDLELLMSSLTAINASTIARGDVAPVYRIVQTFGFHLGRLDIRQ
NSRFHDLALSQLMMAAGLDGKGFPEWSEEARLEFLERELLSPRPFTHPDM
HAGPEAETVIACYRVLFDHYRQFGPDGIGSLIVSMTRNLSDLLVIYLFAR
EVGLMVRHGDGDACPIPVVPLFETIDDLERSHDILDRFLAHPVTRRSIAL
QQELHGRQKPVQQVMVGYSDSNKDGGIVTSLWSLYRSEERLIETGRKHGV
DILFFHGRGGSISRGAGPTHRFIRAQPHGSLDAGLRVTEQGETISQKYAN
RISAAYNLELFLAGVTEARLVHRKEGYRPHPLEKVMDTLARSSNKAYRQL
IEADGLLEFFRQATPIDIIESSRIGSRPARRTGSQSLEDLRAIPWVFSWN
QARFGISGWYGAGSALEELRATDPETFEVMRKSDFSWAPFHYIINNIATS
ILSVDPWIMEQYASLVENKNIRENLLGMIQAEYLRTMRLLDLLYGGPLRE
KHFNVARFIETRQEGLSKLHRLQIDLLKSWRSAKASGNEQQADALLPELF
LTVNAISSGLRTTG
>CT1682 ppd, pyruvate,orthophosphate dikinase
MPNSHANEESAAAKKYIYSFAGGAAEGDASMKNLLGGKGANLAEMANIGL
PVPPGFTLSTEVCAYYYDHQKTYPANLFETEIPAALKKVEDYLGKKFGDP
ENPLLVSVRSGARASMPGMMDTILNLGLNDKTVEGLARKSGNPRFAWDCY
RRFVSMYGDVVLDLKPADKKQIDPFEEILEQKKQELGIHLDTELGVDDLK
DLVGKYKKAILEKTGKTFPENPEEQLRGAISAVFNSWNNERAIVYRKLNH
IPGWWGTACNVQAMVFGNMGEDCGTGVAFTRDAATGDNIFYGEFLMNAQG
EDVVAGTRTPLKIEQLAQEKPAIYNQLEEIRSILEKHYRDMMDIEFTIEN
DKLFMLQCRVGKRTGLAAIKIAVDMYNEGLIDEKEVLRRIEPEQLNQLLR
PVFDLKEKKAAIDSGRLLATGLNAGPGAATGRVYFNADDAMEANARGEKV
ILVRIETSPEDIKGMNAAEGILTERGGMTSHAALVARQMGKVCVAGCGTL
RIDYKAGEMRVAGKDIVIKEGEYISIDGTTGEVIAGEVKTKNSEILEVLI
DKTLDPADAPTFRIYNQLMQWADKYRKLNIRTNADQPDQAEIAITFGAEG
IGLCRTEHMFFGGDRIDAMREMILADDIAGRKVALDKLLPYQRDDFYGLF
KAMGSRPVTVRLLDPPLHEFLPHTDAEIDDLAGKIGKTSAEIKARIESLH
EFNPMLGLRGCRLGILHPEIIVMQVRAIIEAACRIKQEGQEIVPEIMVPL
VSTVKELEITSEVIHKTARSVISEQGVGVKYLVGTMIEVPRAAITSDQIA
QAADFFSYGTNDLTQMGLGMSRDDSGQFLPIYQQQEIFARNPFESIDIDG
VGRLVSISAKEGRSVKPDLKLGICGEHGGDPATVEFCHKTGLNYVSCSPF
RVPIARLAAARAALL
>CT1813 ppiA, peptidyl-prolyl cis-trans isomerase, cyclophilin-type
MPEKFIIKTSMGDISIALYDDTPRHRDNFVKLVGEGYYDGIRFHRVIEGF
MIQSGDPLSRFDEKRMMHGTGGPDYRVPAEIKHPNKKGTIAAARDNNPQK
ASSGSQFYINQADNGFLDGEYTVFGVVESGIDVVDAIAAVETDMRDNPLK
PVTIETITPATA
>CT0887 ppk-1, polyphosphate kinase
MSDPSLYINRELSWLHFNRRVLDEAIRQDQHPLIERVKFIAIFSSNLDEF
FMIRVAGIEKQVEAGIRKKTIDGLTPSEQLERIRAEVIEQLKLRNTCLYG
DILPALAAEGITFVHFADLPEKEQAVLNAWFRKEIYPVLTPLAFDTGHPF
PFMSNLSLNLAIELDEVEHGNLKFARVKVPSVLPRLLKLNDIEGLGNDPS
CMRFLWIEELIQQNLGLLFPTMKIVQSHQFRIIRNADIEIEEDEAGDLLQ
TIEKGVRSRRYGNVVRLDISPEMPDFVRQLLINNLEIEEKNVYEIDGALG
MSCLMELLDIDRPSLKDEPFIPFNMFEEQRNGDIFSAISSGDLLFYHPYD
SFKPVVDFIDRAASDPDVLSIKQTLYRVGSNSPIVKALMKAAESGKQVAV
LVELKARFDEENNIGWARALEDVGAHVIYGLPGLKTHAKLTLVVRREPQG
LKRYLHLGTGNYNPSTGKLYTDYSFFTDDELLAGEVSELFNALTGYFRYT
GYRFLLVSPINTRKRIIEMIEREIALARKSSGGRIIMKMNSLVDPATIQA
LYRASRAGVQIDLVVRGICCLKPGIPGVSENIRVISIIGRYLEHSRAYYF
ANGGSPELYLGSADIMPRNLDDRVETLFPVFDPSLVERVRNDLELQLSDN
LKAWKIGPDGNWTLVRNDAPKVNSQERFMKRRTQKKKTTGIKGRLGLN
>CT1049 ppk-2, polyphosphate kinase
MTDSENGTLVPAAIAEPDLRDPLLYVNRELSWIDFNQRVLEEALDSAAHP
LLERVKFLSIFSSNLDEFFMIRVAGLDDQCAAGINERSVDGLTPIEMLER
IRERVIGQLRQRNACFFDDILPALKRKGIEFVSVSSLSVEQQQLLQHYFR
KEVFPVLTPLAFDTGHPFPFMSNLSLNLAIELEDEESGAIKFARVKVPGI
LSRIIRLDQIEGLGFDDGRIRLLWLEDLVEHNLDQLFPKMRILQCHPFRI
IRDADIEIEEDEAGDLLESIEQGVRSRRYGKVVRLDINPDMPHSIRSLLV
KNLETYERNVYEIGGVLGMSALMELLKIDRPDLKDELFVPNNPLDDKRTA
DIFAEMRSGDMLLHHPYDSFKPVVDLIWQAARDPDVLSIKQTLYRVGSNS
PVVKALMFAAEQRKQVAVLVELKARFDEENNILWARSLEDAGAHVVYGLP
GLKTHAKLTMIVRREQEGLRHYLHLGTGNYNTVTARIYTDYSYLTTDPVL
ADDVTELFNSLTGYSKHREYRSLIVSPLNMRRWIMEMIRHEVDHQKHTGN
GRIVMKMNALVDEEIIRALYRASMAGVKIDLAIRGICCLKPGIPGISENI
RVVSVIGRFLEHSRVYYFNNGDHARIFLGSADIMPRNLDKRVETLFPVIE
PRLVESIKSDLELTLSDNRKSWEMQPDGTYIRKRGGRPAVDSQRLFMRRS
LRRKKNIKKKVKGL
>CT1054 prc, carboxyl-terminal protease
MAAVGQAAQPALLKPTPNQEEAARYIAQYLLQNHYRKVPVNDSLSQQIFK
RYLDNLDNNRSYFTAPEVEKLRQEFGAHLADDFVSGNPADGFAIYNQFLK
RAREKMAYMKNALEKTTFNFSSPETLELERSKTAPWPANEAELHDLWRKE
LKYQYLSAKYSGEKGKNIKSEVMKTLDNRLKIFNQQKPEDAFQAYMYAVT
TSFDPHTDYFSPDEYENFQIDMSRSLEGIGAKLQMENEYTVVNEIIPGGP
AFKSKLLKKGDKIVGVGQGDKGEIIDVIGWRINDVVKKIRGPKGTVVRLK
VLPASQAGKGPAKIIRLVRAKVDLQEQAAQKKIIYDNGHKIGVIVLPSFY
LDFEGERQQKHNYASTTKDVARIINELKQENVEGIIVDLRENGGGSLEEA
VNVTGLFTGRGPVVQVSNALGGKMVLNDENYPMLYRGPLVVLVNRYSASA
SEIFAAAIQDYGRGLIVGDRTFGKGTVQSIVTIQRPFSMFMKQADLGQLK
LTIAKFYRISGGSTQHIGVLPDIVLPSLIDPEVVGEDTYTSSLPWTTISR
AAYTPLGFVTKEDIVLLKKEFAEQSAKDKLYQSYLADLATLNRIRQKKSV
SLQEKGFQVENKTLKEIQDRWGDQNADTGKKKKTDFILQEAARILDDLVA
LKSRPAVPAAAPLARPAVRRAVPVK
>CT0123 prfA, peptide chain release factor 1
MFDKLQSIKDKFQTIEQQLSDPEVVADQNRFRKLNKEYSSLKEIVRAYDE
WSRTKKQLDEAHSMQKNENDPEMRALVEEEAGELQERLPKLEQQLKILLL
PKDEADSRNAIIEIRAGTGGDEAGLFAADLMRMYQRYAERQGWSCQTLEV
SEGSVPGSLKEVSLEVSGHNVYGILKFESGVHRVQRVPETETQGRIHTSA
ASVAVLPEAEEVDVEIRKEDLLIDTFRSGGKGGQNVNKVETAVRITHVPS
GIVVACQEERSQLQNRERAMKMLRSKLYDLQIAEQQKSRADLRRSMVTTG
DRSAKIRTYNFPQSRVTDHRIGFTSHALPQIMQGELDPLIEALRMHDQAE
RLQAETA
>CT0818 prfC, peptide chain release factor 3
MELAKEIARRRTFAIISHPDAGKTTLTEKLLLFGGAIHTAGAVKSNKIRK
SATSDFMEIEKQRGISVATSVMGFEYKGKRINILDTPGHKDFAEDTYRTL
TAVDSVILVVDSMKGVEEQTERLMEVCRMRHTPVIIFINKLDREGRNPFE
LLDELEEKLDIQTCPMTWPISQGQTFKGVYNLFDKSLNLFEANTSQIGQK
LTDIEGIDDPKLADWVGTANAAKLREDVELIEGVYEPYELEMYREGLQAP
VFFGSAVNNFGVRELLDTFIDIAPCPHEREASERIVYASEAALSGFVFKI
HANLDPNHCDRTAFFKICSGRFERNKFYQHTRLGKKVRFSNPTQFMAQEK
NVIDEAWPGDVIGLYDNGSLKIGDTLTEGETLHFRGIPSFSPEIFKVLEN
RDPLKTKQLEKGIRQLTDEGVAQLFVQYGTRKIVGTVGELQFDVIQFRLE
HEYGAQCSFTPLRFHKAFWITSDNQKQLDEFMRRRANVIAFDKEDHPVFF
AETEWMLKIAKEDFPEIEFHSTSEFKTKNQD
>CT1187 priA, primosomal protein N'
MLMYAIVYVERIYRDEPFRLSMPEGLKTTIQPGCQVLLNLARHKASAYTG
YVWSLEKAGEGDIEGEVLDLLNSGVPVLTPVMLKLALWIADYYAALPIDC
LSTTLPAPLRSTVDDVVELAGFQLESPELRVKSTGLRRSILKALATEKRL
TVRQLRKRLGRRELYSAIAELERAGLVNVRKSFVETKPRTVTTWKLAKTL
PDEPEKLIARSPKKREAFELLASRPEQLFRAGEGGISRTVFSGLVTLGLA
EKIETAAPSGESLRFDEPQKEIHSLSPHQQQALDALTKALYEQQFRTFLL
HGVTGSGKTLVYIELLRRVLAEGKTAIVLVPEISLTPQTAARFRHYFGDD
IQIMHSAMSDREKYDAWQRLRQGKARIALGARSTIFAPLENVGAIIVDEE
HDAAYKQDRTPRYHARDTAVMRAMFENALCVLGSATPSFESYRNALEGKY
TLLELPERIDNARMPSIELVWMPGSQRVTPSISGALYDAIRERLKRDEQV
ILLQNRRGFAGSLLCLDCGHTPQCRHCNIPLVYHASDRSLRCHYCGHIEP
FRQTCPACKSENLFYKSSGTERIEEELGELFPEEKILRMDIDTTSTKDAH
ASMLTAFREKRARILLGTQMVAKGLDFPEVTLVGVLMADIGLNLPDFRAA
ERIYSLLMQVAGRAGRSSMPGEVLLQLYNRDNELFQHVIRADYRHFFEAE
MATRRELAYPPFTRLIKFECSSPSEAVAEKGAVALRKHLRPLVPEAFGTI
LGPAPAGISKIKGRYRSQLIIKLTGIKLSAALLRQVQYETLSAFRGENLV
ITVDVDPQHLL
>CT1473 proA, gamma-glutamyl phosphate reductase
MTEHEAIVKQLQAVQNASRKIVPLNEETINGLLVELADRIPAAADAILEA
NRKDLERMDPADPRYDRLLLNEARLNSIATDLRNVAALPSPLHRVLEERT
LPNGLELKKVSVPLGVIGIIYESRPNVTFDVFALCLKSGNATVLKGGSDA
AYSNIAIVNLIQTVIRDRGLDPDMIYLLPAEREAAHILLNAVGYIDVIIP
RGSQALIDFARKHSTVPVIETGAGIVHTYFDQSGDLAMGRDIVFNAKTRR
PSVCNALDTLIVHESRIDDLPVLVVPLEEKNVQIFADEPAYYKLLGRYPD
ELLEMASPEHFGTEFLSLKMSIKTVGSLEEALNHIARHSSKHSEAIIASD
QTTIDAFMKQVDAAAVYANTSTAFTDGAQFGLGAEIGISTQKLHARGPMA
LKELCSYKWLVTGQGQVRTA
>CT1457 proB, glutamate 5-kinase
MSHQHPVYRRIVVKVGTNVITGRNGKLDPAILDSLTSQIAALMQDGVEVI
LVTSGAVSAGRGIVSLSGNLTPVETRQVLAATGQIRLINAYNDRFKKHGI
TGAQLLVTKGDFRDRQHYLNMRTCFEALLQQKIVPIVNENDAVAITELMF
TDNDELSGLIASMLQVDANIILSNVDGLFDTQSEGNPVIEEIDPGTKNFS
QYIRPGKSEFGRGGMLTKCNIAHKLSRLGITVHIANGTTPGILQTIAHGG
KAGTKFIAQKPKQSRKRWVALSEGLEKGAVIINQGAIDALTSGQRATSLL
PVGITGVEGSFQRGDIIRICSTDGKVIGYGMASCTAEKARSAMGQKGLKP
VIHYDHLYLVP
>CT0077 proC, pyrroline-5-carboxylate reductase
MARLSIGFIGTGRIAQALISGLSHDPNIVICGYDKMPDALHSVSLQYGVN
TEESIESLARDAEIIVIAVKPYQMAEVLAELKPALHGQHLIVSVAAGIST
GFIESNCPEGTRVVRVMPNTPAFVGEGMTALCKGMHATADDLLVAERIFN
AIGKTAIIEESGMDAATAVSGSGPAYMFRIIDSLAEGGEACGLDRETAQL
LAAQTMLGAAKMVLSGHKSPEELVREVTTPGGTTEAGLKAMDERDLRGAL
IDTVRAAAARSKELMK
>CT1491 proS, prolyl-tRNA synthetase
MADKITSRQEDYSQWYIDLVRSAKLADYSDVRGCMVIRPNGYAIWEKMQA
ALDGMFKQTGHVNAYFPMFIPESFIAKEAEHIEGFAPECAVVTHGGGEEL
AEKLYVRPTSETIIWSSYKKWIQSYRDLPILINQWANVVRWEMRTRLFLR
TTEFLWQEGHTAHATPEEAQEEVIRMINIYRTFAEEYMAMPVIVGKKSES
EKFAGADATYCIEAMMQDGKALQAGTSHNLGQNFAKAFDCQFQTKDGVLD
YVWATSWGVSTRLIGALIMAHSDDKGLVLPPKLASRQVVIIPILKGNKDE
VRARARFIAKTLNRHGIPTFVDDSENNSPGWKFAEYELQGIPVRIELGPR
DLEQGKCIVARRDTFEKTELLLDDELTINIEEILNNIQQNLYDRALQFRL
DNTVEATTWEEFKASVEKGFVIAHWDGTHETEALIKEETKATIRVIPTDE
EYRQQYNMDEPGTCIRSGKPAAQKVVFAKAY
>CT1361 prsA, ribose-phosphate pyrophosphokinase
METPIKIVAGRSNPELAKKIAAYLGTPLCDAKAENFSDGEISVNYFESIR
GSDMFIIQSTNPPADNLMELLIMIDAAKRSSAYRITAVLPYYGYARQDRK
DKPRVAITAKLVANLLTQAGADRILTMDLHAPQIQGFFDIPFDHLYSSVV
LIDHVKNMDIADNLVVASPDVGGVKLARKFASELGTELVIVDKRRPKANV
AEVMNIIGDVKGKNVLLVDDMIDTAGTIVNAAKAIKEAGGLKIYAAATHP
ILSGPAIERINTSVFEKVIVTDSLVSEHDFCSKIETVTISNLFGEAIKRI
YDGESVSYLFDSKNISQKITNHH
>CT1890 prtC, collagenase
MPAKPQLISPAGDWTSMRAALDAGADAVYFGAEGFNMRAASKGFSPGDFG
GIARLCRSHGAKAYLALNSVIYDAELDEVDRTAAAAKAAGLDAVICWDLA
VIEACRRHEMPIHLSTQASVSNINALRFYASLGARMIVLARELTLEQTAA
ITKCIAAEKLPVTVESFVHGAMCVAISGRCFLSQELFGRSANRGECVQPC
RRSYRIADVEEGFELELGSNYVMSPKDLCALPFLDKLFDAGIGAFKIEGR
NRSPEYVATTTAAYRKAIDFIAAHRNDKDFDEAYRSLTDRLQNDLVRVYN
RGFSEGFFFGKPADAWTKHSGSAATETKSYVGVVRKYFPKAGVAEILVHA
PSVDSGVRLSIQGPATGLVTVPDAELHLDGQLANRIEQGQIFTVRCDRVR
KNDKVYVLLKN
>CT2019 pscB, photosystem P840 reaction center iron-sulfur protein
MAEPVENKNQAPAPGAKVPPKGAPAAPKAGAPAAPKGPVAPKAGAPAAKT
GASAAKQAGKPRLASLGVTLGRSGVRQESALPYVKPKAVPPPKPAAPAAK
GAPAPKGAPAAPAAKAAPGAPVAKAAPKAKKHYFIIENLCVGCGLCLDKC
PPKVNAIGYKFYGDVQEGGFRCYIDQAACISCSACFSGDECPSGALIEVL
PDGEVLDFSYTPPERLDFDLRFLHRFHREAR
>CT1639 pscC, photosystem P840 reaction center cytochrome c-551
MDKNSNGKLIALAVGGAVLMGALFFSVSFLTGYIPAPNHSAILTPLRSFM
GWFLLIFCASIIIMGLGKMSSAISDKWFLSFPLSIFVIVMVMFLSLRVYW
EKGRTTTVDGKYIRTTAELKEFLNKPAATSDVPPAPAGFDFDAAKKLVDV
RCNKCHTLDSVADLFRTKYKKTGQVNLIVKRMQGFPGSGISDDDAKTIGI
WLHEKF
>CT0641 pscD, photosystem P840 reaction center protein PscD
MQPQLSRPQTASNQVRKAVSGPWSGNAVHKAEKYFITSAKRDRDGKLQIE
LVPASGRRKLSPTPEMIRRLIDGEIEIYILTTQPDIAIDMNKEIIDMENR
YVIDFDKRGVKWTMREIPVFYHEGKGLCVELHNKIYTLDQFFK
>CT1340 pssA, CDP-diacylglycerol--serineO-phosphatidyltransferase
MGTEERKARQKRYPPVLQDSGERPRRRFPYVSRSFVPSVFTVMNMVSGYV
SIVMAGEGSFVIAGWLIFLAAFFDTIDGFVARLTNSSSEFGVELDSLSDL
VSFGAAPAYLVYKFGLEHIGMPWGLLLSSLLMVGSGLRLARFNINLIDYH
KDSFSGLPTPAQAMTVASFVLWMSVEPLFTELALQKVLMVLSVLLATLMV
SKVNYDALPKPTLESFRSHPVQMILYVIAIFCVLVFQAKAFFVAMLLYIL
LGIVRSLTRTVQEWQL
>CT0899 pstB, phosphate ABC transporter, ATP-binding protein
MQMTAEEKQTSAKIPAERSTCDIYIPPERKAIAGGGKPHVVARDFSIYYG
EFEAVKKVNAEILSKYVTAIIGPSGCGKSTFLRAINRMNDLIPNCHTTGA
LMFDGEDIYGKFTDEVLLRKKIGMVFQKPNPFPKSIFDNIAYGPKLHGIK
DKKKLSEIVEKSLRKAALWDEVSDRLDKNALGLSGGQQQRLCVARALAVE
PEILLLDEPTSALDPKATAKIEDLIQELRGSYTIMIVTHNMQQASRVSDY
TMFFYEGVLVEHAPTTQLFTRPKDSMTEDYITGRFS
>CT1252 pth, peptidyl-tRNA hydrolase
MKLIVGLGNPELRHAATRHNIGFDVIDHLAGSSTFSSGKGNYRFTKITAP
GGPLILVKPMTYMNLSGHAVVAAMNFWKIERENLLVICDDVNIPLGTIRI
RAKGSAGGQNGLKHIIQSLGSEEFARLRVGVGGENMPASLSSFVLSKFTA
EERKCIDKVIPVCADAVLDFASLGVEHAMTKYNGQVC
>CT2211 ptsH, phosphocarrier protein HPr
MIVQEVIIRNSAGLHTRPAAAVVKLASRFKSDFFIEMDGLEINAKSIIGV
MSLAAPKGSRMVLKLEGEDEAEAAKHLIEFFEQGFGEA
>CT0204 ptsI, phosphoenolpyruvate-protein phosphotransferase
MVYRKAPKHSGDAPDSSANPADRPASTGKERRYQGIGSAKGFAIGETYEF
VRETIEHETADLSPENIEGEIERFMTALHRSEKELKKIERVTTRKIGRLY
SDLFQAQIMLLKDPVLTGNITRRIRQELKPAHIVIEQEFGKLLEHFLNSD
DVIFRERAADLHDLKERIIRNLHIRKLHSWVPEGSIVVSHHLSPADIILL
SRSNIKGFATDTGGKTSHVSLICKSLNIPIVVGLGNFSQKAVSGERIILD
GNEGLVITNPLDETVDTYLKKREEESKREADDSIMAHRHAFTRCGVRISV
YSNIDFKEEIEHLDSMGAEGVGLFRTENLFLDDLKPPKEAAQQEYYRKMG
EMLAPKPLVIRLFDIGGDKLIYSPVKEPNPNLGWRGVRILIDVPEILDAQ
LQAVIRANIHGNIDVLIPMISSVEEIMHIKQAVEEHYKHIRSLTTEPLDK
PGIGAMIEVPAAVELIDEITQIVDFVSIGTNDLTQYTLAVDRNNLIVQDL
FEKFHPAVIRQLHRVISTAQKNHCRVSLCGDMGSDPLATPYLIGCGLREF
SIVSSDIPALKAMVGKYTVEECEALAAECLKLSSASAIKAHLEAFVKAH
>CT1633 ptsK, HPr(ser) kinase/phosphatase
MNFDQKGLKKRSITVAYFFEHIQKRFDIKFRRLNELDEQKCRIHERDLHR
PGLAIAGFTKLFTYKRVQILGNTETRYLNHLSDEERKTAFANFVSFRMPC
IILTSNNKLDQELVDMATGAGIPVFITRCSSTKTIYYITDFLDEEFSLYQ
QYHGSMIDVYGVGVLLTGKSGLGKSEVALDLIERGHGLVADDVVVVKRKG
ETKTLVASRNNIIDHFMEIRGLGVVDVRQNFGIRAIRDRKEVQVVVELLE
WSKESEYERLGLDQKMVKLLGVDLPLVQLPIFPGKNITAIIEVVALNFLL
KHYAGYVPAEALTERIRNVINKERAKAPAPSTSFEEYNDEND
>CT1441 pucC, pucC protein
MERVRKKSRNPESEGTVKQDLEMRASSFLASALPASSYRGVISASVHGTW
RSTPKMIFDAHEEAPFENHDNQIHPDFQGWPQDAILEKARVLCKLWPCLM
KRLSDSSNPFPRFMNKFFRIFNLVRLSLFQIGFGIMLGFVQDILNRVMIK
ELFLPATIALGLISLKELLAILGVKVWAGNLSDRYAIFGYRRTPYVLIGL
VSCIVSFILAPTTAYEVRLDGTGSLVSIIFSALGDVGLWKLSAIFLVFGF
GLQVATTAYYALIADMVDEKDIGKIAGASWTLMVLTAIISNYSIGSYLKV
FTPERLTQVAEIGGLVALTFGLIAVLGVERRNAGVGVHKEKHSIPFSQAI
RLLASSPNTMLFALYIFISIFALFANEVVMDPFGAEVFGMQVSETTKLFK
PVMGGTQLIFMLLTGFLLSRIGTRRGAYFGNVFGAVGFGLIIAAGFMHDV
QFLRIALVVTGIGLGAASVSNITMMMNMTAGRSGIYMGLWGTAQSLAIFI
GHSSAGVIRDLVFHFSGNHMLAYAAIFVLEIIAFTISSLVLPHVSREAFE
AESAEKMLELEAAAEAG
>CT2154 purA, adenylosuccinate synthetase
MEEKIFSRPAASATVLVGTQFGDEGKGKLVDYLSDKYDIVVRYQGGANAG
HTICFDGKTVVLHLIPSGIFNKDCICVIGNGVVIDPNALMDEIKKVEELG
YDVKGRLYISHNAHLIMPYHKRLDSLSESCLSGDNKIGTTGRGIGPSYED
KFARKGIRVVDLLDRDVLKEKLRENLAAKNKLISKVYEQEEIDVEAIIRE
YEEFDKAIDPYVTNTQLFLNRQIKAGKTILLEGAQGCLLDVDHGTYPFVT
SSNPTSGGACTGSGVAPNHVGKIIGVCKAYTTRVGNGDFPTELDDETGEA
LGRIGCEFGATTGRKRRCGWLDLVALRYSVTVSGVTELALTKLDVLDTFE
EIKVCTSYMLDGKEIFDFPTEHQTLSRVQPVYKSLKGWMASNAKAKSFAE
MHPNAQAYVNFLEEALEVPVTFISVGPGRDETVFK
>CT0989 purB, adenylosuccinate lyase
MIPRYSPKDISAIWSDEAKFERWLQLEIAAVEARMEAGIVPADALATIKE
KARFNVEEILIIENETKHDVIAFLTNVAGYVGPESRYIHEGLTSSDVVDT
CLAMQMRDAGKIIVADIESLIEVLGKKAVEHKYTLQMGRTHGIHAEPTTF
GLKLLLWYEEMKRNLERMKRALETVSVGKISGAVGTYQHLSPDIEAAVCQ
KLGLKPSSISTQILQRDRHAEYATTLAIVASSIEKFSTELRHLQRTEVRE
TEEFFSKGQKGSSAMPHKRNPITFERLTGLARVVRSNSIAAMENVALWHE
RDISHSSVERVIMPDSTIALVYMLRTFRDSIETLLVYPERMKQNFDTSYG
LTLSQTLLLALTGKGLTREDAYRLVQRNAMKSWEEKIQLKELVIHDPEIL
EHISAEEINQLFSPETIQDKLKNSVDIIFKRNGL
>CT0960 purC, phosphoribosylaminoimidazole-succinocarboxamidesynthase
MNKVTLLHEGKAKKVWQTDNPDLIIQEFKDDATAFNNKKKGSIADKGVTN
NAIASRLFTMLGENGIPTHFVGKLNDRDMLCKKLDILLIEVVTRNIAAGS
LVRRYGFKEGTVLSKPIVELYLKNDELDDPLMNEDHAVALGLGTCEEVAH
LKAMAKKINSLLVPWFAERKLRLVDFKLEFGRHKGEILLGDEISPDTCRF
WDAESGEKLDKDRFRLDLGGVEDAYSEVKRRVLEQ
>CT1674 purD, phosphoribosylamine--glycine ligase
MKVLIIGSGAREHAMAWAVARSSKVSTVFVAPGNGGTATMGGKVRNTPVK
ATDIDALLELVAKESIGLTVVGPEQPLEAGIVNRFREAGFKVVGPTAEAA
QLETSKVFAKEFMKRHGIPTAGYEVFRDYASAKAFLETCPTFPQVIKASG
LCAGKGVVVAMSRDEALEAIHEFFESRIFGDAADEVVIEAFLSGQEASVF
ALTDGQNYQLFLSAQDHKRIGEGDTGKNTGGMGAYAPAPLVTPEVMRRVE
EEVIRPTLAGMRADGYAYTGFLYVGLMIDKGVPSVVEYNARLGDPETQVV
LPMLKSDLFDALLASVEGGLEVVPFEMQEGAAATVVMASAGYPDAYETGK
VITIDPTVNDMEGVLVFHAGTRRDGDALVTSGGRVLSVTACAGSLKEALD
RVYRAVDAIEFEGAYCRRDIGAKAL
>CT0371 purE, phosphoribosylaminoimidazole carboxylase, catalytic subunit
MTQVSFDAQAPLVGILMGSDSDFDIMKEAATVLNEFGIPFEISVISAHRT
PGDLEAYASSAEERGLKAIIAGAGGAAHLPGVTAAMTPLPVIGVPIYSKK
LSGQDSLYAIVQMPAGIPVATVGIDNARNAALLAVQMLALCDASLMQKLR
EFRQKLAEASRQKTAKIREKLRANG
>CT0313 purF, amidophosphoribosyltransferase
MCGVFGVFNSKTPAEDTFYGLYSLQHRGQEAAGIVVAEYNKAKKKTLFKQ
HKGLGLVSEVFKDEQIFENLSGYAAIGHNRYSTTGASKSNNNIQPFSLTY
RSGSLAIAHNGNLTNSRVLRKELTEEGVIFQASSDTEIIPHLAARSKEKE
PLHQIYDALRQVEGAFSIVILANNQMIAARDPYGVRPLALGKKIDPATGE
LAYVVASETCAFDIIKAEYIRDIEPGEILLIDHLAVDNEKPVSLYLPPVE
RKARCIFEYVYFARPDSFIFRHSVDKVRRNLGKNLARESTIERDPNDKEL
AVVSVPDSSNTAALGFVRESNKLGKPARFEHGLIRNHYVGRTFIQPGIQS
REIKVRSKYNIVRGVMQGRPIILVDDSIVRGTTAKMLIKLVREANPKEIH
LHISSPPITNPCFYGMDFPSKRQLLTHMLADAEHELGDIEKIREYIGVDS
LRYLSMQGLLNSVPEFEGETCSYCTACFSGDYPIPIADATTDKEEND
>CT0320 purH, phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase
MLDPVIKRALVSVSDKTGIVDFCRELASMDVEIFSTGGTLKLLQGAGIVA
QSISTITGFPEIMDGRVKTLHPKIHGGLLAVRDNAEHQKAARENGIQFID
LVVVNLYPFEATIAKADVTFEEAIENIDIGGPSMLRSAAKNNESVTVVTD
VADYATVLDEMRSNGGATTRATRLTLAAKVYALTSRYDTAIAAYMAKAAG
VKGAGDTMTLKLEKELSMRYGENPHQSAGLYKMDDGNGVRSFSAIFEKLH
GKELSYNNMLDIAAATGIIEEFRGEEPSVVIVKHTNPCGVAQAPTLCEAY
RKAFSTDTQAPFGGIIAFNRPLDMETASAVNEIFTEILIAPAFEDGVLEM
LMKKKDRRLVLQKQPLPKAGWEFKSTPFGMLVQERDSKTVAPEELKVVTK
RQPTAEELADLMFAWKIVRHIKSNTILYVKNRQTFGVGAGQMSRVDSSKI
ARWKASEVALDLRGSVVASDAFFPFADGLLAAAEAGVTAVIQPGGSIRDN
EVIEAADANNLAMVFTGMRHFKH
>CT1240 purL, phosphoribosylformylglycinamidine synthase II
MSTEPEVNLKLAQEHGLNEEEYAKICEILGRTPSFTELGIYSVMWSEHCS
YKNSIAVLKTLPRDGASLLTQAGEENAGLVDIGDNLAVAFKIESHNHPSA
VEPYQGAATGVGGIHRDIFTMGARPVASLDSLRFGSPRDPRVRYLVDGVV
RGIGDYGNSFGVPTVGGEIYFEDCYTGNPLVNVMSVGLVEHHKTVSATAW
GKGNPVLIVGSSTGRDGIHGATFASEDLSEASEDKRPSVQVGDPFAEKLL
LEATLEAIATGAVVGLQDMGAAGLTSSTSEMSARGIEKTGSGGIEIDLDL
TPAREKGMTAYELMLSESQERMLIVAEKGREHEIIDVYKKWDVSAVVIGQ
VTDDNMLRVRHHGEVVAEIPATSLVLGGGAPVYIREAVEKKPDTPAADLV
ADDSLDFKALSLQLLSRPNIASKAWVYQQYDSMVQTNTVTPAGQTDAAVI
RIKGTKKGLAMKTDCNSRYVYLNPKAGGAIAVAECARNIACTGAKPLAVT
NCLNFGNPYKPEVYFQFKSAVEGIGDACRMFDTPITGGNVSFYNETSLGG
GRTAIYPTPTIGMIGLLDNIDNLVESTFRKAGDAIVLLGAPELSLDGSEY
LVMQYGTPGTDSPAVDLNHEKNLQELLVTLASKKLINSAHDVSDGGLAVT
LAEKSIMNRERMLGFEVDLECSCKEGTAIQKQLFSEAQGRVVISVDPGRV
GAVIEEADRLNIPGRVIGKVTPEGASIAVNGKPVAEFMIDELLHAYGHAL
ESALHLEEL
>CT0094 purM, phosphoribosylformylglycinamidine cyclo-ligase
MDYKKAGVDISAGEEFVRLIKPHVRQTFTPQVMTDIGAFGGFFQPDFAKY
EKPVLVSSIDGVGTKLKIAIELGKYDTVGSCLVNHCVNDILVCGARPLFF
LDYYACGKLKPEIAASVVTGMVKACRENGAALIGGETAEMPGVYDVEDFD
LAGTIVGMVDQPHIINGSKIEAGDVMIGLPSTGLHTNGYSLARKVFEGRM
NETFAELDGTVGEELLKVHRSYLPVIEPLLGTGDLRGMSHITGGGLMGNT
MRIVPEGLALSVDWNSWPEPLIFDLIRKEGSVPEEDMRRTFNLGLGLVMI
VAKDRVDHIMGYLKSKDENAYIVGEVVKA
>CT0319 purN, phosphoribosylglycinamide formyltransferase
MIDKKKRLAVFCSGTGSNFKALFHAIIERELPAEIVMCLSNRSQCGAIDF
AKEYGIETLHLSESQFGSHDDFARAMLSELRDRQIDMILLAGYLRKIPDA
VIAAYPEKIVNIHPSLLPQFGGHGMYGMRVHEAVIASGETRSGATVHFVN
EEYDKGRIIMQNHVPVLPGDTPKTLAERVLRCEHRLYPAALEKLLDKQP
>CT1338 purQ, phosphoribosylformylglycinamidine synthase I
MADVTVGIVVFPGSNCDHDTEYAVASFPGVKPVMLWHNDHDLKGCDAVIL
PGGFSYGDYLRCGAIARFSPIMREVIDFAGKGRPVLGICNGFQVLVECGL
LEGALIRNAGRRFVSRQTTISVANNATIFTDRYQKGEVLRVPVAHGEGNY
YASPETIESLESNGQVVFRYTDAWGNATAEANFNGSMNNIAGIVNKQGNV
LGLMPHPERASEKLLGSEDGRRLFESLFAHLAGA
>CT1060 purT, phosphoribosylglycinamide formyltransferase 2
MPKTIMLLGSGELGKEFVIAVKRLGQRVVAVDSYDDAPAQQVADRREVID
MLDGRALDAIVAKHRPDIIVPEIEAIRTERFYDYEKQGIQVVPSARAANF
TMNRKAIRDLAARDLGLRTANYRYASSLEELRSAVGEVGLPCVVKPLMSS
SGKGQSTVRSEAEIEAAWSYSQSGKRGDIAEVIVEAFVPFHTEITLLTVT
QKNGPTLFCPPIGHRQERGDYQESWQPCRISPEQLEEAQNIAGKVTEALT
GAGIWGVEFFLADDGVYFSELSPRPHDTGMVTLAGTQNLSEFELHARAVL
GLPVPDITLLRAGASAVILADRVGNNPSFDGLDAALAEPGSDIRIFGKPV
MRPYRRMGVALMSGEKGSDVDELKRQAIANAEKVMIKCDETV
>CT1825 purU, formyltetrahydrofolate deformylase
MIAAAAPSKAILLLSCPDRVGLVARIANFIYERGGNILDLNEHVDVDERQ
FFLRVSWSLDHFSIPAEDLESAFAPLGREFRANWQIRLSGKRSRMAVFVS
KYDHCLREILWRHSLGEFDIDLPLVISNHPDLAPLVEAHGIPFHVIPVTP
EAKAAAEQRQMALCDEHGIDTIVLARYMQVLSPEFTRRWVGRIINIHHSF
LPAFVGGNPYRQAYRRGVKLIGATSHYVTDELDEGPIIEQDIIRITHRDT
LEDLVRKGRDLERLVLARALRLHCDHRILLNGRKTVVFD
>CT0956 pvp, pyrophosphate-energized vacuolar membrane proton pump
MYGLVVCLFGMIFGLIQYQGINKLPVHAAMKEISDLIYETCKTYLITQGK
FIIILWALVAAIIVAYFGGLNHLAPDKVVFILACSLLGIAGSYTVAWFGM
RINTFANSRTAFASLGGKPFPTYAIPLRAGMSIGMLLISIELFAMLCILL
FIPVDYAGPCFIGFAIGESLGASVLRIAGGIFTKIADIGSDLMKIVFKIK
EDDARNPGVIADCTGDNAGDSVGPTADGFETYGVTGVALISFILLAIKDP
SIQVSLLVWIFAMRLVMIVASAVSYWVNDALAKMKYGNADEMNFEKPLIT
LVWLTSIVSIVLTYIASYMLIAQLGDGTMWWKLASIITCGTIAGALIPEL
VDRFTSTECAFVRNVVQCSKEGGAALNILSGLVAGNFSAYWMGLAIIVLM
GAAFGFSTLGLDVMMLAPSVFAFGLVAFGFLSMGPVTIAVDSYGPVTDNA
QSVYELSLIETLPNISNSIESEFGFKPDFENAKRYLEANDGAGNTFKATA
KPVLIGTAVVGSTTMIFSIIMILTGGLADTGAIAKLSILWPPFLLGLLMG
GAVIYWFTGASMNAVTTGAYYAVAFIKKNIKLDGVTKASTEDSKKVVEIC
TRFAQKGMINLFLTIFFSTLAFACLESYLFIGYLISIALFGLYQAIFMAN
AGGAWDNAKKVVETELHAKGTELHDASVVGDTVGDPFKDTSSVALNPIIK
FTTLFGLLAIELAIKLPTTISVSLAVVFFLLSLVFVHRSFFSMRIAVDKD
>CT1800 pyrB, aspartate carbamoyltransferase
MNHLTGLFGLPASTLVELLDLATGYREGLNREPETFAPLLSNRRIALVFF
ENSTRTRFSFELAARHLGAGTLSFTAASSSVSKGETLSDTIRNLEAMKVD
AFVLRHPSSGAAEFVASITKRPVINAGDGTHEHPTQALLDLLTLREYFGK
IEGLKIMILGDILHSRVARSNIIGLKTLGAEIAVCAPTTLLPGRIDQLGV
RVFTGIDEALAWADAAIVLRLQLERATGGYIPSLEEYSASYGLTDEKLDR
LKRLMPVLHPGPINREIEISNLVADRIQPPGYSSSMLMEQVTNGVAVRMA
VLHRLLAK
>CT1042 pyrC, dihydroorotase
MSTLFLNARLLNPAENLDTVGSIKIGDDGLIEAVATGGESIPAKAEDNVI
DLAGKVLAPGLFDMHCHFREPGQEYKETLETGSAAAVAGGFTGVALMPNT
RPVIDSPLGVAYIRHHSAGLPIDLEVIGAMTVESRGEALAPYGKYASYSV
KAVSDDGTAIQSSQIMRLAIEYAANFDLLLIQHAEDKHLTAGGIMNDGAV
SAMLGLKGIPEVAEPIMIARDLQLIAWLKKHKLNGAVAEPRYHVAHISTA
ESVALVRKAKAAGLKVTCEVTPHHFTLTEHDLSSSIEKGNFIMKPPLASV
ENRDALIEGLRDGTIDAIATDHAPHAKHEKECPPDQAAFGIIGLETSLGL
TITELVDKGVITLSQAIELLSTNPRRIMGLETILFRAGRKANLTIIDPDC
EWIVSESDFGSKSRNTPFMGRKLKGRALGIYHNSKLIMR
>CT1597 pyrDI, dihydroorotate dehydrogenase
MNSPAAVKLGRGLNLRSPVMLASGTVSYGEELSQLCDLSKIGGLVTKAIS
PEPRTGNPPQRIAETSCGMINAIGLANVGLDKFVSDKVPFLQTLDTQVIV
NIAGKSIDDYCQVVERLDTVEGIAAYELNLSCPNVKGECMIMGVSPDATR
EIVSTLRKQTRRHLMVKLTPNVTSISAIALAAEEAGADSVSLINTVVGMA
VDYRKRKPLLKNVTGGLSGPAIKPIALAKVWEVSKAVKIPIVGMGGIASF
SDAMEFLLVGASAIQIGTMNFVYPDIGEKIAVAIDKFFSQPDAPAFQEYV
GSLMV
>CT1946 pyrDII, dihydroorotate dehydrogenase, electron transfer subunit
MLQTSAISDVRTRISAIRPAGAGVSILSMPCPKIAAAAKPGNFVNIKINA
ADQPLLRRPFCIHNVQGDIIDVMVKNVGRGTALLCEASCGESLLVLGPLG
NSFGTGTGDFDTALLVSGGIGTAPMLFLEKTLAAAGIPFHHLVGGRSRSD
LLTTSLSNVSTATDDGSEGFHGNVVQLLEKYLTEQTDAGRVKVFACGPNP
MLKALASFCRARAFPCELSLESIMGCGVGICYGCMVELSNADGEKESILL
CREGPVIDGNRFTT
>CT0083 pyrE, orotate phosphoribosyltransferase
MFKSSGALLDGHFRLTSGRHSNSYFQCAKVLQYPEYLSAICGQIAGFFRE
SGITTVISPAIGGIVVGTEVGRQLGVKTIFAERKEGAMMIRRGFSIDPSE
QVLVVEDVITTGGSVAEVIELVKAAGATVAGVASVVDRSNGKVRLADKQF
SLLTMEVVSYAPEECPLCKEEIPIYAPGSRTNPQC
>CT0129 pyrF, orotidine 5'-phosphate decarboxylase
MSSARDKANRRIASLQSMLCVGLDSDPSKIPTLFHSMERPVLEFNRAIIR
ATGEHAAAYKVNTAFYESRGLAGMRDLDDTLQALPPECLSIADAKRADIG
NTSRHYAKAFFETWPFDAITVAPYMGFDSLEPFFEYDDKLVFVLCLTSNP
GSADFEERILDDGRPLYRAVLDRVRSWQRNGNAGIVVGATKASLLQELRQ
EAPELFFLIPGVGAQGGSMQEAVNQGADPDRGGAVVNVSRALIFPKGDFR
SISEFEEAVRREAAKLHDDIKEVL
>CT0142 pyrG, CTP synthase
MARPKNVKHIFVTGGVISSLGKGILSASLGLLLKSRGLRVAIQKYDPYIN
VDPGTMSPYQHGEVYVTDDGAETDLDLGHYERFLDEPTSQACNLTMGRVY
KSVIDKERRGEYLGGTVQVVPHVIDEIKEKMGDLAKNGSIDVLITEIGGT
IGDIESLPFLEAMRQLKLELGEHNLLNIHLTFVPYIKAASELKTKPTQHS
VKMLLETGIQPDILVCRSEKPLSREIKNKVGHFCNVHDLDVIGLNDCDTI
YEVPLMLLKEQLDLRVMKKLGLKKFREPNLEYWKNFCEKVKHPKDGEITI
GICGKYTEYPDAYKSIIESFIHAGASNDVRVLVKMLRAEDAEDPKFDISS
AFKGISGLLVAPGFGDRGIEGKVRFVQYARENNIPFFGICLGMQCATIEF
ARNICDLPDANSTEFNKRTRFPVIDLMEHQKKVKEKGGTMRLGSYPCILK
EGSKAHELYGKFLINERHRHRYEFNNQFRKLFEEKGMIFSGTSPNGDLVE
IVELKNHRWFVAVQFHPELKSRVQKVHPLFDGFVHAAKEFAQGKRQLSLE
VEMPRLSSTEMENAG
>CT1779 pyrH, uridylate kinase
MRKYRRILLKISGESLAGESGYGIDAGVLESFADDIKEATDLGAEIALVI
GGGNIFRGLSAAAASMDRVQADYMGMLATVINSLALQDALERKGIFTRLV
TAIKMEQIAEPFIRRRAVRHLEKGRVVIFGAGTGNPYFTTDTAASLRAIE
IEADVIVKGTRVEGVYDSDPEKNPNAEFFPKISYVDVIRKNLRVMDMTAI
TLCRENTLPIVVMNMNIKGNFTRLLKGEPIGTLVHVGEE
>CT0545 queA-1, S-adenosylmethionine:tRNAribosyltransferase-isom erase
MKVSDFDYALPEERIACYPPAKRGSTRLIVLNRATGSITHSAYASLHEEL
RPGDLLVLNNSRVIRARLIAHKPTGARIELMLLEKHEEAQNLALYRGRLR
IGDRLLAHGVELSVEALPSDGIARLSCATGNLADLFDAHGSVPIPPYLKR
DAEELDRERYQTVFAELPGSVAAPTASLNLTDELLDKINAKGVGIAHVTL
HVGLGTFLPIRSERFEEHVMHREFYNIPQTAARKIGETKASGGRVVAAGT
TVTRALEHAAPKLIESGFTQEVSSEADIFIYPGYEFRIIDALITNFHAPR
STVLMLTAAFAEKELLMRAYHEALDNGYRFLSYGDSMFIS
>CT1316 queA-2, S-adenosylmethionine:tRNAribosyltransferase-isom erase
MRLSNFRYTLPKTRIADHLESPRDACKLMVLNRRKKEIEHKVFTDIVSYF
KKGDLLVVNNSRVFPAKIFGQKEKTDAKIEVFLLRELNKEAGLWDVLVDP
ARKVRVGNKIYFEDDVVAEVVDNTTSRGRTIRFLNPDIDVFQMVEKIGHV
PLPPYFTRKPKETDRTDYQTVYASQTGAVVAPMAGLHFTIPLLQQLQKIG
VKILPLTLHPSLSTFNAIEVEDVSKHKMDSEYFNIPYQTAMEINETKVNK
SGRVFVVGTTTCRALEANATVDGKIKFGQGWTDKFIYPPYNFKVTDALIT
NFQQPETTLMMVVSAFAEHRLLMEAYKTALKNNYRFLAYGDAMLIV
>CT1195 radA, DNA repair protein RadA
MAKSTVRYVCSACGAVSLKYQGRCFECQSWGTLVETHLEEPDAKVRKKRP
AGSMPEVQNLDDDAPSGFHRTLTGIGELDRVLGGGLMEASAILVGGEPGI
GKSTLMLQLVPRLAGKKVLYVAGEESPNQIRERARRLSIKAPNLRLVSEV
ALERILDAIANEQPEMVIVDSIQTVYSSDYQSSAGTITQIRECAASLIRA
AKEQNFILLIIGHITKEGSLAGPKALEHMVDTVVQFEGENYQRYRIIRSV
KNRFGPTNEIGVFKMGEEGLTEVSNPSEFFISDRRTDVPGTAVLAGIEGS
RALMVEVQALVSRTGYSMPQRISTGFDLKRIAIILAVLEKRLQFQTAGQD
VFVKIAGGLKLVEPAADLAIAAAVASGLQDKPCNPTACCCGEIGLSGELR
AISDGERRIREAVHLGFTSIVLPESNTRELKPSLKKLPIRIAGCRTLHEA
LEAMGV
>CT0611 radC, DNA repair protein RadC
MRIHDIDPDNRPRERFLRSGKESLSPAELLALILRSGTAGLNIIDTCNKL
ISEHGLERLADLSIQELQKTPGIGEAKAMQIAAIFELQRRLHFARNMNLK
VKGARDVFEYMKGRIPDETKEHLFVLFLSTKNQILRHETITIGTLTASLI
HPREIFKAAIRESAHSIILVHNHPSGDVQPSNADKQVTSILKKAGDLLQI
ELLDHVIVGNNDWFSFRDHALL
>CT1772 rbcL, ribulose bisphosphate carboxylase, large subunit
MNAEDVKGFFASRESLDMEQYLVLDYYLESVGDIETALAHFCSEQSTAQW
KRVGVDEDFRLVHAAKVIDYEVIEELEQLSYPVKHSETGKIHACRVTIAH
PHCNFGPKIPNLLTAVCGEGTYFTPGVPVVKLMDIHFPDTYLADFEGPKF
GIEGLRDILNAHGRPIFFGVVKPNIGLSPGEFAEIAYQSWLGGLDIAKDD
EMLADVTWSSIEERAAHLGKARRKAEAETGEPKIYLANITDEVDSLMEKH
DVAVRNGANALLINALPVGLSAVRMLSNYTQVPLIGHFPFIASFSRMEKY
GIHSKVMTKLQRLAGLDAVIMPGFGDRMMTPEEEVLENVIECTKPMGRIK
PCLPVPGGSDSALTLQTVYEKVGNVDFGFVPGRGVFGHPMGPKAGAKSIR
QAWEAIEQGISIETWAETHPELQAMVDQSLLKKQD
>CT0242 rbfA, ribosome-binding factor A
MSIRTDKVSSLLQRELSAIFEKELPRSGPLVTVTEVRMTADLGIARVYVS
VIGSEAQRAEVMEYLDAENKMIRKTLSAKIRHQFRRIPELEFYEDRLFEQ
ANRIEQLLKSVKPARDEEQH
>CT1100 rbr-1, rubredoxin
MNVKESEPAQADLQASWMCAECGYIYDPAEGNLETNIRPGMPFDKLPDDW
SCPVCNHPKNQFTKFISQL
>CT1101 rbr-2, rubredoxin
MEQWKCNICGYIYNPETGDPEGDIPAGTSFESLPDSWMCPVCGAGKEEFT
KI
>CT2024 rbr-3, rubredoxin
MQKWVCVPCGYEYDPADGDPENGIEPGTAFEDLPEDWVCPVCGVDKSFFE
PVS
>CT1930 recA, recA protein
MEKEATPQQQPPVVDPARLKQLNLAIETLEKQFGKGAIMRLGDDSAVMHV
QVISTGSMALDYALGVGGLPRGRVTEIYGPESSGKTTLALHAIAEAQKNG
GIAALVDAEHAFDPTYARKLGVDINALLVSQPESGEQALSIVETLVRSGA
VDIIVIDSVAALVPQAELEGEMGDSVVGLQARLMSQALRKLTGAISKSSS
VCLFINQLRDKIGVMYGSPETTTGGKALKFYSSVRLDIRKIAQIKDGEEL
VGNRTKVKVVKNKVAPPFKTAEFDILYGEGISVLGELIDLAVEFGIIKKS
GAWFSYGTEKLGQGRENVKKLLKEDETLRNTIRQQVRDMLTGAPTA
>CT1069 recB, exodeoxyribonuclease V, beta subunit
MTIHILDHAAVELSGMNLIEASAGTGKTYAIASLYLRLLVEQELRPEQIL
VVTFTEAATQELRARIRRRIREALDVMRGEPTEDEFLVKLRDHTDVTGST
EQAATLLEAALAEFDMASIFTIHGFCLRALQDHAFESGALYDTELMDDQS
ALAGQVIDDFWRERFFGDANRLLAYALLKKWTPDSLMAFLQSLQLSERDV
VLPVYDEAEIARIEDEAQAAFDEVCRIWQEEQDVILTILRSDKSLSRSEK
FYRDDKVEPLLQALDEFVEDCNPFALFDGFEKLTTSGIAAGIKKNGTAPA
HPFFEACEQLQSAADERLLALKSELVAFYRKRLPERKKEGNVRFFDDLLT
DLYHALADGPEADALARALRNRYKAALIDEFQDTDPVQYDIFRRIYANSD
APLFLIGDPKQAIYSFRGADIFAYFQAASDVGEERRFTLNTNWRSTPKLL
DGFNRLFDHARQPFVFDGIPSPPLVAGRLIPESELTGKPPLELCLLDGGE
GDGSLKSGDATQFAAKACASMVAETISGGTDASGIAVIVRTHQQARLMQA
TLRKAGIVSVMRSDESVFASIEAEEVRILVTALADPGRETLVRAALVTPI
FGLNGSDIARLNSDENALVERLQEFRDYHRLWRERGFMVMARELMRREGV
RERLLATAGSSGERALTNVLHCFELLHREEHARGLGMEGLAAWFGERVAT
RDSGEEYQIRLESDEPAVKIVTAHVSKGLQYPVVFCPFQFVSGNDKDEVA
LCHDDAGRMVRDFGSASLNRHRTLAARETLAENLRLFYVALTRAERRCIF
FTARIVDGRKSGKPPQLTPASYLLYASDDARRSANPVAEAADELQTIGVD
DMADKLQKMAEESDGAMQVRRLSLGDDFHVRVPPAGSTERPELVLRQFKK
ILDTDWRIASFTTLSRHKQVAFELPDRDESAPAEPAASILLPEAERSIFA
FPKGAGAGILMHSIFETLDFASASCADIDEACAAALDRYGFDKEWQPALA
SMTREVLATPLSSPNGSFTLGGLKPRTWFTELEFFFPLKRITAPELAEVL
VRHGVIPTDTDPAAALQALDFAPVKGMLMGFMDMVFETQGRYYLLDWKSN
HLGNSVEEYRREQLDRAMTENLYRLQYLLYTVALDRFLSLRVPDYRYETH
FGGAIYVFLRGVSAERGEAFGFYRDLPSAELIRELSELLVEMTPGEGEE
>CT1068 recC, exodeoxyribonuclease V, gamma subunit
MLEIYTSNRLETLVSAFGKMVASTLLASPFEREWIVVQSRGMQRWLSMRL
ASQFGVWAGAEYPFPNALIQRLFEWLELSKQADTERFSKETISWSLMRFL
PDLLERDSFAPLRAYLDGDRDGLKLFQLSGRIADTFDQYTLFRPDLLAAW
EAGGDDAARDWQPELWRALVAEHSGKHRGQLQSAFLRKVRTTRLPADFPK
RISLFGVSYMPPYHIEILQALAVRIPVNLFLLSPTREFWTDIVSRRRLFR
MSESERALSVEGNPLLASLGKSGREFAELLLDVGNVEDEYDLYDDPGTAT
LLRALQSDILNLRGDGTDGRHPMPDASDCSVQVHSCHNPLREVEVLHDNL
LDLLDKLPGLEPRDIVVMTPDIETYAPYIATVFGASGPEEPRLPHSIADR
RMLDEGGVAPAILKLLELYGSRHTAPALFDLLSSPPVSRHFRLGEEELDT
VRRWIKETRIRWGMDEGARSKLGLPPFRENSWRAGLERLLLGYAMPDEGR
LFNGVLPFEVAGDAETLGKLADFIDAVDQLSTRFSRSRPLGEWRNHFFWM
LENFIEADADSERELAHVKSEIDSLTSLAENSRFEGEVSAQVMIAWLRGR
LEQFEMGLGFMTGGITFCAMLPMRSIPFRVVAMIGMNDGAFPRQQRPPGF
DLLAREPRKGDRSVRGDDRYLFLESLLSARDVFYLSYVGQSVRENTPLPP
SVLVSELLDAIERGFEFPEGEDAASRLVVQHRLQGFSPAYFTKGSSLFSY
SADNFKALAGRNSLAERPFIEEPLNGFTDEDRTVTLDDLARFFANPAAYF
LERRLGLKPSAAVEPLEEREPFEVAGLERYAMRQELLEAVLEGHDGCAML
PLFRSRGLLPPATHGELLFRKLLVEVEEFATKVQELKGGGTFSQLDIDLD
IGGFRLTGKLDHLLPTGQLLYRCARMRAQDRLRAWLLHLAYHAVENGATQ
ETCIITLDRSIRYRPVDDTTKRLETLLDLYREGITEPLPFFPRTSLAWAE
KAEKPEADRRKAALGQWLDGFGGIEGEGNDPAIRRCFGQEPPFGDRFKSI
ADKLLLPMIEHEGKV
>CT1070 recD, exodeoxyribonuclease V, alpha subunit
MIDFIERPIDWQAAAFLCRGVTENAELLRTVVSLLSQAVGQGHVCLDLEE
IAEELVRIPEREEPLRLLPDVASLIAALRSLPTLGGPGEHCPLILDAAGR
LYLYRYFRYETLLAEAIRARAASVSLEIDEAALAARLDRYFGVDKSGEDR
QRQAALAALRRHFSVISGGPGTGKTTTVVRILGLLLEQPGEARMRIAMAA
PTGKAAARLASSITSLREALPCADEVKRAIPSQVTTIHRLLGTIPGSTGF
RHNERNPLPCDVVIVDEASMVDLPLMTALVTALPPHARLILIGDRDQLAS
VEAGAVLGDICCAAESPVSFIAGCVTVLDRNYRFGDGSGIAALSRAVNAG
DHSEALRLFADSGSGIALEATPTRETIKKRLVAPVLEGFRPCLEAKTPAE
ALRLFDRFRILTALREGPWGASGINHAAETFLNEAGLIRLDSQFYRGRPV
LVTENDYNHKLFNGDTGIMLPDPETGNLRAFFAAPDGTVRAIPPEFLPKH
ETAFAMTVHKSQGSEFDRVLMVLPPEDSVLLTRELIYTGITRAKQSVTIW
SDAAVFSAAVKRRTERRSGLRERLVNS
>CT2288 recF, recombination/replication protein RecF
MRLDSISIANFRNHTLLEFEPGHSITNIYGRNGSGKTSILEAIHYCALTR
GFSGNNDREYLKFGEELFTIRSSFTSGQGIATKVSITYSPKREKRILVNE
QELQTFSSHIGTIPCVTFTPREMVIINGAPAERRRFIDTAICQYDRKYLS
DLLLYRRILQQRNALLSSEQDPRVIDSALDVLTDQLVAIATEIVLVRKRF
IEHFTSMLGGVYQWIPEGAEPSILYQSSLGHHENLYEKDKIQQVFRERFE
TLKQQELQRRQTLAGPHRDDLQFYLNKREIRKYASQGQQRAFLVAMKMTL
QGYLYEASGEIPITLLDDLFSELDEVVSGTMVETLATKGQVIITSTEKKK
GKGISFFSVDDYKSSKEP
>CT1575 recG, ATP-dependent DNA helicase RecG
MPPFLPLQVIKGVGPKRAVILAEAGIRSIADLYDCFPRRYLDRTTIKKIG
ALRDGETVTVVGSVTGTRFEGGGRGGSRFKAQITDGSGVLELTWFRGVHY
FSKTIRSGELVAAHGRVTFFGRTPGMQHPDFDKLGGDDESGDGQRDDELY
KTGAIIPIYPTTEAMKQAGLNSAALRRIVHRAFREHPLRITEYLSPEIIA
ANNLMPIGEAYRQLHFPDSAEQLERARYRMKWSELFFAQLFFALRRTEER
RHLTSVRFERSGEKTASLHERLPFTMTSAQKQAVREIYHDLKSGRQMNRL
LQGDVGSGKTLVAQFAMTLAVDNGLQAAFMAPTEILAFQHYAGLKNSLEP
LGIRVALLTGRQKKKEREEKLARLERGEIDIAVGTHAIIEAGVQFRRLGL
VIIDEQHRFGVLQRKALQEKAENPHVLLMTATPIPRTLTMGIYGDLDVSI
IAEMPAGRKPIQTRLCCEAEKPELYRLLRKQIAEGRQAYIVYPLVEESEK
IDLKAATESYEQLRREVFPELRLGLIHGQLPAAEKEAVMAEFRSGRLDIL
VGTTVIEVGVDVPNATIMVIEHAERFGISQLHQLRGRVGRGEHASSCFLV
YTKLTGDAKDRLQAMAATGDGFRLSEIDLQIRGAGNMLGREQSGAASGLR
IADLLTDGDIMRAARAAAFELIRRDETLTHPENALIRDYYMTHFRKRISL
ADVG
>CT0360 recJ, single-stranded-DNA-specific exonuclease RecJ
MKRYRWTCLEPEPTLVQALSEAINVSSPIAAALVNRGISSFEEARRFFRP
SLDEIPSPFLFNDMKRAVGRLSKAIFGGEKIMVYGDYDVDGTSGTAMLSL
FLREMGAEVCHYINDRFTEGYGLSEAGIAWAFEQGVSLIVTVDCGIRAID
EVQACVGKGVDVIVCDHHEAGELPAACAILNPKVEGCGYPFRELCGCGVA
FKLMQAMVEARGESQDRWRNYLDFVAVATAADMVSLQGENRAYLREGLEL
MRRSPRVSFQAMAANMKLNLAEFSMMNITYGIAPRINAAGRMESAGAAMQ
WLLSSDESEARLRAAELEALNVRRRQIDAEITARAETMVAGHCASFCSSI
VLYDEAWHLGVLGIVASKLLDKYQLPTVVMGRMNGLIKGSVRSVDQLNIY
DVLHECRDHLEQFGGHHQAAGLTLRPEQLEAFRRRFDEVCRELLPVEARQ
KTLLIDADLTLDEITPKFLNVLEQFAPFGFSNREPLFTATGCRLVGKPKL
LRERHVKFTVRGEQSSSFEVIAFDRPDIFNDLEAHGSASALQLVCIPERN
QWNGREYVQLRVMDMAIG
>CT1618 recN, recombination/replication protein RecN
MLKSLYVRDFALIDELSVSFAPGLTIITGETGAGKSILMGALNMVLGERA
SAEVVRAGARKAVIEAVFGGEHYENIGEMLDEEQIERTPELILRRDISAT
GQSRCFINDTPCTVSLLKRAGKQLVDLHGQHDHQLLLHTETHAGMLDGFG
LLHAETAQYRNTLEEYRKLRHELQSLNERADMLRKKRDFIDYQYRELDAA
ALVEGEERSIDEEINLLENAETLFNLGTALGEHLYASESSAYTTLSSAVH
LLEKLSAIDKSFEPWLEELRGATATVEELNRFVGSYIDGIDFNGDRLEAL
RERQILLQRLAKKHGKSIDELIALRDRLAEELSLEENLAGELTTIEAEIQ
KARKALSASAETLSAHRREAAERLEQEIMTGLATLGIPHSTFEVRFTREA
LPDGDIEIDGTRYRAFDNGCDRIEFMISTNLGESPKPLAKVASGGEISRV
MLAMKSALARSAELPILVFDEIDTGISGKVAQSVGFSLKRLSRMHQIIAI
THLPQIAAMSDLHLAVVKRIQADRTLTGVTPLDKDEHVREVARLFSGTEI
TETSLQLAEELIEAGRSA
>CT0806 recR, recombination/replication protein RecR
MRYSSGAVEALIEEFAKLPGIGRKTAQRLTMHVLHERRSEVEKLASALID
VKEKVIRCSVCQNITDLGVDPCHICTSAGRDRSVICVVESPTEVLAFEKT
GHYKGLYHVLHGVISPLDGVGPDDIKVRELIARIGVDSTGGVREVVLALN
PTVEGETTSLYISKLLKPLGINVTRIARGIPVGAELEFIDEATLSRAMEG
RSAI
>CT1778 recX, regulatory protein RecX
MDEGKKSSALDHALRLLAGRAHGRAELESKLKKKGFDSESIAKALARLDE
LNLTDDRAFAQSCTASMARRKPEGRLKTRARLKQKGLPDNIIDEALNGCD
QTELCRSAAEKKLRTLPASPDQKKKKLITFLKNRGFDWETIRETVKLVLG
EESARSDQLD
>CT1545 relA, GTP pyrophosphokinase
MLAQIEQEHYSKLHEILRLCRANLKNYDESLIQRAFFMCYRAHEGEKRAS
GEPFFYHPVEVAKLLVTELPLDGVSVAAALLHDVIEDSGYTYEDISAELG
AEVADIVEGLTKISEIMVNRETTEAEGFRKMLLSMVKDIRVILIKFCDRL
HNMRTLDSLPEHRRLRMALETRDIYAPLAHRFGLGKMKVDLENLALKYID
PEMYDYLLKKVRLSRNERVAYLNKMIAPIKDDLEKQGFTVELQGRAKHLY
SIYNKMRMKNKKFDDIHDLYGIRVIIDTEKISDCFAVYGYITQQFPPIPQ
HFKDYISIPKHNGYQSLHSAIIGPKGHVVELQIRTRRMHEFAELGVAAHW
RYKEKISKDDAALDSFLKWARELIKDADSATAFMEGFKLNLYHDEIYVFT
PKGDMKVLPAGATPIDFAYAIHSEIGNGCIGAKVNGKIVRLNTELRSGDR
VEVITSKSQRPKADWLKIVVTHRAKLKIRAAINEERRQEIEKGRSIWEKM
LSGSKKLFTENDAIRAIRQHGIKTPADLFNALAKQQISSEEVLERISHPH
RPAETHESTVQARAPHKDFAEIAREVQERLGYQKDEVTIAGLNNISYSYA
KCCNPVPGDDIIGFVTTGGTVKIHRKNCVNVTNENSVKSERIVSVAWNRK
VDTDFLAGIRIVGEDKIGMTNQITGVISKFDTNIRTIVLNAKDGIFTCNL
MIFVKNTDKLTTLMDKLRKVQGVFTVERLSN
>CT0911 res, type III restriction system endonuclease
MKLHFEPNLDYQLQAIEAVCDLFRGQEICRTEFTVTRQTSLAFAKSDPGV
GDLGVGNRLTLLDDEILANLRDVQIRNGLAPSETLASGDFTVEMETGTGK
TYVYLRTIFELNKRYGFTKFVIVVPSVAIKEGVYKSLQITEEHFKSLYAG
VPFDYFLYDSAKLGQVRSFATGANIQIMVVTVGAINKKDVNNLYKDSEKT
GGEKPIDLIRATRPIVIVDEPQSVDGGLQGQGKAALDAMNPLCTLRYSAT
HVDKHHMVYRLDAVDAYEKRLVKQIEVASATVEDAHNKPYVRLVSVSNKK
GTISARVELDMQTAGGKVRRQEVTVQDGDNLEQTTGRAMYANCRIGEIRV
AKGDEYLELRVPGGELYLKPGQAWGDVDALAVQREMIRRTIREHLDKEKR
LRPQGIKVLSLFFIDEVAKYRSYDVDGNPVKGDYARIFEEEYRRAANLPG
YRTLFQEVDLTRAAEEVHNGYFSIDKKGGWTDTAENNASNRENAERAYNL
IMKEKEKLLSLDTPLKFIFSHSALKEGWDNPNVFQICTLRDIQTERERRQ
TIGRGLRLCVNQQGERVRGFEVNTLTVVAMESYEQFAENLQKEIEEDTGI
RFGVVEQHQFAGIPVSQPDGSTVPLGVEQSRALWEHLKTAGYIDDKGKVQ
DSLKQALKDDTLVVPEPFAAQRDQIVAGLKKLAGRLEIRNADERRQVRTR
QAVLHSPEFKALWERIKYKSTYRVHFDNKKLIKRCIRAVQEAPAIPKTRL
QWRKADIAIGKAGVEATEREGAATVVIDEADIELPDLLTELQDRTQLTRR
SIQRILSGSGRLKDFKRNPQAFIELTAEAINRCKRLAVVDGIKYQRLGNE
HYYAQELFEQEELTGYLRNLLDANKGVYEQIVYDSDTERTFGDQLEKNDA
IKVYAKLPGWFTVPTPLGSYNPDWAVLVEKDGAERLYFVVETKSGLFAED
LREKERAKIECGKAHFKALEVGEAPARFVMARTVDEVLADP
>CT1258 rfaD, ADP-L-glycero-D-mannoheptose-6-epimerase
MIIITGGAGFIGSAMLWELNRNGTDEVLIVDDLGRASEGRWLNLRGLRYT
DFIHKDDLPDLLEHDRLPKIDAVIHMGAISSTTEQDANLLLRNNYEYSKM
LASWCAKKGVRFIYASSAATFGDGSEGYSDGIEVLDRLRPLNMYGYSKHL
FDCWALRNGILEKAAGLKFFNVYGPNEYHKEDMTSVVFKAFHQIGDNGKV
RLFRSHNPQYADGEQLRDFVYVKDCTKIMQWLLETPSATGLFNIGTGQAR
SFRDLVIATFTAMDRPVSIEFIDMPETIRDKYQYYTCADSAHLRQAGYTG
QMTPLEEGVRDYVQNYLSKPSPHLDTLAFERQ
>CT0305 rfbA, glucose-1-phosphate thymidylyltransferase
MKGIILAGGSGTRLYPVTKGVSKQLLPVYDKPMIYYPLTTLMLAGIRDIL
VITTPDDQSSFVKLLGDGSDWGINLSYTVQPSPDGLAQAFILGRDFIGDD
DVCLVLGDNIFFGYGFSGMLEEAVHVVERRRKAVVFGYYVSDPERYGVVE
FDSDGQVFSIVEKPEKPKSNYAVVGLYFYPNDVIDIAASVNPSSRGELEI
TSVNQTYLDRGDLVCSIMGRGFAWLDTGTHESFQEAGNFIETVEKRQGLK
VACPEEIAWRNGWIGDADIERLASPLLKNQYGQYLLNLLERRI
>CT0308 rfbB, dTDP-D-glucose 4,6-dehydratase
MHILITGGAGFIGSHVVRHFLNRYADYTITNLDKLTYAGNLANLKDVESN
PNYRFVKGDIADGAFLLDLFKEQRFDAVIHLAAESHVDRSIESPVEFVIT
NVFGTVNLLNAARATWEGRFEGKRFYHISTDEVYGSLGSEGMFSESTPYD
PHSPYSASKASSDHFVRAFHATYGLPVVISNCSNNYGSHQFPEKLIPLFI
NNIRLEKPLPVYGQGLNVRDWLWVVDHARAIDEIFHRGAVGETYNIGGHN
EWTNIDLIRLLCRIMDRKLGREAGSSEKLITWVTDRAGHDLRYAIDASKL
QRELGWAPSVTFEEGLEKTVDWYLENQAWLDEVTSGAYQHYYEKMYAGR
>CT0306 rfbC, dTDP-4-dehydrorhamnose 3,5-epimerase
MQIIRTSIPDVLLFEPEVFGDERGWFCESFRQDIFEQHAGCHRFVQDNES
FSRYGVVRGLHYQKPPHVQGKLVRVIRGEVLDVAVDIRKGSPTFGHHTAQ
LLNESNRRMMWIPPGFAHGFAVLSQTAVFSYKCTDYYAPSHDAGIRWNDP
AIGIEWSVPESEIRLSDKDLHHPMLHEIEGIVLDA
>CT0307 rfbD, dTDP-4-dehydrorhamnose reductase
MNILVTGSRGQLGSELQKLQEVHGWQEWFFMDLPELDITDALAVERVCRD
RRIGAIVNCAAYTAVDRAESDAEAAFRVNRDGAAVLAAVAMEVGALLLHV
STDYVFDGSSNRPYCEDDPVAPCGVYGLSKWEGEEAIRASGCSYIILRTA
WLYSVYGQNFVKTMLRLGSERQSLGVVFDQVGSPTWAADLAGTIVSILDQ
CDPVRSYSETFHYSNEGVCSWYDFAKSIMDAEGLSCKVLPIESSNYPTPA
RRPHFSVLNKRKIKSTLGLEIPYWHDSLLRMLTELRKTAGKS
>CT0264 rho, transcription termination factor Rho
MSNNSVSKGLDINALQKKKVHELNAIAKELGVITAGFRKEELIYKIIEAQ
SLKNPDSESGQVMVNTGVLQVIPEGYGFLRSSNYNYLSSPDDIYVSPSQI
KRFNMRTGDTVSGQVRAPKEGERFFALLKINTINGKDPEVTRERPFFENL
TPLFPHSRLKLETRQNEYSGRIMDIFTPIGKGQRGLIVAQPKTGKTILLQ
MIANAIIKNHPEVYLIVLLIDERPEEVTDMARSVEAEVVSSTFDEDPERH
VQVADMVLEKAKRLVEVGHDVVILLDSITRLARAHNTIIPHSGKILSGGI
DANALTKPKRFFGAARNIEEGGSLTIIATALVDTGSRMDDVIFEEFKGTG
NMELVLDRRLSERRIFPAIDILRSGTRKEELLFTQEELSRTWLLRKYLGD
KNPIECMEFMREKIVETKDNKEFFKYMNA
>CT1591 ribBA, 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II
MMSNNQFDSIESAIEDIKNGKLIIVVDDEDREDEGDFIAAAEHVTPEMVN
FITKEARGLLCVAIPMERARELQLEPMVQRNTSQHETNFTVSIDAIAEGV
TTGISAYDRYMTLKMLADPSSTADDFSRPGHIFPLRAMDGGVLRRVGHTE
AAVDLCRLAGCQPAGLLCEILHDDGSMARVPELLKLKEKLGMKLITIKDL
VAYRMQRSKLVHRAVESKLPTAYGEFKLIAYETIVDQQNHMAFVKGDVGN
GEPVLARVHSQCATGDTFGSLRCDCGHQLETALRMIEKEGRGVLIYLMQE
GRGIGLINKLKAYNLQDEGFDTVEANEKLGFKPDLRDYGIGAQILQDLGV
RKMRLMTNNPKKIVGLEGYGLEIVERVPLEIEPNEVNRHYLQTKRDKLGH
MIQMASGNERILFERLADEQLKQHKKD
>CT0747 ribD, riboflavin biosynthesis protein RibD
MATNPALAARPEDETYMWRCLELAERGAGSVSPNPMVGSVIVCAGRVIGE
GWHRQYGGPHAEVDAIASVEDESLLRQSTLYVNLEPCSHYGKTPPCADLI
VEKRIPRVVVGCLDPHEKVAGKGIARLREAGIEVTVGVLEAESERLNEAF
MTSHRKGRPFVALKTAQTLDGRIATSLGASKWITGEESRCQVHRLRCIYD
AVLCGASTVLADNTELTVRFCAGRQPLRVLLDPRLQVPHEFRIFNDAAKT
LVFALREEADPDLVRQLEARGIEVATVGEAEGSLDFAEVFAELHKRRLLS
VLVEGGSRLASAMVRSGFVDKYYVFIAPKLFGGDGLASFGALGVTHPDYA
VKLSFSGIRRFGEDLLLEAYPVH
>CT0756 ribE, riboflavin synthase, alpha subunit
MFTGIVKDVGAIAASARQGSGMRLKVRYTSEAEFGDLAIDESVSINGACQ
TAVAVGPGWFEVDTVAETLKKTTLGSFRPGTKVNLERAVRPMDRLGGHFV
LGHVDGVGRVLRIEEVGGSRMISVAFDSRFDAWIVSAGSIAIDGVSLTVA
SVEPGQFTVAIIPYTFGHTTITGLAAGSEVNLEFDILGKYVARQHTAAAA
PSQEPSRITESWLSGQGFA
>CT0244 ribF, riboflavin biosynthesis protein RibF
MRVVVLQGDTVLDSVNGLPVQLSPEPSAVTIGSFDGLHVGHRKIVGSMIG
HARELGLRSVVVTFEPHPRIVLEGGDGCPVRLLTTFDEKISQFSSMPIDL
LFVVRFDRQFASKSSEAFIREVLVKMLGARHVTVGYDHGFGKRRSGSEET
LHMLGAECGFGVDVVGEVIVAGSPVSSTRIRGLLEAARIRDANECLGAPY
AISGTVVEGDKLGRTIGFPTVNLALPDRCKMVPAHGVYAASIEIDGREYS
AMMNIGRRPTVSEDGEVRVEAHIIGFSGDLYGRFLIVRMLDFIREERRFA
SIDELRTQLELDKKEAGFCKK
>CT2037 ribH, 6,7-dimethyl-8-ribityllumazine synthase
MQVQNIEGSLNASGLKFALVVSRFNDFIGQKLVEGAIDCIVRHGGSADEI
TVIRCPGAFELPSVTRKAMLSGKYDAIVTLGVIIRGSTPHFDVIAAEATK
GIAQVGMEAAIPVSFGVLTTENLEQAIERAGTKAGNKGFDAALAAIEMAN
LYKQL
>CT1165 rimM, 16S rRNA processing protein RimM
MELFLTGVVLKPKGLKGELKVKPVTDFPESFLTRREYYIGKTPEDAVLRK
VQSARFHQGFAWLVFEGAGSREGAEALVGCGLYVTRDALVAMPDDRAYIH
ELIGLDVFDETEGRVGKISDVLQMPAHDVYEVDTGDRKVLIPAVEDFITE
TDLEKGMVRLKRFKEFL
>CT2077 rlpA-1, rare lipoprotein A
MKKRFILYSHLVLALSFFMSVSPFNAFASQNGAIAQSSPQKNSSSRNAVL
KFTTAKPLDNGNRFMVAIGKASYYTNRFRGQTTANGETFDMKEFTAAHRS
LPFGTIVRVTNLNNGKMVFVKINDRGPYVKNRIIDLSKAAAKQLDLVDSG
VGRVKIEAYN
>CT2273 rlpA-2, rare lipoprotein A
MKTQKRIGALLLMTLLSLQACSTYRSTTTPPAQRISPEEAYRLGKLKNKP
YLIDGRLYVPMSFDQVYGYEETGLASWYGKETLDQHNGQPTAYGEIFDPD
LPSAAHKYLPLPMIVRVTNLENNTSIIVRVNDRGPFVDGRVIDLSAAAAK
ALGFYGKGTARVKIESVYR
>CT1299 rluD, ribosomal large subunit pseudouridine synthase D
MTLQVSKVQTPMRIDRYLTQQVENATRNKVQKAIEEQRVLVNGKPVKSNY
RIKSCDHIHITFLRPPAPELAPEDIPISILYEDDDLMVIDKEAGMVVHPA
FGNWTGTLANAILHHLGKDADDLDKTEMRPGIVHRLDKDTSGLIIVAKNP
VALHKLATQFARRQVEKVYKAIVWGVPNPPSGTIKTNIGRSHKNRKVMAN
FPFEGAEGKHAITDYLVVEDLAYFALMDMTLHTGRTHQIRVHLQHLGHPI
LGDQTYGGATPRTLPFSKSEPFTHNLLELMGRQALHAETLRFRQPTTGEP
LAFTAPLPEDMKTVLEKIRSVMKRTTQTL
>CT2119 rnc, ribonuclease III
MSLQFLRSEASDGAGETSDASSADFLLDPQTATHLARLTGRPCNRLIYRT
ALTHRSVLHDHHSEEHKPESNQRLEFLGDAVLDLLISEHLFKQFPGSDEG
HLSSNRAKIVNRKSLAAFALELQLGEHLIIGESADKQKIRTSESALADAL
EALVGAIYLDQGLAGAERFITNHVIAKVDLHKLVEAEYNYKSRLIEYTQS
RQLPPPLYTVITEEGAEHEKTFVVEVSCNGQPLGRGTAPRKKDAEQLAAK
EAMKRLESGDLGNLNEPSPQNS
>CT1612 rnhA, ribonuclease H
MEKTITIYTDGACSGNPGKGGWGALLMYGSSRKEISGYDPATTNNRMELM
AAIKGLEALKEPCRVQLYSDSAYLVNAMNEGWLKRWVKNGWKTAAKKPVE
NIDLWQEILKLTTLHRVTFHKVKGHSDNPYNSRCDELARLAIKENS
>CT2261 rnhB, ribonuclease HII
MLSTDFEYPLWESLSQVCGIDEAGRGPLAGPVVAAAVVFPRHFRPTGIFA
KLDDSKKLTAELRDELALAIRESAESWALGVVDAETIDRINILQATMLAM
NLAVESLGSTPEFLLVDGNRFRPVLPIPYQTIVKGDSKVFSIAAASVLAK
THRDELMTTSAAEYPEYGFEVHFGYPTARHVEAIARHGRCAIHRQSFKLR
KLGEK
>CT0290 rodA, rod shape-determining protein RodA
MMEKYNKFDLLWLLVPLSGLIVMGLMAVYSATNGSTESVPLFYKQLSWAV
TGAIAASIIYFMDYRVVKDNAYFMYAAGIILLIAVLVFGKKVAGATSWVR
FGMFSFQPSELTKMFTIIAMARFLSDDQTDIGNMMDLGKALAIALVPAGL
IMLQPDMGTTLTCLSFIVPMIVLAGFDLYYILLGVVPVALMLSGFFNLTI
LATIAVLSMVMFFLLRKKFYLHQFLVTGGGLLGGLLTWKFTSVILKPHQI
KRIQIFLDPTADPRGAGYNALQAKIAIASGGIFGKGFLHGTQTQLRYIPA
QWTDFIFCVIAEELGFLGSTLLLLLFAALVLRLVWMVGAIKNRFVELLLA
GYASLLLTHVVINIGMTIGVMPVIGVPLPFISYGGSSLVANMMMVGLAMN
FSKNRRHIGY
>CT1670 rpe, ribulose-phosphate 3-epimerase
MPGKTTLLAPSILSADFTNLKASVELAEKAGADWMHCDVMDGIFVPNITF
GSFIVQAIKQCTSIVIDTHLMIVDPDKYIEDFAKAGSDQITVHLEACPHL
HRTIQLIKSLGVKAGVSINPATPVSLLEPVLADLDLVLLMSVNPGFGGQK
FIPGAIKKIMQLDAMRMEMNPEMVIAVDGGVTEENAAMIVDAGADALIAG
TAFFRAPDPVAAAKKLKGLE
>CT1050 rpiB, ribose-5-phosphate isomerase
MKIAVGSDHAGVELKKFVLSWLEKHCYDFEDMGPYSAESVDYPDYGHKVA
EAVARGDFDQGILMCGTGIGISIAANKVKGVRAANVCNPEYAALARQHND
ANVLAFGARFNDDSSAAKILESWFASEFEGGRHQRRVDKIEFCC
>CT0152 rplA, ribosomal protein L1
MSMAGKNYRNASAKVNRAQEYELAEAIEKVKEITTTKFDATVDVAIKLGV
DPRHADQVVRGTVMLPYGTGKTVSVLVVCKENKAEEAREAGADFVGFEDY
IEKIQNGWTDVDVIVATPDVMGQLGKVARILGPRGLMPNPKSGTVTMDVA
KAVKEVKAGKIEFRVDKAGNIHAPVGKVSFDSANLAGNITSFIKEVVRLK
PSAAKGQYLQGITISSTMSPGVKVKKDKFVA
>CT2186 rplB, ribosomal protein L2
MAIRKLAPVTPGTRFASYAGFDEITKSTPEKSLLVPIKRTGGRNSTGRVT
SRHMGGGHKRFYRIIDFKRNKDNVPAKVAAIEYDPNRSARIALLHYVDGE
KRYILAPKNLKVGDRIESGEKVDIKVGNTMPLKNIPIGSDVHNIELKIGK
GGQIARSAGAYAVLAAREGNYATLKMPSGEIRKVRIECRATIGVIGNAEH
ENISLGKAGRSRWLGIRPQTRGMAMNPVDHPMGGGEGKSKSGGGRKHPKS
PWGQLAKGLKTRNKKKASTKLIVRGRKAK
>CT2189 rplC, ribosomal protein L3
MGAILGKKIGMTRLYNDKREAVPCTVIQAGPCYVTQVKSTEKDGYEAYQL
GFGERDEKKVSKPLAGHYKKAGKNPGYILSEVSKSLIVGELEAGATVPVD
VFKEGDKVNVLGVTKGKGFAGVVKRHNFGGGSRTHGQSDRLRAPGSVGGS
SDPSRTFKGTRMAGRMGGKNKTVQNLVIVKVMPESNLIVVKGAVPGPKNS
YVKIVSTTK
>CT2188 rplD, ribosomal protein L4
MELKVLNIQGAETGEVVTLNDEIFAVEVSEHAMYLDVKAILANRRQGTHK
AKTRAEVRGGGKKPFRQKGTGNARQGSTRSGLMVGGGAIFGPQPRTYDQK
VNRKVKQLARRSALSAKAAAGQIVVVDDFSFEAIKTRPVADMLKNLGLAE
KKTLLMMPHHDNVVSTSGRNIEKLNVMVADQASTYDILNSQVVLFQKGAL
QKIEETLG
>CT2177 rplE, ribosomal protein L5
MPARLEVYYRETVVPKLMERFKYKSIMMVPRLEKISVNIGVGEAAQEPKL
LETAMQELGQITGQKPQVRKAKKAISNFKLREGQAIGCRVTLRRKIMFEF
MDRFISVAVPRIRDFRGLSDTSFDGRGNYNVGIREQIIFPEIDIDKVPRI
NGMDISFVTSAKTDEEAYELLSLLGMPFKKKNQ
>CT2174 rplF, ribosomal protein L6
MSRIGKMPIRLADQAQVEVKENMINVTGPKGALSQALVDEVIVSVADGAV
TVQRKDDSKRARAMHGLYRMLVSNMVEGVTKGFTRKLEMSGVGYRAELKG
NLLALTLGYSHMIYFQPPAEIKIEVPDPTTVLVSGIDKALVGQIAAKIRS
FRKPEPYRGKGIKYEGEVIRRKEGKAAGK
>CT2132 rplI, ribosomal protein L9
MKIILRKDVATLGDAGEVVTVKNGYANNYLIPQGYAIRATEGTLKALETE
KKQQARKIELQRTNARELAAKIEQMTLKVLAKAGESGKLFGTVTAGDIAE
ALKAQGVDIDRRKIHLEAPIKALGKYEADAKLFMDITAKLSIEVEAEGAS
EE
>CT0153 rplJ, ribosomal protein L10
MMKRDTKEQIAQEIAEKFQKSQGFYFTEFQGLDVQKMSQLRLEFRKAGIE
YRVVKNTLIKKALKDAADVDKLAAGLKNTTAVAFAYDDPIAPAKIIKKFS
KDNEALKFKMASIDGQVFGSDSLPQLSEMLSKTENIGRLAGLLNNMVASV
PMVMNAVMRNLVSVIDQVGKLEK
>CT0151 rplK, ribosomal protein L11
MAKKITGFIKLQIPAGGANPAPPVGPALGQKGVNIMEFCKQFNAKTQSEA
GMIIPVVITVYSDKSFTFVTKTPPAAVLLLKEAKLQKGSGEPNRNKVGTV
TMDQVRKIAELKRPDLNSIDLEGATQMVIGTARSMGIVVEG
>CT0154 rplL, ribosomal protein L7/L12
MSSIETLVEEIGKLTLTEASELVKALEEKFGVSAAPAVMAGAVMAAPAGE
AAAAEEKTEFDVVLKSAGANKINVIKVVRAITGLGLKEAKDMVDGAPKTV
KEAVSKDEAEKIAKELKDAGAEVELN
>CT1783 rplM, ribosomal protein L13
MSKTLSFKTYSAKPGEVKRTWHIIDAENQVLGRMAAQIANVLRGKHKPQF
TPHIDTGDFVVVTNAAKVALSGKKRDDKTYFSHSHYPGGVRIDSVKDLLQ
KKPEKVIEHAVWGMLPHNNLGRQLFKKLKVYAGPEHPHAAQMPVEMKINQ
>CT2179 rplN, ribosomal protein L14
MIQKETNLVVADNSGAKKVRCIHVFGGTGRRYAALGDQIMVSVKAAVPGG
VVKKKDVCKAVVVRCVKEQKRKDGSYIRFDENAVVLLNAQGEPRGTRIFG
PVARELRDKRYMKIVSLAPEVL
>CT2170 rplO, ribosomal protein L15
MDLSSLRPAKGAVKARKRVGRGPGSGNGTTAGKGNKGQQSRSGYQRPVIE
GGQMPIYRRLPKFGFTPPNQKAVACVNVAQIQMWIEKGLVGEEISVLDLK
HLCNASNQEYFKVLGNGELTSTVTITAHFFSKSAEEKIAKAGGKIVKAYR
TLEEAAKVNGLPFEEALLTPKAKVVKVKKEKKSVKS
>CT2182 rplP, ribosomal protein L16
MLMPKRVQYRKTQRGRMKGNAQRGTAVTFGSFGLKAMEPAWITSRQIEAA
RIAMNRYMKRDGKIWIRIFPDKPVSKKPAETRMGSGKGSPEFWVAVVKPG
RVMFEADGVPREVAVEAFRLAAKKLPIKTKFIVRPDYEG
>CT2161 rplQ, ribosomal protein L17
MRKVKPARKLGRTSAHRKATLSNLSTQLLIHKRIETTEAKAKETRKVVEK
IITKARKGTHHAQREVFGALRDKEAVRELFEEIVGRIGSRNGGYTRIIKL
APRYGDAAKMAVIELVDYAEAPSAAPVVSKQDRAKRVKGSKKAESRSQEN
EGGDAAE
>CT2173 rplR, ribosomal protein L18
MSQVDKTARRQKIKARSRAVVRGTQERPRLCVFRSLSQIYAQLIDDESGK
TLMAASSMSKENAGLTGTKSEVSAAIGKQIAEKALAQGISRVVFDRNGFR
YHGRIKALADGAREAGLIF
>CT1163 rplS, ribosomal protein L19
MDQLIQLVEATQQRNDIPEVRPGDTVRIQLKVIEGEKERLQAFEGVVIGD
KGMGASKTITVRKISHGVGVERIIPVNSPNIESVTVVRSGKARRAKLFYL
RKRTGKAALKVKERKSASAEA
>CT2129 rplT, ribosomal protein L20
MPKSTNSVASKARRKRILKKAKGYWGSRGNVLTVVKHAVDKAEQYAYRDR
RVKKRNFRSLWIMRINAAARQNGVSYSRLMDAIHKKNIEIDRKALAEIAV
KDPAAFSLIVKTALD
>CT1505 rplU, ribosomal protein L21
MQALIEISDKQYLVKAGDKIFVPKQKAAAGDVIEVKTLMQVNQADSALKA
GTATIKVLEHVRDETIIVFRKKRRKRFQKRNGHRQHMTQVEVLSL
>CT2184 rplV, ribosomal protein L22
MQAKAILRHTPTSPRKMRLVAGLVRGKRVDQAKAILHNSTKSASRNVMVT
LKSAVANWSQLNPDERLNDNELFVKAIFVDEGPSLKRLLPAPMGRAYRIR
KRSNHLTIVVDKVENKVTK
>CT2187 rplW, ribosomal protein L23
MKNPLLRPWLTEKSTKLTEQKGQYVFQVKIDADKFDIKKAVEEKFGVDVV
SIRTINCLGKSKRQYTRKGLIAGKKSDWKKAIVTLGDGQTIDYYAKPAEK
SEK
>CT2178 rplX, ribosomal protein L24
MKTGIKKVKLHVRKNDEVTVIAGNDKGKSGKVLKVFPQKGRVIVEGVNIR
KRHMRPTQGMPQGAIIEREFPIHVSNVKKS
>CT1506 rpmA, ribosomal protein L27
MAHKKGGGSTKNGRDSNPKYLGVKAAGGSVVNAGTIILRQRGTAIKPGNN
AGLGRDHTIFALVDGTVHFRNGRNNKKRVDIIPS
>CT1611 rpmB, ribosomal protein L28
MSKVCVLTGKRPKYGNNVSHANNHTRTRFEPNLHTKRIWIEEERRWVKVR
LTAKAMKIMSKTGTAELAKLLK
>CT2181 rpmC, ribosomal protein L29
MKNYEIAAMDKKELLSKIKELENRLADLNFYQAIEPAQNPMVFRNLKRDI
ARMKTRLTQIDRQEKSNA
>CT2171 rpmD, ribosomal protein L30
MSEKMIKVTQVRSVIGGTKKQKDTIKALGLGRPNHKVEIKDNACTRGQIR
VVQHLVKVEEL
>CT1576 rpmE, ribosomal protein L31
MKPEIHPKYTKVTVNCANCGTTFETRSTRNNNIKVDICSKCHPFYTGKQV
LVDTAGRVDRFNKRFAKAAPKASAQ
>CT2112 rpmF, ribosomal protein L32
MANPKAKMSKSRRDKRRAQFNARTKAAVTVVCPNCGEPTLPHRACRHCGH
YKGRQVTGKSVVA
>CT1373 rpmG, ribosomal protein L33
MAKGKENRIVITLECTEAKKEGVPVSRYTTTKNKKNTTERLILKKYNPNL
KRHTEHKEIK
>CT0003 rpmH, ribosomal protein L34
MKRTFQPSNRKRRNKHGFRQRMATKNGRKVLSARRAKGRHSLSVSSSMSA
SKR
>CT2128 rpmI, ribosomal protein L35
MPKMKSHRGACKRFKATASGKVKRERMNGSHNLEHKNRKRTRRLHQSTLV
DSTKEKQIKRMILA
>CT2166 rpmJ, ribosomal protein L36
MKIYSSIKKRCEHCRIIKRKGKRFVICKVNPSHKQRQG
>CT2162 rpoA, DNA-directed RNA polymerase, alpha subunit
MIYQMQMPTKIDVDEATHTGSFGRFIAQPLERGYGVTLGNAMRRVLLASL
PGTAITGIKIDGVFHEFSTIDGVREDVPEIVLNLKKVRFKSNCKRSCKTT
LTLAGPKDFLAGDIVAQEGEFEVLNKDLHIATINSEATVTIDIYIGRGRG
YVPAEENRSDGMPIGFIAIDSIYTPIKNVKLTVENTRVGQKTDYEKMILD
VETDGSITPDDAISLAGKIINDHITFFANFSPTEEEFSEEEYKQLDDEFE
SMRKLLQTKIEDLDLSVRSHNCLRLAEIDSLGDLVSRREEELLNYKNFGK
KSLTELKEQLEKFNLKFGMDITRYQLKG
>CT0155 rpoB, DNA-directed RNA polymerase, beta subunit
MKVADATPTPCIDFSKIQSIINPPDLLKVQLDSFHNFIQDSVPLEKRKDQ
GLEKVLRSAFPITDTRGLYLLEYISYSFDKPKYTVEECIERGLTYDVSLK
IKLKLSYKDEADEPDWKETIQQEVYLGRIPYMTDRGTFIINGAERVVVAQ
LHRSPGVVFSEAVHPNGKKMYSAKIVPTRGSWIEFQTDINNQIFVYIDQK
KNFLVTALLRAIGFAKDEDILGLFDLVEEVEVSSKSSKREQLLGQYLASD
IIDMTTGEVVPARAAITEEIIDQIVAAGYKTVKVMKTTSPEKGVDKSVII
NTILNDSSATEEEALEIVYEELRSNEAPDIDAARSFLERTFFNQKKYDLG
DVGRYRIQKKLQNELAELSAYLEKRPELKELSDAIYERILQTISTYSEEP
IGEDILVLTHYDIIAVINYLIKLINGMAEVDDVDHLANRRVRSVGEQLAA
QFVIGLARMGKNVREKLNSRDTDKIAPADLINARTVSSVVSSFFATSQLS
QFMDQTNPLAEMTNKRRVSALGPGGLTRERAGFEVRDVHYTHYGRLCPIE
TPEGPNIGLISSLSVYAEINDKGFIQTPYRVVENGQVTDKVVMLSAEDEE
NKITVPVSIELDENNRIAAESVQARTKGDYPLVLAEEVNYMDVSPVQIVS
AAAALIPFLEHDDGNRALMGANMQRQAVPLLVSEAPIVGTGMEAKVARDS
RAVIVAEGPGVVQCVTADRIEVRYDLDPENNTVSLLDPDEGVKVYKLIKF
KRSNQDTCISQRPLVHNGQRVNAGDVLADSSSTDNGELALGKNVLVAFMP
WRGYNFEDAIVLSERLVYDDVFTSIHVHEFESSVRDTKRGEEQFTRDIYN
VSEDALRNLDENGIVRIGAEVKERDILVGKITPKGESDPTPEEKLLRAIF
GDKSSDVKDASMHVPAGMKGIVIKTKLFSRKKKIGMDVKERMEAIDKQFD
LKEADLRSRFAKWMKQYLNGKKSVAITSDKGKVLVPEGTVIDEALLAKFN
GLPFLESIDLSKGIVSGAKTNENVVRLIREYRLKLKDLSDERENEKYKIN
VGDELPPGIEELAKVYIAQKRKIQVGDKMAGRHGNKGVVGKILPIEDMPF
MEDGTPVDIVLNPLGVPSRMNIGQLYETSLGWAGKKLGVKFKTPIFSGAT
YTEVQEYLEKAGLPGHGKVKLFDGRTGEQFHDEVTVGYIYMLKLSHLVDD
KIHARSIGPYSLITQQPLGGKAQFGGQRFGEMEVWALEAYGAANILREML
TVKSDDVIGRNKTYEAIVKGQNLPEPGTPESFNVLIRELQGLGLEIRIDD
RVP
>CT0156 rpoC, DNA-directed RNA polymerase, beta-prime subunit
MIFSQGSSPLKGDFSKIKFSIASPESILAHSRGEVLKPETINYRTFKPER
DGLMCEKIFGPTKDWECYCGKYKRVRYKGIICDRCGVEVTTKSVRRERMG
HISLAVPVVHTWFFRSVPSKIGALLDLSTKELERIIYYEVYVVINPGEPG
EKQGIKKFDRLTEEQYFQIITEYEDNQDLEDNDPAKFVAKMGGEAIHMLL
KGLNLDEIALNLRKVLKESGSEQKRADALKRLKVVEAFRKSYEPQKRTRK
KSTGLFPEEDSPELYIYEGNKPEYMVMEVVPVIPPELRPLVPLEGGRFAT
SDLNDLYRRVIIRNNRLKKLIDIRAPEVILRNEKRMLQEAVDALFDNSRK
ANAVKTGESNRPLKSLSDALKGKQGRFRQNLLGKRVDYSGRSVIVVGPEL
KLHQCGLPKSMAIELFQPFVIRRLVERGIAKSVKSAKKLIDKKDPVVWDV
LEKVIDGHPVLLNRAPTLHRLGIQAFQPTLIEGKAIQLHPLVCTAFNADF
DGDQMAVHVPLSPEAQLEASLLMLSSHNLILPQSGKPVTVPSQDMVLGMY
YLTKARFGDVGQGQLFYSMEEVIIAYNEERVGLHAQIFVKYDGKVDQVSD
PVRLVDTLVPEEQAERRAWLKSQIEQKKLLVTTVGRVIFNQHMPEEIGFI
NKLINKKVAKELIAQLSSEVGNVETARFLDNIKEVGFDYAMRGGLSIGLS
DAIVPETKVKHIKNAQRDSAKIIKEYNRGTLTDNERYNQIVDVWQKTSNL
VADESYEKLKKDRDGFNPLYMMLDSGARGSREQVRQLTGMRGLIARPQKS
MSGQPGEIIENPIISNLKEGLTVLEYFISTHGARKGLSDTSLKTADAGYL
TRRLHDVAQDVIVTIDDCGTTRGLHVERNIEEETSGQIKFREKIKGRVAA
RDIVDVINDKVVVKAGEIITDELAAAIQDNIGVEEAEIRSVLTCESKVGI
CAKCYGTNLSVHKLVEIGEAVGVIAAQSIGEPGTQLTLRTFHQGGAAQGG
IAETETKAFYEGQVELEDVKSVEHSIITEDGIEETRQIVIQKNGKLNIID
PDSGKVLKRYVVPHGAHLNVEHGQMVRKEQVLFSSEPNSTQIIAEMPGFA
KFIDIEKGVTYKEEVDPQTGFAQHTIINWRSKLRASETREPRVAIVSESG
EIRKTYPVPIKSNLYVEDGQKIVPGDIIAKVPRNLDRVGGDITAGLPKVT
ELFEARIPTDPAIVSEIDGYVSFGSQRRSSKEIRVKNDFGEEKVYYVQVG
KHVLATEGDEVKAGDPLTDGAVSPQDILRIQGPNAVQQYLVNEIQKVYQI
NAGVEINDKHLEVIVRQMLQKVRVEEPGDTDLLPGDLIDRSTFIEANEAV
AEKVRVIDRGDAPARIIEGQLYKQRDITKLNRELRRNGKSLITIEPALQA
TSHPVLLGITSAALQTESVISAASFQETTKVLTDAAVAGKVDHLVGLKEN
VIVGKLIPAGTGLRKYRSIRLRDNEAEEAEAVEAASDEEI
>CT1193 rpoN, RNA polymerase sigma-54 factor
MAEMRLQQRQTAQLSAQQVMTNQLLQLPLMQLEQRIYDEVQDNPMLELVE
ERRQDDGAAGSTQAATADSAEMFDSVSRFERSSMKVRADGGNRETVSAGR
TSGSASGGEERFFQAVQHDTLHERLLRDLSLQEGIGEREVRIAAEILGNL
DSDGYLTEPLEVIIDGLRQSDIDASEADVREIQQKIWYLDPPGVAVANLR
ERLLVELSVYEHEHDPEAVGVARTILNEAFDDFMNKRFDRLLKKLNLQKR
QLEAAIDVITSLDPHPGEAFFDEGGHYISPDFIVIYENGALTAVLNDRSS
LSVRVSDEYREVLAKRKVPKEDRQFMRQKLQRANEFVTALQVRRQTLLKV
MEALLVAQAKFFIDGPRYLQPLVMKTIAEQTGYDISTISRAVNGKYVQTR
FGVFELKYFFSGAVSTDEGEELSSRIIKQQLRDLIEGENPVEPLSDDRLA
ELLSGKGVQIARRTVAKYREQMQIPVARLRKKIG
>CT0288 rpsA, ribosomal protein S1
MESLYTSTLSEISEEEIVKGRIVSISNKDVTIDVGFKSEGIVSLLEFRDD
DEIKVGDDVEVYLENIEDKMGQLILSKRKADVLRIWDKIYDSIENDTIIN
GKIINRVKGGMTVSLSGVEAFLPGSQIDVKPVRDFDALVGKTMDFRVVKI
NPVTQNIVVSHKVILEEEYAAKREEMLANIKVGMVLEGTVKNITDFGIFV
DLGGLDGLVHITDITWGRINHPSEVVELDQPIKVVVVGFDENTKRVSLGM
KQLEAHPWENIEIKYPVGIKATGRVVSITDYGAFVEIEKGIEGLVHISEM
SWTQHIKHPSQFVSLNQEVEVVILNIDKEHTKLSLSMKRVSEDPWIALSE
KYVEGSLHKGTVSNITDFGVFVELEPGVDGLVHISDLSWTKKIRHPSELV
KKGQELEVKVLKFDVNARRIALGHKQINNDPWGEFEQKYAVGAECTGAIS
QIIEKGVIVILPGEVDGFVPVSHLLQGGVKDIHSSFKIGDELPLRVIEFD
KENKRIILSALEYFKDKSKEEIEAYLQAHPNEKKEIEAASAELEPQPKGR
>CT1781 rpsB, ribosomal protein S2
MPTKFQLEEMLRAGVHFGHLARRWNPKMKPYIFMEKNGVHIIDLKKTLVM
AEEALKAIEAIASTGREIMLVGTKKQAKVIIAEQAERAGMPYVCERWLGG
MLTNFSTIRQSIRRMNAIERMETDGTFDMITKKERLMLIREKDKLVRILG
GIANMNRLPAALFVVDIKKEHIAVKEARSLGIPIFAMVDTNCDPDEVDYV
IPANDDAIRSIDLMVKAVADTILEARTLQVEQEVLAEMDEAAEEETAND
>CT2183 rpsC, ribosomal protein S3
MGQKVNPTGFRLGIIKDWTSRWYDDGPVIAEKIKQDQVIRNYVHARLKKE
KAGIAKIVIERTTKHIKINIYAARPGAIVGHKGEEINNLSQELTRITGKE
VKIDVIEVIKPEIEAQLIGENIAYQLENRVSFRRAMKMAIQQAMRAGAEG
VRIRCAGRLGGAEIARAEQYKEGKIPLHTLRANVDYASVTAHTIAGAIGI
KVWVYKGEVLVQRLDAIEEEEMKKMQERRGDSRGRGRGDGRGAKRRRRPA
KKA
>CT2163 rpsD, ribosomal protein S4
MARFRGSITKVSRRLGIALSPKAEKYLERRPYAPGQHGQSRRGKVSEYAL
QLREKQKMKYLYGILEKQFRNYYKKAVAQRGVTGDNLVKMLERRLDNVVY
RCGFSPSRAGARQLVTHGHMLVNGKKVNIPSFLVSPGDQIEFRQKSRNLD
AVADSLNKVPDSRIPEWIQVDKANRKAVFLAIPEREAVQEPFNEQLVVEL
YSK
>CT2172 rpsE, ribosomal protein S5
MSKKSGRNIKPGELNLKEKLVHINRTAKVVKGGKRFGFNAIVVVGDKEGH
VGYGLGKANEVQDAIAKGVEDGKKNVIKVPIVKGTIPHPIVAKYGSAKVL
MKPATPGTGLIAGGAVRAVLEMAGIHDILTKSLGSSNPHNVVKAAIKGLQ
NISDANDVAERRSKSLKEVFES
>CT2135 rpsF, ribosomal protein S6
MNTLKQYECTVIIDGGLQDDAIAAVMELVKKTVTDKGGVINNVLEVGRRK
MAYLIRKTSIGYYAHIEFDAVPSVIAEIERVFRYEEAILRFLIIQLSSPL
LEMRKRVEKYSVMLGSPEDQAEAEADADAKN
>CT2193 rpsG, ribosomal protein S7
MAKKGSGYGPRGGDFRYNDEAVARLINAIMLDGKKVVATKIVYDAFDIIA
NKVEGGDALEVFRKAMGNVAPLVEVRSKRVGGATYQIPMEVPASRRTALA
FRWIKQFAARRGGRSMAEKLAAELLDASNEQGASVKKRDEVHRMAEANKA
FAHFRF
>CT2175 rpsH, ribosomal protein S8
MPVTDSIADFITRIRNAGSAKNKTTDIPYTRVRENLSKLLLEKGYIKNYT
VITSEQFPFIRVELKYMQDGQHAIKEISRVSKPGRRVYQGKDIKRYLGGL
GLFILSTSKGILTDKEAREQNVGGEVLFRIY
>CT1782 rpsI, ribosomal protein S9
MKEVIDTVGRRKTSVARVFMSPGKGKIVVNKLPVEEYFKDEFKRSQALKP
LAVAEKQNDFDITINVKGGGLTGQSGAVSLAIARALVEFDESIRAALRPD
RLLTRDPRMVERKKYGKKKARKSFQFSKR
>CT2190 rpsJ, ribosomal protein S10
MAVQQKIRIKLKSYDHSLVDKWALKIIDVVKQTEAIIFGPIPLPTKTHVY
TVNRSPHVDKKSREQFAFSSHKRLIEIINPTARTIDMLMKLELPSGVDVE
IKS
>CT2164 rpsK, ribosomal protein S11
MATASRKKKKVKVTPEGTVHIKASFNNVMVTITDTLGNTVSWSSAGKNGF
KGSKKNTPYASQVTSEAAAKEAYDLGMRYVDVLIKGPGSGRDAAIRALQG
VGLEVRSIRDITPLPHNGCRPPKRRRV
>CT2194 rpsL, ribosomal protein S12
MPTIQQLIRHGRSMKASKTASPALEKCPQKRGVCTRVYTTTPKKPNSALR
KVARVRLSNKIEVTAYIPGEGHNLQEHSIVLIRGGRVKDLPGVRYHIVRG
SLDTSGVADRKQSRSKYGAKVPKAGAAPAKKK
>CT2165 rpsM, ribosomal protein S13
MRLAGVNLPLNKHAVIALTYVYGIGNTSAKNILAKAGVAPDKKISELSDE
EAHAIREIIGNEYTVEGEARAEQQLSIKRLMDIGCYRGLRHRRSLPVRGQ
RTRTNARTRKGKRKTVAGKKKAGKK
>CT2176 rpsN, ribosomal protein S14
MARKSIIARNEKRKKLVEKYAAKREELKAAGDYQALSQLPRDSSATRLRT
RCVLTGRGRGNYRKFGLCRNMFRKLALEGKLPGVRKASW
>CT0245 rpsO, ribosomal protein S15
MGLTKEHKTEIITKFGDSATDTGKAEVQVALFTRRITDLTGHLQQHPKDK
HSRRGLLMLVGKRKRVLNYLKKVDIERYRKVLADLDLRK
>CT1166 rpsP, ribosomal protein S16
MVKIRLKRAGRKKMPFYQIVAADGRAPRDGKFLEVLGHYNPTAKPHTVTI
EKDRVAYWLNVGAQPTDTVHSLIRGTGLLHEMNLKRRGLSESDIAAQMEA
WRQKEAERRQKRLNAKLRRRQAKKAAEAAGSAEG
>CT2180 rpsQ, ribosomal protein S17
MSSGAETRGRKKSWLGKVVSDSMDKGIVVAVERRVQHPVYKKYFKKTTRL
MAHDENNEAGVGDLVRITECRPLSKNKSCRLVEIVEKAK
>CT2133 rpsR, ribosomal protein S18
MSNALASKKKVSKNQVVFFDYRDERKLKRFINDQGKIIPRRITGLSAKEQ
SLLTHSIKWARFLAIIPYVADEYK
>CT2185 rpsS, ribosomal protein S19
MPRSLKKGPFIEFKLEKRILDMNSKGEKKVVKTWSRSSMISPDFVGHTVA
VHNGKTHVPVYVTENMVGHKLGEFAPTRLFRGHAGGKAEKGGSAPRKK
>CT0261 rpsT, ribosomal protein S20
MPLHKSAEKRLRQAARRNERNRARKKELKGVLKNMQKLIDANAAKSEVEA
AYKAAVQKLDRLGVKRYIHPNKASRKKAQLTKALNNYTPTAS
>CT1919 rpsU, ribosomal protein S21
MVSVQVNENESIDKLLKRFKKKYERAGVLKEFRKKAYFVKPSIEKRLKRS
RSKRRAQRANEERNS
>CT0987 rsbW, anti-sigma factor
MSYYWLSLPSMIEEIPRLRHFLGVVARIEGYRDAFILDLELTVHEAFVNA
VRHGNHGNAAFPVTITLEAGDIDGERFLEVRVQDCGEGFHPERAIAVICS
SRNATAFGGRGLLFVDRFVESYRIEQASGGCVVVLRYIPY
>CT0479 rsuA, ribosomal small subunit pseudouridine synthase A
MKKQVQEEKVRINKYLAMCGVASRRAADQLVLEGKVSVNGHIADEPGFKV
DPRNDEVIVDGRLMATPEARKVYILFNKPRNVITTNHDERDRQQILDFID
VPERVFPVGRLDRKSTGLLLLTNDGTLAHRLMHPSSQVQKEYLAALDAKF
PPAMLQKLTGGMRLKDTGEKVSPCRAKILDDGMSVLVSIHEGKNHQVRRM
FSTLGFEVKRLDRVAYAGLTLGELRRGEWRFLSRNEVEKLYRLCGGH
>CT0262 ruvA, Holliday junction DNA helicase RuvA
MFAFLRGELVTVSREEAVVEVSGIGYLLHISSGTSRRLPPEGSQVRLFTH
HYVREDAQQLFGFLDEEELQLFRLLLTIGGVGPKLAMAVLSGLSVGEIQE
AIVANRPETLYGITGVGKKTASRIILELRDKILKIQPAASGKTAGAPQAL
QLNEDALAALMTLGFPKPAAQKAISGILETSPGLSVEEVVRAALIAIHNN
F
>CT1630 ruvB, Holliday junction DNA helicase RuvB
MRIEALNTAPDATEARFEEQIRPQKMGDFAGQKKLIDNLKVFITAARKRG
EALDHVLLSGPPGLGKTTLAHIIAAEMGGSIKITSGPLIDKAGNLAGLLT
SMKKGDILFIDEIHRLAPAVEEYLYSAMEDYRIDILLDSGPASRAVQLKL
EPFTLVGATTRAGLLTSPLRARFGINSRLDYYNPELLQSIIIRAAGILNI
GIDEDAAMEIARRSRGTPRIANRLLRRARDFAQVAGDASISLAVARRTLE
SLEIDEGGLDDMDKKILEAIVRKFNGGPVGLASLAVSVGEEQDTIEEVYE
PYLIQMGYLSRTPRGRVATRLAMSRFAHPGISSQGSLFDTAEDG
>CT1664 ruvC, Holliday junction resolvase
MIVLGIDPGSRKTGYGVIAETAAGYRVLGCGLVRPRAADTLHERISQLCA
GLDEVIEQLKPEAVALETAFVGRNVRSALILGQVRGAVLATVMRHSLPVR
EYAPREIKLSVTGTGSACKEQVAAMLSRMLELGGELKPLDVTDALGIAYC
DLARGASSLGGQLRKNGKGRSKGWAAFVNEHPELMA
>CT0721 sahH, adenosylhomocysteinase
MTTEAAVLDYKVADISLAEWGRKEIEIAEKEMPGLMATRKKYEGKKPLAG
ARIAGSLHMTIQTAVLIETLVELGADVRWASCNIFSTQDHAAAAIAAAGV
PVFAWKGETLDEYWWCTRQILEFEGGLGPNLIVDDGGDATLMIHFGYKIE
NDPSMLDKTPGNAEEKALLQQLKAVFAEDNQRWHKVAAGMKGVSEETTTG
VHRLYQMMEKGELLFPAINVNDSVTKSKFDNLYGCRESLADGIKRATDVM
IAGKVVVVLGYGDVGKGCAHSMRSYGARVIVTEIDPICALQAAMEGFEVT
TMEEAVKEGNIFVTATGNKDVITLDHIKQMRDEAIVCNIGHFDNEIQVDA
LNNFKGATRINIKPQVDKYVFENGNCIYLLAEGRLVNLGCATGHPSFVMS
NSFTNQTLAQIELWQNDYKVGVYRLPKKLDEEVARLHLGQIGAKLTTLTK
EQADYIGVPVEGPYKPEHYRY
>CT0862 sat, sulfate adenylyltransferase
MALVNPHGKEKVLKPLLLTGDELVSEKERAKSMKQVRLSSRETGDLIMLG
IGGFTPLTGFMGHADWKGSVETCTMADGTFWPIPITLSTSKEQADTIAIG
EEVALVDDESGELMGSMKVEEKYCIDKAHECREVFKTDDPAHPGVLMVMN
QGDVNLAGPVKVFSEGSFPTEFAGIYMTPAQTRKMFEENGWSTVAAFQTR
NPMHRSHEYLVKIAVEICDGVLIHQLLGKLKPGDIPADVRRDCINVLTEK
YFVKGTTIQAGYPLDMRYAGPREALLHALFRQNFGCSHLIVGRDHAGVGD
YYGPFDAHHIFDQIPEGALETKPLKIDWTFYCYKCDAMASMKTCPHEPAD
RLNLSGTKLRKMLSEGEEVPEHFSRPEVLEILRRYYAGLTEKVDIKMHSH
AIGK
>CT1147 sbcC, exonuclease SbcC
MIIQKLRFANLNSLQGEWEIDFTRPEYLSDGLFAITGPTGSGKSTILDAI
CLGLYGQTPRLGRITKSSNEIMARHSGDCFAEVTFATASGAYRCHWSQHR
SRHKRGGELQSQKHEISDAVTGALIQTKLQETLQEVEARTGMDFDRFTRS
MLLAQGAFTAFLQADADQRAPVLEQITGTKIYSVISMKVHERRRDELARL
ERLQLECEGIRLLGDEERRALETERAECIEKERELNAGVETESAAQRWLQ
EIARLEAALAAIGSEAAALDAEREAFRTDEERLRLARSAAAVEPQYAVLK
LKRAQLRRDSEELAAKEAGRPKMEQRWLEAEAQHEATEKALASAQESVTS
ARPLIAQVRQLDAVIEGKRQEAERRKRELEALEERRGRLDAERAAASASV
ESARNGLQKLRAWQTEHHADASLVGALSGIRHAFDELHPAAERERAVASG
IASVQQAIGQLRDEIAKIELESKASQAAFDEARKALDERKASFTARLDGS
TLKALRVELDLLRDRRHLLEEIVALYKSGKELLPKITELGANIETLESEE
ADATRKLEHARELLAHAEREAAAQENLARMADRVRSLEEERRRLVAGQPC
PLCGSEHHPYVEEQVLPESDEAALDVAKRAVQEYSRSVRELEISVAERRT
EIEQMRQWRDDLAAMRETSGRQCLALLEKAGVEAPAKEAEPVVLENLAEN
KRKVEELAGRIEQLDVLEQRIRDDEARLQQLNEAARSDKRRLEQAVDQQR
VNRFDLDRLEREREAAQCELAERRQLLRRLVEPYGVAVGDEIAESLVAEL
EARRDAWRREEERGRSLEESLRIGEVSLRNVAEQIAALDADLAMRREEVE
AAFAGVETSKKERRELFGEKLPDAEEERLESLVQAARDRLAASSERFNAA
RQALAVLDTSIAELRRAVEEEREEVERREAEFGAALAGKGFADEAAFVAA
CLPEAERERLESAATSLEHRKRDLDLKRSDREARLRDELERRLTTLSADE
LAVRIGELQAQLRDVQGRVGAIASRLEAHRQAADEHRSKMAAVEAQKAEC
RRWEMLHSLIGSADGKKFRNFAQGITFEIMIAHANKQLMRMTDRYVLLHD
LSGALELSVIDQWQAGEVRSTRNLSGGESFIVSLALALGLSQMSSRNVSV
DSLFLDEGFGTLDEEALETALKTLGTLQQSGKLIGIISHVPALRDRIATH
IKVTPLTGGRSAIEGPGVSGPR
>CT1148 sbcD, exonuclease SbcD
MPLRILHTSDWHLGRSLFGRSRLHEFEAFLDWLAGTVESRGIELLVVAGD
IFDTTTPGNATQKLYYQFLNRIALTPGSPCRHVVITSGNHDSPTFLDAPK
ELLRAFDVHVLGATTGDPADEVRLLRDRQGSPEAIVCAVPFLRDRDIRTV
EPGESLEDKIRKLSDGVSLHYAEVARIAAERRRSLGADIPVIAMGHLFTA
GCVTVEEDGVREIYAGALSIVCSTAFPPVFDYVALGHMHVPQRVDKTEHM
RYCGSPIPMGFGEAKQQKLVLQVDFEGREPTVTEIEVPCFQPLERLAGSL
DELSAAIAALRSAGSNAWLEIDYRGDEAPATVQEAMEQAVAGSLLEIRRI
RNNRPLQQALRHSAANETLQQLDPEAVFNRCLEAYGTDESLRPALVESYR
EVLRSIREEDVMAEKEGKS
>CT0136 scpA, segregation and condensation protein A
MFRISLEEFEGPLDLLLFFIKRDELDIYNIPISKITGDFIAYIHAMRRLN
LEVAAEFIYMASMLMSIKARMLLPRAEPVDGEADEFDPRTELVQRLLEYK
RIKEGASELELMALDRERMFPRGYFEELEPAVIDEMDEPVNRPTLYHLML
AYQSVLDNMPKVRTQNVTDAPVTVEEQSALIMARLGERLQVSFTSLFQEF
REAIVIVVTFLAVLELCRNRKISVIVKEGVNDFWISQRDHAE
>CT0478 scpB, segregation and condensation protein B
MQEQRQQLLRSLEALIFSSEEPVNLQTLSQITAHKFTPSELQEAVDELNR
DYEATGRTFRIHAIAGGYRFLTEPEFADLVRQLLAPVIQRRLSRSMLEVL
AVVAWHQPVTKGEIQQIRGASPDYSIDRLLARGLIEVRGRADSPGRPLQY
GTTEVFLDLFHLPSLKDLPKLREIKEILQEHEEQQYLAADGDLPVAADED
EKPRMERIE
>CT2267 sdhA, succinate dehydrogenase, flavoprotein subunit
MIQLNANAPGVPLADQWDAYKAGCKLVSPNNKRKLDIIVVGTGLAGASAA
TTLGQLGYNVKSFCYQDTPRRAHSIAAQGGINAAKNYQNDGDNVFRLFYD
TIKGGDYRSRESNVYRLASISPEIIDICVAQGVPFAREYGGLLANRSFGG
AQVSRTFYARGQTGQQLLIGAYSAMSRQIAAGTVQLYSRRDVLDIVVVDG
KARGIIARNLVTGEIERHSAHAVVLATGGYSNVFYLSTNAMGSNATPAWS
AYKKGALFANPCFTQIHPTCIPVHGEFQSKLTLMSESLRNDGRIWVPKEK
SDAELIRQKKLRPEQIHESKRDYYLERRYPAFGNLVPRDVASRAAKERCD
AGFGVGSTGLAVYLDFGDAIERLGRAEISARYGNLFQMYQRIVDDNPYRT
PMMIYPAVHYTMGGLWVDYELMTTVPGLYSIGECNFSDHGANRLGASALM
QGLADGYFVLPYTISNYLSHEINTPPIPTTLPEFHLAARDVTDRLDRLKK
SNGKESVDHFHRKLGKIMWEYCGMSRNEAGLTKALGLIEELKAEFASGVN
IPGGLKEYNPELEKACRVEDFIELGDLMVRDALHRKESCGGHFREEYQTP
DHEALRNDDEFAYVAAWEYKGRNDEPEMHREELRFETVTPSQRSYK
>CT2266 sdhB, succinate dehydrogenase, iron-sulfur protein
MKFTLKIWRQKNADDKGRMVSYKVDDISPDSSFLEMLDQLNQQLIAKGED
PVSFDHDCREGICGACGLYINGRPHGPLKGITTCQLYMRAFRSGETICVE
PWRSGAFPVVKDLIVDRSALDKIIQAGGYISINSGGLPDANTIPVSKHDA
DAAFDAAACIGCGACVAACSNASPMLFVGAKVSHLALLPQGRIEAARRVQ
QMVAAMDALGFGNCSNTYACQAECPKEISIANIARMNREFLTAKLFAEKE
KNIGFTL
>CT1239 secA, preprotein translocase SecA subunit
MLKIIAKIFGSKHEKDIKKIQPIVDRINEIYGTLNALPDEAFRNKGVELR
KKVRDKLIPFETKIKETEHKLERPDMSHEEHEKLNIELEQLRNKYEEATA
AILDEVLPETFALVKETCRRLKGHTYTVMGHEMVWDMVPYDVQLIGGIVL
HQGKIAEMATGEGKTLVSTLPVFLNALTGRGVHVVTVNEYLAQRDMEWMR
PVYEYHGLSTGVILAGLYSNQRRNAYLCDITWGTNSEFGFDYLRDNMAGS
EEEMVQRDFYFAIVDEVDSVLIDEARTPLIISGPVPNSDTDTKYREIKPW
IEQLVRAQQNLVATLLDQAEKTLKEKPNDFDAGLALLRVKRGQPKNKRFI
KMLSQPGIGKLVQSVENEYLKDNSSRMHEVDDELFYAVDEKANTIDLTEK
GREFLGKLSHQDQDLFLLPDVGSEIAAIEADKNLQPTDKIRKKDEVYRLY
SERSDSLHTIGQLLKAYTLFAKDDEYVVQNGQVMIVDEFTGRVLAGRRYS
DGLHQAIEAKENVKIEGETQTMATITIQNYFRLYSKLAGMTGTAETEASE
FFEIYKLDVVVIPTNHPIARHDQDDLVYKTRREKYNAIVNKVQELNAKGQ
PVLVGTASVEVSETLSRMLRAKRIQHNVLNAKQHAREAEIVAMAGQKGAV
TIATNMAGRGTDIKLGPGVREMGGLFILGSERHESRRIDRQLRGRAGRQG
DPGESIFYVSLEDDLMRLFGSDRVIAVMDKLGHEEGDVIEHSMITKSIER
AQKKVEEQNFAIRKRLLEYDDVMNQQREVIYTRRRKALKMGRLKNDIMDL
LQDYCYTVAKKFHESNDPAGLEEQVLRELSVEFHVEASAFEREPFEQTAE
ALYKAASEFYHRKENSLPDEIMQQIEKYAVLSVIDQKWREHLREIDSLRE
GINLRAYGQKDPLLEYKQEAYKLFVELLREIEHETLSVAFRLFPVSQDES
EEIEARQRRQAVRQERLVAQHAEAESTYKIAADGGMNATLWMPGDEIVVQ
QPVRTEKKPGRNDDCPCGSGKKYKNCCGANE
>CT0149 secE, preprotein translocase SecE subunit
MKKYIEKVDKYYRDVVGEMRKVSWPTREEVKDMTIVVLTVSGILALFTFV
VDWVISTVMGKLL
>CT2169 secY, preprotein translocase SecY subunit
MKLTESIRNINKIPELRQRILYTLLLLFVYRLGSHITIPGVDASAVASAT
TSHSNDLFGLFDLFVGGAFARASIFSLGIMPYISASIIIQLLGAVTPYVQ
KLQKEGEEGRQKINQYTRYGTILIALLQAWGVSVNLSSPSSFGKVVVPDP
GFFFTITAVIILTASTVFVMWLGELITERGIGNGISLIIMIGILARFPAS
LVAEAQSVSFGSKNWIVEIVIMALMVAIVAIVVILTVATRRIPVQHAKRV
VGRKVYGGGTQYIPIKINTAGVMPIIFAQSIMFLPSTFLSFFPESEAMQK
VAAAFAYDSWFYAVVFGLMIVFFTYFYTAIAFNPKEVADTMRRQGGFIPG
VRPGKNTEEFIDNILTRITLPGAIALAVIAILPTFLTKFVNVTPGFANFF
GGTSLLIVVSVGLDTLQQVESHLMMRHYDGFMKTAKARGRR
>CT0173 serB, phosphoserine phosphatase
MHELLLINISGPDKPGLTSKITEVLARYNVPVLDIGQSVIHNHLSLGMLI
EVPKASASAPILKDLLFTAHTLKLEIEFSPISTRDYHKWVGEQGKPRYLL
SLLGRKITSEHLERVTTLVASHGFNIDTINRLSGRLPLKDEGDSGKTKAC
IEFSLRGAPANEEQFRTEMLAITDSLGIDIAFQEDNIFRRTRRLVVFDMD
STLITSEVIDELAKEAGSGELVAAITEQAMRGELDFTESLKKRVGTLAGL
EESTLQKVAERLQLTEGAEHLFYNLHRLGFKTAILSGGFTYFGRYLQKKL
NIDYVFANELEIVDGKMTGNVIGQVVDGKRKAELLEQIAATENIRLEQTV
AVGDGANDLPMLGKAGLGIAFRAKPIVRETAKQAISTLGLDAILYLMGFR
DRDALAV
>CT0606 serS, seryl-tRNA synthetase
MLDITYIRQNPDDVKEMLRRRQQQGDAPKVDRLLERDAERKAMVQRTDDL
KALRNRVSKEIANIKRTGQGSADELIGQMKQVSDEIADLDLALSALEAEI
EELLLTLPNKLHKSVPEGRSAEENVLYKGPVSFEHNLDFPVKNHLELGKS
LGILDFERGAKISGAGFPVYTGKGARLERALINFMLDTHSANHGYTEVFP
PFMVNQESLRGTGQWPKFADQVYHMPEDGLYAIPTAEVPVTNLHRGEILD
ADKLPIAYAAYSACFRREAGSYGKDTRGFLRVHQFNKVEMVRFTRPEASY
EALEEILGHAEAILVALKIPYRVITLCSGDISANAAKCYDIEVWSPAENK
YLEASSVSNFEDYQARRSNIRFKPDSKSKPEFVHTLNGSGLATSRLMVSL
LEHYQTADGKLMVPEALKPYTGFEIIE
>CT0091 sixA, phosphohistidine phosphatase SixA
MKTLYLVRHAKAGWKDPAQSDFDRSLTKQGRRQAEEMSERLRKKGITPER
LISSPAHRALETAEIFADTLGIERREIVQKIEIYEGGIDALAVIVRSLAD
EDNTVMLFGHNPMISHFVQWLTAKPAEAMNTCGIAKIELDCDHWRDTAEG
SGKLVWYKFPERE
>CT1211 sodB, superoxide dismutase
MAYQQPALPYADNALEPHISANTIGFHYGKHHAAYVKNYNGFVEGTPYDA
MSLEEVIIQTASDASKTGIFNNGAQAWNHSFYWNCLTPNGGGEPTGEIAT
KITEDFGSVDKFKEELKNAAATQFGSGWAWLVLDGGKLKVTKTGNAQNPM
TSGQTPLLCIDVWEHAYYLDYQNRRPDHVAAVIENLINWEFINANYAAAK
>CT1019 soxA, sulfur oxidation protein SoxA
MKKTIQRGLFTGALVLMTAMTAKPANAEVNYQALVDADVKAFQGFFRKEF
PDVKLEDFGNGVYALDEDARKQWKEMEEFPPYELDVEAGKALFNKPFANG
KSLASCFPNGGAVRGMYPYFDEKRKEVVTLEMAINECRVANGEKPYAWEK
GDIARVSAYIASISRGQKVDVKVKSKAAYDAYMKGKKFFYAKRGQLNMSC
SGCHMEYAGRHLRAEIISPALGHTTHFPVFRSKWGEIGTLHRRYAGCSNN
IGAKPFAPQSEEYRDLEFFQTVMSNGLKYNGPASRK
>CT1021 soxB, sulfur oxidation protein SoxB
MFRDEPFEADSRRVKVTRGFTNVYGNSNRHLFMNLSRREFLRILGFAGAA
GMLPNLASATKASSDLYDFGRFGDIRLMHITDTHAQLMPIYYREPSMNLG
LGSAFGRPPHLVTEAFLKYYGITPGTPLAHAFTAINYTEAAQRFGKVGGF
AHLKTLVDRMRGDFGADKTLLLDGGDTWQGSGTAFWSRGMDMVEACNLLG
VDVMTGHWEFTYLEDEVLKNLAAFKGDFVAQNIKVKEEALFSGAKAFDEN
TGHAFRPYVVKTLGKHRVAVIGQAFPYTPIANPARFIPDWTFGINARDMQ
ELVDSVRAKEKPDAVVLISHNGMDVDIKLAEVVSGIDVIFGGHTHDGVPQ
PFIVKNARGRTLVTNAGSNGKFLGVIDLKLGDGGVKEFKYKLLPVFANEL
PAHQGMQSLIDKIRAPYLDKLREPLATAGSLLYRRGNFDGPFDQIICDAL
RQRNDAQISLSPGFRWGTTILPGQTITMEHVLDQTCMTYPETYVRDMTGQ
QIKDILEDVADNLFNRDPFYQQGGDMVRTGGLDYRIDPTATMGKRIDNMR
LENGTPVEASKNYRVAGWATVGAKSPGEPVWDTVVAYLKDKKVVEVTKLH
TPELKNVGSNPGIDRS
>CT1016 soxX, sulfur oxidation protein SoxX
MARGFAAGDHGFSLKQVFNSLNKRGHMKSSGIIAAAAVLLLPSLGKAAAP
AAVDSSVEKGKALALDTNKGNCIACHMMGDGEFPGNYGPPLSQMKERYPD
RAVLRKQVWDASAINPKSIMPPFGKNGILNDSEIDNIVDYLYTL
>CT1017 soxY, sulfur oxidation protein SoxY
MGISRRDFCRTIAGSAASFAVLAVMPGRLLASWNEKAFSASKLDEAIAAK
FGSLPIEDSTAIQIKAPEIAENGAFVPVSVSTTIPGATNISIFTPANFSP
MIASFDVLPRMIPDVSLRMRMAKTSNLVVIVQAGGKLYRATREVKVTIGG
CGG
>CT1018 soxZ, sulfur oxidation protein SoxZ
MKIKAVVQNNIVSVKVLIPHPMDTGRVKDQAGALIPAHFITEVTATIGGD
TVFHAELGSGVSKDPYLSFQFKGAKAGDMLKVSWVDNKGASETAEAAITA
M
>CT0879 spoU, tRNA methyltransferase SpoU
MPQARLRQTNLIALVSPERFRKIRELLEKRQTDLTLVMDNVNKPHNLAAI
IRSCDAVGIHQVHAISYRSSIKTKQHTAAGASRWLKVHLNESFVPVGEKL
RQSGMQILVAHYCPEAVNFRSIDYTRPTAIVMGEELKGPSQEALGLADNF
IYIPMMGMVESLNVSVAAALILFEAQRQRQAAGLYDTRHFSDSDIERLLF
EYTYPRLAAVLRDQGKPYPKLNENGAFTE
>CT1644 sppA, protease IV
MADKNRKKGGCFRAGCLTAVVAVLLLVGLGGWFVHQRSNRLPARFVLSVP
LTGELDERPPDAGPFPFGKGRHLLSFEELLTILDRAKTDRRVDSVLLRID
GLGASPAKIQELRSSIAALRKSGKKVTAFLVTPEDKDYQLAVACDSIIVQ
KGSWMTLDGLKAELFFVADPLKKLGVSFQAAQWKKYKSAVETFTRNSASP
ENLEETNALLDDAWSDYLDSVSRQRRIGKDAFRKVVDSLAVLTPEKALGL
HLIDRVATERELEQEYARRLNKPAAELLVGGREYLGATGGMRPQGGGDRI
AVINITGMIVSDGAGGMSEGDGTDVATVKEALQTAIDDLKVKAIVLRIDS
PGGDALAASTMLELLNEAKAKKPIVASMSGLAASGGYMVALAGDKIFAEP
LTITGSIGVFSLKPDLSSLLEKTGIRREVLIRGRFADAETPFRAFDDASF
RKFVELTGTVYEDFIAKVAKGRHMTPAQVDAVAGGRVWSGKRALEVGLID
QIGGLGDAVQEAKKLAGVKKEVKPALLYLPAQKTWLEYLLGSDTSDMVSA
LTAAMVRESLGQLQPLSRLPGAGIARFLLRTDAPQVLALDPVEVVIK
>CT1947 ssb-1, single-strand binding protein
MARGLNKVMLIGHLGNDPERRETASGQSVVNFTLATSEGFKDSSGNLQER
TEWHRIVAWGKLADICSQYLKKGRQVYIEGRLQTRSWDDNKTGDKKYATE
IVCTDMQMLGAKDSGGGTSDASYSQNRPSYSRPSRPEPSSGNYGASPSSG
GAQEFEKDDLPF
>CT2134 ssb-2, single-strand binding protein
MPEINSVIIAGNLTKDPVFRQTNSGGTPVVNFSIACNRRFRDSNHQWQED
VCYVGVVAWNKLAESCRDNLRKSSAVLVDGELQSRTWKAQDGTSRTVVEI
KARRIQFLNKRKKNGEDDEEGFIEDDTHETHHLDDDGHMYEYKYLSSD
>CT0380 sucC, succinyl-CoA synthetase, beta subunit
MNIHEYQGKGILKQFGVAVPKGIVAFSAEEAKQAAEQLFEEQSSPVVVVK
AQIHAGGRGKAGGVKLAKSPEEVFEIAQKMLGATLVTHQTGPEGKEVRRL
LIEEGMNIDKEFYLGITLDRTTSSNVLMVSTEGGMEIEKVAEETPEKLLK
IHVDPVYGLQGFQARKAAFFLGLQGEQFRNGVKFIEALYNAYTTIDASLA
EINPLVITKEGRVLALDAKINFDDNALYRHSDFHDLRDITEEDPLEYEAS
KSNLNYVRLDGNVGCMVNGAGLAMGTMDLIQLSGGRPANFLDVGGGASSK
TVEEGFKIILGDKNVKAILVNIFGGIVRCDRVAGGIIEAAKNIGLKVPVI
VRLEGTNATEAQKMLDESGLNLISAKGLRDAAEKVQKALATA
>CT0269 sucD, succinyl-CoA synthetase, alpha subunit
MSVLVNKDTRLVVQGITGGEGTFHTSQILEYGTNVVAGVTPGKGGILYNG
NEKDQFCRPVPVFDTVQEAVDKAEANATVIFVPAPFAADAIMEAAAAGLK
VIICITEGIPVNDMMKAYSYVQAKGAVLVGPNCPGVITPGEAKVGIMPGF
IHKKGTIGVVSRSGTLTYEAVHQLTEVGLGQSTCIGIGGDPIIGTRFLDA
VKLFAKDDETEGLVMIGEIGGSAEEEAAEYIKKYFKKPVVGFIAGRTAPP
GRRMGHAGAIVSGGKGTAEEKIKAMEAAGIKVVENPADIGEAMLKALGRA
>CT0374 sugE, sugE protein
MAWVMLFVAGLFECAWAVGLKYSEGFTRPVPSVLTIAAMLVSFWLLSVAM
KIVPVGTAYAVWTGIGAAGVAVAGILLFNEPRDLARVFCIFLIIAGVAGL
RVLAGK
>CT1661 suhB, extragenic suppressor protein SuhB
MNLELQTAVKAAKAAGAITLSRFGELSHREIVAKEYKDFVTEVDKQCEAT
ITATITESFPDDGLLCEEGTSGSGASGRTWIVDPLDGTLNFIHSFPVFGI
SIAMRDASGELAVGVVYQPVLDELFTAVKGRGAFLNGKRISVSTREEMQS
YLFATGLPFRDYDHYMDGYIGLLRDVIKDSAGIRRAGSASIDLAYTAAGR
FDGFFEYRLFPWDFAAGVLLVREAGGIVTGIAGSDDVFAHTSILAGSPLT
HPLLLEKARRHFGA
>CT0603 sun, sun protein
MTARELALRVLLELDGMRKSEELLNRMHEHAGLGKNDRALAKELVAGTLK
YRLQCDFIIARFYRHDYAKAATVLKHILRLGVYQLLRLDRVPKSAAVNES
VKLARKFKGDHLARLVNGLLRNISKATIDLDAWTAEMPESKRLSILYSFP
EWLAARWLMRYGPDATLAMLAHGNLPPATGYRINRLKANPETLLARPELS
DAKRVADADGLDRFFFSKQFALLEPLLKEGLVSVQNPAQGQACLMAAPEP
GSTVFDMCAAPGGKSTFMAELMENRGRIIALDRTPAKVARIASNASALGI
TIIEPREGDALTFDPGCAMDTILLDAPCTGTGVLGRRAELRWRTTPEKLR
ELAALQAAMLDRAASLLRPGGVLLYATCSVEPEENELQTEAFLKRHPEFI
AEASRLTLPGSHEGFDGGFAARFRKTEG
>CT2264 surA, peptidyl-prolyl cis-trans isomerase SurA
MKKVLFAVLAALMIAMNGFADAAASTGLDRIVAIVGNEIILASDVNEQEL
MLHLQYPETRKDPQLRKRILENMINQKIILTKAKIDTVKVDEKSVDDQAA
ARYSSLRAGFPSVSAMESRFGMPVNRLKQHIREDIRDQQMIEAFRRKNFH
EVTVSYDETMAFYNQEKGALPEAPETVSVSQIIKMPLVSEAARQAALDKI
KAVQQQLEAGGSFATLAREYSDDPGSREKGGDLGFTRKGELVPSFEEAAS
VLKPGQISGIVETRFGYHIIQLIDKEGDRIHTRHILALFDRSKTDIPATI
ALLKSIRKDVLSGKATFAEMAKKYSDDPASATNGGLITSGSGNPDLEVAT
LRPDLRKIIDGLKSKGDISQPEKIESDKSAPFYALFMLNSRTPAHVLTPE
HDFAQIQELALNHKSQELFNAWIEKLKKEVVVKVMSDI
>CT1557 surE, stationary-phase survival protein SurE
MTTKPQKPHILVCNDDGIEGLGLHALAASMKKLGSVTVVAPAEPQSGKSH
GMTLGEPLRIRRYQKNNRFFGYTVSGTPVDCIKVALSHILDAKPDLIVSG
INYGSNTAMNSLYSGTVAAAREGAIQNVPSLAFSLTTYENADFTYAAKFA
RQLAREVLRRGMPPDTILSANIPNVPEKEIRGILFTRQGRSRWEESTIER
HDMYGNPYYWLAGSLQLHDNDLAEDEYAVRHNYVAVTPITCDMTDHRFRS
ELETWGLQNTIKK
>CT0435 tadB, tadB protein
MWLVYLVWSRFFDPNNKSKNQRLKNIQTTIQWGGQAPSSLLTNLEEHELE
TWLRSRSRAFEKLVNLVQQSRSSFSAGSVLGLMLALFAVVLLAGLLTKTN
ILFLLVLAVAIASMPVMWLSRKAKKRRMAFEAKLPEALDYISRSLRAGHS
LSSAIGMIGKEFPDPLGGEFKTVFDEMNFGIPFKEAFAHLSNRIRSNDIS
FFVIALMIQHETGGNLAELLGGLATTIRERFKLRGKVRTLSSEGRISALV
LGSMPFVFATIISLINPRYILPLFNTPQGHTLLYIAGGLMLVGMYVLNNM
VKIKV
>CT0010 tal, transaldolase
MKFFIDTASLDEIKAANELGVLDGVTTNPSLIAKIVKDSTNFTYADFKAH
IAKICEIVDGPVSAEVTTLKAGEMIAQGEELAAIHKNIVVKCPLTVDGLK
AIKHFSSNGIKTNATLVFSPTQALLAAKAGADFVSPFVGRLDDISTSGME
LVRQIVTIYDNYGYLTEVIVASVRNPLHVVESAMVGADIATIPYSVIKQL
ANHPLTDKGLEKFMEDAAVMKP
>CT1397 tgt, tRNA-guanine transglycosylase
MKFSVLSTDRHSAARCGVLSTSHGDIPTPIFMPVGTRASVKSVEPNELKA
LDAKIILANTYHLYLKPGNDILFKAGGVHRFMNWDGPLLTDSGGYQVYSL
TDLRKISEEGVMFKSHLDGSKLHFTPENVVDTQRIIGSDIMMPLDECPPW
PAEKEYVQKSGELTLRWAERARKHFLSTRPHYGYEQFQFGITQGGTFDDL
RAHSSRALVDMDFDGYAVGGMAVGEPAEEMYRMLELSHTILPESKPRYLM
GVGTPANILNAIERGIDMFDCVIPTREGRNGRVYTRKGTINLRAGKYADD
FRPIDEGFDNHVCRNYSRAYIRHLLNVGEILGLKLCTMQNLSFYLWLTRT
AREHIAAGDFTDWKESFLMEFNSNDKN
>CT2084 thdF, thiophene and furan oxidation protein ThdF
MSPSDLHLPVPGHPIAAIATPVGVGALAIVRISGAGVLDLADRVFRKVHG
SGKLAEAAGYTAHFGRLYDGEEMVDEVIALVFRAPRSFTAEQMVEFTCHG
GPVVVGRVLRLMLDNGCRLAEPGEFTRRAFLNGRIDLLQAEAIGEMIHAR
TESAYRTAVSQMKGDLSVRLGGLREQLIRSCALIELELDFSEEDVEFQSR
DELTMQIETLRSEVNRLIDSYQHGRIVSEGVSTVIAGKPNAGKSTLLNTL
LGQERAIVSHMPGTTRDYIEECFIHDKTMFRLTDTAGLREAGEEIEHEGI
RRSRMKMAEADLILYLLDLGTERLDDELTEIRELKAAHPAAKFLTVANKL
DRAANADALIRAIADGTGTEVIGISALNGDGIDTLKQHMGDLVKNLDKLH
EASVLVTSLRHYEALRNASDALQNALELIAHESETELIAFELRAALDYVG
QITGKVVNEEVLNTIFDKFCIGK
>CT1442 thiC, thiamine biosynthesis protein ThiC
MNQENASCPKKHFFGPASSRITVKGTIYPIEVGMRRVALTRSYECKGERF
DAMPLYDTSGPFGDAEREHDVRKGLEPVRDRWGFDRGTVESVGGELSMTG
RKPRVAKAGEAVTQMHFARKGIVTPEMEYVAIRENQALEAWIEKCGGKPV
TPEMVRSEVARGRAIIPANINHPEIEPMIIGRNFRVKINANIGNSALGSS
IDEEVEKAVWSCRWGADTVMDLSTGKNIHQTREWILRNSPVPIGTVPLYQ
ALEKVGGKAEELSWEVYRDTLVEQAEQGVDYFTIHSGILAATLPDAEARQ
TGIVSRGGSIMARWCRAHKQENFLFTRFDDICDILRSYDVAISLGDALRP
GSIGDANDAAQFGELKTLGELTLRAWKRDVQVMIEGPGHVPLHMIRENME
MQLKHCHEAPFYTLGPLVTDVAAGYDHVNSAIGGTLIASLGCSMLCYVTP
KEHLGLPNRDDVREGVIVHRVAAHAADIAKGSATAWLRDELMSKARYAFA
WEDQFSLALDPLKTRQIHAQNIAATGDTSATAKYCTMCGPDFCSMKRSQE
TTAAGL
>CT1176 thiD, phosphomethylpyrimidine kinase
MMQNYITVLTIAGSDGSGGAGIQTDLKTIAANNCYGLSVITAVTAQNTTE
VRSIHNIPPAFIGEQFKTIVDDIRIDAVKIGMLGSLEAAETVVELMKSLN
EVPVVLDTVLRSSSGKSLLDAEALLVMKQLFHLTSLITPNLPEAAILTGR
SMAPTTQAEIEVMAKDLQREGAKSVLVKGGHGEGDQCNDCLLHEGQLFWY
SNPKIDTLNTHGTGCTLSSAIACGLAKGLPMNEAVAEAISYTRKALLAGA
SWRLGHGNGPLEHFPDRTVERRPGKLQ
>CT1175 thiE, thiamine-phosphate pyrophosphorylase
MTLPRRVLCVITDEHSNPVELARMALQGGAGMVQLRRKTASGQELYEWAI
RIQALCSEQQALFIVNDRVDIAMAVHADGVHLGQQDLPASAARALLAPDA
IIGVSVSNATEAIKAAEEGASYIGVGHIFPTFSKDKPSEPLGTASIRPIG
RAAQLPVIAIGGIGHDNAAEVIRAGASGIAVISAVSDSDDPETATRELVR
RIRQ
>CT0696 thiF, thiamin biosynthesis protein ThiF
MHVSLSDEQCQRYARHLALPEVGEAGQEKLLHSKVLVIGAGGLGSPAAFY
LAAAGVGTIGLMDGDTVDLSNLQRQILHTTASVGANKTASAQERLKALDP
SIRIETHPFRLRKENATEILARYDFVIDATDNFASRFLIARACHEASKPW
SHGGIRNFHGQTMTIIPGQTACYCCIFHEEDESKEAIPQGPIGALPGVIG
SIQAIEAIKYLLNIGTLLTDALMTFDALTMSFRKVAVRRNSRCALCG
>CT0698 thiG, ThiG protein
MDSLRLGTYTFSSRLILGTGKFSSTSAMIKAVRASGTQLVTVALRRFNRE
QAEDDLFGPLSEIEGLTLMPNTSGAATAKEAIKAAHIARELSGSPFIKVE
IHPNPHHLMPDPIETWEACKILAAEGFIVMPYIPADPVLAKRLEEVGCSS
VMPLGSAIGSGQGLSTAEMVKIIIRESSVPVIVDAGLRSPSEACAAMEMG
CEAVLVNSAVAVARDPAAMALAFAKAVEAGFEARNAGLMPRSGSAVATSP
LTSFLGATR
>CT0697 thiH, ThiH protein
MIALPAWLTDERLSEDIEPLLRQTDNESLERLAAEAQAVTLRRFGRVISL
YTPLYLSNFCSSGCVYCGFASDRRSPRRKLDTDEIEKELLAMKALGVSDV
LLLTGERTNSVGFDYLRRAVDIAARHMPRVAVEAFPMSVAEYRGLAECGC
TGLTIYQETYDPDHYRELHRWGPKQDFLERLETPERAITGGIRSVGIGAL
LGLSEPVGEALAVLRHARYLCKTYWKAGVTVSFPRIRPQEGGFQPSFTVS
DRFLARMIFAFRIGMPDVDLVLSTRESSNFRDGMAGLGITRMSIASRTTV
GGYVEKETAGASQFEVSDNRSVEAFCAALRAKDLEPVFKNWDAAYNNPLP
AEECT
>CT0199 thiL, thiamine-monophosphate kinase
MSYNPISEIGEFGLIDRLAKITAPSTKQRPELIEGIGDDCAVWQSDESVV
QVATTDILTEHVHFDLLTTPLHHLGSKAISVNVSDICAMNAMPDYAVISI
AVPPKMPVEMVEELYKGMNHAAEIYGLAIAGGDTSSSASGLFISVSMTGT
TTPELLTLRKGASPGDLICVTGTLGGSTAGLHLLQREKATMIEQMRNNEP
YNKEVMAELQEYTEAIRCHLLPKARIDIIDFFSEEGIVPTAMIDISDGLV
SDLGHLCRRSGVGAKIDESKLPVLPEARAVAEEFQQDVFDWALTGGEDYQ
LLFTVPKSQYDLIASHRDITVIGEISEKEEGMMLTDIFGMTIDMADMKRG
FDHFAG
>CT0699 thiS, thiamine biosynthesis protein ThiS
MAGAITLACTIRFEANAEHVYLSSGTKQTVYHYPFPMSTIHITLNGERKE
VPAGSTVSELLVIADADRQPVAVVVNEHIVRPDQRDSYILQERDQVEILV
FAGGG
>CT2034 thrB, homoserine kinase
MKTVTGFASATVGNVACGFDVLGFAITEPGDEVVLALHDERRSDCPVSIT
SIVGDGGALPLDPKKNTSSFVVLKFLEYIRTTKGISFDGHIDLVLKKNLP
LSSGMGSSAASAAAALIAANELFGSPCTKMELVHFAIEGERVACGSAHAD
NAAPAMLGNFILIRSYNPLDLITIKPPKNLFGTLVHPHTELKTSFARSVL
PKSIPLSTATQQWGNVGALIAGLLMEDYDLIGRALVDVVAEPKRAPLIPG
FNEVKQAALDAGALGCSIAGSGPSVFAFSSSRQTAEAVGSAMQSAFLHSR
AALQSDMWVSPICSQGARIISTTS
>CT2035 thrC, threonine synthase
MIFYSTTKASAPVTMKKATLEGLAPDGGLYVPSTMPHFSAEEIALLENGS
FNNIAFAVAKKFAGDEIPLDRLSELIDECFTFDTPLHELDPDTFVEELFH
GPTLAFKDYGARFLARMTGYFAAEESRLITVLVATSGDTGSAVAYGFHGI
PNTRVVLLYPSGKVSRLQEQQLTTAGDNVFALEVKGDFDDCQRLVKQAFV
DQSLRQKLTLTSANSINISRLIPQSFYYAWAALQLRQRKPDALPIFSVPS
GNYGNLTAGVMAKMMGFPIGHFIAASNANDSVTRYLDEGRYEPKPTVRTL
TTAMDVGNPSNFARLRYFFENDFRKMGQEITGIAVSDAETLETIRSVYEK
YGYVMDPHTAVGYRALDRFRQDHAGAGTPGVVLSTAHPVKFDEAIKEATG
NEVPLPETMEEIMSKPKKATLIGNRYEELARFLSELNTQTN
>CT2126 thrS, threonyl-tRNA synthetase
MSEITDDRQQVIITLPDGSERTYSSGVTGLEIAESIGKKLAEAALAFTID
GKPRDLDTPVTENARVSIITFDSPEGKEIFWHSSSHLMAHAIEELFPGAK
FGAGPAIEQGFYYDIACEHRFNEEDLRAIEAKMLEISKRNLSINREEMPR
EQAIEYFSNERKDPYKVEILEDTLKDVPTVSVYHQDGFADLCSGPHLPST
GKVKAVLLTNISSSYWRGDSSRETMQRIYGITFPSDKLLKEHIARLEEAR
KRDHRKLGAELGLFMLTPEVGSGLPIWLPNGAIIRNELETFLKAEQRKRG
YVPVYTPHIGNIELYKRSGHYPYYSDSQFPPLTYKDETGREEQYLLKPMN
CPHHHLIYSQTLRSYRDLPIRLTEFGTVYRHEQSGELNGLVRARGFTQDD
SHIYCRPDQLVDEICNAIDLTRFVFNTLGFDEVETRLSLHDPENQSKYGG
TAEVWGQAEKDVKEAADRMGIKYFIGIGEASFYGPKIDFIVRDAIGRKWQ
LGTVQVDYVMPERFDLSYIGSDGQKHRPVVIHRAPFGSMERFIGVLIEHT
AGNFPLWLAPVQAVVLPIAEEVHDYAREVYAKLHEAGIRTELDLRSEKIG
KKIREAELSKIPAMLVIGRNEQEKGEVSLRRHRKGDEGSFGVDELIARLC
EERDRRF
>CT1554 thyX, thymidylate synthase, flavin-dependent
MQVRLISVTNPLIEIDNRQLTPEGLMAYCARVSSPHQETPDYEKLLSYCI
QNKHWSVFEMVDMTVEITTSRAISPQILRHRSFCFQEFSQRYAKAQTTEK
YKPRRQDVKNRQNSLDDLDEATVKWFDEAQEKIAQLAFESYEEALEKGIA
KECARVLLPLATQTRLYMKGSIRSWIHYLEVRTDPATQKEHRDIAKAVQA
IFVEQFPVTSKALGWKYGC
>CT1934 tig, trigger factor
MQKNITNVSEIAQELEIILTAEEYQPEYDQQLEEARKSVRIKGFRQGHVP
VGMLKRIIGPSIEAEVAEKMASKYFAAIAEEEKINPASRAQIESYNYEDG
KLTIKISYEIHPEFELKDFSEYTFTQAEYTISDEDVDREIKLILRGHGTM
VTSEDAAAEGDTVIGDVTKLDADGADIEGSKNENHHFNLEYLPADNPFRM
ALEGKKAGDVVDVTVKPKEEGGETNRFRIEIKEVKHLELPELDDELVKEI
SQQRFEKVEDFRNDIRLQLQAHFSDKSEYDLLEAISSKLIEEHPVPTPSA
MVAHFQNILLENAKRQVGGQFPKGFDEREFFNAMKPNAEKHARWLLISQK
IAKENNLEVTDEDIKAFAEKEAEKEPSLTVDQLLNTYLSTEFKDYIIDTI
LKEKIYDVIKSKVTITKEATPVPAHN
>CT1614 tilS, tRNA(Ile)-lysidine synthetase
MNKTEKKFLENMHRQKLVANGDAVLLAVSGGPDSMALLHLFASVASVLHC
RLGVAHCNFMLRGDASDADESFVRDACAELGIDFHVRRFDTASVSSAWKK
SIEETARLLRYDFFGELCREASYTRIATGHHSDDNAETVLFNLFRGAGIS
GLRGIRVRHGAIIRPLLPFTRREIVAYLEEKRVVWRDDHTNEGIEYDRNF
IRNRVIPVIEERFAHKLMPSLQRISEHAGELEEFIDLHISRLLEAHPGLD
LAGGKLHVGTMRQLSMFERKEILKRALKLQGLSVGSNVLNRIAGLLDNQA
GRSVPAGAGVEVVLHDGFLRFRQTGNPSDHR
>CT1870 tkt, transketolase
MTVRVRPISLNLRNNPIMTRTRSIDEEAVATIRLLAVDMVEKAKSGHPGL
PLGAAPMAYTLFTKIMRFNPANPEWPNRDRFVLSAGHGSALLYSMLHLCG
YGLGMDELKQFRQLGSRTPGHPEYGHTPGVETTTGPLGQGIATAVGMAAA
ERFLATKLNTAERALIDHFTYVICGDGDLMEGISSEASSLAGHLRLGRLI
CLYDSNHISIEGSTGLAFTEDVARRYEAYGWHVLSHIDGNDLAAIEQAVR
NAQEIDDRPSLIIVNTTIGYGSPHKQGTASAHGEPLGPEETKLVKQAFGF
DENESFVVSDRVYDHFHAIAERGAALEAEWQTQWQSFRQEEPGLASALTD
LLAGRFPESWLPEMPKFTTGEKLATRQASKSVLAKIAEKAPLLAGGSADL
APSNGTLIGAAFEAGSYGGSTFHFGVREHAMGALVNGMALSKMMIPYGAT
FLVFADYMKPALRIAAMMQSPSIFIFTHDSIGLGEDGPTHQPIEQLAMLR
AMPGFDVYRPADANETAAAWLLALKRRKPAALVLTRQGLPVLDDADGRLR
NGVTKGGYVLAEWADTDDDDDRRIIIVATGSEVHPALEAKTLIEQEGFAA
RVVSMPSRELFAEQPAEYRAAVLPPTVRARAVVEAAATFGWHDIAGDGGI
VIGIDRFGMSAPSSQVMEHFGFTAENIAARALELLNRK
>CT1313 tmk, thymidylate kinase
MLISFEGIDGAGKSTQVMKLKRYLQERGREVLALREPGGTPVAEEIRELL
LERRNDITPVGELLLFAASRAELVQQVIQPALENDSDVILDRFFDSTTAY
QGYGRGLDLDMLAEINRIASCRLVPDVTFYLDLTPEDALMRKFSEKSLPL
AFESEELDRMENSGLDFYRRVREGYHKIGGENPNRIIIIDALLSPSEIHR
KIISSIDALCTKTA
>CT0055 topA, DNA topoisomerase I
MASKSATPSARNRTLIVVESPSKAKTINKYLGDDYTVYASVGHIKDLPKK
EIGIDFDNHYEPRYEVIAGKEKVVRQLKKLAGEADKVLIATDPDREGEAI
AWHISNEIDFAKKPVFRVLFNEITRNAIIEAIGHPQQIDYRLVRSQQTRQ
ALDKIVGYKISPFLWNVVYRGLSAGRVQSVVLRLICERETEIEAFTPQEY
WTIYADFTTENGETFRTKLVKVNGEKPDITSGEQAEAIASAIQGRLFAIS
EIVPKVIQRKQPLPFTTSLLQQAASNQLGFGSQKTMRTAQQLYEGIDLGS
EGATGLITYMRTDSTRISGEAVAQAHDYISQQFGPEYKGFGGQGKAGKNA
QDAHEAIRPTSVTRTPESLRRHLTPDQFRLYELIWKRFVASMMAPAKIEQ
TRVDVDDQKKEFVFRATGNKVLFPGFFRVYDDQQELDYEAQKSTREELEK
EQIVKLPERLSVDEQLKLAELDRRQSFTRPPARYTEASLVKELDNYGIGR
PSTYASIFSTLQDRRYVELQKKKIIPTELGRDVSAILVANFPDLFNVRFT
AEMEDELDKVASGDDEYEKVLDRFYRPLESVLSHRKSDPLIPQNRDAVLC
EKCGKGHMIVKWTTSGKFLGCSNYPKCKNIKAITTNKAKPKETGVRCPVC
GEGHMLLRNGRLGPFLACSNYPKCNTLLNLSKQRHIEPIKTPPITTDLAC
PKCGAPMYLRTGKRGLWLGCSKFPKCRGRLSWSTLDPETQARWEKAMNEH
MAAHPSLAITMLDGKPAPMNLPIEDIIARAEDAGLISATSEQPEAEATT
>CT1444 tpiA, triosephosphate isomerase
MVGNWKMNNTIAESVDLATAIAEKVGADGVQCEVGIAPTFPALYEVCKVI
EWSGIRLCAQNCHYESDGPFTGEVSTRMLAAAGCSYVILGHSERRQLFGE
TNATVNLKVKKALAEGLSVILCVGETLDERERGVTGQIVTAQVVEGLIDV
TDISKLVIAYEPVWAIGTGKTATKEQAQEVHALIRAKVTELYGQKAADHL
RIQYGGSVKPSNAAELFAMPDIDGGLIGGASLNADDFMAIVEAAG
>CT0754 tpx, thiol peroxidase
MIMATITLKGNSIHTAGELPAVGSQLPAFTLVKSDLSEVSPADFAGKKLV
LNIFPSLDTAVCAASVRRFNKEAGERGDAVVLCISADLPFAQGRFCTTEG
LDNVVPLSVYRSPEFGLDYGLTITDGPLKGLLSRAVIVTDASGKVLYAEQ
VPEIVQEPDYDKALAALA
>CT2088 treS, trehalose synthase
MPKKQAVTSYQPESLWYKDAIIYELHVKTFCDSDNDGIGDFRGLKSRLGY
LESLGVTAIWILPFYPSPLRDDGYDIADYKSVNPDYGTIEDFRDFLEEAH
RRGIKVITELVVNHTSDQHEWFKKARKAPKGSPERDFYVWSDDPTKYGEA
RIIFQDFEASNWTWDPVAGQYYWHRFFHHQPDLNFENPAVHQALFDVLDF
WLGMGVDGLRLDAVPYLYEAEGTNCENLPQTFEFLRKLRSYVDEHYPNRM
LLAEANQWPEDSAAYLGKGDMCHMNFHFPLMPRMYMALATEDRFPIIDIL
AQTPEIPESCQWASFLRNHDELTLEMVTDEERDYMRRVYAHDVKARINLG
IRRRLAPLMSNDRRRIELMNIMLLSLPGTPVLYYGDEIGMGDNYYLGDRD
GVRTPMQWNADRNAGFSRANPQRLLLPVIIDPEYHYEAVNVEVQESNPNS
LLWWMRHTIATARRFRAFSRGTIEFLNANNPKVLMFIRSYEDETILTVIN
LSRNAQVINVDLSAYEGCVPEEIFSMNRFPRVHKTPYMVALGPYGYFWLR
LIREESPADRSALLEKVALRAGSWQELFVGRSLDQFETAILPPYYKSVRW
FGGKARNIIRIRVTDTVPVAGMENTAFAMTEVNYPSGENERYQLPLTFVP
VERGNLADELFYRHAIARVELDGLEGFLIDASADETFRSRLLDLILHRET
WSGAAGKITADAGKMLENSCLSTAGEEPPASHLMGLEQSNTSIRYGEQLC
LKLYRKIDSGMAPEIEISRVLTDQTEYRNIPVYLGSFDYGKSYRERCSLG
ILQNFVPNESDGWQLSLGHVRRYYEDVLSRCSQGVLPPTLPELSGTTSKL
PELIHELIGEFYFQMVGKLAERTAGMHRALGSVDTDAAFSPEPFTTLYQR
SIYQAMTDQVKRSMIFLRESIHAVPKEARPLAHKLLDMQKEILEQFEPIR
KEKIDTVKIRIHGDYHLGQVLYTGNDFVILDFEGEPARSLSERKIKRSVY
RDLAGMLRSFDYAAFNVLMQNQAIRPEDRKALEPWAELWSYYVGQHFIDV
YTQQTEGSGLIPKEPRQRELLLRSYLMNKAIYELNYELNNRPEWLPIPMN
GILRLIRN
>CT1164 trmD, tRNA (Guanine-N1)-methyltransferase
MRIEIISVIPDFFASPLEKGLLGIARRKGLAEIHVHNLHDYGLGKYQQVD
DAPFGGGAGMVIRPEPVFACIEKLQAERSYDEVIFMTPDGQAFDQKRANR
LSRKGNLIILCGHYKAIDERIRRTLVTMELSVGDVVLSGGEIPALMVMDA
VLRLIPGVLGDGESALTDSFQNELLDCAWYTRPAEFRGMKVPDVLLSGHH
EKIELWRQENARERTLERRPEMLGKESEKE
>CT1855 trmU, tRNA(5-methylaminomethyl-2-thiouridylate)-methyl transferase
MSSPQHVIIGLSGGVDSAVAACLLIKQGYHVTGLNIRVLDTPEDTPTLAP
SAMRISDSEEFDFPVFTLNLSAKFARDVVGYFHDDYLAGRTPNPCMVCNK
AIKWFGLFEAMRLLRADLVATGHYARTELRDAVTRLLKGVDPEKDQSYFL
WMLTQAELAKTLFPLGGYTKAEVRELARSFGVHAAEKKESQEICFVPHDD
YCAYLANAIPGLEARVAGGEIVDQAGKVIGHHRGYPFYTIGQRRGLGVST
GEPVYVTEIDAEHNRIHVGSKADLECRSLIASGMNWIGIATPDKSFEAEA
RIRYRDRQSACMIEPMDDNRAWVSFREPKQGVACGQAVVFYDGDEVLGGG
IIAKVNPEAPPQKILG
>CT0535 trpA, tryptophan synthase, alpha subunit
MKENRITRLVKQDKKLLIAYYMPEFPVPGATLPVLEALQESGVDLIELGM
PYSDPIGDGSVIQDAAHKAISHGVHVGSIFELVRRARNGEGCKKITTPIL
LMGYCNPLIAYGGDCFMADAVKAGVDGLLIPDLPPEESEDFLERAKHFGL
SVIYLISPVTPPDRIELIDSMSTDFSYCLAVNATTGTGKLDVAGMDEKIA
EYLKRVRQHTKKKFVVGFGIKDRERVRKMWELADGAVVGSALLQHVATAK
TPEETAELAAGFWKSLR
>CT0521 trpB-1, tryptophan synthase, beta subunit
MKQKVIYSAPDEFGHFGTFGGKFIPETLVKNAADLEEEYLKAKNDPEFHQ
TLDNLLRHYVGRPTPLYHASRLSEKQGGAQIWLKREDLCHTGAHKINNAL
GQVLLAKRMGKKRIIAETGAGQHGVATATVCALFGLDCIVYMGEEDIRRQ
APNVARMKLLGTEVRPVTAGSRTLKDATSEAIRDWMNNPEETFYIVGSVI
GMHPYPMMVRDFQSVIGRETRQQVLDQAGRLPDVIVACVGGGSNAIGMFY
EFLPDAKEVELIGVEAAGEGLEGKHAASLTKGEIGVLHGSMMKLLQDEYG
QVQEAHSISAGLDYPGVGPEHCYLQKLGLVCYTSTTDKEALAALDALAKT
EGIICALESAHAVHYAMKRAAEMPKESIIVVNLSGRGDKDMGTIMQELKL
>CT0192 trpB-2, tryptophan synthase, beta subunit
MSTEPTKILLSEDEMPRQWYNIQADLPSPMPPPVGLDGNPIGPDALAKVF
PMNLIEQEVSTERWIDIPEEILGILKLWRPSPLYRARRLEAALGTPAKIY
YKNEGVSPAGSHKPNTAVAQAWYNREFGIKYLTTETGAGQWGSALAMSCK
LIGIECKVFMVRISFDQKPFRKIMMNTWGAECIPSPSPLTAVGRRILEED
PDTPGSLGIAISEAIEQAVERDDTRYALGSVLNHVMLHQTIIGLEARKQF
DKIGRYPDIVIGCAGGGSNFAGISFPFLYDKIHGKDVQVIATEPEACPTL
TRAPYAYDSGDVAMMTPLLPMHSLGHTFIPPAIHAGGLRYHGMAPLVSHT
KQLGLIEATALPQTECYEAALLFAHTEGFIPAPETSHAIAQTIREAKQAK
EEGKEKVILMNWSGHGLMDLQGYDAYMSGKISDYPLPEELLQRSIAASLE
GHPPVPGC
>CT1671 trpC, indole-3-glycerol phosphate synthase
MTYLTRILETKAREVAELKKLKPERRYREACGDLPATRDFRSAITSRDGG
INLIAEVKKASPSRGVLVEDFRPLDIAARYAELGASAFSVLTDSHYFQGS
PDYLKAITQQFSIPVLRKEFIIDESQIYETRLMGADAALLIVAALEPSQL
RDYLQLFAELGLHALVEVHDRRELDIAIEQGSTIVGVNNRDLRDFTVDLM
TSVNLKREYPEGVLSVAESGLKRRDDVLLMQDAGFDAVLIGEGLLASEEL
RQFSWG
>CT1609 trpD, anthranilate phosphoribosyltransferase
MRQQEILQKLLEGHDLSRQEMETCMNSIMENRFTDAGTGAILALLQKKGA
TPAEVIGACASIVSKSTPVTLDQQAVDTCGTGGDHTGTFNISTAAAFIAC
GAGIPIAKHGNRSITSKCGSADVLEALGYQVDLPPCATEELFRETGFAFL
FAPLYHPSMKAVAAIRKELGIKTIFNMLGPIINPAGVRRQLIGVFDPSVM
DIYAEVLLLNGCEHAMLVHGSTGNSMGLDEPSVCGPTSMIELHQGQIIRH
TVEPEDFGLGRWDIGELAGGDSHVNAQIIREILDGSAPQAKIDAALFASA
ITCYVSGKASCIDEGMSLSKGSLETCEALDKMNLIIKTNQRLAQKCASAT
N
>CT1448 trpE, anthranilate synthase component I
MIRSFSDNTADPAKEPSFILTPLVRTFQADTETPVSVYLKLQRPYSCLLE
SVEGEERMARYSYIAVDPVAVLKGTVGGEILLDVRDERFRQLSAIVEQER
DLRAVVDRCMAMFSSEKLPRHKSGSQQMSTSGVFGYFGYDTMHLIEKIPA
AEQPDPAGMPDLCLLFCDTLVIFDNVMRKLFLVTNYLDAADRTRADRKID
ELAALLQKPLVPALVPFQPEKPEPVISNTTRDEYYAKVLKAKEYILSGDI
FQVQISQRLKRKLHTRPFDVYRALRTINPSPYLYFFDFEDFHVVGSSPEL
LVKVERDHTGRRMVDTRPIAGTRRRGESFEEDERNAKELISDEKERAEHL
MLIDLSRNDIGRIAKIGTVETNEMMVIEKYSHVMHIVSNVRGELQDGLTA
MDAFWSCFPAGTLTGAPKVRAMEIIYELEKEKRGLYGGAVGYLDFRGQLN
TAIAIRTMVIRDGVIYFQAAGGIVADSTPEFEYEETMNKMRAGLTALESI
ETFV
>CT0718 trpF, N-(5'-phosphoribosyl)-anthranilate isomerase
MTKIKICGITRAQDALEAALAGADALGFNFSRKSPRRIDAETARSIIAGL
PPLVTPVGVFVEQSPEEINDICRHCGLLVAQLHSDDYDAEKTLQIKGVRV
IRVFRPSPGFEVSQVRKFTEKTGCRSFLFDAYSPAMAGGTGQSIEAQTAG
SLFDETRDFSWALLAGGLKPENVGDAVTLIRPWGVDTASGVESGPGIKDA
LKIRQFVEAVRKADRSLTNCC
>CT0013 trpS, tryptophanyl-tRNA synthetase
MSTQRILSGMRPTGKLHLGHYTGALENWIAQQNLLHPDGSRAYETCFLIA
DYHSLTTSLDTSSLYAHSIDMLVDWLAAGVDPEKSLVFRQSQVKEHAELF
LLFSMLITTARLERNPTLKEQVRDLNMDSLVYGHLGYPVLQAADILLYKG
NVVPVGEDQIPHVEITREIARKFNSHYQHPELGDVFPEPAPKITKFARLV
GLDGKAKMSKSLGNTILLSDAPEEVMAKMRPAVTDTQKVRRNDPGRPEVC
LVYSYHQKFTGESQLVEIETGCRSGALGCVDCKKMCAANISAELAPILER
RKHYEAQPELVKEILYEGESKARKIAGETMKEVREAMSLGESNA
>CT0978 truA, tRNA pseudouridine synthase A
MARSKRTIKMQIEYDGTGYSGWQRQPGDVVTVQGEIERVLERIMQEPVSI
DGAGRTDSGVHARRQVASFATCSPMPLGRLIYSANSLLPSTIRINAMRQA
PESFHARFSATSREYRYFLLEHPSAIDSRFAGCSHGKPDVGAMNRLALML
IGTHDFAAFSKETPDQYGTLCTVTAARWYRSGRFHVFRIEANRFLRSMVR
FLVAGMIEVGMGRLEEGAFARMLESGHRPPKLKPADAAGLFLWKVRY
>CT0243 truB, tRNA pseudouridine synthase B
MIDQGRISVLSEEGDYLLVDKPLDWTSFDVVAKIRGAYKRNGAKRKVGHC
GTLDPKATGLLILATGRKTKTISSLELLDKAYEGTIRLGAKTVSHDTESE
EYDLRDVPSLDERAIREAATSMIGERMQQPPMHSAVWHNGKRLYELARQG
HEVKERKARQIEIHQFEITGIELPYVHFYIRVSKGAYIRVIAHELGELLG
VGGYLKSLKRVAIGQYQLSDAMSVDAVVDEITRAASVIEE
>CT0785 trx-1, thioredoxin
MAQTLDDLIRTSELPVFIDFWADWCGPCKMVAPSVKQLASEFKGRLIVVK
VNVDQQPDAAARFQVQGIPALMLFVGGQLKWRTAGAIPYQQMRQEVLKAI
G
>CT0841 trx-2, thioredoxin
MSGKYFEATDQNFQAEILNSDKVALVDFWAAWCGPCMMLGPVIEELAGDY
EGKAIIAKLNVDENPNTAGQYGIRSIPTMLIIKGGKVVDQMVGALPKNMI
AKKLDEHIG
>CT0842 trxB, thioredoxin reductase
MDKDIRDVVIIGTGPAGYTSAIYTGRANLKPLVIEGPQPGGQLMITTDIE
NFPGFPEGIPGPELMGRMREQAARFGVEFQFGSITEVDVSRSPFSLMLDN
GQEILARTLIIATGANAKWLGIESEEKYRGRGVSACATCDGFFFRNCRVF
VVGGGDTAMEEALYLTKFASEVVLVHRREEFRASKIMSLRASKNEKITTM
LNQVVDEILGDDMKVTGIRLKNVKTGELTEHACDGVFIAIGHEPNAKLFK
GQLDMDDYGYILTKDHSTETSVKGVFACGDVQDFTYRQAVTAVGTGCMAA
IEAERFLESIR
>CT1780 tsf, translation elongation factor TS
MSQISAKDVKELRDTTGVGMMECKKALEETGGDMQKAVEYLRKKGAAMAA
KRADREASEGVVCILMSDDQKTGVILELNCETDFVARGEVFTGFANELAT
LALSNNCESREDLLGIKLGEAYGNETVEEALKSMTGKVGEKLELKRMARL
TAEAGVLESYIHPGSQLGALIAIDTDKPAEAKALAKDLAMQVAAAAPIEV
SRDAVSTELVEKEKEIYRQQALAEGKKEEFVDKIVMGRLNKYYQEVVLTE
QTFIKDQNTKVSGVLDDFMKKNQAQVKVKAFVRYQLGA
>CT2191 tuf, translation elongation factor TU
MAKESYKRDKPHVNIGTIGHVDHGKTTLTAAITSVLAKQGMATLREFSDI
DKAPEERERGITISTAHVEYQTAKRHYAHIDCPGHADYIKNMITGAAQMD
GAILVVAGTDGPMPQTREHILLARQVNVPALVVFLNKVDIADPELLELVE
MELRELLTEYGFPGDDIPIIKGSALKALEGDPEAEKQIMELMDAVDSYIP
QPVRDIDKPFLMPVEDVFSISGRGTVGTGRIERGRIKVGDEVEIVGIKPT
AKSVVTGIEMFQKTLDEGQAGDNAGLLLRGVDKNALERGMVIAKPGSITP
HTKFKAEVYILKKEEGGRHTPFFNGYRPQFYFRTTDVTGSVTLPEGVEMV
MPGDNLSIDVELIAPIAMEESLRFAIREGGRTVGAGSVTKIVE
>CT1424 typA, GTP-binding elongation factor family protein, typA subfamily
MSRKQNIRNIAIIAHVDHGKTTLVDSIFKQTGAFRENQHVDVRVMDSNPQ
ERERGITIFSKNAAVQHKGCKINIVDTPGHADFGGEVERILKMVDGVLLL
VDAFEGPMPQTKFVLRKALELHLKPIVVINKIDRPQADPEKVHDQVLDLF
IALGADEDQLDFPYIFASAKNGIAKCNMSDPDGDMSLLLDMIVKEIPAPE
ADDDAGFQMLVTSLDYSDYIGKIAIGRIQRGKVAPGNQLTLVTQDGVVGK
GTVTKLFLFDRTQRVEAMEATAGDIVALAGIAAANVGETLTTPDQPEPIE
SFEISKPTLSMLFSVNDSPFAGQEGKEVTSRKIRERLMKEIMTNVALNVE
ETDSADTFRVSGRGELHLSVLIETMRREGYELAISRPEVILREENGVTME
PVEHVTIDVPEEYTGVVIEKMGRRKAEMTNMSTLRGGMNRLEFEIPTRGL
IGYNLEFTTDTKGEGMMSHVFHNYQPYKGKLPSRETGALVSAETGVAVAY
AISSLEDRGTFFIGPNAKVYEGMVVGESTRDLDITVNICKTKKLTNMRAS
GSDDSIRLTPPKRLSLEQALEFINDDELLEVTPENIRIRKKILNADLRAK
ATKKAKAMA
>CT0084 tyrA, prephenate dehydrogenase
MQQGIHTISFVGLGLIGASLMQALKRAAGATGRNIEMIGFDPAFDAADIV
AITGECGLDRFEPDPAKLYNADLVVLCAPVVTNIALLDEAKRHIRKDTLV
SDVSSTKAEIAAKAQELGIEFIGMHPIAGREQQGYQAASPELLDGRLVIL
CTECATLETTLATELAGLLRAAGCKPLFMSPEEHDRVYANISHLPQLIST
ALMAHCRENVEWAGPGFASMARLAGSPWAVWRDIVETNRSNIADEMEAFS
ALLADVAGEVRGGNFEALESKFREANDLYQRLQERSSS
>CT0563 tyrS, tyrosyl-tRNA synthetase
MIFPSVKEQLDIIVNNTVEVISTDELERKLTKSLKNGTPLKIKLGADPSR
PDLHLGHSVVLRKLREFQDLGHEAILIIGDFTAMIGDPSGKSKTRPQLTA
EEARENGKSYFEQASKILDPEKTTICYNADWLSQMHFADVIRLSSHYTVA
RMLERDDFEKRYRSQTPISIHEFLYPLAQGMDSVHLKNDVELGGTDQKFN
LLVGRDLQREYGIEPQVCITMPLLVGTDGSEKMSKSLGNAICFNDTPEDM
YGRTLSIPDSLIETYWNLLVPHHSGNDKPIAERIAADPRETKRELARELV
AQYYSAEEAAKAQEHFDRVIVNKQAPTDLPTVEFEEATMPVVELLMALAA
FPSKNEARRMIQQGAVQAGNEKISDINAVIELTENPVIIRAGKRKFFKVA
RAKKSF
>CT0079 ugpQ, glycerophosphoryl diester phosphodiesterase
MTFEIQAHRGARAFYPDNTLQAFCKAADLGCRVIELDLNVSRDLRMVVSH
DPWVSCASGDNPSKHYLYAMNYDEIAQLDCGEASADFPFRHSVRAVRPIL
SEVFGTVEEQLRRAGRPGEMIYNLEVKSWPGLDGAAHPLPEEYAALVTRE
IAASELERRVRLQSFDDRILVAARNLAPTLCYGLLVEERAVFDRFPERPG
FVPEYVNPRLDLVDESLVSWLHGLGAKVVVWTVNHPEDMLRMKRFGADGI
ITDHPEVALHLSGLNGS
>CT0267 uppS, undecaprenyl diphosphate synthase
MPQWFTSKSDPEDTRIQETLKSSCVLPRHIGIIMDGNGRWAKIKGKSRIA
GHVAGVESVRDVVEASSQLGIENLTLFTFSIENWKRPKPEISALMKLLIK
VLRKEAVKLLENDIRLEVIGDMEMIPDDVRKTLDETIELTRKNRGMAMTI
ALSYSGKWDITQACRHIALQVKQGLLDPETIDENLFASYLSTASMPDPDL
LIRTSGEYRISNFMLWQNAYSEIIFSNTLWPDFRRNELYEAIREFQKRER
RFGKTSEQLKTNEVE
>CT1689 uvrA1, excinuclease ABC, subunit A
MDFSHISIRGARVHNLRNISLDIPRNKLVVITGISGSGKSSLAFDTIYAE
GQRRFMETLSPYARQFIGNIERPDVDFIEGLSPVISIDQKSTNRSPRSTV
GTVTEIHDFVRLLYAKAGRRYDPVTGQMLQKQSEESIREAIMVLPEGTKL
QILSPLVTGRKGHYRELFDRLIKKGYLRARIDGEYVEMSAGMQLERYKSH
NIELVIDRLVAGPGIGERLTQAVKLAISMSEHKSSVICDTDDPERGELYF
STQYAYADGTVPVDVLAPNQFSFNSPYGACPECNGLGEVMQLSAELMVPD
MTRSLEEGAIEPFGKPGKRNLWQVIRAIAKKYGFSLTTPVAEIPVEAFNI
LLYGSGSKTFDVGYSYAGKEHLYPQTFQGAVPYVEEVRMNASTSKLREWA
ESFMIRQTCPVCGGARLKQESLHVKVDGLNIAEVEALPLPEALEFFRALP
PKLTTKEKLVAAPVLHEITKRLDFLLNVGLGYLTLDRNSQTLSGGEAQRI
RLASQLGSQLSGVLYVLDEPSIGLHQRDNHKLIDSLRRLRDLGNTVLVVE
HDKDTMLQADQIIDLGPGAGEHGGLVVAQGTASTLGPESVTAGYLNGTLS
VSFPPSKKKKSEQRYLKLKGCRGNNLKQVDAEFPLGSLISVTGVSGSGKS
SLINETLYPLLARHFYRSKILALPCDGVEGIELIDKVVNVDQSPIGRTPR
SNPATYTGAFTFIRDFFTRLPEAQIRGYKAGRFSFNVKGGRCEVCQGAGT
RKIEMNFLPDVYVQCEHCKGKRYNRETLQVKYKGKSIADVLEMPIEEAAL
FFEDFPRIRRILDTMQSVGLGYLKLGQPSPLLSGGEAQRIKLSAELAKIQ
TGKTLYILDEPTTGLHFQDIQHLLAVLRKLVEKGNTVIVIEHNLDIVRNS
DWIIDLGPEGGNGGGHIVATGTPEEIARRSDTHTGHYLKLDAGDLLTDTD
>CT0527 uvrA2, excinuclease ABC, subunit A
MLGGVSTHNLKHVTVRIPRNRFVVLTGLSGSGKSSLAFDTLYAEGHRRYV
ESLSAYVRQFLERMPKPEIETVEGIAPAIAIEQKAIPKNPRSTVGTVSEI
YDYLRLLYARIGKIYSRDTGELVLKHTPEDVSLQARFFDEGTRFYVGYPF
PSHHDERHHDRTVADELKNLLKKGFFRLLKGESVIDISDDKERAGVEMMK
RAELSELLVLVDRFVAKKDEKVYSRVAQAAESCFAESGGQAVIRVAGGKE
FRFSDRLELNGVTYQEPSPQLFAFNSPIGACTSCQGFGRIAGIDEDAVVP
DKSLSLKEGAISCWNSEKYSWYRKQLLKIAPEVGIPVDVPYEKLGRVHRE
MLWKGIPERSFKGLWPFFEEIEKDAGYKVHYRVFLSRYRGYATCPECEGA
RLNPDALQVRVSGKNIGEVVRMTIADAQTFFASLDISPFDRSVADSVLRE
INKRLGYLMDVGLDYLTLDRLTHTLSGGEFQRINLSTSLGSPLVGAIYIL
DEPSIGLHQSDSARLVALLKRLRDLGNTVVVVEHDREIIEAADEVIDLGP
RAGRLGGEVVFHGPPSAMAGAEGSLTADYLSGRKQIAVPAERRKPDFSRC
IEIIGAMQNNLRNIDVRFPLGVMTCVTGVSGSGKSTLVNDILKLGIIRAK
EGSREKVGTHRAIKGVEFIEKVEHVDQSPIGKSSRSNPATYLKIFDDIRQ
LFASTRESKSRGWKPGYFSFNIPGGRCETCAGEGTVKIEMQFLADIEAVC
EECEGKRYKASTLEVKYNGLSISEVLDLTVEEAISFFGQEKSIQKKLQVL
QEVGLEYIRLGQSSSTLSGGEAQRLKLASFISRSDVSNTLFIFDEPTTGL
HFDDISKLVRCFEELMARGNTLVIIEHNPDIIKQADWIIDLGPGAGVRGG
DIVAVGTPEEIVACEASLTGRHLRGYL
>CT1546 uvrB, excinuclease ABC, subunit B
MRNVVHRGGEFKLESPYGPTGDQPSAIKALTDGVLRGDRWQTLLGVTGSG
KTFTVSNVIAQVNRPTLVLSHNKTLAAQLYGELKQFFPHNAVEYFISYYD
FYQPEAYIPSLDKYIAKDLKINDEIERLRLRATSALLSGRNDVIVVSSVS
CIYGLGSPEDWMAQIVELRQGMELDRDEFLQRLVALHYFRDDVDLSPGRF
RVRGDVIDLVPGHEELALRIEFFGSEIDSIHTFDPKSGEIIGRDEYAFIY
PARQFVADSEKLEVAMLAIENELAERLNALRAEEKLVEAQRLEERTRYDL
EMMKELGYCSGIENYARHIAGRKPGERPWCLLDYFPEDFLVVVDESHVTL
PQIRGMYGGDRSRKTVLVEHGFRLPSALDNRPLRFEEFEEMVPQVICVSA
TPSAHELMRSGGVVVEQLIRPTGLLDPQIEVHPVAGQIDHLLARIRERIA
KGQKSLVLTLTKRMSEDLHAYFRKLGLRSQYLHSEIKSLERMQILRELRA
GDIEVLVGVNLLREGLDLPEVALVAILDADKEGFLRDATSLMQIAGRAAR
NVEGLVLFYADKITDSMREVLDETERRRRIQREYNEKHGIEPRSIIKSVD
QVLNTTSVADAEERYRRKRLGLQKRPELELRGVLDSMSRSDVMLMVAEMN
AEMQKAAEQTDYEKAAYLRDEILMLQERIEQMTE
>CT1807 uvrC, excinuclease ABC, subunit C
MSDTAAAPDKTAKSALAEKLATLPTSPGVYRFSNAAGTVIYVGKARNLRN
RVRSYFNSQGRQPGKTAVMVSHIADLNVIITSSEVEALILENNLIKELKP
RYNVNLKDDKSYPWLVITNERFPRIFLTRQVRRDGSLYFGPYTEASQLRL
ILDLIGSIFPVRSCKYKLTEEAVASGRYRVCLDYHIHKCKGPCEGLQSEE
EYQAMIREIVTLLKGKTSALLRDLSAEMQKKAKELKFEEAAALKAQIEGL
KRYAERQKIVSTEAIDRDVFAVAAGEDDACGVVFRIREGKLIGSRHTYLS
NTGNTPLPNLLASFVEHYYLETPDLIPQELMLQAELPEEELEALRQLLSS
RQTERRQVRFTVPRIGEKAHLIAMCLDNAKHHLHEFMVQKKLRGEIARKS
PALESLKQVLHLGKLPERIECFDNSHFQGTDYVSSMVTFVSGKPKKSDYR
KFRLKSFEGSDDYAAMREAVTRRYGGSLAGELPLPDLVLIDGGKGQVNVA
WQVLQELGLDLPVAGLAKRLEEIYVPNEPDPYNLPKTSPALKLLQQLRDE
AHRFAITYHRKLRSDRTIRTELTGIKGVGEKSAEKLLKHFGSVESVSKAS
IDELSAVAGRKTAESIYRYFNAGDAP
>CT0343 uvrD, DNA helicase II
MWAANLLFPEALTDFLQGLNDVQRQAVLATEGPVMVLAGAGSGKTRVITY
RIAHLIRNVGVLPQNILALTFTNKAAGEMRQRIDSILEYGSASSLWIGTF
HSVFARLLRSYIHLIGYDRNFSIFDADDSKSLIKQCMKELNLSPETLPVN
LVHSAISKAKNSFVLPPEFHRKAIDDQSQKISLVYERYVHKLRENNALDF
DDLLIKPIELFTEQPRVLEQLQDYFKYLLIDEYQDTNRVQYLVAKMIAEK
HRNIFVVGDDAQSIYSWRGADISNILNFNDDYGDAQVFKLVDNYRSTKTI
LDAANSVISRNTHQIKKELVANAGMGEPITLIEAFNEKREAEKVAEHIKS
IRLSKGAQYRSFAVFYRTNAQSRVIEDMMRQNRIPYKIFGSVSFYKRKEI
KDAVAYMRLVLNDRDSESLLRVINFPPRKIGDVSINKLKEFAEERNISLY
EAVKRAGEGGFQARLVNALTQFTDLIEAMRKQAEEGTAYDVLSMLYETTG
LLSLLKEENTPEALARHENLQELLSMTRDFADHNPNSATLGDFLSTISLA
SDYDETQESDNYVSLMTVHAAKGLEFPVVFVTGMEEKLFPLNSYEPGELE
EERRLFYVAITRAREKLFLSWAQSRYMYGQPQMCMRSTFINEINPNIIVT
EGGRKLSESPKKVAATAMAGRPVFGSSLRPQPGGSPAAKVPTVKPAGSAR
PTGAAKSEYRPGTRVQHAIFGPGTVLEVQGSGAKQKVKINFRTAGEKTLM
VQYANLKIV
>CT1552 valS, valyl-tRNA synthetase
MSDHSAQNLEKTYNHHEVEERWRSAHWEAIGTFHAEHSRVLKEGATPYTV
LMPPPNVTGSLTLGHVLNHTLQDIFIRYARMMGKEALWLPGTDHAGIATQ
TVVEKKLRKEGVTRHDLGRRDFLDKVWEWREEYGGLILRQLRKLGISCDW
RRNLFTMDERASEAVINTFVALYREGLIYRGRRIINWCPVSQTALSDEEV
IMKSRRDKLVYISYPLAKDPTRSITIATVRPETILADVAIAVNPNDERYA
DLIGELVIVPIAGRHVPVIADDYVDIEFGTGALKITPAHDPNDYEVAKRH
NLPVFSVIGKDARMTDECGYAGMDRFDARDKIVADLAELGYLVKLEEYEH
NVGYSERADVVVEPYLSEQWFVKMQPLAEPALKVVNDGEIRFHPEHWINT
YRHWMENIQDWCISRQLWWGHRIPAWYDDKGNVWVASSYEEACHLAGTDK
LSQDEDVLDTWFSSWLWPLTTLGWTGPHSDNDDLRAFYPTDTLVTGPDII
FFWVARMIMAGLHFKGDVPFRDVYFTSIIRDMKGRKLSKSLGNSPDPLKV
IDTYGTDALRFTIVYIAPLGQDVLFGEEKCELGRNFATKIWNASRFVFMQ
REKLFATREEFVEAFANFTPQRELMSSAGRWLMSRYNAMLERYHQAMANF
KVNDMVKIVHEFFWGDYCDWYVEALKSELTGDITEERGRHAVCLAVSVLE
GVLKALHPVMPFITDEIWHAIAPRSAEETIATEAMPQPDASWRGEDAAAF
DLVRNMVSEIRSLRSAFNVPHDLRAQAVIRASSPAALVALQTGRAIFPAM
TKCEVELGESVERPAHSAASVVDGNELFIKLEGLISFEKEKQRLEKEITK
VTAYIESLEKKLSNEKFVSNAPADVVAKEKEKLEESRSMVLKLQGNLEVL
S
>CT0325 vpsC, vpsC protein
MRNLKELKWSDLKTGIFFLLGLGFAAYLGLVIGKNTSIFTGVTTIKVMTR
DVNGLAENNFVSVAGKKIGTVSKLDFSTQNDSLFVVAELKLQNEYANLVT
KDSKASIRSLGVLGDKYVDIKAGTGKPVKEGDFIQLVPEDGLSALTDNAK
STIEKLNTLLDKLNNGDGPAGRLISDKQMGEDLQKTVMNLRKSSDELTRV
SAELHNGKGLLPKLINDKSLADNTEQTIVNLKNAAAETETLMKQLNEGKG
TLGKLNSDPALYNNLSKTLISLDTLLIDLKKRPNRYVRFTLF
>CT0602 xerD, integrase/recombinase XerD
MPEREEPWRKTLETFLNYLTLERNFSGNTRASYLNDLGRYLAWLHECGVK
PEEAAPGDIRKFIQELHEIGLEASSIARNISAIRSFHKFLLTERLATMNP
AENIHQPKLARYLPSVLTIEEMATLLDAPLKRHPTSTFMLRDKAMLEFLY
ATGVRVSELLGLSRLNLHMDDGFVRVFGKGSKERLVPVGQTAISWMKRYL
DELRPGMMSATSHDTIFLNSRGGPLSRMAAWNIVREHAVIAGIEKPISPH
TFRHSFATHLLEGGADLRVVQEMLGHSSIIATQIYTHIDRSFIKEVHKTF
HPRG
>CT1524 xfp, xylulose-5-phosphate/fructose-6-phosphatephospho ketolase
MTEMTTPLSPRELDLMNAYWRAANYLSVGQIYLMDNPLLKEPLSKEHIKP
RLLGHWGTTPGLNFLYVHLNRIIRNRDLDIIYIAGPGHGGPALVANVWLE
GTYSEYYPDVSFDEAGMKRLFRQFSFPGGIPSHVAPATPGSIHEGGELGY
ALSHAYGAVFDNPDLVAACVIGDGEAETGPLATAWHSNKFLNPKRDGAVL
PVLHLNGYKIANPTVLARISHEELEQLMIGYGYKPYFVEGDDPATMHQMM
AATMDRCFDEIAEIQRRARVDGVTERPMWPMIVFRSPKGWTGPKVVDGKP
AEGSWRSHQVPFSTVRDNPEHMALLETWLKSYRAEELFTADGVLLPELQE
LAPRGKKRMGDIPHANGGLLLKELRMPDFREYGIDVPKPGSVEAEAPKPM
ARFLRDIMKMNEKAANFRVFGPDETASNRLGELFEETDRTWMAGMLPTDD
HLSRDGRVMEILSEHTCQGWLEGYLLTGRHGFFSCYEAFIHIIDSMFNQH
AKWLKVTGAEIPWRRPIASLNYFLTSHVWRQDHNGFSHQDPGFIDHVVNK
KSSVIRVYLPPDANSLLSVTNHCLRSRNYINVIVAGKQPAWQWLDMESAV
RHCTSGIGIWEWASNDANEGEPDVVMACAGDVPTLETLAAVKILRKLAPE
LKIRVVNVVDLMTLQPKEEHPHGLADRDFDDMFTTDKPIIFAYHGYPWLI
HRLTYRRTNHHNLHVRGYKEEGTTTTPFDMVVMNELDRFHLVADVANRVE
SLRPQAAYIKQYVRDRLIEHKEYITKYGEDMPEVRDWRWED
>CT1828 xseA, exodeoxyribonuclease, large subunit
MIKTAQSVGELTRAIKNELESLFPFVRVKGELSNVKQHSSGHVYLTLKDS
EAQIPAVIWKSVRARSPLELRDGLEVIAEGRLEVWPPAGRYQLICTALFE
TGEGEQRLALERLIAKLAKAGWFDAERKKPIPRIPRRIGMITSPTGAVIR
DMSDVFARRFPAAELLLYPVQVQGERAVESIVRALRYFNDPPKPEHKPDV
LIVARGGGSSEDLQAFNDEAVAEAIYRSAIPVISAVGHETDLSVADMVAD
LRAGTPSIAAERAVPDREELLRLIDNLVNRQSILMEGLISGAQLQVDSIT
SSYAFNRPVQMLGQFEERLALMEKEMKRAIAEKVRDREQQLTAAADRLAM
LDINRTLKRGFALVTQDDRYVTSAKSLEADAKIGLTFHDGQREARVTE
>CT2214 xseB, exodeoxyribonuclease, small subunit
MPASSKSRTSAVPTIEELIQRLEEITRNIENPDTGLENSIALYEEGMSLA
EECRKRLLETRKKLETINPAETARPAKPENAPESPRMNDLFGTES
>CT2136 xth, AP (apurinic or apyrimidinic) endonuclease, xth family
MNIFTWNINGIRARKEALAAWLDSRKPDIVVLEEIKAQVSEIPETIRDFA
GYRKFWNGSTFKKGYSGVGLLVRDGSLDEFTCEPPPFDIENRTLVLHAPQ
FTLIGTYVPRGDGEERYAVKLRYLADLKAFIAELLTEGREVILLGDINVA
LRDIDVHRSQNKPGAIGLRPEERAAIEAHLGLGLLDIVRELNPDKKDLFT
WWPNWKFARERNLGWRIDCIYLTPALARKVSGVSVDLDEKSSDHAPVSVK
LSL
>CT1559 ycf5, cytochrome c biogenesis protein
MTLVEFPAITFAAVVCWAVGSLLQIPAKNSKVSGPLSWLFLFSGTLVLVW
FLASYWIALDRPPLRTLGETRLWYATLIPVVGFLVEYRYKIGWLKYYCMG
LATVFLGINYMHPDVFEKALMPALQSPWFVPHVVVYLVGYVLLAASAVSG
WHNVFLNFRGKRSAKGESMAHYLALLGFVLLTFGLIFGALWAKEAWGHYW
TWDPKETWAFISWLAYLGYLHLVACKTDPRKLQWYLGLAFFVLLVCWFGV
NYLPSAQNSVHTYSGAS
>CT1873 zwf, glucose-6-phosphate 1-dehydrogenase
MSGTLDNFTIVIFGASSDLASRKLFPSIFQLARWGHMPESFRLIGVGRQE
QSHEEFRAFVRSRLLEHSPEAAGDAARLDAFCLRLFYARVDLDDPASYEV
LRDEIMREEKVGGSTCRNLMFYLSIPPSLAPAVVRNLGKAGLGGREQSCT
GWRKLIAEKPYGHDLESARELNRVINEVFEENQVYRIDHYLGKEPVQNIM
VFRFSNGIFEPLWNRQYIAQVQITIAEDFGIRDRGAYYEESGLLRDIIQN
HGLQLLTAIAMEPPVDLSADSIRDEKAKVLRSVRHFTPDSVRESVVIGQY
EGYRNEKNVAPDSKVETFAAVKFHVDNWRWSGVPFYMKAGKNLAKRVTEI
VITFRCPPQNYYGPSGGAACCTPNQVVLGIQPDETVAIRFGAKRPGEQMI
TDPVFMKFDYRDTFQGEGLTPYHRLLLDAIEGNQMNFIRKDSVEYSWEII
DSLKNSIDGQVPEQYPVHSWGPESSRIYG