TitleGenColors Logo

Gene list

Applied filters:

COG category: Cell wall/membrane biogenesis
Organism: Chlorobium chlorochromatii CaD3, CaD3
Gene type: CDS

Number of genes found: 156

Free access
Sort by:

 



# Chlorobium chlorochromatii CaD3, CaD3

>Cag_0650 exopolysaccharide biosynthesis protein
MQSVVKQQNITDFHCHILPSVDDGSPNIASSVEMARLLVQAGYKQVYCTS
HLIKGMYEVTNSELLHTRDMLQQELNKQDIALQLLFGREYYLDEFLLDFL
TEPLLFEGTNLLMVEIPNNTSADVVKNTLFAIARRGYTPMIAHPERCQLL
EITTYEQKSTKGWKKWFMGGKEDSTMPEHTNSLLRYLQQLGCMFQGNLGS
FKGLYGHRVKANARAFEQYGLYTHFGTDGHSPDMLRSLF
>Cag_1023 Glycosyltransferase-like
MFQQRKPRLLWANLYCLLDSSSGASISVREMLRQLAYNGYEVEVIGATIF
DAVSGMSALPPQWKKRLETTDILELNDAPLRHKLLMTNSHQRDAVTALEE
AKWYEFYLHTLNTFKPDVVWFYGGRPFDYLISDEAKHRGIPVAAYLVNGN
YTKTRWCRDVDCIITDTQATADYYHRKNGLTLTPVGKFIDPKMVVAAEHL
RRNVLFVNPTFEKGAALVVQIALQLEQLRPDIQLEVVESRGSWRGMVEYV
SARLGKPRTGLSNVQVMPHSRNMRPLYSRARMVLAPSLWWESGSRVLAEA
MLNAIPALVTDNGGNREMVGEGGIAIALPANYHAKPYIELLTSELLEQFV
AQIICCYDDEQFYQTLVAQATLYGCTTHHISTSTQKLLKVFGKLIASSSK
ELSYK
>Cag_1483 acyltransferase, HtrB/MsbB family
MVRRLAWMKLNKHKSKELAQKIVYYLIIFFGFIFRKLNYATTVQLAAWLG
DLFFNVVKIRRQLVLQNLAMVFPARREKEIRQLARQVYRNQAENLLDMLR
LSSMRTSEDASKLLTIDTTEFLAKTTHVKKGAVLVSAHFSNWELLARSIG
LLVTPLHVVVKRLRNSFVDQKINQWRAECGNHVIYKNAAFREGIRMLQKG
GVLSFLGDQSDPKGTFFMDFLGRRTSVFVGPAYIALKAEVPLFVVMGYRL
GNGGYKAELQEIDMNGLQATKQDAEELVRRYTAVVERYIYQYPAEWFWFH
DRWKRVG
>Cag_0463 surface antigen family protein
MKHSYSLLAVALCVATCTTNVQLANAAPKRAATQKTAPAKSAKAAEPLEP
QAALDTNLEAPLSPTSNQAEKVSVIKAIHFSGLQSINESELLNSLPLKVE
QRVALPGTELTNALNYLWKLQFFSDIRVEKEEVGKNGVTLTFHLTELPVL
DTITFRGNKKFDINELKSESNLVSSKKVSEQDVMTAANKLEKLYASKGYV
TARVAYQVEPTANNRVNVLFTVTEGNKVSIDKITFHGNNAFTQKKLRGIL
NETHQNSWWRSIFGSPKFDNEKFAADKELLLDFYRDNGYRDARILRETMS
YTKDNKGLLLDIYVDEGRRYHIGTITWSGNSKDFATTEVLQKTFRIKTGD
VYNAKLIGERLNFSQDNSDVSSLYLDRGYLSFRADLEEKVVHPDKVDLTI
SLREGELFEINTVNIKGNTKTKDHVIRRELYTVPGDMFSRKNVVRSIREI
SMLNYFDPEQIKPDVEPNQQNSTVDITYNVTEKQTDTFNASVGYSGVGFT
GALGVTFNNFSLKDLFNSEAYRPLPHGDGQKLSLQWQFGTSNYRTLSLNF
TEPWAFGTPTSVGFSAFKTHSSYDYTDDDTYNPTVIDQFGTTLSAGRRLS
WPDDYFAINWKLKYLHSKGGFLRFIDFNDPNAPEEADEISITQTISRNSI
DSPIYPRHGSKNSLTAQLAGGVLPGTVDFYKITGLSSWYIPVTKNLVWNI
STQHGVLSTFSETDYIPYTDYFYMGGSGMSSLPTTPLRGYEDRSIGTKLG
ASSTTDTSLYAGKVYSKFSTELRYPLTLSQSVSVYALTFVEGGNLWQKTS
EVNFADLKKSAGFGLRLYLPIIGQIGLDYGYGFDAVESEPEKTKQGWSFL
FSFGNSIE
>Cag_0225 NAD-dependent epimerase/dehydratase family protein/3-beta hydroxysteroid dehydrogenase/isomerase family protein
MKETILLTGSTGFIGQRLLHYLAEEKCHIKVLLRPESPQTALPFDCEIVR
GSFDDSQTLAKAVRGTTHIMHLAGVTKARDEDGYDAGNVMPLQNLLAAVR
HECPDLKRFLYVSSLAAAGPAPEGITGLTESDAPAPVSAYGRSKLRAETS
CHAQARHIPITIVRPPAVYGPGDKDVLQIFQMMAKGVLIGAGHPQKQRFS
LIYVDDLVTGMVQAMRAEKALNRTYYITSPTAYGWNELIAQAQPLLGFKK
LRQFTLPMPFLLGVAGLMGAIGELQGKAPLINRDKVNELVQNYWVCSGKQ
AQLDFGFTATTPLQEGLATTIAWYRKKGWL
>Cag_1114 DegT/DnrJ/EryC1/StrS family protein
MAGAELIGKEELAEIQELFSKEKVNLYRYGGGNYKAREFEEKFAAWMGVK
YAHAVSSGTAAIHCALAGAGIQPGDEVITTAWTFIAPVEAISALGATPVP
VEIDETYHLDPLEVEKAITPKTKAVVAIPMWAPPKMDELVALCNKHGLIL
IEDAAQALGASYKGRKLGTFGKVASFSFDAGKTLHTGEGGIIVTNDKEVY
DRAAEFSDHGHMHLPGLPRGKDPRRAKGLNLRLSEVTAAIGLAQLAKIEM
ILSQAKENKKKIKDAIRHLDNIVLRPFSDEAGAQGDTLIFRVRNREAALQ
FEAHLMEHGFGTKILPEALDWHFAGVWNHLLPEYERYQNVDLETLWTNTG
TMLRSSICLNIPVLMDDDTIKRLINAIVTGAEKIG
>Cag_0648 Periplasmic protein involved in polysaccharide export-like
MRQHRLFLHLFFVGSLALLLNGCASYRNLPAGSHQHITAKPNSTISREAV
VIADPNSSQDSSYKPSDYRVGPNDVLYVNVSGKPEFIGTAGGSSSFKGYR
VDGRGYIYLPIAGKVSVAGLPMYEVRQKIQEVMRRYFNDPWVVVEIAEYR
TRQIFVFGAVKKPGPQVIPSLGINIAQALASADVQNSGQNYKKIRIIRSL
SPTEGELLVVDFDSVLHGRSLPMQLQEGDIVYVPKSTLASWNTTISELLP
SLQAFSAVLQPFVNIKYLQD
>Cag_1039 Outer membrane protein and related peptidoglycan-associated (lipo)proteins-like
MKRLLRSLFIPATLLLGACCIEDDIVVAPAAVPAPVAVPAPKPTPAPAVV
VVPAPVAVVVVAPVPPPLPKPIVLPALVDVFFDFNKSELGTTRNEQLKRN
ANWIKAHPTSNIIIEGHCDERGTNEYNIALGERRANSAKDYMVTLGVDPA
RLTTVSYGEEKPVAIGSTEEAWAQNRRVHFIAE
>Cag_0729 Peptidase S41A, C-terminal protease
MPPCRVEAFWPLATSRAASATTLQPTALHDETGKYISQTLLQYHYRKPAT
NDSLSLQIFNRYLEQLDGSKSYFVASEVESLRKVYGTRFDDELLAGKSKS
GFGMYNFFLKRAKEKMRFMKATADTARFSFMQPEEFELDRKRTPFLPDRR
QLTALWRRELKYQWLTLKHSGEKNSSIRAELSKSYASRLSLLQRQTPNDA
FQSYMAAVTTSFDPHTSYLSPDDYTNFQIDMSRSLEGIGAKLQTEGQYTV
VGEIIPGGPAFKTGFVKKGDKIIAVGQGSSAPMVDVTGWRINDVVKQIRG
PKNSIVRLKILPASQGGVASTKVVQLVREKIDLQEQAARKSIIQQNGLKI
GVITIPSFYLDFEGQQKQATNYASTSRDVARIVEELQREELSGIILDLRD
NGGGSLEEAVNVTGLFITSGPVVQVSNASGGKSVVRDDDRRIFYSGPLAV
LVNRYSASASEIVAAAMQDYKRGIVIGERTFGKGTVQSIVKLTRPFHFFG
KAPEFGQLKLTVAKFYRISGGSTQHKGVVPDITMPSLIDTSSVGEDTYSS
SLPWSTISPALFRPIADVTPEHVTQLRQKQQVRIDTSRLYKTYMRDLATL
NRIRKKKSITLQDSSFKSDVETLRQIEKNWGESNELDSTHTKSGGKALER
DVLLQQSSAVMADFVELKTTERQTVIRAVPALN
>Cag_0050 UDP-N-acetylmuramyl-tripeptide synthetase
MTAMNCIPCCNNLPSLTVGAFITALQAVEVTEQAGACHMEHCITAVSSDS
RDMIEGALFVAVRGFCTDGHRFIESAIERGACIIVCEELPQSITHGCLYL
RVIDARKALAHVAALFYGNPAKQLRFIGVTGTNGKTTTSRIITEMLTAFA
VPSGYIGTNLCRIGKRDIPLERTTPEAHELHALFALMVEAGCKAVVMEVS
SHALLLQRVYGLRFDAALFTNLTQDHLDFHHTMQAYAEAKQLLFDQLQPD
GVAIVNTDDAYAPLMLQRVAPSQRVCCTLQANVPSVLQCPQYGQDFQAEV
RHASLAATEMELRFPKGETELLTVGLAGSYNVMNVLQAAATGYCLGLQPA
AICRALAAVHSVDGRMERVGRPNLPYQIFVDYAHTPDALQKALETLKALK
GEEARLMVLFGCGGNRDRLKRPIMGSIAASLADVVILTSDNPRDESPDAI
LDEIEQGMQGAAHQRIADRAEAIRTAISQLQAGDVLLIAGKGHERYQEVA
GKRTYFSDQEEVGRCL
>Cag_0093 heptosyltransferase
MQLPIQSILIIRLSSIGDIVLTTPLIRLLAAAYPHATLDYCTKLPFIPLL
AHNPRLSHLFTPELLPTTAYDLVVDLQNNRRSQLLVNSLKSRYHVAYHKE
NWKKWLYVHLKLNLYGNSWLHVVDRYRQAMDSFQPLADDSAGCELYPASE
ELEFAASVTAHAGKVLAICCGANHFTKRYPALHFATVVRLLMERHPSLTV
LLLGGKEDVPQVNAILEAVPSALRSQMVAFAGECSLMQSAALLARSDAVL
CNDTGLMHIASAFGKQLFVLFGSSVREFGFLPYHTPYELFEVSKLRCRPC
SHIGRSRCKKKHFRCMHALSPESIAQRIHLYFEKR
>Cag_1473 conserved hypothetical protein
MASKPSILLFSEDFPPNYGGIGQWAIGVAQSIHRMGYPLHLLTRYMNPEA
EMLQNREPYPVIQVHGKRWSQFRSFYTYSAIKNIYKKGIKPDIVIATTWN
IARGITRILKKNKTKLVIVVHGLEVTRTMPWLKTRWLQQTLNAADAVIAV
SNFTRDRVIERCNINPSKVHFLPNGVDPQRFFPRSNTTHLQEKYNLHNKK
VILTLARLQERKGHDKVIEALPTVLKEIPNAHYLISGALKGTYYKTLQQQ
VSNLRLNEHVTFTGFVDSADLNAFYNVCDVYIMPSRELEKKGDTEGFGIT
FLEANACEKAVIGGRSGGVADAIDDGKTGYLVNPLDSNEIAEKLIYLLSN
PELATQFGKQGRQRILTSYTWDAVTKKLLATIA
>Cag_0336 Secretion protein HlyD
MKQKSKLIVSAIALFVVALVAWFFLFKGNKGEEERYRDVQVVRGAISDVV
ATTGVVEPKNRLKIQSSIAGRIEEIVVSEGEMVRKGQVLALLSSTERAAL
LDAARLQGKSEEAYWKKVYKETAVLAPLDGQVIVRSIEPGQMVNGSDSLF
VLSDRLMVKAFVDETDIGRVKVGQQATIQLDAYPDISVRGKVEHIDFESR
LQNNVTMYYVDIIPEQIPSVFRSGMSATITIIVKQKPNALLVPLEAVQQR
NGQSVVLQRNNASSSSAKVRYCAVQTGLRNEQMVEIVAGLGEQDAVLLPD
TAFALPSKKGGTNPFRPQRSPNRP
>Cag_0053 UDP-N-acetylmuramoylalanine-D-glutamate ligase
MDVAGKKVTVLGGGKSGVAAALLLQQLGATVLLSEHGALSSEAMQRLQAA
HIAYEANGHSEQIYSADFCVLSPGIPPTAPVVQQMEAHSIPLYSEIEVAS
CFCKARMVGITGTDGKTTTSTLIHTLCEADGKRHGYRSYSVGNSGIPFSS
MVLAMQPNDVAVIELSSYQLERSISFHPQVSLITNITPDHLDRYGQNMQR
YAEAKYRIFMNQQAGDTFIYNQDDSMLQAAFGASQIAVPCRSVAFGLEPL
TNVQLDKRRVLVNGNMVVVRQNDGALQPIVAVDEVLNRAFRGKHNLSNVL
AAVAVGEALGIGSEVMRQALTAFGGVEHRQELVATIDGVEWINDSKATNV
NAMRQALEAVPAPMILIAGGRDKGNNYATVSHLIERKVCLLIATGESREK
LASFFKGKVPVIAVPTIDEAVAIAHQQAKAGESVLFSPACASFDMFNNFE
ERGAFFKQCVRQVL
>Cag_0057 FtsQ protein, putative
MPDDEYQEYYDPEEGELVEEEVLEEPLPAPTSGGGSFVLIVVALVLLLVA
GLASVALQWKQKVVVRNFIVEGESVLKEQEILAPIEFAKGHNLQLLEVGV
LKSQLLALPYVHDVVVRKEFNGTIRLRLHEREPVALTVHNGHIMVIDREG
FLLPWRNTVAQRYPKLLTVYGTERYAKSERGLQRLHERDVAVILEFIAAL
AESDYASLLIRELHLDATNTTWSKASQSSSHFIFGNDGRFNEKLKNFEIF
WQKVISKKGFTFYNIVDLRFKDRVFTIPSVISPSPQEITPL
>Cag_0371 putative ABC transporter, integral membrane protein
MIKTLLSIAMRHLVGRRRQTLTTIAGVAISTMVLITTNSLTRGLLDSFVE
TIVNVAPHIIVKGEKINPMPINLFSGKQNNAIAIVEDNIQKQEREEVQNY
RQIIALLDTPAYASRVTAISPYVQSQVMAVKGSRNEALIIKGVDINQEDK
ITGIRKKLVAGDVAQFEKNATALLVGRTVARDMNIELNDQVVIIPASGKS
WQCKVAGIFFSGVNAVDNSVLVSLKFGQIIEGLPDNKVSGIALTVKDPFN
NKPLATELEQLSGYRCLTWQEENANILSLFSRIGYIVFSLVAFVGVVSGF
GVANILVTTIFEKSRDIAIMKSLGFSARQLVGMFVVEGFLVGFAGALAGG
VLALGAITIFASIPVESSQGPITKTGFSMSYNPLYFFIIIGITVLISTIA
ALLPSSRAARLEPVSVLRDSSV
>Cag_0652 GDP-mannose 4,6-dehydratase
MHKVINNVFNCLIMSSHSSFSNNKVALITGITGQDGAYLAELLLGKGYIV
HGIKRRASSFNTQRIDHLYKDPHDIQKNVADGTQHSALYLHHGDLTDSSS
LIRIIQQTQPDEIYNLAAQSHVAVSFEEPEYTANSDALGALRILEAIRIL
GLEKKTRFYQASTSELYGLVQEVPQKETTPFYPRSPYAVAKLYAYWITVN
YREAYGIYACNGILFNHESPVRGETFVTRKITRALARIKLGLQQCLYLGN
LEAKRDWGHARDYVEMQWLMLQQEQPEDFVIATGIQYSVRDFVNAAAKEL
GMAIRWEGEGVDEKGYWAVEAYSDMPQQIVPIIEVDPRYFRPTEVETLLG
DATKAKERLGWVPKTTFDELVAEMVREDLRSAERDELVKQSGFKVLDFNE
>Cag_1684 Rare lipoprotein A
MKHEKHFFRLSYCVIATALVAFSSSVFSLPSEAATRRSAATYRSKALSEG
TASFYSTQFHGRKTANGETFNMNQLTAAHPSLPFGTLVKVTNMDNGKNVV
VRINDRGPYVKGRIIDLSKSAAIKIGILKEGVAQVKVEPVKPTINTQVAG
>Cag_0947 D-alanine-D-alanine ligase and related ATP-grasp enzymes-like
MNNSVGFRIECKSTHIVSGYLLGMTQPSLVAQLQFGETISYKALIDRLVL
CLSNYLPPQQLKELSFAQNDAQSFAKVIVLIVTGLQESVGLPVLGIAKVM
NQSNKALKELEKPVYLFQLFFPSFEPQAAKLLLEWLLNTLNHLKEKQTAL
SETQHKALQQLFQKLRVMAPGGTNNIRFIRAAHTLGIPLLSLPGGVFQYG
WGCNARWFDSSITDATPAIGVKLARNKLVTNALLKIGGLPVPEGRRVSSL
EDALLYAKQLGYPVVIKPADLDQGAGVYADLRNSDEVREAFAHARKLSPT
IMLERHSVGKVFRITVFRGEAVAVVERLPAGVLGDGVSTVQALVEKVNKD
PRRSKTSFSLMKPIVIDNEAQMMLDREQMSLNTVPLAGQFIALRRAANVS
TGGDVILLSPHEYDASYANLARRAAALLRLDVAAVDFIAHDIGKPWQQAF
ATVIEVNAQPQMGGVRTNLNKQLLASYVHGDGTIPAVMVLGADSATIVRQ
VRERYGNELSGLGSVSSDGVFIGNNAIGNGGKNIITEIQALIVDSTITAL
LFAGEHNNLLSQGLPLPFCHYLILSDCSSEVNDIARQLDIFHKHIKNEVW
LVQGHVLQGHVERLFGQDKIRLFVSIMEIVTAIKQTFDVNIMSAKIN
>Cag_0056 UDP-N-acetylmuramate--alanine ligase
MELGKTQRVHIVGIGGAGMSAIAELLLKSGFSVSGSDLSTGDVTDRLTAH
GAVIYKGHQEGQVADSDVVVYSSAIRSEENVELRAALKAGIPVIKRDEML
GELMRYKSGICISGTHGKTTTTAMIATMLLEAGESPTVMIGGISDYLKGS
TVVGEGKYMVIEADEYDRAFLKLTPTIAILNSLESEHMDTYGTLEELKQA
FITFANKVPFYGRVICCVDWAEIRKIIPSLNRRYITFGIEEPADVMATDI
VLLEGSTTFTIRAFGIEYPNVRIHVPGKHNVLNALAAFSTGLELGISPER
LIAGLGCYSGMRRRFQVKYSGNNGLMVVDDYAHHPSEVKATVKAAKDGWQ
HSKVVAVFQPHLFSRTRDFADEYGWALSRADEVYIADIYPAREKAADHPG
VTGELVANAVRKAGGKQVHFVNGMEELYTALQTHVAPQTLLLCMGAGDIT
HLATKVAVFCKEHNADH
>Cag_0335 FusA/NodT family protein
MTTKNFIKQVQIFSMKQYIASTLLLFLLIGNAPSVYAEVLSWEQCVAEAR
RAHPSLVQANAIVQQASANRRIVGSSRLPNVALALNAQQQGSSDGTSTDH
IGSSLSLHQLLYDGSKTSKQLSGADEALRAAEAAAQLTNAEVRYQLRSAF
VALLKAQELVELTNEIAERRQKNLRLIRLRYNGGREHIGSLRQAEADVAE
ATFEVEQAKRELTLAQRHLALALGRQKSVALRVQGSLQAAPFSLKKPDIE
QLLTIHPATQQAAAQSRAARYELEASRSAFSPTLALTSSLGRTAASYFPL
ESVDWQAGLSLAVPIYSGGEGKARVAKARAYALEQQAAAQAKVLQVTGAL
EAAWTRLQDAEQAIAVRRRFVEAANERATIASAQYSNGLLGFNEWMIIED
NLVNAKKRLLEASAALFVAEAQWLEAQGGGLNEAEK
>Cag_0051 UDP-N-acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate-D-alanyl-D-alanyl ligase
MKATLQRNDLEAVGELVFHGEAPPSFELAEPHVVIDSREVSEGGLFVALH
GERTDGHRYVNDVFQHGATWAMVNRSWYEAEGHPLPPHHKGFLVVEDTVA
GLQHLAVRYRNTFSIPIVAIGGSNGKTTTKEMVAAVLASDSSAVSMSQGN
RNNHLGVPLTLLQMRHSTERAVVEVGINHPNEMAMLAELVAPTHLLLTNI
GHEHLEFLGDLDGVAKAETQLYDYARQHGATAFINADDERLRAAAEGMPF
RIDYSLHEAVDSLVWAEDVTVERDGRLSFLLVTKGRSEQERLRLHFTGRH
NVLNAVAAATVGLQFGISLHHICEGLAGLQPAPGWKRLEVVEVGGVRLLN
DTYNANSDSMRRAIDALCDMPCNGRRIAVVGDMLELGDAAEVEHQAVAHY
IQRSLVTKLFTFGTQAAAICRHAPELCYGSYSEHSALLDDLLHVLSEGDV
VLVKGSRGMRLELIVDGVVHALQPKS
>Cag_1084 ADP-L-glycero-D-manno-heptose-6-epimerase
MIIITGGAGFIGSAMLWELNAHGEEDIIIVDELGSTTTQQWRNLSGLRYS
DYIHKNDFIPLLERNALKGITAIIHMGANSSTTETDADHLMSNNFGYSKS
IATYCMEHHIRLIYASSAATYGDGANGYSDDSNGITTLRPLNMYGYSKQL
FDWWALQQGVLNYAVGLKFFNVYGPNEYHKGDMSSVVYKAFHQIHQNGSV
KLFQSHRPDYGHGEQSRDFIYVRDCTAIMLWLLEHPLGGLFNVGSGVARS
FRELVTATFAALGKEPAIEYIPMPETLRDKYQYYTCATMEKLRQAGYTAP
FTSLEEGVRDYVQTYLSATSPYLGKG
>Cag_0146 Monofunctional biosynthetic peptidoglycan transglycosylase
MVLAAILLFLVIDIGRYAVYPNIGRLVDENPTKTAFMEYREAEWQREGLE
DKRIRQRWVPLKQVSPNLIKAVLIAEDDKFWKHEGFDYKAMEHALEKNIR
TKKISMGGSTISQQLAKNLFLSPSKNPIRKIKEAILTWRMENTLSKRRIL
ELYVNVAEWGDGIFGIGEASRHYYGVAPSQLSARQASRLAAVLPNPIRYS
PIGSARYVRNRSNIIHAIMVRRGIVLPDYNEVMELPMDSTAVDSVNIGIP
FNLLEQAINADSTLSATPSVPATEVVKQEEKSEVGASVEGNGKGVGAP
>Cag_0649 Uncharacterized protein involved in exopolysaccharide biosynthesis-like
MIEAQDQQQQFFQEQEIHLADYLNILLRRRKVFIATFLGIFIGVFLVTFL
MQPVYQASSTVYVKKDSGKMGINELVMPGGDNSIQAEMEVIKSRTIAEQV
VQKLNLDWSIKSSSSSSFCRILSMSVDPSLKELKLILKDNSHYEIQNTKG
KIIGKGRNRVPVALPFGVLTVAFSGEEGDVFELQRQPLYKAVAALKSAIK
VKEVGRLTNVVEISYEHTNAVLARDVVNSLVQAYLDQSLAFKTQEAGKSV
SFIEEQLQGIRTDLNKAETNLQEYKSSSGVVLLDAEAQELIKKFSSLEQA
RVGVTLQKKQLEFALAAQQESMRTGKPYSPAVMKDDPLVASMAQQLANLE
VQKRALMVDYTSNHPSVVNVQAQIDEIQNKIRSTYKTGLNNLARQEVDIA
QRLAMYEGELRGIPLEERDLARYTRIAKVTGDIYTFLLQKHEEARIAKAS
TISNINIIDTAIIPATPIRPEKLKYLLIGFALSFIAAIALALLVDYFDDT
VKDENQAKNLLGYPYLVTIPYIGKRENGVHKDVQKSDEKDELSFIAYSQQ
KSIAAEAFRSLRTAVHFSALKQKHKITVFTSTFSGEGKSTISTNLAATIA
QTGERTLLIDCDFHKSSLYKRIGMKQVPGVTEVLAGDKPLSEALQQTVVH
NLMFLATGTPPPNPSVILGSNEMKDLIQLLKNDFDQIVIDAPPTLPVSDS
VLLTSIADVVLVVMEAGAIPAKAVTRLGEILKSAKAPVAGFIFNNKSLKG
GNGYGYGYGYGYGYGYGYGYGYGYGHEDENKKKSSFLALLQPVISKMSNI
KFRGKM
>Cag_0193 mannose-1-phosphate guanylyltransferase, putative
MKAFVLAAGLGTRLRPLTNHSPKPLMPVLNVPSLFYTLFLLKEAGIGEVI
CNIHYHAAQMRSVVEAHNLAGLQITFSEEPEILGTGGGLKKCEALLEGED
FVLVNSDIITSIDFRALIERHRISGNLGTLALYETPDAASIGWVGVEDGL
VKDFRNQRHTNLFSSFIYTGTAVLSPDIFRYMHAEYSSIVDTGFAALLER
NSLGYYEHCGLWMDVGTLPHYWQANLDATGAIRRTFGAMQQSFGVAPHVV
APSATISPAATVENSVVGADCVIPDGCTVRNSVLLSGVVLQPNTPLCNAI
ADSQTVTILEPSSLL
>Cag_1016 conserved hypothetical protein
MKPFSGTVLVAGATGRTGAWVVKRLQHHAFDYRLFVRSGEKALELFGAEV
IDKLTIGSIENTEDIRAAVRHADALICAIGGNAGDPTAPPPSAIDRDGVM
RLAQLAKAEGVRHFILISSLAVTRPDHPLNKYGQVLTMKLAGEDEVRRLF
SEAGYCYTIIRPGGLLDGAPMEHALISGTGDQITTGVIQRGDVAEIALLS
LINPQAINLTFEIIQGEEAPQQSLDAYFPQA
>Cag_0384 conserved hypothetical protein
MKKVLVAGATGYLGRYAVEAFKKRGYWVRALVRNLDKAKQPGPYFAPEIA
SLADEIVVGDATLPATIATVCDGIDVVFSSLGMIKPDFVHTIFEVDYQAN
MNLLDLALKAKVKKFIYVSVYDAHRMMNIPNVQAHEKFVRELKAAKIDST
IIRPNGFFSEIGQFVARAHKGFMLLVGDGYQRSNPIHGADLAEVCVDAVD
RSDKEIGVGGPEIFTYQEMMDLAIEIAQNQPFIFPLPLWAADTLVAATGL
VNRDVHDVALFATTLSRIDVVSPEYGTHRLRDFFMQCKAAL
>Cag_1164 penicillin-binding protein 2
MQGIKALVIAVFALLFVRLLYLQVINFQEPGSVSASNSVRRIWIHSPRGR
MIDRNGSILVDNQPLYSVRIIPAEFNEAKLGYLAWLLELPKEEVAAALAK
GRSYSRFAAATITRNVNEITIARLSENLWQLPGVIVTADNKRLYSDSLRG
AHLFGYLRSIPKEKMEELSEKGYSQDDKIGFSGLEKIYEERLKGQKGARY
EMVTPLGKYAGKYDKGNSDIAAVRGDDLYLSIDGGLQMLAEKLLTRTGKS
GAVVALDPNDGGVLALASAPDYSLNTFNGFTDPKGWREIITSPQKPLFNR
TVQAVYPPGSVYKMVLAMAGMEEKLVDPERHIYDGGVFIYGGRRFLSHGG
RGHGSVNMRNAIAVSSNVYFYTIIFNVGFENWTKYGDMFGFGRKTDIDLP
GERKGLLPSAEYYDKRYGKKGWTKGYLVSLSIGQGELGTTPVQLAAYTAA
IANKGTLFQPHIVNGYRDTQTGRYIPVPYQQTKLPISPETFALMHDGMRG
VVLQGTGTLANVPGVAVAGKTGTAQNTHGKDHAWFIAFAPVEQPKIALAV
LVENAGFGGSISAPIARELIHYYINLRNRRPTATRGDVKVNISNDAAADS
SSAAASTNGEPQDEPSPNSEIDKDKGNSTSPSVQKSANDTQSESLPE
>Cag_1166 rod shape-determining protein MreC, putative
MQKFFTLLIKYNAYLLLALYCSIALLVIKLQEEEIFVELRNNGLEFSAAV
NEQFMSIAYFLHLSHENSRLMRLNTDLLNKILHYDNILLEERRYKALFSD
STFNASPYIKAQVVDRKFSATDNMLIIDSGWRHGVKKDMAVLVPQGLVGR
VVAVSENYAKVMPVIHPDFRVCVVADSSNCSGILVWNGGREWIANVDHIP
ISSRLRLNEQFRTADFSTFALRGIPVGRIVRIVPDKLFYTVDVRLAVDFS
ALHYVLVAPQKVEAEKVRIVSDSSAALLRTPQL
>Cag_1781 lipoprotein releasing system
MNFTDHLRIAFVHLRERKRQTILAALGVAVGSAMLITTIAIARGSSDSVI
AKVIDTAPHVLITAERVTPLVPDNLVPRSKQHITMLTKNITPDEPEIIKN
YSEVVQRISSIKELESVSPFVVSKLLARNKTRFTPCIARGVVPELEAEIA
GLKKNVLEPTALEELASTPNGIIVGELLAKKLALRYRDRMVLVTKKSEEF
PVTIVGRFSSGFNRKDESEAYINLALAQRMEGISSNSVSGIGLRTTSVDK
ASITADEVEKLSGYKSESWDETNRNVIEFYNRNALITLVLVGFVFIVAGL
GVSSVMTTVVLQKIKDIAILRSMGMMAKSITRIFMLEGLMIGILGVLVGS
PAGHIICHLIGTIRFEASTAGSIKSDRLTVSESPEVHLIVIVFGILIAVL
SSLSPARKATRYVPVNILRGNIGG
>Cag_1204 Secretion protein HlyD
MNNIVKPVLFIALCSSLTLSGCGDKPEMGRDGAKNQEQPALVVQQLQPSD
AVVVSRYPALLEGKVTVEIRPQVDGVLRSILIDEGAFVKAGQPLFAIDDA
VYRAQYQGALASQHAAEARVVAAKLEVERLQPLVQNQVIAEVQLETARAN
YKAAQAAFDVAVAATRSAKVNLDYTVIKAPINGFVGKITLRQGSLVTKNQ
AAWLTMLSDVSEVYASFSISENELLRFRQQYGQSDGRLGSGSNNVPATLV
LADGTRYPQQGRLATLSGQFDALTGAMRVRAIFANPQALLRSGGTGSVEL
SSSYNNVILVPQSATVEMQDKVFVMRLMQGNKVQKQAITIAAKSGNNYVV
TGGIQAGDTIVLVGADRLQDGMVITPRYETSPAQSATINSAVKRP
>Cag_0345 dolichol-phosphate mannosyltransferase
MFEQLKFSREIVASMLYRVEGIFTNGRAAALIIIPTYNESDNIRRLLEEL
TCCYAGIADILVIDDNSPDGTADCVRALQNTKGSLALLVRDAKLGLGTAY
ITGFSYALQHGYQYVIEMDADYSHDPASVVDLLTASSSADLVIGSRYVNN
TVNVVNWPLSRLILSKMASLYTRLITGLPIADPTSGFKCFRAEVLRSIAF
EHVQSQGYSFQIEMNVRAWKKGFVLKEIPIVFVDRTVGKSKMSRNNIREA
IWMVWWLKVQALLGRL
>Cag_0515 dTDP-4-dehydrorhamnose reductase
MNILVTGSRGQLGSELQALSVRYPQHSFFFYDLPELDITNSEQINHICNA
HHIEVIINAAAYTAVDKAESDAETAFRVNSDGAALLATYAKENHALLLHI
STDYVFDGTSSVPYKESDPATPLGVYGRSKWEGEERIRAINPSHLIIRTS
WLYSMYGANFIKTMLRLGGERSEVRVVFDQVGTPTWAADLAEALLSMLSS
IYKGKHYSATYHYSNEGVASWYDVASAVMEMSNLSCKVLPIESHDYPVPA
PRPHYSVFNKAALKSDWNISISHWRTSLAAMLHSMKTSHHS
>Cag_0793 3-beta hydroxysteroid dehydrogenase/isomerase family protein
MSKNILVTGATGFIGSNLVRKLVTTTEHRISILVRKNSDISALADVRDRI
QLVYGDITQRSSLDAAMQGVHHVYHSAGLTYMGDKKNSLLYKINVDGTHN
MLDAAIAAHVDRFIHVSSITAVGIAFDKKPVNEATPWNFHALGLEYARTK
HLSEKEVAKAIQRGLDCVIVNPAFVFGAGDINFNAGRIIKDIYNRRLPFY
PLGGICVVDVEIVSDAIITAMAKGKTGERYIIGGDNVSYKQLSDTISRVT
GAPKVLLPLPFWTARILYSLLDLYHNRRRLSKLFNLSMFKVASHFLYFDS
SKAARELNMRYEPHEQSIRNAYEWYRERNML
>Cag_0506 glycosyl transferase
MPTPHCHLSVVIPLYNEQESLPELLQQLEQALHHPSLQALFAEPLEYEII
MVDDGSTDGSASSIRRLATKHCNVRLISFQRNFGKTAALSAGFAASSGEL
VCTLDADLQDDPSAIAALITKLHEGYDLVSGWKQQRRDPLSKTIPSKFFN
AVTRLFTGLTIHDANCGLKLYRHDVVKRLELHGDMHRYIPVLAAWMGFAI
TELPVPHHPRKYGTTKYGFSRFIAGLLDFLSVLFITRYLRRPLHFFGTAG
LLSALCGFGISLYVTLDKVLLHKPVSNRPILFLGILLMILGVQLFSTGLL
GELLSTTNNRHSGFIIRETFNVTDEQVQALRQ
>Cag_0240 dihydroflavonol 4-reductase family
MSKLSIALTGATGYIGSQVLLELLKRFKGELDCRVLVRGSSNYAWLEALP
VQVIAADVLEPIALHEALRGVDTLFHCAGLVSWTRRFRSQLYEVNVVGTR
NVLHAALYNGVRRVVMTSSIAAVGMSEDGAPANEAALFKEWQRRNGYMEA
KHLAELEALRAVAEGLDVVLLNPGVVIGVDHHNPASLSSSNRTLRQMYDE
KLWVAPAGSTGFVDVRDVAMAHIAAWEKGKSGERYIVVGHNVSFHELLSR
LSALNNGVAAKVLTVPRSVGMVAALGGEAWSLLTGNPSFIAFESIGTSAR
QLAYNNERSLCELGIAYHDLEETFQTILK
>Cag_0745 glycosyl transferase, family 25
MKVHVISLKRCTERRKAFMDMNPHVDYLFFNAVDGSTIPEKVLSNPLLFE
KGLPYTKGAYGCALSHLLLWNKAIKENCVLTIAEDDAIFRKDFHVMQNKL
LSSISSDWDIILWGWNFDSILSLNVLPDVSPTVMVFSQEKLRESINTFIE
KVTYPSSLFFLDKCFGIPAYTITPQGAIKFKSLCFPLKFFSLWFPLLNRK
LPNNGIDIAMNKIYSSTNSYVSFPPLVVTKNEHAISTIQTNRNT
>Cag_0449 L-alanyl-gamma-D-glutamyl-meso-diaminopimelateligase
MTSNRIPCAISLLILYFESNYFSMSFFYFIGIGGTAMASVAVALSRAGHY
VIGSDTQLYPPMSTFLEEHGISYCHGFAEENLRSFSPDAVVVGNAISRGN
PELEYALEQHLELLAMPDLVRRHLIANNTSVVVAGTHGKTTTTSLVAWML
EAGGLQPGFLIGGIPENFGLGCRPSAGEGAGFFVTEGDEYDTAFFDKRSK
FLLYRPDIAIINNVEFDHADIFNSLEDIKRSFRLFVNLIPRNGLLLVNGD
DPVALECAEKAFCPVERFALHANAEWSATNIHSEEEGSSFELLHHGKSVG
TFHVPLFGNYNIMNAIAALAAAYRCGVSFEALADGLPHFQRPKRRMELLG
EFADGITLIEDFAHHPTAIRVTLEAIAQRYNGRRIVACFEPRSNTTTRNI
FQEELASCFAPASVVVLGKVHRPERYGDHALNTALLQEQLQSAGKEVFLA
GNDADYPADIIRYLEAHRRHGDVVVLLSNGSFGNLKQMVMERWR
>Cag_0052 Phospho-N-acetylmuramoyl-pentapeptidetransferase
MLYYFLKYINDIYHFPALRVIDYLTFRASAAAITALLITLLIGPKLIAYL
KQRIIEPIKEEAPPEHRKKKQLPTMGGLLIIFAFELSVFLWAKVDSPHVW
LVMVAVLWMGAVGFLDDYLKVVKKVKGGLAAHYKLIGQVLLGLLVGLYAW
FDPSMAVLRDTTTVPFFKNLTINYGIFYVPVVIFIITAISNAVNLTDGLD
GLAAGTSAIAFIGLAGFAYLAGNAVYATYLNIPFIQGGGEVAIVSMALVM
ACVGFLWFNSNPAEIIMGDTGSLALGSAMAVIALLIKQELLLPLMAGVFV
LETFSVSLQVLYFKYTRKRTGEGKRIFLMAPLHHHFQLKGWAEQKIVIRF
WIMAILFFLASLMTLKLR
>Cag_1254 hypothetical protein
MKGKPTFVIALASCLIVSCTNNTPKTDADSTMKEVSAEMADMKSYVIQYK
NEVKASGAKSSAIITQYIDMKNDKMSIETESYTELNNSKISEKSLAIYDK
EWTYIINLKDKTGVKMKEDQAEDDPMDMIKSDDTITFRQMIEKEGGKIIG
NEQFLSKDCIVVEMKQEGQISKMFYYKGIPLKIESPAYTMEAIKFEENIT
IPVSKFTIPAGITLSEVPVMP
>Cag_1474 heptosyltransferase
MASKFIQQKRFTRTLVARLLQLLSGKHKQTQHYHGTPKSIIILAQEKLGD
SILLTPLLKNLHQLFPQIKIHLVCFSKASATFFKNDSHITAIHQPKIDGL
AYYRFIRSHTFDILFNTKDSPSTSFLLQTVLIRAKYKVGHIHQHHTGLFN
YEIPINFHSQMAAKNCALFSFLNVQVKPEDCRPYLPANNISKEVATLLSV
PHKSNIIGINISTGSPDRQWQEEKWYKLISDFPEQRFIVFAAPNDIASKQ
RLETLPNVMTTPQTKNIYEVGLLVERLVLLITPDTSLVHVASCYNIPIIG
LYTTIEHDLSRFSPYLIDYHIVRSPTNKVEDIALDAVKTTLKKQLEC
>Cag_0795 Prolipoprotein diacylglyceryl transferase
MSSFLHWWQILPFSMNPVIFSVGSFAVRWYGMMYIIAFAVVYLVVRYRLA
TEKLPFQTTFVGDALTWAMVGVVIGGRLGYIFFYGFADFLANPLQSFFPW
ICSPDGSCRFSGISGMSYHGGVLGVIGAMWLFTRSQQQNFFQVFDLFMPA
IPLGYTFGRLGNFINGELYGRVTEAAIGMYFPTAPTIALRHPSQLYEAFF
EGIVLFVVLWMLRKHSPFAGFLSALYLFGYGFVRFFIEFFREPDAHLGFV
FFSFSMGQVLCIAMMVAGIALFAVAKKLSNNTIKA
>Cag_1475 glycosyl transferase
MKILFLCSAKKWGGNEKWTLLAAQALATKHKVIVGYRSEVIGNRCTTASI
KLPFINEIDLITIGKLIKVIKKEKIDIIIPTKRKEYVIAGILSRVCHIKN
IIRLGIDRPLTNTFLHKLIYQILCDGIIVNANKTKATLAQSPWLDASKIK
VIYNGLDKENLLKEAKNSFTPPHPFLILSVGSLISRKGFDFLLRAFALFI
KRNKNTDVGLIIAGDGPLLNHLKELTKSLAIDNNVHFTGFIANPYPIMKA
SNIYITASQAEGISNALLESIALGCVPISTYSGGAEEIIQNNDNGFLVEY
NHEEKLALLIENLYNHQEVREKITTKAKHMVETTFSSERMAIEITTFCQS
IISKKHQ
>Cag_1476 glycosyl transferase
MVHAAKGLADKGHTVVLASKKHSKIIDYAHSKGVKTTVFEIRGDVSPITT
LKIAHFLKKHAIDVLICNLNKDVRVAGLAARTVNTPVVLARHGMLLCGNK
WKHKVTLTQLVDGIITNSKTIKEAYQNYGWFDENFVKVIYNGLSIPENIQ
THDFSKQFPNKKIIYSAGRLAEQKGFTYLIEVAAQLQKERNDLIFVVSGE
GKLEETLKQEVNNAGLSDSFYFLGFTADIYPYLKGCDLFVLASLFEGMPN
VVMEAMAMKKPVIATDVNGARELMIDGETGIIVPPREPKNMADAIRKIID
NSDALIEMGQKGYERVTSTFTTQAMADALEHHLLEKLAEKKSYKTT
>Cag_1821 mannose-1-phosphate guanylyltransferase, putative
MMSTFRQHVYAVVMGGGFGKKLWPVSKRKRPKQFIDLFNDGTMILKTLQR
IAGLVPEENILVITSALGKQLLLELSPHFQESNIIVEPACRNTAPCIALA
SAHIKKRDPEALTIILPADHLVRDSDAFELIMQAALLQAQHSMGLVTLGI
MPTRPETAYGYIQATESLPMPEGFGVDDRFKLFAVKAFAEKPDYATALNF
LETRDFYWNSGIFVWHIKAIWQEFQRSMPDLYHDFLTIYNHLGTQSEQKI
IEDVYSWIHPCSIDRGIMEKAERVFVLTGEFGWTDLGCWDEVLHVAGDLP
VLVAEQEGGHHMEIACENLFVKTMPDKVIATIGVKDVMIIETDKALLVCH
KGQSHRVREIVDMLRRAGLEDYL
>Cag_0320 3-deoxy-D-manno-octulosonatecytidylyltransferase
MNAIIIIPARLGSTRLPEKMLADIEGEPLIVRTWRQAMQCCRASRVVVAT
DSVKIAEVLTTYGAEVVMTSPEARCGSERIAEAARQFACDVVVNLQGDEP
LISHETIDLALEPFFSPNPPDCSTLVFPLQPDDWAQLHDPNQVKVVLNRE
GYALYFSRSPIPFQRNQLTSTQCYRHVGLYAFKAEVLQCFAALPPTMLEE
AESLEQLRLLEHGYRIRCMVTHDDQPGVNTAEDLELVRTLFKQRHQEA
>Cag_0340 Glycosyltransferases probably involved in cell wall biogenesis-like
MQNKKASWILFVEVGLLIVLLFTYLAIRIYYTINAPFSTIEFILGFFFLC
SEVFILYHSFFFFIDILREALYDAEPKRKEPGKDASVAILVPARHEPREV
VENTLLTCINIAYENKTVYLLDDSSIEKFKQEAREVSKKLGAKIFTRVGN
RGAKAGIVNDILKTLKEKYVVIFDADQNPMPNFLRTLVPLMEGDDKLAFI
QTPQFYSNGDASPVAMTSHNQQAIFYEYICLGKSLNHAMFCCGTNVIFRR
EALQDVGYFDEDTVTEDIATSLRLHAKGWKSMFLYKAYTFGMAPEDLASY
FKQQNRWALGTSQLLRKFIHLFFTNIKALKPVQWIEYSISTTYYFIGWAY
FFLMAGPVMYIFFNIKSYNIDPLSYVFTFIPFLVLSNIVFFQGMARKSYT
RLNMLKVQMLTALSMPVYLSATVIGVLNLDKKGFQVTPKGGATNVPLKAL
WPQLLLFSIVIVTLLWGTIRIVFFEFDYMLVINVIWTTLQAFMLSGLFYF
NKK
>Cag_1408 Membrane-fusion protein-like
MDEHMNANLDASLSALRALRDFKGREGEFWLSMATHISRLFQAERVVLLR
RAEVGWNPLSFWPLASARSAIPPSAQLAELATTAKQTAVAYAPLPEQLGT
TLQHAVSTAAEQSSEKKADNISGNTLLAFHLHTESADGEIMALLWRAHDS
AALRNGDLLKRELLVDLPLQYRQSNTTRPLTSATPESLDVVLSMNEHTTF
TGAAMSLCNELAFRLSCSRVSIGWKDGEYIRLQAVSHTEKFDRKMSVARA
LEVVMEECFDQDEELLVPEVAGTSTTIIREHRAFVAKQGVGAILSLPLRL
GNEVVAVLSCERDKPFSADDIRSLRIICDQVTRRLGDLKHFDRWFGAVAL
DKVRNWASSLIGTDKTVHKIYAVVGSILLLFLLFGKMEYKVEAPFILRTH
DLALLSAPFDGYIERVSRKPGDLVTTGDPLILLDTRQLLLEESRSAADVL
RYQQEEKKAMAQNALAEMKVAEALRRQADSRYQMIRYNLQHADIRAPFSG
IVVEGDLEKLLGAPVRKGDVLLKVAKLEKLYIEIKVAERDIQEFKVGQEG
EVAFISQPSKKYTVVVDRIEPMAVTEQKGNVFLVLGHITEARDAWWRPGM
SGLAKVSVGERHILWIWLHRTLDFFSMKLWW
>Cag_0791 Cell wall-associated hydrolases (invasion-associated proteins)-like
MSQKTICPSTHWMGCPYSLWQRFSRAVAALATLSTLSCIAPSSIVLANPI
APPQSTAVDVVAENPSTPIALDPPTPSKLEQLMGNMGNYFGIRYRFGGQT
PAGFDCSGFVRYMFEKVYNIKLPHSSREMSSLGDRISREELKPGDLVFFH
SGKNRINHVGIYIGNDAFIHSSLSKGITEDKLQHRYYDKRYAGAVRILPD
ITLPFSTPRQEDAQLEIIKPS
>Cag_1330 Apolipoprotein N-acyltransferase
MLQLSNYNQVRRSHYVAALLSGLLLGVAFPSYPFIRLELLAWVALVPLLL
SLRGVERFGALFRRVYFSMLVYGAIALWWVSLATLPGGMLTVFTDALFRS
IPFFLFYLLKKRVGYHFALSAFPFLWIGWEWLYMQQELSLGWLTFGNSQA
LLTPMVQYAEITGVWGVSFWLVWFNVLTVFAVIGKQRNRLAIVASMALMV
ALPLLHSWWLFANAELNSAAKRELRVTLMQPNTDPHHDYDRAVMVPHYLR
ISSQAVRAEHPALVIWPETAILFPLLEPVYHPEFQMLEGTLKEWDAALLS
GVIDRVTRAEQWSSIYNASVLLQPAGQVPQMYRKMHLVPFAERVPFLDYI
PWLGYATMSLSGISGWDKGTEHVIMRLKTPQGTVRIANIICYESIFPEHV
ARFVGRRAELLTIVTNDGWYGTSYGPYQHLAIGRFRAIENRRAVARCANT
GVTAFIDRYGRIMAEVPWWQEATLTADVPLERDLTFYTQYPDLLPQGALA
ASCLFIAMALVRRKRFDEV
>Cag_0676 teichuronic acid biosynthesis
MFTDNLISIITPVYNTYPFLFRLVQSVQMQKVNVEHIIIDDASTDHSYET
LIEYAKKYSNIRLIRLPLNRGPVVARNEGIKIAQGRFLAFLDADDLWLPN
KLEIQLSLMRKNNWSISFTDYRFISFDGTLVGKIVNGPNIVDRDLHFATR
SGMGCLTVMVDRQKFLNFSFLETDPITTRAEDFWAWAELLKTTVAHRVPY
DLARYTVVPGSRSSNPWYKAKVIWTIYREIEKMSFIKALLYYISFSISAT
KKRLLSTPRYKIEDIDGDKGKEWLDLVNLSSNKHS
>Cag_0672 pyridoxal phosphate-dependent enzyme
MILMNDFKAEPPELREAMLGAAQRVIESGWYVLGNEVVSFEKQWAAICGV
DYGIGVGNGMDAIEIALCSLSIGVGDEVITTPMTAFATVLAILRSGAIPV
LADIDSDTGLLSIESVRRCISKKTKAILLVHLYGQVRDMDKWTALCKATD
LYLVEDCAQAHYAQWQGNVAGSFGIAGAYSFYPTKNLGAIGDAGMLITND
ADIADKAKRLRNYGQSTRYYHPELGMNSRLDELHAAMLSERVKWLHSFTE
RRWQIAEYYREHIDNPLINLLSAPEERTAHVYHLYVVTTAYRDALQVYLQ
ENQIQALIHYPIPVHFQDPCKNILRDPKGLAKSEYHAAQCLSLPCHPQMS
DADIEHVANTVNSFKVS
>Cag_0806 RfaE bifunctional protein, domain I
MEFDKPRTMPLSALPPSLPEFEALLALFQGKRIAVVGDIMLDAYIFGHVS
RISPEYPVPVVDVTREEHRLGGAANVAQNTRAMGAETILFGVTGNDRNRD
TLVELFKQQGLTTNALICDPSRPTTCKTRILSQNHHITRVDFESRQEVSA
DIEAQIVHTFESMIASLDAVVLEDYNKGMLTAHLIERIIAISRKHKVPVL
VDPKHRNFFAYKGCTIFKPNLSEMATSLGIAIPNCNAEVEAACKILRDKL
EAETIVVTRSEQGMSIYNGNFTHIAASSLDVADVSGAGDTVIGMLALGAA
AGMDIVTNTSLANLAAGTVCQEVGAVPVKSEKLLKAYRDYLLQQ
>Cag_1911 OmpA domain protein
MKTRKYAFAAFALLALAGCSSKSSVAPAPSASSSAPQALATPLAEPVVPP
PAIASEPLPMYTPPVTSQPLTPSATTTTLVPATVKGYGYDQWQKGPLGDI
FFEYDSATLDESAQMQLQQNAALLQQFIVESIQIEGHCDIRGTSEYNLAL
GERRATTAKEYLMRLGVPASRLETVSFGEERPFDNGNSEDAWAKNRRVHF
VLIKQ
>Cag_0986 outer membrane efflux protein, putative
MKHLRTWINQRKRTFYNISSTLAFLLTPLPALTLMAVSLQVYAGENATPT
RLTLEQCITIALERATPLKKADNNLTLQGTDVLQRYGSFLPRLTLSAGYT
PVQQQKSYTTLSGTMPPTLLTTESDALSMQLTTSLNLFNGFGDMAALQAG
LNRRDAARLSVARARETVVYDVTQAYYQALLDRELLLIARENLQASRDQL
TLTERQYQAGLKSLIDREQQAAETADSQLRVMKAESRAEQSLLELLRRLQ
LDPLTSLELQTAADVVNGDAPYTLAADELIARAREQRNDLKSQQAQSKAN
RWQEREAAAQRYPSLDLNLTASTSATGDVEQRIAGIEKKYSYPPLSDQLG
NATSYSVTLSMNWVLFDGFRSRYSLQSAHLNYLNQQLDVEDAKRNLAIDV
RKAIAEYDAARQQISAARVSLQAASAAFNGIKRKYELGAATFVELSSARA
ALFNARSSLSQATYSLALQKNILDYVSGSTSFSK
>Cag_1974 2-dehydro-3-deoxyphosphooctonate aldolase
MQQKFSIGSITVPDCELPLLIAGPCVIESRAMAFEIADELQRISQAEGVR
FIFKGSYRKANRTSAASFTGIGDEDALTILADIRQKYGMPVLTDVHESAE
VALASRYVDVLQIPAFLCRQTELLVAAGESGLAVNIKKGQFMAPDDMRLA
AAKVARTGNNRILLTERGSSFGYHNLVVDFRGIAKMAESGYPVLYDATHS
LQLPGAGQGMSGGEREYMLPLARAAVATGVDGLFCEIHPNPEKALSDAAT
QIPLAEFGVIIHQLLHLYRCVQPLLSH
>Cag_1940 Rare lipoprotein A
MPKHIYKLWLILPLILSACTTTRAPFRAISPEEAYQQGKLKQNPYVINGT
TYLPLRYEEALAYEENGLASWYGKETLIQNNYQLTAYGEVFDPSKPSAAH
KYLPLPALVRVTNLDNNNSIVVRVNDRGPFIGDRVIDLSAEAAKRLGFYE
KGMARVKIEVLNK
>Cag_1729 conserved hypothetical protein
MQNNTPSYSGKVLVAGATGKTGQWVVKRLQHYGIAVRVFSRDPQKAETIF
GKDVEIIVGKIQDTNDVARAVTGCSAVISALGSNAFSGESSPAEVDRDGI
MRLVDAAVAAGVTHFGLVSSLAVTKWFHPLNLFAGVLTKKWEAEEHLRKH
FSAPNRSYTIVRPGGLKDGEPLQHKLHVDTGDNLWNGFVNRADVAELLVI
SLFTPKAKNKTFEVISEKEELQTSLAHYYDTL
>Cag_1572 conserved hypothetical protein
MQKATISILGCGWLGLPLAKTLIAQGYNVKGSTTSEAKLDVLQEAGIEPY
LVTFEPEIEAEDAVSFFQSDILIVNIPPGRREDIVEYHIMQFSSLIDALG
QSPVRSLLMVSSTSVYPSLNQEVIEEDAVDPESPSGQALLMVEEMLMQES
GFQTSIVRFGGLVGYDRTPARYLSALKEITNPNHPMNLIHQDDCVGIISE
IIRLEQWGEVFNACSPIHPLRSEYYNRAADDAGVARLPLGAVDDSMGYKI
VSSEKVVKALNYTFKHSDPIG
>Cag_1698 Outer membrane protein and related peptidoglycan-associated (lipo)proteins-like
MFFCVNYKPTSLMKKSVSPFVRSVMVPGMLLMGACTCQPVALEPALQPAP
APAPVVVPPPPPPAPPAPKPAPAPAPVVVAPPPPVVVAPPPPPPAPIVVT
KAMILGDILFDFDKSFIRKDAVPQLQDVAAWMKEHPTKNVTIEAHCDSKG
SEAYNIALGKRRADAAKAYLVNKGIDSNRLKTISYGKDKPLMNGIDESAR
ARNRRVHFVVE
>Cag_1481 Glycosyltransferase-like
MQKSINIFTPDKITLPLSWVGHIPFAAWLVEILHPQILVELGTHSGNLYF
AFCQTVKKNQLLTKCYTVDTWKEAQHSGIYDNNVYNEILAYNSTIYGDFS
QLFCMTFDEALTKFENGSIDLLHINGEYTYQAARKDFETWLPKMSNKGII
LFHDIMVKEREFGVYRLWEELSSQYGHFEFTHSHGLGVLLTGKNQHPAIE
SMAQDFQDTQKKKLISGYFEHSGYAIELEYQHQSDATKIHKLSQQLKSQS
LQINSLQKHNQSLQHEIQELNKSLVRITNSSSWRLTKPIRKWSKSLRKRF
RKIRYFLTGETAENVPKRLTTLCNNWFKPTDKTRILIIDSWIPAPDQDSG
SMDTFLTMKALVELGYDITFIPKDLKAKQKYVQLLENEGVRYPDLSKAAI
SIEEFLKVAGHYFDLVMLYRVDTASSFLAMVKHYAPQAKIVFNTVDLHFL
REQRNAELSGSDIMRKNALKTKEHELQLMQQADSTIVLSNVEFDLVKKIK
PEVNLELMPFFRMIPGRSAAFHERKNIVFIGGFKHQPNLDAITYFISEIW
PKVHLKLHDAKLRIIGSNPPKELYRLVDSDNTIELLGYVANLDPEFNTCK
LTVAPLRSGAGIKGKIVTSLSYGVPCVASPIASEGMELIPDKDLLVAKEP
DEFANKIIKLYTDEALWNALSDNALTTVEERYSYKAGKKRIGDFLNKLLG
SSRHSVWGSEEFLQNNLANETDDTDGKKRIIIELPSFDKGGLEKVVLDSI
LAFNKNKFHFLIVTPGKLGELSTVATNAGLSVIQLPDINHEAAYERLVIK
YRPHASMSHFSHLGYPVFINHHIPNITFIHNVYAFLSEKHKKEIMMYDHA
VTRYIAVSPKVACYAEKNLGINQEKITIIPNGLCITEHEERQKRATPALR
DDFGLNKNDFVFLNPASYNLHKGHYIMVDALQIVTKKRKDLKILCVGNIV
HEPHYHELQQYIISCGLSEHMLMLGYISKIENIMPIVDACIMPSFIEGWS
IAMNEAMFYGKPLIMTDTGGASEVIENNDIGILIPNEYGASDLLDRTTLD
KLAYKPHHYKISSMVADAMIAFADNHEYWKKAGEKGRKKIYRHYAFKNVV
AQYEEIMNQVTEPVTYEPQ
>Cag_0919 probable ABC transporter permease protein
MCMKPALWIALRFSFARKRFRIINFISAITLAGITIGVATLLVVLSVLNG
FQELARTFFLSLESPVQLVSSHNNAITVTPALLASIRTLEGVATAEPYAE
GEALLATQNKSELVMLKGLSPTAHQHLQNYINAPQPLFTDSTIAVGELLA
LRSSLYPNQPIQLFSPELLSLGLESLTQPYLIAALTIPRTSLHTIFSIHK
LFDDRYALSSLPLARQVLLLGDNNYTGIEIRGKHGVKGETLQRTLQQWLT
KEGVEKSYRIRTLEEKYQSVFSVMQLEKWITFSILMLVIVLASLSLTGSL
AMTVVEKQQELFYLRCLGFNTPSFTALFVMQGAITGITGTTLGTALGWGI
CAAQQHFGFVQLPSRTAFIIDAYPVAMQLSDFFVVGGAAIALCLIVSLYP
ARKAALIASSRTV
>Cag_0744 Type I secretion outer membrane protein, TolC
MRCHLKKIASLLLLLVLSVSAFPLHAETLDLATAYRKAMEYDARLRAAKA
DNAIYREEVGKARSQLRPNIRGNASRGRSTTQRGNKYGFYPADSYNTVNY
GVTFRQTIFNFSSTAAYDQAKLVAMKSDTDFRKEEEMVMVRIAEAYCNVL
FAEDNLAFNNSFKTAAKEQLQQAKKRFAKGVGTLIEVEEAQASYDQADAQ
GIDMQNNLEFSRRELEHLTGIYPSELRAVDAAKLPLFAQQESFEVWLERA
RTANASVESARHEILIAKKEAAKQRGAQYPSLELVAGRNYSESENNYSIG
AIYNTYSVSMQLSWPIYTGGYGSSSIRQADAKKIKAEEQYSLQVRQMESD
VRKYYNSVAGSIALVKAYQQAVNSREVALKGMKRGFQAGLRSNVEVLDAE
QKLFASRRDLAKSRYQYILNLLMLKQAAGVLQPQDVDEVNGWFAKASLK
>Cag_0743 metalloprotease secretion protein
MAEKITSPLDKGAANAPDAKKYRDTRSPIRLGIWILLVGFGSFLLWASFA
PLDEGVPCQGLVGIATKRKVVEHLRGGTVQAVHVREGEIVQEGQVLISLD
SQTARARFDEIHQHYIGMRATADRLQSEMRGAGSIAFHPDLLRESDKSLV
RKNIENQKALFASRRQTLQILTEQLIAIKSLVSEGFAPLSQQRDLELKIA
EFKSSTASQLAQVQLEVEADAEKSRALAAELADTEIRSPASGQVVGLQVQ
TVGAVIQPGQKIMDIVPLDESLLIDAKIAPHLIDSVHQGLAVDINFSAFA
HAPQLVVEGNVESVSKDIVTDPPSSGTQPGASYYLARIAVTKEGLKQLGK
REMQPGMPATVVIKTGERSLLTYLLDPLMKRIHVSMKEE
>Cag_1346 Peptidase A8, signal peptidase II
MKLFFSLALFVVAADQFSKYVALRFLRDANQSISIIPNFFSFTYAENRGI
AFGLEPAPPALLLLFTMMISAAVLWYVLRSNNRRLIFLLPFSLILGGGVG
NMIDRMVRGYVVDFIYFNLYNGYVGNIYLSLWPIFNIADSAITIGGTMLL
LFHRTLFPDDPIA
>Cag_1106 glycosyl transferase
MIFSYQPTVSIILATFNRAHYVAHAVQSVVEQTIDDWELLIVDDGSDDAT
FDVVAPFLAQHSNIRYMKHKNRNAALSRNAGIQASFGQYITFLDSDDRYA
PNHLASRLKIMAENPAVDLLSGGFWCEEPTLVKDRDNPKRLINIRECIVC
GTLFGKRELFFDLEGFRNVAYAEDTDLWERASLRFVTKKIAAPESYLYQR
ALDSITLTYQATMNG
>Cag_1734 Dihydrodipicolinate synthase subfamily
MPTPYLSGSAVALVTPFKSDSSIDFEAIARLTEFHVAAGTNIIIPCGTTG
ESPTLAEEEQVAIIKTVVEAAQGKLMVAAGAGTNNTHHAVELAKNAEKAG
AAAILSVAPYYNKPSQEGFYQHYRHIAEAVAVPIIIYNVPGRTGCNVAAS
TILRLARDFDNVLAVKEASENFTQISELLEERPSNFAVLTGEDSLILPFM
AMGGNGVISVAANEIPAQIRQLVESAGSGDLVTARTLYSRYRKLLKLNFI
ESNPVPVKYALARMGMIEENYRLPLVPLSAESKRAMDEELTLLGLV
>Cag_0751 hypothetical protein
MKINIIDPGLFHQAGHHLDLDIKVCKVLKNLGHDIAIYSSINVTDKIKSF
FEHYGEVTPIFKAIPYFNVENIDRLAGGYIAFNRQSKILAEDFTKVASAD
LWLFPSIFCAQLNACAISGSKTPIAGCIHLSATKEYAQDEMFWRYALINC
KDRSIELNLGVMEPEHLMEYQKIASNDFEIISFPIPYEGVPIERPRKTLR
KVGFFGHQRDEKGIHLVAKLTDLLTKKGYEIVIQDSTESFKSTGYSNVTI
LGYVANIALEISKCDCVILPYNPIKYQTKGSGILWDALASGVPVIAPVGT
AIGRWIQYCGSGRLFHEFDVNSIIHQFELLSENYDEYVSIAIKNASIWSV
KHGIDKFVSSLLNFKKSC
>Cag_1635 Alanine racemase region
MCEALISLEHLRGNVRALQAHLNGRAHIMGIVKANAYGHNVHFVASTLET
CGIHNFGVANIHEALELKQGGALQKPATIIAFASPLPSHLPYFIEHGITM
TLCDHATFVAARDIAAALERPLQVHVKIDSGMGRLGVAPHEAMALLQAVD
ASPFLELTGVYTHFADSATPNSFTHQQLAIFKTIAAEYEHAAQRTICKHT
ANSGALLSLQASWCDMVRPGILLYGYHPSQECPTQLNVQPVMQVQAKVMF
IKKVAAGTSISYNRTWQAPTERFIATIAAGYADGYHRLLSNNAAVLINGK
RYPQVGTVTMDQIMVDLGSSHSVQVGDSAVLFGWDTLSANELAAQAHTIS
YEMLCAVSARVKRRVV
>Cag_0247 conserved hypothetical protein
MLLIGTRMVDWSSLQTYLVIALLLLLFALVVMLGANSLFQQLVVRIRATM
RGVSLPLELVAWPFRLLMVLAGMAVLANNVPLSARIDVLIKHSLLIGGII
GVTWFLTRLMLVFEQVVLDYYATKVRESEAVRKVATHISLARKIIDALLV
LVAIAGALMTFDTVRQVGLSILASAGILSVMVGLAAQKSLTTLIAGVQIA
ITQPVTIGDEVVVENEKGTIEEITLTYVVMKIWDERRMILPITWFLDRPF
ENWTRTSPELLGSVFLSIDYLLSPDVLRQELERLVATTPLWDGRVVKLQV
TNSTERSMEIRALVSAANASQLWDLRCLVREGLIAFLRNNYNDLLPHLNI
EIERGRSPQNR
>Cag_0605 glycosyl transferase
MELSVVIPLMNEADNIEPLFSALNKALRNIEHEIVLVDDGSTDNTVETIQ
RYATATTKLVVLNKNYGQTAAMAAGIEQASGELIATMDGDLQNDPDDIPM
MIRYLYDNNLDVVAGRRAARQDGMLLRKIPSKIANAMIRNLTNVHMHDYG
CTLKVFKRNVAKNLGLYGELHRFIPVLVQLYGAKMAEVNVRHHPRKFGTS
KYGIGRTCRVLSDLLFMLFFQKYSQKPMHLFGSLGFISLAIGMSINAYLL
AIKILGEDIGGRPLLSLGIILTFIGIQLITTGFIAEFIMRTYYESQNKKT
YIIKDVVSKN
>Cag_1616 RfaE bifunctional protein, domain II
MAAISPKILSWQNAAEQVQAWRAMGNKVVFTNGCFDILHAGHVHYLQAAK
SLGQRLCIGLNSDASVQRLKGPKRPICNEVDRATLLAALEVVDMVVLFDE
DTPERLIAALLPNVLVKGADWAVEQIAGAATVLQHGGEVLTVPLLDGRST
TGVIERIIERYSL
>Cag_0920 conserved hypothetical protein
MKLLWYLATRFALKQRSSSKPTFVVVLAVAGIAAGTAALLLTLAIVNGFA
ATISQKLVSFNAHLQLRHSSQEFFQEERSERLQLTSHPEITHFTPFLEQR
FVLRCRNSAQQGRWSSKAVLVKAMPSAERQNFIKRYSVRGEEKECRVAEG
MVGIYLGRSLAEELNCKVGQKVMLIRDSNQASQRLLADATSLPELLASLA
IEPAVVCGIYSTGLQEGFDDYLLLADLSALEQSRKGFISGYEANVRHIER
LPTTVQELTALTNNRLYGYTLFQRYANLFEWIKLQQNIMPLLIVTITVVA
VFNIMATLLVLIIEKTGEIGMLSALGLAPQRIQGLFMLQALLLSITGITL
GNLLALGLALFEQHYHLIRLPEKSYFITEVQLLLNPMDNVVVSLSVLALC
LLFAWLPSTIAASLKPARALDA
>Cag_0668 Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation)-like
MTLKLLACGDVVNFSAKQDFVDNKLTQIIQNSDIAICNFEAPIFHQDMRP
IKKAGPHVYQSKESVQYLKNVGFNFVSLANNHIYDYRQKGIFDTIQELKK
FDLEFIGGGTTFEEAYKTTIIEKNGITIGLLAGCENEFGCLYEEQNRGGY
AWLFHHKIEDNIRELKAKCDFIVFISHAGVEDIELPIKEWRDRYQRLCDV
GVDVIIGHHPHVPQGYEKHNKSLIFYSLGNFYFDTTSFKNKSDDSFSLLI
EFEKNKNIGFEIIYHKKINGQTVLVDKKDVSFDLDYLCGILEDGYLKRND
EISIELFNKYYYRYYQEALGVIACNLNFLSQIKWLIKKIIFSKKEIDNKN
LMLLHNIRIDSHRFIVQRALSLLSERI
>Cag_1352 hypothetical protein
MVSIAGGAVTVTQNRGFVNGRVFTGYNFNKNIAVELGYLQTGDTTANIAG
VSGTLVAYTGELKASVSGIDYSILIRPISNNEWNGLFIKAGGHYLEMDQK
GSLTFAGIGTNTIKESVNGSGFLIGIGYDTPITDNIDLRSAYTYYGKIAG
DSDSEANFFTIGLLAKF
>Cag_0647 putative glycosyl transferase, family 4
MPFFDSQHPVFFILATLVSSVGLWAIVATQHLHGEHSHDTSEGPQKFHSL
PTPRIGGLAIALTLFGFSSFISSLALVTIIAALPAFLAGLLEDLTKRVSP
RSRLIATFISAIVAWWLTGYHITSLDISFIDSLLVYLPLSLLFTSFAVAG
VAHAVNMIDGFNGLAGGTLFCMIGAFTLIAYSFGDLEIVSLGVLILSVLA
SFLFFNFPFGKIFMGDGGAYLMGFLVAWIAVMLPMRHPDVSPWASLVICS
YPVNEALFSIGRRALQKMSLGQPDSEHLHSLIKKNIIRPNFSYLEPHFRN
SMVAPLCWLYALVPIILAVTFYESIVVLIAIWIGSIVLYMLLYWFVARLA
INKESLL
>Cag_0671 UDP-glucose 4-epimerase
MNYKNKQVLITGGLGFIGSSLARSLVKQGAHVTIVDSLIPQYGGNLFNIS
DIRGKLTINVCDVRDPFAMDYLLQGQDYLFNLAGQTSHMDSMSDPKTDLD
INATAQLSILEACHKTNSDIKIVFASTRQLYGKPDYLPVDEKHPIRPVDV
NGINKLAGEWYHLLYNNVYGIRACALRLTNTYGPGMRVKDARQTFLGIWV
RLLIEGKPIKVFGDGMQLRDFNYVDDCVDALLLAGVNDSANGKVYNLGST
EVVGLKTLAEMMVNFYDGATYELVPFPPERKAIDIGDYYSDFSLITKELG
WEPKVGLQDGLKKTVAYYQVNHAHYWA
>Cag_1448 outer membrane protein OmpH
MKTSSFFAVSRRIATGVMLSLTLAAPQAFAAQESGKIGVIDSAKILQQLA
DTKQAESALQAAAAPMQKELDRMNQDYQQAVAAYRQKAATLAKTAREQKE
KELTTKGKAIEKYQQDNFGRGGALEKKQQELFSPVRQKVLTAVEAIAQKE
GISVVVEKNSAIYATTDADITYKVLNQLNVK
>Cag_1154 UDP-3-O-(3-hydroxymyristoyl) glucosamine N-acyltransferase, LpxD
MMTIQEIYEYLSRFFTPVELIGNGEELIHAPAKIESAQAGEVTFVANKKY
LRFLALTEASLVIVERSLAVEEYVGKHSFLKVNDPYSAFVFLLQRFIPPR
RIAKQGIAATASIGSNVTIGENVSIGEYAVIGEHCSIGNNTVIAAHSVLL
DHVTIGSDVVLFPHVTCYDGTRIGNRVVIHSGAVIGADGFGFAPQQDGSY
IKIPQIGIVEIGDDVEIGANTTIDRATLGSTVIESGVKLDNLVQVAHNCR
IGAHTVIAAQAGVSGSTTLGNHCIVGGQVGFAGHIEVSDHIQVAAKAGVS
KSFMQSGIALRGYPAQPMREQLKYEAQLRTVGDLHAKLKALEQELKALRG
SEMPLQNTL
>Cag_0512 Glucose-1-phosphate thymidylyltransferase, long form
MKGIILAGGSGTRLYPVTRGVSKQLLPIYDKPMIYYPLTTLMLAGIRDIL
IITTPEDQAQFQRLLGDGDDWGISLSYVVQPSPDGLAQAFLLGEKFIDGD
DVALILGDNIFFGYTFSSILERAVQSVTQEQKATIFGYYVSDPERYGVAE
LAPDGCVRSLEEKPQQPKSNYAVVGLYFYPPNVVEIAKTIKPSERGELEI
TTINEVYLQEGNLHCSLLGRGFAWLDTGTHESFQEAGNFIQTVEKRQGLK
VACPEEIAWRAGWISDSDVQRLAEPLMKNQYGQYLQQLLHKRDKL
>Cag_0047 methyltransferase
MALHDTYHDPVLAAEVVATLVQRSGIYVDGTLGGGSHSLALLQALQAQGL
LESSLLIGIDQDSDALAMAAERLQAWQPYTRLLKGNFRDMASLVQQLCDA
EGRACAVTGVLLDLGVSSFQLDTAERGFSYMRSGPLDMRMDNTAPLTAAE
LINHADEAELARIFYHYGEEPRSRALARAVVQQREKMGNFTTTEELAALV
RRLTHGGEKAVIKTLSRLFQALRIAVNDELGALHEVLEGALELLDGNGRL
AVMSYHSLEDRVVKHFFTHHAQCDWGPKGVALREPLSQGALTIVTKRPML
ASADEIERNPRARSAKLRVAAKNQPKTI
>Cag_1152 OmpA family protein
MALSLYKIAKPVALLLIPATLSVTWGCQTTTPSSNAAQKAKIGAAAGGAI
GALIGSRTGSWAKGALIGAVVGGASGAVLGNYMDKQAAAIDQNVEGAQVQ
RVGESIRVVFDSGLLFTTGSSTISAASRSNIEQLARILNTYGDTNVVIEG
HTDSIGNEATNQVLSEKRAESVATLLKVYGVAPNRMSAVGYGETRPVATN
ETEAGRNLNRRVEVLIYATDALRQQAASGQLRL
>Cag_1479 Glycosyltransferases involved in cell wall biogenesis-like
MKAPSVSVILPLYNHEQYIDKTLTSIFEQTSPPDEIILIDDGSTDQGFEK
ATLILKNDPRAHLIQQENKGAHNTINRGIELARSEFIAILNTDDLFLPSK
IERCRNIIENYPEIDFICGDINIIDENSNMVTSGETVKWLEHAHDFQMRC
HSLDLGLLNDNYVTTTSNMFFSKNLWEKNKGFQNLRYCHDLDFILTALSS
SKVVIDYNYKHINYRTHSHNTIKEKISKIRYEMAAVMANAMMCNNAIVSK
STLLDDIKSLGGIIDKKNNALLLSVLMTLRSRYTCKIPYYEQLQEDAIAA
ELYKLLN
>Cag_0490 GTP-binding protein LepA
MGLPNTDSCRIRNFCIIAHIDHGKSTLADRLLEITRTLDRTQMGSAQVLD
DMDLERERGITIKSHAIQMRYNAADGLEYTLNLIDTPGHVDFSYEVSRSL
AACEGALLIVDATQGVEAQTIANLYLALDAGLDIIPVINKIDLPSSDVEG
VARQIIDLMGVKRDEILAVSAKAGIGISELMESIVHRIPPPAVKNNEPLR
ALIFDSVFDAYRGAVVYLRIVEGLLKRGDKVRFFASDKLFTADEIGIMTM
TRQPREQLASGNVGYLICSIKDVKDAKVGDTVTLADNPAVERLSGYKEVK
PMVFSGLYPINSNEFEDLRESLEKLALNDASLIYTPETSVALGFGFRCGF
LGLLHMEIIQERLEREYGVNIITTVPNVEYRVFLTNGEEVEVDNPSVMPE
AGRIKQVEEPYVSMQIITLADYIGNIMKLGMERRGEYKTTDYLDTTRVIM
HFEFPLAEVVFDFHDKLKSISKGYASMDYEYIGYRDSDLVKLDVMLNGDT
VDALSIIVHRSKAYEWGKKLCQKLKGIIPKQMYEVAIQAAIGSRIISRET
ISAMRKNVLAKCYGGDISRKRKLLEKQKEGKKRMKQVGRVEVPQEAFLAL
LNIDE
>Cag_1430 UDP-N-acetylenolpyruvoylglucosamine reductase
MTPTKHQAATWYSNVSCQITTNVALAERTNYCIGGVARYVATPTSLAELS
ALLYAVQQERLPLALMGGSTNSLFSDEDFEGVVLSLEQMAQMVWLSDDEL
FCEAGVENSDIAQALFEVGKSGGEWLYRLPGQIGATVRMNARCFGGEISA
ITRAVLTMSLSGELQWQEPTEIFQGYKQTSLMGSSAIVVGVVLRFTESAP
PTAIRAEMEHYEGERLARHHFDYPSCGSTFKNNYSAGRSSGVIFDELGFR
GVREGGAMVSEHHANFIFNYENATAGDVLKLAAQMRHAALERADIALDLE
VECIGRFERGLLEACGVPSVTNAHNSSKAWAGMLWLPNEFATTTSVAPIA
EPSYPRQLMRGALMGYARFDREFPTGVMVEVEQLCARSEAATNPSAPFMR
WRTYAAPDSSLFTLKAPEPAAFTDGLWRYGVSELFISSVTDGGYLEFEMT
PNGHWVALRFSTPRQRAAGYETLSAALWQGNITVQQGAGWFGMELSWELL
APFINAEGVIALQCAGSSGRGEFGLFPSWATPSTPTDFHQPQQFYPIRL
>Cag_0675 putative glycosyl transferase
MNKILSVITVVKDDYSGLIETAKSVSLIVPSELVEYIIWINESSFEIIHN
IDLVKKLASKVIVGTDYGIFDAMNKALNYAAGDYVLFLNAKDLVIQGFNI
EKMLRPCLLKVQYIDYFGRLRKVVRNHKIDFGIPYCHQGMILPRKGYLFD
ENLKYGADYLALINMNLNWPLPISDSGLIHYDTTGVSTVNRWESDKWTAS
IIKRKFGIFYSLRYLFYCLIKLSIKRLYDVKIILTKKLNIFYVHR
>Cag_1546 glucose-1-phosphate thymidylyltransferase
MLKELIYSAFSNKIGHRKRDCNKAMKAIIPVAGVGTRLRPHTFSHPKVLL
NVAGKPIIGHIMDKLIAAGITEAIVIVGYLGDMIEEWLLQNYDIKFTFVT
QSELLGLGHAISMCKPYIPEDEPLFIILGDTIFDVNLEPVLKSTCSTIGV
KEVVDPRRFGVAVTENGAIVKLVEKPDTPVSNLAIVGLYLLQHSAALFKS
IDYLIEHNITTKGEYQLTDALQRLLDEGEKFTTFPVQGWYDCGKPETLLA
TNEILLSDNPPSKTYPGCIINDPVFIAESAKLENAIIGPYTTIGEDVVIK
DAIIKKSIIGNKAQVKHIMLGNSIIGNNAIIRGTPHEINIGDFSEIRVS
>Cag_1182 capsular polysaccharide biosynthesis protein I
MNVLVTGAAGFIGSTLCKRLLERGDRVTGIDNLNDYYDVSLKEARLAQLQ
PYENFTFVKGDLADRAGMEALFAKGEFEGVVNLAAQAGVRYSIENPHSYV
ESNIVGFLHILEGCRHHGVKHLVYASSSSVYGANETMPFSVHDNVDHPLS
LYAASKKANELMAHTYSHLYNIPTTGLRFFTVYGPWGRPDMALFLFTDAI
LKNKPIKVFNYGKHRRDFTYIDDIVEGVIRTLDHTATPNPAWSGATPDPG
SSKAPWRVYNIGNSQPVELMDYIQALENELGRTAIKEFLPLQPGDVPDTY
ADVDQLIEDVHYKPQTSVPEGVKRFVAWYKEYYGVKG
>Cag_1451 UDP-N-acetylglucosamine pyrophosphorylase
MALAVLIMAGGKGTRMKSDLPKVLHQANGRPLIHYVLETAATLNPAKTLL
IVGHKANDVQQATAHYPATALLQEPQLGTGHAVMQAEAELRNFEGETIIL
SGDAPLVTTATLQAMLALHHAEAATATLLTAELDDPTGYGRIVRVNNSSS
IEKIVEQKDATPNEQTIREINAGVYVFNTRWLFEKLGELNTNNAQQEYYL
TDLFSICFKTGKKVCAYKTATPNEILGINTPAQLQQIEEILKKGGVV
>Cag_1679 hypothetical protein
MRHISSALYRYAFHSAALCTMLYVAAPLHVVAAAESVGTRKANVQEAPFL
TTSLSNATPYVGEEITIHYTLYFQEIAPKIVRESQPSMQGLWAQASNPDR
FINSKSVQFKGKAYRSAVIKSYRVAPLQSGQLPIDGYTMHYVLSENEGEG
EKSITAPSVAITARPLPQPIPQGYSGAVGTFSIAQHSSAQQVRVGEPLTV
TLTINGTGNLFTVTPPTLSVPASFRADTPNKKIEVNSDANAKSGSITLTQ
TIWAEQAGSFTLPACSFIAFNPNSKSFATLRTEPLTIIVTPALQSGGQSA
SKGDSLSSSMAASEDGTAASFPVIPAVAGALIVIMAGVGAWLLRHRKGSA
QTSIEGSVPKSSVAITARSAKEQLFAQLHLVGIKHPNGLTRSELLQALCT
TRLSPEMQQQFMALLDALDALLYAPGAANPLLTPELAERFILFDKELNQT
YSAS
>Cag_0448 DegT/DnrJ/EryC1/StrS family protein
MCCRDSTEKNEMVVYFVRSYSFHLDRQHIMQFIDLLTQKERIKGALLRRF
EDILDRGQFIMGPEVTELEAQLAAYAGTRHCVSCSSGTDALLMPLLAKGI
GAGDAVITTPFTFVATAEVINLAGATPIFVDVLPNTFNINPELIGEAVAE
AKQKGLQPKAIIPVDLFGLLADYERLNAVANEHHLWILEDACQSFGGSFN
GGKAGSFGLVGATSFFPAKPLGGYGDGGAIFTDDSELDMLLRSVRVHGSG
ADKYSNDRIGINGRLDSLQAAVLLEKLTIFDEELATRQSIAEIYNERLDE
RLVVPTIPDGYRSAWAQYSVLAASAEERDMLMQALQQEGIPSMIYYKIPL
HLQKAYRSLGYNTSDFPVSEDLSRRIFSLPMHPYLKDEEIEQICTVLLQT
K
>Cag_1755 outer surface protein, putative
MKKRVAGIVAGCLALACYATPAQAQMPYISGAVGLATLSDVNNVAQGAFE
DGHRLMGAFGIDSGSTRIEAEIGVQNNGVKTLADDIKITTFMGNLYYDFE
LPMAPIKPFAMAGAGMVDVEQKQLGQDTSFAWQVGAGVGFSIIPMVTVDL
QYRYFATASDAQLGATDYSIDASHVMLGLRVGL
>Cag_0049 Penicillin-binding protein 3
MTNPEFRNRLGIVIGGFAVFIIIIIAMLLNIQVINVEKYKKKAERQYVKQ
VTEYARRGAILDRRSRVLAESVESITFYASPKQISKSLLFDEEGEAVINK
RNNKQQTFDNTQGVATLFAKHLGANRQIYLKALKKRKGVAVLAKKVPIEK
ALPLITEIKSRKMHGIWHEKEQQRSYLNVAAQLIGMTGNEKSSVDGGSGL
ELQLNKELKGVNGKRYYQRHATGELYTAPDVAQKAPKSGNSVQLTIDSDI
QSIVEDELSKAVAEFQADAATGVVMDVRTGEILAMASSPTFDLNRRSTWT
QDNSRNRAVTEMFEPGSTFKLVMAAAATEVLHRKSSDYVYAHNGVMPYYK
LKIRDHEPYGTITFKEALMHSSNIVAATTAMKLGRETFYAYTKNFGFGQK
TGVGLVGEARGIVRPLEKWDSTTLPWMGYGYQVMVTPLQILQAYATVAND
GELMRPYIIKKVVSPEGKVIRETAPEVVRRVLKPETARYVSREYFKAIVD
SGTAKNPILQSLHAAGKTGTARRASAGSYAVRSYVSSFVGYFPVSSPRYA
MIILVETPRTSYYAAAVAVPVFARIASRMVACSQEMQKMLAIRSPEQELI
DSLATVTVPDLRGLKGREAQRMLSWLGLTMEHSGDFNGVVVSQSVSSGTQ
VAKAKTVVVRLSK
>Cag_0299 conserved hypothetical protein
MQIVVFEDKRVEEFQPLVLLKPLYALFVGFRSLREKLEYAVRGRATLTYH
IRRYLAPCYQEQHPELVVNRLDEDDILLVNGRLLGDEAVAQVIVEGACEV
GSAFMQNGTMLFARVHRHMLIGENGLLPDVIDTERLAEQLRVEEVNGFRL
LEHIWDIIALHPDELLRDAETLELGRIEGEVHHAAALVNRSNIYVGAGAV
VRAGAVLDADDGFVAIGAGAIVEPQAVLMQNVVLAPWARAKIGAKLYSNV
AIGMASKVGGEVEDSILEPFVNKQHDGFLGHSYLSSWCNLGAGTNTSDLK
NNYSEVPLRRNGELVTTGLQFLGLLMGDHSKCSINSMFNTGTIVGAGANI
FGGGFVPKEVPSFAWGGSHGFEHYDVEKAVETARKVMARRKVTMSASYET
MFRSVAGLSGNSLFI
>Cag_1670 glycosyl transferase
MKIALYAGTYVKDKDGAVRSIYQLVNSFKKAGVEVVVWSPDVDPTYNHGS
LVVHQMPAMPIPLYPDYKLGFFSRATRQQLDAFAPDIIHISTPDIIGRTF
LLYAKERAIPVASAFHTDFPSYLEYYHLGFAVKPTWRYLRWFYNKCDVTL
APNESVQQKLESHGITNVASWSRGIDKELFDPSRRSEAQRATWKVDGKTV
FIYAGRFVPYKDTEVVMQVYERFMQSDYANRVAFVMIGSGPDEEEMCRRM
PDAIFTGYLTGADLPTAYACGDLFFFPSTTEAFCNVTLEALACGLPSIVS
DVGGCRDVVERSSAGLVARSGNSDDFYAKCLELLNNPERYQVMRERGLAY
AEQQSWAAVNGALIERYRRMVNQAQR
>Cag_0514 dTDP-4-dehydrorhamnose 3,5-epimerase related
MHVIPTTIPEVLMLEPKVFGDERGYFFESFRQDVIEEHIGQVHFVQDNES
KSSYGVLRGLHFQKPPYTQSKLVRALFGKVLDVAVDVRHGSPTFGQHITC
LLDSERKNMLWVPKGFAHGFVVLSPEAVFAYKCDNYYTPSHDAGIAWNDP
ALGIDWQLPLADVRLSSKDAAQPSLSSVDCFPYDAYRQAELYPPLIS
>Cag_1255 Outer membrane protein and related peptidoglycan-associated (lipo)proteins-like
MTINHFTMKKNLLTLSLLLSATVPATAQINLKSIFDSSAKKAERNAAQRI
EKKIDKKVNNTFDSVENNLDNVKNNSENNNYAFQEPVSNSIPVKQQSTLS
WNKYDFVPGTEIIFEDDFTGERNGEFPSRWDITKGTVEIAEWGGEKVVWF
KNTNTNVPDAILPYLKNRSTDYLPDEFTFEMDVYFHADYRLNKDYYIFFY
DAKNQTKIFSPSKPIRIDYNSVTYNHIGDLYQGQNKLKPKEGWRHVAISF
NKRALKGYLDDARLLNIPNVEFNPTGIMISSHNSGGKGQPFVKNIRIAKG
AVPLYDKFLVDGKFVTTGITFDINKATIKPESMGTINYVVKMMQEHPELK
FSVEGHTDSDGADANNQTLSEARAQAIVNKLIESGIAKERLTSKGWGESK
PITNNETAEGKAQNRRVEFIKI
>Cag_1282 UDP-N-acetylglucosamine1-carboxyvinyltransferase
MNKLVIQGGRPLSGTLTASGSKNTSLPIIAATLLHGGGTFTLHRIPNLQD
IDTFRQLLHHLGAESSLENNTLTISTANVNSILAPYELVKKMRASIYVLG
PLLARFGHARVSLPGGCAFGPRPIDLHLMVMEKLGATITIETGFIDAVAK
EGKLRGAHIHFPISSVGATGNALMAAVLAEGTTTLTNAAAEPEIETLCNF
LIAMGATIRGVGTTELEIEGTSSLHAVTFNNVFDRIEAGTLLAAAAITGG
DITLLEANPRHMKSVLKKFAEAGCTIETTPDSITLKSPETLLPVDVTAKP
YPSFPTDMQAQWIALMTQANGTSRITDKVYHERFNHIPELNRLGAGIEIH
KNQAIVHGIRKLSGGPVMSTDLRASACLVLAGLVAEGTTEVLRVYHLDRG
YENIEMKLRTLGASIERQKYQEF
>Cag_1072 sialic acid synthase
MIVMAEIRIGSRMVGNGHPVFVIGEIGINHNGSLENAFKLIEGAARAGCD
AVKFQKRTPDLCVPKDQRDIERDTPWGRMKYIDYRYKVEFGKEEYAAIDG
CCKEHGIEWFASCWDEEAVDFMEQFTPPCYKAASASLTDLPLLKKTATTG
RSLIISTGMSTMEEVERTVAELGMEKLLIAHTNSTYPSPIDELNLRMITT
LKALYPEVPIGYSGHEVGLATTWAAVALGATFVERHITLDRAMWGSDQAA
SVELSGLAKLVENIRDIEKALGDGVKRLYEGEAAARKKLRRTS
>Cag_0169 Tetraacyldisaccharide-1-P 4'-kinase
MSNPLRLAFRPFALLYEAIVQTRNQLFNRAVLRAWESPMPVVSVGNLSAG
GTGKTPMVDWVVKYYLSIGFKPAIISRGYKRQSKGVQLVSDGNNVLLSSR
EAGDETAMLAWNNPDAIVVVASKRKQGVKLITKRFAQRLPSVIILDDAFQ
HRQIARSLDIVLVNAEEPFVEAAMLPEGRLREPKKNLLRADVVVLNKITD
LEAATPSIKALEEMGRPLVKARLSTGELICFSGDATTLDEPATAHHLNAF
AFAGIAKPESFVTSLQHEGVNVGATRFVRDHAPYSAKMLRAIRRQAEEQG
LCLITTEKDYFRLLGQPELLSIITALPCYYLKIAPDIFDGKALLQEKLNA
VVHYVPKPEPPKKIEEPYRRW
>Cag_1298 conserved hypothetical protein
MALGFHIQQQRRNVAIVLPMLLPLLLLPFSTLLSATPAKKAPFTASPKSS
FTTTILFTAKDSVLYHAEKQTMSLWGNAVMKDAATSVTSPKMVIDLPASR
LTAYGEQASPTIPAKPAIFSDPQGSFNSSAILYDFATRKGTTTALSSNYN
GLQVKGGNVERLENGVLKITDATFTTCLEEEPHYWFESTTMTITPEGKVT
AKPLYMYMRPEIFSARLPAVPVLALPYMKFSTKPSRTSGVLLPSVSRFDN
STALSGMGYYWAIDEYADLRNEGDIAQNGSWRLGERLRYRKRDRFSSEVQ
GEFKQYQTANEWNARVIHNHRFNATTQLDANLAFSGGERRYEVNSMDSQT
LVSEQTAANASMAKSFGDNKALLALTLNGWRDLRNDNGVTNLTASYYQQP
LFLSQPPTMYDEKAWQLGGYTLLKSNQTRWFGDNQNGHTITAALEADYFT
RFNPASQLLLRQGVVVQAEKPVDELSTHNYQRTTLLLPLQANARLFEHLY
LQSGITAIQTLSTDEDESNYATLLLHSAASTRLYGTLATPALTPLLGLSA
LRHTFIPELRFSWNPPFSALPISNDATNSYVWPFDTPFVIGIEPYFGALP
DGQNRIGITLKNILHGKFHSNKQNAADSIAASTDKSVALLTLNVATALNT
ATEDFRWQPLTITASSNALTPSLFISGGTMYDFYSFEPATGNRIARLNSD
EGKSALRFVKGYANISVALQGDVSSKSAPPTQPVVARTEQALYADRFRMS
SFAAVDYSLPWQATFSLYSESDKSNPLQAVSTTLLHSAVRVALSPTWQAG
FTTGYDLDRKTMVFPLVQAYHDLHCWQIGMQWVPSGEFKGYALSVGLKNF
PAF
>Cag_1407 peptidase, M50 family
MAGEGTTYSESWHLVANLHLKLRSTVEVQRQFYRGEKWYVLKEPFTNRFF
RLQPAAYEFIMRLSPEQSVEEVWLNCLKEFPDQAPGQGDVVSMLGQLYAM
NLLETDVSPDTRQMFARYSQNRRKEKFAQLLNILFIRLPLFDPEPLLQRL
SWVIHRVISIPGAIVWFTALLVAAKLLTDNSAGVFDSAQGVLAPGNLFLL
YIALVFVKSVHEFGHATVCHRFGGEVHTMGVMFMIFTPLPYMDATSSWGF
RSRWERALVGAAGMISELFIAAFAAIYWALSGEGALHSLAYNVMFVASIS
TLLFNINPLLRLDGYYILSDLIDIPNLHNRAVEHLKYLLERYVFHYSGAT
PVAKNRREEIELTLYGVLSSIYKVVIFLGITLFVADKWLILGTLLAITGV
VTGFFLPLYRFVRYLFFDARLYSCRPRVASATALFLLIFLVVTGLLPFPR
NVSAPGIVESYPDVKVVNLSAGSVVEYLVTPGQRVVAGQPLVRLENPELD
FALASAESRVTEIRVRERLALGKRIADLKPVQQELAAVEAELAKLQRDKQ
ELVVRARAAGQWIPATRESIAGMWVERGRYLGSIVGGSRFRFAAVVPQEE
ASELFRAPVAHIEVKLWGEAFRSIRTESVTLVPYQHEKLPSAALGQQAGG
SVAVVQNSRQEADNAAEPFFLINALLEEQSGVHLLHGRSGTIRMSLSPEP
LLVQWSRRIYQLFQKRYQVS
>Cag_1863 glycosyl transferase, group 1 family protein
MKPYKLLWFSEIQWDFLSTRKQRLLARFPDEWHILFIEPFTLGRKHHWLP
VKRGRVWVVTVPFLKTIPFRFGALLKRPLVRTLAGLPGIAIMHLWTLLLG
FSSSQRIIALSNPYWGKVASHLPCRFRCYDANDDHLAFPSTPSWLPDWLQ
RYLSTTSLVFSVSKELTARLPLSSSTKVVELGNGVEFNHFATPRQNKPSQ
LAALSGKILGYAGAMDWLDVDLLEKVAQTYHQYHLVLLGPAYEHGWMERQ
LGLQALPNVHYFGKIEYSELPAWVQAFSVALMPLVANPLKQVSHPNKLYE
YLATGVPVVAMNYCSAVEAAADVVHVAQSYEEFVQLVPIALADNRREARQ
AFAKQHSWDALAATMVHELQHAWQESAP
>Cag_0041 outer membrane protein, putative
MKQNISFHKKISATSLALLLATSSMSYAVEPTSSPSTAFAAPSVTPLTPL
TLAQALQKMQAHYPALHAASEEVMAADARVRQSKSSFLPQVTANAGYLWR
DPVSEMSFGGGTPMQFMPHNNYHATVSAEAILFDFGKRSRELALAQSGTR
TAEEQVALSRREAAWQVVQLFYGILFLQEEQRVQQKEFQALNKALEFTTK
RYQAGTATSFDLATTKARLAALQSRMADSAHALERSEMHFCRLTEMNATQ
PLALQGSLMASVAPSSNQAQLTEQALKNRVETRLAREAEAAAGQRQALAS
KGGAPQLRGNVAYGVANGYQPDIDEIRTTLSAGVTLDVPIFSGFRTTARQ
QESAAALRAATQRRLDAEAQAATEVAELLNALQHNGEKLNATAMQAEQAS
LAASHARARYENGMATTLDLLDTEAALSQAELARLQAAYAVTLNRYALQR
ATGEVFW
>Cag_0627 outer membrane efflux protein, putative
MLHHKKRGVVTFTGKQLLVVVLLFLLPFAGVQGAENSVVKSGNAVTLEEA
LQIGLQRNRTLEVARLDRDIAHQKIRETWADVLPKLTLSGTYTRSLKPSV
LLLPPNPLFPSGELQTSSDNAAFVGLDLRQPLFNASAMAGIRAANIVRSL
SDASYRKTEMAVLTDIKLAYYDVLIAREQVKLIEQSIARWEQSRRDTRAL
FRQGIAADIDTLKAFLSVENLRPDFIQAESRVASAMTTLKNLMGVPADSA
IVLSGKLELPSGTKASYPATTELAAREAFEQRPDLRQIALQADAEAENVN
SLKAERYPLLSLFGKLEAQTSFNDGINPSESRWPVSSSAGVQLSLPLFTG
YRTSARIEQATLSRRQTLTRLEEQKASVRAELETALLHLHEAQQRIEVQS
KTIAVAERSYTISRLRFREGIGSRLELSDAELQLVKARTNYLQAVYDYLV
ATTRLDKSLGRRSALLPLTR
>Cag_0972 Secretion protein HlyD
MKRRTFIVIIAGVLASVVGGYLVLHQEKQEAPKIAVQAERPIPVSIVAVA
AATVSDTLNLVGEIQAIREADIVAEVEGQVRRIAVEPGERKEKGALLVAL
DDEVAAARKKKAELHYRQAARTAERYSALYRDGAVSLSAYEAMELQREEA
QAELVSATKHMQNAAIRAPFGGVVTSRLVSEGELVRVGTKVAHMADFSRT
KVVLYAPERTLPLFVLGKSVLVSSNLFPDKRFSGKVSAISDRAERAHNYR
IEVLLNNESRSIPFRSGMFARVLVPSEGERQALVVPRRALVNGMRNPAIF
VVRNGRAWFTPIIAGMEMPREFEVLGGLVAGDSVIVSGQQELRDKARVEV
VHR
>Cag_0212 glycosyl transferase
MLFLLFQVAIAFALLVFLAIGVANRYELGRLRHAALQSRVPFVSILVPAR
NEAHNIERCINSLLQQRYESFEVLVLDDGSTDATPTLLAELAQHAGGVLQ
VLQGDPLPQGWHGKAWACQQLGEAAHGDLLLFTDADTVHHPTALARSVAA
LQASQASMLSMTPLQTMHSWWEKIVVPLVYVVLMNFLPLRFVRTTSIPAF
SFANGQFILIERTMYRQLNGHAAVRQQLVEDVWLCMAVKKAGGRVVAING
VDLVSCRMYRSGKEVWEGFSKNIFAGLGYYHSALFGLLALIALFYIIPIA
LLTTSVVQANYSATHFWLPLVQVVLAFANRWLVAFTFHQSRFMVFFHPLT
MVAFFAIACNSWYWIVSGKGAGWKGRRYQFTE
>Cag_0590 Penicillin-binding protein 1A
MKSLLYKSLFLLLAYVLSVSATAPSAYAMRSLLGLPSVEELENPNPELAS
LVYSEDGVLIHKYFNKNRTFVPLRSIPRSTRYALIATEDAEFYNHWGVNV
RRVFVAMGENLFRAPKRWHGASTITQQLAKNLYLTQERTFSRKFKELITA
IELERTYTKDEILALYFNTVYFGAGAYGIESAAQTYFGKSASQLTLPESA
TLIATLKNPTAYNPAKNPAGSISRRNLILGLMEKNKFITPQQAAKAKRTP
LTLKYTPLNQQGLAPYFAEYIRQTIKPATILGDLNLYRDGLTVRTTLDSR
MQKYAQQAAVEHLASLQAAFDRSWRWPENLKNQIIRESERYKELVGSGMS
DGQAMARLKADNVWLHNILREKTRIQVALVAIDPNNGHVKAWVGGNSLSP
DEYKYQFDHVWQARRQPGSTFKPFVYTAAIDKGLPANFQVLDQPLQLSSG
DGIWSPRNSDGSSGGMTTLRSALTRSLNQVTVRLAYEHLSPAEIISYAKR
MGINSPMPNDLSIALGTAAVSPLELAGAFTPFANNGIWSEPISILKVEDK
HKRFITSQKPNSRFAIDSTTNYVMVSMLRDVINRGTGASVRSYGFTAEAA
GKTGTTQNMKDAWFAGFTPQLVAVVWTGFDDERIKFTSMEYGQGARAALP
IWAKFMQRCYSDPTLKLGSRYFHIPETVIAVPTSSAQNNMAADLLGGNVS
FEYFTPKGFEYYQSHPELAISSPPPMAPIDSSSNGVSAVTMPALKPVAPV
PSAAKPKVEKPH
>Cag_0025 Large-conductance mechanosensitive channel
MSVMKEFKEFAVKGNVVDMAVGIIIGGAFGAIVNNLVSQVILPPLGLLIG
GVDFSSLYIILKEGATQAAPYNSLAEATAAGAVTLNYGVFLNSVFSFVIM
AFAVFLLVKSINMLRRTEGSKSAVPAPSTTKECPYCLSSVPLKATRCPQC
TSELK
>Cag_1477 glycosyl transferase
MLKILCLHIAHKQASYRYRVEQFLPYWKQYGIEFEPVCIVGKNYFEKLQL
ALSSNKYDYVWLQRKLLSPFFINLITKRSKLIYDYDDALYSIESQRNNKP
KPTHPGSKQSIERLNYILKRASLVFAGSEALYNYSARYNASATFLIPTAF
PAQSNISLPSKINNNSVTIGWIGSIQNLFFLSIIDDVTAAIQQRYPDVRF
SVMSGKPPEGLKTHWDFVAWSKEGEDAWLRSIDVGIMPLVDDEWSRGKCA
FKLLQYMAYGKPVIASAVGANYAAVLHGESGFLAKTLDEWRSAFEIMITN
RALSFSMGQASLNHFLLHYELRHVQNKIVSLLQ
>Cag_0517 UDP-glucose 4-epimerase
MKILVIGGAGYIGSHVARAFLDKGYEVTVFDNLSTGMRENLFAEARFVHG
DILHPAQLHAVMAEGFDGCIYLAALKAAGQSMLHPDAYAEANIGGAINIL
NQAAATGLGTIIFSSSAAVYGSPNYLPIDEAHPTAPENFYGYTKLAIEQL
LAWYDKLKNIRYAAIRYFNAAGYDPDGRVKGLELNPENLLPIVMEVAAGI
RPKLNIYGNDYITRDGSCIRDYVHVSDLATAHVSAFEYIQRTKQSLTVNL
GSEQGVSVLEMVERARAITGRPIPADIVERRAGDPANLVASSSKARELLG
WVPQYSDVDTLIASTWQMYQRFVKK
>Cag_1294 glycosyl transferase
MRFMKIGIACHHTYGGSGAIATELGKALAARGHHVHFFGKEAPFKLGAFV
RNIFYHEVEVMHYPLFDSPFYSLALASKIAEVAFYEKLDIVHAHYAIPHA
LSAMLALQMLEDKCAAAHCFKLVTTLHGTDITVVGADRSMQDVVRLAINK
SHGVTAVSHFLKEETVRMFQPKRDIAVIHNFVDTQLFSRMAAGDIREQLG
LGSEKIVIHISNFRPVKCIGDIIAIFQAIATESNATLLLVGDGPERSEAE
LLVRQLGLCSRVRFLGKLLDIVPLLSLADVMLMPSNVESFGLAALEAMAC
GVPIIATNVGGFPEFIESGKHGYLLPPGDVAAMTEKALHLLNNPDEWQRI
SMACVKQAACFNVSRLVEQYEKYYTKLMG
>Cag_1198 RND efflux system, outer membrane lipoprotein, NodT
MKRHPNETGENDIVPQITWFPTIMPHTKKKTRLLLAATQLAIATTIVGCS
APKESMPPDVAMPDAYRGAATVAAPSDSTIAQMPYQNFFADTALTALIEQ
TLAHNADLQSALKNIELAEQTLDAAKVVWLPSLNLSAQTIRNESSEHGVR
RTPKEFTAAVSASWEVDVWGKIKNRKQSVLANYLKSQEAVKALKTRLVAD
VASGYYNLLMLDEQLAVARKNLALADKTLAMMQLQYQAGQFTHLAIRQQE
AARQQLAATIPQIEQAVAVQENALSVLSGSMPNAITRNPSLLQVKPTNTF
AVGIPAAMLHNRPDVQAAEFALKAATADMKESGAAFYPSFTITAQKGVSA
LQSSDWFNVPSSLFSVVQGTMLQPIFQRGQLEVTYKQSQVKRDQAALAFR
QSVVKAVAEVSDALVRIEKLQTQEQLAEERVATLQQAVRNADMLFRSGLA
TYVEVMSVQSNAHNAELTLADLRRQRLTATAELYRALGGGWR
>Cag_1991 Peptidase S41A, C-terminal protease
MFPQRESKPRHKQSQRNGWRIIQRMATALLALSLPTTTLAYPQAESQSFA
VVSSIELLSEVYRELAAGYVEPLDTALLMKTGIRGMLRSLDPYTTLLERD
DADELADITRGRYVGIGISLATLEKKLYVTAVNEESPAAAAGIRTGDAIL
AINEAKVANIAVDSLRTLLHGTNGSPITFQLERRGSAPRTTTVQRQSVPL
KSVPYYELHNNIGYIALDGFTTRSPHEVRSAWQSLQQQATANKQPLRGLI
VDLRDNSGGLLDAALEITSLFVPNGSEVVSIKGRSTHSHSTLKTTTEPLD
ATLPVALLINGDTASAAEIVAGALQDVDRAIILGERSYGKGLVQSVKKLS
YGNTLKFTTAKYYTPSGRLIQKELKKESSPHSTNADSKQALASAVPDTTQ
RFYTRNHRIVYGGGGIMPDVEIKEPASPYVTALRKRGMIFLFANEWYATH
SDDAPASSALLPSQTELLAHFEKFLQQKEFRYTSNAAKRLEELKSAMKES
GRENPEALRTMEREVELADTEERNREAKQVAVALESAILRHASEHLARQA
ELRHDALVLQAEELLIYPARYRAMLKASSTRK
>Cag_1132 acylneuraminate cytidylyltransferase
MQTVAIIPARGGSKGLKYKNIYPVAGKPLLAWTIEQARASQFVDKVFVST
DSEDIADIAKEYGAEVIERPADIAGDKATSESAILHALNVIQAEHHITVS
AVVFLQATSPLRKQGDIDGAIELFRRENADSLISVTKADDLTIWEQRKSG
EWASVNFDYRNRGMRQDRPAQFIENGSIYMFTPETLHRFNNRIGEKLVAY
EMEFWQTWEIDTLNEIELVEFYMKRKGLM
>Cag_0122 LipD protein, putative
MKLSLADALSRAREQNYTVKAARSRIAQAEGQITQSRQSLLPKVTLSETF
MVTNDPGAALVYKLQHNTIEQSDFMPSKLNNADVIDDFHTSVQVMQPIYN
ADAKKGRSMALVAKKGQEFMAERTAETIALHVSKAYYGLLLARKNSEAID
GSLAIMQGYNAETARGFNVGMLSRSDKLSTEVRLAELQEQKMMMEDEIKN
ATDALRVLLNLDPTVTIVPTTDLNVDGSMPSVKDGGALEQRSDLQAMEVF
RQVASLQAEMADASRLPRVNAFAQGNLHGATPLEGGSSWALGVNVQWNIL
DAKVSEGQMQEAKAKKLEAMYSYEAAKSSGTAEINRALRSLKTAKARLAI
ASKSLEGAKVSFDHIGKQYKTGMAMTMELLMREQAFTYAKMRLNQAAFDY
NVAKSELEYYKGN
>Cag_1472 glycosyl transferase
MNILFMNSARTWGGTEKWTHMAAESLAHEHKVALVYRKNVVGDRFTTSKF
QLPCLSHVDVYTLYQLTRIIRQEKIEVIIPTKRKDYLLAGIASRICGISN
ILRLGIVRPLKLPIIHKLMYHSLVDAVIVNAQQIKTTLLQSPFMVADKIY
VIRNGLDTTQLNKKSQPIAPKIFDFQISTVGILTKRKGHDFLLRGFAQFI
NQEPNANAGIVIIGDGVLKNELQELVEKLHLTQHVHFTGFVENPYPLMAA
SDVIAMLSTNEGISNALLEGMYLENVPISTFVGGTTEFIQDGKNGFLIDY
GNENKLATTLLTIYNNNILKENISLAAKSTILTQFSLTRMTQTLTQLCKT
TIKNKAAQHAAYN
>Cag_0615 Outer membrane protein-like
MSQQIFLTINQLVIMKQHQNISTMGGKIIAIALLAPLFGFSQPSTSKAAE
GDSAPSQATLAPAISAADMQASPSVQTAPTIVAAPQVPTASGLRLQQFLA
SVVDNNDEIKVQKLEWLSNERLLKASRGMYEPVLKVSATRESNHMQNTAQ
EYLQTYSQHYEFSEANNIWSSSIEGLTPFGSTYRLGYDYKKLQNSLQSAM
AVPTDEEYVTFLGLTLTQPLLKGSGQEATNANIRISRANADIAYEGYRQA
SVEAVARAVQLYWQCYGAQEKLAMRQRSATIAEELLQANKSRYEAGKVDY
TAVLDAESGLRLRQALVAAAEQTELTSRKNLLSLAGESAMAQVPATIRME
DVPDCSPLSPDYKQVYEKALTSYPQYLSALATVERENFRATYAHNQEKPQ
LDVKGSYGYNGLGTTVDNSLDRLGSTDFPSWSVGLELTFPLIGDMKSRNE
ATAARLKKEQAIRRLEMQKIELSNQMDIVAGLVSRVYSQVQNYEKVVAIN
AELVRIEDTRFKLGKSDTRMLLEREEEYLKVSESLLDSRLAYQYALVNLY
ALEGSLLTRYGLTLSDKTSATTLTQGM
>Cag_0949 D-alanine-D-alanine ligase and related ATP-grasp enzymes-like
MIKLTHQHTYSGTNIYSTESAIVVLFEIEKEELKIAKNNILSVKHSLSNL
FESYVLKDFVTELELGDFIVSFAHILLTEVRGFINTAKSFKNDDQVVLIL
GYHVPKVSLLALKVALNIYINIEKLNSSDLNKILIDFWDICRKYHPDYQA
GILMEACYKKNIPVLPFINGTKFWQYGWGEKSRIFMESRSNTDGSIAHTL
SNDKPITKAVFNSLGVPTPKDVVIKSSNELETAIAAIGYPCVLKPTNTGG
GKGVIANIRNFTQLLNAFTYARQFTKDAIMVERFICGDDYRLMVIDGKFV
AAIKREPAKVIGNGKSAIRELIQQLNSKRSINLRKSNYLRPILIDKILLE
QLAKQEMSLDNILSAERHISLRSNANLSTGGTCTDVTHLVHPTITQCVEQ
LSVTTGFGTAGFDYITTDISSSLQKSGGAFIEMNTTPGLDVTIAAGWSVE
KIGSITLGDTVGRIPIHLHISNTPLDLYKLPINFDSHLARVSENNVCIGQ
CCYHIDDLQPWAAVKSVLRNKTVQVVEIFCTVSEIIKHGMPVDYVNNIFI
SNIDIPENWMNVLKEHASEIIFH
>Cag_0546 hypothetical protein
MKKALLTALLFGMAAVPAQQLHANGFNYNYVEGQYVKSSMNNVDGSGYAI
TGSVALHDNVALNAGYSNDSYDYDIDTNGYNVGLTYHVPVADSTDILFNA
SLEQAEYSQPLIGSDDDTGYSIGVGIRHKVASAVELNASVYNVSIGEDSA
FGVDAAVLVEVSKNFYLGVEYGTSEDIDAIGFGVRAGF
>Cag_1409 Membrane-fusion protein-like
MKKNIGLALGGAVLFIMILLFFFNPFSASDKSTTLVTVQRRDAVPPPNST
LHDSTSYAHATEEAGFISGIVEPYNDATIGLVVQGKIASIWVPEGRRVGR
GGIILTLEKSQEELEVARRKIIWQNQAELKSAEAVVTTLTETLRLNRKLY
DETRSVSREELDKLSLQWENAVAERDRLRNQKLREKVEYEIAAKTLDSRL
LKAPFSGTVEKVFLKVGEIYQPGQPLVRLIDADRCVLTANIDDTKNYKFK
QGMTVELHVMEGSSEVIRKGTITKVPIAVDPASGVMQVKAFFDNRDGRIK
PGTTAKMRVPQE
>Cag_0987 Secretion protein HlyD
MSKKKNSINRKTLWLLLAALVIGSGSLIFWLRSREKPIEITTEKAFEKEV
VHLITATGTIQPELVVAMSPDVSGEIIELPIKDGEEVKQGALLFKIQPDI
YVNQVQQSQAQLNAALSQSAEANARKLKAEDDFRKANLLYKEKLISQTDY
LASKTNAEAATAAYKASLFAVDQQKSLLVQNRDRMNKTTVRAPINGTIIA
LNSKAGERVVGTGQFPGTEVLRLANLDSMQVEVEVNENDIVKVTQGNPVT
ITVDAFGDRKFSGVVREISNSAIAKAAGTQEEVTNFAVKIRILNHNRLLK
PGMSATADIEAERISKALVVPIQSVTIRGASKPQHDEGEQKGVSVAPSGK
ASEGQQGVFVVSKNKAWFRPVTTGTTDNTHIIVTSGIRKGEEVVSGSYSA
ITNQLKNGSLIKRLTPETP
>Cag_0176 conserved hypothetical protein
MKKVWRTIALFLFCASPLHGEEVATVPTPPSIQKATQPHPYVNTIRISGN
KAISEEELRLIISTRAHKSFFGSGLFGGAKKAFNAEEFERDIFLIKKLYT
YKGYFAAEVDTTITRLSKGKKVNLAIRIKEHQPARIDTLRYFGLEKVPER
LEKKFLAESRLQVGKIFSVEQLIEERDRSLNFFREQGYTFFHEDSIRVKV
DTVGTHAGIAMNFHLPERLQYAPIHAVVQNSRRNEKKPREQTFQLEGIHG
KVIGRQRINPELITTAVAFRQGQYTSQSKEQRTLQNLGATNVFSSLSITP
DSVRSGLLYTTISLEAAPKHELAPKILVDNRYGPLFFGGSMAYENKNLLG
RGEQLRLSANYGTQIEKKSSLLSNLAPSEYDAFNPYDFSVKSTLVRPVSK
QNGNYYSTTIEYATTKQPVLLSNRNALIRATYNAKLGSTSRLNFDFFDVE
WVQKDSLRGFKPLFQKELATNIGIDPTNDAAINAGIDSLVSTHFNQTFRL
RYQSKSKPRSETQIGRTLWNTDLLLEESGSLAWLVDRYLDTSRRNGFTSN
DPQIFGTAYSQYLKLESGVSFVNLSTTNSQFAGRIRAGWMAPYGKATATP
EERRFYAGGANDLRGWIFGTLGPGKNRSEATANFGANIKLTTSLEYRLKF
FRFLNQPSGITFFTDAGNIWDNDGTYGFNSKSLFRDMAWDAGAGLRLGSP
IGPLRFDIAYRLHDPTQAHPWQLKHLNGSDYTFTFGIGEAF
>Cag_0178 Membrane proteins related to metalloendopeptidases-like
MALCSQQDFPIFAKIKSIAYHYLFSLSMNKRVVIPAKAGIHTCNKCNGLP
RRLRLLNIPLHWRGGRRSLTGWLTKSAMDCFTSFAMTVWGTSVVTERRRL
LSIMVALGSIVPIHKAATLYGAEKQNLQAALLPEQAGQMIEELVFALEPE
EGAATEKGELTDNVAPTSSLFASIPNIKPVYGTLSSLFGMRMHPIYNMPL
FHSGIDIAAPIGTKVHATGDGIVAFVGNSKGYGQKITINHGYGYKTIYAH
LSKMVVQQGDNVRRGDTIGLSGNSGTSTGAHLHYEVLRYNQRLDPSAFYF
EEHGARKFTAIQSKQSPKDNS
>Cag_1115 3-deoxy-D-manno-octulosonic-acid transferase
MPISCHHMQSFPLYQTLFPLLAGVAQGVKALHPQIAAFFEVRQHLFTTLQ
QQLATMPNNGFRLWVHAASVGEFEQARPIIAALQARHPNLRLFISFLSPS
GYNARKNFPNAAAVFYLPLDTAANARKLVALLKPDALLLMRYDFWPNHLL
AAKKYGTTLVLAAAVLQPQSAYFNPLLRRFYKKLFHLFNAIYTVAERDTQ
AFKEHFGYRNAITAGDPRFDQVVARSRNRAAVANLRAHYEGRKVLVAGSV
WEADEQLLIAAWQELNPRPSLIVVPHQTEPEKIAHLCSLLDERNLSYARI
STFPESFQPEQQILIIDQIGYLAELYSIASIAYVGGGFGVNVHNTLEPAV
YAIPVLFGPNHHNSPEAAALLEAGGATVVQQQSELHAALQCLCSNESERQ
RQGSAAGTFVQARTGATAMVVEYLEGVANVVKWQGS
>Cag_0447 conserved hypothetical protein
MRYLFVHQNFPGQFKFLAPTLAANKSNKVVALCMKPQAPTIWQGVEVRSY
SANRGTTKGVHPWVSDFETKTIRAEACFMAAQQLKAEGFTPDVIIAHPGW
GESMFLKEVWPHAKLGIYCEFYYHPEGADVGFDPEFPPKSESDRCRLRLK
NLNNIVHFQIADAGLSPTHWQASTFPEPFRSRITVAHDGIDTTLLSSNVA
VRLTLNNSLTLTRKDEVITFVNRNLEPYRGYHVFMRALPELLQQRPNARV
LLVGGDKVSYGAKPEGEESWKEHFIAEVRPRISDADWARVHFLGTIPYNI
FVQLLQLSTVHIYLTYPFVLSWSLLEAMSIGCAIVASNTKPLLEAIHHNE
TGQLVDFFDEKGLVENICELLDNTNERARLGANARRFAQATYDLRTICLP
QQLAWVESLSKK
>Cag_0821 UDP-glucose/GDP-mannose dehydrogenase family protein
MKITIFGSGYVGLVTGACFAKVGNDVLCVDIDENKINRLRKGEIPIYEPG
LDDMVDECIKAGRLHFTTNIQEGVEFGLYQMIAVGTPPDEDGSADLSHVL
SVAHSIGSYMQEYRIVINKSTVPVGTADLVRNKIRSVLAERNIVLDFDVV
SNPEFLKEGDAVDDFMKPDRIIVGVDNPRTRELLRFLYAPFNRSHERFIA
MDIRSAELTKYAANSMLATKISFMNEIANIADRVGADVEAVRKGIGSDSR
IGFSFIYPGIGYGGSCFPKDVQALERTATKHGYTARLLQAVEAVNDDQKA
SLVTKIKNHFNGDISGKMFALWGLAFKPNTDDMREAPSRRIIAELLEAGA
TVQAYDPVAIEEARRIYGDSRGIHFAESPEAAAQGADALVVVTEWLLFRS
PDFEMLKRELRSPLIFDGRNIYSPEFMEQEGFTYYSIGRPTRCAQ
>Cag_0871 Secretion protein HlyD
MKAIEKKTVRLLAVIVPLFLLFAFVAFRSGPLAPVPVTVTRVQQQALTPA
LSGIGTVEARYAYRIGPVASGRVLRLLVDVGDTVRAGQVVGEIDPVDLEE
RLLARRAAVLRAEAVVRSAAAKLHDAEARQRFAVGQEQRYTELLAVRAAS
SEQVEAKRQEAEVARATALSSQAALVAAREELAAAKADYQGLEEQRRNLL
LIAPADGLVTNRLVEAGSTVVAGQSVLEIINPQSVWVAARFDQLQSAGLR
AGLPATVRLRSQAGAPLAASVERIEPLADRITEELLAKVVFNVLPNPLPP
IGELCEVSVGLPSLPRTPVVPNASVHRSESHGGKLGVWVVEGSSLRFVEV
RIGATDQMGNVQILSGLKGGEQVVVYSKSALNEGKRITIVKPTSKQGAQG
VQGGLQ
>Cag_1059 conserved hypothetical protein
MDNKKRVFVVGSTGYIGKFVVRELVARGYHVVSFARERSGVGAATTAEQL
RQDLKGSEVRFGDVGNMQSLRANGIRGEHFDVVVSCLTSRNGGIQDSWNI
DYQATRNALDAAKAAGATQFVLLSAICVQKPMLEFQRAKLKFERELQESG
LTWSIVRPTAFFKSIAGQVEAVKNGKPFVMFGNGRLTACKPISEADLARY
IVNCIDDSSMQNRILPIGGPGPAITPLDQGMMLFELLGREPKFKKMPIQM
FDVIIPVLALLGKIFPQFKEKAEFARIGKYYCSESMLVLDPKTGNYNAAI
TPSFGSDTLREFYGRVLKDGLKGQELGEHAMF
>Cag_2018 conserved hypothetical protein
MEKKKVLVAGASGYLGRYVVKAFAEQGYSVRALVRSPKKLAEEGANLEPA
IAGLIDEVILADATNTALFKDACKGVDVVFSCMGLTKPEPNITNEQVDYL
GNKALLDDALQHGVKKFIYISVFNADKMMDVAVVKAHELFVQALQSSTMP
YTVIRPTGFFSDMGMFFSMARSGHMFLLGDGTNHVNPIHGADLAQVCVNA
VEKNEHEINVGGPDTYTFYETMTLAFTVLGKNPWITSVPMWIGDAALFVT
GLFSQQLAGMMAFAVTVSKIDSVAPAHGTHHLVDFYRALAAKQA
>Cag_1379 Peptidoglycan-binding LysM
MLWLPTALPLEAAEPARNNPNALRRSSISDVLDSLVNATYFKDEYFTAPS
REGGVSFPSTFVPQFSDSVYSSRIAALRRKTPMPLVYNAQVKGYIRMYAV
EKRSYTAKILGLTKIYFPLFEEKFDTYNVPLEMKYLAIVESALNPTAVSP
AGAKGLWQFMYGTGKMYGLESSSFIEDRYDPYKSSVAAARHLRDLYQIYG
DWFLALAAYNAGPGNVNKAIRRAGGVKDYWAIWDYLPAETRGYVPAFIAV
HYIMSYHNEHNIRPLEPAYLYRDIDTLRTSRMVTFEQISETLGISASDLE
FLNPQYKIGVIPASTGNGNVIRLPRRYVAQFQRREQEIYAYRSARTMERE
ALYARLESVRAGAGEQSSESSKGMGNQKIHIVQRGETLGSVARLYRTYIS
QLIAWNNLVDADIMVGQRLVVFGGEDNSPVAAPEPPKSTVPPKAPPIERQ
PTAAPEVRAAAPPKRIAVTRSTQTVTRDELVALTETPTVATDNTSAKAEP
IFHVVEPGQTLFAIATQRKVTVNQLMLWNNLKSVQIKAGQKLIVSSDGQS
GRDNSQ
>Cag_1868 glycosyl transferase
MRIVLLSPFPPLKGGIAHCSGALHAALTAAGDAVVVLPFKKLYPSFPSFL
FSALSPPTPSNATLVLYNPLTWLSAVRRIREQKPELLVIAYWSGVLAPLA
LLFCRLSGTRMLLLLHNLTGHEAFWGESFLQRKLLSSVAGVVTLSHTVTR
QVQHVAPSLPTLTLFHPIEKLPAPSFSKLEARKALGLTSNAPVLLFFGYV
RRYKGLDLLLQALPHVVAQEPSLQVVVAGYFYEPLPRYQQIAETLGITHN
VTFHAGYVPSEKNATYFAAADGVVLPYRAATQSGVVPMAFAYGVPVIVTP
VGALSEMVQHGTTGWIAKAASPDAIAAALREWLANRERWSAMRSSIEAMR
DSVSWERFAAECQPFFASLIDKGRR
>Cag_0905 D-alanine--D-alanine ligase
MSRTTVALFFGGQSAEHEISIISAQSIAAHLDTERFTLLPIYITHSGEWL
CDGFARTLLTTNLASKLRGSSREETAAALQQMVRNAAQAPCNRNLAALGV
DVAFLALHGSFGEDGRMQGFLETCGIPYTGCGVLASALTMDKALTKLCVA
DAGIAVAQGTNILSADYLANPNAVEASVEAQVSYPLFVKPASLGSSIGIS
KVHNREELHPALQAACALDWKVVVESTVKGREIEVAVLGNADPIASVCGE
IEPGKEFYDFQDKYMGNSAKLFIPARIPESLQEEVRRSALTAYRALGCSG
MARVDFFVDESTNSVVLNEVNTIPGFTDISMYPQMMEASGISYRNLITRL
LELALEPLRR
>Cag_0009 Peptidase M50, putative membrane-associated zinc metallopeptidase
MDTTFYFIIAIFILVTAHELGHFLTAKLFGMRVEKFYIGFDFWNLRLWSK
QIGETEYGIGLIPLGGYVKISGMVDESFDTDFQGKPPQPWEFRAKPVWKR
LIVLAGGVAMNMLLAAAIFVGVTMSIGESRTSVSTPAYVEQGSVFADMGM
QTGDLIQAVNGKAVESWEEALDPEFFTASTLTYTLLRNGQEVTVTAPSNI
MSLINDQKGLGIRPVMPPLIGEVLPDMPARAAGIQPNSVIVAINGKSVVD
WHEVVGTISANAGKPLQITWKHLAFADGKEPSVADIRASGEMFVATIVPT
EAGKIGMALQQTIASERRKLGIGESLTSGVQQTWKATVMTVQGFGKILTG
KEDLSKSVGGPLKIAEIAGQSARQGVLGFLFFLAMLSISLAVINILPIPA
LDGGQFVLNAIEGIIGRELPFELKMRIQQIGVALLMSFFAFIFINDILNF
FKR
>Cag_0544 hypothetical protein
MKKALLTALLVGMSAAPAQQLFAKGFNYNYVQGQYVKPSMDNVDGGSGFA
ITGSVALNDNFALNASYNDASFDNDIDASGYNVGVTYHMPVADSTDILLN
AAFEQAEASAFGISTDDTGYSIGAGIRQKVASAVELEAGIYNVSIFDDSN
IGFGAAALVDVAKNIALGVSYENLDESNTIGVGVRAGF
>Cag_0606 conserved hypothetical protein
MPLQTQKNSNPAVQLAVLALLIGVSFFATLGATPLFDVDEGAFSEATREM
LISKNYLTTYLNGAPRFDKPILIYWLQLLSVQILGINEVAFRLPSAIASA
VWALLLFFFVKKESDSQQALIASGLLVLSLQVSVIAKAAIADALLNCLLA
LSMFAVMRYYKNNSKTALLTAFAAIGLGTLTKGPIAIIIPLAVTFLFSVL
EGTLKKWFRMVFYLPGIALFCVIALPWYLLEYQDQGMAFIEGFFFKHNIS
RFNTSLEGHSGSLFYYFPVLIVGMMPFTALLFATMWRLPKLLSTPTNRFL
LIWFGFVFIFFSLSGTKLPHYMIYGYTPLFIVMARIVPQLKHPTRLVILP
ALLLVLLAATPMLAEQALPMFDDLYIQSLLIALAQESGMECTIVSLTTLA
ALIIMQLPRKLSIEAKILTTGLFFCLYMNAYLAPLVGSVLQQPIKEAALL
AKQQGYKVVMWKTYNPSFLVYSESLVEKRQPQAGDVVLTTVKHIAKLNVT
TLLYSKHGIVLAKVPY
>Cag_0516 dTDP-glucose 4,6-dehydratase
MHLLITGGAGFIGSHVVRHFLTRYPSYTITNLDKLTYAGNLENLRDVEQL
PNYRFVKGDITDALFIMELFQANHFDGVIHLAAESHVDRSIANPTDFVIT
NVLGTVNLLNAAKASWQGAFESKRFYHISTDEVYGTLGNDGIFTESTPYD
PHSPYSASKASSDHFVRAWHDTYGLPVVISNCSNNYGSFQFPEKLIPLFI
HNIIQQKPLPLYGKGENIRDWLWVVDHAAAIDVIYHKGKLGETYNIGGHN
EWSNLALVRLLCTIMDKKLGRENGSSEKLITFVTDRAGHDLRYAIDSTKL
QRELGWVPSITFEEGLERTVDWYLANGEWLHNVTSGEYQHYYEAMYQGR
>Cag_2022 NLP/P60 family protein
MNFQPFPKNTPWRFPRISGLWLVSTALILLISISGCQSYWVRTKYHIESK
YSSKKRKSHSARLAPNGGYQFAQPLPMRLSQSAMASFLSEVEQLRGTRYR
SGGCSPDGFDCSGLVCYLMKRHFGLLLPRSSADMATHGVMISSTSALQKG
DLLFFSIDGRTIDHVGIVTGWNRFVHAATSGVREDYISSSYYRERFAFGA
RLLIAE
>Cag_0813 Peptidase S41A, C-terminal protease
MSRIVTVALMLVVLLFGIFLGTRISGRRVESSGAGQQKIVEAYNLMRQFY
VDEVGGDSLAGAGIEGMAGLLDPHTVYLEPEKVTYEQAEFEGNFDGIGIE
FDIVNDTLLVVTPLSGGPSAAVGLASGDRIIGIDGVSAIGITQRDVLKKL
RGKQGSMVQLDVFRPLDGKRMDFSVTRGKISTSSIEAAFMVNQQVGYIRL
SRFIATTADEFRSSLQLLKQQGMKRLLLDMRGNPGGFLEQAVAVADEFLS
EGKLIVYTKSRKGSLPDERYEARSGDTFERGDVVVLIDRGSASAAEIVAG
ALQDNKRAVVVGEPSFGKGLVQQQLPFADGSALRLTVARYYTPSGRQIQR
VYRKGVAGREHYFEESMSNISPNKLFDDPDTLLYYENNNVSVYNTSTLPS
LLLSLKGKKGENNRLTDLRDAGGIIPNYWVNARSYSSFYQELYRTGLYDE
VARKLLDDPHSLVQKYRDSLERFMTSYTEEPNFEALLAKACQSKGVRFNR
VALLQDRHAIVLALKGRMAHQLFGSSGQIKFYVKTADPLVRVATSVPLST
R
>Cag_1478 glycosyl transferase
MREYLISVIIAVYNPNAIFLQKAIQSVLNQSFPVLELILVNDGGNEEFRN
LLPTDSRIKVFRKVNEGVALARNYAIQQSQGEYIAFLDQDDYWFPHKLEK
QISMIPSDQPQCFVVSPIQIIDSVGSVVDKNNLAATSLYKNNLSLVNPFL
GLCYGNYIYSSTPLIHKKVFEIVGLFDVAAQPHDDWDMYLRILYAGVPFF
RYTDSALSVWRIHDSNESHKIKAMLLSKCYVEKKLLELNLVAPVREVVTI
NLLFDNVELAHLFYKENNTPEFRFLMKRYLPSLIRVFFVRFNKTFELDKI
LFRRIRKIILKSFRRYIVSFLRCNG
>Cag_0055 N-acetylglucosaminyltransferase, MurG
MKVLFAGGGTGGHLYPAIAMAAELQRLSPNVSVAFVGTKSGIEATEVPRL
GYKLHLVSVRGLKRGFSPKLLIENLRILFDFARSLGITIQLLRSEAPDVV
VGTGGFVSAPLLFAAQLLGKKTLIQEQNAFPGVTTRLLSLFASEVHVAFN
EARRFLLNKRHVFLTGNPARLFQPMDAAQARARFGLQHNRPTLLVFGGSR
GARSLNNAIASQLDTLLASVNLIWQTGSLDGEKLKAEVKPSPYLWMAPYI
EDMEAAYSAADVVVCRAGASSLAELTNLGKVALLVPYPYATADHQRHNAQ
SLVEHGAALMVADSEAFTKLVPTALELLQNSGKRAAMSVAAAKQAHPDAA
GVLAKRVLGLSR
>Cag_0475 Glycosyl transferase, family 19
MPKKLFVLAGEVSGDIHAAGVVAQLLQAHSNVTVFGVGGAHLKKLGATLL
YDTAQMSIMGIVEVVKHAGFLRRVIRELKAAIEREKPFAALLVDYPGMNL
HMAAFLHNLGIPVIYYVAPQAWAWKEGRVKTIRATVDRLLVIFDFEVEFF
RRHGIQTEFVGHPVIEELAGLAVPSRQDILQRHALSPDTRLIGLLPGSRK
QEIAYIFPAMLEAARKVSQTHKVAFLFGRAPNLKADHFRLLEEYGDLTII
ECGAHGVMHASELLLVTSGTATFEALCFGAPMIVLYKTNALNYFIGKRLV
KLHNISLANIVAEGLLSNSRTVPELLQDEATPEAIYQQVSTLLHDGKTLA
AMRAKLLMARAKLASVEPSKRVAAVIAEYL
>Cag_0573 Glutamate racemase
MNTENPIGIFDSGIGGLTVVKAIQAALPSERIIYFGDTARVPYGPKSQVT
IRKYAADDSAILMRYQPKLIIVACNTVSALALDVVEKCCGSVPVLGVLKA
GASLAVQASRNGRIGVIGTQATVCSNAYACAINQQAPDYEVTSKACPLFV
PLAEEGFIEHDATALIAEHYLAPLRTHNIDTLVLGCTHYPILRNVIERTI
GSNIRIIDSAEAVALQAGEMLRERNLLNQSPDKKTPHLLVSDLPQKFSTL
YQLFMGSELPDVELVEV
>Cag_0552 TonB-like
MNKPVADESPFLAYGITVALALLATLWLSAILLQNNAPLFVTDEGAHSST
RSGKMVIRTISLMSNSPDTPATTNTSTNSQTDLTTSTQNNTSSIAPTTPS
TNSAAPTEVAAQQPTVSNQQNGGESNQITRSTISNASSEGGESGTTTTAT
TSSTSDNAIQATACDVMPRFVVAKKPIYPEQARRAGMSGKVYVNVLLSEE
GRPIKAMVVKRQPTECTLFDAVALKAVMESRYSPGIQNGKAVKVWLTVPM
RFELK
>Cag_0651 GDP-L-fucose synthetase
MHSSKIYVAGHRGMVGSAIIRILKEQGYSNIVVRSREELDLTDQAAVRAF
FASELPNEVYLAAAKVGGIHANNTYPAEFIYQNLMMEANVIDAAFRCGVK
KLLFLGSSCIYPRMVPQPMQENALLTGLLEPTNEPYAIAKIAGIKLCESY
NRQYGVSHGVDYRSVMPTNLYGVGDNYHPDNSHVIPALIRRFHEAKVNNS
QAVTIWGTGTPRREFLYVDDMALACVYVMNLDNEVYSKHTEPMLSHINVG
CGYDVTIHELALLIGKLVGFAGNIVFDSSKPNGTPRKLMDSSRLNALGWK
ATVDLEQGLGLAYDDFLRQKLSH
>Cag_0033 hypothetical protein
MLGAWRNITWIFMNSRTNILMPLTNARGLGKVTFFTGLDFPIVMKPSLMG
EPVLTAQAFYFLERLDSVVTASFPSNKVPLPHEIDYIIDNYLFEYSKRHP
DKKITSKITEFVFWQEDPDNAYFSYDWKLTECLVLDTLNDTSIIDCNDPK
RTMGEIFDWCLYKPYFEDALEEYKNKLEEAAKYVANVKTQNHSSLGTGEY
QLPIIRVNSKPLTLAQVNMLEVVSADRKIDLTSDTYEVNRGMKSSTNYFL
PQEVITVNNRHNPQLLAYYFSAVRDYSPISQFKNYYNVLEYFFEEAPNHL
GITAKTEAEQIIAVLKLFIDPVELNKKFNEIDKATLALIEKPQITSSGEN
IAGIDFSVTDILAEYGRHIYQIRNACIHSKKTRKGKSTPRFIPSYDEEKI
LEYEMPILQWIAIQCIEKESII
>Cag_1822 Glucose inhibited division protein
MYLLPNTTTITKSTMPHPQQPHAILEGLCKEQSLALAPEMVDKLVEYGRL
LEEWNNKINLISRKEDAPIIIKHIFHSLLITQHHTFQTGEKVLDLGTGGG
LPGIPLAIIAPQATFLLVDATGKKITACQEMIATLKLTNVTARHLRVEEL
HGETFHTIVSRQVALLNKLCAYGEPLLHEKGKLICLKGGSLEQEIKQSLE
ASQKHHGFPARVEEHPIDEVSPIFSEKKIVIAYR
>Cag_0558 Secretion protein HlyD
MKSFQKFLPRYSIALVVLFSATVLFLLTRANTVDVEVGEVSPSDLVQAIY
ATGFVEADTVAELHTEASGRVIAVGALEGEQVRAGQTIVQLDATRAQLAV
REARAALAEQQAIVNDNRLRFARRQALFREGAISRQELEESERSRLQSEE
LLQQRQLQIGMKAEEARKSSILAPFSGTLTLQSLKVGDYAPANTLVAQVV
DSNGFMVVVEVDELDMPRLRVGQAATVAFDALPDKRYKAVVSRLVPHTDR
ITKTSRLYLTLQELPASIQEGMTATANIVYNVRPQALLVRKNALVDEQRK
SYVWKIEKGALKKVEVELGASDISFVEVLRGVRAGDKVVLSPAKTLRDGM
EAKITSIKKL
>Cag_0579 penicillin tolerance protein LytB
MKIHLDRTSSGFCIGVQGTINVAEDKLQELGQLYSYGDVVHNEVEVKRLE
ALGLVTVDDAGFKQLSNTSVLIRAHGEPPATYTIAAENNLAITDTTCPVV
SRLQRTTRLLFQLGYQIIIYGKRVHPEVIGLNGQCDNCALIIKHADLSDP
KELEGFEPSAKTALISQTTMDIPGFYELKANLEAYVARVNGSAVEPWMAI
RDIDITADMSKVRTMPRYVFKDTICRQVSSRNQKLHDFSLANDVVVFVAG
KKSSNGQVLFNICKAANPRTFFIEDIEEINPEWFAAHEGKAVESVGICGA
TSTPMWHLEKVANYIEATYANSESIIAQ
>Cag_0076 Cell wall hydrolase/autolysin
MHYFSVRYYRWLFSCLLLCAVVLLTPSSAFAASATNSGSLSLSVQRSTFS
YTVRVLTIREGEQQLVDLESMARALRLSFSREPEAIVLKEPFTNSNVRCM
VAAGNPFVAVQPASSGGNPLLVQLQATPVMRQSRLYLPVEQACRLFSLWL
ERDVRYQPSSGRIQAMLKGKAVVPTFLADAKQRQRRATSIAATSNASSRT
STVITGVSVDERANGAIIRFTASGPPATFSLAPPQPDSSGVVQLQFEQTT
PTSRLRFQRFNGALVRSITPQQKSGQPLHFTIVLDSRFQFVTPLEAQYDK
ARNRYELLVRTEANVEEILRREKEQHIAQTLSHDVAKWKLDTIVLDAGHG
GKDPGAIGLRGTQEKDVVLNIVRDLGNFIEQQWSDVRVVYTRSNDAFVPL
HERGRIANKSGGKLFISVHCNASVNRSARGSEVYILGAHKNSAALNVAMM
ENAVIRNEVDYQESYKGFSEEYLIMSSMVQSAFSRQSTLLAQQIIRPVAE
KQEGNNRGVRQAGFMVLWTPSMPSALVEVGYISHPAEELLLRDRQRQKAV
AYAIFKGIERYRKSYESNVMAALN
>Cag_0198 Acyl-(acyl-carrier-protein)--UDP-N- acetylglucosamine O-acyltransferase
MQSFIHPTALVGQGAQLGEGVTVGPYSVIEDDVVIGSGTTIQAHVHINAG
ARIGNNCKIFSGAVLAGEPQDLKFSGEKTLLIVGDRTVIRECVTLNRGTK
ASGQTVIGSDNLFMAYSHVGHDCVIGNHVVVANGVPFGGHCEVGDYVVVG
GLAGVHQFTRIARCAMIGGISRVSLDVPPFVMASGHESFRFEGLNLIGLK
RRGFTTDQITLIRNSYRIIFQSGLLLANAIEKVKAEVPQEPEVVEILEFF
TSGKHGRKFIRQFNQ
>Cag_0868 RND efflux system, outer membrane lipoprotein, NodT
MMLFPRKRESRKILMVGLSTTKRVRTKIIMKKIIWLALPTAIVLAGCSSS
HTLQSPTIALNDRYQQNSAHPQLTEAEGQQQQLVVESVAARWWEAFGSPK
LNRLIEQSLKQNPTLAAAEATLRQAEALANAKYNSTLYPRLDAVGSAQRL
QLNNSRNGVEGGEKRFNLYNGSLSSSYNFDLSGANNRQLDALQAKANYQH
YQLAGARLRLATEVAVTAIRQAQLGAQMEALERLIALGNEQLTINRERLR
LGAIASHELLEVERMVAEQRAALPAMRHAYQQSRHALALLEGSTPDNATL
PTFTLAEFQLPATLSMRIPSQFVRYRPDIQAAEALMMAANAEYGAAAAKA
YPQLTLSASLGSQALTTAALFGSGTAVWSVAGQLVQPLFNPSLGDEKKAA
NAAFEAATAHYRQSILAGLRDVADLLSALYNNAIALAALASGAAFADEQV
ALTEQRYKLGAASYLEVVQAQSEATQLQLELLAARAQRLSNSAVLYQAMG
GGEMLSPSGRE
>Cag_1593 glycosyl transferase
MKPLPLTIIIPTYNEEDGIRNSIEQLLKLIEQEDGVEIIVSDASSDATLS
IVQQLPVRWCQSQKGRAQQMNHAARLASGNILYFLHADTLPPKGFIADIR
QAVQDGKQAGCFQMRFDDEHPLMQFFGWCTRFPALICRGGDQSLFITREL
FEKIGGFTETMELMEDYEIIQRIEAYTSLHILEKCVTTSARKYHQNGILP
LQYHFGMLHLMYASGVSQRDLVAYYHANIR
>Cag_0005 Glucosamine-fructose-6-phosphate aminotransferase, isomerising
MLNSTISVAKQKGRVKELESEANALAPATIGIGHTRWATHGEPNHRNAHP
HLNKAGDIALIHNGIIENYSLLKQELQAEGYTFVSDTDSEVLVHLIDRIW
QRDPSLDLESAVRQTLRHVEGAYGVCVISSREPDKIVVARKGSPLVIGLG
KDEFFIASDGAPIVEHTNKVIYLSDGEMATVTRHGYCIKTIENIEQYKEV
TELDFSLEKIEKGGFEHFMLKEIYEQPTVMHDVMRGRIRADEGKIMLGGI
ADYLDKLKHAKRIIICACGTSWHASLIGEYLIEEFARIPVEVEYASEFRY
RNPIITSDDVVIVVSQSGETADTLAALRTAKEKGALVMGICNVVGSTIAR
ETLCGMYTHAGPEIGVASTKAFTAQVMMLYMLALLLGKGRTIAQSELSLS
LRELAALPEKAARILELDSQIRQIADRYKEARNVLYLGRGYNFPVALEGA
LKLKEISYIHAEGYPAAEMKHGPIALIDEDMPVVIIATRDNTYAKILSNI
EEVRSRKGRVIAIASEGDQEVKRLAEEVIYIPQASNAITPLLAVIPLQLL
SYHIATLRGCNVDRPRNLAKSVTVE
>Cag_1081 D-alanyl-D-alanine carboxypeptidase
MLRIALIPMLRHHFTSAFRYRHRSTIQHSSLFVATLLLLLLQALPLFARS
NDELLVEQSHDRISAYMLKEYGTPSMLMGKNISTPLPPASLTKVLTSIMA
IESGRLLQDVVITRESTLVEPSKAGFTVGERIKLIDLVKAAMVSSSNDAA
FAIGIYLSGSVDAFVDAMNYKARQIGMRNSHFTNPAGYDRGQYAGNVSTA
EDLMRLTEYAVRNSTFNQIARMDRAVFVEQSTRKVYNLRTHNKLLAHYPH
SVGIKTGWTTRAGGCLIARAVKGDKDLLVVMLNAKISRWDTAASMFDLAF
NDRLPTSQFVASNGGQNLEQSERVIKGEQAALLAAAATPALLHAGGKALQ
AKQSGVVSKLSKEKKLSRKDRLALKKQKGKLSKKERLALKKKQKLSKKEK
LALSKKEKKLSRKERLALKKQKGKLSKKERLALKKKQKFSKKEKVAQTKQ
MRKAKRNELLANKVDKKKSKKEF