TitleGenColors Logo

Gene list

Applied filters:

COG category: Secondary metabolites biosynthesis, transport and catabolism
Gene type: CDS
Genomic element: chromosome

Number of genes found: 66

Free access
Sort by:

 



# Geobacter sulfurreducens PCA, PCA

>GSU0632 conserved hypothetical protein
MATRFLSSLKAAVKESGIFPSYLARKRAAARFLRGSGLEIGALHFPLQVP
PGVTVKYVDYVSREENIRKFPELDAASIVPTDYLEDGFTLASIPDCSQDF
VIANHVLEHASNPLQVLSNWARVLRPGGTLFITVPIGSRCFDKGRPLTPL
QHFIDDYELVRDGTSQSLDERNRLHYREWLTISTPNSDARNRHYRNLSGE
ELESLVETMSAGKAEIHFHVFSRESFVTLLDYFTTSIRSDFSVKAVIRSR
GGAESLAILERSPRAGR
>GSU0585 fumarylacetoacetate hydrolase family protein
MKTARLSPSGNEFPIGKILCIGRNYAEHIKELGNETPDAPVVFMKPATAV
IGDGETIIIPSYSRECHHEAELALLIGKGGKDIPPERALEHVAGYGVAID
LTLRDVQAELKKKGLPWEIAKGFDTSCPLSSFVPADRVADPHDLHITLRV
NGEMRQDGSSSLMIHRIPQILSYMSGVFTLEPGDVILTGTPAGVGPLAAG
DAVEAEVVGVAAIRVTVK
>GSU1899 virulence factor Mce family protein
MALSTEKKVGFFFMAGLVVLGVMLELGERWNPFEKNLPYVTYLSSTTGLK
VGDPVRLAGVEVGKITRIDIEDGRVKVGFEVKPGTRIKTDSVATIRLTNL
LGGQFLGISFGTQTADILAPGSEVKSREIANIDIIVDNVSDLTKDARTFL
NDLNTNQNEVLGKISTMLDENRGNLKGAVQNLNSITAKMDRGEGSLAMLL
NDKALYQNTNELATSLKTVTGKIERGEGSLGKLVNEDALYVEAKGALAEL
NAGAKDIKEIAAKINKGEGSVGKLVHDEALYNELRDASKNISDVARKINE
GQGTLGKLVNDDKLYRDTAAAMKKLDKAADGLSDSGPISVLGSVVGTLF
>GSU2510 hypothetical protein
MTRLSACRACGGPLAPWLEGVADPQTGERFHLTRCGRCGLGHTEPHPADM
APYYGAAYHGGRHGVTAAWCARRRIRWVTAACGGDGAGRRLLDVGCGDGT
FLLEAGQRGWRTVGTELNPAAAREAGLDVRGSVDAADDGIPYDCITLWHS
LEHMTDPGRLLTRLAAMLGHGGVLVVAVPDAGGLQCGFFGRHWLHLDVPR
HLYHFNHGSLGRLLAADGCAVVRTMHHEFEYDLMGWLQSALNALLPVPNI
LFAALTGRRRRGEHGALTLISLLLAVLFAPAATLLVIGETLLGRGGTLVM
VAQKTPRT
>GSU1047 conserved hypothetical protein
MNDQVRLHYETWVYPRYPLAASVRRSDTYALSLDALYARFNGTLPPAAAR
RILLAGCGSFSPYPTAVANPGVPVTALDLSAANLRRARLHAWMHGRFGIS
FEQGDLMDPAAAQGEYGFIDSFGVIHHLADPLAGLRALERRLAPGGILRL
MVYSRGARRVVESARRALRFAGVRDVARLKDLLRRSPPGSRLSHAVEASG
EGGGDAGLADALLHPRARIFSIDELMAMVGETGLVPLRFAHSRALAEPGA
EVVRLRGLEQAGEPVPNFVLYLGREPRGGCGLAPDALLMLNPALRHDVGV
LRLAPLAVPPRLGLKNPVLGFADRRFLRRFSTPLPVSALTPTDRERVGPF
LDALFVIPFR
>GSU0630 conserved domain protein
MTITDDFIAKSKYSDLGRYCVTAFVKSVADGLAPGSSLLDAGAGECAYKG
LFRHCDYKAADMAIGDSTWNYDHLDYKAPLDNLPIPDASFDAVLCTQVLE
HLQKPLECVKEMYRVLKPGGRLFLTVPMAQDEHQTPYDYFRYTSYGLKYL
CTAAGFSEVSVVPMGGMFLRWAYELPRALRMFPKARNSGVSGIRLAGVFL
YPLRVALGLVIPVCQAMLLFLDRFDTVRNDPWGWELTARK
>GSU1103 long-chain-fatty-acid--CoA ligase, putative
MAQPLEFTVGGLLDHIAARYPDNDALVYVDRGLRYSYRQFNEVCREVAKG
LLRLGVKKGDHVSIWAYNVPEWVILQFATAKIGAVLVTVNTNYKSAELEY
ILNQSDSSTLFLVKSFKDTDYVATVNEVVPELAGSEAGALSSPKLPFLRN
VVFIGSETPAGMLNFEAIAAMGQDVSDAELAAVEATLDRHDVINMQYTSG
TTGFPKGVMLTHFNIINNGFNIGECMKFTEKDRLCIPVPFFHCFGCVLGV
MACVTHGTTMVPVEIFDPLSVLRTIEKERCTAVHGVPTMFIAELEHPDFP
KFDLTSLRTGIMAGSNCPIEVMKKVISQMNASEITIAYGQTESSPVITQT
RTDDAIELRVATVGRALPDVEVKIVDIETGAELPPGKQGELCTRGYLVMK
GYYKMPEETARAIDADGWLHTGDLAVMDENGYCKITGRIKNMIIRGGENI
YPREIEEFLYTHPKISDVQIYGVPDRKYGEQVMAAVILKKGDTMTEEDVR
DFCRGKIANYKIPKYVKFVDSYPMTASGKIQKFKLREMAIKELGLEGPGE
TA
>GSU0925 ABC transporter, ATP-binding protein
MSGGEQGIRMDKVSYSVGGRTILTEFDLFLAPGVNRTILGRSGAGKTTIL
RLMLGLIRPQAGTISIDGIPIGSLSESELIRLRKQIAIVFQGGALFDSLT
VGENVGYRLLEEGRLGEGEIERIVLEKLSFVGLEHSIDLYPAELSGGMKK
RVAIARALAAEPRYIFFDEPTTGLDPVGVYNIQHLMLRLQGEGKTTLMVT
HDLETAFAVSERFSFLHEARLLFEGTEEEMRRCAVPDIREFLTPSEASLF
RNSD
>GSU1343 isochorismatase family protein
MNYTELLNPQNSAVIFIDFQPQMTFGVANIDRQTLFNNVILLAKAARIFK
VPTILTTVETKSFSGNMWPQLLDIFPGQEPIERSSMNSWEDAAFVAAVKA
TGRKKLVMAALWTEVCLAFPALGALKAGYDVYAVEDASGGTTLTAHNAAM
RRVEQAGAVPVTSLQVLLEYQRDWAHKETYNDVIAVVKEHCGAYGQGVEY
AYTMVHGAPPSRGGAGH
>GSU1227 ABC transporter, ATP-binding protein
MHGLAPSFFHKEGCDCRVDLTDTRGVSIRIENLNKYFGEKHVLKDVNLAI
NAGETFCIIGPSGTGKSVLLRHIVKLDRPDSGEIFIDGHPVFVNAGKDTP
SDYRYSMVFQSSALFNSLTVGENVGLWLREKRICKEHRIREIIREKLAMV
GLENTEQLKTSELSGGMKKRVAIARSLAMNPDLILYDEPTAELDPVTTDE
LANTILKLKETTKNTTIIVTHDLNFALYVSDRIAMMHDGRIVDVGTPTEI
KASQNPIVRGFIYTTTKGIKGE
>GSU3417 dioxygenase, putative
MNRRDFLKTAGLTGIALGLPGCARSLPFGRDVFPDFGDDARPYLGLATSL
REEHDYEARVEGTIPAGLRGTLYRNGPALFDRGGMRKRTLLDGDGMVQAF
RFGDRGIRYANRFVRTRKFVEEEAAGRFLHPTWSTQAPGGIWTNVWPTER
LLSQAGITVFPWRGRLYAFDESSFPYELDPDTLATVGETTFGLPRDLTTY
SAHGKFDPVTGEWLHFGIRYGPRTFVHLTTFNADGTLRRHRALELPRAIY
MHDWFVTERHVVFHLHPVEIAYWPFLLGIRSMAESLRWRPERGTILMVAE
RDGEAPPRLVETEACYLWHTFNAWEERGKITADFVGYRNPDHFISDDPVI
TAVMLGRRGTYSYPGEVMRYRIDPARGTAAREVLHQGSCEWPRIDERLRC
RPHRTGYMLRCLPGEFFWSIVMGLDPVTGRTDEYSFGRGVYCTEPVFAPR
PDTLAGGPGWLLVELYDSRTRTSSLAILDADRIADGPLALVRLTHHVPFS
YHGWWQPAS
>GSU0570 conserved hypothetical protein
MSDVTNTWNERYDTEEFVYGREPNAFLAGVSAMMPPGDVLCLAEGEGRNA
VFLARQGHRVLAVDASAVGLSKAARLAEEHGVRIETLTTDLADLVIEPGR
WDAIVSIFCHVPPPVRRVLHRQAVAGLRPGGLFVLEAYTPAQLELRTGGP
PTVELLMTLADLREELAGLEFLQAREIERDVVEGRLHTGRGAVVQIVARK
P
>GSU0816 ABC transporter, ATP-binding protein
MIRLVDVHKSFGSQVVLDGLTMEIPEGKITAVIGPSGEGKSVLLKHMIGL
MKPDRGEVFVEGENITTMRRYQMNRVWEKFGMLFQNAALFDSLTVFENVA
FPLEEKTHLSRSEISDRVHDALEHVGLRNVDKKFPDELSGGMKKRVGLAR
ALLLNPRIILFDEPTTGLDPIICRAIHELIRETHERFGYTAVIVSHEIPE
IFDISENVAMLYRGKIIEQGTSEDIRRSEHPVVRQFISGSLEGPIQFI
>GSU0679 tellurite resistance protein-related protein
MGEEQERWDQRYLSEECLLGERPSRLLAEWIDELKRLCPGRRALDIACGE
GRNSIFLARHGFAVTGLDISPVGLDKARRWAAREKLSVDFRLTDLEGYRI
TGTYDLIINFNFLLRDLIPHEVASLAPGGMLIFDTILQSPTAPVPHKKEY
LLQPGELERLFAPFPGTVLYSAEFPDDATPTAKLIFRNTPDAQADR
>GSU0993 conserved hypothetical protein
MDADKGSGTMQVIDDGRCFICGKDNPIGLKAEFVTDPHERRAETRVRIPE
VFQGWQGVTHGGIISALLDEICAQACMASGLQIVTSELRLRYREPVPTGS
EITVIGQVTGERRRLVDVKGWVELDGRVMAEAEVTMFRVG
>GSU1128 conserved hypothetical protein
MYEHLYEEPMTEETALPFELPEWIACAPFEEYLGMTIEEADGGRAVLTMP
FRVKHAQGKGLMHGGAITALADTAVAMAIKSLLPEGSHFVTMEMTLKFHA
PIHGGTVKSVAEAVREDERTIRGTAEVFDGNGIKAATFTSVFRIKRR
>GSU0459 beta-ketoacyl synthase domain protein
MDIAVTGLAAISAAGVGIEPLRETVAQRQCRLTPVPVEVLGEDGYLWGKA
DGFRAADFMPPLKARKFDRCSLLATVAAGMALADAGIDPKGGDPTRIGIA
LGCGFGGIANSVEFLGGYFSRGVEGLVPMLFPNTVANAAASNASIEHGLK
GPNVTQVQRFCSAEAAIQMACRFLEEGRADVMLAGGVDELTPLMMAGFQA
VGQLRTYARSFSEGCGILVLERRDHAERRAARLRGRITGLRTVGMLPAGR
EQETVDRLMPPAAPALVSLSGIAADITALIAPLPAVPTLESGNLIGRSLA
MGGLALASLLLVLPEGARGLHLAASPEGPYHAIDVTGGMRA
>GSU2627 biotin synthesis protein, putative
MIDRRKVRNAFHRGAADYDAYAAVQKRVMERILSLLFAEGVEPARILDVG
AGTGALALRLADRYPSAAITCVDLAHGMARQARDNLGRTMERLVAVADAE
HLPLRDGVFDLVVSTSTFQWLTTLDRAFAEARRVLADDGLFAFALFGDGT
FKELKASYRAALHSVPRGGRDRTHRFFTRDEVRAALARAGFRSVEVFDED
EVEYHPDVPAFLRSVKRIGAGNASPVAGRGLSGRRVMETMMRTYAERFGG
ADGIPATYTVVYGVGKR
>GSU2898 high-molecular-weight cytochrome c
METALVRKIRGMGIRTKIGITVMLVMACFLYQAMFRPLIGDTATNTYYFT
LDSAAVNLGADGYTTTRTSRDGKISMKPGVYTTSRYVTASVSTAEQNMIR
AYGPVYATKQTLTAPSVTIGMRDRNGTTNNMYWKAYVYAYNPKGTANNGV
LLWTSDEKEAHPAVQTPLELTFTNPQPKDVEAGYRLKVVVTCRMASTSSS
ARFYWGNSTNYSYFTVTEAPYVANSVTVNNLSDYYSGQLASVTQGDGAIP
MLKMDLYSNVSGGATWSGGKLDKIGTNTSVYVNEDEPGDVTFSIFKDANG
DGLFQKTDTQVGGPYTFTQLTGQAYELATPQTITTTPQRYFIVYSIARNS
TYGTTVGARVANSSYFAVTGAAGGVVNVTSTSSSTPTIQYGGTAVTKIYA
ADWDEGTTLAGISETGGPSATDTACITSNTTGSGFPLVGLLNYPSHTCAS
VAGRGYSATTAQPDFIRLYFAGAGYHSVMKTIKGGSFVYRVYTPFGGGTV
TLQLFYVTSDGVRVNAPITSRYTTGSSISQTITTSLAGQDFSNVPAGARL
GIQIGVTAGMRIGLGSSVNAQLMVQETAAENENVDVGNGSAIPNATVYAS
DVNKVIDSFTLTAAKAKTVTAVTIKGNATFTGTNIKEVKLYADAGTIGTL
DGSDTLLGSTATISGNTATISGLNLAISTAVRRYLVVVNIGDAPNTNVIL
TAVVDDLTVASTGGIGVDNDSTSATLTILPTTTLSDFIAAEPPNAIIPWN
AGPTKVDAFGLRTNGGVNDTIRNVTVTLSTTSGLPAGKVISDYVGRVDIV
TAAGTSLGHLTAPTMADNWQVPTTGLAATQIPTDYYVAITPKGNQGITFT
VKGRVTSVTHSRTTNALLVNDAGSATILMDEEPPNESSLTAVTGTYHNDT
DRAEVNLSWLGTTDAGGQPVTYKLVRGLGNAPAPRTCTVDNAKTFLAYQG
PATSVVDKGLDEGVNYGYRLCVIDSVNNINAGVTASATAAIKNRCNELPE
LIVNPTASYVKAGTTVKLTIGIKNKDTGVCGPTTFSLVTQGTNIDDSNFT
VAAFEANDFVISTNNGSKYTHLDITAKPGAIEGAVKTFHVKVVKSSGGET
VCPDPIEVVVNKYGTMMHSSLQLGTQKYGKWGVNFTCSTCHSPDATNIKQ
VRNVITTPTGPRPVLFDTISTAINANVAGVFGNDRRSGTASTNVCEVCHH
RARFHQYSAAKVAWKDHNNNGDCLKCHPHSIGFKTKATGQSCDDCHGNPP
TSYEMLVVPPTEVLFPFASNAGSHGKHNARQVTCTACHSNANHLVTATPD
MQLNLGFSVANGTFPGFVGSVTTGTIRTLAPGNDYSWSGAAGTTIQQAPN
TIMTCSVYCHGWEGNGGYNTEPAWTGITQVGCGSCHAATADVPPPSGSHA
KHAGNEPGYGNGIACAKCHGFRNYSTSASHINGNVEWDLAANSTTARYAG
VAAGSTGAKAPTAPGSYGTCSNLYCHSDVQSNNGTGGPTSFATPVWGGST
NCNSCHQADPNTTGGHPQHAGEEVTGFDCRICHANGGSTNSLNHGNSKIN
FMFTGLGENTHYSYSSAKTPGSAPYGTCYNGNCHGARRTLAWEPPNHAVP
LCEKCHTTSPSAAGFYSTSGPGSTTSKTDAYVGAHFQHITSMPFRYSARI
DCSGCHLKPTGPYTPGHIDSALPAEVIFGAIAGSGVQNGYSSAEHQPSYN
YASRECSNVWCHGGGMASNVGAGPYGSAVTDGASLGSPAPAVWNSPYLTG
VGTNDCVKCHAFPPAAPLPGYTHWDDNNNRPFVANQCILCHKHVDNTGYA
FKDPKLHVNGVVDSCNTCHGRPPVDEAGMTIPAVGALTPGMVGAHQAHAL
NPSIGKDCNVCHYQYSQEMPSYDMEMGFNAYGGRVTSGTFYGYSTLSDNY
SPRIVYKSTNAGTVVRRTTNADTLNTCANLYCHGGGTSTRAALQGGSNTR
PNWEGGSSQAACGTCHGVTADTYHATGSHDAHVSTAFGKPRLGCSNCHGV
KENNYHVDGKVEWAFYSTAQRLNQKVANPQYTPAAGNGTAGASGATNGLA
PSTAFGTCAVYCHSDGRGNYASPLPVWGGAPMNCGSCHKNQTSAFTDSHQ
KHSASSANGGYGIDCFICHLGSGSGNPKHVNGDIDVVFNSTVVGVTATYD
SGAKKCFSILCHDTTAVAGPTWGVPSTGTYDGGTHKPTCIGCHSGEVNTR
AAVIPQFGGESHHVQGVQISNTVCYQCHWEANANGTANTTYHTRTAGQPV
NLVIRTTTSRPVAYTEGSTGTAYTSNGTRTELAKLNSNCLGCHNATNAAS
QPFGDGMTPTQYAWDGKSIAERYSVATTTTWAKVTGNNTVAKSLTKAYSA
HGRADLNQRGWTVGNSTTGEVYANTSGTVNVLCYDCHNSHGTSATGIMSS
YSSATGRNMGGILKATSNGIGGYTADYTPYAGGDAVEPNKNAYNPGAALC
FDCHNTASAGATAPWGYGPTASGGTFGSTQAIYGYHDTPYFGSGTFANTQ
TYAYKALNPDNKGGHFGASSSLTTTAAKPINGLCTPCHDPHGVSPSLGAN
QAYAVPLLKGTWVTSPYKQDAAPASKTEARGGGKKRSAMNVGSTPGYRID
QNSMGIAAAATRSQWTFPNNASSQTPSTMQGTTDAQFAGLCTGCHAQADL
NNTAAPATSNWKTMRRVHNTVKGWATASGGNANNKVHAFTCSKCHTPHNA
KLPRLLVTNCLDVKHRGRAASGGSMTGPASQSGSKGAGVGRFPQGGGGTG
DQPLGTAGKWFFGKATQSTSITTNSQTLCHQSATAGGSTYSQDGQLWNTK
SPW
>GSU2792 conserved hypothetical protein
MAEGFKDYFSDTSDAYRTYRPEYPDALFAWLAGLPPRRDAALDCGCGTGQ
ASVVLASYFPRVYAVDPSAGQIASAVPHEGVVYRVAPAEQTGLPGASVDL
VVAAQALHWFDFDRFYPEVRRVGRPGSVFAAFSYGLLSIDADLDRIIGRF
YREVIGRYWPPERAHVDDGYRSIPFPFPEIAAPPFAMEARWELEHLLGYL
ATWSAVREYRQRLGTDPLPELAREVRDAWGIPEEGRTIVWPLALRVGRIA
>GSU0815 mce-related protein
MKRVTLELIVGIFVLVGIICLAWLSVRLGQMELLGGDHYQVSADFDSVSG
LKKGATVEIAGVEVGRVDRIELDSSNDRARVYLRIRDGVKLQDDVIASVR
TSGIIGDKFIKLKPGGADKLLADGGRIRDTESTVDLEELLSKYIHGNVE
>GSU2536 dienelactone hydrolase family protein
MKQSIRAGLMLVVAALVVLAAFPAAAKVRGRVVEYRDGTVTMKGYLAWDP
ALKGKRPGVLVVHEWWGHNEYARKRARMLAGLGYTALAVDMYGEGKQAPH
PDDAAAFAAEVMKNGNLMKDRFMAAMNLLKDQPTVDPGRIGAIGYCFGGA
VVLNMARQGVDLAGVVSFHGSLATDRPAEPGGIKARVLVLNGAADRFVPP
EQVGAFAAEMARSGAEFGFISYAGAKHSFTNPEADAFAKRFGLDVAYDAR
ADRRSWAEMKRFFHLCFSGG
>GSU2290 pyrazinamidase/nicotinamidase, putative
MENDAALLIVDVQNDFCPGGSLAVPEGDTVVPVLNGYISVFRTAGLPIFA
SRDWHPRMTSHFKEHGGQWPVHCVQGSHGAQFHPDLALPGNAIVISKGMD
PERDDYSAFQGTAADGTPLPTLLAARGIRHLYLGGLATDYCVKESALEGI
RHGLIVTVLTDASRGVDLAPGDSERAVQEMMRAGVRMTSLEGIRQEQRLR
E
>GSU0817 conserved hypothetical protein
MLVSSIEKIGALTLFVVREMGKMLIFLTYALVNIVIRPGKPIHIYKQIHF
IGAKSLFVIVLTAAFTGMVLGLQGYYTLAKFGSEGMLGSAVALSLIRELG
PVLSALMVVGRAGSAITAEIGIKKITEQIDALKTMALEPFKYLVSPKILA
ALIALPLLCAIFDVVGIYGGYVVGVKLLGVNPGAYFSEMERSVEWKDVWS
GIVKSFSFGGIIAWVCCYKGYHASHGAEGVSRATTEAVVMSSVLVLIWDY
FLTSVML
>GSU0196 thioesterase family protein
MSETSPQQGAESICPPVDLSGDAGWVPFDAPSLVGESLRFVSGEPDGNRF
RVRYYRDSEQHLHARIWFGPETEGPPGHAHGGAVSAVMDEALGLAAWAAG
YPIVVGNLNISFRTMLPLQKVVTVESRVVSAQGRKVMVHGRLFCGDAVYA
EGECLCITLPGR
>GSU0696 glucose 1-dehydrogenase
MPLKGKVAVVTGGAQGIGKAVVKKLMEKGCAVVMADTDWEAGEETAAGFA
GLGRVLFVPADVGREDDVRVLVERAASHFGRLDILVCNAGVFRSVPLEHC
SLDEWQRLIGTNLTGAFLCAKHAAPFLACHGGSIVTIASTRAFMSEPDTE
AYAASKGGLVALTHALAVSLGPGVRVNCISPGWIETCEWQKASRRRPAAH
SEEDRSQHPAGRVGTPEDVASLAAWLVSPEAGFVTGVNFVVDGGMTRKMV
YV
>GSU0813 conserved hypothetical protein
MKWILLPILFVTLAVSGTALAQPSPTETVKKTVDDVIKIVSDKELKKPQN
EKRRRQEIKRAIGTVFDSAEMAQRAMARHWRDRNAAEKKEFVDLFENLLE
NSYAGKIESYNQEKVVYLKEAIDGDYAEVRSKIVTARRDEYSLDYRLMLK
GGKWVVYDIVIEGVSLVSNYRTQFNKIITNQGYAELVKKLRSKNKEISMP
>GSU1282 hypothetical protein
MKRIVIGIAAAFALCSASWAMGADDARTEPGTCRQIAELTRKYDSGRALP
GDMAVDGRPCLRSEAADCLFGVLEKVLAKCRSEGKEALEPEERALILRLR
EELDAELESRHGYHTLRDEVEAMLAKPDLYDYEYRVGVNGFVRGEGAGSF
RLPDFSQAPGHAEGRFLYRVKPYLFWHPVAWLDLHAEGQGYGFSGGSQDY
GKISLYQGFAEILCPLRDGNSLKAGRQELVYGSSFMLGSDSFYKGLVYDA
VRLRITPLAPLTVDIFGGWYASPWSDGTEGGLAGGYATWNISDGTGLEFY
GFRDSGSAERHDGEHRNSFGVRATAKFGPVSCEVEPVWQTGRLFNGVDAN
ESIRAWGGHADIMIDTDLGGIGNHFFAGAAYGSGSRDAALGVSGRKEFLN
QATDSSLTGDMNVVGDLSGLDVGDYHASGLQIYTLGWGVDLTRDLSLTAT
GRYFLANYVPDGLSRRIGLETDFTLTYAISDALSIIAGYDRFFTGRFFSD
ASGSNDDIHYGYLMFQFDLSHTKPKKALGK
>GSU0549 conserved domain protein
MNQDPTGEYIRTITSHCDLRGAEVLEVGCGAGRITRDLARHAARVVAVDP
DERALAQARAAVTAANVSFLPMPDGVLSFPPASFDIVIYTLSFHHVPLHQ
MDESLATAAGLVRPGGAVVVVEPGEGGSFTEAKERFGAGSGDERPAQEAA
IRAMHALPGWTVGDTVHFRVGFLFTDEDDFISSKLPGFAEQPAAYQQEVR
AFLASHRTPEGIVLDAGRRLNVLRRR
>GSU3168 beta-ketoacyl synthase domain protein
MEQDCESIACGKGSKTRAQKSGTGGRIAVTGFGLVTPMGVTSWRSVSAVI
RNRSHFAWHETVLVADAPDGTALRGATISRVSGEGVRFGLTGSERSLALL
APALREATSGLSPSPGDAVPAWIINGIPSEEVGAIPCPADILPLLSPMEH
LPVGREAGSGRCLFLDRVAEAAKALREGRCPRALVAAVDSLCFLPVLEEL
LAAGRLLSGPNPEGIIPGEAAGAILLEREESARKRGAPIYAFVSSRGHGI
DPAPRTGGRPSQGRGLTEAFFQAFDGLPTTGGEMGLVVADLNGERQRALG
WAVTEERVFGPSHRERQLWLPAFSVGECGAALGVVQTVVAVAALAKDLAN
GEQVALCSSDDGGETRVLCLDQGDFADRHALNRWRRERQAKTNERVT
>GSU0628 conserved domain protein
MHRRAWRFVSFDYCPVCDTRSMIVYSRELERWLADLTASWEAGEDFKRML
AMRENFLCVTCMANSRMRMLARTVLDLCGHATSGDLARRLCSDPLFSVYE
TAAYNIFRIDALKACPRYVVSEYLAPDRFGETIGGVRNESLECLTFPDDS
FDVVINSDVLEHVADLDKSLEEVRRVLKPGGYHVLTVPVDYSLEHTVERA
RMGQGGIEYLKSPVMHGDTVRNTGVLVFRDFGRDAASCLSREGFPCLEMQ
LPGRHGEIISVFIGKKAA
>GSU1222 histone deacetylase/AcuC/AphA family protein
MPARTALIYSNDFARFSYGDDHPFKIQRFILAFELMRAYGLMELPNVKIL
DCPRAAEEALLTFHAPDYLDRLREFSESDDARADFRYGLGDLDNPVFRGL
YDWARLGAGGTIEAARLVAEEGYDIAFNLAGGWHHAHRAKASGFSYLNDA
VVAINLLLEKGLRVAYLDIDAHHGDGVQEAFYDTDRVLTISIHESGMYFF
PGTGFEGETGTGAGTGYSVNIPLVAHADDALFMKAFDEVAFPLLAAYNPD
VLVTQLGADTFRTDPLTRLEVTTHSYTYILRKLKALGIPWVAVGGGGYNL
VNVARAWTLAWGVMNGVELPPRLPDSFVSIIGRLGYPNRMLLDAMHWAQE
DDRNQALDAVERSIAVIRKTIFPVIIGSYGETSGE
>GSU2204 cytochrome c family protein, putative
MRHIGKWLMPLPVLILCAPLALLAEEGHKPFSATMETGGAGLAVSDDISR
VNEYSSVRTEPGINPYGKVDIQIDKGGVELDLNSRYLDSRDQTHGARIDV
KRFFKSSFSYDAFQHWLDHDKLQYLDASIPAAPVAGAFDTTGAILGVPTA
TGTTNLYAYTANPIGPNFAPNFLVTRRSDGARFVTNVAPSDTATYAVQQL
GRASLYGEDMVPNQDFSIVRREWKSNSDFTIPQLPNLTFHFGFREETREG
WEQSIGMSKCTSCHITGQSKQISESTRDLTAGVTGKFGLLTMNYTYLNRQ
FRENGADPVRRYDPALSPGAALPANYYTGANSPPTFDNRMLYDYRDGYLP
YDVTPDSNKDSHVVKAKVDLPRDTTVFASYVKATVDSDKSDDPGYFTLDK
KTLESEYDAWSGKVSTTLWKRLTLTLRGKLEKLQDDDVAITYTPIAYNDG
NAANDIFGFPGATASDILMLQTINRHSSASRDVATLGLDSVYRLAKRTTL
RLSYEFKNEDRDDSFFGTTKTHTVKAALNARPTNTLSARASYTFKAIENP
FQNPTAGLVPFTQSASGYGYLVGNGPTYGVEFYDRRTADLTNRPDTVHEG
SLSTTWSPSPRFSATAFVRVKNESNDLNKTTWKQETYVPGVSLWYAPSDK
VSMTLAYTYLKQTTENSMCQGLYDG
>GSU2400 conserved hypothetical protein
MLKGYTLPRTPRGTSSLAPLPPWHYVGNAIAVEFDAAPTAAAAFLPEGLE
LHSGRCAAYFVEWQYASDTGEEYLDPIRSQYRETIILLSASFEGAPVAYC
PFIWVDQDVSLMRGLVQGWPKQIGSTWITRAYDLPSKAAPVVGPGGRFGA
TLSAKDRRLLEAQVTLREVTETLPSPGFAKAVNTRYFPELVAGKHDSPAV
HELVQLKSRDVRVSPVWKGDAALKIFDHPYLELPDLKPASVLAGYRFSFA
LTVDDLIPLRDLRADTQAADDRATAVE
>GSU3018 conserved hypothetical protein
MGTSGMQCEKKFLDKACSTYKTAGSYQQQAMKDLILRTFLPYMTNNTKSV
CLSLGYAEGYEAKILSECVKELDVIEGSQQFYEQGLEDNIANVTLHYSLF
EDFIPENNKKYDYICANYVLEHVENVTIVLSSLKEMLKDDGLIFSVVPNA
RAFSRQLALHMGLIDDLKGLTENDINHGHRRVYDRVAFNRDLEGAGLSII
AQGGIMLKPLADFQMDKLIDTGILQTEHLNGLYRLGMEYPDFCSALYAIC
KK
>GSU2583 isochorismatase family protein
MNAPYKYTRLSKDDAALLLVDHQAGLISLVQDFSPGEFKNNVLAIAACGK
YFNLPTILTTSFENGPNGPLVPELKEMFPDAPYIARPGNINAWDNEEFVA
AVKKTGRRQLIIAGVVTEVCVAFPALSAVEEGYEVFVITDASGTFNEVTR
HTAWLRMQAAGVQLMNWFGMACELHRDWRNDIEGLGRLFSNHIPNYRNLM
TSYFALMGTK
>GSU0569 isochorismatase family protein
MKRALLVIDVQNEYFTGALPVSYPEGSFPNILAAMDTATANGIPVVVVRH
ASRRPDSATFRPGSPGWELHPEVARRPFDLLLEKNLPGSFTDTNLEAWLR
ERGIDTLVISGYMTQMCCDTTSRQAFHRGFAVEFLADATGTLAFANSAGA
VTAEELHRAVLVTQQLMFATVMTTGEWIGSFR
>GSU1900 transporter, putative
MGMFDRVGKKVLSFHEVLGEMLMLLGRTVWFFREAPRNLPSIFNQMAIIG
YETVPIASVMAFFVGMVLALQTGVELQKYGTQNIIGGIVGLSMVRELGPV
MTSFLVAGRVGSAMAAEIGVMKVYEEIDALKTLDINPVRYLAMPRLIACL
VCVPALVIFANAVGIVGGAIMSHLHPKIFISYSTYYDSLKTALKLKEVGA
GLVKATVFGGIIALVACYTGFKTSGGARGIAQATTRAVVLSFMLILVADY
FLTRILM
>GSU1252 conserved domain protein
MRTRIKPGRIAGFLAQMMLLLAPTVSLAEVCANTIQADVVAIDQQIVHNR
LGAFNPISMIFALRQDVVNAANGLTEAEGGILTPGNVRLRDDKRPRPLVL
RVNEGDCLRISFQNLLAPTPLVLPPDAAGEFAGQPATRHVGMAIMGVSFA
DMVSSGVNVGNNPVSGYAAPGETKVYLVRGEHEGGYHINSIADNVSGEGI
QGQTSFGLFGALNVQKKGAVWYRNQTTNAELLMATTGLTPSGHPIIDYDA
VYPVGHRYAGQPILRILDPVTNKIVHNEVNAIITGPNGGDFPAGTYQRNP
TYPDRDRAFREFTVMFHDEVETVQPFPIFIDPQFKFTLGGVKDGFMINYG
SAAVGTEVVANRSGVGPTWDCAECKFEEFFLSSWAVSDPALLVDVPANTT
DVNGNLIVGPKATKALYPDDPSNVFHAYINDRAKIRNLHFGKEFHVFHLH
AQQWLFSPDDDGSNYLDAQGIGPGGSYTYEIAYNSGNRNKIVGDTIFHCH
FYPHFAQGMWALYRLHDVFEAGTQLDANGLPIATARALPDGEIAAGTPIP
AVVPIPTLPMAPIPGKVRIAQVPGYPGGQVTYDEPDKNAGYPFSLGMKAG
HRPATPPLDLIDDGGLPRHIVRGTNNPLNPSTLDPAITHHEETTLSFDKV
LVAAEAEQVPESGSRSERAAMDFHAREFHPTFLPNGSPGQFVTNGLPPVP
GAPYNDPCRGDKGGAVGVPRTYKGAVIQFDLKMNKVGHHFPQSRIITLWG
DAQATVDGTRPPEPFFIRANSNDCVNFYHTNLVPSVYEQDDYQVKTPTDV
IGQHIHLVKFDVTASDGSANGFNYEDGTMSPDEVRERIHAFNLTGGLIQP
DGVTKVALAAKPHPYFGSTFNGRDITGARTTVQRWYIDNIRNNRGEDRTL
GNVFTHDHFGPSTHQQTGLYASLLTEPQGSRWRDPQTGQFMGGRFDGGPT
SWRADIITTNPAESYREFMVHVADFTLAYEDGACHTVPCVNPAKAIKPPG
MEEIGLPFLFRKPQICPNGTLPPCPEAISAEDPGTFLVNYRNEPVAERVR
LPGTNVQAPGLAGDLAYALSSRVQRANPLLNQQPAFYPPLTPNVLPGDPF
TPLFQVYDNDRVNVRIQAGADEESHTASIHGVKWLQSYGSPNSGFRNSQQ
LGISEQFQLRMPVIPDRLQVGQTADYLYTLNASSDGYWSGIWGLMRSYAV
RQPNLLPLPNNPIGTVPFTAANDASFSGPCPTSALVRSYDVTAVAARTAL
PGGSLVYNSRIGPQGIGPLQDPDAVLYVRTGDLNPDGTLKAGVPVEPLVL
RAAAGECIDVTLRNSLPAVLTESPGYSALHPIVEFFNFNEVRTSSIVGLH
PQLVEYDVTRSDGTVVGNNQDQTVPPGGVRQYRWYAGDVKVVNNMRVATP
IEFGASSLISTDLIKHASRGALGALIIEPLGSSWIEDYPTPHPSVSAGQR
PSRASATVIRANGTTFREQVLVIQDDVALRFGDNTPVPFVTGMEDALDTG
MKAFNYRTEPLWFRLGFSPLNLPFQNQAINFANVLHNSTTGGDPETPVFT
VNAGDETRVRLVQPAGHNRHHTFALYGHVWQREPHTNNSTRLGLNPKSFW
RGSQDVVGPASHWDFLLDHGAGGAFQAKGDYLYRDMLPIHFLNGLWGIMR
VQ
>GSU2414 membrane protein, putative
MTLHGRLLRRQLATSRRQSAIFVLCVALSIVTLVSLAGFGRSVHSSMLRD
ARALHGGDVIVESRSPLSPGLTAAVNRVIAAGRAEGARINEFYSIIRPAG
REDSLLAHIKAVEPGYPFYGTVDLASGRPFRQVLAPGRAIAEQTLLDRLG
LRVGDRLRLGDATLTVADVVTQEPDRPVNVFSLGPRLFVAAADLPSLGLV
GQGSRVSHTIILKVANPRETDRIAAELRASALRDRERVDTYRTAQSGVKR
FFDNFLFFLNLIGIFTLLLAGIGIQSSLAAYLAEQRPSIAVMKALGATGR
FLVVHYVAVASVLGIVGTALGIGASFLLQGVLPELFRGLLPATVEFRIAA
PAVAEGLVLGFLTVTLFTLLPLWQLRAVKPRAILGKEEDPTVRSRAVWVT
TGAVVLFFLAMVLWKVEEPRSGVNFVLGVGGLILLSLACTEAVLRLLRRA
RPRRLAARQALRGLFRPRNATRPIIVTLTAALAVIFAITLVERNLDASYI
RSYPPDAPNLFFIDIQPGQKEDFSRTLGMPALFHAIVRGTVTAINGRPID
REAERRKRGDNLSREFNLTYRDSLLEDERIIEGKSLFRPDWQGVQVSVLD
TVVEMSPMAVGDVITFRVQGVPIEARISSIRTRTRAAVQPYFYFVFQPAV
LRDAPQTFFTAVRVEKPGIAQLQNRIAARFPNVSIFDLTETVAVFARVMG
RLSVIVRFFTLFSVVAGALIIVSSVIATRQARLREAVFFTILGARGRFVL
SVFTMENLIIGGVSGFLALGLAQAASWIVCTRVLDVSWQPFPAVSAALVA
ATVVLVVAVGLGASLSIIRKKPVVFLREQAEE
>GSU2657 spore coat protein A
MTNVRVRIVAVVAVSIGLALFALGGPRKACSQPVPGGTLDPLTIPKFVTP
LVIPPEMPKSTVQPGVPAAAYNIAVRQFKQQILPGGVWNTVNGRSDTFGA
TTVWSYGRAQDKIPVGFIAPAPLSSNISFNYPAFTVENTSGIMTRVRWIN
DLVDAKGNYLPHLLPVDQTLHWANPPATGCIDGTNRTECRTFNTAPYTGP
VPLVTHVHGAHVNASSDGYPEAWWLPAAKNIPAGYAARGTVFDQFDPRNT
VKGSAYFAYENDQPAATLWYHDHTLGITRNNVYAGPAGFWLVRGGANGDA
FVDDGTSAALNDGRLPGPAPRAGMGDPNFNAAIRATIREIPVVIQDRSFN
ADGSLFYPDNRAFFEGLNVSGATPPQFPGAGVLNIPFIPNSDISAIWNPE
VFFNTMVVNGTTWPQLESAPARYRLRLLNGCNSRTLNLTLFTVTGAGPDG
IMGTADDVLGAEIPFYQIGAEQGFLPQVVMIKTGQYTPLPGNGAIPAGLA
APDPMQALLMGPAERADVIIDFTGLADGTVVRMINTAPDAPFGGFPDAPA
DIDTTGQVMQFVVKASLIQPGDALTTPPENLVLPAEASLPATVAVRQLTL
NEEESTRLCVQAQPDGSITTLFVDPMPLPGFLSACAAAGGMPMGPREAKL
GVLVADPMTGMMMSMPMMWADIITEAPVTGTTEIWEIYNLTMDAHPIHPH
LVRFEVVDRQPFDMMTFMPSGPAVPPEPYEQGYKDTVLAYPGQITRVKAT
FDKIGLYVWHCHILEHEDNEMMRPYMVKIDPAFPDVNADGKLAVTDALDL
LKKLKSPLLAGAPYDLTGDGTLDVRDVLALLRTIVFGPPR
>GSU1576 oxidoreductase, short chain dehydrogenase/reductase family
MRRLEGKIALVTGAARGIGEAIARAFATAGAFVYLTDINDAQGAVVAGQI
GAEAAYRRLDVREEADWQCVTTEITGRHGRLDIVVNNAGITGFEDGMVAH
DPEHASLDSWHAVHRTNLDGVFLGCKYAIQEMRRTGTGSIINISSRSGLV
GIPAAAAYASSKAAVRNHTKTVALYCADQGLAIRCNSLHPAAILTPMWEP
LIGPPGAEREQRMKEFVWDTPLRRFGTPEEIAAVALLLASDEVTYMTGSE
ITIDGGILAGSVARPSVD
>GSU0802 oxidoreductase, short chain dehydrogenase/reductase family
MSLNDAKVVVIGGSSGMGLAVAKMAADEGARVVIAGRSEEKLRQAADEIR
QPVETRSLDVTQEQAVQAFFVETGELDHLVVTAATGVAGSFLELETPSFR
QIFDSKFWGQYFAARYGAQRIREGGSITFFSGVAAAKPVDGLSAYAAVNG
AVEALCRSLAVELAPLRVNAVSPGIVDTPAYAGMSPAERKRMFDALAARL
PARRIGRPEDVAAAVISLMKNGYVTGSVVYVDGGHRLV
>GSU1214 tetracenomycin polyketide synthesis 8-o-methyltransferase, putative
MEPKNWTPAELLQLSSGYWNIGALHAAVSLDVFTPLAGGDLSAEELAERL
GADGRALAMLLDALTAMELVHKRADRYGAAPAACRFLSQDSPDYLGHIIR
HHHHLVAGWARLDDAVRSGGPVRQSSSHGSDESARESFLMGMFNLAMLSA
PRVVPRIDLSGRQRLLDLGGGPGTWAIQFCLHNPGLRAVVYDLPTTRPFA
ERTIARFGLADRVSFEEGNFLSGEIPGRYDVAWLSQILHSEGPEGCAVVI
EKAVTALEPGGIIMVQEFILNDAKDGPLFPALFSLNMLVGTARGQAYTEG
ELKAMMAAAGVRDLRRIPLDIPNGAGVIAGTVA
>GSU2713 conserved hypothetical protein
MRQRLAPLLTIMLLLQCVTAAFAAPTLSVDKPFFDFGTIPQGKKLDHVFT
LKNKGDSPLSIVRTKSSCGCTVISLPRKTIEPGGSVELKTTFDSTTFGGK
VTKTITVETNDPANPNYTLTLTGVVSEVLVVAPRQLNLGQIKAGTSGTFT
VTVDNKGNRPVKITSATSPMPQVKVTAGKQSIKPGESTSITISVAPRPED
RFLSGFIIIKTDMPGKPEITVPVYGSVSK
>GSU2441 conserved hypothetical protein
MDIPRIFTITESAHRIHNPITPEKLTTLGAALRLEPGTRVLDLGSGSGEM
LCTWARDHGIIGTGIDMSQLFTAQAKLRAEELGVADRVTFIHGDASGYVS
DDKAGVAACVGATWIGGGVAGTIELLARSLRPGGIILIGEPYWRQVPPTE
DVARGCLANSTSDFLMLPELLASFGRLGYDVVEMVLADQDGWDRYEAAKW
LTMRRWLEANPDDELAKEVRAQLTSEPGRYASYTREYLGWGVFALMPR
>GSU1002 isochorismatase family protein
MGIMEKFFLDRQQAVLVVIDVQEKLCAAMDPEVLERLTKNTGILLEAAQD
LGMPVVATEQYVKGLGCTLPVLKEKIEGDACEKMTFSCCGDDAFLNRLAA
LGRKQVIITGMETHVCVLQTVIELLERGYHVHLVRDAIMSRRKENWFVGM
EVARDAGAVITSTEAALFQLLRVAGSEEFKKLSKLVR
>GSU0924 ABC transporter, permease protein, putative
MPLTSIEHTVKSFLAEFQAFCVLSLRAVVRIFRRPGYYREFVIQFDKMGV
GSLFIICLTGLFTGMVMALQALIQLKPFAATSYVGGMVAVTMVKELGPVL
SSLMVAGRVGSSITAELGTMVVTEQVDAMRVEGTDIVSRLVTSRLKALML
AMPMLALVTDAVALLGGYIIAAGYDINLLMYWKSLPQFMVFQDLIEGVMK
PFVFGTLIALIGCYVGLSTSGGAEGVGTSAKRAVVLSSVMVLVADFFMTK
IFIVFR
>GSU0690 conserved hypothetical protein
MTRGLSGAVPLAHFFLRERVKPGDRVVDATCGNGHDTLFLAELVGPEGRV
WAFDIQDAALAATAKRLEAAGCGERVELVNGGHERLAELVPGPVTAVVFN
LGFLPGAENGTITTPATTGAALDQATELLLPGGIVTVAVYTGHPGGPEEE
AAVDAWAASLPPARFNVWRCRQGNRSSAAPYLVVAECRA
>GSU1506 hypothetical protein
MTGIVTANWWTSLIYRVFVRTACLRAGRGLLPNIWEWLWRLSCNIIEGPV
DITIHNRPAIVNFGYTYPIYMRCYPELNAPLVELTYQCWRSRGTAICIVD
VGAAVGDTMLLLHSNLPEAVGSFVCIEGDQEFYRYLQHNLGHMTEGRLIN
VVVADQETEVSDLVRIHTGTASAQGERTRGASTLSAVVTEEIDLIKIDVD
GFDGRVLLGAEDLLKRCRPLVIFEWHPSLCRQTGNNWTDHFDVLARCGYS
RCLWFTKFGHFSHFSFGHCREEIDTLAEYCMSDVVKDWHYDVIALHDDSD
LSLLLLSQLGFAKARPSRY
>GSU1844 IPT/TIG domain protein, putative
MTRKKRCAGLLLAPVLALSLFSRAEAVLLDVGPIVPQVINSSPPQHGFPL
WYRDTNRVPLELCLSRTASVNGPMCLTQEPFPAQPFNFPNNFGPEAFWWS
ADAIMAMPGGGDARLIMALEAAFAVGDPIPGDQVSFARIRIRIDTPVAGT
YVVTYPYGEMTFNVTDTDAGINYTRDIGIATNNFNGALLGDVGPFLYWDT
GPVAVGDELFVGDPNVDHRVLGSPFPDPLNPSQFSNFFRVRGPAAVGTLQ
TDLFAVMGKIYQTPIPTPLTVDRLTYSRDAAGMQFHAAATTQPVSNQVNP
ALPFPQNFALTGVPSALEVTGTGLPTQTMITNDPADGKFFSASSFFADPG
TLPATVQVTNINDEPDTVVTVPLVDDVTVFRATYRPQSGTLSIAADSADD
VANPVLQAYMPGMTAPLGTLVNGQLSVSFPLVDTSVTPTKTHSVPPVWVT
VKSAAGGEASALVTVRDISPPVVAGFSPASGVVGTAITLFGSNFSPFLAE
NIVTFNGTPATVLSATDSFLTVEVPPLATTGAIAVTTSGGQAASVASFIP
RYTTSVTLAGTGAGSVNSVPVGIACTVGTCSGEFDYNTALELVQSASSGS
QFDGWSGDCTGTGPCTLSTTADWAVSAVFSIQPNVRIGALTYFGTAQAAF
DAVQNGEVILARAMILPGSDPVYDRPGISSTFSGGYADFTEPLTQSDYTT
IVGSLTINRGELVVDQVMVN
>GSU1394 laccase family protein
MTSARMLLATAIAILTLGSGTGTALAFLKPDKTPIGPGDTPDYLTTPNWA
NSPPMRKFVDTLPGLGSGNANNLGQFLSVAVPDITTYPGSDYYEIELRQY
SEQMHSDLPPTTLRGYVQVNNGTDTTSCTDPSLNLATPCTTANNTVAPAP
GPRHLGPIIVAQKDRPVRVKFINKLPTGAGGNLFIPVDQTVMGSGPFQID
YDPVTKQATALKSGTFTQNRAELHLHGGRTPWISDGTPHQWITPAGEMTD
YPTGVSVENVPDMPDPGPGAQTYYWTNQQSSRMLFYHDHAWGITRLNVYV
GEAAGYLIRDAVEQELITAGTIPSAELPLVIEDKTFVDPATIVATDPTWA
WGSQPWTGTGPMTPVKGDLWWPHVYMPAQNPFDITGIAPMGRWAYGPYFW
PATNNPFQPIPNPYYSAACDPAGDPATTPGLLGGPYGQFCQPPEIPSTPN
PSWGAEAFMDTPLVNGTAYPVVDVDPKPYRLRVLNANHDRFVNLQLYKAD
PTVDPNATPGSDAKCLALGGCATETEVKMLPALDYSTDPTWPATWPADGR
PGGIPDWTTRGPDWVMIGTEGGFLPKPVVIPSHPVTWNNDVTTFNAGNVN
GGSLILGPAERADVIVDFSQYAGQTLILYNDAPAPWPAIDPHYDYYTGAP
DNRAMGGADTTLAGFGPNTRTIMKIRVAAGAGAPFNLAALQAAFTSGSNS
GGQPSVFQRSQDPIIVGQGNMNPAGDPAVFSAFLFPETYDAYNKAYDRVF
PTSWPNWGVSRINDKVLNFIGSDGSTTYQYNPADTTPLPWDPTKTTKGGM
PMKFKAIQDEQGETFDDYGRMRAALGLELITPGAGRVNFIVQTYSDPATE
VLQEDGIQIWKITHNGVDTHPVHFHLFDVQVLNRVGWDGFIRLPDPTELG
WKDTVRISPLEDTIVAMKPVKPKMPFGVPNSFRPLNPATPLGDTTELSMV
DPTTGQAWATPNINRFMNFDWEYVWHCHILSHEENDMMRPMQFIPVTNLP
DAPTLNTAIVTVNSVVLNWTDSTPPSAPTTLGNPKNEIGFRVERCAGSSC
TDFAPIGTALANATSYTDLAVNSTTTYRYRVVAYNALGDSPVSNVLSADT
AVISRPIITVSPLTANFGNVTVGFTSSPTNITVTNTGQLPLDVTAFTPSG
GNAAMFTIQNGSCGTLPVTIAPAANCTFSVTFAPTTAGIVTANLQITSND
AASPVPNISLSGTGISPTTNPVRINTTYYPSLAAAFTAAASGNTIQAFGV
LFVEPAVNLNTTGTVTFRGGYDALFGTSTGMTTLQGVFTITNGALVVNNL
TIQ
>GSU1575 hypothetical protein
MTSVQNHYDRLLGPLYGWMLGDGEKARERARQELHDAGISAGDGLAVDLG
AGNGLHAVPLAESGYAVVAIDTCQSLLDELRESSGTFAITAVRDSLETFR
RHCPTAPDVVVCMGDTLTHLPSREAVQGLISEVAASLAPGGIFVTTFRDY
VRGVLEGPARFIPVQSDEEKILTCFLEYGRDRVIVHDILHTRALSGWEMT
VSAYPKVRLDPEWVGAQLGQRGLQVCTEPGQRGMVRIVARKI
>GSU0926 mce-related protein
MKRSDTISWSQVRGGLFVLAALAFFAGGVLIMGDKTKFFVPKGRLSVIMT
DVAGLKVGAPVWLAGVDVGIVTDIRFERPEQSNEVEAVLEVDEEALKKIG
RDSVITVKTRGLLGEKYVDITPTRQIIATPVTRLQGTSLPKLDDVVQKAG
EAFERVNAVVAKTERGEGTLGRFASDPKLYDNLVRLTVELNTIAGSVNRG
EGTLGMLNRSREPYDRLMKILSRADDTLAEIQSSEGTLSKLIRDRELYDK
LVAVADKAGGAADEVRELNRKLVAKEGTLGLLLADRGFYDKGMSLIDRAD
RSLSSLEEITTRLQRGEGTAGKLLSDKELYDKLNRMVDDLDLLVRDIKEN
PKRYVKFSLF
>GSU3181 beta-ketoacyl synthase domain protein
MRTLAITGASCVTAVGHDGPSTAASVRAGISRFAEYDDYRDENDNPITAA
RICGIHDSWDTPQRMAGVAALCLEKLLDEYFRRDARRPSQIHLFLGVASD
ERPGPRYEESSLFPLRGIIGKWTDKPGLQAIRRGNASMMCSLEQAGRLID
SNPDAVCIVGGVDSLLRTSTLNWFEKDCRLKSASYGRHQGLIAGEAVGFM
VIGEHAGAKQADRPILGSIAGLGLAVEPAPRASSALGRNSGLTEACHAAL
SGVQKKAIRAIFSDLNGENSRAREWGMAEMRCFDKPDESRRLWTPANCYG
DIGAASGVALASIAMQGLVRGWLQSPILVTCSDDHGPCGALVLESEK
>GSU1513 conserved domain protein
MSIEMRRVRGGIGDIATCPSCHNEVNSFNYRVDYESGESFIYKCPFCDLM
FMYPLMLSELTKRPMESVDDADMFNSSVLKRLHEQLIVKKEIAVARALLG
RHNFSLLDVGCGTGWISAIWRDLGADVTGLEPSASRRRIARERHGIRVLD
SFVEELGSEESFDVVTIRHVLEHLENPLEALRHIHSHTRHDGLLVVVVPN
IDCLGRFLFDTRWSWILPWHCLFFNPRSLQGLVERAGFKVENVYQTPSPL
WYPESFFRVLPGSSELRERFYARLNIWALLPFAPLVAAGYLSGFSDNITL
IARACRHK
>GSU2746 conserved domain protein
MVITGSDITMGSRHVSIERYERKESLKMWIGDERPDFEGEERGIGLGPDR
VDLSERSRAAEAHARNAEKKQAAEDTEEAGCGCAEDNLEPRLRLLKDLIE
QLTGRSIRVFDMEEAKKAADTADGEGAAAGGEDGGAGFGIEYDFYESRYE
FESTAFAASGVIRTADGQEIRFDLGMLMEREHFSETSVSLRAGDAVKKDP
LVINFNGTAAELTDLRFSFDLDADGTADRIASLGSGSGYLALDRNGDGVV
NNGSELFGPATGEGFAELAAYDDDGNGWIDERDAVFSQLKVWTGASATSA
GTLTGLGQAGIGAIYLGNQSTPFDLKDGDNQLLGSVRSTGIFVGEKGSVG
TVQQIDLAV
>GSU2248 conserved hypothetical protein
MAVTWVEAGKLFPEAIKKLQHVRVLMDIGCGIRPQQFLRPLVHICCEPFG
QYVEHLQTLIKERTDRNYVVINATWAETVRLFPPKSVDTVILNDVIEHLE
KEEAVNLLRATERIARRQIALFTPLGFMPQSHPDGKDAWGLDGGAWQEHK
SGWQPDDFDETWDIVAAEVFHKSDSMGNELEKPFGALWAIKTFDDELSTG
QGVLSVKQKMYSLVDYTFKKMSKLTR
>GSU0242 acpP-1, acyl carrier protein
MSDQLIVQVKQMIIDALRIEGMSPDDIDTDAPLFGEGLGLDSIDALQLVV
AMEKDFGVVVPDAATGTKVFASVRSMADHIAANRK
>GSU1604 acpP-2, acyl carrier protein
MSSIEKRVKEIVAEQLGVDEAQVTNDASFMDDLGADSLDTVELVMALEEE
FDIEISDEDAEKIQNVQDAIDYITEHT
>GSU0229 alkK, medium-chain-fatty-acid--CoA ligase
MTDTLIPRTPSAYDYPLLIKSLLRNPVVDNPDQEIVYRGVIRHTYRDLRE
RVRRLANVLTGLGVKAGDTVAVMDWDSHRYLELFFAVPMIGAVLHTINVR
LSPEQILYTIDHAEDDLLLVNSEFLPILEQIRGRIDTVRGYVLLTDEEKM
PESHIPFVGEYEALLAAASGEYDFPDFDENTRATTFYTTGTTGLPKGVYF
SHRQLVLHTMGVMASLGTVFAHGRLHQGDVYMPITPMFHVHAWGVPYLAT
MLGIKQVYPGRYSPDLLLDLIEKERVTFTHCVPTILHMLLKHPHAKRVDL
AGLKMIIGGAAMSRALCCEALERGIDVFTGYGMSETCPILTFSRLTPEML
AGSPAEQAEVRCLTGLSLPFVDLRVVDPETGAEQPRDGRSAGEIVVRAPW
LTQGYLKDHRTSEKLWEGGFLHTGDVAVRDERGYVRITDRTKDVIKVAGE
WVSSLELEDILAHHPAVAEVAVIGQPDEKWGERPLALVVLKPEEAGRVGE
KDLAHFVREYADKGMVSKQVVLLKVRLVDAIDKTSVGKISKVTLREKYLA
>GSU0074 elbB, enhancing lycopene biosynthesis protein 2
MKKIGVVLSGCGVYDGSEIHEAVLTLLAIDRNGAEAVCMAPSMEFREVNH
LTSQETGATRNALVEAARIARGKIRDVKDVSAVELDAVIFPGGYGAAKNL
CTFAEKGAAATINPEVARLIREMAVAKKPIGAICIAPALIAATLGRDYKP
KVTIGTDAGTAAAITETGSEHVSCPVAEFVVDRENKIVTTPAYMLANRIS
EAAEGIEKAVKAVVEMA
>GSU0460 fabF-1, 3-oxoacyl-(acyl-carrier-protein) synthase II
MDKTRIAITGLGIFCAAGKDLASFTDALLHGRCGIGPVDLFDVSPFPSHI
GAQVRDYSPLDYFDRADARRLSRTDQFAAVAAAEALGMSGAREHYDPFTV
GICVGAGASGMIHGEAWLRDRLAGGKGRPGDLRCILPDRTTTALAERFGL
WGYQGSITTACSSSATAIGWGADLIATGQLDACLCGGSDTLSILTYAGFN
SLRVVDPEPCSPFSLGRQGISLGEGAAFVVLEREETARSRGARIYGRVLG
YALAGEAHHMTAPEPSGSEAARVMRAALDNAGVSAGEIGWVNAHGTGTPL
NDVVESKAMKLVFGADVQHVPLVSTKGMTGHCLGAAGSIEIVATVTALGA
GIIPRTLNFRGSDPECDLDYCHDGPRESSATVALSNSFAFGGNVTSVVIG
R
>GSU1605 fabF-2, 3-oxoacyl-(acyl-carrier-protein) synthase II
MRRVVVTGVGAVSPLGVGNAANWDALVSGTSGIGHITRFDASDLPVRIAG
EVKGFDPEQYIDKKEVKKMDLFIQYAMAAAHYAMEDSGLQITEENAERTG
VLVGAGLGGLPTIEKYHAAMLEGGHKKISPFFIPMLIINLAPGHISIKYG
AKGPNLSSVSACATGTHSIGDAYHMIKRGDADAMIAGGTESTVTPLGIGG
FAVMKALSTRNDDPTAASRPFEKNRDGFVLAEGAGIVVLEEYEAAKKRGA
KIYAEIVGYGLTGDAYHLTAPAPEGEGAARCMKMALNNAGVRPEEVDYIN
AHGTSTPFNDYYETLAVKSVFGDYAKKVMVSSTKSMTGHLLGAAGGVEAV
FTLMAMDKGVIPPTINYQEQDPECDLDYVPNAAREKSITYALSNNFGFGG
TNATLLFKKV
>GSU0461 fabG-1, 3-oxoacyl-(acyl carrier protein) reductase
MEFKDSIVVVTGGTRGIGRAISLHFARQGALVTAAYRADDEAARALEAEA
AGLPGSIAVIRADVGTAEGAMAVIDAASGESGTLHVLVNNAGIIRDGYLA
MMAEDDWDAVMRANLSPLFHCCKWGVRKMLARRRGAIINLSSVSAFAGTA
GQTNYAATKGAAVSFTKSLAREVGPLGIRVNAVAPGLIETEMIAGMKREM
VDRIVGSSILGRTGRPEEVAEAVAFLASDRASYITGQCLVVDGGIL
>GSU1603 fabG-2, 3-oxoacyl-(acyl-carrier-protein) reductase
MSLAGKIAVVTGASRGIGREIALRLAREGADVAVTATTLDSARKTADEIE
QIGRRALALAVDVADAAAVEALFASVVEAFGKVDILVNNAGITRDGLLLR
MKDADWDAVLDVNLKGAFNCTREAAKLMTKARSGRIVNIGSVVGEMGNAG
QVNYCASKAGMIGMTKAVARELAKRGITVNAVTPGFIETDMTAVLSEKVR
ESLMQQIPLERFGSPEDIANAVHFLVSDMGSYITGHVLSVNGGMYM