TitleGenColors Logo

Gene list

Applied filters:

COG category: Secondary metabolites biosynthesis, transport and catabolism
Organism: Mycobacterium bovis AF2122/97, AF2122/97
Gene type: CDS

Number of genes found: 261

Free access
Sort by:

 



# Mycobacterium bovis AF2122/97, AF2122/97

>Mb0069 Mb0069, PROBABLE OXIDOREDUCTASE
MTKWTAADIPDQTGRTAVITGANTGLGFETAAALAAHGAHVVLAVRNLDK
GKQAAARITEATPGAEVELQELDLTSLASVRAAAAQLKSDHQRIDLLINN
AGVMYTPRQTTADGFEMQFGTNHLGHFALTGLLIDRLLPVAGSRVVTISS
VGHRIRAAIHFDDLQWERRYRRVAAYGQAKLANLLFTYELQRRLAPGGTT
IAVASHPGVSNTELVRNMPRPLVAVAAILAPLMQDAELGALPTLRAATDP
AVRGGQYFGPDGFGEIRGYPKVVASSAQSHDEQLQRRLWAVSEELTGVVY
PVG
>Mb0076 Mb0076, CONSERVED HYPOTHETICAL PROTEIN
MGDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRISAVDFAG
SACPDMNLVDLGESTLLPGLVDAHAHLCWDPDGRPEDLAGDPHAVLVGRA
RRHAAAALRSGITTIRDLGDRDYAALALREEYRQKTTVGPELVVSGPPLT
RSGGHCWFLGGVADSVEELVDAVQERAARGADWIKVMATGGFVTTASDPW
QPQYGSGQLAAVVAAAEQVGLPVTAHAHATAGIAAAVAAGVDGIEHCTFL
SEGSAAASPDVVEAIVAQGVWCGMTIPRVYPEMPENLVAVVQDGWRNIRR
LIDAGARVALSTDAGVAPGRRHDVLPDDLVYLSRHGFTSTEVLTGATAAA
AASCGLGHRKGRIAPGYDADLLAVAAGVDHDPAGLCDVKAVWRSGTQVPL
QASAVGYNTPS
>Mb0092 Mb0092, POSSIBLE METHYLTRANSFERASE/METHYLASE
MDQPWNANIHYDALLDAMVPLGTQCVLDVGCGDGLLAARLARRIPYVTAV
DIDAPVLRRAQTRFANAPIRWLHADIMTAELPNAGFDAVVSNAALHHIED
TRTALSRLGGLVTPGGTLAVVTFVTPSLRNGLWHLTSWVACGMANRVKGK
WEHSAPIKWPPPQTLHELRSHVRALLPGACIRRLLYGRVLVTWRAPV
>Mb0100 Mb0100, POSSIBLE OXIDOREDUCTASE
MTLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKDVHPSPR
EFIKLGRIIGQIVPYYEPMYHHEDHPEIFVSSTEEGQGVPKTGAFWHIDY
MFMPEPFAFSMVLPLAVPGHDRGTYFIDLARVWQSLPAAKRDPARGTVST
HDPRRHIKIRPSDVYRPIGEVWDEINRTTPPIKWPTVIRHPKTGQEILYI
CATGTTKIEDKDGNPVDPEVLQELMAATGQLDPEYQSPFIHTQHYQVGDI
ILWDNRVLMHRAKHGSAAGTLTTYRLTMLDGLKTPGYAA
>Mb0103 Mb0103, CONSERVED HYPOTHETICAL PROTEIN
MRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQ
SELPSRLAANPSIAGWLRELEAVCTEFG
>Mb0150 Mb0150, CONSERVED HYPOTHETICAL PROTEIN
MTELDDVSSLPSSRRTAGDTWAITESVGATALGVAAARAVETAATNPLIR
DEFAKVLVSSAGTAWARLADADLAWLDGDQLGRRVHRVACDYQAVRTHFF
DEYFGAAVDAGVRQVVILAAGLDARAYRLNWPAGTVVYEIDQPSVLEYKA
GILQSHGAVPTARRHAVAVDLRDDWPAALIAAGFDGTQPTAWLAEGLLPY
LPGDAADRLFDMVTALSAPGSQVAVEAFTMNTKGNTQRWNRMRERLGLDI
DVQALTYHEPDRSDAAQWLATHGWQVHSVSNREEMARLGRAIPQDLVDET
VRTTLLRGRLVTPAQPA
>Mb0151 Mb0151, CONSERVED HYPOTHETICAL PROTEIN
MRTHDDTWDIKTSVGATAVMVAAARAVETDRPDPLIRDPYARLLVTNAGA
GAIWEAMLDPTLVAKAAAIDAETAAIVAYLRSYQAVRTNFFDTYFASAVA
AGIRQVVILASGLDSRAYRLDWPAGTIVYEIDQPKVLSYKSTTLAENGVT
PSAGRREVPADLRQDWPAALRDAGFDPTARTAWLAEGLLMYLPAEAQDRL
FTQVGAVSVAGSRIAAETAPVHGEERRAEMRARFKKVADVLGIEQTIDVQ
ELVYHDQDRASVADWLTDHGWRARSQRAPDEMRRVGRWVEGVPMADDPTA
FAEFVTAERL
>Mb0153 Mb0153, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MPGVQDRVIVVTGAGGGLGREYALTLAGEGASVVVNDLGGARDGTGAGSA
MADEVVAEIRDKGGRAVANYDSVATEDGAANIIKTALDEFGAVHGVVSNA
GILRDGTFHKMSFENWDAVLKVHLYGGYHVLRAAWPHFREQSYGRVVVAT
STSGLFGNFGQTNYGAAKLGLVGLINTLALEGAKYNIHANALAPIAATRM
TQDILPPEVLEKLTPEFVAPVVAYLCTEECADNASVYVVGGGKVQRVALF
GNDGANFDKPPSVQDVAARWAEITDLSGAKIAGFKL
>Mb0227 Mb0227, CONSERVED HYPOTHETICAL PROTEIN
MKRLSGWDAVLLYSETPNVHMHTLKVAVIELDSDRQEFGVDAFREVIAGR
LHKLEPLGYQLVDVPLKFHHPMWREHCQVDLNYHIRPWRLRAPGGRRELD
EAVGEIASTPLNRDHPLWEMYFVEGLANHRIAVVAKIHHALADGVASANM
MARGMDLLPGPEVGRYVPDPAPTKRQLLSAAFIDHLRHLGRIPATIRYTT
QGLGRVRRSSRKLSPALTMPFTPPPTFMNRIKKPLSKPSGRPPPHTNRAP
SSMPLAM
>Mb0229c Mb0229c, POSSIBLE METHYLTRANSFERASE (METHYLASE)
MAVTDVFARRATLRRSLRLLADFRYEQRDPARFYRTLAADTAAMIGDLWL
ATHSEPPVGRTLLDVGGGPGYFATAFSDAGVGYIGVEPDPDEMHAAGPAF
TGRPGMFVRASGMALPLADDSVDICLSSNVAEHVPRPWQLGTEMLRVTKP
GGLVVLSYTVWLGPFGGHEMGLSHYLGGARAAARYVRKHGHPAKNNYGSS
LFAVSAAEGLRWAAGTGAALAVFPRYHPRWAWWLTSVPVLREFLVSNLVL
VLTP
>Mb0289 Mb0289, CONSERVED HYPOTHETICAL PROTEIN
MRTEGDSWDITTSVGSTALFVATARALEAQKSDPLVVDPYAEAFCRAVGG
SWADVLDGKLPDHKLKSTDFGEHFVNFQGARTKYFDEYFRRAAAAGARQV
VILAAGLDSRAYRLPWPDGTTVFELDRPQVLDFKREVLASHGAQPRALRR
EIAVDLRDDWPQALRDSGFDAAAPSAWIAEGLLIYLPATAQERLFTGIDA
LAGRRSHVAVEDGAPMGPDEYAAKVEEERAAIAEGAEEHPFFQLVYNERC
APAAEWFGERGWTAVATLLNDYLEAVGRPVPGPESEAGPMFARNTLVSAA
RV
>Mb0311 Mb0311, PROBABLE DEHYDROGENASE/REDUCTASE
MNTGTAVITGASSGLGLQCARALLRRDASWHVVLAVRDPARGRAAMEELG
EPNRCSVLEVDLASVRSVRSFVETVRTTPLPPIRALVCNAGLQVVSGIAF
TDDGVEMTFGVNHLGHFALVTGILDWLARPARIVVVSSGTHDPSKHTGMP
DPRYTCAADLAHPPTDQNTPAEGRRRYTTSKLCNVLFTYELDRRLDHGEQ
GVMVNAFDPGLMPGSGLARDYPPILRLAYRLLSPMLRVLPFVHSTRVSGE
HLAALAVDPRFAGVTGQYFAGAKAIRSSAESYDRAKALDLWETSERLLAQ
VT
>Mb0322c Mb0322c, POSSIBLE CONSERVED MEMBRANE PROTEIN
MIVVWEHLCMNPEDDPEARIRELERPLADVARASELGGSQSGGYTYPPGP
PPPPYSYGGPFGGPSPRSSSGNRAWWILAAVVVVGVLVLVGGIAAFSAQR
LSQGNFVVLSPTPSVSRAVPTPTAQPATTLPPAGASLSVSGVNVNRTIAC
NDSIVSVSGMSNTVVITGHCTSLTVSGMRNSVTADSVDTIEAAGFNNEVT
YHSGSPKISNAGGSNSVQQG
>Mb0324 Mb0324, POSSIBLE MUCONOLACTONE ISOMERASE
MEFLVTMTTRVPDSMPADAVERVRAREAARSRELAAQGKLLRLWRPPLRP
GEWRTLGLFAADDNGELEQLLASMPPRSWRTDDVTPLGAHPNDPVGQGIT
IAPGKGPEFLIATTIMVPPGTPAQVVDDTVAREARRAPELAGRGHLVRLW
ALPDGPDGQRTLGLWRARDPGELMAILESLPLAGWMTIETTPLSPHPDDP
IRMP
>Mb0333 Mb0333, HYPOTHETICAL PROTEIN
MGPKGSLRLVKRQPELLVAQHEHWQDTYRAHPVLYGTRPSEPGVYAAEVF
NADGVQRVLELAAGHGRDTLYFAGQGFTVVATDFSDVAVAQLRRSAQARG
VSARVQPIVHDLRQPLPVKTGSIDGAFAHMALCMALSTSEIHAVVAEVGR
VLRPGGKFIYTVRHTGDAHYGAGQAHGDDIFECAGFAVHFFRRELVARLA
TGWVLEEVHDFEEGELPRRLWRVTVTKPA
>Mb0336c Mb0336c, CONSERVED HYPOTHETICAL PROTEIN
MRLTHPARRYLSSQAARPTGAFGRLLGRIWRAETADVNRIAVELLAPGPG
ERVCEIGFGPGRTLGLLAAAGAQVSGVEVSTTMIAIAAHHNAKAIAAGLI
SLYHGDGVTLPVADHSLDKVLGVHNFYFSPDPRASLCDIARALRPGGRLV
LTSISDDQPLAARFDPAIYRVPPTLDTAAWLGAAGFIDVGIKRSADHPAT
VWFTATAT
>Mb0363c Mb0363c, CONSERVED HYPOTHETICAL PROTEIN
MTDASVHPDELDPEYHHHGGFPEYGPASPGAGFGQFVATMRRLQDLAVAA
DPGDAVWDEAAERAAALVELLSPFEADEGKAPAGRTPGLPGMGSLLLPPW
TVTRYGTDGVEMRGSFSRFHVGGNSAVHGGVLPLLFDHMFGMISHAAGRP
ISRTAFLHVDYRRITPIDVPLIVRGRVTNTEGRKAFVCAELFDSDETLLA
EGNGLMVRLLPGQP
>Mb0447c Mb0447c, PUTATIVE DEHYDROGENASE/REDUCTASE
MTANDNKTRKWSAADVPDQSGRVVVVTGANTGIGYHTAAVFADRGAHVVL
AVRNLEKGNAARARIMAARPGAHVTLQQLDLCSLDSVRAAADALRTAYPR
IDVLINNAGVMWTPKQVTKDGFELQFGTNHLGHFALTGLVLDHMLPVPGS
RVVTVSSQGHRIHAAIHFDDLQWERRYNRVAAYGQAKLANLLFTYELQRR
LGEAGKSTIAVAAHPGGSNTELTRNLPRLIRPVATVLGPLLFQSPEMGAL
PTLRAATDPTTQGGQYYGPDGFGEQRGHPKVVQSSAQSHDKDLQRRLWTV
SEELTGVSFGV
>Mb0533 Mb0533, POSSIBLE METHYLTRANSFERASE/METHYLASE (FRAGMENT)
MGGCSITCLNISEVPNETNRKKNRQAGLDRSIRVIHGSFDDIPEPDSGYD
VVWSQDAILHAPDRRKVLEEAFRVLRPGGELIFTDPMQADDVPDGVLQPV
YDRLNLRDLGSMRFYA
>Mb0561c Mb0561c, POSSIBLE OXIDOREDUCTASE
MSKRPLRWLTEQITLAGMRPPISPQLLINRPAMQPVDLTGKRILLTGASS
GIGAAATKQFGLHRAVVVAVARRKDLLDAVADRITGDGGTAMSLPCDLSD
MEAIDALVEDVEKRIGGIDILINNAGRSIRRPLAESLERWHDVERTMVLN
YYAPLRLIRGLAPGMLERGDGHIINVATWGVLSEASPLFSVYNASKAALS
AVSRIIETEWGSQGVHSTTLYYPLVATPMIAPTKAYDGLPALTAAEAAEW
MVTAARTRPVRIAPRVAVAVNALDSIGPRWVNALMQRRNEQLNP
>Mb0575c Mb0575c, POSSIBLE BENZOQUINONE METHYLTRANSFERASE (METHYLASE)
MSTVLTYIRAVDIYEHMTESLDLEFESAYRGESVAFGEGVRPPWSIGEPQ
PELAALIVQGKFRGDVLDVGCGEAAISLALAERGHTTVGLDLSPAAVELA
RHEAAKRGLANASFEVADASSFTGYDGRFDTIVDSTLFHSMPVESREGYL
QSIVRAAAPGASYFVLVFDRAAIPEGPINAVTEDELRAAVSKYWIIDEIK
PARLYARFPAGFAGMPALLDIREEPNGLQSIGGWLLSAHLG
>Mb0582 Mb0582, PROBABLE METHYLTRANSFERASE/METHYLASE
MELSPDRIMAIGGGYGPSKVLLTAVGLGLFTELGDEAMTAEAIADRLGLL
KRPAIDFLDALVSLDLLARDGDGPGSHYRNTPETAHFLDEARPTYAGGLL
KIWNERNYRFWADLTEALKTGKAQSEVKQTGRPFFEALYADPRRLEAFMA
AMDAASRRNIELLAKRFPFERYRRLCDVGCADGLLSRIVAAAHPHLQCVS
LDLPAVTEIARRKLTAEGLGERVQACAGDFLADPLPAADVITMGQILHDW
NLDRKQQLVAKAYEALSKEGAFIVIETLIDDARRENTTGLMMSLNMLIEF
GDAFDYSAADFRGWCGEAGFRSFEVIPLAGGSSAAVAYK
>Mb0673 Mb0673, PROBABLE DIOXYGENASE
MTTAQAAESQNPYLEGFLAPVSTEVTATDLPVTGRIPEHLDGRYLRNGPN
PVAEVDPATYHWFTGDAMVHGVALRDGKARWYRNRWVRTPAVCAALGEPI
SARPHPRTGIIEGGPNTNVLTHAGRTLALVEAGVVNYELTDELDTVGPCD
FDGTLHGGYTAHPQRDPHTGELHAVSYSFARGHRVQYSVIGTDGHARRTV
DIEVAGSPMMHSFSLTDNYVVIYDLPVTFDPMQVVPASVPRWLQRPARLV
IQSVLGRVRIPDPIAALGNRMQGHSDRLPYAWNPSYPARVGVMPREGGNE
DVRWFDIEPCYVYHPLNAYSECRNGAEVLVLDVVRYSRMFDRDRRGPGGD
SRPSLDRWTINLATGAVTAECRDDRAQEFPRINETLVGGPHRFAYTVGIE
GGFLVGAGAALSTPLYKQDCVTGSSTVASLDPDLLIGEMVFVPNPSARAE
DDGILMGYGWHRGRDEGQLLLLDAQTLESIATVHLPQRVPMGFHGNWAPT
T
>Mb0698c Mb0698c, CONSERVED HYPOTHETICAL THREONINE RICH PROTEIN
MVEKPLRADRATHSRLATFALALAAAALPLAGCSSTANPPAATTTPATAT
TTTATSGPTAAPTVTTGESTTASIQIGDMLTYGSIGTTATLDCADGKSLN
VAGSDNTLTVNGTCETVTVGGANNKIAFDRIDERLVVVGLDNTVTYKNGD
PTIDNLGAGNRINKE
>Mb0706 Mb0706, PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MSARGGSLHGRVAFVTGAARAQGRSHAVRLAREGADIVALDICAPVSGSV
TYPPATSEDLGETVRAVEAEGRKVLAREVDIRDDAELRRLVADGVEQFGR
LDIVVANAGVLGWGRLWELTDEQWETVIGVNLTGTWRTLRATVPAMIDAG
NGGSIVVVSSSAGLKATPGNGHYAASKHALVALTNTLAIELGEFGIRVNS
IHPYSVDTPMIEPEAMIQTFAKHPGYVHSFPPMPLQPKGFMTPDEISDVV
VWLAGDGSGALSGNQIPVDKGALKY
>Mb0746c Mb0746c, CONSERVED HYPOTHETICAL PROTEIN
MPRAHDDNWDLASSVGATATMVAAGRALATKDPRGLINDPFAEPLVRAVG
LDFFTKLIDGELDIATTGNLSPGRAQAMIDGIAVRTKYFDDYFRTATDGG
VRQVVILAAGLDARAYRLPWPAGTVVYEIDQPQVIDFKTTTLAGIGAKPT
AIRRTVYIDLRADWPAALQAAGLDSTAPTAWLAEGMLIYLPPDPRTGCST
TAPNSVLRAARSLPNLSRALWISTQAGYEKWRIRFASTAWTSTWRRWCIP
ANAATSSTTCAPRAGTLRAQCGPTYSGAMVCPFPPHTTTIRSAKSSSSAV
V
>Mb0747c Mb0747c, CONSERVED HYPOTHETICAL PROTEIN
MTYTGSIRCEGDTWDLASSVGATATMVAAARAMATRAANPLINDQFAEPL
VRAVGVDVLTRLASGELTASDIDDPERPNASMVRMAEHHAVRTKFFDEFF
MDATRAGIRQVVILASGLDSRAYRLAWPAQTVVYEIDQPQVMEFKTRTLA
ELGATPTADRRVVTADLRADWPTALGAAGFDPTQPTAWSAEGLLRYLPPE
AQDRLLDNVTALSVPDSRFATESIRNFKPHHEERMRERMTILANRWRAYG
FDLDMNELVYFGDRNEPASYLSDNGWLLTEIKSQDLLTANGFQPFEDEEV
PLPDFFYVSARLQRKHRQYPAHRKPAPSWRHTACPVNELSKSAAYTMTRS
DAHQASTTAPPPPGLTG
>Mb0752c Mb0752c, CONSERVED HYPOTHETICAL PROTEIN
MTQTGSARFEGDSWDLASSVGLTATMVAAARAVAGRAPGALVNDQFAEPL
VRAVGVDFFVRMASGELDPDELAEDEANGLRRFADAMAIRTHYFDNFFLD
ATRAGIRQAVILASGLDSRAYRLRWPAGTIVFEVDQPQVIDFKTTTLAGL
GAAPTTDRRTVAVDLRDDWPTALQKAGFDNAQRTAWIAEGLLGYLSAEAQ
DRLLDQITAQSVPGSQFATEVLRDINRLNEEELRGRMRRLAERFRRHGLD
LDMSGLVYFGDRTDARTYLADHGWRTASASTTDLLAEHGLPPIDGDDAPF
GEVIYVSAELKQKHQDTR
>Mb0788c Mb0788c, PROBABLE OXIDOREDUCTASE
MPRFEPHPARRTTVVAGASSGIGAATATELAGRGFPVALGARRMDKLAEL
VDKIRADGGEAVAFPLDVTDPESVKSFVAQTVEALGEVELLVSSAGDMLP
GQLHEVSTEAFAEQVQIHLVGANRLATAVLPAMVARRRGDLIFVGSDVGL
RQRPHMGAYGAAKAGLAAMVTNLQMELEGTGVRASIVHPGPTLTGMGWQL
SAEQVGPMLADWAKWGQARHNYFLRPSDLARAIAFVAETPRGCVVVNMEI
QPEAPLRDAPAHRQKLVLGEEGMPG
>Mb0792 Mb0792, PROBABLE DEHYDROGENASE/REDUCTASE
MFDSKVAIVTGAAQGIGQAYAQALAREGASVVVADINADGAAAVAKQIVA
DGGTVIHVPVDVSDEDSAKAMVDRAVGAFGGIDYLVNNAAIYGGMKLDLL
LTVPLDYYKKFMSVNHDGVLVCTRAVYKHMAKRGGGAIVNQSSTAAWLYS
NFYGLAKVGVNGLTQQLARELGGMKIRINAIAPGPIDTEATRTVTPAELV
KNMVQTIPLSRMGTPEDLVGMCLFLLSDSASWITGQIFNVDGGQIIRS
>Mb0853 Mb0853, CONSERVED HYPOTHETICAL PROTEIN
MVRADRDRWDLATSVGATATMVAAQRALAADPRYALIDDPYAAPLVRAVG
MDVYTRLVDWQIPVEGDSEFDPQRMATGMACRTRFFDQFFLDATHSGIGQ
FVILASGLDARAYRLAWPVGSIVYEVDMPEVIEFKTATLSDLGAEPATER
RTVAVDLRDDWATALQTAGFDPKVPAAWSAEGLLVYLPVEAQDALFDNIT
ALSAPGSRLAFEFVPDTAIFADERWRNYHNRMSELGFDIDLNELVYHGQR
GHVLDYLTRDGWQTSALTVTQLYEANGFAYPDDELATAFADLTYSSATLM
R
>Mb0862 Mb0862, CONSERVED HYPOTHETICAL PROTEIN
MNDKRRAIYTHGYHESVLRSHRRRTAENSAGYLLPYLVPGLSVLDVGCGP
GTITVDLAARVVPGSVTGVEPTDDALSLARAEAQLHRLSNISFTTSDVHK
LDFPDDAFDVVHAHQVLQHVADPVRALQEMRRVCTPGGIVAARDADYSGF
IWFPKLPALDRWLDLYERAARANGGEPDAGRRLLSWARAAGFDDVTPTAS
VWCFATASAREWWGLVWADRILQSDLAHQLVDSGLATAAQLEEISTAWRE
WAAAPDGWLAIPHGEILCRA
>Mb0869c Mb0869c, PROBABLE OXIDASE
MPELATSGNAFDKRRFSRRGFLGAGIASGFALAACASKPTASGAAGMTAA
IDAAEAARPHSGRTVTATLTPQPARIDLGGPIVSTLTYGNTIPGPLIRAT
VGDEIVVSVTNRLGDPTSVHWHGIALRNDMDGTEPATANIGPGGDFTYRF
SVPDPGTYWAHPHVGLQGDHGLYLPVVVDDPTEPGHYDAEWIIILDDWTD
GIGKSPQQLYGELTDPNKPTMQNTTGMPEGEGVDSNLLGGDGGDIAYPYY
LINGRIPVAATSFKAKPGQRIRIRIINSAADTAFRIALAGHSMTVTHTDG
YPVIPTEVDALLIGMAERYDVMVTAAGGVFPLVALAEGKNALARALLSTG
AGSPPDPQFRPDELNWRVGTVEMFTAATTANLGRPEPTHDLPVTLGGTMA
KYDWTINGEPHSTTNPLHVRLGQRPTLMFDNTTMMYHPIHLHGHTFQMIK
ADGSPGARKDTVIVLPKQKMRAVLVADNPGVWVMHCHNNYHQVAGMATRL
DYIL
>Mb0874c Mb0874c, PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MDGFPGRGAVITGGASGIGLATGTEFARRGARVVLGDVDKPGLRQAVNHL
RAEGFDVHGVMCDVRHREEVTHLADEAFRLLGHFDVVFSNAGIVVGGPIV
EMTHDDWRWVIDVDLWGSIHTVEAFLPRLLEQGTGGHVVFTASFAGLVPN
AGLGAYGVAKYGVVGLAETLAREVTADGIGVSVLCPMVVETNLVANSERI
RGAACAQSSTTGSPGPLPLQDDNLGVDDIAQLTADAILANRLYVLPHAAS
RASIRRRFERIDRTFDEQAAEGWRH
>Mb0917c Mb0917c, CONSERVED HYPOTHETICAL PROTEIN
MRTEDDSWDVTTSVGSTGLLVAAARALETQKADPLAIDPYAEVFCRAAGG
EWADVLDGKLPDHYLTTGDFGEHFVNFQGARTRYFDEYFSRATAAGMKQV
VILAAGLDSRAFRLQWPIGTTIFELDRPQVLDFKNAVLADYHIRPRAQRR
SVAVDLRDEWQIALCNNGFDANRPSAWIAEGLLVYLSAEAQQRLFIGIDT
LASPGSHVAVEEATPLDPCEFAAKLERERAANAQGDPRRFFQMVYNERWA
RATEWFDERGWRATATPLAEYLRRVGRAVPEADTEAAPMVTAITFVSAVR
TGLVADPARTSPSSTSIGFKRFEAD
>Mb0919 Mb0919, CONSERVED HYPOTHETICAL PROTEIN
MRQQQEADVVALGRKPGLLCVPERFRAMDLPMAAADALFLWAETPTRPLH
VGALAVLSQPDNGTGRYLRKVFSAAVARQQVAPWWRRRPHRSLTSLGQWS
WRTETEVDLDYHVRLSALPPRAGTAELWALVSELHAGMLDRSRPLWQVDL
IEGLPGGRCAVYVKVHHALADGVSVMRLLQRIVTADPHQRQMPTLWEVPA
QASVAKHTAPRGSSRPLTLAKGVLGQARGVPGMVRVVADTTWRAAQCRSG
PLTLAAPHTPLNEPIAGARSVAGCSFPIERLRQVAEHADATINDVVLAMC
GGALRAYLISRGALPGAPLIAMVPVSLRDTAVIDVFGQGPGNKIGTLMCS
LATHLASPVERLSAIRASMRDGKAAIAGRSRNQALAMSALGAAPLALAMA
LGRVPAPLRPPNVTISNVPGPQGALYWNGARLDALYLLSAPVDGAALNIT
CSGTNEQITFGLTGCRRAVPALSILTDQLAHELELLVGVSEAGPGTRLRR
IAGRR
>Mb0921c Mb0921c, PROBABLE OXIDOREDUCTASE
MSDHDRDFDVVVVGGGHNGLVAAAYLARAGLRVRLLERLAQTGGAAVSIQ
AFDGVEVALSRYSYLVSLLPSRIVADLGAPVRLARRPFSSYTPAPATAGR
SGLLIGPTGEPRAAHLAAIGAAPDAHGFAAFYRRCRLVTARLWPTLIEPL
RTREQARRDIVEYGGHEAAAAWQAMVDEPIGHAIAGAVANDLLRGVIATD
ALIGTFARMHEPSLMQNICFLYHLVGGGTGVWHVPIGGMGSVTSALATAA
ARHGAEIVTGADVFALDPDGTVRYHSDGSDGAEHLVRGRFVLVGVTPAVL
ASLLGEPVAALAPGAQVKVNMVVRRLPRLRDDSVTPQQAFAGTFHVNETW
SQLDAAYSQAASGRLPDPLPCEAYCHSLTDPSILSARLRDAGAQTLTVFG
LHTPHSVFGDTEGLAERLTAAVLASLNSVLAEPIQDVLWTDAQSKPCIET
TTTLDLQRTLGMTGGNIFHGALSWPFADNDDPLDTPARQWGVATDHERIM
LCGSGARRGGAVSGIGGHNAAMAVLACLASRRKSP
>Mb0937c Mb0937c, POSSIBLE DIOXYGENASE
MDITIVGKYLSTLPEDDDHPYRTGPWRPQTTEWDADDLTTVTGEVPADLD
GIYLRNTENPLHPAFATYHPFDGDGMIHVVGFRDGKAFYRNRFIRTDGFL
AENEAGGPLWPGLAEPVQLAKREHGWGARGLMKDASSTDVIVHRGIALTS
FYQCGDLYRIDPYSANTLGKESWHGRFPFDWGVSAHPKVDNKTGELLFFN
YSKQEPYMRYGVVDQNNELVHYVDVPLPGPRLLHDMAFTENYVILNDFPL
FWDPRLLERDVHLPRFYPEIPSRFAVVARRGNDIRWFEADPTFVLHFTNA
YEQGDEIVLDGFYEGDPQPLDTGGTKWEKLFRFLALDRLQSRLHRWRLNM
VTGAVHEEQLSESITEFGTINADYAASSYRYTYAATGKPSWFLFDGLVKH
DLLTGNHECYSFGDGVYGSETAMAPRVGSSAEDDGYLVTLTTDMNDDASY
CLVFDAARPGDGPICKLALPERISSGTHSAWVPGAELRRWDHAESPAAAV
GL
>Mb0950c Mb0950c, PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MILDMFRLDDKVAVITGGGRGLGAAIALAFAQAGADVLIASRTSSELDAV
AEQIRAAGRRAHTVAADLAHPEVTAQLAGQAVGAFGKLDIVVNNVGGTMP
NTLLSTSTKDLADAFAFNVGTAHALTVAAVPLMLEHSGGGSVINISSTMG
RLAARGFAAYGTAKAALAHYTRLAALDLCPRVRVNAIAPGSILTSALEVV
AANDELRAPMEQATPLRRLGDPVDIAAAAVYLASPAGSFLTGKTLEVDGG
LTFPNLDLPIPDL
>Mb0970 Mb0970, PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MLTGVTRQKILITGASSGLGAGMARSFAAQGRDLALCARRTDRLTELKAE
LSQRYPDIKIAVAELDVNDHERVPKVFAELSDEIGGIDRVIVNAGIGKGA
RLGSGKLWANKATIETNLVAALVQIETALDMFNQRGSGHLVLISSVLGVK
GVPGVKAAYAASKAGVRSLGESLRAEYAQGPIRVTVLEPGYIESEMTAKS
ASTMLMVDNATGVKALVAAIEREPGRAAVPWWPWAPLVRLMWVLPPRLTR
RFA
>Mb1079 Mb1079, PROBABLE OXIDOREDUCTASE
MARQRFRDQVVLITGASSGIGEATAKAFAREGAVVALAARREGALRRVAR
EIEAAGGRAMVAPLDVSSSESVRAMVADVVGEFGRIDVVFNNAGVSLVGP
VDAETFLDDTREMLEIDYLGTVRVVREVLPIMKQQRSGRIMNMSSVVGRK
AFARFAGYSSAMHAIAGFSDALRQELRGSGIAVSVIHPALTQTPLLANVD
PADMPPPFRSLTPIPVHWVAAAVLDGVARRRARVVVPFQPRLLMVGDAFS
PRYGDRVVRLLESKIFGRLIGSYRGSVYRHQPTESAKAQAAQPERGYSSA
R
>Mb1176 Mb1176, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MKTKDAVAVVTGGASGLGLATTKRLLDAGAQVVVVDLRGDDVVGGLGDRA
RFAQADVTDEAAVSNALELADSLGPVRVVVNCAGTGNAIRVLSRDGVFPL
AAFRKIVDINLVGTFNVLRLGAERIAKTEPIGEERGVIINTASVAAFDGQ
IGQAAYSASKGGVVGMTLPIARDLASKLIRVVTIAPGLFDTPLLASLPAE
AKASLGQQVPHPSRLGNPDEYGALVLHIIENPMLNGEVIRLDGAIRMAPR
>Mb1178 Mb1178, CONSERVED HYPOTHETICAL PROTEIN
MTSGAAASASRVDHPLFARIWPVVAAHEAEAIRALRRENLAGLSGRVLEV
GAGVGTNFAYYPVAVEQVIAMEPEPRLAAKARIAAADAPVPIVVTDKTVE
EFRDTETFDAVVCSLVLCSVSDPGAVLAHLRSLLRRGGELRYLEHVASAG
ARGRVQRFVDATFWPRLAGNCHTHRHTERAILDAGFVVDSSRREWAFPAW
VPLPVSELALGRAHRT
>Mb1218c Mb1218c, CONSERVED HYPOTHETICAL PROTEIN
MRIAGVGLGQLLLALDATVVSLVDAPRGLDLPVASTALIDSDDVRLGLAA
AAGSADVFFLIGVTDDEAVRWVDDQARQRAPVAIFVKHPSDSVVAGAVRA
GSAVVAVEPRARWERLYHLVNHVLEHHGDRADPTDDSGTDLFGLAQSLAD
RIHGMISIEDAQSHVLAYSASNDEADELRRLSILGRAGPPEHLQWIGQWG
IFDALRAGREVVRVAERPELGLRPRLAIGIHQPGVGALRPPVFAGTIWVQ
QGSQPLADDAEEMLRGAAVLAARIMSRLATQPNTHALRVQQLLGLAELNA
TTAPVDVSTIARELGVAAEGNATLIGFDTAENRDTAVRHVRLVDVMALSA
SAFRHDAQVAANGSRIYVLLPQTTTGRAVTSWVRGTISALRAELGVALRA
AIAGPVAGLAEVNPARVEVDRVLESAERHPILGQVTSLAEARTTVLLDEI
VTLVGTDQRLVDPRIRDLGAQDPVLAQTLRAYLDAFGDIGAAARSLQVHP
NTVRYRIRRIEQLLSTSLGDPDVRLLFSLGLRAMERTA
>Mb1226c Mb1226c, CONSERVED HYPOTHETICAL PROTEIN
MAWQQPSPRIRELIREGARIALNPSPEWIEELDRATIAANPAIANDPVLA
KVVQTANRANLVYWAAANLRDPGARVPANLGTEPLRMARDLVRRGLDTVA
FNIYRTGEHIGWRFWMGIAFELTSDPQELRELLDVSARSVNDFIEATLTG
IAAQVQSEHDELTRSTHAERLEVVGLILDGAPISPERAEAKLGYPLSRAH
TAAIIWSDELDGDHSYLDRAADLFCHAVGSTRPLTVVAGAASRWAWVTDA
DGLDIDTVQAAVDNAPGARIAIGTTANGVEGFRRSHLEALITQRTLSRLR
STQRVAFFADVKMVALISQNPDAASEFITSTLGDLESASPDLQTALLTFI
NEQCNASRAAKRLHTHRNTFLRRLESAQRLLPRPLDHTSVHVAVALEALQ
WRGNKAHALSSPGRRSNSVPA
>Mb1277c Mb1277c, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MEGFAGKVAVVTGAGSGIGQALAIELARSGAKVAISDVDTDGLADTEHRL
KAISTPVKTDRLDVTEREAFLAYADAVNEHFGTVNQIYNNAGIAFTGDIE
VSQFKDIERVMDVDFWGVVNGTKAFLPHLIASGDGHVINISSVFGLFSAP
GQAAYNSAKFAVRGFTEALRQEMALAGHPVKVTTVHPGGVKTAIARNATA
AEGLDQAELAETFDKRVAHLSPQRAAQIILTGVAKNKARVLVGVDAKVLD
LVVRLTGSGYQRIFPIITGRLIPRPR
>Mb1368 Mb1368, PROBABLE HYDROLASE
MNSITDVGGIRVGHYQRLDPDASLGAGWACGVTVVLPPPGTVGAVDCRGG
APGTRETDLLDPANSVRFVDALLLAGGSAYGLAAADGVMRWLEEHRRGVA
MDSGVVPIVPGAVIFDLPVGGWNCRPTADFGYSACAAAGVDVAVGTVGVG
VGARAGALKGGVGTASATLQSGVTVGVLAVVNAAGNVVDPATGLPWMADL
VGEFALRAPPAEQIAALAQLSSPLGAFNTPFNTTIGVIACDAALSPAACR
RIAIAAHDGLARTIRPAHTPLDGDTVFALATGAVAVPPEAGVPAALSPET
QLVTAVGAAAADCLARAVLAGVLNAQPVAGIPTYRDMFPGAFGS
>Mb1379 Mb1379, PROBABLE ACYL CARRIER PROTEIN (ACP)
MWRYPLSTRLALPNTPGVASFAMTSSPSTVSTTLLSILRDDLNIDLTRVT
PDARLVDDVGLDSVAFAVGMVAIEERLGVALSEEELLTCDTVGELEAAIA
AKYRDE
>Mb1406 Mb1406, CONSERVED HYPOTHETICAL PROTEIN
MNVSAESGAPRRAGQRHEVGLAQLPPAPPTTVAVIEGLATGTPRRVVNQS
DAADRVAELFLDPGQRERIPRVYQKSRITTRRMAVDPLDAKFDVFRREPA
TIRDRMHLFYEHAVPLAVDVSKRALAGLPYRAAEIGLLVLATSTGFIAPG
VDVAIVKELGLSPSISRVVVNFMGCAAAMNALGTATNYVRAHPAMKALVV
CIELCSVNAVFADDINDVVIHSLFGDGCAALVIGASQVQEKLEPGKVVVR
SSFSQLLDNTEDGIVLGVNHNGITCELSENLPGYIFSGVAPVVTEMLWDN
GLQISDIDLWAIHPGGPKIIEQSVRSLGISAELAAQSWDVLARFGNMLSV
SLIFVLETMVQQAESAKAISTGVAFAFGPGVTVEGMLFDIIRR
>Mb1412c Mb1412c, PUTATIVE TRANSFERASE
MPGIDFDALYRGESPGEGLPPITTPPWDTKAPKDNVIGWHTGGWVHGDVL
DIGCGLGDNAIYLARNGYQVTGLDISPTALTTAKRRASDAGVDVKFAVGD
ATKLTGYTGAFDTVIDCGMFHCLDDDGKRSYAASVHRATRPGATLLLSCF
SNAMPPDEEWPRSTVSEQTLRDVLGGAGWDIESLEPATVRRELDGTEVEM
AFWNVRAQRRGS
>Mb1438c Mb1438c, PUTATIVE METHYLTRANSFERASE
MTVYTPTSERQAPATTHRQMWALGDYAAIAEELLAPLGPILVSTSGIRRG
DRVLDVAAGSGNVSIPAAMAGAHVTASDLTPELLRRAQARAAAAGLELGW
REANAEALPFSAGEFDAVLSTIGVMFAPRHQRTADELARVCRRGGKISTL
NWTPEGFYGKLLSTIRPYRPTLPAGAPHEVWWGSEDYVSGLFRDHVSDIR
TRRGSLTVDRFGCPDECRDYFKNFYGPAINAYRSIADSPECVATLDAEIT
ELCREYLCDGVMQWEYLIFTARKC
>Mb1440c Mb1440c, PUTATIVE METHYLTRANSFERASE
MTIDTPAREDQTLAATHRAMWALGDYALMAEEVMAPLGPILVAAAGIGPG
VRVLDVAAGSGNISLPAAKTGATVISTDLTPELLQRSQARAAQQGLTLQY
QEANAQALPFADDEFDTVISAIGVMFAPDHQAAADELVRVCRPGGTIGVI
SWTCEGFFGRMLATIRPYRPSVSADLPPSALWGREAYVTGLLGDGVTGLK
TARGLLEVKRFDTAQAVHDYFKNNYGPTIEAYAHIGDNAVLAAELDRQLV
ELAAQYLSDGVMEWEYLLLTAEKR
>Mb1464 Mb1464, CONSERVED HYPOTHETICAL PROTEIN
MAEAGGGPISVIARHMQLIRDDFISELFDKMKAEIRGLDYDARMADLWRA
SITENFVTAVHYLDRDTPQSLVEAPAAALAYARAAAQRDIPLSGLVRAHR
LGHARFLEVAMQYVSLLEPADRVSTIIELVNRSARLVDLVADQLIVAYEH
EHDRWLSRRSGLQQQWVSELLADTPVDVPRAERALGYRLDGVHIAAVVWV
DSAVPIGDVVAQFDQVRCLLAGELGPELGPVANSLMVPTDEREARLWFSP
APTRAFAPSRIRAAFESAGIRARLACGRVGDGLRGFRASLKQAERVKALA
LAGGARPGGRVMFYDDVAPVALLADDLEELRRFVTDVLGDLSVDDERNSW
LRETLREFLLRNRSYVATADAMILHRNTIQYRVIQAMELCGQNLDDPDAA
FRVQMALEVCRWMAPAVLRAKQ
>Mb1467 Mb1467, PROBABLE DEHYDROGENASE
MTTAVVVGAGPNGLAAAIHLARHGVDVQVLEARDTIGGGARSGELTVPGV
IHDHCSAFHPLGVGSPFWAAIDLQRYGLTWKWPDVDCAHPLDDGTAGVLY
RSIEATAAGLGPDGKRWQRAVGDLAAGFDELAEDLLRPVLNMPRHPIRLA
RFGPRAALPATAMARRFHTERARALFGGAAAHVYTRLDRPLTASLGLMIL
ASGHRHGWPVARGGSGSITKALAAALDAYGGTVATGVTVTSRRDIPDADI
VMLDLSPAAVLGIYGDVMPTRINRSYRRYRAGSSAFKVDFAIEGDVGWTN
PDCRRAGTVHLGGTFAEIADTERQRAQGTMVQRPFVLVGQQYLADPSRSV
GNINPIWAYAHVPFGYTGDATAAVIDQIERFAPGFRDRIVATVSTSTTEL
QTYNRNFIGGDIIGGANDRLQVIFRPRVAVDPYAIGVPGVYLCSQSAPPG
AGIHGLCGYHAAESALRWLRKRR
>Mb1488 Mb1488, POSSIBLE TRANSCRIPTIONAL ACTIVATOR PROTEIN
MALRETSPRIHELIREAARIALNPTQEWLDEFDRAILAANPSIAADPALA
TVVKRSNRAHLIHFAAANLRNPGAPVPANLGPEPLRMARDLVRVGLDALA
LDIYRIGQNVAWRRWTDIAFGLTSDPDELHELLDVPFRTANEFVDTTLAG
ITTEMQLERDKLTRDVPAERRKIVQLLIDGAPISREHAEARLGYPLDRSH
TAAVIWGDQAQGDHSHLDRVADAFGHAGGCPHPLVVVAGAATRWVWVKDA
PGFDIDLIHEVLHDIPDARIAIGATAPGIEGFRRSHRDALTTARMIIRLE
SPHRVAFFTDVEMVALLTENAEGADDFIQRTLGNLESASPALKTTLLTFI
NQQCNASRAARLLFTHRNTLMNRLETAQRLLPRPLADTTIHVAVALEAQQ
WREKQTSDPPAKKESNGTKMR
>Mb1535c Mb1535c, PROBABLE METHYLTRANSFERASE
MLDVGCGSGRMALPLTGYLNSEGRYAGFDISQKAIAWCQEHITSAHPNFQ
FEVSDIYNSLYNPKGKYQSLDFRFPYPDASFDVVFLTSVFTHMFPPDVEH
YLDEISRVLKPGGRCLCTYFLLNDESLAHIAEGKSAHNFQHEGPGYRTIH
KKRPEEAIGLPETFVRDVYGKFGLAVHEPLHYGSWSGREPHLSFQDIVIA
TKTAS
>Mb1539 Mb1539, CONSERVED HYPOTHETICAL PROTEIN
MIPVKVENNTSLDQVQDALNCVGYAVVEDVLDEASLAATRDRMYRVQERI
LTEIGKERLARAGELGVLRLMMKYDPHFFTFLEIPEVLSIVDRVLSETAI
LHLQNGFILPSFPPFSTPDVFQNAFHQDFPRVLSGYIASVNIMFAIDPFT
RDTGATLVVPGSHQRIEKPDHTYLARNAVPVQCAAGSLFVFDSTLWHAAG
RNTSGKDRLAINHQFTRSFFKQQIDYVRALGDAVVLEQPARTQQLLGWYS
RVVTNLDEYYQPPDKRLYRKGQG
>Mb1550 Mb1550, Probable methyltransferase
MTITALTVTLPLLWRRLTTAGVKYADQGHFVGSAGVPAADAGGRDAASEQ
IARWTQTCTVVLVCGHGPAKWAFRSWCTSRSCDTLPVALRYRLQSNPLVG
KLTTKYFLPLGTRQVGDHVVFFNFGYEEDPPMALPLSESDEPNRYCIQLY
HQTASQVDLTGKEVLEVSCGAGGGASYIARNLGPASYTGLDLNPASIDLC
RAKHRLPGLQFVQGDAQNLPFPDESFDAVVNVEASHQYPDFRGFLAEVAR
VLRPGGHFLYTDSRRNPVVAEWEAALADAPLRTISQRDIGAQAKRGLDAN
TARSQEAIGRRAPVLLAGLTRCAVRVLDWDLRRGGGFSYRIYLFAKD
>Mb1559c Mb1559c, CONSERVED HYPOTHETICAL PROTEIN
MSDPLTAQEQHKRRQAVRELMPRTPFIGGLGIVFERYEPDDVVIRLPFRT
DLTNDGTYFHGGVIASVMDTAGAAAAWSNHDFDRGTRAATVAMSIQYTGA
AKRCDLLCHARTARRRKELTFTEITATDPDGNIVAHAVQTYRIV
>Mb1570 Mb1570, POSSIBLE FATTY ACYL-COA REDUCTASE
MNLGDLTNFVEKPLAAVSNIVNTPNSAGRYRPFYLRNLLDAVQGRNLNDA
VKGKVVLITGGSSGIGAAAAKKIAEAGGTVVLVARTLENLENVANDIRAI
RGNGGTAHVYPCDLSDMDAIAVMADQVLGDLGGVDILINNAGRSIRRSLE
LSYDRIHDYQRTMQLNYLGAVQLILKFIPGMRERHFGHIVNVSSVGVQTR
APRFGAYIASKAALDSLCDALQAETVHDNVRFTTVHMALVRTPMISPTTI
YDKFPTLTPDQAAGVITDAIVHRPRRASSPFGQFAAVADAVNPAVMDRVR
NRAFNMFGDSSAAKGSESQTDTSELDKRSETFVRATRGIHW
>Mb1623 Mb1623, HYPOTHETICAL PROTEIN
MARTFEDLVAEAASASVGGWDFSWLDGRATEERPSWGYQRQLSQRLANAT
AALDLETGGGEVLAGAGNFPPTMVATEAWPPNAAMATRRLHPLGAVVVIT
GDKPPLPFADAAFDLVTSRHPSTRWWTEIARVLRAGGSYFAQHVGPATLW
DLREHFLGPREHNGADQYAQVVRTCITDAGLEIVDLQMERLRVEFFDVGA
VIYFLRKVIWFLPDFTVEGYHDRLRALHERIQAEGPFVTYSTRALIEARK
PS
>Mb1710 Mb1710, Possible long-chain acyl-CoA synthase
MVDLNFSMVTRPIERLVATAQNGLEVLRLGGLETGSVPSPSQIVESVPMY
KLRRYFPPDNRPGQPPVGPPVLMVHPMMMSADMWDVTREDGAVGILHASG
LDPWVIDFGSPDEVEGGMRRNLADHIVALSEAVDTVKDATGHDVHFVGYS
QGGMFCYQAAAYRRSKDIASVVAFGSPVDTLAALPMGIPANMGAAVADFM
ADHVFNRLDIPSWMARMGFQMMDPLKTAKARVDFVRQLHDREALLPREQQ
RRFLESEGWIAWSGPAISELLKQFIAHNRMMTGGFAISGQMVTLTDITCP
ILAFVGEVDDIGQPASVRGIRRAAPNSEVYECLIRAGHFGLVVGSRAAQQ
SWPTVADWVRWISGDGTKPENIHLMADQPAEHTDSGVAFSSRVAHGIGEV
SEAALALARGAADAVVAANRSVRTLAVETVRTLPRLARLGQLNDHTRISL
GRIIDEQAHDAPKGEFLLFDGRVHTYEAVNRRINNVVRGLIAVGVRQGDR
VGVLMETRPSALVAIAALSRLGAVAVVMRPDTDLSASVRLGRVTEILTDP
TNLDAARQLPGQVLVLGGGESRDLDLPADALEQGQVIDMEKIDPDAVELP
AWYRPNPGLARDLAFIAFSSADGDLVAKQITNYRWAVSAFGTASTAALGR
RDTVYCLTPLHHESALLVSLGGAVVGGTRIALSRGLRPDRFVAEVRQYGV
TVVSYTWAMLRDVVDDPAFVLHGNHPVRLFIGSGMPTGLWERVVEAFAPA
HVVEFFATTDGQAVLANVAGAKIGSKGRPLPGAGRVELGAYDAEHDLILE
NDRGFVQVAGVNQVGVLLAQSRGPIDPTASVKRGVFAPADTWISTDYLFW
RDDDGDYWLAGGRGSVVRTARGMVYTEPVTNALGLITGVDLAVTYGVLVR
GRHVAVSAVTLLPGATITAADLTEAVASMPVGLGPDIVHVVPQLTLSGTY
RPTVSALRANGIPKAGRQAWYFNSGGNEYRRLTPAVRTELTGQHRRGNA
>Mb1741 Mb1741, Probable oxidoreductase
MEEMALAQQVPNLGLARFSVQDKSILITGATGSLGRVAARALADAGARLT
LAGGNSAGLAELVNGAGIDDAAVVTCRPDSLADAQQMVEAALGRYGRLDG
VLVASGSNHVAPITEMAVEDFDAVMDANVRGAWLVCRAAGRVLLEQGQGG
SVVLVSSVRGGLGNAAGYSAYCPSKAGTDLLAKTLAAEWGGHGIRVNALA
PTVFRSAVTEWMFTDDPKGRATREAMLARIPLRRFAEPEDFVGALIYLLS
DASSFYTGQVMYLDGGYTAC
>Mb1758c Mb1758c, CONSERVED HYPOTHETICAL PROTEIN
MARTDDDNWDLTSSVGVTATIVAVGRALATKDPRGLINDPFAEPLVRAVG
LDLFTKMMDGELDMSTIADVSPAVAQAMVYGNAVRTKYFDDYLLNATAGG
IRQVAILASGLDSRAYRLPWPTRTVVYEIDQPKVMEFKTTTLADLGAEPS
AIRRAVPIDLRADWPTALQAAGFDSAAPTAWLAEGLLIYLKPQTQDRLFD
NITALSAPGSMVATEFVTGIADFSAERARTISNPFRCHGVDVDLASLVYT
GPRNHVLDYLAAKGWQPEGVSLAELFRRSGLDVRAADDDTIFISGCLTDH
SSISPPTAAGWR
>Mb1878 Mb1878, CONSERVED HYPOTHETICAL PROTEIN
MQPSPDSPAPLNVTVPFDSELGLQFTELGPDGARAQLDVRPKLLQLTGVV
HGGVYCAMIESIASMAAFAWLNSHGEGGSVVGVNNNTDFLRSISSGMVYG
TAEPLHRGRRQQLWLVTITDDTDRVVARGQVRLQNLEARP
>Mb1887c Mb1887c, POSSIBLE OXIDOREDUCTASE
MAVEVLVTGGDTDLGRTMAEGFRNDGHKVTLVGARRGDLEVAAKELDVDA
VVCDTTDPTSLTEARGLFPRHLDTIVNVPAPSWDAGDPRAYSVSDTANAW
RNALDATVLSVVLTVQSVGDHLRSGGSIVSVVAENPPAGGAESAIKAALS
NWIAGQAAVFGTRGITINTVACGRSVQTGYEGLSHTPAPVAAEIARLALF
LTTPAARHITGQTLHVSHGALAHFG
>Mb1896c Mb1896c, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE
MPGRTSIGVKIRDKVQDKVIAITGGARGIGLATAAALHNLGAKVAIGDID
EAMAKESGADLDLDMYGKLDVTDPDSFSGFLDAVERQLGPIDVLVNNAGI
MPVGRIVDEPDPVTRRILDINVYGVILGSKLAAQRMVPRGRGHVINVASL
AGEIYAVGVATYCASKHAVVAFTDSARLEYRSAGVKFSMVLPSFVNTELI
AGTGGIKGFKNAEPADIADAIVGLIVHPKPRVRVTKAAGSMIVAQRFMPR
QVSEGLNRLLGGEHVFTDDVDMEKRRTYEARARGEE
>Mb1914c Mb1914c, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MKAIFITGAGSGMGREGATLFHANGWRVGAIDRNEDGLAALRVQLGAERL
WARAVDVTDKAALEGALADFCAGNVGGGLDMMWNNAGIGEGGWFEDVPYE
AAVRVVDVNFKAVLTGAYAALPYLKKAPGSLMFSTSSSSGTYGMPRIAVY
SATKHAVKGLTEALSVEWQRHGVRVADVLPGLIDTAILTSTRQHSDEGPY
TISAEQIRAAAPKKGMFRLMPSSSVAEAAWRAYQHPTRLHWYVPRSIRWI
DRLKGVSPEFVRRHIAKSLATLEPKRK
>Mb1921c Mb1921c, CONSERVED HYPOTHETICAL PROTEIN
MVPVDLRRDWPTPLRQAGFDPNQPSAWLAEGLLAFLPPDAQDRLLDNITA
LSAPGSR
>Mb1922c Mb1922c, CONSERVED HYPOTHETICAL PROTEIN
MPRTNNDAWDLATSVGATATMVAAARAVATRADNPLIDDPFAEPLVRAVG
IDFFTRWAAGNIKATDVDDPDGTWGLQRLADLLAARTRYFDAFFRDATSA
GIRQAVILASGLDARAYR
>Mb1931c Mb1931c, CONSERVED HYPOTHETICAL PROTEIN
MTTPEYGSLRSDDDHWDIVSNVGYTALLVAGWRALHTTGPKPLVQDEYAK
HFITASADPYLEGLLANPRTSEDGTAFPRLYGVQTRFFDDFFNCADEAGI
RQAVIVAAGLDCRAYRLDWQPGTTVFEIDVPKVLEFKARVLSERGAVPKA
HRVAVPADLRTDWPTPLTAAGFDPQRPSAWSVEGLLPYLTGDAQYALFAR
IDELCAPGSRVALGALGSRLDHEQLAALETAHPGVNMSGDVNFSALTYDD
KTDPVEWLVEHGWAVDPVRSTLELQVGYGLTPPDVDVKIDSFMRSQYITA
VRA
>Mb1963c Mb1963c, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MSVLDLFDLHGKRALITGASTGIGKRVALAYVEAGAQVAIAARHLDALEK
LADEIGTSGGKVVPVCCDVSQHQQVTSMLDQVTAELGGIDIAVCNAGIIT
VTPMLDMPLEEFQRLQNTNVTGVFLTAQAAAKAMVKQGQGGVIINTASMS
GHIINVPQQVSHYCASKAAVIHLTKAMAVELAPHKIRVNSVSPGYILTEL
VEPYTEYQPLWEPKIPLGRLGRPEELAGLYLYLASEASSYMTGSDIVIDG
GYTCP
>Mb1976 Mb1976, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MNHPDLAGKVAIVTGAGAGIGLAVARRLADEGCHVLCADIDGDAADAAAT
KIGCGAAACRVDVSDEQQIIAMVDACVAAFGGVDKLVANAGVVHLASLID
TTVEDFDRVIAINLRGAWLCTKHAAPRMIERGGGAIVNLSSLAGQVAVGG
TGAYGMSKAGIIQLSRITAAELRSSGIRSNTLLPAFVDTPMQQTAMAMFD
GALGAGGARSMIARLQGRMAAPEEMAGIVVFLLSDDASMITGTTQIADGG
TIAALW
>Mb2000 Mb2000, CONSERVED HYPOTHETICAL PROTEIN
MGEANIREQAIATMPRGGPDASWLDRRFQTDALEYLDRDDVPDEVKQKII
GVLDRVGTLTNLHEKYARIALKLVSDIPNPRILELGAGHGKLSAKILELH
PTATVTISDLDPTSVANIAAGELGTHPRARTQVIDATAIDGHGHSYDLAV
FALAFHHLPPTVACKAIAEATRVGKRFLIIDLKRQKPLSFTLSSVLLLPL
HLLLLPWSSMRSSMHDGFISALRAYSPSALQTLARAADPGMQVEILPAPT
RLFPPSLAVVFSRSSSAPTESSECSADRQPGE
>Mb2013c Mb2013c, POSSIBLE DEHYDROGENASE (FRAGMENT)
MGRLEGKVAFITGVARGQGRSHAVRLADGQARALGKVDVEACGALVGEVE
VWGRDVRDDRRVFVESPADEFGACRRVARQGIRVVGLPVSQRELVEPEAG
CAARRSAAGSQ
>Mb2026c Mb2026c, CONSERVED HYPOTHETICAL PROTEIN
MVKRSRATRLSPSIWSGWESPQCRSIRARLLLPRGRSRPPNADCCWNQLA
VTPDTRMPASSAAGRDAAAYDAWYDSPTGRPILATEVAALRPLIEVFAQP
RLEIGVGTGRFADLLGVRFGLDPSRDALMFARRRGVLVANAVGEAVPFVS
RHFGAVLMAFTLCFVTDPAAIFRETRRLLADGGGLVIGFLPRGTPWADLY
ALRAARGQPGYRDARFYTAAELEQLLADSGFRVIARRCTLHQPPGLARYD
IEAAHDGIQAGAGFVAISAVDQAHEPKDDHPLESE
>Mb2080 Mb2080, CONSERVED HYPOTHETICAL PROTEIN
MTTIEIDAPAGPIDALLGLPPGQGPWPGVVVVHDAVGYVPDNKLISERIA
RAGYVVLTPNMYARGGRARCITRVFRELLTKRGRALDDILAARDHLLAMP
ECSGRVGIVGFCMGGQFALVLSPRGFGATAPFYGTPLPRHLSETLNGACP
IVASFGTRDPLGIGAANRLRKVTAAKNIPADIKSYPGAGHSFANKLPGQP
LVRIAGFGYNEAATEDAWRRVFEFFGQHLRAGSPGEP
>Mb2093c Mb2093c, CONSERVED HYPOTHETICAL PROTEIN
MTDDHPRADIVSRQYHRWLYPHPIADLEAWTTANWEWFDPVHSHRILWPD
REYRPDLDILIAGCGTNQAAIFAFTNRAAKVVAIDISRPALDHQQYLKDK
HGLANLELHLLPIEELATLGRDFDLVVSTGVLHHLADPRAGMKELAHCLR
RDGVVAAMLYGKYGRIGVELLGSVFRDLGLGQDDASIKLAKEAISLLPTY
HPLRNYLTKARDLLSDSALVDTFLHGRQRSYTVEECVDLVTSAGLVFQGW
FHKAPYYPHDFFVPNSEFYAAVNTLPEVKAWSVMERLETLNATHLFMACR
RDRPKEQYTIDFSTVAALDYVPLMRTRCGVSGTDMFWPGWRMAPSPAQLA
FLQQVDGRRTIREIAGCVARTGEPSGGSLADLEEFGRKLFQSLWRLDFVA
VALPASG
>Mb2153c Mb2153c, Probable oxidoreductase
MTSLQGKVVFITGAARGIGAEVARRLHNKGAKLVLTDLSKSELAVMGAEL
GGDDRLLTVVADVRDLPAMQAAAETAVERFGGIDVVVANAGIASYGSVLK
VDPQAFRRVLDVNLLGNFHTVRATLPALIDRRGYVLIVSSLAAFAAPPGM
APYNMSKAGNEHFANALRLEVAHLGVSVGSAHMSWIDTALVRDTKADLPA
FAELLARLPWPLNKTTSVNKCAAAFVNGIEGRKDRVYCPGWVALFRWLKP
LLSTRVGQRPIRNTVAKLMPQMDAEVAALGRFASAYTESLENS
>Mb2266 Mb2266, CONSERVED HYPOTHETICAL PROTEIN
MNDNQLAPVARPRSPLELLDTVPDSLLRRLKQYSGRLATEAVSAMQERLP
FFADLEASQRASVALVVQTAVVNFVEWMHDPHSDVGYTAQAFELVPQDLT
RRIALRQTVDMVRVTMEFFEEVVPLLARSEEQLTALTVGILKYSRDLAFT
AATAYADAAEARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTTAP
ATVLVGTPAPGPNGSNSDGDSERASQDVRDTAARHGRAALTDVHGTWLVA
IVSGQLSPTEKFLKDLLAAFADAPVVIGPTAPMLTAAHRSASEAISGMNA
VAGWRGAPRPVLARELLPERALMGDASAIVALHTDVMRPLADAGPTLIET
LDAYLDCGGAIEACARKLFVHPNTVRYRLKRITDFTGRDPTQPRDAYVLR
VAATVGQLNYPTPH
>Mb2282c Mb2282c, Possible transcriptional regulatory protein
MSGALETTEEFGNRFVAAIDSAGLAILVSVGHQTGLLDTMAGLPPATSME
IAEAAGLEERYVREWLGGMTTGQIVEYDAGSSTYSLPAHRAGMLTRAAGP
DNLAVIAQFVSLLGEVEQKVIRCFREGGGVPYSEYPRFHKLMAEMSGMVF
DAALIDVVLPLVDGLPDRLRSGADVADFGCGSGRAVKLMAQAFGASRFTG
IDFSDEAVAAGTEEAARLGLANATFERHDLAELDKVGAYDVITVFDAIHD
QAQPARVLQNIYRALRPGGVLLMVDIKASSQLEDNVGVPLSTYLYTTSLM
HCMTVSLALDGAGLGTVWGRQLATSMLADAGFTDVTVAEIESDVLNNYYI
ARK
>Mb2286 Mb2286, Possible oxidoreductase
MAKDLVATVPDLSGKLAIITGANSGLGFGLARRLSAAGADVIMAIRNRAK
GEAAVEEIRTAVPDAKLTIKALDLSSLASVAALGEQLMADGRPIDLLINN
AGVMTPPERVTTADGFELQFGSNHLGHFALTAHLLPLLRAAQRARVVSLS
SLAARRGRIHFDDLQFERSYAPMTAYGQSKLAVLMFARELDRRSRAAGWG
IISNAAHPGLTKTNLQIAGPSHGRDKPALMERLYKTSWRFAPFLWQEIEE
GILPALYAAATPQADGGAFYGPRGRYEVAGGGVREAKVPAAARNDADSKR
LWEVSEQLTGVSYPKSR
>Mb2391c Mb2391c, CONSERVED HYPOTHETICAL PROTEIN
MVLPKPTPRGRELIRQAAKVALHPTPEWLDELDRATLAAHPSIAADPALA
TVVSRANRSHLIHFATANLRKPGQPVPANLGPDPLRMARDLVRRGLDASA
LDVYRVGQNVAWQRWTEIAFGLTTDPQELHELLTLPFRSASEFIDATLAG
LAAQMQLEYDELTRDVHAEHRRIVELILDGAPISRQSAEAKLGYPLDRSH
TAAIIWYDDPDDNQNHLDHTARAFGRALGCPQPLIAVASAATRWVWVSDA
ATLDTDRIHQVLDHAPHARIAVGTTARGIDGFRRSHRDALATQRMLARLR
SQQRLAFFADIHMIAVLTENPDSAADFITSTLGDLESASPQLLTTVLTYI
NEQCNASRAAHVLHTHRNTLLRRLETAQRLLPRPLDHTIIQVAVAISALQ
WRGSQTSDPVETPVEGITSPPPESLGRRRSRLAQLER
>Mb2446 Mb2446, HYPOTHETICAL PROTEIN
MDNLPIESAESTRLAKAAMTRRFYTRSVVKGEITLPAVPSMIDEYVTMCA
GLFAGVGRKFSDEELAHLRAVLQGQLAEAYAASQRSTIVISYNAPMGPTL
HYQVRAQWRTVAQEYENWIATREPPLFGTEPDARVWALANEAADPTTHRV
LEIGAGTGRNALALARRGHPVDVVEMTPKFADIIRSDAERDSLDVRVIMR
DVFSTMDDLRQDYQLMVLSEVVPDFRTTQQLRNLFELAAQCLAPGARLVF
NAFLANGDYAPDQAAREFGQQMYTGMCTRAEMSAAAAGLPLELVADDSVY
DYEKTHLPPGAWPPTSWYADWIRGLDVFTTNVESCPIEMRWLVFQRRR
>Mb2493c Mb2493c, CONSERVED HYPOTHETICAL PROTEIN
MLEKAPQKSVADFWFDPLCPWCWITSRWILEVAKVRDIEVNFHVMSLAIL
NENRDDLPEQYREGMARAWGPVRVAIAAEQAHGAKVLDPLYTAMGNRIHN
QGNHELDEVITQSLADAGLPAELAKAATSDAYDNALRKSHHAGMDAVGED
VGTPTIHVNGVAFFGPVLSKIPRGEEAGKLWDASVTFASYPHFFELKRTR
TEPPQFD
>Mb2655 Mb2655, POSSIBLE METHYLTRANSFERASE (METHYLASE)
MANKRGNAGQPLPLSDRDDDHMQGHWLLARLGKRVLRPGGVELTRTLLAR
AEVTDADVLELAPGLGRTAAEILARNPRSYVGAESDPNAANLVRHVLAGR
GDVRVTDAADTGLSDASADVVIGEAMLTMQGNAAKHTIVAEAARVLRPGG
RYAIHELALVPDDVAEQVRTDLRQSLARALKVNARPLTVAEWSHLLAGHG
LVVEHVVTASMALLQPRRVIADEGLLGALRFAGNLLIHRAARRRVLLMRH
TFRRHRERLTAVAIVAHKPHVDS
>Mb2694c Mb2694c, CONSERVED HYPOTHETICAL PROTEIN
MTAQFDPADPTRFEEMYRDDRVAHGLPAATPWDIGGPQPVVQQLVALGAI
RGEVLDPGTGPGHHAIYYAAKGYAATGIDGSVAAIERARDNARKAGVSVN
FQVGDATTLDGLDGRFDTVVDCAFYHTFSTAPELQRCYVRALRRASKPGA
RLYMFEFGEHNVNGFSMPRSLSEDDFRQVLPVGGWEITYLGTTTYQVNLS
VEALELMAARNPDMADQVRCVLERFRAIKPWLVGGRVHAPFWEVHATRVD
>Mb2760 Mb2760, CONSERVED HYPOTHETICAL PROTEIN
MAELTETSPETPETTEAIRAVEAFLNALQNEDFDTVDAALGDDLVYENVG
FSRIRGGRRTATLLRRMQGRVGFEVKIHRIGADGAAVLTERTDALIIGPL
RVQFWVCGVFEVDDGRITLWRDYFDVYDMFKGLLRGLVALVVPSLKATL
>Mb2771 Mb2771, PROBABLE DEHYDROGENASE
MIDRPLEGKVAFITGAARGLGRAHAVRLAADGANIIAVDICEQIASVPYP
LSTADDLAATVELVEDAGGGIVARQGDVRDRASLSVALQAGLDEFGRLDI
VVANAGIAMMQAGDDGWRDVIDVNLTGVFHTVQVAIPTLIEQGTGGSIVL
ISSAAGLVGIGSSDPGSLGYAAAKHGVVGLMRAYANHLAPQNIRVNSVHP
CGVDTPMINNEFFQQWLTTADMDAPHNLGNALPVELVQPTDIANAVAWLA
SEEARYVTGVTLPVDAGFVNKR
>Mb2772 Mb2772, CONSERVED HYPOTHETICAL PROTEIN
MARNPAAQTAFGPMVLAAVEQNEPPGRRLVDDDLADLFLPRPLRWLAGAT
RSAVLRRLLISASEWSGRGLWANLACRKRFIGDKLDEALGDIDAVVILGA
GLDTRAYRLTRRVRMPVFEVDLPVNIARKAKTVRRVLGELPLSVRLVALD
FEHDDLLTALAEHGYRTEYRVFFVCEGVTQYLTERAVRRTLEGLRAAAPG
SRMVFTYVRRDFIDGTNRYGTRTLYHTVRQRRQLWHFGLDPEEVAGFLAD
YGWRLTEQAGPEELVQRYVEPTGRNLNASQIEWSAYAEKSEPVTPR
>Mb2787 Mb2787, PROBABLE ALANINE RICH HYDROLASE
MPKTTDTAATPDGTCAVRLFTPDGPGRWPGVVMFPDAGGVRDTFDRMAAK
LAGFGYVVLLPDVYYREGDWAPFDMKTAFGDPQERARIMFMIGTLTPDRV
TRDADALLNYLASRPEVIGDRFGVCGYCMGGRMSVVVAGRLPDRVAAAAA
FHPGGLVANSPDSPHLLADRISATVYIGGAENDPSFTADHAEKLDKAFSA
AGVPHRIECYPAAHGFAVPDNPSYDAAADERHWAAMTETFGAALN
>Mb2788c Mb2788c, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MTSLDLTGRTAIITGASRGIGLAIAQQLAAAGAHVVLTARRQEAADEAAA
QVGDRALGVGAHAVDEDAARRCVDLTLERFGSVDILINNAGTNPAYGPLL
EQDHARFAKIFDVNLWAPLMWTSLVVTAWMGEHGGAVVNTASIGGMHQSP
AMGMYNATKAALIHVTKQLALELSPRIRVNAICPGVVRTRLAEALWKDHE
DPLAATIALGRIGEPADIASAVAFLVSDAASWITGETMIIDGGLLLGNAL
GFRAAPSTEH
>Mb2817c Mb2817c, CONSERVED HYPOTHETICAL PROTEIN
MTVGTLVASVLPATVFEDLAYAELYSDPPGLTPLPEEAPLIARSVAKRRN
EFITVRHCARIALDQLGVPPAPILKGDKGEPCWPDGVVGSLTHCAGYRGA
VVGRRDAVRSVGIDAEPHDVLPNGVLDAISLPAERADMPRTMPAALHWDR
ILFCAKEATYKAWFPLTKRWLGFEDAHITFETDSTGWTGRFVSRILIDGS
TLSGPPLTTLRGRWSVERGLVLTAIVL
>Mb2882c Mb2882c, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MMDLSQRLAGRVAVITGGGSGIGLAAGRRMRAEGATIVVGDVDVEAGGAA
ADELSGLFVPTDVCDEDAVNGLFDGAAETYGRIDIAFNNAGISPPEDNLI
ENTELAAWQRVQDVNLKSVYLCCRAALRHMVLAGKGSIVNTASFVAVMGS
ATSQISYTASKGGVLAMSRELGVQFARQGIRVNALCPGPVNTPLLQELFA
KNPERAARRMVHVPLGRFAEPDEIAAAVAFLASDDASFITASTFLVDGGI
SSAYVTPL
>Mb2937c Mb2937c, POSSIBLE D-AMINO ACID AMINOHYDROLASE
MLAWRQLNDLEETVTYDVIIRDGLWFDGTGNAPLTRTLGIRDGVVATVAA
GALDETGCPEVVDAAGKWVVPGFIDVHTHYDAEVLLDPGLRESVRHGVTT
VLLGNCSLSTVYANSEDAADLFSRVEAVPREFVLGALRDNQTWSTPAEYI
EAIDALPLGPNVSSLLGHSDLRTAVLGLDRATDDTVRPTEAELAKMAKLL
DEALEAGMLGMSGMDAAIDKLDGDRFRSRALPSTFATWRERRKLISVLRH
RGRILQSAPDVDNPVSALLFFLASSRIFNRRKGVRMSMLVSADAKSMPLA
VHVFGLGTRVLNKLLGSQVRFQHLPVPFELYSDGIDLPVFEEFGAGTAAL
HLRDQLQRNELLADRSYRRSFRREFDRIKLGPSLWHRDFHDAVIVECPDK
SLIGKSFGAIADERGLHPLDAFLDVLVDNGERNVRWTTIVANHRPNQLNK
LAAEPSVHMGFSDAGAHLRNMAFYNFGLRLLKRARDADRAGQPFLSIERA
VYRLTGELAEWFGIGAGTLRQGDRADFAVIDPTHLDESVDGYHEEAVPYY
GGLRRMVNRNDATVVATGVGGTVVFRGGQFGGQFRDGYGQNVKSGRYLRA
GELGAALSRSA
>Mb2939c Mb2939c, CONSERVED HYPOTHETICAL PROTEIN
MKRVDTIRPRSRAVRLHVRGLGLPDETAIQLWIVDGRISTEPVAGADTVF
DGGWILPGLVDAHCHVGLGKHGNVELDEAIAQAETERDVGALLLRDCGSP
TDTRGLDDHEDLPRIIRAGRHLARPKRYIAGFAVELEDESQLPAAVAEQA
RRGDGWVKLVGDWIDRQIGDLAPLWSDDVLKAAIDTAHAQGARVTAHVFS
EDALPGLINAGIDCIEHGTGLTDDTIALMLEHGTALVPTLINLENFPGIA
DAAGRYPTYAAHMRDLYARGYGRVAAAREAGVPVYAGTDAGSTIEHGRIA
DEVAALQRIGMTAHEALGAACWDARRWLGRPGLDDRASADLLCYAQDPRQ
GPGVLQHPDLVILRGRTFGP
>Mb2976 Mb2976, POSSIBLE METHYLTRANSFERASE (METHYLASE)
MAFSRTHSLLARAGSTSTYKRVWRYWYPLMTRGLGNDEIVFINWAYEEDP
PMDLPLEASDEPNRAHINLYHRTATQVDLGGKQVLEVSCGHGGGASYLTR
TLHPASYTGLDLNQAGIKLCKKRHRLPGLDFVRGDAENLPFDDESFDVVL
NVEASHCYPHFRRFLAEVVRVLRPGGYFPYADLRPNNEIAAWEADLAATP
LRQLSQRQINAEVLRGIGNNSQKSRDLVDRHLPAFLRFAGREFIGVQGTQ
LSRYLEGGELSYRMYCFTKD
>Mb2978c Mb2978c, HYPOTHETICAL PROTEIN
MRLPGMLRPTAERHFHSIFYLRHNARRQEHLATLGLDLGNKSVLEVGAGI
GDHTQFFLDRGCKVLCTEPRGENLDVIRQRFGSNPNVTVDHLDLDGDLPA
EAHQYDVVYCYGVLYHLSRPAEALAWMCDRAVDLLLLETCVSYSGEDEPF
LVSERASSPSQAITGTGCRPSRVWVMNRLREKMPHVYVTATQPRHRQFPL
DWRANGPIASTGLARAVFVASRAPLNLPTLVEELPMVQRRC
>Mb2979c Mb2979c, CONSERVED HYPOTHETICAL PROTEIN
MQFQDVRLMRVVVCRRLGPAKGQRRWHPLDLGTTGCFENLGAQRPTYRMR
AIRMLECAMPNRLVRSLQRWRPFGLPPHRWRLAPWYWRGLQVTLEPGSAI
AWIVRLTGGFEETEIDIAAALYSALYPDRCILDVGANVGIHSLAWARLAP
VVALEPAPGTHSRLEANVAANGLQDRIRTLRTAAGDAVGEVDFFVAADSA
FSSLNDTGRIRIRERTRVPCTTLDALAAELPLPVGLLKIDVEGLERAVIA
GAAELLRRDRPVLLVEIYGGAASNPDPERTIADIRAYGYEPFVYADDAGL
QPYQRHRDDRYCYFFIPSRKG
>Mb2980 Mb2980, CONSERVED HYPOTHETICAL PROTEIN
MKSLKLARFIARSAAFEVSRRYSERDLKHQFVKQLKSRRVDVVFDVGANS
GQYAAGLRRAAYKGRIVSFEPLSGPFTILESKASTDPLWDCRQHALGDSD
GTVTINIAGNAGQSSSVLPMLKSHQNAFPPANYVGTQEASIHRLDSVAPE
FLGMNGVAFLKVDVQGFEKQVLAGGKSTIDDHCVGMQLELSFLPLYEGGM
LIPEALDLVYSLGFTLTGLLPCFIDANNGRMLQADGTFFREDD
>Mb3017c Mb3017c, POSSIBLE 2-HYDROXYHEPTA-2,4-DIENE-1,7-DIOATE ISOMERASE (HHDD ISOMERASE)
MTAREIAEHPFGTPTFTGRSWPLADVRLLAPILASKVVCVGKNYADHIAE
MGGRPPADPVIFLKPNTAIIGPNTPIRLPANASPVHFEGELAIVIGRACK
DVPAAQAVDNILGYTIGNDVSARDQQQSDGQWTRAKGHDTFCPVGPWIVT
DLAPFDPADLELRTVVNGDVKQHARTSLMIHDIGAIVEWISAIMTLLPGD
LILTGTPAGVGPIEDGDTVSITIEGIGTLTNPVVRKGKP
>Mb3021 Mb3021, POSSIBLE ALANINE RICH DEHYDROGENASE
MDVTVVGSGPNGLATAVICARAGLNVQVVEAQATFGGGARSAADFEFPEV
LHDVCSAVHPLALASPFFAEFDLPARGVTLTVPDIAYANPLPGRPAAIAY
HDLAHTSAKLDDGASWRRLLGPLVAHSETVVEFMLSDKRSLPTALGSVLR
LGLRMLAQGTPAWRSLAGEDARALFTGVAAHAISPLPSLVSAGAGLMLAT
LAHSVGWPIPVGGTQAIADALIADLRAHGGRLAAGVEITEPQRSVVVFDT
APTALLRVYRDKLPHRYAKALRRYRFRAGIAKVDFVLSDEIPWSDPRLRR
AATLHLGGTRDQMARAEADVAAGRHADWPMVLAACPHVADPGRIDETGRR
PFWTYAHVPSGSTLDATETVTSVLERFAPGFRDIVVAARAVPAARMADHN
ANYVGGDITVGANSTWRAIAGPTPRLNPWRTPIPKVYLCSAATPPGAGVH
GMCGWYAARTLLRTEFGITRMPPLGHELRP
>Mb3056 Mb3056, CONSERVED HYPOTHETICAL PROTEIN
MCAFVPHVPRHSRGDNPPSASTASPAVLTLTGERTIPDLDIENYWFRRHQ
VVYQRLAPRCTARDVLEAGCGEGYGADLIACVARQVIAVDYDETAVAHVR
SRYPRVEVMQANLAELPLPDASVDVVVNFQVIEHLWDQARFVRECARVLR
GSGLLMVSTPNRITFSPGRDTPINPFHTRELNADELTSLLIDAGFVDVAM
CGLFHGPRLRDMDARHGGSIIDAQIMRAVAGAPWPPELAADVAAVTTADF
EMVAAGHDRDIDDSLDLIAIAVRP
>Mb3063c Mb3063c, CONSERVED HYPOTHETICAL PROTEIN
MRARFGARAPWLVETTLLRRRAAGKLGELCPNVGVSQWLFTDEALQQATA
APVARHRARRLAGRVVHDATCSIGTELAALRELAVRAVGSDIDPVRLAMA
RHNLAALGMEADLCRADVLHPVTRDAVVVIDPARRSNGRRRFHLADYQPG
LGPLLDRYRGRDVVVKCAPGIDFEEVGRLGFEGEIEVISYRGGVREACLW
SAGLAGSGIRRRASILDSGEQIGDDEPDDCGVRPAGKWIVDPDGAVVRAG
LVRNYGARHGLWQLDPQIAYLSGDRLPPALRGFEVLEQLAFDERRLRQVL
SALDCGAAEILVRGVAIDPDALRRRLRLRGSRPLAVVITRIGAGSLSHVT
AYVCRPSR
>Mb3064c Mb3064c, CONSERVED HYPOTHETICAL PROTEIN
MTRSSNIPADATPNPHATAEQVAAARHDSKLAQVLYHDWEAENYDEKWSI
SYDQRCVDYARGRFDAIVPDEVIAQLPYDRALELGCGTGFFLLNLIQAGV
ARRGSVTDLSPGMVKVATRNGQALGLDIDGRVADAEGIPYDDDAFDLVVG
HAVLHHIPDVELSLREVVRVLKPGGRFVFAGEPTTVGDGYARTLSTLTWR
VVTNATKLPGLRGWRRPQGELDESSRAAALEALVDLHTFTPQDLQRIAHN
AGAVEVQTATEEFTAAMLGWPLRTFECTVPPGRLGWGWARFAFTSWKTLG
WVDANVWRHVVPKGWFYNVMITGVKPS
>Mb3083c Mb3083c, PROBABLE SHORT CHAIN ALCOHOL DEHYDROGENASE/REDUCTASE
MLQRGAGQYFAGKRCFVTGAASGIGRATALRLAAQGAELYLTDRDRDGLA
QTVCDARALGAQVPEHRVLDVSDYQDVAAFAADIHARHPSMDVVLNIAGV
SAWGTVDQLTHDQWSRMVAINLMGPIHVIETLVPPMVAAGRGGHLVNVSS
AAGLVGLPWHAAYSASKYGLRGLSEVLRFDLARHGIGVSVVVPGAVKTPL
VNTVEIAGVDRDDPRVNRWVERFSGHAVTPEKAADKILAGVTRNRYLVYT
SADIRALYAFKRYAWWPYTLVMRRVNVFFTRALRPGP
>Mb3112 Mb3112, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MSSFEGKVAVITGAGSGIGRALALNLSEKRAKLALSDVDTDGLAKTVRLA
QALGAQVKSDRLDVAEREAVLAHADAVVAHFGTVHQVYNNAGIAYNGNVD
KSEFKDIERIIDVDFWGVVNGTKAFLPHVIASGDGHIVNISSLFGLIAVP
GQSAYNAAKFAVRGFTEALRQEMLVARHPVKVTCVHPGGIKTAVARNATV
ADGEDQQTFAEFFDRRLALHSPEMAAKTIVNGVAKGQARVVVGLEAKAVD
VLARIMGSSYQRLVAAGVAKFFPWAK
>Mb3199 Mb3199, PROBABLE SHORT-CHAIN DEHYDROGENASE/REDUCTASE
MTSLAERTALVTGANRGMGREYVAQLLGRKVAKVYAATRNPLAIDVSDPR
VIPLQLDVTDAVSVAEAADLATDVGILINNAGISRASSVLDKDTSALRGE
LETNLFGPLALASAFADRIAERSGAIVNVSSVLAWLPLGMSYGVSKAAMW
SATESMRIELAPRGVQVVGVYVGLVDTDMGRFADAPKSDPADVVRQVLDG
IEAGKEDVLADEMSRQVRASLNVPARERIARLMGN
>Mb3251 Mb3251, POSSIBLE SHORT-CHAIN DEHYDROGENASE/REDUCTASE
MSLNGKTMFISGASRGIGLAIAKRAARDGANIALIAKTAEPHPKLPGTVF
TAAKELEEAGGQALPIVGDIRDPDAVASAVATTVEQFGGIDICVNNASAI
NLGSITEVPMKRFDLMNGIQVRGTYAVSQACIPHMKGRENPHILTLSPPI
LLEKKWLRPTAYMMAKYGMTLCALGIAEEMRADGIASNTLWPRTMVATAA
VQNLLGGDEAMARSRKPEVYADAAYVIVNKPATEYTGKTLLCEDVLVESG
VTDLSVYDCVPGATLGVDLWVEDANPPGYLPA
>Mb3351c Mb3351c, POSSIBLE METHYLTRANSFERASE
MSVQTDPALREHPNRVDWNARYERAGSAHAPFAPVPWLADVLRAGVPDGP
VLELASGRSGTALALAAHGRQVTAIDVSDVALLQLDSEAVRRGVADRLNL
VQADLGCWEPGETRFALVLSRLFWDAAIFHRACEAVMPGGVLAWESLALS
GAEAGTASAKRRVKPGEPACLLPADFTVVHEGQGNCDSAPSRIMIARRSP
LPGA
>Mb3359c Mb3359c, HYPOTHETICAL PROTEIN
MSQTPGDPEQTTATRRLSHRHTHLAAHTTPTLRRKGPPFRAEMGCFCVCS
AQVQEVAKNSLRGVPESVVMSYSYFVELPRLEDIEPGAHTDVLIANSRVD
QGRIRAAVEAVFDAHPALGTVFEPRVDTLTSRPGGGGWGWGVEPPGAAVA
EVIARHSASFDMYTGRLFAVSLLPGSPDRLVLTASRLCVDDASWQTVVED
LVRQYDESVLVPAR
>Mb3374 Mb3374, POSSIBLE METHYLTRANSFERASE (METHYLASE)
MTCSRRDMSLSFGSAVGAYERGRPSYPPEAIDWLLPAAARRVLDLGAGTG
KLTTRLVERGLDVVAVDPIPEMLDVLRAALPQTVALLGTAEEIPLDDNSV
DAVLVAQAWHWVDPARAIPEVARVLRPGGRLGLVWNTRDERLGWVRELGE
IIGRDGDPVRDRVTLPEPFTTVQRHQVEWTNYLTPQALIDLVASRSYCIT
SPAQVRTKTLDRVRQLLATHPALANSNGLALPYVTVCVRATLA
>Mb3432 Mb3432, CONSERVED HYPOTHETICAL PROTEIN
MARPMGKLPSNTRKCAQCAMAEALLEIAGQTINQKDLGRSGRMTRTDNDT
WDLASSVGATATMIATARALASRAENPLINDPFAEPLVRAVGIDLFTRLA
SGELRLEDIGDHATGGRWMIDNIAIRTKFYDDFFGDATTAGIRQVVILAA
GLDTRAYRLPWPPGTVVYEIDQPAVIKFKTRALANLNAEPNAERHAVAVD
LRNDWPTALKNAGFDPARPTAFSAEGLLSYLPPQGQDRLLDAITALSAPD
SRLATQSPLVLDLAEEDEKKMRMKSAAEAWRERGFDLDLTELIYFDQRND
VADYLAGSGWQVTTSTGKELFAAQGLPPFEDDHITRFADRRYISAVLK
>Mb3440 Mb3440, PROBABLE DIOXYGENASE
MTDLITVKKLGSRIGAQIDGVRLGGDLDPAAVNEIRAALLAHKVVFFRGQ
HQLDDAEQLAFAGLLGTPIGHPAAIALADDAPIITPINSEFGKANRWHTD
VTFAANYPAASVLRAVSLPSYGGSTLWANTAAAYAELPEPLKCLTENLWA
LHTNRYDYVTTKPLTAAQRAFRQVFEKPDFRTEHPVVRVHPETGERTLLA
GDFVRSFVGLDSHESRVLFEVLQRRITMPENTIRWNWAPGDVAIWDNRAT
QHRAIDDYDDQHRLMHRVTLMGDVPVDVYGQASRVISGAPMEIAG
>Mb3510c Mb3510c, CONSERVED HYPOTHETICAL PROTEIN [FIRST PART]
MSQTARRLGPQDMFFLYSESSTTMMHVGALMPFTPPSGAPPDLLRQLVDE
SKASEVVEPWSLRLSHPELLYHPTQSWVVDDNFDLDYHVRRSALASPGDE
RELGIPVSRLHSHALDLRRPPWEVHFIEGLEGGRFAIYIKMHHSLIDGYT
GQKMLARSLSTDPHDTTHPLFFNIPTPGRSPADTQDSVGGGLIAGAGNVL
DGLGDVVRGLGGRQRGR
>Mb3515c Mb3515c, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MNSRAPRNLAVSSPSAQVTGRMVQNGENLFQFRREGPQVQLSFQDRTYLV
TGGGSGIGKGVAAGLVAAGAAVMIVGRNPDKLAAAVKDIEALKTGAIGYE
PADITDEEQTLRVVDAATAWHGRLHGVVHCAGGSQTIGPITQIDSQAWRR
TVDLNVNGTMYVLKHAARELVRGGGGSFVGISSIAASNTHRWFGAYGVTK
SAVDHMMKLAADELGPSWVRVNSIRPGLIRTDLVVPVTESPELSADYRVC
TPLPRVGEVEDVANLAMFLLSDAASWITGQVINVDGGHMLRRGPDFSPML
EPVFGADGLRGVVG
>Mb3532c Mb3532c, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MKLTESNRSPRTTNTTDLSGKVAVVTGAAAGLGRAEALGLARLGATVVVN
DVASALDASDVVDEIGAAAADAGAKAVAVAGDISQRATADELLASAVGLG
GLDIVVNNAGITRDRMLFNMSDEEWDAVIAVHLRGHFLLTRNAAAYWRDK
AKDAEGGSVFGRLVNTSSEAGLVGPVGQANYAAAKAGITALTLSAARALG
RYGVCANVICPRARTAMTADVFGAAPDVEAGQIDPLSPQHVVSLVQFLAS
PAAAEVNGQVFIVYGPQVTLVSPPHMERRFSADGTSWDPTELTATLRDYF
AGRDPEQSFSATDLMRQ
>Mb3549 Mb3549, HYPOTHETICAL PROTEIN
MPVSQHTIAGTVLTMPVRIRTANLHSAMFSVPADPAQRLIDYSGLRVCEY
LPGKAIVMQMLVRYVDGDLGRYHEYGTAIMVNPPGTQRRGPRALTRAAAF
IHHLPVDQVFTLEAGRTIWGFPKIMADFNVTDGRRFGFDVSADGRLIAGI
EFSTGLPVPTLGWQMLKTYSHHDGVTREIPWEMKVSGLRARLGGARLRLG
DHPYAKELASLGLPKRALLSQSAANVEMTFGDGHPI
>Mb3560c Mb3560c, POSSIBLE OXIDOREDUCTASE
MTGMLKRKVIVVSGVGPGLGTTLAHRCARDGADLVLAARSAERLDDVAKQ
IIDTGRRAVAVRTDITDDDDVSNLVQATLAAYGKADVLINNAFRVPSMKP
LAGTTFEHIRDAIELSALGTLRLIQAFTPALAQSHGAIVNVNSMVIRHSQ
PKYGTYKMAKSVLLAMSHSLATELGEQGIRVNSVAPGYIWGDTLKSYFDH
QAGKYGTTVDQIYQATAANSDLKRLPTEDEVASAILFLASDLASGITGQT
LDVNCGEYHT
>Mb3565c Mb3565c, PROBABLE ACETALDEHYDE DEHYDROGENASE (ACETALDEHYDE DEHYDROGENASE [ACETYLATING])
MPSKAKVAIVGSGNISTDLLYKLLRSEWLEPRWMVGIDPESDGLARAAKL
GLETTHEGVDWLLAQPDKPDLVFEATSAYVHRDAAPKYAEAGIRAIDLTP
AAVGPAVIPPANLREHLDAPNVNMITCGGQATIPIVYAVSRIVEVPYAEI
VASVASVSAGPGTRANIDEFTKTTARGVQTIGGAARGKAIIILNPADPPM
IMRDTIFCAIPTDADREAIAASIHDVVKEVQTYVPGYRLLNEPQFDEPSI
NSGGQALVTTFVEVEGAGDYLPPYAGNLDIMTAAATKVGEEIAKETLVVG
GAR
>Mb3566c Mb3566c, PROBABLE HYDRATASE
MLRDATRDELAADLAQAERSRDPIGQLTAAHPEIDVVDAYEIQLINIRQR
VAEGARVVGHKVGLSSPIMQQMMGVDEPDYGHLLDDMQVFEDTPVQASRY
LSPRVEVEVGFILAADLPGAGCTEDDVLAATEALVPAIELIDTRIKDWQI
KICDTIADNASAAGFVLGAARVPPADLDVRAIDAKLTRNGEVVAEGRSDA
VLGNPATAVAWLAGKVESFGVRLRKGDIVLPGSCTFAVEARAGDEFVADF
TGLGLVRLSFE
>Mb3578c Mb3578c, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MGLVDGRVVIVTGAGGGIGRAHALAFAAEGARVVVNDIGVGLDGSPASGG
SAAQDVVDEILAAGGQAVADGSDISDWDQAANLIQAAVETYGGVDVLVNN
AGIVRDRMIANTSEEEFDAVIAVHLKGHFATMRHAASHWRGLSKAGKAPK
DIDARIINTSSGAGLQGSVGQGNYSAAKAGIAALTLVGAAEMRRYGVTVN
AIAPAARTRMTETVFAEVMAKPQEGFDAMAPENVSPLVVWLGSAESRDVT
GKVFEVEGGIIRVAEGWAHGPQVDKGVKWDPAELGPVVSDLLAKSRPPVP
VYGA
>Mb3579c Mb3579c, PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MTLAEAADAINFGLAGRVVLVTGGVRGVGAGISSVFAEQGATVITCARRA
VDGQPYEFHRCDIRDEDSVKRLVGEIGERHGRLDMLVNNAGGSPYALAAE
ATHNFHRKIVELNVLAPLLVSQHANVLMQAQPNGGSIVNICSVSGRRPTP
GTAAYGAAKAGLENLTTTLAVEWAPKVRVNAVVVGMVETERSELFYGDAE
SIARVAATVPLGRLARPADIGWAAAFLASDAASYISGATLEVHGGGEPPP
YLGASSANK
>Mb3589c Mb3589c, PUTATIVE OXIDOREDUCTASE
MNLSVAPKEIAGHGLLDGKVVVVTAAAGTGIGSATARRALAEGADVVISD
HHERRLGETAAELSALGLGRVEHVVCDVTSTAQVDALIDSTTARMGRLDV
LVNNAGLGGQTPVADMTDDEWDRVLDVSLTSVFRATRAALRYFRDAPHGG
VIVNNASVLGWRAQHSQSHYAAAKAGVMALTRCSAIEAAEYGVRINAVSP
SIARHKFLDKTASAELLDRLVAGEAFGRAAEPWEVAATIAFLASDYSSYL
TGEVISVSCQHP
>Mb3657 Mb3657, CONSERVED HYPOTHETICAL PROTEIN
MTQSSSVERLVGEIDEFGYTVVEDVLDADSVAAYLADTRRLERELPTVIA
NSTTVVKGLARPGHVPVDRVDHDWVRIDNLLLHGTRYEALPVHPKLLPVI
EGVLGRDCLLSWCMTSNQLPGAVAQRLHCDDEMYPLPRPHQPLLCNALIA
LCDFTADNGATQVVPGSHRWPERPSPPYPEGKPVEINAGDALIWNGSLWH
TAAANRTDAPRPALTINFCVGFVRQQVNQQLSIPRELVRCFEPRLQELIG
YGLYAGKMGRIDWRPPADYLDADRHPFLDAVADRLQTSVRL
>Mb3725 Mb3725, CONSERVED HYPOTHETICAL PROTEIN
MTDEVMDWDSAYREQGAFEGPPPWNIGEPQPELATLIAAGKVRSDVLDAG
CGYAELSLALAADGYTVVGIDLTPTAVAAATKAAEERGLTTASFVQADIT
EFAAYPAGSAGRFSTVIDSTLFHSLPVDSRDRYLSSVHRAAAPGASYYVL
VFAKGAFPAELEVKPNEVDEDELRAAVSKYWKIDEIRPAFIHVNPVTIPP
QLAGAPVEFPPYDHDEKGRVKFPAYLLTAHKAG
>Mb3756 Mb3756, POSSIBLE TRANSFERASE
MFVEYTKSICPVCKVVVDAQVNIRHDKVYLRKRCREHGSFEALVYGDAQM
YLESARFNKPGTFPLRFQTEVRDGCPSDCGLCPDHKQHACLGLIEVNTHC
NLDCPICFADSGHQPDGYAITAAQCERMLDTLVAAEGEPEVVMFSGGEPT
IHKQLLEFVDAAQARPVKTIIINTNGIRLASDRRFVDQLATRNRPGHPVH
IYLQFDGLDEATHRRIRGHDLRDVKQRALDNCAAAGLTVSLVAAVERGLN
EHELGAVIRHGMAQPGVQSVVFQPVTHAGRHVQFDPLTRLTNSDIIACIT
AQLPEWFRPGDFFPVPCCFPSCRSITYLLTDGEHVVPIPRLLNVEDYLDY
VSNRVIPDLAIREALENLWSASAVPGTDTMTAQLQRATAALNCAEGCGIN
LPEALTHLTDRVFAIVIQDFQDPYTLNVKQLMKCCVQQITPDGRLIPFCA
YNSVGYREQVREQLTGVPVPDIVPNAIPLAGLLADAPHGSKQANTGGSIA
RLAGPTRGAPMALPPQQIKACCADAYSRDIVALLLGDSFHPGGATLTRRL
ADQLGLRSTGDPRRVADIAAGPGASARLLASDYGVAVDGVDISEINVKRA
QAAVAQTGLTERVRFHLGDAESVPLPDDTFDALVCECAFCTFPDKNAAAQ
QFARILRPGGLAGITDVTVGDGGLPAELTPLAAWVACIADARTVTDYTDI
LEGAGLRTRHIESHDESLLDMIDRIDARITALHVAAPEILADNGIRHDSV
RDFTALARAAVQTGRIGYTLMIAEKP
>Mb3788c Mb3788c, PUTATIVE HYDROLASE
MPMEHKPPTAVIQAAHGEHSLPLHDTTDFDDADRGFIAALSPCVIKAADG
RVVWDNDAYSFLDGAAPTSVHPSLWRQSQLTAKQGLYQVVPGIYQVRGFD
ISNISFVEGDTGLIVIDPLVSTEVAAAALDLYRAHRGADRPVVAVIYTHS
HVDHFGGVLGGTTQADVDAGKVAVLAPEGFTAHAVQENIYAGSAMMRRAG
YMYGTVLARGLRGHVGCGLGQTLSTGEVSLVVPTVDITETGETHTIDGVE
IEFQMAPGTEAPAEMHFYFPRFRALCMAENATHNLHNLLTLRGALVRDPR
AWSGYLTEAIDTFADRTDVVFASHHWPTWGREKIVEFLSQQRDMHSYLHD
QTLRLLNQGYTGVEIAEMFQLPPALQRAWHTHGYYGSVSHNVKAIYQRYM
GWFDGNPGWLWPHPPEALAPRYVDALGGIDRVLELAREAFDAGDFRWAAT
LLDHAVFADSEHAAARGLYADTLEQLAYGAECATWRNFFLTGAAELRDGN
PGSSGQVPAPTFFAQLTPDQIFDVLAISINGPRAWDLDLAIDFTFTEPDV
NYRLTLRNGVLIHRKLPADPATANATVTVGDKVRLVAAALGDISSPGFEV
FGDRTVLQTFLSVLDRPDSAFNIVTP
>Mb3793c Mb3793c, CONSERVED HYPOTHETICAL PROTEIN
MPRTDNDSWAITESVGATALGVAAARAAETESDNPLINDPFARIFVDAAG
DGIWSMYTNRTLLAGATDLDPDLRAPIQQMIDFMAARTAFFDEYFLATAD
AGVRQVVILASGLDSRAWRLPWPDGTVVYELDQPKVLEFKSATLRQHGAQ
PASQLVNVPIDLRQDWPKALQKAGFDPSKPCAWLAEGLVRYLPARAQDLL
FERIDALSRPGSWLASNVPGAGFLDPERMRRQRADMRRMRAAAAKLVETE
ISDVDDLWYAEQRTAVAEWLRERGWDVSTATLPELLARYGRSIPHSGEDS
IPPNLFVSAQRATS
>Mb3816c Mb3816c, CONSERVED HYPOTHETICAL PROTEIN
MARTDDDSWDLATGVGATATLVAAGRARAARAAQPLIDDPFAEPLVRAVG
VEFLTRWATGELDAADVDDPDAAWGLQRMTTELVVRTRYFDQFFLDAAAA
GVRQAVILASGLDARGYRLPWPADTTVFEVDQPRVLEFKAQTLAGLGAQP
TADLRMVPADLRHDWPDALRRGGFDAAEPAAWIAEGLFGYLPPDAQNRLL
DHVTDLSAPGSRLALEAFLGSADRDSARVEEMIRTATRGWREHGFHLDIW
ALNYAGPRHEVSGYLDNHGWRSVGTTTAQLLAAHDLPAAPALPAGLADRP
NYWTCVLG
>Mb3820 Mb3820, PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE
MVLDAVGNPQTVLLLGGTSEIGLAICERYLHNSAARIVLACLPDDPRRED
AAAAMKQAGARSVELIDFDALDTDSHPKMIEAAFSGGDVDVAIVAFGLLG
DAEELWQNQRKAVQIAEINYTAAVSVGVLLAEKMRAQGFGQIIAMSSAAG
ERVRRANFVYGSTKAGLDGFYLGLSEALREYGVRVLVIRPGQVRTRMSAH
LKEAPLTVDKEYVANLAVTASAKGKELVWAPAAFRYVMMVLRHIPRSIFR
KLPI
>Mb3859c Mb3859c, PUTATIVE DEHYDROGENASE
MTGYDAIVIGAGHNGLTAAVLLQRAGLRTACLDAKRYAGGMASTVELFDG
YRFDIAGSVQFPTSSAVSSELGLDSLPTVDLEVMSVALRGVGDDPVVQFT
DPTKMLTHLHRVHGADAVTGMAGLLAWSQAPTRALGRFEAGTLPKSFDEM
YACATNEFERSAIDDMLFGSVTDVLDRHFPDREKHGALRGSMTVLAVNTL
YRGPATPGSAAALAFGLGVPEGDFVRWKKLRGGIGALTTHLSQLLERTGG
EVRLRSKVTEIVVDNSRSSARVRGVRTAAGDTLTSPIVVSAIAPDVTINE
LIDPAVLPSEIRDRYLRIDHRGSYLQMHFALAQPPAFAAPYQALNDPSMQ
ASMGIFCTPEQVQQQWEDCRRGIVPADPTVVLQIPSLHDPSLAPAGKQAA
SAFAMWFPIEGGSKYGGYGRAKVEMGQNVIDKITRLAPNFKGSILRYTTF
TPKHMGVMFGAPGGDYCHALLHSDQIGPNRPGPKGFIGQPIPIAGLYLGS
AGCHGGPGITFIPGYNAARQALADRRAANCCVLSGR
>Mb3862c Mb3862c, CONSERVED HYPOTHETICAL PROTEIN
MAMNLLHRRHCSSAGWEKAVANQLLPWALQHVELGPRTLEIGPGYGATLQ
ALLGLTASLTAVEVDNSMVERLNRRYGQRARIIRGDGTQTGLPDDHFTSV
VCFTMLHHVASAQLQDQLFAEAYRVLQPGGVFAGSDGVPSLPFRLIHIAD
TYTPIAPADLPGRLRAVGFTDIHVDVAGARLRWRATKPVAA
>Mb0034 acpA, PROBABLE ACYL CARRIER PROTEIN ACPA (ACP)
MKEAINATIQRILRTDRGITANQVLVDDLGFDSLKLFQLITELEDEFDIA
ISFRDAQNIKTVGDVYTSVAVWFPETAKPAPLGKGTA
>Mb2268 acpM, MEROMYCOLATE EXTENSION ACYL CARRIER PROTEIN ACPM
MPVTQEEIIAGIAEIIEEVTGIEPSEITPEKSFVDDLDIDSLSMVEIAVQ
TEDKYGVKIPDEDLAGLRTVGDVVAYIQKLEEENPEAAQALRAKIESENP
DAVANVQARLEAESK
>Mb3423 acrA1, POSSIBLE MULTI-FUNCTIONAL ENZYME WITH ACYL-CoA-REDUCTASE ACTIVITY ACRA1
MRYVVTGGTGFIGRHVVSRLLDGRPEARLWALVRRQSLSRFERLAGQWGD
RVRPLVGDLTELELSERTIAELGDIDHVLHCAAVHDTTWADATRAVIELA
ARLDATFHHVSSIAVAGDFAGHYTEADFDVGQRLPTPYHRMTFEAERLVR
STPGLRYRIYRPAVVVGDSRTGEMDTIDGPYYLFGVLAKLAVLPSFTPML
LPDIGRTNIVPVDYVADALVALMHADGRDGQTFHLTAPTAIGLRGIYRGI
AGAAGLPPLLGTLPGFVAAPVLNARGRAKVLRNMAATQLGIPAEIFDVVG
CAPTFTSDTTREALRGTGIHVPEFATYAPGLWRYWAEHLDPDRARRNDPL
LGRHVIITGASSGIGRASAIAVAKRGATVFALARNGNALDELVTEIRAHG
GQAHAFTCDVTDSASVEHTVKDILGRFDHVDYLVNNAGRSIRRSVVNSTD
RLHDYERVMAVNYFGAVRMVLALLPHWRERRFGHVVNVSSAGVQARNPKY
SSYLPTKAALDAFADVVASETLSDHITFTNIHMPLVATPMIVPSRRLNPV
RAISAERAAAMVIRGLVEKPARIDTPLGTLAEAGNYVAPRLSRRILHQLY
LGYPDSAAAQGISRPDADRPPAPRRPRRSARAGVPRPLRRLGRLVPGVHW
>Mb2299 cyp121, Cytochrome P450 121 CYP121
MTATVLLEVPFSARGDRIPDAVAELRTREPIRKVRTITGAEAWLVSSYAL
CTQVLEDRRFSMKETAAAGAPRLNALTVPPEVVNNMGNIADAGLRKAVMK
AITPKAPGLEQFLRDTANSLLDNLITEGAPADLRNDFADPLATALHCKVL
GIPQEDGPKLFRSLSIAFMSSADPIPAAKINWDRDIEYMAGILENPNITT
GLMGELSRLRKDPAYSHVSDELFATIGVTFFGAGVISTGSFLTTALISLI
QRPQLRNLLHEKPELIPAGVEELLRINLSFADGLPRLATADIQVGDVLVR
KGELVLVLLEGANFDPEHFPNPGSIELDRPNPTSHLAFGRGQHFCPGSAL
GRRHAQIGIEALLKKMPGVDLAVPIDQLVWRTRFQRRIPERLPVLW
>Mb0789c cyp123, PROBABLE CYTOCHROME P450 123 CYP123
MTVRVGDPELVLDPYDYDFHEDPYPYYRRLRDEAPLYRNEERNFWAVSRH
HDVLQGFRDSTALSNAYGVSLDPSSRTSEAYRVMSMLAMDDPAHLRMRTL
VSKGFTPRRIRELEPQVLELARIHLDSALQTESFDFVAEFAGKLPMDVIS
ELIGVPDTDRARIRALADAVLHREDGVADVPPPAMAASIELMRYYADLIA
EFRRRPANNLTSALLAAELDGDRLSDQEIMAFLFLMVIAGNETTTKLLAN
AVYWAAHHPGQLARVFADHSRIPMWVEETLRYDTSSQILARTVAHDLTLY
DTTIPEGEVLLLLPGSANRDDRVFDDPDDYRIGREIGCKLVSFGSGAHFC
LGAHLARMEARVALGALLRRIRNYEVDDDNVVRVHSSNVRGFAHLPISVQ
AR
>Mb2289 cyp124, Probable cytochrome P450 124 CYP124
MGLNTAIATRVNGTPPPEVPIADIELGSLDFWALDDDVRDGAFATLRREA
PISFWPTIELPGFVAGNGHWALTKYDDVFYASRHPDIFSSYPNITINDQT
PELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEAAVRDRAHRLVS
SMIANNPDRQADLVSELAGPLPLQIICDMMGIPKADHQRIFHWTNVILGF
GDPDLATDFDEFMQVSADIGAYATALAEDRRVNHHDDLTSSLVEAEVDGE
RLSSREIASFFILLVVAGNETTRNAITHGVLALSRYPEQRDRWWSDFDGL
APTAVEEIVRWASPVVYMRRTLTQDIELRGTKMAAGDKVSLWYCSANRDE
SKFADPWTFDLARNPNPHLGFGGGGAHFCLGANLARREIRVAFDELRRQM
PDVVATEEPARLLSQFIHGIKTLPVTWS
>Mb3575c cyp125, PROBABLE CYTOCHROME P450 125 CYP125
MSWNHQSVEIAVRRTTVPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAA
PIWWNGQDPGKGGGFHDGGFWAITKLNDVKEISRHSDVFSSYENGVIPRF
KNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQER
AQKIAAEAAAAGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEM
TGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIVTQLIQADIDG
EKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPE
TAADEIVRWATPVTAFQRTALRDYELSGVQIKKGQRVVMFYRSANFDEEV
FQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPD
LKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH
>Mb0801 cyp126, POSSIBLE CYTOCHROME P450 126 CYP126
MTTAAGLSGIDLTDLDNFADGFPHHLFAIHRREAPVYWHRPTEHTPDGEG
FWSVATYAETLEVLRDPVTYSSVTGGQRRFGGTVLQDLPVAGQVLNMMDD
PRHTRIRRLVSSGLTPRMIRRVEDDLRRRARGLLDGVEPGAPFDFVVEIA
AELPMQMICILLGVPETDRHWLFEAVEPGFDFRGSRRATMPRLNVEDAGS
RLYTYALELIAGKRAEPADDMLSVVANATIDDPDAPALSDAELYLFFHLL
FSAGAETTRNSIAGGLLALAENPDQLQTLRSDFELLPTAIEEIVRWTSPS
PSKRRTASRAVSLGGQPIEAGQKVVVWEGSANRDPSVFDRADEFDITRKP
NPHLGFGQGVHYCLGANLARLELRVLFEELLSRFGSVRVVEPAEWTRSNR
HTGIRHLVVELRGG
>Mb2291c cyp128, PROBABLE CYTOCHROME P450 128 CYP128
MTATQSPPEPAPDRVRLAGCPLAGTPDVGLTAQDATTALGVPTRRRASSG
GIPVATSMWRDAQTVRTYGPAVAKALALRVAGKARSRLTGRHCRKFMQLT
DFDPFDPAIAADPYPHYRELLAGERVQYNPKRDVYILSRYADVREAARNH
DTLSSARGVTFSRGWLPFLPTSDPPAHTRMRKQLAPGMARGALETWRPMV
DQLARELVGGLLTQTPADVVSTVAAPMPMRAITSVLGVDGPDEAAFCRLS
NQAVRITDVALSASGLISLVQGFAGFRRLRALFTHRRDNGLLRECTVLGK
LATHAEQGRLSDDELFFFAVLLLVAGYESTAHMISTLFLTLADYPDQLTL
LAQQPDLIPSAIEEHLRFISPIQNICRTTRVDYSVGQAVIPAGSLVLLAW
GAANRDPRQYEDPDVFRADRNPVGHLAFGSGIHLCPGTQLARMEGQAILR
EIVANIDRIEVVEPPTWTTNANLRGLTRLRVAVTPRVAP
>Mb1429c cyp132, PROBABLE CYTOCHROME P450 132 CYP132
MATATTQRPLKGPAKRMSTWTMTREAITIGFDAGDGFLGRLRGSDITRFR
CAGRRFVSISHPDYVDHVLHEARLKYVKSDEYGPIRATAGLNLLTDEGDS
WARHRGALNSTFARRHLRGLVGLMIDPIADVTAALVPGAQFDMHQSMVET
TLRVVANALFSQDFGPLVQSMHDLATRGLRRAEKLERLGLWGLMPRTVYD
TLIWCIYSGVHLPPPLREMQEITLTLDRAINSVIDRRLAEPTNSADLLNV
LLSADGGIWPRQRVRDEALTFMLAGHETTANAMSWFWYLMALNPQARDHM
LTELDDVLGMRRPTADDLGKLAWTTACLQESQRYFSSVWIIAREAVDDDI
IDGHRIRRGTTVVIPIHHIHHDPRWWPDPDRFDPGRFLRCPTDRPRCAYL
PFGGGRRICIGQSFALMEMVLMAAIMSQHFTFDLAPGYHVELEATLTLRP
KHGVHVIGRRR
>Mb0334c cyp135A1, POSSIBLE CYTOCHROME P450 135A1 CYP135A1
MASTLTTGLPPGPRLPRYLQSVLYLRFREWFLPAMHRKYGDVFSLRVPPY
ADNLVVYTRPEHIKEIFAADPRSLHAGEGNHILGFVMGEHSVLMTDEAEY
ARMRSLLMPAFTRAALRGYRDMIASVAREHITRWRPHATINSLDHMNALT
LDIILRVVFGVTDPKVKAELTSRLQQIINIHPAILAGVPYPSLKRMNPWK
RFFHNQTKIDEILYREIASRRIDSDLTARTDVLSRLLQTKDTPTKPLTDA
ELRDQLITLLLAGHETTAAALSWTLWELAHAPEIQSQVVWAAVGGDDGFL
EAVLKEGMRRHTVIASTARKVTAPAEIGGWRLPAGTVVNTSILLAHASEV
SHPKPTEFRPSRFLDGSVAPNTWLPFGGGVRRCLGFGFALTEGAVILQEI
FRRFTITAAGPSKGETPLVRNITTVPKHGAHLRLIPQRRLGGLGDSDPP
>Mb0583 cyp135B1, POSSIBLE CYTOCHROME P450 135B1 CYP135B1
MSGTSSMGLPPGPRLSGSVQAVLMLRHGLRFLTACQRRYGSVFTLHVAGF
GHMVYLSDPAAIKTVFAGNPSVFHAGEANSMLAGLLGDSSLLLIDDDVHR
DRRRLMSPPFHRDAVARQAGPIAEIAAANIAGWPMAKAFAVAPKMSEITL
EVILRTVIGASDPVRLAALRKVMPRLLNVGPWATLALANPSLLNNRLWSR
LRRRIEEADALLYAEIADRRADPDLAARTDTLAMLVRAADEDGRTMTERE
LRDQLITLLVAGHDTTATGLSWALERLTRHPVTLAKAVQAADASAAGDPA
GDEYLDAVAKETLRIRPVVYDVGRVLTEAVEVAGYRLPAGVMVVPAIGLV
HASAQLYPDPERFDPDRMVGATLSPTTWLPFGGGNRRCLGATFAMVEMRV
VLREILRRVELSTTTTSGERPKLKHVIMVPHRGARIRVRATRDVSATSQA
TAQGAGCPAARGGGPSRAVGSQ
>Mb3085 cyp136, PROBABLE CYTOCHROME P450 136 CYP136
MATIHPPAYLLDQAKRRFTPSFNNFPGMSLVEHMLLNTKFPEKKLAEPPP
GSGLKPVVGDAGLPILGHMIEMLRGGPDYLMFLYKTKGPVVFGDSAVLPG
VAALGPDAAQVIYSNRNKDYSQQGWVPVIGPFFHRGLMLLDFEEHMFHRR
IMQEAFVRSRLAGYLEQMDRVVSRVVADDWVVNDARFLVYPAMKALTLDI
ASMVFMGHEPGTDHELVTKVNKAFTITTRAGNAVIRTSVPPFTWWRGLRA
RELLENYFTARVKERREASGNDLLTVLCQTEDDDGNRFSDADIVNHMIFL
MMAAHDTSTSTATTMAYQLAAHPEWQQRCRDESDRHGDGPLDIESLEQLE
SLDLVMNESIRLVTPVQWAMRQTVRDTELLGYYLPKGTNVIAYPGMNHRL
PEIWTDPLTFDPERFTEPRNEHKRHRYAFTPFGGGVHKCIGMVFGQLEIK
TILHRLLRRYRLELSRPDYQPRWDYSAMPIPMDGMPIVLRPR
>Mb3710c cyp137, PROBABLE CYTOCHROME P450 137 CYP137
MVLRSLASPAALTDPKRCASVVGVAAFAVRREHAPDALGGPPGLPAPRGF
RAAFAAAYAVAYLAGGERRMLRLIRRYGPIMTMPILSLGDVAIVSDSALA
KEVFTAPTDVLLGGEGVGPAAAIYGSGSMFVQEEPQHLRRRKLLTPPLHG
AALDRYVPIIENSTRAAMHTWPVDRPFAMLTVARSLMLDVIVKVIFGVDD
PEEVRRLGRPFERLLNLGVSEQLTVRYALRRLGALRVWPARARANTEIDD
VVMALIAQRRADPRLGERHDVLSLLVSARGESGEQLSDSEIRDDLITLVL
AGHETTATTLAWAFDLLLHHPDALRRVRAEAVGGGEAFTTAVINETLRVR
PPAPLTARVAAQPLTIGGYRVEAGTRIVVHIIAINRSAEVYEHPHEFRPE
RFLGTRPQTYAWVPFGGGVKRCLGANFSMRELITVLHVLLREGEFTAVDD
EPERIVRRSIMLVPRRGTRVRFRPAR
>Mb0141 cyp138, PROBABLE CYTOCHROME P450 138 CYP138
MSEVVTAAPAPPVVRLPPAVRGPKLFQGLAFVVSRRRLLGRFVRRYGKAF
TANILMYGRVVVVADPQLARQVFTSSPEELGNIQPNLSRMFGSGSVFALD
GDDHRRRRRLLAPPFHGKSMKNYETIIEEETLRETANWPQGQAFATLPSM
MHITLNAILRAIFGAGGSELDELRRLIPPWVTLGSRLAALPKPKRDYGRL
SPWGRLAEWRRQYDTVIDKLIEAERADPNFADRTDVLALMLRSTYDDGSI
MSRKDIGDELLTLLAAGHETTAATLGWAFERLSRHPDVLAALVEEVDNGG
HELRQAAILEVQRARTVIDFAARRVNPPVYQLGEWVIPRGYSIIINIAQI
HGDPDVFPQPDRFDPQRYIGSKPSPFAWIPFGGGTRRCVGAAFANMEMDV
VLRTVLRHFTLETTTAAGERSHGRGVAFTPKDGGRVVMRRR
>Mb1694c cyp139, Probable cytochrome P450 139 CYP139
MRYPLGEALLALYRWRGPLINAGVGGHGYTYLLGAEANRFVFANADAFSW
SQTFESLVPVDGPTALIVSDGADHRRRRSVVAPGLRHHHVQRYVATMVSN
IDTVIDGWQPGQRLDIYQELRSAVRRSTAESLFGQRLAVHSDFLGEQLQP
LLDLTRRPPQVMRLQQRVNSPGWRRAMAARKRIDDLIDAQIADARTAPRP
DDHMLTTLISGCSEEGTTLSDNEIRDSIVSLITAGYETTSGALAWAIYAL
LTVPGTWESAASEVARVLGGRVPAADDLSALTYLNGVVHETLRLYSPGVI
SARRVLRDLWFDGHRIRAGRLLIFSAYVTHRLPEIWPEPTEFRPLRWDPN
AADYRKPAPHEFIPFSGGLHRCIGAVMATTEMTVILARLVARAMLQLPAQ
RTHRIRAANFAALRPWPGLTVEIRKSAPAQ
>Mb1912c cyp140, Probable cytochrome p450 140 CYP140
MKDKLHWLAMHGVIRGIAAIGIRRGDLQARLIADPAVATDPVPFYDEVRS
HGALVRNRANYLTVDHRLAHDLLRSDDFRVVSFGENLPPPLRWLERRTRG
DQLHPLREPSLLAVEPPDHTRYRKTVSAVFTSRAVSALRDLVEQTAINLL
DRFAEQPGIVDVVGRYCSQLPIVVISEILGVPEHDRPRVLEFGELAAPSL
DIGIPWRQYLRVQQGIRGFDCWLEGHLQQLRHAPGDDLMSQLIQIAESGD
NETQLDETELRAIAGLVLVAGFETTVNLLGNGIRMLLDTPEHLATLRQHP
ELWPNTVEEILRLDSPVQLTARVACRDVEVAGVRIKRGEVVVIYLAAANR
DPAVFPDPHRFDIERPNAGRHLAFSTGRHFCLGAALARAEGEVGLRTFFD
RFPDVRAAGAGSRRDTRVLRGWSTLPVTLGPARSMVSP
>Mb3548c cyp142a, PROBABLE CYTOCHROME P450 MONOOXYGENASE 142 CYP142A [FIRST PART]
MTEAPDVDLADGNFYASREARAAYRWMRANQPVFRDRNGLAAASTYQAVI
DAERQPELFSNAGGIRPDQPALPMMIDMDDPAHLLRRKLVNAGFTRKRVK
DKEASIAALCDTLIDAVCERGECDFVRDLAAPLPMAVIGDMLGVRPEQRD
MFLRWSDDLVTFLSSHVSQEDFQITMDAFAAYNDFTRATIAARRADPPTT
WSACW
>Mb3547c cyp142b, PROBABLE CYTOCHROME P450 MONOOXYGENASE 142 CYP142B [SECOND PART]
MSSEVDGERLSDDELVMETLLILIGGDETTRHTLSGGTEQLLRNRDQWDL
LQRDPSLLPGAIEEMLRWTAPVKNMCRVLTADTEFHGTALCAGEKMMLLF
ESANFDEAVFCEPEKFDVQRNPNSHLAFGFGTHFCLGNQLARLELSLMTE
RVLRRLPDLRLVADDSVLPLRPANFVSGLESMPVVFTPSPPLG
>Mb1813c cyp143, PROBABLE CYTOCHROME P450 143 CYP143
MTTPGEDHAGSFYLPRLEYSTLPMAVDRGVGWKTLRDAGPVVFMNGWYYL
TRREDVLAALRNPKVFSSRKALQPPGNPLPVVPLAFDPPEHTRYRRILQP
YFSPAALSKALPSLRRHTVAMIDAIAGRGECEAMADLANLFPFQLFLVLY
GLPLEDRDRLIGWKDAVIAMSDRPHPTEADVAAARELLEYLTAMVAERRR
NPGPDVLSQVQIGEDPLSEIEVLGLSHLLILAGLDTVTAAVGFSLLELAR
RPQLRAMLRDNPKQIRVFIEEIVRLEPSAPVAPRVTTEPVTVGGMTLPAG
SPVRLCMAAVNRDGSDAMSTDELVMDGKVHRHWGFGGGPHRCLGSHLARL
ELTLLVGEWLNQIPDFELAPDYAPEIRFPSKSFALKNLPLRWS
>Mb1806 cyp144, Probable cytochrome p450 144 CYP144
MRRSPKGSPGAVLDLQRRVDQAVSADHAELMTIAKDANTFFGAESVQDPY
PLYERMRAAGSVHRIANSDFYAVCGWDAVNEAIGRPEDFSSNLTATMTYT
AEGTAKPFEMDPLGGPTHVLATADDPAHAVHRKLVLRHLAAKRIRVMEQF
TVQAADRLWVDGMQDGCIEWMGAMANRLPMMVVAELIGLPDPDIAQLVKW
GYAATQLLEGLVENDQLVAAGVALMELSGYIFEQFDRAAADPRDNLLGEL
ATACASGELDTLTAQVMMVTLFAAGGESTAALLGSAVWILATRPDIQQQV
RANPELLGAFIEETLRYEPPFRGQYRHVRNATTLDGTELPADSHLLLLWG
AANRDPAQFEAPGEFRLDRAGGKGHISFGKGAHFCVGAALARLEARIVLR
LLLDRTSVIEAADVGGWLPSILVRRIERLELAVQ
>Mb0787c cyp51, CYTOCHROME P450 51 CYP51 (CYPL1) (P450-L1A1) (STEROL 14-ALPHA DEMETHYLASE) (LANOSTEROL 14-ALPHA DEMETHYLASE) (P450-14DM)
MSAVALPRVSGGHDEHGHLEEFRTDPIGLMQRVRDECGDVGTFQLAGKQV
VLLSGSHANEFFFRAGDDDLDQAKAYPFMTPIFGEGVVFDASPERRKEML
HNAALRGEQMKGHAATIEDQVRRMIADWGEAGEIDLLDFFAELTIYTSSA
CLIGKKFRDQLDGRFAKLYHELERGTDPLAYVDPYLPIESFRRRDEARNG
LVALVADIMNGRIANPPTDKSDRDMLDVLIAVKAETGTPRFSADEITGMF
ISMMFAGHHTSSGTASWTLIELMRHRDAYAAVIDELDELYGDGRSVSFHA
LRQIPQLENVLKETLRLHPPLIILMRVAKGEFEVQGHRIHEGDLVAASPA
ISNRIPEDFPDPHDFVPARYEQPRQEDLLNRWTWIPFGAGRHRCVGAAFA
IMQIKAIFSVLLREYEFEMAQPPESYRNDHSKMVVQLAQPACVRYRRRTG
V
>Mb3241 entC, PROBABLE ISOCHORISMATE SYNTHASE ENTC (ISOCHORISMATE HYDROXYMUTASE) (ENTEROCHELIN BIOSYNTHESIS)
MSAHVATLHPEPPFALCGPRGTLIARGVRTRYCDVRAAQAALRSGTAPIL
LGALPFDVSRPAALMVPDGVLRARKLPDWPTGPLPKVRVAAALPPPADYL
TRIGRARDLLAAFDGPLHKVVLARAVQLTADAPLDARVLLRRLVVADPTA
YGYLVDLTSAGNDDTGAALVGASPELLVARSGNRVMCKPFAGSAPRAADP
KLDAANAAALASSAKNRHEHQLVVDTMRVALEPLCEDLTIPAQPQLNRTA
AVWHLCTAITGRLRNISTTAIDLALALHPTPAVGGVPTKAATELIAELEG
DRGFYAGAVGWCDGRGDGHWVVSIRCAQLSADRRAALAHAGGGIVAESDP
DDELEETTTKFATILTALGVEQ
>Mb2237c ephD, Possible short-chain dehydrogenase EphD
MPATQQMSRLVDSPDGVRIAVYHEGNPDGPTVVLVHGFPDSHVLWDGVVP
LLAERFRIVRYDNRGVGRSSVPKPISAYTMAHFADDFDAVIGELSPGEPV
HVLAHDWGSVGVWEYLRRPGASDRVASFTSVSGPSQDHLVNYVYGGLRRP
WRPRTFLRAISQTLRLSYMALFSVPVVAPLLLRVALSSAAVRRNMVGDIP
VDQIHHSETLARDAAHSVKTYPANYFRSFSSSRRGRAIPIVDVPVQLIVN
SQDPYVRPYGYDQTARWVPRLWRRDIKAGHFSPMSHPQVMAAAVHDFADL
ADGKQPSRALLRAQVGRPRGYFGDTLVSVTGAGSGIGRETALAFAREGAE
IVISDIDEATVKDTAAEIAARGGIAYPYVLDVSDAEAVEAFAERVSAEHG
VPDIVVNNAGIGQAGRFLDTPAEQFDRVLAVNLGGVVNGCRAFGQRLVER
GTGGHIVNVSSMAAYAPLQSLSAYCTSKAATYMFSDCLRAELDAAGVGLT
TICPGVIDTNIVATTGFHAPGTDEEKIDGRRGQIDKMFALRSYGPDKVAD
AIVSAVKKKKPIRPVAPEAYALYGISRVLPQALRSTARLRVI
>Mb1519 fabG1, PROBABLE 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE FABG1 (3-KETOACYL-ACYL CARRIER PROTEIN REDUCTASE)
MTATATEGAKPPFVSRSVLVTGGNRGIGLAIAQRLAADGHKVAVTHRGSG
APKGLFGVECDVTDSDAVDRAFTAVEEHQGPVEVLVSNAGLSADAFLMRM
TEEKFEKVINANLTGAFRVAQRASRSMQRNKFGRMIFIGSVSGSWGIGNQ
ANYAASKAGVIGMARSIARELSKANVTANVVAPGYIDTDMTRALDERIQQ
GALQFIPAKRVGTPAEVAGVVSFLASEDASYISGAVIPVDGGMGMGH
>Mb1385 fabG2, PUTATIVE 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE FABG2 (3-KETOACYL-ACYL CARRIER PROTEIN REDUCTASE)
MASLLNARTAVITGGAQGLGLAIGQRFVAEGARVVLGDVNLEATEVAAKR
LGGDDVALAVRCDVTQADDVDILIRTAVERFGGLDVMVNNAGITRDATMR
TMTEEQFDQVIAVHLKGTWNGTRLAAAIMRERKRGAIVNMSSVSGKVGMV
GQTNYSAAKAGIVGMTKAAAKELAHLGIRVNAIAPGLIRSAMTEAMPQRI
WDQKLAEVPMGRAGEPSEVASVAVFLASDLSSYMTGTVLDVTGGRFI
>Mb2025 fabG3, POSSIBLE 20-BETA-HYDROXYSTEROID DEHYDROGENASE FABG3 (Cortisone reductase) ((R)-20-hydroxysteroid dehydrogenase)
MSGRLIGKVALVSGGARGMGASHVRAMVAEGAKVVFGDILDEEGKAVAAE
LADAARYVHLDVTQPAQWTAAVDTAVTAFGGLHVLVNNAGILNIGTIEDY
ALTEWQRILDVNLTGVFLGIRAVVKPMKEAGRGSIINISSIEGLAGTVAC
HGYTATKFAVRGLTKSTALELGPSGIRVNSIHPGLVKTPMTDWVPEDIFQ
TALGRAAEPVEVSNLVVYLASDESSYSTGAEFVVDGGTVAGLAHNDFGAV
EVSSQPEWVT
>Mb0248c fabG4, PROBABLE 3-OXOACYL-[ACYL-CARRIER PROTEIN] REDUCTASE FABG4 (3-KETOACYL-ACYL CARRIER PROTEIN REDUCTASE)
MAPKRSSDLFSQVVNSGPGSFLARQLGVPQPETLRRYRAGEPPLTGSLLI
GGAGRVVEPLRAALEKDYDLVGNNLGGRWADSFGGLVFDATGITEPAGLK
GLHEFFTPVLRNLGRCGRVVVVGGTPEAAASTNERIAQRALEGFTRSLGK
ELRRGATTALVYLSPDAKPAATGLESTMRFLLSAKSAYVDGQVFSVGADD
STPPADWEKPLDGKVAIVTGAARGIGATIAEVFARDGAHVVAIDVESAAE
NLAETASKVGGTALWLDVTADDAVDKISEHLRDHHGGKADILVNNAGITR
DKLLANMDDARWDAVLAVNLLAPLRLTEGLVGNGSIGEGGRVIGLSSIAG
IAGNRGQTNYATTKAGMIGITQALAPGLAAKGITINAVAPGFIETQMTAA
IPLATREVGRRLNSLLQGGQPVDVAEAIAYFASPASNAVTGNVIRVCGQA
MIGA
>Mb1779c fadD1, POSSIBLE FATTY-ACID-COA LIGASE FADD1 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MTDTIQSLLRQHVSDPTIAVKYGGLQWTWSQYLAESAARAAALITIADPQ
RPTHIGSLLGNTPEMLAQLAAAGLGGYVLCGLNTTRRGDALAADVRRADC
QIVVTDADHRALLDGLDLAGARILDTSTPRWAELVAGDGAFVPYREVDTM
DPFMMIFTSGTSGNPKAVPVSHLMATFAGRSLTERFGLTEQDTCYVSMPL
FHSNAVVAGWAPAVVSGAAIAPATFSATGFLDDVRRYHATYMNYVGKPLA
YILATPERDDDADNPLRVAFGNEANDKDIEEFSRRFGVQVEDGFGSTENA
VIVIREPGTPPGSIGRGAHGVAVYNGETVTECAVARFDAHGALTNADEAI
GELVNTTGSGFFTGYYNDPEANAERMRHGMYWSGDLAYRDSEGWIYLAGR
TADWMRVDGENLTAAPIERILLRYKAINRVAVYAVPDEYVGDQVMAALVL
RAGDTFDPDAFEAFLDAQPDLSTKARPRYIRIAADLPSTATHKVLKRQLI
DEGTAVGKADTLWVREPRGSAYHHASGPAKAI
>Mb0102 fadD10, POSSIBLE FATTY-ACID-COA LIGASE FADD10 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MGGKKFQAMPQLPSTVLDRVFEQARQQPEAIALRRCDGTSALRYRELVAE
VGGLAADLRAQSVSRGSRVLVISDNGPETYLSVLACAKLGAIAVMADGNL
PIAAIERFCQITDPAAALVAPGSKMASSAVPEALHSIPVIAVDIAAVTRE
SEHSLDAASLAGNADQGSEDPLAMIFTSGTTGEPKAVLLANRTFFAVPDI
LQKEGLNWVTWVVGETTYSPLPATHIGGLWWILTCLMHGGLCVTGGENTT
SLLEILTTNAVATTCLVPTLLSKLVSELKSANATVPSLRLVGYGGSRAIA
ADVRFIEATGVRTAQVYGLSETGCTALCLPTDDGSIVKIEAGAVGRPYPG
VDVYLAATDGIGPTAPGAGPSASFGTLWIKSPANMLGYWNNPERTAEVLI
DGWVNTGDLLERREDGFFYIKGRSSEMIICGGVNIAPDEVDRIAEGVSGV
REAACYEIPDEEFGALVGLAVVASAELDESAARALKHTIAARFRRESEPM
ARPSTIVIVTDIPRTQSGKVMRASLAAAATADKARVVVRG
>Mb1462c fadD12, POSSIBLE LONG-CHAIN-FATTY-ACID--COA LIGASE FADD12 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MRIRQAFGLIATMRRAGLIAPLRPDRYLRIVAAMRREGMGFTAGFAGAAR
RCPDRPGLIDELGTLTWRQLDERGNALAAALQALPAGPPRVVGIMCRNHR
GFVDALLAVNRIGAHILLLNTSFAGPALAEVVTREGVDTVVYDEEFSATV
DRALAEKPQATRIVAWTDEDHDLTVEKLVAAHAGRRPEHTGSHGKVILLT
SGTTGTPKGARHSGGGIGTLKAILDRTPWRAEEVTVIVAPMFHAWGFSQL
VLASSLACTIVTRRRFDPEATLDLIDRHHATGLVVVPVMFDRIMDLPAEI
RNRYDGRSLRFAAASGSRMRPDVVIAFMDQFGDVIYNNYNATEAGMIATA
TPADLRTAPDTAGRPAEGTEIRILDQQFTEVPTGEVGTIYVRNDSQFDGY
TSGAAKDFHAGFMSSGDVGYLDENGRLFVVGRDDEMIVSGGENIYPIEVE
KTLATHPDVAEAAVIGVDDQQYGQRLAAFVVLKPGVSATPETLKQHVRDN
LANYKVPRDIAVLDELPRGITGKILRTELQSRVGS
>Mb3116 fadD13, PROBABLE CHAIN-FATTY-ACID-CoA LIGASE FADD13 (FATTY-ACYL-CoA SYNTHETASE)
MKNIGWMLRQRATVSPRLQAYVEPSTDVRMTYAQMNALANRCADVLTALG
IAKGDRVALLMPNSVEFCCLFYGAAKLGAVAVPINTRLAAPEVSFILSDS
GSKVVIYGAPSAPVIDAIRAQADPPGTVTDWIGADSLAERLRSAAADEPA
VECGGDDNLFIMYTSGTTGHPKGVVHTHESVHSAASSWASTIDVRYRDRL
LLPLPMFHVAALTTVIFSAMRGVTLISMPQFDATKVWSLIVEERVCIGGA
VPAILNFMRQVPEFAELDAPDFRYFITGGAPMPEALIKIYAAKNIEVVQG
YALTESCGGGTLLLSEDALRKAGSAGRATMFTDVAVRGDDGVIREHGEGE
VVIKSDILLKEYWNRPEATRDAFDNGWFRTGDIGEIDDEGYLYIKDRLKD
MIISGGENVYPAEIESVIIGVPGVSEVAVIGLPDEKWGEIAAAIVVADQN
EVSEQQIVEYCGTRLARYKLPKKVIFAEAIPRNPTGKILKTVLREQYSAT
VPK
>Mb1087 fadD14, PROBABLE MEDIUM CHAIN FATTY-ACID-COA LIGASE FADD14 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MYGTMQDFPLTITAIMRHGCGVHGRRTVTTATGEGYRHSSYRDVGQRAGQ
LANALRRLGVTGDQRVATFMWNNTEHLVTYFAVPSMGAVLHTLNIRLFPE
QIAYVTNEAEDRVILVDLSLARLLAPVLPKLDTVHTVIAVGEGDTTPLRE
AGKTVLRFAELIDAESPDFGWPQIDENSAAAMCYTSGTTGNPKGVVYSHR
SSFLHTMAACTTNGIGVGSSDKVLPIVPMFHANGWGLPYAALMAGADLVL
PDRHLDARSLIHMVETLKPTLAGAVPTIWNDVMHYLEKDPDHDMSSLRLV
ACGGSAVPESLMRTFEDKHDVQIRQLWGMTETSPLATMAWPPPGTPDDQH
WAFRITQGQPVCGVETRIVDDDGQVLPNDGNAVGEVEVRGPWIAGSYYGG
RDESKFDSGWLRTGDVGRIDEQGFITLTDRAKDVIKSGGEWISSVELENC
LIAHPDVLEAAVVGVPDERWQERPLAVVVVREGATVSAGDLRAFLADKVV
RWWLPERWAFVDEIPRTSVGKYDKKAIRSRYAEGAYQITEVHT
>Mb3536 fadD17, POSSIBLE FATTY-ACID-COA SYNTHETASE FADD17 (FATTY-ACID-COA SYNTHASE) (FATTY-ACID-COA LIGASE)
MTPTHPTVTELLLPLSEIDDRGVYFEDSFTSWRDHIRHGAAIAAALRERL
DPARPPHVGVLLQNTPFFSATLVAGALSGIVPVGLNPVRRGAALAGDIAK
ADCQLVLTGSGSAEVPADVEHINVDSPEWTDEVAAHRDTEVRFRSADLAD
LFMLIFTSGTSGDPKAVKCSHRKVAIAGVTITQRFSLGRDDVCYVSMPLF
HSNAVLVGWAVAAACQGSMALRRKFSASQFLADVRRYGATYANYVGKPLS
YVLATPELPDDADNPLRAVYGNEGVPGDIDRFGRRFGCVVMDGFGSTEGG
VAITRTLDTPAGALGPLPGGIQIVDPDTGEPCPTGVVGELVNTAGPGGFE
GYYNDEAAEAERMAGGVYHSGDLAYRDDAGYAYFAGRLGDWMRVDGENLG
TAPIERVLMRYPDATEVAVYPVPDPVVGDQVMAALVLAPGTKFDADKFRA
FLTEQPDLGHKQWPSYVRVSAGLPRTMTFKVIKRQLSAEGVACADPVWPI
RR
>Mb3542c fadD18, PROBABLE FATTY-ACID-COA LIGASE FADD18 (FRAGMENT) (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MAASLSENLSCHSSNMCRLSGNAATNLERPGEEPPGDRCTRRQAVRPART
LAKKGNIPVGYYKDEKKTAETFRTINGVRYAIPGDYAQVEEDGTVTMLGR
GSVSINSGGEKVYPEEVEAALKGHPDVFDALVVGVPDPRYGQQVAAVVQA
RPGCRPSLAELDSFVRSEIAGYKVPRSLWFVDEVKRSPAGKPDYRWAKEQ
TEARPADDVHAGHVTSGS
>Mb3544c fadD19, PROBABLE FATTY-ACID-COA LIGASE FADD19 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MAVALNIADLAEHAIDAVPDRVAVICGDEQLTYAQLEDKANRLAHHLIDQ
GVQKDDKVGLYCRNRIEIVIAMLGIVKAGAILVNVNFRYVEGELRYLFDN
SDMVALVHERRYADRVANVLPDTPHVRTILVVEDGSDQDYRRYGGVEFYS
AIAAGSPERDFGERSADAIYLLYTGGTTGFPKGVMWRHEDIYRVLFGGTD
FATGEFVKDEYDLAKAAAANPPMIRYPIPPMIHGATQSATWMALFSGQTT
VLAPEFNADEVWRTIHKHKVNLLFFTGDAMARPLVDALVKGNDYDLSSLF
LLASTAALFSPSIKEKLLELLPNRVITDSIGSSETGFGGTSVVAAGQAHG
GGPRVRIDHRTVVLDDDGNEVKPGSGMRGVIAKKGNIPVGYYKDEKKTAE
TFRTINGVRYAIPGDYAQVEEDGTVTMLGRGSVSINSGGEKVYPEEVEAA
LKGHPDVFDALVVGVPDPRYGQQVAAVVQARPGCRPSLAELDSFVRSEIA
GYKVPRSLWFVDEVKRSPAGKPDYRWAKEQTEARPADDVHAGHVTSGG
>Mb0276 fadD2, PROBABLE FATTY-ACID-COA LIGASE FADD2 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MPNLTDLPGQAVSKLQKSIGQYVARGTAELHYLRKIIESGAIGLEPPLNY
AALAADIRKWGEVGMLPSHNARRAPNRAAVIDEEGTLTFSELDEAAHAVA
NGLLAKGVRAGDGVAILARNHRWFVIANYGAARVGARIILLNSEFSGPQI
KEVSDREGAKVIIYDDEYTKAVSLAQPPLGKLRALGVNPDDDKPSGSSDE
TLAELIAHSSTAPAPKASRRASIIILTSGTTGTPKGANRNTPPTLAPIGG
ILSHVPFKAGEVTLLPSPMFHALGYMHAALAMFLGSTLVLRRRFKPALVL
EDIEKHKATSMVVVPVMLSRILDQLEKTEPKPDLSSLKIVFVSGSQLGAE
LATRALGDLGPVIYNMYGSTEVAFATIAGPKDLQFNPSTVGPVVKGVTVK
ILDENGNEVPQGAVGRIFVGNAFPFEGYTGGGGKQIIDGLLSSGDVGYFD
ERGLLYVSGRDDEMIVSGGENVFPAEVEDLISGHPDVVEAAAIGVDDKEF
GARLRAFVVKKPGADLDEDTIKQYVRDHLARYKVPREVIFLDELPRNPTG
KVLKRELRKL
>Mb1217c fadD21, PROBABLE FATTY-ACID--COA LIGASE FADD21 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MSDSSVLSLLRERAGLQPDDAAFTYIDYEQDWAGITETLTWSEVFRRTRI
VAHEVRRHCTTGDRAVILAPQGLAYIAAFLGSMQAGAIAVPLSVPQIGSH
DERVSAVLADASPSVILTTSAVAEAVAEHIHRPNTNNVGPIIEIDSLDLT
GNSPSFRVKDLPSAAYLQYTSGSTRAPAGVMISHRNLQANFQQLMSNYFG
DRNGVAPPDTTIVSWLPFYHDMGLVLGIIAPILGGYRSELTSPLAFLQRP
ARWLHSLANGSPSWSAAPNFAFELAVRKTTDADIEGLDLGNVLGITSGAE
RVHPNTLSRFCNRFAPYNFREDMIRPSYGLAEATLYVASRNSGDKPEVVY
FEPDKLSTGSANRCEPKTGTPLLSYGMPTSPTVRIVDPDTCIECPAGTIG
EIWVKGDNVAEGYWNKPDETRHTFGAMLVHPSAGTPDGSWLRTGDLGFLS
EDEMFIVGRMKDMLIVYGRNHYPEDIESTVQEITGGRVAAISVPVDHTEK
LVTVIELKLLGDSAGEAMDELDVIKNNVTAAISRSHGLNVADLVLVPPGS
IPTTTSGKIRRAACVEQYRLQQFTRLDG
>Mb2972c fadD22, PROBABLE FATTY-ACID-CoA LIGASE FADD22 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MRNGNLAGLLAEQASEAGWYDRPAFYAADVVTHGQIHDGAARLGEVLRNR
GLSSGDRVLLCLPDSPDLVQLLLACLARGVMAFLANPELHRDDHALAARN
TEPALVVTSDALRDRFQPSRVAEAAELMSEAARVAPGGYEPMGGDALAYA
TYTSGTTGPPKAAIHRHADPLTFVDAMCRKALRLTPEDTGLCSARMYFAY
GLGNSVWFPLATGGSAVINSAPVTPEAAAILSARFGPSVLYGVPNFFARV
IDSCSPDSFRSLRCVVSAGEALELGLAERLMEFFGGIPILDGIGSTEVGQ
TFVSNRVDEWRLGTLGRVLPPYEIRVVAPDGTTAGPGVEGDLWVRGPAIA
KGYWNRPDSPVANEGWLDTRDRVCIDSDGWVTYRCRADDTEVIGGVNVDP
REVERLIIEDEAVAEAAVVAVRESTGASTLQAFLVATSGATIDGSVMRDL
HRGLLNRLSAFKVPHRFAVVDRLPRTPNGKLVRGALRKQSPTKPIWELSL
TEPGSGVRAQRDDLSASNMTIAGGNDGGATLRERLVALRQERQRLVVDAV
CAEAAKMLGEPDPWSVDQDLAFSELGFDSQMTVTLCKRLAAVTGLRLPET
VGWDYGSISGLAQYLEAELAGGHGRLKSAGPVNSGATGLWAIEEQLNKVE
ELVAVIADGEKQRVADRLRALLGTIAGSEAGLGKLIQAASTPDEIFQLID
SELGK
>Mb3856 fadD23, PROBABLE FATTY-ACID-COA LIGASE FADD23 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MVSLSIPSMLRQCVNLHPDGTAFTYIDYERDSEGISESLTWSQVYRRTLN
VAAEVRRHAAIGDRAVILAPQGLDYIVAFLGALQAGLIAVPLSAPLGGAS
DERVDAVVRDAKPNVVLTTSAIMGDVVPRVTPPPGIASPPTVAVDQLDLD
SPIRSNIVDDSLQTTAYLQYTSGSTRTPAGVMITYKNILANFQQMISAYF
ADTGAVPPLDLFIMSWLPFYHDMGLVLGVCAPIIVGCGAVLTSPVAFLQR
PARWLQLMAREGQAFSAAPNFAFELTAAKAIDDDLAGLDLGRIKTILCGS
ERVHPATLKRFVDRFSRFNLREFAIRPAYGLAEATVYVATSQAGQPPEIR
YFEPHELSAGQAKPCATGAGTALVSYPLPQSPIVRIVDPNTNTECPPGTI
GEIWVHGDNVAGGYWEKPDETERTFGGALVAPSAGTPVGPWLRTGDSGFV
SEDKFFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAIAVPSNGVE
KLVAIVELNNRGNLDTERLSFVTREVTSAISTSHGLSVSDLVLVAPGSIP
ITTSGKVRRAECVKLYRHNEFTRLDAKPLQASDL
>Mb1556 fadD24, PROBABLE FATTY-ACID-COA LIGASE FADD24 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MVASSIPTALRERASVHPNGAAITYIDYEQDWAGVAETLTWSQLYRRMLN
VAEPLRHVGATGDRAVILAPQGIEYVVGFLGALQAGRIAVPLPVPHAGAH
DERTISVLSDTSPAVILTTSGAVDDVRECAQPQPGQSAPSIVELDLLDLD
SRQRSRSPGARPTGRDTPETAYLQYTSGSTRTPAGVMVSNKNVFANFEQI
VADFFAPEGGVVPPDLTVVSWLPLYHDMGLLLGAIMPILAGVPTVLTSPV
GFLQRPARWIQLLARNGRTISAGPNFAFELAVRKTSDDDMDGLDLAGVHT
ILNGSERVHPATLKRFAERFGRFNFAAAALRPAYGMAEATVYIATRNVNE
PPEIVDFESEKLPAGQAIRCPSGSGTPLVSYGVPRSQLVRIVDPDTCIEC
PQGSVGEIWVQGGNVASGYWHKPEESKRTFGARIVTPSAGTPEAPWLRTG
DSGFVSGGELFIIGRIKDLLIVYGRNHAPDDIEATIQEITSGRCAAIAVP
DHGTEKLVAIIELKKRGDSDEDVADRLRIVKRDVAAAIFDSHGLSVADLV
LVSPGSIPITTSGKIRRAQCVQLYRRREFTRLDA
>Mb1548 fadD25, PROBABLE FATTY-ACID-CoA LIGASE FADD25 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MSVVESSLPGVLRERASFQPNDKALTFIDYERSWDGVEETLTWSQLYRRT
LNLAAQLREHGSTGDRALILAPQILDYVVSFIASLQAGIVAVPLSIPQGG
AHDERTVSVFADTAPAIVLTASSVVDNVVEYVQPQPGQNAPAVIEVDRLD
LDARPSSGSRSAAHGHPDILYLQYTSGSTRTPAGVMVSNKNLFANFEQIM
TSYYGVYGKVAPPGSTVVSWLPFYHDMGFVLGLILPILAGIPAVLTSPIG
FLQRPARWIQMLASNTLAFTAAPNFAFDLASRKTKDEDMEGLDLGGVHGI
LNGSERVQPVTLKRFIDRFAPFNLDPKAIRPSYGMAEATVYVATRKAGQP
PKIVQFDPQKLPDGQAERTESDGGTPLVSYGIVDTQLVRIVDPDTGIERP
AGTIGEIWVHGDNVAIGYWQKPEATERTFSATIVNPSEGTPAGPWLRTGD
SGFLSEGELFIMGRIKDLLIVYGRNHSPDDIEATIQTISPGRCAAIAVSE
HGAEKLVAIIELKKKDESDDEAAERLGFVKREVTSAISKSHGLSVADLVL
VSPGSIPITTSGKIRRAQCVELYRQDEFTRLDA
>Mb2955 fadD26, FATTY-ACID-CoA LIGASE FADD26 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MPVTDRSVPSLLQERADQQPDSTAYTYIDYGSDPKGFADSLTWSQVYSRA
CIIAEELKLCGLPGDRVAVLAPQGLEYVLAFLGALQAGFIAVPLSTPQYG
IHDDRVSAVLQDSKPVAILTTSSVVGDVTKYAASHDGQPAPVVVEVDLLD
LDSPRQMPAFSRQHTGAAYLQYTSGSTRTPAGVIVSHTNVIANVTQSMYG
YFGDPAKIPTGTVVSWLPLYHDMGLILGICAPLVARRRAVLMSPMSFLRR
PARWMQLLATSGRCFSAAPNFAFELAVRRTSDQDMAGLDLRDVVGIVSGS
ERIHVATVRRFIERFAPYNLSPTAIRPSYGLAEATLYVAAPEAGAAPKTV
RFDYEQLTAGQARPCGTDGSVGTELISYGSPDPSSVRIVNPETMVENPPG
VVGEIWVHGDHVTMGYWQKPKQTAQVFDAKLVDPAPAAPEGPWLRTGDLG
VISDGELFIMGRIKDLLIVDGRNHYPDDIEATIQEITGGRAAAIAVPDDI
TEQLVAIIEFKRRGSTAEEVMLKLRSVKREVTSAISKSHSLRVADLVLVS
PGSIPITTSGKIRRSACVERYRSDGFKRLDVAV
>Mb2966 fadD28, FATTY-ACID-CoA LIGASE FADD28 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MIVRSLPAALRACARLQPHDPAFTFMDYEQDWDGVAITLTWSQLYRRTLN
VARELSRCGSTGDRVVISAPQGLEYVVAFLGALQAGRIAVPLSVPQGGVT
DERSDSVLSDSSPVAILTTSSAVDDVVQHVARRPGESPPSIIEVDLLDLD
APNGYTFKEDEYPSTAYLQYTSGSTRTPAGVVMSHQNVRVNFEQLMSGYF
ADTDGIPPPNSALVSWLPFYHDMGLVIGICAPILGGYPAVLTSPVSFLQR
PARWMHLMASDFHAFSAAPNFAFELAARRTTDDDMAGRDLGNILTILSGS
ERVQAATIKRFADRFARFNLQERVIRPSYGLAEATVYVATSKPGQPPETV
DFDTESLSAGHAKPCAGGGATSLISYMLPRSPIVRIVDSDTCIECPDGTV
GEIWVHGDNVANGYWQKPDESERTFGGKIVTPSPGTPEGPWLRTGDSGFV
TDGKMFIIGRIKDLLIVYGRNHSPDDIEATIQEITRGRCAAISVPGDRST
EKLVAIIELKKRGDSDQDAMARLGAIKREVTSALSSSHGLSVADLVLVAP
GSIPITTSGKVRRGACVEQYRQDQFARLDA
>Mb2974c fadD29, PROBABLE FATTY-ACID-CoA LIGASE FADD29 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MKTNSSFHAAGEVATQPAWGTGEQAAQPLNGSTSRFAMSESSLADLLQKA
ASQYPNRAAYKFIDYDTDPAGFTETVTWWQVHRRAMIVAEELWIYASSGD
RVAILAPQGLEYIIAFMGVLQAGLIAVPLPVPQFGIHDERISSALRDSAP
SIILTTSSVIDEVTTYAPHACAAQGQSAPIVVAVDALDLSSSRALDPTRF
ERPSTAYLQYTSGSTRAPAGVVLSHKNVITNCVQLMSDYIGDSEKVPSTP
VSWLPFYHDMGLMLGIILPMINQDTAVLMSPMAFLQRPARWMQLLAKHRA
QISSAPNFGFELAVRRTSDDDMAGLDLGHVRTIVTGAERVNVATLRRFTE
RFAPFNLSETAIRPSYGLAEATVYVATAGPGRAPKSVCFDYQQLSVGQAK
RTENGSEGANLVSYGAPRASTVRIVDPETRMENPAGTVGEIWVQGDNVGL
GYWRNPQQTEATFRARLVTPSPGTSEGPWLRTGDLGVIFEGELFITGRIK
ELLVVDGANHYPEDIEATIQEITGGRVVAIAVPDDRTEKLVTIIELMKRG
RTDEEEKNRLRTVKREVASAISRSHRLRVADVVMVAPGSIPVTTSGKVRR
SASVERYLHHEFSRLDAMA
>Mb3591 fadD3, PROBABLE FATTY-ACID-COA LIGASE FADD3 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MINDLRTVPAALDRLVRQLPDHTALIAEDRRFTSTELRDAVYGAAAALIA
LGVEPADRVAIWSPNTWHWVVACLAIHHAGAAVVPLNTRYTATEATDILD
RAGAPVLFAAGLFLGADRAAGLDRAALPALRHVVRVPVEADDGTWDEFIA
TGAGALDAVAARAAAVAPQDVSDILFTSGTTGRSKGVLCAHRQSLSASAS
WAANGKITSDDRYLCINPFFHNFGYKAGILACLQTGATLIPHVTFDPLHA
LRAIERHRITVLPGPPTIYQSLLDHPARKDFDLSSLRFAVTGAATVPVVL
VERMQSELDIDIVLTAYGLTEANGMGTMCRPEDDAVTVATTCGRPFADFE
LRIADDGEVLLRGPNVMVGYLDDTEATAAAIDADGWLHTGDIGAVDQAGN
LRINDRLKDMYICGGFNVYPAEVEQVLARMDGVADAAVIGVPDQRLGEVG
RAFVVARPGTGLDEASVIAYTREHLANFKTPRSVRFVDVLPRNAAGKVSK
PQLRELG
>Mb0411 fadD30, PROBABLE FATTY-ACID-COA LIGASE FADD30 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MSVISTLRDRATTTPSDEAFVFMDYDTKTGDQIDRMTWSQLYSRVTAVSA
YLISYGRHADRRRTAAISAPQGLDYVAGFLGALCAGWAPVPLPEPLGSLR
DKRTGLAVLDCAADVVLTTSQAETRVRATIATHGASVTTPVIALDTLDEP
SGDNCDLDSQLSDWSSYLQYTSGSTANPRGVVLSMRNVTENVDQIIRNYF
RHEGGAPRLPSSVVSWLPLYHDMGLMVGLFIPLFVGCPVILTSPEAFIRK
PARWMQLLAKHQAPFSAAPNFAFDLAVAKTSEEDMAGLDLGHVNTIINGA
EQVQPNTITKFLRRFRPYNLMPAAVKPSYGMAEAVVYLATTKAGSPPTST
EFDADSLARGHAELSTFETERATRLIRYHSDDKEPLLRIVDPDSNIELGP
GRIGEIWIHGKNVSTGYHNADDALNRDKFQASIREASAGTPRSPWLRTGD
LGFIVGDEFYIVGRMKDLIIQDGVNHYPDDIETTVKEFTGGRVAAFSVSD
DGVEHLVIAAEVRTEHGPDKVTIMDFSTIKRLVVSALSKLHGLHVTDFLL
VPPGALPKTTSGKISRAACAKQYGANKLQRVATFP
>Mb1960 fadD31, PROBABLE ACYL-COA LIGASE FADD31 (ACYL-COA SYNTHETASE) (ACYL-COA SYNTHASE)
MNDGSRQELRVRSGLLQIEDCLDADGGIALPAGTTLISLIERNIKYVGDL
VAYRYLDHARSAAGCALEVTWTQFGMRLAAIGAHVQRFAGPGDRVAILAP
QGIDYVCGFYAAIKAGTVAVPLFAPELPGHAERLDTALRDSEPAVILTTA
AAKNAVEGFLNNVPRLRKPTVLVIDQIPDREGELFVPVELDIDAVSHLQY
TSGSTRPPVGVEITHRAVGTNLVQMILSIDLLNRNTHGVSWLPLYHDMGL
SMIGFPAVYGGHSTLMSPTAFVRRPLRWIQALSEGSRTGRVVTAAPNFAY
EWAAQRGLPAQGDDVDLSNVVLIIGSEPVSIDAVTTFNKAFAPYGLPRTA
FKPSYGIAEATLLVATIDHAAEPTVVYLDPEQLGAGHATRVAPDAPNAVV
HVSCGHVARSLWAVIVDPDTGPEAGAELPDGEIGEVWLQGDNVARGYWGR
PEETRMTFGARLQSPLAEGSHADGSAIDDTWLRTGDLGVYLDGELYITGR
IADLLTIDGRNHYPQDIEATAAEASPMVRRGYITAFTVPASDGDDRNQRL
VIIAERAAGTSRSDPRPALDAIRAAVCNRHGLSVADLSFLPAGAIPRTTS
GKLARQACRAQYLSGRLGVH
>Mb3831c fadD32, PROBABLE FATTY-ACID-COA LIGASE FADD32 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE
MFVTGESGMAYHNPFIVNGKIRFPANTNLVRHVEKWAKVRGDKLAYRFLD
FSTERDGVARDILWSDFSARNRAVGARLQQVTQPGDRVAILCPQNLDYLI
SFFGALYSGRIAVPLFDPAEPGHVGRLHAVLDDCAPSTILTTTDSAEGVR
KFIRARSAKERPRVIAVDAVPTEVAATWQQPEANEETVAYLQYTSGSTRI
PSGVQITHLNLPTNVVQVLNALEGQEGDRGVSWLPFFHDMGLITVLLASV
LGHSFTFMTPAAFVRRPGRWIRELARKPGETGGTFSAAPNFAFEHAAVRG
VPRDDEPPLDLSNVKGILNGSEPVSPASMRKFFEAFAPYGLKQTAVKPSY
GLAEATLFVSTTPMDEVPTVIHVDRDELNNQRFVEVAADAPNAVAQVSAG
KVGVSEWAVIVDADTASELPDGQIGEIWLHGNNLGTGYWGKEEESAQTFK
NILKSRISESRAEGAPDDALWVRTGDYGTYFKDHLYIAGRIKDLVIIDGR
NHYPQDLECTAQESTKALRVGYVAAFSVPANQLPQTVFDDSHAGLKFDPE
DTSEQLVIVGERAAGTHKLDHQPIVDDIRAAIAVGHGVTVRDVLLVSAGT
IPRTSSGKIGRRACRAAYLDGSLRSGVGSPTVFATSD
>Mb1380 fadD33, POSSIBLE POLYKETIDE SYNTHASE FADD33
MSELAAVLTRSMQASAGDLMVLDRETSLWCRHPWPEVHGLAESVAAWLLD
HDRPAAVGLVGEPTVELVAAIQGAWLAGAAVSILPGPVRGANDQRWADAT
LTRFLGIGVRTVLSQGSYLARLRSVDTAGVTIGDLSTAAHTNRSATPVAS
EGPAVLQGTAGSTGAPRTAILSPGAVLSNLRGLNQRVGTDAATDVGCSWL
PLYHDMGLAFVLSAALAGAPLWLAPTTAFTASPFRWLSWLSDSGATMTAA
PNFAYNLIGKYARRVSEVDLGALRVTLNGGEPVDCDGLTRFAEAMAPFGF
DAGAVLPSYGLAESTCAVTVPVPGIGLLADRVIDGSGAHKHAVLGNPIPG
MEVRISCGDQAAGNASREIGEIEIRGASMMAGYLGQQPIDPDDWFATGDL
GYLGAGGLVVCGRAKEVISIAGRNIFPTEVELVAAQVRGVREGAVVALGT
GDRSTRPGLVVAAEFRGPDEANARAELIQRVASECGIVPSDVVFVSPGSL
PRTSSGKLRRLAVRRSLEMAD
>Mb0036 fadD34, PROBABLE FATTY-ACID-COA LIGASE FADD34 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MTAALLSPAIAWQQISACTDRTLTITCEDSEVISYQDLIARAAACIPPLR
RLDLKRGEPVLITAHTNLEFLSCFLGLMLHGAVPVPIPPREALKTTERFM
TRLGPLLRHHRVLICTPAEHDEIRAAASTDCQISRFTALAEAGDEQFGRA
TAQQLADTATADWPLCTLDDDAYVQYTSGSTAAPRGVVITYRNLLSNMRA
MAVGSQFQHGDVMGSWLPLHHDMGLVGSLFAALFNSVSAVFTTPHRFLYD
PLGFLRLLTSSGATHTFMPNFALEWLINAYHRRGADIEGIDLHKMRRLII
ASEPVHAEGMRRFAATFAGVGLAPTALGSGYGLAEATVAVSMSAPNTGFR
TETHAAAEVVTGGRVLPGYEVRIDAAPGARAGTIKLRGDSVAAKAYVGGK
KLDALDEEGFCDTHDLGFLVDDEIVILGRQDEVFIVHGENRFPYDIEFII
RGESEQHRTKVACFGVNERVVVVLESPLDSIIDKAEADRLRCQVVAATGL
QLDELITVRRGAIPTTTSGKLKRRAVAQAYRDGTLPRLATHAWTADPDSA
PKTTRSSLEGAH
>Mb2533c fadD35, PROBABLE FATTY-ACID-COA LIGASE FADD35 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MAAAEVVDPNRLSYDRGPSAPSLLESTIGANLAATAARYGHREALVDMVA
RRRFNYSELLTDVHRLATGLVRAGIGPGDRVGIWAPNRWEWVLVQYATAE
IGAILVTINPAYRVREVEYALRQSGVAMVIAVASFKDADYAAMLAEVGPR
CPDLADVILLESDRWDALAGAEPDLPALQQTAARLDGSDPVNIQYTSGTT
AYPKGVTLSHRNILNNGYLVGELLGYTAQDRICIPVPFYHCFGMVMGNLA
ATSHGAAMVIPAPGFDPAATLRAVQDERCTSLYGVPTMFIAELGLPDFTD
YELGSLRTGIMAGAACPVEVMRKVISRMHMPGVSICYGMTETSPVSTQTR
ADDSVDRRVGTVGRVGPHLEIKVVDPATGETVPRGVVGEFCTRGYSVMAG
YWNDPQKTAEVIDADGWMHTGDLAEMDPSGYVRIAGRIKDLVVRGGENIS
PREIEELLHTHPDIVDGHVIGVPDAKYGEELMAVVKLRNDAPELTIERLR
EYCMGRIARFKIPRYLWIVDEFPMTVTGKVRKVEMRQQALEYLRGQQ
>Mb1225 fadD36, PROBABLE FATTY-ACID-COA LIGASE FADD36 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MLLASLNPAVVSAADIADAVRIDGDVLSRSDLVGAATSVAERVAGAHRVA
VLATPTASTVLAITGCLIAGVPVVPVPADVGVTERRHMLTDSGVQAWLGP
LPDDPAGLPHIPVRTHARSWHRYPEPSPGAIAMVVYTSGTTGPPKGVQLS
RRAIAADLDALAEAWQWTAEDVLVHGLPLYHVHGLVLGLLGSLRFGNRFV
HTGKPTPAGYAQACYEAHGTLFFGVPTVWSRVAADQAAAGALKPARLLVS
GSAALPVPVFDKLVQLTGHRPVERYGASESLITLSTRADGERRPGWVGLP
LAGVQTRLVDDDGGEVPHDGETVGKLQVRGPTLFDGYLNQPDATAAAFDA
DSWYRTGDVAVVDGSGMHRIVGRESVDLIKSGGYRVGAGEIETVLLGHPD
VAEAAVVGVPDDDLGQRIVAYVVGSANVDADGLINFVAQQLSVHKRPREV
RIVDALPRNALGKVLKKQLLSEG
>Mb0220 fadD4, PROBABLE FATTY-ACID-COA LIGASE FADD4 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MPRGELYKRFRLVMGGIAPCGSGRRAATYPRRMQIRPYIAADKPAVILYP
SGTVISFDELEARANRLAHWFRQAGLREDDVVAILMENNEHVHAVMWAAR
RSGLYYVPINTHLTASEAAYIVDNSGAKAIVGSAALRETCHGLAEHLPGG
LPDLLMLAGGGLVGWMTYPECVADQPDTPIEDEREGDLLQYSSGTTGRPK
GIKRELPHVSPDAAPGMMPALLDFWMDADSVYLSPAPMYHTAPSVWTMSA
LAAGVTTVVMEKFDAEGALDAIQRYRVTHAQFVPAMFVRMLKLPEAVRNS
YDMSSLRRVIHAAAPCPVQIKEQMIHWWGPIIDEYYASSEASGSTLITAE
DWLTHPGSVGKPIQGGVHIVGADGSELPPNQPGEIYFEGGYPFEYLNDPA
KTAASRNKHGWVTVGDVGYLDDDGYLFLTGRRHHMIISGGVNIYPQEAEN
LLVAHPKVLDAAVFGVPDDEMGQRVMAAVQTVDSADANDQFAGELLAWLR
DRLSHFKCPRSIAFEPQLPRTDTGKLYKSGLVEKYSV
>Mb0172 fadD5, PROBABLE FATTY-ACID-CoA LIGASE FADD5 (FATTY-ACID-CoA SYNTHETASE) (FATTY-ACID-CoA SYNTHASE)
MTAQLASHLTRALTLAQQQPYLARRQNWVNQLERHAMMQPDAPALRFVGN
TMTWADLRRRVAALAGALSGRGVGFGDRVMILMLNRTEFVESVLAANMIG
AIAVPLNFRLTPTEIAVLVEDCAAHVMLTEAALAPVAIGVRNIQPLLSVI
VVAGGSSQDSVFGYEDLLNEAGDVHEPVDIPNDSPALIMYTAGTTGRPKG
AVLTHANLTGQAMTALYTSGANINSDVGFVGVPLFHIAGIGNMLTGLLLG
LPTVIYPLGAFDPGQLLDVLEAEKVTGIFLVPAQWQAVCTEQQARPRDLR
LRVLSWGAAPAPDALLRQMSATFPETQILAAFGQTEMSPVTCMLLGEDAI
AKRGSVGRVIPTVAARVVDQNMNDVPVGEVGEIVYRAPTLMSCYWNNPEA
TAEAFAGGWFHSGDLVRMDSDGYVWVVDRKKDMIISGGENIYCAELENVL
ASHPDIAEVAVIGRADEKWGEVPIAVAAVTNDDLRIEDLGEFLTDRLARY
KHPKALEIVDALPRNPAGKVLKTELRLRYGACVNVERRSASAGFTERREN
RQKL
>Mb1238 fadD6, PROBABLE FATTY-ACID-COA LIGASE FADD6 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MSDYYGGAHTTVRLIDLATRMPRVLADTPVIVRGAMTGLLARPNSKASIG
TVFQDRAARYGDRVFLKFGDQQLTYRDANATANRYAAVLAARGVGPGDVV
GIMLRNSPSTVLAMLATVKCGAIAGMLNYHQRGEVLAHSLGLLDAKVLIA
ESDLVSAVAECGASRGRVAGDVLTVEDVERFATTAPATNPASASAVQAKD
TAFYIFTSGTTGFPKASVMTHHRWLRALAVFGGMGLRLKGSDTLYSCLPL
YHNNALTVAVSSVINSGATLALGKSFSASRFWDEVIANRATAFVYIGEIC
RYLLNQPAKPTDRAHQVRVICGNGLRPEIWDEFTTRFGVARVCEFYAASE
GNSAFINIFNVPRTAGVSPMPLAFVEYDLDTGDPLRDASGRVRRVPDGEP
GLLLSRVNRLQPFDGYTDPVASEKKLVRNAFRDGDCWFNTGDVMSPQGMG
HAAFVDRLGDTFRWKGENVATTQVEAALASDQTVEECTVYGVQIPRTGGR
AGMAAITLRAGAEFDGQALARTVYGHLPGYALPLFVRVVGSLAHTTTFKS
RKVELRNQAYGADIEDPLYVLAGPDEGYVPYYAEYPEEVSLGRRPQG
>Mb0123 fadD7, PROBABLE FATTY-ACID-COA LIGASE FADD7 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MASDFGPRIADLVEVAATRLPEAPALVVTADRIAISHRDLARLVDELAGQ
LTRSGLLPGDRVALRMGSNAEFVVALLAASRADLVVVPLDPALPITEQRV
RSQAAGARVVLIDADGPHDRAEPTTRWWPLTVNVGGDSGPSGGTLSVHLD
AATEPNPATSTPEGLRPDDAMIMFTGGTTGLPKMVPWTHANIASSVRAII
TGYRLSPRDATVAVMPLYHGHGLIASLLATLASGGAVSLPARGRFSAHTF
WDDIKAVGATWYTAVPTIHQILLERSATEPSGRKPAALRFIRSCSAPLTA
QAALALQTEFAAPVVCAFGMTEATHQVTTTQIEGIDQTETPVVSTGLVGR
STGAQIRIVGSDGLPLPAGAVGEIWLRGTTVVRGYLGDPTITAANFTDGW
LRTGDLGSLSAAGDLSIRGRIKELINRGGEKISPERVEGVLASHPNVMEA
AVFGVPHQLYGEAVAAVIVPRESAPPTREELVQFCRERLAAFEIPASFQE
ASGLPHTAKGSLDRRAVAERFGHSV
>Mb0566c fadD8, PROBABLE FATTY-ACID-COA LIGASE FADD8 (FATTY-ACID-COA SYNTHETASE) (FATTY-ACID-COA SYNTHASE)
MSTAGDDAVGVPPACGGRSDAVGVPQLARESGAMRDQDCSGELLRSPTHN
GHLLVGALKRHQNKPVLFLGDTRLTGGQLADRISQYIQAFEALGAGTGVA
VGLLSLNRPEVLMIIGAGQARGYRRTALHPLGSLADHAYVLNDAGISSLI
IDPNPMFVERALALLEQVDSLQQILTIGPVPDALKHVAVDLSAEAAKYQP
QPLVAADLPPDQVIGLTYTGGTTGKPKGVIGTAQSIATMTSIQLAEWEWP
ANPRFLMCTPLSHAGAAFFTPTVIKGGEMIVLAKFDPAEVLRIIEEQRIT
ATMLVPSMLYALLDHPDSHTRDLSSLETVYYGASAINPVRLAEAIRRFGP
IFAQYYGQSEAPMVITYLAKGDHDEKRLTSCGRPTLFARVALLDEHGKPV
KQGEVGEICVSGPLLAGGYWNLPDETSRTFKDGWLHTGDLAREDSDGFYY
IVDRVKDMIVTGGFNVFPREVEDVVAEHPAVAQVCVVGAPDEKWGEAVTA
VVVLRSNAARDEPAIEAMTAEIQAAVKQRKGSVQAPKRVVVVDSLPLTGL
GKPDKKAVRARFWEGAGRAVG
>Mb2269 kasA, 3-OXOACYL-[ACYL-CARRIER PROTEIN] SYNTHASE 1 KASA (BETA-KETOACYL-ACP SYNTHASE) (KAS I)
MSQPSTANGGFPSVVVTAVTATTSISPDIESTWKGLLAGESGIHALEDEF
VTKWDLAVKIGGHLKDPVDSHMGRLDMRRMSYVQRMGKLLGGQLWESAGS
PEVDPDRFAVVVGTGLGGAERIVESYDLMNAGGPRKVSPLAVQMIMPNGA
AAVIGLQLGARAGVMTPVSACSSGSEAIAHAWRQIVMGDADVAVCGGVEG
PIEALPIAAFSMMRAMSTRNDEPERASRPFDKDRDGFVFGEAGALMLIET
EEHAKARGAKPLARLLGAGITSDAFHMVAPAADGVRAGRAMTRSLELAGL
SPADIDHVNAHGTATPIGDAAEANAIRVAGCDQAAVYAPKSALGHSIGAV
GALESVLTVLTLRDGVIPPTLNYETPDPEIDLDVVAGEPRYGDYRYAVNN
SFGFGGHNVALAFGRY
>Mb2270 kasB, 3-OXOACYL-[ACYL-CARRIER PROTEIN] SYNTHASE 2 KASB (BETA-KETOACYL-ACP SYNTHASE) (KAS I)
MGVPPLAGASRTDMEGTFARPMTELVTGKAFPYVVVTGIAMTTALATDAE
TTWKLLLDRQSGIRTLDDPFVEEFDLPVRIGGHLLEEFDHQLTRIELRRM
GYLQRMSTVLSRRLWENAGSPEVDTNRLMVSIGTGLGSAEELVFSYDDMR
ARGMKAVSPLTVQKYMPNGAAAAVGLERHAKAGVMTPVSACASGAEAIAR
AWQQIVLGEADAAICGGVETRIEAVPIAGFAQMRIVMSTNNDDPAGACRP
FDRDRDGFVFGEGGALLLIETEEHAKARGANILARIMGASITSDGFHMVA
PDPNGERAGHAITRAIQLAGLAPGDIDHVNAHATGTQVGDLAEGRAINNA
LGGNRPAVYAPKSALGHSVGAVGAVESILTVLALRDQVIPPTLNLVNLDP
EIDLDVVAGEPRPGNYRYAINNSFGFGGHNVAIAFGRY
>Mb2072 lppI, Probable lipoprotein lppI
MRIAALVAVSLLIAGCPREVGGDVGQSQTIAPPAPAPSAAPSTPPAAGAP
ITTIVSWIEAGHPVDPAAYHVATRDGVTTQLGDDVAFSASSGTVACMTDA
RHTSGTLACLVRLANPPPRPETAYGEWKGGWVDFDGIHLQVGSARADPGP
FVYGNGPELANGDTLSIGDYRCRSYQAGLFCVNYAHQSAVRFASAGIEPF
GCLKPAPPPDGVGVAFGC
>Mb3326c lpqC, POSSIBLE ESTERASE LIPOPROTEIN LPQC
MPWARMLSLIVLMVCLAGCGGDQLLARHASSVATFQFGGLTRSYRLHVPP
AEPSGLVISLHGGGGTGAGQEALTDFDAVADAADLLVVYPDGYDKSWADG
RGASPADRRHLDDVGFLVALAAKLVHDFDIAPGHVFATGMSNGGFMSNRL
ACDRADIFAAVAPVAGTLGVGVTCNPSRPVSVLEAHGTADPLVPFNGGAV
RGRGGLSHSISVASLVDRWRAVDGCQGDPSAAELPDVGDGTMVHLFDSSS
CAAGTEVISYQIDNGGHTWPGGRQYLPKAVIGATTRAFDGSQVIAQFFAT
HGRD
>Mb0690 lpqP, POSSIBLE CONSERVED LIPOPROTEIN LPQP
MLRRVAILLAAVLAFAGCSGGTRLAAGFGNGNSVHTLDVDGAGRSYRLYK
PVGLPSSAPLVVMLHGGFGSAKQAERSYGWDELADSEKFLVAYPDGYHRA
WNANGGGCCGRPAREGVDDIGFVRAVVADIANNVSIDPARVYVTGMSNGA
IMSYTLACNTSIFAAIGVVSGTQLDPCQSPRPVSVIHIHGTADPLVRYHG
GPGAGFARIDGPPVPDLNAFWREVNRCGALDTTTEGPVTTSGATCADNRR
VVLLTVDDAGHRWPSFATQTLWRFFAAHFR
>Mb0179 lprK, POSSIBLE MCE-FAMILY LIPOPROTEIN LPRK (MCE-FAMILY LIPOPROTEIN MCE1E)
MMSVLARMRVMRHRAWQGLVLLVLALLLSSCGWRGISNVAIPGGPGTGPG
SYTIYVQMPDTLAINGNSRVMVADVWVGSIRAIKLKNWVATLTLSLKKDV
TLPKNATAKIGQTSLLGSQHVELAAPPDPSPVPLKDGDTIPLKRSSAYPT
TEQTLASIATLLRGGGLVNLEGIQQEINAIVTGRADQIRAFLGKLDTFTD
ELNQQRDDITRAIDSTNRLLAYVGGRSEVLNRVLTDLPPLIKHFADKQEL
LINASDAVGRLSQSADQYLSAARGDLHQDLQALQCPLKELRRAAPYLVGA
LKLILTQPFDVDTVPQLVRGDYMNLSLTLDLTYSAIDNAFLTGTGFSGAL
RALEQSFGRDPETMIPDIRYTPNPNDAPGGPLVERGNRQC
>Mb0609 lprL, POSSIBLE MCE-FAMILY LIPOPROTEIN LPRL (MCE-FAMILY LIPOPROTEIN MCE2E)
MRCGVSAGSANGKPNRWTLRCGVSAGHRGSVFLLAVLLAPVVLTSCTWRG
IANVPLPVGRGMGPDRMTIYVQMPDTLALNTNSRVRVADVWVGTVRDISL
RNWIATLTLELEPTVRLPANATAKIGQTSLLGTQHVELAAPPIPSPQPLK
SGDTIGLKNSSAYPTVERTLASVALILTGGGIVNLDVI
>Mb3525c lprN, POSSIBLE MCE-FAMILY LIPOPROTEIN LPRN (MCE-FAMILY LIPOPROTEIN MCE4E)
MNRIWLRAIILTASSALLAGCQFGGLNSLPLPGTAGHGEGAYSVTVEMAD
VATLPQNSPVMVDDVTVGSVAGIVAVQRPDGSFYAAVKLDLDKNVLLPAN
AVAKVSQTSLLGSLHVELAPPTDRPPTGRLVDGSRITEANTDRFPTTEEV
FSALGVVVNKGNVGALEEIIDETHQAVAGRQAQFVNLVPRLAELTAGLNR
QVHDIIDALDGLNRVSAILARDKDNLGRALDTLPDAVRVLNQNRDHIVDA
FAALKRLTMVTSHVLAETKVDFGEDLKDLYSIVKALNDDRKDFVTSLQLL
LTFPFPNFGIKQAVRGDYLNVFTTFDLTLRRIGETFFTTAYFDPNMAHMD
EILNPPDFLIGELANLSGQAADPFKIPPGTASGQ
>Mb2965c mas, PROBABLE MULTIFUNCTIONAL MYCOCEROSIC ACID SYNTHASE MEMBRANE-ASSOCIATED MAS
MESRVTPVAVIGMGCRLPGGINSPDKLWESLLRGDDLVTEIPPDRWDADD
YYDPEPGVPGRSVSRWGGFLDDVAGFDAEFFGISEREATSIDPQQRLLLE
TSWEAIEHAGLDPASLAGSSTAVFTGLTHEDYLVLTTTAGGLASPYVVTG
LNNSVASGRIAHTLGLHGPAMTFDTACSSGLMAVHLACRSLHDGEADLAL
AGGCAVLLEPHACVAASAQGMLSSTGRCHSFDADADGFVRSEGCAMVLLK
RLPDALRDGNRIFAVVRGTATNQDGRTETLTMPSEDAQVAVYRAALAAAG
VQPETVGVVEAHGTGTPIGDPIEYRSLARVYGAGTPCALGSAKSNMGHST
ASAGTVGLIKAILSLRHGVVPPLLHFNRLPDELSDVETGLFVPQAVTPWP
NGNDHTPKRVAVSSFGMSGTNVHAIVEEAPAEASAPESSPGDAEVGPRLF
MLSSTSSDALRQTARQLATWVEEHQDCVAASDLAYTLARGRAHRPVRTAV
VAANLPELVEGLREVADGDALYDAAVGHGDRGPVWVFSGQGSQWAAMGTQ
LLASEPVFAATIAKLEPVIAAESGFSVTEAITAQQTVTGIDKVQPAVFAV
QVALAATMEQTYGVRPGAVVGHSMGESAAAVVAGALSLEDAARVICRRSK
LMTRIAGAGAMGSVELPAKQVNSELMARGIDDVVVSVVASPQSTVIGGTS
DTVRDLIARWEQRDVMAREVAVDVASHSPQVDPILDDLAAALADIAPMTP
KVPYYSATLFDPREQPVCDGAYWVDNLRNTVQFAAAVQAAMEDGYRVFAE
LSPHPLLTHAVEQTGRSLDMSVAALAGMRREQPLPHGLRGLLTELHRAGA
ALDYSALYPAGRLVDAPLPAWTHARLFIDDDGQEQRAQGACTITVHPLLG
SHVRLTEEPERHVWQGDVGTSVLSWLSDHQVHNVAALPGAAYCEMALAAA
AEVFGEAAEVRDITFEQMLLLDEQTPIDAVASIDAPGVVNFTVETNRDGE
TTRHATAALRAAEDDCPPPGYDITALLQAHPHAVNGTAMRESFAERGVTL
GAAFGGLTTAHTAEAGAATVLAEVALPASIRFQQGAYRIHPALLDACFQS
VGAGVQAGTATGGLLLPLGVRSLRAYGPTRNARYCYTRLTKAFNDGTRGG
EADLDVLDEHGTVLLAVRGLRMGTGTSERDERDRLVSERLLTLGWQQRAL
PEVGDGEAGSWLLIDTSNAVDTPDMLASTLTDALKSHGPQGTECASLSWS
VQDTPPNDQAGLEKLGSQLRGRDGVVIVYGPRVGDPDEHSLLAGREQVRH
LVRITRELAEFEGELPRLFVVTRQAQIVKPHDSGERANLEQAGLRGLLRV
ISSEHPMLRTTLIDVDEHTDVERVAQQLLSGSEEDETAWRNGDWYVARLT
PSPLGHEERRTAVLDPDHDGMRVQVRRPGDLQTLEFVASDRVPPGPGQIE
VAVSMSSINFADVLIAFGRFPIIDDREPQLGMDFVGVVTAVGEGVTGHQV
GDRVGGFSEGGCWRTFLTCDANLAVTLPPGLTDEQAITAATAHATAWYGL
NDLAQIKAGDKVLIHSATGGVGQAAISIARAKGAEIFATAGNPAKRAMLR
DMGVEHVYDSRSVEFAEQIRRDTDGYGVDIVLNSLTGAAQRAGLELLAFG
GRFVEIGKADVYGNTRLGLFPFRRGLTFYYLDLALMSVTQPDRVRELLAT
VFKLTADGVLTAPQCTHYPLAEAADAIRAMSNAEHTGKLVLDVPRSGRRS
VAVTPEQAPLYRRDGSYIITGGLGGLGLFFASKLAAAGCGRIVLTARSQP
NPKARQTIEGLRAAGADIVVECGNIAEPDTADRLVSAATATGLPLRGVLH
SAAVVEDATLTNITDELIDRDWSPKVFGSWNLHRATLGQPLDWFCLFSSG
AALLGSPGQGAYAAANSWVDVFAHWRRAQGLPVSAIAWGAWGEVGRATFL
AEGGEIMITPEEGAYAFETLVRHDRAYSGYIPILGAPWLADLVRRSPWGE
MFASTGQRSRGPSKFRMELLSLPQDEWAGRLRRLLVEQASVILRRTIDAD
RSFIEYGLDSLGMLEMRTHVETETGIRLTPKVIATNNTARALAQYLADTL
AEEQAAAPAAS
>Mb2405 mbtA, BIFUNCTIONAL ENZYME MBTA: SALICYL-AMP LIGASE (SAL-AMP LIGASE) + SALICYL-S-ArCP SYNTHETASE
MPPKAADGRRPSPDGGLGGFVPFPADRAASYRAAGYWSGRTLDTVLSDAA
RRWPDRLAVADAGDRPGHGGLSYAELDQRADRAAAALHGLGITPGDRVLL
QLPNGCQFAVALFALLRAGAIPVMCLPGHRAAELGHFAAVSAATGLVVAD
VASGFDYRPMARELVADHPTLRHVIVDGDPGPFVSWAQLCAQAGTGSPAP
PADPGSPALLLVSGGTTGMPKLIPRTHDDYVFNATASAALCRLSADDVYL
VVLAAGHNFPLACPGLLGAMTVGATAVFAPDPSPEAAFAAIERHGVTVTA
LVPALAKLWAQSCEWEPVTPKSLRLLQVGGSKLEPEDARRVRTALTPGLQ
QVFGMAEGLLNFTRIGDPPEVVEHTQGRPLCPADELRIVNADGEPVGPGE
EGELLVRGPYTLNGYFAAERDNERCFDPDGFYRSGDLVRRRDDGNLVVTG
RVKDVICRAGETIAASDLEEQLLSHPAIFSAAAVGLPDQYLGEKICAAVV
FAGAPITLAELNGYLDRRGVAAHTRPDQLVAMPALPTTPIGKIDKRAIVR
QLGIATGPVTTQRCH
>Mb2404c mbtB, PHENYLOXAZOLINE SYNTHASE MBTB (PHENYLOXAZOLINE SYNTHETASE)
MVHATACSEIIRAEVAELLGVRADALHPGANLVGQGLDSIRMMSLVGRWR
RKGIAVDFATLAATPTIEAWSQLVSAGTGVAPTAVAAPGDAGLSQEGEPF
PLAPMQHAMWVGRHDHQQLGGVAGHLYVEFDGARVDPDRLRAAATRLALR
HPMLRVQFLPDGTQRIPPAAGSRDFPISVADLRHVAPDVVDQRLAGIRDA
KSHQQLDGAVFELALTLLPGERTRLHVDLDMQAADAMSYRILLADLAALY
DGREPPALGYTYQEYRQAIEAEETLPQPVRDADRDWWAQRIPQLPDPPAL
PTRAGGERDRRRSTRRWHWLDPQTRDALFARARARGITPAMTLAAAFANV
LARWSASSRFLLNLPLFSRQALHPDVDLLVGDFTSSLLLDVDLTGARTAA
ARAQAVQEALRSAAGHSAYPGLSVLRDLSRHRGTQVLAPVVFTSALGLGD
LFCPDVTEQFGTPGWIISQGPQVLLDAQVTEFDGGVLVNWDVREGVFAPG
VIDAMFTHQVDELLRLAAGDDAWDAPSPSALPAAQRAVRAALNGRTAAPS
TEALHDGFFRQAQQQPDAPAVFASSGDLSYAQLRDQASAVAAALRAAGLR
VGDTVAVLGPKTGEQVAAVLGILAAGGVYLPIGVDQPRDRAERILATGSV
NLALVCGPPCQVRVPVPTLLLADVLAAAPAEFVPGPSDPTALAYVLFTSG
STGEPKGVEVAHDAAMNTVETFIRHFELGAADRWLALATLECDMSVLDIF
AALRSGGAIVVVDEAQRRDPDAWARLIDTYEVTALNFMPGWLDMLLEVGG
GRLSSLRAVAVGGDWVRPDLARRLQVQAPSARFAGLGGATETAVHATIFE
VQDAANLPPDWASVPYGVPFPNNACRVVADSGDDCPDWVAGELWVSGRGI
ARGYRGRPELTAERFVEHDGRTWYRTGDLARYWHDGTLEFVGRADHRVKI
SGYRVELGEIEAALQRLPGVHAAAATVLPGGSDVLAAAVCVDDAGVTAES
IRQQLADLVPAHMIPRHVTLLDRIPFTDSGKIDRAEVGALLAAEVERSGD
RSAPYAAPRTVLQRALRRIVADILGRANDAVGVHDDFFALGGDSVLATQV
VAGIRRWLDSPSLMVADMFAARTIAALAQLLTGREANADRLELVAEVYLE
IANMTSADVMAALDPIEQPAQPAFKPWVKRFTGTDKPGAVLVFPHAGGAA
AAYRWLAKSLVANDVDTFVVQYPQRADRRSHPAADSIEALALELFEAGDW
HLTAPLTLFGHCMGAIVAFEFARLAERNGVPVRALWASSGQAPSTVAASG
PLPTADRDVLADMVDLGGTDPVLLEDEEFVELLVPAVKADYRALSGYSCP
PDVRIRANIHAVGGNRDHRISREMLTSWETHTSGRFTLSHFDGGHFYLND
HLDAVARMVSADVR
>Mb2403c mbtC, POLYKETIDE SYNTHETASE MBTC (POLYKETIDE SYNTHASE)
MSDNDPVVIVGLAIEAPGGVETADDYWTLLSEQREGLGPFPTDRGWALRE
LFDGSRRNGFKPIHNLGGFLSSATTFDPEFFRISPREATAMDPQQRVGLR
VAWRTLENSGINPDDLAGHDVGCYVGASALEYGPALTEFSHHSGHLITGT
SLGVISGRIAYTLDLAGPALTVDTSCSSALAAFHTAVQAIRAGDCDLALA
GGVCVMGTPGYFVEFSKQHALSDDGHCRPYSAHASGTAWAEGAAMFLLQR
RSRATADRRRVLAEVRASCLNSDGLSDGLTAPSGDAQTRLLRRAIAQAAV
VPADVGMVEGHGTATRLGDRTELRSLAASYGTAPAGRGPLLGSVKSNIGH
AQAAAGGLGLVKVILAAQHAAIPPTLHVDEPSREIDWEKQGLRLADKLTP
WRAVDGWRTAAVSAFGMSGTNSHVIVSMPDTVSAPERGPECGEV
>Mb2402c mbtD, POLYKETIDE SYNTHETASE MBTD (POLYKETIDE SYNTHASE)
MAPKQLPDGRVAVLLSAHAEELIGPDARAIADYLERFPATTVTEVARQLR
KTRRVRRHRAVLRAADRLELAEGLRALAAGREHPLIARSSLGSAPRQAFV
FPGQGGHWPGMGAVAYRELPTYRTATDTCAAAFAAAGVDSPLPYLIAPPG
TDERQAFCEIEIEGAQFVHAVALAEVWRSCGVLPDLTVGHSLGEVAAAYL
AGSITLSDAVAVVAARANVVGRLPGRYAVAALGIGEQDASALIATTGGWL
ELSVVNASSTVAVSGERQAVAAIVDTVRSSGHFARGITVGFPVHTSVLES
LRDELCEQLPDSEFMEAPVQFIGGTTGDVVAPGTTFGDYWYANLRHTVRF
DRAVESAIRCGARAFIEISAHPALLFAIGQNCEGAANLPDGPAVLVGSAR
RGERFVDALSANIVSAAVADPGYPWGDLGGDPLDGDVDLSGFPNAPMRAV
PMWAHPEPLPPVSGLTIAVERWERMVPSTPVAGRHRHLAVLDLGAHRALA
QTLCAAIDSHPDTELSAARDAELILVIAPDFEHTDAVRAAGALADLVGAG
LLDYPMHIGARCQSVCLVTVGAEQVDAADAVPSAGQAALAAMHRSIGFEH
PEQTFSHLDLPSWDLDPVLGVSVITAVLRGFGETALRGSVNGYTLFERTL
ADAPAVPNWSLDSGVLDDVVVTGGAGAIGMHYARYLAEHGARRIVLLSRR
AADQATVAMLRKQHGTVIVSPPCDITDPTQLSAIAAEYGGVGASLIVHAA
GSVISGTAPGVTSAAVVDNFAAKVLGLAQMIELWPLRPDVRTLLCSSVMG
VWGGHGVVAYSAANRLLDVMAAQLRAQGRHCVAVKWGLWQAPKAGEPARG
IADAVTIARVERSGLRQMAPQQAIEASLHEFTVDPLVFAADAARLQMLLD
SRQFERYEGPTDPNLTIVDAVRTQLAAVLGIPQAGEVNLQESLFDLGVDS
MLALDLRNRLKRSIGATVSLATLMGDITGDGLVAKLEDADERSHTAQKVD
ISRD
>Mb2401c mbtE, PEPTIDE SYNTHETASE MBTE (PEPTIDE SYNTHASE)
MWFVQMADPSGALLNICVSYRITGDIDLARLRDAVNAVARRHRILRTTYP
VGDDGVAQPTVHADLRPGWTQYDLTDLSQRAQRLRLEVLAQREFCAPFEL
SRDAPLRITVVRTAADEHVLLLVAHHIAWDDGSWRVFFTDLTQAYSRADL
GADLGPEHRPSAASGPDTTEADLNYWRAIMADPPEPLELPGPAGTCVPTS
WRAARATLRLPADTAARVATMAKNTGCTPYMVLLAAFGALVHRYTHSDDF
LVAAPVLNRGAGTEDAIGYFGNTVAMRLRPQSAMSFRELLTATRDIASGA
FAHQRINLDRVVRELNPDRRHGAERMTRVSFGFREPDGGGFNPPGIECER
YDLRSNITQLPLGFMVEFDRAGVLVEAEHLVEILEPALAKQMLRHFGVLL
DNALAAPDNTLSGLALMDERDAARLREVSRGERFDTPVKTLVDLVNEQTT
RTPDATAVVYEGQHFTYHDLNEASNRLGHWLIEQGIGSEDRVAVLLDKSP
DLIVTALGVVKSGAVYVPVDPSYPQDRLDFILADCDAKLVLRTPVRELAG
YRSDDPTDADRIRPLRPDNTAYLIYTSGTTGLPKGVAVPHRPVAEYFVWF
KGEYDVDDTDRLLQVASPSFDVSIAEIFGTLACGARMVIPRPGGLTDIGY
LTALLRDEGITAMHFVPSLLGLFLSLPGVSQWRTLQRVPIGGEPLPGEVA
DKFHATFDALLHNFYGPTETVINASRFKVVGPQGTRIVPIGRPKINTTMH
LLDDSLQPVPTGVIGEIYIGGTHVAYGYHRRAGLTAERFVADPFNPGSRM
YRSGDLARRNADGDIEFVGRADEQVKIRGFRIELGDVAAAIAVDPTVGQA
VVVVSDLPRLGKSLVGYVTPAAGGDGPADVGVDLDRIRARVAAALPEYML
PAAYVVLDEIPITAHGKIDRAALPEPQIASDTEFRAPQTATERRLAQLFG
ELLGRDRVGADDSFFDLGGHSLLATKLVAAVRNAFGVDVGVREIFEFATV
TALAGHIDTLDSDSARPRLTRVDHDGPVRLSSSQMRSWFNYRFDGPNAVN
NIPFAAALHGPCDTNAFAAAITDVVARHEILRTVYREIGGVPHQIIQPPA
EVPVRCAAGSDAAWLRAELNNERGYVFDLETDWPIRAALLSTPEQTVLSL
VVHHIAGDHWSAGVLFTDLLTAYRARSTGQRPSWAPLPVQYADYSVWQSA
LLDDGAGIVGPQRDYWIRQLGGLAGETGLRPDFPRPALLSGAGDAVEFRL
GAAIRDKLAAVSRDLGVTEFMLLQAAVAVVLHKAGGGVDVPIGAPVAGRS
EANLDQLIGFFINIVVLRNDLRGNPTLREVLQRTRQMALAAYAHQDLPFD
QVVEAVNPQRSLSRNPLFDIVVHVREQMPQDHVIDTGPDGDTTLRVLEPT
FDAAQADLSVNFFACGDEYRGHVIYRTELYERATAQRFADWLVRVVEAFA
DRPDQPLREVEMVSAQARRRILDRSNAGAGTARVYLLDDALKPVPVGVVG
DVYYGGGPAVGARLARPSETATRFVADPFAAQPGSRLYRNGERGVWKADG
QLELLAEIERLPTAQAAPVPAEPADTETERALAAILADVLEVGEVGRYDD
FFNLGGDSILATQVAARARDGGIPLTARMVFEHPVLCELAAAVDAKPHVE
AEPDDKHHAPMSTSGLSPDELSALTASWDQWP
>Mb2400c mbtF, PEPTIDE SYNTHETASE MBTF (PEPTIDE SYNTHASE)
MGPVAVTRADARGAIDDVMALSPLQQGLFSRATLVAAESGSEAAEADPYV
IAMAADAAGPLDIALLRDCAAAMLTRHPNLRASFLHGNLSRPVQVIPSSA
EVLWRHVRAHPSEVGALAAEERRRRFDVGRGPLIRFLLIELPDECWHLVI
VAHHIVIDGWSLPLFVSELLALYRAGGHVAALPAAPRPYRDYIGWLAGRD
QTASRAMWADHLNGLDGPTLLSPALADTPVQPGIPGRTEVRLDREATAEL
ADAARTRGVTISTLVQMAWATTLSAFTGRGDVTFGVTVSGRPSELSGVET
MIGLFINTVPLRVRLDARATVGGQCAVLQRQFAMLRDHSYLGFNEFRAIA
GIGEMFDTLLVYENFPPGEVVGTAEFVANGVTFRPVALESLSHFPVTVAA
HRSTGELTLLVEVLDGALGTMAPESLGRRVLAVLQRLVSRWDRPLRDVDI
LLDGEHDPTAPGLPDVTTSAPAVHTRFAEIAAAQPDSVAVSWADGQLTYR
ELDALADRLATGLRRADVSRETPVAVALSRGPRYVAAMLAVLKAGGMIVP
LDPAMPGERVAEILRQTSAPVVIDEGVFAASVGADILEDDRAITVPVDQA
AYVIFTSGTTGTPKGVIGTHRALSAYADDHIERVLRPAAQRLGRPLRIAH
AWSFTFDAAWQPLVALLDGHAVHIVDDHRQRDAGALVEAIDRFGLDMIDT
TPSMFAQLHNAGLLDRAPLAVLALGGEALGAATWRMIQQNCARTAMTAFN
CYGPTETTVEAVVAAVAEHARPVIGRPTCTTRAYVMDSWLRPVPDGVAGE
LYLAGAQLTRGYLGRPAETAARFVAEPNGRGSRMYRTGDVVRRLPDGGLE
FLGRSDDQVKIRGFRVEPGEIAAVLNGHHAVHGCHVTARGHASGPRLTAY
VAGGPQPPPVAELRAMLLERLPRYLVPHHIVVLDELPLTPHGKIDENALA
AINVTEGPATPPQTPTELVLAEAFADVMETSNVDVTAGFLQMGLDSIVAL
SVVQAARRRGIALRARLMVECDTIRELAAAIDSDAAWQAPANDAGEPIPV
LPNTHWLYEYGDPRRLAQTEVIRLPDRITRERLDAVLAAVVDGHEVLRCR
FDRDAMALVAQPKTDILSEVWVSGELVTAVAEQTLGVLASLDPQAGRLLS
AVWLREPDGPGVLVLTAHVLAMDPASWRIVLGELDAGLHALAAGRAPSPA
RENTSYRQWSRLLAQRAKALDSVDFWVAELEGADPPLGARRVAPQTDRVG
ELAITMSISDADLTARLLSTGRSMTDLLATAAARMVTAWRRQRGQQTPAP
LLALETHGRADVHVDKTADTSDTVGLLSAIYPLRIHCDGATDFARIPGSG
IDYGLLRYLRADTAERLRAHREPQLLLNYLGSLHVGVGDLAVDRALLADV
GQLPEPEQPVRHELTVLAALLGPADAPVLATRWRTLPDILSADDVATLQS
LWQGALAEITA
>Mb2399c mbtG, LYSINE-N-OXYGENASE MBTG (L-LYSINE 6-MONOOXYGENASE) (LYSINE N6-HYDROXYLASE)
MNPTLAVLGAGAKAVAVAAKASVLRDMGVDVPDVIAVERIGVGANWQASG
GWTDGAHRLGTSPEKDVGFPYRSALVPRRNAELDERMTRYSWQSYLIATA
SFAEWIDRGRPAPTHRRWSQYLAWVADHIGLKVIHGEVERLAVTGDRWAL
CTHETTVQADALMITGPGQAEKSLLPGNPRVLSIAQFWDRAAGHDRINAE
RVAVIGGGETAASMLNELFRHRVSTITVISPQVTLFTRGEGFFENSLFSD
PTDWAALTFDERRDALARTDRGVFSATVQEALLADDRIHHLRGRVAHAVG
RQGQIRLTLSTNRGSENFETVHGFDLVIDGSGADPLWFTSLFSQHTLDLL
ELGLGGPLTADRLQEAIGYDLAVTDVTPKLFLPTLSGLTQGPGFPNLSCL
GLLSDRVLGAGIFTPTKHNDTRRSGEHQSFR
>Mb0175 mce1A, MCE-FAMILY PROTEIN MCE1A
MTTPGKLNKARVPPYKTAGLGLVLVFALVVALVYLQFRGEFTPKTQLTML
SARAGLVMDPGSKVTYNGVEIGRVDTISEVTRDGESAAKFILDVDPRYIH
LIPANVNADIKATTVFGGKYVSLTTPKNPTKRRITPKDVIDVRSVTTEIN
TLFQTLTSIAEKVDPVKLNLTLSAAAEALTGLGDKFGESIVNANTVLDDL
NSRMPQSRHDIQQLAALGDVYADAAPDLFDFLDSSVTTARTINAQQAELD
SALLAAAGFGNTTADVFDRGGPYLQRGVADLVPTATLLDTYSPELFCTIR
NFYDADPLAKAAAGGGNGYSLRTNSEILSGIGISLLSPLALATNGAAIGI
GLVAGLIASPLAVAANLAGALPGIVGGAPNPYTYPENLPRVNARGGPGGA
PGCWQPITRDLWPAPYLVMDTGASLAPYNHMEVGSPYAVEYVWGRQVGDN
TINP
>Mb0176 mce1B, MCE-FAMILY PROTEIN MCE1B
MKITGTVVKLGIVSVVLLFFTVMIIVIFGQMRFDRTNGYTAEFSNVSGLR
QGQFVRASGVEIGKVKALHLVDGGRRVRVEFNIDRSVPLYQSTTAQIRYS
DLIGNRYVELKRGEGKGANDLLPPGGLIPLSRTSPALDLDALIGGFKPVF
RALDPAKVNNIANALITVFQGQGGTINDTLDQTAQLTSQIAERDQAIGEV
VKNLNIVLDTTVKHRKEFDETVNNLENLITGLRNHSDQLAGGLAHISNGA
GTVADLLAENRTLVRKAVSYLDAIQQPVIDQRVELDDLLHKTPTALTALG
RANGTYGDFQNFYLCDLQIKWNGFQAGGPVRTVKLFSQPTGRCTPQ
>Mb0177 mce1C, MCE-FAMILY PROTEIN MCE1C
MRTLEPPNRMRIGLMGIVVALLVVAVGQSFTSVPMLFAKPSYYGQFTDSG
GLHKGDRVRIAGLGVGTVEGLKIDGDHIVVKFSIGTNTIGTESRLAIRTD
TILGRKVLEIEPRGAQALPPGGVLPVGQSTTPYQIYDAFFDVTKAASGWD
IETVKRSLNVLSETVDQTYPHLSAALDGVAKFSDTIGKRDEQITHLLAQA
NQVASILGDRSDQVDRLLVNAKTLIAAFNERGRAVDALLGNISAFSAQVQ
NLINDNPNLNHVLEQLRILTDLLVDRKEDLAETLTILGRFSASFGETFAS
GPYFKVLLANLVPGQILQPFVDAAFKKRGISPEDFWRSAGLPAYRWPDPN
GTRFPNGAPPPPPPVLEGTPEHPGPAVPPGSPCSYTPPADGLPRPWDPLP
CANLTQGPFGGPDFPAPLDVATSPPNPDGPPPAPGLPIAGRPGEVPPNVP
GTPVPIPQEAPPGARTLPLGPAPGPAPPPAAPGPPAPPGPGPQLPAPFIN
PGGTGGSGVTGGSEN
>Mb0178 mce1D, MCE-FAMILY PROTEIN MCE1D
MSTIFDIRNLRLPQLSRASVVIGSLVVVLALAAGIVGVRLYQKLTNNTVV
AYFTQANALYVGDKVQIMGLPVGSIDKIEPAGDKMKVTFHYQNKYKVPAN
ASAVILNPTLVASRNIQLEPPYRGGPVLADNAVIPVERTQVPTEWDELRD
SVSHIIDELGPTPEQPKGPFGEVIEAFADGLAGKGKQINTTLNSLSQALN
ALNEGRGDFFAVVRSLALFVNALHQDDQQFVALNKNLAEFTDRLTHSDAD
LSNAIQQFDSLLAVARPFFAKNREVLTHDVNNLATVTTTLLQPDPLDGLE
TVLHIFPTLAANINQLYHPTHGGVVSLSAFTNFANPMEFICSSIQAGSRL
GYQESAELCAQYLAPVLDAIKFNYFPFGLNVASTASTLPKEIAYSEPRLQ
PPNGYKDTTVPGIWVPDTPLSHRNTQPGWVVAPGMQGVQVGPITQGLLTP
ESLAELMGGPDIAPPSSGLQTPPGPPNAYDEYPVLPPIGLQAPQVPIPPP
PPGPDVIPGPVPPTPAPVGAPLPAEAGGGQ
>Mb0180 mce1F, MCE-FAMILY PROTEIN MCE1F
MLTRFIRRQLILFAIVSVVAIVVLGWYYLRIPSLVGIGQYTLKADLPASG
GLYPTANVTYRGITIGKVTAVEPTDQGARVTMSIASNYKIPVDASANVHS
VSAVGEQYIDLVSTGAPGKYFSSGQTITKGTVPSEIGPALDNSNRGLAAL
PTEKIGLLLDETAQAVGGLGPALQRLVDSTQAIVGDFKTNIGDVNDIIEN
SGPILDSQVNTGDQIERWARKLNNLAAQTATRDQNVRSILSQAAPTADEV
NAVFSGVRDSLPQTLANLEVVFDMLKRYHAGVEQLLVFLPQGAAIAQTVL
TPTPGAAQLPLAPAINYPPPCLTGFLPASEWRSPADTSPRPLPSGTYCKI
PQDAQLQVRGARNIPCVDVPGKRAATPKECRSKDPYVPLGTNPWFGDPNQ
ILTCPAPGARCDQPVKPGLVIPAPSINTGLNPAPADQVQGTPPPVSDPLQ
RPGSGTVQCNGQQPNPCVYTPTSGPSAVYSPASGELVGPDGVKYAVANSS
TTGDDGWKEMLAPAS
>Mb0604 mce2A, MCE-FAMILY PROTEIN MCE2A
MPTLVTRKNRRAWLYVEGVVLLLVGALVLVLVYKQFRGEFTPKTELTMVA
SRAGLVMEAGSKVTYNGVEIGRVGSISEIERDGRPAAKLVLDVNPRYISL
IPVNVVADIEAATLFGNKYVALSAPKIPQQQRISSHDVIDVGSVTTEFNT
LFETITSIAEKVDPIELNATLSAVAQAPDGLGGKFGESIVNGNQILAQLN
PRLPQLGYDVRRLADLGEVYVDASPDLWSFLQNALTTARTLTSQQRDLDA
ALLAATGAGNTGEDVFARGGPYLARAAADLVPTATLLDTYSPELFCMIRN
FHDAAPKVADAVGGNGYSLAAAGTILGAPNPYVYPDNLPRVNAHGGPGGR
PGCWQTITRELWPAPYLVMDTGASLAPYNHVELGQPMFTEYVWGRQYGEN
TINP
>Mb0605 mce2B, MCE-FAMILY PROTEIN MCE2B
MKTTGTTIKLGIVWLVLSVFTVMIIVVFGQVRFHHTTGYSAVFTHVSGLR
AGQFVRAAGVEVGKVAKVTLIDGDKQVLVDFTVDRSLSLDQATTASIRYL
NLIGDRYLELGRGHSGQRLAPGATIPLEHTHPALDLDALLGGFRPLFQTL
DPDKVNSIASSIITVFQGQGATINDILDQTASLTATLADRDHAIGEVVNN
LNTVLATTVKHQTEFDRTVDKLEVLITGLKNRADPLAAAAAHISSAAGTL
ADLLGADRPLLHSSFGHLEGIQQPLIDELAELDHVLGKLPDAYRIIGRAG
GIYGDFFNFYLCDISLKVNGLQPGGPVRTVKLFGQPTGRCTPQ
>Mb0606 mce2C, MCE-FAMILY PROTEIN MCE2C
MRTLTEFNRGRVGMMGAVVTVLVVGVAQSFTSVPMLFATPTYYAQFADMG
GINTGDKVEIAGVNVGLVRSLAIRGNRVLIGFSLPGKTIGMQSRAAIRTD
TILGRKNLEIEPRGSEPLKPNGFLPLAQTTTPYQIYDAFVDVTKAATGWD
IDAVKRSLNVLSETFDQTAPHLSAALEGVKAFSDTVGRRGEQIEQLLANA
NRIARVLGDRSEQVNGLLVNAKTLLAAFKQRSQALRILLTNVSEASAQVS
GLITDNPNLNHVLAQLRTVSEELVKRKNELADVAVLLGRYTAALTEAVGS
GPFFKAMVVNLLPYQILQPWVDAAFKKRGIDPENFWRSAGLPEFRWPDPN
GTRFPNGAPPAAPPVREGTPKHPGPAVPPGTPCSYTPAAGALPRPDNPLP
CAGATVGPFGGPDFPAPLDVQPSPPNPDGPPPTPGILSAGRPGEPAPAVP
GIPMPLPPNAPPGARTQPLEPFPDGTGGSNQ
>Mb0607 mce2Da, MCE-FAMILY PROTEIN MCE2DA [FIRST PART]
MSTIFDIRSLRLPKLSAKVVVVGGLVVVLAVVAAAAGARLYRKLTTTTVV
AYFSEALALYPGDKVQIMGVRVGSIDKIEPAGDKMRVTLHYSNKYQVPAT
ATASILNPSLVASRTIQLSPPYTGGPVLQDGAVIPIERTQVPVEWDQLRD
SINGILRQLGPTERQPKGPFGDLIESAADNLAGKGRQLNETLNSLSQALT
ALNEGRGDFVAITRSLALFVSALYQNDQQFVALNENLAEFTDWFTKSDHD
LADTVERIDDVLGTVRKFVSDNRSVLAADVNNLADATTTLVQPEPRDGLE
TALHVLPTYASNFNNLYYPLHSSLVGQFVFPNFANPIQLICSAIQAGSRL
GYQESAELCAQYLAPVLDALKFNYLPFGSNPFSSAATLPKEVAYSEERLR
PPPGYKDTTVPGIFSRDTPFSHGNHEPGWVVAPGMQGMQVQPFTANMLTP
ESLAELLGGPDIAPPAAGNQLARTAECV
>Mb0610 mce2F, MCE-FAMILY PROTEIN MCE2F
MLTRAIKTQLVLLTVLAVIAVVVLGWYFLRIPSLVGIGRYTLYAELPRSG
GLYRTANVTYRGITIGKVTGVEPTERGARATMSIDNGYQIPTDASANVHS
VSAVGEQFVDLVSTRTSGPYLRHGQTITTTTVPSQIGPALDAANRGLAVL
PKDRVASVLHEASEAVGGLGSSLNRLIEATQAIAHDVRGSLEDIDDIIER
SAPIIDSQVNSGNEIARWAANLNTLAAQTAQTDPAVRSILANAAPTADQV
NATFSDVRESLPQTLANLEVVIDMLKRYHNGVEQALVFLPQSGAIAQSVT
TEFPGQAGLGVGGLALNQPPPCLTGFLPASEWRSPADTSTAPLPKGTYCR
IPMDASNVVRGARNNPCVDVPGKRAATPRECRSNEAYVPGGTNPWYGDPN
QMLSCPAPAARCDQPVKPGQVIPAPSVNNGINPLPADQLPGTPPPVNDPL
QRPGSGTVQCNGQQPNPCVYTPSTFPTTIYDVQSGKVVAPDGVVYSVEAS
THAGADGWKVMLAPTG
>Mb3529c mce4A, MCE-FAMILY PROTEIN MCE4A
MSGGGSRRTSVRVAAALLAGLMVGSAVLTYLSYTAAFTSTDTVTVSSPRA
GLVMEKGAKVKYRGIQVGKVTDISYSGNQARLKLAIDSGEMGFIPSNATV
RIAGNTIFGAKSVEFIPPKTPSPKPLSPNAHVAASQVQLEVNTLFQSLID
LLHKIDPLETNATLSALSEGLRGHGDDLGALLSGLNTLTRQANPKLPALQ
EDFRKAAVVANVYADAAGDLNTVFDNLPTINKTIVDQKDNLNDTLLATIG
LSNNAYETLAPAEQNFIDAINRLRAPLKVTSDYSPVFGCLFKGIARGVKE
FAPLIGVRKAGLFTSSSFVLGAPSYTYPESLPIVNASGGPNCRGLPDIPT
KQTGGSFYRAPFLVTDNALIPYQPFTELQVDAPSTLQFLFNGAFAERDDF
>Mb3528c mce4B, MCE-FAMILY PROTEIN MCE4B
MAGSGVPSHRSMVIKVSVFAVVMLLVAAGLVVVFGDFRFGPTTVYHATFT
DASRLKAGQKVRIAGVPVGSVKAVKLNPDHSIDVAFAIDRSYTLYSSTRA
VIRYENLVGDRFLEITSGPGELRKLPPGGTINVAHTQPALDLDALLGGLR
PVLKGFDADKINTITSAVIELLQGQGGPLANVLADTGAFSAALGARDQLI
GEVITNLNAVLATVDAKSAQFSASVDQLQQLVSGLAKNRDPIAGAISPLA
STTTDLTELLRNSRRPLQGILENARPLATELDNRKAEVNNDIEQLGEDYL
RLSALGSYGAFFNIYFCSVTIKINGPAGSDILLPIGGQPDPSKGRCAFAK
>Mb3527c mce4C, MCE-FAMILY PROTEIN MCE4C
MLNRKPSSKHERDPLRTGIFGLVLVICVVLIAFGYSGLPFWPQGKTYDAY
FTDAGGITPGNSVYVSGLKVGAVSAVSLAGNSAKVTFSVDRSIVVGDQSL
AAIRTDTILGERSIAVSPAGSGKSTTIPLSRTTTPYTLNGVLQDLGRNAN
DLNRPQFEQALNVFTQALHDATPQVRGAVDGLTSLSRALNRRDEALQGLL
AHAKSVTSVLSERAEQVNKLVEDGNQLFAALDARRAALSALISGIDDVAA
QISGFVADNRKEFGPALSKLNLVLANLNERRDYITEALKRLPTYATTLGE
VVGSGPGFNVNVYSVLPGPLVATVFDLVFQPGKLPDSLADYLRGFIQERW
IIRPKSP
>Mb3526c mce4D, MCE-FAMILY PROTEIN MCE4D
MMGRVAMLTGSRGLRYATVIALVAALVGGVYVLSSTGNKRTIVGYFTSAV
GLYPGDQVRVLGVPVGEIDMIEPRSSDVKITMSVSKDVKVPVDVQAVIMS
PNLVAARFIQLTPVYTGGAVLPDNGRIDLDRTAVPVEWDEVKEGLTRLAA
DLSPAAGELQGPLGAAINQAADTLDGNGDSLHNALRELAQVAGRLGDSRG
DIFGTVKNLQVLVDALSESDEQIVQFAGHVASVSQVLADSSANLDQTLGT
LNQALSDIRGFLRENNSTLIETVNQLNDFAQTLSDQSENIEQVLHVAGPG
ITNFYNIYDPAQGTLNGLLSIPNFANPVQFICGGSFDTAAGPSAPDYYRR
AEICRERLGPVLRRLTVNYPPIMFHPLNTITAYKGQIIYDTPATEAKSET
PVPELTWVPAGGGAPVGNPADLQSLLVPPAPGPAPAPPAPGAGPGEHGGG
G
>Mb3524c mce4F, MCE-FAMILY PROTEIN MCE4F
MIDRLAKIQLSIFAVITVITLSVMAIFYLRLPATFGIGTYGVSADFVAGG
GLYKNANVTYRGVAVGRVESVGLNPNGVTAHMRLNSGTAIPSNVTATVRS
VSAIGEQYIDLVPPENPSSTKLRNGFRIQRQNTRIGQDVADLLRQAETLL
GSLGDTRLRELLHEAFIATNGAGPELARLIESARLLVDEANANYPQVSQL
IDQAGPFLQAQIRAGGDIKSLADGLARFTWQLRAADPRLRDTLAGAPDAI
DEANTAFSGIRPSFPALAASLANLGRVGVIYHKSIEQLLVVFPALFAAII
TSAGGVPQDEGAKLDFKIDLHDPPPCMTGFLPPPLVRSPADESVREIPRD
MYCKTAQNDPSTVRGARNYPCQEFPGKRAPTVQLCRDPRGYVPVGTNPWR
GPPIPYGTEVTDGRNILPPNKFPYIPPGADPDPGVPIVGPPPPGQVAGPG
PAPHQPAQPAPPPNDNGPPPPFTSWMPPGYPPEPPQVPYPATIPPPPPPE
GTGPPPGPAPGPQPQASGPAYTIYDQLSGAFADPAGGTGIFAPGMTGASS
AENWVDLMRDPRQL
>Mb0556c menE, POSSIBLE O-SUCCINYLBENZOIC ACID--COA LIGASE MENE (OSB-COA SYNTHETASE) (O-SUCCINYLBENZOATE-COA SYNTHASE)
MLGGSDPALVAVPTQHESLLGALRVGEQIDDDVALVVTTSGTTGPPKGAM
LTAAALTASASAAHDRLGGPGSWLLAVPPYHIAGLAVLVRSVIAGSVPVE
LNVSAGFDVTELPNAIKRLGSGRRYTSLVAAQLAKALTDPAATAALAELD
AVLIGGGPAPRPILDAAAAAGITVVRTYGMSETSGGCVYDGVPLDGVRLR
VLAGGRIAIGGATLAKGYRNPVSPDPFAEPGWFHTDDLGALESGDSGVLT
VLGRADEAISTGGFTVLPQPVEAALGTHPAVRDCAVFGLADDRLGQRVVA
AIVVGDGCPPPTLEALRAHVARTLDVTAAPRELHVVNVLPRRGIGKVDRA
ALVRRFAGEADQ
>Mb0674 mkl, POSSIBLE RIBONUCLEOTIDE-TRANSPORT ATP-BINDING PROTEIN ABC TRANSPORTER MKL
MRYSDSYHTTGRWQPRASTEGFPMGVSIEVNGLTKSFGSSRIWEDVTLTI
PAGEVSVLLGPSGTGKSVFLKSLIGLLRPERGSIIIDGTDIIECSAKELY
EIRTLFGVLFQDGALFGSMNLYDNTAFPLREHTKKKESEIRDIVMEKLAL
VGLGGDEKKFPGEISGGMRKRAGLARALVLDPQIILCDEPDSGLDPVRTA
YLSQLIMDINAQIDATILIVTHNINIARTVPDNMGMLFRKHLVMFGPREV
LLTSDEPVVRQFLNGRRIGPIGMSEEKDEATMAEEQALLDAGHHAGGVEE
IEGVPPQISATPGMPERKAVARRQARVREMLHTLPKKAQAAILDDLEGTH
KYAVHEIGQ
>Mb3596c nat, ARYLAMINE N-ACETYLTRANSFERASE NAT (ARYLAMINE ACETYLASE)
MALDLTAYFDRINYRGATDPTLDVLQDLVTVHSRTIPFENLDPLLGVPVD
DLSPQALADKLVLRRRGGYCFEHNGLMGYVLAELGYRVRRFAARVVWKLA
PDAPLPPQTHTLLGVTFPGSGGCYLVDVGFGGQTPTSPLRLETGAVQPTT
HEPYRLEDRVDGFVLQAMVRDTWQTLYEFTTQTRPQIDLKVASWYASTHP
ASKFVTGLTAAVITDDARWNLSGRDLAVHRAGGTEKIRLADAAAVVDTLS
ERFGINVADIGERGALETRIDELLARQPGADAP
>Mb0104 nrp, PROBABLE PEPTIDE SYNTHETASE NRP (PEPTIDE SYNTHASE)
MHRVRLSRSQRNLYNGVRQDNNPALYLIGKSYRFRRLELARFLAALHATV
LDNPVQLCVLENSGADYPDLVPRLRFGDIVRVGSADEHLQSTWCSGILGK
PLVRHTVHTDPNGYVTGLDVHTHHILLDGGATGTIEADLARYLTTDPAGE
TPSVGAGLAKLREAHRRETAKVEESRGRLSAVVQRELADEAYHGGHGHSV
SDAPGTAAKGVLHESATICGNAFDAILTLSEAQRVPLNVLVAAAAVAVDA
SLRQNTETLLVHTVDNRFGDSDLNVATCLVNSVAQTVRFPPFASVSDVVR
TLDRGYVKAVRRRWLREEHYRRMYLAINRTSHVEALTLNFIREPCAPGLR
PFLSEVPIATDIGPVEGMTVASVLDEEQRTLNLAIWNRADLPACKTHPKV
AERIAAALESMAAMWDRPIAMIVNDWFGIGPDGTRCQGDWPARQPSTPAW
FLDSARGVHQFLGRRRFVYPWVAWLVQRGAAPGDVLVFTDDDTDKTIDLL
IACHLAGCGYSVCDTADEISVRTNAITEHGDGILVTVVDVAATQLAVVGH
DELRKVVDERVTQVTHDALLATKTAYIMPTSGTTGQPKLVRISHGSLAVF
CDAISRAYGWGAHDTVLQCAPLTSDISVEEIFGGAACGARLVRSAAMKTG
DLAALVDDLVARETTIVDLPTAVWQLLCADGDAIDAIGRSRLRQIVIGGE
AIRCSAVDKWLESAASQGISLLSSYGPTEATVVATFLPIVCDQTTMDGAL
LRLGRPILPNTVFLAFGEVVIVGDLVADGYLGIDGDGFGTVTAADGSRRR
AFATGDRVTVDAEGFPVFSGRKDAVVKISGKRVDIAEVTRRIAEDPAVSD
VAVELHSGSLGVWFKSQRTREGEQDAAAATRIRLVLVSLGVSSFFVVGVP
NIPRKPNGKIDSDNLPRLPQWSAAGLNTAETGQRAAGLSQIWSRQLGRAI
GPDSSLLGEGIGSLDLIRILPETRRYLGWRLSLLDLIGADTAANLADYAP
TPDAPTGEDRFRPLVAAQRPAAIPLSFAQRRLWFLDQLQRPAPVYNMAVA
LRLRGYLDTEALGAAVADVVGRHESLRTVFPAVDGVPRQLVIEARRADLG
CDIVDATAWPADRLQRAIEEAARHSFDLATEIPLRTWLFRIADDEHVLVA
VAHHIAADGWSVAPLTADLSAAYASRCAGRAPDWAPLPVQYVDYTLWQRE
ILGDLDDSDSPIAAQLAYWENALAGMPERLRLPTARPYPPVADQRGASLV
VDWPASVQQQVRRIARQHNATSFMVVAAGLAVLLSKLSGSPDVAVGFPIA
GRSDPALDNLVGFFVNTLVLRVNLAGDPSFAELLGQVRARSLAAYENQDV
PFEVLVDRLKPTRALTHHPLIQVMLAWQDNPVGQLNLGDLQATPMPIDTR
TARMDLVFSLAERFSEGSEPAGIGGAVEYRTDVFEAQAIDVLIERLRKVL
VAVAAAPERTVSSIDALDGTERARLDEWGNRAVLTAPAPTPVSIPQMLAA
QVARIPEAEAVCCGDASMTYRELDEASNRLAHRLAGCGAGPGECVALLFE
RCAPAVVAMVAVLKTGAAYLPIDPANPPPRVAFMLGDAVPVAAVTTAGLR
SRLAGHDLPIIDVVDALAAYPGTPPPMPAAVNLAYILYTSGTTGEPKGVG
ITHRNVTRLFASLPARLSAAQVWSQCHSYGFDASAWEIWGALLGGGRLVI
VPESVAASPNDFHGLLVAEHVSVLTQTPAAVAMLPTQGLESVALVVAGEA
CPAALVDRWAPGRVMLNAYGPTETTICAAISAPLRPGSGMPPIGVPVSGA
ALFVLDSWLRPVPAGVAGELYIAGAGVGVGYWRRAGLTASRFVACPFGGS
GARMYRTGDLVCWRADGQLEFLGRTDDQVKIRGYRIELGEVATALAELAG
VGQAVVIAREDRPGDKRLVGYATEIAPGAVDPAGLRAQLAQRLPGYLVPA
AVVVIDALPLTVNGKLDHRALPAPEYGDTNGYRAPAGPVEKTVAGIFARV
LGLERVGVDDSFFELGGDSLAAMRVIAAINTTLNADLPVRALLHASSTRG
LSQLLGRDARPTSDPRLVSVHGDNPTEVHASDLTLDRFIDADTLATAVNL
PGPSPELRTVLLTGATGFLGRYLVLELLRRLDVDGRLICLVRAESDEDAR
RRLEKTFDSGDPELLRHFKELAADRLEVVAGDKSEPDLGLDQPMWRRLAE
TVDLIVDSAAMVNAFPYHELFGPNVAGTAELIRIALTTKLKPFTYVSTAD
VGAAIEPSAFTEDADIRVISPTRTVDGGWAGGYGTSKWAGEVLLREANDL
CALPVAVFRCGMILADTSYAGQLNMSDWVTRMVLSLMATGIAPRSFYEPD
SEGNRQRAHFDGLPVTFVAEAIAVLGARVAGSSLAGFATYHVMNPHDDGI
GLDEYVDWLIEAGYPIRRIDDFAEWLQRFEASLGALPDRQRRHSVLPMLL
ASNSQRLQPLKPTRGCSAPTDRFRAAVRAAKVGSDKDNPDIPHVSAPTII
NYVTNLQLLGLL
>Mb1184c omt, PROBABLE O-METHYLTRANSFERASE OMT
MSAHKPAKQRVALTGVSETALLTLNARAAEARRRDTIIDDPMAVALVESI
DFGFAKFGPTGQGFALRARAFDMAAQHYLDQHPAATVVALAEGLQTSFWR
LDVAIPGGQFRWLTVDLPPIVDLRTRLLPSSPRVSVCAQSALDYSWMDSV
DPAGGVFITAEGLLMYLQPEQALGLIAQCAQTFPGGQMLFDLPPRWFAGW
SRLGLRTSLRYKVPRMPFSMSVAQAADLVNKVPGVVAVRDLRVPPGRGLW
VNMALSTVYRLPVFDPLRPCLTLLEFSRPARG
>Mb0272c oplA, PROBABLE 5-OXOPROLINASE OPLA (5-OXO-L-PROLINASE) (PYROGLUTAMASE) (5-OPASE)
MVGAGWHFWVDRGGTFTDVVARRPDGRLLTHKLLSDNPARYRDAAVAGIR
ALLANGEAGTRVDAVRMGTTVATNALLERTGERTLLVITRGFGDALRIAY
QNRPRIFDRRIVLPEMLYERVVEVDERVTADGRVLRAPDLEALGEKMRQA
HADGIRAVAVVCLHSYLYPGHEREIGTLAQRIGFAQISLSSEVSPLMKLV
PRGDTTVVDAYLSPVLRRYINQVADQMRGVRLMFMQSNGGLAQAGHFRGK
DAILSGPAGGIVGMVRMSALAGFDHVIGFDMGGTSTDVSHYAGEYERVFT
TQVAGVRLRAPMLDIHTVAAGGGSILHFDGSRYRVGPDSAGADPGPACYR
GGGPLCVTDANVMLGRIQPTHFPSVFGPSGDQPLDAGTVRRGFTDLAADI
AARTGDDRSPEQVAEGYLRIAVANMANAVKKISVQKGHDVTRYALTTFGG
AGGQHACAVADALGIRTVLIPPMAGVLSALGIGLADTTAMREQSVEIPLG
PAAPQRLASVAESLERAARAELLDEGVPGERIRVVRRVHLRYEGTDTAIP
VQLAEIETMATAFESSHRALYTFLLDRPLIAEAISVEATGLTDQPDLSQL
GDQANDTTGSSETVRIYSNGLWRDAPLRRREAMRPGDVLTGPAIIAEANA
TTVVDDGWQATMTETGHLLAQRVVTPPRPDAATRAGFEAGFEADPVLLEI
FNNLFMSIAEQMGFRLEATAQSVNIRERLDFSCALFDPDGNLVANAPHIP
VHLGSMGTTVKEVIRRRLSGMKPGDVYAVNDPYHGGTHLPDITVITPVFN
TGGEDVLFFVASRGHHAEIGGITPGSMPADSREIHEEGVLFDNWLLAENG
RFREAETRRLLTEAPFGSRNPDTNLADLRAQIAANQKGVDEVGKMIDHFG
RDVVAAYMRHVQDNAEEAVRRVIDRLDNGAYRYRMDSGATIAVRITVDRA
ARSATIDFTGTSAQLDTNFNAPTSVVNAAVLYVFRTLVADDIPLNDGCLR
PLRIVVPEGSMLAPTHPAAVVAGNVETSQAITGALFAALGVQAEGSGTMN
NVTFGNERHQYYETVGSGSGAGDGYHGASVVQTHMTNSRLTDPEVLEWRY
PVLLREFAVRQGSGGAGRWRGGDGAVRRLEFTEPMTVSTLSGHRRVRPYG
MAGGSPGELGRNRVERADGSTVELAGCGSTHVEPGDTLVIETPGGGGYGP
ASTSARRRR
>Mb3854c papA1, PROBABLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA1
MRIGPVELSAVKDWDPAPGVLVSWHPTPASCAKALAAPVSAVPPSYVQAR
QIRSFSEQAARGLDHSRLLIASVEVFGHCDLRAMTYVINAHLRRHDTYRS
WFELRDTDHIVRHSIADPADIEFVPTTHGEMTSADLRQHIVATPDSLHWD
CFSFGVIQRADSFTFYASIDHLHADGQFVGVGLMEFQSMYTALIMGEPPI
GLSEAGSYVDFCVRQHEYTSALTVDSPEVRAWIDFAEINNGTFPEFPLPL
GDPSVRCGGDLLSMMLMDEQQTQRFESACMAANARFIGGMLACIAIAIHE
LTGADTYFGITPKDIRTPADLMTQGWFTGQIPVTVPVAGLSFNEIARIAQ
TSFDTGADLAKVPFERVVELSPSLRRPQPLFSLVNFFDAQVGPLSAVTKL
FEGLNVGTYSDGRVTYPLSTMVGRFDETAASVLFPDNPVARESVTAYLRA
IRSVCMRIANGGTAERVGNVVALSPGRRNNIERMTWRSCRAGDFIDICNL
KVANVTVDREA
>Mb3850c papA2, POSSIBLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA2
MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQQAQHLRR
YRDHVARGLDMSRLMIFTWDLPGRCNIRAMNYAINAHLRRHDTYHSWFEF
DNAEHIVRHTIADPADIEVVQAEHQNMTSAELRHHIATPQPLQWDCFLFG
IIQSDDHFTFYASIAHLCVDPMIVGVLFIEIHMMYSALVGGDPPIELPPA
GRYDDHCVRQYADTAALTLDSARVRRWVEFAANNDGTLPHFPLPLGDLSV
PHTGKLLTETLMDEQQGERFEAACVAAGARFSGGVFACAALAERELTNCE
TFDVVTTTDTRRTPTELRTTGWFTGLVPITVPVASGLFDSAARVAQISFD
SGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIAPLSTVANSD
LNFRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNPIASESVANYIAAMK
SIYIRTADGTLAILKPGT
>Mb1214 papA3, PROBABLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA3
MLRVGPLTIGTLDDWAPSTGSTVSWRPSAVAHTKASQAPISDVPVSYMQA
QHIRGYCEQKAKGLDYSRLMVVSCQQPGQCDIRAANYVINAHLRRHDTYR
SWFQYNGNGQIIRRTIQDPADIEFVPVHHGELTLPQIREIVQNTPDPLQW
GCFRFGIVQGCDHFTFFASVDHVHVDAMIVGVTLMEFHLMYAALVGGHAP
LELPPAGSYDDFCRRQHTFSSTLTVESPQVRAWTKFAEGTNGSFPDFPLP
LGDPSKPSDADIVTVMMLDEEQTAQFESVCTAAGARFIGGVLACCGLAEH
ELTGTTTYYGLTPRDTRRTPADAMTQGWFTGLIPITVPIAGSAFGDAARA
AQTSFDSGVKLAEVPYDRVVELSSTLTMPRPNFPVVNFLDAGAAPLSVLL
TAELTGTNIGVYSDGRYSYQLSIYVIRVEQGTAVAVMFPDNPIARESVAR
YLATLKSVFQRVAESGQQQNVA
>Mb2964 papA5, POSSIBLE CONSERVED POLYKETIDE SYNTHASE ASSOCIATED PROTEIN PAPA5
MFPGSVIRKLSHSEEVFAQYEVFTSMTIQLRGVIDVDALSDAFDALLETH
PVLASHLEQSSDGGWNLVADDLLHSGICVIDGTAATNGSPSGNAELRLDQ
SVSLLHLQLILREGGAELTLYLHHCMADGHHGAVLVDELFSRYTDAVTTG
DPGPITPQPTPLSMEAVLAQRGIRKQGLSGAERFMSVMYAYEIPATETPA
VLAHPGLPQAVPVTRLWLSKQQTSDLMAFGREHRLSLNAVVAAAILLTEW
QLRNTPHVPIPYVYPVDLRFVLAPPVAPTEATNLLGAASYLAEIGPNTDI
VDLASDIVATLRADLANGVIQQSGLHFGTAFEGTPPGLPPLVFCTDATSF
PTMRTPPGLEIEDIKGQFYCSISVPLDLYSCAVYAGQLIIEHHGHIAEPG
KSLEAIRSLLCTVPSEYGWIME
>Mb2971c pks1, PROBABLE POLYKETIDE SYNTHASE PKS1
MIEEQRTMSVEGADQQSEKLFHYLKKVAVELDETRARLREYEQRATEPVA
VVGIGCRFPGGVDGPDGLWDVVSAGRDVVSEFPTDRGWDVEGLYDPDPDA
EGKTYTRWGAFLDDATGFDAGFFGIAPSEVLAMDPQQRLMLEVSWEALEH
AGIDPLSLRGSATGVYTGIFAASYGNRDTGGLQGYGLTGTSISVASGRVS
YVLGLQGPAVSVDTACSSSLVAIHWAMSSLRSGECDLALAGGVTVMGLPS
IFVGFSRQRGLAADGRCKAFAAAADGTGWGEGAGVVVLERLSDARRLGHS
VLAVVRGSAVNQDGASNGLTAPNGLAQQRVIQAALANAGLSAADVDVVEA
HGTATTLGDPIEAQALLSTYGQGRPAEQPLWVGSIKSNMGHTQAAAGVAG
VIKMVQAMRHGVMPATLHVDEPSPRVDWTSGAVSVLTEAREWSVDGRPRR
AAVSSFGISGTNAHLILEEAPVPAPAEAPVEASESTGGPRPSMVPWVISA
RSAEALTAQAGRLMAHVQANPGLDPIDVGCSLASRSVFEHRAVVVGASRE
QLIAGLAGLAAGEPGAGVAVGQPGSVGKTVVVFPGQGAQRIGMGRELYGE
LPVFAQAFDAVADELDRHLRLPLRDVIWGADADLLDSTEFAQPALFAVEV
ASFAVLRDWGVLPDFVMGHSVGELAAAHAAGVLTLADAAMLVVARGRLMQ
ALPAGGAMVAVAASEDEVEPLLGEGVGIAAINAPESVVISGAQAAANAIA
DRFAAQGRRVHQLAVSHAFHSPLMEPMLEEFARVAARVQAREPQLGLVSN
VTGELAGPDFGSAQYWVDHVRRPVRFADSARHLQTLGATHFIEAGPGSGL
TGSIEQSLAPAEAMVVSMLGKDRPELASALGAAGQVFTTGVPVQWSAVFA
GSGGRRVQLPTYAFQRRRFWETPGADGPADAAGLGLGATEHALLGAVVER
PDSDEVVLTGRLSLADQPWLADHVVNGVVLFPGAGFVELVIRAGDEVGCA
LIEELVLAAPLVMHPGVGVQVQVVVGAADESGHRAVSVYSRGDQSQGWLL
NAEGMLGVAAAETPMDLSVWPPEGAESVDISDGYAQLAERGYAYGPAFQG
LVAIWRRGSELFAEVVAPGEAGVAVDRMGMHPAVLDAVLHALGLAVEKTQ
ASTETRLPFCWRGVSLHAGGAGRVRARFASAGADAISVDVCDATGLPVLT
VRSLVTRPITAEQLRAAVTAAGGASDQGPLEVVWSPISVVSGGANGSAPP
APVSWADFCAGSDGDASVVVWELESAGGQASSVVGSVYAATHTALEVLQS
WLGADRAATLVVLTHGGVGLAGEDISDLAAAAVWGMARSAQAENPGRIVL
IDTDAAVDASVLAGVGEPQLLVRGGTVHAPRLSPAPALLALPAAESAWRL
AAGGGGTLEDLVIQPCPEVQAPLQAGQVRVAVAAVGVNFRDVVAALGMYP
GQAPPLGAEGAGVVLETGPEVTDLAVGDAVMGFLGGAGPLAVVDQQLVTR
VPQGWSFAQAAAVPVVFLTAWYGLADLAEIKAGESVLIHAGTGGVGMAAV
QLARQWGVEVFVTASRGKWDTLRAMGFDDDHIGDSRTCEFEEKFLAVTEG
RGVDVVLDSLAGEFVDASLRLLVRGGRFLEMGKTDIRDAQEIAANYPGVQ
YRAFDLSEAGPARMQEMLAEVRELFDTRELHRLPVTTWDVRCAPAAFRFM
SQARHIGKVVLTMPSALADRLADGTVVITGATGAVGGVLARHLVGAYGVR
HLVLASRRGDRAEGAAELAADLTEAGAKGQVVACDVADRAAVAGLFAQLS
REYPPVRGVIHAAGVLDDAVITSLTPDRIDTVLRAKVDAAWNLHQATSDL
DLSMFVLCSSIAATVGSPGQGNYSAANAFLDGLAAHRQAAGLAGISLAWG
LWEQPGGMTAHLSSRDLARMSRSGLAPMSPAEAVELFDAALAIDHPLAVA
TLLDRAALDARAQAGALPALFSGLARRPRRRQIDDTGDATSSKSALAQRL
HGLAADEQLELLVGLVCLQAAAVLGRPSAEDVDPDTEFGDLGFDSLTAVE
LRNRLKTATGLTLPPTVIFDHPTPTAVAEYVAQQMSGSRPTESGDPTSQV
VEPAAAEVSVHA
>Mb1688 pks10, Possible chalcone synthase pks10
MSVIAGVFGALPPYRYSQRELTDSFVSIPDFEGYEDIVRQLHASAKVNSR
HLVLPLEKYPKLTDFGEANKIFIEKAVDLGVQALAGALDESGLRPEDLDV
LITATVTGLAVPSLDARIAGRLGLRADVRRVPLFGLGCVAGAAGVARLHD
YLRGAPDGVAALVSVELCSLTYPGYKPTLPGLVGSALFADGAAAVVAAGV
KRAQDIGADGPDILDSRSHLYPDSLRTMGYDVGSAGFELVLSRDLAAVVE
QYLGNDVTTFLASHGLSTTDVGAWVTHPGGPKIINAITETLDLSPQALEL
TWRSLGEIGNLSSASVLHVLRDTIAKPPPSGSPGLMIAMGPGFCSELVLL
RWH
>Mb1693 pks11, Possible chalcone synthase pks11
MSVIAGVFGALPPHRYSQSEITDSFVEFPGLKEHEEIIRRLHAAAKVNGR
HLVLPLQQYPSLTDFGDANEIFIEKAVDLGVEALLGALDDANLRPSDIDM
IATATVTGVAVPSLDARIAGRLGLRPDVRRMPLFGLGCVAGAAGVARLRD
YLRGAPDDVAVLVSVELCSLTYPAVKPTVSSLVGTALFGDGAAAVVAVGD
RRAEQVRAGGPDILDSRSSLYPDSLHIMGWDVGSHGLRLRLSPDLTNLIE
RYLANDVTTFLDAHRLTKDDIGAWVSHPGGPKVIDAVATSLALPPEALEL
TWRSLGEIGNLSSASILHILRDTIEKRPPSGSAGLMLAMGPGFCTELVLL
RWR
>Mb2074c pks12, Probable polyketide synthase pks12
MVDQLQHATEALRKALVQVERLKRTNRALLERSSEPIAIVGMSCRFPGGV
DSPEGLWQMVADARDVMSEFPTDRGWDLAGLFDPDPDVRHKSYARTGGFV
DGVADFDPAFFGISPSEALAMDPQHRMLLELSWEALERAGIDPTGLRGSA
TGVFAGLIVGGYGMLAEEIEGYRLTGMTSSVASGRVAYVLGLEGPAVSVD
TACSSSLVALHMAVGSLRSGECDLALAGGVTVNATPTVFVEFSRHRGLAP
DGRCKPYAGRADGVGWSEGGGMLVLQRLSDARRLGHPVLAVVVGSAVNQD
GASNGLTAPNGPSQQRVVRAALANAGLSAAEVDVVEGHGTGTTLGDPIEA
QALLATYGQDRGEPGEPLWLGSVKSNMGHTQAAAGVAGVIKMVLAMRHEL
LPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTRRAGVSSFGISGTN
AHVIIEAVPVVPRREAGWAGPVVPWVVSAKSESALRGQAARLAAYVRGDD
GLDVADVGWSLAGRSVFEHRAVVVGGDRDRLLAGLDELAGDQLGGSVVRG
TATAAGKTVFVFPGQGSQWLGMGIELLDTAPAFAQQIDACAEAFAEFVDW
SLVDVLRGAPGAPGLDRVDVVQPVLFAVMVSLAELWKSVAVHPDAVIGHS
QGEIAAAYVAGALSLRDAARVVTLRSKLLAGLAGPGGMVSIACGADQARD
LLAPFGDRVSIAVVNGPSAVVVSGEVGALEELIAVCSTKELRTRRIEVDY
ASHSVEVEAIRGPLAEALSGIEPRSTRTVFFSTVTGNRLDTAGLDADYWY
RNVRQTVLFDQAVRNACEQGYRTFIESSPHPALITGVEETFAACTDGDSE
AIVVPTLGRGDGGLHRFLLSAASAFVAGVAVNWRGTLDGAGYVELPTYAF
DKRRFWLSAEGSGADVSGLGLGASEHPLLGAVVDLPASGGVVLTGRLSPN
VQPWLADHAVSDVVLFPGTGFVELAIRAGDEVGCSVLDELTLAAPLLLPA
TGSVAVQVVVDAGRDSNSRGVSIFSRADAQAGWLLHAEGILRPGSVEPGA
DLSVWPPAGAVTVDVADGYERLATRGYRYGPAFRGLTAMWARGEEIFAEV
RLPEAAGGVGGFGVHPALLDAVLHAVVIAGDPDELALPFAWQGVSLHATG
ASAVRARIAPAGPSAVSVELADGLGLPVLSVASMVARPVTERQLLAAVSG
SGPDRLFEVIWSPASAATSPGPTPAYQIFESVAADQDPVAGSYVRSHQAL
AAVQSWLTDHESGVLVVATRGAMALPREDVADLAGAAVWGLVRSAQTEHP
GRIVLVDSDAATDDAAIAMALATGEPQVVLRGGQVYTARVRGSRAADAIL
VPPGDGPWRLGLGSAGTFENLRLEPVPNADAPLGPGQVRVAMRAIAANFR
DIMITLGMFTHDALLGGEGAGVVVEVGPGVTEFSVGDSVFGFFPDGSGTL
VAGDVRLLLPMPADWSYAEAAAISAVFTTAYYAFIHLADVQPGQRVLIHA
GTGGVGMAAVQLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEF
EDKFRAATGGRGFDVVLDSLAGEFVDASLRLVAPGGVFLEMGKTDIRDPG
VIAQQYPGVRYRAFDLFEAGPDRIAQILAELATLFGDGVLRPLPVTTFDV
RRAPAALRYLSQARHTGKVVMLMPGSWAAGTVLITGGTGMAGSAVARHVV
ARHGVRNLVLVSRRGPDAPGAAELVAELAAAGAQVQVVACDAADRAALAK
VIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRVDVVLRSKVDAAWHLHE
LTRDLDVSAFVMFSSMAGLVGSSGQANYAAANSFLDALAAHRRAHGLPAI
SLGWGLWDQASAMTSGLDAADLARLGREGVLALSTAEALELFDTAMIVDE
PFLAPARIDLTALRAHAVAVPPMFSDLASAPTRRQVDDSVAAAKSKSALA
HRLHGLPEAEQHAVLLGLVRLHIATVLGNITPEAIDPDKAFQDLGFDSLT
AVEMRNRLKSATGLSLSPTLIFDYPTPNRLASYIRTELAGLPQEIKHTPA
VRTTSEDPIAIVGMACRYPGGVNSPDDMWDMLIQGRDVLSEFPADRGWDL
AGLYNPDPDAAGACYTRTGGFVDGVGDFDPAFFGVGPSEALAMDPQQRML
LELSWEALERAGIDPTGLRGSATGVFAGVMTQGYGMFAAEPVEGFRLTGQ
LSSVASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSLRSGECDLALA
GGVTVNATPDIFVEFSRWRGLSPDGRCKAFAAAADGTGFSEGGGMLVLQR
LSDARRLGHPVLAVVVGSAVNQDGASNGLTAPNGPSQQRVVRAALANAGL
SAAEVDVVEGHGTGTTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNM
GHTQAAAGVAGVIKMVLAMRHELLPATLHVDVPSPHVDWSAGAVELLTAP
RVWPAGARTRRAGVSSFGISGTNAHVIIEAVPVVPRREAGWAGPVVPWVV
SAKSESALRGQAARLAAYVRGDDGLDVADVGWSLAGRSVFEHRAVVVGGD
RDRLLAGLDELAGDQLGGSVVRGTATAAGKTVFVFPGQGSQWLGMGMGLH
AGYPVFAEAFNTVVGELDRHLLRPLREVMWGHDENLLNSTEFAQPALFAV
EVALFRLLGSWGVRPDFVMGHSIGELSAAHVAGVLSLENAAVLVAARGRL
MQALPAGGAMVAVQAAEEEVRPLLSAEVDIAAVNGPASLVISGAQNAVAA
VADQLRADGRRVHQLAVSHAFHSPLMDPMIDEFAAVAAGIAIGRPTIGVI
SNVTGQLAGDDFGSAAYWRRHIRQAVRFADSVRFAQAAGGSRFLEVGPSG
GLVASIEESLPDVAVTTMSALRKDRPEPATLTNAVAQGFVTGMDLDWRAV
VGEAQFVELPTYAFQRRRFWLSGDGVAADAASLGLAASEHALLGAVIDLP
ASGGVVLTGRLSPSVQGWLADHSVAGVTIFPGAGFVELAIRAGDEVGCGV
VDELTLAAPLVLPASGSVAVQVVVNGPDESGVRGVSVYSRGDVGTGWVLH
AEGALRAGSAEPTADLAMWPPAGAVPVEVADGYQQLAERGYGYGPAFRGL
TAMWRRGDEVFAEVALPADAGVSVTGFGVHPVLLDAALHAVVLSAESAER
GQGSVLVPFSWQGVSLHAAGASAVRARIAPVGPSAVSIELADGLGLPVLS
VASMLARPVTDQQLRAAVSSSGPDRLFEVTWSPQPSAAVEPLPVCAWGTT
EDSAAVVFESVPLAGDVVAGVYAATSSVLDVLQSWLTRDGAGVLVVMTRG
AVALPGEDVTDLAGAAVWGLVRSAQTEHPGRIVLVDSDAPLDDSALAAVV
TTGEPQVLWRRGEVYTARVHGSRAVGGLLVPPSDRPWRLAMSTAGTFENL
RLELIPDADAPLGPGQVRVAVSAIAANFRDVMIALGLYPDPDAVMGVEAC
GVVIETSLNKGSFAVGDRVMGLFPEGTGTVASTDQRLLVKVPAGWSHTAA
ATTSVVFATAHYALVDLADVQPGQRVLIHAGTGGVGMAAVQLARHLGLEV
FATASKGKWDTLRAMGFDDDHISDSRSLEFEDKFRAATGGRGFDVVLDSL
AGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVRYRAFDLFEAG
PDRIAQILAELATLFGDGVLRPLPVTTFDVRRAPAALRYLSQARHTGKVV
MLMPGSWAAGTVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPG
AAELVAELAAAGAQVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGA
LDDAVVMSLTPDRVDVVLRSKVDAAWHLHELTRDLDVSAFVMFSSMAGLV
GSSGQANYAAANSFLDALAAHRRAHGLPAISLGWGLWDQASAMTSGLATV
DFKRFARDGIVAMSSADALQLFDTAMIVDEPFMLPAHIDFAALKVKFDGG
TLPPMFVDLINAPTRRQVDDSLAAAKSKSALAHRLHGLPEDEQHAVLLDL
VRSHIATVLGSASPEAIDPDRAFQDLGFDSLTAVEMRNRLKSATGLSLSP
TLIFDYPNSAALAGYMRRELLGSSPQDTSAVAAGEAELQRIVASIPVKRL
RQAGVLDLLLALANETETSGQDPALAPTAEQEIADMDLDDLVNAAFRNDD
E
>Mb3830c pks13, POLYKETIDE SYNTHASE PKS13
MADVAESQENAPAERAELTVPEMRQWLRNWVGKAVGKAPDSIDESVPMVE
LGLSSRDAVAMAADIEDLTGVTLSVAVAFAHPTIESLATRIIEGEPETDL
AGDDAEDWSRTGPAERVDIAIVGLSTRFPGEMNTPEQTWQALLEGRDGIT
DLPDGRWSEFLEEPRLAARVAGARTRGGYLKDIKGFDSEFFAVAKTEADN
IDPQQRMALELTWEALEHARIPASSLRGQAVGVYIGSSTNDYSFLAVSDP
TVAHPYAITGTSSSIIANRVSYFYDFHGPSVTIDTACSSSLVAIHQGVQA
LRNGEADVVVAGGVNALITPMVTLGFDEIGAVLAPDGRIKSFSADADGYT
RSEGGGMLVLKRVDDARRDGDAILAVIAGSAVNHDGRSNGLIAPNQDAQA
DVLRRAYKDAGIDPRTVDYIEAHGTGTILGDPIEAEALGRVVGRGRPADR
PALLGAVKTNVGHLESAAGAASMAKVVLALQHDKLPPSINFAGPSPYIDF
DAMRLKMITTPTDWPRYGGYALGGVSSFGFGGANAHVVVREVLPRDVVEK
EPEPEPEPKAAAEPAEAPTLAGHALRFDEFGNIITDSAVAEEPEPELPGV
TEEALRLKEAALEELAAQEVTAPLVPLAVSAFLTSRKKAAAAELADWMQS
PEGQASSLESIGRSLSRRNHGRSRAVVLAHDHDEAIKGLRAVAAGKQAPN
VFSVDGPVTTGPVWVLAGFGAQHRKMGKSLYLRNEVFAAWIEKVDALVQD
ELGYSVLELILDDAQDYGIETTQVTIFAIQIALGELLRHHGAKPAAVIGQ
SLGEAASAYFAGGLSLRDATRAICSRSHLMGEGEAMLFGEYIRLMALVEY
SADEIREVFSDFPDLEVCVYAAPTQTVIGGPPEQVDAILARAEAEGKFAR
KFATKGASHTSQMDPLLGELTAELQGIKPTSPTCGIFSTVHEGRYIKPGG
EPIHDVEYWKKGLRHSVYFTHGIRNAVDSGHTTFLELAPNPVALMQVALT
TADAGLHDAQLIPTLARKQDEVSSMVSTMAQLYVYGHDLDIRTLFSRASG
PQDYANIPPTRFKRKEHWLPAHFSGDGSTYMPGTHVALPDGRHVWEYAPR
DGNVDLAALVRAAAAHVLPDAQLTAAEQRAVPGDGARLVTTMTRHPGGAS
VQVHARIDESFTLVYDALVSRAGSESVLPTAVGAATAIAVADGAPVAPET
PAEDADAETLSDSLTTRYMPSGMTRWSPDSGETIAERLGLIVGSAMGYEP
EDLPWEVPLIELGLDSLMAVRIKNRVEYDFDLPPIQLTAVRDANLYNVEK
LIEYAVEHRDEVQQLHEHQKTQTAEEIARAQAELLHGKVGKTEPVDSEAG
VALPSPQNGEQPNPTGPALNVDVPPRDAAERVTFATWAIVTGKSPGGIFN
ELPRLDDEAAAKIAQRLSERAEGPITAEDVLTSSNIEALADKVRTYLEAG
QIDGFVRTLRARPEAGGKVPVFVFHPAGGSTVVYEPLLGRLPADTPMYGF
ERVEGSIEERAQQYVPKLIEMQGDGPYVLVGWSLGGVLAYACAIGLRRLG
KDVRFVGLIDAVRAGEEIPQTKEEIRKRWDRYAAFAEKTFNVTIPAIPYE
QLEELDDEGQVRFVLDAVSQSGVQIPAGIIEHQRTSYLDNRAIDTAQIQP
YDGHVTLYMADRYHDDAIMFEPRYAVRQPDGGWGEYVSDLEVVPIGGEHI
QAIDEPIIAKVGEHMSRALGQIEADRTSEVGKQ
>Mb1041 pks16, PUTATIVE POLYKETIDE SYNTHASE PKS16
MSRFTEKMFHNARTATTGMVTGEPHMPVRHTWGEVHERARCIAGGLAAAG
VGLGDVVGVLAGFPVEIAPTAQALWMRGASLTMLHQPTPRTDLAVWAEDT
MTVIGMIEAKAVIVSEPFLVAIPILEQKGMQVLTVADLLASDPIGPIEVG
EDDLALMQLTSGSTGSPKAVQITHRNIYSNAEAMFVGAQYDVDKDVMVSW
LPCFHDMGMVGFLTIPMFFGAELVKVTPMDFLRDTLLWAKLIDKYQGTMT
AAPNFAYALLAKRLRRQAKPGDFDLSTLRFALSGAEPVEPADVEDLLDAG
KPFGLRPSAILPAYGMAETTLAVSFSECNAGLVVDEVDADLLAALRRAVP
ATKGNTRRLATLGPLLQDLEARIIDEQGDVMPARGVGVIELRGESLTPGY
LTMGGFIPAQDEHGWYDTGDLGYLTEEGHVVVCGRVKDVIIMAGRNIYPT
DIERAAGRVDGVRPGCAVAVRLDAGHSRESFAVAVESNAFEDPAEVRRIE
HQVAHEVVAEVDVRPRNVVVLGPGTIPKTPSGKLRRANSVTLVT
>Mb1691 pks17, Probable polyketide synthase pks17
MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFLSQARHV
GKVVLTMPDAWAAGTVLITGGTGMAGSAVARHLVSRYGVRQVVLASRAGE
HTESVAALVDELGSAGARVQVVSCDVADRDAVAGLVASQPDLTAVFHAAG
VLDDAVITGLTPERVDKVLRAKVDGAWNLHELTRHLDVSAFVLFSSMAGI
VGAPGQANYAAANAFLDGLAAYRRSRGLAALSVAWGLWEQASAMTEHLGE
RDRVRMSRVGLAPLPTNQAMGFLDAALLADRPVVVAARLDRAALAGAELP
ALFSQLVAGPIRRIIDGADEVSGSGLASRLHGLTPEQRHRELTELVCSNA
AIVLGHSGTEIDAHKAFQDLGFDSLTAVELRNRLKTATGLTLPPTLIFDY
PTAAELAEHLDIQLANAPAVTVDQPNPSTRFNEVTRELQALLDQPNWNPD
DKTRLIKRLQAILTDCTAPPASSGPSTTHDDEDITTATESQLFAILDDEL
GP
>Mb3855c pks2, POLYKETIDE SYNTHASE PKS2
MGLGSAASGTGADRGAWTLAEPRVTPVAVIGMACRLPGGIDSPELLWKAL
LRGDDLITEVPPDRWDCDEFYDPQPGVPGRTVCKWGGFLDNPADFDCEFF
GIGEREAIAIDPQQRLLLETSWEAMEHAGLTQQTLAGSATGVFAGVTHGD
YTMVAADAKQLEEPYGYLGNSFSMASGRVAYAMRLHGPAITVDTACSSGL
TAVHMACRSLHEGESDVALAGGVALMLEPRKAAAGSALGMLSPTGRCRAF
DVAADGFVSGEGCAVVVLKRLPDALADGDRILAVIRGTSANQDGHTVNIA
TPSQPAQVAAYRAALAAGGVDAATVGMVEAHGPGTPIGDPIEYASVSEVY
GVDGPCALASVKTNFGHTQSTAGVLGLIKVVLALKHGVVPRNLHFTRLPD
EIAGITTNLFVPEVTTPWPTNGRQVPRRAAVSSYGFSGTNVHAVVEQAPQ
TEAQPHAASTPPTGTPALFTLSASSADALRQTAQRLTDWIQQHADSLVLS
DLAYTLARRRTHRSVRTAVIASSVDELIAGLGEVADGDTVYQPAVGQDDR
GPVWLFSGQGSQWAAMGADLLTNESVFAATVAELEPLIAAESGFSVTEAM
TAPETVTGIDRVQPTIFAMQVALAATMAAYGVRPGAVIGHSMGESAAAVV
AGVLSAEDGVRVICRRSKLMATIAGSAAMASVELPALAVQSELTALGIDD
VVVAVVTAPQSTVIAGGTESVRKLVDIWERRDVLARAVAVDVASHSPQVD
PILDELIAALADLNPKAPEIPYYSATLFDPREAPACDARYWADNLRHTVR
FSAAVRSALDDGYRVFAELSPHPLLTHAVDQIAGSVGMPVAALAGMRREQ
PLPLGLRRLLTDLHNAGAAVDFSVLCPQGRLVDAPLPAWSHRFLFYDREG
VDNRSPGGSTVAVHPLLGAHVRLPEEPERHAWQADVGTATLPWLGDHRIH
NVAALPGAAYCEMALSAARAVLGEQSEVRDMRFEAMLLLDDQTPVSTVAT
VTSPGVVDFAVEALQEGVGHHLRRASAVLQQVSGECEPPAYDMASLLEAH
PCRVDGEDLRRQFDKHGVQYGPAFTGLAVAYVAEDATATMLAEVALPGSI
RSQQGLYAIHPALLDACFQSVGAHPDSQSVGSGLLVPLGVRRVRAYAPVR
TARYCYTRVTKVELVGVEADIDVLDAHGTVLLAVCGLRIGTGVSERDKHN
RVLNERLLTIEWHQRELPEMDPSGAGKWLLISDCAASDVTATRLADAFRE
HSAACTTMRWPLHDDQLAAADQLRDQVGSDEFSGVVVLTGSNTGTPHQGS
ADRGAEYVRRLVGIARELSDLPGAVPRMYVVTRGAQRVLADDCVNLEQGG
LRGLLRTIGAEHPHLRATQIDVDEQTGVEQLARQLLATSEEDETAWRDNE
WYVARLCPTPLRPQERRTIVADHQQSGMRLQIRTPGDMQTIELAAFHRVP
PGPGQIEVAVRASSVNFADVLIAFGRYPSFEGHLPQLGTDFAGVVTAVGP
GVTDHKVGDHVGGMSPNGCWGTFVTCDARLAATLPPGLGDAQAAAVTTAH
ATAWYGLHELARIRAGDTVLIHSGTGGVGQAAIAIARAAGAEIFATAGTP
QRRELLRNMGIEHVYDSRSIEFAEQIRRDTNGRGVDVVLNSVTGAAQLAG
LKLLAFRGRFVEIGKRDIYGDTKLGLFPFRRNLSFYAVDLGLLSATHPEE
LRDLLGTVYRLTAAGELPMPQSTHYPLVEAATAIRVMGNAEHTGKLVLHI
PQTGKSLVTLPPEQAQVFRPDGSYIITGGLGGLGLFLAEKMAAAGCGRIV
LNSRTQPTQKMRETIEAIAAMGSEVVVECGDIAQPGTAERLVATAVATGL
PVRGVLHAAAVVEDATLANITDELLARDWAPKVHGAWELHEATSGQPLDW
FCLFSSAAALTGSPGQSAYSAANSWLDAFAHWRQAQGLPATAIAWGAWSD
IGQLGWWSASPARASALEESNYTAITPDEGAYAFEALLRHNRVYTGYAPV
IGAPWLVAFAERSRFFEVFSSSNGSGTSKFRVELNELPRDEWPARLRQLV
AEQVSLILRRTVDPDRPLPEYGLDSLGALELRTRIETETGIRLAPKNVSA
TVRGLADHLYEQLAPDDAPAAALSSQ
>Mb1213 pks3, PROBABLE POLYKETIDE BETA-KETOACYL SYNTHASE PKS3
MRTATATSVAVIGMACRLPGGIDSPQRLWEALLRGDDLVGEIPADRWDAN
VYYDPEPGVPGRSVSRWGAFLDDVGGFDCDFFGLTEREATAIDPQHRLLL
EVSWEAIEHAGVDPATLAESQTGVFVGLTHGDYELLSADCGAAEGPYGFT
GTSNSFASGRVAYTLGLHGPAVTVDTACSSGLTAVHQACRSLDDGESDLA
LAGGVVVTLEPRKSVSGSLQGMLSPTGRCHAFDEAADGFVSGEGCVVLLL
KRLPDAVRDGDRVLAIVRGTAANQDGRTVNIAAPSAQAQIAVYQQALAAA
GVEASTVGMVEAHGTGTPVGDPVEYASLAAVYGTEGPCALTSVKTNFGHL
QSASGPLGLMKTILALRHGVVPQNLHFCRLPDQLAEIDTELFVPQANTSW
PDNTGQPRRAAVSSYGMSGTNVHAILEQAPVSEPAASGPELTPEAGGLAL
FPVSATSAEQLHVTAARLADWVDQNGNAGSRVSMRDLGYTLSCRRAHRPV
RTVVTASSFDELSAALRDVAGDQIPYQPAVGHDDRGPVWVFSGQGSQWPG
MGTELLVAEPVFAATVAAMEPVIARESGFSVTEAMSAPQTVSGIDRVQPT
IFAVQVALAAALKSYGVRPGAIIGHSLGEAAAAVVAGALSLHDGLRVICR
RSRLMSRIAGSGAMASVELPGQQVLSELAIRGISDVVLSVVASPTSTVVG
GATQSIRDLVAAWEQQDVLAREVAVDVASHTPQVDPILDELLEVLAEVDP
TAPEIPYYSATLWDPRERPSFTGEYWVENLRYTVRFAAAVQAALKDGYRV
FGELAPHPLLTYAVEQNAASLDMPIATLAAMRRGEQLPFGLRGFVADVHN
AGAKVDFSVQYPDGRLVDAPLPSWTHRTLMLSREDSHRSHTGAVQAVHPL
LGAHVHLLEEPERHVWQAGVGTGAHPWLGDHRIHNVAAFPGAAYCEMALA
AARTTLGELSEVRDIKFEQTLLLDEQTVVSSAATIAAPGILQFAVESHQE
GEPARRASAMLHALEEMPQPPGYDTNALTAAHESSMSGEELRKMFNSLGI
QYGPAFSGLVAVHTARGAVTTVLAEVALPGAIRSQQSAYASHPALLDACF
QSVLVHPEVQKATVGGLMLPVGVRRLRNYHSTRSAHYCLARVTSSSRAGE
CEADLDVFDQAGTVLLTVEGLRLAAGISEHERANRVFDERLLTIEWERGE
LPEVPQIDAGSWLLLSASEADPLTAQLADALNAVGAQSTSVASASDVAQL
RSLLGGRLTGVVVVTGPPTGGLTQCGRDYVSQLVGIARELAELPGEPPRL
FVVTRSAASVLPSDLANLEQAGLRGLMRVIDSEHPHLGATAIDVDNDETV
AALVASQLQSGSQEDETAWRNGIWYTARLRPGPLRPAERRTAVVEYRRDG
MRLQIRTPGDLESLEFVTFDRVAPGPGEIEVAVTASSVNFADVLVAFGRY
PTFEGYRQQLGIDFAGVVTAVGPDVTEHRIGDHVGGMSANGCWSTFVRCD
ARLAVTLPPELPVAAAAAVPTASATAWYALHDLARICSDDKVLIHSGTGG
VGQAAIAIARAAGCEIFATAGSAQRRQLLHDMGVEHVYDSRSTEFAEQIR
GDTDGYGVDVVLNSLPGAAQRAGIELLAFGGRFVEIGKRDIYGDTRLGLF
PFRRNLSLYAVDLALLTHSHPHTVRRLLKTVYQHTVEGTLPVPQTTHYPI
HDAAVAIRLVGGAGHTGKVVLDVPRTGEGVAVVPPEQVRTSRPDGAYLVT
GGLGGLGLFLAGELAAAGCGRIVLNSRSTPSPHATRVIERLRAAGADIQV
ECGDIADAATAHRVVAVATASGLPVRGVLHAAAVVEDATLANVTDELIDR
CWAPKVHGAWNIHRATAAQPLEWFCLFSSAAALVGSPGQGAYAAANSWLD
AFAHWRRAQGLPATSIAWGAWAEIGRATALAEGTGAAIAPAEGARAFQTL
LRYGRAYSGYAPIMGTPWLTAFAQRSRFAEAFHATGQNQPATGKFLAELG
SLPREEWPRTVRRLVSDQISLLLRRTIDPDRPLSDYGLDSLGNLELRTRI
ETETGIRVSPTKITTVRGLAEHVCDELAAAQSAPV
>Mb1554c pks5, Probable polyketide synthase pks5
MGKERTKTVDRTRVTPVAVIGMGCRLPGGIDSPDRLWEALLRGDDLVTEI
PADRWDIDEYYDPEPGVPGRTDCKWGAYLDNVGDFDPEFFGIGEKEAIAI
DPQHRLLLETSWEAMEHGGLTPNQMASRTGVFVGLVHTDYILVHADNQTF
EGPYGNTGTNACFASGRVAYAMGLQGPAITVDTACSSGLTAIHLACRSLH
DGESDIALAGGVYVMLEPRRFASGSALGMLSATGRCHAFDVSADGFVSGE
GCVMLALKRLPDALADGDRILAVIRGTAANQDGHTVNIATPSRSAQVAAY
REALDVAGVDPATVGMVEAHGPGTPVGDPIEYASLAEVYGNDGPCALASV
KTNFGHTQSAAGALGLMKAVLALQHGVVPQNLHFTALPDKLAAIETNLFV
PQEITPWPGADQETPRRAAVSSYGMTGTNVHAIVEQAPVPAPESGAPGDT
PATPGIDGALLFALSASSQDALRQTAARLADWVDAQGPELAPADLAYTLA
RRRGHRPVRTAVLAATTAELTEALREVATGETPYPPAVGQDDRGPVWVFS
GQGSQWAGMGADLLATEPVFAATIAAIEPLIAAESGFSVTEAMTAPEVVT
GIDRVQPTLFAMQVALAATMKSYGVAPGAVIGHSLGESAAAVVAGALCLE
DGVRVICRRSALMTRIAGAGAMASVELPAQQVLSELMARGVNDAVVAVVA
SPQSTVIGGATQTVRDLVAAWEQRDVLAREVAVDVASHSPQVDPILDELA
EALAEISPLQPEIPYYSATSFDPREEPYCDAYYWVDNLRHTVRFAAAVQA
ALEDGYRVFTELTPHPLLTHAVDQTARSLDMSAAALAGMRREQPLPHGLR
ALAGDLYAAGAAVDFAVLYPTGRLINAPLPTWNHRRLLLDDTTRRIAHAN
TVAVHPLLGSHVRLPEEPERHVWQGEVGTVTQPWLADHQIHGAAALPGAA
YCEMALAAARAVLGEASEVRDIRFEQMLLLDDETPIGVTATVEAPGVVPL
TVETSHDGRYTRQLAAVLHVVREADDAPDQPPQKNIAELLASHPHKVDGA
EVRQWLDKRGHRLGPAFAGLVDAYIAEGAGDTVLAEVNLPGPLRSQVKAY
GVHPVLLDACFQSVAAHPAVQGMADGGLLLPLGVRRLRSYGSARHARYCC
TTVTACGVGVEADLDVLDEHGAVVLAVRGLQLGTGASQASERARVLGERL
LSIEWHERELPENSHAEPGAWLLISTCDATDLVAAQLTDALKVHDAQCTT
MSWPQRADHAAQAARLRDQLGTGGFTGVFVLTAPQTGDPDAESPVRGGEL
VKHVVRIAREIPEITAQEPRLYVLTHNAQAVLSGDRPNLEQGGMRGLLRV
IGAEHPHLKASYVDVDEQTGAESVARQLLAASGEDETAWRNDQWYTARLC
PAPLRPEERQTTVVDHAEAGMRLQIRTPGDLQTLEFAALDRVPPGPGEIE
VAVTASSINFADVLVTFGRYQTLDGRQPQLGTDFAGVVSAVGPGVSELKV
GDRVGGMSPNGCWATFVTCDARLATRLPEGLTDAQAAAVTTASATAWYGL
QDLARIKAGDKVLIHSATGGVGQAAIAIARAAGAQIYATAGNEKRRDLLR
DMGIEHVYDSRSVEFAEQIRRDTAGYGVDIVLNSVTGAAQLAGLKLLALG
GRFIEIGKRDIYSNTRLELLPFRRNLAFYGLDLGLMSVSHPAAVRELLST
VYRLTVEGVLPMPQSTHYPLAEAATAIRVMGAAEHTGKLILDVPHAGRSS
VVLPPEQARVFRSDGSYIITGGLGGLGLFLAEKMANAGAGRIVLSSRSQP
SQKALETIELVRAIGSDVVVECGDIAQPDTADRLVTAATATGLPLRGVLH
AAAVVEDATLANITDELIERDWAPKAYGAWQLHRATADQPLDWFCSFSSA
AALVGSPGQGAYAAANSWLDTFTHWRRAQDLPATSIAWGAWGQIGRAIAF
AEQTGDAIAPEEGAYAFETLLRHNRAYSGYAPVIGSPWLTAFAQHSPFAE
KFQSLGQNRSGTSKFLAELVDLPREEWPDRLRRLLSKQVGLILRRTIDTD
RLLSEYGLDSLSSQELRARVEAETGIRISATEINTTVRGLADLMCDKLAA
DRDAPAPA
>Mb0412 pks6a, PROBABLE MEMBRANE BOUND POLYKETIDE SYNTHASE PKS6A [FIRST PART]
MTDGSVTADKLQKWFREYLSTHIECHPNEVSLDVPIRDLGLKSIDVLAIP
GDLGDRFGFCIPDLAVWDNPSANDLIDSLLNQRSADSLRESHGHADRNTQ
GRGSINEPVAVIGVGCRFPGDIDGPERLWDFLTEKKCAITAYPDRGFTNA
GTFAESGGFLKDVAGFDNRFFDIPPDEALRMDPQQRLLLEVSWEALEHAG
IIPESLRLSRTGVFVGVSSTDYVRLVSASAQQKSTIWDNTGGSSSIIANR
ISYFLDIQGPSIVIDTACSSSLVAVHLACRSLSTWDCDIALVGGTNVLIS
PEPWGGFREAGILSQTGCCHAFDKSADGMVRGEGCGVIVLQRLSDARLEG
RRILAILTGSAVNQDGKSNGIMAPNPSAQIGVLENACKSARVDPLEIGYV
EAHGTGTSLGDRIEAHALGMVFGRKRPGSGPLMIGSIKPNIGHLEGAAGI
AGLIKAGVDG
>Mb0413 pks6b, PROBABLE MEMBRANE BOUND POLYKETIDE SYNTHASE PKS6B [SECOND PART]
MLMVERGSLLPSGGFTEPNPAIPFTELGLRVVDELQEWPVVAGRPRRAGV
SSFGFGGTNAHVIVEEAGSVGADTVSGRADVGGSGGGVVAWVISGKTASA
LAAQAGRLGRYVRARPALDVVDVGYSLVSTRSVFDHRAVVVGQTRDELLA
GLAGVVAGRPEAGVVCGVGKPAGKTAFVFAGQGSQWLGMGSELYAAYPVF
AEALDAVVDELDRHLRYPLRDVIWGHDQDLLNTTEFAQPALFAVEVALYR
LLMSWGVRPGLVLGHSVGELAAAHVAGALCLPDAAMLVAARGRLMQALPA
GGAMFAVQAREDEVAPMLGHDVSIAAVNGPASVVISGAHDAVSAIADRLR
GQGRRVHRLAVSHAFHSALMEPMIAEFTAVAAELSVGLPTIPVISNVTGQ
LVADDFASADYWARHIRAVVRFGDSVRSAHCAGASRFIEVGPGGGLTSLI
EASLADAQIVSVPTLRKDRPEPVSVMTAAAQGFVSGMGLDWASVFSGYRP
KRVELPTYAFQHQKFWLAPAPSVSDPTAAGQIGASDGGAELLASSGFAAR
LAGRSADEQLAAAIEVVCEHAAAVLGRDGAAGLDAGQAFADSGFNSLSAV
ELRNRLTAVTAVTLPATAIFDHPTPTELAQYLITQIDGHGSSAAAAANPA
ERIDALTDLFLQACDAGRDADGWKMVALASNTRERMSSPVRNNVSKNVAL
LADGISDVVVICIPTLTVLSDQREYRDIANAMTGRHSVYSLTLPGFDSSD
ALPQNADMIVETVSNAIIDVVGGSCRFVLSGYSSGGVLAYALCSHLSVKH
QRNPLGVALIDTYLPSQIANPSMNEGFSPNDTGKGLSREVIRVARMLNRL
TATRLTAAATYAAIFQAWEPGRSMAPVLNIVAKDRIATVENLREERINRW
RTAAAEAAYSVAEVPGDHFGVMSTSSEAIATEIHDWISGLVRGPHP
>Mb1689 pks7, Probable polyketide synthase pks7
MNSTPEDLVKALRRSLKQNERLKRENRDLLARTTEPVAVVGMGCRYPGGV
DSPETLWELVAHGRDAVSEFPADRGWDVAGLFDPDPDAVGKSYTRCGGFL
TDVAGFDAEFFGIAPSEALAMDPQQRLLLEVSWEALERAGIDPITLRGSQ
TGVFAGVFHGSYGGQGRVPGDLERYGLRGSTLSVASGRVAYVLGLQGPAV
SVDTACSSSLVALHLAVQSLRLGECDLALVGGVTVMATPAMFIEFSRQRA
LSADGRCKAYAGAADGTAFAEGAGVLVLARLADARRLGHPVLALVRGSAV
NQDGASNGLATPNGPAQQRVITAALASARLGVADVDVVEGHGTGTTLGDP
IEAQAILATYGQRPADRPLWLGSIKSNIGHTSAAAGVAGVIKMVQAMRHG
VLPKTLHVDVPTPHVDWSAGAVSLLTEPRPWHVPGRPRRAGVSSFGISGT
NAHVILEEAPAVEPVGAAHGNDPVAVPWVLSARSAQALTNQARRLLAWVG
ADENVRPLDVGWSLVNTRSLFDHRAVVVGADRTQLMEGLTGLAAGVPGAD
VVAGRAQTVGKTAFVFPGQGAQWLGMGAQLCATAPVFAEHIHRCERALRE
HVEWSLLDVLRGAPGAPGLDRVDVVQPALWAVMVSLAELWRSVGVVPDAV
IGHSQGEIAAAYVAGALSLWDAAAVVALRSRLLVRLGGAGGMVSLACGQP
QAEKLASQWGDRLNIAAVNGVSSVVLAGETDAVTELMQRCEAEGIRARRI
DVDYASHSAQVDAIREELIAALRGIEPRTSTVAFFSTVTGELMDTAGVNA
EYWYRSIRQPVQFERAVRNAFDGGYRVFVESSPHPVLIAGIEETLVDCDR
GATGEPIVIPTLGRDDGGVGRFWLSAGQAHVAGVGVDWRAAFADLGGRRV
ELPTYAFARQRFWLDGLGAVGGDLGGVGLVGAEHGLLAAVVQRPDSGGVV
LTGRISVVAAPWLADHAVGPVVLFPGTGFVELALRAGDEVGCSVLQELTL
QAPLVLPADGVRVQVVVGGVEQSGTRNVWVYSAAGQADSSPGWTLHAQGV
LGVGSVQPAAELSVWPPVGARAMDVADGYQVLAARGYGYGPAFRGLQALW
RRGAEVFADVTLPEGVPIRGFGIHPAVLDAALHAWGIVEGEQQTMLPFSW
QGVCLHASGAARVRVRLAPVGRGAVPVELADPQGLPVLSVRQLMVRPVSA
AALSRSTAGDRGLLEMIWTPVPLEGGDIGDDAVVWELPPHAGAQAGGDVL
AAVYRGVHEVLEVLQSWLASDATGLGVVVTRGAVGPVDDDVTDLAGAAVW
GLVRSAQAEHPGRVVLVDTDGSVAVEDAVGFGARSGEPQLVVRRGRVYAA
RLAPVAAGLTLPSASAGGWRLVAGGGGTLADVVVAPVAPVELATGQVRVA
VGAVGVNFRDVLVALGMYPGGGELGVDGAGVVVEVGPGVTGLAVGDRVMG
LLGLVGSEAVVDARLVTMVPAGWSLVEAAAVPVAFLTAFYGLSVLAEVAA
GQKVLVHAGTGGVGMAAVSLARYWGAEVFVTASRAKWDTLRAMGFDDIHI
SDSRSLEFEEAFLRATEGSGVDVVLNSLAGEFTDASLRLLPSGGRFIELG
KTDIRDGQTVAERHRGVRYRAFDLVEAGPDRIAAMLSEVVGLLAAGVLAR
LPVKTFDARCAPAAYRFVSQARHIGKVVLTIPDGPGGQSGLAGGTVVVTG
GTGMAGSAVATHLVRRHGVANLVLVSRSGEQADRAAEVAALLREGGAQVA
VVSCDVADRDALAALLAGLDPRYPLKGVFHAAGVLDDAVITGLTPDRVDT
VLRAKVDGAWNLHELTEDMDLSAFVVFSSMAGIVGTPAQGNYAAANAFLD
GLVAYRRSRGLAGLSVAWGLWEQASAMTRHLGERDRARMTQAGLAPLTTE
QALGFLDTALQADRAVVVAARLDRAALAGAGAALPALFSQLAAGPTRRRI
DAADTAVSMSGLVSRLHALTPERRQRELTDLVISNAAAVLGRSSSVDINA
HKAFQDLGFDSLTAVELRNRLKTATGLTLSPTLIFDYPTPATLAEHLDSR
LVTASGSDQQSLSDRVDDITRELVVLLDQPDLSANVKAHLRTRLQTMLTS
LTTEDDDIAAATESQLFAILDEELGS
>Mb1690 pks8, Probable polyketide synthase pks8
MSGTTTHVDYLKRLTADLRRTRRRLSDLEAKLSEPVAVVGMGCRYPGGVD
SPETLWELVAQGRDAVSDFPADRGWDVYGLFDPDPDACGKMYTRRGTFLE
HAGDFDAGFFGIGPSEALAMDPQQRLLLEVSWEALERTGIDPTKLRGSAT
GVFAGVIHAGYGGQLSGELEGYGLTGSTLSVASGRVAYVLGLEGPAVSVD
TACSSSLVALHLAVQSLRSGECDLALAGGVTVMATPAAFVEFSRQRALAR
DGRCKVYAGAADGTAWSEGAGVLVVERLVDARRLGHPVLALVRGSAVNQD
GASNGLTAPNGPSQQRVIRAALASARLRAVEVDVVEGHGTGTMLGDPIEA
QALLATYGQDRVEPLWLGSIKSNIGHTSAAAGVAGVIKMVQAMRHGVMPK
TLHVDVPTPHVDWSVGAVSLLTQPRAWSVHGRPRRAGVSSFGISGTNAHV
ILEQAPVVESVVPEVASPTAASAVPWVLSARSEQALAGQAQRLLAFVAAN
PDLDPIDVGWSLVKTRAMFEHRAVVVGADRGALLAGLAALAAGESGAGVA
VGRARSVGKTVFVFPGQGAQWVGMGAQLYAELPLFALAFDAVAEELDRHL
RLPLRNVLWEGDEALLTSTEFAQPALFAIEVALATLLQHWGISPDFLIGH
SVGEIAAAHLAGVLSLTDAAGLVAARGRLMAELPAGGVMVVVAASEEEVL
PVLVDGANLAAVNAPHSVVVSGCEAAVSDIADHFARRGRRVHRLAVSHAF
HSLLMEPMLAEFTRIAAGISVSKPRIPLVSNVTGQMAGAGYGDGQYWVEH
ARRPVRFVEGVQLLNAVGATRFVEVGPGGGLTALVEQSLPLGEALSVAMM
RREHPEVSSVLGAVATLFTAGAQMDWPAVFGSPGRRIELPTYAFQRQRYW
LPPTSAGSADISGVGLLAARHGLLGAVVEQPDSDVVVLTGRLSVGEQRWL
ADHVIAGVVLLAGAAFVELALRAADQVDCGVVEELTVVTPLVLPTVGGVQ
LQVVVGVGEMGQRPVSIYSRNAESDSGWVLHARGVLGAKAVAPAADLSVW
PPLGAAPVDVDGAYQRFAELGYEYGRAFQGLTAMWRRESELFADVAVPDD
VDVTLSGFGIHPLVLDAALHAMGMVGEQAATMLPFSWQGVSLHAAGASRV
RARIAPAGDGTVSVELADQAGLPVLSVQALVMRSVSSQLLSAAVAAADAA
GRGLLEVAWLPVELAHNDISADLVVWELESFQDGVGPVYSATHRVLVALQ
SWLAQERAGGLVVLTQGSVGQDATNLAGAAVWGLVRSAQAEHPGRVMLVD
SDGSMDVGDVIGCGEEQLMIRNGTAYAARLAQLRPQPILQLPDTNSGWRL
VAGGAGTLEDLTLASCPAKELAPGQVRIEVRALGVNFRDVLVALGIYPGA
AELGAEGAGVVTEVGPGVTGLAVGDPVMGLLGVAGSEAVVDARLVVKLPN
RWPLTDAAGVPVVFLTAYCALRVLAQVQPGESVLVHAAAGGVGMAAVQLA
RLWGLEVFATASRGKWDTLHTMGCDNTHVADSRTLAFEETFWLTTEGRGV
DVVLNSLAGEFTDASLRLLPRGGRFIEMGKTEFGTPRSLPRTILGWPTGL
ST
>Mb1692 pks9, Probable polyketide synthase pks9
MQPTGIAIIGLACRFPTVVSPGDLWDLLRDGREATGSIDNVADFDADFFN
LSPREASAMDPRQRLALELTWELLEDAFVVPETLRGQPIAVYLGAMNDDY
AVLTLAADRVDHHAFAGTSRAIIANRVSFAFGLRGPSVTIDSGQSSSLVA
VHLACESVRTGEAPLAIAGGVHLNLARETAMLEQEFGAVSPSGHTYAFDE
RADGYVPGDGGGLVLLKPVQAALDDGDRIHAIIRGSAVGNAGHSATGLTV
PSVAGQVDVIRRAMSGAGVDCHQVHYVEAHGTGTKIGDPIEARALGEIFA
ARQRRPVSVGSVKTNIGHTGGAAGIAGLLKAVLAIENAVIPPSLNYVGAP
IDLDSLGLRVDTALTPWPVADEPRRAGVSSFGMGGTNAHVILEQGPTQSP
EIVESVAAAGSNAPVAVPWVLAARSPQALTNQAGRLLAHLTADDGLTALD
VGWSLVSTRSVFDHRAVVVGADRGRLMAGLAGLAAGEPGAGVVVGRARSV
GKTVFVFPGQGSQWLGMGRQLYGRYSVFARAFDEVVAVLDGQLRLSVRQV
MWGADAGLLESTEFAQPALFVVQVALAALLQDWGVLPDLVMGHSVGEIAA
AYVAGALSLVDAARVVAARGRLMQALPAGGVMVAVAASEDEVAPLLTEGV
CIAAVNAPESVVISGEQAAVGVVVDRLVGLGRRVRRLAVSHAFHSVLMDP
MVEEFSKVLADVCVRAPRIGLVSNVTGQLAGAGYGSPAYWVEHVRKPVRF
FDGVGLAESLGARVFVEVGPGAGLEASVALLARDRPEVESVLAGVGRLFA
EGVAVDWSSVFAGLGGRRVELPTYGFARQRFWLGDNGELSVDQTGKDAGA
IARLQSLAPPELQRQLVELVCFHAAIVLGRKSSHDIDPECAFQDLGFDSM
SGVELRNRLQMAIGLPGLSLPRTLIFDYPTASALAECLGQLLGGQHESSD
DESIWQLLKNIPIHQLRRTGLLDKLLLLAGQPEESLAGRTVSDEVIDSLS
PEALIGLALDEDENDIR
>Mb2069c pncA, PYRAZINAMIDASE/NICOTINAMIDAS PNCA (PZase)
MRALIIVDVQNDFCEGGSLAVTGGAALARAISDYLAEAADYHHVVATKDF
HIDPGDDFSGTPDYSSSWPPHCVSGTPGADFHPSLDTSAIEAVFYKGAYT
GAYSGFEGVDENGTPLLNWLRQRGVDEVDVVGIATDHCVRQTAEDAVRNG
LATRVLVDLTAGVSADTTVAALEEMRTASVELVCSS
>Mb2956 ppsA, PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSA
MTGSISGEADLRHWLIDYLVTNIGCTPDEVDPDLSLADLGVSSRDAVVLS
GELSELLGRTVSPIDFWEHPTINALAAYLAAPEPSPDSDAAVKRGARNSL
DEPIAVVGMGCRFPGGISCPEALWDFLCERRSSISQVPPQRWQPFEGGPP
EVAAALARTTRWGSFLPDIDAFDAEFFEISPSEADKMDPQQRLLLEVAWE
ALEHAGIPPGTLRRSATGVFAGACLSEYGAMASADLSQVDGWSNSGGAMS
IIANRLSYFLDLRGPSVAVDTACSSSLVAIHLACQSLRTQDCHLAIAAGV
NLLLSPAVFRGFDQVGALSPTGQCRAFDATADGFVRGEGAGVVVLKRLTD
AQRDGDRVLAVICGSAVTQDGRSNGLMAPNPAAQMAVLRAAYTNAGMQPS
EVDYVEAHGTGTLLGDPIEARALGTVLGRGRPEDSPLLIGSVKTNLGHTE
AAAGIAGFIKTVLAVQHGQIPPNQHFETANPHIPFTDLRMKVVDTQTEWP
ATGHPRRAGVSSFGFGGTNAHVVIEQGQEVRPAPGQGLSPAVSTLVVAGK
TMQRVSATAGMLADWMEGPGADVALADVAHTLNHHRSRQPKFGTVVARDR
TQAIAGLRALAAGQHAPGVVNPAEGSPGPGTVFVYSGRGSQWAGMGRQLL
ADEPAFAAAVAELEPVFVEQAGFSLHDVLANGEELVGIEQIQLGLIGMQL
ALTELWCSYGVQPDLVIGHSMGEVAAAVVAGALTPAEGLRVTATRSRLMA
PLSGQGGMALLELDAPTTEALIADFPQVTLGIYNSPRQTVIAGPTEQIDE
LITRVRARDRFASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGII
STTYADLHTQPVFDAEHWATNMRNPVHFQQAIASAGSGADGAYHTFIEIS
AHPLLTQAIIDTLHSAQPGARYTSLGTLQRDTDDVVTFRTNLNKAHTIHP
PHTPHPPEPHPPIPTTPWQHTRHWITTKYPAGSVGSAPRAGTLLGQHTTV
ATVSASPPSHLWQARLAPDAKPYQGGHRFHQVEVVPASVVLHTILSAATE
LGYSALSEVRFEQPIFADRPRLIQVVADNRAISLASSPAAGTPSDRWTRH
VTAQLSSSPSDSASSLNEHHRANGQPPERAHRDLIPDLAELLAMRGIDGL
PFSWTVASWTQHSSNLTVAIDLPEALPEGSTGPLLDAAVHLAALSDVADS
RLYVPASIEQISLGDVVTGPRSSVTLNRTAHDDDGITVDVTVAAHGEVPS
LSMRSLRYRALDFGLDVGRAQPPASTGPVEAYCDATNFVHTIDWQPQTVP
DATHPGAEQVTHPGPVAIIGDDGAALCETLEGAGYQPAVMSDGVSQARYV
VYVADSDPAGADETDVDFAVRICTEITGLVRTLAERDADKPAALWILTRG
VHESVAPSALRQSFLWGLAGVIAAEHPELWGGLVDLAINDDLGEFGPALA
ELLAKPSKSILVRRDGVVLAPALAPVRGEPARKSLQCRPDAAYLITGGLG
ALGLLMADWLADRGAHRLVLTGRTPLPPRRDWQLDTLDTELRRRIDAIRA
LEMRGVTVEAVAADVGCREDVQALLAARDRDGAAPIRGIIHAAGITNDQL
VTSMTGDAVRQVMWPKIGGSQVLHDAFPPGSVDFFYLTASAAGIFGIPGQ
GSYAAANSYLDALARARRQQGCHTMSLDWVAWRGLGLAADAQLVSEELAR
MGSRDITPSEAFTAWEFVDGYDVAQAVVVPMPAPAGADGSGANAYLLPAR
NWSVMAATEVRSELEQGLRRIIAAELRVPEKELDTDRPFAELGLNSLMAM
AIRREAEQFVGIELSATMLFNHPTVKSLASYLAKRVAPHDVSQDNQISAL
SSSAGSVLDSLFDRIESAPPEAERSV
>Mb2957 ppsB, PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSB
MMRTAFSRISGMTAQQRTSLADEFDRVSRIAVAEPVAVVGIGCRFPGDVD
GPESFWDFLVAGRNAISTVPADRWDAEAFYHPDPLTPGRMTTKWGGFVPD
VAGFDAEFFGITPREAAAMDPQQRMLLEVAWEALEHAGIPPDSLGGTRTA
VMMGVYFNEYQSMLAASPQNVDAYSGTGNAHSITVGRISYLLGLRGPAVA
VDTACSSSLVAVHLACQSLRLRETDLALAGGVSITLRPETQIAISAWGLL
SPQGRCAAFDAAADGFVRGEGAGVVVLKRLTDAVRDGDQVLAVVRGSAVN
QDGRSNGVTAPNTAAQCDVIADALRSGDVAPDSVNYVEAHGTGTVLGDPI
EFEALAATYGHGGDACALGAVKTNIGHLEAAAGIAGFIKATLAVQRATIP
PNLHFSQWNPAIDAASTRFFVPTQNSPWPTAEGPRRAAVSSFGLGGTNAH
VIIEQGSELAPVSEGGEDTGVSTLVVTGKTAQRMAATAQVLADWMEGPGA
EVAVADVAHTVNHHRARQATFGTVVARDRAQAIAGLRALAAGQHAPGVVS
HQDGSPGPGTVFVYSGRGSQWAGMGRQLLADEPAFAAAVAELEPVFVEQA
GFSLRDVIATGKELVGIEQIQLGLIGMQLTLTELWRSYGVQPDLVIGHSM
GEVAAAVVAGALTPAEGLRVTATRARLMAPLSGQGGMALLGLDAAATEAL
IADYPQVTVGIYNSPRQTVIAGPTEQIDELIARVRAQNRFASRVNIEVAP
HNPAMDALQPAMRSELADLTPRTPTIGIISTTYADLHTQPIFDAEHWATN
MRNPVRFQQAIASAGSGADGAYHTFIEISAHPLLTQAIADTLEDAHRPTK
SAAKYLSIGTLQRDADDTVTFRTNLYTADIAHPPHTCHPPEPHPTIPTTP
WQHTHHWIATTHPSTAAPEDPGSNKVVVNGQSTSESRALEDWCHQLAWPI
RPAVSADPPSTAAWLVVADNELCHELARAADSRVDSLSPPALAAGSDPAA
LLDALRGVDNVLYAPPVPGELLDIESAYQVFHATRRLAAAMVASSATAIS
PPKLFIMTRNAQPISEGDRANPGHAVLWGLGRSLALEHPEIWGGIIDLDD
SMPAELAVRHVLTAAHGTDGEDQVVYRSGARHVPRLQRRTLPGKPVTLNA
DASQLVIGATGNIGPHLIRQLARMGAKTIVAMARKPGALDELTQCLAATG
TDLIAVAADATDPAAMQTLFDRFGTELPPLEGIYLAAFAGRPALLSEMTD
DDVTTMFRPKLDALALLHRLSLKSPVRHFVLFSSVSGLLGSRWLAHYTAT
SAFLDSFAGARRTMGLPATVVDWGLWKSLADVQKDATQISAESGLQPMAD
EVAIGALPLVMNPDAAVATVVVAADWPLLAAAYRTRGALRIVDDLLPAPE
DVGKGESEFRTSLRSCPAEKRRDMLFDHVGALAATVMGMPPTEPLDPSAG
FFQLGMDSLMSVTLQRALSESLGEFLPASVVFDYPTVYSLTDYLATVLPE
LLEIGATAVATQQATDSYHELTEAELLEQLSERLRGTQ
>Mb2958 ppsC, PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSC
MTAATPDRRAIITEALHKIDDLTARLEIAEKSSSEPIAVIGMGCRFPGGV
NNPEQFWDLLCAGRSGIVRVPAQRWDADAYYCDDHTVPGTICSTEGGFLT
SWQPDEFDAEFFSISPREAAAMDPQQRLLIEVAWEALEDAGVPQHTIRGT
QTSVFVGVTAYDYMLTLAGRLRPVDLDAYIPTGNSANFAAGRLAYILGAR
GPAVVIDTACSSSLVAVHLACQSLRGRESDMALVGGTNLLLSPGPSIACS
RWGMLSPEGRCKTFDASADGYVRGEGAAVVVLKRLDDAVRDGNRILAVVR
GSAVNQDGASSGVTVPNGPAQQALLAKALTSSKLTAADIDYVEAHGTGTP
LGDPIELDSLSKVFSDRAGSDQLVIGSVKTNLGHLEAAAGVAGLMKAVLA
VHNGYIPRHLNFHQLTPHASEAASRLRIAADGIDWPTTGRPRRAGVSSFG
VSGTNAHVVIEQAPDPMAAAGTEPQRGPVPAVSTLVVFGKTAPRVAATAS
VLADWLDGPGAAVPLADVAHTLNHHRARQTRFGTVAAVDRRQAVIGLRAL
AAGQSAPGVVAPREGSIGGGTVFVYSGRGSQWAGMGRQLLADEPAFAAAI
AELEPEFVAQGGFSLRDVIAGGKELVGIEQIQLGLIGMQLALTALWRSYG
VTPDAVIGHSMGEVAAAVVAGALTPAQGLRVTAVRSRLMAPLSGQGTMAL
LELDAEATEALIADYPEVSLGIYASPRQTVISGPPLLIDELIDKVRQQNG
FATRVNIEVAPHNPAMDALQPAMRSELADLTPQPPTIPIISTTYADLGIS
LGSGPRFDAEHWATNMRNPVRFHQAIAHAGADHHTFIEISAHPLLTHSIS
DTLRASYDVDNYLSIGTLQRDAHDTLEFHTNLNTTHTTHPPQTPHPPEPH
PVLPTTPWQHTQHWITATSAAYHRPDTHPLLGVGVTDPTNGTRVWESELD
PDLLWLADHVIDDLVVLPGAAYAEIALAAATDTFAVEQDQPWMISELDLR
QMLHVTPGTVLVTTLTGDEQRCQVEIRTRSGSSGWTTHATATVARAEPLA
PLDHEGQRREVTTADLEDQLDPDDLYQRLRGAGQQHGPAFQGIVGLAVTQ
AGVARAQVRLPASARTGSREFMLHPVMMDIALQTLGATRTATDLAGGQDA
RQGPSSNSALVVPVRFAGVHVYGDITRGVRAVGSLAAAGDRLVGEVVLTD
ANGQPLLVVDEVEMAVLGSGSGATELTNRLFMLEWEPAPLEKTAEATGAL
LLIGDPAAGDPLLPALQSSLRDRITDLELASAADEATLRAAISRTSWDGI
VVVCPPRANDESMPDEAQLELARTRTLLVASVVETVTRMGARKSPRLWIV
TRGAAQFDAGESVTLAQTGLRGIARVLTFEHSELNTTLVDIEPDGTGSLA
ALAEELLAGSEADEVALRDGQRYVNRLVPAPTTTSGDLAAEARHQVVNLD
SSGASRAAVRLQIDQPGRLDALNVHEVKRGRPQGDQVEVRVVAAGLNFSD
VLKAMGVYPGLDGAAPVIGGECVGYVTAIGDEVDGVEVGQRVIAFGPGTF
GTHLGTIADLVVPIPDTLADNEAATFGVAYLTAWHSLCEVGRLSPGERVL
IHSATGGVGMAAVSIAKMIGARIYTTAGSDAKREMLSRLGVEYVGDSRSV
DFADEILELTDGYGVDVVLNSLAGEAIQRGVQILAPGGRFIELGKKDVYA
DASLGLAALAKSASFSVVDLDLNLKLQPARYRQLLQHILQHVADGKLEVL
PVTAFSLHDAADAFRLMASGKHTGKIVISIPQHGSIEAIAAPPPLPLVSR
DGGYLIVGGMGGLGFVVARWLAEQGAGLIVLNGRSAPSDEVAAAIAELNA
SGSRIEVITGDITEPDTAERLVRAVEDAGFRLAGVVHSAMVLADEIVLNM
TDSAARRVFAPKVTGSWRLHVATAARDVDWWLTFSSAAALLGTPGQGAYA
AANSWVDGLVAHRRSAGLPAVGINWGPWADVGRAQFFKDLGVEMINAEQG
LAAMQAVLTADRGRTGVFSLDARQWFQSFPAVAGSSLFAKLHDSAARKSG
QRRGGGAIRAQLDALDAAERPGHLASAIADEIRAVLRSGDPIDHHRPLET
LGLDSLMGLELRNRLEASLGITLPVALVWAYPTISDLATALCERMDYATP
AAAQEISDTEPELSDEEMDLLADLVDASELEAATRGES
>Mb2959 ppsD, PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSD
MTSLAERAAQLSPNARAALARELVRAGTTFPTDICEPVAVVGIGCRFPGN
VTGPESFWQLLADGVDTIEQVPPDRWDADAFYDPDPSASGRMTTKWGGFV
SDVDAFDADFFGITPREAVAMDPQHRILLEVAWEALEHAGIPPDSLSGTR
TGVMMGLSSWDYTIVNIERRADIDAYLSTGTPHCAAVGRIAYLLGLRGPA
VAVDTACSSSLVAIHLACQSLRLRETDVALAGGVQLTLSPFTAIALSKWS
ALSPTGRCNSFDANADGFVRGEGCGVVVLKRLADAVRDQDRVLAVVRGSA
TNSDGRSNGMTAPNALAQRDVITSALKLADVTPDSVNYVETHGTGTVLGD
PIEFESLAATYGLGKGQGESPCALGSVKTNIGHLEAAAGVAGFIKAVLAV
QRGHIPRNLHFTRWNPAIDASATRLFVPTESAPWPAAAGPRRAAVSSFGL
SGTNAHVVVEQAPDTAVAAAGGMPYVSALNVSGKTAARVASAAAVLADWM
SGPGAAAPLADVAHTLNRHRARHAKFATVIARDRAEAIAGLRALAAGQPR
VGVVDCDQHAGGPGRVFVYSGQGSQWASMGQQLLANEPAFAKAVAELDPI
FVDQVGFSLQQTLIDGDEVVGIDRIQPVLVGMQLALTELWRSYGVIPDAV
IGHSMGEVSAAVVAGALTPEQGLRVITTRSRLMARLSGQGAMALLELDAD
AAEALIAGYPQVTLAVHASPRQTVIAGPPEQVDTVIAAVATQNRLARRVE
VDVASHHPIIDPILPELRSALADLTPQPPSIPIISTTYESAQPVADADYW
SANLRNPVRFHQAVTAAGVDHNTFIEISPHPVLTHALTDTLDPDGSHTVM
STMNRELDQTLYFHAQLAAVGVAASEHTTGRLVDLPPTPWHHQRFWVTDR
SAMSELAATHPLLGAHIEMPRNGDHVWQTDVGTEVCPWLADHKVFGQPIM
PAAGFAEIALAAASEALGTAADAVAPNIVINQFEVEQMLPLDGHTPLTTQ
LIRGGDSQIRVEIYSRTRGGEFCRHATAKVEQSPRECAHAHPEAQGPATG
TTVSPADFYALLRQTGQHHGPAFAALSRIVRLADGSAETEISIPDEAPRH
PGYRLHPVVLDAALQSVGAAIPDGEIAGSAEASYLPVSFETIRVYRDIGR
HVRCRAHLTNLDGGTGKMGRIVLINDAGHIAAEVDGIYLRRVERRAVPLP
LEQKIFDAEWTESPIAAVPAPEPAAETTRGSWLVLADATVDAPGKAQAKS
MADDFVQQWRSPMRRVHTADIHDESAVLAAFAETAGDPEHPPVGVVVFVG
GASSRLDDELAAARDTVWSITTVVRAVVGTWHGRSPRLWLVTGGGLSVAD
DEPGTPAAASLKGLVRVLAFEHPDMRTTLVDLDITQDPLTALSAELRNAG
SGSRHDDVIAWRGERRFVERLSRATIDVSKGHPVVRQGASYVVTGGLGGL
GLVVARWLVDRGAGRVVLGGRSDPTDEQCNVLAELQTRAEIVVVRGDVAS
PGVAEKLIETARQSGGQLRGVVHAAAVIEDSLVFSMSRDNLERVWAPKAT
GALRMHEATADCELDWWLGFSSAASLLGSPGQAAYACASAWLDALVGWRR
ASGLPAAVINWGPWSEVGVAQALVGSVLDTISVAEGIEALDSLLAADRIR
TGVARLRADRALVAFPEIRSISYFTQVVEELDSAGDLGDWGGPDALADLD
PGEARRAVTERMCARIAAVMGYTDQSTVEPAVPLDKPLTELGLDSLMAVR
IRNGARADFGVEPPVALILQGASLHDLTADLMRQLGLNDPDPALNNADTI
RDRARQRAAARHGAAMRRRPKPAVQGG
>Mb2960 ppsE, PHENOLPTHIOCEROL SYNTHESIS TYPE-I POLYKETIDE SYNTHASE PPSE
MSIPENAIAVVGMAGRFPGAKDVSAFWSNLRRGKESIVTLSEQELRDAGV
SDKTLADPAYVRRAPLLDGIDEFDAGFFGFPPLAAQVLDPQHRLFLQCAW
HALEDAGADPARFDGSIGVYGTSSPSGYLLHNLLSHRDPNAVLAEGLNFD
QFSLFLQNDKDFLATRISHAFNLRGPSIAVQTACSSSLVAVHLACLSLLS
GECDMALAGGSSLCIPHRVGYFTSPGSMVSAVGHCRPFDVRADGTVFGSG
VGLVVLKPLAAAIDAGDRIHAVIRGSAINNDGSAKMGYAAPNPAAQADVI
AEAHAVSGIDSSTVSYVECHGTGTPLGDPIEIQGLRAAFEVSQTSRSAPC
VLGSVKSNIGHLEVAAGIAGLIKTILCLKNKALPATLHYTSPNPELRLDQ
SPFVVQSKYGPWECDGVRRAGVSSFGVGGTNAHVVLEEAPAEASEVSAHA
EPAGPQVILLSAQTAAALGESRTALAAALETQDGPRLSDVAYTLARRRKH
NVTMAAVVHDREHAATVLRAAEHDNVFVGEAAHDGEHGDRADAAPTSDRV
VFLFPGQGAQHVGMAKGLYDTEPVFAQHFDTCAAGFRDETGIDLHAEVFD
GTATDLERIDRSQPALFTVEYALAKLVDTFGVRAGAYIGYSTGEYIAATL
AGVFDLQTAIKTVSLRARLMHESPPGAMVAVALGPDDVTQYLPPEVELSA
VNDPGNCVVAGPKDQIRALRQRLTEAGIPVRRVRATHAFHTSAMDPMLGQ
FQEFLSRQQLRPPRTPLLSNLTGSWMSDQQVVDPASWTRQISSPIRFADE
LDVVLAAPSRILVEVGPGGSLTGSAMRHPKWSTTHRTVRLMRHPLQDVDD
RDTFLRALGELWSAGVEVDWTPRRPAVPHLVSLPGYPFARQRHWVEPNHT
VWAQAPGANNGSPAGTADGSTAATVDAARNGESQTEVTLQRIWSQCLGVS
SVDRNANFFDLGGDSLMAISIAMAAANEGLTITPQDLYEYPTLASLTAAV
DASFASSGLAKPPEAQANPAVPPNVTYFLDRGLRDTGRCRVPLILRLDPK
IGLPDIRAVLTAVVNHHDALRLHLVGNDGIWEQHIAAPAEFTGLSNRSVP
DGVAAGSPEERAAVLGILAELLEDQTDPNAPLAAVHIAAAHGGPHYLCLA
IHAMVTDDSSRQILATDIVTAFGQRLAGEEITLEPVSTGWREWSLRCAAL
ATHPAALDTRSYWIENSTKATLWLADALPNAHTAHPPRADELTKLSSTLS
VEQTSELDDGRRRFRRSIQTILLAALGRTIAQTVGEGVVAVELEGEGRSV
LRPDVDLRRTVGWFTTYYPVPLACATGLGALAQLDAVHNTLKSVPHYGIG
YGLLRYVYAPTGRVLGAQRTPDIHFRYAGVIPELPSGDAPVQFDSDMTLP
VREPIPGMGHAIELRVYRFGGSLHLDWWYDTRRIPAATAEALERTFPLAL
SALIQEAIAAEHTEHDDSEIVGEPEAGALVDLSSMDAG
>Mb2953 tesA, PROBABLE THIOESTERASE TESA
MLARHGPRYGGSVNGHSDDSSGDAKQAAPTLYIFPHAGGTAKDYVAFSRE
FSADVKRIAVQYPGQHDRSGLPPLESIPTLADEIFAMMKPSARIDDPVAF
FGHSMGGMLAFEVALRYQSAGHRVLAFFVSACSAPGHIRYKQLQDLSDRE
MLDLFTRMTGMNPDFFTDDEFFVGALPTLRAVRAIAGYSCPPETKLSCPI
YAFIGDKDWIATQDDMDPWRDRTTEEFSIRVFPGDHFYLNDNLPELVSDI
EDKTLQWHDRA
>Mb0173 yrbE1A, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE1A
MTTSTTLGGYVRDQLQTPLTLVGGFFRMCVLTGKALFRWPFQWREFILQC
WFIMRVGFLPTIMVSIPLTVLLIFTLNILLAQFGAADISGSGAAIGAVTQ
LGPLTTVLVVAGAGSTAICADLGARTIREEIDAMEVLGIDPIHRLVVPRV
LASMLVATLLNGLVITVGLVGGFLFGVYLQNVSGGAYLATLTLITGLPEV
VIATIKAATFGLIAGLVGCYRGLTVRGGSKGLGTAVNETVVLCVIALFAV
NVILTTIGVRFGTGR
>Mb0174 yrbE1B, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE1B
MSTAAVLRARFPRAVANLRQYGGAAARGLDEAGQLTWFALTSIGQIAHAL
RYYRKETLRLIAQIGMGTGAMAVVGGTVAIVGFVTLSGSSLVAIQGFASL
GNIGVEAFTGFFAALINVRIAGPVVTGVALAATVGAGATAELGAMRISEE
IDALEVMGIKSISFLASTRIMAGLVVIIPLYALAMIMSFLSPQITTTVLY
GQSNGTYEHYFQTFLRPDDVFWSFLEALIITAIVMVSHCYYGYAAGGGPV
GVGEAVGRSMRFSLVSVQVVVLFAALALYGVDPNFNLTV
>Mb0602 yrbE2A, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE2A
MTTHAVIITYLRDQTQPAVDAIGGFYRTCVLTGKALVRRPFHWREAIEQG
WFITSVSLLPTLAVSIPLTVLIIFTLNILLAEFGAADISGAGAALGAVTQ
LGPLTTVLVIAGAGATAICADLGARTIREEIDAMEVLGIDPIHRLVVPRV
VAATIVAALLNGAVITIGLVGGFVFSVFIQHVSAGAYVGTLTLVTGLPEV
IISVVKSATFGLIAGLVGCYRGLTTKGGPKGVGTAVNETLVLCVIALFAT
NVVLTTIGVRFGTGH
>Mb0603 yrbE2B, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE2B
MVESSTASAAAVLRARYPRTAASLDRYGGGTARRLERTGTFARFTRISVV
QIGWALRRYRRETLRLVAEIGMGTGAMAVVGGTVAIIGFVTLSGGSLIAI
QGFASLGNIGVEAFTGFFAALANTRVAAPIVSGVALAATVGAGATAQLGA
MRISEEIDALEVMGIKSISFLVSTRILGGLVVIMPLYALALDMAFTSGQV
VTTVFYGQSNGTYEHYFRTFLRPEDVGWSVVEVVIIAVVVMITHCYYGYT
ASGGPVGVGQAVGRSMRFSLVSVVVVVLLAELALYGVDPNFNLTV
>Mb1999 yrbE3A, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE3A
MVIVADKAAGRVADPVLRPVGALGDFFAMTLDTSVCMFKPPFAWREYLLQ
CWFVARVSTLPGVLMTIPWAVISGFLFNVLLTDIGAADFSGTGCAIFTVN
QSRGTGSLERGRFIGPQDHRVAAALEVTAPLLRS
>Mb3531c yrbE4A, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE4A
MIQQLAVPARAVGGFFEMSMDTARAAFRRPFQFREFLDQTWMVARVSLVP
TLLVSIPFTVLVAFTLNILLREIGAADLSGAGTAFGTITQLGPVVTVLVV
AGAGATAICADLGARTIREEIDAMRVLGIDPIQRLVVPRVLASTLVALLL
NGLVCAIGLSGGYAFSVFLQGVNPGAFINGLTVLTGLRELILAEIKALLF
GVMAGLVGCYRGLTVKGGPKGVGNAVNETVVYAFICLFVINVVMTAIGVR
ISAQ
>Mb3530c yrbE4B, CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN YRBE4B
MSYDVTIRFRRFFSRLQRPVDNFGEQALFYGETMRYVPNAITRYRKETVR
LVAEMTLGAGALVMIGGTVGVAAFLTLASGGVIAVQGYSSLGDIGIEALT
GFLSAFLNVRVVAPVIAGIALAATIGAGATAQLGAMRVSEEIDAVECMAV
HSVSYLVSTRLIAGLVAIIPLYSLSVLAAFFAARFTTVFVNGQSAGLYDH
YFNTFLIPSDLLWSFMQAIAMSIAVMLVHTYYGYNASGGSVGVGVAVGQA
VRTSLIVVVVITLFISLAVYGASGNFNLSG