TitleGenColors Logo

Gene list

Applied filters:

COG category: Secondary metabolites biosynthesis, transport and catabolism
Organism: Mycobacterium avium subsp. paratuberculosis str. k10, k10
Gene type: CDS

Number of genes found: 420

Free access
Sort by:

 



# Mycobacterium avium subsp. paratuberculosis str. k10, k10

>MAP0774c hypothetical protein
MLSTFSISQSGGGTACRRTPPVKPSPGGRTPMSNFDSIDFFTDPSLVPDP
HPYFDYLRSQNPVLRLPHYGVVAITGYEEATEVYKDPETFSNIVALGGPF
PPLPVTPEGDDISAQIDAHRSSFPMFEHMVTMDPPEHTKARSVLAKLLTP
SRLKQNEEFMWRLADRQLDEFLHNGECEFIAEYSKPFATLVIADLLGVPE
EDHTQFRTVLGADRPGARVGALDHETVGINPLEWLDDKFCNYIEERRREP
RADVLTFLAEAKYPDGSTPPVIEVVRSATFLFAAGQETTAKLLSAALQVL
GDRPDIQQQLREDRSLIPAFIEESLRMESPVKSDSRLARRATTIGGVDIP
AGTVVMILPGAANRDPRRFENPHEFDLRRKNVREHMAFARGVHSCPGGPL
ARVEGRVSIERILDRMPHIEINETHHGPAGDRRYTYEPTYILRGLSELHL
TFTTADAVAPVG
>MAP3325 hypothetical protein
MSLNGKTMFISGASRGIGLAIAKRAAQDGANIALIAKTAEPHPKLPGTVY
TAAKELEEAGGQALPIVGDVRDPESVEAAVAKTIEQFGGIDICVNNASAI
NLGSITEVPMKRFDLMNGIQVRGTYAVSQACIPHLKGRENPHILTLSPPV
LLGKEWLEPTAYMMAKFGMTLCALGIAEEMREAGIASNTLWPRTLVATAA
VQNLLGGDEAMGRARKPEVYSDAAYVILNKPAREFTGNMLLCEDVLVESG
VTDLSVYNCVPGSQLGVDFWVDGVNPPGYTGP
>MAP2355 hypothetical protein
MMPPSLLHTWFELLSPEQIVMAYGMTENLGLTALRGDEWLEHPGSVGRGF
RDTEIRILDSQQRPLGPGEDGDVYLRAPMSAGYRYLGGAPPLPSTEDGFR
SAGDIGHLDEDGYLYIVDRRVDMIISGGANVFPAEVESALAGHPAIADVV
VIGLADPQWGRRVHAVVQRADGASLTEQQVIDYAKGRLAPYKAPKTVEFV
DAIPRTAATKVNRSAMIAARGG
>MAP3287 hypothetical protein
MPDWTAAQLPSFAGRTVIVTGANAGLGEVTARELARVGGHVILAVRNTDK
GRAAADRMAGVATGRVEVRELDLQDLASVRRFADGIDTVDVLVNNAGIMA
TKHAVTVDGFEGQIGTNHLGHFALTNLLLPKLTDRVVTVSSLMHHFGYIS
LKDLNFRSRPYSAWLAYSQSKLANLLFTSELQRRLDAVPSSLRALAAHPG
WSHTNLQGNSGRKLGDAAVLAVDRIVSTDADFGARQTLYAVSQDLPGDTF
VGPRFGLYGRTQPTWRNWPAKRAGTAAALWELSEELTGTKFPL
>MAP3135c hypothetical protein
MSLLTITKLTDSVGAEVTGLDPAALAHDDSVGEAVLDALEDNGVLVFRGL
YLDPAAQVAFCGRLGEVDHSSDGHHPVPGIYPITLDKSKNASAAYLKATF
DWHIDGCTPLGDECPQKATVLSAVRVAERGGETEFANSYAAYDALTDDEK
RRFGALRVVHSLEASQRRVYPDPSPELMARWRSRRTHEHPLVWTHRSGRK
SLVLGASADYVVGMDVDEGRALLEELLQRATVPERVYRHGWSIGDTVIWD
NRGVLHRAAPYDPDSSREMLRTTVRGDEPIQ
>MAP1731c hypothetical protein
MLRHNAIREAADLLHDAQRTRRPIGPLTERFPGLDVAAAYAIQQANLNRR
TGEGRLVVGHTSWSSTPPPPPRIVPTGPNWSAVGVRDDA
>MAP3493 hypothetical protein
MTGIDFSLLDVPRAAPVDDPEAYLRAVIAWHFGADTGSPFWLRTARTLSF
DPLTDVHTFDDLRRFPNLVNELRHVAVGDLIPRGYPAQDRPLIFESGGTT
GAPKRTVQMPDWVAQVVDWQTEDFAAGGFRRGEGFLCMMPSGPHGVGYFS
RLVSQRLSAVFHAVDIDPRWVKKLAARDAAADVAGYVEHVVEQAVFVLQT
QDVANLHTTGPLLAAMARDDRVVELVDAKIGYLLLSGAHVDADTLDLLRG
IFPRTTITMAFGSTMVLSQAVTRTDGDTFVFDPRTPYVVFWVVDPDTGER
VRYGQPGQVVMHHLSRTMFIPNNLERDLAVRRPGPAGELSDSLSEVRPVQ
SFEGEAVIEGVY
>MAP3742 hypothetical protein
MRLSAERFEVLSRRAAEHGATVNVAIVTAFSRAIARYGDRDHFLLTLTTM
DRHAFTPAVGQLVGDFTGTSVLEVDVRGQRTFAELLHGVGDRLFDDMDHS
TTGGVNVARLLGQRDDDRGEQTPVVFTSTLGATTRIDSGATSLLHPIQGR
GLSQTPQVLLDCQVAEIDGMLEVNWDTRDQAVPAEVLDRAFADFRHALDL
LSTDASAWHRPLLPAQPPETTPVEGPRTHHEPALIHTGFLRNVLVTPDAV
AIRHGDRATTYAELLAAATAVADTLAATGVRPRDYVGIRLPQGPAQIAAL
LGALLARAAYVPLDVGWPTHRVDQIAAQCSLAALCEPDGEVDRLLADPQT
WSPRAAVVPEPHSEVLAPDDTAYVIFTSGSTGVPKGVMMAHGAVVNTLTD
INDRLAIRASDSVLAVSQHTFDLSVYNIFGVLAAGGTIVFADGNETSNPQ
AWCDAITDHRVTVWNSVPAQMQLLLDHVNGTQTLPSLRNVMLSGDWIPVS
QPPEIAALAPNASMLSLGGATEAAISSICHPLAAQVYQRSVPYGTAMRNQ
SVRVLTHRGEPASPWQIGEIHIGGLGLAQGYLGDPERTASAFVVHPVSGE
RLYRTGDYGRLIHDGVIEFLGRRDNQVKIHGHRIELAEVDSALSALAGVH
AAVSTVVGQGPHDRLLAAVVVADAADDDEKRQRRDIAAAVRCAAADAHRN
STADLDGDKLGHFASTARAVAIESMCAALRTVMDVGERIDFAELAGRLAL
PQRLERLLRRWLDALEEAGMVVMHRGPSIELRGGPELSACSAQWQHVRAL
GAAVDYGDELLDYVGQCVENLSGLLAGTVDPLALLFPEGTLDTATAAYRD
NLVSRYTNGLVIAGVAERARWATPTRPLRVLEVGAGVGGTSVGLVAALAH
HHVRYTFTDVSHFFLGQAAKMFAEHDFIDYRLFDVNQPPAAQGLPPGSFD
VVVCANVLHNAVNVDDTLAMFERLLAPGGSLVFIDATATNHPLMISMEFK
EGLHGFTDARAGTNSAFLSYAQWGDALERSPFGEVWSFPPPEHPLSKLGQ
HVFWCDSASGAHALRPGELTERAHAVLPSYMVPQQVICLRSLPLTTNGKI
DRAALTAELESLRAAARRFGRGGDGVSASLDAMQTRIAEVWVSVLGLPST
EALSPNSDFFALGGDSLLLAQTIGRIRREIDEAAGIAWDDLLRAMVSDPT
LAAAAQAVQGCGPTADSGDYSAELDSPLVYLGAGQGFRSGTEIAVCVHDG
SGGLAPYEQLVSELRSAADGAPELYGLRRVAGDAYSRIAPPELFTTLASR
YGAAITALAPSKVHLVGYCMGGLLATEIAKCLEESGIVVAPVTVVSSYRV
PFDIEDDDILDYCFAQIMGVSPSDLGLRVADDAIRAAFAAVRSRHADLIP
PGALRAVAAPELASALDQSAAPGEQRLKLLAQSGVLGDSWTAESLTELRA
IFVHSLRSVVQGAADPFLGDVHFLRQRGDIHFLPTLKEDMTEFWRGYCLG
ELTITEIDGNHFDCLTGENARSVAGLLLGSWVVRT
>MAP1555c hypothetical protein
MPAGSPENHVSAELLGILRDDLNVDVSRVTPDARLVDDVGLDSVAFAVGM
VAIEERLGVTLTEEELLSCETVGDLQAAIAAEPRETRDE
>MAP4205 hypothetical protein
MRFRNVLFPYVYRFGQPVFDRLFYHRYRRQAMSHATGRLLMVGLGPGTDL
MFLPAAVTSVAAVEPEASFRRMAARLAARHGVAVDVVAGSGESIPFPDNS
FDSVHIGLVLCSVRDVAATLAEIRRVLVPGGRLVVLEHVRGDGLTGRLQD
LIARPWSWLAGGCEPNRRTGAAIAAAGFDTSMLRSVPRTPVPFPCKPHLQ
GFAVLSPR
>MAP0680 hypothetical protein
MTEPAALVFEERRFSVPELDALADGWAAALAKDGVTAGRRVAVMTSNRPE
FLAVLLAIWRLAATAVLISPAWKRDEVEHALALTDPGHAVGDHPVLAGLM
PMLHLDEPVPPAQPTAGAPPRAGDAVLVFSSGTTGLPKAVRHTHASLDEA
VRHWREALRLTRRDRIQVATPPSHILGLLNLLTALKTGACVRLHPRFDID
KVLHHIESDRVTVEMAVAPIALAIASHPDLESYDLSSLRYIVWGATPVNA
EVAQTVTRRTGVGWLPAYGTTELPVIACNPIEDARLDTVGRPVPGVEVRV
VSLASGEPAGAGEVGEIQARSASLMAGYLPGEATGEAIRDGWYRTGDAGW
LDTGGWLRITDRLKEMIKVRGFQVAPAEIEAVLQGHPAVADCAVFGVPDG
LNGEAVVAAVAVSAPVDVTELTALVRQKLASYKHLSRVVVVPEIPRLPSG
KVLRRVLKERHGCTSDS
>MAP0761 hypothetical protein
MLKYRGSGLVKAGLIGVVLAVMVILVGLSPDRFIAWATMVRYQALFTEAG
GLATGNPVVVSGMKVGTVSDVKLHRGDALVTFALKGNILLGSETTAHIRT
GTLLGERMLTLESAGTGTMHPMALIPVSRTSSPYSLTEAVSDLTTDTAGT
NTTALNQSLDTLAATLDQIAPQMGPAFDALTRLSRTLNSRNKNLGELFKS
AGDVTGVLSERSQQVNKLILNSDVLLQVLVARRQEIVDLLANTSAVAKQL
TALVHDNESKLAPTLERLNSVTAMLEKNRDNISKALPGLKKFEITVGEAI
SSMYAYSAFVPNFLAPQLFQPFLDYLWGFRTFDTALGPGHPSPVPRSLIP
WPYNGIPVCPGCTLGGRVGGSQ
>MAP1163 hypothetical protein
MARAVVVGAGPNGLAAAIQLARRGVEVQVLEAGETPGGGARSGELTLPGV
IHDLCSATHPFGVGSPFWKEIDLPSYGLVWKWPQIDCAHPLDDGSAGVLY
RSIEQTTAGMGPDATRWRRALGDLAAGFDQLASDLMRPVLRVPHHPVRLA
RFGPRALLPATVLARWFRTEQARALFGGAAAHIYTRLDRPLTASLGLLFL
ASGHRYGWPVAEGGSGSIIHALVAALRAHGGDVATGVTVSDRRDIPDADI
VMLDLTPAAALRLYGELMPGRIARSYRRYRQGSSAFKVDFAVEGDIPWTN
PHCARAGSVHLGGSFAEIADTERQRAQGKLAPRPFVLVGQQYLADPSRSA
GGINPIWAYAHVPFGYTGDATALVVDQIERFAPGFRDRVVATVRRGTADL
AAYNANYIGGDILGGANDGLQVILRPRVSVSPYATGVPGVYLCSQSTPPG
PGIHGLCGYHAAEAALGWLRRRGG
>MAP4197c hypothetical protein
MTRTDQDSWDLASSVGATATMVAAARALASTGERPIINDPFAAPLVRAVG
LDFFRRLVDGEVAPADPQRGERDLQLETDSIAVRTRFFDDFFTGAARDGI
RQSVILAAGLDARAYRLDWPAGAVVYEVDQPKVVEFKTNTMAALDARPAA
QLRTVSIDLREDWPEALRANGFDVTQPTSWSAEGLLMYLPPEAQDRLFDN
ITALSAPGSRLATEYHPDATGTTMAQRAQEFNDRWARVGCDIDLSGLFFD
GERSNVVEYLTGRGWRVSARPRRDLFDDYGLAYPEDDETAQFPNIVAVSA
ELG
>MAP2071c hypothetical protein
MTFWSLIAEAARRGSPRPLLADEHGRSMTARQLYDAACVAAAALAERGVR
RGAVVSWQLPTTLETMVLMAALARLGAVQNPIIPVLRESEVRFITGQLNT
EYFVAPGLWRGFDHGGLARALSAERGFEVITVDLAAPPAAGALRLPGADP
DSLPAPPQSADEARWIYYSSGTTAAPKGIRHTDSSVIAGSAGVVGMVGAT
SSDVDPIAFPVAHIGGAAMLATALLTGMWLVLFEAFDPAATPLAIAAHNP
TFLGTATPFFVAYLEAQRAQGNRPLFPSLRGCLAGGAPITAELSRRVRDT
FGVAGIANAWGMTEFPCATSPSLTAAPEVLDHTVGPPVPGVEVRVVDGAE
NELAAGQEGELRLKGPQCFLGYADPTLDADAFDDQGWLRTGDLGLIDADG
NVRVTGRTKDAIIRNAENISALEIENALAAHPAVADVAVIGIPDPRTGER
VCAVVVPATADGVTLESLVQHCRSRGLSRYKHPERLVVVDTLPRNQFGKV
IKKDLRDAFG
>MAP2642 hypothetical protein
MGESPSITGVAVKFPPNRYTQSEAIRALTDIAGPEFRRFAHSSGVEFRNT
ALRLPRYRELSGFSEANDAYLEVALDLGEQALVAALDRGKVKPSEVDIVF
STTVTGLAVPTLEARLATRVGLRPDVKRVPLFGLGCVAGAAGVARMHDYL
RAFPDQTAALLAVELCSLTIQRHDTSIANLVATSLFGDGAAAVITEGARR
AGAEHTGPRILATRSRIYPDTEEVMGWKIGGDGFRIVLSADVANIAEKYL
GEDVRDFLADHGLAPRDVTTWVCHPGGPRVIEAVESVLDLPADALDHTRN
SLRENGNLSSVSVLDVLAANLADPPAPGSIGLMIAMGPAFCSELVLLAF
>MAP2530 hypothetical protein
MSDRFALTDRVAVITGAGTGIGRASALVLAEHGADIVLAGRRPDPLQATA
KEVEALGRRALIVPTDVTETHQCQDLVDATLADFGRLDILVNNAGGGETK
GITRWTEEEWHDVVDLNLGSVWFLSRCAVKPMMTQGSGAIVNISSGASLI
AMPQAAIYAAAKAGVNNLTGSMAAAWGRKGIRVNCIACGAIRTDGLLADA
AKGGFDVDQLGAMNAMGRIAEPDEIGYGVLFFASDASSYCSGQTLYIHGG
PGPAGI
>MAP0025 hypothetical protein
MTSSNPPLPRAVDDRLVSILRDDLDLKFESLSPTTRLIDDLGMDSVAFAV
SLVAIEERFGTQLSEEDLLYCTTVGDLQSAIAARVNA
>MAP1851 hypothetical protein
MPNSFELDGRGPSDRQLFGIGIAVLVVSALLTTVMLVKSTGRLDDYVRVV
ADLVNVGDGLPQKSDVKYHGVLVGMVDDVVPAAHGKPNYVHINLKEDYAK
SIPASVTARVVPSNVFAVSSVQLVGNGPGTKIRDGAHIPEDKRLPTVLFQ
TTVSKLRDLLAAAGRGRDDRSVGILAALGAATDHRRVSLLNAGAQLNRLL
DQLNSIVSTDTGPSTVSALLDATRGLAQTAPDLLDALHQAVEPMQTFAET
RGQLASLLSGADYTVGTTRQSFDNHIDQLIRITADFTPVLGVLAQKSNNF
VPAVTKLDNLANQFMEQVWNPQTNLGNMRAMLTFTPSSTYTRADCPHYGE
LKGPSCFTAPLIPVRPDLPDVLLPQNYQPPKDLAPPPGTVIGPDGNLVAV
GPPLINPTPNLTDPNPPLAPGITPSPPVPGSANPDNPSPGAPATPQPQQP
WVAPVAPKAPWIPQSSFRPSFGGNVGPVGSQYERTMLGAITGGPASEATE
LLLGPVARGTTVSLAHPPGPSPAPRPGEGK
>MAP0578 hypothetical protein
MKRLEGKRILVTGAASGIGQATALRLLDEGAAVAASDVAVDGLERTRDRA
AQAGTAERLVTVTMDVGREDAVVDGVRVAVAQLGGLDSLVNAAGMLRAAH
THQTSLQLWNQIIGVNLTGTFLVVREALPALLDNARSAVVNFSSTSASFA
HPYMAAYAASKGGIQAFTHSLALEYARRGLRAVCVAPGSIKSGITDATGG
YIPEDADWSLFSRLLPVLPTTVESSGTGMAEPTAVAGVIAMLVSDDGAFI
TGTEIRIDGGTHA
>MAP1690c hypothetical protein
MTDRQERAMSFGSIAEDYDGLRPQAPPEAVDWLMPPDCGVAVDVGAGTGL
FTRALAGRAAHVIAVEPDARMRAVLAHRSPAVRVLEGRGEAIPLPDRCAD
AVFVSSAWHWMDPQRAVPEIGRVLRDGGRFGLIWTSRDRDVDWVRDLDLL
PGDDTSEADAPDRFRRRHENVVLPDPQIFHHVARETFTFVRTMTLDDAVA
MLATYSRIITASPADRARRLAKARAVLTERFPGAHAIDIPMQSRCWRADR
IARAG
>MAP1871c hypothetical protein
MTIDDRALHVGREQLTELDGGPFPLTRGQLDIWLAQETDQQGARWQLGYL
LRIEGTVDPWLLEHTIRQVVREAEPLRAAFFQVDGQVFQKAVDYPDVELA
CYQRMGSQDPVQEAYRLAASIQRTVMPLDGPLFKFALLQTRVDEFYLFVC
CHHIVADGIGLALICHRIGDVYNALASGAPIPPAYFGSLSDLIACEAEYE
ASTDYLDDRAYWAENLPSESEPGYRLAPASEREPYESSAPIELDPVAVAG
IQKLSQQLGVRRSSVLAAACALLVGGCDLENSEIVFEFPVSRRVRPEAQT
VPGMITGFVPLVLKASPGSSVASFCEHVDTRLGEALQHQRFPVHTIENKR
RGSTQTSNRVILNFIPTTNLANIAGARVSGTLTHTNLVDQLGLDFFSDHD
RLFLGMQPGATTGGFGGAGQWLSDCDVHDVVGRLERVLVAMTADPGRRLS
SVDVLGMGERVRLDELGNKAVLAGPQGVSVSIPDLFARHVAGTPDAVAVS
FDGRSVSYRQLDEASNRLAHLLVARGVGPGERVALLFSRSVDAVVAILAV
LKTGAAYLPIDPGVPDERIGFVLADAAPIAAITGTGLAERLDGHDVAVIG
VDDPDVVPAVDAQPSTGLPGPAPDDVAYVIYTSGTTGVPKGVAITHRNVT
QLLGSLDAGLPAPGVWALCHSLAFDVSVWEIFGALVRGGAVGGGAGVGDR
FSAGAARHVGGRTGQRVDPDPLGGGGIAG
>MAP0565 hypothetical protein
MAGSGMPSHRSMMIKVSIFAAAMLLVSAGLVVVFGDFRFGPESTYHATFV
DVSRLKAGQKVCIAGVPVGAVEAVKLNRDNTIDVTFGVDKRYTLYSSTRA
VIRYENLVGDRYLEITSGPGELRKLPAGGTINSQHTQPALDLDALLGGLR
PVLKGLDADKVNTISSAVIQLLQGQGGALSNVLADTSAFTSALGQRDQLI
GDVINNLNTVLTTVDQRSAQFSASVDQLQQLITGLAKHKDDIAGAIPPLA
STTTDLTELLQNSRRPLQGVLENTRPLATELDNRKAEVNNDVEQLGEDYL
RLAALGAYGSFFNIYFCSVTIKINGPAGSDILLPLGGQVDPSKGRCAFVK
>MAP2378 hypothetical protein
MTDLAELVAPAHTAVITQEVQGAVVGPDAGLGALAAEARRVALPNIVRLL
PPARAAGVRIVHCLVQRRPDGLGSNHNAKIFALGRRSGQGRVDISPGTPG
ATLLPELGPEPSDLVLSRWHGVGPMGGTDLDAVLRNLGVSTLVVVGVSLN
IAIPNVVMDAVNAAYRVVVPRDAVAGVPAEYGEAVIANTLSLLATITTTD
ELLRAWSRP
>MAP0202 hypothetical protein
MNDYDAIVIGAGHNGLTAAVLLQKAGLRTLCLDAKLYAGGMASTVELFDG
YRFEIAGSVQFPTSQAVVDELGLDTIPTIDLDVMSVAVRGVGDDPLVQYS
DPVKLFTHLNEVHGADAVNGMAGLMAWAQAPTRALGRFEAGTPPKSLDEM
YACATNEFERSAIDDMLFGSVTDILDRYLPDREKHGALRGSMTVLAVNTL
YRGPATPGSAAALAFGLGVPDGDTMQMKKLRGGIGVLTSHLCDVLERHGG
ELRLRTKVTEILVEHAGSGGRVTGVRTESGETLSAPIVVSGIAPDVTLNE
LLDPAALPADLRARYARIDHRGSYLQMHFALQEAPVFAAPYQVLNEPVMQ
ASIGLFCTPEEVQQQWEQARRGVVPADPTVVLQIPSQNDPGLAPEGKHAA
SAFALWFPIEGEVDYGRAKIEMGQRVIDKITRLAPNFEDSIIRHTTFTPR
HMGVMFGAPGGDYCHGLLNANQVGPNRPGPRGFLGQPIPIEGLYLGSAGC
HGGPGITFIPGYNAARQALTDRSG
>MAP2230c hypothetical protein
MSTVEGLRDPASSFAIVGYAARFPDAADADEFWRVLAEGRDAVSEVPKDR
WDADEFFDPDPDAPGKVVTRRAGFVDDVTGFDAPFFGMSAREARLLDPQH
RLLLETAWRAVEHCGTAPTDLANTNTGVFIGISTHDYLGMASDELTYDQI
EAYTAIGTSSAAAAGRISYRLGLQGPAVAVDTACSSSLVAIHQACQALRL
GECDLALAGGANVLLTPATMITFSQAHMLAPDGRCKTFDAAADGYVRGEG
CGVLVVKRLEDAIRDGDRIRAVIRGSAINQDGASGGLTVPNGVAQQRVIA
EALQRAGIAPGDVDYLEAHGTGTSLGDPIEVQAAGAVLGAGREASRPLLI
GSVKTNIGHLEAAAGIAGVIKVILALEHGMLPQHLHFRDPSPHIPWDRLA
VQVVKEATAWQRNGRPRVAGVSSFGFAGTNAHVILEEAPDGTGRAAVPPA
AVRPPGQRRFHVLPLSARTPAALVQIAGEYRDWLSAHPDTALADLCLTAG
AGRAHFEHRAALVVDSTGSAGELLAALADDRPAPGLLRGVCEDAPKTAWL
FTGQGSQYAGMARELFDTEPVFAETLTRCATVLADVLEKPLLDVIFDTDD
PASEETLRQTSYAQPALFAVEMGLARLWQSWGFEPDVVLGHSVGQYSAAC
VAGVLRLEDGARLIAERGRLFGSLPAGGRMVAAFAAAARVESLTDEFPSL
SVAAYNGANTVLSGPGQELEQAVARLTAEKVRCDWLDTSHAFHSALRDPI
LDEFESYANQFDFGPPQRILICNRTGSALGRSVKLDGAYWRRHARQPVEF
AKSVRTLADLGCKVLLEIGPQPVLTAAALRAWPDPATAPRAIASLRRNTA
DHRQITEALAEAYIVGHLPDFGAAVGPARKLDLPTYPFQHRQYWFSEQRV
PPGSQQTSTQTTAVRLLEDGRIAELAELLDGASVDQHTLDVLTKLAAQHN
QQRKSRSIAEDRYEIRWEKSTVPGPEAEATWLVVGDDDDAVAPLVDALTA
RGHQHRILVLPASDADEERVEIVLRAAAAEEAPLRIALVAALDSGTAPSM
RSLLRMQHRVLAGTRRLFRAAAAAELRQPIWLVTRGAQRVTDADTVFPDQ
SCLWGFGRAASLEYPQLWGGLADLAQGSAEEWSRLVRVASAPRDPAVGED
QIALRGSDIYVPRLTRRSGQPAAAALTLRADATYLVTGGLGSIGLEIAGH
LAAQGARHLVLTGRRAPGEAAQRRIDALSQQHGCEVRVIAADVADAHHVA
RLLGAVRAELPPVAGIVHAAGEIGTTPLSDLEDAEIDRVFAGKVWGAWHL
SEAAADLRLDFFVSTSSIASVWGGLGQTAYGAANAFLDGLAWRLREQGVN
GISVNFGPWAAGMADENARARLAQRGIKTLSPADALAGLADVMATAAAQG
VVARIDWTRFLPLYQQAGQRSLLAELAREVPDFAPAPTPSGKTGLVERLA
GAPVQQRRKLMTEFLRNAVAEVTRVDAAEIREDAGFFDLGMDSLMAIELR
RHIEQGVGKQIPATLAMDYPRLSDVAEYLLGDVLDLRGQPDAKPAPPAGT
PAPAGRTDEPIAIVSMACRFPGAADPEAFWELLSGGVDAIREVPEDRFDI
DEFYDPDPETPGKMYSRFGGYLDQIDEFEPEFFGISPREAVWIDPQQRLA
LETVWEGLERAGYSPAALRGSRSGIFVGVGANEYSHLLSAESVDKIEPHF
ITGNALNAISGRVAFALGLEGPAVAVDTACSSSLVAVHQACQALHSGDCD
LALAGGVNVLLSPVSNIAASRARMLSPVGRCKTFDATADGYVRSEGCGIL
VLKRMSDAVRDGDRVCAVIPASAVNQDGASSGLTVPNGGAQQRLIATALS
RAGLSGDDVDYLEAHGTGTALGDPIEVQAAGAVYGATRDTDRPLLIGSVK
TNIGHLEAASGVAGLIKVVLSLQHEMLPQSLHFETPSPHIPWDSLPLRVV
DKAVPWQTNGRPRRAGVSSFGFTGTNAHVLIEEAPQPVTTQHATPAAESD
ARPVDVLPLSARSPEALLALAQRYGEWLHAHPDADIGDLCFTVGAGRAHF
EHRAALVVDSVRSAREALADLADNRTRPGVARGECTDPPTTAWLFTGQGS
QYPGMARELFDAEPVFAETVTRCADAVDGLLPRPLTEVIFATDKDTGGEA
GKTLRNTAFAQPALFAVEMGLARLWQSWGVEPDVVLGHSVGQYAAACVAG
VFSLEDGARLMAERGRLFGGLPDGGRMVAVFADAQYVEELANDCPRVSVA
AYNGPNTVLSGPGADLEQIVARGASDGIRCSWLETSHAFHSELLDPVLDE
FESYAAHLDFAVPTLPLVCNRTGTVLTTETPLDARYWRRHSRQPVQFAES
VRTVAALGCTVLMEIGPQPVLTGAAVQVWPEHLAAPRAIVSLRKGVGDRR
QIAEGLAAAYVSGHRPEFAALHRRPRRKLELPTYPFQRRRFWPKASGIAG
IDGQGAAASGLLGSAKELASGDCVYTSRLSVKSQPWLADHVIYGTVVVPG
ATYAAMALAAVGPPARVQNVYFYEPIILPDKASREVQLSLHPLQEGGWSF
RVDSRPYGVPDAEWSLNADGTVVAGIEDEPAADPADHIDAVIERLDRSRP
QLLFDSFAESELEWGTTWSTSLKSLWMGDGEAVGDVTVGDELAEHLGTEP
MHPVLLDLCTGVVFPAFPALLAAEQGFNDLLLPLRYGQVSLREKMPRRFY
CRGKWRPSALDSETQVFDIDFIDRDGRHLGGIREFTVKRAPREALLRGLG
GDATRLLYTLGWHEVPPQPSNDGGANPSGTWLIAGFDELAAQLPGCIPFD
RNTDPESFGQLLARAEDRGGPVTGIVWRATQSGGQESSAESVARLETEIA
GLLTAVHALQNADGSAVKLPGGLWIVTERGVATESGEPVDPVQAALWGLG
RTIVNEEPALRCRLVDSDGSEQAVRSLAGLLGAPLDEPELAVRQGKLLAS
RLLPWARSGHLAVPRGTDYVLAPTERGAIDNLRLIEQEAPPPAEGCVQVR
VEAGGLNFREVLNVLGLYPGDPGPLGGDFAGTVTQLGSGVTGLEVGTRVY
GFMQGAFASRFNVPVQLLAPIPDGVSAVDAATIPAAALTARLAFDWAQLK
PGDRVLIHAASGGVGLAAIQLAQRHGATVFATASTYKRATLRKLGVEYVY
DSRSTDFADQILADTDGAGVDVVLNSLTNEGFIEATVRATAQHGRFVEIA
KRDIWTREQMAAARPDIAYEIVALDWACVEQPEHIHALLTEVSDGLASGE
WRPLPAEIYPLAEAKTAFRRMQQARHIGKIVLQMPKPLQPRGDRSYLITG
GLGAIGLHLAAYLAQLGAGEIVLTSRRAPDAAARRAIDDITERYRCRVHT
FAADVGDAAQVEQLLARIRAELPPLAGVAHLAGVLDDALLSQQSPERFRV
ALAPKAFGASYLDHATRTDDLDFFIVSSSVSSLLGSPGQANYSTANAWLD
GLVAARQARGLPATGVNLGPWAQGGMASSEAARANIGAQGLIPLEPTAAL
AALAEVVANGTGQATVIKANWQRAAKLLGDARPPILDLVLPSAVGEASGD
SELLKQLQEIPVAQRAAFITEFLQREVQGFLRLAQPPAATSRFLDLGTDS
LMAVELRNRLHSQFGGAFTINATAVFDYPTIGGLAEYLAGQLPDAEPEPG
AAVAANPNPGSETGTGEAEPEREAAPEPS
>MAP1503c hypothetical protein
MSEGQYGTFHLPRLDFATLPMSVDRGLGWKTLRDAGPVVFMNGHYYLTRR
EDVLAALRNPKVFSSTVLQPPGHPLPVLPLAFDPPQHTRYRKILQPYFSP
HALGKSRPVLERHAAEMIAALADRGECEVMADFAHLYPFQVFMDLYGLPL
QDRDRLLDWKNAVVGEKPFVTESDVEKSEQLLAYLADAIAQRRQHPGTDM
LSQVMTGEGNFTDIELLGMSHLLILAGLDTVTAAIGFSLFELARRPQLRK
ELRDNPKQTRVFIEEIVRLEPSAPVAPRITTEYVEVGGMTLPPGTSVRLC
MAAVNRDDSDSMSTNELNMDGKVHRHWGFGGGPHRCLGSHLARIELTVVV
AEWLTQIPDFELPDGLRAGDQLPLKEFRAQGAAAALGLKDPPRCSGTVPA
CRTTCGIRSRRPERTRRCPRPSAGHGRGAADRPSSCRG
>MAP3253 hypothetical protein
MKAQSLEGVLISGGSSGLGAATVAAVADRGGVPLVIDRKPPAADVPYVGC
DLADTDAVASAVESLAGKVGGRITGVFTPAGIDSCGTLEQVPAKDWEQVI
AVNLVGTAAVIRAALPYLKATGGRIVTCASTLGIKAVSDATAYCASKFGV
VGFTRALAAELAGQVGVTLLIPGGMHTPFFDGRPEQYKPPADAKLNQPED
VAQTVLFALCQPAGCEVREMVVCASTESSWP
>MAP3222c hypothetical protein
MAVAGRTEQVWDDRLPGTIGSTVADIEAAGGRAVPVRADLTDRDDVARLV
DSAREALGPITILVNNAAFTAPGRPPAPGGEARAKPAAGGAKPASGGAKP
ASGGAKPGWPGFVSTPLHAYRRHFDIAVFAAYELMQRVCPDMIGAGGGAI
INITSVASRLPGDGPYADRSGGVLPGYGGSKAALEHLTQCVAYDLADHRI
AVNALSPSKPILTPGLSYYARDFDDTASADEFARAAVELALVDPGRVTGR
TIGHLQVLDGSFRPFGLD
>MAP2189 hypothetical protein
MPAGHRAPSHPLASRRPTFRYARPLAGLATVIAAAAVLALALTQFRGGFT
QTVPVTVVAGRAGLMMNPGAKVKLHGAPVGSVASIQDRADGQADIHLALD
PSRLQLIPANVLVNIVATTAFGAKFIQLIPPESPSPQRLRAGQVLDAGRV
TVEINSVFQQLTSVLAKIDPAKLNETLGAMATALNGRGARLGQMFSDLNN
FLGKIEPSLPNLSHDIEATAEVSSAYADAAPDLITVADRTTRISQTLVDE
QQNLDALLISSIGLADLGNDVLGTNREKLAEVTHLLAPTTDLTNEYHQAL
TCSLKGMFPLALQPPTPVPGLEVLGGITLGSERYRFPKNLPKVAARGGPQ
CDHELPLAYNTFPPFDVADIGANPWQYGQQGILLNSDGLKQFLYGPLDGP
PRNTAQIGQPG
>MAP1787c hypothetical protein
MSIVIDYSQYATYIERRRHHPATEGVRHGVGAIPAGRTGRDRHGRREGSR
GGHRPGAGRGGRHCRRDRAHRGRYRRTIEGIEAAGGKGLALVADAMSRPD
GERVVNTAMERLGRIDILVNNVGGSSYARFLDITDEDFRHTFDWCVTSAF
IMSQLVAAHMLAAGHGSIVNISSGSARFGIRALTAYCVAKGGLEALTRAM
GQELAPKIRVNAIALGSFATDGLRGSLDLMPGSLEKMLAATPLHRLGDVE
DLGRLTVYLCTRDCYATNAIFHVDGGLDSNNSPLPIPDY
>MAP3261c hypothetical protein
MAAAGSLQERANMSRTVVVGASSGLGRCIGVGLAQRGDRVALLARRRQRI
EAAAKDAGPGAVAIECDVTDQASCASAIGEAADALGGIDNIVYTPAVGPL
VRMVDTDADTWRRIFDTNVIGASLVTAAAVPHLSASAGKAVYLSSDAGAF
GPPWPGLGAYGVSKAALERLVEAWRAEHPDIGFTCLIVGECAGGEGDGQT
GMNQGWDMDLAMKAVPLWSSRGCMPGKLMPVEDLIDVVHTILRTNASTSM
PLVVARGAPASQAAFAEANQS
>MAP1716 hypothetical protein
MDLTDKVVAITGGARGIGLATAKAFLAAGAKVALGDLDTELAEKQAVELG
GDPAVVGLSLDVSDPASFVAFLDDVEARLGRLDVLVSNAGIMPTGPFVDE
PPTMSRRMIDVNVYGVLNGSRLAAARFVPRGAGHIVNIASLAGVTGEPGM
ATYCGTKHFVVGFTESLHRELRPHRVGVSLVLPGIINTELSAGTKVPGWA
RPLATAEPEDVAAGIVAAVSKDKPRKTVPATLGALLKSVSLLPDTPRFAV
AHAVRFDQLVSGADPNARAAYHRRLAEES
>MAP0710c hypothetical protein
MDGFLSGFDGRAAVVTGGASGIGLATATEFARRGARLVLSDVDQPALEQA
VNGLRGQGFDAHGVVCDVRHLDEMVRLADEAFRLLGGVDVVFSNAGIVVA
GPLAQMNHDDWRWVIDIDLWGSIHAVEAFLPRLLEQGTGGHIAFTASFAG
LVPNAGLGTYGVAKYGVVGLAETLAREVKPNGIGVSVLCPMVVETKLVSN
SERIRGADYGMSATPEGAFGPLPTQDESVSADDVARLTADAILANRLYIL
PHAAARESIRRRFERIDRTFDEQAAEGWTH
>MAP2594 hypothetical protein
MAGVGLGQLLLALDATMVSLVDAPRGLDQPVASAALIDSDDVRLGLAAAA
GSADVFFLLGVGDDEALRWIDTQARDRVPVAVFVKEPSDALVGTAVAAGS
AVVAVDPRARWERLYQLVNHVLEHHGDRADAADDSGTDLFGLAQSLAERI
HGMVSIENAQSQVLAYSASNDEADELRRLSILGRAGPPEHLEWIGQWGIF
DALRSGTQVVRVAERPELGLRPRLAVGIHQPDPDARRPPVFAGTIWVQQG
AQPLADDAEQILRGAAVLAARIMARLAARPSTQARRLQQLLGVTDSETLA
PVDLTAIAAELGLAADGCAALIGWAAAEGASRHTRLTDVIALSASAFRHD
AHVAGHGSRTYVLLPQPPNRSVSSWVRGTIAALRAELGVQLRAVIAAPVP
GLSAVAAARAEVDRVLDSAERHPLSFGQVTSLAEARTTVLLDEIVTLVGR
DERLVDPRIVALHDREPVLAQTLRTYLDAFGDIAAAAHALRVHPNTVRYR
VRRIEKLLSVSLADPEVRLLFALALRVAER
>MAP1478 hypothetical protein
MPAGQKGRPMGYDTVITNGRWFDGTGGPSAMRDIGVRDGRVVTIAAGPLD
TAGATVIDASGQWVIPGIIDIHTHYDVEILCEPELSESLRHGVTTVLLGS
CSLSTVYLDNVDAGDIFGRVEAIPRRYVIEHLEAARSWTNPKEYVAELER
RNLGPNVAAFIGHSDMRAATMGLDRATRKDVRPSAAELARMEAMLNEALD
EGFVGMSSQQLLFDKLDGEVCRSRTLPSTYAAPRELRRLNAILRRRDKIL
QSGPDITNPLSVLSQLMTSLGFGRPRLKTSLLSAADIKAIPFVIHVMDRL
ARVVNALGGDFRWQHLPVPFEVYADGISLVVFEEFGSGAAALHLQTEIER
NRLMRDEGYRRRFRKDYDSRFGPRVWHRDFFDAEIVACPEQAVVGKSFGQ
VGLERGGLHPVDAFLDLVVEHGERLRWRTTISNHRPEVLRKMAQSPTVQM
GFSDAGAHLRNMAFYNSGLRLLRHAHDAQKAGKPFISMEHAVHRLTGELG
RWYGIDAGTLRVGDRADIVVIDPARLDESLDRYAEHPVASYGGLSRMVNR
NDDTVTAVLVAGKLAFGSGRAAPQLGRQRYGQFLRAGRPSRTLVSTG
>MAP1863c hypothetical protein
MVETSISDLLCERASLQPNDTAFTFIDYERDWNGVAETLTWSQLYRRTVN
AARALQHCGATGDRAVIMAPQGLDYIVAFLGALHAGQVAVPLGGVSDERV
DSVLRDASPTVILTTSSAAETVAGYVKSQSGRPVPSVVEVDLLDLDTQAA
SGAGGQNRPELAYLQYTSGSTRQPAGVMGSQRNLLANFGQIMSDFCAEYG
GSPRRTPPSCRGCRFITTWV
>MAP2365 hypothetical protein
MPLSTGPTGQPVVPTLDFTGETSPYPFFEHMRRTDPVWHGSLADASQLPE
ELRPEDEWVLFDYESVSQAFRDDRIFSSHKYDETIGLVMGHTILAMGGRE
HHDHRNLVAKAFRATALERWEPSVIGPVCEQLVDEIKNDGHADLVKAVTF
EFPTRIISTLLGLPAEDLDLFRRLSLDLISIPTDIEAGLNAATELYDYFL
KQVEQRRRKPTDDIIGDLVAAEIDGEKLTDEAIIAFLRLLLPGGLETTYR
SSGNLLYLLLTHPEQLAMVYRDRSLIPMAIEEGLRFETPLTMVTRTTTEE
VEIGGKTIPANAQIDMCMGSANRDETRWTDPNAFDIRRPRQAHIAFAGGI
HMCLGMHLARLETRVMLNSLFDRVRDLAFVPDDGTGEESKIVGLTFRSPN
KLPVTFAPAA
>MAP0268c hypothetical protein
MTQPDTDWDAAYRQAAPPPWSIGRPQPELERLIDEGKFRSDVLDSGCGHA
ALSLRLAALGHTVVGLDASATAIAEATAAAAAQGLTTATFARADVTDFAD
YPPGSEGRFATIVDSGLFHALPPQRRQDYLRSIFRAAAPGAALYILAFAA
GALAPAHPDRPGPQGFTETELREAVSVLWHIDDLHAARVYGNDDSAGAPD
SPLAHLEHDGEGHFMAPGFLVSAHKPD
>MAP0561 hypothetical protein
MTSSTNATDLSGKVAVVTGAAAGLGRAEALGLARLGATVVVNDIATALDA
SDVIDEINAAGSKAVAVAGDISERATADELVAQADGLGGLDIVVNNAGIT
RDRMLFNMSDEEWDLVIAVHLRGHFLLTRNAAAYWRSRAKEAGGRIFGRI
VNTASEAGLVGPVGQANYGAAKAGIISLTLTAARALGRYGVCANAICPRA
RTAMTADVFGDAPEVPAGGVDPLSPEHVVNLVTFLSSPAAAEVNGQVFIV
YGPQVTLVAAPTAEHKFSAQGPAWDPAELSATLQDYFAGRDPEHNFSASA
LMEQ
>MAP3085c hypothetical protein
MSDATRIADIAATRARFGERTPLLVETVLLRRRASEKLGELGVSDWLFTD
EALQQATAAPVAVHRAGRLAGPGLVVHDVTCSIGTELAALRARGVTAVGS
DIDPVRLAMARHNIGSAAWLCRADALHPVTRDAVLMVDPARRVDGRRRLR
IDDYQPALSPLLDSYRGRDFVVKCAPGIDFEQVRRLGFAGEIEVTSYRGS
VREACLWSAGLAAAGVRRRATVLDRGEQITDADPDDCPVRPVGRWIVDPD
GAVVRAGLVRHYGARHGLWQLDPDIAYLSGDRLPATVRGLGVLEQLAFDE
RRLRQALTASDCGALEVLVRGVRVDPDALRKRMRLRGSRPLSVVITRIGA
RNSGQVTAFVCQPSR
>MAP2382 hypothetical protein
MMTNVPTAAGDETVSLRDPYPFFARKRREAGVFAGTVMDYSKTPESLMPK
QEYSAVSFDAVNTVFRDGRVFSSKPYDKTIGLFMGPTILAMEGKKHRDHR
NLVSAAFKSKALARWEPTIVRPICNALIDDFIDAGTADLVRQFTFEFPTR
VIARLLGLPDEDLPMFHTRAVQLISYHVDYERAFEASAALKDYFLEQIEQ
RKSKPTEDIIGDLVTAEIDGEKLSDEAIYSFLRLLLPAGLETTYRSSGNL
LYLLLTHPDQFAALQADRELLAPAIEEGLRFETPLTTVQRFTTEDTELQG
VRIPARSVIGVCIGSANRDERRWERSEEFDIFRKHVPHISFAAGEHTCLG
LHLARLETRVAMECLLNRLTNVTLLSDGDPHIHGQPFRSPTALPVTFDAK
>MAP0603 hypothetical protein
MRFENKVGIVTGSGGGIGQAYAEALAREGAAVVVADINAEAAEAVAKQIV
ADGGTAISVAVDVSDPESAKAMADRTLAEFGGIDYLVNNAAIFGGMKLDF
LLTIDPEYYKKFMSVNLDGALWCTRAVYKKMTKRGGGAIVNQSSTAAWLY
SNYYGLAKVGINGLTQQLSRELGGRNIRINAIAPGPIDTEANRTTTPKEM
VDDIVKGLPLSRMGTPDDLVGMCLFLLSDEASWITGQIFNVDGGQIIRS
>MAP1707 hypothetical protein
MPSRRVLITGASRGIGRAVADRLAEGGHEPIGLARSAPKDFPGEFYEVDL
ADPYATAATLDKIVGHAAVHAVVNNVGFARFGRLGSIELDHLFDTYNLNV
RAAVQVVQAALPGMLDAEWGRIVNVTSLTTLGTPERTPYAAAKAALEACT
RIWAGELASAGITVNAVAPGPTETDMYRERSPVGSAREARFLQSIPLHRV
ARPREIAHAICFLLDEDAGYITGQILRVDGGGSIAA
>MAP3563 hypothetical protein
MTDLDSSLPPSVRTAGDSWTITELVGATALGVAAARAAETAGPDPLIRDE
FAGLLVSSASPAWARLADPELSWLDDDPHGKRAHRVGIDYQAVRTHYFDE
YFDGALRAGIRQVVILAAGLDSRAYRLNWPAGTTVYEIDQPKVLEYKTET
LQRHGATPAAVRRPVPVDLRDDWPAALTAAGFQAARPTAWLAEGLLPYLP
SDAQDRLFEMVTALSAAGSQVAVEVFGMNSRSNAQRWLRMRERLGLDVNV
AALTYHEPDRSDAAAWLARHGWRVHSVDNRDEMARLGRPVPEDLSDEAVR
STLLRAHLGGSTG
>MAP3933c hypothetical protein
MAPHDKQHRDWSEADVGDQSGRVVVITGANTGIGYETAAVLAHRGAHVVL
AVRDLEKGNAALSRIVAASPNADVTLQQLDLASLASVRSAAEALRAAYPR
IDLLINNAGVMWTPKQVTEDGFELQFGTNHLGHFALTGLLLDHLLGVRDS
RVVTVSSLGHRLRAAIHFDDLHWERRYDRVAAYGQSKLANLLFTYELQRR
LAAAPDAKTIAVAAHPGGSNTELARHLPGIFRPVQAVLGPVLFQSPAMGA
LPTLRAATDPAVQGGQYYGPDGFLEQRGRPKLVESSAQSHDEQLQRRLWA
VSEELTGVHFPV
>MAP0598c hypothetical protein
MTTSTVVPRISGGEEEHGHLEEFRTDPIGLMQRVRDECGDVGWFQLVDKH
VILLSGAQANEFFFRSADEDLDQAEAYPFMTPIFGKGVVFDASPERRKEM
LHNSALRGEQMKGHASTIEGEVKKMIADWGDEGEIELLDFFAELTIYTST
ACLIGLKFREQLDHRFAEYYHDLERGTDPLCYVDPYLPIESFKRRDEARV
KLVALVQEIMDQRLANPPKDKADRDMLDVLVSIKDEDGKPRFSADEITGM
FISLMFAGHHTSSGTSAWTLIELIRHPDVYAEVLAELEELYADGQEVSFH
ALRSIPKLDNVVKETLRLHPPLIILMRVAKGEFEVEGFPIHEGDYVAASP
AISNRIPEDFPDPDAFKPDRYNKPEQADIVNRWTWIPFGAGRHRCVGAAF
AQMQIKAIFSVLLREYDFEMAQPADSYRNDHSKMVVQLARPAKVRYRKRN
A
>MAP0881 hypothetical protein
MKWVTYQGDDGERVGVLRDDTIYAMGSGVTLLDLIARGAEGLRHAGEEAQ
RSPAATVPAARARLLAPIPRPPSIRDSLCFLDHMRNCQAALGAGRVLADT
WYRVPAFYFACPATVLGPYDDAPTAPGSAWQDFELEIAAVIGTGGKDLSV
EQAEQAIIGYTIFNDWSARDLQQLDSQLGIGQAKGKDSGVTLGPYLVTPD
ELEPYRRNGKLDLRVTASVNDTVIGSGSTAQMDWTFGEVISYASRGVTLV
PGDVIGSGTVPTCTLVEHLDPAALESFPGWLHDGDVVTLRVEGLGETRQT
VRASGAPHPLAARPNPDAPPAAPRVNRAAARVPYTRGLHQVGDRVWAWTL
PDGGYGWSNAGLIAGDGASLLVDTLFDLALTREMLTAMRPITASAPITKA
VITHSNGDHTHGNQLLDPAVRIIAAHGTAEEIAHGMAPEMLAMAQTADLG
PVATRYTRERFGHFDFSNITLRNADETFDRSLAVEVGGRRVDVLNLGPAH
TAADSVVHVPDAGVLFGGDLLFIGCTPIVWAGPIANWVAACDAMIALDAP
TVVPGHGPVTDPDGIRAVRGYLVHIAEQAEAAYRKGLSWSEAADTIDLGE
YATWLDAERVVVNVYQRYRELDPHTPQLEAMALLVMQAEWLARRS
>MAP2354 hypothetical protein
MTESAPAELAANPLADGIPFGAKLQQLAGQRGDDTAVTVVALDGTATSLT
FAELDARANQWGRALAATGAQTGSLVALAIPNSQHLVLATLGCWKIGAVP
VPMHWDLPEWERDRVREVIDPAVVVDETSRWGLEARAAGESEAALPVAVS
PTANGICSSGSTGVPKVILNLAPSLWIPQHGEPFLSNWTPVAQPQTIMVP
APMYHTNGFAPLLMLLGGDHLVILEKFDAALVLDVIERFRITNFTATPTM
LARIAARPDVRQRDLSSIVFILQGPR
>MAP3328c hypothetical protein
MNTVHFDFTDARVLVTGGTSGIGNAIATAFADSGAAVTVTGTRAAATGYP
DIDLGAFSYRQCHIQDPESVDALANSLSDLDILVNNAGGPYPAGDEYDPD
GYVASVTQNMFGPMRLTMRCHDLLKSSRAAGGASVVNVVSMSAFRSAVFV
PGYASSKMGLVALTMNLSRRWAGDGIRVNAIAPGLIDTRMTHPAMGIPEV
MDVEIGFHTPLGRPGTPADCAGAALFLCTEAASYITGSTIAVDGGYLTV
>MAP0465 hypothetical protein
MANRYLQGPFAPMHHEYTLTGLEVVGTIPDYLDGRYLRNGANPIGDVDPE
RYHWFLGDGMVHGIRIRDGKAEWYRNRWVRSPATSKALGQPAPPGHFGFF
PIGANTNVIGHAGKTLALIEGGIANYELTEELDTVGVCDFAGTLTGGYAA
HPHRDPETGELHAVSYNPYRGNRVQYSVIDVEGRARRTVNVTVGGSPMMH
DFSLTERYVVFYDLPVTFDEAVGAAMTAPRGLRPIARLMLSALIGRVAIP
DPIAARLPAATTDRRMPYSWNPNYPARVGVMPRDGEDADVRWFDVEPCYV
FHPMNAYEEDGTIVLDVVRNPKMFDRDRTGPNEGPPTLDRWVIDLAAGKV
RESRIDDRGQEFPRIDERLVGKRHRYGYTPTVLEGIEGGDCLLKHDLIGG
STQSRCLGPQKALGEFVFHPSSPAAAEDEGVLMGYVYDRATNRSELAILD
AQTLEDVASIKLPHRVPAGFHGNWVPSTN
>MAP0547 hypothetical protein
MTSTIPEAIANIDLADGNFYADRRASREAYRWMRANQPVFRDRNGLAGAT
TYQAIQDAERNPELFSSTGGIRPDQPGMPYMIDMDDPAHLLRRKLVNAGF
TRKRVKEKEPSIGTLCDTLIDAVCERGECDFVRDIAAPLPMAVIGDMLGV
LPTERGMLLKWSDDLVCGLSSHIDPTSAEFQTVMDAFAAYTAFTMDIIAK
RRAEPTDDLFSILVNAEVEGQRMSDDEIVMETLLILIGGDETTRHTLSGG
TEQLLRHRDQWDALVRDPSLLPGAIEEMLRWTSPVKNMCRTLTADTEFHG
TELRAGEKIMLLFESGNFDESVFDDPDSFDIRRNPNSHMAFGFGTHFCLG
NQLARLELSMMTERVLKRLPDLRLADDGDLPLRPANFVSGLEAMPVVFTP
SAPLLR
>MAP3721 hypothetical protein
MVALGNINEWHPPHGPVTMWMAAPAAREAARAARRSDLAPSYQQNQHLWA
SYQGKAMNRQLPRLMIVAWDIPGTCDIAAMTATINAHVRRQDTYHNWFEY
DNGTFVRRVIDDPEAIEFVPVALGHKTADQVRAHALTTTPDTLEWGCFTY
GIVQHADYFTFYASVDHLHIDGLSAALIFLDVHLTYQELAQTGHQPAGLP
EIRSYRDYVARQREKAATLTLSSPEIKDWIEFARDTDGDWPSFPLPLGDT
WSSTRGDLLTVELINAEDTESFDAACVAADARFIGGVLACTGLAEHELTG
KKTYHGFTAKDTRTPGVDSMTVGWFASLIPITVPTAGETFAQAARTAQKS
FDDAQRLADVPVERVLELATPDELGIKLPTQLPMMLSFLDFRKIPLNGLW
AETKFGTYGDSLSHGGVNMWINRQAANTTVTMSFPDNVIARESVLRYIAT
LIQVFARVARPTGDEPVVVAPQVKPDDTFAPAPDTDDHEAA
>MAP3609 hypothetical protein
MLTPFVRRQLVAFGILTVISLLVLGIYYLQIPSLVGIGRYTLKAELPASG
GLYPTANVTYRGITIGKVTDVEPTEHGAEATMSIDSHYKIPIDAVANVHS
VSAVGEQYLDLVSSGNPGKYLSSGQTITKGTVPAEIGPALDTANRGLAVL
PKEKIGQLLDETAQSVGGLGPNLQRLVDATQAIVGDFKNQITDVNDIIEH
SGPVIDSQVKSGDAIERWARNLNRLGAQSAQEDSHLKSLLRQAAPTADQV
NDVFNDVRESLPQTLANLEVVIDLLKRYHTGVEQVLVFLPQGASIAQTVA
APFPNMAALDLALSINQPPPCLTGFIPASEWRSPADTSLQPLPTGTYCKI
PMDTPANSVRGSRNIPCTDIPGKRAATPRECRDPKPYVPAGTNPWYGDPN
QILTCPAPAARCDQPVKPGMVIPAPSVDNGLNPAPSDRVAGTPPPVSDPL
SRPGSGTVQCNGQQPNPCVYKPSGPPTAVYSPQSGELVGPDGVKYSVENS
TRTGDDGWKEMLAPAG
>MAP0112 hypothetical protein
MKAGKRKALAVACSAVMASSGCATNGLASLPLPAPGIGSGGYLLNAVFSN
ALNLPAHAKVKLAGADVGQLESMVARNYTAVTTLRIMDGVRVPVGSTAEL
RSATPLGDVFVAIKPPVPADPSAPLLKAGDTIGLPATRAAATVESVLSSA
ALLVNGGAVRNFTNIVNGAGKATGDQGRAFGDLINRTNSLLTKLNARSDQ
IDAAVTETAALADRLDAKNQAITDVLRAVGPATDVLSGNADEIADVVDEL
GATTRQLSKFPSIAGTDKTGRSVIKDANAIAAAWNDVVLSPDTSLAGLNR
LIPPFVKATPSQAISVRGSFDRLVLGSRPGTGAETGGFKGDPAFHGPMRR
DWNYLIGSIKYVLWRLQERVVGRGPQTPMGQSPWTPSGPPLPPAPAGQAP
PDPMVPEPPR
>MAP0577 hypothetical protein
MQLSFQDRTYLITGGGSGIGKGVAAGLVSAGASVMIVGRNPDRLAGAVEE
IAPLADRAGNGGAIRYEPTDVTNEDEVARAVDAATAWHGRLHGAVHCAGG
SLTVGPITHTDSEAWRNTVDLNVNGTMYVLKHVGRELVRGGGGSFIGISS
IAASNTHRWFGPYGVTKSAIDHMMMLAADELGESWVRVNSIRPGLIRTDL
VDASVIQSPEISADYAQCTPLPRVGEVEDVANLAMFLLSDAAGWITGQCI
NVDGGHMLRRGPDYSSMMVQMFGQDALRGVV
>MAP4282 hypothetical protein
MSQSLTDKVALITGAARGQGRAHAARLAAEGADIIAVDLAGPLPPSVPYD
SSTPEDLAETAELVRAAGRRVVTAQTDVRDLDALTSSVDTAVGELGRLDV
IVANAGICSPAPWNRITAQAFRDTIDTNVVGTWNTVMAGAQHIIDGGRGG
SIILIGSAAGINMQSFMVHYTASKHAVVGMARAFAAELGRYNIRVNSLNP
GAVATPMGTGRMRDALRAAADDYPHLRGLHKPLLPEGIAQPEDIADAVAW
LASDQSRLVTASQVSVDLGVGYV
>MAP0764 hypothetical protein
MFAIGCCVMVTATGCAFHGLNSLPLPGAVGRGPGANIYHVELPNVGTMES
NSPVMIDDVVVGSVGQMRVQGWHADVEISVKRDVVVPANVVATVGQTSLL
GSMHVELNTPLGQQGSGRLQPGATIPLSRSSAYPSTEQTLSSLGAVVNGG
GLGQIGEIIHNFSAALSGREGAVRDLITRLDTFVGTLDDQRDNIVDSIQA
LNRLAGTFAEQRDVVSQALQKVPPALDVLIKERPRLTAALDKLRVFSNTA
TRLVNDSQADLVQNLKNLEPTIRALADVGPEFGTAIAAGFVFPLTQNFVD
RAVRGDYFNLHVDLDLSIPRLKRGLMLGTHWGQLDQPLVPEPGDPYYLQY
THDPLHDPLRPPWVGPGPRQIVGDPLPGPPPGAAPLPGPPPGAAPLPGPP
PGAASLPDAGLGQTPPAATAPTEGGG
>MAP1132c hypothetical protein
MTLDTTTADASVAARHRTVWALGDYALMAEEVMAPLGPTLVEAAGIGPGV
RVLDVAAGSGNITLPAAAGASVVSTDLTPELLQRSRDRAAGMGLTIDYRE
ANAQALPFGDGEFDAVVSAIGVQFAPDHQRAADELVRVCRPGGRIGLISW
TPEGFFGRMLATIRPYRPSLSHPVPPAALWGRPGYVRALLGERIGEITTA
RGMLPVNRFGSAEDVHAYFKQHYGPTIEAYANIGHNRVLAAELDAQLVEL
AAQHLSDGTMGWEYLLVTAEKRSG
>MAP3385 hypothetical protein
MARTDDDTWDLATSVGATATMVAAGRARATRDGLIDDPFAEPLVRAVGVD
FFTRWAAGELDAADVDVPGAAWGMQRMTDMLTARTRYIDAFFAEAGAAGI
RQVVILASGLDARAYRLPWPAGTTVFEIDQPRVLEFKAATIAQLGAEPTA
PVRAVAVDLRHDWPSALRQAGFDVGRPAAWAAEGLLGFLPPQAQDRLLDN
VTALSADGSQLVAEVFANTGASGDALNAAGEKWRRHGLDVALDDLGFPGE
RNDPASYLQQLGWQPVRTPLNQMLANNGLPLQSTEPGAPFAQNYYCTAVL
NKAG
>MAP3600 hypothetical protein
MTAQLAHHPTQANEQPYLSRRQNWVNQLERHALMQPNATALRFLGKGLTW
GELHGRVRALADALSRRGSASATG
>MAP3741 hypothetical protein
MTAAELVDHLRGIGVQLWADGENLRYRAPQQVLTADLKAQLAAVKTDVIT
LLAEETTLLRAPQDRFEPFPLTDVQAAYLVGRTSAFQWGGVGCHGYAEFA
VDHTVATPSAEQYREAWRKVADRHDMLRCVVHPEGYQVICPDVPDDGLVI
HQCHTVEDVAGVRAGVTEHLRNRIYPLGEAPMYDLVITMGPDDTVVHLSV
DLLIADFVSISILMTDFQQCLLDPECDLAPVDFSFRDYLLNLARERSSAA
GSARRERDLAYWRDRLDQLPSPLSLPVLPDD
>MAP1622c hypothetical protein
MTTPQFGSQRSDDDNWDIVSSVGYTALLVAGWRALHAVSPRPLVRDDYAK
TFIAASGDPYLTGVLANPGTSEDELAFPRLYGAQTRFFDDFFDAAGAAGI
RQAVIIAAGLDSRAYRLEWPPATTVFEVDLAKVLEFKARVLGEQGAVPKA
RRVEVAADLRADWSRPLEAAGFDVESPSAWSVEGLLPYLTDEAQHALFTR
ISGLSAPGSRIAIGALGSRLDHDQLHALEESHPGVDVSGNVDFSALTYEP
QSDPAEWLAAHGWVVDPVRNTLDLQAGYGMTPPEVDVKIDGFMRSQYITA
AR
>MAP0508 hypothetical protein
MSLSEAPKEIAGHGLLEGKVVIVTAAAGTGIGSATAKRALAEGADVVISD
HHERRLGETADQLAALGQGRVESVLCDVTSTAQVDALFASAHARMGRIDV
LVNNAGLGGQTPVVDMTDDEWDRVLDVTLTSVFRATRAALRYFRDAPHGG
VIVNNASVLGWRAQHSQSHYAAAKAGVMALTRCSAIEAAEYDVRINAVSP
SIARHKFLEKTSSADLLDRLSAGEAFGRAAEPWEVAATIAFLASDYSSYL
TGEVISVSSQHP
>MAP0009 hypothetical protein
MLATGHPHPQAPSPAVVAPQLRRGPLTPRDGASSLKRMAVLDSATRFFGS
EAIQDPYPLYERMHAEAPMHRIGDSVFYAVCGWDAVHEAIERVEDFSSNL
TATMVFHEDGTVTPFDMGAPGAPMHALATADDPVHAVHRKILLPHLSAKR
IRIIEEFATQTADRLWDENLSDGRIEWMSAIANRLPMMVVCRLLGLPDDD
VDKLIRLGYATTTLLDGIVAPEQLEQAGMAAIELSGYVLEHFEKASEKPE
SSLMADLAARCAAGELEQLPALGIMLTLFSAAGESTASLLGSAAWILADR
PAIQRQLRENPELLSTFIEETLRFEAPFRGHYRHVWRDTTLGGIELPEGA
HLLLMWGAANRDPTHFKDPNEFRLDRAAAKSHLSFGKGVHFCVGAALARL
EAHIVLRRLLERTSWIDATDVGDWLPSILVRRRERLGLAVR
>MAP0718c hypothetical protein
MQTIKAAGLFDVDAGEIVRPGILRVDGDRIVGVGDSGPTGADDNVIDLGE
AILLPGLMDMEVNLLMGGRGEKPGLSQVQDDPPTRVLRAVGNARRTLRAG
FTTVRNLGLFVKTGGYLLDVALAKAIDAGWIDGPRVVPAGHAITPTGGHL
DPTMFAAFAPHALELTVEEGIANGIDEIRKAVRYQIKHGAELIKVCHSGG
VMSLTGPPGAQHYSDEELRAIVDEAHRRGMRVAAHTHGADAVKHAVAAGI
DCIEHGFLIDDEAIASMVEHGTFLVSTRRLAEGMDVSHAPPELQAKAAQM
FPKSRTSILAAHRAGVKIAVGTDAPAIPHGRNADELVTLVEWGLPPVAVL
RAATVTAAELINSSDRGRLAEGYLADVIAVPGNPLEDITVTQHVSFVMKG
GKVYVHNQN
>MAP0108 hypothetical protein
MLALGATVLLIAGLITGGLLLKSTGRLNDYVRVVAELTNVGDGLPARSDV
KYHGLLVGAVDNVIPAAYGKPNYVHINLKPEYAQDIPSAVTARVVPSNVF
AVSSVQLVDGAPGPSIRNGARIPEDLQLSTVIFQTTISKLRDILAATGRG
REDHTVGILAAVAAATNNRRGPLLTAGAQLTRVLDELNAIVATDPGPSTV
SALLDATRGLQSTAPDLVDVLHDAVRPMQTLVEKREQFRSLVTGSYHTFS
VNRQAFDNHTDQLIEMTQNLTPVLGFFAMNSDKFVPIFTRLNRLSDKFFQ
EVWDPELDTGNMRVNLALTPTYTYTRADCPRYGQLQGPSCFTAPTIAVRP
DLPEVLLPQNYHPPTDLAPPPGTQIGPDGNLVATGPPLYNPNPSLADPNP
PLPWWPWQIGPAPRVPGTADPDDAPPPPPPSPAPPGPPPSPAPPGAVAPA
AYGGNVGPVGSQRERDQLGLITGQGRPASVATQLLLGPVARGSAVSLQPR
AATGGPT
>MAP0701c hypothetical protein
MPVPPAGDTPSERRQLSRRGFMAAGIAGGLALAGCGQSKPDPSRNTRMAA
AIATAEAARPHSGRTVTAHLTPQPVQIDLGGPVVRTLAFGDSIPGPVIRA
AIGDEVAVTVANRLDHPTSVHWHGISLRNDMDGAEPATPNIAAGHDFTYR
FSVPNPGTYWAHPHTGLDEDTGLYLPVIVDDPAEADYDAEWIVVLDDWTD
GVGKSPAQLYGELSNPNKPPRNPPETTTTSTNPTTTETSPVTTTETTSTT
PTTAAAAPPGVGSSELLGGDAGAIAYPYYLVNGRIAAFPKTFNAKPGQRI
RIRLINTASDTAFRVALAGHSMTVTHTDGYPVVPAQVDALLIGMAERYDV
TVTAGDGVFPLVAVAEGKNALGRALLSTAAGSTPDPQFQPGELTKRVGTV
EMFTATTPVNLGRPDAGLDLPIVLGGNMIQYNWTINGEPYSKTNPLLVHE
GQRPTLSFENTTMMYHPIHLHGHTFQVIKPDGSPGARKDTVMVLPKQKLA
AVLSADNPGTWVMHCHNTYHQVAGMETRLDYVL
>MAP0762 hypothetical protein
MAALAAVVLVGLIVAGAAVLVRNTFFGQKTITAYFTTATAIYPNDEVRVS
GVKVGNIKSIEPQGTQAKMTLKVDHDVPIPADAKAVIVASNLVSARYVQL
SPAYRDSGPVMPDGAVIPVERTAVPVEWDEVKTQLMRLATDLGPKSGVSG
TSVGRFIDSAANALDGNGDKLRQTLAQLSGVGRILANGSGNIVDIIKNLQ
TFVGALRDSNVQIVQFNDRLATLTSVVNDSKSDRTRR
>MAP3753 hypothetical protein
MAHPGGAIPDNRLAFTDQMSFLSVRATGQGTVAQCVWIYERTVDFEGLRR
FHRNLDLGLLGRRIERSPLPFGRDRWVSSPPSVDIEVAEGARPRSELSEW
ADERGRLPVNAERGPAWHLGVAHFTDGSTAVSLVASHLVVDGLGFCLAIA
DAVNGTARDLGYPPPRSRHRLRAVVQDAYQSARGTHEVARALFTAVQAGS
RRRSDVARVRAARPMPIHRSNPDDGVMAPAITIYIDPDEWDARAKSLGGT
SNSLFAGVAAKLAEHSGCRSADDGTVAISFPVSDRTEDDMRANALSFVIV
IVDPAQATTDLRGIRGAISQALQTLQENPEELLQVLPLAPLTPKRVMRKL
AHVAYGYTDVGCSAIGELDPAVGRPDGTDADHVFIRGVRQRFTRQNFDRP
RGIFMVSGRIRGRMFITIVAYQLEGRDSKHHLHELVAQTLSEFDLTGTID
>MAP0199 hypothetical protein
MAMNLLHRHRCSSAGWAKEVADELLPWALDGVELGSRTLEIGPGYGATLS
APLDRTASLTAVELDPVMADRLQRRYGDRARVIQASGTETGLPANHFTSV
VCFTMLHHVPSPQLQDQLFAEALRVLQPGGTFAGSDGVPSWAFRLLHVAD
TYNPIAPKDLPGRLAAAGFADIHADARGGRQRWRAVKPID
>MAP1855 hypothetical protein
MTATGARGRLTALAAAMAVTTAVTMATAGCATNGLASLPLPAPGLGSGGY
SLNAVFSNALNLPMNAKVKLAGADVGQLESMVARNYTALTRLRIRDGVQL
PRGSTAELRTATPLGDVFVALKPPPGDHDAPLLKNGDTIGLESTAAAATV
ESVLSSAAVLVNGGAVRNFTNIINGFGKATGDQGQAFGDLIRKSNELLGT
LDARSDQISAALTQLSTLADQLDAKNHTITDLMTAAGPATSALADNTSEL
SEVAQQVGDTSRLLARFPALGGTDTSGRSMIRDLNTIAGAANDVAMSPDT
SWQSINRLIPALVKSTAGNSISVNVSVDKIMLGSLPDIGFPGDIGLHGPH
HYNVNLLVGTLKYTLWRLQERVVGRGPNSPQVPVVPDPTVPGQIDVAPGP
IPPQPGSPP
>MAP1803 hypothetical protein
MTTIQIDTPDGPIDALLSTPAGQGPWPGVVVIHDAFGYGRDKQSINDRIA
RAGYLALTPNMYARGGLVRCITRVMKELAAQRGRALDDILAARDHLQAMP
ECTGRVGIAGFCMGGRFALVMSPKGFSASAPFYGTPLPGNLDEVLEGACP
VVASFGGRDLTGKGAPEKLRKVTADKNITADIKVYPVAGHSFANELPAQP
LLRIAGFGYNEEATEDAWRRVFSFFGEHLAATP
>MAP1747c hypothetical protein
MSDRLRDIRELAGHTDTYRDELYRRWTGLLSYRYIGRKHSSMNLGETDDT
VTIRRDMRNEAGGIMVAPLAISSPEGCQTDMVAVPNPVIASVQIIDPGYD
VKRVEIVGSGIVHQGRTMGYGRCTIVDADNPGRVIAFNEGQGAIIGVPPE
GLDRMDVSGTELVIEDSDELPPLWRAFGASRRADGHWTLPELNTELASPD
AALHIGPQHVVLETAAIDLAAEVAGTRKLQVVSWHVMFMSRGKVGPFRVE
GTAHPGASGRVGVRMLLHDEGNADKAVTSAAAIFEVVG
>MAP3166c hypothetical protein
MTDPLWTAYAEKVDKVDGWFFEADVELFSHLLARQTAEGISGDMLEIGTY
QGKSAILMGYGLRDDEELVICDLFEAVVDHTHGSPSSRDQYSGLDQQQFL
ANWDRFHTRRPMIEVCESSQLDLADRAVRLAHIDGCHAYPCVAHDIELAV
RHTADRGVVVLDDYRGVETPGVAAAVWQAVGNGVLFPFAATYMKLYACAS
PADQHYWLEQVRGRGDICAFPDFEFPSYRSISEAQLRPLDR
>MAP2393c hypothetical protein
MDLGLANAAAVVVGGSRGMGLATARCLADEGARVAVIGRSRDALDSAVTD
LTRRGSPDALGLVADIGDDAAVGQAFGELARRWDGELNALVITVGPGAAG
TFEDLTDEQWRQAVEDGVLGMVRCVRAALPLLRKAQWARIVTFSAHSTQR
QSVLLPAYTAAKSMLTSVSKNLSLLLAKDEILVNVISPGSIASESLVGWA
NSVGVDGRDPYALMEAIGKHFGHPAHMPRAGLPEEIGPVAAFLASRRNSY
MTGANINVDGGSDFT
>MAP4189c hypothetical protein
MTRTHDDEWDLASSVGATATMVAAGRAMATKDPRGLIDDPFAEPLVRAVG
VDFFTKMMDGELDLDAIENATPVRIQSMVDGMAVRTKYFDDYFVDATDAG
VRRVVILASGLDSRAYRLPWPAGTVVYEIDQPRVIEFKSNTLAEVGAEPT
ATRRTIPIDLRGDWPAALSAAGFDPAAPTAWLAEGLLIYLPPEAQDRLFD
NITALSAPGSTIATEFVPGIVDFDAERVREMSGSFRQHGVDIDMASLVYA
GERNHVIDYLNGLGWRAEGVTRTELFHRHGIEVPAPENDDPLGEIIFISA
TRTR
>MAP4314 hypothetical protein
MFDLKITGGTVVDGTGAQRYRADVGIRDGKIVDVVRADGSEAGGLATAEA
AETIDATGKIVAPGFVDIHTHYDGQVSWDSLLEPSSGHGVTTVVTGNCGV
GFAPVRPGTEQWLIELMEGVEDIPGTALTEGITWGWETYPEYLDAIGKQK
FAIDVGSQVAHGAIRAYAMGERGARNEPATPDDIEAMGRLVREAIEAGAL
GFSTSRTMGHRAMDGEPVPGTFAAEEELFGLGRAMAAGGQAVFELAPQGA
AGEDIIGPKKELDWMRRLSGEINRPVSFALIQVDADPNLWREMLDLSADA
HAAGARLYPQIAARPFGMMIGFQGHHGFSHRPTYRRLAAECSREELAQRL
ADPAVKAAILAEDDLPVDPTLLFDGMFALVQHSLHRLYALGDPPDYEPTP
DRTVAAIAEARGEDPLATLYDLMLEADATNMLMLPLFNYADGNCDAIREM
LLHPAGVLGLSDGGAHCGMICDASYPTFLLTHWARDRSRGDKLALEYVIR
KQSRDTAHLFGLTDRGTIEPGKKADINVIDMDALRLHPAAMAFDLPAGGN
RILQGASGYAATIVSGTVTRRNDVDTGARPGRLVRGAR
>MAP1977c hypothetical protein
MDTSCEPGEAGRFWEERYRGAERVWSGRVNPRLAELAADLPAGRALDLGC
GEGADALWLAQRGWTVLAVDISATALRRAAEAAALRKLLARIDFQRHDLN
ESFPEGMFDLVSAQYFHSPVHLDRDAVLRRAATRVKPGGVLLIVDHGAAP
PWAQHDGHHIPGVEEVLGSLRLDPDGWTRLRAEAAGREMTGPNGEVGTLM
DNIMMLRRTEQAYAAS
>MAP2007c hypothetical protein
MTALETTDEFTDRITAAIDGASLALLLSIGHQTGLLDTMAGVPPATSDRI
AEAAGLNERYVREWLAGMTTGRVVDYHPATAEYSLPAHRAKVLTRAAGPD
NLALVALFLPLLAEVEQKIIGCFRTGGGLPYTEFPRFHALMAEQSGVVFD
TALVDVVLPLVDGLVPRLRRGVDVADFGCGSGHAINVMAQAFPASRFTGI
DFSEHAIAAGIAEAAERGLANVSFESRNLADLDRADAYDLITVFDAIHDQ
AQPARVLANIHRALRPGGVLLMADVKASSRLEDNIGVPMSTYLYTTSLMH
CMTVSLAAGGAGLGTAWGTQLAVAMLGEAGFADVRVAEIESDPINNYYIA
RKS
>MAP0830c hypothetical protein
MVIGGGHNGLVAAAYLARAGLRVRLLERLGQVGGAAVSAHAFDGVGVRVS
RYSYLVSLLPPRIIDDLGARVRLARRRFSSYTPDPATGGARGLLIGAPGN
PFAAVGADGDAPGFAGFYQRCRLVTERLWPTLLEPLRSRRHARRHVVDGG
GADAAAAWRALVEEPIGAAIADAVGNDLVRGVIATDALIGTFARLDDPSL
TQNVCFLYHVLGGGTGDWDVPVGGMGALTTALATAAVRHGAEILTGAEVF
AVDPSGAVRYRSGDDEHVARARFVLAGVTPTVLAGLLGQHPAPAIAGAQV
KVNMALRRLPRLADGGVTPEQAFAGTFHVNETWTQLDTAYARAAAGQVPN
PLPCEAYCHSLADPSILSDDLRAAGAHTMTVFGLHTPHALARGADPDTLR
GQLTDRVLASLNSVLAEPIQELLLTDARGRPCIETTTTADLERTLNMSGG
NIFHGGLDWPFADDDNPLDTPARQWGVATAHERIMLCGSGARRGGAVSGI
GGHNAAMAVLSSLSSR
>MAP1574c hypothetical protein
MADTASIGLKVRDKVVVITGGARGIGLATATALHKLGAKVAIGDIDEVRV
KESGAALDLDVYGKLDVTDPHSFSDFLDEVERQLGPIDVLVNNAGIMPLG
RVVDESDAVTRRILDINVYGVILGSKLALARMIPRGRGHVINVASLAGET
YLAGAATYCASKHAVVGFTDAARIEYRRSGVTFSVVKPTFVNTELIAGTS
GAKGVRNAEPSDIADAIVKLVAHPRPRVRVTRTAGAIIASQKFMPRALSE
GLNRLLGGEHVFTDAVDVEKRQAYEARARGEQ
>MAP3593 hypothetical protein
MTGGSAQPPRVALVTGAAGGQGRAIAERLRGNGYAVAACDRRIDELAATV
AASGDDRLIAVELDVTSEQQWRSAVERVVDRFGALSALVNCAGVLHRTPL
PQETADAFENAWRVNCFGAFLGMRAALERLRGTPGASIVNICSTGAIHPF
PQHCAYGSSKWALRGLTQTAAAELAPAGIRVNAVFPGPIATPMLDQATQT
RLAAAASFGRIGQPREVADAVAFLVSAEASFITGAELIVDGGQCLQIR
>MAP3140c hypothetical protein
MTTVLRAARWADVVTGKIHAPAVIVVDDERISAMNPEGPLPDSATTVDLG
DTTLLPGLMDMELNLLIGGPGGPEGLPSPMHGVQDDPAYRTLRGAVNART
TLEAGFTTVRNLGLMVKTGGYLLDVALQRAVDAGWHAGPRIYPAGHAVTP
YGGHLDPTVFQRLAPGIMPLSVAEGIANGVPDVIACVRYQIRHGAKLIKV
SASGGVMSHSTAPGAQQYSDAEFAAIADEAHRAGVRVAAHAVGDSAIQAC
IRAGIDCIEHGFLASDETIQMMVDHGTFLVSTTYLTEAMAIDRIAPELRK
KALEVFPRAKAMLPKAIEAGVRIACGTDAPAIPHGQNAKELGALVQRGMT
PAQAIRAATVVAAELIEADDELGRLAPGYLADIIAVPGNPFDDIAATLDV
RFVMKNGQIYKTPAA
>MAP1850 hypothetical protein
MTASAYQPFAPISVPLARLYRRGKVPVIRLGHLLVFFVRALVAVPLALRQ
YSGEFLRLLSNITWGNGSIVVGGGTAGVAVVLGMTVGALVGIEGYNFLDL
LGLGPATGFVSSLVNTRELAPLMASLAFAMQGGCRFTAQLGSMRIAEEID
AMESIAIRPIPYLVTTRLIASVVAIVPLYAACLAIGYLSTQIVVSIGSGG
STGSYLHYFTLMLAGQDIVYSLFKAVVFVWIASTIQCYYGYYASGGPEGV
GVAAGHGMRASITVVIIVNMLLTMALWGVDSGARFGG
>MAP2128c hypothetical protein
MEFLVAATTQVPDGTPAEAVDDLRARVSARCRELARQGQLLRLWRAPSPP
GRWRTLGLFAAADDNALEQLLASTPLRAWRTEEVTPLPVHPNDPTARLIT
PEPVSGGTAEFLQAITIRVPADAPQRVVDDVLAREAERAGELGAQGCLQR
LWWLHSGPGEPRVLALWRTADTESLAAVLRSLPLHAWLQVDTTTPLHTHP
DDPVSGSLAGR
>MAP3510 hypothetical protein
MSETLAPPLRFDDRVAVVTGAGRGLGRAYAHLLAARGAKVVVNDVGGALD
GAGVDTGPAAQVVDEITAAGGDAVACTESVATPEGGRAIIETALARYGRL
DVLVHNAGNVRRASLKQMSYEDFDAVLDVHLRGAFHVLRPAFPVMCRAGY
GRIVLTSSIGGLYGNQGVANYAAAKAGVIGLSNVAALEGAAEGVRCNVIV
PAALTRMADGIDTSAYPPMGAELVAPVVGWLAHESCSVSGELFIALAGRV
ARAVIAESPGVCRPGWTVEDVGEHLDAIRYVEAPLIFPVVPDGHAEHIRY
SFELAQRANEQGALHG
>MAP3590 hypothetical protein
MDFAYDPFDAEVMANPLPYYRILRDHHPVYYMPQWDTFALSRFDDIWRVL
EVNNGTFVASEGTLPPASVLAQHNDGPVDDPPLHPLPFHAMFDADLYGEI
RRTHSRPFRPRAVTDLEGRIRTLANERLDELLARGSFDLTQEYGGVVVAT
IVCELLGIPTDLAPQVLAAVNAGSLAEPGVGVDTGQARPNYFEFLLPAVQ
RRRADPSGPPLEVVDGLLGYQLPDGSALDDLEVATQMLCIFIGGTETVPK
IVAHGLWELSRHPDQLAAVRADPQHNIPVAREEMIRYCAPAQWFARTVRK
PFDIHGQTPNPGQRVITLLASANRDEREYPDPDDFVWDRPIRRSLAFGRG
QHFCIGYHLARLEVAVLLQEWLRRVPDYAIRADAATRLPSSFQWGWNKIP
VEV
>MAP1439c hypothetical protein
MTVVLADIDGDAVAALRDELAAGGGAAHDAACDVRDPAAVQDLADRAYDI
GPVRLLVNNAGIEQFGYLWDTPVVNWQHVMDVNVSGVFYGVRAFLPKMMA
AGQQAWVWNIASVGAVVAMPLQAPYIVSKHAVLALTECLHLEVQATGHDD
HVHVQAVLPGPVRSNIFESAGGVDPDAASDAAAAEAHRSAMLGIKAASMD
ALEAAEMIFRQSTEGHFYLHTHPDSVGAAMRERAKVLAAQQAPPLRTETR
FDSAPH
>MAP2486 hypothetical protein
MTSSTSSASESAAAPSGRDVIAQFLPQSPFVVKLGIVAERLDEDEVRLRL
PWDPSNVTIGDMVHGGAIATLADLTVMAAAWCGAQAPPQLRGVTVSMALD
FMAPARASDVIGVGRVLRRGRSLVNCEAEIVDPQGTLVAKALATYKVG
>MAP0727 hypothetical protein
MTNEFSELDFFRGSELIENPYPYYEALRQRCPVTKESHHNVTMITGWDEA
CAVLNDAETWSSCISVTGPFPGFPVPLEGDDVTELIERHRDELPFSDQLP
TLDPPTHTNHRSLLMRLITPKRLKENEDAMWVLADQALDTFLAPGHGEFI
KGFAGPFTLLVIADLLGVPEEDRDKFVKGIRQHSGGGVGGTGEETLAHSP
LEFLYGLFFDYVRDRRRQPREDVLTGLATATYPDGSIPEVEDVARVASNV
FSAGQETTVRLLGAALQTLGERPDIQAQLRKDRSLIPNFIEESLRHESPV
KGDFRLNRRPVTVGGVDLPAGTTVMVVQAAANRDPRRFDDPATFDPARKN
ARQHISFGRGIHSCPGAPLARAETRVAIERLLDRTTDIRINENIHGPAND
RRYQYVPTYILRGLTELHLEFTLA
>MAP2115c hypothetical protein
MRRKLSSIAWRVAIFTAVCLLFTFTLIAVFGQLRFEDRTGYQAVFTNISG
LKSGNFVRIAGVEVGKVGDLTLHRDGTVTVGFAVDKGVRLSEGTKAVVRY
ENLIGDRYLALEEGPGSPRRLPPGATIPLARTSPALDLDALIGGFRPLFR
ALDPDQVNALSGQLLRIFQGQGGTLASVLSQTSMLTSTLAGRSQLIGELI
TNLNTVLRTFATRDHEFSDGLDKLAQLVDGLAQRRDDISTGLAYINAAAG
SITDLLSQSRQPLKDVVQQTDRMSGQVLSDRDYVDNLLKELPDIYQVLAR
QGLNGDYFGFYFCEVLLKLNGKGGNPIFVKLLGQPSGRCTPK
>MAP1468c hypothetical protein
MVKPNLTLEIPDLRGKFAVVTGANSGLGFGLAKRLAAAGAEVVLAVRDPA
KGDQAVAAIRREVPQAKLTIRQLDLSSLRSVAALGEQLTAEGRPIDILIN
NAGVMAPPRRQQTSDGFELQFGTNHLGHFALTGRLLALLRAADSARVVTV
SSIAATQRKLDFADVNAEHGYQPMYSYGVAKLAQLMFAVELDRRSRLGGW
GLMSNAAHPGLAKTNLLSGASYGRSAPTLQARLTRLTWRLLPFMWLDIDE
AVKPTLYAAVSPDAQGARYYGPRGFYETARGGVTFARVPPLARSEPEMAR
LWRLSEQLSGVDYPG
>MAP3012c hypothetical protein
MGEFDNTVAVVTGAARGQGRSHAVALAQQGADVIVVDICADLPAIPYALG
TEAELAETVRLVESAGRAAVPVIADVRDLQALRAGVQAGIDRLGDIDVVV
ANAGVVAIGVTEAESEPVFNTIVDTNLKGVWHTMLATVPSIVRKGRGGSV
VLVSSSQGLTGRGGDGSAAMFAYAASKHGVVGLMRSAANAYAPHKIRVNS
VHPSGVATPMILNDFVVNRMLENPNPALSQTLLPEVPLVESRDVTGAVLW
LAGPRSRYVTGVAIPVDAGHVVM
>MAP4293 hypothetical protein
MTDELRGRRILVTGAATGIGAAAVSVLTDAGADVVATYHTTPPPPDLTAH
WLQCDARDADAVSALVHRAAEHLGGLDVLVHAAGLWQPGIPGCIGADDIS
FLLDTNVKATILTNQAAHAAMKAQDPKGGRIINFGSSEAVMGSPISAVYA
ATKGAVQAWTRSAAKAWAADHVTVNALAPAVQTPGADRLRAFLGPDAAAL
IDQQMQMMIPLGGALGEPARDLGPMLVFLAGSGSGFITGQLLAVDGGLMM
VGA
>MAP1254 hypothetical protein
MNLGDLTNLVEKPLAAVSNIINTPNSAGRYRPLYLRNLLDAVQGRTLEEA
VDAKTVLITGASSGIGEAAAKKIAEAGGVVALVARTRENLEKVASEIREN
GGSAHVYPCDLSDMDAIAAMADRVLDDLGGVDILINNAGRSIRRSLELSY
DRIHDYQRTMQLNYLGAVQLILKFIPGHARARLRPDHQRVVGRGADPRAA
VRGVHRQQGRAGQPVRRAAGRSRQRQRQIHHRAHGAGAHPDDQPDHVVRQ
VPGADARAGGRGDRRRDRAPAAAGQLAVRAVRRRRRRGQPGRDGPGAQPG
VRHVRRLQRRQGR
>MAP2528 hypothetical protein
MDVPMAGKVEGKVAFITGAARGQGRSHAITLAREGADIIAIDVCKQLDGV
KLPMSTPDDLAETVRQVEALGRRIIASQVDVRDFDAMQAAVDDGVTQLGR
LDIVLANAALASEGTRLNRMGPKTWRDMIDVNLNGAWITARVAIPHIMAG
KRGGSIVFTSSIGGLRGAENIGNYIASKHGLHGLMRTMALELGPRNIRVN
IVCPSSVATPMLLNEPTYRMFRPDLENPTVEDFKVASRQMHVLPIPYVEP
ADISNAILFLVSDDARYITGVALPVDGGALLK
>MAP2852 hypothetical protein
MTELTGTTVANTRTVEGFLNALQDADYDAAEAALADDLVYENVGLPTIHG
RARAMKLFRRMEDRAAFEVKIHRIAADGGAVLTERTDALIFGPLRLQFWV
CGVFEVQNGRITLWRDYFDFFDMLKATARGVAALLLPSLKATF
>MAP1442 hypothetical protein
MSRLLSGKTALVTGSSRGIGRAVAQRLAAAGATVAVTARSHSSSLSTRAG
TATALPGTIGETIELIEAAGGSAFGIAADLEDADQRDGLVDAVLDRTGRI
DILVNNAGFADYSLVEDMSLETFDRTVEHYLRVPFVLTKCAVPHMRKQGA
GWIVNIGSVTGVAPVRPYREYNKASGDVVYAAMKAALHRFTQGVAAELLD
ASIAVNCVGPSTAVRTPGAAQLIPESFPTEPVEYLAETVLAMCHLPAAER
TGLVAFSLHYPWSQQLPVHSLDGANLLPPLRPPANANPNILPAGV
>MAP1614c hypothetical protein
MATVEPTTKPVPNLPPGFDFTDPDIYAERLPVEELAEMRRVAPIWWNEQP
IGAGGFDDGGFWVVTKHKDVKEVSLRSDVFSSLQKTALPRYKDGTVAEQV
ERGKFVLLNMDAPQHTRLRKIISRAFTPRAVERLRDDLRERARRIVEAAA
AEGSGDFVEQVSCELPLQAIAGLMGVPQEDRKKLFHWSNEMVGDQDPEFA
SNDAITASVELIMYGMQMAADRAKNPGEDLVTKLVQADIDGHKLSDDEFG
FFVILLAVAGNETTRNSITQGMMAFTDFPDQWELFKRERPATAADEIVRW
ATPVTSFQRTALQDYELSGVKIRKGQRVVMFYRSANFDEDVFDDPFTFNI
LRDPNPHVGFGGTGAHYCIGANLARMTIDLMFNAIADAMPDLESIGKPER
LRSGWLNGIKHWQVDYHTNGSSKCPVAH
>MAP0517 hypothetical protein
MTRAEACPPFAEAINLGLTGRVVLVTGGVRGVGAGISSVFAAQGATVVTC
ARRAVEGLPYEFHSCDVRDDDAVKALIDTIADRHGRLDVVVNNAGGSPYV
LTAESSAKFNRKIIELNLIGALSVSQHANAHMQKQAQGGSIVNICSLSGR
RPSPGTGAYGAAKAGLESLTQTLAVEWGPKVRVNACVVGMVETEQSELFY
GDADSIAAISKNVPLGRLAKPADIGWAAAFLASDAACYISGASLEVHGGG
EPPHYLATTNASAIK
>MAP3729 hypothetical protein
MPLSLRPAAALFGAEIGGIDLRAPLTREQRDELQRLLQRYRVLFFRGQQL
STAHQIEFAEAFGPILIFRSVVPADPQHPGVHNVDGSTVGWHLDASGLIE
PPVASVLRAVEIPDRGGDTVWADGMAAYDGMPDDLKSRLEGLSATHTAPN
QHPLVAHPLVSHHPDIGRRYLNINLAPWVDTRILGMSTSHSSALVEQLRA
HHLRSDYQLRFRWSAGAVVLWDNRGMQHTGIRDYGDDTRRRLQRICIAHF
TEGVTGRA
>MAP2951 hypothetical protein
MGFVAQQQSVPGVQAKMEPIPDCGENSYRGSGKLLGKKAIITGGDSGIGR
AVAIAYAREGADVLIAYLNEDDDARDVARHVTDAGRKCVLVPGDLSDPAH
CRAVVDRAVRELGGVDILVNNAAYQMMHKNLDEISDEEWDYTFRLNVGAY
FYLTKAALPHLRAGSSIIGSSSVNSDTPNPTLAPYAATKAAIANFSASLA
QLLGDKGIRVNSVAPGPIWTPLIPSTMPPDSVESFGDNVPLGRAGQPAEL
APIYVLLASDEASYISGARVAVTGGRPIL
>MAP2117c hypothetical protein
MNATAIAKPMTALGQFFLLSAEALAAAVRGPWAWREILEQIWFVARVSIF
PTIMLSIPYTVLIVFVLNILLVEIGAGDLSGAGAGLASVTQVGPVVTAMV
VSGAGSTAMCADLGARTIREEIDAMKVIGVNPVQALVVPRIIAATFVAVM
LYAVVAVIGLTGSYIFVVFVQHVTPGAFVAGMTLVTGLPQVVISLIKATL
FGLSAGLIACYKGLSVGGGPTGVGNAVNETVVFSFMALFFINILTTALGV
KVTAK
>MAP3603 hypothetical protein
MGVALAHIPHALSHYRKETLRLIAQIGMGTGAMAVIGGTVAIVGFVTLSG
SSLVAIQGFASLGNIGVEAFTGFFAALINVRIAAPVVTGIAMAATVGAGA
TAELGAMRISEEIDALEVMGIKSISFLATTRIMAGLVVIIPLYALAMIMS
FLSPQITTTVLYGQSNGTYEHYFRTFLRPDDVFWSFLEAIIITAVVMITH
CFYGYNAGGGPVGVGEAVGRSMRFSLVSVQVVVLAAALALYGVNPNFALT
V
>MAP2861 hypothetical protein
MPRSSEGPLTGKVAFITGAARGQGRAHAVRLAADGADIIAVDLCDQIASV
PYPLATPEELAATVKLVEDIGSRIVARQADVRDRESLSAALQAGLDELGR
LDIVVANAGIAPMSAGDDGWHDVIDVNLTGVYHTIKVAIPTLVKQGTGGS
IVLISSSAGLAGVGSADPGSVGYVAAKHGVVGLMRVYANLLAGQMIRVNS
IHPSGVETPMINNEFTREWLAKMAAATDTPGAMGNAMPVEVLAPEDVANA
VAWLVSDQARYITGVTLPVDAGFLNK
>MAP0599c hypothetical protein
MPRFDPLPERRPAIVAGASSGIGEATAIELAAHGFPVALGARRVEKLNDI
VGKINADGGEAVGFHLDVTDPNSVKSFVAQAVDALGDIEVLVAGAGDTYF
GKLAEIAGDEFESQLQIHLVGAFRLASAVLPGMLERQRGDLIFVGSDVAL
RQRPHMGAYGAAKAALVAMVNNFQMELEGTGVRASVVHPGPTKTAMGWSL
PAEKIGPALEDWAKWGQARHDYFLRAADLARAITFVAETPRGGFIANMEL
QPEAPLADNKDRQKLALGEEGMPGQ
>MAP4087 hypothetical protein
MSTIFDIRNLRLPKVSARVLVIAALAAVFVFIAAVAGVQLYRKLTTTTVV
AYFSETLALYPGDRVQIMGVRVGSIDKIEPAGDKMRVTLHYNNKYRVPAN
ATASILNPSLVASRTIQLSPPYTGGPVLRDGAVIPIERTRVPVEWDQLRD
SINAILRQLGPTPQQPVGPFGDIIESAADNLSGKGKQVNETLNSLSQALT
ALNEGRGDFVAITKSLARFVTALYQHKQQFVALNDNLAQFTDWFTQSDHQ
VSDTIAHLDQVLDAARKFVNDNGSALTHDVGNLADVTTTILQPEPRDGLE
TALHVLPTFAGNFNNLYQPAHSSLVGLFVFPNFANPIHFLCSAIQAGSRL
GYQDSAELCAQYLAPVLDAIKFNYPPAGLNPFNSAATLPKEVAYSEERLR
PPPGYKDTTVPGIFARDTPFSHGNHEPGWVVAPGMQGTDVQPFTAGMLTP
ESLAELMGGPDAAPPPPGVTAAGPPNAYDESNPPSPPWFSRPTPPGPGR
>MAP2176c hypothetical protein
MTLNNIATMPSAFPSWIKLAPGRGGPASGTGATIVFPHAGAAAASYRVLA
AALAAGADTYVVQYPQRAERLGDPAHESVHDLAAGLFRAAPWPGVAPLTL
FGYSMGGVVAFEFARVAEANGTPVRKLWVSAGPPPCVVGDMPELPTTHDG
LLADIADLGGTDPELLADEEFSELLTTAVRADYQAFNGYDPSPDVRIGAD
IHVLGGHHDHRIATGVLRQWERHTAGSFAMSLYDGGHFYLYDHVETVAAQ
VNAG
>MAP3459 hypothetical protein
MTRSRHDRSLSFGSAAAAYERGRPSYPPEAIDWLLPVGARQVLDLGAGTG
KLTTRLVERGLDVVAVDPIPDMLEVLRSSLPETRALLGTAEEIPLEDNSV
DVVLVAQAWHWVDPERAIPEVARVLRPGGRLGLVWNTRDERLGWVRELGR
IIGSDGDGRTHVTLPEPFTDLAHHEVEWTNYLTPQALIDLVASRSYCITS
PTEVRTRTLDQVRHLLATHPALANSTGLALPYVTRCTRATLAG
>MAP1819c hypothetical protein
MDDTGAAPVLILGGRSEIGVELARRLAPGTTVVLAARNADRVNDQVDALK
AAGASAVHTREFDADDLASHGPLVASVVADHGPIGTAVLAFGILGDQARA
ETDAEHAVAIVHTDYVAQVSMLTHLAIAMRAAGRGQLVVFSSIAGARVRR
ANYVYGSAKAGLDGFASGLADALHGTGVRLLTCARDS
>MAP2113c hypothetical protein
MTITRDALRKATALSLVLTLAVASVLVGGKLWRAVEKNSYAAYFAETNGL
FVGDEVRILGVAVGAIDKIEPQSAGSKVTFSVDKKYAIPAAARAAVLSPS
LVTARAIQLVPAYSGGPTLSPGAAIPLSRTAVPVEWDDFRKQLEKLTDAL
QPTTAGGVNSVGEFVNSAADNLRGQGDTARDTVLKLSEAISALGDHADDI
FSTVRNLQLLVSALYSSSDLLASFNTNLAAVTTLLTNTPNEVGSALKSLD
GALSDVRDFLAENREAMGVTVDRLGSITTALNDSRGDVKQILHIAPTVFQ
NFLNIYQPAQSAMTGILALNNFADIPQFICSSIEAASRARLARVSKLCLQ
YLNPIIKNRIYNYIPAGINPFVGTQARPSEITYSEDWLRPGYTPPDGGPP
PEAPAPPGQPAPADQPPAEPGPPPNPTSNSLQVLRDLMLPTGPS
>MAP0093 hypothetical protein
MTASTYIPGLARPFVGAYRVAAAPTMRLGHMLVFFVRAVLAVPTVLRQYR
TEFLRLLSNIAWGNGSIVVGGGTAGVAVVLGFTAGALVAVEGYNFLNLLG
LGPATGIISSLVNTRELAPIMASLAFAMQAGCRFTAQLGAMRIAEEIDAL
ESLAIIGNWRYWNRPNSGFGEISQY
>MAP3553 hypothetical protein
MHERSSHRPAGRGPPASRGPSPKLLQGIGFAVSRRTMMRRLSRRYGNVFT
LRLPMWGPVVMVSDPQLAKQIFTTTPDELGNIQPNLSRLFGSGSVFGLEG
DDHRRRRRLLAPPFHGKSMKNYESIIEEETLRETAGWPEGESFPTLPPMM
RITLNAILRAVFGAEGAELDELRRLIPPWVTLGSRLAALPKPQRYPRFGP
WGQLDRWRRHYDGVIERLIAAEQADPNFAERTDVLALLLRSTYDDGAAMS
HKEIGDELLTLLAAGHETTASTLAWAFERISRHPELLARLVEEADNGGNE
LRQATILEVQRARTVIDFAGRHVYPDVYRLGEWVIPRGYSIIVGIAQIHD
NPDVFPDPRRFDPQRFIDNKPSALSWIPFGGGTRRCVGAAFANMEMDVVL
RTVLRHFTIETTDAPDEPWHCRGVAFTPKHGGRIVVHRR
>MAP3677 hypothetical protein
MLRKRRIHRPQGRFHLRRDHEPLPARAVPRFGGRREHRRRHHLAPPLDPS
RGRAMKDFRAPVNGVDGDSASALVAVPKSQGAQTIRPDSVSRLVGALLDD
HAAVVGRVRSVIRSRLPVYRSVADEALEAELEWVLRSAVGGREALHEPQI
AGLAAIGEARAHDGVPVDDMLRAWRIGVEVVVECAREAARRLGVDDARVL
ELVQSALAWSDIAMATSAKAHRRTERALALEAEESDAEFVRGALMGSLPA
AELRMHAELRGLDPGAEYVAVRARLGGDGPHLRLEQSLGFQDPAHSRRGL
CALLDGDLAGFLIEPPRDVEGVVGFGPPRPLTRLSESYRMAARALVTAEA
CGLRGAYDIAALGLRTAVAIDADVGELLRKRYLEPLSVGGSSRELIATVR
TYLACGMHVERTATRLFVHQNTVRYRLARFEELTGASLRDTEVVTEVWWV
LELAAMRL
>MAP4083 hypothetical protein
MAAISAPGALRARYPRTAANLDRYGGGTVRRLWQIGIFARFARISIGQTG
WALRHYRRETLRLVAEIGMGTGAMAVVGGTVAIIGFVTLSGGSLIAIQGF
ASLGNIGVEAFTGFFAALANTRIAAPIVAGVTLAATVGAGATAQLGAMRI
SEEIDALEVMGIKSISFLVSTRILGGLAVIVPLYALALDMAFTSGQVVTT
VFYGQSNGTYEHYFRTFLRPEDVGWSVFEVVIIAVVVMITHCYYGYTASG
GPVGVGHAVGRSMRFSLVSVVVVVLLAQLALYGVDPNFNLTV
>MAP2719c hypothetical protein
MVVAEHQRHPGGGFNPPEPTTKGGPDYGRFIDAVRALQDHARAVDAPDEV
ITQAADQLEKVSALLAPFDADEWASPSGRRMDLPMRGNILTIPMSAQKGA
DGRMHGQARFARFHLGRNGAVHGGSLGMLFDTVLGLTASVLTGSRRQRTA
YLKIDYRSIVPIETELQFDAGVDRVDGRKIFVSGRLTHGERLLTEADALF
VRLKPGQP
>MAP0536 hypothetical protein
MAGMLDGKVVVISGVGPGLGTTLAHRCADSGADLVLAARTAERLEKVAKE
VNDGGHRALAVRTDITDDDEVAYLVETTMATYGRADVLINNAFRVPSMKR
LAGTSFQHIRDAIELSALGALRLIQAFTPALETAHGSIVNVNSMVLRHSQ
AKYGAYKMAKSALLSMSQSLATELGEKGIRVNSVAPGYIWGDTLQAYFEH
QAGKYGTTVEQIYAATAANSDLKRLPTEDEVASAIMFLASDLSSGITGQT
LDVNCGEYHT
>MAP0262 hypothetical protein
MDHKPPSPVIEAAHRACVLPFSDDADFRDADRGFIAALSPCVVRGADGRV
VFDNDAYAFLDGPAPTSVHPSLWRQSTLAAKHGLYEVVPGIYQVRGLDIS
NVTFVETDTGIIVIDPLVSTEVAAAALTLYRTHRGGDRPVVAVIYTHSHV
DHFGGVLGVTTQADVDAGRVEVLAPEGFVEHAVQENVYAGPAMLRRATYM
YGTLLPRGPRDHVGCGLGQAASMGEVALIVPTVDIRETGETHTIDGVEIE
FQMAPGTEAPAEMHFYFPQFRALCMAENATHNLHNLLTLRGALVRDPRAW
SGYLTEAIDTFADRADVVFASHHWPTWGRDGIVEFLSLQRDLYAYLHDQT
LRLLNQGHTGVEIAEMFRLPPALERAWHARGYYGSVSHNVKAIYQRYMGW
FDGNPARLWPHPPEALGPRYVAAMGGIDRVVDLAQQAFDSGDYRWAATLL
DHAMFTDGEHAGARELYADTLEQLAYGAENATWRNFFLSGATELRDGNFG
TAGQVTSPTMLAQLTPEQIFDGLAIRVNGPRSWGLDITVDVTLADTAVNH
RLALRNGVLVQRKVPADPATATVTVRLANKIRLLALAAGDFASPGLELTG
DRGALQALVGALDAPDPDFNIVTP
>MAP1964c hypothetical protein
MADAVRELVDATIRTEVDDAVVAEARSAIEAVTASLRRRTRPVGVSYRVN
GRPLPLGNAAIGVCNPIAPPIVVHHEGDGRCWSEFVLGSAYEGPPRLVHG
GVSALVLDHMLGEAASEGLSKARFTGTITVKYLRGTPLGPLRCDAWIDRR
EGVKVFARGTISDAAGVTVEADGVFIEPAWARETQ
>MAP3191 hypothetical protein
MSLSVFAAELGPTGGATMESFIHLRKGRTPGRLHADLDGLKDDELGRGGF
TGRTANMYRRHDPTAYRVQGPLRPIDVLTSELKPSDASDANGGPLLMFAN
PDCRISLSRRSEPMPFYVRHVDGDLLCFVHGGAGLLHTEFGPMPYRQGDW
VYLPKATTWRQLPDAETTLLMIEATDEFRVPPPGPLGRHFPFDPSQATIP
EPQALDDDPAHDEYEVRLVHEGGPTTLFYQHNPLDVEGWRGDNFAFTFNI
ADYNVVTSDSVHLPPTVHLFMQATGVYVMNFLPKPAEGVPGTERTPWYHR
NVDYDEIAFFHGGSLYGIPMPPGLISHAPQGVHHGAPEKARERARRKFDE
YARVDWQVIAVDTRRRLTPSAEVLAHDLGQH
>MAP2402 hypothetical protein
MPEASAPTIDHLVRSRAAEFGGKPMVIDPGSRISYDQLDTATRELAAVFV
QAGVGKGTRVGLIMPNNTRWVLIAIALTRIGAVLVPLSTLLRAGELVAQL
RVAAVQFLVSVDEFRGHRYLDDVAAARSELPALQQVWPNEQLDAAAAGAR
AGQIVDAMTQTVTPADPLVIMFTSGSSGTPKGVWHSHGSALGAVQSGLAA
RCIDADSRLYLPMPFFWVGGFGSGILSALLAGATLVTEEIPRPETTLRLL
ESERVTLFRGWPDQAETLARHAGTVGADLSALRPGSLQALLPPEQRARPG
ARATLFGMTEAFGPYCGYPADTDMPVSAWGSCGKPFDGMEVRIVDPDTGA
PVGAGTAGIIQIRGPHTLRGMCGRSREELFTVDGFYPTGDLGHLDDAGFL
FYHGRADDMFKVSGATVYPSEVERALRTIDGVDSAVVTNVPGATGDRVGA
AVVCRELTAAQLRAAARNLLSSFKIPTVWLVLRSDDDLPRGGTGKVDVRR
LRELLADADRRQETRVQG
>MAP0569 hypothetical protein
MHDRLTKIQLAIFAVITVITLTVMAVFYLRLPATFGIGTYGVSADFVAGG
GLYKNANVTYRGVAVGRVESVGLSPNGVIAQMRLNSETAIPSNVTATVKS
VSAVGEQYIDLVPPSAPASTKLRNGSRIERQNTRIGQDVAELLRRSETLV
NSLGDTRLRELLHETFIAANGSGPELARLFESARLLVDEANANYPQVSQL
IDQAGPFLEAQIRAGADIKSLSDGLARFTSEVRQADPQLRDTLATAPGAT
DEASTAFSGIRPSFPALAASLANLGRVGVIYHKSIEQLLVVLPALFAAIT
TAAGGGPQDEGAKLDFKLDLNDPPPCAVGFLPPPLMRTPADETVRELPKD
MYCKTAQNDPSTVRGARNYPCQEFPGKRAPTVQLCRDPKGYVPIGRNPWR
GPPVPYDTPVTNGLNVLPPNKFPYIPPDAEPDPGTPIVGPPPPGVVPGPG
PLPNKQPAYAPPPPNDNGPPPPFTSWQPPGVPPVPPQLPYPKWLPPPAPP
EGINPPPASGAGEAVGAAARRAAAGQRPGVRHL
>MAP0350 hypothetical protein
MSYSHPDSMRGQVAIVTGAAQGVGKGIAAALLERGAAVLLVDIQQETLEA
TATELRALGRVERLVTDLRDPDSAPRIAAAAVDAFGSVHGLVNNAIATNE
PKAFVDITTDDLALGYEVGPRATFLLMQAVHPLLVKEGGGAIVNLGSGTG
TGGEPRWGGYAAAKEGIRGLSKVAALEWGRDNIRVNVVCPFAESDGVKLW
KQFAPNDYAKAVGRVPMKRIGDVRTDVGALVAFLLSTDATFITGQTIHVD
GGIGCFR
>MAP4089 hypothetical protein
MLPRMIKTQLVLLTAVAVAAVVVLGWYYLRIPSLAGIGRYTLYAELPQSG
GLYRTANVTYRGITIGKVTGVEPTERGARATMSIEDGYRIPADAAAHVHS
VSAVGEQYVDLVSTAGREPYLADGQTIHKSTVPSQIGPALDAANRGLAVL
PRDKIASLLYETSQAVGGLGPSLRRLVDATQAIAHDFRGSIDDVDDIVER
SAPIIDSQADSADTLGRWAANLNTLAAQTARQDPALRSILTNAAPTAEQV
RATFGGVRESLPQTLASLEVVIDMLKRYHNGVEQALVFLPQSGAIAQSVT
AQSPGQAALGVGAISLNQPPPCLTGFLPASQWRAPADTSTAPLPAGTYCK
IPMDATNVVRGARNYPCVDVPGKRAATPRECRSTEPYVPQGTNPWYGDPD
QILTCPAPSARCDQPVKPGLVIPAPSVDNGLNPLPADRLPGTPPPISDPL
QRPGSGTVVCNGQQPNPCTYTPSALYDVRSGTVVGPDGVVYSVANSATIG
DEGWKTMLGQAR
>MAP2410 hypothetical protein
MAGKLEGRVAFITGAARGQGRAHAVRMAAEGADIIAVDIAGKLPSCVPYD
PASPDDLSETVRLVEAANRRIVAAVVDTRDFDRLRKVVDDGVAALGRLDI
IVANAGVAAPQAWDDITPEDFRDVMDINVTGTWNTVMAGAPRIIEGGRGG
SIILISSAAGMKMQPFMIHYTASKHAVTGLARAFAAELGKHSIRVNSVHP
GPVNTPMGSGDMVTAVGQAMETNPQLSHVLTPFLPDWVAEPEDIADTVCW
LASDESRKVTAAQIPVDQGSTQY
>MAP2198 hypothetical protein
MLDRLLHRSKTSRGALAVVTGAGSGIGAAFALELGKRGGTVVCSDIDQAA
AQRTADAITQHGAKALATRCDVSQFGDVQALAEQSQSCFGAPPTLVINNA
GVGAGGAAIGDAPLDDWQWTLGINLWGPIHGCHVFTPILRDAAPSAAPRG
IINVASAAAFGAAPGMAAYNVSKAGVLSLSETLAAELSGTPVRVTVLCPT
FVKTNILESGRISEESGELAAKLMRWTGFSADKVARICLDAHDRGDLYCM
PQLDAQIGWHIKRLAPQAYTRAAGLVSRINLP
>MAP1455c hypothetical protein
MLDRYGRIDVLVNNVGHWLRHPGGFADTDPQLWDELYRINLHHVFLVTHA
FLPTMIDRGAGAIVNVSSVEGLRGYPEDPVYAAFKAAVIGFTRSLAVQVG
NHGVRVNAVAPDVTESLQVPYSQWLSAEEQSQWPRWVPVGRMGLPEDQAR
VILFLASDCSSFITGHTIPTDGGTTAAGGWFRSSRRPGREWTNRPADP
>MAP3749 hypothetical protein
MGRLAGKVALVTGGGRGQGRSHAVHLADEGADLIVVDIGEDIPSNQYALA
TRADLDDTAKLVEKAGRRVVAAQVDVRDRVGLKALLDEAVTQLGGLHVIV
ANAGICPLGNDIPVQGFVDAFDVDFIGVVNTVHSGLPHLNAGASIIVTGS
VAGLVPQAGGVSGQGGLQGPGGDGYGLAKKVIRDYTRSLALTLGPQQIRV
NAIHPTNVNTEMLHNPAMYQTFRPDLTNPSREDAEVTFPFMQAMPIPYID
PCDVSHAVVYLAADESRYVTGQQLFVDAGASLKLGM
>MAP1995 hypothetical protein
MNDNPFAGPIAKHPRSPLETLDTVPESVLRRLKQYSGRLATEAVSVMQDQ
LPFFADLEASQRASVSLVVQTAVVNFAEWMQDPKSNVRHTAQAFELVPQD
LARRIALRHSVEMVRVTMEFFEEVVPLLARSEEQLTALTVGVLKYSRDLA
FTAATAYADAAEARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTT
APATVVVGTPAPGRDGSTGPGDSERASQHVRDIAAQHGRAALTDVHGTWL
VAIVSGQLSPTDKFLGDLLEAFSDGPVVVGPTAPMLTAAYHSASEAISGM
NAVGGWRGAPRPVLARELLPERALMGDASAIVALHTDVMGPLADAGPTLI
ETLDAFLDSGGAIEACARKLFVHPNTVRYRLKRITDFTGRDPMQPRDAYV
LRVAATVGQLNYPTHPPSAAGAAMPAVPLPVNGAARGQSGG
>MAP2356 hypothetical protein
MAEPNEDVEITRGDRFIKTAVAILGETGRTDFTVQEVVARSKTSLRAFYQ
HFSSKDELLLGLFDRTIAQSAQTWRAETAGLDSTAALKLVIDRVSQQPES
STQDSLNRALTLYNQHLAETRPREYARVLSPLHRLIRDIVGQGITEGVFN
AGLDVGAAAAIVMQTMMGAQRLHWLGAELNGTPVDAGQLYDFCSRALGIR
DTDDDSSAPSLAELFGQIGMRPGTRNGQFAMTMPVSPAVVNTSGALQGGL
IATLVDVAGGQFGLDYLRPGTTMTTADLFVRYLRPVRQGSAFAVPRMLRS
GRRAMVMQVDIFGDGDDELLATATVNFAIINGDTPSSGLPPAE
>MAP2604c hypothetical protein
MPAATATPVAVIGMACRLPGGIDSPQRLWEALLRGDDLVGEIPADRWDAD
LYYDPEPGVPGRSVTRWGAFLDDVGGFDCDFFGMTEREATAVDPQHRLLL
ETSWDAIEHAGLDPASLAGSQTGVFVGLTHGDYELLSADCGAAEGPYGFT
GTSNSFASGRVAYTLGLHGPAVTVDTACSSGLMAVHQACGSLGGGESDLT
LAGGVVVTLEPRKSVSGSLQGMLSPTGRCHAFDVNADGFVSGEGCVMLLL
KRLDDARRDGDRILAVLRGTAANQDGRTVNIAAPSETAQVAVYRKALQAA
DIDATTVGLVEAHGTGTPVGDPIEYSSLAAVYGTDGPCVLGSVKTNFGHL
QSASGPLGMMKAILALQHGVVPRNLHFTRLPDEMARINTELFVPQENTQW
PSNGHHPRRAAVSSYGMSGTNVHAILEEAPAPEAAGSAAPEAVGPLLFPL
SSTSAEQLRVTAARLAAWLDEQAGDALAGPTGWGLRDLGYTLTRRRAHRP
VRTVVSASGFAELRTELRAVADGDIPYQPAVGQGDRGPVWVFSGQGSQWS
QMGAELLDKEPVFAATIAAVEPLIAAESGFSVTRALSAPPAVTGIDKVQP
TIFAMQVALAETMKSYGVRPGAVIGHSLGECAAAVVAGGLSLGDGVKVIC
RRSRLMARIAGRGAMASVELPGQQVLSELSIRGISGVALSVVASPTSTVV
GGGAEAIRELVAAWQQQDVMAREVAVDVASHSPQVESILDELVEVLAELE
PTAPEVPYYSATL
>MAP2381 hypothetical protein
MKYQGRVVVVTGAGSGIGRALTQALTAGGAHVAASDIYDNGLAETQASCG
PGQVTPYRIDVADRDAVLGFADEVRRKHGPASMVFNNAGVDLFASVADMS
WEDFDWLMGINVGGVVNGTKAFLPQLIEAGSDRRPSRLVNLSSAFGLIAV
PYQGAYSTSKFAVRGFTEALRQEMIIERHPVTVHCVHPGVVRTNFGANMR
TSDTEDPDLAAQLFDRAALTTPARAARLILRGAEKNRARILIGADGRAMA
ALPRLLGVAYAGLLARAARLTDSRAAHSGR
>MAP1665c hypothetical protein
MLTVDFDRLGIGPSSKVIDVGCGAGRHAFEAYRRGADVVAFDQNEAELRS
VDTVLRAMADSGEAPAGASATVVVGDALKLPYADQTFDCVIASEILEHIP
HDDAAIAELIRVLKVGGTLAVSVPRWLPERVCWLLSDEYHSNEGGHVRIY
RASDLRAKILSGGMTLTHAHHAHALHSPFWWLKCAVGVHNTDHRAVAAYH
KLLVWDLMRRPKITQLAESLLNPLVGKSVAMYFVKQQDAKSQATAGYSIA
SV
>MAP0348c hypothetical protein
MSIIAGYGKGAAMGVLEGKVAIVTGTSRGVGVGIAHELLRAGATVVGCAR
SPLDTIPGIEPDWTERAFQRVCDQGDYRGIDAFVTDVVATHGRLDILVNN
AGGTVPAPHAESIPELVQRIQGSPAADDDYARTALFHAFAVQMNLIGPLW
FAIRTYRQMQTQDRMGCIINISSGAGHPAGSPTLVSYGAAKSGLNHLTRS
LAQEWGPKVRVNCVALGPTMTENFRSFVLPKDDPTGEKYFAAVPLRRAGE
PAEVGRICVFLAGGQADFVNGTTIECDGGMLPGVLYDAGLKTITDLL
>MAP3745 hypothetical protein
MTVRIGALTEPTPGNRTVVVFPHAGASPRFYASWCRLLPAGVDLYGVTYP
GRDMLLDEPVPETLADLACNCAVELKPIIYSSSSVVVFGHSMGSLVAFET
IRELEQDGVSVTALVASGADAPHLETDQSWHRAADEDLIQHLAELDSRSR
DVFAVPELRRMLLPTVREDFRLVESYRAELHPPVSCPIHVMTGDADPEVT
AHRWAQWAVHTRAPWHVRSFAGDHFYIRTHEDDVVRHLCGILTSAPAATQ
>MAP2649c hypothetical protein
MEQLFDDLEDFGAFDDAVSGDVRDPYTELARLRREEPIQRLDTSGMPHEE
SKPVFIVYRHEDAQQMLRDNETFSSAAVIAAFGPVLGERVMLGIDEPVHG
RLRSLVSKAFSQKALARWEDELVGRVGNSLIDRFAGNGKADLVKEFTFDY
PSRIIAGLLGLPEQDYPQFQRWSISLLSWILNPERGLAASAALCDYFAPI
LAARRAEPKDDLISGLAQAEIDGEKLEDEEIYSFLRLLLPAGVETTYRAL
GSLLLALLSDPEQLDAIRGDRSLLPQAIEEGVRWEPPLLTITRVATRDTE
LGGVPIPAGSTVMPMLGAANRQEDRYPDPDRFDIFRAPKSHLGWGHGVHV
CLGMHLARLEMRTAVNLLLDRLPNLRLDPDADDPHIRGQVFRSPTSVPVL
FDPQ
>MAP0671 hypothetical protein
MNDQHRATYTHGHHESVLRSHRRRTAEDSAGYLLAHLTPGLSVLDVGCGP
GTITADLAARVAPGQVTAVDQAADVLDVARAEAEQRNLSNVSFGTADVHR
LDFADDTFDVVHAHQVLQHLSDPVAALREMRRVCRPGGIVAVRDADYAGF
IWYPELPALDLWRDLYRRVARANRGEPDAGRRLLSWARRAGFDDITPTGS
LWCYATPETRDWWGGMWADRILHSTVARDLVSLGLAAREQLEEISAAWRE
WAAAPDGWIAIPHGEIICRASAFSRGTGPESVARRRRRQTGRRPRQSATP
PRRCSSRRWLPARGGRRSRRPRRP
>MAP2076c hypothetical protein
MPRTDNDTWDLSTSVGATATMVAAARAIATNADNPLIEDRFAEPLVRAVG
VDFFTRWVTGDLVAADVDDHDSGWKLEHMPVAMAARTRFFDSFFQAATQA
GIRQAVILASGLDARAYRLAWPAGTTVFEIDQPQVIEFKTATLAKLGATP
QATLRTVAVDLRDDWPKALVEAGFDKGQPTAWIAEGLFGYLPPEAQDRLL
DNITALSADGSRLACEAIPDMSEVDTEKAQEMMRRATAKWREHGFDLEFG
DLGYQGERNDVAEYLDGLGWRSVGVPMSQLLADAGLEAIPQTNDSVSVAD
TIYYSSVLAK
>MAP0759 hypothetical protein
MTQNVPAGRGAIAGRPARPGHAHPPAGRNYLPPLLGLATILIIGLIFAVA
VGLFQGSFTETVPVTVISQRAGLVMNPDAKVKMRGVQVGKVASIESLPNG
QAAIHLAMDPSQMHFIPSNVLVDIASSTVFGAKSIQLVEPAQPSAQRLRA
GQTLQGQHVMVEINTVFQELVSVLSHIDPPKLNESLGALAQAFSGRGPQL
GQSLSDLDSFLARLEPSLSAFRHDLSVLPTVSNAYADAAPDLVKTAANAT
RISKTLVDEQHNLDALLISAIGLADIGNDVLSTNRQPLTNVLHLLVPTTD
LTNEYGPALTCSFGGLITIAHGPPLSEPSINISASLTWGGERYRYPTNLP
KVAATGGPKCMGLPTLPFNTNPPQFITDIGANPVGYGNPQLLINSDLLKQ
LLYGPIAGPPRNSAQVGMPG
>MAP0079 hypothetical protein
MAALQDKVALVTGASSGLGAETAKLFSSQGATVFGIGRDTVRLAEVFAGI
ERGGFASVDIASPQACTEAVAQCVRDFGGLDVLVNVAGIHRMRRTESMTD
EDWEHDLAVNLNGPFFLCRAALPELLERGGNIVNVSSIAGVEGQAYSAGY
CAAKHGLIGLTRALAVEYTADRLRVNAVCPGGMLTPQIERFSAPDDPNYD
LILRTAAPRGMMRPLDVANVIAFLASDAAAAVHGAVYRVDNGKGAG
>MAP1979 hypothetical protein
MTEAGGFVDQTVTGVAEPQPMYKALRESNPVFRSTQAVVLSRLADIEMAL
KHTELFSSNMDAVDLGNVRPLIPLQIDPPDHAKYRRILDPLFTPREMARR
EPLVTELVNEMIDRFAPRGECDFHAEFAVPLPCTVFLQLLGLPLEDLDRF
LLWKDGVIRPAGDSGFDRRHESSAGVAQQIYEYFDKAIDEHIAVPRDDVL
SAMIAADVGGQPLSREELLDICFLFLIAGLDTVTDSLDCFFVYLARHPQH
RRQLVERPDVLPGAVEELLRWETPVPGVARVATQDVEVGGCPISKGERVS
PLLGAANTDPAEFPDPEIVDFTRSPNRHRAFGGGPHRCLGSHLARMELRV
ALREFHRRIPDYEIRPGTQLTYTAALRSVESLPLVFPVR
>MAP4190c hypothetical protein
MSEGRTDGDTWGPAQSVGATATMVAAARAVASQGPDALLDDPLAEPLVRA
VGLDPFIRIVEGKLDFPDDPLFNRRARAEQITVRTRFFDDFFIDATEAGL
RQAVILASGLDTRAYRLTWPAGTVVYEIDQPQVIAFKTDTLANLGAAPTA
ERRTISIDLRDDWPAALREGGFDVTRPTAWSAEGLLPYLPPEAQDRLFDN
ITALSAPGSRLATEHVPDPNAFSDERLARISERWQRLGLNLNAADLFYRG
ERNVVADYLTGKGWRVTPHPARQLYARNGFEFPEDEMRATFGEMSYVDAT
LTGGRG
>MAP1373c hypothetical protein
MRYRPGEALLALYRRRGPVIDAGAGRHGYTLLLGAEANKFVFANADAFSW
RATFENLALVDGPTALIVSDGDDHRRRRSVVAPGLRHRQIQDYVTTMVSC
IDRVIDGWRPGQRLDVYQHCRAAVRRSTAESLFGPRLAVHSDALGEHLQP
LLDLTHQPPQLVGLQRRINAPAWRRAMAARQRINNLVDTLIADARAAPNP
NDHMLTMLIDGRGDEGYTLSDNEIRDAIVSLVTAGYETTSGALAWAVYLL
LSQPGAWATAAGEVRRVLAGLPPAAADLSGLTYLNGVVHETLRLYPPGVI
SARRVMRDLRFKGRRIRSGRLLIFSPYVTHRLHEIWPEPRRFAPERWNPD
APGYRRAAPHEFIPFSAGLHRCVGAAMATTEMTVMLARLLARTRLRLPAQ
RLRAANVAALRPTPGLTVEVIDSVPAQ
>MAP1856 hypothetical protein
MIRAVAGFANLVVRAVRTGHRQQVWLSVAGLVLILVVATAYLLIGALRVT
PFASSYRVTVQLPESGGLLPNQDVALRGVRIGRIESLQITDNGVNAVATI
TSKVRIPANTVAHVSALSPAGEQFINFEAASDAGPYLHDGSFITSERTTV
PVSLAQLLGDADGLLAQVDPRKIELIKKELSLSKEGPAKLTAIVDGGLFL
LSTLDSVLPETTSIIKTSRVVLNLASDKNNGLGAAATELNHTLTGVARMQ
AGYRRLTSQTPQTLSAVDNLFADNSDTMVQLLGSMATMSQLLYLRVPALN
ALFPDYRGSVLDAVTSVFHDHGVWATADLYPRYVCDYGTPAHASSAADYY
EPFMYTYCRDDDPAVSIRGAKNAPRPGGDDTAGPPPGADLGRRTDPTPRG
RYTIPTPYGGPQLPIEPPH
>MAP3109 hypothetical protein
MTATISTPQYLLDQARRRFTPTLNTIPGMGAIEKRLLAHEWQTKVLAEPP
AGSGLKPVLGDAGLPILGHIIELFRGGPDYALFLYRNHGPLIYLDSPIMP
AVTALGPDATQAVFSNRNKDYSQKGWHPVIGPFFNRGLMMLDFDEHMYHR
RIMQEAFTRSRLTGYVEHIDRVATAIVADWPTNDARFLFHPAMKELTLDI
ASLVFMGHEPGTDHDLVTTVNQAFTTTTRAGGAIIRQPIPPFKWWRGLRA
RQLLEDYFSERVKERRNATGNDMLTVLCHTEDDDGNSFTDDDIVNHMIFL
MMAAHDTSTSTTTTMVYNMAAHPEWQERAREESARLGDGPLDIEALEKLE
TLELIMNESLRMVTPLPFNMRQAVRDTELLGHYIPAGTNITIWPGMNHRL
PELWTDPDKFDPERFAEPRAEHKKHRYAFAPFGGGAHKCIGMVFGQLEVK
TVVHRLLRRYRLELARPGYQPRWDYGGMPIPMDGMPIVLRPL
>MAP2885c hypothetical protein
MTVTSTITADDGVTLAVHRYTDIDPARPTILAIHGFPDNHHVWDGVADEL
TGAPYNFVAYDVRGAGESSCPATRSGYGFAQLVRDMGAVIDSLGVGRVHL
LAHDWGSIQAWAAVTDKAVTGESVMDKVASFTSISGPHLNYAGTFLHSAR
TPRAVARVVRQLIASGYIGFFLCPGAAELSFRAGIGVKVIAALERIGHTS
TRSQRRDTRRSLRDYLNGLQLYRQNMPAPMLAPGPRLPETTVPVQTLVPR
RDVFVTPALQRFTGAIPQRARVIEIEGGHWVVTSRPDVIARLTTEWVDLQ
SAGAVAPQLTGERRELPGERREVRGRLALVTGAGAGIGRATAVELARQGA
RKVVTVDRDRAAADQTADAVRAAGAGAAVYQVDVSDEAAMNDLAAQVRNE
HGVVDILVNNAGIGMAGRFLETSPANWDAIMGVNVRGVIWGSRAFGTQMV
ERGQGGTIINVASAAAYLPSKSMVAYGTTKAAVLALSESLRADFADEGIT
VTAVCPGFVNTNIAKSTIYAGMSAQQQEWARDRADTAYRRRNFTPEATAK
AIVKAIKTGPAVVPIAAESRIGYALRRISPGAIRLLARWDIRQT
>MAP2774c hypothetical protein
MWCSVRPCDDGHTMTDLQGKVAVITGGAGGIGRALGRRLGHEGMKVVLAD
VLADPLQEATRALADEGIEAAGVVTDVTDYSSVEALAKEALRRFGAVDVV
CNNAGTGAVSEGYLWEHDLADWRWGIDVNVLGVIHGLKAFVPILLERGEG
HVVNTCSGNGGFAPIARGAMGGPATAVYPMTKAAVLCLTESLYTHLEMTG
TRVRAHVLFPGGFLNTGIWESWRHRPPRYAPTQQRRTPEQTLDKVVARFE
AAGARVEFTPLETVADLVVDGIAADRFWMMGPPAPSDDVVRRKAASILSR
GAPDYLVDILGRSAGGNSENQGENQ
>MAP0704 hypothetical protein
MKNDDTARNNTMSDTLTSTATEQTADIPDYPMPRQAGCPFAPPPDVMALA
HDKPLSRVRIWDGSTPWLITGYEQVRELFSDSRVSVDDRLPGFPHWNAGM
LSTVHKRPRSVFTADGEEHTRFRRMLSKPFTFKRVEGLRPTIQQITDEHI
DAILAGPKPADIVSALALPVPSLVISQLLGVPYEDADMFQHHANVGLARY
ATGEDTVKGAMSLNKYLAQLVEAKMENPAEDAVSDLAERVKAGELSVKEA
AQLGTGLLIAGHETTSNMIGLGVLALLENPDQLAVIRDAEDPKVIASAVE
ELLRYLSIIQNGQRRVALEDIHIAGETIRAGEGIIIDLAPANWDARVFPE
PDRLYLHRSGADRNVAFGYGRHQCVGQQLARAELQIVFHSLVRRIPTLQL
AIPIEEVPFKDDRLAYGVYELPVTW
>MAP0758 hypothetical protein
MALRAAYPRLTRQLERPVALLAGIGDHALFYGKALAGMPFAATRYTREVV
RLVAEISMGAGTLAMIGGTVVIVGFLTLAAGGTLAIQGYTSLGNIGIEAL
TGFLAAFINVRIVAPVVAGIGLAATFGAGVTAQLGAMRINEEVDALESMA
IRPVAYLVSTRILAGMLAITPLYSIAVILSFVASQFTTTFLLGQSQGLYQ
HYFNTFLNPIDLLWSFLQAILMALTILLIHTYYGYFASGGPAGVGNATGN
AVRTSLIVVVSVTLLVSLSIYGTNGNFNLSG
>MAP1480 hypothetical protein
MRMGEADRHRWDERYAANGPPPLSSVAPPGVFARHADVFPAAGRALDLAC
GQGTAAVWLALRGLRVLGLDVSPVAIGQARDLARRAGVGERCRFDVADLD
RGLPVGPPADVIVCCKFRDRRLDRAIVERLAPGGLLAIAVLSEVGAGPGP
FRAAPGELTAAFADLQPIAAGEADGQAWLLARPSQSEPVVERR
>MAP3062 hypothetical protein
MSALVPDVPPHPPAGTAPAGDAVMMLTGERTIPGLDIENYWFRRHEVVYQ
RLARHCAGRDVLEAGCGEGYGADMIAGVARRVIAVDYDEAAVAHVRGRYP
RVDVMQANLAQLPLPDSSVDVVVNFQVIEHLWDQTQFVVECARVLRPSGL
LLMSTPNRITFSPGRDTPINPFHTRELNAVELTELLVGGGFRDVSISGLF
HGPRLREMDARHGGSIIDAQIARTVADDPWPPQLAADVAAVTVDDFELVP
SGADAASGHHIDDSLDLIAIAVAP
>MAP2650c hypothetical protein
MAEPRSVVITGASRGLGFASALRMYREGWRVVAAMRTPDQGMPLLRRAIR
DGTGQDPDDDRLIGVQLDLTDSASVAAAAKAIEEAVGAPSAIVHNAGISA
AGMVEETDTALWQRMFATSVLGPVALTQALLPSMRAAGRGRIVLVSSAAG
VRGQPGTAPYSAAKGALERWGESLAGEIAPFGLGVTVLVAGTYDTEIITD
AGTTDDRNFSGPYARLHTTMNTRGRLAVSFARRPERFTDGLYKALDDRAP
FRRRGVGPDASVLLAANRILPASGMHHLSRVVLGIPRQGSMRDGAWPLTH
GQRAMVLVARVLPQPVLQRLAALAGRFSSPKSAARQGDQEG
>MAP2981c hypothetical protein
MRVLAIYVSVPAARDLEDAMPYDVIIRDGLWFDGTGGAALTRTLGIRDGV
LVDVAESLDEAGCPEVIDAAGKWVLPGFIDVHTHYDAEVLLDPGLRESVR
HGVTTVLLGNCSLSTVYADSQDAADLFSRVEAVPREYVYGALISNRTWST
AADYVKAVDALPLGPNVGSLLGHSDLRAAVLGLDRATDPAVRPTGDELAK
MAALLDEALDAGLLGMSGMDAAIEKLDGERFRSRALPSTFATWRERRKLI
EVLRRRGRILQSAPDLDNPALALMFFLTSSRIFGRGRGVRMSMLVSADAK
SMPLAVHTFGLGTRILNKLLRSSVRFQHLPVPFELYSDGIDLPVFEEFGA
GTAALHLRDQLQRNELLADTDYRRRFRREFDRVKLGPSLWHRDFYDAVIV
ECPDKSLIGKSFGAIADERGLHPLDAFLDVLVENGERNVRWTTTVANHRP
KQLDKLAADPSIHMGFSDAGAHLRNMAFYNFALRMLKRTRDAQAAGAPFL
TIERAVHRLTGELADWFGIDAGTLRPGDRADFVVIDPAGLDESVDAYHEE
AVPFYGGLRRMVNRNDDAVIATGVGGVVVFGGSHRKGEFRDGYGHTVKSG
RYLRAGQRVRARVGAPRVTA
>MAP0256 hypothetical protein
MPRTDNDSWDITQSVGATALGVAAARAAETESENPLISDPFARIFVEAAG
KGMWSIYADPALLTKADDLEPDLRGRLQLMIDFMATRTAFFDEFFLAAAD
AGVRQVVILAAGLDARSWRLPWPDGTVVYELDQPKVLDFKSTTLREHGAQ
PKAELVNVPIDLRQDWPKALQEAGFDASRPAVWSAEGLVRYLPAQAQDLL
FERIDALSAPGSRLASNVPDSGFTDPDRLARQREDMRRMRAAAAKLVDAE
ITDFDDLWYPEERTPVDSWLRERGWDVSTATFAELMARYGRSIPQGAQDS
MPPTLYVSARRRAG
>MAP0668 hypothetical protein
MTDLAQVDYFTDADVAQDPYDYWDYLREQGPVFREPHYGVVAVTGYQEVQ
AAFKDVESFSAVNAIGGPFPPLPFTPEGDDISELIEAHRHEFPIFEHMVV
MDPPEHDKARSLLGRLLTPRRLQENKDYIWQLADRQFDEFIANGHCEFLS
EYAKPFATLAIADLLGVPDEDRPQIRRNLGAGNAPGARVGALDHEPVGSN
PLQYLDDLFSGYIADRRERPRDDVLTGLATATYPDGSTPPLLEVVRPATF
LFAAGQETVTKLLSAAVQVLGDQPELQARLRADRGLIGPFIEEALRMQSP
TKVDFRLARKTTTLGGVHIPAGTVIMLCLGAANRDPRKFENPNEFRIDRK
NVREHIAFGRGIHTCAGAPLARVEGQITINRLLDRTSELRINKAKHGPAS
SRQYRYESTFLLRGLTELHIEFTRAG
>MAP0612 hypothetical protein
MTLAGTLSQIDFTDLDNFANGFPHHLFAVHRREAPVYWHEPTDNTPDGEG
FWSVASYAETLEVLKDPATYSSVTGGERPYGGTLLQDLAIAGQVLNMMDD
PRHSQIRRLVSSGLTPRMIRLVEDDLRARARRLLDAVVPGEPFDFLVDIA
AELPMQMICILLGVPESERHWLFQAIEPQFDFGGSRKAALSQLSEAEAGS
RMYEYGQQLIAAKRAEPTDDMLSVVANATLDDAAAPALSDLELYLFFSLL
FSAGAETTRNAVAGGLLALAEHPEQLRWLRDDLGALPTAVEEMVRWTSPS
PSKRRTATRDATLGGQSIKAGQKVQIWEGSANRDASVFDRADEFDVTRKP
NPHLGFGQGVHYCLGANLARLELRVLFEELLSRFGAVRVVRPVEWTRSNR
HTGIRHLVVELRAEQ
>MAP4068c hypothetical protein
MTVGAAPPSVFDSDLPTLHYHSDETPAQVYPRLREAQRRAAVAIGPHGPE
VLSYHLVRSVLRDPRFQIPPGINLLAQGIDSGPLWDKVANSLLCLEGDAH
HRLRSLCSKAFTPRTVARLHDTMAAVMNELVDRVAAAGRCDVVTDIARPY
PVPIICALLGAPREDWRRFSSWADDVFKAFSFTVDLREVEPVVMRAWREL
DDYVDEMVARRRHNLTDDLLSDLIRVGDEGDRLDAAELRMLAGGLLLAGT
DTTRNQVAASVQVLCEHPDQWELLRQRPELAMRAVEETMRHSPIACGTLR
LVVEDAELDGHLFPAGTAVLVNTFAANRDPVVYNDPDRVDITREAAPPIL
TFGGGVHYCLGANLARREIAEALNVLANRLRNPRLAGPAPWKPMVSLSGP
TSLPIEFDR
>MAP1673c hypothetical protein
MQDPPNRSFDPSLLRDAVIRRIYRRTSAEGTIRVPAVPAMIDEYLQLCGN
VCATLGVWYAPEQFAQLRSALEVELAKAFKAFPRSDVLISYHAPFGTGVN
FHVQADWRTIEADYEQWVAVRPPPLFGTEPDARVLTLAAEADDPATYRVL
DVGAGTGRNALALARRGHPVDAVEMTAKFAEAMRADAERESLGVNVIQSD
VFTAMENTRDRYQLMVLSEVVPDFRTAHELRGMFELAAECLAPGGRLVFN
TFLAHDGYVPDAAAVQFSQQCNSMVFTRDEVMGAAGGLPLALVADDSAYE
YEKTHSPAESWPPTGWFEGWAGGLDVFDVERDDSPIELRWIVFAKTRAG
>MAP0711c hypothetical protein
MTGRVEGKVAFVTGAARGQGRSHAVRLAQEGADIIAVDICKPIRAGVVDT
AIPASTPEDLAETADLVKGHNRRIVTAEVDVRDYDALKAAVDSGVEQLGR
LDIIVANAGIGNGGDTLDKTSEEDWTEMIDINLAGVWKTVKAGVPHMIAG
GRGGSIILTSSVGGLKAYPHTGHYVAAKHGVVGLMRAFGVDLGQHMIRVN
SVHPTHVKTPMLHNEGTFKMFRPDLENPGPDDMAPICQMFHTLPIPWVEP
IDISNAVLFFASDEARYITGVTLPIDAGSCLK
>MAP1469c hypothetical protein
MTQPSTDADTDIPDFPMTRAPGCPFAPPPKVLQLNADKQLSRVRIWDGST
PWLVHGYQAIRALFADARTSVDDRLPGYPHWNEGMLATVHKRPRSVFTSD
AEEHTRFRRMLSKPFTFKRVEALRPAVQKITDDHIDALLRGPNPGDIVST
VSLPVPSLVISELLGVPYEDAEFFQTQAQRGMGRYATEEDTAQGAASLAK
YLANLVRAKMQSPSEDLVSDLAERVNAEEISVREAAQLATGVLIAGHETT
ANMISLSVAALLEHPDQRALLCDTDDPKVIATAVEELMRYLSIIQTGQRR
IAIEDIEIGGETIRAGEGIILDVAPANWDARQFPNPDRLDLRREDGPHVG
FGYGRHQCVGQQLARMELQIVLPTLLRRVPTLRLAAPLDELPFKHDALAY
GLYELPVTW
>MAP0562 hypothetical protein
MIEQLAVPARAVGGFFEMMIDTGRAAFRRPFQFGEFLDQTWMIARVSLVP
TLLVSIPFTVLVAFTLNILLREIGAADLSGAGTAFGTITQLGPVVTVLVV
AGAGATAICADLGARTIREEIDAMRVLGIDPIHRLVVPRVLASTVVALLL
NGLVCAIGLSGGYVFSVFLQGVNPGAFINGLTVLTGLRELVLAEVKALLF
GVMAGMVGCYRGLTVKGGPKGVGNAVNETVVYAFICLFVINVVMTAIGVR
ISAK
>MAP3330 hypothetical protein
MRRVAAVKGISMGLRGLADKVAVVVGGATGIGAATAARLAGEGCRVVIGD
VAVDAARQTADRIAAAGGTATQVAFDLADPASVATLIDCAATTYGGVDLL
FNVGADMSTIRADTDVVDIDFDVWDRVMTVSLRGYVAAMKYAIPRMLDRG
GGAIVNMSSAAAFQGEPARPAYATAKAGIGALTRHVASRWGKDNIRCNAV
APGFTATETIRSVPQWPELEAAALKRIRGPRVGDPADVATLVAFLLSAEG
DWINGQVINIDGGTVLR
>MAP3602 hypothetical protein
MTSTSTSRRGPYLVGYLRDQLETPLTLVGGFFRMCVLTGKALFRWPFQWR
EFVLQCWFIMRVAFLPTIMVSIPLTVLLIFTLNVLLAQFGAADLSGAGAA
IGAVTQLGPLTTVLVVAGAGSTAICADLGARTIREEIDAMEVLGIDPIHR
LVVPRVIAATLVATLLNGLVITVGLVGGYLFGVYLQNVSGGAYLATLTTI
TGLPEVVIATVKAAVFGLIAGLVGCFRGLTVRGGSKGLGTAVNETVVLCV
VALYAVNVVLTTIGVRFGTGH
>MAP2928c hypothetical protein
MMDLTQRLAGRVAVITGAGGGIGLAAARRMHAEGATIVVADIDADAGAAA
ADELSGLFVPTDVADEDAVNALFDTAARRYGRIDIAFNNAGISPPDDDVI
ENTEPPAWQRVQDVNLKSVYLCCRAALRHMVSARRGSIINTASFVAVMGS
ATSQISYTAAKGGVLALSRELGVQFARQGIRVNALCPGPVNTPLLQELFA
KDPQRAARRLVHVPVGRFAEPGEIAAAAAFLASDDASFITASTFLVDGGI
SAAYVTPL
>MAP1869c hypothetical protein
MAKNYADRIQGVDPTGPYNLLGWSFGGVVAHEIAIELQRRGCPVRHLILL
DALLVIPSSDVLNSDVLVEQALNEVLRFCGIQVPEGDEPLTYERAEKLLR
ERGIEELPRYKPLLDSIVRNDNTNLALARAHEPGVFDGDVKLFAAAREKG
DQNSSSLRSWRPYVTGEISSYPVDCTHQQMMSTESLIQYGQQLKVVLEGG
G
>MAP3504c hypothetical protein
MRSTLADAARAAAQRTPERIALVDNEVRLNCATLYAQAGELAAALLARIP
AGSVVSFMLPNWHEAAVIYLAATLAGMVVNPILPSLRDHDLRFILEDACA
AMVFVPHRYGGHDYPAMLDRVTAAMSAAPQVVVLRGPATGRTGRHTPYRC
MLGGPPGGLPALDPDAVRMILYTSGTTSRPKGVLHTHNSIHALICQIRDH
WAIDPGDTFLVPSPLAHIGGSIYAFECPLLLGTTAVLMDRWDPARAVALM
TAHRCTHMAGATPFLQQLLSAAANARTRLPDLKVFICGGASVSPSLIRRA
AAYFDRAVVTRVYGCTEVPVATVGAPRPQEADYAADTDGRPGIAEIKLAA
HPAAPTGDGEICVRGPQMLRGYRHPEDDAESFDAAGFFRTGDLGRWALAD
SAGRYLVVTGRAKDVIIRSGENISAKEVEDLLADHPGIAEIAVVGLPDER
TGERACAVIVPTPGASPDVAGLLALLVSKGVAKFKAPEQVVLWDALPKND
AGKVLKHRIRAALSKDG
>MAP2191 hypothetical protein
MRGALMLKYRGANLIRPGFIGTVLILLVVAVGLAPDRLVTWATTIRYEAL
FSEAGGLLPGNKVLVSGVNVGTVSRVSLDDDGNALVDFGVNGKVRLGSLS
TAHIRTGSLLGARVLTLESAGAGLLRPSEVIPLSRTSSPYSLTEAVSDMT
TNLSTTSTDTLNQSLDALSATIDQIAPQLGPTFDGVTRLSRAINSRGESL
DALLKSATDVTGILSEHSQGVNSLILDGNVLLETLVRRRDAISELLANVS
AVSKQLTGLVADNESKLGPTLDRLNSVAAMLEKNRDNLSKALPGLKKFEI
TSGESVSNGFYYNAFVPNLAIPELIQPFFDYYFGFRRNDPNMPRALFPWP
HNGIPGGSR
>MAP2370c hypothetical protein
MTRRTIVITGASDGIGAAAARRLSRAGDRIVVVGRSQTKTAAVAAELGAD
HFVVDYADLSQVRALADKMRAQYPRIDVLLNNAGGVASRIELTADGYERT
YQVNYLAPFLLTTQLLDVLLESRATVVNTTSSSHKLILRATVDDLENTAN
RRPAVAYAYSKLAIVLFTKELHRRYHARGLSVAAVHPGNVNSNIGIASGS
RFLVFMQRYTPAALFISSPDQGADPLVRLASSPPDSEWTSGAYYAKRKIG
KTTRLADDPRLAAELWERTAARLG
>MAP2371c hypothetical protein
MSTTTMDEAAKLLADPMAYTDEQRLHAALTHLRANAPVSWVEVPNYKPFW
AITKHADVMDIERENMLFTNWPRPVLTTAEGDEMQAAAGVRTLIHMDDPQ
HRVVRAIGSDWFRPKAMRALKVRVDELAKIYVDKMLAAGPECDFVQEVAV
NYPLYVIMSLLGLPEADFPRMLKLTQELFGSDDSEFKRGSSNEDQLPALL
DMFGYFNGVTAARREHPTEDLASAIANARVDGEPLSDIDTVSYYLIVATA
GHDTTSATISGGLQALIENPDQLQRLRDNLDLMPLATEEMIRWVTPVKEF
MRTAAKDTVVRGVPIAAGESVLLSYVSANRDEDVFDEPFRFDVGRDPNKH
LAFGYGVHFCMGAALARMEVNSFFTELLPRLKSIELTGDPELVATTFVGG
LKHLPVRYSLA
>MAP1660 hypothetical protein
MSGPGHTITHVSDTARWTALHRATESARPDAVFRDPLAERLAGDHGRAIV
DHVPRTTRNGWWLVARTKIIDDAIAEAIAQGCDRVLNLAAGLDTRPYRLN
LPPDLAWVEADLPALLAEKTQVLADEVPRCRLTRTAVDLADAAARDAFLN
EALDGATKALVLTEGLLMYLDDSDVSALSAALQRPEVRWWMLDFAGPGLK
KMMNKKMAGMLANAPFKFAPDNGLAYFERLGWRTVQVEALYSAARRLRRL
PLVMRPLGWLPQPDPRRPGRRAWSAAALLTH
>MAP0301 hypothetical protein
MGTMSSDQVMDWDSAYREQAHFEGPPPWNIGEPQPELAALIEQGKFRSDV
LDAGCGFAELSLALAARGYTVVGIDLTPTAVAAATRAAAERGLTTASFVQ
ADITSFTGYDGRFATVVDSTLFHSLPVEGRDGYLRSIHRAAAAGASLFIL
VFAKGAFPAEMQPKPNEVDEDELRAAVGKYWAIDDIRPSFIVSNVPQIAD
APFEFPAHERDERGRMKMPAYLLTAHKAQ
>MAP2634c hypothetical protein
MHTGMTTVVVDNPFFARIWPVVATHETAAVRALRRENVAGLTGRVLEVGA
GIGTNFPLYPETVDEVIAVEPEPRLADRARAAAQVVPVRVVVTGETAEAV
GGDEPFDAVVCSLVLCSVRDPQGVLRRLYSLLRPGGQLRYLEHIASACAR
GRFQRFVDATLWPRLFGNCHTHRDTERSIVEAGFDVDASRREWTLPAWSP
MPVSELLLGRAHRPA
>MAP0600c hypothetical protein
MTVHVGDHELVLDPYDYDFHEDPYPYYKRLRDEAPLYRNDELKFWALSRH
QDVLQGFRNSTTLSNKYGVSLDPASRGPHASKTMSFLAMDDPAHLRLRTL
VSKGFTPRRIRELEPRVTEIATQHLDTMLDKAGSAAGGAVDYVDEFAGKL
PMDVISELMGVPQADRVQVRAWADGVMHREEGVTDVPPEAVEASLNLIVY
YQGMVEERRKKPTGDLTSALLEAEIDGDRLTDDEVLGFMFLMVIAGNETT
TKLLANAAFWGHKNPDQLTPVYDDLSRVPLWVEETLRYDTSSQILARTVS
GPLTLYDTTIPEGDVLLLLPGSGHRDERVFDNPDDYLIGREIGPKLLSFG
SGAHFCLGAHLARMEARVALTELFKRIRGYEVDEANAVRVHSSNVRGFAH
LPMSVEVR
>MAP0890 hypothetical protein
MLPEMTRQKILITGASSGLGAGMARAFAAKGRDLALCARRADRLDELKAE
LSQRYPAITVAVAALDVNDHEQVPKVFAELSDELGGIDRVIVNAGIGKGA
KLGSGKLWANKATIETNLVSALVQIETALEMFHKNGSGHLVLISSVLGNK
GVPGVKAAYAASKAGLSSLGESLRAEYAKGPIKVSVLEPGYIESEMTAKS
NSTMLMVDNETGVRALVAAMEREPGRAAVPWWPWAPLVQLMRVLPPRLTK
MFA
>MAP4082 hypothetical protein
MSSQAVIANYLRGQIQPGIDAVGGFFRTCVLTGKALFRRPFQWRETIEQG
WFITSVSLLPTLAVAIPLTVLIIFTLNILLAEFGAADISGAGAALGAVTQ
LGPLTTVLVIAGAGSTAICADLGARTIREEIDALEVLGIDPIHRLVVPRV
VAATIVATLLNGAVITIGLVGGFVFGVFIQHISAGAYVGTLTLITGLPEV
VISVVKSATFGTIAGLVGCYRGLTTKGGPKGVGTAVNETLVLCVIALFAV
NVVLTTIGVRFGTGR
>MAP3659 hypothetical protein
MSISLLLEMAASSNAERTAVVSQDVRLTTQELSDLADGGAAVVAGSNAKH
LVYVGTGGVLLPVLIFAAARAGVAFTPINYRLSAEGIRALIERLPEPLVV
VDARYRDMVGEAPGGVMDSDDFLAAARAAEPAADAFADPDSVAIVLFTSG
TTSQPKAVELSHNNLTSYITGTVEFGAADETDAALICVPPYHIAGVSAAL
SNLYAGRKMVYLPNFDAREWVRLINAENVTTATVVPTMLDRIVTVLENGD
PDTGAPIELPSLRNLAYGGSKVGLPLVRRALELLPHVGFVNAYGLTETSS
TIAVLTPDDHRAALAATTPEAARRLGSVGQAVAGIELQIRDEAGNVLPPG
ETGELFVRGEQVSGRYTGIGSVLDEDGWFPTRDIAMLDDEGYLFIGGRSD
DTIIRGGENIAPAELEEVLVEHPHVRDVAVVGVEDPQWGQAIVAVVVPAA
GVDPDPEELREFVRKSLRGSRTPDEVVFRDELPTTATGKVLRREIIATLA
GLRAAPAQQ
>MAP2033 hypothetical protein
MRVRLGARWMAMHGLPRAYFAVQARRGDPLARLLRSGTTGEDRYALMEQI
RARGPLMRAPFVWASVDHALCRQVLRDKRFGVTSPTEMELPRPVRALIAR
TDPGVANPVEPPAMVIVDPPDHTRYRQLVAQSFTPRAIEALNTRVAQVTL
ELIERIATIPQPDLIADFATRLPVAIIAEILGMPPDSYPRMLAWGRSGSP
LLDLGIDWRTYRDAIDGLRGVDEYLLAHFHQLRADPHSDNPFGRMAADGS
LTDRELTANAALIVGAGFETTVNLIGNGIVLLLRHPEQLALLHDNPDLWP
SAVEEILRIASPVQMTARTPACDVDIAGAHIGAGEMVGLFLGGANRDPKV
FSDPTTFDVTRPNAREHLAFASGIHACLGAALARIEGATALRALFENFPD
LRLTAAPQRRSLINLHGYTRLPAQLGGRRTTSATIPV
>MAP1870c hypothetical protein
MPESVTVSPQELHGMLVAEQVSVLTQTPSAVAALPADGLESVALVVVGEA
CPVEVVDRWAPGRVMVNAYGPTETTMCVAISAPLKPGSGVPPIGAPVSTA
ALFVLDRWLRPAPPGVVGELYVAGAGVAVGYTNRAGLTGSRFVACPFGAP
GTRMYRTGDLVRWRADGQLDYLGRADEQVKIRGYRIELGEIRSALVGLDG
VEQAAVIAREDRPGDKRLVAYVTESATGTADPAEIRARLAQRLPEYMVPA
AVVVLETLPLTANGKLDTRALPAPGYQNSDHRAPSSPVEEILAGVFADVL
GLDRVGVDDSFFDLGGDSLVATRLIAAIETTLNADLSVRAVFEAPTVSQL
ALCVGSDRGRREPLVAVERLAVVPLSFAQQRLWFIDQLQGPSPIYNMAAA
LRLTGRLDADALGTALADVVARQETLRTLFPAVDGIPEQVVIPAERADFG
WQVVDATGWPTDRLQQAIEATVRHSFDLATEIPLRARLFRIADDEHVLVA
VLHHIAADGLSMAPLVADLGMAYASRCAGHAPGWAPLPVQYADYSLWQRA
LLGDVADAESPMAAQLAYWEQQLAGLPERLALPTDRPYPPVADYRGASVM
VEWPTELQQRVRTVAREHNATSFMVIQAALAVLLAKLSASRDLATGFAIA
GRREPALDELVGFFVNTLVLRVDLAGDPSFTESLAQVRARSLAAYDHQDV
PFEVLVERLNPTRSLAHHPLVQVVLAWQNFAREDGVPAGLALGDLQVTPL
AADTQVARMDLTFTLGERWTSAGEPAGIGGSVEFRTDVFDAERIPTLIQR
LERVLAAMTADPGQRLSSVDLMDTDEHIYLDEIGNREALTQPAARVSIPA
MFADQVIRAPQAVATRCAGHSMTYRKLDEASNRLAHLLIEAGAGPGESVA
LLFNRRAEAVVAVLAVLKTGAAYLPIDPAHPTARIEFMVADAAPIAAITT
TELAERLDGCGLPIIDIADPRIDSYPHTALPVPDPDDIAYLIYTSGTTGV
PKGVAITHNNVTELLGSLAPDLARPGQVWSQWHSYSFDISGWEIYGALLH
GGRLVVVPEEVAASPDDLHALLIDEKVTVLCQTPSAAGTLSPQGLESVTL
LVGGEACPSELVERWGPGRVMINEYGPTETTMWVALSAPLTAGSTGSDAV
PIGSPVPGAAFFVLDQWLRPVPAGVVGELYVAGTGVGVGYVRRAGLTASR
FVACPFGESGTRMYRTGDLVRWGADGQLRYLGRADEQVKIRGYRIELGEI
RSALAGLDGIEQAAVIAREDRPGDKRLVGYVTESVTGAADPADIRARLGQ
RLPAYMVPAAVVVLDALPLTVNGKLNARALPAPEYIERERYCAPATPTEE
TLAGIYAQVLGLERVGVDDSFFDLGGDSLSAMRVIAAINKTLDAGLAVRT
LFHAPSVRGLCRQLGQGADEVENQVEIIPRRVPQGGHRRSAVLHSSRERA
ELAVSRSR
>MAP3030c hypothetical protein
MIAREIAEHPFGTPNFTGRSWPLADVRLLAPILASKVVCVGKNYADHIAE
MGEATGPVPADPVIFLKPNTAIIGPNVPIRLPANASPVHFEGELAVVIGR
PCKDVSAAQAAENILGYTIGNDVSARDQQRSDGQWTRAKGHDTFCPVGPW
IVTDLNPLDPGDLELRTEVNGEVKQRSRTSLLIHDIGSIVEWISAVMTLL
PGDLILTGTPAGVGPIEDGDTVSITIEGIGTLTNPVVRKGKS
>MAP4086 hypothetical protein
MHAAMRTLTEFNRTRVGLMGALVTVLVVGVGQSFTSVPMLFATPTYYAQF
TDAGGINVGDKVQIAGVNVGLVSSLVIRGDRVLVGFSMPGKTIGAQSRSA
IRTDTFLGRKNMAIQPRGADPLPPHGIIPSGQTTTPYQIYDAFVDVTKAA
TDWNIDVVKRSLNVLSETFNQTAPHLKAALDGVARFSGTIGNRDEQIKQL
LTNADKITRVLGDRSGQVDALLVNANTLLAAFRQRSQALSALLSNVSAVS
TQVSGFIKENPDAHQVLRQLGTVSDELVKRKKELSDVLVLVSRYTASLTE
AVASGPFFKAIVANLLPYQVLQPWVDAAFKKRGIDPENFWRSAGLPAFRW
PDPNGTRLPNGAPPPAPPVAEGTPDHPGPAVPPGSPCSYTPAADSPPRPG
NPLPCALSGQGPFGPVPDGFPAPLDVETSPPAPDGPPPSPGVPSAGRPGE
APPDVPGTPVPLPVHAPPGARTENPPDATGGSTR
>MAP1436c hypothetical protein
MDRFAGRRAIVTGAGSGIGAATAARLLDEGATVVAYDISAEGLARTRAAA
DDAGTGKRLTTAVLDISVEGDVIAAVDGAVADLGGLEVLVNVAAIQTCSH
THQTTLADWNRTLAVNLTGTFLMTRQALPALLDSGRGVVVNFTSTAASFA
HPYMAAYAASKGGILSFTHSLALEYAKQGLRAVNIQPGGVSTALANSTLD
KMPDGYDVGLWAKQTPLLHGKDSEILGDPSAVASVIAMVASDDGAFITGT
EIRVDGGAHA
>MAP1172c hypothetical protein
MATLLLHHPDFAAHRTAPGHPERPDRYRAVAAALSRPGFDALVRETAEPA
ELAATRYVHSNRYVDAPEAARPQHGYVYLDGGDTMMEPSTWETALRGVGA
TLQAVDRVLAGDVQNAFVACRPPGHHAETERAMGFCLFNNISIGARHAQR
KHGLMRVAIVDFDVHHGNGTQQIFYSDPSVLYASTHQMPLFPGTGAAAET
GVGNIFNSPLAPGDGGAELRAAFTDRFVPALQAFSPELIIVSAGFNAHER
DPLGSLTMTTDDFGWVTRELMKSAEKLCDGRLVAVLEGGYDLQALADSVT
AHVGELLKG
>MAP2183c hypothetical protein
MTAPALDRDRLRELFDLRSSYNAWAGGAYEDDPYPVWHRLREKGPVLPGV
LHELTGSTDTMFFHGLPYPDCPHFTVFDYDSCMIAYRNPEVFASSPEPVD
LEHGPLGLTNSMLSMNGEQHKRYRALVQPSFLPANGKWWIDNWISETVDL
LIDGLVHEGRAELNVDFCAAIPVLTITGSFGVPVEQALDIREALARDPQK
VVDLLKPVIAARPEEPRDDLISVLVQAELTDEDGAKDRLTDREIDSFVLL
LLGAGSGTTWKQMGTTLTTLLQRPELLEAVRADRSLLRPAIEEAIRWMPT
DPMFSRWVMADTELAGVSIPAGSVVHLALGAANRDPARWDRPDEYDITRK
FKPSLGFGQGSHICLGMHVARAEMTIAISALLDRLPNLRLDPDAEPPRFV
GMYERGATAIPVVFDV
>MAP2532 hypothetical protein
MTGLYYDPWNREIDADPYPIYQRLRNEAPLYYNERHDFWGLSRYDDVDAA
LRDPLRLSSAKGDILDVVKADPVMPPGVFINEDPPLHTIHRALVARAFTP
KKMRTLEDKIRAFCVASLDMVADSDRFDFVEDLGAELPMRTIGMLLGIPD
ADQPSVREHARATLQNDTGGPMPIRKDHYFDGDMFSDYVEWRKRPEWEID
MDNARRSRTSTVRGWDSMPAIVT
>MAP0125c hypothetical protein
MGMSQPTHEMFEAAYRGESPEMGEGARPPWSIGEPQPEIAALIEAGKFHG
DVLDAGCGEAAVSLYLAERGFTTVGLDQSPTAIKLAREKAARRGLTSASF
EVADISDFTGYDGRFGTIVDSTLFHSMPVELREGYQRSIVRAAAPGASYF
VLVFDRNAMPAEGPVNAVTEDELREVVGKYWVIDEIRPARIHANVPENFL
AGFEAFAGADIRDEDNGRKSVGAWLLSAHLG
>MAP1603c hypothetical protein
MKERLHWFAMHGFIRGAAALGARRGDVHARLIADPAVAADPARFYDEARA
RGTLVKGRVAYLTADHALAHELLRSEDFRVLVFGSNLPAPLRWLERRTRD
DLLHPLRAPSLLAVEPPEHTRYRKTVSAVFTPRAVAALRDRVERTAAELL
DQLTGGPGVVDIVGRYCSQLPVAIISEILGVPEQDRSRVLEFGELAAPSL
DIGLPWRQYRSVQRGIAGFSSWLAGHLQQLRSNPSDNLMSQLIQTAESGS
AETYLDETELAAIAGLVLAAGFETTVNLLGKGIRMLLDAPEHLDTLRRRP
ELWPNAVEEILRLESPVQLTARMALNDVEVAGRQLHRGDLVLVYLAAANR
DPAVFGDPHRFDIERPNAGRHLAFSGGRHFCLGAALARAEGEVGLRTFFE
RFPEARAAGAGSRRETRVLRGWSSLPVRLGPARSLAAAEAGGRPDEPTAG
>MAP2051c hypothetical protein
MVMTGTSAIELYYDPFDSGIDDNPYPVWQRMREEAPLYYNEKYNFYALSR
YEDVARELPNWQTYRSGRGTTADILFSNVEVPPGILLFEDPPLHDLHRRL
LSRVFTPRRMLAVEDLVRGFCVRELDPLVGAGGFDFIRDLGAMMPMRTIG
YLLGIPEEDQEKIRDRSVANIELSRDSDPAAVDANVFANSIALFADYIEW
RADHPSDDLMTELLRAEIDEPDGTRRPLSRTEVLAYTAMIAGAGNETTAR
LIGFMGQLLSDHPDQRRELAADPSLIPGAVEETLRFEPPSPVQARYVARD
AEHYGRVVPEGSFMLLLNGSANRDPRRFTDPDRYDIHRQGGGHLSFGQGL
HFCLGSALARMEARVAFEEVLKRWPDWEVDYANAERARTASVRGWARLPV
VTGG
>MAP3606 hypothetical protein
MRTLEPPNRVRIGLMGIVVTVLVIGVGQSFTSVPMLFAKPSYYGQFTDSG
GINTGDKVRIAGMDVGKVEGLKIDGDHIVVKFSIGTNTIGTESRLAIKTD
TILGKKILDVEARGSQQLRPGSTLPLGQSTTPYQIYDAFFDVTKAAQGWD
IETVKQSLHVLSQTIDQTYPHLSSALDGVAKFSDTIGKRDEQVKHLLAQA
NQVASVLGDRSDQIDRLLVNTKTLLAAFNERGQAINALLGNIAAFSEQVK
GLINDNPNLNHVLEQLRTVSDILVQRKDDLANGLTEVGKFLPSLNEAIAS
GPFFKVVLHNLALYQISQPWVDAAFKKRGIDPEDFWRSAGLPAYRFPDPN
GTRFPNGAPPPAPPVLEGTPDHPGPAVPPGSPCSYTPAADGLPRPDNPLP
CAGAVTGPFGGPGFPAPVDVMTSPPNPAGLPPTPGIPIAGRPGDAPPDVP
GTPVPLPTQAPPGARTENLAPAGPVPPPSTFAPGAAAGSAGTPRAGQPVA
GAVHQPRRDRRQRRSGGR
>MAP0111 hypothetical protein
MTSRRRKLIALAAAAAALAAVAIGVVGHYVKARLDTMTVTAQFDSAAGLY
EGNVVAVLGMPVGKVSKVTSKGSYVEAELTVDKKVKIPAAVRAVTISTSI
LTDRQVELTPPYRGGPVLKNHDTIGLTRTKTPVAFDRVLDMLDKVSKSLK
GDGKGGGPIADVSDAAVAITDGNGKKILAALDELSKALRLSSERGVTTRE
QLTTIITDLSSITEAAARNDAKVREFGSTTRQLSQILADEKFGTGATGRT
INRILEEVTTLMENNRDNLKQAVRNGDTAAKTLVDDQRGVAELLDVLPLT
LENLYNTVDQNNGAIRVHGLLDKALTDSQSAKELCNLMHLRQLGCSTGTL
QDYGPDFGLTYILDGLSAMGQ
>MAP0518 hypothetical protein
MGVVDGRVVIVTGAGGGIGRAHALAFAAEGARVVVNDIGVGLDGSPASGG
SAAQSVVDEITAAGGEAVADGSNVADWDQAAGLIQTAVETFGGLDVLVNN
AGIVRDRMIANTSEEEFDAVIAVHLKGRFATMRHAASYWRGLSKAGEAVD
GRIINTSSGAGLQGSVGQGNYSAAKAGIATLTLVGAAEMGRYGVTVNAIA
PSARTRMTETVFAEMMATQDQDFDAMAPENVSPLVVWLGSAEARDVTGKV
FEVEGGKIRVAEGWAHGPQIDKGARWDPAELGPVVADLLAKARPPVPVYG
A
>MAP1435 hypothetical protein
MYATSRTIGDTPSGVVPVRVDHRDDAAVSDLFDRVRRESGRLDLLVNNAA
TISDNLVSSKPFWEKPLDLADVLDVGLRSSYVASWYAAPLLVAGGRGLIA
FTSSPGSVCYMHGPAYGAQKAGVDKMAADMAVDFRGTGVATVSIWMGILL
TEKLRAAFGADPAALAATAEHAETPEFTGYVIDALFDDPGLAELSGQTLI
GAELAQRYGITDEGGRRPPSHRDMLGSPRTPSSVVVR
>MAP2862 hypothetical protein
MARNPVAQTAFGPMVLAAVEQNEPPGRRLVDDDFAELFLPAPLRWLVGAT
RFAPVRHLMIRGSEFTGPGLWANLACRKRFIADKLKESLDDINAVVILGA
GLDTRAYLLTRRVRIPVFEVDLPVNVARKFKTVRRVLGELPLSVRLVALD
LEHDDLLTALAEHGYRTDYRVFFICEGVTQYLSEATVRRTLDGLRAAAPG
SRLVFTYVRSDFIDGTNRYGTRTLYRNVRQRRQLWHFGLQPGEVADFLAE
YGWRLVEHAGPDELMQRYVVPTGRKLKASQLEWSAYAEKT
>MAP2015 hypothetical protein
MSLKTRPKKGLATRINGAPPPRVPLADIHLESLDFWGYDDDFRDGAFATL
RREAPISFWPAIEMDGFVAGNGYWALTKHEDVHFASRHPEIFSSVPNITI
NDQTPELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEASVRERAR
RLVSSLVANHPNGEAELVSELAGPLPLQVICDMMGIPEEDHQRVFHWTNV
ILGFGDPDLATDFEEFLQVSMDIGAYATALAEDRRVNHHDDLTTSLVEAE
VDGERLTSSEIASFFILLVVAGNETTRNAISHGVLALSRYPDERDKWFSD
FDRLTPTAVEEVVRWASPVVYMRRTLTRDVKLRGTKMKAGDKVALWYNSA
NRDESTFGNPWLFDVARTPNPHLGFGGGGAHFCLGANLARREIRVVFDEL
RHEIPDIVATEEPARLLSQFIHGIKRLPVAWTPPR
>MAP2899c hypothetical protein
MTGTLVSSVLPASDALAYSEVYSDPPGLAPLPEEEPLIARSVAKRRNEFI
TVRHCARIALGELGLPPAPILKGEKGEPRWPDGVVGSLTHCTGYRGAVVG
RTGAVRSVGIDAEPHDVLPDGVLNAISLPAERSEIPSALPGDLHWDRILF
CAKEATYKAWFPLTRRWLGFEDARITFEADHPGATTGGFVSRILIDPAAL
CGPPLTALSGRWSVARGLVLTAIVL
>MAP1344 hypothetical protein
MSVAEPKRDIAGLPLAPKNPLSYRERLRAIKEFHTGTNKLRDAGGPVTRV
TLGPRWLISPIVLATSPQGIRDIVSVRDGSIDKTSTVATELRRLLGPNLF
VLPHTEWLPRRRTLQPVFTRQRVREFGGHMAEAAESVCAGWPEDTEIDLD
AQCRTLTLRALGRSVLGLDLDERSDAIAEPLRVATSYAVRRALRPLRAPE
WLPTPSRRRARAAAGAIRALADEILQACRADPGREAPLVHALIAATDPET
GQALSDKEIRDEMIIFLFAGHDTTATTLTYALWALGRHPEYQARVAAEVA
ELPDRHLTPDDVARLGFTVRVLQEALRLCPPGPTGTRMATRDVEVAGYRV
EAGTMLAFGRMAVQTDPSLWDAPLRFDPDRFDPRRAGDRDRWQYLPFGGG
PRSCIGDHFAMLEATLALATIVRRVEIESLSDDFPLAVPFTMVAAAPIRA
MVRRRR
>MAP2735 hypothetical protein
MGRLSAGRRRMRKPLTCLPFDDARHLQAHQFLVDEAYLLDAQHYQAWLDT
LTDDVRYVMPVRVTTARGAGFDTSPGMAHFDEDKYSLSQRVARFATEHAW
TEDPPSRLRHFVTNVRTFVEDDRHLLVESAELLFRSRGDVNESALVSCGR
EDVLRWSEDRWKLARRSIFVDESVLRMQNLAVFL
>MAP2525c hypothetical protein
MSVEVAGSGSRKAPQFHFDRHTPEYRERFLDVTQEMHQRCPIAWTDTYGG
HWVAAGAGAVFELARCPHVSNDHDVNNERRGYRGVTIPLTTESDQIRGGM
LEMDDPEHRIYRSLLNPYLSPAAVSRWQPFIDDVVRACLDERIESGRIDF
VDDLANVVPAVLTLAMLGVPLRKWTMYNEPVHAMVYTPPGSPEAAKVHDM
WVSVVVDLFANLTEIREHPRPGIINALAQLRIDGEPAPDMEIIGMLTLLI
GGGFDTTTALTAHALKWLSEHPDQRARLHGELDVLLNPATEEFLRYFTPA
PGDARTISADMELDGIRFAEGERVWLSWAMANRDPSLFDNPNELMLARKA
NRHFSFGIGVHRCIGSNVARTVFKSMLTAVLERMPDYRCDRVNTVHYDTI
GIIQGMRNLPADFTPGKRLGPGLDETLDRLQSVCDSQGLARPITEYKEQA
RLPG
>MAP2172c hypothetical protein
MILADDAEHRDPEALGELMHRHSVAQVTAVPSLVSALLDSRPDAVRSLSR
LVCGGEPVSTSLLQRLVSVCDEADGGGPELLNNIGSTETSGAVSRGPLSP
PNPLVGKPVPGAQAYLLDDGLRPVPVGVVGELYYAGDQLARGYWKRPGLT
AARFVANPFGAEPGSRLYRSGDLARWTEDGQLEFVGRSDHQVQVRGFRVE
LAEVEAALAGADGVAAAAARTWEVHGGTSLAGYVVPQRPIADEAEKAAFA
AQVRAEIAATLPGYMMPSSLTVLDALPKTESGKLNRPGLPRPVVSTGGRT
EPTRTDTERALANVFAELLSTPEVGRFDDFFALGGDSILSVQLASRARAA
GLPVSPRMIFENPTVQQLAAALDALGDNDSDGRLDDQPADARFEPMSTSG
LSASDLAAVTQLWSSSREGTA
>MAP1252c hypothetical protein
MGRTFDELIAEADSVPVEGWDFSWLNGRATEERPSWGYQRLLSRRLANVS
AALDIHTGGGEVLAGAAPFPPTMAAIQTWPPNAALATARLHPLGAVVVAV
GDEPPLPFADHAFDLVTSRHPVSVWWSEIARVLRPGGSYFAQHIGPATMG
ELVEYFIGPQPQKWAEFHPDAVRAQVAAAGLRVVDVRMERLRAEFFDIGA
VVYFLRKVIWTVPDFSVARYRDRLAELHERIESQGLFVAHPTRVLVESRK
PE
>MAP0532 hypothetical protein
MPTKAKVAIVGSGNISTDLLYKLLRSDWLEPRWMVGIDPQSEGLARARKL
GLETTHEGVDWLLAQPEKPDLVFEATSAYVHRDAAPKYEAAGIRAIDLTP
AAVGPAVIPPANLRQHLDAPNVNMITCGGQATIPIVFAVSRVVEVPYAEI
VASVASVSAGPGTRANIDEFTKTTSRGVETIGGAKRGKAIIILNPADPPM
IMRDTIFCAIPEDADRDAIAQSIHDVVKEVQTYVPGYRLLNEPQFDEPSL
NSGGQAVVTTFVEVEGAGDYLPPYAGNLDIMTAAATKVGEEIAKETLATT
AGGAQ
>MAP1560 hypothetical protein
MSEPQEAIIPPDFSAPFDREIGLQFTELSPDGARARLEVTPKLLQPMGLV
HGGVYCSMIESMASVAAYTWLATRGGGNVVGVNNNTDFLRSIGSGTVYGV
VEPIHRGRSQQLWLVTITDDDDRVVARGQVRLQNLEVRNT
>MAP1862c hypothetical protein
MGLILGVCAPVLGGIPAVLTSPVSFLQRPARWMQLLAKESHTFSAAPNFA
FELAARKTSGEDMAGLDLADVLVILSGSERVHPATLRRFTEKFARFNLPG
KAVRPAYGLAEATVYALSRTPAQPPEIAHFDSEKLVTGTAERCESPTGTP
LVSYGVPRSPMIRIVDPDTGIECPEGTVGEIWIHGDNVAMGYWKKPQETE
RAFCGRLAAASSGAPEASWLRTGDSGFLSDRELFIIGRIKDLLIVYGRNH
SPDDIEATVQEITRGRCAAIAVPQGGMEKLVVIIEVKNQDPSREAADKLA
LVEREVTSAISNSHGIGVADLVLVAPGSIPITTSGKVRRASCVEQYRQGQ
FARLSR
>MAP1233 hypothetical protein
MDFLRNAGLMARNVSTEMLRHFERKRLLVNQFKAYGVNVVIDVGANSGQF
GSALRRAGFKSRIVSFEPLSGPFAQLTRKSASDPLWECHQYALGDADETI
TINVAGNAGASSSVLPMLKSHQDAFPPANYIGTEDVAIHRLDSVASEFLN
PTDVTFLKIDVQGFEKQVIAGSKSTLNESCVGMQLELSFIPLYEGDMLIH
EALELVYSLGFRLTGLLPGFTDPRNGRMLQADGIFFRGDD
>MAP0665c hypothetical protein
MGGRVEGKVAFITGAARGQGRSHAVRLAQEGADIIAVDICAPISSNSQIP
PSTPDDLAETADLIKGLDRRIVTAEVDVRDYYALKAAVDSGVEQLGRLDI
ICANAGIGNGGQTLDKTSEDDWRDMIDVNLSGVWKTVKAGVPHMISQGHG
GSIILTSSVGGLKAYPHTGHYIAAKHGVVGLMRTFAVELGQHFIRVNSVH
PTNVNTPMFMNEGTMRLFRPDLKNPGPDDLKVAAQFMHVLPVGWVEPVDI
SNAVLFLASDESRYITGLPVTVDAGSMLK
>MAP3160 hypothetical protein
MEDLRGDLAQRYKRTPTGGSSVDSAIVGADMAALDRDGYVIWENLLSTEQ
CAQIRETVRPWLGHTGRNSFEGRRTQRVYSVLSRTRMCDRLVDHPRVLAV
LDRLLMPNYLLSALQAINIQPGEAAQLAHHDDGFYPVPRPRAPLAAATIW
AIDDFTADNGATVLYPGSHRWGKRRPGPDDEAIPVVMPAGSCVLFVGTLW
HGGGANTTDRDRLAVTAQYCQPWLRPMEAFTLSVPRDIARTVSDDIRRML
GYSIHPPFVGAVDGLHPLRLLEMEPDA
>MAP2114c hypothetical protein
MTWKLPKSPAPQRPLMLGVLGTVILACVTVVAFQYNKLPFIKNTDDYAAY
FSEAGGIKPGNAVRVSGMGVGRVSDLRLEGTKVRIGFTVRKGVVLGDRTE
AAIKTETILGAKMLELTPRGDGRLSGVIPLERTTSPYDLPDALGDLTTTI
SGLDTTQLSAALTTLADTLKATPENLKPALQGVARFSDTLNSRDAQLRSL
LGNANHVSAVLGRRSQQIAGLVANSHALLAALLDERDSLDALMNHLTAVS
HQISGLVNDNRTQLKPALDKLNGVLEILDNRKEELQKTLPKFKRYAMSFG
ECLGSGPFFKAYVANLVPGQFGGPVLDADMYDRFLDPDQKLPSEVVDPPT
GTPPVPPENAPVPLWSQPPSPPPSTPPVRTIPPPSPHEFDQP
>MAP2538 hypothetical protein
MQGFAGKVAVVTGAGSGIGQALAVELGRAGAKLAISDVDTAGLAQTAEQL
AAIGAPVKADRLDVTEREAFLAYADAVNEHYGRVNQIYNNAGITFIGSIE
DSRFKDIERVVDVDFWGVVNGTKAFLPHLIASGDGHVINISSALGLFSAP
GQAAYVSAKFAVRGFTEALHQEMLRAGHPVRVTTVHPGGIKTAFARNATG
VEGLDHAELASLFEEQQAKTTPQRAAQLILDGVRRNKARVLVGPDVKAMD
LLVRAAGPNYERLLAGPVMGRVKEFVTRLLPKR
>MAP2348 hypothetical protein
MRTALVTGGSGGIGKGCARKLVERGYDVLLCARREAPLRAAAEEIGARHV
VADASDPIGFPSALSTLETVDLVVHAAGALGGTYARKQTFEQWRAIISAN
LDSCFVVTSAVLPKMTAGSRLVFISSSAAHEPMPARTAYSASKAGMNAFA
RALALEVDRDGISVHLVTPGPVATEMLQDVPFEMYAIAVSDVAEAVVFLD
TVDPSVDLPEIRLSAVQRGPFARPPVVPTEARRRAQRG
>MAP1451 hypothetical protein
MPQVSAAGRLAGKVALITGAARGIGRAQAVRFAQEGADIVALDLCGPVDT
VMVPPSTPDDLDHTASLVGEVGGRMHAELVDVRDLDGVQAATERGARRFG
GLDVVCATAGITSRAMTVEMDESVWRTMLDVNLTGVWHTCRAAAPHLIAR
GAGSMILTNSIAGLRGLVGVAHYTAAKHGVVGLMQSLAHELAPHRVRVNC
VHPTNVDTPLIQNDTVRSAFRPDLDRPPTRAEFAEAARAMNLLQVPWVDP
VDVANAALFLASDEARYITAVTLPVDAGATQR
>MAP2727 hypothetical protein
MAANELRTGSEPPLPGSHRADEAVAGHWLLARLGKRVLRPGGVELTRTLL
SHAQLTGADVVELAPGLGRTATEIVARAPRSYVGAEADPDAANVVRGVLG
DLGAHNAAVRVADAADTGLPDASGDVVIGEAMLTMQGDAAKRAIVAEAAR
VLRPGGRYAIHELALTPDTVSEEVSTDIRRALARAIKVNARPLTVAEWSA
LLAEQGLVVDHVATAPMALLQPRRLVSDEGLFGALRFARNVLVHRDARKR
VLTMRRTFRKHRRQLAAVAIVAHKPTASQTG
>MAP3879c hypothetical protein
MNLDSSAMNTHSHRRLAVVTGAGSGIGRAIALGLAAGGDRVVAADLDEAS
AAATAAEQPDLITAAPVDVADPARVAALRDRIHADIGVPGVVVNAAGWDR
TDQFLNATPEFAQKVVAINYLGPVHVCSAFLPGMIETHGGGRVVNVASDA
GRVGSAGESIYAGAKGGVIALTKSLAREMARHQITVNCVCPGPTDTPLFH
AQPEKLKEALVKAIPLRRLARPEEVAAAVLFFASQAASFVTGQVISVSGG
LTMAG
>MAP4043c hypothetical protein
MSKSPLRRFADQLVLATMRPPMAPQVLVNRPLIKPVELAGKRVLLTGASS
GIGEAAAEQFAREGARVVVVARRKDLLDALAERITRAGGEAIAMPCDISD
LDAADALVADVQQRLGGVDILINNAGRSIRRPLAESLERWHDVERTMVLN
YYAPLRLIRGIAPGMIERGDGHIINVSTWGVLSEASPLFAVYNASKAALS
TVSRVVETEWGDKGVHSSTLYYPLVATPMIAPTKAYQGVPALTPEEAGRW
MITAARTRPVRIAPRMAIAAKALDTFGPRWVNAVMQRQTVQPNREAGA
>MAP2261c hypothetical protein
MTGVIAERIGIMAVLTGESGTDRSGRPYDEIDLSSRAFWSGTAAERERSF
AVLRAERPVSWHPPVEDSLLPDPTDPGFWAVTRRADIVTVSRNNDVFLSG
HGVMFESIPAELLEASQSFLAMDPPRHTKLRKLAHAALSPRQVRRIEDSI
KANAKAIVEELRSAGSGCDFVDHCAKELPIRTLSDMMGIPESERERMAHA
TDALVSWADPEFLNGRPALEVLLENQMYLHQVVGDLATQRRERPGDDLIS
SLVTAEVDGDRLEDAEVAAFFVLLSVAGNDTTRQTISHTLRALTVFPDEK
FWLLEDFGHRIGTAVEEFIRWASPVMTFRRTAAADVELGGQTILAGEKVV
MFYPSGNWDTEAFDHPERLNLGRDPNPHVGFGGGGLHFCLGAHVARAQLR
AIFSELFRQLPGIQAGEPTYLAGNFVHAIRAMPCTF
>MAP3137c hypothetical protein
MRVAVVTGGASGMGEATCHELGRRGMKIAVLDVNEHAAQRVTDDLRTDGA
TALAVGADVTDRAAVEQAFAKVRSELGPVTVLVTSAGMFGFSPFLDITAE
SWSRIIDVNLTGTFHCCQVALPDMVAANWGRIVMISSSSAQRGSPFAAHY
AASKGAVITLTKSLAREYAPHGITVNNIPPSGIETPMQHQGQADGYLPSN
EQIAANIPLGYLGTGADIAAAVGFLTSDEARYITGQVLGVNGGAVM
>MAP2537 hypothetical protein
MQGFAGKVAVVTGAGSGIGQALAVELARSGAKVAISDVDLEGLAHTEEQL
KAIGAQYKADRLDVTEREAFLAYADAVKEHFGKVNQIYNNAGIAFTGDVE
VSQFKDIERVMDVDFWGVVNGTKAFLPHLIASGDGHVVNVSSVFGLFSVP
GQAAYNSAKFAVRGFTEALRQEMAAAGHPVAVTTVHPGGIKTAIARNATA
AEGLDQAELAKLFDKRLAKTTPQRAAQIILDAVRKKKARVLVGSDAKALD
ILVRLTGSGYQRLFGPVMSRLLPN
>MAP3496 hypothetical protein
MVQAARTNAAAKVNAIAQALDKVLNTVADRVANPARVSDPDRLRGAVGGR
TVLVTGASYGIGEATARRLAAAGATVLVVARSEERLGELTAAINAGGGRA
VAYPTDLTDESAVSALTKQITEEHGPLDVVVSNAGKSLRRSLHHQYDRPH
DFQRTIDVNYLGPVRLLLGLLPAMRDNGRGHVVNVSSVGVRVVPGPQWGA
YQASKGAFDRWLRSVAPELHADGVHVSTVYFALVRTRMIAPTPVLGRLPG
LSPDQAADVIAKAVIERPRTLEPPWVLPAELASVLLAGPADRAARLWHRR
FFSDAEEAGRER
>MAP0566 hypothetical protein
MSSNAPRERDPLRTGIFGVVLVVCVVLIAFGYAGLPFWPQGKIYDAYFTD
AGGINPGNAVYVSGLKVGKVTDVGLAGDSAKISFSVDRHVAVGDQSLAAI
RTDTILGERSIAVTPAGGGKATTIPLSRTTTPYTLAGALEDLGSNASNLN
KPQFEQALHVLTDTLHDATPELRGALDGVTSLSRTLDRRDEALQSLLAHA
KSVTGVLSQRAEQVNKLVDDGNELFAALDERRAALGRLISGIQGLSAQIS
GFVADNRKEFGPALNKLNDVLANLNERRDYITEALKRLPTYATELGEVVG
SGPGFNVNVYGVIPAPLLATMFDFFYQPGKMPASLADYLRGLIQERWIIR
PKSP
>MAP2427c hypothetical protein
MNAITDVGGIRVGHHQRLDPDASLGAGWACGVTVVLTAPGTVGAVDCRGG
APGTRETDLLDPANTVRFVDAVLLAGGSAYGLAAADGVMRWLEEHDRGVA
MDGGVVPIVPGAVIFDLPVGGWQCRPTAEFGYLACAAAARDDDAPAVGTV
GAGVGARAGALKGGVGTASRTLPSGVTVGALAVVNSAGEVVDRATGLPWM
TDLVEQFGLRPPPAEQIEAFAALPSPSNPLDNPLNTVIAVVATDAALSAA
ACRRVAVAAHDGLARSIRPAHTPVDGDTVFVLATGAVEVPPEADTPAAFS
PETRLATEVGVAAADCLAHAVLGGVLAADSVAGIPSYRDVLPGALGR
>MAP1383c hypothetical protein
MTQSTQTVVITGASAGIGRATAKEFGRRGANVALLARGAAGLQGAARDVE
AGGGKALALPTDVADHAAVVAAADETEAAFGPIDVWVNVAFTSVFAPFSE
ISAEEFKRVTEVTYLGYVHGTMAALAKMRPRDRGTIVQVGSALSQRSIPL
QSAYCGAKHAVNGFTESVRCELLHEGSRVRITLVQMPAVNTPQFSWVLSR
LPRHPQPVPPIYQPEVAARGVLYAADHPGRKQYWVGDSTMVTLLAQKFAA
PLLDRYLGRTGYDSQQTEQPVGGDRPHNLWQPLDQEPGSDHGAHGEFDDT
SHAHSPQLWASQHPVISGTGALGAAGLGTWLAARARRWTR
>MAP2745c hypothetical protein
MTGMVNPDRARSRTFVVTGAASGIGLATARRLLAEGGTVVGADLADPPGG
LGPRFTFVPADVTDESAVAAVLAAVPGRLDGVFHAAGVAGGGPVHLLDRA
EWDRVIGVNLTGTFLVAKAALARMIDQSRVDGERGSLVTVASVEGLEGTA
GGSSYNAAKGGVVLLTKNIALDYGPSGIRANVICPGFIETPMAHNVFSIP
GMEAPLASITREHALQRLGRPEEIAAMAAFLLSTDASFVSGQAIAVDGGY
TAGRDHGVVELFGFPS
>MAP0765 hypothetical protein
MLTRFVRIQLAIFLIAGTIGVISMVLFYIQAPTLLGIGRMTVTLELPATG
GLYRFSNVTYRGVQLGKVTAVGLTPSGVKATLSLSTSPKVPADLTAEVLS
VSAVGEQYVDLRPRTNSPPYLHDGSVIAMRDTKIPQPVGPMLDQVNALVQ
SIPKTKLGQLLDESFQGFNGSGYDLGSLIDSSQTLSRDANGVVDRTRALT
EDTGPLLDTQAQTTDSIRTWARSFAGISDVMVNNDSHFRTILEKGPETAN
EASRLLDQIKPTLPVLLANLTTIGQIGITYHPSLEQLLVLLPSAVAIEQA
AAAPNHPEGTAQGDFALTIDDPPICTVGFMPPNTWRSPDDLSDIDTPDGL
YCKLPQDSPLSVRGARNYPCMGHPGKRAPTVEICNSDQPFMPLAMKQHVL
GPYPLDPNLLAQGVPPDDRVTVNDRIFGPVEGTPLPPGAVPRGAPAGPRG
ENPPPGSVGAAVPPVPSSGPPALAPMSRMSADLPPIAPLDVPTPTELPPP
PPPPPAPAAPDQVDGAAPQAAPSAFAGKASKPAPSVVVAKYDPRTGRYVG
PDGKLYQQSDLVTPKAPKTWKDMLPT
>MAP3605 hypothetical protein
MKITGTLVRLSIFSVVLLIFTVMIIVVFGQMRFDRTNGYSAEFSNISGLR
AGQFVRASGVEVGKVSSVQLVDGGKRARVEFNVDRSVPLYQSTTAQIRYL
DLIGNRYLELKRGQGEGADKVLPPGGFIPLSRTQPALDLDALIGGFKPLF
RALDPQKVNTIATALVTVFQGQGGTINDILDQTAQLTSQLGERDQAIGEV
IKNLNTVLDTTVRHRQQFDQTINNLEVLITGLKDHGDQLAGGTAHISNAA
GTVADLLAEDRSLLHKTLNYLDAVQQPLIDQQDQLQDYLKKVPTALNMIG
RAIGSYGDFVNFYACDITLKINGLQAGGPVRTVRLFQQPTGRCTPQ
>MAP1854 hypothetical protein
MTRLGTKALAVVAGVAVVAAAVGIGWWLLRPDDDTITVTAQFDSASGLYE
GNVVAVLGMPVGKITKINPRGGYVEVEFTVDRGVKVPANAQAVTVSTSIL
TDRQIELTPPYRGGPVLGNHDTIGLPRTKTPVEFSSVLNVLDKVTKSLEG
DGHGGGPIADVLGDGAEVVNGNGEKIKAALGELSKALRLSSDGGAATREQ
ITTIVKNLSTLFDAVASNNTKLREFASTIHQVSQIMADEDLGSGNTGHKL
DQLIQRAGDLLDANRDNIKHAALSGNDTLKTVTDQRRDLAELLDLAPLVA
DNAYNMIDRANGSVRARFLTDRLLFDSQYTKEICNLMGLRQLGCSTGTIQ
DFGPDFGLTYVLDGLAAMGQK
>MAP3086c hypothetical protein
MTRSTDAVPTPHATAEQVEAARHDSKLAQVLYHDWEAETYDEKWSISYDQ
RCIDYARGRFDAIVPEHVLRELPYDRALELGCGTGFFLLNLIQSGVARRG
SVTDLSPGMVKVATRNGQSLGLDIDGRVADAEGIPYEDDTFDLVVGHAVL
HHIPDVELSLREVIRVLRPGGRFVFAGEPTSAGDVYARELSTLTWRIATN
VTKLPGLGSWRRPQAELDESSRAAALEAIVDLHTFTPGDLERMAANAGAT
EVRTVSEEFTAAMFGWPVRTFEASVPPGRLGWGWAKFAFNGWKTLSWVDA
NIWRRVVPKGWFYNVMVTGVKPT
>MAP1605c hypothetical protein
MKSIFITGAGSGMGREGAKLFHAKGWRVGAVDRNDDGLATLQQELGDDRL
WTRAVDVTDKAALDGALADFCAGNTGGGLDMMWNNAGIGESGWFEDVPYD
AAMRVVDVNYKAVLTGAYGALPYLKKSAGSLMFSTSSSSATYGMPRLAVY
SSTKHAVKGLTEALSVEWQRHGVRVADVLPGLIDTAILTTTTNHSNDGAA
PMTAEELRATAPKKGMLRLMPASSVAEVAWRAYHHPRRLHWYVPRSIRLI
DVFKGLSPEFVRRSIVKSLPALMPERQ
>MAP0698 hypothetical protein
MAEIDALWRYDRRRAVVTGCASGIGEQVVRQLGQLGADVIGLDQRRPGCA
LGEFHEVDLADPESIDRAAAGIDGPVDALFNIAGVSSGIGNPPLVVTVNF
LGTRQLTEALIPKMVAGSSIVSVSSLAAAGYREHLRQAAPLLDTATMREG
IDWCTSHPEELGTGYQLSKEALILYTMRSVTPLGARGIRINCTGPGVTET
PILDQLRTAYGQGFLDDIPKPLGRVSRPAEQAAVLLFLNSDAASYISGQV
VWVDGGNVGAAIARELEEGRTPWPV
>MAP1782c hypothetical protein
METWVMSISFETSESRADAELPVLPMPRAAHCPLAPPPEFVDWRQQPGLR
RALFQGNPVWVVSRYHDIRAALVDPRLSAKTIPDSIMPTDADNKVPVMFA
RTDDPEHHRLRRMLTGNFTFRRCESMRPQIQDTVDHYLDRMLDGGAPADL
VREFALPVPSLVIALLLGVPPEDLELFQFNTSKGLDQKSSDEEKGKAFGA
MYAYIEELVQRKAREPGDDLISRLITEYVATGQLDHATTAMNSVIMMQAG
HETTANMISLGTVALLGNPEIYARLGQTDDSAVVANIVEELMRYLSIVHS
QVDRVATEDLTIAGQLIRAGEFVVMNLPAGNWDTEFVDNPESFDADRNTR
GHLGFGYGVHQCIGANLARVEMQVAFATLARRLPGLRLAVPPEQLKFKDA
NIYGMKELPVSW
>MAP3567 hypothetical protein
MPGVQDRVIVVTGAGGGLGREYALTLAREGASVVVNDLGGARDGTGAGHN
MADQVVKEIKDAGGRAVANYDSVAEPAGAENIIKTALDEFGAVHGVVSNA
GILRDGTFHKMLFENWDAVLKVHLYGGYNVIRAAWPHFREQSYGRVVVAT
STSGLFGNFGQTNYGAAKLGLVGLINSLALEGAKYNIHANAIAPIAATRM
TEDILPKEVLAKLTPEYVAPVVAYLCTEENPNSASVFVVGGGKVQRVALF
QNAGVTYDKPPTVQDVAARWDEITDLSAAKQADFKLG
>MAP0672c hypothetical protein
MADNSLTDRTVVISGGSRGIGLAIGIAAARRGANVVLLAKTDTPHPRLPG
TVHTAAADVEAAGGKALAVVGDVRREEDVRRAVEATVQRFGGVDVCVNNA
SAIAVEPTAELSAKKFDLMQEVNIRGTFLLTKACLPYLRRAANPHVLTIS
PPINMNPRWLGAHPAYTLSKYGMTLLSLGWAAEFADDGIGVNCLWPQTYI
ATAAVANMADGDKLAESSRSPEIMADAAVEIVSRPAREATGDCYIDAEVL
HSAGVDDLSVYGGGEQPIPDLFLD
>MAP2879c hypothetical protein
MRAGWANAAAAITVVVAACPCPTGSADSTPDSPPFPIGQLGVPVHARAAS
GATADITVNSATWLPPGCTSHFATPTAGSACNVVELTITATSHRFFQVNQ
RYMFAGYGGGNQPWTHPDDAPQPATMAVDYQRLGKMPPLQTGGLHDGQTA
HGFVGFAMPAGGDLYLTINDPEQPAPYTEAGWIVHT
>MAP1853 hypothetical protein
MAGTTAGSMLDRSVRQRLSALTKRPLESYNKTWLGFVAVAVVAAVIAVML
VVHALGAGYRHYTAEFAQAASLRAGNPIVVAGIPVGTVTSMKLVGDHVEA
GLKVRDNISFGKDSRAQIRVTTILGSRYLALEPNGPARLPGNTFDLAHTE
VPYDLQAALQDATTTFEQVDSDRFAQSLAVLGKQLEGLPAVVPQAITNID
TLSSIIATRRDQLGQLLRSTEQVTNTLRRQQAGIGALVDQGQDLLGQFVA
RRAVFHAMMRSLSSLVDTLSRVVVDDRSGVDALLKDIRDFTGLVSAHDDL
LHSLLQVSPIFFREAANLTGDGNAINFNAPNVPLIDSWMCAISGRAKQFG
MIQYFKDCK
>MAP1861c hypothetical protein
MTSLRDKVVFITGGARGIGAEVARRLKAKGARLVLTDLNDAELTSLAAEL
GEERVLTAVADVRDLSAMQAAADRAVERFGGIDVVLANAGIASYGSVAQV
DPDAFRRVLDINVLGVFHTVRATLPAVIERRGYVLIVSSLAAYAACPGLA
PYNASKAGVEMLANALRLEVARHGVKVGSAHMSWINTALVRDTQNDLPAF
DRLLASLPWPLNKTTTVDKCAAAFVKGIQRRRARIYCPRWVALFRWAKPV
LSSRLGEMPLHSPTAELLPALDAQVAALGRSMSAANVELLRPDRGSARN
>MAP2637c hypothetical protein
MEIKDAVAVVTGGASGLGLATTKRLLDRGAQVVVIDLRGEDAVRELGDRA
RFVQADVTDEAAVGKALDTAESMGPLRINVNCAGIGNAIKTLSKDGPFPL
DAFKKVVGVNLIGTFNVLRLAAERIAKTEPIGPGTSPERGVIINTASVAA
FEGQIGQAAYSASKGGVVGMTLPIARDLARELIRVVTIAPGLFKTPLLGS
LPEEAQASLGKQVPHPARLGDPDEYGALAVHIVENPMLNGEVIRLDGAIR
MAPR
>MAP0871c hypothetical protein
MILDRFRLDDKVAVITGAGRGLGAAIAVAFAEAGADVLIASRTESQLEAV
AEQVRAAGRRAHVVAADLAHPESTAELAARAVEAFGKLDIVVNNVGGTMP
NTLLTTSTKDLKDAFTFNVATAHALTVAAVPLMLEHSGGGNIIKITSTMG
RLAGRGFAAYGTAKAALSHYTRLTALDLCPRIRVNAIAPGSILTSALDVV
ASNDELRAPMEKATPLRRLGDPVDIAAAAVYLASPAGEFLTGKTLEVDGG
LTYPNLDIPVPDL
>MAP2342c hypothetical protein
MRAPADFGVDRFTVPAVLDRRAAQHPDRVMMSIAGVDVTFAQMRQRSCAA
ANMLSDLGVGRGDRVALFSGTCPEWVYFWLGAARIGAVSAAINAAHKGDF
LLHALRLCRPAVIFTDPEHRSRAERAAAALEGPPRIVVQGDSLTATLSRA
ADRAPAEDRPDAGELGCLFYTSGTTGPSKAVATTWHYLFSVAATVAAAWE
FRQGEVLWTAMPLFHLSAAPSVLAPMLVGATTVLAAAFHPAEVWDDIRAH
GAIGFAGAGAMVSMLQNLPADPGDARLPLRFISAAPIAARSYRDIEKRYG
CRIVTMYGLTEAFPIAVKALADAGIPGTSGRPNPDFEVRILDAHGNSLPP
DTVGEIACRPRHPHVMSEGYIGDDLAVRPHPEWFRTGDLGRLDRDQNLTY
VDRIKDALRRRGENISSVEVETVVMGHPAVAEAAAVGVPGELGEDDVLVV
VTLRPGATLDCAELLDFCADRMPYFCVPRYVETVPELPKNAIGRIRKDLL
RARGLTTNVWDREKHGYVVRR
>MAP0685 hypothetical protein
MGGTNMTSVANPIPPQDFSGIVGHRFIYTYANGWQYEMYVKNATTIDYRI
HSGHVGGRWVKGQEVNLVQLDDDSYKISWTEPTGTCVAVNVLPSKRRIHG
VIFFPQWIRQHGERTVCFQNEHLDEMRAYRDRGPTYPIYEVPEFAYITLF
EYVGTDDETVIDTAPGDLPRGWSDRTN
>MAP3564 hypothetical protein
MSSLRTHDDTWDIKSSVGTTAVMVAAARAVETEQPDPLIRDPYAKLLVTN
SGAGVLWEAMLDPDIAARVEALDEESAAHLHHMRGYQAVRTHFFDTYFAD
AVAAGIRQIVILASGLDSRAYRLDWPAGTTVYEIDQPQVLAYKSTTLAEN
GVTPSADRREVAVDLRQDWPAALRAAGFDPTQRTAWLAEGLLMYLPAEAQ
DRLFTLIGELSPAGSRVAAETAPNHADERRQQMRERFKKVADEIGFEQTV
DVGELMYRDDHRADVTEWLNAHGWRATAEHSTAAMRRLGRWIENVPLADD
KDAFSDFVVAERR
>MAP0702 hypothetical protein
MVNELAGKVAIVTGGASGIGRGIVERFVAEGARVVIADIETERGERLAAE
LGGEAVFRRTDVSDIEQVGALVAAAVEKFGGLHVMVNNAGISSPLRRLLD
DDLADFHRVMGVNVLGVMAGTRDAARHMADNGGGTIINLTSIGGIQAGGG
VMTYRASKAAVIQFTKAAAIELARYDIRVNAIAPGNIPTPILGKSAGDMD
PEQRERFEARIREGMREDRPLKREGTPDDVAEAALYFATDRSRYVTGTVL
PVDGGTSAGKAMRSKRQG
>MAP2690c hypothetical protein
MTAPEADLSGWSVAPFSGGGYTHDVYRKGVGPGVVLIPELPGIHPGVLAL
GNHLVDNGFTVAMPSLFGEPGKPVSPGYLVAGMTRACVAREFAAFATNKQ
RPVSLFLRALARDLNASTPGDGVGVIGQCFTGGFALAAAVDESVLAPALS
QPSVPFPLGATRRRDPGVSEAELATVADRCANEGLCAMGLRFSEDWTSPR
ERFTALKQRLGDAFEVIEIDSRPGNEHGFGKTAHSVLTLEVREVDGHPAY
EARKRVVEFLTQRLGAR
>MAP0222c hypothetical protein
MHAAKISKPRPAPGADDGVMDANLAAVADTALLVAAIRAHETTRDDRLFA
DPFAARLAGDRGRELLAGALAATGESATAQIVVRTRFWDDALLEAAQQIS
QVVILAAGMDARAYRLAWPDGTVVYELDQPEVLAAKDGVLAGERPACRRV
AVGVDLAQDWPAALRRAGLDPSAPAVWLIEGLLQYLDEAAVTALFDRVDA
LSARGSVLLYDVVGKALLESEFMAPVLESMARSGAPWRFGTDDPGGLCER
LGWSATVTDVAEPGNRYQRWYAPAVPMDVAGAPRGYFVQATKQAVGD
>MAP0663 hypothetical protein
MARTDRDRWDLATSVGATATMVAAQRALSSDANLIDDPYAAPLVRAVGID
VYVRLVDGEIQPGTSEFDPHRMAKGMACRTRFYDDFFLDAARAGVGQAVI
LASGLDARAYRLPWPAGTVVYEVDMPDVIEFKTLTLADLGAQPTAQRRTV
AIDLRDDWAAALREERFDTQAPAAWSAEGLLVYLPEQAQDALFDNITALS
APGSRLAFDFVPDTAVFADPRWRAHHDRMSELGFEVDFNDLVYHGERSHI
VDHLSGRGCSLVPLFRVG
>MAP3034 hypothetical protein
MDVTIVGSGPNGLTAALICARAGLKVQVVEAQPTFGGGARTAADPDSAGV
LHDICSAVHPLALASPFFAEFDLPARGVQLAVPEISYANPLPGRPAAIAY
RDLDRTCAELEHGASFRRLLGPLVARSDDVVALLLGDKRSLPNSPTSALR
LGLRMLAQGTPAWGTLAGEDARALFTGVAAHIISRMPSLTAAGAGLMLAT
LAHSVGWPIPVGGSQAITDALIADLRAHGGELTAGAEVTEPPGGVVAYDT
APTALLRIYGDALPPRYAKALRRYTFGPGVAKVDFVLSDEIPWSDPRLRQ
APTLHLGGTREQMARAEADIAAGRHAQWPMVLAASPHVADPGRIDAAGRR
PFWTYAHVPAGSTLDATEAVTAVVERFAPGFRDVVVAARAIPAARLCDHN
ANYVGGDIGIGGNSAWRAIAGPTPRVNPWSTPIPKVYLCSAATPPGGGVH
GMAGYYAARTLLRREFGLGMPRLAP
>MAP4129 hypothetical protein
MGVAIEVNGLTKSFGSSRIWEDVTLEIPAGEVSVLLGPSGTGKSVFLKSL
IGLLRPERGSVVIDGTDILQCSAKELYEIRTLFGVMFQDGALFGSMNLFD
NAAFPLREHTKKKESEIRDIVMEKLELVGLGGDEKKFPGEISGGMRKRAS
LARALVMDPQIILCDEPDSGLDPVRTAYLSQLILDINAQIDATVLIVTHN
INIARTVPDNMGMLFRRKLVMFGPREVLLTSDEPVVKQFLNGRRIGPIGM
SEEKDEATMAEEQAALEAGQHAGGVEEIEGVPPQIVASPGMPERKAVARR
QARVREILHTLPQKAQAAILDDLEGTHKYRAHEVGD
>MAP2779 hypothetical protein
MGSLDGKVAFITGVARGQGRSHAVRLAREGANIIGIDICADIAANGYPMA
CRAELDQTVALVEEAGGKMLGTVADVRDFGQVKAALDAGVEQFGRLDIVL
ANAGIAPLAFRQLSIEEELAQWRAVTGVNLDGAYHTAWAAIPHLLAGNRG
GVIIFTSSTAGIKGFGGLQGGGLGYAASKHGIVGLMRTLADALAPLNIRV
NTVHPTAVNTMMATNDDMIEFLQKNPGAGPHLQNPMPVGMLEPEDVSAAI
AYLVSDEARYVTGVTFPVDAGFCNKV
>MAP1485c hypothetical protein
MNIAEHALAAAQSPALITDGGTISYGELHDRSRRVAAALHELGLRRGDGV
ALVLPNRPEFLEITWGCQLSGLYYTPVNTHFTADEVVYVIDDSDATAVFV
DASLPGIAARLRSANPAVHIGVGGKLPGWRDYEGVLGAAGDAPPVSDGSE
MLYSSGTTGRPKAVRRPLPQDGNGSWAQSVLELALIHKYGMTQRSVYLSP
APLYHAAGVNYTMAVNRVGAASIIMRKFDAETVLRLIETHRVTHAQFVPT
MFVRMLKLPEAVRDRYDVSSLRCVIHAAAPCPVDVKHRMMRWFGPVIHEY
YGGTEGFAGTTIGPQEWLAHPGSVGVPLAPVPCSTRTGGKSRSARPVSCI
SKAGPTSSTSRTPSKPRRCTTSAAGARWATWAISMRTATSTSPTARRSRS
CPAGSTSIRRKSRTCS
>MAP1004 hypothetical protein
MSEDARADGVSRQYDRWQYPPPVTDLDAWTTNHWDWFDPFWAHRLLWPDR
EYRPDLDILIAGCGTFQAAVYAYTNRAAKVVAIDVSRTALDHQQFLKDKH
RLHNLELHRLAIEEVAALDRDFDLIVSTGVLHHLADPLTGLTALGRCLRP
DGALGVMLYASYGRIGVEMLASVFRDLGLSQDEASVELVKEAVAALPADH
PVHTYLKGARDLSTDGGLVDTFLHARQRSYTVEQCLELVAAAGLAFQGWL
RNSPYYPHDALFGSAATQFQSALNRLPDTTLWSVMERLQPANATHFFLAC
RPERPRERYRIDFSSPTYLGYVPVLRTACLLSGDQIHLPGAKLTLTPAQL
PFVQQVDGRRSIAEIIDGVAGAGRDDRAREFGRRLFESLWRLDFLAMGLA
PGR
>MAP0980c hypothetical protein
MTGWTAADLPSFAQRTVVITGANSGLGAVTARELARRGATVIMAVRDTRK
GEAAARTMAGQVEVRELDLQDLSSVRRFADGVSGADVLINNAGIMAVPYA
LTVDGFESQIGTNHLGHFALTNLLLPRLTDRVVTVSSMAHWPGRINLEDL
NWRSRRYSPWLAYSQSKLANLLFTSELQRRLTAAGSPLRALAAHPGYSHT
NLQGASGRKLGDALMSAATRVVATDADFGARQTLYAASQDLPGDSFVGPR
FGYLGRTQPVGRSRRAKDAGMAAALWALSEQLTKTEFPL
>MAP0107 hypothetical protein
MASVIAVIPLYIACLAVTYLTCQVVANIISGGSIGPYLHYFTMMLSAKDI
AYSVLKCVVFVWLSSTVQCYYGFYAAGGPEGVGVAAGHAMRASITVVIMV
NMLLTMALWSIDAGARFGG
>MAP1564c hypothetical protein
MAVEVLVTGGDTELGRAVAEGFRDDGHKVTLVGARKSDLEIAAKELEAEA
IVCDTTDPAALEQARPLFPHHLDTIVHVPAPSWEAGDPRTYSIADTASAW
RNALDATVLSAVLTVQTVGDHLRSGGSIISVVPENPPAGSAQAAVKAALS
NWTTGQASVFGTRGITVNAVASGRGAQPGYDGLSRSPAPVAAEVARLALF
LTSPAARHITGQTLHVSHGALAHFA
>MAP2870 hypothetical protein
MPKITDSISTADGTCPVRLFFPDGSGPWPGVVMYPDAGGVRDTFDQMAAE
LAGFGYAVLLPDVYYRSGEWAPFDMATVFADQQERNRLFAMIGSVTPDRM
ATDAAAFFDYLAARPEVSGDRFGVCGYCMGGRTSLIVAGRLPDRVAAAAS
FHGGGLVTDSDDSPHLLADRMSATVHVGGAQDDASFTTDHAEQLDKALTA
AGVRHTIEWYSAAHGFAVPDNAPYDPAAAERHWDAMRDVFAAALPR
>MAP2194 hypothetical protein
MLTRFVRIQLTIFALASVVAMAFMFFQYMQVPTLLGIGKLTVTLELPDTG
GLYRFSNVTYRGVQIGKVTAVAPTATGAKATLQLDTSPKIPADVHAAVRS
MSAVGEQYVDLVPRSESGPYLCDGSVITARDTSIPRPVGPMLDRLSALVK
SIPKDKLGQLLNESFSAFNGAGYDLEWLLDSSGKLSRDASGVVDHTRALV
DDGAPFLDAQAQTADKTRRWAHNLAGFTDQMVTDDAQFRKLLHTGPGFEQ
EVSRLLDQLKPTLPVFLANLSTIGQIGVTYHPALEQLLVLLPPSVAAYGS
YGVTNNPTGLAVGRFTLTIADPPACTVGFLPPSQWRSPADTSEADTPDNL
YCKLPQDSPISVRGARTYPCMGKPGKRAPTVEICNSDKPYVPLAERQHAL
GPYPLDPNLLSQGLPPDDRAGIEDRTFGPVEGTPLPPGAAPAGTPPGPPA
PGSLNQPPTAPDVAAAPAAPSAFDSNSSGVSPSVAVLHYDPRTGRYVAPN
GQLYRQSDLVRANAPKTWQGMLLSSD
>MAP1475 hypothetical protein
MAAVSGADALLSGRGAVVSGGSRGIGRAVAELLAGLGAGVVVNGRDPQAV
QETVAAITAAGGRATAVVGAADDERIARSLVDECIGAFGRLDALINCAGI
AEPAGSSILNITADEFDHLIGAHLGTAFHTCRAAAPVMVEQRHGSIVNTS
SVAFLGDYGGTGYPAGKGAVNALTMAIAAELKAYGVRANVVCPGARTRLS
TGADYERHIEDLHRRGLLDDMTRQASLDSAPPVFVAPVYGYLVSDLARDV
TGQILVAAGGFVGSFDRQTPRLLGYRDHHRAGPWSIEEVHTMIGAAATP
>MAP0049c hypothetical protein
MPNALITGAGGGIGSAIATALAPTHTLLLAGRPSDRLDAVAQRLGATTFP
LDLTDGGDIEAACEVVEELDVLVHNAGLSIPGNVADSNVDEWRATFAVNV
FGPVELTLALLPALRRARGQVVFINSGAGRNASPGMASYSASKFALRAFA
DSLRNDEPELRVTTVYPGRTDTGMQRELIAFEGGSYDPDRFLKPETVAAA
VANVVATPPDGHVHEVVLRPARR
>MAP0752c hypothetical protein
MPDAWTDTYGGHRVAAGSHEVFELARCPAVSNDHDINGERRGYKGISIPT
ASRVSAVRGGILEMDDPEHRIYRTVLNPYLSPAAVKRWEPFIDEVTRAAL
DEKIEEGSIDFVDDLANIVPAVLTLAMLGIPLKKWKMYSEPVHAAVYTPE
HSPDIERVTAMHREMGLDMVNNMLEIRENPRPGIVNALLQMRIDGEPAPD
LEILGNLGLVIGGGFDTTTALTAHSLEWLSEHPEQRQLLSDERKTLLDPA
TEEFLRYFTPAPGDGRTFSEDFELDGTVFKEGERLWISWAMANRDPAVFH
DPDEVILDRKGNRHFSFGLGIHRCIGSNVARTVFKSMLIAVLDRMPDYRC
DPEGTVHYETIGVIQGMRKLPATFTPGRRIGAGLDETLEKLQRICDEQEL
ARPITERKEAAVID
>MAP0563 hypothetical protein
MSYDATLRFRRWFSRLQEPVDDFGEQALFYGQTMRYVPNALTRYRKETIR
LIAEMTMGAGALVMIGGTVGVAAFLTLASGGVIAVQGYSSLGNIGIEALT
GFLSAFLNVRVVAPVIAGIALAATIGAGATAQLGAMRVSEEIDAVECMAV
HSVSYLVSTRLIAGLIAIVPLYSLSVLAAFFAARFTTVYINGQSKGLYDH
YFNTFLIPSDLLWSFLQAIVMSIAVMLVHTYYGYNAHGGPVGVGIAVGQA
VRTSLIVVVVITLFISLAVYGVSGNFNLSG
>MAP0110 hypothetical protein
MADTRKLRWLRRRPLESYNRTWLGLAGLVVVAVLVAVSLGIKLLGVGYTH
YTAEFLQAATLRPGNPITVAGIEVGHVTSMKLAGDHVEAGLSVRDNVPLG
KDTRAVIKVMTILGSRYLELVPDGPGSLPANTIPLAHTEVPYDLQSLLED
ATTTFEQVDSDQFAQSLAVLGKQLGGVPPLVPQAVANLQTLSTITADRRG
QLGALLKSTQRVVNTLRNQQSNIGHLMDQGQDLLGHLVARQATFHAMFAA
LTELVDQLDKIVVNNRPMLDELFANLHQLTNMVGQHDDLVRNILQVAPVT
LRGLTNATGYGPVVEFNLPNGLAIDSWMCAISGRAKEFNMIQYFKDCK
>MAP3657 hypothetical protein
MKRLSGWDSVLLYSEAPNVHMHTIKAAVIELDADRRSLDVAAFRQVIAGR
LNKLDPFCYQLVEVPFSFHHPMWRENCEIDLDYHIRPWRVSPPGGRRELD
EAIGQIASTPLDRSRPLWEMYFVEGLANNRIAVVGKIHHALADGVASANL
LARGMDLQPGPEGGPYVCDPPPTTRQLMVSAFADHLRHVGRLPHTIRYTA
QGLGRVRRSARKLSPELTRPFEPPPTFMNHKLTPERRFATATLALADVKE
TGKRLGATINDMVLAMSTGALRTLLLRYDGQAQPLLASVPVSFDFSPERI
SGNRFTGMLVALPVDHDDPLERVAACHQNAISAKESHQLMGPELVSRWAA
YMPPAPTRAFFQWASARDGHNKILNLNISNVPGPRERGRVGGALVTEIYS
VGPLTAGSGLNITVWSYVDQLNISVLTDGATCKDPHEVTEAMVQDFIEIR
RAAGFSEDLTVVEAAMAPA
>MAP2190 hypothetical protein
MTGTGRIMVKFGVFAAVMLMLTTSLFFIFGQFRNGPTHSYSAVFIDASQL
KTGDSVRVAGIRVGTVNGIALQPDNKVVVDFDADDDVALTTGTRAAVRYL
NLVGDRYLELIIGPGSMRVLPEGSQISIDHTMPSLDLDLLLGGLKPVIQG
LNPQDVNALAGALLQVFQGQGQTLQSLLAKTSSFSNDVAGKNKAIETLID
NLNTVVRTLSDQGSQFSGAIERLQRLVTDLAHDRDPIGDAIQALATGTAS
VTDLLSAARPPLSGTVDQLNRLAPLLEHGKGRLDDALQRAPNNFRKLART
GAYGSFVNYYICGLTFRVTDLQGRTAVYPWLKQTEGRCAEP
>MAP0730c hypothetical protein
MTKPKLVFDPYSEDYFNNPYEIYRRMREEAPLYYDEKEDFYALTRHVDVA
AAFKDYETYSSARGCDLAMVRRGISPEQKSIIFMDPPEHRHMRSLLNKAF
TPRAIQSQRETIIEVVDKYLSAADPDNFDVVQDFSGPFPVEVITRMAGVP
EEYRQQVRHWIDTSLHHEPGQIEVSEAGMQANIDTAMYYFGLVQERRQDP
QDDMISRLIAAEIPGENGQMRKLDDIEITGFATLLGGAGAETVTKLLGNA
AVIFARHPDQWQKLQEDRDKIPGAVEELLRYEGPVQYNVRYTLKEAHVSG
GVIPAGKPVFLCGAAANRDPEAFTDADTFDIERDQTEAQHLGLGYGIHSC
LGAALARLESRIALERLLDFMPRYDVDWAGCRRVTMQNVAGWKNVPVKVL
R
>MAP4079 hypothetical protein
MWNWHSAPQLPPELIEAEPTIPLQQQAMVGYMASRTAFFDSFFLEATGAG
IRQAVILAAGLDARSWRLPWPAGTTVYELDQPRVLEFKESTLAEHGAQPA
CNRVAVPVDLRHDWPEALRQAGFDASAPSVWSAEGLMPYLPAAAQDLLFD
RIQGLTVAGSRVAVEALGPKFLDPQARAKRRERMDRIQALMARIDPDRAV
PRTDELWYFEEREDVGEWFGRHGWDVRVTPSDELMAGYGRGRRRPRSATS
CRGTCSSPRSGGRPEGLAFRQGESRARRHRRDVAGQHGFGNQCGGPDCGS
AQHRRAQVDHPAQQRGFSDDAPDAAPAERGEPGERGGQVVRLVDARGQHR
GVLEPLATALTQVRAHRMSRVADHHDGPARPGPGGGAVVKVVAQHLVAGR
RCQHPRNRFGPIGESCLQIGQFAARRELPFRSALGGEPIQAIRTHRHMAG
FDAGTKCLAGQLGVHRRSPHRAMRCSRRTGRRAGR
>MAP1602c hypothetical protein
MTEVSMIAVEDVVRGLWQALSRRDWDAVKTFLSDDCLYVDMPVPALSARG
PDDIVKRLKMGLEQLAGYQNHPGVLVSNGSDVLYEHSETWTFATGEQGVL
RFVTVHKVIDGKVTVWKDYWDFNSLVAFAPPNHFEGLANGDTSWVFDASA
LV
>MAP2603c hypothetical protein
MRFAAAVQAALKDGFRVFGELAPHPLLTHAVEQNAASLDVPIAALAAMRR
EQELPLGLRGFVGDLHSAGALVDFSVQYPAGRLVDAPLPTWSHRRLMLRR
ESVQRAHGSSVQAVHPLLGAHVHLREEPERHVWQGEVGTDAHPWLADHQI
HGVAAFPGAGYCEMALAAAAATLGERAEVRDVTFEQTLLLAGRTEVSSTA
TVTGPGRLEFTVDTHEDGERIRRAGAVLHALPPEHGGETGPPAHDVATLI
ADHPSRIEGAELRKAFGAIGIQYGPAFSGLAAVLVGDRDVGTVLAEVALP
GAIRSQQSGYGAHPALLDACLQSVIAAPELQRAAAGGLLLPVGVRRLRNY
HSTRNAHYCLTRITSSRPGECEADVDVLDQSGTVLLTVEGLRLAGAASEH
EHAQRLLDERLLTIEWEARELPEAPQGEPGSWLLLSACDAASNGDDALTT
QLADVLKTDGAQCRTVSLPPGAVGTEELRSLLSGGEPGGNGHRPLQRLTG
VVVVAAASEAGHPAAPRRGRDYVSHLATVARELSELPGESPRLFVLTRNA
AVVVAGDAPNLTQAGLRGLMRVIDAEHPHLSATQIDVDDATDPGHVARQL
QSRSGEDETAWRDGQWYTARLRPGPLRPADRRTTVVDHGRDGMRLHIRTP
GDLESLELTAFDRVPPGPGEIEVAVASSTINFADVLVAFGRYPMFEGYQQ
QLGGDFAGVVTAVGPGVTEHRVGDRVGGLSGNGCWGTFVLADARHAVTLP
PEIPLRDAAAVPTASATAWYGLHDLARIAPTDKVLIHSGTGGVGQAAIAI
ARAAGCEIFATAGSPQRRQLLRDMGIEHVYDSRSLEFAEQIRRDTDGYGV
DVVLNSLPGAAQRAGIELLALGGRFIELGKRDIYRDSQLGLFPFRRNLSL
FAVDLALLTHSHPHTVRRLLTTVYQRTAAGELPMPRTTHYPLQDAAAAVR
LVAAAGHTGKVVLQVPRTASSVAALPPEKVRPFRADGAYIITGGLGGLGL
FLAGEMACRDGDVGAGRIVLNSRSQPGEQARRAIERLRAAGADIAVECGD
IAEPDTAERLVTCATATGLPVRGVLHAAAVVEDATLTNVTDDLIDRCWAP
KVYGAWNLHQATAGQPLEWFCLFSSAAALVGSPGQGAYAAANSWLDAFAR
WRHARGAPATAIAWGAWSQVGRATALAEDAGVAITPAEGFRAFETLLRHD
RPYAGYAPIMGTPWLTSFAQRSPFAEAFRSAGQGRPDAGKFLAELRALPR
EEWPSAIRRLVSGQLSLLLRRTIDPDRPLSDYGLDSLGNLELRTRIETET
GVRISPATITTVRGLAEHLCDELADLPAAPPAGAPR
>MAP2344 hypothetical protein
MNVNAATAACGDDPAERGSAMTTAAVDLSDFSLWCNGFPDELFAELRRTR
PLFHHDLTPGVAATVHRDFWVATKHRHAVRLHRDTESFTAADGPLIQPVA
MFSSSPTIITMDPPELNKRRKLISNAFNPRAIAKLEDGIRARAARMIDSL
LAHGGGDWIEDVADALPMTVIGDILGIPERDRPRIFDLFDRILKALAPEA
HPRGGVELELFASVFDYAMQLTADKRRNPTGDIWSTLATAVITGEDGEEF
RLPANELEFFFFVLAFAGSDTTKNALAIGLQAFLANPEQVERYCADEALR
PTAVEEVLRWASPVAYWTRTAKVDVEMDGQRIAKGERVVSMLRSANRDEE
VFDAPFTFDIGRQPNPHVAFGGGGPHHCLGAMLARAELRAVFDELLLRCD
DIEIGPAKAAYPNLITNMSIYDEMPISLRRR
>MAP4085 hypothetical protein
MRITGTAVKLVVFWSVLAMFTVMIIVVFGQVRFDRTTGYSAVFTDAGGLR
AGQFVRASGVEVGKVAAVTLSDKDSRVLVEFNVDRSLALDQGTTASIRYL
NLIGDRYLELKRGTSGRRLPPGGRIPVEHTQPALDLDALIGGFRPLFQAL
DPNKVNSIAQSIITVFQGQGATITDILDQTAALTAALADRDKAIGEVINN
LNTVLATTVKHEKEFDRTVDKLELLITGLKNRADPLAAAAAHISDAAATV
AGLLGEDRPSLHGTLGHLEGIQQALINDLPTVDNVLEKLPGAFRVIGRAG
GIYGDFYNFYLCDISIKVNGLQPGGPVRTIKLFGQPTGRCTPQ
>MAP0266c hypothetical protein
MGYAEQLFDLTDRVVLITGGNRGLGREMAFGAARCGADVVIASRNLDNCV
ATAQQVEHETGRRAMAYQVHVGRWDQLDGLVEASYDRFGKIDTLINNAGM
SPLYDKLTDVTEKLFDAVVNLNLKGPFRLSALVGERMVAAGRGSIINVST
AGSLRPTPDIVPYAASKAGLNAMTEALAKAFGPAVRVNTLMAGPFLTDVS
RAWNLEAVQENPFRHLALQRAGDPREIVGAALFLASDASSFTTGSILRAD
GGIP
>MAP0344c hypothetical protein
MTTAPTESAESQGLLLQLLDPANRADPYRLYAQFRERGALQLPEANLAVF
LSYRDCDEVLRHPSSSSDNVNSTVAKRQAAAGTAPVRQGPPGFLFLDPPD
HTRLRRLVSKAFAPRVVAALEPDIRSLVDGLLDRAADKGELEIVEDFAYP
LPVAVICRLLGVPLDDEPQFSRASALLAQALDPFSTITGVPAEVASERQR
AGTWLRDYFHQLIEARRSRPGDDLLSGLIAVEESGDQLTEEEIVSTCNLL
LIAGHETTVNLIGNAVLAMLRDPGQWAALGADPGRAPAIVEETLRYDPPV
QLAGRIALDDMVIGGVEVPAGDVMMLLLAAANRDPAEFDRPDTFDPDRKC
LRHLGFGRGVHYCLGAPLARLEAGVALSAVTARFPRARLDGEPQYKTNVT
LRGLSRLVVAV
>MAP1345 hypothetical protein
MASRRARFRFFYRVGFTPWEGHPIGQGLRDLVEGAAGTQALPTGSALDLG
CGTGDCAVYLAQHGWNVTGVDYVAKPLDKARAKAAAAGVAVDFVRADVTQ
LSQSGIGAAFDLIVDNGCIHNMSGGDRDAYVREVSAVAAPDARLFIVAFP
PGGRFGVPGIDRAEIERRFTAGWTLLSTGQERALDDKTPTYYYLFQRRP
>MAP3015 hypothetical protein
MARWLITGCSTGFGREIAGAALQAGHRVVVTARRADAVRGFAEEFGELAL
PVALDVTDRDQIAAAVAAADAAFGGIDVLVNNAGHGYLSAVEEGEDAEVR
KLFDVNYFGAVDMIKAVLPGMRARGCGHIVNISSMTGLVANPPNAYYSST
KFALEAVTEALAAEVRPLGIKVTAIEPGAFRTDWATRSMKESGTPIADYT
DVAARKDLIKQFADHLPGDPRKVAEAVLMVTGLDEPPLRLLLGRDVLKAM
RDKIAAMSASIDEWEAVTKDVNFPGA
>MAP2526c hypothetical protein
MSRVAVVTGGGSGIGRAIVERLAHDRHRVAVLDVNEEAAEKVAARVAADG
AHAIAVPTDVAESASVAAAFESVRRALGPVQVLVTSAAITGFKPFGEITI
EDWNRHLAVNLTGTFLCLQAALPDMVEAGWGRVVTISSTAAQTGSPRQGH
YSASKGGVIALTRTIALEYAVHGITANTVPPFSVDTPMLRAAQEAGNLPP
VKYLAKASPVGRLGTGEDIAAACAFLCSDEAGYITGQIIGVNGGAVI
>MAP2584 hypothetical protein
MQTHPPLRSPSFPLHSPDFYAGNPYPAYRELRATAPVCWNDVTNFWALLK
YEDIRFVSSNPALFTSTRGITIPDPQLPNPVQQGSLIFTDPPRHRQLRKL
INSGFTRRRVSVLEPKIRKIVRGILDGIERGAVHEFAEQIAAPLPTRMIA
ELIGAPPDDWEQFRAWSDAATGTADPEIELDPAVAAGQLYEYFQRLIAAR
RARPRADLLSVLAEAEIDEHRLTDEDLLNFAFLLLVAGNETTRNLIALGT
LALIAHPDQYRLLVEEPARIPLAVEEMLRWNSPVVHMARTATADVEIRGQ
RIRAGEVVVMLYGSANRDEDVFGPDSEEFDVTRHPNPHIAFGCGEHSCVG
AQLARLEATVFFEELLRRYPRIELVGEVDRMRATMVPGVKRMPVRMGA
>MAP0531 hypothetical protein
MLSVATRDELAAELAQAERSGEPIPPLTAAYPEIDVVDAYEIQLINIRQR
VAEGARVLGHKVGLSSLAIQQMMGVDEPDYGHLLDDMQLFEDTPVKTNRY
LYPRVEVEVGFILNADLPGAGCTEDDVMAATEAFVPAIELIDTRITDWKI
ELCDTIADNASSAGFVLGAARVSPQDIDIKGIDAVLRRNGEVVAEGRTDA
VLGNPVTAVAWLARKVDGFGVRLRKGDVVLPGSCTKAIDAHPGDEFVADF
AGLGSVCLSFE
>MAP0754c hypothetical protein
MKTAVVTGGGSGIGLAVVERLRADGLNVASIDLRPSDAELAFTADVTDRS
QVDAALSAIRAQLGPVTVLVNAAGLDGFKKFNNITFEDWQRVIDVNLNGV
FHTTQAVLPDMIEAGWGRIVNISSSSTHSGAPYMSHYVAAKSAVNGLTKS
LALEYGPKGITVNAVPPGFIDTPMLRAAEKNGFLGDIEETIARTPVRRMG
TPQDIAAACAFLVSEEAGYITGQILGVNGGRNT
>MAP0950c hypothetical protein
MALARSAKRWLAPAAKRLAPRRFWRRKFRILDRLGRSRPDVQLVASLCDP
HRVSLDIGADLGEFTIAMAASSRSVIAFEPRPAQARDLAAMFDAVGAAVR
VEAVALSDEPGTIAMRVVESEPGRSTIDTDNSLGDLTGDQIRVIDVPVKR
LDDLNLDDVGLVKIDVEGHELAVLRGAPETLARNRPAIVVEAEERHHRGA
VAGITRLLTGLGYSGYFELDGARRPIAEFDPAVHQDPANAGSRQDGWTGR
GPYVNNFVFLPDER
>MAP3473c hypothetical protein
MVEQSIWMQKVAADPGHSHWYIERFRAMARAGDDLAGEARFVDAKAPRGA
RILDAGCGPGRLGSYLAAAGHQVVGVDVDPALIEAAEHDHPGPRWLVGDL
AELDLPARGITDPFDVIVSAGNVMTFLAPSTRVLVLSRLRAHLAADGRAA
IGFGAYREYEFTDFLNDAADAGFAPDLLLSSWDLRPFTEDSEFLVAVLRP
A
>MAP0092 hypothetical protein
MTAQDERALDEDRAPDSVATIQEWSAGYVVRHPLASLNTLGEQYILAVRT
VEYCVVDLFTGRFQWAEFVRQGAFMAATAVLPTVLVALPIGVTLSIQFAL
LANQVGATSLAGAASGLAVIRQAASLVAAVLTASAVGSAITADLGSRTMR
EETDAMEVMGVSVIRRLVVPRFAAAIMVGVALTGITCFVGFLASYLFNVY
FQRGAPGSFVATFSSFATTEDMIVALLKAVIYGAIVAVIACQKGLFTKGG
PAGVANSVNAAVVESILVLMIVNVGISELYNTLFPRTGL
>MAP3881 hypothetical protein
MAATAEVGVTATLGAAARAVATRQGLLNDPYAEPLLGAVGIDYLTRAIAD
HTFAADESPVGDDPAVTSLLDALAAHTRFVDEFLAEAGRAGIRQVVILAS
GLDTRPYRLWWPRGTTVYEIDRPRVLDFKAGVLRGLDARLATNRCAVGID
LRDDWPAALRRVGFDAAQPTAWVAEQLLVGYLKPAEQNRLLRRLTAASAA
GSRLAADHLPTWDPLQLEAERAFVEGWRRRGLDIDLASLTHLGEYHYVPE
YLATHGWEPAARSIADLLGGLGLGPRRGAGSGGAQFIPEYVTATRV
>MAP1377 hypothetical protein
MMDLSGKKAVVTGAGGDGLGQAIANRLGGLGADIALIGRTLEKVQRRGRE
VEERWGVKTVAISADMSDWDQVHNAVREAHWQLGGLDIMVNNPVMVAGGL
FETQTKEQIDFTVLGSLSMMMYGAHAALQFLLPQGSGKIINIGSVGGRIQ
QRGLVVYNACKAGVIGFTRNLAHEVALRGVNVLGVAPGIMLNPQLKQYVL
DPQDDQERAGRAAIIEAITQQVQLGRASLPEEAANMVAFLATEAADYLCG
QTIDVAGGQWMG
>MAP3777 hypothetical protein
MRQVVVLAAGLDSRAYRLDWPAATTIFELDQPQVLDFKREVLARAGAQPR
AERREIAIDLREDWPQALRDSGFDPAKPSAWIAEGLLIYLPASAQEQLFT
GIDGLAGHGSHVAVEDGAPMKPEDFETAVAEERAATAQGDQRVFFQLVYN
EQCAPATEWFGNRGWTAVGTPLADYLREVGRPVPGPETEAGPMIARNTLV
SAVRA
>MAP2773c hypothetical protein
MSSNVIRYGPRPPEAQVDHEIDATKAPIATEAVTVIYLTEPDIVAAVLPK
PLQPADEPLVRIQLQRVRIEGMAPFGSAVFSVTARHGDRYGDYPLFMPQS
TEQSVTGGRETFGEPKKLAQITVERDGDRVTAGVDRLGYRLIRLSGQVSG
RAELPPDQMNTEFYFKFLRAPDGSGITDPHLVYGEYHRHYELLENIDGTL
ELGESPLDPVADIVIRQVTSITWCRRRTVQVGRIAARVPQEWLLPYVHQR
YDDVALLAAPRPEPARA
>MAP1649c hypothetical protein
MSVLDLFDLRGRRALVTGASSGIGKQVAQAYLQAGARVAIAARDFEALQR
TADELTAGGTGGVVAIRCDVTQPDEVSGMVDRMIAELGGIDIAVCNAGII
SVSPMLEMPAAEFQRIQDTNVTGVFLTAQAAARAMMRQGRGGAIITTASM
SGHIINVPQQVGHYCASKAAVIQLTKAMAVEFAPHDIRVNSVSPGYIRTE
LVEPLHEYHRQWEPKIPLGRMGRPDELAGIYLYLASAASSYMTGSDIVID
GGYSCP
>MAP1866c hypothetical protein
MRHKGGEVSAPENLDFFTDKSVVDDPYDYYDAIRRCPVWREPAHGVVMVS
GYDEALAVQRDTDHALSVCNIVSGPWSGIPVNTGSDDISELIERYRKKVT
FGDYFITFDPPMHTAHRSLLSRLFTPKQLKNNEDFLWRLADEQLNRFIAN
GKCEMVIDYNFPFTLDAITDLLDVPEADRERFRRAAIASRLEGDRSGFVG
VKEEWFVEYVEERRRNPRDDVLTELALAKVPDGTTPEPIDVARVATFMFA
AGHGTTIDLLSLSMLTLAERPDLQDLLREDNSKIPAFIEEMLRIESPIKS
NFRLARRTTRIGDVEVQAGTSILVMNGAANRDPRRFDEPNEFRLDRPNIL
HHMAFGRGIHTCPGAPIARAEVRVSLERILSRMADIRLSEAKHGPAGARR
LRWDPTLLFRRLKELHLEFTPIR
>MAP3760c hypothetical protein
MSLPLDAADERNRYCIQLYHRTAAQADLSGRRVLEVGCGNGGGASYLTRT
MRPASYTGVDLNQAGIAFCRKRHIVPGLDFVRGDAQNLSFPDESFDAVLN
IESSANYTSLSGNVPGRGVNSGGVSVLM
>MAP4097 hypothetical protein
MSSRKLIGVHDHRGEYGVDGSFHTVSARAQAIGIGAQAAGLAVWAGISWA
RGKRRRAALAALSSAGIAANTALYMYATRAGKFVVWDRILSDLRLAGDET
LLDLGCGRGAVLLAAAKRLPRGRAIGVDLWQADQTDNSEQATLANAAAEG
VADRVELHTADMTALPLADESVDVVVSNLAIHNIPTRAGRRQALDEAVRV
LRPGGRLAIADLWETRQHAARLRELCWRNVRRRNLGWRMWYGGPWFSTRI
VTATKPE
>MAP0073c hypothetical protein
MSEALTVDPAIASMPRGGPDASWLDRRLQTEKLEYTDRYDIPDEVKQTVV
AALDRMGTRAGYHERNARTALDIVADIANPRILELGAGHGKLSAEILTSH
PTATVTVSDLDPTSVANIAAGPLGADPRARTQVVDATAIDAPDDSYDLVV
FAQAFHHLPPATAVRAIAEATHAGKRFLVIDLPRPRSVQLLLTPLILAPF
AAVLALVRPSLKHVMHDAYISALRAYSRSAFIALGKAADPRMGVEFLPLS
ACRLRPGHVGIVFTRPRAS
>MAP2184c hypothetical protein
MPGVLAGKSAIVTGAGSGVGRVSALRFAEEGARVVAADIDLDHAKETVCQ
IESAGGTAIAIGTDVSDEQQVQAMIAAAVDQYGRLDILFNNVGIPTPRLG
MIFEDHTLEDFNRLVAVNLGGVFLGCKYAVLRFKEQGAGGVILNTASVAG
LVGWGGSVYGATKGGVIQLTRAVAIEAAPFGIRVNAICPAAMPLTGFMAA
GGLEVDAEQQAAIAESVGGQHPLGRAITAEDCAEAALYLVSDAARNVTGV
ALPVDGGFVAK
>MAP0882c hypothetical protein
MTRSEGDTWNLASSVGATATMVAAARAAATRRPRPVLTDEYAEPLVRAVG
LDVFTKLASGELDPDDLERDVGFARMVDTFAARGRFYDDYFAAAGKAGVR
QVVIVASGLDARPYRLSWPAGTTVYEIDQPEVIAFKTATLSRIGAAPTAE
LRTIGIDLRQDWPAALQDAGFDPAQPTAWLAEGVLIGFLPPEAEVRLLDS
ITPLSAEGSRFAADYGSLNDASQASTEQARRTTEGWRRRGLDMDIAALTY
PGKHTDVAAHLGADGWATTTFGLADLFAAAGLPELTEAEQGPAATLSFVR
AIKS
>MAP3740 hypothetical protein
MQDNRTVRRPAPKMMSSTPGPPALHDGDRSVPAVFAEWVGRRPDAVALRT
VAATGIDDWTYQRLWDHVREIRDVAFSGLSAGIRIPMALPGGADYVAGML
AALAAGLIPVPVYLPSTREPQRFLARAQHILRDCEPSAVYTCGELVEVLE
RDPILGALPIRTPASTADGLAPHPGGTTADADHGEHVAFLQYSSGSTGKP
KGVVNTHQSILRQAAFAANVWNGDDDMHMVSWLPLYHDMGIFWGVFMPLL
NGGCTTLIPPHDFVRNPRIWLETVSRFRGNWIGGPDFAYRRCIEAFDGTA
LQSLDLSCLRLATNGAEPVRGTTLRDFTAKFRAAGLRDDVMAPQYGLAEA
GLGVTGSQTVRVWVEKSFDADALERGIAVEVAQPNPADGRSRALVSCGDG
AFGWDIQIVDPDRHMTLTDGEVGEIWVGGPGLPDGYWRQPEQTATTFGAR
TADGLGPYLRTGDAGFRYQGELYVCGRYRDLIIVGGRNHFPNDIEKTVEE
AHCGVAPGGACAVQPDAPQANGEWWLVLETGSPVEDLDDLSRILRRRILA
HHETAPERVVWVPCRTLPTTTSGKIRRRETLNRLTAGQLEVVHEVSPRAQ
APDTPAAPDDPPTELAQHLAAMLGVEPYELAPDADLTTLGLTSMMTAQIV
EWSSSQSRRLDFADLYAEPTLRSWQRLFDAAPPVQTGTSSVAASGPWPTT
PLQQAYWVGRGAEQPLGGVGCQTYFELVGARVDAGRLAAALDALTRRHPM
LRATFPDPGRCLITPEAVRLPLAVHDLTDAPVTTRDTHLAEIRRRLRTHR
FDIETGDTWTVELTRLPHGCIVHFAVDLIIADVTSIGTMLRDLAASYRGE
KLPAPSATFADLIQSTSPPPQACADRLPEGPQLPRVQEADISFLRHQHTL
SALATKAIDDACHNHGVTRAAVLLAAYTLVLRRWASQDDFLVNVTTFGRS
PEVSDVVGDFTETHLYRAQLDGQISFVDQAQVTQKGLRTALRAAPAPDLL
ATQLRSGTGHSGIVPVVFTYAADSPLLSAEDANTLGAIDEVVSMTPQVLI
DHQACRLGDDVVLSWDYRAGCFPPGVVDDMFEAYVTLLERLGGHDWSTPA
TPGLSAHSRLARAHRNATTTPAPAGLLYDAFRENAATHPARLALRWRPDD
YRGERHGDVIAQDRSQLTYGELDELARSVARAVAARHAAGSVIGIQLPKG
PSQIVAVLGVMMAGCTYLPVGVDQPAERLSRICARSAMAGLIRTDSDTQD
AGVAVSDITAMIECAPTDPIRIDPHDAAYVIYTSGSTGEPKGVLVSHAAA
LNTIVDVNRRNRIDTHDRLLALSALDFDLSVYDTFGALGCGAQLVTIPEH
ARRDAFHWLSLTTEFGITVWNSVPGLMDMLLIAAGDKAGSLPTLRSVFLS
GDWIPLDLPRRLRRAAPGVRLVAMGGATEAAIWSNEFVVDDVDPDWASIP
YGYPLANQMFRVVDDNGDDQPDYVAGELWIGGAGVALGYHNAPELTSDRF
VHDPTGSRWYRTGDMGCYWRDGTLQFLGRADSQVKIRGHRVECGEIEHAL
RGHPLVAAATVVPIHNCTALGAGIVVTGSGAEQFDDSTPGALRAHLAVRL
PQYMIPKVFVSCPELPLTANGKVDRGKIAARLEAAARAPQPLDTSSTLTV
VERLVAEVWSDVLGAPITGREDNFFAQGGDSLRATEAVARLTRRGVAGAE
VGQLLSHQTLGQFSAACVLADPASEASESAADVGEPVTPGEGFPLTRLQQ
AYTLGAAGLNGSTCAPTYFAVVLAAAPESAGIDLDRFARVVTRCVDEFAM
LRCALDADTTQRVQVDAGPVPVHDLDIQDDPDLLLRRMAAAPFDPHSVPV
IQCFAPSRSPRHVGLLISYLGLDARSLSTVVTTIIAEYQSQPRPRQVDPT
AAVFARFASESAWGENDVDNSVAGPPLLPLHDQRRDPFERVTFARRSFTI
EEQAAATLREHAAHLGVTPTALVFEAFAHALASIGAGQRFAVTVPKSYRP
DYAPADREVLGNFTRLALCEVDYGAVRPGSAEAVAAAQRELWRAVSHDGD
ITGGLAATRTAGGYPVVFTSTLGLTHQDASGLTNVRTLTQTPGVWLDCQT
EDEVAGIRMSWDIATNVVAAESISVAFSRFEEAVRRHAGQAEPPGTAVAP
AVGGSPGPEWASAVIAAALRHCRPEQVLPQYTMLVRRWEALRYVPSGYAA
SDVERAARRLAGIVTGAVSPQTLIGDPQLTPEALLLRDDRMRMALDDLAG
AIFGHARTLGRRLRVVEVGSRTGLITERLTELVGVVVEEYLCLEPNPTLA
GIAAGRRFPAPTRHVDAPDAASGVDVVICCGSLHQLPDAEAVLEAITVSD
DGWLWMVENSEATQATLISAAVLDPGLLASDSKTLRPADRWWRLIADHGW
RPTHMIQDGPGLTLIAHRPDKPGMPTPPAEQRRDGRWSRPAVPASSLPTD
ATVVATLAEIWQRHLAIPTPGVDDDFFLLGGDSLVATRVYADLRAAGFGQ
LAFVDLFNHSTLGELAAHAGPRTGPEVSVAAESTRGGTHDPNRFPLTVVQ
NAYRAGREGALILGGVAAHCYFEFELADFDRPRFDSAARQLVARHAGLRT
TVSPAGTDAASSGEVAVVHTAPIEPVVRDHDDVRAAMRDQIIDLTARPGI
DFGVQTRGDGRTVVGISMDNTMLDGASMMIALSELDHLYRGETVDQLPPL
ETSFAHYVWNHPELLPDADEAVLPRLAASRDYWRARLPSLPPAPKLADMS
LLFEIEEPRFERATATIPAVDWSQVTRSCRAEGVTVASFLLANYARVLSR
WSGTDHFCINVTLFDRDPDVVGIENVVGDFTSLVLLECRVDEPASIWESV
RALQRQLMTDLPHRGADAVWLQRELLRFHGNPTAALFPVVFTSGLGLVDA
SARAAVRFAEPVFAASQTPQTVLDFQVWESAGALKLSWDFVSQAVSPATA
RTQLESLVDGITGVATRSRRIEHKLGEGASNDELLQRVSRICASALGQPR
VEPHDNFFQLGGDSVSATKVVEQIGRELSASATLRLLFANPVIGDFAAKI
ADTDNADEPDLTVEEGML
>MAP1849 hypothetical protein
MTATLTTPLDGVDGVAAIRNWSVGYVKRHPVASLTTVGEQFVLGVRTIQY
FFYDLITGRFQWQEFVRQGAFMAGTAVLPTILVSLPISVTLSIQFALLAG
QVGATSLAGAASGLAVIRQGASLVAAVLMASAVGSAITADLGSRTMREET
DAMEVMGVSVVRRLVVPRFAAAVMIGVALTGVVCFAGFFASYMFNVYFQN
GAPGSFVSTFASFATTGDMILALLKAIVFGAIVAIVSSQKGLSTKGGPTG
VANSVNAAVVEAILLLMIVNVAISQLYIMLSPRTGL
>MAP2747 hypothetical protein
MPAAETRETLAGIVERHAQRRPDAIAIRYGERQWSWAEWSSRIRRAAGAL
RGAGIQRGQCVAFLDKNHPACLEVLIGGASVGAVTTVVNWRVIGDELVHV
LADSGARVLVVGAELRPAAEAAARRVPSLERIIEVGDEYESLLAAAEPAP
SDAGVDTDETALVIYSSGTTGRPKGVLLSQRALVNHAANLAPAFPFGDGD
ANLVAMPLFHVGGIGYALFGIRAGAPTIMTREPDAAALIGAVRAGATHAF
FVPPVIARFLDAGEAARASIAGLRYIVYGAAPMPLPLLHRALSTWPGTKF
VQVYGQTELCGAVTALSDDDHRDAARPQLQLSAGKAVQGCEIRIVDPNSC
AELPAGRSGEVWVRSNQNMSGYLNRAEATAETITADGWVRTGDVGRLDAD
GYVYIEDRLKDMIITGGENVYGPEVESVLIEHPAVVDAAVIGVPDDFWGE
SVKAIVVADGDVDAADVIEFCRRHLAGFKCPRTVDFVAELPRNASGKILK
TQLREPFWRDRDRRV
>MAP3503c hypothetical protein
MADMQVAIVTGASSGIGLGCATRLAGTGMAVLGTGRDPKRLAELETAIGD
PDRVATVAVDLTDDDAPRRIVDRALQRWGHIDFLINNAGVGSPKPLHETD
DDTLDYFLNLMLRAPFRLAREVLPHLPPGSAIINVTSTFAVVGGLRGGAY
SAAKGGLTALTTHIACQYGASGIRCNAVAPGVTVTPMVEKRLQDPRFRKI
NTEMTPHQRLGSVDDIAATVAFLCSPGGSFINGQTIVVDGGWSSTKYLSE
YALTSEWIAP
>MAP0238c hypothetical protein
MEIVASGPGFAAELRGVTIADVAASPDVYAQVRAAFEEHSVLIFRDQHVS
DEAQLAFSRRFGPLEVTKVGAVGRGSHLVVLKTLDDDGNVVPTDHRLALE
NKANQLWHTDSSFKRVPALASVLSSRIVPGRGGETEYVSTRIAFERLDPG
LRERVENSFAWHEYAYSRGKIAPDLARPEERAALPPQCWRLVWRNPVNGR
KALYLASHAYGIEGMEPAAARELLAALTEAATAPGASYLHSWRAGDVVMW
DNRATMHRGRPWPAHQPRYMVRSTIAATAADGLEAMYPPWHAVAR
>MAP0709 hypothetical protein
MAIDPSDILLTDRVAVVTGAGAGIGRGIAAGLAAFGARVAIWERDAQTCT
RAAESIGGLGIVTDVRDSGQVDAALQRTITELGTPAILVNNAGGVFSSPL
LETSENGWDALYRANLRHVLLCTQRIARQLVSVGAGGSIISLTSIEGVRA
APGYAAYAAAKAGVINYTKTAALELAPHGIRVNAIAPDITLTEGLEQLGG
EAATTAMGNIVPLGRPGHVDEIASAAVFLASDMSGYLTGQTLHVDGGTQA
SSGWYHDPRTGDYRLGPAG
>MAP4064c hypothetical protein
MTVPNDDTWGPATSVGTTATMAAAARAIATRDGVINDPFAEPLVRAVGVN
FLTRWAIGELVASDVDVEGSPWGLAQMPASIAARTRYFDEFYADAAAAGI
RQAVILASGLDTRAYRLDWPAGMTVFEIDQPAVIEFKTTALARLGAEPKA
DLRTVAVDLRDDWSTALATAGLDSSKPTAWIAEGLFGYLAPEAQDGLLDA
VTALSTPGSRLGSEAVPNTADMDPHAARERMRAATAKWRDHGFELDVDVI
SFAGERHDVGAYLQAHGWTTVATPMAELLADHGLPAIARADDDRQTMNGV
TYYTSTLGTGRQR
>MAP0336c hypothetical protein
MTVTNDTAVYYDPYDIGIITDPYPTYARLREEAPIYYNERYDFWALSRHS
DVERALANWQVFSNRRSDILELIQSKFDMPGGVMMFQDPPEHTVLRGLMS
RVFTPRRMAALEDQIRQYCIRCLDPLVGSSSFDIIAELASMMPMRVIGML
LGIPESEPVSVRDANDANLRTKPGAPLRVADADSIADGRIYADYVEWRSK
NPSDDLMTTLLNVEFDDEDGVRRKLTRKEVLHYTQVVAGAGNETTGRLIG
WLAKVLAEHPDQRREVYRDRSLLTRTVDETLRFEPTGPHVARWMAADFEC
YGTTVPAGSAMLLLFGAANRDPRRYTDPDTFNIHRDNISHITFGKGVHYC
LGANLARLEERVALDELLNRWPEWDIDYDTAQLASTSTVRGWERLRIVVP
>MAP4078 hypothetical protein
MPRTDDDSWEITESVGATALGVAAARAAETESENPLISDPFARVFLDAAG
DGMWNWFAAPNLPAQIAEAEPDLKPRMQGMVDYMAARTAFFDNFFLAATH
AGVRQVVILAAGLDSRAWRLPFEDGTTVYELDQPRVLEFKATTLAEHGAR
PTCHLVSVPVDLRHDWPAALRQAGFDAHAPSAWSAEGLLPFLPAAAQQLL
FERVQTLAAPGSRIAVEAPGPDFIDEAARERQRQTMQRVRDLMADLEPDR
DIPDVQDLWYFEEREDVGDWLGRHGWDVTVTPAPELMARYDRRPPHDIED
AIPQTRFVAAQRTERTRPDR
>MAP1489c hypothetical protein
MGRVTGKVAVISGAARGQGRSHARMLAAEGADIIAVDLCADIETNEYPLA
RPEDLDETARLVEKEGQRAITAVADVRDRVALSAAIDAGVAEFGHLDIVV
ANAGICPLTAGLPPQAFADAVDVDLGGVLNLVHASLKHLRAGASIIVIGS
NAAFMSSLNTSGAGSGIGGPGGAGYAFAKLAAAHYVNDFALALAPFSIRM
NAVHPTNVDTDMLHSPPMYRAFRPDLPAPTREDAEPVFPLVQAMPVPYVE
PEDISEAVLFLASDAARYITGQQLRVDAGGFLKVKPWSVG
>MAP3940c hypothetical protein
MTYDLIIRNGSIVDGLGGEPYVGDVAVRDGVIAAVGAVNGATANREIDAT
GRLVTPGFVDLHTHYDGQAVWSERLTPSSAHGVTTVVMGNCGVGFAPCRQ
SDHDVLVDVMAGVEDIPGVVMTDGLPWTWETFPEYLDTLEAGKRDIDVAA
YLPHSPLRVYVMGQRGADREPATAEDLAKMRALAKEAVEVGALGFASSRL
TIHKTESGSPIPSYDAAREEIEQIARGVVDGGGGLLQFVPDIPADGYQPV
LQTVFDVAEDVGLPLTFTLVVANSGDPTWPDAITMIEKANAAGGDITAQL
LPRPIGLIIGLQLSANPFVLYPSYREIAHLPLAERVAQMRKPEVRARILA
DKPGEGHPILYVAQMWDWIYPLGDNPDYEPDPSTSIAARARARGVDPMEE
AYDRLLDDDGRAMLLVATSNLQGNSLDTVGELLHRDDVVLGLGDGGAHYG
MICDASYSTYFLTHWARDRKSGRFSVADAVRRLTSVPARVAGLGDRGRIA
VGYKADLNVIDHAALRLHKPVISHDLPAGGRRLDQTADGYVATIVSGEII
AENGVPTAARPGKLVRGRRPGPAPLR
>MAP2982c hypothetical protein
MRWHLRGRSLPDEGPIELWVVDGRISTEPVAGADTVFGASGGGWIVPGLV
DAHCHVGLGEHGEIPLDEAIAQAEIERDVGALLLRDCGSPTDTRSLDDRD
DLPRIIRAGKHLARPKRYAAGFSRELDDEWQLPDAVAQEAKRGDGWIKLV
GDWIDRSVGDLAPLWSDEVLKAAIDTAHAHGARVTAHVFSEDALPGLINA
GIDCIEHGTGLTDDTIELMVSRGTALVPTLVNVVENFPGIAQAAAKYPTY
AAHMRDLYARGPSRIAAAREAGVPIYAGSDAGTMVAPGRIADEVEALKGI
GMTATQALGAACWDARRWLGRPGLEHGASADLLCFAEDPRSGPAVLRNPD
LIMLRGNIFRSPA
>MAP1420 hypothetical protein
MKRGDRAYPVTRGQLDIWLAEQTGHLDVAWQLGVLVRIDGAIDPALLHQT
MRHVVGEAESLRASFFEADGQVFQKAVEYSDVDLTFYDLSGSSDPEREVR
EMTASIQRTPMPLTGPMIKFALFRTGSAEYYWFTTCHHIAIDGMGIALVG
RRIAAVYTALASGKPIPPAFFGSLQDLVGGELEYEASAKFLEDKDYWLAH
RPGDGTAGHPPRPADDGRDPYSPSPPVQLDESVIGSVKELSKALGIRRSS
VLTAACALLVRGWCADGSDEVVLDFPVSRRVDPKSKTHPGMLAGVVPLVL
HAPAAATFADFCRHVDQRSREALRHQQFPTRTLDGEGDFSGPRQAPNRVV
VNFVPARLTLSLADVPATATYTSFGPVGHFGLFFLGFGDQQFLSTVGTGQ
PLANFDATDLAERLQRILAAMAADPARLLSSLDVLRDPEHAQLEALGNTA
VLTRTPGPAVSVPELFATQVARAPQDVALVCEGRSLTYRQLDEASNRLAH
LLAGLGAGPGQSVALLFSRSAEAVASILAVLKTGAAYLPIDPAAPETRIG
FMLADAKPVAALSTAELAGRLEGHGMTVIDVNDPRIQDRPATALPVPAAD
GVAYVIYTSGTTGVPKGVAVTHRNVTQLLGSLDAGLPPAGVWSQCHSYAF
DVSVWEIFGALLRGGRLVVVPEDVTRAPEELHDVLVNEQVSVLTQTPSAV
AMLSPQGLESVSLVVVGEACPAEVVDRWSPGRVMVNAYGPTETTMCVAIS
APLAPGMGSPPIGVPVDGAGLFVLDAWLRPVPPGVVGELYVGGAGVACGY
WRRGGLTASWFVACPFGAPGARMYRTGDLVCWRSDGQLDYRGRADEQVKV
RGYRIELGEVQAALAALDDVDQAVVIAREDRPGGKRLVGYITGTADPAEV
RTALAQRLPVYMVPAAVVALDAIPLTPNGKLDTRALPTPEYTGSRYRAPS
NAVEETVAGIYAHVLGVERVGVDDSFFDLGGDSISALQVVARARAAGLTC
RPRDVFVEQTVARLARVVGSGDRAAEVADEGVGPVPPTPIMRWLQAAERA
GGATDQFNQTVLVQAPAGVTETEVAIVLQALVDRHAMLRLRVTDDGADGW
SFEVPEAGSVQARDCLRSVDALSDEALLAARARLNPAAGTMLAALWVEAT
GQLAVIIHHLAVDAVSWWILLEDLNIAWALHRAGQPVELAPAGTSFARWA
RLLDEHARDPEVVGQLDRWKTVTSTPAALPAPRPDVDTYASAGRLSVELD
AETTAMLLGEVPAAFHAGIHDILLIAFGLAWTEFLGEPGAPIGIDVEGHG
RHEELGADIDLSRTVGWFTAKYPVSLDVAGLRWPQVAAGDPALGPVLKRA
KEQLRTLPEPLTYGLLRYLNTDVDLAGADPPIAFNYLGRQGAASDSAADG
WRISQDMSLLGAAAAVPMPLMHAVELNAGTIDTGAGPHLHAEWTWAPSVL
GAEQITRVSRLWFEALAGVCAHVRSGGGGGLTPSDIAPARLTQQQIDELQ
SRHRIADILPLTPLQQGLLFHSSTAQGNDGMDDMYAVQLDFTLTGPLDAD
RLREAVRTVVHRHPHLAALFCDQYDEPVQIIPADPAVEWRYVELDGTGAA
DADDLIEQLCAAERAAVADLAGQPVFRTALVRTGGDRHRFVLTSHHILLD
GWSLPILLREIFAGYYGQRLPAAGSYRAFLTWLAERDLDAARRAWGEVLS
GFDTPTLVAPEGRLGQGRRGFEKSCVPEQTTRALGELARSCHTTLSTVLQ
AAWAVVLTSLTGRHDVVFGTPRSRVGQLEVDDAEQMVGLLINTVPVRAEI
TATTTTAQLLAQLQNSHNDTLEHQHLALNEIHRVTGHDQLFDTLFVYENY
PIDSGMTLGADGLAIAEFTNREYNHYPLTVEALPGPELGLHIEFDTDVFD
TASIESLVQRLQRVLVAMSTDPDRRLSSLDLLDRGERELVLSTMSGAGVS
APIGVAPQLLAAAVAADPDAPAIVDGARELSYRELDDWSTRLARKLIQHG
VGPEHAAGVAIERCAELVVAWWAVTKVGGVYAPVNLDHPVERIASVLDTV
NAVCVLTCGTDEVAGAGPRPILRIDGLDLSGHSTEPITDADRRSPLRADD
TAYLIFTSGSTGVPKGVAVSHTGLLGWAAAQRELFGLGADARVLMVASPT
FDASVGELLLAAGSGAALIVAPPQVYAGEALTALLHNQRVGTAILTPTVI
STLDRGRLDGLHTLVAVGEACLPELVDGWAPGRQMFNGYGPSETTIWVTC
ARLTAGHPVRIGAPIPGVCARVLDGWLKPVPVGVVGELYLSGPALGHGYL
GRVDLTAERFVANPFGGPGERMYRTGDLVRWTPEGTLDYLGRADNQIKLR
GQRIELGEIENTLLACPQVTQAAVTVQDSAAGSQLVAYVTLDHGPSDADV
RHDTDDADDVAQWRHLYDDLYGADLAATFGEDFRGWNSSYTGEPIPLQEM
AEWRSATVDRIMSLRPRRVLEIGAGSGLLLSQIAPRCDRYVATDFSAVAI
DNLARSMEQLQLPWRDRVELLTQPAHVTDGLPPGHFDTIVINSVVQYFPN
AGYLADVIDNALELLAPGGSLFIGDVRNHALQGAFQTGIALARGGGADAA
EIRQRVRHAMLGETELLLAPEFFTNWADSRPAAAGLDIQLKRGLSDNELN
RYRYDVVIHKAPAPVRSVAAAPTWSWTDCTDCAGLRDQLAARRPAVVRVT
DIPQAGVIDDVRVEAALAAGLPVADALAAAGSDTAAAVAEELHRVGEATG
YRVAVTWGAQPGTLSAVFVQDGDQAAEPLTDLYLPPAGARQRTRHANDPR
ANTKIAQVRERLNAWLPEYMVPTHIVALDEFPMTTSGKLDRKALPAPDYQ
DADRYRAPSTAVEEILVGIYGQVLGLERVGVDDSFFDLGGDSLSAMRLIA
AVNASLNTDLGVRTVFEAPTAAELALRVGSEADRPEPLVAGERPAVIPLS
FAQTRLWFIDQFQGPSPMYNITVALRLSGRLDADALRAALADVVARHESL
RTVFATADGTPQQVVIPADRIGFACDVVDARGWPEDRLREAMSAAARYTF
DLSAESPLHTELFARGDDEHVLVVAVHHIAADGWSITPFARDLGVAYASR
CAGRDPDWAPLPVQYADYTLWQRAHLGDVDDPGSRIAAQLDFWTDALAGL
PERLQLPTDRPYPAVADHRGARLAVDWPAELQQQLRRVAREHDATSFMVV
QAAFAALLAKVSASSDVAVGFPIAGRPEPVLDELIGFFVNTLVLRVDLNE
LGGDPTFAELLAQVRRRSLAAFEHQDVPFELLVERLNPTRSMSHHPLVQV
LLGWENFPGEVTAPAAGLALGDLQVTPMPLHTNTARMDLTFSLAERFTES
GQRAGIAVTAEYRTDVFDGRTVEGLIERLQRLLTAVTADPQRRLSAVDLL
DANEHARLEKWGNTAVLARPATPVSVPARFAAQAARTPDAVALTCDGRSM
TYRELDEAANRLAHFMIHHGAGPGERVALLFPRSAEAIVAILAALKSGAA
YLPIDPALPAARVEFMLTDAAPIVAVTTAALAERLHGFDLTVIDVADPAV
ATQPATAPPVPDPDDVAHIIYTSGTTGVPKGVAVTQYNVAQLFDDLRIGI
ELSPRQVWTQFHSYAFDFSVWEIWGALLHGGRLVVVPETVSRSPNEFHDL
LVREHVTVLTQTPSAVGLLRTDGLDGTALVIGAEPCPPELVDRWAPGRTM
VNVYGPTETTMWACKSAPLTAGSGFPPIGAPVTRAAFFVLDDWLRPVPPG
VVGELYLAGDGVGVGYWRRPGLTAARFLACPFGEPGTRMYRTGDLVCWGP
DGQLRYLGRADEQVKVRGYRIELGEIQAALSALDGVEQAVVVAREDNPGD
KRLVGYVTGSVAPAKARAALAERLPAYMVPAAVVVLDSLPMTVNGKLDTR
ALPAPDYWHTGGYRAPESPTEEILAGIYAEVLGVQRVGVDDSFFDLGGDS
LTTMRLITAINSALDTDLPVRTVFEAPTIAQLAPRIAQSAGGLAPLVAAG
RPDVVPLSFAQNRLWFIDQSQGPSPLYNMAAALRLRGRLDAGALGAALGD
VVARHESLRTVFPSHQGTPRQLVVPAERAEFGWDVIDATDWPADRLDDAV
QDVTRHTFDLAAEIPIRAKLFAVSEDEHVLVIVVHHIAADGMSLTPLGVD
LSQAYASRCAGHAPGWADLPVQYCDYTLWQRAQFGDLNDPDSRIGTQLAY
WEDALAGMPERLALPTDRPYPAAADQRGDSVAVDWPAELQQQVRRIAREH
NATSFMVVQAALAVLLSKIGASSDVAVGFPIAGRRDPALDQLVGFFVNTL
VLRIDLTGDPSFAELLALVQARSLAAFEHQDVPFEVLVERLNPTRSLTHH
PLVQVMLAWQNFAGHDDPAAALALGDLDVTSVPVHDQSARMDLVFSLAER
WNPDGEFAGIGGRVEFRTDVFDAATIETLIERLRRVLEAMTGDPGRPLSA
VDLLDDAERAYLEEVGNTAILTRPASGRVSVPELFATQVARVPETVALVC
DDLSVTYRQLDEASNRLAHRLAAAGAGPGQTVALLFSRSAEAVAAILAVL
KTGAAYLPIDPSAPQTRVEFMLGDAEPIAAVTTAELAQRLAGRPVTVVDV
DDPGIDTLPNTALPLPDPDGIAYLIYTSGTTGAPKGVAVTHHNVTQLLGS
LDAGLPSPGVWSQCHSLAFDVSVWEIFGALLRGGRVVVMPEAVARSPHDL
HDALIARHVTVLTQTPSAVAMLSPQGLESVSLVLAGEACPPEVVDQWAPG
RVMVNGYGPTETSMCVSISAPLTAGSGIPPIGSPVDGAALFVLDESLRPV
PPGVVGELYVAGSGVAAGYLGRPSLTAARFVACPFGAPGARMYRTGDLVR
WRADGQLDYLGRADEQVKVRGYRIELGEIQAALSALDGVEQAVVVAREDN
PGDKRLVGYITGTADPAEARARLGERLPAYMVPAAVLGLDAIPLTPNGKL
DARALPAPDYAAGEYRAPESPTEEILAGIYAEVLGVQRVGVDDSFFDLGG
DSISAMRLIAAVNAALNADLPVRTVFEAPTVAALAPRIGEGGSGLEPLTA
GERPTVVPLSFAQNRLWFLDQLQGPSPVYNMAAALRLDGPLDTEALGAAL
GDVVARHESLRTLFAAPEGRPQQVVLPAERADFGWEVVDASGWSADQLDE
AIGATARYTFDLAAQIPLRAELFRLRDDRHVLVAVVHHIAADGMSITPLV
RDLGAAYARRCDGRGPDWTPLPVQYVDYTLWQRAQFGELADSGSRIAAQL
AYWQDALAGMPERLALPTDRPYPLVADQRGATVEIDWPAELQQRIGDVAH
RHNATSFMVIQTALTVLLAKLGANPDVAVGFPIAGRRDPALDDLVGFFVN
TLVLRVDAAGDPSFTELLARVRTRSLEAFEHQDVPFEVLVERLNPTRSLT
HHPLVQVMLAWQNFAGQDTGPAAGLSLGDVEITPIPVDTHTARMDLTFSV
GERWCESGEPGGIGGTVEFRTDVFDPDSIQTLIGRLRRVLEAMTDDPTQS
VWSVDLLDAGEHARLDTLGNRAALTGPPPRFDSLPTLFAEQAARTPDAVA
LVCGGRRMTYRELDEAANRVAHLLRVRGAGPGHTVALLFSRSAEAIVAIL
GVLKSGAAYLPIDPALPGERIGFMLADAAPMVAISTAELAPRLHGQHDVP
VIDVHDPAIEAAPSSALPPPGADDIAYLIYTSGTTGVPKGVAVSHRNVTQ
LLTADSGLPREGVWSQWHSLAFDVSVWEIFGALLHGGRLVVIPDSVVRSP
DDFHALLLDEQVSVLSQTPSAAGTLSPEGLEDLTLVVAGEACPAELVDRW
APGRTMINAYGPTEATVYTAISAPLQPGSPAGVPIGFPVPGAGLFVLDES
LRPVPPGVVGELYVGGAGVACGYWRRGGLTASWFVACPFGAPGARMYRTG
DLVCWRSDGQLDYRGRADEQVKVRGYRIELGEVQAALAGLDDVEQAVVIA
REDRPGGKRLVGYITGTADPAEVRTALAQRLPVYMVPAAVVALDAIPLTP
NGKLDTRALPTPEYTGSRYRAPSNAVEETVAGIYAHVLGVERVGVDDSFF
DLGGDSISAMRVITAINASLGVELAVRTLFEAPTVASLSWRAQTDTARGG
QAEEIVPVQTLKEGTGAPLFCIHAAGGLSWSYQVLGNHLDCPIIGIQQAE
PQHAAPRSIREMAQSYADRIQETYPDGPYHLVGWSFGGVVAHELAIELQR
RGCAIARLVLLDAQPGLDGSVTAPDAALAEQHMMEEALRSHLAAADHDQP
HAHRQFNQLVREAGAEGMSRHKRLFDVLFGNARNNIERSKIHEPGVFLGD
VTIFSAVRDHEDRSAFLAENWRPYVAGDIVIHEIDCTHDEILNADVVDSY
GQRLGQLLGAQRRRELTPPQRFGADPGDDEPPVR
>MAP2400 hypothetical protein
MPRHPLVQRIADVLDLDPHGRAIEYGGQWFSWAQLGATARQVAARTTGTE
VGMLLRNRPWQVAAFLGVLLGGGTVVVVNPSRGDERTRADLARLRLPLII
GEPDDLAALVTDDTPTMPISRLADPPGPAAPPAAAPRSVAVRMLTSGTTG
PPKRIDLGYDMLARSVLGVEPGTAPAPTEPRRDVAIVNSPLVHIGGVFRV
LQCVAEARPFVLLERFELNAWTAAVRRHRPRAVSLVPAALRTVLHSDLPR
ADLESIQVVTCGTAPLSADDADAFTEKYGIPVLTSYAATEFGGGVAGWTL
ADHRRYWRVKRGSVGRANPGAALRVVAEDGTPLGPDEIGLLEVKPGQLGP
DAGWLRTTDMARIDADGFVWIVGRADQAIIRGGFKVMPDDVRAALESHPA
VAGAAVVARPDPRLGETPVAMVELRAPSTTDALVQHLRERLARYEIPTEI
AIVDALPRTPSGKADLAAVRGYFAERPTVLRNHAR
>MAP1852 hypothetical protein
MRFRGPLIALTLFMIVSLTLTWLVYVSLRRDVAGDTARYSAVFSDVYGLR
EGDDVRMAGVRVGRVEKIELDGKLAKVSFVVQSEQRLYGNTLASVTYQNI
VGQRYLGLSLGKEGNPAQLPPGSTIPLERTEPSFDVTTLLNGYEPLFSLL
NPHDADNLTKGIIASLQGDTSSLTTLVGQTSTLTQTFAGRDQALGNVITN
LNKVVGNLAAQNDNLDGVITQTRSVVGELDRRRPDLVASVGSLARLSDRL
SASAADVYPALREFIDRQPGVTKHIMDVEPQVAFFGDNIPLLLKGLTRVG
NQGAYGNAYVCDVNFMGFFPGLNDVVPIIINAATPGNRAWHTPRCRSTVD
G
>MAP0567 hypothetical protein
MMQRLAGSRGLRYTTIIALVAVLVGGVYVLTSQAKTRKIVGYFTSAVGLY
PGDQVRVLGVPVGTIDTIDPRPTDVKITMSVSQDVKVPKDAKAIIMSPNL
VAARFIQLTPAYTGGPALADGASIGLDRTAVPVEWDEVKQSLTQLAVQLG
PTAGSMQGPLGAAINQAADTFDDKGESFHSALRELSQAAGRLGDSRGDIF
GTVKNLQILVNALSSSNEQIVQFAGNVASVSQVLADSSRHLDTTLGTLNK
ALSDIRGFLHENNSTIVDTVNNLNDFAKTLSDQSDNIEQVLHVAGPGIAN
FYNIYDPAQGTLNGLLSIPEFANPVQFICGGSFETAGGPRAPDYYKRAEL
CRERLGPVLRRLTVNYPPLMFHPLNTITAYKGQIIYDTPETQAKSATPVP
QLTWIPAKGAQTPPAAQNPADLQALLVPTAPQSGPAPAGGAPAPGPAPGS
AFGPRPGPPPGQAPGGLGADLGGGG
>MAP2518 hypothetical protein
MSHNVPVAFELPNADTWADPWPMYRALRDHDPVHHVVPPKRPEHDYYVLS
RHADVWAAARDHETFSSAKGLTVNYDDLELIGLQDNPPFVMQDPPVHTEF
RKLVSRSFTPRQVEAVEPKVRDFVVERIERLRAAGGGDIVAELFKPLPSM
VVAHYLGVPEEDRAQFDGWTEAIVAANTADGGVAGALGSAGDAVTSMMAY
FTGLIERRRTDPEDDTISHLVSAGVGADGDIAGTLSVLAFTFTMVTGGND
TTTGMLGGSMPLLHQRPDQRQRLVDEPELIPDAVEELLRLTSPVQGLART
TTRDVTIGRTTIPAGRRVLLLYGSANRDERQYGPDAGELDVARCPRNILT
FSHGAHHCLGAAAARMQSRVALTELLARCPDFEVDESGIVWAGGNYVRRP
LSVPIRVKS
>MAP4191c hypothetical protein
MPAYVISKGCRESREAYSGGSLLYRLNISLATTSTISDRGAHMTSTRYEG
DTWDLASSVGVTATMVAAARAMATRADNPLINDLFAEPLVKAVGVDLLSR
LAGGELDPAELNDVHDGAAGSAGAMSRMADNMAVRTKFFDEFFLNATKAG
IAQVVILASGLDARAYRLAWPAGTVVYEVDQPQVIDFKTTALAQLGAAPT
AERRVVAVDLRDDWPAALRAAGFDPARPTAWSAEGLLGYLPPEAQDRLLD
TITELSAPGSRLATESAPNPAPGEEEKLKERMQAISQRWRAHGFDLDMAG
LVYFGERNEAAPYLAGHGWRLNSVTIRDLFAANGLDPLDDDDTRMGEMLY
TWGIYE
>MAP4146 hypothetical protein
MAGQAGSLQGRVAFITGAARGQGRSHAVRLAAEGADIIACDICAPVSASV
TYAPASPEDLDETARLVEDQGRKALTRVLDVRDDAALRELVADGMEQFGR
LDVVVANAGVLSWGRVWELTDEQWDTVIGVNLTGTWRTLRATVPAMIEAG
NGGSIVVVSSSAGLKATPGNGHYSASKHGLTALTNTLAIELGEYGIRVNS
IHPYSVETPMIEPEAMMEIFARHPSFVHSFPPMPVQPNGFMTADEVADVV
AWLAGDGSGTLTGTQIPVDKGALKY
>MAP2030 hypothetical protein
MATRPNVSPDAIVDFDHHSDAFNLNELAVNAELRQRCPVAWNENYGGFWF
LSSYDAVSQTARDGDTFAHKYEPNAADGVDYQGEMGVPRPEGQPALGLGE
VDGPYHQALRHALAPFFSPGAVEKLKPFMEHSAHWFLDQQITTGQMDLVL
DYASPVPAILTMKLMGLPYDNWHLYANLFHSVMAVSQDSDEYAAAIAKVP
AMMQEVLDYAATRRAKPEEDLTSFLIRFEFDGHRLTDEQLLNILWNLIGG
GVDTTTSQTALTLLHLGTHPDLRQQLIDHPELYRTATDEFLRYFSVNQTL
SRTVTHDVVLAGQRLRKNDRVVISWLSANHDENEFHRPDEIILDRAPNRH
VAFGLGPHRCIGSHLARLMSEVMVRAVLVRIPDYQVDVDNVHQYLGNPSM
TGLGQLPVTFAPGRSRKALRPW
>MAP1324 hypothetical protein
MVLADVQAPSRYWENIAGETDLDVTDITGTLPDGLVGTLYRNGSGRWTVG
STQVESIFDADGMVSAFVLDGSGVRFRNRFVRTRHYLRSTAAGRLVDRGF
AYQRPGGPRANALRLPANTANTSVMVHRNQLLALWEGGPPHELDLDTLDT
IGPCNLGGALRGPVRAYSAHYRYDPMTNTKVNFGFDPYVPRIDPGHALRG
PRRLRRLRELAAEAVPRVRLRLYETGADGVTRYLRAVPLPGMGVVHDMAL
TTRYAVFVLSPLRINPWALTGRQSYWESIRFKPDAPSYFVLAPRDGGAVR
VVETDPFYHWHFTNAYDDGDDVVVELPRFAPQTYAGMKNYTAHIRSDIAQ
VDDYGAIEDAVVLTRFRIAASGRVTREPLADFGCEFPQIDPRRGTMRHHV
SYLTVQDPAKFPGQGIARVDHRTGEAQTYCPPGQVLVEPIFVPRPGGTAE
DDGWLLTVGYDESRHRSRLMVVDAAHVADGPVAEAWLPFHVPMSYHGTFT
TRVAQRTG
>MAP1381 hypothetical protein
MRGLQGKTFIVAGGSTGIGAATAERLASEGAAVTVGDINIEGANATVGRI
TQSGGRAIAVEFDLADEASVRNLVQKTIGEFGALHGLHNVGSDLSAENLG
RDTTLLDTDFDVWRRTLDVNLLGYVRTSRAVLAHLLQQGSGSIVNTSSGG
SLGTDPMHVAYNAAKAAVNQLTRHIANNWGAQGVRCNGVMPGLVMGETQK
QQNDIQLQQMFLQAAKVTRLGEPRDIAAITAFLLSDEAEWINGQVWYIGG
ASHMRQ
>MAP4128 hypothetical protein
MRAEVTAFDLPVTGRIPDHLDGRYLRNGPNPVAEVDPATYHWFSGDGMVH
GVALRDGRARWYRNRWVRTAHVCAALGEPAPGGLDPRAGMLSVGPNTNVL
GHAGQTLALVEGGGANYRLTEDLDTVGTCDFDGTLFGGYTAHPHRDPRTG
ELHAVSYSFGRGRRVQYSVIDTAGRARRTVDIEVSGSPMMHDFSLTDEYV
VIYDLPVTFDPVQVMPADVPRWLSAPARLVVQSLLGRVQLPGPMATAMNR
NRQPLHRMPYRWNDAYPARLGVMPRDGGNDEVRWFDIEPCYVYHPLNAYS
EIRDGAKVLVLDVVRYSRMFDRDLRGPGDTRPTLDRWTVNLTTGAVGSEL
RDDRSQEFPRINEALLGRRHRFGYTVGTDGGYPSDGASEMSTGLYKHDYA
TGSRRVAPLDPDLLIGEMCFVPNPAGGQDGAEDDGILMGFGYHRGRDEGQ
LVLLDAQSLEQVATVHLPQRVPMGFHGNWVPAG
>MAP1449c hypothetical protein
MSGLSGRVVLVTGAGRGIGRSHCQRFAEEGADVIAVDVPAAAPDLAQTAA
AVQQRGVRAATALADVSDFAALAAAVDEAVGRLGRLDVLVGNAGIHPAAA
PAWEITPQNWQQTLDVNLTGVWHTVKAGVPHMSRTARGGSIVIISSTSGI
RGTPGAAPYSASKHAVVGLARTLANELGPQGIRVNTVHPGAVATAMVLNE
ATFRRLRPDLDNATADDAAEALSARHLLPVPWVEPVDVSNAVVFLASDQA
RYITGTQIVVDAGLLARA
>MAP3744 hypothetical protein
MTTRPHRPRAIVVGSTFGAVYAEALAAPESPVELVGLLSTGSRQSADLAS
RLAIPLYTGMNSLPRVDIAFVVVRSGVVGGEGTRVCKELLSRGVHVVQEQ
PVHADEIMSLLHVAAENAVLYTVNDFYSRVAPMRQFICAAKTLDSLARIR
YVHARASLHVVYPLFTILASIVGPLTPARIVMPEQTGGAFVAGRIVLADV
PVDLLIQNELCASEPDNYARLLHTLTVGSDAGELVLDHTHGPTRWHPRPY
PDSWASQSGCPISERVGIDFDPTTATVRNELWPDAVRLAASEFLGSIGRV
RAGVTQRFVRATRLWSEFTSAMGPAAPINPVTSARLSASELVAL
>MAP2192 hypothetical protein
MTVRRRLLVTALLVGLLVGASGFLVRQTFFRPLTITAYFPSATGIYAGDE
IRVSGVKVGTVASVQPQPSRARLILHVDRHVSIPADAKAIIVAQNLVSAR
YVQLTPAYHRGGGPKMLDGAVIDTDRTAIPVEWDQVTEQLTRLATDLGPA
SDVSSTSVSRFITTAANALDGNGEKLRQTLGQLSGISRILANGSGNIVDI
IKNLQKFVTTLRDSNQQIVQFQNRLTTLSSVLDDSRSDLDAALSNLSVAV
GEVQRFVAGTRDKTAEQIQRLTTVTQVLVDHRMDVENILHAAPTAFSNGY
NIYNPNTPGAMGSFIINNLSNPVHFFCDAIGAVANVTAAETGKLCAEYAG
PGLRTANLNYLPIPTNIILQQIATPGKIIYAEPRLAPGAEGPSPTPPDVP
PAVSAYTGINGDSTPHSVQDLLLPDDRQPAPGDQPPPPGPGTPP
>MAP3013c hypothetical protein
MTIPWTPDRLGDLTGRRVIVTGATNGVGLGTARALAKAGAEVILAVRNTE
LGKQRAAQMGGSTAVEKLDLADLSSVRAFADRIEAPVDILINNAGALTDR
RTETVDGFEMTLGTNLLGPFALTNLLLPKVRSQIINVGSDAHRSATLHLD
DLHLRRHKWTRLGAYAQSKLAVMLWGLELDRRLRAAGSPIVTQLTHPGWV
ASNLSNLGDAPLKAVAHKAVKVVADRLANDIDEGAAPTLYCISEPIPPGS
YVGVSGRFGLRGGPVLIGRTPLACDYDTAARLVAFAERETGTELRV
>MAP1818c hypothetical protein
MRPGFVIGRMTEGMTPAPLSSTPAQVAAATARALAKGRRTVWIPWALGPA
ATVLRMLPQFIWRRMPR
>MAP3738c hypothetical protein
MSVTPGADPYALSAEFYEVMAIPHWDMKRQVLVSALTARGPVKDHVLDIG
AGTGLSTVTVADTIADVPIHAVEPSAAMRAALVSRILSRPDLIDRVTVHP
VNLEELDLPERLGAVVLFGVIGYMDKQARQHFWAALRPRLTPRAPVIVEV
MALDQPMPVPEMTIAQQRIGVRHNEVRISGQPAGSDAEHWTMRYVVSEGD
KVTREFTAEHTWHTVGLAELAHEAEAHDMTFEQLHPIIGVLHPR
>MAP0760 hypothetical protein
MRTRATLIKFAIFAVVMAMLTAFLFFIFGQYRTGATNGYSAVFTDVSRLK
PGQSVRVAGIRVGTVNSVSLQPDKKVVVKFDADRNIVLTEGTRAAVRYLN
LVGDRYLELVDGPGSPKRLPAGGQIPVSRTAPALDLDLLLGGLKPVTQGL
NARDVNALTSGLLQVFQGQGGTLDSLFTKATSFSNALADNDQTVQQLIDN
LNIVIGTISKDGKQFSGAVDRLERLVSGLSDDRNTIGSAIDALDRGTASL
ADLLAQARPPLTGTIDQLNRLAPILDNDKDRLDAAIGKAPKNYRKLVRLG
ANGATIPYYLCMLELRGTDLQGKTVRAPIFRSDAGRCTEP
>MAP0791c hypothetical protein
MASKQTLFRIFYRIGFTPWDGHPLAQSLRDLVEGTGDAAALPAGKALEIG
CGTGDCAIYLAQHGWNVTAVDFVAKPLERARAKAGAAGAAVDFVQADVTR
LGQAGIGTGFELIVDNGCLHNMSDADRDAYVREVTGVAAPQARLLIVAFV
PGGRFGVRGVEDAEMQRRFTADWTLLAAGPERELDGAERTPARYYLFQRR
>MAP3607 hypothetical protein
MSTIFDIRNIRLPKLSRTSVILGSLVIVLALVVGYVGWRLYEKLTNNTVV
AYFPAANALYPGDKVQIMGLRVGAIDKIEPSGDKMKVTFHYQNKYRVPAN
ASAVILNPTLVASRAIQLEPPYKGGPVLADNAVIPEERTQVPVEWDQLRN
SITNIISKLGPTAEQPKGPFGEVIESFADGLAGKGKQFNTTLTNLSRALT
ALNEGRGDFFAVVRSLALFVNALHQDDQQFVALNQNLADFTTRLAHSDGD
LANAIEQFDSLLTTVRPWLDKNRGVLTYDVKNLETATNALLQPDPLNGLE
TALHVLPTAAANINQIYHPSHGSVVAIPSITNFANPMQFICSAIQAGSRL
GYQESAELCAQYLAPILDAIKFNYPPFGLNLFSTAETLPKEVAYSEPRLQ
PPNGYKDTTVPGIWVPDTPTSHRNTQPGWIVAPGMQGQQVGPITAGLMTP
DSLEELMGGPNIAPVQSNLQTPPGPPNAYDENPILPPMGLNAPVPIPPPP
PGPGVAPGPVAPTPAPVSAPAPNAGGPAAPADFGGGQ
>MAP0113 hypothetical protein
MIDTLGRLVVGVVKAGHRQRTWLSGLALLVALVVGAAYLALGALRVNPLE
STYQVTIRLPESGGLLANQDVSVRGIRVGRIRSLRPIPSGVEVVANINAH
TRIPASSPVRVSGLSPAGEQYIDFEPTSNTGPFLSDGSVIGPQHTTTPIP
LSQVLADADGLLAQTDPKKLEIVKRELSLSNQGPQKLTDIIDGSTFLLST
MDPVLPQTVSMLKTSRVALTTLSDKNAGLSVAARNVGDLMAGVNKMDRGY
RRLVDQTPHALSTVDNLFDDNSDTMVGLLANLVTTARVVYLRVPALNAVF
PNYRGSTLEALMTTMHEHGLWATADIYPRYTCDYGTPRRPSSSADYPEPF
LYTYCRDDDPQVLVRGAKNAPRPGGDDTAGPPPGADLGQTTDPTPKGRFT
IPTPYGGPVLPIEPPH
>MAP2659 hypothetical protein
MTIVDRLRYDGKRALVVGGATGMGAAAAKSAAELGAEVIVLDYAPVTYDV
AKSIQVDLRDPASIDAALEQLDGPVHAVFSAAGIAEGTTDLMAINFLGHR
YLIERLLERNQLPSGSAICFISSVAGMGWENDLDLLNEFLATPDFATAQE
WVKAHEPEGIIHYGFSKKVVNAYVATQGYPLLKKGIRINAICPGPTDTPL
AQANADLWLTFAQDYRDETGSKVHTPEQMGDVMAFLNSAAAFGINGITLL
VDYGHTMASLTGAYPPGKPIIDIIMGRVKL
>MAP3730 hypothetical protein
MTASPDDTNALAAEWDERYAGLTTEMRDAEPNAVLITEVSKFAPGRALDI
GCGVGAEAIWLASREWDVTALDVSRVALARATARGRQAGVRVNWVRAALE
DAPLRVGGYDLVTAFYPALRHSSGRAAEESLLAAVAPGGTLLVVHHADVD
AEKAKSYGFDPADYLSHDDIATLLDGDWHVRVDRRQPRKTPARPELQHTH
DDVLLARRLR
>MAP3070 hypothetical protein
MRAIEGSADHVVVIGAGLAGLSAALHLAGRGRAVTVVEREAWPGGRAGRR
DIDGYRIDTGPTVLTMPDIIEDTFAAVGDTLARRLELCTLDPAYRAVFAD
GSSLDIHRDPDRMADAVAEFAGREQATGYRRLRAWLTRLYRTEFDGFIAA
NFDSPLSLLTPRLAQLIAIGGFRNWDRMVKRYLSDPRLQRIFTFQSLYAG
VAPRDALAVYAVIAYMDTVSGVVFPRGGVRALPDALAAAASDAGVQFRYG
TTVTALEKTGGRVGAVLTEQAGRISCDAVVLTTELPQTYALLGRTPRRAL
RLRHSPSAVVAHVGCPAVASQTAHHTILFGDAWDHTFADIIGAGQLMRDP
SLLVTRPTASDPTLAPRGRDLLYVLAPAPNLESGGIDWDSVGRGYVDDLI
ALVGQRLLPGLPGSATVLDVVTPADWARRGMAAGSPFALAHTFAQTGPFR
PANMVRGIDNAVLAGSSTVPGVGVPTTLISGRLAADRITGVSKRTPARID
MKAGSP
>MAP0757 hypothetical protein
MPALRLKVDMSGAMQAVGALFAMSADAVKYLFRRPFQWREFLEQSWFVAR
VSLAPTLLVAIPFTVLVSFTLNILLRELGAADLSGAGAAFGAVTQLGPLV
TVLIVAGAGATAMCADLGSRTIREEIDAMEVLGINPIQRLVTPRMLASGL
VAFLLNSLVVIIGVLGGYVFSVFVQDVNPGAFAAGITLLTGVPEVVISCI
KAALFGLIAGLVACYRGLSITGGGAKAVGNAVNETVVYAFMSLFVVNVVV
TAIGIKMTAK
>MAP3521 hypothetical protein
MLSADHRFQQTMTVRLGRRGRVPITGWVQSRRGVGMAGEDHYTRPCADSA
ALVLIDVQRDFYADDAPMRVEGTSAALGAMAELARPFRRRELPIVHVVRL
YRADGSNADPVRRRFIEDGARVAVPGSPGSQIAPELLPKAVELDHQLLLS
GGFQQIGPAEHVMYKPRWGAFYGTKLVQHLRESGTDTLVFAGCNFPNCPR
TSIYEASERDFRIVLVADAISGLYDRGAQECRAIGVAVRDTAQTLDWLGG
>MAP1380 hypothetical protein
MADNKVALITGAARGQGRAHAVRLSAGGADIIAVDIAGRLPESVPYESPT
RDDLAETARLVEANGRRAITAAIDVRDAEKLSAAVGHAVAELGRLDIIVA
NAGICCPAPWDQITGQAFRDTIDTNVIGTWNTVMAGAHHIIAGRRGGSII
LIGSAAGVKMQAFMVHYTASKHAITGMARAFAAELGRYNIRVNSLHPGAV
DTPMGTGRMRDALESAAATYPHLEGLHKPLLPDGIAQPEDIADAVAWLAS
DQSRFVTASGISVDLGVAYC
>MAP3663c hypothetical protein
MAVTDIFARRATLARSVRLLSQFRYERSEPARFYGALAADTAAMVDDLWR
AGHGESAAGRTLLDVGGGPGYFAAAFTDAGVRYLGVEPDPGEMHAAGPVV
AADTGTFVRASGMALPFADDSVDICLSSNVAEHVPRPWQLGAEMLRVTRP
GGLAVLSYTVWLGPFGGHEMGLTHYLGGARAAERYARKHGHPAKNNYGSS
LFEVSVADGLAWAASTGAALAAFPRYHPRWAWSLTSVPVLREFLVSNLVL
VLQPQ
>MAP2188c hypothetical protein
MTGIRVVAPELARRYEEQGWWTPDTLGDLLARGLKDNQHNTFRVHSAVRP
FAGTFGDVELLARRLAAGLRARGVGPGDVVAFQLPNWVEAAVTFWASALL
SAVVVPIVHFYGPKELRYILSSVRPRVFITAEGFGRMTYVPEVCAGVPTV
ALVGESFDALLEDEPMDATVATDPANPAVIAFTSGTTSDPKGVIHSHQTL
GFETRQLLANYPQGLGRQLTALPVGHFIGMLGAFLMLVLDGAPIDLTDVW
DPDKAIDLMDADGVALGGGPPYFVTSLLDHPRFTPDHLRYIKHIGLGGST
VPAAVTRRLADLGIVVTRSYGSSEHPSITGSQHTAPEAKRLFTDGKARAG
VEVRLADDGEILSRGPDLFVGYTDPVLTARAFDEDGWYHTGDIGVMDDDG
YLTITDRKSDIIIRGGENISALEVEEVLLAMPAVAEAVVVSAPDARLGEH
AAAVLRLKPGYGMPTMAEVREHFERAGVAKQKWPEELHEVADFPRTASGK
VQKYVVRQSIREKA
>MAP3495c hypothetical protein
MSAGHNGDMLARHVIRSLGSALVMAASGVAASAVSGSVIPTAAAQCPDVQ
VVFARGTGEAPGVGPTGQAFVDALHQRVGGRSFDVYPVNYPATDQWDTGI
DGIRDAGAHVVSMAHDCPNTKMVLGGYSQGAAVMGFVTSPAVPDGIDPAT
VPKPLTPDVANHVSSVVLFGMPNVRAMNFLGEPPVVIGPLFQDKTLKVCA
TEDPVCSDGMNFAAHDTYADDGAMIDKGVSFASSHLGLGGPGAASVASGH
GNFGE
>MAP2780 hypothetical protein
MATDGATGRVAGKRVLITGAARGMGRSHAVRLAEQGADCILVDICCTPTG
LDYPLATEEDLNETVRLVEKHGSRAVPKIVDVRDEAAMKAAVDAAVDELG
GLDGAVANAGVLTVGTWDTTTAEQWRLVLDVNLIGAWNTCAAALPHLIGG
GGGSLVNISSSAGIKGTPLHLPYTASKHGIVGMTLALANELAAQNIRVNT
VHPTGVATGMAPPGMHALIAEQRPDLVPIFLNALPAPLIEASDVSNAVLY
LISDESRYVTGLELKVDAGVTIR
>MAP3125c hypothetical protein
MSGQRCDGMVALVTGSSRGLGRAIAGRLAARGATVALTARTLDPDPKYQG
SLRQTRDEILAAGGKAVAVQADLSQPDERERLFAEVVDTVGAPDILVNNA
AVTFLRPLDGFPQRRARLMMEMHVLGPLHLCQLAIPAMRERGRGWIVNLT
SVGGDLPPGPPFSEFDRTAGFGIYGTAKAALNRLTKSLAAELYDDGIAVN
AAAPSNPVATPGAGTLDLAKTDTEDIALITETVFRLCTGDPKTLTGRIAH
TQPFLAEVGWPGTGPPVT
>MAP3116c hypothetical protein
MEINGKKAVVVGGASGMGRASAELFAARGADVAILDRESSDGKAVAEGIG
AAFYPVDVTDFAGTEETLQSAVDKLGGLHVIVTTAGGGIAKRTLTKNGPH
DLESFQSVIDLNLIATFNISRLAAAHMAKNEPEDEERGVIINTASIAAFE
GQIGQVAYTAAKAAIAGMCLTMARDLGSVGVRVLAIAPSLFATGLTQGIP
DEFATQLTKDAAFPKRLGRPEEYAKLALAIVDNPMLNGQCLRLDAGQRFA
PK
>MAP1427 hypothetical protein
MVAVTRRDFCKWGGIAALAACAGPPHGKADYTLRIATGTVELAPGRVVST
LTYNGQFPGPLLRFMQGRRTWVDVYNDTDAPEQLHWHGQHIGADVDGAAE
EGTPYVPAHGMRRISFVPGPAGFRFYHTHVVPRADLSRGQYSGLVGPVHI
ASKDDDAGAFDHEIFLTLKEFEPSLSRGGDMAVDFLAGAQEPDLRERGES
SMAASRGRGEQPGYEVGYRAFAINGRMLGHGDPIRVRTGQLVLLHVLNAS
ATETRSLALPGHAFTVVALDGNPVPNRAPVPVLWLGAGERVSALVSMTNP
GVWVLGDLSDDDREHGLGVVVEYAGRAGEPQWVPPPPFTWDYRLFAGPHP
APVAPHDHTIDLLIEKRNAADNGFNVWTVNGTPFAMDSNQPVLDVERGRR
YRLRLRNASDDLHPMHLHRHTFEITRFAGTPTAGVRKDVAMLGGYQSMEI
DFVADQPGLSLLHCHQQIHMDYGLMLLLNGV
>MAP3818 hypothetical protein
MRTPVTVGQHRHPFGRDIYVGRSGYVTEDAISIGGVNLADPDTYRAGMPY
GAFRKLRERAPVAWHPQKDGSGFWALTGYEEIHAVSRDSATWSSQINGAM
FDAPPPGEVPPVMIFMDPPQHTALRKLINKGFTPRQVTRLNEHIVEMAKQ
IVDDVIERGECEFADDVAGALPSYVIAEMLGIPLEDGRRLYQITEILHTG
SVGDSDDERQQAMVEMFQYGVELAVRKRAEPGDDIATSLLHAEVDGQSLS
DLEFNLFFMLLIDAGGDTTRNLVAAGILALLEHPQELQRLKADPSLMPTA
IEEMLRYTSPVTAFLRTATKDTELRGVPVKAGERVAMFYPSGNRDDSHFA
DPDRLDVGRAPNPHLAFGGGGTHFCLGANLARVEASAMVPEVLSRMNDLE
LAGPVERLRSDLINGIRSMPVRFTPGKRLGTA
>MAP2796c hypothetical protein
MTASFDPADPARFEEMYRDQRTSHGLPAATPWDIGGPQPVVRQLVALGAV
KGEVLDPGTGPGHHAIYYASQGFSATGIDGSAAAIERARANARKAGVSVN
FELADATKLDGFDGRFDTVVDCAFYHVFATEPELRSSYARALHRATKPGA
RLYMFEFGEHDVNGFKMMRSLSENDFRDVLPAAGWEISYLGPTTYQVNLS
AETIQMMAARNPDMADEAAKLLERFRAMEPWLQGGRVHAPFWEVHATRVD
>MAP0234c hypothetical protein
MVLDAVGNPQTILLLGGTSEIGLAICERYLQNAHARIILAAMPGDPGRDA
AVEQMKAAGARSVEVIDFEATDTDTHPKMIEQAFAGGDVDVAIVAFGILG
DAEELWQDQRKAVQAAEINYTAAVSVGVLLAEKMRGQGFGQIIAMSSAAG
ERVRRSNFVYGSTKAGLDGFYLGLGEALREYGVRVLVIRPGQVRTRMSAH
VKEAPLTVDKEYVANLAVTAAAKGKELVWAPAAFRYVMMVLRHIPRPIFR
KLPI
>MAP0522 hypothetical protein
MPSPNLPPGFDLLDPDVCVKGLPVAELAELRKSAPIYWVDVPGGTGGFGD
KGYWAITKHKDVKEISVRSDIFSSQQDCAIPVWPKEMTREQIDLQRNVML
NMDAPHHTRLRKIISRGFTPRAVGRLRDELDARAQNIAKTAAAAGAGDFV
EQVSCELPLQAIAGLLGVPQEDRDKIFRWSNEMTGNEDPEYAHIDPAMSS
AELIMYAMKMAEERAKNPGDDIVTQLIQADLDGEKLSDDEFGFFVVMLAV
AGNETTRNSITHGMIAFADNPDQWELFKKERPETAPDEIVRWATPVTAFQ
RTALEDYELSGVQIKKGQRVVMFYRSANFDEEVFEDPHRFNILRNPNPHV
GFGGTGAHYCIGANLARMTISLIFNAVADHMPDLKPLSAPERLRSGWLNG
IKHWQVDYTGKCPVAH
>MAP3507 hypothetical protein
MDDDLLRLEGRVVVVSGAGGGGIGTTVTAMAARAGATVIAVSRSKENLDE
HIAPLAARGLAVLPVAADASTDEGIAAVIDQARRADGRLYGLVNVAGGAE
PSTWMPSTRVSRTDWRKIFADNLETAFFMSQAVAAELLARRLPGSIVSIS
SISGMNTAPFHIAYGTAKSAIAAMTRTMALELAQSAIRVNAVAPGVTETA
ASRTYVADDPDRDRRAIAMGRRGRPEEQAGAILFLLSELSSYVTGQTLLV
DGGLDLRWSHLGADNTSLFLHDESFRATIRRM
>MAP3107c hypothetical protein
MAQKASGRYYAGKRCLVTGAASGIGRATALRLAAHGAELYLTDRDGDGLR
LTVEDARALGASVPEHRALDIADYDQVASFAADIHAAHPPMDVVLNIAGV
SAWGTVDRLTHEQWRKMIAINLMGPIHVIETFVPPMVAAGRGGHLANVSS
AAGLVALPWHAAYSASKYGLRGLSEVLRFDLARHRIGVSVVVPGAVNTPL
VNTVEIAGVDRDDPDVARWVRRFAGHAVSPEKAADKILAGVARNRYLIYT
SPDIRALYAFKRLAWWPYSVVMRQVNVIFTRALRPAPATLVRHDQLETHP
EQPDRPVELE
>MAP1997 acpM, AcpM
MAVSQEEIIAGIAEIIEEVTGIEPSEVTPEKSFVDDLDIDSLSMVEIAVQ
TEDKYGVKIPDEDLAGLRTVGDVVAYIQKLEEENPEAAEALRAKLETENP
EAVANVKARLEADSK
>MAP3482 acrA1, AcrA1
MRYVVTGGTGFIGRRVVTRLLQTRPEAQVWVLVRRQSLGRFERLSARGDT
PWGDRVKPLVGELPALELSDETLAELGHVDHLVHCAAIYDITAGEAEQRA
ANVDGTRAVIELAQRLGATLHHVSSIAVAGDFPGEYTEDDFDVGQQLPTP
YHQTKFEAELLVRSAPGLRHRIYRPAVVVGDSRTGEMDKVDRPYYFFGVL
AKLAVLPRFTPMLLPDTGRTNLVPVDYVADALVALMHADGLDGQTFHLTA
PKTVGLRGIYRGVAKAAGLPPLRGSLPGAVAAPVLKVRGRARVVRDMAAT
QLGIPAQVFDLVDLAPTFVSEKTRNALAGTGIEVPEFAGYAPRLWRYWAE
HLDPDRARRDDPRGPLYGRHVIVTGASSGIGRASAIAIAQRGATVFALAR
NGAALDALVDEIRANGGAAHAFTCDVTDSASVEHTVKDILGRFDHVDYLV
NNAGRSIRRSVVNSTDRLHDYERVMAVNYFGAVRMVLALLPHWRERRFGH
VVNVSSAGVQARNPKYSSYLPTKAALDAFADVVGSEVLSDHITFTNIHMP
LVRTPMIVPSHRLNPVRAISPERAAAMVVRGLVEKPARIDTPLGTLAEAG
NYFAPRTSRRVLHQLYLGYPDSAAARGVATEPRPAERTERTAPRKPRRPV
RAVTRGLRTPRPVRRLVRLVPGVHW
>MAP2259 entB, EntB
MSDTAVLVVDMMNSYQHPDAENLIPNVEKIIEPLTGLVRRARESAGVDLV
YVNDNYGDFTAQFSDLVRSALDGARPDLVKPIAPVSGEAASLTKVRHSAF
YSTALAYLLSRLGTKRLIITGQVTEQCILYTALDAYVRHFPVVIPTDAVA
HIDPELGAAACKMMEQNMSAELTTAAGCLG
>MAP3316 entC, EntC
MNDEPTFALCGPTQTLVADGVRRSYRDVAAAQAALRSQEVSIVLGALPFD
VRRPAALLTPDTVSVSDGPPDWPARKLPSVRVAAALPPPIDYRDRICRAR
EQLAATDNPLHKVVLARALHLVADAPLDARTILRRLIAADPGAYGYLVDL
SAAGHDHAGVALVGASPELLVARTGDRVECRPFAGSAPRAADPDTDAANG
AALASSAKNRHEHQLVIETIRAALEPLCDDLSIAAEPQLSRTATVWHLCT
PISGRLHDRSTTAIDLALALHPTPAVGGVPTDAAVELIADLEGDRGFYAG
AVGWCDARGDGRWVVSIRCAQLSADRRSALARAGGGIVAESDPDDEVAET
TTKFATILNALDVR
>MAP1955c ephD, EphD
MPAPQHSPQHFVHSADGTRIAVYDEGNPEGPTVVLVHGFPDSHVLWDGVV
PLLAERFRILRYDNRGVGASSAPKPVSAYRMDRFADDFAAVIGELSPGGP
VHVLAHDWGSVGVWHYLKRPGANDRVATFTSVSGPSQDQLVDYIFSGLRA
PWRPRAFARALGQALRLSYMILLSIPVLAPLVLRLTLSIPALRRNAVDNI
PVEQIHHSDRLAADAARAVKTYPANYFRSFAIRKQGVAVIDVPVQLIVNT
EDRYVRPYGYDHTPRFVPRLRRRDIRAGHFSPMSHPQVMAAAVHDFADMA
EGKPASRALLRAQVGRPRKAFGDTLVSVTGAGSGIGRATAFAFAREGAEL
IVSDIDEAAVKATAAEIAGRGGVAHAYVLDVSDAQAVEEFAERVSAAHGV
PDIVVNNAGIGQAGGFLDTPAEEFDRVLAVNLGGVVNGCRSFARRMVQRG
TGGHLVNVSSMAAYAPLQSLSAYCTSKAATFMFSDCLRAELDAAGVGLTT
ICPGLIDTNIINTTRFDAPAGARAERVDDRRGQLGKMFALRHYGPDKVAD
AILSSVQKKKPIRPVAPEAYALYGLSRVMPQGLRNAARLRVI
>MAP1209 fabG1, FabG1
MTDTATENTTESAADYGRPAFVSRSVLVTGGNRGIGLAIAQRLAADGHKV
AVTHRGSGAPDGLFGVECDVTDNDAVDRAFTEVEEHQGPVEVLVSNAGIS
KDAFLIRMTEERFTEVINANLTGAFRVTQRAARSMQKKRFGRIIYIGSVS
GMWGIGNQSNYAAAKAGLIGMARSISRELSKAGVTANVVAPGYIDTEMTR
ALDERIQAGALEFIPAKRVGTAAEVAGAVSFLASEDASYIAGAVIPVDGG
MGMGH
>MAP1692 fabG2, FabG2_1
MGLLDRKTAVITGANSGIGLATAERFLAEGVERVFITGRRQRELEDAARQ
LGARATAVRGDVGAPRDLDRLYGEVAAAGAGLDIVMANAGTTRVARLGEI
TDDDLDTLLGTNVKGVVYTVQKALPLLNDGASIILTGSTTADRGRAGLSI
YAATKAAVRSLARAWANELAHRNIRVNVLVAGSTATPGSDRLAAQTDPYV
SVEEFRAGRIATIPLGRFADPVEIAHAAVFLASDLSSFCTGSTVTADGGF
NQV
>MAP2408c fabG2, FabG2_2
MVQVSLLSGQTAVITGGAQGLGFAIAERFVAEGARVVLGDVNLEATQTAA
KQLGGDQVALAVRCDVTKSSEVETLIQTAVERFGGLDIMVNNAGITRDAT
MRKMTEEQFDQVIAVHLKGTWNGTRLAAAIMRENKRGAIINMSSVSGKVG
MVGQTNYSAAKAGIVGMTKAAAKELAYLGVRVNAIAPGLIRSAMTEAMPQ
RIWDSKVAEVSMGRAGEPSEVASVALFLASDMSSYMTGTVMEITGGRHL
>MAP1739c fabG3, FabG3_1
MAERLAGKVALVSGGARGMGASHVRSLVAEGAKVVFGDILDDEGKAVAAE
VGEATRYLHLDVTKPEDWDAAVATALAEFGRIDVLVNNAGIINIGTLEDY
ALSEWQRILDINLTGVFLGIRAVVKPMKEAGRGSIINISSIEGMAGTIAC
HGYTATKFAVRGLTKSAALELGPSGIRVNSIHPGLIKTPMTEWVPEDIFQ
SALGRAAEPKEVSNLVVYLASDESSYSTGSEFVVDGGTTAGLGHKDFSNV
ETDAQPDWVT
>MAP3577 fabG3, FabG3_2
MGRVDGKVALISGGARGMGAEHARLLAAEGAKVVIGDILDDEGKAVADEI
GDSVRYVHLDVTQPDQWDAAVETAVGEFGKLNVLVNNAGTVALGPLKSFD
LAKWQKVIDVNLTGTFLGMRVAVEPMIAAGGGSIINISSIEGLRGAPMVH
PYVASKWGVRGLAKSAALELAPHNIRVNSVHPGFIRTPMTKHLPDDMVTV
PLGRPAESREVSTFVLFLASDESSYATGSEFVMDGGLVTDVPHKQF
>MAP3692c fabG4, FabG4
MAPKVSSDLFSQIVNSGPGSFLAKQLGVPQPETLRRYRPGDPPLAGSLLI
GGEGRVVEPLRAALAKDYDLVGNNLGGRWADRFGGLVFDATGITTPEGLK
GLYEFFTPLLRNLGHCARVVVVGTTPDAAAGPHERIAQRALEGFTRSLGK
ELRNGSTVALVYLSPAAKPAATGLESTMRFILSAKSAYVDGQVFYVGEAD
STPPADWERPLDGKVAIVTGAARGIGATIAEVFARDGARVVAIDVESAAE
TLAETASRVGGTALWLDVTAPDAVDKITEHLREHHGGHADILVNNAGITR
DKLLANMDDARWDAVLAVNLLAPLRLTEGLVGNGSIGEGGRIVGLSSMAG
IAGNRGQTNYATTKAGMIGLTQALAPELYDKGITINAVAPGFIETQMTAA
IPLATREVGRRMNSLLQGGQPVDVAETIAYFASPASNAVTGNVIRVCGQA
MLGA
>MAP2872c fabG5, FabG5_2
MTSQDLTGRTAIITGASRGIGLAIAQQLAAAGANVVLTARKQEAADEAAA
QVGPQALGVGAHAVDEDAARQCVQLTLERFGSVDILVNNAGTNPAYGPLI
EQDHARFAKIFDVNLWAPLLWTSLAVKSWMGEHGGSIVNTASIGGLHQSP
AMGMYNATKAALIHVTKQLALELSPRVRVNAIAPGVVRTRLAEALWKDHE
DPLSSSIALGRIGEPIDVAAAVAFLVSDAASWITGETLVIDGGTVLGAAQ
GFRPAPTPS
>MAP2736 fabG5, FabG5_1
MTGWLDGKRALVVGAGSGIGRAVVDAFAAEGAKVAALERDSDKCDRLRRQ
LPEVAVAEGDATTRQANERAVALAVDTFGGLDTLVNCVGVFDFYRGVADI
DAGALADAFDEMFRTNVLSHLQSVKAALPALRAAGASSIVLTESASSFYP
GRGGVLYVSSKFAVRGLVSALAHELAPGIRVNGVAPGGTLGTDLRGLASL
GLQGTRLDDAPDRAAEVAARTPLRVALSGADHAWSFVFLASDRSRGITGE
TVRPDGGFGLGTPR
>MAP4294 fadD1, FadD1_2
MAEDTIQALLRKRWSDPGVAVKYGDTQWSWAQYLRDAAARAAALLGAADP
GRPMHVGTLLGNTPEMLSQMAAAGLGGYVLCGLNTTRRGEALAADVRRAD
CQFVVTDAEHRPLLDGLDLVGTQILDTSTPDWAGFVDAAGELTPHRDVAA
MDPFMMIFTSGTSGNPKAVQVSHLMAVFAGSNLVQRFGLTEHDTCYVSMP
LFHSNAVVAGWAPAVCSGAAIVPAKFSARNFLDDIRRYGATYMNYVGKPL
AYVLATPERADDADNPLRVAFGNEANDKDIEEFGRWFGVQVEDGFGSTEN
AVIVIREPGTPRGSIGRGIDGVAIYNSDTVTECPVARFDADGALINADEA
VGELVNTAGSGFFTGYYNDPDANAERMRHDMYWSGDLAYRDADGWIYLAG
RTADWMRVDGENLAAAPIERILLRHSAINRVAVYAVPDGHVGDQVMAAVV
LNDGETLTPAAFEAFLDAQPDLSPKARPRYVRIAADLPSTATHKVLKRQL
ISQGTAVAAGEVLWEREPRGTAYTVVASPDCGVPGDGSAPSTVAPV
>MAP1464 fadD1, FadD1_1
MIETVQQLLRQRRHDDTPAVAYGDKTWTWREHLAEAEAEAAALIARADPA
RPLHVGAALGNSPAMLRAMAAAGLGGYVLCGLNTTRRGSALLSDIHRSDC
QILLVDDEHLPLLDRLDLNGIQVLEVGSPAYADAVAAAPPLIPHREVTAA
DPFMMIFTSGTSGNPKAVRFAHGMAIMCGASLIFQYDVTADDVCYLAMPL
FHSNGVAAGWAVAVGSGALMVPAKFSPSRFLDDVRRYRVTYLNYVGKPLA
LILSTPERPDDADNTLRVAFGNEATDRDIAEFARRFGCRVVDSFGSSEFA
VIVVREDGTPPGSIGRPYPGVSIYNPTTLKECVVTQFDEHGALTNFEEAV
GELVNTQGAGPFVGYYNDPEATAERMRHGMYWSGDLAYRDADGWIYLAGR
TADWMRVDGENLAAAPIERILARLPDISQVAVYAVPDERVGDQVMAALVL
RAGAQLSPEEFGRFLASQPDLSPKAWPRYVRINDRLPTTATNKILKRALI
SAGVTAHDGVLWSRAPRGTRYAVAGENADGPAASVIAGQ
>MAP1159c fadD12, FadD12_1
MPNPVRETLGLIATMRRARLLAPMRPDRYLRIAAAMRREGMGMTSGFAAA
AQRCPDRPGLVDERGSLTWRQLDERCDALAAALQALQSGAPAVIGIMCRN
HRGFVEALVAADRIGADIVLLNTSFAGPALADVITREGVNAVIYDEEFTA
TVDRALAGRPDAIRIVAWTDTEHQHTVDKLIASKAGARPIRTGRKGKMIL
LTSGTTGTPKGAKQSGGNAGIGTLKAILDRTPWRAEEPVVIVAPMFHAWG
FSQLLLAASFACPVITRRKFDPEATLDLIDRHRATGLVVVPVMFDRIMDL
PAEVRRRYECRSLRFAAASGSRMRPDVVVAFMDEFGDVIYNNYNATEAGM
IATATPADLRAAPDTAGRPAGGTEIRILDPEFNELPAGEVGTIYVRNNTQ
FDGYTSGSSKDFHEGFMSSGDLGYLDSAGRLFVVGRDDEMIVSGGENVYP
IEVEKTLATHPDVAEAAVIGVDDEQYGQRLAAFVVLAPEARTTPEALKQH
VRDNLANYKVPREISVLDELPRSSTGKILRADLRARVGG
>MAP3497 fadD12, FadD12_2
MSAERVAPTAARALVRSGLLNPPSPRAVLRLLREASRGGTNPYTLLAVTA
ARWPGRTAIIDDDGALSYRELQRATESLARRLIRDGVAPGRAVGVMCRNG
RGFVTAVFAVALLGADVVPISTEFRSDALAVALRAHHISTVVADNEFAER
IAGADDAVAVIDPATAGAEESGGRPAVAAPGRIVLLTSGTTGKPKGVPRA
PQLRSAVGVWVTILDRTRLRTGSRISVAMPMFHGLGLGMLMLTIALGGTV
LTHRHFDAEAALAQASLHRADAFTAVPVVLARILELPPRVRARNPLPQLR
VVMSSGDRLDPTLGQRFMDTYGNILYNGYGSTEVGIGALATPADLRDAPE
TVGKPVAGCPVRILDRNNRPVGPRVTGRIFVGGELAGTRYTDGGGKTVVD
GMTSTGDMGYLDNAGRLFIVGREDDMIISGGENVYPRAVENALAAHPAVA
DNAVIGVPDERFGHRLAAFVVLHPGSGVDAAQLRDYLKDRVSRFEQPRDI
NIVSSIPRNPTGKVLRKELPG
>MAP2874c fadD13, FadD13
MPPVIEIAREHNPFPTTGVSRGRDGIPRYDELPATLVDMLADQVDARPDS
EAVVELGGGRLTYRQLWDRAARVAGGLRADGLRRGDRVAVRYPAGIDWVL
AFWGTVLAGGVAVAVNTRSAQPEVDFVLSDSGARLQLAPGDPLPDGKPYV
TEQLGAADTAALFYTSGTTGYPKGVPTTHEAFLTNTENAIRCLQQPRDLG
EDMRTLISVPLFHVTGCNSQLLAAARLGGASVILPALDLDALLNAVVAER
VSVMVTVPAIYALLLRHKDFAGTDVSRVRWVGYGGAPIAPSLVRTVKDAF
PHATVFNGYGMTETASLMTVLPDREAVEHADSVGYAVPSVDLGLIPFGDN
EPGVGELVTRGANVTAGYWNRPQATASTFAGGWLHTGDVVRVDDAGRVHI
IDRLKDIINRGGENVSSVEVEAVLLGAPGVADACVLGVPDDVMGEKVGAV
LFGDDDIDVPAVLEHCRGRLADFKVPQYVTVVDGPLPRNAGGKLLKARLR
DQVHWGDPLR
>MAP1008 fadD14, FadD14
MRRRCAVDGTMQDFPLTITAIMRHGCGVHGARTVTTATGDGYRRTSYREL
GDQAAQLANALRGLGVTGDQRVATFMWNNAEHLAAYLAVPSMGAVLHTLN
IRLFPEQIAYVANEAEDQVVLVDASLVKLLAPVLPGLHTVHTVIVVGDGD
TEPLRTSGKTVLRYADVIGAEPAEFDWPRIDENSAAAMCYTSGTTGNPKG
VVYSHRSSFLHTMAACTANGIGIGASDSLLPIVPMFHANAWGLPYAALMA
GADLVLPDCHLDPRSLVRMVEDLRPTVTGAVPTIWNAVLHHLEDEPDHDM
SSLRLVVCGGSAVPVSLMRTFEEKHGVQIRQLWGMTETSPLATMAWPPPG
TPEDQHWAYRGTQGQPVCGVQMRIVDDDGRVLPNDGTAVGEVEVRGPWIA
GSYYLGRDDSKFDSGWLRTGDVGRIDERGFVTLTDRAKDVIKSGGEWISS
VELENCLIGHPDVVEAAVVGVPDERWEERPLAVVVVKDGASVDADQLRKF
LADKVVRWSLPERWTFVDEIPRTSVGKYDKKAVRSRYADGGYRVIEARD
>MAP0556c fadD17, FadD17
MELSDDLTVTELLVPLTEIDDRGVYFEDSFTSWRDHLQHGAAIAAALRAR
LDPARPPHVGVLLENTPFFSAVLVAAGMSGIVPVGLNPVRRGDALRRDIA
RADCQLVLADANSAGTLGDIEHLNVDSAAWADEVAAHRGAPIVFRSASPA
DLFMLIFTSGTSGEPKAVKCSHGKVAIAGVTMTQRFGLGRDDVCYVSMPL
FHSNAVLVGWAVAAACQGSMALRRKFSASGFLPDVRRYGATYANYVGKPL
SYVLATPERPDDAQNPLRAVYGNEGVPADIERFARRFGCVVQDGFGSTEG
GVAIARTPDTPPGSLGPLPAGIDIVDPDTGASCPPGVVGELVNTAGPGRF
EGYYTDEAAEAERMAGGVYHSGDLAYRDEAGYAYFAGRLGDWMRVDGENL
GAAPIERVLLRHPDVTEVAVYPVPDPTVGDQVMAALVLAPGAEFDDEKFR
AFLAEQPDLGPKQWPSYVRISTELPRTVTFKVLKRQLAAQGVDCGDPVWK
IGR
>MAP2380 fadD19, FadD19_2
MSDTTTAFTVPAVAKAVAAAIPDRELIIQGDRRYTYRQVIERSNRLAAYL
HSRGLGCHTEREALAGHEVGQDLLGLYAYNGSEFVEALLGAFAARVAPFN
VNFRYVKSELHYLLADSEATALIYHAAFAPRVAEILPELPRLRVLIQIAD
ESGNELLDGAVDYEDALASVSAEPPPVRHCPDDLYVLYTGGTTGMPKGVL
WRQHDIFMTSFGGRNLMTGEPSSSIDEIVQRAASGPGTKLMILPPLIHGA
AQWSVMTAITTGQTVVFPTVVDHLDAEDVVRTIEREKVMVVTVVGDAMAR
PLVAAIEKGIADVSSLAVVANGGALLTPFVKQRLIEVLPNAVVVDGVGSS
ETGAQMHHMSTPGAVATGTFNAGPDTFVAAEDLSAILPPGHEGMGWLAQR
GYVPLGYKGDAAKTAKTFPVIDGVRYAVPGDRARHHADGHIELLGRDSVC
INSGGEKIFVEEVETAIASHPAVADVVVAGRPSERWGQEVVAVVALSDGA
AVDAGELIAHASNSLARYKLPKAIVFRPVIERSPSGKADYRWAREQAVNG
>MAP2388c fadD19, FadD19_3
MSEWTIGAVLDEIADVIPDRTMTVCGDRRSTFAESADRTRRLANFLSGNG
LGVHRERAVLQNWECGQDRVALVMHNDLYPDMVVGCLKARTVPVNVNYHY
TPREVGELFDYLRPRAVIYHRGLGPKFADVLGRGDVDLLIAVDDGSEAAQ
LPGAVSLDDALAQGDTGHPAPGSPDDLLMICTGGTTGRPKGVLWRQSDIY
VSSMVGADHACAQEIRDKVSGAAGAPWFAVSPLMHAAGMWTAFAAIMAGT
TVVLYDTGKKLDPRSVWETAQRERVGMMTMVGDAYAAPLVAELQRGSYDL
SSLYAIGTGGAATNPKYQQALLELLPHITLINGYGSSETGNMGFGHSRTG
TRTDTFTLREGGLVLAEDYSRFLSPGEPQLGWVAREGRIPLGYFDDPDAT
RKTFPVIDGKQVVISGDRAALEPDGTLRLFGRDSLVVNTGGEKVFVEEVE
EVLRAHPAVADALVVGRPSGRWGEEIVALVELRAGTGAAADELHAHCTSR
LARFKAPKEFLFVAAVQRLGNGKADYRWAKRHAVTEASEKAPMST
>MAP0550 fadD19, FadD19_1
MAVALNIADLAEHAIDAVPDRVALICGDEKLTYAELEEKANRLAHYLLDQ
GVKKDDKVGLYCRNRNEIVIAMLGIVKAGAILVNVNYRYVEGELRYLFDN
SDMVALVHERQHSDRVANVLPDTPNVKTILVVEDGSDKDYQRYGGVEFYS
ALEKGSPERDFGPRSADDIYLLYTGGTTGFPKGVMWRHEDIYRVLFGGTD
FATGEFVKDEYDLAKAAAENPPMIRYPIPPMIHGATQSATWMSIFSGQTT
VLAPEFNADEVWRTIHEHKVNLLFFTGDAMARPLLDALNKDHDYDLSSLF
LLASTAALFSPSIKERLLELLPNRVITDSIGSSETGFGGTSIVAKDAPHA
GGPRVTIDHRTVVLDEEGNEVKPGSGVRGLIAKKGNIPVGYYKDEKKTAE
TFKTFNGVRYAIPGDYALVEEDGTVTMLGRGSVSINSGGEKIYPEEVEGA
LKGHPDVFDALVVGVPDPRYGQHVAAVVQPRPGTRPSLAELDRFVRSEIA
GYKVPRSLWLVDEVKRSPAGKPDYRWAKEQTEARPADDVHAAHVSA
>MAP3714 fadD2, FadD2
MPNLLGLPGQATKAVAKVQQYVERGSAELHYLRRIIESGAFRLEPPQNYA
AMAADIYKWGEFGMLPSLNARRTPGRAAVIDEEGELSYAELDRAAHAVAN
GLIAKGVKAGDGVAILARNHRWFLIANYGAARVGARIILLNSEFSGPQIK
EVSEREGAKVIIYDDEYTKAVSKAEPPLGKLRALGTNPDADEPSGSTDET
LAELIEHSSSEPAPKADRHASIIILTSGTTGTPKGANRSTPPTLAPVGGI
LSHVPFKAGEVTSLPSPMFHALGYLHATIAMFLGSTLVLRRKFKPPLVLQ
DIEKYRPTAMVVVPVMLSRILDTLEKMDKKPDLSSLRIVFVSGSQLGAEL
AARALKDIGPVIYNMYGSTEIAFATIAGPKDLERNAATVGPVVKGVKVKI
FDDNGKELPQGEVVRIFVGNTFPFAGYTGGGNKQIIDGLLSSGDVGYFDE
HGLLYVSGRDDEMIVSGGENVFPAEVEDLISGHPDVVEATAIGVEDKEWG
HRLRAFVVKKEGADLDEDTIKHYVRDHLARYKVPREVIFLDELPRNPTGK
ILKRELREMEV
>MAP2235 fadD21, FadD21_1
MSQSSILSMLHARASLRPADIAFTFTDYERDWAGVRESLTWSQLSRRTIN
VARELHLHGSVGDRAVILAPQSLDYIAAFLGSMQAGLVAVPLPLPHRGSS
HERVSAVLADTSPSVVLTTSAVAEDITEYVDKACLDAVPKIVEIDSLNLD
VDGGPHVRAADLPSVAYLQYSSGSTRQPTGVMISHRNLKVNFEQLMRGFF
ADAHVKAPSDLTIVSWLPFYHDMGLVLGVCAPVLGGYRGELSSPLSFLER
PARWVQALASSPHAWSAAPNFAFDLAARKTTDGDLAGLDLGGVLGIISGA
ERVEPATLQRFVDRFAHFNFRDEMVRPSYGLAEATVFAAAGTWNESSGAV
RFDTDELSAGRARRCAAGSGTALVKYQVPQSPTLRIVDGDTQRECPPGVV
GEIWVHGENVADGYWRKLPHEQSCFGATLRDPSPGTPGGPWLKTGDQGFL
FDGALFIVGRIKDLLIINGRNHYPEDIEATVHEITRGRVAAISVPANGTE
KLVTIIEFKERSDSADTSHRIASVKSDVTAAISNAHGLNVDDVVLVSPGS
IPTTTSGKIRRAGCIEQYRQRQFTRLDS
>MAP2596 fadD21, FadD21_2
MHGSSVVSLMRERAGLQPDDVAFRYTDYEQDWAGVAETLTWAQLYQRTSN
LAREVARHGDTGDRAVILAPQGLSYIVAFLAAIQAGLIAVPLSVPQPGSH
DERVSAVLADTGPAVVLTTAAAAATVTEYLRRPDTGPAPAVVEIDSLDLD
EPNSSSIRISAAPDIAYLQYTSGSTRLPAGVMVSHRNLQVNFQQLMAAFF
PDFDGVAPRNTVCVSWLPFYHDMGLVQGVIAPILGGYPADLTSPVAFLQR
PARWIQAMSRADPVFSAAPNFAFELAVRKTTDADLAGVDLGNIISIVSGA
ERIHPATLRRFCKRFAPYNFHEHMMQPSYGLAEATVYVASRAEAGGPEVV
HFEPEKLSAGSAQRCPAQTGSPLLSYGAPTSPTVRIVDPDTNTERADGTV
GEIWVNGENVAQGYWRKPEETRRTFGGVLVNPSPGTPRQPWLRTGDLGFI
SAGELFIVGRMKDLLIVYGRNHYPEDIESTVQAVTGGRVAAISVPADETE
KLVTIIELKRRGDSDEDARRKLDAVKNEVTAAISNSHGLNVADLVPVAPG
SIPTTTSGKIRRAACVEHYLRGAFIRLDG
>MAP3752 fadD28, FadD28
MFGSSVQDVLRERAGMQPNDAAFTFVDYEHDWAGIAETLTWSQMYRRTLN
VAHELRHCTSSGDRAVILAPQGLDHIAAFLGALEAGLIAVPLAPPAGGAH
DERVHSVLHDTSPSVLLATSSVVGDVAKYAQPRSDGSGPSVIEIDSLDLQ
RSRVTFRRGNRPATAYLQYTSGSTRRPAGVMVSHKNLLANFRQIMSDYFL
DYGKVAPPDTTTVSWLPFYHDLGLILGICAPILAGVRSVFTSPVAFLQRP
ARWMQLLANNSRVFSAAPNFAFDLAARKTTDEDMAGLDLRDVLIIQSGAE
RVQPASMRRFQDRFAKFNLRDTVIRPSYGLAEATVYVATRTPALPPRVVY
FQPEILSSGHAKGCESGSGTALVSYGVPRSQTIRIVDPETSTECPPGTVG
EIWVHGENVSAGYWQRPHETEKTFGARLIGPSAGTPQGPWLRTGDSGFFF
DGELFIIGRMKDLLIVYGRNHSPDDIESTVQAIAPGRCAAVAVPAEGTEK
VVVIVESKKRGGSDEEVMNNLAAVKRELTSAISNSHGLAVADVVLVAPGS
IPITTSGKVRRATCVEQYKQDQFARLDV
>MAP3284c fadD29, FadD29
MLRTGRRDYIGFIFMIASRTRLSPAWKGDSSTMETLIDYLHLWEQRRPEQ
TLFRFVDVDGRELEHYTYRSFAERTRELAAYLSTEAGLRAGDCALLVYPP
GLEMVAALYACARIGVIAVPVSPPLPMSFESGLAKLGFIARDCQARAVLS
TKQFEYDFRMLLGQRHGGQPWSDAGLPELPWFATDGAQEFGGAPVPDTPG
DVLFLQYTSGSTSDPKGVIVSHENVIANVSAFTGGSEVLVSWLPQHHDMG
LISAYMFILLQGGTTHAMSPLDFLARPSAWLRLISDVRATHTPVPNFALE
YCLREDKVPAAELAGIDLSSLECIVVGAEPLRANTFHQFRERFAPYGLRP
EALTGAYGMAESTLIVSIRGRQTLTVNKRGLEKNVARVEKALPENSNQVP
LVSCGKPLEETVVRVVDPQTRQALGEGRVGEVWLAGPSKGRGYWNRPQLT
AEMFEARLAGDDEHSYLRTGDLGFLYEGELFVCGRSKDLIIVRGVNCYPS
DIEAVVERSAKQVRGGCVAALSVEQDDQEALVVVAEVRDEHNLPDARALA
RAIRRHCHIDPHTIVFAPPRSIPKTTSGKIRRASTRQLWLDGALPTFSSW
VNPAATRGHDGTAAGAGPLERFANLIESYDLTGDEDCSFADLGIDSLALA
ELRNDMQALLAEHGAAELADEVNTRLLQRLTVAEFFSLIRQFGEGSGQPL
DALRRALDQISAEYEAHEAAQMRADAALPLPDPAPARAGTPTDILLTGAT
GFLGPFLLSSLLARTPYTVHALVRATDPGHGLDRIVASLRKAQLWTPALE
AEVRARVRVICGDLAEPALGIGEPAFARLARDVDAVVHNGALVNYVRTYD
ALRPTNVEGTRELLRLAMTDHAKTFHLVSSTFIYGWSTQPVVGEWDANEK
MAGLDFGYSQTKWVAEQLALAAQRKGLDVRIYRPSLISPTRSGFGSQDDI
LVRLTAFMIEHGLAVNALNQISLLPADLIADHIVALMDLPDESGSVFNMT
ADDYYNLTDVTRILSERYGYRFDYHDIDSFAEQLNRRCTPDDQMYPLVDF
LTRSADKIAAMRDKRYDNTQYRHRRGLVPVRLREPALTETVDHLVRFLRT
ERMITEVEDEAQRSA
>MAP0780 fadD3, FadD3_2
MNSPRWQTIPEMVLSAADRFGDAEAVVDGPLRLTFQQVVERIRCAAGAFA
ELGVEKGDRVAVWAPNSAEWIIAAFGLLTAGGVLVPVNTRFKTEEAADII
VRARVKAVLVQKGFLGQDYAAPAGIPVIDIKSDFLSSGSPFSRPVNGTDI
SDIIFTSGTTGRPKGAMMNHRQTLRMYDEWATLADLREGDRYLQINPYFH
TFGLKAGLITSFLRGATMLPVPVFDVDTVVDLIERERITMLPGPPTLYHS
LLTVPDKSKLATLRAGVTGAADIPVELVRRIHDELPFQTLMTGYGLTEAG
NVTLSRPGDSFEDVATTAGVPCEGVEVRIADDGEVLVRGYGVMQGYLDDP
AGTAEAIDADGWLHTGDLGTFTETGRLRIVGRKKDMFIVGGFNAYPAEIE
GFLLNHPAVAQAAVIGVPDERMGQVGKAFVVANAEVSESDLLAWCRDRMA
GFKVPRTVEFLDALPLNATGKVMKDQLR
>MAP0506c fadD3, FadD3_1
MTTDPRTVPAALDRLARRLPDHDALITEDRSFTAAALRDEVHRAAAALIE
LGVRAGDRVAIWSPNTWHWVVACLAIHHAGAAMVPLNTRYTAAEAGDILA
RVGAPVLFGMGRFLGHDRLADLDRAALPALRHIVRIPIEADDPVPGSWDE
FIAHGTDLGAVAERAAAVTPDDVSDILFTSGTTGRSKGVLCAHRQSLSAS
ASWAANGKITSDDRYLCINPFFHNFGYKAGILACLQTGATLIPHLTFDPL
RALQAIEQHRITVLPGPPTIYQTLLDHPARRDYDLSSLRFAVTGAATVPV
VLVERMQSELDIDIVLTAYGLTEANGMGTMCRADDDAVTVATTCGRPFAD
FELRIDDSGEVLLRGPNVMLGYLDDPDATAAAIDADGWLHTGDIAFPTSG
SARWAARSWSPAPGPNSTSNP
>MAP1647 fadD31, FadD31
MDDALRRDDGVPGLLRIEDCLDADGGVALPPGVNLISLIDRNIANVGDTV
AYRYLDYSGSDDGTAHEVTWSQFGVRLEAIGARIQQVASRGERVAVLAPQ
GIDYVAGFYAAVKAGTIAVPLFAPELPGHTERLDTALRDSQPSVLLTTTV
ARDAVEQFLAGHPHLHRPRVIAIDEIPDSAAESFAPTELGMDDVSHLQYT
SGSTRPPVGVEITHRAVGTNLVQMILSIDLLDRNTHGVSWLPLYHDMGLS
MIGFPAVYGGHSTLMSPAAFVRRPQRWIKALSDGSRHGNVVTAAPNFAYE
WAAQRGLPGGGEDINLRNVVMIIGSEPVSMDAIRTFNKAFAPYGLPRTAF
KPSYGIAEATLFVATIAPQAEATAVYFDRRQLGAGHAVRVPANAPDAVAA
VSCGQVARSEWAVIVADGAELPDGQVGEIWLQGNNIGRGYWGMPEETRRV
FGATLRSRLPDGSHAAGAAPDGSWLRTGDLGVYLDGELYVTGRIADLVRI
DGRNHYPQDIEATVAEASPMVRRGYVTAFSVPAVDDSEAGRLVVVAERAA
GTSRQDPRPAIEVIRAAVAQRHELAVADVRLLPAGAIPRTTSGKLARRAC
RAQYLDGSLGAR
>MAP0219 fadD32, FadD32
MAYHNPFIANGKIKFPENTNLVKHVEKWARVRGDKLAYRFLDFSTERDGV
ACDISWSEFSARNRAVGARLQQVTEPGDRIAVLCPQNLDYLIALFGALYA
GRIAVPLFDPSEPGHVGRLHAVLDDCTPSTILTTTEAAEGVREFIRARSA
KERPRVIAVDAVPNEVNSTWVPPEADENTIAYLQYTSGSTRTPTGVEITH
LNLPTNVLQVLNGLEGKEGDRGLSWLPFFHDMGLITAMLSPVLGHNFTFM
TPAAFVRRPGRWIREMARKPDDAPDCEVFTVAPNFAFEHAAVRGVPKEGE
PPLDLSNVKGILNGSEPVSPSSMRKFYEAFKPYGLRETAIKPSYGLAEAT
LFVSTTPMDQAPTVIHVDRAELNKQRFVEVPADAPNAVAQVSAGVIGVDE
WAVIVDPETASELPDGHIGEIWLHGNNMGIGYWGKEEETNEVFRNILKSR
ISQSHAEGAPDDAMWVKTGDYGTYYKGHLYIAGRIKDLVIIDGRNHYPQD
LEYSAQEASKALRTGYVAAFSVPANQLPKEVFDNPHTGLKYDPDDTSEQL
VIVAERAPGTHKLDYQPIADDIRAAIAVRHGVTVRDLLLVQAGTIPRTSS
GKIGHRACRAAYLDGSLRSGVGSPTAFANSTD
>MAP0026 fadD33, FadD33_1
MNALAAAMRDAMTRSPHDLVVLDKDSDTWRSCPWPEVHGMAESIAARLLQ
RDRLGAVGLVGEPTVELIAAIQGAWLAGVAVSLLPRPRRGADPSEWAHST
LQRFGGIGVDTVLSHGGVLRTLAAADSAVSVCDLAQAARTPTSTGLEYQD
DSGVAILQGTAGSTGNPRTAVLSPAAVLSNMRGLIERLRLDGASDRACSW
LPLYHDMGLAFLLTSALSGMPLWQAPTGAFSAAPLRWAEWLSDSQATFTA
GPNFGYSMIGRYSGRVRDVDLGALRIAINGGEPVDCEGFQRFATAMAPFG
FRAAAATPAYGMAEATCAVTMPGCGEGLQIDERTESGVRQRYALLGGPIA
GMELRIAATDEPSGEGVGEVEIRGSSTMTGYLGDTTGRSDGDWFPTGDIG
YLVDGALVICGRSKEVITVAGRNIFPTEIEQVAAQVDGVRHGGVVAVGSK
SGAAQSRLLIAAEFVGADRDVTRSAVIKRVISVCGVTPADVVLMPPGSLP
RTSSGKLRRLEVRRQLLG
>MAP1554c fadD33, FadD33_2
MSELAAALTAAMRTGGSDLVVFDRESAAWRRHRWPEVHGLAEGIAAWLLD
RDRPAALGLVGEPTLEFVAAIVGAWLAGAGVSILPGPVRGAEGRRWADTT
LTRFAGIGVRTVLSHGSHLDALQALDPSRPDEMVVEDLAVAANTGRRCPE
PPAPHANPAILQGTAGSTGTPKTAALSPDAVLANLRGLNARLGVTPADVG
CSWLPLYHDMGLSFLLASALGGMSLWLAPTSAFTASPFRWLAWLSESRAT
ITAAPNFAYNLVGKYARRVSGVDLGALRVAINGGEPVDCAGFERFTTAMA
PFGFDAGAATPSYGLAEATCAVSVPAPGTGLRFADVSDETGTRRHAVLGA
PIPGTEIRISPRHDAPDGIGEIEIRGASMMDGYLGHAPIDHQNWFPTGDL
GFFSDDGLVVCGRAKELITLAGRNIFPTEIETVAAQVPGVREGAVVALGT
GENSARPGLIIAAEFAGRDRAGARAEVIQRVASVCGVVPSDVIFMAPGSL
PRTSSGKLRRLDVRRSLEAVD
>MAP2401 fadD35, FadD35
MPGDFATVGATLRHQARRRGDHPLLICDAERISYAEADVRSAELARGLIA
LGAGKGTHVGLLHPNGARFVVAMLAAARIGAVVVPFSTFVTARELREQLL
DSDVEILLSARSFRSHDYARRLSEAVSETDFDPGRRLFCTAAPQLRRVLF
APQTVGAPGGGIDPALLAAMEDDVQACDPLAIVYTSGSTSTPKGVVHTHG
ALLEHQRNLNGIRGLTADDRLFCNSPFFWIGGFAFGLLATLVAGSTLICS
NATDAGATLDLLEAEKPTMTNGFSAGIAHLAEHPSFADRDLSSMRRGNLY
PIMAVEARPADPELRHNMLGMTEAGGVVLIGDDEADQPEHRRGSFGKPAP
GFEARILDPDTGAAVAVGKVGELCIRGPYLMQRYHKRSREECFDPDGWFH
TGDLVRADADGYFYFAGRLGAMIKTAGANVSAVEVEKAIAAVTGGATAYV
VAIPDARRGQLVAAAVVWPDDRAALDPDALRERLKSELSAYKIPRRFASL
RRADVPLLSSGKVDLRQLRKLFDA
>MAP2580c fadD36, FadD36
MLLTSLNPSAVTATDIPDAVRIDGTVLSRADLLGAATSVAERVAGAGRVA
VLATPTAATVLAVTGCLIAGVPFVPVPADVGAAERRHMLADSGVRAWLGP
LPDEPDGLPHVPVRLHARSWHRYPEPSPDATAMIIYTSGTTGLPKGVVLS
RRAIAADLDALAEAWQWTADDVLVHGLPLFHVHGLVLGLLGSLRIGNRFV
HTGKPTPAGYAQARTDFGGTLFFGVPTVWSRVVADDAAARALRPARLLVS
GSAPLPVPVFDRLAGLTGHQPVERYGASESLITISTRADGERRPGWVGLP
LTGVQTRVVDDDGNPVPHDGETVGKLLVRGPMMFDGYLNRPDATAEAFDA
DGWYRTGDVAVVDDAGMHRIVGRESVDLIKSGGYRIGAGEIETALLGHPG
VAEAAVVGMPDEDLGQRIVAFVVPAGRVNPDDLIDHVAQQLSIHKRPREV
RVVDALPRNAMGKVLKKQLLSDG
>MAP3649 fadD4, FadD4
MQIREYLGAGKPAVILYPSGTVVTFDDLEARANRLAHRFRKAGLREGDTV
AILMENNEHIHAVMWAARRSGLYYVPINTHLTAAEAAYIVDNSSARAIVG
SAALRDTCARIGEHLPGGLPDLLLMADGDLDGWEHYPECVAGEPDTPIDD
ELEGDLLQYSSGTTGRPKGIKRELPHVHPAEAPGMMSALVGFWMTPESIY
LSPAPLYHTAPSVWSMSAQAGGITTVVMEKFDAEGCLDAIQRHRVTHGQF
VPAMFTRMLKLPEAVRHSHDLSSLQRVMHAAAPCPVEIKKQMIDWWGPII
DEYYASSEAIGSTLISAEEWLAHPGSVGKPMACEIHILDENGNELPPGQA
GEIYFSGGYSFEYLNDEAKTAASRDKHGWVTVGDVGYVDEEGYLYLTDRR
HHMIISGGVNIYPQEAENLLVTHPKVLDAAVFGVPDDEMGQRVMAAVQTV
DPGDATDEFGAELLSWLRDRLAHYKCPRAIAFEEQLPRTDTGKLFKNGLI
EKYSV
>MAP3601 fadD5, FadD5
MMVLMLNRPEFMESVLAINMLGAIAVPLNFRLTAAEIAFLVQDCQARVVI
TEAVLAPVATGVRDIESLLDTVVVAGGSSDDTVLGYEDLIDETGAAHQPV
DIPNDAAALIMYTSGTTGRPKGAVLTHTNLTGQTMTGLYTNGADINNDVG
FIGVPFFHIAGIGNMLTGLLLGIPTVIYPLGAFEPGQLLDVLAAEKVTGI
FLVPAQWQAVCAEQRARPRDLKLRVISWGAAPAPDALLREMSAMFPGTQI
LAAFGQTEMSPVTCMLLGEDAIRKRGSVGKVIPTVAARVVDENMNDVPVG
EVGEIVYRAPTLMSGYWNNPEATAEAFAGGWFHSGDLVRMDEDGYVWVVD
RKKDMIISGGENIYCAEVENVLASHPDIVEVAVIGRAHEKWGEVPIAVAA
VANDNLALEDLDEFLTERLARYKHPKALEIVDALPRNPAGKVLKTELRIR
YGGG
>MAP2571c fadD6, FadD6
MSDRDGGARTAVKLTDIAARVPTVLADLPVIARGTLTGLLAQPGSHKSIG
TVFQDRAARYGDRVFLRFGDQQLTYRDANAAANRYAAVLAARGVGHGDVV
AIMLRNSPNTVLAMLAAVKCGAVAGMLNYHQRGEVLAHSLGLLDAKVLIA
ETDLVSAVAECGGSGSTETLTAEDLERFAVSAPATNPASASAVQARDTAF
YIFTSGTTGFPKASVMTHLRWLKALAAFGGIGLRLKSSDTLYCCLPLYHN
NALTVALSSVINSGATLALGKSFSASKFWDEVIANDATAFIYIGEVCRYL
LNQPAKPTDRAHRVRLIAGNGLRPEIWDEFTQRFGIARVCEFYASSEGNA
AFINVFNVPRSTGIFPLPLAYVEYDPDTGAPLRGDDGRVRRVPPGQPGLL
LSPVNRLQPFDGYTDPESSEKKLVRNAFRDGDCWFNTGDVMSPQGLGHAA
FVDRLGDTFRWKGENVATTQVEAALASDGSVEDCTVFGVEVPRTGGRAGM
AAIKLRDGAEFDGRSLARTVYEQLPVYALPLFVRVVDSIEQTTTFKSRKV
ELREQGYGPEVKDPLYVLAGRDEGYVPFYDEYPDEVAEGKRP
>MAP3524 fadD7, FadD7
MAMTSEATPSATAAGPRLADLVEAAAQRAPRAPALLVASERNPIAYADLV
RLVDDLAARLRAAGLGPGDRVGLRAGSNPEFVVALLAASRADLVVAPLDP
ALPAADQLSRSRAVGARAVLVDRLGEGQTAPESAPCWPVTVTVGPDDGAP
TVDLTVTAAPTHDVTAPQGLRDDDAMIMFTGGTTGAPKMVPWTRHNIAAS
IRSIVAGYGLGPRDATVAVMPLYHGHGLLAALLATLASGGAVLLPARGKF
SAHTFWDDIAAVGATWYTAVPTIHQILLERARTEAPRGTHALRFIRSCSA
PLTAETAQALQDTFGAPVVCAFGMTEATHQVSTTAIDGAGHSENPGATPG
LVGRSTGPDIRIAGPDGQSLPADTVGEVWLRGATVVRGYLGDPAITAANF
TDGWLHTGDLGTLSAAGDLVIRGRIKELINRGGEKISPERVEGVLAGHPD
VLEAAVFGRPDQLYGETVAAVIVTRGSAAPTADELASFCRERLAPFEVPA
EFRRAAELPHTAKGSLDRRAVAEQFGESA
>MAP4048c fadD8, FadD8
MGDELLRHPIHSGHLTVGALKRNKDKPVLHLGDTTLTGGQLAERISQYIQ
AFEALGAGTGATVGLLSLNRPEVLMIIGAGQTQGYRRVALHPLGSLDDHA
YVLDDAGVTSLIIDPTPAFVERALGLLEKVPGLKQILTIGPVPEALSGSA
VDLVAEAAKYAPKPLVAADLPPDHIGGMAYTGGTTGKPKGVLGTAQSITT
MTTIQLAEWEWPENPRFLMCTPLSHAGAAFFVPTIIKGGELVVLTKFDPA
EVLRVIEEQKITATMLVPSMIYALMDHPDSHTRDLSSLETVYYGASAMNP
VRLAEAIRRFGPIFAQYYGQSEAPMVISYLAKKDHDEKRLTSCGRPTLFA
RTALLDADGNPVPQGEVGEICVSGPLLSGGYWNLPEETAKTFKDGWLHTG
DMAREDEDGFWFIVDRVKDMIVTGGFNVFPREVEDVVAEHPAVAQVCVIG
TPDEKWGEAVTAVIVLRPDHPSDEESVAKVTAEIQAAVKKRKGSVQSPKQ
VIVVDSVPVTALGKPDKKAVRARFWEGAGRAVG
>MAP1998 kasA, KasA
MSKPSTANGGYPSVVVTAVTATTSIAPDVESTWKGLLAGESGIHVLEDDY
ITKWDLPVRIGGHLKEPIDERMSRLDLRRMSYVQRLAKLLSTQLWETAGN
PELDPDRFSVVVGTGLGGAERIVESYDLMNEGGPRKVSPLAVQMIMPNGA
AAVVGLQLGARAGVITPVSACSSGSEAIAHAWRQIVMGDADVAVCGGVEG
PIDALPIAAFSMMRAMSTRNDEPERASRPFDKDRDGFVFGEAGALMLIET
EEHAKARGAKPLARLMGAGITSDAFHMVAPAGDGVRAGRAMTRSLELAGL
SPKDVDHVNAHGTATPIGDTAEANAIRVAGCQEAAVYAPKSALGHSIGAV
GALESVLTVLTLRDGVIPPTLNYETPDPEIDLDIVAGEPRYGDYQYAINN
SFGFGGHNVALAFGRY
>MAP1999 kasB, KasB_1
MRAAPARKVAGQVREGIFARPMTELVTGKTLPNVVVTGIAMTTALATDAE
TTWKLLLDSQSGIRTLDDPFVEEFDLPVRIGGHLLEEFDSQLTRVELRRT
GYLQRMSTILGRRVWENAGSPEVDSNRLMVSIGTGLGSSEEMVFSYDDMR
ARGMKAVSPLGVQKYMPNGAAAAVGLEHHAKAGVITPVSACASGSEGIAQ
AWRNIVFGEADIAICGGVETKIEAVPIAAFAQMRIVMSTKNDDPVGACRP
FDRDRTGFVFGEAGALMVIETEEHAKARGANILARIMGASITSDGYHMVA
PDPNGERAGHAMSRAIQLAGLTPGDIDHVNAHATGTSVGDVAESKAINNA
LGPHGGNAAVYAPKAALGHSVGAVGAVESILTVLALRDQVVPPTLNLENL
DPEIDLDVVAGKPRPGNYQYAINNSFGFGGHNVAIAFGRY
>MAP3485 kasB, KasB_2
MTELVTGETLPNVVVTGIAMTTALATDAETTWKLLLDSQSGIRKLEDSFV
EEFDLPVRIGGHLLEDFDDGLTRAERHRMGYLQMMSTVLGRRVWENAGSP
EVDSNRLMVSIGTGLGSSEEMVFSYDDMRARGMKGVSPLAVSKYMPDGAA
VAVGLERHAKAGVITPVSACASGSEGIAQAWRNIVFGEADIAICGGVETK
IEAVAIAAFAQMRIVMSTKNDDPVGACRPFDRDRTGFVFGEAGALMVIET
EEHAKARGANILARIMGASITSDGYHMVAPDPNGERAGHAMSRAIQLAGL
TPGDIDHVNAHATGTSVGDVAESKAINNALGPHGGNAAVYAPKAALGHSV
GAVGAVESILTVLALRDQVVPPTLNLENLDPEIDLDVVAGKPRPGNYQYA
INNSFGFGGHNVAIAFGRY
>MAP3417c lpqC, LpqC
MAYARWLWLAVLAVCVVGCGVRHVSAASARDISGTFRSGGMDRTYMLHVP
AGDPVGLVLSLHGGGGTGIAQRGLTGFDAVADAHNLLVVYPDGYEKSWAD
GRGASPADRHHVDDVAFLVGLVTKLQNDYRVAPGHVFVTGMSNGGFMSNR
LACDRADVFAAVAPVAGTLGVGVACNPSRPVSVWAAHGTADPLVPFKGGA
VRGRGGLSHAVSAEAMVDKWRKADGCQGDPSMELLPDARDGTVVHRFDST
SCAASTEVVFYRIDKGGHTWPGGKQYLPAAVIGPTTHTLDGSEAIAEFFL
AHARD
>MAP4288 lpqP, LpqP
MPAGCHHGPMRFGCARAVNLGLLPVLVVVLAGCLAGGHALGTPDSQSIPV
GPSTHTLQSGGTPRSYHLYRPQGLSEAAPLVVMLHGGFGNGEQAERAYHW
DAEADAGHFLVAYPDGLGRAWNAGTCCGEPAHAGTDDVGFVNAVVGAIAA
QIPVDRARVYVTGMSNGAMMALRLGCQSDTLAAIAPVAGTLLTDCSAARP
ASVLQIHGTADDRVPYAGGPGKAFALNGSPRVDGPSVESVNATWRAIDAC
GPPSSTTAGDVTTQTAGCADGRTVELISVAGCGHQWPGGAPSPLAEKVAG
IPAPSTALDATDTIWQFFARNHR
>MAP3608 lprK, LprK
MGALRVIRRRSWQGLTLLVAAMVLTSCGWKGISNVSIPGGPGSGPNSYNI
YVQVPDTLAINGNSKVMVADVFVGSIKAIQLKNWIATLTLGINKNVKLPK
NATAKIGQTSLLGSQHVELAAPPNPSPELLKDGDTIPLKNSSSYPTTEQT
LASLSLILRGGGIPNLEVLQNEVYNIFNGRGEAIRALLGKLDTFTNQLNQ
QRDDITHAIDSTNRLLTYVGGRADVVDRLLTDVPPLIKHFADTKQLLINA
VDSVGRLSQAADQYLSEARGPLHTDLQALQCPLKELGKASPYLIGALKLI
LTQPFDIDTVPKIFRGDYINISLTLDLTYSAVDNAFLTGTGLSGALRALE
QSYGRDPETMIPDVRYTPNPNDAPGGPLVERGDRNC
>MAP4088 lprL, LprL
MRRVAVAGRRALVVALAAMLTSCTWRGIADVPLPVGRGTGGDHMTIYVQM
PDTLALNTNSRVRVADVWVGTVRAITLKDWVATLRLDLDPGVRLPANATA
KIGQTSLLGTQHVELAAPKNPSAQRLKSGDTLLLQNSSAYPTVERTLASV
AVILNGGGIANLDAIQTEVLNILDGHAGQIREFLGRLDTFITELSSQRDD
LTRAIDSSNELLTVFANRRDTLDRVLTELPPLIRHFADTRELFADATESL
GRFSDAADRALTDARANLYRSLLSLQRPLRQLVPAAPFVAGALKLGLTAP
FNIDDVAQVIRGDYVNVSAALDLTLSTIDNTMLTGTGLSGALRALEQSWG
RDPATMIPDVRYTPNPNDAPGGPLVERGE
>MAP2112c lprM, LprM
MRRAVVAALVVACAAALSGCGWRGLNSLRLPGTAGGGPGSYTIQAQMPDV
VTIQENTRVRVDDVNVGNVTKIELQDWHALVTMRIDGDVHLPANSTAKLG
QTSLLGSMHIELAPPKGEPPVGRLTAGSVIPLSRASLYPTTEQTLASVSI
LLNGGGIGQLQEITQAVAKAFAGREADMRSLLSQIDEFIAHTNEQTDDII
AAAENLNALAGQVAAADPVVDKALTSVPKALAVLAQERTKIADTIDRVGK
FSAIAADTIHQSKQSLVDNLRNIAPALRSLADAGPSLTRGLDGLATYPWP
ASTVRNWFRGDYANLTLIVDLTLSRIDQGLFTGSRWEGNLTQLELQWGRT
IGMQPSPVTGGNPLTYPYHFGGY
>MAP0568 lprN, LprN
MSRMWLRAGGLATGSMLLAGCQFGGLNSLAMPGTAGHGSGAYSITVELPD
VATLPQNSPVMVDDVTVGSVAGISAEQRSDGSFYAAVKLALDKNVVLPAN
STATVAQTSLLGSMHIDLNRPKDRPAVGRLTDGSKIAEANTGRYPTTEEV
LSALGVVVNKGNVGALEEITDETYRAVAGRQDQFVDLVPRLAELTSGLNR
QVNDIIDAVDGLNRFSASLARDKDNLGRALDTLPEAIRVLNKNRDHIVEA
FSALHKLADVTSHILAKTKVDFAADLKDLYAAVKALNDNRRNFVTSLQLL
LTFPFPNFGIKQAVRGDYLNVFTTFDLTLRRLGETFFTTAYFDPNMAHMN
EILNPPDFLVGEMANLSGQAADPFKIPPGTASGQ
>MAP2178 mbtA, MbtA
MSPNTTPPAGVLDGFVPFPAERAAAYRAAGLWTGRALDTILTDAARRWPD
RTAVLDASGGTGFSYAGLDEQANRAAAGLADAGIAPGDRVLLQLPNGCQF
AVALFALLRAGAIPVMCLPGHRAAELGHFAALSQATALLIADTAAGFDYR
TMAAGLIEEHEALAHVIVDGDPGPFLSWAQLCERAPAGRPATPVDPGSPA
LLLVSGGTTGTPKLIPRTHNDYVFNATASAELCGLTRDDVYLAVLSAGHN
FPLACPGLLGAMTVGATTVFGTDPSPEAAFATIARHGVTVTALVPALAKL
WAQACEWEDNPPKSLRLLQVGGAKLEADDARVIRSALRVRPAKTASSSAT
QAAQLMRRAETPEPQGTSRNSVTAMPQAFQNRAGMSATFPHLLSKKASCK
>MAP2177c mbtB, MbtB
MVHAPARSEDIREEVAELLGVDVDAVQPGSNLIGQGLDSIRIMTLAGRWR
RRGIAVDFATLAETPTIEAWAQLVTAGRQDTDSAAPPADSSGDPSGETEP
FALAPMQHAMWVGRQDNQQLGGVAGHLYVEFDAGLLDSGRLRAAATALAR
RHPMLRVRFLPDGTQCITPAVECGDFPVHVEDLRELGTDEVERRLTALRE
AKSHQQLDGAVFELTVTLLPGGRSRLHVDLDMQAADAMSYRTLMADLAAL
YRGCDLPELSYTYRQYRHAVEAQDAQPQPRRDADRDWWARRLPELPDPPA
LPITAGRGANRSTRRWHWLDPQTRDALFARAQARGITPAMALAAGFANTL
ARWSSNSRFLLNVPLFGRQPLHPDVDALVGDFTSSLLLDVDLVGAHTAAA
RAQVVQDAMRTAAAHSAYSGLSVLRDLSRHRGTQVLAPVVFTSALGLGEL
FSPEVTGQFGTPAWIISQGPQVLLDAQVTEFDGGVLVNWDVREGFFPPGV
IDAMFAYHIDELLRLASADDAWDAPGPAALPEAQRAVREAINGRTAPPSG
EALHDRFFRQAERQPDAPAVFAGSGDLSYAQLRDQALAVAAALRAAGAGA
GDTVAVVGPKSAEQIPAVLGILSVGAAYLPIGADQPRDRAERILQSGRVR
LALVCGGRQLSLPVPGLVLADVLGGAPADAEIACARVDPGELAYVLFTSG
STGEPKGVEVTHDAAMNTVEFIGRHFEIGPADRCLALSTLEGDISVMDVF
VTLRTGGAIVVVDEAQRRDPDAWARLIDTHRVTVLHFMPGWLEMLVEVGR
GRLSSVRVVPTGGDWVRPEVVRRLRAEAPGLRFAGLGGATETPVHNTIFE
VTEPIPADWTALPFGVPLPNNVCRVVGDTGGDCPEWVPGELWVSGRGIAR
GYRGRPDLTAQRFVEHDGRTSYRTGDLVRYRPDGTLEFVGRADHRVKISG
YRVELGEIESALRRVPGVRTAVAALIAGAGESDVLAAQVGTDDPALTGEQ
VRQYLADLVPAHMIPRHVAVVERIGFTAAGKLDRRAVARELHSVVGQSHS
PGHRAASTPLEGALALILGDLLGRDDVGVDDDFFALGGDSVLATQAVARI
RAWLDAPDVMVADMFANRTVSALAAVLRAAEDDPDRLDHVAELYLEVIGM
DAESVLTATRQTTKS
>MAP2175c mbtC, MbtC
MTPMSENGFDAAGIDPVVIVGMGVEAPGGIETAEDYWELLAHGREALGPF
PTDRGWAVSELLAGSRRSGFKQIHDRGGFLSGAATFDPEFFGVSPREAVV
MDPQQRVALRVAWRALENSGINPDDLAGEDVGCYVGASATGYGPEMARFS
EHSGHLLAGTALSVISGRIAYTLGLTGPALTVDSSCASALVAFHVAVRAL
QNGDCDLALAGGVNVLGSPGFFVEFSKQHALSDDGHCRPYSAQASGTVWA
EGAAMFVLQRKSVALRAGRRVVAEVRATAINQDGRSAGLSAPSGDAQVRL
FRRALGESGVKPAEVGMIEGHGTGTRLGDRTELRSLAQTYGDTEPGAGAL
LGSVKSNLGHSLAAAGALGLAKVLVSAEHGAVPPTLHATEASGEIDWEHQ
GLRLAQTLTPWPAIDGQRTAAASAFGIAGTNAHLIVSMPEVA
>MAP2174c mbtD, MbtD
MMPTHGLPDGRIPVLLSSHDPELIRRDAAAILEYLDRIGESTEATGAVAA
TLLRLRRVRRHRALLRAADRAELAAGLGAIARGEEHALITRSARTTPPRI
AFVFPGQGNQWQAMGADAYRLLPAYREAADRCAQAFVSAGFPSPLPYLVS
PDEQSWTRPEIQGAQFTHAVALAEAWRCFGVLPEITIGHSLGEVAAAYVA
EAISLPDAVALVVARATVVDRLTGRYAMAVLGTGAADAESLLATTPGWVE
VSAVNGPASTVVAGDHDAVAAAVRLARQREIFAHQLFVDYPGHTSALRPL
RAALAELTPQSAFRDSPIRFIGSTRGAEVAATTDYASYWYENLCSTVRFD
LAVQCARSCGADAFVELSAHPSLLYPLSGIIDDESAVIVGSGHRDRPITE
SLSASMAAVATADPGYRWSETVPTPNQPALHGFPNAPMRATHLWAAPDRV
TEAAPRPRLTVAVEDWRHTTSPTTPTAADLGFAGAHPLTRRLADAAAAQG
GCRVVAPGEAEILAIVAPELDQLDAVAAVEQIGAREDAGLPDYRGLIGPR
CRAVWLLTVRAEHVDGDDDQACVAQAALAAMHRSIGFEFPDQAFGHLDLP
HRDVDAATAHAIIEVLLAEPAEVALRGEQSPRRHVRTFRAHNESADRPLD
DAALEHVVITGGSGAIGLHYARYCLQRGTRNLTLLSRNGIEAAVLRELTG
SHDARVSAPRCDITDRAAVTQAAARYAGSGATLLIHTAGIAQARSRTDLT
GADVAAVCAAKVRGLALMADVWPLRPDCRILACSSVFGVWGGYHHAAYAA
SNRMLDVLATQLRARGLDCTAIRWGLWQDAGVVAGSEIARTERSGLVAMD
PRRALQASLHRYESDPLIFDADFERLQVFFESQGMPMPFSDTPTRDDEAA
AAPAGKPLAELVRAELAATLHLGDSVSIDPSASLIDLGVDSLLALDLRKR
LRRTVGNSVPVARMLGGITVAELVDALRADATGGPVVPPTVQRTGAVSGA
HPSTLAMLERLDS
>MAP2173c mbtE, MbtE
MTETTGAGTRLDDERLELLRRKLAERGLSRSTGTARHDEPRMSVGQHRMW
FVQSVDPDSALLNICVSYRLTGGVDTGRLHRAVDAVAARHPVLHTTYDTT
DEGDPRPVLRADLRPEWAEHDLSGLTEQARRLRLDVLAQRDFRRPFDLSK
DSPLRVTAARLADEELMLLITAHHIAWDDGSWAPFFADLTRAYAAPGDFD
TTPVVPDLSADPTAIRREDLDYWRPLMADLPEPLELPGPNGSVVPSTWRA
QRAFAELPAEIVDRAAALARETGATPYMVLMAAFAALVHRYTGSTDFLVA
APVLNRGAATENVIGYYGNTVVIRLRPQSHQSFRELLAQTRDGAVGAFAH
SRADLDWLVRESNPDRRHGADRMTRVSFGQREPDGASFCPPGVRCERGDL
RGHFNQLPLSLMIELNRTPDGSGGGLVEAEYLVEVLDQQLVEQMLRHYRT
LLDSLLSDPDATLSACALMSDADAEWLRAVSTGEQFITPAATLSELVSHR
AARTPDAVAVVYEGHTYTYRDIEEESNRVAHWLIERGVGTEDRVAVLLDK
SPELVITALGVLKAGGVYLPVDPTYPQDRLNFILGDADAKLVLREPVTDL
ADYPATAPTELLRPLTPQNTAYLIYTSGSTGLPKGVPVPHAPIAEYFVWF
GDEYRIDETDRLLQVASPSFDVSIGEIFGTLIMGARLVIPRPDGLRDIGY
LTELLAREGITSMHFVPSLLGLFLSLPGVSQWRTLRRVPIGGEALPGEIA
DKFHATFDASLYNFYGPTETVVNCTSYPVEGAQGTRVVPIGRPKINTRVY
LLDNALQPVPPGVIGEIYIAGTHVAHGYHRRPQLTAERFVADPFSRGGRM
YRSGDLARRNANGDIEFVGRADEQVKIRGFRIELGEIAAAISVDPSVGQA
VVLAMDLPQLGKSLVGYVTPAQGAGTETVDVERIRARVAAALPDYMTPAA
YVVLDEIPITAHQKIDRAALPQPQIGAGTEYRDPATPTEHRIAQLFSGLL
GHERVGVDDSFFDLGGHSLVATKLVTAIRAECGVEIGIRDVFELATVGLL
AERVDQLSSGELTGTRPKLIATAHDEPLPLSASQLRSWFAYRMDGPSPVN
NIPFAARLTGPWDIDALIAAVGDVVARHEILRTRYVELDGVPYQVVGPPG
VEIPVRREDGPDDPWLQQQLDAERRHCFELDGELPIRVAVLRVANAAEHV
LSLVVHHIASDHWSAGVLFADVMTAYRARRGGEAPSWAPLRVQYADYAAW
QRTFLGDAGGQESAVAGAQREYWTRQLAGLPEDTGLRPDFPRQPVPSGDG
ESVDFHIDAATRVKLAEVCRELGITEFMLLQTAVAVVLHKAGGGVDIPLG
TPVAGRTEAELDQLIGFFVNILVLRNDLDGNPTLRELLKRARETALAAYA
HQDLPFDRVVDSVSPVRSLSRNPLFQVVVHVRDHLSATRVIETAPVGGDG
QDTVCTSLDPVFDMAHADLSVNFFGTDGASDIGYNCHLIFRTDLYRRTTI
ERLAGWLVRAVTAFADDLDQTLRDVALIDAGEQQRILRQWSRGAQPPRDR
PRTIPELLQPTRSLGADRIAVRCGAEHIDYPALHRRSDNLAALLVDRGVG
PGTLVGLSTRRGIELVVALVAIMKAGAGYFPVDPGYPSARKQFMLDDVRP
PVVVATVEAVDTMPALPGVELLSLDDPQVRALVDRDRPTPMIRCIWCSPR
APPASRKVLWERIARWRPGWIGSCGTTRRAPTTFDWRRPPSPSWKAAWRC
WPGWRRARR
>MAP2171c mbtF, MbtF
MTAAETDVAPDIEDVMALSPLQEGLYSLTTLAEFADGQPADDPYVIGMAA
DITGTLDVALLRDCAEKMLVRHPNLRASFFSRGVPRPVQIVPSRVELPWR
SVTASPAEVPTLETAERRRPFDLERGPAIRFLLIELPDAHWRLVLTAHHI
VIDGWSLPVFVNELMTLYRAGGDPSALPAAPRPYRDYIGWLAGRDLQASQ
RVWREHLAGLPGPTLLAASLGSVEATEGQRTALPRTTELRMPAEATARLA
AETRSRGITVNTLMQMAWALVLSRLTDTRDVVFGVTVSGRPPELTGVETM
VGLFINTVPMRVRLDPAATVGEQCRAVQRDAALLREHSYLGHAQLRALGG
IGEMFDTLLVYENFPMDGLTAGGELSAGGATFRPSALQTLSHFPIAVAAH
MEGGELVVLIEVVDGALGVIPADAVGRRLLTTAERLLRHWERPLRAVSVL
FDDEAAPLRAAGPIAPPSPKCLPARFADVVAQTPDAPAVSWAQGSLTYRD
LDEATNRLAAQLVALGVEPETPVAIKLFRGPRYVVAMLAVLKAGGMCVPM
EPGMPAPRVNSILRQSGASIVLDEERIDELLEAARSRHGGFEPPDIPPAQ
AAYVVFTSGTTGEPKGVIGTHGAVGAYADDHLDRVLRPAAAALGRPLRIA
HAWSFAFDAAWQPLVALLDGHGVHVVDEATQTDAEALVALIAEHGVDMID
TTPSMFAQLQAFGLLSEAPLTVLALGGEALGSAAWARIRNACNTTTMSAY
NCYGPTETTVEAVVAAIAEHAEPSIGRPTRHTRGYVLDSELRPVPCGATG
ELYLGGAQLARGYLGRAGETASRFVADPFAASERMYRTGDLVRRLPDGSL
QYVGRADAQVKIRGHRVEPGEIAAALESHPAVRHAGVLVRHRDGAPRLTG
YVATHQAAADTPSPAELRGMLSARLPRYMVPQRIIMVDEIPLTPNGKLDE
TALAAVDNAEAVDGAAPPQTGTESALAELIAELLGQPRVDVTADFLALGL
DSIMALSVVQAARARGIALRARLVLDCTSIRELAEAIDAESTPAAGEVDD
GTGPMPLLPNGRWLYEHGQPRRLAQTEAIRVPQTLRRDQLETALAGIVGG
HEVLRSRVDRATMTLVPGPAPDIGRELEEVAVVGDLRAAVADHAARAIDR
LDPERGVLLRAVWLRPAGGDGVLLLTAHVIALDPASWRVVLGELGAALTA
AATGHSPAAVREHTSYRRWAHALTARAPAGHRGVLGVRIGRRRPRSRRPA
RRPGPRPRPRPARPQRRHRRRPHPPAPGIGPAAAHAAGGRHRGHRDALAP
ATRTVHAAATAGPRNTRPRRQFGGRPGYTHHRHRRHGGPAQLHLPDPGRR
GGPATGGRQAGRDPRRRTRLRAAALPARRHRRTARRPPVAATAAELPGRR
PHRRRHRADARA
>MAP2170c mbtG, MbtG
MSTLAILGAGAKAVAVAAKASVLRDMGVEVPDVVAVERIGVAANWQASGG
WTDGAHRLGTSPEKDVGFPYRSALVPRRNAELDERMTRYSWQSYLIATAS
FAEWIDRGRPAPTHRRWSQYLSWVADHVGMTVVHGEVEQLAVTGDRWALH
THETTVHADALMITGPGQAEKSLLPGNPRMLSIAQFWDRAANHDRISAER
VAVIGGGETAAAMLNELFRHRVSSITVISPQATLFTRGEGYFENSLFSDP
TNWPALTLAERRDALARTDRGVFSSSVQEALLADDRIHHLRGRVTHAVGV
QGQIRLTLSTNRGSENLETVHGFDLVIDGSGADSLWFAPLFSQEALDLLE
LGLGGPLSSERLQEAIGYDLAVTDVTPKLFLPNLSGLTQGPGFPNLSCLG
LLSDRVLGSTLGPTNYPARRRHDERQPL
>MAP0109 mce1B, Mce1B
MRVRGPLIGLSLFMAIAIAVTWMVYATLRRDVAGPTTPYAAMFTDVYGLR
VGDDVRMAGVRVGRVEKVELAGKLAKVSFIVENDQRLYGNTVASVTYQNI
IGQRYLGLSLGETGSRTTLSAGSVIPVQQTDPSFDVGKLLNGFEPLFTLL
NPKDADDLTKGVLQSLQGDQASIPLLVEQTSTITKTLSARDQSLGDLISS
LTRVTDTVARQNDDLDQALNQTRDAVANFDDRRAALQDDVGSIARVTRRL
SAIADDVDPSLNELITREPGFSKHLVGIEPQLAFTGDNLPLLLKGFARAV
SEGSYGNAYACDLNATGFFPGLNDITTFIVNAATPGNAYPITTKNLGWHT
PRCRNMANG
>MAP4084 mce2, Mce2
MRWFWPWYTCNFAGTSPPKTKLTMVATRAGLVMETGSKVTYNGVAIGRVA
DISEIERDGAPAAKLVLDVDPRYVNLIPANVVATIEAATLFGNKYVSLSA
PENPSRQRISPRDVIDARSVTTEFNTLFETVTSIAEKVSPIELNATLSAL
AQALDGLGGKFGESIVNGNQILAQLNPRMPQIRYDVRRLADLAGAYTKAA
PDLLDFLSNAVTTARTLTRQQGDLDAALLAAVGVGHEGEDIFARGGPYLA
RGAADLVPTAELLDTYSPELFCMIRNFHDAAPEVAKAAGGNGYSLAAAGS
IVGAPNPYVYPDNLPRVNAHGGPGGRPGCWQKITRDLWPAPYLVMDTGAS
LAPYNHLEIGQPLATEFVWGRQYGENTINP
>MAP2116c mce3, Mce3
MSKHSSSGLIRRGGNQRNGIDPIWWAPTLFIVIGGLVALTAASFSGKFQT
FVPLTLVSDRAGLVMEDGAKVKLRGVQIGQVASIGTDVKTARLQLKIQPG
PFRYLPSNLEAEIKSTTAFGSKFVDLIVPERPSASPLKPGAVLRSRNVTV
EVNTVFENLQAVVQALDPAKLNAILSAFAQSVRGKGERIGEAITDANSLL
RTVNSRMDTIGEDWRLFGATTGVYSDAAQHILSILDSASTTSNTITDNQR
SLDSLLLSAVGFSQTGINVIGANESNIVRAMNLLDPTIALMQKYSPTYTC
LFQGAQWYVDHGGRDALGGNGYSVILDAALLFGDDPYRYPKHLPKVNATG
GPGGRPSCGSLPDPSANFPVRALVTDTGWGAAPNEIRTNVAAGNPWWANW
FPTTKNPPEAPRYFWRGGQPPP
>MAP0564 mce4, Mce4
MVAFAVLTYLSYTAAFAPIDTVTVSAPRAGLVMEQGAKVKYRGIQIGKVE
AIEYSGDQARLTLGINSKDMHFIPSNATVHIAGNTIFGAKAVEFIPPQTP
SPTSLRPDAHVAATSVQLEVNTLFQSLIDLLHKIDPVELNGTLSAFSEGL
RGHGDDLGGILSGLNTLTRQANPKLPALQEDFRKTAVVSNVYADAAPDLN
TVFDNLPTINKTVVDQQKNLDTTLLATIGLANNAYDTLAPAEQDFIDTIN
RVRAPLKVAADYSPEFGCLFAGIERGIKEFAPLLGVRKAGLFTSSSFVLG
APSYTYPESLPIVNASGGPNCRGLPDIPTKQTGGSFYRAPFLVTDNANIP
YEPFTELQVDAPSTLQFLFHGAFAERDDF
>MAP4038c menE, MenE
MLDGRDPALVVLGPPGDRESATLRAGLRVGEPVDDDVALVAATSGTTGAP
KGAMLTAAALRASATATHERLGGPGSWLLALPAHHIAGVQVLVRSLLAGS
TPVELDVSRGFDVTQLPAATRALGTGRRYASLVAVQLAKALGDPAATAAL
AELDAVLLGGGPAPRPVLQAAAAAGITVVRSYGMSETAGGCVYDGVPLDG
VRVRVSDGRIALGGATLAKGYRNPVDPDPFAEPGWFLTDDLGAVDDDGVL
TVFGRADDAISTGGLTVLPQPVEAALCTHPAVADCAVFGLPDDRLGQRVV
AAIVLRDGRAAPSPDALRAHLSRTLDATAAPREVHVVAALPRRGIGKVDR
AALVRRFAGGGEQ
>MAP0501 nhoA, NhoA
MTLDLGAYFDRIGYGGKAAPNLEVLRALMAAHTGSIPFENLDPLMGVPVD
DLSPAALTDKLVHRRRGGYCYEQNGLLGYALAEIGFRVRRLAGRVVWMQP
PDTPPRAQTHTVLAVTFPGSQGAYLVDVGFGGQTLPSPIRFETGNAQQTT
HEPYRLDDRGEGLVLQALVRDEWPPLYVFGTRTVPQIDLLVGSWYVSTHP
SSMFVTGLMVARTTADARWNLAGRELTVHRAQSSEKIRLDDADAVLDVLG
ERFGIDVAGIGQRGALLARIEQVLDA
>MAP3216 omt, Omt
MGIDTSAITPEEETAFLTLQARAINNGWARPILPDPGAADAVTKIDYDFD
GLGLVTPVVCQTSLRAKLLDDRVRAFIAQHADAVVVDLGAGLDDGYARVK
PPDTVDWYSVELPGLAALRDKVMPPGPHEHTIGISVTDPEWISAIPADRP
AIVIADGLFPFLNKEQIIAVMRAITDHFPAGQLAFNDYGRMVIGVWISKL
FPQRMFKKVNQLRAFEGYNDPHTPERWNPKLTLVEETSLASVPEVDLFPT
WLRIATRMAGWSKRTARSARILRFSF
>MAP1694 papA2, PapA2
MVTFGTVHNWDPGTGSVISWHATPAAREKARQAPISDVPASYQQLHHLRR
FSEHAARGLDMARLNIGVWDISGVCDVAAMTEAINAHLRRHDTYHSWFEH
RTDGRIVRHTFADPADIEFAALQRGEMTPTELRAHILATPNPLRWDCFTF
GLVQHPDHFTFYMSADHLVIDGMSVGVIFLEIHLTYAALVSGGRPLPLPE
PASYHDYCRRQHQHTEALTLQSPQVRAWIRFAQDNGGTLPSFPLPLGDPS
VPCGSGVVVAPLMDESQTERFDATCTKAGARFSGGVVACAAFAEYELTGA
ETYCAITPYDHRSTPAEFVTPGWFASFIPVTVPVAGASFGDAVIAAQASF
DSAIGLADVPFDRVLELSSFGGRISKPTGDVHMLSFADARGIPFSGQWDG
LNAGIYGDGRSSDQVLMWVNRFDTETTLTVAFPQNPVARDSVERYIRAVR
AMCLRVVEHGAAAVPNRRRVVAAVNASAARSTANAADRTDRRLQGAGFGA
NPVLR
>MAP2231 papA3, PapA3_1
MRIGKITIGSLGDWTPTPGPVTSWHPTAAAAEKVRQAPASPVPVSYMQGQ
HLRNYHERTAAGLDFSRQIIATCDVPGRCDISAMNYAVNAYLRRHDTFRS
WFHHSGDGEFVRHTVSNPADIEFAPIHHGEMTAEEIRAHVVAIPNPLEWG
CFTFGVVQNEEYFTFFAAMDHVHGDATLIGTTMLEANGMYAAASAGGEPL
ELPDAGSFDDFCAREREYTSALTVDSPEVRAWIDFAENNNGSFPEFPLPL
GNPSEATASAMVSELVMDAEQTERFESACTAVGVRFIGGLFACIALVEHE
LTGALTYYGLTPRDTRRTTDNFMTQGWFTGLVPITVPIAAASFADAAWTA
QTSFDSGQQLAKVPYYRVLELAPWLKWPQPNFPVSNFFHAGAAPLNAVLA
AADLGYANNIGIYSDGRYSYQLTIYVFRYGDGTAMAMMYPDNPVAHKSVA
RYTETMRSVCGRVADTGHWGRVA
>MAP3763c papA3, PapA3_2
MVLVGKVEVGPIHEWLPAAGSVVSWQASPASLDKARHAPNSAVPASYQQA
QHLRRFRENAARGIDMSRLLTAGWDIPGRCDIRVMTHVINAHLRRHDTYH
SWFEVEDGGRIARRTIDDPASIKFTPIEHGEMTSDEVRDLVLDTPNPLCW
DSFRFIVIQRADHFTICLCVDHLHIDAMFVGVGFAEIHLMYRALVAGRAP
LTLATAGSYDDYCVRQHRELDALTLESPEVRGWVEFFEDNNGTLPAFPLP
LGDGPVPCDMLSVQLLDERQTARFESVCLAAGARFSGGVFACAALVEHEL
TGTETYYGVIPVDIRRSQDELATTGWFVGFVPLTVSVAGSFSDIVRTAQA
SFDSNKDLANVPAERVVEMAPWLRMPQRGAPLLFFLDAGVPPLSALVNSH
LDGANARLYHDGRIPSQVAIRVNRLESETQVIVLFPNNPIARQSVTRYLA
VMKSVYARVVDGGDPVESHLNHAAMHPQLGSRREARNEWAGSNWSHPMQR
VAQPFG
>MAP2730c pcaG, PcaG
MTETACTPGQTVGPFLDLGLPYPGDARLVDDGDPRAVRLHGTVYDGVGAA
VPDALVELWQPDGAGRIVRQAGSLRRDPAVFTGWGRCATGEDGGYGFTTL
APGSVIDGRTPFFALTVFARGLLNRLFTRAYLPGAGPDTDPLLARVAPER
RTTLLCVAENGGRAYRFDIRLQGPGETVFLAYRTDER
>MAP2731c pcaH, PcaH
MDPECVASQGDITAEIARIAAQYRADDGGAGQPLLDYPPYRATILRHPKQ
PLVAVDPEAAELWAPCFGRDDVDPLDADLTAGHRGEPIGERVVVAGRVVD
EAGRPVAGQLVEIWQANAAGRYRHQRDRHSAPLDPNFTGAGRCLTGPDGW
YRFLTIKPGPYPWRNHHNAWRPAHIHFSVFGTAFTQRLITQMYFPGDPMF
ELDPIFQSILDPAARRRLIAHYDHDLTQPEYATGYRWDIVLAGGGRTPTG
AGND
>MAP1369 pks10, Pks10
MSVIAGVFGALPPHRYPQRELTDFFVSIPEFEGYEDIVRQLHASAKVGSR
HLVLPLEQYPTLTDFGVANRIFIEHAVTLGCAALSGALDEAGLKPEDLDV
LITTTVTGLAVPSVDARIAARLGLRDDVRRVPLFGLGCVAGAAGVARMHD
YLRGAPDAVAALVSVELCSLTYPGYKPSLAGLVGSALFADGAGAVVAVGE
RRAEQLDAAGPSVLDSRSHLYPDSLRTMGYDVGATGFELVLSKDVAAVVE
QYIEDDVTGFLGAHGLTTNDIGAFVSHPGGPKVIEAINAALGLPPEALEL
TWRSLGEIGNLSSASVLHVLRDTLAKPPPSDSPGLMLAMGPGFCSELVLL
RWH
>MAP1372 pks11, Pks11
MSVIAGVFGAVPPHRYSQREITDEIVKFPALREHEEVVRRLHAAAKVNSR
HFVLPLQQYHSLTDFGEVNEIFIDKAVQIGCDALLGALDEAGLRPQDIDT
IATTTVTGVAVPSLDARIAGRLGLRPDVRRVPLFGLGCAAGAAGVGRLHD
YLRGAPDGVAALVSVELCSLTFPTVKPTVSGLVGTAMFGDGAAAVVAVGD
RRAERLGATGPDILDSRSRLYPETLHIMGWNIGSAGMQLVMSPELPAVVE
KHLADDVTGFLAAHGLTTGVVGAWITHPAGPKVITAIAATLDLPAEAHEL
TWRSLGEVANLSSASVLHILRDTIAKPPPAGTPALMLAVGPGFGSELVLL
RWH
>MAP1796c pks12, Pks12
MVDQLQHATEALRKALVQVERLKRTNRALLERSSEPIAIVGMSCRFPGGV
DSPEALWQMVAEGRDVISEFPTDRGWDLAALYDPDPDARHKCYVNTGGFV
DNVADFDPAFFGIAPSEALAMDPQQRMFLELSWEALERAGIDPVKLRGSA
TGVFAGLIVQGYGMLAEEIEGYRLTGMTSSVASGRVSYVLGLEGPAVSVD
TACSSSLVALHMAVQSLRSGECDLALAGGATVNATPTVFVEFSRHRGLAP
DGRCKAYAGAADGVGWSEGGAMLVVERLSDARRLGHSVLAVVRGSAVNQD
GASNGLTAPNGPSQQRVVRAALANAGLSAAEVDVVEGHGTGTTLGDPIEA
QALLATYGQDRGQPLWLGSIKSNMGHTQAAAGVAGVIKMVLAMRHELLPA
TLHVDEPSPHVDWSAGAVSLLTEAQPWPGERPRRAGVSSFGISGTNAHVI
IEAVPAAEPAHADAPALPVLPWMVSAKSAAALRAQAARLAEYVRAHGELD
IADVGWTLAGRATFEHRAVVVGGDRDRLLAGLDELSTDDPAASIIRGVAT
PAGKTVFVFPGQGSQWLGMATELLDTAPVFAQHIQACEEAFAEFVDWSLI
DALRGAPGAAGMDRVDVVQPALFAVMVSLAELWKSVGVSPDAVIGHSQGE
IAAAYVAGALSLRDAARVVTLRSKLLRSLAGPGGMLSIACSTERARELLA
PYGNRVSIAAVNGRSSVVVSGEGAALDELAAFCADLALRTRRIDVDYASH
SVEVEAIREDLAQALTGIEPRSSRIAFFSTVTGNRLDTAGLDADYWYRNI
RQTVQFDQAVRSAAEHGYRTFIESSPHPALVAGIEDTVNDSLPGDTEAIV
IPTLGRDDGGLERFLTSAATAFVAGVNVAWRGVLDGAGFVELPTYAFDRR
RFWLSGEGAAADASGLGLGTSEHPLLSAVVELPASGGVVLTGRLSPSLQG
WLTDHAVSGTVVFPGAGFVELAIRAGDEVGCSTVEELTLQAPLMLPAKDS
GIGSVAVQVVVGEADESGRRDVSIFSRPDSDSPWVCHAQGTLSTGSIEPG
ADLSAWPPAGATKVDIADGYQRLAARGYGYGPAFQGLTAAWVRGDEVFAE
VRLPDAAGGVTGFGVHPALLDAAMHALIVGHQIAGDRDEVVLPFAWQGVS
LHAAGASAVRARLAPAQAPGTAASRAVSLELADGLGLPVLSVRAMVARSV
SERQLRAAVSAAGPDRLFEVAWAPVTVPAAHGEPPVHQVFESLAAEGDPV
GESYRRTHEALAAVQSWLTEHDSGVLVVVTRGAMALAGEDVTDLSGAAVW
GLVRSAQTEHPGRIVLIDSDAALDDSALTAALATGEPQVLLRDGTVYTAR
VHGSRAVDGIMTPPEDRPWRLGISSAGTFENLQLEPVPNPDAALQPGQVR
VALRAIATNFRDVMITLGMFTHDALLGGEGAGVVVEVGPGVTEFSVGDSV
YGFFPDGSGTLVPGDVRLLQHKPAHWSYPEAAGISAVFTTAYMAFIHLAD
VKPGQRVLVHAAAGGVGMAAVQLARHLGLEVFATASRGKWDTLRAMGFDD
DHIGDSRTLDFEDKFRSATGGAGMDVVLDSLAGEFVNASLRLVAPGGVFL
EMGKTDIRDPGVVVREYPGVRYRAFDLFEPGRPRMHQWMVELAGLFDAGV
LNPLPVTTFDIRRGRAALRYLSQARHIGKVVMTVPGALSAGTVLITGGTG
MAGSTLARHLVTGHGVRDLALLSRTGPDAPGAAELVAELEAAGARVQVIA
CDAADRAALAGVIAGISAQRPLSGVIHAAGVLDDAMITSLTPERIDAVLR
AKVDAAWNLHELTRDMNLSAFVMFSSMAGLVGSSGQGNYAAANSFLDALA
AHRRAHGLPAISLAWGLWDQASAMTGGLDAADLARLGRDGILALSSDEAM
ELFDTALIVDEPFLAPARIDLGALRAHAVAVPPMFAELVNAPTRRRVDDS
LAAAKSKSALAHRLDGLPEAEQHAVLLELVRSHIATVLGSPTAEAIDPDK
AFQELGFDSLTAVEMRNRLKTATGLALSPTLIFDYPTPNALAGYIRTELA
GAPQEITHAPVVRATDDDPIAIVGMSCRFPGGVDSPEALWQMVAEGRDVL
SEFPTDRGWDLAGVYNPDPDVPGTCYTRTGGFVDNVADFDPAFFGIAPSE
ALAMDPQQRMFLELSWEALERAGIDPVKLRGSATGMFAGVYTQGYGMGAA
PIAEGFRLTGQSSSVASGRVSYVLGLEGPAVSVDTACSSSLVALHMAVQS
LRSGECDLALAGGATVNATPTVFVEFSRHRGLAPDGRCKAYAGAADGTGF
SEGGAMLVVERLSDARRLGHSVLAVVRGSAVNQDGASNGLTAPNGPSQQR
VVRAALANAGLSAAEVDVVEGHGTGTTLGDPIEAQALLATYGQDRGQPLW
LGSIKSNMGHTQAAAGVAGVIKMVLAMRHELLPATLHVDEPSPHVDWSAG
AVSLLTEAQPWPGERPRRAGVSSFGISGTNAHVIIEAVPAAEPAHADAPA
LPVLPWMVSAKSAAALRAQAARLAEYVRAHGELDIADVGWTLAGRATFEH
RAVVVGGDRDRLLAGLDELSTDDPAASIIRGVATPAGKTVFVFPGQGSQT
LGMGRQLHAGYPVFAEAFDAVVAELDRHLLRPLRDVIWGDDENVLNSTEF
AQPALFAVEVALFRLLESWGVRPDFVMGHSIGELSAAHVAGVLSLQNAAV
LVAARGRFMQALPEGGAMIAVQATEAQVRPLLGPDVGIAAVNGPAAVVIS
GDHDAAVAIAERLRAEGHRVHRLSVSHAFHSPLMEPMIDEFGTVAAGLAS
DKPVIPIISNLTGQPAADDFGSPEYWKRHVRDAVRFADSVRFAQSAGATR
FLEVGPSSGLTAAIEETLADAPVVTVSALRKDRPEPVALVNAVAQGWVCG
MDVDWRGALGTGHLVDLPTYAFDRRRFWLSGDGAASDAAGLGLAAGDHAL
LGAVVELPASGGVLLTGRLSSASQGWLADHAVGGVVLFPGAGFVELAIRA
GDEVGCGIIDELNLAAPLVLPAGGSVAIQVVVDGPDDSGARAVSVFSRAD
AGAGWLLHAEGVLRAGSAQPATDLSAWPPVGAVPVDLGDGYEQLAERGYR
YGPAFRGLTSMWRRGDEIFAEVNLPTDAGVSTTGFGVHPVMLDAALHAVM
LASDGDELPAGSMLVPFSWQRVSLHAAGAAAVRARIVPVSPSAVSIELAD
GLGLPVLSVASMVARPVTDQQLLAAVSNSGPDRLFELIWSAQPSTAVQPV
SLLNWGATELDATDTDSDSGRFAVLFESAPVAGDVVTEVYAATRAVLPVL
QTWLARDGAGTLVVSTRGAMTLPREDVTDLAGAAVWGLVRSAQTEHPGRI
VLVDTDAPLDADAVAAVLAVGEPQTLLRNGTVYTARVLGSRAVGALLVPP
EDGPWRLGMSSYGTIENLRLEPIPDADAPLGPGQVRVATSALAANFRDVM
IALGLYPDPDAVMGIEASGVVIETASQDGRVAVGDRVMGLFPDGTGTVAI
TDQRLLVKVPAGWSHTAAATASVVFATAYYALVDLADARPGQRVLVHAAA
GGVGMAAVQLARHFGLEVFATASRGKWDTLRDMGFDDDHLGDSRGLDFED
KFRAVTGGAGMDIVLDSLSGDSVDASLRLVAPGGIFLEMGKTDIRDPEVV
AAEHPGVRYRAFDLFEAGPDRIARLLDELAAMFGEDVLRPLPVTRFDVRR
APAALRYLSQARHVGKVVMTMPDAWTAGTVLITGATGMAGSAVARHVVTR
HGARNLVLVSRRGLDAPGAAELVAELTAAGARAEVVACDAADREALAKVI
ADIPMQHPLTGVIHAAGVLEDAVVTSLTPQRIDTVLRAKVDAAWNLHELT
RDLDVGAFVMFSSIAGLAGASGQGNYAAGNSFLDGLAAHRRAHGLPAISL
GWGLWDQASAMTGGLGAADLARFGRDGIVAMSSQEALELMDTALIVDEPF
LLPAHIDLAALRAKFDGGTLPPMFVDLINAPTRRQVDDSLAAAKSKSALL
QRLEGLPEDEQQAVLLDLVRSNIATVLGSSSPEAIHPDRAFQELGFDSLT
AVEMRNRLKAATGLALSPTLIFDYPNSAALAGYMYRELVGTSEQPTAAAA
PGEAEIQRVVGSIPVKRLRQAGVLELLLALANESTGTAQSATPAVTTEKD
IADMDLDDLVNAALLDDDDE
>MAP0220 pks13, Pks13
MRAGDDAERSDAEERRPTTVPEMREWLRNWVGRAVGKSPDEIDESVPMVE
LGLSSRDAVAMAADIEDMTGVTLSVAVAFQHPTIESLATRIIEGEPEAVD
AGDDMDWSRSGPAERVDIAIVGLSTRLPGDMNSPDETWQALMEGRDAITD
LPEGRWSEFLEEPRIAARVAGARTRGGYLKDIKGFDSEFFAVAKTEADNI
DPQQRMALELTWEALEHARIPASSLRGEAVGVYVGFSNNDYQFLAVSDPT
VAHPYAITGTASSIIANRVSYFYDFRGPSVAVDTACSSSLVATHQAVQAL
RNGECDVAIAGGVNALLTPLVTLGFDEIGQVLAPDGRIKSFSADADGYTR
SEGGGMFVLKRVDDARRDGDQILAVIAGSAVNHDGRSNGLIAPNQDAQAE
VLRRAYKDAGIDPRTVDYIEAHGTGTVLGDPIEAEALGRVVGRGRPADRP
ALLGAVKTNVGHLESAAGAASLAKVVLALQHDKLPPSINFAGPSPYIDFD
GMRLKVIDSPTDWPRYGGYALAGVSSFGFGGANAHLVVREVLPRDVIERE
PEPTPAPQAAAEPTELPEPQAHALRFDDFGNVIPDPEAPEEEEHELPGLT
EEALRLKAIALEELAAQQESEPTKPLIPLAVSAFLTSRKKAAAAELADWM
ESPEGQASSLESIGRALSRRNHGRSRAVVLAHDHEEAIKGLRAVAEGKQR
PNVFSTDGPVTNGPVWAMAGFGAQHRKMGKNLYLRNEVFAEWIEKVDALI
QDERGYSVLELILDDSHEYGIETSNVVIFAIQIALGELLRHHGAKPAAVV
GQSLGEPASAYFSGGLSLADATRVICSRSHLMGEGEAMLFGEYIRFMALV
EYSADELKTVFADFPGLEVCVYAAPSQTVIGGPPEQIDAIVARAEAEGRF
ARKLQTKGAGHTSQMDPLLGEFSAELQGIKPMSPTVGIFSTVHEGTYIKP
GSEPVHDVAYWVKGMRHSVYFTHGIRNAVDSGHTTFLELAPNPVALMQIG
LTTAAAGLHDAQLIPTLAKKQDDVESMISAMAQLYVYGHDLDIRTLFTRA
KGPQDYANIPPTRFKRKEHWLDVHFSGDGSVIMPGTHVALPDGRHVWEYA
PRNGETDLAALVRSAATQVLPDAQLVASEQRAVPGPGARLVTTMTRHPGG
ASVQVHARIDESFTLVYDALVSRAGQNVAALPTAVGAGAAIAAAPTVAAP
EQPAQAPVEEDTNAETLSDSLTARYLPAGTGKWSPDSGETVAERLGLIVS
AAMGYEPEDLPWEVPLIELGLDSLMAVRIKNRVEYDFDLPPIQLTAVRDA
NLYNIEQLITYAIEHRDEVEALHEHQKTLTPEEIAKEQAQLLSGATPASV
AAPAPDPQAEPETQPAPPPPSDTPIPPPPTDPSGPSANGGQPKPSLAEAL
SSEAVTKALNSDVPPRDAAERVTFATWAIVTGKSAGGIFNPLPKLDADTA
AKMAQRLSERADGPITVEDVQAAETIEALAERVRTYLEAGQIDGFVRTLR
ARPEGSTKIPVFVFHPAGGSTVVYEPLLKRLPPDTPMYGFERVEGSVQER
AAQYVPKLLELNGDNPFVLVGWSLGGALAYACAIGLKRAGADVRFVGLID
TVRAGEEIPQTKEETRKRWDRYARFAERTFNVEIPAIPYEQLEELDDEGQ
VKFVLDIVQQSGVQIPGGIVEHQRTSYLDNRALETVQIEPYDGHVTLYMA
DRYHDDVIEFEPRYAIRQPDGGWGEYVADLEVVPIGGEHIQVIDEPIIGK
VGAHLTDVLNKVEAQTSQTSEVGK
>MAP0977 pks16, Pks16
MSRFTEKMYRNARTAKTGMVTGEPHNPVRHTWGEVHERARRIAGGLAAAG
IGPGDAVGVLAGFPVEIAPTAQGLWMRGASLTMLHQPTPRTDLAVWAEDT
MNVIGMIEAKAVVVSEPFLVAIPVLQEKDIKVLTVADLLASDPIDPVEVG
EDDLALMQLTSGSTGSPKAVQITHRNIYSNAEAMFIGAQYDVDKDVMVSW
LPCFHDMGMVGFLTIPMYFGAELVKVTPMDFLRDTLLWAKLIDKYKGTMT
AAPNFAYALLAKRLRRQAKPGDFDLSTLRFALSGAEPVEPADVEDLLDAG
KPFGLKPSAILPAYGMAETTLAVSFSECNAGLVVDEVDADLLAALRRAVP
ATKGNTRRLATLGPLLQDLEARIVDENGDVMPPRGVGVIELRGESVTPGY
LTMGGFIPAQDENGWYDTGDLGYLTEEGHVVVCGRVKDVIIMAGRNIYPT
DIERAACRVEGVRPGCAVAVRLDAGHSRETFAVAVESNAFQDPAEVRRIE
HQVAREVVAEVDMRPRNVVVLGPGTIPKTPSGKLRRSNSVTLVT
>MAP3764c pks2, Pks2
MTHPLRRGTTMDESATPVAVIGMACRLPGAIDSPDALWAALLRGDDFVTE
IPLDRWDADEHYDPEPGLPGRSVCRWGAFLDDVGGFDADFFGISEREATA
IDPQHRLLLETSWEALEHAGINPAALTGSRTGVFVGLMHDDHTLRTADAG
ALDQPYGFMGNSFAVASGRIAHTLGLHGPALTVDTACSSGLVSVHVACRS
LHHGESDLALAGGATVLLEPRKLAAGSAQGMLSATGRCRAFDAAADGFVS
SEGCAMVLLKRLPDALRDGDRILAVVRGTAANQDGRTLSLATPSLTAQTA
VYRSALAAAGVDAGTVAMVEAHGTGTPVGDPIEYASLAELYGQDGPCALT
SVKTNFGHTQSAAGTLGLVKAVLALQHGVVPQNLHFTRLPDKIGQLNTNL
FVPQDNTPWPRKLQQPRRAAVSSYGVSGTNAHAIVEQAPNASGGVKPTPA
GPLLFTVSSSSDEGLRETAKRLAGWVKERAASVELSDLAYTLARRRAHRP
VRTAVIAGSVDELAAGLHEVADSRTSYLAAFGRDDCGPVWVFSGQGSQWT
QMGAELLANEPVFATTVAEVEPLIARESGFSVTEAMSAPHTLTGIDKVQP
TIFAMQVALAATMKSYGVRPGAVIGHSLGETAAAVVAGALSLQDGVRVIC
RRSRLMSRIAGAGAMASVELPAQQVLSELMARGIDDVVVSVMASPTSTVI
GGVTSTVRELVAAWEERDVLAREIAVDVASHSPQVDPILDELADALVELE
PTAPEVPFYSASLFDPREQPLCDADYWLDNLRHTVRFGAAVQAALEDGHR
VFVELSPHPLLTYAIEQTAASLDTPAAALAAMRREQALPHGMRDLLAEVH
SVGAAVDFSMLYPDGRLLDVPLPAWTHRHLLLGLDSRDTQPQAGPTVAVH
PLLGSHVRLPEEPERHAWQAEVGTATLRWLADHRIHEVAALPAAAYCEIA
LTAAHTALGDASEVRDLRFDQMLLIEDETPVFALAVMASRSAAEFVVETF
EEGERVRRATAILAAAGPGERPPAYDVPALLAARSSSVDGAELRNRFAER
GVQYGPAFTGLTAAHTSGDTVLAEVALPGAMRSQQRAYRIHPALLDACFQ
SVGAASEIQNSGGGSLLLPLGVRRIRAYASARTAHYCYTRVTSTTATGFE
ADLDVLNEHGAVLLSIQGLQMGTGASEEASRDRALNERLLTVEWHRQELT
EVDPVDAGTWLLVSTSAMADVAATELTDSLKLLGADCTSIRWSSRADPVA
NGKTLRDQLRVGRFTGVVLLTGPKDGDRDDECAARGGEYVRHVAHIVREL
AAAPGEPPRLYVVTRNAQTVLEGDGANLEQGGLRGLLRVIGNEYPQFRTT
QIDTDMHTGASLIARQLVSGSEEDETAWRNDEWLTARLRLSPLRPDERRT
AVVDHEHDGMRLHIRAPGDLQTMELVACERIPPGPGQIEVAVTASSINFA
DVLLAFGRHPSFEGRLPQLGTDFAGMVTAVGPDVIEHKIGDHVGGLCPNG
CWGTFVTCDANVAVTLPPGLTDAQAAAVTTAHATAWYGLHDLARVKAGDK
VLIHSATGGVGQAAIAIARAAGAEIFATAGSSQRRKMLHDKAIEHVYDSR
SVEFAEQIRRDTDGYGVDVVLNSLTGAAQLAGLKLLAPGGRFIEIGKRDI
YGDTKLGLFPFRRNLAFYGVDLGLMSVDHPLRVRELLDTAYERVADGLLP
MPASTHYPLTDAATAIRVMSAAEHTGKLVLDVARSGKSSAVLHPDSGRVF
RPDGSYIITGGLGGIGLFLAEKLAAAGCGRIVLSSRSEPNQRVQEMIELV
RAIGSDVVVECADITQPGTAERLVAAATTTGLPVRGVLHAAGVVEDSALG
GITGELLERCWAPKVVGAWNLHRVTAGEPLDWFCLFSSAAALVGSPGQGA
YAAANSWLDAFTHWRRAQGLPATAIAWGAWSELGRATGFAEDVGTAITPD
EGAYAFQALLCHDRAYTGYAPVIDTAWLSAFAERSRFAEAFRSTGKSPTG
TTRFLAELNTLPHDQWPTRLRRLVSEQVSLILRRTVDPDRPLSEYGLDSL
GNLELRTRIEAEAGVRIKSTAITTVRGLAAHLCDTLAEAAPTSQVRAE
>MAP1867c pks5, Pks5
MGGIVARRRLRHRNPARPLGRRRILRPRAGCAGPVGIALGRIHRRRRRLR
PRVLRDQRATAMDPQHRLLLETSWEAMEHAGLTEERVADSRTGVFIGLTH
GDYQLLAADTRSVEGAYGFSGSNFSLASGRIAYALGVHGPALTVDTACSS
GLTAIHLACRSLHEGESDLALAGGATLALDPRKFSAGSAEGMLSPTGRCR
AFDVAADGFVGGEGSVMLLLKRLNDALRDGDRILAVVRGTAANQDGHTVN
IATPSKSAQTAVYRAALAAAGVDAGTVGMVEAHGPGTPVGDPIEYASLAE
VYGIDGPCALASVKTNFGHAQAASGALGMMKAILALQHGVVPRNLHFTQL
PDDLARIDTKLFVPQQTTPWVSNGGHPRRAAVSSYGLSGTNVHAILEQAP
EPAPETVAPEHISAESPLLFPLSSTSADELRRTAGRLADWVHAHDDLALP
DLAYTLARRRVHRPVRTAVLAGDRAQLIEALREVADGDTPYPAAVGRGDR
GPVWVFSGQGSQWAAMGADLLATERVFAATVAQAEPLIARESGFSVAEAM
SAPQTVTGQDRVQPTLFTMQVALAATMKAHGVRPGAVIGHSLGEAAAAVV
AGALSLEDGARVICRRSRLMSRVAGTGATASVELPAQQVLSELTARGISD
VVVAVVASPQSTVIAGAAPTVRDLVTAWQERDVMAREVPTDVAFHSPQVD
PIMDDLTDALAEISPRPPEVPYYSATLFDPREQPVCDARYWANNMRRMVR
FATAVQAALEDGYRVFAELAPHPLLVRALEQTARSREIPMAALASMRRGQ
ALPHGLRGFVADLHNAGAAVDFSVLYPTGRLVDAPLPTWTHRRLWLTDGG
LESPTHGGCTVAVHPLLGPHVHLQEEPERHLWQADVGTAAQPWLADHQIR
NVVVLPGAAYCEMALAAARSVLGAAAEVRDIRFEQALLLDEQTTIDASAS
VSSPGVLHFTVQSHQGGEQARHASAVLGAAVDEQPAAHDLSALLAAHPHD
DDGAEVRRRMDRRGVQYGPAFAGLGAVHTGDETGTVLAEVALPRQIRSQQ
AAYGVHPALLDACFQSVEAHPAVRALGDGALGLVLGIRRLRAYSAARNAH
YCYTRVTKADTSGVEADIDVLDEHGAVLLAVQGLRVGTSASESGTRDRVL
AERLLNIEWRQRELPEPEHADAGSWLLIGTTATADVLASTLADTLKNRGA
QCTTMAWPQQADHTVHAEQLRNHWRGDGFTGVVVLTGPKNGDNDQESALL
SRDHVQHLVRIARELTDLPGEPPRLYVVTRNAQTVLPDDVPNLGQAGLRG
LVRVIGMEHPQLGAGQIDVDESTDAEALARQLLAGCEEDETAWRDGAWYT
ARLCPAPLLPEERHTAVANHECDGMRLQIRTPGDLESIELVACDRIPPGP
EQIEVAVSASSINFADVLVAFGRYPAFEGRLPELGTDFAGVVTAVGPDVT
DHKVGDHVGGLSANGCWGTFLTCDARLAVTVPPGLADDQAAAVTTAHATA
YYGLHELARIGAGDRVLIHSATGGVGQAAIAIARAVGAEIFATAGSEQRR
QLLRDMGIEHVYDSRTVEFADLIRHDTDGYGVDIVLNSVTGAAQRAGIEL
LAFGGRFVEIGKRDIYGDTRLGLLPFRRNLTFYALDLALMSFSHPDRLRG
LLRTVYRLTADGALPMPESTHYPLADAASAIRVMSAAQHTGKLVLDVPDA
GRSRVVVPPAQVRAFRSDGAYIVTGGLGGLGLFLAEKMASPGSRAGCGRI
VLCSRALPNPKALATLERIRQMGADIVVERGDIAEAGTAQRLLDVATATG
LPVRGVLHLAAVIEDATLANITDELIERDWAAKVYGAWNLHLALQESGAE
QSLDWFCSFSSAAALVGSPGQGAYAAANSWLDAFTRWRRARGLKATAIAW
GAWAQIGRGAALADSADVAITPDEGAYAFEALLRHDRACTGYAPITGTPW
LTAFAQRSPFAEAFRANGQSATGTSKLRAELEELPPEEWSTRLRRLISDQ
VSLILRRNVDPDRPLPEYGLDSLGGLELLTRIQTETGIRVSPADIAAIGT
IRGLADLLRDKLTPAGAAQAERV
>MAP1370 pks7, Pks7
MPRSSGSHPVRRWRWTPSSGCCSRCPGKRWSGPELTPRTLRGSATGVFAG
IFHGSYGGQGRVPGNLERYGLRGSTLSVASGRVAYALGLEGPAVSVDTAC
SSSLVALHLAAQSLRSGECDLALAGGVTVMATPAMFIEFSRQRALAADGR
CKAYAGAADGTGFSEGVGVLVLERLSDARRLGHSVLAVLRGSAVNQDGAS
NGLATPNGPSQQRVIRAALANARLGAADVDLVEGHGTGTMLGDPIEAQAL
LATYGQDRPVDEPLWLGSIKSNMGHTSAAAGAGGVIKMVQALRHGVMPKT
LHVDEPTPQVDWSAGAVSLLTEARPWPARDRPRRAGVSSFGISGTNAHVI
VEQYEPETIAPQGGDVVVPWVLSARSAEALTNQAARLLARVKADPGVRVL
DVGWSLVSTRSRFEHRAVIVGADGAQLLRRLADLAGGQPGAGVVTGRAQP
VGKTVFVFPGQGSQWPGMGAQLLDRSTVFAEHMHRCAGALAEHVDWSLID
VIRGTPGAPGLDRVDVVQPALWAVMVSLAELWRSVGVVPDAVIGHSQGEI
AAACVAGALSLQDAARVVALRSRLLVRLSGRGGMVSLACGRSRAEQLIAP
WGERLNIATVNGISAVVVSGEVDALTELLDRCAADDIRARRIDVDYASHS
VQVEEIRDSLAEALRGIAPRSSAVAFFSTVTGELMDTAALDGDYWFRSIR
QTVQFERAVRGAAEAGYRAFIECSPHPVLTAAIEETMPDGGQGCVIPSLG
RDDGGPDRFWLSAGQAFVSGVVVDWCAMLDGLGGRRVDLPTYGFVRQHFW
LPGGSTGSSDVAALGLRGAEHGLLGAVLPRPDSGGVVLTGRLSTSAQAWL
ADHAVGDTVLFPGAGFVELAIRAGDEVGCGVIDELTLSAPLPLPASGGVR
LQLVVGAPDEAGRRPLSVYSAAVHQDSEWMLHAEGVLRAGTVTPAGDLSI
WPPIGATAVDVTGGYARLAQRGYEYGRAFRGLRAMWQRGDEIFAEVALPD
DAAAGDDFGVHPVLLDAALHVLGVAGEKDQTVLPFSWQGVALHASGASRA
RVRIAPAGAGAVSVELADGAGLPVLSVRSVTMRPVSPGQLSAAMGTAQPS
GLLDVIWSPIALGGNDLGDEVTLWEPGEHGGDVVKSVHAAVTETLAVLQS
WLDGEGGGVLAVQTHGAVALAGEDVSDLAGAAVWGLVRSAQAEHPGRLVL
IDSDGSLDARAVIPCGEPQVVVRQGVAHAARLRPARAGATLGLPSGAWRL
DAGGEGTLGDLVVSRCPRTELADGQVRVAVAAVGVNFRDVLVALGMYPGG
GRLGAEGAGVVVEVGPAVTGLAVGDPVMGLLGVVGSEAVVDQRLLTAVPP
GLSLVAAAGVPVVFLTALYGLSVLAGLRPGERVLVHTATGGVGMAAVQLA
RHYGAEVFATASRGKWDTLRAMGFDDDHIGDSRSPDFEQKFLAATGGAGV
DVVLNSLAGEFTDASLRLLAPGGRFIEMGKTDVRDPDVVAQRYRGARYRA
FDLMEAGPDRTAAMLAEIVGLLQEGVLTPLPLKTFDVRCASAAYRYVSQA
RHIGKVVLTVPSGLGEVLSGCGGGLAQSTVLITGGTGMAGSALARHLVDR
YRVGHVVLVSRTGAQAAGAAELVDELQRAGASASVLACDVADRDAVATML
AGLPAGYPLRGIVHAAGILDDGLLSSLIPDRVDAVLRAKVDGAWNLHELT
KDLDLSAFVVFSSMAGIVGTPGQANYAAANSFLDGLVAHRRADGLAGLSL
AWGLWEQASAMTAHLGDRDKARMSRIGLAPLSTEQALAAFDAAMLVETPV
LVAARLDRAALSENIAALPPLLRELAAGPTRRVIDDADVTASMSGLAARL
HGLSPEARRRELVDLVCGNAAMVLGVPNPADINAGRAFQDLGFDSLTAVE
LRNRLKNATGLTLSPTLIFDYPTPVVLAEHLDSRLAGSGGDDQPDLMGRF
NDITRELQALLGAAHWNSDDKAVLRTRIHGLLGALPAGDGPDSAPLDEDL
EAATESQLFAILDEELGR
>MAP1371 pks8, Pks8
MPGTEQHLDYLKRLTADLRRTRRRVAELEGRLSEPVAVVGMACRYAGGVD
SPEALWDLVIEGRDTVSDFPVDRGWDVEGLYDPDPDAKGKMYTRQGSFLQ
HAGDFDAGFFGIGPSEALAMDPQQRIMLEICWEALERAGIDPSALRGTAT
GVFAGVIHAGYGGEVKGELEGYGLTGSTLSVTSGRVSYVLGLEGPAVSVD
TACSSSLVAMHLAAQSLRSGECDLALAGGVTVMATPAAFVEFSRQRALAP
DGRCKVYAGAADGTSWSEGAGVLVLERLGDARRLGHPVWAVLRGSAVNQD
GASNGLTAPNGPSQQRVIRAALANAGLSAVDVDVVEGHGTGTVLGDPIEA
QALLATYGQDRPADRPLWLGSIKSNIGHTSAAAGVAGVIKMVQAIRHGVM
PKTLHVDVPSPHVDWSAGAVSLLTDPRPWPEHGGPRRAGVSSFGISGTNA
HVIVEQAPAAAETEAAPASSAMPDAVVPWVVSARSAEALAGQARRLLDHV
TADAQASPLDVGWSLVSTRAVFEHRAVVVGCERGALATGLAGLASGRPDA
ATVVGRARATGKTVLVFPGQGSQTLGMGRQLYERFVVFARALDEAVAVVD
EHSRLPVREVMWGADPELLQSTEFAQPALFVFEVALAALWESLGVTPDVV
MGHSVGEIAAACVAGVLSLPDAARLVAARGALMAALPAGGVMVAVTAGEA
QVAPLVGGGVSIAAVNGPDAVVLSGEQEAVAAVAERLAGSGARVHRLAVS
HAFHSALMEPMLGGFAAAAAGIEPRPPRIPLVSNLTGQLAGPGYGTPQYW
VEHVRAPVQFLAGVRAAEQAGAGTFLELGPGAALTAAVDQSLSTEGATAI
ATLPKDRPETESALHAAGHLFTRGHRLDWAGVFAGLPARRVELPTYAFAR
ERFWLGGASLAGAPAGAAPVGGGTRAPELAHRLHALPRDEQQRVLRELVC
EHAAAVLGHPDGDAIDPHRAFADLGFDSLIGVELRNRLTTHTGTALSRTL
IFDYPTPTALADHLRRQLLHDEDPESDDERIWSALRRIPLRELRRTGLLD
KLLLLAGIPETATTDPKISDADIDSLSPDALIAMALNSADDDEAE
>MAP1774c pncA, PncA
MVRALIIVDVQNDFCEGGSVPVAGGAAVAPAINAYLDDAPGYDYVVATQD
FHIDPGDHFSDRPDYSSSWPVHCLAGSAGADFRPELDTTRVDAVFRKGAY
AAGYSGFEGVDDNGTPLLEWLRRRGVDEVDVVGIATDHCVRRTAEDAARA
GLTTRVLVDLTAAAAQDSAARALDEMRSAGIELVGVR
>MAP1242 pstA, PstA
MGGDLGESMARDDRAFPLTRGQLDIWLSQEAGFAGTQWQLGLLVKIDGKV
HRDALEQAITQAVAEAEPGRVSFFELDGQVVQKPIDYPHVELAFHDLTDH
ADPVAEAREMSSAIQRTPMPLNGQMFKFVLFQTGHDEFYLFGCCHHIAID
GLGMALVCRRVATIYSAMVAGKPIPDAYFGTVQDLIDLESGYEASPDYAE
DKAYWSEHLPPESGPVDRLPDAEGERDHYSPSASVQLDPSVANRIKELSK
KLAIRRFSVTTAACALLVRGWSGSGSEVALDFPVSRRVRPESKTLPAMLA
GVVPLVLSTAPESTVADFCKHVDKRIRELLAHQRFPVHTLEGDGLRQAPN
RVGINFIPSRLTLDLAGSPATASYTNHGPVGHFGLFFLGAGDQLFLSTAG
PGQPFASFGVADLAGRLQQILAAMTEDPDRPLSSIELLTGDEPALIDRWS
NRPALTEPAPAPVSIPQAFAEHVQRTPDAVAVTFGATSLTYAQLDEASNR
LGHLLADHGVGPGDCVAVMFPRCADAIVSMLAVLKTGAAYVPIDPAHASS
RMDFVLADAAPSAVITTSDLRSRLDDHDLLVVDVHDPAVEAQPGTALPWP
APEDTAYIIYTSGTTGTPKGVAIPHLNVTWLIESLDAGLPPGNVWTQCHS
SAFDFSVWEIFGALLRGRRLLVVPESVASSPEDFHALLVAEQVSVLTQTP
SAVAMLSPEGLESTALVVAGEACPTDVVDRWAAPGRVMLDAYGPTETTVC
ASISTPLTAGDPVVPIGSPIAGAAMFVLDKWLQPVPAGVVGELYLAGRGV
GHGYVRRPGLTASRFVPNPFGAPGSRMYRTGDLVCWGPDGQLQYLGRADE
QVKIRGFRIELGETQSVLAGLDGVEQAAVVAREDRPGDKRLVGYITGTAD
PAELRAQLADRLPPYMVPTAVMVLDALPLTGNGKLDKRALPSPEYAAGEY
RAPGDAIEEILADIYAQVLGVERVGVDDSFFDLGGDSILSMQVVARARAA
GVICRPRDVFVEQTVARLARVSQVAVDGELGAADEGIGPVQPTPIMRWLQ
DIDGPIDEFNQTMVLAAPAGVGVDDVAVVLQALLDRHAMLRLCLDDDGAG
GWDLHVPPPGSVDARAILRTVDVLSEAALARARSRLNPGAGLMLSAVWAS
ATNELALVVHHLAVDGVSWRTLIEDINIAWAQHHSGQEIALPVPGTSFAR
WSSILAEYAKSPAVVAAAAAWQQVVATPAVLPAVGPDDTYASEGQLSASL
DVQTTRLLLGEVPAAFHAGVQDILLIAFGLACTEFVGGGAPIGIDVEGHG
RHEEIASGVDLSRTVGWFTTKYPVALTISQRLDWARVVAGEAALGAVIKD
AKEQLRALPDGLSYGLLRYLNPEIEVQGPDPVIGFNYLGRLGGAAADLSD
EHWRLSPDSPSVSAAAAAIPLPLGHTVELNAGTMDTDAGPQLHANWTWAR
SVLTDEQLNRLSRLWFEALTGICAHVQAGGGGLTPSDIAPTLLDQGRIEQ
LERHYDVADILPLTPLQQGLLFHATGSHAEGDVYAVQLSVTLRGALDPHR
LHRALHTVVTRHPNLAARFCPELGEPVQIIPAEPEMAWRYLELDGGDIDE
QLEQLSADERAAVRELGDRPPFGAALIRTADTEHRFVLTVHHLVMDGWSL
PVLLQEIFACYYGARLPAPAPYRGFVTWLAARDVPAARAAWRAVLDGFDT
PTLVAPRGADAPGRRGVASFRVAAETTSAVSELARRRRTTVNTVLQAAWA
QLLMMLTGQHDVAFGTAVSGRPAELPGAESMVGLLINTVPVRAHATAATT
IADLVDQLQRAHNHTVEHQHLALNEIHRITGQDQLFDTLLVYENYPIDTA
ALSAADDLTATEFSCHDYNHYPLSLQVVPGDELGLRLEFDTDVFDPAAID
TLADRLRKLLAAMPADPDRPLRSLDLLDATEHTRLQRWGNRPALSRPATG
PSLPELFAAQVANAPHAVALRYAGRSMTYRELDEASTRLAHLLAGHGATP
GCFVALLFSRSAEAIVAMLAVLKTGAAYLPIDPALPATRIEFMLGDAAPV
VAVSTADLRARLEAFGLPVVDVAATGAQPGGPLPAPAPDNIAYLLYTSGT
TGVPKGVAVTHRNVAQLLESLHASLPGTGVWSQCHSYGFDVSVQEIWGAL
AGGGRLVVVPESVTSSPDELHALLIAENVTVLSQTPSALAALSPRNLHAA
LVIGGEPCPAALADRWAPGRVMINAYGPTETTVDAVLSTPLAAGAGAPPL
GSPVAGATLFVLDAWLRQVPAGVTGELYIAGAGVAAGYLGRPGLTAARFV
ACPFGDAGARMYRTGDLVRWDRDGRLHYVARADQQVKIRGHRIELGEIHS
ALAELDGVGEVAVIAREDRPGEKRIVGYLTGTADPAAIRARLAERLPAYM
VPAAVLAIEALPLTPNGKLDARALPAPEYAGGAYRAPSTPTEEIIAGIYT
QVLGLHRVGVDDSFFDLGGDSLSAMRVIAAVNAGLDARLSVRVLFEAPTI
AQLAARLGEGGHRFAAVVAAERPAVVPLSFAQSRLWFIGQLHGPSPVYNM
VAALRLHGPVDIGALGAALHDVVTRHESLRTVFAATDGTPAQVVLPPDRA
DIGWQVIDASGWSPARVDDAIRDTARHTFDLAAEIPLRAVLLRCGAEEHL
LVAVVHHIAADGWSLTPLVRDLARAYASRSAGRVPDWVPLPVQYVDYTLW
QRAQFGDLDDPHSLIAGQLRYWEHTLAGMPERLELPTDRPYPVVADFRGA
SVAVEWPAQLQQQISRLARAHNATSFMVVQAALAVLLAKVSASSDVAVGF
PIAGRRDPALDDVVGFFVNTLVLRVDVSGDPTVGELLARVRQRSLAAYEH
QDVPFEVLVERLNPARSLAHHPLVQVMLAWQNIEPTELSLGQVRVTPLPV
DTRTARMDLAWSLAERWAPDGSPAGIGGAVEFRTDVFDTATVEALTQRLR
RVLAAMTADPGRRLSSIDLLDPDEHARLDALGNRAALTRPQNPPTSIPAM
FAAQMARTPHAVALTANGRSVTYRRLEEHANQLAHQLIRYGAGPGDCVAL
LLERSAEAVAAILGVLKAGAAYLPIDPSLPSARIEFMLTDAAPAAVLTST
EFHCRLQDYHQTVIDVDDPSIREQPVTAPPAPAPDNIAYLIYTSGTTGVP
KGVAVTHRNATQLFASLGAAGLPAAPGKVWGQCHSLAFDFSVWEIFGALL
NGGRVLVVPDDVVRSPEDLCALLIEERVDVLSQTPSAFDALQRADSARRL
NPQTVIFGGEALIPHRLGGWLDGHPARPRLINMYGITETTVHASFREIVD
GDIDGNVSPIGMPLAHLGFFVLDGWLRPVPAGVTGELYIAGAGVAAGYLG
RPGLTASRFVACPFGGAGERMYRTGDLARWGADGQLQYLGRADEQVKIRG
YRIELGEIQSALAELDSVEQAAVIAREDRPGDERLVAYVTGTADPAQLRT
ALTERLPAYLVPAAVLVLDALPLTPSGKLDTGALPAPDYQGPEDYLAPAG
AVEEILAWLYAQVLGLPRRVGVQESFFDLGGDSLSAMRLVAAIYNALDIH
LPVRAVFEAPSVRSLSQRLNADPAVAQGLRADFASVHGRDATEVYASDLT
LDKFIDAATLSAAPALPGPGAEVRTVLLTGATGFLGRYLVLQWLERLELA
DGKLICLVRAASDDDARRRLERTFDSGDPALLRYFHELAADHLEVIAGDK
GRANLGLDDRTWQRLADTVDLIVDAAAVVNGVLPYQELFGPNVAGTAELI
RLALSTRLKPYSYVSTANVGDQIEPSAFTEDADIRVAGPIRTIDGGYGNG
YGNSKWAGEVLLREAHDLCGLPVSVFRCDMILADTSYAGQLNLSDMFTRL
LFSVVASGVAPRSFYRLDAHGNRQRAHFDALPVEFVAEAIATLGAQVGRD
AGIGFATYHVMNPHDDGIGLDEYVDWLIEAGYLIERVDDFDQWLHRMETA
LHALPERQRHQSVLQLLALRKARHVPPADPARGCLGPTERFRAAVQEAKI
GADNDIPHITAPVIVKYVTDLQLLGLL
>MAP0644c sseC, SseC
MCSAPKQGVTLPASVDLEKETVITGRVVDSDGQAVGGAFVRLLDSSDEFT
AEVVASATGDFRFFAAPGSWTLRALSAAGNGNAVVTPSGAGIHEVDVKIA