TitleGenColors Logo

Gene list

Applied filters:

COG category: Inorganic ion transport and metabolism
Organism: Mycobacterium avium subsp. paratuberculosis str. k10, k10
Gene type: CDS

Number of genes found: 237

Free access
Sort by:

 



# Mycobacterium avium subsp. paratuberculosis str. k10, k10

>MAP1110 hypothetical protein
MTAMRLELDRVRLSYNGPAVINELSLVVRPGEILVLTGPSGCGKSTVLRA
LAGLLTPDGGRVLADGVPVTGTSGDRAMVFQDNALLPWRTVRSNIELALR
LRGQPRAGRRAAAERWISELGLAGFGDYLPKSLSGGMRQRVQLARGLAGA
PRAVMMDEPFGALDTRTRAAMQRLLIDTWRTHPTTIVFVTHDVDEALALG
DRIVVLGRAGEPPRASVEVPEPRSDRPHAELRAQIIDALNHAEAA
>MAP1809c hypothetical protein
MSIATALDAAPKRLGGAASGAPGGLWARRKWEVVRLVSPLALLALWQLGS
AIGVIAQDVLPAPSLILQAGVELTRNGQLADALHISTVRVVEGLALGGVI
GIVAGAAVGLSRWVEATVDPPLQMIRALPHLGLIPLFILWFGIGELPKVL
LVALGVVFPLYLNTFSAIRQVDPKMLETAQVLGFSFFQRFRRILVPGTAP
QVLVGLRQSLAIAWLTLIVAEQINADKGIGFLINNARDFLRIDIIIFGLT
IYALLGIITDAVVRLIERRAVRYRH
>MAP2018c hypothetical protein
MVTDRFDAIIVGAGFGGIGAAIQLKRLGFDNIVILDREDDLGGTWHVNHY
PGIAVDIPSTTYSYWFEPNPGWSRLFAPGGEVKQYAADVADKYDVRRHMR
FNTTVEGAQWDEDAEVWRVALAGGESLTTRFLITATGFLSQPHTPDIPGI
GSFGGKVVHTTAWDHDYRYQGRRIAVIGTGASAVQVVPELAKEAGELTVY
QRTATHVLPKVDFEFDPAVRRLFARVPAAQRALRWVTDVLLEIIMIVGAL
HFKESRGRGNISASDLAKINRFRWIRDKELRAKLTPDYDCGCKRPTFSNS
FYRVLTQPNVHLETNPIERIEPDGIVTADGRKTVIDTLVLATGFDLWEAN
FPAIEVIGRKGRNLGKWWRETRFQAYQGVSMPYFPNYLSLASPFAFSGLS
FFHTIEYQMRHMDRLLGEVKRRGATIFEVTEEANDRFMERMTKLLDNSVF
YAGNCATSRSYYFSPSGEASLLRPTSTLNSIREASSFPLSDYVIA
>MAP1075c hypothetical protein
MYARSTTIQAQPLSIDIGIAHARDVVMPALTEIDGCVGLSLLVDRQSGTC
IATSSWESIDAMRGSAARVAPIRDRAALMFDGSARVEE
>MAP3141c hypothetical protein
MARFPKPPEGSWTQHYPELGTAPVSYEDSINPEVYELEREAIFKRAWLNV
ARVEQLPRKGSYLTRELKVVNTSIILVRNGSGEIKAFHNVCRHRGNKLVW
NDMPLEETRGVCRQFTCKYHAWRYDLDGNLTFVQQEEEFFDLDKSRYGLV
PVHCDVWEGFVFVNFAKTPEQSLREFLGPMITALEGYPFDKMTSRWCYRS
EVKANWKLYMDAFQEFYHAPVLHANQSPTAYSKAAAEAGFEAPHYRLDGP
HRLVSTSGVRAWEMSPEMRKPIEDICRSGLFGPWDKPDLGPMPDGLNPAK
CDPWGLDSFQLFPNFVILFWGQGWYLTYHYWPTSYHSHVFEGTLYFPVPR
TPRERVAQELAAVSFKEYGLQDANTLEATQTMLESRVVDKFVLNDQEILL
RHLHKETAAWIDDYRRMSTAGV
>MAP3181 hypothetical protein
MIVWSALSLVALLGGIGVMFAVYGRWSQKVGWHSAETSTLSFRQPGDVGL
TPAQRACVWFFAIVSVLFLAQTLLGAAAEHYRADLSNFFGLDLARVLPYN
LARTWHLQLSLFWTAAAFLAGGIFLVPFIARREPKRQALLAYILLGAVAV
VVFGSLICEALSIYGVIPAGGLFSQQWEYLDLPRLWQILLIAGMFMWIAI
IWRGIRGRLKGESKMNMPWLFFFSGLAIPAFYTVGLLAGSDAHYTVADFW
RFWVVHLWVEDFLELFTTVMVAYMFVLLGVVREKIALGVIFLDVILYSAG
GVIGTMHHLYFSGTPVEHMALGAFFSAAEVIPLTFLTVEAWAFLQLGARQ
QSGDANPFPHRWAVMFLVAVGFWNFLGAGIFGFLINLPVVSYYQIGTALT
ANHGHAAMMGVYGMLAVGLAMFAFRYVIPADKWPEKLARISFWCMNIGLA
WMVFATLLPLGVLQLYHSVNDGYFEARSLGYITKPGNAVLEWLRLPGDVI
LIAGGVLPFVWIAWTALRNFRSGTTVQELPEHPLYTESPVMSEAGAAAAK
D
>MAP0388 hypothetical protein
MQSAASVRDTMDSAKRWVTLGFTWNGLRALGVPGDALASFPEEFRQGMAA
RADILGDTGRNHPDNWVGGLAGADLHAIAILFARDDAEHARATNAHDDLL
KRCQGVRRLSHLDLNATPPFNYAHDHFGFRDRLSQPVIEGSGEEPTPGSG
APLKAGEFILGYPDEVGPVANQPEPEVLSRNGTYAAYRRLREHVAVFRDY
LRSVAGADRQEELLAAKLMGRWRSGAPLVLAPDEDDPELGADPLRNNDFN
YKEMDPFGYACPLGAHARRLNPRDTAHNMNRRRMIRRGATYGPALPEGAP
DDGEDRGIAAFIICASLIRQFEFAQNVWINDRTFHELGNEHDPICGTQDG
TLDFTIPKRPIRRVLKGLPAFTTLTGGAYFFLPGINAMRYLAALGERS
>MAP1594c hypothetical protein
MYVCLCVGATNQTVSEVVARGATTSKEIAAACGAGGDCGRCRRTLRAILA
ASAKLETAPSV
>MAP1309 hypothetical protein
MMPSQPLARGGRHRRHVAAGTAVVLSGALGYIGLADPHDPASIYPPCPFK
WLTGWNCPFCGGLRMTHDLLHGELWAAVHDNVFLLAAVPTLAAFLLVRRA
RGRRSLPAAAVPAVVVATLVWTVLRNLPAFPLFPTVLGG
>MAP2654c hypothetical protein
MTAVQPDSTVGTDIWSTSRRLSMGDDACADQWQALGMLASALAGRAVAVA
GLPPGEPAWTDGQTIYVDAGAPGALKSLAVQASMIAAGSLRPDVVGRLLR
HRKLAQRYLTVEGHRALVANAPVLPRVLASLGDRGVAGRSDSPRASLALA
AARVALPDPAPEFGVIRAGKVLAACGRAARPDQPDDKAAPAHVPRRDGAA
DLAELDDAAVDDSDDPDLFTSPVGGGGALGKWLKKLLSSARKTGTGGGPP
GADSPTHRTDSGKRGAYAVASLASAPADEGNDEDPADGVRYPEWDVARGS
YRPAWCTVREVEPAITARATLAVDDAIAVRRPLARLGMGLHRRHRQPQGD
DIDIDAAVEARVEVRAGSVPDEAVYLDSLRRRRDLSVLLLLDVSGSAAEP
GTVGRTVHEQQRAAVADLAVALHDLGDRVALYAYYSQGRRAVSMVPVKRF
HDQLNAQVIRRLNSLEPGAYSRLGAAIRHGSAILEARGGTSRRLLVVLSD
GLAYDHGYERAYGAADARRALTEARRRGTGCVCLTVGAGTDVQSLRRVFG
TTAHATMARPDQLAGVIGPLFRSALRAAEVRRRTAANARAGTATSRRPHT
SAHVGAG
>MAP1484c hypothetical protein
MAVETSKPHTGVNATPVPVPWAVQTPDRIPKQRYYDPEFYALEKEMFWPR
VWQMACRLEEIPKPGDFVEYEIHDESVIVVRLDSQTVRAYHNACRHRGVK
LVEGNGNRRTFVCPFHGWCWSLDGRNTFVLRPETFAQENLAAADLRLVSV
RCELWGGCAWINLDDDAPALRDWMEPFASTYDAWRVESLRVEWWQSCRLP
VNWKLATAAFMEGYHVPQTHPQLLPGAQTGEPSAAVHPVVASSLYFMRTL
GEGMAGMTHQNDIRIAEGLQTMRLPSDPAAAMAAWRSALNDAVVDWHRAR
GSGMPDLNELDRRGITDAIGFAFPHHFILPTYSSASSYRIRPLGPEETLF
EIWSLTRLPGDASAGKPTPPEPMAPDDPRWPPIPAQDFSNLPRQQKGLHS
KSFEFMRLSDRIEGLISNFERVIDGFLAGLRYDALLPAIHKTNTTIDVPI
VDLGFGAVEAR
>MAP3731c hypothetical protein
MIRIDGVRWQYAGTDAAVLDGVDLHIRRGETVLLCGASGSGKSSVLRLMN
GLIPHFHQGSLDGSVHIDGTSVAELSLERVGRLTGTVLQHPRRQFFTAAV
DTELAFTLENFGTPPEQIRNRVGSVITEYGLAELTGHRLAELSGGQQQQI
ACAAAATHGPPLLLFDEPTANLAADAIERFTATLARLRSLGTTIVIAEHR
LHYLREIADRIVLLRNGRIAAEWSRKQFARLDDAALNAEGLRSNNSPVRN
HIPPACAYGASVAGTPSGTAAPASSPSEVVLRGIRCCFRGHRVLDIEEAR
FPAATVTAITGPNGAGKSTLARVLVGLQRHDGEVSFGGSRISRSRRQRMS
AIVMQDVQRQLFTESVRAELRLGAPPAAAGVASTLLRDLGLEEFADRHPL
SLSGGQQQRLVVAAARLSNRKIMVFDEPSSGVDRRHLRSITNVMRDVAAQ
GVVVILISHDQELLTLAADQELRMRVADTLNARSRRKAAGENACLETLSD
>MAP3594 hypothetical protein
MSELSIGIIGAGPGGLALGILLSQAGFGDFTIFDREDGVGGTWRINTYPG
LACDVKSHLYSYSFDLNAHWSRLWSGQPEILDYFQRCADKYGLGPHLRLG
TEIRSAHWDADTQRWRLTTASGHRHHFDVVVSAVGLFTRPLLPELVEEEP
FTGTVMHSARWDHSIPLHGKRIAVLGTGSTASQLIPELAKVAERVYSVQR
SPTWILPKPDRPYTQWERWAFAHLPLAKKLYRTRLWLRSESNISVIEHGS
EKTGQFTNIALGLLEASVPDEELRRKLTPDHPMGCKRLVFSSDYLPALTR
PNVEVLTSPARSLRRRSLVTEDGTEREVDLVVCATGYAAADYLGELDVTG
ERGVTLREVWRDGAYAYLGMAVPGFPNFFMLYGPNTNVGSNSVIFVLEAQ
ARYIVRALKYLRRHRRRYIAVRPAALADFVAKIDRWMVGTVWTTQCSNYF
RAPNGRVVTQWPRSARAFWSMTRRFRPADYRFQAPAMRVPAPASEADAR
>MAP3630 hypothetical protein
MSTVHSSIDHHPDLLALRARYERVAESMSAHFTFGLALLAGLYVAASPWI
VGFSATASLATSDLIAGIAAAFLAYGFATTLDRAHGMTWTLPVLGAWVIV
STWILPGVVLTAGMTWSNVVAGALLTFLGLNATYFGMRTRASAG
>MAP3773c hypothetical protein
MSSPAAPRRRRATVKQRTVLEVLRAQENFRSAQQLYQDIRQNQQLRIGLT
SVYRILRALAADRIAETQRAEDGEILYRLRTEAGHRHYLLCRQCGRAVAF
TPVDIEEHTRRLSRQHHYADVTHYVDLYGTCPLCQNTQP
>MAP1463 hypothetical protein
MPARRPGLRRVVASGAPRGPSMTDVSDVLVIGAGFGGLYAVHRAASSGLS
VTALEAAPDVGGTWYWNRYPGARCDVESVDYSYSFDEELQRSWQWTERFA
AQPEILAYLRHVADRFDLRRHYRFGADVVDAAFEHGRWRVGTSNGQTFAA
RFLICATGCLSAVNRPDIPGAQDFSGEVYFTAAWPREDPDLRGKRVGLIG
TGSSGIQATPIIAAQAESLVVFQRSANYTIPMPNRPFSAEEQQRIQEQYP
ERRRRSAYATSGTPHGMYHKNAVDTDPAERAEALWKRWREGGVLFAKTFP
DQTSDPAANDIARTFAEERIREIVTDPDVAADLIPVDHPIGTKRICTDDG
YYATFNRDNVRLVNLRREPIEAITADGMRTSTTTYPCDVLIFATGFDALT
GALTRINPTGPRGDRLRDIWADGPLTFLGMMVPGLPNLFSISGPGSPSVL
ANMVLHAEVQVDWVVDLLCATRRLGVTEVEPRRDAAVAWTRYVAEVAERT
LFPKAASSWYLGANIEGKKRVFMPYIGGFGTYRRHCEQVADRGYAGLVLT
TR
>MAP0488c hypothetical protein
MSLQAAAAVDHDADTVSLRGARLAFGDRVLWEDLDLSVSRGEFVAVLGPN
GSGKTSLLKVLLGQLPLSAGVGLVDGKPITSGSGRIGYVPQHRPMERDVM
LRGRDLVRLGLDGGRWGAAPLRPRERARRRATVDQALRQVNGELLADVRV
GVMSGGELQRMRVAQALVSDPLLLLCDEPLLTLDPANAKLVSALLERRRR
DAATTVIVVTHEINPILPYVDRVLYLVDGRFRIGTVEQVMNSETLSALYR
ADIQVVKVKGGYVVAGEHTDGHG
>MAP4098 hypothetical protein
MARPWNGLWLPPDAACMLGRMTRNQLTEQIVVARLAKGLTWQELADAIGR
PLLWTTSALLGQHPIPAELGRILVDKLGLDESAVPVLAAPPMRGGLPTAV
PTDPTIYRFYEALQVYGGALKEVIAEQFGDGIMSAINFSVDLQKKPHPSG
DRVVVTFDGKFLPYQWVSSEQ
>MAP1450c hypothetical protein
MTSGSDRRATPDVDVVVVGAGFAGLYALHKLRSNGLRVRVFEAGPDVGGT
WYFNRYPGARCDVESVDYCYSFSDALQREWDWSEKYATQPEILAYINWVA
DRLDLRRDITLNARVNSAVLDEAQLRWTVTTEAGERVTARFCVMATGPLS
AAMTPPFPGLDTFAGQVYHTAAWPHEPVDFTGKRVAVIGTGSSGIQSIPI
IAEAASQLYVFQRTPNYSVPAGNRPLSDSDRAEVKAHYAERRRMSWRSGG
GSPHVAHPKLTMEATPEERREAFEKRWELGGVLFSKTFADQMIDPVANEE
ARKFYEEKVRAVIDDPALADLLIPNDHPIGTKRICTDSNYFQTFNRPNVK
LISVRKTPIMSIDATGINTTDAHYDLDAIVLATGFDAMTGALAKIDIVGR
DGRRLSDDWSGGPRTYLGLGVDGFPNLFLVSGPGAPAVLANMVLHAEANV
NWIADCIAYLDAHDYTAVEATTDAVDDWGAECARRADATLFTKADSWYLG
ANVPGKPRVFMLFVGGFGVYLDICAEVANAGYKGFSLVKAR
>MAP1864c hypothetical protein
MRRIRFSGPRLQGLGWTPGQHIRLQVESLRESMLRLHPYPVLRTYSIYAA
DPDRGALDIVMVDHDGDPKGATPARRWAMAATLGDHVRMTRPQGKFVIRD
DAPYHVFVGEETASVAFAAMLRSLPPTAEVYGVVEAATEADHLPRARPLE
RVERGGAPAAKSAVLADALRRLPLPDHPGVAYLAGEARTVQALRQILITE
RGWKRNQVRTKPFWSPGRTGME
>MAP1792 hypothetical protein
MRGVWLQRRALRAGLSVRPMIVTLIGYLVLIDGMLNSLGWALDLVANHTL
INRVLMVGWGNMFDAGYFWHYNELWIGGAAGPGEKAYVAGLILTVFSMRV
AAAIGFLQMKRWGHQWMVVTCWMGVVIWSAYVFNMTMFADVRYAGVVFPV
IGWWLYDIFYITPFLAIPYLHTVNRETFSD
>MAP1807c hypothetical protein
MIRGVVALTATVGLLLTGCVSRTTTSGPTPPPAAVPLSDLSGLTLQVGDQ
KGGTEALLRAAGQLDNLPYRVAFSTFTSGPPQVEAATAGKIDFAITGNTP
PIFGAASNARIKAVSAYGGGGAGNRILVHADSPITSVSDLRGKAIAVAKG
SSSHANLLAQLDRAAIKPADVKFVYLQPADALSAFSQHQADAWAIWDPYT
AQAEQQIPVRSIAEAQGVTNGDWIGVASDQALADPKRNTALGDLLVRFET
AVRWARAHPQQWAQSYAATVGLDPQVAAVSQARSLRLPTELGDDVVASEQ
KLADLFAAAGQLQSAPRFANWVDRRFNAALRPGLVS
>MAP1336 hypothetical protein
MQGVAGGLLAGLGYAVINSALPRWLWTRGSALVSAMWGVATVVGPATGGL
FAQLGIWRWAFVVMAVLTALMALLVPVALARVDPAPAIPRMKVPVWSLLI
IGVAALAVSVAQIPHNTAATFGLLAAGIMLVGLFVIVDWRMHAAILPPSV
FSPGPLKWIYLTMGVLMAAAMVNTYVPLFGQRLAHLTPIAAGFLGAALAL
GWTVSEIVSASLENPRTVGRVVMVAPLVAASGLALGAVARHGDGSAWTAA
LWAVALLVAGTGIGMAWPHLSARAMASVNDPAEGGAASAAINTVQLTSAA
IGAGLAGVVVNTATGGDEMAAHLLFTVFTALSAAGVAVSYAATRATRQAQ
PVGNVG
>MAP3350c hypothetical protein
MDVKEVLLPGVGLRYEFTDHKGDRVGIIARRSGDFDVVVYAREDPDEARP
VLHLSNEEAEAVAQILGAPRIAERFTELAKEVPGLETGQVHILAGSPFVD
HPLGDTRARTRTGASIVAIVRDDEVLASPGPSEMLHARDVLIVIGTEDGI
AGVEKIIDKG
>MAP3180 hypothetical protein
MTAPETSAQQTSSQPLVSRGWIQGVALVMIFGFLVMGILAYRTYSASMPM
PDKVVSESGRLLFTGADITRGQELYQARGLMEYGSVLGHGAYLGPDYTAE
YLRTATQDVADQLRAQGVADPRERVVTEFRTNRYHPDTKTLVFTDRQAAA
FDHIQDRYGAYFGENSTKYGCCRT
>MAP3076 hypothetical protein
MNIRALLRQLRPSVRAKDWPLQVIPRTPWADQRPTFREAQPAVIDAALQR
CRRQPTGNWYAFAASTHVERGRPLGARVAGVDLVAWRDARGALCVGPRSC
PHLGADLATGTVCGGTLICRWHGLPLDGRAREFGWAPLPSHDDGTLAWVR
LDAVGGETPSPRPVIPPRPAGHTLASVAHLVGVCEPADIIANRLDPWHGA
WFHPYSFTRLEVLATPTEDANRFLVAVTFRMGRLGVPVVAEFDCPEARTI
VMRIVDGEGTGSVVETHATPIGSGPDGRPRTAVLEAVIAHSDRPGFARAL
WAAPLLTPVMRYAAGRLWRDDLAYAERRYEVRSQNR
>MAP1423 hypothetical protein
MIGLGGVWVLDGLEVTMVGNVSARLMEPGSGIALNPAQIGMAAAIYIAGA
CSGALFFGHLTDRFGRRNLFILTLALYLIATVATAFAFAPWYFFLTRFFT
GAGIGGEYAAINSAIDELIPARVRGRVDLVINGTYWLGSAAGAGGALILL
DTSNFAADLGWRLAFGIGAILGIFVLLVRRNVPESPRWLFIHGREEEAEH
IVGEIEEAVQQQTGRPLPEPQGKALRIRQRTAISFREIAAVAFKLYPRRA
VLGLALFIGQAFLYNGVTFNLGTLLSQFYAVPSGMVPVFFVLWALSNFAG
PLLLGHLFDTVGRKQMITLTYIGSAVVVVALALVFLTQAGGVWAFIGVLI
VAFFLASAGASAAYLTVGEIFPMETRALAIAFFYAMGTAIGGITGPLLFG
QLIDSGQRDHVVWSFLIGAVVMAAAGLVELWLGIAAEQRPLEDLALPLTV
DDAEDTEPQGDSAPVD
>MAP3726 hypothetical protein
MTLIKRCVRGNGWRTTLLMLLLLTAVAASLMVGRYPVGVGAMAGMLFGRL
PLLDTSFTPVDQTVLTQIRLPRIGCGVLVGAGLAASGAGYQTMFRNPLVS
PDILGVSAGAGFGGALALLLHAPYWQLEAMAFASGLLAAALALIIGRGIG
RDSAILLVLAGMVIASVFGALISVTEYLANPDDTLPAIVFWLMGGLGRQH
LDGLLAPALIIAAAVLVLYALRWPVTVVVSGDEDAHTLGVDTRRTWAAVV
GVYTLITATTVSLAGIVGWAGLLIPHIARALVGPGFGRLLLVSAALGGVF
VVGVDDVARAAASAEIPLGILSALIGAPFFLVVLAKMRRQWT
>MAP0619c hypothetical protein
MLSNAMDEAPYAAAKTPPHAPTGQPGATEREYPDKLDAALLRISGVCILA
TVMAILDVTVVSVAQRTFIDQFSSSQAVVAWTMTGYTLALATVIPITGWA
ADRFGTKRLFIGSVLAFMLGSLLCALAANVLQLIVFRVVQGIGGGMLLPL
GFMILTREAGPRRLGRLMSILSIPMLLAPIGGPILGGWLIDTSSWRWIFL
INVPIGLLTVALAAVVFPRDHPARSETFDAVGVLLLSPGLATFLFAVSSI
PGRGTVADRHVLIPAAMGLTLIAGFVGHAWHRADHPLIDLRLFRNPVLTH
ANVTMLVFATAFFGAGLLLPSYFQQVLHQTPMQAGVHMIPQGLGAMLTVR
LTGPLVDRQGPGKVVLVGIALITAGLGAFAFGVARQAPYLPTLLAGLAIT
GLGMGCTMMPLSVASVQALAPHQIARGTTLMSVSHQVGGSMGTALMSMIL
TNQFNRSPNIVAANKLAALHQKAAAGGTPIDQSAIPRQSLAPGFWGNVLH
DLSHAYTAVFVIAVALVVCTIIPASFLPKKPATETAGK
>MAP3508 hypothetical protein
MMTDVAKDANSAVAEELSTPMTIGVEAYISEDYARAERDKLWRKVWQQVG
RVEELPEVGSYLTYDILDDSIIVVRTGANEFRAHHNVCMHRGRRLIDTPE
GAKNALGRTRKSFVCGFHGWTYGLDGACTHIREQQDWRQALTPDNTHLRP
VRVDTWGGWLWINMDPDCEPLADYLFPAAKILEPFGLENMRYKWRKWLYF
DCNWKVALEAFNETYHVYTTHPEFNKFGEFKGWAKAQGRHSNIGYDAPED
MEATKSKIRLGIGADPRVSTAEMQVYTMEETNATTTQTLVNAAKRLVDEL
PEGTPADKVLEHWLASARRDDEARGVIWPTIPPDILGQAGTAWQIFPNFQ
IGQGLTSALCYGARPHPSYNPDKCIFEVSVFELYPKGEEPQTEWEYTPVG
DPRWRSVLPQDFSNMAAVQQGMKSLGFPGTKPNPYRERSTVNLHYQLSRY
MGTGAPRELSDKEHPLA
>MAP1137c hypothetical protein
MGRAGPGHQAPGELVTAQTGRRVAISAGSLAVLLGALDAYVVVTIMRDIM
TDVHIPINQLQRITWIITMYLLGYIAAMPLLGRASDRFGRKLVLQVSLAL
FMVGSVVTALAGHWGDFHLLIGGRTIQGVASGALLPVTLALGADLWAQRN
RAGVLGGIGAAQELGSVLGPLYGIFIVFLFHDWRYVFWINVPLTLIAMVM
IQFSLPSHEKVEQPEKVDLVGGVLLAVALGLAVIGLYNPEPDGKQILPSY
GLPLVLGAVVVGILFLLWERFARTRLIEPAGVHFRPFLAALGASLFAGAA
LMVTLVDVELFGQGVLGQDQTQAAGLLLWFLIALPIGAVLGGWIATRVGD
RAMTFVGLLIAAYGYWLIHYWRQDVLSQKHNVLGLFSVPVLHADLLVAGV
GLGLVIGPLTSAALRVVPSAQHGIASAAVVVARMTGMLIGVAALSAWGLY
RFNQIVANLTAAIPPNASLLERIAAQGTMYLKAFAMMYGDIFAATVVICI
AGALLGLLIGGRKEHAEEPEIVEPQAVSLGER
>MAP3690 hypothetical protein
MARMPAPRSDRNALDFFCAVIIVGVLAGIAGVATTLVLRVVQHATYHYSF
GALLAGVAASSPIRRVLGPMVGAALAGLGWWLLRRRTDVPPLAETIARSE
RVPRLAWSIDAVLQVLLVGSGASLGREGAPRQFASALGDFGIGWLRRLSS
RDREILLACAAGAGLGAVYAVPLAGALFAVRILLRTWRLRAVGAAFLSSG
VAVAVGSAVTGDQPNLRWPVEESTYLLTAHGLLLAPVALAVGLVFNRLMA
VARPARPMRTWTLIPALAGAGLVTGVCSHWWPELPGNGRSILTVSLASGM
TLASALAILLLKPVLTALFLRAGGAGGLLTPSLATGAAAGAALVLTINWA
TASQLHVPAVSLAGAAGVLAVTQGSPIWAAIFVWELARPPLWLFLLFLLT
ATAAHGLKVLAQGRTTTHAG
>MAP2441c hypothetical protein
MPARSMVVMPHGPVAHQAGTRGYRRMTAALCGAGLASFAAMYCSQALLPA
LSAHYRIGPATAALTVSLTTGALALSIIPASVLSERYGRIRVMLISGVAS
SVIGLLLPFSPSLGVLLFGRAAQGVALAGIPAVAMALLAEEVDASSLGSA
MGRYIAGTTIGGLAGRIVPSVVVQVGTWRVALLACSLITLAGTAVFAVLV
PRSRFFTPKPASVRAALRNLAGHLRNPVLAKLFAVGFVLMGGFVTVYNYL
GYRLAARPFGLAPSVVGLLFLLYLVGTGTSVVAGRLADRRGRPLVLGAAL
PIAVAGLLLTVPATLAAIVAGVGVFTGGFFAAHTVASGWVGAVAQRDRAE
ASALYLFSYYLGGSVAGAFGGVLYGVGGWSATVCFVVVLLMAGAALVALL
VRDNGFRIGRRVVTSVASVK
>MAP2534c hypothetical protein
MTTATPRTPGGGRRAPSGPAPGAHRWDLITRSSAHSQNPWNPLWAMMIGF
FMIMVDSTIVAIANPTIMADLHIGYDTVVWVTSAYLLGYAVVLLVAGRLG
DRFGTKNLYLIGLAVFTVASVWCGLAGSAAMLIAARVVQGVGAGVLTPQT
LSTITRIFPPERRGVAVSVWSATAGAASLVGPLAGGVLVDGLGWQWIFFV
NVPIGVLGLALAYWLVPVLPTQSHRFDLVGVGLSGVGMFLIVFGLQQGQA
AHWQPWIWALIVAGVGFVTVFVFWQSVNVREPLIPLVIFADRDFSLCNIG
VAIISFAATAMMLPLTFYAQAVCGLSPTRSALLIAPMAIANGVFAPFVGK
IVDRYHPRPVLGFGFSLLAIALTWLTFEMSPATPIWRLVLPFFAMGVGMA
FVWSPLTATATRNLSAQLAGAGSAVYNSVRQLGAVLGSAGMAAFMTWRIG
AEMPGQPAGGGEDSAGPVLPEFLRGPFAAAMSQSVLLPAFIALFGIVAAL
FLVGFRPWAHRDGGTDAFGPDDYGGDYDDDYDDDDAYVELILVREPEPEA
QQRAGQPRRPQPAPAPPADVRRRDPVESRRSVLDERPAQVQPIGFAHNGS
HVDGGKRLRQVAVRRAPKPGPPADRFTRPPRRHHPGPAGHHLGEGESLRG
QHHRPDPDDDPTGYGRHSSGN
>MAP1725c hypothetical protein
MSAVAGFAAVDVGGFAYAGGWLRPDSPLTPPRFADRFEHVYGRHDGFRRN
HAKGLSATGSFTSTGAGAAICRAAVFREGTVPVVGRFSLGGGLPDQADKP
ETVRGLGLLFDAGGQQWRTAMVNVPVFTDSTPEGFYERLLATKPVPSTGK
PDPQKMAAFLDRHPETAAAMKIIKQSPPSAGFADTTFYGLNAFLFTNSAG
ATVPVRWSVVPHDGGVAGAPGPTRGKDFLFDDLIRTLAQRPLKWRLILTL
GEPGDPTHDATKPWPQSRRTVDAGTVTITAVHTEEEGNARDINFDPLVLP
DGITPSDDPLPAARSAVYARSFTRRAEEPKSHSEVNVTRVLP
>MAP0835c hypothetical protein
MTATPRACNRDRVALQAVHFFMADMEAGMGPFLGVLLQSRGWTTGAIGAA
MTLGAIVGMVTVAPAGALVDATRHKRGCVIVVGLAAVAASAVILTSRQFW
TVATAQAVMCISGATIAPAMIGITLGVVGQAAFTRQNGRNQAYNHAGNMA
GAALGGVAGWVFGYAGIFWLAAGFAVATIAAVLAIPAGDIDHHVARGEAR
AAGEAPVKAMRVLARSRPLLVLAAAMVLFHLGNAAMLPLYGLAVVATHAN
PFTTVASTVVVAQAVMVPASLLAMRIAATRGYWPAILIALTALPVRGVLA
ASVITSWGVIPVQVLDGIGAGMLSVAVPGLVARILDGTGHINVGQGAVMA
AQGLGGALSPVLGGAVAQHLGFRAAFLLLAGLSLGALIIWVTFAPMLRRA
ARLPAAPSDRAGAPPTATADNQAK
>MAP2734 hypothetical protein
MPGVLENVRRGMIPAHIYNDPELFALEKRRLFARAWTFVGHESEIPHDGD
YMVRRVLDDSFIITRGSDGRVRAVFNMCLHRGMQVCRAELGNASNFRCPY
HGWTYRNDGRLTGLPFHREAYGGDDGFVKDQTLLPAPNFAGYDGLLFISL
DPAAPPLQDFLGDFRFYLDYYTRQSAGGVELRGPQRWRIRANWKIGAENF
AGDMYHTPHTHASIVDIGLFREPKAQKRKDGATYWAHRGGGTTYKLPPGG
FEERMRYVGYPDEMIGRIKQVWTPRQQRVVGEDGFMVSAATCFPNLSFVH
NWPKVRDGRDDETLPFISIRLWQPISENETEVCSWFAVDSAAPAQYKQDS
YKAYLMCFGSTGMFEQDDVENWVSLTTTAGGSMARRLLLNSRMGLYDDGR
PVVEALAPSAFHGPGRAQVGYNEHNQRALLAMWADYLQEGDACENR
>MAP2053 hypothetical protein
MSAQMTPVMQAASEFALIGGIGFTAFGIYLSVRRRRLHPLLLLCISAMSF
SWIEAPYDWAMYAQFPPAIPRMPSWWPLNMTWGGLPLFVPIGYISYFVLP
AVTGTALGRWLSARFGWRRPQTLLVVGLVVGFCWALFFNGFLGAKLGVFY
YGRVIPGLAIREGTVHQYPLYDSVAMAIQMMLFTYLLGRTDAQGPNVIEM
WAEHRSKSRVGASVLSVVAVVVVGNALYGAVFAPHLVTKLGGWVTAGPTG
ELFPGVPNQPR
>MAP4060c hypothetical protein
MQDIQRADDALPTASRAERLRGVIRHDLPSSLVVFLVALPLSLGIAIASN
APVLAGLIAAIVGGIVVGALGGSPLQVSGPAAGLTVVVAGLVSDFGWGVT
CFITVAAGAVQVLLGLSRVARAALAISPVVVHAMLAGIGITIALQQTHVL
LGGKSKSTAWHNLIGLPGQIIGAHRPGVLLGVLVIVILVAWRWVPAKVRR
VPGPLVAIVAVTVISVVFPFHVRRIDLDGSPLDALQLPDLPHGNWSGVAV
GVITVALIASVESLLSAVSVDRMHNGPRTDFNRELVGQGAANMISGAVGG
LPVTGVIVRSSTNVNAGARSRASAIMHGVWILLFTIPFAGLVDEIPTAAL
AGLLIVIGIQLLKPAHIETAMKHGDLAVYVVTAVSVIFLNLLHGVMIGLA
LAIALTGWRVIRAKIEAAQLDGEWRVTIEGAACTFLALPRLTRVLASVPR
GATVTVAIAVHYLDHAAHQAITDWQRQQEATGGTVRIEGAVVATGRRDEP
QVEAEMPGAAA
>MAP3954 hypothetical protein
MIPLPRPWVLAGAMLIGSAVGLLAGVAFTVAVQAHVRPDLAIAAVVGIPS
VIGLALILFSGRRWVTTLGAFVLAVAPGWFGVLVALRVTAGG
>MAP3560 hypothetical protein
MSEFTIPGLTDKQAARLTELLQKQLSTYNDLHLTLKHIHWNVVGPNFIGV
HEMIDPQVEAVRGFADDVAERIAALGASPQGTPGAIIKDRSWDDYSVGRD
TVQAHLAALDLVYNGVIEDIRQYIDETDELDQVTQDLLIGQAAQLEKFQW
FVRAHLESAGGQLAHKGKSTERAAAQSARGKS
>MAP0999c hypothetical protein
MTVTAIDPAEQAHPAPSAGTKRVQGGLLDPKMLWRSTPDALRKLDPRTLW
RNPVMFIVEIGAAWSTVLAIVGPTWFAWLTVIWLWLTVLFANLAEAVAEG
RGKAQAETLRRAKTQTMARRLRDWAPGSTGIEEAVSATALQQGDIVVVEA
GQVIPGDGDVVEGIASVDESAITGESAPVIRESGGDRSAVTGGTTVLSDR
IVVQITQKPGESFIDRMIALVEGANRQKTPNEIALNILLAALTIIFVFAV
ATLQPLAIYSKVNNPGVPDTQALNTSGVTGIVMVSLLVCLIPTTIGALLS
AIGIAGMDRLVQRNVLAMSGRAVEAAGDVNTLLLDKTGTITLGNRQAAAF
IPLAGVPPEELADAAQLSSLADETPEGRSVVVFAKQHFGLRARTPGELSQ
AQWVAFSATTRMSGVDLDGHSLRKGAASSVAEWVRSQRGSVPHQLGEIVE
RHLRRRRHTTGRRRERRRPGTGARCHPPQGRGEAGHAGTVRRDAADGHPD
GDDHRR
>MAP0146 hypothetical protein
MAWQLATAVTGFFAPSQLPPPGDVVAALSDLARHGELWTHLRASGWRVLA
GYASGAAAGLALGSLVGLSATARRLLAPTVAAFRTVPSLAWVPLLLLWFG
IDETPKILMVAIGAFFPVYTTTASALSHIDAHLVEVGRAYGRHGVSLLTA
VLLPAAAPELVNGLRLGLANAWLFVVAAELIASSKGLGFLLIDSQNSGRT
DVMLLAIVLLAGLGKLSDAALGAVETRIARRRG
>MAP0720c hypothetical protein
MPHFPKPAAGSWTENYPELGTGPVDYTDSIDPAFFEAEREAIFKKTWLNV
GRVNRLPRTGSYFTRELPSAGKGTSVIIVKTKDGSVKAYHNVCRHRGNKL
VWNDFPNEETSGACRQFTCKYHAWRYSLDGDLTLVQQEEEFFDLDKSNYG
LAPVRCEVWEGFIFINFDDNAAPLTDYLGPLAKSIEGYPFGEMTETYSYR
AEVGSNWKLFIDAFVEFYHAPILHQGQYTKEEAAKIQKFGYEALHYELAG
PHSLQSTWGGQAPPPDMSMVKPMDQVLRSGLFGPWDKPEIIEKLDLPPGV
NVKRVPQWGIDSWLFYPNFMLLIWEPGWFLTYHYWPTAVDRHIFESTLYF
VPPRNARERLAQELAAVTFKEYALQDANTLEATQTMIGTRAVKEFLLCDQ
EVLIRHLHKTTGDYVKEYQNNGHVAAR
>MAP3136c hypothetical protein
MTRSGRPPEGSWTEHYPELGTGPVSFKDSTSPEFYELEREAIFKRAWLNV
ARVEELPRVGSYLTKEIEAARTSVIVVKGRDERIRAFYNVCRHRGNKLVW
NDFPGEETRGTCRQFTCKYHGWRYDLTGALKFVQQESEFFDLDPAEYGLR
PVHCDVWNGFVFINFDPQPRQSLREFLGPMITGLDGYPFDKLTERYDWVA
HNNSNWKIFADAFQEYYHVPALHSQQVPPEVRDSNAVFTCGHFQLDGPHR
LVSTAGRRRWLMPPEYMYPIERATRSGLVGPWRTPDIGELPAGLNPGGIE
QWGISNFQIFPNLEILIYGGWYLLYRYWPTSHNTHRFEAYTYFHPARSVR
ERIEHEVAAVVLKEFALQDAGMLGGTQAALEYGVVDDFPLNDQEILVRHL
HKVVVDWVDAYRRERETVGV
>MAP0618c hypothetical protein
MLNGAMEKTRSAAFSVPLATASGPSLRDDEYPDKLDAALFRIAGVCGLAC
IMAVLDSTVVAVAQRTFIAQFGVNQAIVSWTIAGYMLAFATVIPITGWAA
DRFGTKRLFMGSVLIFTLGSLLCAVAPNILLLILFRVVQGVGGGMLLPLS
FVILTREAGPKRVGRLMAVGGIPILLGPIGGPILGGWLIGAYGWKWIFLI
NLPIGLTAFALAALLFPKDRSAPSEALDITGALLLSPGVAIFLCGVCSIP
GRHTVADRYVLVPALVGLVLIAAFILHAWYRTEHPLIDLRLFRNPVVTQV
NVTLLVFAAASVGVGLLVPSYFQIVGHETPMQSGLHMLPIGVGAVLTMPL
GGAVMDKHGPGKIVLTGLPLMAVGLAVFTYGVARQAAYSPVLVCGLAIMG
LGIGLTTTPLSAALMQALAPHQVARGTTLISVNQQVGGSIGAALMAVILT
NQFNRNPALMAANEAAGMHPVTGKRGLPVDPSTVPRPAMTPELAGHVSHH
LSHAYTAVFVLAVVLVACTIIPASFLPRKPPSPPAGD
>MAP1937c hypothetical protein
MVSRYSAYRRGLGDDTVSPEVIDRILIGACAAIYLALLGVSVAACVALAD
LGRGFHKAASSPHTTWVLYAVIIVSALIIAGAIPILLRARRISQAEPTAR
AMTAPARPPVRLGAGVARPATERAPHAPATTPDVGWSGEAVDRIWLRGTV
ILTGTMGAALIAVATATYLMAVGHDGSSWVGYGFAGVITAAMPVVEWLHI
RQLRGAVAEQ
>MAP2744c hypothetical protein
MSGGLTPDQAIDAIRGTGGAQPGCRALHAKGTLYRGTFTATRDAVMLSAA
PHLDGSTVPALIRFSNGSGNPKQRDGAPGVRGMAVKFTLPDGSTTDVSAQ
TARLLVSSTPEGFIDLLKAMRPGLTTPLRLATHLLTHPRLLGALPLLREA
NRIPASYATTEYHGLHAFRWIAADGSARFVRYHLVPTAAEEYLSASDARG
KDPDFLTDELAARLQDGPVRFDFRVQIAGPTDSTVDPSSAWQSTQIVTVG
TVTITGPDTEREHGGDIVVFDPMRVTDGIEPSDDPVLRFRTLVYSASVKL
RTGVDRGAQAPPV
>MAP3845 hypothetical protein
MIWATGRRAAPMYLNAERLALANQTVKETFEQCSVAWQAIPHWDTGDPSQ
TTVPNDNVNPPNNFLPLTSLPKPFEVTLAAAIAPTPDELLATVVYYTAKL
AADFDAAVIPGLLTATTPSQLVPGISPAQLLTALIEARAKVEKGGYRAPS
CLITDTIGVETLAASTIANGYAGTDVLLPPANINSLQRVDTLATDPQVRG
WLLGRRQRIAPGAAAEASPGEEAVDLAVSVPPSLEVVGDTSNNAIKLDVR
LSYALRIKDEAGLVVFRA
>MAP1922c hypothetical protein
MLVVSTDQAHSLGDVLGVPVPPSQAELVRVLADLETGRAEAGGGFLDALA
LDTLALLEARWRDVVATLDRRFPDSELSTIAPEELSALPGVQEVLGLHAV
GELARSGRWDRVVVDCASTADALRMLTLPATFGLYVERAWPRHRRLSLTA
EDARSAAVVELLERVSASVEALSALLTDGDLVGAHLVLTPERVVAAEAAR
TLGSLALMGVRVEELIVNQVLLQDDSYEYRNLPEHPAFYWYTERIAEQQS
VLEELDAAIGEVALVLTPHLSGEPIGPKALGALLDAARRRGGAAPPGPLR
PTVDLESGTGLGSIYRMRLALPQLDPSALTLGRVDDDLIISAGGLRRRVR
LASVLRRCTVLDAHLRGSELTVRFRPDPEVWPK
>MAP1620 hypothetical protein
MGLRDHHLSDHYLSDHREDIGLMRSRYAGEPFTTSTAEIAAALEDVSIPT
LLLSLVHITGDPRFIRDFKQMGIFLNEVQGFMSEEDKARARAEALSVITD
YRDRGCPEPEPLSPELIREMMDWAACEHVPDDYLPLICEELDLDGVDPRR
PAALPAERAAGLPVLVVGCGESGILAGVRLKQANIPFTIVEKNAGPGGTW
WENSYPGARVDVANHFYCYSFEPNNDWTHFFAEQYELQDYFTKVIDQHDL
AGQVRWQSEVLAAEWDDGDGTWTVSLRSADGHTETMRARALITAVGQLNR
PNIPAFDGAQTFRGPSFHSAAWDHSVELKGKRVALVGAGASGFQIAPAIA
ADVKRLTVFQRTAQWMFPNPMYHDEVGDGVRWAMRHLPFYGRWYRFLVLW
PGSDKGLDAAEADPNYADQEHAVSDVNAAAHLMFSQWITSQVGEDSELLA
KVMPDYPACGKRTLQDNGSWLRTLQRDNVELVRTPIDKITPHGIVTVDGA
AYDADVIVYATGFRHTDVLWPLKVTGRNGVDLHQMWGSRPYAYLGITVPE
FPNFFIIYGPGTHLAHGGSLIFQSELQMRYIDQCLARLCEPGVHSIEPKP
DAAIDWHRRTPGPDQEDGVVAPGGQALLFQERRRRDSHGEPMASQRVLVR
GARARLVAVHGADERCCATRK
>MAP3280c hypothetical protein
MQNERVQGFGFHTLALLTAVGFAGPLLASAKRFRIPVVIGELIAGLAIGR
TGFGVVDVADPTFQLLANIGFALVMFVVGTHVPLRARLMRSALPAALARA
TLAGGIAAVLGVALAVQFDTGHAALYAVLMASSSAALALPVIDSLRLRGP
RVLSVTTQIAIADAACIVLLPLVIDIRRAPTATLGSLAVAGCAAALFVLL
RAVDRKGWRRRLHAYSEQHRFALELRTSLLVLFALAALAVATQVSIMLAG
FAVGLVVGAVGEPHRLARQLFGITEGFFSPLFFVWLGASLQVRELGAHPQ
LILLGAGLGCGAVLAHCAGRLLGQPLTLAVLSAAQLGVPVAAATIGTQQH
LLAPGEASALMFGALLTIAAASIATGLAARRQGAPEPGEPAK
>MAP0933 hypothetical protein
MTIQTVMIVWSGRLTAKEKRVAPINSPVSGTDEALTRRGLRHALDKTTDL
AERELRVPLHYYRDPKITEIEEAQILRRVPLAIVPSAQLPNTNDYVVRSV
LGDSLLVTRDRSGASHVLLNYCRHRGAMPACGSGNTARFVCPYHAWTYKN
TGELFSVPGKAGFDSMNTKDYGLVELPSEERHGFIWAVLTADATIDLDAH
LGDFGAELALWNYSSYGYHTQREFTSEVSWKGALEAFAEGYHFPYVHGQS
LIGQNTLANTMVYDEFGKHHRIGFPFTWITNAATDPAASLEPLANMGVIY
WVYPNLILANSPVGLEIIDMLPAGAPTRCTVRHSWMARVPAADDEMRAAY
DAVFEGVHAAVRDEDFAMLPQCGEGVRHGQHDHMIIGRNEIAVQHMIRVF
AHELGVALA
>MAP1472c hypothetical protein
MNVINGMPAHALLVHFVLVLVPLTALLDIVCGLWPAARRGQLMLLTVILA
VVTMALTPITIDAGGWLYDQRADPSPILQEHATLGSAMTYFSAALLAVAI
VLALLGLIERRSDKRRLLTRGVVAVLALGIGIASMVQIYRVGDAGAQSVW
GGEIAHLKKAHPG
>MAP2784 hypothetical protein
MHVINGVRDPAASFPLDTATDDAGERRQANRAVAVSAAGLALTGLVELVI
AVVSGSVALLGDALHNLSDVSTSALVFVGFRASRKLPTERYPYGYERAED
LAGIGVALVIWGSAVVAGFESVTKLLRHGGTGHVGWGIAAAVVGVVGNQL
VARYKLVVGRRIRSATMVADAKHSWLDALSSAGAVLGLIGVALGWGWADA
VAGIVVTGFICHVGWEVTADIAHRLLDGVDPDIVTTAEAVAVSVPGVTHA
HARARWTGRTLRVEVEGFLDAATSLSDSDRIGRSVAAPWPRGCRRCRASP
GRHARPENPSRPAGFEVSPGRSARAAAGRSPTTGPTRRRRRHRVTRPRRV
RRRPRRRAPARRGTWPASAAARTATRPDPRGSTAHRRRWFGPAAGRGPRP
GAAQRNTTHRSAIPAR
>MAP3683 hypothetical protein
MAENHTTTNAGAPAPSDELSLTVGPDGPLLLQDSYLVEQMAAFNRERVPE
RQPHAKGAGAYGRFEVTADVSEYTKAAFLQPGAVTEVFARFSSGNSGERG
SADTARDNRGFSVKFYTTEGNFDLVGSDVPVFAIRDPMKFPNLIRAGGRR
ADNDLHDHNMVWDFWTSCPETAHLVTLVMGDRGIPRTFRHMNGFGLHAFS
WVNSAGEIHWVKYHFKTDQGIQWLPQEEGRRLAGTHPDCCVRDLYEAIAR
GKYPSWSLQVQLMPFADAKTYRFNPFDVTKVWPHADYPPIEIGTMTLDRN
VTDHHAEVEQAAFAPSNLVPGTGLSPDRLLLGRSFAYPDAHRARIGVNHD
QLPVNAARCAVRSYAKDGRMRFVNTADPVYAPNSAGGPQADPARAGEVHW
AADGQMLRAAYTLRRDDDDWGQAGTLVRRVMDDSQRERLVHNIVGHVSAG
VNEPVLSRVFAYWRHVDADIGRKVEEGVRANLNS
>MAP2066 hypothetical protein
MSSAGENTRRAEPVEPRGAAVLDSAHLGDIEGAFGRIRVGETEHARTWKT
RLLTLLAIVGPGIIVMVGDNDAGGVATYAQAGQNYGYSLLWVLLLLVPVL
IVNQEMVVRLGAVTGVGHARLINERFGRGWGWFSVGDLFLLNFLTIVTEF
IGISLAAEYIGVSKYVVVPVSAAALVAIMASGSFRRWERAMFIFIAITLL
QIPMLLMSHPQWGRAAKSFVVPSISGGVSSDAVLLIIAIVGTTVAPWQLF
FQQSNVVDKRITPRFMGYERADTVLGAFVVVIGAAALVMTGDWAARSTDT
VGGFTDAGATAHLLGQHRQVLGSIFAIVLMDASIIGAAAVTLATSYAFGD
VFGLKHSLHRGFADAKQFYLSYTAMVVVAAAIVLIPGAPLGLITTAVQAL
AGLLLPSASVFLLLLCNDREVLGPWVNRAWLNWVAGLIVGTLLLLSGILM
ATTLFPDLNVVAVAGYLTLALIILAAGAAPVLRWLARRQPARPGPRLPAR
GVDRSTWRMPPLALLEPVTWSPGTRLAMIALRGYLVVGALLLVVKAIQLS
R
>MAP1015 hypothetical protein
MMSRIDVVLRSARRRFRRLPAVEVPDAVRRGAVLVDIRPQAQRVREGEVP
GALVIERNVLEWRCDPTSEARLPEAVGDDVEWVIICSEGYTSSLAAAALL
DIGLHRATDVIGGYHALAGAGVLSRLAGGPVGAPLANTGAGERRWI
>MAP1088 hypothetical protein
MVVLQVFSVQLGLISLFPDGSVTSLLVAALVLAVPVSAPIAQVLGKNIDA
TLALPHVNTARAKGGTPGWVIRKHVVKNAAGPALTVTATTVGALLGGSVV
TETVFSRSGVGAVLLQAVSSQDISLIQGLVLLTAVAIVTANLAVDLIHPL
LDPRVTRVQRRGFTTRLGRFG
>MAP1087 hypothetical protein
MLGYVLARIGQSAIVLLAVFSLVFWGVSILPADPAAIFVAKGEGYFNPDI
VAQVKAFYGYDRPLWVQYFAQLNQVLHGHFGFSLSSGQAVTDRIGGVIGE
TLKLAATATGFAVLFAVSVTALATTCAPVRSVLRAIPPLFGAVPTF
>MAP3098c hypothetical protein
MTDVVTHTETAPTPAAKAAKPVHTRAVIIGTGFSGLGMAIALQKQGVGFV
ILEKADDIGGTWRDNSYPGCACDIPSHLYSFSFEPKPDWKNPFSYQPEIW
DYLKGVTEKYGLRRYIEFNSLVDRAHWDDDEHRWHVFTTDGREYVAQFLI
SGAGALHIPSLPDIEGRDEFAGPAFHSAEWDHTVDLTGKRVAVIGTGASA
IQIVPEIVGQVAELQLYQRTPPWVVPRSNPEIPPAVRAAMENVPGLRALV
RLAIYWGQEALAFGMTKRPNLLKVIEAYAKYNIRRSVKDKELRRKLTPHY
RIGCKRILNSSTYYGAVADPKTELVTDHIARITPDGIVTADGTHRPVDVI
VYATGFHVTDSYTYVQIKGLHGEDLVDRWNREGIGAHRGITVADVPNLFF
LLGPNTGLGHNSVVFMIESQIRYVADAIATCDKLGAQALAPTRAAQDRFN
DELQRRLGPSVWNSGGCSSWYLDEHGKNTVLWGGYTWEYWRATRAVKPQE
YQFYGIGSRPGV
>MAP0782 hypothetical protein
MFVCLCNGVTSQTVTEAVQCGASTTNEVARACGAGADCGRCRRTVQAILR
SSSGNRTPNSI
>MAP3727 hypothetical protein
MDVSSAAIAAEQLSFGYPGDGQRLIAVSLAVQPGQVCCLLGPNGAGKTTL
LRCLLGLLTPQSGTVRVAGDPIDRLSRRQLARRVAYVPQRSNTPFPFSTL
DIAVTGRTPYLRAMTSPSATDRRAAAAVLDRLGIGALADRPYAVLSGGER
RLALLSRAMVQDAPVLILDEPMAALDFGNESRILQVVAELAAAGRAVLMT
THQPWHALHSGDQAVLIADGRLIADGPVEQVVTAAALSELYGVPVRVLTA
TDDATGRPVYACAPVAAGDDR
>MAP0815c hypothetical protein
MSGRRQGDPGRVAAKPGRRPGNSAAAPHPGAANYPAGDTGDRRTRRPPPM
PSANRYLPPLGHQPQPDRGAAAPPRGPVAGERITVTRAAALRSREMGSRM
YWMVQRAATADGADKSGLTALTWPVVANFAVDAAMAVALANTLFFAAATG
ESKGRVALYLLITIAPFAVIAPLIGPALDRLQHGRRVALAASFVLRTALA
AVLIMNYDGASGSYPSMVLYPCALAMMVLSKSFSVLRSAVTPRVMPPSID
LVRVNSRLTMFGLLGGTIVGGAIAGGVEFVCTHLFKLPGALFVVVAVTVA
GASLSMRIPRWVEVTAGEVPATLSYRRDSEPLRRRWPEEVKNVPKKATAT
LRQPLGRNIITSLWGNCTIKVMVGFLFLYPAFVAKAHQANGWAQLAMLGM
IGAAAGVGNFVGNFTSARLKLGRPAVLVVRCTVAVTAVALAASVAGNLML
AVIATLVTSGASAIAKASLDAALQDDLPEESRASGFGRSESTLQLAWVLG
GALGVLVYTELWVGFTAVTALLILGLAQTLVSFRGNSLIPGLGGNRPIMV
EQEGARRGVGSPAVVAE
>MAP2131c hypothetical protein
MKKISGSPCIATLLMGLPVLAMTACSSPQHASTQPGTTPPVKSAAPTSSG
ATTTPAPGGGALTAELKTPDGRSVATATFDFTGGYVTVTVKTVANGVLTP
GLHGLHVHEIGKCEPNSVAPTGGAPGNFLSAGGHYQAPGHTGKPESGDLA
TLQVRQDGAAYLVTTTDAFTRDELLAGNKTALMLHGAEDTENAMDRVACG
VIGTG
>MAP3704c hypothetical protein
MPRREEPSHGLLDPVAKMLRLPFGTPEFIDRIVTGGVNQVGRRTLRMLIT
TWDAAGGGPFAASAIASTGMAKTAEIVQGMFIGPVFGPLLRILGADKVAV
RASLCASQLVGLGIMRYGIRSEPLHSMSVDAIVDAIGPTMQRYLVGDITR
>MAP2257 hypothetical protein
MAVFGSVARRSGSGGRAASVVTCQPSERKIMSATTIDRTTGRDGLLRLAM
RADAAISGLVGLAGIPLVGWLAEVSGTTTAFEYGMSAFLIGYGVLVFGLA
ALPSVRRAGMAVIIGNLLYTAAAVVLVLADVFPLTSTGVVLNLAAGVYTL
VFAELQYFGWRRARA
>MAP0395c hypothetical protein
MVATTSSGGAAVGWPARLTKARLHFVTGKGGTGKSTIAAALALTLAAGGR
KVLLVEVEGRQGIAQLFDVPPLPYQEVKIATAERGGQVNALAIDIEAAFL
EYLDMFYNLGIAGRAMRRIGAIEFATTIAPGLRDVLLTGKIKETVIRVDK
NRLPVYDAIVVDAPPTGRIARFLDVTKAVSDLAKGGPVHSQADGVVRLLH
SEQTAIHLVTLLEALPVQETLEAIEELAEMQLPIGSVIVNRNIPAYLQPA
DLAKAAEGDIDADAVRAGLQKAGITLDDKDFAGLLTETIEHATVIATRAE
IAQQLDALHVARLELPAISDGVDLGSLYELSESLAQQGVR
>MAP1761c hypothetical protein
MVRRIAGATCRSRESAWPAAVLVATTMLSVTACGHSGDNANHAAQSKPGG
GNAVKITLTNSAGKDGCALDTTNVPAGPVTFTVANTNAPGISEVELLRDQ
RIVGEKENLAPGLDPVSFTLTLDGGSYQLYCPGASTEYQTLTVTGKAPAT
PTGTIATVLSQGTKDYAAYIVNQIGQLNDGAKALDAAVQAGNLDAAKAAY
AKARLYWERSESTVEGFVLPGFAVGDNAGNLDYLIDMRESTPVDGKVGWK
GFHAIERDLWQAGAITPGTKALSTELVGNVGKLHGIVATLQYKPEDLANG
ASDLIEEIQNTKITGEEEAFSHIDLVDFSGNVEGAQQAYASLRPGLEKID
NNLVHQIDQQFQNVLATLDGYRDPGALGGYRTYTPALKASDAPKLTAVIQ
PLHQSLSTVAQKVVSAG
>MAP4315 hypothetical protein
MAAAAGNPARTRDEDAIGTPPPGPTLVPAERYYSPAFAALEVERMWPRVW
QLACMVDHVAAPGDYFEYRCGPYGVLIVRGDDGALRAFQNVCRHRGNSLC
SGSGSGLRELKCGYHGWTWDLAGALKRVPDRKGFGTLRLSDYPLIPARVD
TWAGLVFVNLDPDAMPLPEYLEAIPEDTAWCRLDEFRCYATLTVEVDANW
KTVADGYSETYHIQTLHPELLRCVDDIHAPQQIWGHTGKSDQPYGVPSPR
FEGALSDEEVWDAYVSTQGALMGAAEGTPFPAADHRPGQTVADLIADRTR
AFAASRGVDLGWADTDRITRLHQYNVFPNMTFLTNADHLTVMCSRPAPDP
AAAPDKGELVMFLTTRMPPGAPRTKPTDVRMSAGEAEPGLVLTQDIAVLA
GLQRGMHQPGFTHLVLSSEERRVINMHRNLERYLDLPAAQRMSGGAGT
>MAP0337 hypothetical protein
MPSSRPMPTSTDRMLGDRPNLCRRPCLAASRRVRQDAAGGHRSVKICGIG
VKASLRTRHRPAPTGTVTGMTVVEFTGGAAPRGALPSRSTLPGPGSVRVT
AAYAAALLVVYLVLAALGPHARQVAVSRMSTNVHNLGRGQLGTLIGSAFV
DDGGELFFWLPGLVCLLALGELLWRGKGLLVTFAVGHIGATMIVAVGLVA
AIESGMLPASVARASDVGISYGAMCVLGAITAAMPVRWRGVWAGWWLGTA
VVATVGADFTAVGHVVALLLGIGLSFRLRSTASWTPVHLALLCVGATFGY
LLLAGAASMAPIGGLAGAFIGVLARRPLGAC
>MAP1762c hypothetical protein
MQGVFGVFIGTFLIGLREGLEASLIVSIVAAFLKRNGQTLRPMFAGVAVA
VLLSVAVGVGLDLLATSLPQAQQEMMETVINAVAVVFVTSMIIWMNRNAA
QLKGELEREARQAVHRGGALALGAMAFLAVLKEGFETSVFLLAAAETSHG
SRWFAVLGGVLGIATSIGLGVGLYFGGLRLNLGRFFRVTGVFLVLIAAGL
VLGALRTAHEAGWLNIGQRQLFDLSGWIPSDSVLGAVTTGVLGIPADPRL
VEVLGWLLYAVPVLVVFLRPARLAATPRARGRLLATAATLLLAIAAVLAV
AAPARDTVDAARTRTVTDRAGHAAAVSMATGPHGRELTVTPAGTSTVHHI
QLVPADDQSVDGLPVQAWQASETAGVDGAPEITLDQLRDMTGGRLPVGLA
AARTPGPFQGQWSTTTVYTVLTRGDAVISAKAASNRTAVLTGGGLTGAKT
VSLGGLATDWSTSAAEDHATAAAIAAGDRNRGEGQLWNVWLPLVIAGFAV
ACALSALASVRVDRKREDERKAIDGEAHRRGNVPVS
>MAP2336 hypothetical protein
MQHDQLIDLTRRALKLARDRTTDLAPTAHVVDARDYTCVQRHQQDRAMLL
ASPQLVGYVSELPAPGTYCTKTVMGRSVLLTRTSDGSVKAFNNVCLHRQS
QVATGCGTASRFSCPYHAWTYDNTGRLVGLPGREGFPDVALRSAGLTELP
ATEFAGFLWVSLDPGATLDVATHLGPLADELDSWGIGRWSPLGEKVLDCA
INWKLAIDTFAENYHFATVHRQTFATIARSNCTVFDSYGPHHRLIFPLNT
ILELDDIPEDQWNPFHNMVVIYALFPNIVLSVTIANGELFRVYPGDRPGR
SITVHQNATPQDLSDESVAAGVQAVFDYAHATVRDEDYRLVESLQANLES
GARDHLVFGRNEPGLQHRHITWAKALAASTG
>MAP3872 hypothetical protein
MSTERAHSYAGDVSPLEAWKLLSDNPNAVLVDVRTDAEWRFVGVPDLSSL
GREVVFLEWNTSDGRHNPDFADQLRRQIEPAPAGQERPVLFLCRSGNRSI
GAAEVATQLGITPAYNVLDGFEGHLDANGHRGETGWRAIGLPWKQG
>MAP0044c hypothetical protein
MIPTRMQSSAPVEIWRSVRALPDFWRLLQVRVASQFGDGLFQAALAGALL
FNPDRAADPLAIARAFTVLFLPYSLLGPFAGALMDRWDRRLVLVGANVGR
LVLIAAIGTILAVRAGDLPLLLGALFANGLARFVGSGLSASLPHVVPREQ
VVTMNAVATAAGVVAAFLGANFMLVPRFLFGAGDRGAAAIVFLTVVPVSI
ALLLSWRFAPRALGPDDTWRAIHGPVLYAVITGWLHGARTVAQRPTVAAT
LSGLAAHRMVVGINSLLVLLLVHHLPGLEGGGFGTALLFFGAAGLGAFLA
NVLTPPAIRRWGRYASANGALAASAIVEVAGAELLLPVMVVCGFLLGVTG
QMVKLCADSAMQMDVDDALRGHVFAVQDALFWVSFIVAITVAGMVIPDDG
HAPVFALFGSVLYLVGLAVHGIVGRRGE
>MAP2065 hypothetical protein
MGTSEDKAPGQPVIHLSQLLRAPVLARSGETVGRVEDVIVRLRGADEYPL
VTGIVAGVGGRRVFVGDKSIHEYSADRVLLTKNKIDLRGFERREGEVLLR
TDVLGHRLIDVATVELVRAYDIELEQTTAGWMVARLDTRRPPRLFGLIKH
SGGHASRDWKAFEPLIGHARSDAVRRLSDRFGELKAAEIADLLEEADKAE
GGEILDRVHSDPELEADVFEELDPEKASRLLDEMPDDEVAALLGRMRADD
AADAIVDLRQSRRRRVLDLMPAPQRTKVITLMGFNPESAGGLMNVDSVSC
AASATAAEALALIASSHSIQPEALIKVHVLDEDRRLDGVVWVITLLQVDP
SETLERLMDSDPVRVNADADLTDIALLMADFNLYSIPVVDEQDHLLGVVT
VDDVLEATIPEDWRRREPAPRPIREITTAEDRPLPGGNAP
>MAP3028 hypothetical protein
MGTNQRASIVMSDDEIADFVVKSRTGTLATIGRDGQPHLTAMWYAVVDGE
IWLETKAKSQKAINLKRDPRVSFLIEDGDTYDTLRGVSFEGVAELVDDPD
VAHRVGVSVFERYTGPYTDEMKPFVEQMMNKRVCVRIVARRARSWDHRKL
GLPPMPVGGSTAPAVLGTDR
>MAP1596 hypothetical protein
MTTPRAAVNAPARADTGSGGERISPQRRNLIFVAIVLGMLLAALDQTIVA
TALPTIVANLGDAGHQSWVVTSYLLASTIVTALVGKLGDLYGRKRVFQAA
VLFFVAGSVLCGLAQSMAMLVGARALQGIGGGGITVTASALIGEVVPLRE
RGRYQGILGAVFGVTTVIGPLLGGYFTDYLSWRWAFWVNVPVSVIVIFVA
AAAIPALAASAKPVIDYAGIVFVGLGAAGLTLATSWGGSRYPWGSPTITG
LFAAAAVALGVFVVVERRAAEPILPVRLFASPVFTVCCVLSFVVGFAMLG
AMTFLPTYMQYVDGVSATTSGLRTLPMVVGMLFTSTGSGTIVGRTGRYKI
FPVAGTALMALAFLLMSRMQPSTPAVIQSLYLFILGAGIGLSMQVLILIV
QNTSDFEDLGVATSGVRFFRTIGSSFGAAIFGSLFVNFLNRRIGPALAAS
GAPPGAVSSPGALHRQPHEVAAPIVAAYAESLTEVFFWAAPVALVGFVLA
LFLREIPLRDIHDSTVDLGDAFGMPTTETPDQMLENAIARMLRGETGMRL
RSIAMRPDCRLDVAGLWGVLRINRYTQMYGAARLTDMAEYLRIPFEVLEP
TFSRLVTAGYAGSDGDRLWLTPAGAQQVGYVHSLLLAWLVDKLGRSPGFE
GRPDRQAVQAALERVAYRVLAQRDWHDEQPTAAITAAAR
>MAP2479 hypothetical protein
MRDDSPRMSPRRHIIVSGDDVLATTIAEELNRAGATIVKLPSEELAGADL
ARASAIVCAGRDDAKNLEIALLARKTNPHVRVVARLGNDVLRGAVAADNG
PGAILDVADLAAPSVVEACLSSSTHPVRAAGIDFLVSGAEAPRDATLREI
YGDLAPVAVIHGNDGATPGEVVPCPGRDHRVRAGDWTAMIGSADELAARG
IRTPRPPATRSRQTWLRRVLDAARAMRDDVNPMLFPAILLALTLLLVSTV
IVHFSYTKPRLSWLDALYFTAETITTVGYGEFTFAHQSAWLRIFAVGLMF
AGVTTTALLVAFLADLLLSRRFLQSAGLRRARHLRNHIIVVGLGSFGSRV
VADLTAAGYDVAVIERDENNRFLSTAAELDVPVIFGDATLRQTLESARVD
RARAVAVLTQDDMVNIEIGIVLREMLGPRVMPEVNRPDVPIVLRIYDRTL
GDAVAKRFGFENVRSTVDLAAPWFIGAAMGLQVLGTFSVGQRSFMVGAMH
VAAGSELDGLRMFEMSTQTRVIAITRRDTPVELHPRRDAWLRAGDTVYLV
GPYRELLETLRKGQPPQQPTNEERPADKATT
>MAP3992 hypothetical protein
MTSTNGPSARDSAGKARDAGSGDGQQGRTQFLTVAEVAALMRVSKMTVYR
LVHNGELPAVRVGRSFRVHAKAVHDMLETSYFDAG
>MAP3728 hypothetical protein
MIRAVLAAVCVATSAAGCGAAHHSPAEPTRTVVDMTGQHVQIPATVTRVA
TNIPLIPATIELLGGIDTVVAAARGSFNALFTTIAPATQQIPRSPPTSLN
AEQLLDLHPQVFFMTDLTPGLLPMLQRLQIPVVQITAFTSPQDLQKAVNL
VAQVLGGAAPARARQYDTYFDAVIQQVHAGAQTDRPTVYYAPGPDPTTTV
GADNIITASIEAAGGRNIAVEHGIGGHQPGAFAFPTITAETLLAWNPDVI
VASNARVADQLATDPTFATLNAVRDHHIYTCPVGIFPWCASSSEAALAPL
FLAKKLDPERFSDLNLANKVANFYIQFYGYSLTGPQVTAILDGAG
>MAP3931 hypothetical protein
MPRGHLTEPVTDSSPPAKGTFTVDMLSRAKRGVTAVFVAHGLLFASWAAH
IPQVKAGLGLDDAALGTALFGAPLGSVLATLAGHWALPRWGSHRLIPVTV
AGYAAAGTTVGLARSGPALFAALALWGMFQGTLDVAMNTQAGTVERRAGA
PMMARFHGMWSLGTLAGALIGAACVGAGIGLTAQLTVLGAVVLLVVVMLT
RRLLPDAADSVAAPPEPAAGRRMTPAVAILAAVSFASFLCEGAATDWSAT
YLRDVVGAGPSVAAASYAAYTLTMVVTRFGAARLHARLPSRRLLPALAVL
AVAGMSVALATADAAAGVLGFAALGVGVALLVPTAFSAAYGARGAGSAIA
IVAATGWLGYLLGPPLIGHLSEWVGLSGALVTIPVMMTVVAVAIRYTPAF
DTADEFHRAPAG
>MAP2599c hypothetical protein
MSNNITWHEHKISRGEREQLNGHKGCVIWFTGLSGSGKSTVANVVEQKLY
ERGIRSYLLDGDNVRYGLNAGPDLLEERHGPEFAQRFGLGFSAQDREENI
RRIGAVAKLFCEAGIIALTAFISPYVRDRDAIRATLDDGDFQEIFIDTPI
EICEKRDPKGLYKKARAGEIKGFTGIDDPYEAPPRPELRLDGAAKDAETL
AEEVIAHLERVGVIATDGLAHTRGRDEVTA
>MAP3865c hypothetical protein
MGAGHNHTPAETGDARLIPRMVMAAAILAAFFVVELVTSLLINSIALLAD
AGHMLTDVVAVFMGLAAVTLARRGSSSPARTYGWHRAEVFTAVANAGLLI
GVSVFILYEAIQRLREAPAVPGVPMIAVALAGLAANFVVALLLRSHSSGS
LAVKGAYLEVIADTVGSLGVLIAGVVTVTTRWPYADVVVAVLVALWVLPR
AISLARDALRILSESSPTHIDVEELRAALGAVDGVTGVHDLHVWTLSPGK
DMCTAHLISTGDSARVLRDARAVLSARGLAHATVQIDCPDDTECSDSF
>MAP1418c hypothetical protein
MWGSVLGLGMLAALNPVRLGLALLMISRPRPGSSLLAYWIGGLTVCVPEL
LIPVLLLNFTPMFGHPSHASPSTGLALGKIQIGLGVVGLSIAAVLTVRFA
ARQRAAAPPPDDRTSELSAAPGATIAMPRLLTRAQDVSPDDRSVLRRLLG
RMHSAWESGASWVAWVIGVISVPVDGVLFIVAIIAASGASVTAQVSASVA
FVVLMYAVVEVILVGYLATPGKTQSLLLVLHDWVRTYHRQILVALFTVVG
VSQLAQGLHLV
>MAP2098c hypothetical protein
MSPWLHSFAKRGLSAGACRAKHLLQRTLPRPAGRIAQVTLLRFRPAGPGA
LRPNELAQAAVMGALCAAIEILAAVIPFAQGLGVLGTVPMGLLAYRYRPR
ALMAATVAGGVIAFLIAGLGSLFMLVDCAWVGGLCGIVKRKGRGTPTVAL
LSLIAGVLWGAGWVAVLAVLTRLRHLFFDVITANANGVAAFLNWMHLQGV
GAGLKRYVADGLQHWPLLIFPYMILLVVVVSFISWSALSRLLDRMRKIPD
VHKLDPPDDGHAAIGPVPVRLENVRFRYPGAEQDALREVSLDVQAGEHVA
VTGANGSGKTTLMLILAGRQPTSGTVHRPGAVGLGEVGGTAIVLQHPKSQ
VLGTRVADDVVWGLPPGTDIDVHRLLREVGLDGLAERDTGSLSGGELQRL
ALAAALARDPKLLIADEVTTMVDQQGRDALLGILSGLAKRHQTALVHITH
YDNEAASADRVIKLSDSPDNAVAAETNAAAAPAVAVQHGSGVPVLELIDV
SHEYASGTPWSKVALRDVSFVVEQGDGLLIHGGNGSGKSTLAWIMAGLTT
PTSGSCLIDGRPTHERVGEVALSFQSARLQLMRDHVDTEVASAAGFSPTD
QDRVAEALMSVGLDPAMGKRRIDQLSGGQMRRVVLAGLLARSPRALVLDE
PLAGLDIGSQRGLLRLLENLRRERGLTVVVISHDTVGLEELCPRSLYLRD
GALQTASTAAGGMP
>MAP2882c hypothetical protein
MIWRRGGLIFVPEGGQLRWEGTMAKPPLSMKPTGWFQVAWSDEIGVGDVH
KMKYFDQEMVAWRAESGQLTVMNAYCEHLGAHLGYGGKVVGEVLQCPFHG
WQWSAEGRNVCIPYQDRPNRGRRMRTYPVVERNASVYIWHDLQRREPYFD
PPDVFAAFGDGSSADDYYPQQRLYRQALEMHPQYVLENGVDFAHFKFVHN
TPIVPVFTRHDFAEPVSYVDFTITFEGDDGQKIEDVNSGVQAINGGLGIA
VTKSWGMIDNRTISAITPVDERTSDVRFMVYIGRTPAKDAARAQRKAAEF
GDEVIRQFTQDIEIWQHQRYSDPPALAADEYQGFMAIRQWAKQFYPEFAA
QEA
>MAP1632c hypothetical protein
MPQRLGLAGLCLGTALIIMEANVLNVAIPSIRQALHASPAQSLWIIDAYT
LVLAALLLSAGRLGDRIGARRCYLLGLAVFSIASVLCALAASSAELIAAR
TIQGVGAAVLIPAPLGLISAMFSDLTARAKAVAVWVTIGGVGFAAGPLIG
GLLVSTFGWRSIFLINIPAAAIIAVMVRLTVAEASRSPLPFDYVGQALAI
VGLSAVVFACVESSALAWMSPFVLLPAVAAALILGLFVIDQRHRGRAGAW
VLLPVELLNNRPVNAGLMSGFVYNFTLYGLVLVYSYVFQSARGYSPVQTG
LAFAPLTVAALVTSLPAGRFVAAHGARRGIMIGMALSAIGLCALAFDAQR
MPFVVLSIAFGIFATGLSLSATGQTMAVMANASDQYKNTASSMLNTARQT
GGVIGVAALGAITSRDLLASAPVALTIAAAACLVAALGVATLIARHARTH
DSDQH
>MAP2377 hypothetical protein
MKVPFTWKVTGWFMVGWSPEFPIGEVRPLHYFGEDLVAYRDESGELHILE
AHCKHLGAHLGHGGTVVGDCVQCPFHGWRWGPDGTNRYIPYQPDRPNRGL
RLKVYPVREQYDCVFVWHQPHGKEPQWEMPDIFGKFPQFETDPAAYYRAY
PEFSRRAEREPVHPQIVAENAPDSAHFEYVHHATVTPRVLDWKIVEQEWQ
FVAGWPDANSDDPAALALRFHSHLFGLGGAISVFEGAQNHRLIFTCTPVD
DECSDLFYSIWWPRVPGDTADVPEGKLREVIEKQFLSTVFDDLQIWRYQK
YVEHPPLSKVDAKGYMALRKWATQFYELPPAGTSSPA
>MAP0144 hypothetical protein
MVSTNPAPARVTLRHVDRTFGTHTVLRDVDVEIEPGAVVALLGASGSGKS
TLLRLVAGLDRPSGGRIEIDGKAVRGIDPRCAVVFQEPRLLPWRSLAANV
AFGLPRGIERSERLAAVQRWLDVVGLREFAGHYPRQVSGGMAQRAGLARA
LARQPSVLLLDEPLAALDALTRLRMQDLLDAVQQRAGTTTILVTHDVEEA
VLLADRVLILRGEEGGAATHDVAIPKPRDRGDPRIAALREQLLEEVGVPR
RGYAVEQEAKTS
>MAP0394c hypothetical protein
MSTTPKQLDMAAILADTTNRVVVCCGAGGVGKTTTAAAIALRAAEYGRNV
CVLTIDPAKRLAQALGVNDLGNTPQRVPLAAEVPGELHAMMLDMRRTFDE
MVVQYSGPGRAQAILDNQFYQTVASSLAGTQEYMAMEKLGQLLAEDRWDL
VVVDTPPSRNALDFLDAPKRLGSFMDSRLWRLLLAPGRGIGRLVTGAMGL
AMKAMSTILGSQMLADAAAFVQSLDATFGGFREKADRTYALLKRRGTQFV
VVSAAEPDALREASFFVDRLSQEGMPLAGLVLNRTHPPLCSLPAERAIDG
TEMLEHDGDPETTSLAAAVLRIHADRAQTAKREIRLLSRFTGANPHVPVI
GVPSLPFDVSDLEALRALADQITSNQATAR
>MAP4062c hypothetical protein
MTVTPQEAADHGAAEADYFDVLIVGAGISGIDAAYRITERNPQLSYAILE
RRARIGGTWDLFRYPGVRSDSSIFTLSFPFEPWTRKEGVADGVHIREYLT
ATAHKYGIDRHIRFNSYVRSADWDSTSDTWTVTVEDGARDGERKLYRARF
LFFGSGYYNYDEGYTPDFPGIEEFTGTVVHPQHWPEDLDYTGKKVVVIGS
GATAVTLLPSLSDRAAKVTMLQRSPTYLISASKYGKVAAVARKVLPRKPA
HLVIRMYSALTEAVFFALSRKAPRLVRWLLRRKAINSLPPGYAVDIHFKP
RYNPWDQRMCLIPDADLYNAITAGRADVVTDHIDHFDATGIALRSGAHLD
ADIIITATGLQLQALGGATISLDGNEIKTNDRFVYKAHMLEDVPNLFWCV
GYTNASWTLRADITARATAKLLEHMTTHGYTHAYPHRGNEPMTEKPSWDI
NAGYVLRSVHALPKSGTKRPWNVRQNYLADAIDYRFDRIEEAMVFGRAAD
RAALAG
>MAP0133c hypothetical protein
MSPEVAERLRPGPVAARSPLLAHPRWFVPGKFRVSHQSMGIRRRRRKWAR
KEIRVADPAMRQTIMGTAIGNFMEWYDFGVYGYIATTLAEVFYPGKSVSG
LHLIATFSTLAAAFVVRPLGGFIFGPLGDRIGRHRVLVVTILMMTVSTTT
TGLLPTYSSIGIWAPILLVIARIFQGLSTGGEYVGAMTYLVEQAPDHKRG
MMVGFLPMGNLVGFVLAGMLVTGLQTWLPDQDMLSYGWRIPLLLGLPFGL
VALYLRLRLEESSAYQSANDSPHTPGGQGRQQIRRTVAQQWRPMLICAAL
VLTSQVADFMLTGYLPTYLRLFVRVGHTAGLVMIVTTLAILMATVVAVAS
LSDRIGVKPIMWTGCALLIGASVPAFLLIRFGGVYPVIFIGVLLIGLMEL
CFDSTGPAMLPALFPTNVRYGALAISYNISISLVGGVTPLIAQALVSATG
NVMVPAYMLIFGGAVGAVTLLFTPEVAGKPLPGSGPAVETEREARALADD
VR
>MAP3733c hypothetical protein
MTATSSTTQSSRRIDVRMSARDLINIGVFGALYIATVFAINVFAFINPLV
MLVALAVSMIAGGVPFMLFLTRVRHAGMVTVFAIITAGLLALTGHPPICF
VITVACALVAEVVLWLGRYRSRTMGVLAYAIYAAWYIGPLLPIFYARDEY
FSSPGMAQMGPRYLEEMERLLSPAVLIAFDLSTVVFGLIGGLLGVRLLRK
HFQRAGLA
>MAP3132 hypothetical protein
MTAPGAARPQKRAGSLRPGELAQASVMGALCAAIAIIAVVLPHGGGLGLL
GSVPTGLLAYRYRIRVLITATVAAGVIGFLVVGLSGLAAIALCAYTGGLA
GIVKRHRRGTPTVLAVSLVAAGVVGAGMVIALTVLTRLRQLAFHAIGAAV
DGAASVVARVPPLHAAAVRFAEFFAAALQHWQWMVLGYALVAIVGASLVG
WWALSRVLERLRGIPDVHKLDAPARNGPTRPVPVRLDRVRLRYPHADRDA
LRAVSLDVQAGEHVAVTGANGAGKTTLMLVLAGREPTSGTIERPGSVGLG
ELGGTAVVMQHPESQVLGTRVADDVVWGLPPGTTTDVGRLLGEVGLAGLA
DRDTGSLSGGELQRLAVAAALAREPALLIADEVTSMVDRQGREQLLTVLS
GLTERHRTALVHITHYNDEAEYADRTINLGDTQGDTALIRTATAPAPTCP
AGRGRRAPVLELAGVGHEYASGTPWSRTALRDVSFTVHEGDGLLIHGGNG
SGKSTLAWIMAGLTVPTTGTCLLDGRPAAEQVGAVALQFQAARLQLMRSR
VDLEVASAAGFSSDDRDRVSAALAAVGLDAGLAERRIDQLSGGQMRRVVL
AGLLARSPRVLILDEPLAGLDAASQRGLVELLAERRRETGLTVVVISHDF
AGLEQLCPRILHLRDGSLDANPVAARPDPVPVAPPTKRPAARRRPVVLLR
PVPGSSPIHELWAGTKLLVVFAMSLLLTVFPGWVAVGLATALAAAGLRLA
HIPRGVLPSVPRWLWIFLGVVGVTAALAGGAPTIRLGTASLGLGGLLDFL
RATALTVVLLGLGALVSWTTNVAQIAPAVATLGRPLRVLRIPVDDWSVAL
ALALRTFPMLIDEFRVLYAARRLRPRRPAQTRWARLRRPATDLIDVVVAV
ITVTLRRADEMGDAITARGGTGQISAAPSRPKRNDWIALSIASAVCAAAV
AAELALLAGH
>MAP3145c hypothetical protein
MGERAPRPGPLPREVWILSWANVMVALGYGVISPALPTFARSFGVSIKAV
TFLVTVFSLSRLCFAPISGLLTERLGERRIYIGGLLIVAVSTAACAFSQA
YWQLMLYRVFSGVGSTMFYVSALGLMIHISPADARGRIAGLFTTSFMVGA
VGGPAVGGLAAGWGLTAPFVVYGVAMLGVALVLFLGLRNSALAAPRPPTR
STVTMREALRVRAYRSALLSNFATGWSAFGLRMALVPLFVSDVIGRGIGT
IGVVLAAFAGGNALAVVPSGYLSDRMGRRTLLIVGLVTSGAATVWLGFVA
SLPVFLVAAGVVGVVTGIYMSPLQAAVADILGNEARAGLPVATVQMMSDL
GAIVGSMAVGWAAEQIGYGWGFFISGVVLLIAAVGWVMAPETRTATELEA
DLMAAESDVEPV
>MAP3227c hypothetical protein
MRQSPSASARRSHHGDGKHSPAPTGSLPCNHVTLTSPAAASAGDLPARPG
LRTVAAGSMIGTTIEWYDFYLYATASALVFKPLFFPNISPSAGTLASFAT
YAAGFGARPLGAVLSGHFGDRLGRKTVLVAALLVMGLVTTAIGALPTYAE
AGLAAPALLASLRVVQGLAVGAEWGGAAVLSVEHAPPGRRGLFGSFTQLG
SPAGMLLATSVFFGVRKATGPAAFLGFGWRIPFLLSIFLVAVGLFVRLRL
TDAEVFDRLRSRDELARLPIVQVLRTDARNVVITTGLRLSQIGLFVLLTT
YSLSYLQDSFGKGSGVGLVAVLISSALGFLSTPGWALLSDRVGRRPPYLF
GALVSVVALVLFFVAAGTGSAVLVVVAIVFGVNVVHDAMYGPQAAWFAEL
FDTRVRYSGSSLGYHIGAVLSGGFAPLIAASLLVAGGGRPWLIVGYFAVL
AAITVGAACAARETRGEPIG
>MAP2105 hypothetical protein
MARECDGPDRVMKKNCPLQCPRRYGGVAENAAGDQVDWAGLPVNLDFAFS
RDQRDKVYAQHLKRRHGTQFGTWRRGGQVCVCELASESISSG
>MAP1039 hypothetical protein
MRRCRVEGRGGGAAGGSARSGARAVPSPGGCGAAPTRVAGRRTSPVTAAS
PASMATGGGGFDPAAWVTSGNRRVPMSSRKYAGIQRGDVMKYAEDGHTRG
LSMPRSRGAVSGLLLVILGAWGALIPFVGPHFNFAYTPDRDWAWSSARGW
LEVAPGAATALGGLLLIVAGNRVAAMLGGWLAVLAGAWFVVGGQLAPLLG
IGSAGDPIAATERKRALLEVTYFSGLGALIIFVGGVVLARTSARLARDVQ
PLASDAPAAPAVEPYRDPAYDPADVSSGALTKPRTSADPEPKRGWRKNRA
GGNAAYLRWPHPQQ
>MAP0489c hypothetical protein
MGTALTRDATRVGRTTRWLAAVLAVTGWAALTGCTGPAHPHATAVVASTD
VWGSVARAVAGGHVAVASILSGSDQDPHSYEASPSDAAAIADAGLVVFNG
GGYDGWVDDVLAHHPGVARVDAYALLPDDGRPRNEHVFYHLGVAKAVAAA
VADRLAAIDPGNAADYRRNAAAFGRDADAIAGIEHTIAAAHPGGSVVATE
PVAFYLLEASGLVNRTPPALEAAVENETDPAPADLARALDLLDRHQVSAL
VVNPQTSASAVNGLREAARRAGVPVVEVRETLPDGADYLSWQRNTVGQLQ
TALQPVRSLQP
>MAP0145 hypothetical protein
MRPRHLTALAVVAAVTAVSGCGSSSGTTTTKDLHLDYAYYNPLSLVIRDQ
QLLEKKGYHVTWVLSQGSNKANEGLRSKALDFGSTGGSPALLARANGTPI
KTVDVYARGEWTALVVAKNSPINAVADLKGKKVAVTKGTDPYFFLLQSLA
TAGLSPADIEIVNLQHADGKTALERGDVDAWSGLDPFMAETIQQQGSRII
YRNPDFNSGGVLNAREDFITAHPDSVQLVVDTYEEARKWAKTHPAELAAL
LASQATVSQSVAQEELGRTALDIDPVPGDWLRAVLTRIEPLAVADGDIKS
DDAGRNALNTLIEPKYARQAR
>MAP1441 hypothetical protein
MSTGMAVTTESAGAAGPDPYDVGELRANLRQADPGVLVAVLAQLTGDPAV
VDRFAPKITHVPDPPEQAGVTDPETAAQLVDEIVTALRTPRRADAVPADD
LDLFARVAPVALGGEVGPEYLGLLVEQGGFQPSQPVLPRTAKLPAGFRVV
IIGAGIAGITAALACADAGIEYQIIERNDEVGGTWYTTRYPGIGVDTPSA
YYSLSRDINGDWSSYYPQGAEYQAYLVSVADKNDLRKHTRFGTEVEALWW
QERRRQWQIHSVGPDGTRDVSYANVVIPAAGYLNRPRWPELAGRETFSGI
SIHSAHWDPELDLTGKRVAIIGAGCTAVQIVDACVDQVAHLTVFQRQPHW
VAPRRRASDDVSTYQRWLGTRLPYYANWIRIKSYWGTADNNYPVILHDPQ
WAAEHLSVSPANDVLLRMCLDYIDRVFGAGSELARKVTPDFAPYGKRIIR
DPGGYYAALAREHVDVEASEPARVNQAGIVTADGRQIDLDVIIYATGYYL
DFLSTVDIRGRDGKKLTDEWGDAPRAYRGGMVPGFPNMFISSAPNYSPGH
GAGHNFGVEVMVHYVMECLQLMALRRATTVEVTQRAYEEYVADIDALMAG
TVWCHTPSAHTYYRSGGGRIVTAFPYRLVDFWRDHRAPSEEDLELR
>MAP0180 hypothetical protein
MTEHLDVLIVGAGISGVSAAWHLQNRCPTKSYAILERRADLGGTWDLFKY
PGIRSDSDMFTLGFRFKPWRSAKSIADGASIKAYIKEAAVENRIEPHIRY
RHRVVAADWSDADNRWTVTVEHDGQRSEITCSFLFACTGYYNYDEGYSPT
FPGAEDFGGTIVHPQHWPEDLDYASKRIVVIGSGATAITLIPALVNSGAG
HVTMLQRSPTYIGSLPGVDPFAERANRLLPDRLAHMANRWKAIAFSTFQY
QLSRKAPAYMRKTLMTMAKRRLPEGYDVEKHFGPRYNVWDERLCLAPDGD
FFRTIRHGKADVVTDTIDRFTTTGIRLNSGEELPADIIVTATGLNMQLLG
GVTPTRNGEPVDLTSLMTYKGLMFSGMPNFAITFGYTNASWTLKADLVSE
FVCRLLNYMDAKGFDFVEPQHPGEDVDELPFMDFTPGYFRRSMHLLPKSG
SRAPWRLKQNYFFDMRTIRRGRVDDEGLKFAKKRAPVAV
>MAP2733c hypothetical protein
MVGRPRAGRLQKVGSKVVAAIGRQEWMDRPSYRFEHLLSFAYNGLGSARN
TVTNALNGVWLGHPVHPPLASLTSGALGTTVALDALSVLPGQPASEVVGA
SRFATRALGVGILASLGSAVTGVTDWQHTHEEDRRVGAVHGLLNVAATAL
YVQSWFDRRRGRHGRGILLTALGYGITVAGSYLGGALVFESGIGIDQSGP
RLRTSAWTPVLPASSLNGKPVRVEVDGVGLVVCQTKPGEVAAYGEFCPHL
AAPMADGWLDRGRLVCPWHGSWFAAESGEVVRGPAAAPLPCYQARVVDGV
VEVRGEQQPAPGGAVGIAKGGAS
>MAP1109 hypothetical protein
MTAQVVADQVVGGVISAPPLARPRRGTLRWQSRVLRVVSVAAAIGLWQLL
TADKVRLLLRFDTLPTVTEIVGALHRRLAAGEYWLDLAQSLLRILTGFGL
AAVIGVATGVLLGRSRLFADVFGPLAELARPIPAIAMVPVAILLFPTDEA
GIVFITFLAAYFPIMVSTRHAVRALPTLWEDSVRTLGGGRWQVLTQVVLP
GILPGVFGGLSVGMGVAWICVISAEMISGRLGVGYRTWQDYTVLAYPQVF
VGIITIGVLGFATSAAVELVGRRVTRWLPRAQDGAR
>MAP3776c hypothetical protein
MATVGACCRSGGPAGVPRNRAGAHRDADRVSGKSRALDMKTVIVCGVGGL
LSRWHGAPIATVLVLTVVTSCSSSPTQTAEGSRNAASPSAIGRTAAPPCP
TAPLAVVVSVDQWGDIVSELGGACANVKTVLASSSVDPHDYEPSPADAAD
FMNAKLIVVNGAGYDSWASKLAGSSASGAPLVSAAAVTTTPDGANPHLWY
LPSAVTAVADAVTQELSRMEPPAAGYFSQRRAQFTSATRLYVNLIAKIKA
EAAGKSYGATETVFDYQAQAAGLVNKTPAGYRRASANESEPSPGDVDAFL
TALAGRHIDLLIYNTQTEGSIPEEIRSAAEQSSVPVVKITETVPPGETSF
EDWQYGQLVQLAKALHVAV
>MAP1755c hypothetical protein
MTINASAPQRQGAVAVVDTVTALPAAVAGDGHTADPFEPLTVGAMVDRVS
AIAVEKAAHPWAFLMRSLVGGAMVAFGVLLALVVSTGVKTPGVASLLMGL
AFGMSFVLILVSGMSLITADMAAGFLAVLQRALSIRSYVVLVAVGLVGNI
VGALVFVTVCAAAGGPYLGAFADRAATVGTQKAGQPFWTALLLAVLCTWF
LQTSMCMFFKARSDVARMALAFYGPFAFVIGGTQHVIANVGFVGLPLLLN
LFHPIAARGDIGWGIGDHGLLTNIGVTTVGNLIGGTVFVALPFWIIAHLQ
RRRILSTGALRPDG
>MAP1107 hypothetical protein
MRRHAALLASVLIAVAALGTGCSLESLSQSAGVVNVVVGYQSKTINTVTA
GTLLRAQGYLERRLADITTRTGTKYAVRWQDYDTGAPITAQMLAEKIDIG
SMGDYPMLINGSKTQANPLARTEMVSITGYNPKGALNMVVVSPDSRARTW
PTWPAPRSRPAWARPATAPWCGP
>MAP2081 hypothetical protein
MADSPAATSPTLVAKQTNVRRPRWDRDHPRYKWVALSNTTLGMLLAMINS
SIVLISLPAIFRGIGLNPLAPANIGYLLWMLMGYLVVTAVLVVFVGRLGD
MFGRVRIYNAGFAVFTVAAIALSFDPFPLTGGAVWLIGWRVVQGVGGAIL
MALSAAILTDAFPSNQRGMALGFNMVAAVAGSFLGLLFGGLLSEWDWRAI
FWVGVPVGVLGTVWGMRSLHELGVRTPGPLDWPGTVTFGVGLTVVLVGIT
YGIQPYGGHPTGWTNPWVLGSIAFGLLLLIVFCFIELRAPQPMVNVRLFR
SAGFGMGNLANLMSSSGRGGLQFMLIIWLQGIWLPLHGYRFESTPLWAGI
YMLPTTIGFLIAAPVAGWLADRFGARPFAVAGMLLMAVTFIGLLMIPVNF
DYRVFALLIFLNALGGGLFAAPNTAVIMSSVPPRDRGAASGVRSTFFNAG
SALSIGVFFSLMVVGLAGTLPHALSSGLQQQGVSAAVAQDAAALPPGGQP
VRGVSGLQPDRRIARTVARTAATRRQCRDTDGRTRRVAEDPRRSAAVRRV
RWGRRRAS
>MAP1434 hypothetical protein
MRMHDTDEIRLIEAQAVPTRFARGWHCLGLIRDFGDGKPHPIDAFGQKLV
VFRGGDGAINVLDGYCRHMGGDLSRGEVKGNEIACPFHDWRWGGDGRCKQ
VPYSRRTPRLARTRAWTTLQQDGMLFVWNDPEGNPPPPEVTIPRIEGAGS
DEWTDWHWYSTVVQSNCREIIDNVVDMAHFFYIHGSLPKQFKNIFEGHVA
TQYMNSAGRPDIGGEGARMLGTTSVASYWGPSFMIDDLTYHYEDADHQTV
LINCHYPIDANSFVLQYGIIVKKSDALPDDLAMQTAIALGDFVKLGFEQD
VEIWRHKARIDNPLLVEEDGPVYQLRRWYQQFYVDVADVQPDMVDRFEFE
LDTTRPYAAWMKEVEANLAARA
>MAP1152 hypothetical protein
MDFGSLPPEINSGRIYSGPGSAPLLAAAAAWHGLAAEMHSAAASYGSAIA
ELRTLWHGPSSTAMAAAAAPFIAWLGGTAAQAEQTAAQATAAAAYDSVFA
ATVPPPVIAANRALLASLIATNVLGQNTPAIAATEAHYAEMWAQDAAAMY
AYAGASAVATRLTPFGAPPQSADANAAADQSAAAASALQLSTASSVESAL
SQGVSQVPVAAQVNATAVTAAAQLPLSLTDITGILKTFNSVMGTISGPYT
PLGVANLAKNWYQIALSIPSVGTGIQGIGPLLHPKALTGVLAPLLRSDLL
TGSTALSSAGTVSASAGRAGLVGSLSVPANWASAVPAVRTVAAELPETML
DAAPAMAVNGQQGMFGPTALSSLAGRAVGGTATRAVAGSTVRVPGAVAVD
DLATTSTVIVIPPNAK
>MAP1089 hypothetical protein
MSARRASWPVRISAAVLILTAAWAAAPGLFTDRNPLKGRPVDKFQPPSAA
HWFGTDHLGRDVLTRVIYGTSHTVATAGLAVAVGLLLGSAVGIAAGVSGP
VVDAVAMRASDVLLALPGFLTSVWIVTAYGPGPLSVGIGVGIGSIAVFAR
VFRAEVLRVRALDYVEAAFLSGETRWSVIRRHIVPNAAGAVIALAVIDLS
GAILLISALGYLGYSAPPPTPEWGLLVAEGRRYLATAWWLSTLPGAVVLS
VILALGVLSRRALTSHRI
>MAP4283 hypothetical protein
MPTSEYQVSGMSCGHCEAAVHSEVARIPGVDGVSVSADTGRLVVTSAVPI
DTDAVLGAVDEAGFQAVLVA
>MAP3774c hypothetical protein
MTVRVLALQYEPHWWSILTSGFMTNALIGGTIVALAAGLVGYFVVIRQSA
FAAHALAHIGLPGATGAVLLGVPVAAGLGVFCVGGALAIGVLGKRAADRE
VVTGTVLALAIGLGLFFNSLATKSSGTMTNVLFGNLLAISRDQLAGFAIL
LAVLALIVGIIYRPLLFASVNPVVAEAKGVPVRALAMIFMALLGLTVTMA
VQAVGTLLLFALVVTPAATAIMLTPRPSMAMLVSTAIGLSSVLLGLGASA
MFNLPPSFPIVVLACGIWSAVWASNHRHRVIAKAVDEATNAPPALMSSTE
PTATTRK
>MAP3732c hypothetical protein
MTRTTTRVTQLDPRTKTVLVLASSIAVMAPGGEVFVPAAVIVGMLLAVAE
QAWVRAAILPSAAGATAAVAYLLPQAIPHPIIGAIGTVAAYLLRLIAVGA
IVIHLVNTTTPSEFTAALRATHIPRAITVSGSVMLRFLPTIVGEARAVSD
AMRLRGIGGTYGMLRHPVCTIEYFTVPLIASSLRVAEDLSATALLRGLGS
AARPTTMYPPRFGKADALIGCIVSALTVTTVLWPVKP
>MAP3073 hypothetical protein
MTGLGYTLPAALAVVIVCAAELTVLRTGLFRRPAYWLSMLIVLGFQVPVD
GWLTKRSSPVVIYDDRQISGLRFPFDIPVEDFLFGFAMVTAVLLLWERRR
ARR
>MAP3299c hypothetical protein
MRNGRSRKLSGLNQTLAAQRGHQLVGVVRIPEEHASPIRVITRRLAIALV
VLFAAAVIVYADRSGYRDLRGGSLTFLDCVYFSAVSLSTTGYGDITPYTE
TARLVHTLIFTALRIAFLAVLVGTTLEVLSERSRQGWKIQRWRSRVRNHT
IVIGYGTKGKTAVAAILGDETTQAEVVVVDTDRSTLEHAESADLVTVHGD
ATKADVLRLAGAQHAASIIVATSRDDTAVLVTLTAREIAPHAKIVASIRE
AENQHLLQQSGADSVVVSSATAGRLLGLATTTPSVVEMIEDLLTPDVGLA
IAEREVEQSEIGGSPRHLRDIVLGVVRRPAAAHRRPRGGRGRGQRPVALH
PQRGALMAIADFQLRSVPLLSRVGADRADQLRTDVEAAAAGWADAALLRV
DSRNQVLVADGRVVLGAAAELGDKPPPEAVFLGRLEDGRHVWAIRGALQA
PDDPEVRAEVVNLRSLGPIFDDTSSQLMSSAVALLNWHERSRFSSVDGSP
TRPARAGWSRVNPVTGHEEFPRIDPAVICLVHDGGDRAVLARQAVWPERM
FSLLAGFVEAGESFEVCVAREVREEIGLTVRDVRYLGSQPWPFPRSLMVG
FHAVADPAQDFAFNDGEIAEAAWFTRDEVRAALAAGDWSSDSESKLLLPG
SISIARVIIESWAALD
>MAP2097c hypothetical protein
MTAPSDTQSAAGRTRRPPRPVVLLVPVPGTSKIHELWAGTKLLVVLGVSV
LLTFFPGWVTVGLMLALLVAAARLAHIPRGALPSPRRWIWIVLAVGGITA
ALGAGSPVVSIAGLHIGLGGTLHFLRVTALSIVLIGLGAMLSWTTNVAEM
GPALATLGRPLRWLRIPSDEWAVALALALRAFPMLIEEFQVLYAARRLRP
NQTPRSRRARRRQQARDMIDLLTAAIVVTLRRADEMGDAITARGGIGQLS
AAPARPKLADWVTLTITVAAGAIGVALDSMIPFQ
>MAP0453 hypothetical protein
MGPTRKRDLTAAVVGAAVVGYLLVQGLYRWFPPITVWTGLSLLAVAVIEA
LWARYVRTKINDGEIGSGPGWLHPLAVARSLMVAKASAWVGALVLGWWIG
VLVYFLPRRSWLRAAAEDTSGAVVAAVSALALLVAALWLQHCCKSPPDSG
EHGEGAET
>MAP3739c hypothetical protein
MSIGAQMLDHARSNQTWTTAVAALACFLMTLDITVVNVALPSIQKDLGAS
LEGLQWVVNAYVLAFAALLLTVGSVSDRLGRKRLFLTGVAVFTVASALCV
ASRTESPLIAARALQGIGGALVFGTCLALIADAYTDAEEEQRRKAVGLAM
AAGAAAATLGPLIGGGLVEIGTWQWIFAINVPVGVALAICTALKVREPHA
PHAADNSRVDSVGAVVAIVVLFALNYGLLTGAAKGWGRGDVLAALAIGLA
GGVGFVLHQLRRGSEATLDLTLFRIPTFLAAIVLGFTVRALSFGVFPFLI
LWLAGAHGRSAFDIGLILSALALPLMVCAVLSTSVARAVGVRATMSIAMV
ITAAGLFLATLIRGDGSWTTILPALAVLGVGNGVAMPHLMNLAVDVVPSN
KAGMATGAANTAFPLGTATGVAAFGVVLSSFVHAKVAASGVIPVHSADSV
ASAIVAGVLRFPTQAMTAFATSAFTDALRLIFGIAGCAALVAAGLSGALI
THRPRVAAESPESVTE
>MAP3987 hypothetical protein
MRLGVLDVGSNTVHLLVVDAHRGGHPTPMSSTKATLRLAEATDSAGKITK
RGAEKLISTIDEFAKIADSSGCEELMAFATSAVREAGNSEEVLNRVRKET
GVELRVLTGVDESRLTFLAVRRWYGWSAGRIINLDIGGGSLEMSSGLDEE
PEVALSLPLGAGRLTREWLPDDPPGRRRVAMLRDWLDAELAEASENILEA
GTPDLTVATSKTFRSLARLTGAAPSGAGPRVKRTLTANGLRQLISFISRM
TTADRAELEGVSAERAPQIVAGALVAEASMRALSIESVDICPWALREGLI
LRKLDSEADGTALMEPSVRNAGGQVVDRNQNRSRGDKP
>MAP1808c hypothetical protein
MTLTAESGPLAGSRTDVAGELRHVDKWYGNRHVLQDVSLQIPSGQIVALI
GRSGSGKSTVLRVLAGLSHDHTGRRLVAGAPALAFQEPRLFPWRDVRTNV
GYGLTRTRLPRAQVRRRAERALADVGLADHARAWPLTLSGGQAQRVSLAR
ALVAEPRLLLLDEPFGALDALTRLSMHTLLLDLWRRHGFGVLLVTHDVDE
AVALADRVLVLEDGRVVHELAIDPPRRTPGEPGAHTERYRAELLDRLGVR
Q
>MAP3834 hypothetical protein
MRDDDSANRWSGVREAPTAPSRQLTGAVVIIALVAAISGMLYGYDTGVIS
WALLQLTQDFNITEGWQQVIAASILLGAVAGALTCSWLSDLRGRRGTLLM
LAVVFIVGALWCADAADSVMLSLGRLVLGFAVGGATQTAPMYVAELAPPA
YRGRLVLCFQIAIGVGILTATLVGAGGSISWRGPIGLACVPAAIMLWLLL
RLPESPRWLVKKDNRDAARAVLEHVRPEGYDVAAELDEATELARVERTAA
TRGWRGLRDAWVRPALVLGCGIAVFTQLSGIEMIIYYSPTILTDDGVYRS
VALQVSVCLGAAYLIAQLVGLAIIDRVGRRRLTLIMVPGAAVSLFALGLL
FITSDSGRDVIPYIMICLIAFMLFNGGGLQLMGWLTGSETYPLAVRPAAT
ALQSATLWGTNLVITLTMLSLIKAIGVGPLMWLYALFNVAAWIFVFFRMP
DLTGKTLEEIEYQLSEGKFRPSDFGR
>MAP0998c hypothetical protein
MMITGDNPLTAKAIADEAGVDDFLAEATPEDKLQLIKREQAGGKLVAMTG
DGTNDAPALAQADVGVAMNTGTSAAKEAGNMVDLDSDPTKLIEIVEIGKQ
LLITRGALTTFSIANDIAKYFAIIPAMFVALFPGLDLINVMRLHSPQSAI
LSAVIFNAIIIVLLIPLSLRGVRYTPSSASKLLSRNLYIYGLGGIVAPFI
GIKAIDLIVQFVPGMS
>MAP0487c hypothetical protein
MEHRLADMADHLFSLDITVHLLGHDFVQQALVAAALLGLVAGLIGPFIVM
RQMSFAVHGSSELSLTGAAFALLVGIGVGVGALIGSALAAALFGVLGRRA
RERDSVIGVVLAFGLGLAVLFIHLYPGRTATSFALLTGQIVGVGYTGLTM
LALVCLLVIAVLATCYRPLLFATVDPDVAAARGVPVHALGIVFAALVGVV
AAQAVQIVGALLVMSLLITPAAAAARVVASPGAAMLASVAFAEVSALGGI
VLSLAPGVPVSVFVATISFLIYLACWLIGRRREAAT
>MAP2414c hypothetical protein
MARGLQGVMLRSFGARDHTATVVETVRIAPHFVRVRMTSPTLFEDVDAEP
AAWLRFWFPDPDGSKTEFQRAYTISEADPAAGRFAVDVVLHDPAGPASRW
ARTVQPGTTIAVMALMGSSRFDVPDEQPAGYLLIGDPASIPGMNGIIGVV
PDDVPIEMYLEQHHDDDTLIPIAVHPRLRVHWVARRDEKSLAAALESRDW
SNWYAWATPEATTLKHVRARLRDEFGFPKSEVHAQAYWSAGRAMGTRRGD
EAATTDDGTETPEAIAAQADSTQRQPEAAPVPAARGNWRTQAAGRLLAPL
RWALIPSGVLQAVITLIQLAPFVLLVELARRLVAGAPAARLWDVGIAAVS
LLGLGALLGAALTLWLHVVDARFARDLRSALLRKLSRLPLGWFTARGSGS
IKQLLQDDTLSLHYLVTHAIPDAVAAVVAPVAVLVYLFAVDWRVALVLFV
PVLVYLVLTASLTIQSGPRIPQSQRWAETMSDEAGAYLEGQPVIRVFGGA
AASSFRRRLDEYVGFLVAWQRPLAGKKTFMDLVTRPSTFLWLIAAVGTLL
VVAGRMDPVNLLPFLLLGTTFGARLLGIAYGLGGIRAGMLAARRLQNTLD
EHELEVREPGEPTGESAQAVVFDNVGFGYRPDVPVIHDVSLTLRPGTLTA
LVGPSGSGKSTLAALLARFHDVDRGSITVGGRDIRSMTADELYARIGFVL
QETQLVHGTVAQNIALAVPDATAAQIEQAAREAQIHDRIMRLPHGYDTVL
GAGVGLSGGERQRLTIARAILADTEILILDEATAFADPESEYLVQQALNR
LTRNRTVLVIAHRLHTITRADQIVVLDHGRVVERGRHEELLAADGRYRRL
WEGGRRDAVTVGTAGEVAR
>MAP2825 hypothetical protein
MWDSGGMKHGSDSGFDGGFDDFDRNKSRPVLITAAAPSYEEQHRARVRKY
LTLMAFRIPALILAAVAYGAWHNGLISLAIVAASIPLPWMAVLIANDRPP
RSPDEPRRFDNARRRTPLFPRAEQAALEPPPAAQARWQPGGWDGIDRDRP
PFH
>MAP4272c hypothetical protein
MHKGATFAVAAVTAAIAPLAACANQQSSQPNTAPLTSSVPGSERLTTQLK
TAEGIPVANASFEFANGYATVTVEAGPNQVLSPGFHGLQIHAVGKCEANS
TAPTGGSTGDFESAGAVYQAPDHTGYPASGDLTALQVRSDGSAKLVTTSN
AFTAADLRTSSGSALILHQNANNLANTPAADSGKRLACGVIAASSATSTT
TTPTTSVTTSTTTVAVPPPSTSTSTSTVTVTGTPTATSTPTTTVTTPPSL
PPGR
>MAP0741c hypothetical protein
MATVDEIPPGTHKLVPIGRHGVGVYNVNGTFYAIANYCPHQGGPLCSGRP
RGRTIVDETAPGDSVMVRDLEYIYCPWHQWGFELATGTTAVKPEWSIRTY
PVRVVGNDVVVQA
>MAP0470 hypothetical protein
MPNTSPVTAWKSLKEGNERFVAGKPQHPSQSVEHRASLAAGQSPTAVVFG
CSDSRVAAELIFDQGLGDMFVVRTAGQAIDTAVLGSIEFAVSVLNVPLIV
VLGHDSCGAVKAALGAIEEGAIPGGFVRDVVERVAPSILMGRREGLSRVD
EFEERHVRETVAQLVSRSTTIAERIGDGTVAVAGVTYHLADGRAALCDHV
GDIGE
>MAP1127c hypothetical protein
MSSPAVPDHHTLIIGAGFSGIGAAIKLDKAGLPDYRVIEAGDGVGGTWHW
NTYPGIAVDIPSFSYQFSFEQSRHWSRTYAPGRELKAYAEHCADKYGIRS
RIRFNTKVLAAEFDDEPRLWRVHTDPGGTVTARFLISACGVLTVPNLPDI
DGVDSFGGITMHTARWDHGQDLSGKRVAVIGTGASAVQVIPEIAPIVKSL
TVFQRTPIWRFPKLDVPLPAPARWAMRIPGGKSVQRLLSQAYVEVTFPIS
AHYFTVLPLAKRMATLGKSYLRQQVRDPEVREKLTPKYAVGCKRPGFHNG
YLATFNRDNVRLVTEPIDKITPDAVATTDGEHHRIDVLILATGFKVMDPD
NVPTFAVTGPGGRSLSRFWDEHRLQAYEGVSVPGFPNLFTVFGPYGYVGS
SYFALIEAQTHHIVRCLKRAERLGAARVEVSEEANARYFAEMMRRRHRQI
FWQDSCKLANSYYFDKNGDVPLRPGTTPEVYWRSRRFNLDDYRFSA
>MAP2765c hypothetical protein
MVAAQGSSMLTAADFAAQWADVPPWEPPDEPPQRNGQRQQQASAEPTTWE
AFDLGPYLRGEIERPHPGIGISRSDGQRSLYPGREHAIVGETESGKTWFA
LGCAAAELNAGNDVVYIHYEEPDATSTVEKLCLLGVDPAVIKARFRFVAP
SRPVREEWLNALLDPSPTLVIHDGVNEAMALHGDEIKAVEGAAAFRRRLI
LPCLRVGAATLACDHLPMVRDGSRRDAYGSVHKGNALDGARFVLENSAPF
GRRLRGVSYVFVTKDRPGHLRANGRATKSPGKTFMGTLVVDDSQAFGPDF
TMRFFAPRDDDVPESDPNAELADAVFRVVAAAPDHAVGSMRLLFAELRNV
DIQFRDDDVRDVVDDLVVSGRLVEISGKRGAKGFRAVVEDADGDST
>MAP4065 hypothetical protein
MAIIHGADRGARPQSWEEVARVTETTTQPVADAVADRPSVSLRTRGRWLR
WGLLSVWGPGLVVMLADTDAGSLITASQSGAQWGYRMVLPQLILMPVLYV
VQEMTVRLGIVTGRGHGSLIRERFGRGWAWLSAFTLFASAIGALLTEFAG
VAGVGELFGVSRWVSIPVATIALLALALTGSYRRVERIGLAVGAAELAFL
VAMVMARPDPGALAHGLTSMPLGDSSYLLLIAANVGAVIMPWMIFYQQGA
VVDKHLSESTIRQARYDTAFGAVLTQLIMIAVVITMASTIGRHGDGAPLE
TVGQIAQSLTPYLGHVGGTVLFGLGMLGAALVAAIVASLAGAWGLAEVFG
WKHTLNQRPNRATAKFYLTYSLAHIVGAVLVLASVDLVNLAVDVEIMNAL
LLPIVLGLLLALEARALPEQWRMRGLHKHVTRALCLVTIGFGLYMVPQAL
GWA
>MAP2550 hypothetical protein
MGSVNRVYIARLARILVLGPLGESVGRVRDVVISISIVRQQPRVLGLVVD
LATRRSIFIPILRVAAIDPNAVTLSTGSVSLRHFEQRPGEVLAIGQVLDT
VVKVNDPELPELAGVDVVVTDLGIEQTRTRDWMVTRVAVRPQRRLRRRGP
VHVVDWRNVQGLTPSALALPGQAVAQLLEQFEGRKPVDVADAIRGLPPKR
RYEVLKALNDDRLADILQELPELDQAEVLSQLGTERSADVLEEMDPDDAA
DLLGVLNPTDAEMLLKRMDPGDSASVRRLLTHSPDTAGGLMTSNPVVLTP
DTAVAEALARARDPDLTAALSSMVFVVRPPTATPTGRYLGCVPLQRLLRE
APAELVGGIVDSDLLTLRPETPLVAVTRYLAAYNLVCGPVVDDENHLLGA
VTVDDLLDHLLPPDWRVDMQELDTAGRLEGLGGSG
>MAP3089c hypothetical protein
MPENDSAAADPDLLIELRDVSLRRGGNVLVGPLDWAVELDERWVIVGPNG
AGKTSLLRIAAAAEHPSSGVAFVLGERLGRVDVTELRSRIGLSSSALAQR
VPSDEVVRDLVVSAGYAVLGRWRERYEDIDYRRAVDMLESLGAEHLADRS
YGTLSEGERKRVLIARALMTDPELLLLDEPAAGLDLGGREELVARLADLA
ADPDAPALVLVTHHVEEIPPGFSHCMLLSEGRVVAAGLLTDVLTSENLST
AFGQAIALDVVDGRYFARRVRTRAAHRRQL
>MAP2717 hypothetical protein
MLSGRPNGEQGRRVATVRSGTMRSMGAQTALTLLLAAIAVIVLVDWISER
TRLPSATLLVLVGIGQALLPGPTIGLEPDVVMTCILPPLLYHAALESSLV
GIRRNLRTVVSLSVVLVLLTAASVGVAFSLLVGGATLAVGMVLGAAVAPT
DPVAALAVARKEGLPLNIVTLIEGEGLLNDATALTTLAVAVTVARGAAFS
APSAIREFVLAAVGGLVVGQVFAYARRLLRRWRHDVLTANAISLATPFLT
YLVAEKLSASGVLAVVVCGLIVGHDSPRVESGASRLQTRAVWRLVNFLLE
GVVFLLIGHQVPVILDELGGYALSTILVAVGVTVAVVLLVRPLWLLLTQA
LPRSLHTRLGNVDDSATDSAEPRPERTRERLSGREIVVLSWAGTRGVVSL
AAIFAVPVLTEGNAPFPDRDLLLFCTLVVVLVTLIGQGVTFGPLVRALGL
RAKTTDELRLRNRARAAALRAALDRLDSLDPEDGADVDPRVIDGVRQQLS
AQLERYEHRLTLLSDVDELPAAPAYEAAVGLRRVAIDAQRDELVRWRDAG
MLPDQSLRAIERELDHEESMLPMGLPRSPRRSQKR
>MAP3775c hypothetical protein
MRGGRLIWSHATFDIPAGGIVAVIGSNGAGKTTLLNMVLGLIPSATGRLE
IFGRRPGQANDNIGYIPQHYADSSGEAIRAADAVLLGLTGRRWAFGRSTT
SQQTRVAEALAAVEATDLGCRRLSTLSGGQRQRIAIAVALVARPQLLILD
EPLASLDLRSQRDIVALLARLHAELAVTILVVAHDLNPLLPILDSVIYML
DGRPRYVPVGDIMDDTLLTRLYGTPIHVHRARDGALYMRSAL
>MAP2579c hypothetical protein
MGEPSVRATTPGTPMRRIAAACLVGSAIEFYDFLIYGTAAALVFPAVFFP
RLGPTVATIASMATFATAFLSRPLGAAVFGYFGDRLGRKKTLVATLLIMG
ASTVSVGLVPSTASIGIAAPLLLTVLRLLQGFAVGGEWAGSVLLSAEYAP
TDRRGWYGMFTLLGGGTAGILASLTFLAVNLTMGEHSPAFMHWGWRVPFL
ISSGLIGIALYVRLNIDETPIFVEEKARHLVPKAPLTELLRLQRREIILV
AGSFVGGMGFIYLGNTFLVMYAHNHLGYSRSFIWGIGALGGLTSMACVAC
SAWISDRVGRRRVMLWGLVACLPWAFVVIPLIDTGRPVCYVVAVLGMFGT
AAVANGPTAAFVPELFATRYRYSGAAVAMNLAGIVGAAVPPLLAGTLLAT
YGSWAIGLMMASLVLASFVSVYLLPETRGAALDAAAAAEKVAAR
>MAP0616c hypothetical protein
MAPLITLVVGSLVAWVVGRLGVAYVDGWAPALAVGLAAMFVLTGIAHFAP
PLRADLVAIVPPRLPAPGLLVSLTGVLELLGALGLLLPATRAAAAGCLLV
LMLAMFPANIHASRMPDPPKSMTTRLPLRIGMEIVFLAAAVAVALGGR
>MAP0223c hypothetical protein
MTAPRLYRPSPPLAEHIEYFGYWRGDEALGVHTSRALPRGAVTAIVDVAG
RTDLGFYASDARTPLTVPPLFAAGAGATAYVVRVAPAHTVMTIHFRPAGA
LAFLGCPLSDLEDALVGLEELWGRDAALLREQLIDAGSPPRRVALLQAFL
VRRMRRNAVWPPARLAPVLRGADLDPSMRVSKAQELSGLSRKRFAALFRC
EVGLSPKAYLRVRRLQAALRALDTPARGATIAADLGYFDQAHFLREFRAF
TGVTPTQYARRRSSMPGHVELAR
>MAP2598c hypothetical protein
MNYTGHNGKPLVERVSTENAAARIEGLPRVPISKATAHEVISLSYGFFTP
LTGFMGRREVDATLDEFALPDSTLWSIPIVFDMSADDIAERDVKEGASVV
LDYLGVPMAILDVTEIYEYDLERMAEKTYGTTDPRHPGVKKTLGYHNRFI
GGDITLINEPVFNEPFKSFWLTPRQHQDALAAKHWHRVVAHQTRNVPHTG
HEALMKQAWLAANEDQPVDSLNTGVLVNAIIGQKRVGDYIDEAILLAQNA
LRTSGYFRENVHMVSFTLWDMRYAGPREAIFHAILRTNLGCTHHMFGRDH
AGVGDFYHPYDSQNILKQYRNQLGIKPVFLRENWYCPVCLEVTNSALCGH
EAQAQSFSGSLIRSILTDEVKPTQKVMRHDVFEVVMECAAKHGQGSPFVT
EEYLANRLPVFTLNQLEGS
>MAP0993 hypothetical protein
MVTGRLAGIDCGTNSIRLLIADVRDGRLRDVHRETRIVRLGQGVDATGEF
APEAIARTRAALSDYADLLKQHGVQRVRMVATSAARDVGNRADFFSMTAD
VLGAVLPGAVAEVITGADEAELSFRGAVGELDSAAGPFVVVDLGGGSTEI
VVGGSDGVTASHSADIGCVRLTERCLHSDPPTPEEVALARQVVRERLEVA
LGVVPVEGARTWVGVAGTMTTLSALAHDLPAYDSAAIHLSRVSGRDLLAV
CERLIGMTRAQRAALPPMHAGRADVIAGGAVVVEELARELRARAGIDELT
VSEHDILDGIVLSIAG
>MAP3682 hypothetical protein
MTENFTTTNAGAPAPSDELSLTLGPDGPVLLQDFYLIEQLAAFNRERVPE
RQPHAKGTGAFGRFEVTNDLSAYTKAAVFQPGTKTDVFVRLSGNAGERGS
ADTVRDTRGFSVKFYTTEGNFDLVGLDFPVFVIRDPIKFPQMVRSAKRRA
NNDCRDHNMQWDFWTLSPESAHQVAMIMSDRGIPKTFRHMHGFGLHTFSF
LNAAGEISWVKFHFKSNQGIEWLTQEEGDRLAGTDPDYCIRDLYEAIERG
DHPSWSVKVQIMPFEEAKTYRFNPFDVTKVWPHADYPLIDLGTMTLDRNV
TDHHTEVEQVTFAPHALVPGIGLSPDKLLLGRSFAYADAHRYRVGANHNQ
IPVNAPRCPVRSYSKDGQMRFVNSTDPVYAPNSYGGPKADPDRASVVKWA
VDGGMMVRAPYTLRPDDDDWGQAGALVRDVMDDAERERLVHNIVHHVTDG
VKEPVLSRVVEYWYNIDADIGKRVEDGIRAANLGR
>MAP1760c hypothetical protein
MEPDSTAVSRRRALGGAVLAGLSGAAIGAVGGGFAGHAVAAGQRGDDHDT
VDLRRSYPFYGQPHQGGIDTPPQRYAMFMSFSLASGAGRTELQTLLARWS
AAAAILQQGKPVGTVQPQVDVQPPADTGEADGLSPASLTVTIGLGPSLFG
DRFGLAARRPAVFTDLPPLNGDNLDPRLHGGDLSVQACADDPQVCYHAVR
NLARLGRNIVSPFWAVLGFGRASAGPGQHTPRNLLGFKDGTRNISSQAEY
DRFVWVDNSDQPWMNGGTYQVVRKIRMLLETWDVDRIGNQQRIFGRTKEE
GAPLSGRHEFDTPDFTAKGPDGNPLIDPMSHVGLAARENNDGIMIRRRSY
NYTDGLDANGQLNAGLLFVSYQKDPQDFIRLQNRLGAHDLLNEYIRHIGS
AIFAVPPAPAEGHYIAQSLFR
>MAP0631c hypothetical protein
MVNIVAVRRHGVHVRVIHVPPVQPQPILAPLTPAAIFLVLTVDDGGEATV
HEALQDISGLVRAIGFREPQKRLSAIASIGSDVWDRLFSGPRPAELHRFV
ELHGPRHTAPATPGDLLFHIRAESLDVCFELADRILKSMAGAVTVVDEVH
GFRYFDNRDLLGFVDGTENPDGALAVSSTAIGDEDPDFAGSCYVHVQKYL
HDMSAWTALSVTEQENVIGRTKLDDIELDDDVKPADAHIALNVITDDDGT
ELKIVRHNMPFGELGKSEYGTYFIGYSRTPRVTEQMLRNMFLGDPPGNTD
RILDFSTAVTGGLFFSPTVDFLDDPPPLPAPGTPAAPPARNGSLSIGSLK
GTTR
>MAP0204c hypothetical protein
MQVTSVGHAGFLIRTQAGSILCDPWVNPAYFASWFPFPDNSTLDWDQLGD
CDYLYVSHLHKDHFDAKNLAEHVNKDAVVLLPEFPVPDLRNALQELGFHR
FFETADSVKHRVGGLDVMIIALRAPADGPIGDSALVVSDGSTTLFNMNDA
RPVELDMLASEFGHIDVHLLQYSGAIWYPMVYDMPARAKESFGIQKRQRQ
MDRARQYLAQVGATWVVPSAGPPCFLDPELRHLNDDHGDPANIFPDQMVF
LEQLRAHGQGGGLLMIPGSTADFSGSTLNSLHHPLPTAEVEAIFATGKAD
YIAAYAERMAPVIAAERAGWAPATGEPLLEPLRALFEPIMSQSDEICDGI
GYPVELVLGPERVILDFPKRTVREPIPDEKVRYGFAIAPELVRTVLRDRE
PDWVNTIFLSTRFKAWRVGGYNEYLYTFFKCLTDERIAYADGWFAETHDN
SASITLDGWEIQRRCPHLKADLSKFGVVEGNTLTCNLHGWQWRLDDGRCL
TAKGHQLRSSRA
>MAP0886c hypothetical protein
MGSRLRPIPQTHRGDRHRRRRRAPPRPAGAHRGVGPGVRSCSPPGGARDT
ALANPRPTPAASDTATARAGDRAVADPGGDPLGRADADGAHHHVDAIIYG
TGFAIPAHVADDTITGAGGLPLRRAWPDGTEPFCGVAVRGFPNYFFASGP
DPGPQARYIVECLKLMQRTGSRRIEVRASSQQVFNERAQLRPVEPPPVAS
AFDLSASTPAGDDTYDGAATLEIAGDRHPVRVRLTGHLDPIDGRYHWQGT
VFGSPSQPLPGDLLGQARAATLTVGQRSAPARIVERTPWGTHTVAGIGAP
PYPGHR
>MAP3420c hypothetical protein
MMDLFAPPEVTSTLIHTGPGAGSLIEAAAAWQRVAVELENSVSSYASTLS
SLIESWDGPSAMAMLQSVQPYLLWLRETAQQSAQLANSAEAAATAFGTVR
STVVHPSVVSANRTRLAQLLATNRFGTNTAAIAETENEYQTMWANNSAAL
SRYQAASSQATSPLTQFNSPLAVTDPGGTANQQAAVMKASVDSSGSSVGS
VLNDLNMPGGFDPNAGWFNYFSTWGNQFISSGFPINLLGVWAQLATAQGV
ASVGGDIGSGLSEGLGATTASLANAIKGIGAGAVAPSGAMGVGVSLGKLT
APPAVVGLLPGTQTGVQLASAASPLPAAESGFPLMPMMVPPPTTSAGTGW
RKRKQQKYEDVAYGREVKGKVMPRNPSAG
>MAP2487c hypothetical protein
MAVTDDYLAHNAGYASSFEGPLPMPPSKHVAVVACMDARLDVYRILGLRE
GEAHVIRNAGGVITDDVVRSLAISQRLLGTREIILIHHTDCGMLTFTDDD
FKRGIQEETGIKPPWAAEAFADLAEDVRQSLRRIEANPFVTKHVSARGFV
FDVATGKLDEVKP
>MAP0142c hypothetical protein
MMSSNVRAARNRPGRTDPQPPTGAPLFVGVLGLLLATGWVANHFVGLMPA
ISDRDHLATTTLDGIFGIYALGLLPGLLVGGRTSDALGRRPVALTGSAAA
LVGTVAMLLSQHSPALFAGRLIVGLGVGLAISAGTAWASDLRGPAGAATA
GAVLTAGFAVGPFAGGVRAWAGPSGVRASFALAAAILALAAFAVVAAPQP
SPVTAPADPDGEETADAAPQGISRALSWAMPLAPWVFASATLGFVTIPGR
LHTALAAPVAAGTATLIVNGVSGAVQVLARALRWGPQAGTAGAVLAALGY
AVAAAAPPTLTPALGVPLFVVLGCASGLCLREGLIDLGGRRAATPARRPD
GLFYVVTYIGFGLPLILASVRPGVATAILSGMAVLAMTAAVGRAARLRRD
DHRQN
>MAP3031 hypothetical protein
MQTDPVATPDVGAGTRWSIMVVSLLATASSFLFINGVAFLIPSLQGARGI
RLDEAGLLASMPSWGMVVTLVAWGYVLDRVGERVVMTTGSALTALAAYAA
SSAHSMVLIAAYLFLGGMAAASCNTAGGRLVSAWFPPHQRGLAMGIRQTA
QPLGIALGAMVIPELAEHGPQAGLRFTALACVFGAVASVIGIVDPPRKPR
ASASDQELASPYRGSLTLWRIHAVAGLMMMPQTVTVTFMLVWLIRNLHWS
VAAAGALVTLSQLLGALGRVAVGRWSDRLGSRMRPVRYIAAAAVLVLLLL
AWADYLNSRWQAGLMVAISVIAVLDNGLEATAITEFAGPYWSGRALGIQN
TTQRMMAAAGPPLFGALISAAKYPPAWLLCALFPLAAMPLVPTQLLPPGL
ETRARRQSVRRLRWWRAVRSHALPDIVRRPGQPG
>MAP3517 hypothetical protein
MTSMHTRGRRTPGAAVQSSAVSDAAHVGDIVGAFGRIRRDGGGADGAAGG
RWRRLRTLAVITGPGLIVMVGDNDAGGVATYAQAGQNYGMGLLWTLVLLI
PVLYVNQEMVLRLGAVARVGHARLIFERFGRFWGAFSVGDLLILNALTIV
TEFIGVALALGFLGCPKIVAVPAAAALLFAVVAGGSFRRWERLMFLLIAV
NVLIFPMVMLVHPAPKATVAGLIPQFPGGLNSTVLLLVVAIVGTTVAPWQ
LFFQQSNVVDKRITARWIPYGRADLVIGIVVVMVGATALMAVTAFGLAGT
AAAGHFTDAGAVAAGLSAHLGRTVGVLFAIILLDASLIGANAVGLATSYA
VGDAMGKRHSLHWKITEAPLFYGGYAVLLAVSAAVSFSPDHILGLVTQGV
QALAGVLLPSATVFLVLLCNDRAVLGPWVNTVRQNIAAWTIVWCLVLLSL
ALTATTFFPDLSTGTIEAGLAAGAVLGVVAGAVMIVVGRRQRDLAEAEAI
VRTLGGGLDPEQVDELDDASSLTRAERRAVRRQDRENWQTPSLALLDRPA
MSPMRRAGLFTLRGYLVVAVVFVIIKLVQAGVVGPAGSL
>MAP2516 hypothetical protein
MPDTSRGPALLILFATLLATAGTGISIVAFPWLALQHRHSATDASIVAAA
MTLPLVLATLIAGTAVDSFGRRRVSLVCDWLSGAAVTAVPLTAWIFGAAA
IDVAELAVLAFCAAAFDPAGMTARQSMLPEAAARAGWSLDRTNSSYEAML
NLAFIVGPGLGGLLIATLGGINTMWVTAGCFALSFLAIGALRLDGAGKPP
RATRPVGLVTGIAEGVRFVWNLRVLRTLGLIDLAVTALYLPMESVLFPKY
FADQHQPAELGWALMALALGGVAGALGYAVLSARLRRRTAVLTATLTFGA
TTAGIAVLPPLPVILGLCAVTGVVYGPIQPIYNFVMQTRAPHHLRGRVVG
VMAGLTYAAGPLGLLVAGPLADAAGLKATFLTLAVPILAIGVVACGLPSL
RELDRAPQFADDPGP
>MAP3509 hypothetical protein
MDTQQDGCGPTETPDDIDIDALRQKYAHEREKRLRKEGSKQYIELEDDFS
GYYEVDPYTPVTPREPIREDIDVAVLGGGFAGLLSAAHLKKAGVDDVRII
ELGGDFGGVWYWNRYPGIQCDNESYCYIPLLEELDFMPSKKFADGAEIYQ
HCRNIGKHFGLYDSAIFSTQVHDLRWDEQIKRWRVSTNRGDDIRARFVVL
ASGPFHRPKLPGIPGIKTFGGHSFHSSRWDYDYTGGDSGGNLHKLADKRV
GVVGTGATGIQIVPFLARYAQHLYVFQRTPSTVDARNNTPTDPEWVKTLR
PGWQRERQRNFHAWTFEGMAPGQPDLVCDFWTELGRNTAARVLALDDPAS
LTPEQFMAIREEEDYKIMERLRRRIDTLIDDPATAEALKPYYRFLCKRPC
SNDDYLPSFNRPNVTLVDVSASKGVERATEKGLVANGVEYELDCIIYASG
FEITTEISRRYSIETIAGRDGLSLFDYWRDGYKTLHGMTSRGFPNQFYTG
FTQVGISANIAANYELQGEHIAYIIAEALKRGAATVEPSDEAQQQWCTTI
RETAVDNSAFDAQCTPGYYNNEGGGGGEGIRSHLGEPYGPGFYAFEDLLR
AWRDKGDLEGLVLGS
>MAP0424 hypothetical protein
MSQLSFFTAESVPPAVADLSGVLAASGQIVMVGTPEPHGARLSVVVDHTW
RAEALADMISEAGLVAEIGRTDEDTPLVRTAVDPALSPLAAEWTRGAVKT
VPPRWLPGPRELRAWTLAAGNPEGEHYVLALDPHAPDTHSPLASALMRVG
IAPTLIGTRGGRPALRISGRRRLSRLVENVGEPPDSPEASAHWPRV
>MAP2200 hypothetical protein
MAFRPVSAAAASAGAILGGRTHPRAGGVSMARDHAHAALIIGAGFTGLGA
AIRLAEAGVDDIVILERADRVGGTWRDTTYPGASCDVPSLLYSYSFVKNP
TWSRTYSPAPEIYRHLEDMADRFDIRRRIRFGHEVSGLAFDEDAGVWTAT
TKNRKKFRARTVVLASGPLSDVSFPDIRGLDSYRGHKIHSARWDHDYDFA
GKRVAVIGTGASAIQIIPELVKQAGFVKVFQRTPCWVLPRLDVATPPAVQ
ALFAKVPAAQELARQALYWGHEASATALVWDTPLTSLVARLGKAHLRAQV
KDPWLRRQLTPDFRPGCKRMLVSSDYYPALQRDNCKLIDWPIATLSPAGI
RTSDGIEHHLDCIVFATGYDVHLTGPPFPVTGLGGRSLAAEWAGGAQAYK
SINVHGYPNLFVMTGPNSGPGHNSLLVYIEGQLDYAVRGITTILNDDLRY
LDVREEVQRRHNEAIQRRLTKTTWMSGCRSWYLTKDGFNGSMYPGFATQY
LRQMSDFRYQDYQAVARRARTPAASSA
>MAP2343c hypothetical protein
MVRTGYGVQVARTVSDSAPDSAEDPSEREFGQAGIALSTYRFPTGWFIVA
FGSDLAPGQVKRAHYFGEELVLFRTASGRVHVMDAYCQHLGANLGVGGTV
EGENIVCPWHGWQWRGDGSNALIPYSKIGCKNNVRIRTYPSMEWYGFVLA
WHERHGRAPYWQPPVLPELETGEYYPLHPHTQMVNRVKVHPQMIIENAAD
PYHVQYVHKAANPATTASFEVAGYHLHATVNAHFGGGRAQTWLTPNGPVD
AKIIYDNYSLGLGVVRFPSELVATVQVTGQTPVDEDYTDYFYTQASVREP
GDTGDVPTGRAARFLALQKEVIKQDFFTWENMKYLEKPNLAPEEAHDYAA
LRRWAHRFYPGPQPAPTDFGYTADGEPDPAAAKA
>MAP0127 hypothetical protein
MTNSGEGPMQRAGSPERQGVSTSRSERLREVLRYDLPASLVVFLVALPLS
LGIAIASDAPVLAGLIAAIVGGIVGGWLGGSPLQVSGPAAGLTVVVADVV
AEFGWGVTCFITVVAGVLQVLLGFSRIARAALAISPVVVHAMLAGIGITI
ALQQVHVLLGGSSKSSAWSNVTGLPAQILGAHRPGLVLGLLVIAILVAWR
WVPARLAIVPGPLVAIVVVTIISMVLPFKVSRIELDGSVLDAVRLPSLPH
GNWGAVAIAVITVTLITSVQSLLTAVSIDRMHTGPRTDFNRELIGQGAAN
IASGALGGLPIAGVIVRSSANVNAGAKTRASTIMHGFWVLVFAVPFAGLV
EKIPTAALAGLLIVIGIELLKPAHIETALRNGDLAIYLVTVTSVIFLNLL
HGVLIGLLLAVVVTGWRVVRARIEAEPVGDGWHVVIEGACTFLALPRLTG
VLASIPERTSVTVHLLTNYLDHAAHQAIGDWQRRHCATGGTVEVRDTAEP
AARRRNSHLSLVEQVSSPGGA
>MAP2376c hypothetical protein
MRMTEADTPFDVLVIGAGFSGLYMLHRLRQLGIPARVLEMAENVRGTWLF
NRYPGARCDIESIEYSYSFSEEIQQEWVWTESMPAQPEILAYLNYVADRL
DLRRDIQFGAEVVAMTFDEDAAMWSVRTRSGDTFRVPFVVAASGILSVPL
QPDIPGMNTFAGTSLFTSRWPAAGVDLTGKRVGVIGTGSTGVQLIPVVAR
EALHLSVFQRSPAYTLPWRVHRFQPGELDEMKARYGEIRAAQRAHPIGAA
RLSAFSVLLEMLGRPPLKSATPEERLRAIEEHGVLGALNWGDVFFDIEAN
RMAAELYGEAVARIVKDPETAASLVPVHPFACKRPIIDQGYYETFNRDNV
TLVDLRKSPIREVTPAGIRTEDRLHELDVIVYATGFDAMTGALSRIDIRG
RGGIGLAEFWATQGPLSYLGLAVAGFPNLFTVQGPGSPSAATNFVAALEQ
HVEWIGDCIGYLRANHIRTIEALSTAQQEWIEHTTALVAPTVLVHPSCNS
WYNGGNVPGKKRMYMGYTGGIPEYRRRCDEIAAGGYTGFKLA
>MAP1108 hypothetical protein
MGSAGHGTLVRALSRAGVNGVEVLNQQPQVGASALESGQVQALSQFVAWP
GLLVFQGKAKLLYDGAELNLPTLHGVVVRRSYAAAHPEVLAAFLQAQLDA
TDFLNAHPLQAARIVADASGLPPEVVYLYNGPGGTSFDTTLKPSLTEALK
SDVPYLKSIGDFADLDVDKFVVDEPLRAVFTARGLDYQAARARTTNPSTL
RGDPALAGELWLDGADTTQTTADPASLLRAVRDALGRGARVRAAYVPDTE
FGTRWFADKAFWVKDGQNYLPFGTAAGAGRYLAAHPGGIAVNYQQALGGS
V
>MAP1394c amt, Amt_1
MHGIDPAATAWLLASTALVLLMTPGLAIFYGGMVRTTGVLNMIMMSFISI
PLVTVAWLLVGYTMAFSQDGMSGLVGNLRHFGMLGITPDTTHGAVPELLY
ATFQLTFAIITAALVSGAIADRAKFAAWMVFVPVWTVAVYSVVAQWVWGP
GGWLARLGVLDYAGGLVVEIVSGSSALALALVLGPRIGFKKDAMRPHNLP
LLFVGVGLLWFGWFGFNAGSALAANGTAAAIFLNTLVAGCLGMLGWLSVE
QIRDGRPTTFGAASGVVAGLVAITPSCGTVNTLGATVVGLAAGVVCSFAA
GAKLRFNYDDSLDVVGVHFVGGVVGVSLIGLFATAVMTAGPQGLFYGGGV
AQLGKQALAIAVVALYAFTVSFLLAKVIDRVMGFRVSAEDETTGVDLTQH
AETAYAEGVHGHLPQRRPGPGDRLK
>MAP2988c amt, Amt_2
MRVTYPILGQPNTGDTAWMLASSALVLLMTPGLAFFYGGMVRARSVLNML
MMSISAMGVVTVLWVLYGYSVAFGDDVGNFMGKPTSYWGLKGLIGVNAVA
ADPSKGTAATDIPLAGTLPATVFVAFQLMFAIITVALISGAVSDRLKFAA
WLVFAGLWATFVYFPVAHWVFAFDGFASEHGGWIANKLHAIDFAGGTAVH
INSGVAGLMLAIVLGKRRGWPTTLFRPHNLPFVMLGAGLLWFGWYGFNAG
SATSSNGAAGSTFMTTTIATATAMLAWMLTERIRDGKATTLGAASGIVAG
LVAITPSCSSVNVLGALVVGLVAGVVCALAVGLKFKLGFDDSLDVVGVHL
VGGLAGTLLVGLLAAPESPAISGVTGVSKGLFYGGGWAQLERQAVGAFSV
LIYSGVVTLILALILKYTMGLRLNPEAEASGIDEAEHAESGYDFAVATGS
VLPPRVAVADTRNGLEEQRVGDKVEAEQS
>MAP2805 arsA, ArsA
MSVIAIAVFVVAYALIASDRVNKTFVALAGAAVVITLPMIRSDDVFYSRE
TGIDWDVIFLLLGMMIIVSVLRQTGVFEYVAIWSAKRARGSPLRVMILLV
LVTALASALLDNVTTVLLIAPVTLLVCDRLAITAAPFLMAEVFASNVGGA
ATLVGDPPNIIIASRGGLSFNDFLVHLAPIVVIVVGVLIALLPRLFPGAF
TVDPERVADVMSLEEKEAIRDPRLLVTCGVVLLAVFAAFIAHGPLHLEPS
LVALLGAGILVVASRLQPADYLSGVEWDTLLFFAGLFVMVGALVKTGVVK
HLARLAITATGGNTLTATMVILVASVVISGIVDNVPYAATMAPVVADLVP
ALGDHANPAVLWWSLALGTDFGGNLTAIGASANIVLLGIARRADNPISFW
EFTRKGVVVTAVSVALSALYLWLRYFVWG
>MAP0484c arsB2, ArsB2
MALTVALVLLAVVLGFAVARPRGWPEAVAAVPAALLLVGVGAIPVAAAEQ
QIADLSGVVAFLGAVLVLAKLCDDEGLFEAAGAAIARGRVGSAGMLRRVF
VIASAITAVLSLDAAVVLLTPVVLAAVRRQRTAVRPYAYATAHLANGASL
LLPVSNLTNLLAFHTAQLSFTRFTLLMAAPWLAAVATLYAVFRGFFAKDL
RVQPDPAALGAPPRPPVFVLVVVALTLAGFAVAQSVGIAPAWVALCGASV
LAVRSLARGHTTVTEIARSVHVSFLVFVLALGVVVQAVMRNGMDRAMSAV
LPSGSGLPALLAIAATAAVLANVVNNLPATLVLLPLVAPAGPVAVLAVLI
GVNIGPNLTYVGSLSNLLWRRVLRQHGVDAGVGEYTRLGVCTVPVSLLVA
VLALWASARLLGG
>MAP0982c arsC, ArsC
MAQGATIYHNPRCSTSRKTLELLRDNGFEPNIVEYLKTPPSRAELVKMIR
DAGIDVRTAVRKRESLYDELNLAEASDEQLLDAMIEHPILIERPFVVTPK
GTRLARPIDAVREIL
>MAP4171 atsA, AtsA
MVAGSGQSAEFSGRVELDIRDSEPDWGPYAAPTAPPNAPNILYLVWDDTG
IATWDCFGGLVEMPAMSRIAERGVRLSQFHTTALCSPTRAALLTGRNATT
VGMATIEEFTEGFPNANGRIPFDTALLSEALAEHGYNTYCVGKWHLTPLE
ESNMASTKRHWPTSRGFERFYGFLGGETDQWYPDLVYDNHPVSPPATPED
GYHLSKDLADKTIEFIRDAKVIAPEKPWFSYVCPGAGHAPHHVFKEWADR
YAGRFDMGYERYREVVLERQKAMGIVPSDTELSPVNPYLDVTGPRGEPWP
LQDTVRPWDSLNDEEKKLFARMAEVFAGFLSYTDAQIGRILDYLEESGQL
DDTIIVVISDNGASGEGGPNGSVNEGKFFNGYIDTVEESMKLFDQLGGPQ
TYNHYPIGWAMAFNTPYKLYKRYASHEGGIADTAIISWPNGIAAHGEIRD
NYVNVCDITPTVYDLLGMSPPETVKGIAQKPLDGVSFKAALDDPNADTGK
TTQFYTMLGTRGIWHEGWFANTVHAATPAGWSHFDADRWELFHIEADRSQ
CHDLAAENPDKLEELKALWFAEAARYNGLPLSDLNILETMTRSRPYLVGE
RDSYVYYPDCADVGIGAAAEIRGRSFSVLAEATVDTTGAEGVLFKQGGAH
GGHVLFIQDGRLHYVYNFLGERQQEVSSSVPVPLGRHLFGASYARTGTVP
DSHTPLGDLTLFIDDEVVGTLAGVSTHPGTFGLAGAGITVGRNGGSGVSS
RFKAPFVFTGGTIARVTLDLSGRPYRDVETEIALAFSRD
>MAP3791c atsG, AtsG
MTQLTPGDAGDRNDAGRKDNVLIVHWHDLGRYLGAYGHRDVSSPRLDRLA
AEGILFTRAHATAPLCSPSRGSLFTGRYPQTNGLIGLAHHGWEYRSGIRT
LPQILSEAGWYSALFGMQHETSYPKRLGFDEFDVSNSYCDYVAQRADEWL
RQSAEGVVGQPFLLTAGFFETHRPYPEDRYPPADSAEVQPPDYLPDTPEV
RGDLAAFYGAISTADAAVGRLLDTLADTGLDASTWVVFFTDHGPAFPRAK
STLYDAGTGIGMIVRPPTGRGLPPRVYDELFSAVDLVPTLLGLLGISVPP
DVDGVSHASALLRPDPDAAPVREHVYTMKTYHDSFDPIRAIRTKDYSYIE
NYASRPLLELPWDIEESPSGMAVAPLVTAPRPERELYDLRADPTEITNLL
AGDDADADEVAANLAVLLHDWRQRTGDVIPSEFAGTRIAARYTETYLQIH
HGARPTARSAIAADRGIEEEGNPAQR
>MAP1595 bfrA, BfrA
MQGDPEVLRLLNEQLTSELTAINQYFLHSKMQDNWGFTELAEHTRAESFD
EMRHAEAITDRILLLDGLPNYQRLFSLRIGQTLREQFEADLAIEYEVMDR
LKPAIILCREKQDSTTATLFEQIVADEEKHIDYLETQLELMDKLGVELYS
AQCVSRPPS
>MAP3236 catB, CatB
MATDHTSGAPDPKQRDLESARFRRDTGYLTTQQGVRVDHTDDALTVGERG
PTLLEDFHAREKITHFDHERIPERVVHARGAGAYGYFEPYDDRLAQYTAA
KFLTSPGTRTPVFVRFSTVAGSRGSADTVRDVRGFATKFYTEQGNYDLVG
NNFPVFFIQDGIKFPDFVHAVKPEPHNEIPQAQSAHDTLWDFVSLQPETL
HAIMWLMSDRALPRSYRMMQGFGVHTFRLVNARGEGTFVKFHWKPRLGVH
SLIWDECQKIAGKDPDYNRRDLWEAIESRQYPEWELGVQLVAEDDEFSFD
FDLLDATKIIPEEQVPVLPVGKMVLNRNPDNFFAETEQVAFHTANVVPGI
DFTNDPLLQFRNFSYLDTQLIRLGGPNFAQLPVNRPVAQVRTNQHDGYGQ
HTIPQGRSSYFKNSIGGGCPALADEDVFRHYTQRVDGQTMRKRAEAFQNH
YGQARMFFKSMSPVEAEHIVAAFAFELGKVEMPEIRSAVVAQLARVDDQL
AAQVAAKLGLPEPPEEQVDESAPVSPALSQVTDGGDTIASRRIAVLAADG
VDVVGTQRFTELMEQRGAVVEVLAPVAGGTLAGGSGGELRVDRSFTTMAS
VLYDAVVVACGPRSVSTLSDDGYAVHFVTEAYKHLKPIGAYGAGVDLLRK
AGIDNRLAEDTDVLNDQAVVTTKAAADELPERFAEEFAAALAQHRCWQRR
TDAVPA
>MAP1301 chaA, ChaA
MSKWLSRNVLSWTVVVPVLAVVVLALIWGERLGPVLVALAALFLIGAVVA
AVHHAEVVAHRIGEPFGSLVLAAAVTIIEVALIVELMASGGNETATLARD
TAFAALMITTNGIAGLSLLLGSRRYGVTLFNAEGSGAALATLTTLATLSL
VLPAFTTTQVGKEFSPGQLTFAAVASLLVYLLFVFTQTVRHRDFFLPIAQ
KGQKSLFEDESHADPPSTREALVSLVLLLCALVAVVGLAEEESPAVERAV
TAVGFPQTFVGVIIAALVLLPETLAAVRAARQGRIQISLNLAYGSAMASI
GLTIPAIALASIWLKGPLVLGLGAIQLVLLALTVVISVLTVVPGRATRLQ
GEVHLVLLAAYVFLAVSP
>MAP1810 cobG, CobG
MARTRDADACPGALQVHQAADGALARVRLPGGMLTPAQLTVLCDIADRLG
SPTLELTARGNVQLRGLTDVTAAAGALAAAGLLPSATHERVRNIVASPLS
GRSGGNLDVRRWVGELDAAIRAQPRLSELGGRFWFSLDDGRADVSGLRAD
VGVHVLPDGCAVLLAGRDTAVRLPPDQVVATLVGVATRFVQVRGSAWRVQ
ELDDPNQLLPGAECGPIAYPAVTKPPVGWITQDDGRVTLGAAVPLGLLSA
RVAEYLAALQAPLVITPWRSVLVGDLREEVADAALRVLAPLGLVFDENSP
WLSVSACTGSPGCARSTADVRADAALAVREGTAPGHRHFVGCERACGSPL
AGEVLLATGEGYRQLR
>MAP2542 corA, CorA
MFQGFDALPEVLRPIAHQPHPQPAPEAPPARATLVDCAVYDDGNRLPGVF
GYADALDKVREIESQGREGFVWVGLREPNQTEMQEVADVFGLHALAVEDA
VCAHQRPKVERYDDTLFLVLKTVNYVPHESVVLAREIVETGEIMVFVGRD
FVVTVRHGEHGGLSEVRKRMDGDPEQMRLGPFAVMHAIADHVVDHYLEVS
SLMLADIDSIEGLAFAPGSKIDVEPIYLLKREVVELRRCVNPLSSAFHRI
QTENKDVISKEVRRYLRDVADHHSEAADQIASYDDMLNSLIQAALARVGM
QQNNDMRKMAAWAGILAVPTMIAAIYGMNFHFMPELNWTWGYPAVMAGMA
VVCLVLYFQFRNRNWL
>MAP4284 ctpA, CtpA
MSTPPRHVDEGTFPDHTASTARIELEITGMTCASCAARIEKKLNKLDGVT
ATVNYATEKAAVSAPASYDPQTLITEIENAGYAAAVAKPSPPRDDPELAS
LRRRLVTATALAGPVIAVAMIPALQFQHWHWAALALTAPVVGWCGRPFHA
AAWANLKHGVATMDTLISIGTLAAFLWSLYALVLGAADRPGMRHDFELTV
GHGAHVSHVAAPCHVYFEVAAGVTLFVLAGRYFERRSKRTAGAALRALLA
LGAKDVAVLRAGAETRIPIERLAVGDEFVVRPGERIATDGIVVAGSSAVD
AAMLTGESVPVEVGVGDGVTGGTVNAGGRLVVRATGIGDDTQLARMAQLV
ERAQSGKADAQRLADRVSGVFVPVVLLLAVATLAGWLTAGGTLATALTAA
VAVLIIACPCALGLATPTALLVGTGRAAQLGVLIKGPEVLETTRAVDTVV
LDKTGTVTTGAMTVLDVVAADGTDRATLLRYAGALEAASEHPIAHAIARD
AKAELGPLPTPTGFRAVGGGGVHGRVDGHAVAVGRPRWLAERGLRPDAAL
AAAAARAEHDGKTVVAVGWDGRARGILALADTVKPCSAAAVRQFTRLGLT
PILLTGDNHTVARRIAGELGIGEVISGALPADKVEAVKRLQSAGRVVAMG
TGTDVAIEAADVTVVRGDLRAAVDAIRLSRRTLATIKTNLVWAFGYNLAA
IPLAALGMLNPMLAGAAMALSSVLVVGNSLRLRSFASIIPGA
>MAP3384 ctpC, CtpC
MNLASVRAIGDEGLTKDPALQVMSDAAGRMRVSVGWVRADSRRAVAVEEA
VAKCDGVRVVHAYPRTGSVVIWYSPRRCDRSAVLAAIGEAAHVTAELIPA
RAPHSSEIRNADVLRMVIGGAALALLGVRRYVFARPPLLGPSGRLFATGV
TVFTGYPFLRGALRSLRSGRAGTDALVSAATVASLVLRENVVALTVLWLL
NIGEYLQDLTLRRTRRAISELLRGSQDTAWIRLEHNEIQVATDTLQIGDE
VVVHDHVAIPVDGEVIDGEAIVDQSAITGENLPVSVVVGMPVHAGSVVVR
GRLVVRARAVGNQTTIGRIITRVEEAQHDRAPIQTVGENFSRRFVPTSFI
VSAITLAVTGDVRRAMTMLLIACPCAVGLATPTAISAAIGNGARRGILIK
GGSHLEQAGQVDAIVFDKTGTLTVGRPVVTNIVAMHKDWEPEQVLAYAAS
SEIHSRHPLAEAVIRSTEERHITIPPHEECEVLVGLGMRTWADGRTLLLG
SPSLLQAEKVKVSKKAKEWVDKLRRQAETPLLLAVDGTLVGLISLRDEVR
PEAAGVLKKLRANGIRRIVMLTGDHPDIAAVVADELGIDEWRAEVMPEDK
LAAVRDLQEEGFVVGMVGDGINDAPALAAADIGIAMGLAGTDVAVETADV
ALSNDDLHRLLDVRDLGSRAVDVIRENYGMSIAVNAAGLIIGAGGALSPV
LAAILHNASSVAVVANSSRLIRYRLN
>MAP0843 ctpE, CtpE
MNTGLTDAEVAQRVAHGQRNAVRQRATRSIADIVRANVFTRINAILGVLL
LIVLATGSVINGMFGLLIIANSVVGMVQEIRAKQTLDKLAIVGQAKPMVR
RQSGTRALPPDDVVLDDIIELGPGDQVVVDGEIVEEANLEVDESLLTGEA
DPIAKAVGDSVMSGSFVVAGSGAYRATRVGSQAYAARLAEEASKFTLVKS
ELRNGINRILQFITYLLVPAGLLTIYTQLFTTHAGWQKSVLRTVGALVPM
VPEGLVLLTSVAFAVGVVRLGQRRCLVQELPAIEGLARVDVVCADKTGTL
TESGMRVARVDELDGSGHDRIADVLAALAAADPRPNASMRAIAQTYSRPP
GWTVTATAPFKSATKWSGVSFAGHGDWVMGAPDVLLDSGSAAAGQAERLG
AQGLRVLLLGAADRAVDHPDAPGPITPVALVVLEQKVRPDARETLDYFAD
QGVSVKVLSGDNAVSVGAVAGELGLHGETLDARQLPSDLAQLADMLDTYT
TFGRVRPDQKRAIVHALQSHGHTVAMTGDGVNDVLALKDADIGVAMGAGS
PASRAVAQIVLLDNRFATLPYVVGEGRRVIGNIERVANLFLTKTVYSVLL
ALLVGFECLFAKALKADPLLYPFQPIHVTVAAWFTIGIPSFILSLAPNNE
RAHPGFVRRVLSSALPSGLIVGAATFASYLVAYHGRHATFQQQDQASTAA
LITLLVTALWVLAVVARPYQWWRVALVIASGLAYVVIFSLPLARKAFLLD
PSNVVVTLSALGIGVLGAAAIEVAWWIRAKMLGVRPRVWR
>MAP3498c ctpI, CtpI
MKIPGVSSVVAGVAGGAAQVVRAGVSTAAGAAGALQTLASPVAELAGPVI
QSMAQTTGRAIGLDGSADGAPAIVPPVRWHSGRRVHLDLDPLLPFPRWHE
YAPAVEEPVRRIPGVAKAHVEGSLGRLVVELADDADDAAVLDEVRSTVAS
VAADISWSKAEAAPPSAPFADPGNPLAILVPLTAAALDLVAMGAAVTGWV
TRLPAAPQTTRAAAALINHQPRMVSILEARLGRVGTDIALAATTAAAHGL
TQSFGTPLLDLTQRTLQISEAAAHRRVWRDREPQLASPDRPQAPVVPVIS
SAKSEVPRHSWAAAAAGEASHVVVGGTIDAAMDKAKGSMAGPVESYVDSA
ANGSLIAAVSALVAGGGTEDVAAAIEAGVPRAAHMGRQAFAAVLGRGLAN
SGQLVLDPGALRRLDRVKVVVIDGAALRGDHRAVLRVRGEAPGWDDDRVY
EVADALLHGEEAPEPDPDELPATGARLRWVPSQGPSAMPAQGLESADLIV
DGDRVGRVDVGWEVDPYAIPLFQTAHRTGARVVLRHVAGTEDLTASVGAT
HPPGTPLLNVVRELRADRGPVLLITALHRDFASTDTLAALAIADVGVALD
DPRAATAWTADIITGTDLADAVRILSAIPVARSASESAVHLAQGGTTLAG
LLLVTGEQEKGASPVSFRRWLNPVNAAAATALVAGTFSATRVLRLPDPTP
QPLTAWHALDPEIVYSRLAGGARPLAVETEPSWRRRLDDLSYSPALAPLR
APLQNVLRLASATRTELADPLTPILAVGAAASAIVGSNIDALLVAGVMTV
NAITGGAQRLRAESAAAELFAEQDQMVRRVVVPAVATTRRRLEAARHATR
TATVSAKSLRPGDVIDLAAPEVVPADARLLEAEDLEVDESLLTGESLPVD
KQVEPVAVNDPDRASMLFEGSTIVAGHARAIVVATGVGTAAHRAISAVAD
VETAAGVQARLRELTSKVLPLTLAGGAAVSTLALLRRASLRQAVADGVAI
AVAAVPEGLPLVATLSQLSAAQRLTARGALVRAPRTIEALGRVDTVCFDK
TGTLTENRLRVVCAVPDDVNPHDPFPELTAPQSAELVRAAARASARPQEG
QGHAHATDEAILTAASSLNGQRDSDWSMIAEVPFESSRGFAAAIGTVGNA
NGNPSDTPVLILKGAPEVILPRCRFADPEADQQRAEAVVRGLAEQGLRVL
AVAQRGWKHDTDDDDTDADAVDAAAHNLELLGYVGLADTARASARPLIEA
LLDAERDVVLITGDHPITARAIARQLGLPADARVVTGAELAGLDEDACAK
LVADVQVFARVSPEQKVQIVAALQRCGRVTAMVGDGANDAAAIRMADVGI
GVSGRGSSAARGAADIVLTDQDLSVLLDALVEGRSMWAGVRDAVTILVGG
NVGEVLFTIIGTAFGAGRAPVGTRQLLLVNLLTDMFPALAVAVTSQYVEP
DEAEYPSAADAEAARREHRRAVLTGPTPSLDAPLMRQIVTRGAVTAAGAT
AAWAIGRWTPGTERRTATMGLTALVTTQLAQTLLTRRHSPLVVATALGSA
GVLVGIVQTPVLSQFFGCTPLGPVAWTGVLGSTAGATAISALAPNWLAKQ
VAALEPGQQDA
>MAP2210c cysA, CysA
MIDAGTDRGDIAITVRDAYKRYGDFVALDHVDFVVPTGSLTALLGPSGSG
KSTLLRTIAGLDQPDTGTVTIYGRDVTRVPPQRRGIGFVFQHYAAFKHLT
VRDNVAYGLKVRKRPKAEIKAKVDNLLEVVGLSGFQGRYPNQLAGGQRQR
MALARALAVDPQVLLLDEPFGALDAKVREDLRAWLRRLHDEVHVTTVLVT
HDQAEALDVADRIAVLNQGRIEQIGSPTEVYDAPTNAFVMSFLGAVSTLN
GTLVRPHDIRVGRTPEMAVAAEDGTAESTGVARAIVDRVVKLGFEVRVEL
TSAATGGPFTAQITRGDAEALALREGDTVYVRATRVPPITAGATTVPALS
RDGADEATLTSA
>MAP0645c cysA3, CysA3
MARSDVRVSTDWAESNLDTPGVVFVEVDEDTSAYHAGHIPGAIKLDWRSD
LQDPVKRDFVDAQQFSKLLSERGIANDDTVILYGGNNNWFAAYAYWYFKL
YGHEKVKLLDGGRKKWELDGRTLSSDPVSRPATSYTAAAPDNSIRAFRDE
VIAAINVKNLVDVRSPDEFSGKILAPAHLPQEQSQRPGHIPGAINVPWSK
AANEDGTFKSDEELAALYAAAGLDTGKETIAYCRIGERSSHTWFVLYELL
GHRNVKNYNGSWTEYGSLVGAPIELGS
>MAP2484c cysN, CysN
MAAPTTLLRLATAGSVDDGKSTLIGRLLYDSKAVMEDQWAAVEQTSKDRG
HDYTDLALVTDGLRAEREQGITIDVAYRYFATPKRKFIIADTPGHIQYTR
NMVTGASTAQLVIVLVDARHGLLEQSRRHAFLASLLGIQHIVLAVNKMDL
IGWDREKFESIRDEFHAFAARLDVHDVATIPISALHGDNVVTKSDQTPWY
EGPALLSHLEEVYIAGDRNLVDVRFPVQYVIRPHTHEHQDHRSYAGTVAS
GVMRPGDEVVVLPVGKRTRITAIEGPNGPVQEAFPPMAVSLTLADEIDIS
RGDLIARTHNQPRIAQDFDATLCWMADNTTLEPGRDYVIKHTTRTTHARV
TGLDYRLDVNTLHRDKTATALKLNELGRISLRTQVPLLLDEYTRNPSTGS
FILIDPHTNGTVAAGMVLRDASAQAASPNTVRHKSSAIAAARPRGKTVWF
TGLSGSGKSSVAMLVEQKLLEKGAQAYVLDGDNLRHGLNADLGFSMADRA
ENLRRLAHVAALLADCGNVVLVPAISPLAEQRELARKVHADAGFDFIEVF
CDTPIEECEKRDPKGLYAKARAGEITQFTGIDSPYQPPANPDLRLIPDGT
VEEQAQRVIDLLESRG
>MAP2058c cysQ, CysQ_2
MNDHELAARLATEAGRLLLGVRDEFADAPASERKAAGDKRSHDFLIEALA
AERPGDAVLSEEGADDPVRLRSERVWIVDPLDGTREFSELGRDDWAVHVA
LWEAGELVAGAVALPAQGVTFATPEVASPPVAPGKPRIVVSRTRPPAIAL
NVRDALDGVLVEMGSAGAKVASVVQGLSDVYVHAGGQFEWDSAAPVAVAR
AAGLHTSRIDGSTLAYNQPDPKLPDLVVCRPELADAVLAVTR
>MAP1877c cysQ, CysQ_1
MTPRNPSDEMTDAALATDLAAEAGELLLKVRDEVGFGYPWALGDAGDSLA
NALILGRLQAERPDDAVLSEEAYDDLSRLQHDRVWIIDPLDGTREFSTPG
RDDWAVHVALWQRPTNGRREITDAAVLPARGNIVYRSDTVTASAARVGVT
DTIRIAVSATRPPAVLHRMRQRLPIQPVAIGSAGAKAMAIIDGVVDAYLH
AGGQWEWDSAAPAGVVMAAGMHASRLDGSPMRYNQLDPYLPDFVMCRAEL
APVLLGAIRDAWR
>MAP2211c cysW, CysW
MTSSPGVRYGLRFVALAYIFVLLVIPVSLILWRTFRPGFGQFYAWVSTPA
AISALNLTLLVVAIVVPLNVFFGIPTALVLARNRFRGKGVLQAIIDLPFA
VSPVIVGVALIVLWGSAGALGFVEKDLGFKIIFGLPGIVLASIFVTLPFV
VREVEPVLHELGTDQEEAAATLGSGWWQTFWRITLPSIRWGLTYGIVLTI
ARTLGEYGAVLMVSSNLPGKSQTLTLLVSDRYNRGAEYGAYALSTLLMGV
AVLVLVFQVVLDARRGRAAGQA
>MAP0410 dppB, DppB
MGWYIARRIAVMVPVFLGATLLIYAMVFLLPGDPVAAIAGDRPLTPAVAA
ALRARYHLDDPFLVQYLRYLGGVLRGDLGRAYSGLPVSDVLAHAFPVTLR
LSLIALAVEAVLGIGFGVIAGLRQGGLFDSAVLITGLVIIAVPIFVLGFL
AQFVFGVRLGIAPVTVGNAATFTRLLLPGIVLGSVSFAYVVRLTRSAVAA
NAHADYVRTATAKGLSQPRVVTVHILRNSLIPVVTFLGADLGALMGGAIV
TEGIFNIHGVGGVLYQAVTRQEAPTVVSIVTVLVLIYLVTNLVVDLLYAV
LDPRIRYG
>MAP0411 dppC, DppC
MAERIRARGGFWRETWRRLRRRPKFIGAGLLILVILAVALFPALFTAADP
TYADPAQSMLPPSRTHWFGTDLQGHDVYARTVYGARASVTVGLGAAAIVF
VVGGALGALAGFYGGWIDAVVSRVTDVFFGLPLLLVAIVLMQVLHHRTVW
TVIAILALFGWPQVARIARGAVLAVRASDYVLAAKALGMSRFQILIRHAL
PNALGPVIAVATIALGLFIVTEATLSYLGVGLPPSVVSWGGDINLAQTRL
RAGSPILFYPAGALAATVLAFMMMGDALRDALDPASRAWRA
>MAP1868c efpA, EfpA_1
MTMSVAPTTRLWSRQFVAVIVAIGGMQLMVAMDGPVAVFALPKIQNEMGL
SDAARSWVITAYMLTFGGLMLLGGRLGDTIGRKRAFLVGVALFTFASGLC
GIAWAGGTLIAARLLHGAAAAIIAPTNLALIATTFPRGSARNAATAVFGA
MTGLGGVLGLVVGGALTDVSWRLAFLVNVPIGLAVIYLVLITRQETQTER
IKLDVTGAVLATVTGTAAVFGISMGPEAGWRSPITIGSGVVALAAFVAFV
VVERTAENPIVPFNLFLDRNRLAAFAAMFLAGGVFFTLTVLVGLYVQTMM
GYSPLRAGVAFIPFGLAMAIGVGVASKLVTWFPPRVVVIASGGLILGATL
YGSTFNRGMPYFPNLVVPLIVCAIGIGAVFVTLTLSVIASVDVDRIGPTS
AIAVMLQTLGGPLVLVVVQVAITSHALRLGGTLGPVKSMNAAQLHALDRG
YTYGLLWLAGVVALLGGVALLIGYTAAQVARAQEVKKAVDAGEL
>MAP2915c efpA, EfpA_2
MTALNDSERAVQNWTSARPDRPAPVRSTPPAETAPKPAAETAVKRTSKYY
PAWLPSRRFIAAVIAIGGMQLLATMDSTVAIVALPRIQNELSLSDAGRSW
VITAYVLTFGGLMLLGGRLGDTIGRKRTFIVGVALFTISSVLCAVAWDEA
TMVIARLSQGVGSAIASPTGLALVATTFPKGPARNFATAVFAAMTAVGSV
MGLVVGGALTEVSWRLAFLVNVPIGLVMMYLARTALRETNRERMKLDATG
AVLATLACTAAVFAFSMGPEKGWISLTTISSGVVALGAALAFVIVERTAE
NPVVPFDLFRDRNRLVTFIAIFLAGGVMFTLTVCIGLYVQDILGYSALRA
GVGFIPFVIAMGIGLGVSSQLVARFSPRVLTIAGGYLLVLAMLYGWWCMH
RGVPYFPNLVLPIVVGGIGIGMAVVPLTLSAIAGVGFDQIGPVSAVTLML
QSLGGPLVLAVIQAVITSRTLYLGGTTGPVKTMNDAQLQALDHGYTYGLL
WVAGAAVIVGAAALFIGYTPEQVAHAQEVKEAMDAGEL
>MAP3127 emrE, EmrE
MPGTGHNLRFLSCPVHTYLYMRARGGVLTYLFLICAILAEVVATSLLKST
QGFTRLWPTVICLLGYAVSFALLAVSISRGMQTDVAYALWSAIGTALIVL
IAVLFLGSPISVTKVVGVGLIIAGVVTLNLTGAH
>MAP3092 fecB, FecB
MLFGVIRPARLAAVTAALMVACGGCGSDRPAATTTRSLVTPTTQIAGAGV
LGNDRRPDESCARDAAEADPGPAKRQVHNAPGADPAPVQVSADPQRIVVL
AGDQLDALCALGLQSRVVGAALPDGASGQPAYLGGAVRGVPGVGSRSHPD
VKAIAAAHPDLILGSQGLTPALYPQLAAIAPTVFTAAPGAAWRDNLRAVG
AATARAGAVDGLLSGFSQRAGDVGARHDASHFQASIVQLTTGSIRVFGAN
NFPASVLGAVGVDRPAAQRFTDKPYLEIGATDADLAKNPDLSVADADVVY
LSCATPAAADRAATVLDSGPWRKLSANRDNRVYVVNDEIWQTGQGLIAAR
GIVDDLRLVNAPIN
>MAP3710c fecB2, FecB2
MRQGWNRRGFLQLAGAAGVAATAGAAGLSAGCSAHQPPPGGAGPGSVTVT
HLFGQTVVKEPPKRVVSAGYTEQDDLLAVGVVPIAVTNWFGDQPFAVWPW
AADKLGGAQPTVLNLDNGIPVDQIAGLKPDLIVAINAGLDADTYQKLSAI
APTVAQSDGDAFFEPWKEQAAAVGQAVFAAEKMKSLVAGVDQKFTDIGKK
NPQWTGKKALLMHGALWQGTVVATMAGWRTDFLNQMGLVIADSIKPFGTD
QRAVIPRDHIKSVLESADVVIWTTQNPDDQKALLADPEVAGSLTTAQNRH
IFTTKDQAGAIAFASPLSYPVIADQLPPQLTKILG
>MAP1669c furA, FurA
MSSTADYADRLRMADLRVTRPRVAVLEVVDANPHADTETIFSAVRMALPD
VSRQAVYDVLNALTAVGLVRRIQPLGMVARYESRVGDNHHHVVCRSCGTI
ADVDCAVGEAPCLTPSDDDNVLDGFVLDEAEVIYWGLCAECSTAGS
>MAP2139 furB, FurB
MTPSGDGAGVSVRSTRQRAAISTLLETLDDFRSAQELHDELRRRGENIGL
TTVYRTLQSMAAAGLIDTLRTDTGESVYRRCSEHHHHHLVCRSCGSTIEV
GDHEVEEWATAVAAKHGFSDVSHTIEIFGTCSECR
>MAP1668c katG, KatG
MSSDTSASRPPQPDTRTASKSESENPAIPSPHPKSNAPLTNRDWWPNQID
VSRLHPHVAEANPLGEDFDYAEEFAKLDVEALKADVISVMTTSQDWWPAD
YGHYGGLFIRMSWHAAGTYRIHDGRGGGGQGMQRFAPLNSWPDNVSLDKA
RRLLWPVKKKYGNKISWADLIIFAGNCALESMGFKTFGFAFGREDVWEPE
EILWGEEDEWLGTDKRYPGTGERELAQPYGATTMGLIYVNPEGPEGKPDP
IAAAIDIRETFGRMAMNDEETAALIVGGHSFGKTHGAGDADLVGPEPEAA
PIEQQGLGWKSSHGTGVGKDAITSGLEVVWTPTPTKWDNTFLETLYGYEW
ELTKSPAGAWQFTAKDGAGAGTIPDPFGGPGRAPTMLVTDISLRESPIYR
DITRRWLDHPEELADAFAKAWYKLLHRDMGLVSRFLGPWVPEPQLWQDPV
PPVDHPLVDDNDVAALKDKVLASGLSVPQLVKTAWSAAGSYRNTDKRGGA
NGGRLRLQPQRNWEANEPSELDKVLPVLEKIQQDFNASASGGKKISLADL
IVLAGSAAVEKAAKDAGYEISVHFAPGRTDASQESTDVDSFAVLEPRADG
FRNFARPGEKAPLEQLLLERAYLLGVTGPEMTVLVGGLRALGANHGGSKH
GVFTDRPGALTNDFFVNLLDMGTEWKASETAENVYEGHDRATGALKWTAT
ANDLVFGSNSVLRALAEVYAQDDNQGKFVEDFVAAWVKVMNNDRFDLK
>MAP1000c kdpA, KdpA
MSSTTAGLIFLAVLVAALVAVHVPLGDYMFRVYTTDRDLATERTIYRLIG
VDARSEQTWGAYARGVLAFSSVSIIFLFVLQLVQGKLPLHLHDPATKMTP
SLAWNTAVSFVTNTNWQAYSGETTQGHLVQMAGLAVQNFVSAAVGMAVAV
ALVRGFARRRTGELGNFWVDLVRGTLRILLPISIVGAVLLVAGGAIQNFH
LHDQVVTTLGGTAQTIPGGPVASQEVIKELATNGGGFYNANSAHPFENPT
AWTNWLEVFLILVIGFSLPRTFGRMVGNPKQGYAIASVMASLYLLSTGFM
LWFQLQHHGTVPSAVGAAMEGVEQRFGVPDSGVFAAATTLTSTGAVDSAH
DSLTSLGGMITMFNMQLGEVAPGGTGSGLYGMLVLAVITVFVAGLMVGRT
PEYLGKKINPREIKLAASYFLVTPLIVLTGTAIAMALPGERAGMANSGPH
GLSEVLYAFTSAANNNGSAFAGLSANTEWYNTALGLAMAFGRFLPIVLVL
ALAGSLARQGSTPDSAGTLPTHRPQFVGMVAGVTLIVVALTFLPMLALGP
LAEGIH
>MAP0997c kdpC, KdpC
MTLSNFIRLHWAALRALLVLTVITGLAYPLLVWVVAQFPGLRDHAEGSIL
TANGKPVGSRLIGQLFTDKDGNPLPQYFQSRPSAAGTGYDPTSSGGSNLG
PESIVDAPGKPGLLTTVCSRSAAVAKLEGVDGSRPFCTGGGVGAVLSVIG
PRDERGNVTHPARVVSVNEPCESTPTPFVSLYEGVRVECAKTGEDYSLGQ
IVPIRGAAPAAPAVPADAVTAGGSGLDPNISPAYADIQVARVAKVRHVRP
EQIRELVAQNSSGRALGFFGEPCVNVLQLNLQLDHRYPVTS
>MAP3349c kefB, KefB
MQISGTLLLQLGALLATLAVLGAAARRFALSPIPVYLLAGLALGKGGLLP
LATGGQFITTSAPIGVVLLLLTLGCEFSLAEFSSSMRHHLPSAAVDVVLN
AAPGAIAGWLLGLDGVAILCLAGVTYISSSGVVARLLEDLHRLGNRETPA
VLSVLVLEDFAMAAYLPLFAVLASGGGWLHAVVGMVVAVSALVAAFAASY
RWGHHVQRLVEHPDSEQLMLRVLGITLIVAAMAESLHASAAVGAFLVGLT
LTGETATRTRQVLAPLRDLFAAIFFLAIGYSVDPHELIPMLPAALILAAA
TAATKVATGIFAARHDGVARRGQLRAGTALIARGEFSLIIIGLAGSSLPA
VAALATSYVFIMAIAGPVLARYTGPRPAAPAT
>MAP1565 modA, ModA
MRRIGILTGLLSVVLIAGMTGCGSKSQPPPTAGKLMVFAAASLRPAFTQI
AERFKAQNPGTGIEFEFAGSSELATQLTQGATADVFASADTAQMDVVAKA
GLLAADPTNFASNTLVIVTAPGNPKRIGSFADLTRPGLTVVTCQRPVPCG
AAAHRVEDSTGVHLNPVSEEPSVTDALTKVTSGQADAALVYVTDARTAGS
KVATVNFPEAAGAVNVYPIGVLKQAPLATQARNFVDLVTSPPGQQILAQA
GFAKP
>MAP1567 modB, ModB
MPRWVYLPAAAGTMFVVLPLLAIAVKVDWPHFWWLITSPSSRTALLLSLR
TAAASTALCVALGVPMALVLARGGTRLVRLLRPLILVPLVLPPVVGGIAL
LYAFGRLGLLGHYLEAAGISVAFSTTAVVLAQTFVSLPFLVISLEGAART
AGADFEVVAATLGARPTTVWWRVTLPLLLPGVVSGAVLAFARSLGEFGAT
LTFAGSRQGVTRTLPLEIYLQRVTDADAAVALSILLVVVAAVVVLGLGAR
RLTGTDAR
>MAP1568 modC, ModC
MSELQLRAVVSQRRFEVEFSVAAGEVLAVLGPNGAGKSTALHVIAGLLRP
DRGLVRLGDRVLTDTAAGIDVPTHDRRVGLLLQDALLFPHMSVAANVAFG
PHSRRPMWRRGRRAEKATALCWLREVDAETLADRKPRQLSGGQAQRVAIA
RALAAEPDVLLLDEPLAGLDVAAAAAIRSVLRRVVTRIGCAAMLVTHDLL
DVFTLADRVLVLESGRIAEIGPVADVLTAPRSHFAARVAGVNLVNGTAEG
DGALLARSGARWYAAPAAPAGPLASGQRAVAVFPPTAVAVYREQPHGSPR
NTVEVTVAEMDVRGAAVLVRGAQQPDGAPGLAAEITVDAASELRLTPGDR
VWFSVKAHEVVLYPATAAAER
>MAP3306c moeZ, MoeZ
MSTPLPPLVAPADQLTADEMARYSRQLIIPGLGVDGQKRLKNARVLVIGA
GGLGAPTLLYLAAAGVGTIGIVEFDAVEESNLQRQIIHGVADVGRSKAAS
ARDSIAAINPLVDVRLHEFRLDASNAVELFGHYDLIVDGTDNFATRYLIN
DAAVLAGKPYVWGSIYRFEGQVSVCWEDAPDGRGLNYRDLYPEPPPPGAV
PSCAEGGVLGVVCASIASVMSTEAIKLITGIGESLLGRLMIYDALEMSYR
TIAIRRDPCDASRPAITTLVDYEQLCGAAPAASTDAATGGAEAAITPRQL
RELLDSGAKLALIDVREPVEFDIVHLDGAQLIPQSSINSGEGLAKLPADR
MPVLYCKTGVRSAQALAVVRQAGFSDAVHLQGGIVAWAQQMQPDMILY
>MAP1790 morD, MorD
MTERSEYSDGRAMALERSCAVTAVALSEQRREGVRLVTGERRGFGLDAAL
TFVHLPYPAPSDWTRRTLTCGVALQCSPSKERVTEFRLNELSARELRALT
LVEGAVALGWIASRWPGLLPEVQRLLPDVHPQAADMDGAQMLDRAIGLAA
TGLELTVPPLLGALPLAYTAPQGLTDRLRRSFGRMPWTTTQKRRPRPYSV
PVGGDGGVRNPNLPPPSRPQDNDLDVTPQHRPGIPYPEWNMWTQRFMHDH
VAVVEHADGRRLRRPVPVAVDVRKWFEEHTHRAMTSRLEDGSDLDVDQYV
SHYIDLTTGEAKEPRVFRDLLPSGRDVTTALLLDGSSSLGVHGGRVFQLE
LACADALSRAMTLARERHGVFVFTGNTRHRVEVRCLKDFEDRRFVPPSTL
GLSTRGYTRLGAPLRHLTSRLLAQPAERRLLIVIGDGLISDEGYEGRYAW
ADAAHAVEEANDAGVSMYYVGVGPTRVDPLPEVFGPRRSQRIRRIEELPR
VLAHVHRELVAA
>MAP1626c nanT, NanT
MTKPSPGRKLTADQRNSFIAALLGWTMDAFDYFIVVLVYADIAKTFHHSK
AEVAFVTTATLIMRPVGALLFGLWADRVGRRLPLMVDVMFYSVVGFLCAF
APNFTVLVILRLLYGIGMGGEWGLGAALAMEKVPVERRGFFSGLLQEGYA
FGYLLASVASLVVMDWLELSWRWLFGLSIVPALISLIIRYRVEESEVWEA
AQDQLRLTSTRIRDVLRNGAIIRRFVYLVLLMTAFNWMSHGTQDVYPTFL
GAHANHGAGLSSTTVKWIVVVYNVGAIIGGLVFGTLSQRFSRRYTVVFCA
MLALPIVPLFAYSRTAAMLGLGSFLMQLFVQGAWGVIPAHLTEMSPDAIR
GLYPGVTYQLGNLLAAFNLPIQERLAETHGYPFALAATIVPVLLTVAVLT
LIGKDATGIRFATSESAFLPTEMT
>MAP2102c narK3, NarK3_1
MARTRRIAHWDPEDLVAWEAGNKLIARRNLIWSIATMHVAFSIWYLWSVM
VLFMPQARYGFTTGDKLLVGATAALVGALVRIPYAMGTARLGGRNWAVLS
SLVLLIPTLAAIVLLAHPGLPLWPYLVCAALTGLGGGNYAAALANVESFF
PQRRKGFALGLTGGVGNLGAAGIQAVGLVVLATAGNQAPYWVCAVYLVLL
ALVGVGAALFMDNLDHSVEVGHVRSVLTVPDTWTISFLYMCASGSFIGFA
FAFGQVIAHNLIAGGQTHGQAALHAAEIAFAGPLLGSMARVVGGKLGDRF
DGGRVTLTVLAAMIVAGGFLVAVSTHDDLTRPSGAPVSLYTTAGYIAGFI
ALFICCGVGKGSVFKLIPSVFAQRSRALELGDTERRHWERARSAALIGFA
GSFGALGGVVINLALRQSYASTGSSTPAFGAFMLCYVAAAALTWARYVRP
RRARAAHRAGLAADGRAAGQLVRASDRFSGCSISA
>MAP3707c narK3, NarK3_2
MGRDHRITDWNPEDAAAWEAGNKRIARRNLLCTIAGDHVAFSIWTLWPVM
ALFMPAAVYGFSAGDKLLLGAVATLVGGCARIPYTLGIAAFGGRKWTTFS
AVVLLIPTVGTIVLLANPGLPLWLFVLCAALTGLGGGNYAASLANVNAFY
PQRLKGAAMAVNAGVANLGVAVIQLVGLLVLATAGHQAPYWVCAVYLVFL
AAVAIAAAMFMDDINHGTQLSTMRSILFERDTHVISLLYIATFGSWIGFS
FAFGQVMQVNFLENGESAKHAALHAAQLAFIGPLLGSVARIYGGRLADRV
GGSRVTLGVLAAMTLAAGLLVIVSTVDDQHAGAHSVSMIGYVVGFMVLFI
LSGMGNGSVFKLIPSVYEARSRGRDTSEDERRQWARAMSGSLIGICSAVG
AFGGVAINLALHQSYLSTGTETSAYWMFMASYVVAAIMTWLVYVRRPVAA
PGVSLPETQAARL
>MAP3712 narU, NarU
MTTTITPVPQPAAAPERHKGRHWIDDWRPEDPEFWETTGKAIARRNLIFS
IFAEHVGFSVWMLWSIVVVHMTAGPHGHPSASGWALTASQALCLVAVPSG
VGAFLRLPYTFAVPVFGGRNWTTISAALLLIPCLLLAWAVSHPGIPFGVL
VAIAATAGFGGGNFASSMANISFFYPEKDKGWALGLNAAGGNIGVAMVQK
VIPPVVIAGGGVALSRAGLFYVPLAVVAAVCAFLFMNNLTEIKADVKPVW
QSLRHADTWIMSLLYIGTFGSFIGYSAAFPTLLKTVFGRGDIALAWAFLG
AAIGSVIRPLGGKLADRVGGARITAASFVMLAVGAAAALWSVKAVNLPAF
FASFMFLFVATGIGNGSTYRMISRIFKIKGELAGGDPDTMVTMRRQAAGA
LGVISAVGAFGGFIVPLAYAWSKSQFGSIEPALRFYVVFFLGLLAVTWYC
YLRKHNAITRVGI
>MAP2924 nicT, NicT
MPALRPARRGTYSRARCPVPDTARAGRTPVTSTEIDRWPARATRFLGALA
PTEWWRLASMLGAILALHLIGWLTLVLLVAPGQYSLGGKAFGVGVGLTAY
TLGLRHAFDADHIAAIDNTTRKLMNDGQRPLAVGFFFSLGHSTVVFALAV
LLACGVRTVVGPVRDDSSALHHYTGLIGTSVSGVFLYAIALLNVVVLVGI
LRVLARVRRGDYDPHTDAAELERQLDNRGLMNRWLGRFTKSITQSWHCYP
VGLLFGLGFDTATEVALLVLAGTSAAAGLPWYAILCLPVLFAAGMCLLDT
IDGSFMNFAYGWAFSNPVRKIYYNIIITALSVAVAWVIGSIELLVLFADE
FGWRGSFWDWLGGLDLNTVGYAVVGMFVLTWAVALLIWRYGRIEERWAGA
DPRAGTGREA
>MAP2208 nirA, NirA_2
MTTARPAKARNEGQWALGNREPLNPNEEMKQAGAPLAVRERIETIYAKNG
FDSIDKSDLRGRFRWWGLYTQREQGYDGSWTGDENIEKLEARYFMMRVRC
DGGAISAAALRTLGQISVDFARDTADITDRENIQYHWIEVENVPEIWRRL
DAVGLRTTEACGDCPRVILGSPLAGESLEEVIDPSWAIAEIARRYIGQPD
FADLPRKYKTAISGLQDVAHEVNDVAFIGVNHPEHGPGLDLWVGGGLSTN
PMLAQRVGAWVPLHEVPEVWAAVTSVFRDYGYRRLRSKARLKFLVKDWGI
EKFREVLETEYLKRPLIDGPAPEPVAHPIDHVGVQRLKNGLNAVGVAPIA
GRVSGTILLAVADLAQQAGCDRIRFTPYQKLVLLDIPDDKLDEVVAGLEA
LGLQSQPSHWRRNLMACSGIEFCKLSFAETRVRAQGLVPELERRLADVNR
QLDVPITINLNGCPNSCARIQVADIGLKGQMVDDGEGGSVEGFQVHLGGS
LGQDSGFGRKLRQHKVTSDELGDYIERVARNFVKYRGEGERFAQWAMRAD
EDDLR
>MAP2035 nirA, NirA_1
MTTARPVKTRNEGQWALGDREPLNDTEKIKLADGPLNVRERIINVYAKQG
FDSIDKSDLRGRFRWMGLYTQREQGYDGSWTGDDNTDKIEAKYFMMRVRS
DGKAMSAHTMRTLGQISTEFARDTADISDRENLQLHWIRIEDVPEIWRRL
ESVGLQTTEACGDCPRGIHGSPLAGDSLDEVLDPSPAIEEIVRRSLNNPE
YANLPRKYKTAVSGLQDVSHETHDVAFVGVEHPEHGPGLDLWVGGGLSTN
PMLAQRLSVWVPLDEVPDVWEAVTQLFRDYGYRRLRAKARLKFLVKDWGI
EKFREILEQEYLNRRLIDGPAPAPVKHTIDHVGVQKIKNGLNAVGVAPIA
GRVSGTTLSAVADLMEQVGSDRARWTPFQKLVILDVPDDKVDELVTGLDA
LGLPSRPSSWRKNTMACTGIEFCKLSFAETRVRTQTLVPELERRLADVDA
QLDAPISVHLNGCPNSCARIQVADIGFKGQWIDNGDGTSVEGFQVHLGGG
LGEQSGFGRKLRQHKVTSEELGDYIDRVTRKYLEGRNDGETFASWALRAD
EEELR
>MAP3703 nirD, NirD
MSGCCSTTVSQAALFRLDDGTVRAVGNVDPFSGAAVLSRGIVGDRNGCPT
VQSPILKQAFSLEDGICLDDPSVSVPVYPVRITADSYVQVGRDYQPRAA
>MAP0869c nramp, Nramp
MFDYPHFLIRLWESRSTLAQRTQGSLKGNWYLLGPAFVAAIAYVDPGNVA
ANVSAGSQYGYLLLWVIVAANVLAGLVQYLSAKLGLVTGRSLPATIGKRM
SRPARLVYWVQAELVAIATDAAEVVGGAIALHILFGLPLLAGGLITGVVA
LLLLGIQDRRGQILFERVITGLLLVIAIGFAASFFVKTAPPEAVLSGLLP
RFRGTESVLLAAAILGATVMPHAVYMHSGLVLDRHGHPDEGPHRRLLLRV
TRWDVVLAMAVAGTVNAAMLLIAATNLQHRDVSASIEGAYAAIHNTLGAT
IAVMFAVGLLASGLASSSVGAYAGAMIMQGLLHRSIPMVVRRLITLCPAL
LILAVGYDTTRALVLSQVVLSFGIPFAVLPLIKLTSDRELMGSDANHRIT
TILGWGVGILISLLNMVLIWLTVTG
>MAP3212 nuoL, NuoL
MTHYTPLLVALPLAGAAILLFGGRRTDRWGHWLGCATAVAAFVVGVGLLD
ELLGRPADQRAIHERVFSWIQVGQLQVDLGLQIDQLSVCFVLLITGVGSL
IHIYSVAYMAEDADRRRFFGYLNLFLASMLLLVIADNYVVLYVGWEGVGL
ASYLLIGFWYHKPTAATAAKKAFVMNRVGDAGLALGMFLMFSTFGTLSYA
GVFAGAPAAGRGALTAMGLLLLLGACAKSAQVPLQAWLGDAMEGPTPVSA
LIHAATMVTAGVYLIVRSNPLYNLSPDAQLAVVIVGAVTLLLGAFIGCAK
DDIKRALAASTMSQIGYMVLAAGLGPTGYAFAIMHLLTHGFFKAGLFLGS
GAVIHAMHEEQDMRRYGGLRAALPVTFVTFGLGYLAIIGVPPFAGFFSKD
AIIEAALAAGGGRGYLLGGAALLGAGVTAFYMTRVMLMTFFGEKRWAPGS
HPHEAPGLMTWPMILLAIGSLFSGGLFAVGGTLQRWLEPVVGRHEEVTHA
VPVWISTALALGVVAIGIAVAYRLYATAPIPRVAPLSVSPLTTAVRNDFY
GDAFNEEVFMRPGAQLTHALVEVDDAGVDGSVNALAALVSATSNRLRGLQ
TGFARNYALSMLTGAVLVIALILAVRLW
>MAP2488 oppB, OppB
MTRFLARRLLNYLVLLALASFLTFCLTSVAFKPLDSLLQRSPRPPQAVID
AKAHSLGLDEPIPIRYAHWASHAVRGDFGKTVTGQPVGASLGRRVGVTLR
LLVIGSLIGTVAGIAAGAWGAIRQYRLSDRVVTMLALLVLSTPTFVIASL
LILAALRVNWALGVQVFDYTGETSPGVTGGAAALVDRLRHLVLPSLTLAL
AAAAGYSRYQRNAMLDVLGQDFIRTARAKGLTRRRALVKHGLRTALIPLA
TLFAYGVAGLVTGAVFVEKIFGWHGMGEWLVQGVATQDTNIVAAITLFSG
AVVLLAGLLSDVFYAALDPRVRVS
>MAP2489 oppC, OppC
MAGFASRRTLVLRRFGRNRLAVASLTLLVLLFVGCYTLPAVLPYSYQDLD
FDALLQPPNARHWLGTNALGQDLLAQILRGMQKSMLIGVCVAVISTGIAA
TVGSIAGYFGGWRDRVLMWLVDLLLVVPSFILIAIVTPRTKNSANILMLV
LLLAGFGWMVSSRMVRGMTMSLREREFIRAARYMGVSSRRIIVGHVVPNV
ASILIIDAALNVASAILAETGLSFLGFGVQPPDVSLGTLIANGTQSATTF
PWVFLFPAGVLVLILVCANLTGDGLRDALDPGSGPARGGRR
>MAP0872 phoS2, PhoS2_2
MKLNRFGAVLSVLSAGALVLSGCGSDNNGAGAGAGGSSSSKVSCGGKKAL
KASGSTAQANAMTRFVNAFEQACPGQTLNYTANGSGAGISEFNGKQTDFG
GSDSPLAPSEYAAAQQRCGSPAWNLPVVFGPIAITYNVAGLNSLNLDGAT
AAKIFNGAITTWNDPGIQALNPGVALPAEPIHVVFRNDESGTTDNFQKYL
DAAADGAWGKGAGKTFKGGVGEGAKGNDGTSAAIKATEGSITYNEWSFAQ
AQKLNMAKIITSAGPDAVAISADSVGKTIAGAKISGQGNDLVLDTLSFYK
PTQAGSYPIVLATYEIVCSKYPDPQVGTAVKAFLQSTVGAGQNGLADNGY
IPIPDAFKSRLSAAINAIT
>MAP4172c phoS2, PhoS2_3
MRTRSAVIAGVLMATTLVVSACGETPASLPYTAGAKVDCGGKQTLSASGS
TAQANAMTRFIAAYRTACPGQTLDYTANGSGNGIGDFLAGRTDFAGSDTP
LSGDQYAAAKRRCGGADAWNLPVVFGPLAITYNLAAVDSLVLDAPTLAKI
FNGTITRWDDPALALLNASMPAEDIRVVYRSDGSGTTDNFQAYLQSAAGG
VWNKGAGKTFNGGVGTGAVGNTGTAAAVKSTEGAISYNELSFALQQGLFA
AEIKTPASRRSLRPVRIGTDIFGKTIKGARIVGTGNDLVLDLSSFYNPAQ
PDVYPIVLATYEIVCSKYPGFDVAKAVKAFLQAAIGPGQVELARTGYIPL
SADFQAKVSGAVDAISSPQAPNPD
>MAP0651 phoS2, PhoS2_1
MRLDRQGGALAAAALTGCGSDENHRGTAAPSISGTTGTAGCGGKNKLTAE
GSTAQENAITMFNQVWGQYCPGKGLAYNPTGSGAGREQFIAGHVDFAGAD
SPLVADQIGPAAQRCGGNPAWDLPLVFGPIAITYNLPGNPALVVSSDAVA
KIFTGKITNWNDPILAALNPGVALPDTKITPIYRTDSSGTTDNVQKYLTA
AAPQSWAKGVGTEFQGGVGEGAAKSAGVIQAVRATTGAVGYVEKGFADQA
GMPYAKIATRGGVVPLTNETAGNAVNAAKFLSEGDDLVLDLHAMYASQEP
GVYPLLLVTYEIVCSKGYDPETLAAVKSFLGVAATSGQNGLSTAGYIPLP
DKVRQRLVTAINALQ
>MAP0654 phoT, PhoT
MAKRLDLKGVNIYYGSFHAVAEVTLSVLPRSVTAFIGPSGCGKTTVLRTL
NRMHEVVPGGRVEGSLLLDDEDIYGPGVDPVGVRRAIGMVFQRPNPFPAM
SIRDNVVAGLKLQGVRNRKVLDETAEYSLRGANLWDEVKDRLDRPGGGLS
GGQQQRLCIARAIAVQPDVLLMDEPCSALDPISTMAIEELISELKQDYTI
VIVTHNMQQAARVSDYTAFFNLEAVGKPGRLIEVDDTEKIFSNPSQKATE
DYISGRFG
>MAP0655c phoY2, PhoY2_2
MRTAYHEQLSELSERLGEMCGLAGVAMERATQALLQADLVLAEQVISDHE
AIAAMSARAEETAFVLLALQAPVAGDLRAIVSAIQMVADIDRMGALALHV
AKIARRRHPQHALPEEVNGYFAEMGSIAVELGNSAQEVVLSRDPEKAARI
REEDDAMDDLHRHLFSVLMDREWKHGVAAAVDVTLLGRFYERFADHAVEV
ARRVIFQATGRLPEEETKPASQ
>MAP0132c phoY2, PhoY2_1
MRSGFHRRLCLLNARLAEMCAMAADAIAQATHALLDADLLTAEGVITRQH
SIAALGLQAEETAFALLALQAPVATDLRAVVSALRIAADAQRMVELAVHV
AEIARRRHPDSAVPAEVRPIIAAMGEAAEALAAGAREVLLSQDPRRAAQI
RRDDDTMDELHRRLLSVLMDPAWTPGVAAAVDATLLGRFYERFADHAVEI
ARRVIFQATGR
>MAP4041c pitA, PitA
MPLPPESGAVNIQLFLLIIVVITALAFDFTNGFHDTGNAMATSIASGALK
PKTAVALSAVLNLVGAFMSTAVAATIAKGLIDSNIVTLELVFAGLVGGVV
WNLLTWLLGIPSSSSHALIGGIVGATIAAVGAHGVIWKGVISKAIIPAIV
SAILAILVGAVATWLVYRITRGVPKKRTEAGFRRGQIGSASLVSLAHGTN
DAQKTMGIIFLALISYGSVSKTAAMPPLWVIVSCALAMATGTYLGGWRII
RTLGKGLVEIQSPQGMAAESSSAAVILLSAHFGYALSTTQVCTGSVLGSG
LGKPGGEVRWGVAGRMGVAWLVTLPLAGLVGAVTYWIVHLIGGYPGAIIG
FALLVAVSATIYLRSRKVKVDHHNVNAEWKGDLTTGLEGADDHSPPPDAG
PPFGGPGDRYQSDDPTLKASAS
>MAP3022 ppk, Ppk
MMRHDRNVTEIDAETRPDENLWHSGDSAVGAPPAATPAAMTDLPEDRYLN
RELSWLDFNARVLALADDNSLPLLERAKFLAIFASNLDEFYMVRVAGLKR
RDEMGLSVRSADGLTPRKQLALIGEHTQRIATRHARVFLDSVRPALAEEG
IHIVTWADLDQAERDELSTYFTEQVFPVLTPLAVDPAHPFPFVSGLSLNL
AVMVRQTEDGGQHFARVKVPNNVDRFVELAAPRAGAEGENRGVVRFLPME
ELIAAFLPLLFPGMEIVEHHAFRITRNADMEVEEDRDEDLLQALERELAR
RRFGPPVRLEIADDMTEGMLELLLRELDVHPGDVIEVPGLLDLSSLWQIY
DLDRPALKDPAFVPDTHPAFADRESPKSIFATLREGDVLVHHPYDSFSTS
VQRFIQQAAADPNVLAIKQTLYRTSGDSPIVRALIEAAEAGKQAVALVEI
KARFDEQANIRWARALEQAGVHVVYGLVGLKTHCKTCLVVRREGSAIRRY
CHIGTGNYNSKTARLYEDVGLLTAAPDIGADLTDLFNSLTGYSRKVSYRN
LLVAPHGIRTGIIERVEREIAAHRERGQGRIRLKMNALVDEQVINSLYRA
SQAGVRVEVVVRGICALRPGVQGYSENIFVRSILGRFLEHSRIIHFRNIN
EFWIGSADMMHRNLDRRVEVLAQVKDPKLTAQLDELFESALDPSTRCWEL
GPDGQWTPSPQEGHTVRDHQVSLMERHRSP
>MAP0874 pstA1, PstA1_2
MTRAVDALDRPVKTEVFRPLSVRRRITNNAATIFFLGSFVVALVPLIWVL
SVVLERGWYAVTRSGWWTHSLHGVLPEQFAGGVYHALYGTVVQAGVAAAM
AVPLGLMTAVFLVEYGSGRLVRLTTFMVDVLAGVPSIVAALFIFSLWIAT
LGFQQSSFAVSLALVLLMLPVVVRSAEEMLRLVPDDLREASYALGVPQWK
TIVHIVFPIAMPGIVSGILLSIARVIGETAPVLVLVGYSRSINLDIFHGN
MASLPLLIYTELTNPEHAGFLRIWGAALTLIIIVAVINVIAAATRFLAGR
RR
>MAP0653 pstA1, PstA1_1
MTSMLDRPLKSRTFSPLSRRRRAANSVATVLVSLSLLVAVTPLVMVLCSV
VVKGFRAITSTVWWSHSQAGMTAFVTGGGAYHALVGTVLQGLVCAAISIP
IGLMVAIYLVEYGGGTPLGRLASFMVDILSGVPSIVAALFVYALCVATLG
LPRSEFAVSLALVLLMLPVIVRATEEMLRIVPVDLREASYALGITKWKTI
ARVVLPTGLSGIVTGILLAMARVMGETAPLLILVGYAQSMNFDIFSGFMG
TLPGMMYNQASAGAGINPIPTDRLWGAALTLILVIATINVVARVITKFLG
ARKS
>MAP0574 pstB, PstB
MFEFVDVVVERRGLRALDGLTAAIPGRGVTAVFGPSGSGKSTLLRLCNRL
ELPTSGRVSFYGSDIAGLDPLWLRRRVGMCFQRPTPFPGTVADNLRVADP
DADEARMRETLDRVALTGAWLDRDVLALSGGEAQRVCLARTLMARPRVLL
LDEPTSAVDAEAAEVIERAVRELAADGTPALWVTHDAAQVTRAADRVLRL
ERGRSLGLSQVGGGPDDGAVPR
>MAP0652 pstC2, PstC2_1
MVAAPFPEPSATPISPWGQGRPHAGDRIFRRLAQASGVLIVLVIAAIAVF
LLDRAVPALQRNRENFFGYGGNWVTTDTSAMHFGIASLLPVTVFVSLFAL
ILAMPVALGVAIFITHYAPRRAATPLAYAVDLLAAVPSIIYGAWGLYVLA
PQLRPVATWLNHSMGWCFLFADGNTSAAGGGTIFTGGIVLAVMILPIITA
VTREVFIQTPHDQIEAALALGATRWEVVRTVTLPFGRSGYISGGMLGLGR
ALGETVALLIILRGTQSAFGWSLFDGGSTFATKIAGAAAELDDRFKAGAY
IAAGLTLFVLTFVVDALARGAVAGVGRRAGP
>MAP0873 pstC2, PstC2_2
MTRGTATAQLPAPTTLNARVPRRGDRLFKAIAAAAGFTIVVAIALIAVFL
LLRAVPSLRVNHANFFTSAQSSTSDPHRLAFGIRDLLMVTVLSSLSALVL
AVPIAVGIAVFLTQYAPRRLARPFGAIVDLLAAVPSIIFGLWGIFVLAPQ
LEPVAAFLNRHLGWFFLFKTGNVSLAGGGTIFTAAVVLSVMILPIVTSVS
REVFRQTPHIQMEAAQALGATKWEVVRMTVLPFGRSGVIAASMLGLGRAL
GETVAVLIILRSAARPGHWSLFDGGYTFASKIASAAAEFSSPLPTGAYIS
AGFALFVLTFIVNALARAIAGGKVNG
>MAP3388c pstS, PstS
MVAPAVGGGRSCVSFARSGAVLSLLAAAALTLTGCGGDSKSSSGSGAHVD
CGGKKVLKDSGSTAQQNAIEQFVYAYVRACPGYTLDYNANGSGAGVTQFL
NNQTDLAGSDIPLDRTTGQTDRAAARCNSPAWDLPTVFGPIAVTYHLTGV
SGLKLDGPTVAKIFNGAITKWDDPALKAVNPGLNLPSTPIAVIFRSDKSG
TTANFQKYLDGASNGAWGKGTSEMFTGGVGQGASGNNGTSALLQNTEGSI
TYNEWSFAVGKQLSMASIITSAGPDAVPITKESVEKTIAGATFQGQGNDL
VLDTSSFYRPTQQGAYPIVLATYEIVCSKYPDAATGSAVKAFMQAAIGPG
QDGLDQYGSIPLPGSFTAKLSSAVNAIS
>MAP0187c sodA, SodA
MAEYTLPDLDWDYAALEPHISGQINEIHHTKHHATYVKGVNDALAKLEEA
RANEDHAAIFLNEKNLAFHLGGHVNHSIWWKNLSPDGGDKPTGELAAAID
DAFGSFDKFRAQFSAAANGLQGSGWAVLGYDTVGSRLLTFQLYDQQANVP
LGIIPLLQVDMWEHAFYLQYKNVKADYVKAFWNVVNWADVQKRYAAATSK
AQGLIFG
>MAP3921 sodC, SodC
MPKLLPPVVLAGCVVALGACSSPQHASSLPGTTPAVWTGSPSPSGAGAAE
AAPAAAPSITTHLKAPDGTQVATAKFEFSNGYATVTIETTANGVLTPGFH
GVHIHKVGKCEPSSVAPTGGAPGDFLSAGGHFQAPGHTGEPASGDLTSLQ
VRKDGSGTLVTTTDAFTMEDLLGGRKTAIIIHAGADNFANIPAERYNQTN
GTPGPDEMTMSTGDAGKRVACGVIGAG
>MAP3402 sseA, SseA
MPLPPDPHPSLQEYAHPERLVTADWLSANLGAPGLVIVESDEDVLLYDVG
HIPGAVKIDWHTDLNDPRVRDYIDGARFAELMDRKGISRDDTVVIYGDKS
NWWAAYALWVFTLFGHPDVRLLNGGRDLWLAERRETTLDVPTKTSTGYPV
VTRNDAPIRAFKDDVLAILGSQPLIDVRSPDEYTGKRTHMPEYPEEGVLR
GGHIPTARSIPWAKAVDESGRFRSRAELEELYGFLRPDDKTVVYCRIGER
SSHTWFVLTHLLGKPGVRNYDGSWTEWGNTVRTPIVAGEEPGQAPAGV
>MAP2046 sseB, SseB
MGARDQVLITATELADVIEAGDPVSILDVRWRLDEPDGRAAYLQGHLPDA
VYVSLEDELSDHTVSGRGRHPLPSGPSLQAAARRWGIRQDTPVVVYDDWN
RAGSARAWWLLRAAGLDNVRILDGGLAAWRATGGRLVSGPVEPVPGNVTV
PHGDLHSGNRPTVTTEQVAAGAATLIDARAPERYRGEMEPLDPVAGHIPG
AENLPSGEVLAADGTFLGDDALARVFAEHRIERHGAVAAYCGSGVTATVT
IAALAAVGRTAALYPGSWSEWCADPARPVERGGA
>MAP2213c subI, SubI
MDIRTAARWRPVLALVLTAGVVAGCHGGASDAVGGTGPADARTSITLVAY
SVPEPGWSKIIPAFNASDEGKGIQVVTSYGASGDQSRGVVDGKPADVVNF
SVEPDIARLVKAGKVAKDWNTDATKGIPFGSVVTLVVRKGNPKHIKDWDD
LLRPGVEVITPSPLSSGSAKWNLLAPYAVKSEGGAHGDAGVDFIRKLVTE
HVKLRPGSGREATDVFVQGSGDVLISYENEAIATERAGKPVEHLNLAQTF
KIDNPVAVVNTSPHLQAAVAFKNFQYTAAAQKVWAQAGFRPVDPAVAADF
RDQYPVPAKLWTIADLGGWSAADPQLFDKNTGSITKIYTQATG
>MAP3451 sugE, SugE
MPSATDTGRGAPYPASDKTAPWRCAMAWLILIASGVLEAVWATALSRSEG
FSRLGPSLVFFVALAFSMTGLAVAMRSLPVGTSYAVWVGVGAALTVTYAA
LVGDEPASPVKLVLIAGIVACVAGLKLLG
>MAP3449 sugI, SugI
MARGSRRGLLVGLTAASVGVIYGYDLSIIAGAQLFVTEDFGLSTRQQELL
TTMAVIGQIGGALFAGVLANAIGRQRSVLLILSGYAVFALLAAFSVGLPM
LLTARLLLGLTIGVTVVVVPVYVAESAPTAVRGALLTAYQLAIVSGLIVG
YLSGYLLADTHSWRWMLGLACVPAVLLLPLVFRMPDTARWYLLKGRVDDA
RRALLRVEPVARVDDELAEIDRAVSEEAASLPAMLAEMVRSPYRRATVFV
VVLGFLIQITGINAIIYYSPRIFEAMGFTGNFALLALPALVQVAGLVAVG
TALLLVDRVGRRPILLCGTAMMIVADVVLVAVFGRGPGGVIAGFAGVLLF
IFGYTMGFGSLGWVYASESFPSRLRSIGSSTMLTSNLVANAIVAAVFLTL
LHSLGGAGTFAVFAVLAVVAFAFVHRYAPETKGRQLEDIRHFWENGGRWD
>MAP2808 trkA, TrkA
MRVVVMGCGRVGSSVADGLSRIGHDVAVIDRDSTAFNRLSPEYAGERVLG
QGFDRDVLLRAGIEEADAFAAVSSGDNSNIISARLARETFGVKRVVARIY
DAKRAEVYERLGIPTIATVPWTTDRLLNALLRETQTAKWRDPTGTVAVSE
VVLHEDWIGHRVTDLEQATGARVAFLIRFGSGVLPEPKSVIQAGDQVYVA
AISGRAAEAAAIAALPPSEDL
>MAP2809 trkB, TrkB
MKVAVAGAGAVGRSVTRELLANGHDVTLIERNPDHVDVDAIPAAHWRLGD
ACELSLLESVQLQEFDVVVAATGDDKANVVLSLLAKTEFAVPRVVARVND
PRNEWLFTDAWGVDVAVSTPRMLASLIEEAVAVGDLVRLMEFRKGQANLV
EITLPDDTPWGGKPVRRLQLPRDAALVTILRGPRVIVPEEDEPLEGGDEL
LFVAVAEAEEELQKLLLG
>MAP2960c viuB, ViuB
MDVAGLPQPLTLDSFAELPAEKKPSVRTLTVRHVDAASRQIALDVVVHGE
HGIAGQWAATAQPGQPIYLMGPGGAYTPDPAADWHLLAGDESALPAIAAA
LEALPPSAVGKAFIEVAGHEDEIPLTAPDGVEVHWVYRGGRADLVPEDRA
GDHAPLIEAVTSAPWLPGQVHVFIHGEAQAVMHNLRPYVRKERGVDAKWA
ASISGYWRRGRTEETFRQWKKELAQAESAQA
>MAP2043 yjcE, YjcE
MFGLVLIVALVSTVIVGTVIGRRYRVGPPVLLIVLGVLLGLVPQFGHVRI
DGEIVLLLFLPAILYWEGLNISFREIRANARIIVFLSVALVIATAVAVSW
TARALGMDPHAAGVLGAVLSPTDAAAVAGLAKKLPRRSLTVLKAESLIND
GTALVLFAVSVHVAIGAPAISPPEVTLRFIGSYLGGIAAGLLVGGAVTLV
RKRIDAPQEEGALSLVTPFAAFLLAQSVECSGVVAVVVSALVLAYSGPVV
IRARSRLQSYAFWDIATFLLNGSLWVFVGVQIPGALRGIAGVDGGVRHAL
FVALVITGVVIVSRIFWGEFTTMLIRLIDRRAVQRERRVGWRQRFVTAWA
GFRGAVSLAAAVAVPMTTLSGAPFPDHSLLIFIVTVVILVTVLVQGSTLP
AVVRWARLPADVAHAEELQLARTRAARAALAALPAVADEVGVSDELRRRL
HKEYEEKAALVLATENGSPDNRILKTREKVRQVRLGVLEHKRREVTALRN
QNRIDDTVLRELQNEMDLEEVQLLAAAADEDDGDTE