TitleGenColors Logo

Gene list

Applied filters:

COG category: Inorganic ion transport and metabolism
Gene type: CDS
Genomic element: chromosome

Number of genes found: 237

Free access
Sort by:

 



# Mycobacterium avium subsp. paratuberculosis str. k10, k10

>MAP3299c hypothetical protein
MRNGRSRKLSGLNQTLAAQRGHQLVGVVRIPEEHASPIRVITRRLAIALV
VLFAAAVIVYADRSGYRDLRGGSLTFLDCVYFSAVSLSTTGYGDITPYTE
TARLVHTLIFTALRIAFLAVLVGTTLEVLSERSRQGWKIQRWRSRVRNHT
IVIGYGTKGKTAVAAILGDETTQAEVVVVDTDRSTLEHAESADLVTVHGD
ATKADVLRLAGAQHAASIIVATSRDDTAVLVTLTAREIAPHAKIVASIRE
AENQHLLQQSGADSVVVSSATAGRLLGLATTTPSVVEMIEDLLTPDVGLA
IAEREVEQSEIGGSPRHLRDIVLGVVRRPAAAHRRPRGGRGRGQRPVALH
PQRGALMAIADFQLRSVPLLSRVGADRADQLRTDVEAAAAGWADAALLRV
DSRNQVLVADGRVVLGAAAELGDKPPPEAVFLGRLEDGRHVWAIRGALQA
PDDPEVRAEVVNLRSLGPIFDDTSSQLMSSAVALLNWHERSRFSSVDGSP
TRPARAGWSRVNPVTGHEEFPRIDPAVICLVHDGGDRAVLARQAVWPERM
FSLLAGFVEAGESFEVCVAREVREEIGLTVRDVRYLGSQPWPFPRSLMVG
FHAVADPAQDFAFNDGEIAEAAWFTRDEVRAALAAGDWSSDSESKLLLPG
SISIARVIIESWAALD
>MAP1807c hypothetical protein
MIRGVVALTATVGLLLTGCVSRTTTSGPTPPPAAVPLSDLSGLTLQVGDQ
KGGTEALLRAAGQLDNLPYRVAFSTFTSGPPQVEAATAGKIDFAITGNTP
PIFGAASNARIKAVSAYGGGGAGNRILVHADSPITSVSDLRGKAIAVAKG
SSSHANLLAQLDRAAIKPADVKFVYLQPADALSAFSQHQADAWAIWDPYT
AQAEQQIPVRSIAEAQGVTNGDWIGVASDQALADPKRNTALGDLLVRFET
AVRWARAHPQQWAQSYAATVGLDPQVAAVSQARSLRLPTELGDDVVASEQ
KLADLFAAAGQLQSAPRFANWVDRRFNAALRPGLVS
>MAP4315 hypothetical protein
MAAAAGNPARTRDEDAIGTPPPGPTLVPAERYYSPAFAALEVERMWPRVW
QLACMVDHVAAPGDYFEYRCGPYGVLIVRGDDGALRAFQNVCRHRGNSLC
SGSGSGLRELKCGYHGWTWDLAGALKRVPDRKGFGTLRLSDYPLIPARVD
TWAGLVFVNLDPDAMPLPEYLEAIPEDTAWCRLDEFRCYATLTVEVDANW
KTVADGYSETYHIQTLHPELLRCVDDIHAPQQIWGHTGKSDQPYGVPSPR
FEGALSDEEVWDAYVSTQGALMGAAEGTPFPAADHRPGQTVADLIADRTR
AFAASRGVDLGWADTDRITRLHQYNVFPNMTFLTNADHLTVMCSRPAPDP
AAAPDKGELVMFLTTRMPPGAPRTKPTDVRMSAGEAEPGLVLTQDIAVLA
GLQRGMHQPGFTHLVLSSEERRVINMHRNLERYLDLPAAQRMSGGAGT
>MAP0395c hypothetical protein
MVATTSSGGAAVGWPARLTKARLHFVTGKGGTGKSTIAAALALTLAAGGR
KVLLVEVEGRQGIAQLFDVPPLPYQEVKIATAERGGQVNALAIDIEAAFL
EYLDMFYNLGIAGRAMRRIGAIEFATTIAPGLRDVLLTGKIKETVIRVDK
NRLPVYDAIVVDAPPTGRIARFLDVTKAVSDLAKGGPVHSQADGVVRLLH
SEQTAIHLVTLLEALPVQETLEAIEELAEMQLPIGSVIVNRNIPAYLQPA
DLAKAAEGDIDADAVRAGLQKAGITLDDKDFAGLLTETIEHATVIATRAE
IAQQLDALHVARLELPAISDGVDLGSLYELSESLAQQGVR
>MAP0142c hypothetical protein
MMSSNVRAARNRPGRTDPQPPTGAPLFVGVLGLLLATGWVANHFVGLMPA
ISDRDHLATTTLDGIFGIYALGLLPGLLVGGRTSDALGRRPVALTGSAAA
LVGTVAMLLSQHSPALFAGRLIVGLGVGLAISAGTAWASDLRGPAGAATA
GAVLTAGFAVGPFAGGVRAWAGPSGVRASFALAAAILALAAFAVVAAPQP
SPVTAPADPDGEETADAAPQGISRALSWAMPLAPWVFASATLGFVTIPGR
LHTALAAPVAAGTATLIVNGVSGAVQVLARALRWGPQAGTAGAVLAALGY
AVAAAAPPTLTPALGVPLFVVLGCASGLCLREGLIDLGGRRAATPARRPD
GLFYVVTYIGFGLPLILASVRPGVATAILSGMAVLAMTAAVGRAARLRRD
DHRQN
>MAP0394c hypothetical protein
MSTTPKQLDMAAILADTTNRVVVCCGAGGVGKTTTAAAIALRAAEYGRNV
CVLTIDPAKRLAQALGVNDLGNTPQRVPLAAEVPGELHAMMLDMRRTFDE
MVVQYSGPGRAQAILDNQFYQTVASSLAGTQEYMAMEKLGQLLAEDRWDL
VVVDTPPSRNALDFLDAPKRLGSFMDSRLWRLLLAPGRGIGRLVTGAMGL
AMKAMSTILGSQMLADAAAFVQSLDATFGGFREKADRTYALLKRRGTQFV
VVSAAEPDALREASFFVDRLSQEGMPLAGLVLNRTHPPLCSLPAERAIDG
TEMLEHDGDPETTSLAAAVLRIHADRAQTAKREIRLLSRFTGANPHVPVI
GVPSLPFDVSDLEALRALADQITSNQATAR
>MAP3280c hypothetical protein
MQNERVQGFGFHTLALLTAVGFAGPLLASAKRFRIPVVIGELIAGLAIGR
TGFGVVDVADPTFQLLANIGFALVMFVVGTHVPLRARLMRSALPAALARA
TLAGGIAAVLGVALAVQFDTGHAALYAVLMASSSAALALPVIDSLRLRGP
RVLSVTTQIAIADAACIVLLPLVIDIRRAPTATLGSLAVAGCAAALFVLL
RAVDRKGWRRRLHAYSEQHRFALELRTSLLVLFALAALAVATQVSIMLAG
FAVGLVVGAVGEPHRLARQLFGITEGFFSPLFFVWLGASLQVRELGAHPQ
LILLGAGLGCGAVLAHCAGRLLGQPLTLAVLSAAQLGVPVAAATIGTQQH
LLAPGEASALMFGALLTIAAASIATGLAARRQGAPEPGEPAK
>MAP2018c hypothetical protein
MVTDRFDAIIVGAGFGGIGAAIQLKRLGFDNIVILDREDDLGGTWHVNHY
PGIAVDIPSTTYSYWFEPNPGWSRLFAPGGEVKQYAADVADKYDVRRHMR
FNTTVEGAQWDEDAEVWRVALAGGESLTTRFLITATGFLSQPHTPDIPGI
GSFGGKVVHTTAWDHDYRYQGRRIAVIGTGASAVQVVPELAKEAGELTVY
QRTATHVLPKVDFEFDPAVRRLFARVPAAQRALRWVTDVLLEIIMIVGAL
HFKESRGRGNISASDLAKINRFRWIRDKELRAKLTPDYDCGCKRPTFSNS
FYRVLTQPNVHLETNPIERIEPDGIVTADGRKTVIDTLVLATGFDLWEAN
FPAIEVIGRKGRNLGKWWRETRFQAYQGVSMPYFPNYLSLASPFAFSGLS
FFHTIEYQMRHMDRLLGEVKRRGATIFEVTEEANDRFMERMTKLLDNSVF
YAGNCATSRSYYFSPSGEASLLRPTSTLNSIREASSFPLSDYVIA
>MAP0631c hypothetical protein
MVNIVAVRRHGVHVRVIHVPPVQPQPILAPLTPAAIFLVLTVDDGGEATV
HEALQDISGLVRAIGFREPQKRLSAIASIGSDVWDRLFSGPRPAELHRFV
ELHGPRHTAPATPGDLLFHIRAESLDVCFELADRILKSMAGAVTVVDEVH
GFRYFDNRDLLGFVDGTENPDGALAVSSTAIGDEDPDFAGSCYVHVQKYL
HDMSAWTALSVTEQENVIGRTKLDDIELDDDVKPADAHIALNVITDDDGT
ELKIVRHNMPFGELGKSEYGTYFIGYSRTPRVTEQMLRNMFLGDPPGNTD
RILDFSTAVTGGLFFSPTVDFLDDPPPLPAPGTPAAPPARNGSLSIGSLK
GTTR
>MAP2734 hypothetical protein
MPGVLENVRRGMIPAHIYNDPELFALEKRRLFARAWTFVGHESEIPHDGD
YMVRRVLDDSFIITRGSDGRVRAVFNMCLHRGMQVCRAELGNASNFRCPY
HGWTYRNDGRLTGLPFHREAYGGDDGFVKDQTLLPAPNFAGYDGLLFISL
DPAAPPLQDFLGDFRFYLDYYTRQSAGGVELRGPQRWRIRANWKIGAENF
AGDMYHTPHTHASIVDIGLFREPKAQKRKDGATYWAHRGGGTTYKLPPGG
FEERMRYVGYPDEMIGRIKQVWTPRQQRVVGEDGFMVSAATCFPNLSFVH
NWPKVRDGRDDETLPFISIRLWQPISENETEVCSWFAVDSAAPAQYKQDS
YKAYLMCFGSTGMFEQDDVENWVSLTTTAGGSMARRLLLNSRMGLYDDGR
PVVEALAPSAFHGPGRAQVGYNEHNQRALLAMWADYLQEGDACENR
>MAP0044c hypothetical protein
MIPTRMQSSAPVEIWRSVRALPDFWRLLQVRVASQFGDGLFQAALAGALL
FNPDRAADPLAIARAFTVLFLPYSLLGPFAGALMDRWDRRLVLVGANVGR
LVLIAAIGTILAVRAGDLPLLLGALFANGLARFVGSGLSASLPHVVPREQ
VVTMNAVATAAGVVAAFLGANFMLVPRFLFGAGDRGAAAIVFLTVVPVSI
ALLLSWRFAPRALGPDDTWRAIHGPVLYAVITGWLHGARTVAQRPTVAAT
LSGLAAHRMVVGINSLLVLLLVHHLPGLEGGGFGTALLFFGAAGLGAFLA
NVLTPPAIRRWGRYASANGALAASAIVEVAGAELLLPVMVVCGFLLGVTG
QMVKLCADSAMQMDVDDALRGHVFAVQDALFWVSFIVAITVAGMVIPDDG
HAPVFALFGSVLYLVGLAVHGIVGRRGE
>MAP3508 hypothetical protein
MMTDVAKDANSAVAEELSTPMTIGVEAYISEDYARAERDKLWRKVWQQVG
RVEELPEVGSYLTYDILDDSIIVVRTGANEFRAHHNVCMHRGRRLIDTPE
GAKNALGRTRKSFVCGFHGWTYGLDGACTHIREQQDWRQALTPDNTHLRP
VRVDTWGGWLWINMDPDCEPLADYLFPAAKILEPFGLENMRYKWRKWLYF
DCNWKVALEAFNETYHVYTTHPEFNKFGEFKGWAKAQGRHSNIGYDAPED
MEATKSKIRLGIGADPRVSTAEMQVYTMEETNATTTQTLVNAAKRLVDEL
PEGTPADKVLEHWLASARRDDEARGVIWPTIPPDILGQAGTAWQIFPNFQ
IGQGLTSALCYGARPHPSYNPDKCIFEVSVFELYPKGEEPQTEWEYTPVG
DPRWRSVLPQDFSNMAAVQQGMKSLGFPGTKPNPYRERSTVNLHYQLSRY
MGTGAPRELSDKEHPLA
>MAP2550 hypothetical protein
MGSVNRVYIARLARILVLGPLGESVGRVRDVVISISIVRQQPRVLGLVVD
LATRRSIFIPILRVAAIDPNAVTLSTGSVSLRHFEQRPGEVLAIGQVLDT
VVKVNDPELPELAGVDVVVTDLGIEQTRTRDWMVTRVAVRPQRRLRRRGP
VHVVDWRNVQGLTPSALALPGQAVAQLLEQFEGRKPVDVADAIRGLPPKR
RYEVLKALNDDRLADILQELPELDQAEVLSQLGTERSADVLEEMDPDDAA
DLLGVLNPTDAEMLLKRMDPGDSASVRRLLTHSPDTAGGLMTSNPVVLTP
DTAVAEALARARDPDLTAALSSMVFVVRPPTATPTGRYLGCVPLQRLLRE
APAELVGGIVDSDLLTLRPETPLVAVTRYLAAYNLVCGPVVDDENHLLGA
VTVDDLLDHLLPPDWRVDMQELDTAGRLEGLGGSG
>MAP1423 hypothetical protein
MIGLGGVWVLDGLEVTMVGNVSARLMEPGSGIALNPAQIGMAAAIYIAGA
CSGALFFGHLTDRFGRRNLFILTLALYLIATVATAFAFAPWYFFLTRFFT
GAGIGGEYAAINSAIDELIPARVRGRVDLVINGTYWLGSAAGAGGALILL
DTSNFAADLGWRLAFGIGAILGIFVLLVRRNVPESPRWLFIHGREEEAEH
IVGEIEEAVQQQTGRPLPEPQGKALRIRQRTAISFREIAAVAFKLYPRRA
VLGLALFIGQAFLYNGVTFNLGTLLSQFYAVPSGMVPVFFVLWALSNFAG
PLLLGHLFDTVGRKQMITLTYIGSAVVVVALALVFLTQAGGVWAFIGVLI
VAFFLASAGASAAYLTVGEIFPMETRALAIAFFYAMGTAIGGITGPLLFG
QLIDSGQRDHVVWSFLIGAVVMAAAGLVELWLGIAAEQRPLEDLALPLTV
DDAEDTEPQGDSAPVD
>MAP3181 hypothetical protein
MIVWSALSLVALLGGIGVMFAVYGRWSQKVGWHSAETSTLSFRQPGDVGL
TPAQRACVWFFAIVSVLFLAQTLLGAAAEHYRADLSNFFGLDLARVLPYN
LARTWHLQLSLFWTAAAFLAGGIFLVPFIARREPKRQALLAYILLGAVAV
VVFGSLICEALSIYGVIPAGGLFSQQWEYLDLPRLWQILLIAGMFMWIAI
IWRGIRGRLKGESKMNMPWLFFFSGLAIPAFYTVGLLAGSDAHYTVADFW
RFWVVHLWVEDFLELFTTVMVAYMFVLLGVVREKIALGVIFLDVILYSAG
GVIGTMHHLYFSGTPVEHMALGAFFSAAEVIPLTFLTVEAWAFLQLGARQ
QSGDANPFPHRWAVMFLVAVGFWNFLGAGIFGFLINLPVVSYYQIGTALT
ANHGHAAMMGVYGMLAVGLAMFAFRYVIPADKWPEKLARISFWCMNIGLA
WMVFATLLPLGVLQLYHSVNDGYFEARSLGYITKPGNAVLEWLRLPGDVI
LIAGGVLPFVWIAWTALRNFRSGTTVQELPEHPLYTESPVMSEAGAAAAK
D
>MAP2343c hypothetical protein
MVRTGYGVQVARTVSDSAPDSAEDPSEREFGQAGIALSTYRFPTGWFIVA
FGSDLAPGQVKRAHYFGEELVLFRTASGRVHVMDAYCQHLGANLGVGGTV
EGENIVCPWHGWQWRGDGSNALIPYSKIGCKNNVRIRTYPSMEWYGFVLA
WHERHGRAPYWQPPVLPELETGEYYPLHPHTQMVNRVKVHPQMIIENAAD
PYHVQYVHKAANPATTASFEVAGYHLHATVNAHFGGGRAQTWLTPNGPVD
AKIIYDNYSLGLGVVRFPSELVATVQVTGQTPVDEDYTDYFYTQASVREP
GDTGDVPTGRAARFLALQKEVIKQDFFTWENMKYLEKPNLAPEEAHDYAA
LRRWAHRFYPGPQPAPTDFGYTADGEPDPAAAKA
>MAP1864c hypothetical protein
MRRIRFSGPRLQGLGWTPGQHIRLQVESLRESMLRLHPYPVLRTYSIYAA
DPDRGALDIVMVDHDGDPKGATPARRWAMAATLGDHVRMTRPQGKFVIRD
DAPYHVFVGEETASVAFAAMLRSLPPTAEVYGVVEAATEADHLPRARPLE
RVERGGAPAAKSAVLADALRRLPLPDHPGVAYLAGEARTVQALRQILITE
RGWKRNQVRTKPFWSPGRTGME
>MAP3727 hypothetical protein
MDVSSAAIAAEQLSFGYPGDGQRLIAVSLAVQPGQVCCLLGPNGAGKTTL
LRCLLGLLTPQSGTVRVAGDPIDRLSRRQLARRVAYVPQRSNTPFPFSTL
DIAVTGRTPYLRAMTSPSATDRRAAAAVLDRLGIGALADRPYAVLSGGER
RLALLSRAMVQDAPVLILDEPMAALDFGNESRILQVVAELAAAGRAVLMT
THQPWHALHSGDQAVLIADGRLIADGPVEQVVTAAALSELYGVPVRVLTA
TDDATGRPVYACAPVAAGDDR
>MAP3776c hypothetical protein
MATVGACCRSGGPAGVPRNRAGAHRDADRVSGKSRALDMKTVIVCGVGGL
LSRWHGAPIATVLVLTVVTSCSSSPTQTAEGSRNAASPSAIGRTAAPPCP
TAPLAVVVSVDQWGDIVSELGGACANVKTVLASSSVDPHDYEPSPADAAD
FMNAKLIVVNGAGYDSWASKLAGSSASGAPLVSAAAVTTTPDGANPHLWY
LPSAVTAVADAVTQELSRMEPPAAGYFSQRRAQFTSATRLYVNLIAKIKA
EAAGKSYGATETVFDYQAQAAGLVNKTPAGYRRASANESEPSPGDVDAFL
TALAGRHIDLLIYNTQTEGSIPEEIRSAAEQSSVPVVKITETVPPGETSF
EDWQYGQLVQLAKALHVAV
>MAP0223c hypothetical protein
MTAPRLYRPSPPLAEHIEYFGYWRGDEALGVHTSRALPRGAVTAIVDVAG
RTDLGFYASDARTPLTVPPLFAAGAGATAYVVRVAPAHTVMTIHFRPAGA
LAFLGCPLSDLEDALVGLEELWGRDAALLREQLIDAGSPPRRVALLQAFL
VRRMRRNAVWPPARLAPVLRGADLDPSMRVSKAQELSGLSRKRFAALFRC
EVGLSPKAYLRVRRLQAALRALDTPARGATIAADLGYFDQAHFLREFRAF
TGVTPTQYARRRSSMPGHVELAR
>MAP1434 hypothetical protein
MRMHDTDEIRLIEAQAVPTRFARGWHCLGLIRDFGDGKPHPIDAFGQKLV
VFRGGDGAINVLDGYCRHMGGDLSRGEVKGNEIACPFHDWRWGGDGRCKQ
VPYSRRTPRLARTRAWTTLQQDGMLFVWNDPEGNPPPPEVTIPRIEGAGS
DEWTDWHWYSTVVQSNCREIIDNVVDMAHFFYIHGSLPKQFKNIFEGHVA
TQYMNSAGRPDIGGEGARMLGTTSVASYWGPSFMIDDLTYHYEDADHQTV
LINCHYPIDANSFVLQYGIIVKKSDALPDDLAMQTAIALGDFVKLGFEQD
VEIWRHKARIDNPLLVEEDGPVYQLRRWYQQFYVDVADVQPDMVDRFEFE
LDTTRPYAAWMKEVEANLAARA
>MAP4065 hypothetical protein
MAIIHGADRGARPQSWEEVARVTETTTQPVADAVADRPSVSLRTRGRWLR
WGLLSVWGPGLVVMLADTDAGSLITASQSGAQWGYRMVLPQLILMPVLYV
VQEMTVRLGIVTGRGHGSLIRERFGRGWAWLSAFTLFASAIGALLTEFAG
VAGVGELFGVSRWVSIPVATIALLALALTGSYRRVERIGLAVGAAELAFL
VAMVMARPDPGALAHGLTSMPLGDSSYLLLIAANVGAVIMPWMIFYQQGA
VVDKHLSESTIRQARYDTAFGAVLTQLIMIAVVITMASTIGRHGDGAPLE
TVGQIAQSLTPYLGHVGGTVLFGLGMLGAALVAAIVASLAGAWGLAEVFG
WKHTLNQRPNRATAKFYLTYSLAHIVGAVLVLASVDLVNLAVDVEIMNAL
LLPIVLGLLLALEARALPEQWRMRGLHKHVTRALCLVTIGFGLYMVPQAL
GWA
>MAP0998c hypothetical protein
MMITGDNPLTAKAIADEAGVDDFLAEATPEDKLQLIKREQAGGKLVAMTG
DGTNDAPALAQADVGVAMNTGTSAAKEAGNMVDLDSDPTKLIEIVEIGKQ
LLITRGALTTFSIANDIAKYFAIIPAMFVALFPGLDLINVMRLHSPQSAI
LSAVIFNAIIIVLLIPLSLRGVRYTPSSASKLLSRNLYIYGLGGIVAPFI
GIKAIDLIVQFVPGMS
>MAP0487c hypothetical protein
MEHRLADMADHLFSLDITVHLLGHDFVQQALVAAALLGLVAGLIGPFIVM
RQMSFAVHGSSELSLTGAAFALLVGIGVGVGALIGSALAAALFGVLGRRA
RERDSVIGVVLAFGLGLAVLFIHLYPGRTATSFALLTGQIVGVGYTGLTM
LALVCLLVIAVLATCYRPLLFATVDPDVAAARGVPVHALGIVFAALVGVV
AAQAVQIVGALLVMSLLITPAAAAARVVASPGAAMLASVAFAEVSALGGI
VLSLAPGVPVSVFVATISFLIYLACWLIGRRREAAT
>MAP1472c hypothetical protein
MNVINGMPAHALLVHFVLVLVPLTALLDIVCGLWPAARRGQLMLLTVILA
VVTMALTPITIDAGGWLYDQRADPSPILQEHATLGSAMTYFSAALLAVAI
VLALLGLIERRSDKRRLLTRGVVAVLALGIGIASMVQIYRVGDAGAQSVW
GGEIAHLKKAHPG
>MAP1725c hypothetical protein
MSAVAGFAAVDVGGFAYAGGWLRPDSPLTPPRFADRFEHVYGRHDGFRRN
HAKGLSATGSFTSTGAGAAICRAAVFREGTVPVVGRFSLGGGLPDQADKP
ETVRGLGLLFDAGGQQWRTAMVNVPVFTDSTPEGFYERLLATKPVPSTGK
PDPQKMAAFLDRHPETAAAMKIIKQSPPSAGFADTTFYGLNAFLFTNSAG
ATVPVRWSVVPHDGGVAGAPGPTRGKDFLFDDLIRTLAQRPLKWRLILTL
GEPGDPTHDATKPWPQSRRTVDAGTVTITAVHTEEEGNARDINFDPLVLP
DGITPSDDPLPAARSAVYARSFTRRAEEPKSHSEVNVTRVLP
>MAP2717 hypothetical protein
MLSGRPNGEQGRRVATVRSGTMRSMGAQTALTLLLAAIAVIVLVDWISER
TRLPSATLLVLVGIGQALLPGPTIGLEPDVVMTCILPPLLYHAALESSLV
GIRRNLRTVVSLSVVLVLLTAASVGVAFSLLVGGATLAVGMVLGAAVAPT
DPVAALAVARKEGLPLNIVTLIEGEGLLNDATALTTLAVAVTVARGAAFS
APSAIREFVLAAVGGLVVGQVFAYARRLLRRWRHDVLTANAISLATPFLT
YLVAEKLSASGVLAVVVCGLIVGHDSPRVESGASRLQTRAVWRLVNFLLE
GVVFLLIGHQVPVILDELGGYALSTILVAVGVTVAVVLLVRPLWLLLTQA
LPRSLHTRLGNVDDSATDSAEPRPERTRERLSGREIVVLSWAGTRGVVSL
AAIFAVPVLTEGNAPFPDRDLLLFCTLVVVLVTLIGQGVTFGPLVRALGL
RAKTTDELRLRNRARAAALRAALDRLDSLDPEDGADVDPRVIDGVRQQLS
AQLERYEHRLTLLSDVDELPAAPAYEAAVGLRRVAIDAQRDELVRWRDAG
MLPDQSLRAIERELDHEESMLPMGLPRSPRRSQKR
>MAP3739c hypothetical protein
MSIGAQMLDHARSNQTWTTAVAALACFLMTLDITVVNVALPSIQKDLGAS
LEGLQWVVNAYVLAFAALLLTVGSVSDRLGRKRLFLTGVAVFTVASALCV
ASRTESPLIAARALQGIGGALVFGTCLALIADAYTDAEEEQRRKAVGLAM
AAGAAAATLGPLIGGGLVEIGTWQWIFAINVPVGVALAICTALKVREPHA
PHAADNSRVDSVGAVVAIVVLFALNYGLLTGAAKGWGRGDVLAALAIGLA
GGVGFVLHQLRRGSEATLDLTLFRIPTFLAAIVLGFTVRALSFGVFPFLI
LWLAGAHGRSAFDIGLILSALALPLMVCAVLSTSVARAVGVRATMSIAMV
ITAAGLFLATLIRGDGSWTTILPALAVLGVGNGVAMPHLMNLAVDVVPSN
KAGMATGAANTAFPLGTATGVAAFGVVLSSFVHAKVAASGVIPVHSADSV
ASAIVAGVLRFPTQAMTAFATSAFTDALRLIFGIAGCAALVAAGLSGALI
THRPRVAAESPESVTE
>MAP2065 hypothetical protein
MGTSEDKAPGQPVIHLSQLLRAPVLARSGETVGRVEDVIVRLRGADEYPL
VTGIVAGVGGRRVFVGDKSIHEYSADRVLLTKNKIDLRGFERREGEVLLR
TDVLGHRLIDVATVELVRAYDIELEQTTAGWMVARLDTRRPPRLFGLIKH
SGGHASRDWKAFEPLIGHARSDAVRRLSDRFGELKAAEIADLLEEADKAE
GGEILDRVHSDPELEADVFEELDPEKASRLLDEMPDDEVAALLGRMRADD
AADAIVDLRQSRRRRVLDLMPAPQRTKVITLMGFNPESAGGLMNVDSVSC
AASATAAEALALIASSHSIQPEALIKVHVLDEDRRLDGVVWVITLLQVDP
SETLERLMDSDPVRVNADADLTDIALLMADFNLYSIPVVDEQDHLLGVVT
VDDVLEATIPEDWRRREPAPRPIREITTAEDRPLPGGNAP
>MAP1089 hypothetical protein
MSARRASWPVRISAAVLILTAAWAAAPGLFTDRNPLKGRPVDKFQPPSAA
HWFGTDHLGRDVLTRVIYGTSHTVATAGLAVAVGLLLGSAVGIAAGVSGP
VVDAVAMRASDVLLALPGFLTSVWIVTAYGPGPLSVGIGVGIGSIAVFAR
VFRAEVLRVRALDYVEAAFLSGETRWSVIRRHIVPNAAGAVIALAVIDLS
GAILLISALGYLGYSAPPPTPEWGLLVAEGRRYLATAWWLSTLPGAVVLS
VILALGVLSRRALTSHRI
>MAP3728 hypothetical protein
MIRAVLAAVCVATSAAGCGAAHHSPAEPTRTVVDMTGQHVQIPATVTRVA
TNIPLIPATIELLGGIDTVVAAARGSFNALFTTIAPATQQIPRSPPTSLN
AEQLLDLHPQVFFMTDLTPGLLPMLQRLQIPVVQITAFTSPQDLQKAVNL
VAQVLGGAAPARARQYDTYFDAVIQQVHAGAQTDRPTVYYAPGPDPTTTV
GADNIITASIEAAGGRNIAVEHGIGGHQPGAFAFPTITAETLLAWNPDVI
VASNARVADQLATDPTFATLNAVRDHHIYTCPVGIFPWCASSSEAALAPL
FLAKKLDPERFSDLNLANKVANFYIQFYGYSLTGPQVTAILDGAG
>MAP4062c hypothetical protein
MTVTPQEAADHGAAEADYFDVLIVGAGISGIDAAYRITERNPQLSYAILE
RRARIGGTWDLFRYPGVRSDSSIFTLSFPFEPWTRKEGVADGVHIREYLT
ATAHKYGIDRHIRFNSYVRSADWDSTSDTWTVTVEDGARDGERKLYRARF
LFFGSGYYNYDEGYTPDFPGIEEFTGTVVHPQHWPEDLDYTGKKVVVIGS
GATAVTLLPSLSDRAAKVTMLQRSPTYLISASKYGKVAAVARKVLPRKPA
HLVIRMYSALTEAVFFALSRKAPRLVRWLLRRKAINSLPPGYAVDIHFKP
RYNPWDQRMCLIPDADLYNAITAGRADVVTDHIDHFDATGIALRSGAHLD
ADIIITATGLQLQALGGATISLDGNEIKTNDRFVYKAHMLEDVPNLFWCV
GYTNASWTLRADITARATAKLLEHMTTHGYTHAYPHRGNEPMTEKPSWDI
NAGYVLRSVHALPKSGTKRPWNVRQNYLADAIDYRFDRIEEAMVFGRAAD
RAALAG
>MAP1152 hypothetical protein
MDFGSLPPEINSGRIYSGPGSAPLLAAAAAWHGLAAEMHSAAASYGSAIA
ELRTLWHGPSSTAMAAAAAPFIAWLGGTAAQAEQTAAQATAAAAYDSVFA
ATVPPPVIAANRALLASLIATNVLGQNTPAIAATEAHYAEMWAQDAAAMY
AYAGASAVATRLTPFGAPPQSADANAAADQSAAAASALQLSTASSVESAL
SQGVSQVPVAAQVNATAVTAAAQLPLSLTDITGILKTFNSVMGTISGPYT
PLGVANLAKNWYQIALSIPSVGTGIQGIGPLLHPKALTGVLAPLLRSDLL
TGSTALSSAGTVSASAGRAGLVGSLSVPANWASAVPAVRTVAAELPETML
DAAPAMAVNGQQGMFGPTALSSLAGRAVGGTATRAVAGSTVRVPGAVAVD
DLATTSTVIVIPPNAK
>MAP3141c hypothetical protein
MARFPKPPEGSWTQHYPELGTAPVSYEDSINPEVYELEREAIFKRAWLNV
ARVEQLPRKGSYLTRELKVVNTSIILVRNGSGEIKAFHNVCRHRGNKLVW
NDMPLEETRGVCRQFTCKYHAWRYDLDGNLTFVQQEEEFFDLDKSRYGLV
PVHCDVWEGFVFVNFAKTPEQSLREFLGPMITALEGYPFDKMTSRWCYRS
EVKANWKLYMDAFQEFYHAPVLHANQSPTAYSKAAAEAGFEAPHYRLDGP
HRLVSTSGVRAWEMSPEMRKPIEDICRSGLFGPWDKPDLGPMPDGLNPAK
CDPWGLDSFQLFPNFVILFWGQGWYLTYHYWPTSYHSHVFEGTLYFPVPR
TPRERVAQELAAVSFKEYGLQDANTLEATQTMLESRVVDKFVLNDQEILL
RHLHKETAAWIDDYRRMSTAGV
>MAP0127 hypothetical protein
MTNSGEGPMQRAGSPERQGVSTSRSERLREVLRYDLPASLVVFLVALPLS
LGIAIASDAPVLAGLIAAIVGGIVGGWLGGSPLQVSGPAAGLTVVVADVV
AEFGWGVTCFITVVAGVLQVLLGFSRIARAALAISPVVVHAMLAGIGITI
ALQQVHVLLGGSSKSSAWSNVTGLPAQILGAHRPGLVLGLLVIAILVAWR
WVPARLAIVPGPLVAIVVVTIISMVLPFKVSRIELDGSVLDAVRLPSLPH
GNWGAVAIAVITVTLITSVQSLLTAVSIDRMHTGPRTDFNRELIGQGAAN
IASGALGGLPIAGVIVRSSANVNAGAKTRASTIMHGFWVLVFAVPFAGLV
EKIPTAALAGLLIVIGIELLKPAHIETALRNGDLAIYLVTVTSVIFLNLL
HGVLIGLLLAVVVTGWRVVRARIEAEPVGDGWHVVIEGACTFLALPRLTG
VLASIPERTSVTVHLLTNYLDHAAHQAIGDWQRRHCATGGTVEVRDTAEP
AARRRNSHLSLVEQVSSPGGA
>MAP3683 hypothetical protein
MAENHTTTNAGAPAPSDELSLTVGPDGPLLLQDSYLVEQMAAFNRERVPE
RQPHAKGAGAYGRFEVTADVSEYTKAAFLQPGAVTEVFARFSSGNSGERG
SADTARDNRGFSVKFYTTEGNFDLVGSDVPVFAIRDPMKFPNLIRAGGRR
ADNDLHDHNMVWDFWTSCPETAHLVTLVMGDRGIPRTFRHMNGFGLHAFS
WVNSAGEIHWVKYHFKTDQGIQWLPQEEGRRLAGTHPDCCVRDLYEAIAR
GKYPSWSLQVQLMPFADAKTYRFNPFDVTKVWPHADYPPIEIGTMTLDRN
VTDHHAEVEQAAFAPSNLVPGTGLSPDRLLLGRSFAYPDAHRARIGVNHD
QLPVNAARCAVRSYAKDGRMRFVNTADPVYAPNSAGGPQADPARAGEVHW
AADGQMLRAAYTLRRDDDDWGQAGTLVRRVMDDSQRERLVHNIVGHVSAG
VNEPVLSRVFAYWRHVDADIGRKVEEGVRANLNS
>MAP1755c hypothetical protein
MTINASAPQRQGAVAVVDTVTALPAAVAGDGHTADPFEPLTVGAMVDRVS
AIAVEKAAHPWAFLMRSLVGGAMVAFGVLLALVVSTGVKTPGVASLLMGL
AFGMSFVLILVSGMSLITADMAAGFLAVLQRALSIRSYVVLVAVGLVGNI
VGALVFVTVCAAAGGPYLGAFADRAATVGTQKAGQPFWTALLLAVLCTWF
LQTSMCMFFKARSDVARMALAFYGPFAFVIGGTQHVIANVGFVGLPLLLN
LFHPIAARGDIGWGIGDHGLLTNIGVTTVGNLIGGTVFVALPFWIIAHLQ
RRRILSTGALRPDG
>MAP3509 hypothetical protein
MDTQQDGCGPTETPDDIDIDALRQKYAHEREKRLRKEGSKQYIELEDDFS
GYYEVDPYTPVTPREPIREDIDVAVLGGGFAGLLSAAHLKKAGVDDVRII
ELGGDFGGVWYWNRYPGIQCDNESYCYIPLLEELDFMPSKKFADGAEIYQ
HCRNIGKHFGLYDSAIFSTQVHDLRWDEQIKRWRVSTNRGDDIRARFVVL
ASGPFHRPKLPGIPGIKTFGGHSFHSSRWDYDYTGGDSGGNLHKLADKRV
GVVGTGATGIQIVPFLARYAQHLYVFQRTPSTVDARNNTPTDPEWVKTLR
PGWQRERQRNFHAWTFEGMAPGQPDLVCDFWTELGRNTAARVLALDDPAS
LTPEQFMAIREEEDYKIMERLRRRIDTLIDDPATAEALKPYYRFLCKRPC
SNDDYLPSFNRPNVTLVDVSASKGVERATEKGLVANGVEYELDCIIYASG
FEITTEISRRYSIETIAGRDGLSLFDYWRDGYKTLHGMTSRGFPNQFYTG
FTQVGISANIAANYELQGEHIAYIIAEALKRGAATVEPSDEAQQQWCTTI
RETAVDNSAFDAQCTPGYYNNEGGGGGEGIRSHLGEPYGPGFYAFEDLLR
AWRDKGDLEGLVLGS
>MAP4272c hypothetical protein
MHKGATFAVAAVTAAIAPLAACANQQSSQPNTAPLTSSVPGSERLTTQLK
TAEGIPVANASFEFANGYATVTVEAGPNQVLSPGFHGLQIHAVGKCEANS
TAPTGGSTGDFESAGAVYQAPDHTGYPASGDLTALQVRSDGSAKLVTTSN
AFTAADLRTSSGSALILHQNANNLANTPAADSGKRLACGVIAASSATSTT
TTPTTSVTTSTTTVAVPPPSTSTSTSTVTVTGTPTATSTPTTTVTTPPSL
PPGR
>MAP2825 hypothetical protein
MWDSGGMKHGSDSGFDGGFDDFDRNKSRPVLITAAAPSYEEQHRARVRKY
LTLMAFRIPALILAAVAYGAWHNGLISLAIVAASIPLPWMAVLIANDRPP
RSPDEPRRFDNARRRTPLFPRAEQAALEPPPAAQARWQPGGWDGIDRDRP
PFH
>MAP3594 hypothetical protein
MSELSIGIIGAGPGGLALGILLSQAGFGDFTIFDREDGVGGTWRINTYPG
LACDVKSHLYSYSFDLNAHWSRLWSGQPEILDYFQRCADKYGLGPHLRLG
TEIRSAHWDADTQRWRLTTASGHRHHFDVVVSAVGLFTRPLLPELVEEEP
FTGTVMHSARWDHSIPLHGKRIAVLGTGSTASQLIPELAKVAERVYSVQR
SPTWILPKPDRPYTQWERWAFAHLPLAKKLYRTRLWLRSESNISVIEHGS
EKTGQFTNIALGLLEASVPDEELRRKLTPDHPMGCKRLVFSSDYLPALTR
PNVEVLTSPARSLRRRSLVTEDGTEREVDLVVCATGYAAADYLGELDVTG
ERGVTLREVWRDGAYAYLGMAVPGFPNFFMLYGPNTNVGSNSVIFVLEAQ
ARYIVRALKYLRRHRRRYIAVRPAALADFVAKIDRWMVGTVWTTQCSNYF
RAPNGRVVTQWPRSARAFWSMTRRFRPADYRFQAPAMRVPAPASEADAR
>MAP1594c hypothetical protein
MYVCLCVGATNQTVSEVVARGATTSKEIAAACGAGGDCGRCRRTLRAILA
ASAKLETAPSV
>MAP2377 hypothetical protein
MKVPFTWKVTGWFMVGWSPEFPIGEVRPLHYFGEDLVAYRDESGELHILE
AHCKHLGAHLGHGGTVVGDCVQCPFHGWRWGPDGTNRYIPYQPDRPNRGL
RLKVYPVREQYDCVFVWHQPHGKEPQWEMPDIFGKFPQFETDPAAYYRAY
PEFSRRAEREPVHPQIVAENAPDSAHFEYVHHATVTPRVLDWKIVEQEWQ
FVAGWPDANSDDPAALALRFHSHLFGLGGAISVFEGAQNHRLIFTCTPVD
DECSDLFYSIWWPRVPGDTADVPEGKLREVIEKQFLSTVFDDLQIWRYQK
YVEHPPLSKVDAKGYMALRKWATQFYELPPAGTSSPA
>MAP0146 hypothetical protein
MAWQLATAVTGFFAPSQLPPPGDVVAALSDLARHGELWTHLRASGWRVLA
GYASGAAAGLALGSLVGLSATARRLLAPTVAAFRTVPSLAWVPLLLLWFG
IDETPKILMVAIGAFFPVYTTTASALSHIDAHLVEVGRAYGRHGVSLLTA
VLLPAAAPELVNGLRLGLANAWLFVVAAELIASSKGLGFLLIDSQNSGRT
DVMLLAIVLLAGLGKLSDAALGAVETRIARRRG
>MAP3773c hypothetical protein
MSSPAAPRRRRATVKQRTVLEVLRAQENFRSAQQLYQDIRQNQQLRIGLT
SVYRILRALAADRIAETQRAEDGEILYRLRTEAGHRHYLLCRQCGRAVAF
TPVDIEEHTRRLSRQHHYADVTHYVDLYGTCPLCQNTQP
>MAP2336 hypothetical protein
MQHDQLIDLTRRALKLARDRTTDLAPTAHVVDARDYTCVQRHQQDRAMLL
ASPQLVGYVSELPAPGTYCTKTVMGRSVLLTRTSDGSVKAFNNVCLHRQS
QVATGCGTASRFSCPYHAWTYDNTGRLVGLPGREGFPDVALRSAGLTELP
ATEFAGFLWVSLDPGATLDVATHLGPLADELDSWGIGRWSPLGEKVLDCA
INWKLAIDTFAENYHFATVHRQTFATIARSNCTVFDSYGPHHRLIFPLNT
ILELDDIPEDQWNPFHNMVVIYALFPNIVLSVTIANGELFRVYPGDRPGR
SITVHQNATPQDLSDESVAAGVQAVFDYAHATVRDEDYRLVESLQANLES
GARDHLVFGRNEPGLQHRHITWAKALAASTG
>MAP2053 hypothetical protein
MSAQMTPVMQAASEFALIGGIGFTAFGIYLSVRRRRLHPLLLLCISAMSF
SWIEAPYDWAMYAQFPPAIPRMPSWWPLNMTWGGLPLFVPIGYISYFVLP
AVTGTALGRWLSARFGWRRPQTLLVVGLVVGFCWALFFNGFLGAKLGVFY
YGRVIPGLAIREGTVHQYPLYDSVAMAIQMMLFTYLLGRTDAQGPNVIEM
WAEHRSKSRVGASVLSVVAVVVVGNALYGAVFAPHLVTKLGGWVTAGPTG
ELFPGVPNQPR
>MAP3775c hypothetical protein
MRGGRLIWSHATFDIPAGGIVAVIGSNGAGKTTLLNMVLGLIPSATGRLE
IFGRRPGQANDNIGYIPQHYADSSGEAIRAADAVLLGLTGRRWAFGRSTT
SQQTRVAEALAAVEATDLGCRRLSTLSGGQRQRIAIAVALVARPQLLILD
EPLASLDLRSQRDIVALLARLHAELAVTILVVAHDLNPLLPILDSVIYML
DGRPRYVPVGDIMDDTLLTRLYGTPIHVHRARDGALYMRSAL
>MAP1015 hypothetical protein
MMSRIDVVLRSARRRFRRLPAVEVPDAVRRGAVLVDIRPQAQRVREGEVP
GALVIERNVLEWRCDPTSEARLPEAVGDDVEWVIICSEGYTSSLAAAALL
DIGLHRATDVIGGYHALAGAGVLSRLAGGPVGAPLANTGAGERRWI
>MAP2599c hypothetical protein
MSNNITWHEHKISRGEREQLNGHKGCVIWFTGLSGSGKSTVANVVEQKLY
ERGIRSYLLDGDNVRYGLNAGPDLLEERHGPEFAQRFGLGFSAQDREENI
RRIGAVAKLFCEAGIIALTAFISPYVRDRDAIRATLDDGDFQEIFIDTPI
EICEKRDPKGLYKKARAGEIKGFTGIDDPYEAPPRPELRLDGAAKDAETL
AEEVIAHLERVGVIATDGLAHTRGRDEVTA
>MAP1760c hypothetical protein
MEPDSTAVSRRRALGGAVLAGLSGAAIGAVGGGFAGHAVAAGQRGDDHDT
VDLRRSYPFYGQPHQGGIDTPPQRYAMFMSFSLASGAGRTELQTLLARWS
AAAAILQQGKPVGTVQPQVDVQPPADTGEADGLSPASLTVTIGLGPSLFG
DRFGLAARRPAVFTDLPPLNGDNLDPRLHGGDLSVQACADDPQVCYHAVR
NLARLGRNIVSPFWAVLGFGRASAGPGQHTPRNLLGFKDGTRNISSQAEY
DRFVWVDNSDQPWMNGGTYQVVRKIRMLLETWDVDRIGNQQRIFGRTKEE
GAPLSGRHEFDTPDFTAKGPDGNPLIDPMSHVGLAARENNDGIMIRRRSY
NYTDGLDANGQLNAGLLFVSYQKDPQDFIRLQNRLGAHDLLNEYIRHIGS
AIFAVPPAPAEGHYIAQSLFR
>MAP3704c hypothetical protein
MPRREEPSHGLLDPVAKMLRLPFGTPEFIDRIVTGGVNQVGRRTLRMLIT
TWDAAGGGPFAASAIASTGMAKTAEIVQGMFIGPVFGPLLRILGADKVAV
RASLCASQLVGLGIMRYGIRSEPLHSMSVDAIVDAIGPTMQRYLVGDITR
>MAP1039 hypothetical protein
MRRCRVEGRGGGAAGGSARSGARAVPSPGGCGAAPTRVAGRRTSPVTAAS
PASMATGGGGFDPAAWVTSGNRRVPMSSRKYAGIQRGDVMKYAEDGHTRG
LSMPRSRGAVSGLLLVILGAWGALIPFVGPHFNFAYTPDRDWAWSSARGW
LEVAPGAATALGGLLLIVAGNRVAAMLGGWLAVLAGAWFVVGGQLAPLLG
IGSAGDPIAATERKRALLEVTYFSGLGALIIFVGGVVLARTSARLARDVQ
PLASDAPAAPAVEPYRDPAYDPADVSSGALTKPRTSADPEPKRGWRKNRA
GGNAAYLRWPHPQQ
>MAP3732c hypothetical protein
MTRTTTRVTQLDPRTKTVLVLASSIAVMAPGGEVFVPAAVIVGMLLAVAE
QAWVRAAILPSAAGATAAVAYLLPQAIPHPIIGAIGTVAAYLLRLIAVGA
IVIHLVNTTTPSEFTAALRATHIPRAITVSGSVMLRFLPTIVGEARAVSD
AMRLRGIGGTYGMLRHPVCTIEYFTVPLIASSLRVAEDLSATALLRGLGS
AARPTTMYPPRFGKADALIGCIVSALTVTTVLWPVKP
>MAP3733c hypothetical protein
MTATSSTTQSSRRIDVRMSARDLINIGVFGALYIATVFAINVFAFINPLV
MLVALAVSMIAGGVPFMLFLTRVRHAGMVTVFAIITAGLLALTGHPPICF
VITVACALVAEVVLWLGRYRSRTMGVLAYAIYAAWYIGPLLPIFYARDEY
FSSPGMAQMGPRYLEEMERLLSPAVLIAFDLSTVVFGLIGGLLGVRLLRK
HFQRAGLA
>MAP1484c hypothetical protein
MAVETSKPHTGVNATPVPVPWAVQTPDRIPKQRYYDPEFYALEKEMFWPR
VWQMACRLEEIPKPGDFVEYEIHDESVIVVRLDSQTVRAYHNACRHRGVK
LVEGNGNRRTFVCPFHGWCWSLDGRNTFVLRPETFAQENLAAADLRLVSV
RCELWGGCAWINLDDDAPALRDWMEPFASTYDAWRVESLRVEWWQSCRLP
VNWKLATAAFMEGYHVPQTHPQLLPGAQTGEPSAAVHPVVASSLYFMRTL
GEGMAGMTHQNDIRIAEGLQTMRLPSDPAAAMAAWRSALNDAVVDWHRAR
GSGMPDLNELDRRGITDAIGFAFPHHFILPTYSSASSYRIRPLGPEETLF
EIWSLTRLPGDASAGKPTPPEPMAPDDPRWPPIPAQDFSNLPRQQKGLHS
KSFEFMRLSDRIEGLISNFERVIDGFLAGLRYDALLPAIHKTNTTIDVPI
VDLGFGAVEAR
>MAP1087 hypothetical protein
MLGYVLARIGQSAIVLLAVFSLVFWGVSILPADPAAIFVAKGEGYFNPDI
VAQVKAFYGYDRPLWVQYFAQLNQVLHGHFGFSLSSGQAVTDRIGGVIGE
TLKLAATATGFAVLFAVSVTALATTCAPVRSVLRAIPPLFGAVPTF
>MAP3073 hypothetical protein
MTGLGYTLPAALAVVIVCAAELTVLRTGLFRRPAYWLSMLIVLGFQVPVD
GWLTKRSSPVVIYDDRQISGLRFPFDIPVEDFLFGFAMVTAVLLLWERRR
ARR
>MAP0204c hypothetical protein
MQVTSVGHAGFLIRTQAGSILCDPWVNPAYFASWFPFPDNSTLDWDQLGD
CDYLYVSHLHKDHFDAKNLAEHVNKDAVVLLPEFPVPDLRNALQELGFHR
FFETADSVKHRVGGLDVMIIALRAPADGPIGDSALVVSDGSTTLFNMNDA
RPVELDMLASEFGHIDVHLLQYSGAIWYPMVYDMPARAKESFGIQKRQRQ
MDRARQYLAQVGATWVVPSAGPPCFLDPELRHLNDDHGDPANIFPDQMVF
LEQLRAHGQGGGLLMIPGSTADFSGSTLNSLHHPLPTAEVEAIFATGKAD
YIAAYAERMAPVIAAERAGWAPATGEPLLEPLRALFEPIMSQSDEICDGI
GYPVELVLGPERVILDFPKRTVREPIPDEKVRYGFAIAPELVRTVLRDRE
PDWVNTIFLSTRFKAWRVGGYNEYLYTFFKCLTDERIAYADGWFAETHDN
SASITLDGWEIQRRCPHLKADLSKFGVVEGNTLTCNLHGWQWRLDDGRCL
TAKGHQLRSSRA
>MAP1309 hypothetical protein
MMPSQPLARGGRHRRHVAAGTAVVLSGALGYIGLADPHDPASIYPPCPFK
WLTGWNCPFCGGLRMTHDLLHGELWAAVHDNVFLLAAVPTLAAFLLVRRA
RGRRSLPAAAVPAVVVATLVWTVLRNLPAFPLFPTVLGG
>MAP0999c hypothetical protein
MTVTAIDPAEQAHPAPSAGTKRVQGGLLDPKMLWRSTPDALRKLDPRTLW
RNPVMFIVEIGAAWSTVLAIVGPTWFAWLTVIWLWLTVLFANLAEAVAEG
RGKAQAETLRRAKTQTMARRLRDWAPGSTGIEEAVSATALQQGDIVVVEA
GQVIPGDGDVVEGIASVDESAITGESAPVIRESGGDRSAVTGGTTVLSDR
IVVQITQKPGESFIDRMIALVEGANRQKTPNEIALNILLAALTIIFVFAV
ATLQPLAIYSKVNNPGVPDTQALNTSGVTGIVMVSLLVCLIPTTIGALLS
AIGIAGMDRLVQRNVLAMSGRAVEAAGDVNTLLLDKTGTITLGNRQAAAF
IPLAGVPPEELADAAQLSSLADETPEGRSVVVFAKQHFGLRARTPGELSQ
AQWVAFSATTRMSGVDLDGHSLRKGAASSVAEWVRSQRGSVPHQLGEIVE
RHLRRRRHTTGRRRERRRPGTGARCHPPQGRGEAGHAGTVRRDAADGHPD
GDDHRR
>MAP0145 hypothetical protein
MRPRHLTALAVVAAVTAVSGCGSSSGTTTTKDLHLDYAYYNPLSLVIRDQ
QLLEKKGYHVTWVLSQGSNKANEGLRSKALDFGSTGGSPALLARANGTPI
KTVDVYARGEWTALVVAKNSPINAVADLKGKKVAVTKGTDPYFFLLQSLA
TAGLSPADIEIVNLQHADGKTALERGDVDAWSGLDPFMAETIQQQGSRII
YRNPDFNSGGVLNAREDFITAHPDSVQLVVDTYEEARKWAKTHPAELAAL
LASQATVSQSVAQEELGRTALDIDPVPGDWLRAVLTRIEPLAVADGDIKS
DDAGRNALNTLIEPKYARQAR
>MAP0488c hypothetical protein
MSLQAAAAVDHDADTVSLRGARLAFGDRVLWEDLDLSVSRGEFVAVLGPN
GSGKTSLLKVLLGQLPLSAGVGLVDGKPITSGSGRIGYVPQHRPMERDVM
LRGRDLVRLGLDGGRWGAAPLRPRERARRRATVDQALRQVNGELLADVRV
GVMSGGELQRMRVAQALVSDPLLLLCDEPLLTLDPANAKLVSALLERRRR
DAATTVIVVTHEINPILPYVDRVLYLVDGRFRIGTVEQVMNSETLSALYR
ADIQVVKVKGGYVVAGEHTDGHG
>MAP1075c hypothetical protein
MYARSTTIQAQPLSIDIGIAHARDVVMPALTEIDGCVGLSLLVDRQSGTC
IATSSWESIDAMRGSAARVAPIRDRAALMFDGSARVEE
>MAP3076 hypothetical protein
MNIRALLRQLRPSVRAKDWPLQVIPRTPWADQRPTFREAQPAVIDAALQR
CRRQPTGNWYAFAASTHVERGRPLGARVAGVDLVAWRDARGALCVGPRSC
PHLGADLATGTVCGGTLICRWHGLPLDGRAREFGWAPLPSHDDGTLAWVR
LDAVGGETPSPRPVIPPRPAGHTLASVAHLVGVCEPADIIANRLDPWHGA
WFHPYSFTRLEVLATPTEDANRFLVAVTFRMGRLGVPVVAEFDCPEARTI
VMRIVDGEGTGSVVETHATPIGSGPDGRPRTAVLEAVIAHSDRPGFARAL
WAAPLLTPVMRYAAGRLWRDDLAYAERRYEVRSQNR
>MAP2257 hypothetical protein
MAVFGSVARRSGSGGRAASVVTCQPSERKIMSATTIDRTTGRDGLLRLAM
RADAAISGLVGLAGIPLVGWLAEVSGTTTAFEYGMSAFLIGYGVLVFGLA
ALPSVRRAGMAVIIGNLLYTAAAVVLVLADVFPLTSTGVVLNLAAGVYTL
VFAELQYFGWRRARA
>MAP0616c hypothetical protein
MAPLITLVVGSLVAWVVGRLGVAYVDGWAPALAVGLAAMFVLTGIAHFAP
PLRADLVAIVPPRLPAPGLLVSLTGVLELLGALGLLLPATRAAAAGCLLV
LMLAMFPANIHASRMPDPPKSMTTRLPLRIGMEIVFLAAAVAVALGGR
>MAP0180 hypothetical protein
MTEHLDVLIVGAGISGVSAAWHLQNRCPTKSYAILERRADLGGTWDLFKY
PGIRSDSDMFTLGFRFKPWRSAKSIADGASIKAYIKEAAVENRIEPHIRY
RHRVVAADWSDADNRWTVTVEHDGQRSEITCSFLFACTGYYNYDEGYSPT
FPGAEDFGGTIVHPQHWPEDLDYASKRIVVIGSGATAITLIPALVNSGAG
HVTMLQRSPTYIGSLPGVDPFAERANRLLPDRLAHMANRWKAIAFSTFQY
QLSRKAPAYMRKTLMTMAKRRLPEGYDVEKHFGPRYNVWDERLCLAPDGD
FFRTIRHGKADVVTDTIDRFTTTGIRLNSGEELPADIIVTATGLNMQLLG
GVTPTRNGEPVDLTSLMTYKGLMFSGMPNFAITFGYTNASWTLKADLVSE
FVCRLLNYMDAKGFDFVEPQHPGEDVDELPFMDFTPGYFRRSMHLLPKSG
SRAPWRLKQNYFFDMRTIRRGRVDDEGLKFAKKRAPVAV
>MAP1761c hypothetical protein
MVRRIAGATCRSRESAWPAAVLVATTMLSVTACGHSGDNANHAAQSKPGG
GNAVKITLTNSAGKDGCALDTTNVPAGPVTFTVANTNAPGISEVELLRDQ
RIVGEKENLAPGLDPVSFTLTLDGGSYQLYCPGASTEYQTLTVTGKAPAT
PTGTIATVLSQGTKDYAAYIVNQIGQLNDGAKALDAAVQAGNLDAAKAAY
AKARLYWERSESTVEGFVLPGFAVGDNAGNLDYLIDMRESTPVDGKVGWK
GFHAIERDLWQAGAITPGTKALSTELVGNVGKLHGIVATLQYKPEDLANG
ASDLIEEIQNTKITGEEEAFSHIDLVDFSGNVEGAQQAYASLRPGLEKID
NNLVHQIDQQFQNVLATLDGYRDPGALGGYRTYTPALKASDAPKLTAVIQ
PLHQSLSTVAQKVVSAG
>MAP3682 hypothetical protein
MTENFTTTNAGAPAPSDELSLTLGPDGPVLLQDFYLIEQLAAFNRERVPE
RQPHAKGTGAFGRFEVTNDLSAYTKAAVFQPGTKTDVFVRLSGNAGERGS
ADTVRDTRGFSVKFYTTEGNFDLVGLDFPVFVIRDPIKFPQMVRSAKRRA
NNDCRDHNMQWDFWTLSPESAHQVAMIMSDRGIPKTFRHMHGFGLHTFSF
LNAAGEISWVKFHFKSNQGIEWLTQEEGDRLAGTDPDYCIRDLYEAIERG
DHPSWSVKVQIMPFEEAKTYRFNPFDVTKVWPHADYPLIDLGTMTLDRNV
TDHHTEVEQVTFAPHALVPGIGLSPDKLLLGRSFAYADAHRYRVGANHNQ
IPVNAPRCPVRSYSKDGQMRFVNSTDPVYAPNSYGGPKADPDRASVVKWA
VDGGMMVRAPYTLRPDDDDWGQAGALVRDVMDDAERERLVHNIVHHVTDG
VKEPVLSRVVEYWYNIDADIGKRVEDGIRAANLGR
>MAP1792 hypothetical protein
MRGVWLQRRALRAGLSVRPMIVTLIGYLVLIDGMLNSLGWALDLVANHTL
INRVLMVGWGNMFDAGYFWHYNELWIGGAAGPGEKAYVAGLILTVFSMRV
AAAIGFLQMKRWGHQWMVVTCWMGVVIWSAYVFNMTMFADVRYAGVVFPV
IGWWLYDIFYITPFLAIPYLHTVNRETFSD
>MAP3028 hypothetical protein
MGTNQRASIVMSDDEIADFVVKSRTGTLATIGRDGQPHLTAMWYAVVDGE
IWLETKAKSQKAINLKRDPRVSFLIEDGDTYDTLRGVSFEGVAELVDDPD
VAHRVGVSVFERYTGPYTDEMKPFVEQMMNKRVCVRIVARRARSWDHRKL
GLPPMPVGGSTAPAVLGTDR
>MAP0720c hypothetical protein
MPHFPKPAAGSWTENYPELGTGPVDYTDSIDPAFFEAEREAIFKKTWLNV
GRVNRLPRTGSYFTRELPSAGKGTSVIIVKTKDGSVKAYHNVCRHRGNKL
VWNDFPNEETSGACRQFTCKYHAWRYSLDGDLTLVQQEEEFFDLDKSNYG
LAPVRCEVWEGFIFINFDDNAAPLTDYLGPLAKSIEGYPFGEMTETYSYR
AEVGSNWKLFIDAFVEFYHAPILHQGQYTKEEAAKIQKFGYEALHYELAG
PHSLQSTWGGQAPPPDMSMVKPMDQVLRSGLFGPWDKPEIIEKLDLPPGV
NVKRVPQWGIDSWLFYPNFMLLIWEPGWFLTYHYWPTAVDRHIFESTLYF
VPPRNARERLAQELAAVTFKEYALQDANTLEATQTMIGTRAVKEFLLCDQ
EVLIRHLHKTTGDYVKEYQNNGHVAAR
>MAP3350c hypothetical protein
MDVKEVLLPGVGLRYEFTDHKGDRVGIIARRSGDFDVVVYAREDPDEARP
VLHLSNEEAEAVAQILGAPRIAERFTELAKEVPGLETGQVHILAGSPFVD
HPLGDTRARTRTGASIVAIVRDDEVLASPGPSEMLHARDVLIVIGTEDGI
AGVEKIIDKG
>MAP0933 hypothetical protein
MTIQTVMIVWSGRLTAKEKRVAPINSPVSGTDEALTRRGLRHALDKTTDL
AERELRVPLHYYRDPKITEIEEAQILRRVPLAIVPSAQLPNTNDYVVRSV
LGDSLLVTRDRSGASHVLLNYCRHRGAMPACGSGNTARFVCPYHAWTYKN
TGELFSVPGKAGFDSMNTKDYGLVELPSEERHGFIWAVLTADATIDLDAH
LGDFGAELALWNYSSYGYHTQREFTSEVSWKGALEAFAEGYHFPYVHGQS
LIGQNTLANTMVYDEFGKHHRIGFPFTWITNAATDPAASLEPLANMGVIY
WVYPNLILANSPVGLEIIDMLPAGAPTRCTVRHSWMARVPAADDEMRAAY
DAVFEGVHAAVRDEDFAMLPQCGEGVRHGQHDHMIIGRNEIAVQHMIRVF
AHELGVALA
>MAP1127c hypothetical protein
MSSPAVPDHHTLIIGAGFSGIGAAIKLDKAGLPDYRVIEAGDGVGGTWHW
NTYPGIAVDIPSFSYQFSFEQSRHWSRTYAPGRELKAYAEHCADKYGIRS
RIRFNTKVLAAEFDDEPRLWRVHTDPGGTVTARFLISACGVLTVPNLPDI
DGVDSFGGITMHTARWDHGQDLSGKRVAVIGTGASAVQVIPEIAPIVKSL
TVFQRTPIWRFPKLDVPLPAPARWAMRIPGGKSVQRLLSQAYVEVTFPIS
AHYFTVLPLAKRMATLGKSYLRQQVRDPEVREKLTPKYAVGCKRPGFHNG
YLATFNRDNVRLVTEPIDKITPDAVATTDGEHHRIDVLILATGFKVMDPD
NVPTFAVTGPGGRSLSRFWDEHRLQAYEGVSVPGFPNLFTVFGPYGYVGS
SYFALIEAQTHHIVRCLKRAERLGAARVEVSEEANARYFAEMMRRRHRQI
FWQDSCKLANSYYFDKNGDVPLRPGTTPEVYWRSRRFNLDDYRFSA
>MAP0388 hypothetical protein
MQSAASVRDTMDSAKRWVTLGFTWNGLRALGVPGDALASFPEEFRQGMAA
RADILGDTGRNHPDNWVGGLAGADLHAIAILFARDDAEHARATNAHDDLL
KRCQGVRRLSHLDLNATPPFNYAHDHFGFRDRLSQPVIEGSGEEPTPGSG
APLKAGEFILGYPDEVGPVANQPEPEVLSRNGTYAAYRRLREHVAVFRDY
LRSVAGADRQEELLAAKLMGRWRSGAPLVLAPDEDDPELGADPLRNNDFN
YKEMDPFGYACPLGAHARRLNPRDTAHNMNRRRMIRRGATYGPALPEGAP
DDGEDRGIAAFIICASLIRQFEFAQNVWINDRTFHELGNEHDPICGTQDG
TLDFTIPKRPIRRVLKGLPAFTTLTGGAYFFLPGINAMRYLAALGERS
>MAP3089c hypothetical protein
MPENDSAAADPDLLIELRDVSLRRGGNVLVGPLDWAVELDERWVIVGPNG
AGKTSLLRIAAAAEHPSSGVAFVLGERLGRVDVTELRSRIGLSSSALAQR
VPSDEVVRDLVVSAGYAVLGRWRERYEDIDYRRAVDMLESLGAEHLADRS
YGTLSEGERKRVLIARALMTDPELLLLDEPAAGLDLGGREELVARLADLA
ADPDAPALVLVTHHVEEIPPGFSHCMLLSEGRVVAAGLLTDVLTSENLST
AFGQAIALDVVDGRYFARRVRTRAAHRRQL
>MAP4283 hypothetical protein
MPTSEYQVSGMSCGHCEAAVHSEVARIPGVDGVSVSADTGRLVVTSAVPI
DTDAVLGAVDEAGFQAVLVA
>MAP2733c hypothetical protein
MVGRPRAGRLQKVGSKVVAAIGRQEWMDRPSYRFEHLLSFAYNGLGSARN
TVTNALNGVWLGHPVHPPLASLTSGALGTTVALDALSVLPGQPASEVVGA
SRFATRALGVGILASLGSAVTGVTDWQHTHEEDRRVGAVHGLLNVAATAL
YVQSWFDRRRGRHGRGILLTALGYGITVAGSYLGGALVFESGIGIDQSGP
RLRTSAWTPVLPASSLNGKPVRVEVDGVGLVVCQTKPGEVAAYGEFCPHL
AAPMADGWLDRGRLVCPWHGSWFAAESGEVVRGPAAAPLPCYQARVVDGV
VEVRGEQQPAPGGAVGIAKGGAS
>MAP2765c hypothetical protein
MVAAQGSSMLTAADFAAQWADVPPWEPPDEPPQRNGQRQQQASAEPTTWE
AFDLGPYLRGEIERPHPGIGISRSDGQRSLYPGREHAIVGETESGKTWFA
LGCAAAELNAGNDVVYIHYEEPDATSTVEKLCLLGVDPAVIKARFRFVAP
SRPVREEWLNALLDPSPTLVIHDGVNEAMALHGDEIKAVEGAAAFRRRLI
LPCLRVGAATLACDHLPMVRDGSRRDAYGSVHKGNALDGARFVLENSAPF
GRRLRGVSYVFVTKDRPGHLRANGRATKSPGKTFMGTLVVDDSQAFGPDF
TMRFFAPRDDDVPESDPNAELADAVFRVVAAAPDHAVGSMRLLFAELRNV
DIQFRDDDVRDVVDDLVVSGRLVEISGKRGAKGFRAVVEDADGDST
>MAP1137c hypothetical protein
MGRAGPGHQAPGELVTAQTGRRVAISAGSLAVLLGALDAYVVVTIMRDIM
TDVHIPINQLQRITWIITMYLLGYIAAMPLLGRASDRFGRKLVLQVSLAL
FMVGSVVTALAGHWGDFHLLIGGRTIQGVASGALLPVTLALGADLWAQRN
RAGVLGGIGAAQELGSVLGPLYGIFIVFLFHDWRYVFWINVPLTLIAMVM
IQFSLPSHEKVEQPEKVDLVGGVLLAVALGLAVIGLYNPEPDGKQILPSY
GLPLVLGAVVVGILFLLWERFARTRLIEPAGVHFRPFLAALGASLFAGAA
LMVTLVDVELFGQGVLGQDQTQAAGLLLWFLIALPIGAVLGGWIATRVGD
RAMTFVGLLIAAYGYWLIHYWRQDVLSQKHNVLGLFSVPVLHADLLVAGV
GLGLVIGPLTSAALRVVPSAQHGIASAAVVVARMTGMLIGVAALSAWGLY
RFNQIVANLTAAIPPNASLLERIAAQGTMYLKAFAMMYGDIFAATVVICI
AGALLGLLIGGRKEHAEEPEIVEPQAVSLGER
>MAP1336 hypothetical protein
MQGVAGGLLAGLGYAVINSALPRWLWTRGSALVSAMWGVATVVGPATGGL
FAQLGIWRWAFVVMAVLTALMALLVPVALARVDPAPAIPRMKVPVWSLLI
IGVAALAVSVAQIPHNTAATFGLLAAGIMLVGLFVIVDWRMHAAILPPSV
FSPGPLKWIYLTMGVLMAAAMVNTYVPLFGQRLAHLTPIAAGFLGAALAL
GWTVSEIVSASLENPRTVGRVVMVAPLVAASGLALGAVARHGDGSAWTAA
LWAVALLVAGTGIGMAWPHLSARAMASVNDPAEGGAASAAINTVQLTSAA
IGAGLAGVVVNTATGGDEMAAHLLFTVFTALSAAGVAVSYAATRATRQAQ
PVGNVG
>MAP3420c hypothetical protein
MMDLFAPPEVTSTLIHTGPGAGSLIEAAAAWQRVAVELENSVSSYASTLS
SLIESWDGPSAMAMLQSVQPYLLWLRETAQQSAQLANSAEAAATAFGTVR
STVVHPSVVSANRTRLAQLLATNRFGTNTAAIAETENEYQTMWANNSAAL
SRYQAASSQATSPLTQFNSPLAVTDPGGTANQQAAVMKASVDSSGSSVGS
VLNDLNMPGGFDPNAGWFNYFSTWGNQFISSGFPINLLGVWAQLATAQGV
ASVGGDIGSGLSEGLGATTASLANAIKGIGAGAVAPSGAMGVGVSLGKLT
APPAVVGLLPGTQTGVQLASAASPLPAAESGFPLMPMMVPPPTTSAGTGW
RKRKQQKYEDVAYGREVKGKVMPRNPSAG
>MAP4060c hypothetical protein
MQDIQRADDALPTASRAERLRGVIRHDLPSSLVVFLVALPLSLGIAIASN
APVLAGLIAAIVGGIVVGALGGSPLQVSGPAAGLTVVVAGLVSDFGWGVT
CFITVAAGAVQVLLGLSRVARAALAISPVVVHAMLAGIGITIALQQTHVL
LGGKSKSTAWHNLIGLPGQIIGAHRPGVLLGVLVIVILVAWRWVPAKVRR
VPGPLVAIVAVTVISVVFPFHVRRIDLDGSPLDALQLPDLPHGNWSGVAV
GVITVALIASVESLLSAVSVDRMHNGPRTDFNRELVGQGAANMISGAVGG
LPVTGVIVRSSTNVNAGARSRASAIMHGVWILLFTIPFAGLVDEIPTAAL
AGLLIVIGIQLLKPAHIETAMKHGDLAVYVVTAVSVIFLNLLHGVMIGLA
LAIALTGWRVIRAKIEAAQLDGEWRVTIEGAACTFLALPRLTRVLASVPR
GATVTVAIAVHYLDHAAHQAITDWQRQQEATGGTVRIEGAVVATGRRDEP
QVEAEMPGAAA
>MAP4098 hypothetical protein
MARPWNGLWLPPDAACMLGRMTRNQLTEQIVVARLAKGLTWQELADAIGR
PLLWTTSALLGQHPIPAELGRILVDKLGLDESAVPVLAAPPMRGGLPTAV
PTDPTIYRFYEALQVYGGALKEVIAEQFGDGIMSAINFSVDLQKKPHPSG
DRVVVTFDGKFLPYQWVSSEQ
>MAP3227c hypothetical protein
MRQSPSASARRSHHGDGKHSPAPTGSLPCNHVTLTSPAAASAGDLPARPG
LRTVAAGSMIGTTIEWYDFYLYATASALVFKPLFFPNISPSAGTLASFAT
YAAGFGARPLGAVLSGHFGDRLGRKTVLVAALLVMGLVTTAIGALPTYAE
AGLAAPALLASLRVVQGLAVGAEWGGAAVLSVEHAPPGRRGLFGSFTQLG
SPAGMLLATSVFFGVRKATGPAAFLGFGWRIPFLLSIFLVAVGLFVRLRL
TDAEVFDRLRSRDELARLPIVQVLRTDARNVVITTGLRLSQIGLFVLLTT
YSLSYLQDSFGKGSGVGLVAVLISSALGFLSTPGWALLSDRVGRRPPYLF
GALVSVVALVLFFVAAGTGSAVLVVVAIVFGVNVVHDAMYGPQAAWFAEL
FDTRVRYSGSSLGYHIGAVLSGGFAPLIAASLLVAGGGRPWLIVGYFAVL
AAITVGAACAARETRGEPIG
>MAP0619c hypothetical protein
MLSNAMDEAPYAAAKTPPHAPTGQPGATEREYPDKLDAALLRISGVCILA
TVMAILDVTVVSVAQRTFIDQFSSSQAVVAWTMTGYTLALATVIPITGWA
ADRFGTKRLFIGSVLAFMLGSLLCALAANVLQLIVFRVVQGIGGGMLLPL
GFMILTREAGPRRLGRLMSILSIPMLLAPIGGPILGGWLIDTSSWRWIFL
INVPIGLLTVALAAVVFPRDHPARSETFDAVGVLLLSPGLATFLFAVSSI
PGRGTVADRHVLIPAAMGLTLIAGFVGHAWHRADHPLIDLRLFRNPVLTH
ANVTMLVFATAFFGAGLLLPSYFQQVLHQTPMQAGVHMIPQGLGAMLTVR
LTGPLVDRQGPGKVVLVGIALITAGLGAFAFGVARQAPYLPTLLAGLAIT
GLGMGCTMMPLSVASVQALAPHQIARGTTLMSVSHQVGGSMGTALMSMIL
TNQFNRSPNIVAANKLAALHQKAAAGGTPIDQSAIPRQSLAPGFWGNVLH
DLSHAYTAVFVIAVALVVCTIIPASFLPKKPATETAGK
>MAP3987 hypothetical protein
MRLGVLDVGSNTVHLLVVDAHRGGHPTPMSSTKATLRLAEATDSAGKITK
RGAEKLISTIDEFAKIADSSGCEELMAFATSAVREAGNSEEVLNRVRKET
GVELRVLTGVDESRLTFLAVRRWYGWSAGRIINLDIGGGSLEMSSGLDEE
PEVALSLPLGAGRLTREWLPDDPPGRRRVAMLRDWLDAELAEASENILEA
GTPDLTVATSKTFRSLARLTGAAPSGAGPRVKRTLTANGLRQLISFISRM
TTADRAELEGVSAERAPQIVAGALVAEASMRALSIESVDICPWALREGLI
LRKLDSEADGTALMEPSVRNAGGQVVDRNQNRSRGDKP
>MAP2066 hypothetical protein
MSSAGENTRRAEPVEPRGAAVLDSAHLGDIEGAFGRIRVGETEHARTWKT
RLLTLLAIVGPGIIVMVGDNDAGGVATYAQAGQNYGYSLLWVLLLLVPVL
IVNQEMVVRLGAVTGVGHARLINERFGRGWGWFSVGDLFLLNFLTIVTEF
IGISLAAEYIGVSKYVVVPVSAAALVAIMASGSFRRWERAMFIFIAITLL
QIPMLLMSHPQWGRAAKSFVVPSISGGVSSDAVLLIIAIVGTTVAPWQLF
FQQSNVVDKRITPRFMGYERADTVLGAFVVVIGAAALVMTGDWAARSTDT
VGGFTDAGATAHLLGQHRQVLGSIFAIVLMDASIIGAAAVTLATSYAFGD
VFGLKHSLHRGFADAKQFYLSYTAMVVVAAAIVLIPGAPLGLITTAVQAL
AGLLLPSASVFLLLLCNDREVLGPWVNRAWLNWVAGLIVGTLLLLSGILM
ATTLFPDLNVVAVAGYLTLALIILAAGAAPVLRWLARRQPARPGPRLPAR
GVDRSTWRMPPLALLEPVTWSPGTRLAMIALRGYLVVGALLLVVKAIQLS
R
>MAP0144 hypothetical protein
MVSTNPAPARVTLRHVDRTFGTHTVLRDVDVEIEPGAVVALLGASGSGKS
TLLRLVAGLDRPSGGRIEIDGKAVRGIDPRCAVVFQEPRLLPWRSLAANV
AFGLPRGIERSERLAAVQRWLDVVGLREFAGHYPRQVSGGMAQRAGLARA
LARQPSVLLLDEPLAALDALTRLRMQDLLDAVQQRAGTTTILVTHDVEEA
VLLADRVLILRGEEGGAATHDVAIPKPRDRGDPRIAALREQLLEEVGVPR
RGYAVEQEAKTS
>MAP2479 hypothetical protein
MRDDSPRMSPRRHIIVSGDDVLATTIAEELNRAGATIVKLPSEELAGADL
ARASAIVCAGRDDAKNLEIALLARKTNPHVRVVARLGNDVLRGAVAADNG
PGAILDVADLAAPSVVEACLSSSTHPVRAAGIDFLVSGAEAPRDATLREI
YGDLAPVAVIHGNDGATPGEVVPCPGRDHRVRAGDWTAMIGSADELAARG
IRTPRPPATRSRQTWLRRVLDAARAMRDDVNPMLFPAILLALTLLLVSTV
IVHFSYTKPRLSWLDALYFTAETITTVGYGEFTFAHQSAWLRIFAVGLMF
AGVTTTALLVAFLADLLLSRRFLQSAGLRRARHLRNHIIVVGLGSFGSRV
VADLTAAGYDVAVIERDENNRFLSTAAELDVPVIFGDATLRQTLESARVD
RARAVAVLTQDDMVNIEIGIVLREMLGPRVMPEVNRPDVPIVLRIYDRTL
GDAVAKRFGFENVRSTVDLAAPWFIGAAMGLQVLGTFSVGQRSFMVGAMH
VAAGSELDGLRMFEMSTQTRVIAITRRDTPVELHPRRDAWLRAGDTVYLV
GPYRELLETLRKGQPPQQPTNEERPADKATT
>MAP2081 hypothetical protein
MADSPAATSPTLVAKQTNVRRPRWDRDHPRYKWVALSNTTLGMLLAMINS
SIVLISLPAIFRGIGLNPLAPANIGYLLWMLMGYLVVTAVLVVFVGRLGD
MFGRVRIYNAGFAVFTVAAIALSFDPFPLTGGAVWLIGWRVVQGVGGAIL
MALSAAILTDAFPSNQRGMALGFNMVAAVAGSFLGLLFGGLLSEWDWRAI
FWVGVPVGVLGTVWGMRSLHELGVRTPGPLDWPGTVTFGVGLTVVLVGIT
YGIQPYGGHPTGWTNPWVLGSIAFGLLLLIVFCFIELRAPQPMVNVRLFR
SAGFGMGNLANLMSSSGRGGLQFMLIIWLQGIWLPLHGYRFESTPLWAGI
YMLPTTIGFLIAAPVAGWLADRFGARPFAVAGMLLMAVTFIGLLMIPVNF
DYRVFALLIFLNALGGGLFAAPNTAVIMSSVPPRDRGAASGVRSTFFNAG
SALSIGVFFSLMVVGLAGTLPHALSSGLQQQGVSAAVAQDAAALPPGGQP
VRGVSGLQPDRRIARTVARTAATRRQCRDTDGRTRRVAEDPRRSAAVRRV
RWGRRRAS
>MAP2131c hypothetical protein
MKKISGSPCIATLLMGLPVLAMTACSSPQHASTQPGTTPPVKSAAPTSSG
ATTTPAPGGGALTAELKTPDGRSVATATFDFTGGYVTVTVKTVANGVLTP
GLHGLHVHEIGKCEPNSVAPTGGAPGNFLSAGGHYQAPGHTGKPESGDLA
TLQVRQDGAAYLVTTTDAFTRDELLAGNKTALMLHGAEDTENAMDRVACG
VIGTG
>MAP2598c hypothetical protein
MNYTGHNGKPLVERVSTENAAARIEGLPRVPISKATAHEVISLSYGFFTP
LTGFMGRREVDATLDEFALPDSTLWSIPIVFDMSADDIAERDVKEGASVV
LDYLGVPMAILDVTEIYEYDLERMAEKTYGTTDPRHPGVKKTLGYHNRFI
GGDITLINEPVFNEPFKSFWLTPRQHQDALAAKHWHRVVAHQTRNVPHTG
HEALMKQAWLAANEDQPVDSLNTGVLVNAIIGQKRVGDYIDEAILLAQNA
LRTSGYFRENVHMVSFTLWDMRYAGPREAIFHAILRTNLGCTHHMFGRDH
AGVGDFYHPYDSQNILKQYRNQLGIKPVFLRENWYCPVCLEVTNSALCGH
EAQAQSFSGSLIRSILTDEVKPTQKVMRHDVFEVVMECAAKHGQGSPFVT
EEYLANRLPVFTLNQLEGS
>MAP3774c hypothetical protein
MTVRVLALQYEPHWWSILTSGFMTNALIGGTIVALAAGLVGYFVVIRQSA
FAAHALAHIGLPGATGAVLLGVPVAAGLGVFCVGGALAIGVLGKRAADRE
VVTGTVLALAIGLGLFFNSLATKSSGTMTNVLFGNLLAISRDQLAGFAIL
LAVLALIVGIIYRPLLFASVNPVVAEAKGVPVRALAMIFMALLGLTVTMA
VQAVGTLLLFALVVTPAATAIMLTPRPSMAMLVSTAIGLSSVLLGLGASA
MFNLPPSFPIVVLACGIWSAVWASNHRHRVIAKAVDEATNAPPALMSSTE
PTATTRK
>MAP3690 hypothetical protein
MARMPAPRSDRNALDFFCAVIIVGVLAGIAGVATTLVLRVVQHATYHYSF
GALLAGVAASSPIRRVLGPMVGAALAGLGWWLLRRRTDVPPLAETIARSE
RVPRLAWSIDAVLQVLLVGSGASLGREGAPRQFASALGDFGIGWLRRLSS
RDREILLACAAGAGLGAVYAVPLAGALFAVRILLRTWRLRAVGAAFLSSG
VAVAVGSAVTGDQPNLRWPVEESTYLLTAHGLLLAPVALAVGLVFNRLMA
VARPARPMRTWTLIPALAGAGLVTGVCSHWWPELPGNGRSILTVSLASGM
TLASALAILLLKPVLTALFLRAGGAGGLLTPSLATGAAAGAALVLTINWA
TASQLHVPAVSLAGAAGVLAVTQGSPIWAAIFVWELARPPLWLFLLFLLT
ATAAHGLKVLAQGRTTTHAG
>MAP3136c hypothetical protein
MTRSGRPPEGSWTEHYPELGTGPVSFKDSTSPEFYELEREAIFKRAWLNV
ARVEELPRVGSYLTKEIEAARTSVIVVKGRDERIRAFYNVCRHRGNKLVW
NDFPGEETRGTCRQFTCKYHGWRYDLTGALKFVQQESEFFDLDPAEYGLR
PVHCDVWNGFVFINFDPQPRQSLREFLGPMITGLDGYPFDKLTERYDWVA
HNNSNWKIFADAFQEYYHVPALHSQQVPPEVRDSNAVFTCGHFQLDGPHR
LVSTAGRRRWLMPPEYMYPIERATRSGLVGPWRTPDIGELPAGLNPGGIE
QWGISNFQIFPNLEILIYGGWYLLYRYWPTSHNTHRFEAYTYFHPARSVR
ERIEHEVAAVVLKEFALQDAGMLGGTQAALEYGVVDDFPLNDQEILVRHL
HKVVVDWVDAYRRERETVGV
>MAP2784 hypothetical protein
MHVINGVRDPAASFPLDTATDDAGERRQANRAVAVSAAGLALTGLVELVI
AVVSGSVALLGDALHNLSDVSTSALVFVGFRASRKLPTERYPYGYERAED
LAGIGVALVIWGSAVVAGFESVTKLLRHGGTGHVGWGIAAAVVGVVGNQL
VARYKLVVGRRIRSATMVADAKHSWLDALSSAGAVLGLIGVALGWGWADA
VAGIVVTGFICHVGWEVTADIAHRLLDGVDPDIVTTAEAVAVSVPGVTHA
HARARWTGRTLRVEVEGFLDAATSLSDSDRIGRSVAAPWPRGCRRCRASP
GRHARPENPSRPAGFEVSPGRSARAAAGRSPTTGPTRRRRRHRVTRPRRV
RRRPRRRAPARRGTWPASAAARTATRPDPRGSTAHRRRWFGPAAGRGPRP
GAAQRNTTHRSAIPAR
>MAP1922c hypothetical protein
MLVVSTDQAHSLGDVLGVPVPPSQAELVRVLADLETGRAEAGGGFLDALA
LDTLALLEARWRDVVATLDRRFPDSELSTIAPEELSALPGVQEVLGLHAV
GELARSGRWDRVVVDCASTADALRMLTLPATFGLYVERAWPRHRRLSLTA
EDARSAAVVELLERVSASVEALSALLTDGDLVGAHLVLTPERVVAAEAAR
TLGSLALMGVRVEELIVNQVLLQDDSYEYRNLPEHPAFYWYTERIAEQQS
VLEELDAAIGEVALVLTPHLSGEPIGPKALGALLDAARRRGGAAPPGPLR
PTVDLESGTGLGSIYRMRLALPQLDPSALTLGRVDDDLIISAGGLRRRVR
LASVLRRCTVLDAHLRGSELTVRFRPDPEVWPK
>MAP3132 hypothetical protein
MTAPGAARPQKRAGSLRPGELAQASVMGALCAAIAIIAVVLPHGGGLGLL
GSVPTGLLAYRYRIRVLITATVAAGVIGFLVVGLSGLAAIALCAYTGGLA
GIVKRHRRGTPTVLAVSLVAAGVVGAGMVIALTVLTRLRQLAFHAIGAAV
DGAASVVARVPPLHAAAVRFAEFFAAALQHWQWMVLGYALVAIVGASLVG
WWALSRVLERLRGIPDVHKLDAPARNGPTRPVPVRLDRVRLRYPHADRDA
LRAVSLDVQAGEHVAVTGANGAGKTTLMLVLAGREPTSGTIERPGSVGLG
ELGGTAVVMQHPESQVLGTRVADDVVWGLPPGTTTDVGRLLGEVGLAGLA
DRDTGSLSGGELQRLAVAAALAREPALLIADEVTSMVDRQGREQLLTVLS
GLTERHRTALVHITHYNDEAEYADRTINLGDTQGDTALIRTATAPAPTCP
AGRGRRAPVLELAGVGHEYASGTPWSRTALRDVSFTVHEGDGLLIHGGNG
SGKSTLAWIMAGLTVPTTGTCLLDGRPAAEQVGAVALQFQAARLQLMRSR
VDLEVASAAGFSSDDRDRVSAALAAVGLDAGLAERRIDQLSGGQMRRVVL
AGLLARSPRVLILDEPLAGLDAASQRGLVELLAERRRETGLTVVVISHDF
AGLEQLCPRILHLRDGSLDANPVAARPDPVPVAPPTKRPAARRRPVVLLR
PVPGSSPIHELWAGTKLLVVFAMSLLLTVFPGWVAVGLATALAAAGLRLA
HIPRGVLPSVPRWLWIFLGVVGVTAALAGGAPTIRLGTASLGLGGLLDFL
RATALTVVLLGLGALVSWTTNVAQIAPAVATLGRPLRVLRIPVDDWSVAL
ALALRTFPMLIDEFRVLYAARRLRPRRPAQTRWARLRRPATDLIDVVVAV
ITVTLRRADEMGDAITARGGTGQISAAPSRPKRNDWIALSIASAVCAAAV
AAELALLAGH
>MAP2487c hypothetical protein
MAVTDDYLAHNAGYASSFEGPLPMPPSKHVAVVACMDARLDVYRILGLRE
GEAHVIRNAGGVITDDVVRSLAISQRLLGTREIILIHHTDCGMLTFTDDD
FKRGIQEETGIKPPWAAEAFADLAEDVRQSLRRIEANPFVTKHVSARGFV
FDVATGKLDEVKP
>MAP1108 hypothetical protein
MGSAGHGTLVRALSRAGVNGVEVLNQQPQVGASALESGQVQALSQFVAWP
GLLVFQGKAKLLYDGAELNLPTLHGVVVRRSYAAAHPEVLAAFLQAQLDA
TDFLNAHPLQAARIVADASGLPPEVVYLYNGPGGTSFDTTLKPSLTEALK
SDVPYLKSIGDFADLDVDKFVVDEPLRAVFTARGLDYQAARARTTNPSTL
RGDPALAGELWLDGADTTQTTADPASLLRAVRDALGRGARVRAAYVPDTE
FGTRWFADKAFWVKDGQNYLPFGTAAGAGRYLAAHPGGIAVNYQQALGGS
V
>MAP0470 hypothetical protein
MPNTSPVTAWKSLKEGNERFVAGKPQHPSQSVEHRASLAAGQSPTAVVFG
CSDSRVAAELIFDQGLGDMFVVRTAGQAIDTAVLGSIEFAVSVLNVPLIV
VLGHDSCGAVKAALGAIEEGAIPGGFVRDVVERVAPSILMGRREGLSRVD
EFEERHVRETVAQLVSRSTTIAERIGDGTVAVAGVTYHLADGRAALCDHV
GDIGE
>MAP2097c hypothetical protein
MTAPSDTQSAAGRTRRPPRPVVLLVPVPGTSKIHELWAGTKLLVVLGVSV
LLTFFPGWVTVGLMLALLVAAARLAHIPRGALPSPRRWIWIVLAVGGITA
ALGAGSPVVSIAGLHIGLGGTLHFLRVTALSIVLIGLGAMLSWTTNVAEM
GPALATLGRPLRWLRIPSDEWAVALALALRAFPMLIEEFQVLYAARRLRP
NQTPRSRRARRRQQARDMIDLLTAAIVVTLRRADEMGDAITARGGIGQLS
AAPARPKLADWVTLTITVAAGAIGVALDSMIPFQ
>MAP2098c hypothetical protein
MSPWLHSFAKRGLSAGACRAKHLLQRTLPRPAGRIAQVTLLRFRPAGPGA
LRPNELAQAAVMGALCAAIEILAAVIPFAQGLGVLGTVPMGLLAYRYRPR
ALMAATVAGGVIAFLIAGLGSLFMLVDCAWVGGLCGIVKRKGRGTPTVAL
LSLIAGVLWGAGWVAVLAVLTRLRHLFFDVITANANGVAAFLNWMHLQGV
GAGLKRYVADGLQHWPLLIFPYMILLVVVVSFISWSALSRLLDRMRKIPD
VHKLDPPDDGHAAIGPVPVRLENVRFRYPGAEQDALREVSLDVQAGEHVA
VTGANGSGKTTLMLILAGRQPTSGTVHRPGAVGLGEVGGTAIVLQHPKSQ
VLGTRVADDVVWGLPPGTDIDVHRLLREVGLDGLAERDTGSLSGGELQRL
ALAAALARDPKLLIADEVTTMVDQQGRDALLGILSGLAKRHQTALVHITH
YDNEAASADRVIKLSDSPDNAVAAETNAAAAPAVAVQHGSGVPVLELIDV
SHEYASGTPWSKVALRDVSFVVEQGDGLLIHGGNGSGKSTLAWIMAGLTT
PTSGSCLIDGRPTHERVGEVALSFQSARLQLMRDHVDTEVASAAGFSPTD
QDRVAEALMSVGLDPAMGKRRIDQLSGGQMRRVVLAGLLARSPRALVLDE
PLAGLDIGSQRGLLRLLENLRRERGLTVVVISHDTVGLEELCPRSLYLRD
GALQTASTAAGGMP
>MAP0133c hypothetical protein
MSPEVAERLRPGPVAARSPLLAHPRWFVPGKFRVSHQSMGIRRRRRKWAR
KEIRVADPAMRQTIMGTAIGNFMEWYDFGVYGYIATTLAEVFYPGKSVSG
LHLIATFSTLAAAFVVRPLGGFIFGPLGDRIGRHRVLVVTILMMTVSTTT
TGLLPTYSSIGIWAPILLVIARIFQGLSTGGEYVGAMTYLVEQAPDHKRG
MMVGFLPMGNLVGFVLAGMLVTGLQTWLPDQDMLSYGWRIPLLLGLPFGL
VALYLRLRLEESSAYQSANDSPHTPGGQGRQQIRRTVAQQWRPMLICAAL
VLTSQVADFMLTGYLPTYLRLFVRVGHTAGLVMIVTTLAILMATVVAVAS
LSDRIGVKPIMWTGCALLIGASVPAFLLIRFGGVYPVIFIGVLLIGLMEL
CFDSTGPAMLPALFPTNVRYGALAISYNISISLVGGVTPLIAQALVSATG
NVMVPAYMLIFGGAVGAVTLLFTPEVAGKPLPGSGPAVETEREARALADD
VR
>MAP3865c hypothetical protein
MGAGHNHTPAETGDARLIPRMVMAAAILAAFFVVELVTSLLINSIALLAD
AGHMLTDVVAVFMGLAAVTLARRGSSSPARTYGWHRAEVFTAVANAGLLI
GVSVFILYEAIQRLREAPAVPGVPMIAVALAGLAANFVVALLLRSHSSGS
LAVKGAYLEVIADTVGSLGVLIAGVVTVTTRWPYADVVVAVLVALWVLPR
AISLARDALRILSESSPTHIDVEELRAALGAVDGVTGVHDLHVWTLSPGK
DMCTAHLISTGDSARVLRDARAVLSARGLAHATVQIDCPDDTECSDSF
>MAP3992 hypothetical protein
MTSTNGPSARDSAGKARDAGSGDGQQGRTQFLTVAEVAALMRVSKMTVYR
LVHNGELPAVRVGRSFRVHAKAVHDMLETSYFDAG
>MAP2534c hypothetical protein
MTTATPRTPGGGRRAPSGPAPGAHRWDLITRSSAHSQNPWNPLWAMMIGF
FMIMVDSTIVAIANPTIMADLHIGYDTVVWVTSAYLLGYAVVLLVAGRLG
DRFGTKNLYLIGLAVFTVASVWCGLAGSAAMLIAARVVQGVGAGVLTPQT
LSTITRIFPPERRGVAVSVWSATAGAASLVGPLAGGVLVDGLGWQWIFFV
NVPIGVLGLALAYWLVPVLPTQSHRFDLVGVGLSGVGMFLIVFGLQQGQA
AHWQPWIWALIVAGVGFVTVFVFWQSVNVREPLIPLVIFADRDFSLCNIG
VAIISFAATAMMLPLTFYAQAVCGLSPTRSALLIAPMAIANGVFAPFVGK
IVDRYHPRPVLGFGFSLLAIALTWLTFEMSPATPIWRLVLPFFAMGVGMA
FVWSPLTATATRNLSAQLAGAGSAVYNSVRQLGAVLGSAGMAAFMTWRIG
AEMPGQPAGGGEDSAGPVLPEFLRGPFAAAMSQSVLLPAFIALFGIVAAL
FLVGFRPWAHRDGGTDAFGPDDYGGDYDDDYDDDDAYVELILVREPEPEA
QQRAGQPRRPQPAPAPPADVRRRDPVESRRSVLDERPAQVQPIGFAHNGS
HVDGGKRLRQVAVRRAPKPGPPADRFTRPPRRHHPGPAGHHLGEGESLRG
QHHRPDPDDDPTGYGRHSSGN
>MAP0741c hypothetical protein
MATVDEIPPGTHKLVPIGRHGVGVYNVNGTFYAIANYCPHQGGPLCSGRP
RGRTIVDETAPGDSVMVRDLEYIYCPWHQWGFELATGTTAVKPEWSIRTY
PVRVVGNDVVVQA
>MAP3031 hypothetical protein
MQTDPVATPDVGAGTRWSIMVVSLLATASSFLFINGVAFLIPSLQGARGI
RLDEAGLLASMPSWGMVVTLVAWGYVLDRVGERVVMTTGSALTALAAYAA
SSAHSMVLIAAYLFLGGMAAASCNTAGGRLVSAWFPPHQRGLAMGIRQTA
QPLGIALGAMVIPELAEHGPQAGLRFTALACVFGAVASVIGIVDPPRKPR
ASASDQELASPYRGSLTLWRIHAVAGLMMMPQTVTVTFMLVWLIRNLHWS
VAAAGALVTLSQLLGALGRVAVGRWSDRLGSRMRPVRYIAAAAVLVLLLL
AWADYLNSRWQAGLMVAISVIAVLDNGLEATAITEFAGPYWSGRALGIQN
TTQRMMAAAGPPLFGALISAAKYPPAWLLCALFPLAAMPLVPTQLLPPGL
ETRARRQSVRRLRWWRAVRSHALPDIVRRPGQPG
>MAP2200 hypothetical protein
MAFRPVSAAAASAGAILGGRTHPRAGGVSMARDHAHAALIIGAGFTGLGA
AIRLAEAGVDDIVILERADRVGGTWRDTTYPGASCDVPSLLYSYSFVKNP
TWSRTYSPAPEIYRHLEDMADRFDIRRRIRFGHEVSGLAFDEDAGVWTAT
TKNRKKFRARTVVLASGPLSDVSFPDIRGLDSYRGHKIHSARWDHDYDFA
GKRVAVIGTGASAIQIIPELVKQAGFVKVFQRTPCWVLPRLDVATPPAVQ
ALFAKVPAAQELARQALYWGHEASATALVWDTPLTSLVARLGKAHLRAQV
KDPWLRRQLTPDFRPGCKRMLVSSDYYPALQRDNCKLIDWPIATLSPAGI
RTSDGIEHHLDCIVFATGYDVHLTGPPFPVTGLGGRSLAAEWAGGAQAYK
SINVHGYPNLFVMTGPNSGPGHNSLLVYIEGQLDYAVRGITTILNDDLRY
LDVREEVQRRHNEAIQRRLTKTTWMSGCRSWYLTKDGFNGSMYPGFATQY
LRQMSDFRYQDYQAVARRARTPAASSA
>MAP2579c hypothetical protein
MGEPSVRATTPGTPMRRIAAACLVGSAIEFYDFLIYGTAAALVFPAVFFP
RLGPTVATIASMATFATAFLSRPLGAAVFGYFGDRLGRKKTLVATLLIMG
ASTVSVGLVPSTASIGIAAPLLLTVLRLLQGFAVGGEWAGSVLLSAEYAP
TDRRGWYGMFTLLGGGTAGILASLTFLAVNLTMGEHSPAFMHWGWRVPFL
ISSGLIGIALYVRLNIDETPIFVEEKARHLVPKAPLTELLRLQRREIILV
AGSFVGGMGFIYLGNTFLVMYAHNHLGYSRSFIWGIGALGGLTSMACVAC
SAWISDRVGRRRVMLWGLVACLPWAFVVIPLIDTGRPVCYVVAVLGMFGT
AAVANGPTAAFVPELFATRYRYSGAAVAMNLAGIVGAAVPPLLAGTLLAT
YGSWAIGLMMASLVLASFVSVYLLPETRGAALDAAAAAEKVAAR
>MAP1450c hypothetical protein
MTSGSDRRATPDVDVVVVGAGFAGLYALHKLRSNGLRVRVFEAGPDVGGT
WYFNRYPGARCDVESVDYCYSFSDALQREWDWSEKYATQPEILAYINWVA
DRLDLRRDITLNARVNSAVLDEAQLRWTVTTEAGERVTARFCVMATGPLS
AAMTPPFPGLDTFAGQVYHTAAWPHEPVDFTGKRVAVIGTGSSGIQSIPI
IAEAASQLYVFQRTPNYSVPAGNRPLSDSDRAEVKAHYAERRRMSWRSGG
GSPHVAHPKLTMEATPEERREAFEKRWELGGVLFSKTFADQMIDPVANEE
ARKFYEEKVRAVIDDPALADLLIPNDHPIGTKRICTDSNYFQTFNRPNVK
LISVRKTPIMSIDATGINTTDAHYDLDAIVLATGFDAMTGALAKIDIVGR
DGRRLSDDWSGGPRTYLGLGVDGFPNLFLVSGPGAPAVLANMVLHAEANV
NWIADCIAYLDAHDYTAVEATTDAVDDWGAECARRADATLFTKADSWYLG
ANVPGKPRVFMLFVGGFGVYLDICAEVANAGYKGFSLVKAR
>MAP3834 hypothetical protein
MRDDDSANRWSGVREAPTAPSRQLTGAVVIIALVAAISGMLYGYDTGVIS
WALLQLTQDFNITEGWQQVIAASILLGAVAGALTCSWLSDLRGRRGTLLM
LAVVFIVGALWCADAADSVMLSLGRLVLGFAVGGATQTAPMYVAELAPPA
YRGRLVLCFQIAIGVGILTATLVGAGGSISWRGPIGLACVPAAIMLWLLL
RLPESPRWLVKKDNRDAARAVLEHVRPEGYDVAAELDEATELARVERTAA
TRGWRGLRDAWVRPALVLGCGIAVFTQLSGIEMIIYYSPTILTDDGVYRS
VALQVSVCLGAAYLIAQLVGLAIIDRVGRRRLTLIMVPGAAVSLFALGLL
FITSDSGRDVIPYIMICLIAFMLFNGGGLQLMGWLTGSETYPLAVRPAAT
ALQSATLWGTNLVITLTMLSLIKAIGVGPLMWLYALFNVAAWIFVFFRMP
DLTGKTLEEIEYQLSEGKFRPSDFGR
>MAP2414c hypothetical protein
MARGLQGVMLRSFGARDHTATVVETVRIAPHFVRVRMTSPTLFEDVDAEP
AAWLRFWFPDPDGSKTEFQRAYTISEADPAAGRFAVDVVLHDPAGPASRW
ARTVQPGTTIAVMALMGSSRFDVPDEQPAGYLLIGDPASIPGMNGIIGVV
PDDVPIEMYLEQHHDDDTLIPIAVHPRLRVHWVARRDEKSLAAALESRDW
SNWYAWATPEATTLKHVRARLRDEFGFPKSEVHAQAYWSAGRAMGTRRGD
EAATTDDGTETPEAIAAQADSTQRQPEAAPVPAARGNWRTQAAGRLLAPL
RWALIPSGVLQAVITLIQLAPFVLLVELARRLVAGAPAARLWDVGIAAVS
LLGLGALLGAALTLWLHVVDARFARDLRSALLRKLSRLPLGWFTARGSGS
IKQLLQDDTLSLHYLVTHAIPDAVAAVVAPVAVLVYLFAVDWRVALVLFV
PVLVYLVLTASLTIQSGPRIPQSQRWAETMSDEAGAYLEGQPVIRVFGGA
AASSFRRRLDEYVGFLVAWQRPLAGKKTFMDLVTRPSTFLWLIAAVGTLL
VVAGRMDPVNLLPFLLLGTTFGARLLGIAYGLGGIRAGMLAARRLQNTLD
EHELEVREPGEPTGESAQAVVFDNVGFGYRPDVPVIHDVSLTLRPGTLTA
LVGPSGSGKSTLAALLARFHDVDRGSITVGGRDIRSMTADELYARIGFVL
QETQLVHGTVAQNIALAVPDATAAQIEQAAREAQIHDRIMRLPHGYDTVL
GAGVGLSGGERQRLTIARAILADTEILILDEATAFADPESEYLVQQALNR
LTRNRTVLVIAHRLHTITRADQIVVLDHGRVVERGRHEELLAADGRYRRL
WEGGRRDAVTVGTAGEVAR
>MAP3180 hypothetical protein
MTAPETSAQQTSSQPLVSRGWIQGVALVMIFGFLVMGILAYRTYSASMPM
PDKVVSESGRLLFTGADITRGQELYQARGLMEYGSVLGHGAYLGPDYTAE
YLRTATQDVADQLRAQGVADPRERVVTEFRTNRYHPDTKTLVFTDRQAAA
FDHIQDRYGAYFGENSTKYGCCRT
>MAP2105 hypothetical protein
MARECDGPDRVMKKNCPLQCPRRYGGVAENAAGDQVDWAGLPVNLDFAFS
RDQRDKVYAQHLKRRHGTQFGTWRRGGQVCVCELASESISSG
>MAP1620 hypothetical protein
MGLRDHHLSDHYLSDHREDIGLMRSRYAGEPFTTSTAEIAAALEDVSIPT
LLLSLVHITGDPRFIRDFKQMGIFLNEVQGFMSEEDKARARAEALSVITD
YRDRGCPEPEPLSPELIREMMDWAACEHVPDDYLPLICEELDLDGVDPRR
PAALPAERAAGLPVLVVGCGESGILAGVRLKQANIPFTIVEKNAGPGGTW
WENSYPGARVDVANHFYCYSFEPNNDWTHFFAEQYELQDYFTKVIDQHDL
AGQVRWQSEVLAAEWDDGDGTWTVSLRSADGHTETMRARALITAVGQLNR
PNIPAFDGAQTFRGPSFHSAAWDHSVELKGKRVALVGAGASGFQIAPAIA
ADVKRLTVFQRTAQWMFPNPMYHDEVGDGVRWAMRHLPFYGRWYRFLVLW
PGSDKGLDAAEADPNYADQEHAVSDVNAAAHLMFSQWITSQVGEDSELLA
KVMPDYPACGKRTLQDNGSWLRTLQRDNVELVRTPIDKITPHGIVTVDGA
AYDADVIVYATGFRHTDVLWPLKVTGRNGVDLHQMWGSRPYAYLGITVPE
FPNFFIIYGPGTHLAHGGSLIFQSELQMRYIDQCLARLCEPGVHSIEPKP
DAAIDWHRRTPGPDQEDGVVAPGGQALLFQERRRRDSHGEPMASQRVLVR
GARARLVAVHGADERCCATRK
>MAP1418c hypothetical protein
MWGSVLGLGMLAALNPVRLGLALLMISRPRPGSSLLAYWIGGLTVCVPEL
LIPVLLLNFTPMFGHPSHASPSTGLALGKIQIGLGVVGLSIAAVLTVRFA
ARQRAAAPPPDDRTSELSAAPGATIAMPRLLTRAQDVSPDDRSVLRRLLG
RMHSAWESGASWVAWVIGVISVPVDGVLFIVAIIAASGASVTAQVSASVA
FVVLMYAVVEVILVGYLATPGKTQSLLLVLHDWVRTYHRQILVALFTVVG
VSQLAQGLHLV
>MAP3098c hypothetical protein
MTDVVTHTETAPTPAAKAAKPVHTRAVIIGTGFSGLGMAIALQKQGVGFV
ILEKADDIGGTWRDNSYPGCACDIPSHLYSFSFEPKPDWKNPFSYQPEIW
DYLKGVTEKYGLRRYIEFNSLVDRAHWDDDEHRWHVFTTDGREYVAQFLI
SGAGALHIPSLPDIEGRDEFAGPAFHSAEWDHTVDLTGKRVAVIGTGASA
IQIVPEIVGQVAELQLYQRTPPWVVPRSNPEIPPAVRAAMENVPGLRALV
RLAIYWGQEALAFGMTKRPNLLKVIEAYAKYNIRRSVKDKELRRKLTPHY
RIGCKRILNSSTYYGAVADPKTELVTDHIARITPDGIVTADGTHRPVDVI
VYATGFHVTDSYTYVQIKGLHGEDLVDRWNREGIGAHRGITVADVPNLFF
LLGPNTGLGHNSVVFMIESQIRYVADAIATCDKLGAQALAPTRAAQDRFN
DELQRRLGPSVWNSGGCSSWYLDEHGKNTVLWGGYTWEYWRATRAVKPQE
YQFYGIGSRPGV
>MAP1809c hypothetical protein
MSIATALDAAPKRLGGAASGAPGGLWARRKWEVVRLVSPLALLALWQLGS
AIGVIAQDVLPAPSLILQAGVELTRNGQLADALHISTVRVVEGLALGGVI
GIVAGAAVGLSRWVEATVDPPLQMIRALPHLGLIPLFILWFGIGELPKVL
LVALGVVFPLYLNTFSAIRQVDPKMLETAQVLGFSFFQRFRRILVPGTAP
QVLVGLRQSLAIAWLTLIVAEQINADKGIGFLINNARDFLRIDIIIFGLT
IYALLGIITDAVVRLIERRAVRYRH
>MAP3931 hypothetical protein
MPRGHLTEPVTDSSPPAKGTFTVDMLSRAKRGVTAVFVAHGLLFASWAAH
IPQVKAGLGLDDAALGTALFGAPLGSVLATLAGHWALPRWGSHRLIPVTV
AGYAAAGTTVGLARSGPALFAALALWGMFQGTLDVAMNTQAGTVERRAGA
PMMARFHGMWSLGTLAGALIGAACVGAGIGLTAQLTVLGAVVLLVVVMLT
RRLLPDAADSVAAPPEPAAGRRMTPAVAILAAVSFASFLCEGAATDWSAT
YLRDVVGAGPSVAAASYAAYTLTMVVTRFGAARLHARLPSRRLLPALAVL
AVAGMSVALATADAAAGVLGFAALGVGVALLVPTAFSAAYGARGAGSAIA
IVAATGWLGYLLGPPLIGHLSEWVGLSGALVTIPVMMTVVAVAIRYTPAF
DTADEFHRAPAG
>MAP3845 hypothetical protein
MIWATGRRAAPMYLNAERLALANQTVKETFEQCSVAWQAIPHWDTGDPSQ
TTVPNDNVNPPNNFLPLTSLPKPFEVTLAAAIAPTPDELLATVVYYTAKL
AADFDAAVIPGLLTATTPSQLVPGISPAQLLTALIEARAKVEKGGYRAPS
CLITDTIGVETLAASTIANGYAGTDVLLPPANINSLQRVDTLATDPQVRG
WLLGRRQRIAPGAAAEASPGEEAVDLAVSVPPSLEVVGDTSNNAIKLDVR
LSYALRIKDEAGLVVFRA
>MAP0993 hypothetical protein
MVTGRLAGIDCGTNSIRLLIADVRDGRLRDVHRETRIVRLGQGVDATGEF
APEAIARTRAALSDYADLLKQHGVQRVRMVATSAARDVGNRADFFSMTAD
VLGAVLPGAVAEVITGADEAELSFRGAVGELDSAAGPFVVVDLGGGSTEI
VVGGSDGVTASHSADIGCVRLTERCLHSDPPTPEEVALARQVVRERLEVA
LGVVPVEGARTWVGVAGTMTTLSALAHDLPAYDSAAIHLSRVSGRDLLAV
CERLIGMTRAQRAALPPMHAGRADVIAGGAVVVEELARELRARAGIDELT
VSEHDILDGIVLSIAG
>MAP3517 hypothetical protein
MTSMHTRGRRTPGAAVQSSAVSDAAHVGDIVGAFGRIRRDGGGADGAAGG
RWRRLRTLAVITGPGLIVMVGDNDAGGVATYAQAGQNYGMGLLWTLVLLI
PVLYVNQEMVLRLGAVARVGHARLIFERFGRFWGAFSVGDLLILNALTIV
TEFIGVALALGFLGCPKIVAVPAAAALLFAVVAGGSFRRWERLMFLLIAV
NVLIFPMVMLVHPAPKATVAGLIPQFPGGLNSTVLLLVVAIVGTTVAPWQ
LFFQQSNVVDKRITARWIPYGRADLVIGIVVVMVGATALMAVTAFGLAGT
AAAGHFTDAGAVAAGLSAHLGRTVGVLFAIILLDASLIGANAVGLATSYA
VGDAMGKRHSLHWKITEAPLFYGGYAVLLAVSAAVSFSPDHILGLVTQGV
QALAGVLLPSATVFLVLLCNDRAVLGPWVNTVRQNIAAWTIVWCLVLLSL
ALTATTFFPDLSTGTIEAGLAAGAVLGVVAGAVMIVVGRRQRDLAEAEAI
VRTLGGGLDPEQVDELDDASSLTRAERRAVRRQDRENWQTPSLALLDRPA
MSPMRRAGLFTLRGYLVVAVVFVIIKLVQAGVVGPAGSL
>MAP3731c hypothetical protein
MIRIDGVRWQYAGTDAAVLDGVDLHIRRGETVLLCGASGSGKSSVLRLMN
GLIPHFHQGSLDGSVHIDGTSVAELSLERVGRLTGTVLQHPRRQFFTAAV
DTELAFTLENFGTPPEQIRNRVGSVITEYGLAELTGHRLAELSGGQQQQI
ACAAAATHGPPLLLFDEPTANLAADAIERFTATLARLRSLGTTIVIAEHR
LHYLREIADRIVLLRNGRIAAEWSRKQFARLDDAALNAEGLRSNNSPVRN
HIPPACAYGASVAGTPSGTAAPASSPSEVVLRGIRCCFRGHRVLDIEEAR
FPAATVTAITGPNGAGKSTLARVLVGLQRHDGEVSFGGSRISRSRRQRMS
AIVMQDVQRQLFTESVRAELRLGAPPAAAGVASTLLRDLGLEEFADRHPL
SLSGGQQQRLVVAAARLSNRKIMVFDEPSSGVDRRHLRSITNVMRDVAAQ
GVVVILISHDQELLTLAADQELRMRVADTLNARSRRKAAGENACLETLSD
>MAP3726 hypothetical protein
MTLIKRCVRGNGWRTTLLMLLLLTAVAASLMVGRYPVGVGAMAGMLFGRL
PLLDTSFTPVDQTVLTQIRLPRIGCGVLVGAGLAASGAGYQTMFRNPLVS
PDILGVSAGAGFGGALALLLHAPYWQLEAMAFASGLLAAALALIIGRGIG
RDSAILLVLAGMVIASVFGALISVTEYLANPDDTLPAIVFWLMGGLGRQH
LDGLLAPALIIAAAVLVLYALRWPVTVVVSGDEDAHTLGVDTRRTWAAVV
GVYTLITATTVSLAGIVGWAGLLIPHIARALVGPGFGRLLLVSAALGGVF
VVGVDDVARAAASAEIPLGILSALIGAPFFLVVLAKMRRQWT
>MAP1632c hypothetical protein
MPQRLGLAGLCLGTALIIMEANVLNVAIPSIRQALHASPAQSLWIIDAYT
LVLAALLLSAGRLGDRIGARRCYLLGLAVFSIASVLCALAASSAELIAAR
TIQGVGAAVLIPAPLGLISAMFSDLTARAKAVAVWVTIGGVGFAAGPLIG
GLLVSTFGWRSIFLINIPAAAIIAVMVRLTVAEASRSPLPFDYVGQALAI
VGLSAVVFACVESSALAWMSPFVLLPAVAAALILGLFVIDQRHRGRAGAW
VLLPVELLNNRPVNAGLMSGFVYNFTLYGLVLVYSYVFQSARGYSPVQTG
LAFAPLTVAALVTSLPAGRFVAAHGARRGIMIGMALSAIGLCALAFDAQR
MPFVVLSIAFGIFATGLSLSATGQTMAVMANASDQYKNTASSMLNTARQT
GGVIGVAALGAITSRDLLASAPVALTIAAAACLVAALGVATLIARHARTH
DSDQH
>MAP2516 hypothetical protein
MPDTSRGPALLILFATLLATAGTGISIVAFPWLALQHRHSATDASIVAAA
MTLPLVLATLIAGTAVDSFGRRRVSLVCDWLSGAAVTAVPLTAWIFGAAA
IDVAELAVLAFCAAAFDPAGMTARQSMLPEAAARAGWSLDRTNSSYEAML
NLAFIVGPGLGGLLIATLGGINTMWVTAGCFALSFLAIGALRLDGAGKPP
RATRPVGLVTGIAEGVRFVWNLRVLRTLGLIDLAVTALYLPMESVLFPKY
FADQHQPAELGWALMALALGGVAGALGYAVLSARLRRRTAVLTATLTFGA
TTAGIAVLPPLPVILGLCAVTGVVYGPIQPIYNFVMQTRAPHHLRGRVVG
VMAGLTYAAGPLGLLVAGPLADAAGLKATFLTLAVPILAIGVVACGLPSL
RELDRAPQFADDPGP
>MAP3954 hypothetical protein
MIPLPRPWVLAGAMLIGSAVGLLAGVAFTVAVQAHVRPDLAIAAVVGIPS
VIGLALILFSGRRWVTTLGAFVLAVAPGWFGVLVALRVTAGG
>MAP1762c hypothetical protein
MQGVFGVFIGTFLIGLREGLEASLIVSIVAAFLKRNGQTLRPMFAGVAVA
VLLSVAVGVGLDLLATSLPQAQQEMMETVINAVAVVFVTSMIIWMNRNAA
QLKGELEREARQAVHRGGALALGAMAFLAVLKEGFETSVFLLAAAETSHG
SRWFAVLGGVLGIATSIGLGVGLYFGGLRLNLGRFFRVTGVFLVLIAAGL
VLGALRTAHEAGWLNIGQRQLFDLSGWIPSDSVLGAVTTGVLGIPADPRL
VEVLGWLLYAVPVLVVFLRPARLAATPRARGRLLATAATLLLAIAAVLAV
AAPARDTVDAARTRTVTDRAGHAAAVSMATGPHGRELTVTPAGTSTVHHI
QLVPADDQSVDGLPVQAWQASETAGVDGAPEITLDQLRDMTGGRLPVGLA
AARTPGPFQGQWSTTTVYTVLTRGDAVISAKAASNRTAVLTGGGLTGAKT
VSLGGLATDWSTSAAEDHATAAAIAAGDRNRGEGQLWNVWLPLVIAGFAV
ACALSALASVRVDRKREDERKAIDGEAHRRGNVPVS
>MAP1110 hypothetical protein
MTAMRLELDRVRLSYNGPAVINELSLVVRPGEILVLTGPSGCGKSTVLRA
LAGLLTPDGGRVLADGVPVTGTSGDRAMVFQDNALLPWRTVRSNIELALR
LRGQPRAGRRAAAERWISELGLAGFGDYLPKSLSGGMRQRVQLARGLAGA
PRAVMMDEPFGALDTRTRAAMQRLLIDTWRTHPTTIVFVTHDVDEALALG
DRIVVLGRAGEPPRASVEVPEPRSDRPHAELRAQIIDALNHAEAA
>MAP3145c hypothetical protein
MGERAPRPGPLPREVWILSWANVMVALGYGVISPALPTFARSFGVSIKAV
TFLVTVFSLSRLCFAPISGLLTERLGERRIYIGGLLIVAVSTAACAFSQA
YWQLMLYRVFSGVGSTMFYVSALGLMIHISPADARGRIAGLFTTSFMVGA
VGGPAVGGLAAGWGLTAPFVVYGVAMLGVALVLFLGLRNSALAAPRPPTR
STVTMREALRVRAYRSALLSNFATGWSAFGLRMALVPLFVSDVIGRGIGT
IGVVLAAFAGGNALAVVPSGYLSDRMGRRTLLIVGLVTSGAATVWLGFVA
SLPVFLVAAGVVGVVTGIYMSPLQAAVADILGNEARAGLPVATVQMMSDL
GAIVGSMAVGWAAEQIGYGWGFFISGVVLLIAAVGWVMAPETRTATELEA
DLMAAESDVEPV
>MAP1937c hypothetical protein
MVSRYSAYRRGLGDDTVSPEVIDRILIGACAAIYLALLGVSVAACVALAD
LGRGFHKAASSPHTTWVLYAVIIVSALIIAGAIPILLRARRISQAEPTAR
AMTAPARPPVRLGAGVARPATERAPHAPATTPDVGWSGEAVDRIWLRGTV
ILTGTMGAALIAVATATYLMAVGHDGSSWVGYGFAGVITAAMPVVEWLHI
RQLRGAVAEQ
>MAP0453 hypothetical protein
MGPTRKRDLTAAVVGAAVVGYLLVQGLYRWFPPITVWTGLSLLAVAVIEA
LWARYVRTKINDGEIGSGPGWLHPLAVARSLMVAKASAWVGALVLGWWIG
VLVYFLPRRSWLRAAAEDTSGAVVAAVSALALLVAALWLQHCCKSPPDSG
EHGEGAET
>MAP2654c hypothetical protein
MTAVQPDSTVGTDIWSTSRRLSMGDDACADQWQALGMLASALAGRAVAVA
GLPPGEPAWTDGQTIYVDAGAPGALKSLAVQASMIAAGSLRPDVVGRLLR
HRKLAQRYLTVEGHRALVANAPVLPRVLASLGDRGVAGRSDSPRASLALA
AARVALPDPAPEFGVIRAGKVLAACGRAARPDQPDDKAAPAHVPRRDGAA
DLAELDDAAVDDSDDPDLFTSPVGGGGALGKWLKKLLSSARKTGTGGGPP
GADSPTHRTDSGKRGAYAVASLASAPADEGNDEDPADGVRYPEWDVARGS
YRPAWCTVREVEPAITARATLAVDDAIAVRRPLARLGMGLHRRHRQPQGD
DIDIDAAVEARVEVRAGSVPDEAVYLDSLRRRRDLSVLLLLDVSGSAAEP
GTVGRTVHEQQRAAVADLAVALHDLGDRVALYAYYSQGRRAVSMVPVKRF
HDQLNAQVIRRLNSLEPGAYSRLGAAIRHGSAILEARGGTSRRLLVVLSD
GLAYDHGYERAYGAADARRALTEARRRGTGCVCLTVGAGTDVQSLRRVFG
TTAHATMARPDQLAGVIGPLFRSALRAAEVRRRTAANARAGTATSRRPHT
SAHVGAG
>MAP1808c hypothetical protein
MTLTAESGPLAGSRTDVAGELRHVDKWYGNRHVLQDVSLQIPSGQIVALI
GRSGSGKSTVLRVLAGLSHDHTGRRLVAGAPALAFQEPRLFPWRDVRTNV
GYGLTRTRLPRAQVRRRAERALADVGLADHARAWPLTLSGGQAQRVSLAR
ALVAEPRLLLLDEPFGALDALTRLSMHTLLLDLWRRHGFGVLLVTHDVDE
AVALADRVLVLEDGRVVHELAIDPPRRTPGEPGAHTERYRAELLDRLGVR
Q
>MAP3560 hypothetical protein
MSEFTIPGLTDKQAARLTELLQKQLSTYNDLHLTLKHIHWNVVGPNFIGV
HEMIDPQVEAVRGFADDVAERIAALGASPQGTPGAIIKDRSWDDYSVGRD
TVQAHLAALDLVYNGVIEDIRQYIDETDELDQVTQDLLIGQAAQLEKFQW
FVRAHLESAGGQLAHKGKSTERAAAQSARGKS
>MAP0337 hypothetical protein
MPSSRPMPTSTDRMLGDRPNLCRRPCLAASRRVRQDAAGGHRSVKICGIG
VKASLRTRHRPAPTGTVTGMTVVEFTGGAAPRGALPSRSTLPGPGSVRVT
AAYAAALLVVYLVLAALGPHARQVAVSRMSTNVHNLGRGQLGTLIGSAFV
DDGGELFFWLPGLVCLLALGELLWRGKGLLVTFAVGHIGATMIVAVGLVA
AIESGMLPASVARASDVGISYGAMCVLGAITAAMPVRWRGVWAGWWLGTA
VVATVGADFTAVGHVVALLLGIGLSFRLRSTASWTPVHLALLCVGATFGY
LLLAGAASMAPIGGLAGAFIGVLARRPLGAC
>MAP0886c hypothetical protein
MGSRLRPIPQTHRGDRHRRRRRAPPRPAGAHRGVGPGVRSCSPPGGARDT
ALANPRPTPAASDTATARAGDRAVADPGGDPLGRADADGAHHHVDAIIYG
TGFAIPAHVADDTITGAGGLPLRRAWPDGTEPFCGVAVRGFPNYFFASGP
DPGPQARYIVECLKLMQRTGSRRIEVRASSQQVFNERAQLRPVEPPPVAS
AFDLSASTPAGDDTYDGAATLEIAGDRHPVRVRLTGHLDPIDGRYHWQGT
VFGSPSQPLPGDLLGQARAATLTVGQRSAPARIVERTPWGTHTVAGIGAP
PYPGHR
>MAP0489c hypothetical protein
MGTALTRDATRVGRTTRWLAAVLAVTGWAALTGCTGPAHPHATAVVASTD
VWGSVARAVAGGHVAVASILSGSDQDPHSYEASPSDAAAIADAGLVVFNG
GGYDGWVDDVLAHHPGVARVDAYALLPDDGRPRNEHVFYHLGVAKAVAAA
VADRLAAIDPGNAADYRRNAAAFGRDADAIAGIEHTIAAAHPGGSVVATE
PVAFYLLEASGLVNRTPPALEAAVENETDPAPADLARALDLLDRHQVSAL
VVNPQTSASAVNGLREAARRAGVPVVEVRETLPDGADYLSWQRNTVGQLQ
TALQPVRSLQP
>MAP3872 hypothetical protein
MSTERAHSYAGDVSPLEAWKLLSDNPNAVLVDVRTDAEWRFVGVPDLSSL
GREVVFLEWNTSDGRHNPDFADQLRRQIEPAPAGQERPVLFLCRSGNRSI
GAAEVATQLGITPAYNVLDGFEGHLDANGHRGETGWRAIGLPWKQG
>MAP2441c hypothetical protein
MPARSMVVMPHGPVAHQAGTRGYRRMTAALCGAGLASFAAMYCSQALLPA
LSAHYRIGPATAALTVSLTTGALALSIIPASVLSERYGRIRVMLISGVAS
SVIGLLLPFSPSLGVLLFGRAAQGVALAGIPAVAMALLAEEVDASSLGSA
MGRYIAGTTIGGLAGRIVPSVVVQVGTWRVALLACSLITLAGTAVFAVLV
PRSRFFTPKPASVRAALRNLAGHLRNPVLAKLFAVGFVLMGGFVTVYNYL
GYRLAARPFGLAPSVVGLLFLLYLVGTGTSVVAGRLADRRGRPLVLGAAL
PIAVAGLLLTVPATLAAIVAGVGVFTGGFFAAHTVASGWVGAVAQRDRAE
ASALYLFSYYLGGSVAGAFGGVLYGVGGWSATVCFVVVLLMAGAALVALL
VRDNGFRIGRRVVTSVASVK
>MAP2882c hypothetical protein
MIWRRGGLIFVPEGGQLRWEGTMAKPPLSMKPTGWFQVAWSDEIGVGDVH
KMKYFDQEMVAWRAESGQLTVMNAYCEHLGAHLGYGGKVVGEVLQCPFHG
WQWSAEGRNVCIPYQDRPNRGRRMRTYPVVERNASVYIWHDLQRREPYFD
PPDVFAAFGDGSSADDYYPQQRLYRQALEMHPQYVLENGVDFAHFKFVHN
TPIVPVFTRHDFAEPVSYVDFTITFEGDDGQKIEDVNSGVQAINGGLGIA
VTKSWGMIDNRTISAITPVDERTSDVRFMVYIGRTPAKDAARAQRKAAEF
GDEVIRQFTQDIEIWQHQRYSDPPALAADEYQGFMAIRQWAKQFYPEFAA
QEA
>MAP0782 hypothetical protein
MFVCLCNGVTSQTVTEAVQCGASTTNEVARACGAGADCGRCRRTVQAILR
SSSGNRTPNSI
>MAP0815c hypothetical protein
MSGRRQGDPGRVAAKPGRRPGNSAAAPHPGAANYPAGDTGDRRTRRPPPM
PSANRYLPPLGHQPQPDRGAAAPPRGPVAGERITVTRAAALRSREMGSRM
YWMVQRAATADGADKSGLTALTWPVVANFAVDAAMAVALANTLFFAAATG
ESKGRVALYLLITIAPFAVIAPLIGPALDRLQHGRRVALAASFVLRTALA
AVLIMNYDGASGSYPSMVLYPCALAMMVLSKSFSVLRSAVTPRVMPPSID
LVRVNSRLTMFGLLGGTIVGGAIAGGVEFVCTHLFKLPGALFVVVAVTVA
GASLSMRIPRWVEVTAGEVPATLSYRRDSEPLRRRWPEEVKNVPKKATAT
LRQPLGRNIITSLWGNCTIKVMVGFLFLYPAFVAKAHQANGWAQLAMLGM
IGAAAGVGNFVGNFTSARLKLGRPAVLVVRCTVAVTAVALAASVAGNLML
AVIATLVTSGASAIAKASLDAALQDDLPEESRASGFGRSESTLQLAWVLG
GALGVLVYTELWVGFTAVTALLILGLAQTLVSFRGNSLIPGLGGNRPIMV
EQEGARRGVGSPAVVAE
>MAP2376c hypothetical protein
MRMTEADTPFDVLVIGAGFSGLYMLHRLRQLGIPARVLEMAENVRGTWLF
NRYPGARCDIESIEYSYSFSEEIQQEWVWTESMPAQPEILAYLNYVADRL
DLRRDIQFGAEVVAMTFDEDAAMWSVRTRSGDTFRVPFVVAASGILSVPL
QPDIPGMNTFAGTSLFTSRWPAAGVDLTGKRVGVIGTGSTGVQLIPVVAR
EALHLSVFQRSPAYTLPWRVHRFQPGELDEMKARYGEIRAAQRAHPIGAA
RLSAFSVLLEMLGRPPLKSATPEERLRAIEEHGVLGALNWGDVFFDIEAN
RMAAELYGEAVARIVKDPETAASLVPVHPFACKRPIIDQGYYETFNRDNV
TLVDLRKSPIREVTPAGIRTEDRLHELDVIVYATGFDAMTGALSRIDIRG
RGGIGLAEFWATQGPLSYLGLAVAGFPNLFTVQGPGSPSAATNFVAALEQ
HVEWIGDCIGYLRANHIRTIEALSTAQQEWIEHTTALVAPTVLVHPSCNS
WYNGGNVPGKKRMYMGYTGGIPEYRRRCDEIAAGGYTGFKLA
>MAP0424 hypothetical protein
MSQLSFFTAESVPPAVADLSGVLAASGQIVMVGTPEPHGARLSVVVDHTW
RAEALADMISEAGLVAEIGRTDEDTPLVRTAVDPALSPLAAEWTRGAVKT
VPPRWLPGPRELRAWTLAAGNPEGEHYVLALDPHAPDTHSPLASALMRVG
IAPTLIGTRGGRPALRISGRRRLSRLVENVGEPPDSPEASAHWPRV
>MAP1109 hypothetical protein
MTAQVVADQVVGGVISAPPLARPRRGTLRWQSRVLRVVSVAAAIGLWQLL
TADKVRLLLRFDTLPTVTEIVGALHRRLAAGEYWLDLAQSLLRILTGFGL
AAVIGVATGVLLGRSRLFADVFGPLAELARPIPAIAMVPVAILLFPTDEA
GIVFITFLAAYFPIMVSTRHAVRALPTLWEDSVRTLGGGRWQVLTQVVLP
GILPGVFGGLSVGMGVAWICVISAEMISGRLGVGYRTWQDYTVLAYPQVF
VGIITIGVLGFATSAAVELVGRRVTRWLPRAQDGAR
>MAP0618c hypothetical protein
MLNGAMEKTRSAAFSVPLATASGPSLRDDEYPDKLDAALFRIAGVCGLAC
IMAVLDSTVVAVAQRTFIAQFGVNQAIVSWTIAGYMLAFATVIPITGWAA
DRFGTKRLFMGSVLIFTLGSLLCAVAPNILLLILFRVVQGVGGGMLLPLS
FVILTREAGPKRVGRLMAVGGIPILLGPIGGPILGGWLIGAYGWKWIFLI
NLPIGLTAFALAALLFPKDRSAPSEALDITGALLLSPGVAIFLCGVCSIP
GRHTVADRYVLVPALVGLVLIAAFILHAWYRTEHPLIDLRLFRNPVVTQV
NVTLLVFAAASVGVGLLVPSYFQIVGHETPMQSGLHMLPIGVGAVLTMPL
GGAVMDKHGPGKIVLTGLPLMAVGLAVFTYGVARQAAYSPVLVCGLAIMG
LGIGLTTTPLSAALMQALAPHQVARGTTLISVNQQVGGSIGAALMAVILT
NQFNRNPALMAANEAAGMHPVTGKRGLPVDPSTVPRPAMTPELAGHVSHH
LSHAYTAVFVLAVVLVACTIIPASFLPRKPPSPPAGD
>MAP2744c hypothetical protein
MSGGLTPDQAIDAIRGTGGAQPGCRALHAKGTLYRGTFTATRDAVMLSAA
PHLDGSTVPALIRFSNGSGNPKQRDGAPGVRGMAVKFTLPDGSTTDVSAQ
TARLLVSSTPEGFIDLLKAMRPGLTTPLRLATHLLTHPRLLGALPLLREA
NRIPASYATTEYHGLHAFRWIAADGSARFVRYHLVPTAAEEYLSASDARG
KDPDFLTDELAARLQDGPVRFDFRVQIAGPTDSTVDPSSAWQSTQIVTVG
TVTITGPDTEREHGGDIVVFDPMRVTDGIEPSDDPVLRFRTLVYSASVKL
RTGVDRGAQAPPV
>MAP1596 hypothetical protein
MTTPRAAVNAPARADTGSGGERISPQRRNLIFVAIVLGMLLAALDQTIVA
TALPTIVANLGDAGHQSWVVTSYLLASTIVTALVGKLGDLYGRKRVFQAA
VLFFVAGSVLCGLAQSMAMLVGARALQGIGGGGITVTASALIGEVVPLRE
RGRYQGILGAVFGVTTVIGPLLGGYFTDYLSWRWAFWVNVPVSVIVIFVA
AAAIPALAASAKPVIDYAGIVFVGLGAAGLTLATSWGGSRYPWGSPTITG
LFAAAAVALGVFVVVERRAAEPILPVRLFASPVFTVCCVLSFVVGFAMLG
AMTFLPTYMQYVDGVSATTSGLRTLPMVVGMLFTSTGSGTIVGRTGRYKI
FPVAGTALMALAFLLMSRMQPSTPAVIQSLYLFILGAGIGLSMQVLILIV
QNTSDFEDLGVATSGVRFFRTIGSSFGAAIFGSLFVNFLNRRIGPALAAS
GAPPGAVSSPGALHRQPHEVAAPIVAAYAESLTEVFFWAAPVALVGFVLA
LFLREIPLRDIHDSTVDLGDAFGMPTTETPDQMLENAIARMLRGETGMRL
RSIAMRPDCRLDVAGLWGVLRINRYTQMYGAARLTDMAEYLRIPFEVLEP
TFSRLVTAGYAGSDGDRLWLTPAGAQQVGYVHSLLLAWLVDKLGRSPGFE
GRPDRQAVQAALERVAYRVLAQRDWHDEQPTAAITAAAR
>MAP1088 hypothetical protein
MVVLQVFSVQLGLISLFPDGSVTSLLVAALVLAVPVSAPIAQVLGKNIDA
TLALPHVNTARAKGGTPGWVIRKHVVKNAAGPALTVTATTVGALLGGSVV
TETVFSRSGVGAVLLQAVSSQDISLIQGLVLLTAVAIVTANLAVDLIHPL
LDPRVTRVQRRGFTTRLGRFG
>MAP1463 hypothetical protein
MPARRPGLRRVVASGAPRGPSMTDVSDVLVIGAGFGGLYAVHRAASSGLS
VTALEAAPDVGGTWYWNRYPGARCDVESVDYSYSFDEELQRSWQWTERFA
AQPEILAYLRHVADRFDLRRHYRFGADVVDAAFEHGRWRVGTSNGQTFAA
RFLICATGCLSAVNRPDIPGAQDFSGEVYFTAAWPREDPDLRGKRVGLIG
TGSSGIQATPIIAAQAESLVVFQRSANYTIPMPNRPFSAEEQQRIQEQYP
ERRRRSAYATSGTPHGMYHKNAVDTDPAERAEALWKRWREGGVLFAKTFP
DQTSDPAANDIARTFAEERIREIVTDPDVAADLIPVDHPIGTKRICTDDG
YYATFNRDNVRLVNLRREPIEAITADGMRTSTTTYPCDVLIFATGFDALT
GALTRINPTGPRGDRLRDIWADGPLTFLGMMVPGLPNLFSISGPGSPSVL
ANMVLHAEVQVDWVVDLLCATRRLGVTEVEPRRDAAVAWTRYVAEVAERT
LFPKAASSWYLGANIEGKKRVFMPYIGGFGTYRRHCEQVADRGYAGLVLT
TR
>MAP0835c hypothetical protein
MTATPRACNRDRVALQAVHFFMADMEAGMGPFLGVLLQSRGWTTGAIGAA
MTLGAIVGMVTVAPAGALVDATRHKRGCVIVVGLAAVAASAVILTSRQFW
TVATAQAVMCISGATIAPAMIGITLGVVGQAAFTRQNGRNQAYNHAGNMA
GAALGGVAGWVFGYAGIFWLAAGFAVATIAAVLAIPAGDIDHHVARGEAR
AAGEAPVKAMRVLARSRPLLVLAAAMVLFHLGNAAMLPLYGLAVVATHAN
PFTTVASTVVVAQAVMVPASLLAMRIAATRGYWPAILIALTALPVRGVLA
ASVITSWGVIPVQVLDGIGAGMLSVAVPGLVARILDGTGHINVGQGAVMA
AQGLGGALSPVLGGAVAQHLGFRAAFLLLAGLSLGALIIWVTFAPMLRRA
ARLPAAPSDRAGAPPTATADNQAK
>MAP1441 hypothetical protein
MSTGMAVTTESAGAAGPDPYDVGELRANLRQADPGVLVAVLAQLTGDPAV
VDRFAPKITHVPDPPEQAGVTDPETAAQLVDEIVTALRTPRRADAVPADD
LDLFARVAPVALGGEVGPEYLGLLVEQGGFQPSQPVLPRTAKLPAGFRVV
IIGAGIAGITAALACADAGIEYQIIERNDEVGGTWYTTRYPGIGVDTPSA
YYSLSRDINGDWSSYYPQGAEYQAYLVSVADKNDLRKHTRFGTEVEALWW
QERRRQWQIHSVGPDGTRDVSYANVVIPAAGYLNRPRWPELAGRETFSGI
SIHSAHWDPELDLTGKRVAIIGAGCTAVQIVDACVDQVAHLTVFQRQPHW
VAPRRRASDDVSTYQRWLGTRLPYYANWIRIKSYWGTADNNYPVILHDPQ
WAAEHLSVSPANDVLLRMCLDYIDRVFGAGSELARKVTPDFAPYGKRIIR
DPGGYYAALAREHVDVEASEPARVNQAGIVTADGRQIDLDVIIYATGYYL
DFLSTVDIRGRDGKKLTDEWGDAPRAYRGGMVPGFPNMFISSAPNYSPGH
GAGHNFGVEVMVHYVMECLQLMALRRATTVEVTQRAYEEYVADIDALMAG
TVWCHTPSAHTYYRSGGGRIVTAFPYRLVDFWRDHRAPSEEDLELR
>MAP3630 hypothetical protein
MSTVHSSIDHHPDLLALRARYERVAESMSAHFTFGLALLAGLYVAASPWI
VGFSATASLATSDLIAGIAAAFLAYGFATTLDRAHGMTWTLPVLGAWVIV
STWILPGVVLTAGMTWSNVVAGALLTFLGLNATYFGMRTRASAG
>MAP1107 hypothetical protein
MRRHAALLASVLIAVAALGTGCSLESLSQSAGVVNVVVGYQSKTINTVTA
GTLLRAQGYLERRLADITTRTGTKYAVRWQDYDTGAPITAQMLAEKIDIG
SMGDYPMLINGSKTQANPLARTEMVSITGYNPKGALNMVVVSPDSRARTW
PTWPAPRSRPAWARPATAPWCGP
>MAP2988c amt, Amt_2
MRVTYPILGQPNTGDTAWMLASSALVLLMTPGLAFFYGGMVRARSVLNML
MMSISAMGVVTVLWVLYGYSVAFGDDVGNFMGKPTSYWGLKGLIGVNAVA
ADPSKGTAATDIPLAGTLPATVFVAFQLMFAIITVALISGAVSDRLKFAA
WLVFAGLWATFVYFPVAHWVFAFDGFASEHGGWIANKLHAIDFAGGTAVH
INSGVAGLMLAIVLGKRRGWPTTLFRPHNLPFVMLGAGLLWFGWYGFNAG
SATSSNGAAGSTFMTTTIATATAMLAWMLTERIRDGKATTLGAASGIVAG
LVAITPSCSSVNVLGALVVGLVAGVVCALAVGLKFKLGFDDSLDVVGVHL
VGGLAGTLLVGLLAAPESPAISGVTGVSKGLFYGGGWAQLERQAVGAFSV
LIYSGVVTLILALILKYTMGLRLNPEAEASGIDEAEHAESGYDFAVATGS
VLPPRVAVADTRNGLEEQRVGDKVEAEQS
>MAP1394c amt, Amt_1
MHGIDPAATAWLLASTALVLLMTPGLAIFYGGMVRTTGVLNMIMMSFISI
PLVTVAWLLVGYTMAFSQDGMSGLVGNLRHFGMLGITPDTTHGAVPELLY
ATFQLTFAIITAALVSGAIADRAKFAAWMVFVPVWTVAVYSVVAQWVWGP
GGWLARLGVLDYAGGLVVEIVSGSSALALALVLGPRIGFKKDAMRPHNLP
LLFVGVGLLWFGWFGFNAGSALAANGTAAAIFLNTLVAGCLGMLGWLSVE
QIRDGRPTTFGAASGVVAGLVAITPSCGTVNTLGATVVGLAAGVVCSFAA
GAKLRFNYDDSLDVVGVHFVGGVVGVSLIGLFATAVMTAGPQGLFYGGGV
AQLGKQALAIAVVALYAFTVSFLLAKVIDRVMGFRVSAEDETTGVDLTQH
AETAYAEGVHGHLPQRRPGPGDRLK
>MAP2805 arsA, ArsA
MSVIAIAVFVVAYALIASDRVNKTFVALAGAAVVITLPMIRSDDVFYSRE
TGIDWDVIFLLLGMMIIVSVLRQTGVFEYVAIWSAKRARGSPLRVMILLV
LVTALASALLDNVTTVLLIAPVTLLVCDRLAITAAPFLMAEVFASNVGGA
ATLVGDPPNIIIASRGGLSFNDFLVHLAPIVVIVVGVLIALLPRLFPGAF
TVDPERVADVMSLEEKEAIRDPRLLVTCGVVLLAVFAAFIAHGPLHLEPS
LVALLGAGILVVASRLQPADYLSGVEWDTLLFFAGLFVMVGALVKTGVVK
HLARLAITATGGNTLTATMVILVASVVISGIVDNVPYAATMAPVVADLVP
ALGDHANPAVLWWSLALGTDFGGNLTAIGASANIVLLGIARRADNPISFW
EFTRKGVVVTAVSVALSALYLWLRYFVWG
>MAP0484c arsB2, ArsB2
MALTVALVLLAVVLGFAVARPRGWPEAVAAVPAALLLVGVGAIPVAAAEQ
QIADLSGVVAFLGAVLVLAKLCDDEGLFEAAGAAIARGRVGSAGMLRRVF
VIASAITAVLSLDAAVVLLTPVVLAAVRRQRTAVRPYAYATAHLANGASL
LLPVSNLTNLLAFHTAQLSFTRFTLLMAAPWLAAVATLYAVFRGFFAKDL
RVQPDPAALGAPPRPPVFVLVVVALTLAGFAVAQSVGIAPAWVALCGASV
LAVRSLARGHTTVTEIARSVHVSFLVFVLALGVVVQAVMRNGMDRAMSAV
LPSGSGLPALLAIAATAAVLANVVNNLPATLVLLPLVAPAGPVAVLAVLI
GVNIGPNLTYVGSLSNLLWRRVLRQHGVDAGVGEYTRLGVCTVPVSLLVA
VLALWASARLLGG
>MAP0982c arsC, ArsC
MAQGATIYHNPRCSTSRKTLELLRDNGFEPNIVEYLKTPPSRAELVKMIR
DAGIDVRTAVRKRESLYDELNLAEASDEQLLDAMIEHPILIERPFVVTPK
GTRLARPIDAVREIL
>MAP4171 atsA, AtsA
MVAGSGQSAEFSGRVELDIRDSEPDWGPYAAPTAPPNAPNILYLVWDDTG
IATWDCFGGLVEMPAMSRIAERGVRLSQFHTTALCSPTRAALLTGRNATT
VGMATIEEFTEGFPNANGRIPFDTALLSEALAEHGYNTYCVGKWHLTPLE
ESNMASTKRHWPTSRGFERFYGFLGGETDQWYPDLVYDNHPVSPPATPED
GYHLSKDLADKTIEFIRDAKVIAPEKPWFSYVCPGAGHAPHHVFKEWADR
YAGRFDMGYERYREVVLERQKAMGIVPSDTELSPVNPYLDVTGPRGEPWP
LQDTVRPWDSLNDEEKKLFARMAEVFAGFLSYTDAQIGRILDYLEESGQL
DDTIIVVISDNGASGEGGPNGSVNEGKFFNGYIDTVEESMKLFDQLGGPQ
TYNHYPIGWAMAFNTPYKLYKRYASHEGGIADTAIISWPNGIAAHGEIRD
NYVNVCDITPTVYDLLGMSPPETVKGIAQKPLDGVSFKAALDDPNADTGK
TTQFYTMLGTRGIWHEGWFANTVHAATPAGWSHFDADRWELFHIEADRSQ
CHDLAAENPDKLEELKALWFAEAARYNGLPLSDLNILETMTRSRPYLVGE
RDSYVYYPDCADVGIGAAAEIRGRSFSVLAEATVDTTGAEGVLFKQGGAH
GGHVLFIQDGRLHYVYNFLGERQQEVSSSVPVPLGRHLFGASYARTGTVP
DSHTPLGDLTLFIDDEVVGTLAGVSTHPGTFGLAGAGITVGRNGGSGVSS
RFKAPFVFTGGTIARVTLDLSGRPYRDVETEIALAFSRD
>MAP3791c atsG, AtsG
MTQLTPGDAGDRNDAGRKDNVLIVHWHDLGRYLGAYGHRDVSSPRLDRLA
AEGILFTRAHATAPLCSPSRGSLFTGRYPQTNGLIGLAHHGWEYRSGIRT
LPQILSEAGWYSALFGMQHETSYPKRLGFDEFDVSNSYCDYVAQRADEWL
RQSAEGVVGQPFLLTAGFFETHRPYPEDRYPPADSAEVQPPDYLPDTPEV
RGDLAAFYGAISTADAAVGRLLDTLADTGLDASTWVVFFTDHGPAFPRAK
STLYDAGTGIGMIVRPPTGRGLPPRVYDELFSAVDLVPTLLGLLGISVPP
DVDGVSHASALLRPDPDAAPVREHVYTMKTYHDSFDPIRAIRTKDYSYIE
NYASRPLLELPWDIEESPSGMAVAPLVTAPRPERELYDLRADPTEITNLL
AGDDADADEVAANLAVLLHDWRQRTGDVIPSEFAGTRIAARYTETYLQIH
HGARPTARSAIAADRGIEEEGNPAQR
>MAP1595 bfrA, BfrA
MQGDPEVLRLLNEQLTSELTAINQYFLHSKMQDNWGFTELAEHTRAESFD
EMRHAEAITDRILLLDGLPNYQRLFSLRIGQTLREQFEADLAIEYEVMDR
LKPAIILCREKQDSTTATLFEQIVADEEKHIDYLETQLELMDKLGVELYS
AQCVSRPPS
>MAP3236 catB, CatB
MATDHTSGAPDPKQRDLESARFRRDTGYLTTQQGVRVDHTDDALTVGERG
PTLLEDFHAREKITHFDHERIPERVVHARGAGAYGYFEPYDDRLAQYTAA
KFLTSPGTRTPVFVRFSTVAGSRGSADTVRDVRGFATKFYTEQGNYDLVG
NNFPVFFIQDGIKFPDFVHAVKPEPHNEIPQAQSAHDTLWDFVSLQPETL
HAIMWLMSDRALPRSYRMMQGFGVHTFRLVNARGEGTFVKFHWKPRLGVH
SLIWDECQKIAGKDPDYNRRDLWEAIESRQYPEWELGVQLVAEDDEFSFD
FDLLDATKIIPEEQVPVLPVGKMVLNRNPDNFFAETEQVAFHTANVVPGI
DFTNDPLLQFRNFSYLDTQLIRLGGPNFAQLPVNRPVAQVRTNQHDGYGQ
HTIPQGRSSYFKNSIGGGCPALADEDVFRHYTQRVDGQTMRKRAEAFQNH
YGQARMFFKSMSPVEAEHIVAAFAFELGKVEMPEIRSAVVAQLARVDDQL
AAQVAAKLGLPEPPEEQVDESAPVSPALSQVTDGGDTIASRRIAVLAADG
VDVVGTQRFTELMEQRGAVVEVLAPVAGGTLAGGSGGELRVDRSFTTMAS
VLYDAVVVACGPRSVSTLSDDGYAVHFVTEAYKHLKPIGAYGAGVDLLRK
AGIDNRLAEDTDVLNDQAVVTTKAAADELPERFAEEFAAALAQHRCWQRR
TDAVPA
>MAP1301 chaA, ChaA
MSKWLSRNVLSWTVVVPVLAVVVLALIWGERLGPVLVALAALFLIGAVVA
AVHHAEVVAHRIGEPFGSLVLAAAVTIIEVALIVELMASGGNETATLARD
TAFAALMITTNGIAGLSLLLGSRRYGVTLFNAEGSGAALATLTTLATLSL
VLPAFTTTQVGKEFSPGQLTFAAVASLLVYLLFVFTQTVRHRDFFLPIAQ
KGQKSLFEDESHADPPSTREALVSLVLLLCALVAVVGLAEEESPAVERAV
TAVGFPQTFVGVIIAALVLLPETLAAVRAARQGRIQISLNLAYGSAMASI
GLTIPAIALASIWLKGPLVLGLGAIQLVLLALTVVISVLTVVPGRATRLQ
GEVHLVLLAAYVFLAVSP
>MAP1810 cobG, CobG
MARTRDADACPGALQVHQAADGALARVRLPGGMLTPAQLTVLCDIADRLG
SPTLELTARGNVQLRGLTDVTAAAGALAAAGLLPSATHERVRNIVASPLS
GRSGGNLDVRRWVGELDAAIRAQPRLSELGGRFWFSLDDGRADVSGLRAD
VGVHVLPDGCAVLLAGRDTAVRLPPDQVVATLVGVATRFVQVRGSAWRVQ
ELDDPNQLLPGAECGPIAYPAVTKPPVGWITQDDGRVTLGAAVPLGLLSA
RVAEYLAALQAPLVITPWRSVLVGDLREEVADAALRVLAPLGLVFDENSP
WLSVSACTGSPGCARSTADVRADAALAVREGTAPGHRHFVGCERACGSPL
AGEVLLATGEGYRQLR
>MAP2542 corA, CorA
MFQGFDALPEVLRPIAHQPHPQPAPEAPPARATLVDCAVYDDGNRLPGVF
GYADALDKVREIESQGREGFVWVGLREPNQTEMQEVADVFGLHALAVEDA
VCAHQRPKVERYDDTLFLVLKTVNYVPHESVVLAREIVETGEIMVFVGRD
FVVTVRHGEHGGLSEVRKRMDGDPEQMRLGPFAVMHAIADHVVDHYLEVS
SLMLADIDSIEGLAFAPGSKIDVEPIYLLKREVVELRRCVNPLSSAFHRI
QTENKDVISKEVRRYLRDVADHHSEAADQIASYDDMLNSLIQAALARVGM
QQNNDMRKMAAWAGILAVPTMIAAIYGMNFHFMPELNWTWGYPAVMAGMA
VVCLVLYFQFRNRNWL
>MAP4284 ctpA, CtpA
MSTPPRHVDEGTFPDHTASTARIELEITGMTCASCAARIEKKLNKLDGVT
ATVNYATEKAAVSAPASYDPQTLITEIENAGYAAAVAKPSPPRDDPELAS
LRRRLVTATALAGPVIAVAMIPALQFQHWHWAALALTAPVVGWCGRPFHA
AAWANLKHGVATMDTLISIGTLAAFLWSLYALVLGAADRPGMRHDFELTV
GHGAHVSHVAAPCHVYFEVAAGVTLFVLAGRYFERRSKRTAGAALRALLA
LGAKDVAVLRAGAETRIPIERLAVGDEFVVRPGERIATDGIVVAGSSAVD
AAMLTGESVPVEVGVGDGVTGGTVNAGGRLVVRATGIGDDTQLARMAQLV
ERAQSGKADAQRLADRVSGVFVPVVLLLAVATLAGWLTAGGTLATALTAA
VAVLIIACPCALGLATPTALLVGTGRAAQLGVLIKGPEVLETTRAVDTVV
LDKTGTVTTGAMTVLDVVAADGTDRATLLRYAGALEAASEHPIAHAIARD
AKAELGPLPTPTGFRAVGGGGVHGRVDGHAVAVGRPRWLAERGLRPDAAL
AAAAARAEHDGKTVVAVGWDGRARGILALADTVKPCSAAAVRQFTRLGLT
PILLTGDNHTVARRIAGELGIGEVISGALPADKVEAVKRLQSAGRVVAMG
TGTDVAIEAADVTVVRGDLRAAVDAIRLSRRTLATIKTNLVWAFGYNLAA
IPLAALGMLNPMLAGAAMALSSVLVVGNSLRLRSFASIIPGA
>MAP3384 ctpC, CtpC
MNLASVRAIGDEGLTKDPALQVMSDAAGRMRVSVGWVRADSRRAVAVEEA
VAKCDGVRVVHAYPRTGSVVIWYSPRRCDRSAVLAAIGEAAHVTAELIPA
RAPHSSEIRNADVLRMVIGGAALALLGVRRYVFARPPLLGPSGRLFATGV
TVFTGYPFLRGALRSLRSGRAGTDALVSAATVASLVLRENVVALTVLWLL
NIGEYLQDLTLRRTRRAISELLRGSQDTAWIRLEHNEIQVATDTLQIGDE
VVVHDHVAIPVDGEVIDGEAIVDQSAITGENLPVSVVVGMPVHAGSVVVR
GRLVVRARAVGNQTTIGRIITRVEEAQHDRAPIQTVGENFSRRFVPTSFI
VSAITLAVTGDVRRAMTMLLIACPCAVGLATPTAISAAIGNGARRGILIK
GGSHLEQAGQVDAIVFDKTGTLTVGRPVVTNIVAMHKDWEPEQVLAYAAS
SEIHSRHPLAEAVIRSTEERHITIPPHEECEVLVGLGMRTWADGRTLLLG
SPSLLQAEKVKVSKKAKEWVDKLRRQAETPLLLAVDGTLVGLISLRDEVR
PEAAGVLKKLRANGIRRIVMLTGDHPDIAAVVADELGIDEWRAEVMPEDK
LAAVRDLQEEGFVVGMVGDGINDAPALAAADIGIAMGLAGTDVAVETADV
ALSNDDLHRLLDVRDLGSRAVDVIRENYGMSIAVNAAGLIIGAGGALSPV
LAAILHNASSVAVVANSSRLIRYRLN
>MAP0843 ctpE, CtpE
MNTGLTDAEVAQRVAHGQRNAVRQRATRSIADIVRANVFTRINAILGVLL
LIVLATGSVINGMFGLLIIANSVVGMVQEIRAKQTLDKLAIVGQAKPMVR
RQSGTRALPPDDVVLDDIIELGPGDQVVVDGEIVEEANLEVDESLLTGEA
DPIAKAVGDSVMSGSFVVAGSGAYRATRVGSQAYAARLAEEASKFTLVKS
ELRNGINRILQFITYLLVPAGLLTIYTQLFTTHAGWQKSVLRTVGALVPM
VPEGLVLLTSVAFAVGVVRLGQRRCLVQELPAIEGLARVDVVCADKTGTL
TESGMRVARVDELDGSGHDRIADVLAALAAADPRPNASMRAIAQTYSRPP
GWTVTATAPFKSATKWSGVSFAGHGDWVMGAPDVLLDSGSAAAGQAERLG
AQGLRVLLLGAADRAVDHPDAPGPITPVALVVLEQKVRPDARETLDYFAD
QGVSVKVLSGDNAVSVGAVAGELGLHGETLDARQLPSDLAQLADMLDTYT
TFGRVRPDQKRAIVHALQSHGHTVAMTGDGVNDVLALKDADIGVAMGAGS
PASRAVAQIVLLDNRFATLPYVVGEGRRVIGNIERVANLFLTKTVYSVLL
ALLVGFECLFAKALKADPLLYPFQPIHVTVAAWFTIGIPSFILSLAPNNE
RAHPGFVRRVLSSALPSGLIVGAATFASYLVAYHGRHATFQQQDQASTAA
LITLLVTALWVLAVVARPYQWWRVALVIASGLAYVVIFSLPLARKAFLLD
PSNVVVTLSALGIGVLGAAAIEVAWWIRAKMLGVRPRVWR
>MAP3498c ctpI, CtpI
MKIPGVSSVVAGVAGGAAQVVRAGVSTAAGAAGALQTLASPVAELAGPVI
QSMAQTTGRAIGLDGSADGAPAIVPPVRWHSGRRVHLDLDPLLPFPRWHE
YAPAVEEPVRRIPGVAKAHVEGSLGRLVVELADDADDAAVLDEVRSTVAS
VAADISWSKAEAAPPSAPFADPGNPLAILVPLTAAALDLVAMGAAVTGWV
TRLPAAPQTTRAAAALINHQPRMVSILEARLGRVGTDIALAATTAAAHGL
TQSFGTPLLDLTQRTLQISEAAAHRRVWRDREPQLASPDRPQAPVVPVIS
SAKSEVPRHSWAAAAAGEASHVVVGGTIDAAMDKAKGSMAGPVESYVDSA
ANGSLIAAVSALVAGGGTEDVAAAIEAGVPRAAHMGRQAFAAVLGRGLAN
SGQLVLDPGALRRLDRVKVVVIDGAALRGDHRAVLRVRGEAPGWDDDRVY
EVADALLHGEEAPEPDPDELPATGARLRWVPSQGPSAMPAQGLESADLIV
DGDRVGRVDVGWEVDPYAIPLFQTAHRTGARVVLRHVAGTEDLTASVGAT
HPPGTPLLNVVRELRADRGPVLLITALHRDFASTDTLAALAIADVGVALD
DPRAATAWTADIITGTDLADAVRILSAIPVARSASESAVHLAQGGTTLAG
LLLVTGEQEKGASPVSFRRWLNPVNAAAATALVAGTFSATRVLRLPDPTP
QPLTAWHALDPEIVYSRLAGGARPLAVETEPSWRRRLDDLSYSPALAPLR
APLQNVLRLASATRTELADPLTPILAVGAAASAIVGSNIDALLVAGVMTV
NAITGGAQRLRAESAAAELFAEQDQMVRRVVVPAVATTRRRLEAARHATR
TATVSAKSLRPGDVIDLAAPEVVPADARLLEAEDLEVDESLLTGESLPVD
KQVEPVAVNDPDRASMLFEGSTIVAGHARAIVVATGVGTAAHRAISAVAD
VETAAGVQARLRELTSKVLPLTLAGGAAVSTLALLRRASLRQAVADGVAI
AVAAVPEGLPLVATLSQLSAAQRLTARGALVRAPRTIEALGRVDTVCFDK
TGTLTENRLRVVCAVPDDVNPHDPFPELTAPQSAELVRAAARASARPQEG
QGHAHATDEAILTAASSLNGQRDSDWSMIAEVPFESSRGFAAAIGTVGNA
NGNPSDTPVLILKGAPEVILPRCRFADPEADQQRAEAVVRGLAEQGLRVL
AVAQRGWKHDTDDDDTDADAVDAAAHNLELLGYVGLADTARASARPLIEA
LLDAERDVVLITGDHPITARAIARQLGLPADARVVTGAELAGLDEDACAK
LVADVQVFARVSPEQKVQIVAALQRCGRVTAMVGDGANDAAAIRMADVGI
GVSGRGSSAARGAADIVLTDQDLSVLLDALVEGRSMWAGVRDAVTILVGG
NVGEVLFTIIGTAFGAGRAPVGTRQLLLVNLLTDMFPALAVAVTSQYVEP
DEAEYPSAADAEAARREHRRAVLTGPTPSLDAPLMRQIVTRGAVTAAGAT
AAWAIGRWTPGTERRTATMGLTALVTTQLAQTLLTRRHSPLVVATALGSA
GVLVGIVQTPVLSQFFGCTPLGPVAWTGVLGSTAGATAISALAPNWLAKQ
VAALEPGQQDA
>MAP2210c cysA, CysA
MIDAGTDRGDIAITVRDAYKRYGDFVALDHVDFVVPTGSLTALLGPSGSG
KSTLLRTIAGLDQPDTGTVTIYGRDVTRVPPQRRGIGFVFQHYAAFKHLT
VRDNVAYGLKVRKRPKAEIKAKVDNLLEVVGLSGFQGRYPNQLAGGQRQR
MALARALAVDPQVLLLDEPFGALDAKVREDLRAWLRRLHDEVHVTTVLVT
HDQAEALDVADRIAVLNQGRIEQIGSPTEVYDAPTNAFVMSFLGAVSTLN
GTLVRPHDIRVGRTPEMAVAAEDGTAESTGVARAIVDRVVKLGFEVRVEL
TSAATGGPFTAQITRGDAEALALREGDTVYVRATRVPPITAGATTVPALS
RDGADEATLTSA
>MAP0645c cysA3, CysA3
MARSDVRVSTDWAESNLDTPGVVFVEVDEDTSAYHAGHIPGAIKLDWRSD
LQDPVKRDFVDAQQFSKLLSERGIANDDTVILYGGNNNWFAAYAYWYFKL
YGHEKVKLLDGGRKKWELDGRTLSSDPVSRPATSYTAAAPDNSIRAFRDE
VIAAINVKNLVDVRSPDEFSGKILAPAHLPQEQSQRPGHIPGAINVPWSK
AANEDGTFKSDEELAALYAAAGLDTGKETIAYCRIGERSSHTWFVLYELL
GHRNVKNYNGSWTEYGSLVGAPIELGS
>MAP2484c cysN, CysN
MAAPTTLLRLATAGSVDDGKSTLIGRLLYDSKAVMEDQWAAVEQTSKDRG
HDYTDLALVTDGLRAEREQGITIDVAYRYFATPKRKFIIADTPGHIQYTR
NMVTGASTAQLVIVLVDARHGLLEQSRRHAFLASLLGIQHIVLAVNKMDL
IGWDREKFESIRDEFHAFAARLDVHDVATIPISALHGDNVVTKSDQTPWY
EGPALLSHLEEVYIAGDRNLVDVRFPVQYVIRPHTHEHQDHRSYAGTVAS
GVMRPGDEVVVLPVGKRTRITAIEGPNGPVQEAFPPMAVSLTLADEIDIS
RGDLIARTHNQPRIAQDFDATLCWMADNTTLEPGRDYVIKHTTRTTHARV
TGLDYRLDVNTLHRDKTATALKLNELGRISLRTQVPLLLDEYTRNPSTGS
FILIDPHTNGTVAAGMVLRDASAQAASPNTVRHKSSAIAAARPRGKTVWF
TGLSGSGKSSVAMLVEQKLLEKGAQAYVLDGDNLRHGLNADLGFSMADRA
ENLRRLAHVAALLADCGNVVLVPAISPLAEQRELARKVHADAGFDFIEVF
CDTPIEECEKRDPKGLYAKARAGEITQFTGIDSPYQPPANPDLRLIPDGT
VEEQAQRVIDLLESRG
>MAP1877c cysQ, CysQ_1
MTPRNPSDEMTDAALATDLAAEAGELLLKVRDEVGFGYPWALGDAGDSLA
NALILGRLQAERPDDAVLSEEAYDDLSRLQHDRVWIIDPLDGTREFSTPG
RDDWAVHVALWQRPTNGRREITDAAVLPARGNIVYRSDTVTASAARVGVT
DTIRIAVSATRPPAVLHRMRQRLPIQPVAIGSAGAKAMAIIDGVVDAYLH
AGGQWEWDSAAPAGVVMAAGMHASRLDGSPMRYNQLDPYLPDFVMCRAEL
APVLLGAIRDAWR
>MAP2058c cysQ, CysQ_2
MNDHELAARLATEAGRLLLGVRDEFADAPASERKAAGDKRSHDFLIEALA
AERPGDAVLSEEGADDPVRLRSERVWIVDPLDGTREFSELGRDDWAVHVA
LWEAGELVAGAVALPAQGVTFATPEVASPPVAPGKPRIVVSRTRPPAIAL
NVRDALDGVLVEMGSAGAKVASVVQGLSDVYVHAGGQFEWDSAAPVAVAR
AAGLHTSRIDGSTLAYNQPDPKLPDLVVCRPELADAVLAVTR
>MAP2211c cysW, CysW
MTSSPGVRYGLRFVALAYIFVLLVIPVSLILWRTFRPGFGQFYAWVSTPA
AISALNLTLLVVAIVVPLNVFFGIPTALVLARNRFRGKGVLQAIIDLPFA
VSPVIVGVALIVLWGSAGALGFVEKDLGFKIIFGLPGIVLASIFVTLPFV
VREVEPVLHELGTDQEEAAATLGSGWWQTFWRITLPSIRWGLTYGIVLTI
ARTLGEYGAVLMVSSNLPGKSQTLTLLVSDRYNRGAEYGAYALSTLLMGV
AVLVLVFQVVLDARRGRAAGQA
>MAP0410 dppB, DppB
MGWYIARRIAVMVPVFLGATLLIYAMVFLLPGDPVAAIAGDRPLTPAVAA
ALRARYHLDDPFLVQYLRYLGGVLRGDLGRAYSGLPVSDVLAHAFPVTLR
LSLIALAVEAVLGIGFGVIAGLRQGGLFDSAVLITGLVIIAVPIFVLGFL
AQFVFGVRLGIAPVTVGNAATFTRLLLPGIVLGSVSFAYVVRLTRSAVAA
NAHADYVRTATAKGLSQPRVVTVHILRNSLIPVVTFLGADLGALMGGAIV
TEGIFNIHGVGGVLYQAVTRQEAPTVVSIVTVLVLIYLVTNLVVDLLYAV
LDPRIRYG
>MAP0411 dppC, DppC
MAERIRARGGFWRETWRRLRRRPKFIGAGLLILVILAVALFPALFTAADP
TYADPAQSMLPPSRTHWFGTDLQGHDVYARTVYGARASVTVGLGAAAIVF
VVGGALGALAGFYGGWIDAVVSRVTDVFFGLPLLLVAIVLMQVLHHRTVW
TVIAILALFGWPQVARIARGAVLAVRASDYVLAAKALGMSRFQILIRHAL
PNALGPVIAVATIALGLFIVTEATLSYLGVGLPPSVVSWGGDINLAQTRL
RAGSPILFYPAGALAATVLAFMMMGDALRDALDPASRAWRA
>MAP2915c efpA, EfpA_2
MTALNDSERAVQNWTSARPDRPAPVRSTPPAETAPKPAAETAVKRTSKYY
PAWLPSRRFIAAVIAIGGMQLLATMDSTVAIVALPRIQNELSLSDAGRSW
VITAYVLTFGGLMLLGGRLGDTIGRKRTFIVGVALFTISSVLCAVAWDEA
TMVIARLSQGVGSAIASPTGLALVATTFPKGPARNFATAVFAAMTAVGSV
MGLVVGGALTEVSWRLAFLVNVPIGLVMMYLARTALRETNRERMKLDATG
AVLATLACTAAVFAFSMGPEKGWISLTTISSGVVALGAALAFVIVERTAE
NPVVPFDLFRDRNRLVTFIAIFLAGGVMFTLTVCIGLYVQDILGYSALRA
GVGFIPFVIAMGIGLGVSSQLVARFSPRVLTIAGGYLLVLAMLYGWWCMH
RGVPYFPNLVLPIVVGGIGIGMAVVPLTLSAIAGVGFDQIGPVSAVTLML
QSLGGPLVLAVIQAVITSRTLYLGGTTGPVKTMNDAQLQALDHGYTYGLL
WVAGAAVIVGAAALFIGYTPEQVAHAQEVKEAMDAGEL
>MAP1868c efpA, EfpA_1
MTMSVAPTTRLWSRQFVAVIVAIGGMQLMVAMDGPVAVFALPKIQNEMGL
SDAARSWVITAYMLTFGGLMLLGGRLGDTIGRKRAFLVGVALFTFASGLC
GIAWAGGTLIAARLLHGAAAAIIAPTNLALIATTFPRGSARNAATAVFGA
MTGLGGVLGLVVGGALTDVSWRLAFLVNVPIGLAVIYLVLITRQETQTER
IKLDVTGAVLATVTGTAAVFGISMGPEAGWRSPITIGSGVVALAAFVAFV
VVERTAENPIVPFNLFLDRNRLAAFAAMFLAGGVFFTLTVLVGLYVQTMM
GYSPLRAGVAFIPFGLAMAIGVGVASKLVTWFPPRVVVIASGGLILGATL
YGSTFNRGMPYFPNLVVPLIVCAIGIGAVFVTLTLSVIASVDVDRIGPTS
AIAVMLQTLGGPLVLVVVQVAITSHALRLGGTLGPVKSMNAAQLHALDRG
YTYGLLWLAGVVALLGGVALLIGYTAAQVARAQEVKKAVDAGEL
>MAP3127 emrE, EmrE
MPGTGHNLRFLSCPVHTYLYMRARGGVLTYLFLICAILAEVVATSLLKST
QGFTRLWPTVICLLGYAVSFALLAVSISRGMQTDVAYALWSAIGTALIVL
IAVLFLGSPISVTKVVGVGLIIAGVVTLNLTGAH
>MAP3092 fecB, FecB
MLFGVIRPARLAAVTAALMVACGGCGSDRPAATTTRSLVTPTTQIAGAGV
LGNDRRPDESCARDAAEADPGPAKRQVHNAPGADPAPVQVSADPQRIVVL
AGDQLDALCALGLQSRVVGAALPDGASGQPAYLGGAVRGVPGVGSRSHPD
VKAIAAAHPDLILGSQGLTPALYPQLAAIAPTVFTAAPGAAWRDNLRAVG
AATARAGAVDGLLSGFSQRAGDVGARHDASHFQASIVQLTTGSIRVFGAN
NFPASVLGAVGVDRPAAQRFTDKPYLEIGATDADLAKNPDLSVADADVVY
LSCATPAAADRAATVLDSGPWRKLSANRDNRVYVVNDEIWQTGQGLIAAR
GIVDDLRLVNAPIN
>MAP3710c fecB2, FecB2
MRQGWNRRGFLQLAGAAGVAATAGAAGLSAGCSAHQPPPGGAGPGSVTVT
HLFGQTVVKEPPKRVVSAGYTEQDDLLAVGVVPIAVTNWFGDQPFAVWPW
AADKLGGAQPTVLNLDNGIPVDQIAGLKPDLIVAINAGLDADTYQKLSAI
APTVAQSDGDAFFEPWKEQAAAVGQAVFAAEKMKSLVAGVDQKFTDIGKK
NPQWTGKKALLMHGALWQGTVVATMAGWRTDFLNQMGLVIADSIKPFGTD
QRAVIPRDHIKSVLESADVVIWTTQNPDDQKALLADPEVAGSLTTAQNRH
IFTTKDQAGAIAFASPLSYPVIADQLPPQLTKILG
>MAP1669c furA, FurA
MSSTADYADRLRMADLRVTRPRVAVLEVVDANPHADTETIFSAVRMALPD
VSRQAVYDVLNALTAVGLVRRIQPLGMVARYESRVGDNHHHVVCRSCGTI
ADVDCAVGEAPCLTPSDDDNVLDGFVLDEAEVIYWGLCAECSTAGS
>MAP2139 furB, FurB
MTPSGDGAGVSVRSTRQRAAISTLLETLDDFRSAQELHDELRRRGENIGL
TTVYRTLQSMAAAGLIDTLRTDTGESVYRRCSEHHHHHLVCRSCGSTIEV
GDHEVEEWATAVAAKHGFSDVSHTIEIFGTCSECR
>MAP1668c katG, KatG
MSSDTSASRPPQPDTRTASKSESENPAIPSPHPKSNAPLTNRDWWPNQID
VSRLHPHVAEANPLGEDFDYAEEFAKLDVEALKADVISVMTTSQDWWPAD
YGHYGGLFIRMSWHAAGTYRIHDGRGGGGQGMQRFAPLNSWPDNVSLDKA
RRLLWPVKKKYGNKISWADLIIFAGNCALESMGFKTFGFAFGREDVWEPE
EILWGEEDEWLGTDKRYPGTGERELAQPYGATTMGLIYVNPEGPEGKPDP
IAAAIDIRETFGRMAMNDEETAALIVGGHSFGKTHGAGDADLVGPEPEAA
PIEQQGLGWKSSHGTGVGKDAITSGLEVVWTPTPTKWDNTFLETLYGYEW
ELTKSPAGAWQFTAKDGAGAGTIPDPFGGPGRAPTMLVTDISLRESPIYR
DITRRWLDHPEELADAFAKAWYKLLHRDMGLVSRFLGPWVPEPQLWQDPV
PPVDHPLVDDNDVAALKDKVLASGLSVPQLVKTAWSAAGSYRNTDKRGGA
NGGRLRLQPQRNWEANEPSELDKVLPVLEKIQQDFNASASGGKKISLADL
IVLAGSAAVEKAAKDAGYEISVHFAPGRTDASQESTDVDSFAVLEPRADG
FRNFARPGEKAPLEQLLLERAYLLGVTGPEMTVLVGGLRALGANHGGSKH
GVFTDRPGALTNDFFVNLLDMGTEWKASETAENVYEGHDRATGALKWTAT
ANDLVFGSNSVLRALAEVYAQDDNQGKFVEDFVAAWVKVMNNDRFDLK
>MAP1000c kdpA, KdpA
MSSTTAGLIFLAVLVAALVAVHVPLGDYMFRVYTTDRDLATERTIYRLIG
VDARSEQTWGAYARGVLAFSSVSIIFLFVLQLVQGKLPLHLHDPATKMTP
SLAWNTAVSFVTNTNWQAYSGETTQGHLVQMAGLAVQNFVSAAVGMAVAV
ALVRGFARRRTGELGNFWVDLVRGTLRILLPISIVGAVLLVAGGAIQNFH
LHDQVVTTLGGTAQTIPGGPVASQEVIKELATNGGGFYNANSAHPFENPT
AWTNWLEVFLILVIGFSLPRTFGRMVGNPKQGYAIASVMASLYLLSTGFM
LWFQLQHHGTVPSAVGAAMEGVEQRFGVPDSGVFAAATTLTSTGAVDSAH
DSLTSLGGMITMFNMQLGEVAPGGTGSGLYGMLVLAVITVFVAGLMVGRT
PEYLGKKINPREIKLAASYFLVTPLIVLTGTAIAMALPGERAGMANSGPH
GLSEVLYAFTSAANNNGSAFAGLSANTEWYNTALGLAMAFGRFLPIVLVL
ALAGSLARQGSTPDSAGTLPTHRPQFVGMVAGVTLIVVALTFLPMLALGP
LAEGIH
>MAP0997c kdpC, KdpC
MTLSNFIRLHWAALRALLVLTVITGLAYPLLVWVVAQFPGLRDHAEGSIL
TANGKPVGSRLIGQLFTDKDGNPLPQYFQSRPSAAGTGYDPTSSGGSNLG
PESIVDAPGKPGLLTTVCSRSAAVAKLEGVDGSRPFCTGGGVGAVLSVIG
PRDERGNVTHPARVVSVNEPCESTPTPFVSLYEGVRVECAKTGEDYSLGQ
IVPIRGAAPAAPAVPADAVTAGGSGLDPNISPAYADIQVARVAKVRHVRP
EQIRELVAQNSSGRALGFFGEPCVNVLQLNLQLDHRYPVTS
>MAP3349c kefB, KefB
MQISGTLLLQLGALLATLAVLGAAARRFALSPIPVYLLAGLALGKGGLLP
LATGGQFITTSAPIGVVLLLLTLGCEFSLAEFSSSMRHHLPSAAVDVVLN
AAPGAIAGWLLGLDGVAILCLAGVTYISSSGVVARLLEDLHRLGNRETPA
VLSVLVLEDFAMAAYLPLFAVLASGGGWLHAVVGMVVAVSALVAAFAASY
RWGHHVQRLVEHPDSEQLMLRVLGITLIVAAMAESLHASAAVGAFLVGLT
LTGETATRTRQVLAPLRDLFAAIFFLAIGYSVDPHELIPMLPAALILAAA
TAATKVATGIFAARHDGVARRGQLRAGTALIARGEFSLIIIGLAGSSLPA
VAALATSYVFIMAIAGPVLARYTGPRPAAPAT
>MAP1565 modA, ModA
MRRIGILTGLLSVVLIAGMTGCGSKSQPPPTAGKLMVFAAASLRPAFTQI
AERFKAQNPGTGIEFEFAGSSELATQLTQGATADVFASADTAQMDVVAKA
GLLAADPTNFASNTLVIVTAPGNPKRIGSFADLTRPGLTVVTCQRPVPCG
AAAHRVEDSTGVHLNPVSEEPSVTDALTKVTSGQADAALVYVTDARTAGS
KVATVNFPEAAGAVNVYPIGVLKQAPLATQARNFVDLVTSPPGQQILAQA
GFAKP
>MAP1567 modB, ModB
MPRWVYLPAAAGTMFVVLPLLAIAVKVDWPHFWWLITSPSSRTALLLSLR
TAAASTALCVALGVPMALVLARGGTRLVRLLRPLILVPLVLPPVVGGIAL
LYAFGRLGLLGHYLEAAGISVAFSTTAVVLAQTFVSLPFLVISLEGAART
AGADFEVVAATLGARPTTVWWRVTLPLLLPGVVSGAVLAFARSLGEFGAT
LTFAGSRQGVTRTLPLEIYLQRVTDADAAVALSILLVVVAAVVVLGLGAR
RLTGTDAR
>MAP1568 modC, ModC
MSELQLRAVVSQRRFEVEFSVAAGEVLAVLGPNGAGKSTALHVIAGLLRP
DRGLVRLGDRVLTDTAAGIDVPTHDRRVGLLLQDALLFPHMSVAANVAFG
PHSRRPMWRRGRRAEKATALCWLREVDAETLADRKPRQLSGGQAQRVAIA
RALAAEPDVLLLDEPLAGLDVAAAAAIRSVLRRVVTRIGCAAMLVTHDLL
DVFTLADRVLVLESGRIAEIGPVADVLTAPRSHFAARVAGVNLVNGTAEG
DGALLARSGARWYAAPAAPAGPLASGQRAVAVFPPTAVAVYREQPHGSPR
NTVEVTVAEMDVRGAAVLVRGAQQPDGAPGLAAEITVDAASELRLTPGDR
VWFSVKAHEVVLYPATAAAER
>MAP3306c moeZ, MoeZ
MSTPLPPLVAPADQLTADEMARYSRQLIIPGLGVDGQKRLKNARVLVIGA
GGLGAPTLLYLAAAGVGTIGIVEFDAVEESNLQRQIIHGVADVGRSKAAS
ARDSIAAINPLVDVRLHEFRLDASNAVELFGHYDLIVDGTDNFATRYLIN
DAAVLAGKPYVWGSIYRFEGQVSVCWEDAPDGRGLNYRDLYPEPPPPGAV
PSCAEGGVLGVVCASIASVMSTEAIKLITGIGESLLGRLMIYDALEMSYR
TIAIRRDPCDASRPAITTLVDYEQLCGAAPAASTDAATGGAEAAITPRQL
RELLDSGAKLALIDVREPVEFDIVHLDGAQLIPQSSINSGEGLAKLPADR
MPVLYCKTGVRSAQALAVVRQAGFSDAVHLQGGIVAWAQQMQPDMILY
>MAP1790 morD, MorD
MTERSEYSDGRAMALERSCAVTAVALSEQRREGVRLVTGERRGFGLDAAL
TFVHLPYPAPSDWTRRTLTCGVALQCSPSKERVTEFRLNELSARELRALT
LVEGAVALGWIASRWPGLLPEVQRLLPDVHPQAADMDGAQMLDRAIGLAA
TGLELTVPPLLGALPLAYTAPQGLTDRLRRSFGRMPWTTTQKRRPRPYSV
PVGGDGGVRNPNLPPPSRPQDNDLDVTPQHRPGIPYPEWNMWTQRFMHDH
VAVVEHADGRRLRRPVPVAVDVRKWFEEHTHRAMTSRLEDGSDLDVDQYV
SHYIDLTTGEAKEPRVFRDLLPSGRDVTTALLLDGSSSLGVHGGRVFQLE
LACADALSRAMTLARERHGVFVFTGNTRHRVEVRCLKDFEDRRFVPPSTL
GLSTRGYTRLGAPLRHLTSRLLAQPAERRLLIVIGDGLISDEGYEGRYAW
ADAAHAVEEANDAGVSMYYVGVGPTRVDPLPEVFGPRRSQRIRRIEELPR
VLAHVHRELVAA
>MAP1626c nanT, NanT
MTKPSPGRKLTADQRNSFIAALLGWTMDAFDYFIVVLVYADIAKTFHHSK
AEVAFVTTATLIMRPVGALLFGLWADRVGRRLPLMVDVMFYSVVGFLCAF
APNFTVLVILRLLYGIGMGGEWGLGAALAMEKVPVERRGFFSGLLQEGYA
FGYLLASVASLVVMDWLELSWRWLFGLSIVPALISLIIRYRVEESEVWEA
AQDQLRLTSTRIRDVLRNGAIIRRFVYLVLLMTAFNWMSHGTQDVYPTFL
GAHANHGAGLSSTTVKWIVVVYNVGAIIGGLVFGTLSQRFSRRYTVVFCA
MLALPIVPLFAYSRTAAMLGLGSFLMQLFVQGAWGVIPAHLTEMSPDAIR
GLYPGVTYQLGNLLAAFNLPIQERLAETHGYPFALAATIVPVLLTVAVLT
LIGKDATGIRFATSESAFLPTEMT
>MAP3707c narK3, NarK3_2
MGRDHRITDWNPEDAAAWEAGNKRIARRNLLCTIAGDHVAFSIWTLWPVM
ALFMPAAVYGFSAGDKLLLGAVATLVGGCARIPYTLGIAAFGGRKWTTFS
AVVLLIPTVGTIVLLANPGLPLWLFVLCAALTGLGGGNYAASLANVNAFY
PQRLKGAAMAVNAGVANLGVAVIQLVGLLVLATAGHQAPYWVCAVYLVFL
AAVAIAAAMFMDDINHGTQLSTMRSILFERDTHVISLLYIATFGSWIGFS
FAFGQVMQVNFLENGESAKHAALHAAQLAFIGPLLGSVARIYGGRLADRV
GGSRVTLGVLAAMTLAAGLLVIVSTVDDQHAGAHSVSMIGYVVGFMVLFI
LSGMGNGSVFKLIPSVYEARSRGRDTSEDERRQWARAMSGSLIGICSAVG
AFGGVAINLALHQSYLSTGTETSAYWMFMASYVVAAIMTWLVYVRRPVAA
PGVSLPETQAARL
>MAP2102c narK3, NarK3_1
MARTRRIAHWDPEDLVAWEAGNKLIARRNLIWSIATMHVAFSIWYLWSVM
VLFMPQARYGFTTGDKLLVGATAALVGALVRIPYAMGTARLGGRNWAVLS
SLVLLIPTLAAIVLLAHPGLPLWPYLVCAALTGLGGGNYAAALANVESFF
PQRRKGFALGLTGGVGNLGAAGIQAVGLVVLATAGNQAPYWVCAVYLVLL
ALVGVGAALFMDNLDHSVEVGHVRSVLTVPDTWTISFLYMCASGSFIGFA
FAFGQVIAHNLIAGGQTHGQAALHAAEIAFAGPLLGSMARVVGGKLGDRF
DGGRVTLTVLAAMIVAGGFLVAVSTHDDLTRPSGAPVSLYTTAGYIAGFI
ALFICCGVGKGSVFKLIPSVFAQRSRALELGDTERRHWERARSAALIGFA
GSFGALGGVVINLALRQSYASTGSSTPAFGAFMLCYVAAAALTWARYVRP
RRARAAHRAGLAADGRAAGQLVRASDRFSGCSISA
>MAP3712 narU, NarU
MTTTITPVPQPAAAPERHKGRHWIDDWRPEDPEFWETTGKAIARRNLIFS
IFAEHVGFSVWMLWSIVVVHMTAGPHGHPSASGWALTASQALCLVAVPSG
VGAFLRLPYTFAVPVFGGRNWTTISAALLLIPCLLLAWAVSHPGIPFGVL
VAIAATAGFGGGNFASSMANISFFYPEKDKGWALGLNAAGGNIGVAMVQK
VIPPVVIAGGGVALSRAGLFYVPLAVVAAVCAFLFMNNLTEIKADVKPVW
QSLRHADTWIMSLLYIGTFGSFIGYSAAFPTLLKTVFGRGDIALAWAFLG
AAIGSVIRPLGGKLADRVGGARITAASFVMLAVGAAAALWSVKAVNLPAF
FASFMFLFVATGIGNGSTYRMISRIFKIKGELAGGDPDTMVTMRRQAAGA
LGVISAVGAFGGFIVPLAYAWSKSQFGSIEPALRFYVVFFLGLLAVTWYC
YLRKHNAITRVGI
>MAP2924 nicT, NicT
MPALRPARRGTYSRARCPVPDTARAGRTPVTSTEIDRWPARATRFLGALA
PTEWWRLASMLGAILALHLIGWLTLVLLVAPGQYSLGGKAFGVGVGLTAY
TLGLRHAFDADHIAAIDNTTRKLMNDGQRPLAVGFFFSLGHSTVVFALAV
LLACGVRTVVGPVRDDSSALHHYTGLIGTSVSGVFLYAIALLNVVVLVGI
LRVLARVRRGDYDPHTDAAELERQLDNRGLMNRWLGRFTKSITQSWHCYP
VGLLFGLGFDTATEVALLVLAGTSAAAGLPWYAILCLPVLFAAGMCLLDT
IDGSFMNFAYGWAFSNPVRKIYYNIIITALSVAVAWVIGSIELLVLFADE
FGWRGSFWDWLGGLDLNTVGYAVVGMFVLTWAVALLIWRYGRIEERWAGA
DPRAGTGREA
>MAP2208 nirA, NirA_2
MTTARPAKARNEGQWALGNREPLNPNEEMKQAGAPLAVRERIETIYAKNG
FDSIDKSDLRGRFRWWGLYTQREQGYDGSWTGDENIEKLEARYFMMRVRC
DGGAISAAALRTLGQISVDFARDTADITDRENIQYHWIEVENVPEIWRRL
DAVGLRTTEACGDCPRVILGSPLAGESLEEVIDPSWAIAEIARRYIGQPD
FADLPRKYKTAISGLQDVAHEVNDVAFIGVNHPEHGPGLDLWVGGGLSTN
PMLAQRVGAWVPLHEVPEVWAAVTSVFRDYGYRRLRSKARLKFLVKDWGI
EKFREVLETEYLKRPLIDGPAPEPVAHPIDHVGVQRLKNGLNAVGVAPIA
GRVSGTILLAVADLAQQAGCDRIRFTPYQKLVLLDIPDDKLDEVVAGLEA
LGLQSQPSHWRRNLMACSGIEFCKLSFAETRVRAQGLVPELERRLADVNR
QLDVPITINLNGCPNSCARIQVADIGLKGQMVDDGEGGSVEGFQVHLGGS
LGQDSGFGRKLRQHKVTSDELGDYIERVARNFVKYRGEGERFAQWAMRAD
EDDLR
>MAP2035 nirA, NirA_1
MTTARPVKTRNEGQWALGDREPLNDTEKIKLADGPLNVRERIINVYAKQG
FDSIDKSDLRGRFRWMGLYTQREQGYDGSWTGDDNTDKIEAKYFMMRVRS
DGKAMSAHTMRTLGQISTEFARDTADISDRENLQLHWIRIEDVPEIWRRL
ESVGLQTTEACGDCPRGIHGSPLAGDSLDEVLDPSPAIEEIVRRSLNNPE
YANLPRKYKTAVSGLQDVSHETHDVAFVGVEHPEHGPGLDLWVGGGLSTN
PMLAQRLSVWVPLDEVPDVWEAVTQLFRDYGYRRLRAKARLKFLVKDWGI
EKFREILEQEYLNRRLIDGPAPAPVKHTIDHVGVQKIKNGLNAVGVAPIA
GRVSGTTLSAVADLMEQVGSDRARWTPFQKLVILDVPDDKVDELVTGLDA
LGLPSRPSSWRKNTMACTGIEFCKLSFAETRVRTQTLVPELERRLADVDA
QLDAPISVHLNGCPNSCARIQVADIGFKGQWIDNGDGTSVEGFQVHLGGG
LGEQSGFGRKLRQHKVTSEELGDYIDRVTRKYLEGRNDGETFASWALRAD
EEELR
>MAP3703 nirD, NirD
MSGCCSTTVSQAALFRLDDGTVRAVGNVDPFSGAAVLSRGIVGDRNGCPT
VQSPILKQAFSLEDGICLDDPSVSVPVYPVRITADSYVQVGRDYQPRAA
>MAP0869c nramp, Nramp
MFDYPHFLIRLWESRSTLAQRTQGSLKGNWYLLGPAFVAAIAYVDPGNVA
ANVSAGSQYGYLLLWVIVAANVLAGLVQYLSAKLGLVTGRSLPATIGKRM
SRPARLVYWVQAELVAIATDAAEVVGGAIALHILFGLPLLAGGLITGVVA
LLLLGIQDRRGQILFERVITGLLLVIAIGFAASFFVKTAPPEAVLSGLLP
RFRGTESVLLAAAILGATVMPHAVYMHSGLVLDRHGHPDEGPHRRLLLRV
TRWDVVLAMAVAGTVNAAMLLIAATNLQHRDVSASIEGAYAAIHNTLGAT
IAVMFAVGLLASGLASSSVGAYAGAMIMQGLLHRSIPMVVRRLITLCPAL
LILAVGYDTTRALVLSQVVLSFGIPFAVLPLIKLTSDRELMGSDANHRIT
TILGWGVGILISLLNMVLIWLTVTG
>MAP3212 nuoL, NuoL
MTHYTPLLVALPLAGAAILLFGGRRTDRWGHWLGCATAVAAFVVGVGLLD
ELLGRPADQRAIHERVFSWIQVGQLQVDLGLQIDQLSVCFVLLITGVGSL
IHIYSVAYMAEDADRRRFFGYLNLFLASMLLLVIADNYVVLYVGWEGVGL
ASYLLIGFWYHKPTAATAAKKAFVMNRVGDAGLALGMFLMFSTFGTLSYA
GVFAGAPAAGRGALTAMGLLLLLGACAKSAQVPLQAWLGDAMEGPTPVSA
LIHAATMVTAGVYLIVRSNPLYNLSPDAQLAVVIVGAVTLLLGAFIGCAK
DDIKRALAASTMSQIGYMVLAAGLGPTGYAFAIMHLLTHGFFKAGLFLGS
GAVIHAMHEEQDMRRYGGLRAALPVTFVTFGLGYLAIIGVPPFAGFFSKD
AIIEAALAAGGGRGYLLGGAALLGAGVTAFYMTRVMLMTFFGEKRWAPGS
HPHEAPGLMTWPMILLAIGSLFSGGLFAVGGTLQRWLEPVVGRHEEVTHA
VPVWISTALALGVVAIGIAVAYRLYATAPIPRVAPLSVSPLTTAVRNDFY
GDAFNEEVFMRPGAQLTHALVEVDDAGVDGSVNALAALVSATSNRLRGLQ
TGFARNYALSMLTGAVLVIALILAVRLW
>MAP2488 oppB, OppB
MTRFLARRLLNYLVLLALASFLTFCLTSVAFKPLDSLLQRSPRPPQAVID
AKAHSLGLDEPIPIRYAHWASHAVRGDFGKTVTGQPVGASLGRRVGVTLR
LLVIGSLIGTVAGIAAGAWGAIRQYRLSDRVVTMLALLVLSTPTFVIASL
LILAALRVNWALGVQVFDYTGETSPGVTGGAAALVDRLRHLVLPSLTLAL
AAAAGYSRYQRNAMLDVLGQDFIRTARAKGLTRRRALVKHGLRTALIPLA
TLFAYGVAGLVTGAVFVEKIFGWHGMGEWLVQGVATQDTNIVAAITLFSG
AVVLLAGLLSDVFYAALDPRVRVS
>MAP2489 oppC, OppC
MAGFASRRTLVLRRFGRNRLAVASLTLLVLLFVGCYTLPAVLPYSYQDLD
FDALLQPPNARHWLGTNALGQDLLAQILRGMQKSMLIGVCVAVISTGIAA
TVGSIAGYFGGWRDRVLMWLVDLLLVVPSFILIAIVTPRTKNSANILMLV
LLLAGFGWMVSSRMVRGMTMSLREREFIRAARYMGVSSRRIIVGHVVPNV
ASILIIDAALNVASAILAETGLSFLGFGVQPPDVSLGTLIANGTQSATTF
PWVFLFPAGVLVLILVCANLTGDGLRDALDPGSGPARGGRR
>MAP0872 phoS2, PhoS2_2
MKLNRFGAVLSVLSAGALVLSGCGSDNNGAGAGAGGSSSSKVSCGGKKAL
KASGSTAQANAMTRFVNAFEQACPGQTLNYTANGSGAGISEFNGKQTDFG
GSDSPLAPSEYAAAQQRCGSPAWNLPVVFGPIAITYNVAGLNSLNLDGAT
AAKIFNGAITTWNDPGIQALNPGVALPAEPIHVVFRNDESGTTDNFQKYL
DAAADGAWGKGAGKTFKGGVGEGAKGNDGTSAAIKATEGSITYNEWSFAQ
AQKLNMAKIITSAGPDAVAISADSVGKTIAGAKISGQGNDLVLDTLSFYK
PTQAGSYPIVLATYEIVCSKYPDPQVGTAVKAFLQSTVGAGQNGLADNGY
IPIPDAFKSRLSAAINAIT
>MAP0651 phoS2, PhoS2_1
MRLDRQGGALAAAALTGCGSDENHRGTAAPSISGTTGTAGCGGKNKLTAE
GSTAQENAITMFNQVWGQYCPGKGLAYNPTGSGAGREQFIAGHVDFAGAD
SPLVADQIGPAAQRCGGNPAWDLPLVFGPIAITYNLPGNPALVVSSDAVA
KIFTGKITNWNDPILAALNPGVALPDTKITPIYRTDSSGTTDNVQKYLTA
AAPQSWAKGVGTEFQGGVGEGAAKSAGVIQAVRATTGAVGYVEKGFADQA
GMPYAKIATRGGVVPLTNETAGNAVNAAKFLSEGDDLVLDLHAMYASQEP
GVYPLLLVTYEIVCSKGYDPETLAAVKSFLGVAATSGQNGLSTAGYIPLP
DKVRQRLVTAINALQ
>MAP4172c phoS2, PhoS2_3
MRTRSAVIAGVLMATTLVVSACGETPASLPYTAGAKVDCGGKQTLSASGS
TAQANAMTRFIAAYRTACPGQTLDYTANGSGNGIGDFLAGRTDFAGSDTP
LSGDQYAAAKRRCGGADAWNLPVVFGPLAITYNLAAVDSLVLDAPTLAKI
FNGTITRWDDPALALLNASMPAEDIRVVYRSDGSGTTDNFQAYLQSAAGG
VWNKGAGKTFNGGVGTGAVGNTGTAAAVKSTEGAISYNELSFALQQGLFA
AEIKTPASRRSLRPVRIGTDIFGKTIKGARIVGTGNDLVLDLSSFYNPAQ
PDVYPIVLATYEIVCSKYPGFDVAKAVKAFLQAAIGPGQVELARTGYIPL
SADFQAKVSGAVDAISSPQAPNPD
>MAP0654 phoT, PhoT
MAKRLDLKGVNIYYGSFHAVAEVTLSVLPRSVTAFIGPSGCGKTTVLRTL
NRMHEVVPGGRVEGSLLLDDEDIYGPGVDPVGVRRAIGMVFQRPNPFPAM
SIRDNVVAGLKLQGVRNRKVLDETAEYSLRGANLWDEVKDRLDRPGGGLS
GGQQQRLCIARAIAVQPDVLLMDEPCSALDPISTMAIEELISELKQDYTI
VIVTHNMQQAARVSDYTAFFNLEAVGKPGRLIEVDDTEKIFSNPSQKATE
DYISGRFG
>MAP0132c phoY2, PhoY2_1
MRSGFHRRLCLLNARLAEMCAMAADAIAQATHALLDADLLTAEGVITRQH
SIAALGLQAEETAFALLALQAPVATDLRAVVSALRIAADAQRMVELAVHV
AEIARRRHPDSAVPAEVRPIIAAMGEAAEALAAGAREVLLSQDPRRAAQI
RRDDDTMDELHRRLLSVLMDPAWTPGVAAAVDATLLGRFYERFADHAVEI
ARRVIFQATGR
>MAP0655c phoY2, PhoY2_2
MRTAYHEQLSELSERLGEMCGLAGVAMERATQALLQADLVLAEQVISDHE
AIAAMSARAEETAFVLLALQAPVAGDLRAIVSAIQMVADIDRMGALALHV
AKIARRRHPQHALPEEVNGYFAEMGSIAVELGNSAQEVVLSRDPEKAARI
REEDDAMDDLHRHLFSVLMDREWKHGVAAAVDVTLLGRFYERFADHAVEV
ARRVIFQATGRLPEEETKPASQ
>MAP4041c pitA, PitA
MPLPPESGAVNIQLFLLIIVVITALAFDFTNGFHDTGNAMATSIASGALK
PKTAVALSAVLNLVGAFMSTAVAATIAKGLIDSNIVTLELVFAGLVGGVV
WNLLTWLLGIPSSSSHALIGGIVGATIAAVGAHGVIWKGVISKAIIPAIV
SAILAILVGAVATWLVYRITRGVPKKRTEAGFRRGQIGSASLVSLAHGTN
DAQKTMGIIFLALISYGSVSKTAAMPPLWVIVSCALAMATGTYLGGWRII
RTLGKGLVEIQSPQGMAAESSSAAVILLSAHFGYALSTTQVCTGSVLGSG
LGKPGGEVRWGVAGRMGVAWLVTLPLAGLVGAVTYWIVHLIGGYPGAIIG
FALLVAVSATIYLRSRKVKVDHHNVNAEWKGDLTTGLEGADDHSPPPDAG
PPFGGPGDRYQSDDPTLKASAS
>MAP3022 ppk, Ppk
MMRHDRNVTEIDAETRPDENLWHSGDSAVGAPPAATPAAMTDLPEDRYLN
RELSWLDFNARVLALADDNSLPLLERAKFLAIFASNLDEFYMVRVAGLKR
RDEMGLSVRSADGLTPRKQLALIGEHTQRIATRHARVFLDSVRPALAEEG
IHIVTWADLDQAERDELSTYFTEQVFPVLTPLAVDPAHPFPFVSGLSLNL
AVMVRQTEDGGQHFARVKVPNNVDRFVELAAPRAGAEGENRGVVRFLPME
ELIAAFLPLLFPGMEIVEHHAFRITRNADMEVEEDRDEDLLQALERELAR
RRFGPPVRLEIADDMTEGMLELLLRELDVHPGDVIEVPGLLDLSSLWQIY
DLDRPALKDPAFVPDTHPAFADRESPKSIFATLREGDVLVHHPYDSFSTS
VQRFIQQAAADPNVLAIKQTLYRTSGDSPIVRALIEAAEAGKQAVALVEI
KARFDEQANIRWARALEQAGVHVVYGLVGLKTHCKTCLVVRREGSAIRRY
CHIGTGNYNSKTARLYEDVGLLTAAPDIGADLTDLFNSLTGYSRKVSYRN
LLVAPHGIRTGIIERVEREIAAHRERGQGRIRLKMNALVDEQVINSLYRA
SQAGVRVEVVVRGICALRPGVQGYSENIFVRSILGRFLEHSRIIHFRNIN
EFWIGSADMMHRNLDRRVEVLAQVKDPKLTAQLDELFESALDPSTRCWEL
GPDGQWTPSPQEGHTVRDHQVSLMERHRSP
>MAP0874 pstA1, PstA1_2
MTRAVDALDRPVKTEVFRPLSVRRRITNNAATIFFLGSFVVALVPLIWVL
SVVLERGWYAVTRSGWWTHSLHGVLPEQFAGGVYHALYGTVVQAGVAAAM
AVPLGLMTAVFLVEYGSGRLVRLTTFMVDVLAGVPSIVAALFIFSLWIAT
LGFQQSSFAVSLALVLLMLPVVVRSAEEMLRLVPDDLREASYALGVPQWK
TIVHIVFPIAMPGIVSGILLSIARVIGETAPVLVLVGYSRSINLDIFHGN
MASLPLLIYTELTNPEHAGFLRIWGAALTLIIIVAVINVIAAATRFLAGR
RR
>MAP0653 pstA1, PstA1_1
MTSMLDRPLKSRTFSPLSRRRRAANSVATVLVSLSLLVAVTPLVMVLCSV
VVKGFRAITSTVWWSHSQAGMTAFVTGGGAYHALVGTVLQGLVCAAISIP
IGLMVAIYLVEYGGGTPLGRLASFMVDILSGVPSIVAALFVYALCVATLG
LPRSEFAVSLALVLLMLPVIVRATEEMLRIVPVDLREASYALGITKWKTI
ARVVLPTGLSGIVTGILLAMARVMGETAPLLILVGYAQSMNFDIFSGFMG
TLPGMMYNQASAGAGINPIPTDRLWGAALTLILVIATINVVARVITKFLG
ARKS
>MAP0574 pstB, PstB
MFEFVDVVVERRGLRALDGLTAAIPGRGVTAVFGPSGSGKSTLLRLCNRL
ELPTSGRVSFYGSDIAGLDPLWLRRRVGMCFQRPTPFPGTVADNLRVADP
DADEARMRETLDRVALTGAWLDRDVLALSGGEAQRVCLARTLMARPRVLL
LDEPTSAVDAEAAEVIERAVRELAADGTPALWVTHDAAQVTRAADRVLRL
ERGRSLGLSQVGGGPDDGAVPR
>MAP0873 pstC2, PstC2_2
MTRGTATAQLPAPTTLNARVPRRGDRLFKAIAAAAGFTIVVAIALIAVFL
LLRAVPSLRVNHANFFTSAQSSTSDPHRLAFGIRDLLMVTVLSSLSALVL
AVPIAVGIAVFLTQYAPRRLARPFGAIVDLLAAVPSIIFGLWGIFVLAPQ
LEPVAAFLNRHLGWFFLFKTGNVSLAGGGTIFTAAVVLSVMILPIVTSVS
REVFRQTPHIQMEAAQALGATKWEVVRMTVLPFGRSGVIAASMLGLGRAL
GETVAVLIILRSAARPGHWSLFDGGYTFASKIASAAAEFSSPLPTGAYIS
AGFALFVLTFIVNALARAIAGGKVNG
>MAP0652 pstC2, PstC2_1
MVAAPFPEPSATPISPWGQGRPHAGDRIFRRLAQASGVLIVLVIAAIAVF
LLDRAVPALQRNRENFFGYGGNWVTTDTSAMHFGIASLLPVTVFVSLFAL
ILAMPVALGVAIFITHYAPRRAATPLAYAVDLLAAVPSIIYGAWGLYVLA
PQLRPVATWLNHSMGWCFLFADGNTSAAGGGTIFTGGIVLAVMILPIITA
VTREVFIQTPHDQIEAALALGATRWEVVRTVTLPFGRSGYISGGMLGLGR
ALGETVALLIILRGTQSAFGWSLFDGGSTFATKIAGAAAELDDRFKAGAY
IAAGLTLFVLTFVVDALARGAVAGVGRRAGP
>MAP3388c pstS, PstS
MVAPAVGGGRSCVSFARSGAVLSLLAAAALTLTGCGGDSKSSSGSGAHVD
CGGKKVLKDSGSTAQQNAIEQFVYAYVRACPGYTLDYNANGSGAGVTQFL
NNQTDLAGSDIPLDRTTGQTDRAAARCNSPAWDLPTVFGPIAVTYHLTGV
SGLKLDGPTVAKIFNGAITKWDDPALKAVNPGLNLPSTPIAVIFRSDKSG
TTANFQKYLDGASNGAWGKGTSEMFTGGVGQGASGNNGTSALLQNTEGSI
TYNEWSFAVGKQLSMASIITSAGPDAVPITKESVEKTIAGATFQGQGNDL
VLDTSSFYRPTQQGAYPIVLATYEIVCSKYPDAATGSAVKAFMQAAIGPG
QDGLDQYGSIPLPGSFTAKLSSAVNAIS
>MAP0187c sodA, SodA
MAEYTLPDLDWDYAALEPHISGQINEIHHTKHHATYVKGVNDALAKLEEA
RANEDHAAIFLNEKNLAFHLGGHVNHSIWWKNLSPDGGDKPTGELAAAID
DAFGSFDKFRAQFSAAANGLQGSGWAVLGYDTVGSRLLTFQLYDQQANVP
LGIIPLLQVDMWEHAFYLQYKNVKADYVKAFWNVVNWADVQKRYAAATSK
AQGLIFG
>MAP3921 sodC, SodC
MPKLLPPVVLAGCVVALGACSSPQHASSLPGTTPAVWTGSPSPSGAGAAE
AAPAAAPSITTHLKAPDGTQVATAKFEFSNGYATVTIETTANGVLTPGFH
GVHIHKVGKCEPSSVAPTGGAPGDFLSAGGHFQAPGHTGEPASGDLTSLQ
VRKDGSGTLVTTTDAFTMEDLLGGRKTAIIIHAGADNFANIPAERYNQTN
GTPGPDEMTMSTGDAGKRVACGVIGAG
>MAP3402 sseA, SseA
MPLPPDPHPSLQEYAHPERLVTADWLSANLGAPGLVIVESDEDVLLYDVG
HIPGAVKIDWHTDLNDPRVRDYIDGARFAELMDRKGISRDDTVVIYGDKS
NWWAAYALWVFTLFGHPDVRLLNGGRDLWLAERRETTLDVPTKTSTGYPV
VTRNDAPIRAFKDDVLAILGSQPLIDVRSPDEYTGKRTHMPEYPEEGVLR
GGHIPTARSIPWAKAVDESGRFRSRAELEELYGFLRPDDKTVVYCRIGER
SSHTWFVLTHLLGKPGVRNYDGSWTEWGNTVRTPIVAGEEPGQAPAGV
>MAP2046 sseB, SseB
MGARDQVLITATELADVIEAGDPVSILDVRWRLDEPDGRAAYLQGHLPDA
VYVSLEDELSDHTVSGRGRHPLPSGPSLQAAARRWGIRQDTPVVVYDDWN
RAGSARAWWLLRAAGLDNVRILDGGLAAWRATGGRLVSGPVEPVPGNVTV
PHGDLHSGNRPTVTTEQVAAGAATLIDARAPERYRGEMEPLDPVAGHIPG
AENLPSGEVLAADGTFLGDDALARVFAEHRIERHGAVAAYCGSGVTATVT
IAALAAVGRTAALYPGSWSEWCADPARPVERGGA
>MAP2213c subI, SubI
MDIRTAARWRPVLALVLTAGVVAGCHGGASDAVGGTGPADARTSITLVAY
SVPEPGWSKIIPAFNASDEGKGIQVVTSYGASGDQSRGVVDGKPADVVNF
SVEPDIARLVKAGKVAKDWNTDATKGIPFGSVVTLVVRKGNPKHIKDWDD
LLRPGVEVITPSPLSSGSAKWNLLAPYAVKSEGGAHGDAGVDFIRKLVTE
HVKLRPGSGREATDVFVQGSGDVLISYENEAIATERAGKPVEHLNLAQTF
KIDNPVAVVNTSPHLQAAVAFKNFQYTAAAQKVWAQAGFRPVDPAVAADF
RDQYPVPAKLWTIADLGGWSAADPQLFDKNTGSITKIYTQATG
>MAP3451 sugE, SugE
MPSATDTGRGAPYPASDKTAPWRCAMAWLILIASGVLEAVWATALSRSEG
FSRLGPSLVFFVALAFSMTGLAVAMRSLPVGTSYAVWVGVGAALTVTYAA
LVGDEPASPVKLVLIAGIVACVAGLKLLG
>MAP3449 sugI, SugI
MARGSRRGLLVGLTAASVGVIYGYDLSIIAGAQLFVTEDFGLSTRQQELL
TTMAVIGQIGGALFAGVLANAIGRQRSVLLILSGYAVFALLAAFSVGLPM
LLTARLLLGLTIGVTVVVVPVYVAESAPTAVRGALLTAYQLAIVSGLIVG
YLSGYLLADTHSWRWMLGLACVPAVLLLPLVFRMPDTARWYLLKGRVDDA
RRALLRVEPVARVDDELAEIDRAVSEEAASLPAMLAEMVRSPYRRATVFV
VVLGFLIQITGINAIIYYSPRIFEAMGFTGNFALLALPALVQVAGLVAVG
TALLLVDRVGRRPILLCGTAMMIVADVVLVAVFGRGPGGVIAGFAGVLLF
IFGYTMGFGSLGWVYASESFPSRLRSIGSSTMLTSNLVANAIVAAVFLTL
LHSLGGAGTFAVFAVLAVVAFAFVHRYAPETKGRQLEDIRHFWENGGRWD
>MAP2808 trkA, TrkA
MRVVVMGCGRVGSSVADGLSRIGHDVAVIDRDSTAFNRLSPEYAGERVLG
QGFDRDVLLRAGIEEADAFAAVSSGDNSNIISARLARETFGVKRVVARIY
DAKRAEVYERLGIPTIATVPWTTDRLLNALLRETQTAKWRDPTGTVAVSE
VVLHEDWIGHRVTDLEQATGARVAFLIRFGSGVLPEPKSVIQAGDQVYVA
AISGRAAEAAAIAALPPSEDL
>MAP2809 trkB, TrkB
MKVAVAGAGAVGRSVTRELLANGHDVTLIERNPDHVDVDAIPAAHWRLGD
ACELSLLESVQLQEFDVVVAATGDDKANVVLSLLAKTEFAVPRVVARVND
PRNEWLFTDAWGVDVAVSTPRMLASLIEEAVAVGDLVRLMEFRKGQANLV
EITLPDDTPWGGKPVRRLQLPRDAALVTILRGPRVIVPEEDEPLEGGDEL
LFVAVAEAEEELQKLLLG
>MAP2960c viuB, ViuB
MDVAGLPQPLTLDSFAELPAEKKPSVRTLTVRHVDAASRQIALDVVVHGE
HGIAGQWAATAQPGQPIYLMGPGGAYTPDPAADWHLLAGDESALPAIAAA
LEALPPSAVGKAFIEVAGHEDEIPLTAPDGVEVHWVYRGGRADLVPEDRA
GDHAPLIEAVTSAPWLPGQVHVFIHGEAQAVMHNLRPYVRKERGVDAKWA
ASISGYWRRGRTEETFRQWKKELAQAESAQA
>MAP2043 yjcE, YjcE
MFGLVLIVALVSTVIVGTVIGRRYRVGPPVLLIVLGVLLGLVPQFGHVRI
DGEIVLLLFLPAILYWEGLNISFREIRANARIIVFLSVALVIATAVAVSW
TARALGMDPHAAGVLGAVLSPTDAAAVAGLAKKLPRRSLTVLKAESLIND
GTALVLFAVSVHVAIGAPAISPPEVTLRFIGSYLGGIAAGLLVGGAVTLV
RKRIDAPQEEGALSLVTPFAAFLLAQSVECSGVVAVVVSALVLAYSGPVV
IRARSRLQSYAFWDIATFLLNGSLWVFVGVQIPGALRGIAGVDGGVRHAL
FVALVITGVVIVSRIFWGEFTTMLIRLIDRRAVQRERRVGWRQRFVTAWA
GFRGAVSLAAAVAVPMTTLSGAPFPDHSLLIFIVTVVILVTVLVQGSTLP
AVVRWARLPADVAHAEELQLARTRAARAALAALPAVADEVGVSDELRRRL
HKEYEEKAALVLATENGSPDNRILKTREKVRQVRLGVLEHKRREVTALRN
QNRIDDTVLRELQNEMDLEEVQLLAAAADEDDGDTE