TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Gene type: CDS
Genomic element: chromosome

Number of genes found: 245

Free access
Sort by:

 



# Mycobacterium avium subsp. paratuberculosis str. k10, k10

>MAP1536 hypothetical protein
MADVDRALGGYDPNAGHSAHLAARPQRIPVPSLLRALLSEHLDPGYAAAA
AKRGAVDETRNRRPRVSGWLWQALAALLIATVFAAAVAQARSVAPGVRSA
QQLLLGNVRATEGSAAKLAQRRNELSAKVDDVQRHALADDAEGQRLLKRL
DALGLAAASTAVIGPGLKVTVTDPGAGPNLSDVSKQRVSGSRQIILDRDL
QLVVNSLWAGGAEAVSVGGVRIGPTVTIRQAGGAILVDNNPTSSPYTILA
IGPPHALRDAFDTSPGMQRLRLLQISYGVVVTVDVADGLTLPAGSVRDVK
FAKQIGPQ
>MAP3021 hypothetical protein
MSGKRADGGHDGAGDVALIIAVKRLAAAKTRLAPVFSARTRESVVLAMLT
DTLTAATRVPSLGSITVITPDEAAAAAAAGLGADVLADPTPEGHPDPLNN
AIATAERAVSGSFTNIVALQGDLPALQSQELAEAVAAARAHRRSFVADRL
ATGTAALFAFGTRLDPRFGSDSSARHRSSGAIELTGAWPGLRCDVDTPTD
LAAARRLGVGAATARAIAAH
>MAP0728 hypothetical protein
MADGRKGYAILTEAIKDPEGMKAYAKAAGSAMSGATVLAVDTAPTVIEGT
WHGDQTVVLEFESVDAARAWYESEGYQKAAKLRQAAADCNAVILAGF
>MAP0195c hypothetical protein
MPVRMDPQRFDELVSDALDLIPPELAAAMDNVVVLVDDRHPEEPDLLGLY
EGVALTERDSDYSGALPDAITIYRAALLDVCESEQQVIEEVAVTVIHEIA
HHFGIDDERLHQLGWA
>MAP0660 hypothetical protein
MTQDTSASRPLTSDVTGSDAAGLTEQSISARPADAGAAVAGGCPVSPLGY
EAPPAPLGPDSLTWKYFGDWRGMLQGPWAGSMQNMHPQLGAAVLDHSTFF
RERWPRLLRSLYPIGGVVFDGDRAPITGAEVRDYHTDIKGVDEQGRRYHA
LNPDVFYWAHSTFFVGTIHVAERFCGGITEEQKRQLFDEHVQWYRMYGMS
MRPVPKSWEEFQEYWDHMCRNVLENNEAARAVLDLTELPKPPFAQWIPNR
LWAAQRKLLAPFFVWVTVGLYDPPVRELMGYRWSARDEWLHRRFGDLVRI
VFALVPSRFRKHPRARAGLDRASGRIPLDAPLVQTPARNLPPEDERGNPK
HYCPMVS
>MAP1624 hypothetical protein
MSVLVAFSVTPLGVGEGVGEIVAEAVRVVRDSGLPNKTDSMFTVIEGETW
EEVMAVVQRAVEAVAARAPRVSTVIKADWRAGVSDAMTHKVASVERYLSD
G
>MAP2119 hypothetical protein
MGGAAGVGFGGLVGHRGGGRDRRAGGGRRAPEGSVVTGVSTVRVAPDETD
ETADATFTSTRPLDYLPGILLLIGVGVLGKYAQIWWNALAKHEHWTVPDI
EYVLWAIVIGLVITNTVGLHPIFRPGVLTYEFWLKAGIVALGSRFVLGDI
AKLGGISLVQILVDMTIAGTIIIAVARWFGLSGKLGSLLAIGTSICGVSA
IVAAKGAIRARNSDVSYAIAAILALGAVSLFVLPPLGHAIGLTDHEFGLW
AGLSVDNTAETTATGYLYSEHAGKIAVLVKSTRNALIGFVVLGFALFWAG
RGQADEIAPGVRAKAAFIWAKFPKFVLGFLVVSAIATAGWLTKGQTANLA
NVSKWAFLLTFAGVGLNTDIRQIARTGWRPLVVAVIGLTVVATVSLGIVL
LTSRVFGWGVTT
>MAP3065 hypothetical protein
MARQAGRVRAALAAVVADTAVTEAASLKDGRATLALPGGLRWVAVHQPDG
YHPPRRFVDSIGGDGLAALPARIAVRWRHIHEFEDVGGDRTRVIDRVETP
VPASLLRPMFDYRHRQLAGDLAAHRLAAEHGLRPLTVAITGSSGLVGSAL
TAFLRTGGHRVIRLVRRAARGGDERRWDPEDPDPGLCDGVDAVVHLAGAS
IAGRFTDRHREAVRDSRIGPTRRLAELLGRGRPRPAALISASAIGYYGYD
RGDETLTEDSDRGDGFLADVVADWERAITLALDAGVRVVQVRTGIVQSPR
GGTLKLMRPLFSAGLGGRLGDGRQWLSWIGIDDLIDVYHRALWDSRLNGP
VNAVAPQPVRNSEYTTALAAVLHRPAVLPVPSLGPRLLLGGQGARELAGA
SQRVAPAKLAAAGHRFRHPDIERALRHLLGRGG
>MAP2037 hypothetical protein
MSTLVLTAHGSRDPRSAANAEAVADRLRRMRPGLDVRLAFLELNAPNFVD
VLAGLPDSRRAVVAPLLLASAYHARLDIPEQIARAGARGIRQADVLGEDD
RLVAVLRGRLAEIGVSPLDDDLGVMVVAIGSSNIAANARTAKVASRLAAG
TRWACATTASRPGPRRRWPAPPTSCAVAARAGW
>MAP2448 hypothetical protein
MVMAVHLTRIYTRTGDDGTTGLSDFSRVSKNDPRLVAYADCDEANAAIGV
AVAVGRPGPELAGVLRQIQNDLFDAGADLSTPVVEDPEYPPLRVTQPYID
RLEKWCDTYNESLPKLNSFVLPGGSPLSALLHVARTVVRRAERSAWAAVD
AAPEGVSALPAKYLNRLSDLLFILSRVANPDGDVLWKPGGQQGGEPAPG
>MAP3293 hypothetical protein
MSTVEVMADLPFGFSSGEDPDKPGKKDPDSGSNPSDPFAAFGISGEFGMG
DLGQIFTQLGQMFSSAGSASAGGSDSGPVNYELARRVASNSIGFVAPIPA
TTNSAIADAVHLAETWLDGATALPAGTAKAVGWTPADWVDNTLETWKRLC
DPMAQQISTVWAASLPEEAKSMAGPLLQMMSQMGGMAFGSQLGQALGRLS
REVLTSTDIGLPLGPKGIAAILPDAVESFASGLERPRSEILTFLAAREAA
HHRLFSHVPWLASQLLGAVEAYAMGMQIDMSGIEELARDFNPASLSDPAA
IENLLGQGVFEPKATPAQTQALERLETLLALIEGWVQVVVAAALGDRIPG
AAALAETLRRRRASGGPAEQTFATLVGLELRPRKMREAAALWERLTEAAG
VDARDGIWQHPDLLPDADDLDDPAAFIDRVIGGDTSGIDEAIARLEQEGP
DSPGSGGDT
>MAP3616c hypothetical protein
MARSHPQHAAPNPNRNIKAVRTVRFWAAPLVITLALMSALCALYLGGILN
PTTNLRHFPIAVVNEDAGPGGAQIVDRLATGLDRNKFDIRVLSRDEAKHQ
LDTGRVYGSLLIPPSFSSKLRDFATSAVSPARPDKPSITVSTNPRAGTLG
ASIAGQTLNSAMGTANGIAAQRVMAEVTAQTGGAPLPGATQAGLSSPIEI
QSVVYNPLPNGTGNGLSAFYYALLLLLAGFTGSIVVSTLVDALLGYVPAE
FGPVYRFAEQVRISRFQTLLLKWAMMLLLGLLTSAVYLAIADGLGMPIDL
SWELWAYGVFAIAAVGITSSSLLSVLGTAGMLVSMLVFVIFGLPSAGATV
PLEATPPLFRWLAEFEPMHQVFLGTRSLLYFGGRGDAGLSQALTMTSAGL
VIGLLLGGIVTHVYDRKGFHRIPGAVEFAIAQDHQAQHQARRGKSTQQPD
SPAEPETPTEPESPDEPESPSAQT
>MAP1411 hypothetical protein
MNATANGEAQPQTGFQVRLTNFEGPFDLLLQLIFAHRLDVTEVALHQVTD
DFIAYTREIGPKLELEETTAFLVVAATLLDLKAARLLPAGQVDDDEDLAL
LEVRDLLFARLLQYRAFKHVAEMFAELEASALRSYPRAVSLEDRFTQLLP
EVMLGVDAERFAQIAAVAFSPRPVPTVSVGHLHEVKVSVPEQARKLLAIL
EARGSGQWATFSELVADCEGSMEVVGRFLALLELYRSRAVAFEQSEPLGV
LQISWTGERPVGETLVEVRDEL
>MAP4090c hypothetical protein
MDDQEAPDAPASRRAHILRLAVFAGFLAVVFYLVAVARVIDVGAIRAVVS
ATGPAAPLTYVVASALAGALFVPGSILAAGSGLLFGPLLGVFVTLGATVG
TATTASFVGRRAGRDSARALLGPARADRVDALIGRGGLWAVVGQRFVPGI
SDALASYAFGAFGVPLWQMAVGAFIGSAPRAFAYTALGASIGNRSSLLAY
AAVAVWCVSAIVGAFAAHRGYRHWRGRPKDEAR
>MAP1566 hypothetical protein
MQTTFDPELVVPDEARVTEFTGDNSLSRKDLSQHPIPPGSLTWKYWGRLD
VIFFGSGVVGTIAGAWPQMAKATSSSVLFTGDSSFGARSKIYKVRRQRSR
EYIYGTVYDAPEDAKKYGLKTRNMHKSIKGTLQDGTFHALNADTFYFGHV
TFFYHLLLKVVEQLYFDGAMPRAMKEQIFEESKEWYSMWGVDDSPQPATY
DDFERYLDNIERNHLVNSQVTQVMLEQFMERRVPPRWWPPVMKKFVWPWV
AGRRQVVVNSFPPHVQELFNLEWTPEDEEIARRFMRMYRRLYAILERVVP
LKFLYLPIAVEGFKREGVDPRKITLESAQQALRENRARRAARENASADET
NGVLASG
>MAP0870c hypothetical protein
MPIPVVQLGTGNVGIHALRALITDPQFELTGVWVSSDAKAGKDAAELAGL
AQPTGVLASTDLDAVLATGPRCAVYNALADNRLPEALDDYRRVLAAGINI
VGSGPVFLQYPWQVIPDELIKPLEEAAQQGNSSLYVNGIDPGFANDLLPL
ALAGTCQSIQQIRCMEIVDYATYDSAAVMFDVMGFGKPLDDTPMLLQPGV
LSPAWGSVVRQLAAGLGVSLDEVTQQHVRVPAPEDFDIASGHIAKGTAAA
LRFEVFGMVDGRPVVVLEHVTRLREDLCPDWPQPAQHGGSYRIEVTGEPS
YAMDICLSSRKGDHNHAGLVATAMRVVNAIPAVVAAPPGIVTTLDLPLIT
GRGLYRPE
>MAP0101 hypothetical protein
MVWQRWPTTGLAPPATSAAEYDTLISNLIATGVITDAGMSYFDVRPALRT
PTLELRVCDSCPRADTIVLITALFRALVEREIQGLRTGVPAAIVVPPLGR
AALWRAARSGLEGDLVDLIHPASRPAGDVVTDLVQMLRPQLEASGDWQAV
EGLARKALTQGSSAARQRRAMRTRNDLFDVVDHLIAETAAVAPGAHGTLA
TRRNGSDGG
>MAP3157 hypothetical protein
MFLPHQIIGQLINKQLDDNMRRYFFRGMEFAAPVGDPGWFGPDSAVWRVH
SHLPALIFGLQCAAFMETLDPSIYWMGMHHSRLIKRDSNGNPVSHVPVID
PEGAATRLGHSVAFFIGTAYGSPETAERLAKSVRAMHHTIKGTRPDGARY
DADDPEWLRWNYATVVWGIATAHELYHPMPLRGKALDRYYGEFVRVGHAL
GGTDLPATKAETLECLESYLPKLAVTHGKAMGTGPNVAMPQAAVDWAIRD
TMPKWAKQMLQHRDCNIIERTARRSAVWAIINGIHAASGPAPEFRQAQAR
VRGGTTVPHTVPSYVLGTDQVRSRAEVERSFQSV
>MAP3679 hypothetical protein
MPDLARRLRRTLWPIAQTTVAAGLAWYLAHDVLDHREPFFAPIAAAVCLW
MTNRVRAELAVEMIIGVALGIGAGTVVLAVFGAGPIGMGAAVMLSLTLAV
LIARSFSTQRSMFVNQSLISAILIMAFPHTGLGVERLYDALIGGGLAVVF
SILLFPKNPLAVLHDASGELLTALHDILAQICCRSADSASPDQGWALSAA
DRLQRCSASLIEARGTAGQLARVCPRRWPLRGAVGAADRQVVHVALFGSS
VLQLTRTLMSVRIRGGPHAEAFRAAVDDLRDADSALAAGHPATAAAHAES
ARRHSAALPAGSPASVAVVIDACIDELQQAIGPRRS
>MAP0455 hypothetical protein
MVQFDGLRPARLKVGIISAGRVGTALGVALERADHVVVACSAISHASRQR
AQRRLPDTPVATPPEVAAGAELLLLAVPDSELAGLVSGLAATEAVRAGTI
VAHTSGANGVGILAPLTRQGCIPLAIHPAMTFTGSDEDISRLPDTCFGIT
AADEVGYAIGQSLVLEMGGEPFSVAEDARVLYHAALAHASNHLVTVLSDA
LEALRAALRGSELLGQQTVDEQPGGIAERIVGPLARAALENTLQRGQAAL
TGPVARGDAAAVAGHLAALHRVAPELAQAYRVNALRTAQRAHAPQEVVEV
LAP
>MAP3958c hypothetical protein
MTAPRIPAGRFRQLGPINWVIAKLGARTVGAPEMHLFTTLGQRRLLFWTW
LAYGGRLLRGKLPTADTELVILRVAHLRGCEYELQHHRRMARTAGLDPDL
QAAIFAWPQRLEPVQARLTVRQQALLAATDEFVNDRTVSEATWRQLAEHL
DRRQLIEFCLLASQYDGLAATMSALAIPLDHPEGS
>MAP1076 hypothetical protein
MHTDVLDVDTSRRRIVDLTEAVRGFCWSRGDGLCNVFVPHATAGVAVIEM
GAGSDDDLVDTLERLLPRDDRYRHAHGSPGHGADHVLPALVAPSVTVPVS
AGEPMLGTWQSIVLVDLNRDNPQRSVRLSFLEG
>MAP1029 hypothetical protein
MSELRLMAVHAHPDDESSKGAATLARYADEGHRVLVVTLTGGERGDILNP
AMDLPEVHGHIAEIRRDEMAKAAEILGVEHTWLGFVDSGLPKGDPPPPLP
EGCFALVPLEEAIEALVRVVREFRPHVMTTYDETGGYPHPDHIRCHQVSV
GAYEAAGDYRRFPDAGEPWTVSKLYYNHGFLRARMQLLHDEAVKHGHEPP
FKKWLEHWDPAHDPFESRVTTRVECSAYFSQRDDALRAHTTQIDPDHDFF
AAPIAWQQRLWPTEEFELARSRVPVRLPEDDLCARAPGWPRRFTPR
>MAP2866c hypothetical protein
MSPGGGIVGAVSTRRLSVAQARRIAVAAQGFTEPRPAGAVTRAHLNRLIS
KIQVLQLDSVSVTVRAHYAPVFSRLGPYDREVLDRAAWGPRSSRLLVEYW
AHEAALMAVEDWPLLRWRMRQYRHGRWGTHIVQANPRLADAVVAAVAELG
PSTAGQIEAHLAAEPRRRKGAWWNRSDTKWVAEALFAAGVLTTATRVGFA
RHYDLVERVLPADVLARRVDDDEAIRELTLRAAGALGVGTEADIRDYFRL
SAAQVKPAIAELVAAGDLERVEVAGWPAPAYLRAGRAVPRTDRGTALLCP
FDPLIFFRPRVERLFEFHYRIEIYTPAAKRRYGYYVWPLLMDGRLAARVD
LKADRAEGTLRVLGAFAEPQAPRPRVAAVLAGELWSMASWLGLGGLSVAD
RGDLALALRAVA
>MAP1841 hypothetical protein
MWVGWLEFDVLLGDVRSLKQKRSVIRPVVAELQRKFSVSAAETGSHDLYR
RAGIGVATVSGDRGHAVDVLDAAERLVAAHPEFELLSVRRSLVRSDDLG
>MAP3557 hypothetical protein
MAHPEIKEPSAGHPITIEPTRGRVQVRVNGELIADSSAALELREATLPAV
QYIPFTDVAQDRLTRTDTSTYCPFKGEASYYSVTTSAGDTVDDVIWTYEQ
PYPAVAAIAGHAAFYPDKAEISISTD
>MAP1776c hypothetical protein
MERLDVVVGPNGAGKSTFIALTLAPLLPGSVVVNADEIARQRWPQDPASH
AYDAAKVAADTRAKLIDLGRSFIAETVFSHPSKLDLLHAARRAGFTVVLH
VLLIPEDLAVERVRHRVRAGGHDVPETKIRERHRRLWTPVAEAMTLADLA
TGYDNSRLRGPRVVARLSGGLTVGAVDWPAWAPETLRSRWPV
>MAP1230 hypothetical protein
MRAARTSQPADGPDLPRHPTRLPGDVRPVALFVLGMARSGTSALTRVLSL
CGSTLPAGMCGADGNNPRGYWEPRAAIMLNEAILRRHDSNWYDPTLRLQQ
EAAFGAAERAACIAEITAYLSTLPAAPLVVVKEPRITALPGLWFEAARRV
GFDVAAVIAVRHPQEVIASAAKYVSTSPELSSALWLKYNLLAERHTRGVQ
RVFVDYANLLDDWHREMKRIAGALEIELDTAEEGAIEEFLTADLRRQRHC
GPVTDLFGADWMSAVYAALRGAAHDDPLDTATLDRVFESYRASERDFRTA
FTDFQARTNSVVRRVFRPSMTMCSIRHGLCTPRIVSPIQMQSLPWWRRSR
GMDQYVICGVTGRTTLSDSTANHFSLGRTAICRLRCGNFCSARRLRIRRH
SSARRW
>MAP1628 hypothetical protein
MGPFVLRAALTGFALWIVTLFVHGLTFVGGDTKLQRVGIIFVVAVIFGLV
NAFIKPIVQILSIPLYILTLGLIHIVINALMLWITAWITEHTTHWGLQID
HFWWTAIWAAIVLSIVSWLLSLVIRRTAG
>MAP1891c hypothetical protein
MSTLHKVKAYFGMAPMDDYEDEYYDDRAPSRGFPRPRFDDGYGRYDGDDY
DDPRREPADCPPPAGYRGGYAEESRYGAVHPREFERPEMGRPRFGSWLRN
STRGALAMDPRRMAMMFEEGHPLSKITTLRPKDYSEARTIGERFRDGTPV
IMDLVSMDNADAKRLVDFAAGLAFALRGSFDKVATKVFLLSPADVDVSPE
ERRRIAETGFYAYQ
>MAP0014 hypothetical protein
MVQTRQSPWRFGVPLVCLLAGLLLAATHGVSGGAEIRRSDAPRLVDLVRE
TQASVNRLSAQREQLAAKIDAAHGRSSDAALAAMLRRSAQLAGEAGMSPV
HGPGLVVTLQDAQRDANGRFPRDASPDDLVVHQQDIQAVLNALWSAGAEA
IQMQDQRIIATSVPRCVGNTLLLNGRTYSPPYTITAIGNAAAMQAALAAA
PLVTLYKQYAVRFGLGYQEEVRSDVQVVGHFEPDRLHFAQPNGPIGY
>MAP1030 hypothetical protein
MSGHSKWATTKHKKAVIDARRGKMFARLIKNIEVAARVGGGDPAGNPTLY
DAIQKAKKSSVPNENIERARKRGAGEEAGGADWQTITYEGYAPNGVAVLI
ECLTDNRNRAASEVRVAMTRNGGTMADPGSVSYLFSRKSVVTCEKNGLTE
DDILAAVLDAGAEEVEDLGDSFEIICEPTDLVAVRTALQDAGIDYDSAEA
GFQPSVTVPLNADGAQKVMRLVDALEDSDDVQDVWTNADIPDEILAQIEE
>MAP1612c hypothetical protein
MASDNASTIGPADGELLLHTGVTGRAARMGHRLTIAMTRWRAGVSWAGSR
PVRAELAVEVDSFEVLRGEGGVKGLSTAEKALVRSNALKSLNASRFPEIR
YTSDVIEHTGDGYRLTGTLQIRGAARNHVIDLGAEDLGEAWRLSVETTVR
QSDYGIKPFSLLMGSVQVADDVSLSFTAVHAKDRAEDR
>MAP3887c hypothetical protein
MRAARDRPLAPGAGVLVIATLYGGNDGINTLIPYADNAYHDARPELAYAP
QDVLHLDQQLGLNPALKGMAGLWNQRKLAVVRGAGYPKHDHSHFRSMDIW
QTASPDEPVSTGWIGRWLDATGDDPLRAVNIGAVLPPLAVGAKCTAAALS
PGGGAGKAGQFDAVTTALGDDDPDDTPAMAAVCKAYRAARTADATFASVK
PPAKQNNSLAAQLDVVAQAVKARVPARVYTVQLGGFDTHAGERGTQQRLL
QTFDEAVTGFVAQMAGRNVVLMAYSEFGRRVRANASQGTDHGTAGPVLIA
GAPVNGGFYGEQPSLTDLDNGDLKYTTDFRDIYHELLARTVGTDPAPSVG
AGRRDLGFLSA
>MAP2860 hypothetical protein
MPVVVVATMTVKPESVDTVRDILTRAVEEVHDEPGCQLYSLHQSGETFVF
VEQWADEEALKAHSTAPAIGKMFSAAGEHLDGAPDIKMLQPVPAGDPGKG
QLRP
>MAP2243c hypothetical protein
MSATQEAIDMATVAAGAAAAKLADDVVVIDVSAQLAITDCFVIASASNER
QVNAIVDEVEEKMRKAGYKPARREGAREGRWTLLDYRDIVVHIQHRDDRD
FYALDRLWSDCPVVPVNLDEDRQNPGDAGTP
>MAP3922 hypothetical protein
MTPASGWRAVSSVPARRGDAHIDFARSPRPTIGVEWEFALVDAQTRDLSN
EATAVIAEIGENPRVHKELLRNTVEVVSGICRTVPEAMEDLRQTLGPARR
IVRDRGMELFCAGAHPFAQWTTQKLTDAPRYAELIKRTQWWGRQMLIWGV
HVHVGISSPNKVMPIMTSLLNYYPHLLALSASSPWWTGVDTGYASNRAMM
FQQLPTAGLPFQFQTWAEFEGFVYDQKKTGIIDHVDEVRWDIRPSPHLGT
LEMRICDGVSNLHELAALVALTHCLVVDLDRRLEADESLPTMPPWHHQEN
KWRAARYGLDAVIILDADSNERLVTEDLDDVLNRLEPVARKLQCADELAA
VADIPRHGASYQRQRRVAEEHDGDLRAVVDALVAELEI
>MAP1390 hypothetical protein
MIDEALLNILVCPADRGPLVLVGQELLYNPRLRRAYRIEDGIPVLLMDEA
RDVDDEEHARLMAQVRPADPR
>MAP1149 hypothetical protein
MTTEVKDELSRLVVKSVSARRAEVTSLLRFAGGLHIVGGRVVVEAEVDLG
NVARRLRKDIFELYGYNAVVHVLSASGIRKSTRYVLRVANDGEALARQTG
LLDNRGRPVRGLPAQVVGGSIADAEAAWRGAFLAHGSLTEPGRSSALEVS
CPGPEAALALVGAARRLGVSAKAREVRGADRVVVRDGEAIGALLTRMGAQ
DTRLIWEERRMRREVRATANRLANFDDANLRRSARAAVAAAARVERALEI
LGDTVPDHLASAGKLRVEHRQASLEELGRLADPPMTKDAVAGRIRRLLSM
ADRKAKIEGIPDTESAVTPDLLEDA
>MAP3096 hypothetical protein
MSPTPFDRFPSSMRRARCTIWYMFTEDSVEIDAPPRLVWDVFTDVERWPE
WTASVTSLTGLDGPALAVGRRFAIKQPGMAKLVWQVTELIPGASWTWVQR
SPGARVAATHHVSARPGGGTLVRQQLDQRGALGALVGRLMAKKTKRFLAL
EARGLKARAEQLSRADGAHP
>MAP1066 hypothetical protein
MRGSAVPSLVRTAVRAGGKRLGAVWFNLLQTSLAAGLSWYLAHDVLDHPQ
PFFAPIAAAVSLSTSNVLRAQRAVQMMIGVTLGIGLGTVVQGMLGPGALP
IAVAAPVALGAAVFIGGGFIGHGMMFANQTVVSALLVLALYRGGAGPERI
FDALIGGAVAIVVAVLLFPADPRTVLGAARAGVLAVLHDVLSRAADVSSG
RRAAPPDWPLSAVDRVHEQLSGLLEARTTAWHVVAIAPRRWGLRDAVRAA
DHQAVHVALLAGSVLQLARAVAPGPGDRQGQPVSTVLLVLAAATALADRD
PAGACVYLGSARRHAARLRSGDGGDREPHVALADAVGACVDDLQRVIDLR
PG
>MAP1070c hypothetical protein
MGFADKTFGTGGPSAAAGGSYDADRLLAGYRAARAQQALFDLRQGPASGY
DEFVGPDGKVRPAWTELADAIGERGRAGLDRLRSVVRGLIDHDGITYTDV
EPGGRGQEPRPWQLDTLPIVLSAADWEPLEAGLLQRSRVLDAVLADLYGP
RSLLTEGVLPPELLFGHPGYLRAANGIEIPGRHQLFMHACDVSRRPDGGF
AVNADRTQAPSGAGYALADRRVVAHAIPDLYERIAPRPTTPFAQALRLAL
IDAAPDVAQDPVVVVLSPGIYSETAFDQAYLATLLGFPLVESADLVVRDG
MVWMRSLGTLKRVDVVLRRVDAEYCDPLDLRADSRLGVAGLVEAQHRGTV
TVVNTLGSGILENPGLQRFLPAMARHLLSETLLLPSAPVYWGGIDTERSH
LLANLASLLVKSTVGGKTLVGPALSSLQLAQLAARIETAPWQWIGQELPQ
FSSAPTDHAGVLSSAGVGIRLFTVAQRGGYAPMIGGIGYLLAAGPAAYTL
KSVAAKDVWVRPTERARAEAVGVPAVEPPAKTAAGTWAVSSPRVLSELFW
IGRYGERAESMARLLIVTRDRFHVYRHHQHSEESECVPVLMAALGRITGT
DTGTGAAAGGDAAETIAVAPSTLWSLTVDPQRPGSLVQSVEGLALAARAV
RDQMSNDTWMVLAGVERALALDSEPPDSLAEADALLTAAQTQTLAGMLTL
SGVAGESMVRDVGWTMMDIGKRIERGLWLTALLRATLAVVRGAAAEQTIL
ESTLVACESSVSYRRRTAGKVSVAAMAELMLFDAQNPRSLLYQVERLRAD
LKDLPGASGSSRPERLVDEIGTRLRRSHPAELERISDDGRRTELAELLDS
VHAELRSLAEVLTTTQLALPGGMQPLWGPDVRRVMPA
>MAP0356c hypothetical protein
MSEVVTGDAVVLDVQIAQLPVRALSALIDIAVIVVGYLLGLMLWAATLTQ
FDTALSNAILLIFTVLVIVGYPLILETATRGRSVGKIALGLRVVSDDGGP
ERFRQALFRALASLVEIWMLFGSPAVICSILSPKAKRIGDIFAGTVVVNE
RGPRLGPPPAMPPSLAWWASSLQLSGLSSGQAEVARQFLSRAAQLDPGLR
LQMAYRIAGDVVARIAPPPPGAPPELVLAAVLAERHRRELARLRPPAPWP
APGYPPAWPGSGPAPQWPAPGPANPGPPEGFSAGFTPPR
>MAP2087c hypothetical protein
MAAPVSLREDQLTRLVAVFPGSPSEARVAALIRRVCAQTSSLPPLPAPME
VGEPESETEAAVAEFAEQFSADVSAITHAQRSRLSKQLGDRTFGVVVQMY
IADFVLRVRAGLEALGVGSRYLGWLSGPISWDHGSDPSDLVFNDFLIVVA
RMRALDPVTSELVRLRGAAQHHCRLCNSLREGSALDAGGSETLYEEIERF
ESSGLLDERAKAALRYTDALIWTPAHLVADDVAEVRSRFSEAEAVELTFD
IMRNASNKVAVSLAADAPRVENGTQRYLIGADGQTVFS
>MAP2042c hypothetical protein
MKVRLLATVAALILSTVLGCHFESRNPPSTKTLQVPMNDVLTQSDISQNI
TLAVGNTLVVQLGSNYTTPYRWTPDAKIGDSAIVKQTSHEFVPPTSDALG
APGTEVWTFAALKPGSTTITTSYSSFVDKNAKPACTYTLSVTVR
>MAP3333c hypothetical protein
MCGRFAVTTDPAQLAQKIKAIDETTGAAPSDTAPNYNVAPTSTIATVVSR
HSEPEDEPTRRVRLMRWGLVPPWAKAGPDGAPETKGPMLINARADKVTSS
PAFRASAKSKRCLIPMDGYYEWRVNSDAAAGKKPRKTPFFMYSEDGEPLF
MAGLWSVWKPAKDAPPLLSCTIITTDAPGELAQIHDRMPLVMPERDWDRW
LDPDAPVDEELLTRPPDVHAIGMREVSTLVNNVRNNGPELLEPAKPEPEQ
ARLL
>MAP3812c hypothetical protein
MRTGVRCVLATVGVAACVVVTPAGVSLAAATQSHAFAIASVLPSSGEMVG
VAHPVVVTFRAPITDPAKRHAAEQTIDVKSTPAMSGKFEWLDNRVVQWVP
DRYWPAHSTIALTVGGVSTEIKTGPAVIGVASISEHTFTVSIDGVEAGPP
TSLPAPHHRPHFGEQGVMPASMGRPEFPTPVGSYTVLSKERAVTMDSSSV
GIPVDDPDGYLLTVNYAVRITNRGLFVHSAPWAVRSLGLENVSHGCISLS
PDDAEWYYDHVNVGDPVIVQD
>MAP2429c hypothetical protein
MVVMSAPTEPKSRPGTTGQRESAPEDVTASPWVTIVWDDPVNLMTYVTYV
FQKLFGYSEPHATKLMLQVHNEGKAVVSAGSREAMEVDVSKLHAAGLWAT
MQQDR
>MAP4174 hypothetical protein
MRSELADLPGGSFRMGSTSFYPEEAPIHTVTVEPFAIERHPVTNAQFAEF
VAATGYVTVAEQPIDPALYPGADPSQLQPGAMVFRPTPGPVDLRDWRQWW
HWVPGASWRHPFGPDSDVADRADHPVVQVAYPDAAAYARWAGRRLPTEAE
WEYAARAGATTTYPWGDEPTSDGRLMANTWQGRFPYRNDGALGWVGTSPV
GVFAPNAFGLLDMIGNVWEWTTTEFSTHHRIGAATKPCCAPSGPADPSVN
QTLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVAELGSN
>MAP3796 hypothetical protein
MHRLLTSLCAAACVIVASVVLSPISAAAGAPWFANAVGNATQVVSVVSTG
GSNATMEIFQRTGTGWQSLRSGVPTHVGSAGMAPQAKSGVPATPMGVYSL
DSAFGTAPNPGTGLPYTQIVGPNYWWSGDDHSPTFNSMQVCPKAQCPFNT
AESENLQIPQYKHAVVMGVNKNKTPGGGAAFFFHTTDGKPTEGCVAVDDA
QLVSIMKWLRPGAVIAITK
>MAP1990 hypothetical protein
MRPVHPQLTARVEHHRSLPVLVWSFAEPRLCISSGPLGGGIGARDWLVNA
TVPLDYDRTDPHRHLVEIGAALGLAGTGCGLLTAVDVTRHHLGADGGVQV
TATVGLSSPAWAAAPDHHFRREAPHRVGTINIVVAAPVRLSEAALVNAVA
TATEAKAQALHEAGIRATGTASDAVVVHCPTDGAAEAFGGPRSTFGARIA
RAVHAAVLAGARSWMSGAAHRPPAGNPTPPRHFEPASAAGSAPRAAGRRA
GHRVGADGIEPPTAGV
>MAP3634 hypothetical protein
MSGGMPMSGWTRGTLFAALNAAVVSVVGLALVLSAGPALADPDPAPADPG
AVAAPPGPPAPPDPLAPPPPPDPLAPPPPAAPPAPWLPPAAQPAAAPAAG
QDPTPFTGTPPFGPPTFVPKTGSTVGVAQPIIINFPGRVDDAGAAISAVH
VSSVPPVPGKFYWMTPTQLRWRPLSFWPAHTAVTVDAGGTVTNFQTGDTL
VATADDATHQLTVTRNGTVEKTFPMSMGMTAGNHQTPNGTYYVQDKKASV
VMDSSTYGVPVNSTYGYKVTVEDAVRFDNVGDYVHSAPWSVDDQGKRDVS
HGCINISPANAKWFFDNFGPGDPIIVKNSSGGDYKKNDGSADWMN
>MAP3705c hypothetical protein
MALILVAHGTRRPGGVAMIEGLAAQVSTLVGGRVEVAFVDVVGPTPSEVL
AAARAAGRPAIVVPAFLSRGYHVRADLPAHVALSGHPNVTVTPALGPSGQ
IARIVGDQLLECGWRPNDSVVLAAAGTSDDKARADLHTAATWLSALTGSR
VTLGFAATGDPPLGEAVARARPHARRNGGRVVVASYLLADGLFQQRLHGC
GADLVSAPLSTHPGLARLIANRFRRALPPVLAATARHASRRTGPHQRAHA
PATRPVP
>MAP2781 hypothetical protein
MRTFAEVMTFDPPDDASPCASGLIDFVFGEVWSRPGLSRRDRRFVTLACV
AAADAQAPLEQHVYAALNSGDLSITEMREAVLHFAVYAGWPKASRFNIVV
DQQWARIAAERGEPAPDPEPLLPLVTAADPEQRLRGGERSFKDINCLPFA
PPRDNPYSGAGILNFVFGEMWLRPGLGMKERRLITVACVAFQDAEIPIMS
HVYAALKSGDVSFREMDEVALQFAAYYGCAKADHLNQAIAEQKQRVIAEN
HSDATPVG
>MAP3364c hypothetical protein
MALQTDTFCLDRRRRRRYCQPPTWCDSRGGFQMTTVDLQQPSDPAAWRDK
KRRLWLMGLIAPTALFVMLPLVWALNRLGWHAAAQVPLWIGPILLYVLLP
ILDLRFGPDGQNPPDEVMERLENDKYYRYCTYVYIPFQYLSVILGAYLFT
ATDLSWLGFHGGLGWAGKLGLALSVGVLGGVGINTAHEMGHKKDSLERWL
SKITLAQTCYGHFYIEHNRGHHVRVATPEDPASARFGETFWEFLPRSVFG
SLRSALRLEAQRLRRLGKNPWNPLTYPSNDVLNAWAMSIVLWGVLIAVFG
PGLIPFVVIQAVFGFSLLEAVNYLEHYGLLRRKIDSPSGKARYERCTPEH
SWNSDHVVTNLFLYHLQRHSDHHANPTRRYQTLRSLDGSPNLPSGYASMI
SLTYFPPLWRKVMDHRVLAHYGGDITRVNVSPRRRAKLLARYPAVTA
>MAP3150c hypothetical protein
MSAGRRIQVARVYDKVGPDEGQRVLVDRIWPRGVRKDDPRVGIWCKDVAP
SKQLREWYHHEPERFDEFTSRYKSELRGNPALDELRTLAKRGSVTLVTAT
RDLDISQAVVLAELLKSG
>MAP2551 hypothetical protein
MSKSTAVRRLYTPRTSRRYSPRLDPETVGQITESIARFFGTGRYLLLQTI
VVAVWIVVNVFAVRLRWDPYPFILLNLAFSTQAAYAAPLILLAQNRQENR
DRVALEEDRRRAAQTKADTEYLARELAAVRLAVGEVVTREYLRHELDDLR
ELLAELRPQSADGEPVSDADNREQTAKKSR
>MAP3075 hypothetical protein
MTAGATDRRRRILPAPTGLPDAGALPSRPRVAVVGGGIAGLTAATGLAER
GVAVEVIEREHYLGGRVGGWTEHHDGTDLAMNRGFHAFFRQYYNLRALLV
RLDPRLRMLVPVNDYPLIDAAGRRDSFRGLPRTPPLNAMAFAVRSPTFRL
RDFARIDARAAAPLAAVSVPGTYRRLDHIDAATFLEDIRFPEAARHLAFE
VFSRSFFADPAKLSAAELATMFHIYFLGSAEGLIFDVPSANYDSALWQPL
RSYLEQRHVRFRLGTSVGSIDAANRFAVHTDTDEELEVDAVVLATDVAGL
QRIVAASRGLANGEWRDRIAGLRTAPPFAVHRFWLDRPVSPRRPAFLGTA
GHKPLDNISVLDRYEREARAWAGAHHGSVVELHSYALDSAPSGAAALREL
RRIYPETAAAQIVHETLLHRSDCPLFAPGGYPHRPTVVTPTPGLLLAGDA
VRIDLPVALMERAATTGWCAANHLLKRWGLAGHPLVTVPTEGRSRLLRWL
ANREGATRS
>MAP0100 hypothetical protein
MTLSTMPRTVGIEEEFHLVDLTTRRLAPRAPELLGLLSDGYVAELQSCVV
ETNGSVVSTLAELRADLTERRRVLVDTAATLGLGVVAAGAVPLSVPSEMH
VTQTSRYQQILADYQLLAREQLICGTQIHVGIDDRDECPGGRSGSRLCSH
SACLEREFAVLVRRI
>MAP3154 hypothetical protein
MSVAPETTVALQDRFFRELPELAVRWQAETFPELRLLVLNEPLATQLGLD
TGWLRGPDGLRFLTGNLVPTGAAPVAQAYSGHQFGGFVPRLGDGRALLLG
ELVDNKGRLRDIHLKGSGATPFARGGDGLAAVGPMLREYVVSEAMHALGV
PTTRSLAVVGTGRPVYREATLPGAVLARVASSHLRVGSFQYAAATGNRDL
LRRLADHAIARHHPGAADAEQPYLALFEAVVAAQASLIAQWMLIGFVHGV
MNTDNMTISGETIDYGPCAFMEAYDPDTVFSSIDFWGRYAYGNQPVIAGW
NLARFAETLLPLFSENTEEAIALAERSFGVFQTRYDAVWATGMRAKLGLP
AQVDAEFAAALIDELLALLKANHVDYTSFFRQLGRAARGDDRSAAEPARE
MFMDLPGFDAWLARWRALGPDADAMDRVNPIYIPRNHLVEEALAAATDGD
LDPLDQLLAAVTAPYTERPGFERYASPAPEDFGKYQTFCGT
>MAP3576 hypothetical protein
MAIRVAHVGTGNVGGLALAELITNPRYELTGVCVSTPEKVGKDAGELCGV
GLDTGVVTGVAAVGDLDAVIAAKPECVVYCAMGDTRLPEAMADVMRILAA
GINVVGSSPGLLQYPWGVMPEKYIARVEDAARQGNSSIFISGVDPGFAND
LIPLALAGTCQRVEQVRCMEIHDYASYNGAEVMGYMGFGRPLDEIPMLLQ
PGVLSIAWGTAIRQLAAGLGIEVDEITESYQREPAPEDFDIAVGRVAKGT
LAVLQFEIRGMVNGHPAIVIEHVTRLRPDLRPDLPQPAAGDGSYRVEITG
EPSYAVDIVPSSRKGDHNHAAIAGAAGRIVNAIPAVIAAPPGIRTTLDLP
LVTGKGLYAPSTLVTT
>MAP3468c hypothetical protein
MTAWVDREFERHDFTDEDLVGLSTERVVFTECNFSGANLAESRHRASAFR
NCTFRRTSLWHSTFEQCTMLGSVFEQCRLRPVTFDEVDFTLAVLGGNDLR
GGAADPALWTTASLAGARVDVDQAVAFALARGLRLDG
>MAP3375 hypothetical protein
MSPAGEHGTAAPIEILPVAGLPEFRPGDDLGAAVAKAAPWLRDGDVVVVT
SKAVSKCEGRLVPAPADPEERDRLRRKLVDEEAVRVLARKGRTLITENRH
GLVQAAAGVDGSNVGRDELALLPLDPDASAAALRARLRELLGVEVAVLVT
DTMGRAWRNGQTDAAVGAAGLAVLHGYSGAVDQHGNELLVTEVAIADEIA
AAADLVKGKLTAMPVAVVRGLSVTDDGSTARQLLRPGTEDLFWLGTAEAI
ELGRRQAQLLRRSVRRFSAEPVPAELVREAVAEALTAPAPHHTRPVRFVW
LQTPAVRTRLLDAMKDKWRADLAGDGRPAESIERRVARGQILYDAPEVVI
PMLVPDGAHSYPDAARTDAEHTMFTVAVGAAVQALLVGLAVRGLGSCWIG
STIFAADLVRTELGLPADWEPLGAIAIGYAAEPAGPRGPADPGDLLIRK
>MAP1982c hypothetical protein
MSVKLADVIAVLDQAYPPWLAEPWDSVGLVCGDPDEPVESVTVAVDATPA
VVDEVPAGGLLLAHHPLLLRGVDTVAASTPKGALVHRLIRSGRSLFTAHT
NADSASPGVSDALAEALGLTVEAVLEPARPASDLDKWVIYVPGENADAVR
EAVFAAGAGHIGDYSHCSWSVTGIGQFLPHEGASPALGSVGQVERVAEER
VEVVAPARARAAVLTAMRAAHPYEEPAFDIFALLPPPGDAGLGRIGRLPQ
PEPLRRFVSRVDAALPATSWGVRAAGDPELAVSRVAVCGGAGDSLLAAAA
GAGVQAYLTADLRHHPADEHRRASEVALIDVAHWASEFPWCAQAADLLRS
RFGERLDVRVSAIRTDPWNVGHDGGEG
>MAP3153 hypothetical protein
MADMIVKSPDEVVAFLKAQHNLIEDMFDQVLHATDPKAREEPFATLRQLL
AVHETAEEMLVHPPARKEADAGDAVVDARLHEEHSAKELLSAIEKLDSTT
DQFLDEVTKLREAVLEHATREENEEFPALQRLDSDDLKRMGTAVRAAEAI
APTHPDPGVESATLNFAVGPFASMLDRARDLIGRAIG
>MAP0286 hypothetical protein
MMTAEFLEHQRSIGNDLLTPVPEYRFPGLLPGDRWCVTALNWLRAHHDGC
AAPVVLASTHESTLEVVPLEALQEHKIDVPDDLANL
>MAP0630c hypothetical protein
MNNLYRDLAPVTEAAWGEIELEASRTFKRHVAGRRVVDVSEPGGPAAAAV
STGRLIDVEAPTNGVVAHLRASKPLVRLRVPFTLSRYEIDNVERGANDSD
WDPVKEAAKKLAFVEDRAIFEGYAAASIDGIRSASSNKPLALPADPREIP
DVITQAISELRLAGVDGPYSVLLSADVYTKVSETTEHGYPILEHIDRLVP
GDIIWAPAIDGAFVLTTRGGDFDLQLGTDVSIGYTSHDADTVQLYLQETL
TFLCYTAEAAVPLTS
>MAP0281 hypothetical protein
MTRKAEIVAVFAICTAFMTASGAFGGFAARADDPEILYNGINQLRQACGP
IAEDPRLTEAAQQHADDMLRNGVSGHIGSDGSSPQARIAAAGYRSRYSGE
IVFWATGSAATPSEALDMWMQSPPHRAIILNCGFNAGGFATARDGNKMTA
VGDFATS
>MAP3571 hypothetical protein
MQKVQAAEDAWNTRDPDHVSLAYTPDSRWRNRDEYIVGRDQIVAFLTRKW
QRELGYSLRKSLWDFHDNRIAVRFQYECRDRSGQWYRSYGNELWEFTESG
LMARREASINDVPIDESQRRYFGPRPASEHGREIPLW
>MAP2223c hypothetical protein
MSLSNQLEDTGRGFRAARSERIFGGYNVASDAYDMAFDEMFDAAGAVRGP
YKGIYAELAPSDASELKARAEALSRAFLDQGITFSLSGQERPFPLDLVPR
VISAAEWARLERGITQRVKALEMYLDDIYGDQEILNDGVIPRRLVTSCEH
FHRQAVGIVPPNGVRIHVAGIDLIRDEEGNFRVLEDNLRSPSGVSYVMEN
RRTMARVFPNLFATHRVRAVDDYAAHLLRALRNSAATNEADPTVVVLTPG
VYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR
IDDAFLDPLQFRADSVLGVAGLVNAARAGNVVISSAIGNGVGDDKLVYTY
VPTMIEYYLGEKPLLANVETLRCWLDDEREEVLDRIDELVLKPVEGSGGY
GIVFGPEASDKELAAVAKKIRDDPRSWIAQPMMELSTVPTQVGSSLAPRY
VDLRPFAVNDGEDVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLASR
ASSGDHELGAAEVVRSLPTAMPDPLVDDAPRSMAQQPQPTGPPQREQLEQ
QQQQRAGR
>MAP2073c hypothetical protein
MSSTPHPAEPHIGSVSSKLNWLRAGVLGANDGIVSTAGIVVGVAAATALR
APILTAGSAGLVAGAVSMALGEYVSVSTQRDTEKALLIQEHQELRDDPAA
ELDELAALYEAKGLTAATARTVAEELTDQNPLLAHAEVELGINPEELTNP
WHAASSSALSFAIGALLPLIAILLPPPTWRIPVTVVAVLIALVITGAVSA
RLGGAPQLRAVARNAIGGSLALAVTYTIGHVVGAAID
>MAP3586c hypothetical protein
MRAMGGRPMSLVAGRGPLSSDPAGRFSPPIPAEVVYVEPHPRRVQAVKDG
RSVIDTERALMVHRRGRPLSYVFPADEVAGLPGEPEPEAPGFVHVPWDAV
DTWWEEGRKLVHYPPNPYHRVDCRGTRRRLRVRVGGTTLVDTDHTTIVFE
TALPPRLYVDPAHVRTDLLRRSETTSYCNYKGFATYWSLVDGDRVVDDVG
WCYPDPPPESLPIKGFLSFDETRVELLAELPVSARS
>MAP1537 hypothetical protein
MIGIAALAIGIVLGLVFHPSVPEVVQPYLPIAVVAALDAVFGGLRAYLER
IFDPKVFVISFVFNVFVAALIVYVGDQLGVGTQLSTAIIVVLGIRIFGNA
AALRRRLFGA
>MAP2082 hypothetical protein
MNLGFLDFTSIEQRRADCDEELRLNRRFSPQVYLGVVDITEQNGHYRVGG
EAGSGEPAVWMRRLPEDGMLPAKLAGGDVDTRLARRIGRTLAKLHGRAET
GPDIEAYGSPSSVIANWQENFDQMGPFIGRTISPAINDEIRSYVQEFVGQ
QAALLERRVTEGHVRDGHGDLHAASVCIADGQIVLFDSLQFAPRYRCADL
ASEVAFLAMDFEYHGRGDLAWAFVDSYVRASGDDELPSLLDFYMCYRAYV
RGKVRSLRLAQTEKVPGGEQEALIAESRGYFDLAWAHAGGLPRPLMVVTM
GLPASGKTTLARALAGRLGLVHLSSDVARKRMAGIPPTRRGSDEFGSGLY
DPAMTRNTYAALRRDAARWLRRGRGVVVDATFGNPGERAQLRQLAHRLGV
DLHVVLCDADDDTLIARLKRRATEQGVVSDARIELWPQLRAAFTPPDEQA
SVLRVDATRDTEETVEQALGLLRARYRSS
>MAP3239 hypothetical protein
MPAGSRARFAAKPLTHSGTPVPLWLAAFQEAPLPALDLAGCPGLVVVAAH
PDDETLGVGAMITQLVAIGVRVQVVCVSDGGARPGSAASERLRAQTIRRF
ELRSATNTLNAPPPLSLGLPGGELTAHEDRLTGALTEILRAAGPGAWCAA
PWRGDGHPDHEAVGHAAWAACAHTGAALLEYPVWMWHWAVPGDPAVPWER
AHAVPAPAWAVSRKRLAAQRYRSRFEPTGGSAPALPGFVLARLLAVGEVV
FR
>MAP0605 hypothetical protein
MMDERRRKGLEKMNEVYGWEMPNVEGDAYFDLTVDHLFGSIWTRPGLSMR
DKRIMTLTAVTAIGNRDLAEIQINAALLNGELTETELKEMAVFLTHYLGF
PLGSALNGAVDAVVAKRRKAAAKGAGEDKKANVDAALKMHSGGERG
>MAP1704c hypothetical protein
MTASNPTVWLTLQAHNVPKLIDYYVETFGFVLTARYGDGETVDHAQLNWP
EGSGGIMLGSHKPGAEWCREPGTAGGYVVTADPDALYRRVRQHNADIIRP
LAETDYGAREFTVRDPEGNLWSFGDYGGANPPN
>MAP4077c hypothetical protein
MRVDGRDIIVSGSLLQPLTRRTNDILRLGMALIFLAVVITGSVITRPQWI
ALEKSVSQIVGVLSPTQSDVVYLVYGLAIVALPFMILIGLIAAGQWKLLG
AYAAAGLSAIVLLSISGTGLAAPRWHFDVLDRLTTLPAQLLDDPRWIGML
AAVLTVSGPWLPARWRRWWWALLLAFVPIHLVVSAIVPARSLVGLAVGWV
VGALVVLVVGTPALEVPLDAAVRALAKAGFEVCRLTVVRPAGRGPLILSA
DGQDADHTALIELYGPHQRSGGALRQLWGKLKLRDAETAPLLTSMRRAVE
HRALMAIAIGEAGLANTATVAVATLDRGWMLYSHKPPRGTPIDRCAKTTP
VQRLWEALRVLNDHQIAHGDLRAHHITVDDGAVLFGGFGSAEYGATEAQL
QSDIAQLLVTTSAHYDPKSAVRAAIDVFGADTILSASRRLTKVAVPKSVR
RSAPDSGAVISGARAEVKRQTGADQIKPQTITRFTRSQIIQLVLFGALVY
VAYPFISTAPTFFSQLRTADWWWALLGLLVSALTYVGAAAALWACADGMV
NFWMLSIAQVANTFAATTTPAGVGGLALSTRFLQKSGLSAMRATAAVALQ
QSVQVIAHLALLVLFSAAAGASMNLSHFVPSATMLYLIAGVALGIVGTFL
FVPTLRRWLATEVRPKLDEVVSDLAKLAREPRRLALILLGCAGTTLGAAL
ALWASIQAFGGDTTFVAVTVVTMVGGTLASAAPTPGGVGAVEAALIGGLA
AFGVPAAIGVPAVLLYRMLTCWLPVFVGWPVMRWLTKHEMV
>MAP1542 hypothetical protein
MGEVRVVGIRVEQPQNQPVLLLRETNGDRYLPIWIGQSEAAAIALEQQGV
EPPRPLTHDLIRDVIAALGHSLKEVRIVDLQEGTFYADLVFDRNITVSAR
PSDSVAIALRVGVPIYVEEAVLAQAGLLIPDESDEEGGTAVREDEVEKFK
EFLDSVSPDDFKAT
>MAP4100c hypothetical protein
MKLRLAIRELHRSERKLAHQLTVLAARHHSDQDIFHLARDLAGWSHRHLG
ELARHGRHYGLRLSADPRTAARTGVVQQRISGLLRHRPEPGLVLLADLRR
IHRLAAGVSLDWELLAQGAQAAKDAELLGLASRCHPETLRQMRWANAMLK
ELSPQVLMN
>MAP3619 hypothetical protein
MYKVGVWGPGSMGVIALRGVIDHPQLELVDLVVHSDAKAGRDAGELCGVA
PVGVVATQDPAAMLAGDADVVVYAAGANLRPLEAVEDMVSLLRAGKNVVS
CSVVPLVFPDAVDSAFSEPLRAAALEGQVSFFTTGIDSGFANDVLPLVLT
GVSRVIESVRVTEMFNYATYPDASAVYEILGFGQPPDYPAFAAQPGIFTF
GWGPVLHQLAAGLGVEIDHIEESNERIPAPESFDTPTGHIAAGTIAAMRS
TLTGYVGEKPTFVLDHVTRMRDDLAPDWPQPRIAITPKDLGYGLASGRGL
YRVEIEGSPSMRCEFEMAEDHDHDLGARIAGSSRMVNAIPAVCAAPPGLL
SALDLPLITGAGLVRPVLGPPPDSRLF
>MAP4307 hypothetical protein
MDDDCLKLTTYLAERRRAGNRFVSDVLLDLYARHRVECAVLLRGIGGFGT
GHRLRSDESLTLSEDPPVAIVATDTRTKIEALLDEVLAVKQRGLVTLERA
RLLHADVGAARPPEGDAVKLTVYLGRKQRVNGTPAYIGVCDLLHRRGLAG
ATVLLGVDGVVHGERRRASFFGRNVDVPMMIVVVGSGAHIGRVLPEIAAL
LRRPLFTLERVRVCKRDGQLLEPPHALPGVDERGLPLFQQLTVYTSESAR
HGGVPIHRAIVARLRQATAADGATVLRGVWGFHGDHPPHGDGLFALTRRV
PVVTIVIDTPAHIAESFAVIDELTGAEGLVTSEMVPALVSDDGGPGAGRM
ARHRY
>MAP0721c hypothetical protein
MRLQPLPAEQWDEATRQALAAMRGADTNNALSTLAHHPALAKAFLRFNVH
LLTASTLPPRVRELAILRVAHRRQCAYEWSHHVSMAKDEGITDEQIAAVR
CFAGDGAGPFDAFDHAVLAGVDELDEKSELSDRTWAALGERLDDRQRMDY
VFTVGCYTLLAMAFNTFGIQLEHAEQH
>MAP3040c hypothetical protein
MTSQPNDAHWQRPGESPEPTPGRPASARLVDPEDDLTPVGYPGDFNPSTG
TTTVIPYGGAAAVAGSGAAGYHLLEQQEPLPYVQPHSAARHAAPEPTDVD
DDEHHDRLLDVGRRGTQHLGLLVLRAGLGVVLGAHGLQKLFGWWGGSGVT
GLRNSLSDVGYQHADILAYVSAGGELVAGVLLVLGLFTPLAAAGALAFLI
NGLLATVSARPHAHTFSFFLPQGHEYQITLIVMATAVILCGPGRYGLDAR
RGWAHRPFIGSFVALLAGIAAGVGVWVALNGVNPIG
>MAP2797c hypothetical protein
MAKLDYDALNSAIRYLMFSVFAVRPGALGDQRDEVVDDASRFFKQQEERG
VVVRGLYDVAGMRADADFMIWTHAETVEALQATYADFRRTTALGRVCSPV
WSSVALHRPAEFNKSHIPAFLAGEEPGAYICVYPFVRSYEWYLLPDEERR
RMLAEHGMAAREYKDVRANTVPAFALGDYEWILAFEAPELHRIVDLMREL
RATDARRHTRAETPFFTGPRVPVEQLVNSLP
>MAP3535 hypothetical protein
MAIDSGRTRKADAARHDPGDIDVVLTRLRRAHGQLGGVIAMIEQGRSCKD
VVTQLAAVSKALDRAGFKIIASGLRDCITRTEQQPPLSIDELEKLFLSLA
>MAP3937 hypothetical protein
MRTRYRKVFRDVYIAKDAELTPAGKARAAWLSTGATLAGLSAAAIHGTKW
LDAAAPAEIVRADRHGQRGILVRSYTLADDEADSVSGMRVTTAARTAFDI
GCGLPAAKALPILDALLNATGIKPADVVAVADRHRGARGIRRLRASLELA
DGGAESPQETRLRVLLVRAGLPKPQTQIELRELRVRVDMGWREWKVAVEY
DGIQHWDDPYQRAWDIERIALLEAAGWAVIRVSAAMLSRPQVIVERVTAK
LAERGAYGRPRASSRA
>MAP0809c hypothetical protein
MRLILNVIWLVFGGLWMAVGYLAAALVCFLLIITIPFGFASLRIASYALW
PFGRTIVDKPTAGSGALIGNVIWVVLFGVWLAIGHLLSAAAMALTIVGIP
LALANLKLIPVSLMPLGKQIVPVGSPVPHAPPVAA
>MAP1404 hypothetical protein
MKMSGLLSRNTDRPGIVGTARVDRNIDRLLRRVCPGDIVVLDVLDLDRIT
ADALVEADIAAVVNASPSVSGRYPNLGPEVLVNNGVTLIDEAGPDIFKKV
KDGAKIRLYDGGVYAGDRRLVRGTERTEHDIADLMREAKSGLAAHLEAFA
GNTIEFIKSESPLLIDGIGIPDIDVDLRRRHVVIVADEPSAEEDLKSLKP
FIKEYQPALIGVGTGADVLRKAGYRPQIIVGDPDQISADALKCGAQVVLP
ADADGHAPGLERIQDLGVGAMTFPAAGSAIDLALLLADHHGAALLVTAGH
TANIETFFDRTRAHSNPSTFLTRLRVGEKVVDAKAVATLYRNHISAGAIA
LLALTMLIAVIVALWVSRTDGVVLHWITDYWNHFSLMVQKWVT
>MAP4348c hypothetical protein
MTGPARSGAAIRGAGRTVARGLIFLIQLYRHMVSPLRPATCRFVPTCSQY
AVDALDEYGLIRGSWLAAARLAKCGPWHQGGWDPIPERPGCRVNCQDASD
AWAVRATRGESGSLV
>MAP2910c hypothetical protein
MTTGLPSQTQVIELLGGEFARAGYEIEDVVIDAHARPPRITVIADGDDGL
DLDAAATLSRSASALLDKLDTIEDHYVLEVSSPGVDRPLRTPKHFRRARG
RKVDVVLSDNSTVTGRVGETGDDTIALVVRAGRDWAIREIPLGDVVKAVV
QVEFSPPAQAELELAGVGGTDKTEERRK
>MAP3976 hypothetical protein
MIFVTPSSAPRPFNRRVALATLGIGALAPGVLAACGGGTAKEAEKKEQPA
LRLKYQPADAAQNVVPTAPVSVEVSDGWFQHVTLSNSSGKAVAGTFNSDR
TVYTTTEPLGYDQTYTWSGSAVGHDGKTVAVAGKFSTVSPSKKISGAFQL
ADGQTVGIAAPIILQFDAPISDKAAVEKALTVTTNPPVEGSWAWLPDEAQ
GARAHWRTREYYPAGTTVNVQAKLYGLPFGDGAYGAEDISLNINIGRRQI
VKAEVSSHRIQVIRDEGVIMDFPCSYGEADKARNVTRNGIHVVSEKYADF
YMSNPAAGYSHVHERWAVRISNNGEFIHANPASAGAQGNTNVTNGCINLS
TSDAEQYYQSAIYGDPVEVTGSSIQLSYADGDIWDWAVDWDTWVAMSALP
PPTAHPPSTQIPVTAPVTPSNAPTLSGTPTTSTTTPASTGPATPGG
>MAP4256 hypothetical protein
MRHYYSVDAIRAAEAPLLASLPDGALMRRAAFGLATEIAAELTARTGGVA
GRRVCAVVGSGDNGGDALWAATFLRRRGAAADAILLNPERAHRKGLAAFG
KAGGRIVESVSPTTDLVIDGVVGISGSGALRPAAAEVFAAVDDAGIPVVA
VDIPSGIDAATGATSGPAVHAVLTVTFGGLKPVHALGDCGRVKLIDIGLD
LPQTDVLGFEAADVAARWPVPGPHDDKYTQGVTGVMAGSATYPGAAVLCT
GAAVAATSGMVRYAGSAHREVLAHWPEVIASPTPASAGRVQSWVVGPGLG
TDDAGAAALWFALETDLPVIVDADGLTMLAAHPELVANRAAPTVLTPHAG
EFARLAGSPPGDDRVGATRKLADTLGVTVLLKGNVTVIADPGGPVYLNPA
GQSWAATAGSGDVLSGMIGALLASGLPAGEAAAAAAFVHARAAALSAADP
GPGEAPTSSSRMVPHIRAALAAL
>MAP0548c hypothetical protein
MAEPFIGSEAVASGLVTPYALRSRFVRVHPDVYVPAGTALSAGLRARSAW
LWSRRRGVVAGRSAAALYGTKWIDDRAPAQLLYPYRRPPDGIQTWSDRLV
GDEIQTIGGMPVTTPARTALDIACRSPVDKAVAAIDALARATELKVLEIE
LLADRYRGRRGITRGRSVLPLVDAGAESPRETWLRLLLLRAGFPRPRTQL
PVRQYGALIACLDMGWEDIKLAVEYDGDQHRTDRRQFTKDIRRAEVLAEL
GWTVVRVTAEDTPAGIIARVSTAWTRRTCTGSEKPAGNSR
>MAP2025c hypothetical protein
MERPDTLDVLRLIANAPNIFESWSQMASQLFDTETFSPRMREVIILRVAH
LQDSPYELAQHVVFARAAGLTDRQIDALQDKADLDAAGFTDDERILIDTV
TELCTTHRLDDASFAKAHTLLGDEALTELLMIVATYYGLALVLNATDLDI
DAPT
>MAP1019 hypothetical protein
MRETSNPVFRSLPKQSGGYAQFGTGAAPMQGYQADPYAAPYATPYQETRA
SRPLTIDDVVTKTGITLAVLAASAVVSYFLVLSNVALAMPLTLVGALGGL
GLVLVATFGRKQDSPAIVLSYAVLEGLFLGALSFVFANFSVSSANAGVLI
GEAVMGTFGVFFGMLVVYKTGAIRVTPKFTRMVVAALFGVLALMLGNFVL
AMFGVGGGAGLGLRSGGPLAIIFSLVCIGIAAFSFLIDFDAADQMVRAGA
PEKAAWGIALGLTVTLVWLYIEILRLLSYLQND
>MAP1570 hypothetical protein
MDVTAATEYLARSTTLTSVGIIGYIIIGGLAGALASKIVRGSGAGILMDI
VIGIVGALIGGFILSFFVNTAGGGLIFTFFTALLGSVILLWIVGMVRRT
>MAP3934c hypothetical protein
MLTGRLDLMSLVVPPYPPARYTKDQPETSAWLKRADEPPDYQTAGVKYHY
LANQHDTAGDYGLYRVDIAPAGGGPGPHFHRAMSEAFFVLSGTMKLYDGT
EWTDGHQGDFLYVPPGGVHGFRNEADDPASILMLFAPGAPREAYFEGFAA
LADMTDEERREWFARHDNFWVQ
>MAP2903c hypothetical protein
MMLATTTLALKEWSAAVHALLDGRQRVLLRKGGIGEKRFELAAGEFLLFP
TVAHSHAQRVRPEHQDLLAPAAADSTDDELVIRAAAKVVAAVPVNRPDGL
PAIEDLHIWTAESVRADRLDFRPKHKLAVLVVSVTPLAEPVRLARTPDYA
GCKSWVQLPVHARLGPPIHDDETLAAVADRVRDAVG
>MAP2993 hypothetical protein
MATVVAFHAHPDDEVVLTGGTIARAVAAGHRVVVVTATDGRVHNEDTDHR
LDELRSSARILGAQRVECLGYADSGYGPLFYPDPPGRTRFGRADLDEAAG
KLAGILRDEHADLLLSYQANGGYGHRDHVRVHHVGKRAAELAAVPRVLEV
TMPREMLLRISDLAHLLRLPGPYEADIVGSAYAPRAAITHRINVFRFARQ
KRDAFAAHRSQIGRSGPAARLFGLLLRLPPQVFGALFSHEWFVDPALPTG
TVRRDIFD
>MAP2529 hypothetical protein
MSRVTPLRFRDWPPEMRDAMAALMPPNPRHPAPVTKDRPKAGNALGTLAH
HPSLARAFCTLQGHLLMGTTLSMRHREMLILRTAAVRGSAYEWTHHNFIA
PDGGISTEDVARIAFGPTAPYWSDFDAALLRSVDELIHDGQMSDATWATL
SSEFGTQQLLDTMFTVTSYDALMRMLKTCQVRLEDDVRELRAQAGLSVDG
DGA
>MAP0875c hypothetical protein
MRSIWKGSIAFGLVNVPVKVYSATEDHDIKFHQVHAKDNGRIRYQRVCEL
DGEVVEYRDIARAYESDDGQMVIITDDDIATLPEERSREIEVLEFVPANE
VDPMLFDRSYFLEPDSKSSKSYVLLAKTLAETDRMAIVHFTLRNKTRLAA
LRVKDFGKRDVMVIHTLLWPDEIRDPDFPILDKKVEIKPAELKMAGQVVE
SMAEDFNPDRYHDDYQEQLHELVQAKLEGGEAFTTEEQPKQLDETEDVSD
LLAKLEASVKARSGDGKAPAKKSPAKKTAAKKAPAKKGAAKKAPAKKAAS
RS
>MAP2881c hypothetical protein
MGTAPIRVFQVGSGNVGSEMIRRIATQPDLELIGVHCYSPEKIGKDTGQF
AGLAPNGVKFTGTVEEIIAAKPDVLTFHGVFPDEDLYVKVLEAGIDIVTT
ADWITGWRRDKNHPHPSGKPVTQLLAEAAAKGGATFYGTGMNPGLNQILG
VVCSADVAEIENVTTIESVDVSCHHSKDTWIEVGYGQPVDDPEIPAKLEK
YTRVFADSVYMMADCFDLTLDEVTFSYELGACTKDVDLGWYTLPKGSLGG
NYIKYQGMVDGVPRVETHLEWQMTPHTDPSWNVKGCYITQIKGDPCIYNK
HMIFPKPGVDLSNPDNFASIGMTVTGMPALAAIRSVVAAPPGLLTSADLP
LRGFAGRFKK
>MAP2674c hypothetical protein
MPPRAEEGNLWFEWSRSLDDPAEYVLVEAFRDGDAGSAHVNSDHFKRAMQ
ELPQALKSTPKIISQTVEATGWSRMGEMTVD
>MAP4244 hypothetical protein
MDPTLSYNFGEIEHSVRQEIHTTSARFNAALDELRARIAPLQQLWTSEAA
TAYQAEQLKWHRSATALNEILVQLGDAVRDGAEEVADADRRAAGVWAR
>MAP0183c hypothetical protein
MPPRKRRAPRLLVAVAALCLGSVAWSPMASAHVHAGSDNPVRGAMAVVTF
QVPNESNTGAATTALTVALPNVAAAHTETMPGWTARLDRDAASGTVRSVT
WTAAAGGGIGPDQFALFRLSVKLPDADTVSFPATQTYADGTVVKWDQPPL
PDGGEPEHPAPTLALAAGPAAGHQHPGGPAAADNAARWLGGAALVLAALG
IAIALVRRRA
>MAP1976 hypothetical protein
MDRWHRRVDSGDWDAIAAAIGEFGGALLPRLVTPREAARLRELYADDGLF
RSTIDMAPKRYGAGPYRYFRAPYPEPIEQLKQALYPRLLPIARDWWGKLG
RDPVWPDRLDDWLAACHAAGQQRSTALMLKYGAGDWNALHQDLYGDLVFP
LQVVINLSDPQTDYTGGEFLLVEQRPRAQSRGTATQLPQGHGYVFTTRER
PVRSARGWSAAPVRHGVSVVRSGQRYAMGLIFHDAA
>MAP1702c hypothetical protein
MVPGSHHQPLSQASGCGPTPSGRHTLAVGSHHWPGSQANGCGPTPSGRHT
LPVGSHHSPSAQSALLTAGVAIATAAIDANANMATTTSWRTNVFMVGLPS
LLPVIEPSPGLRVKAIGQVAGDGPAAKPSDEFVAAHRSAYKHKVGNRMDW
YSSSQSTFGADMDAPPGGGFAVNVFLRDGDTVYRTWHTNGRGTEQLSHSF
GLIDILPWGRQEEWQDSPHGWPSRPTHSGWPDSPAIARAYGPNDG
>MAP4073 hypothetical protein
MPVSEPAGYVEVRAYAELNDFLPAESRGAAVRRPFRAHQTVKDVLEAMGI
PHTEVDLIVVNGSVHGFDHRPRAGDRIAAYPMFEALDIGPTARLRPVPLR
DPRFVVDVNLGRLAWLLRLFGFDVWWSNDADDQTLAAISAEQHRILLTRD
RGLLKRRAVTHGLFVRPDDPEEQALGVIRRLDLTGRLAPLSRCVRCNAGP
>MAP3083 hypothetical protein
MQWTRSVKGSLAAGPALSARGYLGLNAQTPAGCSLMEWENTDNGRQRWCV
RLVQGGGFAGPLLDGFDNLYVGQPGAFLSFPVTQWTRWRQPVIGMPTTPR
FLGHGQLLVVTHLGQVLVFDSHRGQVAGSPLDLVDGVDPTDATRGLADCA
PARPDCPVPAAPAFSAATGMVVLGVWQPGAPSAGLVGLKYHPGQSPLLTR
EWTSDAVGAGVIASPVLSADGSTVYVNGRDQRLWALRAADAKVKWSAPLG
FLAQTPPAVTPQGLIVAGGGPDTRLAAFRDAGDHADQVWRRDDLIPLSSS
SLAGVGYTVVSGPPANGAPGMSVLVFDPGDGHTLNSYPLPAATGYPLGVS
VGTDRRVVAAISDGQVYGFAPA
>MAP2668c hypothetical protein
MPSITPSLWFDDNLEEAATFYTSVFPNSHIEGFNRTTEAGPGEPGTVLSG
SFVLDGSRFIGINGGPHFRFSEAVSFTVHCKDQDEVDYYWDRLSDGGEES
QCGWLKDRFGLSWQIVPDRLFELIGDPDRSRAAAATQAMYGMRKIVIAEL
ERAAASS
>MAP1007 hypothetical protein
MNPGANSPRPYRRLRHLATRADKHSKGGTSVSDANGPTARFGIDNTAVVK
VPVHHGPISDIDVSPDGRRLLVTNYGRDTVSVIDTHTCRVASTIAGLSEP
FAVAMSAADPNYAYVSTATAAYDAIEVIDVVTNWRIATHRLAHSLSDLAV
SPDGRYLYASRNAVRGADVAVLDTTTGELEVIELAAAAGTTTACVRVSAD
GRRLFIGVNGPNGGGLAIIETRTRTDGRRVGGRSRLVGTVELGLPVRDVA
LGNDGNTAYVASCGPVVGSVLDVVDTRAATIVSTHKINEITGPLTRMTLS
RDGERAYLISDDRVTVLGTRTQDVLGEVRVTKHPSALVESPDGHYLYVAD
HSGVLSVARLSSGTAPAAPCSGADEDDDATTGWLPELAPWEPVLA
>MAP1148 hypothetical protein
MSEPINRGIVALGGGHGLYATLSAARRLTPYVTAVVTVADDGGSSGRLRS
ELGVVPPGDLRMALAALASDSPHGRLWATILQHRFGGSGALAGHPIGNLM
LAGLSEVLADPVAALDELGRILGVKGRVLPMCPIALQIEADVSGLEADPR
MFRVIRGQVAIATTPGKVRRVRLLPDNPPATRQAVDAIMAADLVVLGPGS
WFTSVIPHVMVPGLAAALWATTARRALVLNLVAEPGETAGFSVERHLHVL
VQHAPGFTVHDIIIDAERVPSEREREQLRRTATLLSAEVHFADVARPGTP
LHDPGKLAAALDGVRAARKAPGASPVTATADIRVDGASPPAGGNGPAGSG
PRGDDAWR
>MAP1312 hypothetical protein
MQRRRTAARLLSAHREVRVNGPVVDVATKPRAGERVVVSADSRAARLIGA
LALFCAACWLIQILVHHHSNPSWHYADRLAWSLTVLVAVAWIARGIFLGR
PVTTMHAAVAAFFVLAGLGLHVLSFDLLGDVLIAGSGMVLMWPTSSHPRP
ADLPRIWELINATRDDALAPFTMQTGKSYHFSADGSAALAYRTRLGIAVV
SGDPIGDEAHFPELVADFAVTCHAHGWRIAVVGCSERRLELWKDSAVLGQ
TLRPIPIGRDVVVDVSSFDMVGRKFRNLRQAVKRTHNCGVTTEIVAEQEL
SDELLAELTEVVRESSKGAHADRGFHMNLDGVLEGRFPGILLIIARDATG
KVQAFHRYATAGGGSDITLDVPWRRRGAPNGLDERLSVDMVMAGKDRGAQ
RVSLAFAAFPEIFEDKNRGWTRRIFYRLTHVLDPLIALESLYRYVRKWHA
LDGRRYAVISMTQIVPLLFVLLSLEFLPRRRHL
>MAP3530 hypothetical protein
MPSEINNSETRLSWVLAVLAGVLGATAFTHSAGYFVTFMTGNAQRAMLGY
FRGDVVLSVTAGVLIVCFVAGVVIASVCRRHFWVDHPHGPTVLTTFSLVA
ATLVDVIDEGWEENLLDFAPIMLVTFGIGALNTSFVKDGEVSVPLSYVTG
TLVKMGQGIERHIAGGTAADWLGYFLLFASFVVGATVGGFISLFVNGTSM
LVAATVMCALTTGYTYFHSDRRALLDEA
>MAP3639c hypothetical protein
MPHDAPAHNLDLPREQTPRGRYWWVRWVILGVVAIVLAVEVSLGWDQLAK
AWMSMYEANWWWLLASVVAAAASMHSFAQIQRTLLKSAGVHVKQLRSEAA
FYAANSLSTTLPGGPVLSATFLFRQQRLWGASTVVASWQLVMSGVLQAVG
LALLGLGGAFLLGAKNNPFSLLFTLGGFVALLLLAQAVASRPELIEGIGS
RVLAWVNSVRGRPAETGLDKWRETLMQLESVSLGRRDLSVAFGWSMFNWI
ADVACLGFAAYAAGDHASVAGLTVAYAAARAVGTIPLMPGGLLVVEAVLV
PGLVSSGMSLPSAISAMLIYRLISWLLIAAVGWVVFFFVFRTENIADSDD
EPITGPLPVLPTPGGPPDPTDTALQGPLPPDRNPADPNPDKDV
>MAP2756c hypothetical protein
MVGIPVYLDVESRIDQRALMATSRALVDHFARVGNDISHGLGGSLSKAFA
AVDGTAARRDLLALQQEWRRAADVEADAAARMIRDQRRLAEATVKYGDDS
SRTAAAQAMLARSQRDHIDAMIAAEAAHGRLAKAGNETADSVSRMQKLGA
NPIFNAAGIGSVAAMGIGLVSATDAAGNFQQSLQRLHTVAEESPANLKAI
SDGVLKLSSVVGYAPQKLMDAALGVEKAGYRGSDAIKVLTASAQLASEEG
ADLGETISAVTTTMHDYHIPVEQAANVTSKLNVAVGLSKVSLQDFAGALH
NVEPVAAGVGESVNDLYASLAMLTQSGMGADQATQNMTHAITSLAKPTQQ
MSQEMGQLGLDARDIQEHFGERGLIGTANLLYDTIQSKLGPSGMVTLDAW
FKSKQVADSANEMFSKLPAQAQAVATAIQNNAEAYKQFRKDRGGLSVEEA
NLVEQWWNQEKALTGFNNQLKSGKGDVQSMIQALALMVGGQDNLRTVLQL
VGENGPKAAAAVKEVSQAQADGAGNTKGFTESQETYNAKMKDFKGALSAA
RIEIGQDFLPAMTEVVGVLGESAHWLTEHKPLLDAVTVGVGAMGTAWLAI
KGYNIASTILSPIGTGLGMLKDKLFSVETQATTASTAMRNMGPAAMAGEA
EVLAAADAEVAAEGRVTAAAGEANAALSGGRTGGAGLAAAAGPLGIAAAG
SMAANDIFGSLDRRFHTNLFTKLGEIPGSGAWVAGQLFGHAEGGPLHAPG
PKGHDSALFWGADGEHVLTHHEVQKMGGHSGVYKFRSDLMNGRIVLGRAG
GGALGYGGMAPDVAVASSLAGTPYSQGARDDCSGMVGRVILGAMGLPATN
LPTTKNMGQWLAALGFQPGIGGPGSISVGWYDHGPNPNDGHAAMTLSNGE
NAEAGGSHGNFVIGAGAAGAASSQFDHHMFLPTLYGEGAATGMPGFAAGM
GAGGFGGMGGGIPPGATPGTGPGGQPGYYTANPQRVAAAEERLRHLDAEI
DNAEKRRSELKATAKQSERDRLDEEIRHLKAERTQEQQRLAEAERGTFHA
MHGHRGAGGGENPFLPVPLADQFGLSKGLPGLAEWTVGFLEDLVLGPLET
AAWAAIGQAPPGAGGTGGGFGGLSAPGGLGAARFGFPNPAPLAAPPGAAG
ADDAAAAGLDTARGNTTGQAPSSGGGAGAGGGSEFEFVKSPSKPGLPLGP
LARPPAPDSAPPPADYKSWYGQGASDSFYRTWYPTPLGPVPTPSPNAYGP
PSHGSDWRWGPVHVAPPAPPGPMKSPLEQQQSMLGGAGSGGPLPSGPKPL
PMGPFIGGDPRWGTAHFASGGPSGTDTIPAWLSPHEYVEPSEAVDKYGPG
FMDAIRQGRIDPTSVRYYAPGGEVTDQPEPPPQQQAPAQQPQNMVKAPGA
PGGPAIEPPPGAPKPGDNASIHEPTGPGAVSPGSKQGLSDTATPGADVQQ
PGTGQGALPGIGFSGGIIGGLEGAATQAAAMGADMGTFGGAGGAVSSAMN
IGFQELNRAAAYGAQDVGIGVEGLLEALIPNSDATGADWSKTIPGRLLMG
VTGVRTAGQQNTAGQTQQPFASNASSDQYANVGNTQPQAPIQIMGPVHVQ
ANDPKQLHEGLNSQAAMANSANMIGTQFRGTAGGTG
>MAP2723c hypothetical protein
MDSVSLTDLAAEKLAEAKQSHSGRAAHTIHGGHTHELRQTVLALLADHDL
SEHDSPGEATLQVLQGHVRLTTGGDAWEGRAGEYVAIPAERHALHAVEDS
VVLLTVLKTIPSAH
>MAP3374 hypothetical protein
MLCEAQVKVTVLVGGVGGARFLLGVQQLFGLGQFRAQRHTHGRPDTAAGS
HELTAIVNIGDDAWIHGLRVCPDLDTCMYTLGGGVDPERGWGHRDETWHA
KEELARYGVQPDWFGLGDRDIGTHLVRTQMLNAGYPLTQITAALCDRWQP
GARLLPVSDDRCETHVVITDPDDGSRRAIHFQEWWVRYRAQVPTHSFAFV
GAEKAAATTETIAAIADADVILIAPSNPVVSVGAILAVPGVRGALRAAGA
PIVGYSPIIGGKPLRGMADACLSVIGVESTAEAVGRHYGARRGTGILDCW
LVSQDDHADIEGVAVRAVPLMMTDPAATAEMVSAGLQLAGVTP
>MAP3724 hypothetical protein
MAIHQSVQAPIGHVERGATDPPRPQRRRRGRTLGIDEGLMGVALLAGPAN
VIMQLAQPGVGYGVMESRVESGRVDLHPFKRARTTFTYLAVATNGSDAQK
AAFRRAVNKAHAQVYSTPESPVSYNAFDPALQLWVGACLYKGGLDIYRMF
IGELDGEDAERHYREGMALATTLQVPPKMWPADRAAFDRYWEESLAKVHI
DDAVREYLYPIAANRIRGVKLPGPIQRLSEGFALMITTGFLPQRFRDEMR
LPWDATKQRRFDRLIAVLATVNRYLPRFIRQFPFNVLLHDLDRRIKKGRP
LV
>MAP0276 hypothetical protein
MNAAPSDPSRRVWQAMLTWRAQDVSRMESVRLQVSGNRIKANGRIVAAAT
DANPAFGAYYDLQTDETGATKRLGMTVTLAERERVFSFARDEENMWLVTD
PQGEHRAAYNGALDVDVEFSPFFNALPIRRLGLQERAASVTLPVVYVNVP
EMSITADTVSYSSTGSRDEIKVHSPISDTTVSVDEQGFIVDYPGLAERI
>MAP0787 hypothetical protein
MAVQASSEIVIDAPPEVIMEALADMDAVPSWSSVHKRVEVIDKHPDGRPH
HVRVTIAVVGIHDTELLEYHWGPDWMVWDADRTAQQHGQHGEYNLSRLGD
DKTRVRFTITVEPWVPLPEFWVRRARKKILHAALEGLRKRVMTPEAFGRA
D
>MAP3445 hypothetical protein
MPASVSFCPRASRQRGSRRRASRQRDARAAARRFQAPSHIAGLPRLIGMT
QKTRIEPLPPQRAGLLTRAMYRIAKRRYGQVPEPFAVAAHHRRLMVASAV
HETLVDRASKTLPASVRELAVYWTARQIGCSWCVDFGSMLQRLDGLDIQR
LTEIDDYATSPAFTDDERAAIAYAAAITTDPHTVTDEQVDDLRARFGDAG
TFELTYQIGLENMRARVNAALGITEQGFNSGDACRIPWATGDTETSDPAA
NR
>MAP0817c hypothetical protein
MVAPLLRAQIDIDAPVATVWKLVSDLRRMPQWSPQCRWMKPLGPVRQGTR
TINLNRRNRLFWPTTCTVVEIIPDRKLTFRVDTNNTIWSYELEPTDTGTR
LIESRHAENGVTAFSNLSVKAFLGGTDNFERELLDGMNASLARIKAAAEN
TDRR
>MAP4070c hypothetical protein
MIGSDPGLLRLRMATRTTAALGLSLLALWLLTRATGQPLTVALLGVVITM
VASRSVNEPDPRQQRITMALLPVPAAVAITAAAVLAPHPVAGDVVFVLIV
FAAAYIRRFGARGRALGTVCTYVMTSHVLPDRPERVLRATIRALRARMAI
VIDTTAEAVRTGRLDERRRRRMRARTIRLNETALMVQSQIEDRADPGTLW
PGVTAEQLAPWLLDAELAIEWVATAGRRAAALGAELPEAARAELVSALTQ
LGRAIRLPEPGGLRDAASRARRLLDAATDDRPAGTAVRRLALAIINAATA
TADVRAIVDGAAGAAVPDVAGSERPPAAAGAAAEEPAEQEEEQPAGLRPT
TRQAIQVSVAASLAIITGELVSPARWYWAVIAAFVIFAGTNSWGETLTKG
WQRLLGTMLGVPAGVLVATLLTGHEAAALAGIFVCLFCAFYFMTVTYSLM
TFWITTMLALLYGLLGQFSFGVLMLRIEETALGAVIGATVAVVVLPTNTR
TAIRADTRAFLTSLSALIEACAAAMSGSAASPSEQARQLDRDLQQFRVTA
KPLLAGVAGLAGRRGIRRALRIFTACDRYGRALARSSEQYRGSPGPGPQL
AQAFSAAAAQTRRNLDALLEAIDSGHPPTLVSAADELDAAETAARQHDSD
GDGETRPDVRRFLTAVHALRQIERAVISTATNLGGHEDVRTTTAAPR
>MAP1986 hypothetical protein
MPRWLRGLSFLLRPGWVVLALVVVAFAYLCFTVLAPWQLGKHSRTSQQNH
QIEHSLTTPPVPLKTLLPQQNSAAPAEQWRQVSATGHYLADVQVLARLRV
IDSKPAFEVLAPFVVDGGPTVLVDRGYVRPLEGSRVPPIPRPPADTVTIT
ARLRNSEPAAGKDPFVGDGVRQVYSIDTEQIAVLTKVPLAGSYLQLVDGQ
PGGLGVVGVPQLDAGPFLSYGIQWIAFGILAPIGVGYFAYSELRARRAER
QPAAPAPEAPQSVQDKLADRYGRRR
>MAP0161 hypothetical protein
MSDPITYNPGAVADFATDVASRAGQLQSIFDDTSNRTHALQEFFAGHGAS
GFFEAQAQMLSGLQGLIDTIRQHGQTTSHVLDSAISTDQHIAGLF
>MAP1317c hypothetical protein
MCQTAAMTMSPGSPVPSLLPHLWKSALLSGILSLALGVLVLVWPGISILV
AAVAFGVYLLITGIAQVVFAFSLHVSAGSRVLLFISGAASLILALLAFRH
FGQGYAILLLAIWIGIGFIFRGVATTISAVSDPNLPGRGWNIFLGVISLL
AGIVVLASPFESIVTLAIVVGAWFVVIGVFEIISSFGIRKASKTLAG
>MAP2663c hypothetical protein
MTPYDVGLLILRLVLGVTLAAHGYNKFFGGGRIPGTARWFESIGMKPGKF
HATVAASTEMAAGLGLAAGLLTPIPAAGFVSLMLVAAWTVHRPNGFFIVK
EGWEYNLVLAASAVVVATLGAGRLSLDWLVFGKNWMDGWNGLLISLLLGL
AGAIGQLVIFYRPPAKQTG
>MAP1104c hypothetical protein
MSAADGVSSQTWTKVPAITLGFWVIKVLATTLGETGGDTVTMTLDWGYAA
GVALFGVTLVLLVAAQILAARFHPVLYWATIVASTTFGTVLADFADRSLG
IGYTGGSLLLLACLLATLGLWRWSQGTVSVATVHTPKVEAFYWATITFSQ
TLGTALGDWLADTRDFGYRRGALVFAAGLLVVAGLYFWTGVSRVALFWVA
FILTRPLGATVGDFIDKPVAQGGLAWSRPLATAVLAALIAVLLIVIPQRP
GRHPGRPEAGAAQSPTAT
>MAP3614 hypothetical protein
MLTFATGLADAISILVLGHVFVANMTGNVIFLGFWLAPRTSIDLTAVVVA
LPTFACTTILSGRLARHFGERVRAWISTVLATEIVLLVGLSVLAGSGILG
YQQDSKLLMIAILAVTFGLQHSSARQFGIQELSTTVLTSTIVSLGLDSRL
AGGTGERQRLRIGVVATMCAGAFLGATMSRYVVAPVFVVAAAVIAASLLI
FRFGPPAAKPAAAEAN
>MAP4000c hypothetical protein
MGRVTRHYVPAMSDNTVRVDPVVMQGAAASLSGAAEHLSAQLGQLDDQVG
QMLGGWQGASGSAYAAAWELWHRGAREVQLGLAMLARLVGQAGEAYASNE
AGAAQAERAVRGG
>MAP1988 hypothetical protein
MKRPAKRVADLLNPAATLLPAANVIMQLSLPGVGYGVLESPVDSGNVYKH
PFKRARTTGTYLAAATIGTDYDRALICAAVDVAHRQVRSRPSSPVSYNAF
DPKLQLWVAACLYRYFVDQHEFLHGPLDEASADAVYRDAARLGTTLQVPE
RMWPPARAAFDEYWKRSLDELRIDPPVREHLHGVASLAFLPWPLRVLAGP
FNLFATTGFLAPEFREMMQLDWTPAQQRRFGWLLFSLRLADRLIPHWVWI
LGYRIYLWDMRSRARQGRRVV
>MAP0303c hypothetical protein
MMTLSLSNHLAADSAHHALRRDVLDGLRRTPKSLPPKWFYDSVGSDLFDQ
ITRLPEYYPTRAEAEILRAQAPGIASASEADTLVELGSGTSEKTRVLLDA
LRERGALRRYVPFDVDAGILSASAAAIQREYAGIEIQAVCGDFEEHLTEI
PEGGRRLFVFLGSTIGNLTPGPRAQFLAALSAQMRPGDSLLLGTDLVKDT
ERLVRAYDDAAGVTAAFNRNVLAVINRELDADFDVDAFEHVARWNADEER
IEMWLRATVRQRVRVGALGLTVDFRAGEEMLTEVSCKFRPERVSAELAAA
GLRRTRWWTDGAEDFGLSLAVK
>MAP1580c hypothetical protein
MTVRIGTSGWSYDHWADVLYPPGTPSARRLARYIEVFDTVELNASFYRWP
KDSTFAGWREQLPDGFTMSVKAHRGLTHYRRLASPEPWIERFERCWELLG
DRRGVLLVQLHPEQQRDDARLDSFLERMPASIRVAVELRHPSWNDPAVYA
LLERRRAAYAVTSGLGLACIPRATTDLVYVRMHGPDPAANYAGSYSDDDL
RCWAERITAWDGDGKDVWMYFNNDLGGHAVRNALALRELVG
>MAP1338c hypothetical protein
MVSQPKTVSRLLDAEEVDQRHGVARRVCQFGGPGCGNGQFRRCQNNPIPE
GSAESIALRRFSPIAGPTANSPAAVCEPPHRARPKVTSIMSMPTLEPTFE
SGAGDIVAEPAPRRPRGRLLDPWAIAVLATALSAAWACRPSLWFDEGATI
SAAANRTLPELWRLLGHIDAVHGAYYLLMHGWFALFPPTEFFSRFPSALA
VGAAAAGVTVFTRQFAPRRTAVCAGAVFALLPRMTWAGMEARPYAFVAAA
AVWLTVLFVAAVRRGAPRRWVGYALALMLAILLNLNMVLMVPVYGVMLPL
LTARGARRSAALWWAGSSAVAVGAMTPFLLFAHNQVWQVNWIYPVSWHYA
FDIILRQYFDHSVALAVLSAVLIVAAAVARLAGVPAPPGDLRRLLILCAA
WMVIPTALVVVYSAVGEPIYYPRYLIFTAPAMAIVLAVCIVTLARRPWPI
AGAVLLCAVAALPNYLFVQRWPYAKEGWDYSQVADLIGSHAAPGDCLLVD
NTVPWRPGPIRALLATRPAAFRSLIDVERGAYGPKVGTLWDGHVAIWLTT
AKINKCSTIWTITNKDNSLPDHQSGQSLPPGSAFGQAPAYRFPGYLGFHI
VERWQFHYSQVVKSTR
>MAP3829c hypothetical protein
MERVTHSHSHGLPSGPAPVDRLPARIVVGLLIAIGVAVAAGAIVLWPSRQ
HVDIPMPLQNAAGGAVSTQAGHVLSSGLGDCGSPSVSQVLTGPPQPALPG
AGRCVLTQVAIDSGPNAGAATLLESSPGPGQPKFAVGDRIRIVRQVDDQG
ATSYAFYDFERGWALVGLAVAFAVVIVAVARWRGLLALVGILVAFVVLVT
FLLPALRDGAPAVPVALVAAVAILYAVIYLAHGVSLRTSAALLGTLSSLL
LAAGLSWAAIQLAHLTGLSDEQNSTVSAYLGSVSIGGLLLAGFIIGSLGV
LNDVTVTQSSTVFELARLGGSRRAIFTGALRVGRDHIASTVYTLVLAYAG
SSLPLLLLFSVANRSLTDVLTGESVAIEIARSAVGGMALALSVPLTTAIA
AVLAKPSGGRKRGEAGRGGPPPSV
>MAP3888c hypothetical protein
MEPQENRKNPQRIFSCTPHSYSMAGQSTLWVGTARMLRRAGFGVTGPEVD
AAVARGWPGHLDAMLAADPDADPGALATPMPALPAPQPPGKRATPAARKQ
YNQQLTEQQGVLSDWWIRRMVVVRQPFHEKLTLLWHNHFATSAQKVRVAA
QMAAQNQKLRTLSLGDFRTLAYAMLTDAAMLRWLDGQTSTAKAPNENLAR
EFMELFALGHGNGYTESDVRNGARALTGWVIGAGGATSVLPKRHDATAKT
LFGRSANFDAAGFCDAVLAQPKSAGYVAGRLWQQLAGDDPPSPPALDRLV
AAYGPGRDLRALTRAILTDPEFTGARASMVNTPIEWLIGVIRSLRVPVDD
PKRLKMIDATLRTLGQRPFYPPSVGGWPSGQVWLSTASAGARLRAATELA
HAGDLSGIENTPPTDRIDAVGYLIGVGAWSDRTARALQPLVGQPPRLVAA
AVNTPEYLTS
>MAP2161c hypothetical protein
MVATLFYTDELPDAGSLAVLGGDEGFHAATVRRIRPGERLVLGDRAGGLA
RCEVEHAGRDGLRARVLERWTVAPPNPPVTVVQALPKSERSELAVELATE
AGADGFLAWQAARCVANWHGPRVDKGLRRWRAVAKAAARQSRRPHIPTVD
GVLSTASLTSRIRDEVAAGAVVLALHESATDRLTDVPVAQAKSLFLVIGP
EGGIADDELAALRQAGALSVRLGPQVLRTSTAAAVALGALGVLTPRWDDA
ASP
>MAP1679c hypothetical protein
MAAALPLVKLLRDTVAVRMSVAVLLGVAVAVVVGNTVGWRFALVGWVVTA
GVYVVWTQLILFGMDAEQTRVWATREDPTRWVADAVILSASVASLAGVGY
VVAAGSHAGARAVAAAVLGILAVAASWFAVHTLFTVHYARLYYSGQPGGI
NFHDPEPPRFRDFAYVAFTVGMTYQVSDTEIGLTAIRSTVLRHALLSYLL
GAVILAVTINLIAGLGAKL
>MAP3642c hypothetical protein
MSLTEATAAGAEPVAPPAVLSGDALGGFPPPGGRVLLVWDAPNLDMGLGS
ILGRRPTALERPRFDALGRWLLARTAEVSAERPGVVVEPEATVFTNIAPG
SAEVVRPWVDALRNVGFAVFAKPKIDEDSDVDRDMLAHIELRRTEGLAAL
VVASADGQAFRQPLEEIARSGVSVAVIGFREHASWALASDTLDFVDLEDI
SGVFREPLPRIGLDSLPDQGAWLQPFRPLSALLTARV
>MAP0236c hypothetical protein
MRFVVTGGLAGIVDFGLYATLYKVVGVQVDVAKAISFIVGTITAYLINRR
WTFQAAPSTARFVAVMALYAVTFAVQVGLNHLWLAFLHYRGWAIPVAFVI
AQGTATVINFVVQRAVIFRIR
>MAP2740 hypothetical protein
MNVEPLLHSIPPLAVYLVVGGVVGIESLGIPLPGEIVLVTAALMSSHHDL
AVNPLGVGVAAVIGAVIGDSIGYAVGRRFGMPLFERLGRRFPKHFGPGHV
ALAEKLFNRWGARAVFLGRFIALLRILAGPLAGALKMHYPRFLTANVSGA
ICWAGGTTALVYFAGVAAERWLERFSWIGLVVAVVAGLTAAILLRERTSR
AIAELEAEHYRKTGNTATDPV
>MAP2118 hypothetical protein
MTNSPSSYLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPSTSQAPQP
REWFAGVDDESILSLLDPLDAGTPDLAPPEIWRSYIRTVGRTPNGVWGGK
LMWNQTPLLLDRAKNLPDRSGDGLRAAIRDVIGEEPLLIHVYRPDVVSQA
VSFWRAVQTRVWRGRPDPARDARATYHAGAIAHVVTMLRAQEEGWRNWFA
EEDLKPMEIPYPVLWRNLTQVVAAILEQLGLDPQLAPEPVLERQADHRSD
EWVDRYRADAEKYGLPT
>MAP0796c hypothetical protein
MTVTVILELRFKPDEVAAGRELMGRALQDTRAFDGNVRTDVLVDEDDEAH
WLVYEIWETVEHDQAYRAFRAGEGKLTQLPPLLAAPPVKTRYVTSDI
>MAP3746 hypothetical protein
MTRTPESQQESDYGYVAHKDGYAKRLRRVEGQIRGIAKMIEEDKYCIDVL
TQISAANNALRSVALNLLDEHLDHCVSSALAEGGDEAQAKLSEASAAIAR
LVRS
>MAP1586 hypothetical protein
MCHRSHMDSAADPFDLKRFVDAQEPVYAAVVDELRAGRKRSHWMWFVFPQ
LRGLGGSAMADRYGISSLPEARAYLRHELLGPRLHECARLVERVQGRSVG
QIFGSPDDLKLCSSMTLFAHATDDNADFLAVLQKYYDGRQDPVTLARLAD
S
>MAP0992 hypothetical protein
MVDRADLDAVARQLGREPRGVLAIAYRCPNGEPGVVKTAPKLPDGTPFPT
LYYLTHPALTAAASRLETTGLMREMTERLSQDAELAAAYRRAHESYLAER
DAIEPLGTTFSAGGMPDRVKCLHVLIAHSLAKGPGVNPFGDQALSILAAD
PALAGVLQEGRW
>MAP0332 hypothetical protein
MPNTYRVVQWNTGNVGKSSLKSIVTNPTLELVGCYAWSAEKVGRDAGELV
GIPPLGVAATNDVDELLALKPDCVVYNPMWIDVDELVRILSAGVNVVTTA
SFITGGNLGDGRDRLLQACQQGGATIFGSGVSPGFAELLAIVSAMVCNRI
DKVTVNEAADTTFYDSPETEKPVGFGQPIDHPDLPAMAAKGTAIFGEAVR
LVGDALGVELDDVRCVAEFAQTTEDLVMASWTIPAGHVAGTYISWQGIVG
DQVLIDLNVRWRKGQTLDPDWKIEQDGWVIQIDGQPTVTTKVGFLPPPYF
EATTIEEFMDLGHIMTAMPAINAIPAVVAAAPGIASYADLPLTLPRGNAH
VAGQP
>MAP1337c hypothetical protein
MQIWLTGAPLLGGSVQGARHAVELFAAFGLTALIGLERTIQGKSAGLRTQ
TIVGTSSALIMLVSKYGFSDVLSAGSIVLDPSRVAAQVVSGIGFLGAGII
ITRRGAVHGLTTAAAVWESAAIGLATGAGLLLLAIAVVGLHFVSALAFNA
VERQLNARLRGTVRLRIIYANGRGVLRELLRLSGQRDWQLTELDADPHDI
DDGEVAVSMTLSGAKIADAAHVFAEVDGVAAVLSADDATD
>MAP2165 hypothetical protein
MSRLPGVSDRDAGLGARIAFFFTRRKLAQMTGLETAGMLEPLRMYAHIPR
LLNAYGRLEQAESRLDILSPRHRALAELKAATTVRCEYCIDLGSQIARRW
GITDEELLAMADYRDAACFSDVDKLILEYATAISRTPVEVSDELFDALRA
HFDTAQLVGLTHVITLGNLRARFNIALGIGSSGFSGNRVCALPDTPRP
>MAP3452c hypothetical protein
MSEPAEAGFLDRLRARHGWLDHLIRAYQRFDDRNGGFFAAGLTYYTIFAL
FPLLMVGFAVFGFVLARRPQLLSSIDDHIRSQVNGALGNQLQELMNSAID
ARTSVGVIGLATAVWAGLGWISHLRQALTEMWWDTQIESPGFVRNKLSDL
LAMVGTFAVIVATVGLTTVGHAAPMAALLKWLGIPQFVVFDWLFWIFSIL
IATLVSWLLFTWMIARLPREKVSFVDSMRAGLIAAIGFEVFKQVASIYLR
TVLRSPAGAAFGPVLGLMVFAYITAYLVLFCTAWAATASTDPRDRPVAPP
APAIIAPRVHLDEGLNTRQTLTAMGLGAVAALAFDRLTRRRR
>MAP1845c hypothetical protein
MHRPPVVHPRRRIPVTHALIVLLALLIGVVAGLRSLTAPAVVAWAALIGW
VDLHGTWASWMANIITVIVFTVLAVGELVNDKLPKTPARTAAPIFAARIV
LGGLAGAALGAWPHWTFTALGAGVIGAVLGTLGGYHVRRRLVAATGGKDL
PIALLEDVVAIAGGFAILAATGHVLTDYLLTAVK
>MAP3063 hypothetical protein
MNSSHDRVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLRV
LDTLAGEDRRGLLTLGVTPVVNAQLDDPYCLDGMHHWLANWRLRAMEAAS
VRSAPRSKSAGYQACTPEALRALGIRESAEAERALDDFATRWRHGGSPLL
RRLLDAGTVELLGGPLAHPFQPLLAPRLREFALREGLADAALRLGARKGR
GPGGIWAPECAYAPGLEHDYAAAGVTHFMVDGPSLHGDTALGRPVGDTDV
VAFGRDLQVSYRVWSPKSGYPGHPAYRDFHTYDHLTGLKPARVTGRNVPS
ESKAPYDPQRADHAVDLHVADFVDVVRNRLTSESERIGRPAHVVAAFDTE
LFGHWWYEGPTWLARVLRALPEAGVRVGTLHDAIAGGFVGDPVDLPPSSW
GSGKDWQVWAGDQVADLVQLNSEVVDTALSTVDKALSQAGSQPAALDGPV
PRDRVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATREIAG
ALASGRRDTAQRLADGWNRADGLFGALDARRLPR
>MAP0609 hypothetical protein
MTAAVTPKGERRRYALVSAAAELLAEGGFEAVRHRAVARRAGLPLASTTY
YFSSLDDLIARAVEHIAMIEVAQLRSRVSALSRRRRGPETIAEVLADLLV
GDVSGPGLTEQLISRYERHIACTRLPALRETMRRSLRQRAEAVAEAIERS
GRSVHIDLVCTLICAVDGSVVSALVEGRDPRAAAQGAVVDLIEVLAPIDQ
RPVQI
>MAP1547c hypothetical protein
MAEDVRWFAGSPLAAAFGRLALDQVAHREIAAAVDRSGRFADNFTDRGIR
SAAFTALAAFGDSSDVEASRADLKRLHRDVRGTGKGAFSDTRYSALDPEL
WTWVAVSGLNLLYQAYLRVCGHRLSTDEKEVVYQTLRRELQFLELPSKQG
KLPATLDEMLDYYDTVAAKHLADNEFLQFASRSFVAPPVPGLLPRQLRPV
LRLVWPVLTSLAARPVVVCSAAVAHPTMRRLLGVRWGAREQAEFAVYVAA
LQLGWRWLPRRLTLEPLAYNRYQYERLRDRYRSVLLDSFAAPGRG
>MAP3412 hypothetical protein
MSRPKWLPTWRLRAMFAAGLSAMYAAEVPGYATLVQVCTRVNADHVARHP
DAQRWGSLRRVTAERHGAIRVGSPAELRAVADLFAAFGMAPVGYYDLRRA
ASPIPVVCTAFRPVDGNELSLNPFRMFTSMLATRDRRYFDRDLRARVDNF
LAHRQLFDPALLARARSIAAAGGCADDAARPFVAAAVASFALSGKPIERS
WYDELSRVSAVAADIAGVCSTHLNHLTPRVLDIDELYRRMTGRGIAMIDA
IQGPPATGGPAVLLRQTSFRALAEPRRFRGDDGRITEGTVRVRFGEVEAR
GVALTPRGRRRYDAAMAADDPAAVWANYFPATDAQMAAEGLGYYCGGDPS
APIVYEDFLPASAAGIFRSNLDAETPGAAQAAPAVDDSGYDQDWLAGALG
REIHDPYPLYEALAQEAPR
>MAP0745c hypothetical protein
MRPGSPPPRTGTVRLTLDCRPADPGHGVRLRSGNRSLSEGPQRRRSALDR
ERYERGLRIRSDVLGEQYVNRALADADEFTRPLQDLVTEYCWGAVWGREE
LPRKTRSMLNLAMIAVLNRPNELRMHIKAALTNGVTREEIREVFLQVAIY
AGVPAAVDSFRIAREAFAELDRENS
>MAP1145c hypothetical protein
MAVESSLAWLLRSLGVDAFLNEIWATRHHHIDRCRPGYFDGLLPGPSAVD
GLLEQVRPDPAAVRLVKDGEDRDPAGYRRGDGTLNAGGARDGLADGYTLV
LNGLERYLRTVASLSHAIEVELNFPTRVNAYVTPPHSTGFVPHYDPHDVL
VLQIEGCKTWRVSDEPPVPPQQIQSRKGVGADGPASWTDVCLRPGDVLYL
PRGQVHSARTHSEPSVHLTVGLHAPTVLTLVTSALHALSLRDPRVHDRLP
PRHLDDAQVRAGLGEAVRDAVRALDDDAVIADGLGAMEEVLVRRGRCPPV
GSVRDTVGVDGHTLVRKHQPLYARVMRAGDGVVLQFAQLSVSAGSDHEAA
MLFLAGRAEPFRVAELPGLSAAQQVGLAQTLILNGFLARLSDD
>MAP3784 hypothetical protein
MSQIMYNYPAMLSHAADMSGYAGTMQGLGADIASEQATLSNAWQGDTGMT
YQVWQAQWNEAMESLVRAYQSMASTHEANTMSMLARDQAEAAKWGG
>MAP2823 hypothetical protein
MSDQVAKPSRHHIWRITLRTLSKSWDDSIFSESAQAGFWSALSLPPLLLG
LLGTLAYVAPLFGPDMLPSILRQLISTAHSVFSPSVVNEIIEPTVRDIAN
NARGEVVSLGFLISLWAGSSAISAFVDSVVEAHDQTPLRHPVRQRFFALF
LYVVMLVGVVATAPLVVVGPRKVGEHIPLGLSNVLRYGYYPALIVGLTIA
VMILYRVALPEPLPSHRLILGAMLATTVFVTATLGLRFYLRWITSTGYTY
GALSTPIAFLLFAFFGGFAIMLGAELNAAVQEEFPAPKTHAHRLRTWLFS
RSPALERTVQTLASPIANPATQRREVAPAEPRG
>MAP3536 hypothetical protein
MTSKLSTTLHTSFPDAVDRITKALADQGFGVLTTIDVKATLKQKLGADME
DYLILGACNPALAHRALGIDRQIGQLLPCNVVVRSDPAEGDAVLVEAMDP
QLMVKVTGEAGALQEVADQATAKLQAAISALAG
>MAP1016c hypothetical protein
MSEPGSATDTDTGDEPAQQQTAPDGTRTGAQTAVAEHPAPKAPAPGTQQP
EKPWWVRHYTFTGTTVGLVFIWFSLTPSLLPRGAIFQGLVSGISGAIGYG
LGVFAVWLYRYLWAKPSSPPPPRWAWKILIPVGAVGMVLMAIWFHVWQDK
VRDLMGVAHLKWYDYPVAGVLSLVVLFTCVEIGQFTRWLVSFLVGRLDRI
APFRLSATIVVALLVVLTITLLNGVVLKFAMRTMNNTFASANNEMSPDTA
PPKTPLRSGGPESLVSWESLGHQGRVFIEGGPRVEQLTAFNGAPATEPIR
AYAGLNSADGITATAELATRELQRTGGLRRAVVAVATTTGTGWINEAEAT
ALEYMYNGNTAIVSMQYSFLPSWLSFLVDKENARHAGQALFEAVDRLVRQ
LPEAQRPKLVVFGESLGSFGGEAPFMSLNNVLARTDGALFSGPTFNNTIW
TDLTSTRDAGSPEWLPIYNDGRNVRFVARAANLARPKDPWDHPRVVYLQH
ASDPIAWWTTDLLFARPDWLKERRGYDVLPETTWIPVVTFLQVSADMAVA
QNVPDGHGHHYVADVADAWASVLSPPGWTPDKTDRLRPLLHANG
>MAP4069c hypothetical protein
MSEPVAPRVAVYLDFDNIVISRYDQIHGRNSFQRDKAKGLEQFTERLEQA
TVDVGAILDFASSFGTLVLTRAYADWSAEINAGYRGQLVARAVDLVQLFP
AAAYGKNGADIRLAVDAVEDMFRLPDLTHVVIVAGDSDYIPLAQRCKRLG
RYVVGIGVAGASSRALAAACDEFVIYDALPGVTALDRTPAPAETVPAKPR
TRRGKAAQPEEPDPQAAATALLTRALQIGFEKDDVDWLHNSAVKAQMKRM
DPSFSEKSLGFRSFSDFLRSRSDVVELDETSTTRMVRLRPHE
>MAP2297c hypothetical protein
MTVAPPAGTASTEAVPQHRTLVWPVLAGVAVLAGCTAAGIGTLSLASALT
ATGLPDPGPVTTVGLPFLRAAGEIAAVLAVGSFLFAAFLVPPQPSGVLDA
DGYRALRLGTVASGVWAVCAALLVPLTLSDVSGHPVADLRPAQMWSLAGL
ITTASAWRWTAILAAAITLASLPVLRWSWAPVLLAASLTTLIPLGLTGHS
SAGGSHDLATNGLLIHLVAAALWAGGLLALLAYALRGGQGGDHLGLATRR
FSAIALWCWVAMALSGLVNAAVRVQPSDLLATGYGRLVLAKAAALCLLGG
VGWRQRRVNVAALQAVSTLARARRALLRLTLIEAALFGLTFGIAVGLGRT
APPSPPARLPSIAEAEIGYDFDGPPTLTRILFDWRFDLIFGTAAIVLAGL
YVAGVVRLRRRGDRWPPGRTSSWLLGCLVLLFVTSSGVGRYMPAMFSMHM
VVHMCLSMLVPILLALGAPVTLALRALPAAGRGDPPGPREWLLAALHSRF
SRLLTNPVVATVLFVAGFYGLYLSNLFDTTASSHAGHLLMNLHFLLSGYL
FYWVVIGVDPTPRPIPPLAKLAVVFASLPLHAFFGVVLMGTRKVLGADYY
RSLGFSWHTDLLGDQRLGGGIAWSAGEFPLVIVMLALLVQWARSDRRTAK
RLDRAADRDDDAELAAYNAMLAQLAGRDKPGEGSATGQA
>MAP2631 hypothetical protein
MVVTGPGWWNRGMTEYAAFLRGVNVGGVNLKMADVKKALTDAGFTAVRTV
LASGNVLLQSKAGAAAVRAKAEAALRERFGYDAWVLVYDLDTVRRVVDGY
PFEREVDGYQSYVTFVADAAVLDELAALPAGPDERIRRGPDPLGVLYWQV
PKGSTLDSTIGKTMGKPRYKSSTTTRNLRTLVKVLS
>MAP3817c hypothetical protein
MTSEDHTAVATDTLTRSPSQTDARWQVLRASCWVLLVLTLLAAVVMAVRP
SLAAQAGAAQMGFLILFVVVHAWLGYSARGMAAFVAIAGIVAFALEAIGV
ATGFPFGSYTHHLPGPKPLGVPPVAIASWIIFGWLAWALARVIMRPVPGV
AVGGAERFTTPIVATLILGGYDLVYDPIGATAHDWYSYDHPTGALGVPLS
NFLGWLLTGWLIFQLVALVEPRFPGSPVTRTRTYWLLPCLFWLSTTTQIF
TSLIHPPDGFAVRGGKTIQLADVYESGASAALFTIVLTGIMALVRLYRRQ
ASKTLRSSADEA
>MAP3849 hypothetical protein
MSEHIRTPASLARRPTVAVLGAGIAGLTAAHELAERGFVVTVYEAQQDER
NGLGSEPAGAYPPVKLGGLAASQYSTVGTHDGSHAELRPFPGRPSRPTDL
GRAVAGEHGFRFFPAYYLHIWDLFQRIPVYELRNPGGWAPTARTVYDNVR
RVITQGVTIDGKPSLVFPRELPRSGAEFLGILGQLATVGLTFGDVATFQG
RMLRYLVTSPLRRARELQNMSAYDFFVGHDRRTGLNQYSYTPRCDALLRQ
MPKVLVAFDSRWGDARTNISTYLQLYLNMDRRDDKADGVLNGPTTESWFD
HWYRHLVALGVRFVRAAATRLDPVPVDPSRPPHLRPRVQITLTDGTRVTP
DYIVVAVDAPGAEFLTAPMRAAGTGGTVAGLDGFATSVPPPDGPLQPQST
RPRQRRDPYSMDELGRVPWDRFQTLCGIQYYFDTEFQLLRGHLLYAGTEW
ALSSINQTGMWERPPILDRDGHVSVLSVDIGDFNAASSHLVDESGRGKAA
RDCTPDEIATEVWRQIVAALTSSTGPAGEEFMPRPVWYALDRGLIMESGP
GQGRGRAVRNATPYLVPIVGDWDNRPSSDPWNPHGTSWSSVPPEDWWLED
LWRRNVWQARHGGYQVHNNAVVFAGTWNKTFTRLTSMEAACESGRHAVNA
ILDHYIWVQSGGLDRREKTTLSWEFPFGFLDQGLSSPIRMPTPAGDYCYV
FDIENREPADTRALRILDSRFCEWSLPHPLDVGAPTSFIPPPAGGQEMFG
PTTDYNQQLLAYLQAWRELLERWTAMSAATPSPAAPFTPPAAPFTPPFMP
PAAAPAPMPPAPADYTQQLFGYLQAWRQYLEQMAGASSGSAQQPTAPPTA
PPTMPPTMPPPPPPPSTPSTGGQAFGAAPPGQPGTSGDPTPGGSTPVSAT
AKGSTLTWPPPLLGLEPSSYVGTQDPPVSRFLPPNLVNRAEDRQEVLLRP
NDEYGYLHDLFSLNPGLARAISSTGPAPVASGSSFLGAMNRVGPDVSARV
APRSLFSSRAARAPSAGAITPGSRTG
>MAP3395 hypothetical protein
MSFAEATIARLPRLLQPYLLRHHELIKFAIVGGTTFVIDSAIFYTLKLTI
LEPKPVTAKVIAGIVAVIASYVLNREWSFRNRGGRERHHEALLFFAFSGV
GVLLSMAPLWFSSYVLQLRQPTVSLTVENVADFISAYIIGNLLQMAFRFW
AFRRWVFPDQFARDPDKALESALTAGGIAEIFEDAFEDDGGNVTLLRAWR
NRAGRLTQLGDSSEPRVSKTS
>MAP2613c hypothetical protein
MAETPRLLFVHAHPDDESLGTGATIAHYTAAGADVRVVTCTLGEEGEVIG
ERWAELAVDRADQLGGYRIGELTAALRELGVGEPCYLGGAGRWRDSGMPG
TPRRRRQRFIDADEREAVGALVAIIREQRPHVVVGYDPAGGYGHPDHVHV
HTVTTAAVAAAGAGNFPGEPWAVPKFYWSVFATRPFEAAVQALTPEDLRP
GWSMPSAEQFTFGYADEHIDAVVAAGPHAWAAKRAALAAHATQVVVGPTG
RACALSNNVALPILDEEHYVLVAGAAGARDERGWETDLLAGLEFGAAPRR
>MAP2367c hypothetical protein
MLAGLAFGSEIKRFRRSRLTRAAIVVLMLLPLVYGALYLWAFWDPFGHTN
KMPVALVNSDRGAVVSGQQFNAGAEIAKSLTADGSLDWHVVDLPEARNGV
DHGKYYFMVELPPDFSAAIASPVTGQPKKANLIAVYNDANNYISTSIGRT
AIEQVLNAVSSRISGQAVNQMLSVVVSSGSGIKQAADGAARLDDGAGQLA
AGLDSARTGSATLAAGATQLSDGINQATDPLLAVTKAVSQIGGSTEQLQQ
GATALAQANDQLGAIAAAQDAAASSLTSVIDQLSARADPLANNLRGIQDQ
LRGHQFTPQIRQQLTDAQNAAIAMTSGLRTPGSPLASALDQLGGKGQELT
NKLTQLRDGAQQVATGNAELAGGIAKLDDGAGQLKAGSAELATKLAEGAR
QVPNWTTQQKEAVADTIGGPVQLEASHENAAPNFGTGMAPFFLTLALFFG
ALVLWMVLRPLQYRAIAAEVIAIRVVLASYLPAAAIALFQAVVLYCVVRF
ALGMHAVHPVAMLGFMVLISGAFVAATQAINALVGPAVGRVLIMALLMLQ
LVSAGGMYPVETTSRPFQVLHRFDPMTYGVNGLRQLILGGIDARLWQAII
VLAAITAVALAISCLSARRDRLWNLSRLFPAIKM
>MAP4243 hypothetical protein
MATSNTVSTDFDLMRSVAGTTDARNEEIRAMLQAFIGRMSGVPPAAWGGL
AAARFKEMIDRWNAESVRLYHALHAIADTIRHNAATLQDAGQNHADHIAA
AGGSL
>MAP3966 hypothetical protein
MSSHQSAGKIDDGVVAHRASGRTSFAESFAGADPQADAQRRVALRRMKAV
ALSFLIGATGLFLACRWAQAQPGVGAWVGYVGAAAEAGMVGALADWFAVT
ALFRHPLGIPIPHTAIIKRKKDQLGEGLGTFVRENFLSPEVVETKLRDAQ
VSGRLGKWLSEAAHAERVAGETATVLRVLVELLRDEDVQQVIDRMIVRRI
AEPQWGPPVGRVLATLLAENRQEALIQLLADRAFQWSLNAGEVIQRVVER
DQRVVERDSPTWSPRFIDHLVGDRIHRELMDFTDKVRRNPDHELRRSATR
FLFEFADDLQHDPDTIARADAIKEQLMARDEIANAAATAWKTLKRLVLEG
VDDPSSALRSRIADTVVRIGESLRDDADLRDKVDNWLVRAAQHLVSQYGV
EITAIITDTIERWDAEEASRRIELHVGRDLQFIRINGTVVGSLAGLVIYA
VAQLLF
>MAP0437 hypothetical protein
MTPAPDLTLGTAVDWRFAATVGRRLARPGPPSTDYTRRQVIDELAGAATK
AEPLVRQVTGLLTDGADDGRVPAARIVDRPEWIAAAAESMRVMMNGTERP
RGFLTGRVTGAQTGAVLAYVSSGILGQYDPFATDRGAGAGCLLLVYPNVI
AVERQLRIEPSDFRLWVCLHEVTHRVQFTANPWLPGYMSQALALLTRDTG
DDLGQVLSRLADYARNRGNPASQADANSTGILGLVRAVQSEPQRQALDQL
LVLGTLLEGHADHVMDAVGPMAVPSVATIRRRFDERRHRKQPPLQRLLRA
LLGLDAKLSQYTRGKAFVDQVVGRVGMARFNAIWTGPETLPLPVEIEEPQ
RWIDRVL
>MAP1786c hypothetical protein
MRRVIQFSTGNVGRHSLRAIIGRPDLELVGVHAAGPEKIGRDAAQLCGLD
EPTGIIATDDIDALVALNADCVVYTSQGETRPMDAIEQMSRFLAAGTNVV
GTSMVWLVAPRHAEEWLREPLQRACEAGNTSLYVNGIDPGFSGDTLVHAA
VSLATRVSSITVQEIFDYGNYDDAEFTGAAMGFGTAPDDDSPMMFLPGVI
VAMWGGQVRSLADHLGVELNDVRQRIERWFTPERIDCTMMTVQPGRMAAV
RFATEGVRDGEPVITLEHVTRLTRAAAPDWEYPPDGHTGVHRVVVAGEPR
VEINTHVSHPLFDSTDAGCISTAARVVNAIDWVCRAPQGLIGVEDIPLAA
TMRGVLWQDR
>MAP0434 hypothetical protein
MRTFGVSLLVTAAALVLGYAYGGPKSLYLLLVLAALEVSLSFDNAVINAA
VLKQMSRFWQRMFLTIGILVAVFGMRLLFPLLIVWATAGLDPVRALELAL
RPPPHGALEFPDGSPSYQKLLTAAHPQIAAFGGIFLLMLFLDFLLIDRDI
KWLKWIEVPFARIGRLGQVSVVLSGLTLVLVGTGLTHSSQEAVTVLTAGL
LGMVAYLVVNGLSRAFRPSGVGQPQPAGPATGWAGLSLFLYLEVLDAAFS
FDGVTGAFAITSDPVVIVLGLGAVGSMFVRSITIYLVHQETLDRYVYLEH
GAHWAIGALAVIMLASIEPRLTVPEPVTASVGVFFIGTAVGFSVLRRRRE
SGRASAPGP
>MAP1890c hypothetical protein
MVLFFQILGFALFIFWLLLIARFVVEFIRSFSRDWHPTGITVVVLEIIMS
ITDPPVKLLRRLIPQLTIGAVRFDLSIMVLLLVAFIGMQLAFGAAA
>MAP3199 hypothetical protein
MTTLETLLHDPEMAGVWNLVPDRSAITFRIKNMWGLLTVRGRFTDFTGDG
QLTGKGAVFGRVDIRAASLDTGIGRRDQHLRSPDFFDVERFEKISVVVTG
LQPTKGKIADLRTDFTVKGVTAQLPLPVTILELDDGSIRITGETTLDRAR
FDLGWNRFGMIGRTATAAADVIFVRDSQ
>MAP0826c hypothetical protein
MAINIEPALSPHLVVDNAAAAIDFYVKAFGAEELGRLPRPDGKLAHAAVR
INGFMVMLNDDFPEVCGGKSMTPTSLGGTPVTIHLTVPDVDASFQRAVDA
GATVVVPLEDQFWGDRYGMVADPFGHHWSLGQPVREVSPEEMAAAMAAMA
AEQGAGQA
>MAP3004c hypothetical protein
MTGAVATEATGLTPDSPAGPAVPALSAWSVLVAGVIGLVASVTLTLEKID
ILLDPAYVPSCNINPILSCGSVMITPQASLLGFPNPLLGLVAFTVVVVTG
LLAVTKVVLPQWYWIGLAAGLVVGAVFVHWLIFQSLYRIGALCPYCMVVW
VVTIALLVVVASIAYRPALGDRRSGPGWLLFQWRWSIVALWFTAVFLLIM
VRFWDYWSTLL
>MAP4150c hypothetical protein
MSVEHLVHMLRAQGSFCASSGSPMYGDLFELVASDVEAGGVFADILSGHR
DAPSRDAIPLRLLGGLHRLVLDGRAGSLRRWYPSTGGSWDAGAAWPPILA
AAAEHAAALRAALDRPPQTNEVGRSAALIGGLLHINESCLPVRLFEIGSS
AGLNLRADHYRYRYAGGGWGPADSPVCIDDAWRGALPPARGVRIVERHGF
DIAPVDVGNPDGELTVLSYVWPDQAARLARLRGAIEVARRVPATLERRTA
ADAVGRLSLAEGALTVLWHSITWQYLSAGERAAVCAGVDALGARAGASAP
LVHLTMEPARDGPGAPIRFLVRARGWPDGGPRVLAQCHPHGPPVDWL
>MAP0922c hypothetical protein
MPTYSYQCTECDDRFDIVQAFTDDALTTCKHCSGRLRKRFNSVGVVFKGS
GFYRTDSRESGKKPAGSTNGSSSSDSSSSTSDSSGSSAKSGSGEKAVSSP
APAAAAAG
>MAP4063c hypothetical protein
MADSSFDIVSKVDRQEVDNALNQAAKELATRFDFRGTDTTIAWKGDEAIE
LTSSTGERVKAAVDVFKEKLIRRDISMKAFDAGEPQASGKTYKVNGTLKQ
GISSESAKKITKLIRDEGPKGVKTQIQGDEIRVSSKKRDDLQAVIAMLKQ
ADLDVALQFVNYR
>MAP1508 hypothetical protein
MATRFMTDPHEMRAMAGRFEVHAQTVEDEARKMWASSMNIAGAGWSGQAQ
ATSYDTMGQVNQAFRNIVNMLHGVRDGLIRDANNYEQQEQASQQILSS
>MAP1944c hypothetical protein
MRHVGEAMTVQNESNAKTHGVILTEAAATKAKSLLDQEGRDDLALRIAVQ
PGGCAGLRYNLFFDDRSLDGDLTAEFGGVTLTVDRMSAPYVEGASIDFVD
TIEKQGFTIDNPNATGSCACGDSFN
>MAP3862c hypothetical protein
MSTAVTALPDILDPMYWLGADGVFGSAVLPGILVIVFIETGLLFPLLPGE
SLLFTGGLLAAHPNPPASIWVLAPAVAIVAVLGDQTGYFIGRRIGPALFK
KEDSRFFKKHYVTESHAFFEKYGSWAVILARFAPFVRTFVPVIAGVSYMR
YPVFLGFDIVGGIAWGGGATLAGYFLGNVPFVHHNLEKIILGILVVSLMP
AFVAAWRGYRGRRRPTRTPERESERLPASE
>MAP1243 hypothetical protein
MPTMRLPKLPLDEAKAAADEAAVPNYMAELSIFQVLLNHPPLARAINDLL
ATMLWHGALDSRLRELVIMRIGWLTGCDYEWTQHWRVASRLGVPGDDLLG
VRDWRNYPGFGPTERAVLAATDDVVRHGSVSAASWAQCEHEFNGDTTVLV
ELVTVIGAWRMVASILQSLEVPLEDGVASWPPDGREPAS
>MAP0349 hypothetical protein
MKNSGRSTMSSPSKLRVIQWATGGVGKAAIQCVLNHPRLELAGCWVHSAE
KNGVDVGRIIGTQDLGVTASTSVDEVLALDADCVVYSPLIPNDDEVIAIL
RSGKNVVTPVGWVYPDPGNPRHRAVADAALESGVTLHGSGIHPGGITERF
PLMVSSLSSAVTHVRAEEFSDIRTYNAPDVVRHIMGFGGTPEEATGGPMA
GLLDGGFKQSVRMIADHMGFRIDPNIRTIQDVAVATADIDYDPFPITAGT
VAARRFRWQALVDGEPVITAAVNWLMGEQNLDPAWDFGGRGERFEVEITG
DPDVSLTFKGLQPETIAEGLVKNPGVVVTANHCVNAIPDVCAAAPGIKTY
LDLPLFAGRPAPGLATS
>MAP3805c hypothetical protein
MYPIDASGSIDRSYVRIPGCPSPPGGPTGVWQHGGVSSPTRPAVPAALDP
WIVAALAAAVSLAGAARPSFWYDEAATISASYSRSLTQMWHMLGNVDAVH
GLYYLLMHGWFRLVPPTEFWSRAPSGLAVGAAAAGVVVLGRQFSSRTVAV
VSGVFCAVLPRTTWAGVEARPYALSMMAAVWLSVLLVFAARRQSRWPWLC
FGLALVCSVLLDAYIALLLAAYAVFVGVCCRTRTVLWRFGISSAVAVGVL
LPFLLTVAGQAHQISWVASIGHRTVEDVVMQQYFERSPPFAVLSALLICA
AIALWLSRSAPPGPSERQLLVLATCWVGIPTAAIVAYSALVHPIYTPRYL
CFTAPAMALILGVCSAAIAAKPWVTTAVVGVFAIAAVPNYVRAQRNPYAK
YGMDYSQVADLITAKAAPGDCLLVNDTVTFMPAPMRPLLAARPDAYRKLI
DLTLWQRAVDRNDVFDTNLIPEVVAGPLSHCAVLWIITQADPSEPAHQQG
PALPPGPVYGATPAFAVPHDLGFRLVERWQFNLVQVFEATK
>MAP4036 hypothetical protein
MIPVTLLVVAKAPEPGRAKTRLAAAVGERAAAEIAAAALLDTLDAVAAAP
VAARVVALTGDLDGAARGAEIRARLASFTVIAQRGNDFGARLANAHADAA
DGLPVLQIGMDTPQVTAELLAGCARRLLDAPAVLGLARDGGWWVLGVAAP
VLADCLRAVPMSAADTGELTLKALRDNGIEVATVQTLVDVDVVGDVAVVR
DACPPASRFARATRAAGL
>MAP4115c hypothetical protein
MISRERERKPDDAAAAMAAWESFHAKAGPAIKAGDALAPAAAAAVITGGP
DAPVVTDGPFAETAEVACGYYIFEAENLDEALALARDVPVAAFGAVELWP
VVHAVEPSRRITGNDWLALLLEPAESAHTPGTPEWEAVAAKHAELHAAAG
DHIIGGAALHDKSTATTVRVRDGEVLITDGPYVESAEIATGIYLLGAADR
DEAVKIASMIPASTVQLRQLAGVSGL
>MAP1129 hypothetical protein
MEVPYAPRLVAGAWVVAGWVALAYGIYLTVLALRSPPGVELTGHWVLQPA
FKASMALLLTLAAAGHGQVRERRWLMPALLLSAVGDWVLALPWWTLSFLV
GLAAFLFAPMCFIGVLLPLVPLPSGASGPGRPSTPRIAAAVLMCLASIGL
LVWFWPRLGPDKLTLPVTLYIVVLTAMVCAALLAKLPTIWTAVGAVCFAA
SDSMIAIGRFILGNEALAVPIWWAYAAAQILITAGFFFGREAAGDAAGEA
TGDATGDAAEPVE
>MAP0610c hypothetical protein
MYFAGVDLAWAGRNPTGVAVIDSAGELVSVGAVHDDDEILAALDPYVRGE
CLVAFDAPLVVNNPTGQRPAETALNRDFRRFQAGTHPCNTGKPEFADGPR
AARLAAALGLALDPRSPRPRRAIEVYPHAATVALFGLQQTLKYKAKPGRS
LERLKSELLLLMAGVERLAHAPVPVRVGRHAGWRALRRAVECAQRKSELR
RAEDPVDAVVCAYVALFAQRRPDAVTSYGDPGTGCIVTPSLPAARRQSLR
SAPAADRSGPAPR
>MAP2272c hypothetical protein
MTATPREFDIVLYGATGFVGKLTAEYLAGAAPDKRVALAGRSTEKLRAVR
DSLGDAAQSWPVLQADASSPATLNEMAARTQVVITTVGPYTRYGLPLVAA
CAAAGTDYADLTGEAMFVRDSIDSYHKQAADTGARIVHACGFDSVPSDLS
VYALYRAARDDGAGELVDTDLVVRSFSGGVSGGTVASMLEVLDTASRDPE
ARRQLADPYTLSSDRGAEPDVGPQPDLPWRRGRQIAPELAGVWTAGFVMA
PYNTRIVRRSNALLDWAYGRSLRYSESMSLGSSPLAPVASAVVGGTAAAT
FGLGSRYFRFLPRRLVERIVPKPGTGPSPAARERGYYRIETYTTTTSGAR
YVARMEQRGDPGYKATSVLLGESGLALAFDRDKLPQLYGVLTPAAAMGDA
LLDRFPGAGVFLQVDRLAG
>MAP2061 hypothetical protein
MTTRSAAARDATELAAYRVAAMLLGVGTLHFLAPKPFDSIIPAELPGSPR
LYTYGSGVAELTVGALLVPRRTRRAAALAAAILFIGVFPGNLNMVRLWWD
KPWPMRIAALARLPLQIPMITTALRIRRNS
>MAP3632 hypothetical protein
MTNEHGYSQQKDNYAKRLRRIEGQVRGIARMIEEDKYCIDVLTQISAVNS
ALRSVALNLLDEHLGHCVTRAVAGGGDDADEKLAEASAAIARLVRS
>MAP1067c hypothetical protein
MATWYDVARIVGELALTSEPSPHDWRVGKKLLAWERPLRPSEREALARTG
AEPAPGNVLGVRVADEGVKFALIDDAPQTFFTTPHFDGYPAVLVNLDAIS
VRDLEELITEAWLTQAPRKLVQEFLADSR
>MAP0317c hypothetical protein
MQPGPGGDMSALLAQAQQMQQKLMEAQQQLANAEVHGQAGGGLVKVVVKG
SGEVVAVKIDPSVVDPSDVETLQDLVVGAMADASKQVTRMAQERLGSLAG
GFGAPPGQPPAPAPGV
>MAP1224c hypothetical protein
MEGVTGSATSKIAETLRDLGCAIGAAARGVSRSRIAWTVAGITALVVLAS
LIPLPSPVQMRDWAQSVGPWFPLAFLLAHIVVTVVPVPRTAFTLAAGLLF
GPLLGVAIAVAASTASAMIAMLLVRAAGWRLTRLVRHRSMDTVEERLRQR
GWLAIVSLRLIPAVPFSALNYAAGASSVRVLPYGLATLAGLLPGTAAVVI
LGDALAGHPSSLLYLVSALTSALGLTGLVIEIRHFRRHHRRAHRHRDDEP
SPEPATIG
>MAP1640c hypothetical protein
MTDTSRFATIRVDQFVAAPPAKVWRMLTEPELMKLWWAEGQVAAVVGHRF
TLDMPGYGKQPCQVLEVDPPRRFVYTFTAAWTLTWRLEAEGEGTRVFLEH
SGFDLDDARMARAFEQMGPGWRDTVLPRLARVAVD
>MAP2721 hypothetical protein
MDLRALEELPLTYSEVGATASGDLPAGYHHQHVERQIGTGAERFEQAAGA
VLRWGMQRGSGLRVQASSEVAVVDAVVVVRLGFVPAPCRVVYVVDEPDIR
GFGYGTLPGHPESGEERFVVRHDPATSAVYAEVTAFSRPATWWARAGGPV
LRVGQRVIARRYLRAV
>MAP2240c hypothetical protein
MAVVVVTDSSARLPADLLGQWGIRVVPLHILLDGTDLRDGVDEVPADIHQ
RAATTAAATPAELADAYQQALADSGGDGVVAVHISSALSGTCRAAERTAA
DLDPNLRVVDSKSAAMGTGFVALAAARAAAAGAQLDAVADAARAAVSRGH
GFIVVHRLDNLRRSGRIGGPAAWLGTALALKPLLRIDDGKLVLAQRVRTA
GHATEAMIDRVCQVIGDGAAALAVHHVDNPGGAAEVAAALGERLPACDPA
IITELGPVLALHVGAGAVAVVVQQPGS
>MAP3953 hypothetical protein
MEPRPPSGVVITAAAAEVLARLQRQHGPVMFHQSGGCCAGSSPMCYPVGD
FLVGDRDVLLGVLDVGADGVPVWISGPQYAAHYRDKHTQLVIDVVPGRGG
GFSLEAPDGVRFLSRGRVFSAAEQAAVAAAPIITGADYQRGERPAARGAV
VADAACRTCP
>MAP0065 hypothetical protein
MTGAAEPSATISPGPLAADRRSADNRDCPSRNDFLGAAFAEVIGGPVGRH
ALIGRARIMTPLRVMFLIALVFLALGWSTKAACLQSTGTGTGDQRVANWD
NQRAYYELCYSDTVPLYGAELLSQGKFPYKSSWIETDSTGAQQIRYDGQP
AVRYMEYPVLTGMYQYVSMALAKTYTALSKLAPLPVVAEVVMFFNIAAFG
LALAWLATVWASAGLAGRRIWDAALVAGSPLLIFQIFTNFDALATAFAMA
GLLAWARRRPVLAGVLVGLGAAAKLYPLLFLGPMLLLGIRTGRLGAWART
AVAAVATWLLVNLPVLVFFPRGWSEFFRLNTRRAEDMDSLYNVVKSFTGW
RGFDPKLGFWQPPTVLNTVVAVLFVTCCAAIAFIALTAPRRPRATQLVFL
VVAAFLLTNKVWSPQFSLWLVPLAVLALPHRRVLLAWMTVDALVWVPRMY
YLYGNPNRSLPEQVFTTTVLLRDIAVITLCALVIRQIYRPDEDLVRWGGR
VDDPAGGPFDRAPDAPPGWLPDWLRPAGSRRGQAAEPEPALAGAT
>MAP0974 hypothetical protein
MTKLHQNPSPMLRLVVGGLLVVLAFAGGYAVSVCKTVTLTVDGVAMRVTT
MKSKVIDVVEENGFVVDDRDDLYPAADVAVHDAGKIVLRRSRPLQISLDG
HDSKQVWTTASTVDEALAQLAMTDTAPAAANRGSRVPLAGMALPVVSAKT
VRINDGGAVRTVHLAASNVAGLLSAAGAPLAESDQVAPPASAPIVDGMEI
QVTRNRIQQVTERMPLPPNARRIEDPDMNMSRQVVEDPGSPGTQDVTFSV
ATVNGVETGRLPIANTVITPARESVVRVGTKPGTEVPPISDGSIWDAIAG
CEAGGNWAINTGNGYYGGVQFDQGTWERNGGLRFAPRADLATREEQITVA
EVTRERQGWGAWPVCSGRAGAR
>MAP1906c hypothetical protein
MFLGTYTPKLDDKGRLTLPAKFRDALAGGLMVTKSQDHSLAVYPRAEFEQ
LARRASKASKSNPDARAFLRNLAAGTDEQHPDAQGRITLSADHRRYASLS
KDCVVIGAVDYLEIWDAQAWQDYQQTHEENFSAASDEALGDII
>MAP2222c hypothetical protein
MLARNAEALYWIGRYVERADDTARILDVALHQLLEDSSVDPDQASRVLLR
VVGIDPPDHDLDVWSLTDLVAYSTGAQGGCSIVDAVTAARENAKSAREVT
SSEIWECLNTTYHALPERERAAKRLGPHDFLSFIERRAAMFAGLADSTLS
RDDGYRFMVLGRAIERVDMTVRLLLSRVGDSASSPAWVTLLRSAGAHDTY
LRTYRGVLDAGRVVEFMLLDRLFPRSVFYSLRLAEHNLDELMHNRQSRVG
ATAEAQRLLGQARSELEFVQPGVLLESLESRLAGLQRTCRDVSDALALQY
FHVTPWVAWSDASQRARLVSRKGDG
>MAP1954c hypothetical protein
MITATREVAAPCERVWEVMAQGWTYTQWVVGNSRTRAVDADWPNPGASIR
HSVGVWPLVIDDATVVERSDPPHELVLRAHLGPLGAARITLRLHATRVGC
RVEMIEVPARGSVRLIPNQLALLAVYPRNQECLLRLAALAERQEPNPVT
>MAP3277c hypothetical protein
MRCRRLIRMTVPHTMPKTTAAFFVQAAVAFAISFVAALGGIYFLPLDPWP
RLFLGVTFLFLVSSAFTLAKVIRDQQEAATVRVRLDEARIERLLADYDPL
NTAN
>MAP1330c hypothetical protein
MRTVDEYAVRPWGLYLARPTPGRVQFHYLESWLLPSLGLRATVFHFNPGH
ERDHDYYLDVGDYLPGPTVWRSEDHYLDIEVRTGIRAELTDVDELLDALR
HGLLTPEVAERAIQRAVAAVDGLARNGYDLTRWLSEIDMELTWRGRAAAP
EPAP
>MAP0305c hypothetical protein
MTSPEAIADQLARTRARTLRLVDFDDDELRRQYDPLMSPLVWDLAHIGQQ
EELWLLRGGDPARPGMLPPAVEGLYDAFVHSRASRVDLPLLSPEQARAYC
RTVRAAVLDTLDALPDDPDAAFVYAMVVSHENQHDETMLQALNLRSGAPL
LRDTSVLPAGRPELAGTSVLVPGGEFVLGVDAADEPESLDNERRAHVLDL
PAFRIGTVPVTNGEWQQFVADGGYDEPRWWSRRGWQHRQAAGLTAPQFWH
PDARTRTRFGHVEDIPADEPVQHVSFFEAEAYAAWAGARLPTEMEWEKAC
VWDPSTGTRRRYPWGATPPSPAVANLGGAALRPAPVGAYPAGASACGAEQ
MLGDVWEWTTSPLRPWPGFAPMLYERYSQPFFDGDYRVLRGGSWAVEPGI
LRPSFRNWDHPYRRQIFAGVRLAWDVPGARDHR
>MAP2040c hypothetical protein
MNHYTYRVAWSPYLGEYVGTCLELPYVRRQGATAQEAMGAIEEAVDWYIA
SAESSGETLPTPMADRHYSGTIVVRTSPELHSRLAMEAAEQRVSMNQWVV
QKLSGRRPSETFGLSGFD
>MAP3520c hypothetical protein
MRRAGRYLFVMVAITVMALVAPVGRGGASVPLPQPVPGIASILPANGAVV
GVAHPIVVTFTAPAADRAAVERSIHVVSPAGVRGHFEWADDNTVVRFVPN
RYWPAHGHVSVGVQALTTGFDTGDALLGVASISKHTFTVSRNGEVLRTMP
ASMGKPSRPTPIGNFTALEKQRSVVMDSRTIGIPLSSPEGYKITAQYAVR
VTWSGVYVHSAPWSVDSQGYANVSHGYINLSPDNAAWYFNEVNVGDPIQV
VA
>MAP2013c hypothetical protein
MSEHGSKRVVVWGTGFVGKMVIAEIVKHPLFELVGVGVSNPAKVGRDVAD
ICGLPEPTGVIATDDVDALIALKPDALVHYGPTAMHAKENIALITRFLRA
GIDVCSTAMTPWVWPTMHLNPPNWIAPITEACELGESSCFTTGIDPGFAN
DLFPMTLMGLCSEVRRVRASELLDYTNYEGDYEVEMGIGREPEYSPMLEN
RDVLIFAWGATVPMIAHAAGIMLDEITTTWDKWVTPTERTTAKGVIKPGH
VAAVRFTINGVYRGETRIQLEHVNRIGLDAAPDWPSGHDNDVYRVDIEGT
PSIFQETAFRFTDGSGRDAAAAGCLATGLRALNAVPAVNELRPGWVTPLD
LPLIPGAGTIR
>MAP0363 hypothetical protein
MAELKSRLRADLTAAMKAQDKLRTATLRMLLAAIQTEEVSGKQAKDLTDD
EVIKVLAKESRKRAESAEIYTQNGRGDLAANEHAEARIIDEYLPTPLTEA
EVADVVDTAIAQVAEEIGERPSMKQMGMVMKAATAIAAGKADGARLSAAV
KERL
>MAP1069 hypothetical protein
MRDFTCPKCGQRLTFENSTCLNCGSALGFSIEQMALLVISEDETSGHAGF
VPASEYQLCANLLLAECNWLVPVNEPRLLCASCVLTTERPNDADSVGLAE
FARAEAAKRRLIAELHELRLPIVGRDQDPDYGLAFRLLSSAHENVLTGHQ
NGVITLDLAEGDDVHREQLRVEMDEPYRTLLGHFRHEIGHYYFYRLIAPH
PDHLRRFNELFGDPDADYQQALDRHYSEGAPEGWQENFVSSYATMHASED
WAETFAHYLHIRDTLDTSAWCGLAPASATIDRPALGPSAFQTIIDTWLPL
SWSLNMVNRSMGHDDLYPFVLPAAVLEKMQFIHAVVDGAG
>MAP1597 hypothetical protein
MSRIGTFADDDLAGWFVKSPDIGAALGAFSQAVYTKNRLPLRTRELARAV
IAHRNECVVCVNTRDEDGPAAGVDEELYDHVHEWRTWPGYSEQERLAAEF
ADRFATDHTGLRDDEDFWSRCAEHFSDELLADLALSCALWVGMGRVLRTL
DIGQACKLTIPSRG
>MAP2374c hypothetical protein
MVELYNLVVWNERNIELARELLGDTVIRHEVGAAHTLTHAEAVQRVVDMW
EMADSLHFDLNVVVEGDDGEHVAIVYDSTIRTKDGAETNIAGIEVFRVVD
GKITEVWNCGYQQGVWN
>MAP2237 hypothetical protein
MNIMRQSHGKGAPMSKPSTTRPLRVIQWTTGNIGRRSLHAIIGRPDMELV
GVYAHGAEKVGVDAAELAGWPEPTGVAATNDIDALLSLGADACCYNPLWP
NVDELVRLLESGVNVCTSAAWITGGKQTPEDRRRIEEACRKGNSTIFGSG
AHPGMTNMVGMVLSASCERVDEIRITESVDCSTYESAETQTAMGFSQDPD
TPGLAESVRRESEVFAESAAMMADAIGAKLDKMTFDVTFTAATGDSDLGF
MKIPAGTVGSVYGYHRGWVGERNVVSVGFNWTMGDHVVPPKPLEHGHVIQ
VFGLPNMRTVLHCLPPKDWTEPGFMGLGMIYTAMPVTNAVPAVVAARPGI
VTLADLPPVTGRLG
>MAP4290 hypothetical protein
MLGGNGLHGVSHPKVDDRAGVPAGTTSFYFRTRKALVHAMAGRLAELDVA
DFSMMAELAEDHATEFAGTAGLARIVMYVNSEPWLTRAKARYELALLAGR
DPELAAALSESADRLYALARNVVTQWHPAGSAPDPALVDDQATATLAFIN
GIMLTFVAGQPAVDDAGQLDRLIQGVIAGVAHVRGA
>MAP1974 hypothetical protein
MPAQAPQTSRHLEVERKFDVVESTVMPSFEGIAAVARVEQSPTQILDATY
FDTPAHDLARNKITLRRRTGGSDAGWHLKLPAGPDARTEVRAPLDASDAD
TVPTELVDVVLAIVRNRPLRPVARITTERDTQVLYDAAGQPLAEFSNDHV
TAWATPSAPEGPDDGSGGVDEGSGGFDDTAASPTDQVPTQHPPTQQEWRE
WELELLGGNGHAEGAAELLNRLSNRLLDAGAVPAGHGSKLARVLGRAPSP
NGARPPEDPLQRAIAEQVRELLVWDRAVRADAFDSVHQMRVTTRKLRSLL
RDYQDSFGLGDDGWVLDELRELAGILGVARDAEVLAERYERDLGALPPEL
VRGPVRERLVGGARRRYQVGLRRSLIAMRSQRYFRLLDSLDAIAARRPGF
PGAEEQAPVTIDTAYRKVRKAAKAAARVDREHPGDRHLRDEAIHRIRKRA
KRLRYTATATRAVRVAERAKAVQSLLGDHQDSVVSREHLSQEAQAAHAAG
EDTFTYGLLYQREADLAQRCRDELDDTLRKLGKAVHKAD
>MAP1538 hypothetical protein
MNMSQDPPDRPADPPDQGAAHGRHELPGKQPRPAIGPLRRTGLSGVLRGG
RSRFGFGTLAVLLCLLLGIAIVTQVRQTESGDSLETARPADLLVLLDSLR
QREGTLSAEVSELQNTLNSLQASGNNDQAAIQSAQSRLAALSILVGAVGA
TGPGVTVTIDDPAPGVSPEALLDVVNELRAAGAEALEVNDAHQSVRVGVD
TWVAGTPGSLTIDSKTLSPSYSILAIGDPPTLAAAMNIPGGAQDTVKRVG
GRMSVQQADRVDVTTLRQPKPHQYAQPVK
>MAP3134c hypothetical protein
MAASDRHPQRLTPLPADEWDEDTRAALASLIPAERANPVGAGNVLSTLVR
HPDLTAAYLPFNAYLLTRSTLSPRIREVALLRVVHRSKCGYLWSHHLPIA
RRAGLSESDIDDIRRGRCADPTDAAVVRAVDELTGDSTLSRQTWDRLCEL
FTQRQCMDLIFTIGGYLLLALAVNTFGVQEEETP
>MAP2523c hypothetical protein
MTKKLRVIQWTTGKVGKLSLRGVLDDPRLELVGVYAFSEEKAGSDAGALC
GRPDTGVLATTDIDALLALKADTVIYTPFMADIDDVVRLLESGLDVISTN
LFLNVGGIQGETKQRLAAACQRGGSSLYITGVNPGWINTMVTAMTAVCRD
VEMVSVSESADVSVYESPETWQSQGFSLSEAPPAVIETAKMWLSTFRDSV
QRMAVALGFELDDMEFFIEHATASERVDLGWWVIEKDTIAAMRAGWNGKV
NGKTVVQSNVAWYMTKLLNEGWQFDDDHYHVVIKGEPGVDTRIRFLPPDS
WGNHEWDTMTAMPAVSAAVDVAAAPAGILTLKDVGLPCAPAGLWLEG
>MAP0099 hypothetical protein
MRVAAKIAHKFEAAKGSAKKVFGRATGNTGMRAEGRAGQVKGNAKQAGDK
LNDAFKH
>MAP0357 hypothetical protein
MDVDAFVLAHRPTWDRLDRLVGRRRSLSGAEIDELVELYQRVSTHLSMLR
SASSDSMLVGRLSSLVARARSAVTAAHAPLSSTFVRFWTVSFPVVAYRSW
RWWVATGAAFFAVVVIVALWVAGNPEVQSALGTPSDIDQLVNHDVESYYS
EHPAAAFALQIWVNNSWVSAQCIALSVVLGLPIPLVLFENAANLGVIAGL
MFPAGKGGLLLGLLAPHGLLELTAVFLAGATGMRLGWSVISPGDRPRGQV
LAEQGRAVVSVAVGLVAVLLVSGLIEALVTPSPLPTFVRVGIGVVAEAAF
LCYIGYFGRRGVKAGESGDIEEAPDVVPAG
>MAP2024c hypothetical protein
MTARRGNLNVYRTLANAEKVFTSWMVAGDAALSSPVLPRRLRELIVLRTA
SAMDCAYELGQHRDVARTVGIDPDTIDAVISETGWQAGDLTPTELAVLHL
TTELVTTRRVAQPLFDRVHRALGTEATVEALMVINRYAGLALMLNALEVD
LDETARLPIPPTS
>MAP4269c hypothetical protein
MRKWEVDLTLIEAWMDALDDEEYDNLIAALEQLEEHGPITRRPFVDTLEG
SRHPNMKELRPRPTKAGAHIRVLFAFDTRSRAIMLIAGDKAGNWSKWYAK
HIPIADELFDAHQKRLHKAAAKATNRKPRKGKKR
>MAP2643 hypothetical protein
MYCLLVLAVGLERVAELLVSTRNARWSFTQGGKEFGRSHYPVMVFIQTAL
LAGCLVEPWALHRPFLGWLGWPMLAVVAASQGLRWWCITTLGRRWNTRVI
VLPQAPLVRDGPYRWLHHPNYVAVVAEGLALPLVHTAWLTAAVFTLANAA
LLRVRLRVENSALGYT
>MAP1006 hypothetical protein
MPAEPDYPQMAAARGRIEPAPRRVRGYLGDVLVFDTTAARYVWEVPYYPQ
YYIPLADVRTELLRDENHAQRVQFGPSRLYSVVAGGRTCESAARVFDADG
DGPLAGTVRFEWDPLRWFEEDEPIYGHPRNPYARVDALRSHRHVHVEREG
ITLADTSSPVLLFETGLPTRYYIDPTDVDFAHLEPSATQTLCPYKGTTSG
YWSVRVGDVVHEDLAWTYHYPLPAVAQIAGLIAFYNEKLDIVVDGTPLPR
PHTQFS
>MAP3291c hypothetical protein
MGMRPTARMPKLTRRSRILILIALGVIALLLAGPRLIDAYVDWLWFGELG
YRSVFSTVLVTRFVVFLIAGLLVGGIVFAGLAVAYRTRPVFVPSNDNDPV
ARYRALVLSRLRLVSVGVPVAIGLLAGIIAQSYWVRIQLFLHGGDFGIKD
PQFGKDLGFYAFELPFYRLLLSYLFVAVFLAFVANLLAHYIFGGIRLSGR
TGALSRSARIQLVTLVGLLVLLKAVAYWLDRYELLSHTRGGKPITGAGYT
DINAVLPAKLILMAIALICAAAVFSAITMRDLRIPAIGLVLLLLSSLIVG
AGWPLIVEQISVKPNAAQKESEYISRSITATRQAYGLTSDVVTYRNYTGD
AQATAQQVADDRATTSNIRLLDPTIVSPAFTQFQQGKNFYYFPDQLSIDR
YLDRNGALRDYVVAVRELNPDRLIDNQRDWINRHTVYTHGNGFIASPANT
VRGIANDPNQNGGYPEFLVNVVGANGNVVSDGPAPLDQPRVYFGPVISNT
SADYAIVGRNGADREYDYETSTETKNYTYTGLGGVPIGDWLSRSVFAAKF
AERNFLFSNVIGSNSKILFNRDPARRVEAVAPWLTTDSSVYPAIVNKRLV
WIIDGYTTLDNYPYSELTSLESATADSNEVAFNKLAPDKRVSYIRNSVKA
TVDAYDGTVTLYQQDEQDPVLKAWMQVFPGTVKPKSDISPELAEHLRYPE
DLFKVQRMLLAKYHVNDPVTFFSTSDFWDVPLDPNPTASSYQPPYYIVAK
NIAKNDNSSSYQLTSAMNRFKRDYLAAYISASSDPATYGRITVLTIPGQV
NGPKLANNAITTDPAVSQDLGVIGRDNQNRIRWGNLLTLPVGQGGLLYVE
PVYASPGASDAASSYPRLIRVAMMYNDKIGYGPTVRDALTGLFGPGAGAA
ATNIQPTEGGAPAASPPANAPAPAVTPGSAPPVAAPPVPDGSVTLSPAKA
AVLQEIQAAIGAAKDAQKKGDFAGYGAALQRLDDAITKYNNTK
>MAP0960 CpsH, CpsH
MIFVIMGMEVHPFDRLARAVDELARTDAVGEEFFVQLGTCRFEPRHARFE
RFLSFGDVCEQIRRASVAVTHAGAGSTLLCIEQGKHPVMVPRRSSRGEHV
DDHQLPFAEKLATAGLATVVRETTELPAAIAATRSRSAPADALGRARELT
GWLETFWRGLA
>MAP1588c ahpD, AhpD
MSVENLKEALPEYAKDLKLNLGSITRTTELNEEQLWGTLLASAAATRNTQ
VLTEIGAEAADTLSAEAYHAALGAASVMAMNNVFYRGRGFLDGKYDDLRA
GLRMNIIGNPGVEKANFELWCFAVSAINGCPDCVASHEHTLREAGVSRET
IQEALKAAAIISGVAQAIVASQTLATAG
>MAP1881 lppL, LppL
MSRRRTQGNPLLLPHALLRRGRHKLAIQGCIWGVALLLSGCSSNPLRGAP
PTIEPAGAAVSPPSAQQPAGSVRPLAAHAAAAVFDAGTHQLAVLAAASDP
ASVTLFGDPQVPPRVAALPGPATALTGDGRGTVYLATRGGYFVLDLATGR
VVRVGVADAAQTEFTAVAQRADGVLVLGSADGAVYTLASPTPAVPTAGPT
TGSTTGSTAAVANRNKIFARVDSLVTQGNTTVVLDRGQTSVTTIGADGRA
QQALRAGRGATTMAADAAGRVLVVDTRGGQLLVYGVDPLILRQAYPVSQA
PYGLAGSRALAWVSQTVSNIVIGYDLSTGIPVEKVRYPTVQQPNSLAFDE
ASDTLYVVSGSGAGVQVIDHAAGKR
>MAP2322c lppS, LppS_2
MRREMEGHWMTPLRGRRSWLAAAMALVAVGAVACGGGHTAAPPKVIFDKG
TPFADLLVPKLTASVSDGAVGVTVDAPVTVSVADGVLASVTMVNENGRSI
SGQLSPDGLRWSTTEQLGYNRRYTLTAKATGLGGAASKQMTFETSSPAHL
TMPYVSPADGEVVGVGEPVAIRFDENIANRAAAQKAITITTNPPVEGAFY
WLNNREVRWRPEHFWKSGTVIDVAVNTYGVDLGDGMFGEDNVKTHFTIGD
EVISTADDTTKTVTVRVNGEVVKTMPTSMGKDSTPTANGIYIVGARFKHI
IMDSSTYGVPVNSPNGYRTEVDWATQMSYSGVFVHSAPWSVGAQGHTNTS
HGCLNVSPSNAEWFYDHSKSGDIVEVVHTVGPTLPGTEGLGDWNIPWSQW
KAGNANT
>MAP1670c lppS, LppS_1
MASKSSDRLVGRILGVRRPRRFLAAVFVVVAVAGAIQLAAAAPCPHGGCP
GGAAPKPTGPARVRITPGPGARNVDPVAPVLVKAETGTLAGVEMVNEGGT
AVQGVMTPDNLVWKPTVALGYGRNYTLTITSRGTDGVETKQVSTFSTLRP
SNQTKVSFTTTSESPLRDGASYGVGTVIVAHFDEQIADRAAAERHLTVTT
NPKVTGSWFWVDGQNVHWRPEHYYAPGTTVTAEAKIYGIALGDGLFGQDD
SRVSFRIGDAHVSIADDATHQVSVFDNGVLVRTMPTSMGMGGTENIGGRS
ISLWTPPGVYTVLDKGNPVVMDSSTFGLPKNSRLGYRETINYATKISSDG
IYLHELDATVWAQGHQDTSHGCLNLNADNAKWFYDFSVPGDVVEVRNSGG
PPLTLAQNGDWTLSWDQWRRGSALKPT
>MAP0440c lpqG, LpqG
MPVATNAPKCVRSAIAAAAAGLAVVAISACDKHGPPPSGAAPRQVTVVGS
GQVQGVPDTLTADVGIEFTAPDVTTAMNQTNERQQAVINALVGAGIDHKD
ISTTEVALTPQYSNPEAGGSAAITGYRASNAVAVKIHPPDAASRLLALIV
GTGGDATRIRSVSYSIADDSQLVKDARTRAFNDAKSRAEQYAQLSGLHLG
KVLSISEATGNAPPPGGPPPSPRAMPMAVPLEPGQQTVKFTVTASWELD
>MAP1872c mbtH, MbtH_2
MSTNPFDDDNGSFFVLVNDEEQHSLWPAFADIPAGWQMVYGEADRAACLD
YIEQNWPDIRPKTLRDRLAAQRRGAGK
>MAP2169c mbtH, MbtH_3
MSANPFDDDTATFVVLVNDEDQHSLWPTFAETPPGWRVVFGAAGRADCLQ
YVEQNWSDIRPKSLIEALAGE
>MAP1419 mbtH, MbtH_1
MQLNPFDDDGGRFFVLVNNEEQHSLWPTFADIPPGWRVVFGEADRTACLQ
YIEESWPDIRAKSLRDTLSAGL
>MAP1524 mgtC, MgtC
METLSAVDFLLRLGVGVGCGALIGLERQWRARRAGLRTNALVAAGATLFV
LYAAATSDTSPTRVASYVVSGIGFLGGGVILREGVNVRGLNTAATLWCSA
AIGVLAASGHLVFAAIGTGTVIGIHLLGRPLGRLIDRDSSADEDESLLPF
LVQVVCRPKHEKYARAQIVQHAGGNDITLRGIHTGHPADDELTLTAHLLM
NGDAPARLERLVAELSLQPGVRAVQWYAGDEAQASDDRR
>MAP0016c pknB, PknB
MTTPQHLSDRYELGEILGFGGMSEVHLARDVRLHRDVAVKVLRADLARDP
SFYLRFRREAQNAAALNHPSIVAVYDTGEAETPSGPLPYIVMEYVDGVTL
RDIVHTDGPLPPRRAIEIIADACQALNFSHQNGIIHRDVKPANIMISTTN
AVKVMDFGIARAIADSGNSVTQTAAVIGTAQYLSPEQARGDAVDARSDVY
SLGCVLYEILTGEPPFTGDSPVAVAYQHVREDPVPPSQRHEGISADLDAV
VLKALAKNPDNRYQTAVEMRADLVRVHNGEKPEAPKVLTDAERSSLLSSG
AGAAGPSRTDPLPRQVLADSGEDRTVGSVGRWIVAVAALAVLTIIVVIAF
NTFGGGARDVQVPDVRGQVSADAIAALQNRGFKTRTLQKPDSAIPPDHVI
GTDPGANASVSAGDEITINVSTGPEQREVPDVSSLSYSDAVTKLKAAGFS
KFKQANSPSTPELLGKVIGTNPPANQTSAITNVITLIVGSGPETKQVPDI
AGQTVDIAQKNLTVYGFTKITQVQVDSPRPAGEVIGTNPPKGQTVPVDSV
IELQVSKGNQFIMPDLSGMFWTDAEPRLRALGWTGILDKGPDVDAGGAQS
HRVVYQNPPAGAGVNRDGIITLKFGQ
>MAP1025 pra, Pra
MPMTDQPPPGGAYPPPPSSPGSPGGQPTPHPGGQQPPPPPGGSYPPPPPP
GGSYPPPPPPSGGYAPPPPGPAIRTLPTQDYTPWLTRALAFVIDILPYVV
VHGIGTAILVATQQTACITDVTQYAVNQYCATQNSTLGLVAQWLASIVGL
FYLIWNYGYRQGTTGSSVGKSVMKFKVVSEVTGQPVGFGMSVVRALAHFV
DAIICFIGFLFPLWDSKRQTLADKIMTTVCLPLDGSESPPS
>MAP3346c pvdS, PvdS
MGLSFVAVSSAKNDGVPLKAGKKKSARAAVKIPDAVYEAELFRLQTELVK
LQEWVRHSGARLVVVFEGRDAAGKGGAIKRITENLSPRVARVAALPVPTD
RERGQWYYQRYIAHLPSKGEIVLFDRSWDNRAGIEKVMGFCTPQEYVLFL
RQTPIFEQMLIDDGILLRKYWFSVSEAEQLRRFKARLKDPVRRWKLSPID
LESVYRWEDYSRAKDEMMVHTDTPESPWYVVESDIKKHARLNMMAHLLST
IPYHAVEAPPVKLPRRPVVTGNYQRPPRELSRYVDDYAATLI
>MAP1893c yfiH, YfiH
MDSTRLADKRFADTGHVSVRIRRVTTTRAGGVSKPPFDSFNLGDHVGDDP
AAVAANRARLAAALGLKPERVVWMNQVHGDRVEVVHGPRDGAVDATDALV
TGTARLALAVVTADCVPVLLADARAGVAAAVHAGRVGAQRGVVVRTVEAM
RALGAQPADMSALLGPAVSGANYEVPAAMADEVEAALPGSRTTTAAGTPG
LDLRTGIACQLRELGVTAIDIDPRCTVADPALFSHRRDAPTGRLASLIWM
E