TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Organism: Mycobacterium avium subsp. paratuberculosis str. k10, k10
Gene type: CDS

Number of genes found: 245

Free access
Sort by:

 



# Mycobacterium avium subsp. paratuberculosis str. k10, k10

>MAP3021 hypothetical protein
MSGKRADGGHDGAGDVALIIAVKRLAAAKTRLAPVFSARTRESVVLAMLT
DTLTAATRVPSLGSITVITPDEAAAAAAAGLGADVLADPTPEGHPDPLNN
AIATAERAVSGSFTNIVALQGDLPALQSQELAEAVAAARAHRRSFVADRL
ATGTAALFAFGTRLDPRFGSDSSARHRSSGAIELTGAWPGLRCDVDTPTD
LAAARRLGVGAATARAIAAH
>MAP3452c hypothetical protein
MSEPAEAGFLDRLRARHGWLDHLIRAYQRFDDRNGGFFAAGLTYYTIFAL
FPLLMVGFAVFGFVLARRPQLLSSIDDHIRSQVNGALGNQLQELMNSAID
ARTSVGVIGLATAVWAGLGWISHLRQALTEMWWDTQIESPGFVRNKLSDL
LAMVGTFAVIVATVGLTTVGHAAPMAALLKWLGIPQFVVFDWLFWIFSIL
IATLVSWLLFTWMIARLPREKVSFVDSMRAGLIAAIGFEVFKQVASIYLR
TVLRSPAGAAFGPVLGLMVFAYITAYLVLFCTAWAATASTDPRDRPVAPP
APAIIAPRVHLDEGLNTRQTLTAMGLGAVAALAFDRLTRRRR
>MAP1982c hypothetical protein
MSVKLADVIAVLDQAYPPWLAEPWDSVGLVCGDPDEPVESVTVAVDATPA
VVDEVPAGGLLLAHHPLLLRGVDTVAASTPKGALVHRLIRSGRSLFTAHT
NADSASPGVSDALAEALGLTVEAVLEPARPASDLDKWVIYVPGENADAVR
EAVFAAGAGHIGDYSHCSWSVTGIGQFLPHEGASPALGSVGQVERVAEER
VEVVAPARARAAVLTAMRAAHPYEEPAFDIFALLPPPGDAGLGRIGRLPQ
PEPLRRFVSRVDAALPATSWGVRAAGDPELAVSRVAVCGGAGDSLLAAAA
GAGVQAYLTADLRHHPADEHRRASEVALIDVAHWASEFPWCAQAADLLRS
RFGERLDVRVSAIRTDPWNVGHDGGEG
>MAP3966 hypothetical protein
MSSHQSAGKIDDGVVAHRASGRTSFAESFAGADPQADAQRRVALRRMKAV
ALSFLIGATGLFLACRWAQAQPGVGAWVGYVGAAAEAGMVGALADWFAVT
ALFRHPLGIPIPHTAIIKRKKDQLGEGLGTFVRENFLSPEVVETKLRDAQ
VSGRLGKWLSEAAHAERVAGETATVLRVLVELLRDEDVQQVIDRMIVRRI
AEPQWGPPVGRVLATLLAENRQEALIQLLADRAFQWSLNAGEVIQRVVER
DQRVVERDSPTWSPRFIDHLVGDRIHRELMDFTDKVRRNPDHELRRSATR
FLFEFADDLQHDPDTIARADAIKEQLMARDEIANAAATAWKTLKRLVLEG
VDDPSSALRSRIADTVVRIGESLRDDADLRDKVDNWLVRAAQHLVSQYGV
EITAIITDTIERWDAEEASRRIELHVGRDLQFIRINGTVVGSLAGLVIYA
VAQLLF
>MAP0363 hypothetical protein
MAELKSRLRADLTAAMKAQDKLRTATLRMLLAAIQTEEVSGKQAKDLTDD
EVIKVLAKESRKRAESAEIYTQNGRGDLAANEHAEARIIDEYLPTPLTEA
EVADVVDTAIAQVAEEIGERPSMKQMGMVMKAATAIAAGKADGARLSAAV
KERL
>MAP2613c hypothetical protein
MAETPRLLFVHAHPDDESLGTGATIAHYTAAGADVRVVTCTLGEEGEVIG
ERWAELAVDRADQLGGYRIGELTAALRELGVGEPCYLGGAGRWRDSGMPG
TPRRRRQRFIDADEREAVGALVAIIREQRPHVVVGYDPAGGYGHPDHVHV
HTVTTAAVAAAGAGNFPGEPWAVPKFYWSVFATRPFEAAVQALTPEDLRP
GWSMPSAEQFTFGYADEHIDAVVAAGPHAWAAKRAALAAHATQVVVGPTG
RACALSNNVALPILDEEHYVLVAGAAGARDERGWETDLLAGLEFGAAPRR
>MAP3849 hypothetical protein
MSEHIRTPASLARRPTVAVLGAGIAGLTAAHELAERGFVVTVYEAQQDER
NGLGSEPAGAYPPVKLGGLAASQYSTVGTHDGSHAELRPFPGRPSRPTDL
GRAVAGEHGFRFFPAYYLHIWDLFQRIPVYELRNPGGWAPTARTVYDNVR
RVITQGVTIDGKPSLVFPRELPRSGAEFLGILGQLATVGLTFGDVATFQG
RMLRYLVTSPLRRARELQNMSAYDFFVGHDRRTGLNQYSYTPRCDALLRQ
MPKVLVAFDSRWGDARTNISTYLQLYLNMDRRDDKADGVLNGPTTESWFD
HWYRHLVALGVRFVRAAATRLDPVPVDPSRPPHLRPRVQITLTDGTRVTP
DYIVVAVDAPGAEFLTAPMRAAGTGGTVAGLDGFATSVPPPDGPLQPQST
RPRQRRDPYSMDELGRVPWDRFQTLCGIQYYFDTEFQLLRGHLLYAGTEW
ALSSINQTGMWERPPILDRDGHVSVLSVDIGDFNAASSHLVDESGRGKAA
RDCTPDEIATEVWRQIVAALTSSTGPAGEEFMPRPVWYALDRGLIMESGP
GQGRGRAVRNATPYLVPIVGDWDNRPSSDPWNPHGTSWSSVPPEDWWLED
LWRRNVWQARHGGYQVHNNAVVFAGTWNKTFTRLTSMEAACESGRHAVNA
ILDHYIWVQSGGLDRREKTTLSWEFPFGFLDQGLSSPIRMPTPAGDYCYV
FDIENREPADTRALRILDSRFCEWSLPHPLDVGAPTSFIPPPAGGQEMFG
PTTDYNQQLLAYLQAWRELLERWTAMSAATPSPAAPFTPPAAPFTPPFMP
PAAAPAPMPPAPADYTQQLFGYLQAWRQYLEQMAGASSGSAQQPTAPPTA
PPTMPPTMPPPPPPPSTPSTGGQAFGAAPPGQPGTSGDPTPGGSTPVSAT
AKGSTLTWPPPLLGLEPSSYVGTQDPPVSRFLPPNLVNRAEDRQEVLLRP
NDEYGYLHDLFSLNPGLARAISSTGPAPVASGSSFLGAMNRVGPDVSARV
APRSLFSSRAARAPSAGAITPGSRTG
>MAP3536 hypothetical protein
MTSKLSTTLHTSFPDAVDRITKALADQGFGVLTTIDVKATLKQKLGADME
DYLILGACNPALAHRALGIDRQIGQLLPCNVVVRSDPAEGDAVLVEAMDP
QLMVKVTGEAGALQEVADQATAKLQAAISALAG
>MAP2042c hypothetical protein
MKVRLLATVAALILSTVLGCHFESRNPPSTKTLQVPMNDVLTQSDISQNI
TLAVGNTLVVQLGSNYTTPYRWTPDAKIGDSAIVKQTSHEFVPPTSDALG
APGTEVWTFAALKPGSTTITTSYSSFVDKNAKPACTYTLSVTVR
>MAP4256 hypothetical protein
MRHYYSVDAIRAAEAPLLASLPDGALMRRAAFGLATEIAAELTARTGGVA
GRRVCAVVGSGDNGGDALWAATFLRRRGAAADAILLNPERAHRKGLAAFG
KAGGRIVESVSPTTDLVIDGVVGISGSGALRPAAAEVFAAVDDAGIPVVA
VDIPSGIDAATGATSGPAVHAVLTVTFGGLKPVHALGDCGRVKLIDIGLD
LPQTDVLGFEAADVAARWPVPGPHDDKYTQGVTGVMAGSATYPGAAVLCT
GAAVAATSGMVRYAGSAHREVLAHWPEVIASPTPASAGRVQSWVVGPGLG
TDDAGAAALWFALETDLPVIVDADGLTMLAAHPELVANRAAPTVLTPHAG
EFARLAGSPPGDDRVGATRKLADTLGVTVLLKGNVTVIADPGGPVYLNPA
GQSWAATAGSGDVLSGMIGALLASGLPAGEAAAAAAFVHARAAALSAADP
GPGEAPTSSSRMVPHIRAALAAL
>MAP0357 hypothetical protein
MDVDAFVLAHRPTWDRLDRLVGRRRSLSGAEIDELVELYQRVSTHLSMLR
SASSDSMLVGRLSSLVARARSAVTAAHAPLSSTFVRFWTVSFPVVAYRSW
RWWVATGAAFFAVVVIVALWVAGNPEVQSALGTPSDIDQLVNHDVESYYS
EHPAAAFALQIWVNNSWVSAQCIALSVVLGLPIPLVLFENAANLGVIAGL
MFPAGKGGLLLGLLAPHGLLELTAVFLAGATGMRLGWSVISPGDRPRGQV
LAEQGRAVVSVAVGLVAVLLVSGLIEALVTPSPLPTFVRVGIGVVAEAAF
LCYIGYFGRRGVKAGESGDIEEAPDVVPAG
>MAP3679 hypothetical protein
MPDLARRLRRTLWPIAQTTVAAGLAWYLAHDVLDHREPFFAPIAAAVCLW
MTNRVRAELAVEMIIGVALGIGAGTVVLAVFGAGPIGMGAAVMLSLTLAV
LIARSFSTQRSMFVNQSLISAILIMAFPHTGLGVERLYDALIGGGLAVVF
SILLFPKNPLAVLHDASGELLTALHDILAQICCRSADSASPDQGWALSAA
DRLQRCSASLIEARGTAGQLARVCPRRWPLRGAVGAADRQVVHVALFGSS
VLQLTRTLMSVRIRGGPHAEAFRAAVDDLRDADSALAAGHPATAAAHAES
ARRHSAALPAGSPASVAVVIDACIDELQQAIGPRRS
>MAP1786c hypothetical protein
MRRVIQFSTGNVGRHSLRAIIGRPDLELVGVHAAGPEKIGRDAAQLCGLD
EPTGIIATDDIDALVALNADCVVYTSQGETRPMDAIEQMSRFLAAGTNVV
GTSMVWLVAPRHAEEWLREPLQRACEAGNTSLYVNGIDPGFSGDTLVHAA
VSLATRVSSITVQEIFDYGNYDDAEFTGAAMGFGTAPDDDSPMMFLPGVI
VAMWGGQVRSLADHLGVELNDVRQRIERWFTPERIDCTMMTVQPGRMAAV
RFATEGVRDGEPVITLEHVTRLTRAAAPDWEYPPDGHTGVHRVVVAGEPR
VEINTHVSHPLFDSTDAGCISTAARVVNAIDWVCRAPQGLIGVEDIPLAA
TMRGVLWQDR
>MAP1954c hypothetical protein
MITATREVAAPCERVWEVMAQGWTYTQWVVGNSRTRAVDADWPNPGASIR
HSVGVWPLVIDDATVVERSDPPHELVLRAHLGPLGAARITLRLHATRVGC
RVEMIEVPARGSVRLIPNQLALLAVYPRNQECLLRLAALAERQEPNPVT
>MAP2240c hypothetical protein
MAVVVVTDSSARLPADLLGQWGIRVVPLHILLDGTDLRDGVDEVPADIHQ
RAATTAAATPAELADAYQQALADSGGDGVVAVHISSALSGTCRAAERTAA
DLDPNLRVVDSKSAAMGTGFVALAAARAAAAGAQLDAVADAARAAVSRGH
GFIVVHRLDNLRRSGRIGGPAAWLGTALALKPLLRIDDGKLVLAQRVRTA
GHATEAMIDRVCQVIGDGAAALAVHHVDNPGGAAEVAAALGERLPACDPA
IITELGPVLALHVGAGAVAVVVQQPGS
>MAP3887c hypothetical protein
MRAARDRPLAPGAGVLVIATLYGGNDGINTLIPYADNAYHDARPELAYAP
QDVLHLDQQLGLNPALKGMAGLWNQRKLAVVRGAGYPKHDHSHFRSMDIW
QTASPDEPVSTGWIGRWLDATGDDPLRAVNIGAVLPPLAVGAKCTAAALS
PGGGAGKAGQFDAVTTALGDDDPDDTPAMAAVCKAYRAARTADATFASVK
PPAKQNNSLAAQLDVVAQAVKARVPARVYTVQLGGFDTHAGERGTQQRLL
QTFDEAVTGFVAQMAGRNVVLMAYSEFGRRVRANASQGTDHGTAGPVLIA
GAPVNGGFYGEQPSLTDLDNGDLKYTTDFRDIYHELLARTVGTDPAPSVG
AGRRDLGFLSA
>MAP1776c hypothetical protein
MERLDVVVGPNGAGKSTFIALTLAPLLPGSVVVNADEIARQRWPQDPASH
AYDAAKVAADTRAKLIDLGRSFIAETVFSHPSKLDLLHAARRAGFTVVLH
VLLIPEDLAVERVRHRVRAGGHDVPETKIRERHRRLWTPVAEAMTLADLA
TGYDNSRLRGPRVVARLSGGLTVGAVDWPAWAPETLRSRWPV
>MAP0356c hypothetical protein
MSEVVTGDAVVLDVQIAQLPVRALSALIDIAVIVVGYLLGLMLWAATLTQ
FDTALSNAILLIFTVLVIVGYPLILETATRGRSVGKIALGLRVVSDDGGP
ERFRQALFRALASLVEIWMLFGSPAVICSILSPKAKRIGDIFAGTVVVNE
RGPRLGPPPAMPPSLAWWASSLQLSGLSSGQAEVARQFLSRAAQLDPGLR
LQMAYRIAGDVVARIAPPPPGAPPELVLAAVLAERHRRELARLRPPAPWP
APGYPPAWPGSGPAPQWPAPGPANPGPPEGFSAGFTPPR
>MAP4063c hypothetical protein
MADSSFDIVSKVDRQEVDNALNQAAKELATRFDFRGTDTTIAWKGDEAIE
LTSSTGERVKAAVDVFKEKLIRRDISMKAFDAGEPQASGKTYKVNGTLKQ
GISSESAKKITKLIRDEGPKGVKTQIQGDEIRVSSKKRDDLQAVIAMLKQ
ADLDVALQFVNYR
>MAP3934c hypothetical protein
MLTGRLDLMSLVVPPYPPARYTKDQPETSAWLKRADEPPDYQTAGVKYHY
LANQHDTAGDYGLYRVDIAPAGGGPGPHFHRAMSEAFFVLSGTMKLYDGT
EWTDGHQGDFLYVPPGGVHGFRNEADDPASILMLFAPGAPREAYFEGFAA
LADMTDEERREWFARHDNFWVQ
>MAP3096 hypothetical protein
MSPTPFDRFPSSMRRARCTIWYMFTEDSVEIDAPPRLVWDVFTDVERWPE
WTASVTSLTGLDGPALAVGRRFAIKQPGMAKLVWQVTELIPGASWTWVQR
SPGARVAATHHVSARPGGGTLVRQQLDQRGALGALVGRLMAKKTKRFLAL
EARGLKARAEQLSRADGAHP
>MAP4290 hypothetical protein
MLGGNGLHGVSHPKVDDRAGVPAGTTSFYFRTRKALVHAMAGRLAELDVA
DFSMMAELAEDHATEFAGTAGLARIVMYVNSEPWLTRAKARYELALLAGR
DPELAAALSESADRLYALARNVVTQWHPAGSAPDPALVDDQATATLAFIN
GIMLTFVAGQPAVDDAGQLDRLIQGVIAGVAHVRGA
>MAP0236c hypothetical protein
MRFVVTGGLAGIVDFGLYATLYKVVGVQVDVAKAISFIVGTITAYLINRR
WTFQAAPSTARFVAVMALYAVTFAVQVGLNHLWLAFLHYRGWAIPVAFVI
AQGTATVINFVVQRAVIFRIR
>MAP2040c hypothetical protein
MNHYTYRVAWSPYLGEYVGTCLELPYVRRQGATAQEAMGAIEEAVDWYIA
SAESSGETLPTPMADRHYSGTIVVRTSPELHSRLAMEAAEQRVSMNQWVV
QKLSGRRPSETFGLSGFD
>MAP1537 hypothetical protein
MIGIAALAIGIVLGLVFHPSVPEVVQPYLPIAVVAALDAVFGGLRAYLER
IFDPKVFVISFVFNVFVAALIVYVGDQLGVGTQLSTAIIVVLGIRIFGNA
AALRRRLFGA
>MAP4244 hypothetical protein
MDPTLSYNFGEIEHSVRQEIHTTSARFNAALDELRARIAPLQQLWTSEAA
TAYQAEQLKWHRSATALNEILVQLGDAVRDGAEEVADADRRAAGVWAR
>MAP3157 hypothetical protein
MFLPHQIIGQLINKQLDDNMRRYFFRGMEFAAPVGDPGWFGPDSAVWRVH
SHLPALIFGLQCAAFMETLDPSIYWMGMHHSRLIKRDSNGNPVSHVPVID
PEGAATRLGHSVAFFIGTAYGSPETAERLAKSVRAMHHTIKGTRPDGARY
DADDPEWLRWNYATVVWGIATAHELYHPMPLRGKALDRYYGEFVRVGHAL
GGTDLPATKAETLECLESYLPKLAVTHGKAMGTGPNVAMPQAAVDWAIRD
TMPKWAKQMLQHRDCNIIERTARRSAVWAIINGIHAASGPAPEFRQAQAR
VRGGTTVPHTVPSYVLGTDQVRSRAEVERSFQSV
>MAP1006 hypothetical protein
MPAEPDYPQMAAARGRIEPAPRRVRGYLGDVLVFDTTAARYVWEVPYYPQ
YYIPLADVRTELLRDENHAQRVQFGPSRLYSVVAGGRTCESAARVFDADG
DGPLAGTVRFEWDPLRWFEEDEPIYGHPRNPYARVDALRSHRHVHVEREG
ITLADTSSPVLLFETGLPTRYYIDPTDVDFAHLEPSATQTLCPYKGTTSG
YWSVRVGDVVHEDLAWTYHYPLPAVAQIAGLIAFYNEKLDIVVDGTPLPR
PHTQFS
>MAP2165 hypothetical protein
MSRLPGVSDRDAGLGARIAFFFTRRKLAQMTGLETAGMLEPLRMYAHIPR
LLNAYGRLEQAESRLDILSPRHRALAELKAATTVRCEYCIDLGSQIARRW
GITDEELLAMADYRDAACFSDVDKLILEYATAISRTPVEVSDELFDALRA
HFDTAQLVGLTHVITLGNLRARFNIALGIGSSGFSGNRVCALPDTPRP
>MAP3395 hypothetical protein
MSFAEATIARLPRLLQPYLLRHHELIKFAIVGGTTFVIDSAIFYTLKLTI
LEPKPVTAKVIAGIVAVIASYVLNREWSFRNRGGRERHHEALLFFAFSGV
GVLLSMAPLWFSSYVLQLRQPTVSLTVENVADFISAYIIGNLLQMAFRFW
AFRRWVFPDQFARDPDKALESALTAGGIAEIFEDAFEDDGGNVTLLRAWR
NRAGRLTQLGDSSEPRVSKTS
>MAP2272c hypothetical protein
MTATPREFDIVLYGATGFVGKLTAEYLAGAAPDKRVALAGRSTEKLRAVR
DSLGDAAQSWPVLQADASSPATLNEMAARTQVVITTVGPYTRYGLPLVAA
CAAAGTDYADLTGEAMFVRDSIDSYHKQAADTGARIVHACGFDSVPSDLS
VYALYRAARDDGAGELVDTDLVVRSFSGGVSGGTVASMLEVLDTASRDPE
ARRQLADPYTLSSDRGAEPDVGPQPDLPWRRGRQIAPELAGVWTAGFVMA
PYNTRIVRRSNALLDWAYGRSLRYSESMSLGSSPLAPVASAVVGGTAAAT
FGLGSRYFRFLPRRLVERIVPKPGTGPSPAARERGYYRIETYTTTTSGAR
YVARMEQRGDPGYKATSVLLGESGLALAFDRDKLPQLYGVLTPAAAMGDA
LLDRFPGAGVFLQVDRLAG
>MAP3705c hypothetical protein
MALILVAHGTRRPGGVAMIEGLAAQVSTLVGGRVEVAFVDVVGPTPSEVL
AAARAAGRPAIVVPAFLSRGYHVRADLPAHVALSGHPNVTVTPALGPSGQ
IARIVGDQLLECGWRPNDSVVLAAAGTSDDKARADLHTAATWLSALTGSR
VTLGFAATGDPPLGEAVARARPHARRNGGRVVVASYLLADGLFQQRLHGC
GADLVSAPLSTHPGLARLIANRFRRALPPVLAATARHASRRTGPHQRAHA
PATRPVP
>MAP1019 hypothetical protein
MRETSNPVFRSLPKQSGGYAQFGTGAAPMQGYQADPYAAPYATPYQETRA
SRPLTIDDVVTKTGITLAVLAASAVVSYFLVLSNVALAMPLTLVGALGGL
GLVLVATFGRKQDSPAIVLSYAVLEGLFLGALSFVFANFSVSSANAGVLI
GEAVMGTFGVFFGMLVVYKTGAIRVTPKFTRMVVAALFGVLALMLGNFVL
AMFGVGGGAGLGLRSGGPLAIIFSLVCIGIAAFSFLIDFDAADQMVRAGA
PEKAAWGIALGLTVTLVWLYIEILRLLSYLQND
>MAP3063 hypothetical protein
MNSSHDRVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLRV
LDTLAGEDRRGLLTLGVTPVVNAQLDDPYCLDGMHHWLANWRLRAMEAAS
VRSAPRSKSAGYQACTPEALRALGIRESAEAERALDDFATRWRHGGSPLL
RRLLDAGTVELLGGPLAHPFQPLLAPRLREFALREGLADAALRLGARKGR
GPGGIWAPECAYAPGLEHDYAAAGVTHFMVDGPSLHGDTALGRPVGDTDV
VAFGRDLQVSYRVWSPKSGYPGHPAYRDFHTYDHLTGLKPARVTGRNVPS
ESKAPYDPQRADHAVDLHVADFVDVVRNRLTSESERIGRPAHVVAAFDTE
LFGHWWYEGPTWLARVLRALPEAGVRVGTLHDAIAGGFVGDPVDLPPSSW
GSGKDWQVWAGDQVADLVQLNSEVVDTALSTVDKALSQAGSQPAALDGPV
PRDRVADQILRETLLTVSSDWPFMVSKDSAADYARYRAHLHAHATREIAG
ALASGRRDTAQRLADGWNRADGLFGALDARRLPR
>MAP4243 hypothetical protein
MATSNTVSTDFDLMRSVAGTTDARNEEIRAMLQAFIGRMSGVPPAAWGGL
AAARFKEMIDRWNAESVRLYHALHAIADTIRHNAATLQDAGQNHADHIAA
AGGSL
>MAP1640c hypothetical protein
MTDTSRFATIRVDQFVAAPPAKVWRMLTEPELMKLWWAEGQVAAVVGHRF
TLDMPGYGKQPCQVLEVDPPRRFVYTFTAAWTLTWRLEAEGEGTRVFLEH
SGFDLDDARMARAFEQMGPGWRDTVLPRLARVAVD
>MAP0065 hypothetical protein
MTGAAEPSATISPGPLAADRRSADNRDCPSRNDFLGAAFAEVIGGPVGRH
ALIGRARIMTPLRVMFLIALVFLALGWSTKAACLQSTGTGTGDQRVANWD
NQRAYYELCYSDTVPLYGAELLSQGKFPYKSSWIETDSTGAQQIRYDGQP
AVRYMEYPVLTGMYQYVSMALAKTYTALSKLAPLPVVAEVVMFFNIAAFG
LALAWLATVWASAGLAGRRIWDAALVAGSPLLIFQIFTNFDALATAFAMA
GLLAWARRRPVLAGVLVGLGAAAKLYPLLFLGPMLLLGIRTGRLGAWART
AVAAVATWLLVNLPVLVFFPRGWSEFFRLNTRRAEDMDSLYNVVKSFTGW
RGFDPKLGFWQPPTVLNTVVAVLFVTCCAAIAFIALTAPRRPRATQLVFL
VVAAFLLTNKVWSPQFSLWLVPLAVLALPHRRVLLAWMTVDALVWVPRMY
YLYGNPNRSLPEQVFTTTVLLRDIAVITLCALVIRQIYRPDEDLVRWGGR
VDDPAGGPFDRAPDAPPGWLPDWLRPAGSRRGQAAEPEPALAGAT
>MAP3075 hypothetical protein
MTAGATDRRRRILPAPTGLPDAGALPSRPRVAVVGGGIAGLTAATGLAER
GVAVEVIEREHYLGGRVGGWTEHHDGTDLAMNRGFHAFFRQYYNLRALLV
RLDPRLRMLVPVNDYPLIDAAGRRDSFRGLPRTPPLNAMAFAVRSPTFRL
RDFARIDARAAAPLAAVSVPGTYRRLDHIDAATFLEDIRFPEAARHLAFE
VFSRSFFADPAKLSAAELATMFHIYFLGSAEGLIFDVPSANYDSALWQPL
RSYLEQRHVRFRLGTSVGSIDAANRFAVHTDTDEELEVDAVVLATDVAGL
QRIVAASRGLANGEWRDRIAGLRTAPPFAVHRFWLDRPVSPRRPAFLGTA
GHKPLDNISVLDRYEREARAWAGAHHGSVVELHSYALDSAPSGAAALREL
RRIYPETAAAQIVHETLLHRSDCPLFAPGGYPHRPTVVTPTPGLLLAGDA
VRIDLPVALMERAATTGWCAANHLLKRWGLAGHPLVTVPTEGRSRLLRWL
ANREGATRS
>MAP1944c hypothetical protein
MRHVGEAMTVQNESNAKTHGVILTEAAATKAKSLLDQEGRDDLALRIAVQ
PGGCAGLRYNLFFDDRSLDGDLTAEFGGVTLTVDRMSAPYVEGASIDFVD
TIEKQGFTIDNPNATGSCACGDSFN
>MAP2073c hypothetical protein
MSSTPHPAEPHIGSVSSKLNWLRAGVLGANDGIVSTAGIVVGVAAATALR
APILTAGSAGLVAGAVSMALGEYVSVSTQRDTEKALLIQEHQELRDDPAA
ELDELAALYEAKGLTAATARTVAEELTDQNPLLAHAEVELGINPEELTNP
WHAASSSALSFAIGALLPLIAILLPPPTWRIPVTVVAVLIALVITGAVSA
RLGGAPQLRAVARNAIGGSLALAVTYTIGHVVGAAID
>MAP0609 hypothetical protein
MTAAVTPKGERRRYALVSAAAELLAEGGFEAVRHRAVARRAGLPLASTTY
YFSSLDDLIARAVEHIAMIEVAQLRSRVSALSRRRRGPETIAEVLADLLV
GDVSGPGLTEQLISRYERHIACTRLPALRETMRRSLRQRAEAVAEAIERS
GRSVHIDLVCTLICAVDGSVVSALVEGRDPRAAAQGAVVDLIEVLAPIDQ
RPVQI
>MAP3829c hypothetical protein
MERVTHSHSHGLPSGPAPVDRLPARIVVGLLIAIGVAVAAGAIVLWPSRQ
HVDIPMPLQNAAGGAVSTQAGHVLSSGLGDCGSPSVSQVLTGPPQPALPG
AGRCVLTQVAIDSGPNAGAATLLESSPGPGQPKFAVGDRIRIVRQVDDQG
ATSYAFYDFERGWALVGLAVAFAVVIVAVARWRGLLALVGILVAFVVLVT
FLLPALRDGAPAVPVALVAAVAILYAVIYLAHGVSLRTSAALLGTLSSLL
LAAGLSWAAIQLAHLTGLSDEQNSTVSAYLGSVSIGGLLLAGFIIGSLGV
LNDVTVTQSSTVFELARLGGSRRAIFTGALRVGRDHIASTVYTLVLAYAG
SSLPLLLLFSVANRSLTDVLTGESVAIEIARSAVGGMALALSVPLTTAIA
AVLAKPSGGRKRGEAGRGGPPPSV
>MAP3796 hypothetical protein
MHRLLTSLCAAACVIVASVVLSPISAAAGAPWFANAVGNATQVVSVVSTG
GSNATMEIFQRTGTGWQSLRSGVPTHVGSAGMAPQAKSGVPATPMGVYSL
DSAFGTAPNPGTGLPYTQIVGPNYWWSGDDHSPTFNSMQVCPKAQCPFNT
AESENLQIPQYKHAVVMGVNKNKTPGGGAAFFFHTTDGKPTEGCVAVDDA
QLVSIMKWLRPGAVIAITK
>MAP2367c hypothetical protein
MLAGLAFGSEIKRFRRSRLTRAAIVVLMLLPLVYGALYLWAFWDPFGHTN
KMPVALVNSDRGAVVSGQQFNAGAEIAKSLTADGSLDWHVVDLPEARNGV
DHGKYYFMVELPPDFSAAIASPVTGQPKKANLIAVYNDANNYISTSIGRT
AIEQVLNAVSSRISGQAVNQMLSVVVSSGSGIKQAADGAARLDDGAGQLA
AGLDSARTGSATLAAGATQLSDGINQATDPLLAVTKAVSQIGGSTEQLQQ
GATALAQANDQLGAIAAAQDAAASSLTSVIDQLSARADPLANNLRGIQDQ
LRGHQFTPQIRQQLTDAQNAAIAMTSGLRTPGSPLASALDQLGGKGQELT
NKLTQLRDGAQQVATGNAELAGGIAKLDDGAGQLKAGSAELATKLAEGAR
QVPNWTTQQKEAVADTIGGPVQLEASHENAAPNFGTGMAPFFLTLALFFG
ALVLWMVLRPLQYRAIAAEVIAIRVVLASYLPAAAIALFQAVVLYCVVRF
ALGMHAVHPVAMLGFMVLISGAFVAATQAINALVGPAVGRVLIMALLMLQ
LVSAGGMYPVETTSRPFQVLHRFDPMTYGVNGLRQLILGGIDARLWQAII
VLAAITAVALAISCLSARRDRLWNLSRLFPAIKM
>MAP1076 hypothetical protein
MHTDVLDVDTSRRRIVDLTEAVRGFCWSRGDGLCNVFVPHATAGVAVIEM
GAGSDDDLVDTLERLLPRDDRYRHAHGSPGHGADHVLPALVAPSVTVPVS
AGEPMLGTWQSIVLVDLNRDNPQRSVRLSFLEG
>MAP2013c hypothetical protein
MSEHGSKRVVVWGTGFVGKMVIAEIVKHPLFELVGVGVSNPAKVGRDVAD
ICGLPEPTGVIATDDVDALIALKPDALVHYGPTAMHAKENIALITRFLRA
GIDVCSTAMTPWVWPTMHLNPPNWIAPITEACELGESSCFTTGIDPGFAN
DLFPMTLMGLCSEVRRVRASELLDYTNYEGDYEVEMGIGREPEYSPMLEN
RDVLIFAWGATVPMIAHAAGIMLDEITTTWDKWVTPTERTTAKGVIKPGH
VAAVRFTINGVYRGETRIQLEHVNRIGLDAAPDWPSGHDNDVYRVDIEGT
PSIFQETAFRFTDGSGRDAAAAGCLATGLRALNAVPAVNELRPGWVTPLD
LPLIPGAGTIR
>MAP3642c hypothetical protein
MSLTEATAAGAEPVAPPAVLSGDALGGFPPPGGRVLLVWDAPNLDMGLGS
ILGRRPTALERPRFDALGRWLLARTAEVSAERPGVVVEPEATVFTNIAPG
SAEVVRPWVDALRNVGFAVFAKPKIDEDSDVDRDMLAHIELRRTEGLAAL
VVASADGQAFRQPLEEIARSGVSVAVIGFREHASWALASDTLDFVDLEDI
SGVFREPLPRIGLDSLPDQGAWLQPFRPLSALLTARV
>MAP4269c hypothetical protein
MRKWEVDLTLIEAWMDALDDEEYDNLIAALEQLEEHGPITRRPFVDTLEG
SRHPNMKELRPRPTKAGAHIRVLFAFDTRSRAIMLIAGDKAGNWSKWYAK
HIPIADELFDAHQKRLHKAAAKATNRKPRKGKKR
>MAP3746 hypothetical protein
MTRTPESQQESDYGYVAHKDGYAKRLRRVEGQIRGIAKMIEEDKYCIDVL
TQISAANNALRSVALNLLDEHLDHCVSSALAEGGDEAQAKLSEASAAIAR
LVRS
>MAP3586c hypothetical protein
MRAMGGRPMSLVAGRGPLSSDPAGRFSPPIPAEVVYVEPHPRRVQAVKDG
RSVIDTERALMVHRRGRPLSYVFPADEVAGLPGEPEPEAPGFVHVPWDAV
DTWWEEGRKLVHYPPNPYHRVDCRGTRRRLRVRVGGTTLVDTDHTTIVFE
TALPPRLYVDPAHVRTDLLRRSETTSYCNYKGFATYWSLVDGDRVVDDVG
WCYPDPPPESLPIKGFLSFDETRVELLAELPVSARS
>MAP0796c hypothetical protein
MTVTVILELRFKPDEVAAGRELMGRALQDTRAFDGNVRTDVLVDEDDEAH
WLVYEIWETVEHDQAYRAFRAGEGKLTQLPPLLAAPPVKTRYVTSDI
>MAP1149 hypothetical protein
MTTEVKDELSRLVVKSVSARRAEVTSLLRFAGGLHIVGGRVVVEAEVDLG
NVARRLRKDIFELYGYNAVVHVLSASGIRKSTRYVLRVANDGEALARQTG
LLDNRGRPVRGLPAQVVGGSIADAEAAWRGAFLAHGSLTEPGRSSALEVS
CPGPEAALALVGAARRLGVSAKAREVRGADRVVVRDGEAIGALLTRMGAQ
DTRLIWEERRMRREVRATANRLANFDDANLRRSARAAVAAAARVERALEI
LGDTVPDHLASAGKLRVEHRQASLEELGRLADPPMTKDAVAGRIRRLLSM
ADRKAKIEGIPDTESAVTPDLLEDA
>MAP1986 hypothetical protein
MPRWLRGLSFLLRPGWVVLALVVVAFAYLCFTVLAPWQLGKHSRTSQQNH
QIEHSLTTPPVPLKTLLPQQNSAAPAEQWRQVSATGHYLADVQVLARLRV
IDSKPAFEVLAPFVVDGGPTVLVDRGYVRPLEGSRVPPIPRPPADTVTIT
ARLRNSEPAAGKDPFVGDGVRQVYSIDTEQIAVLTKVPLAGSYLQLVDGQ
PGGLGVVGVPQLDAGPFLSYGIQWIAFGILAPIGVGYFAYSELRARRAER
QPAAPAPEAPQSVQDKLADRYGRRR
>MAP3530 hypothetical protein
MPSEINNSETRLSWVLAVLAGVLGATAFTHSAGYFVTFMTGNAQRAMLGY
FRGDVVLSVTAGVLIVCFVAGVVIASVCRRHFWVDHPHGPTVLTTFSLVA
ATLVDVIDEGWEENLLDFAPIMLVTFGIGALNTSFVKDGEVSVPLSYVTG
TLVKMGQGIERHIAGGTAADWLGYFLLFASFVVGATVGGFISLFVNGTSM
LVAATVMCALTTGYTYFHSDRRALLDEA
>MAP3293 hypothetical protein
MSTVEVMADLPFGFSSGEDPDKPGKKDPDSGSNPSDPFAAFGISGEFGMG
DLGQIFTQLGQMFSSAGSASAGGSDSGPVNYELARRVASNSIGFVAPIPA
TTNSAIADAVHLAETWLDGATALPAGTAKAVGWTPADWVDNTLETWKRLC
DPMAQQISTVWAASLPEEAKSMAGPLLQMMSQMGGMAFGSQLGQALGRLS
REVLTSTDIGLPLGPKGIAAILPDAVESFASGLERPRSEILTFLAAREAA
HHRLFSHVPWLASQLLGAVEAYAMGMQIDMSGIEELARDFNPASLSDPAA
IENLLGQGVFEPKATPAQTQALERLETLLALIEGWVQVVVAAALGDRIPG
AAALAETLRRRRASGGPAEQTFATLVGLELRPRKMREAAALWERLTEAAG
VDARDGIWQHPDLLPDADDLDDPAAFIDRVIGGDTSGIDEAIARLEQEGP
DSPGSGGDT
>MAP1624 hypothetical protein
MSVLVAFSVTPLGVGEGVGEIVAEAVRVVRDSGLPNKTDSMFTVIEGETW
EEVMAVVQRAVEAVAARAPRVSTVIKADWRAGVSDAMTHKVASVERYLSD
G
>MAP1067c hypothetical protein
MATWYDVARIVGELALTSEPSPHDWRVGKKLLAWERPLRPSEREALARTG
AEPAPGNVLGVRVADEGVKFALIDDAPQTFFTTPHFDGYPAVLVNLDAIS
VRDLEELITEAWLTQAPRKLVQEFLADSR
>MAP3571 hypothetical protein
MQKVQAAEDAWNTRDPDHVSLAYTPDSRWRNRDEYIVGRDQIVAFLTRKW
QRELGYSLRKSLWDFHDNRIAVRFQYECRDRSGQWYRSYGNELWEFTESG
LMARREASINDVPIDESQRRYFGPRPASEHGREIPLW
>MAP3817c hypothetical protein
MTSEDHTAVATDTLTRSPSQTDARWQVLRASCWVLLVLTLLAAVVMAVRP
SLAAQAGAAQMGFLILFVVVHAWLGYSARGMAAFVAIAGIVAFALEAIGV
ATGFPFGSYTHHLPGPKPLGVPPVAIASWIIFGWLAWALARVIMRPVPGV
AVGGAERFTTPIVATLILGGYDLVYDPIGATAHDWYSYDHPTGALGVPLS
NFLGWLLTGWLIFQLVALVEPRFPGSPVTRTRTYWLLPCLFWLSTTTQIF
TSLIHPPDGFAVRGGKTIQLADVYESGASAALFTIVLTGIMALVRLYRRQ
ASKTLRSSADEA
>MAP2910c hypothetical protein
MTTGLPSQTQVIELLGGEFARAGYEIEDVVIDAHARPPRITVIADGDDGL
DLDAAATLSRSASALLDKLDTIEDHYVLEVSSPGVDRPLRTPKHFRRARG
RKVDVVLSDNSTVTGRVGETGDDTIALVVRAGRDWAIREIPLGDVVKAVV
QVEFSPPAQAELELAGVGGTDKTEERRK
>MAP1538 hypothetical protein
MNMSQDPPDRPADPPDQGAAHGRHELPGKQPRPAIGPLRRTGLSGVLRGG
RSRFGFGTLAVLLCLLLGIAIVTQVRQTESGDSLETARPADLLVLLDSLR
QREGTLSAEVSELQNTLNSLQASGNNDQAAIQSAQSRLAALSILVGAVGA
TGPGVTVTIDDPAPGVSPEALLDVVNELRAAGAEALEVNDAHQSVRVGVD
TWVAGTPGSLTIDSKTLSPSYSILAIGDPPTLAAAMNIPGGAQDTVKRVG
GRMSVQQADRVDVTTLRQPKPHQYAQPVK
>MAP3639c hypothetical protein
MPHDAPAHNLDLPREQTPRGRYWWVRWVILGVVAIVLAVEVSLGWDQLAK
AWMSMYEANWWWLLASVVAAAASMHSFAQIQRTLLKSAGVHVKQLRSEAA
FYAANSLSTTLPGGPVLSATFLFRQQRLWGASTVVASWQLVMSGVLQAVG
LALLGLGGAFLLGAKNNPFSLLFTLGGFVALLLLAQAVASRPELIEGIGS
RVLAWVNSVRGRPAETGLDKWRETLMQLESVSLGRRDLSVAFGWSMFNWI
ADVACLGFAAYAAGDHASVAGLTVAYAAARAVGTIPLMPGGLLVVEAVLV
PGLVSSGMSLPSAISAMLIYRLISWLLIAAVGWVVFFFVFRTENIADSDD
EPITGPLPVLPTPGGPPDPTDTALQGPLPPDRNPADPNPDKDV
>MAP1702c hypothetical protein
MVPGSHHQPLSQASGCGPTPSGRHTLAVGSHHWPGSQANGCGPTPSGRHT
LPVGSHHSPSAQSALLTAGVAIATAAIDANANMATTTSWRTNVFMVGLPS
LLPVIEPSPGLRVKAIGQVAGDGPAAKPSDEFVAAHRSAYKHKVGNRMDW
YSSSQSTFGADMDAPPGGGFAVNVFLRDGDTVYRTWHTNGRGTEQLSHSF
GLIDILPWGRQEEWQDSPHGWPSRPTHSGWPDSPAIARAYGPNDG
>MAP2222c hypothetical protein
MLARNAEALYWIGRYVERADDTARILDVALHQLLEDSSVDPDQASRVLLR
VVGIDPPDHDLDVWSLTDLVAYSTGAQGGCSIVDAVTAARENAKSAREVT
SSEIWECLNTTYHALPERERAAKRLGPHDFLSFIERRAAMFAGLADSTLS
RDDGYRFMVLGRAIERVDMTVRLLLSRVGDSASSPAWVTLLRSAGAHDTY
LRTYRGVLDAGRVVEFMLLDRLFPRSVFYSLRLAEHNLDELMHNRQSRVG
ATAEAQRLLGQARSELEFVQPGVLLESLESRLAGLQRTCRDVSDALALQY
FHVTPWVAWSDASQRARLVSRKGDG
>MAP1104c hypothetical protein
MSAADGVSSQTWTKVPAITLGFWVIKVLATTLGETGGDTVTMTLDWGYAA
GVALFGVTLVLLVAAQILAARFHPVLYWATIVASTTFGTVLADFADRSLG
IGYTGGSLLLLACLLATLGLWRWSQGTVSVATVHTPKVEAFYWATITFSQ
TLGTALGDWLADTRDFGYRRGALVFAAGLLVVAGLYFWTGVSRVALFWVA
FILTRPLGATVGDFIDKPVAQGGLAWSRPLATAVLAALIAVLLIVIPQRP
GRHPGRPEAGAAQSPTAT
>MAP2297c hypothetical protein
MTVAPPAGTASTEAVPQHRTLVWPVLAGVAVLAGCTAAGIGTLSLASALT
ATGLPDPGPVTTVGLPFLRAAGEIAAVLAVGSFLFAAFLVPPQPSGVLDA
DGYRALRLGTVASGVWAVCAALLVPLTLSDVSGHPVADLRPAQMWSLAGL
ITTASAWRWTAILAAAITLASLPVLRWSWAPVLLAASLTTLIPLGLTGHS
SAGGSHDLATNGLLIHLVAAALWAGGLLALLAYALRGGQGGDHLGLATRR
FSAIALWCWVAMALSGLVNAAVRVQPSDLLATGYGRLVLAKAAALCLLGG
VGWRQRRVNVAALQAVSTLARARRALLRLTLIEAALFGLTFGIAVGLGRT
APPSPPARLPSIAEAEIGYDFDGPPTLTRILFDWRFDLIFGTAAIVLAGL
YVAGVVRLRRRGDRWPPGRTSSWLLGCLVLLFVTSSGVGRYMPAMFSMHM
VVHMCLSMLVPILLALGAPVTLALRALPAAGRGDPPGPREWLLAALHSRF
SRLLTNPVVATVLFVAGFYGLYLSNLFDTTASSHAGHLLMNLHFLLSGYL
FYWVVIGVDPTPRPIPPLAKLAVVFASLPLHAFFGVVLMGTRKVLGADYY
RSLGFSWHTDLLGDQRLGGGIAWSAGEFPLVIVMLALLVQWARSDRRTAK
RLDRAADRDDDAELAAYNAMLAQLAGRDKPGEGSATGQA
>MAP4348c hypothetical protein
MTGPARSGAAIRGAGRTVARGLIFLIQLYRHMVSPLRPATCRFVPTCSQY
AVDALDEYGLIRGSWLAAARLAKCGPWHQGGWDPIPERPGCRVNCQDASD
AWAVRATRGESGSLV
>MAP0305c hypothetical protein
MTSPEAIADQLARTRARTLRLVDFDDDELRRQYDPLMSPLVWDLAHIGQQ
EELWLLRGGDPARPGMLPPAVEGLYDAFVHSRASRVDLPLLSPEQARAYC
RTVRAAVLDTLDALPDDPDAAFVYAMVVSHENQHDETMLQALNLRSGAPL
LRDTSVLPAGRPELAGTSVLVPGGEFVLGVDAADEPESLDNERRAHVLDL
PAFRIGTVPVTNGEWQQFVADGGYDEPRWWSRRGWQHRQAAGLTAPQFWH
PDARTRTRFGHVEDIPADEPVQHVSFFEAEAYAAWAGARLPTEMEWEKAC
VWDPSTGTRRRYPWGATPPSPAVANLGGAALRPAPVGAYPAGASACGAEQ
MLGDVWEWTTSPLRPWPGFAPMLYERYSQPFFDGDYRVLRGGSWAVEPGI
LRPSFRNWDHPYRRQIFAGVRLAWDVPGARDHR
>MAP2756c hypothetical protein
MVGIPVYLDVESRIDQRALMATSRALVDHFARVGNDISHGLGGSLSKAFA
AVDGTAARRDLLALQQEWRRAADVEADAAARMIRDQRRLAEATVKYGDDS
SRTAAAQAMLARSQRDHIDAMIAAEAAHGRLAKAGNETADSVSRMQKLGA
NPIFNAAGIGSVAAMGIGLVSATDAAGNFQQSLQRLHTVAEESPANLKAI
SDGVLKLSSVVGYAPQKLMDAALGVEKAGYRGSDAIKVLTASAQLASEEG
ADLGETISAVTTTMHDYHIPVEQAANVTSKLNVAVGLSKVSLQDFAGALH
NVEPVAAGVGESVNDLYASLAMLTQSGMGADQATQNMTHAITSLAKPTQQ
MSQEMGQLGLDARDIQEHFGERGLIGTANLLYDTIQSKLGPSGMVTLDAW
FKSKQVADSANEMFSKLPAQAQAVATAIQNNAEAYKQFRKDRGGLSVEEA
NLVEQWWNQEKALTGFNNQLKSGKGDVQSMIQALALMVGGQDNLRTVLQL
VGENGPKAAAAVKEVSQAQADGAGNTKGFTESQETYNAKMKDFKGALSAA
RIEIGQDFLPAMTEVVGVLGESAHWLTEHKPLLDAVTVGVGAMGTAWLAI
KGYNIASTILSPIGTGLGMLKDKLFSVETQATTASTAMRNMGPAAMAGEA
EVLAAADAEVAAEGRVTAAAGEANAALSGGRTGGAGLAAAAGPLGIAAAG
SMAANDIFGSLDRRFHTNLFTKLGEIPGSGAWVAGQLFGHAEGGPLHAPG
PKGHDSALFWGADGEHVLTHHEVQKMGGHSGVYKFRSDLMNGRIVLGRAG
GGALGYGGMAPDVAVASSLAGTPYSQGARDDCSGMVGRVILGAMGLPATN
LPTTKNMGQWLAALGFQPGIGGPGSISVGWYDHGPNPNDGHAAMTLSNGE
NAEAGGSHGNFVIGAGAAGAASSQFDHHMFLPTLYGEGAATGMPGFAAGM
GAGGFGGMGGGIPPGATPGTGPGGQPGYYTANPQRVAAAEERLRHLDAEI
DNAEKRRSELKATAKQSERDRLDEEIRHLKAERTQEQQRLAEAERGTFHA
MHGHRGAGGGENPFLPVPLADQFGLSKGLPGLAEWTVGFLEDLVLGPLET
AAWAAIGQAPPGAGGTGGGFGGLSAPGGLGAARFGFPNPAPLAAPPGAAG
ADDAAAAGLDTARGNTTGQAPSSGGGAGAGGGSEFEFVKSPSKPGLPLGP
LARPPAPDSAPPPADYKSWYGQGASDSFYRTWYPTPLGPVPTPSPNAYGP
PSHGSDWRWGPVHVAPPAPPGPMKSPLEQQQSMLGGAGSGGPLPSGPKPL
PMGPFIGGDPRWGTAHFASGGPSGTDTIPAWLSPHEYVEPSEAVDKYGPG
FMDAIRQGRIDPTSVRYYAPGGEVTDQPEPPPQQQAPAQQPQNMVKAPGA
PGGPAIEPPPGAPKPGDNASIHEPTGPGAVSPGSKQGLSDTATPGADVQQ
PGTGQGALPGIGFSGGIIGGLEGAATQAAAMGADMGTFGGAGGAVSSAMN
IGFQELNRAAAYGAQDVGIGVEGLLEALIPNSDATGADWSKTIPGRLLMG
VTGVRTAGQQNTAGQTQQPFASNASSDQYANVGNTQPQAPIQIMGPVHVQ
ANDPKQLHEGLNSQAAMANSANMIGTQFRGTAGGTG
>MAP2797c hypothetical protein
MAKLDYDALNSAIRYLMFSVFAVRPGALGDQRDEVVDDASRFFKQQEERG
VVVRGLYDVAGMRADADFMIWTHAETVEALQATYADFRRTTALGRVCSPV
WSSVALHRPAEFNKSHIPAFLAGEEPGAYICVYPFVRSYEWYLLPDEERR
RMLAEHGMAAREYKDVRANTVPAFALGDYEWILAFEAPELHRIVDLMREL
RATDARRHTRAETPFFTGPRVPVEQLVNSLP
>MAP1679c hypothetical protein
MAAALPLVKLLRDTVAVRMSVAVLLGVAVAVVVGNTVGWRFALVGWVVTA
GVYVVWTQLILFGMDAEQTRVWATREDPTRWVADAVILSASVASLAGVGY
VVAAGSHAGARAVAAAVLGILAVAASWFAVHTLFTVHYARLYYSGQPGGI
NFHDPEPPRFRDFAYVAFTVGMTYQVSDTEIGLTAIRSTVLRHALLSYLL
GAVILAVTINLIAGLGAKL
>MAP2024c hypothetical protein
MTARRGNLNVYRTLANAEKVFTSWMVAGDAALSSPVLPRRLRELIVLRTA
SAMDCAYELGQHRDVARTVGIDPDTIDAVISETGWQAGDLTPTELAVLHL
TTELVTTRRVAQPLFDRVHRALGTEATVEALMVINRYAGLALMLNALEVD
LDETARLPIPPTS
>MAP1338c hypothetical protein
MVSQPKTVSRLLDAEEVDQRHGVARRVCQFGGPGCGNGQFRRCQNNPIPE
GSAESIALRRFSPIAGPTANSPAAVCEPPHRARPKVTSIMSMPTLEPTFE
SGAGDIVAEPAPRRPRGRLLDPWAIAVLATALSAAWACRPSLWFDEGATI
SAAANRTLPELWRLLGHIDAVHGAYYLLMHGWFALFPPTEFFSRFPSALA
VGAAAAGVTVFTRQFAPRRTAVCAGAVFALLPRMTWAGMEARPYAFVAAA
AVWLTVLFVAAVRRGAPRRWVGYALALMLAILLNLNMVLMVPVYGVMLPL
LTARGARRSAALWWAGSSAVAVGAMTPFLLFAHNQVWQVNWIYPVSWHYA
FDIILRQYFDHSVALAVLSAVLIVAAAVARLAGVPAPPGDLRRLLILCAA
WMVIPTALVVVYSAVGEPIYYPRYLIFTAPAMAIVLAVCIVTLARRPWPI
AGAVLLCAVAALPNYLFVQRWPYAKEGWDYSQVADLIGSHAAPGDCLLVD
NTVPWRPGPIRALLATRPAAFRSLIDVERGAYGPKVGTLWDGHVAIWLTT
AKINKCSTIWTITNKDNSLPDHQSGQSLPPGSAFGQAPAYRFPGYLGFHI
VERWQFHYSQVVKSTR
>MAP2740 hypothetical protein
MNVEPLLHSIPPLAVYLVVGGVVGIESLGIPLPGEIVLVTAALMSSHHDL
AVNPLGVGVAAVIGAVIGDSIGYAVGRRFGMPLFERLGRRFPKHFGPGHV
ALAEKLFNRWGARAVFLGRFIALLRILAGPLAGALKMHYPRFLTANVSGA
ICWAGGTTALVYFAGVAAERWLERFSWIGLVVAVVAGLTAAILLRERTSR
AIAELEAEHYRKTGNTATDPV
>MAP0161 hypothetical protein
MSDPITYNPGAVADFATDVASRAGQLQSIFDDTSNRTHALQEFFAGHGAS
GFFEAQAQMLSGLQGLIDTIRQHGQTTSHVLDSAISTDQHIAGLF
>MAP3634 hypothetical protein
MSGGMPMSGWTRGTLFAALNAAVVSVVGLALVLSAGPALADPDPAPADPG
AVAAPPGPPAPPDPLAPPPPPDPLAPPPPAAPPAPWLPPAAQPAAAPAAG
QDPTPFTGTPPFGPPTFVPKTGSTVGVAQPIIINFPGRVDDAGAAISAVH
VSSVPPVPGKFYWMTPTQLRWRPLSFWPAHTAVTVDAGGTVTNFQTGDTL
VATADDATHQLTVTRNGTVEKTFPMSMGMTAGNHQTPNGTYYVQDKKASV
VMDSSTYGVPVNSTYGYKVTVEDAVRFDNVGDYVHSAPWSVDDQGKRDVS
HGCINISPANAKWFFDNFGPGDPIIVKNSSGGDYKKNDGSADWMN
>MAP0721c hypothetical protein
MRLQPLPAEQWDEATRQALAAMRGADTNNALSTLAHHPALAKAFLRFNVH
LLTASTLPPRVRELAILRVAHRRQCAYEWSHHVSMAKDEGITDEQIAAVR
CFAGDGAGPFDAFDHAVLAGVDELDEKSELSDRTWAALGERLDDRQRMDY
VFTVGCYTLLAMAFNTFGIQLEHAEQH
>MAP1224c hypothetical protein
MEGVTGSATSKIAETLRDLGCAIGAAARGVSRSRIAWTVAGITALVVLAS
LIPLPSPVQMRDWAQSVGPWFPLAFLLAHIVVTVVPVPRTAFTLAAGLLF
GPLLGVAIAVAASTASAMIAMLLVRAAGWRLTRLVRHRSMDTVEERLRQR
GWLAIVSLRLIPAVPFSALNYAAGASSVRVLPYGLATLAGLLPGTAAVVI
LGDALAGHPSSLLYLVSALTSALGLTGLVIEIRHFRRHHRRAHRHRDDEP
SPEPATIG
>MAP0434 hypothetical protein
MRTFGVSLLVTAAALVLGYAYGGPKSLYLLLVLAALEVSLSFDNAVINAA
VLKQMSRFWQRMFLTIGILVAVFGMRLLFPLLIVWATAGLDPVRALELAL
RPPPHGALEFPDGSPSYQKLLTAAHPQIAAFGGIFLLMLFLDFLLIDRDI
KWLKWIEVPFARIGRLGQVSVVLSGLTLVLVGTGLTHSSQEAVTVLTAGL
LGMVAYLVVNGLSRAFRPSGVGQPQPAGPATGWAGLSLFLYLEVLDAAFS
FDGVTGAFAITSDPVVIVLGLGAVGSMFVRSITIYLVHQETLDRYVYLEH
GAHWAIGALAVIMLASIEPRLTVPEPVTASVGVFFIGTAVGFSVLRRRRE
SGRASAPGP
>MAP1841 hypothetical protein
MWVGWLEFDVLLGDVRSLKQKRSVIRPVVAELQRKFSVSAAETGSHDLYR
RAGIGVATVSGDRGHAVDVLDAAERLVAAHPEFELLSVRRSLVRSDDLG
>MAP2523c hypothetical protein
MTKKLRVIQWTTGKVGKLSLRGVLDDPRLELVGVYAFSEEKAGSDAGALC
GRPDTGVLATTDIDALLALKADTVIYTPFMADIDDVVRLLESGLDVISTN
LFLNVGGIQGETKQRLAAACQRGGSSLYITGVNPGWINTMVTAMTAVCRD
VEMVSVSESADVSVYESPETWQSQGFSLSEAPPAVIETAKMWLSTFRDSV
QRMAVALGFELDDMEFFIEHATASERVDLGWWVIEKDTIAAMRAGWNGKV
NGKTVVQSNVAWYMTKLLNEGWQFDDDHYHVVIKGEPGVDTRIRFLPPDS
WGNHEWDTMTAMPAVSAAVDVAAAPAGILTLKDVGLPCAPAGLWLEG
>MAP0875c hypothetical protein
MRSIWKGSIAFGLVNVPVKVYSATEDHDIKFHQVHAKDNGRIRYQRVCEL
DGEVVEYRDIARAYESDDGQMVIITDDDIATLPEERSREIEVLEFVPANE
VDPMLFDRSYFLEPDSKSSKSYVLLAKTLAETDRMAIVHFTLRNKTRLAA
LRVKDFGKRDVMVIHTLLWPDEIRDPDFPILDKKVEIKPAELKMAGQVVE
SMAEDFNPDRYHDDYQEQLHELVQAKLEGGEAFTTEEQPKQLDETEDVSD
LLAKLEASVKARSGDGKAPAKKSPAKKTAAKKAPAKKGAAKKAPAKKAAS
RS
>MAP0014 hypothetical protein
MVQTRQSPWRFGVPLVCLLAGLLLAATHGVSGGAEIRRSDAPRLVDLVRE
TQASVNRLSAQREQLAAKIDAAHGRSSDAALAAMLRRSAQLAGEAGMSPV
HGPGLVVTLQDAQRDANGRFPRDASPDDLVVHQQDIQAVLNALWSAGAEA
IQMQDQRIIATSVPRCVGNTLLLNGRTYSPPYTITAIGNAAAMQAALAAA
PLVTLYKQYAVRFGLGYQEEVRSDVQVVGHFEPDRLHFAQPNGPIGY
>MAP1580c hypothetical protein
MTVRIGTSGWSYDHWADVLYPPGTPSARRLARYIEVFDTVELNASFYRWP
KDSTFAGWREQLPDGFTMSVKAHRGLTHYRRLASPEPWIERFERCWELLG
DRRGVLLVQLHPEQQRDDARLDSFLERMPASIRVAVELRHPSWNDPAVYA
LLERRRAAYAVTSGLGLACIPRATTDLVYVRMHGPDPAANYAGSYSDDDL
RCWAERITAWDGDGKDVWMYFNNDLGGHAVRNALALRELVG
>MAP2087c hypothetical protein
MAAPVSLREDQLTRLVAVFPGSPSEARVAALIRRVCAQTSSLPPLPAPME
VGEPESETEAAVAEFAEQFSADVSAITHAQRSRLSKQLGDRTFGVVVQMY
IADFVLRVRAGLEALGVGSRYLGWLSGPISWDHGSDPSDLVFNDFLIVVA
RMRALDPVTSELVRLRGAAQHHCRLCNSLREGSALDAGGSETLYEEIERF
ESSGLLDERAKAALRYTDALIWTPAHLVADDVAEVRSRFSEAEAVELTFD
IMRNASNKVAVSLAADAPRVENGTQRYLIGADGQTVFS
>MAP0605 hypothetical protein
MMDERRRKGLEKMNEVYGWEMPNVEGDAYFDLTVDHLFGSIWTRPGLSMR
DKRIMTLTAVTAIGNRDLAEIQINAALLNGELTETELKEMAVFLTHYLGF
PLGSALNGAVDAVVAKRRKAAAKGAGEDKKANVDAALKMHSGGERG
>MAP4036 hypothetical protein
MIPVTLLVVAKAPEPGRAKTRLAAAVGERAAAEIAAAALLDTLDAVAAAP
VAARVVALTGDLDGAARGAEIRARLASFTVIAQRGNDFGARLANAHADAA
DGLPVLQIGMDTPQVTAELLAGCARRLLDAPAVLGLARDGGWWVLGVAAP
VLADCLRAVPMSAADTGELTLKALRDNGIEVATVQTLVDVDVVGDVAVVR
DACPPASRFARATRAAGL
>MAP0728 hypothetical protein
MADGRKGYAILTEAIKDPEGMKAYAKAAGSAMSGATVLAVDTAPTVIEGT
WHGDQTVVLEFESVDAARAWYESEGYQKAAKLRQAAADCNAVILAGF
>MAP1570 hypothetical protein
MDVTAATEYLARSTTLTSVGIIGYIIIGGLAGALASKIVRGSGAGILMDI
VIGIVGALIGGFILSFFVNTAGGGLIFTFFTALLGSVILLWIVGMVRRT
>MAP3619 hypothetical protein
MYKVGVWGPGSMGVIALRGVIDHPQLELVDLVVHSDAKAGRDAGELCGVA
PVGVVATQDPAAMLAGDADVVVYAAGANLRPLEAVEDMVSLLRAGKNVVS
CSVVPLVFPDAVDSAFSEPLRAAALEGQVSFFTTGIDSGFANDVLPLVLT
GVSRVIESVRVTEMFNYATYPDASAVYEILGFGQPPDYPAFAAQPGIFTF
GWGPVLHQLAAGLGVEIDHIEESNERIPAPESFDTPTGHIAAGTIAAMRS
TLTGYVGEKPTFVLDHVTRMRDDLAPDWPQPRIAITPKDLGYGLASGRGL
YRVEIEGSPSMRCEFEMAEDHDHDLGARIAGSSRMVNAIPAVCAAPPGLL
SALDLPLITGAGLVRPVLGPPPDSRLF
>MAP4073 hypothetical protein
MPVSEPAGYVEVRAYAELNDFLPAESRGAAVRRPFRAHQTVKDVLEAMGI
PHTEVDLIVVNGSVHGFDHRPRAGDRIAAYPMFEALDIGPTARLRPVPLR
DPRFVVDVNLGRLAWLLRLFGFDVWWSNDADDQTLAAISAEQHRILLTRD
RGLLKRRAVTHGLFVRPDDPEEQALGVIRRLDLTGRLAPLSRCVRCNAGP
>MAP1066 hypothetical protein
MRGSAVPSLVRTAVRAGGKRLGAVWFNLLQTSLAAGLSWYLAHDVLDHPQ
PFFAPIAAAVSLSTSNVLRAQRAVQMMIGVTLGIGLGTVVQGMLGPGALP
IAVAAPVALGAAVFIGGGFIGHGMMFANQTVVSALLVLALYRGGAGPERI
FDALIGGAVAIVVAVLLFPADPRTVLGAARAGVLAVLHDVLSRAADVSSG
RRAAPPDWPLSAVDRVHEQLSGLLEARTTAWHVVAIAPRRWGLRDAVRAA
DHQAVHVALLAGSVLQLARAVAPGPGDRQGQPVSTVLLVLAAATALADRD
PAGACVYLGSARRHAARLRSGDGGDREPHVALADAVGACVDDLQRVIDLR
PG
>MAP1016c hypothetical protein
MSEPGSATDTDTGDEPAQQQTAPDGTRTGAQTAVAEHPAPKAPAPGTQQP
EKPWWVRHYTFTGTTVGLVFIWFSLTPSLLPRGAIFQGLVSGISGAIGYG
LGVFAVWLYRYLWAKPSSPPPPRWAWKILIPVGAVGMVLMAIWFHVWQDK
VRDLMGVAHLKWYDYPVAGVLSLVVLFTCVEIGQFTRWLVSFLVGRLDRI
APFRLSATIVVALLVVLTITLLNGVVLKFAMRTMNNTFASANNEMSPDTA
PPKTPLRSGGPESLVSWESLGHQGRVFIEGGPRVEQLTAFNGAPATEPIR
AYAGLNSADGITATAELATRELQRTGGLRRAVVAVATTTGTGWINEAEAT
ALEYMYNGNTAIVSMQYSFLPSWLSFLVDKENARHAGQALFEAVDRLVRQ
LPEAQRPKLVVFGESLGSFGGEAPFMSLNNVLARTDGALFSGPTFNNTIW
TDLTSTRDAGSPEWLPIYNDGRNVRFVARAANLARPKDPWDHPRVVYLQH
ASDPIAWWTTDLLFARPDWLKERRGYDVLPETTWIPVVTFLQVSADMAVA
QNVPDGHGHHYVADVADAWASVLSPPGWTPDKTDRLRPLLHANG
>MAP1890c hypothetical protein
MVLFFQILGFALFIFWLLLIARFVVEFIRSFSRDWHPTGITVVVLEIIMS
ITDPPVKLLRRLIPQLTIGAVRFDLSIMVLLLVAFIGMQLAFGAAA
>MAP1990 hypothetical protein
MRPVHPQLTARVEHHRSLPVLVWSFAEPRLCISSGPLGGGIGARDWLVNA
TVPLDYDRTDPHRHLVEIGAALGLAGTGCGLLTAVDVTRHHLGADGGVQV
TATVGLSSPAWAAAPDHHFRREAPHRVGTINIVVAAPVRLSEAALVNAVA
TATEAKAQALHEAGIRATGTASDAVVVHCPTDGAAEAFGGPRSTFGARIA
RAVHAAVLAGARSWMSGAAHRPPAGNPTPPRHFEPASAAGSAPRAAGRRA
GHRVGADGIEPPTAGV
>MAP1317c hypothetical protein
MCQTAAMTMSPGSPVPSLLPHLWKSALLSGILSLALGVLVLVWPGISILV
AAVAFGVYLLITGIAQVVFAFSLHVSAGSRVLLFISGAASLILALLAFRH
FGQGYAILLLAIWIGIGFIFRGVATTISAVSDPNLPGRGWNIFLGVISLL
AGIVVLASPFESIVTLAIVVGAWFVVIGVFEIISSFGIRKASKTLAG
>MAP2663c hypothetical protein
MTPYDVGLLILRLVLGVTLAAHGYNKFFGGGRIPGTARWFESIGMKPGKF
HATVAASTEMAAGLGLAAGLLTPIPAAGFVSLMLVAAWTVHRPNGFFIVK
EGWEYNLVLAASAVVVATLGAGRLSLDWLVFGKNWMDGWNGLLISLLLGL
AGAIGQLVIFYRPPAKQTG
>MAP0183c hypothetical protein
MPPRKRRAPRLLVAVAALCLGSVAWSPMASAHVHAGSDNPVRGAMAVVTF
QVPNESNTGAATTALTVALPNVAAAHTETMPGWTARLDRDAASGTVRSVT
WTAAAGGGIGPDQFALFRLSVKLPDADTVSFPATQTYADGTVVKWDQPPL
PDGGEPEHPAPTLALAAGPAAGHQHPGGPAAADNAARWLGGAALVLAALG
IAIALVRRRA
>MAP0332 hypothetical protein
MPNTYRVVQWNTGNVGKSSLKSIVTNPTLELVGCYAWSAEKVGRDAGELV
GIPPLGVAATNDVDELLALKPDCVVYNPMWIDVDELVRILSAGVNVVTTA
SFITGGNLGDGRDRLLQACQQGGATIFGSGVSPGFAELLAIVSAMVCNRI
DKVTVNEAADTTFYDSPETEKPVGFGQPIDHPDLPAMAAKGTAIFGEAVR
LVGDALGVELDDVRCVAEFAQTTEDLVMASWTIPAGHVAGTYISWQGIVG
DQVLIDLNVRWRKGQTLDPDWKIEQDGWVIQIDGQPTVTTKVGFLPPPYF
EATTIEEFMDLGHIMTAMPAINAIPAVVAAAPGIASYADLPLTLPRGNAH
VAGQP
>MAP0286 hypothetical protein
MMTAEFLEHQRSIGNDLLTPVPEYRFPGLLPGDRWCVTALNWLRAHHDGC
AAPVVLASTHESTLEVVPLEALQEHKIDVPDDLANL
>MAP2643 hypothetical protein
MYCLLVLAVGLERVAELLVSTRNARWSFTQGGKEFGRSHYPVMVFIQTAL
LAGCLVEPWALHRPFLGWLGWPMLAVVAASQGLRWWCITTLGRRWNTRVI
VLPQAPLVRDGPYRWLHHPNYVAVVAEGLALPLVHTAWLTAAVFTLANAA
LLRVRLRVENSALGYT
>MAP4100c hypothetical protein
MKLRLAIRELHRSERKLAHQLTVLAARHHSDQDIFHLARDLAGWSHRHLG
ELARHGRHYGLRLSADPRTAARTGVVQQRISGLLRHRPEPGLVLLADLRR
IHRLAAGVSLDWELLAQGAQAAKDAELLGLASRCHPETLRQMRWANAMLK
ELSPQVLMN
>MAP1029 hypothetical protein
MSELRLMAVHAHPDDESSKGAATLARYADEGHRVLVVTLTGGERGDILNP
AMDLPEVHGHIAEIRRDEMAKAAEILGVEHTWLGFVDSGLPKGDPPPPLP
EGCFALVPLEEAIEALVRVVREFRPHVMTTYDETGGYPHPDHIRCHQVSV
GAYEAAGDYRRFPDAGEPWTVSKLYYNHGFLRARMQLLHDEAVKHGHEPP
FKKWLEHWDPAHDPFESRVTTRVECSAYFSQRDDALRAHTTQIDPDHDFF
AAPIAWQQRLWPTEEFELARSRVPVRLPEDDLCARAPGWPRRFTPR
>MAP3812c hypothetical protein
MRTGVRCVLATVGVAACVVVTPAGVSLAAATQSHAFAIASVLPSSGEMVG
VAHPVVVTFRAPITDPAKRHAAEQTIDVKSTPAMSGKFEWLDNRVVQWVP
DRYWPAHSTIALTVGGVSTEIKTGPAVIGVASISEHTFTVSIDGVEAGPP
TSLPAPHHRPHFGEQGVMPASMGRPEFPTPVGSYTVLSKERAVTMDSSSV
GIPVDDPDGYLLTVNYAVRITNRGLFVHSAPWAVRSLGLENVSHGCISLS
PDDAEWYYDHVNVGDPVIVQD
>MAP0303c hypothetical protein
MMTLSLSNHLAADSAHHALRRDVLDGLRRTPKSLPPKWFYDSVGSDLFDQ
ITRLPEYYPTRAEAEILRAQAPGIASASEADTLVELGSGTSEKTRVLLDA
LRERGALRRYVPFDVDAGILSASAAAIQREYAGIEIQAVCGDFEEHLTEI
PEGGRRLFVFLGSTIGNLTPGPRAQFLAALSAQMRPGDSLLLGTDLVKDT
ERLVRAYDDAAGVTAAFNRNVLAVINRELDADFDVDAFEHVARWNADEER
IEMWLRATVRQRVRVGALGLTVDFRAGEEMLTEVSCKFRPERVSAELAAA
GLRRTRWWTDGAEDFGLSLAVK
>MAP2668c hypothetical protein
MPSITPSLWFDDNLEEAATFYTSVFPNSHIEGFNRTTEAGPGEPGTVLSG
SFVLDGSRFIGINGGPHFRFSEAVSFTVHCKDQDEVDYYWDRLSDGGEES
QCGWLKDRFGLSWQIVPDRLFELIGDPDRSRAAAATQAMYGMRKIVIAEL
ERAAASS
>MAP2903c hypothetical protein
MMLATTTLALKEWSAAVHALLDGRQRVLLRKGGIGEKRFELAAGEFLLFP
TVAHSHAQRVRPEHQDLLAPAAADSTDDELVIRAAAKVVAAVPVNRPDGL
PAIEDLHIWTAESVRADRLDFRPKHKLAVLVVSVTPLAEPVRLARTPDYA
GCKSWVQLPVHARLGPPIHDDETLAAVADRVRDAVG
>MAP3862c hypothetical protein
MSTAVTALPDILDPMYWLGADGVFGSAVLPGILVIVFIETGLLFPLLPGE
SLLFTGGLLAAHPNPPASIWVLAPAVAIVAVLGDQTGYFIGRRIGPALFK
KEDSRFFKKHYVTESHAFFEKYGSWAVILARFAPFVRTFVPVIAGVSYMR
YPVFLGFDIVGGIAWGGGATLAGYFLGNVPFVHHNLEKIILGILVVSLMP
AFVAAWRGYRGRRRPTRTPERESERLPASE
>MAP2993 hypothetical protein
MATVVAFHAHPDDEVVLTGGTIARAVAAGHRVVVVTATDGRVHNEDTDHR
LDELRSSARILGAQRVECLGYADSGYGPLFYPDPPGRTRFGRADLDEAAG
KLAGILRDEHADLLLSYQANGGYGHRDHVRVHHVGKRAAELAAVPRVLEV
TMPREMLLRISDLAHLLRLPGPYEADIVGSAYAPRAAITHRINVFRFARQ
KRDAFAAHRSQIGRSGPAARLFGLLLRLPPQVFGALFSHEWFVDPALPTG
TVRRDIFD
>MAP0870c hypothetical protein
MPIPVVQLGTGNVGIHALRALITDPQFELTGVWVSSDAKAGKDAAELAGL
AQPTGVLASTDLDAVLATGPRCAVYNALADNRLPEALDDYRRVLAAGINI
VGSGPVFLQYPWQVIPDELIKPLEEAAQQGNSSLYVNGIDPGFANDLLPL
ALAGTCQSIQQIRCMEIVDYATYDSAAVMFDVMGFGKPLDDTPMLLQPGV
LSPAWGSVVRQLAAGLGVSLDEVTQQHVRVPAPEDFDIASGHIAKGTAAA
LRFEVFGMVDGRPVVVLEHVTRLREDLCPDWPQPAQHGGSYRIEVTGEPS
YAMDICLSSRKGDHNHAGLVATAMRVVNAIPAVVAAPPGIVTTLDLPLIT
GRGLYRPE
>MAP0437 hypothetical protein
MTPAPDLTLGTAVDWRFAATVGRRLARPGPPSTDYTRRQVIDELAGAATK
AEPLVRQVTGLLTDGADDGRVPAARIVDRPEWIAAAAESMRVMMNGTERP
RGFLTGRVTGAQTGAVLAYVSSGILGQYDPFATDRGAGAGCLLLVYPNVI
AVERQLRIEPSDFRLWVCLHEVTHRVQFTANPWLPGYMSQALALLTRDTG
DDLGQVLSRLADYARNRGNPASQADANSTGILGLVRAVQSEPQRQALDQL
LVLGTLLEGHADHVMDAVGPMAVPSVATIRRRFDERRHRKQPPLQRLLRA
LLGLDAKLSQYTRGKAFVDQVVGRVGMARFNAIWTGPETLPLPVEIEEPQ
RWIDRVL
>MAP0349 hypothetical protein
MKNSGRSTMSSPSKLRVIQWATGGVGKAAIQCVLNHPRLELAGCWVHSAE
KNGVDVGRIIGTQDLGVTASTSVDEVLALDADCVVYSPLIPNDDEVIAIL
RSGKNVVTPVGWVYPDPGNPRHRAVADAALESGVTLHGSGIHPGGITERF
PLMVSSLSSAVTHVRAEEFSDIRTYNAPDVVRHIMGFGGTPEEATGGPMA
GLLDGGFKQSVRMIADHMGFRIDPNIRTIQDVAVATADIDYDPFPITAGT
VAARRFRWQALVDGEPVITAAVNWLMGEQNLDPAWDFGGRGERFEVEITG
DPDVSLTFKGLQPETIAEGLVKNPGVVVTANHCVNAIPDVCAAAPGIKTY
LDLPLFAGRPAPGLATS
>MAP0992 hypothetical protein
MVDRADLDAVARQLGREPRGVLAIAYRCPNGEPGVVKTAPKLPDGTPFPT
LYYLTHPALTAAASRLETTGLMREMTERLSQDAELAAAYRRAHESYLAER
DAIEPLGTTFSAGGMPDRVKCLHVLIAHSLAKGPGVNPFGDQALSILAAD
PALAGVLQEGRW
>MAP2161c hypothetical protein
MVATLFYTDELPDAGSLAVLGGDEGFHAATVRRIRPGERLVLGDRAGGLA
RCEVEHAGRDGLRARVLERWTVAPPNPPVTVVQALPKSERSELAVELATE
AGADGFLAWQAARCVANWHGPRVDKGLRRWRAVAKAAARQSRRPHIPTVD
GVLSTASLTSRIRDEVAAGAVVLALHESATDRLTDVPVAQAKSLFLVIGP
EGGIADDELAALRQAGALSVRLGPQVLRTSTAAAVALGALGVLTPRWDDA
ASP
>MAP0100 hypothetical protein
MTLSTMPRTVGIEEEFHLVDLTTRRLAPRAPELLGLLSDGYVAELQSCVV
ETNGSVVSTLAELRADLTERRRVLVDTAATLGLGVVAAGAVPLSVPSEMH
VTQTSRYQQILADYQLLAREQLICGTQIHVGIDDRDECPGGRSGSRLCSH
SACLEREFAVLVRRI
>MAP3004c hypothetical protein
MTGAVATEATGLTPDSPAGPAVPALSAWSVLVAGVIGLVASVTLTLEKID
ILLDPAYVPSCNINPILSCGSVMITPQASLLGFPNPLLGLVAFTVVVVTG
LLAVTKVVLPQWYWIGLAAGLVVGAVFVHWLIFQSLYRIGALCPYCMVVW
VVTIALLVVVASIAYRPALGDRRSGPGWLLFQWRWSIVALWFTAVFLLIM
VRFWDYWSTLL
>MAP3468c hypothetical protein
MTAWVDREFERHDFTDEDLVGLSTERVVFTECNFSGANLAESRHRASAFR
NCTFRRTSLWHSTFEQCTMLGSVFEQCRLRPVTFDEVDFTLAVLGGNDLR
GGAADPALWTTASLAGARVDVDQAVAFALARGLRLDG
>MAP1586 hypothetical protein
MCHRSHMDSAADPFDLKRFVDAQEPVYAAVVDELRAGRKRSHWMWFVFPQ
LRGLGGSAMADRYGISSLPEARAYLRHELLGPRLHECARLVERVQGRSVG
QIFGSPDDLKLCSSMTLFAHATDDNADFLAVLQKYYDGRQDPVTLARLAD
S
>MAP3557 hypothetical protein
MAHPEIKEPSAGHPITIEPTRGRVQVRVNGELIADSSAALELREATLPAV
QYIPFTDVAQDRLTRTDTSTYCPFKGEASYYSVTTSAGDTVDDVIWTYEQ
PYPAVAAIAGHAAFYPDKAEISISTD
>MAP1547c hypothetical protein
MAEDVRWFAGSPLAAAFGRLALDQVAHREIAAAVDRSGRFADNFTDRGIR
SAAFTALAAFGDSSDVEASRADLKRLHRDVRGTGKGAFSDTRYSALDPEL
WTWVAVSGLNLLYQAYLRVCGHRLSTDEKEVVYQTLRRELQFLELPSKQG
KLPATLDEMLDYYDTVAAKHLADNEFLQFASRSFVAPPVPGLLPRQLRPV
LRLVWPVLTSLAARPVVVCSAAVAHPTMRRLLGVRWGAREQAEFAVYVAA
LQLGWRWLPRRLTLEPLAYNRYQYERLRDRYRSVLLDSFAAPGRG
>MAP3154 hypothetical protein
MSVAPETTVALQDRFFRELPELAVRWQAETFPELRLLVLNEPLATQLGLD
TGWLRGPDGLRFLTGNLVPTGAAPVAQAYSGHQFGGFVPRLGDGRALLLG
ELVDNKGRLRDIHLKGSGATPFARGGDGLAAVGPMLREYVVSEAMHALGV
PTTRSLAVVGTGRPVYREATLPGAVLARVASSHLRVGSFQYAAATGNRDL
LRRLADHAIARHHPGAADAEQPYLALFEAVVAAQASLIAQWMLIGFVHGV
MNTDNMTISGETIDYGPCAFMEAYDPDTVFSSIDFWGRYAYGNQPVIAGW
NLARFAETLLPLFSENTEEAIALAERSFGVFQTRYDAVWATGMRAKLGLP
AQVDAEFAAALIDELLALLKANHVDYTSFFRQLGRAARGDDRSAAEPARE
MFMDLPGFDAWLARWRALGPDADAMDRVNPIYIPRNHLVEEALAAATDGD
LDPLDQLLAAVTAPYTERPGFERYASPAPEDFGKYQTFCGT
>MAP1704c hypothetical protein
MTASNPTVWLTLQAHNVPKLIDYYVETFGFVLTARYGDGETVDHAQLNWP
EGSGGIMLGSHKPGAEWCREPGTAGGYVVTADPDALYRRVRQHNADIIRP
LAETDYGAREFTVRDPEGNLWSFGDYGGANPPN
>MAP2082 hypothetical protein
MNLGFLDFTSIEQRRADCDEELRLNRRFSPQVYLGVVDITEQNGHYRVGG
EAGSGEPAVWMRRLPEDGMLPAKLAGGDVDTRLARRIGRTLAKLHGRAET
GPDIEAYGSPSSVIANWQENFDQMGPFIGRTISPAINDEIRSYVQEFVGQ
QAALLERRVTEGHVRDGHGDLHAASVCIADGQIVLFDSLQFAPRYRCADL
ASEVAFLAMDFEYHGRGDLAWAFVDSYVRASGDDELPSLLDFYMCYRAYV
RGKVRSLRLAQTEKVPGGEQEALIAESRGYFDLAWAHAGGLPRPLMVVTM
GLPASGKTTLARALAGRLGLVHLSSDVARKRMAGIPPTRRGSDEFGSGLY
DPAMTRNTYAALRRDAARWLRRGRGVVVDATFGNPGERAQLRQLAHRLGV
DLHVVLCDADDDTLIARLKRRATEQGVVSDARIELWPQLRAAFTPPDEQA
SVLRVDATRDTEETVEQALGLLRARYRSS
>MAP3199 hypothetical protein
MTTLETLLHDPEMAGVWNLVPDRSAITFRIKNMWGLLTVRGRFTDFTGDG
QLTGKGAVFGRVDIRAASLDTGIGRRDQHLRSPDFFDVERFEKISVVVTG
LQPTKGKIADLRTDFTVKGVTAQLPLPVTILELDDGSIRITGETTLDRAR
FDLGWNRFGMIGRTATAAADVIFVRDSQ
>MAP3520c hypothetical protein
MRRAGRYLFVMVAITVMALVAPVGRGGASVPLPQPVPGIASILPANGAVV
GVAHPIVVTFTAPAADRAAVERSIHVVSPAGVRGHFEWADDNTVVRFVPN
RYWPAHGHVSVGVQALTTGFDTGDALLGVASISKHTFTVSRNGEVLRTMP
ASMGKPSRPTPIGNFTALEKQRSVVMDSRTIGIPLSSPEGYKITAQYAVR
VTWSGVYVHSAPWSVDSQGYANVSHGYINLSPDNAAWYFNEVNVGDPIQV
VA
>MAP1612c hypothetical protein
MASDNASTIGPADGELLLHTGVTGRAARMGHRLTIAMTRWRAGVSWAGSR
PVRAELAVEVDSFEVLRGEGGVKGLSTAEKALVRSNALKSLNASRFPEIR
YTSDVIEHTGDGYRLTGTLQIRGAARNHVIDLGAEDLGEAWRLSVETTVR
QSDYGIKPFSLLMGSVQVADDVSLSFTAVHAKDRAEDR
>MAP0548c hypothetical protein
MAEPFIGSEAVASGLVTPYALRSRFVRVHPDVYVPAGTALSAGLRARSAW
LWSRRRGVVAGRSAAALYGTKWIDDRAPAQLLYPYRRPPDGIQTWSDRLV
GDEIQTIGGMPVTTPARTALDIACRSPVDKAVAAIDALARATELKVLEIE
LLADRYRGRRGITRGRSVLPLVDAGAESPRETWLRLLLLRAGFPRPRTQL
PVRQYGALIACLDMGWEDIKLAVEYDGDQHRTDRRQFTKDIRRAEVLAEL
GWTVVRVTAEDTPAGIIARVSTAWTRRTCTGSEKPAGNSR
>MAP1129 hypothetical protein
MEVPYAPRLVAGAWVVAGWVALAYGIYLTVLALRSPPGVELTGHWVLQPA
FKASMALLLTLAAAGHGQVRERRWLMPALLLSAVGDWVLALPWWTLSFLV
GLAAFLFAPMCFIGVLLPLVPLPSGASGPGRPSTPRIAAAVLMCLASIGL
LVWFWPRLGPDKLTLPVTLYIVVLTAMVCAALLAKLPTIWTAVGAVCFAA
SDSMIAIGRFILGNEALAVPIWWAYAAAQILITAGFFFGREAAGDAAGEA
TGDATGDAAEPVE
>MAP1070c hypothetical protein
MGFADKTFGTGGPSAAAGGSYDADRLLAGYRAARAQQALFDLRQGPASGY
DEFVGPDGKVRPAWTELADAIGERGRAGLDRLRSVVRGLIDHDGITYTDV
EPGGRGQEPRPWQLDTLPIVLSAADWEPLEAGLLQRSRVLDAVLADLYGP
RSLLTEGVLPPELLFGHPGYLRAANGIEIPGRHQLFMHACDVSRRPDGGF
AVNADRTQAPSGAGYALADRRVVAHAIPDLYERIAPRPTTPFAQALRLAL
IDAAPDVAQDPVVVVLSPGIYSETAFDQAYLATLLGFPLVESADLVVRDG
MVWMRSLGTLKRVDVVLRRVDAEYCDPLDLRADSRLGVAGLVEAQHRGTV
TVVNTLGSGILENPGLQRFLPAMARHLLSETLLLPSAPVYWGGIDTERSH
LLANLASLLVKSTVGGKTLVGPALSSLQLAQLAARIETAPWQWIGQELPQ
FSSAPTDHAGVLSSAGVGIRLFTVAQRGGYAPMIGGIGYLLAAGPAAYTL
KSVAAKDVWVRPTERARAEAVGVPAVEPPAKTAAGTWAVSSPRVLSELFW
IGRYGERAESMARLLIVTRDRFHVYRHHQHSEESECVPVLMAALGRITGT
DTGTGAAAGGDAAETIAVAPSTLWSLTVDPQRPGSLVQSVEGLALAARAV
RDQMSNDTWMVLAGVERALALDSEPPDSLAEADALLTAAQTQTLAGMLTL
SGVAGESMVRDVGWTMMDIGKRIERGLWLTALLRATLAVVRGAAAEQTIL
ESTLVACESSVSYRRRTAGKVSVAAMAELMLFDAQNPRSLLYQVERLRAD
LKDLPGASGSSRPERLVDEIGTRLRRSHPAELERISDDGRRTELAELLDS
VHAELRSLAEVLTTTQLALPGGMQPLWGPDVRRVMPA
>MAP3937 hypothetical protein
MRTRYRKVFRDVYIAKDAELTPAGKARAAWLSTGATLAGLSAAAIHGTKW
LDAAAPAEIVRADRHGQRGILVRSYTLADDEADSVSGMRVTTAARTAFDI
GCGLPAAKALPILDALLNATGIKPADVVAVADRHRGARGIRRLRASLELA
DGGAESPQETRLRVLLVRAGLPKPQTQIELRELRVRVDMGWREWKVAVEY
DGIQHWDDPYQRAWDIERIALLEAAGWAVIRVSAAMLSRPQVIVERVTAK
LAERGAYGRPRASSRA
>MAP4000c hypothetical protein
MGRVTRHYVPAMSDNTVRVDPVVMQGAAASLSGAAEHLSAQLGQLDDQVG
QMLGGWQGASGSAYAAAWELWHRGAREVQLGLAMLARLVGQAGEAYASNE
AGAAQAERAVRGG
>MAP3632 hypothetical protein
MTNEHGYSQQKDNYAKRLRRIEGQVRGIARMIEEDKYCIDVLTQISAVNS
ALRSVALNLLDEHLGHCVTRAVAGGGDDADEKLAEASAAIARLVRS
>MAP4090c hypothetical protein
MDDQEAPDAPASRRAHILRLAVFAGFLAVVFYLVAVARVIDVGAIRAVVS
ATGPAAPLTYVVASALAGALFVPGSILAAGSGLLFGPLLGVFVTLGATVG
TATTASFVGRRAGRDSARALLGPARADRVDALIGRGGLWAVVGQRFVPGI
SDALASYAFGAFGVPLWQMAVGAFIGSAPRAFAYTALGASIGNRSSLLAY
AAVAVWCVSAIVGAFAAHRGYRHWRGRPKDEAR
>MAP2243c hypothetical protein
MSATQEAIDMATVAAGAAAAKLADDVVVIDVSAQLAITDCFVIASASNER
QVNAIVDEVEEKMRKAGYKPARREGAREGRWTLLDYRDIVVHIQHRDDRD
FYALDRLWSDCPVVPVNLDEDRQNPGDAGTP
>MAP3616c hypothetical protein
MARSHPQHAAPNPNRNIKAVRTVRFWAAPLVITLALMSALCALYLGGILN
PTTNLRHFPIAVVNEDAGPGGAQIVDRLATGLDRNKFDIRVLSRDEAKHQ
LDTGRVYGSLLIPPSFSSKLRDFATSAVSPARPDKPSITVSTNPRAGTLG
ASIAGQTLNSAMGTANGIAAQRVMAEVTAQTGGAPLPGATQAGLSSPIEI
QSVVYNPLPNGTGNGLSAFYYALLLLLAGFTGSIVVSTLVDALLGYVPAE
FGPVYRFAEQVRISRFQTLLLKWAMMLLLGLLTSAVYLAIADGLGMPIDL
SWELWAYGVFAIAAVGITSSSLLSVLGTAGMLVSMLVFVIFGLPSAGATV
PLEATPPLFRWLAEFEPMHQVFLGTRSLLYFGGRGDAGLSQALTMTSAGL
VIGLLLGGIVTHVYDRKGFHRIPGAVEFAIAQDHQAQHQARRGKSTQQPD
SPAEPETPTEPESPDEPESPSAQT
>MAP0610c hypothetical protein
MYFAGVDLAWAGRNPTGVAVIDSAGELVSVGAVHDDDEILAALDPYVRGE
CLVAFDAPLVVNNPTGQRPAETALNRDFRRFQAGTHPCNTGKPEFADGPR
AARLAAALGLALDPRSPRPRRAIEVYPHAATVALFGLQQTLKYKAKPGRS
LERLKSELLLLMAGVERLAHAPVPVRVGRHAGWRALRRAVECAQRKSELR
RAEDPVDAVVCAYVALFAQRRPDAVTSYGDPGTGCIVTPSLPAARRQSLR
SAPAADRSGPAPR
>MAP3784 hypothetical protein
MSQIMYNYPAMLSHAADMSGYAGTMQGLGADIASEQATLSNAWQGDTGMT
YQVWQAQWNEAMESLVRAYQSMASTHEANTMSMLARDQAEAAKWGG
>MAP1976 hypothetical protein
MDRWHRRVDSGDWDAIAAAIGEFGGALLPRLVTPREAARLRELYADDGLF
RSTIDMAPKRYGAGPYRYFRAPYPEPIEQLKQALYPRLLPIARDWWGKLG
RDPVWPDRLDDWLAACHAAGQQRSTALMLKYGAGDWNALHQDLYGDLVFP
LQVVINLSDPQTDYTGGEFLLVEQRPRAQSRGTATQLPQGHGYVFTTRER
PVRSARGWSAAPVRHGVSVVRSGQRYAMGLIFHDAA
>MAP3535 hypothetical protein
MAIDSGRTRKADAARHDPGDIDVVLTRLRRAHGQLGGVIAMIEQGRSCKD
VVTQLAAVSKALDRAGFKIIASGLRDCITRTEQQPPLSIDELEKLFLSLA
>MAP4115c hypothetical protein
MISRERERKPDDAAAAMAAWESFHAKAGPAIKAGDALAPAAAAAVITGGP
DAPVVTDGPFAETAEVACGYYIFEAENLDEALALARDVPVAAFGAVELWP
VVHAVEPSRRITGNDWLALLLEPAESAHTPGTPEWEAVAAKHAELHAAAG
DHIIGGAALHDKSTATTVRVRDGEVLITDGPYVESAEIATGIYLLGAADR
DEAVKIASMIPASTVQLRQLAGVSGL
>MAP3445 hypothetical protein
MPASVSFCPRASRQRGSRRRASRQRDARAAARRFQAPSHIAGLPRLIGMT
QKTRIEPLPPQRAGLLTRAMYRIAKRRYGQVPEPFAVAAHHRRLMVASAV
HETLVDRASKTLPASVRELAVYWTARQIGCSWCVDFGSMLQRLDGLDIQR
LTEIDDYATSPAFTDDERAAIAYAAAITTDPHTVTDEQVDDLRARFGDAG
TFELTYQIGLENMRARVNAALGITEQGFNSGDACRIPWATGDTETSDPAA
NR
>MAP1230 hypothetical protein
MRAARTSQPADGPDLPRHPTRLPGDVRPVALFVLGMARSGTSALTRVLSL
CGSTLPAGMCGADGNNPRGYWEPRAAIMLNEAILRRHDSNWYDPTLRLQQ
EAAFGAAERAACIAEITAYLSTLPAAPLVVVKEPRITALPGLWFEAARRV
GFDVAAVIAVRHPQEVIASAAKYVSTSPELSSALWLKYNLLAERHTRGVQ
RVFVDYANLLDDWHREMKRIAGALEIELDTAEEGAIEEFLTADLRRQRHC
GPVTDLFGADWMSAVYAALRGAAHDDPLDTATLDRVFESYRASERDFRTA
FTDFQARTNSVVRRVFRPSMTMCSIRHGLCTPRIVSPIQMQSLPWWRRSR
GMDQYVICGVTGRTTLSDSTANHFSLGRTAICRLRCGNFCSARRLRIRRH
SSARRW
>MAP3576 hypothetical protein
MAIRVAHVGTGNVGGLALAELITNPRYELTGVCVSTPEKVGKDAGELCGV
GLDTGVVTGVAAVGDLDAVIAAKPECVVYCAMGDTRLPEAMADVMRILAA
GINVVGSSPGLLQYPWGVMPEKYIARVEDAARQGNSSIFISGVDPGFAND
LIPLALAGTCQRVEQVRCMEIHDYASYNGAEVMGYMGFGRPLDEIPMLLQ
PGVLSIAWGTAIRQLAAGLGIEVDEITESYQREPAPEDFDIAVGRVAKGT
LAVLQFEIRGMVNGHPAIVIEHVTRLRPDLRPDLPQPAAGDGSYRVEITG
EPSYAVDIVPSSRKGDHNHAAIAGAAGRIVNAIPAVIAAPPGIRTTLDLP
LVTGKGLYAPSTLVTT
>MAP1906c hypothetical protein
MFLGTYTPKLDDKGRLTLPAKFRDALAGGLMVTKSQDHSLAVYPRAEFEQ
LARRASKASKSNPDARAFLRNLAAGTDEQHPDAQGRITLSADHRRYASLS
KDCVVIGAVDYLEIWDAQAWQDYQQTHEENFSAASDEALGDII
>MAP1597 hypothetical protein
MSRIGTFADDDLAGWFVKSPDIGAALGAFSQAVYTKNRLPLRTRELARAV
IAHRNECVVCVNTRDEDGPAAGVDEELYDHVHEWRTWPGYSEQERLAAEF
ADRFATDHTGLRDDEDFWSRCAEHFSDELLADLALSCALWVGMGRVLRTL
DIGQACKLTIPSRG
>MAP3888c hypothetical protein
MEPQENRKNPQRIFSCTPHSYSMAGQSTLWVGTARMLRRAGFGVTGPEVD
AAVARGWPGHLDAMLAADPDADPGALATPMPALPAPQPPGKRATPAARKQ
YNQQLTEQQGVLSDWWIRRMVVVRQPFHEKLTLLWHNHFATSAQKVRVAA
QMAAQNQKLRTLSLGDFRTLAYAMLTDAAMLRWLDGQTSTAKAPNENLAR
EFMELFALGHGNGYTESDVRNGARALTGWVIGAGGATSVLPKRHDATAKT
LFGRSANFDAAGFCDAVLAQPKSAGYVAGRLWQQLAGDDPPSPPALDRLV
AAYGPGRDLRALTRAILTDPEFTGARASMVNTPIEWLIGVIRSLRVPVDD
PKRLKMIDATLRTLGQRPFYPPSVGGWPSGQVWLSTASAGARLRAATELA
HAGDLSGIENTPPTDRIDAVGYLIGVGAWSDRTARALQPLVGQPPRLVAA
AVNTPEYLTS
>MAP3614 hypothetical protein
MLTFATGLADAISILVLGHVFVANMTGNVIFLGFWLAPRTSIDLTAVVVA
LPTFACTTILSGRLARHFGERVRAWISTVLATEIVLLVGLSVLAGSGILG
YQQDSKLLMIAILAVTFGLQHSSARQFGIQELSTTVLTSTIVSLGLDSRL
AGGTGERQRLRIGVVATMCAGAFLGATMSRYVVAPVFVVAAAVIAASLLI
FRFGPPAAKPAAAEAN
>MAP0826c hypothetical protein
MAINIEPALSPHLVVDNAAAAIDFYVKAFGAEELGRLPRPDGKLAHAAVR
INGFMVMLNDDFPEVCGGKSMTPTSLGGTPVTIHLTVPDVDASFQRAVDA
GATVVVPLEDQFWGDRYGMVADPFGHHWSLGQPVREVSPEEMAAAMAAMA
AEQGAGQA
>MAP3922 hypothetical protein
MTPASGWRAVSSVPARRGDAHIDFARSPRPTIGVEWEFALVDAQTRDLSN
EATAVIAEIGENPRVHKELLRNTVEVVSGICRTVPEAMEDLRQTLGPARR
IVRDRGMELFCAGAHPFAQWTTQKLTDAPRYAELIKRTQWWGRQMLIWGV
HVHVGISSPNKVMPIMTSLLNYYPHLLALSASSPWWTGVDTGYASNRAMM
FQQLPTAGLPFQFQTWAEFEGFVYDQKKTGIIDHVDEVRWDIRPSPHLGT
LEMRICDGVSNLHELAALVALTHCLVVDLDRRLEADESLPTMPPWHHQEN
KWRAARYGLDAVIILDADSNERLVTEDLDDVLNRLEPVARKLQCADELAA
VADIPRHGASYQRQRRVAEEHDGDLRAVVDALVAELEI
>MAP3153 hypothetical protein
MADMIVKSPDEVVAFLKAQHNLIEDMFDQVLHATDPKAREEPFATLRQLL
AVHETAEEMLVHPPARKEADAGDAVVDARLHEEHSAKELLSAIEKLDSTT
DQFLDEVTKLREAVLEHATREENEEFPALQRLDSDDLKRMGTAVRAAEAI
APTHPDPGVESATLNFAVGPFASMLDRARDLIGRAIG
>MAP3976 hypothetical protein
MIFVTPSSAPRPFNRRVALATLGIGALAPGVLAACGGGTAKEAEKKEQPA
LRLKYQPADAAQNVVPTAPVSVEVSDGWFQHVTLSNSSGKAVAGTFNSDR
TVYTTTEPLGYDQTYTWSGSAVGHDGKTVAVAGKFSTVSPSKKISGAFQL
ADGQTVGIAAPIILQFDAPISDKAAVEKALTVTTNPPVEGSWAWLPDEAQ
GARAHWRTREYYPAGTTVNVQAKLYGLPFGDGAYGAEDISLNINIGRRQI
VKAEVSSHRIQVIRDEGVIMDFPCSYGEADKARNVTRNGIHVVSEKYADF
YMSNPAAGYSHVHERWAVRISNNGEFIHANPASAGAQGNTNVTNGCINLS
TSDAEQYYQSAIYGDPVEVTGSSIQLSYADGDIWDWAVDWDTWVAMSALP
PPTAHPPSTQIPVTAPVTPSNAPTLSGTPTTSTTTPASTGPATPGG
>MAP0745c hypothetical protein
MRPGSPPPRTGTVRLTLDCRPADPGHGVRLRSGNRSLSEGPQRRRSALDR
ERYERGLRIRSDVLGEQYVNRALADADEFTRPLQDLVTEYCWGAVWGREE
LPRKTRSMLNLAMIAVLNRPNELRMHIKAALTNGVTREEIREVFLQVAIY
AGVPAAVDSFRIAREAFAELDRENS
>MAP1030 hypothetical protein
MSGHSKWATTKHKKAVIDARRGKMFARLIKNIEVAARVGGGDPAGNPTLY
DAIQKAKKSSVPNENIERARKRGAGEEAGGADWQTITYEGYAPNGVAVLI
ECLTDNRNRAASEVRVAMTRNGGTMADPGSVSYLFSRKSVVTCEKNGLTE
DDILAAVLDAGAEEVEDLGDSFEIICEPTDLVAVRTALQDAGIDYDSAEA
GFQPSVTVPLNADGAQKVMRLVDALEDSDDVQDVWTNADIPDEILAQIEE
>MAP1542 hypothetical protein
MGEVRVVGIRVEQPQNQPVLLLRETNGDRYLPIWIGQSEAAAIALEQQGV
EPPRPLTHDLIRDVIAALGHSLKEVRIVDLQEGTFYADLVFDRNITVSAR
PSDSVAIALRVGVPIYVEEAVLAQAGLLIPDESDEEGGTAVREDEVEKFK
EFLDSVSPDDFKAT
>MAP3150c hypothetical protein
MSAGRRIQVARVYDKVGPDEGQRVLVDRIWPRGVRKDDPRVGIWCKDVAP
SKQLREWYHHEPERFDEFTSRYKSELRGNPALDELRTLAKRGSVTLVTAT
RDLDISQAVVLAELLKSG
>MAP0101 hypothetical protein
MVWQRWPTTGLAPPATSAAEYDTLISNLIATGVITDAGMSYFDVRPALRT
PTLELRVCDSCPRADTIVLITALFRALVEREIQGLRTGVPAAIVVPPLGR
AALWRAARSGLEGDLVDLIHPASRPAGDVVTDLVQMLRPQLEASGDWQAV
EGLARKALTQGSSAARQRRAMRTRNDLFDVVDHLIAETAAVAPGAHGTLA
TRRNGSDGG
>MAP3805c hypothetical protein
MYPIDASGSIDRSYVRIPGCPSPPGGPTGVWQHGGVSSPTRPAVPAALDP
WIVAALAAAVSLAGAARPSFWYDEAATISASYSRSLTQMWHMLGNVDAVH
GLYYLLMHGWFRLVPPTEFWSRAPSGLAVGAAAAGVVVLGRQFSSRTVAV
VSGVFCAVLPRTTWAGVEARPYALSMMAAVWLSVLLVFAARRQSRWPWLC
FGLALVCSVLLDAYIALLLAAYAVFVGVCCRTRTVLWRFGISSAVAVGVL
LPFLLTVAGQAHQISWVASIGHRTVEDVVMQQYFERSPPFAVLSALLICA
AIALWLSRSAPPGPSERQLLVLATCWVGIPTAAIVAYSALVHPIYTPRYL
CFTAPAMALILGVCSAAIAAKPWVTTAVVGVFAIAAVPNYVRAQRNPYAK
YGMDYSQVADLITAKAAPGDCLLVNDTVTFMPAPMRPLLAARPDAYRKLI
DLTLWQRAVDRNDVFDTNLIPEVVAGPLSHCAVLWIITQADPSEPAHQQG
PALPPGPVYGATPAFAVPHDLGFRLVERWQFNLVQVFEATK
>MAP4150c hypothetical protein
MSVEHLVHMLRAQGSFCASSGSPMYGDLFELVASDVEAGGVFADILSGHR
DAPSRDAIPLRLLGGLHRLVLDGRAGSLRRWYPSTGGSWDAGAAWPPILA
AAAEHAAALRAALDRPPQTNEVGRSAALIGGLLHINESCLPVRLFEIGSS
AGLNLRADHYRYRYAGGGWGPADSPVCIDDAWRGALPPARGVRIVERHGF
DIAPVDVGNPDGELTVLSYVWPDQAARLARLRGAIEVARRVPATLERRTA
ADAVGRLSLAEGALTVLWHSITWQYLSAGERAAVCAGVDALGARAGASAP
LVHLTMEPARDGPGAPIRFLVRARGWPDGGPRVLAQCHPHGPPVDWL
>MAP3958c hypothetical protein
MTAPRIPAGRFRQLGPINWVIAKLGARTVGAPEMHLFTTLGQRRLLFWTW
LAYGGRLLRGKLPTADTELVILRVAHLRGCEYELQHHRRMARTAGLDPDL
QAAIFAWPQRLEPVQARLTVRQQALLAATDEFVNDRTVSEATWRQLAEHL
DRRQLIEFCLLASQYDGLAATMSALAIPLDHPEGS
>MAP1404 hypothetical protein
MKMSGLLSRNTDRPGIVGTARVDRNIDRLLRRVCPGDIVVLDVLDLDRIT
ADALVEADIAAVVNASPSVSGRYPNLGPEVLVNNGVTLIDEAGPDIFKKV
KDGAKIRLYDGGVYAGDRRLVRGTERTEHDIADLMREAKSGLAAHLEAFA
GNTIEFIKSESPLLIDGIGIPDIDVDLRRRHVVIVADEPSAEEDLKSLKP
FIKEYQPALIGVGTGADVLRKAGYRPQIIVGDPDQISADALKCGAQVVLP
ADADGHAPGLERIQDLGVGAMTFPAAGSAIDLALLLADHHGAALLVTAGH
TANIETFFDRTRAHSNPSTFLTRLRVGEKVVDAKAVATLYRNHISAGAIA
LLALTMLIAVIVALWVSRTDGVVLHWITDYWNHFSLMVQKWVT
>MAP3065 hypothetical protein
MARQAGRVRAALAAVVADTAVTEAASLKDGRATLALPGGLRWVAVHQPDG
YHPPRRFVDSIGGDGLAALPARIAVRWRHIHEFEDVGGDRTRVIDRVETP
VPASLLRPMFDYRHRQLAGDLAAHRLAAEHGLRPLTVAITGSSGLVGSAL
TAFLRTGGHRVIRLVRRAARGGDERRWDPEDPDPGLCDGVDAVVHLAGAS
IAGRFTDRHREAVRDSRIGPTRRLAELLGRGRPRPAALISASAIGYYGYD
RGDETLTEDSDRGDGFLADVVADWERAITLALDAGVRVVQVRTGIVQSPR
GGTLKLMRPLFSAGLGGRLGDGRQWLSWIGIDDLIDVYHRALWDSRLNGP
VNAVAPQPVRNSEYTTALAAVLHRPAVLPVPSLGPRLLLGGQGARELAGA
SQRVAPAKLAAAGHRFRHPDIERALRHLLGRGG
>MAP4070c hypothetical protein
MIGSDPGLLRLRMATRTTAALGLSLLALWLLTRATGQPLTVALLGVVITM
VASRSVNEPDPRQQRITMALLPVPAAVAITAAAVLAPHPVAGDVVFVLIV
FAAAYIRRFGARGRALGTVCTYVMTSHVLPDRPERVLRATIRALRARMAI
VIDTTAEAVRTGRLDERRRRRMRARTIRLNETALMVQSQIEDRADPGTLW
PGVTAEQLAPWLLDAELAIEWVATAGRRAAALGAELPEAARAELVSALTQ
LGRAIRLPEPGGLRDAASRARRLLDAATDDRPAGTAVRRLALAIINAATA
TADVRAIVDGAAGAAVPDVAGSERPPAAAGAAAEEPAEQEEEQPAGLRPT
TRQAIQVSVAASLAIITGELVSPARWYWAVIAAFVIFAGTNSWGETLTKG
WQRLLGTMLGVPAGVLVATLLTGHEAAALAGIFVCLFCAFYFMTVTYSLM
TFWITTMLALLYGLLGQFSFGVLMLRIEETALGAVIGATVAVVVLPTNTR
TAIRADTRAFLTSLSALIEACAAAMSGSAASPSEQARQLDRDLQQFRVTA
KPLLAGVAGLAGRRGIRRALRIFTACDRYGRALARSSEQYRGSPGPGPQL
AQAFSAAAAQTRRNLDALLEAIDSGHPPTLVSAADELDAAETAARQHDSD
GDGETRPDVRRFLTAVHALRQIERAVISTATNLGGHEDVRTTTAAPR
>MAP2723c hypothetical protein
MDSVSLTDLAAEKLAEAKQSHSGRAAHTIHGGHTHELRQTVLALLADHDL
SEHDSPGEATLQVLQGHVRLTTGGDAWEGRAGEYVAIPAERHALHAVEDS
VVLLTVLKTIPSAH
>MAP0922c hypothetical protein
MPTYSYQCTECDDRFDIVQAFTDDALTTCKHCSGRLRKRFNSVGVVFKGS
GFYRTDSRESGKKPAGSTNGSSSSDSSSSTSDSSGSSAKSGSGEKAVSSP
APAAAAAG
>MAP2374c hypothetical protein
MVELYNLVVWNERNIELARELLGDTVIRHEVGAAHTLTHAEAVQRVVDMW
EMADSLHFDLNVVVEGDDGEHVAIVYDSTIRTKDGAETNIAGIEVFRVVD
GKITEVWNCGYQQGVWN
>MAP1988 hypothetical protein
MKRPAKRVADLLNPAATLLPAANVIMQLSLPGVGYGVLESPVDSGNVYKH
PFKRARTTGTYLAAATIGTDYDRALICAAVDVAHRQVRSRPSSPVSYNAF
DPKLQLWVAACLYRYFVDQHEFLHGPLDEASADAVYRDAARLGTTLQVPE
RMWPPARAAFDEYWKRSLDELRIDPPVREHLHGVASLAFLPWPLRVLAGP
FNLFATTGFLAPEFREMMQLDWTPAQQRRFGWLLFSLRLADRLIPHWVWI
LGYRIYLWDMRSRARQGRRVV
>MAP4077c hypothetical protein
MRVDGRDIIVSGSLLQPLTRRTNDILRLGMALIFLAVVITGSVITRPQWI
ALEKSVSQIVGVLSPTQSDVVYLVYGLAIVALPFMILIGLIAAGQWKLLG
AYAAAGLSAIVLLSISGTGLAAPRWHFDVLDRLTTLPAQLLDDPRWIGML
AAVLTVSGPWLPARWRRWWWALLLAFVPIHLVVSAIVPARSLVGLAVGWV
VGALVVLVVGTPALEVPLDAAVRALAKAGFEVCRLTVVRPAGRGPLILSA
DGQDADHTALIELYGPHQRSGGALRQLWGKLKLRDAETAPLLTSMRRAVE
HRALMAIAIGEAGLANTATVAVATLDRGWMLYSHKPPRGTPIDRCAKTTP
VQRLWEALRVLNDHQIAHGDLRAHHITVDDGAVLFGGFGSAEYGATEAQL
QSDIAQLLVTTSAHYDPKSAVRAAIDVFGADTILSASRRLTKVAVPKSVR
RSAPDSGAVISGARAEVKRQTGADQIKPQTITRFTRSQIIQLVLFGALVY
VAYPFISTAPTFFSQLRTADWWWALLGLLVSALTYVGAAAALWACADGMV
NFWMLSIAQVANTFAATTTPAGVGGLALSTRFLQKSGLSAMRATAAVALQ
QSVQVIAHLALLVLFSAAAGASMNLSHFVPSATMLYLIAGVALGIVGTFL
FVPTLRRWLATEVRPKLDEVVSDLAKLAREPRRLALILLGCAGTTLGAAL
ALWASIQAFGGDTTFVAVTVVTMVGGTLASAAPTPGGVGAVEAALIGGLA
AFGVPAAIGVPAVLLYRMLTCWLPVFVGWPVMRWLTKHEMV
>MAP1891c hypothetical protein
MSTLHKVKAYFGMAPMDDYEDEYYDDRAPSRGFPRPRFDDGYGRYDGDDY
DDPRREPADCPPPAGYRGGYAEESRYGAVHPREFERPEMGRPRFGSWLRN
STRGALAMDPRRMAMMFEEGHPLSKITTLRPKDYSEARTIGERFRDGTPV
IMDLVSMDNADAKRLVDFAAGLAFALRGSFDKVATKVFLLSPADVDVSPE
ERRRIAETGFYAYQ
>MAP1243 hypothetical protein
MPTMRLPKLPLDEAKAAADEAAVPNYMAELSIFQVLLNHPPLARAINDLL
ATMLWHGALDSRLRELVIMRIGWLTGCDYEWTQHWRVASRLGVPGDDLLG
VRDWRNYPGFGPTERAVLAATDDVVRHGSVSAASWAQCEHEFNGDTTVLV
ELVTVIGAWRMVASILQSLEVPLEDGVASWPPDGREPAS
>MAP2429c hypothetical protein
MVVMSAPTEPKSRPGTTGQRESAPEDVTASPWVTIVWDDPVNLMTYVTYV
FQKLFGYSEPHATKLMLQVHNEGKAVVSAGSREAMEVDVSKLHAAGLWAT
MQQDR
>MAP3134c hypothetical protein
MAASDRHPQRLTPLPADEWDEDTRAALASLIPAERANPVGAGNVLSTLVR
HPDLTAAYLPFNAYLLTRSTLSPRIREVALLRVVHRSKCGYLWSHHLPIA
RRAGLSESDIDDIRRGRCADPTDAAVVRAVDELTGDSTLSRQTWDRLCEL
FTQRQCMDLIFTIGGYLLLALAVNTFGVQEEETP
>MAP2631 hypothetical protein
MVVTGPGWWNRGMTEYAAFLRGVNVGGVNLKMADVKKALTDAGFTAVRTV
LASGNVLLQSKAGAAAVRAKAEAALRERFGYDAWVLVYDLDTVRRVVDGY
PFEREVDGYQSYVTFVADAAVLDELAALPAGPDERIRRGPDPLGVLYWQV
PKGSTLDSTIGKTMGKPRYKSSTTTRNLRTLVKVLS
>MAP2448 hypothetical protein
MVMAVHLTRIYTRTGDDGTTGLSDFSRVSKNDPRLVAYADCDEANAAIGV
AVAVGRPGPELAGVLRQIQNDLFDAGADLSTPVVEDPEYPPLRVTQPYID
RLEKWCDTYNESLPKLNSFVLPGGSPLSALLHVARTVVRRAERSAWAAVD
AAPEGVSALPAKYLNRLSDLLFILSRVANPDGDVLWKPGGQQGGEPAPG
>MAP0660 hypothetical protein
MTQDTSASRPLTSDVTGSDAAGLTEQSISARPADAGAAVAGGCPVSPLGY
EAPPAPLGPDSLTWKYFGDWRGMLQGPWAGSMQNMHPQLGAAVLDHSTFF
RERWPRLLRSLYPIGGVVFDGDRAPITGAEVRDYHTDIKGVDEQGRRYHA
LNPDVFYWAHSTFFVGTIHVAERFCGGITEEQKRQLFDEHVQWYRMYGMS
MRPVPKSWEEFQEYWDHMCRNVLENNEAARAVLDLTELPKPPFAQWIPNR
LWAAQRKLLAPFFVWVTVGLYDPPVRELMGYRWSARDEWLHRRFGDLVRI
VFALVPSRFRKHPRARAGLDRASGRIPLDAPLVQTPARNLPPEDERGNPK
HYCPMVS
>MAP1566 hypothetical protein
MQTTFDPELVVPDEARVTEFTGDNSLSRKDLSQHPIPPGSLTWKYWGRLD
VIFFGSGVVGTIAGAWPQMAKATSSSVLFTGDSSFGARSKIYKVRRQRSR
EYIYGTVYDAPEDAKKYGLKTRNMHKSIKGTLQDGTFHALNADTFYFGHV
TFFYHLLLKVVEQLYFDGAMPRAMKEQIFEESKEWYSMWGVDDSPQPATY
DDFERYLDNIERNHLVNSQVTQVMLEQFMERRVPPRWWPPVMKKFVWPWV
AGRRQVVVNSFPPHVQELFNLEWTPEDEEIARRFMRMYRRLYAILERVVP
LKFLYLPIAVEGFKREGVDPRKITLESAQQALRENRARRAARENASADET
NGVLASG
>MAP1390 hypothetical protein
MIDEALLNILVCPADRGPLVLVGQELLYNPRLRRAYRIEDGIPVLLMDEA
RDVDDEEHARLMAQVRPADPR
>MAP0195c hypothetical protein
MPVRMDPQRFDELVSDALDLIPPELAAAMDNVVVLVDDRHPEEPDLLGLY
EGVALTERDSDYSGALPDAITIYRAALLDVCESEQQVIEEVAVTVIHEIA
HHFGIDDERLHQLGWA
>MAP2551 hypothetical protein
MSKSTAVRRLYTPRTSRRYSPRLDPETVGQITESIARFFGTGRYLLLQTI
VVAVWIVVNVFAVRLRWDPYPFILLNLAFSTQAAYAAPLILLAQNRQENR
DRVALEEDRRRAAQTKADTEYLARELAAVRLAVGEVVTREYLRHELDDLR
ELLAELRPQSADGEPVSDADNREQTAKKSR
>MAP2118 hypothetical protein
MTNSPSSYLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPSTSQAPQP
REWFAGVDDESILSLLDPLDAGTPDLAPPEIWRSYIRTVGRTPNGVWGGK
LMWNQTPLLLDRAKNLPDRSGDGLRAAIRDVIGEEPLLIHVYRPDVVSQA
VSFWRAVQTRVWRGRPDPARDARATYHAGAIAHVVTMLRAQEEGWRNWFA
EEDLKPMEIPYPVLWRNLTQVVAAILEQLGLDPQLAPEPVLERQADHRSD
EWVDRYRADAEKYGLPT
>MAP1411 hypothetical protein
MNATANGEAQPQTGFQVRLTNFEGPFDLLLQLIFAHRLDVTEVALHQVTD
DFIAYTREIGPKLELEETTAFLVVAATLLDLKAARLLPAGQVDDDEDLAL
LEVRDLLFARLLQYRAFKHVAEMFAELEASALRSYPRAVSLEDRFTQLLP
EVMLGVDAERFAQIAAVAFSPRPVPTVSVGHLHEVKVSVPEQARKLLAIL
EARGSGQWATFSELVADCEGSMEVVGRFLALLELYRSRAVAFEQSEPLGV
LQISWTGERPVGETLVEVRDEL
>MAP4069c hypothetical protein
MSEPVAPRVAVYLDFDNIVISRYDQIHGRNSFQRDKAKGLEQFTERLEQA
TVDVGAILDFASSFGTLVLTRAYADWSAEINAGYRGQLVARAVDLVQLFP
AAAYGKNGADIRLAVDAVEDMFRLPDLTHVVIVAGDSDYIPLAQRCKRLG
RYVVGIGVAGASSRALAAACDEFVIYDALPGVTALDRTPAPAETVPAKPR
TRRGKAAQPEEPDPQAAATALLTRALQIGFEKDDVDWLHNSAVKAQMKRM
DPSFSEKSLGFRSFSDFLRSRSDVVELDETSTTRMVRLRPHE
>MAP1536 hypothetical protein
MADVDRALGGYDPNAGHSAHLAARPQRIPVPSLLRALLSEHLDPGYAAAA
AKRGAVDETRNRRPRVSGWLWQALAALLIATVFAAAVAQARSVAPGVRSA
QQLLLGNVRATEGSAAKLAQRRNELSAKVDDVQRHALADDAEGQRLLKRL
DALGLAAASTAVIGPGLKVTVTDPGAGPNLSDVSKQRVSGSRQIILDRDL
QLVVNSLWAGGAEAVSVGGVRIGPTVTIRQAGGAILVDNNPTSSPYTILA
IGPPHALRDAFDTSPGMQRLRLLQISYGVVVTVDVADGLTLPAGSVRDVK
FAKQIGPQ
>MAP3374 hypothetical protein
MLCEAQVKVTVLVGGVGGARFLLGVQQLFGLGQFRAQRHTHGRPDTAAGS
HELTAIVNIGDDAWIHGLRVCPDLDTCMYTLGGGVDPERGWGHRDETWHA
KEELARYGVQPDWFGLGDRDIGTHLVRTQMLNAGYPLTQITAALCDRWQP
GARLLPVSDDRCETHVVITDPDDGSRRAIHFQEWWVRYRAQVPTHSFAFV
GAEKAAATTETIAAIADADVILIAPSNPVVSVGAILAVPGVRGALRAAGA
PIVGYSPIIGGKPLRGMADACLSVIGVESTAEAVGRHYGARRGTGILDCW
LVSQDDHADIEGVAVRAVPLMMTDPAATAEMVSAGLQLAGVTP
>MAP2025c hypothetical protein
MERPDTLDVLRLIANAPNIFESWSQMASQLFDTETFSPRMREVIILRVAH
LQDSPYELAQHVVFARAAGLTDRQIDALQDKADLDAAGFTDDERILIDTV
TELCTTHRLDDASFAKAHTLLGDEALTELLMIVATYYGLALVLNATDLDI
DAPT
>MAP3291c hypothetical protein
MGMRPTARMPKLTRRSRILILIALGVIALLLAGPRLIDAYVDWLWFGELG
YRSVFSTVLVTRFVVFLIAGLLVGGIVFAGLAVAYRTRPVFVPSNDNDPV
ARYRALVLSRLRLVSVGVPVAIGLLAGIIAQSYWVRIQLFLHGGDFGIKD
PQFGKDLGFYAFELPFYRLLLSYLFVAVFLAFVANLLAHYIFGGIRLSGR
TGALSRSARIQLVTLVGLLVLLKAVAYWLDRYELLSHTRGGKPITGAGYT
DINAVLPAKLILMAIALICAAAVFSAITMRDLRIPAIGLVLLLLSSLIVG
AGWPLIVEQISVKPNAAQKESEYISRSITATRQAYGLTSDVVTYRNYTGD
AQATAQQVADDRATTSNIRLLDPTIVSPAFTQFQQGKNFYYFPDQLSIDR
YLDRNGALRDYVVAVRELNPDRLIDNQRDWINRHTVYTHGNGFIASPANT
VRGIANDPNQNGGYPEFLVNVVGANGNVVSDGPAPLDQPRVYFGPVISNT
SADYAIVGRNGADREYDYETSTETKNYTYTGLGGVPIGDWLSRSVFAAKF
AERNFLFSNVIGSNSKILFNRDPARRVEAVAPWLTTDSSVYPAIVNKRLV
WIIDGYTTLDNYPYSELTSLESATADSNEVAFNKLAPDKRVSYIRNSVKA
TVDAYDGTVTLYQQDEQDPVLKAWMQVFPGTVKPKSDISPELAEHLRYPE
DLFKVQRMLLAKYHVNDPVTFFSTSDFWDVPLDPNPTASSYQPPYYIVAK
NIAKNDNSSSYQLTSAMNRFKRDYLAAYISASSDPATYGRITVLTIPGQV
NGPKLANNAITTDPAVSQDLGVIGRDNQNRIRWGNLLTLPVGQGGLLYVE
PVYASPGASDAASSYPRLIRVAMMYNDKIGYGPTVRDALTGLFGPGAGAA
ATNIQPTEGGAPAASPPANAPAPAVTPGSAPPVAAPPVPDGSVTLSPAKA
AVLQEIQAAIGAAKDAQKKGDFAGYGAALQRLDDAITKYNNTK
>MAP0787 hypothetical protein
MAVQASSEIVIDAPPEVIMEALADMDAVPSWSSVHKRVEVIDKHPDGRPH
HVRVTIAVVGIHDTELLEYHWGPDWMVWDADRTAQQHGQHGEYNLSRLGD
DKTRVRFTITVEPWVPLPEFWVRRARKKILHAALEGLRKRVMTPEAFGRA
D
>MAP0281 hypothetical protein
MTRKAEIVAVFAICTAFMTASGAFGGFAARADDPEILYNGINQLRQACGP
IAEDPRLTEAAQQHADDMLRNGVSGHIGSDGSSPQARIAAAGYRSRYSGE
IVFWATGSAATPSEALDMWMQSPPHRAIILNCGFNAGGFATARDGNKMTA
VGDFATS
>MAP2866c hypothetical protein
MSPGGGIVGAVSTRRLSVAQARRIAVAAQGFTEPRPAGAVTRAHLNRLIS
KIQVLQLDSVSVTVRAHYAPVFSRLGPYDREVLDRAAWGPRSSRLLVEYW
AHEAALMAVEDWPLLRWRMRQYRHGRWGTHIVQANPRLADAVVAAVAELG
PSTAGQIEAHLAAEPRRRKGAWWNRSDTKWVAEALFAAGVLTTATRVGFA
RHYDLVERVLPADVLARRVDDDEAIRELTLRAAGALGVGTEADIRDYFRL
SAAQVKPAIAELVAAGDLERVEVAGWPAPAYLRAGRAVPRTDRGTALLCP
FDPLIFFRPRVERLFEFHYRIEIYTPAAKRRYGYYVWPLLMDGRLAARVD
LKADRAEGTLRVLGAFAEPQAPRPRVAAVLAGELWSMASWLGLGGLSVAD
RGDLALALRAVA
>MAP3412 hypothetical protein
MSRPKWLPTWRLRAMFAAGLSAMYAAEVPGYATLVQVCTRVNADHVARHP
DAQRWGSLRRVTAERHGAIRVGSPAELRAVADLFAAFGMAPVGYYDLRRA
ASPIPVVCTAFRPVDGNELSLNPFRMFTSMLATRDRRYFDRDLRARVDNF
LAHRQLFDPALLARARSIAAAGGCADDAARPFVAAAVASFALSGKPIERS
WYDELSRVSAVAADIAGVCSTHLNHLTPRVLDIDELYRRMTGRGIAMIDA
IQGPPATGGPAVLLRQTSFRALAEPRRFRGDDGRITEGTVRVRFGEVEAR
GVALTPRGRRRYDAAMAADDPAAVWANYFPATDAQMAAEGLGYYCGGDPS
APIVYEDFLPASAAGIFRSNLDAETPGAAQAAPAVDDSGYDQDWLAGALG
REIHDPYPLYEALAQEAPR
>MAP1148 hypothetical protein
MSEPINRGIVALGGGHGLYATLSAARRLTPYVTAVVTVADDGGSSGRLRS
ELGVVPPGDLRMALAALASDSPHGRLWATILQHRFGGSGALAGHPIGNLM
LAGLSEVLADPVAALDELGRILGVKGRVLPMCPIALQIEADVSGLEADPR
MFRVIRGQVAIATTPGKVRRVRLLPDNPPATRQAVDAIMAADLVVLGPGS
WFTSVIPHVMVPGLAAALWATTARRALVLNLVAEPGETAGFSVERHLHVL
VQHAPGFTVHDIIIDAERVPSEREREQLRRTATLLSAEVHFADVARPGTP
LHDPGKLAAALDGVRAARKAPGASPVTATADIRVDGASPPAGGNGPAGSG
PRGDDAWR
>MAP2037 hypothetical protein
MSTLVLTAHGSRDPRSAANAEAVADRLRRMRPGLDVRLAFLELNAPNFVD
VLAGLPDSRRAVVAPLLLASAYHARLDIPEQIARAGARGIRQADVLGEDD
RLVAVLRGRLAEIGVSPLDDDLGVMVVAIGSSNIAANARTAKVASRLAAG
TRWACATTASRPGPRRRWPAPPTSCAVAARAGW
>MAP1312 hypothetical protein
MQRRRTAARLLSAHREVRVNGPVVDVATKPRAGERVVVSADSRAARLIGA
LALFCAACWLIQILVHHHSNPSWHYADRLAWSLTVLVAVAWIARGIFLGR
PVTTMHAAVAAFFVLAGLGLHVLSFDLLGDVLIAGSGMVLMWPTSSHPRP
ADLPRIWELINATRDDALAPFTMQTGKSYHFSADGSAALAYRTRLGIAVV
SGDPIGDEAHFPELVADFAVTCHAHGWRIAVVGCSERRLELWKDSAVLGQ
TLRPIPIGRDVVVDVSSFDMVGRKFRNLRQAVKRTHNCGVTTEIVAEQEL
SDELLAELTEVVRESSKGAHADRGFHMNLDGVLEGRFPGILLIIARDATG
KVQAFHRYATAGGGSDITLDVPWRRRGAPNGLDERLSVDMVMAGKDRGAQ
RVSLAFAAFPEIFEDKNRGWTRRIFYRLTHVLDPLIALESLYRYVRKWHA
LDGRRYAVISMTQIVPLLFVLLSLEFLPRRRHL
>MAP2061 hypothetical protein
MTTRSAAARDATELAAYRVAAMLLGVGTLHFLAPKPFDSIIPAELPGSPR
LYTYGSGVAELTVGALLVPRRTRRAAALAAAILFIGVFPGNLNMVRLWWD
KPWPMRIAALARLPLQIPMITTALRIRRNS
>MAP1628 hypothetical protein
MGPFVLRAALTGFALWIVTLFVHGLTFVGGDTKLQRVGIIFVVAVIFGLV
NAFIKPIVQILSIPLYILTLGLIHIVINALMLWITAWITEHTTHWGLQID
HFWWTAIWAAIVLSIVSWLLSLVIRRTAG
>MAP2674c hypothetical protein
MPPRAEEGNLWFEWSRSLDDPAEYVLVEAFRDGDAGSAHVNSDHFKRAMQ
ELPQALKSTPKIISQTVEATGWSRMGEMTVD
>MAP3277c hypothetical protein
MRCRRLIRMTVPHTMPKTTAAFFVQAAVAFAISFVAALGGIYFLPLDPWP
RLFLGVTFLFLVSSAFTLAKVIRDQQEAATVRVRLDEARIERLLADYDPL
NTAN
>MAP2823 hypothetical protein
MSDQVAKPSRHHIWRITLRTLSKSWDDSIFSESAQAGFWSALSLPPLLLG
LLGTLAYVAPLFGPDMLPSILRQLISTAHSVFSPSVVNEIIEPTVRDIAN
NARGEVVSLGFLISLWAGSSAISAFVDSVVEAHDQTPLRHPVRQRFFALF
LYVVMLVGVVATAPLVVVGPRKVGEHIPLGLSNVLRYGYYPALIVGLTIA
VMILYRVALPEPLPSHRLILGAMLATTVFVTATLGLRFYLRWITSTGYTY
GALSTPIAFLLFAFFGGFAIMLGAELNAAVQEEFPAPKTHAHRLRTWLFS
RSPALERTVQTLASPIANPATQRREVAPAEPRG
>MAP2529 hypothetical protein
MSRVTPLRFRDWPPEMRDAMAALMPPNPRHPAPVTKDRPKAGNALGTLAH
HPSLARAFCTLQGHLLMGTTLSMRHREMLILRTAAVRGSAYEWTHHNFIA
PDGGISTEDVARIAFGPTAPYWSDFDAALLRSVDELIHDGQMSDATWATL
SSEFGTQQLLDTMFTVTSYDALMRMLKTCQVRLEDDVRELRAQAGLSVDG
DGA
>MAP4174 hypothetical protein
MRSELADLPGGSFRMGSTSFYPEEAPIHTVTVEPFAIERHPVTNAQFAEF
VAATGYVTVAEQPIDPALYPGADPSQLQPGAMVFRPTPGPVDLRDWRQWW
HWVPGASWRHPFGPDSDVADRADHPVVQVAYPDAAAYARWAGRRLPTEAE
WEYAARAGATTTYPWGDEPTSDGRLMANTWQGRFPYRNDGALGWVGTSPV
GVFAPNAFGLLDMIGNVWEWTTTEFSTHHRIGAATKPCCAPSGPADPSVN
QTLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVAELGSN
>MAP2860 hypothetical protein
MPVVVVATMTVKPESVDTVRDILTRAVEEVHDEPGCQLYSLHQSGETFVF
VEQWADEEALKAHSTAPAIGKMFSAAGEHLDGAPDIKMLQPVPAGDPGKG
QLRP
>MAP1330c hypothetical protein
MRTVDEYAVRPWGLYLARPTPGRVQFHYLESWLLPSLGLRATVFHFNPGH
ERDHDYYLDVGDYLPGPTVWRSEDHYLDIEVRTGIRAELTDVDELLDALR
HGLLTPEVAERAIQRAVAAVDGLARNGYDLTRWLSEIDMELTWRGRAAAP
EPAP
>MAP2781 hypothetical protein
MRTFAEVMTFDPPDDASPCASGLIDFVFGEVWSRPGLSRRDRRFVTLACV
AAADAQAPLEQHVYAALNSGDLSITEMREAVLHFAVYAGWPKASRFNIVV
DQQWARIAAERGEPAPDPEPLLPLVTAADPEQRLRGGERSFKDINCLPFA
PPRDNPYSGAGILNFVFGEMWLRPGLGMKERRLITVACVAFQDAEIPIMS
HVYAALKSGDVSFREMDEVALQFAAYYGCAKADHLNQAIAEQKQRVIAEN
HSDATPVG
>MAP4307 hypothetical protein
MDDDCLKLTTYLAERRRAGNRFVSDVLLDLYARHRVECAVLLRGIGGFGT
GHRLRSDESLTLSEDPPVAIVATDTRTKIEALLDEVLAVKQRGLVTLERA
RLLHADVGAARPPEGDAVKLTVYLGRKQRVNGTPAYIGVCDLLHRRGLAG
ATVLLGVDGVVHGERRRASFFGRNVDVPMMIVVVGSGAHIGRVLPEIAAL
LRRPLFTLERVRVCKRDGQLLEPPHALPGVDERGLPLFQQLTVYTSESAR
HGGVPIHRAIVARLRQATAADGATVLRGVWGFHGDHPPHGDGLFALTRRV
PVVTIVIDTPAHIAESFAVIDELTGAEGLVTSEMVPALVSDDGGPGAGRM
ARHRY
>MAP3375 hypothetical protein
MSPAGEHGTAAPIEILPVAGLPEFRPGDDLGAAVAKAAPWLRDGDVVVVT
SKAVSKCEGRLVPAPADPEERDRLRRKLVDEEAVRVLARKGRTLITENRH
GLVQAAAGVDGSNVGRDELALLPLDPDASAAALRARLRELLGVEVAVLVT
DTMGRAWRNGQTDAAVGAAGLAVLHGYSGAVDQHGNELLVTEVAIADEIA
AAADLVKGKLTAMPVAVVRGLSVTDDGSTARQLLRPGTEDLFWLGTAEAI
ELGRRQAQLLRRSVRRFSAEPVPAELVREAVAEALTAPAPHHTRPVRFVW
LQTPAVRTRLLDAMKDKWRADLAGDGRPAESIERRVARGQILYDAPEVVI
PMLVPDGAHSYPDAARTDAEHTMFTVAVGAAVQALLVGLAVRGLGSCWIG
STIFAADLVRTELGLPADWEPLGAIAIGYAAEPAGPRGPADPGDLLIRK
>MAP1974 hypothetical protein
MPAQAPQTSRHLEVERKFDVVESTVMPSFEGIAAVARVEQSPTQILDATY
FDTPAHDLARNKITLRRRTGGSDAGWHLKLPAGPDARTEVRAPLDASDAD
TVPTELVDVVLAIVRNRPLRPVARITTERDTQVLYDAAGQPLAEFSNDHV
TAWATPSAPEGPDDGSGGVDEGSGGFDDTAASPTDQVPTQHPPTQQEWRE
WELELLGGNGHAEGAAELLNRLSNRLLDAGAVPAGHGSKLARVLGRAPSP
NGARPPEDPLQRAIAEQVRELLVWDRAVRADAFDSVHQMRVTTRKLRSLL
RDYQDSFGLGDDGWVLDELRELAGILGVARDAEVLAERYERDLGALPPEL
VRGPVRERLVGGARRRYQVGLRRSLIAMRSQRYFRLLDSLDAIAARRPGF
PGAEEQAPVTIDTAYRKVRKAAKAAARVDREHPGDRHLRDEAIHRIRKRA
KRLRYTATATRAVRVAERAKAVQSLLGDHQDSVVSREHLSQEAQAAHAAG
EDTFTYGLLYQREADLAQRCRDELDDTLRKLGKAVHKAD
>MAP2119 hypothetical protein
MGGAAGVGFGGLVGHRGGGRDRRAGGGRRAPEGSVVTGVSTVRVAPDETD
ETADATFTSTRPLDYLPGILLLIGVGVLGKYAQIWWNALAKHEHWTVPDI
EYVLWAIVIGLVITNTVGLHPIFRPGVLTYEFWLKAGIVALGSRFVLGDI
AKLGGISLVQILVDMTIAGTIIIAVARWFGLSGKLGSLLAIGTSICGVSA
IVAAKGAIRARNSDVSYAIAAILALGAVSLFVLPPLGHAIGLTDHEFGLW
AGLSVDNTAETTATGYLYSEHAGKIAVLVKSTRNALIGFVVLGFALFWAG
RGQADEIAPGVRAKAAFIWAKFPKFVLGFLVVSAIATAGWLTKGQTANLA
NVSKWAFLLTFAGVGLNTDIRQIARTGWRPLVVAVIGLTVVATVSLGIVL
LTSRVFGWGVTT
>MAP3724 hypothetical protein
MAIHQSVQAPIGHVERGATDPPRPQRRRRGRTLGIDEGLMGVALLAGPAN
VIMQLAQPGVGYGVMESRVESGRVDLHPFKRARTTFTYLAVATNGSDAQK
AAFRRAVNKAHAQVYSTPESPVSYNAFDPALQLWVGACLYKGGLDIYRMF
IGELDGEDAERHYREGMALATTLQVPPKMWPADRAAFDRYWEESLAKVHI
DDAVREYLYPIAANRIRGVKLPGPIQRLSEGFALMITTGFLPQRFRDEMR
LPWDATKQRRFDRLIAVLATVNRYLPRFIRQFPFNVLLHDLDRRIKKGRP
LV
>MAP2223c hypothetical protein
MSLSNQLEDTGRGFRAARSERIFGGYNVASDAYDMAFDEMFDAAGAVRGP
YKGIYAELAPSDASELKARAEALSRAFLDQGITFSLSGQERPFPLDLVPR
VISAAEWARLERGITQRVKALEMYLDDIYGDQEILNDGVIPRRLVTSCEH
FHRQAVGIVPPNGVRIHVAGIDLIRDEEGNFRVLEDNLRSPSGVSYVMEN
RRTMARVFPNLFATHRVRAVDDYAAHLLRALRNSAATNEADPTVVVLTPG
VYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRR
IDDAFLDPLQFRADSVLGVAGLVNAARAGNVVISSAIGNGVGDDKLVYTY
VPTMIEYYLGEKPLLANVETLRCWLDDEREEVLDRIDELVLKPVEGSGGY
GIVFGPEASDKELAAVAKKIRDDPRSWIAQPMMELSTVPTQVGSSLAPRY
VDLRPFAVNDGEDVWVLPGGLTRVALVEGSRVVNSSQGGGSKDTWVLASR
ASSGDHELGAAEVVRSLPTAMPDPLVDDAPRSMAQQPQPTGPPQREQLEQ
QQQQRAGR
>MAP3040c hypothetical protein
MTSQPNDAHWQRPGESPEPTPGRPASARLVDPEDDLTPVGYPGDFNPSTG
TTTVIPYGGAAAVAGSGAAGYHLLEQQEPLPYVQPHSAARHAAPEPTDVD
DDEHHDRLLDVGRRGTQHLGLLVLRAGLGVVLGAHGLQKLFGWWGGSGVT
GLRNSLSDVGYQHADILAYVSAGGELVAGVLLVLGLFTPLAAAGALAFLI
NGLLATVSARPHAHTFSFFLPQGHEYQITLIVMATAVILCGPGRYGLDAR
RGWAHRPFIGSFVALLAGIAAGVGVWVALNGVNPIG
>MAP3239 hypothetical protein
MPAGSRARFAAKPLTHSGTPVPLWLAAFQEAPLPALDLAGCPGLVVVAAH
PDDETLGVGAMITQLVAIGVRVQVVCVSDGGARPGSAASERLRAQTIRRF
ELRSATNTLNAPPPLSLGLPGGELTAHEDRLTGALTEILRAAGPGAWCAA
PWRGDGHPDHEAVGHAAWAACAHTGAALLEYPVWMWHWAVPGDPAVPWER
AHAVPAPAWAVSRKRLAAQRYRSRFEPTGGSAPALPGFVLARLLAVGEVV
FR
>MAP0630c hypothetical protein
MNNLYRDLAPVTEAAWGEIELEASRTFKRHVAGRRVVDVSEPGGPAAAAV
STGRLIDVEAPTNGVVAHLRASKPLVRLRVPFTLSRYEIDNVERGANDSD
WDPVKEAAKKLAFVEDRAIFEGYAAASIDGIRSASSNKPLALPADPREIP
DVITQAISELRLAGVDGPYSVLLSADVYTKVSETTEHGYPILEHIDRLVP
GDIIWAPAIDGAFVLTTRGGDFDLQLGTDVSIGYTSHDADTVQLYLQETL
TFLCYTAEAAVPLTS
>MAP3953 hypothetical protein
MEPRPPSGVVITAAAAEVLARLQRQHGPVMFHQSGGCCAGSSPMCYPVGD
FLVGDRDVLLGVLDVGADGVPVWISGPQYAAHYRDKHTQLVIDVVPGRGG
GFSLEAPDGVRFLSRGRVFSAAEQAAVAAAPIITGADYQRGERPAARGAV
VADAACRTCP
>MAP2881c hypothetical protein
MGTAPIRVFQVGSGNVGSEMIRRIATQPDLELIGVHCYSPEKIGKDTGQF
AGLAPNGVKFTGTVEEIIAAKPDVLTFHGVFPDEDLYVKVLEAGIDIVTT
ADWITGWRRDKNHPHPSGKPVTQLLAEAAAKGGATFYGTGMNPGLNQILG
VVCSADVAEIENVTTIESVDVSCHHSKDTWIEVGYGQPVDDPEIPAKLEK
YTRVFADSVYMMADCFDLTLDEVTFSYELGACTKDVDLGWYTLPKGSLGG
NYIKYQGMVDGVPRVETHLEWQMTPHTDPSWNVKGCYITQIKGDPCIYNK
HMIFPKPGVDLSNPDNFASIGMTVTGMPALAAIRSVVAAPPGLLTSADLP
LRGFAGRFKK
>MAP1508 hypothetical protein
MATRFMTDPHEMRAMAGRFEVHAQTVEDEARKMWASSMNIAGAGWSGQAQ
ATSYDTMGQVNQAFRNIVNMLHGVRDGLIRDANNYEQQEQASQQILSS
>MAP0317c hypothetical protein
MQPGPGGDMSALLAQAQQMQQKLMEAQQQLANAEVHGQAGGGLVKVVVKG
SGEVVAVKIDPSVVDPSDVETLQDLVVGAMADASKQVTRMAQERLGSLAG
GFGAPPGQPPAPAPGV
>MAP0276 hypothetical protein
MNAAPSDPSRRVWQAMLTWRAQDVSRMESVRLQVSGNRIKANGRIVAAAT
DANPAFGAYYDLQTDETGATKRLGMTVTLAERERVFSFARDEENMWLVTD
PQGEHRAAYNGALDVDVEFSPFFNALPIRRLGLQERAASVTLPVVYVNVP
EMSITADTVSYSSTGSRDEIKVHSPISDTTVSVDEQGFIVDYPGLAERI
>MAP0099 hypothetical protein
MRVAAKIAHKFEAAKGSAKKVFGRATGNTGMRAEGRAGQVKGNAKQAGDK
LNDAFKH
>MAP2721 hypothetical protein
MDLRALEELPLTYSEVGATASGDLPAGYHHQHVERQIGTGAERFEQAAGA
VLRWGMQRGSGLRVQASSEVAVVDAVVVVRLGFVPAPCRVVYVVDEPDIR
GFGYGTLPGHPESGEERFVVRHDPATSAVYAEVTAFSRPATWWARAGGPV
LRVGQRVIARRYLRAV
>MAP3083 hypothetical protein
MQWTRSVKGSLAAGPALSARGYLGLNAQTPAGCSLMEWENTDNGRQRWCV
RLVQGGGFAGPLLDGFDNLYVGQPGAFLSFPVTQWTRWRQPVIGMPTTPR
FLGHGQLLVVTHLGQVLVFDSHRGQVAGSPLDLVDGVDPTDATRGLADCA
PARPDCPVPAAPAFSAATGMVVLGVWQPGAPSAGLVGLKYHPGQSPLLTR
EWTSDAVGAGVIASPVLSADGSTVYVNGRDQRLWALRAADAKVKWSAPLG
FLAQTPPAVTPQGLIVAGGGPDTRLAAFRDAGDHADQVWRRDDLIPLSSS
SLAGVGYTVVSGPPANGAPGMSVLVFDPGDGHTLNSYPLPAATGYPLGVS
VGTDRRVVAAISDGQVYGFAPA
>MAP0809c hypothetical protein
MRLILNVIWLVFGGLWMAVGYLAAALVCFLLIITIPFGFASLRIASYALW
PFGRTIVDKPTAGSGALIGNVIWVVLFGVWLAIGHLLSAAAMALTIVGIP
LALANLKLIPVSLMPLGKQIVPVGSPVPHAPPVAA
>MAP0455 hypothetical protein
MVQFDGLRPARLKVGIISAGRVGTALGVALERADHVVVACSAISHASRQR
AQRRLPDTPVATPPEVAAGAELLLLAVPDSELAGLVSGLAATEAVRAGTI
VAHTSGANGVGILAPLTRQGCIPLAIHPAMTFTGSDEDISRLPDTCFGIT
AADEVGYAIGQSLVLEMGGEPFSVAEDARVLYHAALAHASNHLVTVLSDA
LEALRAALRGSELLGQQTVDEQPGGIAERIVGPLARAALENTLQRGQAAL
TGPVARGDAAAVAGHLAALHRVAPELAQAYRVNALRTAQRAHAPQEVVEV
LAP
>MAP1845c hypothetical protein
MHRPPVVHPRRRIPVTHALIVLLALLIGVVAGLRSLTAPAVVAWAALIGW
VDLHGTWASWMANIITVIVFTVLAVGELVNDKLPKTPARTAAPIFAARIV
LGGLAGAALGAWPHWTFTALGAGVIGAVLGTLGGYHVRRRLVAATGGKDL
PIALLEDVVAIAGGFAILAATGHVLTDYLLTAVK
>MAP1069 hypothetical protein
MRDFTCPKCGQRLTFENSTCLNCGSALGFSIEQMALLVISEDETSGHAGF
VPASEYQLCANLLLAECNWLVPVNEPRLLCASCVLTTERPNDADSVGLAE
FARAEAAKRRLIAELHELRLPIVGRDQDPDYGLAFRLLSSAHENVLTGHQ
NGVITLDLAEGDDVHREQLRVEMDEPYRTLLGHFRHEIGHYYFYRLIAPH
PDHLRRFNELFGDPDADYQQALDRHYSEGAPEGWQENFVSSYATMHASED
WAETFAHYLHIRDTLDTSAWCGLAPASATIDRPALGPSAFQTIIDTWLPL
SWSLNMVNRSMGHDDLYPFVLPAAVLEKMQFIHAVVDGAG
>MAP1145c hypothetical protein
MAVESSLAWLLRSLGVDAFLNEIWATRHHHIDRCRPGYFDGLLPGPSAVD
GLLEQVRPDPAAVRLVKDGEDRDPAGYRRGDGTLNAGGARDGLADGYTLV
LNGLERYLRTVASLSHAIEVELNFPTRVNAYVTPPHSTGFVPHYDPHDVL
VLQIEGCKTWRVSDEPPVPPQQIQSRKGVGADGPASWTDVCLRPGDVLYL
PRGQVHSARTHSEPSVHLTVGLHAPTVLTLVTSALHALSLRDPRVHDRLP
PRHLDDAQVRAGLGEAVRDAVRALDDDAVIADGLGAMEEVLVRRGRCPPV
GSVRDTVGVDGHTLVRKHQPLYARVMRAGDGVVLQFAQLSVSAGSDHEAA
MLFLAGRAEPFRVAELPGLSAAQQVGLAQTLILNGFLARLSDD
>MAP0974 hypothetical protein
MTKLHQNPSPMLRLVVGGLLVVLAFAGGYAVSVCKTVTLTVDGVAMRVTT
MKSKVIDVVEENGFVVDDRDDLYPAADVAVHDAGKIVLRRSRPLQISLDG
HDSKQVWTTASTVDEALAQLAMTDTAPAAANRGSRVPLAGMALPVVSAKT
VRINDGGAVRTVHLAASNVAGLLSAAGAPLAESDQVAPPASAPIVDGMEI
QVTRNRIQQVTERMPLPPNARRIEDPDMNMSRQVVEDPGSPGTQDVTFSV
ATVNGVETGRLPIANTVITPARESVVRVGTKPGTEVPPISDGSIWDAIAG
CEAGGNWAINTGNGYYGGVQFDQGTWERNGGLRFAPRADLATREEQITVA
EVTRERQGWGAWPVCSGRAGAR
>MAP3364c hypothetical protein
MALQTDTFCLDRRRRRRYCQPPTWCDSRGGFQMTTVDLQQPSDPAAWRDK
KRRLWLMGLIAPTALFVMLPLVWALNRLGWHAAAQVPLWIGPILLYVLLP
ILDLRFGPDGQNPPDEVMERLENDKYYRYCTYVYIPFQYLSVILGAYLFT
ATDLSWLGFHGGLGWAGKLGLALSVGVLGGVGINTAHEMGHKKDSLERWL
SKITLAQTCYGHFYIEHNRGHHVRVATPEDPASARFGETFWEFLPRSVFG
SLRSALRLEAQRLRRLGKNPWNPLTYPSNDVLNAWAMSIVLWGVLIAVFG
PGLIPFVVIQAVFGFSLLEAVNYLEHYGLLRRKIDSPSGKARYERCTPEH
SWNSDHVVTNLFLYHLQRHSDHHANPTRRYQTLRSLDGSPNLPSGYASMI
SLTYFPPLWRKVMDHRVLAHYGGDITRVNVSPRRRAKLLARYPAVTA
>MAP3333c hypothetical protein
MCGRFAVTTDPAQLAQKIKAIDETTGAAPSDTAPNYNVAPTSTIATVVSR
HSEPEDEPTRRVRLMRWGLVPPWAKAGPDGAPETKGPMLINARADKVTSS
PAFRASAKSKRCLIPMDGYYEWRVNSDAAAGKKPRKTPFFMYSEDGEPLF
MAGLWSVWKPAKDAPPLLSCTIITTDAPGELAQIHDRMPLVMPERDWDRW
LDPDAPVDEELLTRPPDVHAIGMREVSTLVNNVRNNGPELLEPAKPEPEQ
ARLL
>MAP1007 hypothetical protein
MNPGANSPRPYRRLRHLATRADKHSKGGTSVSDANGPTARFGIDNTAVVK
VPVHHGPISDIDVSPDGRRLLVTNYGRDTVSVIDTHTCRVASTIAGLSEP
FAVAMSAADPNYAYVSTATAAYDAIEVIDVVTNWRIATHRLAHSLSDLAV
SPDGRYLYASRNAVRGADVAVLDTTTGELEVIELAAAAGTTTACVRVSAD
GRRLFIGVNGPNGGGLAIIETRTRTDGRRVGGRSRLVGTVELGLPVRDVA
LGNDGNTAYVASCGPVVGSVLDVVDTRAATIVSTHKINEITGPLTRMTLS
RDGERAYLISDDRVTVLGTRTQDVLGEVRVTKHPSALVESPDGHYLYVAD
HSGVLSVARLSSGTAPAAPCSGADEDDDATTGWLPELAPWEPVLA
>MAP0817c hypothetical protein
MVAPLLRAQIDIDAPVATVWKLVSDLRRMPQWSPQCRWMKPLGPVRQGTR
TINLNRRNRLFWPTTCTVVEIIPDRKLTFRVDTNNTIWSYELEPTDTGTR
LIESRHAENGVTAFSNLSVKAFLGGTDNFERELLDGMNASLARIKAAAEN
TDRR
>MAP1337c hypothetical protein
MQIWLTGAPLLGGSVQGARHAVELFAAFGLTALIGLERTIQGKSAGLRTQ
TIVGTSSALIMLVSKYGFSDVLSAGSIVLDPSRVAAQVVSGIGFLGAGII
ITRRGAVHGLTTAAAVWESAAIGLATGAGLLLLAIAVVGLHFVSALAFNA
VERQLNARLRGTVRLRIIYANGRGVLRELLRLSGQRDWQLTELDADPHDI
DDGEVAVSMTLSGAKIADAAHVFAEVDGVAAVLSADDATD
>MAP2237 hypothetical protein
MNIMRQSHGKGAPMSKPSTTRPLRVIQWTTGNIGRRSLHAIIGRPDMELV
GVYAHGAEKVGVDAAELAGWPEPTGVAATNDIDALLSLGADACCYNPLWP
NVDELVRLLESGVNVCTSAAWITGGKQTPEDRRRIEEACRKGNSTIFGSG
AHPGMTNMVGMVLSASCERVDEIRITESVDCSTYESAETQTAMGFSQDPD
TPGLAESVRRESEVFAESAAMMADAIGAKLDKMTFDVTFTAATGDSDLGF
MKIPAGTVGSVYGYHRGWVGERNVVSVGFNWTMGDHVVPPKPLEHGHVIQ
VFGLPNMRTVLHCLPPKDWTEPGFMGLGMIYTAMPVTNAVPAVVAARPGI
VTLADLPPVTGRLG
>MAP0960 CpsH, CpsH
MIFVIMGMEVHPFDRLARAVDELARTDAVGEEFFVQLGTCRFEPRHARFE
RFLSFGDVCEQIRRASVAVTHAGAGSTLLCIEQGKHPVMVPRRSSRGEHV
DDHQLPFAEKLATAGLATVVRETTELPAAIAATRSRSAPADALGRARELT
GWLETFWRGLA
>MAP1588c ahpD, AhpD
MSVENLKEALPEYAKDLKLNLGSITRTTELNEEQLWGTLLASAAATRNTQ
VLTEIGAEAADTLSAEAYHAALGAASVMAMNNVFYRGRGFLDGKYDDLRA
GLRMNIIGNPGVEKANFELWCFAVSAINGCPDCVASHEHTLREAGVSRET
IQEALKAAAIISGVAQAIVASQTLATAG
>MAP1881 lppL, LppL
MSRRRTQGNPLLLPHALLRRGRHKLAIQGCIWGVALLLSGCSSNPLRGAP
PTIEPAGAAVSPPSAQQPAGSVRPLAAHAAAAVFDAGTHQLAVLAAASDP
ASVTLFGDPQVPPRVAALPGPATALTGDGRGTVYLATRGGYFVLDLATGR
VVRVGVADAAQTEFTAVAQRADGVLVLGSADGAVYTLASPTPAVPTAGPT
TGSTTGSTAAVANRNKIFARVDSLVTQGNTTVVLDRGQTSVTTIGADGRA
QQALRAGRGATTMAADAAGRVLVVDTRGGQLLVYGVDPLILRQAYPVSQA
PYGLAGSRALAWVSQTVSNIVIGYDLSTGIPVEKVRYPTVQQPNSLAFDE
ASDTLYVVSGSGAGVQVIDHAAGKR
>MAP1670c lppS, LppS_1
MASKSSDRLVGRILGVRRPRRFLAAVFVVVAVAGAIQLAAAAPCPHGGCP
GGAAPKPTGPARVRITPGPGARNVDPVAPVLVKAETGTLAGVEMVNEGGT
AVQGVMTPDNLVWKPTVALGYGRNYTLTITSRGTDGVETKQVSTFSTLRP
SNQTKVSFTTTSESPLRDGASYGVGTVIVAHFDEQIADRAAAERHLTVTT
NPKVTGSWFWVDGQNVHWRPEHYYAPGTTVTAEAKIYGIALGDGLFGQDD
SRVSFRIGDAHVSIADDATHQVSVFDNGVLVRTMPTSMGMGGTENIGGRS
ISLWTPPGVYTVLDKGNPVVMDSSTFGLPKNSRLGYRETINYATKISSDG
IYLHELDATVWAQGHQDTSHGCLNLNADNAKWFYDFSVPGDVVEVRNSGG
PPLTLAQNGDWTLSWDQWRRGSALKPT
>MAP2322c lppS, LppS_2
MRREMEGHWMTPLRGRRSWLAAAMALVAVGAVACGGGHTAAPPKVIFDKG
TPFADLLVPKLTASVSDGAVGVTVDAPVTVSVADGVLASVTMVNENGRSI
SGQLSPDGLRWSTTEQLGYNRRYTLTAKATGLGGAASKQMTFETSSPAHL
TMPYVSPADGEVVGVGEPVAIRFDENIANRAAAQKAITITTNPPVEGAFY
WLNNREVRWRPEHFWKSGTVIDVAVNTYGVDLGDGMFGEDNVKTHFTIGD
EVISTADDTTKTVTVRVNGEVVKTMPTSMGKDSTPTANGIYIVGARFKHI
IMDSSTYGVPVNSPNGYRTEVDWATQMSYSGVFVHSAPWSVGAQGHTNTS
HGCLNVSPSNAEWFYDHSKSGDIVEVVHTVGPTLPGTEGLGDWNIPWSQW
KAGNANT
>MAP0440c lpqG, LpqG
MPVATNAPKCVRSAIAAAAAGLAVVAISACDKHGPPPSGAAPRQVTVVGS
GQVQGVPDTLTADVGIEFTAPDVTTAMNQTNERQQAVINALVGAGIDHKD
ISTTEVALTPQYSNPEAGGSAAITGYRASNAVAVKIHPPDAASRLLALIV
GTGGDATRIRSVSYSIADDSQLVKDARTRAFNDAKSRAEQYAQLSGLHLG
KVLSISEATGNAPPPGGPPPSPRAMPMAVPLEPGQQTVKFTVTASWELD
>MAP1872c mbtH, MbtH_2
MSTNPFDDDNGSFFVLVNDEEQHSLWPAFADIPAGWQMVYGEADRAACLD
YIEQNWPDIRPKTLRDRLAAQRRGAGK
>MAP1419 mbtH, MbtH_1
MQLNPFDDDGGRFFVLVNNEEQHSLWPTFADIPPGWRVVFGEADRTACLQ
YIEESWPDIRAKSLRDTLSAGL
>MAP2169c mbtH, MbtH_3
MSANPFDDDTATFVVLVNDEDQHSLWPTFAETPPGWRVVFGAAGRADCLQ
YVEQNWSDIRPKSLIEALAGE
>MAP1524 mgtC, MgtC
METLSAVDFLLRLGVGVGCGALIGLERQWRARRAGLRTNALVAAGATLFV
LYAAATSDTSPTRVASYVVSGIGFLGGGVILREGVNVRGLNTAATLWCSA
AIGVLAASGHLVFAAIGTGTVIGIHLLGRPLGRLIDRDSSADEDESLLPF
LVQVVCRPKHEKYARAQIVQHAGGNDITLRGIHTGHPADDELTLTAHLLM
NGDAPARLERLVAELSLQPGVRAVQWYAGDEAQASDDRR
>MAP0016c pknB, PknB
MTTPQHLSDRYELGEILGFGGMSEVHLARDVRLHRDVAVKVLRADLARDP
SFYLRFRREAQNAAALNHPSIVAVYDTGEAETPSGPLPYIVMEYVDGVTL
RDIVHTDGPLPPRRAIEIIADACQALNFSHQNGIIHRDVKPANIMISTTN
AVKVMDFGIARAIADSGNSVTQTAAVIGTAQYLSPEQARGDAVDARSDVY
SLGCVLYEILTGEPPFTGDSPVAVAYQHVREDPVPPSQRHEGISADLDAV
VLKALAKNPDNRYQTAVEMRADLVRVHNGEKPEAPKVLTDAERSSLLSSG
AGAAGPSRTDPLPRQVLADSGEDRTVGSVGRWIVAVAALAVLTIIVVIAF
NTFGGGARDVQVPDVRGQVSADAIAALQNRGFKTRTLQKPDSAIPPDHVI
GTDPGANASVSAGDEITINVSTGPEQREVPDVSSLSYSDAVTKLKAAGFS
KFKQANSPSTPELLGKVIGTNPPANQTSAITNVITLIVGSGPETKQVPDI
AGQTVDIAQKNLTVYGFTKITQVQVDSPRPAGEVIGTNPPKGQTVPVDSV
IELQVSKGNQFIMPDLSGMFWTDAEPRLRALGWTGILDKGPDVDAGGAQS
HRVVYQNPPAGAGVNRDGIITLKFGQ
>MAP1025 pra, Pra
MPMTDQPPPGGAYPPPPSSPGSPGGQPTPHPGGQQPPPPPGGSYPPPPPP
GGSYPPPPPPSGGYAPPPPGPAIRTLPTQDYTPWLTRALAFVIDILPYVV
VHGIGTAILVATQQTACITDVTQYAVNQYCATQNSTLGLVAQWLASIVGL
FYLIWNYGYRQGTTGSSVGKSVMKFKVVSEVTGQPVGFGMSVVRALAHFV
DAIICFIGFLFPLWDSKRQTLADKIMTTVCLPLDGSESPPS
>MAP3346c pvdS, PvdS
MGLSFVAVSSAKNDGVPLKAGKKKSARAAVKIPDAVYEAELFRLQTELVK
LQEWVRHSGARLVVVFEGRDAAGKGGAIKRITENLSPRVARVAALPVPTD
RERGQWYYQRYIAHLPSKGEIVLFDRSWDNRAGIEKVMGFCTPQEYVLFL
RQTPIFEQMLIDDGILLRKYWFSVSEAEQLRRFKARLKDPVRRWKLSPID
LESVYRWEDYSRAKDEMMVHTDTPESPWYVVESDIKKHARLNMMAHLLST
IPYHAVEAPPVKLPRRPVVTGNYQRPPRELSRYVDDYAATLI
>MAP1893c yfiH, YfiH
MDSTRLADKRFADTGHVSVRIRRVTTTRAGGVSKPPFDSFNLGDHVGDDP
AAVAANRARLAAALGLKPERVVWMNQVHGDRVEVVHGPRDGAVDATDALV
TGTARLALAVVTADCVPVLLADARAGVAAAVHAGRVGAQRGVVVRTVEAM
RALGAQPADMSALLGPAVSGANYEVPAAMADEVEAALPGSRTTTAAGTPG
LDLRTGIACQLRELGVTAIDIDPRCTVADPALFSHRRDAPTGRLASLIWM
E