TitleGenColors Logo

Gene list

Applied filters:

COG category: Carbohydrate transport and metabolism
Organism: Mycobacterium avium subsp. paratuberculosis str. k10, k10
Gene type: CDS

Number of genes found: 215

Free access
Sort by:

 



# Mycobacterium avium subsp. paratuberculosis str. k10, k10

>MAP0057c hypothetical protein
MTGAHKRLSVHNVTFYGAPPAELQAHWAALGVSRLSILDNQLLDPQFPML
LRRNDYTVEAVYHLFAGGHLTSDPGAAREALIAVIDAAASVGAHTVYLLT
GGRTGLTWPQAAERFAAMVAPCAVRAKAAGVVLAVENASSLYADIHIAHT
LRDTVALAEMSDLGICIDLFHCWAEGDFEAQLPRALPRTALIQLSDYVLG
DRALPGRAVPGDGAIPIEAFVAQALAAGYAHGFDLELIGPRIEREGRLES
ARRACDVVGAMLDKLGG
>MAP3169c hypothetical protein
MGGVAARRVAALRVMLVAYPIETLLLGALAIFVGGPIHPGALLWGGLYGI
GMAFGMWAFFAALGSGPISVVSPLAAVLNAAVPVAVGMALGERPGQAALV
GVVLALAAVMLVSREAPAEGTPYRFTPKVAWLTVVAGSAMGLNLVFLHQA
PHACKLWPLVFARVAASLVVFAMAGTSQNLKMPRGRPLRLAVAVAVLDIF
ANITMLAALHTWLLSLASILISLYPAATVVLAMVVLRERVTRWQGIGMVL
AMGSVAMIASA
>MAP3432 hypothetical protein
MSRENRSRRRLIGGAYRSLRLLGAVAAVALAASPLTPRTSLAAAAIPQPS
HIVIVVEENRSESGIIGNKSAPFITALAASGANMTQSFAETHPSEPNYLA
LFAGNTFGVTKDLCPVNAGAAPNLGSELLAAGYTFVGYAEGLPSPGSPVC
SAGKYARKHVPWANFTNVPAASSLPFSAFPMGNYASLPTVSFVIPNNDNN
MHDGSIAQADSWLNRQLSGYANWALANNSLLIVTFDEDDNSNVGASRNQI
PTVFYGAHVRPGNYAEQINHYNVLATLEQMYGLPKTGYAAGAAPITDIWG
>MAP2215 hypothetical protein
MESSIDLRTPGPLAPNAALRNPFPPIADYAFLSDWETTCLISPAGSVEWM
CVPRPDSPSVFGAILDRSAGHFRLGPYGVSVPSARRYLPGSLIMETTWQT
HTGWLIVRDALVMGPWHDLERRSRTHRRTPMDWDAEHILLRTVRCVSGTV
ELMMSCEPAFDYHRVGATWEYSANAYGEAIARATKQPDAHPTLRLTTNLR
IGLEGREARARTRMKEGDDVFVALSWTKHPPPQTYQEAADKMWQTTECWR
QWINIGNFPDHPWRAYLQRSALTLKGLTYSPTGALLAASTTSLPETPQGE
RNWDYRYAWVRDSTFALWGLYTLGLDREADDFFAFIADVSGANNNERHPL
QVMYGVGGERSLVEEELHHLSGYDHARPVRIGNGAYDQVQHDIWGSILDS
FYLHAKSREQVPETLWPVLKKQVEEAIKHWREPDRGIWEVRGEPQHFTSS
KVMCWVALDRGAKLAERQGEKSYAQQWRNIAEEIKADILAHGVDSRGVFT
QRYGSDALDASLLLVVLTRFLPPDDPRVRNTVLAIADELTQEGLVLRYRV
EETDDGLSGEEGTFTICSFWLVSALVEIGEVRRAKRLCERLLSYASPLHL
YAEEIEPRTGRQLGNFPQAFTHLALINAVVHVIRAEEEADGSGMFQPANA
PM
>MAP3152c hypothetical protein
MYEQVNSSAAEPDDIGSRIDPVLARSWLLVNGTHPDRFQSAVNSRADIVV
FDIEDAVAPKDKNAARDNVVSWLRAGNVDWVRINGFGTPWWADDLTALAG
TPVGGVMLAMVESVDHVTETAKRLPDVPIVALVETARGLERITEIAATKG
TFRLAFGIGDFRRDTGFGEDPATLAYTRSRFTIASKAADLPSAIDGPTIG
SNPLKLIEATAVSTQFGMTGKICLTPDQCSVVNEGLSPSQDEIAWAKEFF
AEFERDGGEIRNGSDLPRIARATKILELARAYGIEALDIDEEERDHSPAP
SDTYHY
>MAP1039 hypothetical protein
MRRCRVEGRGGGAAGGSARSGARAVPSPGGCGAAPTRVAGRRTSPVTAAS
PASMATGGGGFDPAAWVTSGNRRVPMSSRKYAGIQRGDVMKYAEDGHTRG
LSMPRSRGAVSGLLLVILGAWGALIPFVGPHFNFAYTPDRDWAWSSARGW
LEVAPGAATALGGLLLIVAGNRVAAMLGGWLAVLAGAWFVVGGQLAPLLG
IGSAGDPIAATERKRALLEVTYFSGLGALIIFVGGVVLARTSARLARDVQ
PLASDAPAAPAVEPYRDPAYDPADVSSGALTKPRTSADPEPKRGWRKNRA
GGNAAYLRWPHPQQ
>MAP2534c hypothetical protein
MTTATPRTPGGGRRAPSGPAPGAHRWDLITRSSAHSQNPWNPLWAMMIGF
FMIMVDSTIVAIANPTIMADLHIGYDTVVWVTSAYLLGYAVVLLVAGRLG
DRFGTKNLYLIGLAVFTVASVWCGLAGSAAMLIAARVVQGVGAGVLTPQT
LSTITRIFPPERRGVAVSVWSATAGAASLVGPLAGGVLVDGLGWQWIFFV
NVPIGVLGLALAYWLVPVLPTQSHRFDLVGVGLSGVGMFLIVFGLQQGQA
AHWQPWIWALIVAGVGFVTVFVFWQSVNVREPLIPLVIFADRDFSLCNIG
VAIISFAATAMMLPLTFYAQAVCGLSPTRSALLIAPMAIANGVFAPFVGK
IVDRYHPRPVLGFGFSLLAIALTWLTFEMSPATPIWRLVLPFFAMGVGMA
FVWSPLTATATRNLSAQLAGAGSAVYNSVRQLGAVLGSAGMAAFMTWRIG
AEMPGQPAGGGEDSAGPVLPEFLRGPFAAAMSQSVLLPAFIALFGIVAAL
FLVGFRPWAHRDGGTDAFGPDDYGGDYDDDYDDDDAYVELILVREPEPEA
QQRAGQPRRPQPAPAPPADVRRRDPVESRRSVLDERPAQVQPIGFAHNGS
HVDGGKRLRQVAVRRAPKPGPPADRFTRPPRRHHPGPAGHHLGEGESLRG
QHHRPDPDDDPTGYGRHSSGN
>MAP1778c hypothetical protein
MSGSAPVVVMGVSGSGKSTVGVTLARRLRVPFVDADTLHPPANIAKMAAG
TPLGDDDRRPWLDEVGEWLAAHRDGGVAACSALKRAYRDRLRAHCPDVAF
LHLSGSAALLGPRLAARSGHFMPAALLQSQLDALEPLGPDEAGLTVDAGR
EVDSIVDVVLRARR
>MAP3809c hypothetical protein
MKLNAHNLTGLKIPVPTYDRSQIGIGIAHFGVGGFHRAHQAMYLDRLLND
GLARDWGICGIGVLPDDRRMRDALRSQDYLYTLILEHPDGTQEPRVIGSI
VDFRYAPEDPGSVVDLLADPATRIISLTITEGGYQPPLSAAFELVAEALN
RRRERGLPSPTIVSCDNIIKNGDVARRAFTAHAEQVNPELAQWIRENTTF
PNSMVDRITPTPTPDLVDRLAAEFGVQDAEPVVAEPFAMWVLEDDFADGR
PPLDKAGVRLVDDVAPYEAIKLRLLNAGHQGLCYFAYLAGYREVHEAAQD
PVIAEFLARYMDSEAMPTLPRMPGLEDFRDGLIPRFANAYVGDQVARLCA
DSSDHIPKWLLPVVHDNLRSGGPVQLSAAIVASWARYAEGVDEQGRPIEV
VDRLADTLVPLARSQRENPDAFLANRAVFGDLIDAPRFREAYRWALDSLH
RRGARATLRALIGADGS
>MAP2688 hypothetical protein
MIGPMGDPSLTAELGRVLVTGGSGFVGANLVTTLLERGYQVRSFDRAPSP
LPAHPHLEVLQGDITDAGVCAAVVEGIDTIFHTAAIIDLMGGASVTEEYR
QRSFAVNVGGTENLVRAGQAAGVQRFVYTSSNSVVMGGQNIVGGDETLPY
TDRFNDLYTETKVLAERFVLGQNGVDGMLTCAIRPSGIWGRGDQTMFRKL
FESVIAGHVKVLIGRKSARLDNSYVHNLIHGFILADQHLTPGGTAPGQAY
FINDAEPINMFEFARPVVEACGVNWPRVRVNGPIVRAAMTGWQRLHFRFG
IPAPLLEPLAVERLYLDNFFSIAKASRDLGYQPLFTTEQAMSECLPYYVG
MFEQMKRQALAGKASA
>MAP2967c hypothetical protein
MGVTVAVTGPTGEIGRSAVTALEREPGVDAIIGMARRPFSPSSRGWQKTT
YQQGDILDREAVDAVVAQADVVIHLAFIIMGSRDESARVNLQGTRNVFEA
TVAAGRPRRLVYTSSVAAYGYHSDNPVPLTEDVPARGSAEHYYSAQKAAS
EAMLAEITKDSPLEVFVLRPCIVAGPGATALADAMPWNQLPGPLRAIVKA
VQAIPVLKPVVPDPGYPLQLVHHDDVASAIALAATAPAPPGAYNIAGDGV
VTVADVAKALGGRPVRVPAVAATAASAAISKAPLVPSMLEWLHTARTSMV
MDTTKAKTQLGWRPVHTSAQALAALASAV
>MAP2851c hypothetical protein
MRVAVVAGPDPGHSFPAIALCRRFADAGDTPTLFTGAEWLDTARGAGVDA
VELDGLAATDEDVDAGARIHRRAARMAVLNVPALRDMAPDLVVSDVITAG
GGMAAELLGIPWIELSPHPLYLPSKGLPPIGSGLAPGTGLRGRLRDATMR
ALTARSWRAGLRQRAAARAEIGLPARDPGPLRRLIATLPALEVPRPDWPD
EAVLVGPLHFEPTDRVLDIPAGSGPVVVVAPSTALTGARGLAEVALSCLT
PGETLPAGARLAVSRLAGPALAAPPWAVVGLGSQAELLRHADVVVCGGGH
GMVAKTLLAGVPLVAVPGGGDQWEIANRVVRQGSARLIRPLSADALVVAV
NEVLSSPGYRAAAQRAAAGIADVADPVRVCREALAG
>MAP3420c hypothetical protein
MMDLFAPPEVTSTLIHTGPGAGSLIEAAAAWQRVAVELENSVSSYASTLS
SLIESWDGPSAMAMLQSVQPYLLWLRETAQQSAQLANSAEAAATAFGTVR
STVVHPSVVSANRTRLAQLLATNRFGTNTAAIAETENEYQTMWANNSAAL
SRYQAASSQATSPLTQFNSPLAVTDPGGTANQQAAVMKASVDSSGSSVGS
VLNDLNMPGGFDPNAGWFNYFSTWGNQFISSGFPINLLGVWAQLATAQGV
ASVGGDIGSGLSEGLGATTASLANAIKGIGAGAVAPSGAMGVGVSLGKLT
APPAVVGLLPGTQTGVQLASAASPLPAAESGFPLMPMMVPPPTTSAGTGW
RKRKQQKYEDVAYGREVKGKVMPRNPSAG
>MAP1706 hypothetical protein
MAGLDNHLKRCRTALHGAVSAVIVAVVAVLGLAIAPVADAAAPMATLTVE
HTWQTGFIARFAITNSSTVPLSDWKLEFDMPAGQSVLHTWNSTLTQSGTH
YVLTPANWNRVIAPGGSATGGFRGVLTGTYSPPTNCVLNGQYRCT
>MAP0299c hypothetical protein
MGVVVIGQIGRDLVLRTDRPPTAGESATVLRREELLGGKGANQAVGLAQL
GVPVALIGVVGDDQAGTSILQQAQRDGIDVSKVARRGTTALLVDVVAAPP
ERMLLEDVPDSSLVRVADLDRSSSLFDTADTVSIQLQQPAATALEAAQRA
RQRGLRVVADGVPAPQVRADLLATLDVLRADATEASIIAGTEITTVEQAF
ELADHLLSGGPELVALAVPDVGDLLTWRGGSELFEFADVEVVDPTGAGDA
FVAGLVAALRDGAAPREAGRRAAAAAGATVQRLGGRPDLTGLRA
>MAP1596 hypothetical protein
MTTPRAAVNAPARADTGSGGERISPQRRNLIFVAIVLGMLLAALDQTIVA
TALPTIVANLGDAGHQSWVVTSYLLASTIVTALVGKLGDLYGRKRVFQAA
VLFFVAGSVLCGLAQSMAMLVGARALQGIGGGGITVTASALIGEVVPLRE
RGRYQGILGAVFGVTTVIGPLLGGYFTDYLSWRWAFWVNVPVSVIVIFVA
AAAIPALAASAKPVIDYAGIVFVGLGAAGLTLATSWGGSRYPWGSPTITG
LFAAAAVALGVFVVVERRAAEPILPVRLFASPVFTVCCVLSFVVGFAMLG
AMTFLPTYMQYVDGVSATTSGLRTLPMVVGMLFTSTGSGTIVGRTGRYKI
FPVAGTALMALAFLLMSRMQPSTPAVIQSLYLFILGAGIGLSMQVLILIV
QNTSDFEDLGVATSGVRFFRTIGSSFGAAIFGSLFVNFLNRRIGPALAAS
GAPPGAVSSPGALHRQPHEVAAPIVAAYAESLTEVFFWAAPVALVGFVLA
LFLREIPLRDIHDSTVDLGDAFGMPTTETPDQMLENAIARMLRGETGMRL
RSIAMRPDCRLDVAGLWGVLRINRYTQMYGAARLTDMAEYLRIPFEVLEP
TFSRLVTAGYAGSDGDRLWLTPAGAQQVGYVHSLLLAWLVDKLGRSPGFE
GRPDRQAVQAALERVAYRVLAQRDWHDEQPTAAITAAAR
>MAP1912 hypothetical protein
MATGTDTPAADPPAAKPSRGQWFSRLKAFAGSADSGPAKLGMLGSVLITL
GGLGAGSTRQHDPLLESIHMSWLRFGHGLVLSSIVLWTGVGLMLIAWLSL
GRRVLAGEATEFVMKATTGFWLAPLLVSVPVFSRDTYSYLAQGALLRDGL
DPYAVGPVANPNSLLDNVSPIWSITTAPYGPVFILVAKIITILVGNNVVA
GTALLRLCMLPGLVLLIWATPRLARHLGADEPTALWICVLNPLVLIHLMG
GVHNEMLMVGLMAAGIALTFGGRPVAGVTLITVAIAVKATAGIALPFLVW
VWARQLRDRRGYRPVPAFFAATAASLLIFAVVFAALSAVAGVGLGWLTAL
AGSVKIINWLTVPTAAANLIHAIGSGFFPVSFYPILRVTRLVGIAIIAIS
LPLLWWRFRRDDRDALTGIAISMLIVVLFVPAALPWYYSWPLAVLAPLAQ
SRRAVAIIGGLSTWVMVIFKPDGSHGMYSWLHFSLATAVALVAWYSLYRV
PQPAQ
>MAP0551 hypothetical protein
MTDFVKQHPRSPEGFFGWEAAGLRWLSSVDGGVPCARVLAVDATSLTLRR
LQSVPAGRDAAHEFGRRLAVTHDAGAAAFGAGPDGWDGPGYFGPLSQPLP
MSLRRHRHWGSFYAEERLVPMAERAAPRLAASTRDAIGAVAARCRAGDFD
DDDGPARLHGDLWSGNVMWTPDGVVLIDPAAHGGHRETDLAMLALFGCPH
YDAVLAGYQQVRALKPGWRNRIGLHQLYPLLAHVVLFGGGYAGQTDAAAR
AALAA
>MAP3592 hypothetical protein
MLRRIEHLPIGTVVRLPNADAAPIGRVADAGADAVIIAMIESPDQAAAAV
AATRYPPAGVRSFGPLRASLGHDPAAHESRVSVFAMIETAAALSGITQIC
AVDGLTGIYVGPADLAISLGAGVIGATRQPEVLDAIVHIHKAATKAGLVT
GIHAGDGSTGHAMAQLGFDMITLAAEAQALRRGAAEHLREATQ
>MAP1661c hypothetical protein
MLMPEIDRRRMIVTTGIGVLAAALPNPRAGASPAPPPAAPSGQSGTYLFQ
DEFDGPAGSAPDASKWAVAKARETIKDPTYWERPENIGQYRDDRRNVFLD
GKSNLVLRAAKNGPTYYSGKVQSLWRGGVGHTWEARIKLDCLTAGAWPAY
WLGNDDQGEIDVMEWYGNGNWPSATTVHAKANGGEWKTHNIAVDSAWHTW
RCQWDEAGIRFWKDYVDGAQPYFDVPASSLPDWPFNGPGYTVFPVFNLAV
AGSGGGDPGPGSYPADMLIDWIRVW
>MAP3815 hypothetical protein
MSVAVPIVSVMPALDPRERVERFVLRARKVMAHSLVRDRSDLLKELASGT
IKVQIQVDTVTGEATHQFSMELPPEEAFESFAARVRPFTIPKEPVYWAVV
LDALEELVSEDTLANIIDIEDLRAHWKTVVEGKKVAQAFYVVTEKGQLSD
VQLAEMWLNSDALHTQLIQSEVGKELSLDQRYRAATGVYARLGSCVNATH
YLVKYLVEEGLLELDPEVFTLPVLAKTSVEIKGAVYSAELGAEMPTDLSN
LDPEVWRPIHKDIDLLASSEHGSPEDGSSREPEE
>MAP0453 hypothetical protein
MGPTRKRDLTAAVVGAAVVGYLLVQGLYRWFPPITVWTGLSLLAVAVIEA
LWARYVRTKINDGEIGSGPGWLHPLAVARSLMVAKASAWVGALVLGWWIG
VLVYFLPRRSWLRAAAEDTSGAVVAAVSALALLVAALWLQHCCKSPPDSG
EHGEGAET
>MAP3762c hypothetical protein
MKFALAVYGSRGDVEPHAAIARELLRRGHEVCVAAPPDLRGFVESAGVTA
IDYGPDTRDVLFGKKTNPIKLLSTSKEYFGRIWLEMGETLTSLANGADLL
LTAVAQQGLAANVAEYCDIPLATLHCLPARVNGRLLPNVPSPLSRLAVSA
FWCGYWCVTNKAEESQRRRLGLSKASGSSTRRIVGRKSLEIQAYEDFLFP
GLAAEWAHWDGQRPFVGALTLGLPTDADAEVLSWIAAGSPPVYFGFGSLP
VKSPADTVAMISAACTRLDERALICAGTNDLTHVPRSGHVKIVAAMNHAA
IFPACRAVVHHGGAGTTAAGMRAGVPTLVLWMRNEQPLWGAAVKQMKVGS
SQRFSKTTEESLATCLRSILRPHYMTRAREVAKRMTKSSDSAAVAADLLE
NAARGETT
>MAP4123 hypothetical protein
MLTLGLDIGGTKIAAALVDSVGTLVHTAVRPTPNPAPADDVWDVVHALIA
EVVRAAGAPIAAVGIASAGPVDLPSGSVSPINIAGWHRFPLRDKVAAAVP
ATPVVLGGDGLCMALGEQWLGAGRGARFLLGMVVSTGVGGGLVLDGAPYP
GRTGNAGHVGHVVVELDGRPCTCGGHGCVETVASGPSMVRWARENGWSAA
PGAGARDLAAAAASDPLAQKAFHRSADALAAMIASVGAVCDLDVVVIGGG
VAQSGPLLFDPLRERLAHYAGLDFLSGLTVVPGELGGNAGLIGAARLATL
ARPAGS
>MAP3528 hypothetical protein
MNDAREAVEHHPEEGSHVQDGVVEHPEAEDFDNAAALPTDPTWFKHAVFY
EVLVRAFFDANADGAGDLRGLLAQLDYLQWLGIDCIWLPPFYDSPLRDGG
YDIRDFYKVLPEFGTVEDFVALLNAAHERGIRVITDLVMNHTSESHPWFQ
ESRHDPDGPYGDFYVWSDTSDRYADARIIFVDTEESNWTFDPVRRQFYWH
RFFSHQPDLNYDNPAVQEAMIDVIRFWLGLGIDGFRLDAVPYLFEREGTN
CENLPETHAFLKRVRKVVDDEFPGRVLLAEANQWPADVVEYFGDPSTGGD
ECHMAFHFPLMPRIFMAVRRESRFPISEILAQTPQIPEMAQWGIFLRNHD
ELTLEMVTDEERDYMYAEYAKDPRMKANVGIRRRLAPLLDNDRNQIELFT
ALLLSLPGSPVLYYGDEIGMGDVIWLGDRDGVRTPMQWTPDRNAGFSKAN
PGRLYLPPSQDPVYGYQAVNVEAQRDTSTSLLNFTRTMLAVRRRHEAFAI
GTFEELGGSNPSVLAFVRQVSNDGDTVLCVNNLSRFPQPIELNLQHWSGC
IPVELTGHVEFPRIGHLPYLLTLPGHGFYWFQLTACEEDT
>MAP2715c hypothetical protein
MSDPEQAEQDAERTILDRGVGEQDHLQRLWTPYRMTYLAEAPMKRGPNSS
GKSEQPFTDIPQLTDEDGLVVARGELVYAVLNLYPYNPGHLMVVPYRRVS
ELEDLTDAESAELMSFIQKAIRVIKNVSRPHGFNVGLNLGTSAGGSLAEH
LHVHVVPRWGGDANFITIIGGSKVIPQLLRETRQLLATEWAKQS
>MAP2257 hypothetical protein
MAVFGSVARRSGSGGRAASVVTCQPSERKIMSATTIDRTTGRDGLLRLAM
RADAAISGLVGLAGIPLVGWLAEVSGTTTAFEYGMSAFLIGYGVLVFGLA
ALPSVRRAGMAVIIGNLLYTAAAVVLVLADVFPLTSTGVVLNLAAGVYTL
VFAELQYFGWRRARA
>MAP0285c hypothetical protein
MERRTALKLPLLLAAGAAVARAPRASAEEAGRWSPERANRWYQAQGWLVG
ANYIPANAINQLEMFQPDTFDPRRIDTELGWAQFYGHNTARVFLHDQLWA
ADQRGFQTRLGQFVDIAARHRIKPLFVFFDSCWDPQPRAGRQRPPRPGVH
NSGWVQSPGAERLGDPRYIPVMRDYVTSVMTQFRNDDRVLGWDLWNEPDN
PARQYRNVERSDKEQLVANLLPQVFRWARAVDASQPLTSGVWRGDWGQPQ
GRSAISDIQLANADVITFHSYAEPAGFESRINELTPLGRPILCTEYMARP
RGSTVESILPVAKRHNVGAINWGLVAGKTQTYFPWESWDHPYTSVPKVWF
HDLIRPEGRPFQDIEALTVRKLAGSPT
>MAP1058 hypothetical protein
MCHMASTFTDSKSELMRRAAEQLTALQGRDAAEPPLTTRQKLALTCRALF
DAGHDSGLAGQITARAEADGTYYTQRLGLGFDEITDGNLLLVDEDLNVLE
GDGMANPANRFHSWIYRARPDVQCIVHTHAFHVAALSMLEVPLVVSHMDT
TPLYDDCAFLADWPGVPVGNEEGEIITAALGDKKAVLLAHHGHVIAGASI
EEACSLGILIERAAKLQLAAMAAGTVKQLPEELAREAHDWTLSPQRSRAN
FAYYARRALARHPDALTS
>MAP0879c hypothetical protein
MSYIDCFSRVVARRRMRALPLSNATLPLHSQRIDVPTYDRSALQRGVVHI
GAGNFHRAHQAVYFDDLACSGISDQWGVSAISLRSHDVKDLLSAQDGLYT
VVQRGHDHQTARVVGSIGSVHYAPNDGAAVRAALTDPRTRIVSLTITPNG
YFLDPVTREFDADHPDVRADLVASGCYGTAWGYLTEALEQRRRAGIAPFT
VLCCDNIPDNTQAARTALVSFAALKNPGLARWIDRHVAFPSTMVDRITPQ
TSDSEREFVERTFGVADNWPVLTEPHSQWIIEDAFSDGRPPLDEVGVQFV
TDVSDHKLIKTRLLNGTHIALACLATLAGYRRTDEAMKDPVIFDYVERLM
RDEVQPLLPPVPGMNTPDYRDTLLTRLGNPRMSDQLSRLARRGSTKIASF
LLPSLHEAIAQDRPHTLLMLALAGWARYLRGYDLNGRKIRIDDPEADLLT
KLANMAGNNPDPLLRHEILADLRLVPGFSERLGEMVASIDEHGVIATLRQ
SLQDDSRELVAR
>MAP3187 hypothetical protein
MSNDDLALALLLADRADAVTSARFGALDLRIDTKPDLTPVTDADRAAEAE
LRAVLARQRPNDSIVGEEFGGDTVFTGRQWIIDPIDGTKNFVRGVPVWAT
LIALLHDGVPVVGAVSAPALARRWWAAHGRGAFVAVNGAAARRLSVSSVA
QLDSASLSFSSLSGWAQLGLRDRFLALTDAVWRVRAFGDFLSYCLVAEGA
VDIAAEPEVSVWDLAALDILVREAGGTLTGLDGTPGPHRGSAVASNGLLQ
RQVLERLAL
>MAP1945c hypothetical protein
MMDDGSMASDDAASSLAPGRLQLPAMQVLVAPDCYADSMSALEASAAIAT
GWTRSRPGDRFIVAPQSDGGPGFVEVLASRLGETRRLRVSGPLKAMVEAE
WVFDRGSATAYLECAQACGLALLGGPPTPETAMAAHSKGVGQLIAEALRV
GATRIVVGLGGSACTDGGQGMIAELGGLDTARRQLGNVELIAASDTEYPL
LGPWGAARVFGPQKGADTATVAALEVRLQAWALILEAVAGRDVSAEPGAG
AAGGIGAGLLALGGRCESGAAIIAEHTRLSDDLDTAELIVTGEGRFDEQS
LHGKVVGFLADAARPRGIPVIVLAGQVDLDTATVRSAGIMAALSMAEHAG
SARLAQADAANQLMGLASVAAARLGNSGPSRYR
>MAP3804 hypothetical protein
MDRRSMMLMSGIGMLGAAMRLPGAWATPPAPEAPPSAGGGPYIFADEFDG
PAGSPPDPGKWTIQTWQDDVFPPVAGIYRDDRRNVFQDGNSNLVLCATQE
MGTYYSGKLRGNFRSMINQTWEARIKLDCLFPGLWPSFWGVNEDPLPDGE
VDIFEWYGNGQWPPGTTVHAASNGKTWEGKSIPGLVDGNWHTWRMHWGEE
GFEFSRDGAEYFKVPNKPIHVAGGAPDDFRWPFNNPGYWMTPMFTLAVGG
VGAGDPAAGVFPSSMLIDYIRIW
>MAP4145 hypothetical protein
MLARYLKAQLVVLLCGGLVGPIFLVVYFTLGLGSLLQWMFYVGLLITVAD
VLIALALANYGAKSSEKLAALEQNGVLALAQITGITETGTWVNNQQVVKV
HLHIAGPGLVPFDSEDRVIASVTRLGNLNARKVVVLVNPTTNEYRIDWER
SALVNGLVPAQFTVAEDNTTYDLSGQAGPLMEILQILKANNIPLNRMVDI
RSNPALRAQIQAVVRRAAQQQAPAGQPAPAGQPATGQPSAPVAAPPGPSI
AARLQELNDLHASGALTEQEYNSRRAAIIAEI
>MAP2510 hypothetical protein
MSCVFCAIVAGEAPAIRIYEDDDYLAILDIRPFTRGHTLVLPKRHSVDLT
DTPPETLAGMVTLGQRIARAARSTELADATNIAINDGSAAFQTVFHIHLH
VLPRRNGDKLSVAKGMLLRRDPDRDATGQILRLALARIDANQPD
>MAP3227c hypothetical protein
MRQSPSASARRSHHGDGKHSPAPTGSLPCNHVTLTSPAAASAGDLPARPG
LRTVAAGSMIGTTIEWYDFYLYATASALVFKPLFFPNISPSAGTLASFAT
YAAGFGARPLGAVLSGHFGDRLGRKTVLVAALLVMGLVTTAIGALPTYAE
AGLAAPALLASLRVVQGLAVGAEWGGAAVLSVEHAPPGRRGLFGSFTQLG
SPAGMLLATSVFFGVRKATGPAAFLGFGWRIPFLLSIFLVAVGLFVRLRL
TDAEVFDRLRSRDELARLPIVQVLRTDARNVVITTGLRLSQIGLFVLLTT
YSLSYLQDSFGKGSGVGLVAVLISSALGFLSTPGWALLSDRVGRRPPYLF
GALVSVVALVLFFVAAGTGSAVLVVVAIVFGVNVVHDAMYGPQAAWFAEL
FDTRVRYSGSSLGYHIGAVLSGGFAPLIAASLLVAGGGRPWLIVGYFAVL
AAITVGAACAARETRGEPIG
>MAP1688 hypothetical protein
MVRINGLDTEWHDDDLAAVAGSAADGVLVPKVETAQQVQALDRALDTLGA
AASLQLWVMMETPRAFLRAEEVASASDRLAGLVVGTNDLVNELHGRHVAG
RGPVVPALGLALLGARAAGKVVLDGVFNDITDDAGFRAEAAQGREMGFDG
KTLIHPSQIAPANELFGPSQQELADARKVVSAYQQAQAAGTSVITVDGRM
IESLHVRDAQRILALADLISERDAAS
>MAP1472c hypothetical protein
MNVINGMPAHALLVHFVLVLVPLTALLDIVCGLWPAARRGQLMLLTVILA
VVTMALTPITIDAGGWLYDQRADPSPILQEHATLGSAMTYFSAALLAVAI
VLALLGLIERRSDKRRLLTRGVVAVLALGIGIASMVQIYRVGDAGAQSVW
GGEIAHLKKAHPG
>MAP1632c hypothetical protein
MPQRLGLAGLCLGTALIIMEANVLNVAIPSIRQALHASPAQSLWIIDAYT
LVLAALLLSAGRLGDRIGARRCYLLGLAVFSIASVLCALAASSAELIAAR
TIQGVGAAVLIPAPLGLISAMFSDLTARAKAVAVWVTIGGVGFAAGPLIG
GLLVSTFGWRSIFLINIPAAAIIAVMVRLTVAEASRSPLPFDYVGQALAI
VGLSAVVFACVESSALAWMSPFVLLPAVAAALILGLFVIDQRHRGRAGAW
VLLPVELLNNRPVNAGLMSGFVYNFTLYGLVLVYSYVFQSARGYSPVQTG
LAFAPLTVAALVTSLPAGRFVAAHGARRGIMIGMALSAIGLCALAFDAQR
MPFVVLSIAFGIFATGLSLSATGQTMAVMANASDQYKNTASSMLNTARQT
GGVIGVAALGAITSRDLLASAPVALTIAAAACLVAALGVATLIARHARTH
DSDQH
>MAP1356c hypothetical protein
MTGRSTTAADPLGPDSLTWKYFGDLRTGMMGVWIGAIQNMYPELGAGVQE
HSILLREPLQRVARSVYPIMGVVYDGDRAPQTGRQITSYHHTIKGTDASG
RRYHALNPDTFYWAHATFFMLIIKVAEYFCGGLTEAEKRQLFDEHVQWYR
MYGMSMRPVPKSWEEFQEYWERTGREVLEINQATLDIFRMRIPKPKFVLI
PTPLWDQLFRPLVAGQRWIAAGLFEPAVREKAGMHWTPGDEVLLRLFGKI
VELAFLAVPDEIRLHPRALAAYRREQGRLPSDAPLVEAPPFMAPPRDRRG
LPMHYFPPPPRLPFDPALQPARALMERAGSLVHTTLSLAGLRAGRGRPKR
PVRAA
>MAP1689 hypothetical protein
MGQTAAMAEADRTWMVDTLLALLQTPSPSGRTDAVMQVIGDILDDFGVPF
TLTRRGALTAELPGESATTDRALVVHSDTIGCMVRGLKDNGRLELVPVGT
FSARFAAGARVRIFIDDPEEFITGTILPLKSSGHAFGDEIDTQPTGWEHV
EVRIDRKVSCRDDLVRLGLQIGDFVALITSPELTADGFVVSRHLDGKAGV
AIALALAKNFAENKVVLPHRTTIMVTITEEVGHGASHGLPPDVAELISVD
NAVCAPGQHSLEDGVTIPMADMHGPFDYHLTRKLCRLAQDRGIPFARDIF
RYYRSDAAAAIEAGANTRAALVGFGLDGSHGWERTHIDSLEAAYNLLHAW
LQTPLTFAKWDAKPTGSLRDFPSSKQPAPSERWVPLSRGDYESPGDASPG
THWPPSEGPQS
>MAP0835c hypothetical protein
MTATPRACNRDRVALQAVHFFMADMEAGMGPFLGVLLQSRGWTTGAIGAA
MTLGAIVGMVTVAPAGALVDATRHKRGCVIVVGLAAVAASAVILTSRQFW
TVATAQAVMCISGATIAPAMIGITLGVVGQAAFTRQNGRNQAYNHAGNMA
GAALGGVAGWVFGYAGIFWLAAGFAVATIAAVLAIPAGDIDHHVARGEAR
AAGEAPVKAMRVLARSRPLLVLAAAMVLFHLGNAAMLPLYGLAVVATHAN
PFTTVASTVVVAQAVMVPASLLAMRIAATRGYWPAILIALTALPVRGVLA
ASVITSWGVIPVQVLDGIGAGMLSVAVPGLVARILDGTGHINVGQGAVMA
AQGLGGALSPVLGGAVAQHLGFRAAFLLLAGLSLGALIIWVTFAPMLRRA
ARLPAAPSDRAGAPPTATADNQAK
>MAP0619c hypothetical protein
MLSNAMDEAPYAAAKTPPHAPTGQPGATEREYPDKLDAALLRISGVCILA
TVMAILDVTVVSVAQRTFIDQFSSSQAVVAWTMTGYTLALATVIPITGWA
ADRFGTKRLFIGSVLAFMLGSLLCALAANVLQLIVFRVVQGIGGGMLLPL
GFMILTREAGPRRLGRLMSILSIPMLLAPIGGPILGGWLIDTSSWRWIFL
INVPIGLLTVALAAVVFPRDHPARSETFDAVGVLLLSPGLATFLFAVSSI
PGRGTVADRHVLIPAAMGLTLIAGFVGHAWHRADHPLIDLRLFRNPVLTH
ANVTMLVFATAFFGAGLLLPSYFQQVLHQTPMQAGVHMIPQGLGAMLTVR
LTGPLVDRQGPGKVVLVGIALITAGLGAFAFGVARQAPYLPTLLAGLAIT
GLGMGCTMMPLSVASVQALAPHQIARGTTLMSVSHQVGGSMGTALMSMIL
TNQFNRSPNIVAANKLAALHQKAAAGGTPIDQSAIPRQSLAPGFWGNVLH
DLSHAYTAVFVIAVALVVCTIIPASFLPKKPATETAGK
>MAP1980c hypothetical protein
MKVVIEADGGSRGNPGPAGYGAVVWTPDRGTVLAENKQAIGRATNNVAEY
RGLLAGLGDALKLGATEAEVYLDSKLLVEQMSGRWKVKHPDLIELHAQAR
SLAARFDRISYTWIPRERNSHADRLANEAMDAAAAATGTAEKPGPAEQSD
PTAADAGKTAAAPSPAAPGWTGARGTPTRLLLLRHGQTELSVQRRYSGRG
NPALTETGRRQADAAARYLAARGGISAVFASPLQRAYDTAAVVAKALGLD
VTVDDDLIETDFGAWEGLTFGEAAERDPGLHRRWLRDTSTAPPGGESFDS
AAERVSRVRQKIIAAQQGSTVLVVSHVTPIKMLLREALDAGPGILYRLHL
DLASLSIAEFYSDGAASVRLVNQTGYL
>MAP0239c hypothetical protein
MTFLDAAAQSRTLRRARSDLVDGFRRHELWLHLGWQDIKQRYRRSVLGPF
WITIATGTTAVAMGGLYSKLFHLELSVHLPYVTLGLIIWNLINAAILEGA
DVFVANEGLIKQLPTPLSVHVYRLVWRQMILFAHNIVIYVVIAMIYPKPW
SWADLSVIPALALIVLNCIWVSLCFGILATRYRDIGPLLFSVVQLLFFMT
PIIWNDDTLRQQGAGRWSSIVELNPLLHYLDIVRAPLLGSHQELRHWAVV
LVLTAVGWLLAAFAMRQYRARVPYWV
>MAP3031 hypothetical protein
MQTDPVATPDVGAGTRWSIMVVSLLATASSFLFINGVAFLIPSLQGARGI
RLDEAGLLASMPSWGMVVTLVAWGYVLDRVGERVVMTTGSALTALAAYAA
SSAHSMVLIAAYLFLGGMAAASCNTAGGRLVSAWFPPHQRGLAMGIRQTA
QPLGIALGAMVIPELAEHGPQAGLRFTALACVFGAVASVIGIVDPPRKPR
ASASDQELASPYRGSLTLWRIHAVAGLMMMPQTVTVTFMLVWLIRNLHWS
VAAAGALVTLSQLLGALGRVAVGRWSDRLGSRMRPVRYIAAAAVLVLLLL
AWADYLNSRWQAGLMVAISVIAVLDNGLEATAITEFAGPYWSGRALGIQN
TTQRMMAAAGPPLFGALISAAKYPPAWLLCALFPLAAMPLVPTQLLPPGL
ETRARRQSVRRLRWWRAVRSHALPDIVRRPGQPG
>MAP0735 hypothetical protein
MGVSAYSGATATTRFVSPTPRPPSAVVAFRVAPGLCSKRDIKYFTLGGPL
LEAEKILITGATGKIAFPIARALAAAGNEVWGVARLKDPAAREKLAAAGV
KPVVLDVGAGDFSGLPEDFSYVLHAAVDTGAGDWNRCLETNAQRSGELLY
HCRTAKGFVFCSTGSVYAYQGRRPLTESDPPGVPLRANYSISKVAAEAVC
TWVARHFQIPLTIIRICSTYGPQGGAPADRLEAMLAGKPIRLHPDKPNNY
NPIYEDDYVELGIRAMEVAATPPVVVNWAGSETVSVEEYCTFMGQLIGVE
PKFEYTERAHTPLWPDVTRMHAVLGRTKVPWRDGFRRMIEARHPEITLRQ
ADPATS
>MAP0133c hypothetical protein
MSPEVAERLRPGPVAARSPLLAHPRWFVPGKFRVSHQSMGIRRRRRKWAR
KEIRVADPAMRQTIMGTAIGNFMEWYDFGVYGYIATTLAEVFYPGKSVSG
LHLIATFSTLAAAFVVRPLGGFIFGPLGDRIGRHRVLVVTILMMTVSTTT
TGLLPTYSSIGIWAPILLVIARIFQGLSTGGEYVGAMTYLVEQAPDHKRG
MMVGFLPMGNLVGFVLAGMLVTGLQTWLPDQDMLSYGWRIPLLLGLPFGL
VALYLRLRLEESSAYQSANDSPHTPGGQGRQQIRRTVAQQWRPMLICAAL
VLTSQVADFMLTGYLPTYLRLFVRVGHTAGLVMIVTTLAILMATVVAVAS
LSDRIGVKPIMWTGCALLIGASVPAFLLIRFGGVYPVIFIGVLLIGLMEL
CFDSTGPAMLPALFPTNVRYGALAISYNISISLVGGVTPLIAQALVSATG
NVMVPAYMLIFGGAVGAVTLLFTPEVAGKPLPGSGPAVETEREARALADD
VR
>MAP1657 hypothetical protein
MRGSKILITGPTGQIATPVARALAADNEVWGIARFTDAAARESLEQAGIR
CQTVNLAAGDFTGLPSDFDFVLNLAVAKSGSWDKDLAANAESVGVLMAHC
RNAKAFLHCSSAAVYDPPGDEPRSERSALGDNHKPLFPTYSISKIAGEVV
ARSVARIVGLPTTIARLNVPYGDNGGWPFYHMEMMLSGIPIPVPPGEPAR
YNPIHEDDIIATIPKLLDVASVPATTVNWCGDQTVSLQEWCGYLGSLVGR
EPVFEPSERALRGNPTTVDRMHELIGGTTVDWRDGMRRMAAKFHPELVGV
>MAP0464 hypothetical protein
MTGVVRLTLVSHGMTDAMADGRFPADEPLNELGRRQVQPIAAQLGGAASY
FVAPELRTRQTAGLLGCDGAPEPLLADLDCGRWRGRALQTVDEADLAAWL
SDPAGAPHGGESIADLIERASRWLNALTANPLHTAAVTHPAVIRAALLAA
LDADPASFWRVDVAPAACVVMHHRGGRWTLRL
>MAP2242c hypothetical protein
MKIRRLIMLRHGQTEFNAGSRMQGQLDSELSELGRAQAVAAAEVLGKAQP
LLIVSSDLHRAYDTAVRLGERTGLPIRVDRRLRETHLGDWQGLTHTQVDT
QAPGARLAWREDATWAPHGGESRVDVAARSVPVVAELVAGEPEWGDPDGP
DRPVVLVAHGGLIAALSAALLKLPVPNWPMLGGMGNASWVQLSGHSDDAA
DEFDDIRWRLDVWNASAQVANDVL
>MAP2641c hypothetical protein
MIHVPVTTGPRRWFKRASPRRARNATAGIRPENRPDVTDAVGTPPTRAVG
TPPTRAVGTPPPYAVGTPARRRAGLGSGVMRVLVTGATGYVGSRLVTALL
AGGHDVLAATRNPLRLKRFGWFEEITPVPLDAADPASLRSAFAGCGPVEV
VYYLVHAIGQPGFRDADKAAAANLAAAAKDAGVRRIVYLGGFVPAGEVLS
EHLTSRAEVAEALAVPDGPELVWLGAAMIIGAGSTSFEMMRYVGDRFPLM
PIPSWMDNPIDPISIRDVLHYLLAAADRDLVPAGAYDISGPDTTSYRRLL
KTYARLSGRWHTGLPVGRVDTGLASLVTGVALPVPPGLAGDLVESLDHPM
VASGSDLRQRVPDPPGGLLGVDDAIARALDDRSYRRPRPVNALADPHHLA
DTDPGWAGGDARRIRQLAGRVTPAIARPTLGLVNKVPGPVAGAVRTGLDI
LIALTPKVRPA
>MAP4021 hypothetical protein
MGEQTRVHVVRHGEVHNPDGVLYGRLPGFHLSEAGRAQAAAVAEALAPRD
IVAVIASPLQRAQETAAPIAARHGLAVDTDPDLIESANFFEGRRISPGDG
AWRDPRVWWQLRNPFRPSWGEPYTEIAARMETAVDKARARGTGHEVVCVS
HQLPVWTLRLHLTGRRLWHDPRKRECGLCSITTLVYDGDRLVDVEYSEPA
AP
>MAP0934 hypothetical protein
MAHRDRDSTDELLQELLWTYGPCGQEDAVRAVIARELEPVVDDLWTDDAG
NLIGYVAADASRGDAGAPDHRHRTAAIPGTATRVMAHMDELSMIVKRVEP
DGTLHLTQLGTMYPGNFGLGPVAVLGDNETLTAVLTLGSEHTTKESSRIW
ETKPDQGDRSLDWHHVYVFTGRSTDELAAAGVHAGTRVCVDRSKRSVVEI
GDYVGAYFLDDRAAVTALLNAARMLRERGQRPADDVYLVFTANEEIGGVG
GAYASATLPGDLTLALEVGPTEREYTTSVAGGPIVGYSDALCVYDKSVAD
RLLKIATDRGLTPQPAALGAFESDASHAKASGLTARAGLLCLPTLSTHGY
EVVARSAIDDMAAIVVDFVLQPGRPNG
>MAP0494 hypothetical protein
MMRPLSRHWCRVMGPLSHHSKEQLVRVLVTGASGGIGSAVVKELLAAGHH
VIGLARSEASAATVSGLGAEPLRGDIADLDVLQKAAVDTDGVAYLAFSHD
FSDVGDAIADEARAIDALGAALADTGKPLVLASGTPARPGSVSTEDDPFI
ADGPLAGRGRTGQAVVALAGRGVRSAVVRLPRAVHDAGGRYGLVGILIQL
ARQRGVSSFAGDGTQRWPAVHRDDAAALFRLALEQAPAGSVLHAVGDEGV
PLRAIAEVIGRRLGVPVESAPADTFGPLGQVFAVDQPSSSALTQRRFGWQ
PVGPGLLDDLETGVYPE
>MAP1423 hypothetical protein
MIGLGGVWVLDGLEVTMVGNVSARLMEPGSGIALNPAQIGMAAAIYIAGA
CSGALFFGHLTDRFGRRNLFILTLALYLIATVATAFAFAPWYFFLTRFFT
GAGIGGEYAAINSAIDELIPARVRGRVDLVINGTYWLGSAAGAGGALILL
DTSNFAADLGWRLAFGIGAILGIFVLLVRRNVPESPRWLFIHGREEEAEH
IVGEIEEAVQQQTGRPLPEPQGKALRIRQRTAISFREIAAVAFKLYPRRA
VLGLALFIGQAFLYNGVTFNLGTLLSQFYAVPSGMVPVFFVLWALSNFAG
PLLLGHLFDTVGRKQMITLTYIGSAVVVVALALVFLTQAGGVWAFIGVLI
VAFFLASAGASAAYLTVGEIFPMETRALAIAFFYAMGTAIGGITGPLLFG
QLIDSGQRDHVVWSFLIGAVVMAAAGLVELWLGIAAEQRPLEDLALPLTV
DDAEDTEPQGDSAPVD
>MAP2292 hypothetical protein
MMSAHEVMSAHETERGAMQSKPWWSSAVFYQVYPRSFADSDGDGVGDIDG
VTAHLDHLEQLGVDAIWLNPVTVSPMADHGYDVADPRDIDPLFGGMAAIE
RLIAAAHRRGIKITMDVVPNHTSSAHPWFQAALAAGPGSDARQRYFFRDG
RGADGELPPNNWTSVFGGSAWTRVLEPDGNPGQWYLHLFDTEQPDLNWEH
PDVFDDFEKTLRFWLERGVDGFRIDVAHGMAKPAGLPDSPDLESKVLHHS
DDDPRFNHPSVHDIHRDIRKVVNDYPGAVTVGEVWVTDNARWAEYLRPDE
LHLGFNFRLTKIDFDAVQIHDAIQNSLAATALQEATPTWTLSNHDVGREV
TRYGGGEVGLRRARAMAMVMLALPGAVFIYNGEELGLPDVELPDEVLQDP
TWERSGHTERGRDKCRVPMPWSGQAPPFGFSSRTDTWLPMPKEWAALTVQ
KQRDDPDSTLSFFRRALELRRRRVEFDGDGVELAGGDRRCGDVPASRRAG
VRPQQRRAARAAAAGRTGAGERAAAGREAAPGRGGLAGLAARPGRAFRRP
LLYMLTHGLLRVDSDRRRPGGHRGGGPAARSRGGQHRLDRPRLRRRRHRP
EVAIGVQQHARRAVSRIFQRLQVISVLRGTAHAAAGNRRRRNLRAGAGGR
TPAVGHRAAARAGRHRHHHRHRAVPVGPAVADRDRAARDLLPERDPGRRG
GAQKALPPRPGGDSGGSRAGSRKARAPIAFRGDGGRVRFIALVDDRAAQP
AAPAGRKGDQLLPEPAEIRCLLR
>MAP2081 hypothetical protein
MADSPAATSPTLVAKQTNVRRPRWDRDHPRYKWVALSNTTLGMLLAMINS
SIVLISLPAIFRGIGLNPLAPANIGYLLWMLMGYLVVTAVLVVFVGRLGD
MFGRVRIYNAGFAVFTVAAIALSFDPFPLTGGAVWLIGWRVVQGVGGAIL
MALSAAILTDAFPSNQRGMALGFNMVAAVAGSFLGLLFGGLLSEWDWRAI
FWVGVPVGVLGTVWGMRSLHELGVRTPGPLDWPGTVTFGVGLTVVLVGIT
YGIQPYGGHPTGWTNPWVLGSIAFGLLLLIVFCFIELRAPQPMVNVRLFR
SAGFGMGNLANLMSSSGRGGLQFMLIIWLQGIWLPLHGYRFESTPLWAGI
YMLPTTIGFLIAAPVAGWLADRFGARPFAVAGMLLMAVTFIGLLMIPVNF
DYRVFALLIFLNALGGGLFAAPNTAVIMSSVPPRDRGAASGVRSTFFNAG
SALSIGVFFSLMVVGLAGTLPHALSSGLQQQGVSAAVAQDAAALPPGGQP
VRGVSGLQPDRRIARTVARTAATRRQCRDTDGRTRRVAEDPRRSAAVRRV
RWGRRRAS
>MAP1500c hypothetical protein
MTELAPSLIELARRYGIATDYEDWTGRRVRVPETTLTAVLAALGVAAGTE
QERNDALTAKLRAYWARRLPATIVGRTGEQIRFWGHVTHGDPAEVWVRLE
DGTVRDGVEQVDNFTPPFDLDGRWVGEASFVLPTDLPLGYHRVWLRSGGS
EASAALIVTPDWLGLPERLGARRAWGLAVQLYSVRSRQSWGVGDLTDLTD
LAVWSAFRHGADYLLVNPLHAAGFSGPANRMEPSPYLPTSRRFVNPIYLH
VEAIPEFAELTKRSRVRRLRADVQTHAAGLDAVDRDSSWAAKRTALQWLH
QQPRSAGRQLCYAAFRDREGRALDDFATWCALAEEYGPDWHSWPETLQHP
DAPGVADFVEKHSEAVDFHRWLQWQLDEQLAAAQSQAIRAGMSLGIMHDL
AVGVHPNGADAWALQDVMALGVTAGAPPDEFNQLGQDWSQPPWRPDRLDE
HEYRPFRALIRAVLRHAGGVRIDHIIGLFRLWWIPRGSAPTEGTYVRYDH
EAMIGIVALEAHRAGAVVVGEDLGTVEPWVRDYLLLRGLLGTSILWFELD
RDGNGGPLPAERWREYCLSSVTTHDLPPTAGYLAADHVRLRDSLGLLTRP
VADELESDRADLAAWMAELRRVGLLADGETDSEQVILALYRYLGRTPSRL
LGVALTDAVGDRRTQNQPGTTDEYPNWRVPLTGPDGRPVLLEDVFTDRRA
AALAEAVRAAIAP
>MAP1768c hypothetical protein
MTALDTGAGRRTPKNPSPWRRHAWAGRLFVAPNMVAVAVFMLFPLGFSLY
MSFQRWDVFTPPKFVGLKNFTDLFSSDPLFLIAIRNTVIFTLGSVVPTVA
ISLAVAGVLNQKVRGIGIFRTIVFLPLAISSVVMAVVWQFVFNTDNGLLN
IMLGWVGLGPVPWLVEPRWAMASLCIVSVWRSVPFATVVLLAAMQGVPET
VYEAARIDGAGEIRQFFAITVPLIRGSISFVVIISIIHAFQAFDMVYVLT
GPSGGPESATYVLGIMLFQHAFSFLEFGYASALAWVMFAVLLVLTVVQLR
VSHRRSLETSRGLK
>MAP2323c hypothetical protein
MPRTEIIAALLALSSALCVATGDVLQQRAAHCITDRELGPVELFAGLLRS
QRWWWGAVLLLASIGLQAAALGGGSVLLVQALLMFSVLFALPINARLSHR
AVTAGEWVWAALLTAAVTVVVVVGNPQTGHAGAPLRTWAVVALVLGPLLA
GCVVAGRARGGAVAAVLFAFVSGSLWGVFAVLAKEVMARLGQGVGVVTRT
PELYACILVALGGVVWSQAAFRAGPLTASMPTLQVSQPVVASVLGVVVLG
ETLDTGRAGTVALLVAVVVMTAAIIKLARVEAVATRDRAEAQLHQPA
>MAP3739c hypothetical protein
MSIGAQMLDHARSNQTWTTAVAALACFLMTLDITVVNVALPSIQKDLGAS
LEGLQWVVNAYVLAFAALLLTVGSVSDRLGRKRLFLTGVAVFTVASALCV
ASRTESPLIAARALQGIGGALVFGTCLALIADAYTDAEEEQRRKAVGLAM
AAGAAAATLGPLIGGGLVEIGTWQWIFAINVPVGVALAICTALKVREPHA
PHAADNSRVDSVGAVVAIVVLFALNYGLLTGAAKGWGRGDVLAALAIGLA
GGVGFVLHQLRRGSEATLDLTLFRIPTFLAAIVLGFTVRALSFGVFPFLI
LWLAGAHGRSAFDIGLILSALALPLMVCAVLSTSVARAVGVRATMSIAMV
ITAAGLFLATLIRGDGSWTTILPALAVLGVGNGVAMPHLMNLAVDVVPSN
KAGMATGAANTAFPLGTATGVAAFGVVLSSFVHAKVAASGVIPVHSADSV
ASAIVAGVLRFPTQAMTAFATSAFTDALRLIFGIAGCAALVAAGLSGALI
THRPRVAAESPESVTE
>MAP1767c hypothetical protein
MGSSEAVIKRTVLRAATVYAALLGIAWCALFPIAWAVSGSLKTEGEVSEP
TLVPARPRWSNYTEVFALMPIGRMFVNTVLYAGCVTAGHVFFCSLAGYAF
ARLDFRGRNTLFVVYLGTLMVPLTVTVIPQFLIMRTLGWVDTPWAMIVPG
FFGSAFGTYLMRQFFRTLPTDLEEAAILDGCTPWQVYWRILLPHARPAVM
VLAVLTWVNVWNDFLWPLLMIQRSSLATLTLGLVRMKGEYVARWPVLMAA
SMLIMLPLVIIYAIAQRSFVRGIAVTGMGG
>MAP3860 hypothetical protein
MHTRRRWAAGREAGSSGMDQLWANRAASCEAAVTQRHLRRLWGLPATQLG
VVAWPPARRERLFGTWHYWWQAHLLDCLVDAQQRDPQPQRRARINRQVRS
HRLRNNLSWTNSYYDDMAWLALALERAARIAGVHRRKALPKLANQFLRAW
VPEDGGGIPWRKSDQFFNAPANGPAGIFLARYGDHLKRAEQMANWIDETL
IDPETHLVFDGIKAGSLVRAQYTYCQGVVLGLETELAARTDGPARQRHAA
RVRRLVAAVAEHMAPAGVLAGAGGGDGGLFAGVTARYLALVATTLPGSVA
EDVAARDTARRIVLASAKSAWDYRQTVDGLPVFGPFWDRDAQLPTAGGKQ
AEFVEGAVTASEIAERDLSVQLSGWMLMEAACNVSAESSHENRSAL
>MAP3850c hypothetical protein
MDVVALRDPVSSVVARFVPRAGMIGTSLRDGEVELLGQRRGLDAYVADGK
TMGIPILYPWANRLGGNTYAAEGAAVTLTPGRNGVRADPNGLPLHGVLAG
YPGWRVTAQTANELTAQLDFGADPKLLAGFPFPHLLQVAVRLAGRTLTLT
TTVTATGDTAVPVCFGFHPYLQLPGVPRAEWVIETPPLRHLGLDDHGLPT
GESELQPARREPLRDKTFDDAYDRVDEGAVFAVSGGGRRLEVCFEHGYPA
AQIFTPPAKPGEEAVICFEPMTAPTDALRRGGYRRAHPGRRAATRFSIRV
>MAP2692 hypothetical protein
MTVEGDRSSAATGRAPAQGRPRGEAPDRNLALELVRVTEAGAMAAGRWVG
RGDKEGGDGAAVDAMRELVNTVSMRGVVVIGEGEKDHAPMLYNGEEVGNG
DGPECDFAVDPIDGTTLMSKGMPNAISVLAVADRGAMFDPSAVFYMNKIA
VGPEAAHVLDITAPIAENIKAVAKVKALSVRDMTVCILDRPRHAKLIDEV
RETGARIRLITDGDVAGAISACRPHSGTDMLAGIGGTPEGIIAAAAIRCM
GGAIQAQLAPRDDEERQKAIDAGYDLDQILTTEDLVSGENVFFCATGVTN
GDLLKGVQYYPGGCTTQSIVMRSKSGTVRMIEAYHRLSKLNEYSAIDFTG
DSSAAYPLP
>MAP3733c hypothetical protein
MTATSSTTQSSRRIDVRMSARDLINIGVFGALYIATVFAINVFAFINPLV
MLVALAVSMIAGGVPFMLFLTRVRHAGMVTVFAIITAGLLALTGHPPICF
VITVACALVAEVVLWLGRYRSRTMGVLAYAIYAAWYIGPLLPIFYARDEY
FSSPGMAQMGPRYLEEMERLLSPAVLIAFDLSTVVFGLIGGLLGVRLLRK
HFQRAGLA
>MAP2696c hypothetical protein
MAKRPDTLAWRYWRTVVGVLAAVAVLVIGGLTGHVTRADDLSCSVVKCVA
LTFDDGPGPFDERLLQILKDNDAKATFFLIGNKVAANPAGAKRIADAGME
IGNHTWEHPNMTTIPTEDIAAQFTKANDAIHAATGRTPNLYRPAGGLSNP
VVRQTAGQLGLAEILWDVIPFDWANDSNTAATRYMLMTYIKPGSVVLFHN
TYSSTVDLVYQFIPVLKANGYRLVTVGELLGPRAPGSSYGSRENGPPVNG
LRDIPASDIPKLPDTPSPKPMPNFPITDIPGQNSGGPNNGA
>MAP0815c hypothetical protein
MSGRRQGDPGRVAAKPGRRPGNSAAAPHPGAANYPAGDTGDRRTRRPPPM
PSANRYLPPLGHQPQPDRGAAAPPRGPVAGERITVTRAAALRSREMGSRM
YWMVQRAATADGADKSGLTALTWPVVANFAVDAAMAVALANTLFFAAATG
ESKGRVALYLLITIAPFAVIAPLIGPALDRLQHGRRVALAASFVLRTALA
AVLIMNYDGASGSYPSMVLYPCALAMMVLSKSFSVLRSAVTPRVMPPSID
LVRVNSRLTMFGLLGGTIVGGAIAGGVEFVCTHLFKLPGALFVVVAVTVA
GASLSMRIPRWVEVTAGEVPATLSYRRDSEPLRRRWPEEVKNVPKKATAT
LRQPLGRNIITSLWGNCTIKVMVGFLFLYPAFVAKAHQANGWAQLAMLGM
IGAAAGVGNFVGNFTSARLKLGRPAVLVVRCTVAVTAVALAASVAGNLML
AVIATLVTSGASAIAKASLDAALQDDLPEESRASGFGRSESTLQLAWVLG
GALGVLVYTELWVGFTAVTALLILGLAQTLVSFRGNSLIPGLGGNRPIMV
EQEGARRGVGSPAVVAE
>MAP2660 hypothetical protein
MARVVIVGGHGKVALQLSAILTQRGDAVTSLFRNPDHADDVAATGAKPVV
ADIERLDTDALAGHDAVVFSAGAGGGNPARTYAVDRDAAIRVVDAAARSG
VKRFVMVSYFGAGPDHGVPQDDPFFPYAESKAAADAHLRVSDLDWTILGP
GRLTLDPPTGRIAVGRGKGEVSRADVASVAAAALADDSTIRRTIDFNNGD
VPIAVALAG
>MAP1720 hypothetical protein
MSHAGHSSGTRSGYVLSSANRNALHLASRGNLVHVFVTGGSGLTGPAVVS
ELLSAGHRVTGLARSAASADRLARLGAEPFTGSLDDLDRLREGAAAADGV
IHMAIGGDSGDLSGITKRDVDAIEAMGAALAGTGKPFVSTSGTMVMAPGR
VSEETDAPDPDAFAAYRIPGERACLGYAERGVRSSVIRLAPTVHGPGDYG
FVAMLVATARKTGVSAYIGDGTNRWPAVHRLDAATLFRLALEKAPAGTAL
HGTAESGVPLRDIAEKIGRKLDLPVVSVELDDAPAHFGSAALSSVFAADV
PASSARTRALLGWNPSHHTLLEDLEHGDYFTVAEGAGQG
>MAP0876c hypothetical protein
MTRGLVIGESLIDIVEDAEYVGGSPLNVAVGLGRLGRDVDFLTYLADDDH
GRRIAAYLKDAGVQLVSESRAAERTATARSTIAADGSADYEFDLDWRLSG
TPPVAPPLFVHTGSIAAVRDPGCLAVAALIDTYRVSATVTFDPNVRPSLI
ADRELAVSRIEHLVERSDIVKVSAEDLHWIDPDRSPEQLAQTWLGLGPAI
VAVTMADRGAVGYCAAGTARVPTRAVRVVDTVGAGDSFMAGLLDALWEAG
LLGGDRRGALRDIGIDTLTSALDAASLSSALTIARAGADLPDRAAVQAAL
RR
>MAP1137c hypothetical protein
MGRAGPGHQAPGELVTAQTGRRVAISAGSLAVLLGALDAYVVVTIMRDIM
TDVHIPINQLQRITWIITMYLLGYIAAMPLLGRASDRFGRKLVLQVSLAL
FMVGSVVTALAGHWGDFHLLIGGRTIQGVASGALLPVTLALGADLWAQRN
RAGVLGGIGAAQELGSVLGPLYGIFIVFLFHDWRYVFWINVPLTLIAMVM
IQFSLPSHEKVEQPEKVDLVGGVLLAVALGLAVIGLYNPEPDGKQILPSY
GLPLVLGAVVVGILFLLWERFARTRLIEPAGVHFRPFLAALGASLFAGAA
LMVTLVDVELFGQGVLGQDQTQAAGLLLWFLIALPIGAVLGGWIATRVGD
RAMTFVGLLIAAYGYWLIHYWRQDVLSQKHNVLGLFSVPVLHADLLVAGV
GLGLVIGPLTSAALRVVPSAQHGIASAAVVVARMTGMLIGVAALSAWGLY
RFNQIVANLTAAIPPNASLLERIAAQGTMYLKAFAMMYGDIFAATVVICI
AGALLGLLIGGRKEHAEEPEIVEPQAVSLGER
>MAP3978 hypothetical protein
MGHAPGPKPHRAVRRQIVPPALHIPESAAASVFRAVRLRGPVGRDVIANV
TSLSIATVNRQVIALLEAGLLRERADLAVSGAIGRPRVPVEVNHEPFVTL
GIHIGARTTSIVATDLFGRTLDTVETPTPRNPAGPALASLADSARRYLRR
WHRRRPLWIGVAIGGTVDSATGQVDHPRLGWRQAPVGQVLADALGLPVSV
ASHVDAMAGAELLLGMRRFLPSSPTSLYVYARETVGYALVIGGRVHVPTS
GPGTIATLPAHSELLGGTGQLESTVSDEAVVAVARQLRILPFAPANRASG
SAAGIADLLRVARAGNEQARELLAERGRVLGEAVALLRDMLNPDELVVGG
QAFTEYPEAMEQVEAAFAARSVLGPRDIRLTAFGNRVQEAGAGTVSLSGL
YADPLSAMRRAGALDARLREVARDESSA
>MAP4286 hypothetical protein
MALPPDDRRPACPAMNRRQFLGSLAASVVAGVGAARLIVDPQPRTFAQTP
AAAIPTSGPTAPQALLPPPPLSARIPLPGGGALMKIPGEGDLLALTVDDG
VNSEVVRAYTQFAKDTGVRLTYFVNGVYDSWTENLQLLRPLVDSGQIQLG
NHTWSHPDLTTLTKEQVAEQLSRNDAFLKKTYGIGAKPYWRPPYAKRNAA
VDAVAADLGYQVPTLWSGSLSDSTLITEDYILQMADQYFTPQAIVIGHLN
HLPVTHVYPQLVDIIRERNLRTVTLNDVFLKTP
>MAP2871c hypothetical protein
MRVLLSAYDSRGGVQPLAALAVQLRALGVDARVSAPPDAEFAELLARAGV
PLVAFGDSLRAMKTAATTPSEEQLRRQLDGLIAAQFDAAAAARECDALVV
AGFPPAAAAARSAAEKQGAHYVYVSYQPTILPSPHYPPPGYADAPSAAGL
IDNRLLWQLDGRYMQALFGETLNARRAAIGLPPLDDLRGYALTERPWLAT
DPTLDPWRPPGDLDVVQTGAWLLADDRRLPAELEAFLDAGAPPVYVGFGS
MPMRACRDVARVAIEAVRAQRRRAVVSRGWAGLGPIDDGEDCLVVGEVNH
QALFGRVAAVVHHGGVGTLTAAALAGAPQVVVPQGADQPYWAARVAELGI
GAAHDGPAPTVESLSAALGTALTERTRARAGAVAATIRTDGAARAARLLV
ARLAG
>MAP3989 hypothetical protein
MRPAIKVGLSTASVYPLRAEAAFEYAARLGYDGVELMVWAESVSQDIAAV
KKLSRRYRVPVLSVHAPCLLISQRVWGANPIPKLDRSVRAAEQLGAQTVV
VHPPFRWQRRYAEGFTEQVATLEAASDVMIAVENMFPFRADRFFGAGQSL
ERMRRRGGGPGPAISAFAPSYDPLDGHHAHYTLDLSHTATAGTDSLDMAR
RMGNGLVHLHLCDGNGLPADEHLVPGRGTQPTAEVCQMLAAGHFAGHVIL
EVSTSSARSATEREAMLAESLQFARTHLLR
>MAP0334 hypothetical protein
MRAMRVLITGGTGFVGGWTAKAISDAGHSIRFLVRNPGKLHTSVAKLGVD
VSDFAVADITDRVAVREALQGCDAVVHSAALVATDPRQTAEMLATNMQGA
QNVLGQSVELGLDPIVHVSSFTALFHPDLETLTAELPVVGGADGYGTSKA
QVEIYARGLQDAGAPVNITYPGMVLGPPVGDQFGEAGEGVRAALQMHAIP
GRSAAWLIVDVRDLAALHAALLEPGRGPRRYTAGGHRVPASDLAALLGEV
AGTPMVAVPIPDTALRVAGAVLDRAGRFLPFETPFTWAGMQYYTQMPASD
DSPSERELGITYRDPRQTLADTFAALSATS
>MAP0044c hypothetical protein
MIPTRMQSSAPVEIWRSVRALPDFWRLLQVRVASQFGDGLFQAALAGALL
FNPDRAADPLAIARAFTVLFLPYSLLGPFAGALMDRWDRRLVLVGANVGR
LVLIAAIGTILAVRAGDLPLLLGALFANGLARFVGSGLSASLPHVVPREQ
VVTMNAVATAAGVVAAFLGANFMLVPRFLFGAGDRGAAAIVFLTVVPVSI
ALLLSWRFAPRALGPDDTWRAIHGPVLYAVITGWLHGARTVAQRPTVAAT
LSGLAAHRMVVGINSLLVLLLVHHLPGLEGGGFGTALLFFGAAGLGAFLA
NVLTPPAIRRWGRYASANGALAASAIVEVAGAELLLPVMVVCGFLLGVTG
QMVKLCADSAMQMDVDDALRGHVFAVQDALFWVSFIVAITVAGMVIPDDG
HAPVFALFGSVLYLVGLAVHGIVGRRGE
>MAP3492 hypothetical protein
MIADESFPVDPWHVRETQLNLNLLAQSESLFTLSNGHIGLRGNLDEGEPY
GLPGTYLNSFYEIRPLPYAEAGYGYPEAGQTLVDVTNGKIIRLLVEDEPF
DVRYGELLSHERTLDLRAGTLTRSAYWRSPAGKEVKVVSTRLVSLAQRGV
AAIEYVVQAVGDFVRVTVQSELVTNEDQPQTSGDPRVSAILENPLEEVDH
ENTDTGALLMHRTRSSALMMAAGMDHEVEVPGRVEVSTDSRADLARTTVI
CGLRAGQKLRIVKYLAYGWSSMRSRPALRDQVAAALHSARYTGWPGLLES
QRKYLDNFWDCADVEVEGDPESQQAVRFGLFHLLQASARAERRAIAAKGL
TGTGYDGHAFWDTEGYVLPVLTYTAPHTVADALRWRASTLDLAKERAAEL
GLEGAAFPWRTIRGQECSAYWPAGTAAWHINADIAMAFERYRIVTGDDDL
EAECGLPVLVETARLWLSLGHHDRHGVWHLDGVTGPDEYTAVVRDNVFTN
LMAAHNLRVAAEACNRHPDGAKALGVTTEETAAWRDAADAVKIPYDSGLG
VHQQCEGFTTFAEWDFEHNTTYPLLLHEAYVRLYPAQVIKQADLVLAMHW
QSHAFTPEQKARNVDYYEPRMVRDSSLSACTQAVMCAEVGHLELAHDYAY
EAALIDLRDLHHNTRDGLHMASLAGAWTALVSGFGGLRDDEGVLSLDPQL
PDGISRLRFRLRWRDFRVTVDVNHADVTYTLRDGPGGELTIQHAGAELEL
STQSPTTVAVRPRKPLLPPPQQPPGREPTHRRAVSRSSPGQ
>MAP0086 hypothetical protein
MTRPRMPLTTASAVTFFYALGYPIGTLAVVAMTPMAALVLRFGLAAMVLA
VWTVIAKAGWPRGARLGHVIVAGLLTQGVQFCCLYEAVQLGAPAVLCAVM
IAMNPVATAILAATFLREPLGVQRTVALVLGVLAVLAACSQRLMSTHGVD
PVIVLLFVALLGLSAGGVYQQRFCGGVDFRAMSALQNGVALLPAAGLAML
TPFGVHDGRKALVAVAAMVLLNATLAVSLYVRAINAHGAAAVAMLFAVIP
AVAAALSWIILGQRPDQGVAAGLVLGGLACWINTRAASRPRTQRDGVPAA
APAELQSVSR
>MAP3529 hypothetical protein
MTEPAKLPWSDWLPQQRWYAGRNRRLTGAEPSVIVGLRDDLDLVLVDADY
ADGSRDRYQVLVRWDAAPVSEYSTVATIGAADDRTGFDALYDDEAPQFLL
SLIDSSAVRSASGAEVRFAKEPDAQLPLEAMAHVSDAEQSNTSVIFDRDA
IFKVFRRVSSGINPDIELNRVLGRAGNPHVARLLGTYEMAGADGTPETAW
PLGMVTEFAANAAEGWAMATASVRDLFAEGDLYAHEVGGDFAGESYRLGE
AVASVHATLAETLGTSQAAFPVDNVLARLSSTAALVPELTEYAATIEERF
AKLATETITVQRVHGDLHLGQVLRTPESWLLIDFEGEPGQPLEERRAPDS
PLRDVAGVLRSFEYAAYGPLVEQGAQNTDKQLAARAREWVERNRTAFCDG
YAAASGIDPRDSAPLLAAYELDKAVYEAGYEARHRPGWLPIPLRSIARLT
TA
>MAP2701c hypothetical protein
MSTVSGVSGVTVRPLYDSTDSLLRFAVRADATLTGLCGLAVAVAADPLSS
LTGLTSFQEYIGGAFFVLYGLVVFSLGALPDLRRAGISIVAANAVFTLAA
IVAAGVLPLTGAGVALTLASAAYTAAFAAVQYAGVRRLETA
>MAP1751 hypothetical protein
MTANTVLVTGAFGQVGKRCTQLLLQRGRTVVAMDLRNDNTAAVATELAAG
GYPGTLIPAYTDLLDAEAVRDLVTAHQPDAVVHLAAVVSPLSYRKPDLAR
RVNVGGTENLLAACTALPRPPLFLMASSAAVYGSRNPYRQPERITPDTPV
NPIDQYGQDKVLAEAAIRASGLPYALFRLGGIISPDTHASINGDYLLLMR
SMPSDNRMHAVDARDVALAFANGVDRAATISGKVLLIGGNESYVLTQREL
EDAMMEAVGLGRLGPSASLPGNPADDRGWSFTGWYDTTEAQQLLDFQEHD
WSQTVAWLAESQGRSRLALRLLGPLLRPLLRTILIVQRRLGGTRPLRRPV
DVDRKEVRHSGIGDRRLTGSVSGRSTPAARRRGVRPPAA
>MAP0837 hypothetical protein
MSVATAFNGGAPAQIGSGGDAVTLVEGGTFCLSNRHGDVPVGTSYGLFFR
DARVLSRWELRVDGQVAEALSVEATEAFAAQFILRRAPRSGLADSTLLIV
RERLVADGLRETISLHNLDKESTVVSLELHADADFADLFAVKEGRAATAG
ADMHVADGELVLRGRADQVRGLTVSASGDPIVLPGSLNWRVVVPPGGRWQ
TEVMVQPTWSNRKVRTRIPRGKDIESSAPARKIERWRDTATTVETDHAGL
AQVLRQTESDLGALLIHDADGRGRPYVAAGAPWFMTLFGRDSLLTAWMAL
PLDVGLSIGTLQRLAALQGRRVDALTEEEPGRIMHEIRRGPASADVLGGS
VYYGTADATPLFVMLLAESWRWGADEAVIRSLLPAADAALAWAEQYGDRD
GDGFVEYRRATDRGLINQGWKDSFNGINDAKGRVAEPPIALCEVQGYVYA
ALLARAELAEGMGELAQAAQLRERAQALRTRFGEAFWLPDRGWYAVALDG
RKDRVDALTSNVGHCLWTGIATDEHAAAIVERLAGEEMDSGFGLRTLATT
MGAYNPMSYHNGSVWPHDTAITVSGLLRYGHIPGALALAERLATGVLDAA
AAFGGRLPELFCGFPRSQFASPVPYPTSCSPQAWASAAPLLLLRSFLGLD
PHVPNRALTVTPHLPAEWGRIALSNLRLGATTVQLEAEGETVKVQGLGDD
WQLVTP
>MAP0618c hypothetical protein
MLNGAMEKTRSAAFSVPLATASGPSLRDDEYPDKLDAALFRIAGVCGLAC
IMAVLDSTVVAVAQRTFIAQFGVNQAIVSWTIAGYMLAFATVIPITGWAA
DRFGTKRLFMGSVLIFTLGSLLCAVAPNILLLILFRVVQGVGGGMLLPLS
FVILTREAGPKRVGRLMAVGGIPILLGPIGGPILGGWLIGAYGWKWIFLI
NLPIGLTAFALAALLFPKDRSAPSEALDITGALLLSPGVAIFLCGVCSIP
GRHTVADRYVLVPALVGLVLIAAFILHAWYRTEHPLIDLRLFRNPVVTQV
NVTLLVFAAASVGVGLLVPSYFQIVGHETPMQSGLHMLPIGVGAVLTMPL
GGAVMDKHGPGKIVLTGLPLMAVGLAVFTYGVARQAAYSPVLVCGLAIMG
LGIGLTTTPLSAALMQALAPHQVARGTTLISVNQQVGGSIGAALMAVILT
NQFNRNPALMAANEAAGMHPVTGKRGLPVDPSTVPRPAMTPELAGHVSHH
LSHAYTAVFVLAVVLVACTIIPASFLPRKPPSPPAGD
>MAP0877c hypothetical protein
MEADGINLNNATLPTLPIEAPNYDRGSVTVGIAHIGAGHFHRAHQAMYID
RLLQAGLAREWGICGVGVMPADWTMRDVLNDQDGLYTLILEKPDGSRDAH
VIGSIIDYRYAPDDPESALEVLAAPTTRIISLTITEGGYRDPEGPAFAFI
VEALDRRRRRGIAAPTIASCDNIENNGEIARRAVLANAEARSPELAEWVD
EHVCFPSSMVDRITPSTTLQMAAEVRRDFGVNDRWPVVAEPFTAWVLEDK
FADGRPPLERAGVLLVDDVAPYELMKLRMLNAGHQCLAYFAHLCGFEFVH
EAARDPLFAEYLWAYFESEAIPTLPPVPGIDLYDYGHRLIERFTNPGVRD
TIARLCAFSSDRIPKWLLPVINDNLANDGSVRIAAAAVASWARYAEGTDE
WGKPFEVVDQLADSLIPIARSQYENPTAFIEIAGVFGDLAHQPRFVQAYS
WALESLHRKGARATLEELVR
>MAP1384c hypothetical protein
MAEQTVDAGTVFAPRVLREYALLADGERGALIGPQGDVAWMCAPRWESDA
VFSNLVGGGGVYAITPTDVRHVWGGYYEPGTLIWRNRWITRDGIVECREA
LAFPGSPERVVLLRRLVACRGTVRVRVLLAPAAEFGKHGPGRPRLDDDGV
WSGRSGRLRWRWLAGCDARPVDSAGHGAGELLGCEITLDEGGRHDLVLEL
SERTLPERLPDPGLAWDATAAAWQRSVPALNCTIAPRDGRHAYAVLRGLT
SSSGGMVAAATLGLPERAHTNQNYDYRYAWIRDQVYAGMAAAAVGTTELL
DRAVGFIGARLLEDGADLKPAYTVTGGAVPGEETLDLPGYPGGTDKVGNW
VNQQFQLDVFGEALQLLAVAGAADRLDAEGWRAAQAAIRAIEKRWREPDY
GVWEIQDERWTQSRLACVAGLRAVAKLAGAGPELTACASLADAILADTAS
ESLHPHGYWQRSPRLPGVDASLLLPPVRGAVPADDPRTRATLDAVRRDLT
DDGFVYRFRHDERDLGEAEGAFLLCGFMMALAEHQLGHTHCAMRWFERNR
AACGPPGLFCEEYDVRQRQLRGNLPQAFAHALMLECTATLAEDAVHHD
>MAP1769c hypothetical protein
MLDRPFGRRSLLRGAGALSAAALAPWSAGCGSDDDGALTFFFAANPEERD
ARMRIIDEFARRHPDIKVRAVLSGPGVMQQLSTFCVGGRCPDVLMAWELS
YAELADRGVLLDLGPLLARDKAFAQQLQADSIPALYETFTFNGKQYALPE
QWSGNYLFYNKRLFDEAGVPSPPAAWEHPWGFSEFLNAARALTKRDASGR
AAQYGFVNTWGSYYSAGLFAMNNGVPWSDPRLNPTHFNFDNAAFQEAVQF
YADLANKYRVAPNGSETQSMSTPNLFAVGRAAMALGGHWRYQTYLRAEGL
DFDVAPLPVGPAVGKGQPACSDIGATGLAISSSSPRKEQAWEFVKFATGP
VGQALIGESCLFVPVLRSALKSDGFARAHRRVGNLGVLTDGPAFSQGLPI
TPAWEKVNALMDRNFGPILRGSRPATSLAGLSRSVDEVLRSP
>MAP3581c hypothetical protein
MTLCRCFVKRKVCRRHERMIPEPLVNGFCFGEGPRWFEGLLWFSDMLGEA
VHTTTMGGALTTLPLPGHCPSGLGFRPDGSLLIASTRDRRVLRYDGETVV
TVAELADIAPADLGDMVVDRAGRAYIGCQAFTGGAIIRLDPDDRATVVAD
DLDFPNGMAITPDGATLIVAESTGRRLSAFAIDDDGALSGRRVFADGLDG
PPDGIALDADGGVWAAMTLAHQFERIVAGGAVSDRIDIGDRVAIACALGG
PQRRTLFLLSSTDAYPQRLVGTRLSRLDAVTVTTPGAGLP
>MAP4076 hypothetical protein
MRSRMAPIALVATLAVVMTVIAPAPPRGYDGPPLFVANPVDHVVTLIGTG
TGGETVGEINNFPGATVPFGMVQYSPDTAGNYAGYNYDNPRATGFSMTHA
SVGCAAFGDISMLPVTGGLGVQPWNATERIAHDDTERGVPGYYTVRLPGA
GVTAELTATAHTGVGRFSYPGDGRPALLQVRSGSSLAGNSRATIQLGEDN
TTITGWATSGGFCDKPNAYTVYFAMKFSRPFISYGSWDGSTVYPGARSAN
APYSGGYVEFPAGSVLEARTAISYVGIEGARANLAAEGNAGFDDVRAAAA
RQWNAALSRVTVAGRNTDDLTTFYTALYRTMLHPNMFNDADGRYVGFDGG
IHRVADGHVQYANFSDWDTYRCLAALQALLFPDRAADMAQSLVTDAQQSG
ALPRWAFANAATGEMSGDSVVPLIVNLNTFGANDFDTAAALRYMVDGATK
GGVGLAGYVERRGIATYLRLGYAPFTLEFGRNGWIADASITLEWSIDDFA
ISRFADSLGDAATAAEFANRAQYWQNLFNASTRSVVPRSWSGFFRPGPAV
VVSPDNFGQVGYDEGNAEQYVWSVPHNVAGLVTALGGRAAVAERLDRFTK
KLNVGPNQPYLWAGNEPSFGVPWLYNYIGQPWKTQHTVDRVRGLFGPTAD
GEPGNDDLGALAGWYVWAALGLYPATPGTPILTVAAPLFDRIEIALPAGK
SIRIAAPGASGPHHPAYISGLKVDGRPTEHTWLPESIIRTGGTLAFSLAA
YPDKHWGTAESDAPPSFGAGSSAVAVNVPQPVVSIAPGHRGSVTLDAQRM
IDGAGAYTVSGAASDPGVAVTPTSGQFDAGGAASVSVPITVAPTVPQDYY
LVSLTTTVGQSTRTTTVLVDTQAEPESR
>MAP1402 hypothetical protein
MTPAERTVLMVVHTGREEATETARRVQKVLGDNGIALRVLSAEAVDRGPL
HLAPDDMRAMGVEIEVVDADPLAARGCELVLVLGGDGTFLRAAELARNAD
IPVLGVNLGRIGFLAEAEAEAIDKVLEHVVARDYRVENRMTLDVVVRHQG
TVSDHGWALNEVSLEKGPRLGVLGVVVEIDGRPVSAFGCDGVLVSTPTGS
TAYAFSAGGPVLWPDLEAILVVPNNAHALFGRPMVTSPAATIAIEIEADG
HDALVFCDGRREMLIPAGSRIEVKRCDTAVKWARLDSAPFTDRLVRKFRL
PVTGWRGN
>MAP0194 hypothetical protein
MSGRLILVRHGQSYGNVERRLDTRPPGAELTPLGRDQAREFARRSGRPAM
LAHSVATRASQTAAVIGGQLALPPVELGGIHEVQVGRLENRNDDEAVAEF
NAIYERWHRGELEVPLPGGETGSEVLDRYLPVLTELRLRYLDDHDFHGDI
VVVSHGAAIRLAAAVLAGVEADFALDHHLDNAQSVALTPITDGRWSCVRW
GTLTPPFYPEAAEIPATPVADAVRSTTDPMG
>MAP1795c hypothetical protein
MRIAVTGASGVLGRGLAARLLSKGHDVVGIARHRPDSWSSAADFIEADIR
DAAAVNRAVTGADVVAHFAWAHDTREGTGSQVNIAGTANVLAAMAETSAR
RIVFPSSPHVYGGGDTPRSEADAPAPSAADAVQQARVEEMLVASGLESVT
IRCALVVGRNVDNWVRRVLALPAFPGGTVDRPIQLVHLDDALRLLESAVL
SHEIPSGTVNLAAPGTTTFRRIAATLRRPVLRLGRPAPAELEPVLSAPLM
DTALLHERWGFRPAWQCGECVEDMALAVRGRVTVGRRVVSLPWRLANVAD
LPAVDAPAQDGVVPVPAGPEGVNGEFDTPIDPRFPTFLATNLSEALPGPF
SPSSASATVRGVRAGGVTIAERLRPGGLVQREIALRTVGVFAHRLYGAIT
SAHFMADTVPFAKPSMIVSNSGFFGPSMASLPVFGEQRPAETGWARRLLR
NTRNIGVFGVNLVGLSSGAARDTDDFIADVDRLERLAGDAAELDERRLLS
LIALARDQVVHGWVLASASFMLCAAFNVLLRAVCGRDTSAPAGPELVSAR
SVEAMQRLVVAAQRDPNVARVLTEPGERLDKLAVDAPEFHAAVLAELALI
GHRGPAELEMLSTSYADDPELLVRMVAKAMAATAAPPPQHPRIPLQAKPI
AALAARQLRDREARRDKVVRANWVLRALLREYGRRLTHAGVFGSADDVFY
LLVDELDALPADVAAVVARRRAEHRRLAGIVPPTVFSGSWQPTATSVPAL
AGGGTLRGVGVCGGRVRGRVRIVRPETIDDLQPGEILVAEATDVGYTAAF
CYAAAVVTELGGPMSHAAVVAREFGFPCVVDVQGATKSLPPGALVEVDGA
TGEIRVLELASEQATGGG
>MAP2053 hypothetical protein
MSAQMTPVMQAASEFALIGGIGFTAFGIYLSVRRRRLHPLLLLCISAMSF
SWIEAPYDWAMYAQFPPAIPRMPSWWPLNMTWGGLPLFVPIGYISYFVLP
AVTGTALGRWLSARFGWRRPQTLLVVGLVVGFCWALFFNGFLGAKLGVFY
YGRVIPGLAIREGTVHQYPLYDSVAMAIQMMLFTYLLGRTDAQGPNVIEM
WAEHRSKSRVGASVLSVVAVVVVGNALYGAVFAPHLVTKLGGWVTAGPTG
ELFPGVPNQPR
>MAP1336 hypothetical protein
MQGVAGGLLAGLGYAVINSALPRWLWTRGSALVSAMWGVATVVGPATGGL
FAQLGIWRWAFVVMAVLTALMALLVPVALARVDPAPAIPRMKVPVWSLLI
IGVAALAVSVAQIPHNTAATFGLLAAGIMLVGLFVIVDWRMHAAILPPSV
FSPGPLKWIYLTMGVLMAAAMVNTYVPLFGQRLAHLTPIAAGFLGAALAL
GWTVSEIVSASLENPRTVGRVVMVAPLVAASGLALGAVARHGDGSAWTAA
LWAVALLVAGTGIGMAWPHLSARAMASVNDPAEGGAASAAINTVQLTSAA
IGAGLAGVVVNTATGGDEMAAHLLFTVFTALSAAGVAVSYAATRATRQAQ
PVGNVG
>MAP0971 hypothetical protein
MRAIATRATVKNAAAALLCAAGVDALARRRHRDTLAILMFHGVEDRPPSP
PCWHVSGAALFRRQLRYVRAHFNVLALEDALDRLAGGTLPPRALAITFDD
GTRNLATHAMPVLRELELPAAVFLATGPIGTDRTLWPDRLWLAIAHSPAR
EIDLTPWGLGTRPLKSNIERGAAYTVVVEKLKKLPDPQRIAALDEILCVL
GHHDDGDGGPFRMLSWEQIRQLAADPLVTLYPHTVTHPILARCDDAKLHR
EITESCATIERETGSPPTVFAYPNGRLQDFDERAKDVLRARGVRWALSTA
PGFADRSCDPLALPRLAVGGDASLNYFKLLVSGGLPRR
>MAP0269 hypothetical protein
MQVLLVRHALPLRSEHGQGSDPDLSADGLAQIERLPKALERFPISRVVSS
PQRRAVQTAAPVAADRGLSVEIDDRFAEYDRDLPVYIPVEQIRDEMPEEW
ARLAQGHLPSAVDEDAFRARVRAAVDDLVAAADPEDTVAVFSHGGVINVL
LHEILGTARLLSFPVDYASVTRLLFSRSGQATVAAVNSTEHVWDLLPRNQ
RW
>MAP1399 hypothetical protein
MKTLAQQHDCLLIDLDGTAFRGRSPTEGAVQALARLPGRALFVTNNASRS
AAEVAAHLTELGFTATADDVATSAQSAAHLLAGQLPAGAPVLIVGTEALA
GEIAAVGLRPVRRYDDGPVAVVQGLSMTIGWPDLAEAALAIRAGAVWVAA
NIDLTLPTERGLLPGNGSLVAAVAAATGATPQVAGKPAPALLRDAAARGD
FRAPLVIGDRLDTDIEGANAAGLPSLLVLTGVNSARDAVYAKPARRPTYI
GHDLRALHQDAAALAVAPQPGWRVDVGDTAITVSGDGRDDGSGDGLSVVR
AVASAVWGSPDAGTLPIRAGDDRARAALARWSLVRTD
>MAP1573c hypothetical protein
MSLETSTTTPAGSPLSDRELDLIDKYWRAANYLSVGQIYLLDNPLLKEPL
SAEHVKPRLLGHWGTTPGLNLVYAHLNRIIRNRDADVIYVTGPGHGGPGL
VANAYLEGTYSEVYTGIEEDAEGLRKLFRQFSFPGGIPSHVAAQTPGSIH
EGGELGYALVHAYGAAFDNPYLVVACVIGDGEAETGPLAAGWHSNKFLNP
VTDGAVLPILALNGYKIANPTVLARIPHTELEALLRGYGYRPITVAGDDP
TDVHRQLAAALDEAFDGIAAIQGAARGGGEVQRPVWPMIVLRTPKGWTGP
KVVDGKRVEGTWRSHQVPLAETHDNPEHRAQLEEWLRSYGPEQLFDDDGR
LRAELRALAPTGDRRMSANPHANGGLLLHDLDLPDFRDYAVPVSRPGSVT
HEATRVLGTFLRDVIARNKDRFRMMGPDETASNRLDAVYGATEKVWLSAT
EPDDEHLAPDGRVMEVLSEHLCQGWLEGYLLTGRHGLFNCYEAFVHIVDS
MLNQHAKWLATSRELPWRRPIASLNYLLTSHVWRQDHNGASHQDPGFIDL
VANKRAELTRVYLPPDGNTLLSVADHCLRSRDYINVIVAGKQPALAYLDM
DAAIAHCTRGLGIWDWASTARSIGAEPDVVLACAGDIPTLETLAAADILR
RELPDLAVRVVNVVDLMRLQPDSEHPHGLPDREFDALFTRDRPVIFAYHG
YPWLIHRLTYRRANHAQLHVRGFKERGTTTTPFDMVMLNDLDRFHLVIDV
LDRVEGLASRAAMLRQRMVDARLAARMYTREHGEDDPAIANWTWEPSERN
SRSE
>MAP1766c hypothetical protein
MASVTFQQATRRYPGADRPALDNLNLIVGDGEFVVLVGPSGCGKTTSLRM
VAGLETVDSGRILIGDRDVTNVDPKQRDVAMVFQNYALYPHMTVAQNMGF
AMKIAKIPKAQIRERVLDAAKLLDLQPYLDRKPKDLSGGQRQRVAMGRAI
VRRPQVFLMDEPLSNLDAKLRVQTRNQIAALQRRLGTTTVYVTHDQVEAM
TMGDRVAVLRDGVLQQFAPPRELYRNPSNVFVAGFIGSPAMNLFTLQVVD
SAVSLGDWPIALPREIAAAASEVVVGVRPEHFELGGLGVEMEVDVVEELG
ADAYLYGRITGSGKVIDAPIVARVDGRNPPEKGSRVRLHPAPGHLHFFGR
NGQRIG
>MAP4031 hypothetical protein
MRVLVTGAAGFIGSRVAAALRAAGHDVVAVDALLAAAHGPNPLPPNGCHR
VDVRDADALAPLLAGVDVVCHQAAMVGAGVDAADAPAYGGHNDLATTVLL
AQMFAAGVRRLVLASSMVVYGQGGYRCERHGPVHPAAAAACGPGRRGLRA
PVPDRR
>MAP3248 hypothetical protein
MTGATTPKGNGETGFHPRPDGYQSPMRSLTRVLITGGAGFLGAHLCARLL
DDGVEVVSVDDLSTSGPAVRFGDRPGYRFVQRDVCEPGLIDEVGSGFDAV
FHLASAASPVDYQRRPIQTLCTGSAGTATALEIAERAGARFVLASTSEVY
GDPESHPQRESYWGNVNPAGPRSVYDEAKRFAEALTFAYHRLGRADVGAA
RIFNTYGPGMRADDGRMVPTFCLQALRGDPLTVSGTGLQTRSLCYVDDTI
TGLIALAHSDFAGPVNIGNPTELTVLSAAELIRELAGSTSTIQFTPPAAD
DPQRRCPDIRLARKRLGWRPRVDYRTGLSTTLAWFAERAGRTGESAQQLV
ATTDRRTR
>MAP0083c hypothetical protein
MVEPSTGAWPAQLPAVQVRFARPTAQLDRIVEFYRDVLHLPQLHSAGDDE
WSVVMFGLPGDQYHLEFVAHRDGIDGTAPTRENLLVFYFESAAQQQSVAA
RCREAGAEEVVLDNPWWRRNGADAFLDPDGWTIVLMPRPVPLPAPTA
>MAP3556 hypothetical protein
MSGKPKLVIGANGFLGSHVTRQLVADGAQVRAMVRAGANTRGIDDLSLTR
FHGDVFDTAVLSEAMDGVDDVYYCVVDTRAWLRDTSPLFRTNVEGLRNVL
DVAVTQPELRKFIFTSTYATVGRRRGHVATEDDVIGTRGLSDYVKSRVQA
ENLVMRYVAEAGLPAVAMCVSTTYGSGDWGRTPHGAFIAGAVFGKLPFTM
EGIQLEVVGVTDAAKAMVLAADRGRVGERYLISERMIALKEVVRIAADEA
GVPPPRRSISVPTLYALGALGDLRARLTGKDAELSLASVRMMRAEAPVDH
SKAVRELGWQPRPVEESIREAARFWAAMRNAKGKSVTPR
>MAP3633 hypothetical protein
MTVMTAETANAAARTWTPRIAAQLAILAAAAFTYVTAEILPVGALPAIAR
NLQVSLVLVGTLLSWYALVAALTTIPLVRWTAHLPRRRVLVASLTCLTAS
QLISALAPNFAVLAAGRVLCAITHGLLWSVIAPIATRLVPPSHAGRATMS
IYVGTSLALVVGSPLTAALSLMWGWRLAVVCVTVAAAVVTVAARLMLPEM
VLTEHQLAHVGPRSRHHRNGRLITVSLLAMVAVTGHFVSYTFIVEIIRNV
VGVRGPTLAWVLAAYGLAGLLSVPLVARPLDHRPKSAVILCMTGLTAAFA
VLTALAFGGPTGATTALIGTAAIVLWGAMATAVSPMMQSAAMRNGADDPD
GASGLYVTAFQVGIMAGSLLGGLLYERSVILMLSASGVLMAVALVGIAAN
RRMLDVAPTSSRDS
>MAP0241c hypothetical protein
MAPGSGPHIETRNAWVEFPIFDAKSRSLKKAFLGKAGGTIGRNTSNVVVV
EALRDITMSLELGDRVGLVGHNGAGKSTLLRLLSGIYEPTRGWAQVTGRV
APIFDLGVGMDPEISGYENIIIRGLFLGQTRKQMLAKVDEIAEFTELGDY
LSMPLRTYSTGMRVRLAMGVVTSIDPEILLLDEGLGAVDADFMKKAQSRL
QGLVERSGILVFASHSNEFLARLCKTAMWIDHGQIRMSGGIEDVVRAYEG
EDAARHVREVLAETAEQP
>MAP2254 hypothetical protein
MATTPARRRLRQLLDNGELIVAPGVFDGLSAHLARRTGHVAAYLTGAGVA
ASGFGLPDIGLVTATEMADRAAMIAAALGDVPLIADADTGYGGPMNVVRT
VRAYDAAGVAAIQLEDQVFPKRCGHLPDKQVVDAAVFEQTLAAALDVRSD
DDLLVVARTDARGPLGLDAAIERANRYARAGADIVFVEAPHDAGEIERIA
TEVDAPLLINLVLGGLTPLQSAARLHELGYAIAIHPGDLLMHTTFAMLHS
LCGLNGTDPAAHLPASPGDFFDLVGMAHWRALDEKYAHTGGRTWA
>MAP2843c hypothetical protein
MSKVDVAALIALCAALASAVGDVIRQRSAHEITDKPVGHLELFRMSLRDT
RWWLGGLAAITNYSLQAAALAWGSVVLVTALQVTALLFALPLYARLAHQR
IKPREWAWALILAAALAVVIIVGDPASGKQRAPLHIWVIVALVMVPVLVA
CVVAARRWAGSPFAPVLLAVVAGSSLALFAVLTKGVVEMSEHSLVGVLTS
PEFVPWLLVALTGMIFQQSAFRAGALTASLPTMTVAKPVVAGLLGVLVLD
ETLNAHGPKAFVLVGAVAVVIVATIALARGEAASINRPEPGPGSAPRKAR
DATADVAADDDDTGPFSGRLLVADSSRW
>MAP3133c hypothetical protein
MVRIASVVARQLLDCKARPLVEVEITTDTGHVGRGAAPTGTSVGAHEAFV
LRDGDPTRYRGRSVHRAVAAVRDEIAPALTGAELDDPRSLDRVMIELDDT
PDKHRLGGNAIYSTSIALLRAAAAAAGTPTYTYVGALLGLTPPTTVPMPS
FNMINGGRYGDVEQSFSEFLVVPYRAESIQAAVEKGVSLFEVLGEVLAEH
LGRTPLLASSYGYIAPSGDPHAVLELLAEAVERAGCADVMAFALDCASSE
VCDNGSATYAFNGGRVTAEALIDYARALSQEFPMLFIEDLLDGDDWAGFT
KAVQTVNRSIIVGDDLIVTNRDRLRRAVETSAVDGFILMPNQVGTIAEAL
DCFEYASQNGLLAIPSGRSGGVIDDVVMDLAVGLGAPLQKNGAPRSGERI
EKLNFLLRAAEGIPDCALADVPALARF
>MAP4308c hypothetical protein
MPVRAMRKWESSMSNQQQAERMTSGKGFIAALDQSGGSTPKALRLYGIED
SAYSSEKEMFDLIHQMRSRIITSPAFTGDRVLAAILFEQTMDRDIEGKPS
TTYLWETKGVVPILKIDKGLAEASDDVQLMKPIPGLDELLQRAVSKGVFG
TKERSVIGGANPVGIAAVVAQQFELAHQVLSHGLVPIIEPEVTISIADKA
KAEGILRDEITKQLDSVPDGQRVMLKLSLPTEANFYRPLIEHPKVMRVVA
LSGGYSREEANELLAKNAGLIASFSRALTEGLTVDQSDEQFNATLDKAIQ
SIYDASVAG
>MAP3165 hypothetical protein
MRFALATMGTRGDVEPFATIGRELQRRGHEIRMAVPPNYIRFVESAGLSA
VPHGPDQVKQNEDIVRKYGTAPNPMFLAWVISEDLKRLWPSLGTALMSLA
DGADLLLTDTSEHGLAANVAEYYDIPAATLHYYPSGGAAQLTQEAESAQR
RALGLSGPTAPAARRPMELQAYDEFFFPGLAAEWAQCGVRRPFVGALTLE
LPTEADDEVLSWIAAGTPPIYFGFGSSARVASPGDVIAMITTACAELGER
ALICSGVSDLTQIRTGDHVKAVSAVNHSTVFPACRAVVHHGGPGTTFAGI
RAGVPSLVLAVSVDQPLWAAVINQLEVGIGRHFSETTPDSLVADLRSVLD
PRYVTRTRAVAARMRTPAESASAAADVLEDAARQGSCG
>MAP0142c hypothetical protein
MMSSNVRAARNRPGRTDPQPPTGAPLFVGVLGLLLATGWVANHFVGLMPA
ISDRDHLATTTLDGIFGIYALGLLPGLLVGGRTSDALGRRPVALTGSAAA
LVGTVAMLLSQHSPALFAGRLIVGLGVGLAISAGTAWASDLRGPAGAATA
GAVLTAGFAVGPFAGGVRAWAGPSGVRASFALAAAILALAAFAVVAAPQP
SPVTAPADPDGEETADAAPQGISRALSWAMPLAPWVFASATLGFVTIPGR
LHTALAAPVAAGTATLIVNGVSGAVQVLARALRWGPQAGTAGAVLAALGY
AVAAAAPPTLTPALGVPLFVVLGCASGLCLREGLIDLGGRRAATPARRPD
GLFYVVTYIGFGLPLILASVRPGVATAILSGMAVLAMTAAVGRAARLRRD
DHRQN
>MAP3288 hypothetical protein
MSDANLAAVLALCAALASAVGNVVRQRSAQEVTDKPVGHLALFGMLLRDT
RWWLGGLGDIGSYVLLAAALDRGSVLLVMSLQVTALLFALPIYARMTHHP
ITGREWAWALLLAVALAVLIAVGDPTGGQQRAPLQTWLVVAAVIGPLLVL
GLLGARVWADRPVAALLLAAVAGSLLAMFAVLMKGVVDILEHNPGQLWRS
FELYALVFCGVAGMIYHQSAYRAGALTASLPTIIVAKPVVGGILGIIVLR
ETLVAGGWEWVVLAVTAVVVIVATVGLARGEAASMSAGAGRDVRPTDRPK
AAYQA
>MAP3834 hypothetical protein
MRDDDSANRWSGVREAPTAPSRQLTGAVVIIALVAAISGMLYGYDTGVIS
WALLQLTQDFNITEGWQQVIAASILLGAVAGALTCSWLSDLRGRRGTLLM
LAVVFIVGALWCADAADSVMLSLGRLVLGFAVGGATQTAPMYVAELAPPA
YRGRLVLCFQIAIGVGILTATLVGAGGSISWRGPIGLACVPAAIMLWLLL
RLPESPRWLVKKDNRDAARAVLEHVRPEGYDVAAELDEATELARVERTAA
TRGWRGLRDAWVRPALVLGCGIAVFTQLSGIEMIIYYSPTILTDDGVYRS
VALQVSVCLGAAYLIAQLVGLAIIDRVGRRRLTLIMVPGAAVSLFALGLL
FITSDSGRDVIPYIMICLIAFMLFNGGGLQLMGWLTGSETYPLAVRPAAT
ALQSATLWGTNLVITLTMLSLIKAIGVGPLMWLYALFNVAAWIFVFFRMP
DLTGKTLEEIEYQLSEGKFRPSDFGR
>MAP1587c hypothetical protein
MASVVTGPDWVQHVIWWQVYPLGFVGAYPLPQSSEPPQPEQHRLRRLVDW
FDHAIELGASGIALGPIFASRTHGYDTTDHYRIDPRLGDDTDFDHLIAQA
HRRGLRVLLDGVFNHVGVDFPRYRAAIDSDDHAAARWFRGSRGRFHTFEG
HDGLITLNHDNPEVVDYTVDVMAHWLGRGADGWRLDAAYAVPETFWAATL
PRVRDRYPQAWFVGELIHGDYAAVVRAAGFDSATQYELWKAIWSSLNDGN
FYELDWALQRHNAFLGRFSPLTFIGNHDVTRIASQLNNPAHLAHALVLLL
TVGGVPSVYAGDELGFRGVKEERFGGDDAVRPEFGSPPIPLDDFGTEVWR
LHQYLIGLRRRHPWLHGATTTALLLQNRHYVYETRSGDEALLIVLNIGDE
PLRVSLRELGGRRGRVIGGSAAPPPEVLDTVVVEPQGWRILSPI
>MAP3506c hypothetical protein
MIASSPTADGRIPARMSISRAISTQPSRYATPGSPVPAIDVAPGWRLDRV
TAPSRLFGANGLRTGPDGRIYIAQVTGSQISALDHHTGELETASPKGGDI
VAPDDVAFDAAGNLYATEVMDGRVSVRDTGGRTRVLRDDVPSANGITFHR
GRLFIGECREGGRLLEFDLAGGPPRVLLDNVPSPNAMEVGPDGLLYFPVM
GANEIWRVDPDGGEPQCVATGLGVPDSVKFDARGHIVSTQVASGQVLRID
PRSGEQRVLAQLNPGLDNCTFVGDRLFVSNFTGEITEISPDGTTRSVLPG
GLNWPLDLAVGHDGRLYVADGTYFYAALPDGALHTVGMLFSPGYPGFLRG
VAASGPDEFVVTTSGGQISRYRPEASESEVLADGFDQLYGIAPGPAGVVV
AELGTGRVLSLQRSGAVEVLAAGLREPVGVAIGHDGAALVAEAGAGRVVR
VDGSKVDTVMADLQRPQGILLHGGVLYVVDAGAKELIAFDPADKTRRTIA
SGLPVGPPPGVNPKPLKGMPPFSGPQGPFAGITAAADGTLYVSADGEGSV
LALRPTRDGH
>MAP1578 hypothetical protein
MHILVTDATGALGRLVAGQLIAAGHTVTGIAELPHPCLDRNVEFVCAPLR
DRVLRELTDEADAVIHLAPIDPTAPGNAAMDGLARVTDAAARAGSRLLFV
SQAAGRPELYRPAEDLVASSWGPSVVVRIAPPVGRQLDWMVCRTVATVLR
TKVSAQPMRVLHLDDLMRFLVTALDTDRTGVVDLASPDTVNLVTAWRMLR
ATDPRSRLHRVRSWSQLIPDLNVAAAQEDWLFEFGWHALDAVADTARGLV
GRRLDTAGAINHGAQLALPVEVLPQRAGNGAHSAAPEGVEGEFDDRIDPR
FPVFSATALTEALPGPLTPITLDVQLSGLRTASRVLGRVLALGGAVGEEW
DSRAIAVFGHRPYVGVSVNMVAATQLPGWDQDAVARDALVGRPQVGDPLP
FGEPALAGGALGSIAKAVAAGRSVALLRHLRANTRDYCAAAVAEQVDAAQ
LALLPQAALEVRLRLLRDRIHQGWILNALWLLDTGITAATLVRSQAGQSV
PGLGMITESGLVAAETAELAAMLAADPPLCALAADGNLASIRALAPKTAA
AVDAAVARIGHRGPGDAELASQTFGDDPAMLLSAAAAAAPAEPAAPATLA
ERLAANARGSKELAHDATMRFTHELRMTLRALGSLRVEADLIDAVDDVYY
LTCNELVTMPGDARLRIKRRRTERERLQVQCPPEVIDGRWTPPQHCAEGS
PARRPAG
>MAP0169c hypothetical protein
MTIVVTGATGNVGRPLVTALLAAGAPVRAVTRRSGPAGFPAQVELFDTAA
DALPGASAVFLNARALGGQLEDVVARCARGGVTKLVALSAINADDDFSRQ
PSRFRGDRNRQVEQLAVGSGLAWVSLRPTVFATNFAGMWSAQLREGDVVA
GPYAAASSAPIVETDIADVAAHALLTDDLVGQRIPLTGPQALSNAQLVEV
IGTVLGRSLRYQEIPAAAVRQRFVGLGLGAGFADAYLAMLAETLDKPALV
THDVEKILGRPALPFAHWVATHRDLFDRRNERKEGV
>MAP3242 hypothetical protein
MVSELALDGIQGVPAAGGVKRFVRYRARDVRAVTAGPAAGAGHAADRRSA
GLNGPLLGSSPAARSRRQVRALTTIAVIGATGTAGSRVVARLRPRDVAVV
EISRAQGVDVFDAPGLTRALEGVDVVIDVSNPVPPDGRSDIRQTLATAAR
NIMGACATQEVQRLVVLTIAGIEDPVFDEFPYYQAKREAKEILLDGPVPT
TVVKSTQWYEFATNHDAVSCDDNEVIVEDWLIQPIAADTVADVLVETALA
QTHTPRTITGPDAIRLPELTSKLLVRQGDRRAVRSVEPAVSALGAGALLA
SGHAVVVGPDVDTWLQTLAPPGGDGHRGAGGDGHRGADGDGKPRRG
>MAP2441c hypothetical protein
MPARSMVVMPHGPVAHQAGTRGYRRMTAALCGAGLASFAAMYCSQALLPA
LSAHYRIGPATAALTVSLTTGALALSIIPASVLSERYGRIRVMLISGVAS
SVIGLLLPFSPSLGVLLFGRAAQGVALAGIPAVAMALLAEEVDASSLGSA
MGRYIAGTTIGGLAGRIVPSVVVQVGTWRVALLACSLITLAGTAVFAVLV
PRSRFFTPKPASVRAALRNLAGHLRNPVLAKLFAVGFVLMGGFVTVYNYL
GYRLAARPFGLAPSVVGLLFLLYLVGTGTSVVAGRLADRRGRPLVLGAAL
PIAVAGLLLTVPATLAAIVAGVGVFTGGFFAAHTVASGWVGAVAQRDRAE
ASALYLFSYYLGGSVAGAFGGVLYGVGGWSATVCFVVVLLMAGAALVALL
VRDNGFRIGRRVVTSVASVK
>MAP0616c hypothetical protein
MAPLITLVVGSLVAWVVGRLGVAYVDGWAPALAVGLAAMFVLTGIAHFAP
PLRADLVAIVPPRLPAPGLLVSLTGVLELLGALGLLLPATRAAAAGCLLV
LMLAMFPANIHASRMPDPPKSMTTRLPLRIGMEIVFLAAAVAVALGGR
>MAP3704c hypothetical protein
MPRREEPSHGLLDPVAKMLRLPFGTPEFIDRIVTGGVNQVGRRTLRMLIT
TWDAAGGGPFAASAIASTGMAKTAEIVQGMFIGPVFGPLLRILGADKVAV
RASLCASQLVGLGIMRYGIRSEPLHSMSVDAIVDAIGPTMQRYLVGDITR
>MAP2579c hypothetical protein
MGEPSVRATTPGTPMRRIAAACLVGSAIEFYDFLIYGTAAALVFPAVFFP
RLGPTVATIASMATFATAFLSRPLGAAVFGYFGDRLGRKKTLVATLLIMG
ASTVSVGLVPSTASIGIAAPLLLTVLRLLQGFAVGGEWAGSVLLSAEYAP
TDRRGWYGMFTLLGGGTAGILASLTFLAVNLTMGEHSPAFMHWGWRVPFL
ISSGLIGIALYVRLNIDETPIFVEEKARHLVPKAPLTELLRLQRREIILV
AGSFVGGMGFIYLGNTFLVMYAHNHLGYSRSFIWGIGALGGLTSMACVAC
SAWISDRVGRRRVMLWGLVACLPWAFVVIPLIDTGRPVCYVVAVLGMFGT
AAVANGPTAAFVPELFATRYRYSGAAVAMNLAGIVGAAVPPLLAGTLLAT
YGSWAIGLMMASLVLASFVSVYLLPETRGAALDAAAAAEKVAAR
>MAP3630 hypothetical protein
MSTVHSSIDHHPDLLALRARYERVAESMSAHFTFGLALLAGLYVAASPWI
VGFSATASLATSDLIAGIAAAFLAYGFATTLDRAHGMTWTLPVLGAWVIV
STWILPGVVLTAGMTWSNVVAGALLTFLGLNATYFGMRTRASAG
>MAP0296c hypothetical protein
MHGLIGAARSPEQKRAQLRAGLESGRLQRFPGAFSPLVAKLVAELGFDGV
YVSGAALSADLGLPDIGLTTLTEVSGRGAQIAAVTELPTLIDADTGFGEP
LNAARAVTMLEDAGLAGCHLEDQVNPKRCGHLDGKAVVPVDVMVRRLRAA
VSARRDPNFVVCARTDAAAIEGLPAAIERARAYADAGADLIFTEALQSPT
EFQRFRAALDTPLLANMTEFGKSPLLSTGLLADIGYNVVIYPVTTLRLAM
HAVEAGLREIADTGTQSGLLDRMQHRSRLYQLLRYADYNHFDSDIYNFPR
>MAP1674c hypothetical protein
MKIVLAAYGSRGDVEPCAAVARELQGRGHDVRIALSPDQLGLAESAGLRA
VAYGPDSREQIDTATDFVHTVQNPISALPQLTERVTAVWAAKSETLAALA
AGADLLVAGMNEQRLAANVAQRVGAPLAALHFFPADTLELGWMQSHVTAA
AEAAQRLALGLPVDAGQSDRAALEIQAYDELCVPGLAEQWATAGHRRPFV
GSLTLGLASGVDDEVLSWIADGTAPVYFGFGSTPVSSPAETVAVISAACA
RVGERALICSGPNDFTGIPPEDHVKIVPAVNHAAVFPACRAVVHHGGSGT
TAAGLRAGVPMLILWLWLDQPVWAEAVSALEVGLARAFSASTLDSLTADL
RCVLDGRYHDRAREVATRMTKPGESVAAAADLLEEAARGG
>MAP4121 hypothetical protein
MRAVKCGDAGGFGMEITSALSTELFVGPPEAPLQLVRVAVAGCAERTPVR
VDGPGLRGRALAEAGAEVIEVAVTVDEPVVGQRRPARAGAGEAELPFEFT
VAEPGWTMFMVSHFHYDPVWWNTQGAYTSEWTEDPPGRARQTNGFDLVHA
HLEMARREPEYKFVLAEVDYLKPYWDTRPEDRADLRRFIAQGRVEVMGGT
YNEPNTNLTSTETTIRNLVHGMGFQRDVLGADPATAWQLDVFGHDPQFPG
LAADAGLTSSSWARGPHHQWGPATGDGSDGGVERMQFSSEFEWIAPSGRG
LLTHYMPAHYSAGWWMDSAASLAEAQDATYALFVQLKKVALTRNVLLPAG
TDYTPPNKWVTAIRRDWAARYTWPRFVCALPREFFAAVRAELAQRGGAPS
PQTRDMNPIYTGKDVSYIDTKQANRAAENTVLAAERFAVFAGLLAGAEYP
EAALAKAWVQLAYGAHHDAITGSESDQVYLDLLTGWRDAWELGRTVRDDA
LALLSRRVDGADVAVWNPLNGRRTDVVTARLDAAPGAGVRVLDADGVEVP
AHVEHGGRSVSWLARDVPSLGWRAYRLVAGEAASGWAPLRGSAIANEHYR
LAVDAERGGTVASLIADGRELIAEGRVGNELAVYEEYPSHPSQGEGPWHL
LPKGPVVCSSATRARVRAYRGPLGERVVVHGRIGTLLRYTQTLTLWRGVA
RVDCRTTIDEFTGADRLVRLRWPCPVPGAMPVSEVGDAVVGRGFALLHDG
DRALDTARHPWTLDNPAYGWFGLSSVARVRISDGGLRCLSVAEVVSPAET
TSGPLARELLVALVRAGVTATCSAADGPRYGHLEVDSNLPDVRVALGGPR
RNAFTAAVLADADPAYADELRRQLAATGRARLWVPARAPLAAVWLPDADL
RDARALPVLVVDGRDERELRAAIAALVDDLGDAEITVTQQSPSRTEGFDD
YTVALLNRGVPSFAVDTAGTLHTALLRSCTGWPSGIWIDDPRRSAPDGSN
FQLQHWTHHFDYALVSGPGDWRRAEIPARSAQFGNPLLAVAAGGGAGTGP
GLPPAGSLLRVQPAESVQLAALKAAGNPLASGSAARPDPTAVALRLVETA
GAGTRVAVDSDVATVSDLHAADLLEKPSRQVNSIDLHGYQVATVLARLHA
PAAPAGAAALAPHAEAAQPLYARYWLHNRGPAPLGGLPAVAHLHPARASA
DPDSRVTLRLSAASDCTDATLAGAVVLRCPDGWSADPAELPLRLCAGQHL
QADIAVSVPARAAPGLYPVRAELRLTGEHLPASWRQGVEDVCVVAVGDGE
RAPLIYLADEPAEVTLEPGEAADLTVTVGSHARAELALEAHLISPWGTWD
WMGPAAVGAVLPARGAVDLGFRLAPPAWLDPGQWWALVRIGCAGELLYSP
AVRVTVT
>MAP1687 hypothetical protein
MATQKPQTISYPAPEARPRRHHTDPLDPHVIVLFGATGDLAKRKLIPGLA
YLDQSELAPDIQIVATSLEDLSNDEFRELAKNAIDSFGTHKLTPDQWRNF
ADIITYIPQSAGSDALAAAVAEAEAKLGPNVRRLHYLSVPPKAARAVITM
LRDAKLVERSRVVMEKPFGTDLASAVALNDFVHQTFEESQIFRIDHFLGK
EAAQNILAFRFANGLFEPIWNRNFIDHIQIDIPEALGLDQRASFYEETGA
YKDMVVTHLFQVMAFVVMEPPTALEPFAISEEKNKVFRSMLPVKPSNVVR
GQYIGYREEDGVAKDSDTETFIALKVGIDNWRWAGVPVYLRTGKRMAEGI
RHHLDRVQGGTADHVPARVGGGHAGARPPDVRSCGQFAGVAVVLRQAARP
RHETGQAVHAILHPGDRNRGGRAGGLRTAHPGRDARRPHPVHHRRRHRIT
VGAFRRAAQRPATGQAVPAGDVGPQRNSPVDRAQRVAAAVRARVARSPDP
TPEWAAEKCCGSAQFGAEITDGLAEFPEDFVVGQLEAVVGVHPLGGQAGD
APAGGLQVVAGRAALAEPDQFGHVDVQSVLAPGAAHRPGAFVGEHPHVGD
VLGGQPAARRAERGDVVQQPVLGVRRQVGQQPLGDPGGTGAAVKAVVAQR
CWPVIAQIDGDGAALGGRLRAQPGQGVGLEGDDLGLVDLEHHGARRPGQP
VGPRVEPGGQDDGLANAGSGRGAEEIVEKPRPHRDLLAPLLQRHRHVVGV
ERDLAVQHPDEEGMPDRAHQRFGEPVVEQRVRTARGQRPGGRDHGRGRPH
AGGQIPAVALGTCRTGR
>MAP1880c hypothetical protein
MTVILLRHGRSTSNTAGVLAGRSDGVDLDDKGREQAAGLIDRIGDLPIRA
LISSPLLRCRRTLEPLADTLCLAPLVDDRLAEVDYGEWTGRKIGELVSEP
LWKVVQAHPSAAVFPGGEGLAQVQARAVAAVREHDRRLAAEHGADALWVA
CTHGDVIKAVIADAYGIHLDGFQRVTADPASISVIRYTELRPFVLHVNHT
GARLSAPLRAGPAPPPDQHDDQHHDKHHDTGDKAHTDPKSNGESQAAPAT
SVPSSDAVVGGSTD
>MAP2516 hypothetical protein
MPDTSRGPALLILFATLLATAGTGISIVAFPWLALQHRHSATDASIVAAA
MTLPLVLATLIAGTAVDSFGRRRVSLVCDWLSGAAVTAVPLTAWIFGAAA
IDVAELAVLAFCAAAFDPAGMTARQSMLPEAAARAGWSLDRTNSSYEAML
NLAFIVGPGLGGLLIATLGGINTMWVTAGCFALSFLAIGALRLDGAGKPP
RATRPVGLVTGIAEGVRFVWNLRVLRTLGLIDLAVTALYLPMESVLFPKY
FADQHQPAELGWALMALALGGVAGALGYAVLSARLRRRTAVLTATLTFGA
TTAGIAVLPPLPVILGLCAVTGVVYGPIQPIYNFVMQTRAPHHLRGRVVG
VMAGLTYAAGPLGLLVAGPLADAAGLKATFLTLAVPILAIGVVACGLPSL
RELDRAPQFADDPGP
>MAP4148c hypothetical protein
MKSMQELFEIAWQATQIGATTLKKTQPSSVQHKGDRDLVSDVDLTIQRDI
ANYLSQTTPDVALLAEESDHQPDIETAEWLWVLDPVDGTSNFVHGLPLCA
VSLALLHAGRTVVAVTHAPLLGKTYHAIESKGAFVNGERITASATDSLSG
AIVSLGDYAVGHDAGRLNAHRLALTAELVPRAERIRMIGAATLDLAFVAE
GALDACVMMSNKPWDTAAGTLIAREAGARLTDAQGNPHTHKSASTVVAAP
GIAEQLATVIRNTEVPSDRYAAAVG
>MAP0593c hypothetical protein
MASIFTKIINRELPGRFVYEDDDVVAFLTIEPMTQGHTLVVPREEIDNWQ
DVDSAAFNRVMGVSQLIGKAVCKAFRTERSGLIIAGLEVPHLHVHVFPTR
SLSDFGFANVDRNPSPESLDEAQAKIKAALAQLA
>MAP3931 hypothetical protein
MPRGHLTEPVTDSSPPAKGTFTVDMLSRAKRGVTAVFVAHGLLFASWAAH
IPQVKAGLGLDDAALGTALFGAPLGSVLATLAGHWALPRWGSHRLIPVTV
AGYAAAGTTVGLARSGPALFAALALWGMFQGTLDVAMNTQAGTVERRAGA
PMMARFHGMWSLGTLAGALIGAACVGAGIGLTAQLTVLGAVVLLVVVMLT
RRLLPDAADSVAAPPEPAAGRRMTPAVAILAAVSFASFLCEGAATDWSAT
YLRDVVGAGPSVAAASYAAYTLTMVVTRFGAARLHARLPSRRLLPALAVL
AVAGMSVALATADAAAGVLGFAALGVGVALLVPTAFSAAYGARGAGSAIA
IVAATGWLGYLLGPPLIGHLSEWVGLSGALVTIPVMMTVVAVAIRYTPAF
DTADEFHRAPAG
>MAP0387 hypothetical protein
MVATEHEWSKPAALAIPREGYFELERGRYGPLYPRTPACYGFSIIAKVKE
GREEAVRAYGKQIEEAIKADPHVLAALRLHYLRWLLFDVGSGLHFQYQGI
FDTDFDKYTEDAVQLFSQTGITTVFTNLEGFPEDWRENPDAFVKFVREHQ
CPSFLEYGEYPYVTADEIKKALRLKAAFQTMLDQMQ
>MAP0629c hypothetical protein
MATADGFHPDFDDATPHAARNRVYHVKASQLSSDTAQSEGLRRFAALSGN
SVGSEKLWMGETHALPGSASDNHHHGESETAIYVRTGNPEFVFHDGVHEV
RIATAPGDYVFIPPYLPHREENPDPHTTAEVVIARSTQEAIVVNLPALYP
LTDPD
>MAP3145c hypothetical protein
MGERAPRPGPLPREVWILSWANVMVALGYGVISPALPTFARSFGVSIKAV
TFLVTVFSLSRLCFAPISGLLTERLGERRIYIGGLLIVAVSTAACAFSQA
YWQLMLYRVFSGVGSTMFYVSALGLMIHISPADARGRIAGLFTTSFMVGA
VGGPAVGGLAAGWGLTAPFVVYGVAMLGVALVLFLGLRNSALAAPRPPTR
STVTMREALRVRAYRSALLSNFATGWSAFGLRMALVPLFVSDVIGRGIGT
IGVVLAAFAGGNALAVVPSGYLSDRMGRRTLLIVGLVTSGAATVWLGFVA
SLPVFLVAAGVVGVVTGIYMSPLQAAVADILGNEARAGLPVATVQMMSDL
GAIVGSMAVGWAAEQIGYGWGFFISGVVLLIAAVGWVMAPETRTATELEA
DLMAAESDVEPV
>MAP2062 hypothetical protein
MSLGAKTRVAGVDTHGRELVGRTDIATVLAVAAALMVGAGDVAQQQSAQQ
VTDQPVGTVALFRRLLRDRRWWTGSLVAAGGFGFQAAALGFGSVVLVQAL
LVTSLLFALLISARVNHRRITGRQAVWALLLAAAVAVVVTVGDPQEGTPR
GSLQTWTIVALVMGPALVGCVLGARLYPGPVAALLLGLMSGSLWGLFAVL
TKGVVDQLDRGIPALLRTPELYVWVLLAIAATAWEQSAFRAGPLTASLPA
VTVAEPVIGSVLGVTVLGETLRTNGIGLVALGISVAVMAAATVALARSQA
ASAPTADETPPFSTEKT
>MAP3761c hypothetical protein
MDTVIAMALIAATNPVRLGIALLLISRSRPVLNLLAFWLGAMATAITAGV
GVLTVVHSFAPTLMQSVSALSATPDARHTQIAIGLVILAITALVAVSSSV
RPTRVTSASVDRRAGLPRPNTPSTLARLRGRIRDVLTSDGLSVAFILGLG
SGFPAVEYPIALAAIAASGNSVGVQFSAAVMFTVVMLLVIEIPLVSYSAS
PAQTQAVMQRIHDWVLPRRWQLLAVVVAAEAVWMVAAGLGGA
>MAP1664c hypothetical protein
MHRSSLPAVPGVLTPDQCRQTALSIAAAQEPSGAIPWFDGGHTDPWDHVE
CAMALTTAGLLEPARAAFDWSRRTQRPDGSWPIQFRAGVIEDANSDSNFC
AYIAAGVWHHVLVTGDDSFAAEMWPVVRKAIDFVIDMQVGYGEIAWARSE
AGLLPEALLTGCSSVFHSIRCALALAGLVDDPQPEWELALGRLGHAITAH
PEAFTEKDRYSMDWYYPILGSAVRGPAAVARIKQRWDDFVVDGLGIRCVG
DRPWVTGAETCELVMALEALGRRSEAHRQFAAMQHLREGDGSYWTGLVFA
DGKRWPEERTTWTGAAMILAADALSDATAGAGIFRGDALPVGLQTDFDCE
CVASGR
>MAP3868 hypothetical protein
MLRTPLCRAGAVLASVVLLCSASATASADESIVIDLVRHGQSVANAAGLI
DTAVPGASLTQLGQQQAQTVAGVLAARGPVAAIFASQLIRTQQTAAPLAT
ALGMNVQVLPGLNEIDAGIFNGQPQFSLGGLAYLLAPIAWMLGLRIVPML
APGSADANGYEFRRRFDGALQTVYGSAAANAGGFGDVLQAVYAGAVTNPV
RAANGRITGVAFSSEFAIGVGTMMTVKNPDPLLLLFDPLPNTGIVVIQGS
PRDGWTLISWNGKPVHPGWSATRNGRGANGLCLMLDPSTPTGVCAAKPPG
PASAPVTARPGLS
>MAP4032 hypothetical protein
MVALRYHNVYGPGMPRDTPYSGVAAIFRSALEKGEPPRVFEDGGQMRDFV
HVDDVAAANLAALACRDGFTAVNVCSGQPISILQVATALCDARGGAVAPV
VTGQYRSGDVRHIVADPSRAARLLGFRAAVQPGDGLREFAFAPLR
>MAP0293 hypothetical protein
MTQLLRRSELAVPASNDNMFDKAAGSGADLVFLDLEDAVPPAFKEESRGK
AISALNDLDWGRTARAVRINGLDTPWCHDDLIEVVTAAGRNLDTVIIPKA
RRARDVWWVDVLLSQLEAKLNLGNKIRLEVLIEEVEGLANAEEIAVASPR
LDALIFGVGDFSLSQGARVDTNFVPLGEYPGDFWQYARNKVIVAARIAGI
DAIDAPYPDYRDLTGYERDARRAALMGYTGKWAIHPDQVPVANQVYAPTA
DEIARAEANLAAYREAESNGRGAVGVNGVLVDAAHVKMAQQTLARAALIN
GTGGSNDR
>MAP1418c hypothetical protein
MWGSVLGLGMLAALNPVRLGLALLMISRPRPGSSLLAYWIGGLTVCVPEL
LIPVLLLNFTPMFGHPSHASPSTGLALGKIQIGLGVVGLSIAAVLTVRFA
ARQRAAAPPPDDRTSELSAAPGATIAMPRLLTRAQDVSPDDRSVLRRLLG
RMHSAWESGASWVAWVIGVISVPVDGVLFIVAIIAASGASVTAQVSASVA
FVVLMYAVVEVILVGYLATPGKTQSLLLVLHDWVRTYHRQILVALFTVVG
VSQLAQGLHLV
>MAP2433 hypothetical protein
MRTARRARRTRADRNGVLVPGRVEIDNVQPVVSCGTYPAKAVVGEVVPVS
ASVWREGHEAVAATLVVRYLGPAYPPVTETRRVKAVQAPVADREGRVDQQ
RIKPLSLPMTMGPEPYVFHGQFTPDQVGLWTFRVDGWGDPIHSWRHGLVA
KLDAGQGETELSNDLLVGAELFERAATGVPRARREPLLAAAAALRTAGDP
VTRTALALAPEIEEILAEYPLRDLLTRGEQYGVWVDRPLARFGSWYEMFP
RSTGGWDDDGNPVHGTFATAAAALPRIAAMGFDVVYLPPIHPIGKVHRKG
RNNSPTAGPTDVGSPWAIGSDEGGHDAVHPDLGTIEDFDAFVARARELGM
EVALDLALQCAPDHPWAREHRNWFTELPDGTIAYAENPPKKYQDIYPLNF
DNDPAGLYDEVLRVVRHWIDHGVKFFRVDNPHTKPPDFWAWLIAAVKGID
PDVLFLSEAFTPPVRQNGLTKLGFTQSYTYFTWRTAKWELTEFGNDIAAL
ADFRRPNLFVNTPDILHAILQHNGPGMFAIRAVLAATMGPAWGVYSGYEL
FEHRAVREGSEEYLNSEKYELRPRDFAGALAEGRSLEPFITQLNTIRRLH
PALQQLRTIHFHGVDNDALLAYSKFDPATGDCVLVVVTLNAFGPEEATLF
LDMAALGMEPYERFWVRDEITGQEFQWGQANYVRIDPAQAVAHVINMPLI
PDEARMTLLRRR
>MAP1937c hypothetical protein
MVSRYSAYRRGLGDDTVSPEVIDRILIGACAAIYLALLGVSVAACVALAD
LGRGFHKAASSPHTTWVLYAVIIVSALIIAGAIPILLRARRISQAEPTAR
AMTAPARPPVRLGAGVARPATERAPHAPATTPDVGWSGEAVDRIWLRGTV
ILTGTMGAALIAVATATYLMAVGHDGSSWVGYGFAGVITAAMPVVEWLHI
RQLRGAVAEQ
>MAP3625 bglS, BglS
MTDDERFSLLVAVMGAGDMWPVRDERIPADVPMSAGYVPGVPRLGVPPLL
MSDAGLGVTNPGYRPGDTATALPAGMALAATFDPALARAAGELIGREARS
RGFNVQLAGAMNLARDPRNGRNFEYLSEDPLLTAAMVAESVAGIQQQGVI
STVKHYSLNCNETNRHFLDAVIDPDAHRESDLLAFELAIERAHPGAVMTA
YNKVNGVYAAANSVLLNDVLKDAWAYRGWVMSDWGATPGWECALGGLDQE
CGAQIDALLWQAESFGAPLRDAYADGRLPKDRLSDMVRRILRSIFAVGVD
RSDREPAPDLNAHNDIALRIARQGIVLLTNRGLLPLRPDSPARIAVIGGY
AQLGVPAGFGSSTVVPAGGYAAVVPIGGTGLEAGLRNLYLLPSSPLQELR
RLLPENTIDFDPGISPAEAVAAAQRADIAIVFAVRAEGEGFDLADLSLPS
GQDELIGAVAAANPNTVVVLQTGNPVTMAWLGAVNAVVQAWYPGQAGATA
IAEVLTGRVNPGGRLPITFPVDLHQTPRPQLPGAGLPWGTPSTIDYVEGA
DVGYRWFAAKGHVPMFAFGHGLSYTRFEHRDLVVRGGDTITAGFTVVNTG
DRAGADVPQLYLTAMPGKPCLRLLGFERVELGPGESRHVTIDADPRLLAH
YDGSIRSWRIAAGSHTVALGISATTLRSTATVELTARAFGR
>MAP1942c cbhK, CbhK
MTIAVTGSIATDNLMRFPGRFSEHLLAEHLQKVSLSFLVDDLVIHRGGVA
GNIAFAIGVLGGDVALVGAAGDDFGEYRDWLQRHGVNCDHVLISQTAHTA
RFVCTTDQDMAQIASFYPGAMSEARNIKLADLVSSLGKPELVIIGANDPE
AMVVHTEECRKLGLPFAADPSQQLPRLSGEEIDKLVDGAAYLFTNDYEWE
LLLSKTGWSEADVQQKVGLRVTTLGAKGVDIVERDGTTTHVGVVPETSQT
DPTGVGDAFRAGFLTGRSAGLNLERSAQLGSLVAVLVLESTGTQEWSWDH
EVAKSRLAGAYGEDAAAEIAALLA
>MAP0280 celA, CelA
MTFSAAGAVARWIVPLFTVAALVGIGPVAEPAPAVRLAADGNPLAGAPFY
VNPNSAAMRAAQSADPPSPELTTIANTPQAYWIVPGGSAATVGKYVGDAN
AAGAIPVLAIYGIPHRDCGSFAAGGMATADDYRGWIDGIAAGVGTSRVAI
VVEPDALAMADCLSGDQRQERYELVRYAVDTLTRNPNAAVYVDAGHLRWH
SPEDMAARLNQAGVAHARGFSVNTANFYTTEDEIGYGEAISGLTNGAHYV
IDTSRNGAGPAPDSELNWCNPSGRALGTPPTAATAGAHADAYLWIKRPGE
SDGSCGKGDPPAGNFVNQYAIDLVHNVGH
>MAP2310c citE, CitE
MNPHAAGPAWLFCPADRPERFAKAAAAADVVILDLEDGVAEADKPAARKA
LLETPLDPERTVVRVNAADTDEYARDLEALAGTAYTTVMLSKTESAAQVQ
TLAPREVIALLETPRGAVFATEIAAAQNTVALMWGAEDLVATLGGSSSRR
ADGSYRDVAHHVRSTALLTASAFGRAALDAVHLDIRDLDGLRAEAEDAVA
LGFAGTVCIHPSQVPVVRQAYRPPEEKLDWARRVLAAARSERGVFAFEGQ
MVDSPVLKHAQMMLRRAGETAAG
>MAP1174c devB, DevB
MSTRIEIFPDSQALVTAAAARLADTIADAVAARGRALIVLTGGGNGIGLL
KSLAGRPIDWSAVHLFWGDERYVPEDDDERNEKQAREALLSHVDIPSSQV
HPMPASDGEFGSDLAAAALAYEQLLAANAAPGQPVPNFDVHLLGMGPEGH
INSLFPDTPAVLETTRMVVPVEDSPKPPPQRITLTLPAIARSREVWLMVS
GAGKADAVAAAINGAAPVSVPAAGAVGLDTTLWLLDREAAAKLPPGTAVQ
G
>MAP1868c efpA, EfpA_1
MTMSVAPTTRLWSRQFVAVIVAIGGMQLMVAMDGPVAVFALPKIQNEMGL
SDAARSWVITAYMLTFGGLMLLGGRLGDTIGRKRAFLVGVALFTFASGLC
GIAWAGGTLIAARLLHGAAAAIIAPTNLALIATTFPRGSARNAATAVFGA
MTGLGGVLGLVVGGALTDVSWRLAFLVNVPIGLAVIYLVLITRQETQTER
IKLDVTGAVLATVTGTAAVFGISMGPEAGWRSPITIGSGVVALAAFVAFV
VVERTAENPIVPFNLFLDRNRLAAFAAMFLAGGVFFTLTVLVGLYVQTMM
GYSPLRAGVAFIPFGLAMAIGVGVASKLVTWFPPRVVVIASGGLILGATL
YGSTFNRGMPYFPNLVVPLIVCAIGIGAVFVTLTLSVIASVDVDRIGPTS
AIAVMLQTLGGPLVLVVVQVAITSHALRLGGTLGPVKSMNAAQLHALDRG
YTYGLLWLAGVVALLGGVALLIGYTAAQVARAQEVKKAVDAGEL
>MAP2915c efpA, EfpA_2
MTALNDSERAVQNWTSARPDRPAPVRSTPPAETAPKPAAETAVKRTSKYY
PAWLPSRRFIAAVIAIGGMQLLATMDSTVAIVALPRIQNELSLSDAGRSW
VITAYVLTFGGLMLLGGRLGDTIGRKRTFIVGVALFTISSVLCAVAWDEA
TMVIARLSQGVGSAIASPTGLALVATTFPKGPARNFATAVFAAMTAVGSV
MGLVVGGALTEVSWRLAFLVNVPIGLVMMYLARTALRETNRERMKLDATG
AVLATLACTAAVFAFSMGPEKGWISLTTISSGVVALGAALAFVIVERTAE
NPVVPFDLFRDRNRLVTFIAIFLAGGVMFTLTVCIGLYVQDILGYSALRA
GVGFIPFVIAMGIGLGVSSQLVARFSPRVLTIAGGYLLVLAMLYGWWCMH
RGVPYFPNLVLPIVVGGIGIGMAVVPLTLSAIAGVGFDQIGPVSAVTLML
QSLGGPLVLAVIQAVITSRTLYLGGTTGPVKTMNDAQLQALDHGYTYGLL
WVAGAAVIVGAAALFIGYTPEQVAHAQEVKEAMDAGEL
>MAP0990 eno, Eno
MPIIEQVGAREILDSRGNPTVEVEIALTDGTFARAAVPSGASTGEHEAVE
LRDGGERYGGKGVQKAVQAVLDEIGPAVIGLNADDQRLVDQALVDLDGTP
DKSRLGGNAILGVSLAVAKAAADSAELPLFRYLGGPNAHILPVPMMNILN
GGAHADTAVDIQEFMVAPIGAPSFAEALRWGAEVYHSLKSVLKKEGLSTG
LGDEGGFAPDVAGTTAALDLIGRAIESAGFKLGTDVALALDAAATEFYSD
GTGYKFEGSTRTAEQMAEFYAGLLGAYPLVSIEDPLSEDDWDGWAALTAS
IGDRVQLVGDDVFVTNPERLEEGIEKGVANALLVKVNQIGTLTETLDAVA
LAHHSGYRTMMSHRSGETEDTTIADLAVAVGSGQIKTGAPARSERVAKYN
QLLRIEEALGDAARYAGDLAFPRFALETR
>MAP3315 entD, EntD
MACSPARRLGPAAPACRPAESLHMGLHNHRLVLLRHGETEWSKTGRHTGR
TEVELTEAGRTQAERAGRALSELKLVEPLVISSPRQRSLATAELAGLNVD
EVSEQLAEWDYGSYEGLTTEQIRQSVPDWLVWTHGCPGGESVAQVSGRAD
AAVTAALRQMESRDVVFVSHGHFCRAVMTRWVELPLAEGSRFAMITASIA
VCGFEHGVRQLRALGLT
>MAP1232 epiA, EpiA
MTTTPGPLDRATPVYIAGHRGLVGSALVRRFEAEGFTNLIVRSRDEIDLT
DRAATFDFVSETRPQVIIDAAARVGGIMANNTYPADFLSENLRIQTNLLD
AAVAVRVPRLLFLGSSCIYPKYAPQPIHESALLTGPLEPTNDAYAIAKIA
GILQVQAVRRQYGLAWISAMPTNLYGPGDNFSPSGSHLLPALIRRYEEAK
AGGAEEVTNWGTGTPRRELLHVDDLASACLFLLEHFDGPNHVNVGTGVDH
SISEIADMVATAVGYIGETRWDPTKPDGTPRKLLDVSALRELGWRPRIAL
KEGIDATVSWYRTNADAVRR
>MAP2737 fliH, FliH
MSGRLDEQRALVAQGCRMAAARGLVDGILGHVSLRIDDERLLVRCRSDTD
DGVAFTRPGDIRLVRFDGTAGAPGELDGYRVPNELPIHVETMLADPRHTA
VAHLHPPAVVAADLAGITLRPIYGAHDIPGARLARGGVPVYERAVLIRDS
RLAGDMVAAMRGRPVVVCRGHGITSAAATVPQAVLQAISVDALARMSLRV
RAAGGTLRDIADVDWDDLPDLGPAFTAEAAWRHEVARIEG
>MAP4193c fucA, FucA
MKFVDDPEQAVLDAAKDMLRRGLVEGTAGNISARRSDGNIVITPSSVDYR
DMQLDDLVLIDPAGSVLQAAQGRSPSTEMQLHLACFAAFDDIGCVIHSHP
VWATMFAIAHQSIPACIDEFAVYCGGDVRCADYAASGTPDVGANAVKALQ
GRGAALIANHGLVAVGPRPDKVLHITALVERTAQIVWGARALGGPVPIPA
DVNRNFAAVYGYLRANP
>MAP3993 galE1, GalE1
MDPSNGHNSGPDDTAGNTVHYPKIVLVTGACRFLGGYLTARLAQNPMIKG
VIAVDAIAPSKDMLRRMGRAEFVRADIRNPFIAKVIRNGDVDTVVHAAAA
SYAPRSGGTAALKEINVMGAMQLFAACQKAPSVRRVVLKSTSEVYGSSAH
DPVMFTEDSTSRRPFRAGFAKDSLDIEAYARGLGRRRPDIAVTILRLANM
IGPAMDTTLSRYLAGPLVPTILGRDARLQLLHEQDALGALERAAMAGKAG
TFNIGADGIIMLSQAIRRAGRIPLPVPGFGVWALDSLRRANRYNEISRDQ
FDYLSYGRVMDTSRMRSELGYQPKWTTAEAFDDYVRGRGLTPIIDPDRVR
SLESRAIALAQRWGSRNPIPWGGVR
>MAP4072 galK, GalK
MTVRYAAPGRINLIGEHTDYNLGFALPIALPQRTVASFTRADDDAITVVS
DRADAPVRIPIDTGPGDVTGWAAYPAGVMWALRTAGHPVPGGVMSITSDV
PMGSGLSSSAALECAVLGAISSAAGIRIDATEQARLAQRAENEYVGAPTG
LLDQLAALYGRPATAVLIDFADLAVTPVAFDPDAAGVALLLIDSRERHTH
AGGDYAARRLSCERAAADLSASSLREAADRGCTDLAAVRDAVDARRARHV
LTENRRVIDCVAALNNSDYPEAGRIFTASHASMRDDFGITTERIDLIADT
ALRAGALGARMTGGGFGGCVIALAPVQRAEAIGHAVRRAATQAGFAQPVV
TRTRAAAGAGPCR
>MAP1164 gap, Gap
MTVRVGINGFGRIGRNFYRALLAQQEQGTADIEVVAVNDITDNSTLAHLL
KFDSILGRLPYDVSLEGEDTIVVGPAKIKALEVREGPAALPWGDLGVDVV
VESTGLFTNAAKAKGHLDAGAKKVIISAPATDEDITVVLGVNDDKYDGSQ
NIISNASCTTNCLAPLTKVLDDEFGIVRGLMTTIHAYTQDQNLQDGPHKD
LRRARAAALNIVPTSTGAAKAIGLVMPNLKGKLDGYALRVPIPTGSVTDL
TAELKKPASVEDINAAFKAAAEGRLKGILKYYDAPIVSSDIVTDPHSSIF
DSGLTKVIDNQAKVVSWYDNEWGYSNRLVDLVALVGKSL
>MAP2434 glgB, GlgB
MSHADQLARTHLAPDPADLSRLVAGTHHDPHGILGAHEYGDHTVIRAFRP
HATEVAALVGGDRFAMQHIESGLFAVALPFTNLIDYRLQITYPDSEPYVV
ADAYRFLPTLGEVDLHLFGEGRHERLWEVLGAHPRSFTTADGVVTGVSFA
VWAPNAKGISLIGEFNGWTGTEAPMRVLGSSGVWELFWPDFPIGGLYKFK
VHGADGVVTERADPMAFATEVPPHTASRVTLSSYTWGDADWMTQRAARNP
VFEPMSTYEVHLGSWRPGLSYRQLARELTDYVVEHGFTHVELLPVAEHPF
AGSWGYQVTSYYAPTSRFGTPDEFRALVDALHQAGIGVIVDWVPAHFPKD
AWALGRFDGTPLYEHSDPKRGEQLDWGTYVFDFGRREVRNFLVANALYWL
QEFHIDGLRVDAVASMLYLDYSRPEGGWTPNIYGGRENLEAVQFLQEMNA
TAHKSAPGIVTIAEESTSWPGVTRPTNLGGLGFSMKWNMGWMHDTLDYIS
RDPIYRSYHHHEMTFSMLYAFSENYVLPLSHDEVVHGKGTLWGRMPGNNH
VKAAGLRSLLAYQWAHPGKQLLFMGQEFGQRAEWSEERGLDWWQLDEQGF
SNGILRLVRDINDIYRSHPALWSQDTVPDGYSWIDANDSANNVLSFLRYG
KDGSVMACVFNFAGAEHSGYRLGLPSAGRWREVLNTDATVYNGSGIGNMG
GVDATEDPWHGRPASAVLVLPPTSALWLEPK
>MAP2564c glgC, GlgC
MREAPHVLGIVLAGGEGKRLYPLTADRAKPAVPFGGAYRLIDFVLSNLVN
ARYLRICVLTQYKSHSLDRHISQNWRLSGLAGEYITPVPAQQRLGPRWYT
GSADAIYQSLNLIYDEDPDYIVVFGADHVYRMDPEQMVRLHIDSGAGATV
AGIRVPRSEATAFGCIDSDESGRIRKFVEKPLDPPGTPDDPETTFVSMGN
YIFTTKVLIDAIRADADDDHSDHDMGGDIIPRLVDDGMAAVYDFSDNEVP
GATDRDRGYWRDVGTLDAFYDAHMDLVSVHPVFNLYNKRWPIRGESENLA
PAKFVNGGSAQESVVGAGSIISAASVRNSVLSSNVVVDDGAIVEGSVIMP
GARVGRGAVVRHAILDKNVVVGPGEMVGVDLERDRERFAISAGGVVAVGK
GVWI
>MAP2432c glgP, GlgP
MKALRRFTVRAHLPERLAALDQLSTNLRWSWDKPTQDLFASIDPELWQRC
GSDPVALLGAVKPARLDELAVDETFLGRLDELTADLNDYLSRPLWYQQQQ
DQGAQMPAAIAYFSMEFGVAEVLPNYSGGLGILAGDHLKSASDLGVPLVA
VGLYYRSGYFRQSLTADGWQHENYPALDPQGLPLRLLSDATGDPALVELT
LPDSAQLHARIWVAQVGRVPLLLLDSDVPENEHDLRGVTDRLYGGDQEHR
MRQEILAGIGGVRAIRAYTAIHGLPEPEVFHMNEGHAGFLGAERIRELMT
GSGLDFDTALAVVRSSTVFTTHTPVPAGIDRFPIDMVRLYFDDHADDAAG
APVLLPGVPTARILALGAEDDPTKFNMAHMGLRLAQRANGVSSLHGRVSR
AMFNELWPGFDSNEVPIGSITNGVHARTWAAPQWLQLGYELAGSDSFSDP
GVWLRVQQVDAGHLWGIRCQLRSLLVEDVRQRLRRSWLERGATDAELGWI
ETAFDPDVLTIGFARRVPTYKRLTLMLRDPDRLQQLLLDEQRPIQLIVAG
KSHPADDGGKALIQQVVRFADRPEVRHRIAFLPDYDMSMARLLYWGCDVW
LNNPLRPLEACGTSGMKSALNGGLNLSIRDGWWDEWYDGENGWEIPSADG
VADEQRRDDLESGALYRLLEEAVAPKFYERDERGVPPRWIEMVRHTLQTL
GPKVLASRMVRDYVEQYYTPAAQSWRRTIGPAEGATGGAEFGAARELAAY
RRRAEEAWPNIVITDVDSTGLPDSPVLGSKLTLTATVALGGLAPDEVTVE
AVVGRVDGSDALLEPVTVEMSYAGTAEGGNQVFSTTTPLPLAGSVGYTVR
VLPRHPMLAAGNELGLVTLAR
>MAP3262c glgX, GlgX_2
MTSAKAAAPTPTGEVWPGKAYPLGASYDGAGTNFAVFSEVAERVELCLFD
GDGTESRVALPEVDGFVWHAYIPGIEPGQRYGYRVHGPYDPAAGQRCNPN
KLLLDPYSKAVDGTFEWNQSLFGYNFGDPDSRNDDDSAPSMPKSVVINPY
FDWGNDRPPDHHYADTVVYEAHVKGLTQTHPDIPEQMRGSYGAVAHPVII
EHLQSIGVTAIELMPVHHFANDSTLIEKGLSNYWGYNTIGFFAPDTKYSS
AATPGGQVQEFKAMVRSLHEAGIEVILDVVYNHTAEGNHLGPTLSMRGID
NAAYYRLVDDDKRYYMDYTGTGNSLNVGHPHALQLIMDSLRYWVTEMHVD
GFRFDLAATLAREFYDVDRLATFFELVQQDPTVSQVKLIAEPWDVGPGGY
QVGNFPPQWTEWNGKYRDTVRDFWRGEPATLDEFAYRLSGSADLYEHTGR
RPVASINFVTAHDGFTLRDLVSYNEKHNEANGEDNNDGESHNRSWNCGAE
GPTDDEQVNALRARQQRNFLTTLLLSQGVPMICHGDELGRTQNGNNNGYC
QDNELTWIDWANADVDLLAFTRVVSALRAEHAVFRRRRFFSGKPVGRRGQ
DGLRDIAWFTPDGTEMTDEDWGANFAKSVTVFLNGHGIPDRDARGQRLLD
DSFLLCFNAHHESIEFTLPPKEFGASWQVVVYTGPEETTPADEVPGAGVL
NVDAHTAVVLRAPDGG
>MAP1270c glgX, GlgX_1
MSSNEPASAAGGGTHQPEVPTVWPGSPYPLGASYDGAGTNFSLFSEIAEK
VELCLIDSRGAETRIPLDEVDGYVWHAYLPNINPGQRYGFRVYGPFEPSA
GHRCDPSKLLLDPYGKAFHGDFTYGQALFSYDLKAVAAGGDDADPGIPPM
VDSLGHTMTSVVSNPFFDWGSDRAPLTPYHETVIYEAHVKGMTQTHPSVP
EQLRGTYAGLAHPAIIDHLKSLNVTAIELMPVHQFMHDSRLLDLGLRNYW
GYNTFGFFAPHNQYAANRNSSVAEFKSMVRSFHEAGIEVILDVVYNHTAE
GNHLGPTINFRGIDNAAYYRLVDTDLRRYKDYTGTGNSLNPRHPHVLQLI
MDSLRYWVTEMHVDGFRFDLAATLARELHDVDRLSAFFDLVQQDPIVSQV
KLIAEPWDVGEGGYQVGNFPGLWTEWNGKYRDTVRDYWRGEPATLGEFAS
RLTGSSDLYEATGRRPSASINFVTAHDGFTLNDLVSYNEKHNMANGEDNR
DGESHNRSWNCGVEGPTDDPDITELRYRQMRNFWATLMVSQGTPMIAHGD
EFGRTQNGNNNVYCQDSELSWMDWSLVDKNSDLLAFARRATTLRTKHPVF
RRRRFFEGEPIRSGDEVRDIAWLTPGGREMTHEDWGQSFHKCVAVFLNGD
AITAPNARGERVVDDSFLLCFNAGEQPVQFVMPGGDYAKEWTVELDTNEP
TGRKEGAEPLVVHTEEELTLPSRSLLILRKTL
>MAP1269c glgY, GlgY
MGLPVLSSYRLQLRGESSGFAFTFADAEHLLDYLDALGVTHLYLSPIMTA
TAGSSHGYDVTDPTTVSAELGGRDGLARLSAAARARGMGLVVDIVPNHVG
IDAPRQNPWWWDVLRNGRSSPYATFFDVDWDLDEDGRIVLPVLGSDDDAA
DLTVDGELLRLGDLAFPIAPGTAGGTGPEVHDRQHYRLVGWRNGVCGYRR
FFSITSLAGLRQEDPAVFAATHAEVGRWFAEGLVDGVRIDHPDGLSDPCG
YLTRLRELVGPDAWIVIEKILAADEALEPTLPVAGTTGYDVLREIGGVLL
DPNGAPALTALVDSSGVDYQAMPKMLAELKIHAATVTLASELGRLRRSIA
AAADVDHPLLHEAIAALLTNIGVYRCDYPGLVALLPTALAETQSAAPHLG
PALQVLAAALARGGEPATRLQQLCGAVTAKAVEDCLFYRDARLVSLNEVG
GEPHRFGVGAAEFHHSAATRARLWPHTMTTLTTHDTKRGEDVRARIGVLS
QVPSLWTEFVARWEVAAPSPNPATGQFLWQNIFGVWPVTGEVTAALRERL
HGYAEKAIREAAWHTSWNDPDADFEDAVHRWLDTVLDGPVAGQLTELVAQ
LNPHASSDALAQKLLALTVPGIPDVYQGTELWDDSLVDPDNRRPVDYRAR
RAALQALQHPKIRVVTTVLRVRRAHPDTFLHGDYAPLLADGDACDHVLAF
TRGADIVVAVTRWTVRLAEKGWGNTVLPLPDGTWKDALTGSVVDGPTSAA
QLFAELPVVLLERHHD
>MAP1268c glgZ, GlgZ
MTEQKTEFRVWAPKPARVRLDVEGRPHAMTRTDDGWWHAAVACAPDARYG
FLLDDDPTVLPDPRSPRQPDGVHTRSQLWDPAAATWTDSAWPGRSTHGAV
IYELHLGTFTAAGTFDSAIEKLDYLVDLGVDFVELMPVNSFAGTHGWGYD
GVLWYSVHEPYGGPDGLVRFVDACHARGLGVLIDAVFNHLGPSGNYLPRF
GPYLSSASNPWGEGINIADADSDEVRRYIIGCALRWMRDFHADGLRLDAV
HALVDTTAIHILEELSTETDWLATQLGRPLSLIAESDLNDPRLITARERG
GYGLTAQWADDIHHAIHTAVSGERQGYYADFGSIATLAHTLRHGYFHAAT
YSSFRRRRHGRPLDTSAQTGIPATRLLAYTCTHDQVGNRALGDRPSQNLT
AGQLAVKAVLALLSPYTAMLFMGEEYGASTPFQFFSSHPEPELARATAEG
RKTEFAEHGWDASKKGEIPDPQDPQTFARSKLNWDEVGTGEHARLHRLYR
DLIALRHNDPDLADPWLEHLTVDYDEDQRWIVLARGRLRIVCNLGAEPVT
VPVGGELMLAWDEPTVDADTTVLQGHSFAILSTPGQPVDN
>MAP0936 gmhA, GmhA
MTGPDVVPPSVELVQRRLAETIAVKQQMQNGVVAAQAVEVARAMIDCLRA
GGKVILFGNGGSAQDAGHLAAELMGRFAFDRPGLAAMSLPDATAAITAIG
NDYSYDEVFARQVLAAGRAGDVVVGLTTSGNSANVVRALEAARQAGMTTV
TLTGAGGGKVADVAQICIRIPSDDTGRIQEASLHLGHSICEMVEAALFPR
PS
>MAP1557c gnd, Gnd
MSSSVTPSRPTTGTAQIGVTGLAVMGSNIARNFARHGYTVALHNRSIAKT
DALLKEHGDEGKFVRCETIAEFLDALEKPRRVLIMVKAGDPTDAVINELA
DAMEPGDIIIDGGNALYTDTIRREQAMRERGLHFVGAGISGGEEGALNGP
SIMPGGPAESYRSLGPLLEEISAHVDGVPCCTHIGPDGAGHFVKMVHNGI
EYSDMQLIGEAYQLLRDALGKTAEQIADVFDEWNSGDLDSFLVEITAQVL
RQTDAKTGKPLVDLILDEAEQKGTGRWTVKSALDLGVPVTGIAEAVFARA
LSGSVAQRRATTGLASGRFGEKPSDAAQFTEDIRQALYASKIIAYAQGFN
QIQAGSAEYGWDITPGDLATIWRGGCIIRAKFLNRIKDAFDENPDLPTLI
VAPYFRSAIEAAIDGWRRVVVTATRLGIPIPGFSSALSYYDALRTERLPA
ALTQGLRDFFGAHTYGRIDEDPDKRFHTLWSADRREVPA
>MAP2670c gnd2, Gnd2
MQLGMIGLGRMGANIVRRVVKGGHECVVYDHNPDAVKAMAGEEKTTGVSS
LQELAAKLSAPRVVWVMVPAGTITTGVIEELATTLEPGDIVIDGGNSYYR
DDIRHSKTLSEKGIRLLDCGTSGGVWGRERGYCLMIGGDEEAFAHAEPIF
ATVAPGVDAAPRTPGRDGEVTRSEQGYLHCGPSGAGHFVKMVHNGIEYGM
MASLAEGLNVLRNADIGKHAQEGDAETAPLSNPEFYQYDFDIPEVAEVWR
RGSVIGSWLLDLTAIALHESPELKEFSGRVSDSGEGRWTSIAAIDEGVPT
PVLTTALQSRFASRHLDDFANKTLSAMRKQFGGHAEKPAN
>MAP3981 gpm, Gpm
MGETATLVLLRHGESEWNSLNLFTGWVDVGLTDKGRAEAVRSGELLAEQG
LLPDALYTSLLRRAITTAHLALDAADRLWIPVRRSWRLNERHYGALQGLD
KAETKARYGEEQFMAWRRSYDTPPPPIERGSTYSQDADPRYADIGGGPLT
ECLADVVVRFLPYFTDVIVPDLRSGKTVLIVAHGNSLRALVKHLDQMSDD
DVVGLNIPTGIPLRYDLDARLRPLVPGGTYLDPEAAAAGAAAVASQGRG
>MAP3631c ilvD, IlvD
MPTTDSARAADIKQPDIKPRSRDVTDGLEKAAARGMLRAVGMGDEDFAKP
QIGVASSWNEITPCNLSLDRLAKAVKEGVFAAGGYPLEFGTISVSDGISM
GHEGMHFSLVSREVIADSVETVMQAERLDGSVLLAGCDKSLPGMLMAAAR
LDLASVFLYAGSILPGVAKLSDGSEREVTIIDAFEAVGACARGLMPREDV
DAIERAICPGEGACGGMYTANTMASAAEALGMSLPGSAAPPATDRRRDGF
ARRSGQAVVELLRRGITARDILTKEAFENAIAVVMAFGGSTNAVLHLLAI
AHEADVALSLDDFSRIGSKVPHLADVKPFGRHVMTDVDHIGGVPVMMKAL
LDAGLLNGDCLTVTGATVAQNLAAIAPPDPDGKVLRALSDPLHPTGGITI
LRGSLAPEGAVVKSAGFDSDVFEGTARVFDGERAALDALEDGTITKGDAV
VIRYEGPKGGPGMREMLAITGAIKGAGLGKDVLLLTDGRFSGGTTGLCVG
HIAPEAVDAGPIAFLRDGDRIRLDVANRVLDVLVDPAEFDSRRTGFTPPP
PRYKTGVLAKYVKLVGSAAIGAVCG
>MAP1298 impA, ImpA
MDLDALVARASAILDDASKPFLAGHRADSAVRKKGNDFATDVDLAIERQV
VAALVEATGIGVHGEEFGGSAVDSEWVWVLDPVDGTFNYAAGSPMAGILL
ALLHHGDPVAGLTWLPFLDQRYTAVTGGPLRKNEIPRPPLTSIDLADALV
GAGSFSADARGRFPGRYRMAVLENLSRVSSRLRMHGSTGLDLAYVADGIL
GAAVSFGGHVWDHAAGVALVRAAGGVVTDLAGRPWTPASDSALAAGPGAH
AEILDILRNIGRPEDY
>MAP1840 lppK, LppK
MHRALSVALSAATALAAVGLAACSTSHPRSETSAASSLPPATSTPIAPPA
TTPLPPPEALTDVLGRLADPNVPGANKVSLVEGATPESAATLDKFTNALR
DNGYLPMTFTANDIAWSDQNPSNVKASIAVHTNQPNNANFTFPMEFTPFQ
GGWQLSRHTAETLLALGKSPQAPTSAAPEGPGAPAPPAEPAPPPEPAPTQ
TPPG
>MAP3041 lppZ, LppZ
MPLSGPDMRDACGGGVGNVRSLTVMRGGRSGRSVRRGLAALCGLVLLTSG
CARFNDAQSQPFTTAPELKPQPSSTPPPPPPLPPTPFPKACPAPGVMQGC
LESTSGLIMGPDSKTALVAERTTGAVKEISVSAEPKVKTVIPVDPSGDGG
LMDIVLSPTYQQDRLMYAYVSTPTDNRVIRVADGDIPKDILTGIPKGATG
NTGALMFTSPTTLVVLTGDAGNPALAADPKSLAGKVLRIEQPTTVGQAPP
TTALSGVGSGGGLCTDPVDGSLYVADRTPTADRLQRITKTSDVSTVWTWP
DKPGVAGCAAMDGTVLVNLINTKMTVAVRLAPNTGAVTGEPDVVRKDTHA
HAWALRMSPDGNVWGATVNKTAGDAEKLDDVVFPLFPQGGGFPRNNDDKT
>MAP3481 lpqD, LpqD
MPKRSMIRKVSVALAVLTATVTVAACGGSPQARSITVTFVRHAQSEANAS
GTIDTEVPGPGLSPEGKGQAEQVAHQLGRKDYDSVYASTMTRAQQTAAPL
AAELGKQVEVLPGIQEINAGWYNGKSESMAKSTYLVAPANWLKGDVSDSI
PGSISGKEFNDQFTAAVNKIYNSGHRNPVVFSHANSIMVWTLMNTRNAKD
SLMNTHPLPNTGRVVINGNPITGWTLVDWDGIRDFRS
>MAP3688 lpqI, LpqI
MALSRPLAAHAAVLAAVSELMIGCAHRAAPPPAHSSTSNTPAAAPAPRVC
VDPPAAPATLNLRDKLAQLLMVGVKNADDARNVVNGYHVGGIFIGSWTDL
TIFRGPLGDIAAKAGPLPLAVSVDEEGGRVSRLKSLIGTSPSPRELAQTQ
TVQQVHDLAADRGKKMRDLGITVDFAPVVDVSDEPDDAVIGNRSFGADPA
KVTEYAGAYAQGLRDAGLLPVLKHFPGHGHGSGDSHTGGVVTPPLDDLQN
DDLVPYRTLVTTPPVGVMVGHLQVPGLTGDVPASLSPEAVQLLRNGVGYK
GPPFTGPVFSDDLSSMAAISDRYGVSEAVLRSLLAGVDVALWVTTDEVPA
VLDRLQKAVASGELPAQRVDESLVRVATMKGRGPACGH
>MAP2548c lpqY, LpqY
MVIGRGRVRRAGAVALATLTIAAASSACVAGPRGLVISFYTTATDGATFA
AIAQDCTRQFGGRFAIQQISLPRAPGEQRLQLARRLTGRDRTLDIMSLDV
VWTAEFAEAGWALPLSDDPAGRAEADATVDTLPGPLSTARWHDRLFAAPV
TTNTQLLWYRPDLVLQPPRTWDAVVTEAARLHAAGRPSWIAVQANEGEGL
VVWFNTLLASGGGRVLSEDGRRVTLTDTPAHRAATVNALRILKSVATAPG
ADPSITRTDEGTARLAVEQGRAALAVNWPYALASMLDNAVKGGVPFLPLN
RDPRLAGSINDVGIFVPTDEQYRIAYQASQKVFDFAPYPGAAPGLPAKVT
IGGANLAVASTTRHRAEAFEAIRCVRSLQHQKYVAIQGGLPPVRTSLYSD
PQFQTKYPMYTIIRRQLTDAAVRPATPVYQTVSIRLAATLSPITGIDPER
TADQLSVEVQKAVDGKGLLP
>MAP3367c manA, ManA
MELLRGALRTYAWGSRTAIAEFTERPVPAAHPEAELWFGAHPGDPAWLET
DNGEITLLDALRADPEGQLGPGSRARFGDVLPFLVKVLAADEPLSLQAHP
SAEQAAEGYLREERLGIPLNSPVRNYRDTSHKPELLVALYPFEALAGFRP
ASRTVELLRALAVSDLDPFIELLGDQSDADGLRALFTTWITAPQPDIDVL
VPAVLDGAIAYLSSGATEFAAEAKTVLELGERYPGDAGVLAALLLNRVSL
APGEAIFVSAGNLHTYLRGFAVEVMANSDNVLRGGLTPKHVDVPELLRVL
DFAPTTEDQLRPRVRREGFGRIYETPTDEFAVALLELEDEYVGHEVDATC
SHDGPQILLCIQGCTTVHGKSGSLRLKRGMAAWVAADDAPIRLVAHEPTK
LFRATVGL
>MAP0695 mhpB, MhpB
MIRKVALALCCMSHSPLLNLPGPSRDLLDDIDAAIARARGFVEDYDPQLV
VIFSPDHYNGFFYKVMPPFCIGSSASGVGDYGTHAGPLDVPEKLAVDCAQ
AVLDAGVDIAVSASMDVDHGTVQPLEKLFGTAISRPVIPVFVNAIGVPLG
PMHRCRALGAAVGTYLATLDKRVLVMGSGGLSHSPPVPTLATAPEPVLQR
IVHGAPMTAEQRQARQAAVIDAARSFAAGDSALQPLNPTWDHRFLEIIDR
GSLSELDRWSNSFVTHEGGGSAHEIRTWVAAFAALQAAGPYETTMRYYTP
APELIAGFAIRTARPK
>MAP4247 mrsA, MrsA
MGRLFGTDGVRGVANRELTAELALALGAAAARQLASGSAPGRRVAVIGRD
PRASGEMLEAAVIAGLTSQGVDALRVGVLPTPAVAYLTGAYDADFGVMIS
ASHNPMPDNGIKIFGPGGHKLDDGTEDRIEALVGDAGPRPVGAGIGRVID
AEDAADRYLRHLSKASTLRLDGLTVVVDCAHGAASAVAPRAYRAAGARVI
AINADPNGLNINDNCGSTHLDSLRAAVVAHRADLGLAHDGDADRCLAVDA
DGNLVDGDHIMVVLALAMREAGELASDTLVTTVMSNLGLHLAMRSAGITV
RTTGVGDRYVVEELRAGDFSLGGEQSGHIVMPALGSTGDGIVTGLRLMTR
MAQTGSPLSDLASAMQTLPQVLINVTVADKATAATAPSVQTAVGQAAAEL
GDTGRILLRPSGTEPMIRVMVEAPEKDIAQRLATRVAEAVSAAR
>MAP3023 mutT1, MutT1
MPTQNSSTARRAGTRVVYAAGAVLWRPGDAEPDGGDVEVAIIHRPRYDDW
SLPKGKVDPGETAPVAAVREVREETGQCAVLGRRLDTVRYPIEQGVKKVY
YWSARATGGEFVPGDEVDQLLWLPVAEAMQRLNYTQDRKVLRHFNKKPAD
THTVLVVRHGTAGRKSRFSGDDAKRPLDKRGRAQAEALVPQLLAFGASQV
HAADRVRCHQTVQPLAEELDVPVHNETTLTEEAYAKNPKRARHRMLEIAG
EPGTPVVCTQGKVIPDLIAWWCDRDGVRPDKSRNHKGSTWVLSLSAGRLV
AADHIGGALAANVRA
>MAP3450 nagA, NagA
MGLIAAGTMVASGRVRRPGWVETSGGHIAACGTGPPPRPADREFPDGTLV
PGFVDMHVHGGGGASYTEPDGIAKAAAFHLRHGTTTTMASLVTGSPAELL
GGVRALAEATRDRTVAGIHLEGPWLSRARCGAHDARRMRDPDPHEIDAVL
DAGGGAIRMVTLAPELPGADAAIRRFADAGVVVAVGHTNADYRQTRHAIA
LGATVGTHLFNAMPPLHHREPGPALALLRDPRVTVELIADGVHVHPDVVH
AVIDAAGPERVALVTDALAAAGRPDGAFRLGPVHIDVVAGGRAGARDDDD
RGQHRHHGPALSHRGRVRRRPRRRARGRGADDVHDAGARAGARSGGQPAG
RP
>MAP1626c nanT, NanT
MTKPSPGRKLTADQRNSFIAALLGWTMDAFDYFIVVLVYADIAKTFHHSK
AEVAFVTTATLIMRPVGALLFGLWADRVGRRLPLMVDVMFYSVVGFLCAF
APNFTVLVILRLLYGIGMGGEWGLGAALAMEKVPVERRGFFSGLLQEGYA
FGYLLASVASLVVMDWLELSWRWLFGLSIVPALISLIIRYRVEESEVWEA
AQDQLRLTSTRIRDVLRNGAIIRRFVYLVLLMTAFNWMSHGTQDVYPTFL
GAHANHGAGLSSTTVKWIVVVYNVGAIIGGLVFGTLSQRFSRRYTVVFCA
MLALPIVPLFAYSRTAAMLGLGSFLMQLFVQGAWGVIPAHLTEMSPDAIR
GLYPGVTYQLGNLLAAFNLPIQERLAETHGYPFALAATIVPVLLTVAVLT
LIGKDATGIRFATSESAFLPTEMT
>MAP1175c opcA, OpcA
MIIDMPDATTTAVNKKLDELRERVGAVAMGRALTLIIAPDSEEILEESLK
AANDASHEHPSRIIVTLRGNPYADKPRLDAQLRAGGDTGASEVVVLWLSG
ALSGHAASVVTPFLLPDIPVVVWWPDVAPAVPAQDPLGRLAIRRITDATN
GVDPLAAIKSRLPGYTAGDTDLAWARITYWRALLAAAVDLAPHEPIESAL
VSGLKTEPALDVLAGWLASRIDGPVRRAVGELKVELARSSETIVLSRPQE
GRTATLSRTSRPDALLPLARRETGECLAEDLRRLDADEIYQSALEGIEKV
QYV
>MAP0573c otsA, OtsA
MAPGGGRGSKTAGYGNSDFVVVANRLPVDQERLPDGSTAWKRSPGGLVTA
LEPLLRRQRGAWVGWPGIVDEDVDHEDDPIVQDDLELRPVKLSADDVAEY
YEGFSNATLWPLYHDVIVKPIYHREWWDRYVAVNRRFAEATSRAAARGAT
VWVQDYQLQLVPAMLRELRPDLTIGFFLHIPFPPVELFMQLPWRTEIVKG
LLGADLVGFHLTGGAQNFLFLSRRLIGANTSRGAVGMRSRYGEVELESRV
VRVGAFPISIDSTALDQTARHRDIRRRAREIRAELGNPRKVLLGVDRLDY
TKGIDVRLKAFSELLAEGRAKRDDTVLVQLATPSRERVDSYQQLRNDIER
QVGHINGEYGEVGHPVVHYLHRPVPRNELIAFFVAADVMLVTPLRDGMNL
VAKEYVACRSDLGGALVLSEFTGAAAELRQAYLVNPHDLEGVKDTVEAAL
NQSVEEGRRRMRSLRRQVLAHDVDRWARSFLDALAESGPRDG
>MAP0783c pdc, Pdc
MPVTDAATEPAYTVGDYLLDRLAELGVSEIFGVPGDYNLEFLDHIVAHPR
LRWVGNANELNAGYAADGYGRLRGMSALVTTFGVGELSAANAVAGSYAEQ
VPVVHIVGGPSKDAQGTRRALHHSLGDGDFEHFFRVSREITCAQANLMPA
TARREIDRVLSEVREQKRPGYILLSTDVARFPTEPPEAALPRYTGGTSPR
ALAMFTEAAAALIGEHRITVLADLLVHRLQAIKELEALLAADVVPHATLM
WGKSLLDESSPNFLGIYAGSASAPAVRTAIEEAPVLVTAGVVFTDMVSGF
FSQRIDPARTIDVGQYQSSVAGEVFAPLEMGEALQALTAILTRRGVSSPP
VASPPAEPLPPPPPREQPLTQKMVWDRVCTALTPGNVVLADQGTSFYGMA
DHRLPQGVTFIGQPLWGSIGYTLPAALGAAVAHPDRRTVLLIGDGAAQLT
VQELGTFAREGLSPVIVVVNNDGYTVERAIHGETAPYNDIVGWKWTEVPN
ALGVTEHLAFRVQTYGELDDALTAAARHQDRMVLVEVVLPRLEIPRLLVE
LVRPTSPDGSPRR
>MAP3044c pfkA, PfkA
MRIGVLTGGGDCPGLNAVIRAVVRTCDSRYGSSVVGFQDGWRGLLENRRI
QLHNDDRNDRLLAKGGTMLGTARVHPDKLRAGLNQIKQTLDDNGIDVLIP
IGGEGTLTAAHWLSEENVPVVGVPKTIDNDIDCTDVTFGHDTALTVATDA
IDRLHSTAESHQRVMLVEVMGRHAGWIALNAGLASGAHMTLIPEQPFDVE
EVCRLVKRRFQRGDSHFICVVAEGAKPIPGSISLREGGIDEFGHERFTGV
AAQLGAEVEKRINKEVRVTVLGHVQRGGTPTAYDRVLATRFGVNAADAAH
AGEYGQMVSLRGQDIGRVPLADAVRQLKLVPESRYDDAAAFFG
>MAP0891c pgi, Pgi
MTSVHTLPDITATPAWDALRKHHDRIGDTHLRQFFEEDPDRGRELTVTVG
DLYIDYSKHRVTRETLRLLVDLARTAKLEERRDQMFAGVHINTSEDRAVL
HTALRLPRDAELIVDGRNVVADVHEVLDAMGEFTDRLRSGEWTGATGKRI
STVVNIGIGGSDLGPVMVYQALRHYADAGISARFVSNVDPADLIATLADL
DPATTLFIVASKTFSTLETLTNATAARRWITDALGDAAVAHHFVAVSTNK
RLVDDFGINTDNMFGFWDWVGGRYSVDSAIGLSVMAVIGREAFADFLSGF
HIVDRHFQTAPLESNAPVLLGLIGLWYSNFMGAQSRAVLPYSNDLARFAA
YLQQLTMESNGKSTRADGSPVTTDTGEIFWGEPGTNGQHAFYQLLHQGTR
LVPADFIGFSQPIDDLPTAEGSGSMHDLLMSNFFAQTQVLAFGKTAEEIA
AEGTPADIVPHKVMPGNRPSTSILANRLTPSVLGQLIALYEHQVFTEGVI
WGIDSFDQWGVELGKTQAKALLPVITANNSPAPQSDSSTDALVRRYRSER
GRTS
>MAP1165 pgk, Pgk
MAVHNLKDLLAEGVSGRGVLVRSDLNVPLDSDGEQGRITDPGRITASVPT
LSALVEAGAKVVVAAHLGRPKNGPDPALSLAPVAAALGEQLGRHVQLASD
VVGTDALARAEGLTDGDVLLLENIRFDARETSKDDAERLALARQLAELVG
PTGAFVSDGFGVVHRKQASVYDVATLLPHYAGTLVAEEIAVLEQLTGSTK
RPYAVVLGGSKVSDKLGVIESLATKADSIVIGGGMCFTFLAAQGFSVGKS
LLETEMVDTCRRLLDTYVDVLRLPVDIVAADRFAADAAPQTVPADAIPDD
LMGLDIGPGSVKRFTALLSNAETIFWNGPMGVFEFPAFAAGTKGLAEAIA
AATGKGAFSVVGGGDSAAAVRALGIPESGFSHISTGGGASLEYLEGKALP
GIEVLGRPQPTGGAA
>MAP3146c pgmA, PgmA
MVANSRAGQPAQPEDLVDLPHLVTAYYSIQPDPGDVAQQVAFGTSGHRGS
ALAGAFNEAHILAITQAIVEYRAAHGISGPLFIGRDTHGLSEPAWVSALE
VLAANDVVVMIDSRDRYTPTPAVSHAILSYNRGRTEALADGIVVTPSHNP
PSDGGFKYNPPNGGPADSDVTNAIAKRANEILRDGAGVKRVPLARARRAA
QRHDYLGNYVDDLPNVVDIDAIRSAGVRIGADPLGGASVDYWGEIAERHN
LDLTVVNPLVDATWRFMTLDHDGKIRMDCSSPDAMAGLIRTVTAEPGRFQ
IATGNDADSDRHGIVTPDGGLLNPNHYLAVAIEYLYTQRPSWPGGIAVGK
TAVSSSMIDRVVAGLGRKLIEVPVGFKWFVDGLIGGTLGFGGEESAGASF
LRRDGSVWTTDKDGIILALLASEILAVTGSSPSQRYQRLTAQYGTPCYAR
VDAPADRDQKARLSKLSAEQVTATELAGEPITAKLTAAPGNDAPLGGLKV
TTANAWFAARPSGTEDVYKIYAESFHGPQHLAEVQETAREVVNKVIG
>MAP3369c pmmA, PmmA
MSRPAATVHRVVKAYDVRGLVGEELDQPLVADLGAAFAKVMRAEGAGQVV
IGHDMRDSSPTLAAAFAAGVTGQGLDVVRIGLASTDQLYFASGLLDCPGA
MFTASHNPAAYNGIKLCRAGARPVGADSGLRTIADDVIAGVEDYDGPPGS
VGDRDVLADYGRFLRSLVDTSGLRPLRVAVDAGNGMAGLTTPAVLGPIES
ITLAPLYFELDGSFPHHEANPLDPANLVDLQAYVRETGADIGLAFDGDAD
RCFVVDERGNPVSPSTVTSLVAARELGREIGATIIHNVITSRAVPELVSE
RGGTPLRSRVGHSYIKALMADTGAIFGGEHSAHYYFRDFWGADSGMLAAL
YVLAALGEQDRPLSELTADYQRYESSGEINFTVADAGQCTEAVLRSFGTR
IHSLDHVDGVTVDLGDGSWFNLRSSNTEPLLRLNVEGRTAEDVAAVVAQI
SAEIAAITAGTTTEADAGAAP
>MAP3430 pmmB, PmmB
MTPEEWIAHDPDPRTAAELAACGAAELAARFARPLRFGTAGLRGPVRAGP
DAMNVAVVSRASWALAQVLKRRGPAGARVIVGRDARHGSAVFATVAAEVL
AAQGFSVLLLPGPAPTPVVAFAVRHTGAAAGVQITASHNPPTDNGYKVYD
GGGIQIVSPTDHQIEAAMADAPPADEIRRTPVPPAHTDLVQRYIERAAGV
RRGTGSVRVALTALHGVGGTVAVDALHRAGFGRIHTVASQFAPDPDFPTV
VFPNPEEPGATDALLTLAADVDADVAIALDPDADRCAVGIPDASGWRMLS
GDETGWLLGDYLLSQPQRADRPVVASTVVSSRMLSAIAARHGAVHVETLT
GFKWLARADAEVPGGTLVYAYEEAIGHCVDPAAVRDKDGISAAVLVCDLV
AGLIARGGSVSGRLDELARRHGVHDVAAVSRRVGTDGAAELMRRLRTSPP
ETLAGFGVALTDITDALIFTGGDDDTSVRLVVRPSGTEPKVKCYLEIRCA
PSDDLASSRRRARALRELLVATVRSW
>MAP2664 ppdK, PpdK
MPESAATPRRGCTTARTAPGSGAHRNASRDRDAAPTEQSVVLLDGTSSHP
RELLGNKGHGIEVMRRHHLPVPPAFCLTTAVGLRYLADPAATMDAVWDDV
LDRIGWLQAQTSRTFGRGPHPLLVSVRSGATQSMPGMMDTLLNLGINDSV
EQALAAEHGPAFARDTHRRFREMYRRIVGVERPVPSDPYTQLRAGIEAVF
ASWNSPRARAYRAHYGFADRHGTAVVVQAMVFGNHGPNSGAGAFFSRNPI
TGDDEPFGEWLPGGQGDDVVSGSVDVEPIAALRDEQPAVYDQLMAAARTL
ERLDRDVQEIEFTVEDGALWLLQTRAAERSAQAAVRSALQLRREGLIDDA
ETLRRVTPAHVRSLLMPALQPEIRLAAPLLAKGLPACPGVVSGTAYADVD
EALHAAERGEPVILVRDHTRPEDVSGMLAARGIVTEVGGAASHAAVVSRE
LGRVAVVGCGAGVAAMLAGRRITVDGAEGEVREGELSLSAWSENDTPELR
ELAEIARLISPLRAHSGGDHPRLDDDSESAVRAALAAGQTDVVSPTPLLV
MLTALRQGAP
>MAP2819 ppgK, PpgK
MTSTDSTAHTPAAPAAGPPPRRGFGVDVGGSGIKGGIVDMDTGLLIGERV
KLLTPQPATPSAVAKTIAAVVDAFEWTGPLGVTYPGVVTHGVVQTAANVD
KAWIGTNARDIISAELNGQEVTVLNDADAAGLAEEHYGAGRNQSGLVVLL
TFGTGIGSAVIHNGTLIPNTELGHLEVGGKEAEQRAASSVKERHGWSYEK
WAKQVTRVLVSIENALWPDLFIAGGGISRKADKWLPLLENRTPVVAAALL
NTAGIVGAAMAATSDVTH
>MAP1310 pykA, PykA
MSRRGKIVCTLGPATNSDELILALVEAGMDVARLNFSHGDYADHKAAYER
VRVASDATGRAVGVLADLQGPKIRLGRFATGPTYWADGETVRITVADCEG
SHDRVSTTYKQLAQDAAVGDRVLVDDGKVGLVVDGIEGDDVICTVVEGGP
VSNNKGISLPGMNVSAPALSDKDIEDLTFALELGVDLVALSFVRSPSDVE
LVHEVMDRVGRRVPVIAKLEKPEAVDNLEAIVLAFDAIMVARGDLGVELP
LEEVPLVQKRAIQMARENAKPVIVATQMLDSMIENSRPTRAEASDVANAV
LDGADAVMLSGETSVGKYPLAAVRTMARIICAVEDNSTAAPPLTHVPRTK
RGVISYAARDIGERLDAKALVAFTQSGDTVKRLARLHTPLPLLAFTAWPE
VRSQLAMTWGTETFIVPMMTSTDGMIRQVDKSLLELGRYRRGDLVVIVAG
APPGTVGSTNLIHVHRIGEDDV
>MAP2251 rbsK, RbsK
MTVCRDADPRQKARAPGLVWPPGRRLTSAPMTDVCVVGSVNLDLSLAVDA
LPRPGETVLASSLRQAPGGKGGNQAVAAARAGARVQFVGAVGDDAAAGQL
RAHLQANGVGLDGAVEIPGPSGTAIVVVDANAENTIVVAPGANGRFTLND
ERARAVLAGCDVMLTQLEIPVATAVAAARHARSGGAVVVVNASPAGRDPD
SLSELAAAADVVITNEEEASEWPWRPRHLVVTLGSHGARYVGADGEYSVP
SPDVDAVDTTGAGDVFAGVLAANWPLEPGSPDQRRLALRRACAAGALATL
VPGAGDCAPRAEEIERALRGKS
>MAP0430 rmlB2, RmlB2
MYGGPVRALVTGAAGFIGSTLVDRLLADGHTVVGLDNFASGRASNLEHLV
GNPAHVFVEADIVTADLEAILDEHRPEVVFHLAAQIDVRHSVADPQFDAS
VNVIGTVRLAEAARRTGVRKMVHTSSGGSIYGTPPTYPTPETVPTDPASP
YAAGKVAGEIYLNTFRHLYGLDCSHIAPANVYGPRQDPHGEAGVVAIFAQ
ALQSGKPTKVFGDGTNTRDYVFVDDVVDAFVKASGDAGGGQRFNIGTGVE
TSDRQLHSAVAAAVGGPDDPEFHPPRLGDLKRSCLDIGLAARVLGWQPKV
GLQQGVARTVEYFRNQHN
>MAP1135 rpe, Rpe
MPCNTGQPRGPLIAPSILSADFSRLADEAAAVTGADWLHVDVMDNHFVPN
LTIGLPVVQSLLATTTIPMDCHLMIENPDRWAPPYAEAGAHNVTFHAEAT
DNPIGVARDIRAAGAKAGISVKPGTPLEPYLEILPQFDTLLIMSVEPGFG
GQSFIPEVLGKVRTARKLIDAGELTILVEIDGGINADTIEQAAEAGVDCF
VAGLAVYGADDPAAAVEALRRQALGASQHLRR
>MAP2285c rpi, Rpi
MRVYLGSDHAGFELKQQIIAHLEQSGHQPIDCGAFSYDADDDYPAFCIAA
ATRTVADPDSLGIVLGGSGNGEQIAANKVPGARCALAWSVETAQLAREHN
NAQLIGIGGRMHTVAEALAIVDAFVTTPWSKAPRHQRRIDILAEYERTHQ
APPVPGAVG
>MAP2547c sugA, SugA
MTATLGETRPTPTSSAAPSVLGRTSEQRLALVLVAPAAILMLAVTAYPIG
YAVWLSLQRNNLAAPHDTAFVGLSNYATILSDRYWWTALAVTLGITVVSV
SAEFVLGLALALVMHRTLIGKGLVRTAVLIPYGIVTAVASYSWYYAWTPG
TGYLANLLPHGSAPLTAQIPSLAIVVLAEVWKTTPFMSLLLLAGLALVPE
DLLKAAQVDGAGAWRRLTRVTLPIIKPAVVVALLFRTLDAFRIFDNIYVL
TNGANNTGSVSMLGYDNLFKGFNVGLGSAISVLIFGCVGLIALVFVKVFG
AAAPGGDVDGR
>MAP2546c sugB, SugB
MLWAVIDTLVVVYALLPVLWIFSLSLKPTSTVKDGKLIPSAISLENYRGI
FRGDFFSSALINSVGIGLITTAVAVLLGAMAAYAVARLDFPGKRLLIGAT
LLITMFPAISLVTPLFNIERFLGLFDTWPGLILPYITFALPLAIYTLSAF
FREIPWDLEKAAKIDGATPAQAFRKVIVPLAAPGVVTAAILVFIFAWNDL
LLALTLTATKAAITAPVAIVSFSGSSQFEEPTGSIAAGAVVITVPIILFV
LIFQRRIVAGLTSGAVKG
>MAP2545c sugC, SugC
MAEIVLDHVSKSYPDGATAVRDLNLTIADGEFLILVGPSGCGKTTTLNMI
AGLEDISSGELRIGGERVNEKAPKDRDIAMVFQSYALYPHMTVRQNIAFP
LTLAKMKKPEIAQKVAETAKILDLTDFLDRKPSQLSGGQRQRVAMGRAIV
RHPKAFLMDEPLSNLDAKLRVQMRGEIARLQKRLGTTTVYVTHDQTEAMT
LGDRVVVMHSGVAQQIGTPDELYENPANLFVAGFIGSPAMNFFPATLTPI
GLKLPFGEVMLTPEVQQVIAEHPEPDNVIVGARPEHLSDAALIDGYQRIR
ALTFEVKVDMVESLGADKYVYFSTAAWAAHSTQLDELAAEADAHENQFVA
RVPAESKAAIGQTVELALDTTKLMVFDADSGVNLTVAPSGSP
>MAP3449 sugI, SugI
MARGSRRGLLVGLTAASVGVIYGYDLSIIAGAQLFVTEDFGLSTRQQELL
TTMAVIGQIGGALFAGVLANAIGRQRSVLLILSGYAVFALLAAFSVGLPM
LLTARLLLGLTIGVTVVVVPVYVAESAPTAVRGALLTAYQLAIVSGLIVG
YLSGYLLADTHSWRWMLGLACVPAVLLLPLVFRMPDTARWYLLKGRVDDA
RRALLRVEPVARVDDELAEIDRAVSEEAASLPAMLAEMVRSPYRRATVFV
VVLGFLIQITGINAIIYYSPRIFEAMGFTGNFALLALPALVQVAGLVAVG
TALLLVDRVGRRPILLCGTAMMIVADVVLVAVFGRGPGGVIAGFAGVLLF
IFGYTMGFGSLGWVYASESFPSRLRSIGSSTMLTSNLVANAIVAAVFLTL
LHSLGGAGTFAVFAVLAVVAFAFVHRYAPETKGRQLEDIRHFWENGGRWD
>MAP2818c suhB, SuhB
MMPSDDELVRLRSVAETLAAEAAAFVRRRRAEVFGTEPGGASAPDGGAVR
SKSTPTDPVTVVDTETERLLRDRLAQLRPGEPILGEEGGGPAEPSPPGDG
TVTWVLDPIDGTVNFVYGIPAYAVSVGAQVDGESVAGAVADVVGDRVYSA
AAGLGAHVSDGAGRQPLQCAAVDDVSMALLGTGFGYARQRRAAQAALLAR
MLPVVRDVRRIGSAALDLCMVAAGRLDAYYEHGLKVWDRAAGALIAAEAG
ARVVLPAPDGDGAGLVLAAAPGIADELLAVLERFDGLDPILD
>MAP1177c tal, Tal
MTAQNPNLAALSAAGVSVWLDDLSRDRLRSGNLQELIDTKCVVGVTTNPS
IFQKAFAEGHAYDSQIAELAARGADVDATIRTVTTDDVRNACDVLTREWE
NSDGVDGRVSIEVDPRLAGDTDKTIAQAVELWKIVDRPNLFIKIPATQAG
LPAITAVLAEGISVNVTLIFSVERYRAVMDAYLAGMEKAREAGHDLSKIH
SVASFFVSRVDTEIDKRLEKIGGERALALRGQAGVANARLAYAAYQEVFE
GGQRYQALKADGARVQRPLWASTGVKNPDYSDTLYVTELVAPNTVNTMPE
KTIDAVADHGVIRGDTVTGTGPDAQRVFDELAAVGVDLPDVFVVLENEGV
EKFVDSWTELMEETQKQLGSASK
>MAP1178c tkt, Tkt
MTTLEEISALTQPHLPDDWSELDSAAVDTIRVLAADAVQKVGNGHPGTAM
SLAPLAYTLFQRVMRHDPSDTHWLGRDRFVLSAGHSSLTLYLQLYLGGFG
LELSDIESLRTWGSKTPGHPEFRHTKGVEITTGPLGQGLASAVGMAMASR
YERGLFDPDAAAGTSPFDHFIYVIASDGDIEEGVTSEASSLAAVQQLGNL
IVFYDHNQISIEDDTNIALCEDTAARYEAYGWHVQRVEGGENVVAIEEAI
AAAKAVTDRPSFIELRTIIGYPAPNAMNTGKAHGAALGEEEVAAVKKILG
FDPDKTFQVRDEVIAHTRKLVDRGREAHQKWQTDFDAWAQREPERKALLE
RLTAEKLPDGWDADLPHWEAGSDAIATRKASGAVLNAVAPKLPELWGGSA
DLAESNLTTINNADSFGPPSISTKEFTASWYGRVLHFGVREHAMGAILSG
IVLHGPTRAYGGTFLQFSDYMRPAVRLASLMDIDTIYVWTHDSIGLGEDG
PTHQPIEHLAALRAIPKLSVVRPADANETAYAWRTILARGNGSGPVGLVL
TRQGLPVLEGTDADGVARGGYILGSDGEEAGQEPDVILIATGSEVQLAVE
AQKLLADKDIVARVVSMPCVEWFESQPDDYRDSVLPPSVSARVAVEAGVA
QSWHKLVGDTGKIISIEHYGESADYKTLFREFGFTAEAVAAAAEEVVDN
>MAP1166 tpi, Tpi
MSRKPLIAGNWKMNLNHFEAIALVQKIAFALPDKYYDKVDVTVLPPFTDL
RSVQTLVDGDKLRLSYGAQDLSQHDSGAYTGDISGAFLAKLGCTFVVVGH
SERRTYHNEDDALVAAKAATALKHELTPIICIGEHLEVREAGNHVIHCEE
QLRGSLAGLSAEQIGKVVIAYEPVWAIGTGRVASASDAQEVCAAIRKELA
SLASAQIADSVRVLYGGSVNAKNVGELIAQDDIDGGLVGGASLDGEQFAT
LAAIAAGGPLP
>MAP2090 uspA, UspA
MASAERRVAPRSTALGYALLAPSLFGVLAFLLLPILVVIWLSLCRWDLLG
PLRFVGLSNWRSVLTDAGFGNSLMVTAVFVAMVVPAQTALGLLAATMLAR
RLPGTGLFRTVYVLPWICAPLAIAVLWRWILAPTDGAVSAVLGHSIEWLS
DPSFALPLVSAVVVWTNVGYVSLSFLAGLLAIPEDIHAAARTDGANAWQR
FWRITMPMLRPTTFFVLVTGIVSSAQVFDTVYALTGGGPAGRTDLVAHRI
YAEAFGSAAIGRASVMAVVLFVILIGVTLVQHLYFRRRISYDLT
>MAP2092 uspC, UspC
MTRPRFSTLVAGAVALVAALLAAAAVLLDYSGQPHGDKTIVTVRVWGDEL
AEAYRQSFAAFTRAHPDIEVHVNMVAYSTYFNTLRTDVAGGSADDIFWLS
NAYLAAYADSGRLLNILDTLGTNAAADWERPVVEQFTRHGQLWGVPQLTD
AGIALYYNADLLGAAGIDPAQLNGLRWDPAGGDTLRPLLARLTVDADGNR
GDTRGFDPGRVRQWGYNAANDPQGIYLNYIGSAGGVFQRGDEFAFDNPAA
VSAFRYLVDLINRDHVAPSAADTNDNGDFSRNQFLAGRMALFQSGTYNLA
PVARDARFRWGVAMMPAGPVGRVSVTNGIAAAGNAATKHPAAVRQVLAWM
GSRQGNEYLGRYGAAIPAVTSAQPVYFRYWASRGVDVTPFFAVLNGPRIA
APGGAGFAAGNDALRPYFDEMFSGRGDVATTLRRAQAAANAAARR
>MAP2091 uspE, UspE
MTSPDRARANIAIYAGLLVGALITLAPFTLGLLTAFTSAHQFVTGTPLQL
PRPPTLSNFADLAGAGFGRAAAVTALMTAVILVGQLTFSVLAGYAFARLR
FPGRDALFWVYIATLMVPGTVTIVPMYLMMAQLGLRNTFWALVLPFMFGS
PYAIFLLREHFRMIPNDLVNAARLDGAHTLDVIVHVVIPSSRPVLAALAL
ITVVSQWNNFMWPLVITSGHKWRVLTVATADLQTRFNAQWTLVMAATTIA
IVPLVTLFVIFQRHIVASIVVSGLK
>MAP4195 xylB, XylB
MSRKDVTIGIDVGSTAVKALAADADGRVMARVRIPHELRIPAPDRLEHDA
EAAWRRGPSAALDELLHQPGVAAVAVAAMVPSMTAVDGSGTPITPGLLYG
DGRGRTPGGPQNQPLPALGEAAEFLRWTAARAPGAAGYWPAPAVANHALA
GQAVIDVATAATAFPLFDGTGWDAAACAERGARVEQMPRVHSMGAVVGQL
PGGAALATGAIDALCEQLVAGADRDGDVLVMCGTTLIVWATISQARQMPG
LWTIPHTTPGKSQIGGASNAGGLFLGWVDRVVAQADPASAQPGRVPVFAP
YLRGERSPFHDPDRRGVLEALDLTHDAAALRRAAYEASGFVVRQLIELSG
APVSRIVATGGGTRVGPWLQAIADATGHPVEVSAVAEGAALGAAFLARMA
AGLETTLTDAARWVRTERVVEPDPAWAAPVQDRYQRFLELGNRPCPAVVA
R
>MAP2671c zwf, Zwf
MADDDSHPSDLLVIFGITGDLARKMTFRALYRLERREELEHPIIGVASDD
ITLDQLLDRAREAIKATGETFDDAVFDRLAGRLSYLSGDVTDAGLYSELA
KKIGGDSRPLYYLEMPPSLFAPIVENLAKADLLERARVAVEKPFGHDLES
ARDLNARLRAVLDEDQILRVDHFLGKQPVEELQYLRFANNALAKLWDRDS
ISEIHITMAEDFGIEDRGKFYDAVGAVRDVVQNHLLQVLALVAMEPPVGA
GADDLNDKKAEVFRAMPSLDPEHCVRGQYRGYTEVPGVAKDSTTETYVAL
RAEIDNWRWAGVPIFLRAGKALPHKVTEVRMFLHHVPGFSFLPNRRPPEP
NQIVLRIDPDPGMRLQLSAQVGDSWHDVHLDSSFAVDLGEPVRPYERLLY
AAFNGDRQLFAREDAIEETWRIVQPVLDKPSRIHQYEQGSWGPEAAQALV
HGRHAWQQPWLPQSTSTKR
>MAP1176c zwf2, Zwf2
MSPARTAQQWHNPLRDKRDKRLPRIAGPCGMVIFGVTGDLARKKVMPAIY
DLANRGLLPPSFSLVGFARRDWSTQDFGKVVYEAVKEHCRTPFRQENWDR
LAEGFRFVPGAFDDDEAFGRLAETLDKLDAERGTGGNHAFYLAIPPKSFP
VVCEQLHKSGLARPQGDRWSRVVIEKPFGHDLESAQSLNKAVNAVFPEES
VFRIDHYLGKETVRNILALRFANQLFDPIWNSHYVDHVQITMAEDIGLGG
RAGYYDGIGAARDVIQNHLMQLLALTAMEEPVSFSPLALQAEKIKVLSAT
HLAHPLDETTSRGQYTAGWQGGEKVVGLLDEEGFAKDSITETFAAITLEV
DTRRWAGVPFYLRTGKRLGRRVTEIALVFKRAPHLPFDATMTDELGANAM
VIRVQPDEGVTLRFGSKVPGTAMEVRDVNMDFSYGSAFAEESPEAYEQLI
LDVLLGEPSLFPVNAEVELAWHILDPVLDNWASGGRPEPYEAGTWGPDSA
FEMLHRTGREWRRP