TitleGenColors Logo

Gene list

Applied filters:

COG category: Signal transduction mechanisms
Organism: Mycobacterium avium subsp. paratuberculosis str. k10, k10
Gene type: CDS

Number of genes found: 134

Free access
Sort by:

 



# Mycobacterium avium subsp. paratuberculosis str. k10, k10

>MAP0917 hypothetical protein
MRAPLRATPSLSLRWRVMLLAMSMVAMVVVLMAFAVYAVISAALYSDIDN
QLQSRAQLLIASGSLAADPGKAIEGTAYSDVNAMLVNPGHAIYTAQQPGQ
TLPVGSPEKAVIHGELFMSRRTAGDQRILAVHLQNGTSLLISKSLKPTEA
VMNKLRWVLLIVGGVGVAVAAVAGGMVTRAGLRPVARLTEAAERVARTDD
LRPIPVFGSDELARLTESFNLMLRALAESRERQARLVTDAGHELRTPLTS
LRTNVELLMASMEPGAPRLPEQEMVELRADVLAQIEELSTLVGDLVDLTR
DDAGQVVHEPVDMSDVIDRSLERVRRRRNDIHFDVDVTPWQMYGDAAGLS
RAVLNLLDNAAKWSPPGGHVGVTMRQLDPSHVELVVSDHGPGIPPQERRL
VFERFYRSTSARAMPGSGLGLAIVKKVVLNHGGMLRVEDTVPGGQPPGTS
FYVLLPGRSLPPAGHSTPAGESETDQAEAATDPAVPVAGDTANSRESANV
ISVDSQSARAR
>MAP3568c hypothetical protein
MIDEYRQFPTRNGAQRALHRVISLLGAGRAVLTHCFAGKDRTGFVVATVL
EAIGVDRDVIVADFLRSNDAAPALRAQISAMIAQRQDTELTPEVVTWTEA
RLSDGVLGVREEYLAAARQTIDEKFGSLQAYLRDAGVGEADVQRLRAALL
A
>MAP1222 hypothetical protein
MTPTPSLQRRVTLVVLALLTVLLVVLGVTIDVTMGVLARRNLHDRLLAAT
SRADALSAAHTAPDLLAAELNGGGIRALVVTADGQAYGDRAISPDLAEGP
VEPPPPYPPPLPPPAYPPPYPPPYPPPPGPPPDTTATAVVHPLPDGARLI
LVADTTQTTQVTRQLRGLMIAAGIVTLLIAALLLIAVSRAALRPLDRLTA
LANAITTGDRGRRLRPHRADTELGRAASAFDTMLDALEMSERRAQQAAQA
AQRAETATRRFLVDAAHELRTPIAGMQVAAEQLAHGASEHPDDGQYRRAG
LLLSDARRAGRLVNDMLDLSRIDAGLPLELHDVDVAAVLRDEADRAALLA
PQIGVRRTGLTALNVRADATRLSQIVSNLLDNARRYTPPGGAIEIDLRAG
DGAAEITVTDTGPGIPDDERERIFDRLVRLDAGRARDHGGAGLGLAIARA
LARAHGGDLVCLPHRGGARFRLTLPRATTQ
>MAP1001c hypothetical protein
MTRDRDTASGGFPRWFPSSLRRQLLFGVLAVVSVVLVTVGFVSVLSLRGY
VNAMSDADVAESLDAFSHQYTKYRNGEHVSPHPGTPPIQQAILEFTGQTP
GNLIAVLRNGAVIGSAVFSEDEPRPAPPDVVRDLAAQSWKDSPPRTEILG
RLGPYRVNSTVNGSDVLVVGVSHNLADRIIARKQLTTVALTASALLLTAG
LTVWVVGYTLRPLRRLAAIAAHVAAMPLTDDDHRITVRVQPQDTDPQNEV
GIVGHALNRLLDNVDSALAHRVDSDLRMRQFITDASHELRTPLAAIQGYA
ELTRQDSSTLPPTTEYALARIESEARRMALLVDELLLLSRLGEGQDLQSE
DVDLTEVVSDAVNDAMVAAPTHHWVKDLPDEPVWVRGDQARLHQLVSNLL
GNARVHTPPGVTVTTAIRCHRGGREAPFAELTVTDDGPGIDADLLPNLFE
RFVRADKSRSDGSGHGLGLAIVSSIVKAHHGSVGVESTAGRTVFRVRLPL
IGGPGTLGVAGL
>MAP2512 hypothetical protein
MTPPVGPPRGIRAALGMLAIGVVAFSVSSVAHPDAGHGIFSATALYSALN
AVAAGLIALRACRIPADRWAWALIAAGMACSAVGDVVYAVWVPDGRSPSV
ADPEYLAYYPFVYAGLLLLMRARLKRLPIAVQLDSVVCALTLTAVAAALT
AGPLHQAAVHAPKTVWVGLAYPWCDLMLLALAAGMLPILGWRNEIRWALL
VAGLVLFAVADGAYLFQTAAGSYRVGSLLDVCWPASSVLIAMASWAPPPA
TATQARRRFSPYVTPVASTIVALGVIVLAHHSRSAATLAALSLVVGAGRF
SLTFRDVSLLHSHDRHAMTDELTALPNRRQLVTALQGLPASASPGAGSMP
SRANPRRALLLLSLSDFHEITESIGRQFGDELLCHIANRLAGSVRRDDLL
ARVGDDQFAVLLADGANLTAASAQAGRLLEALSEPIALDPITIQVDGRIA
IALCPDHCDHPRELLSRAETALAHAKSARSKIAVYDSAFEAHRDNDTNLI
EELRTALFDTDELKLHYQPKIDGRDGSIHSVEAVLRWQHPTRGTLLPEEF
LPVAERAGLMRKISNRTLSMALQQVRSWREEGLRLTVAVNLSTTNLLDIE
LVGTVERLLANYDLPADALILEITESALVDSVRSRNTVTALQRLGIRISI
DDYGTGWSSLARLQEVSVDELKLDRIFVARLAHDARSVAIVRSTVALADN
LGADLVAEGVENEDTLDALRRYGCNITQGFVHTPPLPPDELRAWIASHAP
DPSQSRG
>MAP0130 hypothetical protein
MSPLPSQTATPVRLTLAGSARPDTVTAWRRALHRWLQHEVRAPDDVRDDV
VLGVNEALANCVEHAYRSHRGTGAMKLQASYDPGAESIRVCVSDRGRWQS
PSCRLSSDPRASRGIMLMHNLAEHCTIHARPDGTSVYLDYALDPNPVGAQ
RV
>MAP2672 hypothetical protein
MLAGGREAVKTVWNTANLVRKEGFGAAVRSSIEELADWAEVERPDLARVT
PDGRVVILFTDIEESTALNERIGDRAWVKLISSHDKLVSDLVRRQSGHVV
KSQGDGFMVAFARPEQAVRCGIELQRALRRNANRKRHEEIRVRIGIHMGR
SVRRGDDLFGRNVAMAARVAAQAAGGEILVSQPVRDALSRSDGIRFDDGR
EVELKGFSGTYRLFAVLASPDPD
>MAP3200 hypothetical protein
MSDSTPQSSVALRILVYSSNAQTRERVMRALGRHLHPDLPELSYVEVATG
PMVVKTMDQGGIDLAILDGEATPTGGMGIAKQLKDELDTCPPLVVLTGRP
DDAWLASWSRAEAAVPHPIDPILLGRTVLNLLRTPVR
>MAP1347c hypothetical protein
MSGTLDYLVTAAAAELMAATAADSAAISQRVLGNLVRDLGVDFSFLRHND
HTIRATILVAEWPPRNADPDPLGVVYFADADSVFAQAEHLKAPQVVRPEA
ANADYQRNIDEGTGWGTVSLAAVPLLSGDVTTGTLAFGKAGDREWLPEEL
NALQAIAALFAQLQARVVAEKQIHYLAEHDELTGLLNRRALIAYLTSASP
RGNRARSRSCSWPSTGSRSSTVASGNTPATGSSRPSPRSSARPSTFRRSS
LASAVTSSLSCPRRQWTSKRPRRSPGRCTACCTSRSRSTTKCLVAQ
>MAP2250c hypothetical protein
MTTSELTDAESDIAPAPAPQPAKRTRRTPLRSRFRFGIQSKILITMLLSS
ILGVAVIGLIGAVSGRNALRQVESERLIELREGQKRAVQALFREVTNSLI
VWSGGFTINEATAALTAGFARLANATITPAQQQQLVNYYDNQMLKPIKHA
TGDSIDINAVLPNSNAQKYVQAYYTAAPRPTPDSLPVQDAGDGSAWSAAN
VRFDFYVRDIATRFDYRDALLLDTQGNVVYSVMKGPDLGTNILTGPYRES
KLREAYQKALASNDVDFVWITDFQPYQPQLDAPTAWVVSPVGMNGRIEGV
MALPVPAAKINKIMTAGGHWEAAGMGPATETYLAGPDDLMRSDSRVFVED
PQQYRRDAIDAGTPPDVVDRAIHLGGTTLVQPVATAGLRAAQRGETGVVT
ATDYMGNRELEAYAPLNIPNSDLHWSILATRDNSDAFARLGRFSKNLVIA
VTAMIFVICVASMFVAQAAVRPVRRLEEGTRKISSGDYDINIPVRARDEI
GDLTAAFNEMSRNLAIKEELLNEQRRENDRLLLALMPESVLQRYREGEET
IAQKHQDVAIIYADIVGLDELFTEMPEAQLVGTVDELFRQFDSAAETLGV
ERIRTFHNGYLASCGVVTPRLDSIHRAVDFALEIGRIIDRFNSQTGHQLG
LRVGVNTGNVVSGLVGRSGLVYDMWGGAVSLAYQMHSGSPQPGNYVSSQV
YEAMRDVRQFAPAGTISVGGTDQAIYRLLER
>MAP3270c hypothetical protein
MTGNRFDELAAARDQTEKLLRVIAEIGAGLDLDATLHRIISAARELTSAP
YGALAVRDRHANLISFVHEGMDAETVRRIGHHPVGKGLLSLSLLDTPALR
MDDLTAHPAAAGFPEHHPAMRAFLAVPITIRGAVFGNLYLTHVDEARVFS
DADEMAARALAFAAAVAIDNAQVFERERMSVKWIEASREITTALLSRTEP
HRRPMQLIAERACALTDAEQAIVLVPADPEQPGNPEVDTLVVLAAVGLPA
ADVLGQRVPVRGSTSGAVFKSGEPLITDEFRYPIQAFTDAGRRPAIVMPL
RARDEVVGVIAIARGTEQPPFDESYLDLVRDFATHAAIALVLAAAREDAR
QLAVLAERERIAHDLHDHVIQRLFAAGMDLQGTLARARSPEVADRLNRTL
DDLQTIIEEIRTTIFALRSPAAVAGDFRHRIQRVIAELTENRDLVTTVRM
DGPMTAVGAELAEHAEAVTAEAVSNAVRHSGASRLTVQIGVADMFTLDVI
DNGRGIASGNTRRSGLANMTRRAEQLGGSCEISSPPGGGTRVHWTAPLLD
H
>MAP3068 hypothetical protein
MAVLHPARSADRFDPADCRYMADFARAHLFVYARSLGTVLRVDGEVDASN
ARDLTATIRRFGRLKTPLVLDLSRLRFISVESFRALLLLNDELRKSRVHC
CVVPGAAIRALLRLVHDNGLALADSVPEAFQNIEDIVRARREFLSDLARP
RPARTHAGSSAAS
>MAP0082 hypothetical protein
MDDILAQATIFQGIDPEAVAALSRHLQHVSFPRRRTVFVEGEPGDDLYVI
VSGAVKIRHQTADGRETVFAVLGPGDVFGELALFDPGPRTSTVITLTEVE
ALRMDRNALRTWIIERPEIAEQLLRVLARRLRHTNNTLCDLIFTDVPARV
AKQILDLAMRFGTDSGGPVRVEHHLTQKELAQLVGSSRETVNKALADFAQ
RGWIRQQGKALIIDQPAKLARRARA
>MAP2440 hypothetical protein
MSATQSTAQRIGRVLEKITRQSGRLPETPAYGSLLLGRVTESQHRRRIRI
QIIMTVMVLGANLIGIGAALILVIVAIPAPSVFNDAPAWITFGVGPAYVA
SALAVGTYWITRRTVLALRWAIEERKPTPTDERNTFLAPWRIALVDLVLW
GAGTLLLTVLYGLVDTMFIPRFLFGVSFCGVLVATACYLLAEFALRPVAA
QALEAGPPPRRLTAGIMGRTMTVWFLGSGVPVIGIALLALFEIWLRNLTE
TEFAVGVLIVSTAALIFGCLLMWILAWLTATPVRVVRAALKRVERGDLRG
DLVVFDGTELGELQRGFNAMVDGLRERERVRDLFGRHVGREVALAAERER
PKLGGEERHVAVVFIDIVGSTQLVTRRPPAEVVSVLNQFFGIVVEEVDRH
CGLVNKFEGDATLAIFGAPNHLDCPEDAALTAARAIADRLANEMPQCRAG
IGVAAGQVVAGNVGARERFEYTVIGEPVNEAARLCELAKSHSSRLLATGD
AIEGASEKERARWSLGDTVTLRGHERPTRLASPAGAADRPAGTD
>MAP3569c hypothetical protein
MPEALRELSGAWNFRDVADGAPMLRPGRLFRSGELSGLDDEGRATLRRLG
ITDVADLRAAREVARRGPGRVPDGVEVHLLPFPDLGEHEAGTDDQAPHEH
AFQRLLTGDGGRAVGGVRRRGRHPLHDRRIPAIPNA
>MAP2695c hypothetical protein
MPSTRAAAEERKSARRWLFRYGLDMSLSYVLAVGEAAAILIPLRGHTSVG
VNADFARQNTGPVLALIALGIIGVAVAGALSLAPTLRWYVAGEEPTPQQR
DAVMKLAGRQSAILVTAWAVSGGIFLLLNVAGGARLLLPIALGALLGGSA
AAGTGMLLAQRTLRPIMRAATLGAEPRLAVPSVYARLVLLWFLCSAFPIA
VIAALVVLRSYGWLVEKSASLDVPILVVSLAALVLGLPTMILTSRSISDP
IGEVVDAMAEIEHGRMETYVGAYERSQIGRLQTGFNRMVAGLAERDRLRD
LFGRHVGADVAQRAIEEGASLSGDVVEAAVLFIDLVGSTQLAESRPPQEV
AEVLNDFFRIVVNAVDEHHGLVNKFAGDAALAVFGVPLPTNQPASAALAT
ARTLGTQLRQLPVDFGIGVSAGRVFAGNVGAENRYEYTVIGDAVNEAARL
ADLAKTADRRILCSAAAIEAAGEAERGHWAECYSTVLRGRSETTHVSAPT
G
>MAP0153 hypothetical protein
MRVLSLRTIVIVAAISVMTLVVLLGTWVWVGVTNDQYNQLDRRLDSVSSL
GDISSLLTNPQHNSPDRATPDGNLVRTARIGGVTVSVPSNIVLPQLPDGY
ANTTINGVQYRVRTFTAGPASIALAAPLAEAQHRINELHLRVLLICASVI
GGTVVVGWVISLIMVNPFLLLAQQARAINAQSSPDEVQVRGVREAVEIAE
AVEGMLARIGKEQQRTKAALESARDFAAVASHELRTPLTAMRTNLEVLST
LDLPHEQRQEVIGDVIRTQSRIEATLTALERLAQGELTTVDDFVPFDITE
LLDRAAHDALRVYPDVEVSLVPSPTVLMIGLPTGLRLVIDNAIANAVKHG
NAGKIQLTVSSSGEGVEIAIDDDGSGIPESERATVFERFARGSTAARSGS
GLGLALVAQQAELHGGTAELQNSPLGGTRLLLRLAGDGRGPA
>MAP3540c hypothetical protein
MDAVDPDSRHQLAVRMAELVRGMAAPRRLDQVLAEVTAAAVEVIPGADIA
GVLLVRKGGEFETLADTDSLAARLDVLQHDFGEGPCAQAALQETIVRSDD
LRREPRWPRYAPAAVQLGVLSSLSFKLYTADRTAGALNLFSHRPDAWDTE
AETIGSVFAAHAAAAILAGSRAEQLYSAVSTRDRIGQAKGIIMERFGVDD
VRAFDLLRRLSQESQVKLVEIAQQIIDTRGQGA
>MAP3274 hypothetical protein
MLTTMANHGPDPTTPLWRAAQVFRLLSCVYATGFQIAINPDLLRPVLGWV
LFAGLIGWRAASALAYLRGFGRKPAWVLAELVVVVLLMWSNNLVASPHWA
ADNQTWPTTLWASNPTISAAIQFGPVGGMLTGLAVMTANFAVKNYFALNL
GHNATVIIELAIGMAIGMAAQTARRAHDELQRAARLTAAAQERDRLARQV
HDGAIQVLALVAKKGHEIGGATTELADLASEQERALRRWLACTDIDRDAD
GDTVDLRTLLRRRDSDRVSISLPGTPVRLGRWAATELDAAVGNALDNVVA
HAGPGAHAFVLVEDLGDSVLVSVRDDGVGIAAGRVEEAARQGRLGISQSI
VGRLASLGGTAELHSEPGAGTEWELCVPRREGRDDG
>MAP0023c hypothetical protein
MSSQRGLVARIERKLEATVDNAFARVFGGPIVPQEVEALLCREARDGVQK
LHGNRLLAPNEYIITLGMHDFEKVSADPDLTSSAFARYLADYIHEQGWQT
YGEVVVRFEQSSSLRTGQYRACGAVNPDVQPRPTVDDPVRPQSNNAFGEE
RGVAPMTDNSSYRGAQGPGRPGDEYYDERYGRPQDDARGGSEPQGAPDQR
GGYPPEQAGYPPQQGYPPPRHPEQGGYPEQGGYPPPQSYQEHGGYPDQRG
GYPEPGQGGYPPQQHGYPDQRGYPEPAQGGYPQSYEQRPPAPPGYSGQGY
DQGYRPPGGYPPPGGQPAGGPQGYGGYGDYGRGPARPDEGGYAPPEQRPA
YPEQGGGYDQGYPQGGGYGRQDYGSAEYTQYAEHAQGATYAPPGGGYPEA
PGSEYEYGQPADYGQQPEYAQPAEYGQPADYGQQAGGYGGYGQGGYGSTG
TTVTLQLDDGSGRTYQLREGSNIIGRGQDAQFRLPDTGVSRRHLEIRWDG
QVALLSDLNSTNGTTVNNAPVQEWQLADGDVIRLGHSEIIVRIH
>MAP1754c hypothetical protein
MSAPVQQLGIAVAVDGSPASNAAAYWAAREAAMRHCPLTIVHAVSTPTTM
YPPVPYPEALATNLEDEGKQAILHATKIAEEAMPADRQVPIGRKLLYSAP
VPALLELSGAVRMLVLGSAGHGLLARGLLGSVSSAVVRHANCPVAVVRDE
ELPDPHSAPVLLGTDGSPASELATEIAFDEASRRGVDLVAIHAWSDTAVT
EVFEIDWPIVEAEAERHLAESLAGWRERYPDVTVHRLVARDRAAQHIISS
SETAQLVVVGSHGRGGLARLLLGSVSNAVLHSVRVPVIVARPPA
>MAP1319 hypothetical protein
MSVMTGPTTDTDAAVPRRVLIAEDEALIRLDLAEMLREEGYEIVGEAGDG
QEAVELAERHRPDLVIMDVKMPRRDGIDATSEIASKRIAPIVVLTAFSQR
DLVERARDAGAMAYLVKPFTISDLIPAIELAVSRFSEIAELEREVATLSE
RLETRKLVERAKGLLQTEQGMTEPEAFKWIQRAAMDRRTTMKRVAEVVLE
TLGTPKE
>MAP3390c hypothetical protein
MTAPRPLRLTQLTVRGWLALVLWIMGTTVLVGAVLLHRTDQVSRQVADGV
GPARVAAARLQAALRDQETGLRGYLIAADRQFLAPYYDGQRTEQAAADEI
RRLVGGNAGNTELIADLDAVESAAADWRTNYAEPVIASVTPRAPAVVSAP
TADLGKAKFDHLRALFETQNMHLAQAAASATAELHEINGWRDWVLGAMVL
AILGTGLFLGLFSRAAITRPIAQLAASCRRITEGNFVETIVAPKRPKDIR
DMAIDVENMRKRMVEELDMSRSAQEQLDEQAAELRRSNTELEQFAYVASH
DLQEPLRKVASFCQLLERRYGDQLDERGLEYIGFAVDGAKRMQALINDLL
TFSRVGRLGTTEAEVQLDATLDAGLANLAAAVEETGAEIVRPAELPRIVG
DPMLLTMLWQNLIGNAIKFRHKDRRPRVVIECAPGEGTHDGEWLLSVSDN
GIGIPEEFSDKVFVIFQRLHGREVYAGTGVGLALVKKIVEHHGGTVRIDT
SYTDGTRFEFSLPMPGPVDETDAVALEGAQQ
>MAP3225 hypothetical protein
MLGPLDEYPVHQIPQPIAWPGSSDRNFYDRSYFNAHDRTGNIFVISGIGY
YPNLGVKDAFFLVRRGDTQTAVHLCDAIDQDRLNQHVNGYRIEVKEPLRK
LRLVLDETEGVAADLTWEGLFDVVQEQPHILRSGNRVTLDAQRFAQLGSW
SGRIVIDGQEIAVEPATWIGSRDRSWGIRPIGEPEPAGRPADPPFEGMWW
LYVPMAFDDFAIVLIIQEQPDGFRSLNDCTRVWRDGRVEQLGWPRVKIHY
RSGTRIPTGATIEASAPDGTPVHFDVESKLPVPIHVGGGYGGDSDWLHGM
WKGEKFTERLTYDMTDPAIIARSGFGVIDHVGRAICRDGDKAPVEGWGLY
EHGALGRHDPSGFSDWLTVAP
>MAP1995 hypothetical protein
MNDNPFAGPIAKHPRSPLETLDTVPESVLRRLKQYSGRLATEAVSVMQDQ
LPFFADLEASQRASVSLVVQTAVVNFAEWMQDPKSNVRHTAQAFELVPQD
LARRIALRHSVEMVRVTMEFFEEVVPLLARSEEQLTALTVGVLKYSRDLA
FTAATAYADAAEARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTT
APATVVVGTPAPGRDGSTGPGDSERASQHVRDIAAQHGRAALTDVHGTWL
VAIVSGQLSPTDKFLGDLLEAFSDGPVVVGPTAPMLTAAYHSASEAISGM
NAVGGWRGAPRPVLARELLPERALMGDASAIVALHTDVMGPLADAGPTLI
ETLDAFLDSGGAIEACARKLFVHPNTVRYRLKRITDFTGRDPMQPRDAYV
LRVAATVGQLNYPTHPPSAAGAAMPAVPLPVNGAARGQSGG
>MAP0398c hypothetical protein
MDEILARAGIFQGVEPSAVTALTKQLQPVDFPRGHTVFAEGEPGDRLYII
VSGKVKIGRRSPDGRENLLTIMGPSDMFGELSIFDPGPRTSSATTITEVR
AVSMDRDALRAWIADRPEIAEQLLRVLARRLRRTNNNLADLIFTDVPGRV
AKQLLQLAQRFGTQEGGAMRVTHDLTQEEIAQLVGASRETVNKALADFAH
RGWIRLEGKSVLISDSERLARRAR
>MAP3389c hypothetical protein
MTTAGRAIDILLVEDDPGDELITREAFEHNKLNNRLHVAHDGEEGLNYLY
RRGEFADAPRPDLILLDLNLPKYDGRQLLEKIKSDPDLAQIPVVVLTTSS
AEEDILKSYKLHANAYVTKPVDLDQFMKAVRQIDEFFVQVVRLPSA
>MAP4074 hypothetical protein
MGWFRFYFEGERWVWSDQVQRMHGYQPGTVTPTTELVLSHKHPADRPQVI
DGINDMIRRRQAFSTRHRIVDTAGIIHHVVVVGDQLFDDSGELVGTHGFY
IEVTPAATRNREDSISAKVSEIAGRRGVIDRTKGMLMLVYGIDEDAAFNM
LKSLSQHGNIKLSVLAQRIAEDFTALGKEVITARSRFDQRLRTAHLRPPG
AGEAGSG
>MAP2079 hypothetical protein
MSTLLPVELGVLGPLQVRRHGTPVAIPGAKPRAVLTMLGLHGGALVSADA
LMELLWGEEPPRTAAKALQTHISSLRRILGDGVVLTQGAGWILADADVDA
ARYKVAARRGRDAAAAGDNGRAVACFDEALALWRGIPELPDGRRGTSEKT
RWIEGHAALVEDRADALLATGRAAEIIGELEAAVAEAPLRERRWGQLMLA
LYRAGRQGEALGAYQGARAQLADELGVDPGPELRRLESAIVAQDTSLDVL
VVQNLSSVTRAVTFLLTDIEGSTAAWEADAAAMAVALARHDELVEQVVTS
RGGRLIKTRGEGDATFSVFDRPSAGAAAAVELQDAIGHEPWALREPMRIR
VALHTGEAELRDGDYFGRAVNRAARLRSLAAGGQILCSGATAELVIDTLP
DDVVLTDLGMRQLRNLPRPEHVFELRLETGAQAAPPVQPSAAPMERPGLP
AVLLGPGPFVGRGRELDGLLSAWQSTLAGDMQAVLIAGEPGVGKTRLAGE
WSRRVYEQGAVVLYGRCDEDLGAPYQPFAEALRMLVPCLGPDRLRGLRGV
EALLPLVPGLTDVLPDLPTPTRADPDTERYALFDAVVALLAAVSAGAPVV
LVLDDLHWAAKPTLLLLRHLLRFGEHARVQVVGTYRSTDLDRSHPLAAML
ADLHRSAGFEPSNRLQLNGLDEQDVAAYVAEAGYDDDELARALASVTGGN
PFFLIQALHHVDESGGRWDQSTLPQGVREAVSRRLSRLSPETNKALATAA
VIGSRFALELVESVVGEDQVDAFDEACQAGIVIEEPGGRYRFNHAIVRQS
LLAELSSVRRMRLHQRIATTLENQPGAEDELLAELAHHYFECAWAGNAAK
AVQYCRRAADQAMTRLGYEGAADLYDRALHALEEIDDELPDRDDQVTELL
IARCEALLAAGDVGSAVGAVSQLQSATVDSARLSAWATCFDGQLSILSHP
ERLDEVEAAVGAAAARLAELDDAAGEATAHTVRAGCLARLGRIGDCEIAL
DNALTAARRAREHRRVNAVLANAPLAALWGPNPVPRAGGRCLDVVRLLRI
TTDSPAVEATSTRCQAVLEAFRGRAAAARRMIDSARRTVTELGLRHALLE
VEQFAGIVELVADDAAAAEPHLRKAYNGFRRMGLDADTAETAALLGRACL
VLDREAEADELCTESERLAGHALKASIAWRALRALLLSRRGDYDEARRMA
EEAVSVAGHTDLLVDHGGACLTLATVLNAAGETAGARAAAGRAVDLYERK
GAAALAERARRLLGEHTVPSAPTPPRSPSVELDTAATRVGERVMAAVHRK
DWDEFDRLFAPNVSIENRRKIVGVGFPSDAMRREVRRELEAGVTRIDHVM
VAARGERVALSRMITSAVDESPGAPHDELLQVYAVDENSQVVRQVWFDVE
DMDAAMAELDAAYARFEKRCPRPPLENGATRAYERLHAYFKARDWDAITD
LLTDDYCGDDRRSVVGAGIRRGPDAAIEDYRSAADIDVTDAGSDTVATRG
ARLALARAHYARNGKESEAFRVDFLQLVEIDSDGRIAAMAAFDLDDFDAA
IAALDTRYLAGEASAHARTWSVIAEGYAAFNRHEPPPTTPDCVSIDHRRG
TAFAPGELFSYMRAAWDDTPDLTSYIDAVHRLNDMGALVTHVARGTSPEG
VDAEWRYHHLLMVDGESITRSELFDESALDAALARFDQLTRPAPRLQNAA
SQADQRFWKHFPVRDWDAMAATLADDFLLDDRRQALNAGNRRGRDTEIAN
LKVMADLVGQGNVTSNVIAIRGERLALRRLRFSEQDRNPEAFYTEMIGIV
EINADNRLGAHIAFDPDDITGAIAELDARYLAGEAAPYARTWSTIAGALV
AHNRREVAVATPDAITLDHRRAVAFGPDEGSDFLRAGWELDQHLDLYVET
VHRVTEVGAVFTWAGYGTSHDGFDAEWRAVDLMTVDGEMISRVEVFDESD
LDEALERFDELTRPEPRLENGASRVVQQFLAKFAARDWDAMTEMLSHDIS
TVDRRRVVNAGIRQDRDAEINDFRSAADLDVTNATSDVIAIRGERLALIR
SQVSRGDVDTEAFHVDLLWLLEIGSNQKIAGWATFDPDDFDAAVAELDAR
YVAGEASAHSRTWSVILRAYDALNRRETPPTTSDWVTIDHRHASSFPPGG
LPALLAEWQLASDVSSAVEAVHRLDSLGAVFSTVSHETSREGFKAEWRSI
AVITVDRDLVDRLEVFDEADVDAAIARFDELHPHPGRLENAASQTYERFR
SCFAARDWDAMARLLTAETTVDDHRRVVSAETRRGRDVEIANMRAFAGLG
ATRSTATVIATRGERLVLCRTCIRGEDQQPGGFHIDMLIIVETSADKRIL
ARVAYDPDDIEAAFDDLNARYVAGDAAAHAHTWGLVTAAFAAINRHELPE
ISPDWVNIDHRRGATFAAGDMTSYIHDVFDDAPDFHVDVAAVHRLSDLGA
VVTMASHGTSTRGFQAEWREIGLLTFDGELLSRCELFDEADLPDALARFE
ELHRPAPRLHNTASEVSDRFLAYFAARDWDAMANMTADDFASDDRRRVVG
AGIQSGRDVDMNNMRAWAEVGITKIASDVIAIRGQRLFLGRTRFFGREQG
PGAFHSEVLGLVELDAEDRMLARVVFDPNDIDAAFEELDARYLTGEAAAY
ARTWSLLRASDAALDRYEVPAHTPDWANVDHRRATAFEPGGLIAYLRAGW
DLAEDVKNHILAVHRMTERGAVVTHLAHGRLRNRSDVEWREVALVMFEGD
LVSRCELFDEADLPDALARFEELHRPAPRPENVTSRLIELFLAHFAIRDW
DAMAELLADDFYSEDRRRTLNAGIRRGRDAEMANWHATGDVWMSDVSSTV
IATRGAHLALFRFAFASQEHEPEAFQAAALGVVQASDDNQATASIVFDPD
DFDAAFEELDARYLAGEAAAHARIWSVITRAYSRTNRCELPDMTTDSVLI
DHRTTVTTEREPLPGFLPSLWEIMPDLSVYIEAVHRLSDRGAVITQVARG
TSQQGFDAEWRAIVVGILDGDLYTRIEAFDDTDLDAALTRFDELSAEVRP
K
>MAP1540 hypothetical protein
MTDMDSGSQDQNGDEVTVETTSVFRADFLNELDAPAQAGTESAVSGVEGL
PAGSALLVVKRGPNAGSRFLLDQATTSAGRHPDSDIFLDDVTVSRRHAEF
RLENNEFSVVDVGSLNGTYVNREPVDSAVLANGDEVQIGKFRLVFLTGPK
QGEDGGSSG
>MAP4309 hypothetical protein
MGKHQVDQAVVLTVSGEVDMLSSPMLAEAIQTALAAKPAALIVDLSKVGF
LASAGMTVLVTTQAELQPPTKFAVVADGSATSRPIKLMGIDSVLSLYPSL
NDALSALSGA
>MAP3111c hypothetical protein
MTADDETRQYGFVHAALIYRSQQEFLDVVAGFVGDGLAGNEAVLLAVPPA
LMALLRDELYAGGEPPAGVRMADITESARNPSRFMAMQGAFVDEHPDRRV
RIVSQLAWPGRSREEFVACIEHEALVNEAMDGYPATALCLYDASTLDDGV
LADARATHPLLCKSGALQRSPDYAPGEVLERCNQPLPANPGAVTYLVRRS
ADLRPARSFAVDYAGWVGLSQESIEDLQLVATELATNSLMYTGGACRLAF
WQDDRYLVCEARDTGRFDDPLVGRLDPGPCGPASRGLYLVNAISDLVRTH
TTSTGTTIQAYLCFEPAVRPTG
>MAP2504 hypothetical protein
MSDERSSKVGSMFGPYHLKRLLGRGGMGEVYEAEHTVKEWTVAVKLLNES
FSSDPVFRERMKREARTAGRLQEPHVVPVHDYGEIDGQMFLEMRLVEGTD
LDSVLKRFGPLPPPRAVAIITQIASALDAAHAAGVMHRDVKPQNILVTRD
DFAYLVDFGIASATTDEKLTQLGTAVGTWKYMAPERFSDAEVTYRADIYA
LACVLFECLTGSAPYRADSAGVLVSAHVMDPIPAPSARRPGVPKAFDAVI
ARGMAKKPEDRYASAGDLALAAKEALSTPDQDRAATILRRSQEAALPPRG
SATPGPARCWRHRLPSTPGSPVPLRHKLAGAGRRRAARSQPPGSPGRRTT
RAATPIGPLRQHNSPGRRRGAGRRPNGGCGRSSPRSSPCSSSSRAGWASG
W
>MAP2507c hypothetical protein
MTSHDGLPALYVDDVHDGDDRDIEDLLDGLQGTARTERAELVRWLLAQGI
TAEEIRTTNPPLLLATRHLIGDDGTYVSTREISETYGIDMALLQRVQRAI
GLVRVDDPDAAVHMRADGEAAAFTQRFVDVGLDPDQVVLVAQVLAEGLSR
AAEVMRYSALSAIMRPGATELEIAKASKALVTQIAPLLGPMIQQMLFMQL
RHMMETEAVNAAERAAGKPLPGARQITVAFADLVGFTRIGEAVSPEELGQ
LANRLAILARDVTVPPVRFVKTIGDAVMFVCPEPRPLLDVVLKLVEAVDT
DNEFPRLRAGVASGTAVSRAGDWFGSPVNVASRVTAVARPGTVLVADSVW
DVIGDNGEFSGSFAGARRLKGIKNEVKLFRVRRGG
>MAP0834c hypothetical protein
MDTAASSPRVLVVDDDSDVLASLERGLRLSGFEVSTAVDGAEALRSATET
RPDAIVLDINMPVLDGVSVVTALRAMDNDVPVCVLSARSSVDDRVAGLEA
GADDYLVKPFVLAELVARVKALLRRRGATATSSSETITVGPLEVDIPGRR
ARVNGVDVDLTKREFDLLAVLAEHKTAVLSRAQLLELVWGYDFAADTNVV
DVFIGYLRRKLEANGGPRLLHTVRGVGFVLRMQ
>MAP3648c hypothetical protein
MTAEQVSQPPASGSSRFAVYDYVDGMERLVHAVQELSLARSLPDIQRIVR
SSARELTGCDGATFVLRDNDRCYYADEDAIAPLWKGSRFPMTSCISGWAM
LNRDAAVIPDIYRDPRIPHTLYRPTFVKSLVMVPIRKLDPIGAIGNYWAE
PHQPSEHEVRLLQALADSTSMAM
>MAP1342c hypothetical protein
MPDESATPADAGYDNAGVPTFDSVRDKIEARYATAQGAADLDAESPEGRS
VAEQYDERERAAARRLAQIRESMRPQQD
>MAP2026 hypothetical protein
MKSLVGTSFGQYEIRRLIGKGGMGEVYEAYDTKKGRAVALKLLTDNYADD
EKFRERFLRESRAAAILQEPHVIPIHDWGEINGVLYIDMRLVQGQTLHEM
LKTGSLEPRRATDIIRQVASALDAAHAAGLIHRDVKPQNIIVTPDDFAYL
VDFGIAEARGDTHLTMAGHTVGTFDYMAPERFGDEETTSAVDVYALACVL
YEALTGAKPFPVHSAEQAIRAHLSSPPPRPSAVNPHVPASFDDVIARGMA
KHPDDRYGSAGALGRAAKRALAPDPATSAGTNTLLAPQYVSAPSSYPPFA
AQYPYPATGPVSATDADQGGSKKLMVLTIVGVAVALLVGGTGLVIGLTTQ
RNSSTSEPSTSPLVSYTNPVPTYETEPARLPSTPTSAPQDATQQLHQIAN
DDRAFVRAQLADRWVPQLSSKRPGVVDNGVVWDNAMTLREHLQLRQRYPN
VKLLWSGDWSTFSGPDFWVTVAGLTFADSSGPLAWCRFQGFDRDHCAAKL
VSTTHPEAGSTAYN
>MAP1002c hypothetical protein
MTVMSGQSAAQRPRQAILGQLPRIYRADGSPIRVLLVDDEPALTNLVKMA
LHYEGWVVDIAHNGREAMAKFDRAAPDVLVLDIMLPDVDGLRILERVRQS
DAYTPTLFLTARDSVMDRVTGLTAGADDYMTKPFSLEELVARLRGLLRRA
SQQPAPTAETLKVGDLVVDTASREVTRGDTPVSLSSTEFELLRFLMRNPR
RALSRTEILDRVWNYDFAGRTSIVDLYISYLRKKIDSGREPMIHTVRGVG
YMLRPAE
>MAP3321c hypothetical protein
MSTLGDLLAEHTMLPGNAVDHLHAVVGEWQLLADLSFADYLMWVCRDDGM
LVCVAQCRPNTAPTVLQTDAVGTVVASERLTLVAETFASGAAKADDVAAQ
QDSWLPGTHVEASPVRYGGHVVAVLTRHQTAVAADRASGQLEIAYRECAA
DLVHMLAEGTFPDVGDVAMSRSTPRAGDGFIRLDVNGVVAYASPNALSAY
HRMGLTTELEGHNLIEITRPLISDSFEAQEMAEHVKDLLAGGKSMRMEVD
AGGATVLLRTLPLVVNGASAGAAVLIRDVTEVKRRDRALISKDATIREIH
HRVKNNLQTVAALLRLQARRTTNPEGREALIESVRRVSSIALVHDALSMS
VDEQVNLDEVIDRILPIMNDVASVDRPIRINRVGDLGVLDSDRATALIMV
ITELVQNAIEHAFDPTAAEGSVTIRAERSARWLDVVVHDDGRGLPEGFSL
EASDSLGLQIVRTLVSAELDGTLGMSEASERGTDVVLRVPIGRRTRMLL
>MAP0690 hypothetical protein
MGAPMSRLLVCLGSTGADEYGDRVAVAGDAELERVRAVHQMRSYRIGSVL
RVGVVGLMVAAMIIGTARSEWPQESVLILLYGIIALGAVALAFAPFRRWI
GTGRLAGVGRLEPFAFTVVDILALTIFQLLSTNGIYPLLIMAMLPVLLGL
DVSSRRAAVVLIFSMLGFAIAVLQDPVMSSSVRLPEAGFRFLLYGFLCCA
AFLVVRIEERHTRSVAGLSALREELLAQTMNASDVSQRRISEFIHDGPLQ
DILAVRQELVELDAAVPDDEHIGRALAGLQLASERLRQATFELHPAVLEQ
VGLGAAVQQLAEFTAQRSGIEISTDIDYSGRSAIDAVVFGVVRELLSHVV
QHSRARTAAIMLGRTDGTCVLHVVDDGVGFNQETVARRLGEGHIGLASHR
ARVEAAGGELVFLDVPVGTHVCVEVPLRN
>MAP2742 hypothetical protein
MNSVIVEPMSATHLTLSTKLVYELGDPNSTLRATTARSGSAVLIYAGGEI
DACNEDTWRHLVSEAAGVVTTPGPFVVDVTGLEFMGCCAFAVLTDEAKRC
RQRGIDLRLVSCEPIVGRIIDACGLSDILPIYPTVDSALSGADRW
>MAP0376c hypothetical protein
MRDSGRFGPIEWARAGRPLPSEYTSGDRGIAIDIDGEAALFALVDGLGHG
PPAAEAALRAVDTVTAAGAEPIEVLIQLCHRALEGTRGVAMTLARIDFAA
NTLTWTGVGNVTAHLVAKAPTGTEVRSSARLAGGIVGYRIPEIRPAQVVS
IRPGDLLVMRTDGTNTWTTSTSPPPPWPSPKVCWARTPRRPTTPWCWPPG
TGEPRHE
>MAP0259 hypothetical protein
MSQPRASAERVVMCRADGNPINVLVVDDEAVLAEMVSMALRYEGWNIATA
SDGASAIAAARNQRPDVVVLDVMLPDMSGLDVLHKLREENPQLPVLLLTA
KDAVEDRIAGLTAGGDDYVTKPFSIEEVVLRLRALLRRTGVTTVDSGAQL
VVGDLVLDEDSHEVTRAGEPISLTSTEFELLRFMMRNSKRVLSKAQILDR
VWSYDFGGRSNIVELYISYLRKKIDNGREPMIHTLRGAGYVLKPAR
>MAP0426c hypothetical protein
MPRTQFSFRDVAAARYGGIVTTAAALPGRISAFARWVVRTPWPLFWLSMM
QADIIGALFVLGFLRYALPPEDRIQLQDLPVVNLAVFATSLVVLFLAASV
VNLTLLMPVFRWQRRDNLLAESDPAATELARIRALRMPFYRTVTSMVIWC
IGGVVFIIASWSVARFAAPVVAVATGLGAAATAIIGYLQCERVLRPVAVA
ALRSGVPENVKAPGVILRQILTWMLSTAVPVLAIVLAVVADKTSLLHATP
EKLFTPILLLAVAALGIGLVSTLLVAMSIADPLRQLRWALSEVQRGNYNA
HMQIYDASELGLLQAGFNDMVRDLSERQRLRDLFGRYVGEDVARRALERG
TELGGQERDVAVLFVDLVGSTQLAATRPPAEVVHLLNEFFRVVVETVGKH
GGFVNKFQGDAALAIFGAPIEHPDSPGGALAAARELHDALLPVIGSAEFG
IGVSAGRAIAGHIGAQARFEYTVIGDPVNEAARLTELAKLEPGHVLASAI
AVSGALDSEALCWDVGEVVELRGRTAPTQLARPVKLATPEEVAPQEVSSE
VSG
>MAP3264c hypothetical protein
MPPDGSGGDVMREASTTIVTQLVELVTSLERENSDTAAGLYELIDNGVHH
VTGSQYAGITLAEKSKSVNSVAATHRYPMVLDAIQNKYGEGPCLAAAWEH
HVMHIADLAAEQRWQRYRRHALEQTPIRSILSFELFVDRTSMAALNFYAD
PPHAFTEESVELGTVYATHIALAWSMMRRQDQFRSALASRDIIGQAKGVI
MERFDLDAVEAFELLTRLSQQSNTRVVDIAAALIDSEHPLKQRRR
>MAP3844 hypothetical protein
MTDRQRELARLRAAVDAAARGGGECVLISGVPGVGKSALMKAFGVEVAGR
GCVFAYGRCRDGAPAPYAAVRDALGSLVRTMAATAPAERDRWRADLAGGV
SPIAGVLGELVPDLPRVLGETASHPELDATDSRRQLHRAATRLLSATASY
RTVVLAIDDLQWADRDTLLLLAELLTVSLRDVLVVGAHRAGGFDPAAAGI
AAENLTTIELEPLARDDVEELLATVCGKSGELPEVAAEFHHRTGGNPLQV
RQLLYRAQREGALLPVGSGGPPRWDLRVLTSIEVSATAAEFLGRYLDQLR
SADREVLSALSCIGGEFDLDDAAAAAARTPDVVAHALWACLELRLLEALD
MTGQRISNAISRDARYRFSHDRVAEAARAGLSPDEMRAVHLRIGRRLITL
GDDRLFEAARHVGVGGLAVADEAERTRFVEVLRRAAHQARAQASFPLALG
YCRDALDLLGEQRWAGQFALTRELALDAADAALRVGDVPVLNALLEEAEE
FLREPPDRARLAYLRMKGRVAENRLQEALEIGLAALDELGERVATQAGKP
RMGNAIVRMRLTMARWSTERLLQLPHCTDQRVIALHPILAELCNMAVLVR
PNLLPLLVRKQLELTLAHGHTSSSPLVITGYGIILVLLGDHAGSQRFGEV
GMLLAQRPELREARPQTVFMYLDYIHHWRHPIRDGLGELRDAVEEALDRG
DQENAGFLTAVLLSQSFWVGRPLPEIDALASSLIPHIRSQPVPSALCQAV
QQLCLNLMGRSADPFLLAGESAYDERQVLPAARQQGDEVALGAAAAIKQG
LHFWCGDHAGAAEATTEAIEHLGGLAGVAASQLVHLIGALSMIHCAPRDS
ATIRFVRQALASHRKWAAGAPENYAAPYALIQGAWARARGQHGKAERHLH
QAIELAEEHRLPAIGARAHEEAAALYAETGRQTLREHMLRSAYQRWLNLG
FAVRADWLAREHPWLLRRDLRTGSAGIDPIGAHQLVHALSGARTPDALAN
IILGSVADTTGAGRVLFITGEADNQSVRAIHDHGEVLIVEGPWTEVPYRR
DIVRRVIDGGAPVSVAADRAAATNSVLAAPITLHDRTLGVIYAEQGEPGR
NFGADHEEAIAFLCAQAAAPLWSFQLEARLRAADEYRQSLIDVQSRFVPN
ELLRILDIDDLRRVRNGHRVEREMTVLISDIRSYTAMIEDMNVAEAGNLA
MGFLRAVELPIISYHGMIQDVRGDEIVAVFESEADAVRAGLGMLRSLHEH
NQERRALGSEELRAGIGINTGAVAVGLVGGVNRMVLTIIGDAVNLAARIE
STNKRYGSALLISDRTYQRIAGSEEFDVRRMERVMVVNRRRPVTIYEVYN
GDSGPLRAAKRAAQPAFDEAFALFDAGDVDAARAAFQRCRDMLPDDPIAP
LHLAHCDAVARGEMLPGQEIALQNK
>MAP3271c hypothetical protein
MVKVFLVDDHEVVRRGLCDLLSSDPDLQIVGEAGTVAEAKARIPAARPDV
AVLDVRLPDGNGIELCRDLLSEHPDLRCLMLTSFTSDEAMLEAILAGASG
FVIKDIKGMELARAIKDVGAGKSLLDNRAAAALMAKLRGSAAQADPLSGL
TEQERTLLGLLSEGLTNRQIAARMFLAEKTVKNYVSRLLAKLGMERRTQA
AVFASKLNQQAGRPPTPPDWPG
>MAP3292c hypothetical protein
MNRRILTLMVALVPIVVFGVLLAVVTVPFVSLGPGPTFDTLGEVDGKQVV
AIEGTMTHPTTGHLNMTTVSQRDDLTLGEALTLWLSDTEQLVPRDLIYPP
GKSREDVDKANNADFKQSEDSAAYAALGYLKYPSAVTVAKVPEPGPSAGK
LKVGDAIDAVNGTPVATVEEFTGLLKNTKPGQTVTIDYRRKNEPAGVAQI
TLGANKDRDYGFLGVAVLDAPWAPFVVNFNLANVGGPSAGLMFSLAVVDK
LTTGDLAGSNFVAGTGTISADGKVGQIGGITHKMVAAHAAGATVFLVPAK
NCYEASSDNPSGLRLVKVETLAQAVDALHAITAGGQPPSC
>MAP0375c hypothetical protein
MSDDNDFHDQYARALRAYLASRDEASLAVGHELGRRALQEQISLLEIIEQ
HVRLVFEISQDVRIDAPIALEFLLQLLVPLDVATRGFIDGNRRYAEQRAR
AEGLADRDKFRNALVNSLQEGFFVADHEGSVIEVNNAFIEILGFPAEGLP
YRWPHPWLVDPKTASEQIGVVLRTGSAEYETPIRHADGHLAWVKVSINAV
KDSGSDRDAYVGTIRDVTAERAFAARESAVLRLATAVAVAKSVDELLSIT
LDECSTAIDVQRVVAVSWPSGDGDPAVQVAGRPAASSWRELDPWLRDTFA
EARHQLPLTAKTVDHPDNPGKSRGLVAVLSGAGDLALWLELRTPRWVSGE
DRLLVTVLIGHLSLALQHVRQFENARETSLTLQRAMLPPVQPPPGFAVRY
EPAVPPLEIGGDWYDVLPIGDHRIGIVVGDCVGRGLPAAAIMGQLRSSAR
ALLINGAEPAVLLDQLDSAASLIPNAYCTTVFLAILDTETGVLQYSNAGH
MPAVLAGPEPGTTTLLTDAASVPLAVRREDPRPQATRLLPPGSTLMLFTD
GLVERKHEPIDDGIGRAAEVLAQTMTLPLDTVADEVLRELAPAAGYDDDV
AMVIYRHQQSPLRIETAATADQLVRIRHRLADWLGAAGITGELAADIVLV
VNEACTNCVEHAYRGFVAGTMVLDARLGEGEVHTRITDYGSWKTPAANPV
NSGRGLPLMKALSQAMELRTSATGTVADITFRRPAE
>MAP0022c hypothetical protein
MQGLVLQLTRAGFLMLLWVFIWSVLRILRTDIYAPTGAVMVRRGLALRST
LLPSRQRRHAARYLVVTEGALAGGRITLSGQPVLIGRADDSTLVLTDDYA
SARHARLTQRGSEWYVEDLGSTNGTYLDRAKVTTAVRVPIGTPIRIGKTA
IELRP
>MAP4317c hypothetical protein
MTDPAPEIAVLLVDDQDLVRSGLRRILRRKDGFVIVAECADGDEVPAAIA
AHRPDVVVMDLRMRRVDGIEATRRLGGRPPVLALTTFNEDELLSGALRAG
AAGFVLKDSSAEELIRAVRAVARGEGYLDPAVTSRVLTTYRKAAPGPRGA
AIAELTTRERDVLTLIGKGLSNSEIADELCISGVTVKSHIGRIFGKLDLR
DRAAAIVYAYDNGIVAPR
>MAP1437c hypothetical protein
MRERLLNVVHTHGPWRLIIAQHFASADELMATMSGAFPEGFTPSLDLFLT
PTFRGYLANYGTVLYPELHDCFYNAAFLEHAKRYWNAEYAKPEMMLFNIN
GPCANRDPGHLDSPSFRGVRHENAPTWLCSVMGKSGLFSDYLIKMAQVIT
WFCLDTGSGFTYWPDGPLKAPARVLPPINNRGVVVQNEMMVHRGEANGPL
EQQIPSGLAFDTVFAGDPSDRDHWLLKNGEDVIARHHTDELRFLVHWSAE
VFSDYDELKKNMEGSDDITIDRAIDMMLDDLHAKGIKLDAPSEPLHDPEF
IAALNAAYDVGGPTSYPDQAPLSAFQPA
>MAP3647c hypothetical protein
MEQRVRDRTAALERANEEIRRLSVTDELTGLNNRRGFYLLAGQKLSGSHR
LGHACVLAFLDVDGLKQVNDEQGHDAGDMLIKDVAEVLRSIRRESDILAR
IGGDEFCVMVTESDQDSAPLRERLAEAFAAFNATGDRPYRLSASVGLVRA
PVFDTASVDELLARADELMYVEKKARRTAG
>MAP2085 hypothetical protein
MTTRPSSMPFRPTGTTPHRRATRRHLHGRGRGAYCGKPRPVVVVQDDRFD
ATASVTVVPFTTSDVDAPLFRIVVQPSETTGLAETSRLMIDKITTVPRAS
LTRRVGRLPDGDIVRLDMALLVFLGLAE
>MAP0649 hypothetical protein
MELLLLTPELHPDPVLPSLSLLAHTVRTAPPEPSSLLEAGTADAVIVDAR
TDLSSARGLCRLLSTAGRSVPVLAVVSEGGLVAVNSDWGLDEILLPTTGP
AEVDARLRLVVGRRGGLADQESAGKVTLGELVIDEGTYTARLRGRPLDLT
YKEFELLKYLAQHAGRVFTRAQLLHEVWGYDFFGGTRTVDVHVRRLRAKL
GPEYEALIGTVRNVGYKAVRPARGRPPVAEADDAESESDSASESDAEDVH
DPLVDPLHTQ
>MAP1741c hypothetical protein
MAASSKRCGVLVGVDGSPASNFAVCWAARDAAMRNVPLTLVHMVNAATVW
PQVPMAAEAVAWQEDDGRRVLQEAVKIAEDATRNGRKLAITTELWHAPPA
PTLAQLSEEAELVVVGSTGRGAIGRLLLGSVSSGLVRRARCPVAVIHDED
PMMPYPQRAPVLVGIDGSPASELATAIAFDEASRRGVDLNAVHAWSDTQV
FGLPGIDWPAVRSEAERSLAERLAGWQERYPDVTVHRMVVCDRPARQLIE
QSESAQLTVVGSHGRGGLAGTLLGSVSNAVVHSVRMPVIVARPS
>MAP1683c hypothetical protein
MSKWQCRTSPYRGVTRRFDGMGRSLPHPSGDKDARPYRLRLRADAGSVFY
RTQQARMLRIFLGATIFLYAYGVVFTLFPIRPGLTLSNPAGGIVAVGLGL
GALAWLAARPDKPAPATVTAIVATPIVMAFHRVIIAEFVCLIAPMFLAMY
LRAFYSPRRGAALVAVLAGLSVAALAVAPTPKLSIDYAIFVIAIIGAAES
FGLLTRALVTAACTDPLTGLLNRAGWEIATADLLSRTRSATATVTVVALD
IDNFKTLNDTEGHVAGDQYLVRCAAFWRQVAPAGAVLARFGGDEFAVCIA
DHHPTSAKADQFVASVRRHTPDISVGTASGAGASADIASLYAAADAELYS
AKRRRRDPGLRPARG
>MAP4318c hypothetical protein
MLRAAICHLREQLRRRGELIPVGLTWASLIAVDAALLLGGTIGTLQRPAA
DLPVSLTAFAIALSPTVAFFVFNMKMTPGPLWATWTASTALTLFGTSTPI
RADFAPALLVLMVLVVATLASVVGGLLAAASAAALLLTAAALHRLDAIVL
YLGFIVAAWLLGYLMRTQRLLVAEQIEAQQMLAEHAAADERRRIAREVHD
VIAHSLSITLLHVTGARRALRQDRDVDDAVEALEQAERLGRQAMADIRRT
VGLLDNWPTKAAPTTPEPGLDDVASLVDGFRRAGLAVTLCVEGPTDHVSP
AVGLALYRITQESLANIAKHAPESKAGVVLDISPASARLAVTNILPAAVV
AARSPEGRGVRGMRQRVELLGGAIDVGPTRDGWSVCAEIPLQESDGSWRP
WWCGKA
>MAP0377c hypothetical protein
MAADIVVAIDNPDDIVEARKAGHQLALDLGFSLTDVTMIATAISEIARNI
TSYAGRGAVRVAVADREGRKALVVRAEDEGPGIADIERAMEDGYSTGRGL
GMGLPGARRLMDRLVVESTLGQGTVIEMWKWVPPRA
>MAP0860c hypothetical protein
MTDHLEVRISQVLPALPPDIVAKAFQAGATVSDLLHVSALTGTALTKAKL
YLPQIRRDAVRAYYDPRDDIRQDALAVLAMSPAARFAEKATLQLRSLRAD
LDANRSAIGALCGTQKVECERLAQLNDRIVRAASSSVLGSRSLIKLVSEH
DSVIEVLRANFGKLLALTESRGRMLTRATEVMDSALALAGCGVTDIHSSG
GTNRTPRTIYKRAARLGRTQTHRLKDEWVTDCKALLGRLADYADATEALR
QSIDESFAILQLAPGRTTLALSPERRAWTDPYWTATDPRGIRGDSP
>MAP4007c hypothetical protein
MTIAGTETRHDNCVFDCDGAQVRAHHRHLATVVHIRGVIDDANVDRISHH
IRRFTLGENPVVLDVADVTQFAEAGISLLYAFDADCRAAEVEWTLVASSA
VTAVLDDTGHDASFPIMRSVHEVLRDRADAIAVRRRMALPLVKKTS
>MAP2594 hypothetical protein
MAGVGLGQLLLALDATMVSLVDAPRGLDQPVASAALIDSDDVRLGLAAAA
GSADVFFLLGVGDDEALRWIDTQARDRVPVAVFVKEPSDALVGTAVAAGS
AVVAVDPRARWERLYQLVNHVLEHHGDRADAADDSGTDLFGLAQSLAERI
HGMVSIENAQSQVLAYSASNDEADELRRLSILGRAGPPEHLEWIGQWGIF
DALRSGTQVVRVAERPELGLRPRLAVGIHQPDPDARRPPVFAGTIWVQQG
AQPLADDAEQILRGAAVLAARIMARLAARPSTQARRLQQLLGVTDSETLA
PVDLTAIAAELGLAADGCAALIGWAAAEGASRHTRLTDVIALSASAFRHD
AHVAGHGSRTYVLLPQPPNRSVSSWVRGTIAALRAELGVQLRAVIAAPVP
GLSAVAAARAEVDRVLDSAERHPLSFGQVTSLAEARTTVLLDEIVTLVGR
DERLVDPRIVALHDREPVLAQTLRTYLDAFGDIAAAAHALRVHPNTVRYR
VRRIEKLLSVSLADPEVRLLFALALRVAER
>MAP0833c hypothetical protein
MKLLSRIVTRTPSLRARVAVATAIGTAIVVVIAGAVVWFGITSAWRERLD
RRLDETAGFAIPFLPRGLNEIPRSPKDQDTVITVRRDGQVKSNSDVTLPP
MEEGYADTYINGVRYRVRTVEVRTPQPATVEVGATYDDTIAQTNNLHRRV
ILICALSIGAAAVFAWLLAAFAVRPFKRLAQQARSIDADERPQVAVRGAS
EAVEIAEAMRGMLQRIWKEQDRTKEALASARDFSAVTAHELRSPLTAMRT
NLEVLSTLDMPDEQRKEVLGDVIRTQSRIETTLTALERLAQGELSTSEDH
VPVDITELLDRAAHDAMRIYPGLDVSLVPSPTCIIVGLPAGLRLAVDNAI
ANAVKHGGATRVQLSAVSSRAGVEIAVDDNGSGVPEDERQVVFDRFSRGS
TASQSGSGPGIGAGGPAGPAARRDGRAGKQPAGRCPAGPAPARAQLMGIE
Q
>MAP3235c hypothetical protein
MVIAAADGDLQSDLRGAVAIGASAGGVEALSNLAAGLSPDVPFAYLMVLH
VPAGAPSILARIIDRSGPLPASAAQDGAPLQPGHIYVGVPDRHLLVDGAH
VLLSQGPTENGHRPAINALFRSVALTFGPRAIGVLLSGVLDDGVLGMAAI
RSRGGVTVCQSPDDALFPAMPANALDAGVVDRQAAAADIGVVLKELAHRE
IEDPDMQRDPGMELENRIAMMSRFATDFDTEKLGPPSGYTCPDCNGSLVS
LSEGNYRCRVGHAWTAEALLSARDNELEGALWVAVRSLQEKARLARDMAG
KAGAGLLSRRYTQIAEESERALHVLSHRLAGAAAEGETR
>MAP1357 hypothetical protein
MRHMRRGRRCLASTKANNLARWNSRVAIWIRASPRNPRRRSRLRDAPHCR
CTTPPSGCTTETTAPGRWHSSGVRAGCCRATPTSAIRCLPPAMAARAADR
LLGDRDAVSREVSLGVLQVWQALTEGISRRPANPEVTLVFTDLVGFSTWS
LQAGDDAALALLRQVARAVEPPLLDAGGHIVKRMGDGLMAVFGDPTVAVR
AVLAAKEALRSVEVAGYTPRMRVGIHTGRPQRLASDWLGVDVNIAARVME
RATKGGIMVSSSTLDHIPQSELDALGVEAKRTRKPVFGPKPAGMPADLAI
YRLKTLKELSASDDTDETKPQP
>MAP1339 hypothetical protein
MGAYRTVVVGTDGSDSSMRAVERAAQIAGPDAKLIVASAYLPQHEDARAA
DALREESYKVSGTAPIYAILRDAKERAHQAGAKNVDERPIVGVPVDALVH
LAEEEQADLLVVGNVGLSTIAGRLLGSVPANVSRRAKTDVLIVHTTS
>MAP1221 hypothetical protein
MLVVEDSETIREMVSEALTEVGYHTEARRDGERLEELLDGIRPDLVVLDV
MLPGRDGFALIDVIRDWGDIGIVLITARDGLPDRLRGLDGGADDYVIKPF
ELAELVSRVGAVLRRRGRLPQVIQVGDVTLDPGAGVAARGGHRLDLTATE
LRVLTFLVEQRGRIVSAGQILNGVWGYDAYDPNLVQVHVSGLRRKLEAHG
PRILHTVRGIGYRLQPERS
>MAP3269 hypothetical protein
MHLSPGEPSEREGLTMTWNLDGQTANVEQALAGGHTQPAGWFRFYFADQR
WEWSEQVQRMHGYEPGSITPTTELVLSHKHPDDRARVAATIDEIVTNHQA
FSTRHRIIDTAGNVRHVVVVGDRLTDEQGEVIGAQGFYIDVSTPHEQVQE
EMMSARLAEISRNRARIEQTKGMLMLIYGISDSAAFELLKWLSQEGNVKL
RPLAEQIAEDFRGADVPLPTRSEFDHLLLTAHQRLKASDTRS
>MAP4336 hypothetical protein
MPGSPPTGPLPPVPAGIDRRRPELSDSALVSRSWAMAFATLVSRLTGFAR
VVLLAAILGAALSSAFSVANQLPNLVAALVLEATFTAIFVPVLARAEQSD
PDGGAAFVRRLVTLTTALLIVATALSVAAAPLLVRLMLGRTPQVNEPLTV
AFAYLLLPQVLAYGLTSVFMAILNTRNVFGPTAWAPVVNNVVALATLAVY
ALVPGELSVDPVRMGNAKLLVLAVGTTLGVFAQTGVLLVALRRQHVDLRP
LWGIDQRLKRFGTMAAAMVLYVLISQLGLVVGNQIASTAAASGPAIYNYT
WLVLMLPFGMIGVTVLTVVMPRLSRNAAADDTRAVLADLSLATRLTLITL
IPIVAFMTVGGPAMGSALFAYGHFGDVDAGYLGAAIALSAFTLIPYGLVL
LQLRVFYAREQPWTPIVIILVITAVKILGSMLAPHLTGDPKLVAGFLGLA
NGVGFLAGAVIGYVLLRRTLLPGGGHLIGVGEVRTILVTLTAAMLAGLVA
HVADRLLGLGALTAHGGGAGSLLRLLVLALIMVPITAAVMLRAQVPEARA
ALDAVRFRITGRGPRPRKPAAPDRSSHRRPVTYPEQRNSSPPGVNAVQEP
IRRRPPERANRARLVKGPEVTDRPMESAASSAGPGTGSGAPRPVADDFQP
DIPADQPDRPRKADPRPADQKNGDVGTRRGPLDVPRERTADSSTDDVHLV
PGARIAGGRYRLLVFHGGAPPLQFWQALDTALDRQVALTFVDPDRALPDE
VLQEILSRTLRLSRIDKPGIARVLDVVHTGSGGLVVSEWIRGGSLQEVAD
TAPSPVGAVRAMQSLAAAADAAHRAGVALSIDHPSRVRVSIEGDVVLAYP
ATMPDANPQDDIRGIGAALYALLVNRWPLAESGVRSGLAPAERDSSGNPV
EPMAIDRDIPFQISAVAVRAVQDDGGIRSASTLLNLLQQATAVADRTEVL
GPIDDSPSPSTALISPGNDPATFARRRRNVLIGVGAGLAVLVAALLVLAS
IVSKIFGNVGGGLNKDELGLNGPSSSTSAPQTTTSTAAGSVVKPTRASVF
SPDGDADNPGTAGQAIDGDPSTAWATEVYTDAVPFPSFKQGEGLILQLPS
PTVVGQVSIDTPSTGTKVEIRAASSPTPAGLNDTTVLAPAFTLKPGHNVI
PVRAGSPTSNLLVWISTLGTTNGKSQAGFSEITVQAAS
>MAP0949 hypothetical protein
MPRSLDLVVTSVATALMEATASTARSVSETVLAQLVEQFDVDASFLHHHD
AAGPVPVAEWPRRTDAAGPMAALRSEDVESVLAQCERQRRPVVSSPDRVD
GWLRRWLQRDKAAGAPSVAAAPLVSGDATTGVLGFVKFGARRWKPEEINT
LEAVAALFAQLQARIAAEERLRYLAEHDDLTGVYNRRALVAHLTERLAAG
NPGPVAVFYLDLDRLKPINDYLGHTAGDWFIRVFAQRIEECAGEQSMIAR
LGGDEFVVIPHQPMASEAAEAFARRLSTLLCDRLTIGGHVISRTVSIGLA
VGTPGADNCTDLLRRADEAVLTAKRAGGNQTAVSTDDMSLKRAFRNDIEL
HLQGDIESEGLLLHYLPEVDLWTGAVVGAEALVRWRHPIWGLLLPDSFIG
VAESTNLGAELGGGVMRSACADFSRWRANGVGHGAMLRINVSPIQLISRG
FVETVADTIGEFGIDAGSVCLEITERAVVHDTETTRKTLSELKEVGVQIA
IDDFGTGYAVLSHLKSLPVDMLKIDAGFVREVGTDAGDLAIVRAIIGLAE
AFGLEVVAEGVETPAAALTLMQHGCHRAQGFLLSRPIPGEAMEALLSARW
MPMPFLADREASSMGSI
>MAP1740c hypothetical protein
MPDSVIHRRHPASCAGARSRGSGSRPGGSICAVGGNQRGPRLMTSPGADR
YLSAAFRNKSLPERLRDLEAMVEAITDCAIIQLDANGDVARWCPGAEAMT
GYSAAEVLGRPVALLYTSEDRAAGLAESELAAARESGRCEFEGWRARKNG
QRFRAGVALSVFTDDAGSAIGFTTVMRDVTAEHQRAETMFHALLESAPDA
MVIVGPDGRIVMANAQADQMFGYPREELIGREVEILIPPRHRGSHERYRT
GFFAAPAARRMGAGLELWGMRCDGTVFPVDVSLSPLQTEQGVMVSAAIRD
ITEQLAVQAELTETRAQAEVLAERDRIAGDLQDHAIQRVFAVGLALQGTI
PRARSADVQQRLNAAVDDLHAVVQDFRTAIFDLRHTKTDVPGLRQRFDEV
IGRLAEGLATTVQYKGPLSVVEGELADQAEAVVAEAIGNAVRHAAATKLT
IAVEVADEVSIKVIDNGKGLPDDVSEAGLKTLRRRAERVGGTLTVGAAAG
GGTRLRWVAPLP
>MAP2011 hypothetical protein
MSARDDQASGRDDDQRLTEKPEPDEDDKKKAAEMMEAYEDKATIVLPGTG
GSVSGTAVNEWLDEDGNPKHEVAEGTDHADEGRDEQALRDQIEKDKALNE
VLREAAAAPNKGEKG
>MAP2968 hypothetical protein
MAEQSEAGAVASAAGGEKADPATVFAALAEIIYQGSDANEMYAAICIAAT
LTVRGCDHASLLVRENDRYVTVGASDQLARHIDELERRAGDGPCIDAIEE
ETPQIDPDLTTPSLWPKLASVLVAETPVRGAMGFRLLVDKRKGAALNLFS
DTPNMFDAESAGRAAVLAAFASVAINAVAKGEDAASLRRGLLSNREIGKA
VGMLMLLHEMTEDEAFDLLRRHSQALNIKLAEVARAVIDNRGQLPPEIEN
QLPPNT
>MAP1279c hypothetical protein
MARRSREQPAPDPQQPARQAGRRRPGLSWISIQSKVMVMLLVSSLASLGV
IGTVEYVAARNALQPAASERMVQLREAQKRAIETLFSDLSDSLVVYSRGS
TALEAVQAFTAGFDQLANAPVDPAQQRALVDFYTERLIKPVERDTGKKLA
LDAVLPADNAQRYLQAHYTVVAGGPFEGTGSAPDDAGDGSAWSAANARFN
DYFREIATRFEYRDALLLDTRGNVVYSVRKSASLGTNILTGPYRQSNLRG
AYEKAMAANSVNFVWITDFQPYQPQLGQPTAWLVAPIGAPGRAAGVLALP
LPIAKVNRIMTADKQWRAAGMGRTAETYLAGPDNLMRSASRLILEDPERY
ARDAVAAGTPRNTVDTAIRWGGTTLVQPVATAAVRAAQRGETGTVTDTGY
LGRRELAAYAPLSLPNSDLHWSILATRETAEANARVVSLTQTLVLTTTAM
VFVICVAALLVAQMFVRPIRRLQAGAREISAGNYDVTIPVTSRDEIGELT
AAFNDMSRNLQIKQELLGEQRKENDRLLASLMPEPVARRYREGEQTIAQE
HQDVTVIFADIVGLDEISAKLSGRELVGVVDELVREIDSAAETLGVEPIR
TLHNGYLASCGLNVPRLDSVQRTVEFAVEVQGIIARFNGRTGYHLGLRAG
INTGNVVSGLVGRSSLVYDMWGAAVNLAHQTRSSTTQTGIYVTSKVCEVM
RDIRHFTPAGTVTVGGVEQQIWQLSERTGP
>MAP3352c hypothetical protein
MMLMAQAVLPTRAELLAALSVAIDLGLGQPAEHMLRAALIATRLCDRLGL
SRQQRDCVYYTTLVMWIGCHADSHEYARWFGDDIAVRHDSYLVDWSGIPY
LRFLASNVGRGQPLAHRLSVMATLFANARGHISRLIHSHCASAALLADRI
GLGPNVQAALAYTFERYDGGGLPTGARGEDIPIQMRIAQLADMVEVHHRS
YGVAGAVAMVGARRGGQFDPGIADVFLRDADAILAGPQTGDAWAAALREA
PDRHRVDEQSLDAALVALGDFVDLKCPFTLGHSRAVARLAGQAARAAGLD
ADAVALTRRAGHVHDLGRIGVSNQIWSRPGPLSGSQFERVRLHPYLTVRI
LDQVPGLRRLAEVAGNHHECLDGSGYPRGLAGPALGMPDRILAAAVCYQS
GREPRPYREQLSEAEAARRLRGRVRCGELDPVAAEAVLHAAGQPVGPRPN
PRPDGLTPREIEVLGHVARGASNKEIAAALVISEKTARNHVERTYAKIGV
SNRIGASMYALQHGLVPAAPLHS
>MAP2948 hypothetical protein
MSDISRRGAYGPGEARTSRGLEMSLDVVVLTNEDDFESALPDLSRFARPA
RRAALSTDGDGQFGPADVAIIDARSNVAAAQAVSHRLTADHPATAVVALV
APADCAAVDVDCNFDDVMLPGTCAEELQARLRLAIGRRRGGLDGTLKFGD
LLLHPASFTASLDGRELNLTLTEFKLLNFLVQHAGQAFTRTRLMQQAWGY
EGNGRARTVDVHIRRLRAKLGTRHQSLVGTVRGVGYRAPTPPQPEWIVGH
QKP
>MAP3677 hypothetical protein
MLRKRRIHRPQGRFHLRRDHEPLPARAVPRFGGRREHRRRHHLAPPLDPS
RGRAMKDFRAPVNGVDGDSASALVAVPKSQGAQTIRPDSVSRLVGALLDD
HAAVVGRVRSVIRSRLPVYRSVADEALEAELEWVLRSAVGGREALHEPQI
AGLAAIGEARAHDGVPVDDMLRAWRIGVEVVVECAREAARRLGVDDARVL
ELVQSALAWSDIAMATSAKAHRRTERALALEAEESDAEFVRGALMGSLPA
AELRMHAELRGLDPGAEYVAVRARLGGDGPHLRLEQSLGFQDPAHSRRGL
CALLDGDLAGFLIEPPRDVEGVVGFGPPRPLTRLSESYRMAARALVTAEA
CGLRGAYDIAALGLRTAVAIDADVGELLRKRYLEPLSVGGSSRELIATVR
TYLACGMHVERTATRLFVHQNTVRYRLARFEELTGASLRDTEVVTEVWWV
LELAAMRL
>MAP3232 hypothetical protein
MRRRVETERVTTEADRPGVELVALVASAGGLEALTTVLRDLPRDFPAAVV
VQQHLAGHDSLLATILTRQSGRPVGWAANGRAVTPGEVVICPPGKALELT
PQGRCRLHGAQQHGARGADVLLTSIAGSYGPRGVAVVLSGSGRDGAAGTV
AMRRAGGVVIAESPATALYPSMPIAAAQAGADLVLGIGEIAPVLADLVHG
LPLPRRSPPADAPDEAYLDGGVDPDGIFARLAARFGANTPAARAELARLR
AAELRRRRQELSAGVGATRETVATARRRAEESRRRALLAHRAADEAWARS
EQEHRDRDG
>MAP0380 hypothetical protein
MSAPDSITTLVEDHDGVSVVSVSGEIDMVTAPALEQAIGAVVADSPPALV
IDLSAVEFLGSVGLKILAATYEKLGKETGFGVVARGPATRRPIHLTGLDK
TFPLYPTLDDALTAVRDGKLNG
>MAP1742c hypothetical protein
MSDPKHRGILVCVHGSAASDAAVAWAAREAAMRGLPITVIHAVAPVVVGW
PVGQLYADMPAWQQDGAQQVIDQARKIVIANQGGGTPPEMRTEIIYSAVT
PTLIDASRDAWMIVAGSQGLGALGRLLLGSVTAALLAHAHCPVAVVHADD
HAARAADAPVLVGTDGSPASEAAIALAFDEASRRGVGVVALHAWSDVGVF
PILGMDWRDSEAKGEELLAERLAGWQEQYPDVHVKRLVVCDTPSRWLVAE
AERAQLVVVGSHGRGGFPGMLLGSVSSHVAQSATAPVIVVRGR
>MAP0916 hypothetical protein
MRILVVDDDRAVRESLRRSLSFNGYSVELAHDGVEALDMIASDRPDALVL
DVMMPRLDGLEVCRQLRSTGDDLPILVLTARDSVSERVAGLDAGADDYLP
KPFALEELLARMRALLRRTKPEDDAESVAMTFSDLTLDPVTREVTRGQRR
ISLTRTEFALLEMLIANPRRVLTRSRILEEVWGFDFPTSGNALEVYVGYL
RRKTEADGEPRLIHTVRGVGYVLRETPP
>MAP2078c hypothetical protein
MAPERLRRRTVTGAGRMVPGGAAPTWPRTRPFTLVSMDSDRDDSAAIRRD
RRELDRMHTQLDELMAARDQLEQLVRVIVEIGSDLDLDVTLRRVLKAAME
LTGARYAVLSSRASDGALLSFVHAGLDADTARQLGELAVGDGLRIDDVSV
DARSAHLSAHDPPLRALLGIPITVRAANFGNLYLADDRQGRVFTDSQEGA
VRAMATAAAAAIDNARLFERERESAKWTKASREITTALLSGDPQTGPLQL
IVNRALELAGAEQAILLVPREPESPGREADTLVVSATAGRYASQVIGRQV
PMDGSTTGGVARRGLPLITDSFQYPIEGFTDVGERSAIVIPLIADGAVLG
VIAVARDPQQPPFGNDYLDLVSDFARHAAIALALAAGRQHALNQELAQAD
TVDEAVRAAAEELRRLWRARRVLAVTFPSHTSATETAFGPPEVVSVGEPT
QWADLPSHMHQALGALRDGDLLTPNTTQPGTAGIALQHPDGVLVVWIDLT
DERSFTLEDQTLLTVLAGRLGQGLQRVHQVDQQRETALALQHAILGPADL
PNGFAVRYQAATRPLQVGGDWFDIVDLEDGRVALVVGDCVGHGLASATVM
GQVRSACRALLLENPSPAAALAGLDRFAARLPGAQCTTAVCAVLTPDTGE
LVYSSAGHPPPIVVHGDGSTQILDEGHTIALGIRRDWPRPEARVTIPARA
TLLLYTDGLVERRRSPLDRGIYRVATVAQDARGSSLDELATRIMSAVAPS
GGYQDDVVLLLYRHPAPLELNFPADVNQLAPTRTALRNWLRRVRLDPEQT
LNVLVAAGEAVANAIEHGHRHSPQGTIRLGATALGDQVRLTITDTGTWKV
PQATSYPHRGRGIPLMKSLMNDVDIRSDTGGTTVQLSARIT
>MAP1375c hypothetical protein
MRATTSTNACFRLAGNLDHTLSAPQHEQVWKRAKRGIDMSTSPSGLHGRD
RAAMTVNSNDETVPCAEPLGSDVVEGDGDLSAVERVLGLGEPQRVGRFRF
FLDGHRWEWSDAVARMHGYRPGQVQPSTELLLQHKHPDDRERVAAVLDQV
MRGKPFSSRHRIIDTAGRTHWVVVAGDRMLDDSGALAGTSGFYVDVTDSL
HSDITNVLSAVADSRARIEQAKGGADGRLWHLRRAGVRHPGVALAGNQPQ
AARRRRPVSGRGGQQGVPGNAAPGRPRPADPGMTAQGRIARLLLRPSPAG
IVAAQGRIARLLLRPSPAGIVAALGSGRWRTYWELRPYTWNTRHRWSSSR
SRSGSTTAPASASSGATATANPACCGC
>MAP3233c hypothetical protein
MNSTTDEADEAFEALLRYMRDSRGFDFTGYKRTSLMRRVRHRMDQAGYTA
FDEYLDFLQASSDEFTALFNTILINVTSFFRDPDAWEFVAAEVIPRLLAE
RGPNDPIRVWSAGCASGQEAYTLAMLLAEALGFDAFRQRVKIYATDVDED
ALTEARGASYDARAVESVPADLLARYFEQVNGRYVFHKDLRRAVIFGRND
LVKDAPISRVDLLVCRNTLMYLNAETQRNVLGRLHFALAPQGTLFLGHAE
MLLSHGDRFTPLSLKHRIFRKAVGSHSGLDHYDPAGAFYERHADLPGLAT
VRDLAFRASPVAQIVVTGEDTVAMINQQAETVFGLSARDIGRLLRDLEVS
YRPVELRAYLEQAKVERRSARIQDVKWQRPGAETVWFEIHVNPLVDAENG
LLGVSIVFFDVTATRALLDKVVQTNRQLEAAYEELQSTNEELETTNEELQ
STVEELETTNEELQSTNEELETMNEELQSTNDELHTINDMLRERSLELDD
AKSFLDSLVDSVRMGMVVVDREMKVILWNRGCEELWGLRADETTGTMLTA
LDIGLPLDAVRPLIGNAFVDPDGSGEVVVDAVNRRGRPARVRVTCTSFRG
NEGTVGGALLLMDVVG
>MAP2496 hypothetical protein
MRHAKSDYPDGVADHDRPLAPRGIRQAGLAGDWLRAGAPAIDAVLCSTAT
RTRETLRNTRIEAPVRYSERLYASTPGIVIDEINTVGDDVSTLLVIGHEP
TMSALALGLGGGSGANPAAAERISNKFPTSAIAVLAVPCRWTELELGAAT
LTDFVVAR
>MAP1101 hypothetical protein
MFSGGARACRIEGVHLGISARSAAVAAIVVFCALTVAGAGLDAILYRSLL
SGVDYAAAERVRDIAKALQSDSREELDNALLTTDQRVVATQLIGPDGSVV
KRSGSAPATPLVPTADFDFTLRRGLPDDAVSGDDMRVSGQKVATRSGVYT
VLVGGGSEAVEATARTLAILLATTAPVIVAVAAGATYWLVRRSLKSVDAI
RARVADISASDLAERVPVPDGRDEIAALAVTMNEMLSRLEAGHRAQQRFV
GDASHELRSPLTTIISGLEVAEAHPELLDAELAVNTLLPEAHRMRALIDD
LLLLARADERNLLRRKEPVALDDLAEAEAARVRRDADCAVHTDIRPARLL
ADPTAMTRMIRNLVDNAVRHAVSCVAIEVGSRDGTAVLTVGDDGPGIPPA
QRSRVFERFVRLDTDRARSGGGAGLGLAIVAEVVAAHGGTVTIGDRPGGG
TTLTVALPQHPAQHSSR
>MAP3391c hypothetical protein
MIPAQAAVTPAAQRQSLKPLSVLLVEDDRGDAVLVEELVTDAVTDIRVVW
AQSMAHAERELESTRPDCVLLDLHLPDPSGIDALNRIAKLDATVPIVVLT
GLNDEYFGASAVAAGAQDYLVKGRVEPEMLRRALLYAIERKRAELIAADL
HATQLRARENALLERGLLPSPLLLDDPGVDIVARYRPSREDALLCGDFYD
VVQTPDRVVHVLIGDVAGHGPHEAALGGGAADRVPSTHPGRRARRRTDAP
AGAGAALRTHRHRDLRDRAQPGDLAAAARGQRHPRRPSGDAAAGRRHRGM
GRTAGRAGARAARRRLAAPPAAAARGARPAVAHRRALRGVFGAGDPTARR
GRPARPGPLAGRSLRPGVRRRPDRRRPATGPAARRPHR
>MAP1318c hypothetical protein
MAAKSCGAAPLWPNGSSKRPDCVAAARAQSRARNQHYADSAARQSRVLAI
AAWLAVLVCADFILLQVVTGAWTWQIVSLNVVAALIFAIVPWLHRFGDLV
APLTFLGAAYVTVFVSCWDVGTASGGQYFFLVGACLVLLVMGIEHTVLAA
VLAAIGAGLVIAVEFAFPRDTGLQPAWAQSMSFVVTIVSACIMVFVAVWF
ALRDTQRAEAVMESEYQRSEALLANMLPGSIAERLKHSDGDIIADKYDDA
SVLFADIVGFTERASGTAPADLVRFLNRLYIAFDDLVDKHGLEKIKVSGD
SYMVVSGVPRVRPDHAAALADFALDMVKVAAALKDPYGRPVPLRVGMACG
PVVAGVVGSRRFFYDVWGDAVNVASRMESTDSVGRIQVPETMYERLAPEF
VLQERGRIEVKGKGVMRTWYLIGRKPAEAPTDLVAEQPHTAHV
>MAP0260 hypothetical protein
MADTRRVWSLRLRLLVGQIVVLALVCLGITAVTELALNHHLVKQLDGQLA
GTSFRSALMYPEPNHPRHEHSPYPRPGPGPRFLDAPGQPAGMVAAVVSHG
KTVDAGYLTSSGARDALTSTAQTQLEQIAGSRAPVTVNLDGLGRYRVVAA
PSRNGSDIIVTGLSMANIDATLIRMLVIFGIVTVIALAAATIAGVVIIRR
ALAPLRRVAQTAREVADLPLARGEVELPVRVRESDANPSTEVGQLGAALN
RMLDHIADALSARQASETRVRQFVADASHELRTPLAAIRGYTELTQRMGD
DREAVAQAMSRVASETERMTRLVEDLLLLARLDSGRPLEREPVDLSRLAV
DAVNDAHVAGPDHQWELDLPEEPVVVTGDAARLHQVLTNLLANARVHTGA
GTVVTTRLSTEPAHTVLQVIDNGPGIPAELQSEVFERFARGDSSRSRKGG
STGLGLAIVSAVVKAHNGTISVNSSPGHTEFTVRLPPNGWQPHASA
>MAP2616c hypothetical protein
MQPGRRHADGARPGCTLKRPGFTLDERTRAARVIQPVRYRWPRAARISDV
PFRNVAIVAHVDHGKTTLVDAMLRQSGALHHRGDDTQERILDSGDLEKEK
GITILAKNTAVHRHHPDGSVTVINVIDTPGHADFGGEVERGLSMVDGVLL
LVDASEGPLPQTRFVLRKALAAHLPVILVVNKTDRPDARIAEVVEASHDL
LLDVASDLDDEAAKAAEHALGLPTLYASGRAGVASTEQPADGEVPAGENL
DPLFDVLLEHIPAPSGDPEAPLQALVTNLDASTFLGRLALIRIYNGKLRK
GQQVAWMREVDGLPVITSAKITELLATEGVERSPTDEAIAGDIVAVAGLP
EIMIGDTLADPDHAHALPRITVDEPAISVTIGTNTSPLAGKVPGHKLTAR
LVRNRLDQELVGNVSIRVVDIGRPDAWEVQGRGELALAVLVEQMRREGFE
LTVGKPQVVTRTIDGQLHEPFEAMTIDCPEEFVGAITQLMAARKGRMEEM
TNHAAGWVRMDFIVPSRGLIGFRTDFLTITRGTGIANAVFDGYRPWAGEI
RARHTGSLVSDRAGTITPFAMIQLADRGQFFVEPGQDTYEGMVVGINPRA
EDLDINVTREKKLTNMRSSTADVSETLAKPLELDLEQAMEFCAADECVEV
TPEIVRVRKVELDATSRARSRARAKARG
>MAP0378c hypothetical protein
MPVPILKQGAILIASVQAALSDSDAERLRYDLMERVSRFRAQGIIVDVTA
IDVMDSFAARSLRTIAHMTRLRGADTVIVGLQPEVAFAMVQLGLAFDDMN
TALDLEEGIALLNRQLAQRKPTIGRDGGG
>MAP1346c hypothetical protein
MSIGVAAGAPGDETTSDLLRRVDQATRSAKSSGGNNVATFRPEMSTTDTI
RNDIELHLEGMIDGASGALVLHYLPEFDMRTGAVLGTEALLRWQHPTLGL
LMPDSFIQVVESINLGAKLGRLVMHSACAQFGLWQSRGVGKGAVLRINVS
PVQLVTAGIVDTVAATLDEFRLDPSTICLEITESVVVQDIDATRQTLFGL
KDIGVQIAIDDFGTGYSVLTYLKSLPVDSLKIDKGFVHSVDTNAGDLAIV
RSTLALADAFGLGVVAEGVETVAAAQTLLSLGCHRAQGFLLSRPLDSAAM
ESLLAQRVVPMNFSETGPAV
>MAP3472c hypothetical protein
MFSRPAEPAAAEAPGPAPLGGDAVPAKRPPAWSLSNWPVRWKVLAIVLVP
LMLATVFGVLRIHGAMANAAGLRLAAARADVVPAITKYMSALDVALLAGS
AGRDAEGAKKNYEARKNELQTRLGDTDVTDDVRAGVNNLLDAGQMLVNKV
AEGSLGLRERVTFYAPILLTAENVINASVRVDDERIRAQAQGLSRAVGAR
GQMTMQKILVTRGAELPEPQLRTSMATLAGTEPSTLFGMSEVLGAGSPEA
KTLQQQMVSRMAIMSDPASVLVDNPDLLRSIQTTDDIADQVIKNTTASVT
KSVHAQAAERRNSAILDTALVLAAIVIALAVVLLVARALVRPLRTLRDGA
LKVAHTDLEEEIAHVKAGGAEPIPAPLPVYTTEEIGQVAHAVDELHTQAL
LLAGDEARLRLLVNDMFETMSRRSRSLVDQQLALIDRLERNEDNPERLDN
LFRLDHLAARLRRNSANLLVLAGAKLARDQRDPVPLATVINAAVSEVEDY
RRVEIAGLPECSLLGAAAGGAIHLFAELIDNALRYSPPTTSARVSASRGG
DGGVVVRIADSGLGMNDADRRIANMRLQAGGDANPDPHPENARHLGLFVV
GRIAAWHGMRVGLRGPAANESGSGTTAEVYLPPTVLAGRVVAEPSGPRHI
RAVSSPSAKLASAIAAPAEEGGRHDGTQQPAARGTRNAGDASTPPVTLLP
RRNPGSSGIADVAAVPAPPAEPQPRRQRRELATPWWEASPAPRPAPERAP
EPAPEPQQPAPRPARAASDTSAFFAARPRTETPPEAPPKPPPKPKPKPEA
SIPPAASTGPADDDVIYRRMLSEMLGDPHDLVNSPDLDWQSVWDRGWTLA
AAAEDKPVESHTTDHGLPVRTPGARLVPGGAHGAAAEPDDEPGLNGRSPA
PRRPQHAAVTRDPEAVRASFSSHFGGVRTGRSHARESSEGPDQQ
>MAP3179c hypothetical protein
MSTAYPAPAIVVGVDGSRAAMHAAVWAIDEAVGRDIPLRLVYVIDPHGAP
GGHGPDTRLAAARAALADAHRAVDAFAQPVKVETEILWGNTAFKLLEQSR
SAVMLCVGQIGLNHACHGGPAIATSLVRSALCPVAVVQQAPSLPAAARVS
GVVAEVDNGTVLRHAFEEARLRGVGLCAVGNPSARVELERRLARWMRLYP
DVQAESAVLTGGVEQHLRADHRAGRLLVTDAYRAEALCHAGHSVLAVRCG
NL
>MAP0386c hypothetical protein
MTNFDDLTLLNDVLRAIYSADTADFPQVLDDLTAASARWVPGAQEAGITV
TSRQNEVSTPSVTDDCARLLDEFQQRYLEGPCLHAAWTRKVVVVDDLRTD
SRWPKYQADALARTPIRSILSLPMYAGELSMGALNFYAERPHAFSDDSRR
MAALFATLGSLAWSNVVRTQQFKEALSTRDMIGQAKGILMERYELDDETA
FNTLIKLSQSMNTPLRDIARRVIDDTTQR
>MAP2031c hypothetical protein
MSSSSRGSRLGTRFGPYELRSLIGTGTLGEVYRAYDTVKDRLVALKLLRG
ELDAGFRQRLWRDCRAVTRLQEPHVLPLHDFGEMDGVPFIDMQLVDDGGS
LKELLREQGGLEPSRAASITGQVARALDAAHAAGLMHLDVKPENILLTHD
HFTYLADFGLAQAAGDDKLSRTYMAPERFTTGSLGPQTDIYSLACVLYEC
LTGQPPFEGADPGELRSAHLLSPAPRPSIMRRGVGRAFDDIITRGMAKQR
SARFGSAGELARAASEAVFAAYEPVSAAAGLGGPRPLPTPPAQFDGPDDT
LGPPAAERPPRGRVGRLPVVVTAVAVLMLIAGVVLSVKSVVGTHHNSSAP
PPAPSTRALAPPPPTTPPPPLTPTLSRPVTGADGLGFIGETARCDPGNPP
AAVVRTAKSLAVVCQNLSGSYYYRGERIRDGAHIELSNAERVEDGFDVTN
PVDGVVYEVRPNRLRIISFGHVDSSEPVLQYATAS
>MAP4266 hypothetical protein
MDRIWQWVWDRHGAKYPWVIWAFGFTSMFVTYALWSLVITCYERSSHYLE
AVVVTGVGVVLQSLVVLPVRRRFRLMRPVSASDQVDRFQALNETYVWSRT
ARIRQLWFVPIWAATFFAIVSVLAGAHGMRVVEYVIVGGAMGIVTTLISL
HTFMEGSLRPVRAALAGDSGIGDALPRSRPTFAWWLEVSMLSALCNFTTA
GMMVGALLGRVSDSPLLPLLIACVATAALAAPVTVGAIVSPALHPIRDLA
EGTERVAAGDYTRRVPVVQDDDLGALTASFNRMQAGLTERQRLQAAFGTY
VDPALAARLLEQGDDVFTGERREVTVMFVDVRDFTPFAEANSAEDTVARL
NALFEIVVPAVVDGGGHVNKFLGDGALAVFGAPNDLADHADAAVSAALLI
HRLVAKRFGGVLRIGIGINTGVVIAGTIGGGGKLEFTLIGDAVNVAARVE
QLTKTTGDAILVTQQTVDALVSRPPGLSDRGTHALKGKSAPTAVFGLDPA
VTPSHRLG
>MAP2855c 35kd_ag, 35kd_ag
MANPFVKAWKYLMAKFNATIDERADPKVQIQQAIEEAQRTHQALTQQAAQ
VIGNQRQLEMRLNRQLADVEKLQVNVRQALTLADQATAAGDTAKATEYNN
AAEAFAAQLVTAEQSVEDLKTLHDQALNAAAQAKKAVEQNAMVLQQKIAE
RTKLLSQLEQAKMQEQVSASLQSMSELAAPGNVPSLDEVRDKIERRYANA
IGAAELAQGSVQGRMLEVEQAGVQMAGHSRLEQIRASMRDEALPTGGTPA
AGGTQAAPAPGQGAGDAVSEKPLGQ
>MAP3405 Rv0516c, Rv0516c
MESAMSVAQSWQQSRTAQFTARWGLTGTLITVDGELDAANADQLAAYVQQ
SVNRSRRVILDLRGLNFIGTAGFSALHRINVICSAAQTSWAMAPSPAVAR
LLRLCDPDGTLPVTTPKAEPLLEPRRANDDESPGPLLQLVTKPR
>MAP3118 cstA, CstA
MAITERDDGVSYVHTDHNLPPVAIVDRSPITARHRIVFAVIAFARGEPVN
AVWFVVAAICSYLIGFRFYARLIERKIVHPRDDHATPAEILDDGADYVPT
DRRVLFGHHFAAIAGAGPLVGPVLAAQMGYLPCTIWIVAGAVFAGAVQDY
LVLWISTRRRGRSLGQMARDELGASGGLAALLGAFVIMVIIIAVLALVVV
RGLAQSPWGVFSIAMTIPIAVFMGCYLRFLRPGRVAEVSVLGFALLLIAV
ATGSWVAETPWGASWLSLSPVTVSWLIIGYGFVASVLPVWLLLAPRDYLS
TFMKVGAIALLAVGICIARPVMQAPAVSHFASRGDGPVFSGALFPFLFIT
IACGALSGFHALISSGTTPKLLEKESQMRLIGYGGMLTESFVAVMALISA
AVLDQHLYFTLNAPAAQTGGTAATAAHYVNGLGLPGAPTTVDQLNRAAAA
VGEKSIVSRTGGAPTLAVGMAEVLYRVFGGAGLKAFWYHFAIMFEALFIL
TAVDAGTRVARFMLSDTLGNLGGPLARLRNPSWRPGVWLCSLAVAAGWGG
ILLMGVTDPLGGINTLFPLFGIANQLLAAIALTVITVIVVKKGLLKWAWI
PGAPLLWDLVVTLSASWQKIFSADPAVGYWTQHFQYLAAKSAGRTSFGSA
RNAHQLDQVIRNTFIQGSLSIAFAAAVVVVVIAGVLAALGAIRGPASRLS
RPLTEDEPVPSALFAPSGLIATAAEREVQRRWDAPGGKVVPKPAPRS
>MAP0230 embR, EmbR_1
MGFGVLGPLSVTAGGAQLPLGAPKQRAVLAMLLIHRNRPVSVDALIDGVW
GAAPVPAARTSIQSYVSTLRRLLRSALPNPNGLLASVPPGYQLNVADADC
DLGRFRGHKTAGVQAASRGRFEEASGHLAAALREWRGPVLDDLHNFAFVD
AFANLLLEERVAAHTARAEAEIACGRADGVIGELEALVAQHPYREPLWAQ
LITAYYVTERQSDALGAYRRLKTALAEGLGIDPGPTVKALQQRILRQEPL
GPGQPGKAARRAMLSTHKQSTVRSESAAAVIDNPVVARLRDKAGRHYQLN
GVTTRIGRLDDNEIVLDDTEVSRHHAVIIDTGTDFLITDMKSTNGVQVRG
RRVQRSATLVDGDHICIGNSEFVFEIRPA
>MAP2503 embR, EmbR_2
MAKRLEFGLLGPLEMTVDGDLVPLGTPKQRAVLAMLLMNRNNPVGIDRLI
TGLWDESPPSGARASLHSYVSNLRKLLSNAGVDPRTVLVAAPPGYRLNIP
DDRCDLGRFITEKTAGVQAAASGRFEQAGAHLAAALAQWRGPVLEDLTEF
RFVGTFATSLVEEKILTHIAQAEAEIACGRAFSVITELESLTREHPYREQ
LWAQLMTAYYLTDRQSDALAAYRRVQKSLADDLGIDPGQNLRMLNDRILR
QEPLDAKKNAMTTAAVTVTVLEHYTLASGRNAGALLHDAAGRAFPLRGST
TIGRLDDNDIVLDSPKVSRHHAVIVDTGTSYIINDLRSSNGVHVQDQRIH
SAAVLQDGDRIRICEHEFTFRLAEENQHAG
>MAP1965c glnE, GlnE
MVVTKPATQRPRLPSVGRLGLVDPQAAERMAQLGWYDHDDQAHVDLLWAL
SRAPDPDAALLALVRLAETPDAGWDELGAALLTERPLRGRLFAVLGSSLA
LGDHLVAQPRSWKLLRGNVSLPTHDELCAMFTGCVDEALADPGSAMVRLR
TLYRDRLLVLAALDLAATVEDEPVLPFTVVAAHLSDLADAALAAALRVAE
HNVCGDRTPPRLAVIAMGKCGARELNYVSDVDVIFVGERADTVTTRVASE
MMRLASEAFFQVDAGLRPEGRSGELVRTVESHIAYYQRWAKTWEFQALLK
ARAAVGDAELGRRYLDALMPMVWVACEREDFVVEVQAMRRRVEQLVPADV
RGREIKLGSGGLRDVEFAVQLLQLVHGRSDESLHVASTVDALAALGQGGY
IGREDAANLTASYEFLRLLEHRLQLQRLKRTHLLPEADDEEAVRWLARAA
HIRPDGRHDAAGVLREELRHQNLRVSQLHAKLFYQPLLESIGPAGLEIRH
GMTSEAAERQLAALGYEGPQSALKHMSALVNQSGRRGRVQSVLLPRLLNW
MSYAPDPDGGLLAYRRLSEALAGESWYLSTLRDKPAVARRLMHVLGTSAY
VPDLLMRAPRVIQDYGDGPSGPRLLETDPAAVARALVASASRYSDPVRAI
AGARTLRRRELARVASADLLGMLEVTDVCKALTSVWVAVLQAALDAMIRA
NLPDDGPQRGKAPAAIAVIGMGRLGGAELGYGSDADVMFVCEPAPGVDDS
AAVRWAASVAEQVRTLLGTPSVDPPLDVDANLRPEGRNGPLVRTLASYAA
CYEQWAQPWEIQALLRAHAVAGDAELGHRFLLMADKTRYPADGVSPEAVR
EIRRIKARVDAERLPRGADPNTHTKLGRGGLADIEWTVQLLQLLHAHEVP
ALHNTSTLECLDAIAEAGLVPADEVDLLRQAWLTATRARNALVLVRGKPT
DQLPGPGRQLNAVAVAAGWPTDEGGEFLDNYLRVTRRAKAVVCKVFGS
>MAP3894c glnH, GlnH
MLVAAGCGHTESLRVASVPTLPPPTPVGMEQLPPQPPLPPDGPDQNCDLT
ASLRPFPTKAEADAAVADIRARGRLIVGLDIGSNLFSFRDPITGEITGFD
VDIAGEIARDIFGAPSHVEYRILSSDERVTALQRGEVDVVVKTMTITCDR
RKQVNFSTVYLDANQRILAPRDSPITKVSDLSGKRVCVAKGTTSLHRIRQ
IDPPPIVVSVVNWADCLVAMQQREIDAVSTDDSILAGLVEEDPYLHIVGP
NMATQPYGIGINLNNTGLVRFVNGTLERIRRDGTWNTLYRKWLTVLGPAP
APPTPRYLD
>MAP0052c glnQ, GlnQ
MVCGSSALLAPPAAADRNQCAPAGPKSAAVLPENLTRGGSMSQPDEHTTP
TVEPLSSVRIDTLGLGTPGVLTVGTLSGAPPNVCITPTGQYSGFDNQLLR
AIAGKLGLQVRFVGTDFAGLLAQVASRRFDVGSSSIKATDERRQTVAFTN
GYDFGYYALVVPPGSAIKEFTDLAAGQRIGVVQGTVEESYVVDTLHLQPV
KYPDFATVYASLKTRQIDAWVAPAGQAATTIQPGDPAVVVANTISPGNFV
AYAVAKDNKPLVDALNSGLDAVIADGTWSNLYSQWVPRTLPPGWKPGSQA
AAPPKLPDFAAIAASQHHKAVGPAAPKSTLAQLRDSFFDWDMYRQAIPTL
LRTGLPNTLILTFSASVIGLVLGMVLAVAGISHARWLRWPARVYTDVFRG
LPEVVIILLIGLGIGPIVGGLTNNNPYPLGIAALGMMAAAYIGEILRSGI
QSVDPGQLEASRALGFSYATAMRLVVVPQGVRRVLPALVNQFIALLKASA
LVYFLGLVADQRELFQVGRDLNAETGNLSPLVAAGACYLILTVPLTHLVN
MIDARLRRGRATVDPEEPADMLTFGQEIT
>MAP0996c kdpD, KdpD
MMVDVTDVRDHHPKRGELRIYLGAAPGVGKTYSMLGEAHRRLERGTDLVA
GVVETHGRAKTAELLEGIEIIPPRYIEYRGGRFPELDVPAVLARHPQVVL
VDELAHTNTPGSKNPKRWQDVEELLDAGITVISTVNVQHLESLNDVVAQI
TGIEQKETVPDSVVRQASQVELIDITPEALRRRLSHGNVYAPDRIDAALS
NYFRRGNLTALRELVLLWLADQVDTALAKYRAENKITDTWEARERVVVAV
TGGPESETLVRRASRIASKSSAELMVVHVIRGDGLAGLSESRMAKIRELA
SSLDASLHTIVGDEVPAALLEFAREMNATQLVIGTSRRSRWARLFEEGIG
PRIVELSGKIDVHLVTHEESKRGFRASSLAPRERRVASWLAALIVPSVIC
AVTVTWLDPYLDTGGESALFFVGVLLVGLLGGIAPAALSAVLSGLLLNYY
LIAPRHSFTIAEPNSAITELVLLLIAVAVAVLVDFAAKRTREARRASQEA
ELLTLFAGSVLRGADLETLLERVRETYAQRSVSMLRESEDARAGGTKTQV
VACVGRDPCVSVDAADTAIEVGGPDSSEFQMLLAGRKLSARDRRVLSAVA
RQAAGLIRQRELAEEASRTEAIVRADELRRSLLSAVSHDLRTPLAAAKVA
VSSLRAEDVAFSPTDTAELLATIEESIDQLTALVGNLLDSSRLAAGAIHP
DLRRVYLEEAVQRALVSIGKGATGFFRSAIDRVKVDVGDAMVMADAGLLE
RVLANLIDNALRYAPNCVVRVNAGQVGDRVLISVIDEGPGIPHGAEEQIF
EAFQRLGDHDNTTGVGLGMSVARGFVEAMGGTITATDTPGGGLTVMVDMA
APQSEGAA
>MAP0995c kdpE, KdpE
MTRVLVIDDEPQILRALRINLSVRGYEVVTASTGAGALRAAAEHKPDVVI
LDLGLPDISGIDVLAGLRGWLTAPVIVLSARTDSSDKVEALDAGADDYVT
KPFGMDEFLARLRAAVRRNTAASEMEQPVVETESFTVDLAAKKVTKNGSE
VHLTPTEWGMLEVLVRNRGKLVGREELLKEVWGPAYATETHYLRVYLAQL
RRKLENDPSHPKHLLTESGMGYRFEA
>MAP2836 lexA, LexA
MHAVDPSLTERQRTILNVIRSSVTSRGYPPSIREIGDAVGLTSTSSVAHQ
LRTLERKGYLRRDPNRPRAVDVRGVDDDVAAPATEVAGSDALPEPTFVPV
LGRIAAGGPILAEEAVEDVFPLPRELVGDGTLFLLKVVGDSMVEAAICDG
DWVVVRQQHVADNADIVAAMIDGEATVKTFKRAGGQVWLMPHNPAFDPIP
GNDATVLGKVVTVIRKV
>MAP3604 mce1, Mce1_2
MARPVQTNAPRTPPYKLAGLAILVVGALALALIYGQFRGNFTPKTSLTML
ASRAGLVMDPGSKVTYNGVEIGRVGTISETVRDGKPAAKFTLEVYPRYLK
LIPSNVNADIKATTVFGGKYVSLTTPANPSPQKITPHTIIDARSVTTEIN
TLFQTITSIAEKVDPVKLNLTLSAAAQSLSGLGEKFGQSVVNANALLDDV
NPRMPQARKDIQGLAALGDTYADASPDLFDFLNNAVITSRTINAQQKDLD
QALLSAAGFGNTGADLFNKGGPYLARGAHDLVPTAQLLDTYSPEIYCLMR
NEHDALPATGAAEGGFNGYSLNMNTEALSGLGLIANPVSAVPVIASMVGG
IVGVVGGAPNPYIYPENLPRVNARGGPGGAPGCWQPITHDLWPAPELVID
SGNSLAPYNHLDTGSPYAIEYVWGRQVGDNTINP
>MAP3289c mce1, Mce1_1
MRSRDANSDSLMQINSQRIPPYKLLAVAVLLVLSLILALIYGQFRGAFTP
KTSLTMLASRAGLVMDPGSKVTYNGVEIGRVGTISETVRDGKPAAKFTLE
VYPRYLKLIPSNVNADIKATTVFGGKYVSLTTPAHPSPQKITPHTIIDAR
SVTTEINTLFQTITSIAEKVDPVKLNLTLSAAAQSLSGLGEKFGQSVVNA
NALLDDVNPRMPQARKDIQGLAALGDTYADASPDLFDFLNNAVITSRTIN
AQQKDLDQALLSAAGFGNTGADLFTKGGPYLARGAHDLVPTAQLLDTYSP
EIYCLMRNEHDALPATGAAEGGFNGYSLNMNTEALSGLGLIANPVSAVPV
IASMVGGIVGVVGGAPNPYIYPENLPRVSARGGPGGAPGCWQKITRDLWP
APELIMDTGNSIAPYNHLDTGSPYALEYVWGRQVGDNTINP
>MAP3360c mtrA, MtrA
MDSMRQRILVVDDDASLAEMLTIVLRGEGFDTAVIGDGTQALTAVRELRP
DLVLLDLMLPGMNGIDVCRVLRADSGVPIVMLTAKTDTVDVVLGLESGAD
DYIMKPFKPKELVARVRARLRRNDDEPAEMLSIADVEIDVPAHKVTRNGE
QISLTPLEFDLLVALARKPRQVFTRDVLLEQVWGYRHPADTRLVNVHVQR
LRAKVEKDPENPTVVLTVRGVGYKAGPP
>MAP3359c mtrB, MtrB
MWGSRRRTRSRWGRSGPMTRGMGAVSRAVGTAWRRSLQLRVVALTLGLSL
AVILALGFVLTSQVTNRVLDVKVKAAIEQIERARTTVGGIVNGEEARSLD
SSLQLARNTLTSKTDSASGAGTAGTFDAVLMVPGDGPRAATTAGPVDQVP
ASLRGFVKAGQASYQYATVHTDGFSGPALIVGSPASSQVANLELYLIFPL
KNEQATIQLVRGTMITGGAVLLVLLAGIALLVSRQVVVPVRSASRIAERF
AEGHLSERMPVRGEDDMARLAMSFNDMAESLSRQITQLEEFGNLQRRFTS
DVSHELRTPLTTVRMAADLIYDHSADLDPTLARSTELMVNELDRFESLLN
DLLEISRHDAGVAELSVEAVDLRSTVQSALSNVGHLAEDAGIELQVELPA
EEVIAEVDTRRVERILRNLIANAIDHAEHKPVKIRMAADEDTVAVTVRDY
GVGLRPGEEKLVFSRFWRADPSRVRRSGGTGLGLAISIEDARLHQGRLEA
WGEPGVGSCFRLTLPLVRGHKVTTSPLPMKPIPQPSPSGGQSPSTGPQHA
KDRARQREHAERSL
>MAP0689c narL, NarL_1
MADPATRETVRVVVADDHPLFREGVVRALVSSGAVNVVGEAEDGSAALEL
IKSHQPDVALLDYRMPGMDGAQVAAAVRADGLATRVLLISAHDESAIVYQ
ALQQGAAGFVLKDSTRSEIVKAVLDCAQGRDVVAPALVGGLAAEIRQRAE
PTGPVLSAREREVLHRIARGQSIPAIAGELYVAPSTVKTHVQRLYEKLGV
SDRAAAVAEAMRQGLLS
>MAP3275 narL, NarL_2
MTDDATPTTVMVVDDHPIWRDAVARDLAESGFAVVATADGVTAAQRRAGV
VRPDVVVMDMQLPDGDGAQATAAVLAVSPSSRVLVLSASDERDDVLQAVK
AGAAGYLVKSASKAELAQAVTDTAAGRAVFTPSLAGLVLGEYRRIAQSKD
DGPTRPRLTDRETEVLRYVAKGLSAKQIAEKLSLSHRTVENHVQATFRKL
QVANRVELARYAIEHGLDEEP
>MAP2160c phoH, PhoH
MTPRDTSAADAAGALQADAQVRISIDVPPDIVMGLLGSADENLRALERSV
IADLHVRGNAITISGESADVARAERVISELVAIVANGQVLTPEVVRHSVA
MLAGTDNESPAEVLTLDILSRRGKTIRPKTLNQKRYVDAIDANTIVFGVG
PAGTGKTYLAMAKAVNALQTKQVSRIILTRPAVEAGERLGFLPGTLSEKI
DPYLRPLYDALYDMMDPEVIPKLMSAGVIEVAPLAYMRGRTLNSAFIVLD
EAQNTTAEQMKMFLTRLGFGSKIVVTGDITQVDLPGGATSGLRSAMEILD
RVDDIHVAELTSVDVVRHRLVSEIVDAYAKFEEPGLTMNRAARRASGSRG
RR
>MAP2697c phoH2, PhoH2
MTDIRTYVIDTSVLLSDPWACSRFAEHEVVVPLVVISELEAKRHHHELGW
FARQALRLFDDLRLLHGRLDQPIPVGTQGGSLHVELNHTDPAVLPAGFRS
DSNDSRILSCAANLAAEGKRVTLVSKDIPLRVKAAAVGLAADEYHAQDVV
ASGWSAMHELETAAEDIDALFTEGEIDLAEARDLPCHTGIRLLGGSSHAL
GRVNPDKRVQLVRGDREAFGLRGRSAEQRVALDLLLDESVGIVSLGGKAG
TGKSALALCAGLEAVLERRTQRKVVVFRPLYAVGGQELGYLPGSESEKMG
PWAQAVFDTLEGLASPAVLDEVLSRGMLEVLPLTHIRGRSLHDSFVIVDE
AQSLERNVLLTVLSRLGTGSRVVLTHDIAQRDNLRVGRHDGVAAVIEKLK
GHPLFAHITLLRSERSPIAALVTEMLEEIAGPH
>MAP0591 phoP, PhoP
MTSATPTDAKPEARVLVVDDEANIVELLSVSLKFQGFEVHTATNGAQALD
RAREARPDAVILDVMMPGMDGFGVLRRLRADGIDAPALFLTARDSLQDKI
AGLTLGGDDYVTKPFSLEEVVARLRVILRRAGKGGAEPRSARLTFADIEL
DEETHEVWKAGQPVSLSPNEFTLLRYFVINAGTVLSKPKILDHVWRYDFG
GDVNVVESYVSYLRRKIDTGEKRLLHTLRGVGYVLREPR
>MAP0592 phoR, PhoR
MVVTRPRRGLPLRVGLVAATLALVACGLAVSGIAVTSILRHSLVSRIDST
LLDASRGWAQAPRRQSSSAYEGPDPGRPPSKFYVRGVSTDGTPFTAINDR
NAEPALPANNDVGPNPTTLPSVNGSDIQWRAVSVRGPHGLTTVAIDLSDV
QHTVSSLVWLQIGIGVAVLAVVGIASFAVVQRSLRPLVEVEQTAAAIAAG
QLDRRVPERDPRTEVGRLSLALNGMLAQIQQALASSESSAEKARGSEDRM
RRFITDASHELRTPLTTIRGFAELYRQGAARDVAMLLSRIESEASRMGLL
VDDLLLLARLDVQRPLEHHRVDLLALASDAVHDAQAMDPKRTITLEVLDG
PGTPEVFGDEPRIRQVLGNLIANALQHTPESADVTVRVGTDGDDAVLEVA
DRGPGMNEQDASRVFERFYRTDSSRARASGGTGLGLSIVESLVRAHGGTV
GVTTAPGQGCCFRVTLPRISDVPAVQAS
>MAP0018c pknA, PknA
MSPRVGVTLSGRYRLQRLIATGGMGQVWEAVDNRLGRRVAVKVLKQEFSQ
DPEFIERFRAEARTTAMLNHPGIAAVHDYGESQLDGEGRTAYLVMELVNG
EPLNSVLKRTGRLSLRHALDMLEQTGRALQVAHAAGLVHRDVKPGNILIT
PTGQVKITDFGIAKAVDAAPVTQTGMVMGTAQYIAPEQALGHDATPASDV
YSLGVVGYEVVSGKRPFSGDGALTVAMKHIKEPPPPLPAELPPNVRELIE
ITL
>MAP3387c pknD, PknD
MSDNGPAAQVGSWFGPYRLVRLLRQGGMGEVYEAEDTRKHRLVALKLISQ
QFSGNPEFSARLQREADIAGRLTEPHVVPIHDYGEIDGRFFVEMRLVDGI
DLGSLLHREGPLAPPRAIAIIRQVAAALDAAHAAGVTHRDVTPGNILVTP
SDFAYLADFGIARAASDPGLTQVGTAIGTYYYMAPERFTDDEVTNSVDIY
SLACVLTECLTGVPPYRADTVERLVAAHLTKTAPPLSQLRPGAFPPALDR
VIAKGMAKRPEDRYRTAGEFAAAAHEALTTSEQRKAATILRDGQIAALGA
GAAEQRSTHWPDSFAPSPSAETVVGPSPARAGAPSSGLIRAAPTGSGRVY
APGPDFGRPAAPTDNKRKQWIIVGAVALVALVAFVVAVVGYLSTASSGPA
KQAGGQSVLPFNGIDFRLSPGGVTLDGTGNVYVTSEGMYGRVVKLAAGSG
ATTVLPFNGLYQPQGLAVDGAGTVYVADFNNRVLSMAAGSNSQKELPFSG
LNYPEGVAVDSQGGVYVADRGNSRVLKLAAGSQNQTVLPFTGLNNPDGVA
VDPAGNVYVADTDNNRVVKLDAASNTQSELPFHDLSVPWGIAVDNGGTVY
VTEHDKNDVMKYPPGATSGTVLPFTALNTPLAVAVDRDQSVYVADRGDDR
VVKLVQ
>MAP1049c pknE, PknE
MTLDPDSFGHYRILELLGRGGMGRVYRAYDATTDRVVALKVLPPHLAEDQ
DFQQRFRREARIAAGLNDPHVVPIHGYGEIDGRLYVDMRLIEGRDLAHYI
TENGGRLSPQRAVAVIEQVAAALDSAHRAGLIHRDVKPMNVLVTTARDFV
YLIDFGLARAQADTALTQTGATMGTVAYMAPERFTGTTDHRADVYSLACV
LHECLTGKRPFAGDSLEEQLNAHLNTAPPRPSATAPEVPAAFDAVIARGM
AKDPERRYQSVTELAEAARAALAPGVVEKPSAPTPQPRAARRVRAAVVGA
SALTLAVVAAVVVAMVTHGHGPRGAAPKTPGSPAPGRPAPPLPAFVAPPD
LGANCQYRAVPDPSSRPVSPPPSGRVPTTPGQIGAVIATNLGDIGISLAN
SESPCAVNSFISLARQRFFDNTQCARLVDSPDGGSLLCGGPDVDGSGGPG
YEFADEYPANQYRPDDPALRATLLYPRGTVVMATEGPNTNGSQFALIFHD
SEMDPQSTVLGTIDPAGLATLDKIARAGIAGNRPSGPPANPVTITSVRIG
>MAP1332 pknF, PknF
MTIGNGASFAGYTILRQLGAGGMAEVYLALHPRLPRRDVIKVLAEAVTVD
PEFRERFNREADLAATLWHPHIVGVHDRGEFNGHLWISMDYVEGTDASRL
VKESYPDGMPLDEVSAIVQAVAGALDYAHARGLLHRDVKPANILLTHPEA
GERRILLADFGVARHLGDISGITETNVAVGTVAYAAPEQLTGSPIDGRAD
QYALAATAFHLLTGAPPFQHSNPIAVIGQHLHEDPPRLSDFRPELAGLDE
VFCQALAKAPEDRFDRCRAFAAAVRRECDGAAAIGPDARSRSVASPPHRR
RGPGRVIAAVTHRFSSQTRWAAALVCAVLVAVAATWSVLYSFQPGAPPAN
PALASKPSPPAAVAAPIAGGPVLNGTYKLDYDQTKRTTNGIGIRHDGAGT
NWWAFRSACTSSGCAATGTRLDDATHQTAGGPDGGQTDTLRFVGGYWQGA
PEQQRVGCTRPGGPAGATQQETIAWSLAPQSDGTLRGTETETVLSNECGA
QGAVVRVPVVATRVGDVPPGVTVADPASVINASPTATAPAPPVLGGLCSD
VGKVAYDPTNNEQIVCEGSSWAKAPITMGVHAAGSSCDRPGTSVFAMSTS
SDGYLLQCDPVTRTWTRPAG
>MAP3893c pknG, PknG
MAEPDNKSEQPEPGAEQMGPGTQPAEVGDDAQAGAATGRLQATQALFRPD
FDDDDDDFPHISLGALDTDSADRMTVATQALPPVRQLGGGLVEIPRGRDI
DPREALMTNPVVPESKRFCWNCGKPVGRSTKKSKGTSEGWCPHCGSAYSF
LPQLNPGDIVANQYEVKGCIAHGGLGWVYLAVDHNVNDRPVVLKGLVHSG
DAEAQAIAMAERQFLAEVVHPQIVQIFNFVEHVDRHGNPVGYIVMEYVGG
QPLRHGKGEKLPVSEAIAYVLEILPALGYLHSIGLVYNDLKPENIMLTEE
QLKLIDLGAVSRINSFGYLYGTPGFQAPEIVRTGPTVATDIYTVGRTLAA
LTLNLPTRNGRYVDGIPDNDPVLGTYDSFRRLLRRATDPDPRRRFSSTEE
MSAQLMGVLREVVAHDTGVPRPGLSTIFSPSRSTFGVDLLVAHTDVYLDG
QVHSEKLTAREIVTALQVPLVDPADVAAPVLQATVLSQPVQTLDSLRAAR
HGTLDADGVELSESIELPLMEVRALLDLGDVAKATRKLDDLAERVGWQWR
LVWYKAVAELLTGDYDSATTHFTEVLDTFPGELAPKLALAATAELAGDVD
EHRFYETVWKTNDGVISAAFGLARTLSAEGDRAAAVRTLDEVPATSRHFT
TARLTSAVTLLSGRSKSEITEEEIRDAARRVEALPPTEPRVLQIRALVLG
CAMDWLEDNKASTNHILGFPFTEHGLRLGVEAALRNLARVAPTQRHRYAL
VDMANKVRPTSTF
>MAP1914 pknL, PknL
MLDGRYLIESKIASGGTSTVYRGVDTRLDRPVAVKVMDPRYAGDDQFLTR
FQREARAVARLKDPGLVAVYDQGLDARHPFLVMELIEGGTLRELLGERGP
MPPYAVAAVLRPVLGGLAAAHRAGLVHRDVKPENVLISDDGEVKIADFGL
VRAVAAAGITSASVILGTAAYLSPEQVRDGAATPRSDVYAAGIVAYELLT
GRTPFTGDSMLAIAYRRLDADVPPPSAAIDGVPAQFDDFVQRATARDPAD
RYADAVEMGADLDAIADELALPGFRVPAPRNSALHRSAALHREAGRRAPA
AEPPARHPTRHLTRGPEEWPQPDPPAHVGAEPDDDEDDYEYQSVTGEFAG
IPISEFVWARQHNRRMVLVWLALVLAVTGMVATAAWTIGRNLNGLF
>MAP0021c ppp, Ppp
MTLVLRYAARSDRGLVRSNNEDSVYAGARLLALADGMGGHAAGEVASQLV
IAALAHLDDDEPGGDLLAKLDEAVRAGNAAIAAQVEAEPELEGMGTTLTA
ILFAGDRIGLVHIGDSRGYLLRDGELTQITKDDTFVQTLVDEGRITREEA
HSHPQRSLIMRALTGHEVEPTLTMREARAGDRYLLCSDGLSDPVSDETIL
EALQIPDVAEAAYRLIELALRGGGPDNVTVVVADVVDYDYGQTQPILAGA
VSGDEDQLTLPNTSAGRASAIRPRDESAKRVAPQPETPSRPRWSRRRLFV
VIALAVMLVLAGLTVGWWVIQRNYYVAEYNGRISIVRGIQGSLLGVPLQQ
PYLVGCLNARNELSLISYGQSGGSNCQLMTLQDLRRPGQVQVQTGLPGGS
LDQAESQLRQLLAEYLLPLCPPPRATSPPGAQATRSPVPETGGPASPAPP
TTSASPTPSTNATPGPASSSPAGPTTTSQTLTALPGPPLQPGIDCRTVA
>MAP1985 ptpA, PtpA
MSEPLHVTFVCTGNICRSVMAEKMFADQLRRRGLADAVRVSSAGTGNWHV
GECADERAAGVLRAHGYPTEHRAAQVGAEHLSADLVVALDRNHARMLRHL
GVDEDRIRMLRSFDPRTGAHTPDVDDPYYGDSKDFERVYTVIEAALPGLH
DWVDERLAQNGSS
>MAP3983 regX3, RegX3
MTSVLIVEDEESLADPLAFLLRKEGFEATVVTDGSAALAEFDRAGADIVL
LDLMLPGMSGTDVCKQLRARSSVPVIMVTARDSEIDKVVGLELGADDYVT
KPYSARELIARIRAVLRRGGDDDSEISDGVLESGPVRMDVERHVVSVNGD
TITLPLKEFDLLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHVKR
LRSKIEADPANPVHLVTVRGLGYKLEG
>MAP1047 relA, RelA
MAEENSAAQALDAPAESPPNPVIETPEPPTESLKTSSSASRRVRARLARR
MTAQRSTLNPVLEPLVAMHREIYPKANVQLLQRAFEVADQRHASQLRHSG
DPYITHPLAVATILAELGMDTTTLVAALLHDTVEDTGYTLAQLSEEFGEE
VGHLVDGVTKLDRVVLGSAAEGETIRKMITAMARDPRVLVIKVADRLHNM
RTMRFLPPEKQARKARETLEVIAPLAHRLGMASVKWELEDLSFAILHPKK
YDEIVRLVAGRAPSRDTYLAKVRAEIINTLNASKIKATVEGRPKHYWSIY
QKMIVKGRDFDDIHDLVGIRILCDEIRDCYAAVGVVHSLWQPMAGRFKDY
IAQPRYGVYQSLHTTVVGPEGKPLEVQIRTRDMHRTAEYGIAAHWRYKEA
KGRNGVPHPHAAAEIDDMAWMRQLLDWQREAADPGEFLESLRYDLAVQEI
FVFTPKGDVITLPTGSTPIDFAYAVHTEVGHRCIGARVNGRLVALERKLE
NGEVVEVFTSKAANAGPSRDWQQFVVSPRAKAKIRQWFAKERREEALEAG
KDAMAREVRRGGLPLQRLVNAESMSAVARELHYADVSALYTAIGEGHVSA
RHVVQRLLAELGGIDQTEEDLAERSTPTTMLRRPRSSDDVGVSVPGAPGV
LTKLAKCCTPVPGDQIMGFVTRGGGVSVHRTDCTNAASLQQQSERIIEVH
WAPSPSSVFLVAIQVEALDRHRLLSDVTRVLADEKVNILSASVTTSGDRV
AISRFTFEMGDPKHLGHLLNVVRNVEGVYDVYRVTSAA
>MAP0379c rsbR, RsbR
MSDSPSDFNSTGTAAQTVGISSEGLLPQLVQHLRQNRTILREEWARRITE
AELLTAMTPEELFSEATAVYDNYVEVLETGSVEALQAYARDLSERIIPRG
VETDEVVGIVLLLRDVLARSLFEKYQTEFEMLNRVLDAYEPAANRIANTV
AVSFVQERERIIRQQQEAIRELSTPVLQVREQLLILPIIGVLDSQRARQV
TEQLLRAIRANRAKVVVIDITGVPTIDSTVANHLVQTVDASGLMGASVII
TGLSSEIALTLVTIGLDLSKMNAVGDLQGGIEEAERLLGYEVTRTGEQTG
>MAP2361 rsbU, RsbU
MVAEKDWDRIVGAADDVRRVFDNVPALLVGLEGPDHRFVAVNAAYRALSP
PVDPIGLLAREVYPELESQQIFQMFDRVYQTGEPQSGTEWRVQADFEGVG
KAQEHFFDFIVTPRRGDDGSIEGVQLAFVDVTDRVRNRMAAEARLEELSE
RYRNVRDSATVMQQALLAPSVPVTPGADVTAEYLVAAQDTAAGGDWFDAI
PLGDRLVLIVGDVVGHGVEAAAVMSQLRTALRMQILAGYPIAEALEAVDR
FHKHVPGSSSATMCVGSLDFGSGEFSYCTAGHPPPLLVTADATARYVEPS
GAGPLGSGTGFPVRTEFLDVGDSILLYTDGLIERPGRPLGASTAEFADLA
ANIVSGQGGFVIESSVRTADRLCAETLELLLRSTGYSDDVTLLAAQRRTP
PPPLNLTLDATIHAARTVRTRLRQWLSDVGADADDISDIVHAISEFVENA
VEHGYATEVPDGVVVEAMLAGDGNLHASVIDRGRWKDHREGETGRGRGLA
MAEALVSEAHVSHGPEGTTARVTHRLSRPANFVADSLVSRAGDHGVRSAE
FVSTVTEPGRIVVSGDVDSNTASTLDRQIAVESRSGVAPLTVDLSAVTHL
GSAGVSALAAARERALRQGSDYVLVAPPGSPAHHVLSLVQIPVVSGLAEN
VVAEG
>MAP3407c rsbW, RsbW
MTNRLERTVHGGLQSSSSVSRPLPGDHMKDAGLHADKRPRGHRAVELHVA
ARLENLAMLRTLVGAIGTFEDLDFDAVADLRLAVDEVCTRLIRSATPDAT
LIVVVDPQDDQLVVEASAACDTHDVVAPGSFSWHVLTSLADDVQTFHDGR
EPNEAGSVFGITLTARRAASSR
>MAP3982 senX3, SenX3
MTVFSALLLAGVLSVLALAAGVAVGTRLSPRAAQRRQRISTEWTGITVAQ
MLERIVALMPLGAAVVDAHRDVVYLNDRAKELGLVRDRQLDDQAWEAAQQ
ALTGVDVEFDLRPAKRSGGRAGLSVHGQARLLSEEDRRFAVVFAHDQSDY
ARMEATRRDFVANVSHELKTPVGAMALLAEALLASADDSETVRRFAEKVL
VEANRLGDMVAELIELSRLQGAERLTNVTEVNVDSVVNEAISRHKVAADN
ANIEVRTDAPSGLRVLGDETLLVTALANLVSNAIAYSPPGSPVSISRRRR
GDNIEIAVTDRGIGIALEDQERVFERFFRGDKARSRATGGSGLGLAIVKH
VAANHNGTIGVWSKPGTGSTFTLSIPAAVASHRGDGESEQPQGRDVRPNR
SQREEELSR
>MAP1102c tcrA, TcrA
MKVLLVEDEPRLAATVARGLKAEGFVVVTVGNGVDGLAEATENPFDIVIL
DIMLPGRSGYEVLRRMRSNNVWTPVLMLTAKDGEYDETDAFDLGADDYLT
KPFSFRVLVARLRALVRRGAPERPVVLTAGSLSLDPARHTVQRGSTPIAL
TPREYGVLEFLMRNKDVVVTKADILANVWDAHHHGPDNVVEVYVGYLRRK
IDVPFGTNTIETIRGVGYRLLC