TitleGenColors Logo

Gene list

Applied filters:

COG category: Transcription
Organism: Mycobacterium avium subsp. paratuberculosis str. k10, k10
Gene type: CDS

Number of genes found: 298

Free access
Sort by:

 



# Mycobacterium avium subsp. paratuberculosis str. k10, k10

>MAP3689 hypothetical protein
MAGGTKRLPRAVREQQMLDAAVQMFSVNGYHETSMDAIAAQAQISKPMLY
LYYGSKEDLFGACLNRELGRFIDAVRADIDFQQSPNDMLRNTIASFMRYI
DANRASWIVMYTQATSSQAFAHMVREGREQIIELVAGLMRAGSRTPRTDR
EHEMMAVALVGAGEAVANRLSTGDIDVDEAAALMIDLFWYGLKGSPEERE
AAAAQPEADRSGAGG
>MAP1342c hypothetical protein
MPDESATPADAGYDNAGVPTFDSVRDKIEARYATAQGAADLDAESPEGRS
VAEQYDERERAAARRLAQIRESMRPQQD
>MAP0259 hypothetical protein
MSQPRASAERVVMCRADGNPINVLVVDDEAVLAEMVSMALRYEGWNIATA
SDGASAIAAARNQRPDVVVLDVMLPDMSGLDVLHKLREENPQLPVLLLTA
KDAVEDRIAGLTAGGDDYVTKPFSIEEVVLRLRALLRRTGVTTVDSGAQL
VVGDLVLDEDSHEVTRAGEPISLTSTEFELLRFMMRNSKRVLSKAQILDR
VWSYDFGGRSNIVELYISYLRKKIDNGREPMIHTLRGAGYVLKPAR
>MAP1652c hypothetical protein
MPELVPRQVVIVGFPGVQALDVVGPHDVFSGASLLTGGGYQVVLASVDGQ
PVTTPAGLGFVATALPDPGRPIDTVVLPGGAGVDAARGDAELIAWIKAVA
GRARRMVTVCTGAFLAAEAGLLDGHRVTTHWAFADRLASEFPAVDVDADP
IFVRSSDTLWTAAGVTAGIDLALALVEDDHGTEVAQTVARWLVLYLRRPG
GQTQFAAPVWMPRARRDSIREVQEIIEAEPGRPHTVDELARRAAMSPRHF
TRVFTAEIGEAPGQYVERIRTEAARRQLQETDDTVVTIAARCGFGTAETM
RRNFIRRVGISPDQYRNACA
>MAP3139c hypothetical protein
MELTDNILWLLKQAFYYSLTTINEAMREHGVSTAQIGVLRQLANEPGLSG
AELARRLLISPQGVQLALTALERRGLVERKQDPQHARILQAYLTGEGRKV
AETVVNDAIAAHQEVFGVLTEQEQQTLRELLGRVVEKGTGHELFTDHVDG
>MAP0298 hypothetical protein
MRCKSVILRPCPTVFANFVKGAGRRVVRVAKTFAGARLRRLREEQGLTQV
ALARALGLSTSYVNQLENDQRPITVPVLLTLTERFDLPTQYFAPDSDARL
IADLREVLAETPATPGQVEELVARMPAIGQTLVNLHRRLHDTTADLEALY
SRANLDISDTPGLPHQPMPFEEVRDFFYDRKNYIGELDVAAEEMFDHNRL
RIGGLDGQLARLLGEQLGVAVVIDDGQLLAPNSKRLFDPESKTVYLARWL
HPGQRAFQLATQIALLTQPELITAIIAGDDQLSDDARGVARIGLANYFAG
ALLLPYRRFLDAAERVRYDIDQLSRRFEVGFETVCHRLSTLQRPNARGVP
FIFVRTDSAGNISKRQSATAFHFSRVGGNCPLWVVHHAFSRPGQFLTQVA
QMPDDRTYFWIARTTSGGPSRYLGPHKSFAIGLGCDVDHADKLIYSVGID
LRDTESIVPIGAGCKICDRAACPQRAFPYLGRPVSVDPHRSTDLPYPPAI
TP
>MAP1494 hypothetical protein
MSNHDYVTYEEFGRRFFEVAVTPERVAAAFADIAGSEFAMEPIAQGPGKI
AKVSANVKIHEPRVTRRLGDTITFVIHIPLSIDLLVDLWLDKQRFVVSGD
IALRATARAAEPLLLIVDVAKPRPSDITVNVSSKSIRGEVLRILAGVDGE
IRRFIAQYVADEIDAPHSQAAQVIDVAQQLEQAWPG
>MAP4135 hypothetical protein
MPDLTARSVVLSVLLGAHPAHARASELVRLTSDFGIKESTLRVALTRMVN
AGDLIRSADGYRLSDRLLARQRRQDDAIDPKVRPWRGQWVTLIVTSVGDD
ARTRAALRNAMHDKRFSELREGVWMRPDNIDTQLGPDVTDRARVITARDD
DPADLAGRLWDLLGWARAGHRLLDEMAAAPDVPGRFVAAAAMVRHLLTDP
VLPDELLPPDWPGTRLRAAYHDFAAELMARRDPLFAEAT
>MAP0712c hypothetical protein
MTNPSEEPAWKQRAVERSIKTAKLRAAQRVQRFLDAAQAIIIEKGSTDFT
VQEVVDRSRQSLRSFYLQFDGKHELLLALFEDALSRSADQIRAATESHTD
PLERLQVAVQLLYEASRPDPTAKRPLFTDFAPRLLVTHPAEVKVAHAPLL
ALLTELMEAAGEAGKLRTTINPKRVAAMTMQTVMFIAQSSGGPDDATTHP
ITAEEVWDFCSRGFVA
>MAP2260 hypothetical protein
MTVGRAESPELVAVLAGRRIAVLTGAGISTDSGIPDYRGPESPPSNPMTI
RQFTGDPAFRQRYWARNHVGWRHMDDTLPNAGHRALATLEDAAVVTGVIT
QNVDLLHTKAGSRNVIDLHGSYARVICLGCGDTTSRAALAERLEALNPGF
IERTEAIGGLAVAPDADAVVADTASFRYLDCARCAGMLKPDIVYFGESVP
KDVVAAAYRLIDESDALLVAGSSLTVFSGYRFVRHAAARGIPIAIVNRGD
TRGDHLATVKIDGGCSELLTLLADELSPLPTH
>MAP0139c hypothetical protein
MWLYDRPVSLRDAVLAALLEGESSGYDLAKDFDASVANFWPATPQQLYRE
LDRLAGQGLIRARVVHQQRRPNKRMFSLTAAGRAAIRRFTATAPRPSVIR
DELLIKVQAADAGDMRAVRDAIRERRDWATAKLARYQRLRARLLDGRSEE
DYLARAERIGPYLTLIRGISFEEDNIRWAEHALAVIARRLPTTDADSDAG
DSRLVGPATNG
>MAP2335c hypothetical protein
MAEQLLIGDVTALTGIASGRIRHYEKIGLLHADHLSNGYRVFDVEQVLDL
LRIDLMRSLGVGINDIQRLLHGGHSTLADLLDEHRALLIQQRDRLDQLIR
ALDESCAQSRAAADGTDVSEHLLRQLATTHRDSIGVIGRLSAPLPAPVAA
MYADLFDEWDLPVPALFGQMRLPSAASTLLARLAETPGRQVLFDRLRRLA
ADVIALGESVSAATELAHNWIEQQLADPLPDDVVTVLRGVQPLLGQDPVI
VQGFRAWAGSISPAAARFIEEIARQTARRGLAAVSVIVLPAADRAQTVDA
TP
>MAP0198c hypothetical protein
MSEIGQHGLATSAQTLPPGARIARHRHPLHQIVYPSTGAVSVTTPAGTWI
TPANRAIWIPAGCWHEHKFHGHTNFHGVALDPARYRRGPAAPAVLAVTPL
MRELIIACSRARDTDAGAHQRMLAVLHDQLQATSVAEPLWIPTAVDGRLR
AACALLADNLRKPLTLRQIGERIGVGQRTLSRLFRDELAMTFPQWRTQVR
LQHALVLLAERRDVTSVAAECGWATPSAFIDTYRRAFGHTPGRLAPRPS
>MAP3579c hypothetical protein
MSHPVRATAIADTDTSTRQRILAATAEVLGRNGKTKLSLSDVATQAGVSR
PTLYRWFASKEELLSAFSAYERQIFESGLVKATAGLKGVDKLDAVLRFIV
DYQHSYSGVRMVDVEPEHTIAQFSWVIPQMREGLQRHLPGPNAAVKAATV
IRIAISHYIVRSDDADQFLAQLRHAVGIKAPTD
>MAP1105 hypothetical protein
MTGPAVLLIRITNPPGGRGWRNDPARRGFGWSGPNATRRNAVGRRLRPRS
PARKTPLSCVSVTVVAHPVPIGGPRADRARRVADVLRQQIHADAYPDGLP
AELELADEFSVSRNTIREALAVLKREGLIDRGPRVGTHVAQRKYDHALDA
LLGLKETFKDLGEVRNEVRAAMPVTAPPSVARRLRLAPGEQAVFIERLRY
LGELPLSLDLTYLAPAIGRQVLEHSLETNDVFALIEQVSGQRLGSAALAL
EAIPADAHSAATLQVPDGAALLMLERLTSLADGTPVDLEYIRMRGDRITM
RGNLFRAEPMRSNS
>MAP2953 hypothetical protein
MTEADDAPLGYLLYRVGAALRPEVSGALGPLGLTLPEFVCLRILSMFPGM
SSAELSRRAGVTPQAMNTVLRRLQEVGAVARPSSVSSGRSLPAHLTGAGC
TLLKRAEAAVRGADARILAKLTETQQREFKRMLQKLGSDG
>MAP2614 hypothetical protein
MKRTLPAAVVRLLRREAADRNAHGYYQEMPQLTDRTGSRSRRRGEVLERA
LYEATLAELTEVGYGGLTMEGIAARAHTGKAALYRRWDTKCELVHAALVF
ALPPVPELRSGRSARENLLAMFTAQRDLLAGKTAFPGIEVIQQLLHEPEM
RAIFADAVVRPRLKIVESILQSAVQDGDLDPKSITPLTARIGSALINQHF
LLNGSPPNRRELALIVDTVIPPRDSDRAAD
>MAP2347 hypothetical protein
MTALGIGPDARRRLSAPRIAEIVADELRRQIISGELADGDLLPRQEVLVE
QFNVSLVSLREALRILETEGLVSVRRGNRGGAVVHAPAKTSAAYMLGLLL
QSESVAVADLGMALQELEPACAALAARRPDRADTLVPELTKINESMATHL
DDGRLFTEIGRQFHDLVVRGCGNHTIIAVVGSLETLWTSHERQWADESSS
RGTYPSLAQRRAVLNTHVKLTETIAEGNVDRARRIAGRHLADTQTYVLSG
QPDQRIHAFSPQALARPRDLRRS
>MAP1481c hypothetical protein
MIRASRTGLSRGDRFCQPAISPPRAGPRDVLVSHPVLPERHRKVTTPKTF
ADLGVPARIVDALTARGITSPFPIQAETLPDTLAGRDVLGRGKTGSGKTL
AFSIPLVGRLSTGNRRPARPTGLVLAPTRELATQITATLEPLAAACGLRV
STIFGGVSQHRQVTALKAGVDIVVACPGRLEDLMRQRLITLEAVRVTVID
EADHMADLGFLPGVTRILAATPNDGQRLLFSATLDNGVDKLVTRFLRDAV
LHSVDEANSPVSEMTHHVFHVDSVQAKKELVHRLASGTGRRILFLRTKHQ
ARKVARQLTESGVPSVDLHGNLSQPARERNLAMFAAGSARVLVATDIAAR
GVHVDEVELVVHIDPPSEHKSYLHRSGRTARAGSAGDVVTVVLPEQREHT
RALMRKAGIDVAPQRVTAGSQAVHALVGPIAPPKPPAAAGVPSHPAGPHR
PAAAGQRRRRSGRSARTTAAHAVPHRPAAPVRRPDRRRASRAQGSAG
>MAP0491c hypothetical protein
MGNSHAECVSFADEPPGTGAHRHMPARATSKAESKADSSRVSDSQPREVM
NVAVLAESELGSEAQRERRKRILDATMAIASKGGYEAVQMRAVADRADVA
VGTLYRYFPSKVHLLVSALGREFSRIDAKTDRAAMTGGTPFQRLNFMVGK
LNRAMQRNPLLTEAMTRAYVFADASAASEVDQVEKLIDSMFARAMADGEP
TEDQYHIARVISDVWLSNLLAWLTRRASATDVSKRLDLAVRLLLGEHG
>MAP1121c hypothetical protein
MDRRSRRVTASLAADQGAAIGYHRGVSNLRIRQAAELLGVSDDTVRRWIN
QGALEVGHDAAGRKVIASEDLAHFSRTNAPAPPADPLSIGSSARNRFVGL
VTSVVSDKVMTQVEMQCGPFTVVSLMSTEAAEELQLRPGSVAVAVVKATT
VIVETARPSTTVASD
>MAP1412 hypothetical protein
MSENAPDVDLDSELEAGIPEIAEAAPMESGELGSVLEALLLVVDTPVTAE
ALAAATQQPVYRIATKLQEMADELTARDSGIDLRKTSEGWRMYTRARFAP
YVEKLLLDGARSKLTRAALETLAVVAYRQPVARARVSAVRGVNVDAVMRT
LLARGLITEAGVDEDTGATTFATTDLFLERLGLTSLADLPDIAPLLPDVD
TIEDLSESLDSEPRFVKLSGGQASDAALTFDVDQD
>MAP2201 hypothetical protein
MATERATYRSSRPEPGTIASVLHNVRRAPKRVRRQSREYRQLIEGAVSQL
FDAAVRHPHGDPNSGEYRIDDLARLAGTTTRNICVYRDRGLLPPPLRVGR
IALFNDTHLTRLRLITSMLDRGYTIAHVREMLSAWEQGKNLGDVLGLETA
IVGTWTTEKPETMSLAEAQRLVGDPRAFERLVALQVIRVDGSRATLTRPK
LIEAFNEIRGYGVEFDKLIDLHEQIVPEIDKISDMLVRAGAEHVLDRIKP
GEPLPADAEIAELITMLVRFRTQAVATVTATLASSIEANIESLVSRILAD
YLESSSSA
>MAP2316 hypothetical protein
MDMNDSRPTERFSPPQPRYSPSVDPAYADQTPYAPTYGPTMSPWAPAANE
TNPTKQLPAYWQQELPPGGDPHAPGMAPPPGEPKSPRWLWIAAGAAVLLV
VALVIALVLANDAIKTQTAVPPLPAMPEPSPETPTTSTHRSPSLIPAPIP
PTSEPEPPTATTGPAAMQDVVYTVSGEGRAISIMYIDTGDLIQTEFNVAL
PWSKQVSLSKSAVHPANVTIVNIGHSVTCTVTVNGVQVSRRVGGGLTICD
ARG
>MAP1726c hypothetical protein
MTRTQQRAAENRRTVIDAAREIIATQGVEALTLEAVAEKADVVVQTIYNR
VGGRSALLTAVAEQALEQSRVYMDPAYEADGTVEERMMLAANAYARFARE
RPHEFRILVEPPNEPEAVARIAELTRAQNARLTAVLREGMAAGLIRADLD
PDDVTTALWATFNGLLALAWRPGGPSGKPRNRRPPSCRVHRHGE
>MAP1770c hypothetical protein
MNAPRNLAELAQRFDDDRRHLRSVAFQLLGSLADADDAVQSAWLKASRAD
FAAVDNLSGWFTTITAREALDQLRARKRRAELPLAAPEELDRLAVPASSP
ADEDTLLAESVSTALLVVLDRLSPAQRVAFVLHDVFAMPFETIAELLNRS
PDAAKKLASRARARLASFAPAGPPRRDGHRIERHVEVVQAFLAASRGGDI
AALLQLLAPDVLRTVDPVLVPADVPTALRGATRVAQETRRFAGRARAGAV
MLIDGTPGIVIAARGRAQILLMIGIGADDRIHTIDITGDPERIRRATLAL
PPAGRFRNITT
>MAP2527c hypothetical protein
MDMAASPRRVGAETSQTRDALLEAVAQMMLEEGYASVTYRALAAKAGVTP
SLVQYYFPSLDDIFVAAIRRYSERNLQWLTEELQRRADDPLHALWESGWH
ESTSALMTEFMALGNHRKSIRSEIAAVTDSMRRVQVEALVAKFGNDARLL
ADLSFDAVVLLINGVPKLLGLEESVGVDTAHAELIAACERFLDAVEPRAK
PRRRSKKAPTRRR
>MAP2483c hypothetical protein
MRMSAKAEYAVRAMIQLATVPDGTLVKTDDLAQAQGIPPQFLVDILTNLR
TDRLVRSHRGREGGYELARPGKDISIADVLRCIDGPLASVRDIGLGDLPY
SGPTAALTDVWRALRASMRSVLEETTLADVATGSLPKHVAQLADDYRKQE
SQRHGTARTGD
>MAP0116 hypothetical protein
MRSVHRLRPAGRPPIARSAAAAATVICMPSASRASADPVKRRPKDRKVQI
ARAATDAFSELGYHAVSMENIAARVGISAAALYRHSQGKYDLFREVFLAL
GQQLVDATAFADDLPADADPGETLSALTTALIDTTIVNRTAGGLYRWEGR
YLRGDDQRKLAEQIKVINRRLQRPLIKLRPKLTSRQRWILSAATLSVIGS
ITDHRSRLGNAEIRATLSDIAAAVLKAELPTQRGRGAEPARAPTLTSAAG
SYELLLHESMRLFNERGYRETGMEDIAAAVGIPVASIYQYFPGKAAILAV
YYRRAADQLSADLSSILATSSDPEQALARLIEAYVTRSFANPELACVYYT
ERHNLPETDAVLLYNIQRSTVESWARLAVAARPELTLGRARYAVHAAFAL
AVDIGRLVYPNTGAPASTTVRRLMEVTVLGRPAAASRDGLRGAGRQRR
>MAP3927 hypothetical protein
MAPPRKHETDVILDAARALVLDGGPRAASVAAIAKASGAPAGTLYHRFGN
RDGILTAAWLRALERFQARALAADGDTPEDTAVAMAVAAVGFARALPDDA
RLLLTIRPGDLLDGEPDAAFRQTLAAMNAPLTQRLAQLARHLYGNARPRS
VDAVARAVADLPYAVVRRHAHDDPMPSWLETDVAASARAVLRSFGERP
>MAP0053c hypothetical protein
MPKKYGVKEKDLVVSHILHLLLTGKLRTGDRVDRNEIAVGLGVSRVPIQE
ALVQLEHDGIVSTRYHRGAFVERFDEATVLEHHELDGLLNGIASARAATS
PTPRILGELDTLLRSLRTAKDSRAFTDIAADYRRTVNDEYAGPGLHATIR
ASQNLIPRLFWTTYQGGRDDLLPFYEDETSAIHRRDPEAARAACVERSYR
MAQTMLGELFRRRVFTPPDDAGRVVAGPLPVLADADTICGGASLAL
>MAP0493c hypothetical protein
MARWEPDARARLVAAALDLFNERGYDQTTVAEIAERAGLTKSTFFRHFPD
KRDVLAAGQDAIAALLREGIAAAPADATPLALVCSGLKSAAAAFTPFNKE
LAPRLRAAIAASAELQERNALKQIGLALAVSEALQARGVPEPAAALAAEL
GALAVKTAYARWAEPDESGDLGDMACQALRELHAAAADLG
>MAP3876c hypothetical protein
MAVDYSELPAEEAVRAARAGARRRQVLDAAVKVMGRNGFHRMSMHDLAAE
ADVSVGLLYKYFGGKQDLLLATIVRILDVFRDQLAPVIDGAGDDVVDQLS
AGIRRYIQIVDENLDGVVLTYRESRTLGAAGRAQIKDLEIATAAPLRAVI
ETGIAQGVFRSVDVDLAVFDIMLLAHGWALKHWHFGPIYTLDEYFALQTR
HLLNGLISDERRADYADVLK
>MAP1814 hypothetical protein
MRNDAGMRCDVAREALSARLDGERPQVLAQQVDAHLEACRGCRSWLIGAA
VQTRRLASVTPGEGPDLVDKIMASIGEQPTGRPAWMRWLRSHYRRWGLIG
VGLFQVAIAAAQISGIDFGMVAGHMHGAMSGEHLMHESTAWLLALGLAMI
AAGVWPASASGVAAITGVYSVALLGYVIVDAFDGEVTATRIASHMPLLLG
LAFALLVARERVGSRRPGSSDATADAGFAAWAADAPAGRRRGHLRPINRA
APDPTASSTTTRRRIGKPAQLRWLGMIAPSDDEAVTELALSAARGNARAL
EAFIKATQQDVWRFVAYLCDAGSADDLPQETFLRAIGAIERFSGRSSART
WLLSIARRVVADHIRHLQSRPRAAVGADPEHVLRTDRHARGFEDLVEVTT
MIASLNPEQREALLLTQLLGLPYADAAAVCGCPVGTIRSRVARARDALLA
DGERSDLTG
>MAP4212 hypothetical protein
MADSRPHERDPIAAARANWERAGWGDVAPGMVAVTSVMRAHQILLARVET
ALRPYDMSFSRYELLRLLAFSRTGALPITKASDRLQVHVTSVTHAIRRLE
ADGLVQRVPHPTDGRTTLVQITELGRSTVEDATVTLNKQVFADIGMSDTE
SRQLASSIETLRRNAGDF
>MAP2894 hypothetical protein
MRADEERGGLTAVGQDYLKAIWNAQEWSPEGAPQKVSTKMLAEKIGVSAS
TASESIRKLAEQGLVDHEKYGAVTLTEAGRRGA
>MAP3758c hypothetical protein
MSVMVGYLECRPLGALAPYVDCGWVRSNTAGGALRVMPDGCVDLFVTAEG
AVMVSGPATTFYDQCAGTEGAMIGLRLRPGAAAAVLGHPVSELRDTQIRV
DSVFGTSASGLAEDVLEANAFRQRVALLAAVLARYVVKIDPVVDQPVAHS
IEMLRVHPARSVSDVASSVGLSERQLRRRFDAAVGFGPKRLGRIFRFQRL
LDLIHAGDHRIRWADLAIEAGYADQSHLISECLALAGAAPTELPGAARPR
HA
>MAP1788 hypothetical protein
MAVADRLSAREAKRLQTRERLMGAAIAEFARAGMAEADVGAIVAAAGVAH
GTFFFHFPTKEHVLLELERREEERIAKQFAQFLKSEHDLASALNEAVRLV
VGLERRLGDMLFNDFLALHFSQTRPQTEDGRDHPLIVLVANEIEGAQQRG
ETDPHVNPMNSAVFFLLGLYALLITTNHWPTGHALLEDYVARTLRSLKP
>MAP0475 hypothetical protein
MIFKVGDTVVYPHHGAALVEAIETRTIKGEQKEYLVLKVAQGDLTVRVPA
ENAEYVGVRDVVGQEGLDKVFQVLRAPHTEEPTNWSRRYKANLEKLASGD
VNKVAEVVRDLWRRDQERGLSAGEKRMLAKARQILVGELALAESTDDAKA
ETILDEVLAAAS
>MAP3575 hypothetical protein
MPADSSTPNGQTRREELLAVATKLFAARGYHGTRMDDVADVIGLNKATVY
HYYASKSLILYDIYQQAAERTLAAVHDDPSMTAREALYQYTVRLLDQIAA
NPEGAAVYFQEQPYITEWFTKEQVAAVREKETQVYEHVHGLIDRGIASGE
FYECDSHVLALGYIGMTLGAYRWLRPSGRRSAKEIAAEFSTALLRGLIRD
EEIRTTSPLGP
>MAP3097 hypothetical protein
MARTPDRQRRRELLDALIAEFAAGGIGDRSLRRVAEAVGTSHRMLLHHFG
SREGLLLAIVEEVERRQMRVLTELPRAPAEGFAAMWADLRRPELREFERL
FFECYSRAAQGEKPFARMLPDAVDDWLRQAETHSGAPFDPAMARLGLAVI
RGLLLDLVATGDEAGVDAAARAFVNLLNAGA
>MAP1831c hypothetical protein
MTPISTRLVRLLNMVPYFQANPRVTRAKAAADLGVSAKQLEEDLNQLWMC
GLPGYFPGDLIDFEFSGDTIEVTFSAGIDRPLRLTSPEATGLLVALRALA
DIPGVVDPEAARSAIAKIEGAAGAVGHDTTQAATAVDEPAPAESRAAAAV
RAAVQSRHALAIDYYSASHDTLTSRVVDPIRVLLIGGHSYLEAWSREAEG
VRLFRFDRIVEASELDEPAAPPEPVLHAPPDTSLFDGDPSLPSATLRVAP
HASWMFEYYPMRDVRELPDGSCEAVMTYASEDWMTRLVLGLGSGVQVLAP
HSLAQRVREAAAAALAAYEALS
>MAP3381 hypothetical protein
MPVQRVVRVIATALTLAVVLGTGIAWSNVRSFEDGIFHMSAPSLGKGGDD
GAIDILLVGLDSRTDAHGNPLSQQELETLRAGDEEATNTDTIILIRIPNN
GKSATAISIPRDSYVAAPGLGKTKINGVYGQTREAKRASLVKAGDSAEDA
AAQGTEAGREALIKTVADLTGVTVDHYAEIGLLGFALITDALGGVDVCLK
EPVFEPLSGADFPAGPQRLSGPEALSFVRQRHELPRGDLDRVVRQQVVMA
SLAHRVISGQTLSSPATVKRLEAAVQRSVVISAGWDVMDFVQQMQKLAGG
NVAFATIPVLDGAGWSDDGMQSVVRVDPHQVADWVGSLLHDQEQGKTEEI
AYTPAKTTASVVNDTDINGLAAAVSDVLSAKGFATGTVGNNDGGHVKTSQ
VRAAKSDDLGAKEVSKELGGLPVVADSSLAPGAVRVVLANDYSGPGSGLS
GTESIMPAKVSTAGAVSTNDKAPAPSPILTAGSDKPECIN
>MAP1446c hypothetical protein
MPAPAETPTAVIDRISLVLDAFDGPGRLTLAQIVRRTGLPRSSAHRMLER
LVQLRWLRRSGRDYELGMRLVELGSLAVHQDRLVRAATPLLVELHRATGL
VVHLAVLDGPDVVYLEKVGDRLNSVIPSRVGGRQPAHCTAVGKAILAYRD
EATDLDLGARPTKYSISSAPQLAAELAKVRAHGVAFEREESLLEFGCVAA
PIGGPGEAAAAVSVCGPLQRMALDQRLAAPVRMTAMGVWRNVEGGPQRVA
PTLQRLRPLPPMRPQHPVPKPRPALQYA
>MAP3331 hypothetical protein
MASDSRSRRRDGDERRRQLCDAAIRVLAEHGSRGLTHGQVDRYAGVPEGT
TSYYYRTRAALLQGVGKRVAEIDVANLQSVIDEPLDPLSPFAHLARLTMM
QASGPGLMLNRARHELLLGAARDPGLAETSQIFAGRINSMARDAIAHLQP
DIRDPALLDAQTTAVTTFIAGVFTRLAAGDRNVGDAEQLAHLLEAVATAV
ALRAQRS
>MAP4291c hypothetical protein
MESKRRTQEERSAATRDALIAAARKLWGLRGYTEVGTPEIAAAAGVTRGA
MYHQFADKAALFRAVVEAVEQDVMARMAVVVAESGASTPTDAIRAAVDAW
LEVSADPEVRQLILLDAPSVLGWAEFRDVAQRYSLGMTEQLLTEAIRAGE
LAEQPVRPLAHVLIGALDEAAMVIATADDPDRARRETRQVLQRLVDGMFD
AR
>MAP3505c hypothetical protein
MPQTISGDHRYLQIARALRKEIVDGVYPVGSQLPTEHQLCERFAVSRYTI
REALRRLREDNLVASRPRAGTRVVPRPASSSYAQDAMSIDDLLAFAAGAQ
LTIESNAMVTIDDELAARTGLEPGTQWLSVRGYRQADGASVPICRTEYYI
NRSFAAVGRLLQRHAGPIFPLIEDLFGVSVAQLHQEIAAVLLSPELADGL
GAEAGTAALQMRRTYTTSDGEVAQVTINTHLSPNFRYAMTMRRVTGQAG
>MAP1267 hypothetical protein
MAVHKSITATGAESIAASIEAAISAGSLAPGDALPPVRELAARLGVNANT
AAAAYRLLRHRGAVETAGRRGTRVRHRPATTPRSLLGLDVPAGVRDLSTG
NPHPALLPLAAAAPARPASGRAVLYGEPAISPALGEYARAALSADGVPAD
HLALTSGALDGIERALTAHLRPGDRVAVEDPGWANLLDLLAALGFSAEPV
RVDDDGPLPEDLARALGRGARALVLTTRAQNPTGAALSADRAAALRDLLS
GRADDVLLVEDDHCAGISGAALHPVAGCTAHWAFVRSASKAYGPDLRVAV
LAGDQRTVDRVHGRLRLGPGWVSHVLQQLAVELWSDTAASELVATAERRY
ADRRRRLCGALAERGVDAHGRSGLNVWVRVPDEAVAVSRLLGAGWAAAPG
ARFRMQTPAGIRITIADLTPDEIEPLADAVAQAVRPSGRPIV
>MAP3052c hypothetical protein
MVGAMTQTADRCEQASPWSPREAEILAVTLRLLQEHGYDQLTVDAVAGAA
HASKATVYRRWPSKAELVLAAFIEGVRQVAVPPNTGTLRGDLLALGETVC
EQVGHHASTIRAVMFEVSRHPALNDALQHQFLDQRRALIEHVLHQAVDRG
EISADAISDELWDLLPGYLIFRSIVPTRPPSRRTVQALVDGFLIPGLTAR
>MAP2038c hypothetical protein
MPLSPRMPELASFEAFLAIAETGSLGRAARELELTQQAISRRLATMEAQI
GVTLAVRTTRGSQLTPAGLIVADWAARLLEVAHEIDAGLGSLRKEGRERI
RVAASQTISEQLMPHWLLSLQAEATRRGEAAPQVILTATNSEHAIASVRD
GSADLGFVENPGTPKGLGSCVVGEDELVIVVPPTHKWARRSRVVTARELA
QTPLVAREPHSGIRDSLTVALRKVLGEDMQQAAPVLELTSAAAMRAAVLA
GAGPAAMSRLAVADDLAVGRLHAVEIPKLDLRRKFRAIWIGGRTPPAGAI
RDLLSHIIARTAVSK
>MAP1719c hypothetical protein
MACVAQPVRSDAARNREALIEVATRLFAAAAGGDEPSLRLIAREAGVGVG
TLFRHFPTREALVEAVYQDQVRRLTEGADQLLANHPPAQAMRRWMDLFTD
WLATKHGMLGTLRAMINNEQLGSGHTRIELLAAIDKILAAGRAAGDIGDH
ISSEDVAAGLIGIFTVAPTGGNSEQATRLLDIFMNGLSAPQSTVHTTTST
TDNP
>MAP0794 hypothetical protein
MGRPRSDTRERIQQVARELFSQRGVQRTSLQDIADRLGITKPALYYHFPS
REDLVRSILVPLIEEGERFVAEHESREHTEARELLEGYFDFHYRHRRDLV
LLLTELSTLIDLDLIDTVLAWRERLATLVFGKRPTLAQSTRAVVAFGGLQ
DCCVQFPDAPQAELRDKSVAAALDALGVRPRR
>MAP0074 hypothetical protein
MRTATELRGQILQAAGAEFAQYGLAGARIDRIARVAQASKERLYAHFRDK
ETLFREVVAAGNREFFSAVTLRPDAVPDFVGGIYDLAREHPEHHRMISWA
QLEGIALDEPHAEGQPAFAQHVAAIEAAQADGHVDGAWQPDDLLIVLFGI
ALAWANSPHPDAATNDPDANARRRAAAVEAARRIIAPSK
>MAP2789 hypothetical protein
MTPAPATHAVRIITTESVDVQQLADLAARTFPLACPTSAAREDIAAFIDA
NLSAECFARYLADPHRAILAAELDGRIVGYAMLIHDLAGDAAELSKIYVA
PEQHGAGTAAALMRLAVDTAGRWGVSRVWLGVNQKNQRAQRFYAKHGFTV
SGTRTFQLGAGREDDYLMSRTLR
>MAP3077 hypothetical protein
MTKRGAADRRLDRTELEKLMSADMRAITAQSDRIGRHFARQNNVSGTDFH
ALLHIMVAETAGTPLTPAQLRQRMDVSPAAITYLVDRMIDAGHIRREPDP
QDRRKTLLRYEKPGMALAHSFFTPLGAELRDALAHLSDRDLAAAHRVFAA
MIEAMSAFESRLAASTSKPPAAPGRKRATTAGRRGAPR
>MAP2096 hypothetical protein
MDRLDDTDERILAELTEHARATFAEIGDKVNLSAPAVKRRVDRMLDSGVI
KGFTTVVDRNALGWNTEAYVQVFCHGRIAPDELRAAWVDIPEVFSAATVT
GTPDAILHVLARDMRHLEVALERIRSSADVERSESIVVLSNLIDRMRP
>MAP2099 hypothetical protein
MDNVVDTAKLEGGQGLGADLLAVVARLNRLATQRIQMPLPAAQARLLATI
EAHGEARIGDLAAVDHCSQPTMTTQVRRLEEAGLVTRTVDPGDARAVRIR
ITADGRRTLNAVRADRAAAIQPQLALLDAEDRQVLRQAVDVLRRLLDNAS
LASSLAGRRQTS
>MAP4114c hypothetical protein
MADLDGVFRREWGPAVAALARWSGDLTVAEDAVQEACAEALRVWPRDGMP
ANPGGWLVTVARNRARDRLRRESARPGKELAAVLDDIRARTDAADPHPVR
DDELRMMFTCAHPALDRPSQLALTLRLVSGLTVAEIARALLQSEAAIGQR
ITRAKNKVRHANIPLRVPPAELLGERTPHVLGCVYSVFTEGYWSTAGPSA
IRDDLCDEGIRLGAELCALLPDEREAHALSALMLLHDSRRATRVDARGVP
VPLEEQDRTRWDRRRITAGLDRLRYAEGSSGAYLPQAVIAALHATAPSWE
QTDWVTICAAYDRLWQLTRSPVVAANRALAIGLRDGPDAGLAALEEVDEP
KLERSNLIATLRADLLRRAGRPAEALPWYRIALQSNGSEPGRKFLRRRIA
ECTSAAESATGDAFPENGRR
>MAP1221 hypothetical protein
MLVVEDSETIREMVSEALTEVGYHTEARRDGERLEELLDGIRPDLVVLDV
MLPGRDGFALIDVIRDWGDIGIVLITARDGLPDRLRGLDGGADDYVIKPF
ELAELVSRVGAVLRRRGRLPQVIQVGDVTLDPGAGVAARGGHRLDLTATE
LRVLTFLVEQRGRIVSAGQILNGVWGYDAYDPNLVQVHVSGLRRKLEAHG
PRILHTVRGIGYRLQPERS
>MAP0225c hypothetical protein
MDLSTRILFLMTDELDAQILHALQLAPRVSFRRIASVVGATEQTVARRYH
RLRRDGVVRVVGLENRWADGDADWVCRIRAAPDRISRLADALVRRPDVSH
ANVLAGWTDLVCVIRAPLGDTREDLLTQLLPRTAAVTDISIDLMLHSFGD
PVNAPWTGYGSALSDRQAQRIVAARAQPESGTHRSRPTAEDRPLLAALAD
DGRTPQSQLAARTGWSVARVSRRIAALEACGALVYDVDVLPERLGHRLSA
MLWLTVAPRDVHTVGERIAAHPQIAFAGATSGSKNLMAVAICRDSDDLYR
YLREQLGQLDGVLGYEVNVRTQRLKQHGSLVAHGRLINPHPGAGGRVTSQ
R
>MAP0237c hypothetical protein
MTNKRQSPADAARKNLEAELDRLRQRRDRLEVEVKNDRGMVGDHGDAAEA
IQRADELVVLSDRINELDRRLRAGPPDSDASATLPGGTEVTLRFADGEVV
TMHVISIVEETPVGREGETLTARSPLAQALAGRQPGDTVTYPTPQGEKQV
ELIAVKLP
>MAP4268c hypothetical protein
MTNLDDMRRRRPGNRTRIDAIKAEMDREVAQYRLRELREAAGYTQTTLAA
AIGVGQNRVSQMEHGDLGTSRVDTLRKYVEATGGELEVSVKRPDGSRVLL
SL
>MAP0045 hypothetical protein
MPPPEDPEDYVAPAAQRVRAGTLLLANTDLLEPTFRRSVIYIVEHNDGGT
LGVVLNRPSDTAVYNVLPQWTTLAAKPKTMFIGGPVKRDAALCLATLRVG
ADPQGAPGLRHVDGRVVMVDLDADPDAIAPLVEGVRIFAGYSGWTIGQLE
GEIERDDWIVLSALPSDVLVGPRSDLWGQVLRRQPLPLSLLATHPIDISR
N
>MAP3965c hypothetical protein
MAEQITAASVKMDGRKRRWHQHKVDRRNELVDGTIDAIRRLGGALSMDEI
AAEIGVSKTVLYRYFVDKNDLTTAVMMRFTQTTLIPNMAAALTSGLDGFD
LTREVIRVYVETVANEPEPYRFVMSNSSASKSKVIADSERIIARMLAVLM
RRRMQHAGMDTGGVEPWAYLIVGGVQLATHSWMSYPRMSRDELIDYLTML
SWNALKGIVEVGGSLEKFREEPHPSPIVPPREAAL
>MAP3718c hypothetical protein
MAQGGAQVSTAASPADRSGPPAELPTHRGRRTQAAIDLAARTVIARKGVL
AATIADIAAEAGRSAASFYNYYESKEAMVRQWALRFRDEANQRALSVTQH
GLTDRERAYGAAAAHWHTYRNRLAEVISVSQLTMINDDFAQYWAEICSIP
ISFIAESVRRAQAEGRCRDDDPQLIAEAIVAMFNQFCYVQLGSARCGEPD
DEACIQTLANIYYRAIYATEGDR
>MAP3959c hypothetical protein
MPTTCAEHQSRRGHRGCQLCEPGLHRPGNLGGVSKTFVGSRVRQLRHERG
FSQAALAQMLEISPSYLNQIEHDVRPLTVAVLLRITEVFGVDATFFSSQD
DTRLVAELREVTMDRDLDIDVDPTEVAEMVSAHPGLARAVVNLHRRYRIT
TAQLAAATEERFFDGSGTGSITMPHEEVRDYFYQRQNYLHELDTAAEDLT
IKMRLHHGDLARELTRRLTEVHGVHINRRIDLGDTVLHRYDPETKTLEIS
NHLSWGQQVFKMAAELAYLEFGDLIDSLVAQGKFTSEESHKLARLGLANY
FAAAAVLPYRQFHDVAENFRYDVERLSSFYSVSYETIAHRLSTLQRPSMR
GVPFSFIRVDRAGNMSKRQSATGFHFSSSGGTCPLWNVYETFANPGKILV
QIAQMPDGRNYMWVARTVERRAARYGQPGKTFAIGLGCELRHAHRLVYSE
GLDVSGGPGTVATPIGAGCRVCERDNCPQRAFPALGRALDLDEHRSTVSP
YLVKQS
>MAP2369c hypothetical protein
MSPVKPLRSRPTRGEVRDRILDAAAKVFAAEGFAGATIDAIGQAAGFTKG
AVYSNFESKDELFLALLDREFELRGQQIAIALDRSAGDTTAAAREVSRSV
LDSVRDHSDYYVLLVEYWLRAQRDPQLRECLIERRRAAADQALHIVESTD
TVPGDRRLTDIAQLVVTLNLGVAMEEVLRPGTINPDLLAQLITALLESIP
V
>MAP0916 hypothetical protein
MRILVVDDDRAVRESLRRSLSFNGYSVELAHDGVEALDMIASDRPDALVL
DVMMPRLDGLEVCRQLRSTGDDLPILVLTARDSVSERVAGLDAGADDYLP
KPFALEELLARMRALLRRTKPEDDAESVAMTFSDLTLDPVTREVTRGQRR
ISLTRTEFALLEMLIANPRRVLTRSRILEEVWGFDFPTSGNALEVYVGYL
RRKTEADGEPRLIHTVRGVGYVLRETPP
>MAP0601c hypothetical protein
MSSDALVTITSDAGGETGQPPRNRRQEETFRKVLAAGIETLREKSYSDLT
VRAVAARAKVAPATAYTYFSSKNHLIAEVYLDLVRQVPYFTDVNDPMPTR
VEQVLRHLALVVADEPEVSAACTTALLSGGADPAVRAARDRIGVEIHRRI
TSAMGPDADPTTVSALEMSFFGALVQAGSGEFSYREIADRLAYVVRLILT
GTTQASPETEAGDTR
>MAP2418 hypothetical protein
MSTPVPRKRRRRPALIALVIIAACGCLALGWWQWTRFQSVSGSFQNLGYA
LQWPLFAWFCVYAYRKFVRYEEEPPRPRDTAAVTEIPAGLLPERPRPTPQ
PPDDPALREYNRYLAELAAQDAAQQNRTTT
>MAP1559c hypothetical protein
MAKLTRLGDLERAVMDHLWSTPEPQTVRQVHEALSAQRDLAYTTIMTVLQ
RLAKKNLVSQIRDDRAHRYAPVHGRDELVAGLMVDALAQAEDSGSRQAAL
VHFVERVGADEAEALRRALAELEANQRNNPASAGAGPEG
>MAP3681 hypothetical protein
MQAEDQVSPREMVRRGLLLDGLRQPVDLNAVDWHVRQRHPGAAPSEVQDA
TLAVIRSLVSDGLVRLGAQVMVGEHLGGVATEGERFVAWDQPLERSMHKI
SHVYLKHYDDPEKWMYAAWMQLTDKGEQLARSFEQADLDSYRKFQ
>MAP1757c hypothetical protein
MVTPVAVPPEGTDAGRLVGGVPLARLVRDFHRPMLNFARTMVDSPAVAEE
AVQEAWLQVLRSSDSFQGRSSVGTWLFGIVKHTAARHRRREARIRRHEVL
ADADIEAVDPLSGRMHPAGHPDAGHWRVPPSRRFLPEDRTVHRELLGYLR
EALDALPARQRQLVILRDLVGISAEEAAELLQLSAEAQRALLYRARGNLR
NELEKRYQP
>MAP3910c hypothetical protein
MTPSQTAVLTRLWKEGPSSASALAAAERVRPQSMATILAALGQRRLIERS
PDPNDKRRQVVSLTAAGRKRAESDRRAREEWLARAIQERYTEAERRVIVD
ALSLLERLTEQ
>MAP2738c hypothetical protein
MPAKRARVAGPAAAAPAADALAPQYPIESVDNALKLLLLLGEQPEIRLSE
ATRYLGVASSTAHRLLAMLAYRGFVRQDPVSKAYLPGPSLTGVAFAIFGR
LDIARSAAPLMRALSEQLRETVHVGMLDGASVRFVAAVEGPAAVRVASRL
GRALPAHCTSTGKVMLAQLSAAELRALLPRERLKRITAHSIGSRAALEAE
LAAIRERGYATNREESEEGVASVAVPIPTRAPGLRLALNAAAPQNRLDAA
RYPAVAAALVAAAKEIGDQLG
>MAP4287c hypothetical protein
MREGARGGWEPTVPALVNLVAAAGAPRLRAAFAEAGLDGIRPAQSLALVP
LAAGGLHASDLADRLRVSRQAVAQAVAALERHGYVTRTPDPADARARIIE
LTPRGRQALQVMRANAIAMEKRWRKVLGERRLTELRETLTVLLAAEREPG
G
>MAP3711c hypothetical protein
MRENRPMPASEAPPGLRERKKIKTRQAIRREAFRLIEQNGYAATTVEQIA
EAADVSPSTFFRYFGSKESLLLADDLDPLILDALKAQPRELSLPQAFRRA
YATVLSQLPDDQLEFESTRQRLMFSIPELKAAMYDEYYRTVTVAAEAIAH
RIGRAADDFEVRVFAGALTGAMMAAFDSAPTTPETIYRALDFMDAGMPLD
>MAP0458 hypothetical protein
MLLAIDVRNTHTVVGLISGSKEHAKVVQQWRIRTESEITADELALTIDGL
IGDDSERLTGAAALSTVPSVLHEVRLMLDQYWPSVPHVLIEPGVRTGIPL
LVDNPKEVGADRIVNCLAAFHRFQSPAIVIDFGSSICVDVVSAKGEFLGG
AIAPGLQVSSDAAAARSAALRRVELARPRSVIGKNTVECMQAGAVFGFAG
LVDGLVGRIREDVPGFGGDDVAIVATGHTAPLLLPELDTVSHYDQHLTLH
GLRLVFERNRDAQRGRLKTAR
>MAP3917c hypothetical protein
MDARRGTGPVRSLMFSWPDPGTRVTVRYRRPPGSVPPLTDAVGRLLTVDP
TVRVQTKTGAVVEFAPADVVALRVLTDTPIRTSQIRALEHAAAWPAAEQT
WLHGWLLRAGPPPGRPAGPSTDSAVPLDLSAHAEAIPEIAGWYQRRGLTP
RLAVPDRMLTPPPGLTCERTEQVVVQDLSRPSGDEPGVRVDASVAQAAVS
AAPDGTRWVGLWVPPGGHDAEARAGCEALLAWGAGRGATRAYLAVPDDDT
AALELAGALGFRLHHRRRYLTVPAGPVG
>MAP0061c hypothetical protein
MLELAILGLLIESPMHGYELRKRLTGLLGAFRAFSYGSLYPALRRMQAEG
LIAEDAAPAGTPVRRARRVYQLTDEGRRRFGELVADTGPHNYTDDGFGVH
LAFFNRTPAEARMRILEGRRRQVEERREGLREAVARASSSFDRYTRQLHQ
LGLESSEREVKWLNELIAAERAMPGLSEQT
>MAP1783 hypothetical protein
MTAVEDRPLRSDAARNVERILRAAREVYAELGPDAPVEAVARRAGVGERT
LYRRFPAKADLVRAALDQSIDEDLTPVIDSARHAKDPLHGLTRLVEAAIS
LGAREQNLLAAAHRAGSLTSDISVSLNEALGELVREGQRAGRIRADLVAD
DLPRLVAMLYSVLSTMDAGSEGWRRYVALVVDAISVDERRPLPPAAALRY
ASQPNSWPL
>MAP1641c hypothetical protein
MVVVTGVDRVFLALANPVRRELLQLLARHPLSAGELSERFELSRPAVAEH
LKVLREAALVADERRGRQRIYHLTAEPLAELGEWLHPFEKFWRTRLRRLA
EVAEELE
>MAP2394 hypothetical protein
MMEVPVVAKQATADKRQRRERGSINPDDIITGAFELAEQVSIDNLSMPLL
GKHLGVGVTSIYWYFRKKDDLLNAMTDRALSKYVFATPYVEASDWRETLR
NHARLMRKTFMGNPILCDLILIRAALSPKAARLGAQEMEKAVANLVQAGL
SPEDAFDTYSAISVHVRGSVVLQRLYEKNQSSDVGPRAIEDAVAIDPEKT
PLLAQVTRIGHRIGAPDETNFEYGLTCILDHASRLIEEGKGAKAGPSRQR
KPAKSSAARGRTKAPANR
>MAP3671 hypothetical protein
MPTVTWARVDPARRAAIIEAAESEFGAHGFSRGSLNVIARKAGVAKGSLF
QYFADKRDLYAFIADIGSQRVRGYIEQLIRELDPARPFFEFLTDLLDGWV
AYFADHPHERALHAAATLEVDTEARISVQNVIHRHYLEVLRPLVRDARLR
GDLRADSDTDALLSLLLLIFPHLALAPYMRGLDPVLGLDEPTPEQPALAV
RRLVAVLAAAFSPAPAHFATPRSEEVT
>MAP3544c hypothetical protein
MNIRTPLLRPGRAVKGLLPVCGRVHIVNSRTPSSRVRGSGRERSRESQSR
EERKEATRRAIIAAALKLLQDRSFSSLSLREVTREVGIVPAAFYRHFESM
ESLGLVLIDESFRSLRDTLRDARAGKLDPNRVIESSVEILVASVADRREH
WRLIARERNSGLSVLRYAIRTEIRLITSELATDLARFPGLHEWTTEDLNV
LATLFVNAMIVIAEAIEDSQSAEAREDIRRLAVKQLRMIATGIAGWHSTP
>MAP1259 hypothetical protein
MAPVNDLELTLSQARTREPPMLTIGEVARRSGVAATTLRYYEQIGLLPAP
IRLGGQRRYDEAVLSRLEVIALCKTAGFALDEIQRLFADDAPGRPASRAL
ARAKLAQIDAQLESLARARAVIEWGMRCTCPSIDACTCGIHPTRPAPVGP
DVRSARRF
>MAP4317c hypothetical protein
MTDPAPEIAVLLVDDQDLVRSGLRRILRRKDGFVIVAECADGDEVPAAIA
AHRPDVVVMDLRMRRVDGIEATRRLGGRPPVLALTTFNEDELLSGALRAG
AAGFVLKDSSAEELIRAVRAVARGEGYLDPAVTSRVLTTYRKAAPGPRGA
AIAELTTRERDVLTLIGKGLSNSEIADELCISGVTVKSHIGRIFGKLDLR
DRAAAIVYAYDNGIVAPR
>MAP0376c hypothetical protein
MRDSGRFGPIEWARAGRPLPSEYTSGDRGIAIDIDGEAALFALVDGLGHG
PPAAEAALRAVDTVTAAGAEPIEVLIQLCHRALEGTRGVAMTLARIDFAA
NTLTWTGVGNVTAHLVAKAPTGTEVRSSARLAGGIVGYRIPEIRPAQVVS
IRPGDLLVMRTDGTNTWTTSTSPPPPWPSPKVCWARTPRRPTTPWCWPPG
TGEPRHE
>MAP3967 hypothetical protein
MAPEEKLSAKVSNAASDMASDIGSFIRSQRELAQVSVRQLAEKSGVSNPY
LSQVERGLRKPSADVLNQIAKALRVSAEVLYVRAGILEPSDKSQVRDAIV
TDTAITERQKQILLDIYASFTQQNEATHEECPTQPDSE
>MAP3340 hypothetical protein
MYRHCPALTTFAAMRTHGWQGEPPRSDEDAVARIVAAADRCVRRRGGQTT
IADVAAELGVSRKTVYRYFPSTTALLRAVATEGTRKFLEAMAERLSTIDD
IAEAVVEGVVLTVTAVPDEPYLQLLLEEPSHALLRSITSETARRVGQQVL
VEHTSVDWERTPMASRTLDELVEWSLRAVHSFLANPSDPPRDADQLRGYL
RRWLAPAIRQWVQQAAAGTIV
>MAP2775 hypothetical protein
MAARPSPQTERVVNLFELLAADGSAGITLAEVSRRLHVHKASCHSMLSEL
LRAGWLLRDPVRKTYHLGPALVRLGREAAARYPAPVLARSVMDELAAATG
AHCVAFSVSEDYSTVVDQVRSRRGGGHPMPIGTQFPHRPPYGASTVAWAG
TEDRERWLAALPDEVRDRYREAIAAARQRGYAVGLHLLPDLRLQELALLV
RSAEVRSTRLSQLAQALTDELIHQEEWFPISLAPDRSYEVSHIDSPILGP
GPSIALMLSLVPSAEPMSGAAVTRLGTQLAAATRRLSEALADESG
>MAP0725 hypothetical protein
MPVARRYDTLLAKGEDRKQRILDVAQRLLTRNGWRNTTLAQIAGEAGVTP
AGLLHHFESKEQLLHAVLDARDLDDDTHADRTGDLFDQIAAVADRFYRAP
ELVGTFTVLLVENILPEAPLHDRMLARHRAATDIVADLIRRGQADGRYRA
DIDPAVKAVEILAFVHGMETTWLLDPSIPLPEVFKQYAETLARDFAPPKR
AETT
>MAP3223c hypothetical protein
MRLFRGNYDTVGSVMKADPSGLDKAPGAGRPRDPRIDSAILSATAELLVQ
TGYSNISLAAVAERAGTTKSALYRRWSSKAELVHEAAFPTAPTALEAPAG
DIAADMRMMIEATRDVFTTPVVRAALPGLVADMTADPALNARVMSRFADL
FTAVRVRLREAVDRGEAHRDVDPDRLIELIGGATMLRMLLRPEEELDDAW
VEQTTAIVVHGVRR
>MAP4139 hypothetical protein
MAAQPEPPTAAGRQRAGRPAARPAKLSREGIIDGALTFLDREGWDALTIN
ALATQLGTKGPSLYNHVDSLEDLRRAVRIRVIDDIITMLNRVGEGRARDD
AVLVMAGAYRSYAHHHPGRYSAFTRMPLGGDDPEYTAATRGAAAPVISVL
SSYGLDGEEAFHAALEFWSALHGFVLLEMTGVMDDIDTDALFSDMVLRLA
AGLERRTAHG
>MAP2442 hypothetical protein
MAVGDYEWFITLAELQHVTAAAEQLHVAQPTLTRMLARLEQRLGVALFDR
HGRRLSLNTYGRIFYEHARRAQLELDSARRAIADLADPAVGEIRLAYLSS
FGSTVVPRLIARFKESSPRVTFTLEKGAAESIADRVRSGGVDIGVVSPRP
VERTLAWRSLFRQRLGVAVPDGHRFARGAAVSMVDLADEPFVTMHPGFGM
RRLLDELCAAAQFRPRVALQAPNLTTAASLVAAGLGISLVPIDGSSYPSG
VSVKPLADADAYRDVGMIWDSGRPLPRSARDFIAAAAAVGRGGW
>MAP1470c hypothetical protein
MTAERIARSDRSTGTREAILSAAEVLFAERGMYAVSNRQISEAAGQGNNA
AACYHFGTRVDLLRAIEGKHREPIEKLRAQMLAAVGDSTELRDWVGALVR
PLTDHLSALDTPSWYARFAAQAMADPSYRHVVTKDALSSPLLVRTLDGIN
RCLPELPRRVRAERMVMVRNLLMHTCAEHEGALAEHGPRSRSAWPVAAEG
LIDAIVGLWRAPVHVAAAAGPTAQTTAPATGQTDHRGAP
>MAP3317 hypothetical protein
MSHRIRHAAPHDAAAITEMIHALAEFERAADQCMVTETRLAAALFDAAPT
VHGHVAEVDGTVAGMALWFVNFSTWDGVAGIYLEDLFVRPGFRRRGLARA
LLAALAAECVRRGYTRLSWAVLNWNTDAIALYDGIGARPQREWTTYRLSG
PRLAELAESS
>MAP2725c hypothetical protein
MGDPLTKSRKLLAPSAGESAGRRRAVLRLLRASPEPMSIAGIADVLGVHP
NTVRFHLDSLVADGQVEHVELDRKGPGRPPLMFRAVRQMDRGGTRHYRLL
AEILAKAFAAETDPAPKALAAGRAWGQQLDAERRRLPRDAAGAEQAIAQL
VDVLDELGFAPERRVIDGEQQVGLRHCPFLELAENSSNVVCPVHLGLMQG
AMETWGAPVSVDRLDPFVEPDLCLAHLTLQGAGR
>MAP4080c hypothetical protein
MVSVVGKNTSFSLDEHYSAFIESEVASGRYRSASDVVRSALRLLEDRETQ
LRALREALEAGERSGTSTPFDFDTFLDRKRTEASDGR
>MAP3099c hypothetical protein
MPRPARPHSATKPGAKVDARSERWREHRKKVRNEIVDAAFRAIDRLGPEL
SVREIAEEAGTAKPKIYRHFHDKSDLFQAIGERLRDMLWTAIFPSIDLKT
DSAREVIRRSVEEYVTLVDKHPNVLRVFIQGRSTGTPQSTVTILNEGREI
TLAMADLFDNELREMELDHAAVELAAHAAFGSAASATEWWLGPEPDSPRL
MSRAQFVAHLSTIMMAVIIGTAETLGITMNPDQPIHDAVVRKPAVS
>MAP2032c hypothetical protein
MRRLTEPDPVVLAQVDNPDEITARILAATLEQAELVGMRRTTMEDVARRS
GVGRATLYRRFPTKAALVDAVVLAEARRYLEGDAQAWAQGATFEDRMVNS
TVFSVTFMREHALLKKLLRTEPETILPSLTVDAGVILDYATEYTVGLLRR
ELYGTETTPAQERHLRTVAELHTRLTLSFIVTPHTSINLATIEDTRSYVR
NYLMPMVVGPR
>MAP4127c hypothetical protein
MPDSYNIDITLLLRRYAGEGTIATMTSQPAPQRNVRDGMLAAAVALLHEH
GPDALQTRKVAGAAGTSTMAVYTHFGGMRGLIAEVAEEGLRQFDAALTVP
PTDDPVADLFVIGAAYRRYAIRHRHMYRLMFGSTSAHGINAPAHNVLTLT
VAEIEQNYPSFAHVVRAVHRCMRAGRITAGPADDDGAVVATAAQFWASIH
GFVMLELAGYCSDDGSAVLPVLGSMTSNLLVALGDSPKRVSQSMRSAMLG
S
>MAP0938 hypothetical protein
MAGLTGARADELATMDIFAGCPAEDLAPLARRLRPLRAPAGQVLMRQGEQ
AVEFLLISSGSAEVKHVGDDGVVITEHTSPGMIVGEIALLRDIPRTATVT
TAQPLTGWTGDTEAFASMVHIPGVMPRLLRTVRQRLAAFITPIPIQVRDG
THLLLRPVLPGDSERTVHGHIHFSSDTLYRRFMSPRLPTPALMHYLSEVD
YVDHFVWVVTDGADPVADARFVRDENDPTVAEIAFTVADAYQGRGIGTFL
ISALSIAAEVDGIERFSARMLSDNVPMRAIMDRYGAVWQREDIGVITTVI
DVPHRRDLPFRKHEADEIRRVARQVIETVG
>MAP1479c hypothetical protein
MRHRRSSPAAPPTGPARRSDRRPGQTIRKVLDAGLQELRESSYAGLTMRA
VATRAGVSPASAYTYFPSKSALVAAVYLRFLRDLPVHTDVNETAQRRVSA
TLRDMAVVVADEPELTAACGAALMADDPAVKPLREQIGEEVSRRIAAALG
PGWPRAVKSTLQMTFSGALMTARFVSFAEISGQLDEAVDLILGAARAS
>MAP3230c hypothetical protein
MGLTTMTVRDDGAPVGAPAEAVRSDYGPVDPVLAVNGERRARPVAVEYKQ
FHGADAQSARRFFATAYDPGWRIAGVTNHCVVSHRRSDTGSMTVDEVAIQ
GRMTLEIPAGDTVVVIQPRAGALNVAGGPLATPDFPLLVAHGMACVLHCH
GARFDVVTIAAEVLQKVAAEWHTALSQQTQFLNWRPRSRAAVRAWHRALD
YAMATLASPDTARQPLIAAGMAGLLAGALLECYPSNLTEQDPVSDLALPE
TLKEAVSFIHRHATEDIGINDVAAAVHLTPRAVQYLFRRQLDTTPTEYMR
RVRLSRAHQELVAATTASSTVTEIAQRWGFAHTGRFAVLYRQTYGQSPHT
TLRQQTAV
>MAP0666c hypothetical protein
MPTTDRAGRSSKTGAATRRSRRNTDAGARTKLIEATARLMREEGYAAATS
RRVAAEAGVKQALVYYYFPTMDDLFVEVLRAGAESALTRMRALLTEDDPL
QALWLMNSDTALTALNAEFMALANHRKAIGAELKAYAERVRDIETTAATM
VLRANGIDLEEYPPVAISMLIAQAARSLCNEKAVGVTQGHDELRVFVQRQ
LSRLTAPAAASTTSG
>MAP2633 hypothetical protein
MRIAVLSGAGISAESGVPTFRDDKNGLWARFDPYELSSTQGWRDNPQRVW
GWYLWRHYLVADVAPNAGHRAIAAWQDHAEVSVITQNVDDLHERAGSRPV
HHLHGSLFEFRCARCAKPYTGELPAMAEPALEVQPPVCGCGGLIRPDIVW
FGEQLPDEPWRRAVEATESADVMVVVGTSAIVYPAAGLAELALSRGAAVV
EVNPEVTPLSASATLSIRETASQALPGLLQRLPALLN
>MAP2985 hypothetical protein
MAASRSNDWGPVSVPVSSLDPRAGNDHSDRSGALRGWQRRALVKYLAGQP
RDFLAVATPGSGKTTFALRVAAELLGQRAVEQVTVVVPTEHLKVQWAQAA
ARHGLALDPRFSNSNPRIAPEYHGVMVTYAQVAAHPTLHRVRTEQRRTLV
IFDEIHHGGDAKTWGDAIREAFGDATRRLALTGTPFRSDDSPIPFVRYEA
GPDGVRRSQANHTYGYPEALADGVVRPVVFLAYSGEARWRDSAGEEHAAR
LGEPLSAEQTARAWRTALDPAGEWMPAVIAAADQRLRQLRAHIPDAGGMI
IASDRVAARAYATLLTKITSETPTVVLSDDPGSSARISEFAASTSRWLVA
VRMVSEGVDVPRLSVGIYATSASTPLFFAQAVGRFVRSRRPGETASIFLP
SVPNLLQLASELEAQRNHVLGEPHRVSEGDPLDGDPATRTQNEKSELDNG
FTSLGADAELDQVIFDGSSFGTAAPAGSEEEADYLGIPGLLDAEQMRALL
HQRQDEQLQRRAGQPSAGDAPPATVHGQLRELRRELNTLVSIAHHRTGKP
HGWIHNELRRRCGGPPIAAASREQLRARIDAVRRLNAEHS
>MAP4202 hypothetical protein
MRTPVHDVGPPEDRYAMWDAAYVLGSLSAAQRREFEQHMAHCRGCREAVA
DISGVPALLSRLDHDEVAAIDEAGPAPQLSADLLPSLLTAVRRRRRRGRV
ATWVASAAAAAVLAIGVFVGLEGWSSTPARQVTASAEPMAQVGTTLLTST
VQVSGQHWGTSINLRCVCLAPLNAHHDTLAMVVVGRDGSRTRLATWVAVP
GHTASPAGSISTPVDQIAAVQVVAADSGQVLLQRSL
>MAP3271c hypothetical protein
MVKVFLVDDHEVVRRGLCDLLSSDPDLQIVGEAGTVAEAKARIPAARPDV
AVLDVRLPDGNGIELCRDLLSEHPDLRCLMLTSFTSDEAMLEAILAGASG
FVIKDIKGMELARAIKDVGAGKSLLDNRAAAALMAKLRGSAAQADPLSGL
TEQERTLLGLLSEGLTNRQIAARMFLAEKTVKNYVSRLLAKLGMERRTQA
AVFASKLNQQAGRPPTPPDWPG
>MAP2834c hypothetical protein
MHCPFCRHPDSRVIDSRETDEGQAIRRRRSCPECGRRFTTVETAVLAVVK
RSGVTEPFSREKVISGVRRACQGRQVDDDALNLLAQQVEDTVRAAGSPEV
PSHEVGLAILGPLRDLDEVAYLRFASVYRSFESADDFEREIQALREHRGV
ATPG
>MAP2166 hypothetical protein
MTSPDAADLTARFEAARPQLGALSYRMLGSIDDAQDAVQEAWLRLDRSGG
ADIANLDAWLTTVVARICLNMLRDRRTRDGAEPAAHLPDPIVEPAGEFDP
EHRAMLADAVGMALFVVLDTLPPAERLAFVLHDVFAVPFAQIAAIVDGTP
ESARKLASRARRRIERADPVPDGDLAAQREVVDAFFAAGRSGDFDRLVSV
LHPDVVLRGDFGPATATVRVQGAAAVAKLARSYAGPEREAWAATVNGAAG
AVIFVAGRAASVMGFVVRGGRIRAIDVLADPDRIAKLELGALRADH
>MAP3599c hypothetical protein
MLCGSHYGVDGLLTSPRRHRSKRKPRPAKPLKALASPVNAPMSMQPRSRR
RPLRRAQLSDEVAGHLRAAIMSGALRPGTFIRLDETAAELGVSVTPVREA
LLKLRGEGMVQLEPHRGHVVLPLTRQDIEDIFWLQATIAKELAAAATDHI
TDTEIDELDRINDALAAAVGSGDAETIAGLEFGFHRVFNQASGRIKLAWF
LLNAARYMPVLVYAADPQWGRAAVDNHRQLIAALRRRDTAAVIEHTVWQF
TDAAARLTEMRDRTGTPSNRG
>MAP0035 hypothetical protein
MSRESAGAAIRALRESRDWSLADLAAATGVSIMGLSYLERGARKPHKSTV
QKIENGLGLPPGTYSRLLVAADPEAELARLMAAQPAAEMPARRTGPVVVD
RHSDTEVLEGYAEAQLDALKSAIDRLPATTSNEYETYILSVIAQCVKAEL
LAASSWRVAVNAGADSTGRLMEHLQALEATRVALLKRMPTSLSARFDRAC
AQSSLPETIIAALVGVDVDEMWDIRNRGVISPSALPRVRAFTDAVESASR
NGASRGDDEEGS
>MAP0775 hypothetical protein
MTSASSGASSGASSGARRIGAPDAKNRGLLLDAAERLMLEEGYAAVTSRR
LASRAGLKPQLVHYYFRTMEELFLEVFRRRAEEGLAVQAQALQSDQPLWA
VWRFGTDPAFTQISMEFMALANHRKDMRAEIAYYAERFRDEQQRAVAAAL
ERYGVQNKDVPPVVWTVLMTSLSRFLVLEQAIGMSSGHAETVQLVESYLR
RLEGEPKPIAGVPSSWTVHQGRIEQSTPLGPSLDTTTAPSTVRSQ
>MAP0932c hypothetical protein
MCDDHAMRRHGWSGDIPADDEEAVARILTATRRAIDERGTVSVSQVASTL
GVTRQTIYRYFPTHEALLGATALSAVEGFLDRLAAHLGSITDPTGAVVEG
IAYTFEQLAHDRYLSLVFQPGKASAFTAGVTSDIAISFGRSILQRFEVDW
GAAGFAGTGLDELVEVMLRTLQSLIVDPGRPPRTGAELRRFLQDWVAPAV
RAHAISR
>MAP2559 hypothetical protein
MRSADLTAAARIRDAAIEQFGEHGFGVGLRRIAEAAGVSAALVIHHFGSK
EGLRKACDDHIAEQIRESKTEALQTNDPAVWFGQLAEIEEFAPLIAYVLR
SMQTGGDLAKMLWRQMIENAEGYLEEGVRAGTIKPSRDPRARAKYLGITG
GGGLLLYLQMHDNPTDLRAVLRDYSRDMILPALEIYTEGLMADRTMYDAF
LAAEDQGDSPWQLTVLRPSRSAGSPRTSVRCGHWTVSTSPCGREKSTVSW
GPTAPASRPRSASCWDW
>MAP3661c hypothetical protein
MSSTSRATNSPARCPTCGAAPSASAHVTCTGRHCGIRTRPDAGPGLTDRA
DHQAAGPAKRGGTRTKMLASAAEVMRERGAAGVTIDAVLARSGAPRGSVY
YHFPDGRNQILAEALRYSGDSITALIDQAAGRGGRALLREFVEYWERLLT
EGGFTAGCPVVAAAIGCSEDEGPKLSAEAGAILGRWCTALTRAFVNDGFD
DADAASLAVMSIAALEGGIVLSRSTRNAGPLHQVGEQLEFLIKAREFVVR
NKIPGKQPGSR
>MAP3112c hypothetical protein
MNAEPRTGPAKTLASALARDIEAEIVRRGWAVGESLGSEPALQQRFGVSR
SVLREAVRLVEHHQVARMRRGPNGGLYICEPDAGPATRAVVIYLEYLGTT
LADLLNARLVLEPLAASLAAERIDEAGIARLRAVLHAEQQWRPGLPMPRD
EFHIALAEQSKNPVLQLFIKVLMRLTTRYALQSRTDSETEALEAVDHLHT
HHSRIVAAVTAGDPARAKTLSERHVEAVTAWLQRHHAGDRNRGRTPRRPL
NSEVPQGKLAEMLAATIGDDIAADGWRVGSVFGTETALLQRYRVSRAVFR
EAVRLLEYHSIAHMRRGPGGGLVIAEPAAQASIDTIALYLQYRDPSREDL
RCVRDAIEIDNVAKVVKRLAEPQVAAFVASRRSGLPDDSRQTPDDVRRAI
AEEFDFHVGLAQLAGNAPLDLFLRIIVELFRRHWSSTGQALPTWSDVRAV
HHAHLRIADAVAAGDLSVASYRLRRHLDAAASWWL
>MAP2519 hypothetical protein
MPGARRSTDWLSGHRNEAAVDRILDAAQELYTQRDSESIGMNEIARAAGC
SRATLYRYFENREALRTAYVHREANRLGRAIMEQIDGIDEPRQRLTASIT
TTLRMVRGNPALASWFAVTRPPIAGELAEHSEVITALAAAFVGSLGPEEP
AVVERRARWVVRVITSLLLFPGHDEADERAMIEEFVVPIVTPVSARG
>MAP2856c hypothetical protein
MPPLVREVIGDVLREARTTQGRTLREVSDTARVSLGYLSEVERGRKEPSS
ELLNAICDALDVPLSSVLTDAGERMASEERAAVAAPAGPSIDASTKVVIP
PVASLALA
>MAP3195 hypothetical protein
MRAGPLMPAELLTAKGRQTRQAIEQAARKLFAERGFHGTTLADITSAAGK
SPAVFYRYFADKEDLLAALAESFLHEVVTPSGLSVHLPDSPDDDAFFTAV
VTGYWNMFKQNIGIMIAVAQLAATQQRFAAVQNEFRRFGIDLVAASVRRA
QEQGYGAELHPQHTAAAIALLFENFTTVFVGPSGLGIEISDEDAIATLST
IWKKTLYGV
>MAP2778c hypothetical protein
MVTVTAKGPGAPIVSGDPILGIVVDMLDTGGYEAVQLREVARRARVSMAT
IYKRYRTRDELIVAALEGWMDANRYARLPSLIDELPGESMYSDLMHVMRT
IFEPWERHPLMLRSYFQARSGPGGKRLIRRGVEAVVPVAKSVLAQADPGF
VNDLELILTGVVFGFLTRFAQGEIEVTEIVPGIERTVYWLTQAYQNADAG
RQPHISGGGISQPSEGSTSSLYCGQ
>MAP0930 hypothetical protein
MAEQLAAESGPPAVTIRALSEATAISNGAIYHAFGSRGGLLGHVWLRAGK
RYLSFQRAAVAKALSRGVSGEAAVDAVVAAAECPATFSHEHPASARFLLR
VRRDELLGAADVPAELADELRRLDDDLAHLFIELAEKVWGRRDQQAVGAI
RTCVVELPTALLIDAHALHDAAALQRLAAAVRAVLALTPPTTSTKPKPRP
TLTKGAPR
>MAP1732c hypothetical protein
MRRAEMAATKQPKRRKRADGEMSRERILDAATEIAAERGYEATSIGLVSA
KCGLPASSIYWHFKNKDDLIAAVIERSFADWRKAWQVPDEGAPRDRLAGL
AMQIAKVLMDSPDFIRLGLMLALERRPVEPRARAMFIQARAQAYDELADI
VRELAPGLTDKQIDQLVTYAIAGADGLFIAREIGGDAVNLVGLFELHAGA
LYDAALRMISENAGQ
>MAP3882 hypothetical protein
MTPESSPPADGRLRFVPATHDDPLARPLLAELAVEYAQRYGGTPSAHLSW
LPVPRDELAPPDGGLLIGVVDGVPVTGGAFRRYDAETAELKRIWTDAAHR
RRGYARALLAALEERTRACGYRRLYLITGNRQPEAEALYDATGYTRVPPD
PLPDWGPFRPIAFEKWLVPDER
>MAP4196 hypothetical protein
MTDQDRKAARREIADALLKALERRHEIADVVVESENKAAAVEAIVRLLDT
SHLAAEAVMGMSFDQLTIDSRRKILAELEDLNKQLSFTVGERPASSGETL
ELRPFSATEDRDIFAARTQDMGSAGDGSGGPAGDLDDEIRAATARLDDEE
AAWFVAVDSGQKVGMVFGELLNGEVNVRIWIHPDFRKRGYGTAALRKSRT
EMAWCFPAVPLVVRAPAAKPA
>MAP2138 hypothetical protein
MSPSPASMPAATDGAMGPEPGLPAHDHDGLAAAGHPGPADYPVPPPRDIL
DAAGELLRALAAPVRIAIVLQLRQSHRCVHELVDALGVPQPLVSQHLKIL
KAAGVVAGERSGREVLYRLADHHLAHIVVDAVAHAGEEPRP
>MAP4336 hypothetical protein
MPGSPPTGPLPPVPAGIDRRRPELSDSALVSRSWAMAFATLVSRLTGFAR
VVLLAAILGAALSSAFSVANQLPNLVAALVLEATFTAIFVPVLARAEQSD
PDGGAAFVRRLVTLTTALLIVATALSVAAAPLLVRLMLGRTPQVNEPLTV
AFAYLLLPQVLAYGLTSVFMAILNTRNVFGPTAWAPVVNNVVALATLAVY
ALVPGELSVDPVRMGNAKLLVLAVGTTLGVFAQTGVLLVALRRQHVDLRP
LWGIDQRLKRFGTMAAAMVLYVLISQLGLVVGNQIASTAAASGPAIYNYT
WLVLMLPFGMIGVTVLTVVMPRLSRNAAADDTRAVLADLSLATRLTLITL
IPIVAFMTVGGPAMGSALFAYGHFGDVDAGYLGAAIALSAFTLIPYGLVL
LQLRVFYAREQPWTPIVIILVITAVKILGSMLAPHLTGDPKLVAGFLGLA
NGVGFLAGAVIGYVLLRRTLLPGGGHLIGVGEVRTILVTLTAAMLAGLVA
HVADRLLGLGALTAHGGGAGSLLRLLVLALIMVPITAAVMLRAQVPEARA
ALDAVRFRITGRGPRPRKPAAPDRSSHRRPVTYPEQRNSSPPGVNAVQEP
IRRRPPERANRARLVKGPEVTDRPMESAASSAGPGTGSGAPRPVADDFQP
DIPADQPDRPRKADPRPADQKNGDVGTRRGPLDVPRERTADSSTDDVHLV
PGARIAGGRYRLLVFHGGAPPLQFWQALDTALDRQVALTFVDPDRALPDE
VLQEILSRTLRLSRIDKPGIARVLDVVHTGSGGLVVSEWIRGGSLQEVAD
TAPSPVGAVRAMQSLAAAADAAHRAGVALSIDHPSRVRVSIEGDVVLAYP
ATMPDANPQDDIRGIGAALYALLVNRWPLAESGVRSGLAPAERDSSGNPV
EPMAIDRDIPFQISAVAVRAVQDDGGIRSASTLLNLLQQATAVADRTEVL
GPIDDSPSPSTALISPGNDPATFARRRRNVLIGVGAGLAVLVAALLVLAS
IVSKIFGNVGGGLNKDELGLNGPSSSTSAPQTTTSTAAGSVVKPTRASVF
SPDGDADNPGTAGQAIDGDPSTAWATEVYTDAVPFPSFKQGEGLILQLPS
PTVVGQVSIDTPSTGTKVEIRAASSPTPAGLNDTTVLAPAFTLKPGHNVI
PVRAGSPTSNLLVWISTLGTTNGKSQAGFSEITVQAAS
>MAP3552c hypothetical protein
MTATDGEPDLGEIDPFRLRLLDGLATSIGERGYRASTVADVVRCARTSKR
TFYDHFAGKEECFLELLAVDIEKLGAKIAASVDPEADWHVQIRQAVEAYV
GYIEARPAITLSWIRELPSLGAAARPVQRRGLQLLTSLLIDLSASPGFRR
AQLPPLTAPLAVILLGGLRELTALAVEDGKPVRGIVEPAVDASVALLGPR
H
>MAP2632c hypothetical protein
MELGDWLRVDMKAGKPLFDQLRTQVIEGVREGALPPGTRLPTVRELAGQL
GVAVNTVARGYRELESAAIIETRGRFGTFVARYDPTDAAMAAAAREYVRV
ARGLGLDKADAVRYIESVPDE
>MAP2678c hypothetical protein
MAPFIAAAVEISDPQHPARVRAREYKTSVAARLSETAREAGAADPELLAE
QLALLLDGASVRTRALGTDAFRTAAGIAAALVERAIPPTAR
>MAP3565 hypothetical protein
MGSAGTGHPRRWVLLKGRLVASVASAFVMAVTGVGWTGYHTALGRIIISH
ALPNGAMSLGDNQNILLMGLDSRLDQNGRPLPPEIYDALHAGDESSGGYN
ANVLIIVHLSDGDGPVTAVSIPRDDYAELPGCPGSVCAGKIKQAYGLAYQ
QSLNAQAAGNSGAGATDLAAREQTAREAGRKAEIGAVTDFLGISVDHFVE
VTLGAFFQIAKAVEPITVCLAGDTRDSYSGADFHQGVQRIDAAEAMAFVR
QRRDENDGSFTDFDRTRRQQAFLVSLVEAARSGGALSSVSGLRKLLQVAR
DNVAVDAGLDLMQFAARASQLAGRPLSLYTLPISGFGKDPNGSDINLVDL
PTIRRIVNERFMSDAPAALDAAVPGSQPPPAPAEPAVLDVVNATTHDGLA
AALEEALAGRGFTRGSATTAPTQAEDSSIEYGPGAEAAARLLADQLHLPA
TAQSAVAPGTVRLTVGTAFPAADYLAHPGASGSGQVSTVSATGTGDHAPA
PTDLSQMTATSVPCVK
>MAP3891 hypothetical protein
MGQPQSQMEPESKLRQRTEGRLDRSRDPAILDAALAALAEHGYDATNMND
IAARAGVGKAAIYRRWSSKAALMTDALIYWRPELLNDDAPDTGSLAGDLD
AIVKRAKRNDNALISNDLVLRVALEAAHDPELATALNDLILFKGRRVLSA
VLAQAADRGEIDPNRDWSLVADVLTAMGLLRAISGQRVDGKFVRRVIDTL
ILPAVRAAPE
>MAP0490 hypothetical protein
MSSRRPASRCTILMKIVSITKPAGEARTRAHRSAVASPSVSPTPRRRATL
ASLADELKVSRTTVSNAFNRPDQLSADLRERVLATAKRLGYPGPDPVARS
LRTRKAGAVGLVMAEPLTYFFSDPAARDFVTGVAQSCEALGQGLLLVAVG
PSRSLDEGTAAVLSAGVDGFVVYSVCDDDPYLQAVLQRRLPVVVVDQPKG
LTGVSRVGIDDRAAMRELADYVLGLGHREIGLLTMRLGRDRRQGLVDAER
LRSPAFDVQRERITGVWEAMTAAGVDPDSLTVVESYEHLPESGGAAAKVA
LEANPRITALMCTADILALSAMDYLRAHGVYVPGQLTVTGFDGVPDAISR
GLTTVSQSSLHKGSRAGELLLKPARSGLPVVELLDTELIRGRTAGPPA
>MAP1697 hypothetical protein
MRGDPTPRSARGVYGISVASELSGIPPQTLRHYERRGLLTPARSDGGTRR
YSDDDLARLKRISQLVDQGVNLVGVAHILDLESKNDQLKADYSALELLNA
KLKTRQNPERAAAPRQCRGEARREAV
>MAP1057c hypothetical protein
MTALLRPVRRQRGLTLEALAAQTGLTKSYLSKIERGQSTPSIAVALKVAR
ALDVDVGRLFSDETAREKIAVERAGGPGGRRYRVLASSMLGKTMSPFVIR
PTEQLADDPHPEHAGQEFLFVHAGRVELDYGDQTFTLGPGDSAYFDASVS
HKVRSVGPEPAEVVAVAHAEPGSAAFAG
>MAP3562 hypothetical protein
MRMLPRGNRRPGRPPAAKADETRQRIIQAARLVFSERGYDGATFQAIAAR
ADLTRPAINHYFASKRALYQEVMDETNEFVIGVGIKEADRETTLVGRLTA
FISAAVKANAENPAGSAFIFGGVLESQRHPEWNTAENDSVRIAREFLIRV
VNDAIEHGEVAADIDASALVETLLVVMCGVGLYAGYVETYQEMLAVTGML
RQLLEGALWRPGS
>MAP3414 hypothetical protein
MATARRRLSPEDRRAELLALGAEVFGKRPYDEVRIDEIAERAGVSRALMY
HYFPDKRAFFAAVVKDEADRLYENTNMDDVTGLTMYEEIRVGVLAYMAYH
EQNPEAAWAAYVGLGRSDPVLLGVEDDAKNRQMEHIMSRINQLLAEVPAT
REPDVERHLRVILHGWLAFTFEICRQRIIDPTTDADKLADACAHALLDAI
GRVPEIPEELGEAMANSRL
>MAP0155 hypothetical protein
MATATRERFLTAATGLFRRQGYSGTGLKQIVAESRAPLGSLYHFFPGGKQ
DLAVQAIAHTAERYRELLDRVFARSTDIGQATVTWFNWAARALQETDYAD
GCPIGTVACEVASSNEALRQATQAVFASWQSRVAAELIGAGVPTAQARRL
ATFAVASLEGAILLARAHRSTRPLRDTGRVVADTLRAATDRG
>MAP3978 hypothetical protein
MGHAPGPKPHRAVRRQIVPPALHIPESAAASVFRAVRLRGPVGRDVIANV
TSLSIATVNRQVIALLEAGLLRERADLAVSGAIGRPRVPVEVNHEPFVTL
GIHIGARTTSIVATDLFGRTLDTVETPTPRNPAGPALASLADSARRYLRR
WHRRRPLWIGVAIGGTVDSATGQVDHPRLGWRQAPVGQVLADALGLPVSV
ASHVDAMAGAELLLGMRRFLPSSPTSLYVYARETVGYALVIGGRVHVPTS
GPGTIATLPAHSELLGGTGQLESTVSDEAVVAVARQLRILPFAPANRASG
SAAGIADLLRVARAGNEQARELLAERGRVLGEAVALLRDMLNPDELVVGG
QAFTEYPEAMEQVEAAFAARSVLGPRDIRLTAFGNRVQEAGAGTVSLSGL
YADPLSAMRRAGALDARLREVARDESSA
>MAP1848c hypothetical protein
MGFECRIDEGDGMPSANSSSLRDRRRAELLSQIQQTAHQLFAERGFEAVT
TEDIAAAAGISISTYFRYAPTKEGLLVDPAREVAATIVGAYRTRPAHESA
VEALIQLFITHAKEVGAAGDFDTWRHAVATAPHLLRKSVLMSETDHSNLI
QQVAQRMNVTAALDIRPALLVHTSLATVQFVLDGWLSSDIDESRPFHEQL
EEALRITLAGFDRPSDATPGRR
>MAP4295c hypothetical protein
MSPRTSNGLPVTAYLVLGVLAANDERLTAGEIKMRAELSVGHFYWSPSVS
HVRRELTRLLARGMVAQTGSQSGKRAITLYETTDAGRDALRRWVQHFPGQ
DQVVIKHPVILKTWLARGEDPERIVDTLERHLDATRARLDEALWSRRRSQ
ELGITADPEQRFSFAVLDYAIRGLYAEISNISQLRDEIAGGTSRDPVKRV
RRSKGQIRRRAPRSPG
>MAP1631c hypothetical protein
MADEQADSRERLISGTRELLWDRGYVGTSPTAILQQSGVGQGSLYHHFRG
KHDLVLAAEQQAAADMQRSIKEAFAGNRSAHDKIADYLTRQREVLRGCSV
GRLTADPVIVGDDQLRAPVAQTFEVLHTCLTRTIREGQRSGEISVELEPH
KVAAAISATIQGGYALARAANSVEPFNRAITGILQLIEASTAHRRRTNPA
AHDAVPAQRRRTSRSKNPSNHKPAR
>MAP2404c hypothetical protein
MSVPDFAARPQLSDDVAHLIRGRIFDGSYAAGSYVRLDQLAAELGISVTP
VREALFALRAEGLISQQPRRGFVVLPVTRRDVTDVANVQAHVGGELAARA
AVNITDDQLRELKQIQAQLEDAYAGDEHERTVRLNHEFHRAINIAADSPK
LAQLMSQITRYAPESVFPAIEGWPEQSMKDHRRILSALKKRDDKLARAAM
SEHLAAGAVPLIDHLVARGVVVDAGGPDPG
>MAP2181c hypothetical protein
MTTASISSVSRAEERAAERSAAVQRSRERIANQVRLMLDAALRLIREKGD
SFTTQELVKEAGVALQTFYRYFATKDELLLAVIADAMTDACARWRDAARD
LPDPVARLRFYVTAVIEVLDNEQGDGGTAKFVVSTHWRLHRVFPDELAEA
EKPFVDLLLGEINAGIEAGLLAPADPKWAAWFIAELVRSVYHYYAYAPRE
VDVKEQLWQFCLTALGGTARKSRGSRSK
>MAP0335 hypothetical protein
MTRRPHPGDSTTRDLLIDATIKVMVEDGYAAATSRRVAGEAGVKPALVHY
YFPTMDELYLDVFRRGAAAYQERQTRALTSERPLHALWDTLIEPKDTRLL
LEFMGLANHRKAIRAELAAWFGRWRDTQITALNSIVREHDMDAGEFPPAG
IAVILAAIGRMLILEDALGATAGHDEAVALVNRFIDRFELP
>MAP2026 hypothetical protein
MKSLVGTSFGQYEIRRLIGKGGMGEVYEAYDTKKGRAVALKLLTDNYADD
EKFRERFLRESRAAAILQEPHVIPIHDWGEINGVLYIDMRLVQGQTLHEM
LKTGSLEPRRATDIIRQVASALDAAHAAGLIHRDVKPQNIIVTPDDFAYL
VDFGIAEARGDTHLTMAGHTVGTFDYMAPERFGDEETTSAVDVYALACVL
YEALTGAKPFPVHSAEQAIRAHLSSPPPRPSAVNPHVPASFDDVIARGMA
KHPDDRYGSAGALGRAAKRALAPDPATSAGTNTLLAPQYVSAPSSYPPFA
AQYPYPATGPVSATDADQGGSKKLMVLTIVGVAVALLVGGTGLVIGLTTQ
RNSSTSEPSTSPLVSYTNPVPTYETEPARLPSTPTSAPQDATQQLHQIAN
DDRAFVRAQLADRWVPQLSSKRPGVVDNGVVWDNAMTLREHLQLRQRYPN
VKLLWSGDWSTFSGPDFWVTVAGLTFADSSGPLAWCRFQGFDRDHCAAKL
VSTTHPEAGSTAYN
>MAP2031c hypothetical protein
MSSSSRGSRLGTRFGPYELRSLIGTGTLGEVYRAYDTVKDRLVALKLLRG
ELDAGFRQRLWRDCRAVTRLQEPHVLPLHDFGEMDGVPFIDMQLVDDGGS
LKELLREQGGLEPSRAASITGQVARALDAAHAAGLMHLDVKPENILLTHD
HFTYLADFGLAQAAGDDKLSRTYMAPERFTTGSLGPQTDIYSLACVLYEC
LTGQPPFEGADPGELRSAHLLSPAPRPSIMRRGVGRAFDDIITRGMAKQR
SARFGSAGELARAASEAVFAAYEPVSAAAGLGGPRPLPTPPAQFDGPDDT
LGPPAAERPPRGRVGRLPVVVTAVAVLMLIAGVVLSVKSVVGTHHNSSAP
PPAPSTRALAPPPPTTPPPPLTPTLSRPVTGADGLGFIGETARCDPGNPP
AAVVRTAKSLAVVCQNLSGSYYYRGERIRDGAHIELSNAERVEDGFDVTN
PVDGVVYEVRPNRLRIISFGHVDSSEPVLQYATAS
>MAP0509 hypothetical protein
MDRVTGPANSRRDELLELAAAMFAERGLRATTVRDIADSAGILSGSLYHH
FSSKEEMVDEVLRSFLDWLFARYREIIDSESNPLERLKGLFMASFEAIEH
RHAQVVIYQDEAKRLLSQPRFSYIEDMNRQQRKMWLEVLNQGVDEGYFRP
DLDVDLVYRFIRDTTWVSVRWYQPGGPLTAEQVGQQYLAIVLGGITVDAK
KD
>MAP1721c hypothetical protein
MFGHSSKGVRPSWRREKPLPRISREQKERNRGRILAAAGEGFKARGIDGV
GIDELMKAAGMSHGGFYNHFPSKEDLALEVLHQGFTDSLDTVAAVIDTHA
HSGRAALHAIIDTYLSTEHRDHPEHGCASAALAADAGRHGVKAQEAYRRG
LQGYIGAFADLLRVSARQRGTKLDARRAREQAIGLFSQMVGAQLIARAVA
HADPGLSDEILSSNSKALKRLRM
>MAP4300 hypothetical protein
MTGPQGNGNSSSNKAARPTTADVARLAGVSTATVSYVLNNARGRRISPDT
RDAVYRAAKLLGYRPNLAARNLARGKSGVVLYVVPHVAVGEMPMQAGSRM
TTALAREGLLQVQVFETDDDQHVVDAIANLDPVAVTSLFPLSAAARRAVT
EAGIPHIEIGTLPALNDPHLSVGEMRIAHLVERGHRRIAFAYTGIARWRP
LGDYWFEGVSRAARLRGLPPVRADEVTLDTAASVVSAWVRDGVTAVCAQS
DEIACLVLYGIHRAGLRCPGDLAVMGVDASPMGMVSTPPLTTVQFDPCAV
ADAALAAVFERLGHAAPPSPELTDIARLVVRSST
>MAP0179c hypothetical protein
MTTAGQTRALRGRRAIRPTGDEREQAILATAERLLETRPFAGISVDDLAK
GAGLSRPTFYFYFKSKEAVVLSLLEPVIARADAEFDGAVQRLPTDPRRVW
RNGIKAFFTAFSSHRALARAATEALATSSELRAVWSGFMQKWIDQTAAMI
TAERERGAAPDTIPAADLATSLNQMNERTMMAALSAETPAVAEDRVVDTL
THIWVSSIYGESG
>MAP2912 hypothetical protein
MTSGKDADNAALCDALAIEHSTIYGYGIVSAMSPPSVNDMVVEALEQHRQ
RRDDVIAMLTARKVTAPVAAAGYQLPLVVGSPADAARLAARMENDGAGAW
RVVAEHAETGDDRAFAATALVQSAVMAARWNRVLGAWPITTSFPGGND
>MAP1832c hypothetical protein
MPMATSKVERLVNLVIALLSTRGYITAEKIRSSVAGYSESVSAEAFSRMF
ERDKNELRDLGIPLEVGRVSPMDPTEGYRINRDAYALAPVDLTPDEAAAV
AVATQLWESPELITATQGALLKLRAAGVDVDPLDDGAPVAIASPAGVPGL
RGSEEVLGILLSAIDSRQAVQFPHRPSRAEPYATRTVEPWGVVTQKGRWY
LVGHDRDRDATRTFRLSRIGPGVTPIGPPGAVTVPDGVDLRRIVAQTVTD
VSATPAGGQARVWVADGRATALRRAGRPVGTRQLGGRDGEVIELDIRATD
QLARDIAGYGADAVVLEPQSLRDDVLARLRAHAGADAVGAGRPGGRGGVS
A
>MAP0191c hypothetical protein
MNGDFPPRGRRPGRQGPPRQPGPEPSPVIPRPGGPAPSPHAPTQPLHRPP
PAPPARPAPPARPAPPASAIRPARPRRKRRWVRTVASALLVVLLLAGGGL
GAGVAWFDTKLHREPVLADYPGRPEPGRGSNWLIVGSDSRQGLSAEQQQQ
LATGGDIGDGRTDTILLVHLPGVGSGVAPTMVSIPRDSYVPIPEHGKDKI
NAAFAMGGATLLVRTVEQATGLRIAHYAEIGFGGFTALVDALGGVRVCSQ
APMHDPLAGLDLPAGCQTLDGRNALGYVRSRDTPRADLDRMVNQRQFVAA
LLHRAASPTVWLNPWRWYAAPRAVADALTVDRADHAWDLARLGWALHGSP
ATVTVPIGEFTSGDAGSVVVWDHARAARLFDALSSDGQLPAGSTDEQQPS
S
>MAP2052c hypothetical protein
MAQRPSTRRGAAAAPASIRRPRGAPRQLLLDTARARFARQDYRSTTTREI
AQAAGVTEHLLFRHFGSKAALFREALVLPFTDFVAEFGRTWQAVVPEETD
EEELARRFVGQLYDVFVEHQGLLLTLMASEALSEEEKADAGIAEVRRAIT
TLGRISAEGMHLRGLRSDHPDLPAHSTVAMIAGMAALRSTYFGAEPPARE
VIVDELIQAILHGFLHRNG
>MAP2591 hypothetical protein
MSTDATPATDTRELIVAAAFTCFGRQGLQKATIVDIAKQAGVSRSTIYEY
FSDKAAIVEACAEHASERFYREMAKAMDRGGTLEERLCAAAVFVTQARRV
IASEKYFDEDAISLLLTKDAAVLLRECVEFFAPYLAAARLTGEVRKDLDV
EAAGEWFARILFSLFSTPSPTLNLDDPDVTAEFVRAHVVRGFASERPRRR
>MAP0692c hypothetical protein
MTGSGTGSQTLARGLTALQMVADAPAGLTVQQLADQVGVHRTIAYRLLTT
LAEFRLVAKGEDGRYRPAAGLAVLGASFDRNVRQLCLPTLRALADELGTT
VSLLVAEGDQQVAIAVIVPSHVAYQLSFHEGSRYPLDRGAAGIALLACMP
PRPGERELVSRARERGWVTTYGEIEPNTYGLAVGVRRPAPSPPTCINLIS
HREDVVMRGKDAVMRAAEQLSKLLS
>MAP0264c hypothetical protein
MSSLISDRVTAAVERALDDRQREATEEVERILAAAVRVMERVAPEPPRVS
DIVAEAGSSNKAFYRYFAGKDELILAVMERGVAIVVSYLEHQMAKEATPR
GKIARWIEGTLAQVADPHLISMTRAAAGQMSAGTSWRAADQEMMRPLREL
LVEPVAALGSSDVDRDVEAVFSCTAATLRRYVGSAQQPAPDDIAHVVRFC
LRGLGVR
>MAP2948 hypothetical protein
MSDISRRGAYGPGEARTSRGLEMSLDVVVLTNEDDFESALPDLSRFARPA
RRAALSTDGDGQFGPADVAIIDARSNVAAAQAVSHRLTADHPATAVVALV
APADCAAVDVDCNFDDVMLPGTCAEELQARLRLAIGRRRGGLDGTLKFGD
LLLHPASFTASLDGRELNLTLTEFKLLNFLVQHAGQAFTRTRLMQQAWGY
EGNGRARTVDVHIRRLRAKLGTRHQSLVGTVRGVGYRAPTPPQPEWIVGH
QKP
>MAP2883 hypothetical protein
MTRQTSGAGVAGRRPNRRGHATRESMLEAALRSLASGEPGSVSANRIAKD
IGATWGAVQYQFGDTDGFWAAVLHRTAERRAATFSSLSSPVSPEAPLRER
VGAIIETLYRGLAAPDSRAIENLRAALPRDPDELERLYPRTAAELFSWGK
SWLETCQQAFAGLAVDPDRVREVAALIPGAMRGLVSERQLGSYADLDMAR
RGLTNALAAYLEQSRP
>MAP2908c hypothetical protein
MRTCVGCRKRELAVELLRVVAVPTGNGEFAAIVDTAGNLPGRGAWLHPAP
QCAQQAIRRRAFTKALRITGSPDTSAVVEHIESLHAPERPATEQVAKNMS
TP
>MAP4312 hypothetical protein
MEPSRRWGDDRAILDDEEARRRILEAAGRCIVRRGNTQFRMGEVADEAGV
SRSTVYRYFPGRDDVLLGLMLRRVDHALAESVRSLPAPEDPVRSVPRMVL
ARVESVDGNPLNEALFAAESTAMASALQKGSEPLVELLLRHYGPLLERWK
AAGLLYPDVDFRSVVQWLHTATLFLLAPSWRYRPHADKRRFVEQFVVRAL
VPQIRQ
>MAP3341 hypothetical protein
MTIVSEMPYGASMQSPATPSARGGGIREARRRETRARLFDAALAEISQRG
LAAADVSAIVADAGVARGTFYFHFPTKEHVLVEVERAEETRIVSELGGAT
GDLAAVLNQLVQHVLSAEKPWAQRFSGTCSDCTSARRGRSKTSWRSTRSD
SSWWASSAGPRTPGRSPPTLTRPNSRRSSSPGCSRCWPPAHTTVRC
>MAP2023c hypothetical protein
MAPKPKSTRLSVEDWLEVGYTVLAEEGVRALKVERLCQQAGVTRGSFYWH
FEDIDNYRAALVESWNKFLERDRQALSELDSLPPRQRLSAMMGTLVSPQY
WMLERAMREWARLDPVAAENIRAADRHLLRTVTKAYRDYGFSPEDAKLRA
ELTFAAGIGLLHLTGSAEQAQSLAQRERFLDLMLTEAGVASDH
>MAP2980c hypothetical protein
MARTQQQRREETVARLLDASIATIIEVGYARASAAVITKRAGVSVGALFR
HFDTMGDFMAATASEVLRRQLESFTKRVADIPADQPVLEVVLGILRDLTS
GPVNAVIYELTVAARTDEKLRETLQHELAQYAAKIDEVSRALPGVEGFPA
DTFPVLVALMRNVFDGAAVVEGVLPQPEIAARRIPVLTALLNAALPP
>MAP0770c hypothetical protein
MLDAALDLFAANGVSGTSLQMIADAVGITKAAVYHQFRTKEQIVIAVTER
ELGRLVPALEEAEAHDDGPQARDALLVRVIEMAVRDRRLVRTLQFDPVVV
RLLAEHEPFQRFMDRLYRVLLSDAGLDGRIEAAMFSGALSTAVMHPLVVD
IDDETLLDRVTDLSRRLLGLPRKPIE
>MAP2504 hypothetical protein
MSDERSSKVGSMFGPYHLKRLLGRGGMGEVYEAEHTVKEWTVAVKLLNES
FSSDPVFRERMKREARTAGRLQEPHVVPVHDYGEIDGQMFLEMRLVEGTD
LDSVLKRFGPLPPPRAVAIITQIASALDAAHAAGVMHRDVKPQNILVTRD
DFAYLVDFGIASATTDEKLTQLGTAVGTWKYMAPERFSDAEVTYRADIYA
LACVLFECLTGSAPYRADSAGVLVSAHVMDPIPAPSARRPGVPKAFDAVI
ARGMAKKPEDRYASAGDLALAAKEALSTPDQDRAATILRRSQEAALPPRG
SATPGPARCWRHRLPSTPGSPVPLRHKLAGAGRRRAARSQPPGSPGRRTT
RAATPIGPLRQHNSPGRRRGAGRRPNGGCGRSSPRSSPCSSSSRAGWASG
W
>MAP3411c hypothetical protein
MSETLDDVDRILVRELVADGRATLAELAASARLSVSAVQSRVRRLEARGV
ITGYSARIEPEAVGHRLSAFVAITPLDPSQPDDAPARLQHIEEIESCHSV
AGEESYVLLVRVESARALEDLLQRIRTAANVRTRSTIILNTFYDNRVYVP
>MAP1528 hypothetical protein
MCQTDRVGKRQQSREQIEARIIELGRRQLVDRGAAGLSVRAIARDLGMVS
SAVYRYVSSRDELLTLLLVDAYSDLADTVDRARETAGEQWSDDVIAIARA
TRRWAVEHPACWALLYGSPVPGYHAPPERTVAAGTRVVAALFDAVAAGIT
TGDIRLTNDPAPQPMSSDFERIRHEFGFPGDDLVIAKCFLLWAGVLGAIS
LEVFGQYGADTLTDVEAVFDAQLRLLVDVLTRH
>MAP1543 hypothetical protein
METQPPTRARSREGTTVSEQPRQEQLNLAEDGNAGTGERSVTAAAAPVQP
GLFPDDSVPDELVGYRGPSACQIAGITYRQLDYWARTSLVVPSIRSAAGS
GSQRLYSFKDILVLKIVKRLLDTGISLHNIRVAVDHLRQRGVEDLANITL
FSDGTTVYECTSAEEVVDLLQGGQGVFGIAVSGAMRELTGVIADFPGERA
DGGESIAEPEDELASRRKHRDRKIG
>MAP2140c hypothetical protein
MSSLQERLASVLREVLPSQEESDGALTVHHEGTIASLRVVNIAEDLDLVS
LTQILAWDLPLTKKVSDHVARQARDANFGSVSLVEKVNKTAVQRNSGKNT
AKLADVMLRYNFPGAGLTDDALRTLVLLVLDTGARIRHTLTD
>MAP0201 hypothetical protein
MVRPAQTARSERTREALRQAALVRFLAQGVEDTSAEQIAADAGVSLRTFY
RHFRSKHDLLFADYTGLHWFRAALDARPADEPIIDSVQAAIFSFPYDVDA
VTKIAALRHEELDPGRIVRHIRDVQADFAEAIAAQLQRRGRGAGAPTADQ
RVRAAVTARCIAAAVFGAMEVWMVGDERSLGELARLCHAALESLRAGITD
SWSTATAPQVSS
>MAP1431 hypothetical protein
MASETDPASQKPALAATSWALLGMMSYEEEVSGYDLKKWIDWSVDLYYWS
PSYSQIYTELKKLEALGLVTSRVERDEGTRSRRLYKITPAGMDAVTQWTN
HAPVDPPVLKHSVLLRVTFGHLSNPGRLKELLQEHVAYAEARHRKAMEDA
EGAEAEPAWAYSVLALRWAAKYYAAEREFALEMIKEIDEVDAILQRTAKG
GYGKPRPTPGYWREVEKQVQAKREAE
>MAP1323 hypothetical protein
MPAARRRRGRPAGSRVDPDERRTALLDAAERAIRAKGPSAGLVDIAGEAG
FVRSAVYAMYPGRDALLAELSRRTAQRLLGEITTRAAGSTDLRQRMALFF
DVISAWMQAEPNLYRALTGDPSTSVFEQLAGAVEQMLTTSSGSEQARRAA
APWSRAIVGSAVSAAEWWCRAGTMPRAELVEHLTALCWDGGAALPFDADD
IRTIDSGPH
>MAP2658 hypothetical protein
MAKAESLRAGSTADSPTRRAEILATAASLIASSGLRTSLQEIADAAGILP
GSLYHHFESKEAILVELTRRYQEDLERIGRAAQARLDEPDSRPVPEQIIE
LGSAIANCAVEHRAALQMSFYEGPGTDPELTKLTRQRPVAIQEAMLQTLR
AGRWSGYIKPDIDLPTLADRICQTMLQVGLDVMRHTASADPVAELLCRII
LQGLASRPPSDTALDRSKAFAAANEVIETWSDDSDADPGDKAALVRAVAR
AEFGRRGYEVTTIRDIAAAAGLGTGTVYRVIGSKDELLDSIMRSFGKKVE
AGWVSVLRSDATPTEKLDALSWVNVNALDQFSDEFRIQLAWMRLSPPTAN
PGWSYATRLRQMKSLLSEGLRTGEIAIAAPSTAMLARCLIALQWIPENIL
AEIGKRPALLHVRDTVLRGVAVRGAAVDN
>MAP0819 hypothetical protein
MPDSDARLASDLSLAVMRLARQLRFRNPSSPVSLSQLSALAMLANEGPMT
PGALAIRERVRPPSMTRVIASLAEMGLVDRTPHPVDGRQVLVSVSEAGAE
LVKANRRARQEWLAKRLSTLDDDERETLRHAADLMLALVDEGP
>MAP0656c hypothetical protein
MAAAPWERFFEPPPDDRLHRWRPDAAALEPVAAPEDDDEPGGCHTGGGVT
VADLIAKVGAPNRPVHRRAAPEETEPIEPEPVEPEPVGSEPPGPASGLPL
ELQQTQVIDDLAYSVDAVSELRELMATDYPNDGDSDQQSADAADTAAGEP
RRRRRPMLMAGRSVAALAAVLALATTGGAWQWSASKNARLNTINALDRNS
SDIRDPGGQYGDEDFLIVGLDSRAGDNANMGAGSTDDADGARSDTVMLVN
IPANRKRVVAVSFPRDLAITPMQCEAWDPSTGKYGPIYDPKTKSWGPKMV
YTETKLNSAFAFGGPKCLVKEIQKLSGLSINHFIAVDFAGFAKMVDALGG
VEVCSSTPLHDYELGTVLEHGGRQVLDGDTALNYVRARQVTTEVNGDYGR
IKRQQLFLSSLLRSLISADTLLDLNKLNNVVNLVINDTVVDNVKSKDLVQ
LGQSLQGMAAGHVTFVTVPTGVTDQNGDEPPRMSDMRALFDAIINDDPLP
EENDQNAQNLSASGSTTGAPKAPTTRKSPTPTPEPQREQVTTTSPSGVTV
RVSNATTQSGLAATATSQLKRDGFNVMTPDDYPSSVNTTTVLFSPGNEEA
AATVASSFANAKVQRVSGYGQVVQVVLGPDFKSVTAPSPSGSTMSLQIER
GSSNAPAKLPEDLTVTNAADTTCE
>MAP0152c hypothetical protein
MVSEYSLTMLSSGRDRLLSAALRLFAAKGYAATSVADIQRESGLAPGSGA
LYKHFGSKRELLEARSRTGSRAS
>MAP2895 hypothetical protein
MVRRHRLLETFLVNELGYAWDEVHDEAEVLEHAVSDRLVARIDAKLGFPQ
RDPHGDPIPATDGQVPTPPARQLWACGDGDTGTVARISDADSEMLRYFTD
VGINLDSRLRVLTRREFAGMISVAIESGDGAETTVDLGSPAARAIWVTA
>MAP0323c hypothetical protein
MVSLESLTPAELAEVHARHQQGYADLQAKKLSLDLTRGKPAPAQLDLSNA
LLSLPGDDYRDSEGTDTRNYGGLHGLPELRAIFGELLGIPVQNLIAGNNS
SLELMHDLVAFSMLYGGVDSQRPWKDEPSVKFLCPVPGYDRHFAITETMG
IEMIPVPMLSDGPDVGLIEELVAADPAIKGMWTVPVFGNPTGVTYSWDTV
LRLVQMRTAAPDFRLFWDNAYAVHTLTHDFIRQVDVLGLAAAAGNPNRPY
VFASTSKITFAGAGVSFFGGSLGNIAWYLQYAGKKSIGPDKVNQLRHLRF
FRDADGVRLHMLRHQQILAPKFALALEILDQRLSDSKIASWTEPKGGYFI
SLDVLPGTAKRTVALAKDAGIAVTEAGASFPYRKDPNDTNIRIAPTFPSL
PDLRDAVDGLATCALLAASESLLARDRV
>MAP3215c hypothetical protein
MTTTDPDDLTARARIRHAALREFGEKGYEGATIRSIAAAAGVSSGLLRHH
FGSKQELRQACDDYLVKTMRDLNAQVQDNAKRGDVHYVSARIPLGQHQDY
ITRALVEGGAGELFDALVSMTEEWLASADEHRAEPADVDAKSRATLITAM
ALAIPLLRQHISRGLGVDISTEQGDLRVAAILVDIYSNPLLTPEQAKSAR
DDLAGRLSAAAGPPGRPGRAAAAGRRPAPR
>MAP2252c hypothetical protein
MTAAAGVPLHRQLFLVLHDEIDRGVIAPGEALPTEQTLCDQFGVSRITVR
RALADLAAQGYIERRQGVGSFVRQHDPADVPTGRSYLQGLRQTQFETEAE
VVELGVRSAPRAVAEALGRSGELLHIVRVRRQRRTREPLMVTNAWLPPEL
APVLTERALRRAPLYELLSANGVAVDRVRHEITAEIAGPRNAQLLDVPIG
AALLRVNRLVYVGDRPHHHVSALLSPSRSRVLLSQSADDMETGDGLRIAH
DVGGPRG
>MAP0649 hypothetical protein
MELLLLTPELHPDPVLPSLSLLAHTVRTAPPEPSSLLEAGTADAVIVDAR
TDLSSARGLCRLLSTAGRSVPVLAVVSEGGLVAVNSDWGLDEILLPTTGP
AEVDARLRLVVGRRGGLADQESAGKVTLGELVIDEGTYTARLRGRPLDLT
YKEFELLKYLAQHAGRVFTRAQLLHEVWGYDFFGGTRTVDVHVRRLRAKL
GPEYEALIGTVRNVGYKAVRPARGRPPVAEADDAESESDSASESDAEDVH
DPLVDPLHTQ
>MAP0351 hypothetical protein
MTPESLRDRQRAQIRADIRRAAFRLFVERGYDAVTTEEIATAAGVSPRTF
FRHVPAKEELLLAPVRYGGAAIVHLLEGRPAGESPDVALINAIITRTRSF
EPADTEEWRQALLVAPGLLDKVTVHRPADKERATKLIADRMRVDPHTDIR
PGLLVQLAFAAADFGFQQWVLQSGRRWPLDRYVAEALKAVKSPHWRRK
>MAP0008c hypothetical protein
MARDDWLVGRDRRSAAAERIYAAAADLIARRGYDGFTIEALAARVHCSPA
TIYRHAGGKATIRDAVVGIQAARIVDTTREAIKDLRGPERVVTATIMALE
RLRSDPLAQLMRSIHAAPVSDWVINSPTVTALAAEMLGPGHDDPLAAQWL
IRVFLALWGWPLKDPAAERALVHRFLGASYRPGGERDSICGA
>MAP1736 hypothetical protein
MANPVGLRERRRRQTSADIRDAAVRLTLERGFDKVTVDEICAEAGISTRT
FFNYFPNKESAIAYGPSDIPPELVADFVAAGPAPYSVVLAELITLAAHHL
RDVPPRREHAANMLELAKTSPAVLAAFLADLERFQNQLTDIIVRRQGMQP
DDEMAPLISALALTAVRSGIEKWASGEASETDDTPMPYVERAAALVNSIF
TK
>MAP1663 hypothetical protein
MNHVDERLFALAEQVQGFMPAEEGRALYDTALRYLDGGTGVEIGTYCGKS
TLFLGAAAQQTGSVLYTVDHHHGSEEHQAGWEYHDASLVDEVTGLFDTLP
TFRRTLDAAGLDDHVVAVVGKSPIVARGWRSPLQLLFIDGGHSETAANQD
FDGWAKWVTPGGALAIHDVFPDPRDGGRPPYHIYCRAIDSGQFREVSATG
SLRVLERIGGRAGEPVQQD
>MAP3635 hypothetical protein
MRSPREKMVVSAALLIRERGAHATAISDVLEHSGAPRGSAYHYFPGGRTQ
LLCEAVDFAGEYVAAVIAGAESGSRLLDTLIDTYREQLRDSDFRAGCPVV
AVAVEAGEQSDAERPVIERAAAAFDRWTDLIAQRFVADGIRRRRAAELAV
LATSALEGAIVLARARRDPAPLDVVHRQLRDLLAAELAGETAAGRKSRHD
R
>MAP3308 hypothetical protein
MSELAKMPARRAVKSADRGRPTDGAAASNRRGNRLPRDERRGQLLVVASD
VFVDRGYHAAGMDEIADRAGVSKPVLYQHFTSKLELYLAVLARHVENLVS
GVQQALSTTKDNRRRLHAAVQAFFDFIEHDSQGYRLIFENDYVTEPEVAA
QLRVATESCIDAVYALISEDSGLDPHRARMIAVGLVGISVDCARYWLDSD
RPISKADAVEGTVAFAWGGLSHVPLTRS
>MAP1186 hypothetical protein
MQPIRCDQGNRPPRAPLLDRSPELRHTGVVKIRTSLDDAAVGAAGAAVPD
GHTRRAIVRLLLESGSITAGEIGDRLGLAAAGVRRHLDALIEAGDAESIA
PAAWQHAGRGRPAKRYRLTSAGRAKLDHAYDDLAAAAMRQLREIGGEEAV
QAFARQRIDAILAGVPGAASAADDDIEAAAERIATALTKAGYVATTTQVG
GPIHGVQICQHHCPVSHVAEEFPELCDAEQQAMAEVLGTHVQRLATIVNG
DCACTTHVPLAQTAKAPSPRRHTTSNKGASL
>MAP4151c hypothetical protein
MASSAEMEGPGSGMPQQRVGRRRSTTPHHITDVAIDLFTARGFAEVSVDD
VAAAAGISRRTLFRYYASKNAIPWGDFDTHLARLRELLDNVESRVPLGDA
LRTALLAFNTFDESETARHRQRMRVILQTAELQAYSMTMYAGWREVIAGF
VARRCECKTTDLAPQTVAWMMLGVALSAYEHWLGDESVPLPVALGNAFDV
VGAGLNGLDR
>MAP1711c hypothetical protein
MTNPRATNEDLTAKARIRNAALDLYAQYGEDRISLRDIASAAGVTLGLVQ
HHFKTKAGVRDAVDQLVVDYFAHALAQVPAEGSARHVAAARDEAVARMLR
DNPPVVNYVRRAVLDPSEDRLHLLDALIELTRREITALRGSGMASTKRPE
STQILGVLIRQMGELLLQPMVDAVWDRVAKPTDARKPRLRITVEN
>MAP1200c hypothetical protein
MPKVSEDHLAARRRQILDGARRCFAEFGYDKATVRRLEQAIGMSRGAIFH
HFRDKDALFFALAHEDAERMAEVASRAGLIQVMRDMLAAPDQFDWLATRL
EIARKLRNDPAFSRGWAERSAELAAATTDRLRRQTEAHRVRDDVPGDVLQ
CYLELVLDGLVARLASGEDPQRLSAVLDLVENSVRRNDSRPG
>MAP0985 hypothetical protein
MRAPRARMTGSERRQQLIGIARSLFAERGYEGTSIEEIALRANVSKPVVY
EHFGGKEGLYAVVVDREMSALLDGITSSLTNNRSRVRVERVALALLTYVE
ERTDGFRIMIRDSPAAISSGTYSSLLNDAVSQVSSILAGDFARRGLDPDL
APLYAQALVGSVSMTAQWWLDTREPKKEVVAAHLVNLMWNGLTHLEADPR
LQDE
>MAP0617 hypothetical protein
MTRHSYHHGDLPAVILAEAARLVAERGAERVSLRELARCAGVSHAAPAHH
FTDRRGLFTALATQGFELLTQALADARGDFADAALAYVQFALDHPGHYQV
MFNRSLLDASDGGLAAAEAAAGAELSRGVATLRDPHARADPRGAELAAWS
LVHGFATLWLNDAVDAAVRAADPMDTVRRIATMLFAG
>MAP3437c hypothetical protein
MPSLKLTVIRLIRHSRVRKHRSNCRAGKRDSLRRGYAPCHHQVSSMSTSL
SSPLCSFSSRATNTTRVTTAIRALAPTNAGVNRNVNRAASLGDRVMDHRS
TVLAPSSTPTQPANARPGRRLHTGADEWAPRPRPPTSAAVSRIAVLYACP
IQPRRRLLHEILRFGGKTDELIGFARALSVQTATLPGMSSHSPVSAAALA
SRLRMIMGDRKLSRTRLSHETGISRPSLSSKLDGKVEFTYSELLTIAQAV
DVPLDKLLAGDDDERPFRLSDLRPRPDRPL
>MAP2315 hypothetical protein
MTTSASESRAGGAQPPNRRSQLKSDRRLQLLSAAERLFAERGFLAVRLED
IGASAGVSGPAIYRHFPNKESLLVELLVGISTRLLAGARQVRARSADAAA
ALDGLIDFHLDFALNEPDLIRIQDRDLAYLPKPAERQVRKAQRQYVEVWV
GVLRELNPELAEADARLTAHAVFGLLNSTPHSMKSQGSGRGKPARAARSR
AIMRAMTVAALAAGNQCP
>MAP4217 hypothetical protein
MAAHNEPVGPRPLRAVPEGGYPDWEAVYADNAAWVYRTLFARVGNRADAE
DLTAEVFLAALRPLRLTASAAEVRAYLRATARTVLAAHWRETLGREITSI
EDIEQPPEHPDAISTAPQRVARVLDSLPDRYRRILESRFLQGNSIKEAAA
ELGISVANAKVLQHRALRLAAQVNEGGRP
>MAP3156c hypothetical protein
MSSPSRWAGVPLKDRRAERRALLVEAAYRLFGSGGEAAVSVRSVCRECGL
NTRYFYESFRDTDDLLGAVYDRVSEQLEEVIAAAIEQAGDSLRARTRAGI
AAVLGFSSADPRRGRVLFTDARTNPVLAARRRATQDMLRQGVLSEGWRLN
PRSDPVAAEVAAAMYTGAMAELAQQWLAGHLGGDLDAVVDYASKLVLR
>MAP3945 hypothetical protein
MRYARPVAQLTFQRARTEEKKRQRAEALVEAARSLALETGVASVTLTAVA
RRAGIHYSAVRRYFTSHKEVLLHLSAEGWVRWSNTVSDKLAQPGAATPSR
IAETLADALADDPLFCDLLANLHLHLEHEVDAERVVEVKRISTAATLSLA
DSIQRALPELGRSGSLDILLAAYSLAATLWQVANPPERLTDAYAEEPEVV
PPEWNLDFASALTRLLTATCIGLIPQSA
>MAP1541 hypothetical protein
MSAPDSSALAGMSIGAVLELLRPDFPDVTISKIRFLEAEGLVTPQRAASG
YRRFTAYDCARLRFILTAQRDHYLPLKVIRAQLDAQPDGELPAFNTPYTA
PRLVSVAGTDAGANAGLGSDTSAVAPAHVRLSREELLRRAGVDDELLTAL
LKAGIITAGPAGFFDEHAVVILQCSRALSDYGVEPRHLRAFRSAADRQSD
LIAQIAGPVGKAGKAGARDRADDLAREVAALAITLHTSLIKSAVRDVLDR
>MAP3027 hypothetical protein
MRQHSGIGVLDKAVGVLHTVAQSPCGLAELCERTELPRATAYRLAAALEV
HRLLARGDDGRWRLGPAVTELAAHVNDPLLAAGSAVLPLLRETTGESVQL
YRREGTDRVCVAALEPAAGLRDTVPVGARLPMTAGSGAKVLLAHSDAATQ
KAVLANAKFTERTLAEVRRRGWAQSVAEREPGVASVSAPVRDSRGAVIAA
ISVSGPIDRIGRRPGARWAADLVSAAEALTRRL
>MAP3798c hypothetical protein
MSIAAGQQPGRPAGPTRRTTAPRKRGDDTRARIIDETVRCIIEEGFAAAT
AKHVAERAGVTWGVIQYHFGDRNGLLMAAVDDGVARLVESLSSADVSELP
LRQRVEVVIDTAWSCYSSPTSMAAFEILHATRGALGDSSRRHLLDMNAAI
GQLGRLITTDPANAGVAEVIWATLRGVVLAQMVIGTTVDWSLERRALIDM
VTRVLQ
>MAP3548c hypothetical protein
MSRICWPSALNCGSARSSRRTVTPTYVTLSFVMVVDFGRPRDPRIDAAVL
RATVELLAETGYPGLLVSAIAQRAGTSKPAIYRRWPSKAHLVHEAVFPVG
ADTAIPDTGSTADDLREMVRRTMVFLTTPAARAALPGLIGEMAADPSLHS
ALLERFAGAIGGGLADWLAAAAARGEARPDVTAAELAETIAGVTLVALLT
RPTELDDAWVDRTTRLLLKGISA
>MAP0834c hypothetical protein
MDTAASSPRVLVVDDDSDVLASLERGLRLSGFEVSTAVDGAEALRSATET
RPDAIVLDINMPVLDGVSVVTALRAMDNDVPVCVLSARSSVDDRVAGLEA
GADDYLVKPFVLAELVARVKALLRRRGATATSSSETITVGPLEVDIPGRR
ARVNGVDVDLTKREFDLLAVLAEHKTAVLSRAQLLELVWGYDFAADTNVV
DVFIGYLRRKLEANGGPRLLHTVRGVGFVLRMQ
>MAP4123 hypothetical protein
MLTLGLDIGGTKIAAALVDSVGTLVHTAVRPTPNPAPADDVWDVVHALIA
EVVRAAGAPIAAVGIASAGPVDLPSGSVSPINIAGWHRFPLRDKVAAAVP
ATPVVLGGDGLCMALGEQWLGAGRGARFLLGMVVSTGVGGGLVLDGAPYP
GRTGNAGHVGHVVVELDGRPCTCGGHGCVETVASGPSMVRWARENGWSAA
PGAGARDLAAAAASDPLAQKAFHRSADALAAMIASVGAVCDLDVVVIGGG
VAQSGPLLFDPLRERLAHYAGLDFLSGLTVVPGELGGNAGLIGAARLATL
ARPAGS
>MAP0628c hypothetical protein
MVPDQLDLDAADLRISRGSVPASTQLAEALKAAIVKQRLPRGARLPTERE
LVDRTRVSRATVRAAVGLLERQGWLVRRQGLGTFVANPVKQELGPGVRTI
TEVLAASGITPNVDVLSHRVEDAPQRIAETLHLTKVLCVHRRFRDGEQPL
ALMIAYLPPGLGKAVEPLLKSATDTQTGRAMESTYTMWERRLDTPIARAS
YEIHAAGASVEVAGALNLPLGSPVLVLERTSYGHDDTPLEVVEFHYRPER
YRFSVTLPRTVPGRGAGIVERSR
>MAP0799c hypothetical protein
MSDGPLIVQSDKTVLLEVDHELAGAARAAIAPFAELERAPEHVHTYRITP
LALWNARAAGHDAEQVVDALVSFSRYAVPQPLLVDIVDTMARYGRLQLVK
NPAHGLTLVSLDRAVLEEVLRNKKIAPMLGARIDDDTVVVHPSERGRVKQ
MLLKIGWPAEDLAGYVDGEAHPISLTEDGWQLRDYQQMAADSFWSGGSGV
VVLPCGAGKTLVGAAAMAKAGATTLILVTNIVAARQWKRELVARTSLTED
EIGEYSGERKEIRPVTISTYQMITRRTRGEYRHLELFDSRDWGLIIYDEV
HLLPAPVFRMTADLQSKRRLGLTATLVREDGREGDVFSLIGPKRYDAPWK
DIEAQGWIAPAECVEVRVTMTDNERMMYATAEPEERYRLCSTVHTKIAVV
KSILDKHPGEQTLVIGAYLDQLDELGEQLGAPVIQGSTRTKEREELFDAF
RRGEVNTLVVSKVANFSIDLPEAAVAVQVSGTFGSRQEEAQRLGRLLRPK
SDGGGAVFYSVVARDSLDAEYAAHRQRFLAEQGYGYIIRDADDLLGPAI
>MAP0946c hypothetical protein
MTRAPTGSEHADRFTLLRPLLFTIAYEMLGSATEADDVLQDSYLRWSTVD
LATVRDTKSYLAQLVTRQALNALRAGARRREEYVGPWLPEPLLLDEQDPS
TDVVLAESISMAMLVLLETLSPDERAVFVLREVFGFDYDEIAEAVGKPAS
TVRQVAHRAREHVRARRKRHPGAGQAIDPKRNAELTAQFLATAASGDVEA
LMAMLAPDATWTADSGGVVSAARRPVVGAEKVARAITGLFRKAAEYATLR
VDTVTCNGAPAVLLYLGDRLEGVITVEIAADKITNFYVMRNPHKLAALAT
ARDVSRG
>MAP3336c hypothetical protein
MLQGPLADRDAWSAVGECAIEKTMAVVGTKSAMLIMREAYYGTTRFDDFA
RRVGITKAATSARLAELVELGLLKRRPYREPGQRSRDEYVLTQAGIDFMP
VVWAMFEWGRRHLPGRHRLQLTHVGCGAEVGVEIRCAEGHLVPPDELGMR
LAKNSG
>MAP1349c hypothetical protein
MGPTRLFPYDPPPAQRSAKERIRDAALSCFAIRGVAATSLRTVAEAAEVS
VGLVQHHFGTKLALVAAVDQYVLAQVGEALEATALTDAPVDGLDDAGQRL
TSLMAERPDVMSYLGRALAEGGTLGAVIFDGLLAISAAQRDEFVRQGNTR
PDLDPEWAALNPLILRVGAIILHPYIELRLRKSFFAEPELRRWDAAVTSL
IRHGLFLDAPEARERGRGE
>MAP3267c hypothetical protein
MADRRGNAGAPASDQGVYGISVAAELSGIAVQSLRLYERHGLVTPARSQG
GTRRYSADDLARLKRISKLVDAGVNLAGVARILALEDDNATLSAANTDLR
STNRALREAAKTADGDAATG
>MAP0659c hypothetical protein
MQTGQRRGRWTGVPLESRLALRRDNLINAGVQLLGGSGGPALTVRAVCRK
AALTERYFYESFTDRDEFVRAVYDDVCTRAMNTLTSATTPREAVERFVEL
MVDDPVRGRVLLLAPAVEPVLTRSGAEWMPNFIDLLQRKLSRIGDAVLQK
MIATSLVGGLSSLFTAYLHGQLGATRKQFIDYCTNMLYITAAPYVSAGEL
GKPQ
>MAP0661c hypothetical protein
MRMHADSSAGRLPDDQVRLVVEVFRMLADATRVQVLWSLTDREMSVNELA
EHVGKPAPSVSQHLAKLRMARLVRTRREGTTIFYSLENEHVRQLVIDAVF
NAEHAGPGVPGHHRGEGGLKSVAASSRHRPRGAAR
>MAP3542 hypothetical protein
MTPQVRPADRADIRALSATLARAFYDDPVMVWLFPDRRKRIARLSRVFAT
MTRHHHLAGGGVEVACAGAGIGAAALWDPPHRWRETPRAQLAMTPTYLRV
FGLRSMRGRAVQELMKSVHPEEPHWYLAVIGSDPGVRGQGFGQALMRSRL
DRCDAEHCPAYLESTKPENVPYYQRFGFTVTREIVLPDGGPSMWAMWRPP
R
>MAP1131 hypothetical protein
MAADHTTVDESLDAITDALLTASRLLMAISARSVGQVDETITIPQFRTLV
ILSNRGPVNLATLAGLLGVQPSATGRMVDRLVAAGLIDRLPHPTSRRELL
AALTTRGRDVVHQVTAHRRAEIAAIVAKMPPAERHGLVRALTAFTAAGGE
PDVHLDAEIDL
>MAP4047 hypothetical protein
MVAPEGCGETLENAEGPPHSQPSRNGGSSALVGRSDARRNRQRLLEAATA
AFTAHGASVSLESIARDAGVGIGTLYRHFPNREALVEAVYRAELAEVAAA
AAQLLQRHPPKTALRRWMDRYANFVATKRGMAESLQAIFESGALVPSQTR
DSIVGAVETLLRAGADDASLRADVQADDVVSSLIGIFLVSGSPEQTGRML
DLLVAGVTR
>MAP0050c hypothetical protein
MAPLYASDIGLDSRFMADSESNAAEVAELAEGLHRALSKLFAMLRRGDPS
GAAAGDLTLAQLSILVTLLDQGPIRMTDLAAHERVRTPTTTVAIRRLEKV
GLVKRSRDPSDLRAVLVDITPRGRAVHGESLANRRAALAAMLSQLPESDL
DTLMKALAPLERLASGDPTSGPAGVPPARKEA
>MAP0968 hypothetical protein
MPHHRPHGGAELSSTMTTHSTDGRADATRQQILRAASHQFARRPYHDVGL
DDILAEAELTKGAMYFHFKSKHALAVAIIESQTASGAVAVQDLLTRGLSG
LETLIDFSYLIAVKDIKTDLVRSGLNLMESVGLSDGLQEKLFDRWIKALA
RVAEQAKGEGDINEHCDPQDIGRLMVSLHMGLRKTSNLDDPERFLRDLEK
CWSLLLTGILQPDRTEYFQQFLRRRAALAINTSSTDDDES
>MAP0703 hypothetical protein
MTTIGRPVRTERASSTQEAILVAAERLYAEHGMFAVSNRQVSEAAGQGNN
AAVGYHFGTKADLVRAIEHKHRGPVEQLREQMVAELLESGAGGGRDAELR
SWVACSVRPLTDHLEELGNPTWYARFAAQAMTDPAYYNIIVKGALSSPSL
VQVVEGINRCLPDLPAEVHFERNIMARNLLMHTCADRERALAAGTSTSHR
SWRAAASGLIDAIVGLWLAPVTPYE
>MAP2665 hypothetical protein
MTELAVLQAIRLKGRVSRADLAATLGTDPDEIAGTVERLSAAGLVTGDAT
LWITPAGSARLTALLAEERRGIDAAAMAAVYDDFRAINADFKRLVTDWQL
KDGAPNRHDDAEYDDAVLARLDDAHARVTPVIEAAAAQLPRLNRYAAKLA
AALDKVRAGDTAWLTRPLIDSYHTVWFELHEELIVAVGLTRQEAARSGDA
Q
>MAP1124 hypothetical protein
MTNSQSDAALAAVLDRFDPSAGGPGAYDTPLGITNPPIDELLDRVSSKYA
LVIYAAKRARQINDYYNQLGEGILEYVGPLVEPGLQEKPLSIALREIHGD
LLEHTEGE
>MAP1477c hypothetical protein
MRSHGWAGNTPASDAEAIERILDAADRIIAERGSALRIADVARALGVTRQ
TVYRYFPGTQALLVASAMRSADGFLDRMAAHLDGVTDPVVAITEGMAFAV
EELACDHQVEFVLNQRHRGGQKVSIISDTALAFGRSMLHRYDIDWESYGF
DEAGLEELNEFSLRVLHSFLTDPGRPPRTGADLRRYLTRWIGPAIAYPQL
LRAMDALGAAEPQQRPRRRASKAS
>MAP2262 hypothetical protein
MPSVTRTPPGRRSPGREQRREQLERRLLDATERLMRDGASFTELSVDRLA
GAAGISRASFYIYFEDKGHLLRRLASRVFGELTDGARRWWSVAGRHDPDD
VRAAMTRIIATYRRHQAVLVALNEMSGYDPLTAQTYREILTGISGQLTRV
IEDGQADGSIRPRLAAATTASALTWMVERACQQNLPAQLPAYDAELADTL
AEIVWGALYLKPASGI
>MAP1634 hypothetical protein
MDLHQLECFIAVAEEGTFTAAAQRIHLAQSGVSAHIKALEREIGQQLFER
RPRTVRLTAAGNALLPYARAALDALAAGRASIDGLTGLLHGRLAIGTITS
ISPRSIDLPELLAAFHHEHPGVDLSLVEDTAAMLNRHISNGALDVAFTSL
TDEAVAGVRMRELHREPVIAIFLPSDPLSPCRKLTLADVADRPLITLPEG
SGLRWQLNRALRRAGVQAHIAFEAGDPDVLVALVAKGLGVGLVPQSALAQ
SDHVIGLPVSDHPPGRLGIIWPEGQAASPAARAFVEHATTATTKLRRPAE
LRQDR
>MAP2741c hypothetical protein
MTTPDAGSSGASARERVLTAAYELFSRRGIRAVGTDEVIARAGVARATLY
RHFATKNDLVLAVLQRREELWTYGLIEEQSRLRGATPEEQLLAIFDVMDE
WFRQGDRYEGCSFINVLLELGPEHPAGKACIAHIDRVRDIVRRRAVAAGL
TDVEDFASSWHILMKGAIVLAAVGDLDAARRARRMARTLIEEHRPPLVSD
TGEAV
>MAP4304 hypothetical protein
MADYRGGSGYVKRARSPASGRRLRIRRSEDHQMPTDPNGPAAPRRRSEKS
RTAIVTATRELLLERGFDGLTIEAVAARAGVGKQTIYRWWPTRPALVADV
MLEDADRLLASVEHSGDLAADLVGWVGKLVTSLTTPRGSAMLRTLTVACM
EHEDTAVKLRAGFSAPLHDSVRARLLAEGVDAATAESAADAIVGGVVYPI
LSGAQRYSRRRAEHTTRLIVAALTAG
>MAP1391c hypothetical protein
MTTTGTPRRRRGRPAGSSGSRERILASARELFARNGIRNTSIRAVAAAAG
VDSALVHHYFGTKEKLFAAAVQIPIDPMQVIGPLREVPVDDLGYALPSML
LPLWDSEVGAAFIATLRSILAGEEISLFRTFIQDVIGVEVGPRVDNPAGS
GVIRIQFVASQLVGVVMARYILELEPFASLPPEQIAATIAPNLQRYLTGD
LPDGLAP
>MAP1002c hypothetical protein
MTVMSGQSAAQRPRQAILGQLPRIYRADGSPIRVLLVDDEPALTNLVKMA
LHYEGWVVDIAHNGREAMAKFDRAAPDVLVLDIMLPDVDGLRILERVRQS
DAYTPTLFLTARDSVMDRVTGLTAGADDYMTKPFSLEELVARLRGLLRRA
SQQPAPTAETLKVGDLVVDTASREVTRGDTPVSLSSTEFELLRFLMRNPR
RALSRTEILDRVWNYDFAGRTSIVDLYISYLRKKIDSGREPMIHTVRGVG
YMLRPAE
>MAP0354c hypothetical protein
MSNPFTPPDGPFGARPGFGFGFGPGRVDRRALHEARRQARRDFREHLREH
AGGHDGPLGFGPGFGPGFGPGFGPGFGFGPGGPRGAWRRGGPGRGKRGDV
RAAVLALLAERPMHGYEMIQLIAERSNGLWKPSPGSVYPTLQLLADEGLI
TATETDGSKKLFELTDDGRAAAEKIETPPWDEIAEGADPGHMNLRAAIGQ
LFGAVGQSAHTASPEQQQRIVDILNNARREIYGILGED
>MAP1546c hypothetical protein
MSPDTASRPRSAGGQMVDPAMLLGLLPDSLATLDPTARTIVTAARTCFTE
RGFSNTTMQDIADAAGVGVATVYRRFRHKRNLVRLTIIDESVRLSTLISD
VAGRAASAEEGIAEVFAAFVHEASAPKLLTRSIRESPAAGELSAFLTDEH
LIAISRTYIASWLRRWQERGELADDLDTEVVGEMIGRLIISLIETPKSVI
PVYNLAKARDFARRYLVPLVLPAPAALPG
>MAP1978 hypothetical protein
MQANTDIDVQVRRRLRELRVQRGLTLQEVGARAGIDVSTLSRLESGKRRL
ALDHLPRLARALSVSTDELLQPAQAPDPRVRGSAHTHHGVTYWPLTRQGP
AGGLHAFKIRVSARRRTPPAELPVHEGQDWMYVLAGRMRLILGDRDFTID
PGEAVEFSTWTPHWFGVVDGPVEAIVIFGPHGERLHLHS
>MAP1376c hypothetical protein
MDTHRLKYFLRIAEQGSITGAAASLGIAQPALSRQIRLLEEDLGVALFRR
TRRGVQLTEEGERLRASTAAPLRQLELAMQYAGSPIARLARGVRLGILDT
TVDVFAAHLLSSLAAAFPKVTFSVETGSTDQLVEAMLKGAVDVAVINPVP
DDRIFYRDLLTEDLVVVGAATSDLDSGQMMPFTELVELPLVVPRSHTGIG
NVIENAALRRKVKISWRIATDSLALTKRLVAAGLVYGVLPLSACLNEIDS
NQLRYTPLTEPPLTHRVGVAATSNLELPREVTAKVGNILREETAHLIKTG
RWPARLLSPQPWDPAVT
>MAP2003c hypothetical protein
MTGCPRRAVWPRRDPPRRSISSAAAAASDASCVTVLQMLSIRNDKPDAVL
DTGERILAAAASCVVDFGVDRVTLAEIARRAGVSRPTVYRRWPDTRSIVA
ALLTRHVTDVMRAAPLLGDDRESLVRQIVTVADLLRQDRLVMSVLHSELA
PIYITERLGTSQHMLIDALAARLRVAQRNGSVRPGDPVQMATMVLLIAQS
TIQSAQIVEPLLDAEALAKELAYSLNGYLS
>MAP3108c hypothetical protein
MVQGRGVTSHPANPEPGARRRGDKQRQAILAVVRELLQEKPFAELSVSTI
SLRAGVARSGFYFYFDSKYAVLAQLMAEATEELEELTQYFAPRQPGESPE
QFAKRMVGSAAAVYAHNDPVMTACNEARHTDVEIRNLMDQQFEVVLAQIV
GIVEAEMKAGTARPISDDLPTLIRTLAGTTALMLTGDPVLTGPDSDRDRR
VAVLEQLWLHALWAGRP
>MAP0098c hypothetical protein
MTQPPAGLASSAPESRNNEADGESRPITRAAVLASALEIIDRDGVDGLSM
RRLGEAVGRDPMALYRHVPNKAAVLDGVVEMVFERLSLDTTTPDWAAALR
KLGHEFRDLARAHPNVVPLLVTRPLATPLGMRPPGILRHLEEVLTLLIGA
GFTGEDALHVYRALFGFLYGHVLTELQEIVERPEETDHVLRLGLHRLPID
QFGHLRELAPVWASYDPLAELDRGLDILLSGLAVRLTIPGAAPADTTEPQ
QGDHERT
>MAP4081 hypothetical protein
MALQPVIRRSVPEAVFDQIATDVLSGELPAGSALPSERRLAEVFGVSRPA
VREALKRLSAASLVEVRQGGVTTVRDFRRHAGLDLLPQLVFRDGELDAGV
FRSILETRLRIGPKVAELAAEQHGPELAALLDDSLRRLEASRSAVEWHRG
TLEFWDHMVDSSGSIVFRLMYNPFRAAYERAVGVLAAAIPAEINRTDAYR
ALAQAICAGDADEAGRGAREVLELANATMIGALERR
>MAP1705c hypothetical protein
MGVEAADTATALPAARPTPLILGGLHLRASHVRQRRGQAGVQLAVHPLAS
RALFGLPSAEIPVADYDATAVLGRDIVDLHHRVAEAHRWPDVFALVAQYL
IDARRRRDGATVRPEVLHAWHLLQRSRGLMPVTALADAVGVTTRHLATLF
RREVGHSPKTVAMLMRFQHATGLIAESARRHGRVDLARVAADTGYSDQAH
LSREFVRFAGVPPQRWLAEEFRNIQDGGHSLGSQWEHDCFESDRLVDTAS
P
>MAP2855c 35kd_ag, 35kd_ag
MANPFVKAWKYLMAKFNATIDERADPKVQIQQAIEEAQRTHQALTQQAAQ
VIGNQRQLEMRLNRQLADVEKLQVNVRQALTLADQATAAGDTAKATEYNN
AAEAFAAQLVTAEQSVEDLKTLHDQALNAAAQAKKAVEQNAMVLQQKIAE
RTKLLSQLEQAKMQEQVSASLQSMSELAAPGNVPSLDEVRDKIERRYANA
IGAAELAQGSVQGRMLEVEQAGVQMAGHSRLEQIRASMRDEALPTGGTPA
AGGTQAAPAPGQGAGDAVSEKPLGQ
>MAP0085c LysR, LysR
MLDIAPLRSLVAVADCGGFHRAAAALHLSQSAVSQHLRRIEAVVGEPIVQ
RSGRGVVFTEVGQKALRHARTILAAHDTALDDLGATENKVLTIGATEHGA
DVMLAGLSGALRDRLPDLRVRFRLDRNVSLADSIDRGLVDVAIMLDGAGL
DRANASGLVRLQWVSARTLTIAPKGPLPLVIFSEPCTLREPAFAILDSHR
IDYEIAAECGDLSGLYAAVRSGLGVALLPMIGKFPDGLGPAEGLPPANNA
SILVRGRTGIDAALLSTVDTAVRDLLATAT
>MAP0483 absR2, AbsR2
MMASLALIMAMGALGQVDDLTAVAEQALRRATTSFEASHMRFWFGAVYGR
ACRLTGRIDEFVSTARRLADSARDVPGLAYANLALLLGNAELARGAAAEA
ARLVQEALAGAQIHEVTSGLRPASYFALAEAHGRLGHPAQANEAIAGARS
CVPPDFLFMHTGLALATGWALAAGGQLREAVATAQAAARLARDRGQPTHE
LACIQAAAQWGDAAGAARARALADALSLPLADAIACHAEALRAGNGEALL
TVAAAYRAIGDAAAAADAAAQASAAFVEGQQHQRGRYAAALAGELAEECG
GLCTPALRTPAGLKLSGRQRDVIELAVAGLSNRQIAERLVMSVRTVEGHI
YRACQRVGAQSRDELATIIRTRPAGRES
>MAP1366 argR, ArgR
MTRSKSTTETTRAARQARIVAILSSAQVRSQSELAALLADEGIEVTQATL
SRDLEELGAVKLRGADGGVGVYMVPEDGSPVRGVSGGTARLSRLLSELLV
SADASANLAVLRTPPGAADYLASAIDRAALPYVVGTIAGDDTVFVAARDP
MTGAELADTLEKLT
>MAP0969 cprA, CprA
MARQVRSEATRQKILDSAIEVFGEVGYAAAGWSTIIERTGMTKGALYHHF
DSKESLASNIIEEGSDRLLSAFRNVCGSSSPGLENLVHGTFTIVEVLRSD
KMVRAAAQLATALSGFNGAASRFYANLVLETAQEARRAIKEGDLRDDIDP
DVLSASLMGTIFGARLIASTISGHGRIGDIIGDPTARLHQIWSLLLPGIV
SQASLPYFEQFLRREGMRHATANPAAQPEGE
>MAP0579c cpsA, CpsA
MARSGGAHRRHRAVRQPSRFRKGLTRTLGALVSVAAIGLTGAGYYVAHGA
LGGITVSNALQPDDPRSSGDNMNILLIGLDSRKDQDGNDLPYSILKHLHA
GDSDDGGYNTNTLILVHVSADNKVVAFSIPRDDWVPFSGVPGYNHIKIKE
AYGLTKQYVAQKLANQGTSSQKELETKGREAGRAATLRAVRNLTGVPIDY
FAEVNLAGFYDLAQTLGGVEVCLNHPVYDSYSGADFPAGRQRLDASEALS
FVRQRHGLDNGDLDRTHRQQAFISSVMQELQAAGTFTNIDKLKNLMAVAR
KDVVLSAGWTDDMIQRLGGLAGGNIEFRTLPVVRYDNIDGQDVNIVDPAA
IKAEVAAAIGANAPTSAPTSTSVAPDPSTVVDVVNSGSMSGLASEVSHAL
KKRGYTTGQVRDRESGDPAATTIEYGAGAQNDARNLANLLSLDAPNQPSP
SIAPGHIRLTVDTNFTMPTTDDTQLDDTTSTSSTTTSSKAKTYYYNGTTT
TYPTPDQGKPIDGGGVPCVN
>MAP0423 cspA, CspA_1
MPQGTVKWFNAEKGFGFIAPEDGSADVFVHYTEIQGTGFRTLEENQKVEF
EIGHSPKGPQATGVRSL
>MAP0669 cspA, CspA_2
MAQGTVKWFNGEKGFGFITPDDGTKDLFVHYSEIQGSGYRSLDENQRVQF
DVEQGAKGPQAVGVSTV
>MAP0810 cspB, CspB
MRPVPTGKVKWYDADKGFGFLSQEDGEDVYVRSSALPAGVEGLKAGQRVE
FGIASGRRGPQALSLKLIEPPPSLTKARREGPAEHKHSPDELHGMVEDMI
TLLESTVQPELRKGRYPDRKTARRVSEVVRAVARELDA
>MAP2521c deaD, DeaD
MTLPDSSTEAASPTFADLQIHPSVLRAIADVGYETPTGIQAATIPALMAG
SDVVGLAQTGTGKTAAFAIPILSKIDAASTATQALVLAPTRELALQVAEA
FSRYGAHLPKINVLPIYGGSSYAVQLAGLKRGAHVVVGTPGRVIDHLERG
TLDLSHVDYLVLDEADEMLTMGFAEEVDRILSETPEYKQVALFSATMPPA
IRKLTAKYLHDPLEVSTKAKTTTAENISQRYIQVAGPRKMDALTRVLEVE
PFEAMIVFVRTKQATEEVAERLRARGFSAAAINGDIPQGQRERTVAALKD
GGIDILVATDVAARGLDVERISHVLNYDIPHDTESYVHRIGRTGRAGRSG
TALLFVSPRERHLLKAIEKATRQPLTEAELPTVEDVNAQRVAKFADSITA
ALGAPGIDLFRKLVQDYEREHDVPMADIAAALAVQSRDGEEFLMAPEPPR
ERRERHTERRERTEKPRSTRPLATYRIAVGKRHKIGPGAIVGAIANEGGL
HRSDFGHIAIGPGFSLVELPAKLPKSTLKRLEQTRISGVLINLQPDRAAA
KARGRDGGKPRRKYGG
>MAP2431 dinG, DinG
MPELLATAVAALGGSEREGQQQMAAAVAQAFDTGRHLVVQAGTGTGKSLA
YLVPAIVHALRDDSPVVVSTATIALQRQLVDRDLPRLIDSLAAALPRRPQ
FALLKGRRNYLCLNKIHNGGPADGEEAADRPQEELFNPMAVSALGRDVQR
LTEWASSTDSGDRDDLKPSVPDRSWSQVSVSARECLGVARCPFGAECFSE
RARSRAGQADVVVTNHALLAIDAVSDSAILPEHALLVIDEAHELVDRVTA
VATAELTSAALGVAARRIGRLVSPELVQRLEATTATFAAAIHDGTPGRID
RLDDELATYLAALRDAASAARSAIDTTSDPKAASARAEAVAALSEISDTA
SRVLASFGPAIPDRTDVVWLDHEDNRGAMRPVLRVAPLSVADLLRDRVFS
RSTVVLTSATLTIGGSFDAMAAAWGLKGPDGDDPPWRGLDVGSPFQHAKA
GILYVAAHLPPPGRDGVGSAEQLTEIAELITAADGRTLGLFSSMRAARAA
AEAMRDRLSTPVLCQGDDSTSALVEQFSAEPQTSLFGTLSLWQGVDVPGP
SLSLVLIDRIPFPRPDDPLLGARQRAVAARGGNGFMAVAASHAALLLAQG
SGRLLRRVSDRGVVAVLDSRMATAGYGGYLRASLPPFWQTTNGAQVRAAL
QRLRTAATASGPG
>MAP3723c fadD27, FadD27
MSDTSSARPYRGVEAAERLATRRNRLLAAGLDLLGDQRPDISAVTVRGVC
RRAGLAARYFYESFTDKDEFVSCVFDWVVAELAATTQAAVATVPADEQTR
AGIANIVRTITDDARVGRLLFSTQLADPVVVRKRAESSALFAMLSGQHVG
NALQVPANDRIKAAAHFVVGGVGQTISAWLAGDVRLEPDELVDHLAALLD
ELAEPNLYRLTETRAEA
>MAP1027c greA, GreA
MTDTSVTWLTQESHDRLKAELDQLIANRPVIAAEINDRREEGDLRENGGY
HAAREEQGQQEARIRQLQDLLNNAKVGEAPKQSGVALPGSVVKVYYNGDK
SDTETFLIATRQEGVNDGKLEVYSPNSPLGGALIDAKVGETRSYTVPNGS
TVEVTLVSAEPYHS
>MAP2163c hrcA, HrcA
MGSADERRFEVLRAIVADFVATKEPIGSKTLVERHNLGVSSATVRNDMAV
LEAEGYITQPHTSSGRVPTEKGYREFVDRLDDVKPLSAAERRAIQNFLES
GVDLDDVLRRAVRLLAQLTRQVAIVQYPTLSSSTVRHLEVIALTPARLLM
VVITDSGRVDQRIVELGDVIDDHELSRLREMLGQALVGKKLSAASVAVAD
LAEQLRSPDGLGDAVGRSATVLLESLVEHSEERLLMGGTANLTRNAADFG
GSLRSILEALEEQVVVLRLLAAQQEAGKVTVRIGHETAAEQMVGTSMVTT
AYGTSDTVYGGMGVLGPTRMDYPGTIASVAAVAMYIGEVLGAR
>MAP3843 hspR, HspR
MAKNRKGHESRTFLISVAAELAGMHAQTLRTYDRLGLVSPQRTSGGGRWY
SEHDVDLLREVQRLSQDEGVNLAGIKRIIELTNQVEALQARVKELTEELA
QVRAGQRRDLAVLPKSTALVVWQPRKGGTRT
>MAP0995c kdpE, KdpE
MTRVLVIDDEPQILRALRINLSVRGYEVVTASTGAGALRAAAEHKPDVVI
LDLGLPDISGIDVLAGLRGWLTAPVIVLSARTDSSDKVEALDAGADDYVT
KPFGMDEFLARLRAAVRRNTAASEMEQPVVETESFTVDLAAKKVTKNGSE
VHLTPTEWGMLEVLVRNRGKLVGREELLKEVWGPAYATETHYLRVYLAQL
RRKLENDPSHPKHLLTESGMGYRFEA
>MAP2836 lexA, LexA
MHAVDPSLTERQRTILNVIRSSVTSRGYPPSIREIGDAVGLTSTSSVAHQ
LRTLERKGYLRRDPNRPRAVDVRGVDDDVAAPATEVAGSDALPEPTFVPV
LGRIAAGGPILAEEAVEDVFPLPRELVGDGTLFLLKVVGDSMVEAAICDG
DWVVVRQQHVADNADIVAAMIDGEATVKTFKRAGGQVWLMPHNPAFDPIP
GNDATVLGKVVTVIRKV
>MAP0987 mfd, Mfd
MTAPGAARPETPIAGLVELALTAPTFQQLIDTAAASPADLSLVGPASTRL
FVASALARLGPLLVVTATGREADDLTAELRGVVGDAVAVFPSWETLPHER
LSPGVDTVGARLTVLRRLAHPDDARLGPPLQVVVTAVRSLLQPMTPQLGL
VEPVTLSVGQEIEFEHVIARLVELAYSRVDMVGRRGEFAVRGGILDVFPP
TAEHPVRVEFWGDEVSEMRMFSVADQRSIPEIAVDTVISVPCRELLLTED
VRARAAELAAQHPASEPAITGSVSDMLAKIADGIAVDGMEALLPVLRPGK
QVLLTDQLADRTPVLLCDPEKIRTRAADLIKTGREFLEASWSVAALGTLE
NQAPIDVEQLGGSGFAELDEVRAAAVRGGHPWWTLSQLSDESAVELDVRA
APSARGHQHDIDGIFAMLRAHVSTGGHAAVVAPGTGTAHRVVERLAECDT
PAAMLESGAAPRAGVVGVLKGPLHDGIVIPGANLVVITETDLTGSRVAAV
EGKRLAAKRRNTVDPLALTAGDLVVHDQHGIGRFVEMTERTVGGARREYL
VLEYASSKRGGGSDKLYVPMDSLDQLSRYVGGQAPALSKLGGSDWANTKT
KARRAVREIAGELVSLYAKRQASPGHAFSPDTPWQAEMEDAFGYTETVDQ
LTAITEVKSDMEKPIPMDRVICGDVGYGKTEIAVRAAFKAVQDGKQVAVL
VPTTLLADQHLQTFTDRMAGFPVTVKGLSRFTDAAESRAVIEGLADGSVD
IVIGTHRLLQTGVRWKDLGLVVVDEEQRFGVEHKEHIKSLRTHVDVLTMS
ATPIPRTLEMSLAGIREMSTILTPPEERYPVLTYVGPHDDKQVAAALRRE
LLRDGQAFYVHNRVSSIDRAAARVRELVPEARVVVVVAHGQMPEERLERT
VQGFWNREYDILVCTTIVETGLDISNANTLIVERADTFGLSQLHQLRGRV
GRSRERGYAYFLYPPHAPLTETAYDRLATIAQNNELGAGMAVALKDLEIR
GAGNVLGVEQSGHVAGVGFDLYVRLVGEAVEAYRAAADGQTVTTAEEPKD
VRIDLPVDAHLPPDYIASDRLRLEAYRRLAAAGSDDEIDAVVEELVDRYG
ALPEPALRLVAVARLRLLCRAAGITEVSAPSAATVRLSPITLPDSAQVRL
KRMYPAASYRATTSTVQVPIPRAGGVGAPRLRDVELVQMVANLVTALQGK
PQTDVGTGTPVAAMASEEGRG
>MAP3360c mtrA, MtrA
MDSMRQRILVVDDDASLAEMLTIVLRGEGFDTAVIGDGTQALTAVRELRP
DLVLLDLMLPGMNGIDVCRVLRADSGVPIVMLTAKTDTVDVVLGLESGAD
DYIMKPFKPKELVARVRARLRRNDDEPAEMLSIADVEIDVPAHKVTRNGE
QISLTPLEFDLLVALARKPRQVFTRDVLLEQVWGYRHPADTRLVNVHVQR
LRAKVEKDPENPTVVLTVRGVGYKAGPP
>MAP3275 narL, NarL_2
MTDDATPTTVMVVDDHPIWRDAVARDLAESGFAVVATADGVTAAQRRAGV
VRPDVVVMDMQLPDGDGAQATAAVLAVSPSSRVLVLSASDERDDVLQAVK
AGAAGYLVKSASKAELAQAVTDTAAGRAVFTPSLAGLVLGEYRRIAQSKD
DGPTRPRLTDRETEVLRYVAKGLSAKQIAEKLSLSHRTVENHVQATFRKL
QVANRVELARYAIEHGLDEEP
>MAP0689c narL, NarL_1
MADPATRETVRVVVADDHPLFREGVVRALVSSGAVNVVGEAEDGSAALEL
IKSHQPDVALLDYRMPGMDGAQVAAAVRADGLATRVLLISAHDESAIVYQ
ALQQGAAGFVLKDSTRSEIVKAVLDCAQGRDVVAPALVGGLAAEIRQRAE
PTGPVLSAREREVLHRIARGQSIPAIAGELYVAPSTVKTHVQRLYEKLGV
SDRAAAVAEAMRQGLLS
>MAP2909c nusA, NusA
MNIDMAALHAIEVDRGISVNELLETIKSALLTAYRHTEGHQNDARIEIDR
KTGVVKVIARETDEDGNVISEWDDTPEGFGRIAATTARQVMLQRFRDAEN
ERTYGEFSTREGEIVAGVIQRDSRANARGLVVVRMGTETKASEGVIPAAE
QVPGESYEHGNRVRCYVIGVTRGAREPLITLSRTHPNLVRKLFSLEVPEI
ADGSVEIVAVAREAGHRSKIAVKSNLPGLNAKGACIGPMGQRVRNVMSEL
SGEKIDIIDYDDDPARFVANALSPAKVVSVSIIDQAARAARVVVPDFQLS
LAIGKEGQNARLAARLTGWRIDIRGDSPGGAGGHSESRPEHGATHGMAHD
R
>MAP1098 nusB, NusB
MSKPLRGRHQARKRAVDLLFEAEARGLSPAEVVDVRTGLADTNPEVAPLQ
PYTAAVARGVGDHAAHIDDLISSHLQGWTLDRLPAVDRAILRVAVWELLY
ADDVPEPVAVDEAVQLAKELSTDDSPGFVNGVLGQVMLVTPQIRAAARAV
RGPRDT
>MAP4111 nusG, NusG
MTTFDGDPSAGDAVDLRETDEATEAADEAAETTEDAAESAETQGEPAEEV
DPAAALKAELRSKPGDWYVIHSYAGYENKVKANLETRVQNLDVGDYIFQV
EVPTEEVTEIKNGQRKQVNRKVLPGYILVRMDLTDDSWAAVRNTPGVTGF
VGATSRPSALSLDDVVKFLLPRGATKKVAKGAASTAAAAEAGGLERPAIE
VDYEVGESVTVMDGPFATLPATISEVNGEQQKLKVLVSIFGRETPVELTF
SQVSKL
>MAP1590 oxyR, OxyR
MLPRTLNEVKVIFPTTLIGMSDKTYQPTIAGLRAFVAVAEKRQFSGAATA
LGVSQSTLSQALAALEAGLGTQLVERSTRRVFLTPQGAELLPHAQAVVEA
ADAFTAAAAGSADPLRAGMRLGLIPTVAPYVLPTVLAGIAERRPGLTLRV
TEDQTERLLAVLREGALDAALIALPAETAGVTAIPIYDEDFVLALPPGHP
LAGKRRVPATALADLPLLLDEGHCLRDQALDVCHKAGVRAELANTRAASL
ATAVQCVTGGLGVTLIPQSAVPVEASRSRLGLAQFAAPRPGRRIGLVFRS
SSGRDDSYRELAGLISSQHQVRLVK
>MAP3676 oxyS, OxyS_2
MLFRQLEYFVALASERHFARAARACYVSQPALSEAIRKLEQELNVPLVRR
GQKFEGLTPEGERLVLWARRILADHDALKQEVAALQTGLTGELRLGVIPA
ASSIVALLTDPFCAAHPLVRVQLETSLRSAEIVQRLRRFELDAGVLYPDR
QDVADLLVTPLYTEQQVLIAGAELLPDPSETISWSDALTLPMCLLNQGMR
GRRLIDDALAALELTVTPQLETDSVATLLAHVGTGRWASIIPQTWIHSLR
PPAGARILRLENPTVTATVALVTSATEPGSVLTRALVQTARGAGINDVLR
KAAPGYSAG
>MAP3522 oxyS, OxyS_1
MRVLFRQLEYFVAVASERHFARAAEKCFVSQPALSAAIAKLEKELNVTLI
NRGHSFEGLTPEGERLVVWARRILAEHDAFKAEVDAVRSGVTGTLRLGTV
PTASTTASLLLSAFCSAHPLVKVQILSRLSAGELYRRLREFELDAAIAHT
APDDIPDVNLVPLYRERYVLLAPADLLASGAATMTWVQAAQLPLALLTPD
MRDRQIIDRAFADHGITLHPQVETDSVASLFAQASAGSWASIVPHTWLWA
SPLGAGIRAVELVDPVLTADVVLATKSHGPGSPIARALAASAGRLQLNDF
FDAQLLGVTRRR
>MAP4343c parA, ParA
MTQPLRKKGGLGRGLASLIPTGPAEGDAGPATLGPRMGDAAADVLIGGPA
PQEASPVGAVYREISPADIERNPRQPRQVFDEEALAELVHSIREFGLLQP
IVVRAIKESASGARYQIVMGERRWRAAQEAGLATIPAIVRETGDDNLLRD
ALLENIHRVQLNPLEEAAAYQQLLDEFGVTHDELAARIGRSRPLITNMIR
LLKLPIAVQRRVAAGVLSAGHARALLSLEAGPEAQEELATRIVAEGLSVR
ATEEAVTLANRAGTTTPTPPRRKPIQMPGLQDVADRLSTAFDTRVTVSLG
KRKGKIVVEFGSVDDLQRIIDVMAPPKP
>MAP0591 phoP, PhoP
MTSATPTDAKPEARVLVVDDEANIVELLSVSLKFQGFEVHTATNGAQALD
RAREARPDAVILDVMMPGMDGFGVLRRLRADGIDAPALFLTARDSLQDKI
AGLTLGGDDYVTKPFSLEEVVARLRVILRRAGKGGAEPRSARLTFADIEL
DEETHEVWKAGQPVSLSPNEFTLLRYFVINAGTVLSKPKILDHVWRYDFG
GDVNVVESYVSYLRRKIDTGEKRLLHTLRGVGYVLREPR
>MAP0018c pknA, PknA
MSPRVGVTLSGRYRLQRLIATGGMGQVWEAVDNRLGRRVAVKVLKQEFSQ
DPEFIERFRAEARTTAMLNHPGIAAVHDYGESQLDGEGRTAYLVMELVNG
EPLNSVLKRTGRLSLRHALDMLEQTGRALQVAHAAGLVHRDVKPGNILIT
PTGQVKITDFGIAKAVDAAPVTQTGMVMGTAQYIAPEQALGHDATPASDV
YSLGVVGYEVVSGKRPFSGDGALTVAMKHIKEPPPPLPAELPPNVRELIE
ITL
>MAP3387c pknD, PknD
MSDNGPAAQVGSWFGPYRLVRLLRQGGMGEVYEAEDTRKHRLVALKLISQ
QFSGNPEFSARLQREADIAGRLTEPHVVPIHDYGEIDGRFFVEMRLVDGI
DLGSLLHREGPLAPPRAIAIIRQVAAALDAAHAAGVTHRDVTPGNILVTP
SDFAYLADFGIARAASDPGLTQVGTAIGTYYYMAPERFTDDEVTNSVDIY
SLACVLTECLTGVPPYRADTVERLVAAHLTKTAPPLSQLRPGAFPPALDR
VIAKGMAKRPEDRYRTAGEFAAAAHEALTTSEQRKAATILRDGQIAALGA
GAAEQRSTHWPDSFAPSPSAETVVGPSPARAGAPSSGLIRAAPTGSGRVY
APGPDFGRPAAPTDNKRKQWIIVGAVALVALVAFVVAVVGYLSTASSGPA
KQAGGQSVLPFNGIDFRLSPGGVTLDGTGNVYVTSEGMYGRVVKLAAGSG
ATTVLPFNGLYQPQGLAVDGAGTVYVADFNNRVLSMAAGSNSQKELPFSG
LNYPEGVAVDSQGGVYVADRGNSRVLKLAAGSQNQTVLPFTGLNNPDGVA
VDPAGNVYVADTDNNRVVKLDAASNTQSELPFHDLSVPWGIAVDNGGTVY
VTEHDKNDVMKYPPGATSGTVLPFTALNTPLAVAVDRDQSVYVADRGDDR
VVKLVQ
>MAP1049c pknE, PknE
MTLDPDSFGHYRILELLGRGGMGRVYRAYDATTDRVVALKVLPPHLAEDQ
DFQQRFRREARIAAGLNDPHVVPIHGYGEIDGRLYVDMRLIEGRDLAHYI
TENGGRLSPQRAVAVIEQVAAALDSAHRAGLIHRDVKPMNVLVTTARDFV
YLIDFGLARAQADTALTQTGATMGTVAYMAPERFTGTTDHRADVYSLACV
LHECLTGKRPFAGDSLEEQLNAHLNTAPPRPSATAPEVPAAFDAVIARGM
AKDPERRYQSVTELAEAARAALAPGVVEKPSAPTPQPRAARRVRAAVVGA
SALTLAVVAAVVVAMVTHGHGPRGAAPKTPGSPAPGRPAPPLPAFVAPPD
LGANCQYRAVPDPSSRPVSPPPSGRVPTTPGQIGAVIATNLGDIGISLAN
SESPCAVNSFISLARQRFFDNTQCARLVDSPDGGSLLCGGPDVDGSGGPG
YEFADEYPANQYRPDDPALRATLLYPRGTVVMATEGPNTNGSQFALIFHD
SEMDPQSTVLGTIDPAGLATLDKIARAGIAGNRPSGPPANPVTITSVRIG
>MAP1332 pknF, PknF
MTIGNGASFAGYTILRQLGAGGMAEVYLALHPRLPRRDVIKVLAEAVTVD
PEFRERFNREADLAATLWHPHIVGVHDRGEFNGHLWISMDYVEGTDASRL
VKESYPDGMPLDEVSAIVQAVAGALDYAHARGLLHRDVKPANILLTHPEA
GERRILLADFGVARHLGDISGITETNVAVGTVAYAAPEQLTGSPIDGRAD
QYALAATAFHLLTGAPPFQHSNPIAVIGQHLHEDPPRLSDFRPELAGLDE
VFCQALAKAPEDRFDRCRAFAAAVRRECDGAAAIGPDARSRSVASPPHRR
RGPGRVIAAVTHRFSSQTRWAAALVCAVLVAVAATWSVLYSFQPGAPPAN
PALASKPSPPAAVAAPIAGGPVLNGTYKLDYDQTKRTTNGIGIRHDGAGT
NWWAFRSACTSSGCAATGTRLDDATHQTAGGPDGGQTDTLRFVGGYWQGA
PEQQRVGCTRPGGPAGATQQETIAWSLAPQSDGTLRGTETETVLSNECGA
QGAVVRVPVVATRVGDVPPGVTVADPASVINASPTATAPAPPVLGGLCSD
VGKVAYDPTNNEQIVCEGSSWAKAPITMGVHAAGSSCDRPGTSVFAMSTS
SDGYLLQCDPVTRTWTRPAG
>MAP3893c pknG, PknG
MAEPDNKSEQPEPGAEQMGPGTQPAEVGDDAQAGAATGRLQATQALFRPD
FDDDDDDFPHISLGALDTDSADRMTVATQALPPVRQLGGGLVEIPRGRDI
DPREALMTNPVVPESKRFCWNCGKPVGRSTKKSKGTSEGWCPHCGSAYSF
LPQLNPGDIVANQYEVKGCIAHGGLGWVYLAVDHNVNDRPVVLKGLVHSG
DAEAQAIAMAERQFLAEVVHPQIVQIFNFVEHVDRHGNPVGYIVMEYVGG
QPLRHGKGEKLPVSEAIAYVLEILPALGYLHSIGLVYNDLKPENIMLTEE
QLKLIDLGAVSRINSFGYLYGTPGFQAPEIVRTGPTVATDIYTVGRTLAA
LTLNLPTRNGRYVDGIPDNDPVLGTYDSFRRLLRRATDPDPRRRFSSTEE
MSAQLMGVLREVVAHDTGVPRPGLSTIFSPSRSTFGVDLLVAHTDVYLDG
QVHSEKLTAREIVTALQVPLVDPADVAAPVLQATVLSQPVQTLDSLRAAR
HGTLDADGVELSESIELPLMEVRALLDLGDVAKATRKLDDLAERVGWQWR
LVWYKAVAELLTGDYDSATTHFTEVLDTFPGELAPKLALAATAELAGDVD
EHRFYETVWKTNDGVISAAFGLARTLSAEGDRAAAVRTLDEVPATSRHFT
TARLTSAVTLLSGRSKSEITEEEIRDAARRVEALPPTEPRVLQIRALVLG
CAMDWLEDNKASTNHILGFPFTEHGLRLGVEAALRNLARVAPTQRHRYAL
VDMANKVRPTSTF
>MAP1914 pknL, PknL
MLDGRYLIESKIASGGTSTVYRGVDTRLDRPVAVKVMDPRYAGDDQFLTR
FQREARAVARLKDPGLVAVYDQGLDARHPFLVMELIEGGTLRELLGERGP
MPPYAVAAVLRPVLGGLAAAHRAGLVHRDVKPENVLISDDGEVKIADFGL
VRAVAAAGITSASVILGTAAYLSPEQVRDGAATPRSDVYAAGIVAYELLT
GRTPFTGDSMLAIAYRRLDADVPPPSAAIDGVPAQFDDFVQRATARDPAD
RYADAVEMGADLDAIADELALPGFRVPAPRNSALHRSAALHREAGRRAPA
AEPPARHPTRHLTRGPEEWPQPDPPAHVGAEPDDDEDDYEYQSVTGEFAG
IPISEFVWARQHNRRMVLVWLALVLAVTGMVATAAWTIGRNLNGLF
>MAP2819 ppgK, PpgK
MTSTDSTAHTPAAPAAGPPPRRGFGVDVGGSGIKGGIVDMDTGLLIGERV
KLLTPQPATPSAVAKTIAAVVDAFEWTGPLGVTYPGVVTHGVVQTAANVD
KAWIGTNARDIISAELNGQEVTVLNDADAAGLAEEHYGAGRNQSGLVVLL
TFGTGIGSAVIHNGTLIPNTELGHLEVGGKEAEQRAASSVKERHGWSYEK
WAKQVTRVLVSIENALWPDLFIAGGGISRKADKWLPLLENRTPVVAAALL
NTAGIVGAAMAATSDVTH
>MAP3009c recG, RecG
MASLTDRLDFVVGAKAAEQLEELFGIRTVDDLLRHYPRSYTEGASRWGAD
DERPPAGEHITIIDTITETKTWPMKKTPKKVCHRITLGAGRNKVTATFFN
ANYLKKGLTEGTKVMLSGEVGFFKNVMQLTHPAFLILDSPDGRNKGTRSL
KNIANASGASGEAVLDAYERHFFPIYPASTKMQSWDIFSCVRLVLDVLDP
VPDPLPEPLRAKFDLVCEDQALRDIHLAENEARRQRARERLTFDEAVGLQ
WALVARRHGELSESGPPAPPRPDGLAAELLRRLPFELTAGQREVLDVLSD
GLASTRPLNRLLQGEVGSGKTIVSVLAMLQMVDAGYQCALLAPTEVLAAQ
HLRSIRDVLGPLAMAGQLGGADNATRLALLSGSMTAAQKKQVRDEVAGGQ
VGIVVGTHALLQDAVEFHNLGMVVVDEQHRFGVEQRDRLRAKARPGVTPH
LLVMTATPIPRTVALTVYGDLETSTLRELPRGRQPITSNVIFVKDKPAWL
GRAWRRIGEEVAAGRQAYVVAARIDESDDDGAADQNAKAPETAEGLYARL
RSQELAQLRLGLMHGRLSAEEKDAVMAAFRAGDIDVLVCTTVIEVGVDVP
NATVMLVMDADRFGISQLHQLRGRIGRGEHPSLCLFASWAAPDSPAGRRL
TAVAETMDGFALADLDLKERREGDVLGRNQSGRAVTLRLLSLADHQEYIE
AARDFCVQAYAGNRFDPGLSLLAARFTDTDRIEYLDKS
>MAP3983 regX3, RegX3
MTSVLIVEDEESLADPLAFLLRKEGFEATVVTDGSAALAEFDRAGADIVL
LDLMLPGMSGTDVCKQLRARSSVPVIMVTARDSEIDKVVGLELGADDYVT
KPYSARELIARIRAVLRRGGDDDSEISDGVLESGPVRMDVERHVVSVNGD
TITLPLKEFDLLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHVKR
LRSKIEADPANPVHLVTVRGLGYKLEG
>MAP1047 relA, RelA
MAEENSAAQALDAPAESPPNPVIETPEPPTESLKTSSSASRRVRARLARR
MTAQRSTLNPVLEPLVAMHREIYPKANVQLLQRAFEVADQRHASQLRHSG
DPYITHPLAVATILAELGMDTTTLVAALLHDTVEDTGYTLAQLSEEFGEE
VGHLVDGVTKLDRVVLGSAAEGETIRKMITAMARDPRVLVIKVADRLHNM
RTMRFLPPEKQARKARETLEVIAPLAHRLGMASVKWELEDLSFAILHPKK
YDEIVRLVAGRAPSRDTYLAKVRAEIINTLNASKIKATVEGRPKHYWSIY
QKMIVKGRDFDDIHDLVGIRILCDEIRDCYAAVGVVHSLWQPMAGRFKDY
IAQPRYGVYQSLHTTVVGPEGKPLEVQIRTRDMHRTAEYGIAAHWRYKEA
KGRNGVPHPHAAAEIDDMAWMRQLLDWQREAADPGEFLESLRYDLAVQEI
FVFTPKGDVITLPTGSTPIDFAYAVHTEVGHRCIGARVNGRLVALERKLE
NGEVVEVFTSKAANAGPSRDWQQFVVSPRAKAKIRQWFAKERREEALEAG
KDAMAREVRRGGLPLQRLVNAESMSAVARELHYADVSALYTAIGEGHVSA
RHVVQRLLAELGGIDQTEEDLAERSTPTTMLRRPRSSDDVGVSVPGAPGV
LTKLAKCCTPVPGDQIMGFVTRGGGVSVHRTDCTNAASLQQQSERIIEVH
WAPSPSSVFLVAIQVEALDRHRLLSDVTRVLADEKVNILSASVTTSGDRV
AISRFTFEMGDPKHLGHLLNVVRNVEGVYDVYRVTSAA
>MAP3312 rhlE, RhlE
MTTPTSTTELTFAQLGVRDEIVRALDEKGIQHPFAIQELTLPLALAGDDL
IGQARTGMGKTFAFGVPLLQRITAGTAPRALNGTPRALVVVPTRELCLQV
TDDLTLAAKHLTADGGRPLSVVPIYGGRPYEPQIDALRAGADVVVGTPGR
LLDLAQQGHLQLGGLSVLVLDEADEMLDLGFLPDIERILRQIPDDRQSML
FSATMPDPIITLARTFMNQPTHIRAEAPHSAATHDTTVQYVYRAHALDKV
ELVSRVLQAESRGATMIFTRTKRTAQKVADELAERGFKVGAVHGDLGQVA
REKALKAFRTGDIDVLVATDVAARGIDIDDVTHVINYQIPEDEQAYVHRI
GRTGRAGKAGVAVTLVDWDELARWALIDKALGLDVPEPAETYSNSPHLYE
ELGIPAGAGGRVGAARKPQGPRRSAERATGKPDQKSETATRRSGTRRRRT
RGGQPVSGHPSGNGAASSNGEAAADAPAGSPPGNPGSSRRRRRRRKPADA
TAQSN
>MAP2464c rho, Rho
MTDTDLFTAGENTDANQLSTAVTTDTPDAKTHAPVGALTTMVLPELRALA
NQVGVKGTSGMRKNELIAAIKEVRGQANGAAPAEQANGAEPAEDSGKPDA
KDAKDDTTPDTKDANDHTGAQSAAEAPAPSEQNGATAEAPRRERRSASRD
AGAAGRGSDDADRETDGREQSKQDSREQSKQDGGDQDQQSSGGQQGRGSN
QQDDDGEGRQGRRGRRFRDRRRRGERSGEGGDAELREDDVVQPVAGILDV
LDNYAFVRTSGYLAGPHDVYVSMNMVRKNGLRRGDAVTGAVRVPKDGEQP
NQRQKFNPLVRLDSVNGGSVEDAKKRPEFGKLTPLYPNQRLRLETTPDRL
TTRVIDLIMPIGKGQRALIVSPPKAGKTTILQDIANAITKNNPECHLMVV
LVDERPEEVTDMQRSVKGEVIASTFDRPPSDHTSVAELAIERAKRLVEQG
KDVVVLLDSITRLGRAYNNASPASGRILSGGVDSTALYPPKRFLGAARNI
EEGGSLTIIATAMVETGSTGDTVIFEEFKGTGNAELKLDRKIAERRVFPA
VDVNPSGTRKDELLLSPDEFGIVHKLRRVLSGLDSHQAIDLLMSQLRKTK
NNYEFLVQVSKTTPGAMDND
>MAP2995c rnc, Rnc
MSSRQPLLDALGVELPDELLSLALTHRSYAYEHGGLPTNERLEFLGDAVL
GLTITDALYHRHPDRTEGDLAKLRASVVNTQALADVARKLCDGGLGAHLL
LGRGEANTGGADKSSILADGMESLLGAIYLQHGIDTAREVILRLFGALLD
AAPTLGAGLDWKTSLQELTAARGMGAPSYLVTSTGPDHDKEFTAVVVVAD
TEYGTGVGRSKKEAEQKAAAATWKALDVLDSAAQTSA
>MAP4233 rpoA, RpoA
MLISQRPTLSEEVLTDNRSQFVIEPLEPGFGYTLGNSLRRTLLSSIPGAA
VTSIRIDGVLHEFTTVPGVKEDVTAIILNLKSLVVSSEEDEPVTMYLRKQ
GPGEVTAGDIVPPAGVTVHNPELHIATLNDKGKLEVELVVERGRGYVPAV
QNRASGAEIGRIPVDSIYSPVLKVTYKVDATRVEQRTDFDKLILDVETKS
SITPRDALASAGKTLVELFGLARELNVEAEGIEIGPSPAEADHIASFALP
IDDLDLTVRSYNCLKREGVHTVGELVSRTESDLLDIRNFGQKSIDEVKVK
LHQLGLSLKDSPPSFDPSQVAGYDVATGTWSTEAAYDDQDYAETEQL
>MAP4130 rpoB, RpoB
MPAEPTQFAANAAGGPGLRESHEVLEGCILADFRQSKTDRPQSSSNGSSS
LNGSVPGAPNRVSFAKLREPLEVPGLLDVQIDSFEWLIGAPRWREAAIAR
GDAEPKGGLEEVLDELSPIEDFSGSMSLSFSDPRFDEVKAPVDECKDKDM
TYAAPLFVTAEFINNNTGEIKSQTVFMGDFPMMTEKGTFIINGTERVVVS
QLVRSPGVYFDETIDKSTEKTLHSVKVIPSRGAWLEFDVDKRDTVGVRID
RKRRQPVTVLLKALGWTNEQITERFGFSEIMMSTLEKDNTAGTDEALLDI
YRKLRPGEPPTKESAQTLLENLFFKEKRYDLARVGRYKVNKKLGLHAGEP
ITSSTLTEEDVVATIEYLVRLHEGQPTMTVPGGIEVPVETDDIDHFGNRR
LRTVGELIQNQIRVGMSRMERVVRERMTTQDVEAITPQTLINIRPVVAAI
KEFFGTSQLSQFMDQNNPLSGLTHKRRLSALGPGGLSRERAGLEVRDVHP
SHYGRMCPIETPEGPNIGLIGSLSVYARVNPFGFIETPYRKVVDGVVTDE
IHYLTADEEDRHVVAQANSPIDDKGRFAEARVLVRRKAGEVEYVPSSEVD
YMDVSPRQMVSVATAMIPFLEHDDANRALMGANMQRQAVPLVRSEAPLVG
TGMELRAAIDAGDVVVAEKSGVIEEVSADYITVMADDGTRHTYRMRKFER
SNHGTCANQSPIVDAGDRVEAGQVIADGPCTENGEMALGKNLLVAIMPWE
GHNYEDAIILSNRLVEEDVLTSIHIEEHEIDARDTKLGAEEITRDIPNVS
DEVLADLDERGIVRIGAEVRDGDILVGKVTPKGETELTPEERLLRAIFGE
KAREVRDTSLKVPHGESGKVIGIRVFSREDDDELPAGVNELVRVYVAQKR
KISDGDKLAGRHGNKGVIGKILPQEDMPFLPDGTPVDIILNTHGVPRRMN
IGQILETHLGWVAKSGWNIDGNPEWAVNLPEELRHAQPNQIVSTPVFDGA
KEEELAGMLSCTLPNRDGEVMVDGDGKAVLFDGRSGEPFPYPVTVGYMYI
MKLHHLVDDKIHARSTGPYSMITQQPLGGKAQFGGQRFGEMECWAMQAYG
AAYTLQELLTIKSDDTVGRVKVYEAIVKGENIPEPGIPESFKVLLKELQS
LCLNVEVLSSDGAAIELREGEDEDLERAAANLGINLSRNESASVEDLA
>MAP4131 rpoC, RpoC
MLDVNFFDELRIGLATAEDIRQWSYGEVKKPETINYRTLKPEKDGLFCEK
IFGPTRDWECYCGKYKRVRFKGIICERCGVEVTRAKVRRERMGHIELAAP
VTHIWYFKGVPSRLGYLLDLAPKDLEKIIYFAAYVITSVDEEMRHNELST
LEAEMMVERKAVEDQRDADLEARAQKLEADLAELEAEGAKADARRKVRDS
GEREMRQIRDRAQRELDRLEDIWNTFTKLAPKQLIVDENLYRELVDRYGE
YFTGAMGAESIQKLIENFDIDAEAEQLRDVIRNGKGQKKLRALKRLKVVA
AFQQSGNSPMGMVLDAVPVIPPELRPMVQLDGGRFATSDLNDLYRRVINR
NNRLKRLIDLGAPEIIVNNEKRMLQESVDALFDNGRRGRPVTGPGNRPLK
SLSDLLKGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLHQCGLPKLMALE
LFKPFVMKRLVDLNHAQNIKSAKRMVERQRPQVWDVLEEVIAEHPVLLNR
APTLHRLGIQAFEPMLVEGKAIQLHPLVCEAFNADFDGDQMAVHLPLSAE
AQAEARILMLSSNNILSPASGRPLAMPRLDMVTGLYYLTTEVEGDKGEYS
PAAKDRPETGVYSSPAEAIMAADRGVLSVRAKIKVRLTQLRPPAEIEAEL
FGANGWQPGDAWMAETTLGRVLFNELLPVGYPFVNKQMHKKVQASIINDL
AERYPMIVVAQTVDKLKDAGFYWATRSGVTVSMADVLVPPRKKEILDQYE
ERAEKVEKQFQRGALNHDERNEALVEIWKEATDEVGQALREHYPADNPII
TIVDSGATGNFTQTRTLAGMKGLVTNPKGEFIPRPVKSSFREGLTVLEYF
INTHGARKGLADTALRTADSGYLTRRLVDVSQDVIVREHDCETERGIVVE
LAERQPDGTLIRDPYIETSAYARTLGTDAVDEAGNVIVARGEDLGDPEID
ALLAAGITSVKVRSVLTCTTGTGVCATCYGRSMATGKLVDIGEAVGIVAA
QSIGEPGTQLTMRTFHQGGVGEDITGGLPRVQELFEARIPRGKAPIADVT
GRVRLEDGERFYKITIVPDDGSEEVVYDKLSKRQRLRVFKHEDGSERVLS
DGDHVEVGQQLMEGSADPHEVLRVQGPREVQIHLVREVQEVYRAQGVSIH
DKHIEVIVRQMLRRVTIIDSGSTEFLPGSLIDRAEFEAENRRVVAEGGEP
AAGRPVLMGITKASLATDSWLSAASFQETTRVLTDAAINCRSDKLNGLKE
NVIIGKLIPAGTGINRYRNIQVQPTEEARAAAYTIPSYEDQYYSPDFGQA
TGAAVPLDDYGYSDYR
>MAP2820 sigA, SigA
MAATKASPATDGPVKRTATKSPSSPAKRPAAKAANGSAPAKRATKTASRS
AKSEAGAAAEPAKKTRSSAKGADAKAPSARGTKAAKGGPADPDALDTDGA
VEDLDTEPDLEGEPGEDLDIDTDLNLDDLEEDVAADDADIEPGDAEEGED
EEAAAPKAAGATVADEDDEIAEPSEKDKASGDFVWDEDESEALRQARRDA
ELTASADSVRAYLKQIGKVALLNAEEEVELAKRIEAGLYATQLMSEMAER
GEKLPAAQRRDMMWICRDGDRAKNHLLEANLRLVVSLAKRYTGRGMAFLD
LIQEGNLGLIRAVEKFDYTKGYKFSTYATWWIRQAITRAMADQARTIRIP
VHMVEVINKLGRIQRELLQDPGREPTPEELAKEMDITPEKVLEIQQYARE
PISLDQTIGDEGDSQLGDFIEDSEAVVAVDAVSFTLLQDQLQSVLETLSE
REAGVVRLRFGLTDGQPRTLDEIGQVYGVTRERIRQIESKTMSKLRHPSR
SQVLRDYLD
>MAP2826 sigB, SigB
MNPMTVQAEREVAMANASTSRFDGDLDAQSPAADLVRVYLNGIGKTALLN
AAGEVELAKRIEAGLYAEHLLETRKRLGENRKRDLEAVVRDGQAARRHLL
EANLRLVVSLAKRYTGRGMPLLDLIQEGNLGLIRAMEKFDYTKGFKFSTY
ATWWIRQAITRGMADQSRTIRLPVHLVEQVNKLARIKREMHQNLGREATD
EELAAESGIPIDKINDLLEHSRDPVSLDMPVGSEEEAPLGDFIEDAEAMS
AENAVIAELLHTDIRSVLATLDEREHQVIRLRFGLDDGQPRTLDQIGKLF
GLSRERVRQIERDVMSKLRNGERADRLRSYAS
>MAP4275 sigD, SigD
MEISPSMTIPAGERLDAVVAKAVTGDHNALREVLETIRPIVVRYCRARVG
TVERGGLSADDVAQEVCLATITALPRYRDRGRPFLAFLYGIAAHKVADAH
RAAGRDLAYPTESIPDRWSNDAGPEQLAIEADSVSRMSELLEILPAKQRE
ILILRVVVGLSAEETAAAVGSTTGAVRVAQHRALSRLKSEMIAAGDCA
>MAP2557c sigE, SigE
MDRGARETGNTEWQLPVAANDEMPLIGMPNSEELIITTLLSPSSMSHAHD
PSADGWAEPSDGLQGTAVFDATGDKTAMPSWDELVRQHADRVYRLAYRLS
GNQHDAEDLTQETFIRVFRSVQNYQPGTFEGWLHRITTNLFLDMVRRRSR
IRMEALPEDYERVPADEPNPEEIYHDSRLGPDLQAALDSLPPEFRAAVVL
CDIEGLSYEEIGATLGVKLGTVRSRIHRGRQALRDYLAAHPDHDALRASS
A
>MAP1474c sigF, SigF_1
MTNAIAPTPTAARPTSQSDDSYEDVVEMFLELRRMPAESHEYRRQRERIV
ARCLPLADHVASHFARRGEGLDDLVQVARLGLMNAVNRFDPAKGPSFIGF
AVPTMMGEVRRYFRDYSWGMRVPRRLRELHVQISRTTADLAQQLGRAPNA
GELSQVLEVPREEIVECLVAGDAYRLDSLDAPQGADSSGTPRSVADSVGD
IDPQIEHITNREALRVLVATLPHREREVLRMRFFESMTQSQIAERIGVSQ
MQVSRILANTLRCLRDQLE
>MAP3406c sigF, SigF_2
MTARAAGGSVSRPNEYADVPDMFRELATAEPDSMEFQRQRDKIVERCLPL
ADHIARRFEGRGEPRDDLVQVARVGLVNAVVRFDVDAGSDFVSFAVPTIM
GEVRRHFRDNSWSVKVPRRLKELHLRLGTATADLSQRLGRAPTATELAAE
LEMDREEVVEGLVAGSSYNTLSIDSGGGSEEEEVRAIADTLGDVDTGLDR
IEDQESLRPLLEALPERERTVLVLRFFESMTQTQIAERVGISQMHVSRLL
AKSLTRLRDQLQ
>MAP3621c sigG, SigG
MRVPHHYRQRNGCGSAPRLQTLRTLMRVSLLAQTPEGDSVVGADFTAHAE
PYRRELLAHCYRMTGSLHDAEDLVQETLLRAWKAYDRFEGKSSVRTWLHR
IATNTCLSALEGRQRRPLPVGLGAPSADPTAELVERREVPWLEPLPALTD
DPADPSVIVGSRESVRLAFVAALQYLSPRQRAVLLLRDVLGWRAAEVAEA
IGTTTAAVNSLLQRARAQLDAVGPSADDQLAQPDSAETQDLLARYIAAFE
SYDIDRLVELFTAEAIWEMPPYTGWYQGARTIVTLIHQQCPAEGAGDMRL
LPLIANGQPAAAMYMRAGDVHLPFQLHVLDVRGDRVSHVVAFLDDSLFAK
FGLPAALGPRQEGVRA
>MAP3324c sigH, SigH
MVSTATSLLGEEQLAGFLASPGALSVLSGDTAAEGTGFIEMADSPDGPDG
VTSPEVPEAHAEPAAHEEAREETDAELTARFERDAIPLLDQLYGGALRMT
RNPADAEDLLQETMVKAYAGFRSFRAGTNLKAWLYRILTNTYINSYRKKQ
RQPAEYPTEEITDWQLASNAEHSSTGLRSAEVEALESLPDSEIKDALQAL
PEEFRMAVYYADVEGFPYKEIAEIMDTPIGTVMSRLHRGRRQLRGLLADV
AKERGFNRGQQTHEEVSS
>MAP0170 sigI, SigI
MNRSPSADDQVAEAWRRHRPYLVNLAYQMLGDIGDAEDVAQEAFLRLSAA
GDIDDVRGWLTVVASRLCLDQLRSARARHERPGDVGEQPAPPRFDPADRI
TLDDEVRTALLEVLRRLSPGERVSFVLHDVFGVPFEAIAQTVGRPVGTCR
QLARRARAKFVAAQPKLNTVGAAEHQLVTEKFITACANGDLAGLAAVLDP
TVWGVGTVLADPAPPPQVNHGPDAVATNLLRYLWPDVTLVGGPAGAPVLL
AFSRRRLFAVIVLSIRDARVVKIEAIADPAARSAG
>MAP3446c sigJ, SigJ
MPGEVQPEVLMSVAYRLTGTVADAEDIVQDAWLRRHGQDGAITDLRAWLT
TVVSRLGLDRLRSATHRRETYTGNWLPEPVVTGLGPHSGADPLAAVVAGE
DARFAAMVVLERLSPDQRVAFVLHDGFAVPFSEVAEVLGTSEAAARQLAS
RARKVVSAQPPPEPDPSHDEVVGQLMAAMAAGNLEAVVALLHPDVMFTGD
SNGKAPTAVQVIHGADKVVRFMLGLARRYGPGFYSAYQLGLVNGELGIYT
AGLPGGDGYREMCPRIMAMTVRERKVCALWDIANPDKFTGSPLGR
>MAP4201 sigL, SigL
MARVVGISRASGTAEAALMKALYDEHAAVLWRYALRLTGDASQSEDVVQE
TLLRAWQHPEVIGDTERSARAWLFTVARNMIIDDRRSARFRNVVGSTDTA
GAPEQSTPDEVNAALDRLLIADAMAQLSAEHRAVIERSYYRGWTTAQIAT
DLGIAEGTVKSRLHYAVRALRLTLQELGVTR
>MAP4337 sigM, SigM
MGFGRNGNGDRSDAELLAAHVAGDRYAFGELFVRHQRHLHRLARLTTRSP
EDAEDALQDAMLSAHRGAGAFRHDAAVGSWLHRIVVNACLDRLRRTKAHP
TVPLEDIYPVADRTAQVETTLAVQRALMRLPVEQRAAIVAVDMQGYSVAD
TARLLGVAEGTVKSRCARARVRLAELLGYLDAGAHAAAEGAAGQA
>MAP1102c tcrA, TcrA
MKVLLVEDEPRLAATVARGLKAEGFVVVTVGNGVDGLAEATENPFDIVIL
DIMLPGRSGYEVLRRMRSNNVWTPVLMLTAKDGEYDETDAFDLGADDYLT
KPFSFRVLVARLRALVRRGAPERPVVLTAGSLSLDPARHTVQRGSTPIAL
TPREYGVLEFLMRNKDVVVTKADILANVWDAHHHGPDNVVEVYVGYLRRK
IDVPFGTNTIETIRGVGYRLLC