TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Mycobacterium avium subsp. paratuberculosis str. k10, k10
Gene type: CDS

Number of genes found: 189

Free access
Sort by:

 



# Mycobacterium avium subsp. paratuberculosis str. k10, k10

>MAP2566 hypothetical protein
MSWGFGDDCWVQGRSDDQRELLDAESVAGHLLKADSMFAFLAAHRSQLFP
EEMFADLFPSQRGRPSVPAEVMASVITLQALHGFSDNETVDAVTFDLRWK
AACGLPITAGAFHSTTLTYWRRRLAASDRPNRIFEAVKTVVAETGVLAGK
TRRALDSTVLDDAVATQDTVTQLIAAIRRVRREVPGAAAVIEAHCSAHDY
DDPGKPAIAWEDKAARDRLVDGLVGDAHRVLGYLPDQELAPRAAEAVALL
ALIAGQDVEPVEGSDGTDGHWRIAQQVSGDRVISTVDADTRHAHKTVHRR
QDGFKAHLAVEPDTGIITDCALTKASGADNHEAVVGLSLLEGEHTPGAGP
GRFGVWHRGRPGGTGRRRPCRGDQTATATLAGSGWLHQRRFPHRFRRPHR
DLPGWTRNADPSQWRSHLRKILSLMPVGVPVHDRHARTQTHPSYPRAIIA
RCSRRGPRPRLASRIPPTPAHGGTLNGLAHSRQPQGPLPRNRQKQPLAAP
PRRSTEPTSAAHHGPDPHRHHLGHRLTRPHTKALPQPPTAATLNNGGAIS
FLSTHRV
>MAP2577 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP0104 hypothetical protein
MALDQSALLEVLDALRNADAADRIKQAAETIYQALIDAELTAVIGAGPHE
RSASRTNQRNGSRPRTLSTIAGDLELRIPKLRSGSFFPALLERRRRVDQC
LFAVVMEAYLHGTSTRKVDDLVKALGADAGISKSEVSRICADLDTEVGAF
RDRPLSEQHFPYVFLDATYCKARVNHRVVSQAVVIATGVAADGRREVLGF
DVGDSEDGAFWTAFLRSLKTRGLSGVQLVISDAHTGLRSAIEAILIGASW
QRCRVHFLRNVLAQVPKGSAEMVAAAIRTIFAQPDAEHVREQLDTIAGML
GRQLPKVETMLREAADDITAFADFPVLHWKKIWSTNPLERLNKEIKRRTD
VVGVFPNPAALLRLAGSVLVEAHDEWQVAEKRYLSETTLALLHPRSDSAD
QSVAVPAAITA
>MAP0346c hypothetical protein
MPKLSAGVLLYRTGDGVVEVLIAHPGGPFWARKDDGAWSIPKGEYDEGED
PWPAARREFAEELGLAVPAGERIDLGTLKQSGGKLVTVFAVHGDLDVTEA
RSNTFALEWPKGSGTLREFPEVDRVGWFPVAAARTKLLKSQHGFLDRLMA
QPAVAGLSEGT
>MAP2026 hypothetical protein
MKSLVGTSFGQYEIRRLIGKGGMGEVYEAYDTKKGRAVALKLLTDNYADD
EKFRERFLRESRAAAILQEPHVIPIHDWGEINGVLYIDMRLVQGQTLHEM
LKTGSLEPRRATDIIRQVASALDAAHAAGLIHRDVKPQNIIVTPDDFAYL
VDFGIAEARGDTHLTMAGHTVGTFDYMAPERFGDEETTSAVDVYALACVL
YEALTGAKPFPVHSAEQAIRAHLSSPPPRPSAVNPHVPASFDDVIARGMA
KHPDDRYGSAGALGRAAKRALAPDPATSAGTNTLLAPQYVSAPSSYPPFA
AQYPYPATGPVSATDADQGGSKKLMVLTIVGVAVALLVGGTGLVIGLTTQ
RNSSTSEPSTSPLVSYTNPVPTYETEPARLPSTPTSAPQDATQQLHQIAN
DDRAFVRAQLADRWVPQLSSKRPGVVDNGVVWDNAMTLREHLQLRQRYPN
VKLLWSGDWSTFSGPDFWVTVAGLTFADSSGPLAWCRFQGFDRDHCAAKL
VSTTHPEAGSTAYN
>MAP2515c hypothetical protein
MRTRSKGPGRGSCQAWGSRGECNRIPWAAATEFRGGRPRMQDVGVLEHPR
TGQAFDSPVPAGSGWPGDPATPQTPVAADADQVIALARHAGAIPELDALV
SVCRACPRLVEWREEVAVVKRRAFADQPYWGRPVPSWGSARPRLLIVGLA
PAAHGANRTGRMFTGDRSGDQLYAALYRAGLVNQPTSVDAADGLRTKHIR
IVAPVHCAPPANVPTPVERDTCWPWLQAEWRLISEHVRVVVALGGFGWQI
ALRLPGVPAARKPRFGHGVVAELAPGVRLLGCYHPSQQNMFTGRLTPAML
DDVFRDAKGLAGIK
>MAP3670 hypothetical protein
MSHRNAPLSETGRLRLARCVVDEGWSLRRAAERFQVSVTTAERWARRYRE
LGEAGMADRSSRPHHSPNRTPTRTERRIIKVRVIRRWGPARIGYLLGIHP
STVHRVLTRYALAKLRWLDRSTGRIIRRMEPAGCGDLVHVDVKKLGKIPA
GGGWRMLGRAIGGHNSNADKSSGVFSKHRNPIRGYHYLHTAIDGYSRLAY
SEVLDDEIKETAAEFWTRANAWFAECGISVRKVLTDNGSCYRSRVFAQAL
GDIEHRRTRPYRPQTNGKVERFHRTLADEWAYARLYRSDTERCEEFTTWL
HTYNHHRGHTALGGQPPASRVPNLSGQYT
>MAP2031c hypothetical protein
MSSSSRGSRLGTRFGPYELRSLIGTGTLGEVYRAYDTVKDRLVALKLLRG
ELDAGFRQRLWRDCRAVTRLQEPHVLPLHDFGEMDGVPFIDMQLVDDGGS
LKELLREQGGLEPSRAASITGQVARALDAAHAAGLMHLDVKPENILLTHD
HFTYLADFGLAQAAGDDKLSRTYMAPERFTTGSLGPQTDIYSLACVLYEC
LTGQPPFEGADPGELRSAHLLSPAPRPSIMRRGVGRAFDDIITRGMAKQR
SARFGSAGELARAASEAVFAAYEPVSAAAGLGGPRPLPTPPAQFDGPDDT
LGPPAAERPPRGRVGRLPVVVTAVAVLMLIAGVVLSVKSVVGTHHNSSAP
PPAPSTRALAPPPPTTPPPPLTPTLSRPVTGADGLGFIGETARCDPGNPP
AAVVRTAKSLAVVCQNLSGSYYYRGERIRDGAHIELSNAERVEDGFDVTN
PVDGVVYEVRPNRLRIISFGHVDSSEPVLQYATAS
>MAP2477c hypothetical protein
MGTLLLAVSGVVFVAILGGVTFIERLASREIYRNPWMVLREDDIRRPDGS
AGIYSVVDKPSYALVMPYDGRRFGLVEQFRYPIGERRWEFPQGTAPELAD
ADPVELARRELREETGLRATTFEPLGRLDVAPGMSSQRGWVFLATGIVEG
EADREHEEQDMRSAWFSRDDVEQMIRTGVIADAQSIAAYGLFLLRPESTG
APVTDQGQ
>MAP3559 hypothetical protein
MTWHAVHGQSGQIGEHRVKSSARTVVFPGAVSFGHTLAPLRRGLGDPCFL
APGDGSIWRTSLLPSGPVTARISRAGTNAVQCVAWGAGAAQFVDMLPAML
GSEDDASDFVPRHPTVAAAQRRVPHLRLGRTGLVLEALIPAIIEQRVPGA
DAFRSWRVLVSKYGTPAPGPAPERMRVPPSAQAWRSIPSWEFHRANVDPR
RAQTVVTCARRAASLERLVSRPAAQARAALTSLPGVGEWTAAETAQRAFG
DADAVSVGDYHIPKMVGWTLLGRPVDDAGMLELLEPMRPHRQRVVRLLEA
SGLAREPRRGPRLPVQQIHSL
>MAP4066 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP2225c hypothetical protein
MSEVSPLHLVLGDEELLVERAVAEVLAEARKRAGASGDADVPINRMRAGD
VSTYELAELLSPSLFADERIVVLEAAGEAGKDAAAVILSAAAEMPAGVVL
VVVHSGGGRAKALATELQSLGAVVHPCARITKLSERIDFVRKEFRRLRVK
ADEETVTAMLDAVGSDLRELASACSQLVADTGGAVDAAAVRRYHSGKAEV
KGFDIADKAVAGDIEGAAEALRWAMMRGEPLVVLADALAEAIHTIGRVGP
LSGDPYRLAAQLGMPPWRVQKAQKQARYWSRDSVATAMKVVATLNANVKG
AVADADYALESAVRRVAELAGGRNR
>MAP0028c hypothetical protein
MALDQSALLEVLDALRNADAADRIKQAAETIYQALIDAELTAVIGAGPHE
RSASRINQRNGSRPRTLSTIAGDLELRIPKLRSGSFFPALLERRRRVDQC
LFAVVMEAYLHGTSTRKVDDLVKALGADAGISKSEVSRICADLDTEVGAF
RDRPLSEQHFPYVFLDATYCKARVNHRVVSQAVVIATGVAADGRREVLGF
DVGDSEDGAFWTAFLRSLKTRGLSGVQLVISDAHTGLRSAIEAILIGASW
QRCRVHFLRNVLAQVPKGSAEMVAAAIRTIFAQPDAEHVREQLDTIAGML
GRQLPKVETMLREAADDITAFADFPVLHWKKIWSTNPLERLNKEIKRRTD
VVGVFPNPAALLRLAGSVLVEAHDEWQVAEKRYLSETTLALLHPRSDSAD
QSVAVPAAITA
>MAP3244 hypothetical protein
MAGRFRIRLRAPPRCERLLGWVSEANGPGPQLRCGRPLRTAEKTGGFRMR
LRRSVLSKPGITRKRRGKGFSYHAPDGTTVRDPETLQRIKDLVIPPAWKK
VWISPHPNGHIQAVGVDAAGRRQYLYHCAWQQERAEEKFDRVLELSTELP
AWRARIAEDLAARGLTRERVLALGLRLLDRGYFRSGGEQYAEENESYGLS
TLLCEHVTVRRDAVEFDFPAKSGVRRTVRVEDREVVRAVRPLMRRRHRGD
RLLVCRTASGWSEVRADDLNDRFKELVGDQYTVKDLRTWHGTVLAAAAFA
EADAAVSRRVARRVEAAVMKEVAEELGNTPAVARGSYVDPRVITGYERGL
TIGAATRRAGRARDPDAAQEILDKATRMLIRRVAKGHSASGASVLPKSA
>MAP3088c hypothetical protein
MTSPEEAPLVPRPAATVMLVRDAPAGLKVFLMRRHSRMEFAAGVMVFPGG
GVDERDRNADLGGLGAWAGPPPQWWAQRFGIEPDLAEALVCAAARETFEE
SGVLFAGPAGAPDSIVGDASVYRDARRALADGTLSFADFLRTEKLELRSD
LLRPWANWVTPEAERTRRYDTYFFVGALPQGQRADGQNTESDRAGWTTPE
AALEDFSAGRTFLLPPTWTQLDSLAGRTVADVLAVQRQIAPVQPHVEIRG
DNWVFEFFDSDRYHRAREAGGLGWRH
>MAP1943 hypothetical protein
MAGPHSPNHTVGGQGPTPPSESQPLEFPDHPNAGDTGYAAAPQAPPGSAN
YAGPPPAPAPYPPRRSKRRLIVGLALAVALVAVMTVAIVYGVRTNGANTG
ATFSEGAAKTAIQGYLDALEHRDIDEIARNALCGLYDGVQDKRSDQALAK
LSSDAFRKQFSEVQVTSIDKIVYLSQYQAQALFSMRVSPVSGGPARGQVQ
GIAQLLFQRGQIMVCSYVLRTGGSY
>MAP0589c hypothetical protein
MKELTVAEQRYQAVLAVISDGLSISLVAEKVGVSRQTLHAWLARYEASGL
EGLVDRSHRPVSCPHQMPAVVEAALLERRRSRPYWGPRRLVFELAKRRVG
PVPSESAVYRALVRAGMIDPALRDRRSRKWKRWERGAPMELWQMDVVGGF
PLADGTSAKALTGIDDHSRMCVCARLMARERTRAVCDGLRAALAAYGAPQ
QILTDNGKVFTGRFNHPPVEVLFDAICRQNGIDHLLTQPRSPTTTGKIER
FHRSLRAEFLSNTRAFSNLKTAQQALDEWVHYYNTARPHQSLNMTTPAER
FTATASPVSPGDDVPASIDRDGQDWVSRRVTTNGVVSVAWQQVCVGAHYA
GARCDVHVDGELLRFWIGDQLVKTAARTNHAEVRNKRAFRTREQA
>MAP3300c hypothetical protein
MSDRYSPRELACALGLFPPTEEQAAVIAAPPGPLVVIAGAGAGKTETMAA
RVVWLIANGYAHPGQVLGLTFTRKAAGQLLRRVRSRLARLAGVGLGAASP
NHGAVDPEAAPVVSTYHAFAGSLLRDYGLLLPVEPDTRLLSETELWQLAF
DVVNRYRGPLRTDKTPAAVTSMVLRLWGQLAEHLVDTSQLRETHLELERL
VHALPAGPYQRDRGPSQWLLRLSGTQSERAELVPLLEALDERMRAVKVMD
FGMQMASAARLVAAFPQVGADLRGRYRVVLLDEYQDTGHAQRIALSALFA
GGVDDGLALTAVGDPIQSIYGWRGASATNLPRFTTDFPRSDGTPAPVLEL
RTSWRNPPRTLRVANAISAEARRRSVAVHALRPRPDAPPGTVRCALLPDV
VAEREWIADHIDAHYRRARADGVSPPTAAVLVRRNADAAPIADALRARGI
PVEVVGLAGLLSVPEVAELVAMLRLVADPTAGAAAMRVLTGPRWRLGARD
LAALWQRARALGGAGTAGGPATAEAIASAADPRNVEADAVCLADAIADPG
PAEGYSAAGYARIAALAAELSALRAHLDHPLGDLVAVVRRVMDLDCEVRA
AAAAGWAGTEHLDAFADVVAGYAERADTGGTDASASASVAGLLAFLDVAE
SVENGLPAAPLAVARDRVQVLTVHSAKGLEWQLVAVAHLSGGVFPSTTAK
TTWLTDAAELPPLLRGDRARPGALGIPVLDTSDVTNRKQLSDKISEHRRQ
LEQRRVDEERRLLYVAVTRAEDTLLVSGHHWAATGIKPRGPSDFLCEIKD
VIDASAAAGDPCGTVEQWAPAPADGERNPLRDNAVEAVWPADPLASRRGE
VERGAALVRQAMAAEPGDPGADVEGWAADVDALLAERARVTGPPPQALPG
QLSVSSLVGLARDPAGAARRLRHRLPSRPEPHALLGNAFHAWVQKFYGAE
CLFELGDLPGAADSDVGDTAELAELQAAFLESPWAARTPVAVEVPFEMPI
GDTLVRGRIDAVFAESDGGATVVDWKTGAPPDSPEAMRQAAVQLAVYRLA
WAALAKVPESSVRTAFHYVRARTTVVPEALPTSDELAGLLAPTAPV
>MAP1033 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP2284c hypothetical protein
MPEGHTLHRLARLHQRRYAGAPVAVSSPQGRFAEAAAVVDGRVLRRTSAW
GKHLFHHYAGGPIVHVHLGLYGSFSEWERPGDGPLPDPVGQVRMRMVGAG
HGTDLRGPTVCEVIDEGQVSDVLARLGPDPLRDDADPSWAWQRIAKSRRP
IGALLMDQTVMAGVGNVYRSELLFRHGIDPYRAGRDVGEAEFDAAWTDLV
ALMKVGLRRGKIIVVRPEHDRGAPSYRPDRPRTYVYRRAGEACRVCGEPV
RTAVLEGRNVFWCPTCQK
>MAP2034c hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP2965c hypothetical protein
MRAALGITGRCGRPRPAARPRPSAAPGRVAAMTTETTMTRIQLGAMGEAL
AVDHLTRMGLRVLHRNWRCRYGELDIIACDKCAELHWMQHFGTVAGSA
>MAP3377 hypothetical protein
MRDSVLALLTQWDPPDAAQDSLRHAVLAFVHARPDACRRECEPGHVTAST
LVLDHTGDRVLLTLHRRLGRWVQLGGHCDDDAGIVAAALREATEESGIDG
LRMAPGLAAVHVHPVTCSLGLPTRHLDLQFVAHAPAGARIAISDESEDLR
WWPVDGLPAGTDHALAYLVAQATRASR
>MAP1078 hypothetical protein
MVLTQHRVPDRPGDPDQDPGRGRRLGIDVGSVRIGVACSDPDAVLATPVE
TVRRDRSGKHLRRLAALVTELGAVEVVVGLPRTLADRTGTSALDAIDLAD
QLARRIAPTPVRLADERLTTVAAQRSLRAAGVRAKEQRAVIDQAAAVAIL
QSWLDQRRAATREAGDG
>MAP2985 hypothetical protein
MAASRSNDWGPVSVPVSSLDPRAGNDHSDRSGALRGWQRRALVKYLAGQP
RDFLAVATPGSGKTTFALRVAAELLGQRAVEQVTVVVPTEHLKVQWAQAA
ARHGLALDPRFSNSNPRIAPEYHGVMVTYAQVAAHPTLHRVRTEQRRTLV
IFDEIHHGGDAKTWGDAIREAFGDATRRLALTGTPFRSDDSPIPFVRYEA
GPDGVRRSQANHTYGYPEALADGVVRPVVFLAYSGEARWRDSAGEEHAAR
LGEPLSAEQTARAWRTALDPAGEWMPAVIAAADQRLRQLRAHIPDAGGMI
IASDRVAARAYATLLTKITSETPTVVLSDDPGSSARISEFAASTSRWLVA
VRMVSEGVDVPRLSVGIYATSASTPLFFAQAVGRFVRSRRPGETASIFLP
SVPNLLQLASELEAQRNHVLGEPHRVSEGDPLDGDPATRTQNEKSELDNG
FTSLGADAELDQVIFDGSSFGTAAPAGSEEEADYLGIPGLLDAEQMRALL
HQRQDEQLQRRAGQPSAGDAPPATVHGQLRELRRELNTLVSIAHHRTGKP
HGWIHNELRRRCGGPPIAAASREQLRARIDAVRRLNAEHS
>MAP2444c hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP3816 hypothetical protein
MSPQQSFRSLRYTYASLCLAARIRPIDIAELMGHRDVKTTLTVYAHLINT
DDHTGNMAALGSLAAPLQKPNYGNVIPLHG
>MAP0880 hypothetical protein
MNAPRAWPAEPTATPRVKLTNADKVLYPATGTTKADVFDYYTRIADVMIP
HIAGRPVTRKRWPNGVDQESFFEKQLASSAPDWLPRASVTHRSGTTTYPI
IDSATGLAWIAQQAALEVHVPQWRFVAEWTRSRAEELKPGPATRLVFDLD
PGEGVTMAQLAEVARAVRDLIDGIGLQTFPLTSGSKGLHLYTPLTEPVSS
KGATVLAKRVAQQLEKTMPKLVTSTMTKSLRAGKIFVDWSQNNGSKTTIA
PYSLRGRDHPTVAAPRTWDELDDPALRHLRYDEVLTRVARDGDLLAELDD
NQTPVPDRLTKYRSMRDASKTPEPVPDAKPAAGQGNTFVIQEHHARRLHY
DFRLERDGVLVSWAVPKNLPDTPAVNHLAVHTEDHPLEYGGFEGVIPKGE
YGAGRVVIWDSGTYDAEKFQDDEVIVNLHGRKISGRYALIQTQGDQWLAH
RMKDQNVFEFDTIAPMLATHGSVSALKASQWAFEGKWDGYRLLVEADHGT
LRVRSRRGREVTGEYRELRSLAKALEEHHAVLDGEAVVLDKRGVPNFHEM
QNRGKSARVEFWAFDLLYLDGRSLLRARYRDRRKLLEMLASGGALTVPEL
LPGDGDQALRQSAERGWEGVIAKRRDSTYQPGRRSSSWIKDKHWNTQEVV
IGGWKAGEGGRSSGIGSLLMGIPGPGGLHFAGRVGTGFTERDLANLKKTL
APLRTDQSPFDAPLPRSEAKGVTFVEPVLVGEVRYSEWTPDDRLRQSSWR
GLRPDKEASEVVRE
>MAP4334 hypothetical protein
MSDGEQGKPRRRRGRRRGRGAATSSEKQTNGQLTGDSTATKPRRSRAARR
APDRLRTVHETSAGGLVIDGLDGPRESQVAALIGRIDRRGRMLWSLPKGH
IELGETAEQTAIREVAEETGIRGSVLAALGRIDYWFVTDGRRVHKTVHHY
LMRFSGGELSDEDLEVAEVAWVPMRELPSRLAYADERRLARVADELIDKL
QSDGPAALPPLPPSSPRRRPQTHSRTRHSETRHSDKPATGRKNGHGPGP
>MAP2508 hypothetical protein
MSTPLLRGFPPVVDERARTLILGSFPSAQSLLTGQYYANPRNAFWSITGE
LFGFDAAAPYPRRLAQLRRHRIALWDVLHACRRAGSADSAIEPNSLVVNG
FGEFFAEHPGITRVYFNGAKAAELYRRLAMAPDHVCFQRLPSTSPAHVMA
PGAKLAAWAVLRNSA
>MAP0967 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP1328c hypothetical protein
MPELPDVEGFRRQLADALPGRRVRRVKVHDPGILRNTTATTLARRLTGRR
FAGPRRHGKWLVLPTDGPTLLIHSGMTGRPYYCADGAAEDRHQRLVVSLD
QGELRYTDLRKLRGVWLADDPDDLVPITGRQGPDALGLGLRDFRDALTAR
SARRRQLKSALMDQSVLAGLGNLLVDEICWRARIRPTRAVADLDDDEVKA
LHRAMTQVLRTAVRHGRVPGLPRWLTGARDAPDPHCPRCGGRLDHARVGG
RTTLWCPRCQPG
>MAP0253 hypothetical protein
MRELSVAEQRYQAVMAVISDGLSVSQAAEKFGVARQTLHRWLARYEAAGL
EGLVDRSHRPVSCPHQMPAVVEAEMLELRRSRPYWGPRRLVFELAKRGVH
PVPSESAVYRALVRAGLIDPAMRDRRSRKWKRWERGAPMELWQLDIVGGF
PLADGTSAKALTGIDDHSRMCVCAKLMARERTRAVCDGLRAALAAYGVPE
QILTDNGKVFTERFCHPPVEVLFDAICREHGIEHLLTQPRSPTTTGKIEQ
FHRSLRAEFLSGREPFTNLKVAQQALDEWVEDYNTTRPHQALKMITPAQR
FHAGAPASPPSNSCARHVDRSGDDWVSRRVCSNGIVCVSWQQVCIGRHYA
GARCDVHVDGDLLRFWVGDNLVKTAARTSRGEVRNKQALRTNAPA
>MAP0178c hypothetical protein
MFVRCDASILHADLDSFYASVEQRDDPTLRGRPVIVGGGVVLAASYEAKA
YGVRTAMGGAQARRLCPHAVVVPPRMHAYSRASDAVFRVFRECTPLVEPL
SVDEAFLDVGGLRRVSGTPVAIAQRLRADVRDRVGLPITVGIARTKFLAK
VASQQAKPDGLLLVPPDQELAFLRPLPVRRLWGVGAVTAEKLRAHGIATV
ADVAELSESTLGSMVGAAMGRQLYALSRNIDRRRVSTGVRRRSVGAQRAL
GRAGNTMSDSEIDAVVVNLIDRVTGRMRAAGRTGRTVVLRLRFDDFTRAT
RSHTLPWATSSTAPILAAARRLVAGAAPMIARRGLTLVGFAVSGIDRDGA
QQLMLPFQGRPPDAIDAAIDRVRRRYGKAALTPAVLLGRDPGLEMPHLPD
>MAP2439c hypothetical protein
MSRVRLVIAQCTVDYVGRLTAHLPSARRLLLFKADGSVSVHADDRAYKPL
NWMSPPCWLREEAGDAAPVWVVENKAGEQLRITVEDIEHDSSHDLGVDPG
LVKDGVEAHLQALLAEHVQLLGEGYTLVRREYMTAIGPVDLLCRDERGGS
VAVEIKRRGEIDGVEQLTRYLELLNRDSVLAPVRGVFAAQQIKPQARTLA
TDRGIRCVTLDYDKMRGMDSDEYRLF
>MAP0402 hypothetical protein
MSAGGSPLRNGDPAPARGTVPLTPDACPSWLRPLVDNVDHVPDAARRRLP
ADVLAMITTKAASALTSVRGAAREAAVLVLFSGPESGPPGGGPPTDADLL
VTVRASTLRHHAGQAAFPGGAADPTDDGPVATALREAREETGIDVSRLHP
LATMEKMFIAPSQFHVVPVLAYSPDPGPVAVVNEAETALVARVPLRAFIN
PANRLMVYRGDLGRRWAGPAFLLNEMLVWGFTGQVISALLDVAGWAQPWD
TSDQRELDAAMALVGERGDALE
>MAP0105c hypothetical protein
MCDGRVDRDRARHSCLGEEGRGRTGAQQTGQDAGTWWPPPRPGQDRRRGF
LGRRMRGNRSEFVTVIVTAVGAIEPHLSHDDVRTAIEGMGLSAAQLQRLS
RTLRRDGSVLTGPGGSDCAADIEQLILCLRQLGAMRVRAPRCAQCGRNDS
ETYSRKLKKRICRACSMQGWQPAVGECPGCGAVDKLIYRPRHGDGLLCRR
CKPEPDVDHAAKVRDGIAQLRTGLSATEIDRVASVFGTAVAQRELNWILQ
DTPGVFRGEIAHRSAVSVRLAELLVAAGADNVRLPQCPLCLRTVKLGSQI
DGLRCCHTCWGHHFSRGTCARCGCQRHLINYHGAGERLCHRCFEHDPVNH
EPCTRCGRVDFINHHDGQAKLCRRCYPAPTAVCSSCGRTRPCTRTRTGKP
ICGTCSAKQRPPQPCSVCGNIRSVHTRTDAGEPVCNPCARSREPCARCGK
TLGVSARLAGVGPLCSACLQREPAYFTDCVQCGAHGRTYHRGLCPACACP
GELRELFAKNGELSGAASRIVEALLQCDAMPVLRWVRRMRSNSELPAQLA
ELGDTLSHHDLDDLPASKSVEWLRNILVTAEGLPDRDPYLHRTEQYIAAR
LATISNRDDRAAVRAFTEWNHLRKLRARADKGPLKRNHGLAAQIMAAAIT
DFVSELNAHGLALASCQQAFVDDWLVRNPTRRQIHQFLAWAVHRGYAHDV
AAPVPQTRRTRHTLPGDDERWRLIQYLIEHPDLETRDRVAGLLVLLYSQP
AARLVTLKVADVTITDDAVQLTLGAVPLTVPSPVDRLLADLVQQRRGYAA
VTVGTNPWLFPGGRSGGHLSANQVGLRLKRIGISPRIARNTALIDLAGEL
PAVVLAKLLGFSIKRAVTWSEEAGNTRPRYAAEVARRNS
>MAP1825 hypothetical protein
MRLPLVLLDGASMWFRSYFGVPSSITAPDGRPVNAVRGFLDSLAVVITQQ
RPSRLVVCLDLDWRPQFRVDLVPSYKAHRVPEAEPAGEPDVEEVPDDLTP
QIDMIAELLEAYGIPTAGAEGFEADDVLGTLAARERDDPVVVVSGDRDLL
QVVSDDPVPVRVLYLGRGLSKATLFGPVEVAEHYGVPVDRAGPAYAELAL
LRGDPSDGLPGVPGVGEKTAATLLAQHGSLERVLAAAHDPKSKMAKGLRA
KLLGALDYIEAAGAVVRVATDAPVKLSTPSDAVPLVAADPHRTAELATEL
GVGSSIARLQKALDALPG
>MAP0338c hypothetical protein
MSWGFGDDCWVQGRSDDQRELLDAESVAGHLLKADSMFAFLAAHRSQLFP
EEMFADLFPSQRGRPSVPAEVMASVITLQALHGFSDNETVDAVTFDLRWK
AACGLPITAGAFHSTTLTYWRRRLAASDRPNRIFEAVKTVVAETGVLAGK
TRRALDSTVLDDAVATQDTVTQLIAAIRRVRREVPGAAAVIEAHCSAHDY
DDPGKPAIAWEDKAARDRLVDGLVGDAHRVLGYLPDQELAPRAAEAVALL
ALIAGQDVEPVEGSDGTDGHWRIAQQVSGDRVISTVDADTRHAHKTVHRR
QDGFKAHLAVEPDTGIITDCALTKASGADNHEAVVGLSLLEGEHTPGAGP
GRFGVWHRGRPGGTGRRRPCRGDQTATATLAGSGWLHQRRFPHRFRRPHR
DLPGWTRNADPSQWRSHLRKILSLMPVGVPVHDRHARTQTHPSYPRAIIA
RCSRRGPRPRLASRIPPTPAHGGTLNGLAHSRQPQGPLPRNRQKQPLAAP
PRRSTEPTSAAHHGPDPHRHHLGHRLTRPHTKALPQPPTAATLNNGGAIS
FLSTHRV
>MAP1329c hypothetical protein
MAERDRHGTDLAGTRRGAGAGAVTDFAALPDSVRAALRDEPVPQWCSPTL
ATLTEKRFSDPHWIFERKFDGMRCLAFRDGDRVRLLSRNRKPLNGTYPEL
VESLAAQRETRFVLDGEVVAFEGRRTSFARLQGRLGITDPDVARASSVRI
YYYVFDLLHLDGKSTVAVPLIWRKRLLRKAIDFTDPLRYAPHRVGDGMAA
YRAACGRGDEGVIAKRAESAYDGRRSENWLKFKCVRDQEFVVGGYTSPKG
KRLELGALLLGYYQGDQLRYAGKVGTGFDEATLHRLHQLLSGIACDTTPF
TRSLVPEAGAHWVRPELVVQIGFTEWTRDGKLRHPRYLGIRTDKDPTEVI
RETR
>MAP0799c hypothetical protein
MSDGPLIVQSDKTVLLEVDHELAGAARAAIAPFAELERAPEHVHTYRITP
LALWNARAAGHDAEQVVDALVSFSRYAVPQPLLVDIVDTMARYGRLQLVK
NPAHGLTLVSLDRAVLEEVLRNKKIAPMLGARIDDDTVVVHPSERGRVKQ
MLLKIGWPAEDLAGYVDGEAHPISLTEDGWQLRDYQQMAADSFWSGGSGV
VVLPCGAGKTLVGAAAMAKAGATTLILVTNIVAARQWKRELVARTSLTED
EIGEYSGERKEIRPVTISTYQMITRRTRGEYRHLELFDSRDWGLIIYDEV
HLLPAPVFRMTADLQSKRRLGLTATLVREDGREGDVFSLIGPKRYDAPWK
DIEAQGWIAPAECVEVRVTMTDNERMMYATAEPEERYRLCSTVHTKIAVV
KSILDKHPGEQTLVIGAYLDQLDELGEQLGAPVIQGSTRTKEREELFDAF
RRGEVNTLVVSKVANFSIDLPEAAVAVQVSGTFGSRQEEAQRLGRLLRPK
SDGGGAVFYSVVARDSLDAEYAAHRQRFLAEQGYGYIIRDADDLLGPAI
>MAP0866 hypothetical protein
MIVVAGYCGLRWGELAALRWADVDLANKTLRVARAYSEEAPRGEMSPVKD
HQARTVPIPAIVSEELATFRTDQKPNDLVFPSANGTPLRNRNFRRDVFDD
AAEDLGLNITPHNLRDTAASLAIQAGASVVAVARLLGHESAATTLNHYAA
FFPTDLDDVASRLNAAARLTIAVQRARQDREARIEGELGQQPGYIEQLEY
DAAHPENAVEVDGEQASARDETEYLLSSPENARRLLEALGRDKAIHPAPS
DTTEAPTTHRPERSDEQ
>MAP2504 hypothetical protein
MSDERSSKVGSMFGPYHLKRLLGRGGMGEVYEAEHTVKEWTVAVKLLNES
FSSDPVFRERMKREARTAGRLQEPHVVPVHDYGEIDGQMFLEMRLVEGTD
LDSVLKRFGPLPPPRAVAIITQIASALDAAHAAGVMHRDVKPQNILVTRD
DFAYLVDFGIASATTDEKLTQLGTAVGTWKYMAPERFSDAEVTYRADIYA
LACVLFECLTGSAPYRADSAGVLVSAHVMDPIPAPSARRPGVPKAFDAVI
ARGMAKKPEDRYASAGDLALAAKEALSTPDQDRAATILRRSQEAALPPRG
SATPGPARCWRHRLPSTPGSPVPLRHKLAGAGRRRAARSQPPGSPGRRTT
RAATPIGPLRQHNSPGRRRGAGRRPNGGCGRSSPRSSPCSSSSRAGWASG
W
>MAP4093c hypothetical protein
MAYLLDGTEDRLAARRQRLRDALAGFDAATIATTHQFCQLVLRSLGVAGD
TAAGVTLLDSLDELVAEIVDDLYLAHFGQDHDDPVLHYREALRLARDVVN
NPSAQLRPLHPEPGSRAAVSLRFANDVLAELDTRKRRLGVLGYDDLLTRL
ADALATEHSPARLRMHQRWPIVMVDEFQDTDPVQWQVIDRAFSGRSTLIL
IGDPKQAIYAFRGGDIVTYLRAAETAGQKKTLDTNWRSDSALLQRLQVVL
RGAQLGDPAIAVNDVAAHHRGHRLSGAPRNDPFRLRVVSRNTLGRRGIAN
LPIDELRQHIGADLAADIRALLAAGATFDGKPLRAGDIAVIVERHRDAQA
CFTALCDAGVPAVYTGNSDVFTSQAAED
>MAP2493c hypothetical protein
MKLHRLTLTNYRGIAHREIEFPDHGVVVVCGANEIGKSSMIEALDLLLEA
KDRSTKKEVKQVKPTHSDVGSEVSAEISCGPYRFVYRKRFHKKCETELTV
ITPYREQLTGDEAHERVRAMLAETVDSDLWHAQRVLQAASTAAVDLSGCD
ALSRALDVAAGDSGGLCGTEPLLIERIDAEYGRYFTPTGRPTGEWAAAIA
RLADAEVAVAECAAAVAEVDERVSRHAALTEQVADLSQRRIAAGPRVTAA
RQAADRIAELTAQRREAQLVAEAAAATKAAAAQAHSGRLQLLADIDTRAA
VVTQTHAQVQQAAADHAAAQADAQACDTALQHATEALAELERRVQEAQRT
VERLAGREEADRLAARLDKIDAIQRDRERISAELSTIAVTEELLRRIEDA
AAAVDRIGDQLASTSAAVEFTAVADIHLVIGDQQVSLPAGQSWSITAAGP
TAVEVPGVLTARVIAGATTLDVQAKHAAAQQELAAALTAGAVADLAAARS
ADQRRRELQGTRDQLGATLAGLCGDEPLDELRSRLARLRAEHPEQPAEPT
GGAADLAAARTELEALAQARACASAECDTRRQAATAAAARLAETATAATV
LQNRLDTLRAELDAATGRLAAERASVGDEELAASADAALRAEQAAERRVA
ELADALAAASPDAVAAELADATQESESLRERYEEAAGALREVTIELSVFG
SEGRQGKLDAAETEREHAASEHSQVGRRARAAQLLRSVMTRHRDTTRQRY
VEPYRAELQRLGRPVFGPTFEVDIDSDLCIRSRTLNGITVPYESLSGGAK
EQLGILARLAGAALVAKEDAVPVVVDDALGFTDPDRLAKMGQLFDSVGSH
GQVIVLTCSPDRYDGVKGAHRIDLSA
>MAP2140c hypothetical protein
MSSLQERLASVLREVLPSQEESDGALTVHHEGTIASLRVVNIAEDLDLVS
LTQILAWDLPLTKKVSDHVARQARDANFGSVSLVEKVNKTAVQRNSGKNT
AKLADVMLRYNFPGAGLTDDALRTLVLLVLDTGARIRHTLTD
>MAP2151 hypothetical protein
MATRHGFVRVQMRDGRREKVSQAHLIAVAAHGWPEYWTCEKCRETVPYIL
GRLDELGDDSPDNLKWVPDAEAMLWHVIGCCLENLMAGKPHEEEPNREWV
DSFVYDIEAAEFSGETYMCHPTSPKTEKLIQWGLKKEYEAEHQVT
>MAP1581c hypothetical protein
MPAHAGGTRTGMSNRDQRVRRLLDVAGQTYAAQARIKLSDKPMPLFQLLV
LCMLASKPIDAAIAVGAAAELFKAGLRTPKAVLDADRQTMIDAFGRAHYV
RYDESSATRLTDMAERVRDDYSGDLRELAARSEHDTASAKRMLKKFKGIG
DTGADIFLREVQDVWTWVRPYFDDRATGAAKKLGLPAEPDKLGSLAPQAN
ARLAAALVRASLDDDVRRRVSG
>MAP0034 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP4092c hypothetical protein
MVRAAAATMFFGETAETLAAGGDALTDRIAQTLREWAGHSRERGVAAIFE
AAQLLGMSNRVLGWQGGARHMTDLAHLTQLLGEVAHREHHNLPALRDWLR
RQRDERSGAPERNRRLDSDAAAVQIMTVFVSKGLQYPIVYLPFAFHRNTT
DSDVVLFHDEDGTRCLHVGGKDSPDFKTVERLGRKEDASDDSRLTYVALT
RAQAQVVAWWSPAYHEPNGGLSRLLRGRRPGEPVVPDRCEPAKISDADAL
ARLREWEAAGGPVIEESVPAPAPPLSGVPVPARLENRHFHRGIDTGWRRT
SYSGLIRQAQPSGVGSEPEAAGLDDEVGDILLAPATSVGDVASPMADLPA
GATFGTLVHAVLQTADPFAPDLAAELTARIREHSVWWPVAATPEDMAAAM
VPMHDTPLGPLAGGLTLRRIGLSDRLCELDFELPLAGGDRRSAPPEVRLA
DLSPLLRAHLPADDVLACYADRLTDRSLGAEPLRGYLTGSIDAVLRIPDG
AGHRYLVVDYKTNRLGDAQHPLTAADYDRPRMVEAMLHSDYPLQALLYSA
VLHRFLRWRLAGYEPDRHLGGILYLFVRGMCGPDTPVRDGHPSGVFDWRP
PSSLITALSDLLDAQGVAV
>MAP3376 hypothetical protein
MTTFAGKAAASADKVRGGYYTPAPVARFLAHWVRRAGPRIVEPSCGDGAI
LRELARLSDRVHGVELIPHEAAKARRFAPVSAESLFSWLATAEDGGWDGV
AGNPPYIRFGNWASPQRDPALALLRREGLRPTRLTNAWVPFVVAGTVLVR
DGGRVGLVLPAELLQVGYAAQLREFLLSRFRDITLLTFERLVFDGVLQEV
VLFCGVVGTGPARIRTVTLAGADALADIDVERLASAPALLHEHEKWTKYF
LDPAAIELLRALKASATLTRLGALAEVDVGIVTGRNAFFTFTDEQVTALG
LRRHCVPLVSRSAQLSGLIYDTDCRASDLAARQRGWLLNAPREPADPALT
AHIRAGEAAGVHRGYKCSIRTPWWSTPSLWQPDLFLLRQIHAAPRLTVNA
AGPTSTDTVHRVRVGPGVDPAALAAVFHNSVTFAFAEIMGRSYGGGVLEL
EPREAEQLPLPRPECADADLVGDVDLLLKAGELDKALDVVDRRVLIDALG
VPPRAVADCRAAWACLRDRRKRRASR
>MAP1930 hypothetical protein
MSVVLPTVTTVASDGATQLSFADVGPAFGPDEVALRDTTFVVVDLETTGG
RSTSTEETAPDAITEIGAVKVRGGVVLGEFATLVDPQRSIPPQIMRLTGI
TTAMVSDAPAIDAVLPMFLEFAGLDRGAVLVAHNAGFDVGFLRAAAQQCD
IAWPRPRVLCTVRLARRVLSREEAPSVRLASLARLFAVTTQPTHRALDDA
RATVDVLHALIERVGNQGVHTYADLRGYLPDVTPTQRRKRVLAEGLPRRP
GVYLFRGPSGEVLYIGTAVDLRRRVAQYFTGADPRGRMKEMVALAGAVDH
VECAHALEAGVRELRLLAAHAPPYNRRSRFPQRWWWVALTDEAYPRLAVV
RAPRHDRAVGPFRARADAADTAELLARFAGLRTCTNRLGRAALHGPLCPE
AEVAPCPAARGVTATQYAAAVARVAALIDGADNSALAAAVDQVAALAERR
HFESAARLRDRTATAVETLWRGQRLRALASLPELVAAAPDGQGGYQLAVV
RYGRLAAAGNARRGVPPMPVVDAITAAAQAIWPEPGPLGGALVEETALIA
RWLEAPGVRIVRVKGTPDAAGWASPLRSAGAWAAWAAAARSARLASEQAL
RASELLTEPHPTREQLFGRAGVDGGAGAGQPVLPGRQPFGAAG
>MAP1048c hypothetical protein
MSHRNAPLSETGRLRLARCVVDEGWSLRRAAERFQVSVTTAERWARRYRE
LGEAGMADRSSRPHHSPNRTPTRTERRIIKVRVIRRWGPARIGYLLGIHP
STVHRVLTRYALAKLRWLDRSTGRIIRRMEPAGCGDLVHVDVKKLGKIPA
GGGWRMLGRAIGGHNSNADKSSGVFSKHRNPIRGYHYLHTAIDGYSRLAY
SEVLDDEIKETAAEFWTRANAWFAECGISVRKVLTDNGSCYRSRVFAQAL
GDIEHRRTRPYRPQTNGKVERFHRTLADEWAYARLYRSDTERCEEFTTWL
HTYNHHRGHTALGGQPPASRVPNLSGQYS
>MAP2752 hypothetical protein
MSAALRLVTDTGTGTSDGELPPLQRYEIWMKGRGLSARTITDSMLTLRRL
ERVTRRPAHAVAALAISRFLADEALGPRSRYTYHVQLAGFFRWLANEDGA
PNIMAQIPRPRLPRSVPRPITTGQLQALLAVRMHKRTRVMILLAAFAGLR
AHEIAKVRGQDVDPDARTLHVVGKGGHAATIPLHPVLVEAAETMPRHGWW
FPANSRRPGQHVLSRGVVDAIGDAMRRAGIPGGTAHRLRHWYGTTLVASG
TDLRTAQTLLRHSNLASTAIYTEVYDDRRIEAIDRLTIPLPGEAERAQDD
RLRRQVDRCKQSIIRLLGGLEPGEHMSRTKLSQALRSDVRPHINEAIDEL
TAGGMLVTVIAGHGRHYRLDSGWTDRPEFHG
>MAP1824c hypothetical protein
MQQPLAEGLLMRSSRYAVVGRRFWELVRAGMSADDAGVAVGVSMAAGRLW
FADAGGVRPRFVDQSIPRRRPRLTVEEREEIQDGVARSESIRVIARRLGR
HPSTVMREIERNAICRGRYRARYRFGVRWRGGHDPRPRYRASLAHTRAHV
KARRSRPGKLATNQLLHDEVQTRLNEQHSPQQIAWRLRRDFPDDAEMWVS
QETIYQAIYVQGKGNLRRELHTCLRTGRALRKPRRRPGERRGHLRDMVNI
SERPPEVADRAVPGHWEGDLILGSTASGSAIGTVVERTTRFVMLLHLPDG
HGAEAVQEAIVAKMAGLPATLRQTLTWDQGKEMANHAAIAAATELDIYFC
DPHSPWQRGTNENTNGLLRQYFAKGTDLSAFPADYLDYVASKLNRRPRQT
LDWKTPAEALDELLCKPFTPPAVA
>MAP2296c hypothetical protein
MFETPLTVVGHIVNNPERRQVGAQEVIKFRVASNSRRRTADGGWEPGNSL
FINVNCWGRLVTGVGAALGKGAPVIVVGHVYTSEYEDRDGNRRSSLEMRA
TSVGPDVSRAIVRIEKPGYTGPSTDEATPAAAEAADAAGEDADRGDTVEP
VDTGPVPLSA
>MAP0340c hypothetical protein
MAAAEEVDVDGIAVRLTNPDKVYFPKLGSKGTKRHLIDYYRAVAGGPMLD
ALRDRPTHLQRFPDGIDGEEIYQKRIPQHHPDYLQTCRVTFPSGRTADAL
KVTHPAAIVWAAQMGTVTLHPWQVRCPDTDHPDELRIDLDPQPGVAFAQA
RSVAVDVLRPLLDELGLVGYPKTSGGRGIHVFLRIATDWDFVEVRRAGIA
LAREVERRAPDAVTTSWWKEERGARIFIDFNQNARDRTMASAYSVRRTPI
ATVSMPLTWDQLAGADPDDYTMATVPDLVSARENPWAAIDDVAQSIGPLL
RMAEADEERGLGDMPYPPNYPKMPGEPKRVQPSRDTEHKKKN
>MAP0427 hypothetical protein
MSGVFTRLVGQDAVKAELLGAARAARGDAGHSDAGAGTMTHAWLITGPPG
SGRSVAALCFAAALQCTADGEPGCGRCHACTTTLAGTHADVRRVVPEGLS
IGVDEMRAIVQIAARRPATGNWQIVVIEDADRLTEGAANALLKVVEEPPP
STVFLLCAPSVDPEDVAITLRSRCRHVALVTPSTEAIARVLVDSDGLTPE
TADWAASVSGGHVGRARRLATDPEARQRRERALGLVRDAATPSRAYAAAE
ELVAAAEAEAVVLTAERAEAETEELRTALGAGGTGKGTAGAMRGAAGAIK
DLERCQKSRQTRASRDALDRALIDLATYFRDALMASSGAGAVRANHPDMA
ERVAALAAHAPPERLLRCIEAVLECREALAVNVKPKFAVDAMVATIGQQL
G
>MAP1827c hypothetical protein
MRQALLQTSRLSSALRADEQTHRIAPSREPDDGFVAVIYRWARTADLAAA
LAAAEPAGTGSPLLAGDFVRWCRQVLDLLDQVRNAAPDPELRATAKRAIN
DIRRGVVAVDAG
>MAP1054 hypothetical protein
MRWASQAVAVNGKPVEDGALPGLQRLGLVRSVRAPQFEGITFHEVLCKSA
LNKVPNASALPFRYTVNGYRGCSHACRYCFARPSHEYLELNCGNDFDTQV
VVKTNVAQVLRRELRRPSWRRETVALGTNTDPYQRAEGRYALMPGIIGAL
AESGTPLSILTKGTLLRRDLPLIANAAAQVPVSVAVSLAVGDADLHRDVE
PGTPTPQARLGLISAIRAAGLDCHVMVAPVLPYLTDSVEHLDDLLGQIAA
AGAGSVTVFGLHLRGSTRGWFMEWLARSHPELVGRYRELYRRGAYLPPSY
RDMLRHRAAPLIAKYRLGGDHRPFSRAVAAAPAPEPAQPTLF
>MAP2142c hypothetical protein
MRLYRDRAVVLRQHKLGEADRIVTLLTRDHGLVRAVAKGVRRTRSKFGAR
LEPFAHIDAQLHPGRNLDIVTQVVSIDAFATDIVSDYGRYTCACAMLETA
ERLAGEERAPAPALHGLTVSALRAVADGRRSRDLLLDAYLLRAMGIAGWA
PALTECARCATPGPHRAFHVGAGGSVCPHCRPAGSTTPPPGVLDLMSALH
DGDWEFAEQTPQSHRNYVSGLVAAHLQWHLERQLKTLPLVERTYHIDRTI
ADQRATLIGQDMDCG
>MAP1722 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP2768c hypothetical protein
MARDADAKRDKRAFGNIRKLPSGRFQVRYTGPDGSYITAPKTFAAKIDAE
AWLTDRRREIDRGLWDVASAKQPERVTFGAYAAGWLAGRQVAGRPIKART
REHYQAILDDHLLPAFGSRQLTSITPKDVREWYSGTLTNRPTMRSHAYSL
LRTIMGSAVNDELIDANPCRIVGAGRAKRVHKIRPATVEELPILTEAMPE
RLRLMVALASWCALRFGETIELQRGDIDLADEVIRIRRAAVRTHGGTFEV
TTPKSDAGIRDVAIPPHLVSAIEAHLAKYVGKKRDSLLFPNERGRHLQPS
TLNRHWGTKRGPQPAATICVGTTYGTPAQSWPPPPAPASPNSWPG
>MAP2108 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP2628c hypothetical protein
MPKLQLVQEPAADALLDENPFALLVGMLLDQQVPIETAFAGPKKIADRMG
GLDAATIADYDPDKFAALCSERPAIHRFPGSMAKRIQALAQLLVDRYGGD
AAALWTAGEPDGKELLRRLKGLPGFGEQKARIFLALLGKQYGVTPPGWRE
AAGEFGKAGTYLSVADIVDARSLGQVRSYKKQMKAAAKAAK
>MAP3301c hypothetical protein
MHAAMSFTWDAQAGAVLAPGVRGTVRVLGGPGTGKSSLLVDAAVAQIEAG
VNPESVLLLTGSGRLPMAERSALTTALLRSAGAGPAVREPLVRTVHGYAY
AVLRRAAERAGEAPPRLVTSAEQDAIIRELLAGDLADGPRAATAWPAALR
PALSTAGFATELRNLLARCAERGVDPQALERLGRRCRRPEWVAAGQFARQ
YEQVMLLRAAVGTAAPEATTPALGAAELVGAALEAFAVDAELLAAERGRI
RVLLVDDAQQLDPQAARLVRVLAAGADLALIAGDPNQAVFGFRGGDPGSL
LDGAAPAVTLTRSHRCAPAVARAVSGVAGRLPGSSAGRRIEGAGPGEGSV
AVRLAASAHAEAAAIADALRRAHLVDGVPWSQMAVIVRSVPRAGARLPRA
LAAAGVPVTAPAASGPLAEQPAVRALLTVLLATADGLDGQRALALLTGPI
GRVDPVSLRQLRRNLQRANAGRPPGDFAELLVEALTGTAPPPGAPFRALR
RVRAVLDAAGRCHRDGQDPRYILWAAWHRSGLQRRWLSVSERGGPAAAQA
GRDLDSVTALFDITDDYVSRTSGASLRGLVEHVAALQLPGAEPVATAEQV
SVLSAHAALGREWDFVVIAGLQDGLWPNTVPRGGVLGTQRLLDVLDGVSA
DASVRAPLLAEERRLLVAAMGRARQRLLVTAVDSDTDGSDREAALPSPFC
YEIAQWAGEDAEPAALQPVSAPRVLSAAALVGRLRGVVCAPDGAVDELDR
RCAATQLARLAKAGVPGADPASWHGLIPVSTAEPLRGGGDVVTLTPSTMQ
TLTDCPLRWLAERHGGTDPRDLRSAIGSVVHALIAQPHRSPAELVAELDR
VWRHLPFAAQWHSDNELARHRAMLEAFAQWRANTRGALTEVGVEVEIDGT
LSTGDGREVRLRGRVDRLERDAAGRLVIVDVKTGKTPVSKDDAQQHAQLA
LYQLAVAHGLLGAAGGDAEPGGARLVYVGKAAASGVVEREQDPLTAAAAD
QWREALRRAADATAGPQFIARRNDGCTHCPLRPCCPAHADGSGR
>MAP0850c hypothetical protein
MKELTVAEQRYQAVLAVISDGLSISLVAEKVGVSRQTLHAWLARYEASGL
EGLVDRSHRPVSCPHQMPAVVEAALLERRRSRPYWGPRRLVFELAKRRVG
PVPSESAVYRALVRAGMIDPALRDRRSRKWKRWERGAPMELWQMDVVGGF
PLADGTSAKALTGIDDHSRMCVCARLMARERTRAVCDGLRAALAAYGAPQ
QILTDNGKVFTGRFNHPPVEVLFDAICRQNGIDHLLTQPRSPTTTGKIER
FHRSLRAEFLSNTRAFSNLKTAQQALDEWVHYYNTARPHQSLNMTTPAER
FTATASPVSPGDDVPASIDRDGQDWVSRRVTTNGVVSVAWQQVCVGAHYA
GARCDVHVDGELLRFWIGDQLVKTAARTNHAEVRNKRAFRTREQA
>MAP2157 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP2416c hypothetical protein
MSWGFGDDCWVQGRSDDQRELLDAESVAGHLLKADSMFAFLAAHRSQLFP
EEMFADLFPSQRGRPSVPAEVMASVITLQALHGFSDNETVDAVTFDLRWK
AACGLPITAGAFHSTTLTYWRRRLAASDRPNRIFEAVKTVVAETGVLAGK
TRRALDSTVLDDAVATQDTVTQLIAAIRRVRREVPGAAAVIEAHCSAHDY
DDPGKPAIAWEDKAARDRLVDGLVGDAHRVLGYLPDQELAPRAAEAVALL
ALIAGQDVEPVEGSDGTDGHWRIAQQVSGDRVISTVDADTRHAHKTVHRR
QDGFKAHLAVEPDTGIITDCALTKASGADNHEAVVGLSLLEGEHTPGAGP
GRFGVWHRGRPGGTGRRRPCRGDQTATATLAGSGWLHQRRFPHRFRRPHR
DLPGWTRNADPSQWRSHLRKILSLMPVGVPVHDRHARTQTHPSYPRAIIA
RCSRRGPRPRLASRIPPTPAHGGTLNGLAHSRQPQGPLPRNRQKQPLAAP
PRRSTEPTSAAHHGPDPHRHHLGHRLTRPHTKALPQPPTAATLNNGGAIS
FLSTHRV
>MAP3182 hypothetical protein
MGAPATTVWLIAGYSLALLVVGWGFDVMARRASAHAARWRTGRFTYRPDH
DAWICPQDHWLWPTSFDPKHRVMRYRALPVVCNSCPVKAECTTSEHGREI
SREVDPWPHSDAGRFHRGIACCVAGLGIVLPLATLVTNHSAADALVLTAT
VLLVALLGLPLARHLWNTPSNAPQHLPHRTAIEDQVAAAIDRYSTRWGGW
KSKEDNAT
>MAP2148 hypothetical protein
MSKSRDLALKRYAKWLVDEGELSSDPLLGLKPPKGDQKVVNALTEDQLKR
LIAACQGKSLMDRRDETIVRLMAETGLRANETLSLQITDVNLDAGIVTIV
RGKGGKGRVSPFSVQTATAIDRYLRARRAHRLSNTGALWLGGGGKSLGYY
GLSKALKQRATAAGIETFHLHMLRHTAATRWLRAGGSESGLMSVAGWKNR
SMIDRYVGAAAASLAADEARRLNLGDI
>MAP2302 hypothetical protein
MRELSVAEQRYQAVMAVISDGLSVSQAAEKFGVARQTLHRWLARYEAAGL
EGLVDRSHRPVSCPHQMLAVVEAAVLELRRSRPYWGPRRLVFELAKRGVH
PVPSESAVYRALVRAGLIDPAMRDRRSRKWKRWERGAPMELWQLDIVGGF
PLADGTSAKALTGIDDHSRMCVCAKLMARERTRAVCDGLRAALAAYGVPE
QILTDNGKVFTERFCHPPVEVLFDAICREHGIEHLLTQPRSPTTTGKIEQ
FHRSLRAEFLSGREPFTNLKVAQQALDEWVEDYNTTRPHQALKMITPAQR
FHAGAPASPPSNSCARHVDRSGDDWVSRRVCSNGIVCVSWQQVCIGRHYA
GARCDVHVDGDLLRFWVGDNLVKTAARTSRGEVRNKQALRTNAPA
>MAP3487c hypothetical protein
MVTPSGSRVLAIWCMDWPAVAAAAAAELPATAPVAVTLANRVIACSSAAR
AAGVRRGLRRREAAARCPQLHVSTADADRDARFFEAVIAAVDDLVPRAEV
LRPGLLVLPVRGAARYFGSEEAAAERLVDAVAVSSVAGAECQVGIADRLS
TAVLAARAGRIVEPGGDAKFLSVLSVRQLATEPSLSGPGREELTDLLWRM
GIRTIGQFAALARGDVASRFGADGVAAHRLARGEPERPPSGREPPAELEA
VLDCDPPIDRVDAAAFAGRSLAGTLHQALMAAGVGCTRLAIHAATGSGEE
RHRVWRCAEPLTEDATADRVRWQLDGWLSNRTARDRPTAPVTLLRLRAVE
VVSAEALQLPLWGGLGEEDRLRARRALVRVQGLLGPEAVQVPVRSGGRGP
AERITLIPLGDEPVPQADPDLPWPGRLPEPSPAVLLDDPVELLDAQGNPI
RVSSRGLFSADPARLVVHGRGERLCWWGGPWPVDERWWDDRGQGGGRTAR
AQVLLESERALLLCYRQRRWYLEGSYE
>MAP3759c hypothetical protein
MALDQSALLEVLDALRNADAADRIKQAAETIYQALIDAELTAVIGAGPHE
RSASRTNQRNGSRPRTLSTIAGDLELRIPKLRSGSFFPALLERRRRVDQC
LFAVVMEAYLHGTSTRKVDDLVKALGADAGISKSEVSRICADLDTEVGAF
RDRPLSEQHFPYVFLDATYCKARVNHRVVSQAVVIATGVAADGRREVLGF
DVGDSEDGAFWTAFLRSLKTRGLSGVQLVISDAHTGLRSAIEAILIGASW
QRCRVHFLRNVLAQVPKGSAEMVAAAIRTIFAQPDAEHVREQLDTIAGML
GRQLPKVETMLREAADDITAFADFPVLHWKKIWSTNPLERLNKEIKRRTD
VVGVFPNPAALLRLAGSVLVEAHDEWQVAEKRYLSETTLALLHPRSDSAD
QSVAVPAAITA
>MAP3003c hypothetical protein
MTRIIGGVAGGRRLAVPPRGTRPTTDRVRESLFNILAARRELTGLAVLDL
YAGSGALGLEALSRGAATALFVESDPRAAAVIARNIDTLGLPGATLRRGA
VAAVLAGGAATAVDLVLADPPYDVGAAEIDAVLAALAAHGWVRAGSVAVV
ERPAGSAPLSWPPGWSGWPHRVYGDTRLELAERL
>MAP1407 hypothetical protein
MAEHVFETASSETLYTGKIFALRRDQVRMPGGKVVTREIVEHFGAVAVVA
MDDDGNIPMVYQYRHAFGRRLWELPAGLLDVHGEAAHLTAARELMEEAGL
KAETWAVLVDLNSTPGFSDESVRVYLATGLTRVDRPEAHDEEADMTLEWY
PLADAARKVLSGEIVNAIAVAGILAAHAVTTGFERPRPVDSPWQDRPTAF
PARKAGR
>MAP3357c hypothetical protein
MSWGFGDDCWVQGRSDDQRELLDAESVAGHLLKADSMFAFLAAHRSQLFP
EEMFADLFPSQRGRPSVPAEVMASVITLQALHGFSDNETVDAVTFDLRWK
AACGLPITAGAFHSTTLTYWRRRLAASDRPNRIFEAVKTVVAETGVLAGK
TRRALDSTVLDDAVATQDTVTQLIAAIRRVRREVPGAAAVIEAHCSAHDY
DDPGKPAIAWEDKAARDRLVDGLVGDAHRVLGYLPDQELAPRAAEAVALL
ALIAGQDVEPVEGSDGTDGHWRIAQQVSGDRVISTVDADTRHAHKTVHRR
QDGFKAHLAVEPDTGIITDCALTKASGADNHEAVVGLSLLEGEHTPGAGP
GRFGVWHRGRPGGTGRRRPCRGDQTATATLAGSGWLHQRRFPHRFRRPHR
DLPGWTRNADPSQWRSHLRKILSLMPVGVPVHDRHARTQTHPSYPRAIIA
RCSRRGPRPRLASRIPPTPAHGGTLNGLAHSRQPQGPLPRNRQKQPLAAP
PRRSTEPTSAAHHGPDPHRHHLGHRLTRPHTKALPQPPTAATLNNGGAIS
FLSTHRV
>MAP1395 hypothetical protein
MRDAAEQLLVDPVEAARRLLGATLTGRGVSGVIVEVEAYGGVPDGPWPDA
AAHSYKGLRARNFVMFGPPGRLYTYRSHGIHVCANVSCGPDGTAAAVLLR
AAALEDGTDVARGRRGELVHTAALARGPGNLCAAMGITMADNGIDLFDPD
SPVTLRLHEPLTAVCGPRVGVSQAADRPWRLWLPGRPEVSAYRRSPRAPA
PGTSD
>MAP0973 hypothetical protein
MRVSSNRRERQAPPAPEPLAPLIDAHTHLDACAAGPETGAAGVRAIVDRA
AAVGVQAVVTVADDLDSARWVTRAAEWDPRVYAATALHPTRADALDDAAR
AEIERLVAHPGVVAVGETGMDLYWPGRLDGCAEPAVQREAFAWHIDLAKR
CGKPLMIHNREADAEVLDVLAAEGAPDLVIFHCFSSDAAMARRCVDAGWL
LSLSGTVSFRNARALREAVPLIPPGQLLVETDAPFLTPHPHRGTANEPYC
LPYTVRAIAELVDRRPEELAAVTTDNARRVYGLA
>MAP3304 hypothetical protein
MAPVTDEQVERVRALVAAIPPGRVATYGDIAAVAGLSSARIVGWIMRTDS
SDLPWHRVITASGRPARHLRTRQLELLRTEGVLATDGRIPLPEVRHRFGA
>MAP2150 hypothetical protein
MALDQSALLEVLDALRNADAADRIKQAAETIYQALIDAELTAVIGAGPHE
RSASRINQRNGSRPRTLSTIAGDLELRIPKLRSGSFFPALLERRRRVDQC
LFAVVMEAYLHGTSTRKVDDLVKALGADAGISKSEVSRICADLDTEVGAF
RDRPLSEQHFPYVFLDATYCKARVNHRVVSQAVVIATGVAADGRREVLGF
DVGDSEDGAFWTAFLRSLKTRGLSGVQLVISDAHTGLRSAIEAILIGASW
QRCRVHFLRNVLAQVPKGSAEMVAAAIRTIFAQPDAEHVREQLDTIAGML
GRQLPKVETMLREAADDITAFADFPVLHWKKIWSTNPLERLNKEIKRRTD
VVGVFPNPAALLRLAGSVLVEAHDEWQVAEKRYLSETTLALLHPRSDSAD
QSVAVPAAITA
>MAP3467c hypothetical protein
MSWGFGDDCWVQGRSDDQRELLDAESVAGHLLKADSMFAFLAAHRSQLFP
EEMFADLFPSQRGRPSVPAEVMASVITLQALHGFSDNETVDAVTFDLRWK
AACGLPITAGAFHSTTLTYWRRRLAASDRPNRIFEAVKTVVAETGVLAGK
TRRALDSTVLDDAVATQDTVTQLIAAIRRVRREVPGAAAVIEAHCSAHDY
DDPGKPAIAWEDKAARDRLVDGLVGDAHRVLGYLPDQELAPRAAEAVALL
ALIAGQDVEPVEGSDGTDGHWRIAQQVSGDRVISTVDADTRHAHKTVHRR
QDGFKAHLAVEPDTGIITDCALTKASGADNHEAVVGLSLLEGEHTPGAGP
GRFGVWHRGRPGGTGRRRPCRGDQTATATLAGSGWLHQRRFPHRFRRPHR
DLPGWTRNADPSQWRSHLRKILSLMPVGVPVHDRHARTQTHPSYPRAIIA
RCSRRGPRPRLASRIPPTPAHGGTLNGLAHSRQPQGPLPRNRQKQPLAAP
PRRSTEPTSAAHHGPDPHRHHLGHRLTRPHTKALPQPPTAATLNNGGAIS
FLSTHRV
>MAP2203c hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP1785 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP2964c hypothetical protein
MTLSAPGSAHLVLAQNVVHLDQAAAVFEAMLEGWRRQQSARFLRAGTIGA
RLRLIRRLEAFSGLYPWQWAPADGEAFIDHLRSTTVEAVSTARSYEIDIS
LFMEYLLDPRYGWAGVCADQFGDVPQAIFHEGNSIQHKLDYEGDPRRRPL
SYDEIQALFDAADARSGTIQGRGVKGALGAARDAAVLKTIYAFGLRRTEA
SRLDLVDLRRNSQAPQFGGFGVVMVRYGKAPKGAPPKRRTVLLVPEMDWV
VETLDQWLTEIRPRFSPPDRHPALWVTERGGRLSPRSINEAFVAARDDAR
LDRSLNLHCLRHSAVTHWTEFGYPARFVQEQVGHAHASTTSIYTHVSNEY
RNKLLKASLMGRLGDHWDGAPT
>MAP3713c hypothetical protein
MNHMTRPVSLEVAGRRVTITHPDKVVFPRAGGGETTGSAAGPHTKLDLVR
YYLSVADGALRGVAGRPMILKRFVKGITQEAVFQKRAPANRPDWVDVAEL
RYARGTSAAEAVIHDAAGLAWLINLGCVDLNPHPVLADDLDHPDELRVDL
DPMPGVSWRRIVDVALVAREVLEDYGLTPWPKTSGSRGFPIYARIARRWE
FRQVRLAAQTVAREVERRAPEAATSRWWKEEREGVFVDFNQNAKDRTVAS
AYSVRATPDARVSTPLRWDEVAGCRPEAFTIDTVPARLAEIGDPWAGMDD
AVGDLDRLLVLAEELGPPERAPKGAGTRSGGRRRSSMPLIEVARTKTRDE
AMAALDVWRDRHPEAAARLQPADVLVDGMRGPSSIWYRIRINLQHIPEDQ
RPPQEELIADYSPWQGYGGRQKPSWG
>MAP2502 hypothetical protein
MSWGFGDDCWVQGRSDDQRELLDAESVAGHLLKADSMFAFLAAHRSQLFP
EEMFADLFPSQRGRPSVPAEVMASVITLQALHGFSDNETVDAVTFDLRWK
AACGLPITAGAFHSTTLTYWRRRLAASDRPNRIFEAVKTVVAETGVLAGK
TRRALDSTVLDDAVATQDTVTQLIAAIRRVRREVPGAAAVIEAHCSAHDY
DDPGKPAIAWEDKAARDRLVDGLVGDAHRVLGYLPDQELAPRAAEAVALL
ALIAGQDVEPVEGSDGTDGHWRIAQQVSGDRVISTVDADTRHAHKTVHRR
QDGFKAHLAVEPDTGIITDCALTKASGADNHEAVVGLSLLEGEHTPGAGP
GRFGVWHRGRPGGTGRRRPCRGDQTATATPAGSGWLHQRRFPHRFRRPHR
DLPGWTRNADPSQWRSHLRKILSLMPVGVPVHDRHARTQTHPSYPRAIIA
RCSRRGPRPRLASRIPPTPAHGGTLNGLAHSRQPQGPLPRNRQKQPLAAP
PRRSTEPTSAAHHGPDPHRHHLGHRLTRPHTKALPQPPTAATLNNGGAIS
FLSTHRV
>MAP3256c hypothetical protein
MTATSKAPGADRYLPEQRDIEALKHAAETCRGCSLFADATQTVFGNGHPG
APIMLVGEQPGDQEDRAGAPFVGPAGRLLARALHDAGIDPGLTYQTNAVK
HFKFTRKDGKRRIHQKPGRTEIVACRPWLIAEIEAVHPRVIVCLGATAAQ
SLLGTSFRVSTQRGQPLKLPASPEVIPDVAPEPVLVATVHPSSVLRDRSE
RHDEVYRLFVDDLRSARSALG
>MAP0106c hypothetical protein
MVRAMLSGWAKQQLGGRLCAENTVKVRASCVGEFIEFSGAYPWEWTARMM
DEWSAHLVGSLSRAKSTIRQKQGAVRLFCSFITSPFYDWPAQCELWFGDH
PTQICHEWNTSAHLVDYEGDPGRRPFTRKELQDFLDCADAQVEKARRSRR
KGTLAAYRDTTMFKVMYGWGLRISELCRLDLADMYRNPHAPELGQCGFLH
VRYGKASRGSPPKRRTVPTLMPWAAEALLDYVNNIRPLYEPGQKQALWLT
ERRSQVKVRTLTGTFDDIRDEVGLDRKLTPHCFRHSFISHMTEDGVDPRF
LQEISGHRFASTTGIYTHVTGEFMNKMLTDALGRVSSFGEGHQ
>MAP0664c hypothetical protein
MRELSVAEQRYQAVMAVISDGLSVSQAAEKFGVARQTLHRWLARYEAAGL
EGLVDRSHRPVSCPHQMLAVVEAAVLELRRSRPYWGPRRLVFELAKRGVH
PVPSESAVYRALVRAGLIDPAMRDRRSRKWKRWERGAPMELWQLDIVGGF
PLADGTSAKALTGIDDHSRMCVCAKLMARERTRAVCDGLRAALAAYGVPE
QILTDNGKVFTERFCHPPVEVLFDAICREHGIEHLLTQPRSPTTTGKIEQ
FHRSLRAEFLSGREPFTNLKVAQQALDEWVEDYNTTRPHQALKMITPAQR
FHAGAPASPPSNSCARHVDRSGDDWVSRRVCSNGIVCVSWQQVCIGRHYA
GARCDVHVDGDLLRFWVGDNLVKTAARTSRGEVRNKQALRTNAPA
>MAP3854 hypothetical protein
MQQPLAEGLLMRSSRYAVVGRRFWELVRAGMSADDAGVAVGVSMAAGRLW
FADAGGVRPRFVDQSIPRRRPRLTVEEREEIQDGVARSESIRVIARRLGR
HPSTVMREIERNAICRGRYRARYRFGVRWRGGHDPRPRYRASLAHTRAHV
KARRSRPGKLATNQLLHDEVQTRLNEQHSPQQIAWRLRRDFPDDAEMWVS
QETIYQAIYVQGKGNLRRELHTCLRTGRALRKPRRRPGERRGHLRDMVNI
SERPPEVADRAVPGHWEGDLILGSTASGSAIGTVVERTTRFVMLLHLPDG
HGAEAVQEAIVAKMAGLPATLRQTLTWDQGKEMANHAAIAAATELDIYFC
DPHSPWQRGTNENTNGLLRQYFAKGTDLSAFPADYLDYVASKLNRRPRQT
LDWKTPAEALDELLCKPFTPPAVA
>MAP3078c hypothetical protein
MSHRNAPLSETGRLRLARCVVDEGWSLRRAAERFQVSVTTAERWARRYRE
LGEAGMADRSSRPHHSPNRTPTRTERRIIKVRVIRRWGPARIGYLLGIHP
STVHRVLTRYALAKLRWLDRSTGRIIRRMEPAGCGDLVHVDVKKLGKIPA
GGGWRMLGRAIGGHNSNADKSSGVFSKHRNPIRGYHYLHTAIDGYSRLAY
SEVLDDEIKETAAEFWTRANAWFAECGISVRKVLTDNGSCYRSRVFAQAL
GDIEHRRTRPYRPQTNGKVERFHRTLADEWAYARLYRSDTERCEEFTTWL
HTYNHHRGHTALGGQPPASRVPNLSGQYT
>MAP2193 hypothetical protein
MRLRRWWLFLSGRELVAPARCCRSRATRQHLSHRDRQYRHPGIEFAGDDG
RRHRGQRRQDAGQARSRRRRGLGETRCGGARKRGGRRRTDQPAGFDARGA
QSPAGPTAAGTAAARGHHPAGQGIVGRLQGVLGQIGDIVHNFSAALSGHE
TDVRQLLTRLDEFVGVLDQQRDRIIASIDSLNRLAGTFASQREVITQALR
KIPPALDVLIRERPRITAALDKLRVFSNTATQLVNETQADLVKNLQNLEP
TIQALADVGPDLSTVLGYVPTFPFTQNFIDRAVRGDYFNVFAVIDMTIPR
LKRTLLAGTRWGDPDAPLVPAPGDPWFSNYTYDPLGFGVTNPPLAPPPSP
GAPASAPPVMAPDMSVIAPVDQPSRGGG
>MAP2711c hypothetical protein
MTWLMVGIAILVAVLAVVGVWAYRTANRLDRLHVRYDLSWQALDGALARR
AVVARAVAIDAYGGASEGRRLAALADAAEAAPRAAREARENELSAALAAV
DPASLPAGLIAELADAEARVLLARRFHNDAVRDTLALAERRLVRLLHLGG
TAALPSYFEIVERPHALAHGDHGVLNHRTSARVVLLDDRGAVLLLRGSDP
ALAGQQAPKWWFTVGGEVQPGERLAEAAARELAEETGLRVAPSELVGPVW
RRDEVFEFNGSLIDSEEFYFVYRTQRFEPSRTGRTELEHSYIHGHRWCDA
ADIAQLVAAGETVYPLQLSGLLTDAAALASGRTPGPLLSIR
>MAP2226c hypothetical protein
MQTELPAERPQRRLRAEPDADAAVQPEEPGPIAGEDSPDDDQNSLLPRWL
PGAPEHRGWVARIRADPGRAGAIGLAIVAALAVLVTVFTLIRDRPAPVMS
AKLPPVEKVSTASPRSSASPSGGPDRPVVVSVVGLVHTPGLVTLAPGARI
ADAVQAAGGAVNGADTAGLNMARPLDDGEQIVVGLAPVPGQPPVLGSSVA
AGSTPAPKPPPGPGAAKAKPKTGDAVDLNTATVQELDALPGVGPVTAAAI
VAWRQTNGRFTSVDQLADVEGIGPARLEKLRALVRV
>MAP4336 hypothetical protein
MPGSPPTGPLPPVPAGIDRRRPELSDSALVSRSWAMAFATLVSRLTGFAR
VVLLAAILGAALSSAFSVANQLPNLVAALVLEATFTAIFVPVLARAEQSD
PDGGAAFVRRLVTLTTALLIVATALSVAAAPLLVRLMLGRTPQVNEPLTV
AFAYLLLPQVLAYGLTSVFMAILNTRNVFGPTAWAPVVNNVVALATLAVY
ALVPGELSVDPVRMGNAKLLVLAVGTTLGVFAQTGVLLVALRRQHVDLRP
LWGIDQRLKRFGTMAAAMVLYVLISQLGLVVGNQIASTAAASGPAIYNYT
WLVLMLPFGMIGVTVLTVVMPRLSRNAAADDTRAVLADLSLATRLTLITL
IPIVAFMTVGGPAMGSALFAYGHFGDVDAGYLGAAIALSAFTLIPYGLVL
LQLRVFYAREQPWTPIVIILVITAVKILGSMLAPHLTGDPKLVAGFLGLA
NGVGFLAGAVIGYVLLRRTLLPGGGHLIGVGEVRTILVTLTAAMLAGLVA
HVADRLLGLGALTAHGGGAGSLLRLLVLALIMVPITAAVMLRAQVPEARA
ALDAVRFRITGRGPRPRKPAAPDRSSHRRPVTYPEQRNSSPPGVNAVQEP
IRRRPPERANRARLVKGPEVTDRPMESAASSAGPGTGSGAPRPVADDFQP
DIPADQPDRPRKADPRPADQKNGDVGTRRGPLDVPRERTADSSTDDVHLV
PGARIAGGRYRLLVFHGGAPPLQFWQALDTALDRQVALTFVDPDRALPDE
VLQEILSRTLRLSRIDKPGIARVLDVVHTGSGGLVVSEWIRGGSLQEVAD
TAPSPVGAVRAMQSLAAAADAAHRAGVALSIDHPSRVRVSIEGDVVLAYP
ATMPDANPQDDIRGIGAALYALLVNRWPLAESGVRSGLAPAERDSSGNPV
EPMAIDRDIPFQISAVAVRAVQDDGGIRSASTLLNLLQQATAVADRTEVL
GPIDDSPSPSTALISPGNDPATFARRRRNVLIGVGAGLAVLVAALLVLAS
IVSKIFGNVGGGLNKDELGLNGPSSSTSAPQTTTSTAAGSVVKPTRASVF
SPDGDADNPGTAGQAIDGDPSTAWATEVYTDAVPFPSFKQGEGLILQLPS
PTVVGQVSIDTPSTGTKVEIRAASSPTPAGLNDTTVLAPAFTLKPGHNVI
PVRAGSPTSNLLVWISTLGTTNGKSQAGFSEITVQAAS
>MAP3984c hypothetical protein
MTPVPDFIVELRRRIGHAPLWLPGITAVTIRGRKVLLVKRSDNGAWTAVT
GIVEPGENPADCAAREVREETGVSARATRLAWVHVTRPAIHANGDHAQYL
DHVFRMEWLSGEPFPADDESTAAAWFDLDELPPMTADMRRRITLSANDDE
RTVFDTDGPPPARPSG
>MAP1072 hypothetical protein
MPEAVSDGLFDLPGAPPPGDHGLGVPAGAPLAVRMRPASLDEVVGQDHLL
APGSPLRRLVEGSGVASAILYGPPGSGKTTLAALISQATGRRFEALSALS
AGVKDVRAVIESARTALLRGEQTVLFIDEVHRFSKTQQDALLSAVENRVV
LLVAATTENPSFSVVAPLLSRSLILQLRPLSADDIRTVVRRAIDDPRGLG
GRVPVAPEAVDLLVRLAAGDARRALTALEVAAEAGESVTVQTVEQSLDEA
AVRYDRDGDQHYDVISAFIKSVRGSDVDAALHYLARMLVAGEDPRFIARR
LMILASEDIGMADPAALQVAVAAAQTVALIGMPEAQLTLAHCTVYLATAP
KSNAVTTALGAAMSDIKAGKAGLVPAHLRDGHYSGAAALGHAQGYQYSHD
HPDGVVAQQYPPDELVGVDYYRPTGRGAEREMAGRLDRLRAIIRNKRGRS
>MAP1793c hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP0889 hypothetical protein
MPELPEIEALADHLRRHAVGLPIGRVDVAALSVLKTFDPPISALHGQTVV
GAERWGKYLGLRTEGLFLIAHLSRAGWLRWSDRLTAAPLRPGKGPIALRV
HLGTPGAAPGFDLTEAGTQKRLAVWLVDDPARVPGIAALGPDALDLDVDA
LADLLAGNTGRIKTVITDQKVIAGIGNAYSDEILHVAKISPFATAGKLSD
KQLATLHDAMVTVLTDAVSRSVGQGAAMLKGEKRSGLRVHARTGLPCPVC
GDTVREVSFADKSFQYCPTCQTGGKILADRRMSRLLK
>MAP1771c hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP3814c hypothetical protein
MGARGLPSPPTGTRGDSCFFEWSSAARLRCRPVSLFGNLEPRVSGLLLAE
CLGHARRTRGLTADLEKLWKVIGLSEQDTVRVVMEPTRNAWVALASWFRH
HGARISMVPTTQSADLRAYYSKHTKNDHLDSKLLARLPLLHPEGLRDHSG
DGPGEPLRRLVKIRSSIVKRRTAVFQRLDAQLELLGPAWYDALGSSYGKA
ALALLARYADPHSLIRLGHARLTRFLIRHSRGAWREPHATLLLTAAQESL
QLWGSGANARIDFAELAADIATEAEQAQMLTEQIDDLDERAANLYAEADP
RGIIASAPGLGPVTCAVIAGRIGDPHRFHSLAAIRAYSGLVPKVSQSGQA
EQRHGLTKAGDPLLREALFAAADQARKTDPQLAAKYKRLMTTERHHDSAI
CHIATTLLTRIATCWRTGAHYVLRDTDGRPITFEEGRRIVRAHHAVDKKT
RINAASKRYSQRQKGRTGREPQESPSAPIHRPVHPSIKPTEVA
>MAP2494c hypothetical protein
MRFLHTADWQLGMTRHFLAGDAQPRYSAARRDAVAGLGALAAEVGAEFVV
VSGDVFEHNQLPPKVVGQSLEAMRAIGIPVYLLPGNHDPLDASSVYTGAL
FTAERPHNVTVLDRAGVHQVRPGLQLVAAPWRSKVPTTDLVGEVLDGLPE
TDDTRILVGHGGVDVLDPDRDKPSLIRLAKLEDALTRGAVHYVALGDKHS
LTQVGSSGRVWYSGSPEVTNFDDVEADSGHVLIVDIDETDPRRPVSVTAR
RVGCWRFVTLHRQVDSSRDIADLDMNLDLMTNKDRTVVRLALTGSLTVTD
RAALDACLDRYARLFAHLRTWDSHTELAVIPADGEFSDLGIGGFAAAAVE
ELVATARQQDSDTAADAQGALALLLRLADRGAA
>MAP4281 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP2608 hypothetical protein
MSHRNAPLSETGRLRLARCVVDEGWSLRRAAERFQVSVTTAERWARRYRE
LGEAGMADRSSRPHHSPNRTPTRTERRIIKVRVIRRWGPARIGYLLGIHP
STVHRVLTRYALAKLRWLDRSTGRIIRRMEPAGCGDLVHVDVKKLGKIPA
GGGWRMLGRAIGGHNSNADKSSGVFSKHRNPIRGYHYLHTAIDGYSRLAY
SEVLDDEIKETAAEFWTRANAWFAECGISVRKVLTDNGSCYRSRVFAQAL
GDIEHRRTRPYRPQTNGKVERFHRTLADEWAYARLYRSDTERCEEFTTWL
HTYNHHRGHTALGGQPPASRVPNLSGQYN
>MAP0428 hypothetical protein
MKELTVAEQRYQAVLAVISDGLSISLVAEKVGVSRQTLHAWLARYEASGL
EGLVDRSHRPVSCPHQMPAVVEAALLERRRSRPYWGPRRLVFELAKRRVG
PVPSESAVYRALVRAGMIDPALRDRRSRKWKRWERGAPMELWQMDVVGGF
PLADGTSAKALTGIDDHSRMCVCARLMARERTRAVCDGLRAALAAYGAPQ
QILTDNGKVFTGRFNHPPVEVLFDAICRQNGIDHLLTQPRSPTTTGKIER
FHRSLRAEFLSNTRAFSNLKTAQQALDEWVHYYNTARPHQSLNMTTPAER
FTATASPVSPGDDVPASIDRDGQDWVSRRVTTNGVVSVAWQQVCVGAHYA
GARCDVHVDGELLRFWIGDQLVKTAARTNHAEVRNKRAFRTREQA
>MAP0849c hypothetical protein
MALDQSALLEVLDALRNADAADRIKQAAETIYQALIDAELTAVIGAGPHE
RSASRTNQRNGSRPRTLSTIAGDLELRIPKLRSGSFFPALLERRRRVDQC
LFAVVMEAYLHGTSTRKVDDLVKALGADAGISKSEVSRICADLDTEVGAF
RDRPLSEQHFPYVFLDATYCKARVNHRVVSQAVVIATGVAADGRREVLGF
DVGDSEDGAFWTAFLRSLKTRGLSGVQLVISDAHTGLRSAIEAILIGASW
QRCRVHFLRNVLAQVPKGSAEMVAAAIRTIFAQPDAEHVREQLDTIAGML
GRQLPKVETMLREAADDITAFADFPVLHWKKIWSTNPLERLNKEIKRRTD
VVGVFPNPAALLRLAGSVLVEAHDEWQVAEKRYLSETTLALLHPRSDSAD
QSVAVPAAITA
>MAP1150c hypothetical protein
MALDQSALLEVLDALRNADAADRIKQAAETIYQALIDAELTAVIGAGPHE
RSASRINQRNGSRPRTLSTIAGDLELRIPKLRSGSFFPALLERRRRVDQC
LFAVVMEAYLHGTSTRKVDDLVKALGADAGISKSEVSRICADLDTEVGAF
RDRPLSEQHFPYVFLDATYCKARVNHRVVSQAVVIATGVAADGRREVLGF
DVGDSEDGAFWTAFLRSLKTRGLSGVQLVISDAHTGLRSAIEAILIGASW
QRCRVHFLRNVLAQVPKGSAEMVAAAIRTIFAQPDAEHVREQLDTIAGML
GRQLPKVETMLREAADDITAFADFPVLHWKKIWSTNPLERLNKEIKRRTD
VVGVFPNPAALLRLAGSVLVEAHDEWQVAEKRYLSETTLALLHPRSDSAD
QSVAVPAAITA
>MAP2156 hypothetical protein
MICAFITEHRARFGVAPICRVLTEHGCKIAPRTFYAWLARAPSARALWDT
VITEVLAGYYEPDEHGRRKPESLYGATKMWAHLQRQGIGVARCTVERLMR
ANGWRGVTRRKKVRTTIADPAAARAADLVKRKFRVPAPNMLLVADFTYVR
LASGAFVYTAFAIDAYAGRIVGWTCSASKEDRFLRQAIRDAAQLRSNEGN
PLLGNTVHHSDAGSQYTSVRFGETLSLSGLVPSIGTVGDAFDNALAETTI
GLYKTEAVRADSPFRRGPLNRLADVEMLTADWVHWYNTDRLMHRLGRIPP
VEYEAIYYAANTAQSAAAHQ
>MAP2274 hypothetical protein
MALDQSALLEVLDALRNADAADRIKQAAETIYQALIDAELTAVIGAGPHE
RSASRINQRNGSRPRTLSTIAGDLELRIPKLRSGSFFPALLERRRRVDQC
LFAVVMEAYLHGTSTRKVDDLVKALGADAGISKSEVSRICADLDTEVGAF
RDRPLSEQHFPYVFLDATYCKARVNHRVVSQAVVIATGVAADGRREVLGF
DVGDSEDGAFWTAFLRSLKTRGLSGVQLVISDAHTGLRSAIEAILIGASW
QRCRVHFLRNVLAQVPKGSAEMVAAAIRTIFAQPDAEHVREQLDTIAGML
GRQLPKVETMLREAADDITAFADFPVLHWKKIWSTNPLERLNKEIKRRTD
VVGVFPNPAALLRLAGSVLVEAHDEWQVAEKRYLSETTLALLHPRSDSAD
QSVAVPAAITA
>MAP1408 hypothetical protein
MTTVALETQLQGYLDHLTIERGVAANTLSSYRRDLRRYTKHLSDRGISDL
AKVGEDDVSEFLVALRRGDPDTGAAALSAVSAARALIAVRGLHRFLAAEG
LAELDVARAVRPPTPGRRLPKSLTIDQVLALLEAAGGESPADGPLTLRNR
ALLELLYSTGARISEAVGLDVDDVDTQARSVLLRGKGGKQRLVPIGRPAV
AALDAYLVRGRWELARRGRGTPAIFLNVRGGRLSRQSAWQVLQDAAERAG
ITSGVSPHMLRHSFATHLLEGGADVRVVQELLGHASVTTTQIYTMVTVHA
LREVWAEAHPRAR
>MAP1843 hypothetical protein
MIVQPEARQSAARPAGQVSRPALSPSRAADFKQCPLLYRFRAIDRLPEAP
SPAQLRGSLVHAALQQLYELPAAQRGPETALALVDPAWEQLLAATPELTA
DLDPTQHGQLLAEAQALLAGYYRLEDPTRFDPQCCEERVEVELADGTLLR
GFIDRIDVAATGELRVVDYKTGKAPPAARALAEFKAMFQMKFYAVALLRS
RGVPPTRLRLIYLADGQVLDYSPEYDELLRFEKTLMAIWRAIQTAGRTGD
FRPTQSRLCDWCPHQQLCPLFDGTPPPYPGWPDTLGTQDNSTLSDPVDLA
G
>MAP3755 hypothetical protein
MDFQFDVSIDGRPIKIVSIVDEHTRECLAGMVERSITGEHLIAELDQLAV
QHGTYPGCCGATMPPELACSAIAGWASGQIGPGSHQRLETRLQPPPPTLG
PGHWCGMARDPIILDQATPPGRQPSCNKRPRSSRMARWWVIWPSVMVKM
>MAP3480 hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP1173c hypothetical protein
MTGMADRTVRGGQERSRIKTLTQAALNADKTVEQVEDVLDGLSSTLKELS
SSLAALNATVERMETGLDHLDGTLASLDDLAKRLIVLVEPVEAIVARVDD
LVKVGETVMSPLSVTEHAVRGLVDRLRNRTAQ
>MAP2961c hypothetical protein
MTPADPRALRAWAYLSRVAEPPCPGLGALVRRVGPVEAADRVRRAAVDDA
LAKHTEARREIDRAAADLELIARRGGRLVTPDDDEWPVLAFAAFGAAAPR
PPGTAPMLAPLVLWAQGPARLDEVAHRAAAVVGTRAATAYGEQVAGDLVA
GLVQRDVAVVSGGAYGIDGAAHRAALDCDGVTVAVLAGGLDVAYPSGHSA
LLHRIGQHGLLFTEYAPGVRPARYRFLTRNRLVAAMAGATVVVEAGLRSG
AANTAAWARALGRVVAAVPGPVTSSASAGCHALLRNGAELITRAEHVVEL
IGHIGELAAEEPHPVTPLDGLGEAERRVYEALPGRGAATVEQLAAGCGLL
PEQALGPLAMLELAGLVRRQDGRWRIVRTAAGPAPPPA
>MAP3754 hypothetical protein
MATRKRHTPEQIVRKLTAARLRAAGNETAAVCRELGVSEVTYHCWRNQFG
GLKAEDAKRLKDSNARTPPSMGSG
>MAP1481c hypothetical protein
MIRASRTGLSRGDRFCQPAISPPRAGPRDVLVSHPVLPERHRKVTTPKTF
ADLGVPARIVDALTARGITSPFPIQAETLPDTLAGRDVLGRGKTGSGKTL
AFSIPLVGRLSTGNRRPARPTGLVLAPTRELATQITATLEPLAAACGLRV
STIFGGVSQHRQVTALKAGVDIVVACPGRLEDLMRQRLITLEAVRVTVID
EADHMADLGFLPGVTRILAATPNDGQRLLFSATLDNGVDKLVTRFLRDAV
LHSVDEANSPVSEMTHHVFHVDSVQAKKELVHRLASGTGRRILFLRTKHQ
ARKVARQLTESGVPSVDLHGNLSQPARERNLAMFAAGSARVLVATDIAAR
GVHVDEVELVVHIDPPSEHKSYLHRSGRTARAGSAGDVVTVVLPEQREHT
RALMRKAGIDVAPQRVTAGSQAVHALVGPIAPPKPPAAAGVPSHPAGPHR
PAAAGQRRRRSGRSARTTAAHAVPHRPAAPVRRPDRRRASRAQGSAG
>MAP0159c hypothetical protein
MTVTEVVVAQPVWAGVDAGKADHYCMVINDDAQRLLSQRVANDEAALLEL
IAAVTTLADGGEVTWAIDLNAGGAALLIALLIAAGQRLLYIPGRTVHHAA
GSYRGEGKTDAKDAAIIADQARMRHDLQPLRAGDDIAVELRILTSRRSDL
VADRTRAINRMRAQLLEYFPALERAFDYNKSRAALILLTGYQTPDALRSA
GGARVAAFLRKRKARNADTVAATALQAANAQHSIVPGQQLAATVVARLAK
EVMALDTEIGDTDAMIEERFRRHRHAEIILSMPGFGVILGAEFLAATGGD
MAAFASADRLAGVAGLAPVPRDSGRISGNLKRPRRYDRRLLRACYLSALV
SIRTDPSSRTYYDRKRTEGKRHTQAVLALARRRLNVLWAMLRDHAVYHPA
TTTAAA
>MAP3748c IS1110, IS1110
MAAGRLWAGVDVGREYHWVCVVDDTGAVVLSRKLVNDEQPIRELVAEIDQ
LAEEVSWTVDLTTVYAALLLTALAAADTPVRYLSGRAVWQASAVYRGGEA
KTDAKDARVIADQSRMRGADLPVLAPDDDLITELRMLTAHRTDLVADRTR
TINRLRQQLVAVCPALERVAALTSDRGWVMLLSRYQRPKAIRNSGVSRLT
KILGDAGVRNAASIADAAVTAAKTQTVRLPGEEVAAHLVAELAQGVIALD
ARITATDADIEGRFRRHPLAEVITSLPGIGFRLGAEFLAAVGDPARIGSA
DQLAAWAGLAPVPSDSGKRTGRLHTPQRYSRRLRPVMYMSALTAIRCDPQ
SRAYYQSKRDEGKQSIQATICPARRRTNVLYALIRDNRTWQPDSPPITES
AA
>MAP3286 IS1547, IS1547_2
MSADRPNRHLTVKEATSMVVVGADVHKRTHTFVAVDDVGRKLGEKVVAAT
TAGHAEAVMWARERFGTEVVWAIEDCRHLSARLERDLMGFGQSVVRVPPK
LMAQTRASARTRGKSDPIDALAVARGFLREPDLPVASHDEVSRELKLLVD
RREVLVAQRTATINRLLWRVHELDPDHAPKAGSLDLAKHRRILGDWLVTV
PGLVAELARDELADITRLTETINALAKRIGERVRVVAPVLLSLPGCAELT
AAKLVGEAAGVTRFKSEAAFARHAGVAPIPVWSGNTAGRVRMTRSGNRQL
NAALHRIAVTQIRLDGLGQTYYRHRIAVSGSKTEALRCLKRRLARVVFHH
LHTDHQNRIQPCQPAAA
>MAP1952c IS1547, IS1547_1
MSADRPNRHLTVKEATSMVVVGADVHKRTHTFVAVDDVGRKLGEKVVAAT
TAGHAEAVMWARERFGTEVVWAIEDCRHLSARLERDLMGFGQSVVRVPPK
LMAQTRASARTRGKSDPIDALAVARGFLREPDLPVASHDEVSRELKLLVD
RREVLVAQRTATINRLLWRVHELDPDHAPKAGSLDLAKHRRILGDWLVTV
PGLVAELARDELADITRLTETINALAKRIGERVRVVAPVLLSLPGCAELT
AAKLVGEAAGVTRFKSEAAFARHAGVAPIPVWSGNTAGRVRMTRSGNRQL
NAALHRIAVTQIRLDGLGQTYYRHRIAVSGSKTEALRCLKRRLARVVFHH
LHTDHQNRIQPCQPAAA
>MAP1287 IS1601_B, IS1601_B_3
MIFVGDDWAEDHHDVHLMDESGARLASRRLPEGLAGIGEFHQLLARHAEE
PDQVVIGIETDRGLWVEALTAADYQVYAINPMAAARYRDRHHVSGAKSDA
GDAKLLADLVRTDRHNHRRVAGDSADTEAVKVLARAHQNLIWTRNRHTNA
LRSALREYYPGALEAFDDLHDRDALAILGRAPTPMQAANLSLSKIRSALK
AAGRQRNLDTVAQDIQTALRAEQLAAPAAVTAAFGATTRATVGIIAELNR
QITDLEAELATHFETHPDADIYRSLPGLGVILGARVLGEFGDDPNRYTTA
KCRKNYAGTSPLTVASGRKRAVLARHVRNRRLYDAIDQWAFCALNTSPGA
RLFYDRRRAAGDLHHQALRALGNRLVGILHGCLRHRTHYDEHKAWAHRQT
NPDSQAA
>MAP0832c IS1601_B, IS1601_B_1
MIFVGDDWAEDHHDVHLMDESGARLASRRLPEGLAGIGEFHQLLARHAEE
PDQVVIGIETDRGLWVEALTAADYQVYAINPMAAARYRDRHHVSGAKSDA
GDAKLLADLVRTDRHNHRRVAGDSADTEAVKVLARAHQNLIWTRNRHTNA
LRSALREYYPGALEAFDDLHDRDALAILGRAPTPMQAANLSLSKIRSALK
AAGRQRNLDTVAQDIQTALRAEQLAAPAAVTAAFGATTRATVGIIAELNR
QITDLEAELATHFETHPDADIYRSLPGLGVILGARVLGEFGDDPNRYTTA
KCRKNYAGTSPLTVASGRKRAVLARHVRNRRLYDAIDQWAFCALNTSPGA
RLFYDRRRAAGDLHHQALRALGNRLVGILHGCLRHRTHYDEHKAWAHRQT
NPDSQAA
>MAP2050 IS1601_B, IS1601_B_2
MIFVGDDWAEDHHDVHLMDESGARLASRRLPEGLAGIGEFHQLLARHAEE
PDQVVIGIETDRGLWVEALTAADYQVYAINPMAAARYRDRHHVSGAKSDA
GDAKLLADLVRTDRHNHRRVAGDSADTEAVKVLARAHQNLIWTRNRHTNA
LRSALREYYPGALEAFDDLHDRDALAILGRAPTPMQAANLSLSKIRSALK
AAGRQRNLDTVAQDIQTALRAEQLAAPAAVTAAFGATTRATVGIIAELNR
QITDLEAELATHFETHPDADIYRSLPGLGVILGARVLGEFGDDPNRYTTA
KCRKNYAGTSPLTVASGRKRAVLARHVRNRRLYDAIDQWAFCALNTSPGA
RLFYDRRRAAGDLHHQALRALGNRLVGILHGCLRHRTHYDEHKAWAHRQT
NPDSQAA
>MAP2155 IS6110, IS6110
MPKKYDEATKSKAVRLVVDHRDEYDSEYGCIRAVATRIGVGPETLRKWVR
QAEIDGGERDGVTTATNRENRELKRKVAELEQTVEMLRAATTFFVRESDP
RHR
>MAP2443 alkA, AlkA
MHDDFERCYRAVQSKDARFDGWFVTAVLTTGIYCRPSCPVRPPFARNVRF
YPTAAAAQRAGFRACKRCRPDASPGSPEWNVRGDVVARTMRLIADGTVDR
DGVGGLAARLGYTTRQLERLLQAEVGAGPLALARAQRAQTARVLIETTEL
PFGDVAFAAGFSSIRQFNDTVRAVFESPPSVLRRRASARCASAANSPGAV
CLRLPVRTPFGFHGVFGHLAASVVPGCEEVRDGAYRRTLRLGFGTGIVTL
TPAADHVRCELVLDDFRDLTAAIARCRRLLDLDADPEAVDEALAADPQLA
PAVRKAPGQCIPRTVDEAELAVRAVLGQQVSIRAARTHAGRLVAAYGRAV
HDPEGTLTHTFPSVQQLADVDPIHLAVPKARQRTLAALVAGLADRSIVLD
TGCDWQSARTQLLALPGVGPWTAEVIAMRGLGDPDAFPAADLGLRVAAKR
LGLPSGQRSLTAASARWRPWRSYATQYLWTTLEHPVNHWPPQQPSKGILN
DVVKPPR
>MAP2521c deaD, DeaD
MTLPDSSTEAASPTFADLQIHPSVLRAIADVGYETPTGIQAATIPALMAG
SDVVGLAQTGTGKTAAFAIPILSKIDAASTATQALVLAPTRELALQVAEA
FSRYGAHLPKINVLPIYGGSSYAVQLAGLKRGAHVVVGTPGRVIDHLERG
TLDLSHVDYLVLDEADEMLTMGFAEEVDRILSETPEYKQVALFSATMPPA
IRKLTAKYLHDPLEVSTKAKTTTAENISQRYIQVAGPRKMDALTRVLEVE
PFEAMIVFVRTKQATEEVAERLRARGFSAAAINGDIPQGQRERTVAALKD
GGIDILVATDVAARGLDVERISHVLNYDIPHDTESYVHRIGRTGRAGRSG
TALLFVSPRERHLLKAIEKATRQPLTEAELPTVEDVNAQRVAKFADSITA
ALGAPGIDLFRKLVQDYEREHDVPMADIAAALAVQSRDGEEFLMAPEPPR
ERRERHTERRERTEKPRSTRPLATYRIAVGKRHKIGPGAIVGAIANEGGL
HRSDFGHIAIGPGFSLVELPAKLPKSTLKRLEQTRISGVLINLQPDRAAA
KARGRDGGKPRRKYGG
>MAP2431 dinG, DinG
MPELLATAVAALGGSEREGQQQMAAAVAQAFDTGRHLVVQAGTGTGKSLA
YLVPAIVHALRDDSPVVVSTATIALQRQLVDRDLPRLIDSLAAALPRRPQ
FALLKGRRNYLCLNKIHNGGPADGEEAADRPQEELFNPMAVSALGRDVQR
LTEWASSTDSGDRDDLKPSVPDRSWSQVSVSARECLGVARCPFGAECFSE
RARSRAGQADVVVTNHALLAIDAVSDSAILPEHALLVIDEAHELVDRVTA
VATAELTSAALGVAARRIGRLVSPELVQRLEATTATFAAAIHDGTPGRID
RLDDELATYLAALRDAASAARSAIDTTSDPKAASARAEAVAALSEISDTA
SRVLASFGPAIPDRTDVVWLDHEDNRGAMRPVLRVAPLSVADLLRDRVFS
RSTVVLTSATLTIGGSFDAMAAAWGLKGPDGDDPPWRGLDVGSPFQHAKA
GILYVAAHLPPPGRDGVGSAEQLTEIAELITAADGRTLGLFSSMRAARAA
AEAMRDRLSTPVLCQGDDSTSALVEQFSAEPQTSLFGTLSLWQGVDVPGP
SLSLVLIDRIPFPRPDDPLLGARQRAVAARGGNGFMAVAASHAALLLAQG
SGRLLRRVSDRGVVAVLDSRMATAGYGGYLRASLPPFWQTTNGAQVRAAL
QRLRTAATASGPG
>MAP3106 dinP, DinP
MSPRWVLHVDLDQFLASVELRRRPELAGLPVIVGGNGDPNEPRKVVTCAS
YEAREFGVRAGMPLRAAARRCPPDSGVTFLPSDPAAYDAASDQVMGLLRD
LGHPVEVWGWDEAYLAVTATDPIEVAEQIRGVISSETGLTCSVGISDNKQ
RAKIATGFAKPDGVFVLTDANWMALMADRPVDALWGVGPKTAKKLADLQI
TTVWQLAHSDAELLTATFGPRTGLWLLLLAKGGGDDHVSAQPWVPRSRSH
VVTFPRDLTDRAEMETAVTDLADQAVAEVLTAGRVVTRVAVTVRTATFYT
RTKIRKLDAPSTDRDVIVAAALRVLDLFELDRPVRLLGVRLELVMPD
>MAP1248 dinX, DinX
MESRWVLHLDMDAFFASVEQLTRPTLRGRPVLVGGLGGRGVVAGASYEAR
VFGARSAMPMHQAKRLVGVSAVVLPPRGVVYGVASRRVFDTIRAVVPVVE
QLSFDEGFGEPAQLAGAPAQDVEAFCEQLRRRVREQTGLIASVGAGSGKQ
IAKIASGLAKPDGVRVVRRAEERELLGGLPVRRLWGIGPVAEEKLHRLGI
ETIGELAALTDAEAANILGATIGPALHRLARGIDDRPVAERAEAKQISSE
STFAADLTTLEQLREAIEPIAEHAHHRLLRDGRGARTVTVKLKKSDMSTL
TRSATLPYATTEAAALVGVARRLLLDPREIGPIRLLGVGFSGLSEVRQES
LFPDLEMPAPQSDSQSVETAAEAMFGPGHDAGWRVGDDVAHPDLGHGWVQ
GAGHGVVTARFETRTSGPGPARTFPADSAELVRANPVDSLDWPDYVEGLQ
ESSAPPAEDVGGR
>MAP0001 dnaA, DnaA
MADDPGSSFTTVWNAVVSELNGEPVADGGAANRTTLVTPLTPQQRAWLNL
VRPLTIVEGFALLSVPSSFVQNEIERHLRAPITDALSRRLGQQIQLGVRI
APPPDDVEDAPIPPAEPFPDTDAALSADDGADGEPVENGEPVTDTQPGWP
NYFTERPHAIDPAVAAGTSLNRRYTFDTFVIGASNRFAHAAALAIAEAPA
RAYNPLFIWGESGLGKTHLLHAAGNYAQRLFPGMRVKYVSTEEFTNDFIN
SLRDDRKVAFKRSYRDVDVLLVDDIQFIEGKEGIQEEFFHTFNTLHNANK
QIVISSDRPPKQLATLEDRLRTRFEWGLITDVQPPELETRIAILRKKAQM
ERLAVPDDVLELIASSIERNIRELEGALIRVTAFASLNKTPIDKSLAEIV
LRDLIADASTMQISAATIMAATAEYFDTTVEELRGPGKTRALAQSRQIAM
YLCRELTDLSLPKIGQAFGRDHTTVMYAQRKILSEMAERREVFDHVKELT
TRIRQRSKR
>MAP0071 dnaB, DnaB
MAVVDDLTSGMDSSSPSEDFGRQPPQDLAAEQAVLGGMLLSKDAIADVLE
RLRPGDFYRPAHQNVYDAILDLYGRGEPADAVTVAAELDRRNLLRRIGGA
PYLHTLISTVPTAANAGYYATIVAEKALLRRLVEAGTRVVQYGYAGAEGA
DVAEVVDRAQAEIYEVAERRTTEDFVPLEDLLQPTMDEIDAIASNGGVAR
GVPTGFTELDEVTNGLHAGQMIIVAARPGVGKALALDTPLPTPTGWTTMG
DVAVGDELLGDDGRPTRVVAATDVMLGRPCYEVEFSDGTVIVADAAHQWL
TETRASRKSAQAAAVGYNRHKNQRTFAAVRTTAEIAETLRCPTQDRRLNH
SVVNARALELPDREFLVPPYTLGAWLGDGTSAAAQITAADPEIIMRIEAD
GVVAVPSGSAPYRYQLRLPPGAEQAPRRCVVCGKSFIPQTSQVRTCGRSC
GGRARFMSDPVPSPTCVRCGGPSAGMRLCLKCHSTVGTLQARLRTIGVLG
NKHIPTEYLRGSEAQRRALLAGLLDTDGTVTVGGAVQFSVTNQRLARDVN
ELIVSLGYRCQTSTKRVQGRSETSSIAYTLTFSTEDKVFALERKAIAHKE
RRAVTGTSRCGSRFIVDVRPIESVAVRCVEVDNDSHMYLASRAMVPTHNS
TLGLDFLRSCSIKHRMASVIFSLEMSKSEIVMRLLSAEAKIKLSDMRSGR
MSDEDWTRLARRMSEISEAPLYIDDSPNLTMMEIRAKARRLRQKADLRLV
VVDYLQLMSSGKKVESRQLEVSEFSRQLKLLAKELEVPVVAISQLNRGPE
QRTDKKPMLSDLRESGSLEQDADMVILLNRPDAFERDDPRGGEADFILAK
HRNGPTKTVTVAHQLHLSRFANMAR
>MAP1257 dnaE1, DnaE1
MNHSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVGMTDHGNMFGA
SEFYNAATKAGIKPIIGVEAYIAPGSRFDTRRILWGDPSQKADDVSGSGS
YTHLTMVAENAAGLRNLFKLSSLASFEGQLGKWSRMDAELIAEHADGIIA
TTGCPSGEVQTRLRLGQDREALESAAKWREIFGADNFFLELMDHGLSIEQ
RVRDGLLEIGRKLNIPPLATNDCHYITRDAAHNHEALLCVQTGKTLSDPN
RFKFDGDGYYLKSAAEMRQIWDAEVPGACDSTLLIAERVQSYAEVWTPRD
RMPVFPVPEGHDQASWLHHEVMAGLRRRFPDGVGQDYIDRAEYEIKVICD
KGFPSYFLIVADLINYARSVDIRVGPGRGSAAGSLVAYALGITNIDPIPH
GLLFERFLNPERPSAPDIDIDFDDRRRGEMVRYAADKWGSDRVAQVITFG
TIKTKAALKDSARIHYGQPGFAIADRITKALPPPIMAKDIPLSGITDPSH
ERYKEAAEVRGLIETDPDVRTIYQTARGLEGLIRNAGVHACAVIMSSEPL
TEAIPLWKRPQDGAIITGWDYPSCEAIGLLKMDFLGLRNLTIIGDALDNI
KANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLDGGPMRDLLRRMQPT
EFNDIVAVLALYRPGPMGMNAHNDYADRKNGRQPIKPIHPELEEPLREIL
AETYGLIVYQEQIMFIAQKVASYTMGKADALRKAMGKKKLEVLEAEYKGF
YEGMTANGFSEKAVKALWDTILPFAGYAFNKSHAAGYGLVSYWTAYLKAN
YPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLPPDVNESLVNFASVG
QDIRFGLGAVRNVGANVVGSLIKTRNEKGKFTDFSDYLNKIDISACNKKV
TESLIKAGAFDSLKHARKGLFLVHTDAVDSVLGTKKAEAMGQFDLFGGDG
GCTESVFTIKVPDDEWEDKHKLALEREMLGLYVSGHPLNGVAHLLAAQVD
TQIPAILDGDVPNETQVRVGGILASVNRRVNKNGMPWASAQLEDLTGGIE
VMFFPHAYSTYGADIADDAVVLINAKVAIRDDRIALIANELVVPDFSTAQ
VDRPLAVSLPTRQCTIDKVTALKQVLARHPGTSQVHLRLISGDRITTLEL
DASLRVTPSPALMGDLKELLGPGCLGG
>MAP3476c dnaE2, DnaE2
MGWFNGPPSWAEMERVLDSKPRRAGESAAPEPDGPLSRGRATYRPPDEGR
AARSSVPYAELHAHSAFSFLDGASTPEEMVEEAARLDLRALALTDHDGLY
GAVRFAEAAAELDVRTVFGAELSLGPSARTEAPDPPGPHLLVLARGPEGY
RRLSRQLAAAHLAGGEKGKPRYDLDALTEAAGGHWHILTGCRKGHVRQAL
SDGGPDAAARALADLVDRFGAARVSIELTRHGQPLDDERNAALAALAPRF
GVGVVATTGAHFAGPSRRRLAMAMGAIRARESLDSAAGWLAPLGGSHLRS
GAEMARLFAWRPQAVTAAAELGEQCAFGLALIAPRLPPFDVPDGHTEDSW
LRQLTMTGARDRYGSPEHAPRAYAQIEHELKVIAQLQFPGYFLVVHDIAR
FCRENNILCQGRGSAANSAVCYALGVTAVDPVANELLFERFLSPARDGPP
DIDMDIESDQREKVIQYVYDRYGRDYAAQVANVITYRGKIAVRDMARALG
YSQGQQDAWSKQISSWSGPADSPDVEGIPPQVIDLANQVRNLPRHLGIHS
GGMVICDRPIADVCPVEWARMENRSVLQWDKDDCAAIGLVKFDLLGLGML
SALHYAIDLVAEHKGIEVDLARLDLSEPAVYEMLARADSVGVFQVESRAQ
MATLPRLKPRVFYDLVVEVALIRPGPIQGGSVHPYIRRRNGVDPVLYDHP
SMEPALRKTLGVPLFQEQLMQLAVDCAGFSAAEADQLRRAMGSKRSTERM
RRLRSRFYDGMRALHGAPDEVIDRTYEKLEAFANFGFPESHALSFASLVF
YSSWFKLHHPAAFCAALLRAQPMGFYSPQSLVADARRHGVTVHGPDVNAS
LAHATLENAGTEVRLGLGAVRHIGDDLAEKLVQERKANGPFTSLLDLTAR
LQLSVQQTEALATAGAFGCFGMSRREALWAAGAAATQRPDRLPGVGSSSH
IPALPGMSELELAAADVWATGISPDSYPTQFLRDDLDAMGVVPAARLGSV
PDGDRVLIAGAVTHRQRPGTAQGVTFLNLEDETGMVNVLCTPGVWARHRK
LANTAPALLVRGQVQNASGAITVVAERLGRITLAVGSRSRDFR
>MAP2130c dnaG, DnaG
MSSPAGSRAQGDGRARGRGRIPDRDIAAIRERVRIDEVVGDYVQLRRAGA
DSLKGLCPFHDEKSPSFHVRPNHGHFHCFGCGEGGDVYAFLQKIEHVSFV
EAVELLADRIGHTITYSGPATSVQRDRGSRSRLIAANAAAAEFYAAALES
DEAAPARQYLTERNFDAEAARRFGCGFAPSGWDTLTKHLQRKGFEFKELE
AAGLSRQGRRGPMDRFHRRLLWPIRSSAGEVIGFGARRLFDDDPMEAKYI
NTPETLLYKKSNVMFGIDLAKRDIAKGHQAVVVEGYTDVMAMHLAGVTTA
VASCGTAFGDEHLAMLRRLMMDDSFFRGELIYVFDGDAAGRAAALKAFGG
EQNLAGQSFVAVAPDGMDPCDLRLRSGDAALRDLVARRTPLFEFAIRSAL
AELDLDSAEGRVAALRRCVPMVAQIKDPTLRDEYARQLAGWVGWSDVAQV
IDRVRSQSKHSAGAGRGGSGARVSRRAEQSAAPAGPAASRPDPRDPTLWP
QREALKSALQYPALAGPVFDSLTVESFTHPGYAAVRAAIEAAGGTSSGVT
GGQWIEAVRDRASSPLTAGLISELGVEAIQVDDEKLPRYIAGVLARLQEV
WMGRQIAEVKSKLQRMSPIEQGDEYHALFGDLVAMEAYRRSLLEQASGDD
>MAP0002 dnaN, DnaN
MDAATTTAGLSDLKFRLVRESFADAVSWVAKSLPSRPAVPVLSGVLLSGT
DEGLTISGFDYEVSAEAQVAAEIASPGSVLVSGRLLSDIVRALPNKPIDF
YVDGNRVALNCGSARFSLPTMAVEDYPTLPTLPEETGTLPADLFAEAIGQ
VAIAAGRDDTLPMLTGIRVEISGDTVVLAATDRFRLAVRELTWSAASPDI
EAAVLVPAKTLAEAARTGIDGSDVRLSLGAGAGVGKDGLLGISGNGKRST
TRLLDAEFPKFRQLLPAEHTAVATINVAELTEAIKLVALVADRGAQVRME
FSEGSLRLSAGADDVGRAEEDLAVDFAGEPLTIAFNPTYLTDGLGSVRSE
RVSFGFTTPGKPALLRPASDDDSPPSGSGPFSALPTDYVYLLMPVRLPG
>MAP0313c dnaQ, DnaQ
MSAVCWGRPASEPDAGWAVIDVETSGFRPGQARIISLAVLGLDAGGKVEQ
SVVSLLNPGVDPGPTHVHGLTAAMLEDQPQFADIAGDVVEVLRGRTLVAH
NVAFDYAFLAAEAELAGIELPVDTVMCTVELARRLDLGIDNLRLETLAAH
WGVTQQRPHDAFDDAMVLTGVLASALQRARERDIWLPVHPVTRRRWPNGR
VTHDELRPLKVLASRMPCPYLNPGPYVGGRPLVQGMRVALAAEVERTHEE
LVERILHAGLAYSDTVDRETSLVVCNDPAAEHGKGYLARQLGVPVISDAQ
FLDCVRAVIGGTSMDEFTDATPDPQLALF
>MAP0322c dnaZX, DnaZX
MALYRKYRPATFAEVVGQEHVTEPLSIALEAGRINHAYLFSGPRGCGKTS
SARILARSLNCVQGPTATPCGVCDSCLALAPNAPGSIDVVELDAASHGGV
DDTRELRDRAFYAPAQSRHRVFIVDEAHMVTTAGFNALLKIVEEPPEHLI
FIFATTEPEKVLPTIRSRTHHYPFRLLPPKTMRALIGRICEQEGVVVDDA
VYPLVIRAGGGSPRDTLSVLDQLVAGAEGGHVTYQRALGLLGATDLALID
DAVDALAAGDAAALFGAVESVIDAGHDPRRFATDLLERFRDLILLQAVPD
AASRGVVDAPEDVLERMRDQATRIGPATLTRYAEVVQAGLGEMRGATAPR
LLLEVVCARLLLPSASDTESALLQRVERIETRLDMSIPAGSTPAEPVEQP
VRFTRPSAAPKPASKPEPAAAKPEHEPEPQPEARPEPRAKPTPEPKPAPE
PARASETPSAPGELNAAAVRSMWSTVRDKVRQRSRTTEVMLAGATVRAIE
DNTLVLTHESAPLAKRLCEQRNADVIAEALKDALGVNWRVRCEAGSPASA
TAGPPHPKPEQPVPEPDSARRDEEEHMLAEAVRDEPAARRDPEEAALELL
QNELGARRIDGG
>MAP4132 end, End
MLIGSHVRNDDPLAAAQADGADAVQFFLSNPQSWKKPKPRDDAEALKASS
VPLYVHAPYLINVASANNRVRIPSRKILQDTCDAAAEINATAVIVHGGHA
DDNDMEAGFERWVKALDYLETDVQVYLENTAGGDHAMARHFDTIGRLWDR
IGDKGIGFCLDTCHAWAAGEALIDAVDRIKALTGRIDLVHCNDSRDAAGS
GADRHANFGTGQIDPDLLVTVVKAAAAPVICETADEGRKDDIAFLREHTK
S
>MAP2994c fpg, Fpg
MPELPEVEVVRRGLHAHVVGKTIGAVRVHHPRAVRRHEAGPADLTARLLG
ARITGTDRRGKYLWLLLDGCDTALVVHLGMSGQMLLGAVPRAEHVRISAL
LDDGTVLSFADQRTFGGWMLADLLEVDGSILPRPVAHLARDPLDPRFDAA
AVVKVLRRKHSEIKRQLLDQQVVSGIGNIYADEALWRAKVHGARIAATMT
GRQLTAVLDAAAEVMRDALAQGGTSFDSLYVNVNGESGYFDRSLDAYGRE
GESCRRCGAVMRREKFMNRSSFYCPKCQPRPRL
>MAP0006 gyrA, GyrA
MTDTTLPPGGDAADRVEPVDIQQEMQRSYIDYAMSVIVGRALPEVRDGLK
PVHRRVLYAMYDSGFRPDRSHAKSARSVAETMGNYHPHGDASIYDTLVRM
AQPWSLRYPLVDGQGNFGSPGNDPPAAMRYTEARLTPLAMEMLREIDEET
VDFIPNYDGRVQEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLGELAE
AVFWALDNYEADEEATLAAVMERVKGPDFPTSGLIVGTQGIADAYKTGRG
SIRMRGVVEVEEDSRGRTSLVITELPYQVNHDNFITSIAEQVRDGKLAGI
SNIEDQSSDRVGLRIVIELKRDAVAKVVLNNLYKHTQLQTSFGANMLAIV
DGVPRTLRLDQLIRHYVDHQLDVIVRRTTYRLRKANERAHILRGLVKALD
ALDEVIALIRASETVDIARQGLIELLDIDEIQAQAILDMQLRRLAALERQ
RIIDDLAKIEAEIADLEDILAKPERQRGIVRDELAEIVEKHGDARRTRIV
AADGDVSDEDLIAREDVVVTITETGYAKRTKTDLYRSQKRGGKGVQGAGL
NQDDIVRHFFVCSTHDWILFFTTQGRVYRAKAYELPEASRTARGQHVANL
LAFQPEERIAQVIQIRSYEDAPYLVLATRNGLVKKTKLTDFDSNRSGGIV
AINLRDNDELVGAVLCSAEDDLLLVSANGQSIRFSATDEALRPMGRATSG
VQGMRFNADDYLLSLNVVREGTYLLVATSGGYAKRTAIEEYPVQGRGGKG
VLTVMYDRRRGRLVGALIVDEDSELYAITSGGGVIRTAAGQVRKAGRQTK
GVRLMNLGEGDTLLAIARNAEEAADEAVDESDGAAGSDG
>MAP0005 gyrB, GyrB
MAAQKKKAQDEYGASAITVLEGLEAVRKRPGMYIGSTGERGLHHLIWEVV
DNSVDEAMAGYADRVDVRILDDGSVEVADNGRGIPVAMHATGAPTVDVVM
TQLHAGGKFGGENSGYNVSGGLHGVGVSVVNALSTRLEVNIARDGYEWSQ
YYDHAVPGTLKQGEATKRTGTTIRFWADPDIFETTEYDFETVARRLQEMA
FLNKGLTINLTDERVTNEEVVDEVVSDTADAPKSAQEKAAESAAPHKVKH
RTFHYPGGLVDFVKHINRTKNPIHQSIIDFGGKGPGHEVEIAMQWNGGYS
ESVHTFANTINTHEGGTHEEGFRSALTSVVNKYAKDKKLLKDKDPNLTGD
DIREGLAAVISVKVSEPQFEGQTKTKLGNTEVKSFVQKVCNEQLTHWFEA
NPADAKVIVNKAVSSAQARIAARKARELVRRKSATDLGGLPGKLADCRST
DPRKSELYVVEGDSAGGSAKSGRDSMFQAILPLRGKIINVEKARIDRVLK
NTEVQAIITALGTGIHDEFDITKLRYHKIVLMADADVDGQHISTLLLTLL
FRFMRPLIEHGHVFLAQPPLYKLKWQRSDPEFAYSDRERDGLLEAGLKAG
KKINKDDGIQRYKGLGEMDAKELWETTMDPTVRVLRQVTLDDAAAADELF
SILMGEDVDARRSFITRNAKDVRFLDV
>MAP1828c helY, HelY
MSPDPSAPDAATELVELTRFSSELPFALDGFQRRACAALERGHGVLVCAP
TGAGKTVVGEFAVHLALAAGGKCFYTTPLKALSNQKHTDLTARYGRDRIG
LLTGDMSVNADAPVVVMTTEVLRNMLYADSPALQGLSYVVMDEVHFLADR
MRGPVWEEVILHLPDEVRVVSLSATVSNAEEFGGWIQTVRGDTTVVVDEH
RPVPLWQHVLVGKRLFDLFDYRNAEAPGQPGAGREPRVNPDLLRHIAHRR
EADRLSDWQPRRGAGRGRPPARAGRPRFYRTPGRPDVIATLDAEGLLPAI
TFVFSRAGCDAAVQQCLRSPLQLTTQEERVQIAEVIEHRCGDLADADLAV
LGYYEWREGLLRGLAAHHAGMLPAFRHTVEELFTAGLVKAVFATETLALG
INMPARTVVLERLVKFNGEQHVALTPGEYTQLTGRAGRRGIDVEGHAVVL
WNPTEETTEPSAVAGLASTRTFPLRSSFAPSYNMTINLVQQMGPEQAHRL
LEQSFAQYQADRSVVGLVRGIERGQAMLDEIAAELGGPKAPILEYARMRA
RISEMERAQTRASRLHRRQAASDALAALRRGDIINIAHGRRGGLAVVLES
ARDSSDPRPLVLTENRWAGRISSADYSGNSAPVGSMPLPKRVEHRQPRVR
RDLASALRSAAAGLSIPAKRRRGDSDEGFHDPELASLREQLRRHPSHHTP
GLEAQVRQAERYLRIERDNAQLEKKVATATNSLARTFDRIVGLLTERGFI
ERRDGDPRVTDDGRLLARIYSESDLLVAECLRTGAWSGLKPAELAAVVSS
VLYESRGGEGPGTAFAAEAPT
>MAP1684 hrpA, HrpA
MAEPSFPVSGAQLRGRLDGLTIRDAARLGRRLKKLRGAAPDKLRQLADQI
AAAEAVVAARHAAVPAVSYPDLPVSERRREIADAIRAHQVVVIAGETGSG
KTTQLPKICLEAGRGIRGTIGHTQPRRLAARTVAQRIADELGSPLGGTVG
YTVRFADQVSDRTLIKLMTDGILLAEIQRDRRLLRYDTLILDEAHERSLN
IDFLLGYLRELLPRRPDLKLIITSATIEPRRFSEHFANAPVIEVSGRTYP
VEIRYRPLEIALPSATADDPDDPDHEIVRTETRDEVEAIVDAIAELEAEP
PGDVLVFLSGEREIRDTAEALTGLKHTEVLPLYARLPTAEQQKVFAPHTG
RRVVLATNVAETSLTVPGIRYVVDPGNARISRYSRRLKVQRLPIEPISQA
SAAQRAGRCGRVAPGVCIRLYSEADFAARPRYTEPEILRTNLAAVLLQMA
ALQLGDIENFPFLDPPDRRSVRDGVQLLTELGAFDRQGAITERGRRLARL
PVDPRLGRMILAAQTEGCVREMLVLAAALSIPDPRERPSDREEAARQKHA
RFADEHSDFMSYLNLWRYLREQRKELSGNQFRRLCRAEFLHYLRIREWQD
LVGQLRGIAGELGITEESGEPADPARVHAALLAGLLSHVGMRREDSREFA
GARNSRFVLAPGSVLSKRPPRWVVVAELIETTRLYGRTAARIQPEAVERV
AGDLVQRSHSEPHWDPDRGEVMAYERVTLYGLPLVSRRRVGYARIDPVLA
RELFIRHALVEGDWHTRHRFFADNARLRGELEELEERARRRDLVVGDDDV
YALYDARIPTDVVSARHFDAWWKKQRHQTPDLLTFTRADLLRTEETGDAD
RPDSWRAGDVTLPLTYRFQPGAADDGVTVHVPIDVLARLGGDEFAWQVPA
LREELVTALIRSLPKDLRRNFVPAPDTARAVLGAIDPAAEPLLPALQREL
RRRSGVSVPIDAFDLDKLPAHLRMTFAVESTDGTELARGKDLRALQERLT
TPARQAVADTVGARLQRTGLRGWPDDLDELPRVVQRSLDGRTVRGYPAFV
DTGAAVDLRVFATSAEQVRMMGAGLRRLVRLSIPSPAKAIARQLGPRTRL
ALGGNPDGSLPALLEDCADAATDALVPQPVWTRDEFTALRQRVAKTLGPT
TIELVGRVEQVLAAAQQVQLALPATPPPAQADAVADIRAQLDRLLPAGFV
TATGGARLGDLTRYLSAIRRRLDGLARAGQADRDRMRRVHAAQHAYDELV
DELPEAAKTAADVRDIAWQIEELRVSLWAQQLGTPRPVSEQRIHRAIDAA
RQRYRVALPPTLNL
>MAP3024c hupB, HupB
MNKAELIDVLTQKLNTDRRQATAAVENVVDTIVRAVHKGDSVTITGFGVF
EQRRRAARVARNPRTGETVKVKPTSVPAFRPGAQFKAVVSGAQRLPSEGP
AVKRGVVGGAAKKTAAKKAPAKKAAAKKAPAKKAAAKKAPAKKAAVKKAP
ARKAATKAPVRKAATKAPAKKVAAKKAPAKKAATKAPAKKAASKAPARKA
AAKKTTARRGRK
>MAP3117 ligB, LigB
MSPSHAKLATVLLFDVATASADVGGTPSRLTKVARIADLLRRAAPNAALV
AIVVSWLSGELRQRQIGVGWAALRSRPPAAAHPTLTVVAVDAAFAEIGAV
AGKGAQARRAALLNALFAAATETEQTFLLRLLGGELRQGALAGIMADAVA
RAAGIPAAAVQRAAMLGGDLPAVAAAALSGEASALDAFTLRVGRPVAPML
AQTAAGVAQAIERHGGQAIFEAKLDGARVQIHRAGDQVTVYTRSLDDVTA
RLPEVVTATLALPVEALIADGEAIALRPDNSPQRFQVTASRFGRSLDVAA
AVAAQPLSVFIFDILHCDGIDLLDAPTTDRLAALDALVPPAQRVDRLLTA
DPDAAGRFLEATLAAGHEGVMAKAPGAPYQAGRRGAGWLKVKPVHTLDLV
VLAVEWGSGRRRGKLSNIHLGARDPATGEFVMVGKTFKGMTDAMLDWQTA
RFTELAVGGTDGYVVRVRPEQVVEVAVDGVQKSSRYPGGLALRFARVLRY
RDDKGPAEADTIDAVRALY
>MAP0341 ligC, LigC
MDLPVMPPVSPMLAKSVGAIPPAASYEPKWDGFRSICFRDGDEVELGSRN
ERPMTRYFPELVAAVRAELPQRCVIDGEIVIATDHGLDFEALQQRVHPAE
SRVRMLAEATPASFIAFDLLALGDEDYTGRPFRERRAALLEAVGGPGPSI
HVTPATTDLDTARRWFDEFEGAGLDGVVAKPLDITYQPDKRVMFKIKHER
TADCVVAGYRVHKSGADAIGSLLLGLYQDDGQLASVGVIGAFPMAERRRL
FTELQPLVTDFEDHPWNWAAHQAGERTPRKNEFSRWNAGKDLSFVPLRPE
RVVEVRYDHMEGRRFRHTAQFNRWRPDRDPRSCTYEQLEQPVTFRLDDIV
PGLGASRADHKPGKQIGAP
>MAP0987 mfd, Mfd
MTAPGAARPETPIAGLVELALTAPTFQQLIDTAAASPADLSLVGPASTRL
FVASALARLGPLLVVTATGREADDLTAELRGVVGDAVAVFPSWETLPHER
LSPGVDTVGARLTVLRRLAHPDDARLGPPLQVVVTAVRSLLQPMTPQLGL
VEPVTLSVGQEIEFEHVIARLVELAYSRVDMVGRRGEFAVRGGILDVFPP
TAEHPVRVEFWGDEVSEMRMFSVADQRSIPEIAVDTVISVPCRELLLTED
VRARAAELAAQHPASEPAITGSVSDMLAKIADGIAVDGMEALLPVLRPGK
QVLLTDQLADRTPVLLCDPEKIRTRAADLIKTGREFLEASWSVAALGTLE
NQAPIDVEQLGGSGFAELDEVRAAAVRGGHPWWTLSQLSDESAVELDVRA
APSARGHQHDIDGIFAMLRAHVSTGGHAAVVAPGTGTAHRVVERLAECDT
PAAMLESGAAPRAGVVGVLKGPLHDGIVIPGANLVVITETDLTGSRVAAV
EGKRLAAKRRNTVDPLALTAGDLVVHDQHGIGRFVEMTERTVGGARREYL
VLEYASSKRGGGSDKLYVPMDSLDQLSRYVGGQAPALSKLGGSDWANTKT
KARRAVREIAGELVSLYAKRQASPGHAFSPDTPWQAEMEDAFGYTETVDQ
LTAITEVKSDMEKPIPMDRVICGDVGYGKTEIAVRAAFKAVQDGKQVAVL
VPTTLLADQHLQTFTDRMAGFPVTVKGLSRFTDAAESRAVIEGLADGSVD
IVIGTHRLLQTGVRWKDLGLVVVDEEQRFGVEHKEHIKSLRTHVDVLTMS
ATPIPRTLEMSLAGIREMSTILTPPEERYPVLTYVGPHDDKQVAAALRRE
LLRDGQAFYVHNRVSSIDRAAARVRELVPEARVVVVVAHGQMPEERLERT
VQGFWNREYDILVCTTIVETGLDISNANTLIVERADTFGLSQLHQLRGRV
GRSRERGYAYFLYPPHAPLTETAYDRLATIAQNNELGAGMAVALKDLEIR
GAGNVLGVEQSGHVAGVGFDLYVRLVGEAVEAYRAAADGQTVTTAEEPKD
VRIDLPVDAHLPPDYIASDRLRLEAYRRLAAAGSDDEIDAVVEELVDRYG
ALPEPALRLVAVARLRLLCRAAGITEVSAPSAATVRLSPITLPDSAQVRL
KRMYPAASYRATTSTVQVPIPRAGGVGAPRLRDVELVQMVANLVTALQGK
PQTDVGTGTPVAAMASEEGRG
>MAP2621c mutT2, MutT2
MPTQIVVAGALIRGSRLLVAQRARPPELAGRWELPGGKVAPGETERDALA
RELAEELDLRAGDIAVGDRLGGDIAVDGGMTLRAYRVRLLRGRPDARDHR
ALRWITAAQLHDLDWVPADRGWLGDLARVL
>MAP3896 mutT3, MutT3
MHGDGDGWVISERGAHYWGRYGAAGLLLRAPQPDGTPAVLLQHRAVWSHQ
GGTWGLPGGARDSHETPEETAVREANEEAGLLVDRLAVRASVVTAEVAGV
GGTRWSYTTVIADADELLHTVPNRESAEMRWVAEDEVADLPLHPGFAASW
HRLRTEPARLPLSHGDERRQYLPRTIELEDGVFVWCMPGDPADTDEASAQ
LCRRISALLPAPS
>MAP0469c mutY, MutY
MADIMPQPPADGSPLISVTDLLEWYRVARRDLPWRAPGVSAWQILVSEFM
LQQTPVSRVLPIWPDWVRRWPTPSATAAASAADVLRAWGKLGYPRRAKRL
HECATVIARDHGDVVPDDVDTLLTLPGVGGYTARAVACFAYRRPVPVVDT
NVRRVVARAVHGQADAGAPSAGRDHADVAALLPGDGSAPEFSVALMELGA
TVCTARAPRCGLCPLRRCAWREAGHPPATGPARRVQTYAGTDRQVRGRLL
DVLRGNDSPVTRAELDVAWLTDTAQRDRALYSLLADGLVTQTTDGRFALA
GEE
>MAP3416 nei, Nei
MPEGDTVWHTAAVLREHLLGETLTRCDIRVPRFATVDLTGQVVDEILSRG
KHLFIRVGAASIHSHLKMEGSWRVGPRVRVDHRARIVLETGAATAVGVDL
GVLQILDRDRDGEAVAHLGPDLLGEDWDPARAAANLAARPQRPIAEALLD
QRVLAGIGNVYCNELCFVSGHLPTTPVSAVADPRRLVSRARDMLWLNRFR
WNRCTTGDTRNGRQLWVYGRAGQPCRRCGTPIEFDDSGDRVTYWCPSCQR
>MAP0400 nth, Nth
MTTAKSSGRAKSTPAPPGDAAARRWSTESRVALVRRARRMNRILAQAFPE
AHCELDFTTPLELTVATILSAQSTDKRVNLTTPALFKRYTCALDYARADR
DELENLIRPTGFFRNKASALIRLGQALVERFDGEVPATMAELVTLPGVGR
KTANVILGNAFGVPGITVDTHFARLVHRWRWTAEKDPVKIEHAVGELIER
SEWTMLSHRVIFHGRRVCHSRKPACGVCLLAKDCPSFGLGPTEPPLAAQL
VRGPETEHLLALAGL
>MAP2445 ogt, Ogt
MIEYRTVDSPIGLLTLAGRDPVLTNLRMVDQTYEPSRTGWTENPRAFAGA
VEQLGAYFAGELTEFDIELDLRGSEFQRRVWRALQTIPYGETRSYGEIAE
QIGAPGAARAVGLANGHNPIAIVVPCHRVIGASGKLTGYGGGLDRKQTLL
ALERRHSPASLTLFD
>MAP3081 phr, Phr
MPALLWFRRDLRLHDHPALSAAADSDEVLACFVLDPRLQRSSGPRRLQFL
GDSLRVLRDELDGRLLVTRGRPDIRIPEIAKAIGASSVHVSEDFTPFGKR
RDARVRAALASVPLVATGSPYLVAPGRVTKPDGSPYQVFTPFLRRWRDTG
WRPPVKTGAASARWLDPARLGITDCEIPDPGATLDLAAGEEAARNKWKSF
VDNGLANYADDRHRPDIEGTSRMSAHLKFGTIHPRTLVADLDLRAAAARA
YLRELAFRDFYADVLHHRPASAWRNWNSAFDAIRTDTGAEAGRRFAAWKA
GETGFPFVDAGMRQLRQTGFMHNRVRMTVASFLVKDLHLPWQWGAHWFLQ
QLVDGDLANNQHGWQWCAGCGTDAAPYFRVFNPAAQGEKFDPSGDYIRRW
VPELRCADDPHLRTGERPPGYPAPIVDHATERAEALRRYRSM
>MAP0018c pknA, PknA
MSPRVGVTLSGRYRLQRLIATGGMGQVWEAVDNRLGRRVAVKVLKQEFSQ
DPEFIERFRAEARTTAMLNHPGIAAVHDYGESQLDGEGRTAYLVMELVNG
EPLNSVLKRTGRLSLRHALDMLEQTGRALQVAHAAGLVHRDVKPGNILIT
PTGQVKITDFGIAKAVDAAPVTQTGMVMGTAQYIAPEQALGHDATPASDV
YSLGVVGYEVVSGKRPFSGDGALTVAMKHIKEPPPPLPAELPPNVRELIE
ITL
>MAP3387c pknD, PknD
MSDNGPAAQVGSWFGPYRLVRLLRQGGMGEVYEAEDTRKHRLVALKLISQ
QFSGNPEFSARLQREADIAGRLTEPHVVPIHDYGEIDGRFFVEMRLVDGI
DLGSLLHREGPLAPPRAIAIIRQVAAALDAAHAAGVTHRDVTPGNILVTP
SDFAYLADFGIARAASDPGLTQVGTAIGTYYYMAPERFTDDEVTNSVDIY
SLACVLTECLTGVPPYRADTVERLVAAHLTKTAPPLSQLRPGAFPPALDR
VIAKGMAKRPEDRYRTAGEFAAAAHEALTTSEQRKAATILRDGQIAALGA
GAAEQRSTHWPDSFAPSPSAETVVGPSPARAGAPSSGLIRAAPTGSGRVY
APGPDFGRPAAPTDNKRKQWIIVGAVALVALVAFVVAVVGYLSTASSGPA
KQAGGQSVLPFNGIDFRLSPGGVTLDGTGNVYVTSEGMYGRVVKLAAGSG
ATTVLPFNGLYQPQGLAVDGAGTVYVADFNNRVLSMAAGSNSQKELPFSG
LNYPEGVAVDSQGGVYVADRGNSRVLKLAAGSQNQTVLPFTGLNNPDGVA
VDPAGNVYVADTDNNRVVKLDAASNTQSELPFHDLSVPWGIAVDNGGTVY
VTEHDKNDVMKYPPGATSGTVLPFTALNTPLAVAVDRDQSVYVADRGDDR
VVKLVQ
>MAP1049c pknE, PknE
MTLDPDSFGHYRILELLGRGGMGRVYRAYDATTDRVVALKVLPPHLAEDQ
DFQQRFRREARIAAGLNDPHVVPIHGYGEIDGRLYVDMRLIEGRDLAHYI
TENGGRLSPQRAVAVIEQVAAALDSAHRAGLIHRDVKPMNVLVTTARDFV
YLIDFGLARAQADTALTQTGATMGTVAYMAPERFTGTTDHRADVYSLACV
LHECLTGKRPFAGDSLEEQLNAHLNTAPPRPSATAPEVPAAFDAVIARGM
AKDPERRYQSVTELAEAARAALAPGVVEKPSAPTPQPRAARRVRAAVVGA
SALTLAVVAAVVVAMVTHGHGPRGAAPKTPGSPAPGRPAPPLPAFVAPPD
LGANCQYRAVPDPSSRPVSPPPSGRVPTTPGQIGAVIATNLGDIGISLAN
SESPCAVNSFISLARQRFFDNTQCARLVDSPDGGSLLCGGPDVDGSGGPG
YEFADEYPANQYRPDDPALRATLLYPRGTVVMATEGPNTNGSQFALIFHD
SEMDPQSTVLGTIDPAGLATLDKIARAGIAGNRPSGPPANPVTITSVRIG
>MAP1332 pknF, PknF
MTIGNGASFAGYTILRQLGAGGMAEVYLALHPRLPRRDVIKVLAEAVTVD
PEFRERFNREADLAATLWHPHIVGVHDRGEFNGHLWISMDYVEGTDASRL
VKESYPDGMPLDEVSAIVQAVAGALDYAHARGLLHRDVKPANILLTHPEA
GERRILLADFGVARHLGDISGITETNVAVGTVAYAAPEQLTGSPIDGRAD
QYALAATAFHLLTGAPPFQHSNPIAVIGQHLHEDPPRLSDFRPELAGLDE
VFCQALAKAPEDRFDRCRAFAAAVRRECDGAAAIGPDARSRSVASPPHRR
RGPGRVIAAVTHRFSSQTRWAAALVCAVLVAVAATWSVLYSFQPGAPPAN
PALASKPSPPAAVAAPIAGGPVLNGTYKLDYDQTKRTTNGIGIRHDGAGT
NWWAFRSACTSSGCAATGTRLDDATHQTAGGPDGGQTDTLRFVGGYWQGA
PEQQRVGCTRPGGPAGATQQETIAWSLAPQSDGTLRGTETETVLSNECGA
QGAVVRVPVVATRVGDVPPGVTVADPASVINASPTATAPAPPVLGGLCSD
VGKVAYDPTNNEQIVCEGSSWAKAPITMGVHAAGSSCDRPGTSVFAMSTS
SDGYLLQCDPVTRTWTRPAG
>MAP3893c pknG, PknG
MAEPDNKSEQPEPGAEQMGPGTQPAEVGDDAQAGAATGRLQATQALFRPD
FDDDDDDFPHISLGALDTDSADRMTVATQALPPVRQLGGGLVEIPRGRDI
DPREALMTNPVVPESKRFCWNCGKPVGRSTKKSKGTSEGWCPHCGSAYSF
LPQLNPGDIVANQYEVKGCIAHGGLGWVYLAVDHNVNDRPVVLKGLVHSG
DAEAQAIAMAERQFLAEVVHPQIVQIFNFVEHVDRHGNPVGYIVMEYVGG
QPLRHGKGEKLPVSEAIAYVLEILPALGYLHSIGLVYNDLKPENIMLTEE
QLKLIDLGAVSRINSFGYLYGTPGFQAPEIVRTGPTVATDIYTVGRTLAA
LTLNLPTRNGRYVDGIPDNDPVLGTYDSFRRLLRRATDPDPRRRFSSTEE
MSAQLMGVLREVVAHDTGVPRPGLSTIFSPSRSTFGVDLLVAHTDVYLDG
QVHSEKLTAREIVTALQVPLVDPADVAAPVLQATVLSQPVQTLDSLRAAR
HGTLDADGVELSESIELPLMEVRALLDLGDVAKATRKLDDLAERVGWQWR
LVWYKAVAELLTGDYDSATTHFTEVLDTFPGELAPKLALAATAELAGDVD
EHRFYETVWKTNDGVISAAFGLARTLSAEGDRAAAVRTLDEVPATSRHFT
TARLTSAVTLLSGRSKSEITEEEIRDAARRVEALPPTEPRVLQIRALVLG
CAMDWLEDNKASTNHILGFPFTEHGLRLGVEAALRNLARVAPTQRHRYAL
VDMANKVRPTSTF
>MAP1914 pknL, PknL
MLDGRYLIESKIASGGTSTVYRGVDTRLDRPVAVKVMDPRYAGDDQFLTR
FQREARAVARLKDPGLVAVYDQGLDARHPFLVMELIEGGTLRELLGERGP
MPPYAVAAVLRPVLGGLAAAHRAGLVHRDVKPENVLISDDGEVKIADFGL
VRAVAAAGITSASVILGTAAYLSPEQVRDGAATPRSDVYAAGIVAYELLT
GRTPFTGDSMLAIAYRRLDADVPPPSAAIDGVPAQFDDFVQRATARDPAD
RYADAVEMGADLDAIADELALPGFRVPAPRNSALHRSAALHREAGRRAPA
AEPPARHPTRHLTRGPEEWPQPDPPAHVGAEPDDDEDDYEYQSVTGEFAG
IPISEFVWARQHNRRMVLVWLALVLAVTGMVATAAWTIGRNLNGLF
>MAP1322 polA, PolA
MPKVRAVPATKAATKTATKTAAGAGEEQNRPTLMLLDGNSLAFRAFYALP
AENFKTRGGLTTNAVYGFTAMLINLLRDEAPTHIAAAFDVSRQTFRSDRY
PEYKANRSATPDEFHGQIDITKEVLAALGITVLAEPGFEADDLIATLATQ
AENEGYRVLVVTGDRDSLQLVSDNVTVLYPRKGVSELTRFTPEAVVEKYG
LTPQQYPDFAALRGDPSDNLPGIPGVGEKTASKWIAEYGSLQALVDNVDS
VRGKVGDALREHLASVIRNRELTDLIKDVPLAQTPDTLRLQPWDRDQIHR
LFDDLEFRMLRDRLFETLAAVEPEVDEGFDVRGGALEAGELAVWLAEHSS
GKRFGLAVAGTHLAYDADCTALAIVSADGEGRYIDTSALDPDDEAALASW
LADPGPPKALHEAKLAMHDLAGRGWTLAGVTSDTALAAYLVRPGQRSFSL
DDLALRYLRRELRAENPQQQQLSLLDDTDGTDDQAVQTLILRAVAVLDLA
DALDEELARINSTSLLSGMELPVQRVLAGMENAGIAVDLDLLSELQSDFA
HQIRDAAEAAYAVIGKQINLGSPKQLQAVLFDELQMPKTKRTKTGYTTDA
DALQSLFDKTGHPFLQHLLAHRDATRLKVTVDGLLNSVASDGRIHTTFNQ
TIAATGRLSSTEPNLQNIPIRTEAGRRIRDAFVVGDGYAELMTADYSQIE
MRIMAHLSQDAGLIEAFNTGEDLHSFVASRAFGVPIEEVTGELRRRVKAM
SYGLAYGLSAYGLSQQLKISTEEAKEQMDAYFARFGGVRDYLHAVVEQAR
KDGYTSTVLGRRRYLPELDSSNRQVREAAERAALNAPIQGSAADIIKVAM
IEVDKAIKEAGLRSRILLQVHDELLFEVAAGEREQLEALARDKMGGAYPL
DVPLEVSVGYGRSWDAAAH
>MAP1130 priA, PriA
MLSVPHLDREFDYLVSAEQSDDAQPGVRVRVRFHGRLVDGFLLERRHDTD
HQGKLGWLDRVVSPEPVLTAEIRRLVDAVAARYAGTRPDVLRLAVPARHA
RVEREARAAAGIPMPAPVDPSGWDGYGRGAQFLAALAESRAARAVWQALP
GEPWTDRFAEAAAQTVRTGRSVLAIVPDQRDLDALSHAVTSRIDATGVVA
LSAGLGPAARYRRWLAALRGSARVVIGTRSAVFAPLSDLGLVMVWADADD
SLAEPRAPYPHAREVAMLRAHQARCAALIGGYARTAEAQALVRSGWAHDI
VAARPVVRARTPRVIALDDTGYADERDPAARTARIPSVALRAARSALQDG
APVLVQVPRRGYVPSLACARCRTIARCRHCTGPLSLAESPGEGGGGLVCR
WCGRVDPTRRCARCGSDAVRAVVVGARRTAEELGRAFAGTTVITSSGDAV
VPEVADGPALVVATPGAEPRARGDYGAALLLDTWALLGRQDLRAAEDALW
RWMTAAALVRPRGDGGVVMVVAESSLPTVQSLIRWDPVGHAESELTARAE
VGLPPSVHMAAVDGTPAAVNALLDEAGLTGGQTLHADLLGPVDLPPGARR
PAGTPPGAPVTRMLVRVPRRDGLALAAALRRGVGVLSARQTHEPARVQID
PLHIG
>MAP2848c recA, RecA
MSAGLSRLLRRPRRVELESNTCSATVGDVRSVNLSVAFSSVTANRPIPVR
RHTDYRRGTMTQAPDREKALELAMAQIEKSYGKGSVMRLGDEMRQPISVI
PTGSIALDVALGIGGLPRGRVVEIYGPESSGKTTVALHAVANAQAAGGVA
AFIDAEHALDPEYAKKLGVDTDSLLVSQPDTGEQALEIADMLIRSGALDI
LVIDSVAALVPRAELEGEMGDSHVGLQARLMSQALRKMTGALNNSGTTAI
FINQLREKIGVMFGSPETTTGGKALKFYASVRMDVRRIETLKDGTNAVGN
RTRVKIVKNKVSPPFKQAEFDILYGRGISREGSLIDMGVDQGFIRKSGSW
FTYEGEQLGQGKENARTFLMENDEVANEIEKKIKEKLGIGAVVTDDLSDD
GVLPAPVDF
>MAP4094c recC, RecC
MPLHLHRAERTDLLADGLGALLANPPADPFALELVVVPARGVERWLSQRL
SHVLGRARDGDGVCAGVQFRSPGSLIAEITGTADDDPWSPDALAWPLLEA
IDCSLDEPWCRTLATHLGHFDTGAEAELRRTRRYEAARRLAGLFAGYARQ
RPRLLVDWLDGDPGDLDEDLRWQPPLWRALLARVAADPPHVRHAETIARL
RRAPSDLPQRLSLFGHTRLACTDVELLDALATHHDLHLWLPHPSDRLWRA
LAGTHGVIPRRDDTSHRAVAHPLLATLGRDLRELQRGLPADPRTDEYLAG
DARPDTLLGWLQSDIAANAVRPVGRSLAAGDRSVQIHNCHGPARQVDVLR
EMLLGLLADDPTLQPRDILVMCPDIESYAPLIMADFGLGDVVHGAHPAHQ
LRVRLADRSLIQTNPLLGVAAQLLALAGGRVRASEVLNLAQTPAVRARFG
FTDDDLDTITGWVRQANIRWGLDRQHRRPYHVDFVHNTWRFGIDRILAGV
AMSDDSNAWIDATLPLDDVSSNRVQLAGQLAEFVARLQHVLDSLTGARPL
TDWLTAVADGIALLTRVRDADEWQSGQLQREFAAVSAQAGSRADTVLRLP
DVRALLTRHLSGRPTRANFRTGTLTVCTMVPMRSVPHRVVCLVGLDDGLF
PRLGVVDGDDALARCPMTGERDIRSEDRQLLLDAIGAATEKLVITYTGAN
EYSGQPRPPAVPLAELLDTLDITTAEPVRDRIVVRHPLQPFDIRNIIPGE
LVPGVPFSFDPTVLRAAHAATGEHCEPPKFISAPLPPPPPADVILADLVG
FFKDPVQGFFRALQFTLPHDVDGVQDAMPVDIDALQEWTVGDRMLRDMLR
GMTPEDARQAEWRRGTLPPGQLGWRRVTEIRDQAARLALEAHQHRAEQPG
GAHDVDIDLGRGRRLTGTVTPVYGDRLVAVTYSRLDGRHLLESWIPLLAL
AAHDRGRDWSAVCIGRMRRGTGIRVQRLGSPDDDPVELLRELVSIYDAGR
REPIPLPLKTSYAWAAARYGGDDPVAEARYRWKSSDRYPGEDQAPAHVRA
WGRGAPLDDLMRPVRPGEECDGEDNRLGAYAARLWLPMLRAERTTA
>MAP4091c recD, RecD
MTADVLPERVFDAGPLRPFADAGIFEAADVRVAQRLTALTGESDDRVALA
VALLVRALRGGSVCVDLRAVPAQVGAADLPWPAAGDWLAAVRASPLLGPP
PVLRFFGDLLYFDRYWLEEEQVCTDLLALSAPAGDVESSCYERLFPPGYE
EQRAAARIAVSQALTVLTGGPGTGKTTTVARLLALLVEQAERAGEPRPRI
ALAAPTGKAAARLAEAVAAEIEHLDPADRARLAGLTGTTLHRLLGPRPDT
SVRFKHNRGNRLPHDIIVVDETSMVSLTMMARLAEAVRPDTRLILVGDPD
QLASVEAGAVLADLVDGLAGRAGVRVAALATPHRFGSAIGALAAAIRAGD
ADRVLELLAAGGEHIEWVDSERPADRLREVLVSHALRLRSAALLGAAQAA
LATLDEHRLLCAHRDGPHGAVRWNTLVHNWIAEETGQPAWSQWYAGRPLL
VTANDYGLRLYNGDTGVTVAGADGLRAVIAGAAGPLDFATSRLGEVETMH
AMTIHKSQGSQADEVTVLIPPEDSRLLTRELFYTAVTRAKSRVRVVGPEA
SVRAAIERRAIRATGLRQRLRAVG
>MAP0003 recF, RecF
MYVRHLGLRDFRSWAHADLELQPGRTVFIGSNGFGKTNLLEALWYSSTLG
SHRVGTDAPLIRAGADRAVVSTIVVNDGRECAVDLEIAAGRANKARLNRS
PVRSTREVLGVLRAVLFAPEDLALVRGDPSERRRYLDDLATLRRPAIAAV
RADYDKVLRQRTALLKSLSGARHRGDRGALDTLDVWDSRLAEYGAQLMAA
RIDLVNQLAPEVEKAYQLLAPGSRAASIGYRSSLGAAASAEVNAGDRDYL
EAALLAGLAAHRDAELERGMCLVGPHRDDLELWLGEQVAKGFASHGESWS
LALSLRLAAFELLRADESDPVLLLDDVFAELDAARRRALAAVAESAEQVL
VTAAVLEDIPTGWQARRLFVELRDTDAGRVSELRP
>MAP3009c recG, RecG
MASLTDRLDFVVGAKAAEQLEELFGIRTVDDLLRHYPRSYTEGASRWGAD
DERPPAGEHITIIDTITETKTWPMKKTPKKVCHRITLGAGRNKVTATFFN
ANYLKKGLTEGTKVMLSGEVGFFKNVMQLTHPAFLILDSPDGRNKGTRSL
KNIANASGASGEAVLDAYERHFFPIYPASTKMQSWDIFSCVRLVLDVLDP
VPDPLPEPLRAKFDLVCEDQALRDIHLAENEARRQRARERLTFDEAVGLQ
WALVARRHGELSESGPPAPPRPDGLAAELLRRLPFELTAGQREVLDVLSD
GLASTRPLNRLLQGEVGSGKTIVSVLAMLQMVDAGYQCALLAPTEVLAAQ
HLRSIRDVLGPLAMAGQLGGADNATRLALLSGSMTAAQKKQVRDEVAGGQ
VGIVVGTHALLQDAVEFHNLGMVVVDEQHRFGVEQRDRLRAKARPGVTPH
LLVMTATPIPRTVALTVYGDLETSTLRELPRGRQPITSNVIFVKDKPAWL
GRAWRRIGEEVAAGRQAYVVAARIDESDDDGAADQNAKAPETAEGLYARL
RSQELAQLRLGLMHGRLSAEEKDAVMAAFRAGDIDVLVCTTVIEVGVDVP
NATVMLVMDADRFGISQLHQLRGRIGRGEHPSLCLFASWAAPDSPAGRRL
TAVAETMDGFALADLDLKERREGDVLGRNQSGRAVTLRLLSLADHQEYIE
AARDFCVQAYAGNRFDPGLSLLAARFTDTDRIEYLDKS
>MAP1403 recN, RecN
MLTEIRIESLGAISAAVGEFDRGLTVLTGETGTGKTMVVTGLHLLGGARA
DATRVRSGADRAIVEGRFTTTDLDETLVVRLDEMLDASGAERDEDGSVIA
LRSVNRDGPSRAYLGGRSVPAKSLGDFTAELLTLHGQNDQLRLMRPEEQC
GALDRFAKAGPALERYTKLRDAWLSARRDLADRRNRMRELALEADRLTFA
LGEIDAVDPQPGEDDALVAEIMRLSELDTLREAAAAARAALSADDADGSG
LSAVDTLGKARAALESTDDPKLLALAGQIGEVLTVVVDAAGELAGFLEEL
PVDASALEAKLARQSELRGLTRKYAADIDGVLAWARESRERLAQLDVSEE
GLSALAARVEELGRELAGAAADLSTIRRKAAKRLAKEVTAELSGLAMADA
QFSIDVSTDVVPAGSGSDDAAVLTLPSGERVRAGADGIDQVEFGFAAHRG
MDRLPLAKSASGGELSRVMLALEVVLAASRKETVGTTMVFDEVDAGVGGR
AAVQIGRRLARLARTHQVIVVTHLPQVAAYADVHLVVHGAGPRGTSVVRR
VTGDERVAELARMLAGLGESDSGRAHARELLDAAQKDEI
>MAP0316c recR, RecR
MFEGPVQDLIDELGKLPGIGPKSAQRIAFHLLSVEPPDIDRLTAVLARVR
DGVRFCAVCGNVSDDERCRICSDPRRDASVVCVVEEPKDVQAVERTREFR
GRYHVLGGALDPLSGVGPDQLRIRELLSRIGERVDDVDITEVIIATDPNT
EGEATATYLVRMLRDIPGLTVTRIASGLPMGGDLEFADELTLGRALAGRR
AMV
>MAP3312 rhlE, RhlE
MTTPTSTTELTFAQLGVRDEIVRALDEKGIQHPFAIQELTLPLALAGDDL
IGQARTGMGKTFAFGVPLLQRITAGTAPRALNGTPRALVVVPTRELCLQV
TDDLTLAAKHLTADGGRPLSVVPIYGGRPYEPQIDALRAGADVVVGTPGR
LLDLAQQGHLQLGGLSVLVLDEADEMLDLGFLPDIERILRQIPDDRQSML
FSATMPDPIITLARTFMNQPTHIRAEAPHSAATHDTTVQYVYRAHALDKV
ELVSRVLQAESRGATMIFTRTKRTAQKVADELAERGFKVGAVHGDLGQVA
REKALKAFRTGDIDVLVATDVAARGIDIDDVTHVINYQIPEDEQAYVHRI
GRTGRAGKAGVAVTLVDWDELARWALIDKALGLDVPEPAETYSNSPHLYE
ELGIPAGAGGRVGAARKPQGPRRSAERATGKPDQKSETATRRSGTRRRRT
RGGQPVSGHPSGNGAASSNGEAAADAPAGSPPGNPGSSRRRRRRRKPADA
TAQSN
>MAP2970c rnhB, RnhB
MATTWPPRTVIRKSSGLRTLESALYRSGLGPVAGVDEVGRGACAGPLVVA
ACVLGPGKLESLAALDDSKKLTESVRERLFPVIRRYALAYHVVFIPPAEV
DRRGVHVANIEGMRRAVAGLSLRPGYVLSDGFRVPGLAVPSLPVIGGDAV
AACIAAASVLAKVSRDRLMVAMDAEHPGYGFADHKGYSTPAHSAALARLG
PCPQHRHSFINVRRVANGSGGRVVADCKPDLPLQRDEGR
>MAP1037 ruvA, RuvA
MIASVRGEVLEVALDHAVIEAAGVGYRVNATPSTLSTLRTGTQARLITAM
IVREDSMTLYGFTDAETRDLFLTLLSVSGVGPRLAMATLAVHDAGALRQA
LHDGDVAALTRVPGIGKRGAERMVLELRDKIGAAGAAGAPAGAARNGHAV
RGPVVEALVGLGFAAKQAEEATDKVLAAEPEAGTSGALRAALSLLGKSR
>MAP1038 ruvB, RuvB
MTAHDADWSDRDVSGALVPGEGDIDVSLRPRSLREFIGQPRVREQLQLVI
EGAKNRGGTPDHILLSGPPGLGKTSLAMIIAAELGSSLRVTSGPALERAG
DLAAMLSNLVEHDVLFIDEIHRIARPAEEMLYLAMEDFRVDVVVGKGPGA
TSIPLEVAPFTLVGATTRSGALTGPLRDRFGFTAHMDFYEPAELQQVLAR
SAGILGIELGAEAAEEIARRSRGTPRIANRLLRRVRDFAEVRADGVITRD
VAKAALAVYDVDELGLDRLDRAVLTALTRSFGGGPVGVSTLAVAVGEEAA
TVEEVCEPFLVRAGMVARTPRGRVATAQAWTHLGMVPPAGAAGLGQPGLF
D
>MAP1036 ruvC, RuvC
MRVMGVDPGLTRCGLSVVESGRGRTVVALDVDVVRTPSDAPLAERLLSIS
DAVEHWLATHQPDVVAIERVFSQLNVTTVMGTAQAGGVVALAAAKRGIGV
HFHTPSEVKAAVTGNGAANKAQVTAMVTRILALQAKPTPADAADALALAI
CHCWRAPMIARMARAEALAAQQRQKYKDKVDATLRAAR
>MAP0068 ssb, Ssb
MAGDTTITVVGNLTADPELRFTPSGAAVANFTVASTPRIYDRQSGEWKDG
EALFLRCNIWREAAENVAESLTRGSRVIVTGRLKQRSFETREGEKRTVVE
VEVDEIGPSLRYATAKVNKASRSGGGGGGFGGGSRQQPAPASSAPADDPW
GSAPASGSFGGGDDEPPF
>MAP2567c tagA, TagA
MSDDEPIRCGWATARSGPDFELYRDYHDREWGQTVRDGVALFERMSLEAF
QSGLSWLTILRKRENFRRAFSGFDIDAVAGYTDADVQRLMADPGIVRNRA
KIEATIANARAAAELGGGTQLAELLWSFAPAPRSRPADASEIPSATAEST
AMAGELKRRGFRFVGPTTAYALMQATGMVDDHIRGCWVPAAVR
>MAP0425 topA, TopA
MADPKLKENTSGGNGSRRRLVIVESPTKARKLASYLGSRYIVESSRGHIR
DLPRAAADVPAKYKSEPWARLGVNVDADFEPLYIISPEKKHTVSELKGLL
KDVDELYLATDGDREGEAIAWHLLETLKPNIPVKRMVFHEITEPAILEAA
ENPRDLDIDLVDAQETRRILDRLYGYEVTPVLWKKVAPKLSAGRVQSVAT
RIIVQRERDRMAFRSASYWDIVAQLDASVSDPQASPPTFAARLTSVDGLR
VATGRDFDSQGQLRKADEVIVLDEPQATELAAGLQGAQLSVASVEEKPYT
RRPYAPFMTSTLQQEAGRKLRFSAERTMSIAQRLYENGYITYMRTDSTTL
SGSAINAARTQARQLYGEEYVSPSPRQYTRKVKNAQEAHEAIRPAGETFA
TPDAVRRELDGDEFRLYELIWQRTVASQMADARGTTLSLRIGGQAGDRQV
VFSASGRTITFAGFLKAYVETVDELAGGEADDAESRLPQLTQGQRLDAIE
LTPDGHATNPPPRYTEASLVKALEELGIGRPSTYSSIIKTIQDRGYVHKK
GSALVPSWVAFAVTGLLEQHFGRLVDYDFTAAMEDELDAIASGQERRTDW
LNNFYFGGEHGVADSIARSGGLKKLVGVNLEGIDAREVNSIKLFDDEQGR
PVYVRVGKNGPYLERMVTGDDGEPTPQRANLNDSLTPDELTLEVAEQLFA
TPQEGRVLGVDPATGHEIVAKDGRYGPYVAEVLPEPPPDDDDGSGAPAKK
GKKPTGPKPRTASLLRTMDLQTVTLEDALRLLSLPRVVGVDPETGEEITA
QNGRYGPYLKRGNDSRSLASEEQLFDITLEEALKIYAEPKRRGRQGAAAP
PLRELGADPATGKPMVIKDGRFGPYVTDGETNASLRKGDDVMSLTDQRAA
ELLADRRARGPAKRTAKKTSRKAPAKKAAKRG
>MAP3016c ung, Ung
MTARPLNELVEPGWARALQPVAEQVARMGQFLRAEIAAGRRYLPAGPNVL
RAFTYPFDEVKVLIVGQDPYPTPGHAVGLSFSVAPDVSPLPRSLANIFQE
YTADLGHPPPSCGDLTPWAQRGVLLLNRVLTVRPSNPASHRGKGWEPVTE
CAIRALTARQQPLVAILWGRDASTLKPILAQGNCVAIESPHPSPLSASRG
FFGSRPFSRANKLLAEMGADEIDWRLP
>MAP1341 uvrA, UvrA
MADRLIVKGAREHNLRGVDLDLPRDALIVFTGLSGSGKSSLAFDTIFAEG
QRRYVESLSAYARQFLGQMDKPDVDFIEGLSPAVSIDQKSTNRNPRSTVG
TITEVYDYLRLLYARAGTPHCPVCGERIARQTPQQIVDQVLAMPEGTRFL
VLAPVVRTRKGEFADLFEKLNAQGYSRVRVDGVVHPLTDPPKLKKQEKHD
IEVVVDRLTVKASAKQRLTDSVETALNLADGIVVLEFVDHEHDAHNREQR
FSEKLACPNGHALAVDDLEPRSFSFNSPYGACPECSGLGIRKEVDPDLVV
PDPERTLAEGAVAPWSTGHTAEYFTRMMAGLGDELGFDVDTPWRKLPAKA
RKAILEGSDHQVHVRYRNRYGRTRSYYADFEGVLAFLQRKMAQTESEQMK
ERYEGFMRDVPCPVCEGTRLKPEILAVTLAAGRHGKKSIAEVCELSISDC
AEFLNTLTLGPREQAIAGQVLKEIQSRLGFLLDVGLEYLSLSRAAATLSG
GEAQRIRLATQIGSGLVGVLYVLDEPSIGLHQRDNRRLIETLTRLRDLGN
TLIVVEHDEDTIAHADWIVDIGPRAGEHGGQIVHSGTYAELLANQESITG
AYLSGKESIEMPAIRRPVDRRRQLTVVGAREHNLRGIDVSFPLGVLTSVT
GVSGSGKSTLVNDILATVLANRLNGARLVPGRHTRVTGLDHLDKLVRVDQ
SPIGRTPRSNPATYTGVFDKIRTLFAATTEAKVRGYQPGRFSFNVKGGRC
EACTGDGTIKIEMNFLPDVYVPCEVCHGARYNRETLEVHYKGKTISEVLD
MSIEEAAEFFEPITGIHRYLRTLVDVGLGYVRLGQPAPTLSGGEAQRVKL
AAELQKRSTGRTIYILDEPTTGLHFDDIRKLLKVINGLVDKGNTVIVIEH
NLDVIKTSDWIVDMGPEGGAEGGTVVAQGTPEDVAAVPESYTGKFLAEVI
GRGGAAAGAPRRSSRRRKATA
>MAP1335 uvrB, UvrB
MAFATEHPVVAHSEYRPAEEAVEGLVRAGGRFEVVSPHAPAGDQPAAIDE
LERRIRAGERDVVLLGATGTGKSATTAWLIERLQRPTLVMAPNKTLAAQL
ANELREMLPHNAVEYFVSYYDYYQPEAYIAQTDTYIEKDSSINDDVERLR
HSATSSLLSRRDVVVVASVSCIYGLGTPQSYLDRSVELRVGTEVPRDALL
RLLVDVQYTRNDLSFTRGSFRVRGDTVEIIPSYEELAVRIEFFGDEIEAL
YYLHPLTGDVIRQVDSLRIFPATHYVAGPERMAHAISTIEQELAERLAEL
EGQGKLLEAQRLRMRTNYDIEMMRQVGFCSGIENYSRHIDGRGPGSPPAT
LLDYFPEDFLLVIDESHVTVPQIGGMYEGDMSRKRNLVEYGFRLPSACDN
RPLTWEEFADRIGQTVYLSATPGPYELSQTGGEFVEQVIRPTGLVDPKVV
VKPTKGQIDDLIGEIRKRTEADERVLVTTLTKKMAEDLTDYLLEMGIRVR
YLHSEVDTLRRVELLRQLRLGEYDVLVGINLLREGLDLPEVSLVAILDAD
KEGFLRSARSLIQTIGRAARNVSGEVHMYADTITESMKEAIDETERRRAK
QIAYNEAHGIDPQPLRKKIADILDQVYREADDTDTVEIGSGRSMSRGRRA
QGEPGRAVSAGIVEGRDTTNMPRAELADLIKDLTAQMMAAARDLQFELAA
RFRDEIADLKKELRGMDAAGLK
>MAP1146 uvrC, UvrC
MPDPATYRPAPGSIPVEPGVYRFRDPHGRVIYVGKAKSLRSRLTSYFADV
ANLHPRTRQMVTTAAKVEWTVVNTEVEALQLEYNWIKEFDPRFNVRYRDD
KSYPVLAVTLNEEFPRLMVYRGPRRKGVRYFGPYSHAWAIRETLDLLTRV
FPARTCSAGVFKRHKQIDRPCLLGYIDKCSAPCIGRVSAEQHRQIVDDFC
DFLSGKTDRFARELEQQMNAAAAELDFERAARLRDDLGALKRAMEKQAVV
LGDGTDADVVAFADDELEAAVQVFHVRGGRVRGQRGWIVEKSADPGDSGE
EQLVEQFLTQFYGEQAELGGAADESVNPVPREVLVPCLPSNADELTSWLS
GLRGSRVALRVPRRGDKRALAETVQRNAKEALQQHKLKRAGDFNARSAAL
QNIQEALGLSEAPLRIECVDISHVQGTDVVGSLVVFEDGLPRKSDYRHFA
IREAAGQGRSDDVASIAEVTRRRFARHLSEQNDPNMLSPEGKSRRFAYPP
NLYVVDGGAPQVNAASAVLEELGITDVAVIGLAKRLEEVWVPSEPDPVIM
PRNSEGLYLLQRVRDEAHRFAITYHRSKRSKRMTASVLDSVPGLGEHRRK
ALVSHFGSIARLKEATVDQITAVPGIGVATATAVLEALRPDESKAAP
>MAP0893 uvrD, UvrD
MSVHVTDAKPVSEAEQLLEGLNPQQRQAVVHEGSPLLIVAGAGSGKTAVL
TRRVAYLIAARGVGVGQVLAITFTNKAAAEMRERVVRLVGNRARAMWVST
FHSTCVRILRNQASLIEGLNSNFSIYDADDSRRLLQMIGRDMGLDIKRYS
PRLLANAISNLKNELIDPAAAVANLTDGSDELSRTVASVYGEYQRRLRAA
NALDFDDLIGETVAVLRAFPQIAQHYRRRFRHVLVDEYQDTNHAQYVLVR
ELVGTDTEDGVPPGELCVVGDADQSIYAFRGATIRNIEDFERDYPDATTI
LLEQNYRSTQNILSAANSVIARNSGRRDKRLWTDAGAGELIVGYVADNEH
DEARFVAEEIDALAQKGEITYNDVAVFYRTNNSSRSFEEVFIRAGIPYKV
VGGVRFYERKEIRDIIAYLRVLDNPGDAVSMRRILNTPRRGIGDRAEACV
AVYAENTGASFADALVAAAEGKVPMLNSRAEKAIAGFVELLDDLRGRLDD
DLGDLVESVLERTGYRRELESSTDPQELARLDNLNELVSVAHEFSTDRAN
AAALDESLDAADDEDVPDTGVLAEFLERVSLVSDSDEIPEDGAGMVTLMT
LHTAKGLEFPVVFVTGWEDGMFPHMRSLDDPVELSEERRLAYVGITRARQ
RLYLSRAIVRSSWGQPMLNPESRFLREIPQELIEWRRTAPAPSFSAPVSG
AGRFGTPRAAPTRAAMSKRPLLVLAPGDRVTHDKYGLGRVEEVSGAGESA
MSLIDFGSSGRVKLMHNHAPISKL
>MAP3297c uvrD2, UvrD2
MPTLADPLTAGLDDEQREAVLAPRGPVCVLAGAGTGKTRTITHRIAQLVA
GGHVAANQVLAVTFTQRAAGEMRSRLRVLAAAAGTDAAVGSVSALTFHAA
AHRQLRYFWPRVVGDTGWQLLDSKFAVVARAASRTRINASTDDVRDLAGE
IEWAKASLIGPEEYPAAVAAAGRDIPLDAARVAGVYAAYEALKARGDHVT
LLDFDDLLLHTAAAIENDPAVAEEFRDRYRCFVVDEYQDVTPLQQRVLAA
WLGDRDDLTVVGDANQTIYSFTGASPRFLLDFSRRFPDATVVRLERDYRS
TPQVVSLANQVIAAARGRVAGSKLHLVGQRPPGPVPTFHEHSDEPAEAAA
VAKSIARLIESGTPASEIAVLYRVNAQSEIYEEALTEAGIPYQVRGGEGF
FNRQEIKQALLALQRAAEREGEDAREEPLPAVVRAVLEPLGLTAEEPVGT
RARERWEALRALAELVDDEVARRPQLDFAGLLTELRMRADARHPPVVQGV
ALASLHAAKGLEWDAVFLVGLADGTLPISHALAHGADSEAVEEERRLLYV
GITRARTHLALSWALARSPGGRQSRKPSRFLNGIAPQTRAEAAPARSRRN
RSTATRCRICNNNLTTPAAVMLRRCETCAADIDEDLLVRLKAWRLDVAKE
QKVPAYVVFTDNTLIAIAELRPGDEQALIAIPGIGARKLEQYGPDVLELV
RGR
>MAP2958c xerC, XerC
MPDSEPILEEFDEYLALQCGRSAHTRRAYLGDLRSLLAFAGERGTGLDGL
SLPLLRSWLSAAAAGGVARTTLARRTSAVKAFTAWAVRRGLLTADPAARL
QVPKTHRTLPAVLRQDQALGAMAAAKTGAAQGDPIALRDRLIVEMLYATG
IRVSELCGLDIDDVDTGRRLVRVLGKGNKQRSVPFGAPAGEALHAWLADG
RPALANAESGPALLLGARGRRLDVRQARTVVHQTIAAVSGAPDMGPHGLR
HSAATHLLEGGADLRVVQELLGHSSLATTQLYTHVAVSRLRAVHDQAHPR
A
>MAP2686 xseA, XseA
MTRAATSEPNSAENPFPVRAVAIRVAGWIDRLGTVWVEGQLAQVSLRPDS
KTVFMVLRDPAADMSLTVTCPRDLVLNAPVKLAEGTQVVVCGKPSFYTGR
GTFSLRLSEIRAVGVGELLARIERLRRLLDAEGLFDPRLKRPIPFLPNMI
GLITGRASAAERDVKTVAAGRWPAVRFAVRNTAVQGPNAVAQIVEALREL
DRDPEVDVIVLARGGGSVEDLLPFSDETLCRAIAACRTPVISAVGHEPDN
PLCDLVADVRAATPTDAAKKVVPDTAAEQRLIDDLRRRSAQALRNWVSRE
QRALAQIRSRPVLAEPLRALTARAEEVHRARSAIRHDVTRLTAAETERIG
HLRARLATLGPAATLARGYAVVQTVGAGGPHVLRSVGDAPAGTRLRVRVA
DGAVAALSEGPTDGE
>MAP2687 xseB, XseB
MANNKKDEQAAAATPISRLGYEACRDELIEVVRQLEQGGLDLDASLNLWE
RGEQLAKRCEEHLAGARKRIEDALAAGEADDD
>MAP3916c xthA, XthA
MRLATWNVNSIRSRLPRVLDWLARTEVDVLAMQETKCADGQFPTLPFFEL
GYEVAHVGFNQWNGVAIASRVGLDDVRVGFEGQPSWSGKPEVAAAAEARA
LAATCAGVRIWSLYVPNGRAVGDPHYAYKLDWLAALRDSAAAWLREDPAA
QIALAGDWNIAPQDDDVWSMEFFAGATHVSEPERRAFNAILDAQFADLVR
PFAPGPGVYTYWDYTALRFPKRQGMRIDFILGSPALARRVTHAQIDREER
KGKGASDHAPVLVDVSRDEPPS